PdbID EntityID AsymChainIDs AuthorChainIDs UnpCode Name Sequence SeqLength HasWeakHits BestWeakEvalue BestWeakPfamID Source IsVirus Category IsValid 1a0n 1 A A P85A_HUMAN P2L PPRPLPVAPGSSKT 14 T 0.9 AAA_11 pdbhh F Eukaryota T 1a1m 3 C C PEPTIDE TPYDINQML TPYDINQML 9 T 12 Connexin40_C pdbhh F T 1a1n 3 C C PEPTIDE VPLRPMTY VPLRPMTY 8 T 8.1E-05 F-protein pdbhh F T 1a1o 3 C C PEPTIDE LS6 (KPIVQYDNF) KPIVQYDNF 9 T 5 NitrOD1 pdbhh F T 1a1p 1 A _ COMPSTATIN ICVVQDWGHHRCTX 14 T 2.2 RPN1_RPN2_N pdbhh F T 1a34 1 A,A10,A11,A12,A13,A14,A15,A2,A3,A4,A5,A6,A7,A8,A9 A,A,A,A,A,A,A,A,A,A,A,A,A,A,A COAT_STMV STMV MGRGKVKPNRKSTGDNSNVVTMIRAGSYPKVNPTPTWVRAIPFEVSVQSGIAFKVPVGSLFSANFRTDSFTSVTVMSVRAWTQLTPPVNEYSFVRLKPLFKTGDSTEEFEGRASNINTRASVGYRIPTNLRQNTVAADNVCEVRSNCRQVALVISCCFN 159 T 24 Csm4_C pdbhh T Viruses T 1a37 2 B,B2,D,D2 P,P,Q,P PS-RAF259 PEPTIDE LSQRQRST(SEP)TPNVHM KSQRQRSTSTPNVHM 15 T 26 PSRT pdbhh F T 1a38 2 B,D P,Q R18 PEPTIDE (PHCVPRDLSWLDLEANMCLP) FHCVPRDLSWLDLEANMCLP 20 T 0.33 PP_kinase_N pdbhh F T 1a4t 2 B B REGN_BPP22 20-MER BASIC PEPTIDE NAKTRRHERRRKLAIERDT 19 T 1.9 N36 unphh T Viruses T 1a9b 3 C,F C,F PEPTIDE LPPLDITPY LPPLDITPY 9 T 0.94 PINIT pdbhh F T 1ab9 1 A A CTRA_BOVIN GAMMA-CHYMOTRYPSIN CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 1abz 1 A _ ATA XDWLKARVEQELQALEARGTDSNAELRAMEAKLKAEIQKX 40 T 0.03 DUF4148 pdb F T 1aft 1 A _ RIR2_MOUSE RIBONUCLEOSIDE-DIPHOSPHATE REDUCTASE XFTLDADF 8 T 15 GDE_N_bis pdbhh F Eukaryota T 1aj1 1 A A LANA_ACTGA LANTIBIOTIC ACTAGARDINE XSGWVCXLXIECGXVICAC 19 T 7.3E-05 L_biotic_typeA pdbhh F Bacteria T 1aja 1 A,B A,B PPB_ECOLI ALKALINE PHOSPHATASE TPEMPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQGATPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAEARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 449 T 4.2E-10 Alk_phosphatase pdbpercent F Bacteria T 1akj 3 C C HIV REVERSE TRANSCRIPTASE EPITOPE ILKEPVHGV 9 T 0.56 DUF2115 pdbhh F T 1ali 1 A,B A,B PPB_ECOLI ALKALINE PHOSPHATASE TPEMPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQDATPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQENTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 449 T 6.8E-10 Alk_phosphatase unppssm F Bacteria T 1alx 1 A A VALYL GRAMICIDIN XGAXAXVXWXYWXWXWX 17 T 3.1 MAP17 pdbhh F T 1alz 1 A A ILE-GRAMICIDIN C XXGAXAXVXWXYWXWXWX 18 T 3.3 DUF5848 pdbhh F T 1amt 1 A,B,C A,B,C ALAMETHICIN XXPXAXAQXVXGLXPVXXEQX 21 T 23 RRT14 pdbhh F T 1ani 1 A,B A,B PPB_ECOLI ALKALINE PHOSPHATASE MPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQHATPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDHQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 446 T 6.8E-10 Alk_phosphatase unppssm F Bacteria T 1anj 1 A,B A,B PPB_ECOLI ALKALINE PHOSPHATASE MPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQDATPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDHQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 446 T 6.8E-10 Alk_phosphatase unppssm F Bacteria T 1aot 2 B P MT_POVHA PHOSPHOTYROSYL PEPTIDE EPQXEEIPIYL 11 T 3.2 Imm15 pdbhh T Viruses T 1aqg 1 A _ GNAT1_BOVIN GT(ALPHA)(340-350) IKENLKDCGLF 11 T 2.5 Peptidase_C48 pdbhh F Eukaryota T 1aqz 1 A,B A,B RNMG_ASPRE RESTRICTOCIN ATWTCINQQLNPKTNKWEDKRLLYSQAKAESNSHHAPLSDGKTGSSYPHWFTNGYDGNGKLIKGRTPIKFGKADCDRPPKHSQNGMGKDDHYLLEFPTFPDGHDYKFDSKKPKENPGPARVIYTYPNKVFCGIVAHQRGNQGDLRLCSH 149 T 46 Cuticle_2 pdbhh F Eukaryota T 1awu 2 B B PEPTIDE FROM THE HIV-1 CAPSID PROTEIN HVGPIA 6 T 100 LEM pdbhh F T 1axc 2 B,D,F B,D,F CDN1A_HUMAN P21/WAF1 GRKRRQTSMTDFYHSKRRLIFS 22 T 0.85 CDC27 pdbhh F Eukaryota T 1aya 2 B,D P,Q PGFRB_MOUSE PEPTIDE PDGFR-1009 SVLXTAVQPNE 11 T 38 Phage_holin_2_2 pdbhh F Eukaryota T 1ayb 2 B P IRS1_MOUSE PEPTIDE IRS-1-895 SPGEXVNIEFGS 12 T 0.7 CBM32 pdbhh F Eukaryota T 1ayc 2 B P PGFRB_MOUSE PEPTIDE PDGFR-740 DGGXMDMSKGS 11 F F Eukaryota T 1b07 2 B C SOS1_MOUSE PROTEIN (SH3 PEPTOID INHIBITOR) YEVPGPVPPRRR 12 T 11 Duffy_binding pdbhh F Eukaryota T 1b0g 3 C,F C,F EMC7_HUMAN PEPTIDE P1049 (ALWGFFPVL) ALWGFFPVL 9 T 0.51 MRP-L47 pdbhh F Eukaryota T 1b0q 1 A A MSH XCEHXRWCKPVX 12 F F T 1b0r 3 C C PROTEIN (INFLUENZA MATRIX PEPTIDE) GILGFVFTX 9 T 1.7 Flu_M1 pdbhh F T 1b8h 2 D D DPOL_BPR69 GP43 KKASLFDMFDF 11 T 0.82 Radial_spoke_3 pdbhh T Viruses T 1b9p 1 A A COEA1_CHICK ALPHA 1 TYPE XIV COLLAGEN CAVELRSPGISRFRRKIAKRSIKTLEHKRENAKE 34 T 1.8 Mrx7 pdbhh F Eukaryota T 1bbr 4 D,G,J F,G,I FIBA_HUMAN FIBRINOGEN ALPHA/ALPHA-E CHAIN PRECURSOR XDFLAEGGGVR 11 T 1.4 DUF4715 unphh F Eukaryota T 1bc5 2 B T TAR XNWETF 6 T 37 FDF pdbhh F T 1bcv 1 A _ POLG_FMDVA PEPTIDE CORRESPONDING TO THE MAJOR IMMUNOGEN SITE OF FMD VIRUS XGSGVRGDFGSLAPRVARQL 20 T 0.00016 Rhv unppercent T Viruses T 1bei 1 A _ K1A_STIHL SHK-DNP22 RSCIDTIPKSRCTAFQCKHSMXYRLSFCRKTCGTC 35 T 0.0045 ShK unp F Eukaryota T 1bfz 1 A _ HCMV PROTEASE R-SITE N-TERMINAL CLEAVAGE PRODUCT XSYVKA 6 T 150 DUF632 pdbhh F T 1bi6 1 A L IBRO_ANACO BROMELAIN INHIBITOR VI TACSECVCPLR 11 T 0.014 CID_GANP unp F Eukaryota T 1bmb 2 B I PROTEIN (PKF270-974) KPFXVNVEF 9 T 0.61 SH3-WW_linker pdbhh F T 1bog 3 C C PEPTIDE GATPEDLNQKL 11 T 8.6 DUF4605 pdbhh F T 1br8 2 C P PROTEIN (PEPTIDE) SEAAASTAVVIA 12 T 28 ACC_epsilon pdbhh F T 1bt6 2 C,D C,D ANXA2_CHICK ANNEXIN II XSTVHEILSKLSLE 14 T 8 DUF4581 pdbhh F Eukaryota T 1bw8 2 B P A8IP97_RAT PROTEIN (INTERNALIZATION SIGNAL FROM EGFR) FYRALM 6 T 0.2 GcnA_N pdbhh F Eukaryota T 1bxp 2 B B PEPTIDE MET-ARG-TYR-TYR-GLU-SER-SER-LEU-LYS-SER-TYR-PRO-ASP MRYYESSLKSYPD 13 T 3.3 Prion pdbhh F T 1bxx 2 B P PROTEIN (TGN38 PEPTIDE) DYQRLN 6 T 30 Fer4_24 pdbhh F T 1bz9 3 C C PROTEIN (PEPTIDE P1027 (FAPGVFPYM)) FAPGVFPYM 9 T 0.35 CT_C_D pdbhh F T 1c2u 1 A A K1A_STIHL SYNTHETIC PEPTIDE ANALOGUE OF SHK TOXIN RSXIDTIPKSRCTAFQCKHSAKYRLSFCRKTCGTX 35 T 0.0045 ShK unp F Eukaryota T 1c4e 1 A A GUR_GYMSY PROTEIN (GURMARIN) QQCVKKDELCIPYYLDCCEPLECKKVNWWDHKCIG 35 T 0.00036 Toxin_7 pdb F Eukaryota T 1c4v 3 C 3 HIRUGEN ACENEDFEEIPGEYL 15 T 0.033 Hirudin pdbhh F T 1c4y 3 C 3 HIRUGEN ENEDFEGIPGEYL 13 T 0.3 Hirudin pdbhh F T 1c9l 2 C,D C,D B-ADAPTIN 3 DTNLIEFE 8 T 55 DUF247 pdbhh F T 1ca9 2 G,H G,H TNR1B_HUMAN PROTEIN (TNF-R2) GQVPFSKEEC 10 T 3.2 Bac_export_2 pdbhh F Eukaryota T 1cdl 2 B,D,F,H E,F,G,H MYLK_CHICK CALCIUM/CALMODULIN-DEPENDENT PROTEIN KINASE TYPE II ALPHA CHAIN ARRKWQKTGHAVRAIGRLSS 20 T 7.3 PACT_coil_coil pdbhh F Eukaryota T 1cdm 2 B B KCC2A_RAT CALMODULIN LKKFNARRKLKGAILTTMLATRNFS 25 T 13 PACT_coil_coil pdbhh F Eukaryota T 1ce1 3 C P PROTEIN (PEPTIDE ANTIGEN) GTSSPSAD 8 T 9.5 Phage_T4_gp36 pdbhh F T 1cfn 3 C C PROTEIN (BOUND PEPTIDE) GATPQDLNTX 10 T 3.2 DNA_Packaging_2 pdbhh F T 1cfs 3 C C PROTEIN (ANTIGEN BOUND PEPTIDE) GLYEWGGARIT 11 T 3.4 DUF4873 pdbhh F T 1ckk 2 B B KKCC1_RAT CAMKK 1,CAM-KINASE IV KINASE,CALCIUM/CALMODULIN-DEPENDENT PROTEIN KINASE KINASE ALPHA,CAMKK ALPHA VKLIPSWTTVILVKSMLRKRSFGNPF 26 T 14 DUF4326 pdbhh F Eukaryota T 1clv 2 B I IAAI_AMAHP PROTEIN (ALPHA-AMYLASE INHIBITOR) CIPKWNRCGPKMDGVPCCEPYTCTSDYYGNCS 32 T 0.022 Toxin_12 pdbpssm F Eukaryota T 1cmi 2 C,D C,D NOS1_RAT BNOS, CONSTITUTIVE NOS, NC-NOS, NOS TYPE I, NEURONAL NOS, N-NOS, NNOS KAEMKDTGIQVDR 13 T 2.4 Exog_C pdbhh F Eukaryota T 1cmj 1 A A NOR_FUSOX P450NOR ASGAPSFPFSRASGPEPPAEFAKLRATNPVSQVKLFDGSLAWLVTKHKDVCFVATSEKLSKVRTRQGFPELSASGKQAAKAKPTFVDMDPPEHMHQRSMVEPTFTPEAVKNLQPYIQRTVDDLLEQMKQKGCANGPVDLVKEFALPVPSYIIYTLLGVPFNDLEYLTQQNAIRTNGSSTAREASAANQELLDYLAILVEQRLVEPKDDIISKLCTEQVKPGNIDKSDAVQIAFLLLVAGNATMVNMIALGVATLAQHPDQLAQLKANPSLAPQFVEELCRYHTATALAIKRTAKEDVMIGDKLVRANEGIIASNQSANRDEEVFENPDEFNMNRKWPPQDPLGFGFGDHRCIAEHLAKAELTTVFSTLYQKFPDLKVAVPLGKINYTPLNRDVGIVDLPVIF 402 T 2.3E-36 p450 unppercent F Eukaryota T 1cmn 1 A A NOR_FUSOX P450NOR ASGAPSFPFSRASGPEPPAEFAKLRATNPVSQVKLFDGSLAWLVTKHKDVCFVATSEKLSKVRTRQGFPELSASGKQAAKAKPTFVDMDPPEHMHQRSMVEPTFTPEAVKNLQPYIQRTVDDLLEQMKQKGCANGPVDLVKEFALPVPSYIIYTLLGVPFNDLEYLTQQNAIRTNGSSTAREASAANQELLDYLAILVEQRLVEPKDDIISKLCTEQVKPGNIDKSDAVQIAFLLLVAGNATMVNMIALGVATLAQHPDQLAQLKANPSLAPQFVEELCRYHTAVALAIKRTAKEDVMIGDKLVRANEGIIASNQSANRDEEVFENPDEFNMNRKWPPQDPLGFGFGDHRCIAEHLAKAELTTVFSTLYQKFPDLKVAVPLGKINYTPLNRDVGIVDLPVIF 402 T 2.3E-36 p450 pdbpercent F Eukaryota T 1cnl 1 A A CA1_CONIM PROTEIN (ALPHA-CONOTOXIN IMI) GCCSDPRCAWRCX 13 T 0.0098 Toxin_8 unphh F Eukaryota T 1cu4 3 C P RECOGNITION PEPTIDE APKTNMKHMA 10 T 22 MPP6 pdbhh F T 1cvu 2 C F PROTEIN (9-MER) TKTATINAS 9 T 100 Snu56_snRNP pdbhh F T 1cwd 2 B P (PHOSPHONOMETHYL)PHENYLALANINE-CONTAINING PEPTIDE PRO-GLU-GLY-ASP-PM3-GLU-GLU-VAL-LEU PEGDXEEVL 9 T 1.9 Ykof pdbhh F T 1cwu 1 A,B A,B FABI_BRANA ENOYL ACP REDUCTASE LPIDLRGKRAFIAGIADDNGYGWAVAKSLAAAGAEILVGTWVPALNIFETSLRRGKFDQSRVLPDGSLMEIKKVYPLDAVFDNPEDVPEDVKANKRYAGSSNWTVQEAAECVRQDFGSIDILVHSLGNGPEVSKPLLETSRKGYLAAISASSYSFVSLLSHFLPIMNPGGASISLTYIASERIIPGYGGGMSSAKAALESDTRVLAFEAGRKQNIRVNTISAGPLGSRAAKAIGFIDTMIEYSYNNAPIQKTLTADEVGNAAAFLVSPLASAITGATIYVDNGLNSMGVALDSPVF 296 T 3.1E-05 adh_short_C2 unppssm F Eukaryota T 1cz6 1 A A ANDT_ANDAU PROTEIN (ANDROCTONIN) RSVCRQIKICRRRGGCYYKCTNRPY 25 T 0.35 DUF4528 pdbhh F Eukaryota T 1czz 2 D,E D,E TNR5_HUMAN CD40 XPVQETLHGC 10 T 1.8 Ripply pdbhh F Eukaryota T 1d00 2 I,J,K,L,M,N,O,P I,J,K,L,M,N,O,P TNR5_HUMAN B-CELL SURFACE ANTIGEN CD40 XPVQETX 7 T 11 DUF3827 unphh F Eukaryota T 1d01 2 G,H,I G,H,I TNR8_HUMAN CD30 PEPTIDE XMLSVEEEG 9 T 64 DDRGK pdbhh F Eukaryota T 1d0w 1 A A C-TERMINAL ANALOGUE OF NEUROPEPTIDE Y, A POTENT Y2 RECEPTOR AGONIST ARHYKNLLERQRYX 14 T 1.1 Hormone_3 pdbhh F T 1d1e 1 A A C-TERMINAL ANALOGUE OF NEUROPEPTIDE Y, A POTENT Y2 RECEPTOR AGONIST XRHYKNLIERQRYX 14 T 0.00024 Hormone_3 pdbhh F T 1d4t 2 B B SLAF1_HUMAN SLAM KSLTIYAQVQK 11 T 0.1 MFS_1 unppssm F Eukaryota T 1d4w 2 C,D C,D SLAF1_HUMAN SLAM KSLTIXAQVQK 11 T 0.1 MFS_1 unppssm F Eukaryota T 1d5g 2 B B PEPTIDE FADSEADENEQVSAV FADSEADENEQVSAV 15 T 25 DUF1660 pdbhh F T 1d5m 4 D D INHIBITOR XXRAMXSLX 9 T 57 DUF3725 pdbhh F T 1d5q 1 A A CHIMERIC MINI-PROTEIN CNLARCQLSCKSLGLKGGCQGSFCTCG 27 T 0.027 Toxin_2 pdbhh F T 1d6x 1 A A ANTIMICROBIAL PEPTIDE, TRITRPTICIN VRRFPWWWPFLRR 13 T 1.5 DUF2841 pdbhh F T 1d7q 1 A B PROTEIN (N-TERMINAL HISTIDINE TAG) MRGSHHHHHHTDPM 14 T 8300 zf_CCCH_4 pdbhh F T 1d7t 1 A A YNK-CONTRYPHAN GCPXNPKX 8 T 0.038 zf-U11-48K pdbhh F T 1d8e 3 C P RASK_HUMAN K-RAS4B PEPTIDE SUBSTRATE KKKSKTKCVIM 11 T 0.035 Fer4_22 unppssm F Eukaryota T 1d8t 2 C,D C,D THCL_PLARO GE2270A SXNXVXGXXXXXSPX 15 T 1.2 CCER1 unphh F Bacteria T 1ddm 2 B B NAK GFSNMSFEDFP 11 T 1.9 Dodecin pdbhh F T 1de3 1 A A RNAS_ASPGI RIBONUCLEASE ALPHA-SARCIN AVTWTCLNDQKNPKTNKYETKRLLYNQNKAESNSHHAPLSDGKTGSSYPHWFTNGYDGDGKLPKGRTPIKFGKSDCDRPPKHSKDGNGKTDHYLLEFPTFPDGHDYKFDSKKPKENPGPARVIYTYPNKVFCGIIAHTKENQGELKLCSH 150 T 23 MtrE unphh F Eukaryota T 1de7 3 E,F A,B FACTOR XIII ACTIVATION PEPTIDE (28-37) TVELQGVVPXX 11 T 0.7 DUF4075 pdbhh F T 1dfy 1 A A COW_CONSE CONTRYPHAN-SM GCPXQPWX 8 T 0.45 EndIII_4Fe-2S pdbhh F Eukaryota T 1dit 3 C P PEPTIDE INHIBITOR CVS995 XDPXGGGGGNGDFEEIPEYL 20 T 0.16 Hirudin pdbhh F T 1dkd 2 B,D,F,H E,F,G,H 12-MER PEPTIDE SWMTTPWGFLHP 12 T 1.1 DUF6163 pdbhh F T 1dlz 1 A A ZERVAMICIN IIB XWIQXITXLXPQXPXPX 17 T 25 bpX0 pdbhh F T 1dmc 1 A _ MT1_CALSI CD6 METALLOTHIONEIN-1 SPCQKCTSGCKCATKEECSKTCTKPCSCCPK 31 T 1.5 Metallothio_5 pdbhh F Eukaryota T 1dme 1 A _ MT1_CALSI CD6 METALLOTHIONEIN-1 PGPCCNDKCVCQEGGCKAGCQCTSCRCS 28 T 0.53 Metallothio_5 pdbhh F Eukaryota T 1dn2 2 B,D E,F ENGINEERED PEPTIDE DCAWHLGELVWCTX 14 T 7.4 FAT pdbhh F T 1dng 1 A A HUMAN PLATELET FACTOR 4, SEGMENT 59-73 QAPAYEEAAEELAKS 15 T 0.91 Comm pdbhh F T 1dpu 2 B B UNG_HUMAN URACIL DNA GLYCOSYLASE (UNG2) RIQRNKAAALLRLAAR 16 T 3 ARL6IP6 unppssm F Eukaryota T 1dt7 2 C,D X,Y P53_HUMAN CELLULAR TUMOR ANTIGEN P53 SHLKSKKGQSTSRHKKLMFKTE 22 T 56 Class_IIIsignal pdbhh F Eukaryota T 1dtd 2 B B MCPI_HIRME METALLOCARBOXYPEPTIDASE INHIBITOR DESFLCYQPDQVCCFICRGAAPLPSEGECNPHPTAPWCREGAVEWVPYSTGQCRTTCIPYV 61 T 0.019 Inhibitor_I68 unp F Eukaryota T 1dtv 1 A A MCPI_HIRME LCI GSHTPDESFLCYQPDQVCCFICRGAAPLPSEGECNPHPTAPWCREGAVEWVPYSTGQCRTTCIPYVE 67 T 0.0093 Inhibitor_I68 pdb F Eukaryota T 1du1 1 A A CAC1S_RABIT SKELETAL DIHYDROPYRIDINE RECEPTOR TSAQKAKAEERKRRKMSRGL 20 T 0.59 DUF1682 pdbhh F Eukaryota T 1dum 1 A,B A,B MAGA_XENLA MAGAININ 2 GIGKYLHSAKKFGKAWVGEIMNS 23 T 1.6 TAFII28 pdbhh F Eukaryota T 1duy 3 C,F C,F HTLV-1 OCTAMERIC TAX PEPTIDE LFGYPVYV 8 T 0.076 Pecanex_C pdbhh F T 1dva 3 C,F X,Y PEPTIDE E-76 XALCDDPRVDRWYCQFVEGX 20 T 0.97 HTH_48 pdbhh F T 1dzi 2 B,C,D B,C,D COLLAGEN GPPGPPGFPGERGPPGPPGPPX 22 T 0.0013 Collagen pdbpssm F T 1e4w 3 C P CYCLIC PEPTIDE SHFNEYE 7 T 21 Phospho_p8 pdbhh F T 1e4x 4 E,F P,Q CYCLIC PEPTIDE VVSHFND 7 T 3.7 TnpW pdbhh F T 1e54 2 B B OMP32 DNWQNGTS 8 T 4.8 DUF1842 pdbhh F T 1e6i 2 B P H4_YEAST HISTONE H4 AXRHRKILRNSIQGI 15 T 4.2 Shadoo unppercent F Eukaryota T 1e74 1 A A CA1_CONIM ALPHA-CONOTOXIN IM1(R11E) GCCSDPRCAWECX 13 T 0.0098 Toxin_8 unphh F Eukaryota T 1e75 1 A A CA1_CONIM ALPHA-CONOTOXIN IM1(R7L) GCCSDPLCAWRCX 13 T 0.0098 Toxin_8 unphh F Eukaryota T 1e76 1 A A CA1_CONIM ALPHA-CONOTOXIN IM1(D5N) GCCSNPRCAWRCX 13 T 0.0098 Toxin_8 unphh F Eukaryota T 1e91 2 B B MAD1_HUMAN MAD PROTEIN (MAX DIMERIZER) NIQMLLEAADYLE 13 T 3.9 Rad10 pdbhh F Eukaryota T 1eb1 1 A A PEPTIDE INHIBITOR DYEPIPEEAF 10 T 0.018 Hirudin pdbhh F T 1ee5 2 B B NUPL_XENLA NUCLEOPLASMIN AVKRPAATKKAGQAKKKKL 19 T 0.0016 BSP_II unppercent F Eukaryota T 1ee7 1 A A CHRYSOSPERMIN C XFXSXXLQGXXAAXPXXXQX 20 T 21 DUF4141 pdbhh F T 1een 2 B B ALA-ASP-PBF-PTR-LEU-ILE-PRO ADXXLIP 7 T 0.67 SPOC pdbhh F T 1eeo 2 B B ACETYL-E-L-E-F-PTYR-M-D-Y-E-NH2 PEPTIDE XELEFXMDYEX 11 T 5.1 ATP1G1_PLM_MAT8 pdbhh F T 1eey 3 C,F C,F GP2 PEPTIDE ILSALVGIV 9 T 0.7 H2O2_YaaD pdbhh F T 1eez 3 C,F C,F GP2 PEPTIDE ILSALVGIL 9 T 0.63 H2O2_YaaD pdbhh F T 1ehf 1 A A NOR_FUSOX NITRIC-OXIDE REDUCTASE B MASGAPSFPFSRASGPEPPAEFAKLRATNPVSQVKLFDGSLAWLVTKHKDVCFVATSEKLSKVRTRQGFPELSASGKQAAKAKPTFVDMDPPEHMHQRSMVEPTFTPEAVKNLQPYIQRTVDDLLEQMKQKGCANGPVDLVKEFALPVPSYIIYTLLGVPFNDLEYLTQQNAIRTNGSSTAREASAANQELLDYLAILVEQRLVEPKDDIISKLCTEQVKPGNIDKSDAVQIAFLLLVAGNATMVNMIALGVATLAQHPDQLAQLKANPSLAPQFVEELCRYHTATALAIKRTAKEDVMIGDKLVRANEGIIASNQSANRDEEVFENPDEFNMNRKWPPQDPLGFGFGDHRCIAEHLAKAELTTVFSTLYQKFPDLKVAVPLGKINYTPLNRDVGIVDLPVIF 403 T 2.3E-36 p450 unppercent F Eukaryota T 1ehg 1 A A NOR_FUSOX NITRIC-OXIDE REDUCTASE B MASGAPSFPFSRASGPEPPAEFAKLRATNPVSQVKLFDGSLAWLVTKHKDVCFVATSEKLSKVRTRQGFPELSASGKQAAKAKPTFVDMDPPEHMHQRSMVEPTFTPEAVKNLQPYIQRTVDDLLEQMKQKGCANGPVDLVKEFALPVPSYIIYTLLGVPFNDLEYLTQQNAIRTNGSSTAREASAANQELLDYLAILVEQRLVEPKDDIISKLCTEQVKPGNIDKSDAVQIAFLLLVAGNATMVNMIALGVATLAQHPDQLAQLKANPSLAPQFVEELCRYHTAVALAIKRTAKEDVMIGDKLVRANEGIIASNQSANRDEEVFENPDEFNMNRKWPPQDPLGFGFGDHRCIAEHLAKAELTTVFSTLYQKFPDLKVAVPLGKINYTPLNRDVGIVDLPVIF 403 T 2.3E-36 p450 unppercent F Eukaryota T 1ejo 3 C P POLG_FMDVT FMDV PEPTIDE YTTSTRGDLAHVTTT 15 T 0.0013 Waikav_capsid_1 unphh T Viruses T 1ejy 1 A N NUPL_XENLA NUCLEOPLASMIN NLS PEPTIDE KRPAATKKAGQAKKKK 16 T 0.0016 BSP_II unppercent F Eukaryota T 1elw 2 C,D C,D HSC70-PEPTIDE GPTIEEVD 8 T 8.1 DUF4028 pdbhh F T 1elx 1 A,B A,B PPB_ECOLI ALKALINE PHOSPHATASE TPEMPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDAAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQDATPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 449 T 6.8E-10 Alk_phosphatase unppssm F Bacteria T 1ely 1 A,B A,B PPB_ECOLI ALKALINE PHOSPHATASE TPEMPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDCAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQDATPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 449 T 6.5E-11 Alk_phosphatase pdbpssm F Bacteria T 1elz 1 A,B A,B PPB_ECOLI ALKALINE PHOSPHATASE TPEMPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDGAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQDATPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 449 T 6.8E-10 Alk_phosphatase unppssm F Bacteria T 1eoj 2 B B THROMBIN INHIBITOR P798 XRXXXDYEPIPEEA 14 T 0.12 Hirudin pdbhh F T 1eol 2 B B THROMBIN INHIBITOR P628 XRXXXDYEPIPEEAA 15 T 0.16 Hirudin pdbhh F T 1epm 2 B I PS2, THR-PHE-GLN-ALA-PSA-LEU-ARG-GLU TFQAXLRE 8 T 0.26 SAC3 pdbhh F T 1eqx 1 A A UBE3A_HUMAN PAPILLOMAVIRUS E6-ASSOCIATED PROTEIN IPESSELTLQELLGEERR 18 T 3.5 DUF1413 pdbhh F Eukaryota T 1er8 2 B I ANGT_HORSE H-77 XPFHLLVY 8 T 0.86 Nairo_nucleo unphh F Eukaryota T 1eww 1 A A Q9GTP0_CHOFU ANTIFREEZE PROTEIN DGSCTNTNSQLSANSKCEKSTLTNCYVDKSEVYGTTCTGSRFDGVTITTSTSTGSRISGPGCKISTCIITGGVPAPSAACKISGCTFSAN 90 T 6.2E-25 CfAFP unppssm F Eukaryota T 1exy 2 B B REX_HTL1C PROTEIN X (HTLV-1), P27 PROTEIN (HTLV-1) MPKTRRRPRRSQRKRP 16 T 5.9 DUF1639 pdbhh T Viruses T 1ezg 1 A,B A,B ANPY1_TENMO THERMAL HYSTERESIS PROTEIN ISOFORM YL-1 QCTGGADCTSCTGACTGCGNCPNAVTCTNSQHCVKANTCTGSTDCNTAQTCTNSKDCFEANTCTDSTNCYKATACTNSSGCPGH 84 T 0.0023 AFP pdb F Eukaryota T 1f24 1 A A NOR_FUSOX NITRIC OXIDE REDUCTASE ASGAPSFPFSRASGPEPPAEFAKLRATNPVSQVKLFDGSLAWLVTKHKDVCFVATSEKLSKVRTRQGFPELSASGKQAAKAKPTFVDMDPPEHMHQRSMVEPTFTPEAVKNLQPYIQRTVDDLLEQMKQKGCANGPVDLVKEFALPVPSYIIYTLLGVPFNDLEYLTQQNAIRTNGSSTAREASAANQELLDYLAILVEQRLVEPKDDIISKLCTEQVKPGNIDKSDAVQIAFLLLVAGNAAMVNMIALGVATLAQHPDQLAQLKANPSLAPQFVEELCRYHTASALAIKRTAKEDVMIGDKLVRANEGIIASNQSANRDEEVFENPDEFNMNRKWPPQDPLGFGFGDHRCIAEHLAKAELTTVFSTLYQKFPDLKVAVPLGKINYTPLNRDVGIVDLPVIF 402 T 2.3E-36 p450 unppercent F Eukaryota T 1f25 1 A A NOR_FUSOX NITRIC OXIDE REDUCTASE ASGAPSFPFSRASGPEPPAEFAKLRATNPVSQVKLFDGSLAWLVTKHKDVCFVATSEKLSKVRTRQGFPELSASGKQAAKAKPTFVDMDPPEHMHQRSMVEPTFTPEAVKNLQPYIQRTVDDLLEQMKQKGCANGPVDLVKEFALPVPSYIIYTLLGVPFNDLEYLTQQNAIRTNGSSTAREASAANQELLDYLAILVEQRLVEPKDDIISKLCTEQVKPGNIDKSDAVQIAFLLLVAGNANMVNMIALGVATLAQHPDQLAQLKANPSLAPQFVEELCRYHTASALAIKRTAKEDVMIGDKLVRANEGIIASNQSANRDEEVFENPDEFNMNRKWPPQDPLGFGFGDHRCIAEHLAKAELTTVFSTLYQKFPDLKVAVPLGKINYTPLNRDVGIVDLPVIF 402 T 1.4999999999999999E-36 p450 pdbpssm F Eukaryota T 1f3r 1 A A ACETYLCHOLINE RECEPTOR ALPHA WNPGDYGGIX 10 T 0.45 CBM32 pdbhh F T 1f47 2 B A FTSZ_ECOLI CELL DIVISION PROTEIN FTSZ KEPDYLDIPAFLRKQAD 17 T 0.99 Drc1-Sld2 pdbhh F Bacteria T 1f4v 2 D,E,F D,E,F FLIM_ECOLI FLIM MGDSILSQAEIDALLN 16 T 0.027 CitT pdbhh F Bacteria T 1f59 2 C,D C,D NSP1P XDDSKPAFSFGXXXXXXXXXXXAFSFGX 28 T 16 SHIPPO-rpt pdbhh F T 1f7a 2 C P Q9YX54_9HIV1 CA-P2 SUBSTRATE KARVLAEAMS 10 T 13 GREB1 pdbhh T Viruses T 1f8h 2 B B PTGSSSTNPFR PTGSSSTNPFR 11 T 1.8 Yuri_gagarin pdbhh F T 1f8i 1 A,B,C,D A,B,C,D ACEA_MYCTU ICL MASVVGTPKSAEQIQQEWDTNPRWKDVTRTYSAEDVVALQGSVVEEHTLARRGAEVLWEQLHDLEWVNALGALTGNMAVQQVRAGLKAIYLSGWQVAGDANLSGHTYPDQSLYPANSVPQVVRRINNALQRADQIAKIEGDTSVENWLAPIVADGEAGFGGALNVYELQKALIAAGVAGSHWEDQLASEKKSGHLGGKVLIPTQQHIRTLTSARLAADVADVPTVVIARTDAEAATLITSDVDERDQPFITGERTREGFYRTKNGIEPCIARAKAYAPFADLIWMETGTPDLEAARQFSEAVKAEYPDQMLAYNCSPSFNWKKHLDDATIAKFQKELAAMGFKFQFITLAGFHALNYSMFDLAYGYAQNQMSAYVELQEREFAAEERGYTATKHQREVGAGYFDRIATTVDPNSSTTALTGSTEEGQFH 429 T 1.8E-47 ICL pdb F Bacteria T 1f90 3 C E ANTIGENIC NONAPEPTIDE KPLEEVLNL 9 T 5.2 IL2 pdbhh F T 1f95 2 C,D C,D B2L11_HUMAN BCL2-LIKE 11 (APOPTOSIS FACILITATOR) MSCDKSTQT 9 T 0.17 FAM117 pdbhh F Eukaryota T 1f96 2 C,D C,D PROTEIN (NNOS, NEURONAL NITRIC OXIDE SYNTHASE) MKDTGIQVDRDLDGKSHK 18 T 8.5 APOBEC1 pdbhh F T 1fbv 2 B B ZAP70_HUMAN ZAP-70 PEPTIDE SDGXTPEPA 9 T 1.5 FSIP1 pdbhh F Eukaryota T 1ff1 2 B B PTGSSSTNPFL PEPTIDE PTGSSSTNPFL 11 T 1.6 Yuri_gagarin pdbhh F T 1ffo 3 C,F C,F PEPTIDE WITH SEQUENCE ALA-ALA-VAL-TYR-ASN-PHE-ALA-THR-MET AAVYNFATM 9 T 5.9 DUF5607 pdbhh F T 1ffp 3 C,F C,F SYNTHETIC PEPTIDE WITH SEQUENCE SER-ALA-VAL-TYR-ASN-PHE-ALA-THR-MET SAVYNFATM 9 T 6.2 DUF5607 pdbhh F T 1fg2 3 C,F,I,L C,F,I,L LCMV PEPTIDIC EPITOPE GP33 KAVYNFATC 9 T 0.97 TOM6p pdbhh F T 1fiw 2 B L ACRO_SHEEP BETA-ACROSIN LIGHT CHAIN DNTTCDGPCGVRFRQNRQGGVR 22 T 130 Peptidase_C3 unp F Eukaryota T 1fiz 2 B L ACRO_PIG BETA-ACROSIN LIGHT CHAIN RDNATCDGPCGLRFRQKLESGMR 23 F F Eukaryota T 1fkn 2 C,D C,D inhibitor EVNXAEF 7 T 200 DUF1480 pdbhh F T 1fll 2 B,D X,Y TNR5_HUMAN B-CELL SURFACE ANTIGEN CD40 KTAAPVQETLHGSQPVTQEDG 21 T 11 DUF3827 unphh F Eukaryota T 1flt 2 C,D X,Y VGFR1_HUMAN FLT-1, VGR1 GRPFVEMYSEIPEIIHMTEGRELVIPCRVTSPNITVTLKKFPLDTLIPDGKRIIWDSRKGFIISNATYKEIGLLTCEATVNGHLYKTNYLTHRQT 95 T 0.00015 Ig_2 pdbpssm F Eukaryota T 1fme 1 A A FSD-EY PEPTIDE EQYTAKYKGRTFRNEKELRDFIEKFKGR 28 T 0.77 DUF4121 pdbhh F T 1foz 1 A A SYNTHETIC CYCLIC PEPTIDE XFELDKDF 8 T 10 Sda pdbhh F T 1fph 4 D F FIBA_HUMAN FIBRINOPEPTIDE A XDFLAEGGGVXX 12 T 1.4 DUF4715 unphh F Eukaryota T 1fpr 2 B B PEPTIDE PY469 EDTLTXADLD 10 T 2 G6B pdbhh F T 1fry 1 A A SC51_SHEEP SMAP29, SMAP-29 GENE PRODUCT RGLRRLGRKIAHGVKKYGPTVLRIIRIAG 29 T 0.095 CAP18_C unppercent F Eukaryota T 1fsd 1 A _ FULL SEQUENCE DESIGN 1 OF BETA BETA ALPHA MOTIF QQYTAKIKGRTFRNEKELRDFIEKFKGR 28 T 0.091 SpoVIF pdb F T 1fu5 2 B B MT_POVMA MT PEPTIDE EEEXMPMEDLXLDIL 15 T 3.6 DUF402 pdbhh T Viruses T 1fu9 1 A A USH_DROME U-SHAPED TRANSCRIPTIONAL COFACTOR GSAAEVMKKYCSTCDISFNYVKTYLAHKQFYCKNKP 36 T 0.0003 zf-met pdb F Eukaryota T 1ful 1 A A RGD PEPTIDE ISOMER-B ACDCRGDCFCG 11 T 0.48 Squash pdbhh F T 1g0y 2 B I ANTAGONIST PEPTIDE AF10847 ETPFTWEESNAYYWQPYALPL 21 T 0.41 PilJ_C pdbhh F T 1g1e 1 A A MAD1_HUMAN MAX DIMERIZATION PROTEIN RMNIQMLLEAADYLER 16 T 1.8 DUF6117 pdbhh F Eukaryota T 1g1f 2 B B TRI-PHOSPHORYLATED PEPTIDE FROM THE INSULIN RECEPTOR KINASE RDIXETDXXRK 11 T 4.9 Glyco_hydro_108 pdbhh F T 1g1g 2 B B MONO-PHOSPHORYLATED PEPTIDE FROM THE INSULIN RECEPTOR KINASE ETDYXRKGGKGLL 13 T 1.3 LEA_3 pdbhh F T 1g1h 2 B B BI-PHOSPHORYLATED PEPTIDE FROM THE INSULIN RECEPTOR KINASE ETDXXRKGGKGLL 13 T 1.3 LEA_3 pdbhh F T 1g1p 1 A A CO6A_CONER CONOTOXIN EVIA DDCIKPYGFCSLPILKNGLCCSGACVGVCADLX 33 T 0.018 Conotoxin unp F Eukaryota T 1g1s 2 C,D C,D SELPL_HUMAN PSGL-1 QATEYEYLDYDFLPETEPPRPMMDDDDK 28 T 7.6 Coilin_N pdbhh F Eukaryota T 1g37 2 B B THROMBIN NONAPEPTIDE INHIBITOR FEAIPAEYL 9 T 0.45 Hirudin pdbhh F T 1g6g 2 C,D E,F SER-LEU-GLU-VAL-TPO-GLU-ALA-ASPALA-THR-PHE-ALA-LYS SLEVTEADATFAK 13 T 14 TBK1_CCD1 pdbhh F T 1g6m 1 A A 3S1B2_NAJKA SHORT NEUROTOXIN 1 LECHNQQSSQTPTTTGCSGGENNCYKKEWRDNRGYRTERGCGCPSVKKGIGINCCTTDRCNN 62 T 0.032 Hyr1 pdbpercent F Eukaryota T 1g6r 5 E,J P,Q SIYR PEPTIDE SIYRYYGL 8 T 8.9 LEF-9 pdbhh F T 1g70 2 B B RSG-1.2 PEPTIDE DRRRRGSRPSGAERRRRRAAAA 22 T 9.5 BRD4_CDT pdbhh F T 1g7q 3 C P MUCIN 1, TRANSMEMBRANE SAPDTRPA 8 T 32 PNPase_C pdbhh F T 1g89 1 A A CTHL4_BOVIN INDOLICIDIN ILPWKWPWWPWRRX 14 T 0.12 CoV_S2 pdbhh F Eukaryota T 1g92 1 A A POTX_PARCV PAC-TX FLPLLILGSLLMTPPVIQAIHDAQR 25 T 0.84 Viral_Beta_CD pdbhh F Eukaryota T 1g9m 1 A G ENV_HV1H2 ENVELOPE GLYCOPROTEIN GP120 GARSEVVLVNVTENFNMWKNDMVEQMHEDIISLWDQSLKPCVKLTPLCVGAGSCNTSVITQACPKVSFEPIPIHYCAPAGFAILKCNNKTFNGTGPCTNVSTVQCTHGIRPVVSTQLLLNGSLAEEEVVIRSVNFTDNAKTIIVQLNTSVEINCTGAGHCNISRAKWNNTLKQIASKLREQFGNNKTIIFKQSSGGDPEIVTHSFNCGGEFFYCNSTQLFNSTWFNSTWSTEGSNNTEGSDTITLPCRIKQIINMWQKVGKAMYAPPISGQIRCSSNITGLLLTRDGGNSNNESEIFRPGGGDMRDNWRSELYKYKVVKIE 321 T 4.5000000000000003E-23 GP120 pdb T Viruses T 1gag 2 B B BISUBSTRATE PEPTIDE INHIBITOR PATGDFMNMSPVG 13 T 0.69 Glycoprot_B_PH1 pdbhh F T 1gbr 2 B B SOS2_MOUSE SOS-A PEPTIDE SPLLPKLPPKTYKRE 15 T 1.2 PHINT_rpt pdbhh F Eukaryota T 1geb 1 A A CPXA_PSEPU CYTOCHROME P450-CAM MTTETIQSNANLAPLPPHVPEHLVFDFDMYNPSNLSAGVQEAWAVLQESNVPDLVWTRCNGGHWIATRGQLIREAYEDYRHFSSECPFIPREAGEAYDFIPTSMDPPEQRQFRALANQVVGMPVVDKLENRIQELACSLIESLRPQGQCNFTEDYAEPFPIRIFMLLAGLPEEDIPHLKYLTDQMTRPDGSMTFAEAKEALYDYLIPIIEQRRQKPGTDAISIVANGQVNGRPITSDEAKRMCGLLLVGGLDIVVNFLSFSMEFLAKSPEHRQELIERPERIPAACEELLRRFSLVADGRILTSDYEFHGVQLKKGDQILLPQMLSGLDERENACPMHVDFSRQKVSHTTFGHGSHLCLGQHLARREIIVTLKEWLTRIPDFSIAPGAQIQHKSGIVSGVQALPLVWDPATTKAV 415 T 1.6E-05 p450 unppercent F Bacteria T 1gff 1 A 1 VGF_BPG4 BACTERIOPHAGE G4 CAPSID PROTEINS GPF, GPG, GPJ SNVQTSADRVPHDLSHLVFEAGKIGRLKTISWTPVVAGDSFECDMVGAIRLSPLRRGLAVDSRVDIFSFYIPHRHIYGQQWINFMKDGVNASPLPPVTCSSGWDSAAYLGTIPSSTLKVPKFLHQGYLNIYNNYFKPPWSDDLTYANPSNMPSEDYKWGVRVANLKSIWTAPLPPDTRTSENMTTGTSTIDIMGLQAAYAKLHTEQERDYFMTRYRDIMKEFGGHTSYDGDNRPLLLMRSEFWASGYDVDGTDQSSLGQFSGRVQQTFNHKVPRFYVPEHGVIMTLAVTRFPPTHEMEMHYLVGKENLTYTDIACDPALMANLPPREVSLKEFFHSSPDSAKFKIAEGQWYRTQPDRVAFPYNALDGFPFYSALPSTDLKDRVLVNTNNYDEIFQSMQLAHWNMQTKFNINVYRHMPTTRDSIMTS 426 T 2.1E-69 Phage_F pdb T Viruses T 1gg6 1 A A CTRA_BOVIN GAMMA CHYMOTRYPSIN CGVPAIQPVL 10 T 1.7 CaM_bind pdbhh F Eukaryota T 1gje 1 A A IGFBP-1 antagonist CRAGPLQWLCEKYFGX 16 T 2.3 DUF6497 pdbhh F T 1gjf 1 A A IGFBP-1 antagonist XRAGPLQWLAEKYQGX 16 T 9.2 Pico_P2B pdbhh F T 1gjg 1 A A IGFBP-1 antagonist XRPLQWLAEKYFQX 14 T 2.7 DUF5053 pdbhh F T 1gq0 1 A A ANTIAMOEBIN I XFXXXXGLXXPQXPXPX 17 T 0.21 Pep_deformylase pdbhh F T 1gvu 2 B I ANGT_BOVIN INHIBITOR, H189 PHPFHXVIHK 10 T 0.74 Ins134_P3_kin_N pdbhh F Eukaryota T 1gwk 1 A,B A,B Q9C171_PIREQ NCP1 MNVRATYTVIFKNASGLPNGYDNWGWGCTLSYYGGAMIINPQEGKYGAVSLKRNSGSFRGGSLRFDMKNEGKVKILVENSEADEKFEVETISPSDEYVTYILDVDFDLPFDRIDFQDAPGNGDRIWIKNLVHSTGSADDFVDPINLEHHHHHH 153 T 0.22 RNase_H pdbpssm F Eukaryota T 1gxc 2 B,D,F,H B,E,H,K SYNTHETIC PHOSPHOPEPTIDE RHFDTYLIRR 10 T 5.6 DUF4650 pdbhh F T 1gy3 3 E,F E,F SUBSTRATE PEPTIDE HHASPRK 7 T 9 DUF1324 pdbhh F T 1h24 3 E E E2F1_HUMAN E2F-1, PRB-BINDING PROTEIN E2F-1, PBR3, RETINOBLASTOMA-ASSOCIATED PROTEIN 1, RBAP-1 PVKRRLDLE 9 T 2.9 Humanin pdbhh F Eukaryota T 1h26 3 E E P53_HUMAN P53 RECRUITMENT PEPTIDE 11MER STSRHKKLMFK 11 T 29 DUF420 pdbhh F Eukaryota T 1h28 3 E,F E,F RBL1_HUMAN 107 KDA RETINOBLASTOMA-ASSOCIATED PROTEIN, PRB1, P107, P107 RECRUITMENT PEPTIDE 11MER AGSAKRRLFGE 11 T 0.74 PPV_E1_N pdbhh F Eukaryota T 1h3h 2 B B LCP2_HUMAN SLP-76 APSIDRSTKPA 11 T 39 TagF_N pdbhh F Eukaryota T 1h6e 2 B P CTLA4_HUMAN CYTOTOXIC T-LYMPHOCYTE-ASSOCIATED ANTIGEN 4, CTLA-4, CD152 ANTIGEN TTGVYVKMPPT 11 T 0.2 TMEM190 unppssm F Eukaryota T 1ha8 1 A A MER23_EUPRA ER-23 GECEQCFSDGGDCTTCFNNGTGPCANCLAGYPAGCSNSDCTAFLSQCYGGC 51 T 1.1 DUF3716 pdbhh F Eukaryota T 1haa 2 B B HIGH AFFINITY PEPTIDE WRYYESSLEPYPD 13 T 9.4 DUF1489 pdbhh F T 1hbt 3 C I P596 Inhibitor peptide XPXGGGGDYEPIPEEAXX 18 T 0.05 Hirudin pdbhh F T 1hc9 3 C,D C,D HIGH AFFINITY PEPTIDE WRYYESSLLPYPD 13 T 4 Cys_rich_VLP pdbhh F T 1hcw 1 A _ BBA1 XYTVPSXTFSRSDELAKLLRLHAGX 25 T 11 DUF3196 pdbhh F T 1hd9 1 A A BOWMAN-BIRK INHIBITOR DERIVED PEPTIDE XCTASIPPQCY 11 T 0.02 Bowman-Birk_leg pdb F T 1hes 2 B P LYAM3_HUMAN P-SELECTIN PEPTIDE SHLGTYGVFTNAAFDPSP 18 T 1 YoaP pdbhh F Eukaryota T 1hff 1 A A VMI2_HHV8P VMIP-II, VMIP-1B LGASWHRPDK 10 T 5 Apc15p pdbhh T Viruses T 1hgv 1 A A CAPSD_BPH75 PH75 BACTERIOPHAGE MAJOR COAT PROTEIN MDFNPSEVASQVTNYIQAIAAAGVGVLALAIGLSAAWKYAKRFLKG 46 T 0.00018 Phage_Coat_B pdb T Viruses T 1hh6 3 C C PEP-4 DATPEDLGARL 11 T 4.8 CdiI pdbhh F T 1hh9 3 C C PEP-2 DATPEDLNAKLX 12 T 6.6 DUF6489 pdbhh F T 1hhn 1 A A CALR_RAT CALRETICULIN SKKIKDPDAAKPEDWDERAKIDDPTDSKPEDWDKPEHIPDPDAKKPEDWDEEMDGEWEPPVIQNPEYKGEWKPRQIDNPDYKGTWIHPEIDNPEYSPDANI 101 T 2.1E-21 Calreticulin unp F Eukaryota T 1hi6 3 C C PEPTIDE 5 DATPEWLGARLX 12 T 4.1 Birna_VP3 pdbhh F T 1hin 3 C P INFLUENZA HEMAGGLUTININ HA1 (STRAIN X47) (RESIDUES 100-107) YDVPDYAS 8 T 7.1 DUF4535 pdbhh F T 1hjk 1 A,B A,B PPB_ECOLI ALKALINE PHOSPHATASE TPEMPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQDATPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQDQAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 449 T 1.8E-10 Alk_phosphatase pdbpssm F Bacteria T 1hl3 2 B B PRO-ILE-ASP-LEU-SER-LYS-LYS PEPTIDE PIDLSKK 7 T 2.3 NRIP1_repr_2 pdbhh F T 1hoy 2 B B MIMOTOPE OF THE NICOTINIC ACETYLCHOLINE RECEPTOR HRYYESSLEPWYPD 14 T 4.1 NDUF_C2 pdbhh F T 1hqa 1 A,B A,B PPB_ECOLI ALKALINE PHOSPHATASE TPEMPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQDATPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEQTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 449 T 5.4E-10 Alk_phosphatase pdbpercent F Bacteria T 1hqj 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L SIN-ASP-GLU-LEU-GLU-ALA-ARG-ILE-ARG-GLU-LEU-GLU-ALA-ARG-ILE-LYS-NH2 DELERRIRELEARIK 15 T 0.024 DUF1192 pdbhh F T 1hqq 2 B,D,F,H E,F,G,H MINI-PROTEIN 2 RCCHPQCGAVEECR 14 T 0.74 Enterotoxin_ST pdbhh F T 1hr1 1 A A CTHL4_BOVIN INDOLICIDIN ILAWKWAWWAWRRX 14 T 0.21 SAG unp F Eukaryota T 1hr8 3 I,J,K,L O,P,Q,R COX4_YEAST COX4 LSLRQSIRFFKPATRTLCSSRYLL 24 T 9.1 OTCace_N pdbhh F Eukaryota T 1hr9 3 I,J,K,L O,P,Q,R MDHM_YEAST MDH1 LSRVAKRA 8 T 20 Peptidase_S58 pdbhh F Eukaryota T 1hu5 1 A A OVISPIRIN-1 KNLRRIIRKIIHIIKKYG 18 T 1.1 Lambda_CIII pdbhh F T 1hu6 1 A A G10 NOVISPIRIN KNLRRIIRKGIHIIKKYG 18 T 1.6 YabA pdbhh F T 1hu7 1 A A T7 NOVISPIRIN KNLRRITRKIIHIIKKYG 18 T 2.3 Lambda_CIII pdbhh F T 1hvz 1 A A RTD-1 GFCRCLCRRGVCRCICTR 18 T 0.63 DUF5354 pdbhh F T 1hxl 2 C,D C,D MINI-PROTEIN 2 RCCHPQCGMAEECR 14 T 0.56 Cys_rich_CWC pdbhh F T 1hxz 2 C,D C,D MINI-PROTEIN 2 RCCHPQCGMVEECR 14 T 0.64 Cys_rich_CWC pdbhh F T 1hy2 2 E,F,G,H E,F,G,H MINI-PROTEIN 1 CCHPQCGAAYSC 12 T 0.074 Enterotoxin_ST pdbhh F T 1i1f 3 C,F C,F I1F FLKEPVHGV 9 T 6.9 DUF2115 pdbhh F T 1i1y 3 C,F C,F I1Y YLKEPVHGV 9 T 8.3 DUF2115 pdbhh F T 1i2v 1 A A DEFN_HELVI DEFENSIN HELIOMICIN DKLIGSCVWGAVNYTSDCNGECLLRGYKGGHCGSFANVNCWCET 44 T 0.00019 Toxin_3 unppssm F Eukaryota T 1i3z 2 B B SLAF1_HUMAN SIGNALING LYMPHOCYTIC ACTIVATION MOLECULE VEKKSLTIXAQVQK 14 T 0.1 MFS_1 unppssm F Eukaryota T 1i5h 2 B B SCNNB_RAT RENAL BP2 PEPTIDE GSTLPIPGTPPPNYDSL 17 T 0.14 Myc_target_1 pdbhh F Eukaryota T 1i6y 1 A A ION-SELECTIVE LIGAND A1 XCRVVRGDYLDCX 13 T 0.96 YedD pdbhh F T 1i7r 3 C,F C,F 9 RESIDUE PEPTIDE FAPGFFPYL 9 T 2.7 LINES_C pdbhh F T 1i7t 3 C,F C,F 9 RESIDUE PEPTIDE ALWGVFPVL 9 T 0.95 PK_C pdbhh F T 1i7u 3 C,F C,F 9 RESIDUE PEPTIDE ALWGFVPVL 9 T 0.23 Tom7 pdbhh F T 1i8e 1 A A ION-SELECTIVE LIGAND A22 XCYCSLRGDCYCX 13 T 3.3 CRAM_rpt pdbhh F T 1i8g 1 A A MPIP3_XENLA M-PHASE INDUCER PHOSPHATASE 3 EQPLTPVTDL 10 T 12 DUF4636 pdbhh F Eukaryota T 1i8h 1 A A TAU_HUMAN PHF-TAU KVSVVRTPPKSPS 13 T 13 Disulph_isomer pdbhh F Eukaryota T 1i8i 3 C C EPIDERMAL GROWTH FACTOR RECEPTOR, EGFRVIII PEPTIDE ANTIGEN EEKKGNYVVTDH 12 T 1.1 MFA1_2 pdbhh F T 1i8n 1 A,B,C A,B,C LAPP_HAEOF ANTI-PLATELET PROTEIN QDEDAGGAGDETSEGEDTTGSDETPSTGGGGDGGNEETITAGNEDCWSKRPGWKLPDNLLTKTEFTSVDECRKMCEESAVEPSCYILQINTETNECYRNNEGDVTWSSLQYDQPNVVQWHLHACSK 126 T 0.0023 PAN_1 pdbpssm F Eukaryota T 1i93 1 A A ION-SELECTIVE LIGAND D16 XCHWLRGDMRRCX 13 T 3.5 DNA_photolyase pdbhh F T 1i98 1 A A ION-SELECTIVE LIGAND D18 XCRWLRGDWRQCX 13 T 1.8 PTN_MK_C pdbhh F T 1i9f 2 B B RSG-1.2 PEPTIDE RRGSRPSGAERRRRRAAAA 19 T 7.6 BRD4_CDT pdbhh F T 1ic9 1 A A TH10AOX SKYEYTIXSYTFRGPGCPTLKPXITVRCE 29 T 1.1 DUF4360 pdbhh F T 1icl 1 A A TH1OX SKYEYTVXSYTFRGPGCPTVKPXISLRCE 29 T 2.2 DUF4360 pdbhh F T 1ico 1 A A TH10BOX SKYEYTIXSYTFRGPGCPTVKPXVTIRCE 29 T 1.4 DUF4360 pdbhh F T 1id6 1 A A SYR6 SVQARWEAAFDLDLY 15 T 2.5 DUF3841 pdbhh F T 1ieo 1 A A CT1B_CONMR PROTEIN MRIB-NH2 VGVCCGYKLCHPCX 14 T 0.47 Oxidored-like unphh F Eukaryota T 1ifh 3 C P INFLUENZA HEMAGGLUTININ HA1 (STRAIN X47) (RESIDUES 101-107) XDVPDYAS 8 T 10 AbiTii pdbhh F T 1igw 1 A,B,C,D A,B,C,D ACEA_ECOLI ISOCITRASE, ISOCITRATASE, ICL MKTRTQQIEELQKEWTQPRWEGITRPYSAEDVVKLRGSVNPECTLAQLGAAKMWRLLHGESKKGYINSLGALTGGQALQQAKAGIEAVYLSGWQVAADANLAASMYPDQSLYPANSVPAVVERINNTFRRADQIQWSAGIEPGDPRYVDYFLPIVADAEAGFGGVLNAFELMKAMIEAGAAAVHFEDQLASVKKCGHMGGKVLVPTQEAIQKLVAARLCADVTGVPTLLVARTDADAADLITSDCDPYDSEFITGERTSEGFFRTHAGIEQAISRGLAYAPYADLVWCETSTPDLELARRFAQAIHAKYPGKLLAYNCSPSFNWQKNLDDKTIASFQQQLSDMGYKFQFITLAGIHSMWFNMFDLANAYAQGEGMKHYVEKVQQPEFAAAKDGYTFVSHQQEVGTGYFDKVTTIIQGGTSSVTALTGSTEESQF 434 T 2.3E-49 ICL pdb F Bacteria T 1ihj 2 C,D C,D NORPA GKTEFCA 7 T 0.054 cobW pdbhh F T 1iid 2 B O Octapeptide GLYASKLA GLYASKLA 8 T 8 RLAN pdbhh F T 1iij 1 A A ERBB2_RAT ERBB-2 RECEPTOR PROTEIN-TYROSINE KINASE EQRASPVTFIIATVVGVLLFLILVVVVGILIKRRR 35 T 0.0014 Mucin15 pdbhh F Eukaryota T 1ilp 2 C C CXCR1_HUMAN CXCR-1,CDW128A,HIGH AFFINITY INTERLEUKIN-8 RECEPTOR A,IL-8R A,IL-8 RECEPTOR TYPE 1 XMWDFDDXMPPADEDYSPX 19 T 0.01 FA_desaturase unppercent F Eukaryota T 1im1 1 A _ CA1_CONIM ALPHA-CONOTOXIN IM1 GCCSDPRCAWRC 12 T 0.0098 Toxin_8 unphh F Eukaryota T 1im9 3 C,G C,G HLA-Cw4-specific peptide QYDDAVYKL 9 T 22 Cas_Cas02710 pdbhh F T 1iq5 2 B B KKCC_CAEEL CA2+/CALMODULIN DEPENDENT KINASE KINASE VRVIPRLDTLILVKAMGHRKRFGNPFR 27 T 5.7 HCNGP pdbhh F Eukaryota T 1ir3 2 B B PEPTIDE SUBSTRATE KKKLPATGDYMNMSPVGD 18 T 0.064 Gram_pos_anchor pdb F T 1irs 2 B B IL4RA_HUMAN IL-4 RECEPTOR PHOSPHOPEPTIDE LVIAGNPAXRS 11 T 0.91 DUF1890 pdbhh F Eukaryota T 1isq 2 B B RFCL_PYRFU replication factor C large subunit XKQATLFDFLKK 12 T 0.018 Peptidase_C37 unppercent F Archaea T 1iw4 1 A A ITRP_HALRO trypsin inhibitor AHMDCTEFNPLCRCNKMLGDLICAVIGDAKEEHRNMCALCCEHPGGFEYSNGPCE 55 T 5.1 DUF5913 pdbhh F Eukaryota T 1iwk 1 A A CPXA_PSEPU CYTOCHROME P450-CAM MTTETIQSNANLAPLPPHVPEHLVFDFDMYNPSNLSAGVQEAWAVLQESNVPDLVWTRCNGGHWIATRGQLIREAYEDYRHFSSECPFIPREAGEAYDFIPTSMDPPEQRQFKALANQVVGMPVVDKLENRIQELACSLIESLRPQGQCNFTEDYAEPFPIRIFMLLAGLPEEDIPHLKYLTDQMTRPDGSMTFAEAKEALYDYLIPIIEQRRQKPGTDAISIVANGQVNGRPITSDEAKRMCGLLLVGGLDTVVNFLSFSMEFLAKSPEHRQELIERPERIPAACEELLRRFSLVADGRILTSDYEFHGVQLKKGDQILLPQMLSGLDERENACPMHVDFSRQKVSHTTFGHGSHLCLGQHLARREIIVTLKEWLTRIPDFSIAPGAQIQHKSGIVSGVQALPLVWDPATTKAV 415 T 1.6E-05 p450 unppercent F Bacteria T 1iyc 1 A A SCAB_ORYRH scarabaecin ELPKLPDDKVLIRSRSNCPKGKVWNGFDCKSPFAFS 36 T 0.083 DUF5615 unp F Eukaryota T 1j19 2 B B ICAM2_MOUSE ICAM-2 CYTOPLASMIC PEPTIDE, ICAM-2 CYTOPLASMIC TAIL RRRTGTYGVLAAWRRL 16 T 0.28 DUF4231 unppercent F Eukaryota T 1j4l 2 B P RAD9_YEAST DNA REPAIR PROTEIN RAD9 EVELTQELP 9 T 8.5 SidE_DUB pdbhh F Eukaryota T 1j4m 1 A A MBH12 RGKWTYNGITYEGR 14 T 3.8 DUF4923 pdbhh F T 1j4p 2 B B RAD9_YEAST DNA REPAIR PROTEIN RAD9 KKMTFQTPTDPLE 13 T 11 YugN pdbhh F Eukaryota T 1j4q 2 B B RAD9_YEAST DNA REPAIR PROTEIN RAD9 SLEVTEADATFVQ 13 T 41 Myticin-prepro pdbhh F Eukaryota T 1j4x 2 B D DDE(AHP)(TPO)G(PTR)VATR DDEXTGXVATR 11 T 2.8 BioW pdbhh F T 1j51 1 A,B,C,D A,B,C,D CPXA_PSEPU CAMPHOR 5-MONOOXYGENASE TTETIQSNANLAPLPPHVPEHLVFDFDMYNPSNLSAGVQEAWAVLQESNVPDLVWTRCNGGHWIATRGQLIREAYEDYRHFSSECPWIPREAGEAFDFIPTSMDPPEQRQFRALANQVVGMPVVDKLENRIQELACSLIESLRPQGQCNFTEDYAEPFPIRIFMLLAGLPEEDIPHLKYLTDQMTRPDGSMTFAEAKEALYDYLIPIIEQRRQKPGTDAISIVANGQVNGRPITSDEAKRMCGLLLLGGLDTVVNFLSFSMEFLAKSPEHRQELIERPERIPAACEELLRRFSLVADGRILTSDYEFHGVQLKKGDQILLPQMLSGLDERENAAPMHVDFSRQKVSHTTFGHGSHLCLGQHLARREIIVTLKEWLTRIPDFSIAPGAQIQHKSGIVSGVQALPLVWDPATTKAV 414 T 1.6E-05 p450 unppercent F Bacteria T 1j5b 1 A A ANPA_PSEAM Antifreeze protein type 1 analogue DVASDAKAAAELVAANAKAAAELVAANAKAAAEAVARX 38 T 9.3 DUF3157 unppssm F Eukaryota T 1j5l 1 A A MT1_HOMAM CUMT-1 PCEKCTSGCKCPSKDECAKTCSKPCSCCPT 30 T 1.5 Metallothio_5 pdbhh F Eukaryota T 1j5m 1 A A MT1_HOMAM CUMT-1 PGPCCKDKCECAEGGCKTGCKCTSCRCA 28 T 0.53 Metallothio_5 pdbhh F Eukaryota T 1jac 2 B,D,F,H B,D,F,H LECB1_ARTIN JACKFRUIT AGGLUTININ NEQSGKSQTVIVGSWGAKVS 20 T 2.9 DUF3842 pdbhh F Eukaryota T 1jbf 1 A A IGE06 XNLPRCTEGPWGWVCM 16 T 2.2 Mss4 pdbhh F T 1jbl 1 A A SFTI1_HELAN SFTI-1 GRCTKSIPPICFPD 14 T 0.0029 Bowman-Birk_leg pdb F Eukaryota T 1jbr 4 D,E A,B RNMG_ASPRE RIBONUCLEASE MITOGILLIN ATWTCINQQLNPKTNKWEDKRLLYSQAKAESNSHHAPLSDGKTGSSYPHWFTNGYDGNGKLIKGRTPIKFGKADCDRPPKHSQNGMGKDDHYLLEFPTFPDGHDYKFDSKKPKEDPGPARVIYTYPNKVFCGIVAHQRGNQGDLRLCSH 149 T 46 Cuticle_2 pdbhh F Eukaryota T 1jbu 3 C X Peptide exosite inhibitor A-183 EEWEVLCWTWETCER 15 T 0.65 Rad10 pdbhh F T 1jcs 3 C C SYNTHETIC HEXAPEPTIDE TKCVFM TKCVFM 6 T 2.4 Plk4_PB2 pdbhh F T 1jd5 2 B B GRIM_DROME cell death protein GRIM AIAYFIPDQA 10 T 0.61 DUF5521 unppercent F Eukaryota T 1jd6 2 B B HID_DROME head involution defective protein AVPFYLPEGG 10 T 0.62 DUF4367 pdbhh F Eukaryota T 1jdk 1 A A ACETYL GROUP XIWGESGKLIXTTA 14 T 0.038 GP41 pdbhh F T 1je9 1 A A 3S1C_NAJKA SHORT NEUROTOXIN 1 - MONOCLED COBRA LECHNQQSSQAPTTKTCSGETNCYKKWWSDHRGTIIERGCGCPKVKPGVNLNCCRTDRCNN 61 T 0.038 Toxin_TOLIP pdb F Eukaryota T 1jeg 2 B B PTN22_MOUSE HEMATOPOIETIC CELL PROTEIN-TYROSINE PHOSPHATASE 70Z-PEP SRRTDDEIPPPLPERTPESFIVVEE 25 T 6 DUF6436 pdbhh F Eukaryota T 1jg3 2 C,D C,D VYP(L-iso-ASP)HA VYPXHA 6 T 3.1 DUF4140 pdbhh F T 1jgd 3 C C peptide s10R RRLLRGHNQY 10 T 11 DUF2570 pdbhh F T 1jge 3 C C peptide m9 GRFAAAIAK 9 T 6.5 Ribosomal_L13 pdbhh F T 1jjg 1 A A Q9Q8E9_MYXVL M156R MTVIKPSSRPRPRKNKNIKVNTYRTSAMDLSPGSVHEGIVYFKDGIFKVRLLGYEGHECILLDYLNYRQDTLDRLKERLVGRVIKTRVVRADGLYVDLRRFF 102 T 1.6 RNase_II_C_S1 pdbhh T Viruses T 1jky 2 B B MP2K2_HUMAN MAPKK2, MEK2 MLARRKPVLPALTINP 16 T 0.4 DHHA2 pdbhh F Eukaryota T 1jlp 1 A A CM3F_CONPU PSI-CONOTOXIN PIIIF GPPCCLYGSCRPFPGCYNALCCRKX 25 T 0.11 Toxin_7 unphh F Eukaryota T 1jlz 1 A A KA131_TITOB Tityustoxin alpha-KTx ACGSCRKKCKGSGKCINGRCKCY 23 T 0.0072 Toxin_2 pdb F Eukaryota T 1jmt 2 B B U2AF2_HUMAN U2 SNRNP AUXILIARY FACTOR LARGE SUBUNIT KKKVRKYWDVPPPGFEHITPMQYKAMQA 28 T 0.0013 Transformer unp F Eukaryota T 1jn5 3 C C FG-repeat GQSPGFGQGGSV 12 T 1.4 PGF-CTERM pdbhh F T 1jn7 1 A A USH_DROME U-shaped TRANSCRIPTIONAL COFACTOR GSAAEVMKKYCSTCDISFNYVKTYLAHKQFYHKNKP 36 T 0.0003 zf-met pdb F Eukaryota T 1jo3 1 A,B A,B VAL-GRAMICIDIN B XGAXAXVXWXFXWXWX 16 T 0.53 MAP17 pdbhh F T 1jo4 1 A,B A,B GRAMICIDIN D XGAXAXVXWXYXWXWX 16 T 3.1 MAP17 pdbhh F T 1jot 2 B B LECB2_MACPO AGGLUTININ GRNGKSQSIIVGPWGDRVTN 20 T 0.9 DUF3842 pdbhh F Eukaryota T 1jp5 2 C,D C,D epitope peptide corresponding to N-terminus of HIV-1 protease PQITLWQRR 9 T 0.5 Tfb2_C pdbhh F T 1jpf 3 C C LCMV peptidic epitope gp276 SGVENPGGYCL 11 T 9.3E-05 Arena_glycoprot pdbhh F T 1jpg 3 C C LCMV peptidic epitope np396 FQPQNGQFI 9 T 1.1 Arena_ncap_C pdbhh F T 1jpl 2 B,D,F,H E,F,G,H MPRI_HUMAN Cation-Independent Mannose 6-phosphate receptor FHDDSDEDLLHI 12 T 8 NTF-like pdbhh F Eukaryota T 1jsp 1 A A P53_HUMAN tumor protein p53 SHLKSKKGQSTSRHKXLMFK 20 T 0.081 Zn_Tnp_IS1 pdbpssm F Eukaryota T 1ju5 2 B B CRK_MOUSE PROTO-ONCOGENE C-CRK, ADAPTER MOLECULE CRK, P38 EPGPXAQPSVNTK 13 T 1.6 Shugoshin_C pdbhh F Eukaryota T 1jui 2 E,F,G,H P,Q,R,S 10-mer Peptide MYWYPYASGS 10 T 3.9 DUF3263 pdbhh F T 1jwg 2 B,D C,D MPRI_HUMAN M6PR SFHDDSDEDLLHI 13 T 10 NTF-like pdbhh F Eukaryota T 1jy4 1 A,B A,B B4DIMER RGECKFTVXGRTALNTXAVQKWHFVLXGYKCEILA 35 T 2.6 NAAA-beta pdbhh F T 1jy9 1 A A DP-TT2 TTTTRYVEVXGKKILQTTTT 20 T 16 DUF6450 pdbhh F T 1jyc 2 E,F,G,H P,Q,R,S 15-mer peptide RVWYPYGSYLTASGS 15 T 2.2 DUF6375 pdbhh F T 1jyi 2 E,F,G,H P,Q,R,S 12-mer peptide DVFYPYPYASGS 12 T 1.8 XRN_M pdbhh F T 1jyr 2 B L peptide: PSpYVNVQN APSXVNVQN 9 T 0.8 SH3-WW_linker pdbhh F T 1jzp 1 A A CAC1S_RABIT Skeletal Dihydropydrine Receptor TSAQKAKAEERKRRKMSRGLX 21 T 0.66 DUF1682 pdbhh F Eukaryota T 1k2d 3 C P MBP_HUMAN MBP PEPTIDE HSRGGASQYRPSQRHGTGSGSGS 23 T 2.2 Selenoprotein_S pdbhh F Eukaryota T 1k3a 2 B B IRS1_HUMAN IRS-1 KKKSPGEYVNIEFG 14 T 0.41 DUF4834 pdbhh F Eukaryota T 1k83 11 K M AAMAT_AMAPH AMATOXIN XXGIGCNP 8 T 0.85 DUF3085 pdbhh F Eukaryota T 1k91 1 A A CALR_RAT CRP55; CALREGULIN; HACBP; ERP60; CALBP; CALCIUM-BINDING PROTEIN 3; CABP3 GKPEHIPDPDAKKPEDWDEEMDGEWEPPVIQNPEYKG 37 T 2.1E-21 Calreticulin unp F Eukaryota T 1k9c 1 A A CALR_RAT CRP55; CALREGULIN; HACBP; ERP60; CALBP; CALCIUM-BINDING PROTEIN 3; CABP3 SKKIKDPDAAKPEDWDERAKIDDPTDSKPEDWDKPEHIPDPDAKKPEDWDEEMDGEWEPPVIQNPEYKGEWKPR 74 T 2.1E-21 Calreticulin unp F Eukaryota T 1ka6 2 B B SLAF1_HUMAN SIGNALING LYMPHOCYTIC ACTIVATION MOLECULE RKSLTIXAX 9 T 0.1 MFS_1 unppssm F Eukaryota T 1ka7 2 B B SLAF1_HUMAN SIGNALING LYMPHOCYTIC ACTIVATION MOLECULE RKSLTIYAQVQK 12 T 0.1 MFS_1 unppssm F Eukaryota T 1kat 2 C,D X,Y V107 GGNECDIARMWEWECFERL 19 T 6 ARF7EP_C pdbhh F T 1kcn 1 A A e109 zeta peptide ALCPAVCYVGGKALCPDVCYVX 22 T 3.8 MVL pdbhh F T 1kco 1 A A e131 Zeta Peptide VQCPHFCYELDYELCPDVCYVX 22 T 1.7 Prot_inhib_II pdbhh F T 1kfp 1 A A GOME_ACAGO GOMESIN QCRRLCYKQRCVTYCRGRX 19 T 0.0046 PanZ unp F Eukaryota T 1kh4 1 A,B A,B PPB_ECOLI ALKALINE PHOSPHATASE TPEMPVLENRAAQGNITAPGGARRLTGDQTAALRNSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQDATPAALVAHVTSRKCYGPSATSQKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREEAEARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQNHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 449 T 6.8E-10 Alk_phosphatase unppssm F Bacteria T 1kh7 1 A,B A,B PPB_ECOLI alkaline phosphatase TPEMPVLENRAAQGNITAPGGARRLTGDQTAALRNSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQGATPAALVAHVTSRKCYGPSKTSQKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREEAEARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQNHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 449 T 6.8E-10 Alk_phosphatase unppssm F Bacteria T 1khj 1 A,B A,B PPB_ECOLI Alkaline phosphatase TPEMPVLENRAAQGNITAPGGARRLTGDQTAALRNSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQHATPAALVAHVTSRKCYGPSATSQKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREEAEARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQNHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 449 T 6.8E-10 Alk_phosphatase unppssm F Bacteria T 1kj7 2 C P POL_HV1JR gag polyprotein PATIMMQRGN 10 T 0.6 HypA unp T Viruses T 1kjh 2 C P POL_HV1B1 POL POLYPROTEIN IRKILFLDGI 10 T 0.011 Spermine_synt_N pdbhh T Viruses T 1kjv 3 C P UBQL1_RAT peptide NPR NPRAMQALL 9 T 1.1 STI1 unp F Eukaryota T 1kkq 2 E,F,G,H E,F,G,H NCOR2_HUMAN NUCLEAR COREPRESSOR SMRT C-TERMINAL RECEPTOR INTERACTING MOTIF NMGLEAIIRKALMGKYDQW 19 T 4.2 RHH_7 pdbhh F Eukaryota T 1kl3 2 E,F,G,H E,F,G,H strep-tag II peptide NWSHPQFEK 9 T 1.3 CreD pdbhh F T 1klq 2 B B MBP1 SWYSYPPPQRAV 12 T 8.6 NADHdh_A3 pdbhh F T 18 KDA PULMONARY-SURFACTANT PROTEIN CRALIKRIQAMIPKG 15 T 0.013 SapB_1 unphh F Eukaryota T 1ko6 2 B,D B,D NUP98_HUMAN NUCLEOPORIN NUP98, 98KDA NUCLEOPORIN SKYGLQDSDEEEEEHPSKTSTKKLKTAPLPPASQTTPLQMALNGKPAPPPQVEKKGQLEHHHHH 64 T 74 SIN1 pdbhh F Eukaryota T 1kp6 1 A A KP6T_UMV6 PROTEIN (TOXIN) NNAFCAGFGLSCKWECWCTAHGTGNELRYATAAGCGDHLSKSYYDARAGHCLFSDDLRNQFYSHCSSLNNNMSCRSLSK 79 T 0.3 YobH unp T Viruses T 1kpr 3 E,F P,Q Peptide VMAPRTVLL VMAPRTVLL 9 T 0.0013 UL40 pdbhh F T 1ktl 3 E,F P,Q PEPTIDE B27 VTAPRTLLL 9 T 0.24 UL40 pdbhh F T 1ku8 2 B,D,F,H B,D,F,H LECB1_ARTIN AGGLUTININ BETA CHAIN NEQSGISQTVIVGPWGAK 18 T 2.3 DUF3842 pdbhh F Eukaryota T 1kvd 1 A,C A,C TOXK_MILFA SMK TOXIN WSLRWRMQKSTTIAAIAGCSGAATFGGLAGGIVGCIAAGILAILQGFEVNWHNGGGGDRSNPV 63 T 0.63 DUF1056 pdbhh F Eukaryota T 1kvd 2 B,D B,D TOXK_MILFA SMK TOXIN GEATTIWGVGADEAIDKGTPSKNDLQNMSADLAKNGFKGHQGVACSTVKDGNKDVYMIKFSLAGGSNDPGGSPCSDD 77 T 11 IMS_HHH pdbhh F Eukaryota T 1kvf 1 A A PROTEIN: EMP-18 Receptor Agonist TYSCHFGPLTWVCKPQX 17 F F T 1kvg 1 A A Protein: EPO-3 Receptor Agonist SCHFGPLGWVCKX 13 F F T 1ky6 2 B P EPN1_RAT EPSIN 1 FSDPWGG 7 T 0.33 Imm32 pdbhh F Eukaryota T 1ky7 2 B P AMPH_HUMAN AMPHIPHYSIN SFFEDNFVPE 10 T 0.058 CCDC32 pdbhh F Eukaryota T 1kyd 2 B P EPN1_HUMAN EPSIN 1 GSDPWK 6 T 0.58 DUF5054 pdbhh F Eukaryota T 1kyf 2 B P EPS15_MOUSE PROTEIN EPS15, AF-1P PROTEIN GSDPFK 6 T 2.9 DUF5054 pdbhh F Eukaryota T 1l0s 1 A,B,C,D A,B,C,D Q9GTP0_CHOFU ANTIFREEZE PROTEIN ISOFORM 337 DGSCTNTNSQLSANSKCEKSTLTNCYVDKSEVFGTTCTGSRFDGVTITTSTSTGSRISGPGCKISTCIITGGVPAPSAACKISGCTFSAN 90 T 6.2E-25 CfAFP unppssm F Eukaryota T 1l2y 1 A A TC5b NLYIQWLKDGGPSSGRPPPS 20 T 2.5 Mastoparan_2 pdbhh F T 1l2z 2 B B CD2_HUMAN T-CELL SURFACE ANTIGEN T11/LEU-5, LFA-2, LFA-3 RECEPTOR, ERYTHROCYTE RECEPTOR, ROSETTE RECEPTOR SHRPPPPGHRV 11 T 8.9 Peptidase_C21 pdbhh F Eukaryota T 1l3q 1 A A ARAGONITE-ASSOCIATED PROTEIN FPGKNVNCTSGE 12 T 7.7 DUF5736 pdbhh F T 1l4x 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H SIN-ASP-GLU-LEU-GLU-ARG-ALA-ILE-ARG-GLU-LEU-ALA-ALA-ARG-ILE-LYS-NH2 XDELERAIRELAARIKX 17 T 1.6 DUF5320 pdbhh F T 1l6o 2 D,E,F D,E,F DISHEVELLED INTERACTING ANTAGONIST, DPR1 SLKLMTTV 8 T 0.0032 Dapper pdbhh F T 1lb5 2 B B TNR11_HUMAN receptor activator of nuclear factor-kappa B QMPTEDEY 8 T 0.24 KIX unp F Eukaryota T 1lb6 2 B B TNR5_HUMAN CD40 antigen KQEPQEIDF 9 T 0.027 DUF2207 unppercent F Eukaryota T 1lb7 1 A A IGF-1 ANTAGONIST F1-1 RNCFESVAALRRCMYG 16 T 2.6 DUF4695 pdbhh F T 1lck 2 B B LCK_HUMAN TAIL PHOSPHOPEPTIDE TEGQ(PHOSPHO)YQPQPA EGQXQPQPA 9 T 8.1 Sa_NUDIX pdbhh F Eukaryota T 1ld9 3 C,F C,F NANO-PEPTIDE YPNVNIHNF 9 T 1.1 DUF5454 pdbhh F T 1le0 1 A _ Tryptophan Zipper 1 SWTWEGNKWTWKX 13 T 2.6 WXXGXW pdbhh F T 1le1 1 A _ Tryptophan Zipper 2 SWTWENGKWTWKX 13 T 0.64 Chibby pdbhh F T 1lew 2 B B MEF2A_HUMAN SERUM RESPONSE FACTOR-LIKE PROTEIN 1 RKPDLRVVIPPS 12 T 5.8 PDDEXK_7 pdbhh F Eukaryota T 1lez 2 B B MP2K3_MOUSE MKK3B SKGKSKRKKDLRISCNSK 18 T 11 Paramyxo_NS_C pdbhh F Eukaryota T 1lj2 2 C,D C,D IF4G1_HUMAN EIF4GI APKRERKTIRIRDPNQGGKDITEEIMSG 28 T 0.036 PHB_acc_N pdbhh F Eukaryota T 1loi 1 A _ RNPDE4A1A, RAT TYPE IV CYCLIC AMP SPECIFIC PHOSPHODIESTERASE, ISOFORM SUBFAMILY A, SPLICE VARIANT 1 MPLVDFFCETCSKPWLVGWWDQFKRX 26 T 0.28 Rad50_zn_hook pdbhh F T 1lq7 1 A A Alpha3W GSRVKALEEKVKALEEKVKALGGGGRIEELKKKWEELKKKIEELGGGGEVKKVEEEVKKLEEEIKKL 67 T 0.00012 ZapB pdb F T 1ltx 3 C R RAE1_RAT RAB PROTEINS GERANYLGERANYLTRANSFERASE COMPONENT A 1 MADNLPSDFDVIVIGTGLPESIIAAACSRSGQRVLHVDSRSYYGGNWASFSFSGLLSWLKEYQENNDVVTENSMWQEQILENEEAIPLSSKDKTIQHVEVFCYASQDLHKDVEEAGALQKNHASVTSAQSAEAAEAAETSCLPTAVEPLSMGSCEIPAEQSQCPGPESSPEVNDAEATGKKENSDAKSSTEEPSENVPKVQDNTETPKKNRITYSQIIKEGRRFNIDLVSKLLYSRGLLIDLLIKSNVSRYAEFKNITRILAFREGTVEQVPCSRADVFNSKQLTMVEKRMLMKFLTFCVEYEEHPDEYRAYEGTTFSEYLKTQKLTPNLQYFVLHSIAMTSETTSCTVDGLKATKKFLQCLGRYGNTPFLFPLYGQGELPQCFCRMCAVFGGIYCLRHSVQCLVVDKESRKCKAVIDQFGQRIISKHFIIEDSYLSENTCSRVQYRQISRAVLITDGSVLKTDADQQVSILTVPAEEPGSFAVRVIELCSSTMTCMKGTYLVHLTCMSSKTAREDLERVVQKLFTPYTEIEAENEQVEKPRLLWALYFNMRDSSDISRDCYNDLPSNVYVCSGPDSGLGNDNAVKQAETLFQQICPNEDFCPAPPNPEDIVLDGDSSQQEVPESSVTPETNSETPKESTVLGNPEEPSE 650 T 4.3E-15 GDI pdbpssm F Eukaryota T 1lup 1 A A MTX2_GRARO GsMTx2 YCQKWMWTCDEERKCCEGLVCRLWCKRIINM 31 T 0.00052 Toxin_12 pdb F Eukaryota T 1lvb 2 B,D C,D POLG_TEV OLIGOPEPTIDE SUBSTRATE FOR THE PROTEASE TENLYFQSGT 10 T 6.2 CX pdbhh T Viruses T 1lvm 2 C,D C,D POLG_TEV OLIGOPEPTIDE SUBSTRATE FOR THE PROTEASE XENLYFQSGT 10 T 5.7 CX pdbhh T Viruses T 1lvm 3 E E POLG_TEV CATALYTIC DOMAIN OF THE NUCLEAR INCLUSION PROTEIN A (NIA) EATQLMN 7 T 8.7 DUF3460 pdbhh T Viruses T 1lvz 1 A A GNAT1_BOVIN TRANSDUCIN ALPHA-1 CHAIN IRENLKDSGLF 11 T 0.38 ssDNA_TraI_N pdbhh F Eukaryota T 1m02 1 A A PW2 HPLKQYWWRPSI 12 T 0.31 Leader_Trp pdbhh F T 1m08 1 A,B A,B CEA7_ECOLX Colicin E7 MRNKPGKATGKGKPVNNKWLNNAGKDLGSPVPDRIANKLRDKEFKSFDDFRKKFWEEVSKDPELSKQFSRNNNDRMKVGKAPKTRTQDVSGKRTSFELHHEKPISQNGGVYDMDNISVVTPKRHIDIHRGK 131 T 0.021 HNH pdbpercent F Bacteria T 1m24 1 A,B A,B TRICHOTOXIN_A50E XXGXLXQXXXAAXPLXXQX 19 T 22 FAD_oxidored pdbhh F T 1m26 2 B,D,F,H B,D,F,H LECB3_ARTIN AGGLUTININ BETA CHAIN SGISQTVIVGPWGAKSA 17 T 0.51 DUF3842 pdbhh F Eukaryota T 1m2e 1 A A KAIA_SYNE7 KaiA MLSQIAICIWVESTAILQDCQRALSADRYQLQVCESGEMLLEYAQTHRDQIDCLILVAANPSFRAVVQQLCFEGVVVPAIVVGDRDSEDPDEPAKEQLYHSAELHLGIHQLEQLPYQVDAALAEFLRLAPVETMA 135 T 0.066 CHAT pdb F Bacteria T 1m3w 1 A,B,C,D A,B,C,D H10H24 CGGGEIWKLHEEFLKKFEELLKLHEERLKKMX 32 T 4.6 DUF761 pdbhh F T 1m46 2 B B MYO2_YEAST IQ4 SVLRTITNLQKKIRKELKQRQLKQE 25 T 0.00018 IQ unppssm F Eukaryota T 1m4h 2 C,D C,D Inhibitor OM00-3 ELDXVEF 7 T 3.3 Endotoxin_C pdbhh F T 1m7t 1 A A THIO_ECOLI;THIO_HUMAN Chimera of Human and E. coli thioredoxin MVKQIESKTAFQEALDAAGDKLVVVDFSATWCGPCKMIKPFFHSLSEKYSNVIFLEVDVDDAQDVAPKYGIRGIPTLLLFKNGEVAATKVGALSKGQLKEFLDANLV 107 T 8.3E-39 Thioredoxin unppssm F Eukaryota T 1ma2 1 A A TAC1_TACTR Tachyplesin I KWCFRVCYRGICYRRCR 17 T 0.021 Myticin-prepro unp F Eukaryota T 1ma3 2 B B P53_HUMAN TUMOR SUPPRESSOR P53, PHOSPHOPROTEIN, P53, ANTIGEN NY-CO-13 KKGQSTSRHKXLMFKTEG 18 T 30 Class_IIIsignal pdbhh F Eukaryota T 1ma4 1 A A TAC1_TACTR Tachyplesin 1 KWYFRVYYRGIYYRRYR 17 T 0.021 Myticin-prepro unp F Eukaryota T 1mfg 2 B B ERBB2_HUMAN Erb-B2 carboxyl-terminal fragment EYLGLDVPV 9 T 8.1 FAM110_C pdbhh F Eukaryota T 1mfl 2 B B ERBB2_HUMAN PHOSPHORYLATED Erb-B2 carboxyl-terminal fragment. EXLGLDVPV 9 T 8.1 FAM110_C pdbhh F Eukaryota T 1mw4 2 B B ERBB2_HUMAN PY1139, NEU PROTO-ONCOGENE, C-ERBB-2, TYROSINE KINASE-TYPE CELL SURFACE RECEPTOR HER2, MLN 19 PQPEXVNQPD 10 T 0.38 CYSTM pdbhh F Eukaryota T 1mw5 1 A,B A,B Y1480_HAEIN HYPOTHETICAL PROTEIN HI1480 GSHMSETDLLMKMVRQPVKLYSVATLFHEFSEVITKLEHSVQKEPTSLLSEENWHKQFLKFAQALPAHGSASWLNLDDALQAVVGNSRSAFLHQLIAKLKSRHLQVLELNKIGSEPLDLSNLPAPFYVLLPESFAARITLLVQDKALPYVRVSMEYWHALEYKGELNDPAANKARKEAELAAATAEQ 187 T 11 DUF4211 unphh F Bacteria T 1mxe 2 B,D E,F KCC1A_RAT CAM KINASE I IKKNFAKSKWKQAFNATAVVRHMRK 25 T 0.12 Tyrosinase unp F Eukaryota T 1mxq 1 A A TKN_ELECI Eledoisin QPSKDAFIGLM 11 T 0.0001 Tachykinin pdbhh F Eukaryota T 1n09 1 A A bhpW, disulfide cyclized beta-hairpin peptide XCTWEGNKLTCX 12 T 0.74 Lipocalin_7 pdbhh F T 1n0a 1 A A bhpw_pdg, beta-hairpin peptide XCTWEPDGKLTCX 13 T 0.64 PD40 pdbhh F T 1n0c 1 A A bhp_HWLV, disulfide cyclized beta-hairpin peptide XCHWEGNKLVCX 12 T 0.37 Lipocalin_7 pdbhh F T 1n0d 1 A A bhp_VWLH, disulfide cyclized beta-hairpin peptide XCVWEGNKLHCX 12 T 2.6 PHA_gran_rgn pdbhh F T 1n0x 3 E,F P,R B2.1 peptide HERSYMFSDLENRCIAAEXKK 21 T 0.58 TEP1_N pdbhh F T 1n3n 3 I,J,K,L I,J,K,L mycobacterial hsp60 decameric epitope SALQNAASIA 10 T 9.7 dsRBD2 pdbhh F T 1n4m 2 C,D,E C,D,E E2F2_HUMAN E2F-2 DDYLWGLEAGEGISDLFD 18 T 5.2 Carcinustatin pdbhh F Eukaryota T 1n4p 3 M,N M,N RASK_HUMAN KKKSKTKCVIL PEPTIDE KKKSKTKCVIL 11 T 0.035 Fer4_22 unppssm F Eukaryota T 1n5z 2 B,D P,Q PEX14_YEAST PEROXIN-14 EAMPPTLPHRDWKD 14 T 3.4E-05 DUF1664 unphh F Eukaryota T 1n6e 2 B,D,F,H,J,L B,D,F,H,J,L DQTQKAAAELTFF DQTQKAAAELTFF 13 T 50 NEMP pdbhh F T 1n7f 2 C,D C,D LIPA1_HUMAN 8-mer peptide from interacting protein (liprin) ATVRTYSC 8 T 2.4 BLIP pdbhh F Eukaryota T 1n86 5 I,J I,J FIBB_HUMAN fibrin beta chain peptide ligand fragment Gly-His-Arg-Pro-Leu-Asp-Lys GHRPLDK 7 T 5.8 DUF1824 pdbhh F Eukaryota T 1n9u 1 A A ANGT_HUMAN ANG I DRVYIHPFHL 10 T 0.39 Nairo_nucleo pdbhh F Eukaryota T 1n9v 1 A A ANGT_HUMAN ANG II DRVYIHPF 8 T 0.67 Nairo_nucleo pdbhh F Eukaryota T 1nb3 2 B,E,H,K P,R,S,T CATH_PIG CATHEPSIN H MINI CHAIN EPQNCSAT 8 T 0.4 SCAN unp F Eukaryota T 1nex 3 E,F E,F GLL(TPO)PPQSG GLLTPPQSG 9 T 6.5 FTZ pdbhh F T 1nhg 2 C,D C,D Q9BH77_PLAFA ENOYL-ACP-REDUCTASE YTFIDYAIEYSEKYAPLRQKLLSTDIGSVASFLLSRESRAITGQTIYVDNGLNIMFLPDD 60 F F Eukaryota T 1niw 2 B,D,F,H B,D,F,H NOS3_HUMAN EC-NOS, NOS (TYPE III), NOSIII, ENDOTHELIAL NOS, ENOS, CONSTITUTIVE NOS, CNOS RKKTFKEVANAVKISASLMG 20 T 2.3 DUF2774 pdbhh F Eukaryota T 1nlt 2 B B Seven residue peptide GWLYEIS 7 T 2.2 DUF4907 pdbhh F T 1nop 3 E C topoisomerase I-derived peptide KLNYLDPR 8 T 0.0021 Topo_C_assoc pdbhh F T 1not 1 A A CAIA_CONGE GI ALPHA CONOTOXIN ECCNPACGRHYSCX 14 T 0.039 Enterotoxin_ST pdbhh F Eukaryota T 1nrn 3 C R PAR1_HUMAN RECEPTOR BASED PEPTIDE NRS LDPRSFLLRNPNDKYEPFWEDEE 23 T 2.1 DUF4710 pdbhh F Eukaryota T 1nro 3 C R PAR1_HUMAN RECEPTOR BASED PEPTIDE NRP LDPRPFLLRNPNDKYEPFWEDEEKNES 27 T 2.6 SYCP2_SLD pdbhh F Eukaryota T 1nrp 3 C R PAR1_HUMAN RECEPTOR BASED PEPTIDE NR'S LDPXSFLLRNPNDKYEPFWEDEE 23 T 2.1 DUF4710 pdbhh F Eukaryota T 1nrq 3 C R PAR1_HUMAN RECEPTOR BASED PEPTIDE D-FPR'S XPXSXLLRNPNDKYEPFWEDEE 22 T 1.7 DUF4710 pdbhh F Eukaryota T 1nrr 3 C R PAR1_HUMAN THROMBIN RECEPTOR, COAGULATION FACTOR II RECEPTOR FLLRNPNDKYEPFWEDEE 18 T 0.93 SYCP2_SLD pdbhh F Eukaryota T 1ntv 2 B B Apolipoprotein E Receptor-2 peptide NFDNPVYRKT 10 T 3.3 DUF3498 pdbhh F T 1ntx 1 A _ 3S11_DENPO ALPHA-NEUROTOXIN RICYNHQSTTRATTKSCEENSCYKKYWRDHRGTIIERGCGCPKVKPGVGIHCCQSDKCNY 60 T 0.051 Lentiviral_Tat pdb F Eukaryota T 1nwd 2 B,C B,C DCE_PETHY GAD GSHKKTDSEVQLEMITAWKKFVEEKKKK 28 T 0.013 DUF4951 pdb F Eukaryota T 1nxn 1 A A CONTRYPHAN-VN, MAJOR FORM (CIS CONFORMER) GDCPXKPWCX 10 T 0.53 Thioredoxin_4 pdbhh F T 1nyb 2 B A REGN_BPPH3 Probable regulatory protein N ESKGTAKSRYKARRAELIAERR 22 T 0.17 N36 unphh T Viruses T 1nzl 2 C C Doubly phosphorylated peptide ligand (PQpYEpYIPI) PQXEXIPA 8 T 0.29 Ac110_PIF pdbhh F T 1nzq 3 C D Decapeptide Hirudin Analogue XYEPIPEEXXQ 11 T 0.91 Hirudin pdbhh F T 1nzs 1 A A OPSD_BOVIN 19-mer peptide fragment of RHODOPSIN DDEASTTVSKTETSQVAPA 19 T 110 DUF5840 pdbhh F Eukaryota T 1nzv 2 C C Doubly phosphorylated peptide PQpYIpYVPA PQXIXVPA 8 T 1.7 DUF3300 pdbhh F T 1o06 1 A A VPS27_YEAST Vacuolar protein sorting-associated protein VPS27 EEDPDLKAAIQESLREAEEA 20 T 2.1E-05 UIM pdbhh F Eukaryota T 1o20 1 A A PROA_THEMA GPR, GLUTAMATE-5-SEMIALDEHYDE DEHYDROGENASE, GLUTAMYL-GAMMA-SEMIALDEHYDE DEHYDROGENASE, GSA DEHYDROGENASE MGSDKIHHHHHHMDELLEKAKKVREAWDVLRNATTREKNKAIKKIAEKLDERRKEILEANRIDVEKARERGVKESLVDRLALNDKRIDEMIKACETVIGLKDPVGEVIDSWVREDGLRIARVRVPIGPIGIIYESRPNVTVETTILALKSGNTILLRGGSDALNSNKAIVSAIREALKETEIPESSVEFIENTDRSLVLEMIRLREYLSLVIPRGGYGLISFVRDNATVPVLETGVGNCHIFVDESADLKKAVPVIINAKTQRPGTCNAAEKLLVHEKIAKEFLPVIVEELRKHGVEVRGCEKTREIVPDVVPATEDDWPTEYLDLIIAIKVVKNVDEAIEHIKKYSTGHSESILTENYSNAKKFVSEIDAAAVYVNASTRFTDGGQFGFGAEIGISTQRFHARGPVGLRELTTYKFVVLGEYHVRE 427 T 2.7E-07 Aldedh unppercent F Bacteria T 1o53 1 A A PTGA_SALTI EIIA-GLC, GLUCOSE-PERMEASE IIA COMPONENT, PHOSPHOTRANSFERASE ENZYME II, A COMPONENT, EIII-GLC GLFDKLKSLVSDDKK 15 T 1.1 Antimicrobial20 pdbhh F Bacteria T 1o6k 2 B C GSK3B_HUMAN GSK-3 BETA GRPRTTSFAE 10 T 6.3 DUF3084 pdbhh F Eukaryota T 1o6o 2 D,E,F D,E,F NSP1_YEAST NUCLEAR PORE PROTEIN NSP1, NUCLEOSKELETAL-LIKE PROTEIN, P110, NSP1, YJL041W, J1207 MGSSTKSNEKKDSGSSKPAFSFGAKPDEKKNDEVSKPAFSFGAKANEKKESDESKSAFSFGSKPTGKEEGDGAKAAISFGAKPEEQKSSDTSKPAFTFGAQKDNEKKTEESSTGKSMQA 119 T 0.26 SHIPPO-rpt pdbpercent F Eukaryota T 1o9k 3 I,J,K,L P,Q,R,S E2F1_HUMAN PBR3, PRB-BINDING PROTEIN E2F-1, RETINOBLASTOMA-ASSOCIATED PROTEIN 1 LDYHFGLEEGEGIRDLFD 18 T 7.5 Guanylin pdbhh F Eukaryota T 1o9u 2 B B AXIN1_HUMAN AXIS INHIBITION PROTEIN 1, HAXIN, AXIN1, AXIN VEPQKFAEELIHRLEAVQ 18 T 0.19 Cwf_Cwc_15 unppssm F Eukaryota T 1ob7 1 A A CEPHAIBOL C XFXXXXGLXXPQXXPXPX 18 T 7.7 Pep_deformylase pdbhh F T 1obx 2 B B IL5RA_HUMAN IL-5R-ALPHA, CD125 ANTIGEN ETLEDSVF 8 T 15 DUF5588 pdbhh F Eukaryota T 1oby 2 C,D P,Q SDC4_HUMAN AMPHIGLYCAN, SYND4, RYUDOCAN CORE PROTEIN TNEFYA 6 T 3.1 Herpes_gE unphh F Eukaryota T 1odf 1 A A TDA10_YEAST YGR205W MCDKSKTVLDYTIEFLDKYIPEWFETGNKCPLFIFFSGPQGSGKSFTSIQIYNHLMEKYGGEKSIGYASIDDFYLTHEDQLKLNEQFKNNKLLQGRGLPGTHDMKLLQEVLNTIFNNNEHPDQDTVVLPKYDKSQFKGEGDRCPTGQKIKLPVDIFILEGWFLGFNPILQGIENNDLLTGDMVDVNAKLFFYSDLLWRNPEIKSLGIVFTTDNINNVYGWRLQQEHELISKVGKGMTDEQVHAFVDRYMPSYKLYLNDFVRSESLGSIATLTLGIDSNRNVYSTKTRCIE 290 T 0.00012 AAA_16 pdbpercent F Eukaryota T 1oeb 2 C,D C,D LCP2_MOUSE SH2 DOMAIN-CONTAINING LEUCOCYTE PROTEIN OF 76 KDA, SLP-76 TYROSINE PHOSPHOPROTEIN, SLP76 PAPSIDRSTKPPL 13 T 51 Chi-conotoxin pdbhh F Eukaryota T 1oex 2 B B INHIBITOR H261 XHPFAXIH 8 T 13 SoDot-IcmSS pdbhh F T 1of5 1 A A MEX67_YEAST MEX67, YPL169C, P2520 QQFFFENDALGQSSTDFATNFLNLWDNNREQLLNLYSPQSQFSVSVDSTIPPSTVTDSDQTPAFGYYMSSSRNISKVSSEKSIQQRLSIGQESINSIFKTLPKTKHHLQEQPNEYSMETISYPQINGFVITLHGFFEETGKPELESNKKTGKNNYQKNRRYNHGYNSTSNNKLSKKSFDRTWVIVPMNNSVIIASDLLTVRAYSTGAWKTASIAIAQAAGS 221 T 9.9E-08 NTF2 unppssm F Eukaryota T 1om2 2 B B ALDH2_RAT ALDH GPRLSRLLSYA 11 T 9.7 TFIID_30kDa pdbhh F Eukaryota T 1om9 2 B,D P,Q CCD91_HUMAN 15-mer peptide fragment of p56 DDDDFGGFEAAETFD 15 T 0.086 DUF5102 pdbhh F Eukaryota T 1oo4 2 B B 8-mer peptide from PDGFr SVDXVPML 8 T 0.57 Frem_N pdbhh F T 1oqp 2 B B KAR1_YEAST Cell division control protein KAR1 KKRELIESKWHRLLFHDKK 19 T 6.1 TnpW pdbhh F Eukaryota T 1osg 2 G,H,I,J,K,L G,J,H,I,K,L BR3 derived PEPTIDE CHWDLLVRHWVC 12 T 0.38 TetM_leader pdbhh F T 1ou8 2 C,D C,D synthetic ssrA peptide GRHGAANDENY 11 T 10 Tox-HDC pdbhh F T 1ov3 2 C,D C,D P22 PHAGOCYTE B-CYTOCHROME, NEUTROPHIL CYTOCHROME B, 22 KDA POLYPEPTIDE, P22-PHOX, P22PHOX, CYTOCHROME B558, ALPHA CHAIN, CYTOCHROME B-245 ALPHA-SUBUNIT LIGHT CHAIN, SUPEROXIDE- GENERATING NADPH OXIDASE LIGHT CHAIN SUBUNIT KQPPSNPPPRPPAEARKK 18 T 0.00018 Cytochrom_B558a pdbhh F T 1ow6 2 D,E D,F PAXI_HUMAN Paxillin ATRELDELMASLS 13 T 0.99 SAM_LFY pdbhh F Eukaryota T 1ox1 2 B B SYNTHETIC PEPTIDE INHIBITOR SCTRSIPPQCY 11 T 0.0036 Bowman-Birk_leg pdb F T 1oxn 2 F F AEAVPWKSE peptide AEAVPWKSE 9 T 13 LodA_C pdbhh F T 1ozz 1 A A DEFN_ARCDE defensin ARD1 DKLIGSCVWGAVNYTSNCNAECKRRGYKGGHCGSFANVNCWCET 44 T 0.00026 Toxin_3 pdbpssm F Eukaryota T 1p00 1 A A DEFN_ARCDE defensin ARD1 DKLIGSCVWGAVNYTSNCRAECKRRGYKGGHCGSFANVNCWCET 44 T 0.00019 Toxin_3 pdbpssm F Eukaryota T 1p0a 1 A A DEFN_ARCDE DEFENSIN ARD1 DKLIGSCVWGAVNYTSNCNAECKRRGYKGGHCGSFLNVNCWCET 44 T 0.00019 Toxin_3 pdbpercent F Eukaryota T 1p0g 1 A A RL1_HELPY RIBOSOMAL PROTEIN L1 AKKVFKRLEKLFSKIQNDKX 20 T 0.042 Tower pdb F Bacteria T 1p0l 1 A A RL1_HELPY RIBOSOMAL PROTEIN L1 AKKVFKRLEKLFSKIWNDKX 20 T 0.088 Antimicrobial_7 pdbhh F Bacteria T 1p0o 1 A A RL1_HELPY RIBOSOMAL PROTEIN L1 AKKVFKRLEKLFSKIWNWKX 20 T 0.37 Antimicrobial_7 pdbhh F Bacteria T 1p13 2 C,D C,D Peptide CDXANFK 7 T 1.1 Whi5 pdbhh F T 1p22 3 C C CTNB1_HUMAN PRO2286 KAAVSHWQQQSYLDSGIHSGATTTAP 26 T 13 AvrPto pdbhh F Eukaryota T 1p4b 3 C P GCN4(7P-14P) peptide AHLENEVARLKK 12 T 1.1 WD40_alt pdbhh F T 1p5k 1 A A RL1_HELPY RIBOSOMAL PROTEIN L1 AKKVFKRLEKSFSKIQNDKX 20 T 3.5 PRTRC_E unppssm F Bacteria T 1p5l 1 A A RL1_HELPY RIBOSOMAL PROTEIN L1 AKKVSKRLEKLFSKIQNDKX 20 T 0.025 Tower pdb F Bacteria T 1p9f 1 A A TKNK_HUMAN NKB, NEUROMEDIN K, ZNEUROK1 DMHDFFVGLM 10 T 0.0032 Tachykinin pdbhh F Eukaryota T 1p9u 2 G,H G,H PHQ-VNSTLQ-CHLOROMETHYLKETONE INHIBITOR XVNSTLQX 8 T 11 Peptidase_C98 pdbhh F T 1pbz 1 A,B A,B De novo designed cyclic peptide XCGAEAAKAHAKAAEAGCX 19 T 14 DUF3721 pdbhh F T 1pcg 2 C,D E,F peptide inhibitor KXILCRLLQ 9 T 0.67 TMEM95 pdbhh F T 1pd1 2 B B DxE cargo sorting signal peptide of yeast Sys1 protein QLKDLESQI 9 T 2.3 NuA4 pdbhh F T 1pd7 2 B B MAD1_HUMAN Mad1 VRMNIQMLLEAADYLERREREAEH 24 T 3 LMBR1 unp F Eukaryota T 1pef 1 A A PEPTIDE F (EQLLKALEFLLKELLEKL) EQLLKALEFLLKELLEKL 18 T 1.4 RnlA-toxin_DBD pdbhh F T 1peh 1 A _ PCY1A_RAT CYTIDYLYLTRANSFERASE MEMBRANE BINDING DOMAIN PEPTIDE XNEKKYHLQERVDKVKKKVKDVEEKSKEWVQKVEX 35 T 0.02 AKNA pdbpercent F Eukaryota T 1pg1 1 A _ PG1_PIG PROTEGRIN-1 RGGRLCYCRRRFCVCVGRX 19 T 0.16 Defensin_1 pdbhh F Eukaryota T 1pgv 1 A A TMOD_CAEEL TMD-1; TROPOMODULIN PROTEIN 1, ISOFORM A GSHGTTFNGIMQSYVPRIVPDEPDNDTDVESCINRLREDDTDLKEVNINNMKRVSKERIRSLIEAACNSKHIEKFSLANTAISDSEARGLIELIETSPSLRVLNVESNFLTPELLARLLRSTLVTQSIVEFKADNQRQSVLGNQVEMDMMMAIEENESLLRVGISFASMEARHRVSEALERNYERVRLRRLGKDPNV 197 T 0.018 LRR_6 pdbpercent F Eukaryota T 1pjn 1 A A HIBN_XENLA Histone-binding protein N1/N2 RKKRKTEEESPLKDKAKKSKG 21 T 19 CMS1 pdbhh F Eukaryota T 1pp5 1 A A MCJA_ECOLX microcin J25 GGAGHVPEYFVGIGTPISFYG 21 T 0.13 Endonuc-BglII unp F Bacteria T 1psb 2 C,D C,D STK38_HUMAN Ndr Ser/Thr kinase-like protein KRLRRSAHARKETEFLRLKRTRLGLE 26 T 3.9 Pam17 pdbhh F Eukaryota T 1psm 1 A _ Q9NIG6_PLAFA SPAM-H1 EAYKKAKQASQDAEQAAKDAENASKEAEEAAKEAVNLK 38 T 0.0019 Alanine_zipper pdbpercent F Eukaryota T 1psv 1 A _ PDA8D KPYTARIKGRTFSNEKELRDFLETFTGR 28 T 0.56 Glyco_transf_61 pdbhh F T 1pts 2 C P PEPTIDE (FSHPQNT) FSHPQNT 7 T 14 PmoA pdbhh F T 1pwv 2 C,D C,D LF20 MLARRKKVYPYPMEPTIAEG 20 T 5.8 DHHA2 pdbhh F T 1pxd 2 B B LECB3_ARTIN JACALIN BETA-3 CHAIN DEQSGISQTVIVGPWGAKVS 20 T 3.2 DUF3842 pdbhh F Eukaryota T 1py1 2 E,F,G,H E,F,G,H BACE1_HUMAN BETA-SITE APP CLEAVING ENZYME, BETA-SITE AMYLOID PRECURSOR PROTEIN CLEAVING ENZYME, ASPARTYL PROTEASE 2, ASP 2, ASP2, MEMBRANE-ASSOCIATED ASPARTIC PROTEASE 2, MEMAPSIN-2 ADDISLLK 8 T 0.72 CD34_antigen unphh F Eukaryota T 1pyz 1 A,B A,B MIMOCHROME IV, MINIATURIZED METALLOPROTEIN XESQLHSNKRX 11 T 11 DUF3949 pdbhh F T 1pz5 3 C C Octapeptide (MDWNMHAA) MDWNMHAA 8 T 6.4 DUF2969 pdbhh F T 1q1a 2 B B H4_YEAST Histone H4 KGGAXRHRKI 10 T 4.2 Shadoo unppercent F Eukaryota T 1q1s 1 A,B A,B LT_SV40 Large T antigen PGSDDEAAADAQHAAPPKKKRKVE 24 T 0.28 FAM60A unppercent T Viruses T 1q1t 1 A,B A,B LT_SV40 Large T antigen PGSDDEAAADAQHAAPPKKKRKVEY 25 T 0.28 FAM60A unppercent T Viruses T 1q2c 2 B B Histone H4 peptide SGRGKGGKGLGKGGAKRHR 19 T 180 DUF1884 pdbhh F T 1q2d 2 B B 19-mer peptide fragment from p53 Tumor Suppressor NTSSSPQPKKKPLDGEYFT 19 T 0.3 P53_tetramer pdbhh F T 1q3m 1 A A OSTCN_BOVIN GAMMA-CARBOXYGLUTAMIC ACID-CONTAINING PROTEIN, BONE GLA-PROTEIN, BGP YLDHWLGAPAPYPDPLEPKREVCELNPDCDELADHIGFQEAYRRFYGPV 49 T 0.14 Toxin_23 unppercent F Eukaryota T 1q3p 2 C,D C,D C-TERMINAL HEXAPEPTIDE FROM GKAP EAQTRL 6 T 1.8 GKAP pdbhh F T 1q40 2 B,D B,D MEX67_CANAL MEX67 MSPETMFFQDEDSRNLATNFIANYLKLWDANRSELMILYQNESQFSMQVDSSHPHLIESGNSGYSGSTDFGYYLNNSRNLTRVSSIKARMAKLSIGQEQIYKSFQQLPKTRHDIIATPELFSMEVYKFPTLNGIMITLHGSFDEVAQPEVDGSASSAPSGPRGGSRYHSGPKHKRIPLSKKSFDRTFVVIPGPNGSMIVASDTLLIRPYTSDFPWKVQK 219 T 2E-07 NTF2 pdbpssm F Eukaryota T 1q4k 2 D,E,F D,E,F Phospho-peptide sequence Met.Gln.Ser.pThr.Pro.Leu MQSTPL 6 T 180 DUF5540 pdbhh F T 1q4q 2 K,L,M,N,O,P,Q,R,S,T K,L,M,N,O,P,Q,R,S,T DRONC_DROME DRONC SRPPFISLNERR 12 T 0.38 MMgT pdbhh F Eukaryota T 1q5z 1 A A SIPA_SALTY SIPA GPVDKAGTTDNDNSQTDKTGPFSGLKFKQNSFLSTVPSVTNMHSMHFDARETFLGVIRKALEPDTSTPFPVRRAFDGLRAEILPNDTIKSAALKAQCSDIDKHPELKAKMETLKEVITHHPQKEKLAEIALQFAREAGLTRLKGETDYVLSNVLDGLIGDGSWRAGPAYESYLNKPG 177 T 0.0058 DUF3288 pdbpercent F Bacteria T 1q68 2 B B LCK_HUMAN P56-LCK, LSK, T CELL-SPECIFIC PROTEIN-TYROSINE KINASE SHPEDDWLENIDVCENCHYPIVPLDGKGT 29 T 1.3 zf-ACC pdbhh F Eukaryota T 1q69 1 A A CD8A_HUMAN T-LYMPHOCYTE DIFFERENTIATION ANTIGEN T8/LEU-2 RNRRRVCKCPRPVVKSGDK 19 T 0.0033 RCR unphh F Eukaryota T 1q8h 1 A A OSTCN_PIG BONE GLA PROTEIN,BGP,GAMMA-CARBOXYGLUTAMIC ACID-CONTAINING PROTEIN YLDHGLGAPAPYPDPLEPRREVCELNPDCDELADHIGFQEAYRRFYGIA 49 T 6.7 Cytomega_UL84 pdbhh F Eukaryota T 1q90 5 E R UCRIA_CHLRE RIESKE IRON-SULFUR PROTEIN, RISP AASSEVPDMNKRNIMNLILAGGAGLPITTLALGYGAFFVPPSSGGGGGG 49 T 0.00019 UCR_Fe-S_N pdbhh F Eukaryota T 1qfn 2 B B RIR1_ECOLI RIBONUCLEOTIDE REDUCTASE, B1 PROTEIN, R1 PROTEIN GAEDAQDDLVPSIQDDGSESGACKI 25 T 0.68 JmjN pdbhh F Bacteria T 1qg1 2 B I SHC1_HUMAN PROTEIN (SHC-DERIVED PEPTIDE) DDPSXVNVQNLDK 13 T 0.23 SH3-WW_linker pdbhh F Eukaryota T 1qhf 1 A,B A,B PMG1_YEAST PROTEIN (PHOSPHOGLYCERATE MUTASE) PKLVLVRHGQSEWNEKNLFTGWVDVKLSAKGQQEAARAGELLKEKKVYPDVLYTSKLSRAIQTANIALEKADRLWIPVNRSWRLNERHYGDLQGKDKAETLKKFGEEKFNTYRRSFDVPPPPIDASSPFSQKGDERYKYVDPNVLPETESLALVIDRLLPYWQDVIAKDLLSGKTVMIAAHGNSLRGLVKHLEGISDADIAKLNIPTGIPLVFELDENLKPSKPSYYLDPEAAAAGAAAV 240 T 7.6E-05 His_Phos_1 pdb F Eukaryota T 1qix 1 A A BETA-CASOMORPHIN-7 YPFVEPI 7 T 21 PCP pdbhh F T 1qja 2 C,D Q,R PHOSPHOPEPTIDE RLYHSLPA 8 T 2.9 DUF668 pdbhh F T 1qjb 2 C,D Q,S MT_POVBG PHOSPHOPEPTIDE ARSHSYPA 8 T 24 DUF3637 pdbhh T Viruses T 1qls 2 B D ANXA1_HUMAN ANNEXIN I XAMVSAFLKQAW 12 T 3.3 DUF5680 pdbhh F Eukaryota T 1qp6 1 A,B A,B PROTEIN (ALPHA2D) GEVEELEKKFKELWKGPRRGEIEELHKKFHELIKG 35 T 0.0098 CZB pdb F T 1qr1 3 C,F C,F ERBB2_HUMAN GP2 PEPTIDE IISAVVGIL 9 T 0.014 RIFIN unppercent F Eukaryota T 1qrn 3 C C TAX PEPTIDE P6A LLFGYAVYV 9 T 0.96 CLPTM1 pdbhh F T 1qs3 1 A A CAIA_CONGE DES-GLU1-[CYS3ALA]-DES-CYS13-ALPHA CONOTOXIN GI CANPACGRHYSX 12 T 0.042 Enterotoxin_ST unphh F Eukaryota T 1qs7 2 B,D B,D MYLK_CHICK RS20 RRKWQKTGHAVRAIGRLSSSX 21 T 4.3 PACT_coil_coil pdbhh F Eukaryota T 1qsc 2 D,E,F D,E,F CD40 RECEPTOR XYPIQET 7 T 3.3 stn_TNFRSF12A pdbhh F T 1qse 3 C C Tax Peptide V7R LLFGYPRYV 9 T 2.3 DUF5759 pdbhh F T 1qsf 3 C C TAX PEPTIDE Y8A LLFGYPVAV 9 T 0.21 DUF4504 pdbhh F T 1qsv 1 A A VGFR1_HUMAN FLT-1 SDTGRPFVEMYSEIPEIIHMTEGRELVIPCRVTSPNITVTLKKFPLDTLIPDGKRIIWDSRKGFIISNATYKEIGLLTCEATVNGHLYKTNYLTHRQTNTI 101 T 0.00047 Ig_2 pdbpercent F Eukaryota T 1qur 3 C I BIVALENT INHIBITOR (BZA-2 HIRULOG) XXGGGGNGDYEPIPEEAXX 19 T 0.047 Hirudin pdbhh F T 1qwf 2 B B VSL12 VSLARRPLPPLP 12 T 0.73 DUF4522 pdbhh F T 1qx9 1 A A CTHL4_BOVIN CYCLOCP-11 ICLKKWPWWPWRRCKX 16 T 0.071 CoV_S2 pdbhh F Eukaryota T 1qxq 1 A A CTHL4_BOVIN CP-11 ILKKWPWWPWRRKX 14 T 0.055 CoV_S2 pdbhh F Eukaryota T 1r17 2 C,D C,D fibrinopeptide B NEEGFFFSARGHRPLD 16 T 0.26 PyrBI_leader pdbhh F T 1r1r 2 B,D,F,G D,E,F,P RIR2_ECOLI RIBONUCLEOTIDE REDUCTASE R2 PROTEIN YLVGQIDSEVDTDDLSNFQL 20 T 12 UL11 pdbhh F Bacteria T 1r1s 2 B,D,F,H B,D,F,H LAT pY226 peptide XPDXENL 7 T 0.48 LAT pdbhh F T 1r2b 2 C,D C,D NCOR2_HUMAN N-COR2, SILENCING MEDIATOR OF RETINOIC ACID AND THYROID HORMONE RECEPTOR, SMRT, SMRTE, THYROID-, RETINOIC-ACID-RECEPTOR-ASSOCIATED CO-REPRESSOR, T3 RECEPTOR- ASSOCIATING FACTOR, TRAC, CTG REPEAT PROTEIN 26 GSLVATVKEAGRSIHEIPR 19 T 7.6 DUF211 pdbhh F Eukaryota T 1r4y 1 A A RNAS_ASPGI RRNA ENDONUCLEASE AVTWTCGGLLYNQNKAESNSHHAPLSDGKTGSSYPHWFTNGYDGDGKLPKGRTPIKFGKSDCDRPPKHSKDGNGKTDHYLLEFPTFPDGHDYKFDSKKPKENPGPARVIYTYPNKVFCGIIAHTKENQGELKLCSH 136 T 23 MtrE unphh F Eukaryota T 1r5v 2 B,E E,F artificial peptide ADLIAYPKAATKF 13 T 9.8 DUF3800 pdbhh F T 1r5w 3 E,F E,F artificial peptide ADLIAYFKAATKF 13 T 8 TMEM192 pdbhh F T 1r8t 1 A A MP1 RCCHPQCGAAYSCRK 15 T 0.14 Enterotoxin_ST pdbhh F T 1rdt 4 D E CBP_HUMAN LxxLL motif coactivator NLVPDAASKHKQLSELLRGGSGS 23 T 0.95 SRC-1 pdbhh F Eukaryota T 1rf3 2 B B TNR3_HUMAN 24-residue peptide from Lymphotoxin-B Receptor PYPIPEEGDPGPPGLSTPHQEDGK 24 T 5.3 LAX unphh F Eukaryota T 1rff 3 E,F C,E Topoisomerase I-Derived Peptide KLNYYDPR 8 T 0.037 Topo_C_assoc pdbhh F T 1rgj 2 B B MIMOTOPE OF THE NICOTINIC ACETYLCHOLINE RECEPTOR FRYYESSLEPWDD 13 T 1.8 LicD pdbhh F T 1rh4 1 A A RIGHT-HANDED COILED COIL TETRAMER XAALAQXKKEIAYLLAKXKAEILAALKKXKQEIAX 35 T 2.6 Phe_tRNA-synt_N pdbhh F T 1rij 1 A A E6apn1 peptide ALQELLGQWLKDGGPSSGRPPPS 23 T 1.5 RE_NgoBV pdbhh F T 1rjk 2 B C MED1_HUMAN PBP, PPAR BINDING PROTEIN, THYROID HORMONE RECEPTOR-ASSOCIATED PROTEIN COMPLEX 220 KDA COMPONENT, TRAP220, THYROID RECEPTOR INTERACTING PROTEIN 2, TRIP2, P53 REGULATORY PROTEIN RB18A, VITAMIN D RECEPTOR-INTERACTING PROTEIN COMPLEX COMPONENT DRIP205, ACTIVATOR-RECRUITED COFACTOR 205 KDA COMPONENT, ARC205 KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 1rjq 1 A A Q9AGH8_ALCFA D-aminoacylase MRGSHHHHHHGSMSQPDATPFDYILSGGTVIDGTNAPGRLADVGVRGDRIAAVGDLSASSARRRIDVAGKVVSPGFIDSHTHDDNYLLKHRDMTPKISQGVTTVVTGNCGISLAPLAHANPPAPLDLLDEGGSFRFARFSDYLEALRAAPPAVNAACMVGHSTLRAAVMPDLRREATADEIQAMQALADDALASGAIGISTGAFYPPAAHASTEEIIEVCRPLITHGGVYATHMRDEGEHIVQALEETFRIGRELDVPVVISHHKVMGKLNFGRSKETLALIEAAMASQDVSLDAYPYVAGSTMLKQDRVLLAGRTLITWCKPYPELSGRDLEEIAAERGKSKYDVVPELQPAGAIYFMMDEPDVQRILAFGPTMIGSAGLPHDERPHPRLWGTFPRVLGHYSRDLGLFPLETAVWKMTGLTAAKFGLAERGQVQPGYYADLVVFDPATVADSATFEHPTERAAGIHSVYVNGAAVWEDQSFTGQHAGRVLNRAGA 496 T 5.9E-16 Amidohydro_3 unphh F Bacteria T 1rkk 1 A A PPM1_LIMPO POLYPHEMUSIN I RRWCFRVCYRGFCYRKCRX 19 T 1.9 ADAM_CR_2 pdbhh F Eukaryota T 1rpb 1 A _ 3CP1_STRS9 Tricyclic peptide RP 71955 CLGIGSCNDFAGCGYAVVCFW 21 T 0.31 CCAP unphh F Bacteria T 1rpq 2 E,F,G,H W,X,Y,Z Peptide E131 VQCPHFCYELDYELCPDVCYV 21 T 1.6 Prot_inhib_II pdbhh F T 1rqq 3 E,F E,F BISUBSTRATE INHIBITOR KKKLPATGDFMNMSPVGD 18 T 0.3 TagF_N pdbhh F T 1rst 2 B P STREP-TAG PEPTIDE AWRHPQFGG 9 T 1.8 TIMELESS pdbhh F T 1rsu 2 B P STREP-TAG II PEPTIDE SNWSHPQFEK 10 T 0.22 CreD pdbhh F T 1rtf 1 A A (TC)-T-PA SYQSTCGLRQYSQRQRR 17 T 9.5 Abp2 pdbhh F T 1rv6 2 C,D X,Y VGFR1_HUMAN FLT1 protein DTGRPFVEMYSEIPEIIHMTEGRELVIPCRVTSPNITVTLKKFPLDTLIPDGKRIIWDSRKGFIISNATYKEIGLLTCEATVNGHLYKTNYLTHRQTNTI 100 T 0.00097 Ig_2 pdbpssm F Eukaryota T 1rxm 2 B B consensus FEN-1 peptide KTTQSTLDSFFK 12 T 0.86 CitT pdbhh F T 1rxz 2 B B FEN_ARCFU Flap structure-specific endonuclease KSTQATLERWF 11 T 0.3 DUF494 unppercent F Archaea T 1rzx 2 B B Acetylated VKESLV Peptide XVKESLV 7 T 200 DUF5627 pdbhh F T 1s4z 2 C C CAF1A_MOUSE CAF-1 SUBUNIT A, CHROMATIN ASSEMBLY FACTOR I P150 SUBUNIT, CAF-I 150 KDA SUBUNIT, CAF-IP150 GSKAGDLLFIEKVPVVVLEDILATKPSIAS 30 T 0.84 DUF411 pdbhh F Eukaryota T 1s5p 2 B B HISTONE H4 (RESIDUES 12-19) KGGAXRHR 8 T 130 DUF2476 pdbhh F T 1s5r 1 A A HBP1_MOUSE high mobility group box transcription factor 1 DFTPMDSSAVYVLSSMARQRRAS 23 T 1.5 PAXIP1_C pdbhh F Eukaryota T 1s7p 1 A B MCJA_ECOLX microcin J25 VGIGTPISFYG 11 T 0.13 Endonuc-BglII unp F Bacteria T 1s7p 2 B A MCJA_ECOLX microcin J25 GGAGHVPEYF 10 T 0.13 Endonuc-BglII unp F Bacteria T 1s9v 3 C,F C,F alpha-I gliadin LQPFPQPELPY 11 T 3.7 Sod_Fe_N pdbhh F T 1s9x 3 C C NY-ESO-1 peptide analogue S9A SLLMWITQA 9 T 0.7 DUF6405 pdbhh F T 1s9y 3 C C NY-ESO-1 peptide analogue S9S SLLMWITQS 9 T 1.6 DUF6405 pdbhh F T 1sbu 1 A A delta-conotoxin EVIA GFASLXILKNG 11 T 0.56 Pneumo_NS1 pdbhh F T 1sdz 2 B B Reaper AVAFYIPDQA 10 T 2.3 Insulin_TMD pdbhh F T 1shc 2 B B NTRK1_HUMAN TRKA RECEPTOR PHOSPHOPEPTIDE HIIENPQXFSDA 12 T 1.6 DUF2399 pdbhh F Eukaryota T 1skv 1 A,B,C,D A,B,C,D D63_SSV1 ORF D-63 MSKEVLEKELFEMLDEDVRELLSLIHEIKIDRITGNMDKQKLGKAYFQVQKIEAELYQLIKVSHHHHHH 69 T 0.093 Oxidored_nitro unppssm T Viruses T 1sld 2 B P CYCLO-AC-CHPQFC-NH2 XCHPQFCX 8 T 1.4 Cytochrom_C_2 pdbhh F T 1sle 2 B,D M,P AC-CHPQGPPC-NH2 XCHPQGPPCX 10 T 2.8 Defensin_int pdbhh F T 1sm3 3 C P MUC1_HUMAN PEPTIDE EPITOPE TSAPDTRPAPGST 13 T 21 DUF3235 pdbhh F Eukaryota T 1smr 2 B,D,F,H B,D,F,H ANGT_RAT INHIBITOR CH-66 XHPFHXYYS 9 T 0.31 DUF5372 pdbhh F Eukaryota T 1sn9 1 A,B,C,D A,B,C,D BBAT XYRIXSYDFXDELAKLLRQAXGX 23 T 8 DUF5813 pdbhh F T 1sna 1 A,B,C,D A,B,C,D BBAT XYRIXSYDFXDELXKLLRQAXGX 23 T 8.6 DUF1949 pdbhh F T 1sne 1 A,B A,B BBAT XYRIXSYDFXDELAKLLRXAXGX 23 T 8.1 PelD_GGDEF pdbhh F T 1sol 1 A _ GELS_HUMAN GELSOLIN (150-169) KHVVPNEVVVQRLFQVKGRR 20 T 1.4 Sua5_yciO_yrdC pdbhh F Eukaryota T 1soz 2 D,E D,E activating peptide DNRLGLVYQF 10 T 1.2 POLO_box pdbhh F T 1sse 1 A A YAP1_YEAST PHENANTHROLINE RESISTANCE PROTEIN PAR1, PLEIOTROPIC DRUG RESISTANCE PROTEIN PDR4 NLDSNMFSNDFNFENQFDEQVSEFCSKMNQVCGTR 35 T 0.94 PAP1 pdbhh F Eukaryota T 1ssh 2 B B SLA1_YEAST 12-RESIDUE PEPTIDE FROM SLA1 EGPPPAMPARPT 12 T 21 p47_phox_C pdbhh F Eukaryota T 1str 2 C,D M,P AC-CHPQNT-NH2 XCHPQNTX 8 T 9.2 DHOR pdbhh F T 1sts 2 C,D M,P FCHPQNT-NH2 FCHPQNTX 8 T 1.8 DUF2799 pdbhh F T 1suy 2 C,D C,D KAIC_THEEB CIIABD AMAGIISGTPTRISVDEKTELARIAKGMQDLESE 34 T 5.4 Cep57_MT_bd pdbhh F Bacteria T 1svz 2 C,D C,D epitope peptide corresponding to N-terminus of HIV-2 protease PQFSLWKR 8 T 0.42 MTP_lip_bd pdbhh F T 1sy9 2 B B CNGA2_BOVIN CYCLIC-NUCLEOTIDE-GATED CATION CHANNEL 2, CNG CHANNEL 2, CNG-2, CNG2 QQRRGGFRRIARLVGVLREWAYRNFR 26 T 5.6 Adeno_E4 pdbhh F Eukaryota T 1t0j 3 C C CAC1C_HUMAN CALCIUM CHANNEL, L TYPE, ALPHA-1 POLYPEPTIDE, ISOFORM 1, CARDIAC MUSCLE GAQQLEEDLKGYLDWITQAE 20 T 1.8 Antimicrobial14 pdbhh F Eukaryota T 1t15 2 B B FANCJ_HUMAN BRCA1 interacting protein C-terminal helicase 1 STSPTFNK 8 T 4.7 DUF4675 pdbhh F Eukaryota T 1t1x 3 C C GAG PEPTIDE SLYLTVATL 9 T 5.1 Gag_p17 pdbhh F T 1t1y 3 C C GAG PEPTIDE SLYNVVATL 9 T 0.31 Gag_p17 pdbhh F T 1t29 2 B B FANCJ_HUMAN BACH1 phosphorylated peptide ISRSTSPTFNKQTK 14 T 5.4 DUF782 pdbhh F Eukaryota T 1t2v 2 F,G,H,I,J F,G,H,I,J BRCTide-7PS GAAYDISQVFPFAKKK 16 T 1.9 Thump_like pdbhh F T 1t2y 1 A A MT_NEUCR MT GDCGCSGASSCNCGSGCSCSNCGSK 25 T 0.003 Metallothio unphh F Eukaryota T 1t3l 2 B B CAC1S_RABIT CALCIUM CHANNEL, L TYPE, ALPHA-1 POLYPEPTIDE, ISOFORM 3, SKELETAL MUSCLE QQLEEDLRGYMSWITQGE 18 T 1.8 Antimicrobial14 pdbhh F Eukaryota T 1t4f 2 B P optimized p53 peptide RFMDYWEGL 9 T 0.51 Usg pdbhh F T 1t51 1 A A NDB41_OPIMA ISCT ILGKIWEGIKSLFX 14 T 0.005 SecG unppssm F Eukaryota T 1t52 1 A A NDB41_OPIMA ISCT ILGKIWKGIKSLFX 14 T 0.005 SecG unppssm F Eukaryota T 1t54 1 A A NDB41_OPIMA ISCT ILGKIAEGIKSLFX 14 T 0.005 SecG unppssm F Eukaryota T 1t55 1 A A NDB41_OPIMA ISCT ILGKIWKPIKKLFX 14 T 0.005 SecG unppssm F Eukaryota T 1t5w 3 C,F C,F MIG1_YEAST REGULATORY PROTEIN CAT4 AAYSDQATPLLLSPR 15 T 19 DUF5888 pdbhh F Eukaryota T 1t5z 2 B B NCOA4_HUMAN NCOA-4, 70 KDA ANDROGEN RECEPTOR COACTIVATOR, 70 KDA AR-ACTIVATOR, RET-ACTIVATING PROTEIN ELE1 RETSEKFKLLFQSYN 15 T 4 DUF1279 pdbhh F Eukaryota T 1t73 2 B B FxxFF motif peptide SRFADFFRNEGLGSRSGSGK 20 T 1.7 HATPase_c_4 pdbhh F T 1t74 2 B B WxxLF motif peptide SRWQALFDDGTDTSR 15 T 3.4 NUC153 pdbhh F T 1t76 2 B B WxxVW motif peptide SRWAEVWDDNSKVSR 15 T 3 ODC_AZ pdbhh F T 1t79 2 B B FxxLW motif peptide SSKFAALWDPPKLSRSGSGK 20 T 3.7 MOSP_N pdbhh F T 1t7f 2 B B LxxLL motif peptide SSRGLLWDLLTKDSRSGSGK 20 T 4.2 TPK_B1_binding pdbhh F T 1t7r 2 B B FxxLF motif peptide SSRFESLFAGEKESR 15 T 5.5 CoV_NSP15_C pdbhh F T 1t85 1 A A CPXA_PSEPU CAMPHOR 5-MONOOXYGENASE, P450CAM TTETIQSNANLAPLPPHVPEHLVFDFDMYNPSNLSAGVQEAWAVLQESNVPDLVWTRCNGGHWIATRGQLIREAYEDYRHFSSECPFIPREAGEAYDFIPTSMDPPEQRQFRALANQVVGMPVVDKLENRIQELACSLIESLRPQGQCNFTEDYAEPFPIRIFMLLAGLPEEDIPHLKYLTDQMTRPDGSMTFAEAKEALYDYLIPIIEQRRQKPGTDAISIVANGQVNGRPITSDEAKRMCGLLLVGGLDTVVNFLSFSMEFLAKSPEHRQELIERPERIPAACEELLRRFSLVADGRILTSDYEFHGVQLKKGDQILLPQMLSGLDERENAAPMHVDFSRQKVSHTTFGHGSHLCPGQHLARREIIVTLKEWLTRIPDFSIAPGAQIQHKSGIVSGVQALPLVWDPATTKAV 414 T 1.6E-05 p450 unppercent F Bacteria T 1t8j 1 A A BBA5 XYRVXSYDFSRSDELAKLLRQHAGX 25 T 8.3 EZH2_N pdbhh F T 1t9e 1 A A SFTI1_HELAN SFTI-1 GRXTKSIPPIXFPD 14 T 0.052 Bowman-Birk_leg unp F Eukaryota T 1tfs 1 A _ 3SL2_DENPO TOXIN FS2 RICYSHKASLPRATKTCVENTCYKMFIRTHREYISERGCGCPTAMWPYQTECCKGDRCNK 60 T 5.2 Activin_recp pdb F Eukaryota T 1tgg 1 A,B,C A,B,C right-handed coiled coil trimer XAEXEQXKKEIAYLXKKXKEEILEEXKKXKQEIA 34 T 1.2 YojJ pdbhh F T 1ths 3 C I SYNTHETIC INHIBITOR XYEPIPEEAXE 11 T 0.65 Hirudin pdbhh F T 1tjb 1 A,B A,B Lanthanide-Binding Peptide YIDTNNDGWYEGDELLAX 18 T 0.53 DUF5057 pdbhh F T 1tmc 3 C C DECAMERIC PEPTIDE (EVAPPEYHRK) EVAPPEYHRK 10 T 16 DUF6328 pdbhh F T 1tn6 3 C C peptide derived from the C-terminus of Rap2a DDPTASACNIQ 11 T 22 MTCP1 pdbhh F T 1tn7 3 C C Fusion protein KKSKTKCVIF 10 T 2.4 Acetyltransf_14 pdbhh F T 1tnu 3 M,N,O,P,Q,R M,N,O,P,Q,R Transforming protein RhoB GCINCCKVL 9 T 0.72 Gal_GalNac_35kD pdbhh F T 1tny 3 M,N,O,P,Q,R M,N,O,P,Q,R guanine nucleotide-binding protein G(I)/G(S)/G(O) gamma-2 subunit FREKKFFCAIL 11 T 11 ITAM_Cys-rich pdbhh F T 1toq 2 B,D,F,H B,D,F,H LECB3_ARTIN AGGLUTININ BETA CHAIN DENSGKSQTVIVGPWGAKVS 20 T 2.7 DUF3842 pdbhh F Eukaryota T 1tps 2 B B INHIBITOR A90720A XXTRELXV 8 T 5.4 Endotoxin_M pdbhh F T 1tsq 2 C P GAG_HV1H2 AP2V NC-P1 SUBSTRATE PEPTIDE RQVNFLGKIN 10 T 0.61 zf-CCHC_5 unphh T Viruses T 1tsu 2 C P GAG_HV1H2 NC-P1 SUBSTRATE PEPTIDE RQANFLGK 8 T 0.61 zf-CCHC_5 unphh T Viruses T 1tt5 3 E,F E,F UBC12_HUMAN UBC12N26, UBIQUITIN-PROTEIN LIGASE M, UBIQUITIN CARRIER PROTEIN M, NEDD8-CONJUGATING ENZYME UBC12 MIKLFSLKQQKKEEESAGGTKGSSKK 26 T 7.8E-11 UFC1 unphh F Eukaryota T 1tvb 3 C,F C,F PMEL_HUMAN epitope of Melanocyte protein Pmel 17 ITDQVPFSV 9 T 4.8 PatG_C pdbhh F Eukaryota T 1tvh 3 C,F C,F PMEL_HUMAN epitope of Melanocyte protein Pmel 17 IMDQVPFSV 9 T 4.3 DUF1422 pdbhh F Eukaryota T 1twb 2 C,D C,D ssrA peptide ACNDENYA 8 T 19 DUF6231 pdbhh F T 1txp 1 A,B,C,D A,B,C,D HNRPC_HUMAN HNRNP C IQAIKKELTQIKQKVDSLLENLEKIEKE 28 T 0.037 IES5 unppercent F Eukaryota T 1tze 2 B I PHOSPHOTYROSYL HEPTAPEPTIDE LYS-PRO-PHE-PTYR-VAL-ASN-VAL-NH2 KPFXVNVX 8 T 0.35 SH3-WW_linker pdbhh F T 1tzg 3 E,F P,Q GP41 KGWNWFDITNWGK 13 T 0.029 GP41 pdbhh F T 1tzs 3 C X 23-mer peptide from PelB-IgG kappa light chain fusion protein MKYLLPTAAAGLLLLAAQPAMAM 23 T 0.036 DUF6488 pdbhh F T 1u00 2 B P IscU recognition peptide ELPPVKIHC 9 T 7.1 DUF4528 pdbhh F T 1u3h 5 E,J P,I MBP_MOUSE Myelin basic protein (MBP)-peptide SRGGASQYRPSQ 12 T 10 Tsg pdbhh F Eukaryota T 1u67 1 A A PGH1_SHEEP CYCLOOXYGENASE-1, COX-1, PROSTAGLANDIN-ENDOPEROXIDE SYNTHASE 1, PROSTAGLANDIN H2 SYNTHASE 1, PGH SYNTHASE 1, PGHS-1, PHS 1 MSRQSISLRFPLLLLLLSPSPVFSADPGAPAPVNPCCYYPCQHQGICVRFGLDRYQCDCTRTGYSGPNCTIPEIWTWLRTTLRPSPSFIHFLLTHGRWLWDFVNATFIRDTLMRLVLTVRSNLIPSPPTYNIAHDYISWESFSNVSYYTRILPSVPRDCPTPMGTKGKKQLPDAEFLSRRFLLRRKFIPDPQGTNLMFAFFAQHFTHQFFKTSGKMGPGFTKALGHGVDLGHIYGDNLERQYQLRLFKDGKLKYQMLNGEVYPPSVEEAPVLMHYPRGIPPQSQMAVGQEVFGLLPGLMLYATIWLREHNRVCDLLKAEHPTWGDEQLFQTARLILIGETIKIVIEEYAQQLSGYFLQLKFDPELLFGAQFQYRNRIAMEFNQLYHFHPLMPDSFRVGPQDYSYEQFLFNTSMLVDYGVEALVDAFSRQPAGRIGGGRNIDHHILHVAVDVIKESRVLRLQPFNEYRKRFGMKPYTSFQELTGEKEMAAELEELYGDIDALEFYPGLLLEKCHPNSIFGESMIEMGAPFSLKGLLGNPICSPEYWKASTFGGEVGFNLVKTATLKKLVCLNTKTCPYVSFHVPDPRQEDRPGVERPPTEL 600 T 2.7E-05 An_peroxidase unp F Eukaryota T 1u7b 2 B B FEN1_HUMAN SRQGSTQGRLDDFFKVTGSL peptide of Flap endonuclease-1 SRQGSTQGRLDDFFKVTGSL 20 T 0.036 LRV_FeS pdbhh F Eukaryota T 1u7j 1 A,B A,B Four-helix bundle model MDYLRELYKLEQQAMKLYREASERVGDPVLAKILEDEEKHIEWLETING 49 T 0.00018 Rubrerythrin pdbhh F T 1u8h 3 C C GP41 PEPTIDE ALDKWAS 7 T 4.1 DUF148 pdbhh F T 1u8i 3 C C GP41 PEPTIDE ELDKWAN 7 T 0.46 TMEM154 pdbhh F T 1u8j 3 C C GP41 PEPTIDE ELDKWAG 7 T 2.4 TMEM154 pdbhh F T 1u8k 3 C C GP41 PEPTIDE LELDKWASL 9 T 3.6 Kri1_C pdbhh F T 1u8l 3 C C GP41 PEPTIDE DLDRWAS 7 T 1.2 YacG pdbhh F T 1u8m 3 C C GP41 PEPTIDE ELDKYAS 7 T 6.1 DUF3283 pdbhh F T 1u8n 3 C C GP41 PEPTIDE ELDKFAS 7 T 3.8 Gag_p17 pdbhh F T 1u8o 3 C C GP41 PEPTIDE ELDKHAS 7 T 2.2 DUF3283 pdbhh F T 1u8p 3 C C GP41 PEPTIDE ECDKWCS 7 T 0.62 Sex_peptide pdbhh F T 1u8q 3 C C GP41 PEPTIDE ELEKWAS 7 T 5.7 DUF1186 pdbhh F T 1u91 3 C C GP41 PEPTIDE ANALOG ENDKWAS 7 T 3.2 Sin3_corepress pdbhh F T 1u92 3 C C GP41 PEPTIDE ANALOG EADKWQS 7 T 1.7 DEC1 pdbhh F T 1u93 3 C C GP41 PEPTIDE ANALOG EQDKWAS 7 T 28 Tmemb_9 pdbhh F T 1u95 3 C C GP41 PEPTIDE ELDHWAS 7 T 7.4 DUF3606 pdbhh F T 1u9f 1 A,B,C,D A,B,C,D AMINO ACID BIOSYNTHESIS REGULATORY PROTEIN XRMKQIEDKLEEILSXYHIENELARIKKLLGER 33 T 0.0068 VGPC1_C pdbhh F T 1u9g 1 A,B A,B AMINO ACID BIOSYNTHESIS REGULATORY PROTEIN RMKQIEDXEEILSKLYHIENELARIKKLLGER 32 T 0.0046 DUF1192 pdbpercent F T 1u9h 1 A,B A,B AMINO ACID BIOSYNTHESIS REGULATORY PROTEIN XRMKQIEDKLEEILSKLYHIENXARIKKLLGER 33 T 0.048 VGPC1_C pdbhh F T 1u9l 2 C C Lambda N NRPILSL 7 T 1.8 CedA pdbhh F T 1uao 1 A A Chignolin GYDPETGTWG 10 T 0.046 DUF4585 pdbhh F T 1ucy 4 D,G,J F,G,I FIBA_MACFU FIBRINOPEPTIDE A-ALPHA XDFLAEGGGVRPR 13 T 4 ThuA unphh F Eukaryota T 1uef 2 C,D C,D RET_MOUSE POLYPEPTIDE CONTAINING A PHOSPHORYLATED TYROSINE STWIENKLXGMSD 13 T 2.7 DUF3541 pdbhh F Eukaryota T 1ugw 2 B,D,F,H B,D,F,H LECB3_ARTIN JACALIN BETA-3 CHAIN DEQSGISQTVIVGPWGAKSS 20 T 2.5 DUF6409 pdbhh F Eukaryota T 1uh0 2 B,D,F,H B,D,F,H LECB3_ARTIN JACALIN BETA-3 CHAIN DEQSGKSQTVIVGPWGAKVS 20 T 3 DUF3842 pdbhh F Eukaryota T 1uj0 2 B B UBP8_MOUSE UBPY-DERIVED PEPTIDE TPMVNRENKPP 11 T 1.1 DUF6440 pdbhh F Eukaryota T 1ujj 2 C C BACE1_HUMAN C-TERMINAL PEPTIDE FROM BACE HDDFADDISLLK 12 T 0.72 CD34_antigen unphh F Eukaryota T 1ujz 2 B B CEA7_ECOLX DC DNASE KRNKPGKATGKGKPVNNKWLNNAGKDLGSPVPDRIANKLRDKEFKSFDDFRKKFWEEVSKDPELSKQFSRNNNDRMKVGKAPQTRTQDVSGKRRSFELHHEKPISQNGGVYDMDNISVVTPKRAIDIH 128 T 0.0091 HNH pdbpercent F Bacteria T 1ukh 2 B B JIP1_MOUSE JIP1 RPKRPTTLNLF 11 T 12 CTP-dep_RFKase pdbhh F Eukaryota T 1umw 2 C,D E,F PEPTIDE PMQSTPL 7 T 20 KRE9 pdbhh F T 1uoc 1 A,B A,B POP2_YEAST CCR4-ASSOCIATED FACTOR 1 GAMPPIFLPPPNYLFVRDVWKSNLYSEFAVIRQLVSQYNHVSISTEFVGTLARPIGTFRSKVDYHYQTMRANVDFLNPIQLGLSLSDANGNKPDNGPSTWQFNFEFDPKKEIMSTESLELLRKSGINFEKHENLGIDVFEFSQLLMDSGLMMDDSVTWITYHAAYDLGFLINILMNDSMPNNKEDFEWWVHQYMPNFYDLNLVYKIIQEFKNPQLQQSSQQQQQQQYSLTTLADELGLPRFSIFTTTGGQSLLMLLSFCQLSKLSMHKFPNGTDFAKYQGVIYGIDGDQ 289 T 3.5E-30 CAF1 pdbhh F Eukaryota T 1upk 2 B B STRAA_HUMAN STRAD ALPHA NLEELEVDDWEF 12 T 6.3 ANAPC16 pdbhh F Eukaryota T 1ura 1 A,B A,B PPB_ECOLI ALKALINE PHOSPHATASE MPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGNGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQDATPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 446 T 6.8E-10 Alk_phosphatase unppssm F Bacteria T 1urc 3 E,F E,F PEPTIDE INHIBITOR XRKLFG 6 T 29 Mog1 pdbhh F T 1url 2 B B ALA-GLY-HIS-THR-TRP-GLY-HIA AGHTWGX 7 T 1.4 HSNSD pdbhh F T 1uti 2 B D M4K1_MOUSE HEMATOPOETIC PRGENITOR KINASE I, MAPK/ERK KINASE KINASE KINASE 1, MEK KINASE KINASE 1, MEKKK 1, HPK GQPPLVPPRKEKMRGK 16 T 5.1 NapB pdbhh F Eukaryota T 1uw1 1 A A ARTIFICIAL NUCLEOTIDE BINDING PROTEIN (ANBP) GAMDYKDDDDKKTNWLKRIYRVRPCVKCKVAPRNWKVKNKHLRIYNMCKTCFNNSIDIGDDTYHGHDDWLMYADSKEISN 80 T 0.013 ZZ pdbpssm F T 1uwi 1 A,B,C,D A,B,C,D BGAL_SULSO GLYCOSIDASE, LACTASE MYSFPNSFRFGWSQAGFQSEMGTPGSEDLNTDWYKWVHDPENMAAGLVSGDLPENGPGYWGNYKTFHNNAQKMGLKIARLNSEWSRQFPNPLPRPQNFDESKQDVTEVEINENELKRLDEYANKDALNHYREIFKDLKSRGLYFIQNMYHWPLPLWLHDPIRVRRGDFTGPSGWLSTRTVYEFARFSAYTAWKFDDLVDEYSTMNEPNVVGGLGYVGVKSGFPPGYLSFELSRRAMYNIIQAHARAYDGIKSVSKKPVGIIYANSSFQPLTDKDMEAVEMAENDNRWWFFDAIIRGEITRGNEKIVRDDLKGRLDWIGVNYYTRTVVKRTGKGYVSLGGYGHGCERNSVSLAGLPTSDFGWEFFPEGLYDVLTKYWNRYHLYMYVTENGIADDADYQRPYYLVSHVYQVHRAINSGADVRGYLHWSLADNYEWASGFSMRFGLLKVDYNTKRLYWRPSSLVYREIATNGAITDEIEHLNSVPPVKPLRH 489 T 1.3E-42 Glyco_hydro_1 unppercent F Archaea T 1v13 1 A,B A,B CEA9_ECOLX COLICIN E9 MESKRNKPGKATGKGKPVGDKWLDDAGKDSGAPIPDRIADKLRDKEFKSFDDFRKAVWEEVSKDPELSKNLNPSNKSSVSKGYSPFTPKNQQVGGRKVYELHADKPISQGGEVYDMDNIRVTTPKRHIDIHRGK 134 T 0.016 LHH unppercent F Bacteria T 1v1t 2 C,D S,T TNEYKV PEPTIDE TNEYKV 6 T 120 DUF1230 pdbhh F T 1v4q 1 A A CO7C_CONMA omega-conotoxin MVIIC CKGKGAPCRKTMYDCCKGRCGRRGRCX 27 T 0.0018 Conotoxin unphh F Eukaryota T 1v4y 1 A A Q9AGH8_ALCFA D-aminoacylase MRGSHHHHHHGSMSQPDATPFDYILSGGTVIDGTNAPGRLADVGVRGDRIAAVGDLSASSARRRIDVAGKVVSPGFIDSHTHDDNYLLKHRDMTPKISQGVTTVVTGNCGISLAPLAHANPPAPLDLLDEGGSFRFARFSDYLEALRAAPPAVNAACMVGHSTLRAAVMPDLRREATADEIQAMQALADDALASGAIGISTGAFYPPAAHASTEEIIEVCRPLITHGGVYATAMRDEGEHIVQALEETFRIGRELDVPVVISHHKVMGKLNFGRSKETLALIEAAMASQDVSLDAYPYVAGSTMLKQDRVLLAGRTLITWCKPYPELSGRDLEEIAAERGKSKYDVVPELQPAGAIYFMMDEPDVQRILAFGPTMIGSDGLPHDERPHPRLWGTFPRVLGHYSRDLGLFPLETAVWKMTGLTAAKFGLAERGQVQPGYYADLVVFDPATVADSATFEHPTERAAGIHSVYVNGAAVWEDQSFTGQHAGRVLNRAGA 496 T 2.1E-16 Amidohydro_3 pdbhh F Bacteria T 1v5a 1 A A Covalitoxin-I RCLPSGKACAGVTQKIPCCGSCVRGKCS 28 T 0.046 Conotoxin pdbhh F T 1v66 1 A A PIAS1_HUMAN PROTEIN INHIBITOR OF ACTIVATED STAT-1, PIAS-1, GU BINDING PROTEIN, GBP, RNA HELICASE II BINDING PROTEIN, DEAD/H BOX-BINDING PROTEIN 1 MADSAELKQMVMSLRVSELQVLLGYAGRNKHGRKHELLTKALHLLKAGCSPAVQMKIKELYRRRF 65 T 0.003 SAP_new25 pdbhh F Eukaryota T 1vd7 1 A A Q5FBS0_BOMMO FMBP-1 ETSEERAARLAKMSAYAAQRLAN 23 T 0.31 Lipase_chap unppssm F Eukaryota T 1vd8 1 A A Q5FBS0_BOMMO FMBP-1 ESPEQRATRLKRMSEYAAKRLSS 23 T 0.095 EF-1_beta_acid pdbpercent F Eukaryota T 1vd9 1 A A Q5FBS0_BOMMO FMBP-1 ETREQRAIRLARMSAYAARRLAN 23 T 0.15 DUF6366 unppercent F Eukaryota T 1vda 1 A A Q5FBS0_BOMMO FMBP-1 ETPAQRQARLLRMSAYAAKRQAS 23 T 0.15 DUF6366 unppercent F Eukaryota T 1vg0 1 A A RAE1_RAT RAB ESCORT PROTEIN 1, REP-1 MADNLPSDFDVIVIGTGLPESIIAAACSRSGQRVLHVDSRSYYGGNWASFSFSGLLSWLKEYQENNDVVTENSMWQEQILENEEAIPLSSKDKTIQHVEVFCYASQDLHKDVEEAGALQKNHASVTSAQSAEAAEAAETSCLPTAVEPLSMGSCEIPAEQSQCPGPESSPEVNDAEATGKKENSDAKSSTEEPSENVPKVQDNTETPKKNRITYSQIIKEGRRFNIDLVSKLLYSRGLLIDLLIKSNVSRYAEFKNITRILAFREGTVEQVPCSRADVFNSKQLTMVEKRMLMKFLTFCVEYEEHPDEYRAYEGTTFSEYLKTQKLTPNLQYFVLHSIAMTSETTSCTVDGLKATKKFLQCLGRYGNTPFLFPLYGQGELPQCFCRMCAVFGGIYCLRHSVQCLVVDKESRKCKAVIDQFGQRIISKHFIIEDSYLSENTCSRVQYRQISRAVLITDGSVLRTDADQQVSILTVPAEEPGSFAVRVIELCSSTMTCMKGTYLVHLTCMSSKTAREDLERVVQKLFTPYTEIEAENEQVEKPRLLWALYFNMRDSSDISRDCYNDLPSNVYVCSGPDSGLGNDNAVKQAETLFQQICPNEDFCPAPPNPEDIVLDGDSSQQEVPESSVTPETNSETPKESTVLGNPEEPSE 650 T 9.7E-16 GDI pdbpssm F Eukaryota T 1vgk 3 C C syvntnmgl SYVNTNMGL 9 T 4.3 Colipase_C pdbhh F T 1vlu 1 A,B A,B PROA_YEAST GPR, GLUTAMATE-5-SEMIALDEHYDE DEHYDROGENASE, GLUTAMYL-GAMMA-SEMIALDEHYDE DEHYDROGENASE, GSA DEHYDROGENASE MGSDKIHHHHHHMSSSQQIAKNARKAGNILKTISNEGRSDILYKIHDALKANAHAIEEANKIDLAVAKETGLADSLLKRLDLFKGDKFEVMLQGIKDVAELEDPVGKVKMARELDDGLTLYQVTAPVGVLLVIFESRPEVIANITALSIKSGNAAILKGGKESVNTFREMAKIVNDTIAQFQSETGVPVGSVQLIETRQDVSDLLDQDEYIDLVVPRGSNALVRKIKDTTKIPVLGHADGICSIYLDEDADLIKAKRISLDAKTNYPAGCNAMETLLINPKFSKWWEVLENLTLEGGVTIHATKDLKTAYFDKLNELGKLTEAIQCKTVDADEEQDFDKEFLSLDLAAKFVTSTESAIQHINTHSSRHTDAIVTENKANAEKFMKGVDSSGVYWNASTRFADGFRYGFGAEVGISTSKIHARGPVGLDGLVSYQYQIRGDGQVASDYLGAGGNKAFVHKDLDIKTVTL 468 T 9E-07 Aldedh pdbpercent F Eukaryota T 1vm2 1 A A peptide A2 GLFDKLKSLVSDFX 14 T 0.74 Antimicrobial20 pdbhh F T 1vpp 2 C,D X,Y PROTEIN (PEPTIDE V108) RGWVEICAADDYGRCLTEAQ 20 T 2.1 zf-LYAR pdbhh F T 1vtp 1 A _ Q40378_NICAL NA-PROPI SEYASKVDEYVGEVENDLQKSKVAVS 26 T 3 RasGEF_N_2 unp F Eukaryota T 1vwi 2 C,D M,P PEPTIDE LIGAND CONTAINING HPQ HPQGPPCKX 9 T 3.4 TMEM135_C_rich pdbhh F T 1vyj 2 B,D,F,H,J,L B,D,F,H,J,L SMALL PEPTIDE SAVLQKKITDYFHPKK SAVLQKKITDYFHPKK 16 T 2.4 Lactococcin_972 pdbhh F T 1vyq 1 A,B,C A,B,C Q8II92_PLAF7 DUTP PYROPHOSPHATASE MHLKIVCLSDEVREMYKNHKTHHEGDSGLDLFIVKDEVLKPKSTTFVKLGIKAIALQYKSNYYYKCEKSENKKKDDDKSNIVNTSFLLFPRSSISKTPLRLANSIGLIDAGYRGEIIAALDNTSDQEYHIKKNDKLVQLVSFTGEPLSFELVEELDETSRGEGGFGSTSNNKY 173 T 1.7E-06 dUTPase unppercent F Eukaryota T 1vyt 2 C,D E,F CAC1C_RAT CALCIUM CHANNEL L TYPE ALPHA-1 POLYPEPTIDE ISOFORM 1 FROM CARDIAC MUSCLE, RAT BRAIN CLASS C QKLREKQQLEEDLKGYLDWITQAED 25 T 3.1 Antimicrobial14 pdbhh F Eukaryota T 1vzm 1 A,B,C A,B,C OSTCN_ARGRE OSTEOCALCIN AAKELTLAQTESLREVCETNMACDEMADAQGIVAAYQAFYGPIPF 45 T 0.5 UCMA unphh F Eukaryota T 1w0v 3 C C TISD_HUMAN EGF-RESPONSE FACTOR 2, ERF-2, TIS11D PROTEIN RRLPIFSRL 9 T 11 Imm15 pdbhh F Eukaryota T 1w80 2 B P SYNJ1_HUMAN SYNAPTIC INOSITOL-1,4,5-TRISPHOSPHATE 5-PHOSPHATASE 1, SYJ-P3 NPKGWVTFEEEE 12 T 0.37 Stonin2_N pdbhh F Eukaryota T 1w80 3 C Q SYNJ1_HUMAN SYNAPTIC INOSITOL-1,4,5-TRISPHOSPHATE 5-PHOSPHATASE 1, SYJ-P3 LDGFKDSFDLQG 12 T 3.8 Glycoamylase pdbhh F Eukaryota T 1w8t 1 A A Q9C171_PIREQ NON CATALYTIC PROTEIN 1 SNVRATYTVIFKNASGLPNGYDNWGWGCTLSYYGGAMIINPQEGKYGAVSLKRNSGSFRGGSLRFDMKNEGKVAILVENSEADEKFEVETISPSDEYVTYILDVDFDLPFDRIDFQDAPGNGDRIWIKNLVHSTGSADDFVDPINLEHH 149 T 0.025 ShlB pdbpercent F Eukaryota T 1w8w 1 A,B A,B Q9C171_PIREQ NON-CATALYTIC PROTEIN 1 SNVRATYTVIFKNASGLPNGYDNWGWGCTLSYYGGAMIINPQEGKAGAVSLKRNSGSFRGGSLRFDMKNEGKVKILVENSEADEKFEVETISPSDEYVTYILDVDFDLPFDRIDFQDAPGNGDRIWIKNLVHSTGSADDFVDPINLEHH 149 T 0.17 RNase_H pdbpssm F Eukaryota T 1w8z 1 A,B A,B Q9C171_PIREQ NON CATALYTIC PROTEIN 1 SNVRATYTVIFKNASGLPNGYDNWGWGCTLSYYGGAMIINPQEGKYGAVSLKRNSGSFRGGSLRFDMKNEGKVKILVENSEADEAFEVETISPSDEYVTYILDVDFDLPFDRIDFQDAPGNGDRIWIKNLVHSTGSADDFVDPINLHHH 149 T 0.23 RNase_H pdbpssm F Eukaryota T 1w90 1 A,B A,B Q9C171_PIREQ NON-CATALYTIC PROTEIN 1 SNVRATYTVIFKNASGLPNGYDNWGWGCTLSYYGGAMIINPQEGKYGAVSLKRNSGSFRGGSLRFDMKNEGKVKILVENSEADEKFEVETISPSDEYVTYILDVDFDLPFDRIAFQDAPGNGDRIWIKNLVHSTGSADDFVDPINLEHHHHHH 153 T 0.1 RNase_H pdbpssm F Eukaryota T 1w94 1 A,B A,B BRIX_METTH MIL HMLLTTSRKPSQRTRSFSQRLSRIMGWRYINRGKMSLRDVLIEARGPVAVVSERHGNPARITFLDERGGERGYILFNPSFEMKKPELADKAVRVSSCPPGSEGLCNLMGLEVDESSSRDAWSIRTDEEYAWVMELMDARGTPAGFKLLIRDFRVGE 156 T 6.4E-05 Brix unphh F Archaea T 1w9f 1 A,B A,B Q9C171_PIREQ NON CATALYTIC PROTEIN 1 SNVRATYTVIFKNASGLPNGYDNWGWGCTLSYYGGAMIINPQEGKYGAVSLKRNSGSFRGGSLRFDMKNEGKVKILVENSEADEKFEVETISPSDEYVTYILDVDFDLPFDAIDFQDAPGNGDRIWIKNLVHSTGSADDFVDPINLHHH 149 T 0.26 RNase_H pdbpssm F Eukaryota T 1w9r 1 A A A0A0H2US50_STRPN CBPA-R2 GSHMPEKKVAEAEKKVEEAKKKAEDQKEEDRRNYPTNTYKTLELEIAESDVEVKKAELELVKEEAKEPRNEEKVKQAKAEVESKKAEATRLEKIKTDRKKAEEEAKRKAAEEDKVKEKP 119 T 3.3 UPF0231 pdb F Bacteria T 1wa7 2 B B TIP_SHV2C TIP WDPGMPTPPLPPRPANLGERQA 22 T 0.061 EVI2A unp T Viruses T 1wak 1 A A SRPK1_HUMAN SRPK1, SRPK1A PROTEIN KINASE, SERINE/ARGININE-RICH PROTEIN SPECIFIC KINASE 1, SR-PROTEIN-SPECIFIC KINASE 1, SFRS PROTEIN KINASE 1 PEQEEEILGSDDDEQEDPNDYCKGGYHLVKIGDLFNGRYHVIRKLGWGHFSTVWLSWDIQGKKFVAMKVVKSAEHYTETALDEIRLLKSVRNSDPNDPNREMVVQLLDDFKISGVNGTHICMVFEVLGHHLLKWIIKSNYQGLPLPCVKKIIQQVLQGLDYLHTKCRIIHTDIKPENILLSVNEQYIRRLAAEATEWQRSGAPPPSGSAVSTAPATAGNFLVNPLEPKNAEKLKVKIADLGNACWVHKHFTEDIQTRQYRSLEVLIGSGYNTPADIWSTACMAFELATGDYLFEPHSGEEYTRDEDHIALIIELLGKVPRKLIVAGKYSKEFFTKKGDLKHITKLKPWGLFEVLVEKYEWSQEEAAGFTDFLLPMLELIPEKRATAAECLRHPWLNS 397 T 3.2 RFXA_RFXANK_bdg unppercent F Eukaryota T 1wc2 1 A A GUN_MYTED CMCASE, ENDO-1,4-BETA-GLUCANASE, CELLULASE NQKCSGNPRRYNGKSCASTTNYHDSHKGACGCGPASGDAQFGWNAGSFVAAASQMYFDSGNKGWCGQHCGQCIKLTTTGGYVPGQGGPVREGLSKTFMITNLCPNIYPNQDWCNQGSQYGGHNKYGYELHLDLENGRSQVTGMGWNNPETTWEVVNCDSEHNHDHRTPSNSMYGQCQCAHQ 181 T 0.00088 DPBB_1 pdbpssm F Eukaryota T 1wcu 1 A A Q9C171_PIREQ CBM29_1 MVSATYSVVYETGKKLNSGFDNWGWDSKMSFKDNSLVLTADPDEYGAISLKNLNSNYYGKGGCIYLQVKTETEGLVKVQGVRGYDETEAFNVGSFRSSSDFTEYKFEVDDEYQFDRIIVQDGPASNIPIYMRYIIYSTGSCDDHILEHHHHHH 153 T 0.071 YegS_C unppercent F Eukaryota T 1weq 1 A A PHF7_MOUSE PHD finger protein 7 GSSGSSGELEPGAFSELYQRYRHCDAPICLYEQGRDSFEDEGRWRLILCATCGSHGTHRDCSSLRPNSKKWECNECLPASGPSSG 85 T 0.00095 PHD pdb F Eukaryota T 1wfa 1 A,B A,B ANPA_PSEAM ANTIFREEZE PROTEIN ISOFORM HPLC6 DTASDAAAAAALTAANAKAAAELTAANAAAAAAATARX 38 T 9.3 DUF3157 unppssm F Eukaryota T 1wh5 1 A A ZHD1_ARATH ZINC FINGER HOMEOBOX FAMILY PROTEIN GSSGSSGSSAEAGGGIRKRHRTKFTAEQKERMLALAERIGWRIQRQDDEVIQRFCQETGVPRQVLKVWLHNNKHSGPSSG 80 T 0.0036 Homeodomain pdbhh F Eukaryota T 1wh7 1 A A ZHD2_ARATH HYPOTHETICAL PROTEIN F22K18.140, AT4G24660/F22K18_140, ZINC FINGER HOMEOBOX FAMILY PROTEIN GSSGSSGSNPSSSGGTTKRFRTKFTAEQKEKMLAFAERLGWRIQKHDDVAVEQFCAETGVRRQVLKIWMHNNKNSGPSSG 80 T 0.0038 Homeodomain pdbhh F Eukaryota T 1wjj 1 A A Y4844_ARATH hypothetical protein F20O9.120 GSSGSSGSTVKRKPVFVKVEQLKPGTTGHTLTVKVIEANIVVPVTRKTRPASSLSRPSQPSRIVECLIGDETGCILFTARNDQVDLMKPGATVILRNSRIDMFKGTMRLGVDKWGRIEATGAASFTVKEDNNLSLVEYESGPSSG 145 T 0.13 DUF3253 unppercent F Eukaryota T 1wo0 1 A A TAC1_TACTR Tachyplesin I KWCFRVCYRGICYRRCRX 18 T 0.021 Myticin-prepro unp F Eukaryota T 1wqb 1 A A TXP7_APOSC PARALYTIC PEPTIDE VII, PP VII WLGCARVKEACGPWEWPCCSGLKCDGSECHPQ 32 T 0.027 Toxin_7 pdbhh F Eukaryota T 1wqc 1 A A KKX21_OPIMA OmTx1 DPCYEVCLQQHGNVKECEEACKHPVE 26 T 0.023 Thionin pdb F Eukaryota T 1wqd 1 A A KKX21_OPIMA OmTx2 DPCYEVCLQQHGNVKECEEACKHPVEY 27 T 0.026 Thionin pdb F Eukaryota T 1wqe 1 A A KKX23_OPIMA OmTx3 NDPCEEVCIQHTGDVKACEEACQ 23 T 0.55 DUF1289 unphh F Eukaryota T 1wrz 2 B B DAPK2_HUMAN DAP KINASE 2, DAP- KINASE RELATED PROTEIN 1, DRP-1 RRRWKLSFSIVSLCNHLTR 19 T 4.9 AAA_lid_8 pdbhh F Eukaryota T 1ws4 2 B,D,F,H B,D,F,H LECB3_ARTIN JACALIN BETA-3 CHAIN DEQSGISQTVIVGPWGAKSA 20 T 2.8 DUF3842 pdbhh F Eukaryota T 1wzc 1 A,B A,B MPGP_PYRHO MPGP MIRLIFLDIDKTLIPGYEPDPAKPIIEELKDMGFEIIFNSSKTRAEQEYYRKELEVETPFISENGSAIFIPKGYFPFDVKGKEVGNYIVIELGIRVEKIREELKKLENIYGLKYYGNSTKEEIEKFTGMPPELVPLAMEREYSETIFEWSRDGWEEVLVEGGFKVTMGSRFYTVHGNSDKGKAAKILLDFYKRLGQIESYAVGDSYNDFPMFEVVDKVFIVGSLKHKKAQNVSSIIDVLEVIKHHHHHH 249 T 1.5E-10 Hydrolase_3 pdbpercent F Archaea T 1x2r 2 B B NF2L2_MOUSE NF-E2 RELATED FACTOR 2, NFE2-RELATED FACTOR 2, NUCLEAR FACTOR, ERYTHROID DERIVED 2, LIKE 2 LDEETGEFL 9 T 0.055 Radial_spoke unppercent F Eukaryota T 1x3c 1 A A ZN292_HUMAN Zinc finger protein 292 GSSGSSGRKKPVSQSLEFPTRYSPYRPYRCVHQGCFAAFTIQQNLILHYQAVHKSDLPAFSAEVEEESGPSSG 73 T 0.0055 zf-C2H2 unppercent F Eukaryota T 1x5v 1 A A TXFK1_PSACA PcFK1 ACGILHDNCVYVPAQNPCCRGLQCRYGKCLVQVX 34 T 8.7E-05 Conotoxin unphh F Eukaryota T 1x7k 1 A A PPM1_LIMPO PV5 RRWCFRVCYRGRFCYRKCR 19 T 0.19 zf-CCHH pdbhh F Eukaryota T 1x8s 2 B B Pals1 peptide YPKHREMAVDCP 12 T 3.5 GSH_synthase pdbhh F T 1x9t 2 B,B10,B11,B12,B13,B14,B15,B16,B17,B18,B19,B2,B20,B21,B22,B23,B24,B25,B26,B27,B28,B29,B3,B30,B31,B32,B33,B34,B35,B36,B37,B38,B39,B4,B40,B41,B42,B43,B44,B45,B46,B47,B48,B49,B5,B50,B51,B52,B53,B54,B55,B56,B57,B58,B59,B6,B60,B7,B8,B9 B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B SPIKE_ADE02 N-terminal peptide of Fiber protein MKRARPSEDTFNPVYPYDTEC 21 T 0.34 DUF5449 pdbhh T Viruses T 1xb7 2 B P PRGC1_HUMAN PPAR GAMMA COACTIVATOR-1 ALPHA, PPARGC-1 ALPHA, PGC-1 ALPHA, LIGAND EFFECT MODULATOR-6 RPASELLKYLTT 12 T 4.4 MTBP_mid pdbhh F Eukaryota T 1xbh 1 A A PROTEIN (CYCLO(L-262)) CIYYKDGEALKYX 13 T 18 MORN_2 pdbhh F T 1xdk 3 C,D,G,H C,D,G,H MED1_MOUSE TRAP220 NHPMLMNLLKDNPA 14 T 7.6 DnaI_N pdbhh F Eukaryota T 1xe4 1 A A FEMX_WEIVI FemX PVLNLNDPQAVERYEEFMRQSPYGQVTQDLGWAKVMNNWEPVDVYLEDDQGAIIAAMSMLLGDTPTDKKFAYASKGPVMDVTDVDLLDRLVDEAVKALDGRAYVLRFDPEVAYSDEFNTTLQDHGYVTRNRNVADAGMHATIQPRLNMVLDLTKFPDAKTTLDLYPSKTKSKIKRPFRDGVEVHSGNSATELDEFFKTYTTMAERHGITHRPIEYFQRMQAAFDADTMRIFVAEREGKLLSTGIALKYGRKIWYMYAGSMDGNTYYAPYAVQSEMIQWALDTNTDLYDLGGIESESTDDSLYVFKHVFVKDAPREYIGEIDKVLDPEVYAELVKD 335 T 2.6E-25 FemAB unppssm F Bacteria T 1xf8 1 A A FEMX_WEIVI FemX PVLNLNDPQAVERYEEFMRQSPYGQVTQDLGWAKVKNNWEPVDVYLEDDQGAIIAAMSMLLGDTPTDKKFAYASKGPVMDVTDVDLLDRLVDEAVKALDGRAYVLRFDPEVAYSDEFNTTLQDHGYVTRNRNVADAGMHATIQPRLNMVLDLTKFPDAKTTLDLYPSKTKSKIKRPFRDGVEVHSGNSATELDEFFKTYTTMAERHGITHRPIEYFQRMQAAFDADTMRIFVAEREGKLLSTGIALKYGRKIWFMYAGSMDGNTYYAPYAVQSEMIQWALDTNTDLYDLGGIESESTDDSLYVFKHVFVKDAPREYIGEIDKVLDPEVYAELVKD 335 T 2.6E-25 FemAB unppssm F Bacteria T 1xgy 3 E,F P,Q Rhodopsin Epitope Mimetic Peptide TGALQERSK 9 T 30 BshC pdbhh F T 1xh3 3 C C aa 4-17 (LPAVVGLSPGEQEY) of alternative reading frame of M-CSF LPAVVGLSPGEQEY 14 T 6.3 DUF1127 pdbhh F T 1xhm 3 C C SIGK Peptide SIGKAFKILGYPDYD 15 T 1.1 UPF0175 pdbhh F T 1xkm 1 A,C A,C Distinctin chain A ENREVPPGFTALIKTLRKCKII 22 T 4 Bradykinin pdbhh F T 1xkm 2 B,D B,D Distinctin chain B NLVSGLIEARKYLEQLHRKLKNCKV 25 T 0.35 hGDE_central pdbhh F T 1xn2 2 E,F,G,H E,F,G,H OM03-4 REWWSEVNXAEF 12 T 2.2 PHTB1_N pdbhh F T 1xn3 2 E I Peptidic inhibitor KTEEISEVNXVAEF 14 T 13 DUF1805 pdbhh F T 1xoc 2 B B Nonapeptide VDSKNTSSW VDSKNTSSW 9 T 8.4 Ac76 pdbhh F T 1xof 1 A A BBAhetT1 XYRIXSYDFXDEAEKLLRDAXG 22 T 5.7 DUF4952 pdbhh F T 1xof 2 B B BBAhetT1 XYRIXSYDFXDKFKKLLRKAXG 22 T 11 DUF1949 pdbhh F T 1xqd 1 A A NOR_FUSOX P450NOR MASGAPSFPFSRASGPEPPAEFAKLRATNPVSQVKLFDGSLAWLVTKHKDVCFVATSEKLSKVRTRQGFPELGAGGKQAAKAKPTFVDMDPPEHMHQRSMVEPTFTPEAVKNLQPYIQRTVDDLLEQMKQKGCANGPVDLVKEFALPVPSYIIYTLLGVPFNDLEYLTQQNAIRTNGSSTARQASAANQELLDYLAILVEQRLVEPKDDIISKLCTEQVKPGNIDKSDAVQIAFLLLVAGNATMVNMIALGVATLAQHPDQLAQLKANPSLAPQFVEELCRYHTASALAIKRTAKEDVMIGDKLVRANEGIIASNQSANRDEEVFENPDEFNMNRKWPPQDPLGFGFGDHRCIAEHLAKAELTTVFSTLYQKFPDLKVAVPLGKINYTPLNRDVGIVDLPVIF 403 T 6.1E-07 p450 unp F Eukaryota T 1xqh 2 B,D B,F P53_HUMAN 9-mer peptide from tumor protein p53 LKSKKGQSTY 10 T 14 Flp_Fap pdbhh F Eukaryota T 1xr0 1 A A FGFR1_HUMAN FGFR-1, BFGF-R, FMS-LIKE TYROSINE KINASE-2, C-FGR HSQMAVHKLAKSIPLRRQVTVS 22 T 1.3 DUF1823 pdbhh F Eukaryota T 1xr8 3 C C EBNA3_EBVB9 EBNA-3A LEKARGSTY 9 T 4.5 DUF4872 pdbhh T Viruses T 1xu6 1 A A VSM2_TRYBB VSG 221, MITAT1.2 C-TERMINAL DOMAIN GSHMLEVLTQKHKPAESQQQAAETEGSCNKKDQNECKSPCKWHNDAENKKCTLDKEEAKKVADETAKDGKTGNTNTTGSS 80 T 0.00099 Trypan_glycop_C pdbpercent F Eukaryota T 1xxz 1 A A SRIF CKFFXXTXTSC 11 F F T 1xy4 1 A A SRIF YCKEFXXTFKSC 12 T 0.64 CRM1_repeat pdbhh F T 1xy5 1 A A SRIF YCKFEXXTFXSC 12 T 0.36 Laterosporulin pdbhh F T 1xy6 1 A A SRIF YCKFEXXTFKSC 12 T 0.41 Laterosporulin pdbhh F T 1xy9 1 A A SRIF CKFAXXTXTSC 11 T 0.41 DUF2195 pdbhh F T 1xyr 4 D 5 POLG_POL1M Genome polyprotein, Coat protein VP3 GLPVMNTPGSNQ 12 T 2 GSH-S_N pdbhh T Viruses T 1xyr 7 G 8 POLG_POL1M Genome polyprotein, Coat protein VP1 PALTAVETGAT 11 T 8.9 DUF6047 pdbhh T Viruses T 1y03 1 A A ANP3_MYOSC RSS3 GSMNAPARAAAKTAADALAAAKKTAADAAAAAAAA 35 T 160 DUF2443 pdbhh F Eukaryota T 1y19 1 A,C,E,G,I,K A,C,E,G,I,K PI51C_MOUSE Phosphatidylinositol-4-phosphate 5-kinase, type 1 gamma DERSWVYSPLHYSA 14 T 2.2 Invas_SpaK pdbhh F Eukaryota T 1y29 1 A A TXH10_HAPSC huwentoxin-x KCLPPGKPCYGATQKIPCCGVCSHNKCT 28 T 0.0049 Conotoxin unp F Eukaryota T 1y3a 2 E,F,G,H E,F,G,H KB752 peptide SRVTWYDFLMEDTKSR 16 T 2.1 DUF2760 pdbhh F T 1y5c 1 A A Q0PGA5_BUBBU LACTOFERRIN, LACTOFERRICIN B, LFCIN B RRWQWRMKKLG 11 T 0.00046 Transferrin unppercent F Eukaryota T 1y7a 1 A,B A,B PPB_ECOLI APASE TPEMPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQHATPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDWQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 449 T 6.8E-10 Alk_phosphatase unppssm F Bacteria T 1y7l 2 B P CYSE_HAEIN SAT FRAGMENT; E.C.2.3.1.30 GIDDGMNLNI 10 T 18 DUF2523 pdbhh F Bacteria T 1y98 2 B B CTIP_HUMAN CtIP PHOSPHORYLATED PEPTIDE PTRVSSPVFGAT 12 T 4.3 Pardaxin pdbhh F Eukaryota T 1ycp 3 C,G F,N FIBA_HUMAN FIBRINOPEPTIDE A-ALPHA ADSGEGDYLAEGGGVRGPRVVER 23 T 1.1 DUF2388 pdbhh F Eukaryota T 1yjm 2 D,E,F E,F,G X-RAY REPAIR CROSS-COMPLEMENTING PROTEIN 4 XYDESTDEESEKK 13 T 0.6 XRCC4 pdbhh F T 1ym0 2 B B fibrinotic enzyme component B QPPVWYPGGQCGVSQYSDAGDMELPPG 27 T 2.3 PHM7_ext pdbhh F T 1ymt 2 B B NR0B2_MOUSE ORPHAN NUCLEAR RECEPTOR SHP, SMALL HETERODIMER PARTNER RPTILYALLSPSPR 14 T 0.086 NR_Repeat unphh F Eukaryota T 1yp0 2 B B NR0B2_RAT Nuclear receptor subfamily 0, group B, member 2 HPTILYTLLSPG 12 T 0.02 NR_Repeat unphh F Eukaryota T 1yr5 2 B B DAPK1_HUMAN DAP KINASE 1 RKKWKQSVRLISLCQRLSR 19 T 11 AAA_lid_8 pdbhh F Eukaryota T 1yrk 2 B B 13-residue peptide MALYSIXQPYVFA 13 T 1.5 Monellin pdbhh F T 1yt6 1 A A peptide SD ACLPWSDGPC 10 T 0.31 VERL pdbhh F T 1ytr 1 A A PLNA_LACPL Bacteriocin plantaricin A KSSAYSLQMGATAIKQVKKLFKKWGW 26 T 0.045 Bacteriocin_IIc unp F Bacteria T 1yuc 2 C,D C,D NR0B2_HUMAN ORPHAN NUCLEAR RECEPTOR SHP; SMALL HETERODIMER PARTNER ASRPAILYALLSSS 14 T 9.2 NR_Repeat unphh F Eukaryota T 1yvh 2 B B SH2B2_RAT APS ADAPTER PROTEIN GRARAVENQXSFY 13 T 4 UPF0542 pdbhh F Eukaryota T 1ywh 2 B,D,F,H,J,L,N,P B,D,F,H,J,L,N,P antagonist peptide KSDXFXXYLWSSK 13 T 3.5 EF-hand_like pdbhh F T 1ywt 2 C,D C,D synthetic optimal phosphopeptide (mode-1) MARSHSYPAGKK 12 T 8.4 Ribosomal_S13_N pdbhh F T 1yy2 1 A A Leuprolide QHWSYXLRPX 10 T 0.00076 GnRH pdb F T 1yy6 2 B B EBNA1_EBVB9 EBNA1 DPGEGPSTGP 10 T 0.18 Herpes_IE1 pdbhh T Viruses T 1yyp 2 B B DPOL_HCMVA POL LPRRLHLEPAFLPYSVKAHECC 22 T 2.8 TP53IP5 pdbhh T Viruses T 1z7z 4 D 4 POLG_CXA21 human coxsackievirus A21 LPLTKVDSITTF 12 T 7 Flot pdbhh T Viruses T 1z7z 5 E 5 human coxsackievirus A21 LIGRTQ 6 T 33 Corona_7 pdbhh F T 1z9o 2 G,H,I,J,K,L G,H,I,J,K,L OSBL1_RAT ORP-1 SEDEFYDALS 10 T 1.2 AAA_assoc_2 pdbhh F Eukaryota T 1zh7 2 C,D C,D NR0B2_RAT nuclear receptor subfamily 0, group B, member 2 SHPTILYTLLS 11 T 0.02 NR_Repeat unphh F Eukaryota T 1zhb 3 C,F,I,L C,F,I,L DOPO_RAT DOPAMINE BETA- HYDROXYLASE, DBH KALYNYAPI 9 T 0.55 Ntox47 pdbhh F Eukaryota T 1zhk 3 C C EBV-peptide LPEPLPQGQLTAY LPEPLPQGQLTAY 13 T 14 AP-5_subunit_s1 pdbhh F T 1zi7 1 A,B,C A,B,C KES1_YEAST OXYSTEROL-BINDING PROTEIN HOMOLOG 4 GAMDPPFILSPISLTEFSQYWAEHPELFLEPSFINDDNYKEHCLIDPEVESPELARMLAVTKWFISTLKSQYCSRNESLGSEKKPLNPFLGELFVGKWENKEHPEFGETVLLSEQVSHHPPVTAFSIFNDKNKVKLQGYNQIKASFTKSLMLTVKQFGHTMLDIKDESYLVTPPPLHIEGILVASPFVELEGKSYIQSSTGLLCVIEFSGVDGKKNSFKARIYKDSKDSKDKEKALYTISGQWSGSSKIIKANKKEESRLFYDAARIPAEHLNVKPLEEQHPLESRKAWYDVAGAIKLGDFNLIAKTKTELEETQRELRKEEEAKGISWQRRWFKDFDYSVTPEEGALVPEKDDTFLKLASALNLSTKNAPSGTLVGDKEDRKEDLSSIHWRFQRELWDEEKEIVL 406 T 2.8E-13 Oxysterol_BP unppssm F Eukaryota T 1zkk 2 E,F,G,H E,F,G,H H4_HUMAN Peptide corresponding to residues 15-24 of histone H4 AKRHRKVLRD 10 T 0.27 UPF0137 unp F Eukaryota T 1zla 6 K K Q9DUM3_HHV8 latent nuclear antigen MAPPGMRLRSGRSTGAPLTRGS 22 T 7.1 PolC_DP2 pdbhh T Viruses T 1zns 2 C A CEA7_ECOLX Colicin E7 MESKRNKPGKATGKGKPVNNKWLNNAGKDLGSPVPDRIANKLRDKEFKSFDDFRKKFWEEVSKDPELSKQFSRNNNDRMKVGKAPKTRTQDVSGKRTSFELHEEKPISQNGGVYDMDNISVVTPKRHIDIHRGK 134 T 0.026 HNH pdbpercent F Bacteria T 1zpx 1 A A Q00484_9CNID mini-collagen XPCGSYCPSVCAPACAPVCCYPX 23 T 0.047 C_tripleX pdbhh F Eukaryota T 1zrv 1 A A SPING_PSEUS spinigerin HVDKKVADKVLLLKQLRIMRLLTRL 25 T 5.4 YfbU pdbhh F Eukaryota T 1zsd 3 C C BZLF1_EBVB9 EB1, ZEBRA EPLPQGQLTAY 11 T 7.4 AP-5_subunit_s1 pdbhh T Viruses T 1zsg 2 B B PAK1_HUMAN P21-ACTIVATED KINASE 1, PAK-1, P65-PAK, ALPHA-PAK, PAK PEPTIDE DATPPPVIAPRPEHTKSVYTRS 22 T 1.4 TFIIA unppercent F Eukaryota T 1zt1 3 C P Influenza virus epitope, FEANGNLI FEANGNLI 8 T 0.046 TTc_toxin_rep pdbhh F T 1zt7 3 E,F P,Q SV40 epitope, SEFLLEKRI SEFLLEKRI 9 T 20 OMS28_porin pdbhh F T 1zub 2 B B RB6I2_RAT ERC PROTEIN 1, ERC1, CAZ-ASSOCIATED STRUCTURAL PROTEIN 2, CAST2, RAB6 INTERACTING PROTEIN 2, C-TERMINAL PEPTIDE CDQDEEEGIWA 11 T 1.8 MgrB pdbhh F Eukaryota T 1zuz 2 B B DAPK2_HUMAN DRP-1 kinase RRRWKLDFSIVSLCNHLTR 19 T 5 AAA_lid_8 pdbhh F Eukaryota T 1zvs 3 C,F C,F Tat-Tl8 TTPESANL 8 T 70 SepA pdbhh F T 1zx3 1 A A Q82XL7_NITEU hypothetical protein NE0241 MGSSHHHHHHSSGRENLYFQGHMGKKKNKKTEVQQPDPMRKNWIMENMDSGVIYLLESWLKAKSQETGKEISDIFANAVEFNIVLKDWGKEKLEETNTEYQNQQRKLRKTYIEYYDREMKGS 122 T 0.0013 Ets pdbpssm F Bacteria T 1zzd 2 B B RIR4_YEAST RIBONUCLEOTIDE REDUCTASE SMALL SUBUNIT 2 KEINFDDDF 9 T 8.7 Etd1 pdbhh F Eukaryota T 2a1m 1 A,B A,B CPXA_PSEPU CAMPHOR 5-MONOOXYGENASE, P450CAM MTTETIQSNANLAPLPPHVPEHLVFDFDMYNPSNLSAGVQEAWAVLQESNVPDLVWTRCNGGHWIATRGQLIREAYEDYRHFSSECPFIPREAGEAYDFIPTSMDPPEQRQFRALANQVVGMPVVDKLENRIQELACSLIESLRPQGQCNFTEDYAEPFPIRIFMLLAGLPEEDIPHLKYLTDQMTRPDGSMTFAEAKEALYDYLIPIIEQRRQKPGTDAISIVANGQVNGRPITSDEAKRMCGLLLVGGLDTVVNFLSFSMEFLAKSPEHRQELIERPERIPAACEELLRRFSLVADGRILTSDYEFHGVQLKKGDQILLPQMLSGLDERENAAPMHVDFSRQKVSHTTFGHGSHLCLGQHLARREIIVTLKEWLTRIPDFSIAPGAQIQHKSGIVSGVQALPLVWDPATTKAV 415 T 1.6E-05 p450 unppercent F Bacteria T 2a3d 1 A A PROTEIN (DE NOVO THREE-HELIX BUNDLE) MGSWAEFKQRLAAIKTRLQALGGSEAELAAFEKEIAAFESELQAYKGKGNPEVEALRKEAAAIRDELQAYRHN 73 T 0.0085 DUF1202 pdb F T 2a3i 2 B B NCOA1_HUMAN NCOA-1, STEROID RECEPTOR COACTIVATOR-1, SRC-1, RIP160, HIN-2 PROTEIN QQKSLLQQLLTE 12 T 3.8 GFD1 pdbhh F Eukaryota T 2a4j 2 B B XPC_HUMAN XERODERMA PIGMENTOSUM GROUP C COMPLEMENTING PROTEIN P125 NWKLLAKGLLIRERLKR 17 T 9.9 MazG_C pdbhh F Eukaryota T 2a5z 1 A,B,C A,B,C Q8ED25_SHEON hypothetical protein SO2946 MGGSFGKKGASSATAAQVPLATETTPGLMSPSEKLKLSTLTTSIATSDFYASYDFMMHSIGLTSANNISLLSTGNISLQNILSEGNHFGVQPIVSSTTANASFLAGMLMAIFPKESELEVTVYFKTPSAFNPAQLTVIGSTSIGLGISDRSGLIIENGNAFGGIVKASAATETGSTYALSTSTWYICKFKMLTDDRFKVTLYSDSGTQLYSYTSTAAMFRADNATAHIGFKTQCKTATAGISLISIDLIEFKAKVSATRAKV 262 T 33 DUF1652 pdbhh F Bacteria T 2a6d 3 E P Dodecapeptide, RLLIADPPSPRE RLLIADPPSPRE 12 T 2.3 DUF4666 pdbhh F T 2a6i 3 C P Dodecapeptide: KLASIPTHTSPL KLASIPTHTSPL 12 T 18 IML1 pdbhh F T 2a6k 3 E P DODECAPEPTIDE: SLGDNLTNHNLR SLGDNLTNHNLR 12 T 8.5 DUF764 pdbhh F T 2a7u 1 A A ATPA_ECO57 F-ATPASE ALPHA CHAIN MQLNSTEISELIKQRIAQFNVV 22 T 0.5 GnsAB_toxin pdbhh F Bacteria T 2a9x 2 B 1 BIV-2 cyclic peptide RVRTRGKRRIRVPP 14 T 1.6 YhdX pdbhh F T 2ab9 1 A A SFTI1_HELAN pro-SFTI-1 GYKTSISTITIEDNGRCTKSIPPICFPDGRP 31 T 0.015 Bowman-Birk_leg pdb F Eukaryota T 2abz 2 C,D,E,F C,D,E,F MCPI_HIRME LEECH CARBOXYPEPTIDASE INHIBITOR, LCI, INHIBITOR OF A/B METALLOCARBOXYPEPTIDASES GSHTPDESFLCYQPDQVCAFICRGAAPLPSEGECNPHPTAPWAREGAVEWVPYSTGQCRTTCIPYVE 67 T 0.0093 Inhibitor_I68 pdbpssm F Eukaryota T 2ad9 2 B A PTBP1_HUMAN PTB, HETEROGENEOUS NUCLEAR RIBONUCLEOPROTEIN I, HNRNP I, 57 KDA RNA-BINDING PROTEIN PPTB-1 MGSSHHHHHHSSGLVPRGSHMGDSRSAGVPSRVIHIRKLPIDVTEGEVISLGLPFGKVTNLLMLKGKNQAFIEMNTEEAANTMVNYYTSVTPVLRGQPIYIQFSNHKELKTDSSPNQAR 119 T 0.00041 RRM_1 pdbpssm F Eukaryota T 2ag3 1 A A GCN4-pLI RMKQIEDKLEEILSXYHIENELARIKKLLGER 32 T 0.0053 VGPC1_C pdbhh F T 2agh 3 C C KMT2A_HUMAN ALL-1, TRITHORAX-LIKE PROTEIN SDDGNILPSDIMDFVLKNTPSMQALGESPES 31 T 9.5 ComFB pdbhh F Eukaryota T 2ain 2 B B BCR_HUMAN 6-mer peptide from Breakpoint cluster region protein LFSTEV 6 T 79 DUF3916 pdbhh F Eukaryota T 2ajj 1 A A POLG_BVDVC NS5A SGNYVLDLIYSLHKQINRGLKKIVLGWA 28 T 3.9 DUF5103 pdbhh T Viruses T 2aka 2 B L LINKER TRLVPRGSELALE 13 T 7.8 SsgA pdbhh F T 2amn 1 A A CTHL1_CHICK cathelicidin RVKRVWPLVIRTVIAGYNLYRAIKKK 26 T 2.2 Phage_coatGP8 pdbhh F Eukaryota T 2an6 2 E,F,G,H E,F,G,H peptide from Phyllopod LQQERTKLRPVAMVRPTVRVQPQL 24 T 5.9 PRR20 pdbhh F T 2anh 1 A,B A,B PPB_ECOLI ALKALINE PHOSPHATASE MPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQHATPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 446 T 6.8E-10 Alk_phosphatase unppssm F Bacteria T 2aof 2 C C PEPTIDE INHIBITOR RPGNXLQSRPX 11 T 46 IBV_3A pdbhh F T 2aoh 2 C C PEPTIDE INHIBITOR VSFNXPQITA 10 T 9.6 DUF3912 pdbhh F T 2aoj 2 C C PEPTIDE INHIBITOR VSFNXPQITAAX 12 T 16 DUF3912 pdbhh F T 2ap2 2 C,D P,Q MDR1_CRIGR EPITOPE PEPTIDE VVQEALDKAREGRT 14 T 10 Dodecin pdbhh F Eukaryota T 2ap7 1 A A BMNH5_BOMVA Bombinin H2 IIGPVLGLVGSALGGLLKKI 20 T 0.00098 Bombinin pdb F Eukaryota T 2ap8 1 A A BMNH5_BOMVA bombinin H4 IXGPVLGLVGSALGGLLKKI 20 T 0.00098 Bombinin pdb F Eukaryota T 2aq9 2 B X peptide inhibitor SSGWMLDPIAGKWSR 15 T 0.11 Kelch_1 pdb F T 2asq 2 B B PIAS2_HUMAN PROTEIN INHIBITOR OF ACTIVATED STAT X, MSX-INTERACTING ZINC FINGER PROTEIN, MIZ1, DAB2-INTERACTING PROTEIN, DIP, ANDROGEN RECEPTOR-INTERACTING PROTEIN 3, ARIP3, PIAS-NY PROTEIN KVDVIDLTIESSSDEEEDPPAKRQM 25 T 0.026 EF-1_beta_acid pdb F Eukaryota T 2ast 4 D D CDN1B_HUMAN CYCLIN-DEPENDENT KINASE INHIBITOR P27, P27KIP1 AGSVEQTPKK 10 T 52 DUF1850 pdbhh F Eukaryota T 2asu 1 A A HGFL_HUMAN MACROPHAGE STIMULATORY PROTEIN, MSP, MACROPHAGE STIMULATING PROTEIN FEKCGKRVDRLDQRRSKLR 19 F F Eukaryota T 2atp 2 B,E E,F artifact linker AGSADDARKDAARKDDARKDDARKDGSSA 29 T 78 Oberon_cc pdbhh F T 2auc 2 D D MYOA_PLAYO MYOA XLMRVQAHIRKRMVA 15 T 0.063 BORCS8 pdbhh F Eukaryota T 2aww 2 C C GRIA1_RAT 18-RESIDUE C-TERMINAL PEPTIDE FROM GLUR-A SIPCMSHSSGMPLGATGL 18 T 4.1 Glyco_hydr_116N pdbhh F Eukaryota T 2axf 3 C C BZLF1_EBVB9 EBV, EB1, ZEBRA APQPAPENAY 10 T 0.25 Mucin-like unp T Viruses T 2axi 2 B B cyclic 8-mer peptide PFEXLDWEFX 10 T 0.48 Pico_P2B pdbhh F T 2axz 3 H I TPPKEVT(MSE) peptide TPPKEVTM 8 T 21 TrwC pdbhh F T 2azm 2 C,D C,D H2AX_HUMAN HISTONE H2AFX KKATQASQEY 10 T 16 Class_IIIsignal pdbhh F Eukaryota T 2b05 2 B,D,F,H,J,L G,H,I,J,K,L peptide RAISLP 6 T 54 Tryp_alpha_amyl pdbhh F T 2b19 1 A A TKN1_HUMAN NPK DADSSIEKQVALLKALYGHGQISHKRHKTDSFVGLM 36 T 0.0027 Tachykinin pdbhh F Eukaryota T 2b5b 1 A A DBTEW_CARCR Defensin EKKCPGRCTLKCGKHERPTLPYNCGKYICCVPVKVK 36 F F Eukaryota T 2b5k 1 A A PPM1_LIMPO PV5; POLYPHEMUSIN I RRWCFRVCYRGRFCYRKCRX 20 T 0.22 zf-CCHH pdbhh F Eukaryota T 2b5p 1 A A CT6A_CONMR Lambda-conotoxin CMrVIA VCCGYKLCHPC 11 T 0.33 Oxidored-like pdbhh F Eukaryota T 2b7f 2 C,F,I I,J,K (ACE)APQV(STA)VMHP peptide XAPQVXVMHP 10 T 19 OTCace pdbhh F T 2b9h 2 B C STE7 RRNLKGLNLNLHPD 14 T 3.6 DUF3626 pdbhh F T 2b9i 2 B C MSG5 PRSLQNRNTKNLSLDIAALHP 21 T 28 DUF2000 pdbhh F T 2b9j 2 B C CKI, FAR1, FACTOR ARREST PROTEIN SKRGNIPKPLNLS 13 T 3.8 DUF5361 pdbhh F T 2bba 2 B P Agonist peptide TNYLFSPNGPIARAW 15 T 0.036 PufQ pdbhh F T 2bbl 1 A A POLG_POL1M Genome linked protein VPg GAYTGLPNKKPNVPTIRTAKVQ 22 T 11 DUF2111 pdbhh T Viruses T 2bbm 2 B B MYLK2_RABIT MYOSIN LIGHT CHAIN KINASE KRRWKKNFIAVSAANRFKKISSSGAL 26 T 0.024 PACT_coil_coil unppssm F Eukaryota T 2bbu 2 B B IL6RB_MOUSE GP130 PHOSPHOPEPTIDE STASTVEXSTVVHSG 15 T 10 DUF4244 pdbhh F Eukaryota T 2bc7 1 A A CA1_CONIM Alpha-conotoxin ImI GXCSDPRXAWRC 12 T 0.0098 Toxin_8 unphh F Eukaryota T 2bc8 1 A A CA1_CONIM Alpha-conotoxin ImI GXXSDPRXAWRX 12 T 0.0098 Toxin_8 unphh F Eukaryota T 2bcx 2 B B RYR1_RABIT SKELETAL MUSCLE-TYPE RYANODINE RECEPTOR, RYR1, RYR-1, SKELETAL MUSCLE CALCIUM RELEASE CHANNEL KSKKAVWHKLLSKQRRRAVVACFRMTPLYN 30 T 2.1 Spc110_C pdbhh F Eukaryota T 2bec 2 B B SL9A1_HUMAN NA(+)/H(+) EXCHANGER 1, NHE-1, SOLUTE CARRIER FAMILY 9 MEMBER 1, NA(+)/H(+) ANTIPORTER, AMILORIDE-SENSITIVE, APNH VDLLAVKKKQETKRSINEEIHTQFLDHLLTGIEDICGHYGHHH 43 T 0.99 Herpes_TK_C pdbpercent F Eukaryota T 2bey 1 A A BIKK CTKSIPPICTKSIPPI 16 T 0.016 Bowman-Birk_leg pdb F T 2bil 1 A A CONSENSUS PIM1 PEPTIDE PIMTIDE ARKRRRHPSGPPTA 14 T 1.5 DUF3019 pdbhh F T 2bj4 3 C,D C,D PEPTIDE ANTAGONIST LTSRDFGSWYA 11 T 0.6 HNH_repeat pdbhh F T 2bn5 2 B B RU17_DROME U1 SNRNP 70 KDA, SNRNP70, U1-70K PROTEIN RPPPAHHNMFSVPPPPILGRG 21 T 17 DUF1851 pdbhh F Eukaryota T 2bp3 2 C,D S,T GP1BA_HUMAN GLYCOPROTEIN B ALPHA, GLYCOPROTEIN IBALPHA, GP-IB ALPHA, GPIBA, GPIB-ALPHA, CD42B-ALPHA, CD42B LRGSLPTFRSSLFLWVRPNGRV 22 T 0.091 GGN unphh F Eukaryota T 2bp5 2 B P P2RX4_RAT ATP RECEPTOR, P2X4, PURINERGIC RECEPTOR VEDYEQGLSG 10 T 4.9 GM_CSF pdbhh F Eukaryota T 2bqz 2 B,D B,F SMYD5_HUMAN HISTONE H4 RHRKVLRDNY 10 T 3.9 Phage_X pdbhh F Eukaryota T 2br8 2 F,G,H,I,J F,G,H,I,J CA1A_CONPE ALPHA-PNIA GCCSLPPCALNNPKYCX 17 T 0.0013 Toxin_8 pdbpssm F Eukaryota T 2bss 3 C C Q98Y46_9HIV1 HIV PEPTIDE KRWIILGLNK 10 T 1 COX2-transmemb pdbhh T Viruses T 2bta 1 A _ B3AT_HUMAN B3P MEELQDDYEDMMEENX 16 T 1.1 DUF1265 pdbhh F Eukaryota T 2buo 2 B T INHIBITOR OF CAPSID ASSEMBLY ITFEDLLDYYGP 12 T 0.92 DUF2610 pdbhh F T 2bvo 3 C C Q70A61_9HIV1 HIV-P24 KAFSPEVIPMF 11 T 9.1E-05 Gag_p24 unphh T Viruses T 2byp 2 F,G,H,I,J F,G,H,I,J CA1_CONIM ALPHA-CONOTOXIN IMI GCCSDPRCAWRX 12 T 0.0098 Toxin_8 unphh F Eukaryota T 2bzk 1 A A PIMTIDE ARKRRRHPSGPPTAX 15 T 1.8 DUF3019 pdbhh F T 2c1d 1 A,C,E,G A,C,E,G SOXA_PARPN SOXA DPVEDGLVIETDSGPVEIVTKTAPPAFLADTFDTIYSGWHFRDDSTRDLERDDFDNPAMVFVDRGLDKWNAAMGVNGESCASCHQGPESMAGLRAVMPRVDEHTGKLMIMEDYVNACVTERMGLEKWGVTSDNMKDMLSLISLQSRGMAVNVKIDGPAAPYWEHGKEIYYTRYGQLEMSCANCHEDNAGNMIRADHLSQGQINGFPTYRLKDSGMVTAQHRFVGCVRDTRAETFKAGSDDFKALELYVASRGNGLSVEGVSVRH 264 T 6E-05 DUF1924 unphh F Bacteria T 2c2l 2 E,F,G,H E,F,G,H HS90A_HUMAN HSP90 DTSRMEEVD 9 T 6.5 Clathrin_lg_ch pdbhh F Eukaryota T 2c3i 1 A A PIMTIDE KRRRHPSG 8 T 3.5 RNA_GG_bind pdbhh F T 2c5i 1 A P VPS51_YEAST VPS51, APICAL BUD GROWTH PROTEIN 3 AEQISHKKSLRVSSLNKDRRLLLREFYNL 29 T 0.028 rRNA_processing unppercent F Eukaryota T 2c5k 1 A P VPS51_YEAST VPS51, APICAL BUD GROWTH PROTEIN 3 KSLRVSSLNKDRRLLLREFYNLEN 24 T 0.028 rRNA_processing unppercent F Eukaryota T 2c5v 3 E,F F,H ALA-ALA-ABA-ARG-SER-LEU-ILE-PFF-NH2 AAXRSLIXX 9 T 0.68 SDA1 pdbhh F T 2c9f 2 F,G,H,I,J S,T,U,V,W SPIKE_ADE02 N-TERMINAL PEPTIDE OF THE FIBER MKRARPSGDTFNPVYPYDT 19 T 0.44 DUF5449 pdbhh T Viruses T 2c9l 3 C,D Y,Z BZLF1_EBVB9 EB1, ZEBRA MLEIKRYKNRVAARKSRAKFKQLLQHYREVAAAKSSENDRLRLLLKQMCPSLDVDSIIPRTPD 63 T 0.00058 bZIP_2 pdb T Viruses T 2c9n 3 C,D Y,Z BZLF1_EBVB9 EB1, ZEBRA MLEIKRYKNRVASRKCRAKFKQLLQHYREVAAAKSSENDRLRLLLKQMCPSLDVDSIIPRTPD 63 T 0.0016 bZIP_2 pdb T Viruses T 2cbl 2 B B ZAP70_HUMAN ZAP-70 TLNSDGXTPEPA 12 T 0.99 Galanin pdbhh F Eukaryota T 2cch 3 E,F E,F CDC6_HUMAN CDC6-BIS, CDC6-RELATED PROTEIN, P62, CDC6, HSCDC6, HSCDC18 HTLKGRRLVFDN 12 T 1 Med26_C pdbhh F Eukaryota T 2cci 3 E,F F,I CDC6_HUMAN CDC6-RELATED PROTEIN,CDC18-RELATED PROTEIN,HSCDC18,P62(CDC6),HSCDC6 HHASPRKQGKKENGPPHSHTLKGRRLVFDN 30 T 9.8 Rhodanese_C pdbhh F Eukaryota T 2ce8 2 E,F X,Y EH1 PEPTIDE MFSIDNILA 9 T 0.28 TerC pdbhh F T 2cef 1 A A TF_HUMAN TFCD, TF, COAGULATION FACTOR III, THROMBOPLASTIN, CD142 ANTIGEN, TFPP CRKAGVGQSWKENSPLNVS 19 T 0.0002 Shisa unppssm F Eukaryota T 2ck0 3 C P PROTEIN (11-MER; CYCLIC PEPTIDE) CKEWLSTAPCG 11 T 0.75 PSI_PsaJ pdbhh F T 2clr 3 C,F C,F CALR_HUMAN DECAMERIC PEPTIDE FROM CALRETICULIN MLLSVPLLLG 10 T 1.6 DUF4634 pdbhh F Eukaryota T 2cm4 1 A A Q5YD59_ORNMO OMCI DSESDCTGSEPVDAFQAFSEGKEAYVLVRSTDPKARDCLKGEPAGEKQDNTLPVMMTFKQGTDWASTDWTFTLDGAKVTATLGQLTQNREVVYDSQSHHCHVDKVEKEVPDYEMWMLDAGGLEVEVECCRQKLEELASGRNQMYPHLKDC 150 T 7.8E-05 His_binding pdbhh F Eukaryota T 2cmy 2 B B TI_VERHE VERONICA HEDERIFOLIA TRYPSIN INHIBITOR NTDPEQCKVMCYAQRHSSPELLRRCLDNCEKEHD 34 T 0.0098 DUF842 pdb F Eukaryota T 2cp8 1 A A NBR1_HUMAN KIAA0049 PROTEIN, NEIGHBOR OF BRCA1 GENE 1 PROTEIN GSSGSSGQTAALMAHLFEMGFCDRQLNLRLLKKHNYNILQVVTELLQLSGPSSG 54 T 0.0021 UBA pdbpssm F Eukaryota T 2cp9 1 A A EFTS_HUMAN EF-TS, EF-TSMT GSSGSSGSSKELLMKLRRKTGYSFVNCKKALETCGGDLKQAEIWLHKEAQKEGWSKAASGPSSG 64 T 0.00034 EF_TS unphh F Eukaryota T 2csp 1 A A RIMB2_HUMAN RIM-BP2 GSSGSSGVEFSTLPAGPPAPPQDVTVQAGVTPATIRVSWRPPVLTPTGLSNGANVTGYGVYAKGQRVAEVIFPTADSTAVELVRLRSLEAKGVTVRTLSAQGESVDSAVAAVPPELLVPPTPHPSGPSSG 130 T 0.00022 DUF4998 unphh F Eukaryota T 2ctd 1 A A ZN512_HUMAN Zinc finger protein 512 GSSGSSGRIRKEPPVYAAGSLEEQWYLEIVDKGSVSCPTCQAVGRKTIEGLKKHMENCKQEMFTCHHCGKQLRSLAGMKYHVMANHNSLPSGPSSG 96 T 0.00024 zf-C2H2_4 pdbpercent F Eukaryota T 2cvy 2 B B RIR2_YEAST RNR2 C-TERMINAL 9 MER PEPTIDE GAFTFNEDF 9 T 2.1 DUF4295 pdbhh F Eukaryota T 2cwg 2 B,D D,E T5 SIALOGLYCOPEPTIDE OF GLYCOPHORIN A DTYAATPR 8 T 34 DUF2024 pdbhh F T 2czs 1 A,B A,B Q748S4_GEOSL DHC2 MVSGEVRTKKVPLDTNHKRFYDAFAQGAGKLDLDRQCVECHHEKPGGIPFPKNHPVKPADGPMRCLFCHKFKLEHHHHHH 80 T 3.6E-05 Cytochrom_NNT unphh F Bacteria T 2czy 2 B B REST_HUMAN NRSF/REST APQLIMLANVALTGE 15 T 0.93 zf-C2H2 unppssm F Eukaryota T 2d0n 2 B,D B,D SLP-76 binding peptide PSIDRSTKP 9 T 36 Protein_K pdbhh F T 2d1x 2 E,F P,Q ASAP1_HUMAN proline rich region from development and differentiation enhancing factor 1 SKKRPPPPPPGHKRT 15 T 3 DUF6059 pdbhh F Eukaryota T 2d3g 2 C P HGS_HUMAN ubiquitin interacting motif from hepatocyte growth factor-regulated tyrosine kinase substrate LQEEEELQLALALSQSEAEEK 21 T 6.3E-05 UIM pdbhh F Eukaryota T 2d4o 1 A A Q72J89_THET2 hypothetical protein TTHA1254 MRFRPFTEEDLDRLNRLAGKRPVSLGALRFFARTGHSFLAEEGEEPMGFALAQAVWQGEATTVLVTRMEGRSVEALRGLLRAVVKSAYDAGVYEVALHLDPERKELEEALKAEGFALGPLVLAVRVLGSRGARGETRGVLE 141 T 0.067 DUF1999 unppssm F Bacteria T 2d7s 2 B B Q9QCE4_9PICO VPg1 protein GPYAGPLERQRPLKVRAKLPRQE 23 T 5.1 RNase_HII pdbhh T Viruses T 2d8v 1 A A ANCHR_MOUSE Zinc finger FYVE domain-containing protein 19 GSSGSSGLPWCCICNEDATLRCAGCDGDLYCARCFREGHDNFDLKEHQTSPYHPRRPCQEHSGPSSG 67 T 0.00043 zf-B_box pdbpssm F Eukaryota T 2db2 1 A A DHX30_HUMAN KIAA0890 protein GSSGSSGASRDLLKEFPQPKNLLNSVIGRALGISHAKDKLVYVHTNGPKKKKVTLHIKWPKSVEVEGYGSKKIDAERQAAAAACQLFKGWGLLGPRNELFDAAKYRVLADRFGSGPSSG 119 T 0.00018 Dicer_dimer pdbhh F Eukaryota T 2dcx 1 A A DMS4_PHYSA DS IV ALWKTLLKKVLKAX 14 T 0.056 DD_K pdb F Eukaryota T 2dew 2 B A 10-mer peptide from histone H3 LQTARKSTGG 10 T 23 DUF5915 pdbhh F T 2dex 2 B A 10-mer peptide from histone H3 LAPRKQLATK 10 T 24 DUF3597 pdbhh F T 2dey 2 B A 10-mer peptide from histone H4 XSGRGKGGKGL 11 T 5.5 G3P_acyltransf pdbhh F T 2df6 2 C,D C,D PAK2_RAT 18-mer from PAK2 PPVIAPRPEHTKSIYTRS 18 T 2.2 TFIIA unppercent F Eukaryota T 2dhx 1 A A PAR10_HUMAN poly (ADP-ribose) polymerase family, member 10 variant GSSGSSGGVAVEVRGLPPAVPDELLTLYFENRRRSGGGPVLSWQRLGCGGVLTFREPADAERVLAQADHELHGAQLSLRPAPPRAPARLLLQGLPPGTSGPSSG 104 T 0.00023 NID pdbhh F Eukaryota T 2dhz 1 A A RPGFL_HUMAN LINK GUANINE NUCLEOTIDE EXCHANGE FACTOR II GSSGSSGDEIFCRVYMPDHSYVTIRSRLSASVQDILGSVTEKLQYSEEPAGREDSLILVAVSSSGEKVLLQPTEDCVFTALGINSHLFACTRDSYEALVPLPEEIQVSPGDTEISGPSSG 120 T 0.01 RA pdbpssm F Eukaryota T 2djy 2 B B SMAD7_HUMAN SMAD 7, MOTHERS AGAINST DPP HOMOLOG 7, SMAD7, HSMAD7 GPLGSELESPPPPYSRYPMD 20 T 0.051 WBP-1 pdbhh F Eukaryota T 2drm 2 E,F E,G 18-mer peptide from Acan125 AKPVPPPRGAKPAPPPRT 18 T 30 HCV_NS5a_C pdbhh F T 2ds8 2 B,D P,Q XB APALRVVK 8 T 9.8 ACC_epsilon pdbhh F T 2dun 1 A A DPOLM_HUMAN POL MU GSSGSSGSTRFPGVAIYLVEPRMGRSRRAFLTGLARSKGFRVLDACSSEATHVVMEETSAEEAVSWQERRMAAAPPGCTPPALLDISWLTESLGAGQPVPVECRHRLEVAGPRKGPLSPAWMPAYACSGPSSG 133 T 0.00019 BRCT pdbpercent F Eukaryota T 2dvq 2 D,E P,Q H4_YEAST histone H4 SGRGKGGKGLGXGGA 15 T 11 Shadoo unppercent F Eukaryota T 2dvr 2 D,E P,Q H4_YEAST histone H4 SGRGXGGKGLGXGGA 15 T 11 Shadoo unppercent F Eukaryota T 2dvs 2 D,E P,Q histone H4 LGXGGAKRHRKV 12 T 35 DUF4196 pdbhh F T 2dwf 1 A A PSPB_HUMAN SP-B, 6 KDA PROTEIN, PULMONARY SURFACTANT-ASSOCIATED PROTEOLIPID SPLPHE, 18 KDA PULMONARY-SURFACTANT PROTEIN CWLCRALIKRIQAMIPKGGRMLPQLVCRLVLRCS 34 T 4.2E-12 SapB_2 unppssm F Eukaryota T 2dwx 2 E,F P,Q GGA1_HUMAN hinge peptide from ADP-ribosylation factor binding protein GGA1 SLDGTGWNSFQSS 13 T 3.8 DpnII pdbhh F Eukaryota T 2dx2 1 A A Target Peptide INYWLAHAKAG 11 T 2.4 DUF3717 pdbhh F T 2dx3 1 A A DP5_conformation1 INYWLAHAKAGYIVHWTA 18 T 2 XkdW pdbhh F T 2dyf 2 B B BBC1_YEAST PROTEIN BBC1 GSTAPPLPR 9 T 12 FAA_hydro_N_2 pdbhh F Eukaryota T 2dyh 2 B B NF2L2_MOUSE NF-E2-RELATED FACTOR 2, NFE2-RELATED FACTOR 2, NUCLEAR FACTOR, ERYTHROID DERIVED 2, LIKE 2 ILWRQDIDLGVSREV 15 T 0.19 LicD pdbhh F Eukaryota T 2dzm 1 A A FAF1_HUMAN PROTEIN FAF1, HFAF1 GSSGSSGRMLDFRVEYRDRNVDVVLEDTCTVGEIKQILENELQIPVSKMLLKGWKTGDVEDSTVLKSLHLPKNNSLYVLTPDLPPPSSSSHAGALQESLN 100 T 2.5E-05 YukD pdbhh F Eukaryota T 2e4e 1 A A CHIGNOLIN GYDPATGTFG 10 T 0.06 BA14K pdbhh F T 2e4h 2 B B TBA1B_HUMAN ALPHA-TUBULIN UBIQUITOUS, TUBULIN K-ALPHA-1, ALPHA-TUBULIN 3 GEFSEAREDMAALEKDYEEVGVDSVEGEGEEEGEEY 36 T 1.8 Hrs_helical pdbhh F Eukaryota T 2e50 1 A,B,C,D A,B,P,Q SET_HUMAN SET/TAF-1BETA, PHOSPHATASE 2A INHIBITOR I2PP2A, I-2PP2A, TEMPLATE-ACTIVATING FACTOR I, TAF-I, HLA-DR ASSOCIATED PROTEIN II, PHAPII, INHIBITOR OF GRANZYME A-ACTIVATED DNASE, IGAAD MSAQAAKVSKKELNSNHDGADETSEKEQQEAIEHIDEVQNEIDRLNEQASEEILKVEQKYNKLRQPFFQKRSELIAKIPNFWVTTFVNHPQVSALLGEEDEEAMHYLTRVEVTEFEDIKSGYRIDFYFDENPYFENKVLSKEFHMNESGDPSSKSTEIKWKSGKDMTKRSSQTQNKASRKRQHEEPESFFTWFTDHSDAGADELGEVIKDDIWPNPLQYYLVPDM 225 T 7.5E-06 NAP pdb F Eukaryota T 2e6z 1 A A SPT5H_HUMAN HSPT5, DRB SENSITIVITY-INDUCING FACTOR LARGE SUBUNIT, DSIF LARGE SUBUNIT, DSIF P160, TAT-COTRANSACTIVATOR 1 PROTEIN, TAT-CT1 PROTEIN GSSGSSGFQPGDNVEVCEGELINLQGKILSVDGNKITIMPKHEDLKDMLEFPAQELRKY 59 T 0.00013 KOW pdbpercent F Eukaryota T 2e72 1 A A POGZ_HUMAN Pogo transposable element with ZNF domain GSSGSSGQDGGRKICPRCNAQFRVTEALRGHMCYCCPEMVEYQSGPSSG 49 T 4.8E-05 zf_C2H2_6 pdbhh F Eukaryota T 2e7m 1 A A K0319_HUMAN Protein KIAA0319 GSSGSSGPRTVKELTVSAGDNLIITLPDNEVELKAFVAPAPPVETTYNYEWNLISHPTDYQGEIKQGHKQTLNLSQLSVGLYVFKVTVSSENAFGEGFVNVTVKPARSGPSSG 113 T 0.00052 PKD unppercent F Eukaryota T 2ehp 1 A,B A,B Y1627_AQUAE aq_1627 protein MPAIFTHEGKVEGVPGNYPLTAENLFRIGLALCTLWILDKEIEEPTLSIPETNFVTLALSVGFMNAGGSVNVGKGGDIKLFLQKGEIYVLEFQPLSETDIKKLESILFGRAPIPKKTGEDIGSFKC 126 T 0.071 POB3_N pdbpercent F Bacteria T 2ejy 2 B B GLPC_HUMAN PAS-2', GLYCOPROTEIN BETA, GLPC, GLYCOCONNECTIN, SIALOGLYCOPROTEIN D, GLYCOPHORIN D, GPD, CD236 ANTIGEN DAGDSSRKEYCI 12 T 0.043 Herpes_gE unppercent F Eukaryota T 2era 1 A A 3S1EA_LATSE ERABUTOXIN A RICFNHQGSQPQTTKTCSPGESSCYNKQWSDFRGTIIERGCGCPTVKPGIKLSCCESEVCNN 62 T 0.0034 Toxin_TOLIP pdb F Eukaryota T 2erh 2 B B CEA7_ECOLX Colicin E7 RNKPGKATGKGKPVNNKWLNNAGKDLGSPVPDRIANKLRDKEFKSFDDFRKKFWEEVSKDPELSKQFSRTQNDRMKVGRAPQTRTQDVSGKRQSFELHHEKPISQNGGVYDMDNISVVTPKRHIDIH 127 T 0.0087 HNH pdbpercent F Bacteria T 2esx 1 A A O36236_9HIV1 Envelope polyprotein GP160 TRKSIHIGPGRAFYTTGEI 19 F T Viruses T 2evq 1 A A HP7 KTWNPATGKWTE 12 T 0.52 Collagen_bind_2 pdbhh F T 2ewr 1 A A Q9X0A5_THEMA hypothetical protein TM1012 MGSDKIHHHHHHMIRPEYLRVLRKIYDRLKNEKVNWVVTGSLSFALQGVPVEVHDIDIQTDEEGAYEIERIFSEFVSKKVRFSSTEKICSHFGELIIDGIKVEIMGDIRKRLEDGTWEDPVDLNKYKRFVETHGMKIPVLSLEYEYQAYLKLGRVEKAETLRKWLNERKG 170 T 0.00088 NTP_transf_5 unppercent F Bacteria T 2ezd 3 C A HMGA1_HUMAN HIGH MOBILITY GROUP PROTEIN HMG VPTPKRPRGRPKGSKNKGAAKTRKT 25 T 0.029 AT_hook pdbhh F Eukaryota T 2f3a 1 A A aurein 1.2 analog RLFDKIRQVIRKFX 14 T 7.5 DUF6200 pdbhh F T 2f4l 1 A,B,C,D A,B,C,D Q9WXX3_THEMA acetamidase, putative MGSDKIHHHHHHMKVVPAQRCVYSFSANMAPVEEVYPGEQVVFETLDALGGSYDKIDFSKVNPATGPVFVNGVKPGDTLKVRIKRIELPRRGMIVTGKGFGVLGDEVEGFHTKELEIEKWAVLFDGVRIPIHPMVGVIGVAPQEGEYPTGTAHRHGGNMDTKEITENVTVHLPVFQEGALLALGDVHATMGDGEVCVSACEVPAKVVVEIDVSKEEIKWPVVETNDAYYIIVSLPDIEEALKEVTRETVWFIQRRKTIPFTDAYMLASLSVDVGISQLVNPAKTAKARIPKYIFTGV 297 T 3.4E-14 FmdA_AmdA unppercent F Bacteria T 2f58 3 C P V3 LOOP HIGPGRAFGGG 11 T 0.065 GP120 pdbhh F T 2f69 2 B B TAF10_HUMAN TAF10 peptide, Acetyl-Ser-Lys-Ser-Mlz-Asp-Arg-Lys-Tyr-Thr-Leu XSKSKDRKYTL 11 T 0.0084 TFIID-31kDa unphh F Eukaryota T 2f8e 2 B A Q9DS05_9PICO VPg protein GPYAGPLERQRPLKVKAKLPQAE 23 T 3.7 RNase_HII pdbhh T Viruses T 2fci 1 A B Q32PK0_BOVIN Doubly phosphorylated peptide derived from Syk kinase comprising residues 338-350 XDTEVXESPXADPX 14 T 27 Holin_2-3 pdbhh F Eukaryota T 2fcl 1 A A Q9X0A5_THEMA hypothetical protein TM1012 MGSDKIHHHHHHMIRPEYLRVLRKIYDRLKNEKVNWVVTGSLSFALQGVPVEVHDIDIQTDEEGAYEIERIFSEFVSKKVRFSSTEKICSHFGELIIDGIKVEIMGDIRKRLEDGTWEDPVDLNKYKRFVETHGMKIPVLSLEYEYQAYLKLGRVEKAETLRKWLNERK 169 T 0.00088 NTP_transf_5 unppercent F Bacteria T 2ff3 2 B C WASL_HUMAN N-WASP ADGQESTPPTPAPTS 15 T 0.17 WH2 unppssm F Eukaryota T 2ff4 2 C,D E,F RAD9_YEAST DNA repair protein RAD9 SLEVTEADT 9 T 43 AglB_L1 pdbhh F Eukaryota T 2ffu 2 B P Q63549_RAT 13-Peptide EA2, PTTDSTTPAPTTK PTTDSTTPAPTTK 13 T 39 DUF1263 pdbhh F Eukaryota T 2ffw 1 A A TRI18_HUMAN TRIPARTITE MOTIF PROTEIN 18, PUTATIVE TRANSCRIPTION FACTOR XPRF, MIDIN, RING FINGER PROTEIN 59 QKASVSGPNSPSETRRERAFDANTMTSAEKVLCQFCDQDPAQDAVKTCVTCEVSYCDECLKATHPNKKPFTGHRLIEP 78 T 0.0015 Siva pdbpssm F Eukaryota T 2flu 2 B P NF2L2_HUMAN Nrf2 AFFAQLQLDEETGEFL 16 T 0.18 DUF4585 pdbhh F Eukaryota T 2fmc 1 A A RODL_NEUCR RODLET PROTEIN, CLOCK-CONTROLLED GENE PROTEIN 2, BLUE LIGHT-INDUCED PROTEIN 7, EAS ATTIGPNTCSIDDYKPYCCQSMSGPAGSPGLLNLIPVDLSASLGCVVGVIGSQCGASVKCCKDDVTNTGNSFLIINAANCVA 82 T 0.05 Hydrophobin pdbhh F Eukaryota T 2fns 2 C P NC-P1 SUBSTRATE PEPTIDE RQANFLGKIN 10 T 9.8 Phage_30_3 pdbhh F T 2fo4 3 C P MUC1_HUMAN MUCIN 1, TRANSMEMBRANE, MUC-1, POLYMORPHIC EPITHELIAL MUCIN, PEM, PEMT, EPISIALIN, TUMOR-ASSOCIATED MUCIN, CARCINOMA-ASSOCIATED MUCIN, TUMOR-ASSOCIATED EPITHELIAL MEMBRANE ANTIGEN, EMA, H23AG, PEANUT- REACTIVE URINARY MUCIN, PUM, BREAST CARCINOMA-ASSOCIATED ANTIGEN DF3, CD227 ANTIGEN SAPDFRPL 8 T 2.6 DUF724 pdbhh F Eukaryota T 2fot 2 B C SPTN1_HUMAN SPECTRIN, NON-ERYTHROID ALPHA CHAIN, SPECTRIN ALPHA CHAIN, FODRIN ALPHA CHAIN QQEVYGMMPRDETDSKTASASPWKSARLMVHTVATFNSIKER 42 T 0.13 Spectrin unppercent F Eukaryota T 2fq5 1 A A Peptide 2F XDWLKAFYDKVAEKLKEAFX 20 T 0.08 ApoC-I pdb F T 2fqc 1 A A CJEA_CONPO CONOTOXIN PL14A FPRPRICNLACRAGIGHKYPFCHCRX 26 T 0.3 DUF1181 pdbhh F Eukaryota T 2fr9 1 A A Alpha-conotoxin GI ECCNPACGRHYXC 13 T 0.017 Enterotoxin_ST pdbhh F T 2frb 1 A A Alpha-conotoxin GIA ECCXPACGRHYSC 13 T 0.048 Enterotoxin_ST unphh F T 2fym 2 B,E B,E RNE_ECOLI RNASE E ASPELASGKVWIRYPIVR 18 T 0.18 XisI pdbhh F Bacteria T 2fyy 3 C C EBNA1_EBVB9 EBNA-1 HPVGEADYFEY 11 T 6.4 Sel_put pdbhh T Viruses T 2fzt 1 A,B A,B Q9WZF7_THEMA hypothetical protein TM0693 GMNIDEIERKIDEAIEKEDYETLLSLLNKRKELMEGLPKDKLSEILEKDRKRLEIIEKRKTALFQEINVIREARSSLQK 79 T 0.00012 FliT pdb F Bacteria T 2g1t 2 E,F,G,H E,F,G,H ATP-Peptide Conjugate AEEEIFGEFEAKK 13 T 16 SNN_linker pdbhh F T 2g2f 2 C C ATP-Peptide Conjugate EAIFAAPFAKK 11 T 13 AmoC pdbhh F T 2g30 2 B P ARH_HUMAN AUTOSOMAL RECESSIVE HYPERCHOLESTEROLEMIA PROTEIN, ARH PEPTIDE DDGLDEAFSRLAQSRT 16 T 3.5 AalphaY_MDB pdbhh F Eukaryota T 2g35 2 B B PI51C_HUMAN peptide SWVXSPLH 8 T 4.6 Pox_F15 pdbhh F Eukaryota T 2g46 2 B,D C,D O24165_TOBAC meK27 H3 Peptide GKAPRKQLATKAARKSAPATG 21 T 0.023 PAF unp F Eukaryota T 2g57 1 A A Q0PNE9_RABIT Beta-catenin XKAAVSHWQQQSYLDSGIHSGATTTAPX 28 T 12 AvrPto pdbhh F Eukaryota T 2g5l 2 C,D X,Y (FME)(ASP)(VAL)(GLU)(ALA)(TRP)(LEU) MDVEAWL 7 T 1.1 DUF4276 pdbhh F T 2g6u 1 A A Miniprotein MP2 RCCHPQCGMVEECRK 15 T 0.76 Cys_rich_CWC pdbhh F T 2g80 1 A,B,C,D A,B,C,D ENOPH_YEAST UNKNOWN TRANSCRIPT 4 PROTEIN MGSDKIHHHHHHMVIGQKVLLARIPKMGDNYSTYLLDIEGTVCPISFVKETLFPYFTNKVPQLVQQDTRDSPVSNILSQFHIDNKEQLQAHILELVAKDVKDPILKQLQGYVWAHGYESGQIKAPVYADAIDFIKRKKRVFIYSSGSVKAQKLLFGYVQDPNAPAHDSLDLNSYIDGYFDINTSGKKTETQSYANILRDIGAKASEVLFLSDNPLELDAAAGVGIATGLASRPGNAPVPDGQKYQVYKNFETL 253 T 0.00011 Hydrolase_like unppssm F Eukaryota T 2g83 2 C,D C,D KB-1753 phage display peptide RGYYHGIWVGE 11 T 0.089 Clr2 pdbhh F T 2g9y 1 A,B A,B PPB_ECOLI APASE TPEMPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDTAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQDATPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 449 T 6.5E-10 Alk_phosphatase pdbpercent F Bacteria T 2gdl 1 A A myeloid antimicrobial peptide 27 LVQRGRFGRFLRKIRRFRPKVTITIQGSARF 31 T 11 Dapper pdbhh F T 2git 3 C,F C,F Transcriptional activator TAX LLFGKPVYV 9 T 0.28 PDU_like pdbhh F T 2gjh 1 A,B A,B DESIGNED PROTEIN MERVRISITARTKKEAEKFAAILIKVFAELGYNDINVTWDGDTVTVEGQLEGGSLEHHHHHH 62 T 0.024 Helicase_RecD pdb F T 2gkw 2 B B BAFF RECEPTOR, B CELL-ACTIVATING FACTOR RECEPTOR, BAFF-R, BLYS RECEPTOR 3, B-CELL MATURATION DEFECT SVPVPATELGSTELVTTKTAGPEQ 24 T 24 Methyltrans_RNA pdbhh F T 2gph 2 B B PTN7_HUMAN PROTEIN-TYROSINE PHOSPHATASE LC-PTP, HEMATOPOIETIC PROTEIN-TYROSINE PHOSPHATASE, HEPTP RLQERRGSNVALMLDC 16 T 6.1 PA_decarbox pdbhh F Eukaryota T 2gpv 2 G,H,I G,H,I Q4RA23_TETNG N-COR2, SILENCING MEDIATOR OF RETINOIC ACID AND THYROID HORMONE RECEPTOR, SMRT, SMRTE, THYROID-, RETINOIC-ACID-RECEPTOR-ASSOCIATED COREPRESSOR, T3 RECEPTOR- ASSOCIATING FACTOR, TRAC, CTG REPEAT PROTEIN 26, SMAP270, TNMGLEAIIRKALMGKYDQWEE 22 T 3.5 RHH_7 pdbhh F Eukaryota T 2gs6 2 B B Peptide AEEEIYGEFEAKK 13 T 12 NCD2 pdbhh F T 2h1c 2 B B FITA_NEIG1 Trafficking protein A VRLGSMLASIGQEIGGVEL 19 T 0.035 PSK_trans_fac unphh F Bacteria T 2h1p 3 C P PA1 GLQYTPSWMLVG 12 T 1.6 Polyoma_coat2 pdbhh F T 2h2f 2 B B P53_HUMAN TUMOR SUPPRESSOR P53, PHOSPHOPROTEIN P53, ANTIGEN NY-CO-13 KKGQSTSRHKKLMFKTEG 18 T 30 Class_IIIsignal pdbhh F Eukaryota T 2h6f 3 C P farnesylated peptide DDPTASACVLS 11 T 1.8 B pdbhh F T 2h7r 1 A A CPXA_PSEPU CAMPHOR 5-MONOOXYGENASE, P450CAM TTETIQSNANLAPLPPHVPEHLVFDFDMYNPSNLSAGVQEAWAVLQESNVPDLVWTRCNGGHWIATRGQLIREAYEDYRHFSSECPFIPREAGEAYDFIPTSMDPPEQRQFRALANQVVGMPVVDKLENRIQELACSLIESLRPQGQCNFTEDYAEPFPIRIFMLLAGLPEEDIPHLKYLTDQMTRPDGSMTFAEAKEALYDYLIPIIEQRRQKPGTDAISIVANGQVNGRPITSDEAKRMCGALLVGGLDTVVNFLSFSMEFLAKSPEHRQELIERPERIPAACEELLRRFSLVADGRILTSDYEFHGVQLKKGDQILLPQMLSGLDERENAAPMHVDFSRQKVSHTTFGHGSHLCLGQHLARREIIVTLKEWLTRIPDFSIAPGAQIQHKSGIVSGVQALPLVWDPATTKAV 414 T 1.6E-05 p450 unppercent F Bacteria T 2h9r 2 C C AKAP5_HUMAN AKAP79(391-412), AKAP75(391-412) LLIETASSLVKNAIQLSIEQLV 22 T 7.4 IpaB_EvcA pdbhh F Eukaryota T 2hdx 2 G,H,I,J,K,L G,H,I,J,K,L JAK2_MOUSE Jak2 protein TPDXELLTEND 11 T 6.5 SPT6_acidic pdbhh F Eukaryota T 2hev 1 A F TNFL4_HUMAN OX40 LIGAND, OX40L, GLYCOPROTEIN GP34, TAX TRANSCRIPTIONALLY-ACTIVATED GLYCOPROTEIN 1, CD252 ANTIGEN GSHMQVSHRYPRIQSIKVQFTEYKKEKGFILTSQKEDEIMKVQDNSVIINCDGFYLISLKGYFSQEVDISLHYQKDEEPLFQLKKVRSVNSLMVASLTYKDKVYLNVTTDNTSLDDFHVNGGELILIHQNPGEFCVL 137 T 0.016 tRNA_NucTran2_2 unppercent F Eukaryota T 2hfr 1 A A CTHL3_CHICK CATHELICIDIN KRFWPLVPVAINTVAAGINLYKAIRRK 27 T 6.2 PsaX pdbhh F Eukaryota T 2hgo 1 A A CASSI_CORCC CASSIICOLIN QTCVSCVNFGNGFCGDNCGNSWACSGC 27 T 0.32 CIAPIN1 pdbhh F Eukaryota T 2hh0 3 C P PRIO_BOVIN Prion protein HGQWNKPSK 9 T 1.1 ACTH_domain pdbhh F Eukaryota T 2hjk 3 C C Q70AA1_9HIV1 Gag protein KGFNPEVIPMF 11 T 4.1E-05 Gag_p24 unphh T Viruses T 2hm3 1 A A Q8IT70_HYDVU Nematocyst outer wall antigen GSQITGTCPSGCSGDCYPECKPGCCGQVNLN 31 T 0.41 DUF6331 pdbhh F Eukaryota T 2hm4 1 A A Q8IT70_HYDVU Nematocyst outer wall antigen GSQITGTCPSGCSGDCYPECPPGCCGQVNLN 31 T 0.39 DUF6331 pdbhh F Eukaryota T 2hm6 1 A A Q8IT70_HYDVU Nematocyst outer wall antigen GSQITGTCPSVCSGDCYPECPPGCCGQVNLN 31 T 1.6 Tme5_EGF_like pdbhh F Eukaryota T 2hmh 2 B B IL6RB_MOUSE IL-6R-BETA, INTERLEUKIN 6 SIGNAL TRANSDUCER, MEMBRANE GLYCOPROTEIN 130, GP130, CD130 ANTIGEN STVEXSTVVHS 11 T 4.9 S19 pdbhh F Eukaryota T 2hn7 3 C C DNA polymerase PEPTIDE HOMOLOGUE AIMPARFYPK 10 T 0.013 DNA_pol_viral_N pdbhh F T 2hqw 2 B B NMDZ1_RAT N-METHYL-D-ASPARTATE RECEPTOR SUBUNIT NR1, NR1C1 PEPTIDE KKKATFRAITSTLASSFKRRRSSK 24 T 14 Neuropeptide_S pdbhh F Eukaryota T 2hrp 3 C,F P,Q HIV-1 PROTEASE PEPTIDE MSLPGRWKPK 10 T 1.1 DUF3304 pdbhh F T 2ht9 2 C X 12-mer peptide LGTENLYFQSME 12 T 6.7 DUF6099 pdbhh F T 2htf 1 A A DPOLM_HUMAN POL MU GTPPSTRFPGVAIYLVEPRMGRSRRAFLTGLARSKGFRVLDACSSEATHVVMEETSAEEAVSWQERRMAAAPPGCTPPALLDISWLTESLGAGQPVPVECRHRLE 105 T 0.0002 BRCT pdbpercent F Eukaryota T 2hu2 2 B B ZN217_HUMAN 9-mer peptide from Zinc finger protein 217 RRTGAPPAL 9 T 48 CCDC84 pdbhh F Eukaryota T 2hug 2 B B SR54C_ARATH SRP54, 54, CHLOROPLAST PROTEIN, 54CP, FFC APPGTARRKRKADS 14 T 7.1 DapB_C pdbhh F Eukaryota T 2hwl 3 E P FIBG_HUMAN Fibrinogen gamma' peptide PAETEXDSLXPEDD 14 T 0.12 DUF3637 pdbhh F Eukaryota T 2hwn 2 E,F E,F Q4R5S0_MACFA A Kinase binding peptide QEELAWKIAKMIVSDVMQQCKK 22 T 2.6 Imm-NTF2-2 pdbhh F Eukaryota T 2hzs 3 I,J,K,L I,J,K,L MED8_YEAST RNA POLYMERASE II TRANSCRIPTIONAL REGULATION MEDIATOR 8 SKPSKPFNVDDVLKFTFTGEKHHHHHH 27 T 13 Tna_leader pdbhh F Eukaryota T 2i1d 1 A A PF11_PIG TRITRP1; PF-1; C6 VRRFPWWWPFLRRX 14 T 1.8 DUF2841 pdbhh F Eukaryota T 2i1e 1 A A 13-mer analogue of Prophenin-1 containing WWW VKKFPWWWPFLKKX 14 T 0.95 DUF2841 pdbhh F T 2i1f 1 A A PF11_PIG 13-mer analogue of Prophenin-1 containing WWW VRRFAWWWAFLRRX 14 T 0.7 DUF6499 pdbhh F Eukaryota T 2i1g 1 A A 13-mer analogue of Prophenin-1 containing WWW VRRYPWWWPYLRRX 14 T 0.9 DUF2841 pdbhh F T 2i1h 1 A A PF11_PIG 13-mer analogue of Prophenin-1 containing WWW VRRFAWWWPFLRRX 14 T 0.14 DUF6264 pdbhh F Eukaryota T 2i1i 1 A A PF11_PIG 13-mer analogue of Prophenin-1 containing WWW VRRFPWWWAFLRRX 14 T 0.58 DUF6499 pdbhh F Eukaryota T 2i7u 1 A,B A,B Four-alpha-helix bundle MKKLREEAAKLFEEWKKLAEEAAKLLEGGGGGGGGELMKLCEEAAKKAEELFKLAEERLKKL 62 T 0.00038 DUF1771 pdb F T 2i94 2 B B RK_BOVIN RK, G PROTEIN-COUPLED RECEPTOR KINASE 1 MDFGSLETVVANSAFIAARGSFDAS 25 T 3 DUF5465 pdbhh F Eukaryota T 2i9m 1 A A MHA6 SAAEAYAKRIAEAMAKG 17 T 2.7 PilA4 pdbhh F T 2i9n 1 A A MHB4A peptide RGKWTYNGITYEGGGGSAAEAYAKRIAEAMAKG 33 T 1.9 DUF4923 pdbhh F T 2i9o 1 A A MHB8A peptide RGKWTYNGITYEGGGGGGGGSAAEAYAKRIAEAMAKG 37 T 3.8 Cytidylate_kin pdbhh F T 2ifi 1 A A Alpha-conotoxin ImI GCCSDARCAWRCX 13 T 0.09 Toxin_8 pdbhh F T 2ifj 1 A A Alpha-conotoxin ImI GCCSDKRCAWRC 12 T 0.066 Toxin_8 pdbhh F T 2ifr 2 B B Octapeptide XFKFXALRX 9 T 53 Root_cap pdbhh F T 2ifz 1 A A Alpha-conotoxin ImI GCCSDKRCAWRCX 13 T 0.089 Toxin_8 pdbhh F T 2ig0 2 B B H4_HUMAN Dimethylated Histone H4-K20 peptide KRHRKVLRDN 10 T 0.27 UPF0137 unp F Eukaryota T 2igr 1 A A Anticancer peptide CB1a KWKVFKKIEKKWKVFKKIEKAGPKWKVFKKIEKX 34 T 0.16 Cecropin pdb F T 2ih6 1 A A Lambda-conotoxin CMrVIA VCCGYPLCHPC 11 T 0.073 RPAP2_Rtr1 pdbhh F T 2ih7 1 A A Lambda-conotoxin CMrVIA VCCGYPLCHPCX 12 T 0.095 RPAP2_Rtr1 pdbhh F T 2ihs 2 C,D C,D VASA1_DROME ANTIGEN MAB46F11 DINNNNNIVEDVERKREFYI 20 T 4 CppA_N pdbhh F Eukaryota T 2ii1 1 A,B,C,D A,B,C,D Q9KGN3_BACHD Acetamidase GMIRLSNENTIFFMDKENVPIASCQSGDTVIFETKDCFSDQITNEEQALTSIDFNRVNPATGPLYVEGARRGDMLEIEILDIKVGKQGVMTAAPGLGALGESLNSPTTKLFPIEGDDVVYSTGLRLPLQPMIGVIGTAPPGEPINNGTPGPHGGNLDTKDIKPGTTVYLPVEVDGALLALGDLHAAMGDGEILICGVEIAGTVTLKVNVKKERMFPLPALKTDTHFMTIASAETLDAAAVQATKNMATFLANRTALSIEEAGMLLSGAGDLYVSQIVNPLKTARFSLALHYFEKLGVDLCN 301 T 1.5E-21 FmdA_AmdA pdbpercent F Bacteria T 2ipu 3 E,F P,Q A4_HUMAN abeta 1-8 peptide DAEFRHDS 8 T 0.0001 Beta-APP unphh F Eukaryota T 2isq 2 B B SAT1_ARATH ATSAT-1, SAT-P, ATSERAT2;1 TEWSDYVI 8 T 0.23 Phage_T4_gp36 pdbhh F Eukaryota T 2itb 1 A,B A,B Q88KV1_PSEPK TRNA-(Ms(2)io(6)a)-hydroxylase, putative GMSLIPEIDAFLGCPTPDAWIEAALADQETLLIDHKNCEFKAASTALSLIAKYNTHLDLINMMSRLAREELVHHEQVLRLMKRRGVPLRPVSAGRYASGLRRLVRAHEPVKLVDTLVVGAFIEARSCERFAALVPHLDEELGRFYHGLLKSEARHYQGYLKLAHNYGDEADIARCVELVRAAEMELIQSPDQELRFHSGIPQALAA 206 T 2.1E-18 MiaE pdbpssm F Bacteria T 2iuh 2 B B KIT_HUMAN C-KIT PHOSPHOTYROSYL PEPTIDE TNEXMDMKPGV 11 T 15 AvrPtoB-E3_ubiq pdbhh F Eukaryota T 2iui 2 C,D C,D PGFRB_HUMAN PDGFR-BETA,BETA PLATELET-DERIVED GROWTH FACTOR RECEPTOR,BETA-TYPE PLATELET-DERIVED GROWTH FACTOR RECEPTOR,CD140 ANTIGEN-LIKE FAMILY MEMBER B,PLATELET-DERIVED GROWTH FACTOR RECEPTOR 1,PDGFR-1 SIDXVPMLDMK 11 T 2.1 DapH_N pdbhh F Eukaryota T 2iv8 2 B,C P,Q ARRB1_HUMAN B-ARRESTIN2 DDDIVFEDFARQRLKGMKDD 20 T 19 Lsm_interact pdbhh F Eukaryota T 2iv9 2 C P EPS15_HUMAN EPS15, PROTEIN EPS15, AF-1P PROTEIN SFGDGFADFSTL 12 T 1.6 Pico_P2B pdbhh F Eukaryota T 2ivf 3 C C Q5P5I2_AROAE ETHYLBENZENE DEHYDROGENASE GAMMA-SUBUNIT MKAKRVPGGKELLLDLDAPIWAGAESTTFEMFPTPLVMVKEVSPFLALSEGHGVIKRLDVAALHNGSMIALRLKWASEKHDKIVDLNSFVDGVGAMFPVARGAQAVTMGATGRPVNAWYWKANANEPMEIVAEGFSAVRRMKDKAGSDLKAVAQHRNGEWNVILCRSMATGDGLAKLQAGGSSKIAFAVWSGGNAERSGRKSYSGEFVDFEILK 214 T 5E-18 EB_dh pdbpercent F Bacteria T 2ivh 1 A A CEA7_ECOLX COLCIN-E7 KPGKATGKGKPVNNKWLNNAGKDLGSPVPDRIANKLRDKEFKSFDDFRKKFWEEVSKDPELSKQFSRNNNDRMKVGKAPKTRTQDVSGKRTSFELHQEKPISQNGGVYDMDNISVVTPKRHIDIHRGK 128 T 0.0014 HNH pdbpercent F Bacteria T 2iy2 1 A,B A,B DSBG_ECOLI DSBG MELPAPVKAIEKQGITIIKTFDAPGGMKGYLGKYQDMGVTIYLTPDGKHAISGYMYNEKGENLSNTLIEKEI 72 T 0.00056 DsbC_N pdbpssm F Bacteria T 2izq 1 A,B,C,D A,B,C,D GRAMICIDIN D XGAXAXVXWXWYFXWXWX 18 T 4.6 MAP17 pdbhh F T 2izx 2 C C AKAP-IS QIEYLAKQIVDNAIQQAK 18 T 0.0037 RII_binding_1 pdb F T 2j04 2 B,D B,D TFC6_YEAST TAU91 MGLLKDLSSARDKIERIYGLNKEKLLLLAKVKEGFETSVFDFPFKNIQPDSPYFVCLDPPCKKESAYNKVIGDKNRTVYHEINKTEFENMIKLRTKRLKLLIGEVDAEVSTGDKIEFPVLANGKRRGFIYNVGGLVTDIAWLNIEENTDIGKDIQYLAVAVSQYMDEPLNEHLEMFDKEKHSSCIQIFKMNTSTLHCVKVQTIVHSFGEVWDLKWHEGCHAPHLVGCLSFVSQEGTINFLEIIDNATDVHVFKMCEKPSLTLSLADSLITTFDFLSPTTVVCGFKNGFVAEFDLTDPEVPSFYDQVHDSYILSVSTAYSDFEDTVVSTVAVDGYFYIFNPKDIATTKTTVSRFRGSNLVPVVYCPQIYSYIYSDGASSLRAVPSRAAFAVHPLVSRETTITAIGVSRLHPMVLAGSADGSLIITNAARRLLHGIKNSSATQKSLRLWKWDYSIKDDKYRIDSSYEVYPLTVNDVSKAKIDAHGINITCTKWNETSAGGKCYAFSNSAGLLTLEYLSLEHHHHHH 524 T 4E-09 Lgl_C unphh F Eukaryota T 2j6o 2 B C CD2_HUMAN T-CELL SURFACE ANTIGEN T11/LEU-5, LFA-2, LFA-3 RECEPTOR, ERYTHROCYTE RECEPTOR, ROSETTE RECEPTOR, CD2 KGPPLPRPRV 10 T 4.6 Caskin-Pro-rich pdbhh F Eukaryota T 2j7x 2 B B NCOA5_HUMAN NCOA-5, COACTIVATOR INDEPENDENT OF AF-2, CIA, NCOA5 HPPAIQSLINLLADNRY 17 T 2.5 HEAT_PBS pdbhh F Eukaryota T 2j8a 1 A A SET1_YEAST COMPASS COMPONENT SET1, SET DOMAIN-CONTAINING PROTEIN 1, SET1 HISTONE METHYLTRANSFERASE MSCEIVVYPAQDSTTTNIQDISIKNYFKKYGEISHFEAFNDPNSALPLHVYLIKYASSDGKINDAAKAAFSAVRKHESSGCFIMGFKFEVILNKHSILNNIISKFVEINVKKLQKLQENLKKAKEKEAENHHHHHH 136 T 0.0005 DUF4618 pdb F Eukaryota T 2jam 2 C D POLYPEPTIDE GVSKFA 6 T 75 Toxin_36 pdbhh F T 2jaz 2 B,D B,D CEA7_ECOLX COLICIN E7 KRNKPGKATGKGKPVNNKWLNNAGKDLGSPVPDRIANKLRDKEFKSFDDFRKKFWEEVSKDPELSKQFSRNNNDRMKVGKAPKTRTQDVSGKRTSFELHHEKPISQNGGVYDMDDISVVTPKRHIDIHRGK 131 T 0.042 HNH pdbpercent F Bacteria T 2jb0 2 B B CEA7_ECOLX COLICIN E7 KRNKPGKATGKGKPVNNKWLNNAGKDLGSPVPDRIANKLRDKEFKSFDDFRKKFWEEVSKDPELSKQFSRNNNDRMKVGKAPKTRTQDVSGKRTSFELHHEKPISQNGGVYDMDNISVVTPKRHIDIARGK 131 T 0.021 HNH pdbpercent F Bacteria T 2jbg 2 B,D B,D CEA7_ECOLX COLICIN-E7 KRNKPGKATGKGKPVNNKWLNNAGKDLGSPVPDRIANKLRDKEFKSFDDFRKKFWEEVSKDPELSKQFSRNNNDRMKVGKAPKTRTQDVSGKRTSFELHHEKPISQNGGVYDMDAISVVTPKRHIDIHRGK 131 T 0.0015 HNH pdbpercent F Bacteria T 2jet 1 A A CTRB1_RAT CHYMOTRYPSINOGEN B CHAIN A MSTQACGVPTIQPVL 15 T 2.2 Zn_ribbon_2 pdbhh F Eukaryota T 2jf9 2 D,E,F P,Q,R AB5 PEPTIDE SPGSREWFKDMLS 13 T 0.56 Sorb pdbhh F T 2jfa 3 C,D P,Q COREPRESSOR PEPTIDE DAFQLRQLILRGLQDD 16 T 9.8 DUF5731 pdbhh F T 2jk9 2 B B PAWR_HUMAN PROSTATE APOPTOSIS RESPONSE 4 PROTEIN, PAR-4 PEPTIDE NELNNNLPGGAPAAP 15 T 15 RTT107_BRCT_6 pdbhh F Eukaryota T 2jmf 2 B B Q32UW5_DROME Neurogenic locus Notch protein GPLGSPNTGAKQPPSYEDCIK 21 T 0.56 TMEM52 pdbhh F Eukaryota T 2jms 1 A A A0FKY4_EUPNO Pheromone En-6 TDPEEHFDPNTNCDYTNSQDAWDYCTNYIVNSSCGEICCNDCFDETGTGACRAQAFGNSCLNW 63 T 0.0036 Euplotes_phero unp F Eukaryota T 2jmv 1 A A SVN_SCYVA SVN GSGPTYCWNEANNPGGPNRCSNNKQCDGARTCSSSGFCQGTSRKPDPGPKGPTYCWDEAKNPGGPNRCSNSKQCDGARTCSSSGFCQGTAGHAAA 95 T 0.0034 EB pdb F Bacteria T 2jmx 2 B B ATPA_BOVIN F1-ATPASE QKTGTAEVSSILEERILGADTSVDL 25 T 76 PspB pdbhh F Eukaryota T 2jmy 1 A A CM15 KWKLFKKIGAVLKVL 15 T 0.2 Melittin pdbhh F T 2jni 1 A A ANN2_AREMA Arenicin-2 RWCVYAYVRIRGVLVRYRRCW 21 T 2.4 Toxin_25 pdbhh F Eukaryota T 2jnr 1 A A VIR165 LEAIPCSIPPCFAFNKPFVF 20 T 0.93 Serpin pdbhh F T 2jnw 2 B B XPA_HUMAN XERODERMA PIGMENTOSUM GROUP A-COMPLEMENTING PROTEIN KIIDTGGGFILEEE 14 T 1.3 SDH_beta pdbhh F Eukaryota T 2jo4 1 A,B,C,D A,B,C,D KIA7 XAKAAAAAIKAIAAIIKAGGYX 22 T 4.4 DUF1726 pdbhh F T 2jo5 1 A,B,C,D A,B,C,D KIA7F XAKAAAAAIKAIAAIIKAGGFX 22 T 4.3 DUF1726 pdbhh F T 2joa 2 B B Peptide H1-C1 DSRIWWV 7 T 0.86 DUF4894 pdbhh F T 2jof 1 A A TRP-CAGE DAYAQWLKDGGPSSGRPPPS 20 T 1.8 Pam17 pdbhh F T 2jog 2 B B NFAT GPHPVIVITGPHEELE 16 T 0.24 Sigma_reg_C pdbhh F T 2jp5 1 A A ATWLPPR peptide ATWLPPR 7 T 17 SBE2 pdbhh F T 2jp6 1 A A KA181_TITOB TOXIN TC32 GSTGPQTTCQAAMCEAGCKGLGKSMESCQGDTCKCKA 37 T 0.0073 Defensin_2 unphh F Eukaryota T 2jp8 1 A P ANGT_HUMAN SERPIN A8, ANGIOTENSINOGEN DRVYIHP 7 T 3.4 PH_RBD pdbhh F Eukaryota T 2jpy 1 A A PHYL2_PHYHY Phylloseptin-2 protein FLSLIPHAINAVSTLVHHFX 20 T 0.0063 Clavanin unp F Eukaryota T 2jq0 1 A A PHYL1_PHYHY PS-1 FLSLIPHAINAVSAIAKHNX 20 T 3.4 BESS unphh F Eukaryota T 2jq1 1 A A PHYL3_PHYHY PS-3 FLSLIPHAINAVSALANHGX 20 T 3.8 BESS unphh F Eukaryota T 2jq9 2 B B CHM1A_HUMAN CHARGED MULTIVESICULAR BODY PROTEIN 1A, CHMP1A, VACUOLAR PROTEIN SORTING 46-1, VPS46-1, HVPS46-1 VRSQEDQLSRRLAALRN 17 T 4.2 VESA1_N pdbhh F Eukaryota T 2jqi 2 B B RAD53_YEAST SERINE-PROTEIN KINASE 1 NITQPTQQST 10 T 11 VlpA_repeat pdbhh F Eukaryota T 2jqk 2 B B CHM2B_HUMAN CHROMATIN-MODIFYING PROTEIN 2B, CHMP2B, CHMP2.5, VACUOLAR PROTEIN SORTING 2-2, VPS2-2, HVPS2- 2 KATISDEEIERQLKALGVD 19 T 0.021 LEM pdb F Eukaryota T 2jqs 1 A A ALLS_DIPPU Allatostatins DRLYSFGLX 9 T 0.094 Carcinustatin pdbhh F Eukaryota T 2jqu 1 A A ALLS_DIPPU Allatostatins GGSLYSFGLX 10 T 0.092 Carcinustatin pdbhh F Eukaryota T 2jqw 1 A A D0VWW5_ODOGR lectin-like peptide YASPKCFRYPNGVLACT 17 T 2 MORN_2 pdbhh F Eukaryota T 2jrv 1 A A PEPTIDE PEP.1 PMTLPENYFSERPYH 15 T 4.4 DUF4524 pdbhh F T 2jrw 1 A A Cyclic extended Pep.1 CAEPMTLPENYFSERPYHPPPPC 23 T 5.8 Tryp_FSAP pdbhh F T 2jsb 1 A A ANN1_AREMA Arenicin-1 RWCVYAYVRVRGVLVRYRRCW 21 T 2.4 Toxin_25 pdbhh F Eukaryota T 2jta 1 A A 10-mer ubiquitin peptide LEDGRTLSDY 10 T 0.011 FERM_f0 pdbhh F T 2jtd 1 A A MYOM1_MOUSE SKELEMIN GSSHHHHHHSSGLVPRGSHMEEEMKRLLALSQEHKFPTVPTKSELAVEILEKGQVRFWMQAEKLSSNAKVSYIFNEKEIFEGPKYKMHIDRNTGIIEMFMEKLQDEDEGTYTFQIQDGKATGHSTLVLIGDVYKKLQKEAEF 142 T 0.00037 V-set pdb F Eukaryota T 2jui 1 A A P71470_LACPN BACTERIOCIN PEPTIDE PLNE FNRGGYNFGKSVRHVVDAIGSVAGIRGILKSIR 33 T 4.7 DHH pdbhh F Bacteria T 2juq 1 A A CA1A_CONRE ALPHA-RGIA GCCSDVRCRYRCR 13 T 0.00026 Toxin_8 unp F Eukaryota T 2jur 1 A A CA1A_CONRE ALPHA-RGIA GCCSEPRCRYRCR 13 T 0.00026 Toxin_8 unp F Eukaryota T 2jus 1 A A CA1A_CONRE ALPHA-RGIA GCCSDPRCRWRCR 13 T 0.00026 Toxin_8 unp F Eukaryota T 2juy 1 A A Neopetrosiamide A FFCPFGCALVDCGPNRPCRDTGFMSCDC 28 T 4.1 Fib_alpha pdbhh F T 2jv8 1 A A Q82V59_NITEU Uncharacterized protein NE1242 MTHHTEVFEGGTIDIEDDTSLTINGKEISYVHDAVKNKWSSRYLPYTQYDSLLDLARAIIRDTVEFSGVKEGS 73 T 0.022 PFU unppercent F Bacteria T 2jve 1 A A A8D0E6_NOTVI Prod 1 MGSSHHHHHHSSGLVPRGSHMALKCFTRNGDDRTVTTCAEEQTRCLFVQLPYSEIQECKTVQQCAEVLEEVTAIGYPAKCCCEDLCNRSEQ 91 T 0.71 Toxin_TOLIP pdbpercent F Eukaryota T 2jvu 1 A A Q08JB9_ECOLX DISPERSIN GGSGWNADNVDPSQCIKQSGVQYTYNSGVSVCMQGLNEGKVRGVSVSGVFYYNDGTTSNFKGVVTPSTPVNTNQDINKTNKVGVQKYRALTEWVGSRSHHHHHH 104 T 0.076 Colicin_M unppercent F Bacteria T 2jw1 2 B B MXID_SHIFL Outer membrane protein mxiD XSETTLLEDEKSLVSYLNY 19 T 17 DUF3512 pdbhh F Bacteria T 2jx6 1 A A DDSK_PHYDS DD K GLWSKIKAAGKEAAKAAAKAAGKAALNAVSEAVX 34 T 0.00011 DD_K unp F Eukaryota T 2jy0 1 A A POLG_HCVCO Protease NS2-3 MDREMAASAGGAVFVGLVLLTLSPHYK 27 T 0.01 HCV_NS2 pdbhh T Viruses T 2jyp 1 A A Q9BP37_HALRU Aragonite protein AP7 TRHSFRRPFHECALCYSITDPGERQRCIDMYCSYTN 36 T 1.8 HMw1_D2 pdbhh F Eukaryota T 2jzi 2 B B PP2BA_HUMAN CALMODULIN-DEPENDENT CALCINEURIN A SUBUNIT ALPHA ISOFORM, CAM-PRP CATALYTIC SUBUNIT ARKEVIRNKIRAIGKMARVFSVLR 24 T 6.1 DUF2626 pdbhh F Eukaryota T 2k00 2 B B LAYN_MOUSE Layilin GRSKESGWVENEIYY 15 T 8.8 Ploopntkinase2 pdbhh F Eukaryota T 2k0f 2 B B MYLK_CHICK 19-MER PEPTIDE FROM TELOKIN; 19-MER PEPTIDE FROM KINASE-RELATED PROTEIN RRKWQKTGHAVRAIGRLSS 19 T 8.4 PACT_coil_coil pdbhh F Eukaryota T 2k13 1 A X D0VWW8_HAEOF Saratin EEREDCWTFYANRKYTDFDKSFKKSSDLDECKKTCFKTEYCYIVFEDTVNKECYYNVVDGEELDQEKFVVDENFTENYLTDCEGKDAGNAAGTGDESDEVDED 103 T 0.00083 PAN_3 pdb F Eukaryota T 2k20 2 B B O54857_RAT PROTEIN TYROSINE PHOSPHATASE AND TENSIN-LIKE PROTEIN DEDQHSQITKV 11 T 15 Invas_SpaK pdbhh F Eukaryota T 2k2f 1 A,B C,D RYR2_RAT Ryanodine receptor 1 peptide KKAVWHKLLSKQ 12 T 2.4 DUF3693 pdbhh F Eukaryota T 2k3u 2 B B C5aR(P7-28S) peptide XTTPDYGHYDDKDTLDLNTPVDKX 24 T 0.16 EAGR_box pdbhh F T 2k6a 1 A A RODL_NEUCR RODLET PROTEIN, CLOCK-CONTROLLED GENE PROTEIN 2, BLUE LIGHT-INDUCED PROTEIN 7 SATTIGPNTCSIDDYKPYCCQSMSGSASLGCVVGVIGSQCGASVKCCKDDVTNTGNSFLIINAANCVA 68 T 0.083 Hydrophobin unphh F Eukaryota T 2k6q 2 B B SQSTM_RAT UBIQUITIN-BINDING PROTEIN P62, PROTEIN KINASE C-ZETA-INTERACTING PROTEIN, PKC-ZETA-INTERACTING PROTEIN MSGGDDDWTHLSSKEVD 17 T 12 EpuA pdbhh F Eukaryota T 2k6r 1 A A Full Sequence Design 1 Synthetic Superstable GQQYTAXIKGRTFRNEKELRDFIEKFXGR 29 T 0.13 SpoVIF pdb F T 2k7l 1 A B CTDP1_HUMAN centFCP1-T584PO4 peptide EDTDEDDHLIYLEEILVRV 19 T 2.5 Es2 pdbhh F Eukaryota T 2k84 1 A A GAG_EIAVY P9 LYPDLSEIKKEYNVKEKDQVEDLNLDSLWE 30 T 8.3 LSPR pdbhh T Viruses T 2k8j 1 A X POLG_HCVJA p7tm2 RLVPGAAYALYGVWPLLLLLLALPPRAYA 29 T 9.1 DUF2244 pdbhh T Viruses T 2k8q 1 A A SHQ1_YEAST SMALL NUCLEOLAR RNAS OF THE BOX H/ACA FAMILY QUANTITATIVE ACCUMULATION PROTEIN 1 GITPRFSITQDEEFIFLKIFISNIRFSAVGLEIIIQENMIIFHLSPYYLRLRFPHELIDDERSTAQYDSKDECINVKVAKLNKNEYFEDLDLPTKLLARQGDLAGADALTENTDAKKTQKPLIQEVETDGVSNN 134 T 1.7E-05 PIH1_CS pdbhh F Eukaryota T 2k9e 1 A A K1A_STIHL KAPPA-SHTX-SHE3A,POTASSIUM CHANNEL TOXIN SHK XXRSCIDTIPKSRCTAFQCKHSXKYRLSFCRKTCGTCX 38 T 0.0045 ShK unp F Eukaryota T 2k9u 2 B B FBLI1_HUMAN FBLP-1, MITOGEN-INDUCIBLE 2-INTERACTING PROTEIN, MIG2-INTERACTING PROTEIN, MIGFILIN MASKPEKRVASSVFITLAPPRRDV 24 T 11 Pox_A3L pdbhh F Eukaryota T 2kaa 1 A A P78696_HIRTH HTA APIVTCRPKLDGREKPFKVDVATAQAQARKAGLTTGKSGDPHRYFAGDHIRWGVNNCDKADAILWEYPIYWVGKNAEWAKDVKTSQQKGGPTPIRVVYANSRGAVQYCGVMTHSKVDKNNQGKEFFEKCD 130 T 0.02 GLEYA pdb F Eukaryota T 2kb9 1 A A JAG1_HUMAN JAGGED1, HJ1 RCQYGWQGLYCDKCIPHPGCVHGICNEPWQCLCETNWGGQLCDK 44 T 0.0036 hEGF pdbhh F Eukaryota T 2kbb 1 A A TLN1_MOUSE Talin-1 GIDPFTAPGQLECETAIAALNSCLRDLDQASLAAVSQQLAPREGISQEALHTQMLTAVQEISHLIEPLASAARAEASQLGHKVSQMAQYFEPLTLAAVGAASKTLSHPQQMALLDQTKTLAESALQLLYTAKEAGGNPKQAAHTQEALEEAVQMMTEAVEDLTTTLNEAASAAG 174 T 0.00049 I_LWEQ unppssm F Eukaryota T 2kbq 1 A A USH1C_HUMAN USHER SYNDROME TYPE-1C PROTEIN, AUTOIMMUNE ENTEROPATHY-RELATED ANTIGEN AIE-75, ANTIGEN NY-CO-38/NY-CO-37, PDZ-73 PROTEIN, RENAL CARCINOMA ANTIGEN NY-REN-3 MDRKVAREFRHKVDFLIENDAEKDYLYDVLRMYHQTMDVAVLVGDLKLVINEPSRLPLFDAIRPLIPLKHQVEYDQLTPR 80 T 0.0072 DUF3567 pdb F Eukaryota T 2kbr 2 B B CAD23_HUMAN OTOCADHERIN DDDRYLREAIQEYDNIAK 18 T 27 DUF2686 pdbhh F Eukaryota T 2kc6 1 A A MEN1_EUPNO Mating pheromone En-1 NPEDWFTPDTCAYGDSNTAWTTCTTPGQTCYTCCSSCFDVVGEQACQMSAQC 52 T 41 eIF3g pdbhh F Eukaryota T 2kdq 1 A A L-22 CYCLIC PEPTIDE RVRTRKGRRIRIXP 14 T 0.24 DUF2835 pdbhh F T 2kdr 1 A X POLG_HCVH NS4B, P27 SDAAARVTAILSSLTVTQLLRRLHQWIS 28 T 14 SbcD_C pdbhh T Viruses T 2kdu 2 B B UN13A_RAT MUNC13-1 GSRAKANWLRAFNKVRMQLQEARGEGEMSKSLWFKG 36 T 18 MgrB pdbhh F Eukaryota T 2keg 1 A A P71460_LACPN BACTERIOCIN PLNK RRSRKNGIGYAIGYAFGAVERAVLGGSRDYNK 32 T 0.003 Bacteriocin_IIc unppssm F Bacteria T 2keq 1 A A B2J066_NOSP7;B2J821_NOSP7 DNA polymerase III alpha subunit, Nucleic acid binding OB-fold tRNA/helicase-type GGALSYETEILTVEYGLLPIGKIVEKRIECTVYSVDNNGNIYTQPVAQWHDRGEQEVFEYCLEDGSLIRATKDHKFMTVDGQMLPIDEIFERELDLMRVDNLPNIKIATRKYLGKQNVYDIGVERDHNFALKNGFIASN 139 T 4.8E-07 Intein_splicing pdbhh F Bacteria T 2ket 1 A A CTHL6_BOVIN ANTIBACTERIAL PEPTIDE BMAP-27, MYELOID ANTIBACTERIAL PEPTIDE 27 GRFKRFRKKFKKLFKKLSPVIPLLHLX 27 T 0.21 Stomoxyn pdb F Eukaryota T 2kfe 1 A A meucin-24 GRGREFMSNLKEKLSGVKEKMKNS 24 T 1.4 DUF5398 pdbhh F T 2kff 2 B B Rab11-FIP2 NPF peptide FNYESTNPFTAK FNYESTNPFTAK 12 T 6.1 DUF3729 pdbhh F T 2kfg 2 B B Rab11-FIP2 DPF peptide FNYESTDPFTAK FNYESTDPFTAK 12 T 4.8 SsgA pdbhh F T 2kfh 2 B B Rab11-FIP2 GPF peptide FNYESTGPFTAK FNYESTGPFTAK 12 T 9.9 DUF5973 pdbhh F T 2kft 2 B B Histone H3 ARTKQTARKSTGGKAPRKQLC 21 T 0.44 Histone pdbhh F T 2kgn 1 A A STE5_YEAST Protein STE5 PLSRGKKWTEKLARFQRSSAKKKR 24 T 41 DUF3579 pdbhh F Eukaryota T 2khf 1 A A P71461_LACPN BACTERIOCIN PLNJ, BACTERIOCIN PEPTIDE PLNJ GAWKNFWSSLRKGFYDGEAGRAIRR 25 T 0.004 ComC unphh F Bacteria T 2khv 1 A A Q2YAJ6_NITMU Phage integrase MTFSECAALYIKAHRSSWKNTKHADQWTNTIKTYCGPVIGPLSVQDVDTKLIMKVLDPIWEQKPETASRLRGRIESVLDWATVRGYREGDNPARWRGYLEHHHHHH 106 T 0.24 Toxin_5 pdbpssm F Bacteria T 2ki0 1 A A DS119 GSGQVRTIWVGGTPEELKKLKEEAKKANIRVTFWGD 36 T 0.029 Alpha-amylase pdbhh F T 2kib 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H NFGAIL segment from human islet amyloid polypeptide NFGAILS 7 T 3.9 SidC_N pdbhh F T 2kik 1 A,B A,B Artificial diiron protein XDYLRELLKGELQGIKQYREALEYTHNPVLAKILEDEEKHIEWLETILGX 50 T 0.00069 COQ7 pdbhh F T 2kj9 1 A A Q6D355_PECAS Integrase KSVQEKRNNTRAFKTVAKSWFATKTTWSEDYQRSVWTRLETYLFPDIGNKDIAELDTGDLLVPIKKIEKLGYLEIAMRVKQYATAIMRYAVQQKMIRFNPAYDLEGAVQKLEHHHHHH 118 T 0.015 Phage_int_SAM_3 pdbpercent F Bacteria T 2kjy 1 A A MYPT1_HUMAN MYOSIN PHOSPHATASE-TARGETING SUBUNIT 1, MYOSIN PHOSPHATASE TARGET SUBUNIT 1, PROTEIN PHOSPHATASE MYOSIN-BINDING SUBUNIT GPMSTTEVRERRRSYLTPVRDEESESQRKARSRQARQSRRSTQGVTLTDLQEAEKTIGRS 60 T 0.00014 DUF4695 pdb F Eukaryota T 2kk2 1 A A C4NXD5_EUPNO En-A1 YNPEDDYTPLTCPHTISVVWYECTENTANCGTACCDSCFELTGNTMCLLQAGAAGSGCDME 61 T 0.058 AWS pdb F Eukaryota T 2kke 1 A,B A,B O26567_METTH Uncharacterized protein MVGRRPGGGLKDTKPVVVRLYPDEIEALKSRVPANTSMSAYIRRIILNHLEDE 53 T 0.00052 DUF6290 pdb F Archaea T 2kl8 1 A A OR15 MEMDIRFRGDDLEAFEKALKEMIRQARKFAGTVTYTLDGNDLEIRITGVPEQVRKELAKEAERLAKEFNITVTYTIRLEHHHHHH 85 T 0.0033 DUF2067 pdb F T 2km9 1 A A omega_conotoxin-FVIA CKGTGKSCSRIAYNCCTGSCRSGKC 25 T 0.00087 Conotoxin pdbhh F T 2kna 1 A A XIAP_HUMAN E3 UBIQUITIN-PROTEIN LIGASE XIAP, INHIBITOR OF APOPTOSIS PROTEIN 3, X-LINKED INHIBITOR OF APOPTOSIS PROTEIN, X-LINKED IAP, IAP-LIKE PROTEIN, HILP GSAMADIGSEFEKTPSLTRRIDDTIFQNPMVQEAIRMGFSFKDIKKIMEEKIQISGSNYKSLEVLVADLVNAQKDSMQDESSQTSLQKEISTEEQLRRLQEEKL 104 T 0.022 Baculo_RING unphh F Eukaryota T 2knh 2 B B HTF4_HUMAN TRANSCRIPTION FACTOR HTF-4, E-BOX-BINDING PROTEIN, DNA-BINDING PROTEIN HTF4 IGTDKELSDLLDFSAMFS 18 T 9.6 HSV_VP16_C pdbhh F Eukaryota T 2knj 1 A A MPSIN_RHIMP Microplusin preprotein HHQELCTKGDDALVTELECIRLRISPETNAAFDNAVQQLNCLNRACAYRKMCATNNLEQAMSVYFTNEQIKEIHDAATACDPEAHHEHDH 90 T 0.037 zf-C2H2_aberr pdbpssm F Eukaryota T 2knp 1 A A D0VWX1_MOMCO MCoCC-1 GCEGKQCGLFRSCGGGCRCWPTVTPGVGICSSS 33 T 0.00057 Albumin_I pdbhh F Eukaryota T 2kon 1 A A Q7NW74_CHRVO Uncharacterized protein MNVAHYRGYEIEPGHQYRDDIRKYVPYALIRKVGVPDRTPIPTTYPEFYDLEADAERVSIACAKIIIDSHLDRHDQGLADLG 82 T 0.12 Sel_put pdbpssm F Bacteria T 2koz 1 A A nasonin-1 ACNDRDCSLDCIMKGYNTGSCVRGSCQCRRTSG 33 T 0.00035 Toxin_2 pdbpercent F T 2kp0 1 A A nasonin-1M ACNDRDCSLDCIMKGYNFGKCVRGSCQCRRTSG 33 T 0.00047 Toxin_2 pdbpercent F T 2kpa 1 A A ARNO(375-400) VSVDPFYEMLAARKKRISVKKKQEQP 26 T 0.87 KIF1B pdbhh F T 2kpl 2 B B VE6_HPV16 E6CT RSSRTRRETQV 11 T 0.34 FpoO unphh T Viruses T 2kpz 2 B B PRO_HTL1L PR76GAG-PRO, MATRIX PROTEIN P19, MA, SDPQIPPPYVEP 12 T 3.9 RAM pdbhh T Viruses T 2kq0 2 B B VP40_EBOZM MEMBRANE-ASSOCIATED PROTEIN VP40 ILPTAPPEYMEA 12 T 0.96 STAT1_TAZ2bind pdbhh T Viruses T 2kq6 1 A A PKD2_HUMAN POLYCYSTIC KIDNEY DISEASE 2 PROTEIN, AUTOSOMAL DOMINANT POLYCYSTIC KIDNEY DISEASE TYPE II PROTEIN, POLYCYSTWIN, R48321 NTVDDISESLRQGGGKLNFDELRQDLKGKGHTDAEIEAIFTKYDQDGDQELTEHEHQQMRDDLEKEREDLDLDHSSLP 78 T 0.00016 EF-hand_8 pdbpercent F Eukaryota T 2kqf 2 B B Q8JJY9_9RHAB C-terminal motif from Glycoprotein SWESHKSGGETRL 13 T 3.5 DUF5052 unphh T Viruses T 2kqr 1 A A SYNC_BRUMA ASPARAGINE--TRNA LIGASE, ASNRS, POTENTIALLY PROTECTIVE 63 KDA ANTIGEN GSMTVYICPETGDDGNDGSELKPLRTLYQAMIITKSSKGDFLIRTKKDGKQVWEAASKTALKKSWKRYEQEMLKNEKVAAKMLEKDATEVGVKAALEEAKKVQIELDTSLSYI 113 T 0.00065 DUF1565 pdbpssm F Eukaryota T 2kqs 2 B B DAXX_HUMAN DAXX, HDAXX, FAS DEATH DOMAIN-ASSOCIATED PROTEIN, ETS1-ASSOCIATED PROTEIN 1, EAP1 GSKTSVATQCDPEEIIVLSDSD 22 T 14 TMEM169 pdbhh F Eukaryota T 2ksp 2 B B MILK1_HUMAN MOLECULE INTERACTING WITH RAB13, MIRAB13 LESKPYNPFEEEEED 15 T 0.0047 NPF pdbhh F Eukaryota T 2ksw 1 A A O96050_ORYRH Oryctin VPVGSDCEPKLCTMDLVPHCFLNPEKGIVVVHGGCALSKYKCQNPNHEKLGYTHECEEAIKNAPRP 66 T 1.2 DUF5437 unphh F Eukaryota T 2kub 1 A A FAP1_STRPA Fimbriae-associated protein Fap1 ENLDKMISEAEVLNDMAARKLITLDAEQQLELMKSLVATQSQLEATKNLIGDPNATVADLQIAYTTLGNNTQALGNELIKL 81 T 0.0082 FIVAR pdbpssm F Bacteria T 2kup 2 B B ALK_HUMAN HALK, ANAPLASTIC LYMPHOMA KINASE LFRLRHFPCGNVNYGYQQQ 19 T 0.4 Ntox44 pdbhh F Eukaryota T 2kvm 2 B B histone H3 peptide (residues 15-30) with dimethylated lysine 27 APRKQLATKAARKSAP 16 T 16 Rsc14 pdbhh F T 2kwf 2 B B ITF2_HUMAN TCF-4, IMMUNOGLOBULIN TRANSCRIPTION FACTOR 2, ITF-2, SL3-3 ENHANCER FACTOR 2, SEF-2, CLASS B BASIC HELIX-LOOP-HELIX PROTEIN 19, BHLHB19 GSGTDKELSDLLDFSAMFS 19 T 6.3 HSV_VP16_C pdbhh F Eukaryota T 2kwh 1 A A RBP1_HUMAN RALBP1, RAL-INTERACTING PROTEIN 1, 76 KDA RAL-INTERACTING PROTEIN, DINITROPHENYL S-GLUTATHIONE ATPASE, DNP-SG ATPASE GSETQAGIKEEIRRQEFLLNSLHRDLQGGIKDLSKEERLWEVQRILTALKRKLREA 56 T 0.0087 SAB pdbpssm F Eukaryota T 2kwn 1 A B H4_HUMAN Histone peptide GLGKGGAXRHRKVLR 15 T 0.27 UPF0137 unp F Eukaryota T 2kwo 1 A B H4_HUMAN Histone peptide XGRGKGGKGLGKGGAKRHRK 20 T 11 Shadoo unppercent F Eukaryota T 2kwu 1 A A POLI_MOUSE RAD30 HOMOLOG B GSPEFDSAEEKLPFPPDIDPQVFYELPEEVQKELMAEWERAGAARPSAHR 50 T 0.0011 UBM pdbhh F Eukaryota T 2kwv 1 A A POLI_MOUSE RAD30 HOMOLOG B GSDTSDLPLQALPEGVDQEVFKQLPADIQEEILSGKSRENLKGKGSLS 48 T 0.00043 UBM pdb F Eukaryota T 2kx5 2 B B Cyclic peptide mimetic of Tat protein RVRCRQRKGRRICIRIXP 18 T 0.79 DUF3877 pdbhh F T 2kxe 1 A A DP2S_PYRHO DP1 SUBUNIT, POL II GSHMDEFVKGLMKNGYLITPSAYYLLVGHFNEGKFSLIELIKFAKSRETFIIDDEIANEFLKSIGAEVELPQEIK 75 T 0.031 Leu_Phe_trans pdb F Archaea T 2kxh 2 B B FUBP1_HUMAN FUSE-BINDING PROTEIN 1, FBP, DNA HELICASE V, HDH V GAMGYVNDAFKDALQRARQIAAKIGGDAGTS 31 T 23 DUF4312 pdbhh F Eukaryota T 2ky5 1 A A PECA1_HUMAN PECAM-1, ENDOCAM, GPIIA', PECA1 GSSDVQYTEVQVSSAESHKDLGKKDTETVYSEVRKAVPDAVESRYSRTEGSLDGT 55 T 0.11 Shisa unp F Eukaryota T 2kyg 2 C C MTG8_HUMAN PROTEIN MTG8, PROTEIN ETO, EIGHT TWENTY ONE PROTEIN, CYCLIN-D-RELATED PROTEIN, ZINC FINGER MYND DOMAIN-CONTAINING PROTEIN 2 AMADIGSASGYVPEEIWKKAEEAVNEVKRQAMTELQKA 38 T 0.038 DUF3731 unp F Eukaryota T 2kyj 1 A A TXS2B_LIOWA LITX DFPLSKEYESCVRPRKCKPPLKCNKAQICVDPNKGW 36 T 0.6 IL8 pdbhh F Eukaryota T 2kyl 2 B B PTEN_HUMAN C-TERMINUS OF PTEN PFDEDQHTQITKV 13 T 2.1 Surface_antigen pdbhh F Eukaryota T 2kym 2 B B STE20_YEAST Peptide form Serine/threonine-protein kinase STE20 GKFIPSRPAPKPPSSA 16 T 0.00039 TFIIA unppssm F Eukaryota T 2kzu 2 B B RASF1_HUMAN RAS ASSOCIATION (RALGDS/AF-6) DOMAIN FAMILY 1, ISOFORM CRA_A GSQEDSDSELEQYFTARW 18 T 0.76 HSV_VP16_C pdbhh F Eukaryota T 2l07 1 A A BRAZZEIN DCKRKVYPNGSISDYCEY 18 T 1.1 EBA-175_VI pdbhh F T 2l0l 1 A A Oxidoreductase that catalyzes reoxidation of DsbA protein disulfide isomerase I KKLSIYERVALFGVLGAALIGAIAPKK 27 T 1.7E-05 DsbB pdbhh F T 2l0n 1 A A Oxidoreductase that catalyzes reoxidation of DsbA protein disulfide isomerase I KKRYVAMVIWLYSAFRGVQLTYEHTMLQKK 30 T 0.012 DsbB pdbpssm F T 2l2y 1 A A THCL_STRAJ ALANINAMIDE, BRYAMYCIN, GARGON, THIACTIN XIAXASXTXCXXTXXXXX 18 T 1.3 DUF4803 pdbhh F Bacteria T 2l3i 1 A A TOP4A_OXYTA AOXKI4A, antimicrobial peptide in spider venom GIRCPKSWKCKAFKQRVLKRLLAMLRQHAF 30 T 3.2 DUF2615 pdbhh F Eukaryota T 2l3n 1 A A RAP1_SCHPO;TAZ1_SCHPO DNA-binding protein rap1,Telomere length regulator taz1 SVSILRSSVNHREVDEAIDNILRYTNSTEQQFLEAMESTGGRVRIAIAKLLSKQTSGGSGGSKLGGSGGSRKDLSVKGMLYDSDSQQILNRLRERVSGSTAQSA 104 T 0.34 HYPK_UBA pdbhh F Eukaryota T 2l4t 2 B B Glutaminase L peptide KENLESMV 8 T 22 DUF1128 pdbhh F T 2l56 1 A A General control protein GCN4 XNYHLENEVARLKKLVGX 18 T 0.039 VGPC1_C pdbhh F T 2l5e 2 B B GATA1_MOUSE GATA-1 KASGXGKXKRGSN 13 T 29 DUF1087 pdbhh F Eukaryota T 2l5r 1 A A H0USY4_ALYOB Antimicrobial peptide Alyteserin-1C GLKEIFKAGLGSLVKGIAAHVAS 23 T 0.094 Bombinin pdbhh F Eukaryota T 2l6e 2 B B NYAD-13 stapled peptide inhibitor ITFXDLLXYYGKKK 14 T 6.5 YaaC pdbhh F T 2l6s 1 A A VIR-576 LEAIPCSIPPEFLFGKPFVF 20 T 3 DUF5759 pdbhh F T 2l7l 2 B B KCC1A_RAT CAM KINASE I, CAM-KI, CAM KINASE I ALPHA, CAMKI-ALPHA AKSKWKQAFNATAVVRHMRKLQ 22 T 0.12 Tyrosinase unp F Eukaryota T 2l7t 1 A A USH1G_HUMAN 11-MER PEPTIDE FROM USHER SYNDROME TYPE-1G PROTEIN, SANS EELPWDELDLG 11 T 0.61 DUF4099 pdbhh F Eukaryota T 2l87 1 A A CCR5_HUMAN CCR5, C-C CKR-5, CC-CKR-5, CCR5, CHEMR13, HIV-1 FUSION CORECEPTOR MDYQVSSPIYDINYYTSEPAQKINVKQ 27 T 6 Polysacc_syn_2C pdbhh F Eukaryota T 2l8j 2 B B NBR1_HUMAN NBR1-LIR peptide GAMGSASSEDYIIILPES 18 T 0.2 CENP-B_dimeris unppercent F Eukaryota T 2l9x 1 A A Uncharacterized protein GNAACVIGCIGSCVISEGIGSLVGTAFXLG 30 T 0.95 Bacteriocin_IIc unphh F T 2la0 1 A A Uncharacterized protein GWVACVGACGTVCLASGGVGTEFAAASXFL 30 T 0.4 Herpes_US9 pdbhh F T 2laj 2 B B SMAD3_HUMAN MAD HOMOLOG 3, MAD3, MOTHERS AGAINST DPP HOMOLOG 3, HMAD-3, JV15-2, SMAD FAMILY MEMBER 3, SMAD 3, SMAD3, HSMAD3 AGSPNLSPNP 10 T 2.5 DUF1930 pdbhh F Eukaryota T 2law 2 B B SMAD1_HUMAN MAD HOMOLOG 1, MOTHERS AGAINST DPP HOMOLOG 1, JV4-1, MAD-RELATED PROTEIN 1, SMAD FAMILY MEMBER 1, SMAD 1, SMAD1, HSMAD1, TRANSFORMING GROWTH FACTOR-BETA-SIGNALING PROTEIN 1, BSP-1 TPPPAYLPPEDP 12 T 0.72 Myc_target_1 pdbhh F Eukaryota T 2laz 2 B B SMAD1_HUMAN MAD HOMOLOG 1, MOTHERS AGAINST DPP HOMOLOG 1, JV4-1, MAD-RELATED PROTEIN 1, SMAD FAMILY MEMBER 1, SMAD 1, SMAD1, HSMAD1, TRANSFORMING GROWTH FACTOR-BETA-SIGNALING PROTEIN 1, BSP-1 SDPGSPFQ 8 T 4.4 DUF5667 pdbhh F Eukaryota T 2lb0 2 B B SMAD1_HUMAN MAD HOMOLOG 1, MOTHERS AGAINST DPP HOMOLOG 1, JV4-1, MAD-RELATED PROTEIN 1, SMAD FAMILY MEMBER 1, SMAD 1, SMAD1, HSMAD1, TRANSFORMING GROWTH FACTOR-BETA-SIGNALING PROTEIN 1, BSP-1 TSSDPGSPFQ 10 T 2 DUF3297 pdbhh F Eukaryota T 2lb1 2 B B SMAD1_HUMAN MAD HOMOLOG 1, MOTHERS AGAINST DPP HOMOLOG 1, JV4-1, MAD-RELATED PROTEIN 1, SMAD FAMILY MEMBER 1, SMAD 1, SMAD1, HSMAD1, TRANSFORMING GROWTH FACTOR-BETA-SIGNALING PROTEIN 1, BSP-1 ADTPPPAYLPPEDPX 15 T 2.5 Myc_target_1 pdbhh F Eukaryota T 2lb2 2 B B SMAD3_HUMAN MAD HOMOLOG 3, MAD3, MOTHERS AGAINST DPP HOMOLOG 3, HMAD-3, JV15-2, SMAD FAMILY MEMBER 3, SMAD 3, SMAD3, HSMAD3 ETPPPGYLSEDG 12 T 0.85 Gsf2 pdbhh F Eukaryota T 2lcn 1 A A WALP19-P10 peptide XGWWLALALAPALALALWWAX 21 T 1.4 DUF4381 pdbhh F T 2lco 1 A A WALP19-P8 peptide XGWWLALAPALALALALWWAX 21 T 1.3 DUF4381 pdbhh F T 2lct 2 B B KSYK_MOUSE SPLEEN TYROSINE KINASE DTEVXESPXADPE 13 T 23 Holin_2-3 pdbhh F Eukaryota T 2lcu 1 A A H3JQU2_BABCA Bc28.1 SSGIEGCTEDEKRDSVVEGATSVEASLKEQIDWLAERYSADLTNKDTSKWNTDEKVKELLNEKAVGIESRLLAIAKEFHKLKSVLCTGVNETPAHVANRVSPGDAISMLYVLSITHRELSSLKNKIDEWKKVKASEDGTKVIQNIKDDRTNTWFVAHGFKVAELNDVTLEKLATVVNELVSHKDMIYINDAMKQNVDKWTKEESERLAMMAEQGISGAKGKKD 223 T 0.71 ERp29 pdb F Eukaryota T 2ld0 1 A A HD_HUMAN HUNTINGTON DISEASE PROTEIN, HD PROTEIN MATLEKLMKAFESLKSFX 18 T 2 Mito_fiss_reg unphh F Eukaryota T 2ld3 1 A A MYO6_MOUSE Myosin VI QGPGSLVKVGTLKKRLDKFNEVVSALKDGKPEVNRQIKNLEISIDALMAKIKSTMMTREQIQKEYDALVKSSEDLLSALQKKKQQEEE 88 T 0.00048 XhlA pdb F Eukaryota T 2ldj 1 A A Trp-Cage mini-protein NLYIQWLKDXGPSSGRPPPS 20 T 0.12 NDUF_B6 pdbhh F T 2lds 1 A A LAIT1_LIOAU Insecticidal toxin LaIT1 DFPLSKEYETCVRPRKCQPPLKCNKAQICVDPKKGW 36 T 0.63 IL8 pdbhh F Eukaryota T 2le2 1 A,B A,B P56_BPPH2 P56 MVQNDFVDSYDVTMLLQDDDGKQYYEYHKGLSLSDFEVLYGNTADEIIKLRLDKVL 56 T 0.55 GhoS unphh T Viruses T 2ler 1 A A CUGA_CONPB Conotoxin pc16a SCSCKRNFLCC 11 T 1.4 Argos pdbhh F Eukaryota T 2lfk 1 A A Q1EG59_RHIAP Tryptase inhibitor GDKEECTVPIGWSEPVKGLCKARFTRYYCMGNCCKVYEGCYTGGYSRMGECARNCPG 57 T 0.03 DUF3788 unppssm F Eukaryota T 2lfn 1 A A RODL_NEUCR BLUE LIGHT-INDUCED PROTEIN 7, CLOCK-CONTROLLED GENE PROTEIN 2, RODLET PROTEIN SATTIGPNTCSIDDYKPYCCQSMSGSASLGCVVGVIGSQCGASVKCCKDDVTNTGNSGLIINAANCVA 68 T 0.083 Hydrophobin unphh F Eukaryota T 2lg7 1 A A A6L9X6_PARD8 Uncharacterized protein GDDDEPGGKGAMYEVTIEQSGDFRSFIKSVVVVANGTQLKDGATGESLASPVILSDEELAVEKVTLSTTGKAIEFAVSGGVVDGEDGVVNEPMQWVVTVYKNGKEIEKKSLVFRDGKEISTDDLNLYYN 129 T 0.00015 PLCC unp F Bacteria T 2lgf 2 B B LYAM1_HUMAN CD62 ANTIGEN-LIKE FAMILY MEMBER L, LEUKOCYTE ADHESION MOLECULE 1, LAM-1, LEUKOCYTE SURFACE ANTIGEN LEU-8, LEUKOCYTE-ENDOTHELIAL CELL ADHESION MOLECULE 1, LECAM1, LYMPH NODE HOMING RECEPTOR, TQ1, GP90-MEL AFIIWLARRLKKGKK 15 T 0.38 MWFE unp F Eukaryota T 2lhr 1 A A ISDH_STAAW HAPTOGLOBIN RECEPTOR A, STAPHYLOCOCCUS AUREUS SURFACE PROTEIN I SDDYVDEETYNLQKLLAPYHKAKTLERQVYELEKLQEKLPEKYKAEYKKKLDQTRVELADQVKSAVTEFENVTPTNDQ 78 T 0.0033 Tropomyosin pdbpssm F Bacteria T 2lht 1 A A A8W3P3_VENIN Cellophane-induced protein 1 ADVFDPPTQYGYDGKPLDASFCRTAGSREKDCRKDVQACDKKYDDQGRETACAKGIREKYKPAVVYGYDGKPLDLGFCTLAGIREVDCRKDAQTCDKKYESDKCLNAIKEKYKPVVDPNPPA 122 T 0.14 Brr6_like_C_C pdbpercent F Eukaryota T 2li3 1 A A KA20X_TITTR Potassium channel toxin kappa-KTX3.1 GSGCMPEYCAGQCRGKVSQDYCLKNCRCIR 30 T 13 Yeast_MT pdbhh F Eukaryota T 2li5 2 B B ATG7_YEAST ATG7C30, ATG12-ACTIVATING ENZYME E1 ATG7, AUTOPHAGY-RELATED PROTEIN 7, CYTOPLASM TO VACUOLE TARGETING PROTEIN 2 GPHMISGLSVIKQEVERLGNDVFEWEDDESDEIA 34 T 0.12 VMAP-M18 pdb F Eukaryota T 2lid 1 A A Vitellogenin EHKHSDESTSESFESIADNNDDSYFQRKPKLTEAP 35 T 24 KIP1 pdbhh F T 2lk9 1 A A BST2_HUMAN BST-2, HM1.24 ANTIGEN, TETHERIN KRSKLLLGIGILVLLIIVILGVPLIIFTIKKKKKK 35 T 0.00067 UPF0242 unppssm F Eukaryota T 2lkq 1 A A IGLL1_HUMAN CD179 ANTIGEN-LIKE FAMILY MEMBER B, IG LAMBDA-5, IMMUNOGLOBULIN OMEGA POLYPEPTIDE, IMMUNOGLOBULIN-RELATED PROTEIN 14.1 SRSSLRSRWGRFLLQRGSWTGPRC 24 T 26 Toxin_7 pdbhh F Eukaryota T 2lkw 1 A A Q918V6_9REOV Membrane fusion protein p15 XGQRHSIVQPPAPPPNAFVEIX 22 T 2 DUF4381 unphh T Viruses T 2ll1 1 A A TX1_SELPU U1-TRTX-Sp1a DCGHLHDPCPNDRPGHRTCCIGLQCRYGKCLVR 33 T 0.0017 Tachystatin_A pdbhh F Eukaryota T 2ll2 1 A A CXA1_HUMAN CONNEXIN-43, CX43, GAP JUNCTION 43 KDA HEART PROTEIN KGVKDRVKGKSDPYHATSGALSPAKD 26 T 0.52 7tm_1 unp F Eukaryota T 2ll5 1 A A Cyclo-TC1 GDAYAQWLADGGPSSGRPPPSG 22 T 5.1 MOSC_N pdbhh F T 2ll6 2 B B NOS2_HUMAN HEPATOCYTE NOS, HEP-NOS, INDUCIBLE NO SYNTHASE, INDUCIBLE NOS, INOS, NOS TYPE II, PEPTIDYL-CYSTEINE S-NITROSYLASE NOS2 LKVLVKAVLFACMLMRK 17 T 1.2 DUF488 unppercent F Eukaryota T 2ll7 2 B B NOS3_HUMAN CONSTITUTIVE NOS, CNOS, EC-NOS, ENDOTHELIAL NOS, ENOS, NOS TYPE III, NOSIII KKTFKEVANAVKISASL 17 T 0.028 DUF2774 pdbhh F Eukaryota T 2llo 2 B B ESR1_HUMAN ER, ER-ALPHA, ESTRADIOL RECEPTOR, NUCLEAR RECEPTOR SUBFAMILY 3 GROUP A MEMBER 1 RAANLWPSPLMIKRSKKNS 19 T 4 Tom5 pdbhh F Eukaryota T 2llp 1 A,B,C A,B,C CO1A1_HUMAN ALPHA-1 TYPE I COLLAGEN PPGPQGIAGQRGVVGLPG 18 T 0.019 Collagen pdb F Eukaryota T 2llr 1 A A Alvinellacin RGCYTRCWKVGRNGRVCMRVCT 22 T 0.00044 Toxin_25 pdbhh F T 2lm8 1 A A CDT-LPS KWFRVYRGIYRRR 13 T 2.7 DUF2161 pdbhh F T 2lma 1 A A Thp5 peptide WRPYLQTEYYDVMTVISPPEFG 22 T 9.9 Fmp27_SW pdbhh F T 2lmb 1 A A RAGE_HUMAN RECEPTOR FOR ADVANCED GLYCOSYLATION END PRODUCTS MWQRRQRRGEERKAPENQEEEEERAELNQSEEPEAGESSTGGP 43 T 0.0011 TMEM154 unphh F Eukaryota T 2lmz 1 A A CANA_CONIM Conotoxin im17a IPYCGQTGAECYSWCIKQDLSKDWCCDFVKDIRMNPPADKCP 42 T 0.0032 TSGP1 unphh F Eukaryota T 2ln3 1 A A DE NOVO DESIGNED PROTEIN OR135 MGLTRTITSQNKEELLEIALKFISQGLDLEVEFDSTDDKEIEEFERDMEDLAKKTGVQIQKQWQGNKLRIRLKGSLEHHHHHH 83 T 0.19 Cas_APE2256 pdb F T 2lnd 1 A A DE NOVO DESIGNED PROTEIN, PFK fold MGKVLLVISTDTNIISSVQERAKHNYPGRYIRTATSSQDIRDIIKSMKDNGKPLVVFVNGASQNDVNEFQNEAKKEGVSYDVLKSTDPEELTQRVREFLKTAGSLEHHHHHH 112 T 0.0034 DUF3801 pdb F T 2lny 1 A A ShB peptide MAAVAGLYGLGEDRQHRKKQ 20 T 0.53 AcrZ pdbhh F T 2lo7 1 A A KA20_TITSE TITYUSTOXIN-16 GSGCMKEYCAGQCRGKVSQDYCLKHCKCIPR 31 T 1.1 TCR pdb F Eukaryota T 2lob 2 B B CFTR_HUMAN CFTR, ATP-BINDING CASSETTE SUB-FAMILY C MEMBER 7, CHANNEL CONDUCTANCE-CONTROLLING ATPASE, CAMP-DEPENDENT CHLORIDE CHANNEL EEVQDTRL 8 T 4.7 DUF1507 pdbhh F Eukaryota T 2lox 2 B B RAD2_YEAST DNA repair protein RAD2 GSEILERESEKESSNDENKDDDLEVLSEELFEDVPTKSQISKEAEDNDSRKY 52 T 18 DUF3161 pdbhh F Eukaryota T 2loz 2 B B RHG07_HUMAN DELETED IN LIVER CANCER 1 PROTEIN, DLC-1, HP PROTEIN, RHO-TYPE GTPASE-ACTIVATING PROTEIN 7, START DOMAIN-CONTAINING PROTEIN 12, STARD12, STAR-RELATED LIPID TRANSFER PROTEIN 12 EDHKPGTFPKALTN 14 T 4.9 HAV_VP pdbhh F Eukaryota T 2lpb 2 B B GCN4_YEAST AMINO ACID BIOSYNTHESIS REGULATORY PROTEIN STDSTPMFEYENLEDNSKEWTSLFDNDIPVTTDD 34 T 0.53 Iwr1 unppssm F Eukaryota T 2lq0 1 A A D0EKL2_9BASI de novo designed antifreeze peptide 1m QRSNFHPLAASFIVRCAFEHSRRFT 25 T 3.1 DUF5677 pdbhh F Eukaryota T 2lq4 1 A p Lysophosphatidic acid receptor 1 MQALEKELAQNEWELQALEKELAQLEKELQAWNCICDIENCSNMAPLYSDQALKKKLAQLKWKLQALKKKNAQLKKKLQA 80 T 0.0065 DUF489 pdb F T 2lqc 2 B B CAC1C_HUMAN CALCIUM CHANNEL, L TYPE, ALPHA-1 POLYPEPTIDE, ISOFORM 1, CARDIAC MUSCLE, VOLTAGE-GATED CALCIUM CHANNEL SUBUNIT ALPHA CAV1.2 GTGAALSWQAAIDAARQAKLMGSA 24 T 17 DUF1911 pdbhh F Eukaryota T 2lqg 1 A A TLN1_MOUSE Talin-1 GIDPFTLVQRLEHAAKQAAASATQTIAAAQHAASAPKASAGPQPLLVQSCKAVAEQIPLLVQGVRGSQAQPDSPSAQLALIAASQSFLQPGGKMVAAAKASVPTIQDQASAMQLSQCAKNLGTALAELRTAAQKAQEA 138 T 0.0032 I_LWEQ pdbpercent F Eukaryota T 2lqx 1 A A Trypsin inhibitor BWI-2c SEKPQQELEECQNVCRMKRWSTEMVHRCEKKCEEKFERQQR 41 T 0.001 Vicilin_N pdb F T 2lr0 1 A A P-loop ntpase fold MKILILINTNNDELIKKIKKEVENQGYQVRDVNDSDELKKEMKKLAEEKNFEKILIKSNDKQLLKEMLELISKLGYKVFLLLADQDENELEEFKRKIESQGYEVRKVTDDEEALKIVREFMQKAGSLEHHHHHH 134 T 0.002 NLBH pdb F T 2lr1 2 B B VGLI_HCMVA Immediate early glycoprotein CEALKKALRRHRFLWQRRQRA 21 T 0.064 AbfB unppercent T Viruses T 2lr2 1 A A Immunoglobulin G-binding protein A MGSSHHHHHHSSGVDNKFNKEQQNAFYEILHLPNLNEEQRNAFIQSLKDDSYIDTNNDGAYEGDELSGSQSANLLAEAKKLNDAQAPK 88 T 0.0071 B pdbpssm F T 2lr7 1 A A M9MMP3_LITCT Cathelicidin-PY RKCNFLCKLKEKLRTVITSHIDKVLRPQG 29 T 2.6 Importin_rep_5 pdbhh F Eukaryota T 2lrd 1 A A I3NI56_9EUKA Acanthaporin AMGKCSVLKKVACAAAIAGAVAACGGIDLPCVLAALKAAEGCASCFCEDHCHGVCKDLHLC 61 T 0.13 PNTB pdbpercent F Eukaryota T 2lrh 1 A A De novo designed protein MKELILINTNNDELIKKIKKEVENQGYQVRDVNDSDELKKEMKKLAEEKNFEKILIISNDKQLLKEMLELISKLGYKVFLLLQDQDENELEEFKRKIESQGYEVRKVTDDEEALKIVREFMQKAGSLEHHHHHH 134 T 0.0021 NLBH pdb F T 2ls1 1 A A B5I0A0_9ACTN Uncharacterized protein CVWGGDCTDFLGCGTAWICV 20 T 0.36 CCAP pdbhh F Bacteria T 2lsa 1 A A MAGA_XENLA MAGAININ II GIGKFLHSAKKFGKAFVGEIMNS 23 T 0.87 TAFII28 pdbhh F Eukaryota T 2lse 1 A A Four Helix Bundle Protein MQEERKKLLEKLEKILDEVTDGAPDEARERIEKLAKDVKDELEEGDAKNMIEKFRDEMEQMYKDAPNAVMEQLLEEIEKLLKKAGSLVPRGSYLEHHHHHH 101 T 0.00029 Prominin pdb F T 2lsi 2 B B POLK_HUMAN DINB PROTEIN, DINP GSHKKSFFDKKRSERKW 17 T 12 FDF pdbhh F Eukaryota T 2lsj 2 B B POLK_MOUSE DINB PROTEIN, DINP SHMSHKKSFFDKKRSERISNCQDTS 25 T 0.0065 DUF4113 unphh F Eukaryota T 2lsk 2 B B POLH_HUMAN RAD30 HOMOLOG A, XERODERMA PIGMENTOSUM VARIANT TYPE PROTEIN QSTGTEPFFKQKSLLL 16 T 4.1 Med28 pdbhh F Eukaryota T 2lsp 1 A A TF65_HUMAN NF-kB-K310ac peptide RTYETFXSIMKKS 13 T 2.1 Pab87_oct pdbhh F Eukaryota T 2lsr 2 B B CAD23_HUMAN peptide from Cadherin-23 GSLLKEVLEDYLRLKK 16 T 5.8 Nup54_57_C pdbhh F Eukaryota T 2lsv 2 B B HSP82_YEAST 82 KDA HEAT SHOCK PROTEIN, HEAT SHOCK PROTEIN HSP90 HEAT-INDUCIBLE ISOFORM ADTEMEEVD 9 T 19 CHZ pdbhh F Eukaryota T 2lti 1 A A E8RMD3_ASTEC ASTEXIN1 GLSQGVEPDIGQTYFEESRINQD 23 T 4.6 LSPR pdbhh F Bacteria T 2lto 2 B B RPB1_HUMAN RNA POLYMERASE II SUBUNIT B1, DNA-DIRECTED RNA POLYMERASE II SUBUNIT A, DNA-DIRECTED RNA POLYMERASE III LARGEST SUBUNIT, RNA-DIRECTED RNA POLYMERASE II SUBUNIT RPB1 YSPSSPXYTPQSP 13 T 0.00021 RNA_pol_Rpb1_R pdbhh F Eukaryota T 2ltu 1 A A AAPK2_HUMAN AMPK SUBUNIT ALPHA-2 GPHMSYDANVIDDEAVKEVCEKFECTESEVMNSLYSGDPQDQLAVAYHLIIDNRRIMNQASE 62 T 0.00043 UBA_2 unppssm F Eukaryota T 2ltv 2 B B SMAD7_HUMAN Smad7 derived peptide SPPPPYSRYPMD 12 T 0.082 WBP-1 pdbhh F Eukaryota T 2ltw 2 B B SMAD7_HUMAN Smad7 derived peptide GESPPPPYSRYPMD 14 T 7.5 SlyX pdbhh F Eukaryota T 2ltx 2 B B SMAD7_HUMAN Smad7 derived peptide ELESPPPPYSRYPMD 15 T 2.4 WBP-1 pdbhh F Eukaryota T 2lu2 1 A A H4, PUTATIVE RTMDTQNDVESAGRQSEPMEAADRQAEHPGAPTQSEMKEFQEEIKEGVEETKHEGDPEMTRLMVTEKQESKNFSKMAKSQSFSTRIEELGGSISFLTETGVTMIELPKTVSEHDMDQLLHDILAAGGVVGLDSEVKLA 138 T 0.024 THF_DHG_CYH pdb F T 2lue 2 B B OPTN_HUMAN E3-14.7K-INTERACTING PROTEIN, FIP-2, HUNTINGTIN YEAST PARTNER L, HUNTINGTIN-INTERACTING PROTEIN 7, HIP-7, HUNTINGTIN-INTERACTING PROTEIN L, NEMO-RELATED PROTEIN, OPTIC NEUROPATHY-INDUCING PROTEIN, TRANSCRIPTION FACTOR IIIA-INTERACTING PROTEIN, TFIIIA-INTP NSSGSSEDSFVEIRMAE 17 T 5.1 Pea-VEAacid pdbhh F Eukaryota T 2luf 1 A A Retro Trp-cage peptide SPPPRGSSPGGDKLWQIYLN 20 T 5 DUF1822 pdbhh F T 2lvb 1 A A DE NOVO DESIGNED PFK fold PROTEIN MGKVLLVISTDTNIISSVQERAKHNYPGREIRTATSSQDIRDIIKSMKDNGKPLVVFVNGASQNDVNEFQNEAKKEGVSYDVLKSTDPEELTQRVREFLKTAGSLEHHHHHH 112 T 0.0027 DUF3801 pdb F T 2lvh 1 A A Y059A_AFV1Y Putative zinc finger protein ORF59a MIEVSSMERVYQCLRCGLTFRTKKQLIRHLVNTEKVNPLSIDYYYQSFSVSLKDVNKII 59 T 0.00023 zf-C2H2 pdb T Viruses T 2lvm 2 B B H4_HUMAN Histone H4 GAKRHRKVLRDNIQ 14 T 0.27 UPF0137 unp F Eukaryota T 2lw5 1 A A L7P7M1_9CAUD ACR30-35 GMKFIKYLSTAHLNYMNIAVYENGSKIKARVENVVNGKSVGARDFDSTEQLESWFYGLPGSGLGRIENAMNEISRRENP 79 T 0.091 UXS1_N pdbpercent T Viruses T 2lw6 1 A A H2DQR0_MAGOR AvrPiz-t protein SFVQCNHHLLYNGRHWGTIRKKAGWAVRFYEEKPGQPKRLVAICKNASPVHCNYLKCTNLAAGFSAGTSTDVLSSGTVGS 80 T 12 DUF3918 unphh F Eukaryota T 2lwb 1 A A C5GR14_AJEDR Adhesin WI-1 NCDWDKSHEKYDWELWDKWC 20 T 8.6 NinD pdbhh F Eukaryota T 2lwq 1 A A PawS derived peptide 11 (PDP-11) GCWPVPYPPFFDCKPN 16 T 0.12 Antimicrobial23 pdbhh F T 2lws 1 A A PawS Derived Peptide 4 (PDP-4) GSCFGAFCFRRD 12 T 0.43 LIX1 pdbhh F T 2lwt 1 A A PawS Derived Peptide 5 (PDP-5) GRYRRCIPGMFRAYCYMD 18 T 9.6 IGF2_C pdbhh F T 2lwu 1 A A PawS Derived Peptide 7 (PDP-7) GHCIPTTSGPICLRD 15 T 4.5 Toxin_29 pdbhh F T 2lwv 1 A A PawS Derived Peptide 6 (PDP-6) GHCIQVPPMATEICFSD 17 T 0.78 YcgL pdbhh F T 2lww 2 B B TF65_MOUSE V-REL RETICULOENDOTHELIOSIS VIRAL ONCOGENE HOMOLOG A (AVIAN) GSHMKSTQAGEGTLSEALLHLQFDADEDLGALLGNSTDPGVFTDLASVDNSEFQQLLNQGVSMSHSTAEP 70 T 0.19 HBS1_N pdb F Eukaryota T 2lx0 1 A A Membrane fusion protein p14 KKHTIWEVIAGLVALLTFLAFGFWLFKYLQKK 32 T 0.0041 GAPT pdbhh F T 2lx4 1 A A VPP2_MOUSE V-type proton ATPase 116 kDa subunit a isoform 2 MGSLFRSESMCLAQLFL 17 T 0.0014 V_ATPase_prox unphh F Eukaryota T 2lx5 1 A A ATPE_MYCTU ATP SYNTHASE F1 SECTOR EPSILON SUBUNIT, F-ATPASE EPSILON SUBUNIT DPRIAARGRARLRAVGAI 18 T 0.00011 ATP-synt_DE unppssm F Bacteria T 2lx6 1 A A D5VKJ9_CAUST CAULOSEGNIN I GAFVGQPEAVNPLGREIQG 19 T 0.044 DUF5972 unphh F Bacteria T 2lxg 1 A A CM3A_CONKI Mu-conotoxin KIIIA CCNCSSKWCRDHSRCCX 17 T 0.55 C5HCH pdbhh F Eukaryota T 2lxs 2 B B KMT2A_HUMAN ALL-1, CXXC-TYPE ZINC FINGER PROTEIN 7, LYSINE N-METHYLTRANSFERASE 2A, KMT2A, TRITHORAX-LIKE PROTEIN, ZINC FINGER PROTEIN HRX, MLL CLEAVAGE PRODUCT N320, N-TERMINAL CLEAVAGE PRODUCT OF 320 KDA, P320, MLL CLEAVAGE PRODUCT C180, C-TERMINAL CLEAVAGE PRODUCT OF 180 KDA, P180 DAGNILPSDIMDFVLKNTP 19 T 7.3 PLN_propep pdbhh F Eukaryota T 2lzo 1 A A TX9A_URTGR UGTX ISIDPPCRFCYHRDGSGNCVYDAYGCGAV 29 T 1.3 DUF1247 pdbhh F Eukaryota T 2lzx 1 A A Asteropsin B QGCAFEGESCNVEFYPCCPGLGLTCIPGNPDGTCYYL 37 T 0.059 Tachystatin_A pdbhh F T 2lzy 1 A A ABU8-3 QDCPGEGEQCDVEFNPCCPPLTCIPGDPYGICYII 35 T 0.00043 Tachystatin_A pdbhh F T 2m0j 2 B B CNGA2_RAT CYCLIC NUCLEOTIDE-GATED CATION CHANNEL 2, CYCLIC NUCLEOTIDE-GATED CHANNEL ALPHA-2, CNG CHANNEL ALPHA-2, CNG-2, CNG2, CYCLIC NUCLEOTIDE-GATED OLFACTORY CHANNEL SUBUNIT OCNC1 TPRRGRGGFQRIVRLVGVIRDWANKNFR 28 T 0.16 CtsR pdbhh F Eukaryota T 2m0w 1 A A ALPS peptide DFLNSAMSSLYSGWSSFTTGASK 23 T 46 DUF4748 pdbhh F T 2m14 2 B B RAD4_YEAST DNA repair protein RAD4 GSTDDSVEEIQSSEEDYDSEEFEDVTDGNEVAGVEDISVEIK 42 T 15 UL11 pdbhh F Eukaryota T 2m20 1 A,B A,B EGFR_HUMAN PROTO-ONCOGENE C-ERBB-1, RECEPTOR TYROSINE-PROTEIN KINASE ERBB-1 KIPSIATGLVGALLLLLVVALGIGLFIRRRHIVRKRTLRRLLQERELVEPLTPSGEKLWS 60 T 0.0014 GAPT pdb F Eukaryota T 2m2q 1 A A V5IRT8_MOMCH Inhibitor cystine knot peptide MCh-1 GCAGKSCNILGSDPCDAGCFCLPVGIVAGVCV 32 T 0.0001 Albumin_I pdbhh F Eukaryota T 2m2r 1 A A V5IRT9_MOMCH Inhibitor cystine knot peptide MCh-2 GCAGKACNLLGLTCDAGCFCRPDGVGIVAGVCV 33 T 0.00011 Albumin_I unphh F Eukaryota T 2m32 2 B,C,D B,C,D GLOGEN peptide XGPPGPPGLPGENGPPGPPGPPX 23 T 0.00073 Collagen pdbpssm F T 2m35 1 A A TXK1A_SCOMU k-Ssm1a TDDESSNKCAKTKRRENVCRVCGNRSGNDEYYSECCESDYRYHRCLDLLRNF 52 T 2.6 DUF2614 unphh F Eukaryota T 2m37 1 A A E8RMD3_ASTEC ASTEXIN-1 GLSQGVEPDIGQTYFEESR 19 T 2.8 LSPR pdbhh F Bacteria T 2m3a 1 A A KNL2_CAEEL Protein KNL-2 GPLGSVAKKITWRKQDLDRLKRVIALKKPSASDADWTEVLRLLAKEGVVEPEVVRQIAITRLKWVEP 67 T 0.012 Kdo pdb F Eukaryota T 2m3j 1 A A I1SB10_9METZ Asteropsin_E CPGEGEQCDVEFNPCCPPLTCIPGDPYGICYII 33 T 0.0004 Tachystatin_A pdbhh F Eukaryota T 2m3m 2 B B VE6_HPV51 Protein E6 QRTRQRNETQV 11 T 0.072 DUF3716 unp T Viruses T 2m3o 2 B P SCNNA_HUMAN ALPHA-NACH, EPITHELIAL NA(+) CHANNEL SUBUNIT ALPHA, ALPHA-ENAC, ENACA, NONVOLTAGE-GATED SODIUM CHANNEL 1 SUBUNIT ALPHA, SCNEA TAPPPAYATLG 11 T 5.2 Myc_target_1 pdbhh F Eukaryota T 2m41 1 A A CIC_HUMAN Protein capicua homolog VFPWHSLVPFLAPSQ 15 T 15 DUF6356 pdbhh F Eukaryota T 2m45 1 A A MCM_SULSO Minichromosome maintenance protein MCM GSHMGESGKIDIDTIMTGKPKSAREKMMKIIEIIDSLAVSSECAKVKDILKEAQQVGIEKSNIEKLLTDMRKSGIIYEAKPECYKKV 87 T 0.0016 RPA_C pdbpercent F Archaea T 2m4i 1 A A MINC_BACSU Septum site-determining protein MinC GSHMKTKKQQYVTIKGTKNGLTLHLDDACSFDELLDGLQNMLSIEQYTDGKGQKISVHVKLGNRFLYKEQEEQLTELIASKKDLFVHSIDSEVITKKEAQQIREE 105 T 0.009 AF0941-like pdbpssm F Bacteria T 2m56 1 A A CPXA_PSEPU CYTOCHROME P450-CAM, CYTOCHROME P450CAM LAPLPPHVPEHLVFDFDMYNPSNLSAGVQEAWAVLQESNVPDLVWTRCNGGHWIATRGQLIREAYEDYRHFSSECPFIPREAGEAYDFIPTSMDPPEQRQFRALANQVVGMPVVDKLENRIQELACSLIESLRPQGQCNFTEDYAEPFPIRIFMLLAGLPEEDIPHLKYLTDQMTRPDGSMTFAEAKEALYDYLIPIIEQRRQKPGTDAISIVANGQVNGRPITSDEAKRMCGLLLVGGLDTVVNFLSFSMEFLAKSPEHRQELIQRPERIPAACEELLRRFSLVADGRILTSDYEFHGVQLKKGDQILLPQMLSGLDERENACPMHVDFSRQKVSHTTFGHGSHLCLGQHLARREIIVTLKEWLTRIPDFSIAPGAQIQHKSGIVSGVQALPLVWDPATTKAV 404 T 1.6E-05 p450 unppercent F Bacteria T 2m5z 1 A A Q1A2D3_ENTFL ENTEROCIN 7A, ENTEROCIN MR10A, ENTEROCIN NA MGAIAKLVAKFGWPIVKKYYKQIMQFIGEGWAINKIIDWIKKHI 44 T 0.0087 Bacteriocin_IIi pdbhh F Bacteria T 2m60 1 A A Q1A2D2_ENTFL ENTEROCIN 7B, ENTEROCIN MR10B, ENTEROCIN NB MGAIAKLVAKFGWPFIKKFYKQIMQFIGQGWTIDQIEKWLKRH 43 T 9E-05 Bacteriocin_IIi unphh F Bacteria T 2m61 1 A A TXM46_CONAO CONOTOXIN AR1430 CCRLACGLGCHPCCX 15 T 0.24 Radical_SAM_2 pdbhh F Eukaryota T 2m62 1 A A CXT48_CONAO CONOTOXIN AR1232 GVCCGVSFCYPC 12 T 0.18 Oxidored-like unphh F Eukaryota T 2m6c 1 A A Contryphan-In GCVXYPWC 8 T 0.027 ProRS-C_1 pdbhh F T 2m6j 1 A A A0A023GPI4_9ARAC Toxin AbTx IKSCETFIVACDGGKACREVKCKTIX 26 T 1.4 TFIIA_gamma_C pdbhh F Eukaryota T 2m6x 1 A,B,C,D,E,F A,B,C,D,E,F POLG_HCVEV p7 GAKNVIVLNAASAAGNHGFFWGLLVVTLAWHVKGRLVPGATYLSLGVWPLLLVRLLRPHRALA 63 T 0.23 FixQ pdbpercent T Viruses T 2m77 1 A A [Asp2]RTD-1 GDCRCLCRRGVCRCICTR 18 T 0.89 DUF5354 pdbhh F T 2m78 1 A A [Asp11]RTD-1 GFCRCLCRRGDCRCICTR 18 T 0.43 Albumin_I pdbhh F T 2m79 1 A A [Asp2,11]RTD-1 GDCRCLCRRGDCRCICTR 18 T 0.48 Albumin_I pdbhh F T 2m7a 1 A A Uncharacterized protein GSMKRGVEMSIHDLCEDQEQWAMQTLMGSGVLARCRIHNDVILDSGNDASSAYKLGTYLYQKDNSCNLFNTLTEARDAIKDAYESYCGIDDCPQCSKYIDD 101 T 0.071 Phage_FRD3 unppssm F T 2m7b 1 A A Q88G17_PSEPK uncharacterized protein GSMGGIKRLMEEEDAKYSEAVYIAIEAGTLAECEVHEGTYFSDSGDISEAEELAREKFEKGEVSNFDDVEELVKKVVAVCEELGAEECFSCDFD 94 T 0.55 DUF5789 unphh F Bacteria T 2m7c 1 A A Trp-Cage mini-protein RPPPSDXAAYAQWLADXGWAS 21 T 1.1 DUF3349 pdbhh F T 2m7d 1 A A Trp-Cage mini-protein DAYAQWLADXGWASXRPPPS 20 T 3 Sec16_C pdbhh F T 2m7i 1 A A Beta-Hairpin Peptidomimetic antibiotic TWL(DAB)(ORN)(DLY)RW(ORN)(DAB)AK(DPR)P TWLXXXRWXXAKXP 14 T 1.5 Mak_N_cap pdbhh F T 2m7j 1 A A beta-Hairpin Peptidomimetic Antibiotic TWLKKRRWKKAK(DPR)P TWLKKRRWKKAKXP 14 T 0.82 Mak_N_cap pdbhh F T 2m7r 1 A A CON BK-B GEEEYSEAIX 10 T 0.19 Toxin_36 pdbhh F T 2m8f 1 A A E8RUP8_ASTEC astexin3 GPTPMVGLDSVSGQYWDQHAPLAD 24 T 2.1 Cut12 pdbhh F Bacteria T 2m8s 2 B B HBEGF_HUMAN HEPARIN-BINDING EGF-LIKE GROWTH FACTOR, HB-EGF, HBEGF, DIPHTHERIA TOXIN RECEPTOR, DT-R RYHRRGGYDVENEEKVKLGMTNSH 24 T 0.011 DAG1 unppssm F Eukaryota T 2ma3 1 A A O27798_METTH DNA replication initiator (Cdc21/Cdc54) GAMGETGKIDIDKVEGRTPKSERDKFRLLLELIKEYEDDYGGRAPTNILITEMMDRYNVSEEKVEELIRILKDKGAIFEPARGYLKIV 88 T 0.0019 Sigma70_r3 unppercent F Archaea T 2maa 1 A A TEMA_RANTE Temporin-A FLPLIGRVLSGIL 13 T 0.59 Endotoxin_N pdbhh F Eukaryota T 2mae 1 A A TACD2_HUMAN CELL SURFACE GLYCOPROTEIN TROP-2, MEMBRANE COMPONENT CHROMOSOME 1 SURFACE MARKER 1, PANCREATIC CARCINOMA MARKER PROTEIN GA733-1 TNRRKSGKYKKVEIKELGELRKEPSL 26 T 0.0044 DAG1 unppercent F Eukaryota T 2mag 1 A _ MAGA_XENLA MAGAININ 2 GIGKFLHSAKKFGKAFVGEIMNSX 24 T 0.98 TAFII28 pdbhh F Eukaryota T 2mai 1 A A Lassomycin GLRRLFANQLVGRRNX 16 T 5.1 Rod_cone_degen pdbhh F T 2mak 2 B,D B,D CRCM1_HUMAN PROTEIN ORAI-1, TRANSMEMBRANE PROTEIN 142A GSELNELAEFARLQDQLDHRGDH 23 T 0.029 DUF2207 unppssm F Eukaryota T 2mbd 1 A A W5IDB3_LASLA lasiocepsin GLPRKILCAIAKKKGKCKGPLKLVCKC 27 T 1.3 Antimicrobial_1 pdbhh F Eukaryota T 2mbl 1 A A Top7 Fold Protein Top7m13 MSGKKVEVQVKITCNGKTYERTYQLYAVRDEELKEKLKKVLNERMDPIKKLGCKRVRISIRVKHSDAAEEKKEAKKFAAILNKVFAELGYNDSNVTWDGDTVTVEGQLEGVDLEHHHHHH 120 T 0.0046 N-glycanase_N pdb F T 2mc1 2 B B KSYK_MOUSE SPLEEN TYROSINE KINASE DTEVYESPXADPE 13 T 23 Holin_2-3 pdbhh F Eukaryota T 2mc3 1 A A MUS81_HUMAN CDNA FLJ44872 FIS, CLONE BRAMY2022320, HIGHLY SIMILAR TO CROSSOVER JUNCTION ENDONUCLEASE MUS81 (EC 3.1.22.-) GPTMGSGSYWPARHSGARVILLVLYREHLNPNGHHFLTKEELLQRCAQKSPRVAPGSAPPWPALRSLLHRNLVLRTHQPARYSLTPEGLELAQKLAESEGLSLLNVGIG 109 T 0.00099 DUF6429 pdbpercent F Eukaryota T 2mc4 1 A A O52732_STRCH BLDD MEPPPKLVLDLERLATVPAEKAGPLQRYAATIQSQRGDYNGKVLSIRQDDLRTLAVIYDQSPSVLTEQLISWGVLDADARRAVASHDEL 89 T 0.053 DUF43 pdbpercent F Bacteria T 2mc5 1 A A Q8LTJ5_9CAUD RNA POLYMERASE INHIBITOR P7 MNEFTQISGYVNAFGSQRGSVLTVKVENDEGWTLVEEDFDRADYGSDPEFVAEVSSYLKRNGGIKDLTKVLTR 73 T 0.18 DUF1494 pdb T Viruses T 2mc6 2 B B RPOC_XANOR RNAP SUBUNIT BETA', RNA POLYMERASE SUBUNIT BETA', TRANSCRIPTASE SUBUNIT BETA' MKDLLNLFNQ 10 T 6.5 CRM1_repeat_3 pdbhh F Bacteria T 2mc7 1 A A B0LJC7_SALTM Regulatory peptide MNRSPDKIIALIFLLISLLVLCLALWQIVF 30 T 0.038 DUF202 pdb F Bacteria T 2mcd 1 A A Q80J95_9CALI Murine norovirus 1 MRGSHHHHHHGSVSFGAPSPLSSESEDEINYMTPPEQEAQPGALAALHAEGPLAGLPVTRSDARVLIFNEWEERKKSEPWLRLDMSDKAIFRRYPHLR 98 T 5.3 Amidase pdbhh T Viruses T 2mce 1 A A TKN1_RABIT NPGAMMA, PROTACHYKININ-1 DAGHGQISHKRHKTDSFVGLM 21 T 0.0051 Tachykinin pdbhh F Eukaryota T 2mcf 1 A A C5A217_THEGJ TGAM_1934 MKYDVVIIPESFHRFDKHNMEHICPPMVIGDRSYDIAMEIVNGVDRVIKASFNASVEELEGEDCDVLYRKYTLEKEGKKGIVHVKLRKITENCPPVDGNRCSVLEFERDIECIVKAIEECLAKGELNSKLEGKPIPNPLLGLDSTRTG 148 T 0.088 DDE_Tnp_1 pdbpssm F Archaea T 2mch 1 A A Q80J95_9CALI Murine norovirus 1 MRGSHHHHHHGSGALAALHAEGPLAGLPVTRSDARVLIFNEWEERKKSDPWLRLDMSDKAIFRRYPHLR 69 T 2.9 DUF3539 pdbhh T Viruses T 2mck 1 A A H6WEV7_9CALI Polyprotein MRGSHHHHHHGSGALAALHADGPHAGLPVTRSDARVLIFNDWEERKRSEPWLRLDMSDKAIFRRYPHLR 69 T 2.7 DUF3539 pdbhh T Viruses T 2mfa 1 A A 3SX2_DENPO MAMB-2, PI-DP2 LKCFQHGKVVTCHRDMKFCYHNTGMPFRNLKLILQGCSSSCSETENNKCCSTDRCNK 57 T 0.0012 Toxin_TOLIP pdb F Eukaryota T 2mfm 1 A A G7K427_MEDTR CEP11 AFRXTAPGHSXGVGH 15 T 0.22 RNA_pol_Rpb1_R unp F Eukaryota T 2mfo 1 A A G7K427_MEDTR CEP1 AFQXTTPGNSXGVGH 15 T 0.22 RNA_pol_Rpb1_R unp F Eukaryota T 2mfq 2 B B NTRK2_HUMAN GP145-TRKB, TRK-B, NEUROTROPHIC TYROSINE KINASE RECEPTOR TYPE 2, TRKB TYROSINE KINASE, TROPOMYOSIN-RELATED KINASE B GPDAVIIGMTKIPVIENPQXFGI 23 T 8.9 DUF6330 pdbhh F Eukaryota T 2mfs 1 A A Ep-AMP1 CVLIGQRCDNDRGPRCCSGQGNCVPLPFLGGVCAV 35 T 0.0023 Toxin_7 pdb F T 2mfv 1 A A F0CAT1_9XANT Xanthomonin II GGPLAGEEMGGITT 14 T 4.9 Rhabdo_M2 pdbhh F Bacteria T 2mg5 2 B B NOS3_HUMAN target peptide TFKEVANAVKISASLM 16 T 0.013 DUF2774 pdbhh F Eukaryota T 2mgw 1 A A NBR1_HUMAN CELL MIGRATION-INDUCING GENE 19 PROTEIN, MEMBRANE COMPONENT CHROMOSOME 17 SURFACE MARKER 2, NEIGHBOR OF BRCA1 GENE 1 PROTEIN, PROTEIN 1A1-3B GPLGSSEDQTAALMAHLFEMGFCDRQLNLRLLKKHNYNILQVVTELLQLNNN 52 T 0.0014 UBA_5 pdbhh F Eukaryota T 2mh0 1 A A TFE2_HUMAN CLASS B BASIC HELIX-LOOP-HELIX PROTEIN 21, BHLHB21, IMMUNOGLOBULIN ENHANCER-BINDING FACTOR E12/E47, IMMUNOGLOBULIN TRANSCRIPTION FACTOR 1, KAPPA-E2-BINDING FACTOR, TRANSCRIPTION FACTOR 3, TCF-3, TRANSCRIPTION FACTOR ITF-1 GSMNQPQRMAPVGTDKELSDLLDFSMMFPLPVTNGKGRP 39 T 32 DUF1480 pdbhh F Eukaryota T 2mh5 1 A A LAN91_MICS0 Lantibiotic 107891 VXXXXLCXPGCTXPGGGXNCXFCX 24 T 0.00093 Gallidermin unppssm F Bacteria T 2mhy 1 A A Q0GB44_9SALA Plethodontid modulating factor LQCNTLDGGTEECIPGIYNVCVHYKSEDEEYKSCGIQEECEDAEGATVLCCPEDLCN 57 T 0.042 Defensin_propep unppercent F Eukaryota T 2mid 1 A A CLE10_ARATH CLE10P RLVPSGPNPLHN 12 T 21 DUF502 pdbhh F Eukaryota T 2mie 1 A A CLE41_ARATH TRACHEARY ELEMENT DIFFERENTIATION INHIBITORY FACTOR-LIKE PROTEIN, TDIF-LIKE PROTEIN, CLE44P HEVPSGPNPISN 12 T 2.6 DUF502 pdbhh F Eukaryota T 2mif 1 A A CLAVATA-like encoded peptide of Meloidogyne hapla - MhCLE4 HEVPSGPNPSSN 12 T 1.1 DUF2315 pdbhh F T 2mig 1 A A CLAVATA-like encoded peptide of Meloidogyne hapla - MhCLE5 RKVPTGSNPQKN 12 T 4.6 Bradykinin pdbhh F T 2mih 1 A A CLAVATA-LIKE ENCODED PEPTIDE OF MELOIDOGYNE HAPLA - MHCLE6/7 HQVPSGPNPLHNKK 14 T 3.3 DUF3581 pdbhh F T 2mip 2 E,F,G,H E,F,G,H INHIBITOR BI-LA-398 FVFLEIX 7 T 51 HVSL pdbhh F T 2mix 1 A A T3A_TERVA venom peptide toxin TRICCGCYWNGSKDVCSQSCC 21 T 1.1 Hepcidin pdbhh F Eukaryota T 2mjf 1 A A RSA1_YEAST Ribosome assembly 1 protein GPHMFANENSQLLDFIRELGDVGLLEYELSQQEKDVLFGS 40 T 0.17 SpoIISA_toxin unppssm F Eukaryota T 2mjq 1 A A ANOP_ANOSM AS-183 GLLKRIKTLLX 11 T 4.2 PspA_IM30 unphh F Eukaryota T 2mjr 1 A A ANOP_ANOSM AS-183 GLLKWIKTLLX 11 T 0.52 DUF4653 pdbhh F Eukaryota T 2mjs 1 A A ANOP_ANOSM AS-183 GLLKKIKWLLX 11 T 4.2 PspA_IM30 unphh F Eukaryota T 2mjt 1 A A ANOP_ANOSM AS-183 GLLKFIKWLLX 11 T 0.44 Lipoprotein_10 pdbhh F Eukaryota T 2mjv 1 A A TWST1_HUMAN CLASS A BASIC HELIX-LOOP-HELIX PROTEIN 38, BHLHA38, H-TWIST SPAQGXRGXKSA 12 T 10 Parecho_VpG pdbhh F Eukaryota T 2mk0 1 A A O22015_CYLFU PLEURALIN-1, FORMERLY HEP200 SYYHHHHHHTMMPSPEPSSQPSDCGEVIEECPIDACFLPKSDSARPPDCTAVGRPDCNVLPFPNNIGCPSCCPFECSPDNPMFTPSPDGSPPNCSPTMLPSPSPSAVTVPLTPTMLPSPS 120 T 40 DUF35_N pdbhh F Eukaryota T 2mkc 2 B B PML1_YEAST Pre-mRNA leakage protein 1 GSKSQYIDIMPDFSPSGLLELES 23 T 7.1 VirE_N pdbhh F Eukaryota T 2mkr 2 B B EBNA2_EBVB9 EBNA-2, EBV NUCLEAR ANTIGEN 2 DLDESWDYIFETT 13 T 0.75 DUF3841 pdbhh T Viruses T 2ml5 1 A A A7LT22_BACO1 Uncharacterized protein GDSELTTQDGEDFKSFLDKFTSSAAFQYTRVKFPLKTPITLLADDGETEKTFPFTKEKWPLLDSETMKEERITQEEGGIYVSKFTLNEPKHKIFEAGYEESEVDLRVEFELQADGKWYVVDCYTGWYGYDLPIGELKQTIQNVKEENAAFKEIHP 155 T 0.00055 DUF4348 unppssm F Bacteria T 2ml6 1 A A A7V0E7_BACUC Uncharacterized protein GAEEEDFKTFLQKFTSSASFQYSRIKFPLKSPIALLKDDGETEQTFPFTREKWALLDEETLKEGRTTEEEGGTYISHFTVNEPAHKEFEAGYDESEPSLRVVFELTDGKWYVTDCYNDWYNFDLPINELEETIQAVQEENKAFEELHP 148 T 0.00018 DUF4348 pdbpercent F Bacteria T 2mlj 1 A A A0A0H2UKY1_CAUSK Caulonodin V SIGDSGLRESMSSQTYWP 18 T 7.8 Herpes_UL47 pdbhh F Bacteria T 2mlp 1 A _ MCBA_ECOLX MCBA PROPEPTIDE MELKASEFGVVLSVDALKLSRQSPLGX 27 T 2.8 DUF3905 pdbhh F Bacteria T 2mlu 1 A A Q7X2B5_LACLL LsbB MKTILRFVAGYDIASHKKKTGGYPWERGKA 30 T 0.82 DUF4262 pdbhh F Bacteria T 2mm5 1 A A A0A0S0ZR47_9GENT Alpha amylase Alstotide S4 CVPQYGVCDGIINQCCDPYYCSPPIYGHCI 30 T 0.0051 Toxin_35 unp F Eukaryota T 2mm6 1 A A A0A0S0ZR07_9GENT Alpha amylase Alstotide S1 CRPYGYRCDGVINQCCDPYHCTPPLIGICL 30 T 0.0063 Conotoxin unphh F Eukaryota T 2mmj 1 A A MCU11_LITGE maculatin G15 GLFGVLAKVAXHVVGAIAEHFX 22 T 4.7E-05 Caerin_1 unphh F Eukaryota T 2mmt 1 A A MCJA_ECOLX MCCJ25(RGDF) GGAGHVPEYFVRGDFPISFYG 21 T 0.13 Endonuc-BglII unp F Bacteria T 2mmw 1 A A MCJA_ECOLX MCCJ25 GGAGHVPEYFVRGDTPISFYG 21 T 0.13 Endonuc-BglII unp F Bacteria T 2mni 1 A A Q4D059_TRYCC HP_Q4D059 GSAMGHMPAVDVEIHFPLKRIAAEGYAEDELLLNQMGKVNDTPEEEGMPLRAWVIKCAHEALEKNPKIREVYLKPRAVKNSSVQFHVIFDEE 92 T 8.9 TetR_C_1 unphh F Eukaryota T 2mnu 2 B B APT SSSPIQGSWTWENGKWTWKGIIRLEQ 26 T 0.8 WXXGXW pdbhh F T 2mnw 1 A A SHQ1_HUMAN Protein SHQ1 homolog GSTAIGMKETAAAKFERQHMDSPDLGTGGGSGDDDDKMLTPAFDLSQDPDFLTIAIRVSYARVSEFDVYFEGSDFKFYAKPYFLRLTLPGRIVENGSEQGSYDADKGIFTIRLPKETPGQHFEGLNMLTALLA 133 T 0.002 PIH1_CS pdbpercent F Eukaryota T 2moa 1 A A CA1_CONIM ALPHA-CTX IMI GXASDPRCAWRCX 13 T 0.0098 Toxin_8 unphh F Eukaryota T 2moc 1 A A TKN4_HUMAN ENDOKININ-A/B TGKASQFFGLM 11 T 0.00015 Tachykinin unphh F Eukaryota T 2mow 2 B B PAP2_YEAST TRF4P, DNA POLYMERASE KAPPA, DNA POLYMERASE SIGMA, TOPOISOMERASE 1-RELATED PROTEIN TRF4 DDDEDGYNPYTL 12 T 3.8 IreB pdbhh F Eukaryota T 2mp2 3 C C RNF4_MOUSE RING FINGER PROTEIN 4 TVGDEIVDLTCESLEPVVVDLTHND 25 T 0.012 Bac_luciferase unppercent F Eukaryota T 2mp9 1 A A AFP_CENMR CM-P1 SRSELIVHQRLFX 13 T 5.7 Nbs1_C unphh F Eukaryota T 2mpl 1 A A FOG1_MOUSE FRIEND OF GATA PROTEIN 1, FOG-1, FRIEND OF GATA 1, ZINC FINGER PROTEIN MULTITYPE 1 PWSGPEELELALQDGQRCVRARLSLTEGLSWGPFYGSIQTRALSPEREEPGPAVTLMVDESCWLRMLPQVLTEEAANSEIYRKDDALWCRVTKVVPSGGLLYVRLVTEPHGAPRHPVQEPVEPGGLA 127 T 0.069 SET unphh F Eukaryota T 2mpm 2 B B CCR3 VETFGTTSYYDDVGLL 16 T 2.3 G6PD_C pdbhh F T 2mpo 1 A A Q967S9_TOXGO MIC2-associated protein TFLELVEVPCNSVHVQGVMTPNQMVKVTGAGWDNGVLEFYVTRPTKTGGDTSRSHLASIMCYSKDIDGVPSDKAGKCFLKNFSGEDSSEIDEKEVSLPIKSHNDAFMFVCSSNDGSALQCDVFALDNTNSSDGWKVNTVDLGVSVSPDLAFGLTADGVKVKKLYASSGLTAINDDPSLGCKA 182 T 3.9E-05 Etmic-2 unphh F Eukaryota T 2mps 2 B B P73_HUMAN P53-LIKE TRANSCRIPTION FACTOR, P53-RELATED PROTEIN DGGTTFEHLWSSLEPD 16 T 0.019 P53_TAD unphh F Eukaryota T 2mpv 1 A A O30595_ECOLX Major fimbrial subunit of aggregative adherence fimbria II AafA NFCDITITPATNRDVNVDRSANIDLSFTIRQPQRCADAGMRIKAWGEANHGQLLIKPQGGNKSAGFTLASPRFSYIPNNPANIMNGFVLTNPGVYQLGMQGSITPAIPLRPGLYEVVLNAELVTNDNKQNATAVAKTATSTITVV 145 T 0.0003 SEF14_adhesin unphh F Bacteria T 2mq2 1 A A CDP-1 peptide, Cysteine Deleted Protegrin-1 RGGRLYRRRFVVGR 14 T 6.9 Sid-5 pdbhh F T 2mq4 1 A A RR11 peptide from Cysteine Deleted Protegrin-1 RLYRRRFVVGR 11 T 2.9 Sid-5 pdbhh F T 2mq5 1 A A LR10 peptide from Cysteine Deleted Protegrin-1 LYRRRFVVGR 10 T 3.7 DUF2623 pdbhh F T 2mq8 1 A A De novo designed protein LFR1 MLTVEVEVKITADDENKAEEIVKRVIDEVEREVQKQYPNATITRTLTRDDGTVELRIKVKADTEEKAKSIIKLIEERIEEELRKRDPNATITRTVRTEVGSSWSLEHHHHHH 112 T 0.00044 CinA_KH pdb F T 2mqd 1 A A A5VHK8_LACRD Uncharacterized protein GHMKFTDQQIGVLAGLAISPEWLKQNIAANQLVYGIVKPSDTVPAGVDDYSYLVAADDQDGTIIFFKAEGQTVIIKYTSQRNTKLKAKALTLSQLKKEFYQTRSQKREVDDYVAGLRTE 119 T 0.012 Imm42 pdbpssm F Bacteria T 2mr5 1 A A De novo designed Protein OR457 MGTVVIVVSNDERILEELLEVVLKSDPNVKTVRTDDKEKVKEEIEKARKQGRPIVIFIRGAYEEVVRDIVEYAQKEGLRVLVIKVAQDQELLERFYEQLKKDGVDVRVTDNEDEAKKRLKELLEKVGSLEHHHHHH 136 T 0.00094 ANF_receptor pdbpercent F T 2mra 1 A A De novo designed protein OR459 MAGKELRVEIKIDCGNDDKETTYDLYFSKAEEAKELLKKVAEKAADKIKKQGCKRVKIRFEKKGLDDDARKKAKKWALEVANKIANELGAKQSTTTTDGDTFEVEVILELEHHHHHH 117 T 0.0058 DUF4230 pdbpercent F T 2mrk 2 B B FYN_HUMAN PROTO-ONCOGENE SYN, PROTO-ONCOGENE C-FYN, SRC-LIKE KINASE, SLK, P59-FYN EPQXQPGENL 10 T 5.5 Leader_Erm pdbhh F Eukaryota T 2mrl 1 A A Q2SV23_BURTA Uncharacterized protein BTH I2711 MDRIFMTRTEALEFLLKAHQTAVDKIGHPSHKQTPADHAAIEALDRLLLDVRARRVDQFQINASAAQIIVTD 72 T 7.6 Thioredoxin_11 pdbhh F Bacteria T 2ms4 2 B B CRK_HUMAN Peptide PEPGPYAQP 9 T 8.9 HMMR_N pdbhh F Eukaryota T 2msa 1 A A CSP_PLAFA Circumsporozoite protein peptide KNSFSLGENPNANPX 15 T 3.1 DUF1930 pdbhh F Eukaryota T 2msf 1 A A KEX11_TITSE TS11 KPKCGLCRYRCCSGGCSSGKCVNGACDCS 29 T 0.85 Toxin_2 pdb F Eukaryota T 2msq 1 A A Conotoxin cBru9a SCGGSCFGGCWPGCSCYARTCFRDGLP 27 T 0.048 Cyclotide pdbhh F T 2msr 1 A A KMT2A_HUMAN LYSINE N-METHYLTRANSFERASE 2A, ALL-1, CXXC-TYPE ZINC FINGER PROTEIN 7, MYELOID/LYMPHOID OR MIXED-LINEAGE LEUKEMIA, MYELOID/LYMPHOID OR MIXED-LINEAGE LEUKEMIA PROTEIN 1, TRITHORAX-LIKE PROTEIN, ZINC FINGER PROTEIN HRX, MLL CLEAVAGE PRODUCT N320, N-TERMINAL CLEAVAGE PRODUCT OF 320 KDA, P320, MLL CLEAVAGE PRODUCT C180, C-TERMINAL CLEAVAGE PRODUCT OF 180 KDA, P180 GGSGEDEQFLGFGSDEEVRVR 21 T 4.7 EF-1_beta_acid pdbhh F Eukaryota T 2mtg 1 A A LARP6_HUMAN ACHERON, ACHN, LA RIBONUCLEOPROTEIN DOMAIN FAMILY MEMBER 6 GNENLPSKMLLVYDLYLSPKLWALATPQKNGRVQEKVMEHLLKLFGTFGVISSVRILKPGRELPPDIRRISSRYSQVGTQECAIVEFEEVEAAIKAHEFMITESQGKENMKAVLIGMKP 119 T 6.7E-05 Nup35_RRM pdbhh F Eukaryota T 2mtl 1 A A De novo designed protein FR55 OR109 MGEMDIRFRGDDLEALEKALKEMIRQARKFAGTVTYTLDGNDLEIRITGVPEQVRKELAKEAERLAKEFNITVTYTIRGSLEHHHHHH 88 T 0.0035 DUF2067 pdbpercent F T 2mtm 1 A A B9T5G6_RICCO STABLE PEPTIDE BIOMARKER RCB-1 ARCCLVMPVPPFACVKFCS 19 T 0.017 GRP unp F Eukaryota T 2mto 1 A A CA1A_CONRE Alpha-conotoxin RgIA GXCSDPRXRYRCR 13 T 0.00026 Toxin_8 unp F Eukaryota T 2mtq 1 A A Designed Peptide MGSWAEFKQRLAAIKTRCQALGGSEAECAAFEKEIAAFESELQAYKGKGNPEVEALRKEAAAIRDECQAYRHN 73 T 0.021 DUF1202 pdb F T 2mts 1 A A POLG_HCVJ4 HEPATITIS C VIRUS P7 PROTEIN ALENLVVLNAASVAGAHGILSFLVFFSAAWYIKGRLAPGAAYAFYGVWPLLLLLLALPPRAYA 63 T 0.23 FixQ pdbpercent T Viruses T 2mtw 1 A A EBA1_PLAFC EBA-175 YTNQNINISQERDLQKHGFH 20 T 0.023 DBP pdbhh F Eukaryota T 2mty 1 A A Q9U3Y8_PLAFA STARP antigen VIKHNRFLSEYQSNFLGGGY 20 T 0.83 Yos9_DD pdbhh F Eukaryota T 2mu6 1 A A Q9U3Y8_PLAFA STARP antigen KSMINAYLDKLDLETVRKIH 20 T 1.2 zf-Nse unppssm F Eukaryota T 2mu7 1 A A MSP1_PLAFW 1513 MSP-1 peptide GYSLFQKEKMVLNEGTSGTA 20 T 4.4 NOP5NT pdbhh F Eukaryota T 2mu8 1 A A MSA2_PLAF7 MSP-2 peptide KNESKYSNTFINNAYNMSIR 20 T 4.3 GatD_N pdbhh F Eukaryota T 2mu9 1 A A ABRA_PLAF7 P101/acidic basic repeat antigen KMNMLKENVDYIQKNQNLFK 20 T 0.91 ComX pdbhh F Eukaryota T 2muf 1 A A M1EUE6_PLAFA TRSP SDVRYNKSFINNRLLNEHAH 20 T 0.65 preATP-grasp_3 unp F Eukaryota T 2mug 1 A A SERA_PLAFG Serine-repeat antigen protein XNEVSERVHVYHILKHIKDGKX 22 T 8.4 Gemini_AC4_5 pdbhh F Eukaryota T 2muh 1 A A PG2_PIG PG-2 RGGRLCYCRRRFCVCV 16 T 0.075 Defensin_1 pdbhh F Eukaryota T 2muj 1 A A T1RTG8_PLAFA 111 KDA ANTIGEN, P126 YDNILVKMFKTNENNDKSELI 21 T 16 DUF4643 pdbhh F Eukaryota T 2mun 1 A A TX6A_SCOMU MU-SLPTX-SSM6A ADNKCENSLRREIACGQCRDKVKTDGYFYECCTSDSTFKKCQDLLH 46 T 4.1 Ribosomal_L32p unphh F Eukaryota T 2muz 1 A,B,C,D A,B,C,D designed rocker protein YYKEIAHALFSALXALSELYIAVRYX 26 T 9.4 DUF6332 pdbhh F T 2mv3 1 A A YME1_YEAST PROTEIN OSD1, TAT-BINDING HOMOLOG 11, YEAST MITOCHONDRIAL ESCAPE PROTEIN 1, YME1-N MVAVSHAMLATREQEANKDLTSPDAQAAFYKLLLQSNYPQYVVSRFETPGIASSPECMELYMEALQRIGRHSEADAVRQNLEHHHHHH 88 T 0.027 Imm49 pdbpssm F Eukaryota T 2mv7 2 B B DOT1L_HUMAN DOT1-LIKE PROTEIN, HISTONE H3-K79 METHYLTRANSFERASE, H3-K79-HMTASE, LYSINE N-METHYLTRANSFERASE 4 TNKLPVSIPLASVVLPSRAERARST 25 T 3.4 CCDC73 unphh F Eukaryota T 2mva 1 A A TX41A_SCOMU RhTx toxin LNNPCNGVTCPSGYRCSIVDKQCIKKE 27 T 0.018 Secretogranin_V unppssm F Eukaryota T 2mvt 1 A A TX31A_SCOSD Scoloptoxin SSD609 ADDKCEDSLRREIACTKCRDRVRTDDYFYECCTSESTFKKCQTMLHQ 47 T 2.5 DUF2614 unphh F Eukaryota T 2mw3 1 A A A0A0C2JEQ8_9ACTN Lasso peptide SLGSSPYNDILGYPALIVIYP 21 T 0.0013 DUF5972 unp F Bacteria T 2mw7 1 A A A0A0R4I952_CONMO Mo3964 DGECGDKDEPCCGRPDGAKVCNDPWVCILTSSRCENP 37 T 0.19 Sin3_corepress pdb F Eukaryota T 2mwi 1 A A TDIF1_HUMAN TERMINAL DEOXYNUCLEOTIDYLTRANSFERASE-INTERACTING FACTOR 1, TDIF1, TDT-INTERACTING FACTOR 1 GAREGPKWDPARLNESTTFVLGSRANKALGMGGTRGRIYIKHPHLFKYAADPQDKHWLAEQHHMRATGGKMAYLLIEEDIRDLAASDDYRGCLDLKLEELKSFVLPSWMVEKMRKYMETLRT 122 T 5.7E-06 CRC_subunit pdbhh F Eukaryota T 2mwl 1 A A antimicrobial peptide VARGWKRKCPLFGKGG 16 T 0.0078 Flavi_glycoprot pdbhh F T 2mwn 1 A A AB1IP_HUMAN APBB1-INTERACTING PROTEIN 1, PROLINE-RICH EVH1 LIGAND 1, PREL-1, PROLINE-RICH PROTEIN 73, RAP1-GTP-INTERACTING ADAPTER MOLECULE, RIAM, RETINOIC ACID-RESPONSIVE PROLINE-RICH PROTEIN 1, RARP-1 DIDQMFSTLLGEMDLLTQSLGVDT 24 T 1.9 Drf_DAD pdbhh F Eukaryota T 2mwo 2 B B P53_HUMAN P53K370ME2, ANTIGEN NY-CO-13, PHOSPHOPROTEIN P53, TUMOR SUPPRESSOR P53 RAHSSHLKSKKGQST 15 T 8.2 RE_NgoPII pdbhh F Eukaryota T 2mwp 2 B B P53_HUMAN P53K382ME2, ANTIGEN NY-CO-13, PHOSPHOPROTEIN P53, TUMOR SUPPRESSOR P53 STSRHKKLMFKT 12 T 34 DUF420 pdbhh F Eukaryota T 2mwt 1 A A CAMP_CRODU Cathelicidin-like peptide KRFKKFFKKVKKSVKKRLKKIFKKPMVIGVTIPF 34 T 0.0032 Sigma70_ner unp F Eukaryota T 2mxx 1 A A A8AZZ3_STRGC Amylase-binding protein AbpA MADEATDAARNNDGAYYLQTQFTNADKVNEYLAQHDGEIRAEAAADPAVVAAKAALDAVEGGSHNYGEVKAAYEAAFNNAFNAVRNKYVQRFQATYNNATEQEGKTYIQGETPEQANARYLKRVGAANNQNPAAEDKGATTPASKEEAKKSEAAAKNAGKAAGKALPKTSAVKHHHHHH 179 T 0.089 DUF3752 pdbpssm F Bacteria T 2myh 1 A A A0A0G3F8Z3_9ARAC Omega-Tbo-IT1 toxin CASKNERCGNALYGTKGPGCCNGKCICRTVPRKGVNSCRCM 41 T 0.012 Conotoxin unphh F Eukaryota T 2myv 1 A A Q8J180_MAGGR Uncharacterized protein APQDNTSMGSSHHHHHHSSGRENLYFQGHMAWKDCIIQRYKDGDVNNIYTANRNEEITIEEYKVFVNEACHPYPVILPDRSVLSGDFTSAYADDDESC 98 T 5.4 Ceramidase_alk unphh F Eukaryota T 2myw 1 A A B9WZW9_MAGOR AVR-Pia protein APQDNTSMGSSHHHHHHSSGRENLYFQGHMAAPARFCVYYDGHLPATRVLLMYVRIGTTATITARGHEFEVEAKDQNCKVILTNGKQAPDWLAAEPY 97 T 0.012 Pirin_C unppssm F Eukaryota T 2mz0 1 A A DEF32_ARATH Defensin-like protein 32 KDIDGRKPLLIGTCIEFPTEKCNKTCIESNFAGGKCVHIGQSLDFVCVCFPKYYI 55 T 0.00026 Gamma-thionin unppssm F Eukaryota T 2mz6 1 A,B A,B PG3_PIG PG-3 RGGGLCYCRRRFCVCVGR 18 T 0.16 Defensin_1 pdbhh F Eukaryota T 2n01 1 A A Q8PJB3_XANAC VirB7 protein XTKPAPDFGGRWKHVNHFDEAPTEX 25 T 0.056 BNR_6 pdbhh F Bacteria T 2n08 1 A A Short hydrophobic peptide with cyclic constraints HAEGTFTSDFFX 12 T 0.00035 Hormone_2 pdbhh F T 2n09 1 A A Short hydrophobic peptide with cyclic constraints HXEGTFTSDFFX 12 T 0.00035 Hormone_2 pdbhh F T 2n0i 1 A A di-sulfide 11mer peptide HXEGXFTSDFXX 12 T 0.019 Hormone_2 pdbhh F T 2n0n 1 A A lactam (5,9) 11mer peptide HXEGKFTSEFXX 12 T 0.032 Hormone_2 pdbhh F T 2n0o 1 A A ALBO1_HYPAB HY-A1 IFGAILPLALGALKNLIKX 19 T 4 SH3_7 unphh F Eukaryota T 2n0v 1 A A CN-AMP1 SVAGRAQGMX 10 T 10 Dirigent pdbhh F T 2n0y 2 B B NSS_RVFVZ Non-structural protein NS-S GGGGYDVEMESEEESDDDGFVEVD 24 T 0.5 LRR19-TM pdbhh T Viruses T 2n0z 1 A A MYO6_HUMAN Unconventional myosin-VI GPLGSPNSGTKKYDLSKWKYAELRDTINTSCDIELLAACREEFHRRLKVYH 51 T 3 Caldesmon unphh F Eukaryota T 2n10 1 A A MYO6_HUMAN UNCONVENTIONAL MYOSIN-6 GPLGSPNSGTKKYDLSKWKYAELRDTINTSCDIELLAACREEFHRRLKVYHAWKSKNKKR 60 T 3 Caldesmon unphh F Eukaryota T 2n11 1 A A MYO6_HUMAN UNCONVENTIONAL MYOSIN-6 QQQAVLEQERRDRELALRIAQSEAELISDEAQADLALRRSLDSYPVSKNDGTRPKMTPEQMAKEMSEFLSRGPA 74 T 0.00031 BUD22 unp F Eukaryota T 2n12 1 A A MYO6_HUMAN UNCONVENTIONAL MYOSIN-6 RPKMTPEQMAKEMSEFLSRGPAVLATKAAAGTKKYDLSKWKYAELRDTINTSCDIELLAACREEFHRRLKVYHAWKSKNKKR 82 T 0.027 BUD22 unppercent F Eukaryota T 2n13 1 A,D A,D MYO6_HUMAN UNCONVENTIONAL MYOSIN-6 GTKKYDLSKWKYAELRDTINTSCDIELLAACREEFHRRLKVYH 43 T 3 Caldesmon unphh F Eukaryota T 2n16 1 A A DHX36_HUMAN DEAH BOX PROTEIN 36, G4-RESOLVASE 1, G4R1, MLE-LIKE PROTEIN 1, RNA HELICASE ASSOCIATED WITH AU-RICH ELEMENT ARE SMHPGHLKGREIGMWYAKKQ 20 T 2.6 PsaL pdbhh F Eukaryota T 2n1p 1 A A POLG_HCVH Non-structural protein 5B, NS5B HSVSHARPRWFWFSLLLLAAGVGIYLLPNR 30 T 0.081 BSMAP pdbhh T Viruses T 2n24 1 A A O2VC1_CONVC O2_contryphan_Vc1 QWCQPGYAYNPVLGICTITLSRIEHPGNYDY 31 T 1.4 ANATO pdbhh F Eukaryota T 2n2a 1 A,B A,B ERBB2_HUMAN METASTATIC LYMPH NODE GENE 19 PROTEIN, MLN 19, PROTO-ONCOGENE NEU, PROTO-ONCOGENE C-ERBB-2, TYROSINE KINASE-TYPE CELL SURFACE RECEPTOR HER2, P185ERBB2 AEQRASPLTSIISAVVGILLVVVLGVVFGILIKRRQQKIRKYTMRRLLQETELVEPLG 58 T 0.0017 Mucin15 pdbhh F Eukaryota T 2n2c 1 A A TADBP_HUMAN TDP-43 MGGGMNFGAFSINPAMMAAAQAALQSSWGMMGMLASQQNQSGP 43 T 0.0072 Glucosaminidase pdb F Eukaryota T 2n2f 1 A A PDYN_HUMAN PROENKEPHALIN-B, BETA-NEOENDORPHIN-DYNORPHIN, PREPRODYNORPHIN YGGFLRRIRPKLK 13 T 0.025 Op_neuropeptide pdbhh F Eukaryota T 2n2g 1 A A A0A1A9T938_9METZ Asteropsin_F CPGEGEECDVEFNPCCPPLTCIPGDPYGICYII 33 T 0.00039 Tachystatin_A pdbhh F Eukaryota T 2n2h 1 A A SDS3_MOUSE SUPPRESSOR OF DEFECTIVE SILENCING 3 PROTEIN HOMOLOG SNAAQLNYLLTDEQIMEDLRTLNKLKS 27 T 2.9 DUF1639 pdbhh F Eukaryota T 2n2j 1 A,B A,B EBNA2_EBVB9 EBNA-2, EBV NUCLEAR ANTIGEN 2 GAMEMPTFYLALHGGQTYHLIVDTDSLGNPSLSVIPSNPYQEQLSDTPLIPLTIFVGENTGV 62 T 2.4 Swi6_N pdbhh T Viruses T 2n2s 1 A A A0A182DV16_9SPIT pheromone Ep-1 SCGSECAPEPDCWGCCLVQCAPSICAGWCGGS 32 T 1.4 DUF3079 pdbhh F Eukaryota T 2n2t 1 A A OR303 MGQWQIKIYSENEREFRELIERLEEERPSVQYTETTRNGRRQLTIRSNDKNEVDRILEEVRRKVPNARVRETETGSLEHHHHHH 84 T 0.025 F_actin_bind pdb F T 2n2u 1 A A OR358 MVDLKIDVSDDEEAEKIIREIREQWPKATVTRTNGDIKLDAQTEKEAEKMEKAVKKVKPNATIRKTGGSLEHHHHHH 77 T 0.0075 MmoB_DmpM pdb F T 2n31 1 A A TOLIP_HUMAN Toll interacting protein variant GPLGSMATTVSTQRGPVYIGELPQDFLRITPTQQQRQVQLDAQAAQQLQYGGAVGTVG 58 T 0.11 RNA_pol_Rpb1_1 unppssm F Eukaryota T 2n37 1 A A B9WZW9_MAGOR AVR-Pia protein APARFCVYYDGHLPATRVLLMYVRIGTTATITARGHEFEVEAKDQNCKVILTNGKQAPDWLAAEPY 66 T 0.012 Pirin_C unppssm F Eukaryota T 2n3a 1 A A POGZ_HUMAN SUPPRESSOR OF HAIRY WING HOMOLOG 5, ZINC FINGER PROTEIN 280E, ZINC FINGER PROTEIN 635 EGESETESFYGFEEAD 16 T 1.1 Sororin pdbhh F Eukaryota T 2n3p 1 A A A0A1A9T940_9METZ Asteropsin_G QWCAEEGESCEVYPCCDGLICYPTFPEPICGV 32 T 0.0075 Tachystatin_A pdbhh F Eukaryota T 2n3x 1 A A TADBP_HUMAN TDP-43 MNFGAFSINPAMMAAAQAALQSSWGMMGMLASQQNQSGPSGNNQNQGNMQ 50 T 0.0075 Glucosaminidase pdbpercent F Eukaryota T 2n3z 1 A A OR446 MGRLVVVVTSEQLKEEVRKKFPQVEVRLVTTEEDAKQVIKEIQKKGVQKVVLVGVSEKLLQKIKQEANVQVYRVTSNDELEQVVKDVKGSGLEHHHHHH 99 T 0.00041 PrpR_N pdbpssm F T 2n4g 1 A A TADBP_HUMAN TDP-43 MNFGAFSINPAMMAAAQAALQSSWDMMGMLASQQNQSGPSGNNQNQGNMQ 50 T 0.0075 Glucosaminidase pdbpercent F Eukaryota T 2n4h 1 A A TADBP_HUMAN TDP-43 MNFGAFSINPAMMAAAQAALQSSWGMMGMLASRQNQSGPSGNNQNQGNMQ 50 T 0.0075 Glucosaminidase pdbpercent F Eukaryota T 2n4n 1 A A DESIGNED BETA SHEET XERFYEKXXVQKFIRVXGVTIREKX 25 T 15 DUF3692 pdbhh F T 2n4q 1 A A CBX8_HUMAN POLYCOMB 3 HOMOLOG, PC3, HPC3, RECTACHROME 1 TQGGRPSLIARIPVARILGDPEEE 24 T 86 VARLMGL pdbhh F Eukaryota T 2n5c 1 A A A0A0F7VRL1_9ACTN chaxapeptin GFGSKPLDSFGLNFF 15 T 16 CHB_HEX_C pdbhh F Bacteria T 2n5d 1 A A A4PHN0_STRVG fusion protein of two PKS domains GPGSYTGAGEPSQADLDALLSAVRDNRLSIEQAVTLLTPRRGGGSGGGSMDAKEILTRFKDGGLDRAAAQALLAGRTPAAAPRP 84 T 0.56 VbhA pdbhh F Bacteria T 2n5q 1 A A A0A0S2KUN2_9LAMI cysteine-rich peptide jS1 QLCLQCRSNSDCNIIWRICRDGCCNVI 27 T 4.4 ACI44 unphh F Eukaryota T 2n5s 1 A A EGFR_HUMAN PROTO-ONCOGENE C-ERBB-1, RECEPTOR TYROSINE-PROTEIN KINASE ERBB-1 GSCKIPSIATGMVGALLLLLVVALGIGLFMRRRHIVRKRTLRRLLQERELVEGG 54 T 0.0005 GAPT pdb F Eukaryota T 2n5w 1 A A Octyl-tridecaptin A1 XXGSWSXXFEVXA 13 T 12 DUF5626 pdbhh F T 2n5y 1 A A Octyl-tridecaptin A1 XXGXXSXXFEVXA 13 T 12 DUF5626 pdbhh F T 2n67 1 A B Q81AN8_BACCR Hemolysin II DNQKALEEQMNSINSVNDKLNKGKGKLSLSMNGNQLKATSSNAGYGISYEDKNWGIFVNGEKVYTFNEKSTVGNISNDINKLNIKGMYIEIKQI 94 T 0.0097 Gal-bind_lectin pdbpercent F Bacteria T 2n69 1 A A DEF_PENBA BRAZZEIN QDKCKKVYENYPVSKCQLRIANQCNYDCKLDKHARSGECFYDEKRNLQCICDYCEY 56 T 0.00073 Toxin_3 pdb F Eukaryota T 2n6h 1 A A designed 2-stranded parallel beta-sheet XERFYEKXXVQKFIRX 16 T 3.8 DUF3692 pdbhh F T 2n6i 1 A A designed 2-stranded parallel beta-sheet XQKFIRVXGVTIREKX 16 T 7.6 TIP41 pdbhh F T 2n6n 1 A A TXAG4_AGEOR U4-AGTX-AO1A, MU-2AAGA_15 GYCAEKGIKCHNIHCCSGLTCKCKGSSCVCRK 32 T 0.04 Toxin_7 pdbpercent F Eukaryota T 2n6u 1 A A E8RUP9_ASTEC Astexin2-dC4 GLTQIQALDSVSGQFRDQLG 20 T 7.9 BCMA-Tall_bind unphh F Bacteria T 2n72 1 A A GCP60_HUMAN ACYL-COA-BINDING DOMAIN-CONTAINING PROTEIN 3, GOLGI COMPLEX-ASSOCIATED PROTEIN 1, GOCAP1, GOLGI PHOSPHOPROTEIN 1, GOLPH1, PBR- AND PKA-ASSOCIATED PROTEIN 7, PERIPHERAL BENZODIAZEPINE RECEPTOR-ASSOCIATED PROTEIN PAP7 MQQKQQIMAALNSQTAVQFQQYAAQQYPGNYEQQQILIRQLQEQHYQQYMQQLYQVQLAQQQAALQKQQ 69 T 0.011 Sulfatase pdbpercent F Eukaryota T 2n73 2 B B PI4KB_HUMAN Phosphatidylinositol 4-kinase beta GAMVEARSLAVAMGDTVVEPAPLKPTSEPTSGPPGNNGGSLLSVITEGVGELSVIDPEVAQKACQEVLEKVKLLHGGVAV 80 T 47 Pik1 pdbhh F Eukaryota T 2n77 2 B B PCP4_HUMAN BRAIN-SPECIFIC ANTIGEN PCP-4, BRAIN-SPECIFIC POLYPEPTIDE PEP-19 MAERQGAGATNGKDKTSGENDGQKKVQEEFDIDMDAPETERAAVAIQSQFRKFQKKKAGSQS 62 T 0.0063 IQ pdbhh F Eukaryota T 2n7f 1 A A muO-conotoxin MfVIA RDCQEKWEYCIVPILGFVYCCPGLICGPFVCV 32 T 0.00046 Conotoxin pdbhh F T 2n7i 1 A A PRLR_HUMAN PRL-R GSFTMNDTTVWISVAVLSAVICLIIVWAVALKGYSMV 37 T 0.00011 IFNGR1 unphh F Eukaryota T 2n7n 1 A A Peptide PG-989 XXDPPXRWKX 10 T 1.1 DUF2678 pdbhh F T 2n7t 1 A A Peptide PG-992 XXDWPXRWKX 10 T 1.5 Xpo1 pdbhh F T 2n85 1 A A SPN1A_OXYTA OTTX1A KFKWGKLFSTAKKLYKKGKKLSKNKNFKKALKFGKQLAKNL 41 T 0.0033 Latarcin unphh F Eukaryota T 2n86 1 A A SPN1A_OXYTA OTTX1A GTPVGNNKCWAIGTTCSDDCDCCPEHHCHCPAGKWLPGLFRCTCQVTESDKVNKCPPAE 59 T 1.6 DUF5814 pdbpssm F Eukaryota T 2n8d 1 A A antimicrobial peptide Lavracin WDPYFAGVKKLTKAILAVRAX 21 T 8.8 YceD pdbhh F T 2n8j 2 B B NOS3_HUMAN CONSTITUTIVE NOS, CNOS, EC-NOS, ENDOTHELIAL NOS, ENOS, NOS TYPE III, NOSIII TRKKTFKEVANAVKISASLMGT 22 T 4.1 DUF2774 pdbhh F Eukaryota T 2n9a 1 A A DCRLN_OREDC Decoralin SLLSLIRKLITX 12 T 3.2 BDV_P10 unphh F Eukaryota T 2n9e 1 A A UIMC1_HUMAN RECEPTOR-ASSOCIATED PROTEIN 80, RETINOID X RECEPTOR-INTERACTING PROTEIN 110, UBIQUITIN INTERACTION MOTIF-CONTAINING PROTEIN 1 XEDAFIVISDSDGEX 15 T 0.064 MLIP pdbhh F Eukaryota T 2n9x 2 B B FUND1_HUMAN FUN14 domain-containing protein 1 DYESDDDSYEVLDLTEY 17 T 1.2 DUF6327 unphh F Eukaryota T 2n9z 1 A A DKTX_HAPSC TAU-TRTX-HS1A, DOUBLE-KNOT TOXIN, DKTX DCAKEGEVCSWGKKCCDLDNFYCPMEFIPHCKKYKPYVPVTT 42 T 0.15 Ragweed_pollen pdbhh F Eukaryota T 2na6 1 A,B,C A,B,C TNR6_MOUSE APO-1 ANTIGEN, APOPTOSIS-MEDIATING SURFACE ANTIGEN FAS, FASLG RECEPTOR RNRLWLLTILVLLIPLVFIYRKYRKRKS 28 T 0.093 DAG1 pdbhh F Eukaryota T 2na7 1 A,B,C A,B,C TNR6_HUMAN APO-1 ANTIGEN, APOPTOSIS-MEDIATING SURFACE ANTIGEN FAS, FASLG RECEPTOR RSNLGWLSLLLLPIPLIVWVKRKEVQKT 28 T 0.027 KdpC unppercent F Eukaryota T 2na8 1 A A IL3RB_HUMAN CDW131, GM-CSF/IL-3/IL-5 RECEPTOR COMMON BETA SUBUNIT GKRSWDTESVLPMWVLALIVIFLTIAVLLALRFCGIYGYRLRRK 44 T 0.0006 Interfer-bind unppssm F Eukaryota T 2na9 1 A A IL3RB_HUMAN CDW131, GM-CSF/IL-3/IL-5 RECEPTOR COMMON BETA SUBUNIT GKRSWDTESVLAMWVLALIVIFLTIAVLLALRFCGIYGYRLRRK 44 T 0.0006 Interfer-bind unppssm F Eukaryota T 2nae 1 A A CD28_MOUSE T-cell-specific surface glycoprotein CD28 GTNSRRNRLLQSDYMNMTPRRPGLTRKPYQPYAPARDFAAYRP 43 T 0.0075 LAX unppercent F Eukaryota T 2naj 1 A A DKTX_HAPSC TAU-TRTX-HS1A, DOUBLE-KNOT TOXIN, DKTX NCAKEGEVCGWGSKCCHGLDCPLAFIPYCEKYR 33 T 0.0099 Conotoxin_I2 pdb F Eukaryota T 2nal 1 A A Retro-KR-12 RLFDKIRQVIRK 12 T 6.3 TnpW pdbhh F T 2nau 1 A A entity KYEITTIHNLARKLTHRLARRNAGATLR 28 T 2.7 CbtA_toxin pdbhh F T 2nb2 1 A A A0A1S4NYD1_NIGSA nigellin-1.1 DRYQDCLSECNSRCTYIPDYAGMRACIGLCAPACLTSR 38 T 0.006 TIL pdb F Eukaryota T 2nb5 1 A A A0A023GYK7_PERMO Preproalbumin PawS1 GDCYWTSTPPFFTCTPD 17 T 1.8 ESCRT-II pdbhh F Eukaryota T 2nb6 1 A A A0A023GYI2_GALQU Preproalbumin PawS1 GCYPVPYPPFFTCDPN 16 T 0.1 ESCRT-II pdbhh F Eukaryota T 2nbc 1 A A PON1A_ANOEM poneritoxin WCASGCRKKRHGGCSCX 17 T 0.025 Fib_alpha unphh F Eukaryota T 2nbi 1 A A O22015_CYLFU HEP200 protein QPSDLNPSSQPSECADVLEECPIDECFLPYSDASRPPSCLSFGRPDCDVLPTPQNINCPRCCATECRPDNPMFTPSPDGSPPICSPTMLPTNQPTPPEPSSAPSDCGEVIEECPLDTCFLPTSDPARPPDCTAVGRPDCDVLPFPNNLGCPACCPFECSPDNPMFTPSPDGSPPNCSPTMLPTPQPSTPTVITSPAPSSQPSQCAEVIEQCPIDECFLPYGDSSRPLDCTDPAVNRPDCDVLPTPQNINCPACCAFECRPDNPMFTPSPDGSPPICSPTMMPSPEPSSQPSDCGEVIEECPIDACFLPKSDSARPPDCTAVGRPDCNVLPFPNNIGCPSCCPFECSPDNPMFTPSPDGSPPNCSPTMLPSPSPSAVTVPLTPAPSSAPTRQPSSQPTGPQPSSQPSECADVLELCPYDTCFLPFDDSSRPPDCTDPSVNRPDCDKLSTAIDFTCPTCCPTQCRPDNPMFSPSPDGSPPVCSPTMMPSPLPSPTE 494 T 8.8 Mito_fiss_reg unphh F Eukaryota T 2nbl 1 A A Designed beta-arch XTEIRVXGVTIRMRXSHXFWVQVXXKEFKHX 31 T 2.3 RisS_PPD pdbhh F T 2nc7 1 A A PG5_PIG PG-5 RGGRLCYCRPRFCVCVGR 18 T 0.0091 Tmpp129 pdbhh F Eukaryota T 2ncz 2 B B NSD3_HUMAN NUCLEAR SET DOMAIN-CONTAINING PROTEIN 3, PROTEIN WHISTLE, WHSC1-LIKE 1 ISOFORM 9 WITH METHYLTRANSFERASE ACTIVITY TO LYSINE, WOLF-HIRSCHHORN SYNDROME CANDIDATE 1-LIKE PROTEIN 1, WHSC1-LIKE PROTEIN 1 EIKLKITKTIQN 12 T 18 GIT1_C pdbhh F Eukaryota T 2nd0 2 B B LANA1_HHV8P LANA NLQSSIVKFKKPLPLTQPG 19 T 0.00062 EBV-NA1 unphh T Viruses T 2nd1 2 B B NSD3_HUMAN NUCLEAR SET DOMAIN-CONTAINING PROTEIN 3, PROTEIN WHISTLE, WHSC1-LIKE 1 ISOFORM 9 WITH METHYLTRANSFERASE ACTIVITY TO LYSINE, WOLF-HIRSCHHORN SYNDROME CANDIDATE 1-LIKE PROTEIN 1, WHSC1-LIKE PROTEIN 1 VVPKKKIKKEQVE 13 T 7.7 S1FA pdbhh F Eukaryota T 2nd2 1 A A De novo mini protein HHH_06 APCEDLKERLKKLGMSEECRQRLEKMCKEGTSEDAERMARNCES 44 T 0.66 VMAP-M14 pdbhh F T 2nd3 1 A A De novo mini protein EEH_04 QCYTFRSECTNKEFTVCRPNPEEVEKEARRTKEEECRK 38 T 0.047 YTV pdb F T 2nd4 1 A A I1ZJ30_STRPA Amylase-binding protein AbpA GENPSASNQLIQKKYVSWRDAADEANTQVAAHEAEIKEETLRQPGVVAAQQALDKANAIVGHDHEQAVKRAQEDYNTAYNEAYNTVRNRYIQVLQQKYIEAAKAQGNYYDETAVEANRTNEQRIADDIKAQTGKDVTVTKDENGNYVVKDEKGNVVATVDKDGKTVKADAKAG 173 T 0.0016 DUF4988 pdbpssm F Bacteria T 2ndc 1 A A CTHL5_BOVIN ANTIBACTERIAL PEPTIDE BMAP-28, MYELOID ANTIBACTERIAL PEPTIDE 28 GGLRSLGRKILRAWKKYG 18 T 0.85 Fungal_KA1 pdbhh F Eukaryota T 2ndd 1 A A KKX51_HETLA HELATX1 SCKKECSGSRRTKKCMQKCNREHGHX 26 T 0.023 ETRAMP unp F Eukaryota T 2nde 1 A A CTHL5_BOVIN ANTIBACTERIAL PEPTIDE BMAP-28, MYELOID ANTIBACTERIAL PEPTIDE 28 IGLRGLGRKIALIHKKYG 18 T 2.3 IMS_HHH pdbhh F Eukaryota T 2ndi 1 A A Q4PN35_IXOSC Putative secreted salivary protein GLCSENGDCAADECCVDTVFEGDMVTRSCEKTTGNFTECPGLTPIA 46 T 0.037 Conotoxin_I2 unppercent F Eukaryota T 2ndl 1 A A PawS derived peptide GPCFPMGPWGPFCIPD 16 T 0.35 Psg1 pdbhh F T 2ndm 1 A A A0A1C7D043_9ASTR PawS derived peptide 21 GRPCYTLQSCFPD 13 T 1.3 Comm pdbhh F Eukaryota T 2ndn 1 A A A0A0A0V2B6_9ASTR PawS1a Derived Peptide 20 GICFKDPFGSTLCAPD 16 T 0.99 C_GCAxxG_C_C pdbhh F Eukaryota T 2nm1 2 B B SYT2_RAT SYTII EDMFAKLKDKFFNEINK 17 T 0.027 DUF4713 unphh F Eukaryota T 2nmb 2 B B PROTEIN (GPPY PEPTIDE) AYIGPXL 7 T 0.29 Crl pdbhh F T 2nou 1 A A TKN1_SCYCA Scyliorhinin I AKFDKFYGLM 10 T 0.00029 Tachykinin pdbhh F Eukaryota T 2np0 2 B B SYT2_MOUSE SYNAPTOTAGMIN II, SYTII GESQEDMFAKLKEKFFNEINK 21 T 0.34 Alpha_E2_glycop unphh F Eukaryota T 2nr5 1 A,B,C,D,E,F,G,H A,B,C,D,F,G,H,E Q8EDS4_SHEON Hypothetical protein SO2669 SNAMMTKKERIAIQRSMAEEALGKLKAIRQLCGAEDSSDSSDMQEVEIWTNRIKELEDWLWGESPIA 67 T 0.031 DUF2385 unp F Bacteria T 2ns4 1 A A L-22 CYCLIC PEPTIDE RVRTRKGRRIRIPP 14 T 0.24 DUF2835 pdbhh F T 2ns8 2 E,F,G,H,I H,E,F,G,Z 16 residue peptide Tip (Transcription inducing peptide) XWTWNAYAFAAPSGGGS 17 T 4.1 DUF3710 pdbhh F T 2nsw 1 A A MEN2_EUPNO Mating pheromone En-2 DIEDFYTSETCPYKNDSQLAWDTCSGGTGNCGTVCCGQCFSFPVSQSCAGMADSNDCPNA 60 T 30 Inhibitor_I67 pdbhh F Eukaryota T 2nwn 2 B B upain-1 CSWRGLENHRMC 12 T 0.95 DUF2632 pdbhh F T 2nx6 1 A A Q8IT70_HYDVU Nematocyst outer wall antigen GSSSCPQFPSCSPSCAPQCSQQCCQQP 27 T 0.0015 C_tripleX pdbhh F Eukaryota T 2nx7 1 A A Q8IT70_HYDVU Nematocyst outer wall antigen AQNPCSLQQPGCSSACAPACRLSCCSLG 28 T 0.13 C_tripleX pdbhh F Eukaryota T 2nxd 2 C P Analogue of RT-RH pol protease substrate peptide GADIFYLDGA 10 T 3.5 XPG_N pdbhh F T 2nxl 2 C P Analogue of RT-RH pol protease substrate peptide GAEVFYVDGA 10 T 1.8 MHC_II_alpha pdbhh F T 2nxm 2 C P Analogue of RT-RH pol protease substrate peptide GAQTFYVDGA 10 T 2.4 BOFC_N pdbhh F T 2o02 2 C,D P,Q ExoS (416-430) peptide GHGQGLLDALDLAS 14 T 0.58 CTP_transf_1 pdbhh F T 2o0s 1 A A YW12 YVLWKRKRMIFI 12 T 6.7 Transport_MerF pdbhh F T 2o5g 2 B B MYLK_CHICK Smooth muscle Myosin light chain kinase peptide XARRKWQKTGHAVRAIGRLSX 21 T 13 PACT_coil_coil pdbhh F Eukaryota T 2o60 2 B B NOS1_MOUSE Peptide corresponding to calmodulin binding domain of neuronal nitric oxide synthase KRRAIGFKKLAEAVKFSAKLMGQX 24 T 0.094 EDR1 pdbpssm F Eukaryota T 2o6n 1 A A RH4B designed peptide XAEIEQAKKEIAYLIKKAKEEILEEIKKAKQEIAX 35 T 0.037 Endotoxin_C2 pdb F T 2o8z 1 A A cCRF(30-41) Peptide XEAHKNRKLMEIIX 14 T 0.01 CRF pdbhh F T 2o98 2 C,D P,Q PMA3_NICPL H-ATPASE PMA2 TNFNELNQLAEEAKRRAEIARQRELHTLKGHVESVVKLKGLDIETIQQSYDI 52 T 0.023 DUF4398 pdb F Eukaryota T 2obh 2 C,D C,D XPC_HUMAN XERODERMA PIGMENTOSUM GROUP C-COMPLEMENTING PROTEIN, P125 XNWKLLAKGLLIRERLKR 18 T 16 S1FA pdbhh F Eukaryota T 2od2 2 B B Acetylated H4 peptide KGGAXRHRKILTAQ 14 T 32 DUF4196 pdbhh F T 2od4 1 A,B A,B hypothetical protein GMFAGSIPMYIRVVSITAQSKLQFDMTVTYFENVWSPKVISLGAISAEFVQSNENSGMYIIHYPDKQTAISVFDKIKPEVDEVRTQNRIQITEGKRLFRVD 101 T 2.2 ABM pdbhh F T 2od6 1 A,B,C,D A,B,C,D hypothetical protein GMAEPKFTSFTTADFINDVDMELFIDAVEKTAPVWVKEMKSRGLLKFSMNRVWNKGEVFRVVMTYEYKDRASFEANIAYLEDTFGKNPVFLQLVTTAKFTTSRCLVVMEV 110 T 0.0042 DUF3906 pdbpercent F T 2od8 2 B B DNLI1_YEAST CDC9, POLYDEOXYRIBONUCLEOTIDE SYNTHASE AGKKPKQATLARFFTSMKNKPT 22 T 1.6 RXLR_WY pdbhh F Eukaryota T 2odd 1 A B NCOR2_HUMAN SMRT GSGSTISNPPPLISSAK 17 T 28 Connexin43 pdbhh F Eukaryota T 2ofq 2 B B Q79SE5_SALTM TraN PPPEPDWSNTVPVNKTIPVDTQ 22 T 0.12 Cag12 unphh F Bacteria T 2oi3 2 B B artificial peptide PD1 XHSKYPLPPLPSLX 14 T 9.6 DUF5855 pdbhh F T 2ojx 2 B E MPIP3_HUMAN Synthetic peptide LLCSTPNGL 9 T 2.1 DUF3038 pdbhh F Eukaryota T 2okr 2 B,D C,F MAPK2_HUMAN MAPK-ACTIVATED PROTEIN KINASE 2, MAPKAP KINASE 2, MAPKAPK-2, MK2 IKIKKIEDASNPLLLKRRKKARAL 24 T 0.52 DUF6278 pdbhh F Eukaryota T 2oob 1 A A CBLB_HUMAN SIGNAL TRANSDUCTION PROTEIN CBL-B, SH3-BINDING PROTEIN CBL-B, CASITAS B-LINEAGE LYMPHOMA PROTO-ONCOGENE B, RING FINGER PROTEIN 56 GSGPEAALENVDAKIAKLMGEGYAFEEVKRALEIAQNNVEVARSILREFAFP 52 T 0.00014 UBA pdbpssm F Eukaryota T 2op5 1 A,B,C,D,E,F A,B,C,D,E,F hypothetical protein GMKDTDETAFLNSLFMDFTSENELELFLKSLDEVWSEDLYSRLSAAGLIRHVISKVWNKEQHRISMVFEYDSKEGYQKCQEIIDKEFGITLKEKLKKFVFKIHNNRGVVVSEFIRST 117 T 0.052 Ion_trans_N pdbpercent F T 2oq9 1 A A A6N8P1_HYDVU Minicollagen-5 APMQAPVQAAPACMASCAPQCCGR 24 T 0.22 C_tripleX pdbhh F Eukaryota T 2oqj 3 C,F,I,L C,F,I,L peptide 2G12.1 (ACPPSHVLDMRSGTCLAAEGK) ACPPSHVLDMRSGTCLAAEGK 21 T 1.3 Glyco_hydro_65N pdbhh F T 2oru 1 A A xtz1-peptide KAWTWTWNPATGKWTWRKNE 20 T 0.31 LPD24 pdbhh F T 2os2 2 C,D C,D histone 3 peptide STGGVKKPHRY 11 T 7.1 UPF0715 pdbhh F T 2os6 2 B B PLXB1_HUMAN SEMAPHORIN RECEPTOR SEP VENKVTDL 8 T 0.41 TMCCDC2 pdbhh F Eukaryota T 2ot0 2 E,F,G,H E,F,G,H WASP_HUMAN WASP EDQAGDEDEDDEWDD 15 T 0.15 SMN pdbhh F Eukaryota T 2ovh 2 B B SMRT peptide TNMGLEAIIRKALMGKY 17 T 2.8 RuvA_C pdbhh F T 2ovm 2 B B NCoR GHSFADPASNLGLEDIIRKALMGSF 25 T 4.3 RuvA_C pdbhh F T 2ovq 3 C C cyclinE C-terminal degron LPSGLLTPPQSG 12 T 17 Cuticle_1 pdbhh F T 2ovr 3 C C cyclinE N-terminal degron SLIPTPDK 8 T 5 PH_18 pdbhh F T 2p05 1 A A a non-biological ATP binding protein 1819 GSMDYKDDDDKKTNWLKRIYRVRPCVKCKVAPRNWKVKNKHLRIYNMCKTCFNNSIDIGDDTYHGHDDWLMYADSKEISNT 81 T 0.011 ZZ pdbpercent F T 2p09 1 A A a non-biological ATP binding protein with two mutations N32D and D65V GSMDYKDDDDKKTNWLKRIYRVRPCVKCKVAPRDWKVKNKHLRIYNMCKTCFNNSIDIGDDTYHGHVDWLMYADSKEISNT 81 T 0.025 Glyco_tranf_2_4 pdbpssm F T 2p0w 2 C,D P,Q Histone peptide H4 KGGKGLGKGGAKRHR 15 T 130 DUF1884 pdbhh F T 2p0x 1 A A abiotic ATP-binding, folding optimized protein GSFRVKPCVVCKVAPRDWRVKNRHLRIYNMCKTCFNNSIKSGDDTYHGHVDWLMYTDAKEFSST 64 T 0.0067 ZZ pdbpercent F T 2p4r 2 B T ITCH_HUMAN ITCH, ATROPHIN-1-INTERACTING PROTEIN 4, AIP4, NFE2-ASSOCIATED POLYPEPTIDE 1, NAPP1 GGFKPSRPPRPSRPPPPTPRRPASV 25 T 3.5 UPF0449 pdbhh F Eukaryota T 2p5b 2 C,D I,J H3_URECA Histone H3 RKSAPATGGVKKPHRYRPGTVL 22 T 1.8 YlzJ pdbhh F Eukaryota T 2p5h 1 A A pip9 VDIHVWDGV 9 T 0.45 DUF4883 pdbhh F T 2p5j 1 A A pip17 LGRVDIHVWDGVYIRGR 17 T 0.24 DUF4883 pdbhh F T 2p6b 1 A E PVIVIT 14-mer Peptide GPHPVIVITGPHEEX 15 T 0.95 DUF4609 pdbhh F T 2p6j 1 A A designed engrailed homeodomain variant UVF MKQWSENVEEKLKEFVKRHQRITQEELHQYAQRLGLNEEAIRQFFEEFEQRK 52 T 0.0061 DUF72 pdb F T 2p8l 3 C C gp41 peptide ELLELDKWASLNW 13 T 4 Sex_peptide pdbhh F T 2p8p 3 C C gp41 peptide LELDKWASLWX 11 T 0.72 Chrome_Resist pdbhh F T 2p9w 1 A A Q6WIF3_MALSM Mal s 1 allergenic protein ALPDQIDVKVKNLTPEDTIYDRTRQVFYQSNLYKGRIEVYNPKTQSHFNVVIDGASSNGDGEQQMSGLSLLTHDNSKRLFAVMKNAKSFNFADQSSHGASSFHSFNLPLSENSKPVWSVNFEKVQDEFEKKAGKRPFGVVQSAQDRDGNSYVAFALGMPAIARVSADGKTVSTFAWESGNGGQRPGYSGITFDPHSNKLIAFGGPRALTAFDVSKPYAWPEPVKINGDFGTLSGTEKIVTVPVGNESVLVGARAPYAISFRSWDNWKSANIKKTKRSELQNSGFTAVADYYQGSEQGLYAVSAFFDNGAHGGRSDYPLYKLDNSIQNFHHHHHH 334 T 4.3E-07 MRJP pdbhh F Eukaryota T 2pbd 3 C V VASP_HUMAN VASP GPPPAPPLPAAQGPGGGGAGAPGLAAAIAGAKLRKVSKQEEAS 43 T 0.00045 DUF4106 unphh F Eukaryota T 2pg4 1 A,B A,B Q9YDN4_AERPE Uncharacterized protein GMDDETLRLQFGHLIRILPTLLEFEKKGYEPSLAEIVKASGVSEKTFFMGLKDRLIRAGLVKEETLSYRVKTLKLTEKGRRLAECLEKCRDVLGS 95 T 0.0017 HTH_27 unppercent F Archaea T 2pgc 1 A,B,C,D,E A,B,C,D,E uncharacterized protein GMSNINYVILTVASVDFSYRETMARLMSSYSKDLIDNAGAKGTRFGSIGTGDHAGSLIFIQFYDDLTGYQKALEIQSKSSVFKEIMDSGKANIYLRNISTSLPTKFEQSYEHPKYIVLTRAEAAMSDKDKFLNCINDTASCFKDNGALTLRFGNLLTGSNVGNYLLGVGYPSMEAIEKTYDELLAHSSYKELMTFAKVNMRNIIKIL 207 T 1.4E-05 DUF6039 pdbhh F T 2ph7 1 A,B A,B Y2093_ARCFU Uncharacterized protein AF_2093 GMDVEIVEELSKMLAGRKAVTEEEIRRKAIRCALKIMGARLVGIDAELIEDVTCSLIDCPITLKSLHFSEKVKIGDVLFYHPHVIKPEKEDFEQAYFEYKQSKKFLDAFDIMREVTDRFFEGYEAEGRYMRKYTKDGRNYYAFFSTIDDTFEDVDIHLRMVDEVDGDYVVIVPTENELNPFLKFFKQYSEDAKRAGLKIWVVNPDEKTIDPFIGYPKDFRLLKGFKNPKAAALVSAYWRVTVTDLD 246 T 7 DUF1882 unphh F Archaea T 2phk 2 B B MC-PEPTIDE RQMSFRL 7 T 20 OAM_dimer pdbhh F T 2pie 2 B F phosphopeptide ELKTERY 7 T 52 MvaI_BcnI pdbhh F T 2pld 2 B B PGFRB_HUMAN PHOSPHOPEPTIDE FROM PDGF DNDXIIPLPDPK 12 T 2 PA28_alpha pdbhh F Eukaryota T 2plx 2 B B TI_VERHE Peptide Inhibitor QCKVMCYAQRHSSPELLRRCLDNCEK 26 T 0.0098 DUF842 unp F Eukaryota T 2pqw 2 B B Histone H4 RHRKVLRDN 9 T 2.9 Phage_X pdbhh F T 2pr9 2 B P GBRG2_RAT GABA(A) receptor subunit gamma-2 peptide DEEYGYECLD 10 T 5.1 DUF5816 pdbhh F Eukaryota T 2pux 3 C C PAR3_MOUSE PAR-3, THROMBIN RECEPTOR-LIKE 2, COAGULATION FACTOR II RECEPTOR-LIKE 2 QNTFEEFPLSDIE 13 T 0.87 Hirudin pdbhh F Eukaryota T 2pv2 2 E,F E,F C-peptide NFTLKFWDIFRK 12 T 2.2 Fmp27_GFWDK pdbhh F T 2pv9 3 C C PAR4_MOUSE PAR-4, THROMBIN RECEPTOR-LIKE 3, COAGULATION FACTOR II RECEPTOR-LIKE 3 KSSDKPNPRGYPGKFCANDSDTLELP 26 T 25 Colicin_Ia pdbhh F Eukaryota T 2pw1 3 C C peptide epitope ELDKWNSL 8 T 1.9 Lar_restr_allev pdbhh F T 2pw2 3 C C peptide epitope ELDKWKSL 8 T 1.2 DUF4720 pdbhh F T 2pxy 5 E P Myelin basic protein (MBP)-peptide HSRGGASQYRPSQ 13 T 13 Tsg pdbhh F T 2q0n 2 B B Synthetic peptide RRRRRSWYFDG 11 T 0.35 CFIA_Pcf11 pdbhh F T 2q2k 2 B,C A,B O87365_STAAU Hypothetical protein MGSSHHHHHHSSGLVPGSHMDKKETKHLLKIKKEDYPQIFDFLENVPRGTKTAHIREALRRYIEEIGENP 70 T 0.051 PutA_N unppssm F Bacteria T 2q3y 2 B B NR0B2_HUMAN ORPHAN NUCLEAR RECEPTOR SHP, SMALL HETERODIMER PARTNER PAILYALLSS 10 T 6.6 NR_Repeat pdbhh F Eukaryota T 2q5y 2 B,D B,D NUP98_HUMAN Nuclear pore complex protein Nup96 SKYGLQD 7 T 15 DUF2683 pdbhh F Eukaryota T 2q82 1 A A Q94M07_9VIRU Core protein P7 MDFITDMSKNQRLELQNRLAQYETSLMVMSHNGDVPVITGFNVMRVTTMLDALKVELPAVAVLGDDAQDLAYVFGARPLAVGVNIIRVVDVPGQQPSALVDAELGALHEVSMVRVLNDIADEQLVKANM 129 T 15 C_Hendra pdbhh T Viruses T 2q8d 2 C,D F,G PEPTIDE RKSAPATGGVKKPHRY 16 T 31 DUF5976 pdbhh F T 2qas 2 B B C. crescentus ssrA peptide KKGRHGAANDNFAEEFAVAA 20 T 15 KCTD4_C pdbhh F T 2qbl 1 A A CPXA_PSEPU CAMPHOR 5-MONOOXYGENASE, P450CAM MTTETIQSNANLAPLPPHVPEHLVFDFDMYNPSNLSAGVQEAWAVLQESNVPDLVWTRCNGGHWIATRGQLIREAYEDYRHFSSECPFIPREAGEAYDFIPTSMDPPEQRQFRALANQVVGMPVVDKLENRIQELACSLIESLRPQGQCNFTEDYAEPFPIRIFMLLAGLPEEDIPHLKYLTDQMTRPDGSMTFAEAKEALYDYLIPIIEQRRQKPGTDAISIVANGQVNGRPITSDEAKRMCGLLLVTGLDTVVNFLSFSMEFLAKSPEHRQELIERPERIPAACEELLRRFSLVADGRILTSDYEFHGVQLKKGDQILLPQMLSGLDERENACPMHVDFSRQKVSHTTFGHGSHLCLGQHLARREIIVTLKEWLTRIPDFSIAPGAQIQHKSGIVSGVQALPLVWDPATTKAVHHHHHH 421 T 1.6E-05 p450 unppercent F Bacteria T 2qbn 1 A A CPXA_PSEPU CAMPHOR 5-MONOOXYGENASE, P450CAM MTTETIQSNANLAPLPPHVPEHLVFDFDMYNPSNLSAGVQEAWAVLQESNVPDLVWTRCNGGHWIATRGQLIREAYEDYRHFSSECPFIPREAGEAYDFIPTSMDPPEQRQFRALANQVVGMPVVDKLENRIQELACSLIESLRPQGQCNFTEDYAEPFPIRIFMLLAGLPEEDIPHLKYLTDQMTRPDGSMTFAEAKEALYDYLIPIIEQRRQKPGTDAISIVANGQVNGRPITSDEAKRMCGLLLVVGLDTVVNFLSFSMEFLAKSPEHRQELIERPERIPAACEELLRRFSLVADGRILTSDYEFHGVQLKKGDQILLPQMLSGLDERENACPMHVDFSRQKVSHTTFGHGSHLCLGQHLARREIIVTLKEWLTRIPDFSIAPGAQIQHKSGIVSGVQALPLVWDPATTKAVHHHHHH 421 T 1.6E-05 p450 unppercent F Bacteria T 2qbw 2 B B Polypeptide PQPVDSWV 8 T 4 Spt4 pdbhh F T 2qbx 2 C,D D,P antagonistic peptide SNEWIQPRLPQH 12 T 13 B3_4 pdbhh F T 2qc5 1 A A O87275_9STAP Streptogramin B lactonase GSEAWMNFYLEEFNLSIPDSGPYGITSSEDGKVWFTQHKANKISSLDQSGRIKEFEVPTPDAKVMCLIVSSLGDIWFTENGANKIGKLSKKGGFTEYPLPQPDSGPYGITEGLNGDIWFTQLNGDRIGKLTADGTIYEYDLPNKGSYPAFITLGSDNALWFTENQNNSIGRITNTGKLEEYPLPTNAAAPVGITSGNDGALWFVEIMGNKIGRITTTGEISEYDIPTPNARPHAITAGKNSEIWFTEWGANQIGRITNDNTIQEYQLQTENAEPHGITFGKDGSVWFALKCKIGKLNLNE 300 T 0.0004 SGL unppssm F Bacteria T 2qhr 3 C P VGP_EBOEC Envelope glycoprotein peptide VEQHHRRTDND 11 T 0.0011 SOG2 unppercent T Viruses T 2qiy 2 B,D C,D UBP3_YEAST UBIQUITIN THIOESTERASE 3, UBIQUITIN-SPECIFIC-PROCESSING PROTEASE 3, DEUBIQUITINATING ENZYME 3 GSASVTKLKNLKENSSNLIQLPLFINTTEAEFAAASVQRYELNMKALN 48 T 0.031 Caldesmon unppssm F Eukaryota T 2qki 4 G,H G,H compstatin XICVWQDWGAHRCTX 15 T 2 DX pdbhh F T 2qn6 3 C C IF2B_SULSO EIF-2-BETA, AIF2-BETA SSEKEYVEMLDRLYSKLP 18 T 0.8 DUF6103 pdbhh F Archaea T 2qos 2 B A CO8A_HUMAN COMPLEMENT COMPONENT 8 SUBUNIT ALPHA LRYDSTAERLY 11 T 9.3 MHC_II_beta pdbhh F Eukaryota T 2qqf 2 B B H4_YEAST Histone H4 KGGAXRHRKIL 11 T 11 Shadoo unppercent F Eukaryota T 2qrv 2 B,C,F,G B,C,F,G DNM3L_HUMAN DNA (cytosine-5)-methyltransferase 3-like GSMWRSQLKAFYDRESENPLEMFETVPVWRRQPVRVLSLFEDIKKELTSLGFLESGSDPGQLKHVVDVTDTVRKDVEEWGPFDLVYGATPPLGHTCDRPPSWYLFQFHRLLQYARPKPGSPRPFFWMFVDNLVLNKEDLDVASRFLEMEPVTIPDVHGGSLQNAVRVWSNIPAIRSRHWALVSEEELSLLAQNKQSSKLAAKWPTKLVKNCFLPLREYFKYFSTELTSSL 230 T 1E-09 DNA_methylase pdbhh F Eukaryota T 2qt5 2 C,D X,Y FRAS1 NNLQDGTEV 9 T 2.3 DUF4288 pdbhh F T 2r03 2 B B GAG_EIAVY p6-Gag NLYPDLSE 8 T 0.24 LSPR pdbhh T Viruses T 2r0l 4 D B HGFA_HUMAN Hepatocyte growth factor activator VQLSPDLLATLPEPASPGRQACGRRHKKRTFLRPR 35 T 2.4E-22 DUF316 unphh F Eukaryota T 2r0y 2 B B Histone H3 peptide TARKSTGGXAPRK 13 T 0.1 Sirohm_synth_M pdbpercent F T 2r0z 3 C Q GLUTAMATE RECEPTOR INTERACTING PROTEIN 1 AKFRHD 6 T 46 PIG-Y pdbhh F T 2r28 2 B,D C,D PP2BA_HUMAN CALMODULIN-DEPENDENT CALCINEURIN A SUBUNIT ALPHA ISOFORM, CAM-PRP CATALYTIC SUBUNIT AAARKEVIRNKIRAIGKMARVFSVL 25 T 3.3 7kD_DNA_binding pdbhh F Eukaryota T 2r3y 2 D,E,F D,E,F Synthetic peptide YWF DNRLGLVYWF 10 T 2.5 DUF3325 pdbhh F T 2r9b 2 B,D C,D peptide-based inhibitor KPFSXLQF 8 T 15 PPARgamma_N pdbhh F T 2r9q 2 E X Synthetic peptide 1 SNPACVA 7 T 0.58 MFA1_2 pdbhh F T 2r9q 3 F Y Synthetic peptide 2 VEVPLAGAV 9 T 24 BAMBI_C pdbhh F T 2rfi 2 C,D P,Q Histone H3 TKQTARKSTGG 11 T 2.2 Histone pdbhh F T 2rje 2 D,E P,Q H4_HUMAN Histone H4 AKRHRKVLRDN 11 T 0.27 UPF0137 unp F Eukaryota T 2rjf 2 B,D B,D Histone H4 YKGGAKRHRKVLRDNIQGIT 20 T 8.6 DUF1938 pdbhh F T 2rky 2 C,D B,D FNBA_STAA8 STAPHYLOCOCCUS AUREUS FIBRONECTIN BINDING PROTEIN, FNBP NEKNGPIIQNNKFEYKEDTIKET 23 T 6.9 IPU_b_solenoid pdbhh F Bacteria T 2rkz 2 G,H,I,J,K,L M,N,O,P,Q,R FNBA_STAA8 FNBPA XETLTGQYDKNLVTTVEEEYDSX 23 T 40 DUF1372 pdbhh F Bacteria T 2rl0 2 B,D,F,H,J,L G,C,E,H,J,L FNBA_STAA8 STAPHYLOCOCCUS AUREUS FIBRONECTIN BINDING PROTEIN, FNBP GQVTTESNLVEFDEESTK 18 T 0.015 Fn_bind unppssm F Bacteria T 2rlg 1 A A antimicrobial peptide RP-1 ALYKKFKKKLLKSLKRLG 18 T 4 NAC pdbhh F T 2rlj 1 A A VGP_EBOZM Envelope glycoprotein GAAIGLAWIPYFGPAA 16 T 0.89 DUF4855 pdbhh T Viruses T 2rll 1 A A CCR5_HUMAN C-C CKR-5, CC-CKR-5, CCR-5, CCR5, HIV-1 FUSION CORECEPTOR, CHEMR13, CD195 ANTIGEN SPIYDINYY 9 T 3.1 Pico_P1A pdbhh F Eukaryota T 2rlw 1 A A P71469_LACPN BACTERIOCIN PEPTIDE PLNF VFHAYSARGVRNNYKSAVGPADWVISAVRGFIHG 34 T 0.0046 Bacteriocin_IIc unp F Bacteria T 2rmx 2 B B NKG2A_HUMAN NKG2-A/B-ACTIVATING NK RECEPTOR, NK CELL RECEPTOR A, CD159A ANTIGEN MDNQGVIXSDLNLPP 15 T 10 MAP pdbhh F Eukaryota T 2rny 2 B B H4_HUMAN Histone H4 GGAKRHRXVLRDNIQ 15 T 0.27 UPF0137 unp F Eukaryota T 2ror 2 B B LCP2_HUMAN SH2 DOMAIN-CONTAINING LEUKOCYTE PROTEIN OF 76 KDA, SLP-76 TYROSINE PHOSPHOPROTEIN, SLP76 GEDDGDXESPNEEEE 15 T 0.087 SDA1 unppercent F Eukaryota T 2rp5 1 A,B A,B CEP1_CAEEL TRANSCRIPTION FACTOR CEP-1 GPLGSHENCQSPSMKRSRCTNYSFRTLTLSTAEYTKVVEFLAREAKVPRYTWVPTQVVSHILPTEGLERFLTAIKAGHDSVLFNANGIYTMGDMIREFEKHNDIFERIGIDSSKLSKYYEAFLSFYRIQEAMKLPK 136 T 0.0011 SAM_2 pdbpssm F Eukaryota T 2rpa 1 A A KTNA1_MOUSE KATANIN P60 SUBUNIT A1, P60 KATANIN, LIPOTRANSIN GSDHMTMSLQMIVENVKLAREYALLGNYDSAMVYYQGVLDQMNKYLYSVKDTHLRQKWQQVWQEINVEAKQVKDIMKT 78 T 0.00078 MIT pdbpercent F Eukaryota T 2rpn 2 B B ARK1_YEAST Actin-regulating kinase 1 AKKTKPTPPPKPSHLKPK 18 T 9.6 Dynein_attach_N pdbhh F Eukaryota T 2rpq 2 B B MCAF1_HUMAN ATFA-ASSOCIATED MODULATOR, HAM, ATF-INTERACTING PROTEIN, ATF-IP, MBD1-CONTAINING CHROMATIN-ASSOCIATED FACTOR 1, P621 GSPEFKTIDASVSKKAADSTSQCGKATGSDSSGVIDLTMDDEESGASQD 49 T 18 Tox-PLDMTX pdbhh F Eukaryota T 2rps 1 A A B7XBA7_MYTSE Chemokine SVQILRCPDGMQMLRSGQCVATTEPPFDPDSY 32 T 0.13 Bowman-Birk_leg unppssm F Eukaryota T 2rqo 1 A A polytheonamide B XGXGXXXXXXAGAXAXXGAGXXXXAGGXIXXXGXIXVXAXVXVXXXQXT 49 F F T 2rqw 2 B B STE20_YEAST STE20P-PRR PEPTIDE SSSANGKFIPSRPAPKPPSSASAS 24 T 0.00039 TFIIA unppssm F Eukaryota T 2rr3 2 B B OSBP1_HUMAN OSBP PLGSDHWGKGDMSDEDDENEFFDAPEIITMPENLGHKRTGSHHHHHH 47 T 10 DUF1180 pdbhh F Eukaryota T 2rs9 1 A A H4_HUMAN H4K5AC SGRGXGGKGL 10 T 4.7 G3P_acyltransf pdbhh F Eukaryota T 2rsk 2 C,D C,D PRIO_BOVIN partial binding peptide of Major prion protein GQWNKPSKPKTN 12 T 0.19 OATP unppssm F Eukaryota T 2rt4 1 A A AF.2A1 GVVRQWSGYDPRTGTWRSSIAYGGG 25 T 0.7 XPB_DRD pdbhh F T 2rt5 2 B B NCOR2_HUMAN peptide from Silencing mediator of retinoic acid and thyroid hormone receptor YETLSDSE 8 T 7.1 Lactococcin pdbhh F Eukaryota T 2rvb 1 A A XPC_HUMAN XERODERMA PIGMENTOSUM GROUP C-COMPLEMENTING PROTEIN, P125 GSHMAHHLKRGATMNEDSNEEEEESENDWEEVEELSEPVLGDVRESTAFSRS 52 T 0.043 DUF5810 pdbpercent F Eukaryota T 2rvd 1 A A CLN025 YYDPETGTWY 10 T 0.011 OCRE pdb F T 2seb 4 D E CO2A1_HUMAN PEPTIDE FROM COLLAGEN II AYMRADAAAGGA 12 T 0.022 DUF2600 unphh F Eukaryota T 2uux 1 A A Q1EG59_RHIAP TRYPTASE INHIBITOR TDPI AAECTVPIGWSEPVKGLCKARFTRYYCMGNCCKVYEGCYTGGYSRMGECARNCPG 55 T 0.03 DUF3788 unppssm F Eukaryota T 2uuy 2 B B Q1EG59_RHIAP TRYPTASE INHIBITOR TDPI CTVPIGWSEPVKGLCKARFTRYYCMGNCCKVYEGCYTGGYSRMGECARNCPA 52 T 0.03 DUF3788 unppssm F Eukaryota T 2v17 1 A A PEPTIDE FRAGMENT TDHGAE 6 T 68 Exo_endo_phos pdbhh F T 2v1r 2 C,D,E P,Q,R PEX14_YEAST PEX14 XEAMPPTLPHRDWKD 15 T 3.4E-05 DUF1664 unphh F Eukaryota T 2v1s 2 H,I,J,K,L,M,N H,I,J,K,L,M,N ALDH2_RAT ALDH CLASS 2, ALDH1, ALDH-E2 GPRLSRLLSYAGX 13 T 8.8 TFIID_30kDa pdbhh F Eukaryota T 2v1t 2 C,D C,D ALDH2_RAT ALDH CLASS 2, ALDH1, ALDH-E2 GPRLSRLLSAAGX 13 T 5.4 Atypical_Card pdbhh F Eukaryota T 2v2x 3 C,F C,F HIV P17 SLFNTVATL 9 T 0.0057 Gag_p17 pdbhh F T 2v3s 2 C,D C,D WNK4_HUMAN PROTEIN KINASE WITH NO LYSINE 4, PROTEIN KINASE LYSINE-DEFICIENT 4 GRFQVT 6 T 36 DUF3446 pdbhh F Eukaryota T 2v7x 1 A,B,C A,B,C FLA_STRCT 5'-FLUORO-5'-DEOXY ADENOSINE SYNTHETASE MAANSTRRPIIAFMSDLGTTDDSVAQCKGLMYSICPDVTVVDVCHSMTPWDVEEGARYIVDLPRFFPEGTVFATTTYPATGTTTRSVAVRIKQAAKGGARGQWAGSGAGFERAEGSYIYIAPNNGLLTTVLEEHGYLEAYEVTSPKVIPEQPEPTFYAREMVAIPSAHLAAGFPLSEVGRPLEDHEIVRFNRPAVEQDGEALVGVVSAIDHPFGNVWTNIHRTDLEKAGIGYGARLRLTLDGVLPFEAPLTPTFADAGEIGNIAIYLNSRGYLSIARNAASLAYPYHLKEGMSARVEAR 299 T 3.6E-42 SAM_adeno_trans pdbpercent F Bacteria T 2v9k 1 A A PUS10_HUMAN PSEUDOURIDINE SYNTHASE GMFPLTEENKHVAQLLLNTGTCPRCIFRFCGVDFHAPYKLPYKELLNELQKFLETEKDELILEVMNPPPKKIRLQELEDSIDNLSQNGEGRISVSHVGSTASKNSNLNVCNVCLGILQEFCEKDFIKKVCQKVEASGFEFTSLVFSVSFPPQLSVREHAAWLLVKQEMGKQSLSLGRDDIVQLKEAYKWITHPLFSEELGVPIDGKSLFEVSVVFAHPETVEDCHFLAAICPDCFKPAKNKQSVFTRMAVMKALNKIKEEDFLKQFPCPPNSPKAVCAVLEIECAHGAVFVAGRYNKYSRNLPQTPWIIDGERKLESSVEELISDHLLAVFKAESFNFSSSGREDVDVRTLGNGRPFAIELVNPHRVHFTSQEIKELQQKINNSSNKIQVRDLQLVTREAIGHMKEGEEEKTKTYSALIWTNKAIQKKDIEFLNDIKDLKIDQKTPLRVLHRRPLAVRARVIHFMETQYVDEHHFRLHLKTQAGTYIKEFVHGDFGRTKPNIGSLMNVTADILELDVESVDVDWPPALDD 530 T 0.00012 TruB_N unphh F Eukaryota T 2vda 2 B B LAMB_ECOL6 MALTOSE-INDUCIBLE PORIN MMITLRKRRKLPLAVAVAAGVMSAQAMA 28 T 0.4 PAGK pdbhh F Bacteria T 2vdn 3 C C MPT HRG GLY ASP TRP PRO CYS NH2 XXGDWPCX 8 T 5.3 Ferlin_C pdbhh F T 2vdo 3 C C FIBG_HUMAN FIBRINOGEN, GAMMA POLYPEPTIDE HHLGGAKQAGDV 12 T 37 Tox-HNH-HHH pdbhh F Eukaryota T 2vdp 3 C C FIBG_HUMAN FIBRINOGEN LGGAKQAGDV 10 T 56 DUF5974 pdbhh F Eukaryota T 2vdq 3 C C FIBG_HUMAN FIBRINOGEN, GAMMA POLYPEPTIDE HHLGGAKQRGDV 12 T 5.1 DUF6305 pdbhh F Eukaryota T 2vdr 3 C C FIBG_HUMAN FIBRINOGEN LGGAKQRGDV 10 T 69 DUF5974 pdbhh F Eukaryota T 2ve6 3 C,F,I,L C,F,I,L SENDAI VIRUS EPITOPE RESIDUES 324-332 MODIFIED AT P7 FAPGNYXAL 9 T 0.12 Paramyxo_ncap pdbhh F T 2vf1 1 A,B A,B Q9Q1V2_9VIRU CAPSID PROTEIN DWSWYAPSELVAKQIANVPFNVLAGTPIKASVHLRYDPSLVSGLKDQLFVGNNASIMGARLLYLPSFGISTTVLDGLSMAANQLYAYVRKSNSGAKVYEAPDLMMTVLAIQEAYRVLFEIRRAITFANYWNFWNKYLPKQVFEQLLAIDFDDLMSNKANYCAQFNLMAQKINTFALPKYFKSILRMAYVSSNIFMDSDAVTGQMYAFVSSGYYRYSATTSESGTSLVYRDWPVGAAMPRKLNRLFTVLRELLDAIYGDADAQTMFGDIYKAFGSDGLYSIAEISVDETSTPVFDVDILAQIENCTILEANAGLAWTLDSCNVTQSKGQVLLWQPTGTITSSDNTEHIAGDIAVALGDRVLNSHIMEPQYSDVLEWTRLMATIEFDKASVTSSEKVTFKVTSCGAELIRNVLYFKNVWNDAAEDASQRVITYFSHFSQITVTNATDDPTSAYGLMSNTLDFTQLDWHPIIYVTETSVHNVANLNSILIGGDLKRPTVITTDVVKRINSAANYALYYSANLLSNIST 525 T 4 R2K_2 pdb T Viruses T 2vh3 1 A A RANSM_POLLE RSF-1 AXACSFPPSEIPGSKECLAEALQKHQGFKKKSYALICAYLNYKEDAENYERAAEDFDSAVKCTGCKEGVDLHEGNPELIEEGFEKFLASLKIDRKALGSLCTLFQKLXAIPHN 113 T 0.17 Oxidored-like pdbpssm F Eukaryota T 2vh3 2 B B RANSM_POLLE RSF-1 AXACSFPPXEIPGSKECLAEALQKHQGFKKKSYALICAYLNYKEDAENYERAAEDFDSAVKCTGCKEGVDLHEGNPELIEEGFEKFLASLKIDRKALGSLCTLFQKLYAIPHN 113 T 4.2 PHtD_u1 pdbhh F Eukaryota T 2vif 2 B P KIT_HUMAN SCFR, PROTO-ONCOGENE TYROSINE-PROTEIN KINASE KIT, C-KIT, CD117 ANTIGEN NGNNXVYIDPT 11 T 1 DNA_pack_C pdbhh F Eukaryota T 2vj0 3 C Q AMPH_RAT AMPHIPHYSIN1 FEDNFVP 7 T 0.0069 CCDC32 pdbhh F Eukaryota T 2vkn 2 B C PBS2_YEAST POLYMYXIN B RESISTANCE PROTEIN 2, SUPPRESSOR OF FLUORIDE SENSITIVITY 4, PBS2 NKPLPPLPLAGS 12 T 0.23 DUF4554 unppercent F Eukaryota T 2vln 2 B B CEA9_ECOLX COLICIN E9 MESKRNKPGKATGKGKPVGDKWLDDAGKDSGAPIPDRIADKLRDKEFKSFDDFRKAVWEEVSKDPELSKNLNPSAKSSVSKGYSPFTPKNQQVGGRKVYELHHDKPISQGGEVYDMDNIRVTTPKRHIDIHRGK 134 T 0.016 LHH unppercent F Bacteria T 2vlo 2 B B CEA9_ECOLX COLICIN E9 MESKRNKPGKATGKGKPVGDKWLDDAGKDSGAPIPDRIADKLRDKEFKSFDDFRKAVWEEVSKDPELSKNLNPSNKSSVSKGYSPFTPKNQQVGGRAVYELHHDKPISQGGEVYDMDNIRVTTPKRHIDIHRGK 134 T 0.016 LHH unppercent F Bacteria T 2vlp 2 B B CEA9_ECOLX COLICIN E9 MESKRNKPGKATGKGKPVGDKWLDDAGKDSGAPIPDRIADKLRDKEFKSFDDFAKAVWEEVSKDPELSKNLNPSNKSSVSKGYSPFTPKNQQVGGRKVYELHHDKPISQGGEVYDMDNIRVTTPKRHIDIHRGK 134 T 0.016 LHH unppercent F Bacteria T 2vlq 2 B B CEA9_ECOLX COLICIN E9 MESKRNKPGKATGKGKPVGDKWLDDAGKDSGAPIPDRIADKLRDKEFKSFDDFRKAVWEEVSKDPELSKNLNPSNKSSVSKGYSPATPKNQQVGGRKVYELHHDKPISQGGEVYDMDNIRVTTPKRHIDIHRGK 134 T 0.00018 LHH pdbpercent F Bacteria T 2voy 5 E E AT2A1_RABIT CA2+-ATPASE, SERCA1, COPA DELTA C TAFVEPFVILLILIANAIVGVWQERNAENA 30 T 0.078 Chi-conotoxin pdbpercent F Eukaryota T 2voy 11 K K AT2A1_RABIT CA2+-ATPASE, SERCA1, COPA DELTA C EGRAIYNNMKQFIRYLISSNVGEVVCIFLTAA 32 T 0.018 PhoLip_ATPase_C unphh F Eukaryota T 2vp7 2 B B BCL9_HUMAN B-CELL LYMPHOMA 9 PROTEIN, BCL-9, PROTEIN LEGLESS HOMOLOG, BCL9 AAKVVYVFSTEMANKAAEAVLKGQVETIVSFHI 33 T 0.24 Ribosomal_L23eN pdb F Eukaryota T 2vpb 2 B B BCL9_HUMAN B-CELL LYMPHOMA 9 PROTEIN, BCL-9, PROTEIN LEGLESS HOMOLOG, BCL9 AMAAKVVYVFSTEMANKAAEAVLKGQVETIVSFHI 35 T 0.26 Ribosomal_L23eN pdb F Eukaryota T 2vpe 2 B,D B,D BCL9_HUMAN B-CELL LYMPHOMA 9 PROTEIN, BCL-9, PROTEIN LEGLESS HOMOLOG, BCL9 GAMVYVFSTEMANKAAEAVLKGQVETIVSFHI 32 T 0.21 Ribosomal_L23eN pdb F Eukaryota T 2vr3 2 C,D C,D FIBG_HUMAN FIBRINOGEN GAMMA-CHAIN QHHLGGAKQAGAV 13 T 17 Tox-HNH-HHH pdbhh F Eukaryota T 2vum 13 M M AAMAT_AMAPH ALPHA AMANITIN, GAMMA-AMANITIN NPXXGIGC 8 T 0.55 Wzy_C pdbhh F Eukaryota T 2vvd 1 A A SPIKE_BPPM2 P1-RECEPTOR BINDING PROTEIN VNYWVSDEEIRVFKEYSARAKYAQNEGRTALEANNVPFFDIDVPPELDGVPFSLKARVRHKSKGVDGLGDYTSISVKPAFYITEGDETTDTLIKYTSYGSTGSHSGYDFDDNTLDVMVTLSAGVHRVFPVETELDYDAVQEVQHDWYDESFTTFIEVYSDDPLLTVKGYAQILMERT 177 T 0.58 PNPase_C pdbpercent T Viruses T 2vve 1 A,B A,B SPIKE_BPPM2 P1-RECEPTOR BINDING PROTEIN SFQEQTTKSRDVNSFQIPLRDGVRELLPEDASRNRASIKSPVDIWIGGENMTALNGIVDGGRKFEAGQEFQINTFGSVNYWVSDEEIRVFKEYSARAKYAQNEGRTALEANNVPFFDIDVPPELDGVPFSLKARVRHKSKGVDGLGDYTSISVKPAFYITEGDETTDTLIKYTSYGSTGSHSGYDFDDNTLDVMVTLSAGVHRVFPVETELDYDAVQEVQHDWYDESFTTFIEVYSDDPLLTVKGYAQILMERT 254 T 3.3 Phage_T4_Ndd pdbpssm T Viruses T 2vwf 2 B B GAB2_HUMAN GROWTH FACTOR RECEPTOR BOUND PROTEIN 2-ASSOCIATED PROTEIN 2, GRB2-ASSOCIATED BINDER 2, PP100, GAB2 IQPPPVNRNLKPDRK 15 T 12 DUF5898 pdbhh F Eukaryota T 2vxg 1 A,B A,B EDC4_DROME LD41624, GE-1 GAMGDSIKQLLMAGQINKAFHQALLANDLGLVEFTLRHTDSNQAFAPEGCRLEQKVLLSLIQQISADMTNHNELKQRYLNEALLAINMADPITREHAPKVLTELYRNCQQFIKNSPKNSQFSNVRLLMKAIITYRDQLK 139 T 0.0069 Csm2_III-A pdb F Eukaryota T 2vzd 2 C,D C,D PAXI_HUMAN PAXILLIN MDDLDALLADLESTTSHISK 20 T 0.036 DUF883 pdb F Eukaryota T 2vzi 1 A A PAXI_HUMAN Paxillin,Paxillin ATRELDELMASLSDFKFMAQ 20 T 1.2 SAM_LFY pdbhh F Eukaryota T 2w0c 2 K,K10,K11,K12,K13,K14,K15,K16,K17,K18,K19,K2,K20,K21,K22,K23,K24,K25,K26,K27,K28,K29,K3,K30,K31,K32,K33,K34,K35,K36,K37,K38,K39,K4,K40,K41,K42,K43,K44,K45,K46,K47,K48,K49,K5,K50,K51,K52,K53,K54,K55,K56,K57,K58,K59,K6,K60,K7,K8,K9 L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L SPIKE_BPPM2 PROTEIN 2 MIVKKKLAAGEFAETFKNGNNITIIKAVGELVLRAYGADGGEGLRTIVRQGVSIKGMNYTSVMLHTEYAQEIEYWVGDLDYSFQEQTTKSRDVNSFQIPLRDGVRELLPEDASRNRASIKSPVDIWIGGENMTALNGIVDGGRKFEAGQEFQINTFGSVNYWVSDEEIRVFKEYSARAKYAQNEGRTALEANNVPFFDIDVPPELDGVPFSLKARVRHKSKGVDGLGDYTSISVKPAFYITEGDETTDTLIKYTSYGSTGSHSGYDFDDNTLDVMVTLSAGVHRVFPVETELDYDAVQEVQHDWYDESFTTFIEVYSDDPLLTVKGYAQILMERT 335 T 6 DUF4115 pdbhh T Viruses T 2w0c 3 L,L10,L11,L12,L13,L14,L15,L16,L17,L18,L19,L2,L20,L21,L22,L23,L24,L25,L26,L27,L28,L29,L3,L30,L31,L32,L33,L34,L35,L36,L37,L38,L39,L4,L40,L41,L42,L43,L44,L45,L46,L47,L48,L49,L5,L50,L51,L52,L53,L54,L55,L56,L57,L58,L59,L6,L60,L7,L8,L9,M,M10,M11,M12,M13,M14,M15,M16,M17,M18,M19,M2,M20,M21,M22,M23,M24,M25,M26,M27,M28,M29,M3,M30,M31,M32,M33,M34,M35,M36,M37,M38,M39,M4,M40,M41,M42,M43,M44,M45,M46,M47,M48,M49,M5,M50,M51,M52,M53,M54,M55,M56,M57,M58,M59,M6,M60,M7,M8,M9,N,N10,N11,N12,N13,N14,N15,N16,N17,N18,N19,N2,N20,N21,N22,N23,N24,N25,N26,N27,N28,N29,N3,N30,N31,N32,N33,N34,N35,N36,N37,N38,N39,N4,N40,N41,N42,N43,N44,N45,N46,N47,N48,N49,N5,N50,N51,N52,N53,N54,N55,N56,N57,N58,N59,N6,N60,N7,N8,N9,O,O10,O11,O12,O13,O14,O15,O16,O17,O18,O19,O2,O20,O21,O22,O23,O24,O25,O26,O27,O28,O29,O3,O30,O31,O32,O33,O34,O35,O36,O37,O38,O39,O4,O40,O41,O42,O43,O44,O45,O46,O47,O48,O49,O5,O50,O51,O52,O53,O54,O55,O56,O57,O58,O59,O6,O60,O7,O8,O9 P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S P3_BPPM2 PROTEIN III MNTSVPTSVPTNQSVWGNVSTGLDALISGWARVEQIKAAKASTGQGRVEQAMTPELDNGAAVVVEAPKKAAQPSETLVFGVPQKTLLLGFGGLLVLGLVMRGNK 104 T 0.069 RseC_MucC pdb T Viruses T 2w0c 4 P,P10,P11,P12,P13,P14,P15,P16,P17,P18,P19,P2,P20,P21,P22,P23,P24,P25,P26,P27,P28,P29,P3,P30,P31,P32,P33,P34,P35,P36,P37,P38,P39,P4,P40,P41,P42,P43,P44,P45,P46,P47,P48,P49,P5,P50,P51,P52,P53,P54,P55,P56,P57,P58,P59,P6,P60,P7,P8,P9 T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T P6_BPPM2 PROTEIN VI MANFLTKNFVWILAAGVGVWFYQKADNAAKTATKPIADFLAELQFLVNGSNYVKFPNAGFVLTRDALQDDFIAYDDRIKAWLGTHDRHKDFLAEILDHERRVKPVYRKLIGNIIDASTIRAASGVEL 127 T 0.06 Phageshock_PspG pdbpssm T Viruses T 2w0p 2 C C FBLI1_HUMAN MITOGEN-INDUCIBLE 2-INTERACTING PROTEIN, MIGFILIN PEKRVASSVFITLAP 15 T 12 Ycf15 pdbhh F Eukaryota T 2w0t 1 A A LMBL2_HUMAN L(3)MBT-LIKE 2 PROTEIN, H-L(3)MBT-LIKE PROTEIN GSGSEPAVCEMCGIVGTREAFFSKTKRFCSVSCSRSYSSNSKK 43 T 0.0088 zf-FCS pdbhh F Eukaryota T 2w10 2 C,D C,D PTN23_MOUSE HD-PTP PPPRPTAPKPLL 12 T 7.3 UPF0449 pdbhh F Eukaryota T 2w3o 2 C,D C,D XRCC1_HUMAN X-RAY REPAIR CROSS-COMPLEMENTING PROTEIN 1, XRCC1-DERIVED PHOSPHOPEPTIDE YAGSTDEN 8 T 12 M3 pdbhh F Eukaryota T 2w5v 1 A,B A,B Q9KWY4_9BACT TAB5 ALKALINE PHOSPHATASE MUTANT MKLKKIVFTLIALGLFSCKTTSVLVKNEPQLKTPKNVILLISDGAGLSQISSTFYFKSGTPNYTQFKNIGLIKTSSSREDVTDSASGATAFSCGIKTYNAAIGVADDSTAVKSIVEIAALNNIKTGVVATSSITDATPASFYAHALNRGLEEEIAMDMTESDLDFFAGGGLNYFTKRKDKKDVLAILKGNQFTINTTALTDFSSIASNRKMGFLLADEAMPTMEKGRGNFLSAATDLAIQFLSKDNSAFFIMSEGSQIDWGGHANNASYLISEINDFDDAIGTALAFAKKDGNTLVIVTSDHETGGFTLAAKKNKREDGSEYSDYTEIGPTFSTGGHSATLIPVFAYGPGSEEFIGIYENNEIFHKILKVTKWNQ 375 T 2.1E-11 Alk_phosphatase pdbpssm F Bacteria. T 2w5x 1 A,B A,B Q9KWY4_9BACT TAB5 ALKALINE PHOSPHATASE MUTANT MKLKKIVFTLIALGLFSCKTTSVLVKNEPQLKTPKNVILLISDGAGLSQISSTFYFKSGTPNYTQFKNIGLIKTSSSREDVTDSASGATAFSCGIKTYNAAIGVADDSTAVKSIVEIAALNNIKTGVVATSSITEATPASFYAHALNRGLEEEIAMDMTESDLDFFAGGGLNYFTKRKDKKDVLAILKGNQFTINTTALTDFSSIASNRKMGFLLADEAMPTMEKGRGNFLSAATDLAIQFLSKDNSAFFIMSEGSQIDWGGHANNASYLISEINDFDDAIGTALAFAKKDGNTLVIVTSDHETGGFTLAAKKNKREDGSEYSDYTEIGPTFSTGGHSATLIPVFAYGPGSEEFIGIYENNEIFHKILKVTKWNQ 375 T 2.4E-11 Alk_phosphatase unppssm F Bacteria. T 2w65 3 E,F E,F COLLAGEN DERIVED PEPTIDE PCII-CIT1 AXGLTGRPG 9 T 4.2 Glyco_hydro_15 pdbhh F T 2w73 2 E,F,G,H K,L,M,O PP2BA_HUMAN CALMODULIN-DEPENDENT CALCINEURIN A SUBUNIT ALPHA ISOFORM, CAM-PRP CATALYTIC SUBUNIT VIRNKIRAIGKMARVFS 17 T 26 DUF5435 pdbhh F Eukaryota T 2w84 2 B B PEX5_HUMAN PEROXISOME RECEPTOR 1, PEROXISOMAL C-TERMINAL TARGETING SIGNAL IMPORT RECEPTOR, PTS1-BP, PEROXIN-5, PTS1 RECEPTOR, PTS1R, PEX5 GVADLALSENWAQEFLAAGD 20 T 11 Drf_GBD pdbhh F Eukaryota T 2w97 3 C,D E,F IF4G1_HUMAN P220, EIF-4-GAMMA 1, EIF-4G 1, EIF-4G1, EIF4GI KKRYDREFLLGFQF 14 T 0.00025 eIF_4G1 unphh F Eukaryota T 2w9r 2 B B DPS_ECOLI PEXB, VTM LVKSKATNLLY 11 T 0.1 7TMR-HDED unppercent F Bacteria T 2wa8 2 B,D B,D N-END RULE PEPTIDE FRSKGEELFT 10 T 14 DNAP_B_exo_N pdbhh F T 2wb6 1 A A Y102_AFV1Y AFV1-102 MSYYHHHHHHLESTSLYKKAGSENLYFQGIVDKNKIVIPMSEFLDSMFLVIEKLGVHAEKKGSMIFLSSERVKLADWKQLGAMCSDCYHCKLPLSSFIEIVTRKAKDKFLVMYNEKEVTLVARGVQTIQK 130 T 0.069 DYW_deaminase pdbpercent T Viruses T 2wb7 1 A,B A,B PT26-6P MNATINDDDIDDVKKALDHATQAAHKAAAELTAKLRSDFVEYGNGGTAGQVLIHIYGPGLIYGFSAFPVQIRLEIPNQPVPFNKVHITEVTAYVIDENNRTYWTRVWNSSTFRQGGYIADTLDLVTVMKAPDPLVYQIRDAIVTGQISRELYDKIWNTSTTHFEIRVIVKGYQEAWKTDSSVSNQSSCPSDGHWYEDACWVHDKDIDFTLKAETTTAWGHVTGTNDVATIDGGMLGSLPIKFLQSLDLSGKWVLYQNKYAGALSDFIIITAASPVHVLNSTAMYKFLITPNPGYFQPANPKISDEYRFVTLRVIEGGRMELADTTTGHIGDLTEPTFFGLTAHYTDAPGTLDYHALGLVYAYVERDDGVKIPIWLAAEPMISVLSNTYTVMKDQDVKNLIDLYKKKDREKINATTKAMINSLQEKIDEAEQLLAKAKGMNNENAIEYAQGAIDEYKAAINDLQKAAQQDDYQMFLNYLNAAKKHEMAGDYYVNAARKALNGDLEQAKIDAEKAKEYSNLAKEYEPG 526 T 0.00057 TPR_12 pdbpssm F T 2wgo 1 A A B5DCK2_9NEOB RSN-2 GSLILDGDLLKDKLKLPVIDNLFGKELLDKFQDDIKDKYGVDTKDLKILKTSEDKRFYYVSVDAGDGEKCKFKIRKDVDVPKMVGRKCRKDDDDDDGY 98 T 8.4 CDI unphh F Eukaryota T 2wh0 2 E,F Q,R KPCE_HUMAN PKCEV3 DRSKSAPTSPCDQEIKELENNIRKALSFDNR 31 T 54 Arc_MA pdbhh F Eukaryota T 2wo6 2 C C CRUM2_HUMAN ARTIFICIAL CONSENSUS SEQUENCE ARPGTPAL 8 T 6.1 Microvir_J pdbhh F Eukaryota T 2wp2 2 C,D P,Q H4_MOUSE HISTONE H4 SGRGXGGXGLGKGGAKRHRK 20 T 11 Shadoo unppercent F Eukaryota T 2wpt 2 B B CEA9_ECOLX E9 DNASE MESKRNKPGKATGKGKPVGDKWLDDAGKDSGAPIPDRIADKLRDKEFKSFDDFRKAVWEEVSKDPELSKNLNPSNKSSVSKGYSPFTPKNQQVGCRKVYELHHDKPISQGGEVYDMDNIRVTTPKRHIDIHRGK 134 T 0.016 LHH unppercent F Bacteria T 2wq4 1 A,B,C A,B,C B4EH86_BURCJ BC2L-C N-TERMINAL DOMAIN MPLLSASIVSAPVVTSETYVDIPGLYLDVAKAGIRDGKLQVILNVPTPYATGNNFPGIYFAIATNQGVVADGCFTYSSKVPESTGRMPFTLVATIDVGSGVTFVKGQWKSVRGSAMHIDSYASLSAIWGTAAPSSQGSGNQGAETGGTGAGNIGGG 156 T 0.002 DUF1543 unppercent F Bacteria T 2wv4 2 C,D C,D POLG_FMDV1 FOOT AND MOUTH DISEASE VIRUS (SEROTYPE A) VARIANT VP1 CAPSID PROTEIN XAPAKQLLNFD 11 T 8.3 Fn3-like pdbhh T Viruses T 2wv5 2 E,F,G,H E,F,G,H POLG_FMDV1 FOOT AND MOUTH DISEASE VIRUS (SEROTYPE A) VARIANT VP1 CAPSID PROTEIN XAPAKELLNFD 11 T 16 Fn3-like pdbhh T Viruses T 2wwx 2 B B DRRA_LEGPH SIDM MPYSDAKAMLDEVAKIRELGVQRVTRIENLENAKKLWDNANSMLEKGNISGYLKAANELHKFMKEKNLKEDDLRPELSDKTISPKGYAILQSLWGAASDYSRAAATLTESTVEPGLVSAVNKMSAFFMDCKLSPNERATPDPDFKVGKSKILVGIMQFIKDVADPTSKIWMHNTKALMNHKIAAIQKLERSNNVNDETLESVLSSKGENLSEYLSYK 217 T 0.036 GlnD_UR_UTase unppssm F Bacteria T 2x3g 1 A A Q5TJA9_SIRV1 SIRV1 HYPOTHETICAL PROTEIN ORF119 GDLKKVLNFHFSYIYTYFITITTNYKYGDTEKIFRKFRSYIYNHDKNSHVFSIKETTKNSNGLHYHILVFTNKKLDYSRVHKHMPSHSDIRIELVPKSISDIKNVYKYMLKTKKDIKMS 119 T 0.0041 Phage_GPA pdbhh T Viruses T 2x3m 1 A A Q6ZYJ2_PSVY HYPOTHETICAL PROTEIN ORF239 GMSAFDEFNEGFGLDVSDTPEELAFETESAIEEIESETSPGDQPKGSEPEEIRVWAEEKARKAVEEGREVTNWADWIMGWRTPNASEKKMEFMYWYTRTYLEEAKDIRPDIADALARGMAGLAFGRTDWVASMLDPQIMRHIYTDPEVARIYSETRDMLRRVSDYYISLTTMELGKVADIIAEAKAKGENPEVVAREIAEAVPRLSPKSLYFNLYYIGRSIGDNYVLEVARVLSKMRRR 239 T 0.52 HD_assoc pdb T Viruses T 2x4n 3 C,F C,F HLA-A2.1-RESTRICTED INFLUENZA A MATRIX EPITOPE KILGXVFXV 9 T 28 COPIIcoated_ERV pdbhh F T 2x4p 3 C,F C,F HLA-A2.1-RESTRICTED INFLUENZA A MATRIX EPITOPE MILGXVFXV 9 T 0.53 COPIIcoated_ERV pdbhh F T 2x4r 3 C,F C,F PP65_HCMVA LOWER MATRIX PROTEIN PP65,64 KDA MATRIX PHOSPHOPROTEIN NLVPMVATV 9 T 15 GDH_N pdbhh T Viruses T 2x4t 3 C,F C,F PP65_HCMVA LOWER MATRIX PROTEIN PP65,64 KDA MATRIX PHOSPHOPROTEIN NLVXMVATV 9 T 18 Pilus_CpaD pdbhh T Viruses T 2x5c 1 A,B A,B Q6ZYH1_PSVY HYPOTHETICAL PROTEIN ORF131 GMGETPEGPMPNKKGKSEGGQIRTIPLKYYKQEYDMAADLVRMLRGLGVFMHAKCPRCGAEGSVSIVETKNGYKYLVIRHPDGGTHTVPKTDISAILKELCEVKKDLEYVLKRYKEYEEEGGVKFCAEGRK 131 T 0.038 zf-ISL3 unp T Viruses T 2x5g 1 A A Y131_SIRV1 UNCHARACTERIZED PROTEIN 131, CAG38830 GASLKEIIDELGKQAKEQNKIASRILKIKGIKRIVVQLNAVPQDGKIRYSMTIHSQNNFRKQIGITPQDAEDLKLIAEFLEKYSDFLNEYVKFTPR 96 T 0.0019 ANTH unp T Viruses T 2x5h 1 A,B,C,D A,B,C,D Y131_SIRV1 UNCHARACTERIZED PROTEIN 131, CAG38830 GMASLKEIIDELGKQAKEQNKIASRIMKIKGIKRIVVQLNAVPQDGKIRYSMTIHSQNNFRKQIGITPQDAEDLKLIAEFLEKYSDFLNEYVKFTPR 97 T 0.0019 ANTH unp T Viruses T 2x5r 1 A A Q6ZYF6_PSVY HYPOTHETICAL PROTEIN ORF126 GAMARVGPKIEITHGGKKYTVFSKVTHLVPRTENGEEAEYVVFGPEKEGVISVVVLAPKDLNEEALALRVKWFNDTKPRCVKCGAAYNGKNHFRVVAIRNGTYYLDAVCDKCEPRITWLSAIVIGRS 127 T 0.025 DUF2321 unp T Viruses T 2x5t 1 A A Y131_SIRV1 UNCHARACTERIZED PROTEIN 131, CAG38830 GASLKEIIDELGKQAKEQNKIASRILKIKGIKRIVVQLNAVPQDGKIRYSLTIHSQNNFRKQIGITPQDAEDLKLIAEFLEKYSDFLNEYVKFTPR 96 T 0.0019 ANTH unp T Viruses T 2x6m 2 B B SYUA_HUMAN ALPHA-SYNUCLEIN PEPTIDE GYQDYEPEA 9 T 4.6 DUF3270 pdbhh F Eukaryota T 2x6p 1 A,B,C A,B,C COIL SER L19C XEWEALEKKLAALESKLQACEKKLEALEHG 30 T 0.00043 DUF5320 pdbhh F T 2x72 2 B B GNAT1_BOVIN GACT PEPTIDE, TRANSDUCIN ALPHA-1 CHAIN ILENLKDCGLF 11 T 0.75 Phage_holin_4_1 pdbhh F Eukaryota T 2xac 2 C,D C,X VGFR1_HUMAN VEGFR-1, VASCULAR PERMEABILITY FACTOR RECEPTOR, TYROSINE-PROTEIN KINASE RECEPTOR FLT, TYROSINE-PROTEIN KINASE FRT, FLT-1, FMS-LIKE TYROSINE KINASE 1 SDTGRPFVEMYSEIPEIIHMTEGRELVIPCRVTSPNITVTLKKFPLDTLIPDGKRIIWDSRKGFIISNATYKEIGLLTCEATVNGHLYKTNYLTHRQT 98 T 0.0013 Ig_2 pdb F Eukaryota T 2xb2 6 F,J G,U REN3B_HUMAN UPF3B, NONSENSE MRNA REDUCING FACTOR 3B, UP-FRAMESHIFT SUPPRESSOR 3 HOMOLOG B, UP-FRAMESHIFT SUPPRESSOR 3 HOMOLOG ON CHROMOSOME X, HUPF3P-X EVVKRDRIRNKDRPAMQLYQPGARSRNRLCPPDDSTKSGDSAAERKQESGISHRKEGGEE 60 T 10 UPF0561 pdbhh F Eukaryota T 2xc8 1 A,B,C A,B,C O48465_BPSPP BACTERIOPHAGE SPP1 COMPLETE NUCLEOTIDE SEQUENCE GIEIVNRKAVWYLTSEIKETETGIEVSAGELHKGDEEVFPVEEVSFDLTPDDTYPVEYMLYLHMNVQTKKVSWSLCKAYLDGEGYCDYQGNERLIMYPVSVTVFPNGTREGTIFLYEKEDREPDRKPPVIVEPQPVGEIGTPDIDE 146 T 71 Gpi16 pdbhh T Viruses T 2xfx 3 C C Q4MYJ2_THEPA UNCHARACTERIZED PROTEIN VGYPKVKEEML 11 T 2.6 HrpB_C pdbhh F Eukaryota T 2xjh 1 A,B A,B MBCTN_METTR COPPER-BINDING COMPOUND, CBC, HYDROGEN PEROXIDE REDUCTASE, SUPEROXIDE DISMUTASE, MINUS-MET METHANOBACTIN XXGSCYPXSCM 11 T 0.0043 QueC pdbhh F Bacteria T 2xl2 2 C,D C,D RBBP5_MOUSE RBBP5, RBBP-5 YAAEDEEVDVTSVD 14 T 0.0099 DUF2457 unppercent F Eukaryota T 2xl4 1 A A LNTA_LISMO Listeria nuclear targeted protein A GSMGEDEGEQTKTKKDSNKVVKTASRPKLSTKDLALIKADLAEFEARELSSEKILKDTIKEESWSDLDFANDNINQMIGTMKRYQQEILSIDAIKRASEASADTEAFKKIFKEWSEFKIERIQVTIDLLNGKKDSEAVFKKTYPNQIIFKKVRTNKLQTALNNLKVGYELLDSQK 175 T 0.012 T4SS_pilin unppssm F Bacteria T 2xnx 4 M,N M,N M1-BC1 MVWDRQRLEKELEEKKEALELAIDQASRDYHRATALEKELEEKKKALELAIDQASQDYNRANVLEKELEAITREQEINRNLLGNAKLELDQLSSEKEQLTIEKAKLEEEKQISDASRQSLRRDLDASREAKKQVEKDLLEHHHHHH 146 T 0.0088 M pdbpercent F T 2xny 4 G,H M,N M PROTEIN MVNGDGNPREVIEDLAANNPAIQNIRLRHENKDLKARLENAMEVAGRDFKRAEELEKAKQALEDQRKDLETKLKELQQDYDLAKESTSWDRQRLEKHHHHHH 102 T 0.016 ATG14 pdb F T 2xpn 2 B B Q8SRG7_ENCCU SPT6 GSHMFFEIFGTGEEYRYVLESDP 23 T 8.5 STE3 pdbhh F Eukaryota T 2xpp 2 B B Q8SRG7_ENCCU SPT6 GSHMREISEESISSIDYGDRDSLFFEIFGTGEEYRYVLESDP 42 T 18 DUF2887 pdbhh F Eukaryota T 2xqq 2 E,F,G,H E,F,G,H SAC-ARG-GLY-THR-GLN-THR-GLU XRGTQTE 7 T 61 Toxin_27 pdbhh F T 2xrw 2 B B NFAC3_HUMAN PEPNFAT4, NF-ATC3, NFATC3, T-CELL TRANSCRIPTION FACTOR NFAT4, NF-AT4, NFATX LERPSRDHLYLPLE 14 T 2.8 DUF1101 pdbhh F Eukaryota T 2xu6 1 A,B A,B MDV1_YEAST MDV1 COILED COIL GPQTLVNSLEFLNIQKNSTMSEIRDIEVEVENLRQKKEKLLGKIANIEQNQLMLEDNLKQIDDRLDFLEEYG 72 T 0.00065 Pil1 pdb F Eukaryota T 2xu7 2 C,D C,D FOG1_HUMAN FRIEND OF GATA PROTEIN 1, FOG-1, FRIEND OF GATA 1, ZINC FINGER PROTEIN MULTITYPE 1 MSRRKQSNPRQIKRS 15 T 78 DHHA2 pdbhh F Eukaryota T 2xvc 2 B B Q97ZJ5_SULSO SSO0911 SEIPLPIPVKVINTL 15 T 3 FpoO pdbhh F Archaea T 2xvo 1 A,B,C,D A,B,C,D CMR7B_SULSO SSO1725 GAMGSPGGSQQVEWVFIPVIKDVTYEFKVDNNDNITELYVNGNKLGPASSLEMDFYFDVDVSNNQVRKFNNVFVLFGVIATKDSNKIKMQLTLNPCDFVRGFVFPSQDPSQLNNIFASNNKVSVSEKAFAILNRKKEGAVSSTINVYITQNTYTGNTKIEKIQQNTIIIEKNTGIVFKIPNDMLNIFRYSTT 192 T 0.034 OstA_2 unp F Archaea T 2xxm 3 C T INHIBITOR OF CAPSID ASSEMBLY ITFEDLLDYYG 11 T 0.69 DUF2610 pdbhh F T 2xxn 2 B B VIRF4_HHV8P VIRF-4 SVWIPVNEGASTSGM 15 T 7 Calponin pdbhh T Viruses T 2xzq 3 C P PHAGE DISPLAY DERIVED ANTIGEN YQLRPNAETLRF 12 T 5.2 7TM_GPCR_Srb pdbhh F T 2y06 3 C P PHAGE DISPLAY DERIVED ANTIGEN GDPRPSYISHLL 12 T 1.7 Tom7 pdbhh F T 2y07 3 C P PHAGE DISPLAY DERIVED ANTIGEN PPYPAWHAPGNI 12 T 1.3 DUF3612 pdbhh F T 2y36 3 C P DODECAPEPTIDE (DLWTTAIPTIPS) DLWTTAIPTIPS 12 T 12 IER pdbhh F T 2y48 3 C C SNAI1_HUMAN TRANSCRIPTION FACTOR SNAIL, PROTEIN SNAIL HOMOLOG 1, PROTEIN SNA PRSFLVRKPSDPNRKPNYSE 20 T 0.29 bCoV_NS6 unppssm F Eukaryota T 2y4v 2 B B DAPK1_HUMAN DAP KINASE 1 RKKYKQSVRLISLCQRLSR 19 T 11 AAA_lid_8 pdbhh F Eukaryota T 2y65 2 E,F,G W,X,Y KINH_DROME KINESIN GSGPQAQIAKPIRSGQGATS 20 T 0.23 FPP unphh F Eukaryota T 2y6s 3 E,F P,Q VGP_EBOZ5 GP, GP1 GKLGLITNTIAGVAGLI 17 T 5.5 DUF4731 pdbhh T Viruses T 2y7l 2 B B FIBG_HUMAN FIBRINOGEN GAMMA CHAIN, ISOFORM CRA_A GEGQQHHLGGAKQAGDV 17 T 23 Rhodopsin_N pdbhh F Eukaryota T 2y8o 2 B B MP2K6_HUMAN MAPK/ERK KINASE 6, SAPKK3 SKGKKRNPGLKIPK 14 T 3 GHL15 pdbhh F Eukaryota T 2y8s 2 B,D B,E RON2_TOXGM RON2 DIVQHMEDIGGAPPVSCVTNEILGVTCAPQAIAKATT 37 T 7.1 LisH_TPL pdbhh F Eukaryota T 2y8t 2 B,D B,E RON2_TOXGM RON2 DIVQHMEDIGGAPPVSCVTNEILGVTCAPQAIAKATX 37 T 7.5 LisH_TPL pdbhh F Eukaryota T 2y9q 2 B B MKNK1_HUMAN MAP KINASE SIGNAL-INTEGRATING KINASE 1, MNK1 MKLSPPSKSRLARRRALA 18 T 0.15 HNOBA unphh F Eukaryota T 2y9w 2 C,D C,D G1K3P4_AGABI LECTIN-LIKE FOLD PROTEIN MAQARKIPLDLPGTRILNGANWANNSATENLATNSGTLIIFDQSTPGQDADRWLIHNYLDGYKIFNMGSNNWASVSRGNTVLGVSEFDGQTCKWSIEYSGNGEEFWIRVPREGGGGAVWTIKPASSQGPTTVFLDLLKETDPNQRIKFAV 150 T 0.002 Inhibitor_I66 pdbhh F Eukaryota T 2yb8 1 A A SUZ12_DROME SUPPRESSOR 12 OF ZESTE PROTEIN, SUZ12 NPIFLNRTLSYMK 13 T 0.082 DUF4085 unp F Eukaryota T 2ybf 2 B B RAD18_HUMAN POSTREPLICATION REPAIR PROTEIN RAD18, HHR18, HRAD18, RING FINGER PROTEIN 73 SKYRKKHKSEFQLLVDQARKGYKKIAG 27 T 0.012 PhoU_div unp F Eukaryota T 2ych 2 B B Q72IW7_THET2 COMPETENCE PROTEIN PILN MIRLNLLPKNLRRRV 15 T 0.0034 RskA unppercent F Bacteria T 2ydq 2 B T OGA_HUMAN MENINGIOMA-EXPRESSED ANTIGEN 5, NUCLEAR CYTOPLASMIC O-GLCNACASE AND ACETYLTRANSFERASE, PROTEIN O-GLCNACASE, GLYCOSIDE HYDROLASE O-GLCNACASE, HEXOSAMINIDASE C, N-ACETYL-BETA-D-GLUCOSAMINIDASE, N-ACETYL-BETA-GLUCOSAMINIDASE, O-GLCNACASE, OGA VAHSGAK 7 T 69 HEPN_AbiA_CTD pdbhh F Eukaryota T 2yds 2 B T TAB1_HUMAN MITOGEN-ACTIVATED PROTEIN KINASE KINASE KINASE 7-INTERACTING PROTEIN 1, TGF-BETA-ACTIVATED KINASE 1-BINDING PROTEIN 1, TAK1-BINDING PROTEIN 1 VPYSSAQ 7 T 18 UPF0160 pdbhh F Eukaryota T 2yen 1 A A GM3C_CONCN Mu-conotoxin CnIIIC QGCCNGPKGCSSKWCRDHARCCX 23 T 0.089 Mu-conotoxin pdbpssm F Eukaryota T 2yev 3 C,F C,F Q5SH67_THET8 UNCHARACTERIZED PROTEIN TTHA1863 MVYIALFALGAALVTLFFYLILNPRVLTTEGETFDLRFVLFMLLLILLAAGTVALMLLIGKAHHLL 66 T 0.039 7tm_3 pdbpercent F Bacteria T 2ygu 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H VA2_SOLIN ALLERGEN SOL I II, VENOM ALLERGEN II, ALLERGEN=SOL I 2, SOL I 2 DNKELKIIRKDVAECLRTLPKCGNQPDDPLARVDVWHCAMAKRGVYDNPDPAVIKERSMKMCTKIITDPANVENCKKVASRCVDRETQGPKSNRQKAVNIIGCALRAGVAETTVLARKKHHHHHH 125 T 0.03 UPAR_LY6 pdb F Eukaryota T 2ygv 2 E,F,G,H E,F,G,H RAD53_YEAST RAD53, CHK2 HOMOLOG, SERINE-PROTEIN KINASE 1 SKKVKRAKLDQTSKGPENLQFS 22 T 19 GGA_N-GAT pdbhh F Eukaryota T 2yka 2 B B ICP27_SHV21 ORF57 PROTEIN GPLGSSCKTSWADRVREAAAQRR 23 T 0.021 RE_BsaWI pdbhh T Viruses T 2yle 2 B B FMN2_HUMAN FMN2 PROTEIN VCRQKKGKSLYKIKPRHDSGIKAKISMKT 29 T 350 YL1_C pdbhh F Eukaryota T 2ymb 2 E,F F,H CHM1A_HUMAN CHMP1A, CHROMATIN-MODIFYING PROTEIN 1A, CHMP1A, VACUOLAR PROTEIN SORTING-ASSOCIATED PROTEIN 46-1, VPS46-1, HVPS46-1 MEDQLSRRLAALRN 14 T 3.9 DUF4549 pdbhh F Eukaryota T 2ymt 2 B B PHAGE DISPLAY DERIVED GAMMA 2 ADAPTIN EAR DOMAIN BINDING PEPTIDE GEEWGPWVX 9 T 0.69 Phage_antitermQ pdbhh F T 2ynr 2 B,C B,C B54NLS SVLGKRKRHPKV 12 T 6.3 DUF4668 pdbhh F T 2ypy 1 A,B,C,D,E,F,G,H,I,J A,B,C,D,E,F,G,H,I,J Q76SB0_HHV8 KSHV LANA GSRYQQPPVPYRQIDDCPAKARPQHIFYRRFLGKDGRRDPKCQWKFAVIFWGNDPYGLKKLSQAFQFGGVKAGPVSCLPHPGPDQSPITYCVYVYCQNKDTSKKVQMARLAWEASHPLAGNLQSSIVKFKKPLPLTQPG 139 T 0.00011 EBV-NA1 unphh T Viruses T 2yq1 1 A,B,C,D A,B,C,D O41974_MHV68 MHV-68 LANA, IMMEDIATE-EARLY PROTEIN GSKRYSRYQKPHNPSDPLPKKYQGMRRHLQVTAPRLFDPEGHPPTHFKSAVMFSSTHPYTLNKLHKCIQSKHVLSTPVSCLPLVPGTTQQCVTYYLLSFVEDKKQAKKLKRVVLAYCEKYHSSVEGTIVKAKPYFPLPE 139 T 6.9E-05 EBV-NA1 pdbhh T Viruses T 2yrk 1 A A ZFHX4_MOUSE ZINC FINGER HOMEODOMAIN PROTEIN 4, ZFH-4 GSSGSSGGTDGTKPECTLCGVKYSARLSIRDHIFSKQHISKVRETVGSQLDREKD 55 T 0.00021 zf_C2H2_6 pdbhh F Eukaryota T 2yu7 2 B B NKG2A_HUMAN NKG2A ATEQEITXAELNLQK 15 T 0.02 Fez1 unppercent F Eukaryota T 2yvc 2 D,E,F D,E,F NEP_MOUSE NEUTRAL ENDOPEPTIDASE 24.11, NEUTRAL ENDOPEPTIDASE, NEP, ENKEPHALINASE, ATRIOPEPTIDASE, CD10 ANTIGEN GRSESQMDITDINAPKPKKKQR 22 T 7 Asp4 unphh F Eukaryota T 2z2p 1 A,B A,B VGB_STAAU STREPTOGRAMIN B LYASE MEFKLQELNLTNQDTGPYGITVSDKGKVWITQHKANMISCINLDGKITEYELPNKGAKVMCLTISSDGEVWFTENAANKIGRITKKGIIKEYTLPNPDSAPYGITEGPNGDIWFTEMNGNRIGRITDDGKIREYELPNKGSYPSFITLGSDNALWFTENQNNAIGRITESGDITEFKIPTPASGPVGITKGNDDALWFVEIIGNKIGRITTSGEITEFKIPTPNARPHAITAGAGIDLWFTEWGANKIGRLTSNNIIEEYPIQIKSAEPAGICFDGETIWFAMECDKIGKLTLIKDNME 299 T 0.00037 SGL pdbpssm F Bacteria T 2z31 5 E P Myelin basic protein (MBP)-peptide RGGASQYRPSQ 11 T 7 Tsg pdbhh F T 2z3f 2 B,D,F,H,J,L,N,P,Q,R,S I,J,K,L,M,N,O,P,Q,R,T YEG3_SCHPO CAC2 RKVESSKVSKKRIAPTPVYP 20 T 6.3E-18 PALB2_WD40 unphh F Eukaryota T 2z5k 2 B B NXF1_HUMAN TIP-ASSOCIATING PROTEIN, TIP-ASSOCIATED PROTEIN, MRNA EXPORT FACTOR TAP, TAP NLS EEDDGDVAMSDAQDGPRVRYNPYTTRPNRR 30 T 1.6 DUF4687 pdbhh F Eukaryota T 2z8p 2 B B (GLY)(GLU)(ALA)(TPO)(VAL)(PTR)(ALA) GEATVXA 7 T 34 B_solenoid_ydck pdbhh F T 2zck 2 B S KGISSQY KGISSQY 7 T 23 DUF4133 pdbhh F T 2zd7 2 C C EVDLPLSDEEPSS EVDLPLSDEEPSS 13 T 11 DUF1375 pdbhh F T 2zdj 1 A,B,C,D A,B,C,D D0VWQ2_9ZZZZ hypothetical protein TTMA177 MKMRKLVKDFGDDYTLIQDSQEVKAILEYIGSEEEPHALFVKVGDGDYEEVWGIDSFVPYNFLEAYRLK 69 T 2.8 Ribosomal_L30 pdbhh F unclassified sequences. T 2zgh 2 B B SSGKVPL SSGKVPL 7 T 7.6 DUF1859 pdbhh F T 2zgj 2 B B SSGKVPLS SSGKVPLS 8 T 14 La_HTH_kDCL pdbhh F T 2zjd 2 B,D B,D SQSTM_MOUSE UBIQUITIN-BINDING PROTEIN P62, STONE14 SGGDDDWTHLS 11 T 7.6 DUF5888 pdbhh F Eukaryota T 2zl2 2 O,P,Q,T,U,V,W,X O,P,Q,T,U,V,W,X A peptide substrate-NVLGFTQ NVLGFTQ 7 T 3.4 GRAB pdbhh F T 2zne 2 C,D C,D PDC6I_HUMAN PDCD6-INTERACTING PROTEIN, ALG-2-INTERACTING PROTEIN 1, HP95 QGPPYPTYPGYPGYSQ 16 T 5.1 CYYR1 pdbhh F Eukaryota T 2zok 3 I,J,K,L I,L,J,K SPIKE_CVMJC PEPTIDIC EPITOPE S510 XSLWNGPHL 9 T 5.1 Mut7-C pdbhh T Viruses T 2zol 3 E,F F,E SPIKE_CVMJC PEPTIDIC EPITOPE S510 XSLSNGPHL 9 T 6.1 WEF-hand pdbhh T Viruses T 2zpk 3 C,F P,Q PAR4_HUMAN PAR-4, THROMBIN RECEPTOR-LIKE 3, COAGULATION FACTOR II RECEPTOR-LIKE 3 PRGYPGQV 8 T 0.16 Gag_p19 pdbhh F Eukaryota T 2zpy 2 B B CD44_MOUSE CD44 antigen SRRRCGQKKKLVINGGNGTV 20 T 0.046 RCR unphh F Eukaryota T 2zui 1 A A CPXA_PSEPU CYTOCHROME P450-CAM, P450CAM MTTETIQSNANLAPLPPHVPEHLVFDFDMYNPSNLSAGVQEAWAVLQESNVPDLVWTRCNGGHWIATRGQLIREAYEDYRHFSSECPFIPREAGEAYDFIPTSMDPPEQRQFRALANQVVGMPVVDKLENRIQELACSLIESLRPQGQCNFTEDYAEPFPIRIFMLLAGLPEEDIPHLKYLTDQMTRPDGSMTFAEAKEALYDYLIPIIEQRRQKPGTDAISIVANGQVNGRPITSDEAKRMCGLLLVGGLDTVVNFLSFSMEFLAKSPEHRQELIERPERIPAACEELLRRFSLVANGRILTSDYEFHGVQLKKGDQILLPQMLSGLDERENACPMHVDFSRQKVSHTTFGHGSHLCLGQHLARREIIVTLKEWLTRIPDFSIAPGAQIQHKSGIVSGVQALPLVWDPATTKAV 415 T 1.6E-05 p450 unppercent F Bacteria T 2zvk 2 B,D,F U,V,W DNA polymerase eta CKRPRPEGMQTLESFFKPLTH 21 T 2.5 RC-P840_PscD pdbhh F T 2zvl 2 B,D,F,H,J,L U,V,W,X,Y,Z DNA polymerase kappa PKHTLDIFFKPLTH 14 T 1.5 DUF4387 pdbhh F T 2zvm 2 B,D,F U,V,W DNA polymerase iota ALNTAKKGLIDYYLMPSLSTTSR 23 T 3 DUF2620 pdbhh F T 3a7p 1 A,B A,B ATG16_YEAST ATG16, CYTOPLASM TO VACUOLE TARGETING PROTEIN 11, SAP18 HOMOLOG GPMGNFIITERKKAKEERSNPQTDSMDDLLIRRLTDRNDKEAHLNELFQDNSGAIGGNIVSHDDALLNTLAILQKELKSKEQEIRRLKEVIALKNKNTERLNAALISGTIENNVLQQKLSDLKKEHSQLVARWLKKTEKETEAMNSEIDGTK 152 T 0.044 ATG16 unphh F Eukaryota T 3a7q 1 A A RELN_MOUSE REELER PROTEIN GRDGNNLNNPVLLLDTFDFGPREDNWFFYPGGNIGLYCPYSSKGAPEEDSAMVFVSNEVGEHSITTRDLSVNENTIIQFEINVGCSTDSSSADPVRLEFSRDFGATWHLLLPLCYHSSSLVSSLCSTEHHPSSTYYAGTTQGWRREVVHFGKLHLAGSVRFRWYQGFYPAGSQPVTWAIDNVYIGPQCEEMCYGHGSCINGTKCICDPGYSGPTCKISTKNPDFLKDDFEGQLESDRFLLMSGGKPSRKCGILSSGNNLFFNEDGLRMLVTRDLDLSHARFVQFFMRLGCGKGVPDPRSQPVLLQYSLNGGLSWSLLQEFLFSNSSNVGRYIALEMPLKARSGSTRLRWWQPSENGHFYSPWVIDQILIGGNISGNTVLEDDFSTLDSRKWLLHPGGTKMPVCGSTGDALVFIEKASTRYVVTTDIAVNEDSFLQIDFAASCSVTDSCYAIELEYSVDLGLSWHPLVRDCLPTNVECSRYHLQRILVSDTFNKWTRITLPLPSYTRSQATRFRWHQPAPFDKQQTWAIDNVYIGDGCLDMCSGHGRCVQGSCVCDEQWGGLYCDEPETSLPTQLKDNFNRAPSNQNWLTVSGGKLSTVCGAVASGLALHFSGGCSRLLVTVDLNLTNAEFIQFYFMYGCLITPSNRNQGVLLEYSVNGGITWNLLMEIFYDQYSKPGFVNILLPPDAKEIATRFRWWQPRHDGLDQNDWAIDNVLISRLENLYFQ 725 T 0.00014 EGF_2 pdb F Eukaryota T 3aa1 3 C C PKHO1_HUMAN CKIP-1, CASEIN KINASE 2-INTERACTING PROTEIN 1, C-JUN-BINDING PROTEIN, OSTEOCLAST MATURATION ASSOCIATED GENE 120 PROTEIN SYLAHPTRDRAKIQHSRRPPTRG 23 T 0.0038 CAP-ZIP_m pdbhh F Eukaryota T 3aa6 3 C C CD2AP_HUMAN CD2AP, CAS LIGAND WITH MULTIPLE SH3 DOMAINS, ADAPTER PROTEIN CMS NLLHLTANRPKMPGRRLPGRFNG 23 T 0.018 CARMIL_C pdbhh F Eukaryota T 3abd 2 B,D X,Y REV3L_HUMAN REV3, HREV3 MLTPTPDSSPRSTSSPSQSKNGSFTPRTANILKPLMSPPSREEIMATLLDHD 52 T 7 CaM_bind pdbhh F Eukaryota T 3ade 2 B B SQSTM_MOUSE Sequestosome-1 KEVDPSTGELQSLQ 14 T 1.2 DUF2396 pdbhh F Eukaryota T 3al3 2 B B FANCJ_HUMAN PROTEIN FANCJ, ATP-DEPENDENT RNA HELICASE BRIP1, BRCA1-INTERACTING PROTEIN C-TERMINAL HELICASE 1, BRCA1-INTERACTING PROTEIN 1, BRCA1-ASSOCIATED C-TERMINAL HELICASE 1 SIYFTPELYD 10 T 2 NpwBP pdbhh F Eukaryota T 3alo 2 B E p38 peptide DDEMTGYA 8 T 0.16 YicC_N pdbhh F T 3apr 2 B I REDUCED PEPTIDE INHIBITOR XPFHXVY 7 T 0.78 DUF5372 pdbhh F T 3at0 2 B B FIBA_HUMAN 16-MER FROM FIBRINOGEN ALPHA CHAIN, FIBRINOPEPTIDE A GSWNSGSSGTGSTGNQ 16 T 1.8 DUF4603 pdbhh F Eukaryota T 3av9 2 C,D X,Y LENS EPITHELIAL DERIVED GROWTH FACTOR PEPTIDE SAKIDNLD 8 T 8.4 DUF5399 pdbhh F T 3ava 2 C,D X,Y LENS EPITHELIAL DERIVED GROWTH FACTOR PEPTIDE ALKIDNLD 8 T 14 DUF5399 pdbhh F T 3avb 2 C,D X,Y LENS EPITHELIAL DERIVED GROWTH FACTOR PEPTIDE SLKIDNLD 8 T 11 DUF3389 pdbhh F T 3avc 2 C,D D,F LENS EPITHELIAL DERIVED GROWTH FACTOR SDKIDNLD 8 T 15 Pal1 pdbhh F T 3avg 2 C,D D,F LENS EPITHELIAL DERIVED GROWTH FACTOR PEPTIDE ADKIDNLD 8 T 6.9 tRNA_synt_2f pdbhh F T 3avh 2 C,D E,F LENS EPITHELIAL DERIVED GROWTH FACTOR PEPTIDE ARKIDNLD 8 T 1.1 DUF3663 pdbhh F T 3avi 2 C,D D,F LENS EPITHELIAL DERIVED GROWTH FACTOR PEPTIDE SLKIDNMD 8 T 12 DUF3389 pdbhh F T 3avj 2 C,D D,F LENS EPITHELIAL DERIVED GROWTH FACTOR PEPTIDE ALKIDNMD 8 T 15 Dak1_2 pdbhh F T 3avk 2 C,D D,F LENS EPITHELIAL DERIVED GROWTH FACTOR PEPTIDE SLKIDNED 8 T 1.5 DUF3389 pdbhh F T 3avl 2 C,D F,E LENS EPITHELIAL DERIVED GROWTH FACTOR PEPTIDE ATKIDNLD 8 T 12 DUF5399 pdbhh F T 3avm 2 C,D D,F LENS EPITHELIAL DERIVED GROWTH FACTOR PEPTIDE SRKIDNLD 8 T 5.9 DUF5399 pdbhh F T 3avn 2 C,D G,H LENS EPITHELIAL DERIVED GROWTH FACTOR PEPTIDE SHKIDNLD 8 T 15 DUF5399 pdbhh F T 3awr 2 C,D C,D ALDH2_RAT ALDH CLASS 2, ALDH-E2, ALDH1 GPRLSRLLSSAGC 13 T 5.5 Trp_DMAT pdbhh F Eukaryota T 3ax2 2 B,D,F,H B,D,F,H ALDH2_RAT ALDH CLASS 2, ALDH-E2, ALDH1 GPRLSRLLSYAGSGCX 16 T 0.92 DUF4360 pdbhh F Eukaryota T 3ax3 2 B,D,F,H B,D,F,H ALDH2_RAT ALDH CLASS 2, ALDH-E2, ALDH1 GXRLCRLLSYAX 12 T 0.43 Lentiviral_Tat pdbhh F Eukaryota T 3ax5 2 B,D B,D ALDH2_RAT Aldehyde dehydrogenase, mitochondrial GXRLCRLLSYA 11 T 0.33 Lentiviral_Tat pdbhh F Eukaryota T 3axy 3 E,F,K,L E,F,K,L Rice FD homolog OsFD1 LQRVLSAPF 9 T 8.7 ArgoL2 pdbhh F T 3ayu 2 B B A4_HUMAN ABPP, APPI, APP, ALZHEIMER DISEASE AMYLOID PROTEIN, CEREBRAL VASCULAR AMYLOID PEPTIDE, CVAP, PREA4, PROTEASE NEXIN-II, PN-II ISYGNDALMP 10 T 5.1 ESAG1 pdbhh F Eukaryota T 3b1m 2 B B PRGC1_HUMAN PGC-1-ALPHA, PPAR-GAMMA COACTIVATOR 1-ALPHA, PPARGC-1-ALPHA, LIGAND EFFECT MODULATOR 6 QEAEEPSLLKKLLLAPANT 19 T 3.3 Apo-CIII pdbhh F Eukaryota T 3b21 1 A A Q8VSD5_SHIFL OSPI GPLGSPEFMINGVSLQGTAGYEAHTEEGNVNVKKLLESLNSKSLGDMDKDSELAATLQKMINPSGGDGNCSGCALHACMAMLGYGVREAPVPNEISEYMTGFFHRHLEQIDSEGIVSHPNETYSKFRERIAENILQNTSKGSVVMISIEQATHWIAGFNDGEKIMFLDVQTGKGFNLYDPVEKSPDAFVDENSSVQVIHVSDQEFDHYANSSSWKSKRLC 220 T 0.13 Gln_amidase unppercent F Bacteria T 3b23 3 C C VARI_AMBVA Variegin SDQGDVAEPKMHKTAPPFDFEAIPEEYLDDES 32 T 0.038 Hirudin pdbhh F Eukaryota T 3b3i 3 C C VIPR1_HUMAN VIP-R-1, PITUITARY ADENYLATE CYCLASE-ACTIVATING POLYPEPTIDE TYPE II RECEPTOR, PACAP TYPE II RECEPTOR, PACAP-R-2 RRKWXRWHL 9 F F Eukaryota T 3b7f 1 A A Q46R99_CUPNJ Glycosyl hydrolase, BNR repeat GMTASTAPQTEPHKTSAPESGPVMLLVATIKGAWFLASDPARRTWELRGPVFLGHTIHHIVQDPREPERMLMAARTGHLGPTVFRSDDGGGNWTEATRPPAFNKAPEGETGRVVDHVFWLTPGHASEPGTWYAGTSPQGLFRSTDHGASWEPVAGFNDHPMRRAWTGGEQDGTPDGPKMHSILVDPRDPKHLYIGMSSGGVFESTDAGTDWKPLNRGCAANFLPDPNVEFGHDPHCVVQHPAAPDILYQQNHCGIYRMDRREGVWKRIGDAMPREVGDIGFPIVVHQRDPRTVWVFPMDGSDVWPRVSPGGKPAVYVTRDAGESWQRQDRGLPTDQAWLTVKRQAMTADAHAPVGVYFGTTGGEIWASADEGEHWQCIASHLPHIYAVQSARPV 394 T 0.0005 Sortilin-Vps10 pdb F Bacteria T 3b9t 1 A,B,C,D A,B,C,D Q1GZG6_METFK Twin-arginine translocation pathway signal protein GMSDHVCQEGCRHHSHGEDSPEIQQEFQEGRRDFMRDFAVGGVLASAASLGISSSAFGQTMPKTGLTSGHATHYYIPASDKTVSWGFFSKSLKPVVELESGDFATIETLTHHSNDDASLMVKGDPGAESVFYWDSKRKNVDRRGMGPMDHKLGAGGGMGVHILTGPVAIKGAEPGDVLEVRIVDVALRPSANPEFKGKTFGSNVAANWGFHYNELIEEPKKREVVTIYELDATGERNWARAFYNYRWTPQKDPFGVVHPIVDYPGVPVDHSTISKNYNVLKNIRVPVRPHFGTMGLAPKEADLVNSVPPSHFGGNIDNWRIGKGATMYYPVSVAGGLFSVGDPHASQGDSEMCGTAIECSLTGTFQFILHKKADLPGTPLADLQYPLLETQDEWVLHGFSYANYLAELGPDAQNSIFSKSSLDLALKDAFRKMRHFLMQTQNLTEDEAVSLMSIGVDFGITQVVDGNWGVHAVVKKGIFPGRDV 484 T 0.34 TAT_signal unppssm F Bacteria T 3bdf 1 A,B A,B PPB_ECOLI APASE RTPEMPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQDAVPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLKLEHHHHHH 458 T 1.3E-10 Alk_phosphatase pdbpssm F Bacteria T 3bdg 2 B B PPB_ECOLI APASE RTPEMPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQDATPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLKLEEEEEEE 458 T 6.8E-10 Alk_phosphatase unppssm F Bacteria T 3bdz 1 A,B A,B CINA_CITBR CYTOCHROME P450CIN TSLFTTADHYHTPLGPDGTPHAFFEALRDEAETTPIGWSEAYGGHWVVAGYKEIQAVIQNTKAFSNKGVTFPRYETGEFELMMAGQDDPVHKKYRQLVAKPFSPEATDLFTEQLRQSTNDLIDARIELGEGDAATWLANEIPARLTAILLGLPPEDGDTYRRWVWAITHVENPEEGAEIFAELVAHARTLIAERRTNPGNDIMSRVIMSKIDGESLSEDDLIGFFTILLLGGIDATARFLSSVFWRLAWDIELRRRLIAHPELIPNAVDELLRFYGPAMVGRLVTQEVTVGDITMKPGQTAMLWFPIASRDRSAFDSPDNIVIERTPNRHLSLGHGIHRCLGAHLIRVEARVAITEFLKRIPEFSLDPNKECEWLMGQVAGMLHVPIIFPKGKRLSE 397 T 1.3E-34 p450 pdbpercent F Bacteria T 3bef 3 C,F C,F PAR1_HUMAN PAR-1, THROMBIN RECEPTOR, COAGULATION FACTOR II RECEPTOR NDKYEPFWE 9 T 0.26 DUF5848 pdbhh F Eukaryota T 3bgm 3 C C KPCD2_HUMAN NPKC-D2 RQASLSISV 9 T 0.0054 TCAD9 unppercent F Eukaryota T 3bh9 3 C C POF1B_HUMAN PREMATURE OVARIAN FAILURE PROTEIN 1B RTYSGPMNKV 10 T 0.062 CCDC158 unphh F Eukaryota T 3bhb 3 C C N4BP2_HUMAN N4BP2, BCL-3-BINDING PROTEIN KMDSFLDMQL 10 T 15 Rrp15p pdbhh F Eukaryota T 3bim 2 B,D,F,H,J,L,N,P I,J,K,L,M,N,O,P BCOR_HUMAN BCOR GSRSEIISTAPSSWVVPGP 19 T 0.4 GPR15L pdbhh F Eukaryota T 3bin 2 B B CADM1_HUMAN IMMUNOGLOBULIN SUPERFAMILY MEMBER 4, NECTIN-LIKE PROTEIN 2, NECL-2, TUMOR SUPPRESSOR IN LUNG CANCER 1, TSLC-1, SYNAPTIC CELL ADHESION MOLECULE, SPERMATOGENIC IMMUNOGLOBULIN SUPERFAMILY, SGIGSF ARHKGTYFTHEA 12 T 0.16 DAG1 unphh F Eukaryota T 3bk9 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H T23O_XANCP TDO MPVDKNLRDLEPGIHTDLEGRLTYGGYLRLDQLLSAQQPLSEPAHHDEMLFIIQAQTSELWLKLLAHELRAAIVHLQRDEVWQCRKVLARSKQVLRQLTEQWSVLETLTPSEYMGFRDVLGPSSGFQSLQYRYIEFLLGNKNPQMLQVFAYDPAGQARLREVLEAPSLYEEFLRYLARFGHAIPQQYQARDWTAAHVADDTLRPVFERIYENTDRYWREYSLCEDLVDVETQFQLWRFRHMRTVMRVIGFKRGTGGSSGVGFLQQALALTFFPELFDVRTSVGVDNRPPQGSADAGKRLEHHHHHH 306 T 6.3E-41 Trp_dioxygenase unp F Bacteria T 3boo 2 B B N-Ac-CRATKML inhibitory peptide XCRATKML 8 T 12 DUF3136 pdbhh F T 3bp4 3 C C PPGB_HUMAN CATHEPSIN A, CARBOXYPEPTIDASE C, PROTECTIVE PROTEIN FOR BETA-GALACTOSIDASE IRAAPPPLF 9 T 20 DUF6023 pdbhh F Eukaryota T 3bqd 2 B B NCOA1_HUMAN NCOA-1, STEROID RECEPTOR COACTIVATOR 1, SRC-1, RIP160, PROTEIN HIN-2, RENAL CARCINOMA ANTIGEN NY-REN-52 AQQKSLLQQLLTE 13 T 1 GFD1 pdbhh F Eukaryota T 3bqo 2 B B TINF2_HUMAN TRF1-INTERACTING NUCLEAR PROTEIN 2 SHFNLAPLGRRRVQSQWASTR 21 T 1.9 COX7B pdbhh F Eukaryota T 3brd 4 D D LIN12_CAEEL ABNORMAL CELL LINEAGE PROTEIN 12 SPGNRTRKRRMINASVWMPPMENEEKNRK 29 T 0.039 OSTbeta unppercent F Eukaryota T 3brf 4 D D LIN12_CAEEL ABNORMAL CELL LINEAGE PROTEIN 12 SRMINASVWMPPME 14 T 0.039 OSTbeta unppercent F Eukaryota T 3brl 2 B C SWA_DROME Protein swallow 10-resiude peptide ATSAKATQTD 10 T 15 KxDL unp F Eukaryota T 3bts 2 C,D E,F GAL4_YEAST Regulatory protein GAL4 GMFNTTTMDDVYNYLFDDEDT 21 T 4.5 T6PP_N pdbhh F Eukaryota T 3bu3 2 B B IRS2_MOUSE IRS-2, 4PS AYNPYPEDYGDIEIG 15 T 12 STAT1_TAZ2bind pdbhh F Eukaryota T 3bu6 2 B B IRS2_MOUSE IRS-2, 4PS AYNPYPEDXGDIEIG 15 T 12 STAT1_TAZ2bind pdbhh F Eukaryota T 3bu8 2 C,D C,D TINF2_HUMAN TRF1-INTERACTING NUCLEAR PROTEIN 2 SFNLAPLGRRRVQSQWAST 19 T 2.7 COX7B pdbhh F Eukaryota T 3bua 2 E,F,G,H E,F,G,H DCR1B_HUMAN HSNM1B SEFRGLALKYLLTPVNFFQAGYSSRRFDQQVEKYHK 36 T 7.7 Sedlin_N pdbhh F Eukaryota T 3bum 1 A A SPY2_HUMAN SPRY-2 IRNTNEXTEGPTV 13 T 3.3 KAR9 unp F Eukaryota T 3bun 1 A A SPY4_HUMAN SPRY-4, SPROUTY-4 SHVENDXIDNPSL 13 T 0.88 MRP-L51 pdbhh F Eukaryota T 3buo 1 A,C A,C EGFR_HUMAN RECEPTOR TYROSINE-PROTEIN KINASE ERBB-1, EGFR DSFLQRXSSDPTG 13 T 1.2 DUF4348 pdbhh F Eukaryota T 3buw 1 A,C A,C KSYK_HUMAN SPLEEN TYROSINE KINASE TVSFNPXEPELAP 13 T 0.13 Herpes_UL36 pdbhh F Eukaryota T 3bux 1 A,C A,C MET_HUMAN HGF RECEPTOR, SCATTER FACTOR RECEPTOR, SF RECEPTOR, HGF/SF RECEPTOR, MET PROTO-ONCOGENE TYROSINE KINASE, C-MET SNESVDXRATFPE 13 T 3.8 MtaB pdbhh F Eukaryota T 3by7 1 A,B,C,D,E A,B,C,D,E uncharacterized protein GMKNIKIMRLVTGEDIIGNISESQGLITIKKAFVIIPMQATPGKPVQLVLSPWQPYTDDKEIVIDDSKVITITSPKDDIIKSYESHTSEIITPSGLITET 100 T 0.0012 Sm_like pdbpercent F T 3bze 3 I,J,K,L P,Q,R,S HLAG_HUMAN HLA G ANTIGEN VMAPRTLFL 9 T 0.00093 UL40 pdbhh F Eukaryota T 3bzf 3 C,F P,Q 1C07_HUMAN MHC CLASS I ANTIGEN CW*7 VMAPRALLL 9 T 0.095 UL40 pdbhh F Eukaryota T 3c0t 2 B B MED8_SCHPO MEDIATOR COMPLEX SUBUNIT 8, CELL SEPARATION PROTEIN SEP15 MEEQNANQMLTDILSFMKSGKRAAALEHHHHHH 33 T 30 YbeY pdbhh F Eukaryota T 3c2g 1 A,B A,B Q9XVI2_CAEEL Sys-1 protein MNITQAAEQAIRLWFNTPDPMQRLHMAKTIRTWIRQDKFAQVDQANMPNCVQQILNIIYDGLKPQPVQLPISYYAQLWYNLLDILRRFTFLPIISPYIHQVVQMFCPRENGPQDFRELICNLISLNWQKDPHMKHCANQVFQIFNCIIMGVKNEKLRTEFAQHLKFEKLVGTLSEYFNPQVHPGMINPAIFIIFRFIISKDTRLKDYFIWNNNPHDQPPPPTGLIIKLNAVMIGSYRLIAGQNPETLPQNPELAHLIQVIIRTFDLLGLLLHDSDAIDGFVRSDGVGAITTVVQYPNNDLIRAGCKLLLQVSDAKALAKTPLENILPFLLRLIEIHPDDEVIYSGTGFLSNVVAHKQHVKDIAIRSNAIFLLHTIISKYPRLDELTDAPKRNRVCEIICNCLRTLNNFLMMWIPTPNGETKTAGPNEKQQVCKFIEIDILKKLMSCLSCEGMDTPGLLELRSTILRSFILLLRTPFVPKDGVLNVIDENRKENLIGHICAAYSWVFRQPNNTRTQSTKQQLVERTISLLLVLMEQCGAEKEVAQYSYSIDCPLNLLNGNQVKPTFIHNVLVVCDKILEHCPTRADIWTIDRPMLEGLTNHRNSDIAKAANSLLSRFPEN 619 T 1.4E-05 Insc_C unphh F Eukaryota T 3c2g 2 C,D C,D POP1_CAEEL Pop-1 8-residue peptide GDEVKVFR 8 T 4.1 DUF5065 pdbhh F Eukaryota T 3c2p 2 C,D A,B RPOLV_BPN4 Virion RNA polymerase MGGSHHHHHHRSESTVTEELKEGIDAVYPSLVGTADSKAEGIKNYFKLSFTLPEEQKSRTVGSEAPLKDVAQALSSRARYELFTEKETANPAFNGEVIKRYKELMEHGEGIADILRSRLAKFLNTKDVGKRFAQGTEANRWVGGKLLNIVEQDGDTFKYNEQLLQTAVLAGLQWRLTATSNTAIKDAKDVAAITGIDQALLPEGLVEQFDTGMTLTEAVSSLAQKIESYWGLSRNPNAPLGYTKGIPTAMAAEILAAFVESTDVVENIVDMSEIDPDNKKTIGLYTITELDSFDPINSFPTAIEEAVLVNPTEKMFFGDDIPPVANTQLRNPAVRNTPEQKAALKAEQATEFYVHTPMVQFYETLGKDRILELMGAGTLNKELLNDNHAKSLEGKNRSVEDSYNQLFSVIEQVRAQSEDISTVPIHYAYNMTRVGRMQMLGKYNPQSAKLVREAILPTKATLDLSNQNNEDFSAFQLGLAQALDIKVHTMTREVMSDELTKLLEGNLKPAIDMMVEFNTTGSLPENAVDVLNTALGDRKSFVALMALMEYSRYLVAEDKSAFVTPLYVEADGVTNGPINAMMLMTGGLFTPDWIRNIAKGGLFIGSPNKTMNEHRSTADNNDLYQASTNALMESLGKLRSNYASNMPIQSQIDSLLSLMDLFLPDINLGENGALELKRGIAKNPLTITIYGSGARGIAGKLVSSVTDAIYERMSDVLKARAKDPNISAAMAMFGKQAASEAHAEELLARFLKDMETLTSTVPVKRKGVLELQSTGTGAKGKINPKTYTIKGEQLKALQENMLHFFVEPLRNGITQTVGESLVYSTEQLQKATQIQSVVLEDMFKQRVQEKLAEKAKDPTWKKGDFLTQKELNDIQASLNNLAPMIETGSQTFYIAGSENAEVANQVLATNLDDRMRVPMSIYAPAQAGVAGIPFMTIGTGDGMMMQTLSTMKGAPKNTLKIFDGMNIGLNDITDASRKANEAVYTSWQGNPIKNVYESYAKFMKNVDFSKLSPEALEAIGKSALEYDQRENATVDDIANAASLIERNLRNIALGVDIRHKVLDKVNLSIDQMAAVGAPYQNNGKIDLSNMTPEQQADELNKLFREELEARKQKVAKA 1117 T 0.0039 RNA_pol pdbhh T Viruses T 3c3g 1 A A alpha/beta peptide with the GCN4-pLI side chain sequence on an (alpha-alpha-beta) backbone XMKXIEXKLXEIXSKXYHXENXLAXIKXLLXER 33 T 0.0016 VGPC1_C pdbhh F T 3c3h 1 A A alpha/beta-peptide based on the GCN4-pLI side chain sequence, with an (alpha-alpha-beta) backbone and cyclic beta-residues at positions 1, 4, 10, 19, 22, and 28 XXMKXIEXKLXEIXSKXYHXENXLAXIKXLLXER 34 T 0.04 DUF5082 pdbpssm F T 3c3o 2 B B CHM4A_HUMAN CHROMATIN-MODIFYING PROTEIN 4A, CHMP4A, VACUOLAR PROTEIN-SORTING-ASSOCIATED PROTEIN 7-1, SNF7- 1, HSNF-1, SNF7 HOMOLOG ASSOCIATED WITH ALIX-2 DEEALKQLAEWVS 13 T 2.8 DUF3884 pdbhh F Eukaryota T 3c3q 2 B B CHM4B_HUMAN CHROMATIN-MODIFYING PROTEIN 4B, CHMP4B, VACUOLAR PROTEIN-SORTING-ASSOCIATED PROTEIN 7-2, SNF7- 2, HSNF7-2, SNF7 HOMOLOG ASSOCIATED WITH ALIX 1, HVPS32 KEEEDDDMKELENWAGSM 18 T 1.4 TMEM154 pdbhh F Eukaryota T 3c3r 2 B B CHM4C_HUMAN CHROMATIN-MODIFYING PROTEIN 4C, CHMP4C, VACUOLAR PROTEIN-SORTING-ASSOCIATED PROTEIN 7-3, SNF7- 3, HSNF7-3, SNF7 HOMOLOG ASSOCIATED WITH ALIX 3 EDDDIKQLAAWAT 13 T 1.1 Ribosomal_60s unppssm F Eukaryota T 3c5i 2 E E Cleaved fragment of N-terminal expression tag ENLYFQ 6 T 40 Phage_holin_2_4 pdbhh F T 3c94 2 B,C B,C A0A0H3GL04_KLEPH Single-stranded DNA-binding C-terminal tail peptide WMDFDDDIPF 10 T 0.36 Phage_SSB pdbhh F Bacteria T 3c9c 2 B B H4_DROME Histone H4, 27-residue peptide AKRHRKVLRDNIQGITKPAIRRLARRG 27 T 8.5E-08 CENP-T_C unp F Eukaryota T 3c9n 3 C C Peptide antigen VQQESSFVM 9 T 3.5 DUF1615 pdbhh F T 3cal 2 B,D B,D FNBA_STAA8 FNBPA XKGIVTGAVSDHTTVEDTKX 20 T 0.015 Fn_bind unppssm F Bacteria T 3cbl 2 B B Synthetic peptide XIYESL 6 T 89 NUC205 pdbhh F T 3cbm 2 B B ESR1_HUMAN ER, ESTRADIOL RECEPTOR, ER-ALPHA, NUCLEAR RECEPTOR SUBFAMILY 3 GROUP A MEMBER 1 IKRSKKNSLA 10 T 8.3 Chordopox_A30L pdbhh F Eukaryota T 3cc5 3 C,F C,F PMEL_HUMAN SILVER LOCUS PROTEIN HOMOLOG, MELANOCYTE LINEAGE-SPECIFIC ANTIGEN GP100, MELANOMA-ASSOCIATED ME20 ANTIGEN, ME20M, ME20-M KVPRNQDWL 9 T 3.2 ER pdbhh F Eukaryota T 3cch 3 C,F,I,L C,F,I,L nonameric peptide murine gp100 EGSRNQDWL 9 T 3.1 DUF5136 pdbhh F T 3cdw 2 B H POLG_CXB3N VPG GAYTGVPNQKPRVPTLRQAKVQ 22 T 6.8 DUF2111 pdbhh T Viruses T 3cfs 2 B E H4_HUMAN Histone H4 QGITKPAIRRLARRG 15 T 2.9 Phage_Cox pdbhh F Eukaryota T 3cfv 2 B,D E,F H4_HUMAN Histone H4 peptide DNIQGITKPAIRRLARRG 18 T 8.5E-08 CENP-T_C unp F Eukaryota T 3ch1 3 I,J,K,L C,F,I,L nonameric peptide chimeric gp100 EGPRNQDWL 9 T 2.8 APOC4 pdbhh F T 3cmr 1 A,B A,B PPB_ECOLI APASE TPEMPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQDATPAALVAHVTSSKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 449 T 6.8E-10 Alk_phosphatase unppssm F Bacteria T 3cnf 1 A,B A,B CAPSD_CPVBM VP1 MHSTNNNSNKRNNEEKHKQPEIDSSANNGEGTSGTRAQTVGDTATEAGVRNETEAGASTRRQTDGTGLSGTNAKIATASSARQADVEKPADVTFTIENVDDVGIMQQKKPPTVVQSRTDVFNEQFANEALHPTTKVIFNGLDVNTEVQPLSDDFKQISDPKGYLTYSVKYEDQFTKKDKLRASEADDRIVGPTVNLFKYGAAVVNIDLNRDFFDTATGIDLTKGIPLVQDLLVPIGVTAGAEQSAEYVSGLLMVLFKVMTDNRLVIVGETTTPMSNTLSTVVNNVLRTTYHNNVGVNPALLRDFTQVNWLNRDITNMLQQAGTKYGLGLTETRLDYVRLVKTIVGHALNIDHFAASVLNINLRALMEANVTADDRIKALQAHSMISTQFHGPNQGALRPELAFDHDHIIRCLMLAAANYPRLEGIIVQINTGYVASANVIRPVSEKRYFPENLEQNQSAARLVSAVKARASEADISSIHLAIAREVSPMFNVHELKKIAESFEDPSSIVVVLEFILFALFFPTEFNRIKGDIQNVLLLFFSRWYPVEYGIFVQRGATYTINAAGEFEFSGRNEKWDQALYLSEHFPALFSDVPLAGANTIIAIMRLFTPQGFLRTDDLAIAANFPRASRNPQTYIPYTNQRGTVTNEFASRFRTIVATLANVVNERAVQDDMQKATRSCTKQWLRHLETQFDNIAVAHTDHLSVVYATMSNFMLNFTNNFSGNHATFKPDQYVITSPEGSYKPIIERQGETVDGLTIIDTSIVWPILCQCTYPLVRQSGKGVDAVSIMEEIVYPDPSTTLSQSLSVAQVLSKLTLPDAFINMILSGGDSVVMRTYQTEADDDLDEGIRMTTYDQYLSHIRERLHITNVPDPIYITGASTPDQIAASVQATHVAVVLYQSGVINGPASTYLRENEVLVVMPDYYDVVSRFANANLQMNNNRYHESVLEIADIFDQADFIQTSDAVRQLRALMPTLSTSQIRHAIERIAQITDVDSTDYGKLTLRFLGTLTRSLKMQNAQIRRIRPDGTVLRYDDQIDIEAFRWSRYFLDELQLRRLSVGLRLITNPRIARRFNGVRIMYLTDDDPDPDFVPDVPEGYVAVQYAHRLFSSSLANKRNRVTYTHPPTGMAYPSPTGRPHVHMTINERAGMSKLVADNIIASVIKSNWVVDILDIEYTAEVMTPSEGYTQHVDAESIMTAPKGKLFHLQFMDGLLRPEPSAFDPPASGEDMRLIYPLQPISVARSMRAIVNHNEVDRPRGAVAPSSYEMDTGTLSRNGDLLYSPVANGQVGIPKLEVDHISFSNVVSMMTANIRTGDDMAVERVNPDDVRAINIRNA 1333 T 130 SRP19 pdbhh T Viruses T 3cnf 2 C T Q9E957_CPVBM VP3 MWHYTSINNDTRVALDPKPNQIRTITKPNTVPQLGTDYLYTFNSQRRSHTLRLLGPFQYFNFSETDRGHPLFRLPLKYPSKAIPADELIDNLHSWMRSVHLLHVRSEDNTLRYNWMLGVYARSTNYTTPVGQLVVNAPAILNYSNPQDAFNSVFVALGIDYIDIPITNSNIFDDSSTPYNVRIWHAPTMTEVNHILALMRKSTLVSTHSSWHWDVLHTFHYRSESDMIDHFAAKILEDWRQKEKLDKGALVEADRVVQRLIPLSSSTYVQRLAAIGALYPNEFTENVLDLSRLSTALLQLSDTYYQHANDQLRRLYRRMYNDSRTLYMTQRHQELLLAQITADPNILLYPYTYIFTTAYTSMNYISNTGQGRIKHSLAVTGTTEHTIADITLGPMSEDVVTISMVEPMSIAAEDMYGYVLDTPTRDIWPADEQIEQKGDAVALYDTKTSRALGMFNNTVRIDDLLSPLLGLVYRTYIKGDTMTMTQGSLDHLTLCAAVDSDITFVGNRMIAPLAEGYIPKAMHRNNSTMKMLSLYVALKKLENFTTNSYLMAPDTSIILLGAEREPAVSILRRFNRSVSNVRIIGMGDRAVEPNIRVRVPFPIDKNISADFIICDINSYEDQSFESMFGETISVVTTCASAATRVLVKINHPSEYMINSVIERLSQLGGVFYHTALLKTASQNPYSYETYIYITPIAAAVRFPFYSNSAIINRYMTAVADDETPIIPSIHTVIKGHSNTYSPGLFCGCIDVQSAPFALSQLKSYCSEATTWRVDSDDNLVNIIARIDPARIALEFRTRSNTSAYHEYQRYVPNGLGFKGRKTREFRYIHREVTFIHKLMTYALIREQISLTENMTQVVSIGGRNLADISVVPLNMKYVVIDPATRIETLTQEKKNIEVQSRPFSFDAASMDLENNSIYLFIAVIMNEPNGAATPARTQMDKIRNVATAMLTRTNCVAYISFYEAGIITRLDQSTAHKTIRVEEGRLKVANYVPVDTLVEADVTLMLRDIGITHEIIRPSTPELINACSNYGIRLGSTGGAVLDVFNHYSPVIKLVRS 1057 T 0.32 Ig_mannosidase pdbpssm T Viruses T 3cpl 3 C,F E,F NP366 peptide ASNENAETM 9 T 26 DUF4690 pdbhh F T 3cpx 1 A,B,C A,B,C Aminopeptidase, M42 family MGSDKIHHHHHHENLYFQGMQLLKELCSIHAPSGNEEPLKDFILEYIRSNAGSWSYQPVIYADNDLQDCIVLVFGNPRTAVFAHMDSIGFTVSYNNHLHPIGSPSAKEGYRLVGKDSNGDIEGVLKIVDEEWMLETDRLIDRGTEVTFKPDFREEGDFILTPYLDDRLGVWTALELAKTLEHGIIAFTCWEEHGGGSVAYLARWIYETFHVKQSLICDITWVTEGVEAGKGVAISMRDRMIPRKKYVNRIIELARQTDIPFQLEVEGAGASDGRELQLSPYPWDWCFIGAPEKDAHTPNECVHKKDIESMVGLYKYLMEKL 321 T 1.7E-14 Peptidase_M42 pdbpssm F T 3cu8 2 C,D P,Q RAF1_HUMAN RAF-1, C-RAF, CRAF RSTSTPNVH 9 T 46 ALC pdbhh F Eukaryota T 3cv0 2 B B G6PI_TRYBB T. brucei PGI PTS1 peptide Ac-FNELSHL FNELSHL 7 T 1.7 ATG9 pdbhh F Eukaryota T 3cvf 1 A,B,C,D A,B,C,D HOME3_HUMAN HOMER-3 GSHMAAEREETQQKVQDLETRNAELEHQLRAMERSLEEARAERERARAEVGRAAQLLDVSLFELSELREGLARLAEAAP 79 T 0.00048 Cast unppercent F Eukaryota T 3cvo 1 A,B,C,D A,B,C,D Q5LRV1_RUEPO Methyltransferase-like protein of unknown function GMDDQSGDQMRPELTMPPAEAEALRMAYEEAEVILEYGSGGSTVVAAELPGKHVTSVESDRAWARMMKAWLAANPPAEGTEVNIVWTDIGPTGDWGHPVSDAKWRSYPDYPLAVWRTEGFRHPDVVLVDGRFRVGCALATAFSITRPVTLLFDDYSQRRWQHQVEEFLGAPLMIGRLAAFQVEPQPIPPGSLMQLIRTMTSP 202 T 0.0049 Methyltransf_24 pdbpssm F Bacteria T 3cvp 2 B B 10-SKL PTS1 peptide Ac-GTLSNRASKL GTLSNRASKL 10 T 12 DUF2434 pdbhh F T 3cvq 2 B B PTS1 peptide 7-SKL (Ac-SNRWSKL) XNRWSKL 7 T 3.9 PilI pdbhh F T 3d1e 2 C P decamer from polymerase II C-terminal TLMTGQLGLF 10 T 18 Chlorosome_CsmC pdbhh F T 3d1f 2 C,D P,Q Nonapeptide from polymerase III C-terminal SEQVELEFD 9 T 2.8 DUF4462 pdbhh F T 3d24 2 B,D B,D PRGC1_HUMAN PPAR-GAMMA COACTIVATOR 1-ALPHA, PPARGC-1-ALPHA, PGC-1-ALPHA, LIGAND EFFECT MODULATOR 6 QQQKPQRRPCSELLKYLTTNDD 22 T 0.78 HPIP pdbhh F Eukaryota T 3d25 3 C C HMHA1_HUMAN MINOR HISTOCOMPATIBILITY ANTIGEN HA-1, MHAG HA-1 VLHDDLLEA 9 T 7.1 Flu_M1_C pdbhh F Eukaryota T 3d32 2 C,D C,D K1 peptide DATYTWEHLAWPX 13 T 1.2 DUF4172 pdbhh F T 3d39 3 C C Modified HTLV-1 TAX (Y5(4fluoro)F) peptide LLFGXPVYV 9 T 0.35 YvrJ pdbhh F T 3d4b 2 B D Acetyl P53 peptide TSRHKXLMA 9 T 12 AHD pdbhh F T 3d81 2 B C S-alkylamidate intermediate SRHKXLMF 8 T 8 DUF420 pdbhh F T 3d8a 2 I,J,K,L,M,N,O,P S,T,U,V,W,X,Y,Z TRAD1_ECOLI Protein traD GEDVEPGDDF 10 T 0.89 Taq-exonuc pdbhh F Bacteria T 3d9t 2 C,D C,D CASP9_HUMAN CASP-9, ICE-LIKE APOPTOTIC PROTEASE 6, ICE-LAP6, APOPTOTIC PROTEASE MCH-6, APOPTOTIC PROTEASE-ACTIVATING FACTOR 3, APAF-3, CASPASE-9 SUBUNIT P35, CASPASE-9 SUBUNIT P10 ATPFQE 6 F F Eukaryota T 3da9 3 C D Hirudin peptide DFEEIPGEX 9 T 0.014 Hirudin pdbhh F T 3dda 2 B B SNP25_HUMAN SNAP-25, SYNAPTOSOMAL-ASSOCIATED 25 KDA PROTEIN, SUPER PROTEIN, SUP QRATKMX 7 F F Eukaryota T 3ddb 2 B B SNP25_HUMAN SNAP-25, SYNAPTOSOMAL-ASSOCIATED 25 KDA PROTEIN, SUPER PROTEIN, SUP RRATKMX 7 F F Eukaryota T 3dep 2 B B LHCP L18 REGION YPGGSFDPLGLA 12 T 0.0011 Chloroa_b-bind pdbhh F T 3dfe 1 A,B,C,D,E,F A,B,C,D,E,F Q3M8P8_ANAVT Putative Pii-Like Signaling Protein GMSKRANKLVIVTEKVLLKKVAKIIEEAGATGYTVVDTGGKGSRNVRSTGKPNTSDTDSNVKFEVLTENREMAEKIADQVAIKFFTDYAGIIYICEAEVLYGRTFCGPDGC 111 T 0.0009 P-II unppercent F Bacteria T 3dgj 1 A A NNFGAIL peptide NNFGAIL 7 T 5.3 SidC_N pdbhh F T 3dgo 1 A A ATP Binding Protein-DX GSMDYKDDDDKKTNWLKRIYRVRPCVKCKVAPRDWKVKNKHLRIFNMCKTCFNNSIDIGDDTYHGHVDWLMYADSKEISNT 81 T 0.025 Glyco_tranf_2_4 pdbpssm F T 3diw 2 C,D C,D CTNB1_HUMAN BETA-CATENIN NQLAWFDTDL 10 T 3 Tobravirus_2B pdbhh F Eukaryota T 3dks 2 E,F E,F siga peptide XPIPFLXQKD 10 T 1.1 DUF5450 pdbhh F T 3dkt 2 K,L,M,N,O,P,Q,R,S,T K,L,M,N,O,P,Q,R,S,T Q9WZP3_THEMA FERRITIN-LIKE PROTEIN GGDLGIRK 8 T 21 CDCA pdbhh F Bacteria T 3dm1 2 B,D,F,H B,D,F,H EHMT2_HUMAN HISTONE H3-K9 METHYLTRANSFERASE 3, H3-K9-HMTASE 3, EUCHROMATIC HISTONE-LYSINE N-METHYLTRANSFERASE 2, HLA-B-ASSOCIATED TRANSCRIPT 8, PROTEIN G9A, LYSINE N-METHYLTRANSFERASE 1C KVHRARKTMSKP 12 T 2 RNA_pol_Rpb5_N pdbhh F Eukaryota T 3dm7 1 A,B A,B VPS75_YEAST Vacuolar protein sorting-associated protein 75 GSMMSDQENENEHAKAFLGLAKCEEEVDAIEREVELYRLNKMKPVYEKRDAYIDEIAEFWKIVLSQHVSFANYIRASDFKYMDTIDKIKVEWLALESEMYDTRDFSITFHFHGIEGDFKEQQVTKVFQIKKGKDDQEDGILTSEPVPIEWPQSYDSINPDLMKDKRSPEGKKKYRQGMKTIFGWFRWTGLKPGKEFPHGDSLASLFSEEIYPFCVKYYAEAQRDLEDEEGESGL 234 T 0.0013 NAP pdbpercent F Eukaryota T 3dnj 2 C,D C,D synthetic N-end rule peptide YLFVQRDSKE 10 T 0.97 DUF4642 pdbhh F T 3dpc 1 A,B A,B PPB_ECOLI APASE TPEMPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDLAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQDATPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLKHHHHHH 455 T 1.1E-10 Alk_phosphatase pdbpssm F Bacteria T 3dpc 2 C C Phosphorylated Peptide HATPPKKEAD 10 T 33 Feld-I_B pdbhh F T 3dpo 2 C,D C,D PYRRH_PYRAP inhibitor peptide VDKLYXLPRPT 11 T 2.3 PHtD_u1 pdbhh F Eukaryota T 3dpp 2 C,D C,D PYRRH_PYRAP inhibitor peptide VDKLYXLPRPTPPRPIYNRN 20 T 2.5 Apidaecin unphh F Eukaryota T 3dpy 3 C C caged substrate TKCVIM 6 T 3.1 Plk4_PB2 pdbhh F T 3ds9 2 B B octapeptide I1 inhibitor XRWTXMLG 8 T 0.55 Lentiviral_Tat pdbhh F T 3dt5 1 A A Y924_ARCFU Uncharacterized protein AF_0924 GHSNRQVQLMARQQRLKAIEDRLEKFYIPLIKAFSSYVYTAQTEDEIETIITCRRYLAGNNLLRVLPMHFKFKADKIAGSANWTFYAKEDFEQWKEALDVLWEEFLEVLKEYYTLSGTEISLPEKPDWLIGYKGS 135 T 0.03 RE_HaeIII pdb F Archaea T 3dtx 3 C C VIPR1_HUMAN Double citrullinated vasoactive intestinal polypeptide receptor RRKWXXWHL 9 F F Eukaryota T 3dvp 2 C,D C,D PAK1_HUMAN P21 activated Kinase peptide TPTRDVATSP 10 T 1.4 TFIIA unppercent F Eukaryota T 3dw8 2 B,E B,E 2ABA_HUMAN PP2A, SUBUNIT B, B-ALPHA ISOFORM, PP2A, SUBUNIT B, B55-ALPHA ISOFORM, PP2A, SUBUNIT B, PR55-ALPHA ISOFORM, PP2A, SUBUNIT B, R2-ALPHA ISOFORM MAGAGGGNDIQWCFSQVKGAVDDDVAEADIISTVEFNHSGELLATGDKGGRVVIFQQEQENKIQSHSRGEYNVYSTFQSHEPEFDYLKSLEIEEKINKIRWLPQKNAAQFLLSTNDKTIKLWKISERDKRPEGYNLKEEDGRYRDPTTVTTLRVPVFRPMDLMVEASPRRIFANAHTYHINSISINSDYETYLSADDLRINLWHLEITDRSFNIVDIKPANMEELTEVITAAEFHPNSCNTFVYSSSKGTIRLCDMRASALCDRHSKLFEEPEDPSNRSFFSEIISSISDVKFSHSGRYMMTRDYLSVKVWDLNMENRPVETYQVHEYLRSKLCSLYENDCIFDKFECCWNGSDSVVMTGSYNNFFRMFDRNTKRDITLEASRENNKPRTVLKPRKVCASGKRKKDEISVDSLDFNKKILHTAWHPKENIIAVATTNNLYIFQDKVN 447 T 0.11 ANAPC4_WD40 unppercent F Eukaryota T 3dyc 1 A,B A,B PPB_ECOLI APASE TPEMPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQDATPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVYGASIDKQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 449 T 6.8E-10 Alk_phosphatase unppssm F Bacteria T 3dze 1 A A ATP5S_BOVIN ATP SYNTHASE-COUPLING FACTOR B, MITOCHONDRIAL ATP SYNTHASE REGULATORY COMPONENT FACTOR B SFWEWLNAVFNKVDHDRIRDVGPDRAASEWLLRCGAMVRYHGQQRWQKDYNHLPTGPLDKYKIQAIDATDSCIMSIGFDHMEGLQYVEKIRLCKCHYIEDGCLERLSQLENLQKSMLEMEIISCGNVTDKGIIALHHFRNLKYLFLSDLPGVKEKEKIVQAFKTSLPSLELKLDLK 176 T 0.0023 FBXL18_C pdbhh F Eukaryota T 3e08 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H T23O_XANCP Tryptophan 2,3-dioxygenase MPVDKNLRDLEPGIHTDLEGRLTYGGYLRLDQLLSAQQPLSEPAHHDEMLFIIQSQTSELWLKLLAHELRAAIVHLQRDEVWQCRKVLARSKQVLRQLTEQWSVLETLTPSEYMGFRDVLGPSSGFQSLQYRYIEFLLGNKNPQMLQVFAYDPAGQARLREVLEAPSLYEEFLRYLARFGHAIPQQYQARDWTAAHVADDTLRPVFERIYENTDRYWREYSLCEDLVDVETQFQLWRFRHMRTVMRVIGFKRGTGGSSGVGFLQQALALTFFPELFDVRTSVGVDNRPPQGSADAGKR 298 T 6.3E-41 Trp_dioxygenase unp F Bacteria T 3e0m 2 E,F,G E,F,G Short peptide SHMAEI SHMAEI 6 T 76 Nt_Gln_amidase pdbhh F T 3e1k 2 B,D,F,H,J,L,N,P B,D,F,H,J,L,N,P LAC9_KLULA Lactose regulatory protein LAC9 TQQLFNTTTMDDVYNYIFDNDE 22 T 1.4 BTP pdbhh F Eukaryota T 3e1r 2 C C PDC6I_HUMAN PDCD6-INTERACTING PROTEIN, ALG-2-INTERACTING PROTEIN 1, HP95 QAQGPPYPTYPGY 13 T 1.1 Antimicrobial_5 pdbhh F Eukaryota T 3e2b 2 B C SWA_DROME Protein swallow 16-residue peptide MYHIRSATSAKATQTD 16 T 0.0069 KASH_CCD unppercent F Eukaryota T 3e2j 1 A,B,C,D A,B,C,D ATP5S_BOVIN ATP SYNTHASE-COUPLING FACTOR B, MITOCHONDRIAL ATP SYNTHASE REGULATORY COMPONENT FACTOR B SFWGWLNAVFNKVDHDRIRDVGPDRAASEWLLRCGAMVRYHGQQRWQKDYNHLPTGPLDKYKIQAIDATDSCIMSIGFDHMEGLQYVEKIRLCKCHYIEDGCLERLSQLENLQKSMLEMEIISCGNVTDKGIIALHHFRNLKYLFLSDLPGVKEKEKIVQAFKTSLPSLELKLDLK 176 T 0.0023 FBXL18_C pdbhh F Eukaryota T 3e2n 1 A A APX1_PEA;CCPR_YEAST CCP TTPLVHVASVEKGRSYEDFQKVYNAIALKIAEKKCGPVLVRLAWHTSGTWDKHDNTGGSYGGTYRFKKEFNDPSNAGLQNGFKFLEPIHKEFPWISSGDLFSLGGVTAVQEMQGPKIPWRCGRVDTPEDTTPDNGRLPDADKDADYVRTFFQRLNMNDREVVALMGAHALGKTHLKRSGYEGPFGAANNVFTNEFYLNLLNEDWKLEKNDANNEQWDSKSGYMMLPTDYSLIQDPKYLSIVKEYANDQDKFFKDFSKAFEKLLENGITFPKDAPSPFIFKTLEEQGL 287 T 3E-05 peroxidase pdbpssm F Eukaryota T 3e39 1 A,B A,B Q314Q8_DESAG Putative Nitroreductase GMLTENPVLQAIRQRRSIRRYTDEAVSDEAVRLILEAGIWAPSGLNNQPCRFLVIRADDPRCDILAAHTRYGHIVRGAKVIILVFLDREAMYNEVKDHQAAGAAVQNMLLAAHALQLGAVWLGEIINQAATLLPALALDPARLSFEAAIAAGHPAQNGSSSRRPLAELLLEEPFPQPE 178 T 5.1E-20 Nitroreductase pdbpercent F Bacteria T 3e8u 3 C P BNP peptide epitope GVQGSGAFGRG 11 T 0.27 Mannitol_dh pdbhh F T 3ebb 2 E,F,G,H E,F,G,H TERA_HUMAN 15S MG(2+)-ATPASE P97 SUBUNIT, VALOSIN-CONTAINING PROTEIN, VCP TEDNDDDLYG 10 T 17 DUF228 pdbhh F Eukaryota T 3ech 2 C C Q9HXS2_PSEAE 25-mer fragment of protein ArmR RRDYTEQLRRAARRNAWDLYGEHFY 25 T 0.68 PLD_C pdbhh F Bacteria T 3efd 3 C K KcsA SEKAAEEAYTRTTRALHERFDRLERMLDDN 30 T 1.1 PspB pdbhh F T 3eg6 2 B C KMT2A_HUMAN MLL-1 peptide XGSARAEVHLRKS 13 T 1.1 N-SET unphh F Eukaryota T 3eg9 3 C C GOSR2_HUMAN peptide TTIPMDS 7 T 0.044 SLX9 unp F Eukaryota T 3ejh 2 B,D E,F CO1A1_HUMAN Collagen type-I a1 chain GQRGVVGLPGQRGERGFPGLPGY 23 T 0.00026 Collagen pdb F Eukaryota T 3emh 2 B B KMT2A_HUMAN MLL1 ARAEVHLRKSAFD 13 T 1.1 N-SET unphh F Eukaryota T 3eqs 2 B B 12-mer peptide inhibitor TSFAEYWNLLSP 12 T 0.051 P53_TAD pdbhh F T 3era 1 A,B A,B 3S1EA_LATSE ERABUTOXIN A RICFNHQTSQPQTTKTCSPGESSCYNKQWSDFRGTIIERGCGCPTVKPGIKLSCCESEVCNN 62 T 0.003 Toxin_TOLIP pdb F Eukaryota T 3es5 1 A,A2,A3,A4,A5,B,B2,B3,B4,B5 A,A,A,A,A,B,A,B,A,B Q4G3H1_9VIRU Putative capsid protein MSFETSEGMSRPGDNPNKLNAKPRQSARPKTRNSTAQSNQTMRLGWIDPLPQVDTIFPLGLEPNVESIPAGEVELDFNLPETIAKPFADTVTSVGDRIQLVDDDKENIATSIYGLSFFKAARQLYSTMLDHEKAVNQPLKAVYYDETPIPAHMSGALGIIGHMKTKVGDVLVKDAGVLFKRGTAAGVTKFSEIDNDKTWNLDCSKLVWADHSSLSMIKRLASEKISQLVKQRYRVTDAQGHVYSVSMPQLTDQALPDYYDSIPDVAPNSDQLRVLTAALQMSLAQFRNDELPHDEDRSDLLTTLDLLYADGAYEISALRDQFELLMARYTTDFKWRVESIFKVGPPPAGTTGYGAQTVSSTGNTARWQFPLSDADINIGYLFSPSKSFSLFPKMVGYSKRAREDASASFANSDAKKFYAD 420 T 0.023 DUF5463 pdbpssm T Viruses T 3esk 2 B B HSP7C_HUMAN HEAT SHOCK 70 KDA PROTEIN 8 GASSGPTIEEVD 12 T 6.7 DUF4028 pdbhh F Eukaryota T 3etb 2 E,F,G,H J,K,L,M PAG_BACAN PA, PA-83, PA83, ANTHRAX TOXINS TRANSLOCATING PROTEIN [CONTAINS: PROTECTIVE ANTIGEN PA-20 AND PROTECTIVE ANTIGEN PA-63] RDKRFHYDRNNIAVGADESVVKEAHREVINSSTEGLLLNIDKDIRKILSGYIVEIEDTEGLKEVINDRYDMLNISSLRQDGKTFIDFKKYNDKLPLYISNPNYKVNVYAVTKENTIINPSENGDTSTNGIKKILIFSKKGYEIG 144 T 0.12 Fve pdbhh F Bacteria T 3eu7 2 B X BRCA2_HUMAN BRCA2, FANCONI ANEMIA GROUP D1 PROTEIN KADLGPISLNWFEELSSEA 19 T 0.091 DNAP_B_exo_N pdb F Eukaryota T 3eyf 3 E,F E,F GB_HCMVT Synthetic peptide ETIYNTTLKYX 11 T 12 HCMVantigenic_N unphh T Viruses T 3eys 3 C Q pyro-Glu3-A-Beta (3-8) peptide QFRHDS 6 T 59 Importin_rep pdbhh F T 3f58 3 C P V3 LOOP SIGPGRAFGGG 11 T 0.24 CRISPR_assoc pdbhh F T 3f7d 2 B B PRGC1_MOUSE PPAR-GAMMA COACTIVATOR 1-ALPHA, PPARGC-1-ALPHA, PGC-1-ALPHA EEPSLLKKLLLAPA 14 T 7 Cnl2_NKP2 pdbhh F Eukaryota T 3f7f 1 A,B,C,D A,B,C,D NU120_YEAST NUCLEAR PORE PROTEIN NUP120 MACLSRIDANLLQYYEKPEPNNTVDLYVSNNSNNNGLKEGDKSISTPVPQPYGSEYSNCLLLSNSEYICYHFSSRSTLLTFYPLSDAYHGKTINIHLPNASMNQRYTLTIQEVEQQLLVNVILKDGSFLTLQLPLSFLFSSANTLNGEWFHLQNPYDFTVRVPHFLFYVSPQFSVVFLEDGGLLGLKKVDGVHYEPLLFNDNSYLKCLTRFFSRSSKSDYDSVISCKLFHERYLIVLTQNCHLKIWDLTSFTLIQDYDMVSQSDSDPSHFRKVEAVGEYLSLYNNTLVTLLPLENGLFQMGTLLVDSSGILTYTFQNNIPTNLSASAIWSIVDLVLTRPLELNVEASYLNLIVLWKSGTASKLQILNVNDESFKNYEWIESVNKSLVDLQSEHDLDIVTKTGDVERGFCNLKSRYGTQIFERAQQILSENKIIMAHNEDEEYLANLETILRDVKTAFNEASSITLYGDEIILVNCFQPYNHSLYKLNTTVENWFYNMHSETDGSELFKYLRTLNGFASTLSNDVLRSISKKFLDIITGELPDSMTTVEKFTDIFKNCLENQFEITNLKILFDELNSFDIPVVLNDLINNQMKPGIFWKKDFISAIKFDGFTSIISLESLHQLLSIHYRITLQVLLTFVLFDLDTEIFGQHISTLLDLHYKQFLLLNLYRQDKCLLAEVLLKDSSEFSFGVKFFNYGQLIAYIDSLNSNVYNASITENSFFMTFFRSYII 729 T 0.21 Nup160 pdbpssm F Eukaryota T 3faj 1 A A Y131_ATV ORF131 MGSSHHHHHHSSGLVPRGSHMAKYEPKKGDYAGGAVKILDMFENGQLGYPEVTLKLAGEEANARRAGDERTKEAIHAIVKMISDAMKPYRNKGSGFQSQPIPGEVIAQVTSNPEYQQAKAFLASPATQVRNIEREEVLSKGAKKLAQAMAS 151 T 0.15 Mononeg_RNA_pol unppssm T Viruses T 3fbd 1 A,D A,D CEA7_ECOLX Colicin-E7 SKRNKPGKATGKGKPVNNKWLNNAGKDLGSPVPDRIANKLRDKEFKSFQDFRKKFWEEVSKDPELSKQFSRNNNDRMKVGKAPKTRTQDVSGKRTSFELHHEKPISQNGGVYDMDNISVVTPKRHIDIHRGK 132 T 0.038 HNH pdbpercent F Bacteria T 3fdm 2 D,E,F D,E,F alpha/beta-peptide foldamer XXAXRXLXKXGDAFNRX 17 T 7.9 Bclx_interact pdbhh F T 3fdo 2 B B Synthetic high affinity peptide LTFEHYWAQLTS 12 T 1.4 ASXH pdbhh F T 3fe7 2 B L p53-peptidomimetic Ac-Phe-Met-Aib-Pmp-Trp-Glu-Ac3c-Leu-NH2 XFMXXWEXLX 10 T 2.2 CNTF pdbhh F T 3fhv 2 C C cftr peptide SXDXDNKEXX 10 T 0.75 DUF3243 pdbhh F T 3fma 2 F,G,H,I,J L,M,N,O,P BBP_YEAST SPLICING FACTOR 1, ZINC FINGER PROTEIN BBP, MUD SYNTHETIC-LETHAL 5 PROTEIN SSIAPPPGLSG 11 T 1.1 HMMR_N pdbhh F Eukaryota T 3fn0 3 C P Envelope polyprotein gp160 WNWFDITNK 9 T 0.22 Tna_leader pdbhh F T 3fn2 1 A,B A,B Putative sensor histidine kinase domain SNANGYTMQRDNQKTLAVYMFEEINRDVEYLSGRLSEKELKDKYRYYGRGYVRITDKDGQVITYEDGSVQDKTVFLTNEGANKLGWKLEFLIDEKMFEEEILEKQN 106 T 0.19 MucB_RseB pdbpssm F T 3fol 3 C P 8 residue synthetic peptide VNDIFERI 8 T 0.58 WASH_WAHD pdbhh F T 3fon 3 C,F P,E Peptide VNDIFEAI 8 T 2.1 PRC2_HTH_1 pdbhh F T 3fp2 2 B Q HSP82_YEAST HEAT SHOCK PROTEIN HSP90 HEAT-INDUCIBLE ISOFORM, 82 KDA HEAT SHOCK PROTEIN EVPADTEMEEVD 12 T 25 UbiD pdbhh F Eukaryota T 3fp4 2 B Q HSP71_SCHPO Ssa1 GADNGPTVEEVD 12 T 5.8 CBP_CCPA pdbhh F Eukaryota T 3fqn 3 C C CTNB1_HUMAN peptide 30-39 from beta-Catenin: YLDSGIHSGA YLDSGIHSGA 10 T 2.3 DUF3094 pdbhh F Eukaryota T 3fqt 3 C C MPIP2_HUMAN peptide 38-46 from cell division cycle 25b (CDC25b): GLLGSPVRA GLLGSPVRA 9 T 7.7 Lep_receptor_Ig pdbhh F Eukaryota T 3fqw 3 C C IRS2_HUMAN peptide 1097-1105 from insulin receptor substrate 2 (IRS2): RVASPTSGV RVASPTSGV 9 T 3.2 Frataxin_Cyay pdbhh F Eukaryota T 3ft2 3 C P citrulline variant HA-1 peptide VLXDDLLEA 9 T 13 Trypco2 pdbhh F T 3ft4 3 C P arginine variant HA-1 peptide VLRDDLLEA 9 T 13 Trypco2 pdbhh F T 3ftg 3 C C NP366-N3A variant peptide from influenza virus ASAENMETM 9 T 26 DUF1128 pdbhh F T 3fvh 2 B B Acetyl-Leu-His-Ser-phosphoThr-Ala-NH2 peptide XLHSTAX 7 T 500 HEPN_AbiA_CTD pdbhh F T 3fwg 1 A,B A,B CPXA_PSEPU CYTOCHROME P450-CAM, P450CAM NLAPLPPHVPEHLVFDFDMYNPSNLSAGVQEAWAVLQESNVPDLVWTRCNGGHWIATRGQLIREAYEDYRHFSSECPFIPREAGEAYDFIPTSMDPPEQRQFRALANQVVGMPVVDKLENRIQELACSLIESLRPQGQCNFTEDYAEPFPIRIFMLLAGLPEEDIPHLKYLTDQMTRPDGSMTFAEAKEALYDYLIPIIEQRRQKPGTDAISIVANGQVNGRPITSDEAKRMCGLLLVGGLDTVVNFLSFSMEFLAKSPEHRQELIQRPERIPAACEELLRRFSLVADGRILTSDYEFHGVQLKKGDQILLPQMLSGLDERENACPMHVDFSRQKVSHTTFGHGSHLCLGQHLARLQIIVTLKEWLTRIPDFSIAPGAQIQHKSGIVSGVQALPLVWDPATTKAV 405 T 1.6E-05 p450 unppercent F Bacteria T 3fxd 2 B,D B,D Q5ZYC9_LEGPH Protein IcmR EIGEPDVTDATLGSVYSEIISPVKDCILTVAKAVSFNPGGKDNTDAVEVLTELNTKVERAALNQPILTTKTER 73 T 2.8 MOSC_N pdbhh F Bacteria T 3fxe 2 B B Q5ZYC9_LEGPH Protein IcmR EIGEPDVTDATLGSVYSEIISPVKDCILTVAKAVSFNPGGKDNTDAVEVLTELNTKVERAAMNQPILTTKTER 73 T 2.8 MOSC_N pdbhh F Bacteria T 3fxh 1 A A B0BHE4_9BACT Integron gene cassette protein HFX_CASS2 MGSSHHHHHHSSGRENLYFQGMNNKHATSAVHEIIREICRLVDSGHSMTRDQFHELSEQERFIAFLAEKYSSTIKLYYLADSSPLFEKDTSSFIENAFGRHANTVVMEDFGLKSNALLLAINICLAILREINGEV 135 T 0.073 WASH-7_C unppercent F Bacteria T 3fxx 2 B B peptide substrate KQWDNYEXIW 10 T 0.12 DUF3896 pdbhh F T 3fy2 2 B B peptide substrate KQWDNYEFIW 10 T 3 DUF3896 pdbhh F T 3fy6 1 A,B,C,D A,B,C,D M1E1E6_VIBCL Integron cassette protein RENLYFQGMTEVNLNIYSPRWGRHETYIVELHKDYMEISMGAVTIKATYSENQDPEWSEETLQDIMNNDSVYPPEITQNLFQHAWLEWRKGALDNDEVTRELELVAQWVNKVTEAKPNSDFWRKYF 126 T 0.33 DUF768 pdbhh F Bacteria T 3fzx 1 A A Q5LD59_BACFN Putative exported protein GAQNQDCAFFFPNQEGEQITRNCYTADGKLTNILVYRVDQAYEYPSGMEVVANYTFADAAGKTLNSGQMVARCSDGNFSMSMGDVATFPTALNMMNADVYMMGDLMNYPDAFSNPMNPGDDDEFDDGTLRLYQKGNKNNRAEISVFDREFVTTETVNTPAGAFYCTKVKYEMNIWTPKETIKGYGYEWYAPNIGIVRSEQYNNKKELQSYSVLERIKK 218 T 0.0026 DUF3108 unppercent F Bacteria T 3g1b 2 C,D C,D 10-residue peptide WLFVQRDSKE 10 T 1 DUF4642 pdbhh F T 3g2s 2 C,D C,D SORL_HUMAN SORTING PROTEIN-RELATED RECEPTOR CONTAINING LDLR CLASS A REPEATS, SORLA, SORLA-1, LOW-DENSITY LIPOPROTEIN RECEPTOR RELATIVE WITH 11 LIGAND-BINDING REPEATS, LDLR RELATIVE WITH 11 LIGAND-BINDING REPEATS, LR11 ITGFSDDVPMVIA 13 T 0.18 TMEM154 unphh F Eukaryota T 3g2u 2 C,D C,D SORT_HUMAN NEUROTENSIN RECEPTOR 3, NTS3, NTR3, NT3 SGYHDDSDEDLLE 13 T 5.8 SiaC pdbhh F Eukaryota T 3g2w 2 C,D C,D GGA1_HUMAN Internal peptide of the Hinge domain of ADP-ribosylation factor-binding protein GGA1 SASVSLLDDELMSL 14 T 6 EGL-1 pdbhh F Eukaryota T 3g3p 2 C D Peptide (NLE)LFVQRDSKE XLFVQRDSKE 10 T 1.1 DUF4642 pdbhh F T 3g7m 1 A A Q0WX48_WHEAT Xylanase inhibitor TL-XI APLTITNRCHFTVWPAVALVLAQGGGGTELHPGASWSLDTPVIGSQYIWGRTGCSFDRAGKGRCQTGDCGGSSLTCGGNPAVPTTMAEVSVLQGNYTYGVTSTLKGFNVPMNLKCSSGDALPCRKAGCDVVQPYAKSCSAAGSRLQIVFCP 151 T 6.1 Thaumatin unphh F Eukaryota T 3gco 2 B B DNRDGNVYQF peptide DNRDGNVYQF 10 T 4.6 DUF4651 pdbhh F T 3gd1 3 D Z clathrin TNLIELDA 8 T 2.6 LRR_3 pdbhh F T 3gd2 2 B B activator peptide AHQLLRYLLDA 11 T 0.00089 DUF4927 pdbhh F T 3gds 2 B B DNRDGNVYYF peptide DNRDGNVYYF 10 T 0.29 Ribosomal_L24e pdbhh F T 3ge5 1 A,B A,B Q7MX99_PORGI NITROREDUCTASE FAMILY PROTEIN MGSDKIHHHHHHENLYFQGMKQIPQDFRLIEDFFRTRRSVRKFIDRPVEEEKLMAILEAGRIAPSAHNYQPWHFLVVREEEGRKRLAPCSQQPWFPGAPIYIITLGDHQRAWKRGAGDSVDIDTSIAMTYMMLEAHSLGLGCTWVCAFDQALCSEIFDIPSHMTPVSILALGYGDPTVPPREAFNRKTIEEVVSFEKL 198 T 2.5E-17 Nitroreductase unp F Bacteria T 3ggw 3 E,F E,F PEPTIDE B1 YLEDWIKYNNQK 12 T 0.0059 DUF3439 pdbhh F T 3ghb 3 C,F P,Q P88213_9HIV1 Envelope glycoprotein KGVRIGPGQA 10 T 0.085 GP120 pdbhh T Viruses T 3ghe 3 C P P88403_9HIV1 Envelope glycoprotein RKRIHIGPGRAFYAT 15 T 0.00016 GP120 pdbhh T Viruses T 3gj9 2 C,D C,D KCNJ4_HUMAN C-TERMINAL PEPTIDE OF INWARD RECTIFIER K(+) CHANNEL KIR2.3 NISYRRESAI 10 T 1.3 Glyco_tran_10_N pdbhh F Eukaryota T 3gjn 2 B,C B,C CEA7_ECOLX Colicin-E7 MHHHHHHSMGKRNKPGKATGKGKPVNNKWLNNAGKDLGSPVPDRIANKLRDKEFKSFDDFRKKFWEEVSKDPELSKQFSRNNNDRMKVGKAPKTRTQDVSGKRTSFELHAEKPISQNGGVYDMDNISVVTPKRHIDIHRGK 141 T 0.011 HNH pdbpssm F Bacteria T 3gjo 2 E,F,G,H E,F,G,H DYST_HUMAN Dystonin GSRPSTAKPSKIPTPQRKSPASKLDKSSKR 30 T 13 DUF3697 pdbhh F Eukaryota T 3gn6 1 A,B,C,D A,B,C,D Q8KDY2_CHLTE CT0912, ORFan protein with a ferredoxin-like domain repeat GMTGLSQSQASPMQIQPGNAAFNPWTDAALDTIRDVNQALTLYAEMRVVPAHHDAFLAAIDTVSAKLRVLPGFLSLALKQMSGDSTMVKNYPETYKGVLATAYLDGVAAGTQPYFYNLFVRFADGRAARAAGFEALFETHIHPLLHAMAPRGGDGPELLAYRAVLQSVVAGDRHAIYRGAEEIRSFLRRPVELPERETVTVENHVMVPEDKHAAWEPQVAILLQVAQDTFEPQDEPSGVGLPGARDNRYYRKALSTEILRNAHADGGLRAYIMHGVWESVWDHENSHLDPRFLAAAGPVGAAAVVGPVEPFYLTRRLVVAD 321 T 0.0014 ABM pdbpercent F Bacteria T 3gof 2 C,D C,D NOS2_MOUSE INDUCIBLE NO SYNTHASE, INDUCIBLE NOS, INOS, NOS TYPE II, MACROPHAGE NOS, MAC-NOS RRREIRFRVLVKVVFF 16 T 5.9 RNA_pol_Rbc25 pdbhh F Eukaryota T 3gsq 3 C P HCMV pp65 fragment 495-503, variant M5S (NLVPSVATV) NLVPSVATV 9 T 7 CtnDOT_TraJ pdbhh F T 3gsr 3 C P HCMV pp65 fragment 495-503, variant M5V (NLVPVVATV) NLVPVVATV 9 T 1.6 GDH_N pdbhh F T 3gsu 3 C P HCMV pp65 fragment 495-503, variant M5T (NLVPTVATV) NLVPTVATV 9 T 13 CtnDOT_TraJ pdbhh F T 3gsv 3 C P HCMV pp65 fragment 495-503, variant M5Q (NLVPQVATV) NLVPQVATV 9 T 23 DUF5464 pdbhh F T 3gsw 3 C P HCMV pp65 fragment 495-503, variant T8A (NLVPMVAAV) NLVPMVAAV 9 T 29 DUF2714 pdbhh F T 3gsx 3 C P HCMV pp65 fragment 495-503, variant T8V (NLVPMVAVV) NLVPMVAVV 9 T 6.9 ExbD pdbhh F T 3gxq 1 A,B A,B A0A0H2XIU6_STAA3 Putative regulator of transfer genes ArtA ENSVFFGKKKKVSLHLLVDPDMKDEIIKYAQEKDFDNVSQAGREILKKGLEQIA 54 T 0.033 DUF108 pdbpercent F Bacteria T 3gxv 2 C C DNAB_HELPY Replicative DNA helicase IKNASIKRKLFGLANTIREQAL 22 T 5.9 DNA_ligase_A_N unppssm F Bacteria T 3gz1 2 C,D P,Q IPAB_SHIFL 62 KDA ANTIGEN INTTNAHSTSNILIPELKAPKS 22 T 18 Phage_Treg pdbhh F Bacteria T 3gz2 2 C P IPAB_SHIFL 62 KDA ANTIGEN MGSSHHHHHHSSGLVPRGSHMILTSTELGDNTIQAANDAANKLFSLTIADLTANQNINTTNAHSTSNILIPELKAPKS 78 T 5 Aft1_HRR pdbhh F Bacteria T 3h1z 2 B P PI51C_HUMAN PHOSPHATIDYLINOSITOL-4-PHOSPHATE 5-KINASE TYPE I GAMMA, PTDINS(4)P-5-KINASE GAMMA, PTDINSPKIGAMMA, PIP5KIGAMMA YFPTDERSWVYSPLH 15 T 0.63 PIG-S pdbhh F Eukaryota T 3h2h 1 A A Q5H5J0_XANOR PUTATIVE UNCHARACTERIZED PROTEIN APARGTLLTSNFLTSYTRDAISAMLASGSQPASGSQPEQAKCNVRVAEFTYATIGVEGEPATASGVLLIPGGERCSGPYPLLGWGHPTEALRAQEQAKEIRDAKGDDPLVTRLASQGYVVVGSDYLGLGKSNYAYHPYLHSASEASATIDAMRAARSVLQHLKTPLSGKVMLSGYSQGGHTAMATQREIEAHLSKEFHLVASAPISGPYALEQTFLDSWSGSNAVGENTFFILLGSYAIVAMQHTYKNIYLEPGQVFQDPWAAKVEPLFPGKQSLTDMFLNDTLPSIDKVKSYFQPGFYSDFPSNPANPFRQDLARNNLLEWAPQTPTLLCGSSNDATVPLKNAQTAIASFQQRGSNQVALVDTGTGNASDNSAFAHMLTKESCIVVVRDQLLDKQR 397 T 2.8E-13 LIP unppercent F Bacteria T 3h2i 1 A A Q5H5J0_XANOR PUTATIVE UNCHARACTERIZED PROTEIN APARGTLLTSNFLTSYTRDAISAMLASGSQPASGSQPEQAKCNVRVAEFTYATIGVEGEPATASGVLLIPGGERCSGPYPLLGWGHPTEALRAQEQAKEIRDAKGDDPLVTRLASQGYVVVGSDYLGLGKSNYAYHPYLHSASEASATIDAMRAARSVLQHLKTPLSGKVMLSGYSQGGHTAMATQREIEAHLSKEFHLVASAPISGPYALEQTFLDSWSGSNAVGEWTFGILLGSYAIVAMQHTYKNIYLEPGQVFQDPWAAKVEPLFPGKQSLTDMFLNDTLPSIDKVKSYFQPGFYSDFPSNPANPFRQDLARNNLLEWAPQTPTLLCGSSNDATVPLKNAQTAIASFQQRGSNQVALVDTGTGNASDNSAFAHMLTKESCIVVVRDQLLDKQR 397 T 2.8E-13 LIP unppercent F Bacteria T 3h3p 3 E,F S,T 4E10_S0_1TJLC_004_N HHHHHHTNEAYLAHERRELEAKRNQLRDEVDRTKTHMQDEAANDPNWFDITAQLWEFSQELRNRDREEKLIKKIEQTLKKVENED 85 T 0.053 Ran-binding pdbpercent F T 3h52 2 E,F N,M NCOR1_HUMAN N-COR1, N-COR ASNLGLEDIIRKALMGSFD 19 T 3.6 RHH_7 pdbhh F Eukaryota T 3h5f 1 A,B,C A,B,C COIL SER L16L-Pen XEWEALEKKLAALESKXQALEKKLEALEHGX 31 T 0.00045 DUF5320 pdbhh F T 3h5r 2 E,F,G,H E,F,G,H MCCC7, MICROCIN C51, MCCC51, MICROCIN C, MCC MRTGNAD 7 T 110 FARP pdbhh F T 3h7b 3 C,F C,F ATM_YEAST Tel1p peptide MLWGYLQYV 9 T 1 Rgp1 pdbhh F Eukaryota T 3h7z 1 A A YADA2_YEREN Adhesin yadA HTLKTANSYTDVTVSNSTKKAIRESNQYTDHKFHQLDNRLDKLDTRLLKLLASSAALNSLL 61 T 0.0018 CLZ pdb F Bacteria T 3h85 2 B P PI51C_HUMAN PHOSPHATIDYLINOSITOL-4-PHOSPHATE 5-KINASE TYPE I GAMMA, PTDINS(4)P-5-KINASE GAMMA, PTDINSPKIGAMMA, PIP5KIGAMMA SWVYSPLH 8 T 4.6 Pox_F15 pdbhh F Eukaryota T 3h8a 2 E,F E,F RNE_ECOLI RNase E QSPMPLTVAAASPELASGKVWIRYPIVR 28 T 1 XisI pdbhh F Bacteria T 3h8d 2 E,F,G,H E,F,G,H DAB2_RAT DOC-2, MITOGEN-RESPONSIVE PHOSPHOPROTEIN, C9 GSSSGGGSSSSGTSSAFSSYFNNKVGIPQEHVDHDDFDANQLLNKINE 48 T 2.4 EGL-1 pdbhh F Eukaryota T 3h9g 2 E,F,G,H E,F,G,H Microcin C7 analog MRTGNAX 7 T 110 FARP pdbhh F T 3h9j 2 E,F,G,H E,F,G,H MCCC7, MICROCIN C51, MCCC51, MICROCIN C, MCC MRTGNAN 7 T 110 FARP pdbhh F T 3hf0 1 A A GCN4-pLI side chain sequence on an (alpha-alpha-beta-alpha-beta-alpha-beta) backbone with cyclic beta-residues XRMXQXEXKLXEXLXKLXHXEXELXRXKXLLXEX 34 T 1 ATP-synt_DE pdbpssm F T 3hgk 2 E,F,G,H E,F,G,H HPAB2_PSESM AVRPTOB, AVIRULENCE PROTEIN AVRPTOB, E3 UBIQUITIN-PROTEIN LIGASE PRRGAVAHANSIVQQLVSEGADISHTRNMLRNAMNGDAVAFSRVEQNIFRQHFPNMPMHGISRDSELAIELRGALRRAVHQQAAS 85 T 0.11 Peptidase_C58 pdbpssm F Bacteria T 3hik 2 B B Pentamer phosphopeptide XPLHST 6 T 200 zf-C2H2_4 pdbhh F T 3hki 3 C,F C,F PAR1_HUMAN PAR-1, THROMBIN RECEPTOR, COAGULATION FACTOR II RECEPTOR SFLLRNPNDKYEPFWEDEEKN 21 T 1.3 SYCP2_SLD pdbhh F Eukaryota T 3hqh 2 B M H2AY_HUMAN MacroH2A KAASADSTTEGTPAD 15 T 0.047 DUF1764 unp F Eukaryota T 3hql 2 C,D C,D Q9VHV8_DROME SD08157P ENLACDEVTSTTSSST 16 T 1.2 DUF1163 pdbhh F Eukaryota T 3hqm 2 C,D C,D CI_DROME Protein cubitus interruptus NTLFPDVSSSTH 12 T 1.7 LSPR pdbhh F Eukaryota T 3hr5 3 I,J,K,L R,S,T,V M1prime-derived peptide SAQSQRAPDRVLCHSGQQQGLPRAAGGSVPHPRCH 35 T 15 HupF_HypC pdbhh F T 3hsv 2 C M H2AY_HUMAN MH2A1,HISTONE H2A.Y,H2A/Y,MEDULLOBLASTOMA ANTIGEN MU-MB-50.205 XDSTTEGTPADGFTVL 16 T 0.047 DUF1764 unp F Eukaryota T 3huf 2 D E COM1_SCHPO NBS1-INTERACTING PROTEIN 1, MEIOTICALLY UP-REGULATED GENE 38 PROTEIN IQELDSTTDEDEI 13 T 0.0085 CCDC144C unppercent F Eukaryota T 3i1g 1 A A GCN4_YEAST AMINO ACID BIOSYNTHESIS REGULATORY PROTEIN RMAQLEAKVEELLSKNWNLENEVARLKKLVGER 33 T 0.00052 bZIP_1 pdbpercent F Eukaryota T 3i5r 2 B B Peptide ligand HSKRPLPPLPSL 12 T 0.69 Herpes_LAMP2 pdbhh F T 3i7l 2 B B DDB2_HUMAN DAMAGE-SPECIFIC DNA-BINDING PROTEIN 2, DDB P48 SUBUNIT, DDBB, UV-DAMAGED DNA-BINDING PROTEIN 2, UV-DDB 2 SIVRTLHQHKLGRA 14 T 2.6 YrzK pdbhh F Eukaryota T 3i7n 2 B B WDTC1_HUMAN WD and tetratricopeptide repeats protein 1 NITRDLIRRQIKE 13 T 0.77 DUF6483 pdbhh F Eukaryota T 3i7o 2 B B DCAF6_HUMAN NRIP, NUCLEAR RECEPTOR INTERACTION PROTEIN, ANDROGEN RECEPTOR COMPLEX-ASSOCIATED PROTEIN, ARCAP HLLWDVRKRSLGL 13 T 1.8 SCAPER_N pdbhh F Eukaryota T 3i7p 2 B B DCA12_HUMAN CENTROSOME-RELATED PROTEIN TCC52, TESTIS CANCER CENTROSOME-RELATED PROTEIN SLVYYLKNREVRL 13 T 2.9 DUF760 pdbhh F Eukaryota T 3i89 2 B B DCAF5_HUMAN BREAKPOINT CLUSTER REGION PROTEIN 2, BCRP2 SVVGFLSQRGLHG 13 T 2.9 Blt1_C pdbhh F Eukaryota T 3i8c 2 B B DCAF4_HUMAN WD repeat-containing protein 21A NASSMLRKSQLGF 13 T 4 MRP-S35 pdbhh F Eukaryota T 3i8e 2 C,D C,D DCAF8_HUMAN WD repeat-containing protein 42A QALPALRERELGS 13 T 1.7 DUF4661 unp F Eukaryota T 3i91 2 C C H3K9 peptide QTARKSTG 8 T 290 Ribosomal_S8e pdbhh F T 3iax 2 B B CEA_CITFR Colicin-A MPGFNYGGKGDGTGWSSERGSGPEPGGGSHGNSGGHDRGDSSNVGNESVTVMKPGDSYNTPWGKVIINAAGQPTMNGTVMTADNSSMVPYGRGFTRVLNSLVNNPVSLEHHHHHH 115 T 0.32 Cloacin pdbpercent F Bacteria T 3idi 3 C C gp41 MPER peptide ALDKWQN 7 T 2.6 AATF-Che1 pdbhh F T 3idj 3 C C gp41 MPER peptide analog ELDXWAS 7 T 7.3 Med13_C pdbhh F T 3iee 1 A A Q5LA60_BACFN Putative exported protein GASCSGGDKSKAPVVSTADIENAAEVIKYYNTSLGVLKDMVKEKDVNAVLDYMEQKGKTPALSAIVPPAVVSKDSAIVLNPGNCFNEETRRNLKQNYTGLFQARTEFYANFDTYLSYLKKKDVTNAKKLLDVNYQLSTQMSEYKQNIFDILSPFTEQAELVLLVDNPLKAQIMSVRKMSSTMQSILNLYARKHRMDGPRIDLKVAELTKQLDAAKKLPVVNGHEGEMKSYQAFLSQVETFIKQVKKVREKGEYSDADYDMLTSAFETSII 270 T 0.029 LPAM_1 unphh F Bacteria T 3if2 1 A,B A,B Q4FPU3_PSYA2 Aminotransferase GMKFSKFGQKFTQPTGISQLMDDLGDALKSDQPVNMLGGGNPAKIDAVNELFLETYKALGNDNDTGKANSSAIISMANYSNPQGDSAFIDALVGFFNRHYDWNLTSENIALTNGSQNAFFYLFNLFGGAFVNEHSQDKESKSVDKSILLPLTPEYIGYSDVHVEGQHFAAVLPHIDEVTHDGEEGFFKYRVDFEALENLPALKEGRIGAICCSRPTNPTGNVLTDEEMAHLAEIAKRYDIPLIIDNAYGMPFPNIIYSDAHLNWDNNTILCFSLSKIGLPGMRTGIIVADAKVIEAVSAMNAVVNLAPTRFGAAIATPLVANDRIKQLSDNEIKPFYQKQATLAVKLLKQALGDYPLMIHKPEGAIFLWLWFKDLPISTLDLYERLKAKGTLIVPSEYFFPGVDVSDYQHAHECIRMSIAADEQTLIDGIKVIGEVVRELYDNK 444 T 0.00056 Aminotran_1_2 pdbpercent F Bacteria T 3ifl 3 C P A4_HUMAN Amyloid beta A4 protein DAEFRHD 7 T 1 DUF5973 pdbhh F Eukaryota T 3im4 2 C C AKA10_HUMAN KINASE ANCHOR PROTEIN 10, PROTEIN KINASE A-ANCHORING PROTEIN 10, PRKA10, D-AKAP-2 GSPEFVQGNTDEAQEELAWKIAKMIVSDVMQQAQYDQPLEKSTKL 45 T 0.073 TnpW pdbpssm F Eukaryota T 3ino 1 A,B A,B PAG_BACAN PA63 GSRFHYDRNNIAVGADESVVKEAHREVINSSTEGLLLNIDKDIRKILSGYIVEIEDTEGLKEVINDRYDMLNISSLRQDGKTFIDFKKYNDKLPLYISNPNYKVNVYAVTKENTIINPSENGDTSTNGIKKILIFSKKGYEIG 143 T 0.11 Fve pdbhh F Bacteria T 3iqj 2 B P RAF1_HUMAN C-RAF, CRAF, RAF-1 QRSTSTPNVH 10 T 58 ALC pdbhh F Eukaryota T 3isw 2 C C CFTR_HUMAN CFTR, CHANNEL CONDUCTANCE-CONTROLLING ATPASE, CAMP-DEPENDENT CHLORIDE CHANNEL, ATP-BINDING CASSETTE TRANSPORTER SUB-FAMILY C MEMBER 7 PLEKASVVSKLFFSWTAP 18 T 1.4 ACTH_domain pdbhh F Eukaryota T 3it3 1 A,B A,B HISTIDINE ACID PHOSPHATASE MVGYSSKLIFVSMITRHGDRAPFANIENANYSWGTELSELTPIGMNQEYNLGLQLRKRYIDKFGLLPEHYVDQSIYVLSSHTNRTVVSAQSLLMGLYPAGTGPLIGDGDPAIKDRFQPIPIMTLSADSRLIQFPYEQYLAVLKKYVYNSPEWQNKTKEAAPNFAKWQQILGNRISGLNDVITVGDVLIVAQAHGKPLPKGLSQEDADQIIALTDWGLAQQFKSQKVSYIMGGKLTNRMIEDLNNAVNGKSKYKMTYYSGHALTLLEVMGTLGVPLDTAPGYASNLEMELYKDGDIYTVKLRYNGKYVKLPIMDKNNSCSLDALNKYMQSINEKFQKHHHHHH 342 T 0.012 His_Phos_2 pdbpercent F T 3it8 2 D,E,F,J,K,L D,E,F,J,K,L Q9DHW0_YLDV 2L protein ITLKYNYTVTLKDDGLYDGVFYDHYNDQLVTKISYNHETRHGNVNFRADWFNISRSPHTPGNDYNFNFWYSLMKETLEEINKNDSTKTTSLSLITGCYETGLLFGSYGYVETANGPLARYHTGDKRFTKMTHKGFPKVGMLTVKNTLWKDVKAYLGGFEYMGCSLAILDYQKMAKGKIPKDTTPTVKVTGNELEDGNMTLECTVNSFYPPDVITKWIESEHFKGEYKYVNGRYYPEWGRKSNYEPGEPGFPWNIKKDKDANTYSLTDLVRTTSKMSSQPVCVVFHDTLEAQVYTCSEGCNGELYDHLYRKTEEGEGGSHHHHHH 324 T 0.0046 C1-set unppercent T Viruses T 3iux 2 B,D B,D miniature protein inhibitor CNCKAPETFLCYWRCLQX 18 T 0.00097 Bee_toxin pdbhh F T 3iym 1 A,B A,B Q6YDQ6_9VIRU Capsid protein MSSIAPTDSVSSSGKRSKPGKRERQQARSAVGSAGGKPASASKAAAFAQGGSSDPVPMPGKYPVVFSTGAGEPTRDQEFALPVHKAFPLFGSVSDKYRRNPRYAEFRAHSEFTDGVFGTHLAVSSLLRLAQQLVHAHVNMGLPLGDFAPLASSDVRIPSALASVVNQFGEFSSPSIGTRFLLRDFEHAVSRVVFLADQLWTNGNSHHIFARSWLPMSNNDGNFKTIVASRLLEFISAGDLSILPTVLEDAVLSGEVPEAWEQVKDLLGDAPGVGQVDRRDRFDFLFKSYADVGQFTTAFTTQAASDVLTELGLPWNSPSAGHLNWQYSTKQRFTFLADTWAKLSAAYSQFFELSSGLATRQSATGSHAQMVDLTSVEGVTVLKAALALSAPEFSLAACFPPSCIFVGGLTRRVVVTTSLSVSQRATEFCQMDWR 434 T 0.13 BON unppercent T Viruses T 3iz3 1 A A Q914N6_CPVBM Structural protein VP3 MWHYTSINNDTRVALDPKPNQIRTITKPNTVPQLGTDYLYTFNSQRRSHTLRLLGPFQYFNFSETDRGHPLFRLPLKYPSKAIPADELIDNLHSWMRSVHLLHVRSEDNTLRYNWMLGVYARSTNYTTPVGQLVVNAPAILNYSNPQDAFNSVFVALGIDYIDIPITNSNIFDDSSTPYNVRIWHAPTMTEVNHILALMRKSTLVSTHSSWHWNVLHTFHYRSESDMIDHFAAKILEDWRQKEKLDKGALVEADRVIQRLIPLSSSTYVQRLAAIGALYPNEFTENVLDLSRLSTALLQLSDTYYQHANDQLRRLYRRMYNDSRTLYMTQRHQELLLAQITADPNILLYPYTYIFTTIPTSMNYISNTGQGRIKHSLTVTGATEHDTVADIVLGQTGEDVITISMVEPMSIAVEDMYGYVLDTPTRDIWPADEQIEQKGDAVALYDTKTSRALGMFNNTVRIDDLLSPLLSLVYRTYIKGDTMTMTQGSLDHLTLCAAVDSDITFVGNRMIAPLPEGYIPKPMHRNNSTMKMLSLYVALKKLENFATNSYLMAPDTSIILLGAEREPAVNILRRFNRNVSNVRIIGMGDRAVEPNIRVRVPFPIDKNISADFIICDINSYEDQSFESMFSETISVVTTCASAATRALVKINHPSEYMINSVIERLSQLGGVFYHTALLKTASQNPYSYETYIYITPIAAAVRFPFYSNSAMINRYMTAVADDEMPIIPSIHTVIKGHSNTYSPGLFCGCVDVQSAPLALSQLKSYCSEATTWRVDSDDNLVNIIARIDPARIALEFRTRSNTSAYHEYQRYVPNGLGFKVRKTREFRYMHREVTFIHKLMMYALIREQISLTENMTQVVSIGGRNLADISVVPLNMKYVVIDPATRIETLTQEKKNIEVQSRPFQFDAANMDLENNSIYLFIAVIMNEPNGAATPARMQMDKIRNVATAMLTRTNCVAYISFYEAGIITRLDQSTAHKTIRVEEGRLKVANYVPVDTLVEADVTLMLRDIGITHEIIRPSTPELIDACSNYGIRLGSTGGAVLDVFNHYSPVIKLVRS 1058 T 0.52 DUF5705 pdbpssm T Viruses T 3iz3 2 B,C B,C CAPSD_CPVBM Structural protein VP1 MHSTNNNSNKRNNEEKHKQPEIDSSANNGEGTSGTRAQTVGDTATEAGVRNETKAGASTRRQTDGTGLSGTNAKIATASSARQTDVEKPADVTFTIENVDDVGIMQQKKPPTVVQSRTDVFNEQFANEALHPMTKVIFNGLDVNTEVQPLSDDFKQISDPKGYLTYSVKYEDQFTKKDKLRASEADDRIVGPTVNLFKYGAAVVNIDLNRDFFDTATGIDLTKGIPLVQDLLVPIGVTAGAEQSAEYVSGLLMVLFKVMTDNRLVIVGETTTPMSNTLSTVVNNVLRTTYHNNVGVNPALLRDFTQVNWLNRDITNMLQQAGTKYGLGLTETRLDYVRLVKTIVGHALNIDHFAASVLNINLRALMEANVTADDRIKALQAHSMISTQFHGPNQGALRPELAFDHDHIIRCLMLAAANYPRLEGIIVQINTGYVASANVIRPVSEKRYFPENLEQNQSAARLVSAVKARASEADISSIHLAIAREVSPMFNVHELKKIAESFEDPSSIVVVLEFILFALFFPTEFNRIKGDIQNVLLLFFSRWYPVEYGIFIQRGATYTINAAGEFEFSGRNEKWDQSLYLSEHFPALFSDVPLAGANTIIAIMRLFTPQGFLRTDDLAIAANFPRASRNPQTYIPYTNQRGTVTNEFASRFRTIVATLANVVNERAVQDDMQKATRSCTKQWLRHLETQFDNIAVAHTDHLSVVYATMSNFMLNFTNNFSGNHATFKPDQYVITSPEGSYKPIIERQGETVDGLTIIDTSIVWPILCQCTYPLVRQSGKGVDAVSIMEEIVYPDPSTTLSQSLSVAQVLSKLTLPDAFINMILSGGDSVVMRTYQTEADDDLDEGIRMTTYDQYLSHIRERLHITNVPDPIYITGASTPDQIAASVQATHVAVVLYQSGVINGSASTYLRENEVLVVMPDYYDVVSRFANANLQMNNNRYHESVLEIADIFDQADFIQTSDAVRQLRALMPTLSTSQIRHAIERIAQITDVDSTDYGKLTLRFLGTLTRSLKMQNAQIRRIRPDGTVLRYDDQIDIEAFRWSRYFLDELRLRRLSVGLRLITNPRIARRFDGVRIMYLTDDDPDPDFVPDVPEGYVAVQYAHRLFSSSLANKRNRVTYTHPPTGMAYPSPTGRPHVHMTINERAGMSKLVADNIIASVIKSNWVVDIHDIEYTAEVMTPSEGYTQHVDAESIMTAPKGKLFHLQFMDGLLRPEPSAFDPPASGEDMRLIYPLQPISVARSMRAIVNHNEVDRPRGAVAPSSYEMDTGTLSRNGDLLYSPVANGQVGIPKLEVDHISFSNVVSMMTANIRTGDDMAVERVNPDDVRAINIRNA 1333 T 0.96 DUF2717 pdbpercent T Viruses T 3iz3 3 D,E D,E C6K2M8_CPVBM Viral structural protein 5 MLQQPTGGYTTLEQFAFTIRNDGTNATPTQFLQLLSYEATENELVKKTIPTPETHLPSARNVPGNVYIEDAITQALFGISAQNVNAHGYFSRLSALALPNTSARLGLDGVIYNSETINIPFYDPAAVANFAATYAKLGNASTPRYRADMIDIYAHVGLELAGTDAERAAGVMPVKRAKFDSWEGSLISLSRDVVNWKILAFLIDLCSLEGEALRAFKTRNRDVFRMMLFIMSTAVAANVVNRKVTKRVDRVLEYIGVNSMRTAGRTATITYDLSRHEFAAKFLQLTFTRWN 291 T 13 HAD_SAK_2 pdbhh T Viruses T 3izx 3 D,E D,E C6K2M8_CPVBM Viral structural protein 5 MLQQPTGGYTTLEQFAFTIRNDGTNATPTQFLQLLSWEATENELVKKTIPTPETHLPSARNVPGNVYIEDAITQALFGISAQNVNAHGYFSRLSALALPNTSARLGLDGVIYNSETINIPFYDPAAVANFAATYAKLGNASTPRYRADMIDIYAHVGLELAGTDAERAAGVMPVKRAKFDSWEGSLISLSRDVVNWKILAFLIDLCSLEGEALRAFKTRNRDVFRMMLFIMSTAVAANVVNRKVTKRVDRVLEYIGVNSMRTAGRTATITYDLSRHEFAAKFLQLTFTRWNAASAMIRSMPDMHTPRTSITPAGENALVRHNRYMTENFKGLSPIALAQKKHEMMLHTHEIHSMDIDGSIKNMVERETVNKMNEIDAMNTAPWTEEFAEVEPTTVYERHQIGTDPEQTQLISQDAAVIVHQASSDVDENEYGNSVSELTIDTQSDSVL 448 T 0.022 CI-B14_5a unppssm T Viruses T 3j0h 1 A,B,C,D,E,F A,B,C,D,E,F Q8SDD3_BPDPK PHIKZ029 LRPEDAANPSRLIVAIEIVEDEIPLTIRRLSGFNYPNSVRDIGNAPVPTTDKVDGLKARIILIEDNTSEVGTQRVLPGTLVSDKDGSQSLVYPLFEAPVSFFGKLGDSNGMRVWSTTTADIEEFDEAAMAKFKTRQFRIQLIEKPEVGTSPVIVKTADQQDYLNITFDKGVYSDMYNADLYVGDVLVDSYSDDGVVSGLSPLYSPFSQFYVYHENIDLVRQMIYDTEMRVNPAAAAHTTAPGEIDFLTFLAVDGDPYQGIQVLGPLDGGITLGKDGNIYASGGTDGTTDLEEYAK 295 T 0.21 N-Term_TEN pdb T Viruses T 3j17 1 A A Q914N6_CPVBM Structural protein VP3 MWHYTSINNDTRVALDPKPNQIRTITKPNTVPQLGTDYLYTFNSQRRSHTLRLLGPFQYFNFSETDRGHPLFRLPLKYPSKAIPADELIDNLHSWMRSVHLLHVRSEDNTLRYNWMLGVYARSTNYTTPVGQLVVNAPAILNYSNPQDAFNSVFVALGIDYIDIPITNSNIFDDSSTPYNVRIWHAPTMTEVNHILALMRKSTLVSTHSSWHWNVLHTFHYRSESDMIDHFAAXILEDWRQKEKLDKGALVEADRVIQRLIPLSSSTYVQRLAAIGALYPNEFTENVLDLSRLSTALLQLSDTYYQHANDQLRRLYRRMYNDSRTLYMTQRHQELLLAQITADPNILLYPYTYIFTTIPTSMNYISNTGQGRIKHSLTVTGATEHDTVADIVLGQTGEDVITISMVEPMSIAVEDMYGYVLDTPTRDIWPADEQIEQKGDAVALYDTKTSRALGMFNNTVRIDDLLSPLLSLVYRTYIKGDTMTMTQGSLDHLTLCAAVDSDITFVGNRMIAPLPEGYIPKPMHRNNSTMKMLSLYVALKKLENFATNSYLMAPDTSIILLGAEREPAVNILRRFNRNVSNVRIIGMGDRAVEPNIRVRVPFPIDKNISADFIICDINSYEDQSFESMFSETISVVTTCASAATRALVKINHPSEYMINSVIERLSQLGGVFYHTALLKTASQNPYSYETYIYITPIAAAVRFPFYSNSAMINRYMTAVADDEMPIIPSIHTVIKGHSNTYSPGLFCGCVDVQSAPLALSQLKSYCSEATTWRVDSDDNLVNIIARIDPARIALEFRTRSNTSAYHEYQRYVPNGLGFKVRKTREFRYMHREVTFIHKLMMYALIREQISLTENMTQVVSIGGRNLADISVVPLNMKYVVIDPATRIETLTQEKKNIEVQSRPFQFDAANMDLENNSIYLFIAVIMNEPNGAATPARMQMDKIRNVATAMLTRTNCVAYISFYEAGIITRLDQSTAHKTIRVEEGRLKVANYVPVDTLVEADVTLMLRDIGITHEIIRPSTPELIDACSNYGIRLGSTGGAVLDVFNHYSPVIKLVRS 1058 T 0.52 DUF5705 pdbpssm T Viruses T 3j17 3 D,E D,E C6K2M8_CPVBM Structural protein VP5 MLQQPTGGYTTLEQFAFTIRNDGTNATPTQFLQLLSYEATENELVKKTIPTPETHLPSARNVPGNVYIEDAITQALFGISAQNVNAHGYFSRLSALALPNTSARLGLDGVIYNSETINIPFYDPAAVANFAATYAKLGNASTPRYRADMIDIYAHVGLELAGTDAERAAGVMPVKRAKFDSWEGSLISLSRDVVNWKILAFLIDLCSLEGEALRAFKTRNRDVFRMMLFIMSTAVAANVVNRKVTKRVDRVLEYIGVNSMRTAGRTATITYDLSRHEFAAKFLQLTFTRWNAASAMIRSMPDMHTPRTSITPAGENALVRHNRYMTENFKGLSPIALAQKKHEMMLHTHEIHSMDIDGSIKNMVERETVNKMNEIDAMNTAPWTEEFAEVEPTTVYERHQIGTDPEQTQLISQDAAVIVHQASSDVDENEYGNSVSELTIDTQSDSVL 448 T 0.022 CI-B14_5a pdbpssm T Viruses T 3j26 1 A,B,C,D,E,F,G,H,I,J,K,L,M A,B,C,D,E,F,G,H,I,J,K,L,M CAPSD_SPTNK MAJOR CAPSID PROTEIN MSNSAIPLNVVAVQEPRLELNNERTWVVVKGGQQVTYYPFPSTSFSSNQFNFICNPPSAQTVLDRLVFIQVPYDITFTANPSHAGITENLLQPGRDAFRAFPISSITNTLNATINGFPVNIELAQIIHALSRYHTPLKVKNGWMSMQPSFEDNYQSYRDADGANNNPLGVFTSAAGLSELPRGSYTMNVVTNTTTTARITGVLYEQVFLPPFLWDGEQAGGLANLTSLTFNWVLNNNLARIWSHSDITNDVSGNSTIGSMNISFQQPSMYLGFVTPRLNIPIPPRITYPYFKLSRYTTQFQNTLAPNASSTFKSNVVQLDSIPRKLYLFVKQSDNVIYQNLNNQITTPDVFLQINNLNLTWNNQQGILSGASSQNLYDFSVQNGYNKTWSEFNGVTQQFNGVSGQPTKVIGLEGGIVCLELGKDVGLRDDEAEGVIGNFNLQVQMTVTNTNQYVTVTPDMYIVAVYDGTLVISNTSAMASIGVASKEEVLNARITHGVSYNELQRIYG 508 T 0.11 IU_nuc_hydro pdb T Viruses T 3j26 2 N N I0CES9_9VIRU PENTON PROTEIN MSYSHSIKDCQEPDTVYYDILIPFKPNDQGFSPAIFQAQLTQPIVHNPSEYFLSVVRFSIPTQNIPLTIPQIQPYPNTNVNNTIYSVSIGYNGTYSSQNFVQFDPSLTSPNIPAPNAPTVTSPNVEVTPYYYIYDYSTFLQMINTALENAFNEISAPVGADAPFFFYDSNTEKISLIAQAAYYDRTLTTPIEIYCNVNLFTFFDSIKHIGLGYNTPTGRDILFDVRFLGNNYYQDPETAPSYPPEFIQMQQEYPTLSNWNAVKTIQLVSNLLPINKESIPSFRNSNVGIINAQGILADFVPLVTNGPEARISIDFVATGPWRLIDMFGSVPIYMVDLYVYWTDQTGGQYLINIPPGRILTCKLVFIKKSLSKYLVSEK 378 T 0.037 ATP_bind_2 pdb T Viruses T 3j31 1 A Q Q6Q0L4_9VIRU A223 penton base MGEVFKEVKEKFERYKFDVVYVDREYPVSSNNLNVFFEIGERNSFSGLLINEGQAVIDVLLLKKSHEGLSPIPGEGTGIQLSAGQILKFYNVPIAEIIVEYDPSNVSGVSSNVKLKGTIHPLFEVPSQISIENFQPTENYLIYSGFGTSLPQTYTIPANGYLIISITNTSTGNIGQITLTIGSTTMTFNLQTGENKIPVIAGTQITNLTLTSSSAILIYEEVI 223 T 6.7 PSP1 pdbhh T Viruses T 3j31 4 R P Q6Q0L3_9VIRU C381 turret protein MSVTTLGQSFPANAKVKYYYKLSEKQDLDAFVNSIFVGSYKLKQISYLLYGNTKIVSAPVVPLGPNASIIIDDELQEGLYLIRIKVYNTNSFSVTVTPFFNNNNTMTYSIGANSEFEIYDIFTKEQGNIYYIQLPPGLAILEFSLERVFEKGNRINIPKIIHTSGNGYISFRLRKGTYAIKMPYSYNNTTSTTFTNFQFGTISTSVATIPLVISSIPANGSGSGTFLVYLKITGDYEDVKFSVTYGGGLGVPFTFGLEVEEINELVENTNFVTQSVTLSGSQVTQSILNVQGSGSHLRLKYASVSGLTTAVTQCQLQATNLNRSTTYSTVWDFIAGGSSTPPSWDIREINSIQLVANGGSSTSSVTITLILVYEQIAGELS 381 T 0.2 CBM_48 pdb T Viruses T 3j3i 1 A A CAPSD_PCVC CP, COAT PROTEIN MAAPVLYGGAGGTATGPGDMRRSLMHEKKQVFAELRREAQALRVAKEARGKMSVWDPSTREGARGYREKVVRFGRQIASLLQYFENMHSPALDIIACDKFLLKYQIYGDIDRDPAFGENTMTAEVPVVWDKCEVEVKLYAGPLQKLMSRAKLVGAAREGIPNRNDVAKSTGWNQDQVQKFPDNRMDSLISLLEQMQTGQSKLTRLVKGFLILLEMAERKEVDFHVGNHIHVTYAIAPVCDSYDLPGRCYVFNSKPTSEAHAAVLLAMCREYPPPQFASHVSVPADAEDVCIVSQGRQIQPGSAVTLNPGLVYSSILTYAMDTSCTDLLQEAQIIACSLQENRYFSRIGLPTVVSLYDLMVPAFIAQNSALEGARLSGDLSKAVGRVHQMLGMVAAKDIISATHMQSRTGFDPSHGIRQYLNSNSRLVTQMASKLTGIGLFDATPQMRIFSEMDTADYADMLHLTIFEGLWLVQDASVCTDNGPISFLVNGEKLLSADRAGYDVLVEELTLANIRIEHHKMPTGAFTTRWVAAKRDSALRLTPRSRTAHRVDMVRECDFNPTMNLKAAGPKARLRGSGVKSRRRVSEVPLAHVFRSPPRRESTTTTDDSPRWLTREGPQLTRRVPIIDEPPAYESGRSSSPVTSSISEGTSQHEEEMGLFDAEELPMQQTVIATEARRRLGRGTLERIQEAALEGQVAQGEVTAEKNRRIEAMLSARDPQFTGREQITKMLSDGGLGVREREEWLELVDKTVGVKGLKEVRSIDGIRRHLEEYGEREGFAVVRTLLSGNSKHVRRINQLIRESNPSAFETEASRMRRLRADWDGDAGSAPVNALHFVGNSPGWKRWLENNNIPSDIQVAGKKRMCSYLAEVLSHGNLKLSDATKLGRLVEGTSLDLFPPQLSSEEFSTCSEATLAWRNAPSSLGVRPFAQEDSRWLVMAATCGGGSFGIGKLKSLCKEFSVPKELRDALRVKYGLFGGKDSLE 982 T 0.27 TT_ORF2 unppercent T Viruses T 3j40 1 A,B,D,I,L,M,N N,M,H,K,I,J,L Q858G5_BPE15 gp10 MKTVNMKTGTDSFVGEDGKPETKDQYPWGLRITLDNESLQRLGLNAKSLPAVGDSVSVMAMANVCSVSTRTTDHGEDNYVELQITDIGLAPQKRDDAKELKDAFYPDGEDD 111 T 0.045 RNA_pol_Rpb1_7 pdb T Viruses T 3j46 4 D n NC100 XAKKIWLALAGLVLAFSASCAQYEDGSSGELERQHTFALHQRSISGDGDSPHSYHSLPEGVKMTKYLQEQKLAVAAVAAQADLELFSTPVWISQAQGIRAG 101 T 0.0023 SecM pdbhh F T 3j47 6 F R RPN7_YEAST 26S proteasome regulatory subunit RPN7 NAQYHLLVKQGDGLLTKLQKYGAAVR 26 T 0.62 Paf67 unphh F Eukaryota T 3j47 8 H T RPN12_YEAST NUCLEAR INTEGRITY PROTEIN 1 KTNIIEKAMDYAISIEN 17 T 1.9 DUF4576 pdbhh F Eukaryota T 3j4u 2 H,I,J,K,L,M,N H,I,J,K,L,M,N Q775C8_BPBPP BBP16 MIIDKLLQVSDGQAVTASAASTDVIDFGQANPNTGMDDRSKMVITVDESADAAGAATVTFSVQDSADNATFADVAATGAIGKANLAAGKQVVIPMPTKLRRYCRVYYTVATGPLTAGKFSAQVVTGIQQNVAYPDSPRIA 140 T 0.019 DUF6385 pdbpercent T Viruses T 3j89 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S synthetic peptide QARILEADAEILRAYARILEAHAEILRAQ 29 T 3.6 DUF2563 pdbhh F T 3j9w 23 W AZ MIFM_BACSU MifM MTMFVESINDVLFLVDFFTIILPALTAIGIAFLLRECRAGEQWKSKRTDEHQTVFHINRTDFLIIIYHRITTWIRKVFRMNSPVNDEEDAGSLLL 95 T 0.052 PqiA pdb F Bacteria T 3jaj 49 WA 2 Nascent chain LLLLLLLLLLLLKVGPVPVLVMSLLFIASMV 31 T 0.00017 Sec61_beta pdb F T 3jap 46 TA r EIF3B_YEAST eIF3b LHQRELLKQWTEYREKIGQEMEKSMNFKIFD 31 T 0.44 Phage_TAC_8 unp F Eukaryota T 3jau 1 A A Q5DW45_9ENTO Capsid protein VP1 GYPTFGEHKQEKDLEYG 17 T 9.9E-08 Waikav_capsid_1 unphh T Viruses T 3jb6 1 A A D0EZK6_CPVBM RNA-dependent RNA polymerase MLPNTKLHNTIFSETRKFTRESFKEIEHLTARLANDRVARHDFLFNTSIVLISDYSGEDSNGNQLQATITIPNEIINPKEYDPSDYPLAEDESFFKQGHKYDYLVTFRAGSLTNTYEPKTKMYKLHAALDKLMHVKQRKSRFADLWRELCAVIASLDVWYQTTNYPLRTYVKLLFHKGDEFPFYESPSQDRIIFNDKSVASILPTFVYTCCQVGTAIMSGILTHVESIVAMNHFLHCAKDSYIDEKLKIKGIGRSWYQEALHNVCQATVPVWSQFNEVIGHRTKSTSEPHFVSSTFISLRAKRAELLYPEFNAYINRAIQLSKTQNDVANYYAACRAMTNDGTFLATLTELSLDAAVFPRIEQHLVTRPAVLMSNTRHESLKQKYTNGVGSIAQSYLSSFTDEIAKRVNGRHHDEAWLNFLTTSSPGRKLTEIEKLEVGGDVAAWSNSRIVMQAVFAREYRTPERIFKSLKAPIKLVERQQSDRRQRAISGLDNDRLFLSFMPYTIGKQIYELNDNAAQGKQAGNAFDIGEMLYWTSQRNVLLSSIDVAGMDASVTTNTKDIYNTFVLDVASKCTVPRFGPYYAKNMEVFEAGNRQSQVRYVNAAWQACALEAANSQTSTSYESEIFGQVKNAEGTYPSGRADTSTHHTVLLQGLVRGNELKRASDGKNSCLATIKILGDDIMEIFQGSESDTYDHAVSNASILNESGFATTAELSQNSIVLLQQLVVNGTFWGFADRISLWTREDTKDIGRLNLAMMELNALIDDLVFRVRRPEGLKMLGFFCGAICLRRFTLSVDNKLYDSTYNNLSKYMTLTKYDKNPDSDSTLMSLILPLAWLFMPRGGEYPAYPFERRDGTFTEDESMFTARGAYKRRLLYDVSNIGEMIQQNSMALDDDLLHEYGFTGALLLIDLNILDLIDEVKKEDISPVKVSELATSLEQLGKLGEREKSRRAASDLKIRGHALSNDIVYGYGLQEKIQKSAMATKETTVQSKRVSSRLHDVIVAKTRDYKISTIPADALHLHEFEVEDVTVDLLPHAKHTSYSSLAYNMSFGSDGWFAFALLGGLDRSANLLRLDVASIRGNYHKFSYDDPVFKQGYKIYKSDATLLNDFFTAISAGPKEQGILLRAFAYYSLYGNVEYHYVLSPRQLFFLSDNPVSAERLVRIPPKYYVSTQCRALYNIFSYLHILRSIANNRGKRLKMVLHPGLIAYVRGTSQGAILPEADNV 1225 T 0.59 DUF2779 pdbpercent T Viruses T 3jb6 2 B B Q9IR43_CPVBM Viral structural protein 4 MFAIDPLKHSKLYEEYGLYLRPHQINQEIKPTTIKKKELAPTIRSIKYASLIHSMLAKHAARHNGTLINPRMYADMITLGNTKVTVTKGTPKAQIDTLKMNGLTVVSKSRRNNKKKPVSDTTATIDENTDDIVTYKALTEMSTLIESFRLPSGLALIIFDDEKYQSLIPNYINQLIAYTQPHIIPTWQGIADFSDTYLRSYFKRPFELTASNLAAPQKYNLSPMTRSIFNNTGREDAVIRKLYGYGEYVFIRYEGCLITWTGIYGEVTMMVNLSKRDLGLDVGDDYLKEYKKLLFYGVITDAIPSGISARSTIMKISPHKMMNPSGGALAVLSKFLEAVVSTNVINATLVVYAEKGAGKTSFLSTYAEQLSLASGQVVGHLSSDAYGRWLAKNKDVEEPSFAYDYVLSLDTDDNESYYEQKASELLISHGISEVAQYELLSVRKKIKMMDEMNEVLIAQLENADTHSERNFYYMVSTGKTTPRTLIVEGHFNAQDATIARTDTTVLLRTINDTTQAMRDRQRGGVVQLFLRDTYYRLLPALHTTVYPFEMLESIRRWKWVH 561 T 0.00013 Zeta_toxin pdbhh T Viruses T 3jb6 3 C,D C,D D3JWE6_CPVBM VP1 CSP PTVVQSRTDVFNEQFANEALHPMT 24 T 8.5 DASH_Dam1 pdbhh T Viruses T 3jbu 49 WA z OMPA_ECOLI SecM-glycine MASWSHPQFEKGGGARGGSGGGSWSHPQFEKGFENLYFQGMKKTAIAIAVALAGFATVAQAEQKLISEEDLFSTPVWISQAQGIRAG 87 T 1.6999999999999998E-75 OmpA_membrane unp F Bacteria T 3jcu 20 TA,U u,U A0A0K9RHP1_SPIOL Photosystem II Reaction Center Tn protein MASITMTASFLGTTVSKQPPTHHLRRGVVMAKAMPETTTTTKEETSSKRRDLVFAVAAAAACSVARIAMAEEPKRGTPEAKKKYAPVCVTMPSARICYK 99 T 0.014 PsbQ pdbpercent F Eukaryota T 3jpx 2 B B H4_HUMAN HISTONE PEPTIDE GGAKRHRKVLRDNIQ 15 T 0.27 UPF0137 unp F Eukaryota T 3jqo 3 AA,C,DA,F,GA,I,JA,L,MA,O,PA,R,U,X a,C,d,F,g,I,j,L,m,O,p,R,U,X Q46702_ECOLX TraN protein CSSGHKPPPEPDWSNTVPVNKTIPVDTQGGRNES 34 T 0.0023 LPAM_1 unphh F Bacteria T 3jr3 2 B D Acetylated Peptide KKGQSTSRHKXLRFKTEG 18 T 21 DUF986 pdbhh F T 3jrv 2 C,D,E C,D,E DDX3X_HUMAN DEAD BOX PROTEIN 3, X-CHROMOSOMAL, HELICASE-LIKE PROTEIN 2, HLP2, DEAD BOX, X ISOFORM SFGSRSDSRGKSSFFSDRGS 20 T 26 DUF5725 pdbhh F Eukaryota T 3jz9 1 A A DRRA_LEGPH Uncharacterized protein DrrA GHMVTRIENLENAKKLWDNANSMLEKGNISGYLKAANELHKFMKEKNLKEDDLRPELSDKTISPKGYAILQSLWGAASDYSRAAATLTESTVEPGLVSAVNKMSAFFMDCKLSPNERATPDPDFKVGKSKILVGIMQFIKDVADPTSKIWMHNTKALMNHKIAAIQKLERSNNVNDETLESVLSSKGENLSEYLSYK 197 T 0.88 DUF3800 unppssm F Bacteria T 3jzp 2 B P pDI6W peptide (12mer) LTFEHWWAQLTS 12 T 0.19 Potyvirid-P3 pdbhh F T 3jzq 2 B,D P,Q pDIQ peptide (12mer) ETFEHWWSQLLS 12 T 0.14 Potyvirid-P3 pdbhh F T 3k1q 2 B B Q9E3V8_9REOV VP3A ANGPELIIEDTGLCTSFMLLDNIPSAHLTKELIGFTWFMQMYQMTPPLPEGAVNRIVCMTNWASLGDEGRGLEVRLPPPTDSSVHAYKTVLSRGYIDNAQFNPLALRSNVLLMLLQFTLSNLKINKSSTFTSDVTTITSGRMIRAFEGRPELLALAYPGRAVLPTQTKNAQFLSTAIADRIGRLDRANLIGGEVSAMVECMELCDALTLHIRETYIMLLRSMHQDPTQIVQIVNECANNLLNSTIPISLRPTILCPWFASSEDLRLQEVMHLVNISSNTAAALPLVEALSTLLRSVTPLVLDPTVLTNAITTISESTTQTISPISEILRLLQPMGNDYAAFWKCIASWAYNGLVTTVLSEDAFPDSSQSITHLPSMWKCLFLTLAGPMTSDPHSPVKVFMALANLLAQPEPIAIGVPGMHQTTPASQFSHPGVWPPGFLNPQLINPQQAPLLRAFAEHIRANWPQPSEFGYGSTLQGSANLFIPSNRMVYPWPNQPLPRLTVAPTYDSAMSNWISTTIAFFIRVVNSVNMTATVNDLTRRTMTGVMTAMRQVKTMTPFYIQHMCPTELSVLASVTVTPPFQVPFTRLVQNDVITNVLVARVDPAQRGDAAVDIRATHATFAAALPVDPAAIVVAMLCGQTETNLIPSHHYGKAFAPLFASNAMFTRNQRAVITREAFVCARSAVAQCQDAGFLVPRPLDALRQFDVTSAAAAEIMHAVNDAFKTAFDLDGALLDGLALYGDPRIADLSAAYLQYGGNVVREHVPPGPSHIHRALQQVESTFMAEMNLFNVARGNLYLVQTATNGNWSPMAPVAAPPFVRGGPNVRVVGRFGTIVPRPNGLEPQLIDDGNVPRDIAGDWVYPSDVLQVSVAVFRDYVWPMVKAGRTRVLVELGHYVYTLHYYDPQISLDEAPILEEWLSKINPAGIPPVPFCIPIPQVYPCITARRVHYAFTSENNNDSLFSTNAASIDTAFGENAAVSPLRWPGLVDPNYRVGTNDLPNRITLYNSLYRYNFTYPTLDGIMYVRSAT 1027 T 27 Peptidase_C36 pdbhh T Viruses T 3k26 2 B B HISTONE PEPTIDE ARTKKQTARKST 12 T 150 DUF3042 pdbhh F T 3k27 2 B B HISTONE PEPTIDE KQTARKSTG 9 T 300 Ice_nucleation pdbhh F T 3k48 2 D,E,F R,S,T peptide SGWCDPRWYDPFMCEH 16 T 0.36 Yuri_gagarin pdbhh F T 3k8g 1 A,B A,B TP453_TREPA PUTATIVE UNCHARACTERIZED PROTEIN GSGAWKASVDPLGVVGSGADVYLYFPVAGNENLISRIIENHESKADIKKIVDRTTAVYGAFFARSKEFRLFGSGSYPYAFTNLIFSRSDGWASTKTEHGITYYESEHTDVSIPAPHFSCVIFGSSKRERMSKMLSRLVNPDRPQLPPRFEKECTSEGTSQTVALYIKNGGHFITKLLNFPQLNLPLGAMELYLTARRNEYLYTLSLQLGNAKINFPIQFLISRVLNAHIHVEGDRLIIEDGTISAERLASVISSLYSKKGSS 262 T 0.11 DUF5618 unppercent F Bacteria T 3k93 1 A A Q0I4G3_HAES1 phage related exonuclease GMNNLYHLKVRCSSLHKIIGEPKSKADKEAGKLTDTAKSAVREMAKFDLFGYNAFEGNKYTQKGNELEEQAIKLSGVTRGLALKKNTERRENEFITGECDIYVPSRKLIIDTKCSWDIGSHPFFTDEAQEKAKKAGYDIQMQGYMWLWDCDQAQIDFVLFPTPLNLISAYDSDFKLIDLVEQIPQIRRITTVIIQRDNELIDKIKERVSAAQKYYDQLISEMS 223 T 0.0007 PDDEXK_1 unppercent F Bacteria T 3kf9 2 B,D B,D MYLK2_HUMAN MLCK2 KRRWKKNFIAVSAANRFKKISS 22 T 0.024 PACT_coil_coil unppssm F Eukaryota T 3kl4 2 B B DAP2_YEAST DPAP B, YSCV ARSGSGSGSGSKLIRVGIILVLLIWGTVLLLKSIPHHHHHHH 42 T 0.011 Holin_BlyA pdb F Eukaryota T 3kmz 2 B,D C,D NCOR1_HUMAN N-COR1, N-COR RLITLADHICQIITQDFAR 19 T 6.3 Es2 pdbhh F Eukaryota T 3knt 1 A,B,C,D A,B,C,D OGG1_METJA 8-OXOGUANINE DNA GLYCOSYLASE, DNA-(APURINIC OR APYRIMIDINIC SITE) LYASE, AP LYASE MMLIKKIEELKNSEIKDIIDKRIQEFKSFKNKSNEEWFKELCFCILTANFTAEGGIRIQKEIGDGFLTLPREELEEKLKNLGHRFYRKRAEYIVLARRFKNIKDIVESFENEKVAREFLVRNIKGIGYQEASHFLRNVGYDDVAIIDRHILRELYENNYIDEIPKTLSRRKYLEIENILRDIGEEVNLKLSELDLYIWYLRTGKVLK 207 T 0.0002 HhH-GPD unppercent F Archaea T 3kny 1 A A Q8A1X3_BACTN hypothetical protein BT_3535 GACEQNEDWVVNEPMQSFEENPEYAPLNTIPDWVSEKVTPKEYELWRTMSSRYEINYSFLKKDISEKRKKEIYDCINNICERIEKGQINKYEGFLNIADEDGTTLSDSQYFGRIATRSPEGGAEYKTNGCTLYTHSLGPYIKAAVTYKKSDDDVTITSSSVYTGSPYLGNDPSFSGASSVSYDKDKKLIAASCSGTLSFKDGSRKVEVTVQKTGFMIP 218 T 0.2 FtsH_ext unppercent F Bacteria T 3kpl 3 C C EEYLQAFTY, self peptide from the ATP binding cassette protein ABCD3 EEYLQAFTY 9 T 1.2 DUF3921 pdbhh F T 3kpm 3 C C EEYLKAWTF, mimotope peptide EEYLKAWTF 9 T 8.9 IcmF-related pdbhh F T 3kxs 1 A,B,C,D,E,F F,E,C,D,A,B CAPSD_HBVD1 CORE PROTEIN, CORE ANTIGEN, HBCAG, P21.5 MDIDPYKEFGATVELLSFLPSDFFPSVRDLLDTAAALYRDALESPEHCSPHHTALRQAILCWGDLMTLATWVGTNLEDPASRDLVVSYVNTNVGLKFRQLLWFHISCLTFGRETVLEYLVSFGVWIRTPPAARPPNAPILSTL 143 T 3.9E-25 Hepatitis_core unp T Viruses T 3kyn 3 C P KGPPAALTL peptide KGPPAALTL 9 T 73 Holin_BhlA pdbhh F T 3kyo 3 E,F P,Q KLPAQFYIL peptide KLPAQFYIL 9 T 0.2 RRP14 pdbhh F T 3kze 2 D,E D,E Synthetic Peptide SSRKEYYA 8 T 10 DUF4052 pdbhh F T 3l3q 2 B,C B,C pepTM GSEFESPFKKKRREA 15 T 0.65 DUF240 pdbhh F T 3l41 2 B B phosphorylated H2A tail KPSQEL 6 T 11 POX pdbhh F T 3l8l 2 B,D B,D VAL-GRAMICIDIN A XGAXAXVXWXFWXWXWX 17 T 0.53 MAP17 pdbhh F T 3l9a 1 A X A2V8B8_STRMG uncharacterized protein NKREETDMRDFFVITNSEYTFAGVHYAKGAVLHVSPTQKRAFWVIADQENFIKQVNKNIEYVEKNASPAFLQRIVEIYQVKFEGKNVH 88 T 3.5 DUF4604 pdbhh F Bacteria T 3l9k 2 E,F,G,H W,X,Y,Z DYIN_DROME DH IC, CYTOPLASMIC DYNEIN INTERMEDIATE CHAIN, PROTEIN SHORT WING LSEEQKQMIILSENFQRFVVRAGRVIERALSENVDIYT 38 T 0.36 Mrx7 pdbhh F Eukaryota T 3lca 2 B Q HSP71_YEAST HEAT SHOCK PROTEIN YG100 PEAEGPTVEEVD 12 T 12 DUF6246 pdbhh F Eukaryota T 3lfk 1 A,B,C,D A,B,C,D Q97AP8_THEVO MSCTV GSHMSAMAESKVLVKGTPFNKPVIKGKLENNYDMSQDEVSLLLFLKTHGGKIPLYRIKNETGLKDPESVLKNLMDYGFALEDKERLGEKIVLTSEGEFVAQAIRVRDEELRLKEMKQKKNVNRSSAPPQ 129 T 0.0036 MotA_activ unphh F Archaea T 3lge 2 E,F,G,H E,F,G,H SNX9_HUMAN SH3 AND PX DOMAIN-CONTAINING PROTEIN 1, PROTEIN SDP1, SH3 AND PX DOMAIN-CONTAINING PROTEIN 3A QAYQGPATGDDDDWDEDWDGPKSSSYFKDSE 31 T 0.92 DUF4594 unp F Eukaryota T 3lgf 2 B B DIMETHYLATED p53 Lysine 370 PEPTIDE SSHLKSKKGQ 10 T 37 TEX12 pdbhh F T 3lgl 2 B B DIMETHYLATED p53 LYSINE 382 PEPTIDE TSRHKKLMFKT 11 T 26 DUF420 pdbhh F T 3lh0 2 B B DIMETHYLATED p53 LYSINE 372 PEPTIDE SHLKSKKGQST 11 T 17 Flp_Fap pdbhh F T 3lk4 3 AA,C,DA,F,GA,I,JA,L,O,R,U,X 0,C,3,F,6,I,9,L,O,R,U,X CD2AP_HUMAN CAS LIGAND WITH MULTIPLE SH3 DOMAINS, ADAPTER PROTEIN CMS VNFDDIASSENLLHLTANRPKMPGRRLPG 29 T 0.042 CARMIL_C pdbhh F Eukaryota T 3lkn 3 C C NP418 epitope from 1918 influenza strain LPFERATIM 9 T 2.1 Shal-type pdbhh F T 3lko 3 C C NP418 epitope from 1934 influenza strain LPFDRTTIM 9 T 0.53 DUF5775 pdbhh F T 3lkp 3 C C NP418 epitope from 1972 influenza strain LPFDKSTIM 9 T 3.7 Pas_Saposin pdbhh F T 3lkq 3 C C NP418 epitope from 1977 influenza strain LPFDKTTIM 9 T 4.2 Pas_Saposin pdbhh F T 3lkr 3 C C NP418 epitope from 2009 swine-influenza strain LPFERATVM 9 T 1.3 Shal-type pdbhh F T 3lks 3 C C NP418 epitope from 1980 influenza strain LPFEKSTVM 9 T 2.5 DUF724 pdbhh F T 3ll8 1 A E AKAP5_HUMAN AKAP79 peptide EPIAIIITDTE 11 T 3.2 Copine pdbhh F Eukaryota T 3lly 2 B B LECB2_MACPO MPA GRNGKSQSIIVGPWGD 16 T 0.7 DUF3842 pdbhh F Eukaryota T 3llz 2 B B LECB2_MACPO MPA NGKSQSIIVGPWGD 14 T 0.48 DUF3842 pdbhh F Eukaryota T 3lm1 2 B,D,F,H,J,L,N,P B,D,F,H,J,L,N,P LECB2_MACPO MPA RNGKSQSIIVGPWGD 15 T 0.57 DUF3842 pdbhh F Eukaryota T 3ln4 3 C C HNRPC_HUMAN 16-mer peptide from Heterogeneous nuclear ribonucleoproteins C1/C2 AEMYGSVTEHPSPSPL 16 T 7.6 NepR pdbhh F Eukaryota T 3lnz 2 B,D,F,H,J,L,N,P B,D,F,H,J,L,N,P 12-mer peptide inhibitor TSFAEYWALLSP 12 T 0.31 P53_TAD pdbhh F T 3lqa 2 B G Q1PHM6_9HIV1 Envelope glycoprotein gp160 EIVLENVIENFNMWKNDMVDQMHQDIISLWDQSLKPCVKLTPLCVGAGNCNTSTIAQACPKVSFDPIPIHYCAPAGYAILKCNDKTFNGIGPCNNVSTVQCTHGIKPVVSTQLLLNGSLAEEEVVIRSENISNNVKTIIVHLTESVNITCIGAGHCNINEKAWNETLKKVVEKLVKYFPNKTIEFAPPVGGDLEITTHSFNCGGEFFYCNTTKLFNSIHNSTDSTVNSTDSTAETGNSTNTNITLPCRIRQIINMWQEVGRAMYAPPSKGNITCISDITGLLLTRDGGENKTENNDTEIFRPGGGDMKDNWRSELYKYKVVEIKSGHHHHHH 332 T 1.1E-51 GP120 unp T Viruses T 3lrh 2 B,D,F,H,J,L,N,P B,D,F,H,J,L,N,P HD_HUMAN HUNTINGTON DISEASE PROTEIN, HD PROTEIN EKLMKAFESLKSFQ 14 T 2 Mito_fiss_reg unphh F Eukaryota T 3lt8 1 A A ATP BINDING PROTEIN-D65V GSMDYKDDDDKKTNWLKRIYRVRPCVKCKVAPRNWKVKNKHLRIYNMCKTCFNNSIDIGDDTYHGHVDWLMYADSKEISNT 81 T 0.013 ZZ pdbpssm F T 3lu9 3 C,F C,F PAR1_HUMAN PAR-1, THROMBIN RECEPTOR, COAGULATION FACTOR II RECEPTOR ATNATLDPRSFLLRNPNDKYEPFWE 25 T 4.4 DUF5848 pdbhh F Eukaryota T 3lw1 2 B P P53_HUMAN TUMOR SUPPRESSOR P53, PHOSPHOPROTEIN P53, ANTIGEN NY-CO-13 FKTEGPDSD 9 T 54 DoxA pdbhh F Eukaryota T 3m17 3 I,J,K,L I,J,K,L monomeric peptide inhibitor XRFXTGHFGXXYPCX 15 T 1.8 DUF5617 pdbhh F T 3m1b 3 I,J I,J DIMERIC PEPTIDE INHIBITOR XRFXTGHFGXXYPCKX 16 T 2.1 DUF5617 pdbhh F T 3m4c 2 E E HEME-PEPTIDE FRAGMENT KTTCNACHQ 9 T 1.9E-05 Cytochrom_C_2 pdbhh F T 3m50 2 B P Q42932_NICPL N.plumbaginifolia H+-translocating ATPase mRNA RRELHTLKGHVEAVVKLKGLDIETIQQSYDI 31 T 18 DUF1990 pdbhh F Eukaryota T 3m61 2 B P upain-1 W3A CSARGLENHRMC 12 T 7.9 LRRNT pdbhh F T 3m8f 1 A,B A,B Q8KNP2_BACTI Putative DNA-binding protein MGSSHHHHHHSSGLVPRGSHMNRDHFYTLNIAEIAERIGNDDCAYQVLMAFINENGEAQMLNKTAVAEMIQLSKPTVFATVNWFYCAGYIDETRVGRSKIYTLSDLGVEIVECFKQKAMEMRNL 124 T 1.1E-05 DUF3116 pdbhh F Bacteria T 3mhp 2 C C TIC62_PEA TIC62_peptide KTEQPLSPYTAYDDLKPPSSPSPTKP 26 T 4.1 LEA_6 pdbhh F Eukaryota T 3mhr 2 B P YAP1_HUMAN YAP phosphopeptide RAHSSPASLQ 10 T 0.00014 FAM181 unp F Eukaryota T 3mjh 2 B,D B,D EEA1_HUMAN ENDOSOME-ASSOCIATED PROTEIN P162, ZINC FINGER FYVE DOMAIN-CONTAINING PROTEIN 2 SSSEGFICPQCMKSLGSADELFKHYEAVHDAGND 34 T 0.00027 ATG14 unp F Eukaryota T 3ml4 2 E,F,G,H E,F,G,H MUSK_MOUSE MUSCLE-SPECIFIC TYROSINE-PROTEIN KINASE RECEPTOR, MUSCLE-SPECIFIC KINASE RECEPTOR, MUSK LDRLHPNPMXQRM 13 T 3.4 ETC_C1_NDUFA5 pdbhh F Eukaryota T 3mls 3 C,F,I,L P,Q,R,S Rationally designed V3 mimotope ACQAFYASSPRKSIHIGACA 20 T 1.1 Peptidase_U57 pdbhh F T 3mlu 3 C P A0A0K0KAD3_9HIV1 HIV-1 gp120 third variable region (V3) crown NNTRKSIRIGPGQAFYATGGIIG 23 T 1.9E-05 GP120 pdbhh T Viruses T 3mmg 2 C,D C,D POLG_TVMV Nuclear inclusion protein B fragment ETVRFQSD 8 T 1.2 CzcE pdbhh T Viruses T 3mmy 2 B,D,F,H B,D,F,H NUP98_HUMAN NUCLEAR PORE COMPLEX PROTEIN NUP98, NUCLEOPORIN NUP98, 98 KDA NUCLEOPORIN TGTTIKFNPPTGTDTMVKAGVSTNISTKHQCITAMKEYESKSLEELRLEDYQANRK 56 T 0.06 DUF5023 pdbpssm F Eukaryota T 3mn7 2 B S SPIR_DROME Spire DDD PSPREQLMESIRKGKELKQSRPPLKKASDRQLGPPRMCEPSPREQLMESIRKGKELKQSRPPLKKASDRQLGPPRMCEPSPREQLMESIRKGKELKQA 98 T 0.0014 WH2 pdbpssm F Eukaryota T 3mpn 1 A A O67854_AQUAE Transporter REHWATRLGLILAMAGNAVGLGNFLRFPVQAAENGGGAFMIPYIIAFLLVGIPLMWIEWAMGRYGGAQGHGTTPAIFYLLWRNRFAKILGVFGLWIPLVVAIYYVYIESWTLGFAIKFLVGLVPEPPPNATDPDSILRPFKEFLYSYIGVPKGDEPILKPSLFAYIVFLITMCINVSILIRGISKGIERFAKIAMPTLFILAVFLVIRVFLLETPNGTAADGLNFLWTPDFEKLKDPGVWIAAVGQIFFTLSLGFGAIITYASYVRKDQDIVLSGLTAATLNEKAEVILGGSISIPAAVAFFGVANAVAIAKAGAFNLGFITLPAIFSQTAGGTFLGFLWFFLLFFAGLTSSIAIMQPMIAFLEDELKLSRKHAVLWTAAIVFFSAHLVMFLNKSLDEMDFWAGTIGVVFFGLTELIIFFWIFGADKAWEEINRGGIIKVPRIYYYVMRYITPAFLAVLLVVWAREYIPKIMEETHWTVWITRFYIIGLFLFLTFLVFLAERRRNHE 507 T 7.399999999999999E-33 SNF unppercent F Bacteria T 3mpq 1 A A O67854_AQUAE Transporter KREHWATRLGLILAMAGNAVGLGNFLRFPVQAAENGGGAFMIPYIIAFLLVGIPLMWIEWAMGRYGGAQGHGTTPAIFYLLWRNRFAKILGVFGLWIPLVVAIYYVYIESWTLGFAIKFLVGLVPEPPPNATDPDSILRPFKEFLYSYIGVPKGDEPILKPSLFAYIVFLITMFINVSILIRGISKGIERFAKIAMPTLFCLAVFLVIRVFLLETPNGTAADGLNFLWTPDFEKLKDPGVWIAAVGQIFFTLSLGFGAIITYASYVRKDQDIVLSGLTAATLNEKAEVILGGSISIPAAVAFFGVANAVAIAKAGAFNLGFITLPAIFSQTAGGTFLGFLWFFLLFFAGLTSSIAIMQPMIAFLEDELKLSRKHAVLWTAAIVFFSAHLVMFLNKSLDEMDFWAGTIGVVFFGLTELIIFFWIFGADKAWEEINRGGIIKVPRIYYYVMRYITPAFLAVLLVVWAREYIPKIMEETHWTVWITRFYIIGLFLFLTFLVFLAERRRNH 507 T 7.399999999999999E-33 SNF unppercent F Bacteria T 3mqr 2 B B MDM4_HUMAN HdmX Peptide LDLAHSSESQ 10 T 0.93 DUF6143 unppercent F Eukaryota T 3mqs 2 B D MDM2_HUMAN Hdm2 peptide YSQPSTSSSI 10 T 11 UBA2_C pdbhh F Eukaryota T 3mr9 3 C P PP65_HCMVM 9-meric peptide from Tegument protein pp65 NLVPAVATV 9 T 15 5-FTHF_cyc-lig pdbhh T Viruses T 3mrb 3 C P PP65_HCMVM 9-meric peptide from Tegument protein pp65 NLVPMVHTV 9 T 9.9 ExbD pdbhh T Viruses T 3mrc 3 C P PP65_HCMVA 9-meric peptide from Tegument protein pp65 NLVPMCATV 9 T 1.8 STAT1_TAZ2bind pdbhh T Viruses T 3mrd 3 C P PP65_HCMVA 9-meric peptide from Tegument protein pp65 NLVPMGATV 9 T 0.3 APS-reductase_C pdbhh T Viruses T 3n00 2 B B NCOR1_HUMAN N-COR1, N-COR THRLITLADHICQIITQDFAR 21 T 6.8 Es2 pdbhh F Eukaryota T 3n5e 3 C D Synthetic peptide LR LSCQLYQR 8 T 2 SgrT pdbhh F T 3na2 1 A,B,C,D A,B,C,D Uncharacterized protein MGSSHHHHHHSSGRENLYFQGHVEPGVTDRIGQMILEMFRTGMCLFSVRSPGGVAELYGGEARKVEITGTSLTIEREDWHLHCKLETVETVVFDLSPKDNGGIRMAVVFRDKHQAPVLRAAWLPRLMPETPSPPEQFWAFTQRYIDLPMVVDARNRQLVFPGSGQGGFTEGS 172 T 2.9E-05 HemS pdbhh F T 3nf3 2 B C JTH-NB72-39 inhibitor RRFXAMLA 8 T 6 DUF2052 pdbhh F T 3ni3 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L 54-membered ring macrocyclic beta-sheet peptide XTYFTYXSXXKX 12 T 4.5 EAV_GP5 pdbhh F T 3njw 1 A A Bicyclic peptide BI-32169 GLPWGCPSDIPGWNTPWAC 19 T 0.94 CIMR pdbhh F T 3nk3 2 C,D C,D ZP3_CHICK Zona pellucida 3 AFAADAGKEVAADVVIGPVLLSADHHHHHH 30 T 7.2 Psg1 unphh F Eukaryota T 3nmx 2 D,E,F D,E,F ARHG4_HUMAN APC-STIMULATED GUANINE NUCLEOTIDE EXCHANGE FACTOR, ASEF SSSHHYSHPGGGGEQLAINELISDG 25 T 3.6 CDC24 unppercent F Eukaryota T 3noh 1 A A A7B039_RUMGV putative peptide binding protein GVTGATPKAKKAAQSSAQLEGSYIFCMNPLLDKLSDEDIREQLKAFVTGKTDSIRTDTELSFDIYVSETDYALIRYADSLCERLNDAGADVQIKQYSGTMLRSRAVSGKYEAFLSESDLVSTDALENADYIILDSAEMR 139 T 0.0019 SBP_bac_5 pdbhh F Bacteria T 3nqi 1 A,B,C,D A,B,C,D A0A380YR22_BACFN Putative lipoprotein GMDSGESGPQQWAGVVKVNDRMGYVTFTDAAGTELIPTNTIPVTLNARMAYIYCQVDEGQDLSTNPKSIKITLLADPTGIDATAITTPKVGESGDVTTNAPVGSLSFVSGYSTVAPFQFSENTIVLPVLYRVKNVTTTEDIKNELAKHTFTLVCYTDDIKSGDTILKLYLRYKVEDEPAAIAERATRTSSFKAYEISQILREYTLKSGQTKPAKITIVAQQNEYNNKLEDTSTIEKVYEIEYKTAE 246 T 0.00042 NigD_C pdbhh F Bacteria T 3nsw 1 A,B,C,D,E,F,G A,B,C,D,E,F,G Q6R7N7_9BILA Excretory-secretory protein 2 GSHMEYCPKMLSEIRQEDINDVETVAYVTVTGKTARSYNLQYWRLYDVPKTAPSQWPSFGTLRDDCGNIQLTADTDYVLGCKSGNQDCFVKLHDGLSQKEKDLLKE 106 T 0.098 Augurin unppssm F Eukaryota T 3nti 2 B C AUB_DROME AUB[R15(ME2S)] NPVIARGRGXGRK 13 T 0.023 Tristanin_u2 pdbhh F Eukaryota T 3o0e 2 G,H,I,J,K,L L,M,N,O,P,Q CEA9_ECOLX Colicin-E9 SGGDGRGHNTGAHSTSG 17 T 10 Spore_II_R pdbhh F Bacteria T 3o17 2 B,D F,G JIP1_MOUSE JNK-INTERACTING PROTEIN 1, JIP-1, JNK MAP KINASE SCAFFOLD PROTEIN 1 PKRPTTLNLF 10 T 6.8 Lipoprotein_19 pdbhh F Eukaryota T 3o2i 1 A,B,C A,B,C Uncharacterized protein MGSSHHHHHHSSGRENLYFQGMRGDDMHIYELVSRDRTHPVRIYLLHSEYWTEDEFYNLLLEAFQRSSASDWHLQILEVSKYLVTAHGFVEAGGLQEIGFPGELSKTEVRRRINAFLGKDRSDGS 125 T 0.6 EndoU_bacteria pdbpssm F T 3o3b 3 C,F C,F Peptidomimetic ELA-1.1 ELAXXLTV 8 T 49 RasGEF pdbhh F T 3oa6 4 G G H4 peptide monomethylated at lysine 20 GLGKGGAKRHRKVLRDNIQGITKY 24 T 18 DUF1938 pdbhh F T 3oa8 1 A,C,E A,C,E SOXA_STAND SoxA MRRFAAGCLALALLVLPFVLTGARAAEDESEKEIERYRQMIEDPMANPGFLNVDRGEVLWSEPRGTRNVSLETCDLGEGPGKLEGAYAHLPRYFADTGKVMDLEQRLLWCMETIQGRDTKPLVAKPFSGPGRTSDMEDLVAFIANKSDGVKIKVALATPQEKEMYAIGEALFFRRSSINDFSCSTCHGAAGKRIRLQALPQLDVPGKDAQLTMATWPTYRVSQSALRTMQHRMWDCYRQMRMPAPDYASEAVTALTLYLTKQAEGGELKVPSIKR 275 T 1.3E-05 Dehyd-heme_bind pdbhh F Bacteria T 3oak 2 C,D C,D SPT6_YEAST CHROMATIN ELONGATION FACTOR SPT6 DPFTHMSDKIDEMYDIFGDGHDYDWALEIEN 31 T 3.3 SPT6_acidic unppssm F Eukaryota T 3ob1 1 A A SPY2_HUMAN SPRY-2 IRNTNEXTEGPT 12 T 3.3 KAR9 unp F Eukaryota T 3ob2 1 A A EGFR_HUMAN RECEPTOR TYROSINE-PROTEIN KINASE ERBB-1, PROTO-ONCOGENE C-ERBB-1 DSFLQRXSSDPT 12 T 0.91 DUF4348 pdbhh F Eukaryota T 3obq 2 B B HGS_HUMAN PROTEIN PP110, HRS PTPSAPVPL 9 T 5.4 AKAP28 pdbhh F Eukaryota T 3oe0 2 B I Polyphemusin analog, CXC chemokine receptor antagonist RRXCYQKXPYRXCRGX 16 T 4.7 Cytomega_TRL10 pdbhh F T 3oiq 2 B B DPOA_YEAST DNA polymerase alpha catalytic subunit A SPLKLQSRKLRYANDVQDLLDDVENSPVVATKRQNV 36 T 0.53 Wap1 pdbhh F Eukaryota T 3oka 2 C,D C,D N-terminal His-affinity tag MGHHHHHHHHHHSSGHIEGRH 21 T 9500 zf_CCCH_4 pdbhh F T 3olr 2 E,F,G,H E,F,G,H SKAP2 YGEEXDDLY 9 T 12 Rox3 pdbhh F T 3omg 2 C,D C,D dimethylated arginine peptide R14me2s RGRAXGQE 8 T 37 Aim21 pdbhh F T 3omh 2 E,F,G,H E,F,G,H SKAP2_HUMAN SRC FAMILY-ASSOCIATED PHOSPHOPROTEIN 2, SRC KINASE-ASSOCIATED PHOSPHOPROTEIN 55-RELATED PROTEIN, SKAP55 HOMOLOG, SKAP-55HOM, SKAP-HOM, SRC-ASSOCIATED ADAPTER PROTEIN WITH PH AND SH3 DOMAINS, PYK2/RAFTK-ASSOCIATED PROTEIN, RETINOIC ACID-INDUCED PROTEIN 70 DGEEXDDPF 9 T 8 S100PBPR pdbhh F Eukaryota T 3oo3 1 A A Q6ZZI7_ACTTI P450 MONOOXYGENASE MALPLPHQRLRLDPVPEFEELQKAGPLHEYDTEPGMDGRKQWLVTGHDEVRAILADHERFSSMRPVDDEADRALLPGILQAYDPPDHTRLRRTVAPAYSARRMERLRPRIEEIVEECLDDFESVGAPVDFVRHAAWPIPAYIACEFLGVPRDDQAELSRMIRESRESRLPRQRTLSGLGIVNYTKRLTSGKRRDPGDGMIGVIVREHGAEISDEELAGLAEGNLIMAAEQMAAQLAVAVLLLVTHPDQMALLREKPELIDSATEEVLRHASIVEAPAPRVALADVRMAGRDIHAGDVLTCSMLATNRAPGDRFDITREKATHMAFGHGIHHCIGAPLARLQLRVALPAVVGRFPSLRLAVPEEDLRFKPGRPAPFAVEELPLEW 384 T 1.5E-26 p450 pdbpercent F Bacteria T 3op0 2 C,D C,D EGFR_HUMAN RECEPTOR TYROSINE-PROTEIN KINASE ERBB-1, PROTO-ONCOGENE C-ERBB-1 LQRXSSDPTGA 11 T 11 RNase_Y_N pdbhh F Eukaryota T 3opy 3 I,J,K,L I,J,K,L PFKA3_PICPA 6-phosphofructo-1-kinase gamma-subunit MVTKDSIIRDLERENVGPEFGEFLNTLQTDLNSEKPPIEQVKSQLETHFNLAHETQEFSRKNDNAPVDKLLTNYYNNYEVNVLEFVLQMGFSRDLSIPLNVWFVLDMISQLSTSKQDLPLDYYLVLNNSQTGKYSDFVRYLIYEAVGAEIHCFEQGSMPEQYRSSRWEDKVKGPALANRGPIRGNVGAGDRKITFHLLCKKTARMILVGDDRETDFEMSDRSFVTLLLDYYQRVGTTKKIDLLLLTNNFDTNMNNKLQQLKILESLNMLKSNCYVLDYQITVDQVTANFNSYVEGIPAFRRHEIANFLKKRKTPKNADELIFKYVGRWNICYQKKFHQGNISIHQISGYLD 351 T 0.11 DNA_III_psi pdbpercent F Eukaryota T 3oq5 2 D,E D,E P53_HUMAN Cellular tumor antigen p53 TSRHKKLMFK 10 T 21 DUF420 pdbhh F Eukaryota T 3oqg 1 A,B A,B Q9KJ88_HELPX Hpy188I MGHHHHHHEFMAKRKSDIILKSVDDLKDEIDYKDFEYKEYFNLLCELVPNNSLEKLEINAIDEKNMKNEGLVYVFVIQGKIFKIGHSITPITKRVQSYNCGKVEYRKNGTCSTTNYFVLQSLLKINKIVQVYAFFPEQPTYTLFGKTYQDSFSTSKRAENVILENFIKNHNKKPIGCTQT 180 T 0.059 MUG113 pdbpssm F Bacteria T 3os5 2 B B Dnmt1 TPRRSKSA 8 T 1.7 DUF4808 pdbhh F T 3ots 2 C P POL_HV1Y2 MA/CA substrate peptide QNYPIVQ 7 T 43 Rep-A_N pdbhh T Viruses T 3ou1 3 C P POL_HV1Y2 RH/IN substrate peptide KVLFLDG 7 T 0.016 Spermine_synt_N pdbhh T Viruses T 3ou3 2 C C POL_HV1Y2 PR/RT substrate peptide LNFPISP 7 T 0.6 Peptidase_A2B unphh T Viruses T 3ou4 3 C C POL_HV1Y2 TF/PR substrate peptide FNFPQIT 7 T 20 DUF1810 pdbhh T Viruses T 3oua 2 C P POL_HV1Y2 p1/p6 substrate peptide GNFLQSR 7 T 0.41 HypA unp T Viruses T 3oub 2 C P POL_HV1Y2 NC/p1 substrate peptide QVNFLGK 7 T 3.8 HypA unppercent T Viruses T 3ouc 2 C P POL_HV1Y2 p2/NC substrate peptide TIMMQRG 7 T 0.41 HypA unp T Viruses T 3oud 3 C P POL_HV1Y2 CA/p2 substrate peptide RVLFEAM 7 T 0.41 HypA unp T Viruses T 3owr 1 A,B,C,D A,B,C,D Q5L7M9_BACFN uncharacterized hypothetical protein GSKEDLPAYEEAEITKVGAYHRFYSGDKDAITGENIVAEKELDRTNNIDSEHGVATAVFTIPAAGGKFTEAERAKVSLSNLVVYVNVSTAARVTPLDGSPKFGVPADWTREHKYSVMAADGTKKIWTVKVTLNK 134 T 0.0012 DUF5018 pdbpercent F Bacteria T 3owt 2 C C SIR3_YEAST SILENT INFORMATION REGULATOR 3 SEKGNAKMIDFATLSKLKKKYQIILDR 27 T 0.16 IPK pdbhh F Eukaryota T 3ox7 2 B P MH027 MGSADGACSWRGLENHAMCGAAG 23 T 3.2 DUF2632 pdbhh F T 3oy6 2 B P MH036 MGSADGACSWRGLENHRMCGAAG 23 T 5 DUF2632 pdbhh F T 3p2z 2 B B phosphopeptide XPLHSTAX 8 T 130 Dip pdbhh F T 3p34 2 B B phosphopeptide XMQSTPLX 8 T 160 DUF5792 pdbhh F T 3p35 2 D,E D,E phosphopeptide XMQSSPLX 8 T 49 LEA_6 pdbhh F T 3p36 2 B B phosphopeptide XDPPLHSTAX 10 T 32 IML1 pdbhh F T 3p37 2 D,E,F E,D,F phosphopeptide XFDPPLHSTAX 11 T 17 FAF pdbhh F T 3p46 1 A,B,C A,B,C Synthetic collagen peptide XGPPGPPGLPGEAGPPGPPX 20 T 0.0035 Collagen pdbpssm F T 3p4f 2 B B RBBP5_HUMAN RBBP-5, RETINOBLASTOMA-BINDING PROTEIN RBQ-3 EDEEVDVTSVY 11 T 0.014 DUF2457 unppercent F Eukaryota T 3p4f 3 C C KMT2A_HUMAN ZINC FINGER PROTEIN HRX, ALL-1, TRITHORAX-LIKE PROTEIN, LYSINE N-METHYLTRANSFERASE 2A HGAARAEVHL 10 T 1.1 N-SET unphh F Eukaryota T 3p4k 2 B P MAP kinase 14 AADLRISCNSK 11 T 7.6 mRNA_triPase pdbhh F T 3p6z 3 C,F C,I FA5_HUMAN ACTIVATED PROTEIN C COFACTOR AHHHHHHVGTWENLYFQSIPDDDEDSYEIFEPPESTVMATRKMHDRLEPEDEESDADYDYQNRLAAALGIR 71 T 0.022 2OG-FeII_Oxy_5 pdb F Eukaryota T 3p72 2 B B OS1 peptide CTERMALHNLC 11 T 8.1 PHC2_SAM_assoc pdbhh F T 3p87 2 B,D,F,H,J,L G,H,I,J,K,L RNH2B_HUMAN RNASE H2 SUBUNIT B, AICARDI-GOUTIERES SYNDROME 2 PROTEIN, AGS2, DELETED IN LYMPHOCYTIC LEUKEMIA 8, RIBONUCLEASE HI SUBUNIT B DKSGMKSIDTFFGVKNKKKIGKV 23 T 1.4 Mif2_N pdbhh F Eukaryota T 3pbj 1 A,B,C,D,E,F A,B,C,D,E,F COIL SER L9L-Pen L23H XEWEALEKKXAALESKLQALEKKHEALEHGX 31 T 0.0034 DUF5320 pdbhh F T 3pbp 1 A,D,G,J A,D,G,J NUP82_YEAST NUCLEAR PORE PROTEIN NUP82 MSQSSRLSALPIFQASLSASQSPRYIFSSQNGTRIVFIQDNIIRWYNVLTDSLYHSLNFSRHLVLDDTFHVISSTSGDLLCLFNDNEIFVMEVPWGYSNVEDVSIQDAFQIFHYSIDEEEVGPKSSIKKVLFHPKSYRDSCIVVLKEDDTITMFDILNSQEKPIVLNKPNNSFGLDARVNDITDLEFSKDGLTLYCLNTTEGGDIFAFYPFLPSVLLLNEKDLNLILNKSLVMYESLDSTTDVIVKRNVIKQLQFVSKLHENWNSRFGKVDIQKEYRLAKVQGPFTINPFPGELYDYTATNIATILIDNGQNEIVCVSFDDGSLILLFKDLEMSMSWDVDNYVYNNSLVLIERVKLQREIKSLITLPEQLGKLYVISDNIIQQVNFMSWASTLSKSINESDLNPLAGLKFESKLEDIATIERIPNLAYINWNDQSNLALMSNKTLTFQNISS 452 T 4.9E-12 Nup88 pdbpercent F Eukaryota T 3pbp 3 C,F,I,L C,F,I,L NU159_YEAST NUCLEAR PORE PROTEIN NUP159 SSITKDMKGFKVVEVGLAMNTKKQIGDFFKNLNMAK 36 T 7.2 DUF1413 pdbhh F Eukaryota T 3pe4 2 B,D B,D CSK21_HUMAN CK II ALPHA YPGGSTPVSSANMM 14 T 13 Pr_beta_C pdbhh F Eukaryota T 3pes 1 A,B A,B A9J573_BPPYU Uncharacterized protein gp49 SNAMLAEFEDRVAGIPCLIVVTYWEPYVPAKVSGPPEYCYPAEGGCGEWEVRDRRGRPAPWLERKLTEAERERIDQAVFDRMEGR 85 T 5.2 Sulfotransfer_1 pdbhh T Viruses T 3pf6 1 A,B,C,D A,B,C,D C8ZKC7_9CAUD hypothetical protein PP-LUZ7_gp033 GMSQFQEVRPVAQALYPTHPSTKDALEEARLLFPGGTHHDFMRALMGYHNTLVKVMEEQCGS 62 T 0.006 Arabinose_Iso_C pdbpssm T Viruses T 3pgm 1 A,B A,B PMG1_YEAST Phosphoglycerate mutase 1 PKLVLVRHGQSEWNEKNLFTGWVDVKLSAKGQQEAARAGELLKEKGVNVLVDYTSKLSRAIQTANIALEKADRLWIPVNRSWRLNERHYGDLQGKDKAQTLKKFGEEKFNTYRRSFDVPPPPIDASSPFSQKGDERYKYVDPNVLPETESLALVIDRLLPYWQDVIAKLVGKTSMIAAHGNSLRGLVKHLEGISDADIAKLNIPPGTILVFELDENLKPSKPSYYLDPEAAAAGAAAVANQGKK 244 T 5E-07 His_Phos_1 pdb F Eukaryota T 3pkn 2 B B LARP4_HUMAN LA RIBONUCLEOPROTEIN DOMAIN FAMILY MEMBER 4 TGLNPNAKVWQEIA 14 T 0.013 PAM2 pdbhh F Eukaryota T 3plf 1 A,C A,C MetRD peptide NESVRXDATFP 11 T 17 2-thiour_desulf pdbhh F T 3poa 2 B B synthetic phosphopeptide DTAPTEKIAYKK 12 T 7.1 DUF1299 pdbhh F T 3pqr 2 B B GNAT1_BOVIN GALPHA SUBUNIT OF TRANSDUCIN, TRANSDUCIN ALPHA-1 CHAIN ILENLKDVGLF 11 T 2.1 Phage_holin_4_1 pdbhh F Eukaryota T 3pqz 2 E,F L,M cyclic peptide WFEGYDNTFPX 11 T 0.77 RestrictionMunI pdbhh F T 3psl 2 C,D C,D N-alpha acetylated form of histone H3 XARTKQ 6 T 380 SLBP_RNA_bind pdbhh F T 3pth 2 B B LAR4B_HUMAN LA RIBONUCLEOPROTEIN DOMAIN FAMILY MEMBER 4B, LA RIBONUCLEOPROTEIN DOMAIN FAMILY MEMBER 5, LA-RELATED PROTEIN 5 ELNPNAEVWGAPVLH 15 T 0.84 TT_ORF2 unppercent F Eukaryota T 3puj 2 B,D C,D STX4_MOUSE Syntaxin-4 N-terminal peptide MRDRTHELRQ 10 T 0.13 Syntaxin-5_N pdbhh F Eukaryota T 3pvl 2 B B USH1G_HUMAN SCAFFOLD PROTEIN CONTAINING ANKYRIN REPEATS AND SAM DOMAIN SEVSTDSGHDSLFTRPGLGTMVFRRNYLSSGLHGLGREDGGLDGVGAPRGRLQSSPSLDDDSLGSANSLQDRSCGEELPWDELDLGLDEDLEPETS 96 T 2.1 DUF452 unphh F Eukaryota T 3pwj 3 C,F C,F HuD (G2L,I9V) peptide LLYGFVNYV 9 T 7.6 OMS28_porin pdbhh F T 3pwl 3 C,F C,F HuD peptide LGYGFVNYI 9 T 3.1 NapE pdbhh F T 3pwn 3 C,F C,F HuD (G2L) peptide LLYGFVNYI 9 T 9.1 OMS28_porin pdbhh F T 3pxe 2 E,F,G,H E,F,G,H phospho peptide SRSTSPTFNK 10 T 1.5 DUF782 pdbhh F T 3q0a 1 A,C A,B RPOLV_BPN4 Virion RNA polymerase MGGSHHHHHHRSESTVTEELKEGIDAVYPSLVGTADSKAEGIKNYFKLSFTLPEEQKSRTVGSEAPLKDVAQALSSRARYELFTEKETANPAFNGEVIKRYKELMEHGEGIADILRSRLAKFLNTKDVGKRFAQGTEANRWVGGKLLNIVEQDGDTFKYNEQLLQTAVLAGLQWRLTATSNTAIKDAKDVAAITGIDQALLPEGLVEQFDTGMTLTEAVSSLAQKIESYWGLSRNPNAPLGYTKGIPTAMAAEILAAFVESTDVVENIVDMSEIDPDNKKTIGLYTITELDSFDPINSFPTAIEEAVLVNPTEKMFFGDDIPPVANTQLRNPAVRNTPEQKAALKAEQATEFYVHTPMVQFYETLGKDRILELMGAGTLNKELLNDNHAKSLEGKNRSVEDSYNQLFSVIEQVRAQSEDISTVPIHYAYNMTRVGRMQMLGKYNPQSAKLVREAILPTKATLDLSNQNNEDFSAFQLGLAQALDIKVHTMTREVMSDELTKLLEGNLKPAIDMMVEFNTTGSLPENAVDVLNTALGDRKSFVALMALMEYSRYLVAEDKSAFVTPLYVEADGVTNGPINAMMLMTGGLFTPDWIRNIAKGGLFIGSPNKTMNEHRSTADNNDLYQASTNALMESLGKLRSNYASNMPIQSQIDSLLSLMDLFLPDINLGENGALELKRGIAKNPLTITIYGSGARGIAGKLVSSVTDAIYERMSDVLKARAKDPNISAAMAMFGKQAASEAHAEELLARFLKDMETLTSTVPVKRKGVLELQSTGTGAKGKINPKTYTIKGEQLKALQENMLHFFVEPLRNGITQTVGESLVYSTEQLQKATQIQSVVLEDMFKQRVQEKLAEKAKDPTWKKGDFLTQKELNDIQASLNNLAPMIETGSQTFYIAGSENAEVANQVLATNLDDRMRVPMSIYAPAQAGVAGIPFMTIGTGDGMMMQTLSTMKGAPKNTLKIFDGMNIGLNDITDASRKANEAVYTSWQGNPIKNVYESYAKFMKNVDFSKLSPEALEAIGKSALEYDQRENATVDDIANAASLIERNLRNIALGVDIRHKVLDKVNLSIDQMAAVGAPYQNNGKIDLSNMTPEQQADELNKLFREELEARKQKVAKAR 1118 T 0.0038 RNA_pol pdbhh T Viruses T 3q47 2 B C SMAD1_HUMAN Smad1 peptide SPHNPISDVD 10 T 0.84 DUF2733 pdbhh F Eukaryota T 3q4a 2 B C SMAD1_HUMAN Smad1 peptide SPHNPISSVS 10 T 2.6 DUF4943 pdbhh F Eukaryota T 3q6s 2 E,F E,F SGO1_HUMAN HSGO1, SEROLOGICALLY DEFINED BREAST CANCER ANTIGEN NY-BR-85 TNVSLYPVVKIRRLSLSPK 19 T 64 PUB_1 pdbhh F Eukaryota T 3q8d 2 C,D E,F SSB_ECOLI Single-stranded DNA-binding protein YMDFDDDIPF 10 T 0.22 Phage_SSB pdbhh F Bacteria T 3q9g 1 A A Cyclic pseudo-peptide VQIV(4BF)(ORN)(HAO)KL(ORN) VQIVXXXKLX 10 T 40 DUF6332 pdbhh F T 3q9h 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H Cyclic pseudo-peptide LVFFA(ORN)(HAO)LK(ORN) LVFFAXXLKX 10 T 8.7 DUF5347 pdbhh F T 3q9i 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H Cyclic pseudo-peptide LV(4BF)FA(ORN)(HAO)LK(ORN) LVXFAXXLKX 10 T 10 DUF5347 pdbhh F T 3q9j 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H Cyclic pseudo-peptide AIIFL(ORN)(HAO)YK(ORN) AIIFLXXYKX 10 T 21 Corona_NS3b pdbhh F T 3qdr 2 B B CEA_CITFR Colicin-A GSKPGDSYNTPWGKVIINAAGQPTMNGTVMTADNSSMVPYGRGFTRVLNSLVNNPVSHHHHHH 63 T 0.069 DUF6162 pdb F Bacteria T 3qdz 3 E,F E,F PAR4_HUMAN PAR-4, COAGULATION FACTOR II RECEPTOR-LIKE 3, THROMBIN RECEPTOR-LIKE 3 TPSILPAPR 9 T 2.6 Abhydrolase_9_N pdbhh F Eukaryota T 3qfj 3 C C TAX(Y5F) peptide LLFGFPVYV 9 T 0.35 YvrJ pdbhh F T 3qg6 3 E,F C,D Q9F6Z3_STAAU Agr autoinducing peptide YSTCYFIM 8 T 0.7 Ly49 pdbhh F Bacteria T 3qis 2 B B SESQ1_HUMAN SES1 PFARLHECYGQEI 13 T 8 MEIOC pdbhh F Eukaryota T 3qkr 3 C C MRE11_PYRFU MRE11 NUCLEASE, PFMRE11 SDFFTEFELKIIDILGEKDFDDFDYIIKLITEGK 34 T 0.23 PufQ unppssm F Archaea T 3qn7 2 B B UK18 ACSRYEVDCRGRGSACG 17 T 1.5 Toxin_24 pdbhh F T 3qnj 2 C,D C,D antimicrobial peptide oncocin VDKPPYLPRPRPPRXIYNX 19 T 0.14 Apidaecin pdbhh F T 3qnz 3 C C TBB5_HUMAN TUBULIN BETA-5 CHAIN TAEEEEDFGE 10 T 20 DUF1639 pdbhh F Eukaryota T 3qo0 3 C C TBB5_HUMAN TUBULIN BETA-5 CHAIN YQQYQDATAEEEEDFGEEAE 20 T 10 Hrs_helical unphh F Eukaryota T 3qq3 3 C,F C,F NRAM_I96A0 PEPTIDE OF SLA-1*0401-S-OIVNW9 NSDTVGWSW 9 T 2.2 DUF4902 pdbhh T Viruses T 3qxy 2 B,D P,Q TF65_HUMAN NUCLEAR FACTOR NF-KAPPA-B P65 SUBUNIT, NUCLEAR FACTOR OF KAPPA LIGHT POLYPEPTIDE GENE ENHANCER IN B-CELLS 3 RKRTYETFKSIMKKS 15 T 1.4 Dynein_attach_N pdbhh F Eukaryota T 3qz0 1 A,B A,B Q73RI0_TREDE Factor H binding protein MAHHHHHHVDDDDKTFKMNTAQKAHYEKFINALENELKTRHIPAGAVIDMLAEINTEALALDYQIVDKKPGTSIAQGTKAAALRKRFIPKKIK 93 T 0.0084 DUF4969 unphh F Bacteria T 3qzs 2 C,D C,D H4_HUMAN Histone H4 KGGAXRHRKV 10 T 11 Shadoo unppercent F Eukaryota T 3qzv 2 B C H4_HUMAN Histone H4 GKGLGXGGAKR 11 T 11 Shadoo unppercent F Eukaryota T 3r0h 2 B,D,F,H,J,L,N,P a,b,c,d,e,f,g,h NG2 ALRNGQYWV 9 T 0.34 FixS pdbhh F T 3r29 2 C,D C,D NCOR2_HUMAN SMRT, N-COR2, CTG REPEAT PROTEIN 26, SMAP270, SILENCING MEDIATOR OF RETINOIC ACID AND THYROID HORMONE RECEPTOR, T3 RECEPTOR-ASSOCIATING FACTOR, TRAC, THYROID-RECEPTOR-ASSOCIATED COREPRESSOR, RETINOIC-ACID-RECEPTOR-ASSOCIATED COREPRESSOR TNMGLEAIIRKALMGK 16 T 3.6 FYTT pdbhh F Eukaryota T 3r42 2 B B VPS27_YEAST VPS27, GOLGI RETENTION DEFECTIVE PROTEIN 11 QVPSDPYNY 9 T 14 DUF3460 pdbhh F Eukaryota T 3r46 1 A,B,C,D,E,F A,B,C,E,F,G coiled coil helix L24D XGELKAIAQELKAIAKELKAIAWEDKAIAQGAGYX 35 T 1.8 DUF5320 pdbhh F T 3r47 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,E,F,G,H,I,J,K,L,M coiled coil helix L24H XGELKAIAQELKAIAKELKAIAWEHKAIAQGAGX 34 T 0.95 Rho_N pdb F T 3r48 2 B,C,D B,C,E coiled coil helix Y15-L24D XGELKAIAQELKAIAYELKAIAKEDKAIAQGX 32 T 2.1 DUF5660 pdbhh F T 3r4a 1 A,B,C,D A,B,C,D coiled coil helix CC-tet XGELAAIKQELAAIKKELAAIKWELAAIKQGAGX 34 T 0.0033 DUF5320 pdbhh F T 3r4h 1 A,B,C,D,E,F A,B,C,D,E,F coiled coil helix CC-Tet-phi22 XGELAAIKQELAAIKKELAAIKXELAAIKQGAGX 34 T 0.0067 DUF5320 pdbhh F T 3r7g 2 B B FMN2_HUMAN Formin-2 KSLYKIKPRHDSGIKAKISMKT 22 T 14 DUF6140 pdbhh F Eukaryota T 3rbq 2 G,H,I,J,K,L G,H,I,J,K,L GNAT1_HUMAN TRANSDUCIN ALPHA-1 CHAIN XGAGASAEEKH 11 T 2.8 DUF4917 pdbhh F Eukaryota T 3rc5 2 B B MAVS_HUMAN Product MAVS XQEREVPC 8 T 3.6 GRA6 unphh F Eukaryota T 3rce 2 B B Substrate Mimic Peptide GDQNATXG 8 T 3.9 S-AdoMet_synt_M pdbhh F T 3rgv 5 E E peptide WIYVYRPMGCGGS 13 T 0.19 BAALC_N pdbhh F T 3rh3 1 A,B A,B Q8A6H6_BACTN Uncharacterized DUF3829-like protein GQTVSSESTEELDDASKVINYYHMSLAVLRHVANAKDINAVLGYMEQTGKVPEVDPIAPPEIAARDTAELLDPGDYFNPEVRQNLKQNYAGLFNVRTQFYDNFNKFLAYKKSKDTAKTAQLLDENYKLSVELSEYKQVIFDILSPLTEQAESELLADEPLKDQIMAMRKMSGTVQSIMNLYSRKHAMDGVRIDLKMAELEKELKAAEKIPAVTGYDEELKNFQSFLSTVKSFMNDMQKARSKGAYSDKEYQAMSEAYEYGLSVI 264 T 0.0075 PPR_3 pdbpercent F Bacteria T 3rj2 1 A X Q5YFA7_9VIRU ORF158L PROTEIN MGWAIVANCEFVNATGKKTTILVNENWAKYCWIWTYKFPEKYTLLRYSVDGEMFMRHRVTFFNATGRYITHTHLNHGLEDVLEGSLAVPKDAAYARIHAAINVSLTNPGDVHMHYDETEGEQIRSYDAAEFARTLAAV 138 T 0.11 Terminase_6N pdb T Viruses T 3rmr 1 A,B,C A,B,C ATR1_HYAAE ATR1 AQTALDDDEERWPFGPSAVEALIETIDRHGRVSLNDEAKMKKVVRTWKKLIERDDLIGEIGKHYFEAPGPLHDTYDEALATRLVTTYSDRGVARAILHTRPSDPLSKKAGQAHRLEEAVASLWKGRGYTSDNVVSSIATGHDVDFFAPTAFTFLVKCVESEDDANNAIFEYFGSNPSRYFSAVLHAMEKPDADSRVLESSKKWMFQCYAQKQFPTPVFERTLAAYQSEDYAIRGARNHYEKLSLSQIEELVEEYSRIYSV 260 T 0.0016 RXLR unphh F Eukaryota T 3ro2 2 B B NUMA1_HUMAN NUMA PROTEIN, SP-H ANTIGEN RNSFYMGTCQDEPEQLDDWNRIAELQQR 28 T 11 FliT pdbhh F Eukaryota T 3rqe 2 E E PAXI_HUMAN Paxillin LD1 peptide DDLDALLADLESTT 14 T 2.2 DUF2525 pdbhh F Eukaryota T 3rss 2 B B Unknown peptide, probably from expression host APAWLFEA 8 T 0.48 Xin pdbhh F T 3rte 2 B B Unknown peptide, probably from expression host PAWLFEA 7 T 2 Ribosomal_L37 pdbhh F T 3ru4 3 C C CTRA_BOVIN CHYMOTRYPSIN A CHAIN A CGVPAIQPVLS 11 T 1.9 SH pdbhh F Eukaryota T 3ryl 1 A,B A,B Q87GE5_VIBPA ACTIN FILAMENT POINTED END-BINDING DOMAIN GHMRLLSEDLFKQSPKLSEQELDELANNLADYLFQAADIDWHQVISEKTRGLTTEEMAKSEHRYVQAFCREILKYPDCYKSADVASPESPKSGGGSVIDVALKRLQTGRERLFTTTDEKGNRELKKGDAILESAINAARMAISTEEKNTILSNNVKSATFEVFCELPCMDGFAEQNGKTAFYALRAGFYSAFKNTDTAKQDITKFMKDNLQAGFSGYSYQGLTNRVAQLEAQLAALSAKLS 241 T 0.0032 ABC_tran_CTD unppssm F Bacteria T 3rz2 2 C,D C,D Prl-1 (PTP4A1) GWWSLIPPKYIT 12 T 0.068 RRP14 pdbhh F T 3s1b 2 B A mini-Z FNKECLLRYKEAALDPNLNLYQRIAKIVSIDDDC 34 T 2.7 TRCF pdbhh F T 3s1t 2 C C AK_MYCTU Aspartokinase EATVYAGTGRL 11 T 3.6 PC_rep pdbhh F Bacteria T 3s3h 2 C C phosphopeptide GP4 KNSFVXQKLSE 11 T 1.6 DUF244 pdbhh F T 3s63 1 A,B A,B H2L2M0_NECAM NA-SLP-1 LTPKETCDLCQIALRTVFGHFGGNIPSRRKLVHQLKHECKRHFNYRRRCLLLMKVNSDLIFREMTDGSFKPMEVCLIMRECNPHDSPLEPEMIDKSGQPEAFALVSSSDDNYDTSEE 117 T 0.41 SapB_1 pdbhh F Eukaryota T 3s7d 2 B I Monomethylated p53 peptide SSHLKSKKGQSTS 13 T 29 Class_IIIsignal pdbhh F T 3s9c 2 B B FA5_HUMAN ACTIVATED PROTEIN C COFACTOR, PROACCELERIN, LABILE FACTOR SRDPDNIAAWYLRS 14 T 0.055 PPTA pdb F Eukaryota T 3sbn 1 A,B A,B Trichovirin I-4A XXNLXPAVXPXLXPX 15 T 22 Ribosomal_L11_N pdbhh F T 3sfj 2 B,D B,D decameric peptide iCAL36 ANSRWPTSII 10 T 3.7 C9orf72-like pdbhh F T 3sge 3 E,F K,M R13 peptide EEEDDDMGFGLFD 13 T 0.0078 Ribosomal_60s pdb F T 3shw 2 B B CXG1_HUMAN CONNEXIN-45, CX45, GAP JUNCTION ALPHA-7 PROTEIN SGDGKTSVWI 10 T 0.11 SKI pdbhh F Eukaryota T 3si5 2 C,D X,Y KNL1_HUMAN ALL1-FUSED GENE FROM CHROMOSOME 15Q14 PROTEIN, AF15Q14, BUB-LINKING KINETOCHORE PROTEIN, BLINKIN, CANCER SUSCEPTIBILITY CANDIDATE GENE 5 PROTEIN, CANCER/TESTIS ANTIGEN 29, CT29, KINETOCHORE-NULL PROTEIN 1, PROTEIN D40/AF15Q14 GPLGSSSENKIDFNDFIKRLKTGK 24 T 0.065 FtsH_ext pdbpssm F Eukaryota T 3sj9 2 B B FAGLRQAVTQ peptide FAGLRQAVTQ 10 T 7.8 TOH_N pdbhh F T 3sjk 2 B B KPVLRTATVQGPSLDF peptide KPVLRTA 7 T 0.92 zf_C2H2_13 pdbhh F T 3sks 1 A A A0A6H3ACK0_BACAN Putative Oligoendopeptidase F SNAMSFKDYEYKRPNIEELKEKFTVALEKFDNAKTVEEQKQVIHSINEIRNDFGTMGNLCYIRHSVDTTDAFYKEEQDFFDEFSPVVQGYGTKYYNALIHSPFREELEAYYGKQLFALAECDLKTYSDEVVKDLQLENKLSSQYTQLLASAKIDFAGEERTLSQLIPFMQGKERSERKAASEAYYGFLAENEEELDRIYDELVKVRTKIAKSLGFKNFVELGYARMYRTDYNAEMVANYRQQVLDYIVPVTTELRKRQQARIGVEKLAYYDENFEFPTGNPTPKGDADWIVNHGKTMYKELSAETDEFFNFMLDNDLLDLVAKKGKAGGGYCTYIENYKAPFIFSNFNGTSGDIDVLTHEAGHAFQVYESRKFEIPEYNWPTYEACEIHSMSMEFFTWPWMKLFFEEDADKYYFSHLSSALLFLPYGVSVDEYQHYVYENPEASPEERKTAWRNIEKKYLPHRDYEDNDYLERGGFWQRQGHIYSSPFYYIDYTLAQICALQFWKRARDNRQEAWEDYVNLCQQGGSKSFLELVEVANLTSPFAEGCVKSVITEIEAWLHAIDDTKL 567 T 0.00047 Peptidase_M3 unppercent F Bacteria T 3so6 2 B Q LDLR_HUMAN LDL RECEPTOR NSINFDNPVYQKTT 14 T 3.4 PARM unphh F Eukaryota T 3soq 2 B Z DKK1_HUMAN DICKKOPF-1, DKK-1, HDKK-1, SK XNSNAIKNX 9 T 35 GSH_synthase pdbhh F Eukaryota T 3sp6 2 B B PRGC2_HUMAN PGC-1-BETA, PPAR-GAMMA COACTIVATOR 1-BETA, PPARGC-1-BETA, PGC-1-RELATED ESTROGEN RECEPTOR ALPHA COACTIVATOR LSLLQKLLLAT 11 T 18 DUF3014 pdbhh F Eukaryota T 3spv 3 C C BZLF1_EBVB9 EB1, ZEBRA RAKFKQLL 8 T 0.0067 bZIP_2 unppssm T Viruses T 3sri 2 B B Q8IKV6_PLAF7 Rhoptry neck protein 2 KDIGAGPVASCFTTRMSPPQQICLNSVVN 29 T 15 Stealth_CR4 pdbhh F Eukaryota T 3srj 2 C,D,E,F C,D,E,F R1 peptide VFAEFLPLFSKFGSRMHILK 20 T 2.6 DUF3898 pdbhh F T 3sui 2 B B TRPV1_RAT TRPV1, CAPSAICIN RECEPTOR, OSM-9-LIKE TRP CHANNEL 1, OTRPC1, VANILLOID RECEPTOR 1, VANILLOID RECEPTOR TYPE 1-LIKE GPEGVKRTLSFSLRSGRVSGRNWKNFALVPLLRDAST 37 T 0.46 Cgr1 pdbhh F Eukaryota T 3svi 1 A A Type III effector HopAB2 LYTGAVPRANRIVQQLVEAGADLANIRTMFRNMLRGEEMILSRAEQNVFLQHFPDMLPCGIDRNSELAIALREALRRADSQQA 83 T 0.014 Peptidase_C58 pdbpssm F T 3svm 2 B P DNM3A_HUMAN DNMT3A, DNA METHYLTRANSFERASE HSAIIIA, DNA MTASE HSAIIIA, M.HSAIIIA YEPSTTARKVGRPGR 15 T 9.2 AT_hook pdbhh F Eukaryota T 3sw9 2 B,D P,Q DNM3A_MOUSE DNMT3A, DNA METHYLTRANSFERASE MMUIIIA, DNA MTASE MMUIIIA, M.MMUIIIA SATARKVGRPGR 12 T 6 AT_hook pdbhh F Eukaryota T 3t4g 1 A,B A,B Cyclic pseudo-peptide (ORN)AIIGLMV(ORN)KF(HAO)(4BF)K XAIIGLMVXKFXXK 14 T 0.18 Beta-APP pdbhh F T 3t4r 1 A A VP4A_LNYV3 PROTEIN P, PROTEIN 4A MARIRHEKEKLLADLDWEIGEIAQYTPLIVDFLVPDDILAMAADGLTPELKEKIQNEIIENHIALMALEEYSSLEHHHHHH 81 T 0.11 Pox_C4_C10 pdbpercent T Viruses T 3t7k 2 B,D C,D H2A1_YEAST Histone H2A.1 ATKASQEL 8 T 57 POX pdbhh F Eukaryota T 3t7z 1 A A Y694_METJA UNCHARACTERIZED NOP5 FAMILY PROTEIN MJ0694 MIYVTFTPYGAFGVKDNKEVSGLEDIEYKKLFNEEEIPDIMFKLKTQPNKIADELKEEWGDEIKLETLSTEPFNIGEFLRNNLFKVGKELGYFNNYDEFRKKMHYWSTELTKKVIKSYA 119 T 0.013 U3_assoc_6 unppssm F Archaea T 3tbh 2 B B E9BR69_LEIDB Serine acetyl transferase derived octapeptide LERDGSGI 8 T 2.1 DUF2551 pdbhh F Eukaryota T 3tdi 2 C,D C,D UBC12_YEAST RUB1-CONJUGATING ENZYME, RUB1-PROTEIN LIGASE, UBIQUITIN CARRIER PROTEIN 12 XMLKLRQLQKKKQKENENSSSIQPN 25 T 2.5E-10 UFC1 unphh F Eukaryota T 3tdu 3 E,F E,F UBC12_HUMAN NEDD8 CARRIER PROTEIN, NEDD8 PROTEIN LIGASE, UBIQUITIN-CONJUGATING ENZYME E2 M XMIKLFSLKQQKKEEE 16 T 3.3 YpmT pdbhh F Eukaryota T 3tdz 3 E,F E,F UBC12_HUMAN NEDD8-CONJUGATING ENZYME UBC12, NEDD8 CARRIER PROTEIN, NEDD8 PROTEIN LIGASE, UBIQUITIN-CONJUGATING ENZYME E2 M XMIKLXSLKXQKK 13 T 8 FAM53 pdbhh F Eukaryota T 3tei 2 B B KS6A1_HUMAN S6K-ALPHA-1, 90 KDA RIBOSOMAL PROTEIN S6 KINASE 1, P90-RSK 1, P90RSK1, P90S6K, MAP KINASE-ACTIVATED PROTEIN KINASE 1A, MAPK-ACTIVATED PROTEIN KINASE 1A, MAPKAP KINASE 1A, MAPKAPK-1A, RIBOSOMAL S6 KINASE 1, RSK-1 PQLKPIESSILAQRRVRKLPSTTL 24 T 20 Sbi-IV pdbhh F Eukaryota T 3tfk 2 B B p4B10 peptide QLSDVPMDL 9 T 15 KH_9 pdbhh F T 3tfy 2 B,D,F D,E,F HNRPF_HUMAN hnRNP F MLGPEGGRWG 10 F F Eukaryota T 3tg5 2 B B P53_HUMAN ANTIGEN NY-CO-13, PHOSPHOPROTEIN P53, TUMOR SUPPRESSOR P53 HSSHLKSKKGQ 11 T 51 TEX12 pdbhh F Eukaryota T 3thk 2 C,D C,D Proline-rich peptide PPPVPPYSAG 10 T 2.8 GvpL_GvpF pdbhh F T 3tiw 2 B,D C,D AMFR_HUMAN AUTOCRINE MOTILITY FACTOR RECEPTOR, ISOFORM 2, AMF RECEPTOR, ISOFORM 2, RING FINGER PROTEIN 45, GP78 VTLRRRMLAAAAERRLQKQ 19 T 0.072 SVIP pdbhh F Eukaryota T 3tix 2 B,D B,D CHP1_SCHPO Chromo domain-containing protein 1 MISESEDLSSASTLSDYFRFVLRVGKSLYYAGELSFDISKLKAETEHQQLLRSLVSCKQVDVLRFVTSQYLEVFGTCLTKVLSGSLCIRSDVDMTHFKNILNRGNGAGIVLGSNYTLLLFTEDNNALMNLYDCQGQSNSPFWMVIFEPLESILVEWSAKNLRPKKPYHKSQSYLSYLLQLGHIDLHKIGAFQATQILIVSKQPSPEAEELEDTFREAAIPTFRGLEIPESLFLSQNVFVFLNVSLEDDFDQLQFLTLAKRKSCKFFLFGLSLPLKSPNDSHVGTDFKKNNEPLDKLTYSQYLRPMFPKGGVVSVTLSALIKTPRLLELISPFLEIKKDSWILILPPSIVDMVKSYFVTNNPDKSLLEIQNLLNTLQRYLTNPALKNVTLYQDWDIVIDDSADVSLASTLQLYQKKNYDKYRRFVLIHELKNELTPVNGLDIVDYDEFKETFMRAIGLK 458 T 11 PsiB pdbhh F Eukaryota T 3tj5 2 B B B0BXR4_RICRO Antigenic heat-stable 120 kDa protein GSHMNLLNAATALSGSMQYLLNYVNAG 27 T 3.2 SipA_VBS pdbhh F Bacteria T 3tjh 2 B B p3A1 SPLDSLWWI 9 T 2.2 FWWh pdbhh F T 3tjv 2 B B PTSYAGDDSG PTSYAGDDSG 10 T 8.6 DUF5837 pdbhh F T 3tjy 1 A A HPAB3_PSEYM AVIRULENCE PROTEIN HOPPMAL TGAVPRANRIVQQLVEAGADLANIRTMFRNMLRGEEMILSRAEQNVFLQHFPDMLPCGIDRNSELAIALREALRRADSQQAARAPARTPPRSSV 94 T 0.012 Peptidase_C58 pdbpssm F Bacteria T 3tkn 2 B,E,H B,E,H NU159_YEAST NUCLEAR PORE PROTEIN NUP159 GPHSSITKDMKGFKVVEVGLAMNTKKQIGDFFKNLNMAK 39 T 8.3 DUF1413 pdbhh F Eukaryota T 3tkz 2 B,C P,Q PROTEIN (RVIpYFVPLNR peptide) RVIXFVPLNR 10 T 1.1 DUF6271 pdbhh F T 3tl0 2 B B RLNpYAQLWHR peptide RLNXAQLWHR 10 T 0.44 MOSP_N pdbhh F T 3to6 2 B B H4_YEAST K16COA BISUBSTRATE INHIBITOR GKGGAKRHRKIL 12 T 4.2 Shadoo unppercent F Eukaryota T 3tpu 4 D,H,L,P J,F,L,R p5E8 peptide FLSPFWFDI 9 T 0.26 T6SS_VasJ pdbhh F T 3tsz 2 B B JAM1_HUMAN JAM-A, JUNCTIONAL ADHESION MOLECULE 1, JAM-1, PLATELET F11 RECEPTOR, PLATELET ADHESION MOLECULE 1, PAM-1 EGEFKQTSSFLV 12 T 17 DUF4193 pdbhh F Eukaryota T 3twe 1 A,B A,B alpha4H GNADELYKELEDLQERLRKLRKKLRSG 27 T 0.0068 DUF5798 pdb F T 3twf 1 A,B A,B alpha4F3a GNADELYKEXEDLQERXRKLRKKXRSG 27 T 0.53 DUF5798 pdbhh F T 3twg 1 A,B A,B alpha4F3af3d GNADEXYKEXEDXQERXRKXRKKXRSG 27 T 5.9 PRP1_N pdbhh F T 3twr 2 E,F,G,H E,F,G,H 3BP2_HUMAN 3BP-2 LPHLQRSPPDGQSFRX 16 T 4.1 DUF3375 pdbhh F Eukaryota T 3tws 2 E,F,G,H E,F,G,H human TERF1 LPHLQRGCADGQSFRX 16 T 7.5 GSIII_N pdbhh F T 3twt 2 E,F,G,H E,F,G,H human MCL1 LPHLQRPPPIGQSFRX 16 T 3.1 DUF3375 pdbhh F T 3twu 2 B B MCL1_HUMAN BCL-2-LIKE PROTEIN 3, BCL2-L-3, BCL-2-RELATED PROTEIN EAT/MCL1, MCL1/EAT SRRVARPPPIGAEVPX 16 T 7.6 DUF4653 pdbhh F Eukaryota T 3twv 2 E,F,G,H E,F,G,H human NUMA1 LPHLQRTQPDGQSFRX 16 T 4.8 DUF3375 pdbhh F T 3tww 2 C,D C,D human LNPEP LPHLQRQSPDGQSFRX 16 T 3.1 DUF3375 pdbhh F T 3twx 2 C,D C,D human FNBP1 LPHLQRESPDGQSFRX 16 T 3.1 DUF3375 pdbhh F T 3tzd 2 B T H14_HUMAN HISTONE H1B YPVKKKARKSAGAAKRKAS 19 T 0.2 DUF5797 unp F Eukaryota T 3tzg 1 A,B A,B A6L2L1_BACV8 hypothetical protein BVU_2266 GKEKKADTYVTKVTDLTGEEEQVLKLEYDRDGKIIKYGDTPVRYEGDQITIGQMNCLNTGNKLCNVTFQIGKGKARESRARCMLKVGEEVYEADKQTVYDYKGDTIFINSDYRATSDYRFLKKVQGKYVFDQLGRLKEVMTVFTEANDSVSSCHTYYNYDNNINYQANLNLQAYVIDYDGVDSFFYFLLNLGQLRNRTALPNDIGYCMNHGLSTYNVHANYRLDDENPVRIEVLYNYTKLLSRIDLSYNPLN 252 T 0.019 UPF0257 unphh F Bacteria T 3tzw 2 B D CO-CRYSTALLIZED PEPTIDE SDKENFWGMAVA 12 T 1.1 NPFF pdbhh F T 3u23 2 B B RIN3_HUMAN RAS INTERACTION/INTERFERENCE PROTEIN 3 TAKQPPVPPPRKKRISX 17 T 0.0029 HCV_NS5a_C pdbhh F Eukaryota T 3u3f 2 E,F,G,H,I,J E,F,G,H,I,J PAXI_HUMAN Paxillin LD2 peptide SATRELDELMASLSDFK 17 T 0.99 SAM_LFY pdbhh F Eukaryota T 3u7d 2 B,D B,D HEG1_HUMAN Protein HEG homolog 1 SRHSCIFPGQYNPSFISDESRRRDYF 26 T 3.4 LAX unphh F Eukaryota T 3u85 2 B B KMT2A_HUMAN LYSINE N-METHYLTRANSFERASE 2A,ALL-1,CXXC-TYPE ZINC FINGER PROTEIN 7,MYELOID/LYMPHOID OR MIXED-LINEAGE LEUKEMIA,MYELOID/LYMPHOID OR MIXED-LINEAGE LEUKEMIA PROTEIN 1,TRITHORAX-LIKE PROTEIN,ZINC FINGER PROTEIN HRX SRWRFPARPGTTGGGGGGGRR 21 T 10 DUF5877 pdbhh F Eukaryota T 3u88 2 C,E M,N KMT2A_HUMAN LYSINE N-METHYLTRANSFERASE 2A,ALL-1,CXXC-TYPE ZINC FINGER PROTEIN 7,MYELOID/LYMPHOID OR MIXED-LINEAGE LEUKEMIA,MYELOID/LYMPHOID OR MIXED-LINEAGE LEUKEMIA PROTEIN 1,TRITHORAX-LIKE PROTEIN,ZINC FINGER PROTEIN HRX SRWRFPARPGTGRRGLGGAPRQRVPALLRVGPGFDAALQVSAAIGTNLRRFRAVFGESGGGGGSGEDEQFLGFGS 75 T 10 DUF3467 pdbhh F Eukaryota T 3ua0 1 A,B A,B FIBH_BOMMO FIB-H, H-FIBROIN MGHHHHHHMRVKTFVILCCALQYVAYTNANINDFDEDYFGSDVTVQSSNTTDEIIRDASGAVIEEQITTKKMQRKNKNHGILGKNEKMIKTFVITTDSDGNESIVEEDVLMKTLSDGTVAQSYVAADAGAYSQS 134 T 0.053 DUF809 pdb F Eukaryota T 3ueo 2 E,F E,F MDC1_HUMAN phospho-peptide GFIDSDTDVEEE 12 T 1.4 LAGLIDADG_1 pdbhh F Eukaryota T 3ui2 2 B B SR54C_ARATH 54 CHLOROPLAST PROTEIN, 54CP, SRP54, CPSRP54, FFC QKAPPGTARRKRK 13 T 6.1 DUF6490 pdbhh F Eukaryota T 3ukw 2 B C Bimax1 peptide GSRRRRPRKRPLEWDEDEEPPRKRKRLW 28 T 1.5 ROKNT pdbhh F T 3ukx 2 B C Bimax2 peptide GSRRRRRRKRKREWDDDDDPPKKRRRLD 28 T 0.74 Med24_N pdbhh F T 3uky 2 B C NCBP1_YEAST CAP-BINDING PROTEIN 80, CBP80 GSMFNRKRRGDFDEDENYRDFRPRMPKRQRIP 32 T 61 DUF2970 pdbhh F Eukaryota T 3ukz 2 B C NCBP1_MOUSE CAP-BINDING PROTEIN 80, CBP80 GSMSRRRHSYENDGGQPHKRRKTSD 25 T 15 DUF4500 pdbhh F Eukaryota T 3ul0 2 B C NCBP1_MOUSE CAP-BINDING PROTEIN 80, CBP80 GSMSRRRHSDENDGGQPHKRRKTSD 25 T 23 DUF2205 pdbhh F Eukaryota T 3ul1 2 B A NUPL_XENLA Nucleoplasmin GSAVKRPAATKKAGQAKKKKLD 22 T 0.0016 BSP_II unppercent F Eukaryota T 3ulr 3 C C ABL2_HUMAN ABELSON MURINE LEUKEMIA VIRAL ONCOGENE HOMOLOG 2, ABELSON-RELATED GENE PROTEIN, TYROSINE-PROTEIN KINASE ARG SSVVPYLPRLPILPSKT 17 T 0.31 PLU-1 unppercent F Eukaryota T 3ult 1 A,B A,B B5T007_LOLPR Ice recrystallization inhibition protein-like protein MDEQPNTISGSNNTVRSGSKNVLAGNDNTVISGDNNSVSGSNNTVVSGNDNTVTGSNHVVSGTNHIVTDNNNNVSGNDNNVSGSFHTVSGGHNTVSGSNNTVSGSNHVVSGSNKVVTDAAKLAAALEHHHHHH 133 T 0.077 NSP2-B_epitope pdb F Eukaryota T 3um0 2 B B CHMP5_HUMAN CHROMATIN-MODIFYING PROTEIN 5, SNF7 DOMAIN-CONTAINING PROTEIN 2, VACUOLAR PROTEIN SORTING-ASSOCIATED PROTEIN 60, VPS60, HVPS60 TKNKDGVLVDEFGLPQIPAS 20 T 1.3 Castor1_N pdbhh F Eukaryota T 3unn 2 B B MDC1_HUMAN phospho-T4 peptide from Mediator of DNA damage checkpoint protein 1 MEDTQAID 8 T 32 HD_assoc pdbhh F Eukaryota T 3uot 2 C,D D,E MDC1_HUMAN NUCLEAR FACTOR WITH BRCT DOMAINS 1 MEDTQMIDWD 10 T 1.7 DUF4502 pdbhh F Eukaryota T 3up6 1 A,B A,B A7M1U4_BACO1 hypothetical protein BACOVA_04078 GDVAYELPAHTTRAQLSIDLVNNGDVEQQEKINSMRFIVFGSTPGGVRLDVNEHILLSTPETATDIDAQLLEVTSSNDILVVVIANEPQSLTSQLDGIANLLTLQEMIYDISSILNSDGQIISATGMPMTGVIRDISIAPDETKTVQMVIERAVARVDVFIEAIDGGAVTGYTAGSTSVTLHNFSHDSYFVMGNVGNGTRDNADSSKNYGKVKEDVSESNLLTHSWTAATTETWAYSSAPGAENRKLLCSFYTAERLFKSDYSDRLSISMANVLKGPSDVTGITGKVIESVTKVDGTGSPTAQPFTEIRRNNVYQVTARVGKIGIQILTISVEDWGERQDIDLDMDL 347 T 0.00017 P_gingi_FimA unp F Bacteria T 3upr 2 B,D P,Q pep-V HSITYLLPV 9 T 4 PRCC pdbhh F T 3uri 2 B B DB5 peptide HPHLSXAH 8 T 15 DUF1045 pdbhh F T 3url 2 B B DB6 peptide HSLFHXTP 8 T 1.5 Integrase_Zn pdbhh F T 3utq 3 C C INS_HUMAN Insulin ALWGPDPAAA 10 T 2.8 Lipid_DES pdbhh F Eukaryota T 3uvk 2 B B KMT2D_HUMAN ALL1-RELATED PROTEIN, LYSINE N-METHYLTRANSFERASE 2B, KMT2B, MYELOID/LYMPHOID OR MIXED-LINEAGE LEUKEMIA PROTEIN 2 GCARSEPKILT 11 T 0.12 N-SET pdbhh F Eukaryota T 3uvl 2 B B KMT2C_HUMAN HOMOLOGOUS TO ALR PROTEIN, LYSINE N-METHYLTRANSFERASE 2C, KMT2C, MYELOID/LYMPHOID OR MIXED-LINEAGE LEUKEMIA PROTEIN 3 GSARAEPKMSA 11 T 17 N-SET pdbhh F Eukaryota T 3uvm 2 B B KMT2B_HUMAN LYSINE N-METHYLTRANSFERASE 2D, KMT2D, MYELOID/LYMPHOID OR MIXED-LINEAGE LEUKEMIA PROTEIN 4, TRITHORAX HOMOLOG 2, WW DOMAIN-BINDING PROTEIN 7, WBP-7 GAARAEVYLR 10 T 11 Trp_DMAT pdbhh F Eukaryota T 3uvu 1 A B FEN1_HUMAN FEN-1, DNASE IV, FLAP STRUCTURE-SPECIFIC ENDONUCLEASE 1, MATURATION FACTOR 1, MF1, HFEN-1 SAKRKEPEPKGSTKKKAKT 19 T 140 DUF4647 pdbhh F Eukaryota T 3uvw 2 B B H4_HUMAN PEPTIDE (H4K5ACK8AC) SGRGXGGXGLGY 12 T 7.6 HTH_Tnp_Mu_1 pdbhh F Eukaryota T 3uvx 2 B B H4_HUMAN PEPTIDE (H4K12ACK16AC) GXGGAXRHRKV 11 T 11 Shadoo unppercent F Eukaryota T 3uvy 2 B B H4_HUMAN PEPTIDE (H4K16ACK20AC) AXRHRXVLRDN 11 T 0.27 UPF0137 unp F Eukaryota T 3uw9 2 E,F E,F H4_HUMAN PEPTIDE (H4K8ACK12AC) GXGLGXGGAKR 11 T 11 Shadoo unppercent F Eukaryota T 3uxg 2 B B HDAC4_HUMAN HD4 LPLYTSPSLPNITLGLP 17 T 16 RepA1_leader pdbhh F Eukaryota T 3v2o 2 B B LRP2_RAT LRP-2, GLYCOPROTEIN 330, GP330, MEGALIN HYRKTGSLLPTLPKLPSLS 19 T 0.21 Amnionless unppercent F Eukaryota T 3v30 2 B B RFX5_HUMAN REGULATORY FACTOR X 5 KTLVSMPPLPGLDLKGS 17 T 10 XRN_M pdbhh F Eukaryota T 3v31 2 B B HDAC4_HUMAN HD4 XLPLYTSPSLPNITLGLP 18 T 21 RepA1_leader pdbhh F Eukaryota T 3v3b 2 C,D C,D SAH-p53-8 stapled-peptide QSQQTFXNLWRLLXQN 16 T 0.0013 P53_TAD pdbhh F T 3v62 3 C,F C,F SRS2_YEAST ATP-dependent DNA helicase SRS2 SHNPDDTTVDNRPIISNAKFLADAAMKKTQKFSKKVKNEPASSQMDIFSQLSRAKKKSKLNNGEIIVID 69 T 0.013 AD pdbpercent F Eukaryota T 3v79 6 F R NOTC1_HUMAN RAM KRRRQHGQLWFPEGFKVSE 19 T 0.48 DUF4381 unppssm F Eukaryota T 3v7d 3 E E SIC1_YEAST CDK INHIBITOR P40 MTSPFNGLTSPQRSPFPKS 19 T 1.8 RbcS pdbhh F Eukaryota T 3va4 2 B C CHK2_HUMAN CHK2 CHECKPOINT HOMOLOG, CDS1 HOMOLOG, HUCDS1, HCDS1, CHECKPOINT KINASE 2 LETVSTQELYS 11 T 0.39 RAP1 unppercent F Eukaryota T 3vb6 2 C,D E,F C6Z inhibitor XTSAVLX 7 T 470 DUF5550 pdbhh F T 3ve6 2 B B POLS_EEVVT Venezuelan equine encephalitis virus capsid protein NLS EGPSAKKPKKEA 12 T 2.8 AKAP2_C unp T Viruses T 3vfr 3 C C BZLF1_EBVB9 LPEP peptide from EBV, P4A, LPEALPQGQLTAY LPEALPQGQLTAY 13 T 26 AP-5_subunit_s1 pdbhh T Viruses T 3vfs 3 C C LPEP peptide from EBV, P5A, LPEPAPQGQLTAY LPEPAPQGQLTAY 13 T 17 Casc1_N pdbhh F T 3vft 3 C C LPEP peptide from EBV, P6A, LPEPLAQGQLTAY LPEPLAQGQLTAY 13 T 13 DUF99 pdbhh F T 3vfu 3 C C LPEP peptide from EBV, P7A, LPEPLPAGQLTAY LPEPLPAGQLTAY 13 T 19 DUF2808 pdbhh F T 3vfv 3 C C LPEP peptide from EBV, P9A, LPEPLPQGALTAY LPEPLPQGALTAY 13 T 23 GSAP-16 pdbhh F T 3vfw 3 C C LPEP peptide from EBV, P10A, LPEPLPQGQATAY LPEPLPQGQATAY 13 T 29 GSAP-16 pdbhh F T 3vg8 1 A,B,C,D,E,F,G,H,I,J G,H,I,J,A,B,C,D,E,F Q53VW9_THET8 Hypothetical Protein TTHB210 MNVSEALKGALPNFIPGLGTLYVDPSTLPEGPFLAYDRAGNLVKVVFMVPLKKLNESHKYVDIGTKTLRALGITRIDHVNMIPSGPHPGVSEPHYHIELVLVSVDQERKVLEGEPY 116 T 0.0015 DUF5602 pdbpssm F Bacteria T 3viv 2 C C PSTOM_PYRHO UNCHARACTERIZED PROTEIN PH1511 NVIVLMLPME 10 T 2.9 Polysacc_deac_2 pdbhh F Archaea T 3vj6 3 C P HA1L_MOUSE Qdm peptide AMAPRTLLL 9 T 0.014 UL40 pdbhh F Eukaryota T 3vpj 2 C,D,G,H E,F,G,H Q9I2Q0_PSEAE Tse1-specific immunity protein MGSSHHHHHHSSGLVPRGSHMKLLAGSFAALFLSLSAQAADCTFTQLEIVPQFGSPNMFGGEDEHVRVMFSNEDPNDDNPDAFPEPPVYLADRDSGNDCRIEDGGIWSRGGVFLSQDGRRVLMHEFSGSSAELVSYDSATCKVVHREDISGQRWAVDKDGLRLGQKCSGESVDSCAKIVKRSLAPFCQTAKK 192 T 1.5 Me-amine-dh_H unphh F Bacteria T 3vqg 2 B B IGSF5_MOUSE JAM-4 YKVRNVTLV 9 T 0.79 Chisel unppercent F Eukaryota T 3vrp 2 B B EGFR_HUMAN PROTO-ONCOGENE C-ERBB-1, RECEPTOR TYROSINE-PROTEIN KINASE ERBB-1 EDSFLQRXSSDPT 13 T 1.4 DUF4348 pdbhh F Eukaryota T 3vu5 2 B B SC22 XWEEWDKKIEEYTKKIEELIKKS 23 T 0.052 GP41 pdbhh F T 3vvi 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H I1RJZ4_GIBZE TRANSIENT RECEPTOR POTENTIAL CHANNEL VRKLRAEMEELKSMLSQLGKT 21 T 0.035 Vps51 pdb F Eukaryota T 3vvr 2 B B MAD5 XXVYSAVCAAAA 12 T 12 DUF711 pdbhh F T 3vvs 2 B B MAD3S XXVYSAVCLYV 11 T 6.1 TraL_transposon pdbhh F T 3w11 6 F F INSR_HUMAN INSULIN RECEPTOR SUBUNIT ALPHA TFEDYLHNVVFVPRPS 16 T 0.00017 DUF4998 unphh F Eukaryota T 3w13 6 F F INSR_HUMAN INSULIN RECEPTOR SUBUNIT ALPHA EESSFRKTFEDYLHNVVFVPRPS 23 T 0.00017 DUF4998 unphh F Eukaryota T 3w15 2 B B PEX21_YEAST PEROXIN-21 GSKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSVESKLSESKFIQLMRNISSGDVTLKKNADGNSASELFSSNNGELVGNRHIFVKDEIHKDILD 101 T 0.61 DUF3446 pdbpercent F Eukaryota T 3w1b 2 B B DCR1C_HUMAN DNA CROSS-LINK REPAIR 1C PROTEIN, PROTEIN A-SCID, SNM1 HOMOLOG C, HSNM1C, SNM1-LIKE PROTEIN DVPQWEVFFKR 11 T 1.6 DUF4570 pdbhh F Eukaryota T 3w30 1 A,B A,B Q8VSD5_SHIFL ORF169b GPLGSPEFMINGVSLQGTAGYEAHTEEGNVNVKKLLESLNSKSLGDMDKDSELAATLQKMINPSGGDGNASGCALHACMAMLGYGVREAPVPNEISEYMTGFFHRHLEQIDSEGIVSHPNETYSKFRERIAENILQNTSKGSVVMISIEQATHWIAGFNDGEKIMFLDVQTGKGFNLYDPVEKSPDAFVDENSSVQVIHVSDQEFDHYANSSSWKSKRLC 220 T 0.12 Gln_amidase pdbpercent F Bacteria T 3w3w 2 B B STE12_YEAST Protein STE12 PRRRTVGMKSSQGNVPTGNKQSVGKSAKISKPLHIKTSAYQKQYKINLETKARPSAGDEDSAHPDKNKE 69 T 480 LAG1-DNAbind pdbhh F Eukaryota T 3w3x 2 B B PHO4_YEAST Phosphate system positive regulatory protein PHO4 SANKVTKNKSNSSPYLNKRRGKPGPDS 27 T 0.72 TonB_N unp F Eukaryota T 3w3y 2 B B NUP53_YEAST NUCLEAR PORE PROTEIN NUP53 RNAEFKVSKNSTSFKNPRRLEIKDGRSLFLRNRGKIHSGVLSSIESDL 48 T 3.4 DUF3994 pdbhh F Eukaryota T 3w6k 1 A,D A,D A0A0E0TE00_GEOS2 ScpA ERALLFTKPPSDLSAYAD 18 T 7.5 Reo_sigmaC pdbhh F Bacteria T 3wa0 2 G,H G,H DCAF1_HUMAN DDB1- AND CUL4-ASSOCIATED FACTOR 1, HIV-1 VPR-BINDING PROTEIN, VPRBP, VPR-INTERACTING PROTEIN GPLGSYDDDTDDLDELDTDQLLEAELEEDDNNENAGEDGDNDFSPSDEELANLLEEGEDGEDEDSDADEEVELILGDTDSSDNSDLEDDIILSLNE 96 T 5 Cwf_Cwc_15 unphh F Eukaryota T 3wa4 2 B B CD28_HUMAN TP44 SDXMNMTP 8 T 0.089 DUF2207 unppercent F Eukaryota T 3wa5 1 A A Q9HYC5_PSEAE Type VI secretion exported 3 MTATSDLIESLISYSWDDWQVTRQEARRVIAAIRNDNVPDATIAALDKSGSLIKLFQRVGPPELARSLIASIAGRTTMQRYQARNALIRSLINNPLGTQTDNWIYFPTITFFDICADLADAAGRLGFAAAGATGVASQAIQGPFSGVGATGVNPTDLPSIAFGDQLKLLNKDPATVTKYSNPLGDLGAYLSQLSPQDKLNQAQTLVGQPISTLFPDAYPGNPPSRAKVMSAAARKYDLTPQLIGAIILAEQRDQTRDEDAKDYQAAVSIKSANTSIGLGQVVVSTAIKYELFTDLLGQPVRRGLSRKAVATLLASDEFNIFATARYIRYVANLASQQDLRKLPKTRGAFPSIDLRAYAGNPRNWPRDNVRALASEYTSRPWDDNLSPGWPMFVDDAYATFLDPGMRFPLEHHHHHH 416 T 0.0021 DUF1402 unphh F Bacteria T 3wa5 2 B B Q9HYC4_PSEAE Tse3-specific immunity protein MKTVALILASLALLACTAESGVDFDKTLTHPNGLVVERPVGFDARRSAEGFRFDEGGKLRNPRQLEVQRQDAPPPPDLASRRLGDGEARYKVEEDDGGSAGSEYRLWAAKPAGARWIVVSASEQSEDGEPTFALAWALLERARLQLEHHHHHH 153 T 0.0052 PsbP_2 unphh F Bacteria T 3wbn 2 B B MaL6 AFTFRYSPSLYTWFLFPCG 19 T 5.3 HXXEE pdbhh F T 3wim 2 B B WDFY3_HUMAN AUTOPHAGY-LINKED FYVE PROTEIN, ALFY DEKDGFIFVNYSEG 14 T 0.42 ATG8 pdbhh F Eukaryota T 3wit 1 A A Q8XBY5_ECO57 UNCHARACTERIZED PROTEIN GTIAGSVHVDAVNNGGEGNGIQAYTAIKEIMLAVEESKIALTPDGIQLQVGESTVIRLSKDGITIVGGSVFINGLEHHHHHH 82 T 0.052 DUF2345 pdbpssm F Bacteria T 3wkn 2 C,D,G,H,K,L,O,P E,F,G,H,K,L,O,P AF.P17 GPGISAFSPGRGVYDPETGTWYDAAWHLGELVWATYYDPETGTWEPDWQRMLGQ 54 T 0.00013 OCRE pdb F T 3wmg 2 B B anti-CmABCB1 peptide XXLDQIVWFNAPGDLHLCG 19 T 2.5 Endothelin pdbhh F T 3wn7 2 B,D B,M NF2L2_MOUSE NF-E2-RELATED FACTOR 2, NFE2-RELATED FACTOR 2, NUCLEAR FACTOR, ERYTHROID DERIVED 2, LIKE 2 MDLIDILWRQDIDLGVSREVFDFSQRQKDYELEKQ 35 T 0.055 Radial_spoke unppercent F Eukaryota T 3wne 2 C,D C,D LEDGF peptide PKIDNG 6 T 11 Mak10 pdbhh F T 3wnf 2 C,D C,D CKIDNC peptide XCKIDNCX 8 T 0.55 Hormone_4 pdbhh F T 3wng 2 C,D C,D PKIDN(DPR) peptide PKIDNX 6 T 4.6 OrfA pdbhh F T 3wod 6 G,H G,H A7XX65_9CAUD GP39 MVEGFVEPYIRLFEAIPDAETELATFYDADLDTLPPRMFLPSGDLYTPPGPVRLEEIKRKRRVRLVKVSIYRFEHVGLGLAARPYAYAYAWQGDNGILHLYHAPVVLEDVPEVLELDEVTYNESYVRLMRAMGHVDAFIDL 141 T 2.5 Abp2 pdbhh T Viruses T 3woe 2 B,D B,D A7XX65_9CAUD GP39 GPVEPYIRLFEAIPDAETELATFYDADLDTLPPRMFLPSGDLYTPPGPVRLEEIKRKRRVRLVKVSIYRFEHVGLGLAARPYAYAYAWQGDNGILHLYHAPVVLED 106 T 14 CSRNP_N pdbhh T Viruses T 3wof 2 B,D,F,H,J,L,N,P,R,T,V,X B,D,F,H,J,L,N,P,R,T,V,X A7XX65_9CAUD GP39 GPVEPYIRLFEAIPDAETELATFYDADLDTLPPRMFLPSGDLYTPPGPVRLEEIKRKRRVRLVKVSIYRFEHVGLGLAARPYAYAYAWQGDNGILHLYHAPVVLEDVPEVLELDEVTYNESYVRLMRAM 129 T 2.5 Abp2 unphh T Viruses T 3woo 2 C,D C,D ANGT_HUMAN Angiotensin II VYIHPF 6 T 0.64 Adeno_PVIII pdbhh F Eukaryota T 3wp0 2 B B L2GL2_HUMAN HGL LSRVKSLKKSLRQSF 15 T 18 DUF3511 pdbhh F Eukaryota T 3wp1 2 B A L2GL2_HUMAN HGL LKKSLRQSFRRMRRSRV 17 T 27 Neuropeptide_S pdbhh F Eukaryota T 3ws6 3 E,F E,F Mimotope 9-mer peptide YAIENYLEL 9 T 2.6 DUF4744 pdbhh F T 3wsy 2 B C SORL_HUMAN peptide from Sortilin-related receptor LPQDRGFLVVQGDPR 15 T 0.46 Inhibitor_I69 pdbhh F Eukaryota T 3wut 2 C,F,I,L C,F,I,L TEX14_HUMAN PROTEIN KINASE-LIKE PROTEIN SGK307, SUGEN KINASE 307, TESTIS-EXPRESSED SEQUENCE 14, TESTIS-EXPRESSED SEQUENCE 14 PROTEIN DLAVGPPSLNYIPP 14 T 1.6 Topo_C_assoc pdbhh F Eukaryota T 3wuu 2 C,F,I,L C,F,I,L TEX14_HUMAN TEX-14 DLAVGPPSLNYPGY 14 T 13 CitX pdbhh F Eukaryota T 3wuv 2 C,F,I,L,O,R C,F,I,L,O,R PDC6I_HUMAN ALG-2-INTERACTING PROTEIN X, ALIX DQAQGPPYPTYIPP 14 T 1.5 N1221 pdbhh F Eukaryota T 3ww1 1 A,B A,B L0N3Y0_9CELL L-ribose isomerase HHHHHHGSTRTAISRREYDEWLSEAASLARALRYPVTPEMVNDSAGIVFGDDQYEAFAHGLWSREPYEVMVILESLNEPAVDGLPAAGAAHAEYSGLCDKLMIVHPGKFCPPHFHQRKTESYEVVLGEMEVFYAPEPVTVGDDDVLSFSPMPEGSPWPEGVALPAGREDSYAGLTSYVRLRAGDPKFVMHRKHLHAFRCPADSPVPLVVREVSTYSHEPTEHAHDKAAPLPQWRGLHDNTFVAEAANSGRLATAIA 256 T 0.00016 Cupin_2 unp F Bacteria T 3wx4 1 A A ARN_BPT4 ANTI-RGL NUCLEASE MIIDSQSVVQYTFKIDILEKLYKFLPNLYHSIVNELVEELHLENNDFLIGTYKDLSKAGYFYVIPAPGKNIDDVLKTIMIYVHDYEIEDYFELEHHHHHH 100 T 2.3 DUF3198 unphh T Viruses T 3wxa 2 C,D C,D SC31A_HUMAN ABP125, ABP130, SEC31-LIKE PROTEIN 1, SEC31-RELATED PROTEIN A, WEB1-LIKE PROTEIN NPPPPGFIMHGN 12 T 1.6 DUF2173 pdbhh F Eukaryota T 3wyd 1 A,B A,B A0A0A6YVN5_9ZZZZ LC-Est1C MGSSHHHHHHSSGLVPRGSHMPYRLYVPTTYDGTKAFPLVIALHGMGGDENSYFDSYQRGAFMIEAENRGYIVACPKGRQPASMYVGPAERDVMDVIAEVRRDYKIDPDRIYMTGHSMGGYGTWSIAMNHPDVFAALAPVAGGGNPLGMANIAHIPQLVVHGDNDKTVPVERSRVMVEAAKKHGTEIKYIEIPGGDHVSVAARTFKDVFDWFDSHKRKRPAAKAATNK 228 T 2E-08 Peptidase_S9 unp F unclassified sequences T 3x0t 1 A,B A,B PIRA MSNNIKHETDYSHDWTVEPNGGVTEVDSKHTPIIPEVGRSVDIENTGRGELTIQYQWGAPFMAGGWKVAKSHVVQRDETYHLQRPDNAFYHQRIVVINNGASRGFCTIYYHLEHHHHHH 119 T 3.4 DUF916 pdbhh F T 3zbe 1 A A Q8XAD5_ECO57 PAAA2 MDYKDDDDKNRALSPMVSEFETIEQENSYNEWLRAKVATSLADPRPAIPHDEVERRMAERFAKMRKERSKQ 71 T 0.059 HAGH_C pdb F Bacteria T 3zbi 3 AA,C,DA,F,GA,I,JA,L,MA,O,PA,R,U,X a,C,d,F,g,I,j,L,m,O,p,R,U,X Q46702_ECOLX TRAN PROTEIN MRSLLLMGVLLISACSSGHKPPPEPDWSNTVPVNKTIPVDTQGGRNES 48 T 0.0023 LPAM_1 pdbhh F Bacteria T 3zd0 1 A A Q9WLK8_9HEPC P7 PROTEIN GPLGSPEFAAMDYKDDDDKALENLVVLNAASVAGAHGILSFLVFFCAAWYIKGRLAPGAAYAFYGVWPLLLLLLALPPRAYAAAAS 86 T 120 GBV-C_env pdbhh T Viruses T 3zdl 2 B B AB1IP_HUMAN APBB1-INTERACTING PROTEIN 1, PROLINE-RICH EVH1 LIGAND 1, PREL-1, PROLINE-RICH PROTEIN 73, RAP1-GTP-INTERACTING ADAPTER MOLECULE, RIAM, RETINOIC ACID-RESPONSIVE PROLINE-RICH PROTEIN 1, RARP-1 MGESSEDIDQMFSTLLGEMDLLTQSLGVDTLY 32 T 0.86 Drf_DAD pdbhh F Eukaryota T 3zfw 2 C,D X,Y PKHM2_HUMAN PH DOMAIN-CONTAINING FAMILY M MEMBER 2, SALMONELLA-INDUCED FILAMENTS A AND KINESIN-INTERACTING PROTEIN, SIFA AND KINESIN-INTERACTING PROTEIN MGSSHHHHHHSSGLVPRGSHMTNLEWDDSAITGSTGSTGSTGSHM 45 T 0.71 CLSTN_C pdbhh F Eukaryota T 3zgh 1 A A A0A0H2URK1_STRPN PNEUMOCOCCAL SERINE RICH REPEAT PROTEIN, SRRP HHHHHHSGNTIVNGAPAINASLNIAKSETKVYTGEGVDSVYRVPIYYKLKVTNDGSKLTFTYTVTYVNPKTNDLGNISSMRPGYSIYNSGTSTQTMLTLGSDLGKPSGVKNYITDKNGRQVLSYNTSTMTTQGSGYTWGNGAQMNGFFAKKGYGLTSSWTVPITGTDTSFTFTPYAARTDRIGINYFNGGGKVVESSTTSQSLSQ 205 T 0.26 FlgD_ig pdbpercent F Bacteria T 3zhe 1 A,C A,C G5ECF1_CAEEL PROTEIN SMG-5 MQKSDEVTEKFKRYCNQLEKYGQTENVHSPVMAMLRRKGRKQLIEIMKRDGDCTSSINKLWIVGYYHPFQFFIRDKEKNMAIAVLLTMFCGELQEMLSLPDDKYPALWNMYIGDFHRYMPDEEIQKCLAVGYYSRAIDLDPNQGRAFHVLAGLRADLNVAQKLRLMILGQLADAPYKKGTELLEYLKFPQKESTDKLMVDFVIWALNEKSKRMDYQMTGIKIVNEFKAEIEQKLEFDWSLIMSTCRLASKLAMKKFGFQQFYNCFDTISTLYITIYSRTISSKCLLAEAISWISDSAEILGHLDEQKNEPHFQKLSVFAKTKWNELNDLVMNHINSVFTSMSLTINPSISMTSFLLNGPISEPNVEFLSQLINYLVSVEFPPMEIIHDREESGPLLRRINQSEQKRLDIQIKTQNDEVNR 420 T 2.8E-05 EST1_DNA_bind unphh F Eukaryota T 3zin 2 B,C B,C DDX21_MOUSE DEAD BOX PROTEIN 21, GU-ALPHA, NUCLEOLAR RNA HELICASE GU, NUCLEOLAR RNA HELICASE II, RH II/GU SRGQKRSFSKAFGQ 14 T 2.7 DUF1413 pdbhh F Eukaryota T 3zio 2 B,C B,C A28NLS IGRKRGYSVAFG 12 T 2.2 TrbH pdbhh F T 3zip 2 B,C B,C A58NLS WAGRKRTWRDAF 12 T 3.2 DUF5419 pdbhh F T 3ziq 2 B,C B,C B6NLS SSHRKRKFSDAF 12 T 2.7 Mating_C pdbhh F T 3zir 2 B,C B,C B141NLS RQRKRKWSEAF 11 T 0.25 DUF3020 pdbhh F T 3zke 2 B,D,F,H,J,L B,D,F,H,J,L NEK9_HUMAN NEK9 VGMHSKGTQTA 11 T 0.21 KASH_CCD unppssm F Eukaryota T 3zkt 1 A A CT5A_CONCN TAU-CNVA ECCHRQLLCCLRFVX 15 T 1.4 DUF488 unphh F Eukaryota T 3zld 2 B B RON22_TOXGM RHOPTRY NECK PROTEIN 2 GSASDIAQFLTDSGMKAIEDCSWNPIMQQMACVVVAGSGS 40 T 0.064 DUF4040 unp F Eukaryota T 3zlj 2 C,D C,D MUTS_ECOLI DNA MISMATCH REPAIR PROTEIN MUTS PNAAATQVDGTQMSLLSVPEETSPAVEALENLDPRSLTPRQALEWIYRLKSLV 53 T 2.3 DUF5830 pdbhh F Bacteria T 3zmn 1 A,B A,B C8CHL5_9VIRU VP17 MGVFDRIRGALGRGLDVFRGDLPQVQPPAPQPAPAPAITPAAVQVGGWGFAWIDNEDFSPTGLAWRSGEYFALAQMKTPETAHFRIAAQERRLRIYLRGQKVVNGRNLSDPDSRTVNLPFLMQTPQGAPTLPSTYHPDVAVWAKVGSTWQPCVITAINYSTGDVTFTEPAGVTASDGIEIYYVHGDGQFRLRVARDAGGVDDSAATVFNQSFSTMHSVDQNNVETMIAWPQQVELVPGTRLVLEVFTTQVPMVWNERSGHYIQIAAMGRRIEVLDKGGLQRLAELEARGGL 291 T 0.063 DEC-1_N pdbpercent T Viruses T 3zmo 1 A A C8CHL4_9VIRU VP16 MQEAFNRIKALRPGARPATILRSGPEFSVYSGTQRVKVGEFVVPAGASWVLPNPVPVILKLYDTGGNQLPHTTDVFLAKRTKGFDFPEFLAKVQYASYYDLTEAQLRDAKFYQNILQTLSPLRAPQPPQGVVLREGDVLEVYVEAPAGVTVNLNDPRTRIELPIGVDNSNPTL 173 T 2.6 TraI_2B pdbhh T Viruses T 3zmq 2 B C SRC_HUMAN PROTO-ONCOGENE C-SRC, PP60C-SRC, P60-SRC EAQXQPGENL 10 T 4.5 Leader_Erm pdbhh F Eukaryota T 3zmt 3 C C PEPTIDE PRSFLV 6 T 14 IBV_3A pdbhh F T 3zmu 3 C C PKSFLV PEPTIDE PKSFLV 6 T 14 IBV_3A pdbhh F T 3zmz 3 C C PEPTIDE PRSFAV 6 T 34 tRNA-synt_1c_C pdbhh F T 3zn8 5 E S DAP2_YEAST DPAP B, YSCV GIILVLLIWGTVLL 14 T 2 DUF4808 pdbhh F Eukaryota T 3zpe 1 A A Q2TLC1_9ADEN TURKEY ADENOVIRUS TYPE 3 FIBRE MGSSHHHHHHSSGLVPRGSHMASMTGGQQMGRGSEFASIGPPTTMVTGTVSPGRATNGQFVTKTAKVLRYKFVRWDALLIIQFIDNIGVMENPTFYRNKSIELRSADFLSPMLNNTYIVPLNGGVRVESPTIPVQLEVILENNSSFIQVGFVRLTVKNGNPHMIIQCNPVPGNIKMIKIKSVMLFTCLIG 190 T 67 DUF1491 pdbhh T Viruses T 3zpv 1 A,BA,C,DA,E,FA,G,HA,I,JA,L,N,P,R,T,V,X,Z 0,R,2,T,4,V,6,X,8,Z,B,D,F,H,J,L,N,P BCL9_DROME PROTEIN LEGLESS, PROTEIN LEGLESS GAMANHIFVFSTQLANKGAESVLSGQFQTIIAYHCTQ 37 T 6.7 Csm2_III-A pdbhh F Eukaryota T 3zqf 2 B C ANTI-INDUCER PEPTIDE TAP1 KASEGLARVAALARSR 16 T 54 Spond_N pdbhh F T 3zqg 2 B C ANTI-INDUCER PEPTIDE TAP2 TGERGRWQVWGLAKRC 16 T 3.5 DUF5691 pdbhh F T 3zqh 2 B C INDUCER PEPTIDE TIP3 KKESRVVVWRLPPLH 15 T 1.2 MHC_II_alpha pdbhh F T 3zqi 2 C,D C,D INDUCER PEPTIDE TIP2 DDSVLAARARMWMWHW 16 T 4.7 Metal_hydrol pdbhh F T 3zrj 2 C,D X,Y Q9KN57_VIBCH VIPB KKWAQGSLLDEIMAQTRCKK 20 T 0.054 DMP12 unp F Bacteria T 3zwz 2 B B Q8IKV6_PLAF7 RON2 DITQQAKDIGAGPVASCFTTRMSPPQQICLNSVVNTALS 39 T 3.1 zf-XS pdbhh F Eukaryota T 3zx7 1 A A TXL_EISFE LYSENIN SAKAAEGYEQIEVDVVAVWKEGYVYENRGSTSVDQKITITKGMKNVNSETRTVTATHSIGSTISTGDAFEIGSVEVSYSHSHEESQVSMTETEVYESKVIEHTITIPPTSKFTRWQLNADVGGADIEYMYLIDEVTPIGGTQSIPQVITSRAKIIVGRQIILGKTEIRIKHAERKEYMTVVSRKSWPAATLGHSKLFKFVLYEDWGGFRIKTLNTMYSGYEYAYSSDQGGIYFDQGTDNPKQRWAINKSLPLRHGDVVTFMNKYFTRSGLCYDDGPATNVYCLDKREDKWILEVVGLVPRGSGHHHHHH 309 T 0.027 Toxin_10 unphh F Eukaryota T 3zxu 2 B,D B,D CTF19_KLULA CTF19 MDFTSSSGVLDSERNTGSNDSDEPSSHSDVIETEELKLIKLQEHKNNLLRQRSELLDQLSQTRVVEPRSVQLDDKLLLKLLRRNDNAVSDSSQSSNNPLPRVLPSLNIEQRKKYLDITLNDVTVTCEKDMILLRKGSFTASFRIAVENESIRSMAIDLNAFEVELQPIIQYAEDTQNVNVAMMAVVQFLRIKELHEQMISKIVEASKFIRASNNTITLNDLEVSFHCYWNLPSPYPETLILTNKVQKILDFLIYQYGIQLGVIKYGSTII 270 T 0.0022 CENP-P pdbhh F Eukaryota T 3zzy 2 C,D C,D RAVR1_MOUSE RAVER1, PROTEIN RAVER-1 GAMGPGVSLLGAPPKD 16 T 2.9 Ste5 pdbhh F Eukaryota T 3zzz 2 C,D C,D RAVR1_MOUSE RAVER1, PROTEIN RAVER-1 GAMGSSEGLLGLGPGP 16 T 0.49 DUF6027 pdbhh F Eukaryota T 4a1t 2 C,D C,D CP5-46-A PEPTIDE GELGRLVYLLDGPGYDPIHCD 21 T 0.79 DUF5685 pdbhh F T 4a1v 2 C,D C,D CP5-46A-4D5E GELDELVYLLDGPGYDPIHS 20 T 2.4 IreB pdbhh F T 4a2a 2 C,D C,D FTSZ_THEMA CELL DIVISION PROTEIN FTSZ EGDIPAIYRYGLEGLL 16 T 1.6 DUF3510 pdbhh F Bacteria T 4a54 2 B B DCP2_SCHPO DCP2 GATTKEKNISVDVDADASSQLLSLLKSSTAPSDLATPQPSTFPQPPVESHSS 52 T 0.17 DUF1869 pdbpercent F Eukaryota T 4a5x 2 C,D C,D CHM1A_HUMAN CHROMATIN-MODIFYING PROTEIN 1A, CHMP1A, VACUOLAR PROTEIN SORTING-ASSOCIATED PROTEIN 46-1, VPS46-1, HVPS46-1 SHMEDQLSRRLAALRN 16 T 6.1 DUF4549 pdbhh F Eukaryota T 4a62 2 C,D C,D STBB_ECOLX PARR FROM PLASMID R1 EQKSDEETKKNAMKLIN 17 T 0.12 Nexin_C unppssm F Bacteria T 4a94 2 C,D C,D MCPI_NERVS CARBOXYPEPTIDASE INHIBITOR FHVPDDRPCINPGRCPLVPDATCTFVCKAADNDFGYECQHVWTFEGQRVGCYA 53 T 0.88 NPBW pdbhh F Eukaryota T 4aa2 2 B P BNP_GLOBL POTENTIATOR B EGLPPRPKIPP 11 T 0.12 UPF0449 pdbhh F Eukaryota T 4aai 1 A,B A,B Q6TRU9_9VIRU ORF E73 MVESKKIAKKKTTLAFDEDVYHTLKLVSVYLNRDMTEIIEEAVVMWLIQNKEKLPNELKPKIDEISKRFFPAK 73 T 0.0004 Omega_Repress unphh T Viruses T 4abi 2 B B SFTI1_HELAN PTA-SFTI INHIBITOR GRCTKSIXICFPD 13 T 0.052 Bowman-Birk_leg unp F Eukaryota T 4abj 2 B B SFTI1_HELAN ICA-SFTI INHIBITOR, SFTI-1 GRCTKSXPICFPD 13 T 0.052 Bowman-Birk_leg unp F Eukaryota T 4aid 2 D,E,F F,G,H Q9A749_CAUCR RNASE E TAPPEKPRRGWWRR 14 T 0.24 Leader_Trp pdbhh F Bacteria T 4aif 2 C,D D,E HS90A_HUMAN HEAT SHOCK 86 KDA, HSP 86, HSP86, RENAL CARCINOMA ANTIGEN NY-REN-38 SRMEEVD 7 T 26 CAP_N pdbhh F Eukaryota T 4aj5 1 A,AA,B,BA,C,CA,D,DA,Y,Z 1,W,2,X,3,Y,4,Z,U,V SKA3_HUMAN SKA3 MDPIRSFCGKLRSLASTLDCETARLQRALDGEESDFEDYPMRILYDLHSEVQTLKDDINILLDKARLENQEGIDFIKATKVLMEKNSMDIMKIREYFQKYG 101 T 0.0074 RNA_pol_RpbG pdb F Eukaryota T 4ak4 2 B,D,F,H,J,L,N,P B,D,F,H,J,L,N,P LECB4_ARTIN JACALIN BETA-4 CHAIN NEQSGISQTVIVGPWGAQVST 21 T 0.96 DUF3842 pdbhh F Eukaryota T 4akt 2 C C SUBSTRATE ANALOGUE VGAPIPFPAYDG 12 T 2.8 Ibs_toxin pdbhh F T 4amq 1 A A YL544_MIMIV L544 SYYHHHHHHLESTSLYKKAGLRMLIFTYKLERYIKNKILPKILVVPDRDKYQIKGSFRRRIPYITDIDIVNNVHPEYDDTNIYQRIVDLINSFTNDNQIKLIYVICGTDDRFLLTEYSDEEIEKIKILLNPTELVELNNVLSKYQDDLNKKVFYINEIIWDLYKLRWTSSEVLAGKKILRGGIEVSFQDVVKNNSILLLQYFVKIEYYPIGFDIAVRYKPINLITAYQNAAFYQLKLANYSKEYYFMLFPLRFYFKNDPTISKQLEYIIETKFGLYKQLLVRIDSYRTIYESGNLDLDTAKSIIISIIKDIRKLNGIDMNIIDKIQEVSNNSAGQDKIIAWNTLLTQLYTNINKSVNKQSKKYFTRYINIIPKEDRKLCCLEEEHVLQSGGINFESTNFLTKKKLIY 407 T 0.42 NPV_P10 unppssm T Viruses T 4ams 1 A A G5CQN7_9VIRU MG662 GSSHHHHHHSLEVLFQGPGSLIYTYKLEKYVRTKIFPKILLIPDKNRYIIKGSFRRRVPFVTDIDVVNNVYPEISRENIYDEIIKLVNNIQSDPNIILAYLSCGTDERFKISTGSSKELSNIQSLLPDNEKNEFQLVLNKYYNDQQKKLFFLNELIWDHYKLRWKPEDVLIGSMNLANNVSVNFRETVENNSTILLQYYVKLGSYPVGIDVVINYQKIDLTPAYKNAALYQLQLANYSREYYYMLFPLRYYFKNNQDISQRLENIIEKKYGLYKQLMVRIDDYHTLYKSGNLKIDMATNIVIGILRDIEKLPGFESDTIYQIKKVATNNSPSIKIEEWDILLKVLYQEINTAVNNKSRKYFYRYIAMVPPQDRSKNYISENQDMRLKMVN 390 T 0.26 DNA_pol_B_palm unp T Viruses T 4aom 2 B T MYOA_PLAF7 PFM-A, MYOA KNIPSLLRVQAHIRKKMV 18 T 0.22 BORCS8 pdbhh F Eukaryota T 4apj 2 B P BNP_GLOBL POTENTIATOR B QGLPPRPKIPP 11 T 1.5 UPF0449 pdbhh F Eukaryota T 4art 1 A,B A,B Y273_ATV STRUCTURAL PROTEIN ORF273 MGEKITEEREFQSISEIPEEEIDATNDEEKLADIVENEIEKEIRKSKTRKCKTIENFYYYILRDGKIYPASDYDIEVEKGKRSANDIYAFVETDVTRDFDEFLFDIDYGLPSISDILKFYLEKAGFRIANEVPTPNLKYYIHAVVEFGEDRPQYLAVNIYDIDSLARALRIPQIVEQKLGNKPRTITADEFNDIERIVAEEQPILAGYTYDEALRIPYHYYVDHNNSFKDDALKIAHAYLQLFPTPYQVCYEWKARWFNKIDCLKLERLKPSSHHHHHH 279 T 3.9 Transglut_core2 unphh T Viruses T 4ats 1 A A Y273_ATV STRUCTURAL PROTEIN ORF273 MGSSHHHHHHSSGLVPRGSHMGEKITEEREFQSISEIPEEEIDATNDEEKLADIVENEIEKEIRKSKTRKCKTIENFYYYILRDGKIYPASDYDIEVEKGKRSANDIYAFVETDVTRDFDEFLFDIDYGLPSISDILKFYLEKAGFRIANEVPTPNLKYYIHAVVEFGEDRPQYLAVNIYDIDSLARALRIPQIVEQKLGNKPRTITADEFNDIERIVAEEQPILAGYTYDEALRIPYHYYVDHNNSFKDDALKIAHAYLQLFPTPYQVCYEWKARWFNKIDCLKLERLKPSS 293 T 3.9 Transglut_core2 unphh T Viruses T 4au7 2 C C H4_MOUSE HISTONE H4 PEPTIDE RHRKVLRDY 9 T 0.27 UPF0137 unp F Eukaryota T 4axg 2 C,D C,D CUP_DROME OSKAR RIBONUCLEOPROTEIN COMPLEX 147 KDA SUBUNIT STGIHKPGSLRAPKAVRPTTAPVVSSKPVKSYTRSRLMDIRNGMFNALMHRSKESFVMPRIATCDDIELEGRLRRMNIWRTSDGTRFRTRSTTANLNMNNNNNNECMPAFFKNKNKPNLISDESIIQSQP 130 T 6.9E-21 EIF4E-T unppercent F Eukaryota T 4ay5 2 E,F,G,H I,J,K,L TAB1_HUMAN GTAB1TIDE PVSVPYSSAQS 11 T 9.2 YABBY pdbhh F Eukaryota T 4ay6 2 E,F,G,H E,F,G,H TAB1_HUMAN MITOGEN-ACTIVATED PROTEIN KINASE KINASE KINASE 7-INTERACTING PROTEIN 1, TGF-BETA-ACTIVATED KINASE 1-BINDING PROTEIN 1, TAK1-BINDING PROTEIN 1 PVSVPYXSAQSTS 13 T 13 DUF4128 pdbhh F Eukaryota T 4az0 2 B B PPGB_HUMAN CARBOXYPEPTIDASE C, CARBOXYPEPTIDASE L, CATHEPSIN A, PROTECTIVE PROTEIN CATHEPSIN A, PPCA, PROTECTIVE PROTEIN FOR BETA-GALACTOSIDASE MDPPCTNTTAASTYLNNPYVRKALNIPEQLPQWDMCNFLVNLQYRRLYRSMNSQYLKLLSSQKYQILLYNGDVDMACNFMGDEWFVDSLNQKMEVQRRPWLVKYGDSGEQIAGFVKEFSHIAFLTIKGAGHMVPTDKPLAAFTMFSRFLNKQPYE 155 F F Eukaryota T 4aza 2 B,D B,D IF4G1_HUMAN EIF4G1_D5S PEPTIDE XKKRYSREFLLGFX 14 T 0.00025 eIF_4G1 unphh F Eukaryota T 4b18 2 B B TERT_HUMAN HEST2, TELOMERASE CATALYTIC SUBUNIT, TELOMERASE-ASSOCIATED PROTEIN 2, TP2 RRRGGSASRSLPLPKRPRRA 20 T 19 KN_motif pdbhh F Eukaryota T 4b2u 1 A A KNO67_HEXDO S67 GTYCIELGERCPNPREGDWCCHKCVPEGKRFYCRDQ 36 T 1.2 Conotoxin pdbhh F Eukaryota T 4b2v 1 A A KNO64_HEXDO S64 SECVENGGFCPDPEKMGDWCCGRCIRNECRNG 32 T 2.5 Conotoxin pdbhh F Eukaryota T 4b45 2 B B CETZ2_HALVD CETZ2 MWHSDDLDDLLGSHHHHHH 19 T 84 Nop52 pdbhh F Archaea T 4b4n 2 B B CPSF6_HUMAN CPSF6, CLEAVAGE AND POLYADENYLATION SPECIFICITY FACTOR 68 KDA SUBUNIT, CFIM68, CPSF 68 KDA SUBUNIT, PRE-MRNA CLEAVAGE FACTOR IM 68 KDA SUBUNIT, PROTEIN HPBRII-4/7 PVLFPGQPFGQPPLG 15 T 2.2 MF_alpha pdbhh F Eukaryota T 4b4p 1 A,B A,B Q47212_ECOLX FEDF NSSASSAQVTGTLLGTGKTNTTQMPALYTWQHQIYNVNFIPSSSGTLTCQAGTILVWKNGRETQYALECRVSIHHSSGSINESQWGQQSQVGFGTACGNKKCRFTGFEISLRIPPNAQTYPLSSGDLKGSFSLTNKEVNWSASIYVPAIAK 151 T 2.5 DUF2511 pdbhh F Bacteria T 4b7e 2 B B CONSENSUS ANKYRIN REPEAT DOMAIN-LEU EVVKLLLEHGADVLAQD 17 T 0.00035 Shigella_OspC pdbhh F T 4b8o 2 B,C B,C A7XWN5_SV40 SV40TAGNLS GSPPKKKRKVG 11 T 0.42 ACTH_domain pdbhh T Viruses T 4b8p 2 C,D C,D A89NLS VHKTVLGKRKYW 12 T 0.11 MIER1_beta_C pdbhh F T 4b9w 2 C,D P,S PIWL2_MOUSE MILI GRAGPAGXGLVFR 13 T 21 RNR_Alpha pdbhh F Eukaryota T 4be5 1 A,B A,B RBMA MGSSHHHHHHSSGLVPRGSHMEVDCELQPVIEANLSLNQNQLASNGGYISSQLGIRNESCETVKFKYWLSIKGPEGIYFPAKAVVGVDTAQQESDALTDGRMLNVTRGFWVPEYMADGKYTVSLQVVAENGKVFKANQEFVKGVDLNSLPELNGLTIDIKNQFGINSVESTGGFVPFTVDLNNGREGEANVEFWMTAVGPDGLIIPVNAREKWVIASGDTYSKVRGINFDKSYPAGEYTINAQVVDIVSGERVEQSMTVVKK 262 T 0.015 BsuPI pdbhh F T 4bea 2 B B STAPLED EIF4E INTERACTING PEPTIDE KKRYSRXQLLXLX 13 T 2.3 BURAN pdbhh F T 4bg6 2 C,D Q,R RND3_HUMAN PROTEIN MEMB, RHO FAMILY GTPASE 3, RHO-RELATED GTP-BINDING PROTEIN RHO8, RND3 DLRKDKAKSC 10 T 36 DUF6306 pdbhh F Eukaryota T 4bh6 2 I,J,K,L,M,N,O,P I,J,K,L,M,N,O,P ACM1_YEAST APC/C-CDH1 MODULATOR 1 AQFMLYEETAEERNIAVHRHNEIYNNNNSVSNENNPSQVKENLSPAKICPYERAFLREGGRIALKDLSVD 70 T 0.12 HPD unp F Eukaryota T 4bj1 1 A A RIF2_YEAST RAP1-INTERACTING FACTOR 2 GGGRVDHVFYQKFKSMALQELGTNYLSISYVPSLSKFLSKNLRSMKNCIVFFDKVEHIHQYAGIDRAVSETLSLVDINVVIIEMNDYLMKEGIQSSKSKECIESMGQASYSGQLDFEASEKPSNHTSDLMMMVMRKINNDESIDHIVYFKFEQLDKLSTSTIIEPSKLTEFINVLSVLEKSNNIAFKVLIYSNNVSISSLLSTSLKKKLNTKYTVFEMPILTCAQEQEYLKKMIKFTFDSGSKLLQSYNSLVTCQLNNKESNLAIFFEFLKVFPHPFTYLFNAYTEIIVQSRTFDELLDKIRNRLTIKNYPHSAYNFKK 319 T 0.3 PHP_C pdbpercent F Eukaryota T 4bj5 1 A,B A,B RIF2_YEAST RAP1-INTERACTING FACTOR 2 GGGRMEHVDSDFAPIRRSKKVVDSDKIVKAISDDLEQKNFTVLRKLNLVPIKKSVSSPKVCKPSPVKERVDHVFYQKFKSMALQELGTNYLSISYVPSLSKFLSKNLRSMKNCIVFFDKVEHIHQYAGIDRAVSETLSLVDINVVIIEMNDYLMKEGIQSSKSKECIESMGQASYSGQLDFEASEKPSNHTSDLMMMVMRKINNDESIDHIVYFKFEQLDKLSTSTIIEPSKLTEFINVLSVLEKSNNIAFKVLIYSNNVSISSLLSTSLKKKLNTKYTVFEMPILTCAQEQEYLKKMIKFTFDSGSKLLQSYNSLVTCQLNNKESNLAIFFEFLKVFPHPFTYLFNAYTEIIVQSRTFDELLDKIRNRLTIKNYPHSAYNFKKNQRLPLKLTRKVHDR 399 T 0.4 PHP_C pdbpercent F Eukaryota T 4bj5 3 D,F D,F RIF2_YEAST REPRESSOR/ACTIVATOR SITE-BINDING PROTEIN, SBF-E, TUF, RAP1 FTVLRKLNLVPIK 13 T 0.46 DUF5771 pdbhh F Eukaryota T 4bj6 1 A,B A,B RIF2_YEAST RAP1-INTERACTING FACTOR 2 GGGRNFTVLRKLNLVPIKKSVSSPKVCKPSPVKERVDHVFYQKFKSMALQELGTNYLSISYVPSLSKFLSKNLRSMKNCIVFFDKVEHIHQYAGIDRAVSETLSLVDINVVIIEMNDYLMKEGIQSSKSKECIESMGQASYSGQLDFEASEKPSNHTSDLMMMVMRKINNDESIDHIVYFKFEQLDKLSTSTIIEPSKLTEFINVLSVLEKSNNIAFKVLIYSNNVSISSLLSTSLKKKLNTKYTVFEMPILTCAQEQEYLKKMIKFTFDSGSKLLQSYNSLVTCQLNNKESNLAIFFEFLKVFPHPFTYLFNAYTEIIVQSRTFDELLDKIRNRLTIKNYPHSAYNFKKNQRLPLKLTRKVHDR 365 T 0.36 PHP_C pdbpercent F Eukaryota T 4bjs 2 D D RIF1_YEAST RAP1-INTERACTING FACTOR 1, RAP1 INTERACTING FACTOR 1 PSLKLHFFSKKSRRLVARLRGFTPGDLNGISVEERRNLRIELLDFMMRLEYYSNRDNDMN 60 T 0.027 POB3_N pdbpercent F Eukaryota T 4bjt 2 D,E,F D,E,F RIF1_YEAST RAP1-INTERACTING FACTOR 1 ADISVLPEIRIPIFNSLKMQ 20 T 9.1 FTP pdbhh F Eukaryota T 4bl0 3 C,F C,F SP105_YEAST 105 KDA SPINDLE POLE COMPONENT PROTEIN DPTSMEMTEVFPRSIRQKN 19 T 5.2 MELT_2 pdbhh F Eukaryota T 4blb 2 E,F,G,H E,F,G,H GLI1_HUMAN TRANSCRIPTIONAL ACTIVATOR GL1, GLIOMA-ASSOCIATED ONCOGENE, ONCOGENE GLI TSPGGSYGHLSIGTMSP 17 T 96 Ntox11 pdbhh F Eukaryota T 4bld 2 E,F,G,H E,F,G,H GLI3_HUMAN GLI3 FORM OF 190 KDA, GLI3-190, GLI3 FULL LENGTH PROTEIN, G LI3FL, GLI3 C-TERMINALLY TRUNCATED FORM, GLI3 FORM OF 83 KDA, GLI3-8 GLI3 SSASGSYGHLSASAISP 17 T 9.3 Sulfakinin pdbhh F Eukaryota T 4blg 1 A,B A,B O41974_MHV68 IMMEDIATE-EARLY PROTEIN GPGYQKDPPKKYQGMRRHLQVTAPRLFDPEGHPPTHFKSAVMFSSTHPYTLNKLHKCIQSKHVLSTPVSCLPLVPGTTQQCVTYYLLSFVEDKKQAKKLKRVVLAYCEKYHSSVEGTIVKAKPYFPLPEPPTEPPTDPEQP 141 T 0.0001 EBV-NA1 pdbhh T Viruses T 4bpl 2 B B NUPL_XENLA NUCLEOPLASMIN NLS SAVKRPAATKKAGQAKKKKLD 21 T 0.0016 BSP_II unppercent F Eukaryota T 4bqd 2 C,D C,D PEPTIDE XFQSKPNVHVDGYFERLXAKL 21 T 0.54 Pea-VEAacid pdbhh F T 4bqk 2 C,D C,D VIRD2_AGRFC VIRD2NLS LSKRPREDDDGEPSERKRER 20 T 4.7 ROKNT pdbhh F Bacteria T 4btg 1 A,A10,A11,A12,A13,A14,A15,A16,A17,A18,A19,A2,A20,A21,A22,A23,A24,A25,A26,A27,A28,A29,A3,A30,A31,A32,A33,A34,A35,A36,A37,A38,A39,A4,A40,A41,A42,A43,A44,A45,A46,A47,A48,A49,A5,A50,A51,A52,A53,A54,A55,A56,A57,A58,A59,A6,A60,A7,A8,A9,B,B10,B11,B12,B13,B14,B15,B16,B17,B18,B19,B2,B20,B21,B22,B23,B24,B25,B26,B27,B28,B29,B3,B30,B31,B32,B33,B34,B35,B36,B37,B38,B39,B4,B40,B41,B42,B43,B44,B45,B46,B47,B48,B49,B5,B50,B51,B52,B53,B54,B55,B56,B57,B58,B59,B6,B60,B7,B8,B9 A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B P1_BPPH6 CAPSID SUBUNIT OF THE BACTERIOPHAGE PHI6 GFNLKVKDLNGSARGLTQAFAIGELKNQLSVGALQLPLQFTRTFSASMTSELLWEVGKGNIDPVMYARLFFQYAQAGGALSVDELVNQFTEYHQSTACNPEIWRKLTAYITGSSNRAIKADAVGKVPPTAILEQLRTLAPSEHELFHHITTDFVCHVLSPLGFILPDAAYVYRVGRTATYPNFYALVDCVRASDLRRMLTALSSVDSKMLQATFKAKGALAPALISQHLANAATTAFERSRGNFDANAVVSSVLTILGRLWSPSTPKELDPSARLRNTNGIDQLRSNLALFIAYQDMVKQRGRAEVIFSDEELSSTIIPWFIEAMSEVSPFKLRPINETTSYIGQTSAIDHMGQPSHVVVYEDWQFAKEITAFTPVKLANNSNQRFLDVEPGISDRMSATLAPIGNTFAVSAFVKNRTAVYEAVSQRGTVNSNGAEMTLGFPSVVERDYALDRDPMVAIAALRTGIVDESLEARASNDLKRSMFNYYAAVMHYAVAHNPEVVVSEHQGVAAEQGSLYLVWNVRTELRIPVGYNAIEGGSIRTPEPLEAIAYNKPIQPSEVLQAKVLDLANHTTSIHIWPWHEASTEFAYEDAYSVTIRNKRYTAEVKEFELLGLGQRRERVRILKPTVAHAIIQMWYSWFVEDDRTLAAARRTSRDDAEKLAIDGRRMQNAVTLLRKIEMIGTTGIGASAVHLAQSRIVDQMAGRGLIDDSSDLHVGINRHRIRIWAGLAVLQMMGLLSRSEAEALTKVLGDSNALGMVVA 761 T 0.22 STAG pdb T Viruses T 4btp 1 A,B,C,D,E,F,G,H,I,J A,B,C,D,E,F,G,H,I,J Q9MC13_9VIRU p1 MSKLDLGRVDLLSMLGNSSSAGVDTAKGVIPFSTSGATWAVPRLSEDGITSHFLRRRGYVTMTQGGSRDQNAAVRKILSLIIAYDIQTQACFFISNEESMRITMAETMGVKDRPNARTNSWAEVSDSDINRGIAKALKEGNLTLDENQKDGFMKLVHAFVADILAQSGHYKPVTSVTYFSAPIDMESDYLDPFSIAIIRDVLDDSPFSELRYDARAMSELEDRDVPITRFSRVMAQMGNAMVRNIMVLNEAAQRKLRGLAVVGEIVHGRVRAPVRYLNDSFIQTLRSNINFHLLTRTTPERWAQSWIQAFGSLKGWVDAINGIADATTEEEKKKLAMQTSMDLELLSDLTPLIRDAATSVEKFVTFAPLSFYQGLGSVTQIRALDSSTNLAAVIVRYAAKEINLIPAYQSFQVPTVDVAVKKTAIMDQRLSLQLPEFSEDQFFGMLEQRMQNMSDSEVAELVDRIAKGETPFGDVVKQLPGTSTLLVTNGYYMGGLLTNEDKIIPGDASVPALLYMQAASFASSVRFPPGEYPVFHHESSNGDVRLTDQVSADAQLSHSAVETANPLNFLVACNVSVHTPSIAIDIIEPMPDLTRRGTTEYVHKGEIKVAAIPSLPPKSADRKAQVSRETAKFERVLYKARKGGAQVAAPIDLESLFGIAVNLAVPTVKHVYSPDSKTKLALDIIKGLESDGDKAAATRLLMTLARAYTGTYSSLALRRRDEITGIAAQPSDVAMQEFALQSGVQTLKAVAKHTGIMEVATIEMVEEKVRSLDDNRFYEIAAEVVLRALKGM 792 T 12 DUF445 pdbhh T Viruses T 4bu0 2 B,C B,C RHP9_SCHPO RAD9 HOMOLOG, CRB2 GYGEVLVPETVAQHRT 16 T 1.7 RTC4 pdbhh F Eukaryota T 4bu1 2 C,D C,D RHP9_SCHPO RAD9 HOMOLOG, CRB2 GYGRVESTPPAFLP 14 T 1.5 DUF2104 pdbhh F Eukaryota T 4buz 2 B P P53_HUMAN ANTIGEN NY-CO-13, PHOSPHOPROTEIN P53, TUMOR SUPPRESSOR P53, P53 RHKXLMFK 8 T 15 DUF420 pdbhh F Eukaryota T 4bv2 2 C,D E,H P53_HUMAN DEACETYLATED P53-PEPTIDE, ANTIGEN NY-CO-13, PHOSPHOPROTEIN P53, TUMOR SUPPRESSOR P53, P53 STSRHKKLMFKTE 13 T 40 DUF420 pdbhh F Eukaryota T 4bwq 2 B,D,F,H B,D,F,H PQBP1_HUMAN PQBP-1,38 KDA NUCLEAR PROTEIN CONTAINING A WW DOMAIN, NPW38, POLYGLUTAMINE TRACT-BINDING PROTEIN 1, PQBP-1 KRNEAKTGADTTAAGPLFQQRPYPSPGAVLRANAEASRTKQQD 43 T 5 Cortexin unphh F Eukaryota T 4bws 2 B,E B,E PQBP1_HUMAN PQBP-1,38 KDA NUCLEAR PROTEIN CONTAINING A WW DOMAIN, NPW38, POLYGLUTAMINE TRACT-BINDING PROTEIN 1, PQBP-1 TGADTTAAGPLFQQRPYPSPGAVLRANAEASRTKQQD 37 T 5.4 REV pdbhh F Eukaryota T 4bx4 1 A,B A,B Q9MC13_9VIRU P1 MSKLDLGRVDLLSMLGNSSSAGVDTAKGVIPFSTSGATWAVPRLSEDGITSHFLRRRGYVTMTQGGSRDQNAAVRKILSLIIAYDIQTQACFFISNEESMRITMAETMGVKDRPNARTNSWAEVSDSDINRGIAKALKEGNLTLDENQKDGFMKLVHAFVADILAQSGHYKPVTSVTYFSAPIDMESDYLDPFSIAIIRDVLDDSPFSELRYDARAMSELEDRDVPITRFSRVMAQMGNAMVRNIMVLNEAAQRKLRGLAVVGEIVHGRVRAPVRYLNDSFIQTLRSNINFHLLTRTTPERWAQSWIQAFGSLKGWVDAINGIADATTEEEKKKLAMQTSMDLELLSDLTPLIRDAATSVEKFVTFAPLSFYQGLGSVTQIRALDSSTNLAAVIVRYAAKEINLIPAYQSFQVPTVDVGVKKTAIMDQRLSLQLPEFSEDQFFGMLEQRMQNMSDSEVAELVDRIAKGETPFGDVVKQLPGTSTLLVTNGYYMGGLLTNEDKIIPGDASVPALLYMQAASFASSVRFPPGEYPVFHHESSNGDVRLTDQVSADAQLSHSAVETANPLNFLVACNVSVHTPSIAIDIIEPMPDLTRRGTTEYVHKGEIKVAAIPSLPPKSADRKAQVSRETAKFERVLYKARKGGAQVAAPIDLESLFGIAVNLAVPTVKHVYSPDSKTKLALDIIKGLESDGDKAAATRLLMTLARAYTGTYSSLGLRRRDEITGIAAQPSDVAMQEFALQSGVQTLKAVAKHTGIMEVATIEMVEEKVRSLDDNRFYEIAAEVVLRALKGM 792 T 14 DUF445 pdbhh T Viruses T 4bxu 2 B B PEX5_HUMAN PTS1 RECEPTOR, PTS1R, PTS1-BP, PEROXIN-5, PEROXISOMAL C-TERMINAL TARGETING SIGNAL IMPORT RECEPTOR, PEROXISOME RECEPTOR 1, PEX5 ASEDELVAEFLQDQN 15 T 2.3 DUF5748 pdbhh F Eukaryota T 4bxw 2 C F FA5_PSETE FACTOR V A2 PEPTIDE GNEEEEEDDGDIFADIFI 18 T 5.3 CAF1-p150_C2 pdbhh F Eukaryota T 4by8 1 A A PARACELSIN-X XXAXXAXAXQXVIXGXXPVIXXQQX 25 T 18 DUF3824 pdbhh F T 4c1a 1 A,B,C,D A,B,C,D Q3LG57_DANRE ZFL2-1 ORF1P GPAMEALELELEEVESQIRALVVRRSRLRERLLAVP 36 T 0.069 ABC_tran_CTD unppercent F Eukaryota T 4c2g 3 C C CTPB_BACSU PEPTIDE VPA, CTPB, C-TERMINAL PROCESSING PROTEASE EMDKPQTAAVPA 12 T 0.26 Phyto-Amp unppssm F Bacteria T 4c31 3 C,F,G,H C,F,X,Y NUP1_YEAST NUCLEAR PORE PROTEIN NUP1, NUP1 GSPKKDKESIVLPTVGFDFIKDNETPSKKTSPKATS 36 T 0.81 zf-C2H2_assoc2 pdbhh F Eukaryota T 4c5a 2 C C PEPTIDE ENLYFQGA 8 T 4.7 RIP pdbhh F T 4c5e 2 E,F,G,H E,F,G,H PHO_DROME PROTEIN PLEIOHOMEOTIC, TRANSCRIPTION FACTOR YY1 HOMOLOG GAMASRRWEQKLVHIKTMEGEFSVTMWASGIS 32 T 0.24 zf-H2C2_2 unppssm F Eukaryota T 4c5g 2 B B PHO_DROME PROTEIN PLEIOHOMEOTIC, TRANSCRIPTION FACTOR YY1 HOMOLOG AGMASRRWEQKLVHIKTMEGEFSVTMWASGIS 32 T 0.092 INO80_Ies4 pdbpercent F Eukaryota T 4c5h 2 B B PHO_DROME PROTEIN PLEIOHOMEOTIC, TRANSCRIPTION FACTOR YY1 HOMOLOG GAMADINTEESGVVDKNSPFLTLGTTILNSNGKSRRWEQKLVHIKTMEGEFSVTMWASGISDDEYSGSDQIVGASDLLKGKEEFGIDGFTSQQNKEYQKMESKFTNAQTLEMPHPISSVQIMDHLIKERGNLSQE 135 T 0.24 BCL_N pdb F Eukaryota T 4c5i 2 C C TYY1_HUMAN DELTA TRANSCRIPTION FACTOR, INO80 COMPLEX SUBUNIT S, NF-E1, YIN AND YANG 1, YY-1, YY1 DPGNKKWEQKQVQIKTLEGEFSVTMWSSDE 30 T 0.98 INO80_Ies4 pdbhh F Eukaryota T 4c93 2 D,E D,E DPOA_YEAST DNA POLYMERASE I SUBUNIT A, DNA POLYMERASE ALPHA\: PRIMASE COMPLEX P180 SUBUNIT, DNA POLYMERASE-PRIMASE COMPLEX P180 SUBUNIT, POL ALPHA-PRIMASE COMPLEX P180 SUBUNIT, DNA POLYMERASE ALPHA CATALYTIC SUBUNIT IDNFDDILGEFES 13 T 0.8 DUF4927 pdbhh F Eukaryota T 4c95 2 D,E D,E SLD5_YEAST SLD5 MDINIDDILAELDKETTAV 19 T 0.43 Bombolitin pdbhh F Eukaryota T 4cay 3 C C AN32E_HUMAN LANP-LIKE PROTEIN, LANP-L, ANP32E GSHMEVGLSYLMKEEIQDEEDDDDYVEEGE 30 T 0.0014 BUD22 unp F Eukaryota T 4cc3 2 B,D,F,H B,D,F,H ENAH_MOUSE NPC-DERIVED PROLINE-RICH PROTEIN 1, NDPP-1, MURINE MENA PPPPLPSGPAYA 12 T 3.9 FAF pdbhh F Eukaryota T 4cc9 3 C C SAMH1_HUMAN DNTPASE, DENDRITIC CELL-DERIVED IFNG-INDUCED PROTEIN, DCIP, MONOCYTE PROTEIN 5, MOP-5, SAM DOMAIN AND HD DOMAIN-CONTAINING PR OTEIN 1, SAMHD1 MASWSHPQFEKGALEVLFQGPGYQDPQDGDVIAPLITPQKKEWNDSTSVQNPTRLREASKSRVQLFKDDPM 71 T 20 DUF3674 pdbhh F Eukaryota T 4cfh 3 C C AAPK1_RAT AMPK SUBUNIT ALPHA-1 FQVAPRPGSHTIEFFEMCANLIKILAQ 27 T 0.00049 AdenylateSensor pdbhh F Eukaryota T 4cg6 4 D D PEPTIDE VFIVSVGSFISVLFIVI 17 T 2 DUF5383 pdbhh F T 4ch2 3 E,F P,Q GP1BA_HUMAN GP-IB ALPHA, GPIB-ALPHA, GPIBA, GLYCOPROTEIN IBALPHA, ANTIGEN CD42B-ALPHA, GPIBALPHA PEPTIDE GDTDLXDXXPEEDT 14 T 0.34 UPF0300 pdbhh F Eukaryota T 4ch9 2 C,D C,D WNK4_HUMAN PROTEIN KINASE LYSINE-DEFICIENT 4, PROTEIN KINASE WITH NO LYSINE 4 EPEEPEADQHQ 11 T 2.8 AgrD pdbhh F Eukaryota T 4cih 1 A,B,C,D A,B,C,D LNTA_LISMO LNTA RPKLSTKDLALIKADLAEFEARELSSEKILKDTIKEESWSDLDFANDNINQMIGTMKRYQQEILSIDAIKRSSEASADTEAFKKIFKEWSEFKIERIQVTIDLLNGKKDSEAVFKKTYPNQIIFDDVRTNKLQTALNNLKVGYELLDSQK 150 T 0.042 DUF5697 pdbpercent F Bacteria T 4cii 1 A A O25272_HELPY CAG PATHOGENICITY ISLAND PROTEIN 18 EDITSGLKQLDSTYQETNQQVLKNLDEIFSTTSPSANNEMGEEDALNIKKAAIALRGDLALLKANFEANELFFISEDVIFKTYMSSPELLLTYMKINPLDQNTAEQQCGISDKVLVLYCEGKLKIEQEKQNIRERLETSLKAYQSNIGGTASLITASQTLVESLKNKNFIKGIRKLMLAHNKVFLNYLEELDALERSLEQSKRQYLQERQSSKIIVKLEHHHHHH 225 T 0.0046 IDO pdbpssm F Bacteria T 4clq 2 B B BMS1_YEAST BMS1P WNIGKLIYMDNISPEECIRRWRGEDDDSKDESDIEEDVDDDFFRKKDGTVTKEGNKDHAVDLEKFVPYFDTFEKLAKKWKSVDAIKERFL 90 T 0.1 Sigma70_ner pdbpssm F Eukaryota T 4cqo 2 B,D B,D NANO1_HUMAN NOS-1, EC_REP1A FSSWNDYLGLATLITKA 17 T 2 DUF3243 pdbhh F Eukaryota T 4cu5 1 A,B,C,D,E,F A,B,C,D,E,F B6SBV8_9CAUD ENDOLYSIN MYKHTIVYDGEVDKISATVVGWGYNDGKILICDIKDYVPGQTQNLYVVGGGACEKISSITKEKFIMIKGNDRFDTLYKALDFINR 85 T 0.16 DUF1161 unp T Viruses T 4cvo 1 A A ERCC6_HUMAN ATP-DEPENDENT HELICASE ERCC6, COCKAYNE SYNDROME PROTEIN CSB SMEPSAQALELQGLGVDVYDQDVLEQGVLQQVDNAIHEASRASQLVDVEKEYRSVLDDLTSCTTSLRQINKIIEQLSPQ 79 T 0.0069 DegQ pdbpercent F Eukaryota T 4cvz 3 C C PEPTIDE YELDEKFDRL 10 T 0.4 HJURP_C pdbhh F T 4cw1 3 C,F C,F PEPTIDE SWFRKPMTR 9 T 2.7 Tenui_NS4 pdbhh F T 4cw8 1 A A Q0GF90_9ADEN TURKEY ADENOVIRUS TYPE 3 FIBRE MGSSHHHHHHSSGLVPRGSHMASMTGGQQMGRGSEFASIGPPTTMVTGTVSPGRATNGQFVTKTAKVLRYKFVRWDALLIIQFIDNIGVIENPTFYRNKSIELRSADFLSPTLNNTYIVPLNGGVRVESPTIPVQLEVILENNSSFIQVGFVRLTVKNGNPHMIIQCNPVPGNIKMIKIKSVMLFTCLIG 190 T 51 DUF1491 unphh T Viruses T 4cy1 2 C,D C,D KANL1_HUMAN KANSL1, MLL1/MLL COMPLEX SUBUNIT KANSL1, MSL1 HOMOLOG 1, HMSL1V1, NSL COMPLEX PROTEIN NSL1, NON-SPECIFIC LETHAL 1 HOMOLOG DGTCVAARTRPVLSY 15 T 5.3 DUF436 pdbhh F Eukaryota T 4cy2 2 B C KANL2_HUMAN KANSL2, NSL COMPLEX PROTEIN NSL2, NON-SPECIFIC LETHAL 2 HOMOLOG YEFSDDLDVVGDG 13 T 3.8 Rsa3 pdbhh F Eukaryota T 4cy3 2 B D A4V2Z1_DROME CG4699, ISOFORM D GSDYLCSRARPLVLSE 16 T 0.66 Papilloma_E5 pdbhh F Eukaryota T 4cy5 2 B C Q9VAF4_DROME NSL2, LD12439P YRDDDEIDVVSPH 13 T 0.068 Myc_N pdbhh F Eukaryota T 4cyd 2 E,F F,H PROBABLE EXPRESSION TAG AHHHHDYDIPTTENLYFQGHM 21 T 0.72 DUF5704 pdbhh F T 4cyj 2 E,F E,F PAN2_CHATD PAN2 GSMPLSSIGLPYYREPLFSAWPADIISDVGAPPLQLEPSFVATLKQAEWGLYGKNTRNVRRNQVEDTRNTNKQSNALQAPKFLSERARESALSSGGDSSSDPQVDQEPEDPNEIESLKP 119 T 0.74 PKI unppercent F Eukaryota T 4cyk 1 A A PAN3_YEAST PAB1P-DEPENDENT POLY(A)-NUCLEASE, PAN3P MDKINPDWAKDIPCRNITIYGYCKKEKEGCPFKHSDNTTAT 41 T 0.37 zf-CCCH unppercent F Eukaryota T 4d07 2 B B MYO5A_HUMAN MYO5A GSHMSQKEAIQPKDDKNTMTDSTILLE 27 T 0.77 DUF2046 unphh F Eukaryota T 4d0b 3 C C PEPTIDE TAGQEDYDRL 10 T 8.5 CitT pdbhh F T 4d0c 3 C C 10MER PEPTIDE TAGQSNYDRL 10 T 1.2 SASA pdbhh F T 4d0d 3 C,F,I,L C,F,I,L Q9DG07_CHICK SLP-76 ADAPTOR PROTEIN VIFPAKSL 8 T 2.9 DcpS pdbhh F Eukaryota T 4d0u 1 A,B,C,D A,B,C,D SPIKE_ADES1 SPIKE, PROTEIN IV, FIBER PROTEIN OF THE ATADENOVIRUS SNAKE ADENOVIRUS 1 GSSHHHHHHSSGLVPRGSHMASMTGGQQMGRGSPSPPSKTSLDIAEELQNDKGVSFAFQAREEELGAFTKRTLFAYSGDGLTGPFKAPASAELSSFLTAHPKGRWLIAFPLGTGIVSVDEGIMTMEISRSLPEVGSGSSFYLTEK 145 T 0.1 BRK pdb T Viruses T 4d0v 1 A,B,C,D A,B,C,D SPIKE_ADES1 SPIKE, PROTEIN IV, FIBER PROTEIN OF THE ATADENOVIRUS SNAKE ADENOVIRUS 1 MGSSHHHHHHSSGLVPRGSHMASMTGGQQMGRIPSPPSKTSLDIAEELQNDKGVSFAFQAREEELGAFTKRTLFAYSGDGLTGPFKAPASAELSSFLTAHPKGRWLIAFPLGTGIVSVDEGILTLEISRSLPEVGSGSSFYLTEK 145 T 0.1 BRK pdb T Viruses T 4d1f 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L SPIKE_ADES1 SPIKE, PROTEIN IV, FIBER PROTEIN OF THE ATADENOVIRUS SNAKE ADENOVIRUS 1 GSSHHHHHHSSGLVPRGSHMASMTGGQQMGRGSPSPPSKTSLDIAEELQNDKGVSFAFQAREEELGAFTKRTLFAYSGDGLTGPFKAPASAELSSFLTAHPKGRWLIAFPLGTGIVSVDEGILTLEISRSLPEVGSGSSFYLTEK 145 T 0.1 BRK pdb T Viruses T 4d62 1 A A Q2TLC1_9ADEN TURKEY ADENOVIRUS TYPE 3 FIBRE MGSSHHHHHHSSGLVPRGSHMASMTGGQQMGRGSEFGPPTTMVTGTVSPGRATNGQFVTKTAKVLRYKFVRWDALLIIQFIDNIGVMENPTFYRNKSIELRSADFLSPMLNNTYIVPLNGGVRVESPTIPVQLEVILENNSSFIQVGFVRLTVKNGNPHMIIQCNPVPGNIKMIKIKSVMLFTCLIG 187 T 65 DUF1491 pdbhh T Viruses T 4day 3 C C FANCM_HUMAN PROTEIN FACM, ATP-DEPENDENT RNA HELICASE FANCM, FANCONI ANEMIA-ASSOCIATED POLYPEPTIDE OF 250 KDA, FAAP250, PROTEIN HEF ORTHOLOG GHMEDIFDCSRDLFSVTFDLGFCSPDSDDEILEHTSD 37 T 9.4 EDR2_C pdbhh F Eukaryota T 4dc2 2 B Z PARD3_RAT PAR-3, PARD-3, ATYPICAL PKC ISOTYPE-SPECIFIC-INTERACTING PROTEIN, ASIP, ATYPICAL PKC-SPECIFIC-BINDING PROTEIN, ASBP DPVLAFQREGFGRQSMSEKRTKQFSNAS 28 T 0.19 LamB_YcsF pdbhh F Eukaryota T 4djc 2 B B SCN5A_HUMAN HH1, SODIUM CHANNEL PROTEIN CARDIAC MUSCLE SUBUNIT ALPHA, SODIUM CHANNEL PROTEIN TYPE V SUBUNIT ALPHA, VOLTAGE-GATED SODIUM CHANNEL SUBUNIT ALPHA NAV1.5 SNAQKKYYNAMKKLGSKKPQKPIPRPLNKYQGFIF 35 T 12 Prp18 pdbhh F Eukaryota T 4djs 2 B B stapled peptide RRWPQ(MK8)ILD(MK8)HVRRVWR RRWPQXILDXHVRRVWR 17 T 0.00013 Axin_b-cat_bind pdb F T 4dmi 1 A,B,C,D,E A,B,C,D,E Capsid Protein ASQQFRIDSESIRDKLNTLLPSQSRGSIGVDLSGSTTIIPVVDLTETAEGGAQREDLQKAFTLINTIDFDVENTTTTIANTPGFYKVVGNLSSRDEASGAIAVIEVTDGITTKILANNRIVSPDGTTAVQSVPVPFDLMVKLVAGDTLQARSNNAEVRVQGIARQIADVSGNLINP 176 T 27 DUF5606 pdbhh F T 4dny 1 A A STCE_ECO57 MUCINASE, NEUTRAL ZINC METALLOPROTEASE STCE, SECRETED PROTEASE OF C1 ESTERASE INHIBITOR FROM EHEC GSHMASHLDGVPEGGIDFTPHNGTKKIINTVAEVNKLSDASGSSIHSHLTNNALVEIHTANGRWVRDIYLPQGPDLEGKMVRFVSSAGYSSTVFYGDRKVTLSVGNTLLFKYVNGQWFRSGELENN 126 T 0.32 CMV_1a_C pdb F Bacteria T 4dow 2 C,D C,D H4_MOUSE Histone H4 GAKRHRKVLRDN 12 T 0.27 UPF0137 unp F Eukaryota T 4dqm 2 B,D B,D NCOA1_HUMAN NCOA-1, CLASS E BASIC HELIX-LOOP-HELIX PROTEIN 74, BHLHE74, PROTEIN HIN-2, RIP160, RENAL CARCINOMA ANTIGEN NY-REN-52, STEROID RECEPTOR COACTIVATOR 1, SRC-1 KSLLQQLLTE 10 T 7.3 E3_UbLigase_RBR pdbhh F Eukaryota T 4drw 2 E,F E,F AHNK_HUMAN DESMOYOKIN GKVTFPKMKIPKFTFSGREL 20 T 11 DUF5476 pdbhh F Eukaryota T 4ds1 2 B,D B,D NU159_YEAST NUCLEAR PORE PROTEIN NUP159 NYAESGIQTDL 11 T 4.8 PilX_N pdbhh F Eukaryota T 4dt5 1 A,B A,B E5LR38_9CUCU Antifreeze protein GYSCRAVGVDGRAVTDIQGTCHAKATGAGAMASGTSEPGSTSTATATGRGATARSTSTGRGTATTTATGTASATSNAIGQGTATTTATGSAGGRATGSATTSSSASQPTQTQTITGPGFQTAKSFARNTATTTVTASHHHHHH 143 T 0.8 Sporozoite_P67 pdb F Eukaryota T 4e0e 1 A,B,C,D A,B,C,D Q8A074_BACTN Putative uncharacterized protein GAQQLTPPAGTFRLGISKGTDSHWLAPQEKVKGIAFRWKALPDTRGFILEVAVTSLQQADTLFWSFGNCQPDMDINVFSVEGQAFTCYYGESMKLRTLQAVTPTDDIRLSNGRQDKTPLLLYESGKRTDRPVLAGRCPLAANSKLYFCFYEQNARADYNYFMLPDLFAKIDESKHSKK 178 T 4.5E-08 DUF4450 unppercent F Bacteria T 4e27 1 A,B,C,D,E A,B,C,D,E Capsid Protein SQQFRIDSESIRDKLNTLLPSQSRGSIGVDLSGSTTIIPVVDLTETAEGGAQREDLQKAFTLINTIDFDVENTTTTIANTPGFYKVVGNLSSRDEASGAIAVIEVTDGITTKILANNRIVSPDGTTAVQSVPVPFDLMVKLVAGDTLQARSNNAEVRVQGIARQIADVSGNLINP 175 T 27 DUF5606 pdbhh F T 4e35 2 C,D C,D iCAL50 peptide ANSRWPTSIL 10 T 9.5 CBP_BcsR pdbhh F T 4e73 2 B B JIP1_HUMAN JIP-1, JNK-INTERACTING PROTEIN 1, ISLET-BRAIN 1, IB-1, JNK MAP KINASE SCAFFOLD PROTEIN 1, MITOGEN-ACTIVATED PROTEIN KINASE 8-INTERACTING PROTEIN 1 KPKRPTTLNLF 11 T 6.3 Baculo_8kDa pdbhh F Eukaryota T 4e9c 2 B B LDPPLHSpTA phosphopeptide XLDPPLHSTAX 11 T 13 MethyltransfD12 pdbhh F T 4edn 2 K,L,M,N,O,P,Q K,L,M,N,O,P,Q PAXI_HUMAN Paxillin XMDDLDALLADLESTTSHISKX 22 T 0.021 DUF883 pdbpssm F Eukaryota T 4ehp 1 A A VINC_HUMAN METAVINCULIN MMPVFHTRTIESILEPVAQQISHLVIMHEEGEVDGKAIPDLTAPVAAVQAAVSNLVRVGKETVQTTEDQILKRDMPPAFIKVENACTKLVQAAQMLQSDPYSVPARDYLIDGSRGILSGTSDLLLTFDEAEVRKIIRVCKGILEYLTVAEVVETMEDLVTYTKNLGPGMTKMAKMIDERQQELTHQEHRVMLVNSMNTVKELLPVLISAMKIFVTTKNSKNQGIEEALKNRNFTVEKMSAEINEIIRVLQLTS 253 F F Eukaryota T 4eje 2 C,D C,D VP40_EBOZM MEMBRANE-ASSOCIATED PROTEIN VP40 ILPTAPPEY 9 T 1.7 MLANA pdbhh T Viruses T 4ejf 2 E,F,G,H E,F,G,H phage-derived peptide 419 TEKEKGRLHCVEWTILER 18 T 1.4 UBA_e1_thiolCys pdbhh F T 4eo0 1 A A G3P_BPIKE GENE 3 PROTEIN, G3P, MINOR COAT PROTEIN MDNWESITKSYYTGFAISKTVESKDKDGKPVRKEVITQADLTTACNDAKASAQNVFNQIKLTLSGTWPNSQFRLVTGDTCVYNGSPGEKTESWSIRAQVEGDIQRSVPDHHHHHH 115 T 0.058 DUF1579 pdbpercent T Viruses T 4eoy 2 D,E,F D,E,F C0H519_PLAF7 Autophagy-related protein 3 NDWLLPSY 8 T 1.5 DUF1566 pdbhh F Eukaryota T 4ep3 2 B E Q9YP46_9HIV1 substrate CA-p2 KARVLAEAM 9 T 0.18 HypA unp T Viruses T 4epj 2 B D Q9YP46_9HIV1 substrate p2-NC ATIMMQRG 8 T 0.18 HypA unp T Viruses T 4eqa 2 C,D C,D Q9I2Q0_PSEAE PA1845 PROTEIN ADCTFTQLEIVPQFGSPNMFGGEDEHVRVMFSNEDPNDDNPDAFPEPPVYLADRDSGNDCRIEDGGIWSRGGVFLSQDGRRVLMHEFSGSSAELVSYDSATCKVVHREDISGQRWAVDKDGLRLGQKCSGESVDSCAKIVKRSLAPFCQTAKK 153 T 1.4 Me-amine-dh_H pdbhh F Bacteria T 4er4 2 B I H-142 PHPFHXIHK 9 T 3.8 IucA_IucC pdbhh F T 4erq 2 D,E,F D,E,F KMT2D_HUMAN ALL1-RELATED PROTEIN, LYSINE N-METHYLTRANSFERASE 2B, KMT2B, MYELOID/LYMPHOID OR MIXED-LINEAGE LEUKEMIA PROTEIN 2 INPTGCARSEPKIL 14 T 0.0018 N-SET pdbhh F Eukaryota T 4ery 2 B D KMT2C_HUMAN HOMOLOGOUS TO ALR PROTEIN, LYSINE N-METHYLTRANSFERASE 2C, KMT2C, MYELOID/LYMPHOID OR MIXED-LINEAGE LEUKEMIA PROTEIN 3 VNPTGCARSEPKMS 14 T 0.00076 N-SET pdbhh F Eukaryota T 4erz 2 D,E,F D,E,F KMT2B_HUMAN LYSINE N-METHYLTRANSFERASE 2D, KMT2D, MYELOID/LYMPHOID OR MIXED-LINEAGE LEUKEMIA PROTEIN 4, TRITHORAX HOMOLOG 2, WW DOMAIN-BINDING PROTEIN 7, WBP-7 LNPHGAARAEVYLR 14 T 0.39 N-SET pdbhh F Eukaryota T 4es8 1 A,B A,B A0A0H3BY62_STRPZ Epf GDHGPEFNGVMVVKAAEAEELPDDLMNFKGTWEVSADGSSGRFFSKGATDSYVFHLIPAKDVKKPGWREHNEVKDSYIKIDKQSIAARYKTSTTAPYSVAFKVNTKSLIKDHDYKITFEQGQIASGITVDYRIGSAFNKTTDDSFKISDESKYASNVKIEGEEQGFKQREQGDKTISFRTLKEGPMSLVLLSKVEKKPQGDLDVEFKNLKIIDVTNPSQLDKGVAYVGNKNVQLTLKSDDGRTNFEGDEISLFNSRGELLQTVTVTKDQQNPISITLSEDQAKSLKNKEKLKVSIKQKQSKKTSKDFFFEVGIDPKVEAK 320 T 7 DUF4493 pdbpssm F Bacteria T 4esg 2 C,D C,D KMT2A_HUMAN ALL-1, CXXC-TYPE ZINC FINGER PROTEIN 7, LYSINE N-METHYLTRANSFERASE 2A, KMT2A, TRITHORAX-LIKE PROTEIN, ZINC FINGER PROTEIN HRX, MLL CLEAVAGE PRODUCT N320, N-TERMINAL CLEAVAGE PRODUCT OF 320 KDA, P320, MLL CLEAVAGE PRODUCT C180, C-TERMINAL CLEAVAGE PRODUCT OF 180 KDA, P180 EPPLNPHGSARAEVHLR 17 T 1.1 N-SET unphh F Eukaryota T 4ext 2 B B REV3L_HUMAN PROTEIN REVERSIONLESS 3-LIKE, REV3-LIKE, HREV3 RTANILKPLMSPPSREEIMATLL 23 T 2.8 CaM_bind pdbhh F Eukaryota T 4eyy 1 A R Q5ZYC9_LEGPH IcmR MGNNTDDSARNPFGFYTPPRVKEIGEPDVTDATLGSVYSEIISPVKDCILTVAKAVSFNPGGKDNTDAVEVLTELNTKVERAALNQPILTTKTERMFGAAESEKSSEPPSHDERGFKLSS 120 T 8 MOSC_N pdbhh F Bacteria T 4eyz 1 A,B A,B M9MMP4_RUMFL Cellulosome-related protein module from Ruminococcus flavefaciens that resembles papain-like cysteine peptidases MASMYNSDGWYMGEAINMASLNTCAADLGKWQNFIDDYTSNDYYKGTPYIDWVFASSPKGDRWQMNEWSVSEMLKVGGTYEEGGLNXMGFVWHAIAKGLSVESGLDISQTGQYVPFSSYFNGLGLSRKCWATPGGSGGWTVFVDYYNLHYYEFPTKEEMLSSGVLQKGDIIWCVDGSVGLGMAGLRTIADNHHIGIYTGNGTSDSWWQSGPVKADGDLVNVGTDVCPIYGAAAKNTYVVLPWAKKA 246 T 0.005 Beta-lactamase unppercent F Bacteria T 4ezn 2 C,D C,D PYRRH_PYRAP Pyrrhocoricin VDKGSYLPRPTPPRPIYNRN 20 T 2.5 Apidaecin pdbhh F Eukaryota T 4ezp 2 C,D C,D APO-monomer XRPDKPRPYLPRPRPPRPVR 20 T 0.83 Apidaecin pdbhh F T 4ezr 2 B B DROS_DROME Drosocin SHPRPIRV 8 T 3.1 Antimicrobial11 unphh F Eukaryota T 4ezx 2 C,D C,D synthetic peptide NRLMLTG NRLMLTG 7 T 44 Beta-Casp pdbhh F T 4ezy 2 B B synthetic peptide NRLILTG NRLILTG 7 T 4.4 hemP pdbhh F T 4ezz 2 B B synthetic peptide ELPLVKI ELPLVKI 7 T 40 Gal_mutarotas_3 pdbhh F T 4f02 3 C,F C,F IF4G1_HUMAN EIF-4-GAMMA 1, EIF-4G 1, EIF-4G1, P220 KTIRIRDPNQGGKDITEEIMSGARTAY 27 T 4.3 TrbI_Ftype pdbhh F Eukaryota T 4f27 2 B Q FIBA_HUMAN FIBRINOPEPTIDE A, FIBRINOGEN ALPHA CHAIN ASGSSGTGSTGNQ 13 T 5.8 NAD_kinase_C pdbhh F Eukaryota T 4f87 1 A,B,C,D A,B,C,D Q7Y3F3_9CAUD PlyCB MSKINVNVENVSGVQGFLFHTDGKESYGYRAFINGVEIGIKDIETVQGFQQIIPSINISKSDVEAIRKAMKK 72 T 2.6 DUF3213 pdbhh T Viruses T 4fas 2 D,E,F D,E,F Q82V11_NITEU NE1300 SGNLESSLAPISAKDMLDYLACKDKKPTDVVKSHTEVENGKIVRVKCGDIVALVQKAREQSGDAWQGGY 69 T 0.039 DUF6488 unphh F Bacteria T 4fbw 2 C,D C,D NBS1_SCHPO DNA repair and telomere maintenance protein nbs1 GESEDDKAFEENRRLRNLGSVEYIRIMSSEKSNANSRHTSKYYSGRKNFKKFQKKASQK 59 T 0.24 Nbs1_C pdbhh F Eukaryota T 4fc9 1 A,B,C B,A,C Q3BQL2_XANC5 uncharacterized protein MGSSHHHHHHSSGRENLYFQGSATASELLLTAALERIEDTAQAMLSTVIDEERNPFLEGAPSYLPGKRPTDVTTFGQVPALRDMLAESRDLEFLQRVSDMAGPSPRIEDPSEEGLARHYTNVSNWKAQKSAHLGIVDHLGQFVYHEGSPLDVATLAKAVQMWKTRELIVHAHPQDRARFPELAVHIPEQVSDDSDSEQQTSPEPSGHQ 208 T 5.8E-05 LRR_9 unphh F Bacteria T 4fdd 2 B B FUS_HUMAN 75 KDA DNA-PAIRING PROTEIN, ONCOGENE FUS, ONCOGENE TLS, POMP75, TRANSLOCATED IN LIPOSARCOMA PROTEIN RGGGDRGGFGPGKMDSRGEHRQDRRERPY 29 T 130 Pro-NT_NN pdbhh F Eukaryota T 4ffe 1 A,B,C X,Y,Z Q8QN43_COWPX OMCP, ORTHOPOX VIRUS MHC CLASS I-LIKE PROTEIN MGHKLAFNFNLEINGSDTHSTVDVYLDDSQIITFDGKDIRPTIPFMIGDEIFLPFYKNVFSEFFSLFRRVPTSTPYEDLTYFYECDYTDNKSTFDQFYLYNGEEYTVKTQEATNKNMWLTTSEFRLKKWFDGEDCIMHLRSLVRKMEDSKR 151 T 0.078 Thioredoxin_11 unppssm T Viruses T 4fgi 2 B,D,F,H B,D,F,H Q9I2Q0_PSEAE Tsi1 MAFTQLEIVPQFGSPNMFGGEDEHVRVMFSNEDPNDDNPDAFPEPPVYLADRDSGNDCRIEDGGIWSRGGVFLSQDGRRVLMHEFSGSSAELVSYDSATCKVVHREDISGQRWAVDKDGLRLGQKCSGESVDSCAKIVKRSLAPFCQTAKK 151 T 1.3 Me-amine-dh_H pdbhh F Bacteria T 4fj3 2 C P RAF1_HUMAN PROTO-ONCOGENE C-RAF, CRAF, RAF-1 QHRYSTPHAFTFNTSSPSSEGSLSQRQRSTSTPNVH 36 T 0.23 DUF1780 pdbpssm F Eukaryota T 4fjo 2 B B POLK_MOUSE DINB PROTEIN, DINP SFFDKKRSER 10 T 0.0065 DUF4113 unphh F Eukaryota T 4fjo 4 D D REV3L_MOUSE PROTEIN REVERSIONLESS 3-LIKE, REV3-LIKE, SEIZURE-RELATED PROTEIN 4 GSFTPRTAHILKPLMSPPSREEIVATLLDH 30 T 2.2 CaM_bind pdbhh F Eukaryota T 4fmn 3 C C NTH2_YEAST NTG2 (DNA N-GLYCOSYLASE AND APURINIC OR APYRIMIDINIC LYASE) XVRSKYFKK 9 T 1.1 DUF1748 pdbhh F Eukaryota T 4fmo 3 C C EXO1_YEAST EXODEOXYRIBONUCLEASE I, EXO I, EXONUCLEASE I, PROTEIN DHS1 TRSKFFNK 8 T 1.5 Tna_leader pdbhh F Eukaryota T 4fmq 2 B B MAPK DOCKING PEPTIDE LSLSSLAASSLAKRRQQ 17 T 6 Rotavirus_VP1 pdbhh F T 4fq3 2 B B FUS_HUMAN Fusion (Involved in t(12;16) in malignant liposarcoma) GPLGSRGGRGGGDRGGFGPGKMDSRGEHRQDRRERPY 37 T 190 Pro-NT_NN pdbhh F Eukaryota T 4fqb 2 B,D,F,H B,D,F,H Q9I2Q0_PSEAE immune protein Tsi1 MADCTFTQLEIVPQFGSPNMFGGEDEHVRVMFSNEDPNDDNPDAFPEPPVYLADRDSGNDCRIEDGGIWSRGGVFLSQDGRRVLMHEFSGSSAELVSYDSATCKVVHREDISGQRWAVDKDGLRLGQKCSGESVDSCAKIVKRSLAPFCQTAKKLEHHHHHH 162 T 1.5 Me-amine-dh_H unphh F Bacteria T 4fqx 3 C E Synthetic peptide GKQNCLKLATK 11 T 2.5 DUF373 pdbhh F T 4ftg 2 C,D C,D ANXA2_HUMAN ANNEXIN II, ANNEXIN-2, CALPACTIN I HEAVY CHAIN, CALPACTIN-1 HEAVY CHAIN, CHROMOBINDIN-8, LIPOCORTIN II, PLACENTAL ANTICOAGULANT PROTEIN IV, PAP-IV, PROTEIN I, P36 XSTVHEILSKLSLEGDX 17 T 4 MJ1316 pdbhh F Eukaryota T 4ftg 3 E E AHNK_HUMAN DESMOYOKIN XGKVTFPKMKIPKFTFSGRELX 22 T 7.7 DUF5476 pdbhh F Eukaryota T 4fvd 2 B C A9XG43_9ENTO 10-mer peptide from 2A proteinase GSITTLGKFG 10 T 3 PAC3 pdbhh T Viruses T 4fvs 1 A,B,C,D,E,F A,B,C,D,E,F A6LGE3_PARD8 putative lipoprotein GQDCTFFFPQTEGTVWVRKGYDAKGNLQSVMSYQVDEVETLPSGQEVEADYVYTNPSGTIVNKGDIKAYCQNGEFFLDSKETLSYPGVVSEMNTNVDITENFINYPNPYAANFDKNNVYFDEASVKIYDKKNRKNRKDMAIKDREFIKTESITTPAGTFDCAKVKYNIATRSPKSKETITGYGYEWYSPNVGLVRTEQYDKNNVLQSYTVLEELK 215 T 0.0011 DUF3108 unppercent F Bacteria T 4g13 1 A A EMERIMICIN IV, STILBELLIN I XFXXXVGLXXPQXPXX 16 T 0.13 Pep_deformylase pdbhh F T 4g2v 2 B B FRPD1_HUMAN FERM DOMAIN-CONTAINING PROTEIN 2 ALGLLAPLRETKSTNPASRVMEMEPETMETKSVIDSRV 38 T 71 PNP_phzG_C pdbhh F Eukaryota T 4g3b 1 A,B A,B alpha4F3d GNADEXYKELEDXQERLRKXRKKLRS 26 T 0.016 SNARE pdbpssm F T 4g4m 1 A,B A,B alpha4F3(6-13) GNADEXYKEXEDXQERLRKLRKKLRSG 27 T 0.9 YggL_50S_bp pdbhh F T 4g6d 2 B B Q4Z9Y5_9CAUD ORF067 MKLKILDKDNATLNVFHRNKEHKTIDNVPTANLVDWYPLSNAYEYKLSRNGEYLELKRLRSTLPSSYGLDDNNQDIIRDNNHRCKIGYWYNPAVRKDNLKIIEKAKQYGLPIITEEYDANTVEQGFRDIGVIFQSLKTIVVTRYLEGKTEEELRIFNMKSEESQLNEALKESDFSVDLTYSDLGQIYNMLLLMKKISK 198 T 0.073 PglD_N pdbpssm T Viruses T 4g6t 2 B B Q87UE5_PSESM Type III effector HopA1 IPALKANGQLEVDGKRYEIRAADDGTISVLRPEQQSKAKSFFKGASQLIGGSSQRAQIAQALNEKVASARTVLHQSAMTGGR 82 T 0.37 Gifsy-2 pdbhh F Bacteria T 4g6u 1 A A F2WK69_ECO57 EC869 CdiA-CT MGTNQSLTFDKELSDCRKSGGNCQDIIDKWEKISDEQSAEIDQKLKDNPLEAQVIDKEVAKGGYDMTQRPGWLGNIGVEVMTSDEAKAYVQKWNGRDLTKIDVNSPEWTKFAVFASDPENQAMLVSGGLLVKDITKAAISFMSRNTATATVNASEIGMQWGQGNMKQGMPWEDYVGKSLPADARLPKNFKIFDYYDGATKTATSVKSIDTQTMAKLANPNQVYSSIKGNIDAAAKFKEYALSGRELTSSMISNREIQLAIPADTTKTQWAEINRAIEYGKSQGVKVTVTQVK 292 T 0.13 Glyco_hydro_97 unppercent F Bacteria T 4g6v 2 B,D,F,H B,D,F,H H9T8H3_BURPE CdiI MAIDLFCYLSIDRGAAESDLNKIRSNHSELFEGKFLISPVRDADFSLKEIAAEHGLVAESFFLVSLNDKNSADLIPIVSKILVDGFNGGAILILQDNEYRRTSLEHHHHHH 111 T 1.5 T3SS_TC unphh F Bacteria T 4g8i 3 C C POL_HV1B1 Gag protein KRWIIMGLNK 10 T 0.6 DUF5790 pdbhh T Viruses T 4g9j 2 C,D C,D synthetic peptide RRKRPKRKRKNARVTFAEAAEII 23 T 7.8 Consortin_C pdbhh F T 4gao 2 C,E,F,H C,E,F,H UBC12_HUMAN NEDD8-conjugating enzyme Ubc12 XIKLFSLKQQKK 12 T 4.6 DUF3637 pdbhh F Eukaryota T 4gba 2 C,D F,G UBE2F_HUMAN NEDD8 CARRIER PROTEIN UBE2F, NEDD8 PROTEIN LIGASE UBE2F, NEDD8-CONJUGATING ENZYME 2, UBIQUITIN-CONJUGATING ENZYME E2 F XLTLASKLKRDDGLKGSRTAATASD 25 T 37 Spore_YtrH pdbhh F Eukaryota T 4gbx 5 E E synthetic peptide GKQNCLKLAT 10 T 5.7 KRBA1 pdbhh F T 4geq 3 E,F E,F CNN1_YEAST CO-PURIFIED WITH NNF1 PROTEIN 1 NKDPNEVRSFLQDLSQVLARKSQGN 25 T 0.24 DivIVA pdbhh F Eukaryota T 4gfu 2 B F ERBB2_HUMAN HER2-pY1248 phosphor-peptide PEXLGLD 7 T 2.2 DUF2666 pdbhh F Eukaryota T 4ggd 2 C,D C,D BUB1B_HUMAN MAD3/BUB1-RELATED PROTEIN KINASE, HBUBR1, MITOTIC CHECKPOINT KINASE MAD3L, PROTEIN SSK1 DEWELSKENVQPLRQGRIMSTLQ 23 T 1.8 DivIC unppssm F Eukaryota T 4ggn 2 D,E,F D,E,F MYOA_PLAYO Myosin-A SLMRVQAHIRKRMVA 15 T 0.063 BORCS8 pdbhh F Eukaryota T 4ghu 2 B B MAVS_MOUSE CARDIF, MAVS, CARD ADAPTER INDUCING INTERFERON BETA, INTERFERON BETA PROMOTER STIMULATOR PROTEIN 1, IPS-1, VIRUS-INDUCED-SIGNALING ADAPTER, VISA PSCPKPVQDTQPPESPVENSE 21 T 38 rpo132 pdbhh F Eukaryota T 4gkg 1 A,B A,F DCTB_RHIME C4-dicarboxylate transport sensor protein dctB MGSSHHHHHHSSGLVPRGSHMEERLARNALEASVEERTRDLRMARDRLETEIADHRQTTEKLQAVQQ 67 T 0.91 Spectrin unp F Bacteria T 4gkn 3 C,F C,F FAT Cognate peptide FATGIGIITV 10 T 5.7 MLANA pdbhh F T 4gks 3 C,F C,F FLT Cognate peptide FLTGIGIITV 10 T 5.5 IGR pdbhh F T 4gkv 2 E P cleaved peptide fragment corresponding to the C-terminal His tag AIPNPLLGLA 10 T 3.2 Pigment_DH pdbhh F T 4glr 1 A,B A,B TAU_HUMAN phospho-peptide KKVAVVRTPPKSPSSAKC 18 T 12 DUF1067 pdbhh F Eukaryota T 4gly 2 B B BICYCLIC PEPTIDE INHIBITOR UK504 CCLGRGCENHRCLX 14 T 1.1 Ivy pdbhh F T 4gnt 2 B B MLXPL_MOUSE CHREBP, MLX INTERACTOR, MLX-INTERACTING PROTEIN-LIKE, WILLIAMS-BEUREN SYNDROME CHROMOSOMAL REGION 14 PROTEIN HOMOLOG RDKIRLNNAIWRAWYIQYVQR 21 T 0.0087 DUF1752 pdb F Eukaryota T 4gpk 2 M,N,O,P,Q,R,S,T,U,V,W,X M,N,O,P,Q,R,S,T,U,V,W,X NprX peptide SSKPDIVG 8 T 2.5 Bse634I pdbhh F T 4gq6 2 B B KMT2A_HUMAN ALL-1, CXXC-TYPE ZINC FINGER PROTEIN 7, LYSINE N-METHYLTRANSFERASE 2A, KMT2A, TRITHORAX-LIKE PROTEIN, ZINC FINGER PROTEIN HRX, MLL CLEAVAGE PRODUCT N320, N-TERMINAL CLEAVAGE PRODUCT OF 320 KDA, P320, MLL CLEAVAGE PRODUCT C180, C-TERMINAL CLEAVAGE PRODUCT OF 180 KDA, P180 SARWRFPARPGT 12 T 2.3 Xin pdbhh F Eukaryota T 4gqb 3 C C H4_HUMAN Histone H4 peptide XSGRGKGGKGLGKGGAKRHRKV 22 T 11 Shadoo unppercent F Eukaryota T 4gqz 1 A,B,C,D A,B,C,D Q8ZL99_SALTY CUEP AMASSESAFLAQHGLAGKTVEQIVDTIDQTPQSRPLPYSASITSTELKLSDGEQIYTLPLGDKFYLSFAPYEWRTHPCFNHSLSGCQGEMPNKPFTVKVTDSKGAVIVQKEMQSYRNGFIGVWLPRNMEGTLEVSYNGKTASHAIATSDDSQTCLTELPLR 161 T 0.031 DUF3244 unppssm F Bacteria T 4gur 2 B B GLYR1_HUMAN NPAC, 3-HYDROXYISOBUTYRATE DEHYDROGENASE-LIKE PROTEIN, CYTOKINE-LIKE NUCLEAR FACTOR N-PAC, GLYOXYLATE REDUCTASE 1 HOMOLOG, NUCLEAR PROTEIN NP60, NUCLEAR PROTEIN OF 60 KDA PLGSPEFSERGSKSPLKRAQEQSPRKRGRPPKDEKDLTIPESSTVKGMMAGPMAAFKWQPTASEPVKDADPHFHHFLLSQTEKPAVCYQAITKKLKICEEETGSTSIQAADSTAVNGSITPTDK 124 T 0.017 AT_hook pdb F Eukaryota T 4gvb 1 A A KP6T_UMV6 VP10 NNAFCAGFGLSCKWECWCTAHGTGNELRYATAAGCGDHLSKSYYDARAGHCLFSDDLRNQFYSHCSSLNNNMSCRSLSKR 80 T 0.3 YobH unp T Viruses T 4gvb 2 B B KP6T_UMV6 VP12.5 GKRPRPVMCQCVDTTNGGVRLDAVTRAACSIDSFIDGYYTEKDGFCRAKYSWDLFTSGQFYQACLRYSHAGTNCQPDPQYE 81 T 0.0014 DUF5948 pdb T Viruses T 4gvc 2 B B SDC1_HUMAN SYND1 TKQEEFXA 8 F F Eukaryota T 4gvd 2 C,D C,D SDC1_HUMAN SYND1 TKQEEFYA 8 F F Eukaryota T 4gw1 3 E,F E,F cQFD meditope CQFDLSTRRLKC 12 T 3.1 Flavi_NS1 pdbhh F T 4gw5 3 E,F E,F cQYN meditope CQYNLSSRALKC 12 T 1.9 DUF6464 pdbhh F T 4gxb 2 B B LYAM3_MOUSE P-selectin GASAGSSKRLRKKDDGKCPLNPHSHLGTYGVFTNAAYDPTP 41 T 0.58 Syndecan unppercent F Eukaryota T 4gxl 2 B B CACO2_HUMAN ANTIGEN NUCLEAR DOT 52 KDA PROTEIN, NUCLEAR DOMAIN 10 PROTEIN NDP52, NUCLEAR DOMAIN 10 PROTEIN 52, NUCLEAR DOT PROTEIN 52 ARQNPGLAYGNPYS 14 T 2.2 Bac_GH3_C pdbhh F Eukaryota T 4h1l 3 C,F C,F mimotope peptide QHIRCNIPKRISA 13 T 1.4 DUF3091 pdbhh F T 4h25 3 C,F C,F peptide QHIRCNIPKRIGPSKVATLVPR 22 T 6.6 SpdB pdbhh F T 4h26 3 C,F C,F peptide QWIRVNIPKRI 11 T 0.54 DUF2096 pdbhh F T 4h36 2 B B ATF2_HUMAN CAMP-DEPENDENT TRANSCRIPTION FACTOR ATF-2, ACTIVATING TRANSCRIPTION FACTOR 2, CYCLIC AMP-RESPONSIVE ELEMENT-BINDING PROTEIN 2, CREB-2, CAMP-RESPONSIVE ELEMENT-BINDING PROTEIN 2, HB16, CAMP RESPONSE ELEMENT-BINDING PROTEIN CRE-BP1 KHEMTLKF 8 T 0.0017 zf_C2H2_6 unphh F Eukaryota T 4h3b 2 B,D B,D 3BP5_HUMAN SH3BP-5, SH3 DOMAIN-BINDING PROTEIN THAT PREFERENTIALLY ASSOCIATES WITH BTK VVRPGSLDLP 10 T 0.44 DUF5748 pdbhh F Eukaryota T 4h3p 2 C,D B,E KS6A1_HUMAN S6K-ALPHA-1, 90 KDA RIBOSOMAL PROTEIN S6 KINASE 1, P90-RSK 1, P90RSK1, P90S6K, MAP KINASE-ACTIVATED PROTEIN KINASE 1A, MAPK-ACTIVATED PROTEIN KINASE 1A, MAPKAP KINASE 1A, MAPKAPK-1A, RIBOSOMAL S6 KINASE 1, RSK-1 PQLKPIEASILAARRVRKLPSTTL 24 T 7.4 COX5A pdbhh F Eukaryota T 4h3q 2 B B MP2K2_HUMAN MAP KINASE KINASE 2, MAPKK 2, ERK ACTIVATOR KINASE 2, MAPK/ERK KINASE 2, MEK 2 RRKPVLPALTINP 13 T 1.4 DHHA2 pdbhh F Eukaryota T 4h4f 3 C Q CTRC_HUMAN CALDECRIN CGVPSFPPNL 10 T 2.9 POPLD pdbhh F Eukaryota T 4h4n 1 A A A0A6L7H4C2_BACAN hypothetical protein BA_2335 SNAMEKKPIAFKVPPNSKLKVTFFGPYNEVITNVSIINQLSTPKCQTITRYPNYTKYETEVRSLSSC 67 T 2.7 PA-IIL pdbhh F Bacteria T 4h8f 1 A,B A,B CC-Hex-II-Phi22 XGEIKAIAQEIKAIAKEIKAIAXEIKAIAQGYX 33 T 0.0015 DUF2312 pdbpssm F T 4h8l 1 A,B,C,D,E,F A,B,C,D,E,F CC-Hex-D24-A5/7C XGELKCICQELKAIAWELKAIAKEDKAIAQGAGX 34 T 2.6 DUF5741 pdbhh F T 4h8m 1 A,B A,B CC-Hex-H24-A5/7C XGELKCICQELKAIAKELKAIAWEHKAIAQGX 32 T 9.4 KELAA pdbhh F T 4h8o 1 A,B,C,D,E,F A,B,C,D,E,F CC-Hex-N24 XGELKAIAQELKAIAYELKAIAKENKAIAQGX 32 T 0.083 DUF5660 pdbpssm F T 4h9n 3 C C DAXX_HUMAN DAXX, HDAXX, ETS1-ASSOCIATED PROTEIN 1, EAP1, FAS DEATH DOMAIN-ASSOCIATED PROTEIN SPRTRGSRRQIQRLEQLLALYVAEIRRLQEKELDLSELDDPDSAYLQEARLKRKLIRLFGRLCELKDCSSLTGRVIEQRIPYRGTRYPEVNRRIERLINKPGPDTFPDYGDVLRAVEKAAARHSLGLPRQQLQLMAQDAFRDVGIRLQERRHLDLIYNFGCHLTDDYRPGVDPALSDPVLARRLRENRSLAMSRLDEVISKYAMLQDKSEEG 212 T 0.0026 Syntaxin_2 pdbpercent F Eukaryota T 4h9q 3 C C DAXX_HUMAN DAXX, HDAXX, ETS1-ASSOCIATED PROTEIN 1, EAP1, FAS DEATH DOMAIN-ASSOCIATED PROTEIN SPRTRGSRRQIQRLEQLLALYVAEIRRLQEKELDLSELDDPDSAYLQAARLKRKLIRLFGRLCELKDCSSLTGRVIEQRIPYRGTRYPEVNRRIERLINKPGPDTFPDYGDVLRAVEKAAARHSLGLPRQQLQLMAQDAFRDVGIRLQERRHLDLIYNFGCHLTDDYRPGVDPALSDPVLARRLRENRSLAMSRLDEVISKYAMLQDKSEEG 212 T 0.0026 Syntaxin_2 pdbpercent F Eukaryota T 4hab 2 D,E,F D,E,F PL-49 XLHSTMX 7 T 330 Thioredoxin_16 pdbhh F T 4han 2 C,D C,D CACO2_HUMAN ANTIGEN NUCLEAR DOT 52 KDA PROTEIN, NUCLEAR DOMAIN 10 PROTEIN NDP52, NUCLEAR DOMAIN 10 PROTEIN 52, NUCLEAR DOT PROTEIN 52 PGLAYGNPYSGIQE 14 T 1.9 DUF4326 pdbhh F Eukaryota T 4hga 1 A A DAXX_HUMAN DAXX, HDAXX, ETS1-ASSOCIATED PROTEIN 1, EAP1, FAS DEATH DOMAIN-ASSOCIATED PROTEIN GPLQDPSRRQIQRLEQLLALYVAEIRRLQEKELDLSELDDPDSAYLQEARLKRKLIRLFGRLCELKDCSSLTGRVIEQRIPYRGTRYPEVNRRIERLINKPGPDTFPDYGDVLRAVEKAAARHSLGLPRQQLQLMAQDAFRDVGIRLQERRHLDLIYNFGCHLTDDYRPGVDPALSDPVLARRLRENRSLAMSRLDEVISKYAMLQDKSEEGE 213 T 0.017 Latarcin pdbpssm F Eukaryota T 4hgc 2 B I SFTI1_HELAN SFTI-1 GRCTXSIPPICFPD 14 T 0.022 Bowman-Birk_leg pdb F Eukaryota T 4hh6 2 B Z Peptide from EAEC T6SS Sci1 SciI protein KKWDSVYASLFEKINLKK 18 T 0.51 Rtf2 pdbhh F T 4hha 3 C P Q9YKI2_HCMV Glycoprotein B ETIYNTTLKY 10 T 5.1E-05 HCMVantigenic_N unphh T Viruses T 4hic 1 A,B A,B Q8L1C9_ENTFL TraK MKHHHHHHHSDYDIPTTENLYFQGSGSTNKNQPPVTPTATTASKESNQSETSGEATENSSQAVQGSSDHLLKLSAKERADEATEAFESWYKSFSNGDVILEINKELLKEGSGGTSPIELQTKLIDNLKAKFGDKVSDDFYTSLQASFNFNPVIVDGTKGLTISKQNDDESQWFSTWFLDTEKKEKNTKIIVRNDFPFEWVDWRNKGQHDEKVGKIFKNVDWDNDLSYEVIGIDFTEATKNIETNQILFVQMHYNEKIGKWQVTGNVGGVY 270 T 1.4 RCR unphh F Bacteria T 4hjb 1 A,B,C,D C,A,B,D GCN4pLI(alpha/beta/cyclic-gamma) XRMKQIEDKLEEILSKLYHIEXELXIKXLLGER 33 T 0.0073 VGPC1_C pdbhh F T 4hjd 1 A,B A,B GCN4pLI(alpha/beta/acyclic gamma) XRMKQIEDKLEEILXKLXIEXELARIKKLLYER 33 T 0.0071 VGPC1_C pdbhh F T 4hlb 1 A A B6WUJ7_9DELT Uncharacterized protein GAEQQADTVTENSDSEVFVDDSDRFTAFEEELLARYADKGIRSVDVAAYAKGIDIVFVAADRKMTRAEFSAIASRSIRELKERFGFDKDVPIGAVLDYKKDAATDTRTRFVLKLR 115 T 0.01 DUF4999 unp F Bacteria T 4hr6 1 A A U3KRF6_TRIAN SGSL, A ALPHA ANLRLSEANSGTYKTFIGRVREELGSETYRLYGIPVLKHSL 41 T 0.00035 RIP pdbhh F Eukaryota T 4hre 3 E,F,K,L G,H,K,L HLTF_HUMAN DNA-BINDING PROTEIN/PLASMINOGEN ACTIVATOR INHIBITOR 1 REGULATOR, HIP116, RING FINGER PROTEIN 80, SWI/SNF-RELATED MATRIX-ASSOCIATED ACTIN-DEPENDENT REGULATOR OF CHROMATIN SUBFAMILY A MEMBER 3, SUCROSE NONFERMENTING PROTEIN 2-LIKE 3 PRLSYPTFFPRFEF 14 T 14 LegC3_N pdbhh F Eukaryota T 4hrg 2 C,D C,D AHNK_HUMAN DESMOYOKIN QKVTFPKMKIPKFTF 15 T 5 DUF5476 pdbhh F Eukaryota T 4ht6 2 B,D,F B,D,F PAC11_YEAST WD repeat-containing protein PAC11 ITYDKGIQTDQ 11 T 0.55 SAICAR_synt unp F Eukaryota T 4hvu 2 B B SYNTHETIC PEPTIDE Acetyl-APPLPPRNRP XAPPLPPRNRP 11 T 0.21 SCIMP pdbhh F T 4hvw 2 B B SYNTHETIC PEPTIDE Acetyl-VSLARRPLPPLP XVSLARRPLPPLP 13 T 0.95 DUF4522 pdbhh F T 4hw4 2 C,D C,D Mcl-1 BH3 peptide XALETLRRVGDGVQRNHX 18 T 14 BALF1 pdbhh F T 4hx0 1 A A Q9X0A5_THEMA Putative nucleotidyltransferase TM1012 HHHHHMIRPEYLRVLRKIYDRLKNEKVNWVVTGSLSFALQGVPVEVHDIDIQTDEEGAYEIERIFSEFVSKKVRFSSTEKICSHFGELIIDGIKVEIMGDIRKRLEDGTWEDPVDLNKYKRFVETHGMKIPVLSLEYEYQAYLKLGRVEKAETLRKWLNERKG 163 T 0.00088 NTP_transf_5 unppercent F Bacteria T 4hy9 2 C,D C,D PYRRH_PYRAP Pyrrhocoricin VDKLYXXPRPTT 12 T 2.5 Apidaecin unphh F Eukaryota T 4hyb 2 C,D C,D PYRRH_PYRAP Pyrrhocoricin VDKLYXIPRPP 11 T 2.5 Apidaecin unphh F Eukaryota T 4i2w 2 B B HSP7A_CAEEL Heat shock 70 kDa protein A AGGPTIEEVD 10 T 0.49 EcsC unppssm F Eukaryota T 4i2z 2 B B HSP90_CAEEL ABNORMAL DAUER FORMATION PROTEIN 21 EDASRMEEVD 10 T 8.1 TEX12 pdbhh F Eukaryota T 4i4o 1 A,B A,B R4GRU5_BOLED BEL beta-trefoil VNFPNIPAEGVQFRLRARDTGYVIYSRTENPPLVWQYNGPPYDDQLFTLIYGTGPRKNLYAIKSVPNGRVLFSRTSASPYVGNIAGDGTYNDNWFQFIQDDNDPNSFRIYNLASDTVLYSRTTADPKFGNFTGAKYDDQLWHFELV 146 T 2.2E-05 RicinB_lectin_2 pdbpercent F Eukaryota T 4i4q 1 A A R4GRU4_BOLED BEL-beta trefoil VNFPNIPAEGVRFRLRARDSGYVIYSRTENDPLVWHYNGPPYDDQLFTLIHGTGSRLNLYAIKSVPNGRVLFSRNSASPTVGNIVGDGTYNDNWFQFIQDDNDANSFRIYSLASDSVLYSRTTGAPQFGNYTGPKFDDQLWHFEIV 146 T 0.00014 RicinB_lectin_2 pdb F Eukaryota T 4i4w 3 C C Immunogenic peptide ILAKFLHRL 9 T 5.7 SRC-1 pdbhh F T 4i4x 1 A,B,C,D A,B,C,D R4GRU9_BOLED BEL beta-trefoil VNFPNIPAEGVQFRLRARDTGYVIYSRTENPPLVWQYNGPPYDDQLFTLIYGTGPRKNLYAIKSVPNGRVLFSRTSASPYVGNIAGDGTYNDNWFQFIQDDNDPNSFRIYDLASDTVLYSRTTADPKFGNFTGAKYDDQLWHFELV 146 T 0.00012 RicinB_lectin_2 pdbpercent F Eukaryota T 4i5b 3 C,F C,F truncated hemagglutinin peptide VVKQNCLKLATK 12 T 19 Hemagglutinin pdbhh F T 4i7b 2 B,D B,D PHYL_DROME Protein phyllopod XKLRPVXMVRPTVR 14 T 0.45 RNase_PH unppercent F Eukaryota T 4i7c 2 B,D B,D PHYL_DROME Protein phyllopod XKLRPVXMVRPWVR 14 T 0.45 RNase_PH unppercent F Eukaryota T 4i7d 2 B,D B,D PHYL_DROME Protein phyllopod XKLRPVAMVRPXVR 14 T 0.45 RNase_PH unppercent F Eukaryota T 4i80 2 B B macrocyclic peptidomimetic XRWXFPARP 9 T 4.4 DUF2842 pdbhh F T 4ib5 2 D,E,F,G D,E,F,G CK2beta-derived cyclic peptide GCRLYGFKIHGCG 13 T 1.2 Speriolin_C pdbhh F T 4iea 2 B P RAF1_HUMAN PROTO-ONCOGENE C-RAF, CRAF, RAF-1 RSASEPSL 8 T 10 Baculo_LEF5_C pdbhh F Eukaryota T 4if6 2 B B SIR1_HUMAN NAD-dependent protein deacetylase sirtuin-1 GPHMGSQYLFLPPNRYIFHGAEVYSDSEDDV 31 T 13 Rxt3 pdbhh F Eukaryota T 4ifd 11 K K RRP6_YEAST RIBOSOMAL RNA-PROCESSING PROTEIN 6 RSMEATPIPSSETKADGILLETISVPQIRDVMERFSVLCNSNISKSRAKPVTNSSILLGKILPREEHDIAYSKDGLPNKVKTEDIRIRAQNFKSALANLEDIIFEIEKPLVVPVKLEEIKTVDPASAPNHSPEIDNLDDLVVLKKKNIQKKQPAKEKGVTEKDAVDYSKIPNILSNKPG 179 T 0.21 Laminin_N pdb F Eukaryota T 4igk 2 C,D C,D ATRIP_HUMAN ATM AND RAD3-RELATED-INTERACTING PROTEIN ACSPQFG 7 T 0.12 Toxin_18 pdbhh F Eukaryota T 4iik 1 A A SIDD_LEGPH DE-AMPYLASE SIDD, DEAMPYLASE SIDD, ADENYLYL-[RAB1] HYDROLASE MRSIITQICNGVLHGQSYQSGSNDLDKGNSEIFASSLFVHLNEQGKEIIKHKDSDDKIVIGYTKDGMAFQIVVDGFYGCERQAVFSFIDNYVLPLIDNFSLDLTRYPDSKKVTESLIHTIYSLRSKHAPLAEFTMSLCVTYQKDEQLFCAGFGIGDTGIAIKRNEGTIEQLVCHTEVDGFKDAFDNYSSANIDLVIERNSVFNTKVMPGDELVGYTYVPPMLEMTEKEFEVETVDGKKINKRIVRHLNLDPGNFDDKDPLFSQLLQVVKSKQKQLVEQAKETGQIQRFGDDFTVGRLVIPDQLLINQLRIHALSHHHHH 319 T 0.18 Glycos_trans_3N pdbpercent F Bacteria T 4iim 2 B,D,E C,D,E peptide ligand WRDSSGYVMGPW 12 T 1 Galanin pdbhh F T 4iio 2 C C Synthetic Peptide XWRGSLSYLKGPL 13 T 0.56 CRPV_capsid pdbhh F T 4iip 1 A A SIDD_LEGPH DE-AMPYLASE SIDD, DEAMPYLASE SIDD, ADENYLYL-[RAB1] HYDROLASE MRSIITQICNGVLHGQSYQSGSNDLDKGNSEIFASSLFVHLNEQGKEIIKHKDSDDKIVIGYTKDGMAFQIVVAGFYGCERQAVFSFIDNYVLPLIDNFSLDLTRYPDSKKVTESLIHTIYSLRSKHAPLAEFTMSLCVTYQKDEQLFCAGFGIGDTGIAIKRNEGTIEQLVCHTEVDGFKDAFDNYSSANIDLVIERNSVFNTKVMPGDELVGYTYVPPMLEMTEKEFEVETVDGKKINKRIVRHLNLDPGNFDDKDPLFSQLLQVVKSKQKQLVEQAKETGQIQRFGDDFTVGRLVIPDQLLINQLRIHALSHHHHH 319 T 0.17 PP2C_2 pdbpssm F Bacteria T 4ijy 1 A A Q93I65_ECOLX CofJ SPSSSEGGAFTVNMPKTSTVDDIRGCPTLETPLKLTFTEDIQPRKENGSTYFYYDGWRGVGQTVNPWSPVLDNHKYAATEHEIHIYVEFFQTPSNRFADKNGAYSYIDANGVMYTNGEYSWEHVPALGKNIYKVVISDWNKGQTKSIYLPGRDFKTVEVFHFQNNRPQWDDRNSYENVKSRINNNISKSYSKAKLNEQLSTYVHDDGTDSLFLYQKLSRASLKESQINYYQLRGKFNGVNLGYWAQEYILFGGEGAEQLKNKIPDMSNYSMEDNGSFKNALKIESLDLRLMDNNRMAYGSTGTYIASFNRTDFSMTPENLKACGLD 326 T 9.5 DUF4999 unphh F Bacteria T 4ika 2 B D B2ZUN0_9ENTO VPg GAYSGAPKQVLKKPALRTATVQ 22 T 6.7 DUF2111 pdbhh T Viruses T 4il7 1 A A Q6Q0L4_9VIRU HYPOTHETICAL PROTEIN A223 MSHEGLSPIPGEGTGIQLSAGQILKFYNVPIAEIIVEYDPSNVSGVSSNVKLKGTIHPLFEVPSQISIENFQPTENYLIYSGFGTSLPQTYTIPANGYLIISITNTSTGNIGQITLTIGSTTMTFNLQTGENKIPVIAGTQITNMTLTSSSAILIYEEVIHHHHHH 166 T 14 DUF1989 pdbhh T Viruses T 4ind 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T Q6Q0L3_9VIRU C381 turret protein MSVTTLGQSFPANAKVKYYYKLSEKQDLDAFVNSIFVGSYKLKQISYLLYGNTKIVSAPVVPLGPNASIIIDDELQEGLYLIRIKVYNTNSFSVTVTPFFNNNNTMTYSIGANSEFEIYDIFTKEQGNIYYIQLPPGLAILEFSLERVFEKGNRINIPKIIHTSGNGYISFRLRKGTYAIKMPYSYNNTTSTTFTNFQFGTISTSVATIPLVISSIPANGSGSGTFLVYLKITGDYEDVKFSVTYGGGLGVPFTFGLEVEEINELVENTNFVTQSVTLSGSQVTQSILNVQGSGSHLRLKYASVSGLTTAVTQCQLQATNLNRSTTYSTVWDFIAGGSSTPPSWDIREINSIQLVANGGSSTSSVTITLILVYEQIAGELSHHHHHH 387 T 0.2 CBM_48 pdb T Viruses T 4ip3 1 A A Q8VSD5_SHIFL ORF169b GSMINGVSLQGTAGYEAHTEEGNVNVKKLLESLNSKSLGDMDKDSELAATLQKMINPSGGDGNASGCALHACMAMLGYGVREAPVPNEISEYMTGFFHRHLEQIDSEGIVSHPNETYSKFRERIAENILQNTSKGSVVMISIEQATHWIAGFNDGEKIMFLDVQTGKGFNLYDPVEKSPDAFVDENSSVQVIHVSDQEFDHYANSSSWKSKRLC 214 T 0.13 Gln_amidase unppercent F Bacteria T 4iqj 5 M,N,O,P M,N,O,P A0A0M9ACL9_THEAQ DNA polymerase III subunit gamma/tau HHHHHHKAGEAQDLAEGWRAFLEALKPTLRAFVREARPHLEGKTLVLRFPESKAFHHKKAEEQKAHLLPLARAQFGVEELAFVLEKKSLSGASPPPPTKPVPPREAPPPVAAPPPEPEPPLEDPPWEAEEGEDPSEELRRLARLLGGRLLWVRKPKAPEAEEPVSEDGIGGNGIMPP 177 T 0.00029 DNA_pol3_a_NII unppssm F Bacteria T 4irv 2 E,F,G,H E,F,G,H ASPP2_HUMAN BCL2-BINDING PROTEIN, BBP, RENAL CARCINOMA ANTIGEN NY-REN-51, TUMOR SUPPRESSOR P53-BINDING PROTEIN 2, 53BP2, P53-BINDING PROTEIN 2, P53BP2 GPKLASNAPRPLKKRSSITEPEGPNGPNIQKLLYQRTTIAAMETISVPSYPSKSASVTASSE 62 T 0.16 Dermcidin pdb F Eukaryota T 4is6 3 C C PMEL_HUMAN ME20-M, ME20M, MELANOCYTE PROTEIN PMEL 17, MELANOCYTES LINEAGE-SPECIFIC ANTIGEN GP100, MELANOMA-ASSOCIATED ME20 ANTIGEN, P1, P100, PREMELANOSOME PROTEIN, SILVER LOCUS PROTEIN HOMOLOG WNRQLYPEWTEAQRLD 16 T 0.96 Rv0078B pdbhh F Eukaryota T 4isq 2 D,E,F D,E,F SYT1_HUMAN SYNAPTOTAGMIN I, SYTI, P65 GEGKEDAFSKLKEKFMNELHK 21 T 0.01 PRIMA1 unphh F Eukaryota T 4isr 2 D,E,F D,E,F SYT2_RAT SYNAPTOTAGMIN II, SYTII GESQEDMFAKLKDKFFNEINK 21 T 0.027 DUF4713 unphh F Eukaryota T 4ivh 1 A A cyclo[Gln-Lys-Leu-Val-Phe-Phe-Ala-Glu-Asp-(delta-linked-Orn)-Hao-Lys-Hao-(p-bromoPhe)-Thr-(delta-linked-Orn)] TXQKLVFFAEDXXKXX 16 T 0.47 Beta-APP pdbhh F T 4j24 2 E,F,G,H K,I,J,L 19-mer peptide LTARHPLLLRHLLQNSPSD 19 T 3.5 T4_Gp59_C pdbhh F T 4j26 2 B,D I,J 12-mer Peptide HPLLMRLLHHPS 12 T 2.9 HEAT pdbhh F T 4j2c 2 B,D B,D VPS51_HUMAN ANOTHER NEW GENE 2 PROTEIN, PROTEIN FAT-FREE HOMOLOG AHGMLKLYYGLSEGEAA 17 T 1.7 Ribosomal_S4 pdbhh F Eukaryota T 4j2j 2 D,E,F D,E,F CIC_HUMAN Protein capicua homolog EPRSVAVFPWHSLVPFLAPSQ 21 T 2.6 DUF5988 pdbhh F Eukaryota T 4j2l 2 C,D C,D CIC_HUMAN Protein capicua homolog MFVWTNVEPRSVAVFPWHSLVPFLAPSQ 28 T 2.3 DUF2605 pdbhh F Eukaryota T 4j2x 2 B,D B,D FHL-1, KYOT, RBP-ASSOCIATED MOLECULE 14-1, RAM14-1, SKELETAL MUSCLE LIM-PROTEIN 1, SLIM, SLIM-1 SGLVKAPVWWPMKDNPGTTTASTAKNAP 28 T 10 QH-AmDH_gamma pdbhh F T 4j73 2 B B TMED9_BOVIN P25, P24 FAMILY PROTEIN ALPHA-2, P24ALPHA2 FEAKKLV 7 T 130 WSK pdbhh F Eukaryota T 4j7b 3 C,F C,F MA205_DROME 205 kDa microtubule-associated protein MGHHHHHHLDDLVAESPRKEFARINMDGIAVPDEREFDIEADMRPHELEQESDTFGAG 58 T 4 BNIP2 pdbhh F Eukaryota T 4j7o 1 A A SCA2_RICCN Putative surface cell antigen sca2 ASFKDLVSKTPAWEKHNSTQQQNIWKDLTPNEKIKKWQEAALVPSFTQAQNDLGIKYKETDLSSFLDNTRHKARQARAEILLYIERVKQQDFDTKKQAYINQGVVPTDIEAATNLGISYDPSKIDNNVEHDQKVRRAEKDKKAVIELYVSSINRGIKYKHYVDNDIIPEIQEVRTALNMNKDDAQSFVASIRTEIMENAKGQYIADSHIPTEKELKKKFGISRDDNRDGYIKSIRLKVMDKEKPQYIADSHIPTEKELEQKFGADKGEATNYIASIATQMMLDKKSYYIDNNIIPNADELMNEFKIGPVKATSYINQIRAGIEANQFLNNNDTTKPSTGRSQKKSGSKNDHWYMSNQSINNTGTSAR 367 T 0.11 GntR pdb F Bacteria T 4j83 2 B B TAF10_HUMAN STAF28, TRANSCRIPTION INITIATION FACTOR TFIID 30 KDA SUBUNIT, TAF(II)30, TAFII-30, TAFII30 XSKSADRKYTL 11 T 0.0084 TFIID-31kDa unphh F Eukaryota T 4j8g 2 C,D C,D membrane glycoprotein E3 gp19K AASFIDAKKMP 11 T 21 Adeno_GP19K pdbhh F T 4j8s 2 B B TTP_HUMAN TTP, G0/G1 SWITCH REGULATORY PROTEIN 24, GROWTH FACTOR-INDUCIBLE NUCLEAR PROTEIN NUP475, PROTEIN TIS11A, TIS11, ZINC FINGER PROTEIN 36 HOMOLOG, ZFP-36 APRRLPIFNRISVSE 15 T 6.2 Hormone_recep pdbhh F Eukaryota T 4j9c 2 B B P17 XAPTYSPPLPP 11 T 6.8 TAF8_C pdbhh F T 4j9d 2 B,D,F B,D,F 3BP1_HUMAN P0 XAPTYPPPLPP 11 T 1.1 HPS6 pdbhh F Eukaryota T 4jaa 2 B S CONSENSUS ANKYRIN REPEAT DOMAIN-(d)LEU HLEVVKLLLEHGADVXAQDK 20 T 0.00016 Ank pdb F T 4jdh 2 B B Paktide T GGRRRRRTWYFGGGK 15 T 1.1 Microvir_J pdbhh F T 4jdt 1 A G Q0ED31_9HIV1 gp120 VWKDADTTLFCASDAKAHETECHNVWATHACVPTDPNPQEIHLENVTENFNMWKNNMVEQMQEDVISLWDQCLQPCVKLTGGSVIKQACPKISFDPIPIHYCTPAGYVILKCNDKNFNGTGPCKNVSSVQCTHGIKPVVSTQLLLNGSLAEEEIIIRSENLTNNAKTIIVHLNKSVEINCTRPSNGGSGSGGDIRKAYCEINGTKWNKVLKQVTEKLKEHFNNKTIIFQPPSGGDLEITMHHFNCRGEFFYCNTTQLFNNTCIGNETMKGCNGTITLPCKIKQIINMWQGTGQAMYAPPIDGKINCVSNITGILLTRDGGANNTSNETFRPGGGNIKDNWRSELYKYKVVQIEGSHHHHHH 361 T 1.7E-34 GP120 unp T Viruses T 4jdw 1 A A GATM_HUMAN TRANSAMIDINASE, AT38 MLRVRCLRGGSRGAEAVHYIGSRLGRTLTGWVQRTFQSTQAATASSRNSCAADDKATEPLPKDCPVSSYNEWDPLEEVIVGRAENACVPPFTIEVKANTYEKYWPFYQKQGGHYFPKDHLKKAVAEIEEMCNILKTEGVTVRRPDPIDWSLKYKTPDFESTGLYSAMPRDILIVVGNEIIEAPMAWRSRFFEYRAYRSIIKDYFHRGAKWTTAPKPTMADELYNQDYPIHSVEDRHKLAAQGKFVTTEFEPCFDAADFIRAGRDIFAQRSQVTNYLGIEWMRRHLAPDYRVHIISFKDPNPMHIDATFNIIGPGIVLSNPDRPCHQIDLFKKAGWTIITPPTPIIPDDHPLWMSSKWLSMNVLMLDEKRVMVDANEVPIQKMFEKLGITTIKVNIRNANSLGGGFHAWTCDVRRRGTLQSYLD 423 T 0.014 ADI pdbpercent F Eukaryota T 4jfd 3 C C Melanoma peptide ELAAIGILTV 10 T 6 DUF3527 pdbhh F T 4jfe 3 C C Melanoma peptide L7A ELAGIGALTV 10 T 4.6 MLANA pdbhh F T 4jfo 3 C,F C,F E1A heteroclitic Melanoma peptide ALAGIGILTV 10 T 2.5 MLANA pdbhh F T 4jfq 3 C,F C,F L8A heteroclitic Melanoma peptide ELAGIGIATV 10 T 1.5 MLANA pdbhh F T 4jfx 3 E P Phosphopeptide GEKKGNYVVTXA 12 T 1.1 MFA1_2 pdbhh F T 4jfz 3 C P Phosphopeptide GEKKGNYVVTSH 12 T 0.78 MFA1_2 pdbhh F T 4jg1 3 C P Phosphopeptide GEKKGNYVVTTH 12 T 0.64 MFA1_2 pdbhh F T 4jgl 1 A A hypothetical protein GGAKKNVQDAEGQAEAGGNAPSGYLMPAISANNFCGDFTTMTPDYGYLMPEKGLFLKMHDIRGAYGINIYTYVMDGDNIQCTPGHFVMIVPRGGDKLEITIKKSSMKNTPSFTFIPTPDCENSAYVATEKVAGKYYYLCGDAEARYKFEDLFEDERCAEFKNLVDNYGK 169 T 2.2 Surp pdbhh F T 4jhj 2 C,D C,D Q7ZU28_DANRE DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 19 (DBP5 homolog, yeast) MATDSWAQAVDEQEAAAESISTLQISEKEEKP 32 T 0.028 FliN_N pdb F Eukaryota T 4jhk 2 B C Q7SXI8_DANRE Sb:cb157 protein PLGSMSRIKNWGDEVEEQEMRT 22 T 0.24 Meiosis_expr pdbhh F Eukaryota T 4jij 1 A,C P,Q fluorogenic peptidic substrate (8MC)PLG(PHI)(DNW)AR(NH2) XPLGXXARX 9 T 11 FYRN pdbhh F T 4jiz 2 B B phosphopeptide YHSVVRYA 8 T 2.9 BioT2 pdbhh F T 4jk5 2 B B bicyclic peptide UK18-D-Ser, uPA inhibitor ACSRYEVDCRGRXSACGX 18 T 7.1 Kp4 pdbhh F T 4jl0 2 C,D C,D Q840U9_PSEAI PopB TGVALTPPS 9 T 0.7 GluR_Homer-bdg pdbhh F Bacteria T 4jlq 2 B B NAB2_YEAST Nuclear polyadenylated RNA-binding protein NAB2 RFTQRGGGAVGKNRRGGRGGNRGGRNNNSTRFNPLAKA 38 T 0.0034 DDHD unp F Eukaryota T 4jlu 2 B B F175A_HUMAN COILED-COIL DOMAIN-CONTAINING PROTEIN 98, PROTEIN FAM175A GFGEYSRSPTF 11 T 0.48 PipA pdbhh F Eukaryota T 4jm1 1 A A A6LA98_PARD8 hypothetical protein GADKYIDTITGFSCEKAAVTDNGFLVIAIDADSDSGYDMLASQFLEEAKKEGVSGLKGVLIVDIKNAKFEQGAVVGKRIGKAYK 84 T 0.3 DUF6503 unphh F Bacteria T 4jmg 2 B B PTN11_HUMAN PROTEIN-TYROSINE PHOSPHATASE 1D, PTP-1D, PROTEIN-TYROSINE PHOSPHATASE 2C, PTP-2C, SH-PTP2, SHP-2, SHP2, SH-PTP3 DSARVXENVGLMQ 13 T 1.6 CSM2 pdbhh F Eukaryota T 4jmh 2 B B SHC1_HUMAN SHC-TRANSFORMING PROTEIN 3, SHC-TRANSFORMING PROTEIN A, SRC HOMOLOGY 2 DOMAIN-CONTAINING-TRANSFORMING PROTEIN C1, SH2 DOMAIN PROTEIN C1 PPDHQXXNDFPGK 13 T 4.8 Herpes_TK_C pdbhh F Eukaryota T 4jo6 2 E,F Y,Z SBP-Tag MDEKTTGWRGGHVVEGLAGELEQLRARLEHHPQGQREP 38 T 2.2 BLUF pdbhh F T 4joe 2 C,D C,D A-iCAL36 peptide ANSRAPTSII 10 T 17 PMBR pdbhh F T 4jof 2 C,D C,D L-iCAL36 peptide ANSRLPTSII 10 T 5 PMBR pdbhh F T 4jog 2 C,D C,D V-iCAL36 peptide ANSRVPTSII 10 T 5.2 DUF2570 pdbhh F T 4joh 2 C,D C,D H-iCAL36 peptide ANSRHPTSII 10 T 13 TssN pdbhh F T 4joj 2 C,D C,D F-iCAL36 peptide ANSRFPTSII 10 T 3.4 LemA pdbhh F T 4jok 2 C,D C,D Y-iCAL36 peptide ANSRYPTSII 10 T 11 C9orf72-like pdbhh F T 4jol 2 E,F,G,H E,F,G,H HTF4_HUMAN TCF-12, CLASS B BASIC HELIX-LOOP-HELIX PROTEIN 20, BHLHB20, DNA-BINDING PROTEIN HTF4, E-BOX-BINDING PROTEIN, TRANSCRIPTION FACTOR HTF-4 SPLQAKKVRKVPPGLPSSVYAPSPN 25 T 33 Mvb12 pdbhh F Eukaryota T 4jor 2 C,D C,D VE6_HPV18 Protein E6 RLQRRRETQV 10 T 0.19 Mu-like_Com unphh T Viruses T 4jqi 4 D V V2R_HUMAN Vasopressin V2 receptor phosphopeptide ARGRTPPSLGPQDESCTTASSSLAKDTSS 29 T 21 DUF6352 pdbhh F Eukaryota T 4jqv 3 C B BZLF1_EBVG EB1, ZEBRA SELEIKRY 8 T 0.0044 bZIP_2 unppercent T Viruses T 4jqx 3 C B BZLF1_EBVG EB1, ZEBRA EECDSELEIKRY 12 T 0.0044 bZIP_2 unppercent T Viruses T 4js0 2 B B BAIP2_HUMAN BAI-ASSOCIATED PROTEIN 2, BAI1-ASSOCIATED PROTEIN 2, PROTEIN BAP2, FAS LIGAND-ASSOCIATED FACTOR 3, FLAF3, INSULIN RECEPTOR SUBSTRATE P53/P58, IRS-58, IRSP53/58, INSULIN RECEPTOR SUBSTRATE PROTEIN OF 53 KDA, IRSP53, INSULIN RECEPTOR SUBSTRATE P53 ASKSNLVISDPIPGAKPLPVPPELAPFVGRMS 32 T 2.9 DUF6248 pdbhh F Eukaryota T 4jtm 1 A,B A,B E3PJ86_ECOH1 Type II secretion system protein D GAMATFTANFKDTDLKSFIETVGANLNKTIIMGPGVQGKVSIRTMTPLNERQYYQLFLNLLEAQGYAVVPMENDVLKVVKS 81 T 0.14 Corona_NS2A pdbpercent F Bacteria T 4jwd 2 C,D C,D CTHL3_BOVIN BACTENECIN-7, BAC7, PR-59 PRPLPFPRPGPRPI 14 T 0.027 TonB_N unppercent F Eukaryota T 4jwi 2 C,D C,D CTHL3_SHEEP BACTENECIN-7, BAC7, PR-59 PRPILLPWRX 10 T 0.025 Trypan_PARP unp F Eukaryota T 4k0u 2 B B GSPD2_DICD3 T2SS PROTEIN D, GENERAL SECRETION PATHWAY PROTEIN D, PECTIC ENZYMES SECRETION PROTEIN OUTD RTFRQVQSSISDFYD 15 T 0.96 DUF643 pdbhh F Bacteria T 4k1e 2 B B SFTI1_HELAN SFTI-1 GFCQRSIPPICFPD 14 T 0.052 Bowman-Birk_leg unp F Eukaryota T 4k38 2 C,D C,D Kp18Cys peptide YYTSPMCAPARSMLLTGN 18 T 0.0024 Alk_phosphatase pdbhh F T 4k39 2 C,D D,C Cp18Cys peptide YTAVPSCIPSRASILTGM 18 T 0.00017 Alk_phosphatase pdbhh F T 4k3o 2 C E (ACE)QADLF XQADLF 6 T 81 Zn_peptidase pdbhh F T 4k3q 2 C E (ACE)QLDAF XQLDAF 6 T 61 DUF565 pdbhh F T 4k45 2 B B PLCG1_RAT PHOSPHOINOSITIDE PHOSPHOLIPASE C-GAMMA-1, PHOSPHOLIPASE C-GAMMA-1, PLC-GAMMA-1 DYGALYEGRNPGFXVEAN 18 T 37 DUF4207 pdbhh F Eukaryota T 4k6y 2 C,D C,D iCAL36-Q peptide ANSRWQTSII 10 T 0.093 ENOD40 pdbhh F T 4k72 2 C,D C,D iCAL36-VQD peptide ANSRVQDSII 10 T 4.2 DUF5608 pdbhh F T 4k75 2 B B iCAL36-QDTRL peptide ANSRWQDTRL 10 T 5.9 Glyco_transf_8C pdbhh F T 4k76 2 E,F,G,H E,F,G,H iCAL36-TRL peptide ANSRWPTTRL 10 T 9.3 CBP_BcsR pdbhh F T 4k7h 1 A,B,C,D,E A,B,C,D,E P1_BPPH6 Major inner protein P1 GFNLKVKDLNGSARGLTQAFAIGELKNQLSVGALQLPLQFTRTFSASMTSELLWEVGKGNIDPVMYARLFFQYAQAGGALSVDELVNQFTEYHQSTACNPEIWRKLTAYITGSSNRAIKADAVGKVPPTAILEQLRTLAPSEHELFHHITTDFVCHVLSPLGFILPDAAYVYRVGRTATYPNFYALVDCVRASDLRRMLTALSSVDSKMLQATFKAKGALAPALISQHLANAATTAFERSRGNFDANAVVSSVLTILGRLWSPSTPKELDPSARLRNTNGIDQLRSNLALFIAYQDMVKQRGRAEVIFSDEELSSTIIPWFIEAMSEVSPFKLRPINETTSYIGQTSAIDHMGQPSHVVVYEDWQFAKEITAFTPVKLANNSNQRFLDVEPGISDRMSATLAPIGNTFAVSAFVKNRTAVYEAVSQRGTVNSNGAEMTLGFPSVVERDYALDRDPMVAIAALRTGIVDESLEARASNDLKRSMFNYYAAVMHYAVAHNPEVVVSEHQGVAAEQGSLYLVWNVRTELRIPVGYNAIEGGSIRTPEPLEAIAYNKPIQPSEVLQAKVLDLANHTTSIHIWPWHEASTEFAYEDAYSVTIRNKRYTAEVKEFELLGLGQRRERVRILKPTVAHAIIQMWYSWFVEDDRTLAAARRTSRDDAEKLAIDGRRMQNAVTLLRKIEMIGTTGIGASAVHLAQSRIVDQMAGRGLIDDSSDLHVGINRHRIRIWAGLAVLQMMGLLSRSEAEALTKVLGDSNALGMVVATTDIDPSLHHHHHH 775 T 0.22 STAG pdb T Viruses T 4k7t 1 A A bacitracin A2 ICLXIKXIXHXN 12 T 1.2 DUF4092 pdbhh F T 4ka3 2 B B TAB1_HUMAN TGF-beta-activated kinase 1 and MAP3K7-binding protein 1 SSAQSTSKTSVTLSLVMPSQGLEHHHHHH 29 T 7.6 SCIFF pdbhh F Eukaryota T 4kbb 2 C,D C,D SYT2_MOUSE SYNAPTOTAGMIN II, SYTII EGWTENQEPNVAPATTTATMPLAPVAPADNSTESTGPGESQEDMFAKLKEKFFNEINKIVLEHHHHHH 68 T 0.0051 PRIMA1 unphh F Eukaryota T 4kdi 2 C,D C,D OTU1_YEAST OTU DOMAIN-CONTAINING PROTEIN 1 GSHMASMTGGQQMGRGSMKLKVTGAGINQVVTLKQDATLNDLIEHINVDVKTMRFGYPPQRINLQGEDASLGQTQLDELGINSGEKITIE 90 T 0.00015 UBX pdbhh F Eukaryota T 4ke2 1 A,B,C A,B,C ANPM_PSEAM Type I hyperactive antifreeze protein MNIDPAARAAAAAAASKAAVTAADAAAAAATIAASAASVAAATAADDAAASIATINAASAAAKSIAAAAAMAAKDTAAAAASAAAAAVASAAKALETINVKAAYAAATTANTAAAAAAATATTAAAAAAAKATIDNAAAAKAAAVATAVSDAAATAATAAAVAAATLEAAAAKAAATAVSAAAAAAAAAIAFAAAP 196 T 72 NADH_dh_m_C1 unphh F Eukaryota T 4kel 2 B B SFTI1_HELAN SFTI-1 GFCQRSIPPICFPN 14 T 0.051 Bowman-Birk_leg pdb F Eukaryota T 4kkp 1 A,B A,B RbmA protein EVDCELQPVIEANLSLNQNQLASNGGYISSQLGIRNESCETVKFKYWLSIKGPEGIYFPAKAVVGVDTAQQESDALTDGRMLNVTRGFWVPEYMADGKYTVSLQVVAENGKVFKANQEFVKGVDLNSLPELNGLTIDIKNQFGINSVESTGGFVPFTVDLNNGREGEANVEFWMTAVGPDGLIIPVNAREKWVIASGDTYSKVRGINFDKSYPAGEYTINAQVVDIVSGERVEQSMTVVKK 241 T 0.014 BsuPI pdbhh F T 4kmd 2 B B GLI1_HUMAN GLIOMA-ASSOCIATED ONCOGENE, ONCOGENE GLI SRCTSPGGSYGHLSIGT 17 T 1.6 HMMR_N pdbhh F Eukaryota T 4kp3 3 C,F E,F MELPH_MOUSE EXOPHILIN-3, LEADEN PROTEIN, SLP HOMOLOG LACKING C2 DOMAINS A, SLAC2-A, SYNAPTOTAGMIN-LIKE PROTEIN 2A GPGSDLDTEARDQPLNSKKKKRLLSFRDVDFEEDSDHLVQPCS 43 T 5.3 GLTSCR1 pdbhh F Eukaryota T 4ksn 1 A,B,C,D A,B,C,D Q5ZSX5_LEGPH SdbC GNSDGQLDTHLADLYLLKYDTGLGVYESFICKYLEDSNDYIASHPQKLSLDEMPRPLESETVSLRQLIVSVLPSRPSI 78 T 1.2 DUF3213 pdbhh F Bacteria T 4kt3 2 B B Q4KC91_PSEF5 Putative lipoprotein GSHMATDSLQPARIKDSGLTREQAEQVLRVALKHQDYQLQRPGVFIDGDLQDENGKPPHPGYYDFSLGYNDPKAGATEYWGLFSVSLNTGDTWEINSCKRLDGAELRALQRRVMARTGKSLADEKSQREGLGCEDQQ 137 T 5.2 DUF4969 unphh F Bacteria T 4kv1 2 B,D C,D TF65_HUMAN Rel peptide TFXSIMK 7 T 7.6 Adenylate_cycl pdbhh F Eukaryota T 4kvt 1 A,B,C,D,E,F A,B,C,D,E,F 6-helix coiled coil CC-Hex-L24C peptide XGELKAIAQELKAIAKELKAIAWECKAIAQGAG 33 T 0.92 Rho_N pdb F T 4kxq 2 B B SIR1_HUMAN SIRT1, HSIRT1, REGULATORY PROTEIN SIR2 HOMOLOG 1, SIR2-LIKE PROTEIN 1, HSIR2 GPHMGSQYLFLPPNRYIFHGAEVYSDSEDV 30 T 4.1 Rxt3 pdbhh F Eukaryota T 4l0k 1 A,B,C,D A,B,C,D A0A067XG67_9DEIO DraIII MELCHKTVKSRTAYSKHFPHKCQLPLGHSGKCLEFPFLVSLSKTHPRIAAKIVRDATMTTGAAWKSSQAGPNRMPRYVAILDDDILLEKFNLDMQSLPEITRLKIREKAADYDSCIDVARKLTWLAYQLHGAPIPDSFTKNYLEEFFGPMVAGSTNCEICKLPLTIDLFSENRVGKAAVETAHKTPRLHNAENVGFAHRFCNVAQGNKSLDEFYLWMEEVLTRVKML 227 T 2.9E-05 RE_BstXI pdbhh F Bacteria T 4l1u 2 G,H,I,J G,H,I,J SPT5H_HUMAN HSPT5, DRB SENSITIVITY-INDUCING FACTOR 160 KDA SUBUNIT, DSIF P160, DRB SENSITIVITY-INDUCING FACTOR LARGE SUBUNIT, DSIF LARGE SUBUNIT, TAT-COTRANSACTIVATOR 1 PROTEIN, TAT-CT1 PROTEIN YGSGSRTPMYGSQ 13 T 0.063 CTD unphh F Eukaryota T 4l29 3 AA,C,DA,F,GA,I,JA,L,MA,O,PA,R,U,X p,m,o,i,c,k,g,f,q,l,j,h,e,n NY-ESO1 DOUBLE MUTANT (1Y, 9V) YLLMWITQV 9 T 5.7 Thyroglob_assoc pdbhh F T 4l3o 2 E,F,G,H E,F,G,H cyclic peptide S2iL5 GYHTYHVXRRTNYYCX 16 T 11 UCH_C pdbhh F T 4l5n 2 C,D,E,F C,D,E,F P56_BPPZA Early protein GP1B VQNDFLDSYDVTMLLQDDNGKQYYEYHKGLSLSDFEVLYGNTVDEIIKLRVDKIS 55 T 9 DUF2603 pdbhh T Viruses T 4l8b 3 C C NP-N5H peptide ASNEHMETM 9 T 21 YgaB pdbhh F T 4l8c 3 I,J,K,L I,J,K,L NP-N3D peptide ASDENMETM 9 T 22 YpmT pdbhh F T 4l8d 3 E,F E,F NP-N5D peptide ASNEDMETM 9 T 6.4 Lsm_interact pdbhh F T 4lcd 2 C,D C,D SNA3_YEAST Protein SNA3 AQPPAYDEDDEAGADVPLMDNAQQ 24 T 14 TMEM252 unphh F Eukaryota T 4lg6 2 B B CCDC8_HUMAN Coiled-coil domain-containing protein 8 RAFWHTPRLPTLPKRVP 17 T 6.1 RGS_DHEX pdbhh F Eukaryota T 4li3 2 B A CYSE_SALTY Serine acetyltransferase TFEYGDGI 8 T 2.1 Cyanate_lyase pdbhh F Bacteria T 4lkl 2 B B PL-55 XPLHSTMX 8 T 140 Aminopep pdbhh F T 4lkm 2 B,D B,D PL-74 XXPLHSTMX 9 T 170 Aminopep pdbhh F T 4lkx 3 C R CemX segment LAGGSAQSQRAPDR 14 T 1.3 DUF6032 pdbhh F T 4loo 2 B B TAB1_MOUSE MITOGEN-ACTIVATED PROTEIN KINASE KINASE KINASE 7-INTERACTING PROTEIN 1, TGF-BETA-ACTIVATED KINASE 1-BINDING PROTEIN 1, TAK1-BINDING PROTEIN 1 RVYPVSVPYSSAQSTSKTSVTLSLVMPSQ 29 T 6 DUF2584 pdbhh F Eukaryota T 4lp9 2 B I Ser-Leu-Phe-His-Phenylalanyl-reduced-peptide-bond-Tyrosyl-Thr-Pro SLFHXTP 7 T 12 FCP1_C pdbhh F T 4lr4 1 A,B,C,D A,B,C,D C4ZEB7_AGARV hypothetical protein GASIDNGNKVHFNTEDNDTDLTLLQSKIATEEVTCDFTDATNDGASAYADTRRVSNKYMWSASTMEYNFSDQKWTSNTEIFSTYAKTSEGFVMSGFLLNPKGQSNYNSALREGYLNDSAYDENQGHYYQCVVSDEDCNNITFMLESNVNVFIFDNDINLIYRSSDEAGVTSYFDRYYSTTKTIAGTSNKVISLGLIDGNYYIVFKVKDATATTGYHYGYYAGQPLPIAQTTTFSDLTHYTTIKWNRSSSSQSASTQTLTINCPSGSEDEYALTGVKFSDKSKAFANNTYASSIDYYYTPATASYSKKLAQTGGWWSDLVDNNPPSGSIDGNYATSVTVHWVSGISYVNASCTTMTQMTLDYLVPFGIIVG 370 T 0.15 DUF1684 unp F Bacteria T 4lsj 2 B B D30 peptide HSSRLWELLMEAT 13 T 1.6 CemA pdbhh F T 4luq 1 A,B A,B Q9HYC5_PSEAE VIRULENCE EFFECTOR TSE3 MGSSHHHHHHSSGLVPRGSHMTATSDLIESLISYSWDDWQVTRQEARRVIAAIRNDNVPDATIAALDKSGSLIKLFQRVGPPELARSLIASIAGRTTMQRYQARNALIRSLINNPLGTQTDNWIYFPTITFFDICADLADAAGRLGFAAAGATGVASQAIQGPFSGVGATGVNPTDLPSIAFGDQLKLLNKDPATVTKYSNPLGDLGAYLSQLSPQDKLNQAQTLVGQPISTLFPDAYPGNPPSRAKVMSAAARKYDLTPQLIGAIILAEQRDQTRDEDAKDYQAAVSIKSANTSIGLGQVVVSTAIKYELFTDLLGQPVRRGLSRKAVATLLASDEFNIFATARYIRYVANLASQQDLRKLPKTRGAFPSIDLRAYAGNPRNWPRDNVRALASEYTSRPWDDNLSPGWPMFVDDAYATFLDPGMRFP 428 T 0.0021 DUF1402 unphh F Bacteria T 4luq 2 C,D C,D Q9HYC4_PSEAE ANTITOXIN TSI3 MGSSHHHHHHSSGLVPRGSHMDFDKTLTHPNGLVVERPVGFDARRSAEGFRFDEGGKLRNPRQLEVQRQDAPPPPDLASRRLGDGEARYKVEEDDGGSAGSEYRLWAAKPAGARWIVVSASEQSEDGEPTFALAWALLERARLQ 144 T 0.0052 PsbP_2 unphh F Bacteria T 4lx2 2 B B MELPH_MOUSE EXOPHILIN-3, LEADEN PROTEIN, SLP HOMOLOG LACKING C2 DOMAINS A, SLAC2-A, SYNAPTOTAGMIN-LIKE PROTEIN 2A RDQPLNSKKKKRLLSFRDVDFEEDSD 26 T 2.9 Phage_Treg pdbhh F Eukaryota T 4m1x 1 A,B,C,D A,B,C,D B3FK35_9CAUD uncharacterized protein 201phi2-1p060 GSHMASQDNDDIFGNDSPEVPIFRKNLEKFKFSKGDGIKFSNTTFHIYEATRNYVTIHILKKYATAELMEFMHTRHDAVYIGPILEWTDGVHLTFRRKS 99 T 16 SelB-wing_3 unphh T Viruses T 4m1z 2 C,D C,D MYCP1_MYCS2 PEPTIDASE S8 AND S53, SUBTILISIN, KEXIN, SEDOLISIN RVKEVPPPVYIPPPDRGPIT 20 T 0.33 JCAD pdbhh F Bacteria T 4m38 2 C,D E,F H4_HUMAN Histone H4 SGRGKGGKGLGKGGAKRHRKV 21 T 11 Shadoo unppercent F Eukaryota T 4m5e 1 A A Q9HYC5_PSEAE Uncharacterized protein MTATSDLIESLISYSWDDWQVTRQEARRVIAAIRNDNVPDATIAALDKSGSLIKLFQRVGPPELARSLIASIAGRTTMQRYQARNALIRSLINNPLGTQTDNWIYFPTITFFDICADLADAAGRLGFAAAGATGVASQAIQGPFSGVGATGVNPTDLPSIAFGDQLKLLNKDPATVTKYSNPLGDLGAYLSQLSPQDKLNQAQTLVGQPISTLFPDAYPGNPPSRAKVMSAAARKYDLTPQLIGAIILAEQRDQTRDEDAKDYQAAVSIKSANTSIGLGQVVVSTAIKYELFTDLLGQPVRRGLSRKAVATLLASDEFNIFATARYIRYVANLASQQDLRKLPKTRGAFPSIDLRAYAGNPRNWPRDNVRALASEYTSRPWDDNLSPGWPMFVDDAYATFLDLEHHHHHH 410 T 0.0021 DUF1402 unphh F Bacteria T 4m5f 1 A A Q9HYC5_PSEAE Uncharacterized protein MTATSDLIESLISYSWDDWQVTRQEARRVIAAIRNDNVPDATIAALDKSGSLIKLFQRVGPPELARSLIASIAGRTTMQRYQARNALIRSLINNPLGTQTDNWIYFPTITFFDICADLADAAGRLGFAAAGATGVASQAIQGPFSGVGATGVNPTDLPSIAFGDQLKLLNKDPATVTKYSNPLGDLGAYLSQLSPQDKLNQAQTLVGQPISTLFPDAYPGNPPSRAKVMSAAARKYDLTPQLIGAIILAEQRDQTRDEDAKDYQAAVSIKSANTSIGLGQVVVSTAIKYELFTDLLGQPVRRGLSRKAVATLLASDEFNIFATARYIRYVANLASQQDLRKLPKTRGAFPSIDLRAYAGNPRNWPRDNVRALASEYTSRPWDDNLSPGWPMFVDDAYATF 400 T 0.0021 DUF1402 unphh F Bacteria T 4m5f 2 B B Q9HYC4_PSEAE Uncharacterized protein SHGVDFDKTLTHPNGLVVERPVGFDARRSAEGFRFDEGGKLRNPRQLEVQRQDAPPPPDLASRRLGDGEARYKVEEDDGGSAGSEYRLWAAKPAGARWIVVSASEQSEDGEPTFALAWALLERARLQ 127 T 0.0052 PsbP_2 unphh F Bacteria T 4m63 1 A,B A,B T3SS2 effector VopL nucleation of actin polymerization GHMRLLSEDLFKQSPKLSEQELDELANNLADYLFQAADIDWHQVISEKTRGLTTEEMAKSEHRYVQAFCREILKYPDCYKSADVASPESPKSGGGSVIDVALKRLQTGRERLFTTTDEKGNRELKKGDAILESAINAARMAISTEEKNTILSNNVKSATFDVFCELPCMDGFAEQNGKTAFYALRAGFYSAFKNTDTAKQDITKFMKDNLQAGFSGYSYQGLTNRVAQLEAQLAALSAKLS 241 T 0.0017 ABC_tran_CTD pdbpssm F T 4m6b 2 B,D C,F SWR1_YEAST Helicase SWR1 GSHMDRESDDKTPSVGLSALFGKGEESDGDLDLDDSEDFTVNSSSVEGEELEKDW 55 T 14 DUF5945 pdbhh F Eukaryota T 4m6e 1 A A tyrocidine A XPFXNQYVXL 10 T 1.2 Inhibitor_I10 pdbhh F T 4m7c 2 C,D C,D SLX4_HUMAN BTB/POZ DOMAIN-CONTAINING PROTEIN 12 SRGLEVSHRLAPW 13 T 5.5 DUF5673 pdbhh F Eukaryota T 4m91 2 B B CRBN_HUMAN Protein cereblon KRKFHCANLTSW 12 T 27 Chordopox_A33R pdbhh F Eukaryota T 4m9s 2 B,D,F,H E,F,G,H CED-3 fragment PMFNFMGC 8 T 0.78 NADH_u_ox_C pdbhh F T 4m9x 2 B,D C,D CED-3 fragment PLFNFLCG 8 T 3.2 TPR_3 pdbhh F T 4m9y 2 B,D C,D CED-3 fragment PLFNFMGC 8 T 2.2 NADH_u_ox_C pdbhh F T 4m9z 2 B,D,F,H E,F,G,H CED-3 fragment PMFNFLGC 8 T 0.74 NADH_u_ox_C pdbhh F T 4mbe 4 G,H,I,J H,G,X,Y NUP1_YEAST NUCLEAR PORE PROTEIN NUP1 LKKNIEPKKDKESIVLPTVGFDFIK 25 T 0.12 DUF4519 pdbhh F Eukaryota T 4mdd 2 C,D C,D NCOR1_HUMAN N-COR, N-COR1 NLGLEDIIRKALMGS 15 T 3.8 baeRF_family3 pdbhh F Eukaryota T 4mgp 1 A A MAGA_XENLA Magainin 2 Derivative GIGKFLHAAKKFAKAFVAEIMNS 23 T 1.3 TAFII28 pdbhh F Eukaryota T 4mgx 2 B B GP1BA_HUMAN GP-IB ALPHA, GPIB-ALPHA, GPIBA, GLYCOPROTEIN IBALPHA, ANTIGEN CD42B-ALPHA, GLYCOCALICIN PTFRSSLFL 9 T 1.6 TALPID3 unphh F Eukaryota T 4mi7 1 A A H9L447_SALTY Bacteriophage encoded virulence factor GPVDEGIDIIECENLLKEMNVQKIPESSLFTNIKEALQAEVFNSTVEDDFESFISYELQNHGPLMLIRPSLGSECLHAECIVGYDSEVKKVLIYDSMNTSPEWQSNIDVYDKLTLAFNDKYKNEDCSICGLYYDGVYEPK 140 T 0.021 Peptidase_C70 unppssm F Bacteria T 4mjt 2 J,K,L,M,N,O,P,Q,R J,K,L,M,N,O,P,Q,R MONAL_PSEE4 Monalysin QPQSHSIELDEVSKEAASTRAALTSNL 27 T 17 DUF3618 pdbhh F Bacteria T 4mli 2 C,D B,D SpyTag AHIVMVDAYKPTK 13 T 4.7 NAMPT_N pdbhh F T 4mn3 2 B B peptide XFAYKSX 7 T 12 ITAM_Cys-rich pdbhh F T 4mnv 2 B B acyl-enzyme intermediate of bicyclic peptide UK729 TCRQSMCTAR 10 T 4.6 DUF5497 pdbhh F T 4mnw 2 B B bicyclic peptide UK749 QCWDRGCENRKCNX 14 T 2.6 Pox_G9-A16 pdbhh F T 4mnx 2 B B bicyclic peptide UK811 LCSDRGCENRWCKX 14 T 0.81 LRRCT_2 pdbhh F T 4mny 2 C,D C,D bicyclic peptide UK903 GCQVNYCPPVPCLX 14 T 0.4 Antimicrobial23 pdbhh F T 4mod 1 A,B A,B SPIKE_MERS1 HR1 of S protein, LINKER, HR2 of S protein MENQKLIANKFNQALGAMQTGFTTTNEAFQKVQDAVNNNAQALSKLASELSNTFGAISASIGDILVPRGSGGSGGSGGLEVLFQGPLTQINTTLLDLTYEMLSLQQVVKALNESYIDLKELLEHHHHHH 129 T 1.7E-08 CoV_S2 pdbpssm T Viruses T 4moy 2 B B PP1RA_RAT MHC CLASS I REGION PROLINE-RICH PROTEIN CAT53, PHOSPHATASE 1 NUCLEAR TARGETING SUBUNIT, PROTEIN PNUTS GAMGRKRKTVTWPEEGKLREYFYFELDETERVNVNKIKDFGEAA 44 T 2.3 GAAD pdbhh F Eukaryota T 4mqv 2 B,D B,D SMAL1_HUMAN HEPA-RELATED PROTEIN, HHARP, SUCROSE NONFERMENTING PROTEIN 2-LIKE 1 LTEEQRKKIEENRQKALARRAEKLLA 26 T 0.08 ETAA1 pdbhh F Eukaryota T 4ms8 4 D B pCPB9 SPAEAGFFL 9 T 0.047 DUF1148 pdbhh F T 4mtm 1 A A I2GUG0_9CAUD Putative tail fiber protein RSLIANNTVNPNNGLGGAWEVYSGQGSIPTATSTTAGITKVLNVLNSNDVGSALSAAQGKVLNDKFNFQNSKNQSGYVRLGDSGLIIQWGVFTSTKTQSNLIFPLAFPNALLSITGNLNSNTPDVIGIDFDLSTATKTSIKTGAAQVGASWLSGKKISWIAIGY 164 T 0.02 UPF0164 pdb T Viruses T 4mvb 4 D B pCPB7 QPAEGGFQL 9 T 7.6 Turandot pdbhh F T 4mxq 4 D B pCPC5 SPAPRPLDL 9 T 5.6 Rhabdo_M1 pdbhh F T 4myy 1 A,B A,B F4Y428_9CYAN;F4Y429_9CYAN POLYKETIDE SYNTHASE MODULE, ZN-DEPENDENT OXIDOREDUCTASE/POLYKETIDE SYNTHASE MODULE NSALEAKLLDEIKQSSNQELESSIDQILESIINGGGSGGGSMLNKFTKKEQILSEKQQIKQLSPLQRAALALKKLETKLNNTLHE 85 T 0.055 Tubulin pdb F Bacteria T 4mz5 1 A,B A,C DUT_HUMAN DUTPASE, DUTP PYROPHOSPHATASE AISPSKRARPAEV 13 T 0.021 DSBA unppercent F Eukaryota T 4mz6 1 A,B A,C DUT_HUMAN DUTPASE, DUTP PYROPHOSPHATASE AIEPSKRARPAEV 13 T 0.021 DSBA unppercent F Eukaryota T 4mzj 2 B T MYOA_PLAF7 PFM-A XKNXPSLXRVQAHIRKKMV 19 T 0.27 BORCS8 pdbhh F Eukaryota T 4mzk 2 B T MYOA_PLAF7 PFM-A XKNIPSLLRXQAHXRKKMV 19 T 0.55 IQ unppssm F Eukaryota T 4mzl 2 C,D C,D MYOA_PLAF7 hydrogen bond surrogate (HBS) myoA helix mimetic NIXSLLRVQAHIRKKMV 17 T 0.14 BORCS8 pdbhh F Eukaryota T 4mzz 1 A,B A,B OSTCN_BOVIN BONE GLA PROTEIN, BGP, GAMMA-CARBOXYGLUTAMIC ACID-CONTAINING PROTEIN EPKREVCELNPDCDELADHIGFQEAYRRFYGPV 33 T 0.099 Toxin_23 pdbpercent F Eukaryota T 4n0c 2 B,F B,F pCPE3 MPAGRPWDL 9 T 0.35 DUF4516 pdbhh F T 4n0p 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H Q9A495_CAUCR Pilus assembly protein CpaE MGSDKIHHHHHHENLYFQGIPRITIHAFCARPETAALIEKAAADRRMSRAATIVRDGGLEAAVDYYQNQPTPSLVMVETLDGAQRLLHLLDSLAQVCDPGTKVVVVGQTNDIALYRELMRRGVSEYLTQPLGPLQVIRAVGALYADPAAPF 151 T 0.00037 Response_reg pdbpssm F Bacteria T 4n39 2 B B HCFC1_HUMAN Host cell factor 1 THETGTTNTATTATSN 16 T 54 Ice_nucleation pdbhh F Eukaryota T 4n3a 2 B B HCFC1_HUMAN Host cell factor 1 VRVCSNPPCATHETGTTNTATTATSN 26 T 7.8 Ntox1 pdbhh F Eukaryota T 4n3b 2 B B HCFC1_HUMAN Host cell factor 1 VRVCSNPPCQTHETGTTNTATTATSN 26 T 3.2 Ntox1 pdbhh F Eukaryota T 4n3c 2 B B HCFC1_HUMAN Host cell factor 1 VRVCSNPPCETHETGTTNTATTATSN 26 T 15 DUF1936 pdbhh F Eukaryota T 4n4f 2 B C H4_HUMAN Histone 4 Peptide KGGKGLGXGGAXRHRKVLRDN 21 T 0.27 UPF0137 unp F Eukaryota T 4n5e 4 D B pCPA12 VPYMAEFGM 9 T 0.13 UPA_2 pdbhh F T 4n5t 2 B B ATSP-7041 stapled-peptide XLTFXEYWAQXXSAA 15 T 0.74 PBP-Tp47_a pdbhh F T 4n78 6 F P WIRS WGAERSMSTFGKEKA 15 T 3.5 YicC_N pdbhh F T 4n7h 2 B B ARRD3_HUMAN TBP-2-LIKE INDUCIBLE MEMBRANE PROTEIN, TLIMP RPEAPPSYAEVVT 13 T 0.062 TMEM252 pdbhh F Eukaryota T 4n7s 1 A,C A,C Q9HYC5_PSEAE Uncharacterized protein TATSDLIESLISYSWDDWQVTRQEARRVIAAIRNDNVPDATIAALDKSGSLIKLFQRVGPPELARSLIASIAGRTTMQRYQARNALIRSLINNPLGTQTDNWIYFPTITFFDICADLADAAGRLGFAAAGATGVASQAIQGPFSGVGATGVNPTDLPSIAFGDQLKLLNKDPATVTKYSNPLGDLGAYLSQLSPQDKLNQAQTLVGQPISTLFPDAYPGNPPSRAKVMSAAARKYDLTPQLIGAIILAEQRDQTRDEDAKDYQAAVSIKSANTSIGLGQVVVSTAIKYELFTDLLGQPVRRGLSRKAVATLLASDEFNIFATARYIRYVANLASQQDLRKLPKTRGAFPSIDLRAYAGNPRNWPRDNVRALASEYTSRPWDDNLSPGWPMFVDDAYATFLD 401 T 0.0018 DUF1402 pdbhh F Bacteria T 4n7s 2 B,D B,D Q9HYC4_PSEAE inhibitor MGSSHHHHHHSSGENLYFQSHMSMTLTHPNGLVVERPVGFDARRSAEGFRFDEGGKLRNPRQLEVQRQDAPPPPDLASRRLGDGEARYKVEEDDGGSAGSEYRLWAAKPAGARWIVVSASEQSEDGEPTFALAWALLERARLQ 143 T 0.0052 PsbP_2 unphh F Bacteria T 4n7v 2 C C CE152_HUMAN CEP152 MSLDFGSVALPVQNEDEEYDEEDYEREKELQQLLTDLPHDMLDDDLSSPELQYSDCSEDG 60 T 0.035 BING4CT pdbpssm F Eukaryota T 4n7z 2 B B CE192_HUMAN Centrosomal protein of 192 kDa EKLILPTSLEDSSDDDIDDEMFYDDHLEAYFEQLAIPGMIYEDLEGPEPPEKGFKLPT 58 T 29 BDHCT_assoc pdbhh F Eukaryota T 4n80 1 A A Q9HYC5_PSEAE Uncharacterized protein TATSDLIESLISYSWDDWQVTRQEARRVIAAIRNDNVPDATIAALDKSGSLIKLFQRVGPPELARSLIASIAGRTTMQRYQARNALIRSLINNPLGTQTDNWIYFPTITFFDICADLADAAGRLGFAAAGATGVASQAIQGPFSGVGATGVNPTDLPSIAFGDQLKLLNKDPATVTKYSNPLGDLGAYLSQLSPQDKLNQAQTLVGQPISTLFPDAYPGNPPSRAKVMSAAARKYDLTPQLIGAIILAEQRDQTRDEDAKDYQAAVSIKSANTSIGLGQVVVSTAIKYELFTDLLGQPVRRGLSRKAVATLLASDEFNIFATARYIRYVANLASQQDLRKLPKTRGAFPSIDLRAYAGNPRNWPRDNVRALASEYTSRPWDDNLSPGWPMFVDDAYATF 399 T 0.0021 DUF1402 unphh F Bacteria T 4n80 2 B B Q9HYC4_PSEAE Uncharacterized protein SHMDFDKTLTHPNGLVVERPVGFDARRSAEGFRFDEGGKLRNPRQLEVQRQDAPPPPDLASRRLGDGEARYKVEEDDGGSAGSEYRLWAAKPAGARWIVVSASEQSEDGEPTFALAWALLERARL 125 T 0.0052 PsbP_2 unphh F Bacteria T 4n88 2 B,D B,D Q9HYC4_PSEAE Uncharacterized protein SHMMTLTHPNGLVVERPVGFDARRSAEGFRFDEGGKLRNPRQLEVQRQDAPPPPDLASRRLGDGEARYKVEEDDGGSAGSEYRLWAAKPAGARWIVVSASEQSEDGEPTFALAWALLERARLQ 123 T 0.0052 PsbP_2 unphh F Bacteria T 4nag 1 A,B A,B F0CAT0_9XANT Xanthomonin I GGPLAGEEIGGFNVPG 16 T 6.6 Rhabdo_M2 pdbhh F Bacteria T 4nb3 2 C,D C,D ATRIP_HUMAN 3,4 dichlorophenylalanine ATRIP derived peptide XDFTADDLEEWXALA 15 T 0.48 TT_ORF2 pdbhh F Eukaryota T 4nds 1 A,B A,B AGBL_LYODE Alpha-galactosyl-binding lectin ACWKANSCPGSAFESKDRLRSFALLYCRYNYKPPYGQGAFGYASAVSTHGWETEAQCINTFEQIITSCHGQSNGGTLELNSGRLSLAFGNCEEL 94 T 0.065 Fungal_lectin_2 pdb F Eukaryota T 4nf9 2 C,D C,D NSL1_HUMAN Kinetochore-associated protein NSL1 homolog LKRKQTKDCPQRKWYPLRPKKINLDT 26 T 4.7 DUF3410 pdbhh F Eukaryota T 4nft 2 E,F E,F AN32E_HUMAN ANP32E, LANP-LIKE PROTEIN, LANP-L GSHMEEEEEEEEEEDEDEDEDEDEAGSELGEGEEEVGLSYLMKEEIQDEEDD 52 T 0.0014 BUD22 unp F Eukaryota T 4ngh 3 C P MODIFIED FRAGMENT OF HIV GLYCOPROTEIN (GP41) XNWFNITNXLWXIXKKK 17 T 0.029 GP41 pdbhh F T 4nhc 3 C P MODIFIED FRAGMENT OF HIV GLYCOPROTEIN (GP41) XNWFNITNXLWXIKKKK 17 T 0.034 GP41 pdbhh F T 4nio 1 A A SODC_HUMAN SUPEROXIDE DISMUTASE 1, HSOD1 GVTGIAQ 7 F F Eukaryota T 4nm0 2 B B AXIN1_HUMAN AXIS INHIBITION PROTEIN 1, HAXIN GGILVEPQKFAEELIHRLEAVQRT 24 T 0.19 Cwf_Cwc_15 unppssm F Eukaryota T 4nmo 2 C,D C,D iCAL36(Ac-K-1) peptide ANSRWPTSXI 10 T 9.5 CBP_BcsR pdbhh F T 4nmp 2 C,D C,D iCAL36(Ac-K-3) peptide ANSRWPXSII 10 T 4.9 Arc_MA pdbhh F T 4nmq 2 C,D C,D iCAL36(Ac-K-4) peptide ANSRWXTSII 10 T 1.2 Arc_MA pdbhh F T 4nmr 2 C,D C,D iCAL36(Ac-K-5) peptide ANSRXPTSII 10 T 5.3 CX pdbhh F T 4nmx 3 C Z peptide 2-8 XTVFTSWEEYLDWVX 15 T 0.21 DUF5575 pdbhh F T 4nnd 2 B,D,F,H F,C,E,H ERBB2_HUMAN METASTATIC LYMPH NODE GENE 19 PROTEIN, MLN 19, PROTO-ONCOGENE NEU, PROTO-ONCOGENE C-ERBB-2, TYROSINE KINASE-TYPE CELL SURFACE RECEPTOR HER2, P185ERBB2 LQRXSE 6 T 21 LELP1 pdbhh F Eukaryota T 4no3 3 C C AMPD2_HUMAN AMP deaminase 2 RQISQDVKL 9 T 40 DUF2590 pdbhh F Eukaryota T 4nqj 1 A,B,C A,B,C TRI69_HUMAN RFP-LIKE DOMAIN-CONTAINING PROTEIN TRIMLESS, RING FINGER PROTEIN 36, TRIPARTITE MOTIF-CONTAINING PROTEIN 69 SVGQSKEFLQISDAVHFFMEELAIQQGQLETTLKELQTLRNMQKEAIAAHKENKLHLQQHVSMEFLKLHQFLHSKEKDILTELREEGKALNEEMELNLSQLQEQCLLAKDMLVSIQAKTEQQNSFDFLKDITTLLHSLEQGMKVLATRELISRKLNLGQYKGPIQYMVWREMQDTLCPG 179 T 0.00026 DUF1043 pdbpssm F Eukaryota T 4nso 2 B B Q9KN41_VIBCH Immunity protein MGENCNDTSGVHQKILVCIQNEIAKSETQIRNNISSKSIDYGFPDDFYSKQRLAIHEKCMLYINVGGQRGELLMNQCELSMLQGLDIYIQQYIEDVDNSLLEHHHHHH 108 T 0.015 Fmp27_GFWDK pdbpercent F Bacteria T 4nsr 1 A,B,C,D,E,F B,C,A,E,F,D Q9KN41_VIBCH Immunity protein MGENCNDMGENCNDTSGVHQKILVCIQNEIAKSETQIRNNISSKSIDYGFPDDFYSKQRLAIHEKCMLYINVGGQRGELLMNQCELSMLQGLDIYIQQYIEDVDNSLLEHHHHHH 115 T 0.013 Fmp27_GFWDK pdbpercent F Bacteria T 4ntp 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P Cyclic hexadecapeptide (ORN)LV(PHI)FAED(ORN)AII(SAR)L(ORN)V XLVXFAEDXAIIXLXV 16 T 0.012 Beta-APP pdbhh F T 4ntr 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P Cyclic hexadecapeptide (ORN)LVFFAED(ORN)AII(SAR)L(ORN)V XLVFFAEDXAIIXLXV 16 T 0.012 Beta-APP pdbhh F T 4nuf 2 B P EID1_MOUSE ORPHAN NUCLEAR RECEPTOR SHP, SMALL HETERODIMER PARTNER MHRVSAALEEANKVFL 16 T 8.1 DUF4646 pdbhh F Eukaryota T 4nuu 2 C C ACKR1_HUMAN ATYPICAL CHEMOKINE RECEPTOR 1, FY GLYCOPROTEIN, GPFY, GLYCOPROTEIN D, PLASMODIUM VIVAX RECEPTOR GPTGNSSQLDFEDVWNSSYGVNDSFPDGDYGA 32 T 1.4 DUF2603 pdbhh F Eukaryota T 4nuv 2 C,D C,D ACKR1_HUMAN ATYPICAL CHEMOKINE RECEPTOR 1, FY GLYCOPROTEIN, GPFY, GLYCOPROTEIN D, PLASMODIUM VIVAX RECEPTOR GPTGTENSSQLDFEDVWNSSYGVNDSFPDGDYGA 34 T 1.7 Myosin-VI_CBD pdbhh F Eukaryota T 4nw2 2 B,D B,D NS1_I72A2 Nonstructural protein 1 PKQKRKMARTARSKV 15 T 22 LPP20 pdbhh T Viruses T 4nw8 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L Cyclic hexadecapeptide (ORN)LV(PHI)(MEA)AED(ORN)AIIGL(ORN)V XLVXXAEDXAIIGLXV 16 T 0.012 Beta-APP pdbhh F T 4nw9 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L Cyclic hexadecapeptide (ORN)LVF(MEA)AED(ORN)AIIGL(ORN)V XLVFXAEDXAIIGLXV 16 T 0.012 Beta-APP pdbhh F T 4nxq 2 D,E,F D,E,F CNTP4_HUMAN CASPR4 PEPTIDE ENQKEYFF 8 T 3.8 La pdbhh F Eukaryota T 4nxr 2 B B NRX2B_HUMAN NEUREXIN II-BETA PEPTIDE NKDKEYYV 8 T 5.3 TMEM154 unphh F Eukaryota T 4ny3 2 C,D C,D PP2AA_HUMAN PP2A-ALPHA, REPLICATION PROTEIN C, RP-C TPDYFL 6 T 7.6 TDH pdbhh F Eukaryota T 4nzr 3 C M Y281_MYCGE PROTEIN MG281 MGSSHHHHHHSSGLVPRGSHMSLSLNDGSYQSEIDLSGGANFREKFRNFANELSEAITNSPKGLDRPVPKTEISGLIKTGDNFITPSFKAGYYDHVASDGSLLSYYQSTEYFNNRVLMPILQTTNGTLMANNRGYDDVFRQVPSFSGWSNTKATTVSTSNNLTYDKWTYFAAKGSPLYDSYPNHFFEDVKTLAIDAKDISALKTTIDSEKPTYLIIRGLSGNGSQLNELQLPESVKKVSLYGDYTGVNVAKQIFANVVELEFYSTSKANSFGFNPLVLGSKTNVIYDLFASKPFTHIDLTQVTLQNSDNSAIDANKLKQAVGDIYNYRRFERQFQGYFAGGYIDKYLVKNVNTNKDSDDDLVYRSLKELNLHLEEAYREGDNTYYRVNENYYPGASIYENERASRDSEFQNEILKR 416 T 0.13 RE_endonuc pdbpercent F Bacteria T 4o1v 2 B B PTEN_HUMAN MUTATED IN MULTIPLE ADVANCED CANCERS 1, PHOSPHATASE AND TENSIN HOMOLOG PSNPEASSSTSVTPD 15 T 60 DUF3636 pdbhh F Eukaryota T 4o2c 3 C C DDX3X_HUMAN DEAD BOX PROTEIN 3, X-CHROMOSOMAL, DEAD BOX, X ISOFORM, HELICASE-LIKE PROTEIN 2, HLP2 XSHVAVENAL 10 T 17 HTH_SUN2 pdbhh F Eukaryota T 4o2e 3 C,F C,F DDX3X_HUMAN DEAD BOX PROTEIN 3, X-CHROMOSOMAL, DEAD BOX, X ISOFORM, HELICASE-LIKE PROTEIN 2, HLP2 SHVAVENAL 9 T 13 HTH_SUN2 pdbhh F Eukaryota T 4o2f 3 C,F C,F DDX3X_HUMAN DEAD BOX PROTEIN 3, X-CHROMOSOMAL, DEAD BOX, X ISOFORM, HELICASE-LIKE PROTEIN 2, HLP2 HVAVENAL 8 T 9.2 HTH_SUN2 pdbhh F Eukaryota T 4o3t 3 C P ZAP.14 IVGGYPWWMDV 11 T 0.13 Laps pdbhh F T 4o3u 3 C P ZAP 2.3 IIGGCPYWMDREECI 15 T 0.39 DUF779 pdbhh F T 4o4a 1 A,B,C,D A,B,C,D A0A6L8PAP6_BACAN PUTATIVE LIPOPROTEIN MHHHHHHSSGVDLGTENLYFQSNAKETTDTIYLIPEEYEGDLIVVYNVPGAELLPKEEEFSVVTFAADGTAVTSTKNMKFGTVNDLYYTVNKEGQRTKIDSSCIHFSSTGSRTENSWEFPFANLEVTRTACSQEFSANGREVPENQEHPAEKKMRDLMQRIQERYMNKVK 170 T 0.019 Lysis_col unphh F Bacteria T 4o56 2 B B synthetic peptide GPMTSTPK 8 T 3.4 RPN1_RPN2_N pdbhh F T 4o6f 2 B B ESR1_HUMAN ER, ER-ALPHA, ESTRADIOL RECEPTOR, NUCLEAR RECEPTOR SUBFAMILY 3 GROUP A MEMBER 1 GGRMLKHKRQR 11 T 11 Gemini_mov pdbhh F Eukaryota T 4o7j 1 A,B B,A CarG LTPVTLKNGVNQLDINQDGLKDYVVLAQFDNNTSHPNLGLTFFIHRPDGGYSIMPVTNSSEFTWFDYRLSASADFLVQDNRLFKIKKHYYLVTARKTEEDLFDVGKVSLTIYRFKVSRDDPGVPLYEWSMSKTVTAQRSYQSADEAYQEVDEAMLTRHHHHHH 163 T 0.008 FG-GAP_3 pdbpssm F T 4o7k 1 A A OSA_SHIFL ONCOGENIC SUPPRESSION ACTIVITY PROTEIN HHMLLWRRCRAWLEIRRLDKELAQSSGLPLELPQIVPNAWNEVVWRLPVPNHPDAFMTASNAAQSDFIVYVNGLAFYRAWLALGVEDSQACPLKQDMPKDRKYPSSAAHFAVGIDSPVPLADVSPTMILGHFAVCFTDGMTRSMWLLAHEVAVFPVLSRDEASAVMLAEHVGVAAPIQVSKLREQCRKIL 190 T 0.061 DUF3613 pdb F Bacteria T 4o87 1 A,B A,B Q707V3_9ASCO N-tagged Nuclease SMNPTTCLNEGAIGYMAIDILQSQNIETITINDNEYKLNKFNNIKDYISKVWGAASVYNLDLGNDYTKWQSSLDNVETDNIKNYINGHDNVYYNPGGKNKYLIIEASKELKWKGNLNNNKFNVNLKSIFSNAENLKVGHSDLLKLFSSIVNSKGSDNQKKVLNSLLDNINDRRLKKLVSTGQWTEAISDSVANEIAKNNKLTSIKAQLGSQKTQNVMIDANGHDLLKIDYDKTFVTANDLKNKIIDKNKLENAKNYFKIQNNDKILEDIKSKFSKNINENIKGSIRDHAKLIEFTENKKFNTINDNSNSDSKIKSITCKV 320 T 0.011 DWNN pdb F Eukaryota T 4o88 1 A,B A,B Q707V3_9ASCO N-tagged Nuclease PTTCLNEGAIGYMAIDILQSQNIETITINDNEYKLNKFNNIKDYISKVWGAASVYNLDLGNDYTKWQSSLDNVETDNIKNYINGHDNVYYNPGGKNKYLIIEASKELKWKGNLNNNKFNVNLKSIFSNAENLKVGHSDLLKLFSSIVNSKGSDNQKKVLNSLLDNINDRRLKKLVSTGQWTEAISDSVANEIAKNNKLTSIKAQLGSQKTQNVMIDANGHDLLKIDYDKTFVTANDLKNKIIDKNKLENAKNYFKIQNNDKILEDIKSKFSKNINENIKGSIRDHAKLIEFTENKKFNTINDNSNSDSKIKSITCKVLEHHHHHH 325 T 0.012 DWNN pdb F Eukaryota T 4oaj 2 B B 5HT2A_RAT 5-hydroxytryptamine receptor 2A peptide NEKVSCV 7 T 62 ELF pdbhh F Eukaryota T 4od7 2 D,E,F D,E,F (ACE)PWATCDS(NH2) Peptide XPWATCDSX 9 T 5.8 Toxin_37 pdbhh F T 4odq 2 B B RS3_ECOLI 30S ribosomal protein S3 RLGIVKPWNSTWFANX 16 T 0.0042 MRP-S24 unphh F Bacteria T 4oez 2 B B co-regulator peptide SDSAFSRLYTRS 12 T 2.9 Msap1 pdbhh F T 4ofb 2 B B nonphosphopeptide inhibitor XTIDXDEYRXRKTX 14 T 2.5 UCR_TM pdbhh F T 4ofr 2 B B co-regulator peptide ANSSFRDWYTSS 12 T 0.97 DUF1122 pdbhh F T 4ofu 2 B B co-regulator peptide SDSAFSRYYTRS 12 T 1.7 DUF3486 pdbhh F T 4oh4 2 C,D F,E BKI1_ARATH BRI1 kinase inhibitor 1 STMEELQAAIQAAIAHCKNSY 21 T 0.075 DUF5765 pdbpercent F Eukaryota T 4oih 2 B B RCC1_YEAST PRP20, PHEROMONE RESPONSE PATHWAY COMPONENT SRM1, PRE-MRNA-PROCESSING PROTEIN 20, REGULATOR OF CHROMOSOME CONDENSATION, SUPPRESSOR OF RECEPTOR MUTATIONS 1, MRNA TRANSPORT PROTEIN 1 GSMVKRTVATNGDASGAHRAKKMSKTH 27 T 0.0078 RCC1 unppssm F Eukaryota T 4oil 2 B B co-regulator peptide NTTDTLFSQHYR 12 T 5.7 Nairo_nucleo pdbhh F T 4okv 3 E,F E,F Q7YT37_ANOST GE RICH SALIVARY GLAND PROTEIN KYSKIKECFDSLADDVKSLVEKSETSYEECSKDKNNPHCGSEGTRELDEGLIEREQKLSDCIVEKR 66 T 0.028 DUF725 pdb F Eukaryota T 4oni 2 B,D C,D NR0B2_HUMAN NUCLEAR RECEPTOR NR0B2, ORPHAN NUCLEAR RECEPTOR SHP, SMALL HETERODIMER PARTNER QGAASRPAILYALLSSSLK 19 T 9.2 NR_Repeat unphh F Eukaryota T 4oo6 2 B B RBM39_HUMAN HEPATOCELLULAR CARCINOMA PROTEIN 1, RNA-BINDING MOTIF PROTEIN 39, RNA-BINDING REGION-CONTAINING PROTEIN 2, SPLICING FACTOR HCC1 RSRSKERRRSRSRSRDRRFRGRYRSPY 27 T 0.67 CDC45 unp F Eukaryota T 4os1 2 B B bicyclic peptide UK601 (bicyclic 1) GXALGRGCENHRCLX 15 T 1.4 Ivy pdbhh F T 4os2 2 B B bicyclic peptide UK602 (bicyclic 1) GXLGRGCENHRCLX 14 T 0.96 Ivy pdbhh F T 4ovb 1 A A OSA_SHIFL Protein osa MLLWRRCRAWLEIRRLDKELAQSSGLPLELPQIVPNAWNEVVWRLPVPNHPDAFMTASNAAQSDFIVYVNGLAFYRAWLALGVEDSQACPLKQDMPKDRKYPSSAAHFAVGIDSPVPLADVSPTMILGHFAVCFTDGMTRSMWLLAHEVAVFPVLSRDEASAVMLAEHVGVAAPIQVSKLREQCRKIL 188 T 0.06 DUF3613 pdb F Bacteria T 4owr 2 B B NUP98_HUMAN Nuclear pore complex protein Nup98-Nup96 GSPTGTTIKFNPPTGTDTMVKAGVSTNISTKHQCITAMKEYESKSLEELRLEDYQANRK 59 T 0.44 Nucleoporin_FG unp F Eukaryota T 4oyk 2 C,D C,D OTUL_HUMAN DEUBIQUITINATING ENZYME OTULIN, OTU DOMAIN-CONTAINING DEUBIQUITINASE WITH LINEAR LINKAGE SPECIFICITY, UBIQUITIN THIOESTERASE GUMBY, DEUBIQUITINATING ENZYME OTULIN, OTU DOMAIN-CONTAINING DEUBIQUITINASE WITH LINEAR LINKAGE SPECIFICITY, UBIQUITIN THIOESTERASE GUMBY AEHEEDMYRAADEIEKEKE 19 T 0.44 40S_SA_C pdbhh F Eukaryota T 4oz7 1 A,B A,B Methanobactin XASCSXGPNC 10 T 0.31 DUF1499 pdbhh F T 4ozf 5 E J GDA2_WHEAT deamidated Gliadin-alpha2 peptide APQPELPYPQPGS 13 T 3 FAP unp F Eukaryota T 4ozi 5 I,J I,J GDA2_WHEAT deamidated Gliadin-alpha1 peptide QPFPQPELPYPGS 13 T 3 FAP unp F Eukaryota T 4p0b 2 B,D B,D OTUL_HUMAN DEUBIQUITINATING ENZYME OTULIN, OTU DOMAIN-CONTAINING DEUBIQUITINASE WITH LINEAR LINKAGE SPECIFICITY, UBIQUITIN THIOESTERASE GUMBY EEDMYRAADE 10 T 10 Ribosomal_L18_c pdbhh F Eukaryota T 4p1n 2 C,D C,D W0TA43_KLUMA Atg13 MIM ETPPEDLLEFVKLLEDKKELNMKPSTILPQQDISSSLIKFQSMKPNNDTLSDNLSMSMSID 61 T 14 E2_bind pdbhh F Eukaryota T 4p1w 4 G G C5DB94_LACTC KLTH0A00704P SKYSSSFGRLRRQ 13 T 6.2 Corona_5a pdbhh F Eukaryota T 4p2o 5 E P 2A peptide ADPADPLAFFSSAIKGGGGSLV 22 T 0.37 Rib_5-P_isom_A pdbhh F T 4p2q 3 C,H,M,R C,H,M,R 5c2 peptide ADGLAYFRSSFKGG 14 T 10 DUF1338 pdbhh F T 4p2r 3 C,H,M,R C,H,M,R 5c1 peptide ANGVAFFLTPFKA 13 T 9.8 DUF5699 pdbhh F T 4p3w 2 C,D,G,H,K,L G,H,K,L,J,I FBLI1_HUMAN FBLP-1,MIGFILIN,MITOGEN-INDUCIBLE 2-INTERACTING PROTEIN,MIG2-INTERACTING PROTEIN PEKRVASSVFITLAPPRRDVAVAE 24 T 8.4 Pox_A3L pdbhh F Eukaryota T 4p4w 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L CYCLIC HEXADECAPEPTIDE (ORN)YLL(PHI)YTE(ORN)KVA(MVA)AVK XYLLXYTEXKVAXAVK 16 T 1.2 Peptidase_C98 pdbhh F T 4p4y 1 A,B,C,D,E,F,G A,B,C,D,E,F,G CYCLIC HEXADECAPEPTIDE (ORN)YLL(PHI)YTE(ORN)KVT(MAA)TVK XYLLXYTEXKVTXTVK 16 T 1.1 Peptidase_C98 pdbhh F T 4p6j 1 A,B A,B Computationally Designed Transporter of Zn(II) and Proton YXKEIAHALFSALFALSELYIAVRYX 26 T 9.4 DUF6332 pdbhh F T 4p6l 1 A,B A,B Computationally Designed Transporter of Zn(II) and proton YYKEIAHALFSALFALSELYIAVRYX 26 T 9.4 DUF6332 pdbhh F T 4p6z 6 F T BST2_HUMAN BST-2,HM1.24 ANTIGEN,TETHERIN AGFSMASTSYDYCRVPMEDGDKRCK 25 T 0.36 UL42 unphh F Eukaryota T 4p7i 2 C,D C,D DCAF1_HUMAN DDB1- AND CUL4-ASSOCIATED FACTOR 1,HIV-1 VPR-BINDING PROTEIN,VPRBP,SERINE/THREONINE-PROTEIN KINASE VPRBP,VPR-INTERACTING PROTEIN GPGSEFGEDGDNDFSPSDEELANLLEEGEDGEDEDSDADEEVELILGDTDSSDNSDLEDDIILSLNE 67 T 8 ACC_epsilon pdbhh F Eukaryota T 4p9h 2 B G Q0ED31_9HIV1 Envelope glycoprotein gp160 VWKDADTTLFCASDAKAHETECHNVWATHACVPTDPNPQEIHLEQVTENFNMWKNNMVEQMQEDVISLWDQCLQPCVKLTGGSVIKQACPKISFDPIPIHYCTPAGYVILKCNDKNFNGTGPCKNVSSVQCTHGIKPVVSTQLLLNGSLAEEEIIIRSENLTNNAKTIIVHLQKSVEINCTRPSNGGSGSGGDIRKAYCEIQGTKWNKVLKQVTEKLKEHFNNKTIIFQPPSGGDLEITMHHFNCRGEFFYCNTTQLFQNTCIGNETMKGCNGTITLPCKIKQIINMWQGTGQAMYAPPIDGKINCVSQITGILLTRDGGANNTSNETFRPGGGNIKDNWRSELYKYKVVQIEGSHHHHHH 361 T 2.6E-48 GP120 unp T Viruses T 4pby 2 C,D C,D MTA1_HUMAN MTA1 DVFYMATEETRKIRKLLSSSETKRAARRPYK 31 T 0.74 MTA_R1 unp F Eukaryota T 4pbz 2 B B MTA1_HUMAN Metastasis-associated protein MTA1 KLLSSSETKRAARRPYKPIALRQSQA 26 T 0.74 MTA_R1 unp F Eukaryota T 4pc0 2 C,D C,D MTA1_HUMAN HUMAN MTA1 KLLSSSETKRAARRPYKPIALRQSQALPPRPPPPAPVNDEPI 42 T 0.14 MTA_R1 pdbpssm F Eukaryota T 4pdc 2 E,F E,F Q8QN43_COWPX CPXV018 protein GHKLAFNFNLEINGSDTHSTVDVDLDDSQIITFDGKDIRPTIPFMIGDEIFLPFYKNVFSEFFSLFRRVPTSTPYEDLTYFYECDYTDNKSTFDQDYLYNGEEYTVKTQEATNKNMWLTTSEFRLKKWFDGEDCIMHLRSLVRKMEDSKR 150 T 0.078 Thioredoxin_11 unppssm T Viruses T 4pew 1 A,B A,B LAM55_STREK Putative secreted protein SFGSDVRPAAAQEVVGGGDLGPNVLVFDPSTPDIQGKVDEVFRKQESNQFGTDRYALMFKPGTYNDINAQIGFYTSIAGLGLNPDDTTFNGDVTVDAGWFDGNATQNFWRSAENLALNPVNGTNRWAVSQAAPFRRMHVKGGLNLAPDGYGWASGGYIADSKIDGEVGPYSQQQWYTRDSSVGGWGNGVWNMTFSGVEGAPAQSFPEPPYTTLETTPVSREKPFLYLDGDDYKVFVPAKRTNARGTSWGNGTPEGESLPLDQFYVVKPGATAETINAAVDQGLHLLFTPGVYHVDQPIEIDRANTVALGLGLATIIPDNGVTALKVGDVDGVKVAGLLVDAGPVNSETLVEVGSDGASGDHAANPTSLQDVFVRIGGAGPGKATTSIVVNSNDTIIDHTWVWRADHGEGVGWETNRADYGVHVKGDNVLATGLFVEHFNKYDVQWSGENGKTIFYQNEKAYDAPDQAAIQNGDIKGYAAYKVDDSVTTHEGWGMGSYCYFNVNPDIRQQHGFQAPVKPGVKFHDLLVVSLGGKGQYEHVINDIGDPTSGDTTIPSQVVSFP 561 T 0.0083 Pectate_lyase_3 unphh F Bacteria T 4pex 1 A,B A,B LAM55_STREK Putative secreted protein SFGSDVRPAAAQEVVGGGDLGPNVLVFDPSTPDIQGKVDEVFRKQESNQFGTDRYALMFKPGTYNDINAQIGFYTSIAGLGLNPDDTTFNGDVTVDAGWFDGNATQNFWRSAENLALNPVNGTNRWAVSQAAPFRRMHVKGGLNLAPDGYGWASGGYIADSKIDGEVGPYSQQQWYTRDSSVGGWGNGVWNMTFSGVEGAPAQSFPEPPYTTLETTPVSREKPFLYLDGDDYKVFVPAKRTNARGTSWGNGTPEGESLPLDQFYVVKPGATAETINAAVDQGLHLLFTPGVYHVDQPIEIDRANTVALGLGLATIIPDNGVTALKVGDVDGVKVAGLLVDAGPVNSETLVEVGSDGASGDHAANPTSLQDVFVRIGGAGPGKATTSIVVNSNDTIIDHTWVWRADHGEGVGWETNRADYGVHVKGDNVLATGLFVEHFNKYDVQWSGENGKTIFYQNAKAYDAPDQAAIQNGDIKGYAAYKVDDSVTTHEGWGMGSYCYFNVNPDIRQQHGFQAPVKPGVKFHDLLVVSLGGKGQYEHVINDIGDPTSGDTTIPSQVVSFP 561 T 0.0083 Pectate_lyase_3 unphh F Bacteria T 4pg2 3 C D SPIKE_CVMJC CYS-SER-LEU-TRP-ASN-GLY-PRO-HIS-LEU CSLWNGPHL 9 T 3.1 RGM_N pdbhh T Viruses T 4ph8 1 A,B A,B AGGREGATIVE ADHERENCE FIMBRIAE, TYPE I (AAF/I), MAJOR SUBUNIT, AGGA, SHIGA-TOXIN PRODUCING E.COLI ASQHHHHHHVTNDCPVTITTTPPQTVGVSSTTPIGFSAKVTTSDQCIKAGAKVWLWGTGPANKWVLQHAKVAKQKYTLNPSIDGGADFVNQGTDAKIYKKLTSGNKFLNASVSVNPKTQVLIPGEYTMILHAAVDFDNKQGGASQQTTQTIRLTVT 156 T 0.082 DNA_ligase_aden pdbpssm F T 4pju 2 B B RAD21_HUMAN HHR21,NUCLEAR MATRIX PROTEIN 1,NXP-1,SCC1 HOMOLOG VDPVEPMPTMTDQTTLVPNEEEAFALEPIDITVKETKAKRKRKLIVDSVKELDSKTIRAQLSDYSDIVTTLDLAPPTKKLMMWKETGGVEKLFSLPAQPLWNNRLLKLFTRCLTPLVPEDLRKRRKGGEADNLDEFLKEF 140 T 14 Ashwin pdb F Eukaryota T 4pk7 2 B B RAD21_HUMAN HHR21,NUCLEAR MATRIX PROTEIN 1,NXP-1,SCC1 HOMOLOG GPLGSGRPVDPVEPMPTMTDQTTLVPNEEEAFALEPIDITVKETKAKRKRKLIVDSVKELDSKTIRAQLSDYSDIVTTLDLAPPTKKLMMWKETGGVEKLFSLPAQPLWNNRLLKLFTRCLTPLVPEDLRKRRKGGEADNLDEFLKEF 148 T 23 CheC pdbhh F Eukaryota T 4pn8 1 A,B,C,D,E,F,G,H,I,J A,B,C,D,E,F,G,H,I,J CC-Pent XGKIEQILQKIEKILQKIEWILQKIEQILQG 31 T 0.034 DUF4298 pdb F T 4pn9 1 A,B,C,D,E,F A,B,C,D,E,F CC-Hex2 XGEIAKSLKEIAKSLKEIAWSLKEIAKSLKG 31 T 0.032 MCPsignal pdbpssm F T 4pnd 1 A,B,C,D,E A,B,C,D,E CC-Pent_Variant XGNILQKIENILKKIENILWKIENILQKIEG 31 T 1.9 Fer4_24 pdbhh F T 4pqz 1 A A SWT1_YEAST SYNTHETICALLY LETHAL WITH TREX PROTEIN 1 MGSSHHHHHHSSGENLYFQGSYAHIPGIETPPLQFDKVSQNVFEQVKETIFFAIDHTLRKEYGEDIGFIDYNPDKLTTIENASNYIYLFWVSVFSELFTCSKIKKNEWKSLPTVLKSKPTNLNDLRTFEQFWETVLHFLFSKFTNEEKQSLEKQIHEWKTSINAIST 167 T 0.036 Borrelia_REV unp F Eukaryota T 4pr5 3 C C EBNA1_EBVG EBNA-1, EBV NUCLEAR ANTIGEN 1 HPVGDADYFEY 11 T 9.6 Sel_put pdbhh T Viruses T 4pra 3 C C EBNA1_EBVB9 EBNA-1, EBV NUCLEAR ANTIGEN 1 HPVGQADYFEY 11 T 4.1 DUF2620 pdbhh T Viruses T 4prb 3 C C EBNA1_EBVA8 EBNA-1, EBV NUCLEAR ANTIGEN 1 HPVAEADYFEY 11 T 13 Fip1 pdbhh T Viruses T 4psi 2 C,D D,E TELO2_HUMAN Telomere length regulation protein TEL2 homolog ALDSDDEFVPY 11 T 6.1 Glyco_transf_21 pdbhh F Eukaryota T 4pv8 3 E,F E,F S598 peptide modified Q600F RXFIFANI 8 T 2.8 TBP-binding pdbhh F T 4pv9 3 E,F E,F S598 peptide modified Q600V RXVIFANI 8 T 2.7 DUF3099 pdbhh F T 4pvz 2 C,D C,D HEH2_YEAST HELIX-EXTENSION-HELIX DOMAIN-CONTAINING PROTEIN 2 GPLGSTNKRKREQISTDNEAKMQIQEEKSPKKKRKKRSSKANK 43 T 20 CMS1 pdbhh F Eukaryota T 4pw1 1 A,B A,B A7VV57_9FIRM Uncharacterized protein GYKGTIEEREQPQNFNLLYLNSGEELNLYPWNLYTGQEQELFEEEIVSFAANSVRILGGGSWTDEELYPLIKFRYSGQDLRFLKDMALTEKDGRRYLVNMALDPNGLCYFSYVNQDEREATADEMDQALGKLQEDWEKFLSDPLPADSEVDLYEEKPSGSYQLDDGELKTDNAFYMFFMRCQMLSDQMRKEQYSDYIGDNLYTIWELVLKSEFTSLSYDNHIYAMYSNDGGTSMVLIYSPIEERFVGFSLKY 252 T 0.091 Glyco_hydro_43 pdbpssm F Bacteria T 4pz5 2 B B O50835_BORBU Fibronectin-binding protein BBK32 XSISYTDEIEEEDYDQ 16 T 0.052 Fn_bind unppercent F Bacteria T 4q0p 1 A A Q93UQ5_9GAMM L-Ribose isomerase MRGSHHHHHHGSARTSITRREYDEWVREAAALGKALRYPITEKMVNDSAGIVFGADQYDAFKNGMWSGEPYEAMIIFESLNEPAVDGLPTGAAPYAEYSGLCDKLMIVHPGKFCPPHHHGRKTESYEVVLGEMEVFYSPTPSAESGVELLNFSGMPVGSPWPEGVALPKGRESSYEKLTSYVRLRAGDPKFVMHRKHLHAFRCPPDSDVPLVVREVSTYSHEPTEAAAGNHAPIPSWLGMHDNDFVSDAANTGRLQTAIS 260 T 0.0062 Cupin_2 unppercent F Bacteria T 4q0u 1 A A Q93UQ5_9GAMM L-Ribose isomerase MRGSHHHHHHGSARTSITRREYDEWVREAAALGKALRYPITEKMVNDSAGIVFGADQYDAFKNGMWSGEPYEAMIIFESLNEPAVDGLPTGAAPYAEYSGLCDKLMIVHPGKFCPPHHHGRKTESYEVVLGEMEVFYSPTPSAESGVELLNFSGMPVGSPWPEGVALPKGRESSYEKLTSYVRLRAGDPKFVMHRKHLHAFRCPPDSDVPLVVRQVSTYSHEPTEAAAGNHAPIPSWLGMHDNDFVSDAANTGRLQTAIS 260 T 0.0062 Cupin_2 unppercent F Bacteria T 4q0y 1 A,B,C A,B,C J7SH17_CLOS1 Uncharacterized protein GRMEISSLSSIDVFKFNSFSKFSNDKIGVIYDEEKLSKFKVIMNSLDTSEGIKKIEVPKDANIESFKYSYHIQPNLKYVEDNNVYDGYFLLYILVGDSEGKSYIIFSGTELSYVLDKNNTNILKEIFLNVKKQQ 134 T 0.31 ABC2_membrane_3 pdb F Bacteria T 4q2l 1 A A PREDICTED TRANSPORTER, YAJR MFS TRANSPORTER GSHMKEPPYVSSLRIEIPADIAANEALKVRLLETEGVKEVLIAEEEHSAYVKIDSKVTNRFEVEQAIRQA 70 T 0.024 Spore_YhcN_YlaJ pdbhh F T 4q2s 1 A A YE38_SCHPO UNCHARACTERIZED PROTEIN C20G4.08 GAMGAQGIAESLRRLKEYVKAGSVKECVAEWCNMPSVAGFDVLSEISYDRMLENCSNLLLLTFIYHISLLDSVDDDRLSKRMEYISRICLNIDVNDPKVETVVHPVLTLTREALLRQSEFFSPIFKRRLVVLLRALDGKISEI 143 T 7.9 CLTH pdbhh F Eukaryota T 4q6h 2 B B iCAL36-VQDTRL peptide VQDTRL 6 T 200 STE3 pdbhh F T 4q6s 2 C,D C,D BT-L-iCAL36 peptide XWXFKKANSRLPTSII 16 T 2.1 PMBR pdbhh F T 4q8d 1 A,B A,B macrocyclic beta-sheet peptide incorporating residues amyloid beta 15-23 XQKLVFXAEDXQKLVXED 18 T 0.5 Beta-APP pdbhh F T 4qan 1 A,B A,B A7B4B4_RUMGV hypothetical protein GKKEESEVLNVTESLQKESEITSFSEEEEAVLYMLSALKKNDLDMALRGCAIDETALQINFVKTAEELPGMQLIDLPAPTSDYSYYFPLTSAEMTKAYIEQFEELSTEIPEIETLEVLEIAEKKEKEREEQLAECLAAQEVSELEIYVKCGEQSYRLGFTAVQYEKNWKIHSLKEGLLYETDIPACVQMEEMREAKKTYVLPNQLTGANYFQAMPISEKTPQRAVEQFIYAIEKGDLTRALAFATTESSQDTSPELLKKQGEYAKELKTMLYGFLGTEDARLYGKSEEQLNKLRGKLNPEYMVYLDLIKVIPIETEENTETVKQYAGLYSYNGKNYLTGYTLCRQEDGWQIQSLSAPALSLESGEVMRLSKEESRKTSEQSVLKAEKNER 390 T 0.0056 DUF4864 pdbhh F Bacteria T 4qbm 2 C,D C,D histone H4 peptide with sequence Gly-Ala-Lys(ac)-Arg-His-Arg-Lys(ac)-Val-Leu GAXRHRXVL 9 T 29 GIY_YIG_domain pdbhh F T 4qh7 2 C,D,G,H C,D,G,H Q9XZ31_DROME Anastral spindle 2 NYTICAGTQTDP 12 T 0.013 Macoilin unppssm F Eukaryota T 4qh8 2 C C Q9XZ31_DROME Anastral spindle 2 NYSSTTGTQCDIA 13 T 0.088 SKA2 unp F Eukaryota T 4qh8 3 D D Q9XZ31_DROME Anastral spindle 2 NYSSTTGTQCDI 12 T 0.088 SKA2 unp F Eukaryota T 4qj8 2 E,F E,F GAG_HV1A2 p1-p6 peptide RPGNFLQSRL 10 T 9.9 DUF2851 pdbhh T Viruses T 4qlb 2 E,F,G,H G,E,H,F GYG1_CAEEL Protein GYG-1, isoform b PSTEERRAAWEAGQPDYLGRDAFVHIQEALNRALNE 36 T 0.62 CysG_dimeriser unp F Eukaryota T 4qli 2 B B SNAI1_HUMAN PROTEIN SNAIL HOMOLOG 1, PROTEIN SNA SHTLPC 6 T 1.5 zf-C2H2_8 unp F Eukaryota T 4qn8 1 A,B,C A,B,C Q5ZRR7_LEGPH VipE GHMPLTQTQRLINTYGASLKNGTISNEELIILLDPNTFTKSEGYVDPNAPVSDSNHSKMDAIKDFVLTIGPTLDSEILHQLTSRMIELSPPGDRNTFMRGSSLEKAFLAFEMAHYPTKAEEHFNSTRVRTEFPGENDIDNLKAVILNPIIAFFQS 155 T 6.1 PSD5 pdbhh F Bacteria T 4qqi 2 B X RFX7_HUMAN REGULATORY FACTOR X 7, REGULATORY FACTOR X DOMAIN-CONTAINING PROTEIN 2 KAFVHMPTLPNLDFHKT 17 T 0.5 DUF4739 pdbhh F Eukaryota T 4qs4 1 A A Q93I73_ECOLX CofB GSHMEKEADEARRQIVSNALISEIAGIVDFVAEEQITVIEQGIEKEITNPLYEQSSGIPYINRTTNKDLNSTMSTNASEFINWGAGTSTRIFFTRKYCISTGTQGNYEFSKDYIPCEEPAILSNSDLKIDRIDFVATDNTVGSAIERVDFILTFDKSNANESFYFSNYVSSLEKAAEQHSISFKDIYVVERNSSGAAGWRLTTISGKPLTFSGLSKNIGSLDKTKNYGLRLSIDPNLGKFLRADGRVGADKLCWNIDNKMSGPCLAADDSGNNLVLTKGKGAKSNEPGLCWDLNTGTSKLCLTQIEGKDNNDKDASLIKLKDDNGNPATMLANILVEEKSMTDSTKKELRTIPNTIYAAFSNSNASDLVITNPGNYIGNVTSEKGRIELNVQDCPVSPDGNKLHPRLSASIASIVADTKDSNGKYQADFSSLAGNRNSGGQLGYLSGTAIQVNQSGSKWYITATMGVFDPLTNTTYVYLNPKFLSVNITTWCSTEPQT 498 T 0.94 PilX_N unphh F Bacteria T 4qsy 2 B B GAB1_HUMAN GRB2-ASSOCIATED BINDER 1, GROWTH FACTOR RECEPTOR BOUND PROTEIN 2-ASSOCIATED PROTEIN 1 GDKQVEXLDLDLD 13 T 0.94 GHBP pdbhh F Eukaryota T 4quu 2 B B H4_HUMAN Histone H4 RGXGGXGLGXGGAY 14 T 11 Shadoo unppercent F Eukaryota T 4qwn 2 B,D B,D KDM2A_MOUSE F-BOX AND LEUCINE-RICH REPEAT PROTEIN 11, F-BOX/LRR-REPEAT PROTEIN 11, JMJC DOMAIN-CONTAINING HISTONE DEMETHYLATION PROTEIN 1A, [HISTONE-H3]-LYSINE-36 DEMETHYLASE 1A QVHLTHFELEGLRCLVDKLESLPLHKKCVPTGIEDEDALIADVKILLEELASSDPKLALTGVPIVQWP 68 T 0.0015 JHD pdbhh F Eukaryota T 4qxt 3 C Q MSA2_PLAFF MEROZOITE SURFACE PROTEIN 2, MSA-2 XNAYNMSIRRSMANEGSNX 19 T 1.1 DUF6494 pdbhh F Eukaryota T 4qy8 3 C Q MSA2_PLAF7 MEROZOITE SURFACE PROTEIN 2, MSA-2, 45 KDA MEROZOITE SURFACE ANTIGEN XNAYNMSIRRSMAESKPSX 19 T 0.87 DUF6494 pdbhh F Eukaryota T 4qyd 2 B B H4_HUMAN Histone H4 GKGGKGLGXGGAKR 14 T 11 Shadoo unppercent F Eukaryota T 4qyo 3 C Q MSA2_PLAF7 MEROZOITE SURFACE PROTEIN 2, MSA-2 XNAYNMSIRRX 11 T 0.68 DUF6494 pdbhh F Eukaryota T 4r1d 1 A A Q9I3K2_PSEAE Uncharacterized protein MSSEPLEPNQDVIIPRSRDSLGRPVYKAQLTRTDNQSEKVALIRQTAPLPVIFIPGIMGTNLRNKADKSEVWRPPNGLWPMDDLFASIGALWTWAWRGPKARQELLKAEQVEVDDQGTIDVGQSGLSEEAARLRGWGKVMRSAYNPVMGLMERRLDNIVSRRELQAWWNDEALSPPGDQGEEQGKVGPIDEEELLRASRYQFDVWCAGYNWLQSNRQSALDVRDYIENTVLPFYQKECGLDPEQMRRMKVILVTHSMGGLVARALTQLHGYERVLGVVHGVQPATGSSTIYHHMRCGYEGIAQVVLGRNAGEVTAIVANSAGALELAPSAEYREGRPWLFLCDAQGQVLKDIDGKPRAYPQNQDPYEEIYKNTTWYGLVPEQNSQYLDMSDKKEGLRVGPRDNFEDLIDSIANFHGELSAAGYHSETYAHYGADDSRHSWRDLIWKGDPTPLETPGATLNDDENGTYNSWFRRGLPTIVQGPLETGNPLDASGSGGDETVPTDSGQAPALAGVKASFRHGSKGKGQANTKRGYEHQESYNDARAQWAALYGVIKITQLADWHPNDKGGT 569 T 2.8E-09 LCAT pdbhh F Bacteria T 4r1e 2 B B MYOA_PLAF7 PFM-A GSLLRVQAHIRKKMV 15 T 0.11 BORCS8 pdbhh F Eukaryota T 4r29 1 A,B,C,D A,B,C,D Q7DBA6_ECO57 CYSTEINE METHYLTRANSFERASE, NLEE MINPVTNTQGVSPINTKYAEHVVKNIYPEIKHDYFNESPNIYDKKYISGITRGVAELKQEEFVNEKARRFSYMKTMYSVCPEAFEPISRNEASTPEGSWLTVISGKRPMGQFSVDSLYNPDLHALCELPDICCKIFPKENNDFLYIVVVYRNDSPLGEQRANRFIELYNIKRDIMQELNYALPELKAVKSEMIIAREMGEIFSYMPGEIDSYMKYINNKLSKIE 224 T 0.27 Arm-DNA-bind_2 pdb F Bacteria T 4r3p 2 B B ERRFI_HUMAN MITOGEN-INDUCIBLE GENE 6 PROTEIN, MIG-6 THYXLLP 7 T 2.3 DUF1435 pdbhh F Eukaryota T 4r3s 3 C Q MSA2_PLAF7 Merozoite surface protein XFINNAYNMSIRRSX 15 T 2 DUF6494 pdbhh F Eukaryota T 4r4k 1 A,B,C,D A,B,C,D A5ZF42_9BACE Uncharacterized protein GKNEIAQSGEDFKSFLDKFTSSAAFQYTRIKFPLKTPITLLADDGETEKTFPFTKEKWPLLDSETMKEERIEQEEGGIYVSKFTLNEPVHKVFEAGYEESEIDLRVEFEQAADGKWYVVDCYTGWYGYDLPIGELKQTIQQVKEENAAFKEIHP 154 T 0.00025 DUF4348 unppssm F Bacteria T 4r6n 2 B,D,F,H B,D,F,H LECB3_ARTIN JACALIN BETA-3 CHAIN EQSGISQTVIVGPWGAKVS 19 T 2.9 DUF3842 pdbhh F Eukaryota T 4r7a 1 A A PHF6_HUMAN PHD-LIKE ZINC FINGER PROTEIN KSKKKSRKGRPRKTN 15 T 0.29 AT_hook pdbhh F Eukaryota T 4rav 3 E,F E,F HD_HUMAN HUNTINGTON DISEASE PROTEIN, HD PROTEIN MATLEKLMKAFESLKSF 17 T 2 Mito_fiss_reg unphh F Eukaryota T 4rfn 4 D,H M,D T-CELL SURFACE GLYCOPROTEIN CD4 mimetic M48 XNLHFCQLRCKSLGLLGRCAXTFCACVX 28 T 0.0091 Toxin_38 pdbpssm F T 4rh5 2 B B EPS15_HUMAN PROTEIN EPS15, PROTEIN AF-1P FSAXPSEED 9 T 1.5 RAP80_UIM unphh F Eukaryota T 4rhz 2 B B Q9KKG7_BACTU Cry37AA1 MTVYNATFTINFYNEGEWGGPEPYGYIKAYLTNPDHDFEIWKQDDWGKSTPERSTYTQTIKISSDTGSPINQMCFYGDVKEYDVGNADDILAYPSQKVCSTPGVTVRLDGDEKGSYVTIKYSLTPA 126 T 3 DUF4091 pdbpssm F Bacteria T 4riq 2 C,F,I,L,O,R,U,W C,F,I,L,O,R,U,X ASH2L_HUMAN ASH2-LIKE PROTEIN GAMGSVEHTLADVLYHVETEVENLYFQ 27 T 3.3 PRA-PH pdbhh F Eukaryota T 4rjf 2 B,D,F B,D,F CDN1A_HUMAN CDK-INTERACTING PROTEIN 1, MELANOMA DIFFERENTIATION-ASSOCIATED PROTEIN 6, MDA-6, P21 GRKRRQTSMTDFFHSKRRLIFS 22 T 0.88 CDC27 pdbhh F Eukaryota T 4rmh 2 B B Ac-Lys-H3 peptide TGGXAPR 7 T 4.5 Importin_rep_3 pdbhh F T 4ro3 1 A,B A,B M1RHE3_VIBCL Hypothetical Protein SNAMSKFYQINTTLLESNEAVNKQTGEVVPLSPETKLVYAYMLNQYRMYRKYGNRRYTESWDKIFTVCCDVAAQKQKRLAKELTTLGLIEVIGNKNAYKVVHSVESIIETWEFTNSKLNT 120 T 2.4E-05 RepA_N pdbhh F Bacteria T 4rof 2 C,D C,D TXNIP_HUMAN THIOREDOXIN-BINDING PROTEIN 2, VITAMIN D3 UP-REGULATED PROTEIN 1,TXNIP PEPTIDE XTPEAPPCYMDVIX 14 T 14 MYEOV2 pdbhh F Eukaryota T 4roj 2 D,E,F D,E,F TXNIP_HUMAN THIOREDOXIN-BINDING PROTEIN 2, VITAMIN D3 UP-REGULATED PROTEIN 1 XTPEAPPCXMDVIX 14 T 14 MYEOV2 pdbhh F Eukaryota T 4rt4 2 E E BRE2_YEAST BREFELDIN-A SENSITIVITY PROTEIN 2, COMPLEX PROTEINS ASSOCIATED WITH SET1 PROTEIN BRE2, SET1C COMPONENT BRE2 NTLDTLYKEQIAEDIVWDIIDELEQIALQQ 30 T 0.11 Ectoine_synth unp F Eukaryota T 4rtv 2 B B APP12 peptide XAPPLPPRNRPRL 13 T 0.42 SCIMP pdbhh F T 4ru2 2 B,D,F,H,J,L,N,P,R B,D,F,H,J,L,N,P,R U2AF2_MOUSE U2 AUXILIARY FACTOR 65 KDA SUBUNIT, U2 SNRNP AUXILIARY FACTOR LARGE SUBUNIT GKKKVRKYWDVPPPGFEHITPMQYKAMQA 29 T 0.0013 Transformer unp F Eukaryota T 4rud 1 A,B A,B U3EPL2_MICFL Three-finger toxin 3b LKCYSSRTETMTCPEGEDKCEKYAVGLMHGSFFFIYTCTSKCHEGAYNVCCSTDLCNK 58 T 0.0003 Toxin_TOLIP unppercent F Eukaryota T 4rwg 2 D,E,F D,E,F CGRP analog FVPTDVGPFAFX 12 T 1.2 Carcinustatin pdbhh F T 4rxv 1 A A Q5ZWY9_LEGPH hypothetical protein lpg0944 GMAIAPQQIQERLKQEQYQKFVVADIGNFPHCLAQTPEGIASGQRYQKYSTNSLSRTPPFSQWGAPQLLTPKSAQEYIKFAQQRNKKSSFKIDGEAVRVSECSNFAYHSAGVLLDDPQIRTQYDVAVIGSMHSNGRYLHNITLLVPKGSRLPQPPEQLTAEVFPIGTLIVDPWAVGMGHPPEQALAIPKEQFAYNRSLFPATVNYQSALDESLTSTRTGQLTPYTGTPS 229 T 0.28 Ail_Lom pdbpssm F Bacteria T 4rxx 1 A A UBP38_HUMAN DEUBIQUITINATING ENZYME 38, HP43.8KD, UBIQUITIN THIOESTERASE 38, UBIQUITIN-SPECIFIC-PROCESSING PROTEASE 38 MDKILEGLVSSSHPLPLKRVIVRKVVESAEHWLDEAQCEAMFDLTTRLILEGQDPFQRQVGHQVLEAYARYHRPEFESFFNKTFVLGLLHQGYHSLDRKDVAILDYIHNGLKLIMSCPSVLDLFSLLQVEVLRMVCERPEPQLCARLSDLLTDFVQCIPKGKLSITFCQQLVRTIGHFQCVSTQERELREYVSQVTKVSNLLQNIWKAEPATLLPSLQEVFASISSTDASFEPSVALASLVQHIPLQMITVLIRSLTTDPNVKDASMTQALCRMIDWLSWPLAQHVDTWVIALLKGLAAVQKFTILIDVTLLKIELVFNRLWFPLVRPGALAVLSHMLLSFQHSPEAFHLIVPHVVNLVHSFKNDGLPSSTAFLVQLTELIHCMMYHYSGFPDLYEPILEAIKDFPKPSEEKIKLILNQSAWTSHHHHHH 430 T 0.073 DUF2228 pdb F Eukaryota T 4s0g 2 B B EPS15_HUMAN PROTEIN EPS15, PROTEIN AF-1P FSAXVSEED 9 T 1.5 RAP80_UIM unphh F Eukaryota T 4s0r 2 AA,BA,O,P,Q,R,S,T,U,V,W,X,Y,Z 1,2,O,P,Q,R,S,T,U,V,W,X,Y,Z TNRA_BACSU TnrA peptide KMLEGQNAHFRYKNR 15 T 0.00015 MerR-DNA-bind unppssm F Bacteria T 4s3h 1 A,B,C,D A,B,C,D MDB1_SCHPO Mdb1 MGSSHHHHHHSSGLEVLFQGPHMEIQFGNQRCRMVNSGGFLATDGSHLKEMETDDVLVEFLNIEHQLFIRNIRAIVKIADTTVLPSASDKKLLYYVFDETRVRINDTPVIFSKLEEDNANVNEGSK 126 T 8.1 ATG19 pdbhh F Eukaryota T 4thn 3 C I HIRUNORM IV XRXTDXGXPESHXGGDYEEIPXXYXX 26 T 0.16 Hirudin pdbhh F T 4tjx 2 B B Aleurain peptide ADSNPIRPVT 10 T 21 DUF6446 pdbhh F T 4tk1 2 C,D C,D GBRA3_RAT GABA(A) RECEPTOR SUBUNIT ALPHA-3 FNIVGTTYPIN 11 T 0.98 TraW_N pdbhh F Eukaryota T 4tk3 2 C,D C,D GBRA3_RAT GABA(A) RECEPTOR SUBUNIT ALPHA-3 FSIVGTLYPIN 11 T 1.4 SH3_7 pdbhh F Eukaryota T 4tky 2 E,F,G,H F,E,G,H PRO-PHE-ALA-THR-CYS-ASP-SER PFATCDS 7 T 0.54 Hexapep_loop pdbhh F T 4tq1 2 B B TCPR1_HUMAN Tectonin beta-propeller repeat-containing protein 1 MAQTAAWRKQIFQQLTERTKRELENFRHYEQAVEQSVWV 39 T 0.031 Unpaired pdbpssm F Eukaryota T 4tqe 3 C A TAU_HUMAN NEUROFIBRILLARY TANGLE PROTEIN,PAIRED HELICAL FILAMENT-TAU,PHF-TAU LPTPPTREPKKVAVVR 16 T 38 DUF3982 pdbhh F Eukaryota T 4tt0 1 A,B A,B LTP_HHV11 TEGUMENT PROTEIN VP1-2,TEGUMENT PROTEIN VP1/2, HSV1 UL36 GPLGSAKQQRAEATERVTAGLREVLAARERRAQLEAEGLANLKTLLKVVAVPATVAKTLDQARSAEEIADQVEILVDQTEKARELDVQAVAWLEHAQRTFETHPLSAASGDGPGLLTRQGARLQALFDTRRRVEALRR 138 T 0.25 RNA_pol_Rpb2_4 pdb T Viruses T 4tuj 3 C,F E,F peptide1 RCNPNMEPPRCWAAEGD 17 T 0.57 DUF4683 pdbhh F T 4tuk 3 C I peptide2 VCNPLTGALLCSAAEGD 17 T 6.6 DUF1847 pdbhh F T 4tvq 2 E E CCM2_HUMAN MALCAVERNIN STIDFLDRAIFDGAST 16 T 0.023 PID_2 unphh F Eukaryota T 4twi 2 B B H4_YEAST Succinylated H4 Peptide (aa8-20) KGLGKGGAXRHRKW 14 T 4.2 Shadoo unppercent F Eukaryota T 4twt 2 C,D E,F PEPTIDE M21 ACPPCLWQVLCG 12 T 0.053 Ragweed_pollen pdbhh F T 4txq 2 C,D C,D CHM1B_HUMAN CHMP1.5,CHROMATIN-MODIFYING PROTEIN 1B,CHMP1B,VACUOLAR PROTEIN SORTING-ASSOCIATED PROTEIN 46-2,HVPS46-2 SVGTSVASAEQDELSQRLARLRDQV 25 T 2.5 PHA_synth_III_E pdbhh F Eukaryota T 4txy 1 A,B A,B DNCV_VIBCH C-AMP-GMP SYNTHASE,DINUCLEOTIDE CYCLASE DNCV GSMTWNFHQYYTNRNDGLMGKLVLTDEEKNNLKALRKIIRLRTRDVFEEAKGIAKAVKKSALTFEIIQEKVSTTQIKHLSDSEQREVAKLIYEMDDDARDEFLGLTPRFWTQGSFQYDTLNRPFQPGQEMDIDDGTYMPMPIFESEPKIGHSLLILLVDASLKSLVAENHGWKFEAKQTCGRIKIEAEKTHIDVPMYAIPKDEFQKKQIALEANRSFVKGAIFESYVADSITDDSETYELDSENVNLALREGDRKWINSDPKIVEDWFNDSCIRIGKHLRKVCRFMKAWRDAQWDVGGPSSISLMAATVNILDSVAHDASDLGETMKIIAKHLPSEFARGVESPDSTDEKPLFPPSYKHGPREMDIMSKLERLPEILSSAESADSKSEALKKINMAFGNRVTNSELIVLAKAL 413 T 0.03 AAA pdbpssm F Bacteria T 4tyv 1 A,B A,B LAM55_STREK Putative secreted protein AQEVVGGGDLGPNVLVFDPSTPDIQGKVDEVFRKQESNQFGTDRYALMFKPGTYNDINAQIGFYTSIAGLGLNPDDTTFNGDVTVDAGWFDGNATQNFWRSAENLALNPVNGTNRWAVSQAAPFRRMHVKGGLNLAPDGYGWASGGYIADSKIDGEVGPYSQQQWYTRDSSVGGWGNGVWNMTFSGVEGAPAQSFPEPPYTTLETTPVSREKPFLYLDGDDYKVFVPAKRTNARGTSWGNGTPEGESLPLDQFYVVKPGATAETINAAVDQGLHLLFTPGVYHVDQPIEIDRANTVALGLGLATIIPDNGVTALKVGDVDGVKVAGLLVDAGPVNSETLVEVGSDGASGDHAANPTSLQDVFVRIGGAGPGKATTSIVVNSNDTIIDHTWVWRADHGEGVGWETNRADYGVHVKGDNVLATGLFVEHFNKYDVQWSGENGKTIFYQNAKAYDAPDQAAIQNGDIKGYAAYKVDDSVTTHEGWGMGSYCYFNVNPDIRQQHGFQAPVKPGVKFHDLLVVSLGGKGQYEHVINDIGDPTSGDTTIPSQVVSFP 551 T 0.0083 Pectate_lyase_3 unphh F Bacteria T 4tz1 1 A A LAM55_STREK Putative secreted protein EVVGGGDLGPNVLVFDPSTPDIQGKVDEVFRKQESNQFGTDRYALMFKPGTYNDINAQIGFYTSIAGLGLNPDDTTFNGDVTVDAGWFDGNATQNFWRSAENLALNPVNGTNRWAVSQAAPFRRMHVKGGLNLAPDGYGWASGGYIADSKIDGEVGPYSQQQWYTRDSSVGGWGNGVWNMTFSGVEGAPAQSFPEPPYTTLETTPVSREKPFLYLDGDDYKVFVPAKRTNARGTSWGNGTPEGESLPLDQFYVVKPGATAETINAAVDQGLHLLFTPGVYHVDQPIEIDRANTVALGLGLATIIPDNGVTALKVGDVDGVKVAGLLVDAGPVNSETLVEVGSDGASGDHAANPTSLQDVFVRIGGAGPGKATTSIVVNSNDTIIDHTWVWRADHGEGVGWETNRADYGVHVKGDNVLATGLFVEHFNKYDVQWSGENGKTIFYQNAKAYDAPDQAAIQNGDIKGYAAYKVDDSVTTHEGWGMGSYCYFNVNPDIRQQHGFQAPVKPGVKFHDLLVVSLGGKGQYEHVINDIGDPTSGDTTIPSQVVSFP 549 T 0.0083 Pectate_lyase_3 unphh F Bacteria T 4tzl 2 C,D C,D G5EBG0_CAEEL C. elegans HIM-3 closure motif SNARDSPYGLSQGITKKNKD 20 T 5.1 DUF5699 pdbhh F Eukaryota T 4tzm 2 C,D C,D O01820_CAEEL C. elegans HTP-3 closure motif1 TARYGVSNTSINRKKP 16 T 9.3 DUF4090 pdbhh F Eukaryota T 4tzn 2 C,D C,D O01820_CAEEL Protein HTP-3 AMRYGQSPNMPSRRGN 16 T 6.5 zf-CDGSH pdbhh F Eukaryota T 4tzq 2 B,D B,D O01820_CAEEL Protein HTP-3 STARYGVSNTSINRKKP 17 T 10 DUF4090 pdbhh F Eukaryota T 4u03 1 A,B A,B DNCV_VIBCH C-AMP-GMP SYNTHASE, DINUCLEOTIDE CYCLASE DNCV MRMTWNFHQYYTNRNDGLMGKLVLTDEEKNNLKALRKIIRLRTRDVFEEAKGIAKAVKKSALTFEIIQEKVSTTQIKHLSDSEQREVAKLIYEMDDDARDEFLGLTPRFWTQGSFQYDTLNRPFQPGQEMDIDDGTYMPMPIFESEPKIGHSLLILLVDASLKSLVAENHGWKFEAKQTCGRIKIEAEKTHIDVPMYAIPKDEFQKKQIALEANRSFVKGAIFESYVADSITDDSETYELDSENVNLALREGDRKWINSDPKIVEDWFNDSCIRIGKHLRKVCRFMKAWRDAQWDVGGPSSISLMAATVNILDSVAHDASDLGETMKIIAKHLPSEFARGVESPDSTDEKPLFPPSYKHGPREMDIMSKLERLPEILSSAESADSKSEALKKINMAFGNRVTNSELIVLAKALPAFAQELEHHHHHH 427 T 0.94 ApoLp-III pdbpercent F Bacteria T 4u0l 1 A,B A,B DNCV_VIBCH C-AMP-GMP SYNTHASE, DINUCLEOTIDE CYCLASE DNCV MRMTWNFHQYYTNRNDGLMGKLVLTDEEKNNLKALRKIIRLRTRDVFEEAKGIAKAVKKSALTFEIIQEKVSTTQIKHLSDSEQREVAKLIYEMDDDARDEFLGLTPRFWTQGSFQYDTLNRPFQPGQEMAIADGTYMPMPIFESEPKIGHSLLILLVDASLKSLVAENHGWKFEAKQTCGRIKIEAEKTHIDVPMYAIPKDEFQKKQIALEANRSFVKGAIFESYVADSITDDSETYELDSENVNLALREGDRKWINSDPKIVEDWFNDSCIRIGKHLRKVCRFMKAWRDAQWDVGGPSSISLMAATVNILDSVAHDASDLGETMKIIAKHLPSEFARGVESPDSTDEKPLFPPSYKHGPREMDIMSKLERLPEILSSAESADSKSEALKKINMAFGNRVTNSELIVLAKALPAFAQE 419 T 0.017 WEMBL pdbpssm F Bacteria T 4u0m 1 A,B A,B DNCV_VIBCH C-AMP-GMP SYNTHASE,DINUCLEOTIDE CYCLASE DNCV MRMTWNFHQYYTNRNDGLMGKLVLTDEEKNNLKALRKIIRLRTRDVFEEAKGIAKAVKKSALTFEIIQEKVSTTQIKHLSDSEQREVAKLIYEMDDDARDEFLGLTPRFWTQGSFQYDTLNRPFQPGQEMDIDDGTYMPMPIFESEPKIGHSLLILLVDASLKSLVAENHGWKFEAKQTCGRIKIEAEKTHINVPMYAIPKDEFQKKQIALEANRSFVKGAIFESYVADSITDDSETYELDSENVNLALREGDRKWINSDPKIVEDWFNDSCIRIGKHLRKVCRFMKAWRDAQWDVGGPSSISLMAATVNILDSVAHDASDLGETMKIIAKHLPSEFARGVESPDSTDEKPLFPPSYKHGPREMDIMSKLERLPEILSSAESADSKSEALKKINMAFGNRVTNSELIVLAKALPAFAQELEHHHHHH 427 T 0.048 WEMBL pdbpssm F Bacteria T 4u0n 1 A,B A,B DNCV_VIBCH C-AMP-GMP SYNTHASE, DINUCLEOTIDE CYCLASE DNCV MRMTWNFHQYYTNRNDGLMGKLVLTDEEKNNLKALRKIIRLRTRDVFEEAKGIAKAVKKSALTFEIIQEKVSTTQIKHLSDSEQREVAKLIYEMDDDARDEFLGLTPRFWTQGSFQYDTLNRPFQPGQEMDIDDGTYMPMPIFESEPKIGHSLLILLVDASLKSLVAENHGWKFEAKQTCGRIKIEAEKTHIDVPMYAIPKDELDSENVNLALREGDRKWINSDPKIVEDWFNDSCIRIGKHLRKVCRFMKAWRDAQWDVGGPSSISLMAATVNILDSVAHDASDLGETMKIIAKHLPSEFARGVESPDSTDEKPLFPPSYKHGPREMDIMSKLERLPEILSSAESADSKSEALKKINMAFGNRVTNSELIVLAKALPAFAQELEHHHHHH 391 T 0.0023 SMODS pdbpercent F Bacteria T 4u1e 2 B B EIF3B_YEAST EIF3B,CELL CYCLE REGULATION AND TRANSLATION INITIATION PROTEIN,EUKARYOTIC TRANSLATION INITIATION FACTOR 3 90 KDA SUBUNIT,EIF3 P90,TRANSLATION INITIATION FACTOR EIF3 P90 SUBUNIT SNAEADTAMRDLILHQRELLKQWTEYREKIGQEMEKSMNFKIFDVQP 47 T 0.026 WWE pdbpssm F Eukaryota T 4u1h 3 C C POL_HV1H2 TL9 PEPTIDE TPQDLNTML 9 T 0.13 Gag_p24 unphh T Viruses T 4u4c 2 B B AIR2_YEAST;PAP2_YEAST ARGININE METHYLTRANSFERASE-INTERACTING RING FINGER PROTEIN 2,DNA POLYMERASE KAPPA,DNA POLYMERASE SIGMA,TOPOISOMERASE 1-RELATED PROTEIN TRF4 GAASMEKNTAPFVVDTAPTTPPDKLVAPSIEEVNSNPNELRALRGQGRYFGVSDDDKDAIKEAAPKHGDEKDLANNDDFISLSASSEDEQAEQEEEREKQELEIKKEKQKEILNTD 116 T 0.051 RD3 unp F Eukaryota T 4u5b 2 E,F E,F CONII_CONST Con-ikot-ikot GPGSSGPADCCRMKECCTDRVNECLQRYSGREDKFVSFCYQEATVTCGSFNEIVGCCYGYQMCMIRVVKPNSLSGAHEACKTVSCGNPCA 90 T 0.032 DUF902 pdb F Eukaryota T 4u6x 3 C P ALQDA peptide, ALQDAGDSSRKEYFI ALQDAGDSSRKEYFI 15 T 0.34 SOTI pdbhh F T 4u6y 3 C P FLNKD peptide, FLNKDLEVDGHFVTM FLNKDLEVDGHFVTM 15 T 2.1 DUF4603 pdbhh F T 4u7e 2 B A IST1_HUMAN HIST1,PUTATIVE MAPK-ACTIVATING PROTEIN PM28 STSASEDIDFDDLSRRFEELKKKTW 25 T 1.5 INCA1 pdbhh F Eukaryota T 4u90 2 B,C D,E GBRA3_RAT GABA(A) RECEPTOR SUBUNIT ALPHA-3 FNIVGTTYPC 10 T 0.97 DUF749 pdbhh F Eukaryota T 4u91 2 B,C B,E GLYCINE RECEPTOR 58 KDA SUBUNIT FSIVGSLPRDC 11 T 0.35 MucB_RseB_C pdbhh F T 4ubf 2 E P KIF2C_HUMAN KINESIN-LIKE PROTEIN 6,MITOTIC CENTROMERE-ASSOCIATED KINESIN,MCAK QLEEQASRQISS 12 T 0.023 Fib_alpha unp F Eukaryota T 4ud7 2 E,F,G,H F,G,H,I YS-02 XTSFXEYWXLLPENYX 16 T 0.05 P53_TAD pdbhh F T 4uda 2 B B NCOA1_HUMAN NCOA-1, CLASS E BASIC HELIX-LOOP-HELIX PROTEIN 74, BHLHE74, PROTEIN HIN-2, RIP160, RENAL CARCINOMA ANTIGEN NY-REN-52, STEROID RECEPTOR COACTIVATOR 1, SRC-1, NCOA1 PEPTIDE PQAQQKSLLQQLLTE 15 T 1.8 GFD1 pdbhh F Eukaryota T 4ue0 1 A,B,C A,B,C Q997H2_ADEB4 FIBER GALTTSTRQGSRVVGFMDFIIALGWQIIPSNIRYIYILNCSQFMPTSDVTTIYFQADSGLESIFVMDSPFYASCTQQLPDKTIKTYGVTISKKQSIISINFSSSLEPNIMVSAWTASITRTQ 122 T 4.5 DUF2534 pdbhh T Viruses T 4ue1 2 E,F,G,H F,G,H,I YS-01 XTSFXEYWXLLPENFX 16 T 0.055 P53_TAD pdbhh F T 4ue4 2 B B FTSQ_ECOLI FTSQ SIGNAL SEQUENCE LFLLTVCTTVLVSGWVVLGWME 22 T 0.23 DUF5818 unppssm F Bacteria T 4uea 2 B,D,F B,D,F DESIGNED 4E-BP GPHMLERYSKVDLLALRYSPLSQTPPGIELEGRLRRMNIWRTGS 44 T 0.0054 EIF4E-T pdb F T 4uec 2 B B O61380_DROME EUKARYOTIC TRANSLATION INITIATION FACTOR 4G, ISOFORM C, FI02056P, TRANSLATION INITIATION FACTOR EIF4G GHMLEPETTLNDKQDSTDLKVKVSAKISSIINYNEGQWSPNNPSGKKQYDREQLLQLREVKASRIQPEVKNVSILPQP 78 T 0.00012 eIF_4G1 pdbhh F Eukaryota T 4uhp 1 A,C,E,G A,C,E,G Q51502_PSEAI PYOCIN AP41 LARGE COMPONENT DEPGVATGNGQPVTGNWLAGASQGDGVPIPSQIADQLRGKEFKSWRDFREQFWMAVSKDPSALENLSPSNRYFVSQGLAPYAVPEEHLGSKEKFEIHHVVPLESGGALYNIDNLVIVTPKRHSEIHKELKLKRKEK 136 T 0.0013 HNH pdb F Bacteria T 4ui9 18 T U FBX5_HUMAN PEPTIDE MSRRPCSCALRPPAAAAAAAAAAA 24 T 2.2 Toxin_14 pdbhh F Eukaryota T 4uj3 2 B,E,H,K,N,Q,T,W B,E,H,K,N,Q,T,W RAB3I_HUMAN RAB3A-INTERACTING PROTEIN, RABIN-3, SSX2-INTERACTING PROTEIN, RABIN8 GAASNKSTSSAMSGSHQDLSVIQPIVKDCKEADLSLYNEFRLWKDEPTMDRTCPFLDKIYQEDIFPCLTFSKSELASAVLEAVENNTLSIEPVGLQPIRFVKASAVECGGPKKCALTGQSKSCKHRIKLGDSSNYYYISPFCRYRITSVCNFFTYIRYIQQGLVKQQDVDQMFWEVMQLRKEMSLAKLGYFKEEL 195 T 0.11 SHE3 unphh F Eukaryota T 4um9 4 E,F E,F TGFB3_HUMAN LAP XHGRGDLGRLKKX 13 T 14 DUF1843 pdbhh F Eukaryota T 4umi 1 A A SPIKE_ADES1 SPIKE, PROTEIN IV GSSHHHHHHSSGLVPRGSHMASMTGGQQMGRGSLESYPLPPLVWDYSSKSLTLDIGPGLTVVNGKLQVIGATFSNQMSRMAPAPRADLQSNSIEPLPSPPSKTSLDIAEELQNDKGVSFAFQAREEELGAFTKRTLFAYSGDGLTGPFKAPASAELSSFLTAHPKGRWLIAFPLGTGIVSVDEGILTLEISRSLPEVGSGSSFYLTEK 208 T 0.0017 Adeno_shaft unppercent T Viruses T 4umn 2 C,D C,D M06 XTSFXEYWYLLXX 13 T 4.5 P53_TAD pdbhh F T 4uot 1 A,B,C,D,E A,B,C,D,E DESIGNED HELICAL BUNDLE 5H2L XTQEYLLKEIMKLLKEQIKLLKEQIKMLKELEKQ 34 T 0.023 DUF5320 pdbhh F T 4upu 2 B B IP3KA_HUMAN INOSITOL 1\,4\,5-TRISPHOSPHATE 3-KINASE A, IP3 3-KINASE A, IP3K A, INSP 3-KINASE A GEDVGQKNHWQKIRTMVNLPVISPFK 26 T 0.68 SR-25 unppercent F Eukaryota T 4uq2 3 E,F E,G AZOBENZENE-CONTAINING PEPTIDE AIMXYPK 7 T 21 FokI_C pdbhh F T 4usl 2 B D SORCN_HUMAN 22 KDA PROTEIN, CP-22, CP22, V19 MAYPGHPGAGGGYYPGGYGGAPGGPAFPGQTQ 32 T 220 Antimicrobial_5 pdbhh F Eukaryota T 4utn 2 C D SUCCINYL-CPS1-PEPTIDE XGVLXEYGV 9 T 21 DUF3744 pdbhh F T 4utr 2 C C CPSM_HUMAN 3-NITRO-PROPIONYL-CPS1 PEPTIDE XGVLKEYGV 9 F F Eukaryota T 4uu5 2 B B CRUM1_HUMAN PROTEIN CRUMBS HOMOLOG 1 RVEMWNLMPPPAMERLI 17 T 3 DUF1180 unphh F Eukaryota T 4uwx 2 C,D C,D LIPA3_MOUSE PROTEIN TYROSINE PHOSPHATASE RECEPTOR TYPE F POLYPEPTIDE-IN TERACTING PROTEIN ALPHA-3, PTPRF-INTERACTING PROTEIN ALPHA-3, LIPR IN-ALPHA3 TPRSARLERMAQALALQAGSP 21 T 6.5 WSN pdbhh F Eukaryota T 4ux6 1 A A NOS2_MOUSE INDUCIBLE NO SYNTHASE, INDUCIBLE NOS, INOS, MACROPHAGE NOS, MAC-NOS, NOS TYPE II, PEPTIDYL-CYSTEINE S-NITROSYLASE NOS2, INDUCIBLE NITRIC OXIDE SYNTHASE QYVRIKNWGSGEILHDTLHHKATS 24 T 5.8 EFP_N pdbhh F Eukaryota T 4ux9 2 E,F,G,H F,G,H,I MP2K7_HUMAN MAP KINASE KINASE 7, MAPKK 7, JNK-ACTIVATING KINASE 2, MAPK/ERK KINASE 7, MEK 7, STRESS-ACTIVATED PROTEIN KINASE KINASE 4, SAPK KINASE 4, SAPKK-4, SAPKK4, C-JUN N-TERMINAL KINASE KINASE 2, JNK KINASE 2, JNKK 2, MKK7 QRPRPTLQLPLA 12 T 23 Sec-ASP3 pdbhh F Eukaryota T 4uxe 1 A,B,C A,B,C FIBP_BPT4 PROXIMAL LONG TAIL FIBRE PROTEIN GP34, PROTEIN GP34 MGSSHHHHHHSQDPSQFIRRDIAQTVNGSLTLTQQTNLSAPLVSSSTGEFGGSLAANRTFTIRNTGAPTSIVFEKGPASGANPAQSMSIRVWGNQFGGGSDTTRSTVFEVGDDTSHHFYSQRNKDGNIAFNINGTVMPININASGLMNVNGTATFGRSVTANGEFISKSANAFRAINGDYGFFIRNDASNTYFLLTAAGDQTGGFNGLRPLLINNQSGQITIGEGLIIAKGVTINSGGLTVNSRIRSQGTKTSDLYTRAPTSDTVGFWSIDINDSATYNQFPGYFKMVEKTNEVTGLPYLERGEEVKSPGTLTQFGNTLDSLYQDWITYPTTPEARTTRWTRTWQKTKNSWSSFVQVFDGGNPPQPSDIGALPSDNATMGNLTIRDFLRIGNVRIVPDPVNKTVKFEWVE 410 T 5.8 Auxin_repressed pdbhh T Viruses T 4uzy 2 B B Q946G4_CHLRE INTRAFLAGELLAR TRANSPORT PROTEIN 52 NLIPPSFETPLPPLQPAVFPPTIREPPPPALELFDLDESFASLTNKCHGEED 52 T 24 Antimicrobial18 pdbhh F Eukaryota T 4uzz 2 B B I7LT74_TETTS INTRAFLAGELLAR TRANSPORT PROTEIN 52 GAASDEFASEKVRLAQLTNKCNNNDLDYYIKESGDILGVTDKVKNKHDAKAILRYVLEELINFKKLNN 68 T 0.018 RRM_1 pdbpercent F Eukaryota T 4v11 2 B B SV2A_HUMAN SV2A SDATEGHDED 10 T 13 Toxin_25 pdbhh F Eukaryota T 4v3p 53 AB Ld 60S ribosomal protein L29 KFLRNQRYSRKHNKKSGEAESEE 23 T 15 MAT1-1-2 pdbhh F T 4v4u 2 F,G,H,I,J S,T,U,V,W N-TERMINAL PEPTIDE OF FIBER PROTEIN TFNPVYPYDT 10 T 0.25 DUF3463 pdbhh F T 4v5z 76 XB B8 60S Ribosomal protein L35 ARVLTVINQT 10 T 0.0087 Ribosomal_L29 pdbhh F T 4v6u 32 FA BO RL18_PYRFU PFL18, 50S RIBOSOMAL PROTEIN L18 MAHGPRYRVPFRRRREGKTNYRKRLKLLKSGKPRLVVRKSLNHHIAQIIVYDPKGDRTLVSAHTRELIRDFGWKGHCGNTPSAYLLGLLIGYKAKQAGIEEAILDIGLHPPVRGSSVFAVLKGAVDAGLNVPHSPEIFPDEYRIRGEHIAEYAKMLKEQDEEKFRRQFGGYLVKGLDPEKLPEHFEEVKARIIEKFEGEGARE 203 T 1E-08 Ribosomal_L5e pdb F Archaea T 4w4z 2 E,F,G,H E,F,G,H APY-bAla8.am peptide APYCVYRXSWSCX 13 T 0.89 DUF1684 pdbhh F T 4w50 2 E,F,G,H E,F,G,H APY peptide APYCVYRGSWSC 12 T 1 DUF1684 pdbhh F T 4w5y 1 A,B A,B Prp peptide GYMLGSA 7 T 3 G0-G1_switch_2 pdbhh F T 4w67 1 A,B A,B PrP peptide GYVLGSA 7 T 11 DUF2148 pdbhh F T 4w8p 2 B B AB1IP_MOUSE APBB1-INTERACTING PROTEIN 1,PROLINE-RICH EVH1 LIGAND 1,PREL-1,PROLINE-RICH PROTEIN 48 NEDIDQMFSTLLGEMDLLTQS 21 T 6.8 Mvb12 pdbhh F Eukaryota T 4wa0 1 A A E4SDB5_CALK2 possible adhesin TSVPSSPLDYAIFSKGALNTNKNLTVENGSVYSGGDLTIDGGAVFNIDNLISKGEMVINQDSDSRCRDNNIVVRNIIYVEKSLKANRISPRSTNIDAKTIYVGQEMQLYGAGSYKFVQLFSDSNVKLAGPGVNMEVSTLASIRGTLEVIDGATVTLKSNSAVYCNSLVVRNGSRLILENGAKLYLATTPDASTIISIQNNGGTISYSSSFSYPSPPAEIDEIRNRDYTSGLLTTPLPADSVGSNQLGSTADTSQTPPQIVIYGESYINDNEARIEISARLGSPIVDFSTLQLHLISRGNITFVGGGLTIMNGSIISLGSTFNINATGNPYAGLTLKYQMPSPPIQQDIESNTGIQPSQ 358 T 0.032 FecR pdb F Bacteria T 4wci 2 B,D,F B,D,F RIN3_HUMAN RAS INTERACTION/INTERFERENCE PROTEIN 3 AKKNLPTAPPRRRVSE 16 T 11 COX8 pdbhh F Eukaryota T 4wfd 3 C,F,I C,F,I MTR4_YEAST MRNA TRANSPORT REGULATOR MTR4 MDSTDLFDVFEETPVELPTK 20 T 8.8 eIF3h_C pdbhh F Eukaryota T 4wj7 2 E,F,G,H W,X,Y,Z KRIT1 NPxY/F3 VDKVVINPYFGLG 13 T 0.029 MT-A70 pdbhh F T 4wjg 5 DA,E,J,O,T,Y 4,E,J,O,T,Y I7BA80_TRYBB Haptoglobin-hemoglobin receptor AEGLKTKDEVEKACHLAQQLKEVSITLGVIYRTTERHSVQVEAHKTAIDKHADAVSRAVEALTRVDVALQRLKELGKANDTKAVKIIENITSARENLALFNNETQAVLTARDHVHKHRAAALQGWSDAKEKGDAAAEDVWVLLNAAKKGNGSADVKAAAEKCSRYSSSSTSETELQKAIDAAANVGGLSAHKSKYGDVLNKFKLSNASVGAVRDTSGRGGKHMEKVNNVAKLLKDAEVSLAAAAAEIEEVKNAHETKAQEEMKRNGNPIENESETNSGGNAESQGNGDREDKNDEQQQVDEEETKVENGSSEEGSCCGNESNGPHVMKKRHGVEGPRPVDVVS 343 T 8.4E-05 GARP unphh F Eukaryota T 4wjp 2 B,D B,D Daxx GSGEAEERIIVLSDSDY 17 T 1.5 Rnk_N pdbhh F T 4wjv 3 I,J,K,L I,J,K,L NSA2_YEAST NOP7-ASSOCIATED PROTEIN 2 MDTDGDALPTYLLDREQNNTAK 22 T 5 Sec62 unppssm F Eukaryota T 4wjw 3 C P CHS3_YEAST CHITIN-UDP ACETYL-GLUCOSAMINYL TRANSFERASE 3,CLASS-IV CHITIN SYNTHASE 3 DDYYLNLNQDEESLLRSRC 19 T 3.2 DUF3305 pdbhh F Eukaryota T 4wk4 3 C C ALA-CYS-ARG-GLY-ASP-GLY-TRP-CYS ACRGDGWC 8 T 0.14 Peptidase_C65 pdbhh F T 4wnd 2 B B FRPD4_HUMAN PDZ DOMAIN-CONTAINING PROTEIN 10,PSD-95-INTERACTING REGULATOR OF SPINE MORPHOGENESIS,PRESO GPLGSDLPPKVVPSKQLLHSDHMEMEPETMETKSVTDYFSKLHMGSVAYSCTS 53 T 100 EB1 pdbhh F Eukaryota T 4wne 2 B B FRPD4_HUMAN PDZ DOMAIN-CONTAINING PROTEIN 10,PSD-95-INTERACTING REGULATOR OF SPINE MORPHOGENESIS,PRESO KQLLHSDHMEMEPETMETKSVTDYF 25 T 83 SfsA pdbhh F Eukaryota T 4wng 2 B B FRPD4_HUMAN PDZ DOMAIN-CONTAINING PROTEIN 10,PSD-95-INTERACTING REGULATOR OF SPINE MORPHOGENESIS,PRESO GPGSDLPPKVVPSKQLLHSDHMEMEPETMETKSVTDYFSKLHMGSVAYSCTSEFHHHHHH 60 T 130 EB1 pdbhh F Eukaryota T 4wnl 2 E,F,G,H E,F,G,H SHE3_YEAS6 SWI5-dependent HO expression protein 3 RSFYTASPLLSSGSIPKSASPVLPGVKRTASVR 33 T 0.00026 CCDC73 unphh F Eukaryota T 4wnn 3 I T SPT16_YEAST SPT16 GIKKTDDEASDESEEEVSEY 20 T 0.11 SAPS unppssm F Eukaryota T 4wpb 2 C,D C,D alpha/beta-VEGF-1 VXNKXNKEXCNXRAIEXALDPNLNDQQFHXKIWXIIXDCX 40 T 6.5 Vel1p pdbhh F T 4wph 2 C,D C,D ICP0_HHV11 ICP0 GPRKCARKTRH 11 T 2.8 Adeno_E4_34 pdbhh T Viruses T 4wpx 2 B,E B,E G0SGL4_CHATD Putative SAC3 family protein GHMKPKRDLMADFTKWFVTGDGGIMEEFTEETLRHLLWDVWQRHQREEAERKRKAEEEESWRLAREHLTHRLQVKYFYRWREKARALAT 89 T 0.12 PV_NSP1 pdb F Eukaryota T 4wsf 2 B B Q9VHP9_DROME FI18815P1 PDESSADVVFKKPLAPAPR 19 T 0.4 TSSC4 pdbhh F Eukaryota T 4wsi 2 C,D X,Y CRB_DROME 95F GPGSEFRNKRATRGTYSPSAQEYCNPRLEMDNVLKPPPEERLI 43 T 0.031 TMEM154 unphh F Eukaryota T 4wv6 2 B,C B,C TAF8_HUMAN PROTEIN TAUBE NUSS,TBP-ASSOCIATED FACTOR 43 KDA,TBP-ASSOCIATED FACTOR 8,TRANSCRIPTION INITIATION FACTOR TFIID 43 KDA SUBUNIT,HTAFII43 PVKKPKIRRKKSLS 14 T 24 Ribosomal_L29e pdbhh F Eukaryota T 4wvd 1 A,C C,D NCOR1_HUMAN N-COR1 SNLGLEDIIRKALMGSF 17 T 3.1 RuvA_C pdbhh F Eukaryota T 4wvi 2 B D substrate peptide (pep2) GGGGAVPTAKA 11 T 24 DUF3034 pdbhh F T 4wvj 2 B D inhibitor peptide (PEP3) GGGGGAPTAKAPSK 14 T 29 DUF4023 pdbhh F T 4wwr 2 B,D,F,H A,G,C,E BAG6_HUMAN BAG FAMILY MOLECULAR CHAPERONE REGULATOR 6,BCL2-ASSOCIATED ATHANOGENE 6,BAG6,HLA-B-ASSOCIATED TRANSCRIPT 3,PROTEIN G3,PROTEIN SCYTHE MQLLLSEAVSRAAKAAGARPLTSPESLSRDLEAPEVQESYRQQLRSDIQKRLQ 53 T 0.48 DUF2939 unp F Eukaryota T 4wx4 2 B C peptide VKSLKRRRCY 10 T 0.00019 MCPVI pdbhh F T 4wym 2 M,N,O,P,Q,R,S,T,U,V,W M,N,O,P,Q,R,S,T,U,V,W CPSF6_HUMAN CLEAVAGE AND POLYADENYLATION SPECIFICITY FACTOR 68 KDA SUBUNIT,CPSF 68 KDA SUBUNIT,PRE-MRNA CLEAVAGE FACTOR IM 68 KDA SUBUNIT,PROTEIN HPBRII-4/7 GTPVLFPGQPFGQPPLG 17 T 2.2 MlaD pdbhh F Eukaryota T 4wyq 1 A,D A,D DICER_HUMAN HELICASE WITH RNASE MOTIF,HELICASE MOI YERLLMELEEALNFINDCNISVHSKERDSTLISKQILSDCRAVLVVLGPWCADKVAGMMVRELQKYIKHEQEELHRKFLLFTDTFLRKIHALCEEHFSPASLDLKFVTPKVIKLLEILRKYKP 123 T 0.088 Tyrosinase pdbpssm F Eukaryota T 4wyu 2 B,D D,C SYNTHETIC PDZ BINDING MOTIF SWFQTDL 7 T 12 DOR pdbhh F T 4wzn 1 A,B A,B POLG_HAVHM Genome polyprotein SMMSRIAAGDLESSVDDPRSEEDKRFESHIECRKPYKELRLEVGKQRLKYAQEELSNEVLPPPRKMKGLFSQAKISLFYTEEHEIMKFSWRGVTADTRALRRFGFSLAAGRSVWTLEMDAGVLTGRLIRLNDEKWTEMKDDKIVSLIEKFTSNKYWSKVNFPHGMLDLEEIAANSKDFPNMSETDLCFLLHWLNPKKINLADRMLGLSGVQEIKEQG 217 T 46 APC_15aa pdbhh T Viruses T 4wzx 2 B E IST1_HUMAN HIST1,PUTATIVE MAPK-ACTIVATING PROTEIN PM28 TSASEDIDFDDLSRRFEELKKKT 23 T 2.6 TACC_C pdbhh F Eukaryota T 4x01 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H COM1_SCHPO DOUBLE-STRAND BREAK REPAIR PROTEIN CTP1,MEIOTICALLY UP-REGULATED GENE 38 PROTEIN,NBS1-INTERACTING PROTEIN 1,SPORULATION IN THE ABSENCE OF SPO11 PROTEIN 2 HOMOLOG,SAE2 MEHNKSVHWSIVYRQLGNLLEQYEVEIARLKSQLVLEKKLRIQVEKEMESVKTKQIS 57 T 0.00099 Lzipper-MIP1 unppercent F Eukaryota T 4x0w 1 A P mupain-1-17 CPAYSXYLDC 10 T 1.4 DUF6438 pdbhh F T 4x1h 2 B C C-terminal derived peptide of guanine nucleotide-binding protein G(t) subunit alpha-1 VLEDLKSCGLF 11 T 2.7 Defensin_RK-1 pdbhh F T 4x1n 1 A P mupain-1-16 CPAYSAYLDC 10 T 1.4 DUF6438 pdbhh F T 4x1q 1 A P mupain-1 CPAYSRYLDC 10 T 0.32 Hormone_2 pdbhh F T 4x1s 1 A P mupain-1-16 CPAYSAYLAC 10 T 2.1 DUF6438 pdbhh F T 4x1v 2 B B ARAP1_HUMAN CENTAURIN-DELTA-2,CNT-D2 RPTPRPVPMKRHIFRS 16 T 27 DUF3864 pdbhh F Eukaryota T 4x23 7 K,L,W,X V,U,X,W Q66LH7_RAT CENP-C PNVRRSNRIRLKPLEYWRGERIDYQ 25 T 0.6 CENP-C_mid pdbhh F Eukaryota T 4x2h 3 C C G0SGL4_CHATD SER-SER-VAL-PHE-GLY-ALA-PRO-ALA MMAPANNPFGAPPAQVNNPF 20 T 2.9 NpwBP pdbhh F Eukaryota T 4x2m 1 A,B A,B G0SG92_CHATD Mtr2 MLSRRYAAKSFVEWYYRQINENKPVASGYVNNNATYTKAGHPPADITINGRVVATPEEWDTMLKEQRAQHNTSSSSTLPIGRKPVRYDVDCFDVHVINADYRFAAPQRMIEQHAPTDGVRMMMALTVSGSVYFGASPRSTDDYVIKQHFNDVFILVPNWDVLEKPGARSGRKYLIASHKYRAY 183 T 0.17 NTF2 unppercent F Eukaryota T 4x2o 3 C C G0SGL4_CHATD Putative SAC3 family protein FASPAPSNQGSSVFGAPAQST 21 T 4.1 DUF765 pdbhh F Eukaryota T 4x34 2 C,D C,D P53_HUMAN THR-SER-ARG-HIS-ALY-MLY-LEU-MET-PHE-LYS TSRHXKLMFK 10 T 21 DUF420 pdbhh F Eukaryota T 4x3e 2 B B ALA-GLN-ARG-M3L-PHE-ALA-GLN-SER RLQAQRKFAQSQY 13 T 28 DUF4395 pdbhh F T 4x3i 2 B B KCC2A_MOUSE CAM KINASE II SUBUNIT ALPHA, CAMK-II SUBUNIT ALPHA ATRNFSG 7 T 1.7 IER unppercent F Eukaryota T 4x6s 2 C,D L,M Phosphotyrosine mimetic inhibitor peptide G7-TEM1 WFEGXDNTFPX 11 T 0.65 Caf4 pdbhh F T 4x6z 15 CA,DA a,e synthetic peptide (polymer) RRRPRPPYLPRFG 13 T 6.4 TAF8_C pdbhh F T 4x86 2 B B BAG6_HUMAN BAG FAMILY MOLECULAR CHAPERONE REGULATOR 6,BCL2-ASSOCIATED ATHANOGENE 6,BAG6,HLA-B-ASSOCIATED TRANSCRIPT 3,PROTEIN G3,PROTEIN SCYTHE GPLGSAKRRKTMQGEGPQLLLSEAVSRAAKAAGARPLTSPESLSRDLEAPEVQESYRQQLRSDIQKRLQEDPNYSPQRFPN 81 T 0.038 Phosducin pdbpssm F Eukaryota T 4x8n 2 B B RBBP5_HUMAN RBBP-5,RETINOBLASTOMA-BINDING PROTEIN RBQ-3 ERESEFDIED 10 T 0.014 DUF2457 unppercent F Eukaryota T 4x8p 2 B B RBBP5_HUMAN RBBP-5,RETINOBLASTOMA-BINDING PROTEIN RBQ-3 EYEERESEFDIE 12 T 0.014 DUF2457 unppercent F Eukaryota T 4x9z 1 A,B A,B CDKA_CONGR alphaD-conotoxin GeXXA from the venom of Conus generalis DVHRPCQSVRPGRVWGKCCLTRLCSTMCCARADCTCVYHTWRGHGCSCVM 50 T 12 Tachystatin_A pdbhh F Eukaryota T 4xa9 2 B,D,F,H,J,L,N,P a,b,c,d,e,f,g,h Q5ZWY9_LEGPH Uncharacterized protein GMAIAPQQIQERLKQEQYQKFVVADIGNFPHCLAQTPEGIASGQRYQKYSTNSLSRTPPFSQWGAPQLLTPKSAQEYIKFAQQRNKKSSFKIDGEAVRVSECSNFAYHSAGVLLDDPQIRTQYDVAVIGSMHSNGRYLHNITLLVPKGSRLPQPPQQLTAEVFPIGTLIVDPWAVGMGHPPEQALAIPKEQFAYNRSLFPATVNYQSALDESLTSTRTGQLTPYTGTPSRT 231 T 0.55 eIF_4EBP pdbpssm F Bacteria T 4xc2 2 E,F,G,H E,F,G,H KBTB6_HUMAN Kelch repeat and BTB domain-containing protein 6 SDDDFWVRVAP 11 T 0.5 BSD pdbhh F Eukaryota T 4xdn 2 B B SCC2_YEAST Sister chromatid cohesion protein 2 MKSSHHHHHHENLYFQSNAMSYPGKDKNIPGRIIEALEDLPLSYLVPKDGLAALVNAPMRVSLPFDKTIFTSADDGRDVNINVLGTANSTTSSIKNEAEKERLVFKRPSNFTSSANSVDYVPTNFLEGLSPLAQSVLSTHKGLNDSINIEKKSEIVSRPEAKHKLESVTSNAGNLSFNDNSSNKKTKTSTGVTMTQANLA 200 T 0.64 DUF3910 pdbpercent F Eukaryota T 4xef 2 B,C,E,F B,C,E,F LPXN_HUMAN 20-mer peptide containing LD1 motif of leupaxin MEELDALLEELERSTLQDSD 20 T 1.8 Paxillin unphh F Eukaryota T 4xek 2 B C LPXN_HUMAN 19-mer peptide containing Leupaxin LD4 motif KTSAAAQLDELMAHLTEMQ 19 T 0.057 GET2 unppssm F Eukaryota T 4xh2 3 M,N,O,P,Q,R a,c,e,g,h,j PAXI_HUMAN paxillin LD4 WGGSATRELDELMASLSD 18 T 1.8 SAM_LFY pdbhh F Eukaryota T 4xhv 2 B B Q3KN41_DROME Neurexin 1 DSKDVKEWYV 10 T 12 DUF3929 pdbhh F Eukaryota T 4xi7 2 B C JAG1_HUMAN Jagged 1 N-box peptide NQIKNPIEKHG 11 T 0.036 SID-1_RNA_chan unppssm F Eukaryota T 4xib 2 B C DL_DROME Delta N-box peptide NIIKNTWDKSV 11 T 0.16 DAG1 unphh F Eukaryota T 4xif 2 E,F,G,H E,F,G,H K2C7_HUMAN CYTOKERATIN-7,CK-7,KERATIN-7,K7,SARCOLECTIN,TYPE-II KERATIN KB7 GPVFTSRSAAG 11 T 0.047 Keratin_2_head unppssm F Eukaryota T 4xng 1 A,B,C,D A,B,C,D Y218A_MYCGE Uncharacterized protein MG218.1 ASSFHNFSKETLQKQAKRGFLLLERCSLVGLQQLELEYVNLLGRSFDSYQQKTELLNNLKELVDEHFSDTEKIINTLEKIFDVIGGSEYTPVLNSFFNKLLSDPDPMQREIGLRQFIITLRQRFKKLSQKIDSSLKQIETEAKA 144 T 0.51 DUF1043 pdb F Bacteria T 4xoj 2 B B SFTI1_HELAN SFTI-1 GRCTKSIPPICFP 13 T 0.0023 Bowman-Birk_leg pdb F Eukaryota T 4xpm 1 A A MEH1_YEAST EGO COMPLEX SUBUNIT 1,GSE COMPLEX SUBUNIT 2 SPDSAKISKEQLKKLHSNILNEIFSQSQVNKPGPLTVPF 39 T 0.15 SDA1 unppercent F Eukaryota T 4xst 2 B F INSR_RAT IR ESSFRKTFEDYLHNVVFVPRKTS 23 T 3.7 YvbH_ext pdbhh F Eukaryota T 4xvn 1 A,B,C,D,E,F A,B,C,D,E,F TERS_BPG20 Small terminase GSHMSVSFRDRVLKLYLLGFDPSEIAQTLSLDVKRKVTEEEVLHVLAEARELLSALPSLEDIRAEVGQALERARIFQKDLLAIYQNMLRNYNAMMEGLTEHPDGTPVIGVRPADIAAMADRIMKIDQERITALLNSLKVLGHVGSTTAGALPSATELVSVEELVAEVVDEAPKT 174 T 0.031 PLU-1 unppssm T Viruses T 4xxc 3 C B ASP-GLU-LEU-GLU-ILE-LYS-ALA-TYR DELEIKAY 8 T 1.6 Wyosine_form pdbhh F T 4xzr 1 A A SRC1_YEAST HELIX-EXTENSION-HELIX DOMAIN-CONTAINING PROTEIN 1 SDTRKKRKDPDSDDWSESNSKENKIDNKHLNLLSSDSEIEQDYQKAKKRKTSDL 54 T 0.092 Nop14 unp F Eukaryota T 4xzx 1 A A Q8VSD5_SHIFL OSPI GPLGSPEFMINGVSLQGTAGYEAHTEEGNVNVKKLLESLNSKSLGDMDKDSELAATLQKMINPSGGDGNSSGCALHACMAMLGYGVREAPVPNEISEYMTGFFHRHLEQIDSEGIVSHPNETYSKFRERIAENILQNTSKGSVVMISIEQATHWIAGFNDGEKIMFLDVQTGKGFNLYDPVEKSPDAFVDENSSVQVIHVSDQEFDHYANSSSWKSKRLC 220 T 0.031 Gln_amidase pdbpercent F Bacteria T 4y2g 2 B B F175A_HUMAN COILED-COIL DOMAIN-CONTAINING PROTEIN 98,PROTEIN FAM175A YSRSPTF 7 T 0.082 PipA pdbhh F Eukaryota T 4y5i 2 C,D F,G TAU_HUMAN NEUROFIBRILLARY TANGLE PROTEIN,PAIRED HELICAL FILAMENT-TAU,PHF-TAU XRTPSLPTX 9 T 2.9 UPF0167 pdbhh F Eukaryota T 4y6o 2 C,D C,D TNFA_HUMAN peptide LEU-PRO-LYS-MYK-THR-GLY-GLY LPKXTGG 7 T 15 SpoV pdbhh F Eukaryota T 4ycz 3 C C G2Q2S2_MYCTT Nup120 GPGSEFELMQGGSSTNHETAGLRTEMLSRLFTAATSISHFEEAHSALLSMDDEAMQKSYLRRLVEKMCETGQSSELITLPFSGLQTKVDDILVEKCRATRDVLNGVPYHQILYAWRINHNDYRGGAAILLDRLQKLRRAGEGDKVIANEHGNEDALDTQVTRQYLLLINALSCVPPQEAYILEDVLPGDGRGGDDADGDRNGGKAGDDLEADIDELEKKLDVEGGADAAKGDEMAAEEDAALIEKMKRFSTRNGQNLPARRLLMLADLRKQYQQELDRIVAIQNNQFGFGAEDDLMDLAGGSGHHHHHHHHHH 313 T 0.02 ELYS pdbpercent F Eukaryota T 4yh8 2 B B U2AF2_SCHPO U2 AUXILIARY FACTOR 59 KDA SUBUNIT,U2AF59,U2 SNRNP AUXILIARY FACTOR LARGE SUBUNIT GGSSVGRSRSPPPSRERSVRSIEQELEQLRDVTPINQWKRKRSLWDIKPPGYELVTADQAKMSGVFPLPGA 71 T 9.8 Transformer unphh F Eukaryota T 4yiz 2 D,E,F B,D,F U6KQJ2_EIMTE Rhoptry neck protein 2, putative GSASDITQHLNDSGLGPAVECLENLVVGPVCPAAVVAPAV 40 T 8.5 LisH_TPL pdbhh F Eukaryota T 4yl6 2 B B M3K3_HUMAN MAPK/ERK KINASE KINASE 3,MEKK 3 MDEQEALNSIMNDLVALQMNRR 22 T 4.5 DUF3040 pdbhh F Eukaryota T 4ym4 2 B B TIFA_HUMAN THR9 PHOSPHORYLATED N-TERMINAL PEPTIDE MTSFEDADTEET 12 T 120 Soyouz_module pdbhh F Eukaryota T 4ynh 1 A,B A,B SAS5_CAEEL SAS-5 GPLGSKIASAREVIKRDGVIPPEALTIIEQRLRSDPMFRQQIDNVLADAECDANRAAYSP 60 T 0.17 T3SS_needle_E pdbhh F Eukaryota T 4yom 2 B A BRSK2_MOUSE SADA,SERINE/THREONINE-PROTEIN KINASE SAD-A MKKSWFGNFINLEKEEQIFVVIKDKPLSSIKADIVHAFLSIPSLSHSVISQTSFRAEYKATGGPAVFQKPVKFQVDITYTEGGEAQKENGIYSVTFTLLSGPSRRFKRVVETIQAQLLSTHDQPSAQHLSGIIPKSLEHHHHHH 144 T 0.12 Fungal_KA1 pdb F Eukaryota T 4yr6 3 C,F C,F GP1BA_HUMAN ACE-LYS-LEU-ARG-GLY-VAL-LEU-GLN-GLY-HIS-LEU XKLRGVLQGHL 11 T 9.8 Pinin_SDK_memA pdbhh F Eukaryota T 4yuu 20 EC,IB,NA,T w2,W2,w1,W1 PEPTIDE CHAIN UNASSIGNED AAWFAVSAVALVVVAAVLVAVAAAA 25 T 2.2 UL42 pdbhh F T 4yvm 1 A,B A,B Q75XL3_HELPX CAGL GSHMEDITSGLKQLDSTYKETNQQVLKNLDEIFSTTSPSANDKIGKEDALNIKKAAIALRGDLALLKANFEANELFFISEDVIFKTYMSSPELLLTYMKINPLDQKTAEQQCGISDKVLVLYCEGKLKIEQEKQNIRERLETSLKAYQSNIGGTASLITASQTLVESLKNKNFIKGIRKLMLAHDKVFLNYLEKLDALEISLEQSKRQYLQERQSSKVIVK 221 T 0.0044 IDO pdbpssm F Bacteria T 4yxy 1 A,B,C,D A,B,C,D dTor_9x31L MASSHHHHHHSSGLVPRGSSMASGISVEELLKLAKAAYYSGTTVEEAYKLALKLGISVEELLKLAEAAYYSGTTVEEAYKLALKLGISVEELLKLAKAAYYSGTTVEEAYKLALKLG 117 T 0.021 T2SSF pdbpssm F T 4yyp 2 B B STIL_HUMAN TAL-1-INTERRUPTING LOCUS PROTEIN PDAYRFLTEQDRQLRLLQAQIQRLLEAQSLMP 32 T 0.091 ACT_5 pdbpercent F Eukaryota T 4yzh 2 B B CB1A_ARATH CHLOROPHYLL A-B PROTEIN 165,CAB-165,LHCII TYPE I CAB-2 RKTVAKPKGPSGSPW 15 T 2 Peptidase_S29 pdbhh F Eukaryota T 4z09 2 B C Q8IKV6_PLAF7 Rhoptry neck protein 2 CFTARMSPPQQIC 13 T 1.1 Bowman-Birk_leg pdbhh F Eukaryota T 4z0d 2 B C Q8IKV6_PLAF7 Rhoptry neck protein 2 CWTTRMSPPQQIC 13 T 0.61 Bowman-Birk_leg pdbhh F Eukaryota T 4z0f 2 B C Q8IKV6_PLAF7 Rhoptry neck protein 2 CXTTRMSPPQQIC 13 T 0.61 Bowman-Birk_leg pdbhh F Eukaryota T 4z0w 1 A,B A,B PEPTAIBOL GICHIGAMIN XXPXPFXPAXXAXXLXXLXXLXG 23 T 50 DUF688 pdbhh F T 4z29 1 A,B A,B A4TUL6_9PROT Magnetotaxis protein MtxA MASWSHPQFEKGADDDDKSEPPVSMLMQVAGAVETSKGGEKWAPVTRNKFLFVGTQVRTGADGGGKLIDQNSGMAQTIGANSVVEITAAGPKAVSGSLSAPEAASGDLVAGLSNRFAEAQRYTTVRRSVKKEAADLKLRVASDITLSPTYPDLVWENMGAQYGYTLVIDGTSHAVPATSGEMVRFRVPSLTPGAHSFGVTVTEGGQAVGQTEKGGTIVWLSATEDKALVDGVARVKAASTGDEFALGNYLDSKGVTVAAMDAYRKHFASHKDDNDMRPLLIKTYNDLKLRDLRQKEALVYNEQLEGNPGFSSISAHHHHHHHHHH 325 T 3.2E-05 DUF928 unphh F Bacteria T 4z2o 2 B P HOAVI_HOEPD Hoef-peptide SVATVSESLLTE 12 T 15 Pollen_allerg_2 pdbhh F Bacteria T 4z2p 2 B,D P,C HOAVI_HOEPD Hoef-peptide (L9F) SVATVSESFLTE 12 T 12 DUF4325 pdbhh F Bacteria T 4z33 2 C,D C,D FZD7_HUMAN LYS-GLY-GLU-THR-ALA-VAL KGETAV 6 T 190 Phage_SSB pdbhh F Eukaryota T 4z6y 1 A,C,E,G B,G,E,A TBCD7_HUMAN CELL MIGRATION-INDUCING PROTEIN 23 GVEEKKSLEILLKDDRLDTEKLCTFSQRFPLPSMYRALVWKVLLGILPPHHESHAKVMMYRKEQYLDVLHALKVVRFVSDATPQAEVYLRMYQLESGKLPRSPSFPLEPDDEVFLAIAKAMEEMVEDSVDCYWITRRFVNQLNTKYRDSLPQLPKAFEQYLNLEDGRLLTHLRMCSAAPKLPYDLWFKRCFAGCLPESSLQRVWDKVVSGSCKILVFVAVEILLTFKIKVMALNSAEKITKFLENIPQDSSDAIVSKAIDLWHKHCGTPVHSS 273 T 1.8 RabGAP-TBC pdbpercent F Eukaryota T 4z7i 2 C,D C,D DG025 transition-state analogue enzyme inhibitor XXKHHAFSFK 10 T 18 SmaI pdbhh F T 4z80 2 B,D C,D B6KQU6_TOXGV Cytoadherence-linked asexual protein GSASQIVQNQSSLAPELSGCPPMGICMDGTIGDPIAS 37 T 0.18 MSC pdbhh F Eukaryota T 4z88 2 M,N,O,P,Q,R,S,T,U,V,W,X M,N,O,P,Q,R,S,T,U,V,W,X JIP1_DROME JIP-1,APP-LIKE-INTERACTING PROTEIN 1,APLIP1,PROTEIN EYE DEVELOPMENTAL SP512 XTRRRRKLPEIPKNKKX 17 T 17 Curto_V3 pdbhh F Eukaryota T 4z89 2 K,L,M,N,O,P,Q,R,S,T a,b,c,d,e,f,g,h,i,j CAC1A_DROME PROTEIN CACOPHONY,PROTEIN NIGHTBLIND A,PROTEIN NO-ON-TRANSIENT B,DMCA1A XIGRRLPPTPSKPSTLX 17 T 16 Oxidored-like pdbhh F Eukaryota T 4z8c 55 CB,FD 1z,2z Oncocin VDKPPYLPRPRPPRRIYNR 19 T 0.18 Apidaecin pdbhh F T 4z8j 2 B B PTH1R_HUMAN C-terminal PDZ binding motif from parathyroid hormone receptor (PTHR) QEEWETVM 8 T 0.21 Prp19 pdbhh F Eukaryota T 4z8m 2 C,D C,D MAVS_HUMAN MAVS,CARD ADAPTER INDUCING INTERFERON BETA,CARDIF,INTERFERON BETA PROMOTER STIMULATOR PROTEIN 1,IPS-1,PUTATIVE NF-KAPPA-B-ACTIVATING PROTEIN 031N,VIRUS-INDUCED-SIGNALING ADAPTER,VISA GPCHGPEENEYKSEGTFGI 19 T 3.6 GRA6 unphh F Eukaryota T 4z8q 2 B B Q6TKR9_9XANT AvrRxo1-ORF2 MKTLTGADALEFHKKLKERNKALHASDLELALVHADAVGKERFDLEELEKICDTSDAGRLTDAKERNDIYERMYYVEYPNVMTLKEFAHIVETLFSWS 98 T 0.39 Rnk_N pdbpssm F Bacteria T 4z96 2 B C DNMT1_HUMAN DNMT1,CXXC-TYPE ZINC FINGER PROTEIN 9,DNA METHYLTRANSFERASE HSAI,M.HSAI,MCMT SDWPNHARSPGNKGKGKGKGKGKPKSQACEPSE 33 T 37 Ribosomal_L16 pdbhh F Eukaryota T 4z97 2 B C DNMT1_HUMAN DNMT1,CXXC-TYPE ZINC FINGER PROTEIN 9,DNA METHYLTRANSFERASE HSAI,M.HSAI,MCMT SDWPNHARSPGNKGKGKGQGKGKPKSQACEPSE 33 T 38 Ribosomal_L16 pdbhh F Eukaryota T 4za1 1 A,B,C A,B,C C6FX40_STRAS NosA MTEHPAQQLYCTVVLWDLSRSAATVASLRAYLRDHAVDAYTTVPGLRQKTWISSTGPEGEQWGAVYLWDSPEAAYGRPPGVSKVVELIGYRPTERRYYSVEAATEGPAAAAAPFGKGLGLAFDPASPEPLTRPQEFVPPGADAFIPSRPPA 151 T 0.0033 ydhR pdb F Bacteria T 4zc4 1 A,B,C,D A,B,C,D LARP1_HUMAN LA RIBONUCLEOPROTEIN DOMAIN FAMILY MEMBER 1 GHSGGGGGGHMQHPSHELLKENGFTQHVYHKYRRRCLNERKRLGIGQSQEMNTLFRFWSFFLRDHFNKKMYEEFKQLALEDAKEGYRYGLECLFRYYSYGLEKKFRLDIFKDFQEETVKDYEAGQLYGLEKFWAFLKYSKAKNLDIDPKLQEYLGKFRRLED 162 T 0.62 Frankia_peptide pdbpercent F Eukaryota T 4zdt 1 A,C A,C SLX1_SCHPO Structure-specific endonuclease subunit slx1 MEPVKCNLCYECIESDELRANCPFTDCNSINHLTCLASSFLTEECQVLPIEGMCTKCKRVLRWREFLSTVFTT 73 T 0.00013 FANCL_C pdbpercent F Eukaryota T 4zhl 2 B P mupain-1-IG CPAYSRYIGC 10 T 1.9 DUF6438 pdbhh F T 4zhm 1 A P mupain-1-16-IG CPAYSAYIGC 10 T 0.88 DUF6438 pdbhh F T 4zi1 2 B B NCOA5_HUMAN NCOA-5,COACTIVATOR INDEPENDENT OF AF-2,CIA AIESLIDLLADN 12 T 2.8 HEAT_PBS pdbhh F Eukaryota T 4zj7 2 B B CDC14_YEAST Tyrosine-protein phosphatase CDC14 TILRQLLPKNRRVTSGRRTTSAAGGIRKISGSIKK 35 T 0.074 BATS pdbpssm F Eukaryota T 4zjx 2 B B circular peptide inhibitor CXRWTKCLX 9 T 1 Brr6_like_C_C pdbhh F T 4zko 3 C Q C-terminal fragment of upain-1-W3A GLENHRMC 8 T 6.8 SARS_3b pdbhh F T 4zkq 1 A A E9M5R0_9GAMA Putative uncharacterized protein GPVGEPVASEINEASKVSSRLLTQDILFRKDRQATISLPIKLPVEDIITQTCDKITYGPLKFLDLLEKETAVLPLSTDITCPACLGRAVLVGKWECPAHVAVNESDLTVFGPNKEEHVPQFVTVQQPSDGKMQRLFFAKFLGTEESLAVLRVPGPDGHLCIQEALIHFKELSGAGVCSLWKANDSREEGLEMKQVDCLETTVLENQTCIATTLSKKIYHRLYCGERLMTGGQVSTRVLLTALGFYKRQPYTFHRVPKGMVYVHLIDSGSEDYMEYSECEEVTPGRYEDKQISYTFYTDLFQTADGEPVLASVWGTSGLKDSAYESCAFVIPTKGRRKLVPRRIMSKCYPFRLTYHPSTMTVRLDVRVEKHHGATDQGFVFLKMESGTYSEGREYYLDRVLWGEDSSTNNVLQHHHHHHHH 420 T 0.2 DUF4787 pdb T Viruses T 4zks 2 B P upain-1-W3A CSARGLENHAAC 12 T 6.4 LRRNT pdbhh F T 4zlt 1 A,B B,A E9M5R0_9GAMA Putative uncharacterized protein GPVGEPVASEINEASKVSSRLLTQDILFRKDRQATISLPIKLPVEDIITQTCDKITYGPLKFLDLLEKETAVLPLSTDITCPACLGRAVLVGKWECPAHVAVNESDLTVFGPNKEEHVPQFVTVQQPSDGKMQRLFFAKFLGTEESLAVLRVPGPDGHLCIQEALIHFKELSGAGVCSLWKANDSREEGLEMKQVDCLETTVLENQTCIATTLSKKIYHRLYCGERLMTGGQVSTRVLLTALGFYKRQPYTFHRVPKGMVYVHLIDSGSEDYMEYSECEEVTPGRYEDKQISYTFYTDLFQTADGEPVLASVWGTSGLKDSAYESCAFVIPTDGEEDLVPRRIMSKCYPFRLTYHPSTMTVRLDVRVEKHHGATDQGFVFLKMESGTYSEGREYYLDRVLWGEDSSTNNVLQHHHHHHHH 420 T 0.2 DUF4787 pdb T Viruses T 4zmk 1 A A TAZ1_SCHPO Telomere length regulator taz1 DTFSERTLGLNSIDNTEISEVVSLGLVSSALDKITGLLSADNLSETVSQARDFSHTLSKSLKSRAKSLSQK 71 T 16 Leptin pdbhh F Eukaryota T 4zny 2 B B POL_HTL1C T-cell leukemia virus type I, partial gag gene; HTLV1 (human T-lymphotropic virus type I) YVEPTAPQVL 10 T 19 DUF2992 pdbhh T Viruses T 4zoq 1 A,C,E,G,I,K,M,O A,B,C,D,E,F,G,H Q65DC7_BACLD LANP PROTEIN MKRIYIFLLCFAVLLPVGGKTAQAKEQAGEQYLLLEHVKDKSKLLDTAEQFHIHADVIEEIGFAKVTGEKQKLAPFTKKLAEKVGADVIEKPIANTAVNE 100 T 1 Mfp-3 unphh F Bacteria T 4zqu 1 A A CDIA_YERPY CdiA-CT toxin, Conserved domain protein MPWEDYVGKTLPVGSRLPPNFKTYDYFDRATGAVVSAKSLDTQTMAKLSNPNQVYSSIKKNIDVTAKFEKASLSGVTVNSSMITSKEVRLAVPVNTTKAQWTEINRAIEYGKNQGVKVTVTQVK 124 T 0.068 Glyco_hydro_97 pdbpercent F Bacteria T 4zqw 2 B,D A,C CDIA4_ECO5C macrocyclic peptide SXKEYALSGRELT 13 T 0.033 Glyco_hydro_97 unppercent F Bacteria T 4zri 2 C,D C,D LATS2_HUMAN KINASE PHOSPHORYLATED DURING MITOSIS PROTEIN,LARGE TUMOR SUPPRESSOR HOMOLOG 2,SERINE/THREONINE-PROTEIN KINASE KPM,WARTS-LIKE KINASE PKFGPYQKALREIRYSLLPFANESGTSAAAEV 32 T 5.3 DUF3928 pdbhh F Eukaryota T 4zrk 2 E,F,G,H E,F,G,H LATS1_HUMAN LARGE TUMOR SUPPRESSOR HOMOLOG 1,WARTS PROTEIN KINASE,H-WARTS PKFGTHHKALQEIRNSLLPFANETNSSRSTSE 32 T 0.049 EcoEI_R_C unp F Eukaryota T 4zrl 2 B B GLD3_CAEEL GERMLINE DEVELOPMENT DEFECTIVE 3 MAHSYNPFVRSAVEYDADTRLQMAENAASARKLFVSSALKDIIVNPENFYHDFQQSAQMAEDANQRRQVSYNTKREA 77 T 0.0096 Glyco_transf_54 pdb F Eukaryota T 4zrt 2 B B NPHN_HUMAN GLY-PRO-LEU-PTR-ASP-GLU AWGPLXDEVQM 11 T 1.4 GIT_SHD pdbhh F Eukaryota T 4zrz 1 A,B A,B Q7Y3F3_9CAUD PlyCB SKINVNVENVSGVQGFLFHTDGKESYGYRAFINGVEIGIKDIETVQGFQQIIPSINISKSDVEAIEKAMKK 71 T 2.5 DUF3213 pdbhh T Viruses T 4ztd 2 D,E D,E TRAIP_HUMAN ALA-PHE-GLN-ALA-LYS-LEU-ASP-THR-PHE-LEU-TRP-SER AFQAKLDTFLWS 12 T 1 DNA_pol3_chi pdbhh F Eukaryota T 4zu1 2 B A CYSE_SALTY SAT,SERINE TRANSACETYLASE HHTFEYGDGI 10 T 3.3 DUF2023 pdbhh F Bacteria T 4zw2 2 B B CAC1S_MOUSE CALCIUM CHANNEL,L TYPE,ALPHA-1 POLYPEPTIDE,ISOFORM 3,SKELETAL MUSCLE,VOLTAGE-GATED CALCIUM CHANNEL SUBUNIT ALPHA CAV1.1 XQQLEEDLRGYMSWITQGEX 20 T 2.4 Antimicrobial14 pdbhh F Eukaryota T 4zxa 1 A,B,C,D A,B,C,D C1I210_PSEWB Hydroquinone dioxygenase small subunit GPGSMSNVAVNTVFASLDNFRKGTVEIISGEARHYAFSNIFEVAQNSKPYEKVVVGLNLGYVVETLRAEGQSPWFTAAHDEFAIVMDGEVRVEFLKLDAPSKHGEGTHLAGELPVGKPMGYVLLKRGHQCLLPAGSAYRFEASRPGVILQQTIKGPLSVEKWAEICLK 168 T 0.069 Ppnp unphh F Bacteria T 4zya 1 A,B A,B SYNC_HUMAN ASPARAGINYL-TRNA SYNTHETASE,ASNRS GHMAELYVSDREGSDATGDGTKEKPFKTGLKALMTVGKEPFPTIYVDSQKENERWNVISKSQLKNIKKMWHREQMKS 77 T 0.086 DUF1565 pdbpssm F Eukaryota T 4zzj 2 B B P53_HUMAN Ac-p53 RHKXLXF 7 T 36 NifQ pdbhh F Eukaryota T 5a0n 1 A A A5MCJ6_STREE PROTEIN F2 LIKE FIBRONECTIN-BINDING PROTEIN GAMGGFPNDAKGISGNGKYYSLGQIEKLYSNQFATYNNLTVITSDTHENSDNFAFCLANGKRFPSFTDEKPKGIYTLVKDINKEQYTKLLKENHKWSSIPNLNQAWDTFSRLSYMYLKDPTDIVKRAWGTDLNTARTYFHQVIQYEIWRYTDGMRVSSDTNVYIYEKFSPQQKKALEMIRTDLYNFTVPYENLEYRFYKPDWVFGLGFQALATVRWKIEP 220 T 0.06 TED unphh F Bacteria T 5a1q 1 A,B A,B Y1502_ARCFU AF1502 GSHMITYKKLLDELKKEIGPIAKIFLNKAMESLGYDDVDDSNYKEILSVLKMNKELREYVEIVEERLEKEG 71 T 0.0016 DUF1322 pdb F Archaea T 5a29 2 B B PECTATE LYASE HHHHHHHSSGLVPRGSHA 18 T 3000 zf-CCHC_2 pdbhh F T 5a2q 37 KA r RL19_HUMAN RIBOSOMAL PROTEIN EL19 RRSKTKEARKRRE 13 T 4.7 SUIM_assoc pdbhh F Eukaryota T 5a2q 38 LA w RL24_HUMAN RIBOSOMAL PROTEIN EL24 XXXXXXXXXXXXXXQRAITGASLADIMAKRNQKPEVRKAQREQAIRAAKEAKKAKQASKKTA 62 T 0.15 UreF unppercent F Eukaryota T 5a31 19 U V THE ANAPHASE-PROMOTING COMPLEX CHAIN V AAFRIALKSVQKS 13 T 7.8 DIMCO_N pdbhh F T 5a4h 1 A A ABHD5_MOUSE ABHYDROLASE DOMAIN-CONTAINING PROTEIN 5, LIPID DROPLET-BINDING PROTEIN CGI-58, PROTEIN CGI-58, WR10_43 GAMGSVDSADAGGGSGWLTGWLPTWCPTSTSHLKEAEEK 39 T 5.8 CSN7a_helixI pdbhh F Eukaryota T 5a6w 2 C C C4B8B8_MAGOR AVR-PIK PROTEIN METGNKYIEKRAIDLSRERDPNFFDHPGIPVPECFWFMFKNNVRQDAGTCYSSWKMDMKVGPNWVHIKSDDNCNLSGDFPPGWIVLGKKRPGF 93 T 0.1 TMEM18 unp F Eukaryota T 5a8l 9 I Z NASCENT CHAIN MEPLVLSAKKLSSLLTCKYIPP 22 T 3.8 BLOC1S3 pdbhh F T 5ab0 2 C,D E,F DG025 XXKHHAFSFX 10 T 18 SmaI pdbhh F T 5abk 1 A A PRTV_VIBCH METALLOPROTEASE PRTV GAMAQTPIDLGVVNEDKLIEMLVRTGQIPADASDVDKRIALERYLEEKIRSGFKGDAQFGKKALEQRAKILKVIDKQKGPHKAR 84 T 0.038 SLT_2 pdbpercent F Bacteria T 5abu 2 B B MXT_DROME 4E-BINDING PROTEIN MEXTLI GPHMLESRVSYDIEHLLYYSMSPHSWTLPTDWQKMQETAPSILRNKDLQDESQRFDGDKYLASIKTAAKR 70 T 0.051 eIF_4EBP unphh F Eukaryota T 5abx 2 B B MXT_CAEEL 4E-BINDING PROTEIN MEXTLI GPHMIRYNRDTLMTARDTKRAPIPDEMLQEINRVAPDILIA 41 T 0.072 Gal11_ABD1 pdb F Eukaryota T 5acz 3 C C 11MER PEPTIDE GRAEEYGADTL 11 T 4.5 DUF4156 pdbhh F T 5ad0 3 C C 11MER PEPTIDE GHAEEYGADTL 11 T 9.7 DUF4156 pdbhh F T 5adx 17 Z d A0A0J9X293_PIG DYNACTIN SUBUNIT 2 MADPKYADLPGIARNEPDVY 20 T 5.8 SmAKAP pdbhh F Eukaryota T 5afg 2 B B STAPLED PEPTIDE XTFAEYWAQLAS 12 T 0.071 PBP-Tp47_a pdbhh F T 5afp 2 C,D C,D RK_HUMAN RK, G PROTEIN-COUPLED RECEPTOR KINASE 1 MDFGSLETVVANSAFIAARGSFDGS 25 T 3.1 DUF5465 pdbhh F Eukaryota T 5afu 20 EA c A0A0J9X299_PIG DYNACTIN MADPKYADLPGIARNEPDVYAAAAAAAAAAA 31 T 0.16 Dynamitin pdbhh F Eukaryota T 5afw 2 B B KDM1A_HUMAN BRAF35-HDAC COMPLEX PROTEIN BHC110, FLAVIN-CONTAINING AMINE OXIDASE DOMAIN-CONTAINING PROTEIN 2 RRTSRRKRAKVE 12 T 5.8 DUF3306 pdbhh F Eukaryota T 5aiw 1 A A A0A140UHJ9_ENTFL TRAH SNTNQSESEKIIKEFYKTVYNYEKSQKEISMTTVKELATDNVYQELQNEINVNNSYSPQQNTIQKSSVNENEIKILAYESKDNSQQYLVTAPIHQVFNGTKNDFEINQLIQIKNQKITQRTTIQLGEE 128 T 0.0033 Tim44 pdbpssm F Bacteria T 5aj1 1 A A SNF5_HUMAN BRG1-ASSOCIATED FACTOR 47, BAF47, INTEGRASE INTERACTOR 1 PROTEIN, SNF5 HOMOLOG, HSNF5, SMARC B1 DOMAIN GGSMMMALSKTFGQKPVKFQLEDDGEFYMIGSEVGNYLRMFRGSLYKRYPSLWRRLATVEERKKIVASSHGKKTKPNTKDHGYTTLATSVTLLKASEVEEILDGNDEKYKAVSIS 115 T 0.00076 CRC_subunit pdbhh F Eukaryota T 5aj4 32 GA Ao K7GKS8_PIG MITORIBOSOMAL PROTEIN MS39, MRPS39 MAAVASARWLGVRSGLCLPLTGRRVGPCGRTPRSRFYSGSAAHPEVEGANVTGIEEVVIPKKKTWDKVAILQALASTVHRDSTAAPYVFQDDPYLIPTSSVESHSFLLAKKSGENAAKFIINSYPKYFQKDIAEPHIPCLMPEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 530 T 46 RBR pdbpssm F Eukaryota T 5ajd 2 B,D,F,H,J,L B,D,F,H,J,L NOT4_YEAST MODULATOR OF TRANSCRIPTION 2, NOT4 GPDSMDPYDALGNAVDFLDARLHSLSNYQKRPISIKSNIIDEETYKKYPSLFSWDKIEASKKSDN 65 T 0.71 DUF3140 unppercent F Eukaryota T 5ajn 2 B P MUC5A_HUMAN MUC5AC GTTPSPVPTTSTCSAA 16 T 5.4 TYW3 pdbhh F Eukaryota T 5ajo 2 B B MUC5A_HUMAN MUC5AC AGTTPSPVPTTSTTSAA 17 T 9.2 Inhibitor_I53 pdbhh F Eukaryota T 5amt 1 A,B A,B A0Q7H2_FRATN IGLE MGSSHHHHHHSSGLVPRGSHGSHMDGLYINNNIPKTKIVLESKPDKNIFYSDNYQSISQRIYDDNVKVLNLKTGKNEFPLDKDIKDYALYFILPENKKTENWKYLISSDSVNEFTIKNDSSIEKD 125 T 0.0061 DUF5006 unphh F Bacteria T 5amu 1 A,B A,B A0Q7H2_FRATN IGLE MGSSHHHHHHSSGLVPRGSHMAIMDGLYINNNIPKTKIVLESKPDKNIFYSDNYQSISQRAADDNVKALNLKTGKNEFPLDKDIKDYALYFILPENKKTENWKYLISSDSVNEFTIKNDSSIEKD 125 T 0.0061 DUF5006 unphh F Bacteria T 5aoq 1 A,B A,B TORSO_BOMMO RECEPTOR TYROSINE KINASE TORSO HHHHHHHHGEVVSQRYPPAPGLLKYLEQDVCYSLYYYLNWTSLADCKTNFEETGISDVPSTVKVRCQSKNSIRFETEPSEHWQLFILMEHDNFDPIPFTLIEPNNVFGELITTANKEYQIWSTYLDEYGTLQDWMEGPIVLKFDQRNQQPDDIKYNVTQEFKYIILGNDSYTINGKFVWNTTGDRDLCFDIANICQNTNMKHAKIWPTAHPSFDVENLVLNDECEIHVKGIHGTTKHKYKTPSCFELPECFLNNMEPEIPQDVAIAADQDLR 272 T 0.00078 fn3 unppercent F Eukaryota T 5aot 1 A A A0A1A9TAF4_RUMFL CBM74-RFGH5 MGAEEEDTAILYPFTISGNDRNGNFTINFKGTPNSTNNGCIGYSYNGDWEKIEWEGSCDGNGNLVVEVPMSKIPAGVTSGEIQIWWHSGDLKMTDYKALEHHHHHH 106 T 4.3 DUF5766 pdbhh F Bacteria T 5apr 2 B I PEPSTATIN-LIKE RENIN INHIBITOR HPFCXLFX 8 T 3.9 RNR_inhib pdbhh F T 5aum 3 E,F C,D peptide RENLYFQGKDG RENLYFQGKDG 11 T 3 PNPase_C pdbhh F T 5avp 1 A,B,C,D A,B,C,D D2S5K0_GEOOG UNCHARACTERIZED PROTEIN MNHKVHHHHHHIEGRHMEGLLARTSVTRREYDEWLNEAAALGRALRYPVRPEMVNDSAGIVFGEDQYDAFENGLWSREPYEAMVIFESLNEPAVDGLPAAGAPFAEYSGLCDKLMIVHPGKFCPPHYHQRKTESYEVVLGEMELFYSPKPVQVGEEEVLSFTGMHEGSPWPDGVALPIGREESYAALTSYRRLRVGDPKFVMHRKHLHAFRCPADSDVPLVVREVSTYSHEPTEEAADKAAPLPDWAGLHDNSFVAAAANSGRLRTAIQ 269 T 0.0036 Cupin_2 unp F Bacteria T 5awt 2 B B EPS15_HUMAN EPS15 YDPFGGDPFKG 11 T 0.46 CsgF pdbhh F Eukaryota T 5awu 2 B B EPS15_HUMAN EPS15 YDPFKGSDPFA 11 T 1.5 MT-A70 pdbhh F Eukaryota T 5ax6 1 A A Q93I73_ECOLX CofB GPDEARRQIVSNALISEIAGIVDFVAEEQITVIEQGIEKEITNPLYEQSSGIPYINRTTNKDLNSTMSTNASEFINWGAGTSTRIFFTRKYCISTGTQGNYEFSKDYIPCEEPAILSNSDLKIDRIDFVATDNTVGSAIERVDFILTFDKSNANESFYFSNYVSSLEKAAEQHSISFKDIYVVERNSSGAAGWRLTTISGKPLTFSGLSKNIGSLDKTKNYGLRLSIDPNLGKFLRADGRVGADKLCWNIDNKMSGPCLAADDSGNNLVLTKGKGAKSNEPGLCWDLNTGTSKLCLTQIEGKDNNDKDASLIKLKDDNGNPATMLANILVEEKSMTDSTKKELRTIPNTIYAAFSNSNASDLVITNPGNYIGNVTSEKGRIELNVQDCPVSPDGNKLHPRLSASIASIVADTKDSNGKYQADFSSLAGNRNSGGQLGYLSGTAIQVNQSGSKWYITATMGVFDPLTNTTYVYLNPKFLSVNITTWCSTEPQT 492 T 2.1 PulG unphh F Bacteria T 5azg 2 C,D C,D UNC51_CAEEL UNC-51 AIM YQESTDFTFL 10 T 15 DUF1957 pdbhh F Eukaryota T 5b0u 1 A,B A,B LUCI_OPLGR 19 KDA PROTEIN OF OPLOPHORUS LUCIFERASE, NANOKAZ MNHKVHHHHHHMELGTLEGSEFFTLEDFVGDWRQTAGYNLDQVLEQGGVSSLFQNLGVSVTPIQRIVLSGENGLKIDIHVIIPYEGLSGDQMGQIEKIFKVVYPVDDHHFKVILHYGTLVIDGVTPNMIDYFGRPYEGIAVFDGKKITVTGTLWNGNKIIDERLINPDGSLLFRVTINGVTGWRLCERILA 191 T 0.0098 Lipocalin_7 pdbhh F Eukaryota T 5b16 2 B,C B,C DGCR8_HUMAN DIGEORGE SYNDROME CRITICAL REGION 8 MANLHILSKLQEEMKRLAEEREETRHGGSRGDMLEVLFQ 39 T 7.1 Tma16 pdbhh F Eukaryota T 5b4w 2 G,H,I,J,K,L G,H,I,J,K,L Synthesized cyclic peptide XXRPRVARWTGQIIYCSX 18 T 0.56 DUF3104 pdbhh F T 5b56 2 C,D,E,F C,D,E,F VPR_HV1B9 R ORF PROTEIN,VIRAL PROTEIN R QQRRTRNGASKS 12 T 6.8 EcoRII-C pdbhh T Viruses T 5b6c 2 B B UFD1_HUMAN Peptide from Ubiquitin fusion degradation protein 1 homolog FRAFSGSGNRL 11 T 0.52 SEP pdbhh F Eukaryota T 5b6i 1 A,B A,B W0W999_9ACTN Fluorinase MGSSHHHHHHSSGLVPRGSHMAANGSQRPIIAFMSDLGTTDDSVAQCKGLMHSICPGVTVVDVCHSMTPWDVEEGARYIVDLPRFFPEGTVFATTTYPATGTTTRSVAVRIRQAAKGGARGQWAGSGDGFERADGSYIYIAPNNGLLTTVLEEHGYIEAYEVTSTKVIPANPEPTFYSREMVAIPSAHLAAGFPLAEVGRRLDDSEIVRFHRPAVEISGEALSGVVTAIDHPFGNIWTNIHRTDLEKAGIGQGKHLKIILDDVLPFEAPLTPTFADAGAIGNIAFYLNSRGYLSLARNAASLAYPYNLKAGLKVRVEAR 319 T 2.6E-42 SAM_adeno_trans pdbpercent F Bacteria T 5b7i 2 B,C B,C L7P7R7_9CAUD Uncharacterized protein AcrF3 MGSSHHHHHHSQDPMSNTISDRIVARSVIEAARFIQSWEDADPDSLTEDQVLAAAGFAARLHEGLQATVLQRLVDESNHEEYREFKAWEEALLNADGRVASSPFADWGWWYRIANVMLATASQNVGVTWGSRVHGRLMAIFQDKFKQRYEEQA 153 T 0.23 Rtt102p unppssm T Viruses T 5bjt 3 I,J,K,L,M,N,O P,Q,R,S,T,U,V peptide inhibitor XRYFCTKWKHGWCEEVGTX 19 T 4.8 GnRH pdbhh F T 5bmt 1 A,B A,B A7AJI6_9PORP Uncharacterized protein GDGGGNTQQLSSYAIVDYSSTMRTLIYPLGYYPLYVATIANDPTYRAGDCVLANFTVDFDSADNANASTNGFYVATGAASSPLAKYDLSYSPLDSMALDNELLLSGSESALLFSNNYKRIVVIPTFTSVLTDQKNTYIMSMDSNQEPETVDGTDRVYTLCLRAQKREEGKAPTISNAMDPIAVEGGTLYSMLKGKESAAGKKIVSYRVKYPLTFNADSTKIATWGYSKISQFSIEEATN 239 T 0.0074 DUF4969 unphh F Bacteria T 5bn6 1 A,C,E,G A,B,C,D Q38720_ARTHE Jacalin AEQSGKSQTVIVGPWGAQV 19 T 2.8 DUF3842 pdbhh F Eukaryota T 5bnd 1 A,B,C A,B,C Q5HN63_STAEQ ABC transporter, ATP-binding protein TYEEKLAYGLALDGSVTLNGSKDLKVPKYSLITITGENNKRYRVEMNQRRYSVSKNQVFYFNPAGLYESHTFKKLSPYIKSNYSTYVEYFNSHLHQKHDKVTETLRPDKDKKYVVPITQQPIKMIFGDNDKLSGFVIPMTNKTELKKTFNITKDVWITKSGSGYFIADMKEEKWIYIEL 179 T 0.051 DUF2393 unppercent F Bacteria T 5bnw 2 B D LMNB2_HUMAN laminB1 residues 179-191 KLSPSPSSRVTVS 13 T 0.41 CCDC85 unppercent F Eukaryota T 5boa 1 A,B,C,D,E,F A,B,C,D,E,F A4VT01_STRSY Translation initiation factor 2 (IF-2 GTPase) MGSSHHHHHHSSGLVPRGSHMKQQSPLIQTSNADYKSGKDQEKLRTSVSINLLKAEEGQIQWKVTFDTSEWSFNVKHGGVYFILPNGLDLTKIVDNNQHDITASFPTDINDYRNSGQEKYRFFSSKQGLDNENGFNSQWNWSAGQANPSETVNSWKSGNRLSKIYFINQITDTTELTYTLTAKVTEPNQQSFPLLAVMKSFTYTNSKSTEVTSLGAREITLEKEKT 226 T 4.2 DUF5377 unphh F Bacteria T 5bpz 1 A A Q0IH16_XENLA Anapc5 protein MASVHESLYFNPMMTNGVVHANVFGIKDWVTPYKISVLVLLSEMSKNTKISLVEKRRLNKQILPLLQGPDMTLSKLIKIVEECCPNVSSSVHIRIKLMAEGELKDMEQFFDDLADSFTGTEPEVHKTSVVGLFLRHMILAYNKLSFSQVYKLYTSLQQYFQSDENLYFQ 169 T 0.071 CEP19 unp F Eukaryota T 5brm 2 G,H,I,J,K,L,M,N,O G,H,I,J,K,L,M,N,O STK3_HUMAN MAMMALIAN STE20-LIKE PROTEIN KINASE 2,MST-2,STE20-LIKE KINASE MST2,SERINE/THREONINE-PROTEIN KINASE KRS-1 DEEEEDGTMKRNATSPQVQRPSFMDYFDKQD 31 T 0.28 Fib_succ_major unp F Eukaryota T 5bs0 3 C C TITIN_HUMAN CONNECTIN,RHABDOMYOSARCOMA ANTIGEN MU-RMS-40.14 ESDPIVAQY 9 T 13 DUF6497 pdbhh F Eukaryota T 5btw 1 A,B A,B Q5ZVE4_LEGPH Uncharacterized protein MVTKIIWVSNNGKPNLKIEFVSEEEKSNFFKEVKKKASELGLNFPLVQGSGNSLLIEASNYPINPCGCYISPGGKLAINFGKVELSHFILPKVGVKTEHAEIFKDHNTIFFHKHKLPGVNSELTFIPTGTPVIVPVTKLEHHHHHH 146 T 0.73 PPV_E2_C pdbhh F Bacteria T 5bty 1 A A lpg1496 SGDSSISISAIGNVDSPMIRITFQNQTEREFFLNKITDKAKSLGVNISTHPFEIKEPNMVLIKPSKYPDNKLGCYISKNKEIAINFGRTDFRDFVLSNLGVGSHLGTCPTKNETGNDTFYFHQENLSLNGPALSVNTK 138 T 0.019 SL4P pdbpssm F T 5btz 1 A A lpg1496 KSGDSSISISAIGNVDSPMIRITFQNQTEREFFLNKITDKAKSLGVNISTHPFEIKEPNMVLIKPSKYPDNKLGCYISKNKEIAINFGRTDFRDFVLSNLGVGSHLGTCPTKNETGNDTFYFHQENLSLNGPALSVNTK 139 T 0.017 SL4P pdbpssm F T 5bvl 1 A A designed TIM barrel sTIM11 MDKDEAWKCVEQLRREGATQIAYRSDDWRDLKEAWKKGADILIVDATDKDEAWKQVEQLRREGATQIAYRSDDWRDLKEAWKKGADILIVDATDKDEAWKQVEQLRREGATQIAYRSDDWRDLKEAWKKGADILIVDATDKDEAWKQVEQLRREGATQIAYRSDDWRDLKEAWKKGADILICDATGLEHHHHHH 194 T 0.00015 NanE pdbhh F T 5bxo 2 C,D C,D TNKS2_HUMAN TANK2,ADP-RIBOSYLTRANSFERASE DIPHTHERIA TOXIN-LIKE 6,ARTD6,POLY [ADP-RIBOSE] POLYMERASE 5B,TNKS-2,TRF1-INTERACTING ANKYRIN-RELATED ADP-RIBOSE POLYMERASE 2,TANKYRASE II,TANKYRASE-LIKE PROTEIN,TANKYRASE-RELATED PROTEIN XREAGDGAEX 10 T 7.5 DUF5840 pdbhh F Eukaryota T 5bxu 2 B B cp4n4m5 XREAGDGAX 9 T 5.5 DUF5840 pdbhh F T 5c07 3 C,H C,H Marker peptide YQFGPDFPIA 10 T 0.58 TGT_C2 pdbhh F T 5c08 3 C,H C,H Marker peptide RQWGPDPAAV 10 T 2.1 LEA_3 pdbhh F T 5c09 3 C,H C,H Marker peptide YLGGPDFPTI 10 T 6.4 GlfT2_domain3 pdbhh F T 5c0a 3 C,H C,H Marker peptide MVWGPDPLYV 10 T 0.74 Tachykinin pdbhh F T 5c0b 3 C,H C,H Marker peptide RQFGPDFPTI 10 T 8.1 Synapsin pdbhh F T 5c0c 3 C,H C,H Marker peptide RQFGPDWIVA 10 T 1.4 Rab15_effector pdbhh F T 5c0d 3 C C INS_HUMAN Marker peptide AQWGPDPAAA 10 T 3.1 LEA_3 pdbhh F Eukaryota T 5c1b 2 G,H V,U UFD1_HUMAN UB FUSION PROTEIN 1 GELGFRAFSGSGNRLDGKKKG 21 T 1.6 SEP pdbhh F Eukaryota T 5c56 1 A B ICP0_HHV11 Ubiquitin E3 ligase ICP0 SGPRGPRKCARKTRHAETSGA 21 T 4.6 DUF6395 pdbhh T Viruses T 5c5e 2 B,D G,H KAIC_SYNE7 KaiC C-terminal peptide DEKSELSRIVRGVQEKGPES 20 T 2 Lambda_CIII pdbhh F Bacteria T 5c6d 2 C,D C,D UHRF1_HUMAN INVERTED CCAAT BOX-BINDING PROTEIN OF 90 KDA,NUCLEAR PROTEIN 95,NUCLEAR ZINC FINGER PROTEIN NP95,HNP95,RING FINGER PROTEIN 106,TRANSCRIPTION FACTOR ICBP90,UBIQUITIN-LIKE PHD AND RING FINGER DOMAIN-CONTAINING PROTEIN 1,HUHRF1,UBIQUITIN-LIKE-CONTAINING PHD AND RING FINGER DOMAINS PROTEIN 1 SEGGFASPRTGKGKWKRKSAGGGPSRAGSPRRT 33 T 43 Ribosomal_L35p pdbhh F Eukaryota T 5c6g 2 B,D B,D SCC2_ASHGO Sister chromatid cohesion protein 2 MSTFPGEDTRIPKRISEALSHQPLNHLVPKRELSRLLSKPVQISVQLESEDAFEEVPEELWQYPHPIDLDPLRLEESQPLRFRRPRGARLDYREDSSEIADLPGMGQLARACLSGTQLVDSAAIVESIESNAKKRKQTLAIGDVEMVSPDKKTKVMASVSPVSLNRVALGSQHLKTLERLMQYIGADESSAEFGDFEYWITLEDRATHILSEQCIDKL 218 T 32 GhoS pdbhh F Eukaryota T 5c6h 2 B,D,F,H,J,L,N,P,R,T,V,X B,D,F,H,J,L,N,P,R,T,V,X HUWE1_HUMAN Mule BH3 peptide from E3 ubiquitin-protein ligase HUWE1 PGVMTQEVGQLLQDMGDDVYQQYRSL 26 T 4.4 KIP1 pdbhh F Eukaryota T 5c6v 2 E,F,G,H E,F,G,H NINJA_ARATH NINJA-FAMILY PROTEIN AT4G28910,NOVEL INTERACTOR OF JAZ DNGLELSLGLS 11 T 0.00015 EAR unppercent F Eukaryota T 5c7f 2 E,F,G,H E,F,G,H IAA1_ARATH INDOLEACETIC ACID-INDUCED PROTEIN 1 KDTELRLGLPG 11 T 1.2E-05 AUX_IAA pdbhh F Eukaryota T 5c9n 1 A,B A,B GEMC1_HUMAN Geminin coiled-coil domain-containing protein 1 DSNFPLPDLCSWEEAQLSSQLYRNKQLQDTLVQKEEELARLHEENNHLRQYLNSALVKCEEEKAKKELSSDEFSKAYGKFRKGKR 85 T 0.00086 YabA pdb F Eukaryota T 5cfa 2 C,D D,C FIBA_HUMAN Peptide from Fibrinogen alpha chain SKQFTSSTSYNRGDS 15 T 1.5 DUF5326 pdbhh F Eukaryota T 5cgn 2 E,F,G,H E,F,G,H MAGA_XENLA L-ACPC8-Ala-Magainin GIGKFLHXAKKFAKAFVAEIMNS 23 T 1.3 TAFII28 pdbhh F Eukaryota T 5cgo 1 A,B A,B MAGA_XENLA ACPC-13 derivative of Ala-Magainin 2 GIGKFLHAAKKFXKAFVAEIMNS 23 T 1 TAFII28 pdbhh F Eukaryota T 5cje 1 A A Q82LM3_STRAW CYTOCHROME P450 107L2 MGNVIDLGEYGARFTEDPYPVYAELRERGPVHWVRTPPPEAFEGWLVVGHEEARAALADPRLSKDGTKKGLTSLDVELMGPYLLVVDPPEHTRLRSLVARAFTMRRVEALRPRIQEITDGLLDEMLPRGRADLVDSFAYPLPITVICELLGVPDIDRVTFRALSNEIVAPTGGDAELAAYERLAAYLDELIDDKRSTAPADDLLGDLIRTRAEDDDRLSGEELRAMAFILLVAGHETTVNLITNGVHTLLTHPDQLAALRADMTLLDGAVEEVLRFEGPVETATYRYAAESMEIGGTAIAEGDPVMIGLDAAGRDPARHPDPHVFDIHRAPQGHLAFGHGIHYCLGAPLARLEARVALRSLLERCPDLALDGPPGARPPGMLIRGVRRLPVRW 393 T 1.3E-06 p450 unppssm F Bacteria T 5ck3 2 B,D,F B,D,F G0S401_CHATD Putative signal recognition particle protein MGATTQYTTLPSVLLIGPSGAGKTALLTLFERGPLLNPDGTSVGAADLKNPYRKPIVTSPVAQTHTSQVPTSVELAVGANEDGTPTSYKVDLDAAGATARKFLLIDTPGHPKLRGTTLQHLLNPSPSLTIIPTNAPNKKTSTDSHSDPYKSKLKAVIFLLDAAALADSDGDYLSQTASYLYDVLLSLQKRFHSRKNSRAPSSIPVLIAANKQDLFTAVPASLVKSRLEHELGRIRKTRQKGLLEASVTSEDEIRADDEEGWLGAVGSKEFKFEEMMEFDMEVEVMGGNVIGDGPGAERWWRWIGERI 307 T 2.3E-10 MnmE_helical pdbhh F Eukaryota T 5ck4 1 A,B A,B G0S401_CHATD Putative signal recognition particle protein MKHHHHHHPMGATTQYTTLPSVLLIGPSGAGKTALLTLFERGPLLNPDGTSVGAADLKNPYRKPIVTSPVAQTHTSQVPTSVELAVGANEDGTPTSYKVDLDAAGATARKFLLIDTPGHPKLRGTTLQHLLNPSPSLTIIPTNAPNKKTSTDSHSDPYKSKLKAVIFLLDAAALADSDGDYLSQTASYLYDVLLSLQKRFHSRKNSRAPSSIPVLIAANKQDLFTAVPASLVKSRLEHELGRIRKTRQKGLLEASVTSEDEIRADDEEGWLGAVGSKEFKFEEMMEFDMEVEVMGGNVIGDGPGAERWWRWIGERI 316 T 4.7E-10 MnmE_helical pdbhh F Eukaryota T 5cmu 1 A,B,C A,B,C ENV_HV1H2 Envelope glycoprotein,AP1 SSGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQARILSGGRGGWMEWDREIEELIKKSEELIKKIEEQIKKQE 73 T 0.11 GP41 pdbpssm T Viruses T 5cmz 2 B,D B,D Artificial HIV entry inhibitor AP3 XMTWEEWDKKIEELIKKSEELIKKIEEQIKKQEESIKK 38 T 0.0055 DUF1351 pdbpercent F T 5cn0 1 A A ENV_HV1H2 GP41 SSGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQARILSGGRGGWEEWDKKIEELIKKSEELIKKIEEQIKKQE 73 T 0.058 GP41 pdbpercent T Viruses T 5cow 1 A A E3M3V1_CAERE Putative uncharacterized protein KLLLEGVKEQDPVDKFTYLLLQPLTEATLSDAVNFIVEKYSAELPDEGDASLVVRSQLGCQFFFLVTRTLAHDQRELAKLVQTLIPRPVRLEVFPGLQRSVFKSSVFLGHHIIQIFMGAKKPFQDWSFVGLAQDFECPWRRLAIAELLKKFSVSVVEKVFDNPVALIPQHESDNEALIELVTNALRFALWIVEFYETETNEKSIKELAFLDHSSKTLLIESFTKFLQGKDVKDQDHLKRIIDALEKS 247 T 0.12 CbtA_toxin pdb F Eukaryota T 5coz 1 A A C4ZHW1_AGARV Uncharacterized protein GIDDGTQANTTDLNDYENVLNSLDEEQIGKLPQNIKCVVNDKLNIDSEINIWDATSYYVKSGKVKAINFSENKDKCYDLMEKLAKAINLNKDVCVQSHRSENGNEIYLWDNNYTQDSIAIRNDSALAETHDGKLAVSASKFGTYYSPFNDKDKFRTDKQLMFMSAEEAEELAVKTAKELEINVCEKNELYVLDDKNTLIFPEDDTDKQNDTYVFFMFPDVYGIPYSRCPENEALTGYANQENHLVIAMDEKGISFLDIPPLYDWVETTETGEILHPSSILSKEVDKLKKYVTSGDIEVSEISLEYMLFADKNETYDIKPVWVVYYYQNQLVTGENSYTQKMALYDVYDAYTGEEYRIQ 358 T 0.14 HTH_26 unppercent F Bacteria T 5cqc 1 A A Q5ZUV9_LEGPH putative RavZ protein GSKLIVDEFEELGEQELDKTTSIYPPETSWEVNKGMNSSRLHKLYSLFFDKSSAFYLGDDVSVLEDKPLTGAYGFQSKKNDQQIFLFRPDSDYVAGYHVDAKSDAGWVNDKLDRRLSEISEFCSKATQPATFILPFVEMPTDITKGVQHQVLLTISYDPKSKQLTPTVYDSIGRDTYSESLSSYFKGKYRTTCDEILTQSIEKAIKSTDFTLGKFTRAAYNHQNRLTEGNCGSYTFRTIKEVISSSAQGTEVKIPGSGYITSNSYLTSQHVQDIESCIKYRNLGVVDIESALTEGKTLPVQLSEFIVALEDYGKLRSQQSEKSMLNFIGYSKTAKLTAVELLIGILNDIKGKNEISESQYDKLVKEVDCLMDSSLGKLVQFHLKNLGAESLQKLVLPCVKFDDELFDVPDITGEELASKK 420 T 18 DUF438 unphh F Bacteria T 5cqx 2 C C MAZE_ECOLI Antitoxin MazE HENIDWGEPKDKEVW 15 T 1.1 DUF2389 pdbhh F Bacteria T 5csf 2 C C KS6A1_HUMAN S6K-ALPHA-1,90 KDA RIBOSOMAL PROTEIN S6 KINASE 1,P90S6K,MAP KINASE-ACTIVATED PROTEIN KINASE 1A,MAPKAPK-1A,RIBOSOMAL S6 KINASE 1,RSK-1 GSQSQLSHQDLQLVKGAMAATYSALNSSKPTPQLKPIESSILAQRRVRKLPSTTL 55 T 55 DUF4603 pdbhh F Eukaryota T 5csi 2 C C KS6A1_HUMAN S6K-ALPHA-1,90 KDA RIBOSOMAL PROTEIN S6 KINASE 1,P90S6K,MAP KINASE-ACTIVATED PROTEIN KINASE 1A,MAPKAPK-1A,RIBOSOMAL S6 KINASE 1,RSK-1 GSQDLQLVKGAMAATYSALNSSKPTPQLKPIESSILAQRRVRKLPSTTL 49 T 0.026 Vitelline_membr pdbpssm F Eukaryota T 5csj 2 C C KS6A1_HUMAN S6K-ALPHA-1,90 KDA RIBOSOMAL PROTEIN S6 KINASE 1,P90S6K,MAP KINASE-ACTIVATED PROTEIN KINASE 1A,MAPKAPK-1A,RIBOSOMAL S6 KINASE 1,RSK-1 GSGAMAATYSALNSSKPTPQLKPIESSILAQRRVRKLPSTTL 42 T 35 DUF4603 pdbhh F Eukaryota T 5csm 1 A A CHMU_YEAST CHORISMATE PYRUVATE MUTASE MDFTKPETVLNLQNIRDELVRMEDSIIFKFIERSHFATCPSVYEANHPGLEIPNFKGSFLDWALSNLEIAHSRIRRFESPDETPFFPDKIQKSFLPSINYPQILAPYAPEVNYNDKIKKVYIEKIIPLISKRDGDDKNNFGSVATRDIECLQSLSRRIHFGKFVAEAKFQSDIPLYTKLIKSKDVEGIMKNITNSAVEEKILERLTKKAEVYGVDPTERRIERRISPEYLVKIYKEIVIPITKEVEVEYLLRRLEE 256 T 0.048 CM_2 pdb F Eukaryota T 5csn 2 C C KS6A1_HUMAN S6K-ALPHA-1,90 KDA RIBOSOMAL PROTEIN S6 KINASE 1,P90S6K,MAP KINASE-ACTIVATED PROTEIN KINASE 1A,MAPKAPK-1A,RIBOSOMAL S6 KINASE 1,RSK-1 GSQSQLSHQDLQLVKGAMAATYSALNSSKPTPQLKPIESS 40 T 26 RRT14 pdbhh F Eukaryota T 5ctt 2 B B SART3_HUMAN SART-3,TAT-INTERACTING PROTEIN OF 110 KDA,TIP110,P110 NUCLEAR RNA-BINDING PROTEIN LVPRGSRKRARAEKKALKKKKKIRGPEKRGADEDDEKEWGDDEEEQPSKRRRVEN 55 T 0.31 Pox_Ag35 pdb F Eukaryota T 5cv1 1 A A PGL1_CAEEL P granule abnormality protein 1 KQLMLDGPKSEPADPFISLLMDPLEESVGKVVNHIAQLFEEASKNEGDESLVLRSQLGYQLFFLIVRSLADGKREVSKKILSGIPTSVRAEVFPGLQRSVYKSAVFLGNHIIQVLLGSKKSFEDWDVVGVAKDLESAWKRRAIAELIKKFQVSILEQCFDKPVPLIPQSPLNNDAVIDNVNKALQFALWLTEFYGSENETEALGELRFLDSTSKNLLVDSFKKFVQGINSKTHVTRIVESLEK 243 T 3.6 DMA pdbhh F Eukaryota T 5cv3 1 A A E3M3V1_CAERE Putative uncharacterized protein GKLLLEGVKEQDPVDKFTYLLLQPLTEATLSDAVNFIVEKYSAELPDEGDASLVVRSQLGCQFFFLVTRTLAHDQRELAKLVQTLIPRPVRLEVFPGLQRSVFKSSVFLGHHIIQIFMGAKKPFQDWSFVGLAQDFECPWRRLAIAELLKKFSVSVVEKVFDNPVALIPQHESDNEALIELVTNALRFALWIVEFYETETNEKSIKELAFLDHSSKTLLIESFTKFLQGKDVKDQDHLKRIIDALEKS 248 T 0.13 CbtA_toxin pdb F Eukaryota T 5cve 2 C,D D,E H2B_DROME N-terminal peptide from Histone H2B XPKTSGKAA 9 T 110 DUF6143 pdbhh F Eukaryota T 5cw9 1 A A De novo designed ferredoxin-ferredoxin domain insertion protein MEMDIRFRGDDPEAYYKALREMIRQARKFAGTVTVTLIIRFRGDDLEALEKALKEMIRQARKFAGTVTYTLDGNDLEIRITGVPPQVILELVKEAIRLAKEFNITVTVELVIRITGVPEQVRKELAKEAERLAKEFNITVTYTIRL 146 T 0.0034 Radical_SAM pdb F T 5cws 6 F,L F,L NIC96_CHATD NUCLEAR PORE PROTEIN NIC96 SALFDSLLARNKKQAEGETALGELPSLQLGLADLRQRLRKLGPSSDRPIEPGKAHYFLAASGVDPGAAVRDLGA 74 T 0.012 TFIIA unppssm F Eukaryota T 5cww 2 B B NUP82_CHATD NUCLEAR PORE PROTEIN NUP82 MPKIKSFAPAWLNEPAPGHKLFAPAADDGTATVPLAYGKKIKPGPRRTIARRGTEIFVACGKQIRWGDLAQLKESWESRPSRSSVGPTSTKKDSSDFDDGAATAGYRIIKTPVADDIRQLVMSPNQDFLAVLTSHTVHICILPDSSHLHIQDTTPFKPKFWTLGPTTHVTSRSAVVSAVWHPLGVNGHALVTVTEDAIVRVWELSTADRWTFDAPTLAIDLKKLADATYLDQDFGVSTSATNKGFSPDAFDMEVAAACFPTRDSGGWAPMTLWLAMTSGDVYALCPLLPQRWTPPPTLIPSLSASIVAKVAAAEDNPESTPEERLVAQQQLEWMSEIDNQEPKLVEEATGEATIEVYTRPSRPGLVPKLQGPFDFDLNPEDEQDDEVELKDIYVIGEKPRVADLMRGEEEELEMMKEDQHNGLSLNIICLLSTSGQVKICLDIDGVEAQWLPPRSKNKRLFAPPPEPPSLLTFQTFDTLKPAEVTPDGWPMFSEDATSPYSFYVTHPAGITYISLTPWVFRLESELQSDSEAGTEFRIDLLAKGQGSERDRIFTQTRTQSPLAAATSIDDPDLGYFILSATQTDPIALFFETPER 595 T 3E-15 Nup88 unppercent F Eukaryota T 5cww 3 C C NU159_CHATD NUCLEAR PORE PROTEIN NUP159 LRAREAKRKATLRMLRESLARVGPNVVRLRDD 32 T 0.0026 DUF5768 unppssm F Eukaryota T 5cx3 2 E,F,G,H E,F,G,H FYCO1_HUMAN ZINC FINGER FYVE DOMAIN-CONTAINING PROTEIN 7 RPPDDAVFDIITDEELCQIQESGSSLVPRGS 31 T 2.9 ComC pdbhh F Eukaryota T 5czf 1 A,B A,B Q8XAD5_ECO57 PaaA2 METIEQENSYNEWLRAKVATSLADPRPAIPHDEVERRMAERFAKMRKERSKQ 52 T 0.044 HAGH_C pdb F Bacteria T 5czh 2 B B CITRULLINE--ASPARTATE LIGASE DEEDYXEIP 9 T 12 N-SET pdbhh F T 5czi 2 B B SHC1_HUMAN SHC-TRANSFORMING PROTEIN 3, SHC-TRANSFORMING PROTEIN A, SRC HOMOLOGY 2 DOMAIN-CONTAINING-TRANSFORMING PROTEIN C1, SH2 DOMAIN PROTEIN C1 PDHQYXNDF 9 T 2.4 HAND pdbhh F Eukaryota T 5d0j 2 E,F L,M G7-TEdFP peptide SFEGYDNSC 9 T 1.2 Crr6 pdbhh F T 5d23 1 A A Q5FBS0_BOMMO UNCHARACTERIZED PROTEIN MHHHHHHETSEERAARLAKMSAYAAQRLANESPEQRATRLKRMSEYAAKRLSSETREQRAIRLARMSAYAARRLANETPAQRQARLLRMSAYAAKRQASKKS 102 T 0.072 DUF3752 pdbpssm F Eukaryota T 5d2m 4 G G ZN451_HUMAN COACTIVATOR FOR STEROID RECEPTORS GAMDHVEFGSGDPGSEIIESVPPAGPEASESTTDENEDDIQFVSEGPLRPVLEYIDLVSSDDEEP 65 T 57 SelB-wing_3 pdbhh F Eukaryota T 5d50 2 E,F,K,L,M,N,O,P E,G,M,O,F,H,N,P T1SA45_9CAUD Anti-repressor protein MQRQYHHPLEEGFEERIHTPVGVRSLVEDSHLMKLLRELDKDGFNVDGPLAELVALVNYVTSSQMTMQDLQTHLDYCAEQLRKQTT 86 T 0.0051 DUF724 pdb T Viruses T 5d5k 2 B B PARP2_HUMAN HPARP-2,ADP-RIBOSYLTRANSFERASE DIPHTHERIA TOXIN-LIKE 2,ARTD2,NAD(+) ADP-RIBOSYLTRANSFERASE 2,ADPRT-2,POLY[ADP-RIBOSE] SYNTHASE 2,PADPRT-2 MGSSHHHHHHSSGLVPRGSHMAARRRRSTGGGRARALNESKRVNNGNTAPEDSSPAKKTRRCQRQESKKMPVAGGKANKDRTEDKQDESVKALLLKGK 98 T 96 DUF5757 pdbhh F Eukaryota T 5d5y 1 A,B B,A G0SB31_CHATD CTSKN7 SQQQIAALSESLQATQQQLQALQQQCYELEKTNRLLVSEVMTLQKMVKAQ 50 T 0.0033 CENP-H pdb F Eukaryota T 5d60 1 A,B,C,D A,B,C,D G0SB31_CHATD CTSKN7 SQQQIAALSESLQATQQQLQALQQQCYELEKTNRLLVSEVMTLQKMVKAQNQASNEIINHL 61 T 6.9E-05 Sulfotransfer_4 unphh F Eukaryota T 5d94 2 B B FYCO1_HUMAN Peptide from FYVE and coiled-coil domain-containing protein 1 DDAVFDIITDEEL 13 T 4.5 Flavi_NS2B pdbhh F Eukaryota T 5d9e 1 A A D5VKJ8_CAUST Caulosegnin II GTLTPGLPEDFLPGHYMMPG 20 T 0.087 DUF5974 unp F Bacteria T 5d9q 1 A,F,K G,A,J Q2N0S6_9HIV1 Envelope glycoprotein gp120 AENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQAFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGANNTSTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRR 472 T 3.5E-54 GP120 pdbpercent T Viruses T 5da7 2 C,D B,E Q5JH72_THEKO Thermococcales inhibitor of PCNA MDRKLDEFIGDATPKKVSKEKPVRRKKRLKPTSLDSFLPEEHINYFRDLRIGSKKIRNAKIEEL 64 T 0.011 Inos-1-P_synth unppercent F Archaea T 5dai 2 B B FEN_THEKO C-terminus of FEN-1 protein KQRTLESWFGR 11 T 0.15 PaaX unppercent F Archaea T 5day 1 A,B A,B NRP1_ARATH NUCLEOSOME/CHROMATIN ASSEMBLY FACTOR GROUP A6,PROTEIN SET HOMOLOG 1 SNLEQIDAELVLSIEKLQEIQDDLEKINEKASDEVLEVEQKYNVIRKPVYDKRNEVIQSIPGFWMTAFLSHPALGDLLTEEDQKIFKYLNSLEVEDAKDVKSGYSITFHFTSNPFFEDAKLTKTFTFLEEGTTKITATPIKWKEGKGLPNGVNHDDKKGNKRALPEESFFTWFTDAQHKEDAGDEIHDEVADIIKEDLWSNPLTYFNN 208 T 1.1E-10 NAP pdbpercent F Eukaryota T 5dbr 2 B C SCN5A_HUMAN HH1,SODIUM CHANNEL PROTEIN CARDIAC MUSCLE SUBUNIT ALPHA,SODIUM CHANNEL PROTEIN TYPE V SUBUNIT ALPHA,VOLTAGE-GATED SODIUM CHANNEL SUBUNIT ALPHA NAV1.5 GPGSQDIFMTEEQKKYYNAMKKLGSKKPQKPIPRPLNKYQGFIFDIVTKQA 51 T 7.9 NPA pdbhh F Eukaryota T 5de2 2 C,D C,D NEK9_MOUSE NERCC1 KINASE,NEVER IN MITOSIS A-RELATED KINASE 9,NIMA-RELATED PROTEIN KINASE 9 GWLRKELENAEFIPMPDSP 19 T 2.2 DUF1456 pdbhh F Eukaryota T 5df6 2 B,C B,C TXNIP_HUMAN txnip KFMPPPTXTEVDX 13 T 0.53 Tryp_FSAP pdbhh F Eukaryota T 5dfn 1 A,B A,B Q6JXI5_TETTH Telomerase associated protein p45 GWKQQQIPQIKSNQENINTLKYKELIAGELMRITHKLLIQKLQQQQPANNNKQINEMDVESNELAEKKEVIIKIQEIAKDQQLYDTLSIQYQVDQKEQYYAKIAQSLEDFVSISALKMVSYIYPNISYQVSIGFFQNILDIATKTVKDRGALGCNYKYLKDKLTKALNLQQISYPLISESYISYLVHLFQDFNIIEIENEHKFYYKQAFQYDDS 214 T 0.014 DUF2204 pdb F Eukaryota T 5dgo 1 A A CDC45_HUMAN PORC-PI-1 MFVSDFRKEFYEVVQSQRVLLFVASDVDALCACKILQALFQCDHVQYTLVPVSGWQELETAFLEHKEQFHYFILINCGANVDLLDILQPDEDTIFFVCDTHRPVNVVNVYNDTQIKLLIKQDDDLEVPAYEDIFRDEEEDEEHSGNDSDGSEPVEQTMRRRQRREWEARRRDILFDYEQYEYHGTSSAMVMFELAWMLSKDLNDMLWWAIVGLTDQWVQDKITQMKYVTDVGVLQRHVSRHNHRNEDEENTLSVDCTRISFEYDLRLVLYQHWSLHDSLCNTSYTAARFKLWSVHGQKRLQEFLADMGLPLKQVKQKFQAMDISLKENLREMIEESANKFGMKDMRVQTFSIHFGFKHKFLASDVVFATMSLMESPEKDGSGTDHFIQALDSLSRSNLDKLYHGLELAKKQLRATQQTIASCLCTNLVISQGPFLYCSLMEGTPDVMLFSRPASLSLLSKHLLKSFVCSTKNRRCKLLPLVMAAPLSMEHGTVTVVGIPPETDSSDRKNFFGRAFEKAAESTSSRMLHNHFDLSVIELKAEDRSKFLDALISLLS 555 T 7E-37 CDC45 pdbpercent F Eukaryota T 5dha 4 D D Engineered Nuclear Export Signal Peptide (CPEB4 NES reverse mutant) GGSYRMIDILSSELSHMDFTR 21 T 5.8 SAC3 pdbhh F T 5dhf 4 D D RIOK2_HUMAN Serine/threonine-protein kinase RIO2 GGSYRSFEMTEFNQALEEI 19 T 0.9 RhoGEF67_u1 pdbhh F Eukaryota T 5di8 3 C C Fc-III peptide DCAWHLGELVWCT 13 T 6.1 FAT pdbhh F T 5di9 4 D D Engineered Nuclear Export Signal Peptide (hRio2 NES reverse mutant) GGSYGKIEELAQNFETMEFSR 21 T 1.6 OTOS pdbhh F T 5dif 4 D D CPEB4_HUMAN Cytoplasmic polyadenylation element-binding protein 4 GGSYRTFDMHSLESSLIDI 19 T 6.4 DUF1959 pdbhh F Eukaryota T 5djn 1 A,B A,C F8VQ75_MOUSE Kinesin-like protein GPGSHAVVNEDPNAKVIRELREEVEKLREQLSQAEAMKAELKEKLEESEKLIKELTVTWEEKLRKTEAIAQERQRQLESMGISLETSGIKVGDD 94 T 2.1E-08 Kinesin_assoc pdb F Eukaryota T 5djq 4 M,N,O,P N,O,P,Q H7ESS5_PSEST Putative uncharacterized protein MFVDNVVLAGVVTVGLMVAFLAGFGYFIWRDAHKKS 36 T 0.0048 FixQ pdbhh F T 5dmg 3 G,H,I Z,P,X TAU_HUMAN Microtubule-associated protein SIDMVDSPQLATLAD 15 T 19 YkpC pdbhh F Eukaryota T 5dms 2 B,D B,D FBX43_MOUSE ENDOGENOUS MEIOTIC INHIBITOR 2 FSQHKTSTI 9 T 5.9 SVS_QK pdbhh F Eukaryota T 5dmu 1 A A D1Z0H5_METPS NHEJ Polymerase GLVPRGSHMTEVLHIEGHDIKVTNPDKVLFPEDGITKGELVDYYRRISGVMVPLVRGRPMTMQRFPDGIGKEGFFQKEASDYFPDWVHRATLELGKGGIQHQVVCDDAATLVYLASQAMITPHVFLSRIDKVHYPDRLIFDLDPPDNNFETVRSAAKTIREALDAEGYPVYLMTTGSRGLHVVVPLDRSADFDTVRAFARGFGEKLTKKYPDRFTIELSKEKRRGRLFLDYLRNSYGQTGVAPYGVRARSGAPVATPITWDELDDISGSQEYNIRNIMGRMDKRGDAWKYIDKDRTSIKNL 301 T 0.0052 S-methyl_trans unppercent F Archaea T 5dmv 2 B D FBX43_MOUSE ENDOGENOUS MEIOTIC INHIBITOR 2 SPLVTSTIKTEDVVSNSQNSRLHFSQHKTSTI 32 T 4 SVS_QK pdbhh F Eukaryota T 5dof 1 A,B,C,D A,B,C,D D2CVN7_TETTH P19 QQPKRNFDLYKLITDKQIDFQVADLIQDEQSSFVSVRIYGQFKCFVPKSTIQEQLDKIKNLSSKELAKNKIFKFLSEYNKNNQKQDELSHDYYGYFKVQQHQFILNLENAQREASLAVDDFYFINGRIYKTNHDILILQAHHVYQMQKPTLQLLQAASEINQN 163 T 0.53 TMF_DNA_bd pdb F Eukaryota T 5doi 2 E,F,G,H E,F,G,H Q6JXI5_TETTH P45 EDNFELVFLKELPSLPDFSKVCFTGLILSFSNFPSSEQNQQKDVPHKIAIIQDSTGEAELFLDMYKFCQEEISVFKAITGIGVLKKKNIGAGQVCKIIVERFRIIHSADEEMLQYLLIQKYKLSKTLN 128 T 0.21 YkpC pdb F Eukaryota T 5dok 1 A,B A,B Q6JXI5_TETTH P45 KSNQENINSLKYKELIAGELMRITHKLLIQKLQQQQPANNNKQINEMDVESNELAEKKEVIIKIQEIAKDQQLYDTLSIQYQVDQKEQYYAKIAQSLEDFVSISALKMVSYIYPNISYQVSIGFFQNILDIATKTVKDRGALGCNYKYLKDKLTKALNLQQISYPLISESYISYLVHLFQDFNIIEIENEHKFYYKQAFQYDDS 204 T 0.013 DUF2204 pdb F Eukaryota T 5doq 3 C C A0A0Q0UXS2_9BACI Putative membrane protein MQTFLIMYAPMVVVALSVVAAFWVGLKDVHVNE 33 T 0.16 FixS pdbpssm F Bacteria T 5dow 2 B,D,F,H B,D,F,H S26A3_MOUSE SLC26A3 TRANSPORTER, DOWN-REGULATED IN ADENOMA, PROTEIN DRA, SOLUTE CARRIER FAMILY 26 MEMBER 3 KRNKALKKIRKLQKRGLIQMTX 22 T 0.82 DUF2786 pdbhh F Eukaryota T 5dpw 2 B,D,F,H,J,L,N,P B,D,F,H,J,L,N,P PKHM1_HUMAN PH DOMAIN-CONTAINING FAMILY M MEMBER 1,162 KDA ADAPTER PROTEIN,AP162 PQQEDEWVNVQYPD 14 T 2 DOR pdbhh F Eukaryota T 5dqs 2 B D EF1B_HUMAN EF-1-BETA GAMGFGDLKSPAGLQVLNDYLADKSYIEGYVPSQADVAVFEAVSSPPPADLCHALRWYNHIKSYEKEKASLPGVKKALGKYGPADVEDTT 90 T 0.00027 GST_C_4 pdbhh F Eukaryota T 5drv 2 B B POLN_SFV NSP3 LTFGDFDE 8 T 0.16 DUF5102 pdbhh T Viruses T 5dx1 2 E,F,G,H F,G,H,I PABP1_HUMAN PABP1 peptide NMPGAIRPAAPXPPFSTMX 19 T 65 MIP-T3 pdbhh F Eukaryota T 5dx8 2 E,F,G,H E,F,G,H PABP1_HUMAN POLY(A)-BINDING PROTEIN 1 NMPGAIXPAAPXPPFSTMX 19 T 65 MIP-T3 pdbhh F Eukaryota T 5e0l 2 B C FA83D_HUMAN Protein Chica peptide SYRKAIDAATQTEE 14 T 0.054 TMEM131_like unppercent F Eukaryota T 5e0m 2 B C FA83D_HUMAN Protein Chica peptide SYWSRSTTTQTDM 13 T 0.054 TMEM131_like unppercent F Eukaryota T 5e0u 2 D,E,F D,E,F CDN1A_HUMAN CDK-INTERACTING PROTEIN 1,MELANOMA DIFFERENTIATION-ASSOCIATED PROTEIN 6,MDA-6,P21 CGRKRRQTSMTDFYHSKRRLIFS 23 T 0.94 CDC27 pdbhh F Eukaryota T 5e0v 2 C,D C,D FEN1_HUMAN FEN-1,FLAP STRUCTURE-SPECIFIC ENDONUCLEASE 1 STQGRLDDFFKVTGSL 16 T 0.15 LRV_FeS pdbhh F Eukaryota T 5e1b 2 C,D D,E RCC1_HUMAN RCC1 SPKRIA 6 T 9.3 DUF1107 pdbhh F Eukaryota T 5e1d 2 C,D D,E RCC1_HUMAN RCC1 YPKRIA 6 T 5.6 DUF1719 pdbhh F Eukaryota T 5e24 2 B,D B,D HLES_DROME Protein hairless GGRLQFFKDGKFILELARSKDGDKSGWVSVTRKTFRPP 38 T 0.43 Tryp_FSAP pdbhh F Eukaryota T 5e2a 2 C,D D,E RCC1_HUMAN RCC1 XPKRIA 6 T 7.2 DUF5394 pdbhh F Eukaryota T 5e2v 3 C P TAU_HUMAN TAU PHOSPHOPEPTIDE RSGYSSPGSPGTPGSRSR 18 T 15 Tachystatin_A pdbhh F Eukaryota T 5e5a 6 K K VIE1_HCMVT C-terminal domain of Regulatory protein IE1 GGKSTHPMVTRSKADQ 16 T 20 p53-inducible11 pdbhh T Viruses T 5e5x 1 A A ANFLVH (residues 13-18) from islet amyloid polypeptide ANFLVH 6 T 5 DUF1160 pdbhh F T 5e61 1 A,B A,B FGAILSS (residues 23-29) from islet amyloid polypeptide FGAILSS 7 T 6.2 Amelotin pdbhh F T 5e6q 2 B A XRCC1_HUMAN X-RAY REPAIR CROSS-COMPLEMENTING PROTEIN 1 SPKGKRKLDLNQEEKKTPSKPPAQLSPSVPKRPKLP 36 T 0.21 XRCC1_N pdbhh F Eukaryota T 5e7t 2 B B Q9AYV4_BPTU2 Minor structural protein 5 MADKNYLHTAYANSADGTDGFTTVYPNLNLLVNSSAKNKEGFFKNFDKVENGYGEVTMKGTNAWVNKDLGEGFSIQPINYKPGDKYTMSVDVMFTSWNVPAGTTISAFWMRQRYTENSWKEICTIDLPKDPSKMLNQWIRITQTSTIPPYEDPSVGTQAILNVGFFGQQEGSFTIRVRNPKQELGSIATPYMPSASEVTTADWPKFVGTYVDTNPVSSTVSSKYDWDEMKYRVYLDGTPVGGSKLLSFDLENLKAGTSYNVQVSQINGNVESDKSESVAFKTTLPK 286 T 0.012 CBM_4_9 pdbpercent T Viruses T 5e8n 3 C,F,I,L C,F,I,L TRH4, CERS5,LAG1 LONGEVITY ASSURANCE HOMOLOG 5,TRANSLOCATING CHAIN-ASSOCIATING MEMBRANE PROTEIN HOMOLOG 4,TRAM HOMOLOG 4 MCLRMTAVM 9 T 12 Adeno_E4_ORF3 pdbhh F T 5e8o 2 B,F C,F CERS5,LAG1 LONGEVITY ASSURANCE HOMOLOG 5,TRANSLOCATING CHAIN-ASSOCIATING MEMBRANE PROTEIN HOMOLOG 4,TRAM HOMOLOG 4 MXLRMTAVM 9 T 21 HV_small_capsid pdbhh F T 5e8p 3 C,F C,F CERS5,LAG1 LONGEVITY ASSURANCE HOMOLOG 5,TRANSLOCATING CHAIN-ASSOCIATING MEMBRANE PROTEIN HOMOLOG 4,TRAM HOMOLOG 4 MCLRXTAVM 9 T 19 DUF6401 pdbhh F T 5eay 2 E,F,G,H E,F,G,H DNA2_HUMAN HDNA2,DNA REPLICATION ATP-DEPENDENT HELICASE-LIKE HOMOLOG NELELLMEKSFWE 13 T 4.5 Retinal pdbhh F Eukaryota T 5ec5 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,R,S TXL_EISFE EFL1 GMSAKAAEGYEQIEVDVVAVWKEGYVYENRGSTSVDQKITITKGMKNVNSETRTVTATHSIGSTISTGDAFEIGSVEVSYSHSHQKSQVSMTQTEVYSSKVIEHTITIPPTSKFTRWQLNADVGGAGIEYMYLIDEVTPIGGTQSIPQVITSRAKIIVGRQIILGKTEIRIKHAERKEYMTVVSRKSWPAATLGHSKLFKFVLYEDWGGFRIKTLNTMYSGYEYAYSSDQGGIYFDQGTDNPKQRWAINKSLPLRHGDVVTFMNKYFTRSGLCYDDGPATNVYCLDKREDKWILEVVG 298 T 0.027 Toxin_10 unphh F Eukaryota T 5ed9 1 A,B,C A,B,C SUN2_MOUSE PROTEIN UNC-84 HOMOLOG B,SAD1/UNC-84 PROTEIN-LIKE 2 GPGSEFKSMTQEAFQESSVKELGRLEAQLASLRQELAALTLKQNSVADEVGLLPQKIQAARADVESQFPDWIRQFLLG 78 T 0.0003 ADIP pdbpercent F Eukaryota T 5eel 2 G,H,I,J,K,L L,M,N,P,Q,R Bicyclic Peptide Inhibitor SFEGYDNXX 9 T 0.52 DUF4911 pdbhh F T 5eeq 2 C,D L,M Bicyclic Peptide Inhibitor SFEGYDNSFPXX 12 T 0.6 Rox3 pdbhh F T 5efi 3 C C p99p YEHDFHHIREKGNHWKNFLAVM 22 T 7.9 DUF1925 pdbhh F T 5efv 1 A,B,C A,B,C Q8SDT4_BPPHA MINOR STRUCTURAL PROTEIN HHHHHHLVPRGSMSNKLITDLSRVFDYRYVDENEYNFKLISDMLTDFNFSLEYHRNKEVFAHDGEQIKYEHLNVTSNVSDFLTYLNGRFSNMVLGHNGDGINEVKDARVDNTGYGHKTLQDRLYHDYSTLDVFTKKVEKAVDEHYKEYRATEYRFEPKEQEPEFITDLSPYTNAVMQSFWVDPRTKIIYMTQARPGNHYMLSRLKPNGQFIDRLLVKNGGHGTHNAYRYIDGELWIYSAVLDSNKNNKFVRFQYRTGEITYGNEMQDVMPNIFNDRYTSAIYNPVENLMIFRREYKPTERQLKNSLNFVEVRSADDIDKGIDKVLYQMDIPMEYTSDTQPMQGITYDAGILYWYTGDSNTANPNYLQGFDIKTKELLFKRRIDIGGVNNNFKGDFQEAEGLDMYYDLETGRKALLIGVTIGPGNNRHHSIYSIGQRGVNQFLKNIAPQVSMTDSGGRVKPLPIQNPAYLSDITEVGHYYIYTQDTQNALDFPLPKAFRDAGWFLDVLPGHYNGALRQVLTRNSTGRNMLKFERVIDIFNKKNNGAWNFCPQNAGYWEHIPKSITKLSDLKIVGLDFYITTEESNRFTDFPKDFKGIAGWILEVKSNTPGNTTQVLRRNNFPSAHQFLVRNFGTGGVGKWSLFEGKVVE 648 T 0.1 Baculo_PEP_C pdbpercent T Viruses T 5eg2 2 B B TAF10_HUMAN STAF28,TRANSCRIPTION INITIATION FACTOR TFIID 30 KDA SUBUNIT,TAFII30 SKSKDRKYTL 10 T 0.0084 TFIID-31kDa unphh F Eukaryota T 5eha 1 A A G1K3P4_AGABI Lectin-like fold protein ARKIPLDLPGTRILNGANWANNSATENLATNSGTLIIFDQSTPGQDADRWLIHNYLDGYKIFNMGSNNWASVSRGNTVLGVSEFDGQTCKWSIEYSGNGEEFWIRVPREGGGGAVWTIKPASSQGPTTVFLDLLKETDPNQRIKFAVENLYFQ 153 T 0.002 Inhibitor_I66 unphh F Eukaryota T 5ehb 1 A,B A,B pHiosYI TDKIXDALEKLAEIQKEIAEFLRELIEAAEKT 32 T 0.1 Ribosomal_L22 pdb F T 5eib 4 D F CENPJ_HUMAN CENP-J,CENTROSOMAL P4.1-ASSOCIATED PROTEIN,LAG-3-ASSOCIATED PROTEIN,LYST-INTERACTING PROTEIN 1 KQPFLKRGEGLARFTNAKSKFQK 23 T 7 DUF6200 pdbhh F Eukaryota T 5ejo 1 A A RLF2_YEAST CAF-1 90 KDA SUBUNIT,RAP1 LOCALIZATION FACTOR 2 GPLGSMKQKAMITDPMDLLRLFDGVQDSTFSLGTVTEIAQKNLPQYNKQTIKNTIKEYAIRSSGKGDLPRKWVIKDAQNWENLRANANMPTPSL 94 T 0.00059 CAF1-p150_C2 unphh F Eukaryota T 5ejv 2 B,D C,D EBI96 Coactivator Peptide VESEFPYLLSLLGEVSPQP 19 T 0.82 DUF4576 pdbhh F T 5ekf 1 A,B B,C ERCC5_HUMAN DNA EXCISION REPAIR PROTEIN ERCC-5,XERODERMA PIGMENTOSUM GROUP G-COMPLEMENTING PROTEIN KTQKRGITNTLEESSSLKRKRLSD 24 T 7 DUF503 pdbhh F Eukaryota T 5ekg 1 A,B B,C ERCC5_HUMAN XPG2 peptide VFGKKRRKLRRARGRKRKT 19 T 6.4 AT_hook pdbhh F Eukaryota T 5elq 2 B,D P,C DGKZ_HUMAN GLU-ASP-GLN-GLU-THR-ALA-VAL REDQETAV 8 T 33 SNN_linker pdbhh F Eukaryota T 5ema 2 B B LRC3B_HUMAN ASP-ASP-ILE-SEP-THR-VAL-VAL PDDISTVV 8 T 29 B2 pdbhh F Eukaryota T 5emb 2 B B PTH1R_HUMAN GLU-GLU-TRP-SEP-THR-VAL-MET QEEWSTVM 8 T 0.54 Prp19 pdbhh F Eukaryota T 5enw 3 C C Peptide G9L GLKEGIPAL 9 T 10 CENP-M pdbhh F T 5eoa 2 C,D C,D TBK1_HUMAN NF-KAPPA-B-ACTIVATING KINASE,T2K,TANK-BINDING KINASE 1 GPGSYPSSNTLVEMTLGMKKLKEEMEGVVKELAENNHILERFGSLTMDGGLRNVDCL 57 T 0.0016 DUF713 unp F Eukaryota T 5eoj 1 A,B,C A,B,C ACC-Hex-PheI XELKAIAQEFKAIAKEFKAIAXEFKAIAQKX 31 T 2.6 DUF5741 pdbhh F T 5eok 2 B K P39 HIYPDFPTD 9 T 5.5 DUF4012 pdbhh F T 5eon 1 A,B,C A,B,C ACC-Hex XELKAIAQEFKAIAKEFKAIAWEFKAIAQKX 31 T 3.9 DUF5320 pdbhh F T 5eot 3 C C Peptide G13E GLLPELPAVGG 11 T 3.5 Fapy_DNA_glyco pdbhh F T 5ep6 2 C,D B,D TBK1_HUMAN NF-KAPPA-B-ACTIVATING KINASE,T2K,TANK-BINDING KINASE 1 SGSGSYPSSNTLVEMTLGMKKLKEEMEGVVKELAENNHILERFGSLTMDGGLRNVDCL 58 T 0.0016 DUF713 unp F Eukaryota T 5epp 2 B B RHBL4_HUMAN RRP4,RHOMBOID DOMAIN-CONTAINING PROTEIN 1,RHOMBOID-LIKE PROTEIN 4 SPEEMRRQRLHRFDS 15 T 0.71 SUIM_assoc pdbhh F Eukaryota T 5eqw 1 A,B,C,D,E A,B,C,D,E A0A125SJ78_9VIRU Putative major coat protein MHHHHHHMAKYEATKGDYAGGVLAILTQYFNNMVGYPEVSLKLAGEEANMSREGMINQKEIVHQMVETIRRASEPIRQGRGFHDAYVYFASVPENAPPNSIALPPQAQSEVQAKLTELMQKLANRNPQGVAEEEQELATQGI 142 T 0.08 KfrA_N pdbpercent T Viruses T 5esq 3 E,F E,F Cyclic beta-alanine-linked meditope XQFDLSTRRLKX 12 T 10 AAA_lid_8 pdbhh F T 5et0 2 B,D B,D MYO3B_MOUSE Myosin-IIIb SQRKPRKLGQIKVLDGEDQYYKCLSPGACAPEETHSVHPFFFSSSPREDPFAQH 54 T 0.078 VHL pdb F Eukaryota T 5et1 2 C,D C,D MYO3B_MOUSE Myosin-IIIb QKQRAPRRRCQQPKMLSSPEDTMYYNQLNGTLEYQG 36 T 15 Tho2 pdbhh F Eukaryota T 5eta 2 B,D D,C B6KJB6_TOXGV Putative transmembrane protein GLLERRGVSELPPLYI 16 T 1.8 ComFB pdbhh F Eukaryota T 5etf 2 B B MP2K6_HUMAN MAPKK 6,MAPK/ERK KINASE 6,MEK 6,STRESS-ACTIVATED PROTEIN KINASE KINASE 3,SAPKK3 SKGKKRNPGLKIPKA 15 T 3.4 GHL15 pdbhh F Eukaryota T 5etu 3 E,F E,F L5E meditope variant CQFDESTRRLKC 12 T 7.7 YsaB pdbhh F T 5euk 3 E,F E,F F3H meditope CQHDLSTRRLKC 12 T 11 DUF3637 pdbhh F T 5eur 1 A,B,C A,B,C SF216 MSISYRKLDIALSADKETVLVFGQELSTKYFTEIVVTTMLNSTGSDMANSNRILNDIHAAGLDAGDYGKYSRWWAQSNAQERQEAERRRKEAKAHQERMAAIHATPEEIAKAVAERKAREEALIKRFGNKGAAFGL 136 T 0.019 Topoisom_I_N pdb F T 5evf 1 A A A0Q625_FRATN Francisella virulence factor GSHMETKGVYLPKYSAELPPTDPSQVRVYNLQYQSDTQGNIGQVRTSTHVSNEKDFQKLCDKNLKEAIKLAAQHGAHEIKYICLYPEGQINELSSVQLRGYAFRD 105 T 0.078 DUF2757 pdb F Bacteria T 5ewz 2 C C GAB2_HUMAN GRB2-ASSOCIATED BINDER 2,GROWTH FACTOR RECEPTOR BOUND PROTEIN 2-ASSOCIATED PROTEIN 2,PP100 PRRNTLPAM 9 T 1.5 PKI pdbhh F Eukaryota T 5ex0 2 B D M3K2_HUMAN MAP3K2 peptide EKFGKGGTYP 10 T 12 Cyanate_lyase pdbhh F Eukaryota T 5ex8 1 A A Q8KLL7_STRTO STAF MHHHHHHGKPIPNPLLGLDSTENLYFQGIDPFTMFEEINVVRASQLHRRDRFDPVPELHSLMKEGGLTVLGTEDSTEGRTAWLATGIDEVRQVLGSDKFSARLLYGGTAAGITWPGFLTQYDPPEHTRLRRMVVPAFSHRRMQKFRPRVEQIVQDSLDTIESLGGPVDFVPHFGWAIATPATCDFLGIPRDDQADLARILLASRTDRSDKRRTAAGNKFMTYMKQHVAQSRRGSGDDLFGIVGRENGDAITDAELTGVAAFVMGAAADQVARLLAAGAWLMVEQPAQFALLREKPETVPEWLDETMRYLTTDEKTHPRVATQDVRIGNQLVKAGDTVTCSLLAANRPNYPSAEDEFDITREKAEHLAFGHGIHHCLGRAMAELMFKVSIPALAHRFPTLRLADPQREITLGPPPFDVEALLLDW 424 T 8.8E-29 p450 pdbpercent F Bacteria T 5exa 2 C,D C,D GAB2_HUMAN GRB2-ASSOCIATED BINDER 2,GROWTH FACTOR RECEPTOR BOUND PROTEIN 2-ASSOCIATED PROTEIN 2,PP100 PRRNTLPAMDQ 11 T 1.7 PKI pdbhh F Eukaryota T 5eyz 2 E,F,G,H E,F,G,H CYTO8-RETEV SWESHKSGRETEV 13 T 8.1 Svs_4_5_6 pdbhh F T 5ez0 2 E,F,G,H E,F,G,H B5MDL5_HUMAN MITOGEN-ACTIVATED PROTEIN KINASE 12,ISOFORM CRA_A SWARVSKETPL 11 T 0.51 CdhC pdbhh F Eukaryota T 5ez8 1 A,B,C,D,E,F,G A,B,C,D,E,F,G CC-Hept-I-C-I XGEIAQALKEIAKALKEIAWACKEIAQALKG 31 T 0.028 MCPsignal pdbpssm F T 5ez9 1 A,B,C,D,E,F,G A,B,C,D,E,F,G CC-Hept-L22H XGEIAKALREIAKALREIAWAHREIAKALRG 31 T 0.036 WXG100 pdbpssm F T 5eza 1 A,B,C,D,E,F,G A,B,C,D,E,F,G CC-Hept-C-H-I XGEIAKALREIAKALRECAWAHREIAKALRG 31 T 6.4 RecR pdbhh F T 5ezc 1 A,B,C,D,E,F,G A,B,C,D,E,F,G CC-Hept-C-H-E XGEIAKALREIAKALRECAWAHREEAKALRG 31 T 2.8 EDS1_EP pdbhh F T 5eze 1 A,B,C,D,E,F,G A,B,C,D,E,F,G CC-Hept-bMeCys-His-Glu XGEIAKALREIAKALREXAWAHREEAKALRG 31 T 2.8 EDS1_EP pdbhh F T 5f0l 4 D D NRAM2_HUMAN NRAMP 2,DIVALENT CATION TRANSPORTER 1,DIVALENT METAL TRANSPORTER 1,DMT-1,SOLUTE CARRIER FAMILY 11 MEMBER 2 HLGLTAQPELYLLNTMDADSLVSR 24 T 17 CagA pdbhh F Eukaryota T 5f0m 4 D D NRAM2_HUMAN NRAMP 2,DIVALENT CATION TRANSPORTER 1,DIVALENT METAL TRANSPORTER 1,DMT-1,SOLUTE CARRIER FAMILY 11 MEMBER 2 TAQPELYLLNTMSHHHHH 18 T 220 DUF1143 pdbhh F Eukaryota T 5f0o 2 B E C5DNF8_LACTC KLTH0G16610p LTNPSQYLLQDAVTEREVLLVP 22 T 14 AglB_L1 pdbhh F Eukaryota T 5f0p 4 D D NRAM2_HUMAN NRAMP 2,DIVALENT CATION TRANSPORTER 1,DIVALENT METAL TRANSPORTER 1,DMT-1,SOLUTE CARRIER FAMILY 11 MEMBER 2, DIVALENT CATION TRANSPORTER II TAQPELYLMNTMSHHHHH 18 T 220 DUF1143 pdbhh F Eukaryota T 5f1i 3 C,F,I,L,O,R,U,X C,F,I,L,O,R,U,X 9-mer peptide KLFSGELTK 9 T 6.4 DUF5823 pdbhh F T 5f1t 1 A,B,C,D,E,F A,B,C,D,E,F Macrocyclic peptide XAVLXVGSXVHGXATV 16 T 2.3 DUF6001 pdbhh F T 5f3o 1 A,B A,B C4LU64_ENTHI EHRNASEIII MSSTTLHNAMQYTAFDVLSSILNLMKADPLYDLLQLNQAYSSQDQEYEKNEFYGDSYLEERASSLVLKFLRKYEQIPFEMYSGLRIHTVKNQTLGEIFDLLHLGDTKTFEKKKKGDLVESLIGGCVLLSQRENATLFLLFAHALIDYIFYHSSYIYFNANPPKLVKEEIITDIQNWFKDKLFYYRSSLEKYQTDP 195 T 0.025 Ribonuclease_3 pdbpssm F Eukaryota T 5f3q 1 A,B A,B C4LU64_ENTHI EH.RNASEIII SSSTTLHNAMQYTAFDVLSSILNLMKADPLYDLLQLNQAYSSQDQEYEKNEFYGDSYLEERASSLVLKFLRKYEQIPFEMYSGLRIHTVKNQTLGEIFDLLHLGDTKTFEKKKKGDLVESLIGGCVLLSQRENATLFLLFAHALIDYIFYHSSYIYFNANPPKLVKEEIITDIQNWFKDKLFYYRSSLEKYQT 193 T 0.025 Ribonuclease_3 pdbpssm F Eukaryota T 5f3y 2 B B ANS4B_MOUSE ANKS4B GSVEEDDDVQHESILNRPGLGSIVFSRNRVLDFEDISDSKRELGFKMPSELFQRQGAAGTVEEEEEEEEEEEEEKREANGTAGDLPWDEEEVEWEEDAVDAT 102 T 0.37 PBP_sp32 pdb F Eukaryota T 5f67 2 B,D C,D TRP_DROME TRP C terminal Tail GPGSRGKSTVTGRMISGWL 19 T 0.019 Mur_ligase_M pdbhh F Eukaryota T 5f6k 3 D,F D,F RBBP5_HUMAN RBBP-5,RETINOBLASTOMA-BINDING PROTEIN RBQ-3 SAFAPDFKELDENVEYEERESEFDIED 27 T 0.014 DUF2457 unppercent F Eukaryota T 5f74 2 B B MLXPL_RAT CHREBP,CLASS D BASIC HELIX-LOOP-HELIX PROTEIN 14,BHLHD14,MLX INTERACTOR,MLX-INTERACTING PROTEIN-LIKE,WS BASIC-HELIX-LOOP-HELIX LEUCINE ZIPPER PROTEIN,WS-BHLH,WILLIAMS-BEUREN SYNDROME CHROMOSOMAL REGION 14 PROTEIN MARALADLSVNLQVPRVVPSPDSDSDTDLEDPSPRRSAGGLHRSQVIHSGHFMVSSPHSDSLTRRRDQEGPVGLADFGPRSIDPTLTRLFECLSLAYSGKLVSPKWKNFKGLKLLCRDKIRLNNAIWRAWYIQYVQRRKSPVCGFVTPLQGSEADEHRKPEAVVLEGNYWKRRIEVVMREYHKWRIYYKKRLRKSS 196 T 0.17 DUF1752 pdbhh F Eukaryota T 5f7d 3 C C Peptide G11N GLKEGIPALD 10 T 14 ATPase pdbhh F T 5f88 3 E,F E,F L5Y meditope CQFDYSTRRLKC 12 T 10 Flavi_NS1 pdbhh F T 5f8t 2 B P CYS-PRO-LYS-ARG-PHE-M70-ALA-LEU-PHE-CYS CPKRFAALFC 10 T 1.1 DUF4395 pdbhh F T 5f8x 2 B B CYS-PRO-ALA-ARG-PHE-M70-ALA-LEU-TRP-CYS CPARFAALWC 10 T 1.5 BTRD1 pdbhh F T 5f8z 2 B B CYS-PRO-ALA-ARG-PHE-M70-ALA-LEU-PHE-CYS CPARFAALFC 10 T 2.3 NUC153 pdbhh F T 5f9j 3 C C Peptide Y9L YLSPIASPL 9 T 3.3 Fe_hyd_lg_C pdbhh F T 5fa3 3 C C G9V GLLPELPAV 9 T 2 Fapy_DNA_glyco pdbhh F T 5fa4 3 C C Peptide Y16R YLSPIASPLLD 11 T 6.6 Fe_hyd_lg_C pdbhh F T 5fa5 3 C C H4_HUMAN Histone H4 SGRGKGGKGLGKGGAKRHRK 20 T 11 Shadoo unppercent F Eukaryota T 5fby 2 B B cleaved peptide MGSSHHHHHHSQLEVLFQGPLGSGRP 26 T 280 Glyco_hyd_65N_2 pdbhh F T 5fc2 1 A A pAMK, peptide containing a phospho-serine DKSIEVGRX 9 T 11 DUF2244 pdbhh F T 5fcm 1 A,B,C,D A,B,C,D A8ID55_CHLRE Basal body protein GSMAIDVDRTLAVLRRKLEALGYSDPLEPASLQLVQKLVEDLVHTTDSYTAVKQQCAKQAQEIAAFDTRLES 72 T 0.00048 ADIP unphh F Eukaryota T 5fdu 54 BB,DD 1y,2y Metalnikowin I VDKPDYRPRP 10 T 5.7 NinD pdbhh F T 5fdv 54 CB,DD 1y,2y PYRRH_PYRAP Pyrrhocoricin VDKGSYLPRPTPPRPI 16 T 1.4 Apidaecin pdbhh F Eukaryota T 5fdw 3 C C Peptide Y10L YLSPIASPLL 10 T 4.9 Fe_hyd_lg_C pdbhh F T 5ff6 3 E,F E,F L10Q meditope CQFDLSTRRQKC 12 T 7.5 DUF1254 pdbhh F T 5fg0 1 A,B A,B LTN1_YEAST RING DOMAIN MUTANT KILLED BY RTF1 DELETION PROTEIN 1 SLNTDLGLGHNGVRISLNYFDGLPDPSLLNSLYSNELKLIFKSLLKRDETTKEKALMDLSNLISDFNQNEYFFNDIFLLCWSQIYAKLIISDYKVIRLQSHQITIMLVKSLRKKISKFLKDFIPLILLGTCELDYSVSKPSLNELTECFNKDPAKINALWAVFQEQLLNLVKEIVVNENEDTISDERYSSKEESEFRYHRVIASAVLLLIKLFVHNKDVSERNSSSLKVILSDESIWKLLNLKNGQNTNAYETVLRLIDVLYTRGYMPSHKNIMKLAVKKLLKSLTHITSKNILKVCPVLPSILNLLATLDDYEDGTIWSYDKSSKEKVLKFLSVSRTSPSPGFFNAVFALYSSTKRHSFLDYYLEWLPFWQKSVQRLNEKGFSARNSAEVLNEFWTNFLKFAEDSSEERVKKM 414 T 0.0085 CLASP_N pdbhh F Eukaryota T 5fg8 2 B B KCNAE_DROME ETHER-A-GO-GO PROTEIN GVLPKAPKLQASQATLARQDTIDEGGEVDSSPPSRDSRVVIEGAAVSSATVGPS 54 T 30 DUF4491 pdbhh F Eukaryota T 5fkp 3 C C P99 YEHDFHHIREWGNHWKNFLAVM 22 T 3.2 Xpo1 pdbhh F T 5fl2 2 B K RAVA_ECOLI REGULATORY ATPASE VARIANT A, REGULATORY ATPASE VARIANT A, R AVA ATPASE DKTALTVIRLGGIFSRRQQYQLPVNVTASTLTLLLQKPLKLHDMEVVHISFERSALEQWLSKGGEIRGKLNGIGFAQKLNLEVDSAQHLVVRDVSLQGSTLALPGS 106 T 0.17 HTH_17 unppssm F Bacteria T 5frp 2 C,D C,D SCC1_YEAST MCD1-LIKE PROTEIN RLNTVTRVHQLMLEDAVTEREVLVTPGLEFLDDTTIPVGLMAQE 44 T 4.2 EF-hand_13 pdbhh F Eukaryota T 5frq 2 E,F G,L DNLJ_HELPY DNA LIGASE QEFIRSLF 8 T 1.2 IFRD_C pdbhh F Bacteria T 5frs 2 B C SCC1_YEAST SCC1 LMMEDAVTEREVLVTPG 17 T 0.18 RPW8 unp F Eukaryota T 5fs4 1 A,B A,B Q9AZ42_9VIRU AP205 BACTERIOPHAGE COAT PROTEIN GSMANKPMQPITSTANKIVWSDPTRLSTTFSASLLRQRVKVGIAELNNVSGQYVSVYKRPAPKPEGGADAGVIMPNENQSIRTVISGSAENLATLKAEWETHKRNVDTLFASGNAGLGFLDPTAAIVSSDTTA 133 T 15 Packaging_FI unphh T Viruses T 5ft1 2 C,D,F,H,J,L C,D,F,H,J,L Q9HZM8_PSEAE RNASE E ERPRRRSRGQRRRSNRRERQ 20 T 26 Tristanin_u2 pdbhh F Bacteria T 5fu7 4 D,H D,H A0A0B4KGY5_DROME NANOS, ISOFORM B GPHMLESHQQTDEIARSLKIFAQVTTGAAENAAGSMQDVMQEFATNGYASDDLG 54 T 0.053 Tape_meas_lam_C pdb F Eukaryota T 5fw5 2 C C POLN_SFV NON-STRUCTURAL PROTEIN 3 LTFGDFDEHEVDALASGITFGDFDD 25 T 0.21 DUF5102 pdbhh T Viruses T 5fwe 2 C,D C,D H4_HUMAN SYNTHETIC PEPTIDE SGXGKGGKGLGKGGA 15 T 11 Shadoo unppercent F Eukaryota T 5fzt 2 B B RHG07_HUMAN DELETED IN LIVER CANCER 1 PROTEIN, DLC-1, HP PROTEIN, RHO-TYPE GTPASE-ACTIVATING PROTEIN 7, START DOMAIN-CONTAINING PROTEIN 12, STARD12, STAR-RELATED LIPID TRANSFER PROTEIN 12, DLC1 PELDDILYHVKGMQRIVNQWSEK 23 T 4.7 Leptin pdbhh F Eukaryota T 5fzv 1 A A A0A0A0V662_9ARAC Venom peptide U3-SYTX-Sth1a GLIESIACIQKGLPCMEHSDCCRGVCEALFCQ 32 T 9.9E-05 Toxin_18 pdbhh F Eukaryota T 5fzw 1 A A A0A0A0VBR5_9ARAC Venom peptide U3-SYTX-Sth1h GLIESIACMQKGLPCMEHVDCCHGVCDSLFCLY 33 T 0.00011 Toxin_18 pdbhh F Eukaryota T 5fzx 1 A A A0A0A0V633_9ARAC U5-SCYTOTOXIN-STH1A DETPDECVTRGNFCATPEVHGDWCCGSLKCVSNSCR 36 T 0.00036 Conotoxin unphh F Eukaryota T 5g04 15 R S HSL1_YEAST HSL1 QNSASKRSLYSLQSISKRSLNLNDLLVFDDPLPSKKPASENVNKSEPHSLESDSDFEILCDQILFGNALDRILEEEEDNEKERDTQRQRQNDTKSSADTFTISGVSTNKENEGPEYPTKIEKNQFNMSYKPSENMSGLSSFPIFEKENTLSSSYLEEQKPKRAALSDITNSFNKMNKQEGMRIEKKIQREQLQKKNDRPSPLKPIQ 206 T 0.068 CANIN pdb F Eukaryota T 5g51 1 A A Q8B3M2_9VIRU DWV-VP3-P-DOMAIN EEYRAKTGYAPYYAGVWHSFNNSNSLVFRWGSASDQIAQWPTISVPRGELAFLRIKDGKQAAVGTQPWRTMVVWPSGHGYNIGIPTYNAERARQLAQHLYGGGSLTDEKAKQLFVPANQQGPGKVSNGNPVWEVMRAPLATQRAHIQDFEFIEAIPE 157 T 9.2 GatD_N pdbhh T Viruses T 5gad 36 JA k PPB_ECOLI 1A9L SS MKQSTLALLLLLLLLTPV 18 T 0.1 LPAM_1 pdb F Bacteria T 5gaq 1 A,B,C,D,E,F,G,H,I A,B,C,D,E,F,G,H,I TXL_EISFE EFL1 MSAKAAEGYEQIEVDVVAVWKEGYVYENRGSTSVDQKITITKGMKNVNSETRTVTATHSIGSTISTGDAFEIGSVEVSYSHSHEESQVSMTETEVYESKVIEHTITIPPTSKFTRWQLNADVGGADIEYMYLIDEVTPIGGTQSIPQVITSRAKIIVGRQIILGKTEIRIKHAERKEYMTVVSRKSWPAATLGHSKLFKFVLYEDWGGFRIKTLNTMYSGYEYAYSSDQGGIYFDQGTDNPKQRWAINKSLPLRHGDVVTFMNKYFTRSGLCYDDGPATNVYCLDKREDKWILEVVGLVPRGSGHHHHHH 310 T 0.027 Toxin_10 unphh F Eukaryota T 5gds 3 C I HIRUNORM V XVXTDXGXPESHXGGDYEEIPXXYXX 26 T 0.16 Hirudin pdbhh F T 5gg4 2 E,F,G,H E,F,G,H RN169_HUMAN RING FINGER PROTEIN 169 RGRKRHCKTKHLE 13 T 0.0061 BCOR pdbhh F Eukaryota T 5ggp 2 C,D C,D DAG1_HUMAN DYSTROPHIN-ASSOCIATED GLYCOPROTEIN 1 ATPTPVTAIG 10 T 1.3 NiFe_hyd_3_EhaA pdbhh F Eukaryota T 5ghr 2 B,D B,D Q5JF31_THEKO Putative uncharacterized protein MGSSHHHHHHSSGENLYFQGHMSKEVPKEAYIIQIDLPAVLGPDMKEYGPFMAGDMAIIPTVIGRALVEREAARRVRIFL 80 T 0.31 SSURE unppercent F Archaea T 5gi0 1 A A MORF9_ARATH RNA EDITING-INTERACTING PROTEIN 9 MEQRETIMLPGSDYNHWLIVMEFPKDPAPSRDQMIDTYLNTLATVLGSMEEAKKNMYAFSTTTYTGFQCTIDEETSEKFKGLPGVLWVLPDSYIDVKNKDYGGDKYINGEIIPSTYPTYQPKQLEHHHHHHHH 133 T 0.27 Inhibitor_I9 pdbhh F Eukaryota T 5gic 2 B C MED1_HUMAN SRC1 NHPMLMNLLK 10 T 14 CoV_NSP8 pdbhh F Eukaryota T 5gim 3 C C Q6F3E8_AMBVA N-terminal peptide from Putative uncharacterized protein avahiru SGGHQTAVPK 10 T 4.2 Tsp45I pdbhh F Eukaryota T 5gim 4 D D Q6F3E8_AMBVA C-terminal peptide from Putative uncharacterized protein avahiru ISKQGLGGDFEEIPSDEIIE 20 T 0.0093 Hirudin pdbhh F Eukaryota T 5glf 2 B,D,F,H B,D,F,H DERL1_HUMAN DEGRADATION IN ENDOPLASMIC RETICULUM PROTEIN 1,DERTRIN-1,DER1-LIKE PROTEIN 1 RHNWGQGFRLGD 12 T 0.31 DUF6123 pdbhh F Eukaryota T 5gmi 2 C,D C,D JAM3_MOUSE JAM-C,JAM-2,JUNCTIONAL ADHESION MOLECULE 3,JAM-3 NYIRTSEEGDFRHKSSFVI 19 T 0.2 ASTN_1_2_N unphh F Eukaryota T 5gmj 2 C,D C,D JAM2_MOUSE JAM-B,JUNCTIONAL ADHESION MOLECULE 2,JAM-2,VASCULAR ENDOTHELIAL JUNCTION-ASSOCIATED MOLECULE,VE-JAM SKVTTMSENDFKHTKSFII 19 T 3.7 RhoGEF67_u1 pdbhh F Eukaryota T 5gmv 2 B,D D,C FUND1_HUMAN FUNDC1 PEPTIDE DSYEVLDL 8 T 21 DUF6417 pdbhh F Eukaryota T 5gmy 2 B B acceptor peptide, ARG-TYR-ASN-VAL-THR-ALA-CYS RYNVTAC 7 T 0.53 DUF5735 pdbhh F T 5gnf 1 A,B A,B L7P7R7_9CAUD Uncharacterized protein AcrF3 SMSNTISDRIVARSVIEAARFIQSWEDADPDSLTEDQVLAAAGFAARLHEGLQATVLQRLVDESNHEEYREFKAWEEALLNADGRVASSPFADWGWWYRIANVMLATASQNVGVTWGSRVHGRLMAIFQDKFKQRYEEQA 140 T 0.23 Rtt102p unppssm T Viruses T 5gnv 2 B B MAP1A_MOUSE MAP-1A AELEGGPYSPLGKDYRKAEGEREGEG 26 T 8.1 DUF2059 pdbhh F Eukaryota T 5go3 1 A,B A,B DNCV_VIBCH C-GMP-AMP SYNTHASE,3'3'-CGAMP SYNTHASE,CYCLIC AMP-GMP SYNTHASE,C-AMP-GMP SYNTHASE,DINUCLEOTIDE CYCLASE DNCV GPLGSMRMTWNFHQYYTNRNDGLMGKLVLTDEEKNNLKALRKIIRLRTRDVFEEAKGIAKAVKKSALTFEIIQEKVSTTQIKHLSDSEQREVAKLIYEMDDDARDEFLGLTPRFWTQGSFQYDTLNRPFQPGQEMNINDGTYMPMPIFESEPKIGHSLLILLVDASLKSLVAENHGWKFEAKQTCGRIKIEAEKTHIDVPMYAIPKDEFQKKQIALEANRSFVKGAIFESYVADSITDDSETYELDSENVNLALREGDRKWINSDPKIVEDWFNDSCIRIGKHLRKVCRFMKAWRDAQWDVGGPSSISLMAATVNILDSVAHDASDLGETMKIIAKHLPSEFARGVESPDSTDEKPLFPPSYKHGPREMDIMSKLERLPEILSSAESADSKSEALKKINMAFGNRVTNSELIVLAKALPAFAQEPSSASKPEKISSTMVSGHHHHHH 447 T 0.035 zf-NF-X1 pdbpssm F Bacteria T 5gow 1 A A TFDP1_HUMAN DP1 YVGEDDEEDDDFNENDEDD 19 T 5.5 DUF4820 pdbhh F Eukaryota T 5gp7 2 B B UBP25_HUMAN USP25 SLSRTPADGR 10 T 3.7 tRNA-synt_2e pdbhh F Eukaryota T 5gpk 1 A,B A,B YO48_SCHPO CCP1 MEAAQAFENLANLEQEFGKAEIEILKKQNELFQPLFEQRRDILKTINNFWVVVLEAAGDEISQYITPEDSVLLEKLENIYVERFNEKEPRDVRISLTFQPNEYLQDDNLTLVKEVRMKEEKAKDDEGLEKKITKYTSQPVDIHWKPGKSMFRKNKKLPPNFFDYFQWTGEEEDDDFDGATLTIFLAEDLFPNAVKYFTEAMTEEASDEDESVDLEEDEEEEDEEDEEGDEEKQEPPSKKSKKSNAAAENLYFQGLEDYKDDDDKHHHHHHHHHH 274 T 3.4E-06 NAP unppercent F Eukaryota T 5gqh 2 B,C B,C L7P7R7_9CAUD ACRF3,UNCHARACTERIZED PROTEIN MSNTISDRIVARSVIEAARFIQSWEDADPDSLTEDQVLAAAGFAARLHEGLQATVLQRLVDESNHEEYREFKAWEEALLNADGRVASSPFADWGWWYRIANVMLATASQNVGVTWGSRVHGRLMAIFQDKFKQRYEEQA 139 T 0.23 Rtt102p unppssm T Viruses T 5grq 2 C,D C,D ATRX_HUMAN ATP-DEPENDENT HELICASE ATRX,X-LINKED HELICASE II,X-LINKED NUCLEAR PROTEIN,XNP,ZNF-HX VTVDDDDDDNDPENRIAKKMLLEEIKANLS 30 T 15 APC_rep pdbhh F Eukaryota T 5grs 2 B,D,F,H I,J,K,L SCAP_SCHPO SREBP CLEAVAGE-ACTIVATING PROTEIN AHMNTHSGGETQVWEVWMYSQSEKKHRSKSLKMYNSLIIADPGPSLAVSDRCVAIVLGNYVALVGYGSEIFRDFYQIRNSDEMDRILRRKRKNLQRKRSGTIG 103 T 17 BBS1 pdbhh F Eukaryota T 5gsf 1 A A A0A1S4NYD9_9ROSI roseltide rT1 CIPRGGICLVALSGCCNSPGCIFGICA 27 T 0.0001 DUF5637 pdbhh F Eukaryota T 5gtb 2 B B PDV2_ARATH PROTEIN PLASTID DIVISION2 LVKERVEIPFDSVVAKRDVTYGYG 24 T 1 DUF1163 pdbhh F Eukaryota T 5gtc 6 K K ORF73_HHV8P LANA peptide GMRLRSGRSTGX 12 T 1.4 RNR_inhib pdbhh T Viruses T 5gvo 1 A A A0A171DJY5_9ACTN SPHAERICIN GLPIGWWIERPSGWYFPI 18 T 0.0011 DUF5972 unp F Bacteria T 5gwg 1 A,B A,B Q4JEI2_RAT RATTUSIN, PROTEIN DEFAL1 LRVRRTLQCSCRRVCRNTCSCIRLSRSTYAS 31 T 0.31 F-box pdb F Eukaryota T 5gwm 1 A A Q9BML7_DROME Metabotropic GABA-B receptor subtype 1 MDSAISKEDEERYQKLVTENEQLQRLITQKEEKIRVLRQRLVERGDAKGTELN 53 T 0.0015 Csm1_N pdbpssm F Eukaryota T 5gwm 2 B B Q9VPS7_DROME GABAB RECEPTOR 3,METABOTROPIC GABA-B RECEPTOR SUBTYPE 3,ISOFORM D,ISOFORM E,ISOFORM G GPLGSRRFVVDDRRELQYRVEVQNRVYKKEIQALDAEIRKLERLLESGLT 50 T 0.0041 CCDC14 pdbpercent F Eukaryota T 5gxw 2 B B NUMA1_HUMAN NUCLEAR MATRIX PROTEIN-22,NMP-22,NUCLEAR MITOTIC APPARATUS PROTEIN,NUMA PROTEIN,SP-H ANTIGEN RQQRKRVSLEPHQGPGTPESKKATSCF 27 T 0.0067 DUF4023 pdbpssm F Eukaryota T 5gzt 1 A A K7ZLW6_9BACL Chitinase MNHKVHHHHHHIEGRHMELGTLEVILDRAAAFKNEANAIAYDKAGTYGPASGTETIDGNVKVTVPGVTLRNLVIKGDLLLSEGVGSGDVTLDKVSVHGLTTVSGGGEN 108 T 0.0052 DUF5649 pdbpercent F Bacteria T 5h0r 1 A F A0A0S1LIW6_CPVBM RNA-dependent RNA polymerase MLPNTKLHNTIFSETRKFTRESFKEIEHLTARLANDSVARHDFLFNTSIALISDYSGEDSNGNQLQATITIPNEIINPKEYDPSDYPLAEDESFFKQGHKYDYLVTFRAGSLTNTYEPKTKMYKLHAALDKLMHVRQRKSRFADLWRELCAVIASLDVWYQTTNYPLRTYVKLLFHRGDEFPFYESPSQDRIIFNDKSVASILPTFVYTCCQVGTAIMSGILTHVESIVAMNHFLHCAKDSYIDEKLKIKGIGRSWYQEALHNVGQATVPVWSQFNEVIGHRRKSTSEPHFVSSTFISLRAKRAELLYPEFNAYINRAIQLSKTQNDVANYYAACRAMTNDGTFLATLTELSLDAAVFPRIEQRLVTRPAVLMSNTRHESLKQKYTNGVGSIAQSYLSSFTDEIAKRVNGIHHDEAWLNFLTTSSPGRKLTEIEKLEVGGDVAAWSNSRIVMQAVFAREYRTPERIFKSLKAPIKLVERQQSDRRQRAISGLDNDRLFLSFMPYTIGKQIYELNDNAAQGKQAGNAFDIGEMLYWTSQRNVLLSSIDVAGMDASVTTNTKDIYNTFVLDVASKCTVPRFGPYYAKNMEVFEVGKRQSQVRYVNAAWQACALEAADSQTSTSYESEIFGQVKNAEGTYPSGRADTSTHHTVLLQGLVRGNELKRASDGKNSCLATIKILGDDIMEIFQGSESDTYDHAMSNANILNESGFATTAELSQNSIVLLQQLVVNGTFWGFADRISLWTREDTKDIGRLNLAMMELNALIDDLVFRVRRPEGLKMLGFFCGAICLRRFTLSVDNKLYDSTYNNLSKYMTLIKYDKNPDFDSTLMSLILPLAWLFMPRGGEYPAYPFERRDGTFTEDESMFTARGAYKRRLLYDVSNIREMIQQNSMALDDDLLHEYGFTGALLLIDLNILDLIDEVKKEDISPVKVSELATSLEQLGKLGEREKSRRAASDLKIRGHALSNDIVYGYGLQEKIQKSAMATKETTVQSKRVSSRLHDVIVAKTRDYKISTIPADALRLHEFEVEDVTVDLLPHAKHTSYSSLAYNMSFGSDGWFAFALLGGLDRSANLLRLDVASIRGNYHKFSYDDPVFKQGYKIYKSDATLLNDFFTAISAGPKEQGILLRAFAYYSLYGNVEYHYVLSPRQLFFLSDNPVSAERLVRIPPKYYVSTQCRALYNIFSYLHILRSIANNWGKRLKMVLHPGLIAYVRGTSQGAILPEADNV 1225 T 0.48 DUF445 pdbpercent T Viruses T 5h0r 2 B G Q80A92_CPV1 VP4 protein MFAIDPLKHPKLYEEYGLYLRPHQINQEIKPTTIKKKELAPTIRSIKYASLIHSMLAKHAARHNGTLINPRMYADMITLGNTKVTVTKGTPKAQIDTLKMNGLTVVSKSRRNNKKKPVSDTTASTDETTDDVVTYKALTEMSTLVESFRLPSGLTLIVFDDEKYQSLIPDYINQLITYTQPHIIPTWQGITDFSDTYLRSYFKRPFELTASNLAVPQKHNLSPITRSIFNNTGREDAIIRKLYGYGEYVFIKYEGCLITWTGLYGAVTMMVNLPKRDLGLDVGDDFLKEYKKLLFHGVITDAIPSGISAKSTVMRISPHKMMNPSGGALAVLSKYIEAVVSTNVINATLVVYAEKGAGKTSFLSTYAQQLSLASGQIVGHLSSDAYGRWLAKNKDVEEPSFEYDYVLSLDTDDNESYYEQKASELLTSHGISELSQYELLSVRRKVKMMNEMDEILIAQLDNANTHSERNFYYMVSTGKNTPRTLIVEGHFNAQDATIARTDTTILLRTINDTTQAMRDRQRSGVVQLFLRDTYYRLLPSLHTTVYPFEMLESIKRWKWVH 561 T 6.5E-05 Zeta_toxin unphh T Viruses T 5h1h 1 A A Bradykinin-trypsin inhibitor secondary loop chimera CFPDGRCKRPPGFSPL 16 T 0.0039 Bradykinin pdbhh F T 5h1i 1 A A Bradykinin-trypsin inhibitor secondary loop chimera CKRPPGFSPLCTKSIPPI 18 T 0.0047 Bradykinin pdbhh F T 5h1z 1 A A A0A1S4NYE0_9SPHN putative CYP alkane hydroxylase CYP153D17 ATLAQDVIDRFDVSRPELYRDDLWQAPFRELRATAPVHRVEHSDFGPYWSVSSYKPIITVESLPDLYSSAGGITLADFIENNPTDVRMPMFIAMDRPKHTGQRRTVAPAFTPSEMVRMSDNIRMRTAEVLDSLEWNTPFDWVDTVSVELTTQMLAILFDFPWEERRKLTFWSDWAGDIELVKNEELRLERLRHMYECGGYFQNLWNAKIGKPPTPDLISMMIHSDAMAEMDQMEFLGNLILLIVGGNDTTRNTMSAVAYGLDLFPDQRAKLEADPSMIPNTVQEIIRWQTPLAHMRRTATVDSELEGQQIKAGDKLALWYISANRDESVFENADRIIVDRPNARRHLAFGHGIHRCVGARLAELQIAVLLEEMAKRRMRVNVLGEPERVAACFVHGYRKLPVEISRY 407 T 4.5E-25 p450 pdbpssm F Bacteria T 5h2c 2 B B NVJ1_YEAST Nucleus-vacuole junction protein 1 KHYNDGERAVLQFGKNRSEPIILSYKD 27 T 2.8 Mg-por_mtran_C pdbhh F Eukaryota T 5h2v 2 B B ULP1_YEAST Ubiquitin-like-specific protease 1 MSVEVDKHRNTLQYHKKNPYSPLFSPISTYRCYPRVLNNPSESRRSASFSGIYKKRTNTSRFNYLNDRRVLSMEESMKDGSDRASKAGFIGGIRETLWNSGKYLWHTFVKNEPRNFDGSEVEASGNSDVESRSSGSRSSDVPYGLRENYS 150 T 12 DUF1412 pdbhh F Eukaryota T 5h2w 2 B,D B,D ULP1_YEAST Ubiquitin-like-specific protease 1 SSDTRKHKFDTSTWALPNKRRRIESEGVGTPSTSPISSLASQKSNCDSDNSITFSRDPFGWNKWKTSAIGSNSENNTSDQKNSYDRRQYGTAFIRKKKVAKQNINNTKLVSRAQSEEVTYLRQIFNGEYKVPKILKEERERQLKLMDMDKEKDTGLKKSIIDLTEKIKTILIENNKNRLQTRNENDDDLVF 191 T 23 DUF6203 pdbpssm F Eukaryota T 5h2x 2 B B ULP1_YEAST Ubiquitin-like-specific protease 1 SSDTRKHKFDTSTWALPNKRRRI 23 T 9.8 DUF3579 pdbhh F Eukaryota T 5h3j 2 B B GO45_MOUSE BASIC LEUCINE ZIPPER NUCLEAR FACTOR 1 GPEFHPYTRYENITFNCCNHCQGELIAL 28 T 0.23 zf_Rg pdbhh F Eukaryota T 5h43 3 C C KAT8_HUMAN LYSINE ACETYLTRANSFERASE 8,MOZ,YBF2/SAS3,SAS2 AND TIP60 PROTEIN 1,HMOF SELAEQPERKITRNQ 15 T 0.0039 Myosin_head unp F Eukaryota T 5h4p 44 RA z REH1_YEAST REI1-HOMOLOG 1,PRE-60S FACTOR REH1 TITAADRRMVSGVTEKQYKKGMKKMQQLEKNAINTQIRREIKRVNFQTHYRDELLQ 56 T 0.034 Phage_Mu_F pdb F Eukaryota T 5h5m 1 A,B A,B HMP1_CAEEL PROTEIN HUMPBACK-1 GGIQGDLINEIDTFQNRIEIDPAHYRRGTDRPDLEGHCERIVSGSASIADAESTRENRKQKIVAECNNLRQALQELLTEYEKSTGRRDDNDDIPLGIAEVHKRTKDLRRHLRRAIVDHISDAFLDTRTPLILLIEAAKEGHEENTRYRSKMFQEHANEIVSVARLSCQLSSDVESVSVIQHTAAQLEKLAPQVAQAAILLCHQPTSKTAQENMETYKNAWFDKVRLLTTALDNITTLDDFLAVSEAHIVEDCERGIKGITANASTPDENAANCETVDCAAGSIRGRALRVCDVVDAEMDFLQNSEYTETVKQAVRILKTQRVDQFAERASALANRQEAHGLTWDPKTKEEEMNEFINACTLVHDAVKDIRHALLMNRSMND 381 T 4.4E-80 Vinculin unp F Eukaryota T 5h5q 2 B B GXpep-1 XCRVDLQGWRRCRRX 15 T 1.1 WYL_2 pdbhh F T 5h5r 2 B B GXpep-2 XCRAWYQNYCALRRX 15 T 0.031 LIN37 pdbhh F T 5h5s 2 B B GXpep-3 VPCPYLPLWNCAGK 14 T 1.4 DUF4708 pdbhh F T 5h5y 1 A,B A,B A0A0D7C3R7_ECOLX T3SS EFFECTOR NLEB MLSPIRTTFHNSVNIVQSSPSQTVSFAGKEYELKVIDEKTPILFQWFEPNPERYKKDEVPIVNTKQHPYLDNVTNAARIESDRMIGIFVDGDFSVNQKTAFSKLERDFENVMIIYREDVDFSMYDRKLSDIYHDIICEQRLRTEDKRDEYLLNLLEKELREISKAQDSLISMYAKKRNHAWFDFFRNLALLKAGEIFRSTYNTKNHGISFGEGCIYLDMDMILTGKLGTIYAPDGISMHVDRRNDSVNIENSAIIVNRSNHPALLEGLSFMHSKVDAHPYYDGLGKGVKKYFNFTPLHNYNHFCDFIEFNHPNIIMNTSQYTCSSW 326 T 2.5E-05 Glyco_transf_88 pdbhh F Bacteria T 5h60 1 A A Q9L9J3_SALTY Transferase MIPPLNRYVPALSKNELVKTVTNRDIQFTSFNGKDYPLCFLDEKTPLLFQWFERNPARFGKNDIPIINTEKNPYLNNIIKAATIEKERLIGIFVDGDFFPGQKDAFSKLEYDYENIKVIYRNDIDFSMYDKKLSEIYMENISKQESMPEEKRDCHLLQLLKKELSDIQEGNDSLIKSYLLDKGHGWFDFYRNMAMLKAGQLFLEADKVGCYDLSTNSGCIYLDADMIITEKLGGIYIPDGIAVHVERIDGRASMENGIIAVDRNNHPALLAGLEIMHTKFDADPYSDGVCNGIRKHFNYSLNEDYNSFCDFIEFKHDNIIMNTSQFTQSSWARHVQ 336 T 2.2E-05 Glyco_transf_88 pdbhh F Bacteria T 5h61 1 A,B A,B Q8ZNP4_SALTY Transferase MARFNAAFTRIKIMFSRIRGLISCQSNTQTIAPTLSPPSSGHVSFAGIDYPLLPLNHQTPLVFQWFERNPDRFGQNEIPIINTQKNPYLNNIINAAIIEKERIIGIFVDGDFSKGQRKALGKLEQNYRNIKVIYNSDLNYSMYDKKLTTIYLENITKLEAQSASERDEVLLNGVKKSLEDVLKNNPEETLISSHNKDKGHLWFDFYRNLFLLKGSDAFLEAGKPGCHHLQPGGGCIYLDADMLLTDKLGTLYLPDGIAIHVSRKDNHVSLENGIIAVNRSEHPALIKGLEIMHSKPYGDPYNDWLSKGLRHYFDGSHIQDYDAFCDFIEFKHENIIMNTSSLTASSWR 348 T 2.1E-05 Glyco_transf_88 pdbhh F Bacteria T 5h7g 2 C,D C,D F1324 peptide LWYTDIRMSWRVP 13 T 7.8 OTT_1508_deam pdbhh F T 5h7y 2 B B Q9I3K2_PSEAE Uncharacterized protein DDLFASIGALWTWAWRGPKARQELLKA 27 T 9.5 Lentiviral_Tat pdbhh F Bacteria T 5hau 58 FB,LD 1x,2x CTHL3_BOVIN BACTENECIN-7,BAC7,PR-59 RRIRPRPPRLPRPRPRPLPFPRPGPRPIPRPLPFP 35 T 0.027 TonB_N unppercent F Eukaryota T 5haw 3 D,E L,K FTSZ_ECOLI FtsZ CTT DYLDIPAFLR 10 T 1.8 Duffy_binding pdbhh F Bacteria T 5hax 2 B B NUP53_CHATD NUCLEAR PORE PROTEIN NUP53 SQDDEFCRVIPTVRKAKLLPMEEALLPAPTFTQ 33 T 16 FtsX_ECD pdbhh F Eukaryota T 5hb0 2 E,F,G,H E,F,G,H NU145_CHATD NUCLEAR PORE PROTEIN NUP145 SHKKLVINKDMRTDLFSPPNKD 22 T 5.2 MethyTransf_Reg pdbhh F Eukaryota T 5hb3 2 B,D B,D NUP53_CHATD NUCLEAR PORE PROTEIN NUP53 SQQDGSLRSRKANLETGAFGKSTRRTRSKAATPAKREDPTIAAADKIFSNWLASQ 55 T 23 DUF5830 pdbhh F Eukaryota T 5hcc 4 D D C5I3_DERAN Dermacentor andersoni RaCI3 GPMSGESQSIQRKGQCEEVICHRKLNHLGERVTSGCPTGCLCVIREPDNVDNANGTCYALMSSTTTTTTTPDGTTTSEEEE 81 T 0.05 UPAR_LY6_2 pdbpercent F Eukaryota T 5hcd 4 D D C5I2_RHIMP Rhipicephalus microplus RaCI2 GPMEEANTTPISVKDQCANVTCRRTVDNRGKRHIDGCPPGCLCVLKGPDSKDNLDGTCYLLATTPKSTTTSTEQSFNMEE 80 T 0.061 CBM_19 pdb F Eukaryota T 5hce 4 D D C5I1_RHIAP Rhipicephalus appendiculatus RaCI1 GPMEEVKTTPIPNHQCVNATCERKLDALGNAVITKCPQGCLCVVRGASNIVPANGTCFQLATTKPPMAPGDNKDNKEEESN 81 T 0.0095 UPAR_LY6_2 unppercent F Eukaryota T 5hcp 55 CB,FD 1z,2z MK1_PALPR METALNIKOWIN I VDKPDYRPRPRPPNM 15 T 2.3 Toxin_33 pdbhh F Eukaryota T 5hcq 55 CB,FD 1z,2z Oncocin d15-19 VDKPPYLPRPRPPR 14 T 0.11 Apidaecin pdbhh F T 5hda 2 C,D B,D EBNA2_EBVB9 EBV NUCLEAR ANTIGEN 2 SMPELSPVL 9 T 1.8 Fapy_DNA_glyco pdbhh T Viruses T 5hdt 2 C E WAPL_HUMAN FRIEND OF EBNA2 PROTEIN,WAPL COHESIN RELEASE FACTOR MTSRFGKTYSRKGGNGSSKFDEVFSNKRTTLST 33 T 0.21 BRCT_assoc pdbhh F Eukaryota T 5hhm 3 C,H C,H M1-F5L, GILGLVFTL GILGLVFTL 9 T 0.72 Asp4 pdbhh F T 5hho 5 E C M1-G4E, GILEFVFTL GILEFVFTL 9 T 24 Cas9_PI2 pdbhh F T 5hhq 3 C C M1-L3W, GIWGFVFTL GIWGFVFTL 9 T 2 Tr-sialidase_C pdbhh F T 5hhv 3 D I IL-17A peptide inhibitor XIHVTIPADLWDWINK 16 T 0.59 VapB_antitoxin pdbhh F T 5hi8 1 A,B A,B E3SMK9_9CAUD CPET NSMIDKFCDWFEGEFDNWTQAASNPTKWAHIIVKHEKISEYKYHTSSRYSYMDKPYREQTVDIEYVCPELIIVHNPACDIIFKWTGIYFEGESEPDCQWNGQPLDSKARLYADEYHTWDVGYWEGSEGFFHFKKNV 136 T 3E-16 CpeT pdbpercent T Viruses T 5hit 2 B B KCNH1_MOUSE Potassium voltage-gated channel subfamily H member 1 APLILPPDHPVRRLFQR 17 T 1.2 DUF4196 pdbhh F Eukaryota T 5hkh 2 B B UBA5_HUMAN ASP-ASN-GLU-TRP-GLY-ILE-GLU-LEU-VAL DNEWGIELV 9 T 2.3 LT-IIB pdbhh F Eukaryota T 5hkp 2 C,D C,D TERF1_HUMAN NIMA-INTERACTING PROTEIN 2,TTAGGG REPEAT-BINDING FACTOR 1,TELOMERIC PROTEIN PIN2/TRF1 SHMAEDVSSAAPSPRGCADGRDADPTEEQMAETERNDEEQFECQELLECQVQVGAPE 57 T 19 VCX_VCY pdbhh F Eukaryota T 5hky 2 B B SPY2_HUMAN SPRY-2 XQQVHVLSLDQIRAIRNTNEXTEGPT 26 T 3.3 KAR9 unp F Eukaryota T 5hog 2 D,E D,E DNA2_YEAST Dna2p SLRNIDDILDDIEGDLT 17 T 5.4 RMP pdbhh F Eukaryota T 5hoi 2 D,E,F D,E,F TOF2_YEAST Topoisomerase 1-associated factor 2 SHAKDVKIQETIRKLNRFKPT 21 T 2.2 DUF5611 pdbhh F Eukaryota T 5hpm 3 E,F E,F Cyclic amidated, acetylated linked meditope XQFDLSTRRLK 11 T 11 DUF4180 pdbhh F T 5hpp 1 A A ORN-THR-ILE-ALA-MAA-LEU-LEU-SER-ORN-SER-PHI-SER-THR-THR-ALA-VAL XTIAXLLSXSXSTTAV 16 T 4.6 PAGK pdbhh F T 5hq8 2 C,D I,J M3K2_HUMAN MEKK2 peptide YDNPIFEKFGKGGTYX 16 T 3.8 Thrombin_light pdbhh F Eukaryota T 5hs5 1 A,B A,B SARX_STAA8 STAPHYLOCOCCAL ACCESSORY REGULATOR X ETLLGFYKQYKALSEYIDKKYKLSLNDLAVLDLTMKHCKDEKVLMQSFLKTAMDELDLSRTKLLVSIRRLIEKERLSKVRSSKDERKIYIYLNNDDISKFNALFEDVEQFLNILEHHHHHH 121 T 0.00054 AphA_like unphh F Bacteria T 5hsz 2 C K FTSZ_ECOLI C-terminal Tail of FtsZ LDIPAFLRKQA 11 T 0.8 Drc1-Sld2 pdbhh F Bacteria T 5htb 2 B B ARC-3353 inhibitor ARKKQTAX 8 T 15 DHHA2 pdbhh F T 5hu6 4 D D Q581F2_TRYB2 Haptoglobin-hemoglobin receptor GLKTKDEVEKACHLAQQLKEVSITLGVIYRTTERHSVQVEAHKTAIDKHADAVSRAVEALTRVDVALQRLKELGKANDTKAVKIIENITSARENLALFNNETQAVLTARDHVHKHRAAALQGWSDAKEKGDAAAEDVWVLLNAAKKGNGSADAKAAAEKCSRYSSSSTSETELQKAIDAAANVGGLSAHKSKYGDVLNKFKLSNASVGAVRDTSGRGGKHMEKVNNVAKLLKDAEVSLAAAAAEIEEVKNAHETKVQEEM 260 T 8.4E-05 GARP unphh F Eukaryota T 5huw 2 B,C A,B TRM3_HHV11 HSV1 large terminase NLS GPPKKRAKVDVA 12 T 4.7 DUF4611 pdbhh T Viruses T 5huy 2 B,C B,A D3YRZ5_HCMVO HCMV small terminase VSRRVRATRKRPRRAS 16 T 5.6 DUF2569 pdbhh T Viruses T 5hx2 1 A A BP07_BPT4 GENE PRODUCT 7, GP7 MTVKAPSVTSLRISKLSANQVQVRWDDVGANFYYFVEIAETKTNSGENLPSNQYRWINLGYTANNSFFFDDADPLTTYIIRVATAAQDFEQSDWIYTEEFETFATNAYTFQNMIEMQLANKFIQEKFTLNNSDYVNFNNDTIMAALMNESFQFSPSYVDVSSISNFIIGENEYHEIQGSIQQVCKDINRVYLMESEGILYLFERYQPVVKVSNDKGQTWKAVKLFNDRVGYPLSKTVYYQSANTTYVLGYDKIFYGRKSTDVRWSADDVRFSSQDITFAKLGDQLHLGFDVEIFATYATLPANVYRIAEAITCTDDYIYVVARDKVRYIKTSNALIDFDPLSPTYSERLFEPDTMTITGNPKAVCYKMDSICDKVFALIIGEVETLNANPRTSKIIDSADKGIYVLNHDEKTWKRVFGNTEEERRRIQPGYANMSTDGKLVSLSSSNFKFLSDNVVNDPETAAKYQLIGAVKYEFPREWLADKHYHMMAFIADETSDWETFTPQPMKYYAEPFFNWSKKSNTRCWINNSDRAVVVYADLKYTKVIENIPETSPDRLVHEYWDDGDCTIVMPNVKFTGFKKYASGMLFYKASGEIISYYDFNYRVRDTVEIIWKPTEVFLKAFLQNQEHETPWSPEEERGLADPDLRPLIGTMMPDSYLLQDSNFEAFCEAYIQYLSDGYGTQYNNLRNLIRNQYPREEHAWEYLWSEIYKRNIYLNADKRDAVARFFESRSYDFYSTKGIEASYKFLFKVLYNEEVEIEIESGAGTEYDIIVQSDSLTEDLVGQTIYTATGRCNVTYIERSYSNGKLQWTVTIHNLLGRLIAGQEVKAERLPSFEGEIIRGVKGKDLLQNNIDYINRSRSYYVMKIKSNLPSSRWKSDVIRFVHPVGFGFIAITLLTMFINVGLTLKHTETIINKYKNYKWDSGLPTEYADRIAKLTPTGEIEHDSVTGEAIYEPGPMAGVKYPLPDDYNAENNNSIFQGQLPSERRKLMSPLFDASGTTFAQFRDLVNKRLKDNIGNPRDPENPTQVKIDE 1032 T 0.033 fn3 pdb T Viruses T 5hyn 5 E,J,O,T E,J,P,U JARD2_HUMAN JARID2 K116me3 RLQAQRKFAQSQ 12 T 25 DUF4395 pdbhh F Eukaryota T 5hyp 2 B B W0T1Y4_STRPY M28 protein GPGSAESPKSTETSANGADKLADAYNTLLTEHEKLRDEYYTLIDAKEEEPRYKALRGENQDLREKEGKYQDKIKKLEEKEKNLEKKSEDVERHYLKKLDQEHKE 104 T 0.0036 TMF_DNA_bd pdbpssm F Bacteria T 5hyq 3 E,F E,F Amidated meditope CQFDLSTRRLKX 12 T 3.1 Flavi_NS1 pdbhh F T 5hyu 1 A A M21_STRPY M protein, serotype 2.1 GPGSNSKNPVPVKKEAKLSEAELHDKIKNLEEEKAELFEKLDKVEEEHKKVEEEHKKDHEKLEKKSEDVERHYLRQLDQEYKEQQERQKNLEELERQSQREVEKR 105 T 0.0091 APG6_N pdb F Bacteria T 5hyx 2 B A RGF1_ARATH PTR-SER-ASN-PRO-GLY-HIS-HIS-PRO-HYP-ARG-HIS-ASN DXSNPGHHPXRHN 13 T 22 Pterin_4a pdbhh F Eukaryota T 5hz0 2 B A RGF2_ARATH ASP-PTR-TRP-LYS-PRO-ARG-HIS-HIS-PRO-HYP-ARG-ASN-ASN DXWKPRHHPXRNN 13 T 13 Metal_hydrol pdbhh F Eukaryota T 5hz1 2 B A RGF3_ARATH ASP-PTR-TRP-ARG-ALA-LYS-HIS-HIS-PRO-HYP-LYS-ASN-ASN DXWRAKHHPXKNN 13 T 0.16 N_formyltrans_C pdbpercent F Eukaryota T 5hz3 2 B A RGF5_ARATH ASP-PTR-PRO-LYS-PRO-SER-THR-ARG-PRO-HYP-ARG-HIS-ASN DXPKPSTRPXRHN 13 T 6.6 DUF2101 pdbhh F Eukaryota T 5hzp 1 A,B C,A M49_STRP9 M protein, serotype 49 GPGSAEKKVEAKVEVAENNVSSVARREKELYDQIADLTDKNGEYLERIGELEERQKNLEKLEHQSQVAADKHYQEQAKKHQEYKQEQEER 90 T 0.026 HCR unphh F Bacteria T 5hzy 1 A A Q5ZUV9_LEGPH Uncharacterized protein RavZ MHHHHHHENLYFQGSSIYPPETSWEVNKGMNSSRLHKLYSLFFDKSSAFYLGDDVSVLEDKPLTGAYGFQSKKNDQQIFLFRPDSDYVAGYHVDAKSDAGWVNDKLDRRLSEISEFCSKATQPATFILPFVEMPTDITKGVQHQVLLTISYDPKSKQLTPTVYDSIGRDTYSESLSSYFKGKYRTTCDEILTQSIEKAIKSTDFTLGKFTRAAYNHQNRLTEGNSGSYTFRTIKEVISSSAQGTEVKIPGSGYITSNSYLTSQHVQDIESCIKYRNLGVVDIESALTEGKTLPVQLSEFIVALEDYGKLRSQQSEKSMLNFIGYSKTAKLTAVELLIGILNDIKGKNEISESQYDKLVKEVDCLMDSSLGKLVQFHLKNLGAESLQKLVLPCVKFDDTIDDFVTIEKDELFDVPDITGEELASKKGIEQGALDKEALLKQKQIKTDLLDLREEDKTGLKKPLHGGIKVK 469 T 16 Crr6 pdbhh F Bacteria T 5i22 2 B B POLN_CHIKS CHIKV nsP3 peptide STVPVAPPRRRRGRNLT 17 T 2.4 HCV_NS5a_C pdbhh T Viruses T 5i25 2 B B KNG1_HUMAN ASN-PRO-ILE-SER-ASP-PHE-PRO-ASP NPISDFPD 8 T 1.6 Ku_PK_bind pdbhh F Eukaryota T 5i2i 3 E,F E,F Meditope GQQDLSTRRLKG 12 T 12 DUF262 pdbhh F T 5i4q 1 A A CDIA_ECONC CDIA LSYLGIGKKISFDGDFYTVDGMKFSKSYYEKLWEQGRPAPFVQAREVLNSNPKIEPDPRGAPGYLRYEGAGLEMIYNPKTGQVGHIQPVKVK 92 T 2.1 SspH pdbhh F Bacteria T 5i4q 2 B B CDII_ECONC CDII MDIWPEFQRDLEMYRDVVLSIKRNLRLYEECIESLVHQIGSTNFDNAQPLFDDLFRMQSELATMLYKYEYKPGKRIQDLIYHLDRDDFYSRKYWHKKFSDGLAWPEAGHHHHHH 114 T 0.0028 DUF4041 unppercent F Bacteria T 5i6a 1 A,B A,B ALA-PHE-GLY-LYD-VAL-PHE-PRO-GLN-ALA-GLY AFGXVFPQAG 10 T 5.5 TraW_N pdbhh F T 5i7p 1 A A FKB1A_HUMAN;SLYD_ECOLI PPIASE FKBP1A,12 KDA FK506-BINDING PROTEIN,FKBP-12,CALSTABIN-1,FK506-BINDING PROTEIN 1A,FKBP-1A,IMMUNOPHILIN FKBP12,ROTAMASE,PPIASE,HISTIDINE-RICH PROTEIN,METALLOCHAPERONE SLYD,ROTAMASE,SENSITIVITY TO LYSIS PROTEIN D,WHP,PPIASE FKBP1A,12 KDA FK506-BINDING PROTEIN,FKBP-12,CALSTABIN-1,FK506-BINDING PROTEIN 1A,FKBP-1A,IMMUNOPHILIN FKBP12,ROTAMASE GVQVETISPGDGRTFPKRGQTAVVHYTGMLEDGKKFDSSRDRNKPFKFMLGKQEVIRGWEEGVAQMSVGQRAKLTISPDYAYGQYDENLVQRVPKDVFMGVDELQVGMRFLAETDQGPVPVEITAVEDDHVVVDGNHMLAGQNLVFDVELLKLEAHHHHHH 161 T 3.9E-16 FKBP_C unppercent F Bacteria T 5i7q 1 A A FKB1A_HUMAN;FKBX_ECOLI PPIASE FKBP1A,12 KDA FK506-BINDING PROTEIN,FKBP-12,CALSTABIN-1,FK506-BINDING PROTEIN 1A,FKBP-1A,IMMUNOPHILIN FKBP12,ROTAMASE,PPIASE,ROTAMASE,PPIASE FKBP1A,12 KDA FK506-BINDING PROTEIN,FKBP-12,CALSTABIN-1,FK506-BINDING PROTEIN 1A,FKBP-1A,IMMUNOPHILIN FKBP12,ROTAMASE GVQVETISPGDGRTFPKRGQTAVVHYTGMLEDGKKFDSSRDRNKPFKFMLGKQEVIRGWEEGVAQMSVGQRAKLTISPDYAYGVPSPDLIQYFSRREFMDAGEPEIGAIMLFTAMDGSEMPGVIREINGDSITVDFNHPLAGQTLVFDVELLKLEAHHHHHH 162 T 2.3E-18 FKBP_C unppercent F Bacteria T 5i8c 3 C C Q2N0S7_9HIV1 HIV-1 Clade A BG505 Fusion Peptide (residue 512-520) AVGIGAVFL 9 T 2.2 OAD_gamma pdbhh T Viruses T 5iay 2 B B UHRF1_HUMAN Spacer TGKGKWKRKSAGGGPS 16 T 6.1 Ribosomal_L35p pdbhh F Eukaryota T 5ibo 1 A,B A,B LUCI_OPLGR 19KOLASE SDNMVFTLEDFVGDWRQTAGYNLDQVLEQGGVSSLFQNLGVSVTPIQRIVLSGENGLKIDIHVIIPYEGLSGDQMGQIEKIFKVVYPVDDHHFKVILHYGTLVIDGVTPNMIDYFGRPYEGIAVFDGKKITVTGTLWNGNKIIDERLINPDGSLLFRVTINGVTGWRLCERILA 174 T 0.0086 Lipocalin_7 pdbhh F Eukaryota T 5icn 3 C C GLY-ALA-6A0-ARG-HIS LGKGGAXRH 9 T 18 PNPase_C pdbhh F T 5icx 3 E,F E,F Meditope XQFDLSTRRLRCGGSK 16 T 3 zf-CDGSH pdbhh F T 5icy 3 E,F E,F Meditope SQFDLSTRRLKS 12 T 12 Lambda_CIII pdbhh F T 5icz 3 E,F E,F Meditope GQFDLSTRRLKG 12 T 6.5 Pet127 pdbhh F T 5id0 3 E,F E,F Cyclic meditope QFDLSTRRLKX 11 T 7.9 AAA_lid_8 pdbhh F T 5id1 3 E,F E,F Meditope XQFDLSTRRLKC 12 T 6.5 DUF5947 pdbhh F T 5iec 1 A A C5I2_RHIMP RaCI2 GPMEEANTTPISVKDQCANVTCRRTVDNRGKRHIDGCPPGCLCVLKGPDSKDNLDGTCYLLATTPKSTTT 70 T 0.0052 CBM_19 pdbpercent F Eukaryota T 5ieh 3 C C INCE_HUMAN Inner centromere protein REFSKEPEL 9 T 37 DUF3966 pdbhh F Eukaryota T 5igo 2 B,D,F,H U,V,W,X TRIB1_HUMAN TRB-1,G-PROTEIN-COUPLED RECEPTOR-INDUCED GENE 2 PROTEIN,GIG-2,SKIP1 SDQIVPEY 8 T 9 Cryptochrome_C pdbhh F Eukaryota T 5igq 2 B U TRIB1_HUMAN TRB-1,G-PROTEIN-COUPLED RECEPTOR-INDUCED GENE 2 PROTEIN,GIG-2,SKIP1 SDQIVPEYQED 11 T 27 DUF4851 pdbhh F Eukaryota T 5ih2 2 C,D M,N ABL1_MOUSE Proline rich Peptide XYEKPALPRKRX 12 T 4 DUF5972 pdbhh F Eukaryota T 5ii6 1 A A ZP2_MOUSE ZONA PELLUCIDA GLYCOPROTEIN 2,ZP-2,ZONA PELLUCIDA PROTEIN A VSLPQSENPAFPGTLICDKDEVRIEFSSRFDMEKWNPSVVDTLGSEILSCTYALDLERFVLKFPYETCTIKVVGGYQVNIRVGDTTTDVRYKDDMYHFFCPAIQLEHHHHHH 112 T 4.8 DUF5374 pdbhh F Eukaryota T 5ijh 1 A,B A,B XPR1_HUMAN PROTEIN SYG1 HOMOLOG,XENOTROPIC AND POLYTROPIC MURINE LEUKEMIA VIRUS RECEPTOR X3,X-RECEPTOR, MKFAEHLSAHITPEWRKQYIQYEAFKDMLYSAQDQAPSVEVTDEDTVKRYFAKFEEKFFQTCEKELAKINTFYSEKLAEAQRRFATLQNELQSSLDAQKESTGVTTLRQRRKPVFHLSHEERVQHRNIKDLKLAFSEFYLSLILLQNYQNLNFTGFRKILKKHDKILETSRGADWRVAHVEVAPFYTCKKINQLISETEAVVTNELEHHHHHH 213 T 7.2E-16 SPX pdb F Eukaryota T 5ikf 2 B B CLR1_SCHPO Cryptic loci regulator protein 1 MASMTGGQQMGPFLTPDNIASSILYSTASFSRSKPDRPRLNLSLELKLMQNELNKGQLKKQFKGDLRNLADWNNLSLVSSKFPSLPITNLRPDGSFLKHRRFNEEIAYNRQTLEKAIKQLDLSPDKVIQLREQNGVAVNGRVCYPTRNKHSEISA 155 T 4.1 Lyase_catalyt pdbhh F Eukaryota T 5ikj 2 B B CLR1_SCHPO Cryptic loci regulator protein 1 SSLLSRLTQSNQSKDKIIAALAKRNVYKSFAGLYDSKGKNDNTGYDFDSNYARVGRHGSFILPVSKSVPTPSLLIEGSIVQRKNIKIE 88 T 14 ESP pdbhh F Eukaryota T 5io3 1 A A Q5ZUV9_LEGPH Uncharacterized protein RavZ MKGKLTGKDKLIVDEFEELGEQESDIDEFDLLEGDEKLPGDSELDKTTSIYPPETSWEVNKGMNSSRLHKLYSLFFDKSSAFYLGDDVSVLEDKPLTGAYGFQSKKNDQQIFLFRPDSDYVAGYHVDAKSDAGWVNDKLDRRLSEISEFCSKATQPATFILPFVEMPTDITKGVQHQVLLTISYDPKSKQLTPTVYDSIGRDTYSESLSSYFKGKYRTTCDEILTQSIEKAIKSTDFTLGKFTRAAYNHQNRLTEGNSGSYTFRTIKEVISSSAQGTEVKIPGSGYITSNSYLTSQHVQDIESCIKYRNLGVVDIESALTEGKTLPVQLSEFIVALEDYGKLRSQQSEKSMLNFIGYSKTAKLTAVELLIGILNDIKGKNEISESQYDKLVKEVDCLMDSSLGKLVQFHLKNLGAESLQKLVLPCVKFDDTIDDFVTIEKDELFDVPDITGEELASKKGIEQGALDKEALLKQKQIKTDLLDLREEDKTGLKKPLHGGIKVK 502 T 18 DUF438 pdbhh F Bacteria T 5ioo 1 A,B A,B A0A1L1QK08_9ARCH AvpA MEINRKQAKEFYNSDMATALESCQKYGHALFMPELIDAKILATKGSSLLSNWLTAPSIRATGRTKQGNPVVVYVHVDNYLSNPENIRNAERINGAGVMPVDEFQRLLDLGDNKNVFVIDYDKLKSSSSGVIPVERALEHPQTIPFIGGEERAQRYLEKFKQVYGNNIGIWHCDDLKDEPLGRLLFVGDYCNNGLIGNYGIGNYARFVGVRGSASAEGTAQKISAPTIEQILKVSKNFVPKATRKEYENKIKALYK 255 T 0.093 Transglut_C pdb F Archaea T 5iop 3 E,F E,F Meditope variant GQXDLSTRRLKG 12 T 6.1 Rit1_C pdbhh F T 5ip7 13 M Q T2FA_YEAST PHE-ILE-LYS-ARG-ASP-ARG-MET-ARG-ARG-ASN-PHE-LEU-ARG-MET-ARG FIKRDRMRRNFLRMR 15 T 2.2 DUF5928 pdbhh F Eukaryota T 5ipy 1 A,B A,B A3SLM3_ROSNI Flavin-containing monooxygenase MTKRVAVIGAGPSGLAQLRAFQSAADQGAEIPEIVCFEKQANWGGLWNYTWRTGLDENGEPVHCSMYRYLWSNGPKEGLEFADYSFEEHFGKQIASYPPRAVLFDYIEGRVHKADVRKWIRFNSPVRWVSYDAETAKFTVTAHNHETDSTYSAAFDHVICASGHFSTPNVPFYEGFDTFNGRIVHAHDFRDAREFEGKDVLVMGASYSAEDIGSQCWKYGAKSITSCYRSAPMGYAWPDNWEEKPALEKLTGKTAHFADGSTRDVDAIILCTGYKHFFSFLPDDLRLKTANRLATADLYKGVAYVHNPAMFYLGMQDQWFTFNMFDAQAWWVRDAILGRITLPKDKAAMLADVAERETREEASDDVKYAIRYQADYVKELVAETDYPSFDIDGACDAFFEWKKHKAKDIMAFRDNSYKSVITGTMAPVHHTPWKEALDDSMEAYLQNHHHHHH 453 T 4.1E-11 FMO-like unppercent F Bacteria T 5iq4 1 A,B A,B A3SLM3_ROSNI Flavin-containing monooxygenase MTKRVAVIGAGPSGLAQLRAFQSAADQGAEIPEIVCFEKQANWGGLWNYTWRTGLDENGEPVHCSMYRYLWSNGPKEGLEFADYSFEEHFGKQIASYPPRAVLFDYIEGRVHKADVRKWIRFNSPVRWVSYDAETAKFTVTAHNHETDSTYSEDFDHVICASGHFSTPNVPFYEGFDTFNGRIVHAHDFRDAREFEGKDVLVMGASSSAEDIGSQCWKYGAKSITSCYRSAPMGYAWPDNWEEKPALEKLTGKTAHFADGSTRDVDAIILCTGYKHFFSFLPDDLRLKTANRLATADLYKGVAYVHNPAMFYLGMQDQWFTFNMFDAQAWWVRDAILGRITLPKDKAAMLADVAERETREEASDDVKYAIRYQADYVKELVAETDYPSFDIDGACDAFFEWKKHKAKDIMAFRDNSYKSVITGTMAPVHHTPWKEALDDSMEAYLQNHHHHHH 453 T 4.1E-11 FMO-like unppercent F Bacteria T 5ir0 1 A,B A,B M1Q7T5_VIBCL Uncharacterized protein ORF19 GMYTNTIIKTEIDEKVIKAFKLDALTRSKLFFKLTTKLAVPFAGVIDGAFSADRSLVSASVASLLSQHLDQETFEETQLILFGSIVEDGEALATPEAINKWFEYNDVNPMDLFVWLVDENLVTLFKGSKQLQSLKPKFDEFYKKFEDFIPQTVISDDKAEE 161 T 0.00039 Phage_TAC_9 unphh F Bacteria T 5iri 1 A,B A,B BRSK1_MOUSE SERINE/THREONINE-PROTEIN KINASE SAD-B MKRSWFGNFISLDKEEQIFLVLKDKPLSSIKADIVHAFLSIPSLSHSVLSQTSFRAEYKASGGPSVFQKPVRFQVDISSSEGPEPSPRRDGSSGGGIYSVTFTLISGPSRRFKRVVETIQAQLLSTHDQLEHHHHHH 137 T 0.016 KA1 pdbpssm F Eukaryota T 5irx 2 E,F E,F DKTX_HAPSC TAU-TRTX-HS1A, DOUBLE-KNOT TOXIN, DKTX DCAKEGEVCSWGKKCCDLDNFYCPMEFIPHCKKYKPYVPVTTNCAKEGEVCGWGSKCCHGLDCPLAFIPYCEKYR 75 T 0.079 Conotoxin_I2 unp F Eukaryota T 5itz 3 C D CENPJ_HUMAN CENP-J,CENTROSOMAL P4.1-ASSOCIATED PROTEIN,LAG-3-ASSOCIATED PROTEIN,LYST-INTERACTING PROTEIN 1 MAHHHHHHGSLVPRGSAQKHDDSSEVANIEERPIKAAIGERKQTFEDYLEEQIQLEEQELKQKQLKEAEGPLPIKAKPKQPFLKRGEGLARFTNAKSKFQKGKESKLVTNQSTSEDQPLFKMDRQQLQR 129 T 0.64 DUF1654 unppssm F Eukaryota T 5iue 3 I,J,K,L K,L,M,N Peptide LEU-ILE-LEU-ARG-TRP-GLU-GLN-ASP LILRWEQD 8 T 0.94 DUF1246 pdbhh F T 5iv2 3 E,F E,F Meditope variant GQFDLSTRXLKG 12 T 6.5 Pet127 pdbhh F T 5ivn 2 B B G9GAG7_HUMAN Cadherin derived peptide XDRKAAVSHWQX 12 T 0.15 DUF2288 unp F Eukaryota T 5ivz 3 E,F E,F Meditope variant GQFDLSTXRLKG 12 T 6.5 Pet127 pdbhh F T 5ix9 1 A A A1YIY2_9GAMM Antifreeze protein MSDNQFPFATLGNAIGFITKLDGSVTVQSINGQERVLKLGDPIFFGETVLTGGSGSVTIAFVDGTDVVIGGDSIVEMTDEIYNTGDNEDLVADSSSEIDALQNAILAGDDPTLIQDAPAAGNTLADQQRVDVSIERNDNSAQAGFGVDTQSSLPTYGYDTDNGNGGQATEREYSAPSLSRTLNQSPLLEHHHHHH 195 T 0.0019 FecR pdbpercent F Bacteria T 5ixf 2 B B STABP_HUMAN STAM-binding protein AKPPVVDRSLKPGA 14 T 9.4 DUF1681 pdbhh F Eukaryota T 5ixq 2 B B IDA_ARATH PROTEIN INFLORESCENCE DEFICIENT IN ABSCISSION PIPPSAPSKRHN 12 T 0.67 Disulph_isomer pdbhh F Eukaryota T 5ixt 2 B B IDA_ARATH PROTEIN INFLORESCENCE DEFICIENT IN ABSCISSION YPKGVPIPPSAPSKRHN 17 T 2.7 Disulph_isomer pdbhh F Eukaryota T 5iy4 2 B,D,F B,D,F SPRTN_HUMAN DVC1 PIP box SNSHQNVLSNYFPRVS 16 T 11 FAD_SOX pdbhh F Eukaryota T 5iyv 2 B B IDL1_ARATH Protein IDA-LIKE 1 LVPPSGPSMRHN 12 T 0.021 Sperm_Ag_HE2 unp F Eukaryota T 5iyx 2 B B IDA_ARATH PROTEIN INFLORESCENCE DEFICIENT IN ABSCISSION YVPIPPSAPSKRHN 14 T 0.34 Disulph_isomer pdbhh F Eukaryota T 5iz0 2 B,D,F,H C,E,F,H GLU-PHE-PRO-TYR-LEU-LEU-SER-LEU-LEU-GLY-GLU-VAL-SER-PRO-GLN EFPYLLSLLGEVSPQ 15 T 1.6 DUF4576 pdbhh F T 5iz6 2 B B PHQ-ALA-GLY-GLU-ALA-LEU-TYR-GLU-NH2 XAGEALYEX 9 T 30 NYAP_N pdbhh F T 5iz8 2 C,D C,D ACE-ALA-GLY-GLU-ALA-LEU-ALA-ASP-NH2 XAGEALADX 9 T 34 SWI-SNF_Ssr4 pdbhh F T 5iz9 2 B B ACE-GLY-GLY-GLU-ALA-LEU-ALA-ASP-NH2 XGGEALADX 9 T 72 DUF898 pdbhh F T 5iza 2 B B ACE-GLY-GLY-GLU-ALA-LEU-ALA-TRP-NH2 XGGEALAWX 9 T 3.2 Nmad4 pdbhh F T 5ize 1 A,B A,B L_HANTV PROTEIN L,LARGE STRUCTURAL PROTEIN,REPLICASE,TRANSCRIPTASE GMDKYREIHNKLKEFSPGTLTAVECIDYLDRLYAVRHDIVDQMIKHDWSDNKDSEEAIGKVLLFAGVPSNIITALEKKIIPNHPTGKSLKAFFKMTPDNYKISGTTIEFVEVTVTADVDKGIREKKLKYEAGLTYIEQELHKFFLKGEIPQPYKITFNVVAVRTDGSNITTQWPSRRNDG 180 T 0.075 L_protein_N pdbpssm T Viruses T 5j19 2 C,D C,D O96561_DROME PON ESCFTNAAFSSTPKK 15 T 0.26 RskA unppercent F Eukaryota T 5j2y 1 A,B A,B Q9X7H4_PSEAI REGULATORY PROTEIN RSAL,RSAL PROTEIN,UNCHARACTERIZED PROTEIN,VIRULENCE GENE REPRESSOR RSAL MASHERTQPQNMAFRAKATRTARRESQETFWSRFGISQSCGSRFENGENLPFPIYLLLHFYIEGQITDRQLADLRGKIRE 80 T 0.00017 DUF4447 pdbhh F Bacteria T 5j3h 1 A B Peptide S519C16 GSLDESFYDWFERQLG 16 T 0.086 YozE_SAM_like pdbhh F T 5j4a 2 B,D B,D CDII9_BURPE Immunity protein CdiI KMAGSIVISKEVRVPVSTSQFDYLVSRIGDQFHSSDMWIKDEVYLPMEEGGMSFISTESLNSSGLSIFLATVMRARAASQAEESFPLYENVWNQLVEKLRQDARLGVSGNTSLEHHHHHH 120 T 0.15 Adeno_E4_ORF3 pdbpssm F Bacteria T 5j6t 1 A A ALBO1_HYPAB HY-A1 XIFGAIWPLALGALKNLIKX 20 T 4 SH3_7 unphh F Eukaryota T 5j6v 1 A A ALBO1_HYPAB Hylin-D DIFGAIWPLALGALKNLIKX 20 T 2.7 DUF3275 pdbhh F Eukaryota T 5j6w 1 A A ALBO1_HYPAB Hylin-K KIFGAIWPLALGALKNLIKX 20 T 4 SH3_7 unphh F Eukaryota T 5j7j 2 B B DLG4_HUMAN POSTSYNAPTIC DENSITY PROTEIN 95,PSD-95,SYNAPSE-ASSOCIATED PROTEIN 90,SAP90 MDCLCIVTTKKYRYQDEDT 19 T 0.0058 MAGUK_N_PEST unp F Eukaryota T 5j8h 2 B B EF2K_HUMAN EEF-2K,CALCIUM/CALMODULIN-DEPENDENT EUKARYOTIC ELONGATION FACTOR 2 KINASE SPANSFHFKEAWKHAIQKAKHMPDPWA 27 T 5.7 MPLKIP pdbhh F Eukaryota T 5j9q 5 M,N,O L,M,O H2AZ_YEAST Htz1 SGAKDSGSLR 10 T 0.0022 Histone unppercent F Eukaryota T 5jcy 2 B B SPIR2_HUMAN SPIR-2 QRPRPRVLLKAPTLAEMEEMNTSEEEE 27 T 14 DUF5395 pdbhh F Eukaryota T 5jej 1 A,B,C C,D,E STING_HUMAN HSTING,ENDOPLASMIC RETICULUM INTERFERON STIMULATOR,ERIS,MEDIATOR OF IRF3 ACTIVATION,HMITA,TRANSMEMBRANE PROTEIN 173 STVGSLKTSAVPSTSTMSQEPELLISGMEKPLPLRTDWS 39 T 10 Herpes_IE68 pdbhh F Eukaryota T 5jek 2 C,D C,D MAVS_HUMAN MAVS peptide SGCFEDLAISASTSLGWG 18 T 3.6 GRA6 unphh F Eukaryota T 5jel 2 B B TCAM1_HUMAN Phosphorylated TRIF peptide SPASLASNLEISQSPTMPFWS 21 T 17 DUF4675 pdbhh F Eukaryota T 5jg9 1 A,B,C A,B,C de novo design, hyper stable, disulfide-rich mini protein GSEERRYKRCGQDEERVRRECKERGERQNCQYQIRKEGNCYVCEIRC 47 T 18 Rad50_zn_hook pdbhh F T 5jge 1 A,B,D,E A,B,D,E ATG19_YEAST CYTOPLASM-TO-VACUOLE TARGETING PROTEIN 19 GPHMLDNFMKQLLKLEESLNKLELEQKVTNKE 32 T 2.7 NCKAP5 pdbhh F Eukaryota T 5jge 2 C,F C,F AMPL_YEAST Ape1 propeptide GPMEEQREILEQLKKTLQMLTVY 23 T 0.74 PKHD_C pdbhh F Eukaryota T 5jhc 1 A,AA,B,BA,C,CA,D,DA,E,EA,F,FA,G,GA,H,HA,I,IA,J,JA,K,KA,L,LA,M,MA,N,O,P,Q,R,S,T,U,V,W,X,Y,Z A,g,C,P,E,h,G,R,I,i,K,T,M,j,O,V,Q,k,S,X,U,l,W,Z,Y,m,B,a,b,D,F,c,H,d,J,e,L,f,N AMPL_YEAST AMINOPEPTIDASE YSCI,LEUCINE AMINOPEPTIDASE IV,LAPIV,LYSOSOMAL AMINOPEPTIDASE III,POLYPEPTIDASE,VACUOLAR AMINOPEPTIDASE I GPMEEQREILEQLKKTLQMLTVEL 24 T 0.86 PKHD_C pdbhh F Eukaryota T 5jhf 5 I,J I,J C5DB94_LACTC Atg13 17LR LQPFKAGSVGSGS 13 T 0.6 DUF565 pdbhh F Eukaryota T 5jhi 1 A A DE NOVO MINIPROTEIN EHE_06 CKQRRRYRGSEEECRKYAEELSRRTGCEVEVECET 35 T 0.048 Ribosomal_S4 pdb F T 5jhj 1 A A R9RX08_MAGOR Antivirulence protein AVR-Pia APQDNTSMGSSHHHHHHSSGRENLYFQGHMAAPARSCVYYDGHLPATRVLLMYVRIGNTATITARGHEFEVEAKDQNCKVILTNGKQAPDWLAAEPY 97 T 0.012 Pirin_C unppssm F Eukaryota T 5jhq 2 E,F,G,H,I,J,K,L E,F,G,H,I,J,K,L LCAP_HUMAN Peptide derived from insulin-responsive aminopeptidase (IRAP) ATGYRQSPDGACSVPS 16 T 0.0072 GBP_PSP pdb F Eukaryota T 5ji4 1 A A DE NOVO MINIPROTEIN EEHE_02 APCECDVNGETYTVSSSEECERLCRKLGVTNCRVHCG 37 T 0.82 DUF6482 pdbhh F T 5jie 1 A,B,C,D,E B,C,E,D,A E9KNV6_9VIRU Protein delta MPSEDYAIWYARATIAALQAAEYRLAMPSASYTAWFTDAVSDKLDKISESLNTLVECVIDKRLAVS 66 T 0.048 DNMT1-RFD pdb T Viruses T 5jiu 2 C,D C,D DDX4_MOUSE DEAD BOX PROTEIN 4,MVH,VASA HOMOLOG KSETEGGESSDSQGPKVTYI 20 T 26 Polo_box_3 pdbhh F Eukaryota T 5jja 2 C,D C,D BUB1B_HUMAN MAD3/BUB1-RELATED PROTEIN KINASE,HBUBR1,MITOTIC CHECKPOINT KINASE MAD3L,PROTEIN SSK1 GKTSEDQQTACGTIYSQTLSIKKLDPIIEDDREADHSSGFSGSSASVASTSSIKCLQIPEKLELTNETSENPTQS 75 T 0.066 NifU_N pdbpssm F Eukaryota T 5jjz 2 B B H14_HUMAN LYS-LYS-LYS-ALA-ARG-MLY-SER-ALA-GLY-ALA-ALA-LYS-TYR KKKARKSAGAAKY 13 T 0.2 DUF5797 unp F Eukaryota T 5jkq 1 A,B,C,D A,B,D,C C6KSR6_PLAF7 PfVFT1 GSMGVEEVVNNKAKRLIDIYHAAVKELIQNEELIDLIDKHNVDYSVIESIENLPNLADINVKDDIDDVLSEIIKKKEVKIGALKNKNWGIIGNYEQNPPVGFWPDVMYIIWETISKHIFNDEDAINIAYNYYDNVFVALNDKDIHMTDNYFLSNSRLVDQSGNNLPKLTSGLPIIKHSNKIMILKEYNINNLEDLKSYISKNEGLKIACLTEANCNALKNIFLDKVTYDYKSFSSYIDLSKSVLSKSHIIGVISGIPFNFNEHKINVFDSFLKTGHSAYFKAAA 284 T 8.4E-05 SBP_bac_3 unphh F Eukaryota T 5jm4 2 C,D D,E GLN-GLY-MKD-ANG-ASP-MKD-LEU-ASP-LEU-ALA-CLU QGXXDXLDLAX 11 T 120 DUF1797 pdbhh F T 5jmb 1 A,B A,B B3JI28_9BACE Uncharacterized protein SNAVTVDDLVEGIAFSITHDSENPNIVYLKSLMPSSYQVCWQHPQGRSQEREVTLQMPFEGKYEVTFGVQTRGGIVYGNPATFTIDSFCADFVN 94 T 0.0055 ARL6IP6 pdbpercent F Bacteria T 5jnb 2 E,F,G,H E,F,G,H O61711_CAEEL RNP (RRM RNA binding domain) containing TLFDNHPVQQYSGFNPIDFRFDDYVEGAKRFDNLANLIRSSTPTDPFANYQKPCESTSTSRSRTNSAKDQKHGP 74 T 0.056 Toxin_YhaV pdbpercent F Eukaryota T 5jp2 1 A,C E,F EPS15_HUMAN PROTEIN EPS15,PROTEIN AF-1P TNLDFFQSDPFVGSDPFKDDPFGGAGA 27 T 8.5 Taeniidae_ag pdbhh F Eukaryota T 5jpl 1 A A J7LF03_NOCAA Uncharacterized protein GRPNWGFENDWSCVRVC 17 T 5.9 DUF4710 pdbhh F Bacteria T 5jpo 2 E E EF1D_HUMAN EF-1-DELTA,ANTIGEN NY-CO-4 GAMATNFLAHEKIWFDKFKYDDAERRFYEQMN 32 T 5.1 PRR18 pdbhh F Eukaryota T 5jqf 1 A,B A,B A0A1D5B387_SPHAL Sphingopyxin I GIEPLGPVDEDQGEHYLFAGG 21 T 11 SpecificRecomb pdbhh F Bacteria T 5jqz 1 A,B A,B De novo designed homotetramer GSHMGTAIEANSRMLKALIEIAKAIWKALWANSLLLEATSRGDTERMRQWAEEARKIYKEAEKIIDRADEIVEEAKKRHD 80 T 0.19 Dec-1 pdb F T 5jr2 2 E,F,G,H E,F,G,H APYd3 peptide XPYCVYRXSWSCX 13 T 0.7 DUF1684 pdbhh F T 5jte 57 EB B5 ErmBL AVFQMRNVD 9 T 1.4E-05 ErmC pdbhh F T 5jtm 2 E,F,G,H E,F,G,H PPB_ECOLI APASE MKQSTIALALLPLLFTPVTKARTPE 25 T 3.6 Mfp-3 pdbhh F Bacteria T 5jts 1 A A A0A1L1QK12_STRSQ beta-1,4-mannanase GPLGSSACPSGATCGSYTVGGLGSRKQQVRNAGGSSLDLAVAMLETERMDTAYPYGDNKSGDAANFGIFKQNWLMLRSACAQFGGQGAGQYDNGAALNSSLGQDVSCLHQSQSHYGLDAWFAGHRNGASGLSSPNTADIAAYKAAVYWIKAQLDADSANLGNDTRFWVQVPAI 173 T 0.16 Lys pdb F Bacteria T 5ju9 1 A A A0A1L1QK13_9ACTN beta-1,4-mannanase SACPSGATCGSYTVGGLGSRKQQVRNAGGSSLDLAVAMLETERMDTAYPYGDNKSGDAANFGIFKQNWLMLRSACAQFGGQGAGQYDNGAALNSSLGQDVSCLHQSQSHYGLDAWFAGHRNGASGLSSPNTADIAAYKAAVYWIKAQLDADSANLGNDTRFWVQVPAI 168 T 0.15 Lys pdb F Bacteria T 5jub 2 C,D C,D ComS LPYFAGCL 8 T 1.4 IL17R_fnIII_D2 pdbhh F T 5jug 1 A A A0A1L1QK16_9ACTN beta-1,4-mannanase SACPSGATCGSYTVGGLGSRKQQVRNAGGSSLDLAVAMLQTERMDTAYPYGDNKSGDAANFGIFKQNWLMLRSACAQFGGQGAGQYDNGAALNSSLGQDVSCLHQSQSHYGLDAWFAGHRNGASGLSSPNTADIAAYKAAVYWIKAQLDADSANLGNDTRFWVQVPAI 168 T 0.14 Lys pdb F Bacteria T 5jui 1 A,B,C A,B,C A0A0H2URK1_STRPN Cell wall surface anchor family protein HHHHHHSGNTIVNGAPAINASLNIAKSETKVYTGEGVDSVYRVPIYYKLKVTNDGSKLTFTYTVTYVNPKTNDLGNISSMRPGYSIYNSGTSTQTMLTLGSDLGKPSGVKNYITDKNGRQVLSYNTSTMTTQGSGYTWGNGAQMNGFFAKKGYGLTSSWTVPITGTDTSFTFTPYAARTDRIGINYFNGGGKVVESST 198 T 0.41 FlgD_ig pdbpercent F Bacteria T 5jxt 2 I,J,K,T,U,V,W R,S,U,Q,T,V,W A0A0E9NAT8_9ASCO Histone H4 SGRGKGGKGLGKGGAKRHRKI 21 T 84 DUF4196 pdbhh F Eukaryota T 5jzr 1 A,B A,B Q9AZ42_9VIRU Coat protein MANKPMQPITSTANKIVWSDPTRLSTTFSASLLRQRVKVGIAELNNVSGQYVSVYKRPAPKPEGCADACVIMPNENQSIRTVISGSAENLATLKAEWETHKRNVDTLFASGNAGLGFLDPTAAIVSSDTTA 131 T 0.19 Glycoprot_B_PH2 pdbpssm T Viruses T 5k0y 26 Z d IF2B_SACS2 eukaryotic initiation factor 2 subunit Beta (eIF2-Beta) SEKEYVEMLDRLYSKLP 17 T 0.77 DUF6103 pdbhh F Archaea T 5k18 3 E,F F,E Bisubstrate inhibitor XMDSEVAALVID 12 T 0.89 Trm56 pdbhh F T 5k57 1 A A DDI2_HUMAN Protein DDI1 homolog 2 SQQSHSSPGEITSSPQGLDNPALLRDMLLANPHELSLLKERNPPLAEALLSGDLEKFSRVLVEQQQDRARREQERIRLFSADPFDLEAQAKIEEDIRQ 98 T 0.00065 XPC-binding pdbpssm F Eukaryota T 5k58 3 G,H,I,J L,K,N,M FTSZ_ECOLI Octapeptide LDIPAFLR 8 T 4.6 DUF1848 pdbhh F Bacteria T 5k6s 2 B B BUB1B_HUMAN BubR1 TLSIKKLSPIIEDDREADH 19 T 8.3 YwhD pdbhh F Eukaryota T 5k99 2 B,D C,D Microcin C MRTGNAXX 8 T 75 RGM_C pdbhh F T 5kdg 1 A A Q9XC73_SALTM UNCHARACTERIZED PROTEIN,VIRULENCE PROTEIN MTATPQGQIIHHRNFQSLYNNSWVCSLNVIKSRDGNNYSALEDITSDNQAFNNILEGIDIIECENLLKEMNVQKIPESSLFTNIKEALQAEVFNSTVEDNFESFISYELQNHGPLMLIRPSLGSECLHAECIVGYDSEVKKVLIYDSMNTSPEWQSNIDVYDKLTLAFNDKYKNEDCSICGLYYDGVYEPKLEHHHHHH 199 T 0.002 YsaB unppssm F Bacteria T 5kev 1 A A Q87GI4_VIBPA VtrA Protein MTAKDDYPSLSFQQDYVYIFSSDFQLSEELGVALINALSAKEIVPERLYVMLNDKTISFSFISKNKKSKNRVLSTEKKLNYKHISEYIVNEIEY 94 T 0.014 NUP214 pdb F Bacteria T 5kev 2 B B Q87GI3_VIBPA VtrC Protein MGSSHHHHHHSQDPVHFYETSYKYQAADSTYMHDVAINVSIKGNHFTSDIIIRELVKSENKNYYNVIGHGDIIQKNTHQYYLNFDNIDVYTGTNKANMKPYKEPTSISSLINKSNNIRVVYLSEEYVVVEFFFYDGQIITLHRY 144 T 0.17 Gp13-like unppercent F Bacteria T 5kez 2 B B ACE-DTY-PRO-TYR-SER-CYS-TRP-VAL-ARG-HIS-NH2 XXPYSCWVRHX 11 T 1.5 DUF6006 pdbhh F T 5kgf 7 K,N L,K TP53B_HUMAN Tumor suppressor p53-binding protein 1 LTKAADISLDNLVEGKRKRRS 21 T 11 CHZ pdbhh F Eukaryota T 5kgn 2 C,D C,D macrocyclic peptide inhibitor XXDYPGDYCYLYX 13 T 3.8 Gln_deamidase_2 pdbhh F T 5kgq 1 A A Q4DY78_TRYCC Uncharacterized protein GSAMGHMVKISHEDTQRIKTAFLSYAQGQDKVTEAMIDQLICGAFPGLSWEQLQEKKKGRAAANGYDRSAFFSLVASDEQYVRFIAQHFPCAPEEEKPPEIDALELKTQKGF 112 T 0.13 DICT unppercent F Eukaryota T 5khr 15 R S HSL1_YEAST HSL1 peptide NKENEGPEYPTKIEXYLEEQKPKRAALSDITNS 33 T 13 NTS_2 pdbhh F Eukaryota T 5ki0 1 A A K2C6A_HUMAN Antimicrobial peptide KAMP-19 RAIGGGLSSVGGGSSTIKY 19 T 14 DUF4244 pdbhh F Eukaryota T 5kko 1 A,B,C,D,E,F A,B,C,D,E,F Uncharacterised protein SNAMKYFQIDELTLNAMLRITTIESLTPEQRLELIKAHLLNIKTPSDDNEPWDEF 55 T 0.16 DUF6291 unppssm F T 5kkv 1 A A GCN4-p2L XGMKQIEDKIEEILSKIYHIENEIARIKKLIGEGHH 36 T 0.0018 VGPC1_C pdbhh F T 5klc 1 A A A0A0R5P8X1_9BACT Carbohydrate binding module E1 GSHMSASCGSGNFNKTAAKGVEFSAVAGDCIKYNKSSGTLQIGSWTGVASSYNITSGPQGITNTGNGWTTVANAANGDLYIKIVSASRSFNVKFDNW 97 T 0.41 Pox_T4_N unp F Bacteria T 5klh 1 A,B A,B Q26806_9TRYP Surface glycoprotein GSAMGSSDDPRDNFKKAVSAFDPKPLESWTGTFSDVKATVRRQSLSVAGLGSIPSVYTEATVPVSGNTDGSQLVVKVNINTVAPFTRRSPLHATRERWFSCSSSQCSGYSRKCDCQEKHEQFRNKCYSQGGQYSTQSSKCRLGEKCGYCKQEVYLSKLYLVAASDGKGEYRESTQYQSALYSFGHLSQGYEAVPQDKVQVQLYSEGDPFIALERETMGEGEFGVPNRTAAA 231 T 0.0019 Shisa unphh F Eukaryota T 5klr 2 B A Prototypical P4[R]cNLS SKKAGFPAKKRKVEAA 16 T 20 IGR pdbhh F T 5klt 2 B A Prototypical P4[M]cNLS SKKAGFPAKKMKVEAA 16 T 9.7 DUF4543 pdbhh F T 5kmx 1 A,B,C,D A,B,C,D G0UXP9_TRYCI Putative uncharacterized protein TCIL3000_10_9440 GSAMGSSDEPRDDFKEAVNAFNPNPIEKWTGRFNTENASVRRRTLNVPGFKSIPTVYTEATLPLNKDVTDGRLTVVVNINTVQPFTRRTPLRVKREKWYTCSSSQCSGSSSKCDCHRKHDEFRNKCISEGGRYTTESSKCRLGEKCGYCKQNVYLATLYLVAGSVGGGMYRESDKYQSALYPFYDISQGYEPRQPSSVNVRLYSEGDPFIAFQQLTEGREEFGIPNRTVGAAA 233 T 0.001 DUF4106 unphh F Eukaryota T 5knm 4 D N Peptide ILE-LEU-ARG-TRP-GLU-GLN ILRWEQ 6 T 0.72 DUF1216 pdbhh F T 5koa 2 C D FTSZ_ECOLI C-terminal tail of FtsZ DYLDIPAFLRKQ 12 T 1.4 F-box-like_2 pdbhh F Bacteria T 5kpe 1 A A De novo Beta Sheet Design Protein OR664 MQDIVEAAKQAAIAIFQLWKNPTDPEAQELLNKILSPDVLDQVREHARELQKQGIHFEVKRVEVTTDGNTVNVTVELEETTGGTTTNTTYELRFEVDGDTIRRVTVTQNGGSLEHHHHHH 120 T 0.0013 SnoaL_2 pdb F T 5kph 1 A A De novo Beta Sheet Design Protein OR485 MPSEEEEKRQVKQVAKEKLLEQSPNSKVQVRRVQKQGNTIRVELELRTNGKKENYTVEVERQGNTWTVKRITRTVGSLEHHHHHH 85 T 0.00085 DUF3828 pdb F T 5ks5 1 A A EF2K_HUMAN EEF-2K,CALCIUM/CALMODULIN-DEPENDENT EUKARYOTIC ELONGATION FACTOR 2 KINASE GSHMSPDRCQDWLEALHWYNTALEMTDCDEGGEYDGMQDEPRYMMLAREAEMLFTGGYGLEKDPQRSGDLYTQAAEAAMEAMKGRLANQYYQKAEEAWAQMEE 103 T 0.00017 Sel1 unphh F Eukaryota T 5ksa 5 E J GDB0_WHEAT DQ8.5-glia-gamma1 peptide QPQQSFPEQEA 11 T 1.4 DUF3067 pdbhh F Eukaryota T 5ksb 5 I,J I,J GDB0_WHEAT DQ8.5-glia-gamma1 peptide GPQQSFPEQEA 11 T 1.3 DUF3067 pdbhh F Eukaryota T 5kvn 1 A A Designed peptide NC_HEE_D1 NDKCKELKKRYPNCEVRCDXPRYEVHC 27 T 0.31 LPD29 pdbhh F T 5kwn 2 B U HY5_ARATH peptide 16-mer IESDEEIRRVPEFGGEAVG 19 T 0.2 Macoilin unppercent F Eukaryota T 5kwo 1 A A Designed peptide NC_EHE_D1 CQTWRXVSPEECRKYKEEYXCVRCTE 26 T 0.2 zf-CW pdbpssm F T 5kwp 1 A A Designed peptide NC_EEH_D2 TCVECXXVKVCRPDPEEARREAEERCX 27 T 2 Herpes_IE1 pdbhh F T 5kwx 1 A A Designed peptide NC_EEH_D1 CSYTCXPQTYTFPTCEEAKKMKKRC 25 T 0.8 Fer4_5 pdbhh F T 5kwz 1 A A Designed peptide NC_cHH_D1 HDPEKRKECEKKYTDPKKREECKRKA 26 T 0.24 Antimicrobial21 pdb F T 5kx0 1 A A Designed peptide NC_cHh_DL_D1 NPELQRKCKELXTRXXXXXXXXXXSD 26 T 11 DUF3511 pdbhh F T 5kx1 1 A A Designed peptide NC_cHHH_D1 NPEDCRQDPEANKSPEECKKLK 22 T 3.1 DUF1388 pdbhh F T 5kx2 1 A A Designed peptide NC_cEE_D1 PVTWCVRIXPTVRCTVRX 18 T 6.5 SapA pdbhh F T 5l0l 1 A,B A,B Q5ZYD3_LEGPH Uncharacterized protein GKKEFLKHEYSPGHWSIDYTRAGTSIAVITVRNKYHYSVILNPTDCRGYRIIIRYLNEGDSTLSSAFNRPYTVSEQRGLNDVASLMTQVYEKLGLIVQFSQLGNNSQSFDKGTGVTLIGSEEEPSMLHLHMWGRGDPDMEYIAGVPLRGPEPGLMFDLIAKNKTHPINQHAIKWNEEELKACLAMFKLKLAEYVNSPEFTEEFGDTLKVTIHDKK 215 T 0.001 DUF3762 pdbpercent F Bacteria T 5l23 2 B B RPGF1_HUMAN C3G derived peptide XDNSPPPALPKKRQSYX 17 T 8.5 Ribosomal_L32p pdbhh F Eukaryota T 5l3x 1 A A NELFA_HUMAN NELF-A,WOLF-HIRSCHHORN SYNDROME CANDIDATE 2 PROTEIN ESDTGLWLHNKLGATDELWAPPSIASLLTAAVIDNIRLCFHGLSSAVKLKLLLGTLHLPRRTVDEMKGALMEIIQLASLDSDPWVLMVADILKSFPDTGSLNLELEEQNPNVQDILGELREKVGECEASAMLPLECQYLNKNALTTLAGPLTPPVKHFQLKRKPKSATLRAELLQKS 177 T 0.00076 Adaptin_N pdbpssm F Eukaryota T 5l82 1 A A Enterococcin K1 MKFKFNPTGTIVKKLTQYEIAWFKNKHGYYPWEIPRC 37 T 0.016 Psg1 pdb F T 5l85 1 A A ZNHI3_HUMAN HNF-4A COACTIVATOR,THYROID HORMONE RECEPTOR INTERACTOR 3,THYROID RECEPTOR-INTERACTING PROTEIN 3,TRIP-3 GPHMDRVSLQNLKNLGESATLRSLLLNPHLRQLMVNLDQGEDKAKLMRAYMQEPLFVEFADCCLGIVEPSQNEES 75 T 0.00049 STI1 unphh F Eukaryota T 5l85 2 B B NUFP1_HUMAN NUCLEAR FMRP-INTERACTING PROTEIN 1 DIRHERNVILQCVRYIIKKDFFGLDTNSAKSKDV 34 T 0.18 Nup188 unppssm F Eukaryota T 5l9v 2 C,D C,D HIF1A_HUMAN HIF1-ALPHA,ARNT-INTERACTING PROTEIN,BASIC-HELIX-LOOP-HELIX-PAS PROTEIN MOP1,CLASS E BASIC HELIX-LOOP-HELIX PROTEIN 78,BHLHE78,MEMBER OF PAS PROTEIN 1,PAS DOMAIN-CONTAINING PROTEIN 8 DACTLLAPAAGDTIISLCF 19 T 13 DUF5913 pdbhh F Eukaryota T 5lah 1 A A TX121_URTEQ tau-AnmTx Ueq 12-1 CYPGQPGCGHCSRPNYCEGARCESGFHDCGSDHWCDASGDRCCCA 45 T 1.4 TerY_C pdbhh F Eukaryota T 5lb7 3 C C ASPM_MOUSE CALMODULIN-BINDING PROTEIN SHA1,CALMODULIN-BINDING PROTEIN 1,SPINDLE AND HYDROXYUREA CHECKPOINT ABNORMAL PROTEIN LSPDSFLND 9 T 0.89 Cmyb_C pdbhh F Eukaryota T 5lgm 1 A A V5557_BPT7 Fusion protein 5.5/5.7 MSDYLKVLQAIKSCPKTFQSNYVRNNASLVAEAASRGHISCATTSGRNGGAWEITASGTRFLKRMGGCV 69 T 0.012 DUF3116 pdbhh T Viruses T 5lgp 2 E,F,G,H E,F,G,H PABP1_HUMAN POLY(A)-BINDING PROTEIN 1 XFQNMPGAIRPAA 13 T 5.8 DUF1992 pdbhh F Eukaryota T 5lgq 2 E,F,G,H F,E,G,H PABP1_HUMAN POLY(A)-BINDING PROTEIN 1 PAAPRPPFSTM 11 T 19 Spore_YtrH pdbhh F Eukaryota T 5lgr 2 E,F,G,H E,F,G,H PABP1_HUMAN POLY(A)-BINDING PROTEIN 1 FQNMPGAIRPAA 12 T 4.4 DUF1992 pdbhh F Eukaryota T 5lhw 1 A A STIL_HUMAN TAL-1-INTERRUPTING LOCUS PROTEIN GGSLTEQDRQLRLLQAQIQRLLEAQSLM 28 T 1.8 SlyX pdbhh F Eukaryota T 5li1 2 B B Q28E03_XENTR UNCHARACTERIZED PROTEIN LAFQREGFGRQSMSEKRTKQ 20 T 0.088 LamB_YcsF pdbhh F Eukaryota T 5lih 2 C,D F,G KPCE_HUMAN PKC Epsilon pseudo substrate sequence ERMRPFKRQGSVRRRV 16 T 20 NumbF pdbhh F Eukaryota T 5ljn 2 C,D C,D SPAT2_HUMAN SPERMATOGENESIS-ASSOCIATED PROTEIN PD1 DVDLYTDS 8 T 19 Noda_Vmethyltr pdbhh F Eukaryota T 5lm1 2 B B UBAP1_HUMAN UBAP-1 SNIKSLSFPKLDSDDSNQKT 20 T 2 UPF0728 pdbhh F Eukaryota T 5lm5 2 C,D C,D DCP2_YEAST DCP2 DECAPPING FACTOR SSSPGQLLDILNSK 14 T 3.7 TaqI_C pdbhh F Eukaryota T 5lmf 2 C,D C,D DCP2_YEAST DCP2 DECAPPING FACTOR TAHSNSQALLDLLKKPT 17 T 3.6 RMMBL pdbhh F Eukaryota T 5lmg 2 C,D C,D DCP2_YEAST DCP2 DECAPPING FACTOR TSGSNELLSILHRK 14 T 16 RRM_9 pdbhh F Eukaryota T 5lmz 1 A,B A,B W0W999_9ACTN Fluorinase GAMVAANGSQRPIIAFMSDLGTTDDSVAQCKGLMHSICPGVTVVDVCHSMTPWDVEEGARYIVDLPRFFPEGTVFATTTYPATGTTTRSVAVRIRQAAKGGARGQWAGSGDGFERADGSYIYIAPNNGLLTTVLEEHGYIEAYEVTSTKVIPANPEPTFYSREMVAIPSAHLAAGFPLAEVGRRLDDSEIVRFHRPAVEISGEALSGVVTAIDHPFGNIWTNIHRTDLEKAGIGQGKHLKIILDDVLPFEAPLTPTFADAGAIGNIAFYLNSRGYLSLARNAASLAYPYNLKAGLKVRVEAR 302 T 3.8E-42 SAM_adeno_trans unppercent F Bacteria T 5ln4 1 A,B,C A,B,C PSAA_YERPE ADHESIN,ANTIGEN 4,ADHESIN,ANTIGEN 4 TFHVDFAPNTGEIFAGKQPGDVTMFTLTMGDTAPHGGWRLIPTGDSKGGYMISADGDYVGLYSYMMSWVGIDNNWYINDDSPKDIKDHLYVKAGTVLKPTTYKFTGRVEEYVFNDKQSTVINSKDVSGEVTVK 133 T 0.018 SEF14_adhesin unp F Bacteria T 5lnd 1 A,B A,B MYFA_YEREN C-AG,MYF ANTIGEN,C-AG,MYF ANTIGEN,C-AG,MYF ANTIGEN SFSVEFKATENEIVSGKLDADTPAFHLVMSDSGEHKGWNVRPTGASEGGQMVSADGTRVDLHTNELSWDNDHWWIDDGSERVEATFFLAAGDEVKAGEYQFTGRVEEYVEDNKQEPTVINSKDISATKTVKE 132 T 7.3 DUF3836 unppssm F Bacteria T 5los 1 A A G4TKU4_SERID PIIN_05872 GPGSAPLPNPPMTPAQHYAQAIHHEGLARHHTTVAEDHRQTANLHDNRIKAAKARYNAGLDPNGLTSAQKHQIERDHHLSLAAQAERHAATHNREAAYHRLHSQTPAPGTKRSIDELD 118 T 0.13 DSBA pdb F Eukaryota T 5lpc 1 A A B0C4R0_ACAM1 Vanadium-dependent bromoperoxidase MGSSHHPHHHHHSSGLEVLFQGPLGSHMNTRRQQAQNIRNNAAELAANRPHPQHINNKEEYEYRRPKKDGNEPSHIANFTKGLPHDEHTGLLLNSADYDQFVLGIQSGDTTDFARTPLGPAELPKVHGCLSKQKIDCDDDHRSGFWKSQIAQGAAGGDGAKLRAWESAGAGLVFDLEGPDAQAVTMPPAPRLESPELTSEIAEVYSQALLRDIHFSQLRDPGLGDQVNACDSCPTQLSIYEAIDILNTVQIEGQNWFSANCCDLTDDEQARQRPLVTRQNIFRGIAPGDDVGPYLSQFLLIGNNALGGGVFGQEAGHIGYGAIRIDQRVRKATPCKDFMTNFETWLDVQNGADLRGLETYVDADPGKCREFPAYRVITTPRDLATYVHYDALYEAYLNACLILLGMGAPFDPGIPFQKPDVEDKQQGFAHFGGPQILTLVCEAATRGLKAVRFQKFNVHRRLRPEALGGLVDRYKHGKGAGDELKPVAALVEALENVGLLSKVVAHNQLQNQNLDRSGDPSSAGDNYFLPMAFPEGSPMHPSYGAGHATVAGACVTMLKAFFDHGWQLNLGMANGKYISYEPNQDGSSLQQVLLDCPLTVEGELNKIAANISIGRDWAGVHYFTDYIESLRLGEKIAIGILEEQKLTYGENFTMTVPLYDGGSIQI 666 T 0.0014 PAP2 unppssm F Bacteria T 5lqp 1 A,AA,AB,AC,AD,AE,AF,B,BA,BB,BC,BD,BE,BF,C,CA,CB,CC,CD,CE,CF,D,DA,DB,DC,DD,DE,DF,E,EA,EB,EC,ED,EE,EF,F,FA,FB,FC,FD,FE,FF,G,GA,GB,GC,GD,GE,GF,H,HA,HB,HC,HD,HE,HF,I,IA,IB,IC,ID,IE,IF,J,JA,JB,JC,JD,JE,JF,K,KA,KB,KC,KD,KE,KF,L,LA,LB,LC,LD,LE,LF,M,MA,MB,MC,MD,ME,MF,N,NA,NB,NC,ND,NE,NF,O,OA,OB,OC,OD,OE,OF,P,PA,PB,PC,PD,PE,PF,Q,QA,QB,QC,QD,QE,QF,R,RA,RB,RC,RD,RE,RF,S,SA,SB,SC,SD,SE,SF,T,TA,TB,TC,TD,TE,TF,U,UA,UB,UC,UD,UE,UF,V,VA,VB,VC,VD,VE,VF,W,WA,WB,WC,WD,WE,WF,X,XA,XB,XC,XD,XE,XF,Y,YA,YB,YC,YD,YE,Z,ZA,ZB,ZC,ZD,ZE AB,BB,CB,DB,EB,FB,GB,AC,BC,CC,DC,EC,FC,GC,AD,BD,CD,DD,ED,FD,GD,AE,BE,CE,DE,EE,FE,GE,AF,BF,CF,DF,EF,FF,GF,AG,BG,CG,DG,EG,FG,GG,AH,BH,CH,DH,EH,FH,GH,AI,BI,CI,DI,EI,FI,GI,AJ,BJ,CJ,DJ,EJ,FJ,GJ,AK,BK,CK,DK,EK,FK,GK,AL,BL,CL,DL,EL,FL,GL,AM,BM,CM,DM,EM,FM,GM,AN,BN,CN,DN,EN,FN,GN,AO,BO,CO,DO,EO,FO,GO,AP,BP,CP,DP,EP,FP,GP,AQ,BQ,CQ,DQ,EQ,FQ,GQ,AR,BR,CR,DR,ER,FR,GR,AS,BS,CS,DS,ES,FS,GS,AT,BT,CT,DT,ET,FT,GT,AU,BU,CU,DU,EU,FU,GU,AV,BV,CV,DV,EV,FV,GV,AW,BW,CW,DW,EW,FW,GW,AX,BX,CX,DX,EX,FX,GX,AY,BY,CY,DY,EY,FY,GY,AZ,BZ,CZ,DZ,EZ,FZ,BA,CA,DA,EA,FA,GA Q9AZ42_9VIRU Coat protein ANKPMQPITSTANKIVWSDPTRLSTTFSASLLRQRVKVGIAELNNVSGQYVSVYKRPAPKPEGCADACVIMPNENQSIRTVISGSAENLATLKAEWETHKRNVDTLFASGNAGLGFLDPTAAIVSSDTT 129 T 15 Packaging_FI pdbhh T Viruses T 5ls6 4 M,N,O,P Q,R,S,T JARD2_HUMAN Jarid2 K116me3 RLQAQRKFAQS 11 T 21 DUF4395 pdbhh F Eukaryota T 5lsf 4 D,D10,D11,D12,D13,D14,D15,D16,D17,D18,D19,D2,D20,D21,D22,D23,D24,D25,D26,D27,D28,D29,D3,D30,D31,D32,D33,D34,D35,D36,D37,D38,D39,D4,D40,D41,D42,D43,D44,D45,D46,D47,D48,D49,D5,D50,D51,D52,D53,D54,D55,D56,D57,D58,D59,D6,D60,D7,D8,D9 D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D A0A2S0CUG6_9VIRU VP4 DNPHRFLPANVSNRWNEYSSAYLPRV 26 T 2.7 YLP pdbhh T Viruses T 5lso 2 B,D C,D LYS-SER-ARG-TRP-ASP-GLU KSRWDE 6 T 0.034 SF3b1 pdbhh F T 5lsp 4 G,H X,Y MET_HUMAN HGF RECEPTOR,HGF/SF RECEPTOR,PROTO-ONCOGENE C-MET,SCATTER FACTOR RECEPTOR,SF RECEPTOR,TYROSINE-PROTEIN KINASE MET ETRECKEALAKSEM 14 T 10 YedD pdbhh F Eukaryota T 5lsw 2 B,D B,D Q9VV48_DROME ROQUIN,ISOFORM A,ISOFORM B,ISOFORM C EGGIDSGMMLQLEKNLVDIVD 21 T 0.18 QueC pdbhh F Eukaryota T 5lvp 2 E,F,G,H E,F,G,H hydrophobic-motif peptide of PKB/Akt KGAGGGGFPQFSYSA 15 T 6.9 DUF4172 pdbhh F T 5lvy 1 A A C9K5V2_ECOLX Adhesin protein NSCSLSISSPDPVTYTIPTDKGDKYINFKLDVPDPRCKALGGTVYFWGADTRDGKLVMKKGQDKYTLMTTYGGAVQQQLGGGYGYYHVSQKTPPQTISGVVSKNVGYKPGQYTVELTGFFSLNDNKQANPTPSSLTSKAAGKNIVSSTGTITIS 154 T 0.00036 SEF14_adhesin unphh F Bacteria T 5lwc 1 A A BS222_STAPS Bacteriocin BacSp222 MAGLLRFLLSKGRALYNWAKSHVGKVWEWLKSGATYEQIKEWIENALGWR 50 F F Bacteria T 5lx2 2 B B PI4KB_HUMAN Phosphatidylinositol 4-kinase beta TASNPK 6 T 13 EndIII_4Fe-2S pdbhh F Eukaryota T 5lxh 2 D,E,F E,F,G ATG4B_HUMAN AUT-LIKE 1 CYSTEINE ENDOPEPTIDASE,AUTOPHAGIN-1,AUTOPHAGY-RELATED CYSTEINE ENDOPEPTIDASE 1,AUTOPHAGY-RELATED PROTEIN 4 HOMOLOG B,HAPG4B EDEDFEILSL 10 T 1.2 Vault_3 pdbhh F Eukaryota T 5lxl 1 A A DECO_BPT5 CAPSID PROTEIN PB10 MGIDYSGLRTIFGEKLPESHIFFATVAAHKYVPSYAFLRRELGLSSAHTNRKVWKKFVEAYGKAIPPAPPAPPLTLSKLEHHHHHH 86 T 0.13 DUF2063 pdbpssm T Viruses T 5ly1 2 E E CP2 XXVYNTRSGWRWYT 14 T 0.14 TraF pdbhh F T 5ly2 2 E,F,G,H E,F,G,H CP2_R6Kme3 XXVYNTKSGWRWYT 14 T 0.11 TraF pdbhh F T 5ly3 2 B B RKD2_PYRCJ ARCADIN-2 GGIGENEWVKILRSKR 16 T 2.4 DUF4287 pdbhh F Archaea T 5lz3 2 B B POLG_AIVA8 3A GNRVIDAEPREIPLEYADDLLEAMAHHRPVPCSLGLSQAIANNTPIQQISETFWKYRK 58 T 0.061 RsdA_SigD_bd pdbpssm T Viruses T 5lz6 2 B B Q8BES6_9PICO 3A GAHSERTFETAPSEIDADEVLEILSKSKPAPTHLTLER 38 T 0.19 Nitroreductase pdb T Viruses T 5lzx 45 SA 1 Nascent chain DSPGLKV 7 T 4.1 Peptidase_M18 pdbhh F T 5m0i 3 G,H,I H,J,I SHE3_YEAST SWI5-dependent HO expression protein 3 KTNVTHNNDPSTSPTISVPPGVTR 24 T 10 B277 pdbhh F Eukaryota T 5m21 1 A,C,E,G A,C,E,G F8TW82_9SPHN Hydroquinone dioxygenase small subunit MADVVTEFGALTDYRKGGVEIIDDDPRNYVFSNVFEVAANAAPYERVAVGKNFEYVIESARAEGTSGWFSCAHDEFVLAMDGQIEVHLLKLDNSDAYVDPDSEGAVAIGEALPEGRKMGRIVLRRGHMALLPVGAAYRFYAEQPAAMLFQSIEGAVTVQKWGEICQTEAA 170 T 0.18 Lyx_isomer pdbhh F Bacteria T 5m41 1 A A Nigritoxine MSLPSNPTPVIPANLDLGGINHSAVANRYRNLTKEAQQNLYQFAIIEVLSQIREERPDKNLDAYNALIGIDKVTTVDIYTYGATNMFFMPDARGSKTGILVNLNSPDKPYTNIQQPSDFNNINDESFRQNFTSWEKRDGTTYSGVDTALDGLQEGQGGWNLGYFNQKTPRTINISELSKILVERLDYHVSQENNDDQILSTLLLDVLPRSAKGAAREPLGVSASGIPFQLEFTFEGFTSPTDELRAIQSPFSHLAKYFDLLVASTNGSDLQDVEYSQEQAENIGAWIDSGTQLLMSASGIGAAVSVIQGAAGLTADAIEGKEIDPLDVISLSLAAIPGGKIVAKLSKVSKNLGQVVRGGISIAETGVDIVGSSRDLIEGFKKGNFTDIINGLVSVASSSASGRPGKSKIGNAIKKGNPDAPLPTRPTYRNHEGEVRPIPTAQTKSFFERVAIVRREGLSGRGAIGLDLTAAQKRGAELSGMGGTISKSNPNGNVSQVYINEAEGIEKNITYRKVPVPNEPGNFENRLQESFLDNNGQTKWRDFPYAGEEFDFRLQHKDDFNNIGDLGVGKQGIIAVNNPYSFVHHSHTFEQKGISNNHLTLESNAFLTYIEGKKTGDFENKYGNEMEWLVRKFKTKKNDFDLKDIPDNIHFRTDREKGDHSLTTYTLQDFITVVENAPTKMRKVKNDEFALNNIVESMRATAKNMGASPDTLFLDVASTNYMTQLMGQVLTNGRQELNLQGLSNAAQKLRNGASSSV 757 T 0.26 Ago_PAZ pdb F T 5m4t 1 A A VSM1_TRYBB VSG GSKKQQTESAENKEKICNAAKDNQKACENLKEKGCVFNTESNKCELKKDVKEKLEKESKETEGKDEKANTTGS 73 T 0.00018 Trypan_glycop_C unphh F Eukaryota T 5m5g 4 D E G0RYC6_CHATD Fragment from molecular 2 (region containing putative polycomb protein Suz12) VMLPGRGVP 9 T 1.2 SSPI pdbhh F Eukaryota T 5m5s 2 C,D,E,F E,F,G,H AMPH_HUMAN Amphiphysin ETLLDLDFDP 10 T 1.7 DUF5331 pdbhh F Eukaryota T 5m5u 2 C,D,E,F E,F,G,H LHDAG_HDVIT L-HDAG,P27 SDILFPADS 9 T 2.6 Pyocin_S pdbhh T Viruses T 5m61 2 C,D,E,F E,F,G,H AMPH_HUMAN Amphiphysin ETLLDLDFDPFK 12 T 0.12 UPF0489 pdbhh F Eukaryota T 5m9e 2 E,F,G,H E,F,G,H DIS1_SCHPO Phosphoprotein p93 RRSLAGSMLQKPTQFSRPSF 20 T 0.076 AF-4 pdbhh F Eukaryota T 5m9f 1 A,B,C A,B,C Q6Y7P9_BPPGK PUTATIVE RECEPTOR BINDING PROTEIN GSSHHHHHHSSGLVPRGSHMASMTGGQQMGRGPKDMEKYLLSSIRDDGSASFPLLVYTSDSKTFQQAIIDHIDRTGQTTFTFYVQGGVSGSPMSNSCRGLFMSDTPNTSSLHGVYNAIGTDGRNVTGSVVGSNWTSPKTSPSHKELWTGAQSFLSTGTTKNLSDDISNYSYVEVYTTHKTTEKTKGNDNTGTICHKFYLDGSGTYVCSGTFVSGDRTDTKPPITEFYRVGVSFKGSTWTLVDSAVQNSKTQYVTRIIGINMP 262 T 0.024 Ig_4 pdb T Viruses T 5m9n 2 C C E2F1_HUMAN E2F peptide SSGPARGXGXHPGKGVK 17 T 0.11 TP1 unp F Eukaryota T 5m9o 2 B B E2F1_HUMAN E2F peptide ARGXGXHPG 9 T 0.11 TP1 unp F Eukaryota T 5m9u 1 A A ANN1_AREMA Arenicin-1 RWCVYAYRRVRGVLVRYRRCW 21 T 1.8 CBP_BcsN pdbhh F Eukaryota T 5mao 1 A,B A,B Q72GF3_THET2 Heat resistant RNA dependent ATPase GGMAERSLLTGEEGWRTYKATGPRLSLPRLVALLKGQGLEVGKVAEAEGGFYVDLRPEARPEVAGLRLEPA 71 T 0.00048 GUCT pdbpercent F Bacteria T 5mas 1 A A A0A1U7Q1Y9_9HYPO Bergofungin A XVXXXVGLXXPQXPXX 16 T 0.12 Pep_deformylase pdbhh F Eukaryota T 5mav 2 G,H,I,J,K,L G,H,K,L,N,M Q0MQR4_HUMAN Poly (ADP-ribose) glycohydrolase QHGKKDSKITDHFMRLPKA 19 T 0.32 DUF4334 pdbhh F Eukaryota T 5mb9 1 A,B A,B G0RZX9_CHATD Putative heat shock protein SAMGWSHPQFEKMAESASKAAPGERVVIGITFGNSNSSIAHTVDDKAEVIANEDGDRQIPTILSYVDGDEYYGQQAKNFLVRNPKNTVAYFRDILGQDFKSVDPTHNHASAHPQEAGDNVVFTIKDKAEEDAEPSTLTVSEIATRYLRRLVGAASEYLGKKVTSAVITIPTNFTEKQKAALIAAAAAADLEVLQLISEPAAAVLAYDARPEATISDKIIVVADLGGSRSDVTVLASRSGMYTILATVHDYEYHGIALDKVLIDHFSKEFLKKNPGAKDPRENPRSLAKLRLEAESTKRALSRSTNASFSVESLIDGLDFASTINRLRYETIARTVFEGFNRLVESAVKKAGLDPLDVDEVIMSGGTSNTPRIAANFRYIFPESTRILAPSTDPSALNPSELQARGAALQASLIQEFETEDIEQSTHAAVTTMPHVTNAIGVVSVSESGEEKFVPIIAPETAVPARRTVHLDAPKEGGDVLVKVVEGSTHINVIKPEPKAKEDGETKEKTEDADDDGDFDDDDEEEEEEEEEEEKREKVWKIGSTLAEAAVRGVKKGAKVEVTINVNTDLTVIVTAREVGGKGGVRGTLSA 590 T 0.053 DUF3221 pdbpercent F Eukaryota T 5mb9 2 C,D C,D G0RYD6_CHATD Putative ribosome associated protein GAMAEKDFKAIGKLTQEGSSMRTLEPVGPHFLAHARRVRHKRTFS 45 T 4.2 Suv3_N pdbhh F Eukaryota T 5mbw 2 B B BACE1 INHIBITOR PEPTIDE Pep#3 EVNXVAEXKX 10 T 18 Musclin pdbhh F T 5mco 2 B B BACE-1 EXOSITE PEPTIDE XALYPYFLPISAK 13 T 0.96 Suv3_N pdbhh F T 5mcq 2 B D BACE-1 ACTIVE AND EXOSITE BINDING INHIBITOR GGGYPYFIPXGXGEVNXVAEXX 22 T 3.3 Img2 pdbhh F T 5me5 2 B B A0A1S3C4H6_CUCME eIF4G NEAIKEDAGALSKAEPDDWEDAADIATPDLESANGDGVGTSMLDSGDRTGDMAKKYSRDFLLKFAEQFLDLPHNFEVTSDIESLMSTHTN 90 T 0.00032 eIF_4G1 pdbhh F Eukaryota T 5mhc 2 B P P53_HUMAN LYS-LEU-MET-PHE-LYS-TPO-GLU-GLY-PRO-ASP-SER-ASP KLMFKTEGPDSD 12 T 44 CopB pdbhh F Eukaryota T 5mhl 2 C,D K,O inhibitor Mi0621 XFXVKVIGX 9 T 9.3 DUF5580 pdbhh F T 5miy 1 A,B,C A,B,C E3 ubiquitin ligase RavN GAMGSMPTYFDPIMQEDTVLDENTIVYLVKIGDNKFSIKAISSGLEHLPSDPTTHAEKYWPIPAKSLIDHSSNKLLFEEDKLTNQPISKDQVIELFAVDPDKTEPKQFSDSVKRELTENWAREVLQDQ 128 T 0.74 GET2 unppssm F T 5mjy 2 E,F E,F ZFYV9_HUMAN MOTHERS AGAINST DECAPENTAPLEGIC HOMOLOG-INTERACTING PROTEIN,MADH-INTERACTING PROTEIN,NOVEL SERINE PROTEASE,NSP,RECEPTOR ACTIVATION ANCHOR,HSARA,SMAD ANCHOR FOR RECEPTOR ACTIVATION MENYFQAEAYNLDKVLDEFEQN 22 T 6.4 OmpH pdbhh F Eukaryota T 5mk0 2 B,D B,D ZFY16_HUMAN ENDOFIN,ENDOSOME-ASSOCIATED FYVE DOMAIN PROTEIN MDSYFKAAVSDLDKLLDDFEQN 22 T 1.2 OmpH pdbhh F Eukaryota T 5mk1 2 E,F,G,H E,F,H,K CHM4A_HUMAN CHROMATIN-MODIFYING PROTEIN 4A,CHMP4A,SNF7 HOMOLOG ASSOCIATED WITH ALIX-2,SNF7-1,HSNF-1,VACUOLAR PROTEIN SORTING-ASSOCIATED PROTEIN 32-1,HVPS32-1 PKVDEDEEALKQLAEWVS 18 T 1.8 ZapA pdbhh F Eukaryota T 5mk2 2 C C CHM4B_HUMAN CHROMATIN-MODIFYING PROTEIN 4B,CHMP4B,SNF7 HOMOLOG ASSOCIATED WITH ALIX 1,SNF7-2,HSNF7-2,VACUOLAR PROTEIN SORTING-ASSOCIATED PROTEIN 32-2,HVPS32-2 KKKEEEDDDMKELENWAGSM 20 T 1.7 TMEM154 pdbhh F Eukaryota T 5mk3 2 E,F,G,H E,F,G,H CHM4C_HUMAN CHROMATIN-MODIFYING PROTEIN 4C,CHMP4C,SNF7 HOMOLOG ASSOCIATED WITH ALIX 3,SNF7-3,HSNF7-3,VACUOLAR PROTEIN SORTING-ASSOCIATED PROTEIN 32-3,HVPS32-3 QRAEEEDDDIKQLAAWAT 18 T 1.1 Ribosomal_60s unppssm F Eukaryota T 5mku 2 B B HIS-LYS-ILE-LEU-HIS-ARG-LEU-LEU-GLN-ASP-SER HKILHRLLQDS 11 T 0.0023 SRC-1 pdb F T 5mlo 2 B,D,F B,D,F ZRAB3_HUMAN ZRANB3 PIP box peptide EKEKQHDIRSFFVPQ 15 T 0.95 DUF4651 pdbhh F Eukaryota T 5mlw 2 B,D,F B,D,F ZRAB3_HUMAN APIM motif peptide GSDITRFLVKK 11 T 0.15 DUF3460 pdbhh F Eukaryota T 5mm2 1 A A CAPS4_NORAV capsid protein VP4C SLPENAPNAVSNPQQFITPATALSAEEYNVHEALGETEELELDEFPVLVFKGNVPVDSVTSIPLDLATIYDFAWDGEQNAISQKFQRFAHLIPKSAGGFGPVIGNYTITANLPTGVAGRILHNCLPGDCVDLAVSRIFGLKSLLGVAGTAVSAIGGPLLNGLVNTAAPILSGAAHAIGGNVVGGLADAVIDIGSNLLTPKEKEQPSANSSAISGDIPISRFVEMLKYVKENYQDNPVFPTLLVEPQNFISNAMTALKTIPIEVFANMRNVKVERNLFDRTVVPTVKEATLADIVIPNHMYGYILRDFLQNKRAFQSGTKQNVYFQQFLTVLSQRNIRTHITLNDITSCSIDSESIANKIERVKHYLSTNSSGETTEEFSRTDTGLLPITTRKIVLGESKRRTERYVAETVFPSVRQ 416 T 0.096 Iso_dh pdb T Viruses T 5mm2 2 B B CAPS4_NORAV capsid protein VP4B ADNEVTAEGGKLVQELVYDHSAIPVAPVVETQAEQPEVPVSLVATRKNDTGHLATKWYDFAKISLSNPANMNWTTLTIDPYNNVTLSRDGESMVLPWRRNVWTTGSKSIGYIRTMVAQINIPRPPQISGVLEVKDSINNSSISLVEFGGKVEIPIIPKVMNGLATTASLPRHRLNPWMRTAESKVELQYRIIAFNRTSDIADLNVSVLLRPGDSQFQLPMKPDNNVDTRHFELVEALMYHYDSLRIRGEEQ 251 T 0.24 Waikav_capsid_1 pdbhh T Viruses T 5mm2 3 C C D2WFA0_NORAV Capsid protein VP4A MQNPTQTMHIYDMPLRVIAGLSTLAKTTEEDDNTSTGIVVSEVGEPQVVNHPAWIDPFVAYQLRAPRKNITPDFIFGRADIGNAFSAFLPRRFSAPAVGTRLVVDPVFTYQQRTVLGLYNYFHADFYYIVHVPAPLGTGIYLKIYAPEFDTTTVTRGIRFKPSASPTIALSVPWSNDLSTVETSVGRVGQSGGSIVIETIEDNSNETVNTPLSITVWCCMANIKATGYRHADTSAYNEKGMNFIPVPVPKPPVPPTKPITGEEQ 264 T 0.0028 Waikav_capsid_1 pdbhh T Viruses T 5mmi 7 G 6 PSRP5_SPIOL plastid ribosomal protein cL37, PSRP5 MALLSPLLSLSSVPPITSIAVSSSSFPIKLQNVSVALLPTLGQRLMTHGPVIAQKRGTVVAMVSAAADETAGEDGDQSKVEEANISVQNLPLESKLQLKLEQKMKMKMAKKIRLRRNRLMRKRKLRKRGAWPPSKMKKLKNV 142 T 0.084 DUF3381 pdbpssm F Eukaryota T 5mmk 1 A A GLY-ILE-LEU-SER-SER-LEU-TRP-LYS-LYS-LEU-LYS-LYS-ILE-ILE-ALA-LYS GILSSLWKKLKKIIAKX 17 T 3.6 TnpW pdbhh F T 5mn3 1 A A domain-swapped metallothionein from Littorina Littorea GSMSSVFGAGCTDTCKQTPCGCGSGCNCKEDCRCQSCKYGAGCTDVCKQTPCGCATSGCNCTDDCKCQSCSTACKCAAGSCKCGKGCTGPDSCKCDRSCSCK 102 T 0.00054 Metallothio_Euk pdbpercent F T 5mn9 2 C C MINY1_HUMAN DEUBIQUITINATING ENZYME MINDY-1,PROTEIN FAM63A GPLGSQVDQDYLIALSLQQQQPRGPLGLTDLELAQQLQQEEYQQ 44 T 0.22 CCDC50_N pdbhh F Eukaryota T 5mrc 71 SB 55 RT13_YEAST 37S RIBOSOMAL PROTEIN MRP13, MITOCHONDRIAL,YMS-A MGTITVVINEGPILLIRALHRATTNKKMFRSTVWRRFASTGEIAKAKLDEFLIYHKTDAKLKPFIYRPKNAQILLTKDIRDPKTREPLQPRPPVKPLSKQTLNDFIYSVEPNSTELLDWFKEWTGTSIRKRAIWTYISPIHVQKMLTASFFKIGKYAHMVGLLYGIEHKFLKAQNPSVFDIEHFFNTNIMCALHRNRLKDYKDAEIAQRKLQVAWKKVLNRKNNTGLANILVATLGRQIGFTPELTGLQPVDISLPDIPNSSSGAELKDLLSKYEGIYLIARTLLDIDQHNAQYLELQEFIRQYQNALSESSDPYDTHLKALGLLETPPPQESTEKEEK 339 T 32 NMU unphh F Eukaryota T 5mrc 73 UB 77 RSM28_YEAST 37S RIBOSOMAL PROTEIN RSM28, MITOCHONDRIAL SSEYVLEEPTPLSLLEYTPQVFPTKESRLVNFTLDSLKKSNYPIYRSPNLGILKVHDFTLNTPNFGKYTPGSSLIFAKEPQLQNLLIEEDPEDFHRQVTGEYQLLKPYVKKDFEKLTKSKDTVSKLVQNSQVVRLSLQSVVMGSEEKKLVYDVCSGMKPISELQQ 165 T 0.33 bCoV_NS6 pdbpercent F Eukaryota T 5ms2 1 A A Q5ZUV9_LEGPH Legionella pneumophila effector protein RavZ GPMKGKLTGKDKLIVDEFEELGEQESDIDEFDLLEGDEKLPGDSELDKTTSIYPPETSWEVNKGMNSSRLHKLYSLFFDKSSAFYLGDDVSVLEDKPLTGAYGFQSKKNDQQIFLFRPDSDYVAGYHVDAKSDAGWVNDKLDRRLSEISEFCSKATQPATFILPFVEMPTDITKGVQHQVLLTISYDPKSKQLTPTVYDSIGRDTYSESLSSYFKGKYRTTCDEILTQSIEKAIKSTDFTLGKFTRAAYNHQNRLTEGNCGSYTFRTIKEVISSSAQGTEVKIPGSGYITSNSYLTSQHVQDIESCIKYRNLGVVDIESALTEGKTLPVQLSEFIVALEDYGKLRSQQSEKSMLNFIGYSKTAKLTAVELLIGILNDIKGKNEISESQYDKLVKEVDCLMDSSLGKLVQFHLKNLGAESLQKLVLPCVKFDDT 433 T 8.9 DUF438 pdbhh F Bacteria T 5ms7 1 A A Q5ZUV9_LEGPH Legionella pneumophila effector protein RavZ GPGEQESDIDEFDLLEGDEKLPGDSELDKTTSIYPPETSWEVNKGMNSSRLHKLYSLFFDKSSAFYLGDDVSVLEDKPLTGAYGFQSKKNDQQIFLFRPDSDYVAGYHVDAKSDAGWVNDKLDRRLSEISEFCSKATQPATFILPFVEMPTDITKGVQHQVLLTISYDPKSKQLTPTVYDSIGRDTYSESLSSYFKGKYRTTCDEILTQSIEKAIKSTDFTLGKFTRAAYNHQNRLTEGNCGSYTFRTIKEVISSSAQGTEVKIPGSGYITSNSYLTSQHVQDIESCIKYRNLGVVDIESALTEGKTLPVQLSEFIVALEDYGKLRSQQSEKSMLNFIGYSKTAKLTAVELLIGILNDIKGKNEISESQYDKLVKEVDCLMDSSLGKLVQFHLKNLGAESLQKLVLPCVKFDDTIDDFVTIEKDELFDVPDITGEELASKKGIEQGALDKEALLKQKQIKTDLLDLREEDKTGLKKPLHGGIKVK 485 T 15 DUF438 pdbhh F Bacteria T 5ms8 1 A A Q5ZUV9_LEGPH Legionella pneumophila effector protein RavZ GPMKGKLTGKDKLIVDEFEELGEQESDIDEFDLLEGDEKLPGDSELDKTTSIYPPETSWEVNKGMNSSRLHKLYSLFFDKSSAFYLGDDVSVLEDKPLTGAYGFQSKKNDQQIFLFRPDSDYVAGYHVDAKSDAGWVNDKLDRRLSEISEFCSKATQPATFILPFVEMPTDITKGVQHQVLLTISYDPKSKQLTPTVYDSIGRDTYSESLSSYFKGKYRTTCDEILTQSIEKAIKSTDFTLGKFTRAAYNHQNRLTEGNCGSYTFRTIKEVISSSAQGTEVKIPGSGYITSNSYLTSQHVQDIESCIKYRNLGVVDIESALTEGKTLPVQLSEFIVALEDYGKLRSQQSEKSMLNFIGYSKTAKLTAVELLIGILNDIKGKNEISESQYDKLVKEVDCLMDSSLGKLVQFHLKNLGAESLQKLVLPCVKFDDTIDDFVTIEKDELFDVPDITGEELASKKGIEQGALDKEALLKQKQIKTDLLDLREED 489 T 17 DUF438 pdbhh F Bacteria T 5msm 3 C,F C,F CTF18_YEAST Chromosome transmission fidelity protein 18 SGKVKTGLNSSSSTIDFFKNQYGLLKQTQELEETQKTIGSDETNQADDCNQTVKIWVKYNEGFSNAVRKNVTWNNLWE 78 T 2 Jnk-SapK_ap_N unppssm F Eukaryota T 5mtw 2 E,F,G E,F,G HIGA1_MYCTU Antitoxin HigA1 EVPTWHRLSSYRG 13 T 2.6 WW_like pdbhh F Bacteria T 5mu0 3 Q,R,S,T,U,V,W,X Q,R,S,T,U,V,W,X CO2A1_MOUSE ALPHA-1 TYPE II COLLAGEN GPPGARGLTGXPGDAGPP 18 T 0.0042 Collagen unppercent F Eukaryota T 5mu2 3 Q,R,S,T,U,V,W,X X,Q,R,S,T,U,V,W synthetic peptide containing the CII583-591 epitope of collagen type II GPPGPPGPPGPPGGRGLTGPIGPPGPPGPP 30 T 0.00036 Collagen pdb F T 5mu3 2 B B CENPP_KLULA CHROMOSOME TRANSMISSION FIDELITY PROTEIN 19 MNIEQRKKYLDITLNDVTVTCEKDMILLRKGSFTASFRIAVENESIRSMAIDLNAFEVELQPIIQYAEDTQNVNVAMMAVVQFLRIKELHEQMISKIVEASKFIRASNNTITLNDLEVSFHCYWNLPSPYPETLILTNKVQKILDFLIYQYGIQLGVIKYGSTII 165 T 0.0022 CENP-P unphh F Eukaryota T 5muu 1 A,B A,B P1_BPPH6 Major inner protein P1 MFNLKVKDLNGSARGLTQAFAIGELKNQLSVGALQLPLQFTRTFSASMTSELLWEVGKGNIDPVMYARLFFQYAQAGGALSVDELVNQFTEYHQSTACNPEIWRKLTAYITGSSNRAIKADAVGKVPPTAILEQLRTLAPSEHELFHHITTDFVCHVLSPLGFILPDAAYVYRVGRTATYPNFYALVDCVRASDLRRMLTALSSVDSKMLQATFKAKGALAPALISQHLANAATTAFERSRGNFDANAVVSSVLTILGRLWSPSTPKELDPSARLRNTNGIDQLRSNLALFIAYQDMVKQRGRAEVIFSDEELSSTIIPWFIEAMSEVSPFKLRPINETTSYIGQTSAIDHMGQPSHVVVYEDWQFAKEITAFTPVKLANNSNQRFLDVEPGISDRMSATLAPIGNTFAVSAFVKNRTAVYEAVSQRGTVNSNGAEMTLGFPSVVERDYALDRDPMVAIAALRTGIVDESLEARASNDLKRSMFNYYAAVMHYAVAHNPEVVVSEHQGVAAEQGSLYLVWNVRTELRIPVGYNAIEGGSIRTPEPLEAIAYNKPIQPSEVLQAKVLDLANHTTSIHIWPWHEASTEFAYEDAYSVTIRNKRYTAEVKEFELLGLGQRRERVRILKPTVAHAIIQMWYSWFVEDDRTLAAARRTSRDDAEKLAIDGRRMQNAVTLLRKIEMIGTTGIGASAVHLAQSRIVDQMAGRGLIDDSSDLHVGINRHRIRIWAGLAVLQMMGLLSRSEAEALTKVLGDSNALGMVVATTDIDPSL 769 T 0.22 STAG pdb T Viruses T 5muu 3 D,E,F,G,H,I,J,K,L,M D,E,F,G,H,I,J,K,L,M CAPSD_BPPH6 PROTEIN P8 MLLPVVARAAVPAIESAIAATPGLVSRIAAAIGSKVSPSAILAAVKSNPVVAGLTLAQIGSTGYDAYQQLLENHPEVAEMLKDLSFKADEIQPDFIGNLGQYREELELVEDAARFVGGMSNLIRLRQALELDIKYYGLKMQLNDMGYRS 149 T 2.7 DnaI_N pdbhh T Viruses T 5muz 1 A,B A,B J7HBG8_9VIRU L protein GPHADGDQNLFDYQFTGTPEEPIKGYWTTTISYRDSKPKISLTIRQEFVEGGVESQAVLATVVGRPHLQDFLLLKRKHLEYSDYPESIDLIEFGDVKVIEKTV 103 T 1.8 Viral_Rep pdbhh T Viruses T 5mwe 2 B,C C,D CNN_DROME PROTEIN ARROW GPMDQQNSAVIGQLRLELQQARTEVETADKWRLECIDVCSVLTNRLEELAGFLNSLLKHKDVLGVLAADRRNAMRKAVDRS 81 T 0.0034 FPP unphh F Eukaryota T 5mxs 1 A A TRP-TYR-HIS-ARG-LEU-SER-HIS-LEU-HIS-SER-ARG-LEU-GLN-ASP-NH2 WYHRLSHLHSRLQDX 15 T 8.1 FA_hydroxylase pdbhh F T 5mxt 1 A A TRP-TYR-HIS-ARG-LEU-SER-HIS-ILE-HIS-SER-ARG-LEU-GLN-ASP-NH2 WYHRLSHIHSRLQDX 15 T 3.1 Endotoxin_M pdbhh F T 5my9 2 B P LRRK2_HUMAN DARDARIN LQRHSNSLGPIFD 13 T 15 DCP1 pdbhh F Eukaryota T 5myc 2 B P LRRK2_HUMAN DARDARIN VKKKSNSISVGEFYRDAVLQRCSPNLQRHSNSLGPIFD 38 T 48 US30 pdbhh F Eukaryota T 5mz6 2 B B IFY1_CAEEL Interactor of FizzY protein MEDLNFEERGSTQIPASLQQHFSAKLGRQNELEKTPSRGGLGLVVNSSKTPGGKSLQSLASACKVPPSTKKNTIPIAFECYEDETDDQIADVATIKKTEKHPCSPIDTANRCETFDSLAADIEDDMLNLEDQDVVLSEDRPYGDVIDPAESEAEALAELGVEEWDSYPPIDPASRIGDDFNYVLRTEDFAEEGDVKLEETRHRTVIADIDEVKMSKAERNELFSMLADDLDSYDLLAEEANLPL 244 T 35 Sgf11_N pdbhh F Eukaryota T 5mzm 3 C,F C,F Ceramide synthase 5 derived peptide Trh4 p3P MCPRMTAVM 9 T 6.8 Mob1_phocein pdbhh F T 5n4b 2 C,D D,C AAMA1_GALM3 Alpha-amanitin proprotein IWGIGCNPWTAEHVDQTLASGNDIC 25 T 1.1 Sld7_N unphh F Eukaryota T 5n4c 2 B,F,G,H E,F,G,H AAMA1_GALM3 Alpha-amanitin proprotein MFDTNATRLPIWGIGCNPWTAEHVDQTLASGNDIC 35 T 1.1 Sld7_N pdbhh F Eukaryota T 5n5r 2 B P WWTR1_HUMAN TAZ pS89 peptide SHSSPASLQ 9 T 0.13 TFIIA unppercent F Eukaryota T 5n5w 2 B P WWTR1_HUMAN TAZ pS89 peptide RSHSSPASLQ 10 T 0.13 TFIIA unppercent F Eukaryota T 5n7x 2 C,D,G,I,J,L,N,P C,D,G,I,J,L,N,P GLU-TRP-VAL-HIS-PRO-GLN-PHE-GLU-GLN-LYS-ALA-LYS Peptide EWVHPQFEQKAK 12 T 0.052 RHINO pdbhh F T 5n85 2 B B PRIPO_HUMAN HPRIMPOL1,COILED-COIL DOMAIN-CONTAINING PROTEIN 111 DNGIDDAYFLEATED 15 T 25 DUF5302 pdbhh F Eukaryota T 5n89 2 C,E,G,I,J,L,N,P C,E,G,I,J,L,N,P GLY-ASN-SER-PHE-ASP-ASP-TRP-LEU-ALA-SER-LYS-GLY-NH2 GNSFDDWLASKGX 13 T 0.86 CnrY pdbhh F T 5n8a 1 A X PRIPO_HUMAN HPRIMPOL1,COILED-COIL DOMAIN-CONTAINING PROTEIN 111 MGSSHHHHHHSSGLVPRGSHMTTDEADETRSNETQNPHKPSPSRLSTGASADAVWDNGIDDAYFLEATEDAELAEAAENSLLSYNSEVDEIPDELIIEVLQE 102 T 31 Hydin_ADK pdbhh F Eukaryota T 5n8b 2 B,D,F,H E,C,F,H ALA-PHE-PRO-ASP-TYR-LEU-ALA-GLU-TYR-HIS-GLY-GLY-NH2 AFPDYLAEYHGGX 13 T 1.6 Not1 pdbhh F T 5n8e 2 E,F,G E,F,H ARG-ASP-PRO-ALA-PRO-ALA-TRP-ALA-HIS-GLY-GLY-GLY-NH2 RDPAPAWAHGGGX 13 T 5.7 DUF2591 pdbhh F T 5naf 2 E,F E,F MECP2_MOUSE MECP2 KKAVKESSIRSVHETVLPIKKRKTR 25 T 0.57 Humanin pdbhh F Eukaryota T 5nam 1 A A TLR4_HUMAN HTOLL MNITSQMNKTIIGVSVLSVLVVSVVAVLVYKFYFHLMLLAGCIKYGRG 48 T 0.082 Serinc pdbpssm F Eukaryota T 5nao 1 A A TLR4_HUMAN HTOLL MNITSQMNKTIIGVSVLSVLVVSVVAVLVYKFYFH 35 T 0.014 Phageshock_PspG pdb F Eukaryota T 5nas 2 C,D C,D PI4KB_HUMAN PTDINS 4-KINASE BETA,NPIK,PI4K92 LKRTASNPK 9 T 1.4 RE_HindIII pdbhh F Eukaryota T 5ncl 3 C D SSD1_YEAST PROTEIN SRK1 TTEQSDFKFP 10 T 23 Interferon pdbhh F Eukaryota T 5ncm 2 B B CBK1_YEAST CELL WALL BIOSYNTHESIS KINASE GSASSPVQSGFNNGTISNYMYFERRPDLLTKGTQDKAAAVKLKIENFYQSSVKYAIERNERRVELETELTSHNWSEERKSRQLSSLGKKESQFLRLRRTRLSLED 105 T 0.076 DNA_pol_D_N pdb F Eukaryota T 5ncn 2 B B DBF2_YEAST DUMBBELL FORMING PROTEIN 2 GSASKKLPPKFYERATSNKTQRVVSVCKMYFLEHYCDMFDYVISRRQRTKQVLEYLQQQSQLPNSDQIKLNEEWSSYLQREHQVLRKRRLKPK 93 T 0.0039 PPPI_inhib pdbpercent F Eukaryota T 5nco 39 MA k Signal sequence (1A9L) KQSTLALLLLLLLLTPVAAAAAA 23 T 1.5 Mfp-3 pdbhh F T 5nd1 1 A A M1VMJ0_RNQV1 Capsid protein MATQMQQRDNGEETLANIKHSARSELKDMNVSGLAGINIQAGGYDLTGLSLTDELIAQGLSNVGLLPFEHSRVVLELAEAITVTANNTDMGGSGQYCEQGAWRAWLHVGLNMAKHHVRIRSSAAMDFGTHRMMACDPASLDASTISAMSRNVTTATMNAVSREMKAMKALAAGRSRMSQSDIADNNDCRGFAFGVLSRMVMHNSARRHGVVNGRLQELGENDASTADTYLTWELACAHGKGEVAITPVPAAWLDPEAQLTGRERVFSEALARLVDPDVGCVHVKIDGVTQNAAENARVHYATRPDPMSWLDDNTGLSADSNAGRISGEHYTLWKGRHSKVHLTIQLKQLYHRMSTTAATAEPRADSIVYYLKGFEGLGACAEFLLANSRFGHHSFLPGVFGVTADVHEAAYAQNQALFLAGIGDRMPPATFTKAQLATATYALMRRYDISERTCHFAITTIGHMVAQTAVRDLNNGSLSPLPFRVNLSPFLVQGVQFWDMNDTEGSVSVHDMGIGKELLTATYALGAMASLAHVCEQGGTGEEMSAIEVSRFTDVHTVATDLFRKVVMTELGDLKLRGSEVTHSSQEALFAQKMKAVWSSMAEGSTRLYNLNQAYGPFVDVQLARIRSSFVRDIGASRQMMDASALIKHAQNVTYDWPQNESGCPVQFIALPVPSTITHYATPAIGTERWFATTRLNAAGSKVISEIRWTNGLNSDDRAGVHVFAYGRSTIVSSPLGCAEAMAAMVIAGEHKVVRRHTSIARAQSARTANIVAGAVLGARNGDMMTIIRPSTSVASSAVHLRGYIPMAAMNMLPITDGDCDLVVADTRTRPGRMSTSPEAHRRGVLASDYHVDIATDGNIRHVAREVYTVADIPTVSERVSGLALRPYERSCVRDASTLHSMLCGAVPLLYGGGEPMKLGDNTPVTNRQALRPPEYNRNPALRMPARFQMGTTACAFTKALGDVRAQMELREGDVVTEEVTEPDTTIVPQGSITERVVVGEMTEALIMADQPMFDQDVVQNIMYNSPGIRGTERANIQAELMAAKDWPSILQATAKSSHGDIASAKTPYDLIKACKVDWSKVKGETQLKLIMNKIHPEYMTLSRAISAQVDARIVPNVPKSAMSTLLFWACATDMGLHTAVMANLAGLQRTTGFKGGIVYDLEGGANWPATDVRARIIEGWNAHARRIAQSGLISRDLTVMKIQHDMNADDIMALPAHIGDSWVLTIGELTANIATDEQSVAYAKDAQSAYYAIDRLRRLVAGREEGVDEILSKAEVMARVLAENKGLADDQALYHREWNRRTMAYVYMTTYLGSLDVEARLGVVSDDAYAERRAEWKAERAKLAVPAVSGQGQGRR 1357 T 4.8 CmlA_N pdbhh T Viruses T 5nd1 2 B B M1VHN2_RNQV1 Capsid protein MSAPSDQSQETRSPTSVGNTVAADVQTSVHDKPTGELKGSDGTGIHEATGLPIDKRGEVPTVQLERTAESIAKMMDLLRSEKFTAAAADAKLMLQQEFQNIVACAKNAPQMTVNAGRFYLGCNSTTAIIAGDTADGYEIEYSGKRIEGQCVVALEPLTITLSGSTSSTQDNSDSAKLFALAVSQVWGGASTVGIVAPMLQTVAQEQTFRARVERDSGFQHHAALTTVVTTIVGWLMHVGDSAAKRSRDGWLDHQTDFAVKGMLTPHIASGMDWAGVQTYSASAMETTTDRVRADYAGRMVVHSTLRKQTLRSRGTGDTTETENSGRYLLALPKCDAGVAAAALALTWGKPKLGGAGHANLTAVMSEAGVGYITGVNGTRATPHADTVFGREELVYLLGFALRHMADAQEQVIRNVLAQVASLFRPAACSAHEWMNVHGALMPKVSRPMNEPAFREVWNVANSSSDLQMIDRDKLNGEHFLRQLAQQITVNCTGTAMAIYQAVLAGPTGITDGDTTRLQKDLYHHLFQYATTTYADGVQVMQANTRMANKMVPPVNALAAWGLGSSMDSFTGPHCAYYFGLADAADGCFYSTTTGRTLSVYAVDVNHTSSDSYLAMAQLEPGLIATATGTGSTITTNVEAAGVVDGGLVTEGHVSLYTTISAQWNGLQREVYNWLLWHACKTEDSSHADIVGAEEVKSAVEWLSSNSVEAHRFRSSAGLGATEAAGSPGRRAWRLHHYDGQIFSNVIADTERHPYMRRLYTPSELRDARNDLFVVDRIWKIVMAMRAQLMLISVQEDGGRHQHSKHYFGEAAAIGVMGHGFTNLFAYCASTVHGGREARLISNCTDTPMYKKEANDLVPPMMKVAQLSTLLAHGGAWCNAVNMGGNSTSIGLSILGDGTMPLQTVPWTVNEITYLSEEGARHGIEAIIDTNGSVSVKVKMTMLEPRQRFCLYDDNKTSSYITAQESRTATYVTLKLGGTKNANTISGLVAHDYKLATTILASTYDKGRKTGLTLEDLQKVGGITGGQGMTGRGGGSSSGRGGRGRGGSSTGGAETIGDSE 1059 T 24 Prion_octapep pdbhh T Viruses T 5ngn 1 A,B,C A,B,C A0A2D0TC93_LYCBA lybatide 2 DSCSEYCSNRCPSCDGQTQTQYTLCCINICCPS 33 T 0.12 Radical_SAM_2 pdbhh F Eukaryota T 5nif 15 CA,DA 3,4 TRP-ARG-SER-TYR-TYR-ALA KYFTGSKLWRSYYA 14 T 3.4 DUF4130 pdbhh F T 5njc 3 E,F E,F VAL-LEU-GLU-ASP-ARG-ILE VLEDRI 6 T 63 DUF5805 pdbhh F T 5njx 2 B B Q96HX7_HUMAN HSP90AA1 protein HHHHHHDDTSRMEEVD 16 T 4900 NHL pdbhh F Eukaryota T 5nkp 2 C,D D,C WNK3_HUMAN PROTEIN KINASE LYSINE-DEFICIENT 3,PROTEIN KINASE WITH NO LYSINE 3 ECEETEVDQHV 11 T 17 GP79 pdbhh F Eukaryota T 5nne 2 B C TOP2A_HUMAN GKA(ALY)GK(ALY)TQMY GKAXGKXTQMY 11 T 17 TAT_ubiq pdbhh F Eukaryota T 5nnf 2 B B BAZ1B_HUMAN FLPH(ALY)YDVKL FLPHXYDVKL 10 T 1.9 HMMR_N pdbhh F Eukaryota T 5nny 1 A,B A,B Q5GA15_LEGPN WipB GPMTDISMGDLHANALLFLNILVRQGIIAISPENYAKFAEIYTLPELQADYWGTEAPVFSAENKQERLEEIKKQYNALIAQIKIINTKKLIRLIGDELVDRGVIDYFILKLLQALYDQGADFEILLSNHGIEFVEACELFKENGNKLVAKRLGNIQHGNSFHALQEAIAAGAISNEEVLNIYHQVYKKHLKIISYSLDPDANEIKVFSHAGIGLNHIRGLARKFKVPYSEESAVDLAKTIDAINKKFAEKASSGEIHTLYTHDMMYRGYAGEHLNSTDEVVAATVWGREYGDLIRTSKKFKITFIHGHDSYDPEKVEHVTLN 322 T 6.4E-05 Metallophos pdbpercent F Bacteria T 5no2 10 J L RS12_ECOLI SMALL RIBOSOMAL SUBUNIT PROTEIN US12 ATVNQLVRKPRARK 14 T 1.6 MobC pdbhh F Bacteria T 5no7 1 A,B A,B A0A060SRI5_PYCCI Lytic polysaccharide monooxygenase HIAFWHNSMYGFNVTEQTFPYDNRPVVPLQYMTFQEWWFHNHLDYPPHPGDFFDFPAGKAATAELACNKGATTWFNSSEGGNIQNGNDPCPGSPPSEYHTTGIDDVKGCAMAIAYESDVRKIKPEDFTVFSVNQTCVWYRFTDFQVPERMPPCPPGGCHCAWFWIHSPDSGGEQIYMNGFQCNITGSTSHVPLAKPKVARRCGADPDHGKPDAVPGNCTYGAKQPLYWLQKEGNNEFDDYIAPPFYNDLYNFKDGAQNDIFVDSYPDGIPLEQKLISEEDLNSAVDHHHHHH 292 T 6.9 Tetradecapep pdbhh F Eukaryota T 5npj 2 C,D D,E POLG_HCVJF Epitope peptide WGENETDVFLLN 12 T 0.00057 HCV_NS1 pdbhh T Viruses T 5npr 2 B E bisubstrate inhibitor XVTPVCTAX 9 T 0.17 hNIFK_binding pdbhh F T 5nps 2 B D 5,6-DIHYDRO-BENZO[H]CINNOLIN-3-YLAMINE XVTPVSTAX 9 T 25 RB_A pdbhh F T 5nqf 2 B B A5K3N8_PLAVS Rhoptry neck protein 2 MDISQHATDIGMGPATSCYTSTIPPPKQVCIQQAVKATL 39 T 3.5 zf-XS pdbhh F Eukaryota T 5nr5 1 A A Q54HW9_DICDI MatA protein GSHMASMDPLDKIINDIKKEANDSGVTLAPLSVPKPKLEELSEQQKIILAEYIAEVGLQNITAITLSKKLNITVEKAKNYIKNSNRLGRTNNLKTIGILQEEVSSMEAKSMTW 113 T 0.014 P4Ha_N pdbpercent F Eukaryota T 5nvk 2 B,D,F,H B,D,F,H GGYF1_HUMAN PERQ AMINO ACID-RICH WITH GYF DOMAIN-CONTAINING PROTEIN 1 GPHMKYKLADYRYGREEMLALYVKENKVPEELQDKEFAAVLQDEPLQPLALEPLTEEEQRNFSLSVNSVAVLRLM 75 T 2.6 T4SS_TraI pdbhh F Eukaryota T 5nvl 2 B,D B,D GGYF2_HUMAN PERQ AMINO ACID-RICH WITH GYF DOMAIN-CONTAINING PROTEIN 2,TRINUCLEOTIDE REPEAT-CONTAINING GENE 15 PROTEIN GPHMKYKLADYRYGREEMLALFLKDNKIPSDLLDKEFLPILQEEPLPPLALVPFTEEEQRNFSMSVNSAAVLRLT 75 T 3.1 T4SS_TraI pdbhh F Eukaryota T 5nvm 2 B,D B,D GGYF2_HUMAN PERQ AMINO ACID-RICH WITH GYF DOMAIN-CONTAINING PROTEIN 2,TRINUCLEOTIDE REPEAT-CONTAINING GENE 15 PROTEIN GPHMKYKLADYRYGREEMLALFLKDNKIPSDLLDKEFLPILQ 42 T 14 TTRAP pdbhh F Eukaryota T 5nwj 2 B P KAT1_ARATH Potassium channel KAT1 HLYFSSN 7 F F Eukaryota T 5nwy 1 A s A0A0P7EF65_VIBAL VemP nascent chain MHHHHHHHHHHGDYKDDDDKENLYFQGSAQIDQKAHVPHFSKLQPFVAVSVSPNSSVDFSEASEESSQSPVSEGHASLDSVALFNSQRWTSYLREGLDDEHVDFVGDLTTPFYADAGYAYSLMDINWRHNQSTFYHFTSDHRISGWKETNAMYVALNSQFSALEVLFQGPYPYDVPDYA 179 T 6.3 DUF4022 unphh F Bacteria T 5nxf 1 A,B,C A,B,C FIBP_BPT4 GENE PRODUCT 34,GP34 STEAQEGVIKVATQSETVTGTSANTAVSPKNLKWIAQSEPTWAATTAIRGFVKTSSGSITFVGNDTVGSTQDLELYEKNSYAVSPYELNRVLANYLPLKAKAADTNLLDGLDSSQFIRRDIAQTVNGSLTLTQQTNLSAPLVSSSTGEFGGSLAANRTFTIRNTGAPTSIVFEKGPASGANPAQSMSIRVWGNQFGGGSDTTRSTVFEVGDDTSHHFYSQRNKDGNIAFNINGTVMPININASGLMNVNGTATFGRSVTANGEFISKSANAFRAINGDYGFFIRNDASNTYFLLTAAGDQTGGFNGLRPLLINNQSGQITIGEGLIIAKGVTINSGGLTVNSRIRSQGTKTSDLYTRAPTSDTVGFWSIDINDSATYNQFPGYFKMVEKTNEVTGLPYLERGEEVKSPGTLTQFGNTLDSLYQDWITYPTTPEARTTRWTRTWQKTKNSWSSFVQVFDGGNPPQPSDIGALPSDNATMGNLTIRDFLRIGNVRIVPDPVNKTVKFEWVE 509 T 0.16 gp12-short_mid unphh T Viruses T 5nxh 1 A,B,C A,B,C FIBP_BPT4 GENE PRODUCT 34,GP34 SGLVESGTLWDHYTLNILEANETQRGTLRVATQVEAAAGTLDNVLITPKKLLGTKSTEAQEGVIKVATQSETVTGTSANTAVSPKNLKWIAQSEPTWAATTAIRGFVKTSSGSITFVGNDTVGSTQDLELYEKNSYAVSPYELNRVLANYLPLKAKAADTNLLDGLDSSQFIRRDIAQTVNGSLTLTQQTNLSAPLVSSSTGEFGGSLAANRTFTIRNTGAPTSIVFEKGPASGANPAQSMSIRVWGNQFGGGSDTTRSTVFEVGDDTSHHFYSQRNKDGNIAFNINGTVMPININASGLMNVNGTATFGRSVTANGEFISKSANAFRAINGDYGFFIRNDASNTYFLLTAAGDQTGGFNGLRPLLINNQSGQITIGEGLIIAKGVTINSGGLTVNSRIRSQGTKTSDLYTRAPTSDTVGFWSIDINDSATYNQFPGYFKMVEKTNEVTGLPYLERGEEVKSPGTLTQFGNTLDSLYQDWITYPTTPEARTTRWTRTWQKTKNSWSSFVQVFDGGNPPQPSDIGALPSDNATMGNLTIRDFLRIGNVRIVPDPVNKTVKFEWVE 564 T 0.16 gp12-short_mid unphh T Viruses T 5nxk 1 A,B,C A,B,C Serine-rich secreted cell wall anchored (LPXTG-motif ) protein IEEVSNEEELKAALRDASITTIKLKNNITLNNAITINNGNRNITIIGDGHYINALNSDGGIILNNRGGSAKIDLTIENATLYNTSKYGFVNMSSNGVDTVTYKDVTAYGGTLVWSKTGAGVKTLNLVGNTTLNSVKSYEVDGQSCGTEAFSHRTPDGDKTTALYVSNAINIAENANVVLNNSATDIDMWLLTAVPSTSGISTVTVGNNASLTMENIGNTEYNIKLDGGRENHFIVNENAAVKMSAKVDNVRIIPQLENIFTRGNIELAKGSNVHLEVITGSNFRVAGTVANRIDFNGTATLIKQEGASGP 310 T 0.18 Cas5fv_helical pdb F T 5nxq 2 D,E D,E SLD5_YEAST MET-ASP-ILE-UA1-ILE-ASP-ASP-ILE-LEU-UA2-GLU-LEU-ASP-LYS-GLU MDIXIDDILXELDKETTAV 19 T 0.5 Bombolitin pdbhh F Eukaryota T 5ny0 1 A A A0A384E0N5_LACR1 L. reuteris SRRP binding region EDIQADATAANASELKKALQDTSVHTIKLTDNITLTSAIELTNVSRDVTIYGNGKYINATDGNGGIFIHNTKSYTVNLTIEKATLYNQSQYGFVHMNDEGTDNITYKNITAYGGTLVWSQTHVGTKTLSLEGTVNFYSVPSYTVGGQTYSTDAFKIGTHYPNGENKDTTPAIYVSNEINIADNANIALENSATKIDIWMIADIGIHPHTTALTIGNNATLTMENGNNSALNIKLDGDTSNSFTVGEGSTVKLSAKVDNVRILPYEDSNTANVSFAKGSDVTLHAGTGSNLRMGASISNQIDFNGKATFIKDSGAYANTAYADQTRGNIEFDYYWNDQQKTGSTGVANFNPGSNVLFQAGPGASNVNTY 368 T 0.0055 MAP pdb F Bacteria T 5o3u 2 E,F,G,H L,M,N,O F6LNM3_9CARY Putative presegetalin F1 MATSFQFDGLKPSFSASYSSKPIQTQVSNGMDNASAPV 38 T 0.02 MAGI_u1 pdb F Eukaryota T 5o3v 2 C,D D,C F6LNL6_9CARY Putative presegetalin B1 MSPILAHDVVKPQGVAWAFQAKDVENASAPV 31 T 16 DPRP pdbhh F Eukaryota T 5o3w 2 E,F,G,H W,X,Y,Z F6LNL5_9CARY Presegetalin A1 MSPILAHDVVKPQGVPVWAFQAKDVENASAPV 32 T 10 Choline_sulf_C pdbhh F Eukaryota T 5o45 2 B B PHE-MEA-9KK-SAR-ASP-VAL-MEA-TYR-SAR-TRP-TYR-LEU-CCS-GLY-NH2 FXXXDVXYXWYLXGX 15 T 0.79 Selenoprotein_S pdbhh F T 5o4y 2 D,E,F A,D,F PHE-MAA-ASN-PRO-HIS-LEU-SER-TRP-SER-TRP-9KK-9KK-ARG-CCS-GLY-NH2 FXNPHLSWSWXXRXGX 16 T 2.9 DUF4462 pdbhh F T 5o60 1 A 3 A0QTP4_MYCS2 BL37 MAKRGRKKRDRKHSKANHGKRPNA 24 T 0.18 DUF6254 pdb F Bacteria T 5o6u 3 C,D,E C,D,E A4Y6G1_SHEPC Uncharacterized protein MQKVTGIKSVDFKIKALGHGVVNWNGPTTLTGDDGKTVDNHTLPKLRGYTNLTGKVKDETGYKYKKQATDINFKETPLYISQNCIRHHLFREQAFDLHYASDKNLKNVLASITGLIRGYVVPSSQCKRTSPLLLEDFVDQLGNGNFEQYGQAGARDSTSFFSKTTFGDTEYISYGSISIEQLQFISLDKKFDRAAMVIKEGEGEVIAAELQNYIQSLNPSLNPQAIFHSNYVRRGTIFEEGECGILLNDDAVKALVAETLERLANLSIRQAKGYMYVDDITVDYNDSHKMMRIKRDESEIINEQHAPFAQYFYAK 315 T 0.025 MecA_N pdbpssm F Bacteria T 5o74 1 A,C,E,G,I,K A,C,E,G,I,K DRRA_LEGPN DEFECTS IN RAB1 RECRUITMENT PROTEIN A GHMVTRIENLENAKKLWDNANSMLEKGNISGYLKAANELHKFMKEKNLKEDDLRPELSDKTISPKGYAILQSLWGAASDYSRAAATLTESTVEPGLVSAVNKMSAFFMDCKLSPNERATPDPDFKVGKSKILVGIMQFIKDVADPTSKIWMHNTKALMNHKIAAIQKLERSNNVNCETLESVLSSKGENLSEYLSYK 197 T 0.099 LuxQ-periplasm unppercent F Bacteria T 5o8k 2 B B REV3L_HUMAN PROTEIN REVERSIONLESS 3-LIKE,HREV3 MGRTANILKPLMSPPSREEIMATLLDHD 28 T 3.6 FAM110_C pdbhh F Eukaryota T 5o9t 2 C,D C,D 1IP-CYS-PHE-SER-LYS-PRO-ARG XNCFSKPR 8 T 5.6 DUF1244 pdbhh F T 5o9v 2 C,D C,D AIFM3_HUMAN APOPTOSIS-INDUCING FACTOR-LIKE PROTEIN GGCFSKPK 8 T 0.062 NifU unphh F Eukaryota T 5oac 1 A,B,C,D,E,F,G,H,I,J A,B,C,D,E,F,G,H,I,J A0A2D0TC94_9VIRU Major capsid protein MTIKYLSSETEKLMNQTVSGIDVCFTLIGVDDDSFASGSKNDYISDTPKFLDPSNVHIKATLKRGGKDYVLFSENLALLAKYSTITQGRDQWEEGVKLAAKEMVHLVYIPFSGNTNWPAHINLKDNDVLEVYVNVVRGAYGAELDANACICDVRTSPSIGVEKFIPFMTSYSIRANQATDLVNLGNDVTRIALLSMTNDVSNIPNAFTDVTLSSDRLDKNFNSNQLILEHSKCIEDSVRSHANEVDSYLIHEDIEIDSAKVHLKMNPAKIRENTIYLVRSHFQTSLEILQKAVAMEEKHQSADIAKVPAT 310 T 0.19 AcetylCoA_hydro pdbpercent T Viruses T 5oap 1 A A DRE2A_ARATH DREB2A SSDMFDVDELLRDLNGDD 18 T 6.5 DUF2525 pdbhh F Eukaryota T 5od4 1 A A A0A0C4DI32_FUSO4 SECRETED IN XYLEM 3 PROTEIN,AVR2 GPPYCVFPGRRTSSTSFTTSFSTEPLGYARMLHRDPPYERAGNSGLNHRIYERSRVGGLRTVIDVAPPDGHQAIANYEIEVRRIPVATPNAAGDCFHTARLSTGSRGPATISWDADASYTYYLTISED 128 T 11 DUF4377 pdbhh F Eukaryota T 5ods 2 E,F,G,H E,F,G,H TACC3_HUMAN LYS-GLU-SER-ALA-LEU-ARG-LYS-GLN-SEP-LEU-TYR-LEU-LYS-PHE-ASP-PRO-LEU-LEU KESALRKQSLYLKFDPLL 18 T 2 DUF4293 pdbhh F Eukaryota T 5odt 2 B B TACC3_HUMAN ERIC-1 MELKEESFRDPAEVLGTGAEVDYLEQFGTSSFKESALRKQSLYLKF 46 T 0.99 DUF2095 pdbhh F Eukaryota T 5oec 1 A A Q6EAT3_SALER VIRULENCE PROTEIN GHMQGQIIHHRNFQSQFDTTGNTLYNNAWVCSLNVIKSRDGNNYSALEDITSDNQAFNNILEGIDIIECENLLKEMNVQKIPESSLFTNIKEALQAEVFNSTVEDDFESFISYELQNHGPLMLIRPSLGSECLHAECIVGYDSEVKKVLIYDSMNTSPEWQSNIDVYDKLTLAFNDKYKNEDCSICGLYYDGVYEPK 197 T 0.002 YsaB unppssm F Bacteria T 5oed 1 A A Q6EAT3_SALER VIRULENCE PROTEIN GHMLRHIQNSLGSVYRSNTATPQGQIIHHRNFQSQFDTTGNTLYNNAWVCSLNVIKSRDGNNYSALEDITSDNQAFNNILEGIDIIECENLLKEMNVQKIPESSLFTNIKEALQAEVFNSTVEDDFESFISYELQNHGPLMLIRPSLGSECLHAECIVGYDSEVKKVLIYDSMNTSPEWQSNIDVYDKLTLAFNDKYKNEDCSICGLYYDGVYEPKPLHSSSWKDWCTIL 230 T 0.002 YsaB pdbpssm F Bacteria T 5oek 1 A,B A,B GHR_HUMAN GH RECEPTOR,SOMATOTROPIN RECEPTOR GSMSQFTCEEDFYFPWLLIIIFGIFGLTVMLFVFLFSKQQRIK 43 T 8.8E-05 IFNGR1 unphh F Eukaryota T 5oeo 2 B C TRPV5_HUMAN TRPV5,CALCIUM TRANSPORT PROTEIN 2,CAT2,EPITHELIAL CALCIUM CHANNEL 1,ECAC1,OSM-9-LIKE TRP CHANNEL 3,OTRPC3 GADKEDDQEHPSEKQPSGAESGTLARASLALPTSSLSRTASQSSSHRGWEILRQNTLGHLNLGLNLSEGDGEE 73 T 0.16 Lipase3_N pdbpssm F Eukaryota T 5oh5 1 A A RidL GPSILEEYIRMAKNKEFFDALEEIAESAKNDETLRNELAKVLDDILKTDPSDPEAFRKIVAEHQEFWDEHDPSLMEFNEGRFFGKSRKQYLKSDDFLNSTDPTYNFQKLHQFAAEQRVKLGLEKSDTDTLVAILKNNPEECRAYIESKKPGLGNFSEGNVHGWLKEEYTPTIPPKAINKSTGVLSDEAIKRIKEQARDLLLLKLINSSGNTQLLKDLRDAMSKPEAERAANALGFPTEGNGVLFLSREVVDALEERVEKLEQEAAKRGFDSYVQSLSHNALLA 283 T 0.026 SE pdb F T 5oh6 1 A,B A,B Interaptin GPSAKNKEFFDALEEIAESAKNDETLRNELAKVLDDILKTDPSDPEAFRKIVAEHQEFWDEHDPSLMEFNEGRFFGKSRKQYLKSDDFLNSTDPTYNFQKLHQFAAEQRVKLGLEKSDTDTLVAILKNNPEECRAYIESKKPGLGNFSEGNVHGGSGGVLSDEAIKRIKEQARDLLLLKLINSSGNTQLLKDLRDAMSKPEAERAANALGFPTEGNGVLFLSREVVDALEERVEKLA 237 T 0.14 HEAT_2 pdbpercent F T 5ohg 2 C C RNE_ECOLI RNASE E RRYRDERYPTQSPMPLTVACASPELASGKVWIRYPI 36 T 1.2 XisI pdbhh F Bacteria T 5ohg 3 F J RNE_ECOLI RNASE E RDERYPTQSPMPLTVACASPELASGKVWIRYPIVR 35 T 0.87 XisI pdbhh F Bacteria T 5oj5 2 B B PSBA1_THEEB PHE-PRO-LEU-ASP-LEU-ALA NAHNFPLDLA 10 T 0.49 IL34 pdbhh F Bacteria T 5ojo 2 C C CPSM_HUMAN CARBAMOYL-PHOSPHATE SYNTHETASE I,CPSASE I XVLKEYGV 8 F F Eukaryota T 5ojr 2 E,F E,F PSBA3_THEVB PSII D1 PROTEIN 3,PHOTOSYSTEM II Q(B) PROTEIN 3 NAHNFPLDLASAESAPVA 18 T 3.3 IL34 pdbhh F Bacteria T 5ok6 2 C,D C,D ALA-GLU-GLY-GLU-PHE-TYR-LYS-LEU-LYS-ILE-ARG-THR-PRO-AAR AEGEFYKLKIRTPR 14 T 1.3 RsgI_N pdbhh F T 5okc 3 D,F G,I CTF18_YEAST Chromosome transmission fidelity protein 18 TVKIWVKYNEGFSNAVRKNVTWNNLWE 27 T 6.5 BRCA2 pdbhh F Eukaryota T 5oki 4 E,H E,I CTF18_YEAST Chromosome transmission fidelity protein 18 TVKIWVKYNEGFSNAVRKNVTWNNLW 26 T 6.6 Hairy_orange pdbhh F Eukaryota T 5oll 1 A A GUR_GYMSY SWEET TASTE-SUPPRESSING PEPTIDE EQCVKKDELCIPYYLDCCEPLECKKVNWWDHKCIG 35 T 0.00036 Toxin_7 pdb F Eukaryota T 5ons 2 B B DENR_HUMAN DRP,PROTEIN DRP1,SMOOTH MUSCLE CELL-ASSOCIATED PROTEIN 3,SMAP-3 MHHHHHHDADYPLRVLYCGVCSLPTEYCEYMPDVA 35 T 0.012 PHM7_cyt unppssm F Eukaryota T 5oob 3 E,J,K E,K,Z L9KL62_TUPCH NELF-E,RNA-BINDING PROTEIN RD DKRTQIVYSDDVYKENLVDGF 21 T 0.85 DUF5820 pdbhh F Eukaryota T 5oqt 2 B C MGTS_ECOLI Uncharacterized protein YneM MLGNMNVFMAVLGIILFSGFLAAYFSHKWDD 31 T 0.23 Gram_pos_anchor pdb F Bacteria T 5osh 3 C,F,I,L C,F,I,L Interaptin MALEEYIRMAKNKEFFDALEEIAESAKNDETLRNELAKVLDDILKTDPSDPEAFRKIVAEHQEFWDEHDPSLMEFNEGRFFGKSRKQYLKSDDFLNSTDPTYNFQKLHQFAAEQRVKLGLEKSDTDTLVAILKNNPEECRAYIESKKPGLGNFSEGNVHGWLKEEYTPTIPPKAINKSTGVLSDEAIKRIKEQARDLLLLKLINSSGNTQLLKDLRDAMSKPE 223 T 0.016 SE pdb F T 5osi 3 C,F,I C,F,I Q5ZT54_LEGPH Interaptin KEEYTPTIPPKAIN 14 T 0.25 SE unp F Bacteria T 5ouo 1 A A V5BCL0_TOXGV Perforin-like protein 1 ETGRNLPKQLTQATQVAWSGPPPGFAKCPGGQVVILGFAMHLNFKEPGTDNFRIISCPPGREKCDGVGTASSETDEGRIYILCGEEPINEIQQVVAESPAHAGASVLEASCPDETVVVGGFGISVRGGSDGLDSFSIESCTTGQTICTKAPTRGSEKNFLWMMCVDKQYPGLRELVNVAELGSHGNANKRAVNSDGNVDVKCPANSSIVLGYVMEAHTNMQFVRDKFLQCPENASECKMTGKGVDHGMLWLFDRHALFGWIICKTVNEGTKHHHHHH 277 T 0.23 HZS_alpha pdb F Eukaryota T 5ov3 2 C C RBBP5_MOUSE RBBP-5 EPKQTG 6 T 0.0099 DUF2457 unppercent F Eukaryota T 5ow5 3 E,F E,F CAMP3_MOUSE Calmodulin-regulated spectrin-associated protein IEEALQIIHS 10 T 3.2 BsuBI_PstI_RE_N pdbhh F Eukaryota T 5owo 1 A,B,C,D A,B,C,D DYHC1_HUMAN CYTOPLASMIC DYNEIN HEAVY CHAIN 1,DYNEIN HEAVY CHAIN,CYTOSOLIC MSEPGGGGGEDGSAGLEVSAVQNVADVSVLQKHLRKLVPLLLEDGGEAPAALEAALEEKSALEQMRKFLSDPQVHTVLVERSTLKEDVGDEGEEEKEFISYNINIDIHYGVKSNSLAFIKRTPVIDADKPVSSQLRVLTLSEDSPYETLHSFISNAVAPFFKSYIRESGKADRDGDKMAPSVEKKIAELEMGLLHLQQNIE 201 T 0.13 DUF4042 pdbpercent F Eukaryota T 5owp 2 B D 5,6-DIHYDRO-BENZO[H]CINNOLIN-3-YLAMINE SAXDTRPA 8 T 32 PNPase_C pdbhh F T 5oxe 1 A A D4QF72_APBV1 Major virion protein MAPKATLVKKFKGLAVGVGALLAAPPIMGLASYAVNGISSYLSITINSTTYDFAPLAQAVMVFGGIGLVAYGLHRILGRGL 81 T 0.3 MSP1b pdb T Viruses T 5oxw 1 A,B,C,D A,B,C,D Q74N74_NANEQ NEQ068 SIMDTEIEVIENGIKKKEKLSDLFNKYYAGFQIGEKHYAFPPDLYVYDGERWVKVYSIIKHETETDLYEINGITLSANHLVLSKGNWVKAKEYENKNN 98 T 3.6E-44 DNA_pol_B unp F Archaea T 5oxw 2 E,F,G,H E,F,G,H ALA-SER-GLY-SER-PHE-LYS-VAL-ILE-TYR-GLY-ASP ASGSFKVIYGD 11 T 5.2 DapH_N pdbhh F T 5qsm 1 A,B A,B STAG1_HUMAN SCC3 HOMOLOG 1,STROMAL ANTIGEN 1 SMSPNGNLIRMLVLFFLESELHEHAAYLVDSLWESSQELLKDWECMTELLLEEPVQGEEAMSDRQESALIELMVCTIRQAAEAHPPVGRGTGKRVLTAKERKTQIDDRNKLTEHFIITLPMLLSKYSADAEKVANLLQIPQYFDLEIYSTGRMEKHLDALLKQIKFVVEKHVESDVLEACSKTYSILCSEEYTIQNRVDIARSQLIDEFVDRFNHSVEDLLQEGEEADDDDIYNVLSTLKRLTSFHNAHDLTKWDLFGNCYRLLKTGIEHGAMPEQIVVQALQCSHYSILWQLVKITDGSPSKEDLLVLRKTVKSFLAVCQQCLSNVNTPVKEQAFMLLCDLLMIFSHQLMTGGREGLQPLVFNPDTGLQSELLSFVMDHVFIDQDEENQSMEGDEEDEANKIEALHKRRNLLAAFSKLIIYDIVDMHAAADIFKHYMKYYNDYGDIIKETLSKTRQID 459 T 0.0089 Cnd3 pdb F Eukaryota T 5sbg 1 A C METP, miniaturized rubredoxin XYCSDCGADXSQVRGGYCTNCGASXDRIRX 30 T 0.0005 OrfB_Zn_ribbon pdbpssm F T 5suj 1 A,B A,B Q5ZTL3_LEGPH Uncharacterized protein MIVRGINMTKIKLESPGFMVHKKLKSMSQSYGVMMTGVPAEVLGQMQAERSIPSINKTGNLKQQIAKEVSKVCHMMTEPTQSCGQASNDVCELLLGKIEAEKFHFTKYEALSADGDNLKNVLENTAPSSTNLLIRFEIDREDPPIVLVKTKNENFNPETAVKNKIYLLENKLYFIDKMGNLFNLGPGKKKCTQLFNAIGDSAEYSLCDPFVLEEPEKPEDFAISEIVDIFNEQKERFDFWIGSHSFTIYIPQTLGESPRQFYPYQAYFGSHTLQDWFVSDKDEYLSRIGIDKYIEKLAVLGKTTNTKERSDIYAEFFSKRGREAFFCAHLNEKRQPLRVKFKITEINPELALKNLQETQEFIDTHPGENPSDKVENYRNRAKLAMTEHLESLLDIKPESS 400 T 0.023 AgrD pdb F Bacteria T 5sur 1 A,B,C,D A,B,C,D 16mer A-beta peptide: ORN-CYS-VAL-PHE-MEA-CYS-GLU-ASP-ORN-ALA-ILE-ILE-GLY-LEU-ORN-VAL XCVFXCEDXAIIGLXV 16 T 0.079 Beta-APP pdbhh F T 5sut 1 A,B A,B 16mer A-beta peptide: ORN-CYS-VAL-PHE-PHE-CYS-GLU-ASP-ORN-ALA-ILE-ILE-SAR-LEU-ORN-VAL XCVFFCEDXAIIXLXV 16 T 0.079 Beta-APP pdbhh F T 5sve 3 C C NFAC1_HUMAN NFATc1 LxVP peptide DDQYLAVPQHPYQWAKPK 18 T 0.13 IucA_IucC pdb F Eukaryota T 5sw9 2 B B CDCA2_HUMAN RepoMan RDIASKKPLLSPIPELPEVPE 21 T 7.1 Fapy_DNA_glyco pdbhh F Eukaryota T 5swf 2 B B BUB1B_HUMAN Double phosphorylated BubR1 KLSPIIEDS 9 T 6.6 TBCC pdbhh F Eukaryota T 5sxm 2 C,D D,C ACE-ALA-ARG-THR-GLU-VAL-TYR-NH2 XARTEVYX 8 T 14 TcdB_toxin_midC pdbhh F T 5sxp 2 E,F F,G ITCH_HUMAN ITCH,ATROPHIN-1-INTERACTING PROTEIN 4,AIP4,NFE2-ASSOCIATED POLYPEPTIDE 1,NAPP1 GSGGGKPSRPPRPSRPPPPTPRRPASY 27 T 3.7 UPF0449 pdbhh F Eukaryota T 5syq 1 A A Y1974_AQUAE Uncharacterized protein aq_1974 GSEEKEEKKVRELTPQELELFKRAMGITPHNYWQWASRTNNFKLLTDGEWVWVEGYEEHIGKQLPLNQARAWSWEFIKNRLKELNL 86 T 0.042 DUF3621 unp F Bacteria T 5szx 3 C,D A,B BZLF1_EBVB9 EB1,ZEBRA LEIKRYKNRVASRKSRAKFKQLLQHYREVAAAKSSENDRLRLLLKQMCPSLDVDSIIPRTPD 62 T 0.0011 bZIP_2 pdb T Viruses T 5t0f 2 B B TIF9_ARATH JASMONATE ZIM DOMAIN-CONTAINING PROTEIN 10,PROTEIN JASMONATE-ASSOCIATED 1,PROTEIN JAZ10 KQTNNAPKPKFQKFLDRRRSFRDIQGAISKIDPEIIKSLLAST 43 T 2.8 DUF3666 pdbhh F Eukaryota T 5t1k 4 E,F E,F CQFDA(PH)2STRRLKC PEPTIDE CQFDXSTRRLKC 12 T 7.2 YsaB pdbhh F T 5t1l 4 E,F E,F CYCLIC MEDITOPE CQA(Ph)2DLSTRRLKC CQXDLSTRRLKC 12 T 3.3 DUF2089 pdbhh F T 5t1m 3 E,F,G E,F,H CYCLIC PEPTIDE CQYDLSTRRLKC CQYDLSTRRLKC 12 T 7.6 DUF1254 pdbhh F T 5t2s 2 B,D B,D CDC7_YEAST ASP-GLY-GLU-SER-TPO-ASP-GLU-ASP-ASP DGESTDEDDVVS 12 T 2.2 CRF1 pdbhh F Eukaryota T 5t47 2 B,D B,D O61380_DROME EUKARYOTIC TRANSLATION INITIATION FACTOR 4G,ISOFORM C,FI02056P,TRANSLATION INITIATION FACTOR EIF4G GPHMSIINYNEGQWSPNNPSGKKQYDREQLLQLREVKASRIQPEVKNVSILPQPNLMPSFIRNN 64 T 0.00012 eIF_4G1 pdbhh F Eukaryota T 5t56 1 A,C A,C MCJA_ECOLX MCCJ25 GGAGHVPEYFVR 12 T 0.13 Endonuc-BglII unp F Bacteria T 5t56 2 B,D B,D MCJA_ECOLX MCCJ25 CGTPISFYC 9 T 0.24 KSHV_K1 pdbhh F Bacteria T 5t5o 2 B,D,F,H,J,L,N,P,R,T a,b,c,d,e,f,g,h,i,j TN-peptide ACE-GLY-VAL-THR-SER-ALA XGVTSA 6 T 180 GIDA_C pdbhh F T 5t6y 3 C C THADA_HUMAN Decapeptide: THR-SER-THR-PHE-GLU-ASP-VAL-LYS-ILE-LEU-ALA-PHE TSTFEDVKILAF 12 T 7.3 Peroxin-3 pdbhh F Eukaryota T 5t6z 3 C C POL_HV1B1 Decapeptide: THR-SER-THR-LEU-GLN-GLU-GLN-ILE-GLY-TRP TSTLQEQIGW 10 T 3.2 Red1 pdbhh T Viruses T 5t70 4 D C POL_HV1B1 Decapeptide: THR-SER-ASN-LEU-GLN-GLU-GLN-ILE-GLY-TRP TSNLQEQIGW 10 T 3.3 Red1 pdbhh T Viruses T 5t7a 1 A,B A,B Q9KG76_BACHD BH0236 protein MGSSHHHHHHSSGLVPRGSHMASQGNGDSHTHPDYTAGIRGITGNEVTIFFAPTTEARYVDVHLKVNNGQQLNYRMTERNGEWERVVENLSSGDVLEYSFTYEKLGPQYTTEWFTYSR 118 T 0.0019 CBM_48 pdb F Bacteria T 5t7q 1 A A TIRAP_HUMAN TIR DOMAIN-CONTAINING ADAPTER PROTEIN,ADAPTOR PROTEIN WYATT,MYD88 ADAPTER-LIKE PROTEIN,MYD88-2 KKPLGKMADWFRQTLLKKPKK 21 T 1 Hfx_Cass5 pdbhh F Eukaryota T 5t86 1 A A Q1RPM1_ECOLX CdiA toxin IEQILKPEKNWETARNKALDLVGNLGADSKPVIGRLEVSAGNGKVIGRQSSDGKVGWRVDYDPEKGTHINIWDYSQGKGPGKAVKQVIPFEGNEKSFETILKQLNR 106 T 25 DUF1818 pdbhh F Bacteria T 5t86 2 B I A0A0B0W5A7_ECOLX CdiI immunity protein MTLFDECREALSADFNIVEGLAQQEALGILNKYPLAKGSVTWSEIRHSDYESFDELLSANSVKNDDMFVFADDASIPVFRSNLRLIAENIYDVTALSPKLFIFNDEVIIQPLFPTDMFRLGIKKHHHHHH 130 T 0.0023 DUF2947 unphh F Bacteria T 5t87 1 A,B,C,D A,B,C,D B3R1C2_CUPTR CdiI immunity protein MTMRYQEPARIPNAEIDHVLASGNPEAIADACLSIAYYEDDWEWAFKRLKSVAFDLNRPDSLRSLAVTCVGHLARRIHDLDVAMAEEFLLSLGGDQAVASAASDALDDLRIFRMSD 116 T 0.0029 HEAT_2 pdbpercent F Bacteria T 5t87 2 E,F,G,H E,F,G,H B3R1C1_CUPTR CdiA toxin SRGPSNGQSVLENSVQVKETSPRRVSVDPQTGEFVVFDRTLGDVYHGHVRAWKDLTSDMQNALVRGGYVDRKGNPK 76 T 0.02 DUF3945 pdbhh F Bacteria T 5tce 1 A A PYRD_HUMAN DHODEHASE,DIHYDROOROTATE OXIDASE GDERFYAEHLMPTLQGLLDPESAHRLAVRFTSLGX 35 T 1.1 DUF2240 pdbhh F Eukaryota T 5teg 2 C,D D,E H4_HUMAN Histone H4 mutant peptide with H4K20norleucine KRHRXVLR 8 T 0.27 UPF0137 unp F Eukaryota T 5tfp 1 A,B A,B SETB2_HUMAN CHRONIC LYMPHOCYTIC LEUKEMIA DELETION REGION GENE 8 PROTEIN,LYSINE N-METHYLTRANSFERASE 1F,SET DOMAIN BIFURCATED 2 MGEKNGDAKTFWMELEDDGKVDFIFEQVQNVLQSLKQKIKDGSATNKEYIQAMILVNEATIINS 64 T 0.01 Pectinesterase pdb F Eukaryota T 5tgq 1 A A A0A1S4NYF7_STAWA R.SwaI protein MNFKKYEENLVASIEEVIQRIIDDKHRPNIIGKTRVGAEVSDYLEDEFVKYISSGKSSSLYDAQGAPKEKTKNPWDARCKFKFMDREEEIWIDFKAFKITNMDSNPDIGTPNKIVKFIHEGNFYLVFVLVYYESKQDGVEFVKYNNDYKKVYLLKDVNESFRINPKPQMQVNIAAEPTYRTREEFIHFFVKKWKESFERQIKSLEKKEIMLKDLEDKLKNSNDNSI 226 T 0.013 CK2S unppercent F Bacteria T 5th2 3 E,F E,F L5Q meditope CQFDQSTRRLKC 12 T 7.1 PriA_CRR pdbhh F T 5tj1 1 A A V4RMX4_9CAUL Benenodin-1 GVGFGRPDSILTQEQAKPM 19 T 0.15 DUF5974 unphh F Bacteria T 5tja 1 A A MCLN1_HUMAN MG-2,MUCOLIPIDIN GSGLSNQLAVTFREENTIAFRHLFLLGYSDGADDTFAAYTREQLYQAIFHAVDQYLALPDVSLGRYAYVRGGGDPWTNGSGLALCQRYYHRGHVDPANDTFDIDPMVVTDCIQVDPPERPPPPPSDDLTLLESSSSYKNLTLKFHKLVNVTIHFRLKTINLQSLINNEIPDCYTFSVLITFDNKAHSGRIPISLETQAHIQECKHPSVFQHGDNSLEHHHHHH 223 T 0.08 DUF1866 pdb F Eukaryota T 5tp6 2 B B NOS2_HUMAN HEPATOCYTE NOS,HEP-NOS,INDUCIBLE NO SYNTHASE,INOS,NOS TYPE II,PEPTIDYL-CYSTEINE S-NITROSYLASE NOS2 AGHMRPKRREIPLKVLVKAVLFACMLMRK 29 T 1.2 DUF488 unppercent F Eukaryota T 5tq1 2 B B INSR_RAT Insulin receptor PSSVXVPDEWE 11 T 9.2 RHH_1 pdbhh F Eukaryota T 5tqs 2 E,F,G,H E,F,H,G ERBB2_HUMAN Receptor protein-tyrosine kinase DNLYXWDQDPP 11 T 0.65 DUF2093 pdbhh F Eukaryota T 5tsc 1 A,B A,B Q5ZTL4_LEGPH Uncharacterized protein GMTTSKLEKTGLHVHEKIKHMVKNYGTMITGIPAEILGQNEAEISVGYVKKMGNMKENIAEVVRKSEMTQPTNSAGKASNEVCDLLLGTEGASEFEKSSYQVLSGDGSNLKGSLPNKNLLVRVEMDRFNAPQKYQKIKREEFNPETAEKNKIYLLEDQLVYLDIFGKVIDLGQTSDTCHRLFNAITTPFYQNYILYDEYIDPEESAEEAAMFEMGEIVKAKMKNIDCWTATHSFTIFVPESDSEDTRTLYPYQAYWTSHTLQQWFSGDKDEKLSRLGIDGYIEKLALLGTTTDSKIRSSIYGELFSPPGKEHVFCTGMNEKFSPLRVKFKVTEVNPEIALQNLEEVQEFIDTNYPGENAKDQCELYKIKAQEAMTKQLEMRLLIEPTHRKESVTPYKEKNTQSFFSKNLDTTSRIDKMISSVITMENLKILAKEADTLSGKDREKLVEYFKTLPSEQLAEIKA 463 T 0.035 V-ATPase_H_N unppercent F Bacteria T 5tvz 1 A A PO152_YEAST NUCLEAR PORE PROTEIN POM152,P150,PORE MEMBRANE PROTEIN POM152 MSLRVKPSASLKLHHDLKLCLGDHSSVPVALKGQGPFTLTYDIIETFSSKRKTFEIKEIKTNEYVIKTPVFTTGGDYILSLVSIKDSTGCVVGLSQPDAKIQVRRDEGHHHHHH 114 T 0.0071 PKD_4 pdbpssm F Eukaryota T 5twg 2 B E STK4_HUMAN T353 peptide VASTMTDGANTMIEP 15 T 13 BBS2_N pdbhh F Eukaryota T 5twh 2 B E STK4_HUMAN T367 peptide DDTLPSQLGTMVINAED 17 T 1.7 OspE pdbhh F Eukaryota T 5two 2 B B PRGC1_HUMAN PRO-SER-LEU-LEU-LYS-LYS-LEU-LEU-LEU-ALA-PRO AEEPSLLKKLLLAPA 15 T 5.4 DUF1467 pdbhh F Eukaryota T 5tx1 3 N O A4ZKM1_9ADEN Fiber AKRLRVEDDFNPVYPYGYA 19 T 0.48 DUF5449 pdbhh T Viruses T 5tx8 1 A A HH2 AEDCERIRKELEKNPNDEIKKKLEKCQA 28 T 2.4 UPF0228 pdbhh F T 5txh 1 A,B,C,D A,B,C,D IFAEDV IFAEDV 6 T 7.9 EBV-NA1 pdbhh F T 5txs 3 C C anapestic lymphoma kinase-derived neuroblastoma tumor antigen AQDIYRASY 9 T 0.21 ChaB pdbhh F T 5tyi 2 E,F,G,H L,M,N,P Peptide inhibitor KFEGXDNEX 9 T 33 DUF5840 pdbhh F T 5tzs 41 QA i BMS1_YEAST Bms1,Ribosome biogenesis protein BMS1,Bms1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXWNIGKLIYMDNISPEECIRRWRGEDDDSKDESDIEEDVDDDFFRKKDGTVTKEGNKDHAVDLEKFVPYFDTFEKLAKKWKSVDAIKERFLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 511 T 0.12 Inhibitor_I67 pdbpssm F Eukaryota T 5u1m 2 B B INSR_HUMAN IR XLYASSNPAX 10 T 6.8 MPS-4 pdbhh F Eukaryota T 5u1q 2 E,F,G,H L,M,N,P LYS-PHE-GLU-GLY-TYR-ASP-ASN-GLU-CST KFEGYDNEX 9 T 4.1 Crr6 pdbhh F T 5u30 1 A A C2C1_ALIAG AACC2C1 SMAVKSIKVKLRLDDMPEIRAGLWKLHKEVNAGVRYYTEWLSLLRQENLYRRSPNGDGEQECDKTAEECKAELLERLRARQVENGHRGPAGSDDELLQLARQLYELLVPQAIGAKGDAQQIARKFLSPLADKDAVGGLGIAKAGNKPRWVRMREAGEPGWEEEKEKAETRKSADRTADVLRALADFGLKPLMRVYTDSEMSSVEWKPLRKGQAVRTWDRDMFQQAIERMMSWESWNQRVGQEYAKLVEQKNRFEQKNFVGQEHLVHLVNQLQQDMKEASPGLESKEQTAHYVTGRALRGSDKVFEKWGKLAPDAPFDLYDAEIKNVQRRNTRRFGSHDLFAKLAEPEYQALWREDASFLTRYAVYNSILRKLNHAKMFATFTLPDATAHPIWTRFDKLGGNLHQYTFLFNEFGERRHAIRFHKLLKVENGVAREVDDVTVPISMSEQLDNLLPRDPNEPIALYFRDYGAEQHFTGEFGGAKIQCRRDQLAHMHRRRGARDVYLNVSVRVQSQSEARGERRPPYAAVFRLVGDNHRAFVHFDKLSDYLAEHPDDGKLGSEGLLSGLRVMSVALGLRTSASISVFRVARKDELKPNSKGRVPFFFPIKGNDNLVAVHERSQLLKLPGETESKDLRAIREERQRTLRQLRTQLAYLRLLVRCGSEDVGRRERSWAKLIEQPVDAANHMTPDWREAFENELQKLKSLHGICSDKEWMDAVYESVRRVWRHMGKQVRDWRKDVRSGERPKIRGYAKDVVGGNSIEQIEYLERQYKFLKSWSFFGKVSGQVIRAEKGSRFAITLREHIDHAKEDRLKKLADRIIMEALGYVYALDERGKGKWVAKYPPCQLILLAELSEYQFNNDRPPSENNQLMQWSHRGVFQELINQAQVHDLLVGTMYAAFSSRFDARTGAPGIRCRRVPARCTQEHNPEPFPWWLNKFVVEHTLDACPLRADDLIPTGEGEIFVSPFSAEEGDFHQIHAALNAAQNLQQRLWSDFDISQIRLRCDWGEVDGELVLIPRLTGKRTADSYSNKVFYTNTGVTYYERERGKKRRKVFAQEKLSEEEAELLVEADEAREKSVVLMRDPSGIINRGNWTRQKEFWSMVNQRIEGYLVKQIRSRVPLQDSACENTGDI 1130 T 0.0038 RuvC_1 unphh F Bacteria T 5u34 1 A A C2C1_ALIAG AACC2C1 SMAVKSIKVKLRLDDMPEIRAGLWKLHKEVNAGVRYYTEWLSLLRQENLYRRSPNGDGEQECDKTAEECKAELLERLRARQVENGHRGPAGSDDELLQLARQLYELLVPQAIGAKGDAQQIARKFLSPLADKDAVGGLGIAKAGNKPRWVRMREAGEPGWEEEKEKAETRKSADRTADVLRALADFGLKPLMRVYTDSEMSSVEWKPLRKGQAVRTWDRDMFQQAIERMMSWESWNQRVGQEYAKLVEQKNRFEQKNFVGQEHLVHLVNQLQQDMKEASPGLESKEQTAHYVTGRALRGSDKVFEKWGKLAPDAPFDLYDAEIKNVQRRNTRRFGSHDLFAKLAEPEYQALWREDASFLTRYAVYNSILRKLNHAKMFATFTLPDATAHPIWTRFDKLGGNLHQYTFLFNEFGERRHAIRFHKLLKVENGVAREVDDVTVPISMSEQLDNLLPRDPNEPIALYFRDYGAEQHFTGEFGGAKIQCRRDQLAHMHRRRGARDVYLNVSVRVQSQSEARGERRPPYAAVFRLVGDNHRAFVHFDKLSDYLAEHPDDGKLGSEGLLSGLRVMSVDLGLRTSASISVFRVARKDELKPNSKGRVPFFFPIKGNDNLVAVHERSQLLKLPGETESKDLRAIREERQRTLRQLRTQLAYLRLLVRCGSEDVGRRERSWAKLIEQPVDAANHMTPDWREAFENELQKLKSLHGICSDKEWMDAVYESVRRVWRHMGKQVRDWRKDVRSGERPKIRGYAKDVVGGNSIEQIEYLERQYKFLKSWSFFGKVSGQVIRAEKGSRFAITLREHIDHAKEDRLKKLADRIIMEALGYVYALDERGKGKWVAKYPPCQLILLAELSEYQFNNDRPPSENNQLMQWSHRGVFQELINQAQVHDLLVGTMYAAFSSRFDARTGAPGIRCRRVPARCTQEHNPEPFPWWLNKFVVEHTLDACPLRADDLIPTGEGEIFVSPFSAEEGDFHQIHADLNAAQNLQQRLWSDFDISQIRLRCDWGEVDGELVLIPRLTGKRTADSYSNKVFYTNTGVTYYERERGKKRRKVFAQEKLSEEEAELLVEADEAREKSVVLMRDPSGIINRGNWTRQKEFWSMVNQRIEGYLVKQIRSRVPLQDSACENTGDI 1130 T 0.0038 RuvC_1 unphh F Bacteria T 5u4k 2 B B TF65_HUMAN NUCLEAR FACTOR NF-KAPPA-B P65 SUBUNIT,NUCLEAR FACTOR OF KAPPA LIGHT POLYPEPTIDE GENE ENHANCER IN B-CELLS 3 GSPGYPNGLLSGDEDFSSIADMDFSALLSQISS 33 T 17 Orthopox_F14 pdbhh F Eukaryota T 5u4w 3 G,I,K G,I,K A0A024B7W1_ZIKV Protein E GALNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLMWLGLNTKNGSISLMCLALGGVLIFLSTA 66 T 0.33 COPI_assoc pdb T Viruses T 5u5c 1 A,B,C,D,E,F A,B,C,D,E,F Designed tetrameric coiled coil peptide with one terpyridine side chain XELAAIKEELAAIKXELAAIKQELAAIKQX 30 T 0.00064 DUF5320 pdbhh F T 5u5f 4 D D 5-DIPHENYL LONG MEDITOPE XCQFDXSTRRLRCGGSK 17 T 2.5 Flavi_NS1 pdbhh F T 5u5m 5 E D AZIDO-PEG4-MEDITOPE XCQFDXSTXRLRC 13 T 4.7 DUF6464 pdbhh F T 5u5p 1 A,B C,B MLH1_HUMAN MUTL PROTEIN HOMOLOG 1 SNPRKRHRED 10 T 4.8 T_cell_tran_alt pdbhh F Eukaryota T 5u5r 2 B B PMS2_HUMAN DNA MISMATCH REPAIR PROTEIN PMS2,PMS1 PROTEIN HOMOLOG 2 TPNTKRFKKEE 11 T 6.3 Nbs1_C pdbhh F Eukaryota T 5u66 1 A B STAPLED PEPTIDE FROM DOMAIN B OF PROTEIN A XFNMXQQRRFYXALH 15 T 2.1 B pdbhh F T 5u6a 5 E D meditope peptide XCQFDXSTXRLRCG 14 T 3.4 zinc_ribbon_12 pdbhh F T 5u75 1 A A G0Z026_STAAU Enterotoxin-like toxin X STQNSSSVQDKQLQKVEEVPNNSEKALVKKLYDRYSKDTINGKSNKSRNWVYSERPLNENQVRIHLEGTYTVAGRVYTPKRNITLNKEVVTLKELDHIIRFAHISYGLYMGEHLPKGNIVINTKNGGKYTLESHKELQKNRENVEINTDDIKNVTFELVKSVNDIEQV 168 T 0.00015 Stap_Strp_tox_C unppercent F Bacteria T 5u96 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H Q928V6_LISIN Putative integrase ASLNEKLKIEHAKKKRLFDLYINGSYEVSELDSMMNDIDAQINYYEAQIEAN 52 T 0.00093 AAA_23 unppercent F Bacteria T 5u98 3 C,F C,F SPT5H_HUMAN VAL-THR-THR-ASP-ILE-GLN-VAL-LYS-VAL VTTDIQVKV 9 T 1.9 DUF460 pdbhh F Eukaryota T 5uae 1 A,B,C,D A,B,C,D Q928V6_LISIN Putative integrase KEDELDSLNEKLKIEHAKKKRLFDLYINGSYEVSELDSMMNDIDAQINYYEAQIEANEELKK 62 T 0.00093 AAA_23 unppercent F Bacteria T 5ud5 1 A,B A,B PYLS_METMA PYRROLYSINE--TRNA(PYL) LIGASE,PYRROLYSYL-TRNA SYNTHETASE,PYLRS MGHHHHHHMDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAP 109 T 0.017 Zn_ribbon_recom pdbpercent F Archaea T 5uf5 1 A,B A,B Q5ZWW6_LEGPH effector protein SidK SNAEQYHSQVVGKIGYIARCMQTIDPENNLKKIREDYQDVLIWAEKNYRFEEILEASKSGKCPNDLDALSRRSLILQELLRLVSSISPFKMKLDLIESQYEKMKQHVNLWKSDYHVKLNQLNQLTDYLKNAAPTPKNNFLRAMTSVLQMQIAQYGITEDNEGINQLFKLGLHLLAMANEKIDEQYHLFKGYVKDQPEESPFEGILPAEDQKILVKTMIDYAMPKLSSKVLQDKLSALSSSDVLTKTLLDSIDRIVKENEKLNALSK 266 T 1 DUF3276 unppssm F Bacteria T 5ufs 2 C,D C,D NR0B2_HUMAN SHP NR Box 1 Peptide APAILYALLSS 11 T 8.5 NR_Repeat pdbhh F Eukaryota T 5uhr 1 A,B,C,D A,B,C,D ORN-CIR-LEU-ALA-ASN-PHE-LEU-VAL-ORN-ILE-LYS-HAO-LYS-A8E XXLANFLVXIKXKX 14 T 9.3 PsaX pdbhh F T 5ui6 1 A A Acinetodin GGKGPIFETWVTEGNYYG 18 T 2.2 YqeC pdbhh F T 5ui7 1 A A A0A1S4NYG0_KLEPN Klebsidin GSDGPIIEFFNPNGVMHYG 19 T 0.83 tRNA-synt_1c pdbhh F Bacteria T 5ujr 1 A A Q46313_CARML Bacteriocin WGWKEVVQNGQTIFSAGQKLGNMVGKIVPLPFG 33 T 0.048 Bacteriocin_IIc unp F Bacteria T 5ujt 3 C,F,I C,F,I insulin mimotope GVEELYLVAGEEGCGG 16 T 1.8 NTPase_1 pdbhh F T 5ukh 1 A A T1ZG69_STRIT Uncharacterized protein MGSSHHHHHHSQDPSDLSWSKRLSAYAALKDLTLSKQDKVFLEHLMTEYGFDSTTARQILKLKQGLERKFSSIFDDYTQEERDYLLFRIIGSVSYNGVKWDETAGYLSRYFYKEVVSNPVTGEKQKVPKSLLDIFQELGLSKAEAKQLQYNLSLQHEMAGGTLSTTGDMVKQDPDYYETAKNSYKLVYGTTEGFDKFWDERLKAYSNDGRGNADFTHQSITMATHLNPTSVQLSDIYGGRKHVKNLAGWEGDTTYNANERKPSIGEDDYKADLDSVNIIGRMKKGQSYQSAMSSYYSDVQKGHSVREKEFLKNKDWEKVKKTIYDSLVPNGINKNADSVVKDYIAKNYPDVSKFLSRLESVAGGQ 365 T 0.021 Seryl_tRNA_N unppercent F Bacteria T 5ulo 2 C,D C,D TBCD7_HUMAN TBC1 domain family member 7 XESGKLPRSPSFPX 14 F F Eukaryota T 5uml 2 B,D,F,H C,D,F,H PEPTIDE INHIBITOR M3 LTFLEYWAQLMQ 12 T 2.6 ParD_like pdbhh F T 5uoi 1 A A HHH_rd1_0142 RKWEEIAERLREEFNINPEEAREAVEKAGGNEEEARRIVKKRL 43 T 0.00066 DUF3606 pdbhh F T 5up1 1 A A EEHEE_rd3_1049 MGSSHHHHHHSSGLVPRGSHMTTVKLGDIKVTFDNPEKAKKYAQKLAKIYQLTVHVHGDTIHVK 64 T 0.46 DUF2188 pdbhh F T 5up5 1 A A EHEE_rd1_0284 TQTQEFDNEEEARKAEKELRKENRRVTVTQENGRWRVTWD 40 T 0.0015 SPOR pdb F T 5uqd 1 A A DPY21_CAEEL DumPY: shorter than wild-type MKSSWSHPQFEKGAMTGWSHPQFEKENLYFQSNATMRITNRNLKMLTRQFDLPKMSSRFRKFVRIRRHPNGMATIISCDYNQIKQHLGPNEMKHFERQFVRLGFAENNGVPLFAIGVMENAAEALHDQFEWLAKNSPNTQVKVGSLTNKQFIETMPMKKYYESAMETLDMGTFRFGPLMSLSMVGTKNEEAGGNFKEMLDALNAAPFLGPIMPWGDFSEVQGIKEDTSDDGPIFWVRPGEQMVPTDGKNRSTEPRHPLATRGNDRRETAFNDRTNAHADQVRESTEDDPTTTTTTTTTTSSSSSSSKSKKSAKSDPTFVKSTAAVGVLQGIRNPDANDDDEYYEDERKAVKEVIVFDAHDLHKVAHHLAMDLYEPPVSQCHRWVDDAILNTMRREGIRYAKLELHENDMYFLPRNVIHQFRTVSACSSVAWHVRLRHYYDVD 442 T 0.017 Cupin_8 unphh F Eukaryota T 5utf 1 A G Q2N0S6_9HIV1 Envelope glycoprotein gp120 AENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGEMKNCSFNMTTELRDKKQKVYSLFWRLDVVQINENQGNRSNNSNKEYRLINCNTSACTQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPMNMTRKSIRIGPGQAFYALGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRMKQIINMWQRIGQCMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 481 T 3.5E-54 GP120 pdbpercent T Viruses T 5uty 4 D G Q2N0S6_9HIV1 ENVELOPE GLYCOPROTEIN GP160 MPMGSLQPLATLYLLGMLVASVLAAENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGEMKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSACTQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPMNMTRKSIRIGPGQAFYALGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQCMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 505 T 3.8E-54 GP120 pdbpercent T Viruses T 5uup 2 B B Bfl-1-specific selected peptide XGVREIAYGLRRAADDVNAQVERX 24 T 0.0029 PUMA pdbhh F T 5uw3 2 B,D,F,H E,F,G,H F6LNL5_9CARY Presegetalin A1 GVPVWAFQAKDVENASAPV 19 T 10 Choline_sulf_C unphh F Eukaryota T 5uwh 4 D D PAXI_HUMAN Paxillin GGSYRELDELMASLSDFKFMAQ 22 T 0.86 KNOX2 pdbhh F Eukaryota T 5uwi 4 D D HDAC5_HUMAN Histone deacetylase 5 GGSYEAETVSAMALLSVG 18 T 11 GPHR_N pdbhh F Eukaryota T 5uwp 4 D D DIAP3_HUMAN Protein diaphanous homolog 3 GGSYSVPEVEALLARLRAL 19 T 0.98 DUF1128 pdbhh F Eukaryota T 5uws 4 D D APBA3_HUMAN Amyloid beta A4 precursor protein-binding family A member 3 GGSYSSLQELVQQFEALPGDLV 22 T 0.9 NikR_C pdbhh F Eukaryota T 5uww 4 D D DEAF1_HUMAN Deformed epidermal autoregulatory factor 1 homolog GGSSWLYLEEMVNSLLNTAQQ 21 T 0.16 Latarcin unppssm F Eukaryota T 5uy9 2 B B Brd4 peptide QASTPRX 7 T 110 Matrix pdbhh F T 5uyo 1 A A HEEH_rd4_0097 MGSSHHHHHHSSGLVPRGSHMDVEEQIRRLEEVLKKNQPVTWNGTTYTDPNEIKKVIEELRKSM 64 T 0.13 DUF6466 pdb F T 5uz9 4 I,J I,J L7P7M1_9CAUD ACRF1 KFIKYLSTAHLNYMNIAVYENGSKIKARVENVVNGKSVGARDFDSTEQLESWFYGLPGSGLGRIENAMNEISRRENP 77 T 0.16 DUF4982 pdb T Viruses T 5uz9 5 K K ACR30_BPD31 GENE PRODUCT 30, GP30, ACRF2 MHHHHHHIAQQHKDTVAACEAAEAIAIAKDQVWDGEGYTKYTFDDNSVLIQSGTTQYAMDADDADSIKGYADWLDDEARSAEASEIERLLESVEEE 96 T 0.13 Transglycosylas unp T Viruses T 5uzl 1 A A K9LL63_BRANA O-acyltransferase NVDVRYTYRPSVPAHRRVRESPLSSDAIFKQSH 33 T 140 GUCT pdbhh F Eukaryota T 5uzu 1 A B Q2G0X2_STAA8 Uncharacterised protein GSTKVYSQNGLVLHDDANFLEHELSYIDVLLDKNADQATKDNLRSYFADKGLHSIKDIINKAKQDGFDVSK 71 T 0.0051 CompInhib_SCIN unphh F Bacteria T 5uzz 2 B B 14-mer Peptide RVRTRGKRRIRRXP 14 T 13 Defensin_int pdbhh F T 5v0y 1 A A A0A3F2YLM5_AREMA arenicin-3 GFCWYVCVYRNGVRVCYRRCN 21 T 1.4 PilI pdbhh F Eukaryota T 5v11 1 A A AA139 GFCWYVCARRNGARVCYRRCN 21 T 3.4 Toxin_25 pdbhh F T 5v1a 1 A B ULP2_YEAST Ubiquitin-like-specific protease 2 AEFTSPYFGRPSLKTRAKQFEGVSSP 26 T 12 LtuB pdbhh F Eukaryota T 5v1d 2 B,E,G E,F,G 12-mer peptide ADPQPWRFYAPR 12 T 0.33 TM1586_NiRdase pdbhh F T 5v1e 1 A A Guavanin 2 RQYMRQIEQALRYGYRISRRX 21 T 1.4 Tyrosinase pdbhh F T 5v1t 2 B B SuiA 22mer MSKELEKVLESSAMAKGDGWHV 22 T 4.1 DUF1952 pdbhh F T 5v1u 2 E,F,G,H E,F,G,H D1CIY7_THET1 TbiA(beta) Thr(-5)Glu Leader MTKTYTAPTLVEYGGLERLT 20 T 2.1E-05 DUF5972 pdbhh F Bacteria T 5v1v 2 C,D C,D D1CIZ1_THET1 TbiA(alpha) Leader Peptide MKEYRSPELKEYGRVEDRTAG 21 T 0.029 DUF5972 unphh F Bacteria T 5v1y 3 E,F E,F PSMD1_HUMAN 26S PROTEASOME REGULATORY SUBUNIT RPN2, 26S PROTEASOME REGULATORY SUBUNIT S1, 26S PROTEASOME SUBUNIT P112, RPN2 GPQEPEPPEPFEYIDD 16 T 29 AgrD pdbhh F Eukaryota T 5v1z 3 E,F F,E PSMD1_HUMAN 26S PROTEASOME REGULATORY SUBUNIT RPN2, 26S PROTEASOME REGULATORY SUBUNIT S1, 26S PROTEASOME SUBUNIT P112, RPN2 GPKIEEEEQEPEPPEPFEYIDD 22 T 51 Bacillus_PapR pdbhh F Eukaryota T 5v2g 1 A,B,C A,B,C 20-mer Peptide KNPEAEEITRCKKLLDDSSS 20 T 0.12 DUF3151 pdb F T 5v2p 2 B B CAC1C_HUMAN CALCIUM CHANNEL, L TYPE,ALPHA-1 POLYPEPTIDE, ISOFORM 1, CARDIAC MUSCLE, VOLTAGE-GATED CALCIUM CHANNEL SUBUNIT ALPHA CAV1.2 CSPLECDLKGYLDWITQAE 19 T 5.4 MIIP pdbhh F Eukaryota T 5v2q 2 B B CAC1C_HUMAN CALCIUM CHANNEL, L TYPE,ALPHA-1 POLYPEPTIDE, ISOFORM 1, CARDIAC MUSCLE, VOLTAGE-GATED CALCIUM CHANNEL SUBUNIT ALPHA CAV1.2 ASPLEEDLCGYLCWITQAE 19 T 4.5 3-PAP pdbhh F Eukaryota T 5v3n 2 B B H0GHZ9_SACCK;TOF2_YEAST Ulp2p,Topoisomerase 1-associated factor 2 chimera SNAPYFGRPSLKTRAKQFEGVSSKDIGENCRRIEAFSD 38 T 1.6 DUF1499 pdbhh F Eukaryota T 5v4b 3 C C DISC1_HUMAN DISC1 peptide PEVPPTPPGSHSAFT 15 T 3.2 GSAP-16 pdbhh F Eukaryota T 5v4c 1 A A Q8I5P1_PLAF7 Peptide 38136 NVHTFRGINGHNSSSSL 17 T 0.0082 SseC unppercent F Eukaryota T 5v62 2 B I KS6A1_HUMAN S6K-ALPHA-1,90 KDA RIBOSOMAL PROTEIN S6 KINASE 1,P90S6K,MAP KINASE-ACTIVATED PROTEIN KINASE 1A,MAPKAPK-1A,RIBOSOMAL S6 KINASE 1,RSK-1 XGTAQLKPIESSILAQRRVRK 21 T 15 Microvir_lysis pdbhh F Eukaryota T 5v63 1 A A ORN-LYS-LEU-VAL-PHI-PHE-ALA-GLU-ORN-ALA-ILE-ILE-SAR-LEU-MET-VAL XKLVXFAEXAIIXLMVV 17 T 0.0033 Beta-APP pdbhh F T 5v64 1 A A ORN-GLN-LYS-LEU-VAL-PHI-PHE-ALA-ORN-ALA-ILE-ILE-SAR-LEU-MET-VAL XQKLVXFAXAIIXLMV 16 T 0.02 Beta-APP pdbhh F T 5v65 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P ORN-LEU-VAL-PHI-PHE-ALA-GLU-ASP-ORN-ALA-ILE-ILE-SAR-LEU-MET-VAL XLVXFAEDXAIIXLMV 16 T 0.00035 Beta-APP pdbhh F T 5v6e 1 A,C,E,G,I A,C,E,G,I GIPC1_MOUSE GAIP C-TERMINUS-INTERACTING PROTEIN,RGS-GAIP-INTERACTING PROTEIN,RGS19-INTERACTING PROTEIN 1,SEMAF CYTOPLASMIC DOMAIN-ASSOCIATED PROTEIN 1,SEMCAP-1,SYNECTIN GPHMSAFEEKAIEKVDDLLESYMGIRDTELAATMVELGKDKRNPDELAEALDERLGDFAFPDEFVFDVWGAIGDAKVGRY 80 T 0.00069 PWI pdbhh F Eukaryota T 5v6e 2 B,D,F,H,J B,D,F,H,J MYO6_MOUSE UNCONVENTIONAL MYOSIN-6 GPGSHDLSKWKYAELRDTINTSCDIELLAACREEFHRRLKVYHAWKSKN 49 T 0.12 Pox_MCEL pdbpssm F Eukaryota T 5v6h 1 A,C,E,G,I A,C,E,G,I GIPC2_MOUSE SEMAF CYTOPLASMIC DOMAIN-ASSOCIATED PROTEIN 2,SEMCAP-2 GPHMSEAKAKAIGKVDDLLELYMGIRDIDLATTMFEAGKDKSNPDEFAVALDETLGDFAFPDEFLFDVWGAISDMKQGR 79 T 0.0007 PWI pdbhh F Eukaryota T 5v6i 1 A,B A,B G3BK00_COPCM Y3 PROTEIN QDPLSCYDNFGNRDVAACARFIDDFCDTLTPNIYRPRDNGQRCYVVNGHKCDFTVFNTNNGGSPIRASTPNCKTVLRAAANRCPTGGRGKINPSAPFLFAIDPNDGDCSTDF 112 T 0.00034 Fungal_lectin_2 unphh F Eukaryota T 5v6x 1 A,B A,B PYLS_METMA PYRROLYSINE--TRNA(PYL) LIGASE,PYRROLYSYL-TRNA SYNTHETASE,PYLRS MGHHHHHHMNNKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRPARALRYHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAP 109 T 0.041 Zn_ribbon_recom pdbpercent F Archaea T 5v77 1 A,B A,B Q5F7U7_NEIG1 Uncharacterized protein MAHHHHHHMKKNIFHNVSLYEIIFSDNGNTLTLSFTDTIEGNYFGYIKCSNILNFKLDTNNFVDYEDKEDSLFPLFIPEIELYKYQFYSEIIIDVGIIIKISAETINFEPLGK 113 T 0.28 DUF6329 unppercent F Bacteria T 5v7j 1 A G Q2N0S6_9HIV1 Envelope glycoprotein gp160 ENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTAAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENIANNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQAFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSAGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSATETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 480 T 3.4E-54 GP120 pdbpercent T Viruses T 5v8k 2 B B B0TAT4_HELMI proteinsubunit pshX YSPTFNVAHILAFFFLFLHIPFYFV 25 T 5 DUF4834 unphh F Bacteria T 5v8w 1 A,C,E,G A,C,E,G INT9_HUMAN INT9,PROTEIN RELATED TO CPSF SUBUNITS OF 74 KDA,RC-74 MKPLLSGSIPVEQFVQTLEKHGFSDIKVEDTAKGHIVLLQEAETLIQIEEDSTHIICDNDEMLRVRLRDLVLKFLQKF 78 T 0.086 FAM167 pdb F Eukaryota T 5v8w 2 B,D,F,H B,D,F,H INT11_HUMAN INT11,CLEAVAGE AND POLYADENYLATION-SPECIFIC FACTOR 3-LIKE PROTEIN,CPSF3-LIKE PROTEIN,PROTEIN RELATED TO CPSF SUBUNITS OF 68 KDA,RC-68 GSHMRLVSSEQALKELGLAEHQLRFTCRVHLHDTRKEQETALRVYSHLKSVLKDHCVQHLPDGSVTVESVLLQAAAPSEDPGTKVLLVSWTYQDEELGSFLTSLLKKGLPQAPS 114 T 0.021 CPSF73-100_C unppssm F Eukaryota T 5va9 2 C,D C,D Peptide Inhibitor piHA-L5(d10Y) XYGHSHIRFGYSYHVSYCGX 20 T 4.5 ZinT pdbhh F T 5vaq 3 C C FGF21_HUMAN FGF-21 PDVGSSDPLSMVGGSQGRSPSYES 24 T 23 DUF1335 pdbhh F Eukaryota T 5vav 1 A A cyc-MC12 GRCTQAWPPICFPD 14 T 0.38 Bowman-Birk_leg pdbhh F T 5vb9 2 C,D C,D Peptide inhibitor CWVLEYDMFGALHCR 15 T 1.8 Cytochrom_B559a pdbhh F T 5vbl 1 A A agonist peptide KFRRQRPXXEHKKXXPX 17 T 0.076 Apelin pdb F T 5vbn 2 B,D B,F DPOE1_HUMAN DNA POLYMERASE II SUBUNIT A AQFRDPCRSYVLPEVICRSCNFCRDLDLCKDSSFSEDGAVLPQWLCSNCQAPYDSSAIEMTLVEVLQKKLMAFTLQDLVCLKCRGVKETSMPVYCSCAGDFALTIHTQVFMEQIGIFRNIAQHYGMSYLLETLEWLLQKNPQLGH 145 T 0.0011 zinc_ribbon_15 pdbhh F Eukaryota T 5vey 3 C C RN169_HUMAN RING FINGER PROTEIN 169,RING-TYPE E3 UBIQUITIN TRANSFERASE RNF169 GHMDPVLREMEQKLQQEEEDRQLALQLQRMFDNERRTVSRRKGSVDQYLLRSSNMAGAK 59 T 0.011 DUF4788 pdbpssm F Eukaryota T 5vf1 1 A,B,C,D,E,F A,B,C,D,E,F ORN-LYS-LEU-VAL-PHI-PHE-ALA-GLU-ORN-GLU-ALA-PHE-MEA-VAL-LEU-LYS XKLVXFAEXEAFXVLK 16 T 4.3 DUF4065 pdbhh F T 5vfw 1 A A ANXA1_HUMAN ANNEXIN I,ANNEXIN-1,CALPACTIN II,CALPACTIN-2,CHROMOBINDIN-9,LIPOCORTIN I,PHOSPHOLIPASE A2 INHIBITORY PROTEIN,P35 AMVSEFLKQAWFIENEEQEYVQTVK 25 T 16 DUF807 pdbhh F Eukaryota T 5vgb 2 B B A0A2D0TCG3_NEIME Anti-CRISPR protein (AcrIIC1) MANKTYKIGKNAGYDGCGLCLAAISENEAIKVKYLRDICPDYDGDDKAEDWLRWGTDSRVKAAALEMEQYAYTSVGMASCWEFVEL 86 T 6.9 WIYLD pdbhh F Bacteria T 5vgd 3 C C SER-ALA-GLU-PRO-VAL-PRO-LEU-GLN-LEU SAEPVPLQL 9 T 21 REV pdbhh F T 5vid 2 F,G,H,I F,G,H,I Bot.0671.2 MGSSHHHHHHSSGLVPRGSHMQPMFAELKAKFFLEIGDRDAARNALRKAGYSDEEAERIIRKYELE 66 T 0.099 RuvA_C pdbhh F T 5vji 1 A,B,D,E A,B,D,E CLOCK_MOUSE MCLOCK GAMDPEFSAQLGAMQHLKDQLEQRTRMIEANIHRQQEELRKIQEQLQMVHG 51 T 0.011 DUF641 pdbpercent F Eukaryota T 5vjj 1 A,B A,B B2ZCS6_MELLI Avirulence protein AvrP123 SNAQSNPNQELGVVQCLCRRIAPLTQPPFGVRCRATLNCPCDYIGDCPGPAEQYMYRCPNCGPRSHVACSGVHQGTCQQVHPGKDSVEYGG 91 T 0.032 LSR unppssm F Eukaryota T 5vjs 1 A A Reaction Center Maquette GSPELRQEHQQLAQEFQQLLQEIQQLGRELLKGELQGIKQLREASEKARNPEKKSVLQKILEDEEKHIELLETLQQTGQEAQQLLQELQQTGQELWQLGGSGGPELRQKHQQLAQKIQQLLQKHQQLGAKILEDEEKHIELLETILGGSGGDELRELLKGELQGIKQYRELQQLGQKAQQLVQKLQQTGQKLWQLG 196 T 0.0003 Rubrerythrin pdbpercent F T 5vk0 2 B,D,F,H,J,L,N,P,R,T,V,X B,D,F,H,J,L,N,P,R,T,V,X Lysine-cysteine side chain dithiocarbamate stapled peptide inhibitor PMI XTSFAEYWXLLSCX 14 T 0.42 P53_TAD pdbhh F T 5vk1 2 B,D,F,H,J,L,N,P B,D,F,H,J,L,N,P Lysine-cysteine side chain dithiocarbamate stapled peptide inhibitor PMI XTSFXEYWCLLSPX 14 T 0.21 PDDEXK_7 pdbhh F T 5vkl 2 B B RPB1_YEAST RNA POLYMERASE II SUBUNIT B1,DNA-DIRECTED RNA POLYMERASE III LARGEST SUBUNIT,RNA POLYMERASE II SUBUNIT B220 ESGLVNADLDVKDELMFSPLVDS 23 T 0.16 Hemolysin_N pdbhh F Eukaryota T 5vko 2 B B RPB1_YEAST RNA POLYMERASE II SUBUNIT B1,DNA-DIRECTED RNA POLYMERASE III LARGEST SUBUNIT,RNA POLYMERASE II SUBUNIT B220 CGGVTPYSNESGLVNADLDVKDELMFSPLVDSGS 34 T 0.36 Hemolysin_N pdbhh F Eukaryota T 5vl6 1 A A Q8I5P1_PLAF7 Peptide 38138 NVHTFRGDNVHNSSSSL 17 T 0.0082 SseC unppercent F Eukaryota T 5vla 2 B Z THR-VAL-PHE-THR-SER-TRP-GLU-GLU-TYR-LEU-ASP-TRP-VAL-MET-PRO-TRP-ASN-LEU-VAL-ARG-ILE-GLY-LEU-LEU TVFTSWEEYLDWVGSGDLMPWNLVRIGLLR 30 T 0.9 SseB pdbhh F T 5vlh 2 B Y CYS-ARG-LEU-PRO-TRP-ASN-LEU-GLN-ARG-ILE-GLY-LEU-PRO-CYS CRLPWNLQRIGLPC 14 T 0.19 CIS_TMP pdbhh F T 5vli 3 C C Computationally designed peptide HB1.6928.2.3 CIEQSFTTLFACQTAAEIWRAFGYTVKIMVDNGNCRLHVC 40 T 4.4 DUF4468 pdbhh F T 5vlk 3 C Z ACE-TRP-ASN-LEU-VAL-HRG-ILE-GLY-LEU-LEU peptide XWNLVXIGLLR 11 T 3.3 Abi_alpha pdbhh F T 5vll 2 B Y CYS-PHE-ILE-PRO-TRP-ASN-LEU-GLN-ARG-ILE-GLY-LEU-LEU-CYS CFIPWNLQRIGLLC 14 T 0.41 DUF2982 pdbhh F T 5vlp 4 D Z LDLR antagonist peptide XMESFPGWNLVXIGLLR 17 T 1 BOFC_N pdbhh F T 5vob 3 C C UL128_HCMVA Envelope glycoprotein UL128 MSPKDLTPFLTTLWLLLGHSRVPRVRAEECCEFINVNHPPERCYDFKMCNRFTVALRCPDGEVCYSPEKTAEIRGIVTTMTHSLTRQVVHNKLTSCNYNPLYLEADGRIRCGKVNDKAQYLLGAAGSVPYRWINLEYDKITRIVGLDQYLESVKKHKRLDVCRAKMGYMLQ 171 T 32 Auto_anti-p27 pdbhh T Viruses T 5vob 5 E E U131A_HCMVM Envelope glycoprotein UL131A MRLCRVWLSVCLCAVVLGQCQRETAEKNDYYRVPHYWDACSRALPDQTRYKYVEQLVDLTLNYHYDASHGLDNFDVLKRINVTEVSLLISDFRRQNRRGGTNKRTTFNAAGSLAPHARSLEFSVRLFAN 129 T 0.94 DDDD pdbpercent T Viruses T 5vox 16 EA,FA,GA e,f,g effector protein SidK MSFIKVGIKMGGLTSEQYHSQVVGKIGYIARCMQTIDPENNLKKIREDYQDVLIWAEKNYRFEEILEASKSGKCPNDLDALSRRSLILLELLRLVSSISPFKMKLDLIESQYEKMKQHVNLWKSDYHVKLNQLNQLTDYLKNAAPTPKNNFLRAMTSVLQMQIAQYGITEDNEGINQLFKLGLHLLAMANEKIDEQYHLFKGYVKDQPEESPFEGILPAEDQKILVKTMIDYAMPKLSSKVLQDKLSALSSSDVLTKTLLDSIDRIVKENEKLNALSKVKLGKFGLDIREIEVIYSQALKISPQDALQYTAQQCDAQLLSMAFPDSQNYIIESISNKKVKTIAELIHSKEFIYQIIKTEVFKQVDPNEKIRLQAATELYQLLGRIMDKQINLFTKMNLEQINEYIQTKTKAILDKIPERVELLTFMGFEIPTFKGIETLMTDISHSQDNETLAIAQEFYTNIKNAKNQLLGDKLIEDITPQDVEKFFNQCSQYGSEAAEKLADNRPVLTKIADILTAIARWAISLIGFNTPPQFLAPTRTCVDQVSDEITKIKLKLEDTLGSLQKVQEESLSL 573 T 0.15 Chalcone pdb F T 5vqi 2 B,C A,C NIT2_NEUCR Nuclear localization sequence of NIT2 transcription factor (NIT2-NLS) TISSKRQRRHSKS 13 T 12 DUF4543 pdbhh F Eukaryota T 5vr1 1 A A Turripeptide DCCPCPAGAVRCRFACCX 18 T 1.3 MSC pdbhh F T 5vt9 2 B,D C,D MYOA_TOXGO MYOA,TGM-A GASKKTPFIIRAQAHIRRHLVDNNVSPATVQPAFAAA 37 T 0.00015 IQ unppssm F Eukaryota T 5vtb 2 B B BC11A_HUMAN BCL-11A, B-CELL CLL/LYMPHOMA 11A, COUP-TF-INTERACTING PROTEIN 1, ECOTROPIC VIRAL INTEGRATION SITE 9 PROTEIN HOMOLOG, EVI-9, ZINC FINGER PROTEIN 856 SRRKQGKPQHLSKRE 15 T 460 VMAP-M12 pdbhh F Eukaryota T 5vte 1 A A de novo peptide 1 XELEAIAQKFEAIAKKFEAIAXKFEAIAQKX 31 T 4.2 DUF2967 pdbhh F T 5vud 3 C C Nonamer peptide: LEU-SER-SER-PRO-VAL-THR-LYS-SER-TRP LSSPVTKSW 9 T 24 HTH_WhiA pdbhh F T 5vue 3 C C Nonamer peptide: LEU-THR-VAL-GLN-VAL-ALA-ARG-VAL-TRP LTVQVARVW 9 T 2.2 TraV pdbhh F T 5vuf 3 C C Nonamer peptide: LEU-THR-VAL-GLN-VAL-ALA-ARG-VAL-TYR LTVQVARVY 9 T 8.7 APOBEC_C pdbhh F T 5vvt 2 B,D B,D ELK1_HUMAN ELK1 peptide FWSTLSPI 8 T 0.43 DUF5848 pdbhh F Eukaryota T 5vw1 3 C B A0A0E0UT28_LISMM anti-CRISPR protein AcrIIA4 GSMNINDLIREIKNKDYTVKLSGTDSNSITQLIIRVNNDGNEYVISESENESIVEKFISAFKNGWNQEYEDEEEFYNDMQTITLKSELN 89 T 0.041 DUF4930 pdb F Bacteria T 5vwi 2 C,D C,D beta-PIX PAWDETNL 8 T 1.1 IPP-2 pdbhh F T 5vwl 1 A A Q6TAN6_9HIV1 Cytoplasmic tail of HIV-1 gp41 protein SLALIWDDLRSLCLFSYHRLRDLLLIVTRIVELLGRRGWEALKYWWNLLQYWSQELKNSAVNLLNATAIAVAEGTDRVIEVLQAAYRAIRHIPRRIRQGLERILL 105 T 4.9 DUF6307 pdbhh T Viruses T 5vxv 1 A A PEX15_YEAST PEROXIN-15,PEROXISOME BIOSYNTHESIS PROTEIN PAS21 MSEVFQECVNLFIKRDIKDCLEKMSEVGFIDITVFKSNPMILDLFVSACDIMPSFTKLGLTLQSEILNIFTLDTPQCIETRKIILGDLSKLLVINKFFRCCIKVIQFNLTDHTEQEEKTLELESIMSDFIFVYITKMRTTIDVVGLQELIEIFIFQVKVKLHHKKPSPNMYWALCKTLPKLSPTLKGLYLSKDVSIEDAILNSIDNKIQKDKLEVLFQ 218 T 0.008 FGF-BP1 unppssm F Eukaryota T 5vzl 3 C C A0A2D0TCG7_9VIRU phage anti-CRISPR AcrIIA4 MNINDLIREIKNKDYTVKLSGTDSNSITQLIIRVNNDGNEYVISESENESIVEKFISAFKNGWNQEYEDEEEFYNDMQTITLKSELN 87 T 0.033 DUF4930 pdb T Viruses T 5vzu 3 E,F E,F CCND1_HUMAN Cyclin D1 EEVDLACTPTDVRDVDI 17 T 8.2 RE_HaeIII pdbhh F Eukaryota T 5w0j 1 A,B A,B peptide 1 XELAQAFKEIAKAFKEIAKAFEXIAQAIEKX 31 T 4.3 DUF1241 pdbhh F T 5w0k 5 E,J X,Y GP42_EBVB9 GP42 KPNVEVWPVDPPPPVNFNKTAEQEYGDKEVKLPHW 35 T 5.4 MarB unphh T Viruses T 5w1h 1 A A A0A2D0TCG9_9FIRM LbaCas13a (C2c2) SNAMKISKVREENRGAKLTVNAKTAVVSENRSQEGILYNDPSRYGKSRKNDEDRDRYIESRLKSSGKLYRIFNEDKNKRETDELQWFLSEIVKKINRRNGLVLSDMLSVDDRAFEKAFEKYAELSYTNRRNKVSGSPAFETCGVDAATAERLKGIISETNFINRIKNNIDNKVSEDIIDRIIAKYLKKSLCRERVKRGLKKLLMNAFDLPYSDPDIDVQRDFIDYVLEDFYHVRAKSQVSRSIKNMNMPVQPEGDGKFAITVSKGGTESGNKRSAEKEAFKKFLSDYASLDERVRDDMLRRMRRLVVLYFYGSDDSKLSDVNEKFDVWEDHAARRVDNREFIKLPLENKLANGKTDKDAERIRKNTVKELYRNQNIGCYRQAVKAVEEDNNGRYFDDKMLNMFFIHRIEYGVEKIYANLKQVTEFKARTGYLSEKIWKDLINYISIKYIAMGKAVYNYAMDELNASDKKEIELGKISEEYLSGISSFDYELIKAEEMLQRETAVYVAFAARHLSSQTVELDSENSDFLLLKPKGTMDKNDKNKLASNNILNFLKDKETLRDTILQYFGGHSLWTDFPFDKYLAGGKDDVDFLTDLKDVIYSMRNDSFHYATENHNNGKWNKELISAMFEHETERMTVVMKDKFYSNNLPMFYKNDDLKKLLIDLYKDNVERASQVPSFNKVFVRKNFPALVRDKDNLGIELDLKADADKGENELKFYNALYYMFKEIYYNAFLNDKNVRERFITKATKVADNYDRNKERNLKDRIKSAGSDEKKKLREQLQNYIAENDFGQRIKNIVQVNPDYTLAQICQLIMTEYNQQNNGCMQKKSAARKDINKDSYQHYKMLLLVNLRKAFLEFIKENYAFVLKPYKHDLCDKADFVPDFAKYVKPYAGLISRVAGSSELQKWYIVSRFLSPAQANHMLGFLHSYKQYVWDIYRRASETGTEINHSIAEDKIAGVDITDVDAVIDLSVKLCGTISSEISDYFKDDEVYAEYISSYLDFEYDGGNYKDSLNRFCNSDAVNDQKVALYYDGEHPKLNRNIILSKLYGERRFLEKITDRVSRSDIVEYYKLKKETSQYQTKGIFDSEDEQKNIKKFQEMKNIVEFRDLMDYSEIADELQGQLINWIYLRERDLMNFQLGYHYACLNNDSNKQATYVTLDYQGKKNRKINGAILYQICAMYINGLPLYYVDKDSSEWTVSDGKESTGAKIGEFYRYAKSFENTSDCYASGLEIFENISEHDNITELRNYIEHFRYYSSFDRSFLGIYSEVFDRFFTYDLKYRKNVPTILYNILLQHFVNVRFEFVSGKKMIGIDKKDRKIAKEKECARITIREKNGVYSEQFTYKLKNGTVYVDARDKRYLQSIIRLLFYPEKVNMDEMIEVKEKKKPSDNNTGKGYSKRDRQQDRKEYDKYKEKKKKEGNFLSGMGGNINWDEINAQLKN 1440 T 1.2 SesA unppercent F Bacteria T 5w2j 2 C F unidentified peptide AKGALQELGAGLTA 14 T 11 zf-C2HCIx2C pdbhh F T 5w3n 1 A,B,C,D,E,F,G,H,I A,B,C,D,E,F,G,H,I FUS_HUMAN 75 KDA DNA-PAIRING PROTEIN,ONCOGENE FUS,ONCOGENE TLS,POMP75,TRANSLOCATED IN LIPOSARCOMA PROTEIN MSYYHHHHHHDYDIPTTENLYFQGAMDPASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRG 241 T 720 SelK_SelG pdbhh F Eukaryota T 5w4a 1 A,B,C,D A,B,C,D P-granule scaffold MDTNKREIVEFLGIRTYFFPNLALYAVNNDELLVSDPNKANSFAAYVFGASDKKPSVDDIVQILFPSGSDSGTILTSMDTLLALGPDFLTEFKKRNQDLARFNLTHDLSILAQGDEDAAKKKLNLMGRKAKLQKTEAAKILAILIKTINSEENYEKFTELSELCGLDLDFDAYVFTKILGLEDEDTADEVEVIRDNFLNRLDQTKPKLADIIRNGP 216 T 0.0014 SidE_PDE pdbpssm F T 5w4e 2 B,C A,D TDT_HUMAN human DNA repair polymerase Tdt SHLSPRKKRPRQTGAL 16 T 21 Doppel pdbhh F Eukaryota T 5w4f 2 B,C A,D DPOLM_HUMAN POL MU,TERMINAL TRANSFERASE LPKRRRARVGSPSGDAASSTPPSTRFPGV 29 T 0.0018 BRCT unppercent F Eukaryota T 5w4g 2 B A DPOLL_HUMAN POL LAMBDA,DNA POLYMERASE BETA-2,POL BETA2,DNA POLYMERASE KAPPA RGILKAFPKRQKIHADASSKVLAKIPRRE 29 T 13 Luteo_coat pdbhh F Eukaryota T 5w4h 1 A,B,C A,B,C A-beta 17_36 peptide: ORN-LYS-LEU-VAL-MEA-PHE-ALA-GLU-ORN-ALA-ILE-ILE-GLY-LEU-MET-VAL XKLVXFAEXAIIGLMV 16 T 0.0038 Beta-APP pdbhh F T 5w4j 1 A,B,C,D,E,F A,B,C,D,E,F A-beta 17_36 peptide: ORN-LYS-VAL-PHE-MEA-ALA-ALA-ASP-ORN-ALA-ILE-ILE-GLY-LEU-MET-VAL XKVFXAADXAIIGLMV 16 T 0.031 Beta-APP pdbhh F T 5w4k 58 ID,JD A,B Klebsazolicin XSPGNXASXSNSASANXX 18 T 1.1 Cytochrom_C pdbhh F T 5w54 1 A A A0A2D0TCH0_MANSE Stress Response Peptide-2 FGVKDGKCPSGRVRRLGICVPDDDY 25 T 0.78 NRF pdbhh F Eukaryota T 5w5s 3 C D CYCLIC PEPTIDE CP141019 (P5) XXXLEYXEWLSX 12 T 3.6 TerB_N pdbhh F T 5w5u 3 C D CYCLIC PEPTIDE CP141037 (P4) XXXXEYFEWLSX 12 T 3.6 TerB_N pdbhh F T 5w6i 3 C D CYCLIC PEPTIDE CP141046 (P3) XXXLEYFEWLSX 12 T 3.6 TerB_N pdbhh F T 5w6t 3 C F CYCLIC PEPTIDE CP151070 (P7) XXXXEYXEWLSX 12 T 3.6 TerB_N pdbhh F T 5w6u 3 C D CYCLIC PEPTIDE CP121068 (P2) XRXLEYFEWLSX 12 T 3.9 TerB_N pdbhh F T 5w6y 1 A,B A,B A9S498_PHYPA Chorismate mutase MACALSVSGILCASQAATSFSSAKPTKSQPHPVQLKAFVPISQPAALKSASLVVSPSRTSHASVEAETEPFTLANIRESLIRQEDTIIYALLQRAQFSFNAPTYDENSFSIPGFKGSLVEFMLKETETLHAKVRRYQAPDEHPFFPEDLSQPILPSLPKSRVLHPAAEKININKSIWSMYLQDLLPKLTVPDDDGNYGSASVCDVLCLQALSKRIHYGKFVAEAKFIEDPARFEGHIKAQDGDAILRELTFKNVEDNVKRRVANKARAYGQEVNEHGKVDNARYKIDPDLAGALYEDWVMPLTKQVQVAYLLRRLD 316 T 0.072 DUF5788 pdbpercent F Eukaryota T 5w7x 2 E,F,G,H H,E,F,G XRCC1_HUMAN X-RAY REPAIR CROSS-COMPLEMENTING PROTEIN 1 PYAGSTDEN 9 T 19 M3 pdbhh F Eukaryota T 5w7y 2 B,D D,C XRCC1_HUMAN X-RAY REPAIR CROSS-COMPLEMENTING PROTEIN 1 PYAGETDE 8 T 7.5 GH43_C pdbhh F Eukaryota T 5w82 1 A,B,C,D,E C,B,D,E,A E9KNV6_9VIRU Protein delta HMMPSEDYAIWYARATIAALQAAEYRLAMPSASYTAWFTDAVSDKLDKISESLNTLVECVIDKRLAVSVPEPLPVRVENKVQVEVEDEVRVRVENKVDVEVKN 103 T 0.12 DNMT1-RFD pdb T Viruses T 5w94 2 B,D B,D SCC2_YEAST Sister chromatid cohesion protein 2 SNAMSYPGKDKNIPGRIIEALEDLPLSYLVPKDGLAALVNAPMRVSLPFDKTIFTSADDGRDVNINVLGTANSTTSSIKNEAEKERLVFKRPSNFTSSANSVDYVPTNFLEGLSPLAQSVLSTHKGLNDSINIEKKSEIVSRPEAKHKLESVTSNAGNLSFNDNSSNKKTKTSTGVTMTQANLA 184 T 0.4 HTH_25 pdbpercent F Eukaryota T 5w96 1 A,B A,B FZ7 LPSDDLEFWCHVMY 14 T 0.45 v110 pdbhh F T 5w9f 1 A A De novo mini protein gHEEE_02 SQETRKKCTEMKKKFKNCEVRCDESNHCVEVRCSDTKYTLC 41 T 14 DUF5651 pdbhh F T 5wa1 2 B B CHM4C_HUMAN CHROMATIN-MODIFYING PROTEIN 4C, CHMP4C, SNF7 HOMOLOG ASSOCIATED WITH ALIX 3, SNF7-3, HSNF7-3, VACUOLAR PROTEIN SORTING-ASSOCIATED PROTEIN 32-3, HVPS32-3 QRAEEEDDDIKQLAAWTT 18 T 1.1 Ribosomal_60s unppssm F Eukaryota T 5wa4 2 G,H,I,J,K,L M,N,O,P,Q,R D6Y501_THEBD TbtA 16-mer peptide MDLNDLPMDVFELADS 16 T 6.7 NPH3 pdbhh F Bacteria T 5wah 1 A A BAG_STRAG BETA ANTIGEN,B ANTIGEN GVEKTAGETSATDTGKREKQLQQWKNNLKNDVDNTILSHEQKNEFKTKIDETNDSDALLELENQFNETNRLLHIKQHEEVEKDKKAKQQKTLKQSDTKV 99 T 2.1 RtcB pdb F Bacteria T 5wai 2 B,F B,F SUZ12_HUMAN CHROMATIN PRECIPITATED E2F TARGET 9 PROTEIN,CHET 9 PROTEIN,JOINED TO JAZF1 PROTEIN,SUPPRESSOR OF ZESTE 12 PROTEIN HOMOLOG MEHVQADHELFLQAFEKPTQIYRFLRTRNLIAPIFLHRTLTYMSHRNSRTNIKRKTFKVDDMLSKVEKMKGEQESHSLSAHLQLTFTGFFHKNDKPSPNSENEQNSVTLEVLLVKVCHKKRKDVSCPIRQVPTGKKQVPLNPDLNQTKPGNFPSLAVSSNEFEPSNSHMVKSYSLLFRVTRPGRREFNGMINGETNENIDVNEELPARRKRNREDGEKTFVAQMTVFDKNRRLQLLDGEYEVAMQEMEECPISKKRATWETILDGKRLPPFETFSQGPTLQFTLRWTGETNDKSTAPIAKPLATRNSESLHQENKPGSVKPTQTIAVKESLTTDLQTRKEKDTPNENRQKLRIFYQFLYNNNTRQQTEARDDLHCPWCTLNCRKLYSLLKHLKLCHSRFIFNYVYHPKGARIDVSINECYDGSYAGNPQDIHRQPGFAFSRNGPVKRTPITHILVCRPKRTKASMSEFLEWSHPQFEK 478 T 0.17 zf_C2H2_6 unphh F Eukaryota T 5wai 3 C,G C,G AEBP2_HUMAN ADIPOCYTE ENHANCER-BINDING PROTEIN 2,AE-BINDING PROTEIN 2 SNARHRAICFNLSAHIESLGKGHSVVFHSTVIAKRKEDSGKIKLLLHWMPEDILPDVWVNESERHQLKTKVVHLSKLPKDTALLLDPNIYRTMPQKRLKR 100 T 0.011 Mtf2_C pdbhh F Eukaryota T 5wai 4 D,H D,H JARD2_HUMAN Jumonji, AT-rich interactive domain 2 LSKRKPKTEDFLTFLCLRG 19 T 0.86 GMAP pdbhh F Eukaryota T 5wb5 2 B B E9AFM3_LEIMA Uncharacterized protein GSPSVRTMYTREELLRIATLASAMDLGPEVLRKFDVIEVAEPVPTPKRRDAES 53 T 0.43 Spore_YtrH pdbhh F Eukaryota T 5wbh 2 F W KS6B1_HUMAN S6K1,70 KDA RIBOSOMAL PROTEIN S6 KINASE 1,P70-S6K 1,RIBOSOMAL PROTEIN S6 KINASE I,SERINE/THREONINE-PROTEIN KINASE 14A,P70 RIBOSOMAL S6 KINASE ALPHA,P70 S6KA TYVAPSVLESVKEKFSFEPKIRSPRR 26 T 19 PLN_propep pdbhh F Eukaryota T 5wbk 2 B T KS6B1_HUMAN S6K1,70 KDA RIBOSOMAL PROTEIN S6 KINASE 1,P70-S6K 1,RIBOSOMAL PROTEIN S6 KINASE I,SERINE/THREONINE-PROTEIN KINASE 14A,P70 RIBOSOMAL S6 KINASE ALPHA,P70 S6KA MAGVFDIDLDQPED 14 T 0.86 DUF1805 pdbhh F Eukaryota T 5wco 1 A,B,C A,B,C Q910W0_9ORTO NON-STRUCTURAL PROTEIN 2,ORF1 MGSSHHHHHHSSGLVPRGSHMNESQWIQKHLPCMREANPKPRELIRHALKKKKRPEVVYAMGVLLTLGGESGLTVEFPVPEGKTVKVKTLNQLVNGMISRATMTLYCVMKDPPSGSMATLMRDHIRNWLKEESGCQDADGGEEKWAMVYGMISPDMAEEKTMLKELKTMLHSRMQMYALGASSKALENLEKAIVAAVHRLPASCSTEKMVLLGYLK 216 T 0.081 CbiD unphh T Viruses T 5wcv 1 A A A0A2H4A2Y1_ANESU ShK homolog AsK132958 CENTISGCSRADCLLTHRKQGCQKTCGLC 29 T 0.0051 ShK pdb F Eukaryota T 5wd8 1 A,B A,B LPG2328 SNAPVTELTRLKEYMEDQIAKAKESSSLTAQLKFLENAHTEHFVKMGSLTTIYKGGSEVVDRLKIEIRSLYEEMLELKDKCRDQIQQYETS 91 T 0.21 Siah-Interact_N pdbpercent F T 5wdu 1 A,C,E G,F,Q Q2N0S6_9HIV1 ENV POLYPROTEIN ENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATCACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQAFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGANNTSTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRR 471 T 3.5E-54 GP120 pdbpercent T Viruses T 5we0 1 A,D,G,J A,D,G,J POZ1_SCHPO POT1-ASSOCIATED PROTEIN POZ1 SNEKIRSQSVLNTLETFFIKENHYDMQREESSIVNACLRYLGYSKSMCHEKMPIFMDIAFIEYCFNLSLDPSSFQNLPITQTQPDSQQILWEYSLISNALERLENIELERQNCMREDGLSKYTNSLLLNKETLNNEALKLYSCAKAGICRWMAFHFLEQEPIDHINFTKFLQDWGSHNEKEMEALQRLSKHKIRKRLIYVSQHKKKMPWSKFNSVLSRYIQCTKLQLEVFCDYDFKQREIVKMLTSNIN 249 T 0.0063 Dcc1 pdbpercent F Eukaryota T 5we0 2 B,E,H,K B,E,H,K TPZ1_SCHPO MEIOTICALLY UP-REGULATED GENE 169 PROTEIN SEACEMCRLGLPHGSFFELLRDWKKIEEFRNKS 33 T 0.75 RPAP3_C pdbhh F Eukaryota T 5we1 1 A,C A,C POZ1_SCHPO POT1-ASSOCIATED PROTEIN POZ1,POT1-ASSOCIATED PROTEIN POZ1 SESSIVNACLRYLGYSKSMCHEKMPIFMDIAFIEYCFNLSLDPSSFQGGASQQILWEYSLISNALERLENIELERQNCMREDGLSKYTNSLLLNKETLNNEALKLYSCAKAGICRWMAFHFLEQEPIDHINFTKFLQDWGSHNEKEMEALQRLSKHKIRKRLIYVSQHKKKMPWSKFNSVLSRYIQCTKLQLEVFCDYDFKQREIVKMLTSNIN 214 T 0.005 Dcc1 pdbpercent F Eukaryota T 5we2 1 A,C A,C POZ1_SCHPO POT1-ASSOCIATED PROTEIN POZ1 SNEKIRSQSVLNTLETFFIKENHYDMQREESSIVNACLRYLGYSKSMCHEKMPIFMDIAFIEYCFNLSLDPSSFQNLPITQTQPDSQQILWEYSLISNALERLENIELERQNCMREDGLVKYTNELLLNKETLNNEALKLYSCAKAGICRWMAFHFLEQEPIDHINFTKFLQDWGSHNEKEMEALQRLSKHKIRKRLIYVSQHKKKMPWSKFNSVLSRYIQCTKLQLEVFCDYDFKQREIVKMLTSNIN 249 T 0.14 DUF5896 unppssm F Eukaryota T 5we2 3 E F RAP1_SCHPO DNA-binding protein rap1 SDNIFVKPGEDLEIPLLSDYSDSENISEKS 30 T 0.89 DUF3983 pdbhh F Eukaryota T 5wg1 2 C P Nrf2 EAGE mutant peptide LDEEAGEFL 9 T 1.3 Herpes_US9 pdbhh F T 5wgd 3 D F (ACE)AILHKLLQDS(NH2) XAILHKLLQDSX 12 T 0.0022 SRC-1 pdbhh F T 5wir 1 A,C D,C TERB1_HUMAN TERB1-TBM SKKILLTPRRRQRLS 15 T 0.73 WSK pdbhh F Eukaryota T 5wjc 2 B B MIS19_SCHPO Eic1 protein MDLMPLEKARAIEIAFDNVFHNTKIPDNLQQFDAILKRLERRRFIPTENQKPRVYETELLVLRFREFGVKDNHNHPINLHSLRSKSLIRAQGKKLDLHNRVFLRRNVRAVKM 112 T 6.6 Ins_allergen_rp pdbhh F Eukaryota T 5wk1 1 A,B,C,D,E,F,G X,L,Y,M,S,Z,K S5MS27_9CAUD Capsid Stabilizing Protein MANSKNSIFVGGAGRVKQTIEGLAQSAFKPGQLLARAAGDAIDVTAKASTTYGNEFLICDDQPQTLGGGTDVAVTAGDTVQAISVLPGQYVLLSFAATQNVTTKGAAVASNGDGNFKLGNPATEQTFAVTEEIINVTTAGTLVLCRAI 148 T 13 PP_kinase_N pdbhh T Viruses T 5wlb 2 B,E B,E 225-15 a SGPRRPRXPGDQASLEELHEYWARLWNYLYRVAH 34 T 0.00013 Hormone_3 pdbpssm F T 5wlc 17 Q LI UTP8_YEAST Utp8 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRLFKQAIVTCPNLPLNELLEELFSIRNRELLLDISFRILQDFTRDSIKQEMKKLSKLDVQNFIEFITSGGEDSSPECFNPSQSTQLFQLLSLVLDSIGLFSLEGALLENLTLYIDKQVEIAERNTELWNLIDTKGFQHGFASSTFDNGTSQKRALPTYTMEYLDI 713 T 8.500000000000002E-245 Utp8 unppssm F Eukaryota T 5wlc 38 MA NE FAF1_YEAST Faf1 MTLDDDDYIKQMELQRKAFESQFGSLESMGFEDKTKNIRTEVDTRDSSGDEIDNSDHGSDFKDGTIESSNSSDEDSGNETAEENNQDSKPKXXXXXXXXXXXXXXXXXXXXXXXXXXXSGKTLTQINKKLESTEAKEEKEDETLEAENLQNDLELQQFLRESHLLSAFNNGGSGSTNSGVSLTLQSMGGGNDDGIVYQDDQVIGKARSRTLEMRLNRLSRVNGHQDKINKLEKVPMHIRRGMIDKHVKRIKKYEQEAAEGGIVLSKVKKGQFRKIESTYKKDIERRIGGSIKARDKEKATKRERGLKISSVGRSTRNGLIVSKRDIARISGGERSGKFNGKKKSRR 346 T 1.9E-09 DUF4602 pdbpssm F Eukaryota T 5wlh 1 A A A0A2D0TCH1_LACNK LbaCas13a H328A (C2c2) SNAMKISKVREENRGAKLTVNAKTAVVSENRSQEGILYNDPSRYGKSRKNDEDRDRYIESRLKSSGKLYRIFNEDKNKRETDELQWFLSEIVKKINRRNGLVLSDMLSVDDRAFEKAFEKYAELSYTNRRNKVSGSPAFETCGVDAATAERLKGIISETNFINRIKNNIDNKVSEDIIDRIIAKYLKKSLCRERVKRGLKKLLMNAFDLPYSDPDIDVQRDFIDYVLEDFYHVRAKSQVSRSIKNMNMPVQPEGDGKFAITVSKGGTESGNKRSAEKEAFKKFLSDYASLDERVRDDMLRRMRRLVVLYFYGSDDSKLSDVNEKFDVWEDAAARRVDNREFIKLPLENKLANGKTDKDAERIRKNTVKELYRNQNIGCYRQAVKAVEEDNNGRYFDDKMLNMFFIHRIEYGVEKIYANLKQVTEFKARTGYLSEKIWKDLINYISIKYIAMGKAVYNYAMDELNASDKKEIELGKISEEYLSGISSFDYELIKAEEMLQRETAVYVAFAARHLSSQTVELDSENSDFLLLKPKGTMDKNDKNKLASNNILNFLKDKETLRDTILQYFGGHSLWTDFPFDKYLAGGKDDVDFLTDLKDVIYSMRNDSFHYATENHNNGKWNKELISAMFEHETERMTVVMKDKFYSNNLPMFYKNDDLKKLLIDLYKDNVERASQVPSFNKVFVRKNFPALVRDKDNLGIELDLKADADKGENELKFYNALYYMFKEIYYNAFLNDKNVRERFITKATKVADNYDRNKERNLKDRIKSAGSDEKKKLREQLQNYIAENDFGQRIKNIVQVNPDYTLAQICQLIMTEYNQQNNGCMQKKSAARKDINKDSYQHYKMLLLVNLRKAFLEFIKENYAFVLKPYKHDLCDKADFVPDFAKYVKPYAGLISRVAGSSELQKWYIVSRFLSPAQANHMLGFLHSYKQYVWDIYRRASETGTEINHSIAEDKIAGVDITDVDAVIDLSVKLCGTISSEISDYFKDDEVYAEYISSYLDFEYDGGNYKDSLNRFCNSDAVNDQKVALYYDGEHPKLNRNIILSKLYGERRFLEKITDRVSRSDIVEYYKLKKETSQYQTKGIFDSEDEQKNIKKFQEMKNIVEFRDLMDYSEIADELQGQLINWIYLRERDLMNFQLGYHYACLNNDSNKQATYVTLDYQGKKNRKINGAILYQICAMYINGLPLYYVDKDSSEWTVSDGKESTGAKIGEFYRYAKSFENTSDCYASGLEIFENISEHDNITELRNYIEHFRYYSSFDRSFLGIYSEVFDRFFTYDLKYRKNVPTILYNILLQHFVNVRFEFVSGKKMIGIDKKDRKIAKEKECARITIREKNGVYSEQFTYKLKNGTVYVDARDKRYLQSIIRLLFYPEKVNMDEMIEVKEKKKPSDNNTGKGYSKRDRQQDRKEYDKYKEKKKKEGNFLSGMGGNINWDEINAQLKN 1440 T 0.19 APEH_N unppercent F Bacteria T 5wlj 1 A,B,C,D A,B,C,D De Novo Metal Binding Helical Bundle XIEELLRKILEDEARHVAELEDIEKWLX 28 T 0.027 Ribonuc_red_sm pdbhh F T 5wlk 1 A,B,C,D A,B,C,D Helical Bundle 4EH2 XIEELLRKIIEDEVRHIAELEDIEKWLX 28 T 0.09 Ribonuc_red_sm pdbhh F T 5wll 1 A,B,C,D A,B,C,D Helical Bundle 4DH1 XIEELLRKILEDDARHVAELEDIEKWLX 28 T 0.66 Ald_deCOase pdbhh F T 5wlm 1 A,B,C,D A,B,C,D Helical Bundle 4DH2 XIEELLRKIIEDDVRHIAELEDIEKWLX 28 T 1.5 Rubrerythrin pdbhh F T 5wlp 1 A A ATG32_YEAST EXTRACELLULAR MUTANT PROTEIN 37 SNATNSFVMPKLSLTQKNPVFRLLILGRTGSSFYQSIPKEYQSLFELPKYHDSATFPQYTGIVIIFQELREMVSLLNRIVQYSQGKPVIPICQPGQVIQVKNVLKSFLRNKLVKLLFPPVVVTNKRDLKKMFQRLQDLSLEYGED 145 T 0.0048 FleQ pdbpercent F Eukaryota T 5wmn 3 E,F E,F SPI peptide from Influenza A virus SPIVPSFDM 9 T 1.4 Bul1_N pdbhh F T 5wmp 3 C C TPR peptide from CMV TPRVTGGGAM 10 T 7.7 PGK pdbhh F T 5wmr 3 C C QIK peptide from CMV QIKVRVDMV 9 T 1.5 Herpes_IE1 pdbhh F T 5woc 1 A,B A,B SER-PRO-GLU-GLU-ARG-ALA-GLN-LEU-CYS-THR-ALA-ALA-GLU-LYS-ALA-ASP-GLU-LEU-GLY SPEERAQLCTAAEKADELG 19 T 1.7 DNA_pol_P_Exo pdbhh F T 5wod 1 A A 38-mer peptide SPEERAQLLTAAEKADELGCPEERAQLLTAAEKADELG 38 T 1.7 DUF3721 pdb F T 5wou 2 B V Q9VE13_DROME GUK-holder, isoform A LPSFETAL 8 T 5.5 Comm pdbhh F Eukaryota T 5wpp 1 A,B A,B A0A076NBW8_9CYAN HPIU5 MGSSHHHHHHSSGLVPRGSHMASTSVVSIPINNAGFEDPFIEVVDDYTVDTPPGWTTYNPNNLVPEKRTTMTSNNGVGYVGPGTQFYNQLAPEGRNIGYIYLAQKPGSGVAGFEQILDATLEPDTKYTLMVDVGNFGGEFQKISLAGFPGYRVELLAGDTVLAADHNNLYIKDGEFKTSTVTFTATPDNPYLDQKLGIRLINLLQGTFSGLDFDNVRLTVEPAQT 225 T 0.27 CBM_4_9 pdbpercent F Bacteria T 5wps 1 A A A0A076NBW8_9CYAN HPIU5 MGSSHHHHHHSSGLVPRGSHMASTSVVSIPINNAGFEDPFIEVVDDYTVDTPPGWTTYNPNNLVPEKRTTWTSNNGVGYVGPGTQFYNQLAPEGRNIGFIYLAQKPGSGVAGFEQILDATLEPDTKYTLKVDVGNFGGEFQKISLAGFPGYRVELLAGDTVLAADHNNLYIKDGEFKTSTVTFTATPDNPYLDQKLGIRLINLLQGTFSGLDFDNVRLTVEPAQT 225 T 3.4 DUF642 pdbhh F Bacteria T 5wpu 1 A A A0A076NBW8_9CYAN HPIU5 MGSSHHHHHHSSGLVPRGSHMASTSVVSIPINNAGFEDPFIEVVDDYTVDTPPGWTTYNPNNLVPEKRTTWTSNNGVGYVGPGTQFYNQLAPEGRNIGSIYLAQKPGSGVAGFEQILDATLEPDTKYTLKVDVGNFGGEFQKISLAGFPGYRVELLAGDTVLAADHNNLYIKDGEFKTSTVTFTATPDNPYLDQKLGIRLINLLQGTFSGLDFDNVRLTVEPAQT 225 T 2.5 DUF642 pdbhh F Bacteria T 5wqd 2 H,I,J,K,L,M,N H,I,J,K,L,M,N NBN_HUMAN NBS1 KMRIPNYQLSPTKLPS 16 T 1.1 SpoIISB_antitox pdbhh F Eukaryota T 5wqe 1 A A C2C1_ALIAG AACC2C1 MAVKSIKVKLRLDDMPEIRAGLWKLHKEVNAGVRYYTEWLSLLRQENLYRRSPNGDGEQECDKTAEECKAELLERLRARQVENGHRGPAGSDDELLQLARQLYELLVPQAIGAKGDAQQIARKFLSPLADKDAVGGLGIAKAGNKPRWVRMREAGEPGWEEEKEKAETRKSADRTADVLRALADFGLKPLMRVYTDSEMSSVEWKPLRKGQAVRTWDRDMFQQAIERMMSWESWNQRVGQEYAKLVEQKNRFEQKNFVGQEHLVHLVNQLQQDMKEASPGLESKEQTAHYVTGRALRGSDKVFEKWGKLAPDAPFDLYDAEIKNVQRRNTRRFGSHDLFAKLAEPEYQALWREDASFLTRYAVYNSILRKLNHAKMFATFTLPDATAHPIWTRFDKLGGNLHQYTFLFNEFGERRHAIRFHKLLKVENGVAREVDDVTVPISMSEQLDNLLPRDPNEPIALYFRDYGAEQHFTGEFGGAKIQCRRDQLAHMHRRRGARDVYLNVSVRVQSQSEARGERRPPYAAVFRLVGDNHRAFVHFDKLSDYLAEHPDDGKLGSEGLLSGLRVMSVDLGLRTSASISVFRVARKDELKPNSKGRVPFFFPIKGNDNLVAVHERSQLLKLPGETESKDLRAIREERQRTLRQLRTQLAYLRLLVRCGSEDVGRRERSWAKLIEQPVDAANHMTPDWREAFENELQKLKSLHGICSDKEWMDAVYESVRRVWRHMGKQVRDWRKDVRSGERPKIRGYAKDVVGGNSIEQIEYLERQYKFLKSWSFFGKVSGQVIRAEKGSRFAITLREHIDHAKEDRLKKLADRIIMEALGYVYALDERGKGKWVAKYPPCQLILLEELSEYQFNNDRPPSENNQLMQWSHRGVFQELINQAQVHDLLVGTMYAAFSSRFDARTGAPGIRCRRVPARCTQEHNPEPFPWWLNKFVVEHTLDACPLRADDLIPTGEGEIFVSPFSAEEGDFHQIHADLNAAQNLQQRLWSDFDISQIRLRCDWGEVDGELVLIPRLTGKRTADSYSNKVFYTNTGVTYYERERGKKRRKVFAQEKLSEEEAELLVEADEAREKSVVLMRDPSGIINRGNWTRQKEFWSMVNQRIEGYLVKQIRSRVPLQDSACENTGDILEHHHHHH 1137 T 0.0038 RuvC_1 pdbhh F Bacteria T 5wrd 2 C,D C,D FYCO1_MOUSE Peptide from FYVE and coiled-coil domain-containing protein 1 DDAVFDIITDEELCQIQES 19 T 1.8 ComC pdbhh F Eukaryota T 5wrl 2 B P IRS1_RAT IRS-1,PP185 DYMPMSPK 8 T 0.082 STAT1_TAZ2bind pdbhh F Eukaryota T 5wrx 1 A A analogue peptide VG13P VARGWGRKCPLFG 13 T 0.0016 Flavi_glycoprot pdbhh F T 5wsh 3 C C GLY-VAL-TRP-ILE-ARG-THR-PRO-THR-ALA GVWIRTPTA 9 T 0.13 Hepatitis_core pdbhh F T 5wti 3 C Z A0A0D0F5I0_9BACI UNCHARACTERIZED PROTEIN MATRSFILKIEPNEEVKKGLWKTHEVLNHGIAYYMNILKLIRQEAIYEHHEQDPKNPKKVSKAEIQAELWDFVLKMQKCNSFTHEVDKDVVFNILRELYEELVPSSVEKKGEANQLSNKFLYPLVDPNSQSGKGTASSGRKPRWYNLKIAGDPSWEEEKKKWEEDKKKDPLAKILGKLAEYGLIPLFIPFTDSNEPIVKEIKWMEKSRNQSVRRLDKDMFIQALERFLSWESWNLKVKEEYEKVEKEHKTLEERIKEDIQAFKSLEQYEKERQEQLLRDTLNTNEYRLSKRGLRGWREIIQKWLKMDENEPSEKYLEVFKDYQRKHPREAGDYSVYEFLSKKENHFIWRNHPEYPYLYATFCEIDKKKKDAKQQATFTLADPINHPLWVRFEERSGSNLNKYRILTEQLHTEKLKKKLTVQLDRLIYPTESGGWEEKGKVDIVLLPSRQFYNQIFLDIEEKGKHAFTYKDESIKFPLKGTLGGARVQFDRDHLRRYPHKVESGNVGRIYFNMTVNIEPTESPVSKSLKIHRDDFPKFVNFKPKELTEWIKDSKGKKLKSGIESLEIGLRVMSIDLGQRQAAAASIFEVVDQKPDIEGKLFFPIKGTELYAVHRASFNIKLPGETLVKSREVLRKAREDNLKLMNQKLNFLRNVLHFQQFEDITEREKRVTKWISRQENSDVPLVYQDELIQIRELMYKPYKDWVAFLKQLHKRLEVEIGKEVKHWRKSLSDGRKGLYGISLKNIDEIDRTRKFLLRWSLRPTEPGEVRRLEPGQRFAIDQLNHLNALKEDRLKKMANTIIMHALGYCYDVRKKKWQAKNPACQIILFEDLSNYNPYEERSRFENSKLMKWSRREIPRQVALQGEIYGLQVGEVGAQFSSRFHAKTGSPGIRCSVVTKEKLQDNRFFKNLQREGRLTLDKIAVLKEGDLYPDKGGEKFISLSKDRKLVTTHADINAAQNLQKRFWTRTHGFYKVYCKAYQVDGQTVYIPESKDQKQKIIEEFGEGYFILKDGVYEWGNAGKLKIKKGSSKQSSSELVDSDILKDSFDLASELKGEKLMLYRDPSGNVFPSDKWMAAGVFFGKLERILISKLTNQYSISTIEDDSSKQSM 1108 T 0.0023 RuvC_1 pdbhh F Bacteria T 5wtj 1 A,B A,B C2C2_LEPSD ENDORNASE,LSHC2C2 MGNLFGHKRWYEVRDKKDFKIKRKVKVKRNYDGNKYILNINENNNKEKIDNNKFIRKYINYKKNDNILKEFTRKFHAGNILFKLKGKEGIIRIENNDDFLETEEVVLYIEAYGKSEKLKALGITKKKIIDEAIRQGITKDDKKIEIKRQENEEEIEIDIRDEYTNKTLNDCSIILRIIENDELETKKSIYEIFKNINMSLYKIIEKIIENETEKVFENRYYEEHLREKLLKDDKIDVILTNFMEIREKIKSNLEILGFVKFYLNVGGDKKKSKNKKMLVEKILNINVDLTVEDIADFVIKELEFWNITKRIEKVKKVNNEFLEKRRNRTYIKSYVLLDKHEKFKIERENKKDKIVKFFVENIKNNSIKEKIEKILAEFKIDELIKKLEKELKKGNCDTEIFGIFKKHYKVNFDSKKFSKKSDEEKELYKIIYRYLKGRIEKILVNEQKVRLKKMEKIEIEKILNESILSEKILKRVKQYTLEHIMYLGKLRHNDIDMTTVNTDDFSRLHAKEELDLELITFFASTNMELNKIFSRENINNDENIDFFGGDREKNYVLDKKILNSKIKIIRDLDFIDNKNNITNNFIRKFTKIGTNERNRILHAISKERDLQGTQDDYNKVINIIQNLKISDEEVSKALNLDVVFKDKKNIITKINDIKISEENNNDIKYLPSFSKVLPEILNLYRNNPKNEPFDTIETEKIVLNALIYVNKELYKKLILEDDLEENESKNIFLQELKKTLGNIDEIDENIIENYYKNAQISASKGNNKAIKKYQKKVIECYIGYLRKNYEELFDFSDFKMNIQEIKKQIKDINDNKTYERITVKTSDKTIVINDDFEYIISIFALLNSNAVINKIRNRFFATSVWLNTSEYQNIIDILDEIMQLNTLRNECITENWNLNLEEFIQKMKEIEKDFDDFKIQTKKEIFNNYYEDIKNNILTEFKDDINGCDVLEKKLEKIVIFDDETKFEIDKKSNILQDEQRKLSNINKKDLKKKVDQYIKDKDQEIKSKILCRIIFNSDFLKKYKKEIDNLIEDMESENENKFQEIYYPKERKNELYIYKKNLFLNIGNPNFDKIYGLISNDIKMADAKFLFNIDGKNIRKNKISEIDAILKNLNDKLNGYSKEYKEKYIKKLKENDDFFAKNIQNKNYKSFEKDYNRVSEYKKIRDLVEFNYLNKIESYLIDINWKLAIQMARFERDMHYIVNGLRELGIIKLSGYNTGISRAYPKRNGSDGFYTTTAYYKFFDEESYKKFEKICYGFGIDLSENSEINKPENESIRNYISHFYIVRNPFADYSIAEQIDRVSNLLSYSTRYNNSTYASVFEVFKKDVNLDYDELKKKFKLIGNNDILERLMKPKKVSVLELESYNSDYIKNLIIELLTKIENTNDTLLEHHHHHH 1397 T 0.067 PET117 pdbpercent F Bacteria T 5wtt 3 C,F P,C Epitope peptide of Cyr61 CGLECNFG 8 T 0.17 DUF3330 pdbhh F T 5wuj 1 A A O25118_HELPY Flagellar M-ring protein FSEEEVRYEIILEKIRGTLKERPDEIAMLFKLLIKDE 37 T 0.061 Peptidase_C34 pdbpssm F Bacteria T 5wxe 1 A A A0A2R2JFU3_9LAMI jasmintide js3 QLCLLCQTSRDCNYIIWTVCRDGCCNIS 28 T 0.021 PAN_4 pdbpercent F Eukaryota T 5wxf 2 B P upain-2-2 peptide CSWXGLENHAAC 12 T 0.53 DUF2632 pdbhh F T 5wxn 2 C,D C,D STK11_HUMAN Serine/threonine-protein kinase STK11 RWRSMTVVPYLED 13 T 0.034 WWamide pdbhh F Eukaryota T 5wxo 2 B P upain-2-2-W3A peptide CSAXGLENHAAC 12 T 6.4 LRRNT pdbhh F T 5wxq 2 B P upain-2-4 peptide GACSWRGLENHAAC 14 T 1 DUF2632 pdbhh F T 5wxr 2 B P upain-2-4-W3A peptide GACSARGLENHAAC 14 T 5.5 RE_SacI pdbhh F T 5wyh 2 B,D B,D Interaptin LEEYIRMAKNKEFFDALEEIAESAKNDETLRNELAKVLDDILKTDPSDPEAFRKIVAEHQEFWDEHDPSLMEFNEGRFFGKSRKQYLKSDDFLNSTDPTYNFQKLHQFAAEQRVKLGLEKSDTDTLVAILKNNPEECRAYIESKKPGLGNFSEGNVHGWLKEEYTPTIPPKAINKSTGVLSDEAIKRIKEQARDLLLL 198 T 0.013 SE pdb F T 5wyl 2 B,D B,D G0SCS8_CHATD UTP17 DLDMEDNEDTHAVVVAPQRLAEIFNAAPAFAMPPIEDVFYQVASLFSTKPVINA 54 T 2.7 VQ pdbhh F Eukaryota T 5wzz 2 E,F,G,H E,F,G,H AXIN1_HUMAN AXIS INHIBITION PROTEIN 1,HAXIN YRVPKEVRVEPQKFAEELIH 20 T 0.19 Cwf_Cwc_15 unppssm F Eukaryota T 5x0s 1 A A TXF1A_SCOSU SsTx EVIKKDTPYKKRKFPYKSECLKACATSFTGGDESRIQEGKPGFFKCTCYFTTG 53 T 0.08 DUF5760 pdbpercent F Eukaryota T 5x1e 2 B,E B,E Q5ZS31_LEGPH IcmW PDLSHEASAKYWFEYLDPMIYRVITFMESVENWTLDGNPELEEAMKQLGQELDDIEKIDLGLLAEEDKFIRIVGNIKSGRGLRLLQAIDTVHPGSASRVLIHAEETSLSSSDPAGFFLKRNIVFERLRLLSRVFCQYRLKLVLRALEG 148 T 0.11 DUF2335 pdbpercent F Bacteria T 5x1e 3 C C Q5ZYC6_LEGPH IcmO (DotL) EGALTIFSKLRIDPNAPPILVADKEVFSEPLLPINETRNQMITIERLAGAKDKYAGTVANELIKDFQIATSYPPEERDVIDVQELTGIIRDLSAKISAEREK 102 T 0.74 RecC_C unppssm F Bacteria T 5x1e 5 F F Q5ZYC6_LEGPH IcmO (DotL) ALTIFSKLRIDPNAPPILVADKEVFSEPLLPINETRNQMITIERLAGAKDKYAGTVANELIKDFQIATSYPPEERDVIDVQELTGIIRDLSAKISAEREK 100 T 0.1 Glypican pdbpssm F Bacteria T 5x1g 1 A C WHAMM_HUMAN WAS PROTEIN HOMOLOGY REGION 2 DOMAIN-CONTAINING PROTEIN 1,WH2 DOMAIN-CONTAINING PROTEIN 1 IQMKRDKIKEEEQKKKEWINQERQKTLQRLRSFK 34 T 0.042 Trimer_CC pdbpssm F Eukaryota T 5x1u 1 A,B A,B Q5WZ95_LEGPL Uncharacterized protein ALTPMEFARKYNLLRKDDALLDNPVPGEEMTAGIRRGDAKRVFTMQLGPYWDGFERCSPQAYALSAVFMARMNRDRDAANNILKVLDKTFVDGKPDFSVARPVMKKYQNSELVQEVVAKHAYVLTVIASLLEAAREDGVVPSSEFLWLKPVDRRLWYMLNCVGRQTPYSEVAGPFAHWKAEKEMGRRSLVPMIDEAIRALEIAVKEVR 208 T 0.31 Pkip-1 pdb F Bacteria T 5x42 2 B,D B,D Q5ZYC6_LEGPH IcmO (DotL) VEPPPDDYLMKLQKQLASFQSILESGDLSINKAVENEEITLISKALKESTIVEPIERGVAALIAFHGQNE 70 T 0.12 DUF2433 pdb F Bacteria T 5x6x 1 A,B,C,D C,A,B,D MCE_RDVA mRNA capping enzyme P5 GGSMSNPDYCIPNFSQTVNERTIIDIFTICRYRSPLVVFCLSHNELAKKYAQDVSMSSGTHVHIIDGSVEITVSLYRTFRTIATQLLGRMQIVVFVTVDKSVVSTQVMKSIAWAFRGSFVELRNQSVDSSTLVSKLENLVSFAPLYNVPKCGPDYYGPTVYSELLSLATNARTHWYATIDYSMFTRSVLTGFVAKYFNEEAVPIDKRIVSIVGYNPPYVWTCLRHGIRPTYIEKSLPNPGGKGPFGLILPVINELVLKSKVKYVMHNPQIKLLCLDTFMLSTSMNILYIGAYPATHLLSLQLNGWTILAFDPKITSDWTDAMAKATGAKVIGVSKEFDFKSFSVQANQLNMFQNSKLSVIDDTWVETDYEKFQSEKQAYFEWLIDRTSIDVRLISMKWNRSKDTSVSHLLALLPQPYGASIREMRAFFHKKGASDIKILAAETEKYMDDFTAMSVSDQINTQKFMHCMITTVGDALKMDLDGGRAVIASYSLSNSSNSKERVLKFLSDANKAKAMVVFGAPNTHRLAYAKKVGLVLDSAIKMSKDLITFSNPTGRRWRDYGYSQSELYDAGYVEITIDQMVAYSSDVYNGVGYFANSTYNDLFSWYIPKWYVHKRMLMQDIRLSPAALVKCFTTLIRNICYVPHETYYRFRGILVDKYLRSKNVDPSQYSIVGSGSKTFTVLSHFEVPHECGPLVFEASTDVNISGHLLSLAIAAHFVASPMILWAEQMKYMAVDRMLPPNLDKSLFFDNKVTPSGALQRWHSREEVLLAAEICESYAAMMLNNKHSPDIIGTLKSAINLVFKI 804 T 5.8E-05 PARP_regulatory unphh T Viruses T 5x7v 1 A,B,C,D,E,F A,B,C,D,E,F Q9Y010_PLAFA Nucleosome assembly protein FMQDFEDIQKDIEQLDIKCAHEQMNIQKQYDEKKKPLFEKRDEIIQKIPGFWANTLRKHPALSDIVPEDIDILNHLVKLDLKDNMDNNGSYKITFIFGEKAKEFMEPLTLVKHVTFDNNQEKVVECTRIKWKEGKNPIAAVTHNRSDLDNEIPKWSIFEWFTTDELQDKPDVGELIRREIWHNPLSYYLGLEE 193 T 7.5E-07 NAP pdbpssm F Eukaryota T 5x90 2 B F Q5ZS31_LEGPH IcmW PDLSHEASAKYWFEYLDPMIYRVITFMESVENWTLDGNPELEEAMKQLGQELDDIEKIDLGLLAEEDKFIRIVGNIKSGRGLRLLQAIDTVHPGSASRVLIHAEETSLSSSDPAGFFLKRNIVFERLRLLSRVFCQYRLKLVLRALEGD 149 T 0.11 DUF2335 pdbpercent F Bacteria T 5x90 3 C G Q5ZYC6_LEGPH IcmO (DotL) EGALTIFSKLRIDPNAPPILVADKEVFSEPLLPINETRNQMITIERLAGAKDKYAGTVANELIKDFQIATSYPPEERDVIDVQELTGIIRDLSAKISAEREKANKKAA 108 T 0.061 Csm2_III-A pdbpssm F Bacteria T 5x90 4 D H Q5ZY48_LEGPH Hypothetical virulence protein LTMIDDLNNPLAIVERVYLIWWHWADFHLHVISPHIDTITPAIVIEPELIPGSNDHEFVYSIHDSGSKLSTSKSQDMFSAGMSMCKLFYTIEKMVYILVERLKSGGVSMEAEVQIAFAGHEIAQRKAFESIINLPYNVVVTNFDPGIWGEKYLQNVKRLADKGYGYPPESPR 172 T 0.21 Herpes_TK_C pdbpercent F Bacteria T 5x90 6 G C Q5ZYC6_LEGPH IcmO (DotL) EGALTIFSKLRIDPNAPPILVADKEVFSEPLLPINETRNQMITIERLAGAKDKYAGTVANELIKDFQIATSYPPEERDVIDVQELTGIIRDLSAKISAEREKANKKA 107 T 0.7 DUF3811 pdbpercent F Bacteria T 5x90 7 H D Q5ZY48_LEGPH Hypothetical virulence protein IDDLNNPLAIVERVYLIWWHWADFHLHVISPHIDTITPAIVIEPELIPGSNDHEFVYSIHDSGSKLSTSKSQDMFSAGMSMCKLFYTIEKMVYILVERLKSGGVSMEAEVQIAFAGHEIAQRKAFESIINLPYNVVVTNFDPGIWGEKYLQNVKRLADKGYGYPPESPRKI 171 T 0.21 Herpes_TK_C pdbpercent F Bacteria T 5x9x 1 A A Q9BML7_DROME Metabotropic GABA-B receptor subtype 1 MDSKEDEERYQKLVTENEQLQRLITQKEEKIRVLRQRLVERGDA 44 T 0.0039 Csm1_N pdb F Eukaryota T 5x9x 2 B B Q9BML6_DROME GABAB RECEPTOR 2 GPLGSSVSELEQRLRDVKNTNSRFRKALMEKENELQALIRKLGPE 45 T 0.011 MIP-T3_C pdbpercent F Eukaryota T 5xa5 2 B B HMP2_CAEEL PROTEIN HUMPBACK-2 GGIQTSAAEATNSTTSIVEMMQMPTQQLKQSVMDLLTYEGSNDMSGLS 48 T 3.7E-05 Adaptin_N unppssm F Eukaryota T 5xad 2 C,D C,D Q5ZUV9_LEGPH Uncharacterised protein GSIVDEFEELGEQESDIDEFDLLEG 25 T 14 LMP pdbhh F Bacteria T 5xbd 1 A A A0A2R2JFU8_9CARY pB1 QCKPNGAKCTEISIPPCCSNFCLRYAGQKSGTCANR 36 T 0.00093 Antifungal_pept pdb F Eukaryota T 5xco 2 B B ACE-ARG-ARG-ARG-ARG-CYS-PRO-LEU-TYR-ILE-SER-TYR-ASP-PRO-VAL-CYS-ARG-ARG-ARG-ARG-NH2 XRRRRCPLYISYDPVCRRRRX 21 T 1.5 YliH pdbhh F T 5xhz 2 C,D C,D ARAP1_MOUSE CENTAURIN-DELTA-2,CNT-D2 RPVPMKRHIFR 11 T 21 DUF924 pdbhh F Eukaryota T 5xiu 1 A A RN168_HUMAN HRNF168,RING FINGER PROTEIN 168,RING-TYPE E3 UBIQUITIN TRANSFERASE RNF168 GPGHMEETEINFTQKLIDLEHLLFERHKQEEQDRLLALQLQKEVDKEQM 49 T 7.2 DUF3629 pdbhh F Eukaryota T 5xiv 1 A A A0A247D712_GINBI beta-ginkgotide, beta-gB1 YETGCKRCCYLDEYGCIRCC 20 T 2.5 Antistasin pdbhh F Eukaryota T 5xj0 6 G,H G,H A7XX65_9CAUD gp39 GSHMVEGFVEPYIRLFEAIPDAETELATFYDADLDTLPPRMFLPSGDLYTPPGPVRLEEIKRKRRVRLVKVSIYRFEHVGLGLAARPYAYAYAWQGDNGILHLYHAPVVLEDVPEVLELDEVTYNESYVRLMRAMGHVDAFIDL 144 T 2.5 Abp2 unphh T Viruses T 5xjg 2 B,D B,D NVJ1_YEAST Nucleus-vacuole junction protein 1 NREKDCSSSSEVESQSKCRKESTAEPDSLSRDTRTTSSLKSSTSFPISFKGSIDLKSLNQPSSLLHIQVSPTKSSNLDAQVNTEQAYSQPFRY 93 T 0.06 Trypan_PARP unp F Eukaryota T 5xjm 4 D B ANGT_HUMAN Sar1, Ile8-angiotensin II XRVYIHPI 8 T 3 Ion_trans_N pdbhh F Eukaryota T 5xll 1 A,B A,B PKNI_MYCTU Serine/threonine-protein kinase PknI TAPVPLDGTYRIEIQRSKQTYDYTPTPQPPDVNTWWAFRTSCTPTECLAAATMLDDNDHTQAKTPPVRPFLMQFGEGQWKSRPETVQFPCVGPNGSPSTQATTQLLALRPQPQGDLVGEMVVTVHSNECGQQGAVIRIPAVASRSGDLPPAVTVPDPATIPDTPDTTSTATLTPPTTTAPGPGR 184 T 1.9 DEC-1_N pdb F Bacteria T 5xlm 1 A,B A,B PKNI_MYCTU Serine/threonine-protein kinase PknI RKTNTTATEVARPPTSGSAVPSAPTTTVAVTAPVPLDGTYRIEIQRSKQTYDYTPTPQPPDVNTWWAFRTSCTPTECLAAATMLDDNDHTQAKTPPVRPFLMQFGEGQWKSRPETVQFPCVGPNGSPSTQATTQLLALRPQPQGDLVGEMVVTVHSNECGQQGAVIRIPAVASRSGDLPPAVTVPDPATIPDTPDTTSTATLTPPTTTAPGPGR 214 T 0.24 Acyl-CoA_dh_C unp F Bacteria T 5xln 2 B B SYTC_HUMAN THREONYL-TRNA SYNTHETASE,THRRS GGKKKNKEGSGDGGRAELNPWPEYIYTRLEMYNILKAEHDSILAE 45 T 4.4 CAAP1 pdbhh F Eukaryota T 5xlo 3 H,I N,M L7P7M1_9CAUD Uncharacterized protein AcrF1 MKFIKYLSTAHLNYMNIAVYENGSKIKARVENVVNGKSVGARDFDSTEQLESWFYGLPGSGLGRIENAMNEISRRENP 78 T 0.16 DUF4982 pdb T Viruses T 5xm4 1 A A A0A0B8ZWE6_9SPHN SUBTERISIN GPPGDRIEFGVLAQLPG 17 T 0.16 DUF5974 unphh F Bacteria T 5xn3 2 B B NOS2_HUMAN cR8 peptide from NOS2 RGDINNNV 8 T 3.8 DUF6373 pdbhh F Eukaryota T 5xnb 1 A,D,G,J,M,P A,D,G,J,M,P Q5ZYC6_LEGPH ICMO PROTEIN EPVEDIVEEEVEGALTIFSKLRIDPNAPPILVADKEVFSEPLLPINETRNQMITIERLAGAKDKYAGTVANELIKDFQIATSYPPEERDVIDVQELTGIIRDLSAKISAEREK 113 T 0.74 RecC_C unppssm F Bacteria T 5xnb 3 C,F,I,L,O,R C,F,I,L,O,R Q5ZS31_LEGPH ICMW PROTEIN MPDLSHEASAKYWFEYLDPMIYRVITFMESVENWTLDGNPELEEAMKQLGQELDDIEKIDLGLLAEEDKFIRIVGNIKSGRGLRLLQAIDTVHPGSASRVLIHAEETSLSSSDPAGFFLKRNIVFERLRLLSRVFCQYRLKLVLRALEGDE 151 T 0.11 DUF2335 unppercent F Bacteria T 5xo3 1 A A THAN_PODMA Thanatin GSKKPVPIIYCNRRTGKCQRF 21 T 2.6 YihI pdbhh F Eukaryota T 5xo4 1 A A THAN_PODMA Thanatin GSKKPVPIIYCNRRTGKCQRM 21 T 2.6 YihI pdbhh F Eukaryota T 5xo5 1 A A THAN_PODMA Thanatin GSKKPVPIIACNRRTGKCQRA 21 T 0.14 YihI pdbhh F Eukaryota T 5xod 2 B B SKI_HUMAN PROTO-ONCOGENE C-SKI GPGLQKTLEQFHLSSMSSLGGPAAFSASDED 31 T 2.7 DUF2520 pdbhh F Eukaryota T 5xoj 4 D,E,F E,F,G E7Q297_YEASB Nup42p KPSAFGAPAFGSSAPINVNPPSTTSAFGAPSFGST 35 T 18 DUF2673 pdbhh F T 5xol 1 A,B A,B THAN_PODMA Thanatin GSKKPVPIIYCNAATGKCQRM 21 T 2.6 YihI unphh F Eukaryota T 5xpt 2 B B CHAP1_HUMAN ZINC FINGER PROTEIN 828 MSNPSASSGPWKPAKPAPSVS 21 T 5.2 GvpL_GvpF pdbhh F Eukaryota T 5xqz 2 B,D C,D ST38L_HUMAN NDR2 PROTEIN KINASE,NUCLEAR DBF2-RELATED KINASE 2 SSGHMKLTLENFYSNLILQHEERETRQKKLEVAMEEEGLADEEKKLRRSQHARKETEFLRLKRTRLGL 68 T 0.12 Anoct_dimer pdbpssm F Eukaryota T 5xsj 2 B L A6LW08_CLOB8 Signal transduction histidine kinase, LytS MGSSHHHHHHSQGSMLNNMLITNEIKQHVDSSLDNFNQYILNGTPSKKESYNNEVILAKQKIGNLKKNSDDVNQYILRDLDNTLDSYIESSKNTISAYENKEGYVFYYDDFVAAKNIASYCDAYASTLMQNFLEANSIAYKELNRNSS 148 T 0.00042 HBM pdbpercent F Bacteria T 5xtc 1 A Q NDUS2_HUMAN COMPLEX I-49KD,CI-49KD,NADH-UBIQUINONE OXIDOREDUCTASE 49 KDA SUBUNIT VRQWQPDVEWAQQFGGAVMYPSKETAHWKPPPWNDVDPPKDTIVKN 46 T 36 CCSAP pdbhh F Eukaryota T 5xtj 1 A,B A,B A0A2U8ZTY7_RHIZD ENDO BETA-1,4-MANNANASE ADRGTETVPGLGQRKQQILNSGGGVWDLAIAMLETKNLGTDYVYGDGKTYDSANFGIFKQNWFMLRTSTSQFKGQTTNQWNNGAVLNSNLQQDIKARQESQNYYGPDKWFAGHRNGESGLSNPYTQDITNYKDAVNWIHDQLASDPKYLSDDTRFWVDVTAI 162 T 0.036 Lys pdbpercent F Eukaryota T 5xup 2 C,D C,D TERB1_HUMAN COILED-COIL DOMAIN-CONTAINING PROTEIN 79 KILLTPRRRQRL 12 T 0.86 WSK pdbhh F Eukaryota T 5xv8 1 A A UVSSA_HUMAN UV-stimulated scaffold protein A GSMRRRTEALGDAEEDEDDEDFVEVPEKEGYEPHIPDHLRPEYGLEAA 48 T 15 AgrD pdbhh F Eukaryota T 5xvw 2 C,F D,F RNG1A_ARATH RING 1A ILAWGRGGTRSNTR 14 T 0.25 PetN pdbhh F Eukaryota T 5xw5 2 C C SWI6_YEAST CELL-CYCLE BOX FACTOR SUBUNIT SWI6,MBF SUBUNIT P90,TRANS-ACTING ACTIVATOR OF HO ENDONUCLEASE GENE HRELGSPLKK 10 T 15 DUF4416 pdbhh F Eukaryota T 5xwe 1 A,B A,B 3S11H_OPHHA WTX DE-1 HOMOLOG 1 MKPVLLTLVVVTIVCLDLGYTRICLKQEPFQPETTTTCPEGEDACYNLFWSDHSEIKIEMGCGCPKTEPYTNLYCCKIDSCNK 83 T 0.021 Endomucin pdb F Eukaryota T 5xwp 1 A,F A,B CS13A_LEPBD Uncharacterized protein SMKVTKVGGISHKKYTSEGRLVKSESEENRTDERLSALLNMRLDMYIKNPSSTETKENQKRIGKLKKFFSNKMVYLKDNTLSLKNGKKENIDREYSETDILESDVRDKKNFAVLKKIYLNENVNSEELEVFRNDIKKKLNKINSLKYSFEKNKANYQKINENNIEKVEGKSKRNIIYDYYRESAKRDAYVSNVKEAFDKLYKEEDIAKLVLEIENLTKLEKYKIREFYHEIIGRKNDKENFAKIIYEEIQNVNNMKELIEKVPDMSELKKSQVFYKYYLDKEELNDKNIKYAFCHFVEIEMSQLLKNYVYKRLSNISNDKIKRIFEYQNLKKLIENKLLNKLDTYVRNCGKYNYYLQDGEIATSDFIARNRQNEAFLRNIIGVSSVAYFSLRNILETENENDITGRMRGKTVKNNKGEEKYVSGEVDKIYNENKKNEVKENLKMFYSYDFNMDNKNEIEDFFANIDEAISSIRHGIVHFNLELEGKDIFAFKNIAPSEISKKMFQNEINEKKLKLKIFRQLNSANVFRYLEKYKILNYLKRTRFEFVNKNIPFVPSFTKLYSRIDDLKNSLGIYWKTPKTNDDNKTKEIIDAQIYLLKNIYYGEFLNYFMSNNGNFFEISKEIIELNKNDKRNLKTGFYKLQKFEDIQEKIPKEYLANIQSLYMINAGNQDEEEKDTYIDFIQKIFLKGFMTYLANNGRLSLIYIGSDEETNTSLAEKKQEFDKFLKKYEQNNNIKIPYEINEFLREIKLGNILKYTERLNMFYLILKLLNHKELTNLKGSLEKYQSANKEEAFSDQLELINLLNLDNNRVTEDFELEADEIGKFLDFNGNKVKDNKELKKFDTNKIYFDGENIIKHRAFYNIKKYGMLNLLEKIADKAGYKISIEELKKYSNKKNEIEKNHKMQENLHRKYARPRKDEKFTDEDYESYKQAIENIEEYTHLKNKVEFNELNLLQGLLLRILHRLVGYTSIWERDLRFRLKGEFPENQYIEEIFNFENKKNVKYKGGQIVEKYIKFYKELHQNDEVKINKYSSANIKVLKQEKKDLYIANYIAAFNYIPHAEISLLEVLENLRKLLSYDRKLKNAVMKSVVDILKEYGFVATFKIGADKKIGIQTLESEKIVHLKNLKKKKLMTDRNSEELCKLVKIMFEYKMEEKKSEN 1160 T 0.66 DUF2316 unppercent F Bacteria T 5xwr 2 C,D C,D SALL4_HUMAN MET-SER-ARG-ARG-LYS-GLN-ALA-LYS-PRO-GLN-HIS-ILE MSRRKQAKPQHI 12 T 160 Loricrin pdbhh F Eukaryota T 5xwy 1 A A CS13A_LEPBD A type VI-A CRISPR-Cas RNA-guided RNA ribonuclease, Cas13a MKVTKVGGISHKKYTSEGRLVKSESEENRTDERLSALLNMRLDMYIKNPSSTETKENQKRIGKLKKFFSNKMVYLKDNTLSLKNGKKENIDREYSETDILESDVRDKKNFAVLKKIYLNENVNSEELEVFRNDIKKKLNKINSLKYSFEKNKANYQKINENNIEKVEGKSKRNIIYDYYRESAKRDAYVSNVKEAFDKLYKEEDIAKLVLEIENLTKLEKYKIREFYHEIIGRKNDKENFAKIIYEEIQNVNNMKELIEKVPDMSELKKSQVFYKYYLDKEELNDKNIKYAFCHFVEIEMSQLLKNYVYKRLSNISNDKIKRIFEYQNLKKLIENKLLNKLDTYVRNCGKYNYYLQDGEIATSDFIARNRQNEAFLRNIIGVSSVAYFSLRNILETENENDITGRMRGKTVKNNKGEEKYVSGEVDKIYNENKKNEVKENLKMFYSYDFNMDNKNEIEDFFANIDEAISSIRHGIVHFNLELEGKDIFAFKNIAPSEISKKMFQNEINEKKLKLKIFRQLNSANVFRYLEKYKILNYLKRTRFEFVNKNIPFVPSFTKLYSRIDDLKNSLGIYWKTPKTNDDNKTKEIIDAQIYLLKNIYYGEFLNYFMSNNGNFFEISKEIIELNKNDKRNLKTGFYKLQKFEDIQEKIPKEYLANIQSLYMINAGNQDEEEKDTYIDFIQKIFLKGFMTYLANNGRLSLIYIGSDEETNTSLAEKKQEFDKFLKKYEQNNNIKIPYEINEFLREIKLGNILKYTERLNMFYLILKLLNHKELTNLKGSLEKYQSANKEEAFSDQLELINLLNLDNNRVTEDFELEADEIGKFLDFNGNKVKDNKELKKFDTNKIYFDGENIIKHRAFYNIKKYGMLNLLEKIADKAGYKISIEELKKYSNKKNEIEKNHKMQENLHRKYARPRKDEKFTDEDYESYKQAIENIEEYTHLKNKVEFNELNLLQGLLLRILHRLVGYTSIWERDLRFRLKGEFPENQYIEEIFNFENKKNVKYKGGQIVEKYIKFYKELHQNDEVKINKYSSANIKVLKQEKKDLYIANYIAAFNYIPHAEISLLEVLENLRKLLSYDRKLKNAVMKSVVDILKEYGFVATFKIGADKKIGIQTLESEKIVHLKNLKKKKLMTDRNSEELCKLVKIMFEYKMEEKKSEN 1159 T 0.66 DUF2316 unppercent F Bacteria T 5xxe 1 A,C A,B POZ1_SCHPO POT1-ASSOCIATED PROTEIN POZ1 GSNEKIRSQSVLNTLETFFIKENHYDMQREESSIVNACLRYLGYSKSMCHEKMPIFMDIAFIEYCFNLSLDPSSFQNLPITQTQPDSQQILWEYSLISNALERLENIELERQNCMREDGLVKYTNELLLNKETLNNEALKLYSCAKAGICRWMAFHFLEQEPIDHINFTKFLQDWGSHNEKEMEALQRLSKHKIRKRLIYVSQHKKKMPWSKFNSVLSRYIQCTKLQLEVFCDYDFKQREIVKMLTSNIN 250 T 0.14 DUF5896 unppssm F Eukaryota T 5xxe 2 B,D C,D TPZ1_SCHPO MEIOTICALLY UP-REGULATED GENE 169 PROTEIN EACEMCRLGLPHGSFFELLRDWKKIEEFRNKS 32 T 0.7 RPAP3_C pdbhh F Eukaryota T 5xxf 1 A,D A,B POZ1_SCHPO POT1-ASSOCIATED PROTEIN POZ1 GSNEKIRSQSVLNTLETFFIKENHYDMQREESSIVNACLRYLGYSKSMCHEKMPIFMDIAFIEYCFNLSLDPSSFQNLPITQTQPDSQQILWEYSLISNALERLENIELERQNCMREDGLVKYTNELLLNKETLNNEALKLYSCAKAGICRWMAFHFLEQEPIDHINFTKFLQDWGSHNEKEMEALQRLSKHKIRKRLIYVSQHKKKMPWSKFNSVLSRYIQCTKLQLEVFCDYDFKQREIVKMLTSN 248 T 0.14 DUF5896 unppssm F Eukaryota T 5xxf 2 B,E C,D TPZ1_SCHPO MEIOTICALLY UP-REGULATED GENE 169 PROTEIN ACEMCRLGLPHGSFFELLRDWKKIEEFRNKS 31 T 0.64 RPAP3_C pdbhh F Eukaryota T 5xxf 3 C,F E,F Rap1 NSDNIFVKPGEDLEIPL 17 T 0.27 DUF3983 pdbhh F T 5xxk 2 C,D C,D Hydrocarbon stapled peptide THC-SER-PHE-0EH-GLU-TYR-6CW-ALA-LEU-LEU-MK8-NH2 XSFXEYXALLXX 12 T 0.54 P53_TAD pdbhh F T 5xxq 2 C,D C,D ZN827_HUMAN Zinc finger protein 827 MPRRKQEQPKRLPS 14 T 21 Co_AT_N pdbhh F Eukaryota T 5xy9 2 C,D C,D STK26_HUMAN MST3 AND SOK1-RELATED KINASE,MAMMALIAN STE20-LIKE PROTEIN KINASE 4,STE20-LIKE KINASE MST4,SERINE/THREONINE-PROTEIN KINASE MASK TSRENNTHPEWS 12 T 3.7 DUF2811 pdbhh F Eukaryota T 5xyf 2 B C TERF2_HUMAN TTAGGG REPEAT-BINDING FACTOR 2,TELOMERIC DNA-BINDING PROTEIN SLQPKNKRMTISRLVLEE 18 T 10 VasL pdbhh F Eukaryota T 5xyf 3 C B ACD_HUMAN POT1 AND TIN2-INTERACTING PROTEIN ADPRSSLCARVQAARLPPQLMAWALHFLMDAQPGSEPTPM 40 T 15 DUF6525 pdbhh F Eukaryota T 5xyn 3 C C SHU1_YEAST Suppressor of HU sensitivity involved in recombination protein 1 MQFEERLQQLVESDWSLDQSSPNVLVIVLGDTARKYVELGGLKEHVTTNTVAGHVASRERVSVVFLGRVKYLYMYLTRMQAQANGPQYSNVLVYGLWDLTATQDGPQQLRLLSLVLRQCLSLPSKVEFYPEPPSSSVPARLLRFWDHIIR 150 T 28 RepA1_leader pdbhh F Eukaryota T 5xyn 4 D D SHU2_YEAST Suppressor of hydroxyurea sensitivity protein 2 MSKDVIEYSKLFAKLVNTNDDTKLDDTIASFLYYMFPRELFIRAISLLESSDMFIYILDRVHNKEGNEHTSLIDVLVDEFYKGSSNSLLEYRLIVKDTNDGAPPILVDIAHWFCSCEEFCKYFHEALEKTDEKEELHDVLINEVDDHLQFSDDRFAQLDPHSLSKQWYFKFDKVCCSHLLAFSILLRSSINVLKFFTVNSNKVFVIAIDNIDEWLNLHINIVE 223 T 0.5 SWIM pdbpssm F Eukaryota T 5xyv 2 C,D C,D DEL_DROME Protein deadlock MEKLDKIRMSQKLSCWQHILTTLGTSSKTEQEWNTFFKGFLESWRKPYCIQTSCDPSIPL 60 T 0.068 Herpes_IE68 pdbpssm F Eukaryota T 5xyw 2 C,D C,D B4Q3Z0_DROSI GD21652 MENLAKIRMSQKLACWQQILTTLGTSSMSEQEWNTFFRGFLESWQNPYCIQTSCDPSIPL 60 T 0.21 DUF4543 pdbpssm F Eukaryota T 5xzk 1 A,B,C A,B,C A0A384E107_9AGAR lectin (PhoSL) APVPVTKLVCDGDTYKCTAYLDYGDGKWVAQWDTAVFHTT 40 T 0.069 C2-set pdbhh F Eukaryota T 5xzx 2 B B RANB3_HUMAN RANBP3-B GSSPEGGEDSDREDGNYCPPVKRERTSSLT 30 T 12 Fib_alpha pdbhh F Eukaryota T 5y0h 1 A A N6 GFAWNVCVYRNGVRVCHRRAN 21 T 1.5 PilI pdbhh F T 5y0i 1 A A NZ17074(N1) GFCWNVCVYRNGVRVCHRRCN 21 T 1 PilI pdbhh F T 5y0j 1 A A N2 AFCWNVCVYRNAVRVCHRRCN 21 T 4.2 DUF2760 pdbhh F T 5y14 2 D,E,F F,E,D LP-40 YTSLIHSLIEESQNQQEKNEQELLELDK 28 T 0.00015 GP41 pdb F T 5y18 2 B B ATRX_HUMAN ATP-DEPENDENT HELICASE ATRX,X-LINKED HELICASE II,X-LINKED NUCLEAR PROTEIN,XNP,ZNF-HX SENRIAKKMLLEEIKANLSSDED 23 T 12 DUF6481 pdbhh F Eukaryota T 5y21 2 C,D C,D RNG1A_ARATH RING 1A EVRQKKRRKRSTSR 14 T 3.3 CDC45 unppssm F Eukaryota T 5y53 2 C,F D,F DRIP1_ARATH PEPTIDE FROM E3 UBIQUITIN PROTEIN LIGASE DRIP1 ETVTPKRMRTTQRKRSAT 18 T 36 Ribosomal_L41 pdbhh F Eukaryota T 5y59 2 B C Sir4p NSKLLSLLRSKT 12 T 8.8 SRC-1 pdbhh F T 5y5w 2 E,F,G E,F,G Histone peptide H4K20(me3) KRHRKVLDN 9 T 15 Phage_X pdbhh F T 5y6p 2 C,D C1,D1 LRC4 MAAAFTAPVNLKGSSLTSNTLPAVCSRPAPLTLTPRAQADLPPPGIPSGQDPLDNAPLRHYVPRPVETYEDRGFATILPRTWEGETNTIGAGDIEPVTKEEVEESRKVPVDAASTGAFVEYARMMKEERAQALADQARRNSAPTSGRPTCGETEGTEFVSNARPILVDGVKVVEYWGVPNGPVPRLFGGPGE 192 T 0.026 DUF4786 pdbpssm F T 5y6p 3 E,F E1,F1 LRC5 MAFVSATPVSQAVRPAPALGAQLAASPLRPEIAHASNSSTPRMGYGAYSYITDKTKGHVNQYYVDKFRIASDWTKGTPKTQADAVLGRTFKGAVLVPTEGIPQEFDPAIAPRDNTVDPDPRIAESEGEVYPWDINYFDPQFLPSAYSDVNDPETVDSSFADFRSSMWESRRESLTAQDFGAVARVQRIKNGLDEKYLMTLDGMLDARYARFQKIAEPAVLSPTGTPMTEIPGTPYLGSVGAMDFIAQEEESVAFWKSGPSTTPVNYKRPSGAQTPNLPYNTAAPVAAINEAQEAQKGQMQLSAGDDE 307 T 14 DUF5953 pdbhh F T 5y6p 12 AC,AE,AO,AY,BY,CAA,CG,CI,CT,CV,DAA,DP,DR,VN A2,a3,b9,dw,dx,ey,34,Y5,aY,bY,ez,U8,Z9,b8 LR_gamma4 MDSPAFAVNGMFSAVKVGNSSFTENKVTAVSKTAPTASVRMVVDPFQRKFQSIGKIGIDYSRPKKLATYKRVGYSVGLDFPNAVSMAGHYSLTDCTRAGGAAKILMKYDEYCAKGMLQVYKRSAVSTGVYTTKCTEATQPGVAYDVRVFNRTAAFRQAQKPVNVRLGEQYAARKACVTLAHNCSREEAQFKNMPMSCATFLAGKMEAMGTCYRTVRPSSKAEDYMAGSVRMQVYQKGNASGVYPVGGCEDGHAKGDADLRRVIALASEYRAAQQGAAAVTGAQYASSKMAIQLYGHSCNHEEGQFCDYPAVAAAMCRY 318 T 0.29 ACC_epsilon pdbpercent F T 5y6p 14 BO,CC,CE,CO,CP,CR,DC,DE,DG,DI,DO,DT,DV,WN,XN,YN c9,C2,c3,d9,T8,Y9,D2,d3,44,Z5,e9,aZ,bZ,c8,d8,e8 LR_gamma5 MYAFAPNTPFTASKAVVGKTSFTSPLPAQSESRPTAAPTMVLRTVLRSPVPSGAATVYGYVGRGNISVILAKADEYMAKSVRKQYLAKSNPYGTFGVQCTEGSVKFAADFSRIRALNAEFRAKLGSASKKTFDMYENRKNAISNSHGCHHEETQFVGYKGVSSMYNVSKSEASGSCSRYASPETVVEAAMLRFMDIQVKMAANPTGVYNISCNEGAARGQAEDVRVAALNAAFRQGQKSLGKLLDEKYQQKKQGYSFAHGCNYEEGLINKYPALGAAFRSKSYGY 285 T 0.052 rRNA_methylase pdbpssm F T 5y6p 22 CBA,DBA eY,eZ LR_gamma7 MTAPAFTAPISLTTPHAFSARGLRPATTSSAAPTAVPTPRMSAADKYMARTVTRTAKSAAAGFGVYTPQCTEASGGANTAEATRLAVLAADFRLRQAPLGARFADLYETRRAAVIQACNSSAEEGYATSFPSRAAASVAGRAEGLRACSRYFPQKPPVEEYMAACVDRQYKQMRVHGGVYSTLCADGRSAGDADTARIAALGARFRAQHLSKSQQTQMRYNAMSEARMLARGLCTYEEAQFNAYPKMAGMMRYGTGVYAASVRGPELVVGNKSMTVAEQVNGVNAESYWPSSKVRPAVARGTSPWMGLGVVKSYAAMSEAAMAYGIEQQSKPYVPQKYEGWSSGWKPKSSLM 352 T 0.39 DUF2477 pdbpercent F T 5y6p 23 CCA,CFA,DCA,DFA ly,hY,lz,hZ LR_gamma8 MEPAFVSSFAPKPVITTSLTASSPLSVTARKNAVSTPTMAAYSLDKYAQMSGANAVDTSGASPAASSTWWVAYRDSLKERFNPFRAPANPEVDVGKSKEYFFAQTAYGRILNMVNASRFGKGGDPDELVPPPGAQPADQYMANCIVKQYKAMATPTGVYTTQCTEGVVRGQAEEARNAALSAAFRMKQRSSAQKFGDFCESRRMAVIGAHGCSYEESLLTKFPAAARAYTTASSEAKGNCVRYADGTSPAETYMAACVDKQMKFRSVPMGVYDVLCSDGNTKGVAEYKRVSAMSVRFRSNQMSTLYKMQAKYNNAAYARNYFGHGCSYEENLFNKYPAVSASMRPSTARY 350 T 1.9 rRNA_methylase pdbpercent F T 5y6p 24 CEA,DEA gy,gz LR_gamma6 MAFITSFTPRNLASRSEFTSTSVSTRRPTLARNTIRALFTPPVDEFMASSVQSQYIQKACPSGVPPIQCIEGVTSDQPYAARTLKRQTELRYHQLPVAVKLRKAYETRRAAVVATHGCSHEEGRVLSYPRMASAMLIGQAEASKACSRYFVPNGPAEKHMLQAVENRYMAAVNGSGVFSGACTDGQTRYEAYLMQLRGKSAEFRAKQYSTFEKESMKYAARKQALIQKGHDCNAEEVIFSNYPIVASAMRPTFGYYTPIVKNPGIGSVINIMRPVWDKNSSISSPATLVGVGGFVQP 297 T 0.095 PRMT5 pdb F T 5y7d 1 A A CX04A_HUMAN ENDOTHELIAL-OVEREXPRESSED LIPOPOLYSACCHARIDE-ASSOCIATED FACTOR 1 GMKFGCLSFRQPYAGFVLNGIKTVETRWRPLLSSQRNCTIAVHIAHRDWEGDAWRELLVERLGMTPAQIQTLLRKGEKFGRGVIAGLVDIGETLQCPEDLTPDEVVELENQAVLTNLKQKYLTVISNPRWLLEPIPRKGGKDVFQVDIPEHLIPLGHEVLE 161 T 0.00028 ASCH pdbpercent F Eukaryota T 5y7w 2 C,D C,D YL-2 peptide LLPPTEQDLXKLXXYX 16 T 0.049 STAT6_C pdb F T 5yay 2 B B KI21A_MOUSE Kinesin-like protein KIF21A LMKLCGEVKPKNKARRRTTTQMELLYAD 28 T 4.2 Imm63 pdbhh F Eukaryota T 5yb2 3 D,E,F,J,K,L,N H,G,I,K,J,L,P LP-11 ELTWEEWEKKIEEYTKKIEEILK 23 T 0.069 GP41 pdbhh F T 5ybe 2 B B KI21A_MOUSE KIF21A PKNKARRRTTTQMELLYAD 19 T 2.3 Imm63 pdbhh F Eukaryota T 5ybu 2 B B KI21A_HUMAN KINESIN-LIKE PROTEIN KIF2,RENAL CARCINOMA ANTIGEN NY-REN-62 EVKPKNKARRRTTTQMELLYAD 22 T 2.9 Imm63 pdbhh F Eukaryota T 5yc0 2 D,E,F,J,K,L Q,W,P,H,I,G LP-46 WQEWEQKITALLEQAQIQQEKNEYELQKLDK 31 T 0.00047 GP41 pdbhh F T 5yca 2 B C LEM2_SCHPO LEM DOMAIN PROTEIN 2 GSAEEDDELFQNYVLQQTRK 20 T 0.52 TFIIA unppercent F Eukaryota T 5yco 2 E,F E,F UHRF2_HUMAN E3 ubiquitin-protein ligase UHRF2 NEILQTLLDLFFPGYSK 17 T 0.0058 zf-RING_6 unphh F Eukaryota T 5yd3 2 B,D,F,H B,D,F,H CCR5_HUMAN Epitope peptide DINYYTSEP 9 T 5.5 Pico_P1A pdbhh F Eukaryota T 5yd4 2 B,D,F,H B,D,F,H CCR5_HUMAN Epitope peptide (mutation T6A) DINYYASEP 9 T 3.6 DEC-1_C pdbhh F Eukaryota T 5yd5 2 B,D B,D CCR5_HUMAN Peptide epitope (mutation N3A) DIAYYTSEP 9 T 4.3 DUF3417 pdbhh F Eukaryota T 5ye3 3 C C H4_HUMAN di-acetylated histone H4 SGRGXGGXGLGK 12 T 11 Shadoo unppercent F Eukaryota T 5yf4 2 B B STK26_HUMAN MST3 AND SOK1-RELATED KINASE,MAMMALIAN STE20-LIKE PROTEIN KINASE 4,STE20-LIKE KINASE MST4,SERINE/THREONINE-PROTEIN KINASE MASK THPEWSFTTVRKKPDP 16 T 1.7 MRPL52 pdbhh F Eukaryota T 5ygd 2 B D PIWI_DROME ASP-GLN-GLY-ARG-GLY-ARG-2MR-ARG-PRO-LEU-ASN DQGRGRXRPLN 11 T 7.6 M157 pdbhh F Eukaryota T 5yhr 1 A,B A,B ACR30_BPD31 GENE PRODUCT 30,GP30 GMTKTAQMIAQQHKDTVAACEAAEAIAIAKDQVWDGEGYTKYTFDDNSVLIQSGTTQYAMDADDADSIKGYADWLDDEARSAEASEIERLLESVEEE 97 T 0.13 Transglycosylas unp T Viruses T 5yi7 2 B,D B,D Q9W4I7_DROME RE65495P QQKLPTNPFEVLRQPPKKKKREHACFENPGLNLE 34 T 2.8 LAG1-DNAbind pdbhh F Eukaryota T 5yi8 2 B B Q9W4I7_DROME RE65495P KKKKREHACFENPGLNLELPEKQFNPYEVVRSA 33 T 14 HAGH_C pdbhh F Eukaryota T 5yip 2 B B ANK3_RAT ANK-3,ANKYRIN-G PEDDWTEFSSEEIREARQAAASHAPS 26 T 0.34 GHBP pdbhh F Eukaryota T 5yir 2 D,E,F C,G,H ANK2_HUMAN ANK-2,ANKYRIN-B,BRAIN ANKYRIN,NON-ERYTHROID ANKYRIN VEEEWVIVSDEEIEEARQKAPLEITEY 27 T 3 Pex14_N pdbhh F Eukaryota T 5ykk 1 A A Andersonin-Y1 (AY1) FLPKLFAKITKKNMAHIR 18 T 0.15 Antimicrobial_1 pdbhh F T 5ykl 1 A A designed AY1C FLPKLFAKITKKNMAHIRC 19 T 0.18 Antimicrobial_1 pdbhh F T 5ykq 1 A A designed CAY1 CFLPKLFAKITKKNMAHIR 19 T 0.15 Antimicrobial_1 pdbhh F T 5ylx 3 C C RPOA_PRRS1 PRRSV-NSP9-TMP9 peptide TMPPGFELY 9 T 0.24 MucB_RseB_C pdbhh T Viruses T 5ym9 1 A,B A,B Q5ZTL3_LEGPH DEAMIDASE KLESPGFMVHKKLKSMSQSYGVMMTGVPAEVLGQMQAERSIPSINKTGNLKQQIAKEVSKVCHMMTEPTQSCGQASNDVCELLLGKIEAEKFHFTKYEALSADGDNLKNVLENTAPSSTNLLIRFEIDREDPPIVLVKTKNENFNPETAVKNKIYLLENKLYFIDKMGNLFNLGPGKKKCTQLFNAIGDSAEYSLCDPFVLEEPEKPEDFAISEIVDIFNEQKERFDFWIGSHSFTIYIPQTLGESPRQFYPYQAYFGSHTLQDWFVSDKDEYLSRIGIDKYIEKLAVLGKTTNTKERSDIYAEFFSKRGREAFFCAHLNEKRQPLRVKFKITEINPELALKNLQETQEFIDTHPGENPSDKVENYRNRAKLAMTEHLESLLDI 384 T 0.022 AgrD pdb F Bacteria T 5ymv 3 C,F C,F ALA-VAL-LYS-GLY-VAL-GLY-THR-MET-VAL AVKGVGTMV 9 T 1.6 Spec3 pdbhh F T 5ymw 3 C,F,I,L C,F,I,L SRC_RSVP LEU-PRO-ALA-CYS-VAL-LEU-GLU-VAL LPACVLEV 8 F T Viruses T 5ypo 2 C,D C,D DLGP1_HUMAN SAPAP AARRESYLKATQPSL 15 T 37 EABR pdbhh F Eukaryota T 5ypr 2 B B Synthesized GK inhibitor RIRREEYRRAINGQSF 16 T 5.9 DUF6026 pdbhh F T 5ypu 2 B,D B,D COBL_MOUSE Cordon-Bleu WH2 motif SLHSALXEAIHSSGGREKLRKV 22 T 0.00025 WH2 unppercent F Eukaryota T 5ypz 2 D,E,F D,E,F Q93I65_ECOLX CofJ SPSSSEGGAFTVNMPKTSTVDDIR 24 T 4.9 DUF5808 pdbhh F Bacteria T 5ytp 1 A,B A,B Q5SM04_THET8 TTHA0139 MAKKEKKRLQVVISEEQDALLTRAAYALSSPERAVSKSEVVRLAIEKIARELEEGKAKEELEALLKHLKAEEGEEEA 77 T 0.0032 TAN unppercent F Bacteria T 5yvi 2 B B FUS_HUMAN 75 KDA DNA-PAIRING PROTEIN,ONCOGENE FUS,ONCOGENE TLS,POMP75,TRANSLOCATED IN LIPOSARCOMA PROTEIN GSGGGPGGSHMGGNYGDDRRGGRGGYDRGGYRGRGGDRGGFRGGRGGGDRGGFGPGKMDSRGEHRQDRRERPY 73 T 860 DUF2219 pdbhh F Eukaryota T 5yvk 1 A A V5TER4_9CYAN AMBU4 MGSSHHHHHHSSGLVPRGSHMASTSAVSIPINNAGFENPFMDVVDDYTIDTPPGWTTYDPNNLVPEKRTTWTSNNGVGYVGPGTQFYNQLAPEGRNIGYIYLSQNPGSGVAGFEQILDATLEPDTKYTLTVDVGNLAGTFKGLSFAGFPGYRVELLAGDTVLAADHNNLFIKEGEFKTSTVTYTSTAKDLHLGQKLGIRLVNLLQDKFSGLDFDNVRLTTEPTET 225 T 2.1 DUF642 pdbhh F Bacteria T 5yvp 1 A,B,C,D A,B,C,D A0A1P8VSI6_9CYAN FILC1 MGSSHHHHHHSSGLVPRGSHMASTSVVSIPINNAGFEDPFIEVVDDYTVDTPPGWTTYNPNNLVPEKRTTWTSNNGVGYVGPGTQFYNQLAPEGRNIGYIYLAQKPGSGVAGFEQILDATLEPDTNYTLKVDVGNLAGTFKGLSFAGFPGYRVELLAGDTVLAADHNNLFIKEGEFKTSTVTYTSTAKDLHLGQKLGIRLVNLLQDKFSGLDFDNVRLTAEPTEA 225 T 2.2 DUF642 pdbhh F Bacteria T 5yy9 2 C,D C,D DNLI1_HUMAN Ligase 1 IPKRRTARKQLPK 13 T 26 EAV_GS pdbhh F Eukaryota T 5yyf 2 B,D B,D Peptide inhibitor PHQ-H3(Q5-K9) XQTARKX 7 T 390 MC1 pdbhh F T 5yyz 2 B B HOP1_YEAST Meiosis-specific protein HOP1 QASIQPTQFVSNN 13 T 4.4 ParBc_2 pdbhh F Eukaryota T 5yz9 1 A A MTA70_HUMAN METHYLTRANSFERASE-LIKE PROTEIN 3,HMETTL3,N6-ADENOSINE-METHYLTRANSFERASE 70 KDA SUBUNIT,MT-A70 AHMSIVEKFRSRGRAQVQEFCDYGTKEECMKASDADRPCRKLHFRRIINKHTDESLGDCSFLNTCFHMDTCKYVHYEIDASMDSEAPGSKDHTPSQELALTQ 102 T 0.0081 DUF445 unp F Eukaryota T 5z08 2 C C G2R3T1_THITE Cenp-K RQKDEWAKKTSSLMKQLDWFIGEHLGAMLAAEELGGPVVGELMEIDPDDLSAGFNAHGKLKKATSQPDLDRRQRRIDDIWGPQDEQGQAHKRKRGADEALAASAEMRDLIEQLMNKLVEAGGDNSATYVEIPRESAAARFLVRSKVAMFHPNDARRLRLVDFGRDLDD 168 T 4E-08 CENP-K pdb F Eukaryota T 5z1v 1 A,B,C,D A,B,C,D A0A0H4ITX1_MAGOR AvrPib protein MSHHHHHHSMAMTQVTILKKGERITWVEVPKGESREFNIRGKYFTVSVSDDGTPSISGSKYTVE 64 T 0.24 Picorna_P3A unppercent F Eukaryota T 5z1y 1 A A C3Z8S4_BRAFL mBjAMP1 peptide NLCASLRARHTIPQCRKFGRR 21 T 6.7 CENP-O pdbhh F Eukaryota T 5z26 1 A A SC51_SHEEP SMAP-18 RGLRRLGRKIAHGVKKYG 18 T 0.095 CAP18_C unppercent F Eukaryota T 5z28 1 A,B A,B VAL2_ARATH PROTEIN HIGH-LEVEL EXPRESSION OF SUGAR-INDUCIBLE-LIKE 1,PROTEIN VP1/ABI3-LIKE 2 AIKVCMNALCGAASTSGEWKKGWPMRSGDLASLCDKCGCAYEQSIFCEVFHAKESGWRECNSCDKRLHCGCIASRFMMELLENGGVTCISCAKKSGLISMNVS 103 T 0.19 FrhB_FdhB_N pdb F Eukaryota T 5z2c 1 A,B,C,D,E,F,G,H,I A,B,C,D,E,F,G,H,I ALPK1_HUMAN CHROMOSOME 4 KINASE,LYMPHOCYTE ALPHA-PROTEIN KINASE MNNQKVVAVLLQECKQVLDQLLLEAPDVSEEDKSEDQRCRALLPSELRTLIQEAKEMKWPFVPEKWQYKQAVGPEDKTNLKDVIGAGLQQLLASLRASILARDCAAAAAIVFLVDRFLYGLDVSGKLLQVAKGLHKLQPATPIAPQVVIRQARISVNSGKLLKAEYILSSLISNNGATGTWLYRNESDKVLVQSVCIQIRGQILQKLGMWYEAAELIWASIVGYLALPQPDKKGLSTSLGILADIFVSMSKNDYEKFKNNPQINLSLLKEFDHHLLSAAEACKLAAAFSAYTPLFVLTAVNIRGTCLLSYSSSNDCPPELKNLHLCEAKEAFEIGLLTKRDDEPVTGKQELHSFVKAAFGLTTVHRRLHGETGTVHAASQLCKEAMGKLYNFSTSSRSQDREALSQEVMSVIAQVKEHLQVQSFSNVDDRSYVPESFECRLDKLIL 446 T 0.24 RNPP_C pdbhh F Eukaryota T 5z2o 1 A A G2,7,13A SMAP-18 analogue RALRRLARKIAHAVKKYG 18 T 3.4 DUF5664 pdbhh F T 5z31 1 A A LYS-ASN-LYS-SER-ARG-VAL-ALA-ARG-GLY-TRP-GLY-ARG-LYS-CYS-PRO-LEU-PHE-GLY KNKSRVARGWGRKCPLFG 18 T 0.0093 Flavi_glycoprot pdbhh F T 5z32 1 A A VAL-ALA-ARG-GLY-TRP-GLY-ARG-LYS-CYS-PRO-LEU-PHE-GLY-LYS-ASN-LYS-SER-ARG VARGWGRKCPLFGKNKSR 18 T 0.0017 Flavi_glycoprot pdbhh F T 5z3a 1 A A IMGH_KRIFD Glycoside hydrolase 15-related protein MGSSHHHHHHSSGLVPRGSHMTTSARDTGLDSHELARLHELARHSHAVITRHQDAGGAYPAAPTFSAYRGYAWLRDGSFTAEGISRYGDVASAGRFHDWVDGVLRRRRGQVDDLLAAVDRGEVPSNEGMLPTRFTFDGNDGSDPWWDFQTDGYGMWLWSVVTHAARHGLDLERWRAGIDVAVDYLLAFWDRPCYDWWEEHVEHRHVSTLGAIHGGLVAVGTCAALRSAPWSAATLQVAARIRSLVSAEGVVDGHLVKWLGSSAVDGSLPACVVPFGLVPPDDDVAAMTRAAVAKDLDVDGGVHRFAADVFYGGGQWILLSALLGWNLAAAGDTAGALRHLRWIADQADADGDLPEQVPHHLLHPGSRAEWVARWGTVATPLLWSHGMYLILADELGLLPPAAKDA 405 T 7.9 Glyco_hydro_15 unppssm F Bacteria T 5z3b 1 A A IMGH_KRIFD Glycoside hydrolase 15-related protein MGSSHHHHHHSSGLVPRGSHMTTSARDTGLDSHELARLHELARHSHAVITRHQDAGGAYPAAPTFSAFRGYAWLRDGSFTAEGISRYGDVASAGRFHDWVDGVLRRRRGQVDDLLAAVDRGEVPSNEGMLPTRFTFDGNDGSDPWWDFQTDGYGMWLWSVVTHAARHGLDLERWRAGIDVAVDYLLAFWDRPCYDWWEEHVEHRHVSTLGAIHGGLVAVGTCAALRSAPWSAATLQVAARIRSLVSAEGVVDGHLVKWLGSSAVDGSLPACVVPFGLVPPDDDVAAMTRAAVAKDLDVDGGVHRFAADVFYGGGQWILLSALLGWNLAAAGDTAGALRHLRWIADQADADGDLPEQVPHHLLHPGSRAEWVARWGTVATPLLWSHGMYLILADELGLLPPAAKDA 405 T 7.9 Glyco_hydro_15 unppssm F Bacteria T 5z3c 1 A A IMGH_KRIFD Glycoside hydrolase 15-related protein MGSSHHHHHHSSGLVPRGSHMTTSARDTGLDSHELARLHELARHSHAVITRHQDAGGAYPAAPTFSAYRGYAWLRDGSFTAEGISRYGDVASAGRFHDWVDGVLRRRRGQVDDLLAAVDRGEVPSNEGMLPTRFTFDGNDGSDPWWDFQTDGYGMWLWSVVTHAARHGLDLERWRAGIDVAVDYLLAFWDRPCYDWWAEHVEHRHVSTLGAIHGGLVAVGTCAALRSAPWSAATLQVAARIRSLVSAEGVVDGHLVKWLGSSAVDGSLPACVVPFGLVPPDDDVAAMTRAAVAKDLDVDGGVHRFAADVFYGGGQWILLSALLGWNLAAAGDTAGALRHLRWIADQADADGDLPEQVPHHLLHPGSRAEWVARWGTVATPLLWSHGMYLILADELGLLPPAAKDA 405 T 7.9 Glyco_hydro_15 unppssm F Bacteria T 5z3d 1 A A IMGH_KRIFD Glycoside hydrolase 15-related protein MGSSHHHHHHSSGLVPRGSHMTTSARDTGLDSHELARLHELARHSHAVITRHQDAGGAYPAAPTFSAYRGYAWLRDGSFTAEGISRYGDVASAGRFHDWVDGVLRRRRGQVDDLLAAVDRGEVPSNEGMLPTRFTFDGNDGSDPWWDFQTDGYGMWLWSVVTHAARHGLDLERWRAGIDVAVDYLLAFWDRPCYDWWEEHVEHRHVSTLGAIHGGLVAVGTCAALRSAPWSAATLQVAARIRSLVSAEGVVDGHLVKWLGSSAVDGSLPACVVPFGLVPPDDDVAAMTRAAVAKDLDVDGGVHRFAADVYYGGGQWILLSALLGWNLAAAGDTAGALRHLRWIADQADADGDLPEQVPHHLLHPGSRAEWVARWGTVATPLLWSHGMYLILADELGLLPPAAKDA 405 T 7.9 Glyco_hydro_15 unppssm F Bacteria T 5z3e 1 A A IMGH_KRIFD Glycoside hydrolase 15-related protein MGSSHHHHHHSSGLVPRGSHMTTSARDTGLDSHELARLHELARHSHAVITRHQDAGGAYPAAPTFSAYRGYAWLRDGSFTAEGISRYGDVASAGRFHDWVDGVLRRRRGQVDDLLAAVDRGEVPSNEGMLPTRFTFDGNDGSDPWWDFQTDGYGMWLWSVVTHAARHGLDLERWRAGIDVAVDYLLAFWDRPCYDWWEEHVEHRHVSTLGAIHGGLVAVGTCAALRSAPWSAATLQVAARIRSLVSAEGVVDGHLVKWLGSSAVDGSLPACVVPFGLVPPDDDVAAMTRAAVAKDLDVDGGVHRFAADVFYGGGQWILLSALLGWNLAAAGDTAGALRHLRWIADQADADGDLPAQVPHHLLHPGSRAEWVARWGTVATPLLWSHGMYLILADELGLLPPAAKDA 405 T 7.9 Glyco_hydro_15 unppssm F Bacteria T 5z53 1 A,B,C,D A,B,C,D A0A1P8VSI6_9CYAN FILC1 MKRKLIVAVVCLIFICFGINTPAHATSVVSIPINNAGFEDPFIEVVDDYTVDTPPGWTTYNPNNLVPEKRTTWTSNNGVGYVGPGTQFYNQLAPEGRNIGYIYLAQKPGSGVAGFEQILDATLEPDTNYTLKVDVGNLAGTFKGLSFAGFPGYRVELLAGDTVLAADHNNLFIKEGEFKTSTVTYTSTAKDLHLGQKLGIRLVNLLQDKFSGLDFDNVRLTAEPTEA 227 T 3.2 DUF642 pdbhh F Bacteria T 5z54 1 A,B,C,D A,B,C,D A0A076NBW8_9CYAN HPIU5 MKRKLIVAVVCLIFICFGINTPAHATSVVSIPINNAGFEDPFIEVVDDYTVDTPPGWTTYNPNNLVPEKRTTWTSNNGVGYVGPGTQFYNQLAPEGRNIGYIYLAQKPGSGVAGFEQILDATLEPDTKYTLKVDVGNFGGEFQKISLAGFPGYRVELLAGDTVLAADHNNLYIKDGEFKTSTVTFTATPDNPYLDQKLGIRLINLLQGTFSGLDFDNVRLTVEPAQT 227 T 3.4 DUF4969 pdbhh F Bacteria T 5z8h 2 B B Peptide inhibitor XAGESLYEX 9 T 24 DUF3928 pdbhh F T 5z93 2 B B V9H1G0_HUMAN Gene for histone H3 (germline gene) TARXSTGGKA 10 T 0.044 PAF unp F Eukaryota T 5z94 2 C,D C,D V9H1G0_HUMAN Gene for histone H3 (germline gene) ARTXQTARKSTGGKA 15 T 0.044 PAF unp F Eukaryota T 5zc3 1 A,B B,A A0A3F2YLV0_PHYCP RxLR effector MDTTDIKPVRPAINLQQPPFVVGRLLRTVQDEERGFTLPGAGKLADLFESTALKLAQSARINTWLVKGTSVDDAFLKLELNTAGSRIFENPKLLTWAVYVTKVEKQNPEEIILAKLSKQFTEGSLAKMIASAKLDSKTEGLATILQAQQRQVWVDAGKSSDEVFKLLQLDEAGTKLFKNQQFSTWTSFVDAFNRKYPEKAVSIFSKLAKTYDGFTLWKMLEAAKKVPKTEIIASKLQAQQIDAWLDAGKSTDEVFNLLKLQRTGDKLFKNSQFLTWVSYVEKFNKKDPDQAIAIFSKLAGVYDQVTLSSMLEAAKHVPSTKRIASYLQGQQNQHWLADGKSTDDIFKLLKLNTPSPENLIDPRLDAWTSFMRAFNMANEGKETTLIATLTTHYKDRGLAQLLQEGTKFASTKKIAEELQTAQFARWLQLGKTEDDIFALLKLKLTTPTTDPEAIVFYQYKLFMDAHMKLAAA 472 T 0.0012 RXLR pdbhh F Eukaryota T 5zcn 1 A A A0A381AKI5_BREDI brevunsin DGMGEEFIEGLVRDSLYPPAG 21 T 4.5 DUF4090 unphh F Bacteria T 5zgb 12 L O M1VFJ4_CYAM1 PsaO MYGFVSVLPVASALQRQQCTCAARCSFTTRAARVAPVRIALSRPQRLVGASSLRMFEVSDGEPYPLNPAVIFIALIGWSAVAAIPSNIPVLGGTGLTQAFLASIQRLLAQYPTGPKLDDPFWFYLIVYHVGLFALLIFGQIGYAGYAKGTYNRSA 155 T 23 YbgT_YccB pdbhh F Eukaryota T 5zgc 2 G,H,I,J,K,L G,H,I,J,K,L H4_HUMAN Histone H4K16bhb peptide GKGGAXRHRKV 11 T 11 Shadoo unppercent F Eukaryota T 5zia 3 C,F,I,L,O,R R,C,F,J,N,Q TAU_HUMAN phosphorylated tau peptide SPSSAKSRL 9 T 3.4 MinE pdbhh F Eukaryota T 5zji 17 Q O B6SQZ7_MAIZE 16kDa membrane protein MHLLASCCFTRGSRVSARNPLMSRNLERNGRITCMTFPRDWLRRDLSVIGFGLIGWMGPSSVPAINGNSLTGLFFSSIGQELAHFPTPPPVTSQFWLWLVTWHLGLFIVLTFGQIGFKGRTEDYFEK 127 T 0.0085 Plasmid_RAQPRD pdbpercent F Eukaryota T 5zjy 2 B B LYS-LYS-ARG-TYR-SER-ARG-2JN-GLN-LEU-LEU-2JN-PHE XKKRYSRXQLLXFX 14 T 6.6 Tachystatin_A pdbhh F T 5zjz 2 B B EIF-4G1 XKKRYSRXQLLXFWX 15 T 3.1 BURAN pdbhh F T 5zk5 2 B B IF4G1_HUMAN LYS-ARG-TYR-SER-ARG-GLU-GLN-LEU-LEU-MK8-PHE-GLN-ARG-MK8 XKKRYSREQLLXFQRXX 17 T 0.00025 eIF_4G1 unphh F Eukaryota T 5zk7 2 C,D C,D ACE-ARG-TYR-SER-ARG-MK8-GLN-LEU-LEU-MK8-LEU-PHE-ARG-NH2 XRYSRXQLLXLFRX 14 T 8.4 Hat1_N pdbhh F T 5zk9 2 B B ACE-ARG-ILE-ILE-TYR-SER-ARG-MK8-GLN-LEU-LEU-MK8-LEU-LYS-NH2 XRIIYSRXQLLXLKX 15 T 0.14 eIF_4EBP pdbhh F T 5zml 2 B B ACE-LYS-LYS-ARG-TYR-SER-ARG-MK8-GLN-LEU-LEU-MK8-PHE-ARG-ARG XKKRYSRXQLLXFRRR 16 T 5.4 BURAN pdbhh F T 5zmo 1 A A Q9L0M9_STRCO Uncharacterized protein McrA GSREAPKTFHRRVGDVRPARRAMGPALHRPVLLLWAIGQAVARAPRLQPWSTTRDAVAPLMEKYGQVEDGVDGVRYPFWALVRDDLWCVEQAEELTLTSRGRRPTLESLNAVDPSAGLREDDYNLLRSQPEAAASAAAGLIARYFHLLPAGLLEDFGLHELLAGRWPDALRP 172 T 29 RepA_C pdbhh F Bacteria T 5zmr 1 A A RPN5_YEAST PROTEASOME NON-ATPASE SUBUNIT 5 MSRDAPIKADKDYSQILKEEFPKIDSLAQNDCNSALDQLLVLEKKTRQASDLASSKEVLAKIVDLLASRNKWDDLNEQLTLLSKKHGQLKLSIQYMIQKVMEYLKSSKSLDLNTRISVIETIRVVTENKIFVEVER 136 T 0.013 ERp29 unppercent F Eukaryota T 5zng 2 B C Q8J180_MAGGR AVR1-CO39 MAWKDCIIQRYKDGDVNNIYTANRNEEITIEEYKVFVNEACHPYPVILPDRSVLSGDFTSAYADDDESCYRHHHHHH 77 T 3.8 Ceramidase_alk pdbhh F Eukaryota T 5zoo 2 B A NCOR2_HUMAN SMRT corepressor SP1 fragment HIRGSITQGIPRSY 14 T 8.8 DUF1149 pdbhh F Eukaryota T 5zop 2 B A NCOR2_HUMAN SMRT corepressor SP2 fragment EGSITQGTPLKY 12 T 8.6 PrmC_N pdbhh F Eukaryota T 5zpw 2 B,D,F B,D,F MET-THR-TRP-GLU-GLU-TRP-ASP-MK8-LYS-ILE-GLU-MK8-TYR-THR-MK8-LYS-ILE-GLU-MK8-LEU-ILE-LYS-LYS-SER MTWEEWDXKIEXYTXKIEXLIKKS 24 T 0.036 GP41 pdbhh F T 5zqg 2 C C PEPTIDE LEU-ALA-GLN-LEU-GLN-VAL-ALA KLAQLQVAYHQ 11 T 14 GTP-bdg_M pdbhh F T 5zqv 2 E,F,G,H E,F,G,H PPR3A_HUMAN PROTEIN PHOSPHATASE 1 GLYCOGEN-ASSOCIATED REGULATORY SUBUNIT,PROTEIN PHOSPHATASE TYPE-1 GLYCOGEN TARGETING SUBUNIT,RG1 MEPSEVPSQISKDNFLEVPNLSDSLCEDEEVTFQPGFSPQPSRRGSDSSEDIYLDTPSSGTRRVSFADSFGFNLVSVKEFDSWELPSASTTFDLGTDIF 99 T 4.8 RSD-2 pdbhh F Eukaryota T 5zt0 2 G,H,I,J G,H,I,J PPR3B_HUMAN Protein phosphatase 1 regulatory subunit 3B SKPLRPCIQLSSKNEASGMVAPAVQEKKVKKRVSFADNQGLALTMVKVFSEFDDPLDMPFNITELLDNIVSLTTA 75 T 0.018 DUF4913 pdbpssm F Eukaryota T 5zt3 1 A A M1SWB3_ORYSI WA352 AHMQEAANRSPPYAPYPYPVDEIIGGDSVQSIQRRLLGTNWNPSAHDMQMSRIQAEDLFELKVEIIRKMAGLHPSGDWMGWGARALDNPRTATGEEDLARLHQMLDDLQSRNEQSATFWRLVERVRLRAD 130 T 0.18 Rnk_N pdb F Eukaryota T 5zuj 2 B I TIFA_HUMAN TRAF2-BINDING PROTEIN SSQSQSPTEDDENES 15 T 41 Eapp_C pdbhh F Eukaryota T 5zut 2 B E Q5T4P3_HUMAN PHOSPHATIDYLINOSITOL 3-KINASE REGULATORY SUBUNIT GAMMA EVMMPYSTELIFYIEMDP 18 T 1.9 Colicin_Pyocin pdbhh F Eukaryota T 5zv3 1 A A TAU_HUMAN TAU PEPTIDE TEDGSEEPGSETSDAKSTPT 20 T 55 DUF6318 pdbhh F Eukaryota T 5zvf 1 A A CTHL4_BOVIN non glycosylated analogue of Indolicidin ILPWKWKWTPWRRX 14 T 0.21 SAG unp F Eukaryota T 5zys 2 B B Nephrin LPFELRGHLV 10 T 2 Saccharop_dh_N pdbhh F T 5zz9 2 D,E,F D,E,F DREB_HUMAN DEVELOPMENTALLY-REGULATED BRAIN PROTEIN LLNFDELPEPPATFCDPEEVEGSGENLQ 28 T 31 DUF4604 pdbhh F Eukaryota T 6a27 1 A,B A,B PPRA_DEIRA PLEIOTROPIC PROTEIN PROMOTING DNA REPAIR MARAKAKDQTDGIYAAFDTLMSTAGVDSQIAALAASEADAGTLDAALTQSLQEAQGRWGLGLHHLRHEARLTDDGDIEILTDGRPSARVSEGFGALAQAYAPMQALDERGLSQWAALGEGYRAPGDLPLAQLKVLIEHARDFETDWSAGRGETFQRVWRKGDTLFVEVARPASAEAALSDAARDVIASIKDRAFQRELMRRSEKDGMLGALLGARHAGAKANLAQLPEAHFTVQAFVQTLSGAAARNAEEYRAALKTAAAALEEYQGVTTRQLSEVLRHGLRES 284 T 8.9 Ldt_C pdbhh F Bacteria T 6a29 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H PPRA_DEIRA PLEIOTROPIC PROTEIN PROMOTING DNA REPAIR MARAKAKDQTDGIYAAFDTLMSTAGVDSQIAALAASEADAGTLDAALTQSLQEAQGRWGLGLHHLRHEARLTDDGDIEILTDGRPSARVSEGFGALAQAYAPMQALDERGLSQWAALGEGYRAPGDLPLAQLKVLIEHRRDFETDWSAGRGETFQRVWRKGDTLFVEVARPASAEAALSDAAWDVIASIKDRAFQRELMRRSEKDGMLGALLGARHAGAKANLAQLPEAHFTVQAFVQTLSGAAARNAEEYRAALKTAAAALEEYQGVTTRQLSEVLRHGLRES 284 T 8.9 Ldt_C pdbhh F Bacteria T 6a2b 3 C C TYR-MET-MET-PRO-ARG-HIS-TRP-PRO-ILE YMMPRHWPI 9 T 2.9 RS4NT pdbhh F T 6a33 2 B I TIFA_HUMAN PUTATIVE MAPK-ACTIVATING PROTEIN PM14,PUTATIVE NF-KAPPA-B-ACTIVATING PROTEIN 20,TRAF2-BINDING PROTEIN SSQSSSPTEMDENES 15 T 22 RhoGEF67_u1 pdbhh F Eukaryota T 6a38 4 D D NS2_MUMIP MVM NS2 NES GGSTVDEMTKKFGTLTIHDT 20 T 0.33 DUF6118 unppssm T Viruses T 6a3a 4 D D NS2_MUMIP MVM NES mutant Nm2 GGSTVEDMTKKFGTLTIHDT 20 T 0.33 DUF6118 unppssm T Viruses T 6a3b 4 D D NS2_MUMIP MVM NES mutant Nm13 DDTVDEMTKKFGTLTIHD 18 T 0.33 DUF6118 unppssm T Viruses T 6a3c 4 D D NS2_MUMIP MVM NES mutant Nm12 GGSTVDEMTKKFGTLTIHDDD 21 T 0.33 DUF6118 unppssm T Viruses T 6a3e 4 D D NS2_MUMIP MVM NES mutant Nm15 GGSDDTVDELTKKFGTLTIHDDD 23 T 0.33 DUF6118 unppssm T Viruses T 6a48 1 A A RELN_MOUSE REELER PROTEIN EIHSDSVILRDDFDSYQQLELNPNIWVECSNCEMGEQCGTIMHGNAVTFCEPYGPRELTTTCLNTTTASVLQFSIGSGSCRFSYSDPSITVSYAKNNTADWIQLEKIRAPSNVSTVIHILYLPEEAKGESVQFQWKQDSLRVGEVYEACWALDNILVINSAHREVVLEDNLDPVDTGNWLFFPGATVKHSCQSDGNSIYFHGNEGSEFNFATTRDVDLSTEDIQEQWSEEFESQPTGWDILGAVVGADCGTVESGLSLVFLKDGERKLCTPYMDTTGYGNLRFYFVMGGICDPGVSHENDIILYAKIEGRKEHIALDTLTYSSYKVPSLVSVVINPELQTPATKFCLRQKSHQGYNRNVWAVDFFHVLPVLPSTMSHMIQFSINLGCGTHQPGNSVSLEFSTNHGRSWSLLHTECLPEICAGPHLPHSTVYSSENYSGWNRITIPLPNAALTRDTRIRWRQTGPILGNMWAIDNVYIGPSCLKFCSGRGQCTRHGCKCDPGFSGPACEMASQTFPMFISESFGSARLSSYHNFYSIRGAEVSFGCGVLASGKALVFNKDGRRQLITSFLDSSQSRFLQFTLRLGSKSVLSTCRAPDQPGEGVLLHYSYDNGITWKLLEHYSYVNYHEPRIISVELPDDARQFGIQFRWWQPYHSSQGEDVWAIDEIVMTSRLENLYF 677 T 0.0021 EGF_2 pdb F Eukaryota T 6a51 1 A,B A,B Q0PBQ6_CAMJE CYSTEINE PERMEASE MGSSHHHHHHSSMKSLILPPNEFLDHYILNAEFHRFAGISKNAYKFWKNVEIGRYQGTRIIFLHRNCILEKHQQALRQCSGLNGFVLASAFCSFTGLAPSHLVEKNNSSIYKLLELKEICGIKFVNLKKFYDFLGLNYHQHIYIEKCHFFSPAPFEKRIKITESMCVGYY 170 T 0.1 DUF1247 pdbpssm F Bacteria T 6a56 1 A,B A,B A0A2Z5WLM1_ANTJA AJLec QRCGGWVKLNTAPVCFSAKGNRPGSFTPSHHGFLKSVKLRHLRGLVTCQSSTDAHDSYWGCKNRDGFHNYPLNVFVTDKHNKVMFPKTGATYYLDPYVIKNRFYGVQGYNAMSPELVLQHGCNSPSDYIGPDSQLRVWYGEDLYNTMESDNSGKVCADVFGYFV 164 T 0.027 CTP_transf_like pdbpssm F Eukaryota T 6a5d 1 A,B A,B LLG1_ARATH LORELEI-LIKE-GPI-ANCHORED PROTEIN 1 SFISDGVFESQSLVLGRNLLQTKKTCPVNFEFMNYTIITSKCKGPKYPPKECCGAFKDFACPYTDQLNDLSSDCATTMFSYINLYGKYPPGLFANQCKEGKEGLECPAGSQLPPETSAEVNAATTSSSRLWLTVSA 136 T 0.53 MIT_LIKE_ACTX pdbpercent F Eukaryota T 6a5e 2 B,E C,D LLG2_ARATH LORELEI-LIKE-GPI-ANCHORED PROTEIN 2 TTCKEDFANKNYTIITSRCKGPNYPANVCCSAFKDFACPFAEVLNDEKNDCASTMFSYINLYGRYPPGIFANMCKEGKEGLDCT 84 T 16 SPARK pdbhh F Eukaryota T 6a5q 2 D,E,F D,E,F TFEB pS211-peptide LVGVTSSSCPADLTQ 15 T 7.5 Hormone_4 pdbhh F T 6a6c 1 A A A0A1S4NYE1_9BACL Beta-1,3-glucanase ADFTQGADVSGNNVTLWFKSSVNTTWVDVHYKVNSGVQQNVRMSFNAGAARFEHTILTAAQAEIEYFFTYNNGVPAYDTTTFTYR 85 T 0.0051 DUF6209 pdbpercent F Bacteria T 6a6i 1 A,C,E,G A,C,E,G Q59FF6_HUMAN Excision repair cross-complementing rodent repair deficiency, complementation group 6 variant GPGHMLPERLESESGHLREASALLPTTEHDDLLVEMRNFIAFQAHTDGQASTREILQEFESKLSASQSCVFRELLRNLCTFHRTSGGEGIWKLKPEYC 98 T 0.0018 TFIIF_beta pdbpssm F Eukaryota T 6a6w 2 B B SAD1_SCHPO Spindle pole body-associated protein sad1 GPLSDNEEFENVVKNGH 17 T 0.013 UPF0257 unppercent F Eukaryota T 6a8g 1 A,D P,E muPAin-1-IG XCPAYSRYIGCX 12 T 3.2 DUF6438 pdbhh F T 6a98 1 A,B,C,D A,B,C,D A0A1P8VSL7_9CYAN aromatic prenyltransferase MGSSHHHHHHSSGLVPRGSHMASAVSIPIKNAGFEEPSLTVEDYYTIDTPPGWITYDPNGLVPAKRTRITSNNGVGYTGPNSAYYNHKAPEGRNVAYVYLAQEIGSGIAGLEQTLDAVLKPNTKYTLTVDIGNSGGSFQGFPLDGFPGYRVELLAGDTVLAADQNNLYIKEKDFKTTTVTFIATPESPYLGQHLGIRLINPLQGKFSGVDFDNVRLTAEPAET 223 T 0.26 CBM_4_9 unppercent F Bacteria T 6a9c 2 C E C4M4E9_ENTHI FP10(GEF) PEPTIDE KVAPPIPHR 9 T 19 NapB pdbhh F Eukaryota T 6a9w 1 A A M5AAG8_9CAUD Primase MGSSHHHHHHSSGLVPRGSHMIMEIPAIKALSRYAQWVIWKKERDTKIPYNPNNGKKASSTDPLAWGDIDEAQAGLVRYGANGLGFVLTKSDPFVFIDLDHVLDENKRVKCEWARQLLKEIKSYTEISPSGDGLHVVVSGKLPDYIKHKTKFDDGSALEVYESGRYMTITGEVFDGRDDIKELDLSILGEFAEHKIETKNAPVQIESATTLDDEAIIDLMKRKGQWPDAPKDGDDWSSLDMSFANRLAFWCGKDIERMDRIFRQSPLMRQKWDRPTAGSTYGRITLKKACDFVDSVYDPALRNESDCPFEPYNEEGGPRN 320 T 0.0011 VirE_N pdbhh T Viruses T 6a9x 2 B A ANK3_RAT ANKG, ANK-3,ANKYRIN-G DDWTEFSSEEIREARQAAASHAPS 24 T 1.6 CFIA_Pcf11 pdbhh F Eukaryota T 6aaf 2 B B TM184_SCHPO HFL1(386-409) MLQFEIDDEMEPLYNQAKQMRYGDYLEVLFQ 31 T 0.028 DUF6022 pdbpssm F Eukaryota T 6aaw 2 B B ACE-LEU-THR-PHE-STQ-GLU-TYR-DTR-GLN-LEU-CBA-MK8-SER-ALA-ALA XLTFXEYXQLXXSAAX 16 T 2.6 Nmad2 pdbhh F T 6aay 1 A A K1LVU1_9FLAO Bergeyella zoohelcum Cas13b (R1177A) mutant MENKTSLGNNIYYNPFKPQDKSYFAGYFNAAMENTDSVFRELGKRLKGKEYTSENFFDAIFKENISLVEYERYVKLLSDYFPMARLLDKKEVPIKERKENFKKNFKGIIKAVRDLRNFYTHKEHGEVEITDEIFGVLDEMLKSTVLTVKKKKVKTDKTKEILKKSIEKQLDILCQKKLEYLRDTARKIEEKRRNQRERGEKELVAPFKYSDKRDDLIAAIYNDAFDVYIDKKKDSLKESSKAKYNTKSDPQQEEGDLKIPISKNGVVFLLSLFLTKQEIHAFKSKIAGFKATVIDEATVSEATVSHGKNSICFMATHEIFSHLAYKKLKRKVRTAEINYGEAENAEQLSVYAKETLMMQMLDELSKVPDVVYQNLSEDVQKTFIEDWNEYLKENNGDVGTMEEEQVIHPVIRKRYEDKFNYFAIRFLDEFAQFPTLRFQVHLGNYLHDSRPKENLISDRRIKEKITVFGRLSELEHKKALFIKNTETNEDREHYWEIFPNPNYDFPKENISVNDKDFPIAGSILDREKQPVAGKIGIKVKLLNQQYVSEVDKAVKAHQLKQRKASKPSIQNIIEEIVPINESNPKEAIVFGGQPTAYLSMNDIHSILYEFFDKWEKKKEKLEKKGEKELRKEIGKELEKKIVGKIQAQIQQIIDKDTNAKILKPYQDGNSTAIDKEKLIKDLKQEQNILQKLKDEQTVREKEYNDFIAYQDKNREINKVRDRNHKQYLKDNLKRKYPEAPARKEVLYYREKGKVAVWLANDIKRFMPTDFKNEWKGEQHSLLQKSLAYYEQCKEELKNLLPEKVFQHLPFKLGGYFQQKYLYQFYTCYLDKRLEYISGLVQQAENFKSENKVFKKVENECFKFLKKQNYTHKELDARVQSILGYPIFLERGFMDEKPTIIKGKTFKGNEALFADWFRYYKEYQNFQTFYDTENYPLVELEKKQADRKRKTKIYQQKKNDVFTLLMAKHIFKSVFKQDSIDQFSLEDLYQSREERLGNQERARQTGERNTNYIWNKTVDLKLCDGKITVENVKLKNVGDFIKYEYDQRVQAFLKYEENIEWQAFLIKESKEEENYPYVVEREIEQYEKVRREELLKEVHLIEEYILEKVKDKEILKKGDNQNFKYYILNGLLKQLKNEDVESYKVFNLNTEPEDVNINQLKQEATDLEQKAFVLTYIANKFAHNQLPKKEFWDYCQEKYGKIEKEKTYAEYFAEVFKKEKEALIKLEHHHHHH 1232 T 0.32 HcgB pdbpssm F Bacteria T 6aci 1 A A B7UI21_ECO27 T3SS secreted effector NleB homolog SGRPSFAGKEYSLEPIDERTPILFQWFEARPERYEKGEVPILNTKEHPYLSNIINAAKIENERIIGVLVDGNFTYEQKKEFLNLENEHQNIAIIYRADVDFSMYDKKLSDIYLENIHKQESYPASERDNYLLGLLREELKNIPEGKDSLIESYAEKREHTWFDFFRNLAILKAGSLFTETGKTGCHNISPCSGCIYLDADMIITDKLGVLYAPDGIAVHVDCNDEIKSLENGAIVVNRSNHPALLAGLDIMKSKVDAHPYYDGLGKGIKRHFNYSSLHNYNAFCDFIEFKHENIIPNTSMYTSSSW 306 T 2.2E-05 Glyco_transf_88 unphh F Bacteria T 6aco 2 B B H2B1C_HUMAN succinyl peptide H2BK120 AVTXYTS 7 T 42 DUF5611 pdbhh F Eukaryota T 6adq 5 E,Q I,U A0R1B6_MYCS2 Cytochrome c oxidase subunit CtaJ MSTALTHGLIGGVPLVLFAVLALIFLTRKGPHPDTYKMSDPWTHAPILWAAEEPREHGHGGHGHDSHGVVIGGGASGKW 79 T 0.08 VPDSG-CTERM pdbpssm F Bacteria T 6af0 3 C C G2Q3X1_MYCTT Cdc73 protein SAASGRAGRGTLDPRLAQIYSGERRMGDRNTALRGIKPTDFSHVRKLAAPFVTRKPGAAPSAGVGASATLALNQ 74 T 0.095 DUF5529 unppercent F Eukaryota T 6aht 1 A A A1BZ87_BACCE DNA-BINDING PROTEIN LSNISMSSSEIIDVLCENLNDGIWALRVLYAEGAMNKEKLWDYINQYHKDYQIENEKDYEGKKILPSRYALDIMTARLEGAGLISFKAIGRVRIYDVTDLGNVLIKELEKR 111 T 5E-05 DUF3116 pdbhh F Bacteria T 6aht 2 B B A1BYM8_BACCE DNA-BINDING PROTEIN ISMSSSEIIDVLCENLNDGIWALRVLYAEGAMNKEKLWDYINQYHKDYQIENEKDYEGKKILPSRYALDIMTARLEGAGLISFKAIGRVRIYDVTDLGNVLIKELEKRVEKNN 113 T 6.9E-05 DUF3116 unphh F Bacteria T 6aif 2 B B CYSE_SALTY SAT,SERINE TRANSACETYLASE WHTFEYGDGI 10 T 3.2 Cyanate_lyase pdbhh F Bacteria T 6ak0 1 A A A0A493R6M6_9ACTN CYS-LEU-GLY-VAL-GLY-SER-CYS-VAL-ASP-PHE-ALA-GLY-CYS-GLY-TYR-ALA-VAL-VAL-CYS-PHE-DTR CLGVGSCVDFAGCGYAVVCFX 21 T 1.4 CCAP pdbhh F Bacteria T 6ak2 2 C,D D,E peptide inhibitor KSL-128018 SHWXXDI 7 T 7.8 DUF3950 pdbhh F T 6al5 1 A A CD19_HUMAN B-LYMPHOCYTE SURFACE ANTIGEN B4,DIFFERENTIATION ANTIGEN CD19,T-CELL SURFACE ANTIGEN LEU-12 EEPLVVKVEEGDNAVLQCLKGTSDGPTQQLTWSRESPLKPFLKLSLGLPGLGIHMRPLAIWLFIFNVSQQMGGFYLCQPGPPSEKAWQPGWTVNVEGSGELFRWNVSDLGGLGCGLKQRSSEGPSSPSGKLMSPKLYVWAKDRPEIWEGEPPCLPPRDSLNQSLSQDLTMAPGSTLWLSCGVPPDSVSRGPLSWTHVHPKGPKSLLSLELKDDRPARDMWVMETGLLLPRATAQDAGKYYCHRGNLTMSFHLEITARGSHHHHHH 265 T 0.00011 G6B unphh F Eukaryota T 6al7 1 A,B,C,D A,B,D,E A0A076NBW8_9CYAN HPIU5 MGSSHHHHHHSSGLVPRGSHMASTSVVSIPINNAGFEDPFIEVVDDYTVDTPPGWTTYNPNNLVPEKRTTWTSNNGVGYVGPGTQFYNQLAPEGRNIGYIYLAQKPGSGVAGFEQILDATLEPDTKYTLKVDVGNSGGEFQKISLAGFPGYRVELLAGDTVLAADHNNLYIKDGEFKTSTVTFTATPDNPYLDQKLGIRLINLLQGTFSGLDFDNVRLTVEPAQT 225 T 2.7 DUF642 pdbhh F Bacteria T 6al8 1 A,B,C,D A,B,D,E A0A076NBW8_9CYAN HPIU5 MGSSHHHHHHSSGLVPRGSHMASTSVVSIPINNAGFEDPFIEVVDDYTVDTPPGWTTYNPNNLVPEKRTTWTSNNGVGYVGPGTQFYNQLAPEGRNIGFIYLAQKPGSGVAGFEQILDATLEPDTKYTLKVDVGNSGGEFQKISLAGFPGYRVELLAGDTVLAADHNNLYIKDGEFKTSTVTFTATPDNPYLDQKLGIRLINLLQGTFSGLDFDNVRLTVEPAQT 225 T 3.5 DUF3868 unphh F Bacteria T 6alg 8 I N VNUN_BPHK0 Transcription termination factor nun VKKTIYVNPDSGQNRKVSDRGLTSRDRRRIARWEKRIAYALKNGVTPGFNAIDDGPEYKINEDPMDKVDKALATPFPRDVEKIEDEKYEDVMHRVVNHAHQRNPNKKWS 109 T 0.0082 N36 unphh T Viruses T 6aly 1 A A MED15_YEAST AUTONOMOUS REPLICATION REGULATORY PROTEIN 3,BASAL EXPRESSION ACTIVATOR PROTEIN 1,DEFECTIVE SILENCING SUPPRESSOR PROTEIN 4,MEDIATOR COMPLEX SUBUNIT 15,TRANSCRIPTION REGULATORY PROTEIN GAL11,TY INSERTION SUPPRESSOR PROTEIN 13 NNPLQQQSSQNTVPNVLNQINQIFSPEEQRSLLQEAIETCKNFEKTQLGSTMTEPVKQSFIRKYINQKALRKIQALRDVKNNNNANNNGSNL 92 T 0.022 Gliadin pdb F Eukaryota T 6am5 3 C C SER-MET-LEU-GLY-ILE-GLY-ILE-VAL-PRO-VAL SMLGIGIVPV 10 T 4.4 Dehydratase_MU pdbhh F T 6amt 3 C,F C,F MET-MET-TRP-ASP-ARG-GLY-LEU-GLY-MET-MET MMWDRGLGMM 10 T 6.8 RNA_pol_Rpb5_N pdbhh F T 6anf 1 A A Capped-strapped peptide XTPRQARAARAAXCX 15 T 13 BssS pdbhh F T 6anw 1 A,B,C A,B,C A0A073KP86_9GAMM anti-CRISPR protein AcrF10 GSMTTFRIENVRIETINDFDMVKFDLVTDLGRVELAEHVNYDSEGDFKSVEYTDSNIRYNMVDELCSVFDLTDKPSLMPAIDYVTFAEIIEAVEEMLEA 99 T 3.9 DUF6156 unphh F Bacteria T 6anz 1 A A B4RQJ2_NEIG2 NEGOA.19190.A.B1 MAHHHHHHMKTSTIVFGGFFITDNGERIQIPILENPNIKEINNFFSVSNFEKKAGVLVFRIIPEPEFGNTELTIYFEKGYYLPIIQTILEDGDIEVKNLKTENYSGNTMEILGDVYPIEHISKNISIIQDIISEFIMKNKPITIMI 146 T 0.025 Imm1 unphh F Bacteria T 6ar2 2 C,D C,D STK3_HUMAN ASP-GLY-TPO-MET-LYS-ARG EEEDGTMKRN 10 T 0.28 Fib_succ_major unp F Eukaryota T 6arz 1 A,B,C A,B,C L7P7L6_9CAUD PHAGE ANTI-CRISPR PROTEIN MEKKLSDAQVALVAAWRKYPDLRESLEEAASILSLIVFQAETLSDQANELANYIRRQGLEEAEGACRNIDIMRAKWVEVCGEVNQHGIRVYGDAIDRDVDLEHHHHHH 108 T 0.99 DUF1040 unphh T Viruses T 6as3 1 A,B,C,D A,B,C,D L7P7L6_9CAUD NHis AcrE1 protein HHHHHHMEKKLSDAQVALVAAWRKYPDLRESLEEAASILSLIVFQAETLSDQANELANYIRRQGLEEAEGACRNIDIMRAKWVEVCGEVNQYGIRVYGDAIDRDVD 106 T 0.13 GAF_3 pdbpssm T Viruses T 6as4 1 A,B,C A,B,C L7P7L6_9CAUD PHAGE ANTI-CRISPR PROTEIN HHHHHHMEKKLSDAQVALVAAWRKYPDLRESLEEAASILSLIVFQAETLSDQANELANYIRRQGLEEAEGACRNIDIMRAKWVEVCGEVNQHGIRVYGDAIDRDVD 106 T 0.99 DUF1040 unphh T Viruses T 6at5 3 C C CTG1B_HUMAN AUTOIMMUNOGENIC CANCER/TESTIS ANTIGEN NY-ESO-1,CANCER/TESTIS ANTIGEN 6.1,CT6.1,L ANTIGEN FAMILY MEMBER 2,LAGE-2 APRGPHGGAASGL 13 T 10 FTCD_C pdbhh F Eukaryota T 6atz 3 E,F E,F FIBB_HUMAN FIBRINOGEN BETA- 74CIT69-81 GGYRAXPAKAAT 12 T 1.5 AT_hook pdbhh F Eukaryota T 6au5 3 E,F E,F meditope XQFDLSTXRLK 11 T 21 DUF4180 pdbhh F T 6au8 2 B C BAG6_HUMAN BAG FAMILY MOLECULAR CHAPERONE REGULATOR 6,BCL2-ASSOCIATED ATHANOGENE 6,BAG-6,HLA-B-ASSOCIATED TRANSCRIPT 3,PROTEIN G3,PROTEIN SCYTHE EPWAAAVPPEWVPIIQQDIQSQRKVKPQPPLSDAYLSGMPAKR 43 T 8.9 DUF5928 pdbhh F Eukaryota T 6awb 6 G B 30S ribosomal protein S1 ESFAQLFEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 153 T 350 DUF5572 pdbhh F T 6awk 1 A A PLP-12 FVGGTSFD 8 T 0.36 BDV_M pdbhh F T 6ax2 1 A A TX22A_MACGS MU-HXTX-MG2A,NEUROTOXIN MAGI-3 GGCIKWNHSCQTTTLKCCGKCVVCYCHTPWGTNCRCDRTRLFCTED 46 T 0.012 Toxin_9 pdb F Eukaryota T 6axi 1 A A ASP-LEU-PHE-VAL-PRO-PRO-ILE-ASP DLFVPPID 8 T 7.7 DUF5651 pdbhh F T 6ay9 2 B B CPSF6_HUMAN CLEAVAGE AND POLYADENYLATION SPECIFICITY FACTOR 68 KDA SUBUNIT,CPSF 68 KDA SUBUNIT,PRE-MRNA CLEAVAGE FACTOR IM 68 KDA SUBUNIT,PROTEIN HPBRII-4/7 PVLFPGQPFGQP 12 T 0.89 MF_alpha pdbhh F Eukaryota T 6aza 1 A A KPHAB_ACTTE ARG-CYS-LYS-THR-CYS-SER-LYS-GLY-ARG-CYS-ARG-PRO-LYS-PRO-ASN-CYS-GLY-NH2 RCKTCSKGRCRPKPNCGX 18 T 1.4 DUF35_N unphh F Eukaryota T 6azf 1 A A GLY-SER-PRO-LEU-PHE-ASP GSPLFD 6 T 0.17 Peptidase_C24 pdbhh F T 6azk 3 E,F E,F meditope QFDLSTXRLKX 11 T 19 DUF4180 pdbhh F T 6azp 2 B B Q2G0X2_STAA8 Staphylococcal Peroxidase Inhibitor ANFLEHELSYIDVLLDKNADQATKDNLRSYFADKGLHSIKDIINKAKQDGFDVSKYEHVK 60 T 0.0076 Drf_FH3 pdbpssm F Bacteria T 6b12 2 B,C B,C Q4K3B5_PSEF5 Tni2 MISDFERIREDGKVIDENMTVDQMIALGWSPCRVVEARWRWQEQLLSVVNSRGLLAIVVPDRQHLAILWNDDDTGVAATLYVVSGDRQQQIRIADQLLINGQLEAGIYSWFEQFPQVSPSIFTCMFSRQRDQAMFRVDIDASTGDIVSIQHSR 153 T 0.49 Skp1_POZ unppercent F Bacteria T 6b27 2 G,H,I,J,K,L G,H,I,J,K,L CAC1S_HUMAN CALCIUM CHANNEL,L TYPE,ALPHA-1 POLYPEPTIDE,ISOFORM 3,SKELETAL MUSCLE,VOLTAGE-GATED CALCIUM CHANNEL SUBUNIT ALPHA CAV1.1 EDEPEIPLSPRPRP 14 T 18 OGFr_III pdbhh F Eukaryota T 6b34 1 A A Tyrocidine A analogue D-PHE-BE2-PHE-D-PHE-ASN-GLN-TYR-VAL-ORN-LEU XXFXNQYVXL 10 T 3.2 MFP2b pdbhh F T 6b35 1 A A Tyrocidine A analogue D-PHE-BE2-PHE-D-PHE-ASN-LYS-TYR-VAL-ORN-LEU XXFXNKYVXL 10 T 7.9 MSA_2 pdbhh F T 6b46 2 G,H I,J L7P7M1_9CAUD Anti-CRISPR protein AcrF1 GSMKFIKYLSTAHLNYMNIAVYENGSKIKARVENVVNGKSVGARDFDSTEQLESWFYGLPGSGLGRIENAMNEISRRENP 80 T 0.075 UXS1_N pdb T Viruses T 6b47 4 I K ACR30_BPD31 Anti-CRISPR protein AcrF2 GSMIAQQHKDTVAACEAAEAIAIAKDQVWDGEGYTKYTFDDNSVLIQSGTTQYAMDADDADSIKGYADWLDDEARSAEASEIERLLESVEEE 92 T 0.13 Transglycosylas unp T Viruses T 6b4e 2 C,D C,D NUP42_YEAST NUCLEAR PORE PROTEIN NUP42 GPSGSELADLAEETLKIFRANKFELGLVPDIPPPPALVA 39 T 14 DUF5767 unphh F Eukaryota T 6b4f 2 C,D C,D NUPL2_HUMAN Nucleoporin like 2 GPSGSIIATDNVLFTPRDKLTVEELEQFQSKKFTLGKIPLKPPPLELLNV 50 T 7.7 Arcadin_1 pdbhh F Eukaryota T 6b4g 2 B,C,D,E B,D,F,H AMO1_CHATD NUCLEAR PORE PROTEIN AMO1 GPHMGSPEFDGTLVRIWMPDGAPAYTADTEAEDPKVYEDEGVKRQWQSFLEKGRFEGGMPEVPPRREWCVWDF 73 T 3.4 BTHB pdbhh F Eukaryota T 6b4j 2 B,D C,D NUPL2_HUMAN Nucleoporin like 2 GPSGSIIATDNVLFTPRDKLTVEELEQFQSKKFTLGKIPLKPPPLELLN 49 T 10 Arcadin_1 pdbhh F Eukaryota T 6b67 2 D,E,F D,E,F cyclic peptide c(MpSIpYVA) MSIXVAX 7 T 6.4 DOR pdbhh F T 6b7l 1 A A immune modulator A MEKAANSIAKRVPLALPEAGLYQANLMSRDGDKATPRMIKDLDGLALVYPKGETVQHWGVWVDHQVGKVETNSQWLGQADQKADKDGIYPVQLIRNSERLGTSTALSSVTNDHNLITFQDQPVIDLQGKEIKRWVFDFTRTGTKFSDNSPIYSGFSGHVAVTALTTKAVTTASWSATDSDGFSSEMVGKVDTTNNGGKLTVAIEFPAAGCTLVGEGSATAGLSKLTMTGFGKCNFKQSAAATPIENLWNAALARAMDNRVAYVTTFTADAKKEALVIGFPDTNGLLITADKRLEHHHHHH 300 T 0.68 Choline_bind_1 pdbpssm F T 6b9l 2 E,F,G,H E,F,G,H peptide 135E2, (DUG)SAYPDSVPFR XSAYPDSVPFR 11 T 6.1 DUF5623 pdbhh F T 6b9m 2 D D UHRF1_HUMAN INVERTED CCAAT BOX-BINDING PROTEIN OF 90 KDA,NUCLEAR PROTEIN 95,NUCLEAR ZINC FINGER PROTEIN NP95,HNP95,RING FINGER PROTEIN 106,RING-TYPE E3 UBIQUITIN TRANSFERASE UHRF1,TRANSCRIPTION FACTOR ICBP90,UBIQUITIN-LIKE PHD AND RING FINGER DOMAIN-CONTAINING PROTEIN 1,HUHRF1,UBIQUITIN-LIKE-CONTAINING PHD AND RING FINGER DOMAINS PROTEIN 1 ASPRTGKGKWKRKSAGGGPSRAGSPRRTSKKTKVEPYSLTA 41 T 6.4 PMAIP1 pdbpercent F Eukaryota T 6b9y 5 E D meditope XSQFDFCTRRLQSGGK 16 T 2.1 Lambda_CIII pdbhh F T 6bae 5 E D meditope XCQFDLSTRRLKCX 14 T 4.4 Flavi_NS1 pdbhh F T 6bah 5 E D meditope XSQFDXCTRRLQS 13 T 1.7 Lambda_CIII pdbhh F T 6bb4 3 C,F,I P,Q,R TAU_HUMAN NEUROFIBRILLARY TANGLE PROTEIN,PAIRED HELICAL FILAMENT-TAU,PHF-TAU TDHGAEIVYKSPVVSGDTSPRHL 23 T 0.37 Tmemb_cc2 unp F Eukaryota T 6bc8 2 B B REV3L_HUMAN PROTEIN REVERSIONLESS 3-LIKE,HREV3 MEDKKIVIMPCKCAPSRQLVQVWLQAKE 28 T 0.42 SAM_3 pdbhh F Eukaryota T 6bcr 2 C,D,G,H C,D,G,H BAIP2_HUMAN Insulin receptor substrate protein of 53 kDa, peptide (IRSp53) LSDSYSNTLPVRKS 14 T 2.4 Glycolipid_bind pdbhh F Eukaryota T 6bcy 2 C,D,G,H C,D,G,H BAIP2_HUMAN Insulin receptor substrate protein of 53 kDa, peptide (IRSp53) ATTENKTLPRSSS 13 T 5.7 EAR pdbhh F Eukaryota T 6bd1 2 C,D,G,H C,D,G,H BAIP2_HUMAN Insulin receptor substrate protein of 53 kDa, peptide (IRSp53) TLPRSSSMAAGLEK 14 T 27 EAR pdbhh F Eukaryota T 6bd2 2 C C BAIP2_HUMAN PROTEIN BAP2,FAS LIGAND-ASSOCIATED FACTOR 3,FLAF3,INSULIN RECEPTOR SUBSTRATE P53/P58,IRSP53/58,INSULIN RECEPTOR SUBSTRATE PROTEIN OF 53 KDA,INSULIN RECEPTOR SUBSTRATE P53 DSYSNTLPVRKSVTPKNSYATTENKTLPRSSSMAAGLE 38 T 0.1 EAR pdb F Eukaryota T 6bdu 1 A,B A,B PPRA_DEIRA PLEIOTROPIC PROTEIN PROMOTING DNA REPAIR MRSGSHHHHHHRSDITSLYKKAGLENLYFQGQTDGIYAAFDTLMSTAGVDSQIAALAASEADAGTLDAALTQSLQEAQGRWGLGLHHLRHEARLTDDGDIEILTDGRPSARVSEGFGALAQAYAPMQALDERGLSQWAALGEGYRAPGDLPLAQLKVLIEHARDFETDWSAGRGETFQRVWRKGDTLFVEVARPASAEAALSKAAWKVIASIKDRAFQRELMRRSEKDGMLGALLGARHAGAKANLAQLPEAHFTVQAFVQTLSGAAARNAEEYRAALKTAAAALEEYQGVTTRQLSEVLRHGLRES 307 T 9.2 Ldt_C unphh F Bacteria T 6beo 1 A A (DPR)PY(DHI)PKDL(DGN) XPYXPKDLX 9 T 2.6 Gemin6 pdbhh F T 6bet 1 A A H(DPR)(DVA)CIP(DPR)E(DLY)VC(DGL) HXXCIPXEXVCX 12 T 1.1 ARMET_N pdbhh F T 6beu 1 A A (DCY)N(DVA)(DPR)DVYC(DPR)(DSG)KY(DVA)(DPR) XNXXDVYCXXKYXX 14 T 1.5 CcmE pdbhh F T 6bf4 1 A,D A,G D7S2G1_9HIV1 HIV-1 clade AE gp120 core VWRDADTTLFCASDAKAHETEVHNVWATHACVPTDPNPQEIHLVNVTENFNMWKNKMVEQMQEDVISLWDESLKPCVKLTGGSVIKQACPKVSFDPIPIHYCTPAGYVILKCNDKNFNGTGPCKNVSSVQCTHGIKPVVSTQLLLNGSLAEEEIIIRSENLTDNAKNIIVHLNKSVEINCTRPSNGGSGSGGDIRKAYCEIDGTEWNKTLTQVAEKLKEHFNKTIVYQPPSGGDLEITMHHFNCRGEFFYCNTTQLFNNSVGNSTIKLPCRIKQIINMWQGVGQAMYAPPISGAINCLSNITGILLTRDGGGNNRSNETFRPGGGNIKDNWRSELYKYKVVEIE 344 T 2.2E-50 GP120 unppercent T Viruses T 6bga 5 E E Velcro peptide YVVVPDGTGGGSGSG 15 T 0.54 CFIA_Pcf11 pdbhh F T 6bgg 1 A A CHD4_HUMAN CHD4 KVAPLKIKLGGF 12 T 3.3 DUF2577 pdbhh F Eukaryota T 6bgh 2 B B SMCA4_HUMAN Brd3_ET RSVKVKIKLGRK 12 T 5.5 ProQ_C pdbhh F Eukaryota T 6bij 3 C C FIBB_HUMAN Citrullinated Fibrinogen 72,74Cit69-81 GGYXAXPAKAAAT 13 T 3.3 AT_hook pdbhh F Eukaryota T 6bil 3 C C FIBB_HUMAN Fibrinogen beta 74cit69-81 GGYRAXPAKAAAT 13 T 3.3 AT_hook pdbhh F Eukaryota T 6bin 2 B C CO2A1_HUMAN Type II Collagen 1240Cit 1237-1249 QYMXADQAAGGLR 13 T 0.022 DUF2600 unphh F Eukaryota T 6bir 3 C C VIME_HUMAN Vimentin 424Cit419-431 SSLNLXETNLDSL 13 T 11 LRR_1 pdbhh F Eukaryota T 6bj5 2 B,D,F F,G,H SPI1_MYXVL SERPIN-1 LIPRNALTAIVANKPFMFLIYHKPTTTVLFMGTITKGEKVIYDTEGRDDVVSSV 54 F T Viruses T 6bj8 5 E C VAL-PRO-LEU-THR-GLU-ASP-ALA-GLU-LEU VPLTEDAEL 9 T 6.3 EST1_DNA_bind pdbhh F T 6bl5 1 A A A7XXR5_9CAUD Head decoration protein DKIQLFRTIGRVQYWERVPRLHAYGVFALPFPMDPDVEWGNWFAGPHPKAFLVSVHPSGPKAGHVYPTDLSDPDSVANVIGMVLDGHDYEADHNVTVTLRAAVPIEYVQQGIEAPPLQPDPAVLNAAPQLKLKVIKGHYFFDYTR 145 T 0.77 TRI9 pdbpssm T Viruses T 6bl9 1 A A Sm2a toxin EETEEPIRHAKKNPSEGECKKACADAFANGDQSKIAKAENFKDYYCNCHIIIH 53 T 0.022 BSMAP pdbpssm F T 6bmt 2 B B A0A2A4GXB5_9STAP Hypothetical Protein GSTGSMKKTLVAGFAVAALSTGIFAVSNEANAQVTSQNGIILHDDSRMLDHELQYVDVLINPNANPQTKERLKAYFESQGLNTVSEIVQKAKQDGLDTSKYDHLI 105 T 0.046 Drf_FH3 unppssm F Bacteria T 6bnt 2 B B IRS1_HUMAN IRS-1 CHTDDGYMPMSPGVA 15 T 0.14 ComGF pdbhh F Eukaryota T 6bo3 1 A,B A,B Q6Q0L4_9VIRU Uncharacterized protein MGEVFKEVKEKFERYKFDVVYVDREYPVSSNNLNVFFEIGERNSFSGLLINEGQAVIDVLLLKKSHEGLSPIPGEGTGIQLSAGQILKFYNVPIAEIIVEYDPSNVSGVSSNVKLKGTIHPLFEVPSQISIENFQPTENYLIYSGFGTSLPQTYTIPANGYLIISITNTSTGNIGQITLTIGSTTMTFNLQTGENKIPVIAGTQITNMTLTSSSAILIYEEVIHHHHHH 229 T 6.4 PSP1 pdbhh T Viruses T 6bqb 3 C P Q7K740_PLAF7 N-terminal junction peptide XKQPADGNPDPNANPX 16 T 5.1 Nup54 pdbhh F Eukaryota T 6bqt 2 C,F,I,L C,F,I,L BAIP2_HUMAN PROTEIN BAP2,FAS LIGAND-ASSOCIATED FACTOR 3,FLAF3,INSULIN RECEPTOR SUBSTRATE P53/P58,IRSP53/58,INSULIN RECEPTOR SUBSTRATE PROTEIN OF 53 KDA,INSULIN RECEPTOR SUBSTRATE P53 DSYSNTLPVRKSVTPKNSYATTENKTLPRSSS 32 T 20 Glycolipid_bind pdbhh F Eukaryota T 6bra 2 C S Phage display-optimized HIV-1 protease substrate SGIFLETS 8 T 3.7 DUF2016 pdbhh F T 6buu 2 C,D F,G GSK3B_HUMAN GLY-ARG-PRO-ARG-THR-THR-ZXW-PHE-ALA-GLU GRPRTTXFAE 10 T 7.5 AT_hook pdbhh F Eukaryota T 6bvh 2 B I SFTI1_HELAN Trypsin inhibitor 1 GTCTRSIPPICNPN 14 T 0.0038 Bowman-Birk_leg pdb F Eukaryota T 6bvu 1 A A SFTI1_HELAN Trypsin inhibitor 1 HFRW-1 CTASIPPICHXRWR 14 T 0.052 Bowman-Birk_leg unp F Eukaryota T 6bvv 2 B B W_NIPAV Protein W SRNIHLLGRKTCLGRRVVQPGMFEDHPPTKKARVSMRRMSN 41 T 1.3 Paramyxo_P_V_N unphh T Viruses T 6bvw 1 A A SFTI1_HELAN Trypsin inhibitor 1 HFRW-3 CTASIPPICHXXXR 14 T 0.052 Bowman-Birk_leg unp F Eukaryota T 6bvx 1 A A SFTI1_HELAN Trypsin inhibitor 1 HFRW-2 CTHXXWPICFPDGR 14 T 0.052 Bowman-Birk_leg unp F Eukaryota T 6bvy 1 A A SFTI1_HELAN Trypsin inhibitor 1 HFRW-4 CTASIPPICXXXWR 14 T 0.052 Bowman-Birk_leg unp F Eukaryota T 6bw1 1 A C W_HENDH Protein W SRSLNMLGRKTCLGRRVVQPGMFADYPPTKKARVLLRRMSN 41 T 8.1 Paramyxo_P_V_N unphh T Viruses T 6bw3 2 B,D B,D MECOM_HUMAN MYELODYSPLASIA SYNDROME 1 PROTEIN,MYELODYSPLASIA SYNDROME-ASSOCIATED PROTEIN 1 MRSKGRARKLAT 12 T 11 DUF3824 pdbhh F Eukaryota T 6bw4 2 B,D B,D PRD16_HUMAN PR DOMAIN-CONTAINING PROTEIN 16,TRANSCRIPTION FACTOR MEL1,MDS1/EVI1-LIKE GENE 1 MRSKARARKLAK 12 T 10 TCP pdbhh F Eukaryota T 6bx3 4 E F SPP1_YEAST COMPLEX PROTEINS ASSOCIATED WITH SET1 PROTEIN SPP1,SET1C COMPONENT SPP1,SUPPRESSOR OF PRP PROTEIN 1 HGREFVNDIWSRLKTDEDRAVVKKMVEQTGHIDKFKKFGQLDFIDNNIVVKTDDEKEIFDQIVVRDMTLKTLEDDLQEVQEISLPLFKKKLELLEVYLGWLDNVYTEMRKLDDDAASHVECGKEDSKGTKRKKKKNSSRSRARKNICGYCSTYERIPCSVEEFVRDFGSNEEATKIHEVCTKWKCNRHLDWVSTNQEQYLQQIDSLESMQERLQHLIQARKKQLNIQYYEEILRRGL 237 T 0.0087 Mod_r pdbpssm F Eukaryota T 6bxp 3 C C ENV_HV1H2 HIV peptide RKV-Kyn RVKEKYQHLX 10 T 0.25 FAS_meander pdbhh T Viruses T 6bxq 1 A A ENV_HV1H2 HIV peptide RKV RVKEKYQHLW 10 T 0.25 FAS_meander pdbhh T Viruses T 6bxr 1 A A A0A140H546_TOXGO Mitochondrial association factor 1 SQTVDLSCLSGTTVRFFGPSHHFGGFTPLYDPAPDKRVATVDAGANALFIGGGGLNGQFAKTLLEEAEKHGIRLTPEELSQHSQRIQQSLLRRAVKSPGKLVELDTGVASPVFARSFGFVPVVPGLMWEESEVGPNVGVTFVHILKPEVTPYGNLNNNVMMYTVAPSGAAPDKTYSLAYKTTIAGVIGAAAAYNDTPAGQQYPVQGLRLPLLGGGIFRRNRSLESIGRANAEGTSLAITRYGPNFELQYMYDPSNAALHGLQEAESTYLASAA 273 T 0.5 FAP unp F Eukaryota T 6bxs 1 A,B,C A,B,C A0A193AUK9_TOXGO mitochondrial association factor 1 GSMGTPDPLTLRFTCLGDRNVIFFGPSGRQDGFTPLYDPSPSKRVATVDAGTYGLFIGGVGMNGEFADTIIEEARRNRIPLTATELSAESQEIQERLLHDAERQPGTLVEIDSGRFSRVFARSFAYVAIVPNTVWDESETGKNVGATFLHILKPEVTPHGNEMNDVMLYTVAPFGNASDSAYNMAYKATMLGIVGAVSEYNKTPWGEVKPVEAIRLPLLGAGHFRGRRGLHSIGRANAVAVEAAITRFDPRVELQFMYEPSDTALRGLMESERKYKFPQGD 281 T 0.052 DUF6479 unppercent F Eukaryota T 6bxw 1 A A A0A140H546_TOXGO Mitochondrial association factor 1 GSMGSQTVDLSCLSGTTVRFFGPSHHFGGFTPLYDPAPDKRVATVDAGANALFIGGGGLNGQFAKTLLEEAEKHGIRLTPEELSQHSQRIQQSLLRRAVKSPGKLVELDTGVASPVFARSFGFVPVVPGLMWEESEVGPNVGVTFVHILKPEVTPYGNLNNNVMMYTVAPSGAAPDKTYSLAYKTTIAGVIGAAAAYNDTPAGQQYPVQGLRLPLLGGGIFRRNRSLESIGRANAEGTSLAITRYGPNFELQYMYDPSNAALHGLQEAESTYLASAAA 278 T 0.5 FAP unp F Eukaryota T 6byj 2 G,H,I G,P,T TSTTATPPVSQASSTTTSTW O-GlcNac peptide TSTTATPPVSQASSTTTSTW 20 T 84 Polyoma_coat pdbhh F T 6byk 2 E,F,G,H G,J,K,R ATPPVSQASSTT O-GlcNac peptide ATPPVSQASSTT 12 T 39 Luteo_coat pdbhh F T 6byl 2 G,H,I G,P,T TSASTTVPVTTATTTTTSTW O-GlcNac peptide TSASTTVPVTTATTTTTSTW 20 T 45 YjbE pdbhh F T 6c23 2 B E JARD2_HUMAN JUMONJI/ARID DOMAIN-CONTAINING PROTEIN 2 SNARKRPRLQAQRKFAQSQPNSPSTTPVKIVEPLLPPPATQISDLSKRKPKTEDFLTFLCLRGSPALPNSMVYFGSSQDEEEVEEEDDETEDVKTATNNASSSCQSTPRKGKTHKHVHNGHVFNGSSRSTREKEPVQKHKSKEATPAKEKHSDHRADSRREQASANHPAAAPSTGSSAKGLAATHHHPPLHRSAQDLRKQVSKVNGVTRMSSLGAGVTSAKKMREVRPSPSKTVKYTATVTKGAVTYTKAKRELVKDTKPNHHKPSSAVNHTISGKTESSNAKTRKQVLSLGGASKSTGPAVNGLKVSGRLNPKSCTKEVGGRQLREGLQLREGLRNSKRRLEEAHQA 348 T 0.054 Actin_micro pdbpercent F Eukaryota T 6c23 9 L B JARD2_HUMAN JUMONJI/ARID DOMAIN-CONTAINING PROTEIN 2 RKRPRLQAQRKFAQSQPNSPSTTPVKIVEPLLPPPATQISDLSKRKPKTEDFLTFLCLRGSPALPNSMVYFGSSQDEEEVEEEDDETEDVKTATNNASSSCQSTPRKGKTHKHVHNGHVFNGSSRSTREKEPVQKHKSKEATPAKEKHSDHRADSRREQASANHPAAAPSTGSSAKGLAATHHHPPLHRSAQDLRKQVSKVNGVTRMSSLGAGVTSAKKMREVRPSPSKTVKYTATVTKGAVTYTKAKRELVKDTKPNHHKPSSAVNHTISGKTESSNAKTRKQVLSLGGASKSTGPAVNGLKVSGRLNPKSCTKEVGGRQLREGLQLREGLRNSKRRLEEAHQA 345 T 0.053 Actin_micro pdbpercent F Eukaryota T 6c3g 1 A,B A,B LYS-ALA-LEU-GLY-ILE-SER KALGIS 6 T 0.55 Tat pdbhh F T 6c3r 1 A,B A,B POLN_CRPVC Cricket paralysis virus 1A protein INSLEELAAQELIAAQFEGNLDGFFCTFYVQSKPQLLDLESECYCMDDFDCGCDRIKREEELRKLIFLTSDVYGYNFEEWKGLVWKFVQNYCPEHRYGSTFGNGLLIVSPRFFMDHLDWFQQWKLVSSNDECRAFLRKRTQ 141 T 7.1 AAA_lid_6 pdbhh T Viruses T 6c48 2 C,D F,C MYBB_HUMAN B-MYB,MYB-LIKE PROTEIN 2 APMSSAWKTVACGGTRDQLFMQEKARQLLGRL 32 T 14 Atracotoxin pdbhh F Eukaryota T 6c4a 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H ACEA_MYCTU ICL1,ISOCITRASE,ISOCITRATASE,METHYLISOCITRATE LYASE,MICA MHHHHHHLVPRGSHMSVVGTPKSAEQIQQEWDTNPRWKDVTRTYSAEDVVALQGSVVEEHTLARRGAEVLWEQLHDLEWVNALGALTGNMAVQQVRAGLKAIYLSGWQVAGDANLSGHTYPDQSLYPANSVPQVVRRINNALQRADQIAKIEGDTSVENWLAPIVADGEAGFGGALNVYELQKALIAAGVAGSHWEDQLASEKKXGHLGGKVLIPTQQHIRTLTSARLAADVADVPTVVIARTDAEAATLITSDVDERDQPFITGERTREGFYRTKNGIEPCIARAKAYAPFADLIWMETGTPDLEAARQFSEAVKAEYPDQMLAYNCSPSFNWKKHLDDATIAKFQKELAAMGFKFQFITLAGFHALNYSMFDLAYGYAQNQMSAYVELQEREFAAEERGYTATKHQREVGAGYFDRIATTVDPNSSTTALTGSTEEGQFH 442 T 1.8E-47 ICL unp F Bacteria T 6c4x 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H cross-alpha amyloid-like membrane peptide alpha-AmMEM XSKLLLLLIILSEALHLAILLLIKWGX 27 T 2.9 GRP pdbhh F T 6c4y 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R A,B,C,D,E,F,G,H,K,I,J,L,M,N,O,P,Q,R Cross-alpha Amyloid-like Structure alphaAmG XSKLLELLRKLGEALHKAIELLEKWGX 27 T 2.1 BssS pdbhh F T 6c50 1 A,AA,AB,AC,AD,AE,AF,AG,AH,AI,B,BA,BB,BC,BD,BE,BF,BG,BH,BI,C,CA,CB,CC,CD,CE,CF,CG,CH,D,DA,DB,DC,DD,DE,DF,DG,DH,E,EA,EB,EC,ED,EE,EF,EG,EH,F,FA,FB,FC,FD,FE,FF,FG,FH,G,GA,GB,GC,GD,GE,GF,GG,GH,H,HA,HB,HC,HD,HE,HF,HG,HH,I,IA,IB,IC,ID,IE,IF,IG,IH,J,JA,JB,JC,JD,JE,JF,JG,JH,K,KA,KB,KC,KD,KE,KF,KG,KH,L,LA,LB,LC,LD,LE,LF,LG,LH,M,MA,MB,MC,MD,ME,MF,MG,MH,N,NA,NB,NC,ND,NE,NF,NG,NH,O,OA,OB,OC,OD,OE,OF,OG,OH,P,PA,PB,PC,PD,PE,PF,PG,PH,Q,QA,QB,QC,QD,QE,QF,QG,QH,R,RA,RB,RC,RD,RE,RF,RG,RH,S,SA,SB,SC,SD,SE,SF,SG,SH,T,TA,TB,TC,TD,TE,TF,TG,TH,U,UA,UB,UC,UD,UE,UF,UG,UH,V,VA,VB,VC,VD,VE,VF,VG,VH,W,WA,WB,WC,WD,WE,WF,WG,WH,X,XA,XB,XC,XD,XE,XF,XG,XH,Y,YA,YB,YC,YD,YE,YF,YG,YH,Z,ZA,ZB,ZC,ZD,ZE,ZF,ZG,ZH A1,G3,N1,T3,11,73,e1,k3,r1,x3,A2,G4,N2,T4,12,74,e2,k4,r2,x4,A3,H1,N3,U1,13,81,e3,l1,r3,A4,H2,N4,U2,14,82,e4,l2,r4,B1,H3,O1,U3,21,83,f1,l3,s1,B2,H4,O2,U4,22,84,f2,l4,s2,B3,I1,O3,V1,23,91,f3,m1,s3,B4,I2,O4,V2,24,92,f4,m2,s4,C1,I3,P1,V3,31,93,g1,m3,t1,C2,I4,P2,V4,32,94,g2,m4,t2,C3,J1,P3,W1,33,a1,g3,n1,t3,C4,J2,P4,W2,34,a2,g4,n2,t4,D1,J3,Q1,W3,41,a3,h1,n3,u1,D2,J4,Q2,W4,42,a4,h2,n4,u2,D3,K1,Q3,X1,43,b1,h3,o1,u3,D4,K2,Q4,X2,44,b2,h4,o2,u4,E1,K3,R1,X3,51,b3,i1,o3,v1,E2,K4,R2,X4,52,b4,i2,o4,v2,E3,L1,R3,Y1,53,c1,i3,p1,v3,E4,L2,R4,Y2,54,c2,i4,p2,v4,F1,L3,S1,Y3,61,c3,j1,p3,w1,F2,L4,S2,Y4,62,c4,j2,p4,w2,F3,M1,S3,Z1,63,d1,j3,q1,w3,F4,M2,S4,Z2,64,d2,j4,q2,w4,G1,M3,T1,Z3,71,d3,k1,q3,x1,G2,M4,T2,Z4,72,d4,k2,q4,x2 Cross-alpha Amyloid-like Structure alphaAmS XSKLLELLRKLSEALHKAIELLEKWGX 27 T 3.1 BssS pdbhh F T 6c51 1 A,B,C,D A,C,B,D Cross-alpha Amyloid-like Structure alphaAmL XSKLLELLRKLLEALHKAIELLEKWGX 27 T 2.3 Antimicrobial19 pdbhh F T 6c52 1 A,B,C,D A,B,C,D Cross-alpha Amyloid-like Structure alphaTet XSKLEELRRKLQEAEHKARELQEKWGX 27 T 0.0094 DMPK_coil pdb F T 6c5x 4 G G IL6RB_MOUSE GP130 peptide fragment TVEXSTVVHS 10 T 7.8 DUF2536 pdbhh F Eukaryota T 6c90 2 B B ZCHC8_HUMAN TRAMP-LIKE COMPLEX RNA-BINDING FACTOR ZCCHC8 SGDPIPDMSKFATGITPFEFENMAESTGMYLRIRSLLKNSPRNQQKNKKASE 52 T 6.5 DUF2621 pdbhh F Eukaryota T 6cae 56 ID,JD,KD A,B,C NOSO-95179 antibiotic KXAGXPHKX 9 T 30 Chisel pdbhh F T 6cbi 2 G,H,I,J H,I,J,K CDN1A_HUMAN GLY-ARG-LYS-ARG-ARG-GLN-DAB-SER-MET-THR-GLU-PHE-TYR-HIS GRKRRQXSMTEFYH 14 T 1.4 HD_assoc pdbhh F Eukaryota T 6cbz 3 C,D C,D grip peptide AILHRLLQ 8 T 0.0019 SRC-1 pdbhh F T 6cdg 2 B B Hexapeptide PGLWKS PGLWKS 6 T 0.43 Herpes_TK_C pdbhh F T 6ce7 4 E P INSR_HUMAN IR QILKELEESSFRKTFEDYLHNVVFVPRPSR 30 T 0.00017 DUF4998 unphh F Eukaryota T 6cej 1 A A CDN1A_HUMAN CDK-INTERACTING PROTEIN 1, MELANOMA DIFFERENTIATION-ASSOCIATED PROTEIN 6, MDA-6, P21 GRKRRQTSMTDFYH 14 T 2.4 INCA1 pdbhh F Eukaryota T 6cf6 2 C,D C,D RN146_HUMAN RNF146 NLARESSADGADS 13 T 47 DUF2788 pdbhh F Eukaryota T 6cfa 1 A A peptide PaAMP1R3 PMARNKKLLKKLRLKIAFK 19 T 7.9 RR_TM4-6 pdbhh F T 6cfb 1 A A A0A384E130_9METZ barrettide A DVSPCFCVEDETSGAKTCVPDNCDASRGTNP 31 T 8.4 NRF pdbhh F Eukaryota T 6cfh 1 A,B A,B TADBP_HUMAN TDP-43 SWGMMGMLASQ 11 T 0.29 Glucosaminidase unppercent F Eukaryota T 6cfw 4 D I I6U847_9EURY MBH subunit MFGYWDPLYFIIVFIIGLILAYLLNLWAKKSGMGTREVGEGTKIFISGEDPEKVIPGFEHLEGYYTGRNTMWGLVNGVKKFFATLKNDHTGLLPDYVSYLLMTTAFILVILLLRG 115 T 0.00077 Oxidored_q3 pdbpssm F Archaea T 6cfw 8 H E I6V287_9EURY MBH subunit MKRALGFLSLLVIFASLLVALSPEYGIKFGVGGEDWLKYRYTDNYYIEHGIEEVGGTNIVTDIVFDYRGYDTLGEATVLFTAIAGAVALLRPWRREENE 99 T 0.002 DUF2106 pdbhh F Archaea T 6cgi 1 A,B,C,D A,B,C,D A0A0H3NMP8_SALTS Type III secretion system effector protein SNAPSSGSANFAGVEYPLLPLDQHTPLLFQWFERNPSRFGENQIPIINTQQNPYLNNIINAAIIEKERTIGVLVDGNFSAGQKKALAKLEKQYENIKVIYNSDLDYSMYDKKLSDIYLENIAKIEAQPANVRDEYLLGEIKKSLNEVLKNNPEESLVSSHDKRLGHVRFDFYRNLFLLKGSNAFLEAGKHGCHHLQPGGGCIYLDADMLLTGKLGTLYLPDGIAVHVSRKGNSMSLENGIIAVNRSEHPALKKGLEIMHSKPYGDPYIDGVCGGLRHYFNCSIRHNYEEFCNFIEFKHEHIFMDTSSLTISSWR 314 T 1.9E-05 Glyco_transf_88 unphh F Bacteria T 6cgw 1 A A JZTX5_CHIGU BETA/KAPPA-TRTX-CG2A, JINGZHAOTOXIN-5, JINGZHAOTOXIN-V, JZTX-V, PEPTIDE F8-15.73 YCQKWXWTCDSKRACCEGLRCKLWCRKEI 29 T 0.0016 Conotoxin unppercent F Eukaryota T 6ch7 4 D G Q2N0S6_9HIV1 ENV POLYPROTEIN AENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRTELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQAFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGREKR 479 T 3.4E-54 GP120 pdbpercent T Viruses T 6cht 2 C,F,I,L C,F,I,L PA2G4_HUMAN CELL CYCLE PROTEIN P38-2G4 HOMOLOG,HG4-1,ERBB3-BINDING PROTEIN 1 VQDAELKALLQSSASRKTQK 20 T 0.25 TraC unp F Eukaryota T 6cit 4 D D NS2_MUMIP NS2 GGSYSTVDEMTKKFGTLTIHD 21 T 0.33 DUF6118 unppssm T Viruses T 6cix 1 A B CDN1A_HUMAN p21 GRKRRQKSMTEFYH 14 T 1.6 HD_assoc pdbhh F Eukaryota T 6cjd 1 A A A0A0H3NF83_SALTS TYPE III SECRETION SYSTEM EFFECTOR PROTEIN ORGC GHMVSLSARAAMLNNMDSAPLSNGGDVDLYDAFYQRLLALPESASSETLKDSIYQEMNAFKDPNSGDSAFVSFEQQTAMLQNMLAKVEPGTHLYEALNGVLVGSMNAQSQMTSWMQEIILSGGENKEAIDW 131 T 0.0082 AAA_35 pdbpercent F Bacteria T 6cka 1 A,B A,B A0A0H2UWN8_STRP3 Paratox MLYIDEFKEAIDKGYILGDTVAIVRKNGKIFDYVLPHEKVRDDEVVTVERVEEVMVELDKLEHHHHHH 68 T 0.019 Mesothelin unppercent F Bacteria T 6cl1 3 E,F E,F ACE-1MH-ASP-B3L-PHE-1U8 ACEXDXFX 8 T 16 Arabinose_Isome pdbhh F T 6cl5 1 A,B,C,D,E,F A,B,C,D,E,F Q9KW03_PSEAI TAIL FIBER PROTEIN SGSEFVTAGMALAATDIPGLDASKLVSGVLAEQRLPVFARGLATAVSNSSDPNTATVPLMLTNHANGPVAGRYFYIQSMFYPDQNGNASQIATSYNATSEMYVRVSYAANPSIREWLPWQRCDIGGSFTKTTDGSIGNGVNINSFVNSGWWLQSTSEWAAGGANYPVGLAGLLIVYRAHADHIYQTYVTLNGSTYSRCCYAGSWRPWRQNWDDGNFDPASYLPKAGFTWAALPGKPATFPPSGHNHDTSQITSGILPLARGGLGANTAAGARNNIGAGVPATASRALNGWWKDNDTGLIVQWMQVNVGDHPGGIIDRTLTFPIAFPSACLHVVPTVKEVGRPATSASTVTVADVSVSNTGCVIVSSEYYGLAQNYGIRVMAIGY 384 T 0.0049 H_lectin pdbpssm F Bacteria T 6cl6 1 A,B,C,D,E,F A,B,C,D,E,F G3XD71_PSEAE Tail fiber protein SGSVTAGMALAATDIPGLDASKLVSGVLAEQRLPVFARGLATAVSNSSDPNTATVPLMLTNHANGPVAGRYFYIQSMFYPDQNGNASQIATSYNATSEMYVRVSYAANPSIREWLPWQRCDIGGSFTKEADGELPGGVNLDSMVTSGWWSQSFTAQAASGANYPIVRAGLLHVYAASSNFIYQTYQAYDGESFYFRCRHSNTWFPWRRMWHGGDFNPSDYLLKSGFYWNALPGKPATFPPSAHNHDVGQLTSGILPLARGGVGSNTAAGARSTIGAGVPATASLGASGWWRDNDTGLIRQWGQVTCPADADASITFPIPFPTLCLGGYANQTSAFHPGTDASTGFRGATTTTAVIRNGYFAQAVLSWEAFGR 372 T 0.6 Big_2 pdbpercent F Bacteria T 6cnl 2 M,N,O,P,Q,R,S,T,U,V,W,X M,O,X,V,R,T,W,P,N,U,S,Q PGAM5 Multimerization Motif Peptide GPGVWDPNWDRREP 14 T 1.9 IL17R_fnIII_D2 pdbhh F T 6co4 2 B B PSMD1_HUMAN 26S PROTEASOME REGULATORY SUBUNIT RPN2,26S PROTEASOME REGULATORY SUBUNIT S1,26S PROTEASOME SUBUNIT P112 GPGSQEPEPPEPFEYIDD 18 T 38 AgrD pdbhh F Eukaryota T 6cou 1 A A CSP2_STRPN CSP-2 AMRISRIILXFLFLRKK 17 T 0.95 DUF5841 unphh F Bacteria T 6cp8 1 A,B A,B A0A2A2CAY5_ECOLX CONTACT-DEPENDENT INHIBITOR A SNSFEVSSLPDANGKNHITAVKGDAKIPVDKIELYMRGKASGDLDSLQAEYNSLKDARISSQKEFAKDPNNAKRMEVLEKQIHNIERSQDMARVLEQAGIVNTASNNSMIMDKLLDSAQGATSANRKTSVVVSGPNGNVRIYATWTILPDGTKRLSTVTGTFK 163 T 0.016 DUF1090 pdb F Bacteria T 6cp8 2 C,D C,D A0A2A2C800_ECOLX CdiI SNAMINVNSTAKDIEGLESYLANGYVEANSFNDPEDDALECLSNLLVKDSRGGLSFCKKILNSNNIDGVFIKGSALNFLLLSEQWSYAFEYLTSNADNITLAELEKALFYFYCAKNETDPYPVPEGLFKKLMKRYEELKNDPDAKFYHLHETYDDFSKAYPLNN 164 T 0.06 DUF4007 pdbpssm F Bacteria T 6cp9 1 A,C,E,G A,C,E,G B5Y0C2_KLEP3 FILAMENTOUS HAEMAGGLUTININ FAMILY PROTEIN VPEITTAQTIANSVVDAKKFDYLFGKATGNSHTLDRTNQLALEMKRLGVADDINGHAVLAEHFTQATKDSNNIVKKYTDQYGSFEIRESFFIGPSGKATVFESTFEVMKDGSHRFITTIPKNGVTK 126 T 1.9 Exog_C pdbhh F Bacteria T 6cp9 2 B,D,F,H B,D,F,H CdiI MFIENKPGEIELLSFFESEPVSFERDNISFLYTAKNKCGLSVDFSFSVVEGWIQYTVRLHENEILHNSIDGVSSFSIRNDNLGDYIYAEIITKELINKIEIRIRPDIKIKSSSVIR 116 T 11 Imm50 pdbhh F T 6cpd 1 A,B A,B A0A452CSS7_9RHIZ PmoD SMGNMCMVMFGYDMIHITVFQPDKSRSEYCDEIPATGRTIMAFDIENPAFRDLPLELRIIRDPLTPVLPTGEKELDALTELHLPAKKYSKGTFSVEHNFANNGHYIGLVTLTRESGQQETAQFKFMVG 128 T 0.016 PKD_4 pdbpssm F Bacteria T 6csu 1 A,C B,D CE152_HUMAN CEP152 MGALEELRGQYIKAVKKIKCDMLRYIQESKERAAEMVKAEVLRERQETARKMRK 54 T 25 Viral_cys_rich pdbhh F Eukaryota T 6csu 2 B,D C,A CEP63_HUMAN CEP63 ACLNTRFLEEEELRSHHILERLDAHIEELKRESEKTVRQFTALK 44 T 0.021 HalX pdb F Eukaryota T 6ct4 1 A A O06514_ECOLX MERP PROTEIN PMKKLKLALRLAAKIAPVW 19 T 0.041 Mfp-3 unphh F Bacteria T 6ct8 1 A A G3XD71_PSEAE R2-type pyocin MHHHHHHSSGVDLGTENLYFQSNAGSFTKEADGELPGGVNLDSMVTSGWWSQSFTAQAASGANYPIVRAGLLHVYAASSNFIYQTYQAYDGESFYFRCRHSNTWFPWRRMWHGGDFNPSDYLLKSGFYWNALPGKPATFPPSAHNHDVGQLTSGILPLARGGVGSNTAAGARSTIGAGVPATASLGASGWWRDNDTGLIRQWGQVTCPADADASITFPIPFPTLCLGGYANQTSAFHPGTDASTGFRGATTTTAVIRNGYFAQAVLSWEAFGR 273 T 20 Neuraminidase pdbhh F Bacteria T 6ctg 1 A A AFP_CENMR CM-P1 SRSELIVHQRX 11 T 5.7 Nbs1_C unphh F Eukaryota T 6cuc 1 A A DKTX_CYRSC TAU-TRTX-HS1A,DOUBLE-KNOT TOXIN,DKTX GDCAKEGEVCSWGKKCCDLDNFYCPMEFIPHCKKYKPYVPVTTSFNCAKEGEVCGWGSKCCHGLDCPLAFIPYCEKYRGRND 82 T 0.071 Conotoxin_I2 pdb F Eukaryota T 6cvz 1 A,B,C A,B,C RFWD3_HUMAN RING FINGER AND WD REPEAT DOMAIN-CONTAINING PROTEIN 3,RING FINGER PROTEIN 201 GSPSSQGQHKHKYHFQKTFTVSQAGNCRIMAYCDALSCLVISQPSPQASFLPGFGVKMLSTANMKSSQYIPMHGKQIRGLAFSSYLRGLLLSASLDNTIKLTSLETNTVVQTYNAGRPVWSCCWCLDEANYIYAGLANGSILVYDVRNTSSHVQELVAQKARCPLVSLSYMPRAASAAFPYGGVLAGTLEDASFWEQKMDFSHWPHVLPLEPGGCIDFQTENSSRHCLVTYRPDKNHTTIRSVLMEMSYRLDDTGNPICSCQPVHTFFGGPTCKLLTKNAIFQSPENDGNILVCTGDEAANSALLWDAASGSLLQDLQTDQPVLDICPFEVNRNSYLATLTEKMVHIYKWE 351 T 0.002 WD40_2 pdb F Eukaryota T 6cwp 2 C F VAL-GLU-TYR-THR-LYS-HIS VEYTKH 6 T 16 DUF5428 pdbhh F T 6cxg 3 E,F A,C 10V1S glycopeptide XATKTNSKREKTXDNHVTIXRSIPWYTYRWLPNGSGSGXA 40 T 5.3 Peptidase_C54 pdbhh F T 6czo 2 B,D B,D Q05C46_HUMAN CASC5 protein GAMGHSSILKPPRSPLQDLRGGNETVQESNALRNKKNSRRVSFADTIKVFQTESHMKIVRKS 62 T 2.3 Consortin_C pdbhh F Eukaryota T 6d0y 3 C B PRGC2_HUMAN PPARGC-1-BETA,PGC-1-RELATED ESTROGEN RECEPTOR ALPHA COACTIVATOR XSEEALPASGKSKXEAMDFDSLLKEAQQSLH 31 T 0.074 HALZ pdbpssm F Eukaryota T 6d29 3 C C THR-SER-MET-SER-PHE-VAL-PRO-ARG-PRO-TRP TSMSFVPRPW 10 T 0.99 LSM_int_assoc pdbhh F T 6d2b 3 C C LEU-SER-ASP-SER-THR-ARG-ASP-VAL-THR-TRP LSDSTARDVTW 11 T 47 Alpha_TIF pdbhh F T 6d2c 1 A,B A,B A0A084JZF2_9FLAO Ulvan lyase MRKLKYNTTRVILMIAFISLSACSSEDAMIEEEQVIPDPDPVAQTDEDTGPVVDCTNQGTNPTRDTDIPNPRNIGDIDDRSCYANYSESSILGKFWGIYNITDGSNHMDAPNTLQPRIERSLSRSQATGAGSYARFRGVLRILEVGDTGTFSSSGSYFMQAKGKHTGGGGSPDPAICLYRAHPVYGDDGNGNQVQVSFDIWREQINFRGGSGSAGRTEVFLKNVLKNEQIDIELEVGFRDDPNNPGQTLHYADAKIGGEEFNWNIPEPERGIESGIRYGAYRVKGGRAQFRWANTSYTKDEVN 303 T 0.031 DUF4999 pdbhh F Bacteria T 6d2h 1 A A SER-ARG-PHE-GLU-LEU-ILE-VAL-HIS-GLN-ARG-NH2 SRFELIVHQRX 11 T 7.5 Ribosomal_S10 pdbhh F T 6d2r 3 C C GLY-SER-PHE-ASP-TYR-SER-GLY-VAL-HIS-LEU-TRP GSFDYSGVHLW 11 T 3.6 DUF2399 pdbhh F T 6d2t 3 C C LEU-ALA-LEU-LEU-THR-GLY-VAL-ARG-TRP LALLTGVRW 9 T 3.6 TMEM252 pdbhh F T 6d2u 1 A A DAB-VAL-ARG-THR-ARG-LYS-GLY-ARG-ARG-ILE-NOR-ILE-DPR-PRO XVRTRKGRRIXIXP 14 T 0.35 DUF2835 pdbhh F T 6d37 1 A A ALA-TYR-ALA-GLN-TRP-LEU-ALA-ASP-DAL-GLY-PRO-ALA-SER-DAL-NVA-PRO-PRO-PRO-SER XAYAQWLADXGPASXXPPPSX 21 T 5.6 Sec16_C pdbhh F T 6d3o 3 C,D C,D HH4 alpha/beta-Peptide NCDIHVXXEWXCFXR 15 T 9.4 PHP_C pdbhh F T 6d3u 1 A,B A,B A0A084JZF2_9FLAO Ulvan lyase MRKLKYNTTRVILMIAFISLSACSSEDAMIEEEQVIPDPDPVAQTDEDTGPVVDCTNQGTNPTRDTDIPNPRNIGDIDDRSCYANYSESSILGKFWGIYNITDGSNHMDAPNTLQPRIERSLSRSQATGAGSYARFRGVLRILEVGDTGTFSSSGSYFMQAMGKHTGGGGSPDPAICLYRAHPVYGDDGNGNQVQVSFDIWREQINFRGGSGSAGRTEVFLKNVLKNEQIDIELEVGFRDDPNNPGQTLHYADAKIGGEEFNWNIPEPERGIESGIRYGAYRVKGGRAQFRWANTSYTKDEVN 303 T 0.031 DUF4999 pdbhh F Bacteria T 6d3x 2 C,D C,D SFTI1_HELAN SFTI-1 GRCYKSKPPICFPD 14 T 0.052 Bowman-Birk_leg unp F Eukaryota T 6d3y 2 B C SFTI1_HELAN SFTI-1 GRCTKSRPPICFPD 14 T 0.011 Bowman-Birk_leg pdb F Eukaryota T 6d3z 2 B C SFTI1_HELAN SFTI-1 GRCYKSRPPICFPN 14 T 0.052 Bowman-Birk_leg unp F Eukaryota T 6d40 2 B C SFTI1_HELAN SFTI-1 GRCYKSIPPICFPD 14 T 0.052 Bowman-Birk_leg unp F Eukaryota T 6d53 1 A C Q81AN8_BACCR Hemolysin II GSHMDNQKALEEQMNSINSVNDKLNKGKGKLSLSMNGNQLKATSSNAGYGISYEDKNWGIFVNGEKVYTFNEKSTVGNISNDINKLNIKGPYIEIKQI 98 T 0.0039 CE2_N pdbpercent F Bacteria T 6d5f 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T,U,V,W,X,Y,Z A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T,U,V,W,X,Y,Z Fimbrial protein MARKRTSKNDPLRMYLNYVRKLQTMGDAYDESAKYRIANFENGFKSLHMVENEFKQYLANVIDEAIKSGASPQDLPYVNEIKLALMKIFTSWLKYSNEKLGANEIAINVAGTATMTLTENLYGTRVSCEEAVSLINSIFAVWVGVEPFEAEEREGACLVTPRSPLPPVPISSPTGFSAPIQEVLQAKSPEEIIGVKGGA 199 T 0.00082 Sulf_coat_C pdbhh F T 6d6v 4 D E A0A0U8TRG9_TETTH Telomerase holoenzyme Teb2 subunit MSNRVQGGFDNNSGNNQSAQKQQAEKIPQITVPLNCFMINQIVKAAKENPQAHSGNHYEWYGAFENAIITAKFEFLQSINDSPKIMGKLSDSTGCIEVVIQKSKMSDELPEFVQAYEIELQNNGNRHKYVRAMLKMRKNAQIQLLYFSIVNDANEISRHGLDLCLRYLQRKHGIEDFMHMTNDKAHNNHNASAQKVHYQIDRNQQPKEQVLELMRQILKHNPNDQIPKSKIIEFFQSQLNQVQINQILQQLVSANEIFSVGSDNYLLNV 269 T 0.012 HSM3_N pdb F Eukaryota T 6d7a 1 A,B A,B A1E348_TOXGO Perforin-like protein 1 SNAVGLTPQDLSALTGVTRNLPKQLTQATQVAWSGPPPGFAKCPGGQVVILGFAMHLNFKEPGTDNFRIISCPPGREKCDGVGTASSETDEGRIYILCGEEPINEIQQVVAESPAHAGASVLEASCPDETVVVGGFGISVRGGSDGLDSFSIESCTTGQTICTKAPTRGSEKNFLWMMCVDKQYPGLRELVNVAELGSHGNANKRAVNSDGNVDVKCPANSSIVLGYVMEAHTNMQFVRDKFLQCPENASECKMTGKGVDHGMLWLFDRHALFGWIICKTVNEPAMHVATDVGKAKGNGKKKKGRKGKNKTNAPNEVEEGQQLGADSPSQVSVPADADSGPTSKTMSSLKLAPVKLLDL 359 T 10 Cutinase pdb F Eukaryota T 6d7k 4 D,H D,H Q27RN3_METSR Methane monooxygenase hydroxylase, MmoD SNAMAHSAEPTTEASRILIHSDARYEAFTVDLDYMWRWEILRDGEFVQEGCSLSFDSSRKAVAHVLSHFKRQDEAAQRPGDNSAEIKRLLQSLGTPIPVNEQNDSTKNELAQPE 114 T 1.9 DUF1508 pdbhh F Bacteria T 6d7y 1 A A Hemagglutinin IKTVLDTAQAPYKGSTVIGHALSKHAGRHPEIWGKVKGSMSGWNEQAMKHFKEIVRAPGEFRPTMNEKGITFLEKRLIDGRGVRLNLDGTFKGFID 96 T 35 Tcp10_C pdbhh F T 6d7y 2 B B immune protein MKELFEVIFEGVNTSRLFFLLKEIESKSDRIFDFNFSEDFFSSNVNVFSELLIDSFLGFNGDLYFGVSMEGFSVKDGLKLPVVLLRVLKYEGGVDVGLCFYMNDFNSAGKVMLEFQKYMNGISADFGFENFYGGLEPASDQETRFFTNNRLGPLL 155 T 0.017 Psb28 pdbpssm F T 6d94 2 B B MED1_HUMAN ACTIVATOR-RECRUITED COFACTOR 205 KDA COMPONENT,ARC205,MEDIATOR COMPLEX SUBUNIT 1,PEROXISOME PROLIFERATOR-ACTIVATED RECEPTOR-BINDING PROTEIN,PPAR-BINDING PROTEIN,THYROID HORMONE RECEPTOR-ASSOCIATED PROTEIN COMPLEX 220 KDA COMPONENT,TRAP220,THYROID RECEPTOR-INTERACTING PROTEIN 2,TRIP-2,VITAMIN D RECEPTOR-INTERACTING PROTEIN COMPLEX COMPONENT DRIP205,P53 REGULATORY PROTEIN RB18A VSSMAGNTKNHPMLMNLLKDNPAQ 24 T 12 HEAT pdbhh F Eukaryota T 6da1 2 C C serine-rich region (SRR) peptide PSXDSXDXEDXPAALWX 17 T 2.6 SAS-6_N pdbhh F T 6dc8 3 C P TAU_HUMAN NEUROFIBRILLARY TANGLE PROTEIN,PAIRED HELICAL FILAMENT-TAU,PHF-TAU RENAKAKTDHGAEIVYKSPVVSGDTSPRHLX 31 T 0.37 Tmemb_cc2 unp F Eukaryota T 6dc9 3 C,F P,Q TAU_HUMAN NEUROFIBRILLARY TANGLE PROTEIN,PAIRED HELICAL FILAMENT-TAU,PHF-TAU RENAKAKTDHGAEIVYKSPVVSGDTSPRHL 30 T 0.37 Tmemb_cc2 unp F Eukaryota T 6dei 2 C,D C,D DSE3_YEAST DAUGHTER SPECIFIC EXPRESSION PROTEIN 3 SNAFGGTLKLKKRLESVPELFLHD 24 T 4 DUF3805 pdbhh F Eukaryota T 6dex 1 A A Q75DL0_ASHGO ABR011WP SNAERALLQLVVEDDAKALVFVLGQDARRYFEEELPASPFEFPSPQAVANSRQNVGVMFLDKLQYLYMYLTKLEVDEAPEYRTLVVYGLEQLLGAGGELDADQVRLASLIYNTAFRVRVRHGAAVRFVAHGAPHAQLQQLEAHWRLFT 148 T 0.099 CutC pdb F Eukaryota T 6dex 2 B B SHU2_ASHGO Suppressor of hydroxyurea sensitivity protein 2 MAETNFNYSKLLRNLVTEDNVLNEVVVSFLYQLFPRDLFVRAFSLLESADMFIYVWMPTPKEADELLESLYNGTPLYRPIVRPRGPDDRPVCVDLDHWFCSCTEFAATCRPHLVGDTPLSDALFRPTEAADPDDCFGMLAGLQHLRADPEKLMCEHLFAFAILLQTDLRVLRHFSTGPGAQVFVLGITSIDEWLKLHLNVV 201 T 1 SWIM pdbpssm F Eukaryota T 6dfd 1 A,B A,B CNNM3_HUMAN ANCIENT CONSERVED DOMAIN-CONTAINING PROTEIN 3,CYCLIN-M3 GPLGSSEDYRDTVVKRKPASLMAPLKRKEEFSLFKVSDDEYKVTISPQLLLATQRFLSREVDVFSPLRMSEKVLLHLLKHPSVNQEVRFDESNRLATHHYLYQRSQPVDYFILILQGRVEVEIGKEGLKFENGAFTYYGVSALMVPSSVHQSPVSSLQPIRHDLQPDPGDGTHSSMYCPDYTVRALSDLQLIKVTRLQYLNALMATRAQNLPQSPENTDLQMMPGSQTRLLGEKTTTAAGSSHSRPGVPVEGSPGRNPGV 260 T 0.0053 cNMP_binding pdbhh F Eukaryota T 6dfg 1 A,E,I A,C,D Q2N0S6_9HIV1 ENV POLYPROTEIN AENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHEDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNTPVQINCTRPNNNTVKSIRIGPGQAFYYTGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFAQSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGR 476 T 3.1E-54 GP120 pdbpercent T Viruses T 6dg5 1 A A Neoleukin-2/15 GSHMPKKKIQLHAEHALYDALMILNIVKTNSPPAEEKLEDYAFNFELILEEIARLFESGDQKDEAEKAKRMKEWMKRIKTTASEDEQEEMANAIITILQSWIFS 104 T 0.0088 UvrD_C pdb F T 6dgp 2 C,D C,D TRAP220 Coactivator Peptide (Mediator of RNA polymerase II transcription subunit 1) NTKNHPMLMNLLKDNPAQD 19 T 8.5 HEAT pdbhh F T 6dhx 1 A,B,C A,B,C T1ZH71_STRIT TipC2 MGSSHHHHHHSQDPMNQPKNIFDEIYQETEKTYRLNNIFNKLTDVEVHSYQEYSDDSKFYPSILYKDIAKTGNYTKIAIDFSFLNKNNNILIYFEKEIGPNVRVRIWNKYTRQDRTLTKSVKIALEKGDSDKYIEDETQVRAYLKKYGITAKDLDAHYEKIVNQKVLKDWCSIYKSKYSPKDYGQVTVKMQWEKW 195 T 0.0095 YflT unppercent F Bacteria T 6dig 3 C C 13-mer peptide: ALA-GLY-ASN-HIS-ALA-ALA-GLY-ILE-LEU-THR-LEU-GLY-LYS AGNHAAGILTLGK 13 T 1.2E-05 Orexin pdbhh F T 6dj3 1 A,B A,B CNNM2_HUMAN ANCIENT CONSERVED DOMAIN-CONTAINING PROTEIN 2,CYCLIN-M2 GPLGSTDLYTDNRTKKKVAHRERKQDFSAFKQTDSEMKVKISPQLLLAMHRFLATEVEAFSPSQMSEKILLRLLKHPNVIQELKYDEKNKKAPEYYLYQRNKPVDYFVLILQGKVEVEAGKEGMKFEASAFSYYGVMALTASPVIDAVTPTLGSSNNQLNSSLLQVYIPDYSVRALSDLQFVKISRQQYQNALMASRMD 199 T 0.0026 cNMP_binding pdbhh F Eukaryota T 6djy 1 A A A0A0A0UEE5_9REOV Clamp protein MTLTYWDKEKRMTLKQMIQQVAINEQENELTHYVFTTPLSMPTFGKPMLGYVPLNEVATSKFFSNVNDFDRDNQLAMAHFPDTTITQAYNLTNSIKPGDTSLPDAEVAALKWFWKFFTSINLVRQPPMDNVMYWACQFLSSGTSFLPLERDVEIVFSGFKGSHICMFSNLRQMNLSPILCPYYDLITNFKTTTEIRAYVDAHEELKSLLTYLCLCTIVGLCDTFTETRNMDTGEYVWKVRDVVSRNHTPAQNVEKFCYTIQNAKYMIQLVHVLLFPLTDNKYADLPNYVAVITQGAINQSRSHNVINTTDESNSNTTSDTAASTSGIVSGDTGTVASLYPDEFKYVQS 348 T 31 DUF5659 pdbhh T Viruses T 6djy 2 B,C B,C A0A0A0U7Z7_9REOV Major capsid protein MRPIRMYKNNQERTNLKHQEINEEQQNEQTTSNQGFTRSDNSGKINIERISSSRNQITDGKTVSSYSKIETNRSSQDSVQHGGSSITYTSDTTGNPRITNARTNNDETHATGPIEDLNSTSHGREPEIESFADRAELAMMIQGMTVGALTVQPMRSIRSTFANLANVLIFHDVFTTEDKPSAFIEYHSDEMIVNMPKQTYNPIDNLAKILYLPSLEKFKYGTGIVQLNYSPHISKLYQNTNNIINTITDGITYANRTEFFIRVMVLMMMDRKILTMEFYDVDTSAISNTAILPTIPTTTGVSPLLRIDTRTEPIWYNDAIKTLITNLTIQYGKIKTVLDANAVKRYSVVGYPIDQYRAYLYNHNLLEYLGKKVKREDIMSLIKALSYEFDLITISDLEYQNIPKWFSDNDLSRFIFSICMFPDIVRQFHALNIDYFSQANVFTVKSENAIVKMLNSNQNMEPTIINWFLFRICAIDKTVIDDYFSLEMTPIIMRPKLYDFDMKRGEPVSLLYILELILFSIMFPNVTQHMLGQIQARILYISMYAFRQEYLKFITKFGFYYKIVNGRKEYIQVTNQNERMTENNDVLTGNLYPSLFTDDPTLSAIAPTLAKIARLMKPTTSLTPDDRAIAAKFPRFKDSAHLNPYSSLNIGGRTQHSVTYTRMYDAIEEMFNLILRAFASSFAQRPRAGVTQLKSLLTQLADPLCLALDGHVYHLYNVMANMMQNFIPNTDGQFHSFRACSYAVKDGGNIYRVVQNGDELNESLLIDTAIVWGLLGNTDSSYGNAIGATGTANVPTKVQPVIPTPDNFITPTIHLKTSIDAICSVEGILLLILSRQTTIPGYEDELNKLRTGISQPKVTERQYRRARESIKNMLGSGDYNVAPLHFLLHTEHRSTKLSKPLIRRVLDNVVQPYVANLDPAEFENTPQLIENSNMTRLQIALKMLTGDMDDIVKGLILHKRACAKFDVYETLTIPTDVKTIVLTMQHISTQTQNNMVYYVFLIDGVKILAEDIKNVNFQIDITGIWPEYVITLLLRAINNGFNTYVSMPNILYKPTITADVRQFMNTTKAETLLISNKSIVHEIMFFDNALQPKMSSDTLALSEAVYRTIWNSSIITQRISARGLMNLEDARPPEAKISHQSELDMGKIDETSGEPIYTSGLQKMQSSKVSMANVVLSAGSDVIRQAAIKYNVVRTQEIILFE 1202 T 0.054 DUF6279 pdb T Viruses T 6djy 3 D D A0A0A0U955_9REOV Turret protein MIDLRLEEDILTATLPEFLSTRPKYRYAYTNTKQQDIRFQGPMRHVRLTHLYKQTKLWNLQYIERELAISEIDDALDEFIQTFSLPYVIEQGTYKYNMLLGMHAHNVNYQDDVSELIANNPQLLNYLDDNPFSAIFELVNVDLQIYQYGQNIFNNEAEHTILFLKDNTNYGVIQALQKHPFSATHINWHLHKHIFVFHSREQLLNKLLSAGLEDSQLYQRQKTYSTKRGDRPTERMVTYIEDDHIRRIQAVFPLLLDNIFDVKLHKDSSMTWLKSYADMIYDSVKNSNSTITPEIRKLYLRMYNQYMRIFLPIEQYMLYDNTCWPFSEKITLKINVRLISSRENQPVLWKTPIDTENLISIVQPDEPINKLNFTAIPSTMIRLNDNITMYRAVKDMFSAIEYLPDAIENIPTLTMKEQALSRYISPDSEAQNFFNNQPPYLNSIMNVNRQVFEAVKRGNIQVSTGSMEHLCLCMHVKSGLIVGRTVLIDDKVVLRRNFNASTAKMITCYVKAFAQLYGEGSLINPGLRMVFFGVETEPAIDILKLFYGDKSLYIQGFGDRGIGRDKFRTKIEDALTLRIGCDILISDIDQADYEDPNEEKFDDITDFVCYVTELVISNATVGLVKISMPTYYIMNKISSTLNNKFSNVAINIVKLSTQKPYTYEAYIMLSHGSTLTNKGYLRNPVCDVYLEKISLQPMDLKIISTISNEINYDKPTLYRFVVDKNDVTDVSIAMHILSIHCSTITTRSVMVRSDNTGAFVTMSGIKDMKRVAIMNRMTDGTSANSYMHEQNGKLYLQKVPYLEDLISAFPNGFGSTYQNDYDSSMSVINVNALIRQVVYRVISKSIPVALLESLSRIRIIGGRDLGEMNAVYKLYKTPIEVYDAVGITREYPHVQISYRAQRYSFTESIPNHTLLLANYVIMNDVDGAPISSLEQINTIKKIISKISLGSIAYIQVYTDIVARNINVMTKNDSFLISANADKTVFKVQVSGYKAVEMCNYEQLLQLVSDNTGVNIIKLTYQDVLESCVLSSGILGDTGSWLLDLVLASTYIIEIRG 1056 T 1.3 Reovirus_L2 pdbhh T Viruses T 6dkm 1 A,C,E,G A,C,E,G DHD131_A GSDESDRIRKIVEESDEIVKESRKLAERARELIKESEDKRVSEERNERLLEELLRILDENAELLKRNLELLKEVLYRTR 79 T 0.18 Syntaxin_2 pdb F T 6dlc 2 B B Designed protein DHD1:234_B HGDPKVVETYVELLKRHEKAVKELLEIAKTHAKKVE 36 T 0.46 Nup54_C pdbhh F T 6dm3 1 A,B A,B Q5ZWF6_LEGPH RavO ESEKIYKVMEEIFVDRHYKENIRTGEEVKQYFSKSKAEFILRWSSANESDTENKYVFIAASFQASDGIHSIRYGINKNGELFSINTASNKVTPIDILPLGVMATLTQHITQNKELIEKAL 120 T 6.9 IBP39 pdbhh F Bacteria T 6dm4 2 B,E,G E,G,H SHC1_HUMAN Shc1 phospho-Tyr317 peptide PSXVNVQ 7 T 0.25 SH3-WW_linker pdbhh F Eukaryota T 6dm9 1 A,C A,C DHD15_extended_A MTREELLRENIELAKEHIEIMREILELLQKMEELLEKARGADEDVAKTIKELLRRLKEIIERNQRIAKEHEYIARERS 78 T 0.0058 Hormone_1 pdb F T 6dmp 1 A A Designed orthogonal protein DHD13_XAAA_A GTKEDILERQRKIIERAQEIHRRQQEILEELERIIRKPGSSEEAMKRMLKLLEESLRLLKELLELSEESAQLLYEQR 77 T 0.0023 Ku_C pdbpercent F T 6dmp 2 B B Designed orthogonal protein DHD13_XAAA_B TEKRLLEEAERAHREQKEIIKKAQELHRRLEEIVRQSGSSEEAKKEAKKILEEIRELSKRSLELLREILYLSQEQKGSLVPR 82 T 0.00053 Prefoldin_2 pdb F T 6dmx 1 A,F E,J HBZ_HTL1A BZIP factor GSHMASGLFRALPVSAPEDLLVEELVDGLLSLEEELKDKEEEKAVLDGLLSLEEESRG 58 T 0.6 Cupin_8 unp T Viruses T 6dnm 1 A A Export chaperone SatS SHMVVLGGDRDFWLQVGIDPIQIMTGTATFYTLRCYLDDRPIFLGRNGRISVFGSERALARYLADEHDHDLSDLSTYDDIRTAATDGSLAVAVTDDNVYVLSGLVDDFADGPDAVDREQLDLAVELLRDIGDYSEDSAVDKALETTRPLGQLVAYVLDPHSVGKPTAPYAAAVREWEKLERFVESRLRRE 190 T 0.04 DUF6482 pdbhh F T 6dno 2 B B PPR3A_RABIT PROTEIN PHOSPHATASE 1 GLYCOGEN-ASSOCIATED REGULATORY SUBUNIT,PROTEIN PHOSPHATASE TYPE-1 GLYCOGEN TARGETING SUBUNIT,RG1 RRVSFADNFGFNLVSVKEFDTWELPSVSTT 30 T 1.3 RSD-2 pdbhh F Eukaryota T 6dnq 1 A E Q2Q067_9DELA BZIP factor GSHMASGLFRALPVSAPEDLLVEELVDGLLSLEEELKDKEEEKAVLDGLLSLEEESRGRLRRGPPGEKAPPRGETHRDR 79 T 0.16 Cupin_8 unp T Viruses T 6dr4 1 A,B,C,D A,C,B,D ORN-CYS-VAL-PHE-MEA-CYS-GLU-ASP-ORN-ALA-VAL-ILE-GLY-LEU-ORN-VAL XCVFXCEDXAVIGLXV 16 T 0.35 Beta-APP pdbhh F T 6dr5 1 A,B,C,D,E,F A,B,C,D,E,F ORT-CYS-VAL-PHE-MEA-CYS-GLU-ASP-ORT-ALA-CHG-ILE-GLY-LEU-ORA-VAL XCVFXCEDXAXIGLXV 16 T 1.1 Beta-APP pdbhh F T 6drd 13 M M GRL1A_HUMAN DNA-DIRECTED RNA POLYMERASE II SUBUNIT M,GLUTAMATE RECEPTOR-LIKE PROTEIN 1A XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPTQLLSIEESLALQKQQ 62 T 5.2 DUF6465 pdbpssm F Eukaryota T 6drq 1 A A Primosomal protein LGGDRDFWLQVGIDPIQIMTGTATFYTLRCYLDDRPIFLGRNGRISVFGSERALARYLADEHDHDLSDLSTYDDIRTAATDGSLAVAVTDDNVYVLSGLVDDFADGPDAVDREQLDLAVELLRDIGDYSEDSAVDKALETTRPLGQLVAYVLDPHSVGKPTAPYAAAVREWEKLERFVESRLRRE 185 T 0.037 DUF6482 pdbhh F T 6dsl 1 A A Consensus engineered intein CatN EFEALSGDTMIEILDDDGIIQKISMEDLYQRLA 33 T 1 Ish1 pdbhh F T 6dsl 2 B B Consensus engineered intein CatC DYKDDDDKMFKLNTKNIKVLTPSGFKSFSGIQKVYKPFYHHIIFDDGSEIKCSDNHSFGKDKIKASTIKVGDYLQGKKVLYNEIVEEGIYLYDLLNVGEDNLYYTNGIVSHACESRGK 118 T 0.12 DUF3857 pdb F T 6dt7 2 B B Q8LTE3_BPN4 RNAP2 MQTFTAREYLKIDIANNYGLDKEDWDDRIAWFDKNENNLLNLVREAEEPALFYAGVKAWMDVKEGKPIGYPVALDATSSGLQILACLTGDRRAAELCNVVNYRDESGKVKRRDAYTVIYNKMLNTLGKGARIKRNDCKQAIMTALYGSEAKPKEVFGEGIMLNVFESTMNVEAPAVWELNKFWLQCGNPEAFVYHWVMPDGFNVYIKVMVNEVETVHFLDKPYDCVRKVQGTEEKTRMLSANTTHSIDGLVVRELVRRCDYDKNQIEYIKALCNGEAEYKASEKNYGKAMELWGYYEKTGFLTARIFDYLDSETIKLVNTQDILDLIESMPKKPFHVLTVHDCFRCLPNYGNDIRRQYNNLLATIAKGDLLSFIMSQVIGQEVTIGKLDPTLWEDVLETEYALS 404 T 7.5E-42 RNA_pol pdbpercent T Viruses T 6dtd 1 A A E6K398_9BACT nuclease MQKQDKLFVDRKKNAIFAFPKYITIMENKEKPEPIYYELTDKHFWAAFLNLARHNVYTTINHINRRLEIAELKDDGYMMGIKGSWNEQAKKLDKKVRLRDLIMKHFPFLEAAAYEMTNSKSPNNKEQREKEQSEALSLNNLKNVLFIFLEKLQVLRNYYSHYKYSEESPKPIFETSLLKNMYKVFDANVRLVKRDYMHHENIDMQRDFTHLNRKKQVGRTKNIIDSPNFHYHFADKEGNMTIAGLLFFVSLFLDKKDAIWMQKKLKGFKDGRNLREQMTNEVFCRSRISLPKLKLENVQTKDWMQLDMLNELVRCPKSLYERLREKDRESFKVPFDIFSDDYNAEEEPFKNTLVRHQDRFPYFVLRYFDLNEIFEQLRFQIDLGTYHFSIYNKRIGDEDEVRHLTHHLYGFARIQDFAPQNQPEEWRKLVKDLDHFETSQEPYISKTAPHYHLENEKIGIKFCSAHNNLFPSLQTDKTCNGRSKFNLGTQFTAEAFLSVHELLPMMFYYLLLTKDYSRKESADKVEGIIRKEISNIYAIYDAFANNEINSIADLTRRLQNTNILQGHLPKQMISILKGRQKDMGKEAERKIGEMIDDTQRRLDLLCKQTNQKIRIGKRNAGLLKSGKIADWLVNDMMRFQPVQKDQNNIPINNSKANSTEYRMLQRALALFGSENFRLKAYFNQMNLVGNDNPHPFLAETQWEHQTNILSFYRNYLEARKKYLKGLKPQNWKQYQHFLILKVQKTNRNTLVTGWKNSFNLPRGIFTQPIREWFEKHNNSKRIYDQILSFDRVGFVAKAIPLYFAEEYKDNVQPFYDYPFNIGNRLKPKKRQFLDKKERVELWQKNKELFKNYPSEKKKTDLAYLDFLSWKKFERELRLIKNQDIVTWLMFKELFNMATVEGLKIGEIHLRDIDTNTANEESNNILNRIMPMKLPVKTYETDNKGNILKERPLATFYIEETETKVLKQGNFKALVKDRRLNGLFSFAETTDLNLEEHPISKLSVDLELIKYQTTRISIFEMTLGLEKKLIDKYSTLPTDSFRNMLERWLQCKANRPELKNYVNSLIAVRNAFSHNQYPMYDATLFAEVKKFTLFPSVDTKKIELNIAPQLLEIVGKAIKEIEKSENKN 1127 T 0.13 Cdh1_DBD_1 pdbpssm F Bacteria T 6dtn 2 B A (6D6)PPKRIA(NH2), DC100-1 XPPKRIAX 8 T 27 DUF5394 pdbhh F T 6du2 2 C,D C,D REST_HUMAN REST-pS861/4 EDLSPPSPPLPK 12 T 12 Tir_receptor_N pdbhh F Eukaryota T 6dus 1 A,B A,B A0A0H3NMP8_SALTS Type III secretion system effector protein SSGSANFAGVEYPLLPLDQHTPLLFQWFERNPSRFGENQIPIINTQQNPYLNNIINAAIIEKERTIGVLVDGNFSAGQKKALAKLEKQYENIKVIYNSDLDYSMYDKKLSDIYLENIAKIEAQPANVRDEYLLGEIKKSLNEVLKNNPEESLVSSHDKRLGHVRFDFYRNLFLLKGSNAFLEAGKHGCHHLQPGGGCIYLDADMLLTGKLGTLYLPDGIAVHVSRKGNSMSLQNGIIAVNRSEHPALKKGLEIMHSKPYGDPYIDGVCGGLRHYFNCSIRHNYEEFCNFIEFKHEHIFMDTSSLTISSWR 310 T 1.9E-05 Glyco_transf_88 unphh F Bacteria T 6dym 1 A A EBO_DROME Ebony MLKMEAVPLRLEHRQEVIDIIVASFYNKADLEQWLKPGVLRTDYSDILNDIWNVLVERDLSFVVYDTNTDRIIGTALNFDARNEPEVDIKSKLLIVFEFLEFCEGPIRDNYLPKGLNQILHSFMMGTAEKLNPRENIACMHFMEHEVLRVAREKQFAGIFTTNTSPLTQQLADVYHYKTLLNFQVNEYVHSDGSRPFGDAPDEQRAIVHWKEVGKGSHHHHHH 223 T 0.00056 Acetyltransf_9 unphh F Eukaryota T 6dz9 1 A A CPfox2 GSKRFRXPIIFNER 14 T 7.3 Hum_adeno_E3A pdbhh F T 6dza 1 A A CPfox4 GSKRFRFXPEIIFNER 16 T 5.7 PRC2_HTH_1 pdbhh F T 6dzb 1 A A CPfox5 GSRGFRFXPKIIFNER 16 T 1.6 PsbT pdbhh F T 6dzc 1 A A CPfox6 GSRGFRFXPKIIRNER 16 T 2.9 DUF3368 pdbhh F T 6dze 1 A A CPfox7 GSRRFRFXPKIIFNQR 16 T 2.4 PsbT pdbhh F T 6dzi 56 DB 3 A0QTP4_MYCS2 Uncharacterized protein AKRGRKKRDRKHSKANHGKRPNA 23 T 0.16 DUF6254 pdb F Bacteria T 6e10 2 G,I,K,M,O,Q,S B,A,G,F,E,D,C Q8IKC8_PLAF7 Exported protein 2 MKVSYIFSFFLLFFVYKNTNTVVCDNGYGDLAATSALTTVIKDPISLTIKDIYEHGVKNPFTKIIHKLKKFIRYRKVLRWSRMWWVLLVREIVGDNTIEKKTEKALREIWDQCTIAVYNNTLNAVESKPLLFLHGILNECRNNFATKLRQDPSLIVAKIDQIIKSQIYRFWVSEPYLKIGRSHTLYTHITPDAVPQLPKECTLKHLSSYMEEKLKSMESKKNIESGKYEFDVDSSETDSTKDDGKPDDDDDDDDNFDDDDNFDDDTVEEEDASGDLFKNEKKDENKE 287 T 0.086 Y_Y_Y pdbpercent F Eukaryota T 6e10 3 H,J,L,N,P,R,T,V a,g,f,e,d,c,b,h Q8ILA1_PLAF7 Translocon component PTEX150 SVKDIKKLIEEGILDYEDLTENELRKLAKPDDNFYELSPYASDEKDLSLNETSGLTNEQLKNFLGQNGTYHMSYDSKSIDYAKQKKSEKKEDQQEDDDGFYDAYKQIKNSYDGIPNNFNHEAPQLIGNNYVFTSIYDTKENLIKFLKKNSEYDLYDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 207 T 0.02 Latexin pdb F Eukaryota T 6e11 5 N,O,P,R,T,V,X d,c,b,a,g,f,e Q8ILA1_PLAF7 Translocon component PTEX150 MRIIILALLIVCTIINYYCAVQNNGNKSLNVMPTCSMPGNDSDSNDNETGDVDNDKNNELGNANDNNEMNNENAESKNMQGENSNNQEQLNENVHANDDAMYEGTPSSDNPPQENVDANNNEQEYGPPQEEPVSENNVENVEVATDDSGNDNINNNDNFNNNDNYNDNDNFNEEPPSDDGNKNEDELTEGNQSDDKPMNEEEATINEMGKITNPFEDMLKGKVDDMDIGKMMNKDNLQSFLSSLTGNKDGSGKNPLSDMMNIFGVPQTGKEGAEGGVNKENQMKQINELKDKLETMLKGAGVNVDKIKDSIKNNDLLKNKQLLKEAISKLTLDPSMMNMLNNKDGANGKPFDINPDSMMKMFNALSNENGNLDDLKMKPTDGSFDSFNDGVDNNLVPSNPKGQNNNEEDDEEGGDDDDYDDKSFVVNSKYADNSFEDKFNTFDEKDDDVKYELFGENEEAEELNNNTTTASSKGDANNSVNTQEGEGEEESFSANEENINNNNNHNNKNYNNYNTSQQEEDDNSFNENDEPLISSSQFDNNKKNKMSVSTHNKKSKNLMDSLDLESTNYGSNSSSSMSNNYNSKNKNSKKNNKKKSSQKDYIRTDGKVSFDMATLQKTIKNFGGADNEIVQNILKKYVTIDNDDDNDADEDEDEDDDDDDDLDEDEFSVKDIKKLIEEGILDYEDLTENELRKLAKPDDNFYELSPYASDEKDLSLNETSGLTNEQLKNFLGQNGTYHMSYDSKSIDYAKQKKSEKKEDQQEDDDGFYDAYKQIKNSYDGIPNNFNHEAPQLIGNNYVFTSIYDTKENLIKFLKKNSEYDLYDDDDKEGGNFKSPLYDKYGGKLQKFKRQRAFNILKQWRAKEKKLKEKKKKEEMEENKEFDFSKNYNFSSKNDGGVTMFSKDQLEDMVKNFGGKPSAHVTDSFSRKENPFVPTNTKNNSNDDDDMDNGYVTFDGKNKVSENDDDEKGNNNDDENDNDDSNDEEELDEEEDDN 993 T 0.14 CLP_protease pdbpssm F Eukaryota T 6e1r 1 A,B,C,D,E,F A,B,C,D,E,F A0A221SBY4_9CAUD Tailspike protein NNPNLDMSGWLMNLKGVVNSKVELEGLSGSDGQVVLMTGYYAGQYMGGDHFKYDSTQALINNGVTVINGWVKQFSAGVLTVSACGADPSASDHSAALDLAVNTATSLKRKLVVDFDLRVNTTTELDATLRIEGDGGAVQFSRSITATADIPIFTVKAGFSSESSYFGKLMFKASTGGTATAFRSTSNGYLSQSTFDHCVFDRSLRYGIDANLILCDFQKCDFGTYMSTTNSIGFKAIRSLGVVGTREPNANTFYNCIFRKGTDDCMIEWDSYGTQWHFFACDLEQNLCTEALIKCTASSPIMFVGGYIEANTSTPYVIKTLGNSATGFVPLIKFQGIHMNRPCSVAIGKNTMANYPKYIFEGCYGQLISAVVESSTGVLNDVALIENSIANHFTLATGGSIGDIRTLTMPSGFNADSRNFQAAKITNLTSYKHNYKKTINRDFTVGSSVGVASLSHPSISGASYGGRLLVNAIFGTTAAAGTNSAVYELLVTSVGTAKYISQIGSAGLTSGAAASHPSFTWSINSSNVLVATAVGSTAGRFAMEVFTTGNVQAT 554 T 0.006 Pectate_lyase_3 pdbhh T Viruses T 6e2p 2 C,D C,D LEPR_HUMAN LEPR, LEP-R, HUB219, OB RECEPTOR, OB-R GSHQRMKKLFWEDVPNPKNCSWAQGLNFQKPETFEHLFIKHTASVTCGPLLLEPETISEDISVDTSWKNKDEGNS 75 T 3.5 RCR unphh F Eukaryota T 6e2q 2 E,F,G,H M,N,O,P EPOR_HUMAN EPOR, EPO-R GSGSGSGSGSGSGSSHRRALKQKIWPGIPSPESEFEGLFTTHKGNFQLWLYQNDGCLWWSPCTPFTEDPPASLEVLSERCGNS 83 T 0.0066 IFNGR1 unphh F Eukaryota T 6e3c 1 A C Q5C838_9VIRU Dec protein GSHMANPNFTPSWPLYKDADGVYVSALPIKAIKYANDGSANAEFDGPYADQYMSAQTVAVFKPEVGGYLFRSQYGELLYMSKTAFEANYTSASGSVANAETADKLSTARTITLTGAVTGSASFDGSANVTIETTSGS 137 T 0.067 DUF5853 unppssm T Viruses T 6e3i 2 B B peptide srt.F4 XQRVVHIAAGLRRTGDQLEAYGX 23 T 2.9 PMAIP1 pdbhh F T 6e3j 2 B B peptide srt.F10 XRRVVQIAAGLRRAGDQLEKYGX 23 T 0.79 BID pdbhh F T 6e49 2 D,E,F D,E,F PIF1_YEAST DNA REPAIR AND RECOMBINATION HELICASE PIF1, PETITE INTEGRATION FREQUENCY PROTEIN 1, TELOMERE STABILITY PROTEIN 1 NGIAAMLQRHSRKRFQL 17 T 2.3 CrtO pdbhh F Eukaryota T 6e4h 1 A,B A,B PALB2_MOUSE Partner and localizer of BRCA2 MEELSGKPLSYAEKEKLKEKLAFLKKEYSRTLARLQRAKRAEKAKNSKKAIEDGVPQPEALEHHHHHH 68 T 0.024 DUF1564 pdbpercent F Eukaryota T 6e4j 1 A A I6V394_9EURY Uncharacterized protein PF2048.1 MAHHHHHHGSVVKEKLEKALIEVRPYVEYYNELKALVSKISSSVNDLEEAIVVLREEEKKASEPFKTDIRILLDFLESKP 80 T 0.0002 Rnk_N unppssm F Archaea T 6e4y 3 C P PCSK9_HUMAN NEURAL APOPTOSIS-REGULATED CONVERTASE 1, NARC-1, PROPROTEIN CONVERTASE 9, PC9, SUBTILISIN/KEXIN-LIKE PROTEASE PC9 EDEDGDYEELVLALRSEEDGLA 22 T 8.7 PIN7 pdbhh F Eukaryota T 6e5h 1 A A Designed peptide NC_HEE_D1: Aib turn mutant NDKCKELKKRYXGCEVRCDXPRYEVHCX 28 T 1.1 DUF6410 pdbhh F T 6e5i 1 A A Designed peptide NC_HEE_D1: Orn turn mutant NDKCKELKKRYXCEVRCDXPRYEVHCX 27 T 9.4 DUF2152 pdbhh F T 6e5j 1 A A Designed peptide NC_HEE_D1: Aib turn, beta3 helix, N-methyl hairpin mutant NDXCKXLKXRYXGCEXRCDXPRYEXHCX 28 T 1.1 DUF6410 pdbhh F T 6e5n 1 A B MYO6_HUMAN UNCONVENTIONAL MYOSIN-6 GPLGSRPKMTPEQMAKEMSEFLSRGPAVLATKAAAGTKKYDLSKWKYAELRDTINTSCDIELLAACREEFHRRLKVYHAWKSKNKKR 87 T 0.027 BUD22 unppercent F Eukaryota T 6e66 1 A A NLEB1_ECO27 NleB MLSSLNVLQSSFRGKTALSNSTLLQRPSFAGKEYSLEPIDERTPILFQWFEARPERYEKGEVPILNTKEHPYLSNIINAAKIENERIIGVLVDGNFTYEQKKEFLNLENEHQNIAIIYRADVDFSMYDKKLSDIYLENIHKQESYPASERDNYLLGLLREELKNIPEGKDSLIESYAEKREHTWFDFFRNLAILKAGSLFTETGKTGCHNISPCSGCIYLDADMIITDKLGVLYAPDGIAVHVDCNDEIKSLCNGAIVVNRSNHPALLAGLDIMKSKVDAHPYYDGLGKGIKRHFNYSSLHNYNAFCDFIEEGNPGIIIPNTSMYTSSSW 330 T 1.7E-05 Glyco_transf_88 pdbhh F Bacteria T 6e8k 2 B B IL2RB_HUMAN INTERLEUKIN-2 RECEPTOR SUBUNIT BETA,IL-2RB,HIGH AFFINITY IL-2 RECEPTOR SUBUNIT BETA,INTERLEUKIN-15 RECEPTOR SUBUNIT BETA,P70-75,P75 YFTYDPXSEEDPD 13 T 1.3 Membrane_bind pdbhh F Eukaryota T 6e8m 2 B B DNJA1_HUMAN DNAJ HOMOLOG SUBFAMILY A MEMBER 1,DNAJ PROTEIN HOMOLOG 2,HSDJ,HEAT SHOCK 40 KDA PROTEIN 4,HEAT SHOCK PROTEIN J2,HSJ-2,HUMAN DNAJ PROTEIN 2,HDJ-2 HYNGEAXEDDEHH 13 T 9.7 BLOC1S3 pdbhh F Eukaryota T 6e9e 2 B A B0MS50_9FIRM EsCas13d MGKKIHARDLREQRKTDRTEKFADQNKKREAERAVPKKDAAVSVKSVSSVSSKKDNVTKSMAKAAGVKSVFAVGNTVYMTSFGRGNDAVLEQKIVDTSHEPLNIDDPAYQLNVVTMNGYSVTGHRGETVSAVTDNPLRRFNGRKKDEPEQSVPTDMLCLKPTLEKKFFGKEFDDNIHIQLIYNILDIEKILAVYSTNAIYALNNMSADENIENSDFFMKRTTDETFDDFEKKKESTNSREKADFDAFEKFIGNYRLAYFADAFYVNKKNPKGKAKNVLREDKELYSVLTLIGKLRHWCVHSEEGRAEFWLYKLDELKDDFKNVLDVVYNRPVEEINNRFIENNKVNIQILGSVYKNTDIAELVRSYYEFLITKKYKNMGFSIKKLRESMLEGKGYADKEYDSVRNKLYQMTDFILYTGYINEDSDRADDLVNTLRSSLKEDDKTTVYCKEADYLWKKYRESIREVADALDGDNIKKLSKSNIEIQEDKLRKCFISYADSVSEFTKLIYLLTRFLSGKEINDLVTTLINKFDNIRSFLEIMDELGLDRTFTAEYSFFEGSTKYLAELVELNSFVKSCSFDINAKRTMYRDALDILGIESDKTEEDIEKMIDNILQIDANGDKKLKKNNGLRNFIASNVIDSNRFKYLVRYGNPKKIRETAKCKPAVRFVLNEIPDAQIERYYEACCPKNTALCSANKRREKLADMIAEIKFENFSDAGNYQKANVTSRTSEAEIKRKNQAIIRLYLTVMYIMLKNLVNVNARYVIAFHCVERDTKLYAESGLEVGNIEKNKTNLTMAVMGVKLENGIIKTEFDKSFAENAANRYLRNARWYKLILDNLKKSERAVVNEFRNTVCHLNAIRNININIKEIKEVENYFALYHYLIQKHLENRFADKKVERDTGDFISKLEEHKTYCKDFVKAYCTPFGYNLVRYKNLTIDGLFDKNYPGKDDSDEQK 954 T 0.18 Orthopox_F14 pdbpercent F Bacteria T 6e9f 1 A A B0MS50_9FIRM EsCas13d MGKKIHARDLREQRKTDRTEKFADQNKKREAERAVPKKDAAVSVKSVSSVSSKKDNVTKSMAKAAGVKSVFAVGNTVYMTSFGRGNDAVLEQKIVDTSHEPLNIDDPAYQLNVVTMNGYSVTGHRGETVSAVTDNPLRRFNGRKKDEPEQSVPTDMLCLKPTLEKKFFGKEFDDNIHIQLIYNILDIEKILAVYSTNAIYALNNMSADENIENSDFFMKRTTDETFDDFEKKKESTNSREKADFDAFEKFIGNYRLAYFADAFYVNKKNPKGKAKNVLREDKELYSVLTLIGKLAHWCVASEEGRAEFWLYKLDELKDDFKNVLDVVYNRPVEEINNRFIENNKVNIQILGSVYKNTDIAELVRSYYEFLITKKYKNMGFSIKKLRESMLEGKGYADKEYDSVRNKLYQMTDFILYTGYINEDSDRADDLVNTLRSSLKEDDKTTVYCKEADYLWKKYRESIREVADALDGDNIKKLSKSNIEIQEDKLRKCFISYADSVSEFTKLIYLLTRFLSGKEINDLVTTLINKFDNIRSFLEIMDELGLDRTFTAEYSFFEGSTKYLAELVELNSFVKSCSFDINAKRTMYRDALDILGIESDKTEEDIEKMIDNILQIDANGDKKLKKNNGLRNFIASNVIDSNRFKYLVRYGNPKKIRETAKCKPAVRFVLNEIPDAQIERYYEACCPKNTALCSANKRREKLADMIAEIKFENFSDAGNYQKANVTSRTSEAEIKRKNQAIIRLYLTVMYIMLKNLVNVNARYVIAFHCVERDTKLYAESGLEVGNIEKNKTNLTMAVMGVKLENGIIKTEFDKSFAENAANRYLRNARWYKLILDNLKKSERAVVNEFANTVCALNAIRNININIKEIKEVENYFALYHYLIQKHLENRFADKKVERDTGDFISKLEEHKTYCKDFVKAYCTPFGYNLVRYKNLTIDGLFDKNYPGKDDSDEQK 954 T 0.18 Orthopox_F14 unppercent F Bacteria T 6ee9 1 A X Stress-response Peptide-1 FGVRVGTCPSGYVRRGTFCFPDDDY 25 T 0.013 CPW_WPC pdbhh F T 6ef0 14 N s CCNB_ARBPU model substrate polypeptide SARLGGASIAVQ 12 T 3.9 DUF3182 pdbhh F Eukaryota T 6ef1 14 N s CCNB_ARBPU model substrate polypeptide NENVSARLGGASIAV 15 T 6.3 DUF3182 pdbhh F Eukaryota T 6ef2 14 N s CCNB_ARBPU model substrate polypeptide NNENVSARLGGASIAV 16 T 8.1 DUF3182 pdbhh F Eukaryota T 6ef3 21 U n PSB7_YEAST Proteasome subunit beta type-7 KWDFAKDIKGYGTQK 15 T 2.4 Ice_nucleation pdbhh F Eukaryota T 6ef3 23 W s CCNB_ARBPU Model substrate polypeptide GGKHTFNNENVSARLGGASIAVQAPAQPPPYSHHHHHH 38 T 22 Con-6 pdbhh F Eukaryota T 6ef5 2 E,H S,Q KKCC2_HUMAN ARG-SER-LEU-SEP-ALA-PRO-GLY RSLSAPG 7 T 9.1 DUF6439 pdbhh F Eukaryota T 6ef8 1 A,B,C,D,E,F,G A,B,C,D,E,F,G OMCS_GEOSL OUTER MEMBRANE CYTOCHROME S FHSGGVAECEGCHTMHNSLGGAVMNSATAQFTTGPMLLQGATQSSSCLNCHQHAGDTGPSSYHISTAEADMPAGTAPLQMTPGGDFGWVKKTYTWNVRGLNTSEGERKGHNIVAGDYNYVADTTLTTAPGGTYPANQLHCSSCHDPHGKYRRFVDGSIATTGLPIKNSGSYQNSNDPTAWGAVGAYRILGGTGYQPKSLSGSYAFANQVPAAVAPSTYNRTEATTQTRVAYGQGMSEWCANCHTDIHNSAYPTNLRHPAGNGAKFGATIAGLYNSYKKSGDLTGTQASAYLSLAPFEEGTADYTVLKGHAKIDDTALTGADATSNVNCLSCHRAHASGFDSMTRFNLAYEFTTIADASGNSIYGTDPNTSSLQGRSVNEMTAAYYGRTADKFAPYQRALCNKCHAKD 407 T 9.8E-05 Cytochrom_NNT unphh F Bacteria T 6efe 1 A A CLEA_CONVL Kappa-conotoxin vil14a GGLGRCIYNCMNSGGGLSFIQCKTMCY 27 T 0.024 Eclosion pdbhh F Eukaryota T 6ego 1 A A Hg(II)(GRAND CoilSerL12AL16C)3- EWEALEKKLAAAESKCQALEKKLQALEKKLEALEHG 36 T 0.00015 Cep57_CLD pdb F T 6eik 1 A,B,C,D,E,F,G A,B,C,D,E,F,G CC-Hept-I24E XGEIAKALREIAKALREIAWALREEAKALRGX 32 T 0.019 WXG100 pdbpssm F T 6eiz 1 A,B,C,D,E,F A,B,C,D,E,F CC-Hex2 XGEIAKSLKEIAKSLKEIAWSLKEIAKSLKGX 32 T 0.031 MCPsignal pdbpssm F T 6eje 2 B B SDC1_HUMAN SYND1 PAAEGSGEQDFT 12 T 32 BING4CT pdbhh F Eukaryota T 6ejl 2 C,D C,D M3K5_HUMAN APOPTOSIS SIGNAL-REGULATING KINASE 1,ASK-1,MAPK/ERK KINASE KINASE 5,MEKK 5 RSISLPVP 8 T 4 Imm9 pdbhh F Eukaryota T 6ek1 1 A A A0A452CST7_PSEFL restriction endonuclease PfoI MQKYRLYEKDGSPVQDFNRFVKGWLDIEFGLKEHQPPKVFDTIRDKYNEAIEAVVLSGVAPRTAHKAALSTLTELLFGHDLAKELSARLDIQPIGVGGFRSAHSQAFAKNVGENFVNLMVYALACILKDNDDVLVDKGLPPHLKKALTLSRECRIKDTLREIKIPIEGDLCVFSRSNHCNAIVISAKTRLKEVFHIGTMWALFSDVAKDEYCLNKWGLKVESSESLKDTMYVFATADMINKDGARSQGCDVERETPRNLIAMDASFFDYVFVSKMGIGHVSSDLSLKYGRESLFHELGCIIDMIEQKFDILL 312 T 0.037 ChaB unppercent F Bacteria T 6eka 1 A,B,C,D,E A,B,C,D,E B2B1E9_PODAN Podospora anserina S mat+ genomic DNA chromosome 3, supercontig 2 MKTLSATRACRTGQKFGEMKTDDHSIAMQGIVGVAQPGVDQSFGSLTTTKSSRAFQGQMDAGSFSNLFSKLEHHHHHH 78 T 0.029 Fez1 unppssm F Eukaryota T 6eke 1 A,B,C A,C,B A0A3B6UEU4_9AGAR lectin GAMAPVPVTKLVCDGDTYKCTAYLDFGDGRWVAQWDTNVFHTG 43 T 0.1 C2-set pdbhh F Eukaryota T 6ekj 2 B B CHAP1_HUMAN ZINC FINGER PROTEIN 828 MSASSGPWKPAKPAPSVSPGPWKPIPSVS 29 T 19 FGAR-AT_N pdbhh F Eukaryota T 6ekm 2 B B REV3L_HUMAN PROTEIN REVERSIONLESS 3-LIKE,HREV3 MGDKKIVIMPCKCAPSRQLVQVWLQAKE 28 T 0.37 SAM_3 pdbhh F Eukaryota T 6eko 1 A,B A,B A0A452CST9_PSEFL Restriction endonuclease PfoI MQKYRLYEKDGSPVQDFNRFVKGWLDIEFGLKEHQPPKVFDTIRDKYNEAIEAVVLSGVAPRTAHKAALSTLTELLFGHDLAKELSARLDIQPIGVGGFRSAHSQAFAKNVGENFVNLMVYALACILKDNDDVLVDKGLPPHLKKALTLSRECRIKDTLREIKIPIEGDLCVFSRSNHCNAIVISAATRLKEVFHIGTMWALFSDVAKDEYCLNKWGLKVESSESLKDTMYVFATADMINKDGARSQGCDVERETPRNLIAMDASFFDYVFVSKMGIGHVSSDLSLKYGRESLFHELGCIIDMIEQKFDILL 312 T 0.64 RE_BsaWI pdbhh F Bacteria T 6ekr 1 A A Q93K38_KLEPN Type ii site-specific deoxyribonuclease MDILKEKIDVASRLYNLNLDHIPATLQVIEHAMLLLKNNAGYGYFGSFNGKNTQEYHSFTFNGEYSRPVRDDLFITDYDFFVSGFREFNESLRDIGSKWSSFDSRRANKIIYTSVMSVACCFDLWKSGSRKTPGTFFEIFMAAVLKWMIPDEIFSKHIPLIDQLESDDESIDPSSVSTDIVIKSAYANASVVIPLKITTRERIVQPFAQQRILDSYFGNGVYFSFLACISETQQDKKKKKVNHICVPGTIRLYQKYLSSLSGMYYCDIPERYLERDLTDIIPVRTMGDFLFDIYSFFRSQGAAALEHHHHHH 312 T 0.033 Nop52 pdbpercent F Bacteria T 6ena 1 A A NEMA1_LINLO Nemertide alpha-1 GCIATGSFCTLSKGCCTKNCGWNFKCNPPNQ 31 T 0.001 Conotoxin_I2 pdb F Eukaryota T 6epg 1 A,C,E,G A,C,E,G D5K9E3_NEIGO Epsilon_1 antitoxin MNKVEPQESNAIRMIKEACEKNRRMMTDEAFRKEVEKRLYAGPSPELLAKLRVLWAANKEQ 61 T 1.5 DUF6033 pdbhh F Bacteria T 6er6 1 A B Endonuclease colEdes7 MESKRNKPGKATGKGKPVGDKWLDDAGKDSGAPIPDRIADKLRDKEFKNFDDFRKKFWEEVSKDPDLAKQFKRSNRKRIQQGYAPFAPQKDQVGGRTTFELHHDKPISQDGGVYDMNNIRVTTPKRAIDIHRGK 134 T 0.0047 HNH pdbpssm F T 6ere 1 A,D B,A colicin MESKRNKPGKATGKGKPVGDKWLDDAGKDSGAPIPDRIADKLRDKEFKNFDDFRKKFWKEVAKDPDLAKQFSKANQRNIKDGNAPFARESDQVGGRTTYELHHDKPISQDGGVYDMNNIRVTTPKRAIDIHRGK 134 T 0.0033 HNH pdbpssm F T 6erf 5 Q,R,S,T Q,R,S,T APLF_HUMAN APURINIC-APYRIMIDINIC ENDONUCLEASE APLF,PNK AND APTX-LIKE FHA DOMAIN-CONTAINING PROTEIN,XRCC1-INTERACTING PROTEIN 1 KQQPILAERKRILPTWML 18 T 0.032 PNISR pdbhh F Eukaryota T 6erg 3 C,F C,F NHEJ1_HUMAN PROTEIN CERNUNNOS,XRCC4-LIKE FACTOR SKVKRKKPRGLFS 13 T 2.4 DUF3487 pdbhh F Eukaryota T 6erh 5 G,J M,T NHEJ1_HUMAN PROTEIN CERNUNNOS,XRCC4-LIKE FACTOR LQRPQLSKVKRKKPRGLFS 19 T 7.6 DUF3487 pdbhh F Eukaryota T 6et5 7 AB,BB,G,NA,OA,PA,QA,RA,SA,TA,UA,VA,WA,XA,YA,ZA y,5,2,I,O,R,U,X,a,d,g,j,m,p,s,v LHG_BLAVI Light-harvesting protein B-1015 gamma chain SDWNLWVPLGILGIPTIWIALTYR 24 T 0.72 Proton_antipo_C pdbhh F Bacteria T 6ewc 3 C,G C,G RETR2_HUMAN Reticulophagy regulator 2 RLSSPLHFV 9 T 5.7 Pox_F15 pdbhh F Eukaryota T 6ewo 3 C,G C,G SYNEM_HUMAN DESMUSLIN RTFSPTYGL 9 T 1.6 Adipokin_hormo pdbhh F Eukaryota T 6eww 2 E,F,G,H E,F,G,H KKCC2_HUMAN ARG-LYS-LEU-SEP-LEU-GLN-GLU-ARG RKLSLQER 8 T 3.9 DUF2660 pdbhh F Eukaryota T 6ex9 2 B B Inhibitor Peptide WSYFYDGSYSYYDYE 15 T 2.8 DUF6058 pdbhh F T 6exa 1 A,B A,B Q5ZYC7_LEGPH IcmP (DotM) GPSGGGADVNKGPWAMALTPMEFARKYNLLRKDDALLDNPVPGEEMTAGIEEGDAKRVFTMQLGPYWDGFERCSPQAYALSAVFMARMNRDRDAANNILKVLDKTFVDGKPDFSVARPVMKKYQNSELVQEVVAKHAYVLTVIASLLEAAREDGVVPSSEFLWLKPVDRRLWYMLNCVGRQTPYSEVAGPFAHWKAEKEMGRRSLVPMIDEAIRALEIAVKEVRLTPRQMEELEP 235 T 17 DUF6015 pdbhh F Bacteria T 6exb 1 A,B A,B Q5ZYC7_LEGPH IcmP (DotM) GPSGGGADVNKGPWAMALTPMEFARKYNLLRKDDALLDNPVPGEEMTAGIRRGDAKRVFTMQLGPYWDGFERCSPQAYALSAVFMARMNRDRDAANNILKVLDKTFVDGKPDFSVARPVMKKYQNSELVQEVVAKHAYVLTVIASLLEAAREDGVVPSSEFLWLKPVDRRLWYMLNCVGRQTPYSEVAGPFAHWKAEKEMGRRSLVPMIDEAIRALEIAVKEVRLTPRQMEELEP 235 T 18 DUF6015 pdbhh F Bacteria T 6exc 1 A,B A,B Q5ZYC7_LEGPH IcmP (DotM) GPSGGGADVNKGPWAMALTPMEFARKYNLLRKDDALLDNPVPGEEMTAGIRRGDAKRVFTMQLGPYWDGFERCSPQAYALSAVFMARMNRDRDAANNILKVLDKTFVDGKPDFSVARPVMKKYQNSELVQEVVAKHAYVLTVIASLLEAAREDGVVPSSEFLWLKPVDEELWYMLNCVGRQTPYSEVAGPFAHWKAEKEMGRRSLVPMIDEAIRALEIAVKEVRLTPRQMEELEP 235 T 20 DUF6015 pdbhh F Bacteria T 6exe 1 A,B A,B Q5ZYC7_LEGPH IcmP (DotM) GPSGGGADVNKGPWAMALTPMEFARKYNLLRKDDALLDNPVPGEEMTAGIRRGDAKRVFTMQLGPYWDGFEECSPQAYALSAVFMARMNRDRDAANNILKVLDKTFVDGKPDFSVARPVMKKYQNSELVQEVVAKHAYVLTVIASLLEAAREDGVVPSSEFLWLKPVDRRLWYMLNCVGRQTPYSEVAGPFAHWKAEKEMGRRSLVPMIDEAIRALEIAVKEVRLTPRQMEELEP 235 T 18 DUF6015 pdbhh F Bacteria T 6exj 2 B,D B,D SSR2_RAT SSTR2 XDLQTSI 7 T 39 RLL pdbhh F Eukaryota T 6ey3 1 A A CYS-ARG-PRO-LEU-TRP-THR-ALA-CYS-GLY CRPLWTACG 9 T 0.58 Tmpp129 pdbhh F T 6eyr 1 A,B A,B A0A0H3NMP8_SALTS Type III secretion system effector protein GPLGSYSHTATPAITLPSSGSANFAGVEYPLLPLDQHTPLLFQWFERNPSRFGENQIPIINTQQNPYLNNIINAAIIEKERTIGVLVDGNFSAGQKKALAKLEKQYENIKVIYNSDLDYSMYDKKLSDIYLENIAKIEAQPANVRDEYLLGEIKKSLNEVLKNNPEESLVSSHDKRLGHVRFDFYRNLFLLKGSNAFLEAGKHGCHHLQPGGGCIYLDADMLLTGKLGTLYLPDGIAVHVSRKGNSMSLENGIIAVNRSEHPALKKGLEIMHSKPYGDPYIDGVCGGLRHYFNCSIRHNYEEFCNFIEFKHEHIFMDTSSLTISS 325 T 1.8E-05 Glyco_transf_88 pdbhh F Bacteria T 6eys 1 A,B,C,D B,A,C,D A0A0H2ZBG1_PSEAB PvdP HHHHHHSSGLEVLFQGTTHRHLTREEEPQTPDEASLDLAATDGIRLGDRLRGLWDLRLVGGDAELPGLPREGLQLVLDVAPKGRGLIGYLDTPERLLAAEPPRFRVLGDLLGASSASIRWRLVDQASGSVAPTHDCSAVFDEVWADYANAGDGTLSGRIQRLERSPLSPNEDFRFVAVKRHFPLAHERIVLNEKLLGWLVSPQHRLFHQLWHASRDKWHRLSEKQRNALRGVGWQPGPLDRERDARGPRKDRNASGIDFFFMHRHMLHTARSMQDLPSWERLPRPVVPLEYDRPGFIRYFDNPDGFSVPPAWVAVDDDEYSEWLHGLKSAEAYHANFLVWESQYQDPAYLAKLTLGQFGSELELGMHDWLHMRWASVTRDPSNGAPVMTDRFPADFAPRWFRPENDFLGDPFSSHVNPVFWSFHGWIDDRIEDWYRAHERFHPGEVQRREVEGIQWFAPGRWVEVGDPWLGPATHGCGLSDVQASSNSVELDVETMKLALRIIFSEEDQLSGWLKRAPRRPWYARNLKLARDQLRR 536 T 0.0015 Hemocyanin_M pdbhh F Bacteria T 6eyt 1 A,B A,B A0A0H3NMP8_SALTS Type III secretion system effector protein GPLGSYSHTATPAITLPSSGSANFAGVEYPLLPLDQHTPLLFQWFERNPSRFGENQIPIINTQQNPYLNNIINAAIIEKERTIGVLVDGNFSAGQKKALAKLEKQYENIKVIYNSDLDYSMYDKKLSDIYLENIAKIEAQPANVRDEYLLGEIKKSLNEVLKNNPEESLVSSHDKRLGHVRFDFYRNLFLLKGSNAFLEAGKHGCHHLQPGGGCIYLDADMLLTGKLGTLYLPDGIAVHVSRKGNSMSLENGIIAVNRSEHPALKKGLEIMHSKPYGDPYIDGVCGGLRHYFNCSIRHNYEEFCNFIEFKHEHIFMDTSSLTISSWR 327 T 1.8E-05 Glyco_transf_88 pdbhh F Bacteria T 6eyx 1 A,B A,B Q9XJC1_9CAUD AcrIIa6 MKINDDIKELILEYMSRYFKFENDFYKLPGIKFTDANWQKFKNGGTDIEKMGAARVNAMLDCLFDDFELAMIGKAQTNYYNDNSLKMNMPFYTYYDMFKKQQLLKWLKNNRDDVIGGTGRMYTASGNYIANAYLEVALESSSLGSGSYMLQMRFKDYSKGQEPIPSGRQNRLEWIENNLENIRLE 185 T 0.062 PDH_E1_M pdb T Viruses T 6eyy 1 A,B A,B A0A1S5PRR0_9CAUD AcrIIa6 MKINDDIKELILEYMSRYFKFENDFYKLPGIKFTDANWQKFKNGGTDIEKMGAARVNAMLDCLFDDFELAMIGKAQTNYYNDNSLKMNMPFYTYYDMFKKQQLLKWLKNNRDDVIGGTGRMYTASGNYIANAYLEVALESSSLGSGSYMLQMRFKDYSKGQEPIPSGRQNRLEWIENNLENIR 183 T 0.06 PDH_E1_M pdb T Viruses T 6f09 2 B,D,F,H A,B,C,D UBP8_HUMAN DEUBIQUITINATING ENZYME 8,UBIQUITIN ISOPEPTIDASE Y,HUBPY,UBIQUITIN THIOESTERASE 8,UBIQUITIN-SPECIFIC-PROCESSING PROTEASE 8 KLKRSYSSPDITQ 13 T 1.9 STAC2_u1 pdbhh F Eukaryota T 6f0f 2 B B ip2_s GAMGTLTPKEAELARRIRGAGGRTLNGFG 29 T 0.032 cIII pdb F T 6f0g 2 C,D C,D ip3 ASTERKWAELARRIRGAGGVTLNGFG 26 T 1.7 ELF pdbhh F T 6f0h 2 B,D B,D ip4 ASTEEKWARLARRIAGAGGVTLDGFG 26 T 1.2 DUF1654 pdbhh F T 6f0w 2 B S B3RXX2_TRIAD Hypoxia inducible factor, alpha subunit EKEDYDDLAPFVPPPSFDNRL 21 T 0.096 Pilt unppercent F Eukaryota T 6f0y 2 B B RT109_YEAST histone acetyltransferase Rtt109 C-terminus LAITMLKPRKKAKAL 15 T 2.9 SOXp pdbhh F Eukaryota T 6f1s 1 A A H7C664_CORGT CglIIR protein MPTRANVLDKRKVGNLSGGVNYFAADPRIKNVEALDKKLLAYLDKHGEDSTIGMRAIITILNAFTVDPNDLDLATFKAALLDFERNQPHLTARMVLRTNRKVNQGTGALLSPTDQALSRAEVAHPLLILYRIEGVNDAAAQRGEPTWSSDPIWVPNIKLPGQRQFWCVDGGHHHHHHG 178 T 0.024 Imm30 pdbpssm F Bacteria T 6f34 2 B C MGTS_ECOLI MgtS MLGNMNVFMAVLGIILFSGFLAAYFSH 27 T 0.23 Gram_pos_anchor unp F Bacteria T 6f36 3 L N D7P897_9CHLO Mitochondrial ATP synthase subunit ASA6 MMLRTLTRSSAVAGQAVRLFKTSAAAAEGNSVAGIIKSVNETSGANLLSSLKTIKAQAAPIYPAAASSTGYSTQAKIALFGALSWILYRADGQSKAHEWIVDLNLNVLQAAWLISFSSLIPFRAVYFAFRGMAPATASTLNGLKTFSSISL 151 T 0.00025 NrsF pdbpercent F Eukaryota T 6f46 1 A A B2CL1_HUMAN BCL2-L-1,APOPTOSIS REGULATOR BCL-X GSGESRKGQERFNRWFLTGMTVAGVVLLGSLFSRK 35 T 0.014 DUF3094 pdbpssm F Eukaryota T 6f4p 2 B B RS6_HUMAN PHOSPHOPROTEIN NP33,SMALL RIBOSOMAL SUBUNIT PROTEIN ES6 VPRRLGPKRASRIRKL 16 T 41 DUF6408 pdbhh F Eukaryota T 6f4q 2 B B RS6_HUMAN PHOSPHOPROTEIN NP33,SMALL RIBOSOMAL SUBUNIT PROTEIN ES6 VPRRLGPKRCSRIRKL 16 T 40 DUF6408 pdbhh F Eukaryota T 6f55 2 B B TRPV4_CHICK TRANSIENT RECEPTOR POTENTIAL CATION CHANNEL SUBFAMILY V MEMBER 4 TKGPAPNPPPILKVW 15 T 0.1 Ank_2 unppercent F Eukaryota T 6f61 1 A A A0A4P1LYD9_9ARAC purotoxin-6 GYCATKGIKCNDIHCCSGLKCDSKRKVCVKG 31 T 0.0041 Toxin_7 pdb F Eukaryota T 6f8f 2 B G PDX1_HUMAN PDX-1,GLUCOSE-SENSITIVE FACTOR,GSF,INSULIN PROMOTER FACTOR 1,IPF-1,INSULIN UPSTREAM FACTOR 1,IUF-1,ISLET/DUODENUM HOMEOBOX-1,IDX-1,SOMATOSTATIN-TRANSACTIVATING FACTOR 1,STF-1 PEQDCAVTSGE 11 T 2.5 Rieske_3 pdbhh F Eukaryota T 6f8g 2 E,F,G,H E,F,G,H PDX1_MESAU HOMEODOMAIN PROTEIN PDX1,INSULIN PROMOTER FACTOR 1,IPF-1 EPEQDSAVTSGE 12 T 52 DUF5577 pdbhh F Eukaryota T 6f9i 2 B,D X,C CSTN1_MOUSE ALCADEIN-ALPHA,ALC-ALPHA NATRQLEWDDSTLSY 15 T 0.0031 CDC45 unppercent F Eukaryota T 6f9w 2 B B 4ET_HUMAN EIF4E TRANSPORTER,EUKARYOTIC TRANSLATION INITIATION FACTOR 4E NUCLEAR IMPORT FACTOR 1 GPLGSGLAKWFGSDMLQQPLPSMPAKVISVDELEYRQ 37 T 0.18 AbfS_sensor pdbpssm F Eukaryota T 6fad 1 A,B,C,D A,B,C,D SRPK1_HUMAN SFRS PROTEIN KINASE 1,SERINE/ARGININE-RICH PROTEIN-SPECIFIC KINASE 1,SR-PROTEIN-SPECIFIC KINASE 1 GSHMPEQEEEILGSDDDEQEDPNDYCKGGYHLVKIGDLFNGRYHVIRKLGWGHFSTVWLSWDIQGKKFVAMKVVKSAEHYTETALDEIRLLKSVRNSDPNDPNREMVVQLLDDFKISGVNGTHICMVFEVLGHHLLKWIIKSNYQGLPLPCVKKIIQQVLQGLDYLHTKCRIIHTDIKPENILLSVNEQYIRRLAAEATEWQRSGAPPPSGSAVSTAPQPKPADKMSKNKKKKLKKKQKRQAELLEKRMQEIEEMEKESGPGQKRPNKQEESESPVERPLKENPPNKMTQEKLEESSTIGQDQTLMERDTEGGAAEINCNGVIEVINYTQNSNNETLRHKEDLHNANDCDVQNLNQESSFLSSQNGDSSTSQETDSCTPITSEVSDTMVCQSSSTVGQSFSEQHISQLQESIRAEIPCEDEQEQEHNGPLDNKGKSTAGNFLVNPLEPKNAEKLKVKIADLGNACWVHKHFTEDIQTRQYRSLEVLIGSGYNTPADIWSTACMAFELATGDYLFEPHSGEEYTRDEDHIALIIELLGKVPRKLIVAGKYSKEFFTKKGDLKHITKLKPWGLFEVLVEKYEWSQEEAAGFTDFLLPMLELIPEKRATAAECLRHPWLNS 618 T 1.3E-06 WaaY unphh F Eukaryota T 6fau 2 B B TAU_HUMAN ACE-ARG-THR-PRO-SEP-LEU-PRO-GLY XRTPSLPG 8 T 6.1 UPF0167 pdbhh F Eukaryota T 6fbk 2 B P WNK1_HUMAN ERYTHROCYTE 65 KDA PROTEIN,P65,KINASE DEFICIENT PROTEIN,PROTEIN KINASE LYSINE-DEFICIENT 1,PROTEIN KINASE WITH NO LYSINE 1,HWNK1 LTQVVHSAGRRFIVSPVPESRLR 23 T 0.73 NUC pdbhh F Eukaryota T 6fbw 2 B,D B,D TAU_HUMAN ARG-THR-PRO-SEP-LEU-PRO-GLY RTPSLPG 7 T 4.1 UPF0167 pdbhh F Eukaryota T 6fc1 2 B,D B,D EAP1_YEAST EIF4E-ASSOCIATED PROTEIN 1 GPHMTDPITNYKPMDLQYKTYAYSMNELYHLKPSLASASYEEDPLISELVRSLPKRKFWRLRMG 64 T 0.047 CNTF pdbpssm F Eukaryota T 6fc6 2 B B BIM1_YEAST Protein BIM1 SNNLIIDEETF 11 T 1.8 DUF3797 pdbhh F Eukaryota T 6fcp 2 B P SHRM3_HUMAN SHROOM-RELATED PROTEIN,HSHRML AGPVHVRSRSSLATA 15 T 2.2 CCDC14 unphh F Eukaryota T 6fdt 2 B B HS71B_HUMAN HEAT SHOCK 70 KDA PROTEIN 2,HSP70.2 SGPTIEEVD 9 T 6 DUF3567 pdbhh F Eukaryota T 6fe8 3 D D CBF3C_YEAST CHROMOSOME TRANSMISSION FIDELITY PROTEIN 13,KINETOCHORE PROTEIN CTF13 MGPSFNPVRFLELPIDIRKEVYFHLDGNFCGAHPYPIDILYKSNDVELPGKPSYKRSKRSKKLLRYMYPVFATYLNIFEYSPQLIEKWLEYAFWLRYDCLVLDCFKVNHLYDGTLIDALEWTYLDNELRLAYFNKASMLEVWYTFKEYKKWVIDSVAFDELDLLNVSNIQFNIDNLTPQLVDKCLSILEQKDLFATIGEVQFGQDEEVGEEKDVDVSGANSDENSSPSSTIKNKKRSASKRSHSDNGNVGATHNQLTSISVIRTIRSMESMKSLRKITVRGEKLYELLINFHGFRDNPGKTISYIVKRRINEIRLSRMNQISRTGLADFTRWDNLQKLVLSRVAYIDLNSIVFPKNFKSLTMKRVSKIKWWNIEENILKELKVDKRTFKSLYIKEDDSKFTKFFNLRHTRIKELDKSEINQITYLRCQAIVWLSFRTLNHIKLQNVSEVFNNIIVPRALFDSKRVEIYRCEKISQVLVIGSRSGSENLYFQGSKRRWKKNFIAVSAANRFKKISSSGAL 519 T 0.088 Glft2_N unppercent F Eukaryota T 6fel 2 E,F,G,H E,F,G,H KKCC2_HUMAN CAMKK 2,CALCIUM/CALMODULIN-DEPENDENT PROTEIN KINASE KINASE BETA,CAMKK BETA RSLSAPGN 8 T 13 DUF6439 pdbhh F Eukaryota T 6fgm 1 A A ALA-CYS-PHE-LEU-THR-ARG-LEU-GLY-THR-TYR-VAL-CYS ACFLTRLGTYVC 12 T 3.9 zf-C3HC pdbhh F T 6fkr 53 AB 1y Tur1A peptide RRIRFRPPYLPRPGRRPRFPPP 22 T 13 Consortin_C pdbhh F T 6fkz 2 C E 3(S)-(phenylthio)succinyl-CPS1 peptide XVLXEYGV 8 T 25 Sulf_coat_C pdbhh F T 6fm1 1 A,B A,B G3FFN6_9CAUD Adenylosuccinate synthetase MGSSHHHHHHSSGLVPRGSHMKNVDLVIDLQFGSTGKGLIAGYLAEKNGYDTVINANMPNAGHTYINAEGRKWMHKVLPNGIVSPNLKRVMLGAGSVFSINRLMEEIEMSKDLLHDKVAILIHPMATVLDEEAHKKAEVGIATSIGSTGQGSMAAMVEKLQRDPTNNTIVARDVAQYDGRIAQYVCTVEEWDMALMASERILAEGAQGFSLSLNQEFYPYCTSRDCTPARFLADMGIPLPMLNKVIGTARCHPIRVGGTSGGHYPDQEELTWEQLGQVPELTTVTKKVRRVFSFSFIQMQKAMWTCQPDEVFLNFCNYLSPMGWQDIVHQIEVAAQSRYCDAEVKYLGFGPTFNDVELREDVM 363 T 1.1E-68 Adenylsucc_synt pdbpercent T Viruses T 6fmb 1 A A N1JJ94_BLUG1 CSEP0064 putative effector protein AAAYWDCDGTEIPERNVRAAVVLAFNYRKESFHGYPATFIIGSTFSGVGEVRQFPVEDSDANWQGGAVKYYILTNKRGSYLEVFSSVGSGNKCTFVEG 98 T 18 T2SS_PulS_OutS unphh F Eukaryota T 6fmp 2 C C ACY-ASP-GLU-GLU-THR-GLY-GLU-PHE XDEETGEF 8 T 4 DUF4585 pdbhh F T 6fos 14 O O M1VFJ4_CYAM1 PsaM SSLRMFEVSDGEPYPLNPAVIFIALIGWSAVAAIPSNIPVLGGTGLTQAFLASIQRLLAQYPTGPKLDDPFWFYLIVYHVGLFALLIFGQIGYAGYAK 98 T 23 YbgT_YccB unphh F Eukaryota T 6fpf 1 A,B,C,D A,C,D,E CMU1_USTMA Chromosome 16, whole genome shotgun sequence MAAVSGKSEAAEIEAGDRLDALRDQLQRYETPIIQTILARSALGGRAPSEQDEVRAALSRNAFEPSEVISEWLQTESGARFRSTRPLPPAVEFITPVVLSRDTVLDKPVVGKGIFPIGRRPQDPTNMDEFLDTSLLSLNQSSTVDLASAVSLDVSLLHLVSARVLLGYPIALAKFDWLHDNFCHILTNTTLSKSQKLANIIQQLTDHKQEVNVLSRVEQKSKSLSHLFRNDIPYPPHTQDRILRLFQAYLIPITTQIEAAAILDHANKCTLEHHHHHH 278 T 0.26 CM_2 pdbpssm F Eukaryota T 6fq4 2 B B Q824H6_CHLCV TarP-VBS1 LLEAARNTTTMLSKTLSKVC 20 T 0.02 SipA_VBS pdb F Bacteria T 6fto 2 C C MIT1_SCHPO MI2-LIKE INTERACTING WITH CLR3 PROTEIN 1,SNF2/HDAC-CONTAINING REPRESSOR COMPLEX PROTEIN MIT1,SHREC PROTEIN MIT1 MPKEDDSLCKIVVRREPLDVLLPYYDASETTVQKILHENDSTLSVKFLAGVEALIKKDELDKYKNGKACLRVWLKHKSGKR 81 T 0.12 Mad3_BUB1_I pdb F Eukaryota T 6fub 1 A B C4B8C2_MAGOR AVR-Pik protein METGNKYIEKRAIDLSRERDPNFFDNPGIPVPECFWFMFKNNVRQDAGTCYSSWKMDMKVGPNWVHIKSDDNCNLSGDFPPGWIVLGKKRPGF 93 T 0.086 TMEM18 unp F Eukaryota T 6fud 2 B B C4B8B9_MAGOR AVR-PIKM PROTEIN METGNKYIEKRAIDLSRERDPNFFDNADIPVPECFWFMFKNNVRQDAGTCYSSWKMDMKVGPNWVHIKSDDNCNLSGDFPPGWIVLGKKRPGF 93 T 0.34 DIM unppssm F Eukaryota T 6fwn 1 A A VWF_HUMAN VWF GSMASACEVVTGSPRGDSQSSWKSVGSQWASPENPCLINECVRVKEEVFIQQRNVSCPQLEVPVCPSGFQLSCKTSACCPSCRCE 85 T 2.2 Antistasin pdbhh F Eukaryota T 6g0o 2 B B ATRX_HUMAN ATP-DEPENDENT HELICASE ATRX,X-LINKED HELICASE II,X-LINKED NUCLEAR PROTEIN,XNP,ZNF-HX HFPXGIXQIKY 11 T 0.27 PI_PP_C pdbhh F Eukaryota T 6g0p 2 B B E2F1_HUMAN E2F-1,PBR3,RETINOBLASTOMA-ASSOCIATED PROTEIN 1,RBAP-1,RETINOBLASTOMA-BINDING PROTEIN 3,RBBP-3,PRB-BINDING PROTEIN E2F-1 HPGXGVXSPGEKSRYE 16 T 0.17 Cucumo_2B pdbhh F Eukaryota T 6g0q 2 B B GATA1_HUMAN ERYF1,GATA-BINDING FACTOR 1,GF-1,NF-E1 DNA-BINDING PROTEIN ASGXGKXKRGY 11 T 9.7 RELT pdbhh F Eukaryota T 6g3j 3 C,F C,F MET-THR-SER-ALA-ILE-GLY-ILE-LEU-PRO-VAL MTSAIGILPV 10 T 5 CLLAC pdbhh F T 6g3k 3 C,F C,F ILE-THR-SER-GLY-ILE-GLY-VAL-LEU-PRO-VAL ITSGIGVLPV 10 T 2.9 NAD_binding_6 pdbhh F T 6g41 1 A,B,C,D,E,F,G,H,I,J A,B,C,D,E,F,G,H,I,J A0A1L4BKA3_9VIRU Minor capsid protein GAMGMKQYIWLNETIKSNKQLAGPRGSYKRPVSVDIFRSSTILDPDKNYLLIVEEFHLHKIRLPLFKPAGHDYQVGIFNRSTDEIMGVREVDFSTFVDEDGYMYDYVDVGTAINETLAGLCDGIIGEEDIPVFSFNKHSKKFEITTTENFRNGHFIMFNDDMRVDFNSFEFDDIDEEYSLVILNEDVETQDASTLEFLTPISHIVIESNDLPVSYELLPSISKNTTISDNTGVFLTNYKYLQQNNQDYNSILFRVENSSNKYHNILQTNFNRFNLSFTIYDYDNEKHPLTLLPQTVIQLKLLFESID 307 T 0.051 SIN1 unp T Viruses T 6g43 1 A,B,C A,B,C A0A1L4BK98_9VIRU Putative major capsid protein GAMGMNTPPELDTVLQAPYAYNWPTSKNVKIASRIGIPYSTFQTIQPVSDAPNNGIGQITFNQPLGNLTGGAPRLRVSFTAEIKNILADSSLKDQIGLKSFPVNRSIPVAVINMNGKTFTSYPAQLIKLHQYNADPLELALLSPCSDVDEYNKIKAVSMNNPYRQGTESTDSRMSRGLGCNYAYYIHPRAAGSTSVKIDFVVDEALVANPTQYKNIKDPVPFRNLNTFKVILDGQFKPENMIGIADDVKLVAGKADFEVDITGFKINMLVQNWVAPLEIGDIPKTIIYNTPLISLEGNISSMCLNTKDPYGIPGERNKHILTTHSMAMNNVPSMFAVMVSQETPTKKFAPDQLAGIIGLEIKVDSDVGIFRELEQQQLYELSSSNGYNKRFSCFSGALANGLTVADPAVAAGNKFKEAIFGAGSVIFFRPSDLGLKDYNVMANANKSINMQVQATFVTPEAAGTGAHYKLEVFSIRDNLTYSFEDGTFMDDLTLYTPDQLLRSPLKLTDDNNKLMRVMGG 520 T 16 DUF4223 pdbhh T Viruses T 6g45 1 A,B,C A,B,C A0A1L4BK98_9VIRU Putative major capsid protein GAMGMNTPPELDTVLQAPYAYNWPTSKNVKIASRIGIPYSTFQTIQPVSDAPNNGIGQITFNQPLGNLTGGAPRLRVSFTAEIKNILADSSLKDQIGLKSFPVNRSIPVAVINMNGKTFTSYPAQLIKLHQYNADPLELALLSPCSDVDEYNKIKAVSMNNPYRQGTESTDSRMSRGLGCNYAYYIHPRAAGSTSVKIDFVVDEALVANPTQYKNIKDPVPFRNLNTFKVILDGQFKPENMIGIADDVKLVAGKADFEVDITGFKINMLVQNWVAPLEIGDIPKTIIYNTPLISLEGNISSMCLNTKDPYGIPGERNKHILTTHSMAMNNVPSMFAVMVSQETPTKKFAPDQLAGIIGLEIKVDSDVGIFRELEQQQLYELSSSNGYNKRFSCFSGALANGLTVADPAVAAGNKFKEAIFGAGSVIFFRPSDLGLKDYNVMANANKSINMQVQATFVTPEAAGTGAHYKLEVFSIRDNLTYSFEDGTFMDDLTLYTPDQLLRSPLKLTDDNNKLMRVMGGSFMGDVMTNFNHMAAHPVTKTVTKLLRNAGPLKDYAGDGTMMGNIASVYGYGKKKTTTRKKKGGEIVLLGSGKKGGKKLSDKQLHDLRNL 610 T 19 DUF4223 pdbhh T Viruses T 6g52 1 A,B,C,D,E,F,G,H,I I,B,C,D,E,F,G,H,A CNNM4_HUMAN ANCIENT CONSERVED DOMAIN-CONTAINING PROTEIN 4,CYCLIN-M4 AGMKISPQLLLAAHRFLATEVSQFSPSLISEKILLRLLKYPDVIQELKFDEHNKYYARHYLYTRNKPADYFILILQGKVEVEAGKENMKFETGAFSYYGTMALTSVPSDRSPAHPTPLSRSASLSYPDRTDVSTAATLAGSSNQFGSSVLGQYISDFSVRALVDLQYIKITRQQYQNGLLASRMENSPQ 189 T 0.00033 cNMP_binding pdb F Eukaryota T 6g57 1 A,B,C,D A,B,C,D KCTD8_HUMAN BTB/POZ domain-containing protein KCTD8 SMAQDKRSGFLTLGYRGSYTTVRDNQADAKFRRVARIMVCGRIALAKEVFGDTLNESRDPDRQPEKYTSRFYLKFTYLEQAFDRLSEAGFHMVACNSSGTAAFVNQYRDDKIWSSYTEYIFFRP 124 T 0.61 GFRP pdbhh F Eukaryota T 6g5g 2 C P SYT2_HUMAN SYNAPTOTAGMIN II,SYTII GESQEDMFAKLKEKLFNEINK 21 T 0.64 DUF4312 pdbhh F Eukaryota T 6g65 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L CC-Type2-VV XGEVAQAVKEVAKAVKEVAWAVKEVAQAVKGX 32 T 0.0064 MCPsignal pdbpssm F T 6g66 1 A,B,C,D,E,F,G A,B,C,D,E,F,G CC-Type2-IV XGEVAQAIKEVAKAIKEVAWAIKEVAQAIKGX 32 T 0.007 DUF1241 pdb F T 6g69 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N A,B,C,D,E,F,G,H,I,J,K,L,M,N CC-Type2-IL-Sg-L17E XGELAQSIKELAKSIKEEAWSIKELAQSIKGX 32 T 0.18 HTH_52 pdbhh F T 6g6e 1 A,B,C,D,E,F,G A,B,C,D,E,F,G CC-Type2-deLI XGEIAQAXKEIAKAXKEIAWAXKEIAQAXKGX 32 T 2.8 DUF1328 pdbpssm F T 6g6g 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L CC-Type2-FI XGEIAQAFKEIAKAFKEIAWAFKEIAQAFKGX 32 T 0.029 WXG100 pdbpssm F T 6g6h 1 A,B,C,D,E A,B,C,D,E 5H2L_2.1-I9L XTQEYLLKELMKLLKEQIKLLKEQIKMLKELEKQX 35 T 0.027 DUF5320 pdbhh F T 6g6x 2 B P YAP1_HUMAN YES-ASSOCIATED PROTEIN 1,PROTEIN YORKIE HOMOLOG,YES-ASSOCIATED PROTEIN YAP65 HOMOLOG XRAHSSXASLQ 11 T 0.00014 FAM181 unp F Eukaryota T 6g84 2 B D CBK1_YEAST CBK1 FTDVPALNYPATPPPH 16 T 0.8 CbtA pdbhh F Eukaryota T 6g84 3 D C CBK1_YEAST CBK1 AFTDVPALNYPATPPPH 17 T 0.48 CbtA pdbhh F Eukaryota T 6g86 2 C,D D,C SIC1_YEAST CDK INHIBITOR P40 PSTTKSFKNAPLLAPP 16 T 12 BHD_1 pdbhh F Eukaryota T 6g8i 2 B P YAP1_HUMAN ALA-HIS-SEP-SER-PRO-ALA-SER-LEU-GLN XXAHSSPASLQ 11 T 0.00014 FAM181 unp F Eukaryota T 6g8j 2 B P YAP1_HUMAN ACE-ARG-ALA-HIS-SEP-SER-PRO-BAL-SER-LEU-GLN XRAHSSPXSLQ 11 T 0.00014 FAM181 unp F Eukaryota T 6g8k 2 B P YAP1_HUMAN ACE-ARG-ALA-HIS-SEP-SER-PRO-ALA-BSE-LEU-GLN XRAHSSPAXLQ 11 T 0.00014 FAM181 unp F Eukaryota T 6g8l 2 B P YAP1_HUMAN ACE-ARG-ALA-HIS-SEP-SER-PRO-ALA-SER-BLE-GLN XRAHSSPASXQ 11 T 0.00014 FAM181 unp F Eukaryota T 6g8p 2 B P YAP1_HUMAN YES-ASSOCIATED PROTEIN 1,PROTEIN YORKIE HOMOLOG,YES-ASSOCIATED PROTEIN YAP65 HOMOLOG XRAHSSXASXQ 11 T 0.00014 FAM181 unp F Eukaryota T 6g8q 2 B P YAP1_HUMAN YES-ASSOCIATED PROTEIN 1,PROTEIN YORKIE HOMOLOG,YES-ASSOCIATED PROTEIN YAP65 HOMOLOG XRAHSSPXSLX 11 T 0.00014 FAM181 unp F Eukaryota T 6g9q 5 E P DOPO_MOUSE DOPAMINE BETA-MONOOXYGENASE KAPYDYAPI 9 T 5.4 DUF1043 pdbhh F Eukaryota T 6gb7 5 M,N P,R TPSN_MOUSE GLY-GLY-LEU-SER EDAGGGGLSK 10 T 11 DUF5672 pdbhh F Eukaryota T 6gc3 2 B B SEN1_YEAST TRNA-SPLICING ENDONUCLEASE POSITIVE EFFECTOR DDDEDDYTPSIS 12 T 2.4 CM1 pdbhh F Eukaryota T 6gcs 30 DA g B5FVF3_YARLI NI9M SUBUNIT MINANPGFWNGPFRYLRWSAHNRPHLFFAFAIGIAGPVAALTLTPLRRKYLYPDHSPLPQSYP 63 T 0.054 DUF998 pdb F Eukaryota T 6gdj 1 A,B A,B YHP6_SCHPO Mto2 GGPAPLSTMQTALMRLRTYHPSPIILKPVEQAVNHAITLVNTSPSSVVDALCRSLAELCLGLVQEAIDASILSQQESSNSLDLVRHTP 88 T 7.4 SPDY pdbhh F Eukaryota T 6gf6 1 A,B A,B A0A140JXP0_CHICK Zona pellucida sperm-binding protein 1,Zona pellucida sperm-binding protein 1 DAAQPALLQYHYDCGDFGMQLLAYPTRGRTVHFKVLDEFGTRFEVANCSICMHWLNTGEDGGLIFSAGYEGCHVLVKDGRYVLRVQLEEMLLSGVVAASYEVQMTCPRPAGYEILRDEKVHHHHHHHHQRPDRGNS 136 T 0.2 Translat_reg pdbpercent F Eukaryota T 6gif 1 A A AAPA1_HELPY AapA1 MATKHGKNSWKTLYLKISFLGCKVVALLKR 30 T 0.81 protein_MS5 unphh F Bacteria T 6gig 1 A A AAPA1_HELPY AapA1 MATKHGKNSWKTLYLKISFLGCKVVVLLKR 30 T 0.81 protein_MS5 unphh F Bacteria T 6gij 1 A A TEMB_RANTE temporinB_KKG6A KKLLPIVANLLKSLL 15 T 3.9 Nup188_C pdbhh F Eukaryota T 6gik 1 A A temporinB_L1FK FLPIVGLLKSLLK 13 T 2.6 PSI_8 pdbhh F T 6gil 1 A A TEMB_RANTE Temporin-B LLPIVGNLLKSLL 13 T 1.8 DUF5665 pdbhh F Eukaryota T 6gje 2 B,C,D B,C,D CUBN_HUMAN 460 KDA RECEPTOR,INTESTINAL INTRINSIC FACTOR RECEPTOR,INTRINSIC FACTOR-COBALAMIN RECEPTOR,INTRINSIC FACTOR-VITAMIN B12 RECEPTOR GELELQRQKRSINLQQPRMATERGNLVFLTGSAQNIEFRTGSLGKIKLNDEDLSECLHQIQKNKEDIIELKGSAIGLPQNISSQIYQLNSKLVDLERKFQGLQQTVDKKV 110 T 0.0088 hEGF unppercent F Eukaryota T 6gkf 2 I,J,K,L,M,N,O,P I,J,K,L,M,N,O,P CASP2_HUMAN Caspase-2 YDLSLPFP 8 T 1.4 TUG-UBL1 pdbhh F Eukaryota T 6gkg 2 I,J,K,L,M,N I,J,K,L,M,N CASP2_HUMAN Caspase-2 VEHSLDNK 8 T 38 Lectin_leg-like pdbhh F Eukaryota T 6gmg 2 C,D C,D PAPI_STRMB SPI DIPIGXKMT 9 F F Bacteria T 6gml 14 N U NELFA_HUMAN NELF-A,WOLF-HIRSCHHORN SYNDROME CANDIDATE 2 PROTEIN MASMRESDTGLWLHNKLGATDELWAPPSIASLLTAAVIDNIRLCFHGLSSAVKLKLLLGTLHLPRRTVDEMKGALMEIIQLASLDSDPWVLMVADILKSFPDTGSLNLELEEQNPNVQDILGELREKVGECEASAMLPLECQYLNKNALTTLAGPLTPPVKHFQLKRKPKSATLRAELLQKSTETAQQLKRSAGVPFHAKGRGLLRKMDTTTPLKGIPKQAPFRSPTAPSVFSPTGNRTPIPPSRTLLRKERGVKLLDISELDMVGAGREAKRRRKTLDAEVVEKPAKEETVVENATPDYAAGLVSTQKLGSLNNEPALPSTSYLPSTPSVVPASSYIPSSETPPAPSSREASRPPEEPSAPSPTLPAQFKQRAPMYNSGLSPATPTPAAPTSPLTPTTPPAVAPTTQTPPVAMVAPQTQAPAQQQPKKNLSLTREQMFAAQEMFKTANKVTRPEKALILGFMAGSRENPCQEQGDVIQIKLSEHTEDLPKADGQGSTTMLVDTVFEMNYATGQWTRFKKYKPMTNVS 528 T 0.008 PRCC unppercent F Eukaryota T 6go0 1 A A O32830_LACPN Plantaricin S beta protein KKKKQSWYAAAGDAIVSFGEGFLNAW 26 T 0.00052 LcnG-beta unppssm F Bacteria T 6gos 1 A A MCBA_ECOLX MCCB17 MGHHHHHHMELKASEFGVVLSVDALKLSRQSPLGVGIGGGGGGGGGXGGQGGXGXNXGGNGXGXGSHI 68 T 0.51 Dehydratase_MU pdbhh F Bacteria T 6gp7 2 C D PBP1A MSDQFNSREARRKANSK 17 T 5.2 Birna_RdRp pdbhh F T 6gpz 2 C E LmPBPA1 MADKPQTRSQYRNKQ 15 T 4.1 Pox_A12 pdbhh F T 6gqn 2 C,D C,G SpPBP2a TILRRSRSDRKKLA 14 T 6.4 DUF1408 pdbhh F T 6grh 1 A A MCBA_ECOLX MCCB17 MGHHHHHHMELKASEFGVVLSVDALKLSRQSPLGVGIGGGGGGGGGXGGQGG 52 T 0.86 FeoB_associated unphh F Bacteria T 6gs5 1 A A TEML_RANTE Temporin-L FVQWFSKFLGRIL 13 T 0.063 MOSC_N pdbhh F Eukaryota T 6gt7 1 A A Q54324_SULIS functional pRN1 primase TVVEFEELRKELVKRDSGKPVEKIKEEICTKSPPKLIKEIICENKTYADVNIDRSRGDWHVILYLMKHGVTDPDKILELLPRDSKAKENEKWNTQKYFVITLSKAWSVVKKYLEA 115 T 1E-08 pRN1_helical pdbpssm F Archaea T 6gvk 2 B B 230 KDA BULLOUS PEMPHIGOID ANTIGEN,230/240 KDA BULLOUS PEMPHIGOID ANTIGEN,BULLOUS PEMPHIGOID ANTIGEN 1,BULLOUS PEMPHIGOID ANTIGEN,DYSTONIA MUSCULORUM PROTEIN,HEMIDESMOSOMAL PLAQUE PROTEIN DSNENLLLVHCGPTLINSCISFGSESFDGH 30 T 13 DUF4556 pdbhh F T 6gvw 5 E,J E,J UIMC1_MOUSE RECEPTOR-ASSOCIATED PROTEIN 80,UBIQUITIN INTERACTION MOTIF-CONTAINING PROTEIN 1 GGGRHYYWGIPFCPAGVDPNQYTNVILCQLEVYQKSLKMAQRQLVKKRGFGEPVLPRPPFLIQN 64 T 6.2 PRTRC_E pdbhh F Eukaryota T 6gw7 1 A A A0A2A6XLY0_HELPX DNA protecting protein DprA MLKDYHLKEMPEMEDEFLEYCAKNPSYEEAYLKFGDKLLEYELLGKIKRINHIVVLAHH 59 T 1.5 DnaI_N pdbhh F Bacteria T 6gx9 2 C,D C,D CPSF6_HUMAN CLEAVAGE AND POLYADENYLATION SPECIFICITY FACTOR 68 KDA SUBUNIT,CPSF 68 KDA SUBUNIT,CLEAVAGE FACTOR IM COMPLEX 68 KDA SUBUNIT,CFIM68,PRE-MRNA CLEAVAGE FACTOR IM 68 KDA SUBUNIT,PROTEIN HPBRII-4/7 ESKSYGSGSRRERSRERDHSRSREKSRRHKSRSRDRHDDYYRERSRERERHRDRDRDRDRERDREREYRH 70 T 0.12 PRP38_assoc unppercent F Eukaryota T 6gxc 2 B B GLY-ASP-GLN-DAB-ALA-THR-PPN-GLY GDQXATXG 8 T 1.3 S-AdoMet_synt_M pdbhh F T 6gy2 2 C,D C,D BRCA2_HUMAN Phosphopeptide of BRCA2 WSSSLATPPTLSSTVLI 17 T 18 CoV_NSP4_C pdbhh F Eukaryota T 6gyp 2 B A CBF3C_YEAST CHROMOSOME TRANSMISSION FIDELITY PROTEIN 13,KINETOCHORE PROTEIN CTF13 MPSFNPVRFLELPIDIRKEVYFHLDGNFCGAHPYPIDILYKSNDVELPGKPSYKRSKRSKKLLRYMYPVFATYLNIFEYSPQLIEKWLEYAFWLRYDCLVLDCFKVNHLYDGTLIDALEWTYLDNELRLAYFNKASMLEVWYTFKEYKKWVIDSVAFDELDLLNVSNIQFNIDNLTPQLVDKCLSILEQKDLFATIGEVQFGQDEEVGEEKDVDVSGANSDENSSPSSTIKNKKRSASKRSHSDNGNVGATHNQLTSISVIRTIRSMESMKSLRKITVRGEKLYELLINFHGFRDNPGKTISYIVKRRINEIRLSRMNQISRTGLADFTRWDNLQKLVLSRVAYIDLNSIVFPKNFKSLTMKRVSKIKWWNIEENILKELKVDKRTFKSLYIKEDDSKFTKFFNLRHTRIKELDKSEINQITYLRCQAIVWLSFRTLNHIKLQNVSEVFNNIIVPRALFDSKRVEIYRCEKISQVLVI 478 T 0.088 Glft2_N pdbpercent F Eukaryota T 6gzj 2 B B MAG_MOUSE SIGLEC-4A SEKRLGSERRLLGLRGESPELDLSYSHSDLGKRPTKDSYTLTEELAEYAEIRVK 54 T 0.23 DAG1 unphh F Eukaryota T 6gzl 2 B B MAG_MOUSE PRO-THR-LYS-ASP-SER-TYR-THR-LEU-THR-GLU-GLU-LEU KRPTKDSYTLTEELAEY 17 T 15 IFNGR1 unphh F Eukaryota T 6h06 3 I,J,K,L I,G,J,K TAU_HUMAN NEUROFIBRILLARY TANGLE PROTEIN,PAIRED HELICAL FILAMENT-TAU,PHF-TAU SPRHLSNVSSTGSIDMVDSPQLATLA 26 T 25 BAGE pdbhh F Eukaryota T 6h0e 3 I,J,K,L I,G,J,K TAU_MOUSE NEUROFIBRILLARY TANGLE PROTEIN,PAIRED HELICAL FILAMENT-TAU,PHF-TAU RHLSNVSSTGSIDMVDSPQLATLA 24 T 24 BAGE pdbhh F Eukaryota T 6h0i 1 A A F4RME6_MELLP Secreted protein MCEFIEDSEDIQGLKSLRKSHTSLEDDDDGSRGGDCEGCSGTACSSDAQCRARGCDGCSTSGVCVLSSLHHHHHH 75 T 0.1 SPDY unppssm F Eukaryota T 6h1q 1 A,B A,B B4EYH7_PROMH Fimbrial adhesin IADPLVVTPPPMNFDGAADGTPAGTPITSTWIGETSVHNGFKCEKKFLQKCWVETLYANATGSKISGIYYYEGSNRYPVYSLPGVKGIGYAFGLKDNNDSVAYVPIDVDNGSGATVIYPAVGSTVNHNVDRVSLKGKVVFVVTDKHLETGVYNIPYTVIANTWSEYGGGHKGNNTSIVAINPVTITAHHHHHH 193 T 0.25 TMEM151 unppssm F Bacteria T 6h22 2 C,D C,D Stapled peptide XLTFAEYWAQLASX 14 T 0.11 PBP-Tp47_a pdbhh F T 6h41 2 B B VAL-ASP-GLU-CYS-TRP-ARG-ILE-ILE-ALA-SER-HIS-THR-TRP-PHE-CYS-ALA-GLU-GLU VDECWRIIASHTWFCAEE 18 T 2.6 DUF3750 pdbhh F T 6h48 1 A A A0A2S6DEV9_STAAU STL GPGKKREVTIEEIGEFHEKYLKLLFTNLETHNDRKKALAEIEKLKEESIYLGEKLRLVPNHHYDAIKGKPMYKLYLYEYPDRLEHQKKIILEKDTN 96 T 0.00083 LRRFIP pdb F Bacteria T 6h4k 1 A A UBP25_HUMAN DEUBIQUITINATING ENZYME 25,USP ON CHROMOSOME 21,UBIQUITIN THIOESTERASE 25,UBIQUITIN-SPECIFIC-PROCESSING PROTEASE 25 GPSHEHEDKSPETVLQSAIKLEYARLVKLAQEDTPPETDYRLHHVVVYFIQNQAPKKIIEKTLLEQFGDRNLSFDERCHNIMKVAQAKLEMIKPEEVNLEEYEEWHQDYRKFRETTMYLIIGLENFQRESYIDSLLFLICAYQNNKELLSKGLYRGHDEELISHYRRECLLKLNEQAAELFESGEDREVNNGLIIMNEFIVPFLPLLLVDEMEEKDILAVEDMRNRWCSYLGQEMEPHLQEKLTDFLPKLLDCSMEIKSFHEPPKLPSYSTHELCERFARIMLSLSRTPADGR 293 T 0.21 Imm30 pdb F Eukaryota T 6h6a 2 B,D,F E,H,K LCK_HUMAN GLY-CYS-GLY-CYS-SER-SER GCGCSSHPED 10 T 1.2 EPV_E5 pdbhh F Eukaryota T 6h7b 2 B,D B,D Q4Q7R3_LEIMA HIS-HIS-MET-ASN-PRO-ASN-ALA-THR-GLU-PHE-MET-PRO HHMNPNATEFMPGR 14 T 9.4E-05 PAM2 pdbhh F Eukaryota T 6h82 1 A,A10,A11,A12,A13,A14,A15,A16,A17,A18,A19,A2,A20,A21,A22,A23,A24,A25,A26,A27,A28,A29,A3,A30,A31,A32,A33,A34,A35,A36,A37,A38,A39,A4,A40,A41,A42,A43,A44,A45,A46,A47,A48,A49,A5,A50,A51,A52,A53,A54,A55,A56,A57,A58,A59,A6,A60,A7,A8,A9,C,C10,C11,C12,C13,C14,C15,C16,C17,C18,C19,C2,C20,C21,C22,C23,C24,C25,C26,C27,C28,C29,C3,C30,C31,C32,C33,C34,C35,C36,C37,C38,C39,C4,C40,C41,C42,C43,C44,C45,C46,C47,C48,C49,C5,C50,C51,C52,C53,C54,C55,C56,C57,C58,C59,C6,C60,C7,C8,C9,E,E10,E11,E12,E13,E14,E15,E16,E17,E18,E19,E2,E20,E21,E22,E23,E24,E25,E26,E27,E28,E29,E3,E30,E31,E32,E33,E34,E35,E36,E37,E38,E39,E4,E40,E41,E42,E43,E44,E45,E46,E47,E48,E49,E5,E50,E51,E52,E53,E54,E55,E56,E57,E58,E59,E6,E60,E7,E8,E9,G,G10,G11,G12,G13,G14,G15,G16,G17,G18,G19,G2,G20,G21,G22,G23,G24,G25,G26,G27,G28,G29,G3,G30,G31,G32,G33,G34,G35,G36,G37,G38,G39,G4,G40,G41,G42,G43,G44,G45,G46,G47,G48,G49,G5,G50,G51,G52,G53,G54,G55,G56,G57,G58,G59,G6,G60,G7,G8,G9,I,I10,I11,I12,I13,I14,I15,I16,I17,I18,I19,I2,I20,I21,I22,I23,I24,I25,I26,I27,I28,I29,I3,I30,I31,I32,I33,I34,I35,I36,I37,I38,I39,I4,I40,I41,I42,I43,I44,I45,I46,I47,I48,I49,I5,I50,I51,I52,I53,I54,I55,I56,I57,I58,I59,I6,I60,I7,I8,I9,K,K10,K11,K12,K13,K14,K15,K16,K17,K18,K19,K2,K20,K21,K22,K23,K24,K25,K26,K27,K28,K29,K3,K30,K31,K32,K33,K34,K35,K36,K37,K38,K39,K4,K40,K41,K42,K43,K44,K45,K46,K47,K48,K49,K5,K50,K51,K52,K53,K54,K55,K56,K57,K58,K59,K6,K60,K7,K8,K9,M,M10,M11,M12,M13,M14,M15,M16,M17,M18,M19,M2,M20,M21,M22,M23,M24,M25,M26,M27,M28,M29,M3,M30,M31,M32,M33,M34,M35,M36,M37,M38,M39,M4,M40,M41,M42,M43,M44,M45,M46,M47,M48,M49,M5,M50,M51,M52,M53,M54,M55,M56,M57,M58,M59,M6,M60,M7,M8,M9,O,O10,O11,O12,O13,O14,O15,O16,O17,O18,O19,O2,O20,O21,O22,O23,O24,O25,O26,O27,O28,O29,O3,O30,O31,O32,O33,O34,O35,O36,O37,O38,O39,O4,O40,O41,O42,O43,O44,O45,O46,O47,O48,O49,O5,O50,O51,O52,O53,O54,O55,O56,O57,O58,O59,O6,O60,O7,O8,O9,Q,Q10,Q11,Q12,Q13,Q14,Q15,Q16,Q17,Q18,Q19,Q2,Q20,Q21,Q22,Q23,Q24,Q25,Q26,Q27,Q28,Q29,Q3,Q30,Q31,Q32,Q33,Q34,Q35,Q36,Q37,Q38,Q39,Q4,Q40,Q41,Q42,Q43,Q44,Q45,Q46,Q47,Q48,Q49,Q5,Q50,Q51,Q52,Q53,Q54,Q55,Q56,Q57,Q58,Q59,Q6,Q60,Q7,Q8,Q9,S,S10,S11,S12,S13,S14,S15,S16,S17,S18,S19,S2,S20,S21,S22,S23,S24,S25,S26,S27,S28,S29,S3,S30,S31,S32,S33,S34,S35,S36,S37,S38,S39,S4,S40,S41,S42,S43,S44,S45,S46,S47,S48,S49,S5,S50,S51,S52,S53,S54,S55,S56,S57,S58,S59,S6,S60,S7,S8,S9,U,U10,U11,U12,U13,U14,U15,U16,U17,U18,U19,U2,U20,U21,U22,U23,U24,U25,U26,U27,U28,U29,U3,U30,U31,U32,U33,U34,U35,U36,U37,U38,U39,U4,U40,U41,U42,U43,U44,U45,U46,U47,U48,U49,U5,U50,U51,U52,U53,U54,U55,U56,U57,U58,U59,U6,U60,U7,U8,U9,Y,Y10,Y11,Y12,Y13,Y14,Y15,Y16,Y17,Y18,Y19,Y2,Y20,Y21,Y22,Y23,Y24,Y25,Y26,Y27,Y28,Y29,Y3,Y30,Y31,Y32,Y33,Y34,Y35,Y36,Y37,Y38,Y39,Y4,Y40,Y41,Y42,Y43,Y44,Y45,Y46,Y47,Y48,Y49,Y5,Y50,Y51,Y52,Y53,Y54,Y55,Y56,Y57,Y58,Y59,Y6,Y60,Y7,Y8,Y9 A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X H9AZX2_9VIRU VP4 QTQEYTINHTGGVLGDSYVTTASNQTSPQRETAVLSFECPRKFEEINYVGQRDATRFVPRTTESITGSANDDTVVDLTANIQPVAGEEVIAEQDYPVAVAYNVTQGVEVDVVDADYAADTVTLGTNPADGDEVKVWPIMSDGDVQFRLINQFGQEEGRVYPWSTPLYRWHDFPQLKRGREINLHGSASWSENETLEILLDAPQALTWEDSDYPRGQYVTTLEQDVEITL 229 T 6.3 DUF1344 unphh T Viruses T 6h82 2 B,B10,B11,B12,B13,B14,B15,B16,B17,B18,B19,B2,B20,B21,B22,B23,B24,B25,B26,B27,B28,B29,B3,B30,B31,B32,B33,B34,B35,B36,B37,B38,B39,B4,B40,B41,B42,B43,B44,B45,B46,B47,B48,B49,B5,B50,B51,B52,B53,B54,B55,B56,B57,B58,B59,B6,B60,B7,B8,B9,H,H10,H11,H12,H13,H14,H15,H16,H17,H18,H19,H2,H20,H21,H22,H23,H24,H25,H26,H27,H28,H29,H3,H30,H31,H32,H33,H34,H35,H36,H37,H38,H39,H4,H40,H41,H42,H43,H44,H45,H46,H47,H48,H49,H5,H50,H51,H52,H53,H54,H55,H56,H57,H58,H59,H6,H60,H7,H8,H9,N,N10,N11,N12,N13,N14,N15,N16,N17,N18,N19,N2,N20,N21,N22,N23,N24,N25,N26,N27,N28,N29,N3,N30,N31,N32,N33,N34,N35,N36,N37,N38,N39,N4,N40,N41,N42,N43,N44,N45,N46,N47,N48,N49,N5,N50,N51,N52,N53,N54,N55,N56,N57,N58,N59,N6,N60,N7,N8,N9,R,R10,R11,R12,R13,R14,R15,R16,R17,R18,R19,R2,R20,R21,R22,R23,R24,R25,R26,R27,R28,R29,R3,R30,R31,R32,R33,R34,R35,R36,R37,R38,R39,R4,R40,R41,R42,R43,R44,R45,R46,R47,R48,R49,R5,R50,R51,R52,R53,R54,R55,R56,R57,R58,R59,R6,R60,R7,R8,R9 D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W H9AZX1_9VIRU VP7 PEIGNNGAEKQISLHKGQPFIDTQDVGAADPNTPAVTIEGPSDYVIAIDAGTPVAPEFRDANGDKLDPSTRVTIQKCDKQGNPLGDGIVFSDTLGRFEYSKMRSDPDYMRKTTTSLMIDEREIVKIFVEVPPNANGMDADNSRITIGDDTSDYGKAVGIVEHGDLSPAESKA 172 T 0.82 aRib unppercent T Viruses T 6h82 3 D,D10,D11,D12,D13,D14,D15,D16,D17,D18,D19,D2,D20,D21,D22,D23,D24,D25,D26,D27,D28,D29,D3,D30,D31,D32,D33,D34,D35,D36,D37,D38,D39,D4,D40,D41,D42,D43,D44,D45,D46,D47,D48,D49,D5,D50,D51,D52,D53,D54,D55,D56,D57,D58,D59,D6,D60,D7,D8,D9,F,F10,F11,F12,F13,F14,F15,F16,F17,F18,F19,F2,F20,F21,F22,F23,F24,F25,F26,F27,F28,F29,F3,F30,F31,F32,F33,F34,F35,F36,F37,F38,F39,F4,F40,F41,F42,F43,F44,F45,F46,F47,F48,F49,F5,F50,F51,F52,F53,F54,F55,F56,F57,F58,F59,F6,F60,F7,F8,F9,J,J10,J11,J12,J13,J14,J15,J16,J17,J18,J19,J2,J20,J21,J22,J23,J24,J25,J26,J27,J28,J29,J3,J30,J31,J32,J33,J34,J35,J36,J37,J38,J39,J4,J40,J41,J42,J43,J44,J45,J46,J47,J48,J49,J5,J50,J51,J52,J53,J54,J55,J56,J57,J58,J59,J6,J60,J7,J8,J9,L,L10,L11,L12,L13,L14,L15,L16,L17,L18,L19,L2,L20,L21,L22,L23,L24,L25,L26,L27,L28,L29,L3,L30,L31,L32,L33,L34,L35,L36,L37,L38,L39,L4,L40,L41,L42,L43,L44,L45,L46,L47,L48,L49,L5,L50,L51,L52,L53,L54,L55,L56,L57,L58,L59,L6,L60,L7,L8,L9,P,P10,P11,P12,P13,P14,P15,P16,P17,P18,P19,P2,P20,P21,P22,P23,P24,P25,P26,P27,P28,P29,P3,P30,P31,P32,P33,P34,P35,P36,P37,P38,P39,P4,P40,P41,P42,P43,P44,P45,P46,P47,P48,P49,P5,P50,P51,P52,P53,P54,P55,P56,P57,P58,P59,P6,P60,P7,P8,P9,T,T10,T11,T12,T13,T14,T15,T16,T17,T18,T19,T2,T20,T21,T22,T23,T24,T25,T26,T27,T28,T29,T3,T30,T31,T32,T33,T34,T35,T36,T37,T38,T39,T4,T40,T41,T42,T43,T44,T45,T46,T47,T48,T49,T5,T50,T51,T52,T53,T54,T55,T56,T57,T58,T59,T6,T60,T7,T8,T9,V,V10,V11,V12,V13,V14,V15,V16,V17,V18,V19,V2,V20,V21,V22,V23,V24,V25,V26,V27,V28,V29,V3,V30,V31,V32,V33,V34,V35,V36,V37,V38,V39,V4,V40,V41,V42,V43,V44,V45,V46,V47,V48,V49,V5,V50,V51,V52,V53,V54,V55,V56,V57,V58,V59,V6,V60,V7,V8,V9,Z,Z10,Z11,Z12,Z13,Z14,Z15,Z16,Z17,Z18,Z19,Z2,Z20,Z21,Z22,Z23,Z24,Z25,Z26,Z27,Z28,Z29,Z3,Z30,Z31,Z32,Z33,Z34,Z35,Z36,Z37,Z38,Z39,Z4,Z40,Z41,Z42,Z43,Z44,Z45,Z46,Z47,Z48,Z49,Z5,Z50,Z51,Z52,Z53,Z54,Z55,Z56,Z57,Z58,Z59,Z6,Z60,Z7,Z8,Z9 E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a H9AZX1_9VIRU VP7 PEIGNNGAEKQISLHKGQPFIDTQDVGAADPNTPAVTIEGPSDYVIAIDAGTPVAPEFRDANGDKLDPSTRVTIQKCDKQGNPLGDGIVFSDTLGRFEYSKMRSDPDYMRKTTTSLMIDEREIVKIFVEVPPNANGMDADNSRITIGDDTSDYGKAVGIVEHGDLSPAESKAVRQ 175 T 0.82 aRib unppercent T Viruses T 6h82 4 AA,AA2,AA3,AA4,AA5,AA6,AA7,AA8,AA9,W,W10,W11,W12,W13,W14,W15,W16,W17,W18,W19,W2,W20,W21,W22,W23,W24,W25,W26,W27,W28,W29,W3,W30,W31,W32,W33,W34,W35,W36,W37,W38,W39,W4,W40,W41,W42,W43,W44,W45,W46,W47,W48,W49,W5,W50,W51,W52,W53,W54,W55,W56,W57,W58,W59,W6,W60,W7,W8,W9,X,X10,X11,X12,X13,X14,X15,X16,X17,X18,X19,X2,X20,X21,X22,X23,X24,X25,X26,X27,X28,X29,X3,X30,X31,X32,X33,X34,X35,X36,X37,X38,X39,X4,X40,X41,X42,X43,X44,X45,X46,X47,X48,X49,X5,X50,X51,X52,X53,X54,X55,X56,X57,X58,X59,X6,X60,X7,X8,X9 Y,Y,Y,Y,Y,Y,Y,Y,Y,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I H9AZX1_9VIRU VP7 IGNNGAEKQISLHKGQPFIDTQDVGAADPNTPAVTIEGPSDYVIAIDAGTPVAPEFRDANGDKLDPSTRVTIQKCDKQGNPLGDGIVFSDTLGRFEYSKMRSDPDYMRKTTTSLMIDEREIVKIFVEVPPNANGMDADNSRITIGDDTSDYGKAVGIVEHG 161 T 0.82 aRib unppercent T Viruses T 6h82 5 BA b H9AZX8_9VIRU Uncharacterized protein QTADGRVGLVPVNSYVTLETDDLDTDEHPVTDAGTVALEPGESAPIVRYDLGQPAAVYAVGATDEANVEYELKVNNSKTVGGRTNSPLGVLNTPFSFVEKLGGAIPCETAATYWAHYSSDATGTVELAGRMHIEV 135 T 0.035 T4BSS_DotH_IcmK pdb T Viruses T 6h8c 2 B B UBA5_HUMAN UBIQUITIN-ACTIVATING ENZYME 5,THIFP1,UFM1-ACTIVATING ENZYME,UBIQUITIN-ACTIVATING ENZYME E1 DOMAIN-CONTAINING PROTEIN 1 GAMEIIHEDNEWGIELVSE 19 T 5.7 DUF2172 pdbhh F Eukaryota T 6h8p 2 C,D C,D H14_HUMAN HISTONE H1B,HISTONE H1S-4 TPVKKKARKSAGAAK 15 T 0.2 DUF5797 unp F Eukaryota T 6h8q 2 C,D G,H SCC1_YEAST Sister chromatid cohesion protein 1 TDAMTESQPKQTGTRRNSKLLNTKSIQIDEETENSESIASSNTYKEERSNNLLTPQPTNFTTKRLWSEITESMSYLPDPILKNFLSYESLKKRKIHNGRE 100 T 0.12 ABC2_membrane_7 pdb F Eukaryota T 6h9c 1 A,AA,BA,CA,DA,EA,FA,S,T,U,V,W,X,Y,Z D,M,N,I,L,a,Y,F,E,R,T,S,W,K,J A0A1C7A3R1_9VIRU VP7 MGNIGNLSAEKQISLYDGQPFISEQDVAAGDPNTPALTIEGPDGYVIAVDAGTPIAPEFRDSNGEKLDPSTRVIVQKCDRQGNPLGDGIIFNDTLGRFNYNKMRTDPDYMRKTAKSLMVDEREIVKVFVDVPDGANGYDAERSRFTLGDDTSDFGKAVEIVDHDDLTEGETQAVKSASQRSGGA 184 T 0.27 aRib pdbpercent T Viruses T 6h9c 2 B b A0A1C7A3R7_9VIRU VP9 MRDNQDLLVKRLGRLVNVLESKEFGGTTTVDKDLDVTKNVTRTDEPNEDNTPDYFSTGKDRVLVPDTEEWERLGFGIVAKTVNVRTTDDVLLAFANPNTNGPTFKIRSNESPFTIGGDAGIDTAFMWLKKAESAQNDPAVEIIAYR 146 T 0.058 Aft1_OSA unppercent T Viruses T 6h9c 7 G,H,I,J,K,L,M,N,O,P,Q,R Z,A,B,C,P,Q,O,V,U,H,G,X A0A1C7A3R2_9VIRU VP4 MADQTQEYTLSHTGGLLGSSKVTTASNQTAPQRETAIISFEVPRKFSEIEYVGQRDATRFVPRTTEEITGTANDDTVVQLQANIQPIAGEEDMADQDYPVVVAYNVTQGAQVEIADVNYATDEVTLATDPADGDTVKLWPIMGDGEVQFRLVNQFGQEEGRVYPWATPLYRWHDFPQLKRGREINLHGSVTWQENETVEVLLDAPQAITWEDADYPEGQYVSTFEQDVEITL 232 T 6.3 DUF1344 pdbhh T Viruses T 6h9h 1 A,B B,A Q5NWP0_AROAE Csf5 MGQQHLLRFALPAGKKLWPNDLREALAKHDLPPLFFSRDPQTGHAITRAMRNEKRVRGYIEQHGHEPPPPTEEQRANPLAIPGIRIVGSSTWVGILATGERYKPLLEAATLPAIQIVTQRCGRGVGVELEQHTLSIKGLDDPKRYFVRNLVMKRGLTKTAENTTQVASRILSALERQAVAYSLDLPPTAQVDIHVESVVRPRGMRLVTSTGATEQFVGLADVEFYACLDLKGYWFAGNLTSRGYGRIIADHPAMSTGRYAHHHHHH 266 T 0.0038 Cas6b_C unphh F Bacteria T 6hc0 1 A A Q6MPU8_BDEBA DgcB N-terminus KTSIVAS 7 T 7.3 Clink pdbhh F Bacteria T 6hc1 2 C C Q6MPU8_BDEBA DgcB N-terminus, phosphorylated LEKTSIVASDTX 12 T 5.2 Zea_mays_MuDR pdbhh F Bacteria T 6hc2 2 B,D,F,H,J,L,N,P,R,T,V,X B,D,F,H,J,L,N,P,R,T,V,X NUMA1_HUMAN NUCLEAR MATRIX PROTEIN-22,NMP-22,NUCLEAR MITOTIC APPARATUS PROTEIN,NUMA PROTEIN,SP-H ANTIGEN GPLGSPDYGNSALLSLPGYRPTTRSSARRSQAGVSSGAPPGRNSFYMGTCQDEPEQLDDWNRIAELQQRNR 71 T 0.072 zf-FPG_IleRS pdbpssm F Eukaryota T 6hem 1 A A UBP25_HUMAN DEUBIQUITINATING ENZYME 25,USP ON CHROMOSOME 21,UBIQUITIN THIOESTERASE 25,UBIQUITIN-SPECIFIC-PROCESSING PROTEASE 25 GPDFSKHLKEETIQIITKASHEHEDKSPETVLQSAIKLEYARLVKLAQEDTPPETDYRLHHVVVYFIQNQAPKKIIEKTLLEQFGDRNLSFDERCHNIMKVAQAKLEMIKPEEVNLEEYEEWHQDYRKFRETTMYLIIGLENFQRESYIDSLLFLICAYQNNKELLSKGLYRGHDEELISHYRRECLLKLNEQAAELFESGEDREVNNGLIIMNEFIVPFLPLLLVDEMEEKDILAVEDMRNRWCSYLGQEMEPHLQEKLTDFLPKLLDCSMEIKSFHEPPKLPSYSTHELCERFARIMLSLS 303 T 0.22 Imm30 pdb F Eukaryota T 6hfa 2 C,D C,D LM266, 1-[(2~{S})-2-azanyl-3-methyl-butyl]urea TSFAEYWXXXX 11 T 3.9 PBP-Tp47_a pdbhh F T 6hiv 1 A DA Q57UJ2_TRYB2 ms48 MMRRVASTGSNGGHCYVGSSLWADSVYMEVSSITLQKRFSFKYATKLQHDEMRQPYYIHEKRYGIFSNERNIAKARRGLPFITPLYTKHMNLWDTDTDASNNRFFRGYYYGQRELHQLLGRPHSLEATNADGSNDLSTYEANTSQLYKGIPRPAITNLHYEPAWRYTLYQAGAHGAQLSNPRSPFTAKVLGDELMQVRDIKSVEHCKAWFDRLQYLINLHYEAVGDIGEFKSRHTRHVHEFFVAFHDALSSLDFRDTYLFDQFKAVRPPELSDLFGIFLEMEANYVHEDYCPRCSLPYSTTRYCGEGDVDTPFRKHRGRWAPHQKWGREWYDVVARRAEALWYRATEDPYFGTPQHTQRQAEALLKVYVQAKQRGKAIDFMNKLRGSVEYLTGDITITTVMQESYDALLDTTPHPHLLTNGFTLESDAAKYTGEVNKAPLSPLQFRIDMEMNKYRRQQKEEGVVRVPPALWRLDTSAIVPYKVDSKTRRIVNWREVKEGIEKSFLSVGLPKEAYTGNEWREMLYLHDVIANREAKAAVLEQQQKLDREKLKARERASLKGEKSSNSASNGIGIIFPDKDGYQVFFVDEASLRPFGIGASGTLFKAVARVYPSPAAVPYDDPVHGKQFLIDTTDETCHLFGGFEHGDRLLLRAKSSGGDVDGGNNSNCRNTGSFTDEEEEVIVMGVSTEGSDAERMLCAMHVDTGKQRERGIVFLGTDCIDIRERWESVRYAASAYVKGRVTLLESQRTAREEFMGQVVGIRDGVLFVQWRLLKGGGSVLDRSVAVPIGDSTQVRELFVETKVLGVEPLQSPPSWRTPFRNDFAEERLKELQRAPFKREKYVSLIQGKYTPKVKKFGYTQHTTVDDFETKEYKDRLLSKQFFQNPQAFEVVPDRHERSVQFGGKWEYQRTHGLPTVDRNELENGWSEVEPISDGEMKVIEQALRDISGPRPGNFVKPPSKTKSLQLSESWWEPLGYGWEQHNEEQKALCDVTEQRLIDGSNLPFGGKLPPFGTSYGMGERMRAIIEDYSKGFGLGPHGHSPTHDTIHYNTLNAEGERVRDLGYTDALGRLFSEKLGDRDVHQWAVESCADGEADVRQLLLSLHEWRERGRPPSLLLANVLSKYLEEEIAAFNKGIPENAPKLQLETSDGTLAHSGSGGSRSGTMWADVDPTTFALHQASQSTRSTRCDEPFILHLVKRAKLGECVTNFTDTAYIAHLESSVINEFHLALTKLVGKGISPTLLAQKTGQLHRGSVRVGGNVVPFVKSRELSRLLERMGLTSESIAVITRGLANCPEQESVGDDFAVPVSLILSWDGPGSSGGSGSGDRRLNTTAARNEQQRKGSAALSSAIRQLSQHQKQRGSASGRGWQNEDKMVVHVLEELSLRNDGLVMDIQYAIRENKKNRRLRWEFVTTLLPVFGGNEAKVEQLYSDYCDGKYVPNITVAVEAFIAFLHNATQHPETYSASDYFDIDDGKSSTASGTGDQPSAGQYTTLKLLDPLEGPFVFDNVKVEYIQTVERFRRHGIRAGPVMAPATGFIAANCKSLNYFTRRQEEVVYVTNDSDQGLRRSLENSAYHKTIAANPALQYLLKARRGAALVETFNRFFYRTMPMLNFYQNVLKHYSETIQPLRQAAKSSTRGLARALESERSAAMEEFRRNSERYWRNIIEGRSVEQVVLANGESNRRTVGGGAGEQLQKPQGNRDAVRTNFTGEATDGSRKGQRQHQQPPLSQQQQHSRGEGERSAVHTSPRQGGSRGMADLLSKLNTPKTGSGVKGSTAKNTGNNRRGPKS 1788 T 12 DUF5053 pdbhh F Eukaryota T 6hiv 2 B DD Q385L8_TRYB2 ms51 MLRCTVVGHHRSGKARAFVFRDPSLRMMRAGSGYQQLRRMGMPMQVGMGWRKVDSFHANTQYQHAWPLLSHDDLGNSDQSNNTKNIMYSMYMPKRNKGTAPWFRGADTYSVKYCEQGRYEYQRYLMINRFPSEYKKHFLSFLSNIRMSSGSATIPQEALHWLLRMIVDNFNPQHVHYIAAMKTLQSAGELDMARDVWKIMERQQTWPCTATICAYLDVCVEAGEKTWAMEAWNRYCTELKFLEPGEVDPKPISRVPFSLTREELLYLPKWKKHFDHDPNLDVMDLNRFNRTREVYLRMAQVMLAGGERNAFQHFFTKLEEAMLNKPTPVPEPPNPHLVRRPRWAPYEHCKSVHHSPWRLQNNGRALALGPPVTIEDEMQSRFFSNDQFLVHSVKEVLRIVLQEHKRAHPTECTRCKTEAFFYKTKDADETLKFCDDLIERLFASLGVRLSNLNTSSLLSTILEVFRVVGKESGAALLQRANEFLERKASLGDAEGSRENLTASNYLQVLSGFADESAFVYNTKKDGTCQYKTGFDPRTTMRHLADVVQEIAGNPHVTWAADMHLQVVETMVGCGTMKANDYFVRNVLRQFSWDSRFLEALYVEYRRQDDVDMWAELTKRALVWTARYNAPASERLRRLIEDDYDTIRVQTRTFRELAVFQFRDVEERRHSRDVVNELPNPWYDYVAHALPFPDRDAGYPDEYGDLGQWRAPGGPGSPVRGPGYYAPPMEGEHQRGYTAEWRDLRNPMRPPEFPTPWERKYRQYARGQHPSYDMVYAGPMPEIFPMRRDFRKPTRWDFHDIEKQGKYRTSGPY 812 T 0.001 PPR_long pdbhh F Eukaryota T 6hiv 3 C DI Q587C2_TRYB2 ms56 MRKFCHFMANCWNSARSHATYGAVPLTHSQVTSVYATDGGKVDELGLLELVEERIFSWKLNKWEMRIPPNLPNDQKELIRQEQENLKQILSEWRKCFGALNADILQISSLTGVPKDVVREKNRTWLQEEVAKLRWMGEVNKAALLRDAFMRLEAFGSRDFMFMERLCCIYGLARQGTFDEAFTNYITEDPVTNDIFVDERNPFKELVAHIVRNYSQIDIIYDFLGFNYSEGYRSSLRRYMEYLQCKTAENVRASGRLVTGDKGEHNILFDYCVSRESLVSGDSCQGIIDFLYINGNDVTLIIIASDNPWLRNRQLPHRRQMEGIARRVCFVLGIPPSEVRIRNLLLPPTYLDKGSIVRLNDIVFRLSNEQSNLLIPWLTNYNKELDPKDVDYTALAKTTNEEEWLTL 407 T 0.2 ApoC-I pdbpercent F Eukaryota T 6hiv 4 D DL Q38BS2_TRYB2 ms59 MRCSCAFLDKSVFAAKRRVIVPIHPTPNFPAHFIKSAFTTDPLKEKQKARFSSGGEAMREVQDIPKNLEGERSRRDLASRGDTEFQALVEFIEGASYDQLISGRRFKKVYDVLSENDDMFIWLCHTAMSVLNPGDMRSRLIYNHLRILAESVASGEMTQRTAFRFFESAVRSPAYREIAKRQLEGGAATRLAGISAAADVMRRMGLTRRPMSSYFELYQRIVERSEAMTPWGFPPLFQFEERLSLEPRLKFFSRASQQSLERRRRGHVMTPHTTLHGRRIFWIPPTWNRAGRFLGPHVTLYPGMTPD 307 T 0.37 DUF2840 pdbpssm F Eukaryota T 6hiv 6 F DN Q38D60_TRYB2 ms61 MLCRTFLRQFRMSGGDMFVEYKVLSRDHRRSIRVEDAIVDPTFKRTVLPLGWLELLRSPSLRLPTGYFVEETVHVSLPNATSNGGKKEARPQKGGFASGSPSVGRNEANAIIAGPVVLYITGQSVPVVLNPYFVPEGTWDMRTRDGELDLRLGMDAIEQCTLFSELRPGGLLYGKLPENPNVRRNESLRATLGRYGMKCDLAESPLVPRPWTRMRYMFIDELQRGPKLTEFVGHNPRNGTPWRFSQNTKYFRIGIWRDTIRRNDMNEGLHAHSSWQKSPQQSVPEVRFLAPYP 293 T 5.9 AAA_11 pdbhh F Eukaryota T 6hiv 7 G DO Q383D1_TRYB2 ms62 MRRFCTARVSSYVKRAPLLLPRGGRRWRSGAGTTADGGGEKEYIADSFESTAGSNAYAAMELAAEARREIHELWLSAETALEREKRVQQVAALIEKYKLDPSTPREADVSRGLGDAFDRLLLLCLPLGKTDAKGTDNLERLMHLAGRNGRELSVRTIQHLFARTDSFAEALAVFYTMRRCHVAMNMEAYYSMLYSLQRLEEEGWGQHFRNEYEENGAPSEQAMDFIVKGISNALLPENKPWLGRVMFQDRNVPDRRYDTRDFDELDTAWTQRYKSGTPAGAH 282 T 0.084 ECSIT pdbhh F Eukaryota T 6hiv 8 H DP Q38F25_TRYB2 ms63 MLHGTPVRRASLRYRRPYWMMFLKGVDNWKIYTVIQQPDHQRTEMLYQAWLGGLDRPYTRPKCMANQPLWLSKKRHMLRKERLDGPETPLEKYVLEWHKKFHSFQGTERPTPDDLHTALDLVERPLDLSYALQLLGQCRNLNNIRFAKETFLVFLEACLRVGRRDCAEYALEHAEPLGFWFIDEDHRRYLQGEQTWYKLSPLDNLYYPVEENAKLNEGRKPITRLSPATESPGSGTAISDGEPSTDVEGETTVDDEIAQLEAELAALEREGGGK 274 T 0.19 MNE1 pdbpssm F Eukaryota T 6hiv 10 J DR C9ZPP1_TRYB9 ms65 MFSESLCILSRRFRYNTKFPALVSYNKLPWEVVNHETPQFHMHVAPHYEQLLTLAASSPVPHIIGSKHIDVPREHRLRLLPGMLYLLDGDTLPGEFTINRVLDPTALQYYGRLSSQIVTVEAVRMLVPDDLRLLCNCITFKGPLHLPVAPYASLASLRGASQGGTTGSETGSNCFTLYHFVRPNRPPKELQLEKYYIHAPCVAPLSEFASNSDERGNWRPRLQAPKRTRRATPLPAYRPPQSYLMGLAERLAVVPGGCFGRRSLMWGHWF 270 T 0.47 DUF3475 unp F Eukaryota T 6hiv 13 M DZ Q587C4_TRYB2 ms73 MLRRSILFRMKYADLELTTRGEFPHGMKEPAFVKKLDQNIPWYFSTYRSMYHWPITGDNWSDLNEAEKHHDLHMFYTLAWWKLGEGIFGVDEDS 94 T 9.7 Mastoparan_2 pdbhh F Eukaryota T 6hiv 14 N Da D0A3P2_TRYB9 ms74 MFSLTQTWLIAHWYCGHKFRHRFMRDKRFHPSLQASHDARNRFSKRRHFKTNRWNYQQAYRDMP 64 T 0.093 MLANA pdb F Eukaryota T 6hiv 15 O DB C9ZJE4_TRYB9 ms49 MIRRRVCDGARLAPNIRATFSAVRYQSGLHTFIRDSKPSNFSSVRRSENANGDATASPGATAGENPASSGDWASHMQRELFGEVDPLGGQAHKDYYRDVTRGYSPQYAPRNFANGGAVAYPHIQSPYEYEEAAHRRVWLDHDVDRMREEFTQHRASLRSLASAQEREELLRSRAAEYQVANTVHESESVHPIQQLYNSGGTSRSALKQQAVADRYSIAEQHSPLPLTTGVDRDALDEAQRTKDRILNDSFTAENLLITHGLREKEKHDFTILQRTVRIPFQGYDMDRFLAQQKGTPYGAQQLPPNVVPSSMEEAQRTLRGSSATATPLVDAVAQKVYARNTVVDRPAIGEQLTEQIINIMRASRTTAEQQREEERAQRFGLGRQGALVQDGGPDQRTLKKHTNDERIVDAMLFQQNAYRKTPTDEHWNPYIRRSTENGVGHLLQNKFDIMRREDRLSKGEQDLTERNTIHYGVPIQQIVDEFVFRHRNARGERPLDYFKPFPNFRALRLNRMYRDVEGFSLMKQRPEFLEWELFTRYRQHHQQRRRLALLHGLEPVANETAQERDTRRHRLDEICERTPFDEREMRVNDDEMRVSVETLRSWFGVYMLPSPTVVNAVLGGSASVNLHLYHLADEMGTADTREHVLSSRYLNRLLLLESYQNRVGRGFMNHVVGRAPEPVVPHEQPQEVLRHFSAEERAMYEQHVKEQTSRQLGEWERAMKRRRWLTDHQQYGHVVSHGLETSVVDLSHTETGAVLTVSTKAYEQEIEAVRMKTNATIKVDGMVYNLLPNSERRVVPLTVQLDSGEKIDMTSEDFDRCELEAFPRNLNHALNYGIANYAYNRGNYVETQDSIWEEQTASGQEGWSPATHADGLREGLPVRARRPIFSSSAEQRIAGGPQRAVIIQYHHQPFFNPEPRLVKVAFQCDGTIMEVPISDVMIWQRRYHGPERTVGDESRRYNPAAMRRYVDVTDPFNEKTSNTEHFLDKYEPKRNADTVADKYRTTKQITEIDKWTRYDSARADNYRPLSISHRRDYIRMGYIPRYTPWEWIAIQEADQPLIAEQIRQDNIGTSYFFSLNRYWRYKASPHGYIRHFENEVRDLLQYVDGVTPWKQAQKIRTYWEVRSHHPMPQFNRPEVAMHRNTVGLLPAHMWETDKKTGKVKSVKDSVRDYQTKTPYPKWVQL 1181 T 0.098 ACD unp F Eukaryota T 6hiv 16 P DC C9ZSK8_TRYB9 ms50 MFRVRRLAGFIQCSPCGKMGPLKQQTRRYTPIWKSDPAVDNVAPLRDEDERRALWAEVGPISDVGSAVTAWIRFGNDPVLHTAVPTMLGGKFRNQQREKESLLPNSSSPFAYVEDYMGTNLVFGSPVHAKESAAVWATYFERRYASRLRLSRRTVANYVGLINSPEVFDDESDRPETRWSQDTFFRECAYLSEKFLKEKVSNMQQFEAALKRASPEAYLAFFDAFQQQTQTQIPLPSPSVWHYEGERRKQWAEKFISISHKAQAFFKDVLSEDVKKYQEVPGKLLQKVKPVLADVGKILVKRHERWLKGRVWTSLTEEEREAYCMKEVKRQQMQVEDGEFDPMMEDDVDDTELEEWQREHDAIMKLMNSPIDGLHFTTLELWLHTMRCEELETEHIYTSARVRAIQVAARKKLYDTTSYEEVIQAVVESIARGTLDLGAGVLRPHFNEVWCQLNYAKFGSSTITQHTTTSRRQLLFFHAGSLKDIAATATLYYATKPLSNSLDYASPYKYRRSLITLCSNYGVETAYTTQRPLLRSAANLARAEDLIHAVVTAAAQPFGERRRAATRDLHMEFQRLAVPVERVIVANPVSALLESGADPDEKPVEGEKVNMWPLGAKRVVLYKWSAPNVEKLKAMESDAASAVSGSSLTAKRLREIQELKRRGFLEVSLWRRVTAQERKQRNEIVEAKKKQVEEVVRTVPSLAHLHQYATSLYSRIEERVAFPTETSTVTETTNMKEETNKPLEDSEWEFAVLLDDRVLLNKEESVELYLPYRDANGELLAQGEYRALVRAFDLEANPNLHPAYCSVGYSESFQVFDALPQLIAQFFRVKDATAEAAGVTHIPAADFTPFCAFLRDAGLDVPLRCEFEAGQAVTTDGDVYMDYFLQLLRGEAFHQSHAQAGLTEAQRAIEPLCRAHWVVHHPGADESEWATARRSVLDHAMQHEREWWFPNEMLDVKDVVTGSTNGLTPQMYPAAVRYGVELCTVLTAEGKFVDERGSGLSARCVVNGTGAAESVVFDTANCNGTNTTSVEDALRVAHGALRSAQDRHNTLAAFRLGPLSKQSQVLLFCGVNAYEFGGKYARTYAYAFEKAKKELEATAASGFMAPSLSHEDTERLSDQPTTSPSVDRFASTTHPEQRKAQFVPRVGPGSTPLEDPAADQKSEWS 1165 T 0.23 CT_C_D pdbpercent F Eukaryota T 6hiv 17 Q DE Q386Q7_TRYB2 mS52 MRRCSFASTCRYRSADDVPLGLGSAYVWTCRNITSRGQFNPIHNFSYAMERGVRARDVKAFEKLITNPGPLRVAYTPDYLDWLHRCYKAKGTYMDARAVAEKKFNGNIVSSELSAAVNRREGKRGDTRGDTVDNDHHNLPGAPPPGMFLRPAHSFRRLAGELKRRRAQSILDEVARAQGMLDLFERQPHFPAIHIDRCSRFHLVELFKEMVLERSLDSNMIWEKALLYRAILSERKPSYPTSFHYIFTAVEDTVFAPTISTPMEMAGSGGEGHDRSSHPLAAKCPTLEAYYYYVYLVKKYYIDNAVEAHVVLRCHREPNAADLLFSNPPPKDDTEIMKAVELLRNADIQRGVAAAAAVSDPTLPPGGEGSVIGNSDNXKNSEKXSEGSRGRPARPPVLPGAYPPIDMLWRCEENLPLLKVLLFGEFNLIVSENPFVKFPSAHGFLTRPYSTDSSRTLADGMSLANVMAEKRGHLLPSLPRNTATSIDARAQDIRRLQQKHHRDDIVSFQKLLRTHAEDSPSAFSSYSDWSYFNPRAVRAEERDRLTRKAVEALKLYDSATNDIYRHSFEDVQACHTQRVTERDRTMPPYLPTLPHFVAIIKKDPHISFLLHIGLPDRNSSEEGSAKHKELEKRIYYLARALYHTALEYHNETVRRVNRQKVNVAASLLDNFVEQEWTTILRDKHDVTDVTKTLNDTQNDKKQLARRLGRYMLFANRSLDDTGFPTDARADDYTRWMAPPSVGKVSL 746 T 7.6 Complex1_LYR_2 pdbpssm F Eukaryota T 6hiv 18 R DF Q38ET1_TRYB2 ms53 MSVSGVFSKGRGIGHEATTSILRYIPRARVPWQPSRFGRENLTAADMARLWSRGRYRDGPGNYNSGYCTERTHVLEENTVSIIPRRELEKYMPDITIGPKALVTPVSLMNARNGHRVTHDLLHSYDPHIGRLGKPAVVDHDNITVEDPNRVGLNAATLDCRGRIYRWLRRGPFFQVDNYFRRSVKLNRDGTLPTDFVHEAPLMRKIIRLAHRGHLKAACEEYRRVTTVPPVEVYRALTACCVPGAKLADAVSIFEDGDSKLFYVSRDGEVLHNLMRCAIAARHRARIMWVYNVMRGRFYENVVVRAEVDLIWRYRIAMIALEYLLDHECAEEAAAIYSYLVEEELLRCDVHVRVGLHMREAIAAGKPITLNNDVMNATSLVRDATAVAPEVARELQRRHAQTLQNNAVEAVGAENVREGSAPWSILGPLTAIGPTAEDTMVWLQQHYGDVDVMSIMRWARFRKGKDLMAKDRPQYLARAAAWIELLSKRNREMEEVPLTYMRKSKPLVLDTNSNVRVAWQTPLMRSGGPPRLLAREEGYVFHHSNSSRFVEETYRHPGESLQSRYLALQPLHTEVSAKEDFQRLYYQAQKHHKQQERLLPVSTAAIPSRIVHHSVMSALHGVSGKGEPANRSLFTXNKSNDSHSNTGVASGTSACTDGAGSATPEF 666 T 0.0009 PPR_long pdbhh F Eukaryota T 6hiv 21 U DJ Q584U8_TRYB2 ms57 MFRSTRPRSVGYTPVNPDTSPMVAYSQYHWHYNLPQGMERPHSVNRTFAAPFQSNHSLVNKYRGVWIEFDMHPAFSVALEPQLRKLPRGRTLPKTPAEEVIADYTALAPLVDDEKTRDLWLAKVFQHCAFQRCGGAMELWERYCHQRFTAEGATAKPPLSLVKSVLFYCNKTDNSGWRALFDRCLKDGWNYTPLFDTAQWSFMLKSIGRMGDEDGVRAVLEEMLDVQADLDRVEARSVVIALNAVTNADVYEFVKKYLFNFGERKVKFLRTTYSDLRGHGAGKLRIPLKENDNMYYHVCWHSSIRSPRQFSPRQLYFDYTPSTLGSSSHNPNAKIDDIVKDKIEKWKAEGLLPEDYVHEDRVYDRSAAFKNVARQEKWKKMPKILKSKRMGYTGDP 396 T 1.1 RPM2 pdbhh F Eukaryota T 6hiv 24 X DV Q57UZ6_TRYB2 ms69 MRRVRDGLFLSHPSSALQCSVAVVSTGEFDHPPFQFRQRHTFNTTPLHDANRFGGRTAYLREIGPVNIKKQGRRFKKDPRTVQFNVDVWCAQQTLRKRWKQRDWEVIEIPFRLVPREQQRVIPELYTDIPQMTDPARNDFSNIRNKVYDREELQGVLFPAAGAMLYPPLQRVDKQAMTLDKYL 183 T 0.13 CNRIP1 pdb F Eukaryota T 6hiv 25 Y DW D0A8P6_TRYB9 ms70 MLRFTHRALTATPERFSVLGTTHPKPKRTGFGRNNKMRSKPSDNVAWYDKGPVEWLPRPVRLTYDHLDQLQQWTMRATLDGRTEEFNRIRDLHREWSQHPLMPVLGDVEPKFPLNLFKQNHRAKKRFLVRWHKANTPANWLWMPRGPTVVTPLHRTNPTQYPENWKQMVQRKSGTGTPS 179 T 16 THDPS_M pdbhh F Eukaryota T 6hiv 26 Z DX Q383G5_TRYB2 ms71 MFHRAFVSSSDLTGCTIALSSVCTQKRYWAKPKKRPKVGQGFHEKAQKWREEYLLDRHRALADSLRAYVEFSTSKRVEPWDARFKPFDRIEKDGVYVLMRYMMEEKLQLCNYHHRPVKRLFCNIGLMGPQITTKARWKPYRFATNPAGTSKAERMYQRDKTVYTHGHND 169 T 0.18 Tox-MPTase5 pdbpercent F Eukaryota T 6hiv 27 AA DY Q57YD4_TRYB2 ms72 MLRSTLTSLNTFLTSSVATPPISVIRTGPKWWADPERMVRQKLMYFTLGVDQLPLRRTAVIQRDLQRFHMCKPPPRVGDSTGYKRSRAAQLNTWYRRIQYQEYHMQHLFTRHVWGLLRVYPGNTTKIQGKADDGYVGYDSVPFHRYNRAPLPFPARELYERRK 163 T 37 MBDa pdbhh F Eukaryota T 6hiv 28 BA CC E0A3K1_LEIAM uS3m MFLIHFVHYKTILQKYTFKFKHIFLSIDKYNSLFFNISGILIWLNIIHINIILIKYSFFILINNFEYLIILIST 74 T 0.022 Prophage_tail unppercent F Eukaryota T 6hiv 36 JA CN Q580I0_TRYB2 uS14m MFCFSRGLWMRHIGQDVPKRHTHFVLESRLMYEKSFRDCWLHSVCRAISQLDEPLSKTVVGTHQKMLQRKVTCFQYNQYGLFKTPYYRLANVDRYHAVQGVAGTREWVPYVNVSYWTMNKMVRGGNLLVHRVHYTGWGTDSHLKKGGWEHRWNKVLQRNVLQYSRI 166 T 12 DUF6462 pdbhh F Eukaryota T 6hiv 41 OA CS Q584T8_TRYB2 uS19m MAFRNTFTTPGKFSTVSKNIVLLLIWRVKVFLRAEGFAHSLVMLPVSLYSKILLCDVKKKIVYFHCCTRKKSMLRRCPCVWPDGPTKSSVSIGTAFTLQKRFLKIAKSAFGFYLARRGQRKYPFLRRPHIKNTHSMNPSAPYFWSFMTAKSQMAFLPEENYITGDWTGKFFVSKRQVYTLQHATSGAKVRVKSFPSIFEFNSPSRWNIGKEMNTLTKPRMDLIDEQMLTKKQRLDYVRAGLLPK 244 T 4.9 LIN52 unphh F Eukaryota T 6hiv 42 PA CU Q580M9_TRYB2 bS12m MLHTTRLWLGGYMMYHRKAMGTMKYSKWKGAHGGISHFYGRTPMVEEVRPNEPITLVDRRIMHYVHHSRLRHFQLFRSYQEKSNSTECKLREGEMLRRRWHRRLQKSFIAFMQFKTMKVLEDQARLVNTYGQAAVNAALGDPWNATDNVARERKSAAVRRQVRALPMVNVVPKHVATMKQIHNDRFNYRWRVN 193 T 2.6 HMD pdbhh F Eukaryota T 6hiv 43 QA CZ C9ZRZ4_TRYB9 mt-IF-3 MMQRCSTSLSRLCFRRLLRTPLLVYSIPPTRDVPSGIAHCPLSCSMRMVTSSNDDEFVFDPTLSIQKDAAIHTAKKSFETIVLEYVPAHAPEEARQKVKSYLTQHPIDILITQPKVQITHLEDAESGAETKVSLSPCDLPEALQQARERGMNLVQMGARGDVAYCRIRRESTRILSLIHTELEALREQEEKQQGKGRGGVQAAAKMGELIDHTFRDAVDAHFVGWRSKKIVEDIRRRHPVKLTIKEFQSPECAIGKLREMCQAMQHYAQEKVIYHHFTSIVANDREASITFAPALPMAKSDSWKHIKYPGEKEWTNALRRMEDACRKSGRYGTYAKSNKLKLRSLGQTSYRVDKYGRKMD 360 T 0.0011 mIF3 pdbhh F Eukaryota T 6hiv 44 RA Ca Q38DR3_TRYB2 mS22 MLRRAYIQRRYPFNKRGPREHKSWKHHVLTEPPKPLQWRDPKVWTRDLSVMKSFDAPQWDLWQSRPRSEDMDEALQPFMDMPKSLKDRRYDIPWWANPFGAWYLQNILSLELLKLKSKTNAEKIATYRSYMRSLASGKDNTMSDDDVIRNIIKERWKTLEFGDRNAGYPCTFGDYIQFLNEWFKSLDEEGMQRLREHFDRRIRPLLAVMSPVDILWLEALTQNSPHNKEQLQRKIAFQTSLGTPEFFDMSKRLRYEINEDYKVRDELGPELFALWSKAPERWPPERLSKMYGLDFTLVRKILVWHHFKACYDACVEPDWSLPKRLFALEWIRDVRARKHGLFYGKMRFAEQKITFYSDRFLFRDLVNRREASYANVWEMDDPYRFLQTEQDYEDYWGDNYDVYRRMFPEMIGRTGEPVQQYGQMPIWAGPHRQHANKSEHNWMFAEIGVNVGHEALKKLELDPTNEKRRRFVIRQPDGTLRSAKMSEMRAWYWKEEWADFRFWAPQMEWGIENTPSQEQYQEHVPDTTDADFRKQRRIQSRPVKWFYESHYTRTGSFAGFQPLRFMQRRTEREVRWPDVINAAVQIQKRKPAAYIFKAIPEL 602 T 0.26 MRP-S22 pdb F Eukaryota T 6hiv 45 SA Cb Q57VB2_TRYB2 mS23 MRQCVVRRYKMPKNMGVAPRFDTWNEKYEPWEHMKRMGRLVGTGFYIPPEWYNHFRMFPPINHNFQQEKTLNPHNASEPTQDDTSTLSPERVALRDELARKSRLVASEGMRYYNIFWVRKPLDTMEKEYYELKRRGVDHGEAIRKVLQGFYSGLAVKKRVAAIQAEEAKLTGRFITMREATVVLGVLAKLHKEQLTPHQVSLLAKEQGETTQSGAKLTAIVSRTQPHVNKEASSPATSEAVGSSTEESLSADALASMLSEDGEQSAVGTRYQVEVKETANDSVRQLREKAEDQTGFPDWYTGESPTYSGTSCSRDGFALMKANK 324 T 0.026 Herpes_ICP4_N unppercent F Eukaryota T 6hiv 46 TA Cd Q38DK6_TRYB2 mS26 MMRSSSFCRRQIRPYYNLPSKSEHGRKMTGFLTPYRHWMWKQNELWRNVHEAQFEHLRRVYKRQWLESFRVNADEYIYKYNITKAAQLAQWECEMKEQEKKRIEARQMMDGRQALKKKHLDLLREFHERQFFFWYERASERLQNMNLINYVPHAQLREHIDKELDKYVAGKNEPYPLNFVGQMPFLEDGDGNIVEVPESLLSNHMAEHPDSTAKPHEPHTSSSISEAAAFEERMLRAMVSAKEEDLKEWLGDDSRALSETIDDISREEEEREADIRVARSMEETDAEREVSRRAYIERGKTGSRSIFRPPTVSEGAGGTPSAPAGDANTPMRRRKKGKLDRVHALQAHQDELLAKLSSQGLKEGVDASSVPERGKIVQSRGRIRDKAVIPTHEVLMQKPELAAGSTPGARIQTKDMVDKMYHRGKYKKSGSGDKSDGEDL 440 T 0.028 ThylakoidFormat pdbpssm F Eukaryota T 6hiv 48 VA Ci Q57WW0_TRYB2 mS33 MVLRRWFPLLGVHRVGYTHPSTLPVPCAQRWDLRLARARIFQEYIEEKAPGAWQLEDERSMSPEFKTFTGYPMRDMRPGYGQNLPDYIMKKRLPNNTHYELFARRDIPNEDNAMYGKYLYDMTVHGTSLPSTYRMHKDINKAQRNDRKLSGNRFKVLCSSGAKNPPSGWEPIPDATEEEED 181 T 1.1 Nmad3 pdbhh F Eukaryota T 6hiv 51 YA Cm Q38C96_TRYB2 mS37 MKSSDIFFAYRLTPVVFKSRQHDSGVNQYGLKPTNAYDYINPTNLINFGRGTTFDNLGVRRAGRGEIDSSPSHSGSPVFTQAKLIGLSGEEQLTMCQSETMALRLCMAKAGKETCERESRALDSCLGRVGHLRRAMSEACWEFNDWFIQNVSDNHTKPFQHRPHDWRHFYAQEKLVRERQQNGHAYGRRPKQFSFGARYVKTDGYGKRPRLPYNK 215 T 0.054 Gypsy pdbpercent F Eukaryota T 6hiv 52 ZA Cn Q57VQ9_TRYB2 mS38 MRAGCVACSRIPKLGIAALGSTTTSMQATCQETQELRCATTQMAPAVIFPLPPPLRGSYIDRKPTAASFNEHHATASFRHHMIASADVSHRPRHFYFASGTRNDTGTRSVKEGERKQLKQQTNVEVDSVAYGDQLDELSARWAAKFYGQVTFGPRNYPYPSSRWLARRFQMKKHRIIKRFRFRRYKLAAVANLPFAKMIRVGMLPELKSSKTKRGDVVDPTLSGQLVSAVKNTGEKRKGQRTRPKSKYQV 250 T 28 DUF2996 pdbhh F Eukaryota T 6hiv 68 PB A4 Q38AB0_TRYB2 bL31m MVLHKWAVVSRSAPPPRGLRPIARTIPTHPRLRPVDYKIPYVLRTFIKDRHTSEVQHLENRGMFAEELSIERSRFPRFHSTFTIQTDGSLNEREFEFAVPPIVTLFHDRLSAHRERQLELAKIGKLRKERNWETEQKGEESVSMACNALAFPYCIPKNMLKRSRVVDPLNSKSSTQGVTSGGG 183 T 12 RNA_pol_L pdbhh F Eukaryota T 6hiv 69 QB A5 Q584F4_TRYB2 bL32m MFRRTFFTPMIAQPTLLMLGNKGGTPKRKKNPMQLRRKVYGLHFKEKYLKMEEWYYCPLCAEPKKPGEWCRREDCRQIKP 80 T 0.089 HVO_2753_ZBP pdbhh F Eukaryota T 6hiv 71 SB A8 D0A1K1_TRYB9 bL35m MGSEESNNICAYKRTISLAKIYIVLLVKTAMLRYSRLCFPKMGCEEITRKARRVQLQPTEYLAQHRMQVWQLRFKEMGPPFSRVWVALGGKMRRRRVGRQVDVKDMRYYWRPIEPQYQRLYMSRLRIRDHSNKLRQPMRLRATNADIGSGSSSIEWERASNRKYGAMLAPPKRQDFEFRVV 181 T 0.13 Cytochrom_B559a pdbpssm F Eukaryota T 6hiv 76 XB AJ C9ZPD1_TRYB9 uL10m MSPASPLPVAALSRLRITHRSFLTRSRGRHVCRSAVGVEYRPEQQKKVLDHSYARVINAEVVHGDEQKFWGERRTFYTQRNIFFPMWDRCAQALILITREVPRVPQEMAFRLMAVFLKLMLLPRLMMNTELMLPMWIASNAEGAMAAAKDGSKGKEQSSKQQGESKDDAKKEGDNTK 177 T 12 DUF5783 pdbhh F Eukaryota T 6hiv 82 DC AT Q4GZ98_TRYB2 bL19m MGYTRERTNRHFFVARANAFFSRLPIARVQRSLAMEAVREGRMRPWKYTKEQILGAPVTCNFEYNPRPVRLIGTVMDAHTEETSIKGGLKVYARNEETNMMLWIPPGNPKLRHEVTSTKGSFQHYLDERDKWDEAWITGRARMK 144 T 3.8 DUF2760 pdbhh F Eukaryota T 6hiv 83 EC AU Q383R2_TRYB2 bL20m MLRRTVCVQHYRAKLELDRIRSMLRGRARLERKVGLKRLFFLMRTQTRYRVEQQAHWERAIVRKNVDSAAREHGTGWQHLRNELGRQNVMLLPRSQQLLAQYEPLAFRAVVELCASRIPPPPPPVVASVPEESYTLWPPASHDNSECASTDGSDAPHGQQQSLSHPAARVELRCGVERVLRRGPSGLGNNVNELIDAWKEFDVSPLRKGEVNK 213 T 7.7E-05 Ribosomal_L20 pdbhh F Eukaryota T 6hiv 90 LC Ae D0A8I6_TRYB9 mL41 MLRCSCACRRGVYHNAPSVYPFVKPFHDTPYDQDRGRHDSVGQRYRKNSWPEWMDNGADGTGYGIGLHRTHPLSRLRGNLKRSPSHVPRVLGMMIQGVWHKSGVKLYFRGGKPPNPSVHPYLTGEPCPLYGWKVTDESVIRQFNMPSIDGTNFRYKPYVALQERKIMGVQVPSDSVSLASGRTTESKPLAKRLFFWR 197 T 1.9 MRP-L27 pdbhh F Eukaryota T 6hiv 91 MC Af Q383S1_TRYB2 mL42 MLRLCRVSLRVQSHQKKRAQHPNAGTRFGRVYNRGFIRYGFGGFGMSVYTPKKDRRFRVQPLPSLHANSLADDTPLVTTTRTLSPNFRAFALQDGGVFFTHPSHEQVMRVGQNILAEETKATGMTSMDTYVNSRIQSIIAENTVENVALSHWRRRHMWNLVRTHGKLQRHWGVSDATKGARSNIYGRPS 189 T 0.12 Toxin_10 pdb F Eukaryota T 6hiv 95 QC Ao Q385V2_TRYB2 mL52 MRLHTVRAPIITRAAMRGYSEARSNYDGTSLPAWPAPGKKPTYPAALSELRLPQPRMRKTRTEWMYYHGHGGCPGKYGPSREIADFEYADGTPASISGRRFAFKHHQDHLLVQLIRAAATVERYDASGLLPRIPGTAEQRNWDPAIPLFLDDVDEQGRPAPLRTAGDAPGTMVSHVCSRVVDERMGTPTHTPNELANRHEGETLEANTMFATNDPSAFVSDTVKLRDDKRPYWSRRRWALTDKFLVPKSPKPKNTIKDE 259 T 0.00017 MRPL52 unppssm F Eukaryota T 6hiv 96 RC Ap Q57YA9_TRYB2 mL53 MLNPPKHYSVESLRTVGLLPAQLALSRKPRLRPHVGNLKGLVYPLPYYAMWRGNHNKYTYNKSTVCLWGEGDTRSMYHQHYAHAKCPTDYGRGGREFEYLTVKRGKMLQKPLPRVQYVAEGSKPVWLFKSWHTPLSSPSMWEREVQYAEHTPEHIGAKRPLAVVAPRTMHRYLFLMHMEKVTITVSPLLFGYGHTIQKAVLDFYRRAISARSPFPKDKVFLFYAIDHITPRIEVTWLDGTSYVPPVLEGASSQDLIQMVMEEAWLAADRMAAEGRVLNPLAIDDYKWDQLVVFKKVRDKEASKGGGRKK 309 T 5.5 MRP_L53 unphh F Eukaryota T 6hiv 97 SC At C9ZU82_TRYB9 mL63 MLRHCTAHRRYRTAWRELLHPLPVRARKMEWLKRDAVEENEEILRRPYYTIKSYALPPAVGRQESIHNSNNIRGGMHSSHSLDLIMRQPRRVKTPEQLRALRDRLRFIGVTGPMPQATSVSTKSYTDTYGSRLRPRYPESWDTVPPHQPSRELL 154 T 20 DUF4113 pdbhh F Eukaryota T 6hiv 103 ZC BA D0A5V6_TRYB9 mL67 MLRRFALTSSVALRLRFERDSGHNTVRYRPIPESMQPKHLEDNFTPFPLPKFDESLEYGPVRLRNIPDIEAAKERRRGSRLAATEVLLQETLQEENQFATSGKGDGNMAIAITERHTEDVTTPAADSRFPSQTMSPCSHEEEMRGYVVSRDYPLIDRLHCTRSIEELVAQFEDRPQIESRVAALADMASTVSFRSDEELLRMFTAISAPFSVDGRGLNFLTVKVSKFGRPYYVPNSLLPAYVNLVDATTIALVREQPWRLSASPALFIQVLQFMALIKVFEPNKWFTFSDHAPSNRADYRHAIGVNHSTAFWGTGEELYDFMVELLRVEDDGRIPTMLDLCTREQMVDLLSGFCGVMPCGKAVGDVFKTITDAFLRRVRNDISGPWSAHDWAIVERMYLVTVLCDAGNNEILQLLLSDTASPRGPDFFAAVSRTKDTPTKKRALCLLQEAIDNASAKADKVTLLGLLESGSEFLLSLVDKGVAHTFATQNLFDYRILNSFLHCSLVADRLRVEQSVITSLIPSSLRDVQVQMLMSNERNALNPLTSSLPGNSGAIATAPKLKRPLMTMLSQLEYLNSIDSVFILHSSLMATSTDQLVSAVRRLPSGKDSLIVTMSCLRALSVKSLTSPSMKERIACARALEIVSYELEKGRAVLLPFSEEILLHDAGAYCDEDLMLWTVAAFLARELPLVKVHTLMHSNCTARTPYRFLKGGHNLLVSSRSLYDKGAPLLSSLHSKELRLVTHNVRLRTPVRDRKCTLQYYNPIRARFVYRRDKPLFDKYHVTARNLAPGFSRGALKHDWRALGVYTPDHPQVPYHPLQTWMLGETTVEAE 831 T 0.047 DUF5642 pdb F Eukaryota T 6hiv 106 CD BD C9ZR91_TRYB9 mL70 MCLKRKAPHLFCFCLWSIFLSFRCFCFRSYAIMLPLLSFPTIQISIFLSFKLPITTFLLSPCFVFVFVFAIRYCGELTLNAQLVLFLLYHCAQTQRGPLKEGEMPICPGLCGELAAVPFRVFLGTLPTLAVEERFLRQLQPVFAWYSSRKRVKEQANEFIEIDLASCDAELLLRYSHIYYVRRQLFDELIERQMTLLDSGKAPKMAEPSLLQCLAGCNMTIADRLQLEIRQLGAAKRAASVPGRRELDPVARLEVYDYACMMRLVEEDAGAVGDAEMKARAYLPREVIESKLGHLTQLLLGSDARAALDKKDVKLLNRMIPPDYTRVGCVEKLRPFDVTAYFRFYGERINNVKVENYFKRALWGHVYRRFATTPSFLSGVSTYWARHSGLDASFTTTTMPQEVAVAVCDQQIQFPAIKFRAQYVYTSPETARQLWRTDAAVPLMRLFPLMGSRTAEDLAAGVLTDAFWMHLGLSEEENLLQDSLLLKVRRFVDEVGDMYETNIDSVLKRVDDNFKQVVPQLKAEDLQVDAPLQDGEGEDTVRETVAA 547 T 0.26 MGAT2 unppssm F Eukaryota T 6hiv 107 DD BE Q57WG1_TRYB2 mL71 MLFRSVSCKNYQRGGWSPGSKHQKHMTLNPTLYLYRFPGPHGPGPYTMKYWWTLGCFPTGMEVPFRLHEFLSTYQQEHVPVEVEEWLRCYIKDPLSELVNASNDFFKAVEVYPEVESARGYKTLQPSIAPLLVPMKKFEEQLGVKISPVGLRSVLSNPVLKDRFLDDLFDYKSYVEKGGSTPHRRLARSRFEGSLSVLGECEKCLPEQHQVEISESLGTFIGATVSPAETTADDERSLILLLTTISEGCINAGNYSDAASVLADALMFCHDPDSQATTHANISFASLLNADFKGAEYNGREAALLQPQVKPTSTACARGYVGWAAAAAYQDDFEKAEAIVKDGLTLYVGNEHLEKLANKLQALREEQPSVYKQVPRSLRESRSHLPSQQSRGLLSGSGKGFSNEFDWVEFKNKLYPSKMDPRNNEMGSVFRRVGDLGSFISTSRSMERL 449 T 2.4E-05 TPR_21 pdbhh F Eukaryota T 6hiv 109 FD BG Q57Y49_TRYB2 mL73 MRLARTLHHVASATTGGGQMLEGLVNDGPEVAHKQHASFTPFSIQPWQARCVGASRRKLLPQMLMYHGARLGPRPLIILDHSTKGEAGVAEAARKYESILSQLSWDYGAVYIPLHAQCTDSSKDLLEQSCQRICAVMDALDVRWTHFLTYSYGALVAVRMASSQEFPHRVGTLMSLDTPLVTREFLRNMEQREDIAKAERDINVPEDGLAFAKQALLSSLEGPLPCPAAEDESLYRDYLFDPNRIFGAGGLVRDESRYVPLKSLLGVRHPVQLIVPSANPLSDAAAHSEVFGHRRPAVVKCCQRHEDLFKESAAKEVAGVLGAWMRRFEPDCFISKRYEQAANEMGQLMLSTAQVSSESAGKGGGEPRKKKEKKKSKA 378 T 7.3E-10 Ndr pdbhh F Eukaryota T 6hiv 110 GD BH Q38AM5_TRYB2 mL74 MLVPGLSLTRRAVTSSCCRPLHVVRGFSTTCTLFGLEQLQDVPTSTSRRPTGLHRGPGKRQTSEREAAQYKFIRRWELQMRDEWDQLEPFKGLPKPKRQFGNEAAEVIWPYALLLERVVKVHPFTKSIYVYYAQRQSTARGKLAAEIARSFAREFLIPITFHNSQVYTEAEMLLEYSETPWVVLHSLDNGQKPRILPVAPVEGTPAHTAVEQLLAEVVQGCEALGASVADPVTATRVLNERPLQNQYVRVDYQWFGDTPDERASHLVRWEFEPEQIEPKIRHRTRHVLDWLNYDGNLPTHRAVHVNAMREKARQKAPRTVAGPRTFYNSAGSRANARSSRFGGQAAVGK 349 T 0.023 L51_S25_CI-B8 pdbhh F Eukaryota T 6hiv 112 ID BJ Q383M2_TRYB2 mL76 MLRLSSWNLKSQHHNVLRRSRPHIHKYRELNRWQRQAQGISKWDQSHSHRPLPYVERFNPESVGLTRGTSAFAWKWWHTQYPWLPNVPPEAAQIDEAQKQERRSHRPPAWDDEFAKVVLNMNDAEIREYLMSKLTDVIFLETQRDGYELRRLDFEGKPLTSLPEPRIIENFVLEEETIRERVIYQVVEGVFRLSPTSADRRELRSVANIIDYVLTHVRAARPTDRERRQERPITSAALAVMQKCPIQPQLGFVHALPHDTRDALLQEWERMHHLDWQFGKAVYTPRSKENVRGNLTWLREDRHYDQRMKFMQEVESGEARAKHMKLIAEAAGN 333 T 1.8 KIX pdbpssm F Eukaryota T 6hiv 114 KD BL D0A4P5_TRYB9 mL78 MGASAGLIRRGGGVFPDAVSLTLTPSRRVYGGSGRGDLLYENPDARRHSGRALGVLNGVRHSSQATMPESGQLYYRKLILHSRPPNGSCAGLQRHCHDTCNWSYLIPSLHRCAESAISAKLWEKMCQLGLEDRSKAWVNLTQYERQRVRDGQNLYRYEVHQRLPLLEESIGWAQLDDLLGWFRSARRAWVRLPTSVTLSRESSEAGVASVVSPSSAMSCRLEGHADSRDTTPGRNQVFDTPERVEQLTEATVHRIREELQRLNRSERSDCEGSAAMRASARRLARDEELSRCVEEELGWHGVALQHRIPVPK 312 T 0.091 CAF1-p150_C2 pdb F Eukaryota T 6hiv 115 LD BM Q381N5_TRYB2 mL79 MRPTFPALGSRAKGYENRVMVYAHRRHRAWYLPPKLAHARSPLANKSPDEYGNTWDPRTGVEWYHRLRRRGAYRHWPWARWNDDPVRQHQELSCRRTFSAAVTGANEGVPLWNYYAEVGQEYGLPSHFPLSFMAPFIHQYTSRAWSRKEIERHLKVVEERTGLRTIQQACDATSELLEWGEEEMGVVPHGLLQHVVMLAEDIVLQNKKKAYRKAAHERGILRTTTMERYYALPHLRTGPPMPTTLEQPSGEFPWGKFSTMVGGTRIHPLYRPDGFFKDNMYPA 283 T 0.055 POTRA_2 pdbpssm F Eukaryota T 6hiv 116 MD BN Q585A3_TRYB2 mL80 MCCLYTSDVFFWSSLVVSPLPRIVRCVSAPQIRSIPCGSGDFSVMKRSLIARWQSGVHTPHGVVYRGAKMKNWPEQRIPENFKFTEEQRFRTKAIPRDVGTIPRNFVLGVLYRHQPCEVGGLWEHCTNDPEIVLDSKRHLREVLKQAREEGFVTFERDAISNEWLCFLTRERYEEVQRIVTAKSEAVDSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSHARRLEEEVANTTRYLRRFQQREIDYLPYTDLNGKVNFMWWYETRDVQQRADEVLTDSSSSKALAGEEEHKAGSQLEATTASTS 302 T 0.004 CCDC106 pdb F Eukaryota T 6hiv 117 ND BO D0A755_TRYB9 mL81 MKRLFPSAGVSVVLTSSSIVMSCPCNHIFTSRRAYYWPYNEDFVPEGAETSRFQSSGSPGTRRRVLQEYALSPLFGARVPCCVVGTLRTAKEIIVLKRDIQSLLCELMELPQGSVTLGPLQQLREMMLYRHGTPLSPRLGDERQLNMDAYARRPIAARTMMDVFLAEDMSLDEIVNFGRLSLGKLQIALNNLRKENAEKSSADCEDGSAVEPAQLLVDRMGISYCEIPSLDESDYVFAEVGGVAVTDEEAERVAQRWAERCE 262 T 10 DUF1382 unphh F Eukaryota T 6hiv 118 OD BP D0A0S6_TRYB9 mL82 MRRCAACLTSLPVSSPTGAAVGAPAAPLKTQAPSNRSLLGYPLRRAAAMEMLYGGICIQHLAQPPFPLRKIQSESLPPPSLQGERDDLELEVKDSTGNVMGYRLFPVNIGIRARTESVRVRSEDCYKRFLAQKHCAAAGVPLQFPAPSSITNSNCLATPRAASHFHPPSSSLSLFTRPADSQGGDVGRTTPADVAAYHPRAWRPYQMLKPMPHNWGPAVRSSGVRGPHMQLLQERIDKKGFGWKRKSRSLWQQDISTAGFRPKRYF 266 T 5.2 BH4 pdbhh F Eukaryota T 6hiv 120 QD BR Q586A6_TRYB2 mL84 MLRFTRLFREMAPELQLEYMPVLFTRTILGPQGGFAGEERLVKLEVARKYMEAGHAVTPTEELRRGLWCYNPDTDKYDCFIERNEEFLDFAARKRQWLDVYWRVNTGYLLFGRQSWGQGFLINCPLRKRDVAQKLWEQYKVRIDPRLIEFREKDRRTGIQELGHNWCWLYLPGAEELGINREVYDNKRVKVRIHVRKMNSMFALY 205 T 2.2 Ribosomal_L9_C pdbhh F Eukaryota T 6hiv 121 RD BS Q38FG8_TRYB2 mL85 MRRLGVFCRGQRNSPLLTRGIATGGRVTNEDRRWWLVHLECAPDITPGTFIAWLDCCGTHTCKKLIERNIWTIEQVAALDSDQVDELKYREGCLKMDVVWEHARTIITPLRQREVTGGVESELQGRIMELRKKRELERRREEILKERANVSEQREETLRK 160 T 0.048 RNase_Y_N unp F Eukaryota T 6hiv 122 SD BT C9ZPU8_TRYB9 mL86 MRRCIPARGGFTMKYKKGTGLWDEDHVNDYKTNRYLSARATMRWYQEMERHQTRNSLNARRATQSHNNNRGLHHTGRGAFERELERRGVQVEKYPLTTTTGTMRVAELVILRRMELEKRAEEALAEQRGELQKKNPTPSEWYDESKGPLNPNFLRSMRSHYEVDIANLPDTPLIRGQREFFIGEERGNGAA 191 T 3 DUF2663 pdbhh F Eukaryota T 6hiv 123 TD BU D0A7Z9_TRYB9 mL87 MLNPTFSLYRKTLQSYPVPPKIRHYDRRWSGSRTNPYNRQYWRVIMNENYSRPSFWVSDFRHRYLMRTGTDYQGQVPSSPQPGLYQGFSDVHKLLANHPKPQRESRHLPVLPMTPRIVFEHANEKRIDTAKRMRRDRRRIEELKTLEFWGWYMKLQRVRGRWCREQGVSSRGVYGPAVDAAELWG 185 T 1.3 Crl pdbhh F Eukaryota T 6hiv 124 UD BV C9ZUW3_TRYB9 mL88 MLQINAFKLVRATPFLLKRTGKPADTPDYKQVYLPYDAAPTERELERERRRFKQAYHGRMEHRKLVEVKEVPLNVYTYGKEGMSLPIAIFKDQKDPVIGPEWTYPGIYENKIAAQHWYTEELFDKESKEAFESPWQQQILDNQVKRRMAKVMFRMRQVNMKAVDLFQKERGSSRRSGGAGEKGKDGGGKK 190 T 2.5 Ribosomal_L37 pdbhh F Eukaryota T 6hiv 125 VD BW Q57WW5_TRYB2 mL89 MSGAFGCGSYRSVVAGTQNVPRRMTFYPSAYELIQLHKAHREVIRHFYVRDKIFDNKFPGNALANGLFKFVPNRRENYHMRELMESIRRRSIWMHRIKQQREINAKVVENMEVKYGKKAAASMLCFTTPDSNAYFAPHRYQDVANSWPNYWQHPSVNHVVPKPRWRRHRELGGITRVEDPFAVQASDY 188 T 24 Babuvirus_MP pdbhh F Eukaryota T 6hiv 127 XD BY Q580U5_TRYB2 mS91 MLFTRCLLAVTTINSSTASAAGRLIRIRKKSKWIDRRSKRIPHNGKDVWQFGEQPSCALCHVRFRFKQDYEAHKESELHQNRLRWVETMKWWEEIGEPHHQQHAASEWEWFRQRVLPAKAAAMGLSEEDAARELRRAVMHETPRWYSRIQPPNARSEIKEPRDQRWPSSPKW 172 T 0.0021 zf-met pdb F Eukaryota T 6hiv 129 ZD Ba D0A4T0_TRYB9 mL93 FKQGQGMFSHQLKRLLQKKSIHRYNWDPLPMYDPRKLVHASRHMDVETWREVPDPHWDERSYLVPDQMFYNIPVPPEYKDAYWWRELQARRVQCPVEWVSHRMYNKGDRQRYDFQDLAFRKKFEFSYEEVVKNAKDMRS 139 T 7.9 Pox_VP8_L4R unphh F Eukaryota T 6hiv 131 BE Bc Q389K3_TRYB2 mL95 MFRPTTAIADSFKEHYHRVHLPRRLALQRYARQQSLRNAAKGNVKAEEVPYKYNRWWVNEEHEFVHQYAFVEDPEVTKAKRETLPPVTRENIWKEPQQTFFLPFAPYVRVVDYPKDPDAKFLKPVNIPRWKDYMQRTKPVIPRTWY 146 T 1.8 DUF4653 pdbhh F Eukaryota T 6hiv 132 CE Bd Q381W9_TRYB2 mL96 MNDIYARRLAQATMFHQLMRCHGTLWAATQVTKEQMDYNFIREEFMRVNGRRAMPLLLGAAANENLHQLHLSHLSEHCAWGESARALAVQRQTPLSQRVAALGRMAETIHQVKTASTVQNLFNEQISCMEGISSFEEEPLIEGE 144 T 0.083 CCP_MauG pdb F Eukaryota T 6hiv 133 DE Be Q388L8_TRYB2 mL97 MSSRFFQKYFIRCGNCQTIQRYAKGYKPIPNPILFDSDAHCRSYHRERRDCTGLTGTLVTCRCDKCARVHSHWTVMDFQEFLDAKLVMTPEERTALLWPGAGSRAEPSSGTSN 113 T 0.15 Myticin-prepro pdb F Eukaryota T 6hiv 134 EE Bf Q388M2_TRYB2 mL98 MVLRGVRLRSVAVSCYGSSLTAATRCLSVRTEDFFSKEAISHARRVSWAPHTTEKKQGAFAKLARSNFGDPLPSSFAQEPYFEEEIEAHRKHHRPDVYIYKYNVSPTHFSLRE 113 T 6.1 DUF2975 pdbhh F Eukaryota T 6hiv 135 FE Bg Q587H8_TRYB2 mL99 MYQRTRFLWSSWRDYPLGSRDRRGRFNMDEAAAALQLNPAYAAALYRPLNYTFHIRGQLYPAQKGRPSRPGSLAASQGRMFPLYQRNDRLDKELFRLNSRGLTTE 105 T 1.5 Tenui_NS4 pdbhh F Eukaryota T 6hiv 136 GE Bh A0A1G4HYZ0_TRYEQ mL100 MALFSCFRCGYMYEFAVSNSYCRKLTLRNDHCPRCDQLTLFRFMSVSGMVGNMPFKPIGVPGPSYATLWWRKTREGKEASAPLDAVCKSDRW 92 T 0.004 DUF1178 unphh F Eukaryota T 6hiw 17 Q DE Q386Q7_TRYB2 mS52 MRRCSFASTCRYRSADDVPLGLGSAYVWTCRNITSRGQFNPIHNFSYAMERGVRARDVKAFEKLITNPGPLRVAYTPDYLDWLHRCYKAKGTYMDARAVAEKKFNGNIVSSELSAAVNRREGKRGDTRGDTVDNDHHNLPGAPPPGMFLRPAHSFRRLAGELKRRRAQSILDEVARAQGMLDLFERQPHFPAIHIDRCSRFHLVELFKEMVLERSLDSNMIWEKALLYRAILSERKPSYPTSFHYIFTAVEDTVFAPTISTPMEMAGSGGEGHDRSSHPLAAKCPTLEAYYYYVYLVKKYYIDNAVEAHVVLRCHREPNAADLLFSNPPPKDDTEIMKAVELLRNADIQRGVAAAAAVSDPTLPPGGEGSVIGNSDNKKNSEKTSEGSRGRPARPPVLPGAYPPIDMLWRCEENLPLLKVLLFGEFNLIVSENPFVKFPSAHGFLTRPYSTDSSRTLADGMSLANVMAEKRGHLLPSLPRNTATSIDARAQDIRRLQQKHHRDDIVSFQKLLRTTHAEDSPSAFSSYSDWSYFNPRAVRAEERDRLTRKAVEALKLYDSATNDIYRHSFEDVQACHTQRVTERDRTMPPYLPTLPHFVAIIKKDPHISFLLHIGLPDRNSSEEGSAKHKELEKRIYYLARALYHTALEYHNETVRRVNRQKVNVAASLLDNFVEQEWTTILRDKHDVTDVTKTLNDTQNDKKQLARRLGRYMLFANRSLDDTGFPTDARADDYTRWMAPPSVGKVSL 747 T 7.7 Complex1_LYR_2 unppssm F Eukaryota T 6hiw 45 SA Cb Q57VB2_TRYB2 mS23 MRQCVVRRYKMPKNMGVAPRFDTWNEKYEPWEHMKRMGRLVGTGFYIPPEWYNHFRMFPPINHNFQQEKTLNPHNASEPTQDDTSTLSPERVALRDELARKSRLVASEGMRYYNIFWVRKPLDTMEKEYYELKRRGVDHGEAIRKVLQGFYSGLAVKKRVAAIQAEEAKLTGRFITMREATVVLGVLAKLHKEQLTPHQVSLLAKEQGETTQSGAKLTAIVSRTQPHVNKEASSPATSEAVGSSTEESLSADALASMLSEDGEQSAVGTRYQVEVKETANDSVRQLREKAEDQTGFPDWYTGESPTYSGTSACSRDGFALMKANK 325 T 0.026 Herpes_ICP4_N unppercent F Eukaryota T 6hiw 46 TA Cd Q38DK6_TRYB2 mS26 MMRSSSFCRRQIRPYYNLPSKSEHGRKMTGFLTPYRHWMWKQNELWRNVHEAQFEHLRRVYKRQWLESFRVNADEYIYKYNITKAAQLAQWECEMKEQEKKRIEARQMMDGRQALKKKHLDLLREFHERQFFFWYERASERLQNMNLINYVPHAQLREHIDKELDKYVAGKNEPYPLNFVGQMPFLEDGDGNIVEVPESLLSNHMAEHPDSTAKPHEPHTSSSISEAAAFEERMLRAMVSAKEEDLKEWLGDDSRALSETIDDISREEEEREADIRVARSMEETDAEREVSRRAYIERXKTGSRSIFRPPTVSEGAGGTPSAPAGDANTPMRRRKKGKLDRVHALQAHQDELLAKLSSQGLKEGVDASSVPERGKIVQSRGRIRDKAVIPTHEVLMQKPELAAGSTPGARIQTKDMVDKMYHRGKYKKSGSGDKSDGEDL 440 T 0.028 ThylakoidFormat pdbpssm F Eukaryota T 6hix 41 PA BB D0A135_TRYB9 ml68 MLYTRRLMTTGGSATADGAVSYSKGSYHIVPKKYTVGKRIAVRSYLDRNRTELSDRTYMPQKAWFEPYTPKKFDMEHQRISHNFYNLETKLIWTAFDTPELIGILLHDETIKGAPHLYDAEFLESAVHWTRESRYWRCIGITKPFYNKTTLRAQCWHDRGLQVGTLVFSQAMRDALMDLERAVRRKELGLEPNYVWDRWGPVGFIDGARTDHLPRFAHNPYVDPDGVEVTEVDIAPFNTHEQIKERYGAFIDPDLRPFEGVFRAPSHGALTLDDVPHQEAVRLYRDLMEKADMPVMLGNGAEIPPMDMRALFHLSANPERMKAASELSSWREVRGMLAPVQEVCDEKVEALRLMENTRHDAARVRTFYEEKCGFSDFMRTPDKVITAAVLCYLQELQRICTETDWGKPLARCLTDLERVNVMGKDAFLVYRHIEDAILDKKRRVWATRFAGEANEESTLDYLLENFGRRTEQTRNVGTTGTEFDREQEPIGRQVQRRVLDSDKASKLAEVRQKRGKMWSNKKSVFDSLHQKQLQNVTYGVH 541 T 4.7 VirE_N pdbhh F Eukaryota T 6hix 58 GB BS Q38FG8_TRYB2 ml85 MRRLGVFCRGQRNSPLLTRGIATGGRVTNEDRRWWLVHLECAPDITPGTFIAWLDCCGTHTCKKLIERNIWTIEQVAALDSDQVDELKYREGCLKMDVVWEHARTIITPLRQREVTGGVESELQGRIMELRKKRELERRREEILKERANVSEQREETLRKLREAIAAKKAAMXQKKQAASEAYGGSSDGGARKEGAEE 198 T 0.02 RNase_Y_N pdbpssm F Eukaryota T 6hix 66 OB Ba D0A4T0_TRYB9 ml93 MFRVTGLQLKNPVVFKQGQGMFSHQLKRLLQKKSIHRYNWDPLPMYDPRKLVHASRHMDVETWREVPDPHWDERSYLVPDQMFYNIPVPPEYKDAYWWRELQARRVQCPVEWVSHRMYNKGDRQRYDFQDLAFRKKFEFSYEEVVKNAKDMRS 153 T 7.9 Pox_VP8_L4R pdbhh F Eukaryota T 6hk5 1 A,B,C,D,E,F,G,H B,C,E,G,A,D,F,H P72321_RHORU CooJ MTESPERGRKRLGIYLAHFLDHVEGHMGEIGVQRDALAEDARLGALIDRALADMAVARASLNAVLRDL 68 T 2.4 DUF1569 pdbhh F Bacteria T 6hks 2 G,H,I,J,K,L G,H,I,J,K,L VE6_HPV16 Protein E6 RSSRTRRETQL 11 T 0.34 FpoO unphh T Viruses T 6hm4 2 B B MDB1_SCHPO BRCT DOMAIN PROTEIN MDB1,MIDZONE AND DNA BREAK-LOCALIZING PROTEIN 1 GVMTVPNTPQKPNLQ 15 T 22 TPX2_importin pdbhh F Eukaryota T 6hm5 2 B B RAD9A_HUMAN HRAD9,DNA REPAIR EXONUCLEASE RAD9 HOMOLOG A SPVLAEDSEGE 11 T 9.3 Ham1p_like pdbhh F Eukaryota T 6hn9 1 A A A0A3G2WH77_9ANNE Nicomicin-1 GFWSSVWDGAKNVGTAIIKNAKVCVYAVCVSHK 33 T 2 MCPVI pdbhh F Eukaryota T 6hne 1 A A GLY-LEU-PHE-ASP-ILE-VAL-LYS-LYS-VAL-LEU-LYS-LEU-LEU-LYS-NHE GLFDIVKKVLKLLKX 15 T 0.00042 Antimicrobial20 pdbhh F T 6hnh 1 A A LYS-LEU-LEU-LYS-LEU-LEU-LYS-LYS-VAL-VAL-GLY-ALA-LEU-GLY-NHE KLLKLLKKVVGALGX 15 T 1.9 Antimicrobial20 pdbhh F T 6hoi 2 C,D F,G BECN1_HUMAN COILED-COIL MYOSIN-LIKE BCL2-INTERACTING PROTEIN,PROTEIN GT197 SANSFTLIGE 10 T 0.53 Stm1_N pdbhh F Eukaryota T 6hol 2 C,D C,D BAKOR_HUMAN BARKOR,AUTOPHAGY-RELATED PROTEIN 14-LIKE PROTEIN,ATG14L TDLGTDWENLPSPRF 15 T 0.4 Rop-like pdbhh F Eukaryota T 6hom 2 B,D B,D M9PCL9_DROME CCR4-NOT transcription complex subunit 4, isoform L DDDLGFDPFVETQKGLAELMENEVVQ 26 T 0.69 DUF6021 pdbhh F Eukaryota T 6hpg 2 G,H,I,J,K,L a,b,c,d,e,f HS904_ARATH ATHSP90-4,HEAT SHOCK PROTEIN 81-4,HSP81-4 GSKMEEVD 8 T 8 TMEM191C pdbhh F Eukaryota T 6hq6 1 A,B A,B Bacterial beta-1,3-oligosaccharide phosphorylase MGSSHHHHHHSSGLVPAGSMSQSPNTLANEETTSIDKSITMDMVSMNGEMFYKIANNDAMRPFFMTIVSDSNHWMFVSSNGGLTAGRKNAEYALFPYYTDDKITESADITGSKSIFQIQYNNELIVWEPFSERFTNKFKITRNLYKNYYGNKIIFEEINEDLGLTYRYQWCSSNQFGFVRKSELSNHSKNVYEISLLDGIQNIMPYGVSSDLQSSTSNLVDAYKRSELHPKSGLGIFALSAIIVDKAEPSEALKANIAWSLGLNNPKYLVSSLQLNHFRNGKSISPEDDIKGEKGAYFLNTVMTLEANTQKEWMIIANVNQDHSDIIAITETIQNNKKIAEDINTDIELGTKRLIELNASSDALQLTADNLRDTRHFSNTLFNIMRGGIFDNNYQIEKGDFSNYIKKANKLVFDKIDLNALGEIFSLNDLNEFASKQKDVDFDRLALEYLPLKFSRRHGDPSRPWNKFSINTQSEIDGSKVLDYEGNWRDIFQNWEALAHSFPNFIDSMIHKFLNASTFDGYNPYRVTKEGFDWETIEEDDPWSYIGYWGDHQIIYLLKFLEFIEKHQPGKLHSYFESECFVYAAVPYTIKPYEEILNNPKDTIGYNHEWEKVINERKKSIGADGALLKSNDKSIYHVNFIEKILATVLAKMSNFIPEAGIWLNTQRPEWNDANNALVGNGVSMVTLYYLRRFLKFFDQLLENSTLENIKISNEMVEFYHKVRETLMENQHLLAGSISDTDRKVILDKLGNAAADYRFQIYNSGFWGKKRTHSMQGLKNFTKVSLQFIDHSIKANQRPDKLYHAYNLMSVEKNKEIAISYLSEMLEGQVAVLSSGFLSSKENLAVLDGLKNSALFREDQYSYLLYPNKELPKFLDKNTISKEAVSKSELLSLLVSKSNKQVIEKDSIGEYHFNGEFNNASNLKQALEDLSQQNEYKDLVAKESKTVEAIFEDVFNHKAFTGRSGTFYGYEGLGSIYWHMVSKLQLAVLECCLKAVEEKESEEVIGRLLEHYYEINEGIGVHKSPSLYGAFPTDAYSHTPAGKGAQQPGMTGQVKEDILSRFGELGIFVKNGCLELNPCLLRKDEFLKEAKTFDYVTVNFQHQSLELVEKSLAFTYCQIPIIYKIANQKCIEVFTNDGKSAKAASLILDKQTSQDVFGRTGIINKIEVSILESDLR 1175 T 0.18 Glucodextran_N pdbhh F T 6hqc 1 A A TAPA_BACSU BIOFILM ASSEMBLY ACCESSORY PROTEIN TAPA DQSDLHISDQTDTKGTVCSPFALFAVLENTGEKLKKSKWKWELHKLENARKPLKDGNVIEKGFVSNQIGDSLYKIETKKKMKPGIYAFKVYKPAGYPANGSTFEWSEPMRLAKCDE 116 T 0.002 Herpes_PAP unp F Bacteria T 6hqe 1 A,AA,B,BA,C,CA,D,DA,E,EA,F,FA,G,GA,H,HA,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,PA,Q,QA,R,RA,S,SA,T,TA,U,UA,V,VA,W,WA,X,XA,Y,YA,Z,ZA A,a,B,b,C,c,D,d,E,e,F,f,G,g,H,h,I,i,J,j,K,k,L,l,M,m,N,n,O,o,P,p,Q,q,R,r,S,s,T,t,U,u,V,v,W,w,X,x,Y,y,Z,z peptide LRV_M3delta1 PEALERLAADPDREVRAAVARRL 23 T 0.017 LRV pdb F T 6hrr 1 A,B A,B MCLN2_HUMAN TRANSIENT RECEPTOR POTENTIAL CHANNEL MUCOLIPIN 2,TRPML2 AFKEDNTVAFKHLFLKGYSGTDEDDYSCSVYTQEDAYESIFFAINQYHQLKDITLGTLGYGENEDNRIGLKVCKQHYKKGTMFPSNETLNIDNDVELDCVQLDLQDLSKKPPDWKNSSFFRLEFYRLLQVEISFHLKGIDLQTIHSRELPDCYVFQNTIIFDNKAHSGKIKIYFDSDAKIEECKDLNIFGS 191 T 0.79 Baseplate pdb F Eukaryota T 6hrs 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H MCLN2_HUMAN TRANSIENT RECEPTOR POTENTIAL CHANNEL MUCOLIPIN 2,TRPML2 SNQLVVAFKEDNTVAFKHLFLKGYSGTDEDDYSCSVYTQEDAYESIFFAINQYHQLKDITLGTLGYGENEDNRIGLKVCKQHYKKGTMFPSNETLNIDNDVELDCVQLDLQDLSKKPPDWKNSSFFRLEFYRLLQVEISFHLKGIDLQTIHSRELPDCYVFQNTIIFDNKAHSGKIKIYFDSDAKIEECKDLNIFGSTQ 199 T 0.8 Baseplate pdb F Eukaryota T 6hu9 22 FA,RA l,x YD19A_YEAST Cox26 MFFSQVLRSSARAAPIKRYTGGRIGESWVITEGRRLIPEIFQWSAVLSVCLGWPGAVYFFSKARKA 66 T 0.077 Phage_holin_Dp1 unppercent F Eukaryota T 6hua 2 C,D C,D XIP signaling peptide VPFFMIYY 8 T 2 Toxin_10 pdbhh F T 6hum 8 H P Proton-translocating NADH-quinone dehydrogenase subunit P NdhP AVISVKPILLAMTPVFILLCLFFGTRNGFYDTDQYHGNAAAH 42 T 0.13 MLANA pdbpssm F T 6hum 18 R Q Proton-translocating NADH-quinone dehydrogenase subunit Q NdhQ ATDFRAIMKFDGADSPAMIAISAVLILGFIAGLIWWALH 39 T 1.4 Rax2 pdbhh F T 6hv6 1 A A PATOX_PHOAA PHOTORHABDUS ASYMBIOTICA TOXIN,PATOX MGSSHHHHHHSSGLVPRGSHMASMTGGQQMGRGSKDTAWEFHTDVGLKGGGLKDFIDRFTKEPKEITISGYKFKRIKYNQENFDTMQRMALDYAYNPDSKGKIAQAQQAYKTGKEDYNAPQYDNFNGLSLDKKIERYISPDTDATTKGVLAGKMNESIKDINAFQTAKDAQSWKKSANKANKVVLTPQNLYLKGKPSEALPESVLMGWALQSSQDAKLSKMLMGIYSSNDITSNPLYKSLKELHANGNASKFNASATSISNINVSNLATSETKLFPTEISSVRVDAPKHTMLISKIKNRENKIKYVFYDPNYGMAYFDKHSDMAAFFQKKMQQYDFPDDSVSFHPLDYSNVSDIKISGRNLNEIIDGEIPLLYKQEGVQLEGITPRDGIYRVPPKNTLGVQETKHYIIVNNDIYQVEWDQTNNTWRVFDPSNTNRSRPTVPVKQDTNGVDKLAAALEHHHHHH 463 T 0.00014 Peptidase_C58 pdb F Bacteria T 6hvo 2 D,E,F D,F,E DPOD4_HUMAN DNA POLYMERASE DELTA SUBUNIT P12 MGRKRLITDSYPVVKRREG 19 T 0.012 Adeno_terminal unppssm F Eukaryota T 6hxt 1 A,B,C A,B,C CCD61_HUMAN Coiled-coil domain-containing protein 61 GSMDQPAGLQVDYVFRGVEHAVRVMVSGQVLELEVEDRMTADQWRGEFDAGFIEDLTHKTGNFKQFNIFCHMLESALTQSSESVTLDLLTYTDLESLRNRKMGGRPGSLAPRSAQLNSKRYLILIYSVEFDRIHYPLPLPYQGKP 145 T 0.01 XRCC4 unphh F Eukaryota T 6hxv 1 A,B A,B CCD61_DANRE Coiled-coil domain-containing protein 61 GPHMEVGTVVQEEMKFRGSEFAVKVEMAERLLIVEISDVVTADQWRGEFGPAYIEDLTRKTGNFKQFPVFCSMLESAVHKSSDSVTLDLLTYSDLELLRNRKAGVVGRPRAQPQSPALSAKRYLILIYTVEEARIHYPLPLPYLGKPDPAELQKEIRALRSELKTLGLRGD 171 T 0.00045 XRCC4 unphh F Eukaryota T 6hxy 1 A,B A,B CCD61_DANRE Coiled-coil domain-containing protein 61 GGSMEVGTVVQEEMKFRGSEFAVKVEMAERLLIVEISDVVTADQWRGEFGPAYIEDLTRKTGNFKQFPVFCSMLESAVHKSSDSVTLDLLTYSDLELLRNRKAGVVGRPRAQPQSPALSAKRYLILIYTVEFDRIHYPLPLPYLGKPDPAELQKEIRALRSELKTLGLRGDHK 173 T 0.00045 XRCC4 unphh F Eukaryota T 6hy2 2 B A TRP-MET-LEU-ASP-PRO-ILE-ALA-GLY-LYS-TRP-SER-ARG WMLDPIAGKWSR 12 T 0.055 FBPase pdbhh F T 6hyd 1 A A MDN1_YEAST DYNEIN-RELATED AAA-ATPASE REA1,MIDAS-CONTAINING PROTEIN,RIBOSOME EXPORT/ASSEMBLY PROTEIN 1,DYNEIN-RELATED AAA-ATPASE REA1,MIDAS-CONTAINING PROTEIN,RIBOSOME EXPORT/ASSEMBLY PROTEIN 1,DYNEIN-RELATED AAA-ATPASE REA1,MIDAS-CONTAINING PROTEIN,RIBOSOME EXPORT/ASSEMBLY PROTEIN 1 PIEESLAAVIPISHLGEVGKWANNVLNCTEYSEKKIAERLYVFITFLTDMGVLEKINNLYKPANLKFQKALGLHDKQLTEETVSLTLNEYVLPTVSKYSDKIKSPESLYLLSSLRLLLNSLNALKLINEKSTHGKIDELTYIELSAAAFNGRHLKNIPRIPIFCILYNILTVMSENLKTESLFCGSNQYQYYWDLLVIVIAALETAVTKDEARLRVYKELIDSWIASVKSKSDIEITPFLNINLEFTDVLQLSRGHSITLLWDIFRKNYPTTSNSWLAFEKLINLSEKFDKVRLLQFSESYNSIKDLMDVFRLLNDDVLNNKLSEFNLLLSKLEDGINELELISNKFLNKRKHYFADEFDNLIRYTFSVDTAELIKELAPASSLATQKLTKLITNKYNYPPIFDVLWTEKNAKLTSFTSTIFSSQFLEDVVRKSNNLKSFSGNQIKQSISDAELLLSSTIKCSPNLLKSQMEYYKNMLLSWLRKVIDIHVGGDCLKLTLKELCSLIEEKTASETRVTFAEYIFPALDLAESSKSLEELGEAWITFGTGLLLLFVPDSPYDPAIHDYVLYDLFLKTKTFSQNLMKSWRNVRKVISGDEEIFTEKLINTISDDDAPQSPRVYRTGMSIDSLFDEWMAFLSSTMSSRQIKELVSSYKCNSDQSDRRLEMLQQNSAHFLNRLESGYSKFADLNDILAGYIYSINFGFDLLKLQKSKDRASFQISPLWSMDPINISCAENVLSAYHELSRFFKKGDMEDTSIEKVLMYFLTLFKFHKRDTNLLEIFEAALYTLYSRWSVRRFRQEQEENEKSNMFKFNDNSDDYEADFRKLFPDYEDTALVTNEKDISSPENLDDIYFKLADTYISVFDKDHDANFSSELKSGAIITTILSEDLKNTRIEELKSGSLSAVINTLDAETQSFKNTEVFGNIDFYHDFSIPEFQKAGDIIETVLKSVLKLLKQWPEHATLKELYRVSQEFLNYPIKTPLARQLQKIEQIYTYLAEWEKYASSEVSLNNTVKLITDLIVSWRKLELRTWKGLFNSEDAKTRKSIGKWWFYLYESIVISNFVSEKKETAPNATLLVSSLNLFFSKSTLGEFNARLDLVKAFYKHIQLIGLRSSKIAGLLHNTIKFYYQFKPLIDERITNGKKSLEKEIDDIILLASWKDVNVDALKQSSRKSHNNLYKIVRKYRDLLNGDAKTIIEAGLLYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRNIDTVASNMDSYLEKISSQEFPNFADLASDFYAEAERLRKETPNVYTKENKKRLAYLKTQKSKLLGDALKELRRIGLKVNFREDIQKVQSSTTTILANIAPFNNEYLNSSDAFFFKILDLLPKLRSAASNPSDDIPVAAIERGMALAQSLMFSLITVRHPLSEFTNDYCKINGMMLDLEHFTCLKGDIVHSSLKANVDNVRLFEKWLPSLLDYAAQTLSVISKYSATSEQQKILLDAKSTLSSFFVHFNSSRIFDSSFIESYSRFELFINELLKKLENAKETGNAFVFDIIIEWIKANKGGPIKKEQKRGPSVEDVEQAFRRTFTSIILSFQKVIGDGIESISETDDNWLSASFKKVMVNVKLLRSSVVSKNIETALSLLKDFDFTTTESIYVKSVISFTLPVITRYYNAMTVVLERSRIYYTNTSRGMYILSTILHSLAKN 1676 T 0.0088 SseC pdbpssm F Eukaryota T 6i1j 1 A A A helical peptide containing a trinuclear Cu(II) center: HisAD GEIAAIKQEIAAHKKEHAAIKWEIAAIKQGYG 32 T 0.42 DUF5320 pdbhh F T 6i1m 1 A A A0A2H1BUS1_FASHE Cystatin VGGYTEPRSVTPEERSVFQPMILSKLLTAGSVVSSCELELLQVSTQVVAGTNYKFKVSGGATCPGCWEVVVFVPLYSSKSATSVGTPTRVSCT 93 T 2.1E-05 Cystatin pdbhh F Eukaryota T 6i2g 2 B B N7P-SER-ARG-LEU-GLU-GLU-GLU-LEU-ARG-ARG-ARG-LEU-THR-GLU-LPD XSRLEEELRRRLTEX 15 T 5.2 TF_AP-2 pdbhh F T 6i31 1 A,B A,B EVA3_RHISA Evasin-3 LVSTIESRTSGDGADNFDVVSCNKNCTSGQNECPEGCFCGLLGQNKKGHCYKIIGNLSGEPPVVRR 66 T 0.059 Toxin_11 unppercent F Eukaryota T 6i4x 4 D D Erythropoietin receptor ASFEXTILDPS 11 T 7.1 SmpB pdbhh F T 6i56 1 A,B,C,D,E D,A,C,B,E XEPA_BACSU PROTEIN XKDY MVKYQYEFPLDKAGKAGAVKPYRGGKNDFVTPVSNLSGVAEILTNAALKATEAYSQLGQDRLGAVLISKVKGWAYADREGTLFIEESDNNNVWTTTAAVNVAAGVLTATDWVYLSKRYYRFRYVNGNLQQSEFVLYQSVGAGEMDVRVNEKTPLQIDFAENQTHDGRLKVEARKTFDFVFHENAESASEGAALPVDGAAHLLVEVYGTAEMSEVKFWGKSVSGQKLPIRGVKTDDATTASSTLGKAEAWAFDIKGFKEIIMEIISITGGTLSVKGTAVS 279 T 0.72 DUF6385 pdbhh F Bacteria T 6i5j 5 G,H,I,J I,J,K,L Growth hormone receptor peptide PVPDXTSIHIX 11 T 0.00021 GHBP pdbhh F T 6i5o 1 A,B,C,D,E A,B,C,D,E YOMS_BACSU SPBc2 prophage-derived uncharacterized protein YomS MTETTENVVITIPDKTSFTFHEAATSPSEGEEFVVGHFRELTVKISGSSTSREIKFYAVDENGEKTALSGTNKTDFQLGSSTLNTNEYWDFDIAGLFKVMFEVVSVTGDVTVKGIVVS 118 T 0.84 DUF4251 pdbhh F Bacteria T 6i9e 2 H,I,J,K,L,M,N H,I,J,K,L,M,N A7XXC1_9CAUD Auxiliary protein MDKVKLFQTIGRVEYWERVPRLHAYGVFALPFPMDPDVNWAQWFTGPHPRAFLVSIHKYGPKAGHVYPTNLTDEDALLNVIGMVLDGHDYENDPNVTVTLKAAVPIEYVQQDPQAPALQPHQAVLDAAEVLKLKVIKGHYFFDYTR 146 T 0.78 TRI9 pdbpssm T Viruses T 6iac 2 B B Q859I5_9CAUD Lower collar protein MARYTMTLYDFIKSELIKKGFNEFVNDNKLTFYDDEFQFMQKMLKFDKDVLAIVNEKVFKGFSLKDELSDLLFKKSFTIHFLDREINRQTVEAFGMQVITVCITHEDYLNVVYSSSEVEKYLQSQGFTEHNEDTTSNTDETSNQNATSLDNSTGMTANRNAYVSLPQSEVNIDVDNTTLRFADNNTIDNGKTVNKSSNESNQNAKRNQNQKGNAKGTQFTKQYLIDNIDKAYDLRKKILNEFDKKCFLQIW 251 T 0.12 AKAP95 pdb T Viruses T 6iac 4 F,G,H,I,J,K G,H,I,J,K,L Q859I1_9CAUD inner core protein MTEFDEIVKPDDKEETSESTEENLESTEETSESTEESTEESTEESTEDKTVETIEEENENKLEPTTTDEDSSKFDPVVLEQRIASLEQQVTTFLSSQMQQPQQVQQTQSDVTESNKEDNDYSDEELVDKLDLD 133 T 0.0095 TolA_bind_tri pdb T Viruses T 6iai 1 A,B,C,D A,B,C,D Q8Z7T2_SALTI STOD MGSSHHHHHHSSGLVPRGSHMFLTFPNVAITRDNRIDKLSENDLELIRDTAIQNGGRKIQVQLRDLLYEVSNRAVEGDNNTFKVSFSTTDRAMFRERHIEWQGNAIRLERQLNTGLNVSRG 121 T 0.029 Calici_MSP pdb F Bacteria T 6iam 2 B B SER-ALA-ARG-ALA-XY5-VAL-HIS-LEU-ARG-LYS-SER-ALA SARAXVHLRKSA 12 T 22 Peptidase_S31 pdbhh F T 6iat 1 A,B,C,D C,A,B,D Q859I3_9CAUD Major head protein MAQQSTKNETALLVAKSAKSALQDFNHDYSKSWTFGDKWDNSNTMFETFVNKYLFPKINETLLIDIALGNRFNWLAKEQDFIGQYSEEYVIMDTVPINMDLSKNEELMLKRNYPRMATKLYGNGIVKKQKFTLNNNDTRFNFQTLADATNYALGVYKKKISDINVLEEKEMRAMLVDYSLNQLSETNVRKATSKEDLASKVFEAILNLQNNSAKYNEVHRASGGAIGQYTTVSKLKDIVILTTDSLKSYLLDTKIANTFQIAGIDFTDHVISFDDLGGVFKVTKEFKLQNQDSIDFLRAYGDYQSQLGDTIPVGAVFTYDVSKLKEFTGNVEEIKPKSDLYAFILDINSIKYKRYTKGMLKPPFHNPEFDEVTHWIHYYSFKAISPFFNKILITDQDVNPKPEEELQE 408 T 12 ER pdbhh T Viruses T 6iat 2 E,F,G,H E,F,G,H Q859I2_9CAUD Arstotzka protein MYEGNNMRSMMGTSYEDSRLNKRTELNENMSIDTNKSEDSYGVQIHSLSKQSFTGDVEEE 60 T 0.048 DUF4958 pdb T Viruses T 6ibh 1 A,B A,B A0A4P9I8G4_9AGAM Auxiliary activity CAZyme HFQLQWPGARGAFVANDEVYFCGAHNNVTTNRTDFPLDGSGFVSIKSGHAPYTVGAIISLETDADAWEDFKNSSGGDQIAIAYRQVDNSGTYCVPFNPSSLNIAGIQDGANATIQVVYTGGDGNLYQCADVTFRTTVANLNSSVCTNSTHHHHHH 155 T 0.64 Big_1 unp F Eukaryota T 6idx 2 B C AGRB1_MOUSE BRAIN-SPECIFIC ANGIOGENESIS INHIBITOR 1 RKSRYAELDFEKIMHTRKRHQDMFQ 25 T 7.7 YppG pdbhh F Eukaryota T 6ieh 2 B A NRDE2_HUMAN Protein NRDE2 homolog SFRTDKKPDPANWEYKSLYRGDIARYKRKGDSCLGINPKKQCISWEGTSTEKKHSRKQVERYFTKKSVGLMNIDGVAISSKTEPPSSEPISFIPVKDLEDAAPVT 105 T 0.043 DUF1283 pdbpercent F Eukaryota T 6ifo 3 E,F F,E AcrIIA2 MTLTRAQKKYAEAMHEFINMVDDFEESTPDFAKEVLHDSDYVVITKNEKYAVALCSLSTDECEYDTNLYLDEKLVDYSTVDVNGVTYYINIVETNDIDDLEIATDEDEMKSGNQEIILKSELK 123 T 0.13 DUF6376 pdb F T 6iha 1 A A A0A0U2QEK1_9DIPT SibaCec-A KINKQKIKNGAKKALGVASKVAPVVAAFAR 30 T 1.3E-05 Cecropin unphh F Eukaryota T 6ilc 3 C C HEV-1 DFANTFLP 8 T 0.0062 AAA_23 unppercent F T 6ilg 3 C C PHOSP_HENDH HEV-1-P8L DFANTFLL 8 T 0.0062 AAA_23 unppercent T Viruses T 6ilu 1 A,B A,B A0A218KCJ1_9CAUD Lysin GETAPVSEPEGIGVALSIYPDGYGVNLYERPSDPIYAGNITKKIPYKVFAGYWGGGDKDMICLGGEKQWAYNKHFTIDWYKVRSKYPVGWGVNFYDGPSGNFLGNIDGSEVYNAHNRVGGYVDIGGNRWIKEEHVTITAK 140 T 41 Mastoparan_2 pdbhh T Viruses T 6img 1 A A (ACE)-GLY-CYS-PRO-CYS-ILE-TRP-PRO-GLU-LEU-CYS-PRO-TRP-ILE-ARG-SER-CYS-(NH2) XGCPCIWPELCPWIRSCX 18 T 1.8 Toxin_26 pdbhh F T 6imh 1 A A (ACE)-GLY-CYS-PRO-CYS-GLU-PRO-SER-TYR-LEU-CYS-PRO-TRP-LEU-PRO-GLY-CYS-(NH2) XGCPCEPSYLCPWLPGCX 18 T 0.13 FOXP-CC pdbhh F T 6imu 1 A,B A,B A0A4V8H012_TALFU Endo-beta-1,2-glucanase AGIHHHHHHSSEPSCRFAHQYTQEQVLQNPSKFINDVLFWEGKFHQNNISYNSGNGMSYDGTNIDWVTGEGTVKHPFSAASKESLQVMLYAHAIAGSADAARFLSPNNPSAAPGIAASIMDTKLQTYLRFNETYPGFGGFLPWFTSSSQDLTPTWDWNNRVPGLDNGELLWAVYAFIQAAENTSNKSFIDLAKKWQTWMDYTKTTAAHIFYQGEGKVCAVTDIKNQSLPVYHPEQTYACEGTSYLNDPYEGELFTWWLQFFGGLSDADIEALWEYKRPQLVSVDYHIGNVGPITVQKGYWFSSHETWKVLEMPYYDIDIIRRVFQNAERARTCNSVVTQVPGMFASINNVTDPATGDVVGYISNAGIPSIANQTIQELDVITPYSVFPTVLFDKGVGMAWWRNMAIGKKMQNIYGSTESTRRDGTGVSALLTWDSKVSTVNAILGGVSGLVSQKMKAENIYNTFVERIEAEYSRVFKNLKGEHVPFCLPQETVPDTGLVDFTTCN 505 T 0.19 Glycoamylase pdbhh F Eukaryota T 6imw 1 A,B A,B A0A4V8H013_TALFU Endo-beta-1,2-glucanase AGIHHHHHHSSEPSCRFAHQYTQEQVLQNPSKFINDVLFWEGKFHQNNISYNSGNGMSYDGTNIDWVTGEGTVKHPFSAASKESLQVMLYAHAIAGSADAARFLSPNNPSAAPGIAASIMDTKLQTYLRFNETYPGFGGFLPWFTSSSQDLTPTWDWNNRVPGLDNGELLWAVYAFIQAAENTSNKSFIDLAKKWQTWMDYTKTTAAHIFYQGEGKVCAVTDIKNQSLPVYHPEQTYACEGTSYLNDPYQGELFTWWLQFFGGLSDADIEALWEYKRPQLVSVDYHIGNVGPITVQKGYWFSSHETWKVLEMPYYDIDIIRRVFQNAERARTCNSVVTQVPGMFASINNVTDPATGDVVGYISNAGIPSIANQTIQELDVITPYSVFPTVLFDKGVGMAWWRNMAIGKKMQNIYGSTESTRRDGTGVSALLTWDSKVSTVNAILGGVSGLVSQKMKAENIYNTFVERIEAEYSRVFKNLKGEHVPFCLPQETVPDTGLVDFTTCN 505 T 0.16 Glycoamylase unphh F Eukaryota T 6ip5 82 DC zx nascent peptide LSAKKLSSLLTCKYIPP 17 T 2 BLOC1S3 pdbhh F T 6ipv 1 A,B,C,D A,B,C,D A0A5A4PV77_STREX CqsB2 MSQRVPDESGLAQNYVLDRSDLQGLDLVWNENTGMDDMMKLMESKTKETYDHGEIFGQYCSLAEHINVPYDIVFEYAANARSLEEWTYSIRNMKHLGGGLYRADEMIQPNTDIYIRAEAQKGPEHGLVVYPCAWDQGHELWMRYYMTIIDSSKVLDKPGTVVLWTNCKHPYYDRSTENVPDYIAEGRARTDRVWVGDIWPVFHAGHSIEMGNLKRILEHRFGAGKAKLAAALEHHHHHH 239 T 0.0007 Polyketide_cyc2 unppercent F Bacteria T 6iqg 2 C,D C,D 18-mer peptide G(HCS)DCAYHRGELVWCT(HCS)H(NH2) GXDCAYHRGELVWCTXHX 18 T 3.1 CHORD pdbhh F T 6iqh 2 B,D C,D 17-mer peptide (GPDCAYHKGELVWCTFH) GPDCAYHKGELVWCTFH 17 T 0.47 DUF1247 pdbhh F T 6ist 1 A,B,D A,B,D S5MRN1_9CAUD Lysin MFIYYKRTKQGSTEQWFVIGGKRIYLPTMTYVNEANDLIKRYGGNTNVTTYNHDNFGLKMMEAALPQVKV 70 T 0.022 Sial-lect-inser pdb T Viruses T 6itc 5 E B OMPA_ECOLI Translocating peptide MAKKTAIAIAVALAGFATVASYAQYEDGCSGELERQHTFAGGARSISGDGDSPHSYHSG 59 T 1.6999999999999998E-75 OmpA_membrane unp F Bacteria T 6iu7 2 B B TP53B_HUMAN P53BP1 SGKRKLITSEEERSPAKRGRKS 22 T 35 GMAP pdbhh F Eukaryota T 6iua 2 B B TP53B_HUMAN P53BP1 SGKRKLITSEEERDPAKRGRKS 22 T 44 KCTD18_C pdbhh F Eukaryota T 6iui 2 C,D C,D PAXI_HUMAN Paxillin GPGSEFSATRELDELMASLSDFKFMAQG 28 T 2.5 SAM_LFY pdbhh F Eukaryota T 6iv8 1 A,D A,C A0A1C5SD84_9FIRM The selenomethionine (SeMet)-labeled Cas13d MAKKNKMKPRELREAQKKARQLKAAEINNNAAPAIAAMPAAEVIAPVAEKKKSSVKAAGMKSILVSENKMYITSFGKGNSAVLEYEVDNNDYNKTQLSSKDNSNIELGDVNEVNITFSSKHGFGSGVEINTSNPTHRSGESSPVRGDMLGLKSELEKRFFGKTFDDNIHIQLIYNILDIEKILAVYVTNIVYALNNMLGIKDSESYDDFMGYLSARNTYEVFTHPDKSNLSDKVKGNIKKSLSKFNDLLKTKRLGYFGLEEPKTKDTRASEAYKKRVYHMLAIVGQIAQCVFHDKSGAKRFDLYSFINNIDPEYRDTLDYLVEERLKSINKDFIEGNKVNISLLIDMMKGYEADDIIRLYYDFIVLKSQKNLGFSIKKLREKMLEEYGFRFKDKQYDSVRSKMYKLMDFLLFCNYYRNDVAAGEALVRKLRFSMTDDEKEGIYADEAAKLWGKFRNDFENIADHMNGDVIKELGKADMDFDEKILDSEKKNASDLLYFSKMIYMLTYFLDGKEINDLLTTLISKFDNIKEFLKIMKSSAVDVECELTAGYKLFNDSQRITNELFIVKNIASMRKPAASAKLTMFRDALTILGIDDNITDDRISEILKLKEKGKGIHGLRNFITNNVIESSRFVYLIKYANAQKIREVAKNEKVVMFVLGGIPDTQIERYYKSCVEFPDMNSSLEAKRSELARMIKNISFDDFKNVKQQAKGRENVAKERAKAVIGLYLTVMYLLVKNLVNVNARYVIAIHCLERDFGLYKEIIPELASKNLKNDYRILSQTLCELCDDRNESSNLFLKKNKRLRKCVEVDINNADSSMTRKYANCIAHLTVVRELKEYIGDIRTVDSYFSIYHYVMQRCITKRGDDTKQEEKIKYEDDLLKNHGYTKDFVKALNSPFGYNIPRFKNLSIEQLFDRNEYLTEKLEHHHHHH 930 T 0.0023 RB_A pdbpercent F Bacteria T 6iw8 1 A A GOGA2_HUMAN 130 KDA CIS-GOLGI MATRIX PROTEIN,GM130,GM130 AUTOANTIGEN,GOLGIN-95 GPLGSMSEETRQSKLAAAKKKLREYQQRNSPGVPTGAKKKKKIKNGSNPETTT 53 T 0.0086 DUF812 unphh F Eukaryota T 6iwa 1 A A GOGA2_HUMAN 130 KDA CIS-GOLGI MATRIX PROTEIN,GM130,GM130 AUTOANTIGEN,GOLGIN-95 AKKKLREYQQRNSPGVPTGAKKKKKIKN 28 T 0.0086 DUF812 unphh F Eukaryota T 6ixk 1 A,B A,B A0A0H3NK84_SALTS GLYCOSYLTRANSFERASE MIPPLNRYVPALSKNELVKTVTNRDIQFTSFNGKDYPLCFLDEKTPLLFQWFERNPARFGKNDIPIINTEKNPYLNNIIKAATIEKERLIGIFVDGDFFPGQKDAFSKLEYDYENIKVIYRNDIDFSMYDKKLSEIYMENISKQESMPEEKRDCHLLQLLKKELSDIQEGNDSLIKSYLLDKGHGWADFYRNMAMLKAGQLFLEADKVGCYDLSTNSGCIYLDADMIITEKLGGIYIPDGIAVHVERIDGRASMENGIIAVDRNNHPALLAGLEIMHTKFDADPYSDGVCNGIRKHFNYSLNEDYNSFCDFIEFKHDNIIMNTSQFTQSSWARHVQ 336 T 2.2E-05 Glyco_transf_88 pdbhh F Bacteria T 6ixp 2 B,C,E,F B,C,E,F MMR1_YEAST MMR1 GPGSEFGNSARIPCPKTRLARVSVLDLKKIEEQPDSSSG 39 T 0.024 DUF2080 pdb F Eukaryota T 6ixq 2 B B SMY1_YEAST SUPPRESSOR PROTEIN SMY1 GPGSSSSSIATTGSQESFVARPFKKGLNLHSIKVTSSTPKGSENLYFQ 48 T 7.7 CCDC85 pdbhh F Eukaryota T 6ixr 2 B B INP2_YEAST INP2 SGSGSGSGSGSEFNHGFHLDILKGRK 26 T 0.016 Serglycin pdb F Eukaryota T 6j03 1 A A V5TER4_9CYAN AMBU4 SAVSIPINNAGFENPFMDVVDDYTIDTPPGWTTYDPNNLVPEKRTTWTSNNGVGYVGPGTQFYNQLAPEGRNIGYIYLSQNPGSGVAGFEQCLDATLEPDTKYTLTVDVGALAGTFKGLSFAGFPGYRVELLAGDTVLAADHNNLFIKEGEFKTSTVTYTSTAKDLHLGQKLGIRLVNLLQDKFSGLDFDNVRLTTEPTE 200 T 0.22 CBM_4_9 pdbpercent F Bacteria T 6j07 2 B B TERB1_HUMAN COILED-COIL DOMAIN-CONTAINING PROTEIN 79 YRCSGCIAVEKSLNSRNFSKLLHSCPYQCDRHKVIVEAEDRYKSELRKSLICNKKILLTP 60 T 5.7 WCCH pdbhh F Eukaryota T 6j0n 3 M,N,O,P,Q,R P,Q,R,S,T,U B6VNN3_PHOAA Pvc12 MSNQDALFHSVKDDIHFDTLLEQAHQVIEKQAEKLWSDTAEHDPGITFLQGISYGVSDLAYRHTLPLKDLLTPAPDEQQQEGIFPAEFGPHNTLTCGPVTADDYRKALLDLHSSDSLDGTQQDEGDFLFRSVQLVREPEKQRYTYWYDATKREYSFVNSEGAKEFTLRGNYWLYLEPTRWTQGNIAAATRQLTEFLTKNRNIGESVSNIIWLQPVDLPLLLDVELDDDVGAQDVPGIFAAVYSTAEQYLMPGAQRYRTEVLQNAGMSNDQIFEGPLLEHGWIPELPAARDYTQRLTLNLSRLVNSLLEIEGIKHVNRLRLDDSFDKTAIEPVKGDTWSWSIKEGYYPRLWGEDPLNQLAQQNGPLRVIAKGGISVSVSKEQIQASLPSQSLIQNEPVILAYGQHRDVGSYYPVSDTLPPCYGLQHSLSESEHLLPLHQFMLPFEQLLACGCQQIAMLPRLLAFQREGYEVWGDQWPFKSGSVNDDAHQDYAPALKDLLGQIALDSDHELDIINYLLGYFGTQRAPRTFTTQLDDFRAVQQGYLAQQPTLTYHRSNIRIDQVSSLQKRIAARMGLGGELFKPQPDLSQLPFYLIEHRALLPVKPNSQFDKEQKPASVTEEGGSQTGQHYVVIEQKGIDGKLTQGQVINLILYEGEQGETQFTIRGQMVFKTEGDKFWLDVNNSAQLEYNLARVMTAAKASKLFWQNSPVWMEDMGYRLAYASDQSSLPVNQRRLTRTVQTPFPPMVVVGSEITLLKQVGIVNLKKAESEKLYAKVVSFDRIEGTLIIERLGNSTLAFPTSEEAWRYSWYFSGEKYERTDRFSFVISVVVNSDLIKLPGVDPYKLEEWVKETILTEFPAHISMIIHWMDREAFLNFANTYQRWQNNGTPLGDAAYSILESLTLGKLPSALKGVGTMRIATSSQREEVVGSNGDQWNTDGITQNELFYVPKES 950 T 0.016 DUF276 pdbhh F Bacteria T 6j0x 2 B,D,F,H E,F,G,H MMS22_YEAST METHYL METHANESULFONATE-SENSITIVITY PROTEIN 22,SYNTHETICALLY LETHAL WITH MCM10 PROTEIN 2 SIIYEPEFNENYLWAE 16 T 1.9 Pept_S41_N pdbhh F Eukaryota T 6j0y 2 B,D C,D SLX4_YEAST Peptide from Structure-specific endonuclease subunit SLX4 GPLGSGSSIRVKLLQESVVKLNPKLVKHNFYRVEANDSEEEETEFDDQFCIADIQLVD 58 T 0.063 RNA_pol_Rpo13 pdbpssm F Eukaryota T 6j3q 2 N,O,P,Q,R,S,T,U,V,W,X,Y,Z 0,3,1,4,2,6,5,7,b,8,c,9,d A0A4Y5TPY8_9CAUD cement protein MPLVYTPAVRGGANPASGSYLLDPQYVNSGVDILQATYGYNINGTANADQLLQRDAILAILEYALKDTAFVNAIQAVAAGSGVTTPASFVSACVTKLTA 99 T 0.14 PilW pdb T Viruses T 6j4v 3 C C TBA1B_HUMAN ALPHA-TUBULIN UBIQUITOUS,TUBULIN K-ALPHA-1,TUBULIN ALPHA-UBIQUITOUS CHAIN DYEEVGVDSVEGEGEEEGECY 21 T 28 Hrs_helical unphh F Eukaryota T 6j56 2 B,D C,D TOM1_HUMAN Peptide from Target of Myb protein 1 GVTSEGKFDKFLEERAKAADRLPNLSS 27 T 0.72 FUSC unp F Eukaryota T 6j68 2 C,D C,D LATS1_MOUSE LARGE TUMOR SUPPRESSOR HOMOLOG 1,WARTS PROTEIN KINASE GPGSVAEAPSYQGPPPPYPKHLLHQNPS 28 T 9.1 Nt_Gln_amidase pdbhh F Eukaryota T 6j7v 1 A A H9ABP6_9VIRU VP5 IAPLVGYAIGAAAISAVGGIGVGWTLREFEVVGSDDPAEGLTPDVLRNQLSDSVVKRKSNNQSTMVDNQNILDGVEHTAYTEAKIAAIEELNAGSSESAVLSAANSAIDSYETTVRTNFYKSWNETVRELEAMTQTVIAHADVGLSYITDFGDPRFGNLASGTSPNTLKDTTVSMPDGTNFTLLTFRHNTGWDSGNAAYSVVEYNPKEVVTSTNSNTYNTVDGTQYMKFSEWNAVETEMDTVFQNVRNGISTWVTNVYGDVQSGAIEISDLVTPRERATMMAQEEGMSQAIADLIALNVPVDAEREATITIQDTGATLPGTFALTDSSDGPLSAGQTYDPSTFSGDVYFTADMSLVEGPWDAINSGVDGGTITITSEPYEGTAIEVTTVESETVSVPAADWTDNGDGTWSYDASGDLETTITNVDSARFVSTATETTYDTLQLKGAFTVDKLVNKQSGEEVSSTSFTSSEPQTDSNYITQDEWDQLEQQNKELIEKYEQSQSGGGLDLGGLDMFGVPGEMVAVGAAAVIGFLMLGNN 537 T 0.062 B56 pdbpssm T Viruses T 6j8e 3 C D CM3A_CONKI Mu-conotoxin KIIIA CCNCSSKWCRDHSRCC 16 T 0.49 C5HCH pdbhh F Eukaryota T 6j9e 8 I J Q8LTJ5_9CAUD RNA POLYMERASE INHIBITOR P7 GAMAMNEFTQISGYVNAFGSQRGSVLTVKVENDEGWTLVEEDFDRADYGSDPEFVAEVSSYLKRNGGIKDLTKVLTR 77 T 0.18 DUF1494 unp T Viruses T 6j9k 1 A A A0A425B3G2_NEIME AcrIIC2 SMASKNNIFNKYPTIIHGEARGENDEFVVHTRYPRFLARKSFDDNFTGEMPAKPVNGELGQIGEPRRLAYDSRLGLWLSDFIMLDNNKPKNMEDWLGQLKAACDRIAADDLMLNEDAADLEGWDD 125 T 0.029 SfsA_N pdbpercent F Bacteria T 6j9l 2 C E CAS9_FRATN HNH endonuclease family protein SKDSYTLLMNNRTARRHQRRGIDRKQLVKRLFKLIWTEQLNLEWDKD 47 T 5.5 RRXRR unphh F Bacteria T 6j9m 1 A,F A,F CAS9_NEIM8 CRISPR-associated endonuclease Cas9 SVPKTGDSLAMARRLARSVRRLTRRRAHRLLRTRRLLKREGVLQAANFDENGLIKSLPNTPWQLRAAALDRKLT 74 T 0.00058 Cas9-BH pdbhh F Bacteria T 6j9n 2 B B A0A425B395_NEIME AcrIIC3 SMAFKRAIIFTSFNGFEKVSRTEKRRLAKIINARVSIIDEYLRAKDTNASLDGQYRAFLFNDESPAMTEFLAKLKAFAESCTGISIDAWEIEESEYVRLPVERRDFLAAANGKEIFKI 118 T 1.8 Iron_traffic unphh F Bacteria T 6jcu 2 B,D B,D COBL_MOUSE Peptide from Protein cordon-bleu SLHSALMEAIRSSGGREKLRKV 22 T 0.00025 WH2 unppercent F Eukaryota T 6jd7 1 A,B,C A,B,C AcrIIC2 SMSKNNIFNKYPTIIHGEARGENDEFVVHTRYPRFLARKSFDDNFTGEMPAKPVNGELGQIGEPRRLAYDSRLGLWLSDFIMLDNNKPKNMEDWLGQLKAACDRIAADDLMLNEDAADLEGWDD 124 T 0.028 SfsA_N pdbpercent F T 6jdj 2 C C CAS9_NEIM8 CRISPR-associated endonuclease Cas9 SMAAFKPNSINYILGLDIGIASVGWAMVEIDEEENPIRLIDLGVRVFERAEVPKTGDSLAMARRLARSVRRLTRRRAH 78 T 0.00011 Pox_A22 pdbhh F Bacteria T 6je4 5 E,J,S,T I,J,S,T A0A425B395_NEIME AcrIIC3 SMFKRAIIFTSFNGFEKVSRTEKRRLAKIINARVSIIDEYLRAKDTNASLDGQYRAFLFNDESPAMTEFLAKLKAFAESCTGISIDAWEIEESEYVRLPVERRDFLAAANGKEIFKI 117 T 1.8 Iron_traffic unphh F Bacteria T 6jhc 1 A A A0A0H5BR52_9MYRI Hydroxynitrile lyase LTCDQLPKAAINPIQEFIDSNPLEFEYVLTETFECTTRIYVQPARWSTTKAPTALDIKGTQIMAYDFVGGPENSAHLNECHTGDKQVWYFQYTNLLTDNGSSYCAYRCNGTEIIEYKCASNNNGTDPLQHQAMEVAKTVPNGDKIHYAKSNCPETHGCFAFY 162 T 29 RNR_inhib pdbhh F Eukaryota T 6jhv 1 A,B A,B A0A425B395_NEIME AcrIIC3 MMFKRAIIFTSFNGFEKVSRTEKRRLAKIINARVSIIDEYLRAKDTNASLDGQYRAFLFNDESPAMTEFLAKLKAFAESCTGISIDAWEIEESEYVRLPVERRDFLAAANGKEIFKI 117 T 1.8 Iron_traffic unphh F Bacteria T 6jhw 1 A,C A,C A0A425B395_NEIME AcrIIC3 MAFKRAIIFTSFNGFEKVSRTEKRRLAKIINARVSIIDEYLRAKDTNASLDGQYRAFLFNDESPAMTEFLAKLKAFAESCTGISIDAWEIEESEYVRLPVERRDFLAAANGKEIFKI 117 T 1.8 Iron_traffic unphh F Bacteria T 6ji7 1 A A coffeetide EGECSPLGEPCAGNPWGCCPGCICIWQLTDRCVGNC 36 T 0.0084 DUF5637 pdbhh F T 6jja 1 A A Q15BP7_9VIRU Nucleocapsid protein CP17 MNKRINNNRRTMRSRRGRGRTMGSNLIPYANSPVPIPYTPPVTPVTVIGNPRKTTWIDIDLSSEESGIYTLTVGSYRNRITKLGPSKPNFIIEKVAAYAAPGDYKVVLNDFKTGIQVVDEGSYAHRAAAGILYPPAAQMFYGISATGTLNTITTTAKDPVPVVRALVTYWDSEQ 174 T 0.036 ALMS_repeat pdb T Viruses T 6jjw 2 B U PTN14_HUMAN PROTEIN-TYROSINE PHOSPHATASE PEZ GPGSSHRHSAIIVPSYRPTPDYETVMRQMKRG 32 T 8.7 SpoIISB_antitox pdbhh F Eukaryota T 6jjx 2 C,D D,C AMOT_HUMAN Peptide from Angiomotin GPGSGRTEGQLMRYQHPPEYGAARPA 26 T 0.46 DUF6092 pdbhh F Eukaryota T 6jk2 1 A A A0A3B6UEU4_9AGAR Lectin APVPVTKLVCDGDTYKCTAYLDFGDGRWVAQWDTNVFHTG 40 T 0.07 C2-set pdbhh F Eukaryota T 6jky 1 A,D A,D Q5ZTL3_LEGPH MvcA ASLESPGFMVHKKLKSMSQSYGVMMTGVPAEVLGQMQAERSIPSINKTGNLKQQIAKEVSKVCHMMTEPTQSAGQASNDVCELLLGKIEAEKFHFTKYEALSADGDNLKNVLENTAPSSTNLLIRFEIDREDPPIVLVKTKNENFNPETAVKNKIYLLENKLYFIDKMGNLFNLGPGKKKCTQLFNAIGDSAEYSLCDPFVLEEPEKPEDFAISEIVDIFNEQKERFDFWIGSHSFTIYIPQTLGESPRQFYPYQAYFGSHTLQDWFVSDKDEYLSRIGIDKYIEKLAVLGKTTNTKERSDIYAEFFSKRGREAFFCAHLNEKRQPLRVKFKITEINPELALKNLQETQEFIDTHPGENPSDKVENYRNRAKLAMTEHLESLLDI 385 T 0.022 AgrD pdb F Bacteria T 6jle 2 B E MYO3A_HUMAN Myosin-IIIa GSDNKDSKATSEREACGLAIFSKQISKLSEEYFILQKKLNEMILSQQLKS 50 T 15 Uds1 pdbhh F Eukaryota T 6jmu 2 C,D C,D PAXI_MOUSE Paxillin GSGSGSGSGSSATRELDELMASLSDFKMQGLE 32 T 0.094 Serglycin pdb F Eukaryota T 6joz 3 C C ALA-THR-ILE-GLY-THR-ALA-MET-TYR-LYS ATIGTAMYK 9 T 1.6 DUF3362 pdbhh F T 6jpp 1 A A ELMO1_HUMAN PROTEIN CED-12 HOMOLOG GMPPPADIVKVAIEWPGAYPKLMEIDQKKPLSAIIKEVCDGWSLANHEYFALQHADSSNFYITEKNRNEIKNGTILRLTTSPAQNAQQLHERIQSSSMDAKLEALKDLASLSRD 114 T 0.003 FERM_N pdb F Eukaryota T 6jqa 1 A A X5IFG3_ONYPH Phytoplasmal effector causing phyllody 1 MNKDIASASNNNQNITNXSIEENIINLKXKIRKNAVKKINTEREIQQLSNNDPNKNTLLALKQNLENLIHNQKEQLKTYQKLLKTLNDENN 91 T 0.15 DUF3349 unppercent F Bacteria T 6jqa 2 B,C B,C X5IFG3_ONYPH Phytoplasmal effector causing phyllody 1 MNKDIASASNNNQNITNYSIEENIINLKYKIRKNAVKKINTEREIQQLSNNDPNKNTLLALKQNLENLIHNQKEQLKTYQKLLKTLNDENN 91 T 0.15 DUF3349 unppercent F Bacteria T 6jqa 3 D D X5IFG3_ONYPH Phytoplasmal effector causing phyllody 1 MNKDIASASNNNQNITNYSIEENIINLKXKIRKNAVKKINTEREIQQLSNNDPNKNTLLALKQNLENLIHNQKEQLKTYQKLLKTLNDENN 91 T 0.15 DUF3349 unppercent F Bacteria T 6jsh 3 C,F,I C,H,I FAS2_YEAST FATTY ACID SYNTHASE SR2 HELICES LNMKYRKRQLVTREAQIKDWVENELEALKLEAEEIPSEDQNEFLLERTREIHNEAESQLRAAQQQWGNDFY 71 T 5.5E-10 SpoVAD unphh F Eukaryota T 6jue 2 B A PAR6B_MOUSE THR-ILE-ILE-THR-LEU LEEDGTIITL 10 T 0.28 CIDE-N pdbhh F Eukaryota T 6jwj 2 B C UFD1_YEAST UB FUSION PROTEIN 1,POLYMERASE-INTERACTING PROTEIN 3 GPGHMEPAKLDLPEGQLFFGFPM 23 T 19 AP-5_subunit_s1 pdbhh F Eukaryota T 6jwn 2 B,D B,D NOS2_HUMAN CR9 PEPTIDE RGDINNNVE 9 T 5.5 DUF6373 pdbhh F Eukaryota T 6jxu 2 B B ICP0_HHV11 viral protein NNRDPIVISDSP 12 T 5 MTD pdbhh T Viruses T 6jxv 2 B B ICP0_HHV11 Phosphorylated SLS4-SIM from ubiquitin E3 ligase ICP0 LANNRDPIVISDSPPASPHR 20 T 3.3 Myf5 pdbhh T Viruses T 6jzd 3 C C TBA1A_MOUSE GLU-GLY-GLU-GLU-TYR VDSVEGEGEEEGEEY 15 T 29 Hrs_helical unphh F Eukaryota T 6jzn 2 E,F,G,H G,F,H,E PDV1_ARATH PROTEIN PLASTID DIVISION1 DHLDVMMARG 10 T 9.3 LicD pdbhh F Eukaryota T 6k06 1 A A GOGA2_HUMAN PHOSPHOMIMETIC GM130,130 KDA CIS-GOLGI MATRIX PROTEIN,GM130,GM130 AUTOANTIGEN,GOLGIN-95 GPLGSMSEETRQSKLAAAKKKLREYQQRNDPGVPTGAKKKKKIKNGSNPETTT 53 T 0.0086 DUF812 unphh F Eukaryota T 6k07 2 B B SHLD3_HUMAN REV7-INTERACTING NOVEL NHEJ REGULATOR 1,SHIELD COMPLEX SUBUNIT 3 MGSKLPLRPKRSPPVISEEAAEDVKQYLTI 30 T 33 Sm_like unphh F Eukaryota T 6k0t 2 B,D B,D PRGC1_HUMAN Peroxisome proliferator-activated receptor gamma coactivator 1-alpha EEPSLLKKLLLA 12 T 12 Neurokinin_B pdbhh F Eukaryota T 6k11 1 A,B A,B Q5ZTL3_LEGPH Lpg2148(MvcA) LESPGFMVHKKLKSMSQSYGVMMTGVPAEVLGQMQAERSIPSINKTGNLKQQIAKEVSKVCHMMTEPTQSCGQASNDVCELLLGKIEAEKFHFTKYEALSADGDNLKNVLENTAPSSTNLLIRFEIDREDPPIVLVKTKNENFNPETAVKNKIYLLENKLYFIDKMGNLFNLGPGKKKCTQLFNAIGDSAEYSLCDPFVLEEPEKPEDFAISEIVDIFNEQKERFDFWIGSHSFTIYIPQTLGESPRQFYPYQAYFGSHTLQDWFVSDKDEYLSRIGIDKYIEKLAVLGKTTNTKERSDIYAEFFSKRGREAFFCAHLNEKRQPLRVKFKITEINPELALKNLQETQEFIDTHPGENPSDKVENYRNRAKLAMTEHLESLLDI 383 T 0.022 AgrD pdb F Bacteria T 6k15 8 I E HTL1_YEAST CHROMATIN STRUCTURE-REMODELING COMPLEX PROTEIN HTL1 MSQNNTISSMNPERAYNNVTLKNLTAFQLLSQRENICELLNLVESTERHNSIINPERQRMSLEEMKKMLDALKNERKK 78 T 0.064 DUF4976 pdbpercent F Eukaryota T 6k31 1 A,B A,B AiPEPCK GPGHMSSSPAAPTNSANSAIALRLELLGAPVPDHAARSDFDRETDRLVAPILARQRELTRRLANRPCAADRRIQAFLDSYLDGAAAQPKLPGATLVLDQPGLARALSLPVDATSFTSDYVESYRVLSGVLHNPRNDRRTTAGVFHVAEGGLPIPDDKKAVPRDVFARVLAAAVDAPDDLMTLPWASTQADPARCFVSLLLRPVVVPEVPGFSAERSMEIRFIAPGGLVSNLDFVEGIFGNGGDPYLPENDASLAPESWTGHTGCVILAPHLTRLTKKELGLPAWEEATERQRRDGMCWRGADELYNDGKAFKLVARDERGVIVTIIADNYYGYCKKEVKTQISYSANLFGCVEEEHSGGALAFPRYNLGQEYTDVHTPAGATVERVLARNPGRFEARADGSAVLLDDDGRPDEGIVLVPAGAHFSMRTQTVTWDRADGREASIPLLADRVYIAPGGYRVHAKHREGDATQWHLVGTAPWATQAHKPATVSGGGKSEISKSLLDAFVFGEAYVGDVDADLDAVQKILDGNYADRFVDPANKSAHHRPILSERRSLGSVIKLLTPSSMYTEEYNAFLESIPAHIKELIFTVKRYYQPGWGADWRSHFSVGIINGRKGNSLRLDGEVIKVNMLRVGFEDDGAWRLLSLRPDFSPAAKVQTEDDITSSIVAPGGLESTAGSSVSRKFVTNCESLLFQRPDDAIVRGYDKQTERDMSGTGLFISNYQPLTPADARAMVADAPGLSRFTEPMQELVRRAAAIPEAADPREETYWTSTANPRLVGGAPTRNPRYLQVRPDIANPRDVALADLSIHLYRDAPLAAPARHGVDVVAAGRRNNPPEPGVPALCAYNPLHYMELPELFMEFISSMTGKSPSTTGAGSEGALTKSPFNALPPVYDLNAALLSYALGGYDGWLSSAGYIGPKVKVAHDISLLVPEIFSRMTPQERDARALIEAGYLERLEDFDHEGRRIEASRLGYRMNAAFATAYFGRIFLHPDVVFTEEMLRPELQDPAIFADSVEVIVATHRAVAKHYVDDGSIQWAVPPLKALLEIMYSGRSEEGWTLSSPELRALFERENILASDWYAERVDAKVERDRKQAESAIAALTRFTTTQGNEEVTERLDIEGRLASARAWLDEVTSPAYRAHLVGTLGLQPSLA 1153 T 3 SKI pdbhh F T 6k32 2 B B C7EWL9_CPVBM VP4 FAIDPLKHSKLYEEYGLYLRPHQINQEIKPTTIKKKELAPTIRSIKYASLIHSMLAEHAARHNGTLINPRMYADMITLGNTKVTVTKGTPKAQIDTLKMNGLTVVSKSRRNNKKKPVSDTTATIDENTNAIVTYKALTEMSTLIESFRLPSGLTLIIFDDEKYQSLIPNYINQLIAYTQPHIIPTWQGIADFSDTYLRSYFKRPFELTASNLAAPQKHNLSPMTRSIFNNTGREDAVIRKLYGYGEYVFIKYEGCLITWTGIYGEVTMMVNLSKRDLGLDVGDDYLKEYKKLLFYGVITDAIPSGISTRSTIMKISPHKMMNPSGGALAVLSKFLEAVVSTNVINATLVVYAEKGAGKTSFLSTYAEQLSLASGQVVGHLSSDAYGRWLAKTKDIEEPSFAYDYVLSLDTDDNESYYEQKASELLMSHGISEVAQYELLSVRKKIKMMDEMNEVLIAQLENADTHSERNFYYMVSTGKTTPRILIVEGHFNAQDATIARTDTTVLLRTINDTTQAMRDRQRGGVVQLFLRDTYYRLLPALHTTVYPFEMLESIKRWKWV 559 T 0.00013 AAA_33 unppssm T Viruses T 6k32 5 E C CAPSD_CPVBM VP1 VQSRTDVFNEQFANEALHPMTKVIFNGLDVNTEVQPLSDDFKQISDPKGYLTYSVKYEDQFTKKDKLRASEADDRIVGPTVNLFKYGAAVVNIDLNRDFFDTATGIDLTKGIPLVQDLLVPIGVTAGAEQSAEYVSGLLMVLFKVMTDNRLVIVGETTTPMSNTLSTVVNNVLRTTYHNNVGVNPALLRDFTQVNWLNRDITNMLQQAGTKYGLGLTETRLDYVRLVKTIVGHALNIDHFAASVLNINLRALMEANVTADDRIKALQAHSMISTQFHGPNQGALRPELAFDHDHIIRCLMLAAANYPRLEGIIVQINTGYVASANVIRPVSEKRYFPENLEQNQSAARLVSAVKARASEADISSIHLAIAREVSPMFNVHELKKIAESFEDPSSIVVVLEFILFALFFPTEFNRIKGDIQNVLLLFFSRWYPVEYGIFIQRGATYTINAAGEFEFSGRNEKWDQSLYLSEHFPALFSDVPLAGANTIIAIMRLFTPQGFLRTDDLAIAANFPRASRNPQTYIPYTNQRGTVTNEFASRFRTIVATLANVVNERAVQDDMQKATRSCTKQWLRHLETQFDNIAVAHTDHLSVVYATMSNFMLNFTNNFSGNHATFKPDQYVITSPEGSYKPIIERQGETVDGLTIIDTSIVWPILCQCTYPLVRQSGKGVDAVSIMEEIVYPDPSTTLSQSLSVAQVLSKLTLPDAFINMILSGGDSVVMRTYQTEADDDLDEGIRMTTYDQYLSHIRERLHITNVPDPIYITGASTPDQIAASVQATHVAVVLYQSGVINGSASTYLRENEVLVVMPDYYDVVSRFANANLQMNNNRYHESVLEIADIFDQADFIQTSDAVRQLRALMPTLSTSQIRHAIERIAQITDVDSTDYGKLTLRFLGTLTRSLKMQNAQIRRIRPDGTVLRYDDQIDIEAFRWSRYFLDELRLRRLSVGLRLITNPRIARRFDGVRIMYLTDDDPDPDFVPDVPEGYVAVQYAHRLFSSSLANKRNRVTYTHPPTGMAYPSPTGRPHVHMTINERAGMSKLVADNIIASVIKSNWVVDIHDIEYTAEVMTPSEGYTQHVDAESIMTAPKGKLFHLQFMDGLLRPEPSAFDPPASGEDMRLIYPLQPISVARSMRAIVNHNEVDRPRGAVAPSSYEMDTGTLSRNGDLLYSPVANGQVGIPKLEVDHISFSNVVSMMTANIRTGDDMAVERVNPDDVRAINIRNA 1220 T 120 SRP19 pdbhh T Viruses T 6k32 6 F,G,H D,E,F CAPSD_CPVBM VP1 ALHPMTKVIFNGLDVNTEVQPLSDDFKQISDPKGYLTYSVKYEDQFTKKDKLRASEADDRIVGPTVNLFKYGAAVVNIDLNRDFFDTATGIDLTKGIPLVQDLLVPIGVTAGAEQSAEYVSGLLMVLFKVMTDNRLVIVGETTTPMSNTLSTVVNNVLRTTYHNNVGVNPALLRDFTQVNWLNRDITNMLQQAGTKYGLGLTETRLDYVRLVKTIVGHALNIDHFAASVLNINLRALMEANVTADDRIKALQAHSMISTQFHGPNQGALRPELAFDHDHIIRCLMLAAANYPRLEGIIVQINTGYVASANVIRPVSEKRYFPENLEQNQSAARLVSAVKARASEADISSIHLAIAREVSPMFNVHELKKIAESFEDPSSIVVVLEFILFALFFPTEFNRIKGDIQNVLLLFFSRWYPVEYGIFIQRGATYTINAAGEFEFSGRNEKWDQSLYLSEHFPALFSDVPLAGANTIIAIMRLFTPQGFLRTDDLAIAANFPRASRNPQTYIPYTNQRGTVTNEFASRFRTIVATLANVVNERAVQDDMQKATRSCTKQWLRHLETQFDNIAVAHTDHLSVVYATMSNFMLNFTNNFSGNHATFKPDQYVITSPEGSYKPIIERQGETVDGLTIIDTSIVWPILCQCTYPLVRQSGKGVDAVSIMEEIVYPDPSTTLSQSLSVAQVLSKLTLPDAFINMILSGGDSVVMRTYQTEADDDLDEGIRMTTYDQYLSHIRERLHITNVPDPIYITGASTPDQIAASVQATHVAVVLYQSGVINGSASTYLRENEVLVVMPDYYDVVSRFANANLQMNNNRYHESVLEIADIFDQADFIQTSDAVRQLRALMPTLSTSQIRHAIERIAQITDVDSTDYGKLTLRFLGTLTRSLKMQNAQIRRIRPDGTVLRYDDQIDIEAFRWSRYFLDELRLRRLSVGLRLITNPRIARRFDGVRIMYLTDDDPDPDFVPDVPEGYVAVQYAHRLFSSSLANKRNRVTYTHPPTGMAYPSPTGRPHVHMTINERAGMSKLVADNIIASVIKSNWVVDIHDIEYTAEVMTPSEGYTQHVDAESIMTAPKGKLFHLQFMDGLLRPEPSAFDPPASGEDMRLIYPLQPISVARSMRAIVNHNEVDRPRGAVAPSSYEMDTGTLSRNGDLLYSPVANGQVGIPKLEVDHISFSNVVSMMTANIRTGDDMAVERVNPDDVRAINIRNA 1205 T 120 SRP19 pdbhh T Viruses T 6k32 7 I G CAPSD_CPVBM VP1 KKPPTVVQSRTDVFNEQFANEALHPMTKVIFNGLDVNTEVQPLSDDFKQISDPKGYLTYSVKYEDQFTKKDKLRASEADDRIVGPTVNLFKYGAAVVNIDLNRDFFDTATGIDLTKGIPLVQDLLVPIGVTAGAEQSAEYVSGLLMVLFKVMTDNRLVIVGETTTPMSNTLSTVVNNVLRTTYHNNVGVNPALLRDFTQVNWLNRDITNMLQQAGTKYGLGLTETRLDYVRLVKTIVGHALNIDHFAASVLNINLRALMEANVTADDRIKALQAHSMISTQFHGPNQGALRPELAFDHDHIIRCLMLAAANYPRLEGIIVQINTGYVASANVIRPVSEKRYFPENLEQNQSAARLVSAVKARASEADISSIHLAIAREVSPMFNVHELKKIAESFEDPSSIVVVLEFILFALFFPTEFNRIKGDIQNVLLLFFSRWYPVEYGIFIQRGATYTINAAGEFEFSGRNEKWDQSLYLSEHFPALFSDVPLAGANTIIAIMRLFTPQGFLRTDDLAIAANFPRASRNPQTYIPYTNQRGTVTNEFASRFRTIVATLANVVNERAVQDDMQKATRSCTKQWLRHLETQFDNIAVAHTDHLSVVYATMSNFMLNFTNNFSGNHATFKPDQYVITSPEGSYKPIIERQGETVDGLTIIDTSIVWPILCQCTYPLVRQSGKGVDAVSIMEEIVYPDPSTTLSQSLSVAQVLSKLTLPDAFINMILSGGDSVVMRTYQTEADDDLDEGIRMTTYDQYLSHIRERLHITNVPDPIYITGASTPDQIAASVQATHVAVVLYQSGVINGSASTYLRENEVLVVMPDYYDVVSRFANANLQMNNNRYHESVLEIADIFDQADFIQTSDAVRQLRALMPTLSTSQIRHAIERIAQITDVDSTDYGKLTLRFLGTLTRSLKMQNAQIRRIRPDGTVLRYDDQIDIEAFRWSRYFLDELRLRRLSVGLRLITNPRIARRFDGVRIMYLTDDDPDPDFVPDVPEGYVAVQYAHRLFSSSLANKRNRVTYTHPPTGMAYPSPTGRPHVHMTINERAGMSKLVADNIIASVIKSNWVVDIHDIEYTAEVMTPSEGYTQHVDAESIMTAPKGKLFHLQFMDGLLRPEPSAFDPPASGEDMRLIYPLQPISVARSMRAIVNHNEVDRPRGAVAPSSYEMDTGTLSRNGDLLYSPVANGQVGIPKLEVDHISFSNVVSMMTANIRTGDDMAVERVNPDDVRAINIRNA 1226 T 130 SRP19 pdbhh T Viruses T 6k3a 2 B,D,F B,D,F DNMT1_HUMAN Peptide from DNA (cytosine-5)-methyltransferase 1 STRQTTITSHFAKGPAKRKP 20 T 0.14 AIB pdbhh F Eukaryota T 6k3b 1 A A Q5ZTL4_LEGPH Lpg2147 GSHMEKTGLHVHEKIKHMVKNYGTMITGIPAEILGQNEAEISVGYVKKMGNMKENIAEVVRKSEMTQPTNSAGKASNEVCDLLLGTEGASEFEKSSYQVLSGDGSNLKGSLPNKNLLVRVEMDRFNAPQKYQKIKREEFNPETAEKNKIYLLEDQLVYLDIFGKVIDLGQTSDTCHRLFNAITTPFYQNYILYDEYIDPEESAEEAAMFEMGEIVKAKMKNIDCWTATHSFTIFVPESDSEDTRTLYPYQAYWTSHTLQQWFSGDKDEKLSRLGIDGYIEKLALLGTTTDSKIRSSIYGELFSPPGKEHVFCTGMNEKFSPLRVKFKVTEVNPEIALQNLEEVQEFIDTNYPGENAKDQCELYKIKAQEAMTKQLEMRLLIE 382 T 0.035 V-ATPase_H_N unppercent F Bacteria T 6k3f 2 B,D,F,H,J,L U,V,W,X,Y,Z ACKR3_HUMAN CHEMOKINE-RELATED PROTEIN 1,C-X-C CHEMOKINE RECEPTOR TYPE 7,CXCR-7,CHEMOKINE ORPHAN RECEPTOR 1,G-PROTEIN COUPLED RECEPTOR 159,G-PROTEIN COUPLED RECEPTOR RDC1 HOMOLOG,RDC-1 IFKYSAKTGLTKLID 15 T 48 DUF2589 pdbhh F Eukaryota T 6k4k 1 A,B A,B SIDJ_LEGPH SidJ MFGFIKKVLDFFGVDQSEDNPSETAVETTDVSTKIKTTDTTQEESSVKTKTVVPTQPGGSVKPETIAPDQQKKHQIKTETTTSTTKQKGPKVTLMDGHVKQYYFARRGETSTHDTSLPPPVKVLSGRSIPLKEIPFEATRNELVQIYLTSIDKLIKSNKLNSIPSQQIASHYLFLRSLANSETDGIKKNQILSLAKPLGTYLASKEPHVWKMINELIEKSEYPIIHYLKNNRAHSNFMLALIHEYHKEPLTKNQSAFVQKFRDSSVFLFPNPIYTAWLAHSYDEDSSFNPMFRERLSTNFYHSTLTDNLLLRTEPKEVTLSSEHHYKKEKGPIDSSFRYQMSSDRLLRIQGRTLLFSTPQNDVVAVKVQKKGEPKSTLEEEFEMADYLLKHQRRLDVHSKLPQPLGQYSVKKSEILEISRGSLDFERFKTLIDDSKDLEVYVYKAPQSYFTYLHDKNQDLEDLTASVKTNVHDLFVLLREGIVFPQLADIFHTHFGEDEREDKGRYQALVQLLNVLQFQLGRIDKWQKAVEYVNLRSSGLADLGDSLPITSLFTSSDFTKHYFSELLTGGYHPTFFDKSSGTANSLFTGKRRLFGNYLYLNTIAEYLLVIQLTLGSYGDKVTRDMMDKPKKEAVWRELANVMFTSCAEAIHIMTGIPQSRALTLLKQRANIEKHFRQTQFWMTPDYSKLDEDTLQMEQYSIYSGEPEYEFTDKLVSGVGLSVDGVHQDLGGYNRESPLRELEKLLYATVTLIEGTMQLDKEFFKQLEQVEKILSGEIKTDANSCFEAVAQLLDLARPGCHFQKRLVLSYYEEAKLKYPSAPTDAYDSRFQVVARTNAAITIQRFWREARKNLSEKSDIDSEKPESERTTDKRL 873 T 0.34 IQ pdb F Bacteria T 6k4v 1 A A smart chimeric peptide G6 RVQGRWKVRASFFKGGGGSGFAWNVCVYRNGVRVCHRRAN 40 T 0.36 EipB_like pdbhh F T 6k4w 1 A A SCP-A6 RVQGRWKVRASFFKEAAAKEAAAKGFAWNVCVYRNGVRVCHRRAN 45 T 0.012 EipB_like pdb F T 6k5r 2 B B VIE2_HCMVA VIRAL TRANSCRIPTION FACTOR IE2, IE2,PROTEIN UL122 DTAGCIVISDSE 12 T 0.55 DUF2778 pdbhh T Viruses T 6k7w 1 A A UBP19_HUMAN UBIQUITIN SPECIFIC PEPTIDASE 19,DEUBIQUITINATING ENZYME 19,UBIQUITIN THIOESTERASE 19,UBIQUITIN-SPECIFIC-PROCESSING PROTEASE 19,ZINC FINGER MYND DOMAIN-CONTAINING PROTEIN 9 TPELALDWRQSAEEVIVKLRVGVGPLQLEDVDAAFTDTDCVVRFAGGQQWGGVFYAEIKSSCAKVQTRKGSLLHLTLPKKVPMLTWPSLLVE 92 T 0.00018 CS pdbhh F Eukaryota T 6k8e 1 A,B A,D SARX_STAA8 STAPHYLOCOCCAL ACCESSORY REGULATOR X HHHHHHMGSMNTEKLETLLGFYKQYKALSEYIDKKYKLSLNDLAVLDLTMKHCKDEKVLMQSFLKTAMDELDLSRTKLLVSIRRLIEKERLSKVRSSKDERKIYIYLNNDDISKFNALFEDVEQFLNI 128 T 0.00054 AphA_like unphh F Bacteria T 6k8k 2 B,D,F,H E,C,F,H BIC2_ARATH BLUE-LIGHT INHIBITOR OF CRYPTOCHROMES 2 PETTVLSGRDRLKRHREEVAGKVPIPDSWGKEGLLMGWMDFSTFDAAFTSSQIVSARAALMADSGHHHHHH 71 T 0.16 BRD4_CDT pdbpssm F Eukaryota T 6k9a 1 A A M5AAG8_9CAUD Primase SMIMEIPAIKALSRYAQWVIWKKERDTKIPYNPNNGKKASSTDPLAWGDIDEAQAGLVRYGANGLGFVLTKSDPFVFIDLDHVLDENKRVKCEWARQLLKEIKSYTEISPSGDGLHVVVSGKLPDYIKHKTKFDDGSALEVYESGRYMTITGEVFDGRDDIKELDLSILGEFAEHKIETKNAPVQIESATTLDDEAIIDLMKRKGQWPDAPKDGDDWSSLDMSFANRLAFWCGKDIERMDRIFRQSPLMRQKWDRPTAGSTYGRITLKKACDFVDSVYDPALRNESDCPFEPYNEEGGPRNDKEEK 306 T 0.00098 VirE_N pdbhh T Viruses T 6kac 28 BA,DB 4,3 Unindentified Stromal Protein (USP) AWAAAAGAAGAGYGVYRYEAAYGAA 25 T 0.017 PsbR pdb F T 6kbb 3 C,F F,E SWC5_YEAST SWR1-complex protein 5 GSMPEVETKIIPNEKEDEDEDGYIEEEDEDFQPEKDKLGGGSDDSDASDGGDDYDDGVNRDKGRNKVDYSRIESESGGLIK 81 T 10 DUF4637 pdbpercent F Eukaryota T 6kbm 2 B B ATG13_YEAST Autophagy-related protein 13 GGNSSTSALNSRRNSLDKSSNKQGMSGLPPIFGGESTSYHHDNKIQKYNQLGVEEDDDDENDRLLNQMGNSATKFKSSISPRSIDSISSSFIKSRIPIRQPYHYSQPTTAPFQAQAKFHKPANKLIDNG 129 T 20 RCS1 pdbhh F Eukaryota T 6kbw 1 A,B A,B A0A0B5RNJ4_9FLAO FLAVIN-CONTAINING MONOOXYGENASE MLNLKVGIIGAGPSGLAMLRAFESEQKKGNPIPEIKCYEKQDNWGGMWNYTWRTGVGKYGEPIHGSMYKYLWSNGPKECLEFSDYTFMEHFKQPISSYPPREVLFDYIQGRIKQSNARDFIKFNTVARWVDYLEDKKQFRVIFDDLVKNETFEEYFDYLVVGTGHFSTPNMPYFKGIDSFPGTVMHAHDFRGADQFIDKDILLIGSSYSAEDIGVQCFKHGSKSVTISYRTNPIGAKWPKGIEEKPIVTHFEDNVAHFKDGSKKEYDAVILCTGYQHKFPFLPDNLRLKTKNNLYPDNLYKGVVFNENERLIFLGMQDQYYTFNMFDTQAWFARDYMLGRIALPNKEIRDKDIAKWVELEKTSVTGEEHVDFQTDYIKELIEMTDYPTFDLDRVAEMFKSWLNDKETNILNYRDKVYTSVMTGVTAEEHHTPWMKELDDSLERYLDEVEVDELELSKENYYHHHHHH 467 T 1.6E-26 FMO-like unp F Bacteria T 6ke6 66 RB RV FAF1_YEAST FORTY S ASSEMBLY FACTOR MTLDDDDYIKQMELQRKAFESQFGSLESMGFEDKTKNIRTEVDTRDSSGDEIDNSDHGSDFKDGTIESSNSSDEDSGNETAEENNQDSKPKTQPKVIRFNGPSDVYVPPSKKTQKLLRSGKTLTQINKKLESTEAKEEKEDETLEAENLQNDLELQQFLRESHLLSAFNNGGSGSTNSGVSLTLQSMGGGNDDGIVYQDDQVIGKARSRTLEMRLNRLSRVNGHQDKINKLEKVPMHIRRGMIDKHVKRIKKYEQEAAEGGIVLSKVKKGQFRKIESTYKKDIERRIGGSIKARDKEKATKRERGLKISSVGRSTRNGLIVSKRDIARISGGERSGKFNGKKKSRR 346 T 4.1E-08 DUF4602 unppssm F Eukaryota T 6key 2 B B NOS2_HUMAN HEPATOCYTE NOS,HEP-NOS,INDUCIBLE NO SYNTHASE,INOS,NOS TYPE II,PEPTIDYL-CYSTEINE S-NITROSYLASE NOS2 KDINNNVEK 9 T 8.2 ZirS_C pdbhh F Eukaryota T 6kf3 3 C C RPOA2_THEKO DNA-directed RNA polymerase subunit A'' MVAEKTIKSMVSKAELPDNIKEELYAKLIEYNEKYKLKKDEIQAIIDETVREYQKALIEPGEAVGTVAAQSIGEPSTQMTLNTFHYAGVAEINVTLGLPRIIEIVDARKNPSTPIMTVYLDEEHRYDRDKALEVARRIEGTTLENLAREETIDILNMEYVVEIDPERLEKAGLDMEKVVRKLTGSFKSAEFEAEGYTLVVRPKKVTKLSDLRKIAEKVKKHRLKGLSGVGKTIIRKEGDEYVIYTEGSNFKQVLKVPGVDPTRTRTNNIWEIAEVLGIEAARNAIIDEIVSTMREQGLEVDVRHIMLVADMMTLDGVIRPIGRHGIVGEKASVLARAAFEITTQHLFAAAERGEVDPLNGVVENVLIGQPVPVGTGIVKLAMSLPLRPKRE 391 T 0.0004 RNA_pol_Rpb1_6 pdbpercent F Archaea T 6kfp 1 A A Q5ZTL4_LEGPH MavC EKTGLHVHEKIKHMVKNYGTMITGIPAEILGQNEAEISVGYVKKMGNMKENIAEVVRKSEMTQPTNSAGKASNEVCDLLLGTEGASEFEKSSYQVLSGDGSNLKGSLPNKNLLVRVEMDRFNAPQKYQKIKREEFNPETAEKNKIYLLEDQLVYLDIFGKVIDLGQTSDTCHRLFNAITTPFYQNYILYDEYIDPEESAEEAAMFEMGEIVKAKMKNIDCWTATHSFTIFVPESDSEDTRTLYPYQAYWTSHTLQQWFSGDKDEKLSRLGIDGYIEKLALLGTTTDSKIRSSIYGELFSPPGKEHVFCTGMNEKFSPLRVKFKVTEVNPEIALQNLEEVQEFIDTNYPGENAKDQCELYKIKAQEAMTKQLEMRLLIE 378 T 0.035 V-ATPase_H_N unppercent F Bacteria T 6kg6 1 A B Q5ZTL4_LEGPH MavC GPLGSEKTGLHVHEKIKHMVKNYGTMITGIPAEILGQNEAEISVGYVKKMGNMKENIAEVVRKSEMTQPTNSAGKASNEVCDLLLGTEGASEFEKSSYQVLSGDGSNLKGSLPNKNLLVRVEMDRFNAPQKYQKIKREEFNPETAEKNKIYLLEDQLVYLDIFGKVIDLGQTSDTCHRLFNAITTPFYQNYILYDEYIDPEESAEEAAMFEMGEIVKAKMKNIDCWTATHSFTIFVPESDSEDTRTLYPYQAYWTSHTLQQWFSGDKDEKLSRLGIDGYIEKLALLGTTTDSKIRSSIYGELFSPPGKEHVFCTGMNEKFSPLRVKFKVTEVNPEIALQNLEEVQEFIDTNYPGENAKDQCELYKIKAQEAMTKQLEMRLLIE 383 T 0.035 V-ATPase_H_N unppercent F Bacteria T 6kgx 3 BB,CE,M,RF 21,M4,M1,24 LR7 MAAFVVGAQVSSGFMGCAAPRKEQTRVGGAAKAAVSMRIIIQDKNSYRKYQTNTSTSKWDKLLNTKPMKRQVQPNPPTNETRALNLGNTFRSPAFKFLGTLKRSKDPSGLRLGFYGRKADDFMARSIAMQAKASAAGSGVYTTQCSEGASKGMAENARTASLAKQFRQAQRSAREMSFDYYEGRKYAMKAVGHICNYEEKIFQQYNKTAAAYVMGKQETLLSCDRYAQPANKAEEYIQKSVQMQMKKRSIPYGVYTTSCADGTVKGMAENARVAKESANFRARQMSAGAKAAARFNARRVANDWHNNGCNYEEKLTSRFPAAASSVRPTTNRY 333 T 0.022 APC_u5 pdb F T 6kgx 7 BS,EB,EG,EN,FZ,PD,RH,RK,SX,ZP AF,A2,A6,AB,AJ,Y3,A7,A9,AI,YD LR4 MAAFVSGFHGVQVGAPAENKLVCRAAKPAQLTMLTGYDSKSSPNFPNRAATRERRTVSFNARVARNKSQAKKILEKADEFFARSVTMQYKAFACPNGVYDIQCTEGTVKGAAYEKRAMAVSAAFRAKQASPAAKARALFENRRHAIIASHECQHEEDLFVRFPKLSAAYMMGKTEAMRTCSRYVVPDSLEEEYMAASVDRQMKERACPGGVYASSCVEGNAKGQAEQARVAALATAFRSAQKSASKTTAERYSSAAYGRDHFAHGCSYEESVFNTYPATAAAMRSKSYNY 290 T 0.14 Amidohydro_1 pdbpercent F T 6kgx 13 EJ,JV,KV,RL,WR,XR A8,wG,xG,AA,wE,xE LR5 MAAFVGSAASAFTGASAVKANEKRSVCSLQMVAMPQTGLVNSKFSARMAKKTAKQTKNKVDEYMARSVQRQYKQAAVATGVYGTQCTEGTVKGAAEASRSAALSRQFRIKQRSAFSKAHDLFEFRKHAIIAAAGCSYEEKMVTRFPKLAAAMVLGQTEMMRTCSRYVVPESVEEEYMAASVDKQMKRRGAPGGVYSLSCAEGVAKGQAEIARVSALGAAYRAASKSASAVTAERYNSMAYGRVHFAHGCSYEEQQFNKYPAAAAAMRSDSYGY 273 T 1.6 rRNA_methylase pdbpercent F T 6kgx 14 FJ,LV,SL,YR B8,yG,BA,yE LR8 METAFVSGFMGKAAVAKFGATAVCDKTARRSSSSNSQVHMVTGAVSSVNMRRFQRVPKVSGFSAKVTKKNVNKALDKADMFFAKSVTMEGKAAAIPYGVYGIQCMEGSAKGMAHEKRAMALSAAFRMNQRSAAEKTGAMYENRRLALILAQNDHQEKQYIKYPKLAAAALMASTEVTRACQRYAVPESIEEEFLAASVDKVNKMRGTTASGVYKSSCVEGNAKGQAEQARVAALAVAFRSAQKSASQFAAERYAQSKYGRDLFSSTHFEEGYANTYPAMAAAKRASSYGY 290 T 0.011 THF_DHG_CYH pdb F T 6kgx 23 MW,OX ZH,2H LRC4 MAFVACGPLRAGEGGARLGARKAACSMQLAPPGIPPGEDARNNQSLRQYVARPVETYQKRSFATPLPLTWTGETETVGAFDVVVPPQEKDLPVSGEATSAFVKYSDMVRAERKAALQALLSASAAGEGRPTCGAEGRKFVSNANPVLVNGVKCVEYWRK 159 T 14 CRM1_repeat unphh F T 6kgx 24 NW,PX aH,3H LRC5 MAFVSGAGVAVPAGAKASAPLCALRMSGYGDYSYSTDRTKGHVNQYYVDKARSRSDWGNRNVLPASEGDAVLGRTAKGAVAVPEFGIPQLDDPVLGFGPDSMVDPRIAEADGAVWRWDAGFVDESMTLASCADISDEAVADEAFAKFRGSVLAERGAMITKAESATASVITSLRDGLYSGEAQLLTASGQRLANVAGQEKIATISGYTWDGQPQTEIPGKPFVKSIGAMDYMDGVEGGDVVAAKVGAFWKPKAPKEVPYKRPMGANTPELPYNTVPRLVQAAGLAVQE 288 T 14 DUF5953 pdbhh F T 6khi 16 P P V5V507_9CYAN proton-translocating NADH-quinone dehydrogenase subunit P MDAVISVKPILLAMTPVFILLCLFFGTRNGFYDTDQYHGNGSAH 44 T 0.12 SPC25 unppercent F Bacteria T 6khi 17 Q Q V5V791_9CYAN proton-translocating NADH-quinone dehydrogenase subunit Q MATDFNRGIMKFDGADSPAMIAISAVLILGFIAALIWWALHTAYA 45 T 0.019 FixS pdb F Bacteria T 6ki9 1 A,B,C A,B,C A0A1C9HA64_9BACT FabMG, novel types of Enoyl-acyl carrier protein reductase MKSPIPLRDVPQSNIFRKGDVFVLFGELFGRGYANGLINEARDAGMTIVGITVGRRDENNALRALTAEELATAEANLGGRIINVPLMAGFDLDAPAGEPTPTDLLADMTLKSWQDDKLDWAHIEKCRAVGVQRFKDGVAKVMAELDGMIPDGANAFFAHTMAGGIPKVKVFLAIANRIYKGRGERFLSSSALLNSDLGKLILMNFDEVTANTFLHLIEGSAAIRARLEKSGGQVRYSAYGYHGTEILIDDKYQWQTYTSYTQGKAKMRLERIAEDAWKQGIKATVYNCPEIRTNSSDIFVGVELSLFPLLKALKKENGGAWAEAQWQACREVLSEGHTLESLLQKIDDYNASDVMKGFRNFEAWPMPNTAELADIMIGTSDEITKMHKSRDALVTDVLSALVLEGTGPLMFHESSNPAGPVLWLSHDVIAKQLNLMHRLEHHHHHH 446 T 0.16 DUF1566 pdb F Bacteria T 6kir 1 A A CX040_MOUSE Uncharacterized protein CXorf40 homolog GGSMKFPCLSFRQPYAGLILNGVKTLETRWRPLLSSVQKYTIAIHIAHKDWEDDEWQEVLMERLGMTWTQIQTLLQAGEKYGRGVIAGLIDIGETFQCPETLTAEEAVELETQAVLTNLQLKYLTQVSNPRWLLEPIPRKGGKDIFQVDIPEHLIPLEKE 160 T 0.00014 ASCH unp F Eukaryota T 6kit 1 A A CX040_MOUSE Uncharacterized protein CXorf40 homolog SMKFPCLSFRQPYAGLILNGVKTLETRWRPLLSSVQKYTIAIHIAHKDWEDDEWQEVLMERLGMTWTQIQTLLQAGEKYGRGVIAGLIDIGETFQCPETLTAEEAVELETQAVLTNLQLKYLTQVSNPRWLLEPIPRKGGKDIFQVDIPEHLIPLEK 157 T 0.00014 ASCH pdb F Eukaryota T 6kl4 1 A A Q5ZTL4_LEGPH MavC EKTGLHVHEKIKHMVKNYGTMITGIPAEILGQNEAEISVGYVKKMGNMKENIAEVVRKSEMTQPTNSCGKASNEVCDLLLGTEGASEFEKSSYQVLSGDGSNLKGSLPNKNLLVRVEMDRFNAPQKYQKIKREEFNPETAEKNKIYLLEDQLVYLDIFGKVIDLGQTSDTCHRLFNAITTPFYQNYILYDEYIDPEESAEEAAMFEMGEIVKAKMKNIDCWTATHSFTIFVPESDSEDTRTLYPYQAYWTSHTLQQWFSGDKDEKLSRLGIDGYIEKLALLGTTTDSKIRSSIYGELFSPPGKEHVFCTGMNEKFSPLRVKFKVTEVNPEIALQNLEEVQEFIDTNYPGENAKDQCELYKIKAQEAMTKQLEMRLLIE 378 T 0.035 V-ATPase_H_N unppercent F Bacteria T 6klm 1 A A Roseltide rT7 CVSSGIVDACSECCEPDKCIIMLPTWPPRYVCSV 34 T 2.4 Benyvirus_14KDa pdbhh F T 6kls 3 C,F C,F O66458_AQUAE Cytochrome c MNTWGLIKTIFFAGSTLVFFFLLWFYNPFKHVEHYEVDEEVKAIIDNPWKKTESGKTIAEEGRELFIASCSSCHSLRYDGIYIMSVAANPKWKNIEKTSGRPVYRFGTLYKDRFFVPKDVYEAFAHDDIQGLKASLGQVPPDLSSMYLARGEGYLYQFILNPQKVLPGTTMPQLFNPQFDPQAKEKVAKIVAYMKSVNTPPPKESAKRTVMGVIVIAYFIVMGLLLWKYRENLLKRLGYH 240 T 7E-06 Cytochrom_C1 pdbpssm F Bacteria T 6km7 2 C,D C,D RBBP5_HUMAN RBBP-5,RETINOBLASTOMA-BINDING PROTEIN RBQ-3 DEELEDSKALLYLPIAPEVEDPEENPYGPPPDGSQPPKKKPKTTNIELQGVPNDEVHPLLGVKGDGKSK 69 T 0.014 DUF2457 unppercent F Eukaryota T 6kmh 2 C,D C,D APBA1_RAT ADAPTER PROTEIN X11ALPHA,NEURON-SPECIFIC X11 PROTEIN,NEURONAL MUNC18-1-INTERACTING PROTEIN 1,MINT-1 GPGSEFQRYSKEKRDAISLAIKDIKEAIEEVKTRTIRSPYTPDEPKEPIWVMRQDISPTRDCDDQR 66 T 25 GCIP pdbhh F Eukaryota T 6kmy 1 A A Czon1107-P5A GFRSACPPFCX 11 T 0.088 Pellino pdbhh F T 6kn2 1 A A CX07_CONZO Czon1107-WT (Conformer A) GFRSPCPPFCX 11 T 0.098 Peroxidase_2 unphh F Eukaryota T 6kno 1 A A Czon1107-P7A(minor conformer) GFRSPCAPFCX 11 T 3.1 Chlam_OMP3 pdbhh F T 6kny 1 A,B A,B B2UR41_AKKM8 Protein Amuc_1100 IVNSKRSELDKKISIAAKEIKSANAAEITPSRSSNEELEKELNRYAKAVGSLETAYKPFLASSALVPTTPTAFQNELKTFRDSLISSCKKKNILITDTSSWLGFQVYSTQAPSVQAASTLGFELKAINSLVNKLAECGLSKFIKVYRPQLPIETPANNPEESDEADQAPWTPMPLEIAFQGDRESVLKAMNAITGMQDYLFTVNSIRIRNERMMPPPIANPAAAKPAAAQPATGAASLTPADEAAAPAAPAIQQVIKPYMGKEQVFVQVSLNLVHFNQPKAQEPSED 287 T 0.019 T2SSM unphh F Bacteria T 6kpb 2 B B IDD10_ARATH ID1-LIKE ZINC FINGER PROTEIN 3,PROTEIN INDETERMINATE-DOMAIN 10 SPMSATALLQKAAQMGS 17 T 1.9 fvmX5 pdbhh F Eukaryota T 6kpd 2 B B IDD9_ARATH ID1-LIKE ZINC FINGER PROTEIN 1,PROTEIN INDETERMINATE-DOMAIN 9 GPQIASMSATALLQKAAQMGSKRSSSSSSNSKTFGLMT 38 T 0.41 Vfa1 unppssm F Eukaryota T 6ks5 1 A,B A,B Q5ZV21_LEGPH Type IV secretion protein Dot MHTKKDKKVISLQERVENAVDVSGAFDNCFFHNFALYLLTNNLPLPDDLFHFKSIINRNSKAEQLFEFFHNPESLNLFSILDKENDVSEPSGYLFEKSLILGFLLREWFPTQLVNNSAVKAEMLEGEKGVFSAFKNYKEYRSFMSKEELKSTEFGALYEANEAFLEYFYNRSESTLINKDSPFEKYFVGSSSDEEAIKNYWDAEGYTLYCQHLAKPQVKLSYIEIMTMMKVINQPLTIYDRSTSSIVAEYVNPKVNLPDFEVAIDALQGHYFLLKTEETEKELEEYERSYAQYKRDRSEILAHSDKPVSSLLVRATCPKGHLDEDPFIALIESLSEINSLSQIDTNLKNENT 352 T 0.25 Ycf54 pdbpssm F Bacteria T 6kto 2 C C SHLD3_HUMAN REV7-INTERACTING NOVEL NHEJ REGULATOR 1,SHIELD COMPLEX SUBUNIT 3 MTTEVILHYRPCESDPTQLPKIAEKAIQDFPTRPLSRFIPWFPYDGSKLPLRPKRSPPVISEEA 64 T 6.8 Sm_like pdbhh F Eukaryota T 6kto 3 D D SHLD2_HUMAN PROTEIN FAM35A,RINN1-REV7-INTERACTING NOVEL NHEJ REGULATOR 2,SHIELD COMPLEX SUBUNIT 2 MGMSGGSQVHIFWGAPIAPLKITVSEDTASLMSVADPWKKIQLLYSQHSLYLKD 54 T 3 LPD38 pdbhh F Eukaryota T 6ku0 2 B,D B,D MICA1_HUMAN MOLECULE INTERACTING WITH CASL PROTEIN 1,MICAL-1,NEDD9-INTERACTING PROTEIN WITH CALPONIN HOMOLOGY AND LIM DOMAINS GPGSQPTRRQIRLSSPERQRLSSLNLT 27 T 3.1 DUF3156 pdbhh F Eukaryota T 6kva 3 E,F B,b CXCR2_HUMAN CXCR2 PEPTIDE DSFEDFWKGED 11 T 1.1 LRR_3 pdbhh F Eukaryota T 6kwo 3 C C NRAM_I96A0 peptide ESDTVGWSW 9 T 2 CDC24_OB1 pdbhh T Viruses T 6kx1 3 C C MUC1_HUMAN Synthetic MUC1 glycopepide XVTSAPDTRPAPGSTA 16 T 32 DUF3235 pdbhh F Eukaryota T 6kx9 3 C C 8-pepide (ARG-ARG-ALA-LEU-ARG-GLU-GLY-TYR) RRALREGY 8 T 4.6 RNR_inhib pdbhh F T 6kxx 2 B B PRGC1_HUMAN PGC1alpha PQEAEEPSLLKKLLLAPANTQL 22 T 4.1 Apo-CIII pdbhh F Eukaryota T 6kyf 1 A A AcrF11 GPLGSMSMELFHGSYEEISEIRDSGVFGGLFGAHEKETALSHGETLHRIISPLPLTDYALNYEIESAWEVALDVAGGDENVAEAIMAKACESDSNDGWELQRLRGVLAVRLGYTSVEMEDEHGTTWLCLPGCTVEKI 137 T 0.027 Strep_his_triad pdbpssm F T 6kyu 3 C C peptide LRKRQLTVL 9 T 5.9 FAM181 pdbhh F T 6kz1 2 B B MYO15_HUMAN Myosin XVa EKRLTLPPSEITLL 14 T 0.36 DUF4875 pdbhh F Eukaryota T 6kza 2 C,D C,D DNAC_ECOLI DNA replication protein DnaC MKNVGDLMQRLQKMMPAHIKPAFKTGEELLAWQKEQGAIRSAALERENRAMKMQRT 56 T 0.014 DUF6434 pdbpssm F Bacteria T 6kzg 2 B,D C,Q CIC_HUMAN 8-MER FROM PROTEIN CAPICUA HOMOLOG RSMSETGT 8 T 1.6 Tis11B_N pdbhh F Eukaryota T 6kzh 2 B,D C,Q CIC_HUMAN 8-MER FROM PROTEIN CAPICUA HOMOLOG RTQSLSAL 8 T 15 APC_r pdbhh F Eukaryota T 6kzj 1 A A ANK2_HUMAN ANK-2,ANKYRIN-B,BRAIN ANKYRIN,NON-ERYTHROID ANKYRIN GPGSASPDLLSEVSEMKQDLIKMTAILTTDVSDKAGSIKVKELVKAAEEEPGEPFEIVERVKEDLEKVNEILRSGT 76 T 15 PKHD_C pdbhh F Eukaryota T 6l0o 1 A A MCM8_HUMAN MINICHROMOSOME MAINTENANCE 8 MARSMSNRSTAKRFISALNNVAERTYNNIFQFHQLRQIAKELNIQVADFENFIGSLNDQGYLLKKGPKVYQLQTMHHHHHH 81 T 0.0086 TMP_3 pdbpercent F Eukaryota T 6l0v 2 B,D,F,H B,D,F,H Q5XVG3_ARATH LZY3 GPKWVKTDSDFIVLEI 16 T 1.3 DUF2286 pdbhh F Eukaryota T 6l2w 1 A,B A,B A0A4Y5TR47_9CAUD freshwater cyanophage protein MMFVRLSYHSFDYLFNLFDAGVIDLNTKCPVSLSEIEDYDNFGWLELTAENLENVCEYCAKLGIEANGSLGDFRYWYSGDMSYHLELKSDQSENLEVKIREINLKLKELELIKNECLEHHHHHH 124 T 0.036 RNA_pol_Rpb1_1 pdb T Viruses T 6l4u 12 L 2u Photosystem I reaction center subunit Psa28 FAPMPRAAISTTQARTASMPSAPFTSLSMASEDMTWEGEYPPSKVLGPIMSKMPSGLLGLISIACAAVCAYSIAQSGVLQQQPGAYENGSWVKWYYVLGSFGGPLAWGTHVASWIQRKNGM 121 T 0.02 PIG-Y pdbpercent F T 6l63 2 B,D B,D F3 XFAYDRRXLSNNXRNYXG 18 T 8 E2 pdbhh F T 6l66 2 B C PRO-ARG-LYS-GLN-LEU-ALA PRKQLA 6 T 89 DUF3597 pdbhh F T 6l6g 1 A,B A,B Q5ZZ22_LEGPH Uncharacterized protein Lpg0189 NSDNNTDGLIFSPLPQNKNTVVRHYSNEQEMPNLSQMAQRTIDFPTQIVRVSGNLTGLELSCDDVENEIDQVFSKKISPNLFTYNTYVSCGYDVNDPEQHAINFSIQSYFDPLTDNAVDYLKSYLKEYNGYNLFNTTTLQIENAKGIIVSMNLNAGLKSNPDKTPFTLYRQDRNNFYFKSNFDVRKELISDIYQRFYSNDPDMILPFFDKWIFSYAGSVYYSILMASNYLELQPERIFVMENEGDIFVSDLRYYFANLCMKRNPNKHCL 269 T 0.02 DUF5012 unppercent F Bacteria T 6l6v 1 A A GP44_BPSP1 GP44, GENE 44 PROTEIN MAKSNNVYVVNGEEKVSTLAEVAKVLGVSRVSKKDVEEGKYDVVVEEAAVSLADT 55 T 0.012 HTH_31 pdb T Viruses T 6l7c 3 AA,S,T,U,V,W,X,Y,Z a,S,T,U,V,W,X,Y,Z CSGA_ECOLI Major curlin subunit CsgA GVVPQYGGGGNHGGGGNNSGPN 22 T 0.35 YjbE unphh F Bacteria T 6l7i 1 A,B,C,D,E A,B,C,D,E Q9RN43_PHOLU TOXIN A RALEVERTVSLAEVYAGLPKDNGPFSLAQEIDKLVSQGSGSAGSGNNNLAFGAGTDTKTSLQASVSFADLKIREDYPASLGKIRRIKQISVTLPALLGPYQDVQAILSYGDKAGLANGCEALAVSHGMNDSGQFQLDFNDGKFLPFEGIAIDQGTLTLSFPNASMPEKGKQATMLKTLNDIILHIRYTIK 190 T 4.3999999999999996E-48 TcA_TcB_BD unppssm F Bacteria T 6l7i 4 H H Q93EP1_PHOLU TccC2 APEKGKYTKEVNFFDE 16 T 0.1 Ntox47 unphh F Bacteria T 6l7o 17 Q Q V5V791_9CYAN NAD(P)H-quinone oxidoreductase subunit Q MATDFNRGIMKFDGADSPAMIAISAVLILGFIAGLIWWALHTAYA 45 T 0.019 FixS unp F Bacteria T 6l7q 1 A A F8AFT0_PYRYC hypothetical protein MMLLTRHAKERIAKRLAKKRSLSHIYSSLWAFLERAVRIEIAEGVVAFTDGRKTLVCVPLDCERLSRGEILEKVRGVGVYECIFPEGRLAKLTRPEKFLESVPPGEYYFYMNDEKKVLYVGKRRPLLAITFRPAKRDERLFYIWA 145 T 1.2 DUF4258 pdbhh F Archaea T 6l7r 1 A A G0SED1_CHATD GCP3 GPLGSMQRINNAIDSLIGHLVPAAAGDDDDARTRRQAVFDLVRALLEQPGSNIPSDVNHASDLIKRRLISTNPSQALRFSNLYTRLLALPVLNQKWAILYLLHQLAD 107 T 0.068 DUF6415 pdb F Eukaryota T 6l8r 1 A A PD1L1_HUMAN HPD-L1,B7 HOMOLOG 1,B7-H1 GPRLRKGRMMDVKKCGIQDTNSKKQSDTHLEET 33 T 0.14 ASFV_J13L unphh F Eukaryota T 6l9k 2 B Q SER-PRO-SER-TYR-ALA-TYR-HIS-GLN-PHE SPSYAYHQF 9 T 8.4 F-112 pdbhh F T 6l9m 3 C,F,I,L C,F,I,L SER-PRO-SER-TYR-VAL-TYR-HIS-GLN-PHE SPSYVYHQF 9 T 0.51 DIPSY pdbhh F T 6lad 1 A,B,C,D,E,F A,B,C,D,E,F B2UR41_AKKM8 Amuc_1100 MIVNSKRSELDKKISIAAKEIKSANAAEITPSRSSNEELEKELNRYAKAVGSLETAYKPFLASSALVPTTPTAFQNELKTFRDSLISSCKKKNILITDTSSWLGFQVYSTQAPSVQAASTLGFELKAINSLVNKLAECGLSKFIKVYRPQLPIETPANNPEESDEADQAPWTPMPLEIAFQGDRESVLKAMNAITGMQDYLFTVNSIRIRNERMMPPPIANPAAAKPAAAQPATGAASLTPADEAAAPAAPAIQQVIKPYMGKEQVFVQVSLNLVHFNQPKAQEPSEDLEHHHHHH 296 T 0.019 T2SSM unphh F Bacteria T 6laf 1 A,B A,B B2UR41_AKKM8 Amuc_1100 MSLETAYKPFLASSALVPTTPTAFQNELKTFRDSLISSCKKKNILITDTSSWLGFQVYSTQAPSVQAASTLGFELKAINSLVNKLAECGLSKFIKVYRPQLPIETPANNPEESDEADQAPWTPMPLEIAFQGDRESVLKAMNAITGMQDYLFTVNSIRIRNERMMPPPIANPAAAKPAAAQPATGAASLTPADEAAAPAAPAIQQVIKPYMGKEQVFVQVSLNLVHFNQPKAQEPSEDLEHHHHHH 246 T 0.019 T2SSM unphh F Bacteria T 6lar 3 D,I F,J ECCC3_MYCS2 ESX CONSERVED COMPONENT C3,TYPE VII SECRETION SYSTEM PROTEIN ECCC3,T7SS PROTEIN ECCC3 MSRLIFEHQRRLTPPTTRKGTITIEPPPQLPRVVPPSLLRRVLPFLIVILIVGMIVALFATGMRLISPTMLFFPFVLLLAATALYRGGDNKMRTEEVDAERADYLRYLSVVRDNVRAHAAEQRAALEWSHPEPEVLATIPGTRRQWERDPRDRDFLVLRAGRHDVPLDAALKVKDTADEIDLEPVAHSALRGLLDVQRTVRDAPTGLDVAKLARITVIGEADEARAAIRAWIAQAVTWHDPTMLGVALAAPDLESGDWSWLKWLPHVDVPNEADGVGPARYLTTSTAELRERLAPALADRPLFPAESGAALKHLLVVLDDPDADPDDIARKPGLTGVTVIHRTTELPNREQYPDPERPILRVADGRIERWQVGGWQPCVDVADAMSAAEAAHIARRLSRWDSNPGYIRSTSTGSATFTTLLGIPDASALDVHLGGIKAFHHHHHHHHHH 449 T 0.0012 TauE pdbpercent F Bacteria T 6lbu 2 B B Q6CNW4_KLULA TEN1 GSSKIITDLDTIAGKIEEYAGDTLLRLRIFAQFQDISHSHERTDGIYLHFSNVPDFNAETNRERSYYFLIDETIYDEAFINTKSGERPHKGDILDMRCCYRKYDKVVEIMHLKVISIADLDSLREFLAKADDDSEIRSFLR 141 T 1.4 DUF2059 unphh F Eukaryota T 6lcn 1 A,B,C,D,E,F A,B,C,D,E,F D5SUT9_PLAL2 Serine O-acetyltransferase MGSSHHHHHHSSGLVPRGSHMATDLRLKDQLPEITDRIVESYRDFATTHHLGHCPLPSSEAVYEIAQDLQEILFPGYRRRQNLHMGNVTYHVGDLVDSLHDRLTQQIARALRHDYRRQHGISCADEVSHDFEALAQAKTITLLELLPRLRRTLALDVQAAFDGDPAAGSLDEIIFCYPGLHAVTIYRLAHELYLLDVPLIPRMLTEWAHSQTGIDIHPGATIGHSFFIDHGTGVVIGETCEIANHVKLYQGVTLGALSFPKDEQGNLLRRHKRHPTIEDHVVIYANATVLGGETVIGSHAVIGSSVSLSHSVPPNTIVTIEKPSLRYREAS 331 T 3.1E-10 DUF4954 pdbhh F Bacteria T 6ldv 3 C P M3K2_HUMAN GLY-M3L-GLY-GLY-THR-TYR-PRO NPIFEKFGKGGTYP 14 T 1.8 Thrombin_light pdbhh F Eukaryota T 6lek 1 A A Q9GRC4_MEGRO Cement protein-20k MAHEEDGVCNSNAPCYHCDANGENCSCNCELFDCEAKKPDGSYAHPCRRCDANNICKCSCTAIPCNEDHPCHHCHEEDDGDTHCHCSCEHSHDHHDDDTHGECTKKAPCWRCEYNADLKHDVCGCECSKLPCNDEHPCYRKEGGVVSCDCKTITCNEDHPCYHSYEEDGVTKSDCDCEHSPGPSEHHHHHH 191 T 0.7 Inhibitor_I53 unphh F Eukaryota T 6lfn 1 A,B A,B LpCGTb GMSPPAPADVVSSAKPHVAVIPAAGMGHLNPTLRLAGELASRGCVVTFINPSPPVSLAEATSVAEFVASTPGVRLLDLPVQPLDPSCFPAHEDPFLRQFEAVRRSAPLLTPLLSDVSPPLAAIVCDIAICSTFLTVAAEISLPAYVFFSLSAQMLSLNLAFPTVADQVYGAGEGDEIRFPGLPESIPRSWLPPPLLDPAHLFAVHFVENGKAMPRAAGILVHSWEALEPEALAALRGGRVLAGLPPVLPIGPLYQKEKSNAVFLPWLDAQRDRSVLFVCFGNRSTHSPEQLREMAAGLERSGCRFVWVLKTKVVDKDEDEGAQKEILGEGYLERVKERGVVINGWVDQMTILSHRAVGGFFSHSGSSSVAEAAIGGQPLLLWPMGGDQRMSALVAERRGMGVWPRGWGWSADDKLIPGEEIARRIKDFMGDNALRAVAAKMKKETASAMAPGGSKDQWFDDFIARINRV 469 T 4E-27 UDPGT pdbhh F T 6lfz 1 A,B A,B SbCGTb GMSKSENAGQRPHVAVFPCAGMGHLLPYLRLAAMLHSRGCAVSVISAHPTISDAESRSLSSFFSLYPQIRSLEIQLLPLKRNPRFTNDDPFFIQRESIGNSIHLLRPLLASLSPPLSAIFVDFPVLTEFSPIAADFSLPTYTLIVTSARFFSLMAHLPRLLEQEDDISKKSEVCVPHLDPIQVSSIPPQMLDRRHFFVETITSNVASLSYLKGVLINTFTWLEPEAVEALKRNGVDHILPIGPLEAIKAEESDMDLPWLEEQAPKSVLFISFGSRGAHTKEQLREFAAALEKSGWRFLWVLKSGKVDREDKEETEDILGSSFLERTKNRGVVIKGWADQERILAHSAIGGFVSHCGWNSVVEAAKLGVPVLAWPPHGDQRVNAEVVEKVGLGLWVRGWGWAGERLIGRDEIAEKLIELRNDERLRERVKEVREKAREERESGGISETLIRDLIHSLKIK 459 T 9.6E-27 UDPGT pdbhh F T 6lg0 1 A,B,C,D,E,F A,B,C,D,E,F SbCGTa GMASTTKSENVGAHIALFPCAGMGHLLPFLRLAAMLDARGCAVTVITVKPTVSAAESDHLSAFFTIHPRITRLEFQLLPYQKSGLRNDDPFFIQMETIATSVHLLRPLLSSLSPPLSAIVSDFTLTSQVTDLVSDLPISTYTLMTSSAAFFCLMAYLPKLLQIDVANRDAIEIPDLGPISMSSIPPKMLDPSDFFSAFISSNVSSLHKVKGVLINTFNSFESEAIEAVRRNGVDHILPIGPLESYDAKKAHDLPWLDEQPPESVLFVSFGSRTALSKEQIRELGAALEKSGCRFLWVLKGGKVDKEDKEEVEDMLGASFVERTKKKGLIVKGWVKQEQILAHPAIGGFVSHCGWNSVIEAARLGVPVLAWPQHGDQSVNAGVVEKAGLGLWVREWGWGQTKLIGREEIAEKMIEVMQDEKLRVSAGEVRAKAKETREVDGDSEALLQRLIHSFNNITQNS 460 T 2.7E-26 UDPGT pdbhh F T 6lhf 3 C,H C,F RY0808 peptide RRREQTDY 8 T 15 HEPN_SAV_6107 pdbhh F T 6ljk 2 B B BE2-SER-ALA-ILE-LYS-SER-NIY-GLY-SET XSAIKSXGX 9 T 2.8 LAG1-DNAbind pdbhh F T 6lkf 1 A A A0A220GHA5_9CAUD AcrIIA5 MAYGKSRYNSYRKRSFNRSNKQRREYAQEMDRLEKAFENLDGWYLSSMKDSAYKDFGKYEIRLSNHSADNKYHDLENGRLIVNIKASKLNFVDIIENKLDKIIEKIDKLDLDKYRFINATNLEHDIKCYYKGFKTKKEVI 140 T 0.033 DUSP pdbpssm T Viruses T 6llq 1 A A VAL88 GRVVVVVTSEQVKEEVRKKFPQVEVRVVTTEEDAKQVVKEVQKKGVQKVVVVGVSEKVVQKVKQEANVQVYRVTSNDEVEQVVKDVKGSGLEHHHHHH 98 T 0.00031 PrpR_N pdbpssm F T 6ln4 2 B B PRGC1_HUMAN PPARGC-1-ALPHA,LIGAND EFFECT MODULATOR 6 ASELLKYLTT 10 T 2.2 MTBP_mid pdbhh F Eukaryota T 6lnl 1 A,B,C A,B,C PP62_ASFB7 60 kDa polyprotein MPSNMKQFCKISVWLQQHDPDLLEIINNLCMLGNLSAAKYKHGVTFIYPKQAKIRDEIKKHAYSNDPSQAIKTLESLILPFYIPTPAEFTGEIGSYTGVKLEVEKTEANKVILKNGEAVLVPAADFKPFPDRRLAVWIMESGSMPLEGPPYKRKKEGGGNLEHHHHHHHH 170 T 0.28 mRNA_decap_C pdbpercent T Viruses T 6lnm 2 B,D,F B,D,F APBA1_MOUSE ADAPTER PROTEIN X11ALPHA,NEURON-SPECIFIC X11 PROTEIN,NEURONAL MUNC18-1-INTERACTING PROTEIN 1,MINT-1 GPGSEFISLAIKDIKEAIEEVKTRTIRSPYTPDEPKEPIWVMRQDISPTR 50 T 18 UPF0561 pdbhh F Eukaryota T 6lp2 1 A A Q5ZTL3_LEGPH Uncharacterized protein lpg2148 GSLESPGFMVHKKLKSMSQSYGVMMTGVPAEVLGQMQAERSIPSINKTGNLKQQIAKEVSKVCHMMTEPTQSAGQASNDVCELLLGKIEAEKFHFTKYEALSADGDNLKNVLENTAPSSTNLLIRFEIDREDPPIVLVKTKNENFNPETAVKNKIYLLENKLYFIDKMGNLFNLGPGKKKCTQLFNAIGDSAEYSLCDPFVLEEPEKPEDFAISEIVDIFNEQKERFDFWIGSHSFTIYIPQTLGESPRQFYPYQAYFGSHTLQDWFVSDKDEYLSRIGIDKYIEKLAVLGKTTNTKERSDIYAEFFSKRGREAFFCAHLNEKRQPLRVKFKITEINPELALKNLQETQEFIDTHPGENPSDKVENYRNRAKLAMTEHLESLLDIK 386 T 0.021 CIF pdbhh F Bacteria T 6lp3 1 A,B,D,E A,B,D,E YM11_YEAST Uncharacterized protein YMR124W GSPSKTKSAPVSYDKDGMNASEEDFSFDNTLAKPYEPLYARRGDITSAGSTSGEDSSQPKMITISGEQLNLITENKELMNELTLVSTELAESIKRETELEERIRLYETNNSAPSFDDSSSVSFSDFEKELRKKSSKIVQLIQQLNDERLKRFIAEEQLLLQENGTKPSSMELVGRIENLNKLIDERDSEIEMLKGRLQ 198 T 0.039 Yop-YscD_ppl pdbpercent F Eukaryota T 6lph 2 B B FUSED_DROME Serine/threonine-protein kinase fused AAPVINSHTCFVSGNSNMILNHMNDNFA 28 T 0.23 DUF4193 unp F Eukaryota T 6lqz 2 B B STH1_YEAST ATP-DEPENDENT HELICASE STH1,CHROMATIN STRUCTURE-REMODELING COMPLEX PROTEIN STH1,SNF2 HOMOLOG SEVKSSSVEIINGSESKKKKPKLTVKIKLNKTTVLENNDGKRAEEKPESKSPAKKTAAKY 60 T 70 DUF167 pdbhh F Eukaryota T 6lrp 1 A A A9WDE7_CHLAA Isocitrate lyase MRGSHHHHHHGSMDRAAQIKQIADSWNTPRFAGIVRPYTPEDVYRLRGSVQIEYTLARMGAERLWNLLHTEPYINALGALTGNQAMQQVKAGLKAIYLSGWQVAADANLAGQMYPDQSLYPANSGPQLVRNINNALRRADQIYHSEGRNDIYWFAPIVADAEAGFGGPLNVFEIMKAYIEAGAAGVHFEDQLASEKKCGHMGGKVLIPTQAAIRNLVAARLAADVMGVPTIIVARTDANAATLLTSDIDERDRPFCTGERTSEGFYRVRAGLDQAIARGLAYAPYADMIWCETSEPNLEEARRFAEAIHAQFPGKLLAYNCSPSFNWKKKLDDATIAAFQRELGAMGYKFQFVTLAGFHALNYSMFELARNYRDRGMAAYSELQQAEFAAEAYGYTATRHQREVGTGYFDEVAQVIAGGEISTTALTGSTEEEQFH 436 T 1.4999999999999998E-47 ICL unp F Bacteria T 6ls6 2 C,D C,D H31_HUMAN 2 TKQTARXS 8 T 260 SpecificRecomb pdbhh F Eukaryota T 6lsb 2 B B H3_ACRFO Histone H3 ARTKQTARKSTGGXAPRKQLATKAAX 26 T 0.26 PAF pdbpercent F Eukaryota T 6ltp 1 A,D A,G Cas12i2 SMSSAIKSYKSVLRPNERKNQLLKSTIQCLEDGSAFFFKMLQGLFGGITPEIVRFSTEQEKQQQDIALWCAVNWFRPVSQDSLTHTIASDNLVEKFEEYYGGTASDAIKQYFSASIGESYYWNDCRQQYYDLCRELGVEVSDLTHDLEILCREKCLAVATESNQNNSIISVLFGTGEKEDRSVKLRITKKILEAISNLKEIPKNVAPIQEIILNVAKATKETFRQVYAGNLGAPSTLEKFIAKDGQKEFDLKKLQTDLKKVIRGKSKERDWCCQEELRSYVEQNTIQYDLWAWGEMFNKAHTALKIKSTRNYNFAKQRLEQFKEIQSLNNLLVVKKLNDFFDSEFFSGEETYTICVHHLGGKDLSKLYKAWEDDPADPENAIVVLCDDLKNNFKKEPIRNILRYIFTIRQECSAQDILAAAKYNQQLDRYKSQKANPSVLGNQGFTWTNAVILPEKAQRNDRPNSLDLRIWLYLKLRHPDGRWKKHHIPFYDTRFFQEIYAAGNSPVDTCQFRTPRFGYHLPKLTDQTAIRVNKKHVKAAKTEARIRLAIQQGTLPVSNLKITEISATINSKGQVRIPVKFDVGRQKGTLQIGDRFCGYDQNQTASHAYSLWEVVKEGQYHKELGCFVRFISSGDIVSITENRGNQFDQLSYEGLAYPQYADWRKKASKFVSLWQITKKNKKKEIVTVEAKEKFDAICKYQPRLYKFNKEYAYLLRDIVRGKSLVELQQIRQEIFRFIEQDCGVTRLGSLSLSTLETVKAVKGIIYSYFSTALNASKNNPISDEQRKEFDPELFALLEKLELIRTRKKKQKVERIANSLIQTCLENNIKFIRGEGDLSTTNNATKKKANSRSMDWLARGVFNKIRQLAPMHNITLFGCGSLYTSHQDPLVHRNPDKAMKCRWAAIPVKDIGDWVLRKLSQNLRAKNIGTGEYYHQGVKEFLSHYELQDLEEELLKWRSDRKSNIPCWVLQNRLAEKLGNKEAVVYIPVRGGRIYFATHKVATGAVSIVFDQKQVWVCNADHVAAANIALTVKGIGEQSSDEENPDGSRIKLQLTS 1055 T 0.11 RHSP pdb F T 6ltr 1 A A Cas12i2 SMSSAIKSYKSVLRPNERKNQLLKSTIQCLEDGSAFFFKMLQGLFGGITPEIVRFSTEQEKQQQDIALWCAVNWFRPVSQDSLTHTIASDNLVEKFEEYYGGTASDAIKQYFSASIGESYYWNDCRQQYYDLCRELGVEVSDLTHDLEILCREKCLAVATESNQNNSIISVLFGTGEKEDRSVKLRITKKILEAISNLKEIPKNVAPIQEIILNVAKATKETFRQVYAGNLGAPSTLEKFIAKDGQKEFDLKKLQTDLKKVIRGKSKERDWCCQEELRSYVEQNTIQYDLWAWGEMFNKAHTALKIKSTRNYNFAKQRLEQFKEIQSLNNLLVVKKLNDFFDSEFFSGEETYTICVHHLGGKDLSKLYKAWEDDPADPENAIVVLCDDLKNNFKKEPIRNILRYIFTIRQECSAQDILAAAKYNQQLDRYKSQKANPSVLGNQGFTWTNAVILPEKAQRNDRPNSLDLRIWLYLKLRHPDGRWKKHHIPFYDTRFFQEIYAAGNSPVDTCQFRTPRFGYHLPKLTDQTAIRVNKKHVKAAKTEARIRLAIQQGTLPVSNLKITEISATINSKGQVRIPVKFDVGRQKGTLQIGDRFCGYDQNQTASHAYSLWEVVKEGQYHKELGCFVRFISSGDIVSITENRGNQFDQLSYEGLAYPQYADWRKKASKFVSLWQITKKNKKKEIVTVEAKEKFDAICKYQPRLYKFNKEYAYLLRDIVRGKSLVELQQIRQEIFRFIEQDCGVTRLGSLSLSTLETVKAVKGIIYSYFSTALNASKNNPISDEQRKEFDPELFALLEKLELIRTRKKKQKVERIANSLIQTCLENNIKFIRGAGDLSTTNNATKKKANSRSMDWLARGVFNKIRQLAPMHNITLFGCGSLYTSHQDPLVHRNPDKAMKCRWAAIPVKDIGDWVLRKLSQNLRAKNIGTGEYYHQGVKEFLSHYELQDLEEELLKWRSDRKSNIPCWVLQNRLAEKLGNKEAVVYIPVRGGRIYFATHKVATGAVSIVFDQKQVWVCNADHVAAANIALTVKGIGEQSSDEENPDGSRIKLQLTS 1055 T 0.11 RHSP pdb F T 6lui 1 A A SAMD1_HUMAN STERILE ALPHA MOTIF DOMAIN-CONTAINING PROTEIN 1,SAM DOMAIN-CONTAINING PROTEIN 1 SASPHYQEWILDTIDSLRSRKARPDLERICRMVRRRHGPEPERTRAELEKLIQQRAVLRVSYKGSISYRNAARVQPPRRG 80 T 0.025 Linker_histone pdbpercent F Eukaryota T 6lum 3 C,H,M E,I,O A0R4D1_MYCS2 Succinate dehydrogenase subunit F MVLFFEILLVAAVLVITWFAVYALYRLVTDES 32 T 0.0077 CoxIIa unphh F Bacteria T 6lup 3 C,F C,F PHE-ALA-ASN-PHE-PHE-ILE-ARG-GLY-LEU FANFFIRGL 9 T 3.9 DUF2199 pdbhh F T 6lvb 2 B,D,F,H B,D,F,H I6NWZ0_9RHOB N,N-dimethylformamidase small subunit MTEASESCVRDPSNYRDRSADWYAFYDERRRKEIIDIIDEHPEIVEEHAANPFGYRKHPSPYLQRVHNYFRMQPTFGRYYIYSEREWDAYRIATIREFGELPELGDERFKTEEEAMHAVFLRRIEDVRAELA 132 T 0.18 UPF0158 pdb F Bacteria T 6lw4 1 A A Q5ZTL3_LEGPH Uncharacterized protein Lpg2148 LESPGFMVHKKLKSMSQSYGVMMTGVPAEVLGQMQAERSIPSINKTGNLKQQIAKEVSKVCHMMTEPTQSAGQASNDVCELLLGKIEAEKFHFTKYEALSADGDNLKNVLENTAPSSTNLLIRFEIDREDPPIVLVKTKNENFNPETAVKNKIYLLENKLYFIDKMGNLFNLGPGKKKCTQLFNAIGDSAEYSLCDPFVLEEPEKPEDFAISEIVDIFNEQKERFDFWIGSHSFTIYIPQTLGESPRQFYPYQAYFGSHTLQDWFVSDKDEYLSRIGIDKYIEKLAVLGKTTNTKERSDIYAEFFSKRGREAFFCAHLNEKRQPLRVKFKITEINPELALKNLQETQEFIDTHPGENPSDKVENYRNRAKLAMTEHLESLLDI 383 T 0.021 CIF pdbhh F Bacteria T 6lw5 2 B B TRP-LYS-TYR-MET-VAL-QXV WKYMVX 6 T 32 DUF5891 pdbhh F T 6ly5 31 HA h A0A6J4B118_9STRA PsaR MSKLSFFILSAVLAVSAAFAPMPRAAISTTHARTASMPSASFTSLSMASEDMTWEGEYPPSKVLGPIMSKMPSGLLGLISIACAAVCVYSIAQSGVLQQQPGAYENGSWVKWYYVLGSFGGPLAWGTHVASWIQRKNGM 139 T 0.51 DUF6520 pdbhh F Eukaryota T 6lyc 2 B B D4-2 XXRYSAVYSIHPSWCGX 17 T 0.95 BUD22 pdbhh F T 6lzx 1 A,B A,B B5MGN9_PHYAM Glycosyltransferase MNHKVHHHHHHLQENLYFQGMGAEPQQLHVVFFPIMAHGHMIPTLDIARLFAARNVRATIITTPLNAHTFTKAIEMGKKNGSPTIHLELFKFPAQDVGLPEGCENLEQALGSSLIEKFFKGVGLLREQLEAYLEKTRPNCLVADMFFPWATDSAAKFNIPRLVFHGTSFFSLCALEVVRLYEPHKNVSSDEELFSLPLFPHDIKMMRLQLPEDVWKHEKAEGKTRLKLIKESELKSYGVIVNSFYELEPNYAEFFRKELGRRAWNIGPVSLCNRSTEDKAQRGKQTSIDEHECLKWLNSKKKNSVIYICFGSTAHQIAPQLYEIAMALEASGQEFIWVVRNNNNNDDDDDDSWLPRGFEQRVEGKGLIIRGWAPQVLILEHEAIGAFVTHCGWNSTLEGITAGVPMVTWPIFAEQFYNEKLVNQILKIGVPVGANKWSRETSIEDVIKKDAIEKALREIMVGDEAEERRSRAKKLKEMAWKAVEEGGSSYSDLSALIEELRGYHA 505 T 8.000000000000001E-28 UDPGT pdbhh F Eukaryota T 6m0q 2 B,D,F,H,J,L B,D,F,H,J,L Q82V11_NITEU Uncharacterized protein MNKVIVAAFVSAFVLGSTATFASGNLESSLAPISAKDMLDYLACKDKKPTDVVKSHTEVENGKIVRVKCGDIVALVQKAREQSGDAWQGGY 91 T 0.042 DUF6488 pdbpercent F Bacteria T 6m0r 6 F O YP17B_YEAST Uncharacterized protein YPR170W-B TGKAWCCTVLSAFGVVILSVIAHLFNTNHESFVGSINDPEDGPAVAHTVYLAALVYLVFFVFCGFQVYL 69 T 0.055 Tetraspanin pdbpssm F Eukaryota T 6m0y 1 A A LYS-ARG-ILE-VAL-LYS-ARG-ILE-LYS-LYS-TRP-LEU-ARG KRIVKRIKKWLR 12 T 0.78 FAM110_C pdbhh F T 6m19 1 A A lasso peptide LVVIVQADWNAPGFF 15 T 1.2 Exo_endo_phos pdbhh F T 6m1h 2 B B MAXA_LUTLO Maxadilan CDATCQFRKAIDDCQKQAHHSNVLQTSVQTTATFTSMDTSQLPGNSVFKECMKQKKKEFKA 61 T 0.74 Clavanin unphh F Eukaryota T 6m1u 1 A A MET16_HUMAN METHYLTRANSFERASE 10 DOMAIN-CONTAINING PROTEIN,METHYLTRANSFERASE-LIKE PROTEIN 16,N6-ADENOSINE-METHYLTRANSFERASE METTL16,U6 SMALL NUCLEAR RNA (ADENINE-(43)-N(6))-METHYLTRANSFERASE,METHYLTRANSFERASE 10 DOMAIN-CONTAINING PROTEIN,METHYLTRANSFERASE-LIKE PROTEIN 16,N6-ADENOSINE-METHYLTRANSFERASE METTL16,U6 SMALL NUCLEAR RNA (ADENINE-(43)-N(6))-METHYLTRANSFERASE MKPITFVVLASVMKELSLKASPLRSETAEGIVVVTTWIEKILTDLKVQHKRVPCGKEEVSLFLTAIENSWIHLRRKKRERVRQLREVPRAPEDVIQALEEKKGVAGQYLFKCLINVKKEVDDALVEMHWVEGQNRDLMNQLCTYIRNQIFRLVAVNLEHHHHHH 164 T 15 BLUF pdbhh F Eukaryota T 6m24 3 C C POLG_RHDVF VP60-2 ALMPGQFFV 9 T 0.46 tRNA_Me_trans pdbhh T Viruses T 6m2k 1 A C POLG_RHDVF VP60-10 FVPFNSPNI 9 T 6.5 DUF1919 pdbhh T Viruses T 6m3n 1 A A anti-CRIPSR AcrIF7 GHMTTFTSIVTTNPDFGGFEFYVEAGQQFDDSAYEEAYGVSVPSAVVEEMNAKAAQLKDGEWLNVSHEA 69 T 0.034 DUF3085 pdbpssm F T 6m64 2 B,D,F B,D,F CBP_HUMAN CBP GPPPAAVEAARQILREAQQQQHLYSDED 28 T 6.4 DUF2007 pdbhh F Eukaryota T 6m6q 1 A,B A,B Q93413_CAEEL Dicer Related Helicase MQPTAIRLEDYDKSKLRLPFESPYFPAYFRLLKWKFLDVCVESTRNNDIGYFKLFESLFPPGKLEEIARMIIDEPTPVSHDPDMIKIRNADLDVKIRKQAETYVTLRHAHQQKVQRRRFSECFLNTVLFDEKGLRIADEVMFNYDKELYGYSHWEDLPDGWLTAETFKNKFYDEEEVTNNPFGYQKLDRVAGAARGMIIMKHLKSNPRCVSETTILAFEVFNKGNHQLSTDLVEDLLTEGPAFELKIENGEEKKYAVKKWSLHKTLTMFLAIIGFKSNDKKEKNEHEEWYYGFIDAMKNDPANRAALYFLDKNWPEELEEREKERDRIRLTLLKS 335 T 0.29 RE_NgoFVII unphh F Eukaryota T 6m7a 2 B,D D,C SHLD3_HUMAN REV7-INTERACTING NOVEL NHEJ REGULATOR 1,SHIELD COMPLEX SUBUNIT 3 QDFPTRPLSRFIPWFPYDGSKLPLRPKRSPPVISEEAAEDVKQYLT 46 T 4.6 Sm_like pdbhh F Eukaryota T 6m7b 2 C,D C,D SHLD3_HUMAN REV7-INTERACTING NOVEL NHEJ REGULATOR 1,SHIELD COMPLEX SUBUNIT 3 RFIPWFPYDGSKLPLRPKRSPPVISEEAAEDVKQYLT 37 T 1.2 Sm_like pdbhh F Eukaryota T 6m8r 2 K,L K,L GABR2_HUMAN GB2, G-PROTEIN COUPLED RECEPTOR 51, HG20 GPEKDPIEDINSPEHIQRRLSLQLPILHHAYLPSIGGVDAS 41 T 5.2 DUF776 pdbhh F Eukaryota T 6m8s 3 I,J,K,L,O A,O,P,B,M KCD12_HUMAN PFETIN,PREDOMINANTLY FETAL EXPRESSED T1 DOMAIN GPESLDGSRRSGYITIGYRGSYTIGRDAQADAKFRRVARITVCGKTSLAKEVFGDTLNESRDPDRPPERYTSRYYLKFNFLEQAFDKLSESGFHMVACSSTGTCAFASSTDQSEDKIWTSYTEYVFCRE 129 T 0.13 Baculo_VP91_N pdb F Eukaryota T 6m90 3 C C CTNB1_HUMAN BETA-CATENIN CDRKAAVSHWQQQSYLDSGIHSGATTTAPSLSG 33 T 25 AvrPto pdbhh F Eukaryota T 6m91 3 C C CTNB1_HUMAN BETA-CATENIN CDRKAAVSHWQQQSYLDSGIHAGATTTAPSLSG 33 T 12 AvrPto pdbhh F Eukaryota T 6m9k 2 D,E,F D,E,F VBET_LAMBD Recombination protein bet ITPVNDETMQEINTLLIALDKTWDDDLLPLCSQIFRRDIRASSELTQAEAVKALGFLKQKAAEQKVA 67 T 0.25 DUF1018 pdbhh T Viruses T 6mbb 2 B B dF1 XSYVDKIADVMREVAEKINSDLTX 24 T 1.7 DUF5806 pdbhh F T 6mbc 2 B B dF4 XSLLEKLAEYLRQMADEINKKYVKX 25 T 0.11 Amphi-Trp pdb F T 6mbd 2 C,D C,D dM1 XAPKEKEVAETLRKIGEEINEALKX 25 T 0.11 Bclx_interact pdb F T 6mbe 2 B B dM7 XDKTLEEIARELLKLALEIDKEIX 24 T 1.3 DUF2497 pdbhh F T 6mbm 1 A A MYO1H_HUMAN MYOSIN-1H KWAVRIIRKFIKGFIS 16 T 0.00051 IQ unppercent F Eukaryota T 6mc6 1 A,B A,B PPRA_DEIRA PLEIOTROPIC PROTEIN PROMOTING DNA REPAIR GMQTDGIYAAFDTLMSTAGVDSQIAALAASEADAGTLDAALTQSLQEAQGRWGLGLHHLRHEARLTDDGDIEILTDGRPSARVSEGFGALAQAYAPMQALDERGLSQWAALGEGYRAPGDLPLAQLKVLIEHARDFETDWSAGRGETFQRVWRKGDTLFVEVARPASAEAALSKAAWKVIASIKDRAFQRELMRRSEKDGMLGALLGARHAGAKANLAQLPEAHFTVQAFVQTLSGAAARNAEEYRAALKTAAAALEEYQGVTTRQLSEVLRHGLRES 278 T 8.6 Ldt_C pdbhh F Bacteria T 6mc8 1 A,B A,B C1D318_DEIDV DNA repair protein PprA MRSGSHHHHHHRSDITSLYKKAGLENLYFQGSVNPLARFAELVATAGLQSDVQALADSGADDTTLEAQLTQELRLAHDRWGLGLLHLQHSARLIHTDGVPSDIALLVDGAPRAQLSDGARAIAGTYASMQAPGPEGRSEWGILPEGHRVTLRPGLGQLRVLIEDARDFETHWTPGAAQTWTRTWRQGETLAVEVHRPATPATALAKAAWKVITSIKDRTFQRELMERSNQVGMLGALLGARHSGAGDALNQLPEAHFAVSSAVVRETGREGREVDRWKAMQREATETLDELQKAATRRLAAVLSGGLR 308 T 0.2 Asp_protease_2 unp F Bacteria T 6mcc 3 C C A0A4V8H027_9CAUD ACRIIA2B.3 MTTARKKFYQAISEFEAMTGKDVERTPQIADEVLNDAEYIAFTKTEKYALYLCTSNVEGLEDRYFLDEECLDSTFLETEDNETYYIHFLQETEFSEDDNEDELPLATEEQIEAYDKQEELKAVILKKELN 130 T 0.051 CARDB pdbpercent T Viruses T 6mcd 1 A A Pb(II)(GRAND Coil Ser L12CL16A)- XEWEALEKKLAACESKAQALEKKLQALEKKLEALEHGX 38 T 0.0005 Lebercilin pdbpssm F T 6mct 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O A,B,C,D,E,F,G,H,I,J,K,L,M,N,O mini-eVgL membrane protein DSLKWIVFLLFLIVLLLLAIVFLLRGX 27 T 0.002 RCR pdbhh F T 6me1 3 C,F F,E ENV_HV1B1 ENV POLYPROTEIN AVGLGAVFLGHHHHHH 16 T 52 PBP_N unphh T Viruses T 6mf5 2 C,D C,D SPC72_YEAST Spc72 SLAQSSPAGSQ 11 T 82 NAD_kinase_C pdbhh F Eukaryota T 6mf6 2 C,D D,C DBF4_YEAST DUMBBELL FORMING PROTEIN 4 RARIERARSIEGAVQVSKGTG 21 T 2.2 PDGF_N pdbhh F Eukaryota T 6mf8 1 A A TRAC_MOUSE T-cell receptor alpha chain C region DATLTEKSFETDMNLNFQNLSVMGLRILLLKVAGFNLLMTLRLWSS 46 T 0.048 Ribonucleas_3_3 pdb F Eukaryota T 6mgq 2 D,E,F D,E,F Phosphinic inhibitor DG014 XXTFPETLTY 10 T 22 Tcp11 pdbhh F T 6mhf 2 B C GRDN_HUMAN AKT PHOSPHORYLATION ENHANCER,APE,COILED-COIL DOMAIN-CONTAINING PROTEIN 88A,G ALPHA-INTERACTING VESICLE-ASSOCIATED PROTEIN,GIV,GIRDERS OF ACTIN FILAMENT,HOOK-RELATED PROTEIN 1,HKRP1 KTGSPGSEVVTLQQFLEESNKLTSVQIKSSS 31 T 4.3 DFRP_C pdbhh F Eukaryota T 6mi9 1 A A PRO-MET-ALA-ARG-ASN-LYS-ILE-LEU-GLY-LYS-ILE-LEU-ARG-LYS-ILE-ALA-ALA-PHE-LYS PMARNKILGKILRKIAAFKX 20 T 0.88 HATPase_c_4 pdbhh F T 6mic 1 A A A0A0H3AKH0_VIBC3 Toxin co-regulated pilus biosynthesis protein B GSHMFLEDSELCWDTAAGSAKSCLSVRYDTVGNKTELDLKQIDVVSAKGLSFESDGKTKTPVVSTYETFQDGGRAKTINAIECPTGLNNRFAAVVSSFSTAGQNANFSSESAKDSQGTTQKDGSKGPHALLSGISLNWTLTNKVWDVTASIGIESGILPTSGIDSGSLLRNPKSLSFIAFQWCEN 185 T 23 DapH_N pdbhh F Bacteria T 6mjb 2 C C Q6FKQ5_CANGA Kinetochore-associated protein DSN1 GDKDNGLHAGETDGDDEGFEFRRHSNLGVPTLGERLDSLHEIKSARRMDHFNSSRNSLR 59 T 6.9 DUF1752 pdbhh F Eukaryota T 6mjc 2 B B Q6FKQ5_CANGA Kinetochore-associated protein DSN1 SNAPTLGERLDSLHEIKSARRMDHFNDD 28 T 3 DUF1752 pdbhh F Eukaryota T 6mje 2 B,D,F,H B,D,F,H DSN1_YEAST Dsn1p DLKFKRHKNKHIQGFPTLGERLDNLQDIKKAKRVENFNSS 40 T 2.5 Cytadhesin_P30 unphh F Eukaryota T 6mjl 2 B A MLXPL_HUMAN ChREBP Peptide ASN-TYR-TRP-LYS-ARG-ARG-ILE-GLU-VAL NYWKRRIEV 9 T 0.015 DUF2635 pdbhh F Eukaryota T 6mk4 1 A A TXPR2_THRPR BETA/OMEGA-TRTX-TP2A, PROTX-II, PT-II, PROTOXIN-2, PROTX2 YCQKWMWTCDSERKCCKGMVCRLWCKKKLW 30 T 0.001 Toxin_12 unppercent F Eukaryota T 6ml1 3 E G Proteolyzed N-terminal tag of Ubv.15.1a construct MAHHHHHHDTSLYKKAGSTENLYFQG 26 T 52 PTase_Orf2 pdbhh F T 6mm1 1 A,B,C,D A,B,C,D EHMT2_HUMAN EUCHROMATIC HISTONE-LYSINE N-METHYLTRANSFERASE 2,HLA-B-ASSOCIATED TRANSCRIPT 8,HISTONE H3-K9 METHYLTRANSFERASE 3,H3-K9-HMTASE 3,LYSINE N-METHYLTRANSFERASE 1C,PROTEIN G9A GSGFEELPLCSCRMEAPKIDRISERAGHKCMATESVDGELSGCNAAILKRETMRPSSRVALMVLCETHRARMVKHHCCPGCGYFCTAGTFLECHPDFRVAHRFHKACVSQLNGMVFCPHCGEDASEAQEVTIPRGD 136 T 0.029 DZR pdb F Eukaryota T 6mm5 2 B C RYR2_MOUSE RYR2,CARDIAC MUSCLE RYANODINE RECEPTOR,CARDIAC MUSCLE RYANODINE RECEPTOR-CALCIUM RELEASE CHANNEL,TYPE 2 RYANODINE RECEPTOR LYNRTRRISQTS 12 T 18 SNN_linker pdbhh F Eukaryota T 6mon 2 C,D C,D LYS-LEU-NLE-SER-LYS-ARG-GLY KLXSKRG 7 T 32 YuiB pdbhh F T 6mpz 2 E,F,G,H M,N,O,P peptide aldehyde inhibitor 1 based on the ProcA2.8 leader peptide GNLSDDELEGVAGX 14 T 0.00047 L_biotic_typeA pdbhh F T 6mrq 2 B I inhibitor from Tityus obscurus scorpion venom (TopI1) ILKRCKTYDDCKDVCKARKGKCEFGICKCMIK 32 T 0.012 Toxin_2 pdbhh F T 6mrr 1 A A Foldit1 GWSTELEKHREELKEFLKKEGITNVEIRIDNGRLEVRVEGGTERLKRFLEELRQKLEKKGYTVDIKIE 68 T 0.0019 HMA pdb F T 6mrs 1 A A Peak6 GSGRQEKVLKSIEETVRKMGVTMETHRSGNEVKVVIKGLHESQQEQLKKDVEETSKKQGVETRIEFHGDTVTIVVRE 77 T 0.0072 Phage_TAC_5 pdb F T 6ms1 2 C,D C,D APC C-terminus peptide GSYLVTSV 8 T 0.068 EB1_binding pdbhh F T 6ms4 2 B B DENR_HUMAN DRP,PROTEIN DRP1,SMOOTH MUSCLE CELL-ASSOCIATED PROTEIN 3,SMAP-3 GDYPLRVLYCGVCSLPTEYCEYMPDVAKCRQWLEKNFPNEFAKLTV 46 T 0.012 PHM7_cyt unppssm F Eukaryota T 6msp 1 A A De novo Designed Protein Foldit3 MGHHHHHHENLYFQSHMTDELLERLRQLFEELHERGTEIVVEVHINGERDEIRVRNISKEELKKLLERIREKIEREGSSEVEVNVHSGGQTWTFNEK 97 T 0.0032 DUF6175 pdb F T 6mt3 2 B B NP338 peptide FEDLRVLSF 9 T 0.74 Flu_NP pdbhh F T 6mt4 3 C C NP338-L7S peptide FEDLRVSSF 9 T 9.1 EKR pdbhh F T 6mt5 3 C C NP338-V6L peptide FEDLRLLSF 9 T 2.6 Flu_NP pdbhh F T 6mtu 2 C,D C,D CRCM_HUMAN PROTEIN MCC PHTNETSL 8 T 8.4 TPR_MLP1_2 unphh F Eukaryota T 6mv5 3 C P PCSK9_HUMAN NEURAL APOPTOSIS-REGULATED CONVERTASE 1, NARC-1, PROPROTEIN CONVERTASE 9, PC9, SUBTILISIN/KEXIN-LIKE PROTEASE PC9 XKDEDGDYEELVLALRSEEDGLA 23 T 7.9 PIN7 pdbhh F Eukaryota T 6mw6 1 A A Citrocin GGVGKIIEYFIGGGVGRYG 19 T 8.1 Bac_chlorC pdbhh F T 6mwm 1 A A R1AB_BCHK4 NSP3, PAPAIN-LIKE PROTEINASE SHMQTPETAFINNVTSNGGYHSWHLVSGDLIVKDVCYKKLLHWSGQTICYADNKFYVVKNDVALPFSDLEACRAYLTSRAA 81 T 0.25 DUF3954 pdb T Viruses T 6myd 2 B,D B,D STING_DANRE STING CTT, Transmembrane protein 173 EPVETTDY 8 T 15 Swm2 pdbhh F Eukaryota T 6mye 2 B B ARHGQ_HUMAN SH3 DOMAIN-CONTAINING GUANINE EXCHANGE FACTOR XKPNGLLITDFPX 13 T 0.82 DUF1968 pdbhh F Eukaryota T 6mzn 1 A A E7FDB6_DANRE Transforming growth factor beta receptor III GSPCELLPVGVGHPVQAMLKSFTALSGCASRGTTSHPQEVHIINLRKGSAQGAREKTAEVALHLRPIQSLHVHQKPLVFILNSPQPILWKVRTEKLAPGVKRIFHVVEGSEVHFEVGNFSKSCEVKVETLPHGNEHLLNWAHHRYTAVTSFSELRMAHDIYIKVGEDPVFSETCKIDNKFLSLNYLASYIEPQPSTGCVLSGPDHEQEVHIIELQAPNSSSAFQVDVIVDLRPLDGDIPLHRDVVLLLKCEKSVNWVIKAHKVMGKLEIMTSDTVSLSEDTERLMQVSKTVKQKLPAGSQALIQWAEENGFNPVTSYTNTPVANHFNLRLREHHHHHH 338 T 0.17 DUF108 pdbpssm F Eukaryota T 6n05 1 A,B A,B A0A425B3G2_NEIME AcrIIC2 MNTIHHHHHNTSGSGGGGGRLVPRGSMSENLYFQGSMSKNNIFNKYPTIIHGEARGENDEFVVHTRYPRFLARKSFDDNFTGEMPAKPVNGELGQIGEPRRLAYDSRLGLWLSDFIMLDNNKPKNMEDWLGQLKAACDRIAADDLMLNEDAADLEGWDD 159 T 0.05 SfsA_N pdbpercent F Bacteria T 6n0s 1 A A A3DM20_STAMF MCRB GPMNKILGFSKYWVEINNWILPTLDHIGLTLWGMIKKHASEYRGIRYSLEKFGELKIIHYIGSRASHDLGKTFIGESVLSKENNKIVKEYSWPEIKKVLRKIFLDNGISSDTIDQYFIAIRRIIRPSRSDRFFLFRLAEYRKYENPVKYDQVRDIISHITWTGRYLVPIRPEDYEAIHSRG 181 T 2.9 Colicin_Pyocin pdbhh F Archaea T 6n2b 1 A,B A,B E4S4B2_CALKI Tapirin MKTSTTYGGESLTEAYLYYFNQICEDAREAAYSYYFDGSGNFKSTYIGGKVSPQNEPKRIWDDLTAGHITQDEAKDRILSGMREIIVSEVNNFMNGLPSSISFKVSPSSPITINKLDDLKNYILKELKDISNFGVGSFTVSSWSAGDVKGYTVEFEVYKEQNTGTSPAKDTVRNMRIDIAVNKGVMPDIGTLNPTSSTSSWNDLFEYAVYSRGSFLPNYKFTVRGGSIYSGERIQTQGEFKAIGVNNLICKGPEVIVNGGGNSIEIKEIMYIQNKLVFNGAPNTNPNTLNANKIYTGLGGMELNGYGYYKANEIYSDGEVQVKNYGNFEIGSIGIVKKLTVTDNGRTTIKSGATLYCDQLEVRNNGRVFIEAGATLVTRAISISGGTIEGPGTRQVNPSATFPSYPPFIDDIKNFDFDSRMSVTTLPADPVGATTLGSVYDKSATPWEIVVYGESGINDSELITEVNSKLGSFPSNVRLYLASKGNITFSNPTSLPLYNPTTGKLVIEGAIITLGSTFNINISGAGIELIYKRAGSTIESSITSTLNYIPPPRSYSSSSAQTVNTMYQVKRRGMIIK 577 T 0.00018 PilX_N unphh F Bacteria T 6n2c 1 A,B A,B E4Q7C4_CALH1 Tapirin MLTSLIHSKETINKTQTSTAADSAMEYILFYISKAIAQAKRLTYAQFFDSTGRLIYTGDSFENDYLNTFNSYIADFFENRGNRIGIDMKLADNSSVQVSNVSELILLARQSCEYISNISFSRSGNSYILEVEALDSTTKTKRVERCVFTIPSPFEKVEIVSNSSSPDTLLPYLLAWDSNIFDFTTYGLFSSDKIIFNNNITVTTRNMYSSSDITLRSDNNRPGDYTIKADNIIVKNGSFIFGGNNKVVVNNLMYTKNGITFNGNNNRLESNSLLFSDGTISLSGKDEIVANALFCDTLDIRNGSSNLVTINEFAYFNKLNIWTDKMVLKSNSKLFGGDIEIRNDGILSADVGTVVYANNLDIIGSSATIDAPDTVLYCNNLKIDGEVKLNVKKIVCSGTITISNLNSGTNIRVSDKIECRSIPQNIPSGIRNLFVQNPNVNFQIPYPTIPAIIEEIKKNTFPTNWIRLDNIVEDKKDINGANYYSLVSTGQNSNDINEIFNKNKPNNPHSNVQIFVITKSGINVPPDQNHLDGVLIANGSLQFNGGNLNIEYVRMPQPLIDYLLSKNIIKIENVQPPVISNPTVTFLPRDVNLFIIARHFVVK 603 T 0.00019 PilX_N unphh F Bacteria T 6n3a 2 K,L,M,N,O,P,Q,R,S,T K,L,M,N,O,P,Q,R,S,T TADBP_HUMAN segA long small GMLASQQNQS 10 T 0.29 Glucosaminidase unppercent F Eukaryota T 6n3c 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T TADBP_HUMAN TDP-43, SEGB QGGFGNSRGGGAGLGNNQGSNMGGGMNFGEFSINPAMMAAAQAALQ 46 T 0.29 Glucosaminidase unppercent F Eukaryota T 6n3e 2 B B SF3B1_HUMAN SF3b1 U2AF ligand motif NRWDETP 7 T 0.022 SF3b1 pdbhh F Eukaryota T 6n61 8 I I Capistruin GTPGFQTPDARVISRFGFN 19 T 4.4 P_C10 pdbhh F T 6n68 1 A A PROTO_AGEPP AGELAIA-CHEMOTACTIC PEPTIDE,AGELAIA-CP ILGTILGLLKGLX 13 T 0.47 DUF445 unphh F Eukaryota T 6n7o 1 A,B B,A Q7WSG2_9VIRU GIL01 gp7 GSMRDKLLDFIIELSQSSKQVVSKSYVIDRLMQVTKEDYKELEKNVEGKKDD 52 T 14 SpoOE-like pdbhh T Viruses T 6n7p 8 H H SNU71_YEAST U1 small nuclear ribonucleoprotein component SNU71,U1 small nuclear ribonucleoprotein component SNU71,Snu71 MRDIVFVSPQLYLSSQEGWKSDSAKSGFIPILKNDLQRFQDSLKHIVDARNSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 105 T 1.1 PUD1_2 pdbpssm F Eukaryota T 6n7q 2 B C RON2 peptide CWTTRMSPPMQIP 13 T 3.2 Antimicrobial23 pdbhh F T 6n7r 8 H H SNU71_YEAST U1 small nuclear ribonucleoprotein component SNU71,U1 small nuclear ribonucleoprotein component SNU71,Snu71 MRDIVFVSPQLYLSSQEGWKSDSAKSGFIPILKNDLQRFQDSLKHIVDARNSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 89 T 0.2 Vps5 pdbpssm F Eukaryota T 6n8c 1 A,B,C,D A,B,C,D HD_HUMAN HUNTINGTON DISEASE PROTEIN,HD PROTEIN ATLEKLMKAFESLKSFQQQQQQQ 23 T 2 Mito_fiss_reg unphh F Eukaryota T 6n9t 2 B,D E,F Photo-affinity peptide DCAWHLGELXWCT 13 T 0.46 YbjM pdbhh F T 6ncl 1 A a0 Q98550_PBCV1 P14 MQTPSIIQCGLLNSFARKMTDAISDNQIIATSRFFNIARDVADVVVSNTKLAQQYEQLSIDSLKEYLVSVAKFVAVDYSNTTSADVDDLIHKLRLFIEEECYQYNIDKEETCDGDVCVSDEYNEPAPKPKPKPKPKPAPKPKPAPKPKPAPKPAPKPAPKPAPKPAPKPAPKPAPKPAPEPAPEPAPEPAPKPAPEPAPEPAPIRPARRCDENPSNLETCCTNKALYGDFTDSSCDIVKKKTNWWLWGGIAILVIVLMIGGYFIYKRYFSAPKFENTGEFVNDMNFNNDVNFNNDVNFDNDMNYGNEGIDVSDLEILNLPVPSVSPVPSASIVPSVSPIPRGSPVPSASPIK 352 T 0.0076 Trypan_PARP pdbpssm T Viruses T 6ncl 3 C,D a2,a3 Q98505_PBCV1 P10 MMNFILVLLIVAMIGTILVSESKYLFSKPVCKNCGVKAVTLPVDISAGKLAKVAEAVKKQTEEIKTLLKQKQSAPKAPELTNPIEHIKASTTVVSGANGLENVIDEDLPFSDFKGVPVAETTVEGMIKGIRPPTYADPRVMNPALAAAPVQFSDPTQFGTFGVTDDVSPAFSTEDKIPKTNAKISSDISVEGYENSYDANGARLVMDGKVVKSECQLPSYQIRNSKHHTQLPMRSLNEPPPMVEDLVDESLFEGLQGYPVDEKLDLLTPPGTATPSSEWAAINYGLTNN 289 T 0.11 DUF4330 pdbpssm T Viruses T 6ncl 4 E a4 Q84580_PBCV1 P7 MQIYSEYYEKIGPRKLRLLVKRRLLFASTWLYINNYILLSIIMKLQTKHMILLGFVAVVVVFIIFMLTRKKKEGFSIGNIFGKVKGAVTGTVGKVVNVVKPQGYKPEFVNRVNFGKFWACPEGTTDWGSEDKQCLVSQYGPMMWRNKGGNEWGWSCPAGSAPNNSDDWNQKCVQGYSMKKLIDGQWRCTDTEIDTGKDWSNSDWFTAQQQCDRGNNKVFTRRMYIDGKWQCPDGTWDTGFTWSDGENGGKQCKYYP 256 T 0.014 Wzy_C pdb T Viruses T 6ncl 5 F a5 Q84523_PBCV1 P6 MILVGIAVLILLAVFAILYYKQKEKFVVVGKFVEPIPSNPGQDFTLLPMDQTYTFADPVPDTATAFDVVLSRFTDKKAPADLLKGATFPEAAPYTDSEVENISKLALSRVKGPDAPVLSFISVEYAAKGVDNKKNTHYDIAFMVYDQVKNFSLKLVLVAVLDAKNKLWIKKFSSFNSFTPKDKGPKGVENIDETPLAEFIPDFVQFSRLYKDNANV 216 T 3.7 Mid2 pdbhh T Viruses T 6ncl 6 G a6 Q84626_PBCV1 P1 MVETTQHFVSIESSNRPDPANTTPANYSIQLPQRYRNIWSAMLVNIALPAVSPPQKYVYLDIDKLNSIDSTSPSGGVNFALAKIPLSIAGTGNVFFADTMTSSFPNVPLQNPVATMDKLNIKLKDANGNVLTIPAGNEHSFMIQLTCGDYIPRGGGSTITQNGRVLGGTR 170 T 1.8 DUF2433 pdbhh T Viruses T 6ncl 7 H a7 Q84459_PBCV1 P12 MGNGPPMERAVSSDDILTYYNTFIFFIYFNFTNENIYIIYTIYMKVQNTIVYIVLLLIVVVIIWNFTRKEGWSDYNAPNDFMKIYYSNIVEDKKLAEKYPFFGTGPFTGLRCRKPNNVGCNTTWVSGQLVELTPKLKEQIECKFGIQYVKT 151 T 0.042 ID pdbpercent T Viruses T 6ncl 8 I a8 Q98576_PBCV1 P5 MDSRLSAAYAIRAARISMIPGGVDGLVINYAEGGEPAWVQYPLKKQKPLPNNLCYTPTLEDIARKREAVIAKYTKQPLETGTTFTHVLNASHLNEQYTRVKKSALPDKEFPIIETEKYPEPPILWETTIGAPSRLFDRSDGVKYVR 146 T 8.9 XRN1_DBM pdbhh T Viruses T 6ncl 9 J,K,KD,L,M,N,O,P,R,S,T,U a9,b0,l5,b1,b2,b3,b4,b5,b7,b8,c0,c1 Q84666_PBCV1 P11 MDMHMIVKVVAILAVLFLVYKLWESMNKPNASPLKIQNPYEKYMNSAEGGEYDAEDDDIYYPETDAEDDDIYTGETDDMYDGEDDDIYVQEGDDIEDAEDEPYDDSADMEQDVPKVQQPMMPLLTPSSQLLPKPSPEAADFAQFAPKNLQAQNFLTATQWIGVNTQGSSLKNANYDLRADPIIPKADVGPWMMSSVDPNIYQKPLFG 207 T 0.14 FeoB_associated pdbhh T Viruses T 6ncl 11 V,W,X,Y c2,c3,c4,c5 O41054_PBCV1 P4 MFSAFRDTASIGFSDTHQDEKTLRFLKKQISQFIKHLKEYYPNNELTKKLVMKYSDVQLLPYTKGATKDTYTSGLFDHTTGVIKIAPRDGLGNVRDEQSLNKSICHELAHGTRVKYPGESSHSDEWKDAWKTFLKIAADELGWKIEVPCSSVSFYGLTKDDCENCVWDQDPETCPKTAKLA 181 T 0.002 WLM pdbhh T Viruses T 6ncl 12 AA,BA,Z c7,c8,c6 Q98573_PBCV1 P3 MAMKTQRKENVLFQNVKPREIPLVDNPFSTYPYKHVITETQPTQAKNQAIWGLVQMGLSGEAAAMYGDVVVQKTTRACRKSEGGFKDVNTELWGTSPYLGRGDGEVYNMPASNQLLRGFESSLRGSRVRTQIDDKSFIPYTWQMIDVPLAAAKTSFIAGLDTRQQLAYGNP 171 T 3.2 B_solenoid_ydck pdbhh T Viruses T 6ncl 15 JD l4 Q98473_PBCV1 P13 MHKITPFLIAAVVAVIVLAVWLFKKDNKKETWFSRDLNYGKANSKIWNATVAKGLKGIANENAEIRKMYPYLGYGDFTGAICKGPNNQGCTYYANYTR 98 T 0.0083 DUF4381 pdbpssm T Viruses T 6nd4 5 E I UTP8_YEAST Utp8 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRLFKQAIVTCPNLPLNELLEELFSIRNRELLLDISFRILQDFTRDSIKQEMKKLSKLDVQNFIEFITSGGEDSSPECFNPSQSTQLFQLLSLVLDSIGLFSLEGALLENLTLYIDKQVEIAERNTELWNLIDTKGFQHGFASSTFDNGTSQKRALPTYTMEYLDI 519 T 8.500000000000002E-245 Utp8 unppssm F Eukaryota T 6ndy 2 F G Designed Cyclic Peptide GGDEIVNKVLGGSSGGXXXXXXXXGGKGCK 30 T 19 Eno-Rase_NADH_b pdbhh F T 6nf2 1 A,G,Q A,G,Q Q2N0S6_9HIV1 Envelope glycoprotein gp120 AENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSACTQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQAFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQCMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRR 480 T 3.4E-54 GP120 pdbpercent T Viruses T 6nfj 3 C,F C,F FGF19_HUMAN FGF-19 PMLPMVPEEPEDLRGHLESDMFSSPLETDSMDPFGLVTGLEAVRSPSFEK 50 T 4.6 Mrx7 pdbhh F Eukaryota T 6nfw 1 A A POLG_PVYN VPg GKNKSKRIQALKFRHARDKRAGFEIDNNDDTIEEFFGSAYRKKGKGKGTTVGMGKSSRRFINMYGFDPTEYSFIQFVDPLTGAQIEENVYADIRDIQERFSEVRKKMVENDDIEMQALGSNTTIHAYFRKDWSDKALKIDLMPHNPLKVCDKTNGIAKFPERELELRQTGPAVEVDVKDIPAQEVEHE 188 T 0.1 DUF4447 pdb T Viruses T 6nhw 1 A,B,C,D,E,F A,B,C,D,E,F TR10B_HUMAN DEATH RECEPTOR 5,TNF-RELATED APOPTOSIS-INDUCING LIGAND RECEPTOR 2,TRAIL-R2 MPGSLSGIIIGVTVAAVVLIVAVFVCKSLLWKKVLP 36 T 0.16 Psg1 pdbpssm F Eukaryota T 6nhy 1 A,B,C A,B,C TR10B_HUMAN DEATH RECEPTOR 5,TNF-RELATED APOPTOSIS-INDUCING LIGAND RECEPTOR 2,TRAIL-R2 MPGSLSGIIIYVTVAAVVLIVAVFVCKSLLWKKVLP 36 T 0.016 Psg1 pdbpssm F Eukaryota T 6ni2 5 E V V2R_HUMAN V2R,AVPR V2,ANTIDIURETIC HORMONE RECEPTOR,RENAL-TYPE ARGININE VASOPRESSIN RECEPTOR ARGRTPPSLGPQDESCTTASSSLAKD 26 T 16 DUF6352 pdbhh F Eukaryota T 6nid 2 D,E,F D,E,F NRX1A_HUMAN NEUREXIN I-ALPHA,NEUREXIN-1-ALPHA KKNKDKEYYV 10 T 8 Topo_Zn_Ribbon pdbhh F Eukaryota T 6nii 1 A,B B,A Uncharacterized protein RavD GPLGSMNLKAEVFLNQNCAEMMIKKAAQLILGSDLDFEYTRGVQDIQVDLGPAFMFSPDEEKTLWVSGKNQETLEKDLATLNKSSVYFFRTGTQGGAGHWQVLYYEAAKSGWVSYSSQSNHFQVTDSNGKLTASGKGLLVPHANWGKENGNYAFLLVNASAENIIHAANFVYILRTQNEVAAIEYCALNHEFHPEIKRTARAKAE 205 T 16 DUF2846 pdbhh F T 6nj8 2 E,F,G E,F,G targeting peptide TVGSLIQ 7 T 6.6 Chlorosome_CsmC pdbhh F T 6njd 2 B,D B,D A0A509GV61_LEGPN RavD GPLGSMNLKAEVFLNQNCAEMMIKKAAQLILGSDLDFEYTRGIQDIQVDLGPAFMFSPDEEKTLWVSGKNQETLEKDLATLNKSSVYFFRTGTQGGAGHWQVLYYEAAKSGWVSYSSQSNHFQVTDSNGKLTASGKGLLVPHANWGKENGNYAFLLVNASAENIIHAANFVYILRTQNEVAAIEYCALNHEFHPEIKRTARAKAE 205 T 19 SWC7 pdbhh F Bacteria T 6njv 1 A A B0RTN2_XANCB Xcc_CTR_I RDEEDLQRYIDVTRGEIFFSRGVILVEGDAERFIVPAFAEVLNIPLDMLGITVCSVGGTNFTPYVKLLGPEGLNIPHVILTDRDPTNGNHPLVRRRLINVLDVIEGGVDHEELDADEVIKLAEQYGYFVNENTLEPELFAGGLAEDMQEVIREELPRLRRETLNALQQWVDDPAQIDEDLLLRLIERIGKGRFAQALAPSVSEDVCPAYIRSALEHIRDAIALEHHHHHH 230 T 0.0022 DUF3226 pdbhh F Bacteria T 6njz 2 C,D C,D YSA-GSGSK-bio peptide YSAYPDSVPMMSGSGSK 17 T 6.1 DUF4810 pdbhh F T 6nk0 2 C,D C,D bA-WLA-Yam XWLAYPDSVPYX 12 T 4.1 DUF3052 pdbhh F T 6nk1 2 C,D C,D bA-WLA-YRPKbio XWLAYPDSVPYRPK 14 T 6.5 DUF3052 pdbhh F T 6nk9 1 A A Aca Toxin 1 CGGAGAKCSTKSDCCSGLWCSGSGHCYHRRYT 32 T 1.2E-05 Toxin_30 pdbhh F T 6nkp 2 C,D C,D bA-WLA-YSKbio peptide XWLAYPDSVPYSK 13 T 3.9 FXR_C1 pdbhh F T 6nl1 1 A A B6SBM0_9TRYP Mitochondrial edited mRNA stability factor 1 GSHMRKQLFFTLARPCVAVGRRFISGDNKSIDSSAFISDDDALRGELASALDTEGHALPFDVHLQQPHSSGDGTAGDTSTIQLEKLSHPPARFDLLTNSFVYKWQTKAALARKVSGPMREWAAELKYRTGVHIELEPTYPERLSENAVKGSGSDDGDGTQWGAYETADDVDITVYLFGSERGIFNCHKLMEAAIQQDPVYVRLGIFRRLANSSEVEWLMLRRINRELRPPDIPPISLKLPGKWTLLYERYKEAAIRTLWEETGITVDASNVYPTGHLYQTVPQYYWRVPVRYFVAEVPSDIRVEGPQVVPLQYMRNWDARLLRQSPDPIDRAWAQLADPATGCAWMKASMIDQLQKPLRGDNYMAIRYTPPPYSNLQEVVGLGDGSITPSTGNGEDAS 398 T 0.00034 NUDIX pdb F Eukaryota T 6nm2 1 A A WW291 peptide WWWLRKIWX 9 T 0.32 DUF6273 pdbhh F T 6nm3 1 A A WW295 peptide RKIWWWWLX 9 T 0.57 DUF5976 pdbhh F T 6nmc 3 C,D B,C A0A5H1ZR46_9GAMM AcrVA1 SKAMYEAKERYAKKKMQENTKIDTLTDEQHDALAQLCAFRHKFHSNKDSLFLSESAFSGEFSFEMQSDENSKLREVGLPTIEWSFYDNSHIPDDSFREWFNFANYSELSETIQEQGLELDLDDDETYELVYDELYTEAMGEYEELNQDIEKYLRRIDEEHGTQYC 165 T 0.0032 ZnuA pdbpssm F Bacteria T 6nmd 3 C B A0A5H1ZR47_9GAMM AcrVA1 MSKAMYEAKERYAKKKMQENTKIDTLTDEQHDALAQLCAFRHKFHSNKDSLFLSESAFSGEFSFEMQSDENSKLREVGLPTIEWSFYDNSHIPDDSFREWFNFANYSELSETIQEQGLELDLDDDETYELVYDELYTEAMGEYEELNQDIEKYLRRIDEEHGTQYCPTGFARLR 174 T 0.0043 ZnuA pdbpssm F Bacteria T 6nnv 2 E,F,G,H I,J,K,L macrocyclic peptide XFXXXDVXYXWYLCKX 16 T 0.51 CNPase pdbhh F T 6nox 1 A A SFTI-KLK5 Peptide GFCHRSYPPECWPN 14 T 1.1 Bowman-Birk_leg pdbhh F T 6nqw 1 A A B0STJ8_LEPBP Flagellar coiling protein A GSAKDQVDELLKGELVPENDDAELTEDQKKKKKEIMEQESLWKNPDFKGYNKTFQELHQLSKTFANNQFRLALSNYQSGVNTIMKNRDWVEQYRKEEAEKKRLDEKWYWQKVDRKAREERVVYREKMKAKQDALNYFSKAINHLDEIKNPDLRERPEFKRLLSDVYRSWIMAEYDLQNLPQTIPILELYIEIDDNEKEYPAHKYLASAYSFEENMIKKTKGPDDMLFKYRYKKNVHLLRATELKYGKDSPEYKHIVNVINRDEVISVAQ 269 T 0.0087 TED_complement pdbpercent F Bacteria T 6nqz 1 A,B A,B Q72RA0_LEPIC Flagellar coiling protein B GSGSQQNSGSDQKSQPSSAQLGQSILETERKLDEKIFELNQRLTRHTVLMKMKVRVLPFRTVLFKGKANNDECTPAINQEDPANNCIRVEVYDFIRDEERGLNKNVQGALAKYMEIYFEGQNSNDPEPRTEPPRNINKLKSKIYKNNMVLEDKIISEVMDRGPNTQPSHNDKVEVFFQKDNYPEYGRPETPAEKGVGKYILAGVENTKTHPIRNSFKKEFYIKHLDQFDRLFTKIFDYNDQLGNENYKENVDALKDSLRY 260 T 0.074 VTC unppssm F Bacteria T 6nsx 2 B B Hsh155 SRWDVK 6 T 33 DUF6507 pdbhh F T 6nu2 46 TA l RM54_HUMAN MRP-L54,MITOCHONDRIAL LARGE RIBOSOMAL SUBUNIT PROTEIN ML54 SREYWRRLRKQNIWRHNRLSKNK 23 T 4.9 PPV_E2_N pdbhh F Eukaryota T 6nuw 7 G I CENPU_YEAST ASSOCIATED WITH MICROTUBULES AND ESSENTIAL PROTEIN 1,CENP-U HOMOLOG,CONSTITUTIVE CENTROMERE-ASSOCIATED NETWORK PROTEIN AME1 MDRDTKLAFRLRGSHSRRTDDIDDDVIVFKTPNAVYREENSPIQSPVQPILSSPKLANSFEFPITTNNVNAQDRHEHGYQPLDAEDYPMIDSENKSLISESPQNVRNDEDLTTRYNFDDIPIRQLSSSITSVTTIDVLSSLFINLFENDLIPQALKDFNKSDDDQFRKLLYKLDLRLFQTISDQMTRDLKDILDINVSNNELCYQLKQVLARKEDLNQQIISVRNEIQELKAGKDWHDLQNEQAKLNDKVKLNKRLNDLTSTLLGKYEGDRKIMSQDSEDDSIRDDSNILDIAHFVDLMDPYNGLLKKINKINENLSNELQPSLHHHHHH 330 T 0.0029 DUF1640 pdb F Eukaryota T 6nw8 1 A A A0A5H1ZR48_CENNO Cn29 LCLSCRGGDYDCRVKGTCENGKCVCGS 27 T 0.0047 EGF_2 pdb F Eukaryota T 6nxf 2 C,D V,U ANR31_MOUSE Ankyrin repeat domain 31 SSRESMQTIPHYLQIKEILQISKQELLPCHVMEQHWKFYVGRSHSEALLSW 51 T 15 DUF525 pdbhh F Eukaryota T 6nz2 1 A A BCD1_YEAST Box C/D snoRNA protein 1 GPHMRDSTECQRIIRRGVNCLMLPKGMQRSSQNRSKWDKTMDLFVWSVEWILCPMQEKGEKKELFKHVSHRIKETDFLVQGMGKNVFQKCCEFYRLAGTSSCIEGEDGSETKEERTQILQKSGLKFYTKTFPYNTTHIMDSKKLVELAIHEKCIGELLKNTTVIEFPTIFVAMTEADLPEGYEVLHQE 188 T 0.045 MobA_MobL unppssm F Eukaryota T 6o09 2 B,D,F,H,J,L I,B,E,G,J,L K7MRE7_SOYBN Uncharacterized protein YPLVQTKIIDFFRIQRSPEA 20 T 11 DUF1378 pdbhh F Eukaryota T 6o0c 1 A,B,C A,B,C Design construct XAA_GVDQ mutant M4L GSHLGDLKYSLERLREILERLEENPSEKQIVEAIRAIVENNAQIVEAIRAIVENNAQIVENNRAIIEALEAIGVDQKILEEMKKQLKDLKRSLERG 96 T 0.00067 PLU-1 pdbpercent F T 6o0i 1 A,B,C A,B,C Design construct XAA GSHMGTEDLKYSLERLREILERLEENPSEKQIVEAIRAIVENNAQIVEAIRAIVENNAQIVENNRAIIEALEAIGGGTKILEEMKKQLKDLKRSLERG 98 T 0.0013 KinB_sensor pdbpercent F T 6o1q 1 A A NPHP1_HUMAN JUVENILE NEPHRONOPHTHISIS 1 PROTEIN GPMAMLARRQRDPLQALRRRNQELKQQVDSLLSESQLKEALEPNKRQHIYQRCIQLKQAIDENKNALQKLSKADESAPVANYNQRKEEEHTLLDKLTQQLQGLAVTISRENITEVGAPT 119 T 0.012 DUF6100 pdbpssm F Eukaryota T 6o26 3 C C CSP_PLAFO CS PNRNVDENANANSA 14 T 54 Tir_receptor_M pdbhh F Eukaryota T 6o28 3 E,F E,F CSP_PLAFO CS KQPADGNPDPNANPN 15 T 0.25 PT unppercent F Eukaryota T 6o2k 1 A,B A,B Q9VHP9_DROME Centromeric protein-C, isoform A TPLRDEQEEASTKLMQWLRGVGDAPPSASMSDENASVSSANELIFCQVDGIDYAFYNTKEKAMLGYMRFKPYQKRSMKQAKVHPLKLLVQFGEFNVETLAVGEEKEVHSVLRVGDMIEIDRGTRYSIQNAIDKVSVLMCIRS 142 T 1.8E-05 CENP-C_C pdbhh F Eukaryota T 6o35 1 A,B,C,D A,B,C,D de novo designed WSHC8 GSSAEELLRRSREYLKKVKEEQERKAKEFQELLKELSERSEELIRELEEKGAASEAELARMKQQHMTAYLEAQLTAWEIESKSKIALLELQQNQLNLELRHI 102 T 0.024 MitMem_reg pdb F T 6o38 2 G,H,I,J,K,L G,H,K,L,M,N A0A2T7FJI6_ACINO Type II secretion chaperone CpaB MQSSSALTFSPESRQQSGAKMIESQNILNLSPSEKERLSQQQIVFNEVEKDQLHSKANFPLLKNAKGMVIKYDPKVIELKKVGDTVKFQMLEYGINRTGKIVEIEPVDQDIVRWTGRFDQGDPNQNFFTITQSQKDHYTIMQIFTEKGNYSAEIKDGVGLVQTMDEGVTDQELHHDHP 178 T 0.18 DUF2969 pdbpssm F Bacteria T 6o3h 2 H,I,J,K,L,M,N H,I,J,K,L,M,N A7XXR5_9CAUD P74-26 Head Decoration Protein MDKIQLFRTIGRVQYWERVPRLHAYGVFALPFPMDPDVEWGNWFAGPHPKAFLVSVHPSGPKAGHVYPTDLSDPDSVANVIGMVLDGHDYEADHNVTVTLRAAVPIEYVQQGIEAPPLQPDPAVLNAAPQLKLKVIKGHYFFDYTR 146 T 0.78 TRI9 pdbpssm T Viruses T 6o3n 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P Cross-alpha Amyloid-like Structure alphaAmA XSKLLELLRKLAEALHKAIELLEKWGX 27 T 1.4 BssS pdbhh F T 6o3w 1 A,B C,D SEN1_YEAST TRNA-SPLICING ENDONUCLEASE POSITIVE EFFECTOR EQEDDYKLPMEYIT 14 T 2.1 DUF3228 pdbhh F Eukaryota T 6o3x 2 D,E,F D,E,F SEN1_YEAST TRNA-SPLICING ENDONUCLEASE POSITIVE EFFECTOR DDEDDYTPSISD 12 T 1.9 CM1 pdbhh F Eukaryota T 6o3y 2 D,E,F D,E,F SEN1_YEAST TRNA-SPLICING ENDONUCLEASE POSITIVE EFFECTOR EAEDPYDLNPHPQ 13 T 1.1 Caveolin pdbhh F Eukaryota T 6o43 1 A A Q859J2_9CAUD Orf11 MNDQEKIDKFTHSYINDDFGLTIDQLVPKVKGYGRFNVWLGGNESKIRQVLKAVKEIGVSPTLFAVYEKNEGFSSGLGWLNHTSARGDYLTDAKFIARKLVSQSKQAGQPSWYDAGNIVHFVPQDVQRKGNADFAKNMKAGTIGRAYIPLTAAATWAAYYPLGLKASYNKVQNYGNPFLDGANTILAWGGKLDGKGGSPS 200 T 0.25 PPR_3 pdb T Viruses T 6o5l 1 A A L0A1P5_DEIPD PprA MRSGSHHHHHHRSDITSLYKKAGLENLYFQGREDALRGFDALMATAGVESTIVKHAASGADSQTLNDELTRSLQLAHDRWGLGLLHLRHEARLDRGEDTDVILLVDGREVARLSQGAAAISATYETMRAQNADDLSDWGVLPEGHRVTLKAGNNQMRVLVEDARDFETHWSSERGGAFVRTWRQGETLAVEVHRPASPGTALAKAAWKAIMSIKDRNFQRELMERSNSVGMLGALLGARHKDAGRALERLPEAHFAVRSTVVRMTGGAQREFDQWRSMVREGLDQLDELQKTTTRHLTEILRHGLK 306 T 0.46 ZapA pdbpercent F Bacteria T 6o5o 2 C,D C,D ACE-QNGFDNPNYQPQENMQA XQNGFDNPNYQPQENMQA 18 T 1 APP_amyloid pdbhh F T 6o7g 2 B A Histone H4 XGKGGAXRHRKVX 13 T 23 DUF4196 pdbhh F T 6o8c 2 C,D D,E STING_HUMAN HSTING,ENDOPLASMIC RETICULUM INTERFERON STIMULATOR,ERIS,MEDIATOR OF IRF3 ACTIVATION,HMITA,TRANSMEMBRANE PROTEIN 173 STWGSLKTSAVPSTSTMSQEPELLISGMEKPLPLRTDFS 39 T 9.9 Herpes_IE68 pdbhh F Eukaryota T 6o8p 1 A X Circular bacteriocin, circularin A/uberolysin family AGKEKIRKKLKNEIKKKGRKAVIAW 25 T 0.00084 Bacteriocin_IId pdbhh F T 6o8r 1 A X Circular bacteriocin, circularin A/uberolysin family AWKEKIRKKLKNEIKKKGRKAVIAW 25 T 0.0011 Bacteriocin_IId pdbhh F T 6o8s 1 A X Circular bacteriocin, circularin A/uberolysin family AGKEKIRKKLKNEIKKKWRKAVIAW 25 T 0.0011 Bacteriocin_IId pdbhh F T 6o8t 1 A X Circular bacteriocin, circularin A/uberolysin family AWKEKIRKKLKNEIKKKWRKAVIAW 25 T 1.6 Bacteriocin_IId pdbhh F T 6o9b 3 C C CTNB1_HUMAN BETA-CATENIN TTAPSLSGK 9 T 6.2 LEA_6 pdbhh F Eukaryota T 6o9c 3 C C CTNB1_HUMAN BETA-CATENIN TTAPFLSGK 9 T 20 Fst_toxin pdbhh F Eukaryota T 6obi 1 A A A0A5H1ZR50_MERUN Myosin-VI KQQEEEAERLRRIQEEMEKERKRREEDEQRRRKEEEERRMKLEMEAKRKQEEEERKKREDDEKRIQAE 68 T 9.6 Caldesmon pdbpssm F Eukaryota T 6obk 1 A A D3WAF4_BPLP2 Uncharacterized protein ORF47 MNKEHILAQKEVLTPIEYEHYVKHLFDIGEITKELYIELSSDL 43 T 2.1 DUF6442 pdbhh T Viruses T 6ocp 2 P,Q,R P,Q,R GABR2_HUMAN GB2,G-PROTEIN COUPLED RECEPTOR 51,HG20 QLPILHHAYLPSIGG 15 T 30 UCH_C pdbhh F Eukaryota T 6ocx 2 E,F,G,H F,H,J,L Peptide inhibitor UNC10245109 DGGSFWYRAMKALYG 15 T 1.1 OCIA pdbhh F T 6od0 2 C,D D,E Peptide inhibitor UNC10245092 SFWYGAMKALYG 12 T 6.2 DUF5806 pdbhh F T 6od2 1 A A SPC42_YEASB Spindle pole body component SPC42 SDDDIMMYESAELKRVEEEIEELKRKILVRKKHDLRKLSLNNQLQELQSMMDG 53 T 0.11 TPD52 pdb F Eukaryota T 6oe6 1 A,B A,B Q72Q74_LEPIC Uncharacterized protein MAHHHHHHCFKPTGEFGWVLLDEEKFNIIEKKIMTVGEYTITRKNLIFPDDKTICYIYRFSRSVSESAETYVSLSKFQLGYNEMDVLRKRPNPVSQTIEGSFQGLSPGKYLLKVAYEGDVIDEVEFLVRSTRTPYIEDTSSSADDIEKAMK 151 T 1.3 DUF4969 unphh F Bacteria T 6ofa 1 A A KKX1U_UROMN Wasabi Receptor Toxin ASPQQAKYCYEQCNVNKVPFDQCYQMCSPLERS 33 T 0.15 OATP unppssm F Eukaryota T 6ohz 1 A A Q04PE5_LEPBJ Uncharacterized protein MAHHHHHHMTEIDDLLRKNPELQKEWKRTVWTAAISSGVIAYRPPLLERAFREFPMETAKSALNLFVAAHKSKNRQSVDIITQNLKDAKTFPLGQLEEEIVTDILKYPNLLEKLLQTGWNPNLILEWEKHKSLSQNSKRSHRRPEILIKSNGKEFIEKQETTLLILAMQNDFIPMETVQILLKYGADPSLGVKRKSEGKEYLLYPLANINSNGNTILKELKQKTLIDWKK 230 T 0.098 Spore_III_AB unppssm F Bacteria T 6oi4 3 C,F E,F PSMD1_HUMAN 26S PROTEASOME REGULATORY SUBUNIT RPN2, 26S PROTEASOME REGULATORY SUBUNIT S1, 26S PROTEASOME SUBUNIT P112, RPN2 PQEPEPPEPFEXID 14 T 6.5 PrmC_N pdbhh F Eukaryota T 6ois 2 C,D,E,F C,D,E,F DMS3_ARATH PROTEIN INVOLVED IN DE NOVO 1 MADLYPTGQQISFQTTPLNVQDPTRMMNLDQSSPVARNETQNGGGIAHAEFAMFNSKRLESDLEAMGNKIKQHEDNLKFLKSQKNKMDEAIVDLQVHMSKLNSSPTPRSENSDNSLQGEDINAQILRHENSAAGVLSLVETLHGAQASQLMLTKGVVGVVAKLGKVNDENLSQILSNYLGTRSMLAVVCRNYESVTALEAYDNHGNIDINAGLHCLGSSIGREIGDSFDAICLENLRPYVGQHIADDLQRRLDLLKPKLPNGECPPGFLGFAVNMIQIDPAYLLCVTSYGYGLRETLFYNLFSRLQVYKTRADMISALPCISDGAVSLDGGIIRKTGIFNLGNRDEVNVRFAKPTASRTMDNYSEAEKKMKELKWKKEKTLEDIKREQVLREHAVFNFGKKKEEFVRCLAQSSCTNQPMNTPRGTLESGKETAAAKFERQHMDSSTSAA 449 T 0.001 DUF724 unp F Eukaryota T 6oit 3 G G CHR35_ARATH PROTEIN DEFECTIVE IN MERISTEM SILENCING 1,PROTEIN DEFECTIVE IN RNA-DIRECTED DNA METHYLATION 1 GEFFAVSNMLEALDSGKFGSVSKELEEIADMRMDLVKRSIWLYPSLAYTVFEAEKTMDGGGGSDYKDDDDK 71 T 5.4 CSTF2_hinge pdbhh F Eukaryota T 6oj0 1 A,AA,B,BA,C,CA,D,DA,E,EA,F,FA,G,GA,H,HA,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,PA,Q,R,S,T,U,V,W,X,Y,Z T,J,U,K,V,L,W,M,X,a,G,b,n,c,o,d,p,e,q,f,r,A,g,B,N,C,O,D,P,E,Q,F,R,S,h,i,j,k,l,m,H,I A0A1W6I187_9VIRU Structural protein VP4 MSESVTQQVFNFAVTKSQPFGGYVYSTNLTASTSSAVTSTQLTPLNLSITLGQITLSGNSLVIPATQIWYLTDAYVSVPDYTNITNGAEADGVILIYKDGVKLMLTTPLISSMSISNPARTHLAQAVKYSPQSILTMYFNPTKPATASTSYPNTVYFTVVVVDFSYAQNPARAVVSANAVM 181 T 32 DUF1684 pdbhh T Viruses T 6oj0 2 QA Z A0A1W6I162_9VIRU Uncharacterized protein MLSLDNYSYVHNITTQTNIDLSSQQTIHLASINGKGYIIFLRFFCEGSSACFTNVKFSVKANGLVLYSFRYIQLLELGQAIATAIPSSSQGFSTLLSNYNVLISSPIGTLPQLTLYDSYDNRYGAMLQPAFPLPFVNTLSLDVDILPVSQSSYDPIPYSLNDNQISTNAPTGKGNISIEYLLYNCLV 187 T 7.3 Class_IIIsignal pdbhh T Viruses T 6ole 83 EC y CADH1_HUMAN CAM 120/80,EPITHELIAL CADHERIN,E-CADHERIN,UVOMORULIN GVCRKAAQPVEAGLQIPAILGILGGILALLILILLLLLF 39 T 0.016 ASFV_J13L unphh F Eukaryota T 6olg 85 GC A CADH1_HUMAN CAM 120/80,EPITHELIAL CADHERIN,E-CADHERIN,UVOMORULIN GVCRKAAQPEEAGLQIPAILGILGGILALLILILLLLLF 39 T 0.016 ASFV_J13L unphh F Eukaryota T 6olo 1 A A Designed trimeric coiled coil peptide XQIAAIKXAIAAIKQQIAAIKEAIAAIKQX 30 T 0.092 DUF5320 pdbhh F T 6olz 82 DC A PCSK9_HUMAN NEURAL APOPTOSIS-REGULATED CONVERTASE 1,NARC-1,PROPROTEIN CONVERTASE 9,PC9,SUBTILISIN/KEXIN-LIKE PROTEASE PC9 SWWPLPLLLLLLLLLGPAGARAQEDE 26 T 0.55 Chi-conotoxin pdbhh F Eukaryota T 6oni 2 B D NCOR1_HUMAN NCOR isoform c DPASNLGLEDIIRKALMGSFDDK 23 T 3.3 RuvA_C pdbhh F Eukaryota T 6opd 3 C C Melanoma antigen variant ILNAMIVKI 9 T 7.1 DUF4408 pdbhh F T 6oqp 1 A A SER-LYS-TRP-ILE-CYS-ALA-ASN-ARG-SER-VAL-CYS-PRO-ILE SKWICANRSVCPI 13 T 2.1 Sprouty pdbhh F T 6ori 1 A A Q9Z4N7_ENTFL SURFACE PROTEIN NAQMGEGRLANYSASGNTFQENPGYTKNYNFSDLQFNPKAITGDVLQGNTIDFEVYGKHNIAASTANWEIRLQLDERLAQYVEKIQVDPKKGVGNSRRTFVRINDSLGRPTNIWKVNYIRANDGLFAGAETTDTQTAPNGVITFEKNLDEIFKEIGADNLKSDRLMYRIYLVSHQDDDKIVPGIESTGYFLTDQDDFYNKLDVSENNSDQFKHGSVNTKYEEANIQTKDGSGSTGANGAIILDHKLTKEKNFSYSTSAKGTPWYANYKIDERLVPYVSGIQMHMVQADKVAYNVAFESGKKVADLAIERREGHENYGMGSITDNDLTKLIDFANASPRPIVVRYVLQLTKPLDEILEEMKAADKIEENAPFGEDFIFDSWLSDTNKKLIQNTYGTGYYYLQDIDGLEVLFQ 411 T 0.55 ARL6IP6 pdbpssm F Bacteria T 6orj 1 A A Q8SCZ8_BPDPK PHIKZ164 MDEAVSLLSNMQDSEIQTSEFRLWSIGRATENKPRNSFTLMVLPIESATATDGETTFNPVEEVVDGVDADGRAYTTKVSVSRDIPCIWLPNEDNRATPPDVMRGEKIAIYRLGDTSQFYWRSMGLSNDLRTLESVVYTFNASLSPGGAGKNFDTCYFMQFSAHDKHVTIGTSKANGEPYRYSVQINTGTGAVYILDDIGNRFELVSKDKRLMLMNADNSFVKVEKKAIDLNADQYIKLTSGGSTLELNPTEFKVNTTNTTIKSSGTHIQEAGGTMTHKAGGNMLFTAPRYDFT 293 T 0.0017 DUF2345 pdbpercent T Viruses T 6os1 3 C B TRV023 peptide XRVYKHPA 8 T 9.5 DUF3782 pdbhh F T 6os2 3 C B TRV026 peptide XRVYYHPX 8 T 1.5 VEFS-Box pdbhh F T 6osw 1 A A Q7T2G3_DANRE FORKHEAD BOX M1-LIKE GEFMRESPRRPIILKRRKLPFAKSTARSFPDGIRVMDHPTMPDTQVVVIPKSADLQSVISVLTAKGKEAGPQGRNKFILLSGDTSAEEENLYFQ 94 T 0.042 uDENN pdbpssm F Eukaryota T 6osw 2 B B Q7T2G3_DANRE FORKHEAD BOX M1-LIKE GAQAGAANRSLTEGFVLDTMNDSLSKILVDISFSGLEDEDLGMGNISWSQFIPEAK 56 T 8 LRR_RI_capping pdbhh F Eukaryota T 6ov6 1 A,B,C A,B,C G2EBB4_9FLAO C24 PROTEIN MNKQNFLQTGGFPLETDTLNAMQEAYSVFNALGELAGNKAIIKGCVVSGSTTTDGVVYINGEVFKFVGGQTQSRVKILETSTSKEFEDGSTNAVHFERYVTFASGTGSISWAEFAKLTTLRELSRRLLPAGTNPQLYSGSVNNIPSGWQLCDGTNGTENLKGSFIVGYDPNDSDYNAIGKVGGTKKVTPSGNLDSRSINVTVPRDGWSTFGSGLGAVKSGRIVVGSGQQENSEYLESLRASGIDRTLTSTPHSHTFTGNQQDNRAPYYTLAYIIYIG 277 T 0.058 DUF859 pdbhh F Bacteria T 6ov7 2 C,D C,D kCAL01 peptide ANSRWQVTRV 10 T 2.1 DUF6245 pdbhh F T 6ovf 2 C,D C,D STA03 XQNGFDNPNYQPQ 13 T 0.34 APP_amyloid pdbhh F T 6ox6 2 B B A0A090A233_PSEAI PA14_01140 MAIEKGEAFARRDIYIDYDFEDVTYRWDHRQGTIHVRFYGEAESPEPVEHDNRLFNDALRFGREITREEYETGFPKG 77 T 7.9 LSM14 pdbhh F Bacteria T 6oyl 2 B B KIF4A_HUMAN CHROMOKINESIN-A GHMELKHVATEYQENKAPGKKKKRALASNTSFFSGLEPIEEEPE 44 T 0.03 DUF3584 unphh F Eukaryota T 6p0f 1 A A C5A3Z3_THEGJ GTPase subunit of restriction endonuclease MENQLFIIGIGTGTDEYENFEETILKGVKRNELEGQIGPDILDNCCSDVCYFWGRSKETIYEKKIDKGDMVLFYVGKRISRNKVDLNQETAVYLGIICETVEISENDVSFLNDFWRKGENFRFLMFFKKKPEKLHHSINEINSKLGYNPDYFPIAGYVKPERMSGVYDILKNILKKRGILKESDS 185 T 62 Endonuc-EcoRV pdbhh F Archaea T 6p23 3 C C MHC I-peptide RXRAAAKKKYCL 12 T 1.5 SBP_bac_10 pdbhh F T 6p2c 3 C C MHC I-peptide RXRARARARARAAAKKKYCL 20 T 0.054 TCP pdbhh F T 6p2f 3 C C MHC I-peptide RXRARAAAKKGYCL 14 T 3.9 KGG pdbhh F T 6p2s 3 C C MHC I-peptide RXAAAKKKYCL 11 T 2.9 RNF111_N pdbhh F T 6p5b 1 A A Q5ZTL4_LEGPH MavC GPLGSMTTSKLEKTGLHVHEKIKHMVKNYGTMITGIPAEILGQNEAEISVGYVKKMGNMKENIAEVVRKSEMTQPTNSAGKASNEVCDLLLGTEGASEFEKSSYQVLSGDGSNLKGSLPNKNLLVRVEMDRFNAPQKYQKIKREEFNPETAEKNKIYLLEDQLVYLDIFGKVIDLGQTSDTCHRLFNAITTPFYQNYILYDEYIDPEESAEEAAMFEMGEIVKAKMKNIDCWTATHSFTIFVPESDSEDTRTLYPYQAYWTSHTLQQWFSGDKDEKLSRLGIDGYIEKLALLGTTTDSKIRSSIYGELFSPPGKEHVFCTGMNEKFSPLRVKFKVTEVNPEIALQNLEEVQEFIDTNYPGENAKDQCELYKIKAQEAMTKQLEMRLLIE 389 T 0.035 V-ATPase_H_N unppercent F Bacteria T 6p5h 1 A,B A,B Q5ZTL4_LEGPH MavC GPLGSRFNAPQKYQKIKREEFNPETAEKNKIYLLEDQLVYLDIFGKVIDLGQTSDTCHRLFNAITTPFYQNYILYDEYIDPEESAEEAAMFEMGEIVKAKMKNID 105 T 0.08 HUN pdbpssm F Bacteria T 6p5r 1 A A B6SBM0_9TRYP Mitochondrial edited mRNA stability factor 1 MGSSHHHHHHSSGLVPRGSHMDDALRGELAMGSSHHHHHHSSGLVPRGSHMDDALRGELASALDTEGHALPFDVHLQQPHSSGDGTAGDTSTIQLEKLSHPPARFDLLTNSFVYKWQTKAALARKVSGPMREWAAELKYRTGVHIELEPTYPERLSENAVKGSGSDDGDGTQWGAYETADDVDITVYLFGSERGIFNCHKLMEAAIQQDPVYVRLGIFRRLANSSEVEWLMLRRINRELRPPDIPPISLKLPGKWTLLYERYKEAAIRTLWEETGITVDASNVYPTGHLYQTVPQYYWRVPVRYFVAEVPSDIRVEGPQVVPLQYMRNWDARLLRQSPDPIDRAWAQLADPATGCAWMKASMIDQLQKPLRGDNYMAIRYTPPPYSNLQEVVGLGDGSITPSTGNGEDAS 410 T 0.00034 NUDIX unp F Eukaryota T 6p64 5 I,J C,H HHAT_HUMAN Neoantigen peptide KQWLVWLFL KQWLVWLFL 9 T 0.54 DUF446 pdbhh F Eukaryota T 6p6e 2 B,C B,C PACC_NEUCR PAC3 NLS FDARKRQFDDLNDFFGSVKRRQIN 24 T 1.1 FKS1_dom1 pdbhh F Eukaryota T 6p7o 1 A A D7Y2H5_ECOLX E. coli MS115-1 NucC MSDWSLSQLFASLHEDIQLRLGTARKAFQHPGAKGDASEGVWIEMLDTYLPKRYQAANAFVVDSLGNFSDQIDVVVFDRQYSPFIFKFNEQIIVPAESVYAVFEAKQSASADLVAYAQRKVASVRRLHRTSLPIPHAGGTYPAKPLIPILGGLLTFESDWSPALGMSFDKALNGDLSDGRLDMGCVASHGHFYFNNIDSKFNFEHGNKPATAFLFRLIAQLQFSGTVPMIDIDAYGKWLAN 241 T 0.71 NERD pdbhh F Bacteria T 6p7p 1 A,B,C A,B,C D7Y2H5_ECOLX E. coli MS115-1 NucC 2-241 SNASDWSLSQLFASLHEDIQLRLGTARKAFQHPGAKGDASEGVWIEMLDTYLPKRYQAANAFVVDSLGNFSDQINVVVFDRQYSPFIFKFNEQIIVPAESVYAVFEAKQSASADLVAYAQRKVASVRRLHRTSLPIPHAGGTYPAKPLIPILGGLLTFESDWSPALGMSFDKALNGDLSDGRLDMGCVASHGHFYFNNIDSKFNFEHGNKPATAFLFRLIAQLQFSGTVPMIDIDAYGKWLAN 243 T 0.71 NERD unphh F Bacteria T 6p7v 1 A C Q6CK37_KLULA KLLA0F13816P MFDTKLFLSLPIDIRYTVYFFLGDVVQNVRPPAKSDIFNDELIAYPNIREFNQSLVDKYSKHIGVYDYIPNFIPNWCRDFDLLRHDIILTDRLRVCLQYEEQWFSVQWIVVSGELEIGIFTTDEQFLQVSYTINEYCHLLSIAQQDLRLGINVSDINDVNELCKEIQHRWLFDTVSYISFINCWDLDHENVVSIIPCMESFNNLHMLRIESKNMFNNLINTQGVRENPGKTIVYNVRQNIFELELYTLRDLGYKSVVDLQKWEQLQCLSLSGCEFIDLNNLILPQHCKMLILKEVKYIIWWDLSHLLKRIRPQWIINGQVKKPTKKEEEEESEWYNLYLEVVQTYQPLNFIELHNAKRVKGNLILPARLVTESRIKISNGTKVDSVLLI 389 T 0.024 DUF5420 pdbpssm F Eukaryota T 6p8b 2 D L FITC-RJPXD33 TNLYMLPKWDIP 12 T 1.7 MG3 pdbhh F T 6p8p 1 A,B,C,D A,B,C,D CAP8_PSEAI Uncharacterized protein MTTVVSRTFRSSPHRDALQTWDAIVELLTQGKDGTARSELRAVTGVAASLIADQAPKSAPIVATCDGPRTRIYCLFDEDAIDGDDANEEVLGFEPLKGDWGMSLPCPKEQLGWVQSALKKHSSRIIARDLSQGIATQAQADAGQAMSLDLGGFLKS 156 T 0.28 DUF3944 pdbpercent F Bacteria T 6p8s 2 C,D C,D CAP8_PSEAI HORMA1 MTTVVSRTFRSSPHRDALQTWDAIVELLTQGKDGTARSELRAVTGVAASLIADQAPKSAPIVATCDGPRTRIYCLFDEDAIDGDDANEEVLGFEPLKGDWGVSLPCPKEQLGWVQSALKKHSSRIIARDLSQG 133 T 0.28 DUF3944 unppercent F Bacteria T 6p8s 3 E,F E,F Peptide 1 SNAEVMEFNP 10 T 5.6 DUF1885 pdbhh F T 6pdk 1 A A B2FJJ6_STRMK Uncharacterized protein MGSSHHHHHHSSGLVPRGSHMATQTGRTINGHTYTDAPVDVKLGPNTFRIPANYLDSQIAPWPGEGVTLVIEWPDMKPTAPGARANPRTNDFRKEIPIRINYVDRVPVETLLSRLSSNEAITEEGSVERGDPRDRLDQRVAKPQTLGLTPYAIDEAKMVVYAKKYEARYGKPPVRNPAYERDWYIARQGDGRISSFIKCDGEEFRRDGVRLEGREVISEPGEVAAGCVHYFVDIDNKLSVSLDYKRAFLKDWKRMEEAVRDVIARTRSK 269 T 1.3 PsbP_2 unphh F Bacteria T 6pe4 9 P Q A0A4Z7TVW3_VIBPH Cation transporter MGSSHHHHHHSQDLDEVDAGSMVNTTQKISQSPVPDLEQFRAIAAQKDDRVISKRGEVKEPSTFHKGHKFASVSEGVLRKKYTKFFQENIKTHLDLKQALLKEEKPETALLAYSLVSPSGYRGEPLTERKILEVVSLLDEVKVDGDTYQQLKNTFDSISKDPRMQVSLENQYPGKMDGFGAQLLEMGKEKLKGSGVNAAINLALPGVGLLVATGRELHKASVNGDAEAYHHQLEQISQLPGRDQRLSMPMQQTLAIGHAMLSAEGAVGATLGMATGGLGTFGVSSVATAGVTPIAKEAIGTALTTGIISGGGFVAGQAGAYGLNNEVQDQLKQGPMSGVLPRLEISNVKGDFTFSMQEPAAVRALMAYLGPKEDTSMSSPQAPKEAQEMEAARLTLKQMLGSSPNEHLVPDVDSLLKLSDEDMPSQTESTANGAFKKLLSEDWDWLMPAVRAMDKGEAGKINEKLTYKLPLDAANGRVYLDKSPNLSDAQLDALDKLGSPSQLRLMYLAEGWI 513 T 0.14 VP4_helical unppercent F Bacteria T 6peu 2 E,F,G,H M,N,P,Q MCRA_METJA GLY-ARG-LEU-GLY-PHE-TYR-GLY-TYR-ASP-LEU-GLN-ASP GRLGFYGYDLQD 12 T 3.2 Anth_synt_I_N pdbhh F Archaea T 6pfj 1 A T F2RFR7_STRVP RSIG GSRPPAQRTAESALPDRARPELGALRLPELRTLRREAQSDEADLSYVRRMLQGRIDILRAELARRTDGEAPVLDRLSEILADVPSRHRSSARHVTLSTPRGEEYRRLAAEMLSEVELSDLTARTDEELHAAMGRLAGYEQQISRRRHHLQRTADDCSAEIARRYREGEAQVDDLLA 176 T 0.13 OrfB_IS605 unppercent F Bacteria T 6pi2 1 A A PPM1_LIMPO Tachyplesin II RWCFRVCYRGICYRKCRX 18 T 0.51 YlaC pdbhh F Eukaryota T 6pi3 1 A A TAC3_TACGI TACHYPLESIN III KWCFRVCYRGICYRKCRX 18 T 0.66 YlaC unphh F Eukaryota T 6pin 1 A A TAC1_TACTR TACHYPLESIN I KWCFRVCYRGICYRRCRG 18 T 0.021 Myticin-prepro unp F Eukaryota T 6pio 1 A A TAC2_TACTR TACHYPLESIN II RWCFRVCYRGICYRKCRG 18 T 0.04 Myticin-prepro unppercent F Eukaryota T 6pip 1 A A TAC3_TACGI TACHYPLESIN III KWCFRVCYRGICYRKCRG 18 T 0.53 YlaC pdbhh F Eukaryota T 6pir 1 A,B,C A,B,C Q5ZT21_LEGPH MavE SNATRFERNFLINSLMFLETILSVDKKLDDAIHHFTQGQYENPRYQINSRITNADDWSKEDKLKFTSAIAEAIALVSEKYENPTSETTEQIQSARNILLDNYVPLLTANTDPENRLKSVRENSSQIRKELIAKLKDE 137 T 0.008 DUF3502 unppercent F Bacteria T 6pit 3 C,D D,C Stapled Peptide 41A XHKKLHRXLQDS 12 T 0.0031 SRC-1 pdbhh F T 6pjp 2 B B GRK_DROME Peptide aldehyde inhibitor RKVRMAAIVFSFP 13 T 0.0014 DUF3844 unphh F Eukaryota T 6plh 3 C C IL21R_HUMAN IL-21R,NOVEL INTERLEUKIN RECEPTOR AGPMPGSSYQGTWSEWSDPVIFQTQSEELKEHHHHHH 37 T 0.00047 fn3 unppssm F Eukaryota T 6plm 1 A,B A,B SIDJ_LEGPH SidJ protein GHVKQYYFARRGETSTHDTSLPPPVKVLSGRSIPLKEIPFEATRNELVQIYLTSIDKLIKSNKLNSIPSQQIASHYLFLRSLANSETDGIKKNQILSLAKPLGTYLASKEPHVWKMINELIEKSEYPIIHYLKNNRAHSNFMLALIHEYHKEPLTKNQSAFVQKFRDSSVFLFPNPIYTAWLAHSYDEDSSFNPMFRERLSTNFYHSTLTDNLLLRTEPKEVTLSSEHHYKKEKGPIDSSFRYQMSSDRLLRIQGRTLLFSTPQNDVVAVKVQKKGEPKSTLEEEFEMADYLLKHQRRLDVHSKLPQPLGQYSVKKSEILEISRGSLDFERFKTLIDDSKDLEVYVYKAPQSYFTYLHDKNQDLEDLTASVKTNVHDLFVLLREGIVFPQLADIFHTHFGEDEREDKGRYQALVQLLNVLQFQLGRIDKWQKAVEYVNLRSSGLADLGDSLPITSLFTSSDFTKHYFSELLTGGYHPTFFDKSSGTANSLFTGKRRLFGNYLYLNTIAEYLLVIQLTLGSYGDKVTRDMMDKPKKEAVWRELANVMFTSCAEAIHIMTGIPQSRALTLLKQRANIEKHFRQTQFWMTPDYSKLDEDTLQMEQYSIYSGEPEYEFTDKLVSGVGLSVDGVHQDLGGYNRESPLRELEKLLYATVTLIEGTMQLDKEFFKQLEQVEKILSGEIKTDANSCFEAVAQLLDLARPGCHFQKRLVLSYYEEAKLKYPSAPTDAYDSRFQVVARTNAAITIQRFWREARKNLS 757 T 0.29 IQ pdb F Bacteria T 6pm9 2 E,F,G,H E,F,G,H OGA_HUMAN OGA,BETA-N-ACETYLGLUCOSAMINIDASE,BETA-N-ACETYLHEXOSAMINIDASE,BETA-HEXOSAMINIDASE,MENINGIOMA-EXPRESSED ANTIGEN 5,N-ACETYL-BETA-D-GLUCOSAMINIDASE,N-ACETYL-BETA-GLUCOSAMINIDASE,NUCLEAR CYTOPLASMIC O-GLCNACASE AND ACETYLTRANSFERASE,NCOAT MTLEDLQLLADLFYLPYEHGPKGAQMLREFQWLRANSSVVSVNCKGKDSEKIEEWRSRAAKFEEMCGLVMGMFTRLSNCANRTILYDMYSYVWDIKSIMSMVKSFVQWLGCRSHSSAQFLIGDQEPWAFRGGLAGEFQRLLPIDGANDLFFQPHHHHHHHH 161 T 12 Pinin_SDK_memA pdbhh F Eukaryota T 6por 1 A A A0A105L2P0_9BURK Ubonodin GGDGSIAEYFNRPMHIHDWQIMDSGYYG 28 T 0.2 MmoB_DmpM pdbhh F Bacteria T 6ppc 1 A A CG2RA_CONMI CONOPEPTIDE MI045 EDCGSDCMPCGGECCCEPNSCIDGTCHHESSPN 33 T 0.57 FeoB_associated pdbhh F Eukaryota T 6pqf 1 A A OlvA(BCS) ACGXGXGCAKXCAASCAAS 19 T 0.096 C_tripleX pdbhh F T 6pqg 1 A A OlvA(BC) ACGXGDGCAKXCAASCAAS 19 T 0.096 C_tripleX pdbhh F T 6pqt 1 A A G0SCF1_CHATD Dynein intermediate chain protein GAHMMQARREELLAKKARLAEIKRQRELRAQQAAGRSITPSELVSPTPSRANSRREIESLIDSILSSSAGANSPRRGSRPNSVISTGELSTD 92 T 0.097 kleA_kleC pdbpssm F Eukaryota T 6psh 1 A A ANTIH_BPT4 PROTEIN RI MNVDPHFDKFMESGIRHVYMLFENKSVESSEQFYSFMRTTYKNDPCSSDFECIERGAEMAQSYARIMNIKLETEKLAAALEHHHHHH 87 T 4 LT-IIB unphh T Viruses T 6psk 1 A R ANTIH_BPT4 Antiholin MNVDPHFDKFMESGIRHVYMLFENKSVESSEQFYSFMRTTYKNDPCSSDFECIERGAEMAQSYARIMNIKLETE 74 T 4 LT-IIB unphh T Viruses T 6pu1 2 B B SC24C_HUMAN SEC24-RELATED PROTEIN C GPLLPGQSFGGPSVS 15 T 5.7 LT-IIB pdbhh F Eukaryota T 6pun 3 E,F E,F P91820_CAEEL LST-1 GSNSSGLRSQKLHLTYIEKNKRVRAMIPQ 29 T 0.15 N_Asn_amidohyd unp F Eukaryota T 6pv9 2 B B macrocyclic peptide XFXNPHLXWSWXXRXGX 17 T 9.9 DUF5701 pdbhh F T 6pvb 2 B A AMINO GROUP-()-(2~{S})-2-azanylpropanal-()-ISOLEUCINE-()-ARGININE-()-LYSINE-()-PROLINE-()-AMINO-ACETALDEHYDE-()-9-(5-{[(3S)-3-amino-3-carboxypropyl](pentyl)amino}-5-deoxy-beta-L-arabinofuranosyl)-9H-purin-6-amine XXPKRIAX 8 T 3.8 HIRA_B pdbhh F T 6pw3 1 A,B,C,D C,A,B,D LARP1_HUMAN LA RIBONUCLEOPROTEIN DOMAIN FAMILY MEMBER 1 GHSGGGGGGHMQHPSHELLKENGFTQHVYHKYRRRCLNERKRLGIGQSQEMNTLFRFWSYFLEDHFNKKMYEEFKQLALEDAKEGYRYGLECLFRYYSYGLEKKFRLDIFKDFQEETVKDYEAGQLYGLEKFWAFLKYSKAKNLDIDPKLQEYLGKFRRLED 162 T 0.62 Frankia_peptide pdbpercent F Eukaryota T 6pwb 2 AA,AF,BD,D,E,FC,FE,GA,GB,H,HB,HD,HE,IA,IB,ID,JB,JD,KD,M,MB,N,O,P,QA,QC,QD,RC,RE,SC,SD,TC,UA,V,WA,WB,X,XB,XD,Y,YB,YD,Z,ZB,ZC,ZD,ZE CC,GW,EV,AL,AM,DY,GJ,CI,CU,AU,CV,FF,GL,CK,CW,FG,CX,FH,FI,BE,EA,BF,BG,BH,BZ,EK,FO,EL,HE,EM,FQ,EN,DD,BN,DF,DP,BP,DQ,GA,CA,DR,GB,CB,DS,ET,GC,GV B0STJ8_LEPBP Flagellar coiling protein A (FcpA) LTEDQKKKKKEIMEQESLWKNPDFKGYNKTFQELHQLSKTFANNQFRLALSNYQSGVNTIMKNRDWVEQYRKEEAEKKRLDEKWYWQKVDRKAREERVVYREKMKAKQDALNYFSKAINHLDEIKNPDLRERPEFKRLLSDVYRSWIMAEYDLQNLPQTIPILELYIEIDDNEKEYPAHKYLASAYSFEENMIKKTKGPDDMLFKYRYKKNVHLLRATELKYGKDSPEYKHIVNVIN 237 T 0.029 RVT_2 pdbpssm F Bacteria T 6pwb 3 AC,AD,AE,BA,BC,BE,BF,CA,CC,CE,DA,F,GC,GE,HA,KB,LB,LD,MD,ND,PE,Q,R,RA,RD,S,SE,UC,VA,VC,W,WC DT,EU,GE,CD,DU,GF,GZ,CE,DV,GG,CF,AO,DZ,GK,CJ,CY,CZ,FJ,FK,FL,HB,BI,BJ,DA,FP,BK,HF,EO,DE,EP,BO,EQ B0SR03_LEPBP Flagellar coiling protein B (FcpB) SGKSMADTEKELDDNISEVNKRLRLHTVLFKMKVRTLPHKTVLYKGKPSADGERCEAADKQEAQDNTCLHLEVFDFVGSEDGKSSKNLGAKFKKMELFFEGSNNADPDPRKEQPRNLTKIRTYIYQNNFLLEDKVISVIADVAPNGEPAHNDKIELFYQHDDYPVWGTPETPSEKGVGKYILSNVENTKSNPIRNNFKKQFYFKNLDYFDKLFTKIFDYND 221 T 0.11 HTH_1 unppssm F Bacteria T 6pwd 1 A A A0A085GHR3_9GAMM Type III effector HopBF1 SMFNVSNNVAPSRYQGPSSTSVTPNAFHDVPSLGQKVGAGSQKDVFHSRQDPRQCICLFRPGTTGSIPAEQYAQKELETTKQLKNLGFPVVDAHALVKHQGSVGVAKDFIHNALDSEDIVNNKKSLPDNLKFNKNVLEDCNAIIRRLKNLEVHIEDLQFLVDHNGHVLINDPRDVVRSSPDKSISKVNELRSHALNNLLDIDSD 204 T 0.0036 Pkinase pdbhh F Bacteria T 6px6 3 C C GLTC_WHEAT DQ2.2-glut-L1 APFSEQEQPVLG 12 T 21 BAGE pdbhh F Eukaryota T 6pxr 3 C A TAU_HUMAN NEUROFIBRILLARY TANGLE PROTEIN,PAIRED HELICAL FILAMENT-TAU,PHF-TAU AGTYGLGD 8 T 0.16 GshA pdbhh F Eukaryota T 6pxu 2 C,D C,D GAGATGAGAGYYITPRTGAGA GAGATGAGAGYYITPRTGAGA 21 T 0.32 YtxH pdbhh F T 6q0m 2 C,D C,D peptide YYESDWL 7 T 0.43 MIX pdbhh F T 6q0n 2 C,D C,D peptide TGYETWV 7 T 0.49 DUF4860 pdbhh F T 6q0r 3 C C DCA15_HUMAN DDB1- and CUL4-associated factor 15 MDWSHPQFEKSAVGLNDIFEAQKIEWHEGGGGSGENLYFQGGGRMEPGYVNYTKLYYVLESGEGTEPEDELEDDKISLPFVVTDLRGRNLRPMRERTAVQGQYLTVEQLTLDFEYVINEVIRHDATWGHQFCSFSDYDIVILEVCPETNQVLINIGLLLLAFPSPTEEGQLRPKTYHTSLKVAWDLNTGIFETVSVGDLTEVKGQTSGSVWSSYRKSCVDMVMKWLVPESSGRYVNRMTNEALHKGCSLKVLADSERYTWIVL 263 T 15 LuxS pdbhh F Eukaryota T 6q0u 2 C,D C,D peptide YYESGWL 7 T 1 MORN_2 pdbhh F T 6q1h 1 A,B,C,E,F,G A,B,C,E,F,G NUCC_PSEAI Bacterial protein ORF C62 MSQWSLSQLLSSLHEDIQQRLSVVRKTFGHPGTKGDASENVWIDMLDTYLPKRYQAAKAHVVDSLGNFSQQINVVVFDRQYSPFIFTYENETIIPAESVYAVFEAKQTADAGLVAYAQEKVASVRRLHRTSLPIPHAGGTYPAKPLIPILGGLLTFESEWSPALGPSMDKALNANLTEGRLDIGCVAAHGHFFYDQASGAYSYTNENKPATAFLFKLIAQLQFSGTVPMIDVEAYGQWLTK 241 T 0.37 AdoMet_Synthase pdbpercent F Bacteria T 6q1u 2 C,D C,D SFTI1_HELAN GLY-ARG-ALA-TYR-LYS-SER-LYS-PRO-PRO-ILE-ALA-PHE-PRO-ASP GRAYKSKPPIAFPD 14 T 0.052 Bowman-Birk_leg unp F Eukaryota T 6q1x 1 A A A0A0U2WJJ9_9BURK Pandonodin GVLGNDAEGITLLPLCFKPICIPTLPPLTGGHA 33 T 0.17 PEP-utilizers_C unppercent F Bacteria T 6q36 2 C,D C,D ACE-PRO-6CW-ARG-LEU-ARG-LYS-2JH-HYP-ASP-SER-PHE-ALN-LYS-GLU-PRO-NH2 XPXRLRKXPDSFXKEPX 17 T 1.8 FAM181 pdbhh F T 6q3v 1 A A N4BP1_HUMAN N4BP1 GSDEFTAPAEKAELLEQSRGRIEGLFGVSLAVLGALGAEEPLPARIWLQLCGAQEAVHSAKEYIKGICEPELEERECYPKDMHCIFVGAESLFLKSLIQDTCADLCILDIGLLGIRGSAEAVVMARSHIQQFVKLFENKENLPSSQKESEVKREFKQFVEAHADNYTMDLLILPTSLKKELLTLTQGE 188 T 0.25 YafQ_toxin pdb F Eukaryota T 6q4q 2 C,D C,D Stapled peptide XRLYGFKWH 9 T 1.1 Speriolin_C pdbhh F T 6q5h 1 A,B A,B CC-Hex*-L24D XGELKAIAQELKAIAKELKAIAWEDKAIAQGX 32 T 1.7 DUF5320 pdbhh F T 6q5i 1 A,B A,B CC-Hex*-L24E XGELKAIAQELKAIAKELKAIAWEEKAIAQGX 32 T 2 DUF5320 pdbhh F T 6q5k 1 A,B A,B CC-Hex*-L24K XGELKAIAQELKAIAKELKAIAWEKKAIAQGX 32 T 0.57 DUF2312 pdbhh F T 6q5l 1 A,B B,A CC-Hex*-L24H XGELKAIAQELKAIAKELKAIAWEHKAIAQGX 32 T 0.83 Rho_N pdbpssm F T 6q5m 1 A,B A,B CC-Hex*-L24Dab XGELKAIAQELKAIAKELKAIAWEXKAIAQGX 32 T 1.7 DUF5320 pdbhh F T 6q5o 1 A A CC-Hex*-LL XGELKALAQELKALAKELKALAWELKALAKGX 32 T 0.038 DUF5320 pdbhh F T 6q5p 1 A,B,C,D,E,F A,B,C,D,E,F CC-Hex*-II XGEIKAIAQEIKAIAKEIKAIAWEIKAIAQGX 32 T 0.0073 DUF2312 pdb F T 6q5q 1 A A CC-Hex-KgEb XGKLEAIAQKLEAIAKKLEAIAWKLEAIAQGAGX 34 T 0.053 Matrilin_ccoil pdb F T 6q5r 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L CC-Hex*-LL-KgEb XGKLEALAQKLEALAKKLEALAWKLEALAQGX 32 T 0.0071 Matrilin_ccoil pdb F T 6q5s 1 A,B,C,D A,B,C,D apCC-Tet XGELEALAQELEALAKKLKALAWKLKALAQGX 32 T 0.021 DUF5320 pdbhh F T 6q5z 1 A A H72_CONVC H_VC7.2 GAMGNVNCGGVPCKFGCCREDRCREIDCD 29 T 2.7 DUF4801 pdbhh F Eukaryota T 6q67 2 B B B8R1T8_9PICO 3A GLTIEAEPTELSYQDALEMLAESKPVSTTLSFER 34 T 0.15 Toprim_C_rpt pdbpercent T Viruses T 6q6r 2 E,F,G,H E,F,G,H DHX36_HUMAN DEAD/H BOX POLYPEPTIDE 36,DEAH-BOX PROTEIN 36,G4-RESOLVASE-1,G4R1,MLE-LIKE PROTEIN 1,RNA HELICASE ASSOCIATED WITH AU-RICH ELEMENT PROTEIN HPGHLKGREIGMWYAKKQGQKNKEAERQE 29 T 1.7 PsaL pdbhh F Eukaryota T 6q76 2 B B B9WZW9_MAGOR AVR-Pia protein GPAPARFCVYYDGHLPATRVLLMYVRIGTTATITARGHEFEVEAKDQNCKVILTNGKQAPDWLAAEPY 68 T 0.012 Pirin_C unppssm F Eukaryota T 6qax 1 A A LEU-GLY-GLN-GLN-GLN-PRO-PHE-PRO-PRO-GLN-GLN-PRO-TYR LGQQQPFPPQQPY 13 T 22 DUF3910 pdbhh F T 6qay 1 A A TAPA_BACSU BIOFILM ASSEMBLY ACCESSORY PROTEIN TAPA AFHDIETFDVSLQTCKDFQHTDKNCHYDKRWDQSDLHISDQTDTKGTVCSPFALFAVLENTGEKLKKSKWKWELHKLENARKPLKDGNVIEKGFVSNQIGDSLYKIETKKKMKPGIYAFKVYKPAGYPANGSTFEWSEPMRLAKCDE 147 T 0.002 Herpes_PAP unp F Bacteria T 6qb0 1 A A LEU-GLY-GLN-GLN-GLN-ALA-PHE-PRO-PRO-GLN-GLN-PRO-TYR LGQQQAFPPQQPY 13 T 24 NinD pdbhh F T 6qb1 1 A A LEU-GLY-GLN-GLN-GLN-PRO-ALA-PRO-PRO-GLN-GLN-PRO-TYR LGQQQPAPPQQPY 13 T 17 Oxidored-like pdbhh F T 6qb7 1 A,B,C,D,E A,B,C,D,E KCD16_HUMAN POTASSIUM CHANNEL TETRAMERIZATION DOMAIN-CONTAINING PROTEIN 16 SMEIKQSPDEFCHSDFEDASQGSDTRICPPSSLLPADRKWGFITVGYRGSCTLGREGQADAKFRRVPRILVCGRISLAKEVFGETLNESRDPDRAPERYTSRFYLKFKHLERAFDMLSECGFHMVACNSSVTASFINQYTDDKIWSSYTEYVFYREPSRWSPS 163 T 1.1 GFRP pdbhh F Eukaryota T 6qbb 2 B,D P,Q Strep-tag II peptide XSAWSHPQFEKX 12 T 2.7 PqqA pdbhh F T 6qc0 2 D,E,F B,D,F DTL_HUMAN DDB1- AND CUL4-ASSOCIATED FACTOR 2,LETHAL(2) DENTICLELESS PROTEIN HOMOLOG,RETINOIC ACID-REGULATED NUCLEAR MATRIX-ASSOCIATED PROTEIN SSMRKICTYFHRKS 14 T 2.7 Flexi_CP pdbhh F Eukaryota T 6qcg 2 G,H,I,J,K,L I,G,H,J,K,L CDT1_HUMAN DOUBLE PARKED HOMOLOG,DUP MEQRRVTDFFARRR 14 T 4.3 DAO_C pdbhh F Eukaryota T 6qdr 2 B B PAK6_HUMAN PAK-5,P21-ACTIVATED KINASE 6,PAK-6 XVISSNTLRGRS 12 T 1.4 Csm1_B pdbhh F Eukaryota T 6qdv 23 W R SRRM2_HUMAN 300 KDA NUCLEAR MATRIX ANTIGEN,SERINE/ARGININE-RICH SPLICING FACTOR-RELATED NUCLEAR MATRIX PROTEIN OF 300 KDA,SER/ARG-RELATED NUCLEAR MATRIX PROTEIN OF 300 KDA,SPLICING COACTIVATOR SUBUNIT SRM300,TAX-RESPONSIVE ENHANCER ELEMENT-BINDING PROTEIN 803,TAXREB803 MYNGIGLPTPRGSGTNGYVQRNLSLV 26 T 27 Antimicrobial14 pdbhh F Eukaryota T 6qet 1 A A GLL11_CHICK GAL-11,BETA-DEFENSIN 11,VITELLINE MEMBRANE OUTER LAYER PROTEIN 2,VITELLINE MEMBRANE OUTER LAYER PROTEIN II,VMOII DTTSDFHTCQDKGGHCVSPKIRCLEEQLGLCPLKRWTCCKEI 42 T 1.7E-05 DEFB136 unphh F Eukaryota T 6qev 2 B D PRO-LYS-SER-ILE-ARG-ILE-GLY-PRO-GLY-GLN-ALA-PHE-TYR-ALA-DPR PKSIRIGPGQAFYAX 15 T 0.00049 GP120 pdbhh F T 6qgi 1 A A H9ABL9_9VIRU VP5 IAPLVGVGLAAGAVGVGWALREFEIVGSDAPPEGLTADALKQQVYQTAKTRKSTNASTIVDNQNILDGVKHTAYTDAKIAAIEELNAGSAESAVLDAATTEVNSYLTTVQSNFLKTWNESVAELDSILSTVVNHPDIGKGDVFLMLNGSDNTIEDLLANPSGSTDATSFTLADGTTMSVGTVEVDRGTESYYYDPMSGLVGDLGDLKNGGPTVQYDGDSLVYLNASNWKPIYDEMDTVLQNVRSGISTWVSNVYGDVQSGEIEVSDLVTPRERAAMMAQEEGMSQAIADLIALNVPVDAEREATITIQDTGATLPGTFALTDASDGPLESGKTYDPSTFSGDVYFTADMSLVEGDWTAYQSGVDGGNVTLTSEPYSGTAVELNTAANETVAVDAGNWTATGNGTWYHDVSPELETDITSIESARFLSTAEQTQYETIQLQGSFTIDKLTNTQTGEEVTATSFDSSEPHTDSNYITQEEWDQLEQQNKELIEKYEQSQSGGGLDLGQFDMFGIPGEIVAVGVAALVGLGVLGNN 533 T 0.02 Sporozoite_P67 unppssm T Viruses T 6qgl 1 A,B A,B H9ABP6_9VIRU VP5 IAPLVGYAIGAAAISAVGGIGVGWTLREFEVVGSDDPAEGLTPDVLRNQLSDSVVKRKSNNQSTMVDNQNILDGVEHTAYTEAKIAAIEELNAGSSESAVLSAANSAIDSYETTVRTNFYKSWNETVRELEAMTQTVIAHADVGLSYITDFGDPRFGNLASGTSPNTLKDTTVSMPDGTNFTLLTFRHNTGWDSGNAAYSVVEYNPKEVVTSTNSNTYNTVDGTQYMKFSEWNAVETEMDTVFQNVRNGISTWVTNVYGDVQSGAIEISDLVTPRERATMMAQEEGMSQAIADLIALNVPVDAEREATITIQDTGATLPGTFALTDSSDGPLSAGQTYDPSTFSGDVYFTADMSLVEGPWDAINSGVDGGTITITSEPYEGTAIEVTTVESETVSVPAADWTDNGDGTWSYDASGDLETTITNVDSARFVSTATETTYDTLQLKGAFTVDKLVNKQSGEEVSSTSFTSSEPQTDSNYITQDEWDQLEQQNKELIEKYEQSQSGGGLDLGGLD 512 T 0.058 B56 pdbpssm T Viruses T 6qhg 1 A,B A,B L_RVFV Polymerase GPGAGTVGGFIKRQQSKVVQNKVVYYGVGIWRGFMDGYQVHLEIENDIGQPPRLRNVTTNCQSSPWDLSIPIRQWAEDMGVTNNQDYSSKSSRGARYWMHSFRMQGPSKPFGCPVYIIK 119 T 0.2 UNC80 pdbpssm T Viruses T 6qiu 2 B P ATX1_HUMAN Ataxin-1 phosphopeptide KRRWSAPESR 10 T 0.85 ACTH_domain pdbhh F Eukaryota T 6qix 1 A,B A,B P43 MLVLFFPLLLTVGLSTAGHVKCPDFGDWKPWTDCLWYPPQHMYSKLSHACGMHAHRNLTGVMDLPHGHKTPPPCGHCSFKFRCRRRPNTEGCYPLDGEVEVCHDHSDICTLPKLPHLGCGYAFINEKLKQCFTRPDTPSYVRLGYRKMFESIPKKHCIEKDGMCKCCCGDYEPNESGTECIKPPAHDCPAYGPPSEWSECLWFPLKNIVSHVYDHCHVHKEPDGYEPHSVAPANVHIPEKCGFCSFRVKCMKRDKKDGCFPLKLGKKSCGKDDCPTCGDICTLDKINGSCAFPRVMKEKIWDDFTATSKEKHMPHWKRDGYAKMLMQLPYSNCKEVGDKCKCCCHPYEPNKDGTACVVKEYCKRVHELHHHDHHGHGEEHHKSSSSESKEHHHH 394 T 29 PNISR pdbhh F T 6qjb 1 A A EVA3_RHISA Evasin-3 FDVVSCNKNCTSGQNECPEGCFCGLLGQNKKGHCYKIIGN 40 T 0.032 Toxin_11 pdbhh F Eukaryota T 6qkf 1 A A PG4_PIG PG-4 RGGRLCYCRGWICFCVGR 18 T 0.51 PCAF_N pdbhh F Eukaryota T 6qlc 1 A A C8ZKB3_9CAUD SSDNA-BINDING RNA POLYMERASE COFACTOR DRC GPLGSMALVKKNQARNTQATDNKGASAYLNFHFPTRDGKDVRLVSLGLRADDALHMQLQEFLTVDDKGKPLSETAYAERCKKLVSRLIIKLGVTRSEEERALDL 104 T 0.086 Hemolysin_N unppssm T Viruses T 6qld 1 A C CENPC_YEAST CENP-C HOMOLOG,CONSTITUTIVE CENTROMERE-ASSOCIATED NETWORK PROTEIN MIF2,MITOTIC FIDELITY OF CHROMOSOME TRANSMISSION PROTEIN 2 LRKSTRVKVAPLQYWRNEKIVY 22 T 1.4 CENP-C_mid pdbhh F Eukaryota T 6qld 12 L U CENPU_YEAST ASSOCIATED WITH MICROTUBULES AND ESSENTIAL PROTEIN 1,CENP-U HOMOLOG,CONSTITUTIVE CENTROMERE-ASSOCIATED NETWORK PROTEIN AME1 SVTTIDVLSSLFINLFENDLIPQALKDFNKSDDDQFRKLLYKLDLRLFQTISDQMTRDLKDILDINVSNNELCYQLKQVLARKEDLNQQIISVRNEIQELKAGKDWHDLQNEQAKLNDKVKLNKRLNDLTSTLLGKYEGDRKIMSQDSEDDSIRDDSNILDIAHFVDLMDPYNGLLKKINKINENLSNEL 190 T 0.0042 DUF1640 pdb F Eukaryota T 6qld 13 M Y NKP1_YEAST CONSTITUTIVE CENTROMERE-ASSOCIATED NETWORK PROTEIN NKP1,NON-ESSENTIAL KINETOCHORE PROTEIN 1 TDTYNSISNFIENELTALLSSDDYLMDDLAGELPNEVCRLLKAQVIEKRKDAMSRGKQDLLSKEIYDNESELRASQSQQIMELVGDIPKYSLGSELRNRVEGEPQSTSIERLIEDVLKLPQMEVADEEEVEVENDLKVLSEYSNLRKDLILKCQALQIGESKLSDILSQTNSINSLTTSIKEASEDDDISEYFATYNGKLVVALEEMKLLLEEAVKTFGNSPEKREKIKKILSELKK 237 T 0.00018 FTA4 unppercent F Eukaryota T 6qle 10 J Y NKP1_YEAST CONSTITUTIVE CENTROMERE-ASSOCIATED NETWORK PROTEIN NKP1,NON-ESSENTIAL KINETOCHORE PROTEIN 1 MTDTYNSISNFIENELTALLSSDDYLMDDLAGELPNEVCRLLKAQVIEKRKDAMSRGKQDLLSKEIYDNESELRASQSQQIMELVGDIPKYSLGSELRNRVEGEPQSTSIERLIEDVLKLPQMEVADEEEVEVENDLKVLSEYSNLRKDLILKCQALQIGESKLSDILSQTNSINSLTTSIKEASEDDDISEYFATYNGKLVVALEEMKLLLEEAVKTFGNSPEKREKIKKILSELKK 238 T 0.00018 FTA4 pdbpercent F Eukaryota T 6qlf 6 F U CENPU_YEAST ASSOCIATED WITH MICROTUBULES AND ESSENTIAL PROTEIN 1,CENP-U HOMOLOG,CONSTITUTIVE CENTROMERE-ASSOCIATED NETWORK PROTEIN AME1 MDRDTKLAFRLRGSHSRRTDDIDDDVIVFKTPNAVYREENSPIQSPVQPILSSPKLANSFEFPITTNNVNAQDRHEHGYQPLDAEDYPMIDSENKSLISESPQNVRNDEDLTTRYNFDDIPIRQLSSSITSVTTIDVLSSLFINLFENDLIPQALKDFNKSDDDQFRKLLYKLDLRLFQTISDQMTRDLKDILDINVSNNELCYQLKQVLARKEDLNQQIISVRNEIQELKAGKDWHDLQNEQAKLNDKVKLNKRLNDLTSTLLGKYEGDRKIMSQDSEDDSIRDDSNILDIAHFVDLMDPYNGLLKKINKINENLSNEL 320 T 0.0047 DUF1640 unp F Eukaryota T 6qnn 2 B B GTSE1_HUMAN GTSE-1,PROTEIN B99 HOMOLOG SQPLIDLPLIDFCDTPEAHVAVGSESRPLIDLMTNTPDMNKNVAKPSPVVGQLIDLSSPLIQLSPE 66 T 3 Gag_p12 pdbhh F Eukaryota T 6qnp 2 E,F,G,H H,I,J,K GTSE1_HUMAN GTSE-1,PROTEIN B99 HOMOLOG LAVTPDAASQPLIDLPLIDFCDTPEAHVAVGSESRPLIDLMTNTPDMNKNVAKPSPVVGQLIDLSSP 67 T 3.9 Gag_p12 pdbhh F Eukaryota T 6qpk 1 A,B A,B A0A2H1G421_ZYMTR Uncharacterized protein GHMAVVYAARCKFGNPLVQNNRITRAVCDLTNEHTTKDGSWHYVEVDNECKYLAGDNPRDQPGWAVFVKYCTYYKGVPDA 80 T 0.016 DUF5948 unppssm F Eukaryota T 6qrm 2 C,D C,D AIFM3_HUMAN APOPTOSIS-INDUCING FACTOR-LIKE PROTEIN GNCFSKRRAA 10 T 0.062 NifU unphh F Eukaryota T 6qs0 1 A A Y2503_BORBU Putative outer membrane protein BBA03 GAMGTPLEKLVSRLNLNNTEKETLTFLTNLLKEKLVDPNIGLHFKNSGGDESKIEESVQKFLSELKEDEIKDLLAKIKENKDKKEKDPEELNTYKSILASGFDGIFNQADSKTTLNKLKDTI 122 T 0.00042 RRP36 pdbpercent F Bacteria T 6qsz 1 A,C,E,G,I,K,M,O A,C,E,G,I,K,M,O SIR4_YEAST SILENT INFORMATION REGULATOR 4 GPKPKNTKENLSKSSWRQEWLANLKLISVSLVDEFPSELSDSDRQIINEKMQLLKDIFANNLKSAISNNFRESDIIILKGEIEDYPMSSEIKIYYNELQNKPDAKKARFWSFMKTQRFVSNMGFDIQ 127 T 0.092 DUF6120 pdbpercent F Eukaryota T 6qsz 2 B,D,F,H,J,L,N,P B,D,F,H,J,L,N,P ESC1_YEAST ESTABLISHES SILENT CHROMATIN PROTEIN 1 IPSTDLPSDPPSDKEE 16 T 30 bCoV_SUD_M pdbhh F Eukaryota T 6qt9 1 A,C A,D Q4KPG2_9VIRU ORF 25 EYTISHTGGTLGSSKVTTAANQTSPQRETAIIGFECPRKFAEIEYVGQRDSTRFIPRTTESITGTAGDDTVVSLTANIQPVAGETAIEDQDYPVAVAYNVTQGVQVDIDAVDYAADEVTLADNPADGDTVKVWPIMGDGDVQFRLVNQFGQEEGRVYPWATPLYRWHDFPQLKRGREINLHGSVTWEENETVEVLLDAPQAITWEDSDYPEGQYVSTFEQDVEITL 226 T 6.3 DUF1344 unphh T Viruses T 6qt9 2 B,D,E,F,G,H,I,J,K,L C,E,F,G,H,I,J,K,L,M Q4KPG2_9VIRU ORF 25 YTISHTGGTLGSSKVTTAANQTSPQRETAIIGFECPRKFAEIEYVGQRDSTRFIPRTTESITGTAGDDTVVSLTANIQPVAGETAIEDQDYPVAVAYNVTQGVQVDIDAVDYAADEVTLADNPADGDTVKVWPIMGDGDVQFRLVNQFGQEEGRVYPWATPLYRWHDFPQLKRGREINLHGSVTWEENETVEVLLDAPQAITWEDSDYPEGQYVSTFEQDVEITL 225 T 6.3 DUF1344 unphh T Viruses T 6qt9 3 AA,M,N,O,P,Q,R,U,V,W,Y o,a,b,c,d,e,f,i,j,k,m Q4KPG3_9VIRU ORF 24 GNIGNLSAEKQISVYDGQPFVDEQDVPADDPNTPALTIEGPDGYVIAVDAGTPIAPEFRDSNGNKLDPSTRVIVQKCDRQGNPLGDGIVFNDTLGRFDYEQMRTDPDFMRKTAKSLMIDEREIVKVFVDIPAGANGYDADKSRLTLGDDTSDFGKAVEIVDHDELSDAETRAV 173 T 6.6 HEF_HK pdbhh T Viruses T 6qt9 4 S,X,Z g,l,n Q4KPG3_9VIRU ORF 24 SAEKQISVYDGQPFVDEQDVPADDPNTPALTIEGPDGYVIAVDAGTPIAPEFRDSNGNKLDPSTRVIVQKCDRQGNPLGDGIVFNDTLGRFDYEQMRTDPDFMRKTAKSLMIDEREIVKVFVDIPAGANGYDADKSRLTLGDDTSDFGKAVEIVDHDELSDAETRAV 167 T 6.5 HEF_HK pdbhh T Viruses T 6qt9 5 T h Q4KPG3_9VIRU ORF 24 GNIGNLSAEKQISVYDGQPFVDEQDVPADDPNTPALTIEGPDGYVIAVDAGTPIAPEFRDSNGNKLDPSTRVIVQKCDRQGNPLGDGIVFNDTLGRFDYEQMRTDPDFMRKTAKSLMIDEREIVKVFVDIPAGANGYDADKSRLTLGDDTSDFGKAVEIVDHDELSDAETRAVKA 175 T 6.6 HEF_HK pdbhh T Viruses T 6qt9 6 BA Y Q4KPF6_9VIRU ORF 31 ERLGRLVDVLETKEFGDTTVERSVTQNIDRTRTDSPNNENQPIYFSTGPEAIAVENTEEWERLDFGIVAETVNIRTTDDIDIAFADPNKNGPVIRVREGESPFTIGGDAGIESAFIWLRQAETASNTPGIQIIAF 135 T 23 MREG pdbhh T Viruses T 6qtm 2 D,E,F D,E,F O42838_SACPA Ribonuclease H ESPPSLDSSPPNTSFNA 17 T 31 TOC159_MAD pdbhh F Eukaryota T 6qto 2 B B HY5_ARATH PROTEIN LONG HYPOCOTYL 5,BZIP TRANSCRIPTION FACTOR 56,ATBZIP56 XEIRRVPEFGGY 12 T 0.2 Macoilin unppercent F Eukaryota T 6qtq 2 B B UVR8_ARATH PROTEIN UV-B RESISTANCE 8,RCC1 DOMAIN-CONTAINING PROTEIN UVR8 XRYAVVPDE 9 T 2 CFIA_Pcf11 pdbhh F Eukaryota T 6qtt 2 B B HYH_ARATH HY5 HOMOLOG,BZIP TRANSCRIPTION FACTOR 64,ATBZIP64 XELLMVPDMY 10 T 0.0077 CASP_C unppercent F Eukaryota T 6qtu 2 B B BBX24_ARATH SALT TOLERANCE PROTEIN XEHFIVPDLY 10 T 0.89 MRP-L27 pdbhh F Eukaryota T 6qtv 2 B B HFR1_ARATH BASIC HELIX-LOOP-HELIX PROTEIN 26,BHLH 26,PROTEIN LONG HYPOCOTYL IN FAR-RED 1,PROTEIN REDUCED PHYTOCHROME SIGNALING,REDUCED SENSITIVITY TO FAR-RED LIGHT,TRANSCRIPTION FACTOR EN 68,BHLH TRANSCRIPTION FACTOR BHLH026 XYLQIVPEIHK 11 T 0.065 MRC1 unppercent F Eukaryota T 6qtx 2 B B COL3_ARATH Zinc finger protein CONSTANS-LIKE 3 XGFGVVPSFY 10 T 3 YqzE pdbhh F Eukaryota T 6qu1 2 B D SMRCD_HUMAN ATP-DEPENDENT HELICASE 1,HHEL1 LSELEDLKDAKLQTLKELFPQRSDNDLLKLIESTSTMDGAIAAALLMF 48 T 0.00016 CUE pdbpssm F Eukaryota T 6qvp 1 A,B,C,D,E,F A,E,C,D,B,F H9L4G5_SALTM MEMBRANE PROTEIN,PUTATIVE INNER MEMBRANE OR EXPORTED PROTEIN KTDITSTKNELVITYHGRLRSFSEEDTYKIKAWLEDKINSNLLIEMVIPQADISFSDSLRLGYERGIILMKEIKKIYPDVVIDMSVNSAASSTTSKAIITTINK 104 T 0.15 Na_Ca_ex_C unppssm F Bacteria T 6qxb 1 A A PHE-VAL-CAP-TRP-PHE-SER-LYS-PHE-LEU-GLY-ARG-ILE-LEU-NH2 FVXWFSKFLGRILX 14 T 0.013 Mim2 pdbhh F T 6qyv 1 A A PHE-SER-DAL-LEU-ALA-LEU-CYS-ALA FSXLALCA 8 T 11 PAGK pdbhh F T 6qz9 2 M,N,O,P,Q,R,S,T,U,V,W,X 0A,0B,0C,0D,0E,0F,0G,0H,0I,0J,0K,0L TUB11_BPPH2 GENE PRODUCT 11,GP11,LOWER COLLAR PROTEIN,PROTEIN P11 MSSYTMQLRTYIEMWSQGETGLSTAEKIEKGRPKLFDFNYPIFDESYRTIFETHFIRNFYMREIGFETEGLFKFHLETWLMINMPYFNKLFESELIKYDPLENTRVGVKSNTKNDTDRNDNRDVKQDLTSNGTSSTDAKQNDTSKTTGNEKSSGSGSITDDNFKRDLNADTADDRLQLTTKDGEGVLEYASQIEEHNENKKRDTKTSNTTDTTSNTTGTSTLDSDSKTSNKANTTSNDKLNSQINSVEDYIEDRVGKIGTQSYARLVMDYREALLRIEQRIFNEMQELFMLVY 293 T 0.018 LAP1C pdbpssm T Viruses T 6qzl 1 A,B,C,D,E A,B,C,D,E KCD12_HUMAN PFETIN,PREDOMINANTLY FETAL EXPRESSED T1 DOMAIN SMDGSRRSGYITIGYRGSYTIGRDAQADAKFRRVARITVCGKTSLAKEVFGDTLNESRDPDRPPERYTSRYYLKFNFLEQAFDKLSESGFHMVACSSTGTCAFASSTDQSEDKIWTSYTEYVFCRE 126 T 0.12 Baculo_VP91_N pdb F Eukaryota T 6qzr 2 I,J,K,L,M,N,O,P R,J,M,N,O,P,T,U FOXO1_HUMAN FORKHEAD BOX PROTEIN O1A,FORKHEAD IN RHABDOMYOSARCOMA RPRSCTWPLPR 11 T 5.9 PCSK9_C1 pdbhh F Eukaryota T 6qzs 2 B,D P,C FOXO1_HUMAN FOXO1 pS256 site RRRAASMDNNSK 12 T 12 Lys_export pdbhh F Eukaryota T 6qzy 2 B B A0A2R2JFI5_OMPOL ASN-GLY-PHE-PRO-TRP-MVA-ILE-MVA-VAL-GLY-PRO-ILE-GLY NGFPWXIXVGPIGVIGSVMSTE 22 T 1.3 DUF2897 pdbhh F Eukaryota T 6r00 2 B B A0A2R2JFI5_OMPOL PHE-PRO-TRP-MVA-ILE-MVA-PHE-GLY-VAL-ILE-GLY-VAL-ILE-GLY FPWXIXFGVIGVIG 14 T 0.043 DUF2897 pdbhh F Eukaryota T 6r0j 1 A A GLUP_BACSU Rhomboid family serine protease MFLLEYTYWKIAAHLVNSGYGVIQAGESDEIWLEAPDKSSHDLVRLYKHDLDFRQEMVRDIEEQAERVERVRHQLGRRRMKLLNVFFSTEAPVDDWEEIAKKTFEKGTVSVEPAIVRGTMLRDDLQAVFPSFRTEDCSEEHASFENAQMARERFLSLVLKQEEQRKTEAAVFQNGKLERENLYFQ 185 T 0.013 TBP unppercent F Bacteria T 6r17 1 A,B A,B SYCE2_HUMAN CENTRAL ELEMENT SYNAPTONEMAL COMPLEX PROTEIN 1 GSMGLYFSSLDSSIDILQKRAQELIENINKSRQKDHALMTNFRNSLKTKVSDLTEKLEERIYQIYNDHNKIIQEKLQEFTQKMAKISHLETELKQVCHSVETVYKDLCLQPE 112 T 0.00044 Dynamitin unppssm F Eukaryota T 6r1g 1 A,B A,B P22_BORBU ANTIGEN IPLA7 GAMGSNEYVEEQEAENSSKPDDSKIDEHTIGHVFHAMGVVHSKKDRKSLGKNIKVFYFSEEDGHFQTIPSKENAKLIVYFYDNVYAGEAPISISGKEAFIFVGITPDFKKIINSNLHGAKSDLIGTFKDLNIKNSKLEITVDENNSDAKTFLESVNYIIDGVEKISPMLTN 171 T 0.0071 DUF4969 unp F Bacteria T 6r21 3 AA,BA,CA,DA,Y,Z c,d,e,f,a,b TUBE2_BPT7 GENE PRODUCT 12,GP12 MALISQSIKNLKGGISQQPDILRYPDQGSRQVNGWSSETEGLQKRPPLVFLNTLGDNGALGQAPYIHLINRDEHEQYYAVFTGSGIRVFDLSGNEKQVRYPNGSNYIKTANPRNDLRMVTVADYTFIVNRNVVAQKNTKSVNLPNYNPNQDGLINVRGGQYGRELIVHINGKDVAKYKIPDGSQPEHVNNTDAQWLAEELAKQMRTNLSDWTVNVGQGFIHVTAPSGQQIDSFTTKDGYADQLINPVTHYAQSFSKLPPNAPNGYMVKIVGDASKSADQYYVRYDAERKVWTETLGWNTEDQVLWETMPHALVRAADGNFDFKWLEWSPKSCGDVDTNPWPSFVGSSINDVFFFRNRLGFLSGENIILSRTAKYFNFYPASIANLSDDDPIDVAVSTNRIAILKYAVPFSEELLIWSDEAQFVLTASGTLTSKSVELNLTTQFDVQDRARPFGIGRNVYFASPRSSFTSIHRYYAVQDVSSVKNAEDITSHVPNYIPNGVFSICGSGTENFCSVLSHGDPSKIFMYKFLYLNEELRQQSWSHWDFGENVQVLACQSISSDMYVILRNEFNTFLARISFTKNAIDLQGEPYRAFMDMKIRYTIPSGTYNDDTFTTSIHIPTIYGANFGRGKITVLEPDGKITVFEQPTAGWNSDPWLRLSGNLEGRMVYIGFNINFVYEFSKFLIKQTADDGSTSTEDIGRLQLRRAWVNYENSGTFDIYVENQSSNWKYTMAGARLGSNTLRAGRLNLGTGQYRFPVVGNAKFNTVYILSDETTPLNIIGCGWEGNYLRRSSGI 794 T 0.031 MelC1 pdbpercent T Viruses T 6r25 2 B L GLYR1_HUMAN NPAC DPHFHHFLLSQT 12 T 11 GREB1 unppercent F Eukaryota T 6r28 1 A A peptide P7 PSIXHVHRPDWPCWYR 16 T 1.9 DUF4172 pdbhh F T 6r2i 2 B B KASH5_HUMAN KASH5 GSMTSGTSGTSGGPSPPPTWPHLQLCYLQPPPV 33 T 0.28 B56 pdbhh F Eukaryota T 6r2l 3 C C AN30A_HUMAN SER-LEU-SER-LYS-ILE-LEU-ASP-THR-VAL SLSKILDTV 9 T 1.6E-05 SCP-1 unphh F Eukaryota T 6r4w 2 C,D C,D ACE-GLU-VAL-ASN-ALA-PRO-VAL-LPD XEVNAPVX 8 T 0.17 HIF-1a_CTAD pdbhh F T 6r4x 2 C,D C,D ACE-GLU-VAL-ASN-PRO-ALA-VAL-LPD XEVNPAVX 8 T 7.3 DUF5974 pdbhh F T 6r51 2 B,E,F E,D,F ACE-SER-LEU-ARG-PRO-ALA-PRO-LPD XSLRPAPX 8 T 0.28 RhoGEF67_u2 pdbhh F T 6r5g 2 B B PDCD1_HUMAN ITSM EQTEXATIVFP 11 T 0.002 DUF4578 pdbhh F Eukaryota T 6r5m 1 A,B,C A,B,C 3SX_DENPO Dendroaspis polylepis MT9 TICHIQISKTHGILKTCEENSCYKMSVRGWIIGRGCGCPSAVRPRQVQCCTSDKCNY 57 F F Eukaryota T 6r5q 2 B 1 XBP1_HUMAN XBP-1,TAX-RESPONSIVE ELEMENT-BINDING PROTEIN 5,TREB-5 DPVPYQPPFLCQWGRHQPAWKPLM 24 T 30 Prok-E2_E pdbhh F Eukaryota T 6r5w 1 A,B,C A,B,C Q8W5Z4_9CAUD Gp15 protein NPAQFAQKTVLDEHVNDADIHVTATDKTNWNAKETVEGAQAKADKALADAKAFFELSSSVQSVTLTPKNGFVASQPLIARYIKFGNRFLVIVSGIVGKGTGSGTGICATLPTFLAPDASWNKLYSAAQQSTAASNQANIYLSVSADINIVGVGSVDVNTGLDGIIYLTKEVTT 173 T 0.00052 Caudo_bapla_RBP pdbhh T Viruses T 6r64 1 A,B A,B MCRA_ECOLI ECOKMCRA GHHHHHHEFMHVFDNNGIELKAECSIGEEDGVYGLILESWGPGDRNKDYNIALDYIIERLVDSGVSQVVVYLASSSVRKHMHSLDERKIHPGEYFTLIGNSPRDIRLKMCGYQAYFSRTGRKEIPSGNRTKRILINVPGIYSDSFWASIIRG 152 T 0.3 DUF5616 pdbpssm F Bacteria T 6r7w 2 B B G8ULV2_TANFA Putative lipoprotein KRDPVYFIKLSTIK 14 T 6.8 DUF4786 pdbhh F Bacteria T 6r8i 2 B B SER-LEU-PRO-PHE-THR-PHE-LYS-VAL-PRO-ALA-PRO-PRO-PRO-SER-LEU-PRO-PRO-SER SLPFTFKVPAPPPSLPPSW 19 T 0.29 LDB19 pdbhh F T 6r8m 2 C,F C,G C4B8C2_MAGOR AVR-Pik protein ETGNKYIEKRAIDLSRERDPNFFDNPGIPVPECFWFMFKNNVRQDAGTCYSSWKMDMKVGPNWVHIKSDDNCNLSGDFPPGWIVLGKKRPGF 92 T 0.086 TMEM18 unp F Eukaryota T 6rao 9 J J Q6HAC7_9GAMM Afp12 MSKENALFPAVKDAIVFDALWQQAHEKVTALSGEIWTDTGDHDPGVTLLQSATWNCSDLSYRASLSLNDLLTHQDQSTLFPEEFGPEQVLTCNTVTAEDYRRALLDVHSSDIQALDTPEQDFLFSDVSLTQEPKEHRFHWWYNAEKREYSFRKPTDSGEVNELKLRGNLWLSLVPTRYTQSLSPENLAAVEQCLAEFLAAHRNLGEVVSRITWLQPATFSPRMTIELADNIGDINQVAAQIYQVTDAFLRPAVARYTTEQRRALGDADDAIFEGPRLKHGWQQTAPSQITSGGYVLNLGPLVNLLLAIPGVASLSTLSVDKGDGHITAVTGDNLRWQVADGYYPLLWGAPPLSLLAGDDSPLTLVSKGGIRNTLESEAMAGYLTQADLIVTTPTVLPAGRFRDQTLYIPIGQRQPECYALQQPDTVIDDQTRAVHQFLLPVDQLLADGTAELAQLPTLLAFKNRGDAIRGTRWPYTNAMVQQAIHQPYAKTLEAIAQQDAAIFTQDKQPVGGNYARELDFLQYLLGYFGTQRAALPLTLDLPDFLATQRAYLAQQPALGYDRINIRIDQVSALQKRIAARIGLDSICFADNPDLGQLPFYLIEHRQLLPQTPDSTFDSEQTPSGFAVAEPDITLTQAGSVGKVVQGQLIDLIAIEGGSRLHVSRLLVIKAEGDSFTVSTENSQQLHNTLSRLETAWASHNLRWQNSNVWLQDMDYRLNYAEAKLQPANPQQRLLASNAQSPYPAMVSVGDGIVLRPAGLQFYMPGANATRAATLDADWQLAATVKAVDPIAGTLLIEKAAGSTEDFPSAESSFRYQWAFSQANYATTDRFSFVVSAVLNRRLIENPNIVPEQLVAWIQETIMAEFPAHVSLINHWLDDATFNNFGVTYSRWQNSGMPLGDDAFALMQILTLGHLPVTQLDIGLMRIATEEQRTEVIGDGSQWHEDVILREELFYVPKDVQTTL 963 T 0.059 DUF276 pdbhh F Bacteria T 6rd4 1 A 0 A0A5H1ZR95_9CHLO ASA-10: Polytomella F-ATP synthase associated subunit 10 MSYSAYFAKAGFQFPAGLSALVAGIVALNVCTGRPTKGTKEISNAEYNATPIGYLQSPDQHPTAFPKVPGMKDVHGSPHHHH 82 T 0.1 DUF6506 pdb F Eukaryota T 6rd4 2 B 1 Q85JD5_9CHLO ATP synthase associated protein ASA1 MMRAAQKAKQELPATVLTQTRSYLAPLRSDFTEEITAPKVASASNLVNEWNNKKQATENLMKLLQAYKDIGDAKSEPLLKNHNPRTFEDRDYPVPDFRTQNLKAGDVPKFFDTVISTRASAAIASKDKFWAGRKTEAEAASAKASAAFPRVAVPEWKKGKTVSIENLNTVTDKYAAALVPKRKLALPVLPEGVKKAVEDFAASVGQAKNASEVSELLAKSLAEKAVVTEGGKVVEGFSYVSKAVAAKVIATRRAEVHERLLKLWAKRLLVSPELAIVPLNEFDAQLASKFEGISPKYQELLSAVAQGNKTFAQRLNSSPAFSSFLLKREKAESEVPPSELELEAAQKAAELEDPEVALRTLLGPQMEALGASDLLLSEQIRVITEHRYTPDRLQYKEGMKLADKIAAQEAALKEELKVIYGDNVDVKHFQASPRTPVQQLFDSLKNAAANKERAAKEAAAAASPYLAYAVTKKQEVQADPSNIPFDEVLYPQLSEELLELELSDIREDEIALEKAEEEELWLLTLTQQFKHIQKHFGIDLPHSVVAHMDPLLIKKIDWETTNALEDFDITLDDMGAEDAKEQWGAENLSHHFLPLIRYRRDLARKNGDRYGPDLVNGN 618 T 0.21 Lipoprotein_10 pdb F Eukaryota T 6rd4 4 D 3 K0J903_9CHLO Mitochondrial F1F0 ATP synthase associated 32 kDa protein MRQASRLALSIRQAGNVEAASAVPAMTRQFSAPGSHEHHETPLSKVMPTVVSIPRKVACLALGATKKVVCGLASSGPSQNLVSTFANKVIVEENLVNVAEIDVPFWSYWLSSAGFTSKDAFVKFAEAVKPKVAALSTSDITNLTVAFKRANYYDKDLFTGIEANVSANFTKFETEQLLQIVATFDAFNHSSVAFLDDVADSITYCNHYLAPVRAGADELATLLTYYAKNGHERADLLATVARGFSEVSLGKLSAAQRKDTVLSALKAFQTFGFYPESIEAVIGAALVSPAEYSAEELKEVEAVKVAAENALGGEFVLIQEGAHGH 325 T 0.042 FAST_1 pdbpercent F Eukaryota T 6rd4 8 H 7 D8V7I2_9CHLO Mitochondrial ATP synthase associated protein ASA7 MSSVRAGVEAGRRDLTTFTFSGLQDAPVAALSGSIKLNVAAKAGKAEVTVAAGAAKAATQVSAAALRKLSGSKISLAEVARISVLHSSIQNYLLSLSNERYQLLSQWPDFTTMYGKDFYYRAHPEDLKKFYDAADEYYKLYETVTEFDSLSALASQVVPNYAARRRSTVHPAIGSTVADGAFTNFLLSKQ 190 T 0.96 DUF4296 pdbhh F Eukaryota T 6rd4 9 I 8 D8V7I7_9CHLO Mitochondrial ATP synthase subunit ASA8 MVLGEVYLKDILRTPPTGAIPANVPHPFQTSFYTYATKKLIPRHWYLLGGFTFTITLYGILDGLRDSGKKKAYDEAIHAGKTPYTAGGH 89 T 0.3 Selenoprotein_S pdbhh F Eukaryota T 6rd4 10 J 9 A0A5H1ZR73_9CHLO Mitochondrial ATP synthase subunit ASA9 MAVTSFLGKAFEKYFYDFSAYEQFGLNRFLSSKGQYVALRHVGFVMVGVNVLLAANFPFNPPFPTIGMCPAGWEGTWVCQADKAKALEMYKEWKKSN 97 T 0.39 1-cysPrx_C pdbhh F Eukaryota T 6rfm 1 A A CYAA_BORPE AC-HLY,ACT,CYCLOLYSIN GSRSFSLGEVSDMAAVEAAELEMTRQVLHAGARQDDAEPGVSGASAHWGQRALQGAQAVAAAQRLVHAIALMTQFGRAGS 80 T 35 Hol_Tox pdbhh F Bacteria T 6rfq 17 Q S F2Z673_YARLI Subunit NESM of NADH:Ubiquinone Oxidoreductase (Complex I) MLKLHYRNFITAQHSTTNTTPTMIASVCKRAGLRAGPRAYPGVRQFALRAYNEEKELALKQRLSQLPPPGKAFVTAEGEPRPAKEAELAELAEIAALYKTDRVGILDILLLGNKHARLYRDNTALLKDYYYNGRRILDKIPVKDKQTGKVTWEIKREGAEKEDWVNQMYFLYAPSLILLLIVMVYKSREDITFWAKKELDQRVLDKHPEINDAPENERDALIVERIIAGDYDKLASLQKKATPTPATLI 249 T 0.0013 ESSS pdbpercent F Eukaryota T 6rfq 22 V Z Q6CI10_YARLI Subunit NUZM of NADH:Ubiquinone Oxidoreductase (Complex I) MLPGGPVPVFKKYTVGSKGIWEKLRVLLAIAPNRSTGNPIVPLYRVPTPGSRPEANVYQDPSSYPTNDIAENPYWKRDHRRAYPQTAFFDQKTVTGLLELGSEATPRIADGEAGTKALANIANGGVSFTQALGKSSKDVIYGEVLTVNGLPPVAPTLAPKQWKIIEGEAAIYPKGYPCRTFH 182 T 0.00017 CI-B14_5a pdbhh F Eukaryota T 6rfq 30 DA i A0A1H6Q311_YARLL Subunit N7BM of NADH:Ubiquinone Oxidoreductase (Complex I) MGGGRYPFPKDVISMTGGWWANPSNWKLNGLFATGIAVGLALWVSTATLPYTRRREGITSESDISKWNAAAGVWRERHGKISTGEAAESE 90 T 6 Sex_peptide pdbhh F Eukaryota T 6rfq 33 GA n Q6C1R9_YARLI Subunit NUNM of NADH:Ubiquinone Oxidoreductase (Complex I) FGSHGHGPELTPAVPFFQPYVLKWAGVSLGLVAFYQFNSSYEAKNGHTWVETFFHPKSREDILNEEAKIVQALNNQRELTIKMHELKREEKDYAHSYSPLFSDPVPQGGSIGKAPGSSRE 120 T 0.023 DUF5950 pdb F Eukaryota T 6rh6 2 B B AP2M1_RAT AP-2 MU CHAIN,ADAPTOR PROTEIN COMPLEX AP-2 SUBUNIT MU,ADAPTOR-RELATED PROTEIN COMPLEX 2 SUBUNIT MU,CLATHRIN ASSEMBLY PROTEIN COMPLEX 2 MU MEDIUM CHAIN,CLATHRIN COAT ASSEMBLY PROTEIN AP50,CLATHRIN COAT-ASSOCIATED PROTEIN AP50,MU2-ADAPTIN,PLASMA MEMBRANE ADAPTOR AP-2 50 KDA PROTEIN SQITSQVTGQIGWRR 15 T 74 DUF2553 pdbhh F Eukaryota T 6rhc 2 B P WWTR1_HUMAN TRANSCRIPTIONAL COACTIVATOR WITH PDZ-BINDING MOTIF RSHSSPASLQLGT 13 T 0.13 TFIIA unppercent F Eukaryota T 6rjx 1 A A O51302_BORBU LysM domain protein GAMGESRESKNAKIAQPDNKNFQLRDIKDIKNELIRERGHLFYSKEFNEAERLEEAMKQSFSKKKAIEGNEIALKVLERYKTIIRETREKKEKTNYLKENIEKYLNDAEANEAYIWIPLEIDEVNNLYFEATRKYKNYDLDNALDMYSKAFNRAQQAAKNAKEAKALKETDERMYKQLKALEAASNLPI 189 T 0.063 DUF2686 pdbpercent F Bacteria T 6rko 3 C H YNHF_ECOLI Uncharacterized protein YnhF MSTDLKFSLVTTIIVLGLIVAVGLTAALH 29 T 1.1 DUF6520 pdbhh F Bacteria T 6rm3 32 FA SH0 eS7 MSAQTETEKQISEIIKKFVTDIDEKKLQEINIQTFIRDNKRKVMTVKVPVEIISKSQINFGSIIKNIKQKFQDYYIILVENIKNEEKTSWNDCKKIFKGACYPFNINGIRTDVISPEEEIVNVLLEKKCTFNEDEFKMIETAIKGLVGMNVVVSTNFHSLN 161 T 0.0037 Ribosomal_S7e pdbpercent F T 6rm3 43 QA LM0 eL14 MKFIELGRLVAPIIKKERNIKAIIIGIIDSTFVVLKKSNGENEVCPVSSLILLDEVYDIKNLSSEEIVKLIENKKEEGGASNDFERFKNKLREEVKKNILREKGI 105 T 0.0025 Ribosomal_L14e pdbhh F T 6rm3 48 VA LNN MDF2 MEKQQNEKKLNEEETEKLALTEEHPKKKVNEEDNLDTLPEKREEDIVFKKVNVEKNKEKEEDHNFSSNYADHKIDLLSVENKDFPKKQKKVKDSLHLLKENRDRDFGRSKRGHVRNKKQGRETGTRRIKIRKNNYESNDINNYNIKTISKKSRRKEAERQ 160 T 58 DUF2970 pdbhh F T 6rm3 72 TB LXX msL1 MASKLKKSWKDEKKKSKTAIFSLEKQKVMKQERLQKAKILREKIKGLKEKKKQYYLDLARQKADKLEADKLLN 73 T 0.086 FliJ pdbpssm F T 6rml 2 C C TP53B_HUMAN 53BP1 EVEEIPETPCESQGE 15 T 9 Perm-CXXC pdbhh F Eukaryota T 6rmm 2 E,F P,R TP53B_HUMAN 53BP1 SSDLVAPSPDAFRST 15 T 11 Tachykinin pdbhh F Eukaryota T 6rmv 3 C C TRPM1_MOUSE Transient receptor potential cation channel, subfamily M, member 3 KRPKALKLLGMEDDI 15 T 8.4 RPT pdbhh F Eukaryota T 6ro1 2 B B NVL_HUMAN NUCLEAR VCP-LIKE PROTEIN GPDSMKDSEGGWFIDKTPSVKKDSFFLDLSCEKSNPKKPITEIQDSKDSSLLESD 55 T 6.9 Amdo_NSP pdbhh F Eukaryota T 6ro6 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H DDROC_DEIDV HTH-type transcriptional regulator DdrOC MEGALPKGLSDLIADPTLGPQITPDWVRTLSRIELRGKRPRDKQDWYEIYLHLKRILS 58 T 3.6 DUF4936 pdbhh F Bacteria T 6roy 2 B,D D,C PDCD1_HUMAN immune receptor tyrosine-based inhibitory motif (ITIM) FSVDXGELDFQ 11 T 0.0063 SIT unphh F Eukaryota T 6rp4 1 A,B,C,D A,B,C,D SIDD_LEGPH SIDD SIGVSDGLLSYIKNENENKGFLGIYGFFTGADKNIEKATLYKNLIAKYQNNHFISLIILSALVSDSKTPLMTQYLVGYLDFPSKALLANKITELLLKELENPDMREILGSRLATDVIEELETKIIRYIHNPAGSDIHSTLNLWTADKIKAATNSSLTI 158 T 0.0095 Yuri_gagarin pdbpercent F Bacteria T 6rqj 2 B C C5I_ORNMO Complement inhibitor MASHHHHHHHHHHSGDSESDCTGSEPVDAFQAFSEGKEAYVLVRSTDPKARDCLKGEPAGEKQDNTLPVMMTFKQGTDWASTDWTFTLDGAKVTATLGQLTQNREVVYDSQSHHCHVDKVEKEVPDYEMWMLDAGGLEVEVECCRQKLEELASGRNQMYPHLKDC 165 T 8.1E-05 His_binding pdbhh F Eukaryota T 6rqx 2 B B PSE-LYS-HIS-HIS-ALA-PHE-SER-PHE-LYS XKHHAFSFK 9 T 13 SmaI pdbhh F T 6rrc 2 B,D B,D RAD21_HUMAN HHR21,NUCLEAR MATRIX PROTEIN 1,NXP-1,SCC1 HOMOLOG KRKLIVDSVKELDSKTIRAQLSDYS 25 T 3.2 E2 pdbhh F Eukaryota T 6rre 1 A,B,C,D,E,F A,B,C,D,E,F SIDD_LEGPH SIDD MRSIITQICNGVLHGQSYQSGSNDLDKGNSEIFASSLFVHLNEQGKEIIKHKDSDDKIVIGYTKDGMAFQIVVDGFYGCERQAVFSFIDNYVLPLIDNFSLDLTRYPDSKKVTESLIHTIYSLRSKHAPLAEFTMSLCVTYQKDEQLFCAGFGIGDTGIAIKRNEGTIEQLVCHTEVDGFKDAFDNYSSANIDLVIERNSVFNTKVMPGDELVGYTYVPPMLEMTEKEFEVETVDGKKINKRIVRHLNLDPGNFDDKDPLFSQLLQVVKSKQKQLVEQAKETGQIQRFGDDFTVGRLVIPDQLLINQLRIHALSIGVSDGLLSYIKNENENKGFLGIYGFFTGADKNIEKATLYKNLIAKYQNNHFISLIILSALVSDSKTPLMTQYLVGYLDFPSKALLANKITELLLKELENPDMREILGSRLATDVIEELETKIIRYIHNPAGSDIHSTLNLWTADKIKAATNSSLTI 471 T 0.08 PP2C_2 pdbpssm F Bacteria T 6rrk 2 C,D C,D RAD21_HUMAN HHR21,NUCLEAR MATRIX PROTEIN 1,NXP-1,SCC1 HOMOLOG PTKKLMMWKETGGVEKLFSLPAQPLWNNRLLKLFTRCLTP 40 T 3.2 Vac_ImportDeg pdbhh F Eukaryota T 6rrl 1 A A peptide 3967 FRIMRILRVLKL 12 T 6.6 OGFr_N pdbhh F T 6rro 1 A A peptide 536_2 GFIVKRFKILV 11 T 1.8 CHDCT2 pdbhh F T 6rrp 1 A,B A,B Q9I188_PSEAE PvdP MTVSRRGFMAGLALTGAAALPVAYYTHRHLTREEEPQTPDEASLDLAATDGIRLGDRLRGLWDLRLVGGDAELPGLPREGLQLVLDVAPKGRGLIGYLDTPERLLAAEPPRFRVLGDLLGASSASIRWRLVDQASGSVAPTHDCSAVFDEVWADYANAGDGTLSGRIQRLERSPLSPNEDFRFVAVKRHFPLAHERIVLNEKLLGWLVSPQHRLFHQLWHASRDKWHRLSEKQRNALRGVGWQPGPLDRERDARGPRKDRNASGIDFFFMHRHMLHTARSMQDLPSWERLPRPVVPLEYDRPGFIRYFDNPDGFSVPPAWVAVDDDEYSEWLHGLKSAEAYHANFLVWESQYQDPAYLAKLTLGQFGSELELGMHDWLHMRWASVTRDPSNGAPVMTDRFPADFAPRWFRPENDFLGDPFSSHVNPVFWSFHGWIDDRIEDWYRAHERFHPGEVQRREVEGIQWFAPGRWVEVGDPWLGPATHGCGLSDVQASSNSVELDVETMKLALRIIFSEEDQLSGWLKRAPRRPWYARNLKLARDQLRR 544 T 0.0015 Hemocyanin_M pdbhh F Bacteria T 6rsm 1 A A peptide 12530 KFKKVIWKSFL 11 T 8.7 DUF5665 pdbhh F T 6ru6 2 C C P63_HUMAN P63,CHRONIC ULCERATIVE STOMATITIS PROTEIN,CUSP,KERATINOCYTE TRANSCRIPTION FACTOR KET,TRANSFORMATION-RELATED PROTEIN 63,TP63,TUMOR PROTEIN P73-LIKE,P73L,P40,P51 RTPSSASTVSVGY 13 T 24 Vac7 pdbhh F Eukaryota T 6ru7 2 C,D C,D P63_HUMAN P63,CHRONIC ULCERATIVE STOMATITIS PROTEIN,CUSP,KERATINOCYTE TRANSCRIPTION FACTOR KET,TRANSFORMATION-RELATED PROTEIN 63,TP63,TUMOR PROTEIN P73-LIKE,P73L,P40,P51 YTPSSASTVSVGSSET 16 T 26 GerPB pdbhh F Eukaryota T 6ru8 2 E,F,G,H E,F,G,H P63_HUMAN P63,CHRONIC ULCERATIVE STOMATITIS PROTEIN,CUSP,KERATINOCYTE TRANSCRIPTION FACTOR KET,TRANSFORMATION-RELATED PROTEIN 63,TP63,TUMOR PROTEIN P73-LIKE,P73L,P40,P51 SSASTVSVGSSY 12 T 8.5 HET-S pdbhh F Eukaryota T 6ruj 2 B B CONSENSUS ANKYRIN REPEAT DOMAIN-(D)3-hydroxy-Leu EVVKLLLEHGADVXA 15 T 0.00029 Shigella_OspC pdbhh F T 6rvc 1 A,B,C A,B,C PTC1_HUMAN PTC1 ETGHHHHHHRDGLDLTDIVPRETREYDFIAAQFKYFSFYNMYIVTQKADYPNIQHLLYDLHRSFSNVKYVMLEENKQLPKMWLHYFRDWLQGLQDAFDSDWETGKIMPNNYKNGSDDGVLAYKLLVQTGSRDKPIDISQLTKQRLVDADGIINPSAFYIYLTAWVSNDPVAYAASQANIRPHRPEWVHDKADYMPETRLRIPAAEPIEYAQFPFYLNGLRDTSDFVEAIEKVRTICSNYTSLGLSSYPNGYPFLFWEQYIG 261 T 3.3E-50 Patched unppercent F Eukaryota T 6rw2 2 B B ALA-ARG-ASP-CYS-PRO-LEU-VAL-ASN-PRO-LEU-CYS-LEU-HIS-PRO-GLY-TRP-THR-CYS ARDCPLVNPLCLHPGWTC 18 T 2.6 NusG_II pdbhh F T 6rxq 2 E,F,G,H E,F,G,H H4_YEAST Histone H4 KGGAKRHRKIL 11 T 11 Shadoo unppercent F Eukaryota T 6rxt 47 YA UH Y1780_CHATD Utp8 MAAKLQIHAPYVLHALPRPLDRSDGLGRYFSGEVFGQKQGGKRKKRTELAVAIDGVAVYLYDILSSQVVTSYLVSPQSCFTCPPSSLRWRPASSKTVTRYTYVSVATGDSVLAKREIRLFREETLSTGNTVVACISRTICSDSPIVHIFTSSPRNFLTNVPGKDIPNHDLIIITANGSIFALNGETLEEKWQVSPSVLSREILSDSKLALQVDFVQQTSAADVADGLFGGKNDLFGVFQERIHRDGFNPDFFVVITSQSGADSANARHLHVLALPSEREARQTGKENVISVFVAPLAVEETCRSFQLDVRSGTLQAISNKALVTYQFANGIAKLENRLQVPGLSSYLRLSKTSVLTSATDSLSVYNPIYRSLQAAARLEPTDDTNGHACEFVSYLASRELAVAIRGGSLVVIQIEAPKNRTAKRRAEGLLTDAIRRGISRKIAFEKRTKPEHVSDSTILADAVPGSLSDPSWSEWQNKAMQADELLQNNDIQSWEELMAEVFKVPIKPDETADAEKQTAPNPVVKLPEWEWPSSRSDYARVDRRWVVYAINKVFGWEGQLESNTGRLTCRLPESSVLIYLVDAGHLSTSNVKSAFKDDVREVDKVEELIGEQLPIILAEVDPTMELLVGYLSGTQLGSSELVSSIKLLLCSLGLFEDGSRLPAVGDNTHIEQVTGQENEVVNMELDRAEEELQITEHYLDEHRTRGLGIAFSKLAACPAAETVKSLRRLFKPDEVLVLLNVLRAELIKDGWTTRYLDKINADQEDDAPPDASIQLIADLMSRCIDAVGLSGWMAADVMLSSSRTHQDSANFFSQFQAEISVALEGVMEAVRLKGVIAEAANYAKRARRALADSAKGKAMTVHMSAELPLGLKTDNKISTERVRSGGEIVARSSRQIGHFISKRRGIYSIHRISEEMLLGAAGPTVVQEAR 930 T 3.1E-10 Utp8 pdbpssm F Eukaryota T 6s0k 35 IA k Cytoskeleton protein RodZ RKKRDGWLMTFTWLVLFVVIGLSGAWWWQAAAAAAAAAAAAKALIVYGAAAAAAAAA 57 T 0.006 TspO_MBR pdbpercent F T 6s0n 1 A A GLN-ASP-VAL-ASN-THR-ALA-VAL-ALA-TRP QDVNTAVAW 9 T 10 gp12-short_mid pdbhh F T 6s1c 4 E,H D,H CTF18_YEAST Chromosome transmission fidelity protein 18 GAMGNQTVKIWVKYNEGFSNAVRKNVTWNNLWE 33 T 2 Jnk-SapK_ap_N unppssm F Eukaryota T 6s1u 2 C I PRO-0A1-VAL-PSA-ALA-MET-THR PXVXAMT 7 T 34 Allatostatin pdbhh F T 6s22 2 B F FGF23_HUMAN FGF-23,PHOSPHATONIN,TUMOR-DERIVED HYPOPHOSPHATEMIA-INDUCING FACTOR NTPIPRRHTRSA 12 T 2.1 MGTL pdbhh F Eukaryota T 6s29 2 B,D B,D MIS19_SCHPO EIGHTEEN-INTERACTING CENTROMERE PROTEIN 1,KINETOCHORE PROTEIN MIS19 PRVYETELLVLRFREFGVKDNHNHPINLHSLRSKSLIRAQGKKLDLHNRVFLRRNVRAVKM 61 T 14 DHOase pdbhh F Eukaryota T 6s2c 1 A A E1CI69_9VIRU Capsid protein PISADFSEVENAPSFLSLAENTDEVLKPYTGLEIQTIITNIVGDANPNQSRIFDQDRLRGNQYSAGGLVTQNAVSAIPFTNLIPRTIRVGNILVNSANRLQITETNVSEYYSNPIIATKLSEMISDQVKNNQFSTWRRDNTSLQGFNAFDIATINTAILPNGLSLESMLLKLSLLHSIKAMNVDAASINRSQYQVIDHNTVPTIGAPAVVGVNNSPVFGEDCGGNNPVYPFGGGTGAIAFHVTLQTVPDERKSYAIFVPPAILQATSDANEALALFALSMSEWPHALYTVTKQTTDLAGANAGQQVFIPTQSTIHIGGRRVLDLIIPRREIAPNPTTLVAANAMCMVRPQAGPDATAGAIPLAAGQLFNMNFIGAPAFEEWPLTSYLYSWAGRFDITTIRQYMGRLATMVGVKDAYWAAHELNVALSQVAPKMTTAAGGWAAQAANSAQQSDVCYSSLLTVTRSAANFPLANQPAADMRVYDTDPATWNKVALGLATAANLVPEQSMDVPFVVGDARTSFWERLQAIPMCIAWTMYYHSRGITTLAWDNAYTDNTNKWLQKMVRNTFSTTQSVGTIIPARYGKIVCNLYKNMFHRAPAYVATSVGGKELHITHFERWLPGGTYANVYSGAGAVVNCFSPVLIPDIWCQYFTAKLPLFAGAFPPAQGQNSTKGFNSKQGLMIHRNQNNNLVAPYLEKFADNSSYFPVGQGPEINDMATWNGRLWMTTGNVQYLDYSGAAIVEAVPPAGELPVGKQIPLLAGENAPIELTNAATTCVPRYSNDGRRIFTYLTTAQSVIPVQACNRAANLARSCWLLSNVYAEPALQALGDEVEDAFDTLTNS 840 T 0.16 TPP_enzyme_C pdb T Viruses T 6s2c 2 B B E1CI69_9VIRU Capsid protein PISADFSEVENAPSFLSLAENTDEVLKPYTGLEIQTIITNIVGDANPNQSRIFDQDRLRGNQYSAGGLVTQNAVSAIPFTNLIPRTIRVGNILVNSANRLQITETNVSEYYSNPIIATKLSEMISDQVKNNQFSTWRRDNTSLQGFNAFDIATINTAILPNGLSLESMLLKLSLLHSIKAMNVDAASINRSQYQVIDHNTVPTIGAPAVVGVNNSPVFGEDCGGNNPVYPFGGGTGAIAFHVTLQTVPDERKSYAIFVPPAILQATSDANEALALFALSMSEWPHALYTVTKQTTDLAGANAGQQVFIPTQSTIHIGGRRVLDLIIPRREIAPNPTTLVAANAMCMVRPQAGPDATAGAIPLAAGQLFNMNFIGAPAFEEWPLTSYLYSWAGRFDITTIRQYMGRLATMVGVKDAYWAAHELNVALSQVAPKMTTAAGGWAAQAANSAQQSDVCYSSLLTVTRSAANFPLANQPAADMRVYDTDPATWNKVALGLATAANLVPEQSMDVPFVVGDARTSFWERLQAIPMCIAWTMYYHSRGITTLAWDNAYTDNTNKWLQKMVRNTFSTTQSVGTIIPARYGKIVCNLYKNMFHRAPAYVATSVGGKELHITHFERWLPGGTYANVYSGAGAVVNCFSPVLIPDIWCQYFTAKLPLFAGAFPPAQGQNSTKGFNSKQGLMIHRNQNNNLVAPYLEKFADNSSYFPVGQGPEINDMATWNGRLWMTTGNVQYLDYSGAAIVEAVPPAGELPVGKQIPLLAGENAPIELTNAATTCVPRYSNDGRRIFTYLTTAQSVIPVQACNRAANLARSCWLLSNVYAEPALQALGDEVEDAFDTLTNSSF 842 T 0.16 TPP_enzyme_C pdb T Viruses T 6s2d 1 A A Q90VX5_PSEAM Pleurocidin-like prepropolypeptide GKGRWLERIGKAGGIIIGGALDHL 24 T 5600 Antimicrobial12 unppssm F Eukaryota T 6s35 3 C C ALA-ARG-(D)LYS-MET-GLN-GLU-ALA-ARG-LYS-SER-THR ARXMQEARKST 11 T 15 DUF5915 pdbhh F T 6s3d 3 I,J,K,L M,N,O,P S0_2.126 ASPCDKQKNYIDKQLLPIVNKAGCSRPEEVEERIRRALKKMGDTSCFDEILKGLKEIKCGGSWLEHHHHHH 71 T 0.18 Spo0A_C pdb F T 6s6q 2 C,D C,D CIF2_ARATH Protein CASPARIAN STRIP INTEGRITY FACTOR 2 DYGHSSPKPKLVRPPFKLIPN 21 T 0.2 Inhibitor_I53 unphh F Eukaryota T 6s8r 2 B B GGYF1_DROME GIGYF family protein CG11148 DENLPEWAIENPSKLGGSFDASGAFHG 27 T 1.6 Tipalpha pdbhh F Eukaryota T 6s9k 2 B B CASP2_HUMAN CASP-2,NEURAL PRECURSOR CELL EXPRESSED DEVELOPMENTALLY DOWN-REGULATED PROTEIN 2,NEDD-2,PROTEASE ICH-1 DYDLSLPFPVCESCPLYKKLRLSTDTVEHSLDNK 34 T 10 DUF4073 pdbhh F Eukaryota T 6saa 1 A A A0A1V0FWW5_9ARAC U1-theraphotoxin-Pf3 RCLHAGAACSGPIQKIPCCGTCSRRKCT 28 T 0.066 Conotoxin pdbhh F Eukaryota T 6sat 2 C,D P,Q FTSZ_CORGL Cell division protein FtsZ DDLDVPSFLQ 10 T 2.2 DUF4809 pdbhh F Bacteria T 6sb1 1 A,B A,B MPEG1_MOUSE MPG-1,PERFORIN-2,P-2,PROTEIN MPS1 ETGGCTNVDSPNFNFQANMDDDSCDAKVTNFTFGGVYQECTELSGDVLCQNLEQKNLLTGDFSCPPGYSPVHLLSQTHEEGYSRLECKKKCTLKIFCKTVCEDVFRVAKAEFRAYWCVAAGQVPDNSGLLFGGVFTDKTINPMTNAQSCPAGYIPLNLFESLKVCVSLDYELGFKFSVPFGGFFSCIMGNPLVNSDTAKDVRAPSLKKCPGGFSQHLAVISDGCQVSYCVKAGIFTGGSLLPVRLPPYTKPPLMSQVATNTVIVTNSETARSWIKDPQTNQWKLGEGTKHHHHHH 295 T 0.37 UN_NPL4 pdb F Eukaryota T 6sba 2 B B VGLL4_MOUSE Vestigial like 4 (Drosophila), isoform CRA_a SVEDHFAKALGDTWLQIKAA 20 T 0.00088 Vg_Tdu pdbhh F Eukaryota T 6sft 1 A A A0A0H3CCM2_CAUVN Two-component receiver protein CleD SKPREWVEAVAYVGPDRRRFNSADYKGPRKRKADAS 36 T 14 DUF1816 pdbhh F Bacteria T 6sg9 4 D FJ D0A8R9_TRYB9 mt-SAF18 MLCHNRLLLVNKQTQKYRTKLRYRFRQPSVVPLRQTLQQRHNTILEVLRRRRINSGDQSPYRYVEERLYSKPSRLDREGVKVNKTYALQGLGDLEPLRYGANFGISEKDALKYETVAEKAKYMEPPIPYSSLAARKLAAGALWPAAPDPEGMISKEVRLLRHESSMSPSARAFSERVAYHLRRSLKACPGHIAEHIDFTQLIIQEVLGSRRSKEIYIVWFTVDPGARFELEPRLHQLNHWVQQLIIKRVKRRPHIPRVTWIYDGGRLERELPRDVKQELQSFVADAATTLESRVKYLKELDTMNQRMKDIPWFMPYLWSKEEKAARQKSMLADLEEVERRKNEHSSGRSAPPRTSPPPQFVR 362 T 0.041 RBFA pdbhh F Eukaryota T 6sg9 5 E FY A0A3L6L5M9_9TRYP mt-SAF28 MWRLSRSLRSNSLHNPGPFLDGALQLIKLHLAHKNAAADKNTKACSDIEGEFLRELEAFRACFTMSSSLKVAKLYTKKLHGALSYFQLYDDPLMRQLDMIIGKQTMQPSAGRQHGVFKAPVAARLDPFFLDEREETVLPSELPNPPKPDPSTPLRERALKVPAQHRGHWVLRDPDIAITREERRTDPW 188 T 7.4 Hist_rich_Ca-bd pdbhh F Eukaryota T 6sg9 6 F Fa Q57VU7_TRYB2 mt-SAF30 MPPNSATRWLPFVSSDLKDYLNRYWAVMFTVGARPIETGHIRHYVSWYCTRMKVVLLDHHVYVEPLRQQLQEASRTPELPLLFVNKKLVGTLRDVELLEREKKLKDVLHFGFEWRVGGSVAATNGQKSLMGALPAPYGDAEFFRGRYRGPPVARPVVSLPTLHPFALRSEE 171 T 0.15 Glutaredoxin pdbpercent F Eukaryota T 6sg9 18 R DE Q386Q7_TRYB2 mS52 MRRCSFASTCRYRSADDVPLGLGSAYVWTCRNITSRGQFNPIHNFSYAMERGVRARDVKAFEKLITNPGPLRVAYTPDYLDWLHRCYKAKGTYMDARAVAEKKFNGNIVSSELSAAVNRREGKRGDTRGDTVDNDHHNLPGAPPPGMFLRPAHSFRRLAGELKRRRAQSILDEVARAQGMLDLFERQPHFPAIHIDRCSRFHLVELFKEMVLERSLDSNMIWEKALLYRAILSERKPSYPTSFHYIFTAVEDTVFAPTISTPMEMAGSGGEGHDRSSHPLAAKCPTLEAYYYYVYLVKKYYIDNAVEAHVVLRCHREPNAADLLFSNPPPKDDTEIMKAVELLRNADIQRGVAAAAAVSDPTLPPGGEGSVIGNSDNKKNSEKTSEGSRGRPARPPVLPGAYPPIDMLWRCEENLPLLKVLLFGEFNLIVSENPFVKFPSAHGFLTRPYSTDSSRTLADGMSLANVMAEKRGHLLPSLPRNTATSIDARAQDIRRLQQKHHRDDIVSFQKLLRSTHAEDSPSAFSSYSDWSYFNPRAVRAEERDRLTRKAVEALKLYDSATNDIYRHSFEDVQACHTQRVTERDRTMPPYLPTLPHFVAIIKKDPHISFLLHIGLPDRNSSEEGSAKHKELEKRIYYLARALYHTALEYHNETVRRVNRQKVNVAASLLDNFVEQEWTTILRDKHDVTDVTKTLNDTQNDKKQLARRLGRYMLFANRSLDDTGFPTDARADDYTRWMAPPSVGKVSL 747 T 7.7 Complex1_LYR_2 pdbpssm F Eukaryota T 6sg9 19 S DF mS53 MSVSGVFSKGRGIGHEATTSILRYIPRARVPWQPSRFGRENLTAADMARLWSRGRYRDGPGNYNSGYCTERTHVLEENTVSIIPRRELEKYMPDITIGPKALVTPVSLMNARNGHRVTHDLLHSYDPHIGRLGKPAVVDHDNITVEDPNRVGLNAATLDCRGRIYRWLRRGPFFQVDNYFRRSVKLNRDGTLPTDFVHEAPLMRKIIRLAHRGHLKAACEEYRRVTTVPPVEVYRALTACCVPGAKLADAVSIFEDGDSKLFYVSRDGEVLHNLMRCAIAARHRARIMWVYNVMRGRFYENVVVRAEVDLIWRYRIAMIALEYLLDHECAEEAAAIYSYLVEEELLRCDVHVRVGLHMREAIAAGKPITLNNDVMNATSLVRDATAVAPEVARELQRRHAQTLQNNAVEAVGAENVREGSAPWSILGPLTAIGPTAEDTMVWLQQHYGDVDVMSIMRWARFRKGKDLMAKDRPQYLARAAAWIELLSKRNREMEEVPLTYMRKSKPLVLDTNSNVRVAWQTPLMRSGGPPRLLAREEGYVFHHSNSSRFVEETYRHPGESLQSRYLALQPLHTEVSAKEDFQRLYYQAQKHHKQQERLLPVSTAAIPSRIVHHSVMSALHGVSGKGEPANRSLFTGNKSNDSHSNTGVASGTSACTDGAGSATPEF 666 T 0.00091 PPR_long pdbhh F T 6sg9 28 BA DY Q57YD4_TRYB2 mS72 MLRSTLTSLNTFLTSSVATPPISVIRTGPKWWAHPERMVRQKLMYFTLGVDQLPLRRTAVIQRDLQRFHMCKPPPRVGDSTGYKRSRAAQLNTWYRRIQYQEYHMQHLFTRHVWGLLRVYPGNTTKIQGKADDGYVGYDSVPFHRYNRAPLPFPARELYERRK 163 T 37 MBDa pdbhh F Eukaryota T 6sg9 31 EA FD Q583U2_TRYB2 mt-SAF12 (KRIPP18) MLIRRLRRPHSTVRGCRISSASNSDSAGAFSANGTQLPDPPYYLPHSPRFDAERCGTFNKKWLLNLPALKPLVRNSTYLPKKEELWRAPTHEALETIIGHLPYHDALRYITEHSLFLLFPTVLRARDAPLPHVIYEDFMKSCTFASLQNPPEEQFALPSVLLRTLLCMAAYHCTLDADYFTTCQMLFGRMEQQQQTTPEVLSAWVYCCTASGRVDEALTYAKYMADCSAPFDVTVFSLMQHPSLNPIEVEDGSVPHSAKGLLLQRRLGNRLHTAYRSDAVAAHGMFVYYALTLSHVRKWEVIRAAAALGVTLAERTVVLAVEVFAREKGMRCGPKTVKALTHFLAQDGTVGHLLYVLLRARKNELLPEFRDLPHTTFSEEEQELVLQCVAQRARHDDSFAVAATLVSSLVREDDPSELLMAFARAARNHHSADVCGGDGDGSVCADVPAPVPESPPSNSSEIIEKDRWAVVQASVRSLLLDVNALDQASRRDAYHKHWKNGNKGVKNKENVLAKTLTLPHDSMHTATIQRKEKELWELMRSDTPVGVRELAQLNIMEELQEAKRLERAEMAWVNPDGTF 579 T 0.3 PPR_long pdbhh F Eukaryota T 6sg9 33 GA FG mt-SAF15 MRWSAVLCKVSKVPTRSVIPADALRYHPRYSVPRKMVLSNTFNVVGENNRYTSLKLILEKLSGHVGRRQYKMLCNLEKHYDKLKDEGIDWHELKWLSREELIVLFDKVLHLTRTERAALLPAIEAKVCGVLRQTDSRHSTVVCMNRGNANHGWRCGRNGHTNDVYFEGKAKANTAALQLCRRSSVRSVERSGVLVEVRSEPDFSVVGRFDHSLSPRVWSTPENPTFQVTTIGYEFRVHQEDPRVIPQIVEAAEEWELHANITKQVIWEMLEMYAVERDRQPLDLKPGEMGDPDIPSRHATAFNVQVVPPLSEGDEAIQVVEQRIVRADGSEVPWFQEPPPQLFSGGIPVILPFAPSIIVKSTFRQVTRSSAQDVTRQLLQPVVDVTCFLHPNVCFWWNAEDEQRCLGHIVDYAKRIPFALPFNLYFRVNLSKDLRGVQNYTEELGKRMSMKAHYFNLRSYGVR 463 T 0.089 DUF1285 pdbpercent F T 6sg9 34 HA FH mt-SAF16 MLVCCRSSLSLLARATMPLCCSRRFLTHQNNIDDISGPVDTNSNSVSDGRLHCSTGEGGKASTCERVSLRTIAESLGAAAAAELRAEVERDTRDGVAAIPPLPPLGWRVRHPSGSNYFVMTRTLKNGVQSAELNNRRYRSVHDIFLQSLQKGGHKYAQKGKGSEGRQKDAKEEEEPPSQEDGKSSPKVTVGYDREGRGHRATLQRMDELHDSPKLSRADVHLTVFAPFRVYDPSLHDPTVDICEWSSFDLVVQKTVPDNMVANKLLQPLSCTPQDGALSMYVCLASVNSEMRIRSIQLLSMKEAQALVEHACFGNGEPLFLELLRRRGRRRPLVERRFDDPRLRYEEVAQPQQVADEAAVACSSSCYGPYYPAFEMLMDSCGSAGEYSRALCYGGPYVSELSRELCDALLDYIKGDLGVSDQLCEYVCQMQFFLEQEEYMTWLGQVQHVANAVSRTA 457 T 3.8E-14 MAM33 pdbpssm F T 6sg9 35 IA FI mt-SAF17 MQRTLRSAARRKWGQKTWSPTATNGGAAPANGVSAQEALQIAYRPMPPSQTVEYEEDFGHNLMIHREYISKRCRDRVSFELSALSYSNLELRRGQEHLAGIMNRERRGVSVGASGAPDDQVQMQTDVDANSREVLSARYLFNERRLQFCDRFQNFFQSKLENSAASDSNGHEKQHLFSLMEACAVIFGCETEAARETYYRMFLGLDSETLLEEDEALRNRIADAKLVQRVLENNKGRQEVTQSPKLQQQQDQGKPLHAVSSGTSLLNDCEEERFISSIPELSLFEDETEARANGFVEGEDDVKANNGDLTAGSFSSPASSVNLPEEFEEYAPLYKAYITHAVGKGPVASYDISTLGSTGLTAERRRWRTLMEKIVREDYHTMTEVEQMDAIVLNEQLHTVKFFDLKIGDAIRDILQLLQRETGVGSSVNRDTPVGISPNNPERRV 445 T 4.8 DUF3221 pdbhh F T 6sg9 36 JA FK Q57XS8_TRYB2 mt-SAF19 MHSSLIILRHAYFSALHPARRVVPGSLLPVRTQFYTRHFTSTAGPTCGDGGETYKSEPTKVGASVEGTNSGNGVTDSPSLFSSSAPTVRRRALPPSDFPENALLKCIEKEIEDEALRLDKEECPPPPPTGWEMYHAPGTSVFYGRRWWLPATASAETRATPERHTIRVQLTKRDPSLDPECDVRGEHFPFSFFVQRAPSKGEAVRRDGTFRMGDSAAAGDVKGRTEGKEEEEEEEELGLYDQSIEVRADFVDGELLVDNVVFHGTFKTGSSCSKRSGNTSPEAAAATAAGQHDNTTGGRGKVEEVRYNNIFNGYPGPNLDEAEEEVLDGLQAWLAERCVDDQFGEFVGQYSVWVEQQEYEMWLKRLRDFVAA 372 T 1.9999999999999998E-26 MAM33 unppssm F Eukaryota T 6sg9 38 LA FV Q38C60_TRYB2 mt-SAF25 MMKRTILQRCIQNKSLEIARISRSDINSRAHLPFNFDVCYELGSREFTLFSSVGSTSVLVFCNVSSRRLRSVKGGQGETEFPPKRLNVKRRRQSGSADRSPVVFSAFVSMPTSGLTIEALCCSSLGLLVVDGVSFHQGPLTESMIPQDAHPGPEGHYQGPLLNQRSLAEMVVNTRGCITSVDPFRQQESFFDGHVNPWNARSLRFGHVPVHTAKPGFSDALCHFLEVFGVNDELAFFVEDFAHLVHREEETAWTNVLKTMMGGR 264 T 1E-07 MAM33 pdbhh F Eukaryota T 6sg9 39 MA Fe Q387L0_TRYB2 mt-SAF34 MSRSTVFGPGSLYSFTKFGSFNRSPTNCTLNKRMKDIFRLENQKHIRNDFDRERRYRMCTKCGITTVTINFNNVPSARVGLWGRCADDKDYTHHRMVDITQREYEVLRESPVEKRLNWWRYER 123 T 0.022 zf_C2H2_13 pdbpercent F Eukaryota T 6sga 10 J Cb mS23 MRQCVVRRYKMPKNMGVAPRFDTWNEKYEPWEHMKRMGRLVGTGFYIPPEWYNHFRMFPPINHNFQQEKTLNPHNASEPTQDDTSTLSPERVALRDELARKSRLVASEGMRYYNIFWVRKPLDTMEKEYYELKRRGVDHGEAIRKVLQGFYSGLAVKKRVAAIQAEEAKLTGRFITMREATVVLGVLAKLHKEQLTPHQVSLLAKEQGETTQSGAKLTAIVSRTQPHVNKEASSPATSEAVGSSTEESLSADALASMLSEDGEQSAVGTRYQVEVKETANDSVRQLREKAEDQTGFPDWYTGESPTYSGTS 311 T 0.038 CHAD pdb F T 6sga 24 X F3 Q38E61_TRYB2 mt-SAF3 MLRGWHPNSSAMQVGMRHITIGGRHSRGGFRQPLGKHPQVKQGTVEGVPRRIPGTTKVTYTNKKGRTFSFSVPVSELTHPQVTLESAAGTWREMDTSFCELGDIEDDMPSPVDECLRGGSSLDKRLIQEVRERFVSFCREYVLMDTSGMKSTILSTELNAGPDYEHYDRRLRRKRHWLAIRHRFEDVRYVIWPDVVEETARGDSAQADVSLTNPSLTAGEMLEALLWLDAASTFCVRKVHPSDLGDKSEFLPLDLQREVEVVACHARRDLDFFDPSATSLEQFTACAALCVNHRVPFSLFFPAQDVCGDASVSTGQCIVANAPSPHTALGAVRIMALISEGSGSDIGKTIMFSDAFGAVTRFGILRGLSRVMSVEAFGCKDALENVNESELCIILHFCAEVREQNAAFFRRYEASEEDSDPQQVSFLAKYQQLSQIALARCKRLLYHPDSPRAQVMSEDGYIPLVELQRHAEGTNKAALIHYNLGIRSAQGMRRVALGAQSSARLAELVSRLEEASARVSGNTLVNDLVHHLSHKAAAGKMSLTLREVNTLLPLLSRMRRESPNGALDARFDRVFNAIDTAIGAAMRHNCTLDELLDLAEGLAACEMVPSALKQVEMVLIRSVMMHECSPMHLRRMLQAMFTLMRTSVPQVLLQSVASRVADYIKEASHMDSSSSNGGGDEKVKNHEECEQLLELLVVLGKCGYGALPGLVTIYWEAQLIDSMQLNPRLRCSYASLLASAAFALKKHDKRAWEGLADESHRLFMEYTRCNKENDIGRFAECVTGLAVLTQIKDNTNSSDVAFLKEYLSATSLELKSCEVIRVQELTDLLGRTLEWSEALGVVAPDVVIQLEKALFVMLENVSHTAPGVGIPDELVTAACCLVDMSSASLELRKAAAGVVGGAIVHAEEALETLRSGAPTQVRPGHSFDVAALASAERENVYKNSILQYCAALQRSGMSTHVEELWS 966 T 0.074 DNA_ligase_A_N unppercent F Eukaryota T 6sga 25 Y F5 mt-SAF5 MRHTIPFFRRSAFVPAPGSSLLNPRSQRAKVRRMVAAQKAQGENFERQALYAELGGSPSARAPRSKGERSKEATRRVGCEVAERAKHMTDAEWEGVPVDEKHAFAKYMHKVLQEHPTETTEQQRRRYFETTMADVFELDPRKTVRDEYERVKLGLPVHLKNPQYSLGVSQAVYDAADASLFDPENVHRLENAMTHVKQVFADYVHKKREGVSTEAERRMLANLTAELNLETQKHLANMFKYAEMRLRQVKLEERHHQLAEIERLRRMAQQRGGVKGRKGGSRKMSRMERLKRVINRAVGLDIAVAETVLTEMQAQEEFLQFCEVFARLTLGSGFKHTGKDENLSAYIESLRKLYSMDAATLSTLDVVQYYSSKEGAHPVDWAKRWYERALLLPLQSTPEYQKLLQIQQRDESTVKHIKETAGTGHAFACEAEAEVARIKTQKVVNLVEKMFMDPKDKRLESLHEKRLRYLAHMQMERQIRCVRENAKLFDGVENMPEAAQCRELYEKIMEKKTAQCNMTSPPEGEGSAIQSAKTEGDHCEVGPGMFNVYDDAEASSLFEKIREITLRVIRDRRVQSAAATKARMLNRIIRSLKGGERSIAEELRALHQQRKEKMTMRILGIIENDVKTEMEWLQNMEEAERPPLLPIPENMSYVSAADVQAWRELREDDERKAANPFERRRRTFQPELLGQAWSVPNKPLLFWGTGVSAVQQALRHVAEDAERKRQGLLLAPPYPCAENPWGWRLAKDILDDNN 754 T 0.12 FhuF pdb F T 6sga 28 BA F8 mt-SAF8 MMHRTAARLRMPRKPTPYVRKFLEGCPLPETLVDDIAGANLKSMAPFFTTAPRYIVAAESRLSKLFFHHALYPAGGARRPCRVLIVRGGRSVREPSFTINTGGGRGEVGGGSRGYRDPARRAYFYARPSTVGPFYSGNGGVSSNVAKCHSGVGSEGAGLVKRASVDGLLSPLCGVIEAHFAVGGTCNDAVATEGDGTESLTKGGEKLCVTLAGLLSGGHGGLMMDDGNCASNVRAAKRVARLLHDAAHHLSSFFYVHTQLPDSALFVSRPEASIASDGKGDGLLPSHLAREKDNDGRPSVEGVAVFRLAGGLEPTVHFAVGAPLSVLQRGVDGTASRGEKNSAEEGLNSAASTTVLPFGHIQCLLRVRTRGGKHCAAGKEGSEGTSNTPWCNTAGNDDITSNFAASGPQIAGGIVEPWKLGVSLDPKVPFFMRTLTEKRPSFSCGEGYLGTCSRSGDVDGNTVNDVSSSSGGSRTRAPGEVHMNHLLVRNDCETYLLPQRELLLSFHVPEEAEAMCKEQNEERMRRQAALGYGSPSHVFAEGPRTFARVLHGMKANLAAVEEASSTFRQGAAEGISPQVNGGSTTSGSSRVYEVRALPGDVVFVPRGWKYSVERIVGTAIIDAVAASTASPREALRAVFRTAPDPPLPQDVVRCDEGHAGAGEMPGGDSSNAEIVGVEVDAFVLCYKPYPVLSNAQASTYVAANYVHSGIDDFYAKGGNDVYHKYT 726 T 0.02 Cupin_4 pdbhh F T 6sga 29 CA F9 mt-SAF9 MTGPTRALFLSSGINLGRLRLAEQFSSMNGWQSKEDPAFDAYVKERRRKENYEAFDQRVERGYAAAAKLHKAEIQNAVKRRLKSSGAKFTAETLREMSSAVTERLAWLRDVWAQIDADYRSGDSARQETAAQEISAALRGEPNDYMRWVYETKRELRFAGPVGRRAIQEELQAAELPEVLDEEVNRYHDLKLNMMEIEREVKAKYGVAGQQHWAELQAAKDEEYIQKLDEAAEVYKQLLDQSARLDESRRSELQRSYVERVHQAQVRFKAAMELEGQREQLIEAHQAMKEERMRTEREKRRQLLREAAELRAQGKKSADVLTALKERQLDANAKRQAEYELKECEDILKRKSEMLDMIAHFKHDVEEREGREMLQRQKSDEERQVNVFGFYEEVGVEDGLSISSEGTTSQGGSSGLGTVSTSTSCAKSADSNSSAQPSQKLRKEELWKVINADTYEDPFRTVHQARLDAVKTYDPAYARTFPLNLVLGRKYSRQGAGEMAAGNETDKQILQKGNNILYSFQWGLNNGTVHDLDADGGTDYFMDGAFHVRDKETGDIDWRYEKKRGGPVFRGPKFYRLGAQREAADPGERAMDPTPYTSTPREHKWRSS 608 T 0.11 MDMPI_C pdbpssm F T 6sga 30 DA FA Q386U1_TRYB2 mt-SAF10 MPSASSGYTFADFLRRLERSPDSHMAPLYHEHRELFVRRHDMFARVISSVTWSKGVALVAAAGYTQAVNVTIYRALLARMLLHNRHVRQCGAGSVVPWSAALRTYSEAIATHGNAVPTRMTLSALRLCTPARQWVAAISLLMLSQANDKLTLPMLIDAAGCCATPAAWEKAMALLGRFHAQSLQVLPDSIQSLRPVGTSASTVDAAAHALLPRSEGPTPEQKHILTVINKVVSAVPWQVALSNEMCRSYLTHLVASTTLRPTEKTASLTTAVQQLPWEAFVTLMKTVTATVQEGSQGVLGSRSTPQLPPPQDGMERGDVAKSLLSNSIIREGVNLLQSEPETAIPFITTILYKLPSAEAAALFLSEATSAYRNSSSAVVAAAIRHPVVVGALLKRCADSNSWYLAASIFKSTSPTAIPCDVASDLVIQMRRANQAPLVVDVLQKYIVPSRTKLTEEAIEAALLCVLVHNRALAKASAVVAGTSPDNRTGKPNGIGVANGVHWISALSWATDLLEEGVESRILQTGTTPSVGGVNHEDPTVLLRKKTLSPRILSLLIYICVNAGSPRGGLFALGYARTVSKTELELSEEITALLYCMMYDRPREAESIIQHAVKKHGEYKGKYLGRLLVASQEAKGSALRNQT 642 T 1.4 Luteo_PO pdb F Eukaryota T 6sga 41 UA FZ mt-SAF29 MRRKTTLNIGQVICFSSWNDGSEGYEWKSRALSEKRSLALEFLGNVNKRVSIHDAIRLKADINKKAISNVSCPSFFSGIEGADEDEDQSDMSLCSLLGVLEGEIETDCITHLSPSDASLLKEEFLCDYDPSDTKRMAKWVNLRSETSDYQSYGAIPEGERSLWSAWYLRNIKAGKKPI 178 T 0.02 PH_11 pdbpssm F T 6sga 43 WA Fb Q581U4_TRYB2 mt-SAF31 MNCSSTLACHAVVSAPSTASLITSCWTPQQHCYRQLMKSLRAAYFHDRSKLFWSRHRVLVEFYKYSEEANEEAVKQLVAIGLEVAAFIDHHMRTDVERIVKHNETMMALPVAQAKKFRSDYLLAEKQHDSWCKQKIKNIMKRRPPPPYPFF 151 T 0.019 Complex1_LYR pdb F Eukaryota T 6sgf 1 A,B,C,D,E,F A,B,C,D,E,F C7G9B5_9FIRM Beta-xylanase GAMGVKKVFTADQLKVAWGDADYELADGQWKLSFAKQYNQVKWTLPESIEMSQVNAVTFQVADQKVPISLKVYNGGDDATAANTQYGLSGQTEYTINPSGDGAIDAVGIMITEDKPENATVSLVSVTFELKAGAGDAKLGD 141 T 0.00088 BspA_v pdbpssm F Bacteria T 6sgo 1 A A F4S7L2_MELLP MLP124017 MELPESFEFILTEDMVTDLDVKGLGYDFIDLVTKSPDSVNSEHELAHFLGPHDPEIYVNGKIQTTTAFLQFFRQGLFKKLKDAEFAINVSGKVKEGEGYKLVWKSAAQRSHDQKIRWDEAEAYIWRRKDGSCWLHSVKFIMSKAAPYVAIDHHHHHH 157 T 4.3 WLM pdbhh F Eukaryota T 6sgw 3 D,H F,J ECCC3_MYCS2 ESX CONSERVED COMPONENT C3,TYPE VII SECRETION SYSTEM PROTEIN ECCC3,T7SS PROTEIN ECCC3 SRLIFEHQRRLTPPTTRKGTITIEPPPQLPRVVPPSLLRRVLPFLIVILIVGMIVALFATGMRLISPTMLFFPFVLLLAATALYRGGDNKMRTEEVDAERADYLRYLSVVRDNVRAHAAEQRAALEWSHPEPEVLATIPGTRRQWERDPRDRDFLVLRAGRHDVPLDAALKVKDTADEIDLEPVAHSALRGLLDVQRTVRDAPTGLDVAKLARITVIGEADEARAAIRAWIAQAVTWHDPTMLGVALAAPDLESGDWSWLKWLPHVDVPNEADGVGPARYLTTSTAELRERLAPALADRPLFPAESGAALKHLLVVLDDPDADPDDIARKPGLTGVTVIHRTTELPNREQYPDPERPILRVADGRIERWQVGGWQPCVDVADAMSAAEAAHIARRLSRWDS 401 T 0.07 Bac_export_3 pdb F Bacteria T 6sgz 3 D J ESX-3 secretion system protein EccC3 MSRLIFEHQRRLTPPTTRKGTITIEPPPQLPMRTEEVDAERADYLRYLSVVRDNVRAHAAEQRAALEWSHPEPEVLATIPGTRRQWERDPRDRDFLVLRAGRHDVPLDAALKVKDTADEIDLEPVAHSALRGLLDVQRTVRDAPTGLDVAKLARITVIGEADEARAAIRAWIAQAVTWHDPTMLGVALAAPDLESGDWSWLKWLPHVDVPNEADGVGPARYLTTSTAELRERLAPALADRPLFPAESGAALKHLLVVLDDPDADPDDIARKPGLTGVTVIHRTTELPNREQYPDPERPILRVADGRIERWQVGGWQPCVDVADAMSAAEAAHIARRLSRWDSN 343 T 22 FtsK_SpoIIIE pdbhh F T 6sjl 2 D,E,F D,E,F A0A377LA80_ECOLX Putative type VI secretion protein MTKYQGYDVTDATHKTSIHNDWKVVVAKKKPARGVTLTIGIFFDGTGNNRENTASRLMKFNECSAARQGVNQKDAQSCEDFLKEINKNSISNGSYRGYYSNIHWLNILYHPDQVLKKDQTSAQIKTYISGIGTAAGEADSVIGMGLGTSILDIFEGVVTKTDEAMERITQALSEFMGFNLSPDFCIAKIQFDVFGFSRGAAAARHFANRVMEQDPAIARAIAKGLRGDFYDGKPSGEVRFLGLFDTVAAIGGISNFFDINGRSNPGVKLELRPSVAKKVFQITAMNEYRYNFSLNSIKGMWPELALPGAHSDIGGGYNPVGSPLQENESLFLSCPEFEIVSDDTREMDTRVYRKAEQVRKMLMTLPALKHILPHGKLTTKIRSIGVNNSNQRRAGVIQKQVGAAVFFERMAVPNDWANVCLRVMLDAAQEAGVLFEPIRQTNTELQLPSELIFLADKAIAQGKAVRLGQEPQAFTEEELYIIGKYTHCSANWNIESDGNLWVDPTTGEIFIHRFGPKGNKAFVFPNKPNDRWIRSVWYMDDQQRLNDNAVKNTKVMMSGV 560 T 1.3E-11 DUF2235 pdbpssm F Bacteria T 6sjt 1 A,B AAA,BBB A0A0C9MKT2_LEGPN NttC MAHHHHHHVDDDDKMAPAYLTTHNRTGEESNAYIAGSIPSLYPTAAYSTNQVYWNLVRLACYGHTTNGQCPALIKMATNTANPIDIGYVTMDLNTGDITPKTLSAKGYSLRVIGPGEAEITKN 123 T 0.96 DUF6488 unphh F Bacteria T 6sjw 1 A A FRPC_NEIMC Iron-regulated protein FrpC PLALDLDGDGIETVATKGFSGSLFDHNRDGIRTATGWVSADDGLLVRDLNGNGIIDNGAELFGDNTKLADGSFAKHGYAALAELDSNGDNIINAADAAFQSLRVWQDLNQDGISQANELRTLEELGIQSLDLAYKDVNKNLGNGNTLAQQGSYTKTNGTTAKMGDLLLAADNLHSRFLE 179 T 0.08 SdrD_B pdb F Bacteria T 6sjx 1 A A FRPC_NEIMC Iron-regulated protein FrpC GSDALALDLDGDGIETVATKGFSGSLFDHNRDGIRTATGWVSADDGLLVRDLNGNGIIDNGAELFGDNTKLADGSFAKHGYAALAELDSNGDNIINAADAAFQSLRVWQDLNQDGISQANELRTLEELGIQSLDLAYKDVNKNLGNGNTLAQQGSYTKTNGTTAKMGDLLLAADNLHSRFLE 182 T 0.083 SdrD_B pdb F Bacteria T 6sjz 2 C,D E,F AIFM3_HUMAN APOPTOSIS-INDUCING FACTOR-LIKE PROTEIN XGNCFSKPR 9 T 0.062 NifU unphh F Eukaryota T 6sk2 2 C,D D,F AIFM3_HUMAN APOPTOSIS-INDUCING FACTOR-LIKE PROTEIN XGKSFSKPR 9 T 0.062 NifU unphh F Eukaryota T 6sk3 2 C,D C,D AIFM3_HUMAN Apoptosis-inducing factor 3 GNCFSKPR 8 T 0.062 NifU unphh F Eukaryota T 6sk8 2 C,D C,D AIFM3_HUMAN Apoptosis-inducing factor 3 GDCFSKPR 8 T 0.062 NifU unphh F Eukaryota T 6skw 1 A,B AAA,BBB A0A4Q5N6R9_LEGPN NttE MAHHHHHHVDDDDKMNSDDNADGLIFSPLPQNKNTVVRHYSNEQEMPNLSQMAQRTIDFPTQIVRVSGNLTGLELSCDDVENEIDQVFSKKISPNLFTYNTYVSCGYDVNDPEQHATNFSIQSYFDPLTDNAVDYLKSYLKEYNGYNLFNTTTLQIENAKGIIVSMNLNAGLKSNPDKTPFTLYRQDRNNFYFKSNFDVRKELISDIYQRFYSNDPDMILPFFDKWIFSYAGSVYYSILMASNYLELQPERIFVMENEGDIFVSDLRYYFANLCMKRNPNKHCL 284 T 0.07 FAM117 pdbpercent F Bacteria T 6sl5 13 M O PsaO LRVDPIVPAISFVGWTLPSNIGTSALNGQSLFGAFYESIGQNLAHWPTGFALDDKFWLYMVTWHTGLFIVMLLGQVGFKGRTEDYF 86 T 23 YWFCY pdbhh F T 6sli 3 C,I P,H ALA-SER-THR-THR-GLY-GLY-ASN-SER-GLN-ARG-GLY-SER-GLY ASTTGGNSQRGSG 13 T 56 CtsR pdbhh F T 6sli 4 F,L E,K ASTTGGNSQRGGG ASTTGGNSQRGGG 13 T 5.1 CtsR pdbhh F T 6slj 3 E P ALA-SER-THR-THR-GLY-ALA-ASN-SER-GLN-ARG-GLY-SER-GLY ASTTGANSQRGSG 13 T 56 CtsR pdbhh F T 6slj 4 F Q ALA-SER-THR-THR-GLY-ALA-ASN-SER-GLN-ARG ASTTGANSQR 10 T 40 Orbi_NS3 pdbhh F T 6sln 3 E,F P,Q GLN-THR-ALA-GLY-ALA-ASN-SER-GLN-ARG-GLY-SER-ALA-GLY QTAGANSQRGSAG 13 T 21 Ice_nucleation pdbhh F T 6snt 80 BC NC YEG7_YEAST Uncharacterized protein YEL057C MANDGIQRNDNRKGFKTVQFSAYSKEIDVIMKKISFLERNITQQLDTLPHFPKTLPPNHKDCVSRKHRARRGWSSQLKNLLGIYSKEEIFTLDNLAATLHDQVLKLQATLFPNAILKQVHLDNANIENKRILKEITYKYLSNENCKEENKFGTFIVKRIFFGDLSLGVSVLINRIAFESATSSIMVVRSSFIESDFFYEDYLIFDCRAKRRKKLKRKILFISTTMNFNYQTKV 233 T 1.8 DpnI_C unppssm F Eukaryota T 6sok 2 E,F,G,H P,Q,R,S Twin-Strep-tag peptide XSAWSHPQFEKGGGSGGGSGGSAWSHPQFEKX 32 T 3.1 PNPase_C pdbhh F T 6sr6 2 B,D B,D G0RYD6_CHATD Putative ribosome associated protein GPAMNATVVSLPLPTLPEGWAAEKDFKAIGKLTQEGSSMRTLEPVGPHFLAHARRVRHKRTFS 63 T 5.6 Suv3_N pdbhh F Eukaryota T 6swg 2 C C TASOR_HUMAN Protein TASOR MSETTERTVLGEYNLFSRKIEEILKQKNVSYVSTVSTPIFSTQEKMKRLSEFIYSKTSKAGVQEFVDGLHEKLNTIIIKASAK 83 T 0.021 ERAP1_C pdb F Eukaryota T 6swy 1 A 5 VID28_YEAST GLUCOSE-INDUCED DEGRADATION PROTEIN 5 MTVAYSLENLKKISNSLVGDQLAKVDYFLAPKCQIFQCLLSIEQSDGVELKNAKLDLLYTLLHLEPQQRDIVGTYYFDIVSAIYKSMSLASSFTKNNSSTNYKYIKLLNLCAGVYPNCGFPDLQYLQNGFIQLVNHKFLRSKCKIDEVVTIIELLKLFLLVDEKNCSDFNKSKFMEEEREVTETSHYQDFKMAESLEHIIVKISSKYLDQISLKYIVRLKVSRPASPSSVKNDPFDNKGVDCTRAIPKKINISNMYDSSLLSLALLLYLRYHYMIPGDRKLRNDATFKMFVLGLLKSNDVNIRCVALKFLLQPYFTEDKKWEDTRTLEKILPYLVKSFNYDPLPWWFDPFDMLDSLIVLYNEITPMNNPVLTTLAHTNVIFCILSRFAQCLSLPQHNEATLKTTTKFIKICASFAASDEKYRLLLLNDTLLLNHLEYGLESHITLIQDFISLKDEIKETTTESHSMCLPPIYDHDFVAAWLLLLKSFSRSVSALRTTLKRNKIAQLLLQILSKTYTLTKECYFAGQDFMKPEIMIMGITLGSICNFVVEFSNLQSFMLRNGIIDIIEKMLTDPLFNSKKAWDDNEDERRIALQGIPVHEVKANSLWVLRHLMYNCQNEEKFQLLAKIPMNLILDFINDPCWAVQAQCFQLLRNLTCNSRKIVNILLEKFKDVEYKIDPQTGNKISIGSTYLFEFLAKKMRLLNPLDTQQKKAMEGILYIIVNLAAVNENKKQLVIEQDEILNIMSEILVETTTDSSSYGNDSNLKLACLWVLNNLLWNSSVSHYTQYAIENGLEPGHSPSDSENPQSTVTIGYNESVAGGYSRGKYYDEPDGDDSSSNANDDEDDDNDEGDDEGDEFVRTPAAKGSTSNVQVTRATVERCRKLVEVGLYDLVRKNITDESLSVREKARTLLYHMDLLLKVK 921 T 0.0017 HEAT_2 unppercent F Eukaryota T 6syi 2 B B PB1-11 DYNPYLLFLK 10 T 0.3 Flu_PB1 pdbhh F T 6sz9 2 B B Q5ZYC7_LEGPH IcmP (DotM) MYIEMAQQQQQSGSDNSMAPVWIVILLFITAYFVWALAHQYIVSFVFTINIWQARLVNLFLNNQLLANQIYLMQTLDPNTVNWDQMVTVMRAVGDYMRYPVICILVVLAFVLYNSNVTLKYRKTYDMKSLRAQEQFNWPAIMPIVKEDLVSQDVNKGPWAMALTPMEFARKYNLLRKDDALLDNPVPGEEMTAGIRRGDAKRVFTMQLGPYWDGFERCSPQAYALSAVFMARMNRDRDAANNILKVLDKTFVDGKPDFSVARPVMKKYQNSELVQEVVAKHAYVLTVIASLLEAAREDGVVPSSEFLWLKPVDRRLWYMLNCVGRQTPYSEVAGPFAHWKAEKEMGRRSLVPMIDEAIRALEIAVKEVRLTPRQMEELEP 380 T 0.031 B277 pdb F Bacteria T 6sz9 4 D D Q5ZV91_LEGPH DotZ MDEIKKDDELSQWLSTYGTITAERILGRYNISLPQDEILEAINIPSSFYRHLLQIPLKNVLNGIVIQQASDYHVYAQKLLIDYLLSGESSKEPDSQGAGTRESLEDERQRLVQLGDEFHKLELEQDNLIASSQASLMKISIDWNTKLETTLSKLNSLYKNTNSKIKKNAIRKALIKAFIHCDLVKDQSQKNKYQLIDKLNQTLAVSVGAELKESILTNLSELFQILEALNTKLDEFTDRTNHLSQQAKSFRTQFYEVILRIIELIKLLPEYKIDPAQDAINREPLYFDRTIGER 294 T 0.0097 EAP30 pdbpssm F Bacteria T 6sz9 5 E E Q5ZYR7_LEGPH DotY MPKYTLPTRDALLKAMQVGETSIEAAEYMATRFEQILTKAKLLPECNDMLEKIKEYAQFVKFKLLSSAQVWSGQERPTSDYQNTQENKAEFLASHLEGLPSGLKLEVAIGDDAKILRGFSSNGKMVEGDQLKTMDGLLEGWLAKNSLAISGGAVVKIDNTGNQTKVDPQEIRQLINDSEKGVAKYFADKGVGMEVAQRTYQEPKALETKREEIRQEIESGAEAPTTQSIR 230 T 0.019 GPW_gp25 pdbpercent F Bacteria T 6szs 55 CB y YQKK_BACSU Uncharacterized protein YqkK MAKSQAKKKRGHRLRNGGRDVLLSRGSTPSFSTHGRMTKSKKEILNKRKHKNPYDHTAVDDKDFFVPQKAA 71 T 0.068 DUF5988 pdbpercent F Bacteria T 6t2d 2 B B Stapled peptide GAR300-Gp LTFEQYWAQLESAA 14 T 0.7 PBP-Tp47_a pdbhh F T 6t2f 2 B B MDM2 in complex with GAR300-Am LTFDQYWAQLDSAA 14 T 0.15 PBP-Tp47_a pdbhh F T 6t33 1 A A F8QV07_RUMGN Ruminococcin C WGCVCSGSTAVANSHNAGPAYCVGYCGNNGVVTRNANANVAKTA 44 T 2.6 EPV_E5 pdbhh F Bacteria T 6t3o 1 A A MYOM1_HUMAN 190 KDA CONNECTIN-ASSOCIATED PROTEIN,190 KDA TITIN-ASSOCIATED PROTEIN,MYOMESIN FAMILY MEMBER 1 MGSSHHHHHHSSGLVPRGSHMKSELAVEILEKGQVRFWMQAEKLSGNAKVNYIFNEKEIFEGPKYKMHIDRNTGIIEMFMEKLQDEDEGTYTFQLQDGKATNHSTVVLVGDVFKKLQKEAEFQRQEWIRKQG 132 T 0.00041 V-set pdb F Eukaryota T 6t46 2 B,D,F,H B,D,F,H E9RIY7_BACNA Quorum-sensing secretion protein (processed) MKKINGWIVVALLAVTTVGAAAAIQYTNNADSPGQFQVAQKGMY 44 T 0.0046 PhrC_PhrF pdb F Bacteria T 6t7v 2 B I NF2L2_HUMAN LEU-ASP-PRO-GLU-THR-GLY-GLU-PHE-LEU LDPETGEFL 9 T 0.0068 DUF4585 pdbhh F Eukaryota T 6t7y 2 B B DP2L_PYRAB cPIP motif from the DP2 large subunit of PolD KKRVISLEEFFS 12 T 1.9 Med29 pdbhh F Archaea T 6t80 2 E,F,G,H E,F,G,H SNAT_HUMAN AANAT peptide GSGSLRRNSGCG 12 T 26 YopE pdbhh F Eukaryota T 6t84 1 A A A0QQF4_MYCS2 Uncharacterized protein GHMIDRRRGLGRRRKSWAKSHGFDYEYESEDLLKRWKRGVMSTVGDVTAKNVVLGQIRGEAVFIFDIEEVATVIALHRKVGTNVVVDLRLKGLKEPRENDIWLLGAIGPRMVYSTNLDAARRACDRRMVTFAHTAPDCAEIMWNEQNWTLVAMPVTSNRAQWDEGLRTVRQFNDLLRVLPPVPQNAS 187 T 0.51 PepSY_TM unppssm F Bacteria T 6t9q 1 A A TIM_HUMAN HTIM DPGTHIVLWTGDQELELQRLFEEFRDSDDVLGHIMKNITAKRSRARIVDKLLALGLVAERRELYKKR 67 T 0.0043 Myb_DNA-bind_6 pdbpercent F Eukaryota T 6taz 1 A B TIM_HUMAN HTIM DPSRRAPTWSPEEEAHLRELYLANKDVEGQDVVEAILAHLNTVPRTRKQIIHHLVQMGLADSVKDFQRKGTHIVLWTGDQELELQRLFEEFRDSDDVLGHIMKNITAKRSRARIVDKLLALGLVAERRELYKKRQKKLASS 141 T 0.003 DEP pdbpssm F Eukaryota T 6tb9 2 DA,EA,FA,GA,HA,JA,KA,LA,MA,NA,OA E3,D3,A3,C3,B3,A1,E2,D2,A2,C2,B2 Head spike base Rcc01079 MDVFAKHAVSLESPAVRHYEITPSDSTDLARRPRALRVQTGGTLVLRDETGITVTYTVFAGEILPVRPVRVLATGTTATAVGWE 84 T 0.26 DUF2835 pdbpssm F T 6tb9 3 IA,PA F3,F2 Head spike fiber Rcc01080 MIALGLGLGLAANGGPALRRYAVNGVAPVAVLDFERHFLSHPLALTRATSATYADALRAVQTAPADTPRYDYSTGKRALLLEASATNLLPNSAQFEAASWGKTRASVLANAALAPNGTMTADKLVEDTSNNSHFVARTGTQIAAGTSVTASIFVKAAERRWFALVTADSANAFRTTYFDLQTGTLGVVSQGAAGHVAQIVAAGNGWYRCSVTQTQAASGNFNFYPSVASANGATSYPGDGASGLYLWGAQLEAGAAVSSVIPTEAAAVTRAADLASVAVAAGSYDLRRVDAAGTAVTKGVAHPGGALTIGAGSLYLLSLFPAGAL 325 T 0.012 CBM_4_9 pdbpssm F T 6tbt 2 C,D C,D Apt48 peptide GPHGPRDWCLFGGP 14 T 1.5 Prog_receptor pdbhh F T 6tcb 1 A,B A,B Q9I0B9_PSEAE Uncharacterized protein PA2723 GHMDELFEEHLEIAKALFAQRLPYWCDVFLRPADQAFNAYLNARGQASTYLVLEGFDPVYVPRGCDLDAVRATARARARLREAGLGEDALPVLL 94 T 0.63 DUF2992 unphh F Bacteria T 6tcj 2 C,D C,D Hybrid BTB-binding (HBP) peptide PGGFLCWDGRSIHEIPR 17 T 1.8 DUF1996 pdbhh F T 6tdd 1 A,B A,B Q0B304_BURCM Beta-ketoacyl synthase GPGSMNKPTSSDGWKDDYLSRLSRLSKNQLMALALKLKQQQLEQG 45 T 4.2 LIN52 pdbhh F Bacteria T 6tdm 1 A,B A,B Q0B308_BURCM;Q0B309_BURCM Beta-ketoacyl synthase,Beta-ketoacyl synthase GPGSYDAALPIDELSALLRQEMGDDGGGSGGGSMQDIQQLLAKSLTEIKRLKAANQALEQARRE 64 T 0.00011 Docking unppssm F Bacteria T 6tdn 1 A,B A,B Q0B303_BURCM;Q0B304_BURCM Beta-ketoacyl synthase,Beta-ketoacyl synthase GPGSYAPLDTELSEIEGLQDDDLAALLGKEFIREGGGSGGGSGGGSMNKPTSSDGWKDDYLSRLSRLSKNQLMALALKLKQQQLEQG 87 T 8.5 DUF4266 pdbhh F Bacteria T 6tdu 2 B,T D,d ATPTB6 MTHAELHLFDLDEFMQTYKRLQTRQDWLIENKCKKSRLFSYVAAVIAFTVGKSATMSDEAILAKIDPYVTSEVRVQRGAWWRSGYFTKEEVEMMTPKGPIARYYKFLLGVRRFPLKHGALSWACGFVPAWLTFTSLNHWAQNRRLNRYLTQESVFGEMARELVRGKTADEATTSVMARVEKEILGVH 187 T 0.25 Phg_2220_C pdbpssm F T 6tdu 3 C,U E,e ATPTB12 MSSYTGAALAPKSERLRLAFEEKQKDHQKCIEEAKGKGLKKDELIDACAWTHRKTILALKDWFAYRPPFQDRRSKWAEYCSIRHDSGSWLGWSQKFF 97 T 0.074 Cofac_haem_bdg pdb F T 6tdu 4 D,V F,f ATP synthase subunit a MLNSNIYIIIYGGIIMYSIMIIIQMFLYNFSNKIYIEVEINKYILSKNNIDIYWIICNCTIIIIITTLNHIINKIGIYNMIEYNICYWLIGTGLGLYISPFIVFGYKFFVYIMDLNNYSLNIYHNNNKMNDIQQIYNGTNYNDTMIFFIKDINNIFTIYRSINFFMNWLYQMIYYGVRMWLVFVLHSFSLGSFGELITVITDNNLIFNVFYIGLLGLGFILYLIVIFYLGIQIYVYISFSLSFLHSTILLFLVNYIPHYNNKSIFNTFTNKSIY 274 T 4.4 DUF4514 pdbhh F T 6tdu 5 E,W G,g ATP synthase subunit b MPSTSPADKDVPMSILHTHGLSYVNWCMSLAPGLLVFEGFFRARYYRSRVPPSRTVLMNGLKMRMFSLARQQAPKIVHKPVLSPIPEHLRLVKNVAQVQIDMLKLLNAQAAK 112 T 0.097 TMEM33_Pom33 pdb F T 6tdu 6 F,X H,h ATP synthase subunit d MMRRACRIIRPSHVRGVSGVAPTIYLRSKAALPATSTTDVRPQLYALQRFAKAQLKTATEAERAAIEADIARYQEYLDSDLEKLKQDVAEDTAKKQKLIPLLDRYPDVPIEKIPEHANVLLKKIDACLEILSKDIGEVTDAEAHEMYFETSKFQILHIYTGCVASFPEGDVPPGAVECLPGQVIRTKVNGEDVMLEIDEVDPGYQVCWFKPDVPLPENAEILWSYPYEPTAALPTGTTWEEGQANVLIPAEPTPEAAVWPPTPVTNVYAPMAEKLALKSNPELKVLFKEALLQPAKLLPLDVDYQCSHDREVVEAKRDRYLTALVEAEQAPPLPFTPDVLQLQLEHNVLKGELIDRLRALEYTIVTEQLQARLHERRLRGDVIDEWEELDYHPLVRDDTYLAIDFGDPTFGRYIWKLFPHTDGDEECMFKDTRLDVLPPQVNPLNAILAQHTAQTPVHRSLEKRLWTEVRATAVSE 476 T 0.016 Apolipoprotein pdb F T 6tdu 7 G,Y I,i ATP synthase subunit f MAPLYPVLSQASLYKRHFFKNIKLFHVVFYVGAPCVTFGTAAWSGSNRNSREAIFMVIEERHGWDNFKKLSSHQQGVIMQEAAQESLLARNKGELHLP 98 T 10 Arm_3 pdbhh F T 6tdu 8 H,Z J,j ATP synthase subunit i/j MVYTNWQSSYTRLFVSKPWMWHPLAWMTLSVGIWWKFGKESLCNERSFYIHTHPKWAPHKFHTVYNWSRDPIKWTLAEQYASIIRNTNTDIEAVLKIKLPANAN 104 T 31 Spectrin_like pdbhh F T 6tdu 9 AA,I k,K ATP synthase subunit k MAFGRTRPTLSSPLVPVWNDLRALQVFTSQEYMQKRGPGFTNTLEYKLSCLNPVKWYDMMKVMPGGKAFVGTALGLALFGGWGVEFVKNISVMTKEKPPIDWNNEKLGHLTRS 113 T 9.8 Mt_ATP-synt_D pdbhh F T 6tdu 10 BA,J l,L ATP synthase subunit 8 LIPVSLVDLININIIFYILLLYTLLLFFIPLFLASINYTYHYIYKYYNYNYNFINNN 57 T 0.86 UPF0542 pdbhh F T 6tdu 11 CA,K m,M ATPEG1 MSLAKVWMYASWIPRGIPKAMANELSSAAAALAHPEAIARVAQLESQGKNPYRVARAEFWQMYLACWPYRFRNTVVEWETCKAKVLKGSVDLQDIVDLLYLLAWAYLFWILGEIYGRGSLYGYRFDGEIHRQEAQNVILYKEKEAQEMAVVMEKLEKEIQEWLKTMEQE 169 T 0.015 ATP-synt_G pdbhh F T 6tdu 12 DA,L n,N ATPEG2 MPLPNAVVQGYTSVRGPKRPLDHFYGRTPLNIDTLWHWVKFPHRYDNLRFAVCFWAFLVSAHFANKKQRNLRVEWEKNMEIQKKLHPSGLWSEEQAFAAAEKLGRPKAGHPMRVFEDGYQQFDLKPKLFDPDEEAHH 137 T 8.9 MF_alpha pdbhh F T 6tdu 13 EA,M o,O ATPEG3 MADHNKKDVGSWASPNEHLMFFDFSSWLLVDFGKRWERWVSFKKSFLTTTRSPYWSPQFFLLTFFQLRNSNVKLCENWNWAPKGDDFNLLHNSAAEPFGRDLKAHLEREAGAKHHH 116 T 2.4 BNR_6 pdbhh F T 6tdu 14 FA,N p,P ATPEG4 MGGDAHAAPAEKPDPALDATKALPKALEEVEFFQSYAVRRKTGFHLFNRATGSPTIVGPMFYNLYNFVRIGRVSKYVCWLSLPLVFQRMWMKNRATGMEYDIDLENYAPFEAKKNPMHGH 120 T 19 DUF3274 pdbhh F T 6tdu 16 HA,P r,R ATPEG6 MFGVTRKLLGELSEYVEVNEKGMPKPQALSLWNMPYAKRRALTKFARGVRWQFIVLFIALYNFKNRDDSHLLRRGAYN 78 T 7.5 DUF2845 pdbhh F T 6tdu 17 IA,Q s,S ATPEG7 MMRISRKLLVPVANFRPKKPWDGPWGIQISQKKDRPFIAMWILFPLLLVDHLTREYYAYWHSSKVPVTDVFGDF 74 T 17 Mem_trans pdbhh F T 6tdu 18 JA,R t,T ATPEG8 MGGKASEAVTIAFRFPHRTTFLVKQNVGQKLNKGHQTFWQLVAGGWLFFLLINRTSFKPKLAAPKV 66 T 2.8 SCIMP pdbhh F T 6tdu 26 XA,XB AN,BN inhibitor of F1 (IF1) MAAACAVRGFTTARPMLTPNKVKVPGRKPQDEEDLTWAEADRKLTPEERYARDKQMALLDKMTSQVEELEKSHTEQKKSNKGVKAQIEAISRQLEALKAQLKE 103 T 0.027 FlxA pdb F T 6tdu 29 JB,JC C,c ATPTB4 MFRGFRPVLAADAVKFQTLYNVLTGKQHLKDQVPVKDCNLTAIFGASWKADLNKWFDSEYAPKLPAAERDSAKKSLDLYLKRVDLTRYTREELTTYGILACGPGKVDALTEKHLLETGKARLEELTAGLGNKDEGVNAFRKEVEQEGKYANWPAEKSKALADKVIAASP 169 T 0.14 Hydantoinase_A pdbpssm F T 6tdw 7 G T subunit 8 LIPVSLVDLININIIFYILLLYTLLLFFIPLFLASINYTYHYIYKYYNYNYNFINNNT 58 T 0.9 UPF0542 pdbhh F T 6tf4 1 A A Q9KTB3_VIBCH Transcription/translation regulatory transformer protein RfaH GAMGEQLKHATKQLPEKGQTVRVARGQFAGIEAIYLEPDGDTRSIMLVKMISQQVPMSIENTDWEVT 67 T 2.8E-05 KOW pdbhh F Bacteria T 6tfl 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N A,B,C,D,E,F,G,H,I,J,K,L,M,N B0R5R2_HALS3 RNA-binding protein Lsm GAMSGRPLDVLEESLEETVTVRLKDGDEFTGVLTGYDQHMNVVIEGEDTTIIRGDNVVTIKP 62 T 0.087 LSM pdbpssm F Archaea T 6tg8 2 B PPP VAL-ILE-ASN-PRO-GLU-THR-GLY-GLU-GLN-ILE-GLN VINPETGEQIQ 11 T 0.17 2C_adapt pdbhh F T 6th1 1 A R O57046_9BETA Immediate early protein 1 GPLGSEQQPGDRCPRHVARIIAENDPPIRCDLTLQELLSEVQVDFEPSASEVVAMEGLMDEQHFIPHDPHSKKAAVQSLVIAIKTADLLLQMIHENVKRDIRTTCIQMANESYARADIVRDSLIAASQGKYTALGKIVFHSYTNFMPVNANESEKRAWMEMLGECTSHGNKLCEMANAQVEQETRDIINIMFKNIDDVVTQTTRAMRGVFDPPDTVKALSAAAQLIRVWEHDNVINDQSVSTSSVVTAALEANENLAKALRDVSGYAEVQFNRLCLSILTSAKERIDIIYHSARSQHLACNVRMNVAQQNLATFILTNARERPNDAVIRTRRAVANTGILLFTGQHITRDALDKAAESKSVEEIVGMS 368 T 7.1 Herpes_IE1 pdbhh T Viruses T 6thh 2 C C M9U4Y8_SULIS CRISPR-associated protein, CscA MRNLKRIVMGENKLIGLVRTALDSITLGQGVNEAKIKSPQSYAFHTISVGTISLDICKAIYSSSEIGRKQLENLSKKYNMPFEDLWFYGGFLHDWNKLSGKEESLENKEELTKKIIDKLKLPNEFLHGISTMAEGHLPDNLHLPLWVSIKLADMLLISDIGSVRDVFYFANSDSYRNAIEALKEYNLELNYVSSTFRLFTLIASKELLNDVFNEKSGYFPLISYADGIVFLKRKNSQPVLLSKIVDLLSRQVFSSSSEVIEEKISDIEKCIKNKEELFRQMNIDVKSAIYDEEGKVKQINAFLPTKVCKPFEDVVGNLDNKSKLQVAREVIERNRKDIPFGLLIYFVNKFSKNEEDYIRKGLGINEKSLKYLLNIGDVQKALDKILELLEKRYAEQSSDKTLLYYVKFSSSGNIIDDLPKITDRPNDYCVVCGMPIYSSNPVRFVQYASELGGRAEIWIPREKALDEIDNVRDDWKVCPICIYEANLMKDRVKPPYFIVTFYPGVPISLLNIIDFDFSQSSIKYYIDEEKDTYFTAFEKMGGRLEPYVKKVLPAYFSSKVIIKASEVSNFSLSTRLSKSELNKLLPYAPMISMIFLTSPVLISSNLYEMPIAHERVISITSTYNYTFMKSLNSNLLTLYSIFAYSAKYDAMRKICGRSDLDNCLGYLTEEMDLYSSVDPALGVLSIGMGVGTPIDTDEKFFSAFLPVSGYLLKVTGKVSKMGETLKSSIFSIAYALKDIIKSQKVSKYDVTGFLRDGVDMFFKTTSVIKDKEDRIGISVNAAISSLENKYALDDQHRAQVYSALQDIFKTLYSIEEESDRSLAISIANTLSNWLYIAYKLVLQGDKSLEHHHHHH 855 T 0.057 DUF2225 unphh F Archaea T 6thl 2 B B BCD1_YEAST Box C/D snoRNA protein 1 MRDSTECQRIIRRGVNCLMLPKGMQRSSQNRSKWDKTMDLFVWSVEWILCPMQEKGEKKELFKHVSHRIKETDFLVQGMGKNVFQKCCEFYRLAGTSSCIEGEDGSETKEERTQILQKSGLKFYTKTFPYNTTHIMDSKKLVELAIHEKCIGELLKNTTVIEFPTIFVAMTEADLPEGYEVLHQE 185 T 0.045 MobA_MobL unppssm F Eukaryota T 6tid 1 A AAA B4EH86_BURCJ Lectin GHMPLLSASIVSAPVVTSETYVDIPGLYLDVAKAGIRDGKLQVILNVPTPYATGNNFPGIYFAIATNQGVVADGCFTYSSKVPESTGRMPFTLVATIDVGSGVTFVKGQWKSVRGSAMHIDSYASLSAIWGTAA 134 T 0.002 DUF1543 unppercent F Bacteria T 6tip 2 C,D P,Q Strep-tag II peptide SAWSHPQFEK 10 T 1.8 PqqA pdbhh F T 6tj1 1 A,B,C A,B,C De novo designed WSHC6 MGSSHHHHHHSSGLVPRGSHMTEDEIRKLRKLLEEAEKKLYKLEDKTRRSEEISKTDDDPKAQSLQLIAESLMLIAESLLIIAISLLLSSRNG 93 T 0.01 Halogen_Hydrol pdb F T 6tj3 1 A A Q8IJM4_PLAF7 PfELC MASDMEEKFREAFILFSSCSDHIEMYKFFELMNSFGIILTNDEKAALPNDINMDYWLNFAKKHYNYEQPFKHIN 74 T 0.024 Na_Ca_ex_C unppercent F Eukaryota T 6tj4 1 A,B A,B Q8IJM4_PLAF7 PfELC SMASDMEEKFREAFILFSSCSDHIEMYKFFELMNSFGIILTNDEKAALPNDINMDYWLNFAKKHYNYEQPFKHIN 75 T 0.024 Na_Ca_ex_C unppercent F Eukaryota T 6tj5 3 C C MYOA_TOXGO MYOA,TGM-A GAMASSWEPLVSVLEAYYAGRRHKKQLLKKTPFIIRAQAHIRRHLV 46 T 0.00015 IQ unppssm F Eukaryota T 6tj7 3 C C MYOA_TOXGO MYOA,TGM-A SSWEPLVSVLEAYYAGRRHKKQLLKKTPFIIRAQAHIRRHLV 42 T 0.00015 IQ unppssm F Eukaryota T 6tkg 3 C I TTI_GLOMM Tsetse thrombin inhibitor MKFFTVLFFLLSIIYLIVAAPGEPGAPIDYDEYGDSSEEVGGTPLHEIPGIRL 53 T 0.048 Cytochrom_B558a unppssm F Eukaryota T 6tkl 3 C I TTI_GLOMM Tsetse thrombin inhibitor MKFFTVLFFLLSIIYLIVAAPGEPGAPIDYDEYGDSSEEVGGTPLHEIPGIRX 53 T 0.048 Cytochrom_B558a unppssm F Eukaryota T 6tkt 1 A A Q53805_9ACTN Pre-phenomycin GAMANPKTIKAAAYNQARSTLADAGSRTAAKSHPIHGKTDVPVSYGTSLLAAARDEFRQADKKLPAKDKKSDMSIAHYNAVHSAAKTMGIDTW 93 T 0.85 DUF6388 pdbhh F Bacteria T 6tlx 1 A A A0A0L1KLX8_9EUGL Protein kinase GSSQKFSTSPSPTLDDGLDRIKCPKKHGMKLLRAFPKLNDTAGGTSDYGWGFWCDRCHKEVPALIKSKKRISKAQDERTHAPEENTFFYHCHCGYDLCKACGASIIHASNTLKENYSTELKNLAACFSTPS 131 T 0.0031 zf-RING_4 pdbpercent F Eukaryota T 6tmg 1 A,Y q,Q A0A125YPS4_TOXGG ATPTG11 MVRNQRYPASPVQEIFLPEPVPFVQFDQTAPSPNSPPAPLPSPSLSQCEEQKDRYRDISSMFHRGVAGAEQVREAYNSMAKCFRRVSVAEVLESDPAFRQARNFTMDLKQAEDDQRYKQLQYGRVPSILTKYHL 134 T 15 Chor_lyase unphh F Eukaryota T 6tmg 2 B,Z i,I S7VXW3_TOXGG ATPTG7 MPSSSSEDAQGGNRFECVSNSTSPRRKNATKDEAACLQPRRSAVSGPREDVLCIRTTPQPHVRRGKSGPGRRKRMRFGRERERRDKKRGEGERKRTRFPFLRLHIEGGNANSRRPLCFPSRHSLLRNHYGSLSMAFRKVSPPKAPMSVFEARSSFLDLEQCARAAGPQRWEAECQGVRQRALQAAADVMSRECGAYGDSFFQCYRHGFRLEACQGEKATMQLLRCQRMVADRLVPL 236 T 0.068 FokI_N pdbpercent F Eukaryota T 6tmg 3 AA,C T,t A0A125YLH9_TOXGG ATPTG14 MPAPAASGAAAVLSKDIARSFRWMQAFAAVKGKPTAGSCAAGTAVVNPEDPTKVTLKGRYTNFSLQHIWEKYDYLQTHLLLRECMLSQVAKNPRLLDPEINAGLTPTVFMRVPPETQDPETQAKAAPQKGQAN 133 T 0.16 DUF5106 pdbpercent F Eukaryota T 6tmg 4 BA,D G,g S7WD71_TOXGG ATPTG5 MQNGVFTRENADFLVKSGADSPSSQSLLLRTSPSPLSLPRRRFIFLRSASVDLSERSSLACLAPFFCLASGVCLRSAFSLPFFARRGRPCLFFIFIFFFRVSFTANFRGKRVKMAASTIPISQWPSLLYAPPSSPANPAVEALPEMQFDDLHYPRQMLLCRGAGYSLEQCNRMAQPDARVTPENPAEKLLKEEAVAAIACLSQREGGKDEQCRYYIERMYKLANKEKQPEPGTLSKASTLACKLLGIHRPEA 252 T 0.027 CHCH pdbpssm F Eukaryota T 6tmg 6 DA,F K,k A0A125YSI9_TOXGG subunit a MAAGSRFPFCTAARLSSRGTLPRLGEATFFAGAESQRSAGAFAKTLQRPFLRAPSTQLFPVGNRLGVSSARALVANAMEPRRFFAAAASAKATHALQPTGTGSVAFTRPGQGSNAQFQTSLADKTRGLLGVGFLRPTKMASFAATFLLNFRFYFMYMARTTFQAVRPLLAFSVFGEVMKLVLATMSSGLFSFLFSFVLAFEVFYFFLQCYISYTFLTMFFTVLF 224 T 53 DUF5090 unphh F Eukaryota T 6tmg 7 EA,G J,j S7UQ82_TOXGG subunit i/j MGLSPAFAATAGCRLASPVANSSRFLSLLRLSRPRLNAAAPAAEAAKTLERNVPMKEILQPLWVVEPPNFLRQPVWKQFWEAQFANRSFFFFGNAWTSAAAFAFFIWWSRVFDPPPKERLDRYWLNSPKFRILSAFHNPGKRPGLKISLMTYEARYCYRGLDHPFTLNEMKDFLFKLREQYLVNKYEGIQFPFVFRQFNRVSTPGTLEVHTSPALQQQPHFHEEAAGHH 229 T 3.7 YokU pdbhh F Eukaryota T 6tmg 8 FA,H S,s A0A125YLN4_TOXGG ATPTG13 MSWATRLLRMSSPRLGLLPLGRSVKLGGAKERVSFSQFFDSEYFWTKANVGPFFLFLFTSPFWYQGIKTVYASCRYRKLNEREIISDRYTWLHERMLEDEVERVLLEQVPAGGFDKTRPGLLLGPSTL 128 T 0.24 DUF5378 pdbpercent F Eukaryota T 6tmg 9 GA,I U,u A0A125YRP0_TOXGG ATPTG15 MATPPLQDGAPTNGGAATKPSCGARLQNFARMAIKGPSVPHSILFGVGAGCCAYAGYYLYRAMRLTFFDTESVALQSRLRYAEKQKLFHQELDRELAAGHIASLVAEYDPVATRLPFQPMQDRYRV 126 T 3.2 DUF3067 unphh F Eukaryota T 6tmg 10 HA,J H,h A0A125YL08_TOXGG ATPTG6 MAETREGGQSGAASILGAEAFPELLSKVPLNPQMDEDKHFNKYKWGNEPIPVNRRTGSRMNSSIYDNRNHEAVRHPWSTDARTFHPNDNPEADRINTQYSNMVSDSFPEGGFSDAPRFSSNWERLLAYHHGLYSPEKFNSTTKTADEIRLAVNDFAAKVHADDPKNACKYLMIEEFKCLQSAQARIDPQGAATKCVKWFNEWRQCAWDQEKMVKGYNYIEDRRARKHKPYIGAPDLQYS 239 T 1.8 DnaJ pdb F Eukaryota T 6tmg 11 IA,K E,e A0A125YLR0_TOXGG ATPTG3 MGEKQEEEGEEEKEGKGEGGGEGGREEEDEDGSAPVVSWLRIVEERECHEETDEAPETKIALPFSAQRSSRGFEARQVEVLVSANSAFLLSVLLASLFLSSSLPSFCPPRFLLSVLLALFKDKMAGDAPAAAAAPQQAGRTASASGVRTPGYLDLVGHSLKATSMDHGMQYSSIYWETSHRTYLPFWASLTQKFSWKIMDDQIRSFLRLPKPVTTEPFVFSSGSPYIRRYFGDADISVPVPLHAPAHFAFVPTGTVSPWEETGMETGPQGAAARGAAATAFRAVLESAWKCDIDEQIKEKLHSRAGAGAFHASGSTGGCPIPTDF 325 T 4.7 DUF512 unphh F Eukaryota T 6tmg 12 JA,L X,x S7W180_TOXGG ATPTG17 MSTSPGLAFANLTLLLDVPQLPAIWAVNAWRELNGLFTEMKTLAGTSDLLYPSNRYNPQNEKTNRMGRPRKYNHGEWMFGNSY 83 T 4.8 AT_hook pdbhh F Eukaryota T 6tmg 13 KA,M B,b S7V2T0_TOXGG subunit b MNFSSSARWLAVRQSQTLGHTTRATVAAGRRVLAHSPAATEFTSFQSLHIGGDVCKLPLAVALGAAPSALGYGSAKHNQQRQYATLGSGWSFSKVQYTKYRITKPWTTDTTFDDIILSQPSKEDFAKFTKEAPLFLRFLKLVTDVEGRQEAFIQFAKRCENGLTVEKDVYVTKKELVDCLWKNGYTDTEINAFEIAFPADYKFHYPELAVLFDLTEEDCYKYCIRQRAATPEELVELKYTKPKNLVSSYGLCFLGVWFGLSNTVLSNAWFYSKTFPFGAVFYMLGSYFYRDIREKLWKEEKSLIHTAQENKNMGEESVYKQMKKYATDTKCLDYLSTFRTEVEDQIANYKVALVSQMRRQLTERLVEKLNGIQQAEKLIQGSLQDVMIREIVSSFKDLYKSRPELHDAAMQSAIQGLSGSDGAMDPVGAHFKASLQELAKVNLSTATADPMGTVVQRVAAVFQKREKEFLDTFTVKATEAQEIKTIVDKCHKGNTFDFHALSDEELRRLEQLYSTVNNRVGFETIHENSIKPVAPLSENSKGFVEFVNTQLEITKAKLRNARLTAFAHAFV 571 T 0.00013 Mt_ATP-synt_B pdbpercent F Eukaryota T 6tmg 14 LA,N R,r A0A125YKF7_TOXGG ATPTG12 MLNFIPKRCPSVSLLFGKRPVQRIEVGQARHQLEIPVETIEKIYEGVDSRLEYHNKDYNAMKWKDFMKLKLDAYHLLEASQSETAAKSALSDLNWFSDLADIYSGQQTMAEMDVALKAQGEQKLSYPIQGKNIK 134 T 12 DUF4416 pdbhh F Eukaryota T 6tmg 15 MA,O P,p A0A125YMA7_TOXGG ATPTG10 MSPPTASASVASSGSSPHMDRLLGDLKLLAAYDSAAGWQEPKAMESAFQSLSWDDADVLKALPQYLNCRGEQKRRVDFAYAALCPRPVDEKDPKQTLMSLWMKARLFSYDQKHPFVLSPFAATDKSTSAGAMTAEKPF 138 T 9.6 DUF6103 pdbhh F Eukaryota T 6tmg 16 NA,P V,v S7UQT7_TOXGG subunit f MGFHFQQYIAMAGRAINPVQWTRAWRRMEGKSATEVYRDALAWTNNQFAQISRASQYRAWWWQNPLGMGLVLYGTYKAWHMIYMVRKQKKTAQLVAAAYGQGGQWLNPVPR 111 T 0.11 DUF4468 unp F Eukaryota T 6tmg 17 OA,Q L,l S7W7F1_TOXGG ATPTG8 MTALPPPPSANVAVSFTAAPAEPLSRGEVKAASLKLELQNIERELKDWWMSRKILRDRNIGLFNLLQHHNFAGLSVNNAKLSDSQRVMWTDLVQGKPDVEDKLSVDAREMKVDMYEKLFKQAADLENPCRMPGVAYLRCLRDTLTETQSARRSSCLNAFSSFDACRTGLLKQQSAAVENSLVRQNMADVRAKALFERRAVLLDLVEGK 208 T 0.12 CHCH pdbhh F Eukaryota T 6tmg 19 QA,S D,d A0A125YV76_TOXGG ATPTG2 MSPVGRLFLGSKLPAQTWQSFRLQPALPQFAQKRFFSGGAAKPSWHVAREHRFGPTLPDHAYYGEHATYNYFVLFIRGMRPYLEKIFGDCASTIKNAAVAVYRPVNAFVVKHNPDLRLQFVAFASFIATHMAITKEFNDMYQRLVDITSLLELQAAQLHASEGFWDSESEQQEARLQRHAEHRNDLETTWEEALREATLARNFDVLVSYLNHGTSDGCGEHGACGHSGQNGIPPSVTWNFNAMPYGKENPDTKTFPIPDHEQPYRAFSLGFTANNLSGNWGDYIDRQDNKNALMRPARMMFTDVFIPTTK 310 T 10 PerC unphh F Eukaryota T 6tmg 20 RA,T M,m A0A125YPQ4_TOXGG subunit 8 MNTFFLTPAAAAARRVAVSFFARSSASGFPQHRVALRPFPSQRPAERAHNLAKSQTLRSVKAHGRQSGKKEQSTESGGRRGFRAAVGAGTGCMLAASPMLFTDYDNTASPKSELIFMAGNALGYCTERFFENEYGQSIFMFALGLAYLAMLGHEGKIHGAVWRMKHLFATNFKMVGHPRYAYALPKNPLLQDAAPTKTGSTSAKK 205 T 27 TMEM132D_C unphh F Eukaryota T 6tmg 23 UA,W W,w S7VTI0_TOXGG ATPTG16 MPFMWRQRAYCAPVPSAFASQQPNGLGGEAGVRKPLLRSNSESLSVFSQIPDGLLGHTTSVTMGNSDIFFLPKPSNLLKIALPAFVFMPNLTIFTRAFPFYAHTSA 106 T 7.1 DUF3561 pdbhh F Eukaryota T 6tmh 1 A i A0A125YJP2_TOXGG Inhibitor of F1 MSSPCCVAIRRVARTTLESGRRQVDSKSTDVSPFFTGTQQMSLPSAGMVTKIRNFSSVKFMDQKRSGEETVYFKKEDEALLRNLLANHPEYDPKYSVDHMNAEVGSIARDITLACQKHGMKDPSAAFMKDLISIFGAHGYAKNSK 145 T 1.4 DUF3223 pdbhh F Eukaryota T 6tml 21 EE,GD,QH,SA,SG,U N8,n8,N9,N7,n9,n7 A0A125YUZ2_TOXGG ATPTG9 MSGDSVAPHQRAACEQLHSEYKQCLAKNGRTHFSACTDFHSKLRACENMLGTSYCIDEGINLMKCTKNPDPSFCAKEFVAMRECNRPQGPHLVLSSSPSSPPHYELRPEVKHLYNVDSTDLGSAVAPVRSKEQLDRVADSLKADLNLPGYGHIPYKWESLRPNPGA 166 T 0.0045 Cmc1 pdb F Eukaryota T 6tms 1 A,B,C,D,E,G,H,I,J,K A,B,C,E,F,D,H,I,J,K a novel designed pore protein TEDEIRKLRKLLEEAEKKLYKLEDKTRRSEEISKTDDDPKAQSLQLIAESLMLIAESLLIIAISLLLSS 69 T 0.1 Matrilin_ccoil pdb F T 6tms 2 F,L G,L a novel designed pore protein TEDEIRKLKKLLEEAEKKLYKLEDKTRRSEEISKTDDDPKAQSLQLIAESLMLIAESLLIIAISLLLSS 69 T 0.075 Matrilin_ccoil pdb F T 6tms 3 M,N Q,R affinity purification tag HHHHHHSGLVPRGSHM 16 T 4000 zinc_ribbon_2 pdbhh F T 6tnh 1 A,B A,B G3FFN6_9CAUD Adenylosuccinate synthetase MGHHHHHHHHHHGLVPRGSHMENVDLVIDLQFGSTGKGLIAGYLAEKNGYDTVINANMPNAGHTYINAEGRKWMHKVLPNGIVSPNLKRVMLGAGSVFSINRLMEEIEMSKDLLHDKVAILIHPMATVLDEEAHKKAEVGIATSIGSTGQGSMAAMVEKLQRDPTNNTIVARDVAQYDGRIAQYVCTVEEWDMALMASERILAEGAQGFSLSLNQEFYPYCTSRDCTPARFLADMGIPLPMLNKVIGTARCHPIRVGGTSGGHYPDQEELTWEQLGQVPELTTVTKKVRRVFSFSFIQMQKAMWTCQPDEVFLNFCNYLSPMGWQDIVHQIEVAAQSRYCDAEVKYLGFGPTFNDVELREDVM 363 T 2.9999999999999996E-68 Adenylsucc_synt unppercent T Viruses T 6tnq 2 B,D,F B,D,F DLGP1_HUMAN Chains: B,D,F TSPKFRSR 8 T 0.77 ArfA pdbhh F Eukaryota T 6tob 1 A A P71658_MYCTU Integration host factor MIHF GSHMVALPQLTDEQRAAALEKAAAARRARAELKDRLKRGGTNLTQVLKDAESDEVLGKMKVSALLEALPKVGKVKAQEIMTELEIAPTRRLRGLGDRQRKALLEKFGSA 109 T 0.00038 Ribosomal_S13 pdbhh F Bacteria T 6tq0 2 B,D,F,H,J,L,N,P B,D,F,H,J,L,N,P DLGP1_HUMAN repeat peptide 5 from GKAP MPGCFRMR 8 T 0.36 ANP pdbhh F Eukaryota T 6tqs 2 G,H,I,J,K G,H,I,J,K OSBL1_HUMAN OSBP-RELATED PROTEIN 1 GAMRSILSEDEFYDALSDSES 21 T 0.0027 DUF4298 pdb F Eukaryota T 6ts3 2 C,D C,D KCC2A_MOUSE ACE-ASN-ALA-ARG-ARG-LYS-LEU-LYS-GLY-ALA-ILE-LEU-THR-THR-MET-LEU-ALA-THR-ARG-ASN-PHE NARRKLKGAILTTMLATRNFSG 22 T 12 HycH pdbhh F Eukaryota T 6tsc 2 B B A0A2R2JFI5_OMPOL GLY-PHE-PRO-TRP-MVA-ILE-MVA-VAL-GLY-VAL-PRO-GLY GFPWXIXVGVPG 12 T 0.49 dCache_2 pdbhh F Eukaryota T 6tt6 1 A A PD-i6 peptide WXVXEAXD 8 T 0.81 ApeA_NTD1 pdbhh F T 6ttu 8 H I IKBA_HUMAN CYS-LYS-LYS-ALA-ARG-HIS-ASP-SEP-GLY CKKERLLDDRHDSGLDSMKDEEDYKDDDDK 30 T 16 GlutR_N pdbhh F Eukaryota T 6tvj 1 A A PD-i3 peptide LXXRYXDTMY 10 T 0.46 Ima1_N pdbhh F T 6tvw 2 B DDD MET-THR-TRP-MET-GLU-TRP-ASP-ARG-GLU EQIWNNMTWMEWDRE 15 T 0.00021 GP41 pdbhh F T 6tvw 3 C DbD ASN-ASN-TYR-THR-SER-LEU-ILE-HIS-SER-LEU-ILE-GLU-GLU NNYTSLIHSLIEESQ 15 T 7.9 DUF5470 pdbhh F T 6twb 2 C B Double Bridged Peptide F19 XVNIMXCRCPX 11 T 4.9 DUF4668 pdbhh F T 6twc 3 C C Double Bridged Peptide F21 TCVNIMCCRCPX 12 T 5.2 DUF4668 pdbhh F T 6twg 1 A A CRBL_VESCR Crabrolin Plus, mutant of Crabrolin peptide FLPKILRKIVRAL 13 T 1.9 Antimicrobial_8 unphh F Eukaryota T 6twq 2 B,C C,D VE6_HPV16 THR-ARG-ARG-GLU-THR-GLN-LEU SSRTRRETQL 10 T 0.34 FpoO unphh T Viruses T 6twu 2 C C VE6_HPV16 Protein E6 SSRTRREEQL 10 T 0.34 FpoO unphh T Viruses T 6twy 2 C C KS6A1_HUMAN Phosphomimetic RSK1 peptide RRVRKLPETTL 11 T 9.8 CITED pdbhh F Eukaryota T 6txs 2 B BBB CD44_HUMAN CDW44,EPICAN,EXTRACELLULAR MATRIX RECEPTOR III,ECMR-III,GP90 LYMPHOCYTE HOMING/ADHESION RECEPTOR,HUTCH-I,HEPARAN SULFATE PROTEOGLYCAN,HERMES ANTIGEN,HYALURONATE RECEPTOR,PHAGOCYTIC GLYCOPROTEIN 1,PGP-1,PHAGOCYTIC GLYCOPROTEIN I,PGP-I QKKKLVIN 8 T 0.044 RCR unphh F Eukaryota T 6tyt 2 B B A0A1L8ENT6_XENLA ALA-LYS-GLY-LEU-PHE-MET RPPAGASKPKKKAKGLFM 18 T 24 Aft1_HRR pdbhh F Eukaryota T 6tyt 3 C C APLF_HUMAN ARG-LYS-ARG-ILE-LEU-PRO-THR-TRP-MET-LEU-ALA LAERKRILPTWMLAEH 16 T 0.054 PNISR pdbhh F Eukaryota T 6tyu 2 B B CYREN_HUMAN LYS-THR-ARG-VAL-LEU-PRO-SER-TRP-LEU-THR-ALA SETKTRVLPSWLTAQV 16 T 0.14 PNISR pdbhh F Eukaryota T 6tyv 2 B B WRN_HUMAN THR-THR-ALA-GLN-GLN-ARG-LYS-CYS-PRO-GLU-TRP-MET-ASN TTAQQRKCPEWMNVQN 16 T 0.078 Polyoma_coat2 pdbhh F Eukaryota T 6u19 1 A A PSMD4_HUMAN 26S PROTEASOME REGULATORY SUBUNIT RPN10,26S PROTEASOME REGULATORY SUBUNIT S5A,ANTISECRETORY FACTOR 1,ASF,MULTIUBIQUITIN CHAIN-BINDING PROTEIN SADIDASSAMDTSEPAKEEDDYDVMQDPEFLQSVLENLPGVDPNNEAIRNAMGSLASQATKDGKKDKKEEDKK 73 T 0.081 DUF5797 pdbpercent F Eukaryota T 6u22 2 B C SFTI1_HELAN GLY-ARG-ALA-THR-LYS-SER-ILE-PRO-PRO-ILE-ALA-PHE-PRO-ASP GRATKSIPPIAFPD 14 T 0.052 Bowman-Birk_leg unp F Eukaryota T 6u2f 2 B B Organo-peptide PCSK9 inhibitor XWNLKXIGLLR 11 T 6 SARG pdbhh F T 6u3i 2 B B cis-1-amino-4-phenylcyclohexaneacyl-WNLK(hR)I(D-ser)LLR - NH2 XWNLKXIXLLR 11 T 21 RNA_polI_A14 pdbhh F T 6u3m 3 E,F E,F Alpha1a peptide AQPMPMPELPYPGSGGSIEGR 21 T 7 DUF3148 pdbhh F T 6u3n 5 E C Peptide APMPMPELPYPGSGGSIEGR 20 T 15 Pertus-S5-tox pdbhh F T 6u42 8 BN,CN,DN 4V,4W,4X A8J0X0_CHLRE FLAGELLAR ASSOCIATED PROTEIN MPSPAREKLMTIKAMEEAKGRSQHARAPAIFRDTALDTHKSIQPEYFGPSTVPEKKEFSTRLSSGRTRSVTKHQRAAMEALQRTSQMAGQGEVRTVFMPTAEQMPVCAAAGERRGNVANSEWALLDTLEVNLYLNEKDARLRSQKAVQQTQRAILDTQVGMLAQAKLAAETAKAAERVELLATVAAHQAEERQRAEEQRAALTRLRTDREAMLAETRVQREAALSRKREEEAKLVAAAQAQLEADRQAAARKAAELKEQAAKTMADNEARLVARKAAEAAQRVADAETTKRMIEMAEAQDRARDRNMKSFHDMIQARARGVGQKAVDDRRDRLEREERLIAEAERAAAQREAERAAAEAERKARLKSDLVSGNEALKRAKAEKLAVEREAEARERAAAEQRVLAEKEAAERQMAGMRERATATKRFVAGQAAAVAERAKTDDIFMSEQERLLNKRLLEQAVATVQRPMQYSVKLY 475 T 0.043 OTCace pdb F Eukaryota T 6u42 11 KN 5E A0A2K3E5X9_CHLRE RIB30 MLNVTGGRRPVASWRTPPGFLERLADAWPAVLDGAVEQAGGDPARVTRDSFLAALREALPGLSAAEDDYARQVSLSVIQQVRGSNVFFPDLDYLQAALLQGRVPPQELDQPRSTLSLATFTTTTRSGTKSLDLFKTTGVTWKIPKGFLNRYNDCNHEVLRRAAALVGARHDGARDVVAGVWGRVDVPTFVEACRQVLGEISADEEEYLIALASEQVQDGTAYIRDLPFLDKCIQNGKTPTSIKGPELLPSIFLNDTTSGKTDGMTLRHTGGRIF 274 T 0.2 FliX pdb F Eukaryota T 6u42 13 NN,ON,PN,QN 5H,5I,5J,5K FLTOP_CHLRE CILIA- AND FLAGELLA-ASSOCIATED PROTEIN 126,FLAGELLUM-ASSOCIATED PROTEIN 126 MSRSYPGEQVEHAFNSKRLKNWEVPAVDKSQAISTSTGTRFGTLQPRSGRTQFIVDDNGHLKSGVPKLEKSAFNFTQTTPVFMDSAPRWPKENPTWPKNMKATMGYKGIQSNYLPTNTVTLKAVEVPGTTERNFNFM 137 T 6.8 DUF3697 pdbhh F Eukaryota T 6u42 19 JO 6E A0A2K3E1X6_CHLRE FAP143 MAEETQPYTSYNKQDEVPTLIGNWVEERELKELTGVTRNLAASQALKDTSDGTSPTRSLGDALTATHPRVIEHVQAQTHAADWQSTVQATYRPPSDATRNAAAYVNTSKMGPRERMLHEQLMREAQDLPPELQATLTGPAVPVTTASTYGADFHQHDLTGIVVGAKVMKDRDGRPAVRDPTFLAETQMMKKDAADRLMGETARQSGARDTTMLPNPDVPVTIYTEAVANKTYGGVFPGTTTLNTAAPFGKSTNFSKPMSDYSKVVVDE 268 T 0.093 DUF1143 pdbhh F Eukaryota T 6u42 21 LO,MO 6G,6H A0A2K3DZF0_CHLRE FAP166 MSLTLNGLDESMRRMQGYEVTRAPEDVGNSIPNFKEGIFTYKGSRQAPWKSEQTHSFSLPNAYTARVLNGTIVHTGGATEMAITTHHTVERPMMPPGTIRGSTWVKPQYIPTDDPALDELHAVAYVVSPQLPALMDACNSYHLHSADGWITTAGFMTAARRAGLTLSRAEYLALERALTKDTMGRINYLQLEALVQAVTAADQTGEGGAEPAAE 214 T 0.028 EF-hand_11 pdbhh F Eukaryota T 6u42 23 VO,WO,XO 6Q,6R,6S A0A2K3DTN6_CHLRE FAP276 MDLKQQVKNYTMTIRNTRPPTMIKEQDKSEFSHFRALQVLANGDEVPYEATLRNVIHDGARQPKLPPRQTQKHPGYIRNESGGFFTS 87 T 0.092 DUF3337 pdbpercent F Eukaryota T 6u42 26 IP,JP,KP 7D,7E,7F A8JF23_CHLRE FLAGELLAR ASSOCIATED PROTEIN 222 MATNSTGPWATGTFSPNSTGTVTQYNHPMFVSQRLTGNFTSQFEMNSLPSHKYETLPIRSGHLPGYQGHVPGGVGAIAQRKPAAAMHTMTHLATSGSLPKGSPQTDMSLVDLRPEQRSMAKVYMYAEGAKTSFLKFPTPKTFDHRN 146 T 0.74 DUF2475 pdbhh F Eukaryota T 6u42 27 LP 7G A8HSW0_CHLRE FLAGELLAR ASSOCIATED PROTEIN 95 MAAYAHNDGAPDISQAFQNTVLVKNWYEDRFQSQVASATGRTLRELPTHERVVHKAVPPGHPGLFQTTKQAAEEKLLTTPPPAKVKKPSMYTEANVAERLQTYGLADNIHYTIGPNAATEASWAPVHNLTTTNKEFYEIKPEAARAADPDTFRASGPSPFAKTGFCAKSVKGEASDETTVAGGKGARGEITRRPGESGNPYGVSVFVDEYGKWGSAIQGMPLTETRARMQTKYFP 235 T 0.25 DUF1143 pdbpercent F Eukaryota T 6u42 29 NP,OP 7I,7J A0A2K3DZI2_CHLRE FAP129 MVHKGPNQAGNKGLLTYNNAVGIPGYTGFMPSTNALALPVKGFEHTGRPAASAEVEKLTVKSVDPRKTSQYADDYHKKPADTKAFSKTGGGYWISQRVLPPHTAFTATTTYRAETLNAEPNTAAILDRSQGLASTLVGYEAARQAGEVRRSISADPRARAEDTARGIGTQTVLGRPGSGGNTSILATVAAAAPSSPQAGSSVMVSSARRPATVPTKYGELPGYQTTYGAATDKMARMQADNELNGTGSFAPSNMGDPRFKTLPRVMNPGMGRNYSSYVAEYGGDGHDPMARQAANKDTMTRISVTRDLAGGTTRNVSHIPRYTGHIPASEYATPEARAQGEAAEPRPDHKSQALTYTLDQYPRGRLPGYTGFKAQAPANIDAGLKHSMKLPCHSTTSGDATLRGTQFGVPHQDHTHYINSRAGLNSFFSNSVVGTEFVSDNGLFNAQVYYKEAKSQGALGIKTAQPSKLTHYGAPFRAAASMV 483 T 0.0018 SPATA48 pdb F Eukaryota T 6u42 30 PP,QP 7K,7L FAP21 MSLTTQSLRRTNYEAEMTQPQIPPAGITGKLHETAKDALTWNDERPSTPDDIKKYRQSTVHEPGKIVRHPGHADDPVPQGPFGVKSAASGGQNINEALKNYPDSELARWKLEQAEGVYASAQREPLGAGYVRGHRLPEGLGSERPFGVTYDARGKDLSRQAAAVIFPTDRPAEEDAATRAMYTRSHQDFQPGEQRRRDYNWDAAGIDPAQHRFGAVDRNGVGDGVRKALQPGLDPSLQAPKVLPKLHEDFKATATDYLGRPRQLGTGDRPQLAPDHAFGQPSMRKGREPGVGELLTGRFGADEQQPDADLGKSLREGYRNQPKPGDEGRAFGVPTIRTDVRLPRLRSVANACNYGNEPDAGQVLRPPRAADLGISDEAFVALRPKSELRQLVDEAGLALSDADFEAAWALAAEADGGAAAAGEGGGAAEGPEGRACVDTFFRARHHLLAQTLQIEPTF 458 T 0.035 DUF6395 pdbpssm F T 6u42 31 RP,SP 7M,7N A8IXN7_CHLRE FLAGELLAR ASSOCIATED PROTEIN 273 MSILGPADRRPELALTGTTISHLKTWRTEYLDEYSDIKLAAGVPEQRMEMAGITAHIGTITGRHTHMHKETTRLPTGHPPSSTYRAQDAVPIGTMTRGTGTITKLGDSCLYDKEQTWAHWRVAVDGKPADTRRKYRGVS 139 T 0.12 Autoind_bind pdbpercent F Eukaryota T 6u42 32 TP,UP 7O,7P A8JC52_CHLRE FLAGELLAR ASSOCIATED PROTEIN 107 MQGDRWSRNCGSGGVGHSGTVNEYRSGVLIGNFVENAAKTTGRMGETILSHTGPGAQTGIPTTTQKRSYTAEGKTGEYLVEASTRHDLNQPGVKGELLTRHGRFDEPPVQCLGTTYQLTYGRADGTDRRVQSYLWHGRKQVDYFVPHSTGGPSTLSLTARKQQEWGTQGATDAYLTTKMAATQPAALATAENPTRTQTLRPLGDSGLMPQPGQKPKGFARDELDKPHHRTGLRVNYRS 238 T 3.1 DUF1143 pdbhh F Eukaryota T 6u42 33 VP 7Q A0A2K3D7C7_CHLRE RIB21 MDATTKTLKSTTRVDNSTNPNFKHTSTFHTRGQWTPESPPPLTSTYTIFHGERPELPRYVPKYAVSPETAALTSRHGSSPYSFRATAERAGSTPDGRATYRFSGLPAGVSPYSTGTKLSSSTLGSSGLPPVQYKSYLTEYVDEYREPLEQLDTQRSLTLKYGTTGGYRTTQRSTRSDGQPKYQTRVVAF 189 T 76 DUF4851 pdbhh F Eukaryota T 6u42 34 WP,XP 7R,7S A8IPZ5_CHLRE OUTER DYNEIN ARM-DOCKING COMPLEX SUBUNIT 1 MAQKSTLKLPRLRTKEELLKTSPELCKLLGEDSDDGRSMSPFTAPPPAGTVKPPSRGLPAVSTKATKGPGMDTPRGLGEEELTEEELLRLELEKIKNERQVLLDSIKLVKAQAGTAGGEAQQNDIKALRRELELKKAKLNELHEDVRRKENVLNKQRDDTTDASRLTPGELSEEQAYIQQLQDEMKQIDEELVEAEAKNRLYYLLGERTRREHLAMDMKVRASQQLKKDSADDLYTLTAHFNEMRAAKEQAERELARMKRMLEETRVDWQKKLRERRREVRELKKRQQKQLERERKMREKQLERERQERELQAKLKMEQDSYEMRVAALAPKVEAMEHSWNRIRTISGADTPEEVLAYWEGLKAKEEQMRSLVSLAEQRESSAKSEIAALLENRSGMYEKGSAAAADVGEGSEERATLITEVERNMEGAKGKFNKLRSVCIGAEQGLRSLQERLMIALEEIHPDQLRASHMKGGHDAKARGKGAASAGARRGSAHAHTPDRNKRGPATGSRSQSPALVPHSPAGDKPSSPLHGTSPEHGHEPIPEGAEELAGEAEMVSPLGADGNTIDDEHFFPELPELLTSVTDRLNRVLVLAAELDAQEPAGAGEDGLPLSGEPGADGAEGAAPASPSRGAPEGLSESERTLVKGMNRRTWTGAPLLETINASPSEAALTLNIKRKKGKKKEQQVQPDLNRILGYTGSDVEEEEPESEEETEEEANKDDGVVDRDYIKLRALKMSQRLANQQRAIKV 749 T 0.00034 CALCOCO1 pdbhh F Eukaryota T 6u42 37 DQ 7Y A8HPK6_CHLRE FLAGELLAR ASSOCIATED PROTEIN 68 MGAANENIHMTDGIRRETMKKETLARERSLAAQSPYMAQVATYRARNPPLDHSRLMQDPKVQDWASIAGTRRSLATNVPDGGPRVNVNLLKYKRDADFISTTPYDGGPSYNAETCMQNWAEDRRDKHYKSGFHPKELRRSTRYDSEYSARFKPTSADYVGRLTHTYNTTSRFEGLTRVGTNGIAAPVLPKRSADTSGEHVFYAKDGYGPTPWMDHTAPTARGRFWVGTAPHVAHDTITHSTLRSEPLEFQQRCPTEDARSKILMGNKPLTHESDRTLRIRDDLVATNTFTRTWRTMYQSDHVDFSRRPATVR 312 T 0.034 DUF1143 pdbpercent F Eukaryota T 6u48 32 FA A phazolicin TXARXDSXSRXGAXGKXSGXAS 22 T 6.7 zf-Dof pdbhh F T 6u4a 2 C,D D,C cyclic peptide 3.1_3 XWWIIPXVKXGCX 13 T 4.2 DUF5989 pdbhh F T 6u6l 2 B B Cyclic peptide 3.1_2 XWKTIXGXTWRTXQC 15 T 5.8 CyRPA pdbhh F T 6u72 2 C C 3.1_2_AcK5toA XWKTIAGXTWRTXQC 15 T 6 BNR pdbhh F T 6u74 2 E,F E,F cyclic peptide 3.1_2 XWKTIXGXTWRTXQCX 16 T 6.8 CyRPA pdbhh F T 6u7q 1 A A SFTI1_HELAN GLY-ARG-CYS-THR-LYS-SER-ILE-PRO-PRO-ARG-CYS-PHE-PRO-ASP inhibitor GRCTKSIPPRCFPD 14 T 0.0013 Bowman-Birk_leg pdb F Eukaryota T 6u7r 1 A A SFTI1_HELAN GLY-LYS-CYS-LEU-PHE-SER-ASN-PRO-PRO-ILE-CYS-PHE-PRO-ASN inhibitor GKCLFSNPPICFPN 14 T 0.052 Bowman-Birk_leg unp F Eukaryota T 6u7u 1 A A SFTI1_HELAN GLY-ARG-ALA-THR-LYS-SER-ILE-PRO-PRO-ARG-ALA-PHE-PRO-ASP GRATKSIPPRAFPD 14 T 0.052 Bowman-Birk_leg unp F Eukaryota T 6u7w 1 A A GLY-LYS-ALA-LEU-PHE-SER-ASN-PRO-PRO-ILE-ALA-PHE-PRO-ASN GKALFSNPPIAFPN 14 T 7.8 ANAPC16 pdbhh F T 6u8g 2 E,F E,F cyclic peptide 3.1_2_AcK7toA XWKTIXGATWRTXQCX 16 T 8.1 PSP94 pdbhh F T 6u8h 2 B C cyclic peptide 3.2_2 XWSWLCKXYNLIH 13 T 3.2 Tet_res_leader pdbhh F T 6u8m 2 C C cyclic peptide 3.2_1 XWXKAILPGXILKTLHIC 18 T 2.8 Packaging_FI pdbhh F T 6u9e 1 A,B,C A,C,E Q7X3I9_FRANO PdpA MIAVKDITDLNIQDIISQLTSEVINGDTTPSSAKFACEINSYIINYNLSNINLINTQLKNTKILYRKGLISKLDYEKYKRYCIISRFKNNIDEFILYFSTNYKDSQSLKIAIKELQNSCSSSLILELPHDYIRKIDVLLTSIDSAIQRSSDLNKTIIKQLNKLRSSLSRYIGYNNVLQKQEITINIKPINKNFELEDISFVSTRNKQYFKHNSLTLKNPHIEKLEVCENIYGINGWLTFDLAYINNHKDFNFLLSPNQPILLDIQINDSFNFYKKESKKDHHKRTTRFIAIGFNSNSIDIHENFEYSIYSYTKNVSSGVKKFKIQFHDPLKALWTKHKPSYIALNKSLDDIFKDNFFFDSLFSLDTNKSNNLKIRIPQAFISTVNRNFYDFFIQQLEQNKCYLKYFCDKKSGKVSYHVVDQVDNDLQRNIVNSDEDLKDKLSPYDISCFKKQILISNKSNFYVKEKNICPDVTLNTQRKEDRKISDTLVKPFSSILKDNLQSVEYIQSNNDDKQEIITTGFEILLTSRNTLPFLDTEITLSKLDNDQNYLLGATDIKSLYISQRKLLFKRSKYCSKQLYENLHNFHYKSDSESDVYEKIAFTKYPSLTHDNSITYKIKDYSNLTPEYPKYKSFSNFYINGRITIGENVNNDSKKAYKFFKNHKPEESSIAEFQENGEKGTSAILNSKADILYAIEIAKEMLSDKSSDKPIIYLPLKVNINSANNQFIPLRNDDIILIEIQSFTKGEIIELISNSAISTKKAQQQLLQRQLLGSKENCEMAYTQTSDSETFSLTQVNEDCENSFLINDKKGIFLRYKSKGN 820 T 0.4 Usg pdbpercent F Bacteria T 6u9e 2 D,E,F B,D,F Q7X3I8_FRANO VgrG MDYKDDDDKDYKDDDDKDYKDDDDKGSKADHIFNLEEQGLLIDIKDDSKGCTTKLESSGKITHNATESIESSADKQIIENVKDSKISITEKEILLATKKSSIMLSEDKIVIKIGNSLIILDDSNISLESATINIKSSANINIQASQNIDIKSLNNSIKADVNLNAEGLDVNIKGSVTASIKGSAATMVG 189 T 0.0018 DUF2345 pdbpssm F Bacteria T 6u9x 1 A,B D,A B6SBM0_9TRYP Mitochondrial edited mRNA stability factor 1 GSHMDDALRGELASALDTEGHALPFDVHLQQPHSSGDGTAGDTSTIQLEKLSHPPARFDLLTNSFVYKWQTKAALARKVSGPMREWAAELKYRTGVHIELEPTYPERLSENAVKGSGSDDGDGTQWGAYETADDVDITVYLFGSERGIFNCHKLMEAAIQQDPVYVRLGIFRRLANSSEVEWLMLRRINRELRPPDIPPISLKLPGKWTLLYERYKEAAIRTLWEETGITVDASNVYPTGHLYQTVPQYYWRVPVRYFVAEVPSDIRVEGPQVVPLQYMRNWDARLLRQSPDPIDRAWAQLADPATGCAWMKASMIDQLQKPLRGDNYMAIRYTPPPYSNLQEVVGLGDGSITPSTGNGEDAS 363 T 0.00029 NUDIX pdb F Eukaryota T 6ubh 2 E,F,G,H E,F,G,H peptide KNFDFWV 7 T 0.55 DUF5926 pdbhh F T 6udr 1 A A S2-3, Lurch crystal form 1 XTRPDQXXXX 10 T 56 SH3_11 pdbhh F T 6uf2 1 A A Q31PX7_SYNE7 Biofilm-related protein MRIDELVPADPRAVSLYTPYYSQANRRRYLPYALSLYQGSSIEGSRAVEGGAPISFVATWTVTPLPADMTRCHLQFNNDAELTYEILLPNHEFLEYLIDMLMGYQRMQKTDFPGAFYRRLLGYDS 125 T 4.3 ATP13 pdbhh F Bacteria T 6uf7 1 A A S2-5, Uncle Fester XSEXRPXXIX 10 F F T 6uf8 1 A A S2-6, London Bridge XNXXPXAXKHXE 12 F F T 6uf9 1 A A S4-1, Tim apo-form KLXXXHXXQEXXKLXXXHXXQEXX 24 T 59 SRC-1 pdbhh F T 6uib 3 C C Peptide 23-652 DTLTKSFCYFGTWCQMYGST 20 T 3.6 DUF1911 pdbhh F T 6uka 2 B B ELMO2_MOUSE PROTEIN CED-12 HOMOLOG A MPPPSDIVKVAIEWPGANAQLLEIDQKRPLASIIKEVCDGWSLPNPEYYTLRYADGPQLYVTEQTRNDIKNGTILQLAVSA 81 T 0.029 DUF3697 pdbpssm F Eukaryota T 6uke 1 A X I3DBY6_HAEPH HhaI Restriction Endonuclease MNWKEFEVFCVTYLNKTYGNKFAKKGESDSTTSDILFTGNNPFYIEAKMPHSQCGQFVLIPNRAEYKFDYSPKNKSEINPYTQKIMQFMSENFSEYANLSTKGKIIPLPESVFVNWIKEYYKSKSVKFFITSNGDFIIFPIEHFEHYFNVSCTYRIKKSGSRHLNSKSLPDFKQALDKKGISYTMRGLELHSDENIHDKRISGDDKDFLIKENNGAYHVKILSNTFNANVIFSISLKNNISLFILNEDRKAFEAAISL 258 T 0.03 DUF4105 pdbpssm F Bacteria T 6ulo 1 A,B A,B Q72TN2_LEPIC Uncharacterized protein MAHHHHHHKIEENQNVSLNEGDIVSKLKETPQETLVPTKWDVGDTTVSNEDRLDLLIPHVQNLGNVYVGVGSEQNLTIAAWAKSDFIYLMDFTQIVVHANTITILFLQKSEKKEDFIRLWGKEGEKEALELIQVSFSDPEVYKKVYKQASPFIRKRHKTNLMLSKKYNYKMFQTDDEQYSYIRKLAIEGKILPIRGNLLGNITLTGIGNTLKKIGRKVGIIYFSNAEEYFAYPQEFKNSILNLPVSESSLVVRTISVRKDLFPWSPGSEISTDRGFHYCVQKISNFQKWLSSGKPGLRSLQVMVEGGTVDKKNGITVVDKEPVVTEDKLPKTGG 334 T 0.014 Hypoth_Ymh unppssm F Bacteria T 6ulp 2 C C Cyclic peptide 3.2_3 XWXQWKXYGLKICX 14 T 0.39 Fungal_KA1 pdbhh F T 6ulq 2 D,E,F D,E,F Cyclic peptide 4.2_3 XWXGYLCLRXRIQRTYNX 18 T 4.2 DUF2569 pdbhh F T 6uls 2 B B E2F1_HUMAN Diacetylated E2F1 Peptide (K117ac and K120ac) HPGXGVXSPGX 11 T 0.11 TP1 unp F Eukaryota T 6ult 2 I,J,K,L I,J,K,L Cyclic peptide 4.2_3 XWXGYLCLRXRIQ 13 T 2.2 RRM_DME pdbhh F T 6ulv 2 E,F E,G Cyclic peptide 4.2_3 XWXNWCWLXRXLLLRX 16 T 1.1 UL17 pdbhh F T 6umm 4 E,J E,J ECCC3_MYCS2 ESX CONSERVED COMPONENT C3,TYPE VII SECRETION SYSTEM PROTEIN ECCC3,T7SS PROTEIN ECCC3 MSRLIFEHQRRLTPPTTRKGTITIEPPPQLPRVVPPSLLRRVLPFLIVILIVGMIVALFATGMRLISPTMLFFPFVLLLAATALYRGGDNKMRTEEVDAERADYLRYLSVVRDNVRAHAAEQRAALEWSHPEPEVLATIPGTRRQWERDPRDRDFLVLRAGRHDVPLDAALKVKDTADEIDLEPVAHSALRGLLDVQRTVRDAPTGLDVAKLARITVIGEADEARAAIRAWIAQAVTWHDPTMLGVALAAPDLESGDWSWLKWLPHVDVPNEADGVGPARYLTTSTAELRERLAPALADRPLFPAESGAALKHLLVVLDDPDADPDDIARKPGLTGVTVIHRTTELPNREQYPDPERPILRVADGRIERWQVGGWQPCVDVADAMSAAEAAHIARRLSRWDSN 403 T 0.071 Bac_export_3 pdb F Bacteria T 6upw 1 A,C L,M VINC_HUMAN METAVINCULIN,MV MPVFHTRTIESILEPVAQQISHLVIMHEEGEVDGKAIPDLTAPVAAVQAAVSNLVRVGKETVQTTEDQILKRDMPPAFIKVENACTKLVQAAQMLQSDPYSVPARDYLIDGSRGILSGTSDLLLTFDEAEVRKIIRVCKGILEYLTVAEVVETMEDLVTYTKNLGPGMTKMAKMIDERQQELTHQEHRVMLVNSMNTVKELLPVLISAMKIFVTTKNSKNQGIEEALKNRNFTVEKMSAEINEIIRVLQLTSWDEDAWASKDTEAMKRALASIDSKLNQAKGWLRDPSASPGDAGEQAIRQILDEAGKVGELCAGKERREILGTCKMLGQMTDQVADLRARGQGSSPVAMQKAQQVSQGLDVLTAKVENAARKLEAMTNSKQSIAKKIDAAQNWLADPNGGPEGEEQIRGALAEARKIAELCDDPKERDDILRSLGEISALTSKLADLRRQGKGDSPEARALAKQVATALQNLQTKTNRAVANSRPAKAAVHLEGKIEQAQRWIDNPTVDDRGVGQAAIRGLVAEGHRLANVMMGPYRQDLLAKCDRVDQLTAQLADLAARGEGESPQARALASQLQDSLKDLKARMQEAMTQEVSDVFSDTTTPIKLLAVAATAPPDAPNREEVFDERAANFENHSGKLGATAEKAAAVGTANKSTVEGIQASVKTARELTPQVVSAARILLRNPGNQAAYEHFETMKNQWIDNVEKMTGLVDEAIDTKSLLDASEEAIKKDLDKCKVAMANIQPQMLVAGATSIARRANRILLVAKREVENSEDPKFREAVKAASDELSKTISPMVMDAKAVAGNISDPGLQKSFLDSGYRILGAVAKVREAFQPQEPDFPPPPPDLEQLRLTDELAPPKPPLPEGEVPPPRPPPPEEKDEEFPEQKAGEVINQPMMMAARQLHDEARKWSSKPGIPAAEVGIGVVAEADAADAAGFPVPPDMEDDYEPELLLMPSNQPVNQPILAAAQSLHREATKWSSKGNDIIAAAKRMALLMAEMSRLVRGGSGTKRALIQCAKDIAKASDEVTRLAKEVAKQCTDKRIRTNLLQVCERIPTISTQLKILSTVKATMLGRTNISDEESEQATEMLVHNAQNLMQSVKETVREAEAASIKIRTDAGFTLRWVRKTPWYQ 1134 T 1.9E-200 Vinculin pdb F Eukaryota T 6utc 1 A A Q9RP86_VIBVL TRANSCRIPTIONAL REGULATOR,TRANSMEMBRANE TRANSCRIPTION ACTIVATOR MAHHHHHHTNPSESKFRLLENVNGVEVLTPLNHPPLQAWMPSIRQCVNKYAETHTGDSAPVKVIATGGQGNQLILNYIHTLPHSNENVTLRIFSEQNDLGSICK 104 T 0.12 TPPK_C unppercent F Bacteria T 6uud 3 C A CSP_PLAFA Circumsporozoite protein EDNEKLRKPKHKKLKQPA 18 T 8.1 P120R pdbhh F Eukaryota T 6ux5 1 A A ACR1_ACTEQ U-AITX-AEQ5A,ACRORHAGIN I,ACRORHAGIN-1 SSTPDGTWVKCRHDCFTKYKSCQMSDSCHDEQSCHQCHVKHTDCVNTGCP 50 T 0.027 DUF4802 pdbhh F Eukaryota T 6uxc 1 A,B A,B A0A0H3MBU8_CHLT2 CT253 SNSGSYNARLYTKGSKAKGVVAMLPVFYRTEKSAELLPWNLQAEFSEEISRRLHSSDKLLLIKHHASAGVAAQFFSPTPNISPELATQLLPAEFVVAAEILEQKTTEDVLNPSISASVRVRVFDIRHNKVSMIYQEILDASQSLASGSNDYHRYGWRSKNFDSTPMGLMHQRLFREIVARVEGYVCANYS 190 T 6.1E-05 CsgG unphh F Bacteria T 6uxd 1 A,B A,B A0A0H3MCU1_CHLT2 CT021 AHSPLQSSIQEKILTARPGDYAVLSRGSQKFFFLIRQSSSEATWVEMSEFASLTQQEKKLVEQSSWKNAFHQLQSSKKVYLLRISKNPLMIFVLKNAQWMPLSEKDPLPFFVKILRLPLSPAPSHLIKYKGKERTPWSPRTSLNGELITLPSSAWISVWPKDSSPLSEKNILIYFSNNERLAFPLWTSIDTPTGTVIIKTIEMGHQAASSYPALPNF 217 T 2.7 DUF3868 unphh F Bacteria T 6uxf 1 A A NUCC_VIBMT Vibrio meotecus sp. RC341 NucC MAQDWQLSELLENLHADVQHKLTTVRKSFKHSVVKGDGAENVWVDLFNQYLPERYRASRAFVVDSENQFSEQIDVVIYDRQYSPFIFHYAEQLIIPAESVYAVFEVKQTLNKQHIDAARKKVASVRALHRTSLPIPHAGGVHSPRELIGIIGGLLTLENELKIPDTLMGHLDHDKADKGMLNIGCAADDCFFYYDNDHQRMQVMQHKKATTAFLFELLSQLQKCGTVPMIDIHAYGKWLTPRISE 245 T 0.53 NERD pdbhh F Bacteria T 6uxv 6 I I SNF6_YEAST SWI/SNF COMPLEX COMPONENT SNF6 MGVIKKKRSHHGKASRQQYYSGVQVGGVGSMGAINNNIPSLTSFAEENNYQYGYSGSSAGMNGRSLTYAQQQLNKQRQDFERVRLRPEQLSNIIHDESDTISFRSNLLKNFISSNDAFNMLSLTTVPCDRIEKSRLFSEKTIRYLMQKQHXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 179 T 0.017 Glyco_transf_34 pdbpercent F Eukaryota T 6uzn 3 C C Synthetic peptide THR-VAL-ARG-ALA-SER-GLY-HIS-SER-TYR TVRASGHSY 9 T 5.3 Gly_radical pdbhh F T 6uzo 3 C C Synthetic peptide HIS-LEU-ALA-SER-SER-GLY-HIS-SER-TYR HLASSGHSY 9 T 9.3 DUF562 pdbhh F T 6v0n 3 C C RIOK1_HUMAN Riok1 PBM peptide VVPGQFDDADSSD 13 T 0.00026 COPR5 pdbhh F Eukaryota T 6v0o 3 C D ICLN_HUMAN PBM peptide TVAGQFEDADVDH 13 T 0.007 COPR5 pdbhh F Eukaryota T 6v4b 1 A,B A,B V4JF97_9DELT Neur_chan_LBD domain-containing protein MHNLQQLLPTRSLIWIFSFLTSISIWCTVAHAETEGRVQHFTGYIEDGRGIFYSLPDMKQGDIIYASMQNTGGNLDPLVGIMAEEIDPAVSLGQVLEKALASENDLISELTAVADRIFLGWDDDGGKGYSASLEFTIPRDGTYHIFAGSTITNQRLDKFQPTYTTGSFQLILGLNAPQVISGEGEPEGEVFASLASLEIKPE 202 T 0.088 PPC pdbpercent F Bacteria T 6v4e 2 C,D C,D Stapled peptide QSQQTF(0EH)NLWRLL(MK8)QN(NH2) QSQQTFXNLWRLLXQNX 17 T 0.0017 P53_TAD pdbhh F T 6v4g 2 B B Stapled peptide QSQQTF(0EH)NLWRLE(MK8)QN(NH2) QSQQTFXNLWRLEXQNX 17 T 0.011 P53_TAD pdbhh F T 6v67 1 A,B A,B PD-1 Binding Miniprotein GR918.2 GSCFCVCITGPQWDYRYGNKEQCKKFLTECEQKNPGAEVEIQC 43 T 3.5 SBP_bac_8 pdbhh F T 6v6a 2 B,D B,D S7V0W9_TOXGG Apical Cap Protein 9 (AC9) STRPKFVPCLSTAAAGAGSWMSGNREPSEYPQGM 34 T 1.6 DUF1168 pdbhh F Eukaryota T 6v7b 3 C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T,U,V,W,X,Y A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T,U,V,W A0A140F3K6_9VIRU Structural protein VP1 MSVVTTRARIAETLTEKHTLGIEKVVATDSWRVGITSREKKLERINISAEISRRIQDEAIAYARNKGIPYLPGINGIAWKLLRLKWLGYTDQINVVMRTVPAEWRDFLTQIMENTQMESMYSELRKVRV 129 T 0.15 NDUFA12 pdb T Viruses T 6v7k 2 C X alpha/beta-Peptide HH4 XEXNCDIHVXXEWXCFXRX 19 T 16 Desulfoferrodox pdbhh F T 6v7p 2 B,D D,B Protein PIAS GSGEAEERIISLD 13 T 20 Lsm_C pdbhh F T 6v7u 1 A,B A,B A0SML3_9CAUD Quorum sensing anti-activator protein Aqs1 MTNTDLKPLLDNLRNATEFWNLVAAASATDESTVHNRSYRDALDWLESAALALGDALIAQRKAVGGDHE 69 T 0.42 YqaH unphh T Viruses T 6v7w 2 B,C,E,F A,C,D,F A0SML3_9CAUD QUORUM SENSING ANTI-ACTIVATOR PROTEIN AQS1 MTNTDLKPLLDNLRNATEFWNLVKEASATDESTVHNRSYRDALDWLESAALALGDALIAQRKAVGGDHE 69 T 0.42 YqaH unphh T Viruses T 6v84 2 B,D C,D LyCALAc ANSRLPTSXI 10 T 10 DUF3697 pdbhh F T 6v8i 3 D,H,L CF,BF,AF A4ZFC2_9CAUD Tape Measure Protein, gp57 MTEYKIKATIEASVAKFKRQIDSAVKSVQRFKRVADQTKDVELNANDKKLQKTIKVAKKSLDAFSNKNVKAKLDASIQDLQQKILESNFELDKLNSKEASPEVKLQKQKLTKDIAEAENKLSELEKKRVNIDVNADNSKFNRVLKVSKASLEALNRSKAKAILDVDNSVANSKIKRTKEELKSIPNKTRSRLDVDTRLSIPTIYAFKKSLDALPNKKTTKVDVDTNGLKKVYAYIIKANDNFQRQMGNLANMFRVFGTVGSNMVGGLLTSSFSILIPVIASVVPVVFALLNAIKVLTGGVLALGGAVAIAGAGFVAFGAMAISAIKMLNDGTLQASSATNEYKKALDGVKSAWTDIIKQNQSAIFTTLANGLNTVKTAMQSLQPFFSGISRGMEEASQSVLKWAENSSVASRFFNMMNTTGVSVFNKLLSAAGGFGDGLVNVFTQLAPLFQWSADWLDRLGQSFSNWANSAAGENSITRFIEYTKTNLPIIGNIFKNVFAGINNLMNAFSGSSTGIFQSLEQMTAKFREWSEQVGQSQGFKDFVSYIQTNGPLIMQLIGNIARGLVAFATAMAPIASAVLRVAVAITGWIANLFEAHPATAQLVGVIITLVGAFRFLIAPILAVMDFLGPLAARLVALVTKFGWAKTGTLVLSKAMTSLKGPIKLVTAIFQLLFGKIGLIRNAITGLVTVFGILGGPITIVIGVIAALIAIFVLLWNKNEGFRNFIINAWNAIKTFMVNVWNVLKAVASVVWNAILTAITTAVSNVYNFIMIVWNQIVAYLQGLWNGIIAIATTVWNLLVTIITTVFTTIMTIVMTIWTAIWTFLSTIWNTIITIATTIWNLLVTVITTVFTTIMTIAMTIWNAIWTFLQTLWNTIVTVATKVWNAITTAISTALQAAWSFISNIWNTIWSFLSGILTTIWNKVVSIFTQVVSTISDKMSQAWNFIVTKGMQWVSTITSTLINFVNRVIQGFVNVVNKVSQGMTNAVNKIKSFIGDFVSAGADMIRGLIRGIGQMAGQLVDAAKNVAKKALDAAKSALGIHSPSREFMDVGMYSMLGFVKGIDNHSSKVIRNVSNVADKVVDAFQPTLNAPDISSITGNLSNLGGNINAQVQHTHSIETSPNMKTVKVEFDVNNDALTSIVNGRNAKRNSEYYL 1154 T 0.077 Nucleoporin_FG2 pdbhh T Viruses T 6v98 1 A A Q9KMN9_VIBCH Cysteine hydrolase SGNDDFLIPVVFPDYLISVADEQSFELWGVKIKTPAVKAPYLGHAGVILINGETGVTRYYEYGRYKNPKSDIPGNVRKVGVSNVTIKSGLITESSLLKVLKEVSLRSGQEGRISGVVLRGKFFSEADSWLRGKMDLNNSPDKIPYDLDSHNXMTFVIDLADAMGLDPAWKPPVVVPSAYIEQFQLSEIDLDYDYKTNKLTVSE 203 T 0.027 DUF4105 pdbhh F Bacteria T 6v9z 2 B,D C,D A3DCU2_HUNT2 CtA SNAMSEAKKLNIGRELTDEELMEMTGGSTFSIQCQKDYTYKPSLPVVKYGVVIDEPEVVIKYGVGPIVGIKYGVEPIGPIQPMYGIKPVETLK 93 T 0.017 L_biotic_typeA unphh F Bacteria T 6vb0 3 C C Synthetic peptide GLU-LEU-ARG-ALA-ARG-GLU-GLU-SER-TYR ELRAREESY 9 T 0.04 Alpha_TIF pdbhh F T 6vb2 3 C C LOXE3_HUMAN Synthetic peptide GLU-LEU-ARG-ALA-ARG-GLN-GLU-CYS-TYR ELRARQECY 9 T 1.4 SUIM_assoc pdbhh F Eukaryota T 6vb3 3 C C Synthetic peptide THR-VAL-ALA-ALA-SER-GLY-HIS-SER-TYR TVAASGHSY 9 T 2.5 BAAT_C pdbhh F T 6vb9 1 A,B,C,D A,B,C,D ACEA_MYCTU Isocitrate lyase GSHMSVVGTPKSAEQIQQEWDTNPRWKDVTRTYSAEDVVALQGSVVEEHTLARRGAEVLWEQLHDLEWVNALGALTGNMAVQQVRAGLKAIYLSGWQVAGDANLSGHTYPDQSLYPANSVPQVVRRINNALQRADQIAKIEGDTSVENWLAPIVADGEAGFGGALNVYELQKALIAAGVAGSHWEDQLASEKKXGHLGGKVLIPTQQHIRTLTSARLAADVADVPTVVIARTDAEAATLITSDVDERDQPFITGERTREGFYRTKNGIEPCIARAKAYAPFADLIWMETGTPDLEAARQFSEAVKAEYPDQMLAYNCSPSFNWKKHLDDATIAKFQKELAAMGFKFQFITLAGFHALNYSMFDLAYGYAQNQMSAYVELQEREFAAEERGYTATKHQREVGAGYFDRIATTVDPNSSTTALTGSTEEGQFH 431 T 1.8E-47 ICL pdb F Bacteria T 6vdb 2 B H ALA-PRO-ARG-PHE-GLY-GLY-VAL-MET-ARG-PRO-ASN-ARG APRFGGVMRPNRYR 14 T 5.3 MraY_sig1 pdbhh F T 6vdp 1 A A SFMD_STRLA 3-methyl-L-tyrosine peroxygenase MTAPADTVHPAGQPDYVAQVATVPFRLGRPEELPGTLDELRAAVSARAGEAVRGLNRPGARTDLAALLAATERTRAALAPVGAGPVGDDPSESEANRDNDLAFGIVRTRGPVAELLVDAALAALAGILEVAVDRGSDLEDAAWQRFIGGFDALLGWLADPHSAPRPATVPGAGPAGPPVHQDALRRWVRGHHVFMVLAQGCALATACLRDSAARGDLPGAEASAAAAEALMRGCQGALLYAGDANREQYNEQIRPTLMPPVAPPKMSGLHWRDHEVLIKELAGSRDAWEWLSAQGSERPATFRAALAETYDSHIGVCGHFVGDQSPSLLAAQGSTRSAVGVIGQFRKIRLSALPEQPATQQGEPS 365 T 0.00091 Hs1pro-1_C pdbhh F Bacteria T 6ve5 2 B B SHLD3_HUMAN REV7-INTERACTING NOVEL NHEJ REGULATOR 1,SHIELD COMPLEX SUBUNIT 3 MWFPYDGSKLPLRPKRSPPVISEEAAEDVKQYLTI 35 T 21 MauJ pdbhh F Eukaryota T 6ve7 6 W,XA W,x A0A2K3DTN6_CHLRE FAP276 MDLKQQVKNYTMTIRNTRPPTMIKEQDKSEFSHFRALQVLANGDEVPYEATLRNVIHDGARQPKLPPRQTQKHPGYIRNESGGFFT 86 T 0.092 DUF3337 unppercent F Eukaryota T 6vek 1 A A CONTACT-DEPENDENT INHIBITOR A MVENNYLSVSEKTELEIAKQKLKNSKDPAEREKAQQKYDALLEKDISSDKAVITACSNGQAASAACAGERLKVIAAKGGYETGHYNNQVSDMYPDAYGQIVNLLNITSVDAQNQQQVKDAMVNYAMVQFGVDRATAQAYVETYDGMKVVAASMAPVIGAAAASKIEVLAGKQRLSNSFEVSSLPDANGKNHITAVKGDAKIPVDKIELYMRGKASGDLDSLQAEYNSLKDARISSQKEFAKDPNNAKRMEVLEKQIHNIERSQDMARVLEQAGIVNTASNNSMIMDKLLDSAQGATSANRKTSVVVSGPNGNVRIYATWTILPDGTKRLSTVNTGTFK 338 T 0.0083 ORF6C pdb F T 6vek 2 B I A0A2A2C800_ECOLX contact-dependent immunity protein CdiI MINVNSTAKDIEGLESYLANGYVEANSFNDPEDDALECLSNLLVKDSRGGLSFCKKILNSNNIDGVFIKGSALNFLLLSEQWSYAFEYLTSNADNITLAELEKALFYFYCAKNETDPYPVPEGLFKKLMKRYEELKNDPDAKFYHLHETYDDFSKAYPLNNHHHHHH 167 T 0.2 PIG-X unp F Bacteria T 6ven 11 O O BRE2_YEAST BREFELDIN-A SENSITIVITY PROTEIN 2,COMPLEX PROTEINS ASSOCIATED WITH SET1 PROTEIN BRE2,SET1C COMPONENT BRE2 MKLGIIPYQEGTDIVYKNALQGQQEGKRPNLPQMEATHQIKSSVQGTSYEFVRTEDIPLNRRHFVYRPCSANPFFTILGYGCTEYPFDHSGMSVMDRSEGLSISRDGNDLVSVPDQYGWRTARSDVCIKEGMTYWEVEVIRGGNKKFADGVNNKENADDSVDEVQSGIYEKMHKQVNDTPHLRFGVCRREASLEAPVGFDVYGYGIRDISLESIHEGKLNCVLENGSPLKEGDKIGFLLSLPSIHTQIKQAKEFTKRRIFALNSHMDTMNEPWREDAENGPSRKKLKQETTNKEFQRALLEDIEYNDVVRDQIAIRYKNQLFFEATDYVKTTKPEYYSSDKRERQDYYQLEDSYLAIFQNGKYLGKAFENLKPLLPPFSELQYNEKFYLGYWQHGEARDESNDKNTTSAKKKKQQQKKKKGLILRNKYVNNNKLGYYPTISCFNGGTARIISEEDKLEYLDQIRSAYCVDGNSKVNTLDTLYKEQIAEDIVWDIIDELEQIALQQ 505 T 0.00011 Neuralized pdbhh F Eukaryota T 6vg7 1 A A De novo designed protein RO2_25 MGSSHHHHHHSSGLVPRGSHMTLFVLILSNDKKLIEEARKMAEKANLILITVGDEEELKKAIKKADDIAKKQNSSEAKILILLEKPVSPEYEKKLQKYADAEVRVRTVTSPDEAKRWIKEFSEE 124 T 0.0037 Regulator_TrmB pdbpercent F T 6vga 1 A A De novo designed protein RO2_1 MGSSHHHHHHSSGLVPRGSHMRLVVLIVSNDKKLIEEARKMAEKANLELITVPGSPEEAIRLAQEIAEKAPGPVKVLVLITGSADPDEKTKAKKAAEEARKWNVRVRTVTSPDEAKRWIKEFSEE 125 T 0.0014 Regulator_TrmB pdb F T 6vgb 1 A A De novo designed protein RO2_20 MGSSHHHHHHSSGLVPRGSHMGLLVLIWSNDKKLIEEARKMAEKANLYLLTLETDDKKIEDILKSLGPPVKILVLLEDTKDADKVKKEIEKKARKKNLPVRIRKVTSPDEAKRWIKEFSEE 121 T 0.068 IF3_N pdbpssm F T 6vh8 1 A A Excelsatoxin A LPRCDSPFCSLFRIGLCGDKCTCVPLPIFGLCVPDV 36 T 0.0036 Albumin_I pdbhh F T 6vhj 1 A A LAN11_PROMM Prochlorosin 1.1 FFCVQGXANRFXINVC 16 T 0.0065 Bacteriocin_IIc unppercent F Bacteria T 6vi1 3 M,N,O,P,Q,R M,N,O,P,Q,R TERL_BPP22 DNA-PACKAGING PROTEIN GP2,GENE PRODUCT 2,GP2 MELDAILDNLSDEEQIELLELLEEEENYRNTHL 33 T 0.0057 DUF3775 pdbhh T Viruses T 6vjq 1 A A Q7TUK2_PROMM Prochlorosin 2.1 CCIXGESPGXAPXNDYKCXKGRGPGGCY 28 T 0.009 NHase_alpha unphh F Bacteria T 6vjz 4 D D USA1_YEAST U1 SNP1-associating protein 1 VRAADNTSSANDNNTVENDESAWNRRVVRPLRNSFPLLLVLIRTFYLIGYNSLVPFFIILEFGSFLPWKYIILLSLLFIFRTVWNTQEVWNLWRDYLHLNEIDEVKFSQIKEFINSNSLTLNFYKKCKDTQSAIDLLMIPNLHEQRLSVYSKYDIEYDTNTPDVGQLNLLFIKVLSGEIPKDALDELFKEFFELYETTRNMNTLYPQDSLNELLLMIWKESQKKDINTLPKYRRWFQTLCSQIAEHNVLDVVLRYIIPDPVNDRVITAVIKNFVLFWVTLLPYVKEKLDDIVAQRARDREQPAPSAQQQENEDEALIIPDEEEPTATGAQPHLYIPDED 339 T 0.03 GyrB_insert pdb F Eukaryota T 6vk9 2 AA,BA,CA,DA,EA,FA,Q,R,S,T,U,V,W,X,Y,Z F,X,Z,1,3,5,D,B,P,R,T,V,H,J,L,N Q74D22_GEOSL Geopilin domain 2 protein AGKIPTTTMGGKDFTFKPSTNVSVSYFTTNGATSTAGTVNTDYAVNTKNSSGNRVFTSTNNTSNIWYIENDAWKGKAVSDSDVTALGTGDVGKSDFSGTEWKSQ 104 T 0.11 DUF1445 pdb F Bacteria T 6vl2 1 A A Stigmurin FFSLIPSLVGGLISAFKX 18 T 0.55 Endotoxin_N pdbhh F T 6vlj 1 A A Q7V447_PROMM Prochlorosin 2.8 AACHNHAPXMPPXYWEGEC 19 T 0.0038 NHase_alpha unphh F Bacteria T 6vmc 5 E C PMEL_HUMAN ME20-M,ME20M,MELANOCYTE PROTEIN PMEL 17,MELANOCYTES LINEAGE-SPECIFIC ANTIGEN GP100,MELANOMA-ASSOCIATED ME20 ANTIGEN,P1,P100,PREMELANOSOME PROTEIN,SILVER LOCUS PROTEIN HOMOLOG ILDQVPFSV 9 T 6.2 Spin-Ssty pdbhh F Eukaryota T 6vo5 2 C,D C,D H4_HUMAN Histone H4 SGRGKGGKGLGAGGAKRHRK 20 T 11 Shadoo unppercent F Eukaryota T 6vpx 1 A,B,C A,E,C B9V6B3_9HIV1 Envelope glycoprotein gp120 AEQLWVTVYYGVPVWKEATTTLFCASDARAYDTEVHNVWATHACVPTDPNPQEVVLENVTENFNMWKNNMVEQMHEDIISLWDQSLKPCVKLTPLCVTLNCTDLRNSSSGEKMEGGEIKNCSFNITTSMRDKVQKEYALFYKLDVVPIKNDNTSYRLISCNTSVITQACPKVSFEPIPIHYCAPAGFAILKCNDKKFNGTGPCTNVSTVQCTHGIRPVVSTQLLLNGSLAEEEVVIRSANFTDNAKIIIVQLNKSVEINCTRPNNNTRKSIHIGPGRAFYTTGEIIGDIRQAHCNISGTKWNDTLKQIVVKLKEQFGNKTIVFNHSSGGDPEIVMHSFNCGGEFFYCNSTQLFNSTWNDTEGSNNTKGNGTIVLPCRIKQIVNMWQEVGKAMYAPPIKGQIRCSSNITGLILIRDGGNNNESTEIFRPGGGDMRDNWRSELYKYKVVKIEPLGIAPTKAKRRVVQ 465 T 2.9999999999999995E-54 GP120 pdb T Viruses T 6vpz 3 C C POL_HV1H2 11-mer peptide KRWIILGLNKI 11 T 4.2 COX2-transmemb pdbhh T Viruses T 6vq2 3 C C POL_HV1H2 14-mer peptide KRWIILGLNKIVRM 14 T 6.2 COX2-transmemb pdbhh T Viruses T 6vq6 8 P,Q,R Q,R,S Q5ZWW6_LEGPH Effector protein SidK GMSFIKVGIKMGGLTSEQYHSQVVGKIGYIARCMQTIDPENNLKKIREDYQDVLIWAEKNYRFEEILEASKSGKCPNDLDALSRRSLILQELLRLVSSISPFKMKLDLIESQYEKMKQHVNLWKSDYHVKLNQLNQLTDYLKNAAPTPKNNFLRAMTSVLQMQIAQYGITEDNEGINQLFKLGLHLLAMANEKIDEQYHLFKGYVKDQPEESPFEGILPAEDQKILVKTMIDYAMPKLSSKVLQDKLSALSSSDVLTKTLLDSIDRIVKENEKLNALSKDYKDHDGDYKDHDIDYKDDDDK 301 T 1 DUF3276 unppssm F Bacteria T 6vqd 3 C C POL_HV1H2 8-mer peptide KRWIILGL 8 T 0.56 COX2-transmemb pdbhh T Viruses T 6vqe 3 C C POL_HV1H2 13-mer peptide KRWIILGLNKIVR 13 T 5.7 COX2-transmemb pdbhh T Viruses T 6vqp 2 B Q CalU17 His-Tagged protein MGSSHHHHHHSSGLVPRGS 19 T 9200 zf_CCCH_4 pdbhh F T 6vqv 1 A,B A,B AcrF9 MKAAYIIKEVQNINSEREGTQIEATSLSQAKRIASKEQCFHGTVMRIETVNGLWLAYKEDGKRWVDCQ 68 T 0.11 Ribosomal_L19 pdbpssm F T 6vqw 4 I A AcrF8 MARIAPNEDSTMSTAYIIFNSSVAAVVDTEIANGANVTFSTVTVKEEINANRDFNLVNAQNGKISRAKRWGNEASKCEYFGREINPTEFFIK 92 T 10 PHF12_MRG_bd pdbhh F T 6vqx 1 A A A0A6B1YCA6_PSEAI AcrF6 MKVPAFFAANILTIEQIIEAINNDGSAMTSAPEIAGYYAWDAATDALESENDLEQLTEDDFVAHLEVLEERGAKIDRDAAIAVALQFQAAAVNDLHSGDE 100 T 0.47 SUB1_ProdP9 pdb F Bacteria T 6vqy 3 E,F G,F POL_HV1H2 7-mer peptide KRWIILG 7 T 0.7 COX2-transmemb pdbhh T Viruses T 6vr4 1 A,B A,B S0A2C3_9CAUD DNA-dependent RNA polymerase MGSSHHHHHHSQDPMACKIENIKYKGKEVESKLGSQLIDIFNDLDRAKEEYDKLSSPEFIAKFGDWINDEVERNVNEDGEPLLIQDVRQDSSKHYFFILKNGERFDLLTREFDSFTSPDLTNEIKEITDQLSYYIYNKHFSSDFEQVEGAKLNIQNEISQFVKEGKAPVQAAYNKLQDPDIKDLLDYYDNIEKHSDEFESEIVKFFSEKKLIIKDAELEDVTQEGLNEGLQGGDLVQAFEKNSKDNATANVKLMLSFLPKIDNLTGEPALGDYLNKPVFRSFDSIHSELLEVLSDITTLHVQGEVLDVFSSMYNKIKELADFKKSFKPLLEILDTIDEQKKTEFVQAFYLSKINFYTTTIETLETEDQNNTLTTFKVQNVSNANNPISSKLTEYYTNFKYKILPGGKLNKGKLKDLQSTVTSLLEKTRKENNPKYKSDSDFYEVFEEGVVELMQVFEDLGVDSITFEAMDIFLKQFRFDLPENNAYKIMYQQYQGKLTNLNNLLKDIQSNKINPYKINPFKNYSNLIFNSLAEAENYFIENNNESTIFSNGKTYWNFARPSYISNRINTFKNNPGVLRQLLNTSYGQSSLWAKHLLGEEKNVTGDFVLAGNARESASENRLKSLELSIFNSLQEKDKGAEGNDNGSISIVDQLADKLNKVLRGGTKNGTSIYSTVTPGDKSTLHEIKIDHFIPETISSFSNGTMIFNDKIVNAFTDHFVSEVNRMKEAYQELETLPESKRVVHYHTDARGNVMKDGKLAGNAFKSGHILSELSFDQITQDDNEMLKLYNEDGSPINPKGAVSNEQKILIKQTINKVLNQRIKENIRYFKDQGLVIDTVNKDGNKGFHFHGLDKSIMSEYTDDIQLTEFDISHVVSDFTLNSILASIEYTKLFTGDPANYKNMVDFFKRVPATYTNGTNLRLGLEANDHLFDVAVLENIVKPSAYLKEIGESLKLSDLSEAEKKYILEAYEDVNQTDAQAWITPKRWAFLISRTGKWNSKYQSVYNKILKSESLDASEMKLAAQPLKGVYFGLVNNTPTYLKYSQAVLLPQLVAGTQLQSLADAMNKQDIGESIVLDGVKVGATTPNIVTDENGDILKSISLNPLTLSNADWKLQQDLPVKTIKPTLLGSQIQKNIYSSLTDEATYTIENEAFNGSGMFQAINDTVSAMSNLSIAGLSSELGKDSEGKIDKRKLYDMLEREMLDKGSAINLLKSIQKNLPIEAMPGIKDKLYNIVFSKINSAAVKLKTNGGSFIQLSNFGLDKQTADAKGITWLVEPSDLKPPVIEKDADGKNYIRPGQIFMSHVQIAKLVPDYAKMDSKTLSSMIDPKALRAIGYRIPNQGQSSNDPLQIVGILPEAMGDTIVAYTEIPTKTGSDFDIDKMYVMLPNFKVEHTKKSFKLAKDYIAQNEITVEEMYDELEDHGFNIDDIANGEEVTESAITEAFIKNHILNSNSELEYHNDFVKQHNIDAVNKIDFLGYSEELHKNKSEQLQNRLFDLYWAVLTNEKTYGDLITPIDFPHVKDEIKRVFGDNSKQTGENLKFHDPLYQLKLKFTYAGGKSGVGITANMLVDHNRSKGIDMQFNQYNLGVGHTQNGNTVFDKEYSEELNGTRFKIKDTISAFLNAFVDNAKDPYINDGNFNTYTSSVAFMLIRAGVHPDWIISFIGQPVLRELADFTQRYESKIIPKEDVGKSSFDIIVEKYETINQESYKDAESRAFSLDTLQESIEVGVHGIDLDVLKTFKGFQEQAKRLNESVQLSRFDTNGSGKNILDLIILKNKIKNLYVSEQTQQKGSMMNHFKKYHNNGKITSLGTQVKNTLLFTDDILNNNPSLFLLGSKPIQDLVNSISNNLVDSRGGSRGLLTNEDVGKLFYKEVYKYIMADFAPFKVGDPMAYIKDTIFDLVNYKTEDKQYDSSNFFIENMTVYENSFGITNKNKSVDFQDRLYRSAYDLMMENPELANKMFISSFLMNGFENKLIDIKEYIPYQWFLENDIRSFIESKNTGLKDSSESLRSFEEQFIKNNSDSNILAPKVSQSVIKSIKGIKSKHVFELPINDKTKRYILGATETKEEVLPNYVKVGSDLYRLKAYREKSGVYVRTNKLGFEDPKSFLSIKEYKFGTRTGGNFTGELTKQELVYTNQWVNENITLANGYISADSRTVDNPADKILEQNSLENILFSQNNVVSSDENDITKQECK 2194 T 0.038 HTH_40 pdbpercent T Viruses T 6vrb 2 B A CS13A_LISSS CRISPR-ASSOCIATED ENDORIBONUCLEASE C2C2,ENDORNASE,LSEC2C2 TMRITKVEVDRKKVLISRDKNGGKLVYENEMQDNTEQIMHHKKSSFYKSVVNKTICRPEQKQMKKLVHGLLQENSQEKIKVSDVTKLNISNFLNHRFKKSLYYFPENSPDKSEEYRIEINLSQLLEDSLKKQQGTFICWESFSKDMELYINWAENYISSKTKLIKKSIRNNRIQSTESRSGQLMDRYMKDILNKNKPFDIQSVSEKYQLEKLTSALKATFKEAKKNDKEINYKLKSTLQNHERQIIEELKENSELNQFNIEIRKHLETYFPIKKTNRKVGDIRNLEIGEIQKIVNHRLKNKIVQRILQEGKLASYEIESTVNSNSLQKIKIEEAFALKFINACLFASNNLRNMVYPVCKKDILMIGEFKNSFKEIKHKKFIRQWSQFFSQEITVDDIELASWGLRGAIAPIRNEIIHLKKHSWKKFFNNPTFKVKKSKIINGKTKDVTSEFLYKETLFKDYFYSELDSVPELIINKMESSKILDYYSSDQLNQVFTIPNFELSLLTSAVPFAPSFKRVYLKGFDYQNQDEAQPDYNLKLNIYNEKAFNSEAFQAQYSLFKMVYYQVFLPQFTTNNDLFKSSVDFILTLNKERKGYAKAFQDIRKMNKDEKPSEYMSYIQSQLMLYQKKQEEKEKINHFEKFINQVFIKGFNSFIEKNRLTYICHPTKNTVPENDNIEIPFHTDMDDSNIAFWLMCKLLDAKQLSELRNEMIKFSCSLQSTEEISTFTKAREVIGLALLNGEKGCNDWKELFDDKEAWKKNMSLYVSEELLQSLPYTQEDGQTPVINRSIDLVKKYGTETILEKLFSSSDDYKVSAKDIAKLHEYDVTEKIAQQESLHKQWIEKPGLARDSAWTKKYQNVINDISNYQWAKTKVELTQVRHLHQLTIDLLSRLAGYMSIADRDFQFSSNYILERENSEYRVTSWILLSENKNKNKYNDYELYNLKNASIKVSSKNDPQLKVDLKQLRLTLEYLELFDNRLKEKRNNISHFNYLNGQLGNSILELFDDARDVLSYDRKLKNAVSKSLKEILSSHGMEVTFKPLYQTNHHLKIDKLQPKKIHHLGEKSTVSSNQVSNEYCQLVRTLLTMK 1087 T 0.51 MbeB_N unp F Bacteria T 6vrb 3 C C AcrVIA1 MIYYIKDLKVKGKIFENLMNKEAVEGLITFLKKAEFEIYSRENYSKYNKWFEMWKSPTSSLVFWKNYSFRCHLLFVIEKDGECLGIPASVFESVLQIYLADPFAPDTKELFVEVCNLYECLADVTVVEHFEAEESAWHKLTHNETEVSKRVYSKDDDELLKYIPEFLDTIATNKKSQKYNQIQGKIQEINKEIATLYESSEDYIFTEYVSNLYRESAKLEQHSKQILKE 229 T 0.32 DUF5377 pdb F T 6vrc 1 A A CS13A_LISSS CRISPR-ASSOCIATED ENDORIBONUCLEASE C2C2,ENDORNASE,LSEC2C2 MWISIKTLIHHLGVLFFCDYMYNRREKKIIEVKTMRITKVEVDRKKVLISRDKNGGKLVYENEMQDNTEQIMHHKKSSFYKSVVNKTICRPEQKQMKKLVHGLLQENSQEKIKVSDVTKLNISNFLNHRFKKSLYYFPENSPDKSEEYRIEINLSQLLEDSLKKQQGTFICWESFSKDMELYINWAENYISSKTKLIKKSIRNNRIQSTESRSGQLMDRYMKDILNKNKPFDIQSVSEKYQLEKLTSALKATFKEAKKNDKEINYKLKSTLQNHERQIIEELKENSELNQFNIEIRKHLETYFPIKKTNRKVGDIRNLEIGEIQKIVNHRLKNKIVQRILQEGKLASYEIESTVNSNSLQKIKIEEAFALKFINACLFASNNLRNMVYPVCKKDILMIGEFKNSFKEIKHKKFIRQWSQFFSQEITVDDIELASWGLRGAIAPIRNEIIHLKKHSWKKFFNNPTFKVKKSKIINGKTKDVTSEFLYKETLFKDYFYSELDSVPELIINKMESSKILDYYSSDQLNQVFTIPNFELSLLTSAVPFAPSFKRVYLKGFDYQNQDEAQPDYNLKLNIYNEKAFNSEAFQAQYSLFKMVYYQVFLPQFTTNNDLFKSSVDFILTLNKERKGYAKAFQDIRKMNKDEKPSEYMSYIQSQLMLYQKKQEEKEKINHFEKFINQVFIKGFNSFIEKNRLTYICHPTKNTVPENDNIEIPFHTDMDDSNIAFWLMCKLLDAKQLSELRNEMIKFSCSLQSTEEISTFTKAREVIGLALLNGEKGCNDWKELFDDKEAWKKNMSLYVSEELLQSLPYTQEDGQTPVINRSIDLVKKYGTETILEKLFSSSDDYKVSAKDIAKLHEYDVTEKIAQQESLHKQWIEKPGLARDSAWTKKYQNVINDISNYQWAKTKVELTQVRHLHQLTIDLLSRLAGYMSIADRDFQFSSNYILERENSEYRVTSWILLSENKNKNKYNDYELYNLKNASIKVSSKNDPQLKVDLKQLRLTLEYLELFDNRLKEKRNNISHFNYLNGQLGNSILELFDDARDVLSYDRKLKNAVSKSLKEILSSHGMEVTFKPLYQTNHHLKIDKLQPKKIHHLGEKSTVSSNQVSNEYCQLVRTLLTMKHHHHHH 1126 T 0.12 FRB_dom pdbpercent F Bacteria T 6vro 2 B B CRBG1_HUMAN ABSENT IN MELANOMA 1 PROTEIN KRKKARMPNSPAPHFAMPPIHEDHLE 26 T 11 DUF3320 pdbhh F Eukaryota T 6vrw 1 A,C,E G,A,D A0A0N9FF17_9HIV1 Envelope glycoprotein gp120 GLWVTVYYGVPVWREAKTTLFCASDAKSYEKEVHNVWATHACVPTDPNPQELVLENVTENFNMWKNDMVDQMHEDIISLWDQSLKPCVKLTPLCVTLNCSDAKVNATYKGTREEIKNCSFNATTELRDKKRREYALFYRLDIVPLSGEGNNNSEYRLINCNTSVITQICPKVTFDPIPIHYCAPAGYAILKCNNKTFNGTGPCNNVSTVQCTHGIKPVVSTQLLLNGSLAEEEIIIRSENLTDNVKTIIVHLNESVEITCTRPNNMTRKSVRIGPGQTFYALGDIIGDIRQPHCNISEIKWEKTLQRVSEKLREHFNKTIIFNQSSGGDLEITTHSFNCGGEFFYCNTSDLFFNKTFNETYSTGSNSTNSTITLPCRIKQIINMWQEVGRAMYAPPIAGNITCKSNITGLLLTRDGGGNNSTKETFRPGGGNMRDNWRSELYKYKVVEVKPLGIAPTECNRTVVQRRRRRR 471 T 3.2999999999999996E-53 GP120 pdbpssm T Viruses T 6vtw 1 A A S4_2.45 CSVVVGENYSIKCDATKCTIEDKNRGIIKTVTGSRCEELAKAVQKAQ 47 T 6.3 DUF2997 pdbhh F T 6vu4 1 A,B C,A APP,ABPP,APPI,ALZHEIMER DISEASE AMYLOID PROTEIN,AMYLOID PRECURSOR PROTEIN,AMYLOID-BETA A4 PROTEIN,CEREBRAL VASCULAR AMYLOID PEPTIDE,CVAP,PREA4,PROTEASE NEXIN-II,PN-II HXKLVXFAEXAIIGLMV 17 T 0.0046 Beta-APP pdbhh F T 6vw9 2 B B S6FCX2_CAEEL K+/Cl-Cotransporter SKMHTAVRLNELLLQHSANSQLILLNLPKPPVHKDQQALDDYVHYLEVMTDKLNRVIFVRGTGKEVITESS 71 F F Eukaryota T 6vxy 2 B C SFTI1_HELAN SFTI1 inhibitor GLY-ARG-GLY-THR-LYS-SER-ILE-PRO-PRO-ILE-ALA-PHE-PRO-ASP GRGTKSIPPIAFPD 14 T 1 Antimicrobial23 pdbhh F Eukaryota T 6vy2 1 A,C,E A,C,E A0A0A7I3C6_9HIV1 SURFACE PROTEIN GP120, SU, GP120 MPMGSLQPLATLYLLGMLVASVLAAENLWVTVYYGVPVWKEAKTTLFCASDAKAYEKKVHNVWATHACVPTDPNPQEMVLKNVTENFNMWKNDMVDQMHEDVISLWDQSLKPCVKLTPLCVTLNCTNATASNSSIIEGMKNCSFNITTELRDKREKKNALFYKLDIVQLDGNSSQYRLINCNTSVITQACPKVSFDPIPIHYCAPAGYAILKCNNKTFTGTGPCNNVSTVQCTHGIKPVVSTQLLLNGSLAEGEIIIRSENITNNVKTIIVHLNESVKIECTRPNNKTRTSIRIGPGQWFYATGQVIGDIREAYCNINESKWNETLQRVSKKLKEYFPHKNITFQPSSGGDLEITTHSFNCGGEFFYCNTSSLFNRTYMANSTDMANSTETNSTRTITIHCRIKQIINMWQEVGRAMYAPPIAGNITCISNITGLLLTRDGGKNNTETFRPGGGNMKDNWRSELYKYKVVKIEPLGVAPTRCKRRVV 487 T 1.2E-52 GP120 pdbpssm T Viruses T 6vzi 4 D G A0A0N9FF17_9HIV1 ENV POLYPROTEIN GLWVTVYYGVPVWREAKTTLFCASDAKSYEKEVHNVWATHACVPTDPNPQELVLENVTENFNMWKNDMVDQMHEDIISLWDQSLKPCVKLTPLCVTLNCSDAKVNATYKGTREEIKNCSFNATTELRDKKRREYALFYRLDIVPLSGEGNNNSEYRLINCNTSVITQICPKVTFDPIPIHYCAPAGYAILKCNNKTFNGTGPCNNVSTVQCTHGIKPVVSTQLLLNGSLAEEEIIIRSENLTDNVKTIIVHLNESVEITCTRPNNMTRKSVRIGPGQTFYALGDIIGDIRQPHCNISEIKWEKTLQRVSEKLREHFNKTIIFNQSSGGDLEITTHSFNCGGEFFYCNTSDLFFNKTFNETYSTGSNSTNSTITLPCRIKQIINMWQEVGRAMYAPPIAGNITCKSNITGLLLTRDGGGNNSTKETFRPGGGNMRDNWRSELYKYKVVEVKPLGIAPTECRRRVVQRRRRRR 471 T 3.2999999999999996E-53 GP120 pdbpssm T Viruses T 6w03 4 D G Q2N0S6_9HIV1 ENV POLYPROTEIN AENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNAITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSACTQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPNNMTRKSIRIGPGQAFYALGDIIGDIRQPHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQCMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 481 T 3.4E-54 GP120 pdbpercent T Viruses T 6w0l 2 B P W_NIPAV Phosphorylated W peptide ARVSMRRMSN 10 T 1.3 Paramyxo_P_V_N unphh T Viruses T 6w0v 1 A A A0A1P8L021_PSEAI PYS8 DEPGVATGNGQPVTGNWLAGASQGDGVPIPSQIADQLRGKEFKSWRDFREQFWVAVANDPELVKYFRKTNAKGMRDGLSPFTPKAEQAGGRDKYAIHHVVQISQGGAVYDIDNLRVMTPKMHIQV 125 T 0.0052 HNH pdbpssm F Bacteria T 6w1x 6 K,L I,J C0AVY5_9GAMM anti-CRISPR AcrIF9 MKSTYIIKEVQNINSDREGVKVETTSLTSAKRIASKNQFFHGTVLRIESESGNWLAYKEDGKRWIECE 68 T 0.15 SurA_N pdb F Bacteria T 6w25 2 B B SHU9119 XXDHXRWKX 9 T 23 ACTH_domain pdbhh F T 6w2r 1 A,B,C,D A,B,C,D Junction 19 DHR54-DHR79 MGTTEDERRELEKVARKAIEAAREGNTDEVREQLQRALEIARESGTKTAVKLALDVALRVAQEAAKRGNKDAIDEAAEVVVRIAEESNNSDALEQALRVLEEIAKAVLKSEKTEDAKKAVKLVQEAYKAAQRAIEAAKRTGTPDVIKLAIKLAKLAARAALEVIKRPKSEEVNEALKKIVKAIQEAVESLREAEESGDPEKREKARERVREAVERAEEVQRDPSSGWLEHHHHHH 235 T 0.0008 SPO22 pdb F T 6w3j 3 C C CE192_HUMAN CEP192/SPD-2 IDDEMFYDDHLEAYFEQLAIPG 22 T 3.2 SUB1_ProdP9 pdbhh F Eukaryota T 6w5c 1 A A Cas12i MSNKEKNASETRKAYTTKMIPRSHDRMKLLGNFMDYLMDGTPIFFELWNQFGGGIDRDIISGTANKDKISDDLLLAVNWFKVMPINSKPQGVSPSNLANLFQQYSGSEPDIQAQEYFASNFDTEKHQWKDMRVEYERLLAELQLSRSDMHHDLKLMYKEKCIGLSLSTAHYITSVMFGTGAKNNRQTKHQFYSKVIQLLEESTQINSVEQLASIILKAGDCDSYRKLRIRCSRKGATPSILKIVQDYELGTNHDDEVNVPSLIANLKEKLGRFEYECEWKCMEKIKAFLASKVGPYYLGSYSAMLENALSPIKGMTTKNCKFVLKQIDAKNDIKYENEPFGKIVEGFFDSPYFESDTNVKWVLHPHHIGESNIKTLWEDLNAIHSKYEEDIASLSEDKKEKRIKVYQGDVCQTINTYCEEVGKEAKTPLVQLLRYLYSRKDDIAVDKIIDGITFLSKKHKVEKQKINPVIQKYPSFNFGNNSKLLGKIISPKDKLKHNLKCNRNQVDNYIWIEIKVLNTKTMRWEKHHYALSSTRFLEEVYYPATSENPPDALAARFRTKTNGYEGKPALSAEQIEQIRSAPVGLRKVKKRQMRLEAARQQNLLPRYTWGKDFNINICKRGNNFEVTLATKVKKKKEKNYKVVLGYDANIVRKNTYAAIEAHANGDGVIDYNDLPVKPIESGFVTVESQVRDKSYDQLSYNGVKLLYCKPHVESRRSFLEKYNGTMKDNRGNNIQIDFMKDFEAIADDETSLYYFNMKYCKLLQSSIRNHSSQAKEYREEIFELLRDGKLSVLKLSSLSNLSFVMFKVAKSLIGTYFGHLLKKPKNSKSDVKAPPITDEDKQKADPEMFALRLALEEKRLNKVKSKKEVIANKIVAKALELRDKYGPVLIKGENISDTTKKGKKSSTNSFLMDWLARGVANKVKEMVMMHQGLEFVEVNPNFTSHQDPFVHKNPENTFRARYSRCTPSELTEKNRKEILSFLSDKPSKRPTNAYYNEGAMAFLATYGLKKNDVLGVSLEKFKQIMANILHQRSEDQLLFPSRGGMFYLATYKLDADATSVNWNGKQFWVCNADLVAAYNVGLVDIQKDFKKK 1092 T 0.37 DUF1910 pdbpercent F T 6w62 1 A A Cas12i MSNKEKNASETRKAYTTKMIPRSHDRMKLLGNFMDYLMDGTPIFFELWNQFGGGIDRDIISGTANKDKISDDLLLAVNWFKVMPINSKPQGVSPSNLANLFQQYSGSEPDIQAQEYFASNFDTEKHQWKDMRVEYERLLAELQLSRSDMHHDLKLMYKEKCIGLSLSTAHYITSVMFGTGAKNNRQTKHQFYSKVIQLLEESTQINSVEQLASIILKAGDCDSYRKLRIRCSRKGATPSILKIVQDYELGTNHDDEVNVPSLIANLKEKLGRFEYECEWKCMEKIKAFLASKVGPYYLGSYSAMLENALSPIKGMTTKNCKFVLKQIDAKNDIKYENEPFGKIVEGFFDSPYFESDTNVKWVLHPHHIGESNIKTLWEDLNAIHSKYEEDIASLSEDKKEKRIKVYQGDVCQTINTYCEEVGKEAKTPLVQLLRYLYSRKDDIAVDKIIDGITFLSKKHKVEKQKINPVIQKYPSFNFGNNSKLLGKIISPKDKLKHNLKCNRNQVDNYIWIEIKVLNTKTMRWEKHHYALSSTRFLEEVYYPATSENPPDALAARFRTKTNGYEGKPALSAEQIEQIRSAPVGLRKVKKRQMRLEAARQQNLLPRYTWGKDFNINICKRGNNFEVTLATKVKKKKEKNYKVVLGYDANIVRKNTYAAIEAHANGDGVIDYNDLPVKPIESGFVTVESQVRDKSYDQLSYNGVKLLYCKPHVESRRSFLEKYRNGTMKDNRGNNIQIDFMKDFEAIADDETSLYYFNMKYCKLLQSSIRNHSSQAKEYREEIFELLRDGKLSVLKLSSLSNLSFVMFKVAKSLIGTYFGHLLKKPKNSKSDVKAPPITDEDKQKADPEMFALRLALEEKRLNKVKSKKEVIANKIVAKALELRDKYGPVLIKGENISDTTKKGKKSSTNSFLMDWLARGVANKVKEMVMMHQGLEFVEVNPNFTSHQDPFVHKNPENTFRARYSRCTPSELTEKNRKEILSFLSDKPSKRPTNAYYNEGAMAFLATYGLKKNDVLGVSLEKFKQIMANILHQRSEDQLLFPSRGGMFYLATYKLDADATSVNWNGKQFWVCNADLVAAYNVGLVDIQKDFKKK 1093 T 0.37 DUF1910 pdbpercent F T 6w6x 1 A,B A,B De novo designed ABLE protein SVKSEYAEAAAVGQEAVAVFNTMKAAFQNGDKEAVAQYLARLASLYTRHEELLNRILEKARREGNKEAVTLMNEFTATFQTGKSIFNAMVAAFKNGDDDSFESYLQALEKVTAKGETLADQIAKAL 126 T 0.028 ASD2 pdb F T 6w8u 1 A,AA,B,BA,C,CA,D,DA,E,EA,F,FA,G,GA,H,HA,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,Q,R,S,T,U,V,W,X,Y,Z A,a,B,b,C,c,D,d,E,e,F,f,G,g,H,h,I,i,J,j,K,k,L,l,M,m,N,n,O,o,P,Q,R,S,T,U,V,W,X,Y,Z A4WH64_PYRAR pilin MTSLEIAIIVAIVLVIAIAVGWYLYTTFAAAGQQTGLTATKATIYVTKDGNVYLNVTLVPQGAAQVAISSIEVAGVSIPCTSSNLVKAPGEYVIELSSVSVSVGQVLTGRIVLASGAISPFTATVVAADHVPSTENKLCSSQ 142 T 1.8E-05 DUF973 unphh F Archaea T 6w9m 2 B B NR0B2_HUMAN ORPHAN NUCLEAR RECEPTOR SHP, SMALL HETERODIMER PARTNER RPAILYALLSS 11 T 7.3 GTA_holin_3TM pdbhh F Eukaryota T 6w9s 1 A A B1EF49_ESCAT OTU domain-containing protein EschOTU SSPQSVFSDSVSSSRLELKKQIIKALDLDYWQGSGGEIMPLVLIDFYKRHNININIYLNHCKVNNFDKKAINLINAGNHYNALTMNSRGNIERIDVPGDGNCLYHAVVKSHQITRKPKPYGNELQKDKPEWCILKESLKTHFDKDFDQFVEQVKCILISENTHEANKILDKVAQYSGVK 179 T 0.0021 OTU pdbhh F Bacteria T 6w9y 1 A,B,C,D A,B,C,D De novo designed receptor transmembrane domain proMP 1.2 EPELLFILVAILGGLFGAIVAFLLALRRLX 30 T 0.08 DUF1294 pdb F T 6w9z 1 A,B,C,D,E,F A,B,C,D,E,F De novo designed receptor transmembrane domain ProMP C2.1 EPELTVALILGIFLGTFIAFWVVYLLRRLX 30 T 0.12 YtxH pdbhh F T 6wa0 1 A,B,C,D,E,F,G,H,I A,B,C,D,E,F,G,H,I De novo designed receptor transmembrane domain proMP C3.1 EPETALLVAFVAYYTALIALIFAILATRRLX 31 T 7.1 MSA_2 pdbhh F T 6wb3 1 A,B A,B SEPT4_HUMAN APOPTOSIS-RELATED PROTEIN IN THE TGF-BETA SIGNALING PATHWAY,ARTS,BRADEION BETA,BRAIN PROTEIN H5,CE5B3 BETA,CELL DIVISION CONTROL-RELATED PROTEIN 2,HCDCREL-2,CEREBRAL PROTEIN 7,PEANUT-LIKE PROTEIN 2 XETEKLIREKDEELRRMQEMLHKIQKQMKENX 32 T 0.097 AAA_23 unp F Eukaryota T 6wb9 1 A 0 EMC10_YEAST Endoplasmic reticulum membrane protein complex subunit 10 MLVRLLRVILLASMVFCADILQLSYSDDAKDAIPLGTFEIDSTSDGNVTVTTVNIQDVEVSGEYCLNAQIEGKLDMPCFSYMKLRTPLKYDLIVDVDEDNEVKQVSLSYDETNDAITATVRYPEAGPTAPVTKLKKKTKTYADKKASKNKDGSTAQFEEDEEVKEVSWFQKNWKMLLLGLLIYNFVAGSAKKQQQGGAGADQKTE 205 T 0.039 PFU pdb F Eukaryota T 6wba 2 B,C C,B MLH1_HUMAN MUTL PROTEIN HOMOLOG 1 SSNPAKRHRED 11 T 2.9 T_cell_tran_alt pdbhh F Eukaryota T 6wbb 2 B,C C,B MLH1_HUMAN MUTL PROTEIN HOMOLOG 1 SSNPRKRHRAD 11 T 8.6 DUF5592 pdbhh F Eukaryota T 6wbc 2 B B MLH1_HUMAN MUTL PROTEIN HOMOLOG 1 SSNPRKKHRED 11 T 2.3 T_cell_tran_alt pdbhh F Eukaryota T 6wbe 1 A,B,C,D A,B,C,D SEPT1_HUMAN LARP,PEANUT-LIKE PROTEIN 3,SEROLOGICALLY DEFINED BREAST CANCER ANTIGEN NY-BR-24 DTEKLIREKDEELRRMQEMLEKMQAQMQQS 30 T 0.04 HHV-5_US34A pdb F Eukaryota T 6wc3 2 B B Q753G8_ASHGO AFR344CP MGSMPYATQLALLQDELLDMLEPRDGEGLRTADIIDKTLRFRELLGCYRLQVEKSTRQLELPRQVRTAAALRGAHAPASQAPALAQLLLWERFLADYRRRLDAAIVHEHEATAARQLQARPTAARAPRAPMTAKDRLLA 139 T 0.12 MRP-S31 unppercent F Eukaryota T 6wc6 3 C C KDM1A_HUMAN LYSINE-SPECIFIC HISTONE DEMETHYLASE 1A SEEERNAKAEKEKKL 15 T 3 NDUF_B8 pdbhh F Eukaryota T 6wcu 1 A,B A,B SEPT5_HUMAN CELL DIVISION CONTROL-RELATED PROTEIN 1,CDCREL-1,PEANUT-LIKE PROTEIN 1 ETEKLIRMKDEELRRMQEMLQRMKQQMQDQ 30 T 0.046 IL32 pdbpssm F Eukaryota T 6wdp 1 A A I12R1_HUMAN IL-12RB1,IL-12 RECEPTOR BETA COMPONENT SECCFQDPPYPDADSGSASGPRDLRCYRISSDRYECSWQYEGPTAGVSHFLRCCLSSGRCCYFAAGSATRLQFSDQAGVSVLYTVTLWVESWARNQTEKSPEVTLQLYNSVKYEPPLGDIKVSKLAGQLRMEWETPDNQVGAEVQFRHRTPSSPWKLGDCGPQDDDTESCLCPLEMNVAQEFQLRRRQLGSQGSSWSKWSSPVCVPPENPPQPHHHHHH 219 T 6.2E-05 LIFR_D2 pdbhh F Eukaryota T 6wdr 1 A A RSSA1_YEAST NUCLEIC ACID-BINDING PROTEIN NAB1A,SMALL RIBOSOMAL SUBUNIT PROTEIN US2-A SLPATFDLTPEDAQLLLAANTHLGARNVQVHQEPYVFNARPDGVHVINVGKTWEKLVLAARIIAAIPNPEDVVAISSRTFGQRAVLKFAAHTGATPIAGRFTPGSFTNYITRSFKEPRLVIVTDPRSDAQAIKEASYVNIPVIALTDLDSPSEFVDVAIPCNNRGKHSIGLIWYLLAREVLRLRGALVDRTQPWSIMPDLYFYRFP 206 T 2.9E-13 Ribosomal_S2 pdb F Eukaryota T 6weg 3 E P Q5NHR4_FRATT Peptide KRNVFSRCWINMNLYSVIKAKS 22 T 0.92 XTBD pdbhh F Bacteria T 6wes 1 A A C5IAW5_PHANO Tox3 YIKANDINFGTRSVHDCRERTGIQRDVKVRADIPFETDDGPNQVLRVTWSNALNVDRFDPLPIVTVPGNAASTTITAIHDFCLMNPTTSPPTRCLYQLRQPFTLGFDRTRMHNNIYLTPPNPQRPTMHEVCIRADECPAGRVFLECSTRTYGAIPRGE 158 T 0.077 DUF6286 unppercent F Eukaryota T 6wgg 1 A 8 CDT1_YEAST SIC1 INDISPENSABLE PROTEIN 2,TOPOISOMERASE-A HYPERSENSITIVE PROTEIN 11 MSGTANSRRKEVLRVPVIDLNRVSDEEQLLPVVRAILLQHDTFLLKNYANKAVLDALLAGLTTKDLPDTSQGFDANFTGTLPLEDDVWLEQYIFDTDPQLRFDRKCRNESLCSIYSRLFKLGLFFAQLCVKSVVSSAELQDCISTSHYATKLTRYFNDNGSTHDGADAGATVLPTGDDFQYLFERDYVTFLPTGVLTIFPCAKAIRYKPSTMATTDNSWVSIDEPDCLLFHTGTLLARWSQGMHTTSPLQIDPRANIVSLTIWPPLTTPISSKGEGTIANHLLEQQIKAFPKVAQQYYPRELSILRLQDAMKFVKELFTVCETVLSLNALSRSTGVPPELHVLLPQISSMMKRKIVQDDILKLLTIWSDAYVVELNSRGELTMNLPKRDNLTTLTNKSRTLAFVERAESWYQQVIASKDEIMTDVPAFKINKRRSSSNSKTVLSSKVQTKSSNANALNNSRYLANSKENFMYKEKMPDSQANLMDRLRERERRSAALLSQRQKRYQQFLAMKMTQVFDILFSLTRGQPYTETYLSSLIVDSLQDSNNPIGTKEASEILAGLQGILPMDISVHQVDGGLKVYRWNSLDKNRFSKLLQIHKSKQQD 604 T 1.3E-07 CDT1 pdbhh F Eukaryota T 6wgn 2 D,E,F E,F,G Cyclic Peptide KD2 GXFVNFRNFRTFRCG 15 T 9.3 Bac_DNA_binding pdbhh F T 6wh3 1 A,AA,AB,B,BA,BB,C,CA,CB,D,DA,DB,E,EA,EB,F,FA,FB,G,GA,GB,H,HA,HB,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,PA,Q,QA,R,RA,S,SA,T,TA,U,UA,V,VA,W,WA,X,XA,Y,YA,Z,ZA A,8,u,B,7,v,C,6,w,D,5,x,E,4,y,F,3,z,G,a,1,H,b,2,I,c,J,d,K,e,L,f,M,g,N,h,O,i,P,j,Q,k,R,l,S,m,T,n,U,o,V,p,W,q,X,r,Y,s,Z,t Penaeus monodon metallodensovirus major capsid protein MSDEVSSSTDVVSRKRRRHDEGGKALEDIAVHGASEGDGSAPGGSVWQTTDYIALSMVVYRTAIKLRNFVNIRGLTPTEMIVIPWNVMRFYCEYNTGTYGLSGNVHHKNYSMLLACKAHRPTKVGYTLSNLILTSDELVSTGGTLGTTTTFNTSPYMIHSIDDQQCLSKVYPKTDTVWPVSSMRELDYVASTVSGDNAIIPSTIFNKNRYWKQGDDALHFSHDLDLGFWFGSDYGNAYVPQNNDSMNAVGTIPTSKHINVRGVNNRGMAGHYLSFPPIRTNDGQFKLNAQFTLETEIEFEFRLWEQGVQGINSVHTNLNPANDSLWIQSYGSLVSITESKINNIQFGPTCPRVDARNKGGKMSMLFDHH 369 T 29 DUF4752 pdbhh F T 6whn 2 D,E,F F,G,H U2M-ASN-PRO-LYS-GLN-DLY-TRP-GLY peptide macrocycle XNPKQXWG 8 T 1.7 Pox_A28 pdbhh F T 6who 2 D,E,F F,G,H U2M-ASN-PRO-GLU-GLN-DLY-TRP-GLY peptide macrocycle XNPEQXWG 8 T 3.1 DUF2340 pdbhh F T 6win 1 A A Q8Z969_SALTI Type 6 secretion amidase effector 2 APYVYANAKALQDTEKVGNHHQCVELIQHYIRVGQASTWQQGAAVFGNKNIEVGTVIATFVNGRYPNHNSGNHAAFFLGQDTGGIWVMDQWKDDIAKPRVSKRYIRKLHNGSVRSDGTYIRMSNNAEAYFIVELEHHHHH 140 T 0.95 GATA unppssm F Bacteria T 6wj2 8 H H S38A9_HUMAN SOLUTE CARRIER FAMILY 38 MEMBER 9,UP-REGULATED IN LUNG CANCER 11 GGTMANMNSDSRHLGTSEVDHERDPGPMNIQFEPSDLRSKRPFCIEPTNIVNVNHVIQRVSDHASAMNKRIHYYSRLTTPADKALIAPDHVVPAPEECYVYSPLGSAYKLQSYTEGYGKNTS 122 T 37 CoV_NSP4_C pdbhh F Eukaryota T 6wjq 2 C,D C,D PDPK1_HUMAN HPDK1 XARTTSQLYDAVPIQSX 17 T 3.7 HHA pdbhh F Eukaryota T 6wkk 2 G,H,I,J,K,L,M,N,O,P,Q,R,S,T,U,V,W,X G,H,I,J,K,L,M,N,O,P,Q,R,S,T,U,V,W,X G3MB96_9CAUD Gp26 capsid decoration protein PYVRLGYEGILNGAHDIDVAGLNGVEQLAGKFATIGANGVKLAGDNGTNAVGLFREDLGDMVNASEKASFYFRGGEYYVNISRTSLTAAGIAAGDEITCDADGKMIKFTGTGKALGVVTHVGEYRAGNMYEKATQGVTDTDTFIGFIMYV 150 T 0.082 DUF4265 pdb T Viruses T 6wkr 6 F,R B,E JARD2_HUMAN JUMONJI/ARID DOMAIN-CONTAINING PROTEIN 2 MSKERPKRNIIQKKYDDSDGIPWSEERVVRKVLYLSLKEFKNSQKRQHAEGIAGSLKTVNGLLGNDQSKGLGPASEQSENEKDDASQVSSTSNDVSSSDFEEGPSRKRPRLQAQRKFAQSQPNSPSTTPVKIVEPLLPPPATQISDLSKRKPKTEDFLTFLCLRGSPALPNSMVYFGSSQDEEEVEEEDDETEDVKTATNNASSSCQSTPRKGKTHKHVHNGHVFNGSSRSTREKEPVQKHKSKEATPAKEKHSDHRADSRREQASANHPAAAPSTGSSAKGLAATHHHPPLHRSAQDLRKQVSKVNGVTRMSSLGAGVTSAKKMREVRPSPSKTVKYTATVTKGAVTYTKAKRELVKDTKPNHHKPSSAVNHTISGKTESSNAKTRKQVLSLGGASKSTGPAVNGLKVSGRLNPKSCTKEVGGRQLREGLQLREGLRNSKRRLEEAHQA 450 T 0.11 Actin_micro pdb F Eukaryota T 6wkx 1 A,AA,AB,AC,AD,B,BA,BB,BC,C,CA,CB,CC,D,DA,DB,DC,E,EA,EB,EC,F,FA,FB,FC,G,GA,GB,GC,H,HA,HB,HC,I,IA,IB,IC,J,JA,JB,JC,K,KA,KB,KC,L,LA,LB,LC,M,MA,MB,MC,N,NA,NB,NC,O,OA,OB,OC,P,PA,PB,PC,Q,QA,QB,QC,R,RA,RB,RC,S,SA,SB,SC,T,TA,TB,TC,U,UA,UB,UC,V,VA,VB,VC,W,WA,WB,WC,X,XA,XB,XC,Y,YA,YB,YC,Z,ZA,ZB,ZC A,p,AA,VA,qA,V,q,BA,WA,W,r,CA,Q,X,s,L,XA,Y,G,DA,YA,B,t,EA,ZA,Z,u,FA,aA,a,v,GA,R,b,w,M,bA,c,H,HA,cA,C,x,IA,dA,d,y,JA,eA,e,z,KA,S,f,0,N,fA,g,I,LA,gA,D,1,MA,hA,h,2,NA,iA,i,3,OA,T,j,4,O,jA,k,J,PA,kA,E,5,QA,lA,l,6,RA,mA,m,7,SA,U,n,8,P,nA,o,K,TA,oA,F,9,UA,pA peptide 15-10-3 QAEILRAYARILEAQ 15 T 2.7 Inhibitor_I10 pdbhh F T 6wky 1 A,AA,B,BA,C,CA,D,DA,E,EA,F,FA,G,GA,H,HA,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,P,Q,R,S,T,U,V,W,X,Y,Z B,g,C,h,D,i,A,j,E,k,F,l,G,m,H,n,I,o,J,p,K,q,L,r,M,s,N,t,O,P,Q,R,S,T,a,b,c,d,e,f peptide 29-24-3 QAEILRAYARILEADAKILEAHAEILKAQ 29 T 20 Rad33 pdbhh F T 6wl0 1 A,AA,B,BA,C,CA,D,DA,E,EA,F,FA,G,GA,H,HA,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,Q,R,S,T,U,V,W,X,Y,Z A,a,B,b,C,c,D,d,E,e,F,f,G,g,H,h,I,i,J,j,K,k,L,l,M,m,N,n,O,o,P,Q,R,S,T,U,V,W,X,Y,Z peptide 36-31-3-RD TLEELRAEARILEAKAEILKAKAEVLKAKAEILKAQ 36 T 7.2 DUF6327 pdbhh F T 6wl1 1 A,AA,B,BA,C,CA,D,DA,E,EA,F,FA,G,GA,H,HA,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,PA,Q,QA,R,RA,S,SA,T,TA,U,UA,V,VA,W,WA,X,XA,Y,YA,Z,ZA A,a,B,b,C,c,D,d,E,e,F,f,G,g,H,h,I,i,J,j,K,k,L,l,M,m,N,n,O,o,P,p,Q,q,R,r,S,s,T,t,U,u,V,v,W,w,X,x,Y,y,Z,z peptide 36-31-3 QAEILRAYARILEADAEILKAQAKILEAHAEILKAQ 36 T 0.11 RnlB_antitoxin pdb F T 6wl7 1 A,AA,AB,AC,AD,AE,B,BA,BB,BC,BD,BE,C,CA,CB,CC,CD,CE,D,DA,DB,DC,DD,DE,E,EA,EB,EC,ED,EE,F,FA,FB,FC,FD,FE,G,GA,GB,GC,GD,GE,H,HA,HB,HC,HD,HE,I,IA,IB,IC,ID,IE,J,JA,JB,JC,JD,JE,K,KA,KB,KC,KD,KE,L,LA,LB,LC,LD,LE,M,MA,MB,MC,MD,ME,N,NA,NB,NC,ND,NE,O,OA,OB,OC,OD,OE,P,PA,PB,PC,PD,PE,Q,QA,QB,QC,QD,QE,R,RA,RB,RC,RD,RE,S,SA,SB,SC,SD,SE,T,TA,TB,TC,TD,TE,U,UA,UB,UC,UD,V,VA,VB,VC,VD,W,WA,WB,WC,WD,X,XA,XB,XC,XD,Y,YA,YB,YC,YD,Z,ZA,ZB,ZC,ZD A,u,GA,N,xA,JB,Z,v,HA,cA,yA,KB,a,w,J,dA,zA,W,b,x,IA,eA,0A,LB,c,F,JA,fA,S,MB,d,y,KA,gA,1A,NB,B,z,LA,O,2A,OB,e,0,MA,hA,3A,PB,f,1,K,iA,4A,X,g,2,NA,jA,5A,QB,h,G,OA,kA,T,RB,i,3,PA,lA,6A,SB,C,4,QA,P,7A,TB,j,5,RA,mA,8A,UB,k,6,L,nA,9A,Y,l,7,SA,oA,AB,VB,m,H,TA,pA,U,WB,n,8,UA,qA,BB,XB,D,9,VA,Q,CB,YB,o,AA,WA,rA,DB,ZB,p,BA,M,sA,EB,q,CA,XA,tA,FB,r,I,YA,uA,V,s,DA,ZA,vA,GB,E,EA,aA,R,HB,t,FA,bA,wA,IB peptide 29-20-2 QAEILEADARILRAYAEILKAHAEILKAQ 29 T 7.3 DUF5799 pdbhh F T 6wl8 1 A,AA,AB,AC,AD,B,BA,BB,BC,BD,C,CA,CB,CC,D,DA,DB,DC,E,EA,EB,EC,F,FA,FB,FC,G,GA,GB,GC,H,HA,HB,HC,I,IA,IB,IC,J,JA,JB,JC,K,KA,KB,KC,L,LA,LB,LC,M,MA,MB,MC,N,NA,NB,NC,O,OA,OB,OC,P,PA,PB,PC,Q,QA,QB,QC,R,RA,RB,RC,S,SA,SB,SC,T,TA,TB,TC,U,UA,UB,UC,V,VA,VB,VC,W,WA,WB,WC,X,XA,XB,XC,Y,YA,YB,YC,Z,ZA,ZB,ZC A,N,a,n,1,0,EA,RA,eA,rA,B,O,b,o,2,FA,SA,fA,C,P,c,p,3,GA,TA,gA,D,Q,d,q,4,HA,UA,hA,E,R,e,r,5,IA,VA,iA,F,S,f,s,6,JA,WA,jA,G,T,g,t,7,KA,XA,kA,H,U,h,u,8,LA,YA,lA,I,V,i,v,9,MA,ZA,mA,J,W,j,w,AA,NA,aA,nA,K,X,k,x,BA,OA,bA,oA,L,Y,l,y,CA,PA,cA,pA,M,Z,m,z,DA,QA,dA,qA Form 2 peptide QAKILEADAEILKAYAKILEAHAEILKAQ 29 T 2.4 DUF5320 pdbhh F T 6wl9 1 A,AA,AB,AC,B,BA,BB,BC,C,CA,CB,CC,D,DA,DB,DC,E,EA,EB,EC,F,FA,FB,FC,G,GA,GB,GC,H,HA,HB,HC,I,IA,IB,IC,J,JA,JB,JC,K,KA,KB,KC,L,LA,LB,LC,M,MA,MB,MC,N,NA,NB,NC,O,OA,OB,OC,P,PA,PB,PC,Q,QA,QB,QC,R,RA,RB,RC,S,SA,SB,SC,T,TA,TB,TC,U,UA,UB,UC,V,VA,VB,VC,W,WA,WB,WC,X,XA,XB,XC,Y,YA,YB,YC,Z,ZA,ZB,ZC A,N,a,n,0,DA,QA,dA,B,O,b,o,1,EA,RA,eA,C,P,c,p,2,FA,SA,fA,D,Q,d,q,3,GA,TA,gA,E,R,e,r,4,HA,UA,hA,F,S,f,s,5,IA,VA,iA,G,T,g,t,6,JA,WA,jA,H,U,h,u,7,KA,XA,kA,I,V,i,v,8,LA,YA,lA,J,W,j,w,9,MA,ZA,mA,K,X,k,x,AA,NA,aA,nA,L,Y,l,y,BA,OA,bA,oA,M,Z,m,z,CA,PA,cA,pA peptide Form2a QAEILKADAEILKAYAKILEAHAEILKAQ 29 T 4.5 DUF5320 pdbhh F T 6wlg 1 A,B A,B INT3_HUMAN INT3,SOSS COMPLEX SUBUNIT A,SENSOR OF SINGLE-STRAND DNA COMPLEX SUBUNIT A,SENSOR OF SSDNA SUBUNIT A HPIKETVVEEPVDITPYLDQLDESLRDKVLQLQKGSDTEAQCEVMQEIVDQVLEEDFDSEQLSVLASCLQELFKAHFRGEVLPEEITEESLEESVGKPLYLIFRNLCQMQEDNSSFSLLLDLLSELYQKQPKIGYHLLYYLRASKAAAGKMNLYESFAQATQLGDLHTCLMMDMKACQEDDVRLLCHLTPSIYTEFPDETLRSGELLNMIVAVIDSAQLQELVCHVMMGNLVMFRKDSVLNILIQSLDWETFEQYCAWQLFLAHNIPLETIIPILQHLKYKEHPEALSCLLLQLRREKPSEEMVKMVLSRPCHPDDQFTTSILRHWCMKHDELLAEHIKSLLIKNNSLPRKRQSLRSSSSKLAQLTLEQILEHLDNLRLNLTNTKQNFFSQTPILQALQHVQASCDEAHKMKFSDLFSLAEEY 423 T 0.0011 IFRD pdbpssm F Eukaryota T 6wlw 6 N T RNK_HUMAN RNASE KAPPA MGWLRPGPRPLCPPARASWAFSHRFPSPLAPRRSPTPFFMASLLCCGPKLAACGIVLSAWGVIMLIMLGIFFNVHSAVLIEDVPFTEKDFENGPQNIYNLYEQVSYNCFIAAGLYLLLGGFSFCQVRLNKRKEYMVR 137 T 0.0026 DUF2650 pdbpercent F Eukaryota T 6wlx 2 B B CTNB1_HUMAN BETA-CATENIN KKRLSVE 7 T 0.051 Adaptin_N unppssm F Eukaryota T 6wlz 3 G,H,I X,Y,Z Q5ZWW6_LEGPH SidK MSFIKVGIKMGGLTSEQYHSQVVGKIGYIARCMQTIDPENNLKKIREDYQDVLIWAEKNYRFEEILEASKSGKCPNDLDALSRRSLILQELLRLVSSISPFKMKLDLIESQYEKMKQHVNLWKSDYHVKLNQLNQLTDYLKNAAPTPKNNFLRAMTSVLQMQIAQYGITEDNEGINQLFKLGLHLLAMANEKIDEQYHLFKGYVKDQPEESPFEGILPAEDQKILVKTMIDYAMPKLSSKVLQDKLSALSSSDVLTKTLLDSIDRIVKENEKLNALSKVKLGKFGLDIREIEVIYSQALKISPQDALQYTAQQCDAQLLSMAFPDSQNYIIESISNKKVKTIAELIHSKEFIYQIIKTEVFKQVDPNEKIRLQAATELYQLLGRIMDKQINLFTKMNLEQINEYIQTKTKAILDKIPERVELLTFMGFEIPTFKGIETLMTDISHSQDNETLAIAQEFYTNIKNAKNQLLGDKLIEDITPQDVEKFFNQCSQYGSEAAEKLADNRPVLTKIADILTAIARWAISLIGFNTPPQFLAPTRTCVDQVSDEITKIKLKLEDTLGSLQKVQEESLSL 573 T 0.15 Chalcone pdb F Bacteria T 6wmf 2 B B KASH5_HUMAN COILED-COIL DOMAIN-CONTAINING PROTEIN 155,KASH DOMAIN-CONTAINING PROTEIN 5 GPGGSGGPSPPPTWPHLQLCYLQPPPV 27 T 1.7 B56 pdbhh F Eukaryota T 6wmk 1 A,C A,C Beta sheet heterodimer LHD29 - Chain A SGGSTWQWVLINISEEARQRIEEYVRRISKKEGTEVHFEKDDGVLHIRVKNLHEKRAREIHEYAKRVIL 69 T 0.005 TnpV pdb F T 6wmk 2 B,D B,D Beta sheet heterodimer LHD29 - Chain B SGGSSSIFLLSNVSEEARQRAEEYVRRISKKEGTEVRFEKDDGFLTIEVKNLSEERLREIAEYLWRVAV 69 T 0.013 MecA pdb F T 6wmq 2 C,D E,F NCOR1_HUMAN N-COR1 RTHRLITLADHICQIITQDFARN 23 T 40 Es2 pdbhh F Eukaryota T 6wnm 1 A,B A,B A0A509JD33_PSEAI Pf4r MSTPADRARLLIKKIGPKKVSLHGGDYERWKSVSKGAIRVSTEEIDVLVKIFPNYALWIASGSIAPEVGQTSPDYDEANLNLSNQNAGAHHHHHH 95 T 0.0022 BetR unphh F Bacteria T 6wnx 3 C,F,I C,F,I CTNB1_HUMAN BETA-CATENIN LDSGIHSGA 9 T 2.4 Peptidase_C9 pdbhh F Eukaryota T 6wo6 1 A,B A,B A0A3A6VZ03_LEGPN RavA SNATFTCDELKGLEHPYEVLGNGDALAENREELNKLTNDAALVLASRLVLECPVNELKDFAHAIEAARMPQDDSDTFHSFLFQAYQVKKRIISLLDPRNINPHSMILEKEFDGELFNNFNKLAIDVLTNNEVAIALRLAETTPAQDRSRVSQNINNIFPQSLFAAKVGHAFAVRRDIERLLLGDRPDQFFSSREFKIDSCIEFASLFNVINDKESSIAGKLALRTPAENRTDVVMKIKGFCAEDSELAIKVQSAFALRRDIERNLLGDNPEQFFSSRDFSVDLCLEFAILFPELLKGHEQAIGEKLAKLDAKVRSDISRKLEMINGAAHEQ 331 T 0.29 PORR pdbpssm F Bacteria T 6wpb 1 A A HLP1_BOAPU HSP1-NH2 GILDAIKAIAKAAGX 15 T 0.48 Antimicrobial_2 unphh F Eukaryota T 6wpv 1 A A Xanthoxycyclin D GTVAVQFL 8 T 9.9 MatB pdbhh F T 6wq2 3 C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S A,B,C,D,E,F,G,H,I,J,K,L,M,N,P,R,T Y035_SIFVH Structural protein MCP2 MARRNRRLSSASVYRYYLKRISMNIGTTGHVNGLSIAGNPEIMRAIARLSEQETYNWVTDYAPSHLAKEVVKQISGKYNIPGAYQGLLMAFAEKVLANYILDYKGEPLVEIHHNFLWELMQRQSGAGLGVTSGFIYTFVRKDGKPVTVDMSKVLTEIEDALFKLVKK 167 T 0.022 DUF1581 pdb T Viruses T 6wq2 4 AA,BA,CA,DA,EA,FA,GA,HA,IA,JA,T,U,V,W,X,Y,Z h,i,j,k,l,m,n,p,r,t,a,b,c,d,e,f,g Y036_SIFVH Structural protein MCP1 MAGRQSHKKIDVRNDTSTRYKGKLYGIFVNYMGEKYAQQLVENMYSNYNDVFVEIYNKMHNALRPTLVKLAGAGATFPLWQLVNEAIYAVYLTHKETASFLVTKYVARGVPAMTVKTLLAEVGNQLKELVPAVAEQIGSVTLDHTNVVSTVDNIVTSMPALPNSYAGVLMKTKVPTVTPHYAGTGTFSSMESAYKALEDIERGL 204 T 0.019 MMS22L_C pdb T Viruses T 6wqj 1 A A A0A0A0L4Q9_CUCSA Vicilin-buried peptide-10 QKETEICRQWCQVMKPQGGEEQRRCQQECEERLRD 35 T 4.8E-05 Vicilin_N unphh F Eukaryota T 6wqr 1 A A HSTX1_HAESL HSTX-I ACKEYWECGAFLFCIEGICVPMIX 24 T 0.0053 DUF3397 unppercent F Eukaryota T 6wqu 4 D D NOTC3_HUMAN NOTCH 3 ARRKREHSTLWFPEGFSL 18 T 0.062 VPS9 unphh F Eukaryota T 6wrv 2 D,E,F D,C,F Computationally designed protein 3DS18 DEREEEQRRRLEEVKEEAKRRERSEQDLAVLYLEAVNAAVVFVADSEEEAKRVADIVKKLVPEVIIFVHDNFVVFVVDSDEAARRVYEIVERAQ 94 T 0.0011 MCM6_C pdb F T 6wrw 2 C,D C,D Computationally designed protein 2DS25.5 DEEEIQKAIEELLRKGVSEEEAAIIIVQRFNVAVVVVVQDERQGKHISEYIRRYIPEADVILFANLVVIKVETHELSTRVWEAAQKAY 88 T 0.023 NUDIX_2 pdb F T 6wrx 2 C,D C,D Computationally designed protein 2DS25.1 DEEEIQKAIEELLRKGVSEEEAAIIIVQRFNVAVVVVVQDERQAKHISEYIRRYIPEADVILFANIVVIKVETHELRKRVWEAAQKAY 88 T 0.014 NUDIX_2 pdb F T 6wt4 1 A,B A,B CAP13_FLASX Bacterial STING SEAEYSPAFALAVGYFKNFIFPAITQIKENGEVNPKICIYKPKHFDELTSTNIDMIKAELTNKKYNLSEINLSLKGARARDILTLNKKSKIHSYFDFPNTLLSLYSYVDFKIASSNNNSSELKKKKFVELLIEQFYLKLNELIQENNLTNNITFCDKNLQGL 162 T 0.00067 TMEM173 pdbhh F Bacteria T 6wt5 1 A,B,C,D A,B,C,D CAP12_CAPGB ABC-TYPE SUGAR TRANSPORT SYSTEM, PERIPLASMIC COMPONENT SNDSDINFFPSSTLAAVYYENFIKPTCSHIINNGGLLDKNGYIYKKCTIKIIIPKKLTSDVNSQFQRIKAKIETKELSFEYLGRPRNINVEIIAEDGEVMIIDFPTILSGINYAISNLLPQDFNSMSVDYEAILSRELERFVYTLKKIALRDGFDDLIKIVDEDN 165 T 0.00018 TMEM173 pdbhh F Bacteria T 6wt8 1 A A CDNE_FLASX FSCDNE SQKNYLELIKKVRERSNPDLVQMTKMYSETLSGSKLFENKSIEYSDVSIYIKESMKGVAPSYTMNSKVAANKVEAHLKKSHGNLVDFERQGSVMTNTHILKENDVDLVQITNKSSEFDHKGLEKALNNTSVLKTEEILNLKKHKENFSPYQGNQIDDLKYVRLKSELVLSSTYKTVDIEKENSIYVKVTEPERDIDVVTATYYKSVDFMKTNDKSRKGIQIYNKKTGKINDVDYPFLSIERINVKDIISNRRLKNMIRFLKNIKYDCPHIENKGSIRSFHINAICYNIDVKKYEDLHYLDLVSILYQELTNIISNKSYRDNIKSVDGCEYIFEFDCAKKLIEIEFLSQELDSIIADLHNQSLLVG 365 T 0.00046 NTP_transf_2 pdbpercent F Bacteria T 6wt9 1 A A CDNE_CAPGB CGCDNE SEKKNYSALFENLQNRSNPEKLQEITTKFFSDNPDVKYNDVLKYITLAMNGVSPEYTNKSREAGEKVKLHLQDILLDVEYQYQGSVMTNTHIKGYSDIDLLVISDKFYTLDERNIIENLEVNKFSLSQEKIQKLQQELLGKKYHSATNDLKNNRLLSEQKLSSVYEICDITHPKAIKITNKSMGRDVDIVIANWYDDAQSVINNRQIEYRGIQIYNKRSNTIENRDFPFLSIQRINKRSSETKGRLKKMIRFLKNLKADSDEKIELSSFDINAICYNIEKNKYLHSNKYQLVPILYEQLNELVSNSNKINSLKSVDGHEYIFSRNNIDKKESLKMLLQEVKIIYSNLQSYL 351 T 1.3E-05 NTP_transf_2 pdb F Bacteria T 6wuc 4 D W CENPW_YEAST CENP-W HOMOLOG,CONSTITUTIVE CENTROMERE-ASSOCIATED NETWORK PROTEIN WIP1,W-LIKE PROTEIN 1 SNAMDTEALANYLLRQLSLDAEENKLEDLLQRQNEDQESSQEYNKKLLLACGFQAILRKILLDARTRATAEGLREVYPYHIEAATQAFLDSQ 92 T 4.6E-05 CENP-W pdbhh F Eukaryota T 6wuc 5 E T CENPT_YEAST CENP-T HOMOLOG,CO-PURIFIED WITH NNF1 PROTEIN 1,CONSTITUTIVE CENTROMERE-ASSOCIATED NETWORK PROTEIN CNN1 MSTPRKAAGNNENTEVSEIRTPFRERALEEQRLKDEVLIRNTPGYRKLLSASTKSHDILNKDPNEVRSFLQDLSQVLARKSQGNDTTTNKTQARNLIDELAYEESQPEENELLRSRSEKLTDNNIGNETQPDYTSLSQTVFAKLQERDKGLKSRKIDPIIIQDVPTTGHEDELTVHSPDKANSISMEVLRTSPSIGMDQVDEPPVRDPVPISITQQEEPLSEDLPSDDKEETEEAENEDYSFENTSDENLDDIGNDPIRLNVPAVRRSSIKPLQIMDLKHLTRQFLNENRIILPKQTWSTIQEESLNIMDFLKQKIGTLQKQELVDSFIDMGIINNVDDMFELAHELLPLELQSRIESYLF 361 T 0.0019 CENP-T_C pdbhh F Eukaryota T 6wud 2 B B TMC1_MOUSE BEETHOVEN PROTEIN,DEAFNESS PROTEIN,TRANSMEMBRANE COCHLEAR-EXPRESSED PROTEIN 1 GGGDDNTFNFSWKVFCSWDYLIGNPETADNKFNSITMNFKEAIIEERAAQVEENI 55 T 4.4 NAD_binding_5 pdbhh F Eukaryota T 6wux 1 A,B A,B HTRSN_PHYTS Homotarsinin NLVSDIIGSKKHMEKLISIIKKCRX 25 T 7 Spore_IV_A unphh F Eukaryota T 6wvs 1 A A DeNovoTIM15 hyperstable de novo TIM barrel MDILIVNATDVDEMLKQVEILRRLGAKQIAVVSDDWRILQEALKKGGDILIVNATDVDEMLKQVEILRRLGAKQIAVVSDDWRILQEALKKGGDILIVNATDVDEMLKQVEILRRLGAKQIAVVSDDWRILQEALKKGGDILIVNATDVDEMLKQVEILRRLGAKQIAVVSDDWRILQEALKKGGLEHHHHHH 193 T 0.0034 DNA_photolyase pdbpercent F T 6ww7 9 I I EMC10_HUMAN HEMATOPOIETIC SIGNAL PEPTIDE-CONTAINING MEMBRANE DOMAIN-CONTAINING PROTEIN 1 MAAASAGATRLLLLLLMAVAAPSRARGSGCRAGTGARGAGAEGREGEACGTVGLLLEHSFEIDDSANFRKRGSLLWNQQDGTLSLSQRQLSEEERGRLRDVAALNGLYRVRIPRRPGALDGLEAGGYVSSFVPACSLVESHLSDQLTLHVDVAGNVVGVSVVTHPGGCRGHEVEDVDLELFNTSVQLQPPTTAPGPETAAFIERLEMEQAQKAKNPQEQKSFFAKYWMYIIPVVLFLMMSGAPDTGGQGGGGGGGGGGGSGR 262 T 0.0033 2OG-FeII_Oxy_5 pdbpssm F Eukaryota T 6ww9 2 B,D X,Y SHLD3_HUMAN REV7-INTERACTING NOVEL NHEJ REGULATOR 1,SHIELD COMPLEX SUBUNIT 3 MLSRFIPWFPYDGSKLPLRPKRSPPASREEIMATL 35 T 2.2 Sm_like pdbhh F Eukaryota T 6wxh 2 D D CEA1_ECOLX Colicin-E1 METAVAYYKDGVPYDDKGQVIITLLNGTPDGSGSGGGGGKGGSKSESSAAIHATAKWSTAQLKKTQAEQAARAKAAAEAQAKAKANRDALTQRLKDIVNEALRHNASRTPSATELAHANNAAMQAEDERLRLAKAEEKARKEAEAAEKAFQEAEQRRKEIEREKAETERQLKLAEAEEKRLAALSEEAKALEHHHHHH 198 T 0.21 GTP1_OBG unppercent F Bacteria T 6wxo 1 A,B A,B TFD-HE MGHHHHHHGGWGGSGGENLYFQGDILIVNAKDVDEMLKQVEILRRLGAKQIAVHSSDWRILQEALKKGGDILIVNGGGMTITFRGDDLEALLKAAIEMIKQALKFGATITLSLDGNDLNINITGVPEQVRKELAKEAERLAKEFGITVTRTGGGDVDEMLKQVEILRRLGAKQIAVESDDWRILQEALKKG 191 T 0.0014 YicC_N pdbpercent F T 6wy6 2 C,D C,D EDE1_YEAST BUD SITE SELECTION PROTEIN 15, EDE1 ADSESEFENVANAGSMEQFETIDHKDLX 28 T 0.063 ComC pdbpssm F Eukaryota T 6wzx 2 C,D C,D ILE-GLY-LEU-TRP-LYS peptide IGLWKS 6 T 4.5 DUF3876 pdbhh F T 6wzz 2 B B VGLWKS peptide VGLWKS 6 T 1.1 CCD48 pdbhh F T 6x1g 1 A,B A,C B3CVM3_ORITI ULP_PROTEASE domain-containing protein MERLVKKVTSNLETELKFFKGRLVQELMQIVKNENGRIDHTSKNWQESASVLLNSQEKGAVSLAEVERAVSKMTQKLRDQKVSEEEVVNIESKLKFERASLEAKLFDDNEIKELINKRIKEDALRAIPFLGSDSESFMEKISPFVKLPDDSYSLLKANDKHHPFQNILYSNALKFFADSSDIGYLNDDSLKNLTPENLNAFEQAVAADIDKLMHHHHHH 219 T 0.0088 RAB3GAP2_C pdbpssm F Bacteria T 6x1s 3 I,J,K,L G,I,J,K NM23-1-pTza peptide RNIIXGSDS 9 T 0.6 Nbs1_C pdbhh F T 6x23 2 B B KCNJ9_HUMAN GIRK-3,INWARD RECTIFIER K(+) CHANNEL KIR3.3,POTASSIUM CHANNEL,INWARDLY RECTIFYING SUBFAMILY J MEMBER 9 LPPPESESKV 10 T 22 ANAPC9 pdbhh F Eukaryota T 6x2p 4 D D MP2K1_HUMAN MKK1,ERK ACTIVATOR KINASE 1,MAPK/ERK KINASE 1,MEK 1 TNLEALQKKLEELELDE 17 T 0.07 CAMSAP_CC1 unppercent F Eukaryota T 6x2s 4 D D MP2K1_HUMAN MKK1,ERK ACTIVATOR KINASE 1,MAPK/ERK KINASE 1,MEK 1 NLEALQKKLEELELNQ 16 T 0.07 CAMSAP_CC1 unppercent F Eukaryota T 6x2x 4 D D MP2K1_HUMAN MKK1,ERK ACTIVATOR KINASE 1,MAPK/ERK KINASE 1,MEK 1 ALEALQKKLEELELDE 16 T 0.07 CAMSAP_CC1 unppercent F Eukaryota T 6x3b 1 A,B,C,D A,B,C,D RMD_PSEAE NAD-DEPENDENT EPIMERASE/DEHYDRATASE FAMILY PROTEIN MGSSHHHHHHSSENLYFQGHMTQRLFVTGLSGFVGKHLQAYLAAAHTPWALLPVPHRYDLLEPDSLGDLWPELPDAVIHLAGQTYVPEAFRDPARTLQINLLGTLNLLQALKARGFSGTFLYISSGDVYGQVAEAALPIHEELIPHPRNPYAVSKLAAESLCLQWGITEGWRVLVARPFNHIGPGQKDSFVIASAARQIARMKQGLQANRLEVGDIDVSRDFLDVQDVLSAYLRLLSHGEAGAVYNVCSGQEQKIRELIELLADIAQVELEIVQDPARMRRAEQRRVRGSHARLHDATGWKPEITIKQSLRAILSDWESRVREE 324 T 0.00018 GDP_Man_Dehyd pdbpercent F Bacteria T 6x5g 2 B B LRRC7_HUMAN DENSIN-180,DENSIN,PROTEIN LAP1 SKSRSTSSHGRRPLIRQDRIVG 22 T 3.7 Rubredoxin_C pdbhh F Eukaryota T 6x5q 2 B B GRIA1_HUMAN GLUR-1,AMPA-SELECTIVE GLUTAMATE RECEPTOR 1,GLUR-A,GLUR-K1,GLUTAMATE RECEPTOR IONOTROPIC,AMPA 1,GLUA1 SKRMKGFCLIPQQSINEAIR 20 T 0.21 PROCT pdbhh F Eukaryota T 6x6f 1 A,B,C,D,E,F A,B,C,D,F,H Pf6r MESIQSRARTLIDKAGIDRLVRHGEISHSRWQSVRYKDIRMSTEELEVLQSLFPHYRLWLISGEVMPEAGQVSPDFEEASRNLAGQNAGAHHHHHH 96 T 7E-05 BetR pdbhh F T 6x6h 2 B A2 A9ZMR8_ECOLX STX2A PECQITGDRPVIKINNTLWESNTAAAFLNRKSQFLYTTGK 40 T 7.7 CdiA_C pdbhh F Bacteria T 6x6o 1 A,B A,B SPAC_BPT4 Protein spackle MKKFIFATIFALASCAAQPAMAGYDKDLCEWSMTADQTEVETQIEADIMNIVKRDRPEMKAEVQKQLKSGGVMQYNYVLYCDKNFNNKNIIAEVVGELEHHHHHH 105 T 0.0054 Mfp-3 unphh T Viruses T 6x6s 1 A,AA,AB,AF,B,BA,BF,C,CA,CE,CF,D,DE,DF,E,ED,EE,EF,FD,FE,GC,GD,GE,HC,HD,IB,IC,ID,JB,JC,KA,KB,KC,LA,LB,M,MA,MB,N,NA,O,OA,OE,P,PE,Q,QD,QE,RD,RE,SC,SD,SE,TC,TD,UB,UC,UD,VB,VC,WA,WB,WC,XA,XB,Y,YA,YB,Z,ZA AA,CC,EE,NA,AB,CD,NB,AC,CE,LA,NC,AD,LB,ND,AE,JA,LC,NE,JB,LD,HA,JC,LE,HB,JD,FA,HC,JE,FB,HD,DA,FC,HE,DB,FD,BA,DC,FE,BB,DD,BC,DE,MA,BD,MB,BE,KA,MC,KB,MD,IA,KC,ME,IB,KD,GA,IC,KE,GB,ID,EA,GC,IE,EB,GD,CA,EC,GE,CB,ED A0A2J9KJK3_HELPX Type IV secretion system apparatus protein Cag3 MFRKLATAVSLIGLLTSNTLYAKEISEADKVIKATKETKETKKEAKRLKKEAKQRQQIPDHKKPQYVSVDDTKTQALFDIYDTLNVNDKSFGDWFGNSALKDKTYLYAMDLLDYNNYLSIENPIIKTRAMGTYADLIIITGSLEQVNGYYNILKALNKRNAKFVLKINENMPYAQATFLRVPKRSDPNAHTLDKGASIDENKLFEQQKKMYFNYANDVICRPDDEVCSPLRDEMVAMPTSDSVTQKPNIIAPYSLYRLKETNNANEAQPSPYATATAPENSKEKLIEELIANSQLVANEEEREKKLLAEKEKQEAELAKYKLKDLENQKKLKALEAELKKKNAKKPRVVEVPVSPQTSNSDETMRVVKEKENYNGLLVDKETTIKRSYEGTLISENSYSKKTPLNPNDLRSLEEEIKSYYIKSNGLCYTNGINLYVKIKNDPYKEGMLCGYESVQNLLSPLKDKLKYDKQKLQKALLKDSK 481 T 0.12 RRP36 pdbpssm F Bacteria T 6x6s 2 AE,BB,CD,DA,EC,F,FF,GB,HE,IA,JD,K,KF,LC,ME,NB,OD,PA,QC,R,SB,TE,UA,VD,W,XC,YE,ZB Km,EM,Im,CM,Gm,AM,NM,Em,LM,Cm,JM,Am,Nm,HM,Lm,FM,Jm,DM,Hm,BM,Fm,MM,Dm,KM,Bm,IM,Mm,GM A0A2J9KJL4_HELPX Type IV secretion system apparatus protein CagM MLAKIVFSSLVAFGVLSANVEQFGSFFNEIKKEQEEVAAKEDALKARKKLLNNTHDFLEDLIFRKQKIKELMDHRAKVLSDLENKYKKEKEALEKETRGKILTAKSKAYGDLEQALKDNPLYRKLLPNPYAYVLNQETFTKEDRERLSYYYPQVKTSSIFKKTTATTKDKAQALLQMGVFSLDEEQNKKASRLALSYKQAIEEYSNNVSNLLSRKELDNIDYYLQLERNKFDSKAKDIAQKATNTLIFNSERLAFSMAIDKINEKYLRGYEAFSNLLKNVKDDVELNTLTKNFTNQKLSFAQKQKLCLLVLDSFNFDTQSKKSILKKTNEYNIFVDSDPMMSDKTTMQKEHYKIFNFFKTVVSAYRNNVAKNNPFE 376 T 0.021 DUF4363 pdb F Bacteria T 6x7i 1 A A B2CL1_HUMAN BCL2-L-1,APOPTOSIS REGULATOR BCL-X GQERFNRWFLTGMTVAGVVLLGSLFSRK 28 T 0.057 DUF3094 pdb F Eukaryota T 6x7w 2 B,F G,D HIV fusion peptide 512-519 XAVGLGAVF 9 T 4.6 DUF3918 pdbhh F T 6x89 5 E A7 A0A1S3UVC7_VIGRR NDUA7 MAKSASNSLVQTLKRYIKKPWEITGPCADPEYRSAVPLATEYRLQCPATTKEKPCIPNSLPETVYDIKYFSRDQRRNRPPIRRTVLKKADVEKLAKEQTFAVSDFPPVYLNSAVEEDINAIGGGYQG 127 T 0.0011 CI-B14_5a pdb F Eukaryota T 6x89 29 CA P2 A0A1S3TGE7_VIGRR Protein At2g27730, mitochondrial MAARVAARYGSRRLFSSGSGKILSEEEKAAENAYFKKAEQDKLEKLARKGPQPEASSGGSVIDAKPSGSGHTGASAERVSTDKHRNYAVVAGTITILGALGWYLKGTAKKPEVQD 115 T 0.0082 IATP unphh F Eukaryota T 6x8n 1 A,B A,B De novo designed ABLE protein SVKSEYAEAAAVGQEAVAVFNTMKAAFQNGDKEAVAQYLARLASLYTRAEELLNRILEKARREGNKEAVTLMNEFTATFQTGKSIFNAMVAAFKNGDDDSFESYLQALEKVTAKGETLADQIAKAL 126 T 0.028 ASD2 pdbpssm F T 6x8r 1 A A SxIIIC peptide RGCCNGRGGCSSRWCRDHARCCX 23 T 0.041 Mu-conotoxin pdbpssm F T 6xa1 83 EC NC Stalled Nascent chain GLQIPAILGILGGILALLILILNPN 25 T 0.022 Phage_holin_5_2 pdb F T 6xar 2 C,D C,D SLAP2_MOUSE SRC-LIKE ADAPTER PROTEIN 2,SLAP-2 LSEGLRESLSSYISLAEDP 19 T 0.59 RHH_6 pdbhh F Eukaryota T 6xaw 2 B B UME6_YEAST NEGATIVE TRANSCRIPTIONAL REGULATOR OF IME2,REGULATOR OF INDUCER OF MEIOSIS PROTEIN 16,UNSCHEDULED MEIOTIC GENE EXPRESSION PROTEIN 6 GPRSRLLLGPNSASSSTKLDDDLGTAAAVLSNMRSSPYRTHDKPIS 46 T 4.6 ACC_epsilon pdbhh F Eukaryota T 6xf7 1 A,B B,C LMBD1_REOVL Lambda 1 protein QRHITEFISSWQNHPIVQVSADVENKKTAQLLHADTPRLVTWDAGLCTSFKIVPIVPAQVPQDVLAYTFFTSSYAIQSPFPEAAVSRIVVHTRWASNVDFDRDSSVIMAPPTENNIHLFKQLLNTETLSVRGANPLMFRANVLHMLLEFVLDNLYLNRHTGFSQDHTPFTEGANLRSLPGPDAEKWYSIMYPTRMGTPNVSKICNFVASCVRNRVGRFDRAQMMNGAMSEWVDVFETSDALTVSIRGRWMARLARMNINPTEIEWALTECAQGYVTVTSPYAPSVNRLMPYRISNAERQISQIIRIMNIGNNATVIQPVLQDISVLLQRISPLQIDPTIISNTMSTVSESTTQTLSPASSILGKLRPSNSDFSSFRVALAGWLYNGVVTTVIDDSSYPKDGGSVTSLENLWDFFILALALPLTTDPCAPVKAFMTLANMMVGFETIPMDNQIYTQSRRASAFSTPHTWPRCFMNIQLISPIDAPILRQWAEIIHRYWPNPSQIRYGAPNVFGSANLFTPPEVLLLPIDHQPANVTTPTLDFTNELTNWRARVCELMKNLVDNQRYQPGWTQSLVSSMRGTLDKLKLIKSMTPMYLQQLAPVELAVIAPMLPFPPFQVPYVRLDRDRVPTMVGVTRQSRDTITQPALSLSTTNTTVGVPLALDARAITVALLSGKYPPDLVTNVWYADAIYPMYADTEVFSNLQRDMITCEAVQTLVTLVAQISETQYPVDRYLDWIPSLRASAATAATFAEWVNTSMKTAFDLSDMLLEPLLSGDPRMTQLAIQYQQYNGRTFNVIPEMPGSVIADCVQLTAEVFNHEYNLFGIARGDIIIGRVQSTHLWSPLAPPPDLVFDRDTPGVHIFGRDCRISFGMNGAAPMIRDETGMMVPFEGNWIFPLALWQMNTRYFNQQFDAWIKTGELRIRIEMGAYPYMLHYYDPRQYANAWNLTSAWLEEITPTSIPSVPFMVPISSDHDISSAPAVQYIISTEYNDRSLFCTNSSSPQTIAGPDKHIPVERYNILTNPDAPPTQIQLPEVVDLYNVVTRYAYETPPITAVVMGVP 1059 T 0.59 DUF5810 unppercent T Viruses T 6xfk 2 B B SCTC_SALTY T3SS SECRETIN,PROTEIN INVG DDKLQKWVRVYLDRGQ 16 T 0.062 DUF3963 pdbhh F Bacteria T 6xfl 2 B B SCTC_SALTY T3SS SECRETIN,PROTEIN INVG GSHMDPLTPDASESVNNILKQSGAWSGDDKLQKWVRVYLDRGQEAIK 47 T 0.11 DUF3485 pdbpssm F Bacteria T 6xfm 1 A,B,C,D,E,F,G,H 1,2,3,4,5,6,7,8 FUS_HUMAN 75 KDA DNA-PAIRING PROTEIN,ONCOGENE FUS,ONCOGENE TLS,POMP75,TRANSLOCATED IN LIPOSARCOMA PROTEIN GSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQGDRG 104 T 920 HMMR_N pdbhh F Eukaryota T 6xi6 1 A A helical fusion design GKELEIVARLQQLNIELARKLLEAVARLQELNIDLVRKTSELTDEKTIREEIRKVKEESKRIVEEAEQEIRKAEAESLRLTAEAAADAARKAALRMGDERVRRLAAELVRLAQEAAEEATRDPNSSDQNEALRLIILAIEAAVRALDKAIEKGDPEDRERAREMVRAAVRAAELVQRYPSASAANEALKALVAAIDEGDKDAARCAEELVEQAEEALRKKNPEEARAVYEAARDVLEALQRLEEAKRRGDEEERREAEERLRQACERARKKN 272 T 0.00026 SMBP pdb F T 6xi8 1 A C TFB3_YEAST RNA POLYMERASE II TRANSCRIPTION FACTOR B 38 KDA SUBUNIT,RNA POLYMERASE II TRANSCRIPTION FACTOR B P38 SUBUNIT PFNGDREAHPPFTLKGSVYNDPFIKDLEHRKEFIASGFNTNYAYERVLTEAFMGLGCVISEEL 63 T 6.1 DUF6190 pdbhh F Eukaryota T 6xib 3 C I Peptide 30 XCKGWWDHYXCA 12 T 0.047 EPV_E5 pdbhh F T 6xic 3 C I Peptide 40 XCKXWWDHYXX 11 T 1.7 DUF5958 pdbhh F T 6xid 3 C I Peptide 51 XCKXWWPTYXCA 12 T 0.35 ZinT pdbhh F T 6xie 3 C I Peptide 77 XCAXXWQTXXC 11 T 11 DUF2754 pdbhh F T 6xke 1 A,B,C A,B,C A0A1Y9G8D0_ANOAL Albicin ANNHIRTVLKLFRTIDLDDSKKSFYLTAAKYGIQTQLREPIIRIVGGYLPSTKLSEACVKNMISEVYEIEGDFYSKFSYACEDHAPYSVECLEDARDDYLTQLVELFKETKKCLRE 116 T 0.094 Herpes_UL55 pdbpssm F Eukaryota T 6xl7 1 A A SG7.AF ARKHVQELLKTFRRIDFDETRKSVYLQSAKFGVQSQLREPLTKKVLNYWDDVKLSKTCLDRMVTKVNDVKETFYAGFSYACESHNQYSVDCLEAAKPSYLTALGEIRGETEKCLTTRLK 119 T 5.8 HECW1_helix pdbhh F T 6xli 3 G,H,I E,F,P TAU_HUMAN Tau Phosphopeptide (Ac-SR(pT)PSLP(pT)PPTRE-OH) XSRTPSLPTPPTRE 14 T 2.3 UPF0449 pdbhh F Eukaryota T 6xmb 1 A,B,C A,B,C A5HUP6_ANOST Anophensin TEATRKHVQQLMKVFRAIDFDFTKKAFYLHRAKYGVQNQLRNPLYLKAMSLPRSAKLSQPCLNKMIDEVNDLESTFYAGFSFNCHDHDQYSMDCLEAAEPTYLDGLKKLAASTEQCLVQK 120 T 0.025 PL48 pdbpssm F Eukaryota T 6xmn 2 B B CXCR1_HUMAN CXCR-1,CDW128A,HIGH AFFINITY INTERLEUKIN-8 RECEPTOR A,IL-8R A,IL-8 RECEPTOR TYPE 1 MSNITDPQMWDFDDLNFTGMPPADEDYSP 29 T 0.01 FA_desaturase unppercent F Eukaryota T 6xn9 1 A A Recifin modulatory peptide QEAFCYSDRFCQNYIGSIPDCCFGRGSYSFELQPPPWECYQC 42 T 3.5 Rubredoxin pdbhh F T 6xnj 2 B B Q8XAN6_ECO57 NleG8 peptide LATQNICTRI 10 T 5 DUF3894 pdbhh F Bacteria T 6xnr 1 A,B,C,D,E AAA,BBB,CCC,DDD,EEE Antifreeze protein MYSCRAVGVDASTVTDVQGTCHAKATGPGAVASGTSVDGSTSTATATGSGATATSTSTGTGTATTTATSNAAATSNAIGQGTATSTATGTAAARAIGSSTTSASATEPTQTKTVSGPGAQTATAIAIDTATTTVTASLEHHHHHH 145 T 4.8 Sporozoite_P67 pdbpercent F T 6xns 1 A,B,C,D,E,F A,B,C,D,E,F C3_crown-05 MGDRSDHAKKLKTFLENLRRHLDRLDKHIKQLRDILSENPEDERVKDVIDLSERSVRIVKTVIKIFEDSVRKLLKQINKEAEELAKSPDPEDLKRAVELAEAVVRADPGSNLSKKALEIILRAAAELAKLPDPDALAAAARAASKVQQEQPGSNLAKAAQEIMRQASRAAEEAARRAKETLEKAEKDGDPETALKAVETVVKVARALNQIATMAGSEEAQERAARVASEAARLAERVLELAEKQGDPEVARRARELQEKVLDILLDILEQILQTATKIIDDANKLLEKLRRSERKDPKVVETYVELLKRHERLVKQLLEIAKAHAEAVEGGSLEHHHHHH 340 T 0.0088 DUF327 pdbpercent F T 6xod 2 B B PEX22_ARATH PEROXIN-22,ATPEX22 GPAVQDVVDQFFQPVKPTLGQIVRQKLSEGRKVTCRLLGVILEETSPEELQKQATVRSSVLEVLLEITKYSDLYLMERVLDDESEAKVLQALENAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWHIDTNPEISTQLARFIKYQLHVATVKPERTAPNVFTSQSIEQFFGSV 175 T 0.00058 Peroxin-22 pdbhh F Eukaryota T 6xor 1 A,B A,B SWA_DROME Protein swallow SFDRLLAENESLQQKINSLEVEAKRLQGFNEYVQERLDRITDDFVKMKDNFETLRTELSEAQQKLRRQQDN 71 T 0.00023 HALZ pdbpssm F Eukaryota T 6xp6 5 I,J C,F DQ2-glia-a2 peptide AAPQPELPYPQPGSGGSIEGRGGSGA 26 T 23 Dicty_CAD pdbhh F T 6xr1 1 A A dTor_9x57R GNLELALKALQILVNAAYVLAEIARDRGNEELLEKAARLAEEAARQAEEIARQARKEGNLELALKALQILVNAAYVLAEIARDRGNEELLEKAARLAEEAARQAEEIARQARKEGNLELALKALQILVNAAYVLAEIARDRGNEELLEKAARLAEEAARQAEEIARQARKEGNLELALKALQILVNAAYVLAEIARDRGNEELLEKAARLAEEAARQAEEIARQARKEGNLELALKALQILVNAAYVLAEIARDRGNEELLEKAARLAEEAARQAEEIARQARKEGNLELALKALQILVNAAYVLAEIARDRGNEELLEKAARLAEEAARQAEEIARQARKEGNLELALKALQILVNAAYVLAEIARDRGNEELLEKAARLAEEAARQAEEIARQARKEGNLELALKALQILVNAAYVLAEIARDRGNEELLEKAARLAEEAARQAEEIARQARKEGNLELALKALQILVNAAYVLAEIARDRGNEELLEKAARLAEEAARQAEEIARQARKEG 514 T 0.31 Ribosomal_S21 pdb F T 6xr2 1 A,B,C,D,E,F A,B,C,D,E,F dTor_3x57R GNLELALKALQILVNAAYVLAEIARDRGNEELLEKAARLAEEAARQAEEIARQARKEGNLELALKALQILVNAAYVLAEIARDRGNEELLEKAARLAEEAARQAEEIARQARKEGNLELALKALQILVNAAYVLAEIARDRGNEELLEKAARLAEEAARQAEEIARQARKEG 172 T 0.053 Ribosomal_S21 pdb F T 6xrf 2 C,F,I C,F,I TSE6_PSEAE PAAR motif family protein MDAQAAARLGDEIAHGFGVAAMVAGAVAGALIGAAVVAATAATGGLAAVILAGSIAAGGLSHHHHHH 67 T 0.14 MaoC_dehydratas pdb F Bacteria T 6xro 2 B B peptide boronate inhibitor KRFRSMQYSA 10 T 5 NmU-R2_C_term pdbhh F T 6xs5 2 B B RT-D1 XXIIDTPLGVFLSSLKR 17 T 0.27 RNA_GG_bind pdbhh F T 6xs7 2 B B 48V-DTY-THR-THR-ILE-TYR-TRP-THR-PRO-LEU-GLY-THR-PHE-PRO-ARG-ILE-ARG XXTTIYWTPLGTFPRIR 17 T 2.4 DUF3438 pdbhh F T 6xs8 2 B B 48V-DTY-GLY-TYR-ASP-PRO-LEU-GLY-LEU-LYS-TYR-PHE-ALA XXGYDPLGLKYFA 13 T 1.5 Ntox47 pdbhh F T 6xs9 3 C,D,E,F C,D,E,F 48V-TYR-ILE-LYS-THR-PRO-LEU-GLY-THR-PHE-PRO-ASN-ARG-HIS-GLY XYIKTPLGTFPNRHG 15 T 0.078 MtrB pdbhh F T 6xsa 2 B B 48V-TYR-LEU-PRO-THR-ILE-THR-GLY-VAL-GLY-HIS-LEU-TRP-HIS-PRO-LEU XYLPTITGVGHLWHPL 16 T 0.031 Exonuc_VII_L pdbhh F T 6xt4 1 A A 1BH_69 MGHHHHHHGGDSLDMLEWSLGSNDEKEKLKELLKRAEELAKSPDPEDLKEAVRLAEEVVRERPGSNLAKKALEIILRAAEELAKLPDPKALIAAVLAAIKVVREQPGSNLAKKALEIILRAAEELAKLPDPLALAAAVVAATIVVLTQPGSELAKKALEIIERAAEELKKSPDPLAQLLAIAAEALVIALKSSSEETIKEMVKLTTLALLTSLLILILILLDLKEMLERLEKNPDKDVIVKVLKVIVKAIEASVLNQAISAINQILLALSD 271 T 0.013 Dak2 pdbpercent F T 6xtd 1 A A A0A1C3HFI3_SERMA RHS1-CT MGSSHHHHHHSQDPKPRCAATKANDHNQAAFGRQWQGRGIYKGRDSWSNIMLKEGDIVYGGAPGQSGFYFNKATLDAAGGSRAKLWESLQVLPHEKFGYRSKIQAYRVKRETIAGTGKAISQDPTRFGEGGGTQFFLSNYKTVLEPIDKPFEIGL 155 T 0.28 TetR_C_18 pdb F Bacteria T 6xtd 2 B B RHSI1 MMQLDTYDGTLELAGITLGTATTREMLIKGSRLWEGWPEKSDGRTTSYRTIISTKKEKAGDIYIIADFSGAFITDAVLCSWRFAPEKLMMGIQKKVEGAITKNLRTWFYEKTHIQLPVSGSWGHIDAAYDPHNLTGTIVCNYRSAFHTEDEWRKYCKRNNIIY 163 T 1.3 Fip1 pdbhh F T 6xth 1 A A A0A5P9PRQ2_9PSEU Felipeptin A1 GSRGWGFEPGVRCLIWCD 18 T 9.2 2EXR pdbhh F Bacteria T 6xti 1 A A A0A5P9PSL4_9PSEU Felipeptin A2 GGGGRGYEYNKQCLIFC 17 T 6 MFA1_2 pdbhh F Bacteria T 6xtt 1 A B Q5ZVQ5_LEGPH NttA MAHHHHHHVDDDDKMEDTANPNEMTKDAWLNSMTPLLPDLICKGFIQDPDLKKRFDEIKMTYEQCVTLIPESTKKCQDELYASMPDKINSETAGTWGRSLGECIGKDFAEKHLIPK 116 T 0.62 PAGK unphh F Bacteria T 6xvd 2 B P upain-1-W3F CSFRGLENHAMC 12 T 1.2 LRRNT pdbhh F T 6xwd 2 B P AMPN_HUMAN Amino peptidase N 38-46 NKNANSSPV 9 T 93 DUF6446 pdbhh F Eukaryota T 6xwu 1 A A Q9VHP9_DROME RE68959p MSKPQNNDTLELDDILSQPVKDKERFAAFMMRKLAENKPAQNDNLFGNFKLDFDLDFEVPLIKKSQAKPKSKLPEVQPLGELVSKNSAATEKVNEPPVDQAPNENVPPRRSPTLSPNNRRSMRRSGNVPGSDKLRRHAIRRRSRSCGRQLLPEFEETVNLTRSISSPVNFLPEISSTPCTEKQKEEVAKNTTRVETDKPAEKPMELSQEPEPENPLQTKVTSPARNPILAAEIEQICKERQSSFHKNVLQLDYSGRAPYSRPPTPSSPSVAGLRRTYTMEKGPAPGQLLLSPSHRYDTPSKMPVVKAKRFNQELMVPDTPERQSHDPAWQSEPQPEFVVPETQPQDLGELVQTLSRSAISPIVVINTSNSNRSVRRDAVAMKSVPTSPVTALSSPPIAPSPRRSAAASPQKSIAQLPRVEENMDAIMTDDESDEHPSTVPLNLAPSGGNTTRQRRLRSSNRARATIESQESSMRLLNLHKSVNAKKSKPRKTAIPLNKAPSAPINGEQFARELTRMSNYEILDLRKRNSLNEIYPLNGHRNHRSEKLILEEEIQRELLRRNLMDEAEGLPKQQSSDDSNEDYIPVPPKTQSLRTKSNDRSQGRGRPRSTRRDLPMTTELVNYLGLSQTLETRRKSSKDGKRCLYTKGSSDHEDNDSLSPVKLPRLSKSIQIVPPPPVSLRYSQSLQNLPCSGKFDFDNVVMAAPPDFHDSVNSDAIEIAPPPPEYVVNTRGRSTSGRKSNKNDLVLPPPGYEGGQEEEHDERPSQPRCTAKELQQSTQNGRRAMENELVPPPIEYVEEENRNNEQSRRSTKNGNLVDRNTHNAVEYCEPPEPPEYDDSDHGQASILRRSGKKLQHSKQSVQKSNKEQIIAPSYENNEDYDSDEEPIYNEEYGKEESQNKNVTRRKSDKDEMASHTLECIEGPDPNWNSSCNKQNRNHQNASKSKENDKLANRSSKSQKLSNPRQNAVGTEKSVALSNRGEECTEKSSDVMESLRVNTPTPPIDQNSDDVPSRNPSPSRTLLSDDVPSTSRAALEFLQRSQNMSKSRPPDESSADVVFKKPLAPAPRAKSKKGKSEVDKLKLAKMPVEAEELNTTGIRRSKRGQVPLQMSWCHTMDPSKFNFMSGFIEPRSKNSKTKKGNLSKAKKASATKPKPTVEKNLPDNRGPLCSSTPRISEKLPGAIPHSESLGLSTLTWEETEVQAEAEKVPKKRGRPKKAVGGVQTDTEAEPEPEPEPMISSVAPLTSDQEEPDVPDEQAPYTEAALGPVVFSTPLRDEQEEASTKLMQWLRGVGDAPPSASMSDENASVSSANELIFYQVDGIDYAFYNTKEKAMLGYMRFKPYQKRSMKQAKVHPLKLLVQFGEFNVETLAVGEEKEVHSVLRVGDMIEIDRGTRYSIQNAIDKVSVLMCIRS 1411 T 0.00049 CENP-C_C unppssm F Eukaryota T 6xxc 2 B B ERR3_HUMAN Estrogen Related Receptor gamma phosphopeptide KRRRKSCQA 9 T 23 PHD20L1_u1 pdbhh F Eukaryota T 6xxf 2 B BBB RyR2 Peptide KKAVWHKLLSKQRKRAVVACF 21 T 3.2 DUF5463 pdbhh F T 6xxs 2 C,D,F,H C,D,G,H NCOR1_HUMAN N-COR1 GITTIKEMGRSIHEIPR 17 T 0.61 DUF211 pdbhh F Eukaryota T 6xxz 1 A,B A,B 2-EK-4 XGEIKQQLAEIKQQLAEIKWQLAEIKQQLAGX 32 T 0.0016 DUF5320 pdbhh F T 6xy0 1 A,B,C,D A,B,C,D 3-EK-4 XGAIQQELKAIQQELKAIQWELKAIQQELKGX 32 T 0.0037 DUF5320 pdbhh F T 6xy1 1 A,B,C,D A,B,C,D 4-KE-4 XGEIQKQLKEIQKQLKEIQWQLKEIQKQLKGX 32 T 0.0015 DUF5320 pdbhh F T 6xya 1 A B L_SFTS RNA-dependent RNA polymerase GPAQSGTLGGFSKPQKTFVRPGGGVGYKGKGVWTGVMEDTHVQILIDGDGTSNWLEEIRLSSDARLYDVIESIRRLCDDLGINNRVASAYRGHCMVRLSGFKIKPASRTDGCPVRIME 118 T 2.2 DUF5363 pdbhh T Viruses T 6xyb 1 A,B,C,D A,B,C,D Q4D6Q6_TRYCC Uncharacterized protein GSHMPNLCVSATFNPPVITMLGSALREETVKLLEQRIPTGVSTSSSPSKDPVKFLFYPNPDHWRMELSQHFCDDLHKSAVFLTIIEGLEGEGWNLRASNSIRDSESGKDTTKLFFARRN 119 T 0.034 DUF4177 unphh F Eukaryota T 6xyh 1 A A AMS3 GCKNLNSHCYRQHRECCHGLVCRRPNYGNGRGILWKCVRA 40 T 0.0092 Toxin_22 pdbhh F T 6xyi 1 A A AMS9.3.1 GCKKLNSYCTRQHRECCHGLVCRRPDYGIGRGILWKCTRARK 42 T 0.0037 Toxin_12 pdb F T 6xyw 33 GA AI Q8L7U3_ARATH DECOY MPRSSLRLLAKPLLESRRGFCTSSDKIVASVLFERLRVVIPKPDPAVYAFQEFKFNWQQQFRRRYPDEFLDIAKNRAKGEYQMDYVPAPRITEADKNNDRKSLYRALDKKLYLLIFGKPFGATSDKPVWHFPEKVYDSEPTLRKCAESALKSVVGDLTHTYFVGNAPMAHMAIQPTEEMPDLPSYKRFFFKCSVVAASKYDISNCEDFVWVTKDELLEFFPEQAEFFNKMIIS 233 T 5.6E-10 MRP-L46 pdbhh F Eukaryota T 6xyw 37 KA AM Q9C9B5_ARATH TUMOR NECROSIS FACTOR RECEPTOR FAMILY PROTEIN MWFAGGGGGLRKLCRASAIFDNEISYNSLLVRYMSRERAVNVRKINPKVPIQEAYAISNSLYDLFKLHGPLSVPNTWLRAQEAGVSGLNSKTHMKLLLKWMRGKKMLKLICNQVGSSKKFFHTVLPEDPLQEQPAAPIENKKQAVKKKRSK 151 T 0.3 HARE-HTH pdbhh F Eukaryota T 6xyw 38 LA AN Q9SD44_ARATH PROTEIN TRANSLOCASE SUBUNIT MGFGAIRSILRPLSRTLVSRAVVNYSSAPFNATIPAAKPELCSFFGGSMTHLRLPWIPMANHFHSLSLTDTRLPKRRPMTHPKRKRSKLKPPGPYAYVQYTPGQPISSNNPNEGSVKRRNAKKRIGQRRAFILSEKKKRQALVQEAKRKKRIKQVERKMAAVARDRAWAERLIELQQLEEEKKKSMSS 188 T 0.032 DUF6087 pdb F Eukaryota T 6xzh 2 B B ARG-HIS-LYS-ILE-URL-URK-URL-LEU-GLN RHKIXXXLQ 9 T 7.1 SRC-1 pdbhh F T 6xzi 2 B B ARG-HIS-LYS-ILE-LEU-URK-UIL-URL RHKILXXX 8 T 1.3 SRC-1 pdbhh F T 6xzj 2 B B ARG-HIS-LYS-ILE-LEU-URR-UIL-URL-GLN RHKILXXXQ 9 T 1.8 SRC-1 pdbhh F T 6xzz 2 B B NCOR1_HUMAN N-COR1 RERIAAASSDLYLRPGS 17 T 3.7 B3R pdbhh F Eukaryota T 6y13 1 A A bp70 XHXXYXCIRCYAX 13 T 1.4 S_tail_recep_bd pdbhh F T 6y1q 1 A A Analog 5 PCKNXFXKTFTSCK 14 T 0.00046 Somatostatin pdb F T 6y26 3 C C GLY-ARG-LEU-ASN-ALA-PRO-ILE-LYS-VAL GRLNAPIKV 9 T 16 DUF4861 pdbhh F T 6y28 3 C C GLY-ARG-LEU-ASN-GLU-PRO-ILE-LYS-VAL GRLNEPIKV 9 T 3.5 DUF863 pdbhh F T 6y2a 3 C C mQ GRLNQPIKV 9 T 2 DUF3560 pdbhh F T 6y38 2 C,D C,D MYO15_MOUSE Chains: C,D ERLTLPPSEITLL 13 T 5.4 DUF4875 pdbhh F Eukaryota T 6y3r 2 B P GAB2_HUMAN Chain P PRRNTLPAMDNS 12 T 2.6 PKI pdbhh F Eukaryota T 6y3s 2 B P GAB2_HUMAN Gab2 NARSASFSQG 10 T 20 DUF6425 pdbhh F Eukaryota T 6y4e 1 A A B4EUK6_PROMH Fimbrial adhesin SIFSYITESTGTPSNATYTYVIERWDPETSGILNPCYGWPVCYVTVNHKHTVNGTGGNPAFQIARIEKLRTLAEVRDVVLKNRSFPIEGQTTHRGPSLNSNQECVGLFYQPNSSGISPRGKLLPGSLCGAHHHHHH 136 T 1.8 PSI_8 unp F Bacteria T 6y4f 1 A A B4EUK6_PROMH Fimbrial adhesin SIFSYITESTGTPSNATYTYVIERWDPETSGILNPCYGWPVCYVTVNHKHTVNGTGGNPAFQIARIEKLRTLAEVRDVVLKNRSFPIEGQTTHRGPSLNSNQECVGLFYQPNSSGISPRGKLLPGSLCGIAPPPVHHHHHH 141 T 1.8 PSI_8 unp F Bacteria T 6y4o 2 B B RYR2_HUMAN RYR2,CARDIAC MUSCLE RYANODINE RECEPTOR,CARDIAC MUSCLE RYANODINE RECEPTOR-CALCIUM RELEASE CHANNEL,TYPE 2 RYANODINE RECEPTOR SNARSKKAVWHKLLSKQRKRAVVACFRMAP 30 T 1.7 Spc110_C pdbhh F Eukaryota T 6y4q 2 C,D C,D ACE-LEU-THR-PHE-GLY-GLU-TYR-TRP-ALA-GLN-LEU-ALA-SER XLTFGEYWAQLAS 13 T 0.42 P53_TAD pdbhh F T 6y58 2 B P ERR3_HUMAN Estrogen Related Receptor gamma phosphopeptide VYRSLSFE 8 T 1.7 AbiJ_NTD5 pdbhh F Eukaryota T 6y8k 2 B PPP BCY10916 CIEEGQYCFADPYXC 15 T 0.0042 DUF5637 pdb F T 6y9l 1 A,B,C,D B,D,A,C GP_TSWV1 Glycoprotein KVEIIRGDHPEVYDDSAENEVPTAASIQRKAILETLTNLMLESQTPGTRQIREEESTIPIFAESTTQKTISVSDLPNNCLNASSLKCEIKGISTYNVYYQVENNGVIYSCVSDSAEGLEKCDNSLNLPKRFSKVPVIPITKLDNKRHFSVGTKFFISESLTQDNYPITYNSYPTNGTVSLQTVKLSGDCKITKSNFANPYTVSITSPEKIMGYLIKKPGENVEHKVISFSGSASITFTEEMLDGEHNLLCGDKSAKIPKTNKRVRDCIIKYSKSIYKQTACINFSWIR 288 T 0.018 Bunya_G2 unphh T Viruses T 6y9m 1 A,B,C,D A,B,C,D GP_TSWV1 Glycoprotein KVEIIRGDHPEVYDDSAENEVPTAASIQRKAILETLTNLMLESQTPGTRQIREEESTIPIFAESTTQKTISVSDLPNNCLNASSLKCEIKGISTYNVYYQVENNGVIYSCVSDSAEGLEKCDNSLNLPKRFSKVPVIPITKLDNKRHFSVGTKFFISESLTQDNYPITYNSYPTNGTVSLQTVKLSGDCKITKSNFANPYTVSITSPEKIMGYLIKKPGENVEHKVISFSGSASITFTEEMLDGEHNLLCGDKSAKIPKTNKRVRDCIIKYSKSIYKQTACINF 284 T 0.016 Bunya_G2 unphh T Viruses T 6y9o 2 B C CSKP_MOUSE CALCIUM/CALMODULIN-DEPENDENT SERINE PROTEIN KINASE TAPQWVPVSWVY 12 T 1.8 DUF463 unphh F Eukaryota T 6y9p 2 C,D,F,H,J,L C,D,F,H,J,L Q6PPF3_RAT Harmonin a1 PKEYDDELTFF 11 T 7.5 DUF3601 pdbhh F Eukaryota T 6ya2 1 A,B,C A,B,C GP_TSWV1 Glycoprotein STTQKTISVSDLPNNCLNASSLKCEIKGISTYNVYYQVENNGVIYSCVSDSAEGLEKCDNSLNLPKRFSKVPVIPITKLDNKRHFSVGTKFFISESLTQDNYPITYNSYPTNGTVCLQTVKLSGDCKITKSNFANPYTVSITSPEKIMGYLIKKPGENVEHKVISFSGSASITFTEEMLDGEHNLLCGDKSAKIPKTNK 199 T 0.018 Bunya_G2 unphh T Viruses T 6ya7 3 C C MCM2_HUMAN MINICHROMOSOME MAINTENANCE PROTEIN 2 HOMOLOG,NUCLEAR PROTEIN BM28 RRTDALTXSPGRDLP 15 T 110 P53 pdbhh F Eukaryota T 6yaz 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L CC-Type2-(TaId)5 XGEIAQATKEIAQATKEIAKATKEIAWATKEIAQATKGX 39 T 0.00096 MCPsignal pdb F T 6yb0 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L CC-Type2-(TaSd)2 XGEIAQALKEIAQALKESAKATKESAWATKEIAQALKGX 39 T 0.014 MCPsignal pdb F T 6yb1 1 A,B,C,D A,B,C,D K2-CCTM-VbIc XGKKSAWATVISALATVISALATVISAWATVGX 33 T 0.059 DUF6486 pdb F T 6yb2 1 A,B,C,D,E,F A,B,C,D,E,F CC-Type2-(TaId)2 XGEIAQALKEIAKATKEIAWATKEIAQALKGX 32 T 0.013 MCPsignal pdbpssm F T 6ycr 2 B B FFIVIRDRVFR(CCS)G(NH2) FFIVIRDRVFRXGX 14 T 4 BOFC_N pdbhh F T 6ycx 3 D F A0A2I0BQX1_PLAFO Uncharacterized protein MASDMEEKFREAFILFSSCSDHIEMYKFFELMNSFGIILTNDEKAALPNDINMDYWLNFAKKHYNYEQPFKHINNVNEQNTNVQIKIDNFLGIMKALDTRLTESDLNILLQITNPENKTTLNLKTVSQKLTESI 134 T 0.024 Na_Ca_ex_C unppercent F Eukaryota T 6ycx 4 E G A0A2I0BQX1_PLAFO Uncharacterized protein MASDMEEKFREAFILFSSCSDHIEMYKFFELMNSFGIILTNDEKAALPNDINMDYWLNFAKKHYNYEQPFKHINNVNEQNTNVQIKIDNFLGIMKALDTRLTESDLNILLQITNPENKSTLNLKTVSQKLTESI 134 T 0.024 Na_Ca_ex_C pdbpercent F Eukaryota T 6yf7 1 A,AA,AB,AC,AD,AE,AF,AG,AH,AI,AJ,B,BA,BB,BC,BD,BE,BF,BG,BH,BI,BJ,C,CA,CB,CC,CD,CE,CF,CG,CH,CI,CJ,D,DA,DB,DC,DD,DE,DF,DG,DH,DI,DJ,E,EA,EB,EC,ED,EE,EF,EG,EH,EI,EJ,F,FA,FB,FC,FD,FE,FF,FG,FH,FI,FJ,G,GA,GB,GC,GD,GE,GF,GG,GH,GI,GJ,H,HA,HB,HC,HD,HE,HF,HG,HH,HI,HJ,I,IA,IB,IC,ID,IE,IF,IG,IH,II,IJ,J,JA,JB,JC,JD,JE,JF,JG,JH,JI,JJ,K,KA,KB,KC,KD,KE,KF,KG,KH,KI,L,LA,LB,LC,LD,LE,LF,LG,LH,LI,M,MA,MB,MC,MD,ME,MF,MG,MH,MI,N,NA,NB,NC,ND,NE,NF,NG,NH,NI,O,OA,OB,OC,OD,OE,OF,OG,OH,OI,P,PA,PB,PC,PD,PE,PF,PG,PH,PI,Q,QA,QB,QC,QD,QE,QF,QG,QH,QI,R,RA,RB,RC,RD,RE,RF,RG,RH,RI,S,SA,SB,SC,SD,SE,SF,SG,SH,SI,T,TA,TB,TC,TD,TE,TF,TG,TH,TI,U,UA,UB,UC,UD,UE,UF,UG,UH,UI,V,VA,VB,VC,VD,VE,VF,VG,VH,VI,W,WA,WB,WC,WD,WE,WF,WG,WH,WI,X,XA,XB,XC,XD,XE,XF,XG,XH,XI,Y,YA,YB,YC,YD,YE,YF,YG,YH,YI,Z,ZA,ZB,ZC,ZD,ZE,ZF,ZG,ZH,ZI AA,BA,CA,DA,EA,FA,GA,HA,IA,JA,KA,AB,BB,CB,DB,EB,FB,GB,HB,IB,JB,KB,AC,BC,CC,DC,EC,FC,GC,HC,IC,JC,KC,AD,BD,CD,DD,ED,FD,GD,HD,ID,JD,KD,AE,BE,CE,DE,EE,FE,GE,HE,IE,JE,KE,AF,BF,CF,DF,EF,FF,GF,HF,IF,JF,KF,AG,BG,CG,DG,EG,FG,GG,HG,IG,JG,KG,AH,BH,CH,DH,EH,FH,GH,HH,IH,JH,KH,AI,BI,CI,DI,EI,FI,GI,HI,II,JI,KI,AJ,BJ,CJ,DJ,EJ,FJ,GJ,HJ,IJ,JJ,KJ,AK,BK,CK,DK,EK,FK,GK,HK,IK,JK,AL,BL,CL,DL,EL,FL,GL,HL,IL,JL,AM,BM,CM,DM,EM,FM,GM,HM,IM,JM,AN,BN,CN,DN,EN,FN,GN,HN,IN,JN,AO,BO,CO,DO,EO,FO,GO,HO,IO,JO,AP,BP,CP,DP,EP,FP,GP,HP,IP,JP,AQ,BQ,CQ,DQ,EQ,FQ,GQ,HQ,IQ,JQ,AR,BR,CR,DR,ER,FR,GR,HR,IR,JR,AS,BS,CS,DS,ES,FS,GS,HS,IS,JS,AT,BT,CT,DT,ET,FT,GT,HT,IT,JT,AU,BU,CU,DU,EU,FU,GU,HU,IU,JU,AV,BV,CV,DV,EV,FV,GV,HV,IV,JV,AW,BW,CW,DW,EW,FW,GW,HW,IW,JW,AX,BX,CX,DX,EX,FX,GX,HX,IX,JX,AY,BY,CY,DY,EY,FY,GY,HY,IY,JY,AZ,BZ,CZ,DZ,EZ,FZ,GZ,HZ,IZ,JZ A0A068EP60_9VIRU Coat protein SITKYSESAGPIGQSIYTFTGVTVPAQYMPRLVATTTVNKAGTNIEYKIAVNYPLVSVVDGANVALNTIRANLSFTALQSVINTDEKLRVLDEIVSFITANKANIIDGNVLTVTP 115 T 0.14 Hexokinase_2 pdbpercent T Viruses T 6yf9 1 A,AA,AB,AC,AD,AE,AF,B,BA,BB,BC,BD,BE,BF,C,CA,CB,CC,CD,CE,CF,D,DA,DB,DC,DD,DE,DF,E,EA,EB,EC,ED,EE,EF,F,FA,FB,FC,FD,FE,FF,G,GA,GB,GC,GD,GE,GF,H,HA,HB,HC,HD,HE,HF,I,IA,IB,IC,ID,IE,IF,J,JA,JB,JC,JD,JE,JF,K,KA,KB,KC,KD,KE,KF,L,LA,LB,LC,LD,LE,LF,M,MA,MB,MC,MD,ME,MF,N,NA,NB,NC,ND,NE,NF,O,OA,OB,OC,OD,OE,OF,P,PA,PB,PC,PD,PE,PF,Q,QA,QB,QC,QD,QE,QF,R,RA,RB,RC,RD,RE,RF,S,SA,SB,SC,SD,SE,SF,T,TA,TB,TC,TD,TE,TF,U,UA,UB,UC,UD,UE,UF,V,VA,VB,VC,VD,VE,VF,W,WA,WB,WC,WD,WE,WF,X,XA,XB,XC,XD,XE,XF,Y,YA,YB,YC,YD,YE,Z,ZA,ZB,ZC,ZD,ZE AA,BA,CA,DA,EA,FA,GA,AB,BB,CB,DB,EB,FB,GB,AC,BC,CC,DC,EC,FC,GC,AD,BD,CD,DD,ED,FD,GD,AE,BE,CE,DE,EE,FE,GE,AF,BF,CF,DF,EF,FF,GF,AG,BG,CG,DG,EG,FG,GG,AH,BH,CH,DH,EH,FH,GH,AI,BI,CI,DI,EI,FI,GI,AJ,BJ,CJ,DJ,EJ,FJ,GJ,AK,BK,CK,DK,EK,FK,GK,AL,BL,CL,DL,EL,FL,GL,AM,BM,CM,DM,EM,FM,GM,AN,BN,CN,DN,EN,FN,GN,AO,BO,CO,DO,EO,FO,GO,AP,BP,CP,DP,EP,FP,GP,AQ,BQ,CQ,DQ,EQ,FQ,GQ,AR,BR,CR,DR,ER,FR,GR,AS,BS,CS,DS,ES,FS,GS,AT,BT,CT,DT,ET,FT,GT,AU,BU,CU,DU,EU,FU,GU,AV,BV,CV,DV,EV,FV,GV,AW,BW,CW,DW,EW,FW,GW,AX,BX,CX,DX,EX,FX,GX,AY,BY,CY,DY,EY,FY,AZ,BZ,CZ,DZ,EZ,FZ coat protein AAPSLALVGANSTLASTLVNYSLRSQNGNNVDYVCTDPDSTLSAPGLINAKFDIKAPGITGNDRIHANLRKVVLDEKTNLPSTGSVTIQVSIPRNPAWNASMTVSLLKQAADYLAGTSATVSGQTDTSGFPAKWAGLMFP 140 T 0.3 CLP1_P pdb F T 6yfa 1 A,AA,AB,B,BA,BB,C,CA,CB,D,DA,DB,E,EA,EB,F,FA,FB,G,GA,GB,H,HA,HB,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,PA,Q,QA,R,RA,S,SA,T,TA,U,UA,V,VA,W,WA,X,XA,Y,YA,Z,ZA AA,BA,CA,AB,BB,CB,AC,BC,CC,AD,BD,CD,AE,BE,CE,AF,BF,CF,AG,BG,CG,AH,BH,CH,AI,BI,AJ,BJ,AK,BK,AL,BL,AM,BM,AN,BN,AO,BO,AP,BP,AQ,BQ,AR,BR,AS,BS,AT,BT,AU,BU,AV,BV,AW,BW,AX,BX,AY,BY,AZ,BZ coat protein SKILSTNNSNSNFVDTSFTLKVPVYSKDYRVTQDEPDEVVVANRQQPFGVKNTARYGIRQIADVYRNTTIDRAYQSPSKKGTSLVVQVTETWTVASTDDETYGYSLPFSAHVIVNVPQDALITEEILYDALKRLMGHFYEGNDTTSPTTTSVRLKDMLQGALVPQSL 167 T 34 DUF3626 pdbhh F T 6yfb 1 A,AA,AB,AC,AD,AE,AF,AG,AH,B,BA,BB,BC,BD,BE,BF,BG,BH,C,CA,CB,CC,CD,CE,CF,CG,D,DA,DB,DC,DD,DE,DF,DG,E,EA,EB,EC,ED,EE,EF,EG,F,FA,FB,FC,FD,FE,FF,FG,G,GA,GB,GC,GD,GE,GF,GG,H,HA,HB,HC,HD,HE,HF,HG,I,IA,IB,IC,ID,IE,IF,IG,J,JA,JB,JC,JD,JE,JF,JG,K,KA,KB,KC,KD,KE,KF,KG,L,LA,LB,LC,LD,LE,LF,LG,M,MA,MB,MC,MD,ME,MF,MG,N,NA,NB,NC,ND,NE,NF,NG,O,OA,OB,OC,OD,OE,OF,OG,P,PA,PB,PC,PD,PE,PF,PG,Q,QA,QB,QC,QD,QE,QF,QG,R,RA,RB,RC,RD,RE,RF,RG,S,SA,SB,SC,SD,SE,SF,SG,T,TA,TB,TC,TD,TE,TF,TG,U,UA,UB,UC,UD,UE,UF,UG,V,VA,VB,VC,VD,VE,VF,VG,W,WA,WB,WC,WD,WE,WF,WG,X,XA,XB,XC,XD,XE,XF,XG,Y,YA,YB,YC,YD,YE,YF,YG,Z,ZA,ZB,ZC,ZD,ZE,ZF,ZG AA,BA,CA,DA,EA,FA,GA,HA,IA,AB,BB,CB,DB,EB,FB,GB,HB,IB,AC,BC,CC,DC,EC,FC,GC,HC,AD,BD,CD,DD,ED,FD,GD,HD,AE,BE,CE,DE,EE,FE,GE,HE,AF,BF,CF,DF,EF,FF,GF,HF,AG,BG,CG,DG,EG,FG,GG,HG,AH,BH,CH,DH,EH,FH,GH,HH,AI,BI,CI,DI,EI,FI,GI,HI,AJ,BJ,CJ,DJ,EJ,FJ,GJ,HJ,AK,BK,CK,DK,EK,FK,GK,HK,AL,BL,CL,DL,EL,FL,GL,HL,AM,BM,CM,DM,EM,FM,GM,HM,AN,BN,CN,DN,EN,FN,GN,HN,AO,BO,CO,DO,EO,FO,GO,HO,AP,BP,CP,DP,EP,FP,GP,HP,AQ,BQ,CQ,DQ,EQ,FQ,GQ,HQ,AR,BR,CR,DR,ER,FR,GR,HR,AS,BS,CS,DS,ES,FS,GS,HS,AT,BT,CT,DT,ET,FT,GT,HT,AU,BU,CU,DU,EU,FU,GU,HU,AV,BV,CV,DV,EV,FV,GV,HV,AW,BW,CW,DW,EW,FW,GW,HW,AX,BX,CX,DX,EX,FX,GX,HX,AY,BY,CY,DY,EY,FY,GY,HY,AZ,BZ,CZ,DZ,EZ,FZ,GZ,HZ coat protein SYTQSFGYTIPTEKDTLEIPQYQALLAKKASYMDDSQGKNTATYMNTAAPKDQPETITFGVNKVDNVYKQSNVQNQTFYASSSKGTKIRIDGKRIWRTQSTDVNTGLPVIVDCPLWTSFTLGFADFTLVDDSARKSTIEWMISQLELLKDDGVWSKLCSGVTRIYG 166 T 7.3 DUF4325 pdbhh F T 6yfc 1 A,AA,AB,AC,AD,AE,AF,B,BA,BB,BC,BD,BE,BF,C,CA,CB,CC,CD,CE,CF,D,DA,DB,DC,DD,DE,DF,E,EA,EB,EC,ED,EE,EF,F,FA,FB,FC,FD,FE,FF,G,GA,GB,GC,GD,GE,GF,H,HA,HB,HC,HD,HE,HF,I,IA,IB,IC,ID,IE,IF,J,JA,JB,JC,JD,JE,JF,K,KA,KB,KC,KD,KE,KF,L,LA,LB,LC,LD,LE,LF,M,MA,MB,MC,MD,ME,MF,N,NA,NB,NC,ND,NE,NF,O,OA,OB,OC,OD,OE,OF,P,PA,PB,PC,PD,PE,PF,Q,QA,QB,QC,QD,QE,QF,R,RA,RB,RC,RD,RE,RF,S,SA,SB,SC,SD,SE,SF,T,TA,TB,TC,TD,TE,TF,U,UA,UB,UC,UD,UE,UF,V,VA,VB,VC,VD,VE,VF,W,WA,WB,WC,WD,WE,WF,X,XA,XB,XC,XD,XE,XF,Y,YA,YB,YC,YD,YE,Z,ZA,ZB,ZC,ZD,ZE AA,BA,CA,DA,EA,FA,GA,AB,BB,CB,DB,EB,FB,GB,AC,BC,CC,DC,EC,FC,GC,AD,BD,CD,DD,ED,FD,GD,AE,BE,CE,DE,EE,FE,GE,AF,BF,CF,DF,EF,FF,GF,AG,BG,CG,DG,EG,FG,GG,AH,BH,CH,DH,EH,FH,GH,AI,BI,CI,DI,EI,FI,GI,AJ,BJ,CJ,DJ,EJ,FJ,GJ,AK,BK,CK,DK,EK,FK,GK,AL,BL,CL,DL,EL,FL,GL,AM,BM,CM,DM,EM,FM,GM,AN,BN,CN,DN,EN,FN,GN,AO,BO,CO,DO,EO,FO,GO,AP,BP,CP,DP,EP,FP,GP,AQ,BQ,CQ,DQ,EQ,FQ,GQ,AR,BR,CR,DR,ER,FR,GR,AS,BS,CS,DS,ES,FS,GS,AT,BT,CT,DT,ET,FT,GT,AU,BU,CU,DU,EU,FU,GU,AV,BV,CV,DV,EV,FV,GV,AW,BW,CW,DW,EW,FW,GW,AX,BX,CX,DX,EX,FX,GX,AY,BY,CY,DY,EY,FY,AZ,BZ,CZ,DZ,EZ,FZ coat protein MRLTDVDLTVGEETREYAVSEQQGTLFRFVDKSGTVANNTGVFSLEQRFGAANSNRKVTMLLTDPVVVKDASGADMTIKANASVTFSLPKTYPNEHITKLRQTLIAWLGQQCVSDPVDSGLNNY 124 T 0.0063 Phage_coat pdbhh F T 6yfd 1 A,AA,AB,AC,B,BA,BB,BC,C,CA,CB,CC,D,DA,DB,DC,E,EA,EB,EC,F,FA,FB,FC,G,GA,GB,GC,H,HA,HB,HC,I,IA,IB,IC,J,JA,JB,JC,K,KA,KB,KC,L,LA,LB,LC,M,MA,MB,N,NA,NB,O,OA,OB,P,PA,PB,Q,QA,QB,R,RA,RB,S,SA,SB,T,TA,TB,U,UA,UB,V,VA,VB,W,WA,WB,X,XA,XB,Y,YA,YB,Z,ZA,ZB AA,BA,CA,DA,AB,BB,CB,DB,AC,BC,CC,DC,AD,BD,CD,DD,AE,BE,CE,DE,AF,BF,CF,DF,AG,BG,CG,DG,AH,BH,CH,DH,AI,BI,CI,DI,AJ,BJ,CJ,DJ,AK,BK,CK,DK,AL,BL,CL,DL,AM,BM,CM,AN,BN,CN,AO,BO,CO,AP,BP,CP,AQ,BQ,CQ,AR,BR,CR,AS,BS,CS,AT,BT,CT,AU,BU,CU,AV,BV,CV,AW,BW,CW,AX,BX,CX,AY,BY,CY,AZ,BZ,CZ coat protein TKRNRNNQARGQLYMGQQGPVQSSRTTFGVNPDRQANARPVYLAPAAPMENTYTYLGSIQFAAGRHIFGEPASNVLPPQNIVPGVPTKHGEYVTTNTGDRLMASSTTVTRDVSNGRTKVSIDIPYYDRNAVETLKASAIPGAVAPVGSFKVNVEVLGGGVLTGTDANAQFALDELLSNMLMDAARIAQDGPKNTARLVAASHGVMPQA 208 T 21 FeS_assembly_P pdbhh F T 6yfg 1 A,AA,AB,AC,AD,AE,AF,AG,AH,AI,AJ,AK,AL,AM,B,BA,BB,BC,BD,BE,BF,BG,BH,BI,BJ,BK,BL,BM,C,CA,CB,CC,CD,CE,CF,CG,CH,CI,CJ,CK,CL,CM,D,DA,DB,DC,DD,DE,DF,DG,DH,DI,DJ,DK,DL,DM,E,EA,EB,EC,ED,EE,EF,EG,EH,EI,EJ,EK,EL,EM,F,FA,FB,FC,FD,FE,FF,FG,FH,FI,FJ,FK,FL,FM,G,GA,GB,GC,GD,GE,GF,GG,GH,GI,GJ,GK,GL,GM,H,HA,HB,HC,HD,HE,HF,HG,HH,HI,HJ,HK,HL,HM,I,IA,IB,IC,ID,IE,IF,IG,IH,II,IJ,IK,IL,IM,J,JA,JB,JC,JD,JE,JF,JG,JH,JI,JJ,JK,JL,JM,K,KA,KB,KC,KD,KE,KF,KG,KH,KI,KJ,KK,KL,KM,L,LA,LB,LC,LD,LE,LF,LG,LH,LI,LJ,LK,LL,LM,M,MA,MB,MC,MD,ME,MF,MG,MH,MI,MJ,MK,ML,MM,N,NA,NB,NC,ND,NE,NF,NG,NH,NI,NJ,NK,NL,NM,O,OA,OB,OC,OD,OE,OF,OG,OH,OI,OJ,OK,OL,OM,P,PA,PB,PC,PD,PE,PF,PG,PH,PI,PJ,PK,PL,PM,Q,QA,QB,QC,QD,QE,QF,QG,QH,QI,QJ,QK,QL,QM,R,RA,RB,RC,RD,RE,RF,RG,RH,RI,RJ,RK,RL,RM,S,SA,SB,SC,SD,SE,SF,SG,SH,SI,SJ,SK,SL,SM,T,TA,TB,TC,TD,TE,TF,TG,TH,TI,TJ,TK,TL,TM,U,UA,UB,UC,UD,UE,UF,UG,UH,UI,UJ,UK,UL,UM,V,VA,VB,VC,VD,VE,VF,VG,VH,VI,VJ,VK,VL,VM,W,WA,WB,WC,WD,WE,WF,WG,WH,WI,WJ,WK,WL,X,XA,XB,XC,XD,XE,XF,XG,XH,XI,XJ,XK,XL,Y,YA,YB,YC,YD,YE,YF,YG,YH,YI,YJ,YK,YL,Z,ZA,ZB,ZC,ZD,ZE,ZF,ZG,ZH,ZI,ZJ,ZK,ZL AA,BA,CA,DA,EA,FA,GA,HA,IA,JA,KA,LA,MA,NA,AB,BB,CB,DB,EB,FB,GB,HB,IB,JB,KB,LB,MB,NB,AC,BC,CC,DC,EC,FC,GC,HC,IC,JC,KC,LC,MC,NC,AD,BD,CD,DD,ED,FD,GD,HD,ID,JD,KD,LD,MD,ND,AE,BE,CE,DE,EE,FE,GE,HE,IE,JE,KE,LE,ME,NE,AF,BF,CF,DF,EF,FF,GF,HF,IF,JF,KF,LF,MF,NF,AG,BG,CG,DG,EG,FG,GG,HG,IG,JG,KG,LG,MG,NG,AH,BH,CH,DH,EH,FH,GH,HH,IH,JH,KH,LH,MH,NH,AI,BI,CI,DI,EI,FI,GI,HI,II,JI,KI,LI,MI,NI,AJ,BJ,CJ,DJ,EJ,FJ,GJ,HJ,IJ,JJ,KJ,LJ,MJ,NJ,AK,BK,CK,DK,EK,FK,GK,HK,IK,JK,KK,LK,MK,NK,AL,BL,CL,DL,EL,FL,GL,HL,IL,JL,KL,LL,ML,NL,AM,BM,CM,DM,EM,FM,GM,HM,IM,JM,KM,LM,MM,NM,AN,BN,CN,DN,EN,FN,GN,HN,IN,JN,KN,LN,MN,NN,AO,BO,CO,DO,EO,FO,GO,HO,IO,JO,KO,LO,MO,NO,AP,BP,CP,DP,EP,FP,GP,HP,IP,JP,KP,LP,MP,NP,AQ,BQ,CQ,DQ,EQ,FQ,GQ,HQ,IQ,JQ,KQ,LQ,MQ,NQ,AR,BR,CR,DR,ER,FR,GR,HR,IR,JR,KR,LR,MR,NR,AS,BS,CS,DS,ES,FS,GS,HS,IS,JS,KS,LS,MS,NS,AT,BT,CT,DT,ET,FT,GT,HT,IT,JT,KT,LT,MT,NT,AU,BU,CU,DU,EU,FU,GU,HU,IU,JU,KU,LU,MU,NU,AV,BV,CV,DV,EV,FV,GV,HV,IV,JV,KV,LV,MV,NV,AW,BW,CW,DW,EW,FW,GW,HW,IW,JW,KW,LW,MW,AX,BX,CX,DX,EX,FX,GX,HX,IX,JX,KX,LX,MX,AY,BY,CY,DY,EY,FY,GY,HY,IY,JY,KY,LY,MY,AZ,BZ,CZ,DZ,EZ,FZ,GZ,HZ,IZ,JZ,KZ,LZ,MZ coat protein SKPIAIFKLRELSSDSTLFTLPGHSVTLPNTLGIVSHLPTPRKGNPGTVKTMRNLRKTILLGAGTASERAVPIVIKTETSFPVGTTEEDRAEVLKQMASFLIEEVKNNQELAYSGYVQDKYFIEDLVITE 130 T 0.077 AAA_11 pdbhh F T 6yfj 1 A,AA,AB,B,BA,BB,C,CA,CB,D,DA,DB,E,EA,EB,F,FA,FB,G,GA,GB,H,HA,HB,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,PA,Q,QA,R,RA,S,SA,T,TA,U,UA,V,VA,W,WA,X,XA,Y,YA,Z,ZA AA,BA,CA,AB,BB,CB,AC,BC,CC,AD,BD,CD,AE,BE,CE,AF,BF,CF,AG,BG,CG,AH,BH,CH,AI,BI,AJ,BJ,AK,BK,AL,BL,AM,BM,AN,BN,AO,BO,AP,BP,AQ,BQ,AR,BR,AS,BS,AT,BT,AU,BU,AV,BV,AW,BW,AX,BX,AY,BY,AZ,BZ coat protein AYKLIKMAGGNSAIQTYAREDKTTQTLSTQKTISVLRNGSTSTRIIKVHINSTAPVTINTCDPTKCGPTVPMGVSFKSSMPEDADPAEVLKAAKAALALFEANLNSAFNKNVDEISVA 118 T 0.14 MRF_C2 pdb F T 6yfl 1 A,AA,AB,B,BA,BB,C,CA,CB,D,DA,DB,E,EA,EB,F,FA,FB,G,GA,GB,H,HA,HB,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,PA,Q,QA,R,RA,S,SA,T,TA,U,UA,V,VA,W,WA,X,XA,Y,YA,Z,ZA AA,BA,CA,AB,BB,CB,AC,BC,CC,AD,BD,CD,AE,BE,CE,AF,BF,CF,AG,BG,CG,AH,BH,CH,AI,BI,AJ,BJ,AK,BK,AL,BL,AM,BM,AN,BN,AO,BO,AP,BP,AQ,BQ,AR,BR,AS,BS,AT,BT,AU,BU,AV,BV,AW,BW,AX,BX,AY,BY,AZ,BZ coat protein AYSPSTPVTGAAQTGFTSPTYTLTSDTAPTALGKQHAVTATGGTQTGVTTHSVSSPFTITFTRPKTMKTVGVPNSNGVITNIGRNTYGFLVRKGVIPAVNQSPQVMLVRVEISVPAGADTYDAANVKAALSAAIGVLSQQSAGIGDTALSGIL 153 T 3.2 Lin0512_fam pdbhh F T 6yfm 1 A,AA,B,BA,C,CA,D,DA,E,EA,F,FA,G,GA,H,HA,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,PA,Q,QA,R,RA,S,SA,T,U,V,W,X,Y,Z AA,BA,AB,BB,AC,BC,AD,BD,AE,BE,AF,BF,AG,BG,AH,BH,AI,BI,AJ,BJ,AK,BK,AL,BL,AM,BM,AN,BN,AO,BO,AP,BP,AQ,BQ,AR,BR,AS,BS,AT,AU,AV,AW,AX,AY,AZ coat protein SSQANITVFDGAATPVSHVLVPLGVGIDENLGSVAKWRENLATVPLYANVRVTTMQKKLKSGIERVEIRVEVPVMEAVSGQNAFGYTAAPKVAFTDSGSFVGYFSERSAQSNRRLVKQILTNLLGNVSTSVAAPTTGFASELIDSGITAS 150 F F T 6yfo 1 A,AA,AB,AC,B,BA,BB,BC,C,CA,CB,CC,D,DA,DB,DC,E,EA,EB,EC,F,FA,FB,FC,G,GA,GB,GC,H,HA,HB,HC,I,IA,IB,IC,J,JA,JB,JC,K,KA,KB,KC,L,LA,LB,LC,M,MA,MB,N,NA,NB,O,OA,OB,P,PA,PB,Q,QA,QB,R,RA,RB,S,SA,SB,T,TA,TB,U,UA,UB,V,VA,VB,W,WA,WB,X,XA,XB,Y,YA,YB,Z,ZA,ZB AA,BA,CA,DA,AB,BB,CB,DB,AC,BC,CC,DC,AD,BD,CD,DD,AE,BE,CE,DE,AF,BF,CF,DF,AG,BG,CG,DG,AH,BH,CH,DH,AI,BI,CI,DI,AJ,BJ,CJ,DJ,AK,BK,CK,DK,AL,BL,CL,DL,AM,BM,CM,AN,BN,CN,AO,BO,CO,AP,BP,CP,AQ,BQ,CQ,AR,BR,CR,AS,BS,CS,AT,BT,CT,AU,BU,CU,AV,BV,CV,AW,BW,CW,AX,BX,CX,AY,BY,CY,AZ,BZ,CZ coat protein SIIGSSIKTGATSASITGGSDITFALTGQTVTNGLNVSVSEDTDYRTRRNATFKSRVPTVVNGNYSKGKNEVVFVIPMSLDSGETVFNSVRIALEIHPALASASVKDLRLIGAQLLTDADYDSFWTLGALA 131 T 0.044 RRXRR pdbpercent F T 6yfp 1 A,AA,AB,AC,B,BA,BB,BC,C,CA,CB,CC,D,DA,DB,DC,E,EA,EB,EC,F,FA,FB,FC,G,GA,GB,GC,H,HA,HB,HC,I,IA,IB,IC,J,JA,JB,JC,K,KA,KB,KC,L,LA,LB,LC,M,MA,MB,N,NA,NB,O,OA,OB,P,PA,PB,Q,QA,QB,R,RA,RB,S,SA,SB,T,TA,TB,U,UA,UB,V,VA,VB,W,WA,WB,X,XA,XB,Y,YA,YB,Z,ZA,ZB AA,BA,CA,DA,AB,BB,CB,DB,AC,BC,CC,DC,AD,BD,CD,DD,AE,BE,CE,DE,AF,BF,CF,DF,AG,BG,CG,DG,AH,BH,CH,DH,AI,BI,CI,DI,AJ,BJ,CJ,DJ,AK,BK,CK,DK,AL,BL,CL,DL,AM,BM,CM,AN,BN,CN,AO,BO,CO,AP,BP,CP,AQ,BQ,CQ,AR,BR,CR,AS,BS,CS,AT,BT,CT,AU,BU,CU,AV,BV,CV,AW,BW,CW,AX,BX,CX,AY,BY,CY,AZ,BZ,CZ coat protein SYTIDINCSTGDTQANLVLTEIPAEPYVHVSGDNKSTIEYLDTGSDNSLLVRPTQQFNCVSSQYPYRNYSKIPRSQQDPLAVRREFYTRRVEYWRKADASNVDAPEYTLPQSCSIRLASTVTKETTAADIAGIVLRTLAPIFPNGSGDWIKLQQLIDGLPRIFG 164 T 5.2 CtsR pdbhh F T 6yfr 1 A,AA,AB,B,BA,BB,C,CA,CB,D,DA,DB,E,EA,EB,F,FA,FB,G,GA,GB,H,HA,HB,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,PA,Q,QA,R,RA,S,SA,T,TA,U,UA,V,VA,W,WA,X,XA,Y,YA,Z,ZA AA,BA,CA,AB,BB,CB,AC,BC,CC,AD,BD,CD,AE,BE,CE,AF,BF,CF,AG,BG,CG,AH,BH,CH,AI,BI,AJ,BJ,AK,BK,AL,BL,AM,BM,AN,BN,AO,BO,AP,BP,AQ,BQ,AR,BR,AS,BS,AT,BT,AU,BU,AV,BV,AW,BW,AX,BX,AY,BY,AZ,BZ coat protein ASLPVTQYSPPVTPLGKSTWNVTGSTNPPGLVPQVVQTESINARKSNIMSKISVYYYIPSTNSVSCCTEWDTIRCEFSLTLLQLSSNTDVAARTVDVLDTMISFLAKRRNSILAGNLLLPDNP 123 T 0.18 CHB_HEX_C pdbpercent F T 6yfs 1 A,AA,AB,AC,B,BA,BB,BC,C,CA,CB,CC,D,DA,DB,DC,E,EA,EB,EC,F,FA,FB,FC,G,GA,GB,GC,H,HA,HB,HC,I,IA,IB,IC,J,JA,JB,JC,K,KA,KB,KC,L,LA,LB,LC,M,MA,MB,N,NA,NB,O,OA,OB,P,PA,PB,Q,QA,QB,R,RA,RB,S,SA,SB,T,TA,TB,U,UA,UB,V,VA,VB,W,WA,WB,X,XA,XB,Y,YA,YB,Z,ZA,ZB AA,BA,CA,DA,AB,BB,CB,DB,AC,BC,CC,DC,AD,BD,CD,DD,AE,BE,CE,DE,AF,BF,CF,DF,AG,BG,CG,DG,AH,BH,CH,DH,AI,BI,CI,DI,AJ,BJ,CJ,DJ,AK,BK,CK,DK,AL,BL,CL,DL,AM,BM,CM,AN,BN,CN,AO,BO,CO,AP,BP,CP,AQ,BQ,CQ,AR,BR,CR,AS,BS,CS,AT,BT,CT,AU,BU,CU,AV,BV,CV,AW,BW,CW,AX,BX,CX,AY,BY,CY,AZ,BZ,CZ coat protein AQHNMRLQLTSGTSLTWVDPNDFRSTFRINLNVNQKVAGAVSVYNARSEVITNRAPLVVIEGCTDACSVNRENISIRTTISGSVENKAAVLAALLDHLHNLGLARDDLVAGLLPTTIQPVVEYTGS 126 T 0.005 YopH_N pdb F T 6yft 1 A,AA,AB,AC,AD,AE,AF,AG,AH,AI,AJ,AK,AL,AM,B,BA,BB,BC,BD,BE,BF,BG,BH,BI,BJ,BK,BL,BM,C,CA,CB,CC,CD,CE,CF,CG,CH,CI,CJ,CK,CL,CM,D,DA,DB,DC,DD,DE,DF,DG,DH,DI,DJ,DK,DL,DM,E,EA,EB,EC,ED,EE,EF,EG,EH,EI,EJ,EK,EL,EM,F,FA,FB,FC,FD,FE,FF,FG,FH,FI,FJ,FK,FL,FM,G,GA,GB,GC,GD,GE,GF,GG,GH,GI,GJ,GK,GL,GM,H,HA,HB,HC,HD,HE,HF,HG,HH,HI,HJ,HK,HL,HM,I,IA,IB,IC,ID,IE,IF,IG,IH,II,IJ,IK,IL,IM,J,JA,JB,JC,JD,JE,JF,JG,JH,JI,JJ,JK,JL,JM,K,KA,KB,KC,KD,KE,KF,KG,KH,KI,KJ,KK,KL,KM,L,LA,LB,LC,LD,LE,LF,LG,LH,LI,LJ,LK,LL,LM,M,MA,MB,MC,MD,ME,MF,MG,MH,MI,MJ,MK,ML,MM,N,NA,NB,NC,ND,NE,NF,NG,NH,NI,NJ,NK,NL,NM,O,OA,OB,OC,OD,OE,OF,OG,OH,OI,OJ,OK,OL,OM,P,PA,PB,PC,PD,PE,PF,PG,PH,PI,PJ,PK,PL,PM,Q,QA,QB,QC,QD,QE,QF,QG,QH,QI,QJ,QK,QL,QM,R,RA,RB,RC,RD,RE,RF,RG,RH,RI,RJ,RK,RL,RM,S,SA,SB,SC,SD,SE,SF,SG,SH,SI,SJ,SK,SL,SM,T,TA,TB,TC,TD,TE,TF,TG,TH,TI,TJ,TK,TL,TM,U,UA,UB,UC,UD,UE,UF,UG,UH,UI,UJ,UK,UL,UM,V,VA,VB,VC,VD,VE,VF,VG,VH,VI,VJ,VK,VL,VM,W,WA,WB,WC,WD,WE,WF,WG,WH,WI,WJ,WK,WL,X,XA,XB,XC,XD,XE,XF,XG,XH,XI,XJ,XK,XL,Y,YA,YB,YC,YD,YE,YF,YG,YH,YI,YJ,YK,YL,Z,ZA,ZB,ZC,ZD,ZE,ZF,ZG,ZH,ZI,ZJ,ZK,ZL AA,BA,CA,DA,EA,FA,GA,HA,IA,JA,KA,LA,MA,NA,AB,BB,CB,DB,EB,FB,GB,HB,IB,JB,KB,LB,MB,NB,AC,BC,CC,DC,EC,FC,GC,HC,IC,JC,KC,LC,MC,NC,AD,BD,CD,DD,ED,FD,GD,HD,ID,JD,KD,LD,MD,ND,AE,BE,CE,DE,EE,FE,GE,HE,IE,JE,KE,LE,ME,NE,AF,BF,CF,DF,EF,FF,GF,HF,IF,JF,KF,LF,MF,NF,AG,BG,CG,DG,EG,FG,GG,HG,IG,JG,KG,LG,MG,NG,AH,BH,CH,DH,EH,FH,GH,HH,IH,JH,KH,LH,MH,NH,AI,BI,CI,DI,EI,FI,GI,HI,II,JI,KI,LI,MI,NI,AJ,BJ,CJ,DJ,EJ,FJ,GJ,HJ,IJ,JJ,KJ,LJ,MJ,NJ,AK,BK,CK,DK,EK,FK,GK,HK,IK,JK,KK,LK,MK,NK,AL,BL,CL,DL,EL,FL,GL,HL,IL,JL,KL,LL,ML,NL,AM,BM,CM,DM,EM,FM,GM,HM,IM,JM,KM,LM,MM,NM,AN,BN,CN,DN,EN,FN,GN,HN,IN,JN,KN,LN,MN,NN,AO,BO,CO,DO,EO,FO,GO,HO,IO,JO,KO,LO,MO,NO,AP,BP,CP,DP,EP,FP,GP,HP,IP,JP,KP,LP,MP,NP,AQ,BQ,CQ,DQ,EQ,FQ,GQ,HQ,IQ,JQ,KQ,LQ,MQ,NQ,AR,BR,CR,DR,ER,FR,GR,HR,IR,JR,KR,LR,MR,NR,AS,BS,CS,DS,ES,FS,GS,HS,IS,JS,KS,LS,MS,NS,AT,BT,CT,DT,ET,FT,GT,HT,IT,JT,KT,LT,MT,NT,AU,BU,CU,DU,EU,FU,GU,HU,IU,JU,KU,LU,MU,NU,AV,BV,CV,DV,EV,FV,GV,HV,IV,JV,KV,LV,MV,NV,AW,BW,CW,DW,EW,FW,GW,HW,IW,JW,KW,LW,MW,AX,BX,CX,DX,EX,FX,GX,HX,IX,JX,KX,LX,MX,AY,BY,CY,DY,EY,FY,GY,HY,IY,JY,KY,LY,MY,AZ,BZ,CZ,DZ,EZ,FZ,GZ,HZ,IZ,JZ,KZ,LZ,MZ coat protein STFSSLVIGSNTFIPTAPGYYSLSTRGFSDPRNQIKISGGKFNAKTGRVTAAVSRLWETDVTVAGLPVRSAAEVAIIMTLGRGITATNADVLLSDLNTLLDPARLDQILQGGF 113 T 0.096 DUF1194 pdb F T 6yfu 1 A,AA,AB,B,BA,BB,C,CA,CB,D,DA,DB,E,EA,EB,F,FA,FB,G,GA,GB,H,HA,HB,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,PA,Q,QA,R,RA,S,SA,T,TA,U,UA,V,VA,W,WA,X,XA,Y,YA,Z,ZA AA,BA,CA,AB,BB,CB,AC,BC,CC,AD,BD,CD,AE,BE,CE,AF,BF,CF,AG,BG,CG,AH,BH,CH,AI,BI,AJ,BJ,AK,BK,AL,BL,AM,BM,AN,BN,AO,BO,AP,BP,AQ,BQ,AR,BR,AS,BS,AT,BT,AU,BU,AV,BV,AW,BW,AX,BX,AY,BY,AZ,BZ coat protein SIKYIFKKTDTLPRSVIGNVLRTTGPDTTVYSLPGHTPVNPFTLTAVSRLPVPRKGNAGTTKTTLSLRREVTINKGTDQEKIVPMIARIETSVPVGVSQDDFKAMIEGLACPLLLDEIHVNDLFLSGLPIATTDVPDNEPLPPALL 146 T 0.13 AAA_11 pdbhh F T 6ygc 4 D D ARL3_YEAST ARF-LIKE GTPASE 3 MFHLVGSRRR 10 T 0.077 TniB unppercent F Eukaryota T 6yh0 2 B EEE PRO-VAL-PRO-ARG PVPRAHS 7 T 83 DUF4462 pdbhh F T 6yia 2 B P SMAD2_HUMAN SMAD2 XWPSVRCSSMS 11 T 5.3 DUF5466 pdbhh F Eukaryota T 6yib 2 B P SMAD3_HUMAN SMAD3 XWPSIRCSSVS 11 T 4.7 Peptidase_Prp pdbhh F Eukaryota T 6yj4 34 HA h Q6C1R9_YARLI subunit NUNM of protein NADH:Ubiquinone Oxidoreductase (Complex I) MLRHTVRATQTLRQARNVRFGSHGHGPELTPAVPFFQPYVLKWAGVSLGLVAFYQFNSSYEAKNGHTWVETFFHPKSREDILNEEAKIVQALNNQRELTIKMHELKREEKDYAHSYSPLFSDPVPQGGSIGKAPGSSRE 139 T 0.033 DUF5950 pdb F Eukaryota T 6ylu 2 B B BLNK_HUMAN BLNKpT152 ARLTSTLPALTA 12 T 1.8 DUF1685 pdbhh F Eukaryota T 6ymx 12 L m COX26_YEAST Cytochrome c oxidase subunit 26, mitochondrial ESWVITEGRRLIPEIFQWSAVLSVCLGWPGAVYFFSKA 38 T 0.077 Phage_holin_Dp1 unppercent F Eukaryota T 6yn0 2 B B FTSN_ECOLI Cell division protein FtsN LPPKPEERWRYIKELESRQ 19 T 0.54 TFIIA unp F Bacteria T 6yn1 5 DA,E,IA,J,NA,O,T,Y d,E,i,J,n,O,T,Y APLF_HUMAN APURINIC-APYRIMIDINIC ENDONUCLEASE APLF,PNK AND APTX-LIKE FHA DOMAIN-CONTAINING PROTEIN,XRCC1-INTERACTING PROTEIN 1 GLDEDNDNVGQPNEYDLNDSFLDDEEEDYEPTDEDSDWEPGKE 43 T 0.00014 HUN pdbpercent F Eukaryota T 6yns 2 AA,BA,CA,DA,EA,FA,GA,HA,IA,JA,M,N,O,P,Q,R,S,T,U,V,W,X,Y,Z c,d,e,f,g,h,i,j,k,l,N,O,Q,R,S,T,U,V,W,X,Y,Z,a,b CYAA_BORPE CYCLOLYSIN WGQRALQGAQAVAAAQRLVHAIAL 24 T 3.3 Neuropep_like pdbhh F Bacteria T 6ynw 4 M e I7MMW3_TETTS subunit epsilon MCIEFAFKKAGIPIVRNFLHSTEGVIYGLPQRVQRNLAINYTVKQYKEGKAVSAKTIKTLQEAFPSKGDTK 71 T 0.062 YopH_N pdbpercent F Eukaryota T 6ynx 1 A,T a,A Q951C1_TETTH subunit a MGRENVLPVHNDVYEDFVFTTPYFQPESTFKSVPKLFSDILLGGVEWVYTTSESVLAYDYKLWYLWSGVSNLDESFDMFFNQYWALSLSTSVFQLFYAVILDRYLSVLFQNTPYTNDWFRMMLHSKETALIWLYHPELSWHINGLNQFFTYFYGGILEFVYFDKSNPDMCILVHTLWIHLLILFLIFTGFVTILFSFYGNPNTEENTIDSDYLAASGTVEAEKEITSIDDYLGLVFAIAYVFGVFFYVHGWTSMLSHAVLLLSCYSIIIMFLFILGMPTLLLYDFGIFFLAYLKGAGKYISSVAEMMFDYTACLVFYIRILAQWIRVVLMVVTFISLSHYVSDFDITNSALIGSENQSDSMNELNTNFSMTYYILTVLPGKFIYWIYEILHTFFVVCSQFVAFFAIVFWLFLFLYTFFIIEKHEDFFSKKREERKKKLKELWNLKN 446 T 5.9 ETRAMP pdbpercent F Eukaryota T 6ynx 4 D,W f,F Q24I07_TETTS subunit f MSLHEKMQTDYLWVKDHSQADSWAKARTHGYNYIAHTVPNKKERYEMIWRSMGKSTDWELEKFRLGKKFPDRGNKRRWFKNLFRLIKNPMGYIFWKTYKARLAKPSLIVTSMFIGFTLGFIKLKAQSIAYSKKQYATLRAGKNIEGSGQVHFGYHDQKWGMPAIPMFQLMYYELPGNSIVVNPCRNQNYRLYFEMRKKLGILPA 204 T 4 DUF6249 pdbhh F Eukaryota T 6ynx 5 E,X i,I I7LZW2_TETTS subunit i/j MNPIQKAWLKILEPVSYVINEKMAKRTGIIGKLGRFFAIGPREYGVHPINRMFIFMNRKYMAFQAVALHRYSFVKSLTHNGFHMLRVFRHFAFVLPATVLAGLGLFVYWGDDNKCYSPDRFPYLKKRAGDMALPLNSLNQRTSAHYIEINAIYGAEMMKRYHKVWENIIEERSKATDQEKKTRYAHPSYQYSPLPVVSIPNVLNPLNLQ 209 T 11 mit_SMPDase pdbhh F Eukaryota T 6ynx 6 F,Y k,K I7LSX6_TETTS subunit k MWYKYFSKQSWNLRVWRKANLKYNQDDFGMTQPKYIARFGDFRFRLVRTEGALRGCMFFVGFGCFSIINYLYGRYGYIINESSQKRAAQDLLDNDMAADKILFKNRVGAPTRPLRSLDDMMAFLSGSATYDQLADYASYNHAMDVNQDQQAGLDSWMSEKDKNMVKYYQRSLGKKVEGI 179 T 5 RTP801_C pdbhh F Eukaryota T 6ynx 7 G,Z c,C Q950Y8_TETTH subunit 8 MITILDYLFLLDLNDDLTRKAVFEQVIIFIFIYCTMNFLAWSTVVELIWPTHFFNRRHSSSQEFIRFRTYTEVLLKISAYNDFFYVLNNYYYNQKLILKN 100 T 2.1 ATP-synt_B pdbhh F Eukaryota T 6ynx 8 AA,H G,g I7M8Q3_TETTS ATPTT3 MINRSTAFISKNLRQANLTQSSLAMKTQYNQMGFSSDNPYNKRWEYKWKHSYYTYPRDYEHTEVRKPQDSKDVPPIYFAYYKDFVDRWLPGMNMWWQRRHRIFDKFNVYFLPGMSLFFYQFADLALGFKIMAAFPLFLAYTRIRDKTLDPDFKETYLRDMIYQNPEITKYFNEETIHVLDYEFEYLPGYLCPEKFPEYQNKTWQFFNTDTAQAEGFFKFGDVESGATMTLKFKTMPIPGKFRYQVGEPFYFYDLRAEIKCDGVYKEVVLVDEKESLKKIRPFLFLI 286 T 35 EIID-AGA unphh F Eukaryota T 6ynx 9 BA,I H,h I7MCZ0_TETTS ATPTT4 MQQRKKIYLRQKRKIYIQLKNKEKKKNNQFIQKREKMGYKIRNKSIFWTRAGWKNNWHPKNFNAPRPSYGEFTMGIRCRNDHHSFLRYVQTYRNMSRHCKQYFLGDKQLEETFILGLRSLFLVPYDSQCLTDQIKHGGERRFVDQLDRDFELISYNTHPYQLFTYTVRNEHLAWKNEQYEKIQKGEKTFEQELLDYLDEQVLAEKAKLRDGQNFSIERMTEIALHVFRKARAGKVRPAQDVRGPDGNVNDFLEQRRPFEHPNPTGVTH 268 T 0.075 Staphylokinase pdb F Eukaryota T 6ynx 10 CA,J J,j Q228N4_TETTS ATPTT5 MSENKAPGQIYAYDIHNTHYPYVNIKQDSQTQLLASFRRSIASINPFSYRQVPSQDRAAFGLRWGNAWYAPNPYPNGIHFDRVFPTHYDPLAETNRTKANLQLIKYAPGNYSTLVVTSEKLPRPCIRTIQNYRRCQMVNGTEKCNSEAQDILAICPNWALDHMKEKVRFYTKALAINNQTYIRAMQVEEYNQGRTVADVAPKTWIHGTRQHLRPDTMWADDRYTNITQTEINEAIKRVEARKAREHEKKPVEQANVNANTGEQPVRVEKSLYP 273 T 4.7 CX9C pdbhh F Eukaryota T 6ynx 11 DA,K L,l I7MCQ6_TETTS ATPTT6 MPVKEGQAKLWFSTKEEADAYDDKMISNIELKSQDYEDENFSPVFNRKTQEYFLEPSEKFKSDFAELLRPLRSLSFNQVVDRYVLIPPNHTFYRNWTYEKFLGGFGLSYLILRELPLRNFYARVFVMYAFAAKVLDHLGNPFPFSGHGQIVAAADRWNHWDVRCYDNVMKALKYIRIPTVQNNIPEATRWYGRQPGHLLRADTYWIPNLVSQRFAKHQPAHWDGTQNMPIFRLADPKHKDSYMVQFR 247 T 0.011 YebG pdb F Eukaryota T 6ynx 12 EA,L M,m I7M980_TETTS ATPTT7 MDNYFTAITLLGLRDQNLPPFKDARLQRYKSIKKMIDLIETTTKLAPPMPVELFMLNPTDPEWDDDMTYPTITHATALYKSSALAGNLFLYAYNYNNFTANIRLRTMRYLFPVVSLAIFGNIYWDYRSQLVKVNLFDEYIQARAQELVKQNEYLLEHEDVKRYVWWYEDLKETLARVHRQANNHKACDFKDSEIILQDFIRRYTNPKDNLPIKFHPQGQTF 221 T 1.7 PDR_CDR pdbpssm F Eukaryota T 6ynx 13 FA,M N,n I7LVK6_TETTS ATPTT8 MEGFIQNKRKKEKEGEEEEESKEKRKQINQLNKQKQEEEKIYQQKKDQKRKKYLYQRKEMTIFAETWEASEYQYRNKANLKTLPVNHLGKLAELKFDFVEYKAHQLIACHLYERMTIHCMNQYGLFKDFYRPECLDAQYYFKTCVELNAAYGIQKKFFPEHFVGSPYARPVPQFQQLGL 179 T 0.14 Pet20 pdbpssm F Eukaryota T 6ynx 14 GA,N O,o Q24HK1_TETTS ATPTT9 MKQKINKLLKNKGVQDKYKYLSKLILLDQEIKGKIKRKNKKEKQKRKNKLILEEMQNTTNIVHVPVHMGHTHYFDYIDSFPKLKEGPTLEENHITNQKILREQLISGQQGLEQNLCLRNCFKLSQKRYIEFCLDRKCGGADFQRAATILGYTKN 154 T 3.7 Arteri_nucleo pdbpssm F Eukaryota T 6ynx 15 HA,O P,p I7LZE5_TETTS ATPTT10 MSNIFLELQDGDKTVYTHTSLIEESKQEQIQAIYDKVPQWTNGGRFLGFWLSMEAVNRVQSVAKLPIYYRAGIVATSTLLGGLVSSLVFWKSGNENQVAKLANGAPVYLKKWEVPELSKLYFFLDDDNNFKPSLNHHAVTQGRQYYKIYQHN 152 T 4.8 Glyco_transf_43 pdbhh F Eukaryota T 6ynx 17 JA,Q R,r I7M0G0_TETTS ATPTT12 MSQDPKIVNPQLWPNPNKLRFADLYKYQGVEMKKINDSIKNYKAAKFYIGGILGGCLVFKFFIDAAVDKYIFGENGNGGKFLEMQTINSNYDYYYNRQFQRMRYLTEDPAGDDPLQKTKDEHLVDLGFIPKVFGANVEVRKRAPHDKYL 149 T 0.029 Disaggr_repeat pdbpercent F Eukaryota T 6ynx 18 KA,R S,s I7MLU7_TETTS ATPTT13 MNSLSSKKANSLVFKSIRNFTLQWGSLAERPMVDRVMSTSTWPVPYYQRLFKAYPIREKKDKMSLLLSDIDIDDTNWYQAKDFLRGSFRGRQIVDYVENNIASNTYILIQQDVANMAKAYVHDICGYIDVANKENVRILSKGDLI 145 T 20 Tryp_FSAP unphh F Eukaryota T 6ynx 20 MA,NA i1,i2 I7M7C0_TETTS Inhibitor of F1 (IF1) MNRSVNIAKNLIQTYRAMSVQSRFAFSTREEEWLDKRTKSQEKVYFDQEDRKAMKRLLEKLNTTSKFVEDSEYLAPQNLEVENILKRYHINYTQALIDELVDWKTGKN 108 T 0.043 DUF5673 pdbpssm F Eukaryota T 6yny 2 B,U b,B I7MJ84_TETTS subunit b MHSTLRVFTKNNCLSFTNMNRFSTAAQVAQANYSKFRADYSASVAAFQQRIKTIEKENTGSMKKPMAKAYEHPYNSEHHPLNFSAVKIAETFHDFIGPEQVSPHYESFAMSRKFLLTFWGGFFVLNFGMATVDLNWIMKSTYIPWIFWFQLMYFYVEGKNSMFMPLLQRFYRRAAANEIFTMEAFYHENIENKLRNLMRITKGQLEYWDIHTSYGEIRADSINNFLANEYLRLQSHITSRALNILKQAQAYETMNQAALLQKLIDDATSAIDNALKGDKKAEVLARSLDSAIDGLSKGYMDYQNDPLLPLILSSIEANVKKITTLSAQEQANLIGLTAEQLKSIKENDVRARKEFLESQPKLDNNLKNIESVKKILATWGK 381 T 0.12 Tipalpha pdbpssm F Eukaryota T 6yo8 2 E,F,G,H E,F,G,H GCR_HUMAN GR,NUCLEAR RECEPTOR SUBFAMILY 3 GROUP C MEMBER 1 KTIVPATLPQLTP 13 T 5.7 DUF2064 pdbhh F Eukaryota T 6yp6 1 A A G3CFL3_9CAUD 933WP42, VB_24B_21 GVTTLLSYLASESEGSLKVQGWSASGGRAEVVSDAEGTGGKAVKLTKEAGKSSWVLEYAAGNGAALLQKGGQIRCRFKVSGALAANQYVMAFYWPVSSLPQGVALTGDGGNNLLAAFYIQTDAKDLNVMYHNAKVATNNLKLGTFGAFDNEWHTLAFRFAGNNSLQVTPVIDGQDGTPFTLTQSPVSAFAADKLHVTDITRGATYPVLIDSIAVEVNS 218 T 2.4 Sial-lect-inser unphh T Viruses T 6ypc 3 C T CENPT_YEAST CENP-T HOMOLOG,CO-PURIFIED WITH NNF1 PROTEIN 1,CONSTITUTIVE CENTROMERE-ASSOCIATED NETWORK PROTEIN CNN1 MSTPRKAAGNNENTEVSEIRTPFRERALEEQRLKDEVLIRNTPGYRKLLSASTKSHDILNKDPNEVRSFLQDLSQVLARKSQGNDTTTNKTQARNLIDELAYEESQPEENELLRSRSEKLTDNNIGNETQPDYTSLSQTVFAKLQERDKGLKSRKIDPIIIQDVPTTGHEDELTVHSPDKANSISMEVLRTSPSIGMDQVDEPPVRDPVPISITQQEEPLSEDLPSDDKEETEEAENEDYSFENTSDENLDDIGNDPIRLNVPAVRRSSIKPLQIMDLKHLTRQFLNENRIILPKQTWSTIQEESLNIMDFLKQKIGTLQKQELVDSFIDMGIINNVDDMFELAHELLPLELQSRIESYLFENLYFQ 367 T 0.0019 CENP-T_C unphh F Eukaryota T 6ypc 4 D W CENPW_YEAST CENP-W HOMOLOG,CONSTITUTIVE CENTROMERE-ASSOCIATED NETWORK PROTEIN WIP1,W-LIKE PROTEIN 1 MDTEALANYLLRQLSLDAEENKLEDLLQRQNEDQESSQEYNKKLLLACGFQAILRKILLDARTRATAEGLREVYPYHIEAATQAFLDSQ 89 T 4.4E-05 CENP-W pdbhh F Eukaryota T 6yqx 1 A A de novo designed TIM barrel DeNovoTIM13 MDVDEMLKQVEILRRLGAKQIAVRSDDWRILQEALKKGGDILIVDATDVDEMLKQVEILRRLGAKQIAVRSDDWRILQEALKKGGDILIVDATDVDEMLKQVEILRRLGAKQIAVRSDDWRILQEALKKGGDILIVDATDVDEMLKQVEILRRLGAKQIAVRSDDWRILQEALKKGGDILIVDATGLEHHHHHH 194 T 0.00012 NanE pdbhh F T 6yqy 1 A A de novo designed TIM barrel sTIM11noCys MDKDEAWKQVEQLRREGATQIAYRSDDWRDLKEAWKKGADILIVDATDKDEAWKQVEQLRREGATQIAYRSDDWRDLKEAWKKGADILIVDATDKDEAWKQVEQLRREGATQIAYRSDDWRDLKEAWKKGADILIVDATDKDEAWKQVEQLRREGATQIAYRSDDWRDLKEAWKKGADILIVDATGLEHHHHHH 194 T 0.00015 NanE pdbhh F T 6yr5 2 B,D,F,H O,P,Q,R MDM4_HUMAN DOUBLE MINUTE 4 PROTEIN,MDM2-LIKE P53-BINDING PROTEIN,PROTEIN MDMX,P53-BINDING PROTEIN MDM4 DCRRTISAPVVRPK 14 T 0.93 DUF6143 unppercent F Eukaryota T 6yr6 2 B,D,F,H B,D,F,H MDM2_HUMAN hDM2-186 QRKRHKSDSISLS 13 T 1.4 NTF3_N pdbhh F Eukaryota T 6yr7 2 C,D Q,C MDM4_HUMAN DOUBLE MINUTE 4 PROTEIN,MDM2-LIKE P53-BINDING PROTEIN,PROTEIN MDMX,P53-BINDING PROTEIN MDM4 SKLTHSLSTSDITAIPEKENEGNDVPDCRRTISAPVVRPK 40 T 0.01 Cript unp F Eukaryota T 6yro 1 A,B,C,D,E D,A,B,C,E G5DSS1_STRSU SadP MHHHHHHSSGLVPRGSHMKQQSPLIQTSNADYKSGKDQEKLRTSVSINLLKAEEGQIQWKVTFDTSEWSFNVKHGGVYFILPNGLDLTKIVDNNQHDITASFPTDINDYRNSGQEKYRFFSSKQGLDNENGFNSQWNWSAGQANPSETVNSWKSGNRLSKIYFIDQITDTTELTYTLTAKVTEPNQQSFPLLAVMKSFTYTNSKSTEVTSLGAREITLEKEKT 223 T 2.4 DUF5377 unphh F Bacteria T 6ys8 2 C,D,E,F,G G,F,E,D,C Q5EGM4_FLAJO PROTEIN INVOLVED IN GLIDING MOTILITY GLDL MALLSKKVMNFAYGMGAAVVIVGALFKITHFEIGPLTGTVMLSIGLLTEALIFALSAFEPVEDELDWTLVYPELANGQARKKEAKAETATDAQGLLSQKLDAMLKEAKVDGELMASLGNSIKNFEGAAKAISPTVDSIAGQKKYAEEMSMAAAQMESLNSLYKVQLESASRNAQANSEIAENAAKLKEQMASMTANIASLNSVYGGMLSAMSNKG 215 T 0.0059 TPR_MLP1_2 pdbpssm F Bacteria T 6yse 1 A A A9J6U1_BPLUZ GP4 MKSPYEAAHERALMVNRLQKLTRMLRVHPDPKWKQEQQELIKRLKK 46 T 0.79 CBF_beta pdbhh T Viruses T 6ysz 1 A,B,C,D,E,F A,B,C,D,E,F GP15_BPT7 GENE PRODUCT 15,GP15 MRGSHHHHHHGMASMTGGQQMGRDLYDDDDKDPSSMSKIESALQAAQPGLSRLRGGAGGMGYRAATTQAEQPRSSLLDTIGRFAKAGADMYTAKEQRARDLADERSNEIIRKLTPEQRREALNNGTLLYQDDPYAMEALRVKTGRNAAYLVDDDVMQKIKEGVFRTREEMEEYRHSRLQEGAKVYAEQFGIDPEDVDYQRGFNGDITERNISLYGAHDNFLSQQAQKGAIMNSRVELNGVLQDPDMLRRPDSADFFEKYIDNGLVTGAIPSDAQATQLISQAFSDASSRAGGADFLMRVGDKKVTLNGATTTYRELIGEEQWNALMVTAQRSQFETDAKLNEQYRLKINSALNQEDPRTAWEMLQGIKAELDKVQPDEQMTPQREWLISAQEQVQNQMNAWTKAQAKALDDSMKSMNKLDVIDKQFQKRINGEWVSTDFKDMPVNENTGEFKHSDMVNYANKKLAEIDSMDIPDGAKDAMKLKYLQADSKDGAFRTAIGTMVTDAGQEWSAAVINGKLPERTPAMDALRRIRNADPQLIAALYPDQAELFLTMDMMDKQGIDPQVILDADRLTVKRSKEQRFEDDKAFESALNASKAPEIARMPASLRESARKIYDSVKYRSGNESMAMEQMTKFLKESTYTFTGDDVDGDTVGVIPKNMMQVNSDPKSWEQGRDILEEARKGIIASNPWITNKQLTMYSQGDSIYLMDTTGQVRVRYDKELLSKVWSENQKKLEEKAREKALADVNKRAPIVAATKAREAAAKRVREKRKQTPKFIYGRKE 782 T 0.15 DUF4404 pdbpercent T Viruses T 6yvh 2 C,J,K,L C,I,E,G CWC27_HUMAN ANTIGEN NY-CO-10,PROBABLE INACTIVE PEPTIDYL-PROLYL CIS-TRANS ISOMERASE CWC27 HOMOLOG,PPIASE CWC27,SEROLOGICALLY DEFINED COLON CANCER ANTIGEN 10 RSMGTSREDQTLALLNQFKSKLTQAIAETPENDIPETEVEDDEGWMSHVLQFEDKSR 57 T 11 DUF4407 unppssm F Eukaryota T 6yw1 2 B B PHD2-SPECIFIC RaPID CYCLIC PEPTIDE 3C (14-MER) XXVWLTDTWVLSRT 14 T 0.21 DUF4571 pdbhh F T 6yw5 22 V VV Q7SHR9_NEUCR mS26 MAPTLARPSLSGVQFILSSPTTTCAATSVVTRAIAARSFSTTRSARDSVSIPPDSPNYIKVPEPPQSSEVRHPFVKGHLPIPRSIFPKKGVPEKVQSGYVNRIAPKSAAELAGLPPKSKQESWRRKMAEARRQSLEAGLQGLWQRKVKRDQKQAKESKARYLANKRAAQAPERLDEVFTRATIRESTAKNTFVPLDPEAFVKAEEARIKHAEKEAMKSEARRDAVVQLYVASKNFIVDEKELEEHVNKHFTEKIHNAGLWESGRSIWDSQKNPISMRELRNEFSGFNDRVTATTSAAVKTTVRQKNVAEELTGGKL 316 T 0.17 T2SSE_N unppercent F Eukaryota T 6yw5 23 W WW Q7RYW7_NEUCR mS27 MAGRAPQHALRVGCRAVPEALSKPAQQSRCLSSTVPRQATYPVVSFNKTSSPELKEALETLREKVILPTYLPPELRQKIFNKKYEKELAHDPVTIQIDGQPQRFSYINMLTDMPNTPKNIRAALLSMKNGGDFANLSGLLEGMHRANRKLPYWLSAQIVRKACKAGHLQLILNMVRDVKRTGFTLERHETVNELLFWIQRFAWKSDYSEPETRKALREVQEILDALEGDERHMSKDRKRQQALTRFPYHRDPQFLAARLNLTAELAARRAVTGQTSEQQLNSANDVKNLVKYAEQLVRLWPADKALLDMYTDEAYVARVDLRYLIKPQVHLRYASFTLQALKNAAKIVGQLGHGPLAAQLINRAAAVEAESQLAYAKVDDGMAGQKIYEMVVGGKK 396 T 0.52 Chloroplast_duf pdbpercent F Eukaryota T 6yw5 27 AA 11 Q7S4Y4_NEUCR 37S ribosomal protein mrp10, mitochondrial MPNKPIRLPPLKQLRVRQANKAEENPCIAVMSSVLACWASAGYNSAGCATVENALRACMDAPKPAPKPNNTINYHLSRFQERLTQGKSKK 90 T 0.00011 CHCH pdbpercent F Eukaryota T 6yw5 32 GA 77 Q7SG49_NEUCR mS46 MNRQVVTSTLGRRGVASTILNAQQQQRPFSSTTTRCAAEDDSKKPAAAPSTPRAAAPGPISASRQKSEAAVGKLTQLRGSFTSLTNDNSFHKTLPAGARDARRLAAAPIAGKGAGAGAVAPLGGGGGASGAPKVINVRSLKGTLGSRGSNNIPGAVAPGAALRPRFAAGPGAAAGRPRFGAAASPGAGPTGAARRPPFGARRARPAGDKKRSGGSGDKRPRGDDYDAPPTEEEKAFLRGLEQGKVTEYVPKLTPDTLLGYGPPVATDAALGKVESAMRTMRILGGGLPFNDQSGVTSDPTAIKHRYVHEKKPVFFSSVEEKEWVRESLDKFAVSEGPEKKTKQKILETSVLGKYEEPKYVESLTETVKMVEKYQGGTFSYAPSDADKFNKKLNQLLAAGLPRAAPAPAQAQKKA 414 T 0.072 UPF0164 pdbpercent F Eukaryota T 6yw7 7 G C ARC1A_HUMAN SOP2-LIKE PROTEIN MSLHQFLLEPITCHAWNRDRTQIALSPNNHEVHIYKKNGSQWVKAHELKEHNGHITGIDWAPKSDRIVTCGADRNAYVWSQKDGVWKPTLVILRINRAATFVKWSPLENKFAVGSGARLISVCYFESENDWWVSKHIKKPIRSTVLSLDWHPNNVLLAAGSCDFKCRVFSAYIKEVDEKPASTPWGSKMPFGQLMSEFGGSGTGGWVHGVSFSASGSRLAWVSHDSTVSVADASKSVQVSTLKTEFLPLLSVSFVSENSVVAAGHDCCPMLFNYDDRGCLTFVSKLDIPKQSIQRNMSAMERFRNMDKRATTEDRNTALETLHQNSITQVSIYEVDKQDCRKFCTTGIDGAMTIWDFKTLESSIQGLRIM 370 T 0.00016 WD40 pdb F Eukaryota T 6ywc 3 C,F C,F De novo design 4E1H_95 MKYFDCTVSGERGIIKTYGIQLPEEALKEHVREYVEKLREGSAITITCTAGDRVFKFKDKVGSWGSHHHHHH 72 T 0.31 DUF3577 pdbhh F T 6ywd 3 C C De novo designed protein 4H_01 MEVERELRNWLSEVLSKINDAPVTNDIKKAISNQVLKVAEQVWNGHSKEELQERVRKEVCSVCSNVPACWAICGGLLEVVKYQGSHHHHHH 91 T 0.0054 Glycoprotein_G pdbhh F T 6ywe 9 I f Q6M9C4_NEUCS Related to ribosomal protein YmL11, mitochondrial MSLRLSRPAVRGLGSAIKSSRISSRSAALLVPSTSTSAFSTASPQRAAAAGHLRLPDDYVPPTQPPSARPVDTRKSQLLRTYTSMLRSTPLMLIFQHNNLTAIEWAAIRRELSLALSNVPVPEGAPDITSKIHLQVVRTRIFDVALKTVEFFDPSTVEPTTATTATGTKVPATYNHDLSKHAWKAVKEATKNTEAVEKTVYGQLAPLLVGPVAILTLPSVSPAHLGAALSVLAPSPPAFPAPSRKKNPGYYDLTCQSGLQKLLLVGGRIEGKAFDYDGIKWVGGIENGIEGLRAQLVHMLQSAGMGLTSVLEGAGKSLWLTMESRRSVLEEEQNPKKEGEGEEEKKE 347 T 8.5E-09 Ribosomal_L10 pdbhh F Eukaryota T 6yxm 1 A BBB CII-C-39-CIT LPGQXGERG 9 T 4.4 DotA pdbhh F T 6yxq 1 A A ASCC3_HUMAN ASC-1 COMPLEX SUBUNIT P200,ASC1P200,HELICASE,ATP BINDING 1,TRIP4 COMPLEX SUBUNIT P200 GAEFMALPRLTGALRSFSNVTKQDNYNEEVADLKIKRSKLHEQVLDLGLTWKKIIKFLNEKLEKSKMQSINEDLKDILHAAKQIVGTDNGREAIESGAAFLFMTFHLKDSVGHKETKAIKQMFGPFPSSSATAACNATNRIISHFSQDDLTALVQMTEKEHGDRVFFGKNLAFSFDMHDLDHFDELPINGETQKTISLDYKKFLNEHLQEA 211 T 0.015 SNase pdbpercent F Eukaryota T 6yxq 2 B B ASCC2_HUMAN ASC-1 COMPLEX SUBUNIT P100,TRIP4 COMPLEX SUBUNIT P100 GAMAMPALPLDQLQITHKDPKTGKLRTSPALHPEQKADRYFVLYKPPPKDNIPALVEEYLERATFVANDLDWLLALPHDKFWCQVIFDETLQKCLDSYLRYVPRKFDEGVASAPEVVDMQKRLHRSVFLTFLRMSTHKESKDHFISPSAFGEILYNNFLFDIPKILDLCVLFGKGNSPLLQKMIGNIFTQQPSYYSDLDETLPTILQVFSNILQHCGLQGDGANTTPQKLEERGRLTPSDMPLLELKDIVLYLCDTCTTLWAFLDIFPLACQTFQKHDFCYRLASFYEAAIPEMESAIKKRRLEDSKLLGDLWQRLSHSRKKLMEIFHIILNQICLLPILESSCDNIQGFIEEFLQIFSSLLQEKRFLRDYDALFPVAEDISLLQQASSVLDETRTAYILQAVESAWEGVDRRKATDAKDPSVIEEPNGEPNGVTVTA 438 T 0.18 DUF325 pdbpssm F Eukaryota T 6yxx 2 B E1 Q57WG6_TRYB2 mt-LAF21 MWHSSLRYVSFKRLPFGRRSTSGGVNFNKGLLTDRERGDPFTEPHAYRNKKSIAAISKVAKKQDILLREEKQRKELDKIQSGYVTERELHIGCDKPLGGNANEIARVIDEQALISPTPGEKCSTALRELMENEVDRRNHMMDKFGQPVGAREFHRLFKELRHADNEAETIERHQTRLVEEYGVYPSLRLDAYMLDDDTYFPEWVNALPYSIRDRVKFGSLGLTEKDEALRVTLGRMPLDRRRREWERLKKAKEYKAAKEETLTLAELRDARQGKRRFHWLQRKRQKRASILRRLALRKPDAFELWPSRVVDYSQRIAFIAQHVENGLDTKGQWPLDPEELARARVRRSKEEAERTFLMSAEEKRAHKKLSGRSGDGSISEMLQSLEVPDKPFKRLSRKVYANRVNAIVHGDQDEYGRRYRKMETRSKRRMRPYASLGEIGLENELRKEPRINAKGLNNTDDEDWPRHTKSWGDGMPSMRYGS 482 T 0.016 Ten_N pdb F Eukaryota T 6yxx 27 AA EG Q387S8_TRYB2 mt-LAF7 MPNIKGGVGSFLMRRAAPKSIRQKYQTGPQFYKRKFFQFQKGHHRLHRRISGVQTGSPTHQREYERFHHLPGDVRTRPQFDFTFGETRADRVMFAWRKRGDLQLYQMSGRGETFVCYRCGYPVRSQLVAVKADNWDYRMCYRCYTNTVHRGMENDT 156 T 0.042 Ring_hydroxyl_B unppercent F Eukaryota T 6yxx 29 CA EH A0A1G4IEQ9_TRYEQ mt-LAF8 MKSVFDIARSHVTFPVSRDGTALRRVLKDWLDYTECQSLQAKPAFPAELCITVHPSVKSMSRVYTQGDPKMSGEVCRRPGEAASLSYTKRVRLLLWSAVLPWEVQRGLSMSVLEPPGNGGSVLGEVGVRHGLGGSGGAVDGGSAVATKGWKEVEDSLGVVTDPTTXTAADFVSGPQCSTLGASIIPGMLFVMSPESAAQGLCFWSGAIRQPIDIAFIAPVEPPAADTPSFSELRQRRLQLSLEGYDISRFFPDGELESPTVTFAVQSHSYLDPFPDCEQRDQQVGCGSSRERNKGIEDSRRYTATPEGVGRGNENVRYVLETRRNLLRDSIRSALRECRCTHGGVVWASNTGCAADGNPTGTGECDVEVTISLTLSDELKEDLREKARLYTNYVVPLEGHVRRHIKCLSGISPHPKGDATDGSDALQQKGTEAVWGEEGCVCTNGSAPFVAPPPIIKPPLPVKVGTLASTRPRSPMLADEAEGRPTRLAPSVFGRHDAPALQRAQQECNQLISASALARIPNTSPRAPEIPPIDYEIFDLCLRLGLCQSEAIYYFYGRIMREWSKELRRLRAAKSHGEGGVNDGNDMVLREEDVHRMLRLVHDPSLQVPPELSACVEAVASLRKITNEVGVPVV 634 T 0.45 DUF192 pdbhh F Eukaryota T 6yxx 38 LA EL C9ZVC0_TRYB9 mt-LAF12 MRRLFITTASTLCHSLHCTDTRTGGAGKESTPTEVQCEMTLQCSDESGCSPFLSSLLSPVETVPLHDVTRTYSTMDVVDPPARYNPMVPNVEPSSSSAGHMEQXLENEEEEGPVACAHKNGKLWGVFEGSEDNKPPAWFYRLCKDLFYRTNSEDNMDDAALVSDIEPSHYISSTENRHVDGSDTTQRSAEAGTDVSDGVDPYVWIPFNLLDEADYHVGPYRFPSTATYTHEQRTLLCLGDTRREYVHFCDSYAFPGRAQIPTSVGTCPSKLYVNPKQQQPVVYIQLSNDIPPAMWLPVKGTAASVRRVLAEFASMAALHRDWHHDEFMERHATAVRMLELQRLPAGEGDILRYMAYDARNAQFAFAPIREFPNQQEFFLGEHDDPEKLMEHVDLCPLLFAIPHMRTVVDLHAEHMIPTIAGPGVATSLYRCIYSKALLFVQVHLSSEVKLPPQDPEAFKFMWKDSQVLPKMRIPVFVRVVWPTNERMSGGGGLLRRFNRLFGTEFASDIPVDAAMALLYVMQWSGHIKDFLGVRGMRQRLADLLLASQQPEPTKLYPGTREIPNPEYTVAERLGMHVQYLAQLHDPDISLTIQRLLPVASAPVRMGCAKAALIAGDRELFRHIVSSEPPGRMQTYMTKLVRKRKTRDLVDAEPRLLEDQYEFAAPLWTKRGKRLDSNTLEGVVEAQSRLSG 691 T 12 RNF152_C unphh F Eukaryota T 6yxx 42 PA BN C9ZQF0_TRYB9 mL80 MCCLYTSDVFFWSSLVVSPLPRIVRCVSAPQIRSIPCGSGDFSVMKRSLIARWQSGVHTPHGVVYRGAKMKNWPEQRIPENFKFTEEQRFRTKAIPRDVGTIPRNFVLGVLYRHQPCEVGGLWEHCTNDPEIVLDSKRHLREVLKQAREEGFVTFERDAISNEWLCFLTRERYEEVQRIVTAKSEAVDTHSGLRGAAATETSTYAEKFREMNVEAKEAHARRLEEEVANTTRYLRRFQQREIDYLPYTDLNGKVNFMWWYETRDVQQRADEVLTDSSSSKALAGEEEHKAGSQLEATTASTS 302 T 0.023 DUF2203 pdbpssm F Eukaryota T 6yxx 43 QA EN C9ZPS0_TRYB9 mt-LAF14 MRRCGNDRAVALLRATTEAMRLVKLKLSPDRTRNEEIQDRQNAFVWSDEHIFRPHQHFTHDPCSWSRSLEQSMKKQRKLSMVERLRSLEQRQLEEKQSASATAGGSSKCANHMDGEKAEGPRFYGAVGDSEDLKEYVANEDYFYTMQQEEKPNDPPLQELVDEVQSLHVLLSSPRYEDTPLATVERLQCAYSEALRCVFDRVRNASVGKTMSCNALLFSWSLLLQGVPALLESLAEKRTEECLVRALSTVHEALNIVLQEFNRITHSKERVELLPLEGWIESLDVVTHPLTNKDYTSLKGNIRLPESSFKPQCKLDSATVEFVHSRAIQAAAIRMIENDQSDVETEPLDPYHLYILLRCMVRLAEKGVNDSHIHRAALLTGMVGERIFSSLERTVAPPRRYSLRHALLGKQLRDASKPHAIPLDVCAPPGGVKKPPTAADDVLLLTRACTLLMKVATNVLPQTKFKVLETVDTVLKTLSYAPNYDLSTADTVIFSNMVLEELHHVDEASATDRHLRVLLLLSRLRLSMCADRSALSHLFSCLCNLLPPHSIQQDKLREWKRLRGLVMRHLLYSVRGEEVEQHYTRVLKSSETWVEHLAFGQYSGGLPLSLWLEACHIYLTAGRKLTVSCAEALITLRGRCKDGGVLRSSNSAGVGPLDFVSVTLLAQLLEVVSHGCCSADDLVASPVAWDKVRQTIQGAIGEDENTIQLLRAGRLCVADRQATGSLVTTYP 731 T 1.1 DUF4048 pdbpssm F Eukaryota T 6yxx 45 SA,UA EO,EP C9ZMA6_TRYB9 mt-LAF15a MFTVSCNVAFLCHPAVHHSLLLLRALRQRHTLAIERMGANVTNIGGTVSLSQCGNHISIVPPNLHGSKCVTSGGSIGTVGESPLCVAEHGLQRVHDPQHILYLFSSASPVRQSALDGQIQSYLNAVVVSNQVLRAADDVLIALSIGEMEAVRQTHGNLIDCVAALDASLQQTTENEEGGGGNGATQEVDCLSTWPLFTTIQFLVEEGGLPLGPFPRMSRAYYRLKESTPVVAHSQLVWRTFELSRGPEGPTGELPAWPHRGFLRDIQRQIAEYTTDPPERIMAGVTGEKGPLRARVSGARLGLQRTPARIPWTMQGLHR 319 T 0.42 Hemerythrin pdbpssm F Eukaryota T 6yxx 55 DB ET Q383S4_TRYB2 mt-LAF19 MAFRRLVKRHKITNNQMLLMRRREPYKPTMKDRQEIADRAKLEEFERKNADGLMFVPEKALPPWQKSLAHNAKALGSRINFRGFRVRVADGQDEPGFPTPFR 102 T 5.2 Ribosomal_S6e pdbhh F Eukaryota T 6yxx 68 QB Ae Q383U6_TRYB2 mL41 MLRCSCARRRGVYHNAPSVYPFVKPFHDTPYDQDRGRHDSVGQRYRKNSWPEWMDNGADGTGYGIGLHRTHPLSRLRGNLKRSPSHVPRVLGMMIQGVWHKSGVKLYFRGGKPPNPSVHPYLTGEPCPLYGWKVTDESVIRQFNMPSIDGTNFRYKPYVALQERKIMGVQVPSDSVSLASGRTTESKPLAKRLFFWR 197 T 1.9 MRP-L27 pdbhh F Eukaryota T 6yxx 72 VB E7 mt-LAF27 MKRFIQSRRVWNLMDIRRRPPVLNRLRGVLQFAQQPGALSRHRQGGDYNCSMRVSIYSRPGKVSRLNNADATWHNRSRKEKPPDFDPSAFRRRYRDS 97 T 0.034 CobW_C pdb F T 6yxx 74 XB E9 A0A3L6L206_9TRYP mt-LAF29 MPRHLSSSLLQKSLTPGAARLLPQNLIPQRRPAPGRMTYRGPLLAELDRVRDTCTTEATSSGVVEESMVSNGAFDAVRPAALGSTSSVADEYLVPTPRATKLKKAAMAASRTPTFVLPNSRSATDGEGGARQANDERDGTNSSPQLFSIESYETQQRALRNIFNEAGRHCVRLRKDSKWLLEERRAFRQSHQRAPTTKEVSPHVDVALAPVGALKLSKYLSPCSASREVVEHSLLLHRTVTKSKLVQSSEKTLFRCLRCFHVYAARPRTLLRGEVAQSWLEYEAEAEKEERARQLARRPHLRKKKYSIGRQRASLANDPRCCPLCRSTKAQWMMEYVHHHTHG 343 T 0.092 Sigma70_r3 pdbpercent F Eukaryota T 6yxx 79 CC Ao mL52 MHIWRTAASNWVKRHARSNAAQWRRPEPASSSAAAKTIARLLAVGGHGCWSEATALYAEALATKSVQRYGVVTPAHRHDCIALILGAMSSSVDASGSDINRGEASLTPHYGMLAERLSLDIIVPHSDVGQEEKEAASVACIAVFRALLTSGRTHAAQRFAQQLLRSRNSREHGLWALRTIMLAVTAREEGMYRQLFLHGGLLKDLEHLTGLDAFTVVMSSCDGHEKNIGPCGGRTEGNLGRDEVHRVAARWCVDVALHMHTCPWRFCLTNEKEPRRETAFNSEVTSYCCAKELPMPAFAVTMLPFEVLRSNAVGVLRCCTAAGNGSHGNEVVTFDGTASNVSGGRSGDSTSCSTTAPSLGLPALKRLQDVVSAAIEVPLSTHEPVECLYLLLQISRDSTAPDPRRVMPSTMNILWWRRHLQSCVLESVKSAVRWKSVCDFVGLSRLVTERVPHCFPLLVAHAAESPDALRTLFNEVSRQEGAAAXPVDDIPRSCVMGLLQMCGDERSAAVIDWVVEHCXDHESIAQFVLDSLGQHQELASVVVRRLFQRSEQSXDVINALSRLLEKGTCRTLLADVCSLVPQAQQWTVALRFVSGMSAPEVGHVHTAFMAFVGNCDEQFAINMALLWSDETFTWLKADVNPPPRRVYMDDRWSDALRLVALGAERLTQRPELLGKYIRYFSKRVAVNSRLYMELEALLACSKVGGGSARNCGRSSNWNPIASTVTGGPPLETSLVGVRTEVLPKGRQWMEACAHASEVGVTRGVLEILAKRGRWEEATILLARAEAKRRVEWAPLVIRAARLSSQWRAALSVAEKIAANGMKLHYSVVAELLACCFSADVPLEVVQLWLHRRSVGESTSSHVLFGLGPGSSLRTEQRDANDKHNVFGDASRWLGSAQAELFFVLLESSSSRSSDHQWRYALQLLKEHVLLSGGVPSARVFRSTYSILHRAERWRESLQLLGLQRSVCGSPTVKCVHLVLSTLPSTAWQYALGALQCIPPGDSGSIHRVLPLLLPVSWESALGLMIDHRVMTNTAMECVVGCEDVPLALRLQAWRRLLPSLPVSMKHRAAPIYLRLAAGVADGDVGLEGTNGLDAMCVIERSIRGLRGDYINYASFVYHRALLHRHWCNDSLHPSGGVAAFRALSGIAEVDQQCEVSAAALEQLSNIVERLEQVCGTVSASTATHGSQHLERTVPVVNSGEILQGSAPTNARRDPETHCFSFVSSYWLSLFYLFRYCRCVISSSLFPFFKKDQGSGLPIFDVMRLHTVRAPIITRAAMRGYSEARSNYDGTSLPAWPAPGKKPTYPAALSELRLPQPRMRKTRTEWMYYHGHGGCPGKYGPSREIADFEYADGTPASISGRRFAFKHHQDHLLVQLIRAAATVERYDASGLLPRIPGTAEQRNWDPAIPLFLDDVDEQGRPAPLRTAGDAPGTMVSHVCSRVVDERMGTPTHTPNELANRHEGETLEANTMFATNDPSAFVSDTVKLRDDKRPYWSRRRWALTDKFLVPKSPKPKNTIKDE 1520 T 0.00017 MRPL52 pdbpssm F T 6yxx 84 IC Av D0A934_TRYB9 mL64 MLRGTRGFLAVSPGVGIAPETTPVKYTPMMLNIQNMMWWNGKRNLYRATYREKTWYEISRTGAFTKGRRPVMRQKYSREALQAALAMVPPGFEVADVPRPPQRILAQSEGIVGRWYSNYWTLHSMRYQCLLAGVEWPLGERQRPRTNYDEPFFFADFEESKAIRDYRSRWINVNRSLVGMTKRMKEAEEEARYMQFRKLQDTFWSNRKVLVNRVKSMYNQGARTSAKDMPIKTINIKAFLSE 242 T 0.022 DUF1672 unppercent F Eukaryota T 6yxy 57 GB EU mt-LAF20 MFTPSLALWSQFLKRTFVGGMGYNVKRPYRIEIRMEHDKKRRMRRRNIGCRRMMKS 56 T 0.081 MCPVI pdb F T 6yxy 76 ZB Bi Q4GZ80_TRYB2 mL101 MLVRTWMLLNRPKGPQGLRPGKEYRLTVPYRSEVTMLRLANHKAINSNIRELFKKPLVMNNIKAIPRDLGEIPRDYVLRLLFFHQPIRLVDLWTICKEHDDVPLDSAKHLRLVLKIAKLQRWVYAEKNQTNNLYYYYVHQSRMQEVQQMVRASEVRKKEQESVREIEAEKLRMEEQERRKVALDENIVALQNALVSNIAQIQEFDPGFARSKIYVTESGAVNVGWGLNDGGSAACSSDLDGQQVA 245 T 0.0025 Tfb2 pdb F Eukaryota T 6yzh 2 B D P8C9 AARLYGFKXX 10 T 1.3 B5 pdbhh F T 6z0g 1 A A TREM2_HUMAN TREM-2,TRIGGERING RECEPTOR EXPRESSED ON MONOCYTES 2 GSGRSLLEGEIPFPPTSILLLLACIFLIKILAASALWAAAWHGQKPGTH 49 T 0.00029 SIT unphh F Eukaryota T 6z0h 1 A A TREM2_HUMAN TREM-2,TRIGGERING RECEPTOR EXPRESSED ON MONOCYTES 2 GSGRSLLEGEIPFPPTSILLLLACIFLIAILAASALWAAAWHGQKPGTH 49 T 0.00029 SIT unphh F Eukaryota T 6z0l 1 A,C,E,G A,C,E,G Positive Strand XNLAALRSELQALRREGFSPERLAALESRLQALERRLAALRSRLQALRGX 50 T 0.0025 DUF5320 pdbhh F T 6z13 1 A P bicyclic peptide 3C CDIHVXWEWECFEKL 15 T 9.7 PDE6_gamma pdbhh F T 6z19 2 B C P2 ALYGFKWA 8 T 5 DUF2627 pdbhh F T 6z1p 6 F Af Q951A2_TETTH Ymf69 MLNNIFIFEKYLKKNNFKKNKIILKKKFTPLRFFLFLLSLFLTPFNCMFIISFKINNKLEINFESYLI 68 T 0.55 DUF3667 pdbpssm F Eukaryota T 6z1p 13 M Am Q951B5_TETTH Ymf74 XXXXXXXXXMKFKREFKFLIKKKNFKFKKFKILLKIYYSIKNLINFYKIIKLNNFKIKSTLLINNNYYFNYITNGLDLKYDNTFQNFELNTLSIKNYKNKNLIISNNNQLDIIKFQKFLFIIDNKYVNSLICDNLFDFFFISIILTNSLILEFYKNIILINLIKIN 166 T 0.94 Interfer-bind unppssm F Eukaryota T 6z1p 34 HA AH I7LT48_TETTS mL40 MSQFLAKAVRSDLTQLCSQTLRWNKKGKVNEATVQARKEKKRLKETNQFTGDYTGERPPHPWLSAKRLRSIMTQYQVFANKNRLNIKVDKKDAVEMKKQFEEIGAHEYYRMRTEQIINEKNNQVIDQINKEIETLPFNIYEEITKMPKDKIYNFNTDSNSPYILYFEQIARMFDEEHLTKLKVSQRLQKLAEDKLGEND 199 T 0.017 DUF5446 pdb F Eukaryota T 6z1p 39 MA AM Q22KC0_TETTS mL53 MDAISIKIIIVKKLRTINIFFELVINQNLICLEKLYYCQYTSQIVSIDRQKYSKIYQMGKKRRVQVFTGSMLQPLYRYLLSAKIGVNPLDLHGQNIAKQIFLKCKQARPKYLQNDFNATLVQDNEAPASYFHAKFVNGYEQKWYLHKSTEEEINRNLKYFNYQIEMERNLQGHDDEYEDEETQL 184 T 3.7 MRP_L53 pdbhh F Eukaryota T 6z1p 42 PA AP I7M3V9_TETTS mL101 MFQAIKSIELQTKSLFCQLQNGFKRTSASQKLRQKIKYRSSRPDKFKLVSKLGKFDFTKPNLSFPVSIPLKLSYVYQPAKHTPNLPTHDFLNFKTMTGNEILLNLENYENLRPSEICGALIELSKREGHEEINWNEHEWVAATTEHVTKMMPTYTPSVVCYLMVAFQRLRITHEKLWKNLTFAIEKTIHKFNAKSFAYTYIAYLEDTSRSSEEFRKKLVELLPIHLHQMNPNQLTRCFELTFERGYMNEYLFEQHFHVLYWRRNVWFGVNNIIKVLEIYPKLNFVDDCDFFEGAILANIPKVKTQLNEQNTKALIEAIQALESKYPDLKINSTLKFLNTHLTFCQTKLKAIENSKFYKIVLNDFEYYKIKESQRLEKEAKQAEKTN 386 T 0.36 NPV_P10 pdb F Eukaryota T 6z1p 43 QA AQ W7WYR3_TETTS mL102 MNALIFRNTNFLFNWISQSSSMLGLLGIRKNMSFQLQEETAEDKTQKKLNELLQGAFDFTNRNARPPKKANHGARPCSSVMRRLKKKYFYRRTKEAMTPEMEPKKKFEL 109 T 16 DUF5528 pdbhh F Eukaryota T 6z1p 44 RA AR I7MKV5_TETTS mL103 MLLSQSIQKAVANAFKQISRSQYKAISCFSSSDKNDNGSNNQEGDSSKNNEKKQATTNDDIYIHKTSYNLEQFKSYTQNVEKALKDLNQEKEKLENSSPFDLLPRRKRRVYDRPLHDLDISNYECWRSYDKMIFKTHKHAARVVCKINLSPRALKHAFGIGSDVSRNSDVSTREYDFEDSNLDSFLLYDYKATTEYHGNNDPNYDYQNQDQVPPKKRKQQHPTPQEFWESDEPHAFRVNCSNYADYHKFKKWIQTEIEKRASEKSYEEKIIERFGPYKIYDQYDQKYDLVKEPSVFKYGREYYLEKGKKFSEKEMEENPYLKPIKPAKQMEDKYRVAWPYQWNPKPTQ 348 T 0.14 BLI1 pdbpssm F Eukaryota T 6z1p 45 SA AS I7LTP6_TETTS mL104 MIGQVVNKGTSSVSSLFQQIRRGFKGFDKNRWATSAEVFPEKPFKYGYDKQQKIKTKQDRQWYEAPVGAMTKKRFQTADFRAYQTELEEAAQKTLEGQKIDVEEEVVKVEKDEIFGEQGAPKRYYDRILQKELYFHHRNGKHFFEQADRKISFLNQFTPQIEKVPELAETLKTILKIATESEGSPLKRAGIDQEFIIEDDFYSENEECKKLFEEIKQKVSNMNYRLLPDLCLVLSFKLRYNKDVFGIWEKIEQNFMQSIHHYPVMELVKMRYASCALSPKSLSRDCLKAIHDIVFTELHNVPSVLDLSHLLFAFRHINSLKYYNLILDEICRRPIKTLQEAIALLFVFSHSLFPNYKRKEIREKDQDLKEKHKIVDHLADALANNAKQIQGDDFVRVLIGLNNLQLTTFKDVLTHIERYIIKNIDTLDAFQTSNALYGFSKANNNAGFGSEQLYKALQKAAEKHWSQFSNADKARTFYAFAFQDLVDPVFRKKFIQPWLNENLESNLSHSELHYVAFSLMFEQNKDAEIWKKFVKNICKNQYVVPVLNYYPIKLARYYMQSIFPKWNFDIYKLACQDAEATWDASRQIDSIMENNKEWKSIGVLLQNRLEFNSIGLDNFENLLLIDWAIMPQRVAIMIQGARQTLPNGKPTPLHRLKLQLLENHKFAVFNLIYKDFEAISPDQKIPYLKKTIEELIAKQDTYIKDVEEPQQWLSFMDRMQELTYRNIMIGEATKGGVIDPEIQEVQFDWSKLQKELKERQKQEE 764 T 0.00023 RAP pdbpssm F Eukaryota T 6z1p 47 UA AU Q23Q81_TETTS mL106 MIVSKQQSIISLLWRSACGFSKFSKRQTTPKILRDHRFTRKGMVIRKQSKNYDYDLTNLAASQAHQSHLFFNKEEDFALLKKQADEKEKNMKNMHVFEDHSVPETILLEVQDKFLVQKHPEKVLNNLVELDKQFSKKGGEITELIQSALSKLIKEQILSFNLKNFGVLTTLAKKYLPNDAKLWENLANNYCRLMQKSEYNDLKTLDSARSQKYIENSVLTLTTVLKFSNKFHHENCVENISQVATSYLSTNFDVVQDVNTRFLLISNILPLIQSYKQVELIGLVKQKMDLIPKLQANTITALTHSIYKVKQQNSKNSRFPADLIDVSFLQKLEQTWLKTFDKSNTQLLAIFSYSIASLGYSGETKKFTQEYVEKNIKDITNLKDLAFFGESLKKFRALSQKYFTGAEQTLKQALSNNSQELHAELALQLLRVYSKNLFLNSEIASALIKKVDDAYYYEQFKPKASQNEMIVKTLQKYSQIVDLSNLRIYQNLIGSKKLF 499 T 1.2 MbeB_N pdbpercent F Eukaryota T 6z1p 51 YA Bc Q950X8_TETTH Ymf73 MINKKLNLFLIENKKLLKTNTEIYNLNKNFNLIKFFKLTNYKEIKALISLLKCINCLNKLNKSIFIFNKNFITVVYKTNFFKKLLTYKFINIELMLTLKLFIFFNTRIFINTSDTFIKFKSEYETYPEILFDCYHNHFSRKRVKNLSYKMFLLIMYNLI 159 T 18 Ring_hydroxyl_A pdbhh F Eukaryota T 6z1p 52 ZA Bd Q951A8_TETTH Ymf64 MKLLIFLKKINTIKKIENLNSENFTNQNYYYNLTNSLDQSEFITLKHVTFFFLKNKLMYSLSKYDRLNININENFYTFLKSLDKVDVYSLKFFKNMIFNKTYSNYNVNLFYSISIKEIKKNKEEFFKSNTNKILLIRNSFKTTNLKLLRKISIIDLILKKYLESELNDKISINFDKYNLKFMRKKRLYVKVLRRKLRRMRKMLRWAKISLRNFIRLTLIFLCTKDIDIFSKVLVKIMDSMHYKNHRRFLYYLKLFISKSMHYYFNILRFEGFFFYLSGKISGGGNSKKKNYAVKCGKYSLTNKMLKLKYKKGLIHTKTGVLGYKLMISYK 330 T 0.019 VAR1 unphh F Eukaryota T 6z1p 53 AB Be Q951C0_TETTH Ymf76 MFLVKFKIKKLRLKKKIKKFYKLINYNFSNLLNNFYHKKPNFLTLYNNTNNFFLKILFYIKYINLISKTISNKKLFKFLNNPKIRNRKKFKYKYSDKIKFILNILKSKKTKIKNLLFFIKYFSVLRKRQSRIFNLARVKSRLSKRRFFKKKLKKKKIAKYFFQMFKKLKFKHKKYINLINLDFYFIRNKRFFRLHRLYDIRKKYIRYLNNNRNIYKFYKFRIKHNFKFIKRHIKSISKLSIKDRVHFYELSLRNIAIKLKYAFTLRNANLFTKSGFIFLNGHQELNPFKYAYKGDIIELPFSKFILKLRRKMKKKMFNSMRKYKKYNWRTLKNKVNPEQRRLRISRFSENTLNFKTKLTKLFQYDYRTLSYCVVLDTNFKRDLTYLNKKLIPIYLLKLFNWKIIS 405 T 0.3 S4_2 unphh F Eukaryota T 6z1p 56 DB Bh Q951A4_TETTH Ymf63 MKQLKKIMINKTKIMSNDIIYIKRTYHQKIVNLKIYNNFKTEPKFYNLKFIEFQNLLNNVNLNKVFYTEYTSYPLEDRFATSKFHTFDSYLTSLELIDCTFLKKKFNYKYKYSMFTYFIPFLIKNGKKLSTINFILKGISTIYDNLKYNKLSQFESYSYVNQFKHYIDTADDVYNINFLIHWIINIYKPVFDVKCFNVPKVHKKKSAKTVLFKIVYLSEKNRLKTAYKHISTCIRQDNSAKLNNRITNIFLDLLLNYKKSYLYTRKMYIYEQVMDM 276 T 0.02 Ribosomal_S7 unphh F Eukaryota T 6z1p 58 FB Bj Q951A1_TETTH Ymf59 MNNKIYINIKYKMNFNPQVINSRNILSKNKSNRIYCKNFIFTILFFDFFNSTFSKNFLPYKYNLHITKKRKHVGSILRAPYKNKIAQFSLGLYRYFLNLSFFINSEFLPNINNKFEFKLLFIKFLNSYNYFESTLVTQVSRVIKIPTQIQII 152 T 37 GDYXXLXY unphh F Eukaryota T 6z1p 59 GB Bk Q950Z8_TETTH Ymf61 MVQKKFNFKKDSFFYEGYVWNHSLNIIHDIQLNYLDKNSNAIAIKYAKTLNIMSSLYRNLTFKKFDFIKIWYWYYLYYIKNIYFKNLINKNNNYVFEKPNIFVFNIKSKQIRLAVLTSKNYVYNLTVGKILSSLNIKEKSKKKSNKGERLFSEYLENFFKNKNIRFGTKKLAIIKLKYFKKGFKLHESIFKTLNKNLFIINTIYDFKVPNNFFKFKKIRSIKKRIKKKLIKDENTLNF 238 T 0.15 Pectate_lyase_3 pdb F Eukaryota T 6z1p 62 JB Bn Q951B0_TETTH Ribosomal protein S14 MLFKRRKDILKCKYKKIYIFKNKIKNIILKSIFFNRNIKNINRAYAYMILNNSKILYKKYHKICKFSGYRKNVNKFTGIGRHELNRKATLGQLQNISMNSW 101 T 2.3 Ribosomal_S14 unphh F Eukaryota T 6z1p 68 PB Bt Q22BA0_TETTS bS21m MNPIQFSQFVVNSITKHYFKEVAQGALRPGQVRREVKSVYKDYNPLKIHIFKYKHPRMLEIPMPGHQRRAERVEAQRRRNREQINFFCRYVLAKQGKTIPYN 102 T 13 DUF3755 pdbhh F Eukaryota T 6z1p 69 QB Bu Q23YQ0_TETTS mS23 MYLRKWSRMEWWDSVRYFKKYNLLDYKEEHRLLHLFPPTWQQYLPTKADLREVNFRDKQDKQLVKVLFQKYPDLRYDTSGRMYDKDGGQDNYANYVVRFILKQKEYMKRGMSANAAFIETEKIFQDRMQRKIDQNNLTRGIAINNRARSFMNFYQQMAEREARWKVQRMKRDVQQYLHEREIFEKEINDDGDMEDDFAEEENIYNRVLLKFQNAGMPEITKQDEKVSTQREFIERSENMFKVYYERAAIYDRLQGLTDSQIRSEIQNSPAKMKKRTRNLVKKLERLGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEKLQVVIDKLRRTKYKIDQLSMKHAQDLMFEEHEVYGKDILIDEKVGYEDLRDYLFQPSEIRRKTELDVINDNKIQEMIKISTIKEDLVNPYNNEYASKLNLQDTIEYQRSKQEKIKTLRAEKEREE 567 T 0.0021 MRP-S23 pdbhh F Eukaryota T 6z1p 70 RB Bv I7M0P0_TETTS mS26 MNFINKARSQLGLFKEFIVMPRMGWKLRKPSRKVTIYEQERRAQKQLEKEYRAKVISDYWNSQTILENEYIEKYTREELERKKKSDQNFRDSIIRIAKATQNHVEFLKKRSALDEAKERKHILEQDVKAMNKKRILNIMQQESRHWINQQNVNNINPDSIMPATIYDETDYYLKLHEQAFLFEQGRLEEMEKVSLETEEIQYKNSVLMPIYQDVISMIKHLKSTESFKLEKEFQAAKRILIEDCRNMQIEDQLEEKMAKLEKAFNTLRKAQKEKFDQPENQLEFLHEHLLILYNMLRKWGEYTNMLKIPATVVRDILHERQVMLEKKKLIRKQNLEQAKNQDRDTEENEQDSDIQSNDERETSDSEDEIDIAKLIEEEKLRQEKIKEQQRLFEERKLQMEKEQAEEEQAEQKQTIQDDLLNPEKAIENLMLQFEKKAEEELNAEFHIEKNSLYGNDKNAVDSRDFYKGIDLENVFPRALIDNSFDFTKLSGLPFQNVKEALRNEEKIELLRGEDPNNSPRKFETALIVEVFKLKSQQLRAASLTRAQQQKLNDIDTLLDLIGEIKVEEPTFLLKIWKNF 579 T 0.17 LRR_12 pdb F Eukaryota T 6z1p 72 TB Bx Q23UD3_TETTS mS31 MKIKIVYENLFRVINEGQLKSKRVKKLINFNKEELIYISKCCDMIKALKSVQRQLVFVQKAKFCSVPNQNNQGQGSNTNEPQQQAAAAATTTASQAEKPQQPAFLNANKDAQKDQKKTQNHEQKDQKQHQNQSSIYGSKINVGNSEEIKKSIQKSVNDFSSTYKLNLSDKKIKAGNKQTFAKKQDQESISKKKEKLLKGIQPTDEKLVDKHIGNVPIAQQIVVDKLKKYALRVSDTKRIHDKLKSEEGVDFSYVEPRNLEILNSFVHRKSEEMEELISTYLGVNQTLSKEEQLDDWRQQQALDHINFTPETMGKPEVFYPGSDRGPHPLDDPQNYIQWYEKHCPLPYRPLIDQMVQMTDMKITDNNMPSYIKKWIDQIQEDPQEDLDQEKEDDQEEDLDSDEEADMSEDEDVGLTDNKAQNDELEQQLCSVVGGGQGVFFYRTNPLPRLQVDNTSLWSLDQNSELPEDTSPEEQIINCEGNITFHSGLQDYEIPAFPETLSSYITHYQIPSIEKWSIFRQFPNMYHWWQKFYETNKKLSEQPLVRYQYSGLPSVYTYFYTMPEFARNNIVVQNVARCFEFNRPELNHQQKIMALNYAAKFSLPLDDLIVHAASQMIVSQKHFLTAKEEDRLKTVNQFYYEADTEAWDIHLAHEEHTIEQIENFQPPKRLGVDDIEEQLCDLPLEYYDNDDGFWNDFIKEKLNRNNAAYPATQGRAFFKH 719 T 13 Ndc1_Nup pdbpssm F Eukaryota T 6z1p 76 XB BB I7MMM6_TETTS mS37 MAPPPKYVISRKLVKRFFDKYLPRQPMDVQNESGKLMKCWQQYGIDDPRCKEYEVLYDHMYTLTRNYRAKIEGLRIKEDVMGALNRPKYHSEVKGRWRTGKTTEWDVYDGVQ 112 T 0.2 CHCH pdb F Eukaryota T 6z1p 79 AC BE Q23DN6_TETTS mS45 MIQKAIKILVNKQIQCFSYSKQSSFARMSAFRNDFDEKIKQKYIKNKTNDERFQQMNPEYIKKAINEEYEEAKEELFQNGGILTELRKSMLGKEEDEKETLGAEEFEDPVDNYLHDGLTHDEFLYRASNIKKQLFAKQIPDFFEKDEINSQYGNYSSFDKNFAKLKSMKTHLPDIEQSGFAGYKLEKWVNSLKGKKEIDDDEDLDEVREIIEENEINITEDEKHLLKWKIADIMRKENDEYVPFLEELEGENEKDPFEEDDTEDGRLSCKARDDIYELYQKGWSIKDICTRYGIVPERAKAVIWMCEKYYFQILPKADALAIHMAQEMEEEWEEENGWQDYGIDLEELAEREKGMHTLSFKRYREVDVGKPSKNILSEEDYTLVQKINTPRQEKITLKLDGGKYQRGYLIKDWKINKGRGRRDVSKMFRRIIENSHDISKLPSSVQLRVREGPRNASKGYSSKL 464 T 9E-05 Bot1p pdbhh F Eukaryota T 6z1p 81 CC BG Q24G80_TETTS mS76 MRQLVKTQLLKSNIQELRSSIDIRKVLNNKYRDSESSDFYKKNREQLVSRIKQDMDVDSYTGATTRRTQFQQYYQDQPSLGFVYPMNPGRGAFMEPDCRFTGNDFMEKINTHFATAITSSYQDLDKVVKIQVNDSKKNIEQNLEKLFREKNPNKEFNFDKEYARYLKLDSKKFKKEYGYETD 182 T 0.0099 DUF4932 pdbpercent F Eukaryota T 6z1p 84 FC BJ I7M7B1_TETTS mS78 MLSRIASKKISKQIGLNKKIARQSSSINKVIQDLLETSSKEESDLRVSQFAKNLDYLREKHEERSKIINQYLHRMRAASINSNIPLKEILRQFISSTRLGFDQKMTSETLFYLSATLNSLGPNPNHYFDYNLELLKDSWRFDDLVGDFRYGLKSGVEFSPERIAQGLRSLKKLGYSNSRITREAIQKIHRMLTKNDDQFNIDTENIIDNNPSIHKPLYYMRSPLDAKQIVENPEFQKFLKATIQKQEQELNKLNQKKNLQKDEDAQTITLRNELNEEEQEILDEIEIISKKINKSIQKSALAMKKTIESVIQIRRHYLRMIESQEKVQSVNLTPQLNLDLLNLEESMVEAGLISINEVKNPQLIIDTNQNRESIAQIVSNLIVEQNESFLPYFDNLIQSKPQLLQEIDSDRSLPEINVQFFANYTHSQFAEALLAVTEYSNSKLSEFKGQTFLNEKWADFFPQVDEADRLFFVQFLDSQKLFIEVSEVVLEQTKNEKDVNALGKYAAAFGNIGLIGVSKELVKRITSLEGMSTQTGICILRVCSEFPEEFSQLSENIIQILEKSSDITASQAIDLVYYEIALNKVSEASLNILKQQSKHILDENPRLNYVSQYLKLQGVDLGYNASGASIFENNLFQKNPIKDKLVELIGKAENLSHTGLGDYKPDFMCLERNEQGEIVQKAIFITPAEYSYFDIAKPAVEYTLLSKYLSHKVKNMVFEFIPITKFIDVNHQDHMITIKRDNQLFIELFDKAHHNIYSNLNNDLLVIGENLIDEEYNILKSKIKQIFRLSGGRRYLQLSVSDLMQVKYSLFSMAEDFNTVLSETSQQNLNQLCQKHFNQDFVSLLKSHSKLNFSVKNTQEEMEFFLKQKWVGKRLSIDLLPKGNEKYNQSDAFFDHLYISSENYYEYPEWQELLTQEYGVSNIAQINPTNFLAHQVSDVCNQTNTAYIIPKKKEVRSQIRPLDQRIVTSKQDRANYSLTWEQDYFTQGTNGELVYRGENHKISEGTKYIANLLNFKWELKKAFSSQERLDFLHKLNLTDKLIENHLHTSQNHQPKHAHFAQRDPKKYFEYLQNKTPSSLYCQEYLEFFTRKVDQKNKIIQLSKLHFTKIKEIYKSYQKGYISRQQFDQQKEGIQKDLLKTLRQYDAANNSLDSLLQNETIRYIDVQFDAESLLKDKSYVRYNLENDRDYLACKKDLNQTLNLKRIRAEEKLIKAKILSKISAEQTLTSLEQQYLEKWNSGEITLPKEPTFLTYKELDNSDVQLLNSLKFSDLVSFDKMQVNDLVVEFSSLFEQAIINNADVHLKGKNALKEWGISPEWISKSATNSMNNYLIALSETQAWKDGQKLKEHEKENVCSMLRLLQEQSYQDKIFTANENVLRKEALDVFNQSENRFSIFMKWLESREQEGFSISSDNTEHVLKLWENIFEKSYENTTQEELHLFTHNLVQRLYVLSKYPPSAFGSFLSKLLLHPKLHIIDKNILICGIDAFKFNAYLSSKELIDFSRNVRAAHTPQDLAALTRANVFASENILQKISKLVKN 1539 T 0.0033 DUF6076 pdb F Eukaryota T 6z1p 87 IC BM Q24E31_TETTS mS81 MIAKLFIRSSQRLIALNARFFSTNPANQFNNQETSSTYQNQRNNRREPSEFRRNNQERYQKREGEEQYRPRKESITWDEYFNLYATNKIHNISEAKFPYNFRGMQDFVPIKKEFDNMIFAKGENYADFCKQFDRRMVWFMKSLATNKDDDLPYSELNFLLQAKKGAELLRRNGYQINLIEDNEGFQKFGGKKGGQPFEITHIMGLNSNRTRNAGTDAYSIIRDLEEKGLIMFIGNQLKMDENGNYDFKLTRDDDKQMILRVHYKFATPYILQVTDSSGKVVSPPPESYIHTAVFENQLRLPPKFSRLDLHFLDWIKLYRIQNEWKLVDFDRFLSGNKLIYSEKEQRQLFKGEKDN 355 T 0.46 SID-1_RNA_chan pdbpercent F Eukaryota T 6z1p 88 JC BN Q22GF7_TETTS mS82 MAASQKVVNGIAKGLQEYVNPTKLAPFKKAVRDQMMEIEGLQAFEQGLYHNKDYENMIKQLVESRKAFRNSRSKTERQSIAKQQYDEWSKYVEIRKSQLTEDFQIPKNFQSQMDQVWGFVKNRKESTIHSSKMLDFHYELMNQFKFSIPIEPRLLVQMIHPHFGYLSNYPGNFTQEDILEVYKCKLVASMERVLGQDLLANEIAAYTYWKIYDKQAQGSFDLKTFGEFMKTFRFNLDGTAENFKQEFKFALQLHPGELSNDLQESDQLVRFDFYRYLFLERNL 283 T 0.0023 EF-hand_1 pdbpercent F Eukaryota T 6z1p 89 KC BO Q22N51_TETTS PARP alpha-helical domain-containing protein,mS83 DYKPIPFGVNVEDSVEYYNKQLESLEKHMPLNIFSCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 142 T 9 DUF2102 pdbhh F Eukaryota T 6z1p 90 LC BP mS84,mS84 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKVETLNEEDVAEKIIENLTDVKIIEKDSKRKVLKKTGVHKFKDIDSYIKETRSSRDKKK 100 T 0.00025 TraT pdbpssm F T 6z1p 91 MC BQ Q22UP3_TETTS mS85 MLRVVHNLGKSIKINLTSKNILRVSFSSDQKVSTEPESGLTFEQKAEIFERFSNSFVGIDRFKETQQTLKKIVEANYSQSAIKEELIQELKEVYGKNYEKILNLRFAVEYDGHKDGVAVGEFELFPKNLQDVNKFENYSKNGDLIKQLQQTTYISVDPKETHKYVVPKDSHFSLLLDEYIADEYVSELNDNQVCLFGFPLTCDESDITNLLNNEFKGNFTSSVIGEDILSLPAYVVLTFSSPKEAAEYKQKVNALQYTIEKRPIYATTFEDSRREHSTNRTLLVTGFKKNEYINDMLNLFSSFGSVMHFEIVEDPVHSKLPTTEQVIEYLKKTIKEGDDSVIYEITDFSDGAPSITEYPPFNPNTELMRDAKKVHEVKDEIELEKERQQQLAKRQLIPRIVLYSWDSESRVPEEYRMKNASEEQKKIIQSIENDLRTQYQNKQYLFVTYACTQQAQIAFHALNNLRNYEVTLKKSIEHYHCDTIHQTSVFKEIKQIKGDFKIKFNEQSLVKTEEEKVLAQTHQEMREKLLEQANSQQFTQELSKQLENEIIGKTKYGVEHQLHLKSDKLGRDFNSRVTQDDLNQLASQFQENQKKNLQTLEEVRKDEEELLNLYKAKVMYSDINKITHPYIEADKETIQKVEENYYQQQLKEYKLKQKEYMKEQQIREKWLEESKEMLEKKFVFGRNYKKKVIADKAGVDDVPDIKEPKEEEADQYYAPYNDYIQQKRYKKYLRYVDEMQRLYDGEYSEAMKNKIFVEGGKKTCDSDGNQFVTQNQGEVFNKILLSDEQFEMLKYYTSIADVLPNKRVQELSTMLEETPEETIYMMKQLKYPTKVFDRSKIPELDENSIPISNEDFVNDLNKYVSGLGQRYAVQKDARGDEKIVMYENTPHPVPLQALNVDEIQLLRDCLTTYGFDAEATEREIQYFIKHGDYSEEVLKIVGNEQTIDEESELEALINATGLTKAELESIMKLDLEKEGSNVLLSLQQQREELSLELSRATPQPKDLIKTNNTKLRNKDKQGRYKTSSFKLF 1032 T 0.24 Nab6_mRNP_bdg pdbhh F Eukaryota T 6z1p 93 OC BS A4VCP7_TETTS mS87 MIRSILKQVKGNLTKGNSFNAKLNEIPVRCFSSSTGEGNEGDAPKNQQEQQQQKDQQPQQQQQPLQNQGKNQKQFDNKRNFQNNQQGANADRNADKQKKNFTPFKQQSNNQNYRKREDGESDQQNQGGFRSNSQNQQNSTGFTSSQNQRNQPKKEVLSFNLKKAEDQSNRDSSNQQDNKQRPQRPQRENQNDQASEQSEGQSYQKSSTSSYGSEGMFLNQLFKSEQNKTEKQKGANQHQQLMKRIKSYEQNGNPSEQEMRMAIECYNSCGLYDKTIATFQTYKDNFVKGKQGVSLNETILNSVFESYLKNSSSKFNDVNEFFLIHFAQTKQLKLIYKDNIQNYISRVCIDPYLNLSQRIEFLSEFVNMFTNSQDADSLVTSSFNNIDLSSLSSLFGNADTQQYIQASKTLATLMNLSIQHNKGSFNQLGKNTEFNKVQFEIINTKKIFNNLLNLKQYEICREIVEALSRGNLLHSHTFTSKSSEQQKYIDYKDLIKFSLDFQVSVNYWIDVIAKKVDFESFTQLFSSAILELRMNQNPPKVFSIEQVYDFFYNISNYGALNPTQVYILMDISIYLKEYNLAIELFTHHQKYTNRRRDNFVYEKMIQVVNSINLRQQAGKSNKDHVLQAYRKLYTEAEQQTGKPIGFLNSKIYEIKSCILDQNHDSAYSIFNDRFLKESIANYQALKMKYFLLLLLEKQDERDLQYSEISINKFINGLAPKQLIQIWNDDRNDTYLAKQLANKENDYNEFVKRIEVEKYYSIIKRDINNGFYTIEDRRQNMDVMMNEYERYVIDNFEVLDPEYTLDGKLKGAVLDGILKIRKYEEVHLKKEERRKEKQSKKGEKESDSQTSQALIQHQKEMEDTQIEINKILGRPIDTPVNVPELYKKYDTEDKMNNKYIERYVKNLQQNEFNKEGRLYKMKFMNLDQYKEASQCYPELVEQAEILARPQIPSDVNLVFELMTWGAENQQPLAIQLGEIYCDLNGIPVPSSLVEKINKAIDPYNDNLDFINQITSAKRLIGDINRERVYQTYTHFSQKPKFAEQVNQLGEEDKYRDEATYVLQQEQAASRLTKYGIPKKYIMEQLKA 1086 T 2.2 FlgM unppercent F Eukaryota T 6z1p 95 QC BU Q22EB6_TETTS mS89 MMNRNLFKLLQLSTKSVSFNCTKLNYKFATFPSKEQMRFQKNMGYNGFQPNIAFKDDLYFPLDNPVQRQGLEDLINHIKVNPNLAIEGRGLCTILYIIAREGKDEPFIFKELERHLYKFKENLSPRLSFGGLYASYKSNLASPYQVSFFEDEFTRNSQQINAYEAIEILQTMFENTTKVNEHKIQYFHQSVKPIIVSNFSKQVRPYTGNLLKLFIGLRNMNIYDEELHELILKYLPYRRGLNNVKDIAEVYETLCDYKEKGILKQNIDAHIEALEKKLTTKDDCRWRYNLKEKRFYTYDELIANRDNYTIKDQLNHKYRFSNPELIEKFNLVQSDKDAIKAELEARERSRELENLVLEMFELKNRGEVAQTEDKNTLKGTYENVIFVKEGEELEEEEGAEEIDNEPAEEVDEGLDFDLKSSNKPKVKKEKGQKQKNKNN 439 T 0.95 L31 pdb F Eukaryota T 6z1p 96 RC BV Q24HL0_TETTS mS90 TNAKEYYDYLLRFTPQDERGYIKFHPGQFSKMVKIASTEEDIKSIRDAYYNFIGHKQKFTNAQVDRFLEKAAELKAAPLINEILINHNFLMYYPHSSVLHKLAEHYIQENNAEGLNELTRIYSNTHFLKLEDRTLELVSNYAIEQKNQGIILNVAQIAYRKVLTSINENTINNILTGIARQKIANPEPKEGTQNKVETKFLAHLKKCSQSYHTIIGRAYLALANXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 310 T 0.079 YBD unppssm F Eukaryota T 6z1p 98 TC BX Q951B8_TETTH Ribosomal protein S3 MGLKSLPMLNKSGISMYWHNIWDSIKLYKKYSLSFLFLNEVINHFLNENLYYYCIMKIRPTDPRLKGFRGNKSININKIKKSWNMRHFYLGKILFLKYQGWVLVLINYYCSRRNKLYINYKSFKAFKKIAKSFRQGVTSYVYKMDKYKFKF 151 T 0.058 Rib_hydrolayse unppercent F Eukaryota T 6z2d 1 A A B2UR60_AKKM8 O-glycan protease EVTVPDALKDRIALKKTARQLNIVYFLGSDTEPVPDYERRLSELLLYLQQFYGKEMQRHGYGARSFGLDIKSPGRVNIIEYKAKNPAAHYPYENGGGWKAAQELDEFFKAHPDRKKSQHTLIIMPTWNDEKNGPDNPGGVPFYGMGRNCFALDYPAFDIKHLGQKTREGRLLTKWYGGMAHELGHGLNLPHNHQTASDGKKYGTALMGSGNYTFGTSPTFLTPASCALLDACEVFSVTPSQQFYEGKPEVEVGDVAISFKGDQILVSGNYKSPQTVKALNVYIQDPPYAVNQDYDAVSFSRRLGKKSGKFSMKIDKKELEGLNNNEFRISLMFILANGLHMQKHFTFHWDALQDYRDGSKSGSGHHHHHH 370 T 0.00025 Metallopep pdbpercent F Bacteria T 6z2i 1 A A de novo designed TIM barrel DeNovoTIM6 MDKDEAWKQVEILRRLGAKQIAYRSDDWRDLQEALKKGGDILIVDATDKDEAWKQVEILRRLGAKQIAYRSDDWRDLQEALKKGGDILIVDATDKDEAWKQVEILRRLGAKQIAYRSDDWRDLQEALKKGGDILIVDATDKDEAWKQVEILRRLGAKQIAYRSDDWRDLQEALKKGGDILIVDATGLEHHHHHH 194 T 0.00021 NanE pdbhh F T 6z2o 1 A A B2UR60_AKKM8 O-glycan protease MEVTVPDALKDRIALKKTARQLNIVYFLGSDTEPVPDYERRLSELLLYLQQFYGKEMQRHGYGARSFGLDIKSPGRVNIIEYKAKNPAAHYPYENGGGWKAAQELDEFFKAHPDRKKSQHTLIIMPTWNDEKNGPDNPGGVPFYGMGRNCFALDYPAFDIKHLGQKTREGRLLTKWYGGMAHELGHGLNLPHNHQTASDGKKYGTALMGSGNYTFGTSPTFLTPASCALLDACEVFSVTPSQQFYEGKPEVEVGDVAISFKGDQILVSGNYKSPQTVKALNVYIQDPPYAVNQDYDAVSFSRRLGKKSGKFSMKIDKKELEGLNNNEFRISLMFILANGLHMQKHFTFHWDALQDYRDGSKSGSGHHHHHH 371 T 0.00025 Metallopep pdbpercent F Bacteria T 6z2p 1 A A B2UR60_AKKM8 O-glycan protease MEVTVPDALKDRIALKKTARQLNIVYFLGSDTEPVPDYERRLSELLLYLQQFYGKEMQRHGYGARSFGLDIKSPGRVNIIEYKAKNPAAHYPYENGGGWKAAQELDEFFKAHPDRKKSQHTLIIMPTWNDEKNGPDNPGGVPFYGMGRNCFALDYPAFDIKHLGQKTREGRLLTKWYGGMAAALGHGLNLPHNHQTASDGKKYGTALMGSGNYTFGTSPTFLTPASCALLDACEVFSVTPSQQFYEGKPEVEVGDVAISFKGDQILVSGNYKSPQTVKALNVYIQDPPYAVNQDYDAVSFSRRLGKKSGKFSMKIDKKELEGLNNNEFRISLMFILANGLHMQKHFTFHWDALQDYRDGSKSGSGHHHHHH 371 T 0.0033 Metallopep unppercent F Bacteria T 6z2p 2 B C DROS_DROME Glycodrosocin GKPRPYSPRPTSHPRPIRV 19 T 0.0059 DIM unppercent F Eukaryota T 6z3f 1 A P Chains: P CDIHVXWEWDCFEKLX 16 T 8.9 NIPSNAP pdbhh F T 6z3r 4 D E RENT1_HUMAN ATP-DEPENDENT HELICASE RENT1,NONSENSE MRNA REDUCING FACTOR 1,NORF1,UP-FRAMESHIFT SUPPRESSOR 1 HOMOLOG,HUPF1 QPELSQDSYLG 11 T 0.61 DUF4629 pdbhh F Eukaryota T 6z3u 3 C,F C,F G0SF48_CHATD RING-type domain-containing protein GPYDPFGGMEFVPSRYRVREELNHPSLDKYRIDQQHITGGYSFLDYISRAMFEAFAGLAVFIEDEKEAG 69 T 3.1 MotCF pdbhh F Eukaryota T 6z41 1 A A B3PJ79_CELJU Carbohydrate binding protein, putative, cpb33A MGNCISPVYVDGSSYANNALVQNNGSEYRCLVGGWCTVGGPYAPGTGWAWANAWELVRSCQAHHHHHH 68 T 0.19 P2X_receptor pdbpercent F Bacteria T 6z5s 1 A W Q6N1K3_RHOPA Light harvesting complex 1 Protein W MMLLLVLTAIAFVATAVVARVLAASAPEGKLYCQAAGAASMVVGPFITLVAAFVLGKAGIGGEVLDATAMLRVAALPAFGTLFVGPVVFWFFRRQRRTVAAA 102 T 0.54 DUF4229 pdbhh F Bacteria T 6z5y 1 A,B A,B D0N2F7_PHYIT Lytic Polysaccharide Monooxygenase HGYIAKPAPSWKASKTNNWVVEIEPQWKGGWDESKGDEGLLATFKELAPKNNFKDVRSLMDGNPVFGEECGFTDPKGKPSEPPSDGTATFSRGIVHAGPCEIWLDDKMVLQNDDCQSAYGDGTQQTIAVFKPVDYSSCAAGGCMLRFYWLALQRLKGKTVWQAYKNCIPLTGWSHPQFEK 180 T 0.0049 PA14 pdb F Eukaryota T 6z6e 1 A,B,C A,B,C Q9MBW4_BPHK7 Terminase small subunit ADKRIRSDSSAAAVQAMKNAAVDTIDPPSHAGLEKKAEPFWHDNIRSKALDSWTPADLLAAVELANNQLYITVLRKDLRKEERIRGEERDEGLIKDLRKQIVELQRTILAQRRDLQIHSHATNGESRDQKKRNQNDRDARNTKNEHQDQDDNLIAFPKHG 160 T 0.011 Terminase_4 unppercent T Viruses T 6z6j 2 B C5 LSO2_YEAST LATE-ANNOTATED SMALL OPEN READING FRAME 2 MGKRFSESAAKKAAGLARKRDQAHAKQRAQMEQLEAEEASKWEQGSRKENAKKLEEEQKRQEKARAKKERDALLTAEEEQLGKGGKGKRKMK 92 T 4 F-protein pdb F Eukaryota T 6z6k 2 B C5 LSO2_YEAST LATE-ANNOTATED SMALL OPEN READING FRAME 2 GKRFSESAAKKAAGLARKRDQAHAKQRAQMEQLEAEEASKWEQGSRKENAKKLEEEQKRQEKARAKKERDALLTAEEEQLGKGGKGKRKMK 91 T 3.8 F-protein pdb F Eukaryota T 6z8d 1 A,B A,B CAPSD_HPBVH Capsid protein precursor MKQNDTKKTTQRRNSKKYSSKTNRGTKRAPRDQEVGTGAQESTRNDVAWYARYPHILEEATRLPFAYPIGQYYDTGYSVASATEWSKYVDTSLTIPGVMCVNFTPTPGESYNKNSPINIAAQNVYTYVRHMNSGHANYEQADLMMYLLAMDSLYIFHSYVRKILAISKLYTPVNKYFPRALLVALGVDPEDVFANQAQWEYFVNMVAYRAGAFAAPASMTYYERHAWMSNGLYVDQDVTRAQIYMFKPTMLWKYENLGTTGTKLVPLMMPKAGDNRKLVDFQVLFNNLVSTMLGDEDFGIMSGDVFKAFGADGLVKLLAVDSTTMTLPTYDPLILAQIHSARAVGAPILETSTLTGFPGRQWQITQNPDVNNGAIIFHPSFGYDGQDHEELSFRAMCSNMILNLPGEAHSAEMIIEATRLATMFQVKAVPAGDTSKPVLYLPNGFGTEVVNDYTMISVDKATPHDLTIHTFFNNILVPNAKENYVANLELLNNIIQFDWAPQLYLTYGIAQESFGPFAQLNDWTILTGETLARMHEVCVTSMFDVPQMGFNK 552 T 55 DUF4549 pdbhh T Viruses T 6z8e 1 A,B A,B CAPSD_HPBVH Capsid protein precursor MRGSHHHHHHGMASMTGGQQMGRDLYDDDDKDRWGSMKQNDTKKTTQRRNSKKYSSKTNRGTKRAPRDQEVGTGAQESTRNDVAWYARYPHILEEATRLPFAYPIGQYYDTGYSVASATEWSKYVDTSLTIPGVMCVNFTPTPGESYNKNSPINIAAQNVYTYVRHMNSGHANYEQADLMMYLLAMDSLYIFHSYVRKILAISKLYTPVNKYFPRALLVALGVDPEDVFANQAQWEYFVNMVAYRAGAFAAPASMTYYERHAWMSNGLYVDQDVTRAQIYMFKPTMLWKYENLGTTGTKLVPLMMPKAGDNRKLVDFQVLFNNLVSTMLGDEDFGIMSGDVFKAFGADGLVKLLAVDSTTMTLPTYDPLILAQIHSARAVGAPILETSTLTGFPGRQWQITQNPDVNNGAIIFHPSFGYDGQDHEELSFRAMCSNMILNLPGEAHSAEMIIEATRLATMFQVKAVPAGDTSKPVLYLPNGFGTEVVNDYTMISVDKATPHDLTIHTFFNNILVPNAKENYVANLELLNNIIQFDWAPQLYLTYGIAQESFGPFAQLNDWTILTGETLARMHEVCVTSMFDVPQMGFNK 588 T 35 Protamine_3 pdbhh T Viruses T 6z8f 1 A,B A,B CAPSD_HPBVH Capsid protein precursor ESTRNDVAWYARYPHILEEATRLPFAYPIGQYYDTGYSVASATEWSKYVDTSLTIPGVMCVNFTPTPGESYNKNSPINIAAQNVYTYVRHMNSGHANYEQADLMMYLLAMDSLYIFHSYVRKILAISKLYTPVNKYFPRALLVALGVDPEDVFANQAQWEYFVNMVAYRAGAFAAPASMTYYERHAWMSNGLYVDQDVTRAQIYMFKPTMLWKYENLGTTGTKLVPLMMPKAGDNRKLVDFQVLFNNLVSTMLGDEDFGIMSGDVFKAFGADGLVKLLAVDSTTMTLPTYDPLILAQIHSARAVGAPILETSTLTGFPGRQWQITQNPDVNNGAIIFHPSFGYDGQDHEELSFRAMCSNMILNLPGEAHSAEMIIEATRLATMFQVKAVPAGDTSKPVLYLPNGFGTEVVNDYTMISVDKATPHDLTIHTFFNNILVPNAKENYVANLELLNNIIQFDWAPQLYLTYGIAQESFGPFAQLNDWTILTGETLARMHEVCVTSMFDVPQMGFNK 512 T 30 DUF4549 pdbhh T Viruses T 6z9v 3 C,F C,F ILE-ILE-GLY-TRP-MET-TRP-ILE-PRO-VAL IIGWMWIPV 9 T 0.94 Acyl-CoA_dh_C pdbhh F T 6z9x 3 C,F C,F LEU-LEU-SER-TUR-PHE-GLY-THR-PRO-THR LLSXFGTPT 9 T 1.9 DUF6120 pdbhh F T 6zbr 1 A P Chains: P CDIHVXWEWKCFEEL 15 T 2.9 Metal_hydrol pdbhh F T 6zbt 2 E,F,G,H E,F,G,H NED4L_HUMAN HECT-TYPE E3 UBIQUITIN TRANSFERASE NED4L,NEDD4.2,NEDD4-2 LRSCSVTDAV 10 T 6.9 YodL pdbhh F Eukaryota T 6zc9 2 E,F,G,H E,F,G,H NED4L_HUMAN HECT-TYPE E3 UBIQUITIN TRANSFERASE NED4L,NEDD4.2,NEDD4-2 PRSLSSPTVT 10 T 39 ASF1_hist_chap pdbhh F Eukaryota T 6zcd 1 A P Derived from V114 peptide CDIHVXWEWKCFEDL 15 T 3.5 WWE pdbhh F T 6zcj 2 B P LCP2_HUMAN SLP76pS376 FPQSASLPPY 10 F F Eukaryota T 6zd3 1 A,B B,A YTH domain containing 1 MHHHHHHSSGRENLYFQGTSKLKYVLQDARFFLIKSNNHENVSLAKAKGVWSTLPVNEKKLNLAFRSARSVILIFSVRESGKFQGFARLSSESHHGGSPIHWVLPAGMSAKALGGVFKIDWICRRELPFTKSAHLTNPWNEHKPVKIGRDGQEIELECGTQLCLLFPPDESIDLYQVIHKMRH 183 T 8.6E-31 YTH pdbhh F T 6zd4 1 A,B A,B YTH domain containing 1 MHHHHHHSSGRENLYFQGTSKLKYVLQDARFFLIKSNNHENVSLAKAKGVWATLPVNEKKLNLAFRSARSVILIFSVRESGKFQGFARLSSESHHGGSPIHWVLPAGMSAKMLGGVFKIDWICRRELPFTKSAHLTNPWNEHKPVKIGRDGQEIELECGTQLCLLFPPDESIDLYQVIHKMRH 183 T 3.8999999999999997E-31 YTH pdbhh F T 6zdw 1 A,B AAA,BBB A0A162MUP0_MUCCL DRBM domain-containing protein MTDTDTVEQFIHTIFARVTDDHGRPVDITAALPLLKQILTGYTQEVAEHKFNYIGESAVQFAMHLILADHFSKYENGCLSAIAKKYTVPLQLYKLIGKQIHLKEYVRPVYLKETLDMIVGILFRCYGITAVYKFIQEEFILLVNQDINNANSPKKPSSPSLSTNQADNPVKLLHELIQAKSGTLEAEAHETEDKKWEVKIVAKLNEKALPFSHARTNASKQKAKTEASRDILTYFTNYPDVCQHLQVPVEGEVEIHVLPISENDYCHLFAET 272 T 7.5E-05 Ribonucleas_3_3 unphh F Eukaryota T 6zef 2 C,D C,D PHAR1_HUMAN Phosphatase and actin regulator GPLGSRKILIRFSDYVEVADAQDYDRRADKPWTRLTAADKAAIRKELNEFKSTEMEVHELSRHLTRFHRP 70 T 2.3 DUF6344 pdbhh F Eukaryota T 6zh1 1 A A W5SB08_BORHE FACTOR H-BINDING PROTEIN A MAHHHHHHVDDDDKDLFNKNKKLDADLLKTLDNLLKTLDNNQKQALIYFKDKLQDKKYLNDLMEQQKSFLDNLQKKKEDPDLQDRLKKTLNSEYDESQFNKLLNELGNAKAKQFLQQLHIMLQSIKDGTLTSFSSSNFNDLQNLEQKKERALQYINGKLYVEYYFYINGISNADNFFETIMEYLKT 186 T 0.0047 FUT8_N_cat unppssm F Bacteria T 6zhx 7 K,L K,L CHD1L_HUMAN AMPLIFIED IN LIVER CANCER PROTEIN 1 EKASQEGRSLRNKGSVLIPGLVEGSTKRKRVLSPEEK 37 T 0.47 ATP-synt_E unp F Eukaryota T 6zj3 80 BC Lo Ribosomal protein eLEgr1 MAEVELVSVPECKAQTVDKHVLWSCINFGTSNVALIDPYHPAHRGARKYINQFHSGKVPKTAAKAKEAKAEE 72 T 7.9 MRP_L53 pdbhh F T 6zj3 97 SC L6 Ribosomal protein eLEgr2 MGGDDFEKKPLPDCLKELHEKQQAKLAKSKENYTPPKYNTPRKTTRERLNRRAQIKAALQRKKDKLKAE 69 T 14 NepR pdbhh F T 6zj3 98 TC L7 Ribosomal protein eLEgr3 MPLKNNCFRRVYHSNWEYLLSLEKEADAEPKQKALRYKQEKKQQFREKGLKLAAAKTAEAAKSA 64 T 17 Phage_antiter_Q pdbhh F T 6zm9 1 A A Chains: A SLMERLGGGGFSARIFVGLNVGDKPTYTIEDVVKDTIAIRKRQGILPDASFVAQRGVYTEQRSGQLVTENSVQIIIIDLEGLSKEDFTGKVQALGKELREDFKQESVIVEIQERGIVQDVYSITAEWYEEGPMRPLRVDLQPSLIS 146 T 7.1E-05 DUF3574 pdbhh F T 6zn3 1 A,D,G,J,M A,D,G,J,M Q8IJM4_PLAF7 Myosin essential light chain ELC SMASDMEEKFREAFILFSSCSDHIEMYKFFELMNSFGIILTNDEKAALPNDINMDYWLNFAKKHYNYEQPFKHINNVNEQNTNVQIKIDNFLGIMKALDTRLTESDLNILLQITNPENKSTLNLKTVSQKLTESI 135 T 0.024 Na_Ca_ex_C unppercent F Eukaryota T 6zn3 3 C,F,I,L,O C,F,I,L,O MYOA_PLAF7 PFM-A SVEWENCVSVIEAAILKHKYKQKVNKNIPSLLRVQAHIRKKMV 43 T 0.00021 IQ unppercent F Eukaryota T 6znl 10 R Y A0A4X1TB62_PIG DYNACTIN SUBUNIT 4 MASLLQSERVLYLVQGEKKVRAPLSQLYFCRYCSELRSLECVSHEVDSHYCPSCLENMPSAEAKLKKNRCANCFDCPGCMHTLSTRATSISTQLPDDPAKTAVKKAYYLACGFCRWTSRDVGMADKSVASGGWQEPDHPHTQRMNKLIEYYQQLAQKEKVERDRKKLARRRNYMPLAFSQHTIHVVDKYGLGTRLQRPRAGTTITALAGLSLKEGEDQKEIKIEPAQAVDEVEPLPEDYYTRPVNLTEVTTLQQRLLQPDFQPICASQLYPRHKHLLIKRSLRCRQCEHNLSKPEFNPTSIKFKIQLVAVNYIPEVRIMSIPNLRYMKESQVLLTLTNPVENLTHVTLLECEEGDPDDTNSTAKVSVPPTELVLAGKDAAAEYDELAEPQDFPDDPDVVAFRKANKVGVFIKVTPQREEGDVTVCFKLKHDFKNLAAPIRPVEEADPGAEVSWLTQHVELSLGPLLP 467 T 3.9000000000000004E-28 Dynactin_p62 pdbpercent F Eukaryota T 6zop 1 A A A0DFJ7_PARTE DDE_Tnp_1_7 domain-containing protein GPLGSPEFSYFAKIQPHTFIEGEEIVKCSECGNETKVFCQECTILKAEVVGLCHEKDTIKCQRFHEFMDFELDKNKEVIDKRKG 84 T 0.011 UPF0167 pdb F Eukaryota T 6zpj 1 A,B A,B E9AN40_LEIMU LEISHMANIA MEXICANA KKT4 GSTANKLTEAQRRIAELEKELQRTTQRVDQLSDVVQQQKDELQAAKDRHALEMEETRHAYNAVIHRKDEVQEEALRQLLKSRQLMVSAARYEAVVAAKKLHAQEFELGAPAGRQACGRIMLKSNRK 126 T 0.0076 SEN1_N pdb F Eukaryota T 6zpm 1 A,B A,B A0A2V2WCI2_TRYCR Trypanosoma cruzi KKT4 117-218 GSSLQRYEKLVKECRRLEEELEQKTHEASDASQRVRQLERETTRLMRRVEQLVSAVEGQKQKLDETEAKHKLELAEIENRHELEIQSKMSSHEEALRRLMDARR 104 T 0.0004 AAA_13 pdbpssm F Eukaryota T 6zpp 1 A A A0A151GCU7_9HYPO VIRULENCE FACTOR SRLSNAFVLATTASAAAVPSPALPADDILLAINQSLRLVDSRAAMLVSQVRHGAINNVGSLADSYHELIFSLRGAVRAVDDVWRPLPKDAPMRIVESLRPFQKIPASLRSALKERLDAIAERPGGCQAVDDNNRQLGLDFDRLYWEIASSSSFSAIHETVSSQQKQFETAMRELTDEFSSRCLRRAQASA 190 T 0.2 FliT pdb F Eukaryota T 6zqt 2 B,D C,D RBP1_HUMAN RALBP1,76 KDA RAL-INTERACTING PROTEIN,DINITROPHENYL S-GLUTATHIONE ATPASE,DNP-SG ATPASE,RAL-INTERACTING PROTEIN 1 GPLGSETQAGIKEEIRRQEFLLNSLHRDLQGGIKDLSKEHRLWEVLRILTALRRKLREA 59 T 0.01 G10 pdbpercent F Eukaryota T 6zrc 2 B,C P,Q MTA1_HUMAN macrocyclic peptide based on residues 659-672 of the metastasis-associated protein MTA1 XCTKRAARRPYKPCAX 16 T 0.74 MTA_R1 unp F Eukaryota T 6zrd 2 C,D P,Q MTA1_HUMAN macrocyclic peptide based on residues 659-672 of the metastasis-associated protein MTA1 XCTKRCARRPYKPCAX 16 T 1.9 Toxin_27 pdbhh F Eukaryota T 6zrn 2 C,D C,D RBP1_HUMAN RALBP1,76 KDA RAL-INTERACTING PROTEIN,DINITROPHENYL S-GLUTATHIONE ATPASE,DNP-SG ATPASE,RAL-INTERACTING PROTEIN 1 GPLGSETQAGIKEEIRRQEFLLNSLHRDLQGGIKDLSKESRMWEVLRILTALRRKLREA 59 T 0.01 G10 pdbpercent F Eukaryota T 6zrw 1 A,B,C,D,E,F A,B,C,D,E,F B3VS76_COPCI Mucin-binding lectin 1 AIFHTGSELFIITRGPGKLTLLTWGGLNNLRSVIGAIPTENTGVTKWAVSFSHNYTRFSFIWEGQGEACYQIGNGLTRSPVGRSWSSSSTIHWGSSTVITEDVTSVVPGAVNRDKVTTAYALPDNL 126 T 41 IMS_HHH pdbhh F Eukaryota T 6zsu 1 A,B A,B A0A0K1ECI7_CHOCO CGNE GMGGRRTIGIRSGEGAIMNASDFYALLRGRGMPVVVDDAEAAAVVSELGFRTVPFEAFDFDSPSEDPALVIVAQMGNVDALHGLWERSGTPLMHLALAKFDGGLSRLRAGLARVLAVDTDAALKRRAEAYEQLFSSASVEIASGEGVLRCHIGDEVEVGNCGDTLEQGFLYSVAEFLEASVVNLEGERSTFWVEGELPFDGFIHLSNSAALKERWGGMLDEFMRRSREGANLVRFADNVIDRLVVGGVDVTSALAGLSQGEERGMAATEFGLGCADAEAAEPFGVNSLLHKSAGGAYIGIGKGLRIPHIDFIARGATIRFIPAAEG 326 T 0.069 DUF2806 pdbpercent F Bacteria T 6zsv 1 A,B A,B A0A0K1EBZ5_CHOCO Uncharacterized protein GAMADIGSMDVLEYFERLKNRELAFVLDDLQLSDMVTRRGFSVIPFDDFDLAREDHPPAFVLVTRLDYHGKLMQAWETAKGISSHLSLAKFDTSPKSVEYSLDQLLSMDFAETLKRRGDYYDSVASTNRMEVVTPGAVLTCDFGNEIEIANNDVEMQKGWLYSVAEFFETSVINLEADRSSYTLNGDLCFTGLIYLCNRPDLKERASATMDELMRMSTRGRNVVSFVDNQIVRMELGGVDMTATLRELIVGKEREGSSTEFAMGCVEYPLAQDWTINSVMNEGSHGIHVGVGMGKEIPHMDFIAKGAELRIAESSDA 317 T 0.055 YgbA_NO unppssm F Bacteria T 6zsy 1 A A GRND_DROME Protein grindelwald GESRDCHGTICHPVNEFCYVATERCHPCIEVCNNQTHNYDAFLCAKECSAYK 52 T 0.22 MSSP pdbpercent F Eukaryota T 6zt1 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L CC-Type2-(LaIdGe)4 XGEIGQALKEIGKALKEIGXALKEIGQALKGX 32 T 0.013 ApoC-I pdbpssm F T 6zts 1 A,B A,B LMBD1_REOVL Lambda-1 VSRIVVHTRWASNVDFDRDSSVIMAPPTENNIHLFKQLLNTETLSVRGANPLMFRANVLHMLLEFVLDNLYLNRHTGFSQDHTPFTEGANLRSLPGPDAEKWYSIMYPTRMGTPNVSKICNFVASCVRNRVGRFDRAQMMNGAMSEWVDVFETSDALTVSIRGRWMARLARMNINPTEIEWALTECAQGYVTVTSPYAPSVNRLMPYRISNAERQISQIIRIMNIGNNATVIQPVLQDISVLLQRISPLQIDPTIISNTMSTVSESTTQTLSPASSILGKLRPSNSDFSSFRVALAGWLYNGVVTTVIDDSSYPKDGGSVTSLENLWDFFILALALPMTTDPCAPVKAFMTLANMMVGFETIPMDNQIYTQSRRASAFSTPHTWPRCFMNIQLISPIDAPILRQWAEIIHRYWPNPSQIRYGAPNVFGSANLFTPPEVLLLPIDHQPANVTTPTLDFTNELTNWRARVCELMKNLVDNQRYQPGWTQSLVSSMRGTLDKLKLIKSMTPMYLQQLAPVELAVIAPMLPFPPFQVPYVRLDRDRVPTMVGVTRQSRDAITQPALSLSTTNTTVGVPLALDARAITVALLSGKYPPDLVTNVWYADAIYPMYADTEVFSNLQRDMITCEAVQTLVTLVAQISDTQYPVDRYLDWIPSLRASAATAATFAEWVNTSLKTAFDLSDMLLEPLLSGDPRMTQLAIQYQQYNGRTFDVIPEMPGSVIADCVQLTAEVFNHEYNLFGIARGDIIIGRVQSTHLWSPLAPPPDLVFDRDTPGVHIFGRDCRISFGMNGAAPMIRDETGMMVPFEGNWIFPLALWQMNTRYFNQQFDAWIKTGELRIRIEMGAYPYMLHYYDPRQYANAWNLASAWLEEITPTSIPSVPFMVPISSDHDISSAPAVQYIISTEYNDRSLFCTNSSSPQTIAGPDKHIPVERYNILTNPDAPPTQIQLPEVIDLYNVVTRYAYETPPITAVVMGVP 975 T 25 Peptidase_C36 pdbhh T Viruses T 6ztz 1 A B LMBD1_REOVL LAMBDA 1 NKKTAQLLHADTPRLVTWDAGLCTSFKIVPIVPAQVPQDVLAYTFFTSSYAIQSPFPEAAVSRIVVHTRWASNVDFDRDSSVIMAPPTENNIHLFKQLLNTETLSVRGANPLMFRANVLHMLLEFVLDNLYLNRHTGFSQDHTPFTEGANLRSLPGPDAEKWYSIMYPTRMGTPNVSKICNFVASCVRNRVGRFDRAQMMNGAMSEWVDVFETSDALTVSIRGRWMARLARMNINPTEIEWALTECAQGYVTVTSPYAPSVNRLMPYRISNAERQISQIIRIMNIGNNATVIQPVLQDISVLLQRISPLQIDPTIISNTMSTVSESTTQTLSPASSILGKLRPSNSDFSSFRVALAGWLYNGVVTTVIDDSSYPKDGGSVTSLENLWDFFILALALPLTTDPCAPVKAFMTLANMMVGFETIPMDNQIYTQSRRASAFSTPHTWPRCFMNIQLISPIDAPILRQWAEIIHRYWPNPSQIRYGAPNVFGSANLFTPPEVLLLPIDHQPANVTTPTLDFTNELTNWRARVCELMKNLVDNQRYQPGWTQSLVSSMRGTLDKLKLIKSMTPMYLQQLAPVELAVIAPMLPFPPFQVPYVRLDRDRVPTMVGVTRQSRDTITQPALSLSTTNTTVGVPLALDARAITVALLSGKYPPDLVTNVWYADAIYPMYADTEVFSNLQRDMITCEAVQTLVTLVAQISETQYPVDRYLDWIPSLRASAATAATFAEWVNTSMKTAFDLSDMLLEPLLSGDPRMTQLAIQYQQYNGRTFNVIPEMPGSVIADCVQLTAEVFNHEYNLFGIARGDIIIGRVQSTHLWSPLAPPPDLVFDRDTPGVHIFGRDCRISFGMNGAAPMIRDETGMMVPFEGNWIFPLALWQMNTRYFNQQFDAWIKTGELRIRIEMGAYPYMLHYYDPRQYANAWNLTSAWLEEITPTSIPSVPFMVPISSDHDISSAPAVQYIISTEYNDRSLFCTNSSSPQTIAGPDKHIPVERYNILTNPDAPPTQIQLPEVVDLYNVVTRYAYETPPITAVVMGVP 1035 T 27 Peptidase_C36 pdbhh T Viruses T 6ztz 2 B C LMBD1_REOVL LAMBDA 1 AGLCTSFKIVPIVPAQVPQDVLAYTFFTSSYAIQSPFPEAAVSRIVVHTRWASNVDFDRDSSVIMAPPTENNIHLFKQLLNTETLSVRGANPLMFRANVLHMLLEFVLDNLYLNRHTGFSQDHTPFTEGANLRSLPGPDAEKWYSIMYPTRMGTPNVSKICNFVASCVRNRVGRFDRAQMMNGAMSEWVDVFETSDALTVSIRGRWMARLARMNINPTEIEWALTECAQGYVTVTSPYAPSVNRLMPYRISNAERQISQIIRIMNIGNNATVIQPVLQDISVLLQRISPLQIDPTIISNTMSTLSPASSILGKLRPSNSDFSSFRVALAGWLYNGVVTTVIDDSSYPKDGGSVTSLENLWDFFILALALPLTTDPCAPVKAFMTLANMMVGFETIPMDNQIYTQSRRASAFSTPHTWPRCFMNIQLISPIDAPILRQWAEIIHRYWPNPSQIRYGAPNVFGSANLFTPPEVLLLPIDHQPANVTTPTLDFTNELTNWRARVCELMKNLVDNQRYQPGWTQSLVSSMRGTLDKLKLIKSMTPMYLQQLAPVELAVIAPMLPFPPFQVPYVRLDRDRVPTMVGVTRQSRDTITQPALSLSTTNTTVGVPLALDARAITVALLSGKYPPDLVTNVWYADAIYPMYADTEVFSNLQRDMITCEAVQTLVTLVAQISETQYPVDRYLDWIPSLRASAATAATFAEWVNTSMKTAFDLSDMLLEPLLSGDPRMTQLAIQYQQYNGRTFNVIPEMPGSVIADCVQLTAEVFNHEYNLFGIARGDIIIGRVQSTHLWSPLAPPPDLVFDRDTPGVHIFGRDCRISFGMNGAAPMIRDETGMMVPFEGNWIFPLALWQMNTRYFNQQFDAWIKTGELRIRIEMGAYPYMLHYYDPRQYANAWNLTSAWLEEITPTSIPSVPFMVPISSDHDISSAPAVQYIISTEYNDRSLFCTNSSSPQTIAGPDKHIPVERYNILTNPDAPPTQIQLPEVVDLYNVVTRYAYETPPITAVVMGVP 1008 T 26 Peptidase_C36 pdbhh T Viruses T 6zuf 2 C,D C,D C2 foldamer/peptide hybrid inhibitor of histone chaperone ASF1 EKXXXXRIA 9 T 73 YcbB pdbhh F T 6zuq 1 A A A0A1P8YXI8_PASFU Extracellular protein 11-1 GHMLDCKAVALKWVHQFRIPGGDNCNFYCSYDSLYQQFNLWKKNDACQGADGFSTAIPKIQEAPCSDCPGSKTCICSVQATAWRVRNGKWFDGQQWFDCDVKPYTERVLGRRWYDESEADKDIYVGYYSRGFISNDNVHCGSQ 143 T 0.047 TSGP1 unphh F Eukaryota T 6zvb 2 B P GAB2_HUMAN phosphorylated Gab2pT391 peptide IPRRNTLPAMDNS 13 T 36 TbpB_A pdbhh F Eukaryota T 6zvf 3 C P LEG3_HUMAN GAL-3,35 KDA LECTIN,CARBOHYDRATE-BINDING PROTEIN 35,CBP 35,GALACTOSE-SPECIFIC LECTIN 3,GALACTOSIDE-BINDING PROTEIN,GALBP,IGE-BINDING PROTEIN,L-31,LAMININ-BINDING PROTEIN,LECTIN L-29,MAC-2 ANTIGEN XQAPPGAYPG 10 T 0.76 HMMR_N pdbhh F Eukaryota T 6zvh 36 JA y LYAR_HUMAN Cell growth-regulating nucleolar protein KFNWKGTIKAILKQAPDNEITIKKLRKKVLAQYYTVTDEHHRSEEELLVIFNKKISKNPTFKLLKDKVKLVK 72 T 0.0019 DEK_C pdb F Eukaryota T 6zvq 2 B B SKI_HUMAN PROTO-ONCOGENE C-SKI FQPHPGLQKTLEQFHLSSMSSLGGPAAFSARWAQE 35 T 120 DUF2520 pdbhh F Eukaryota T 6zwk 2 G,H,I,J,K,L G,H,I,J,K,L H2AX_HUMAN H2A/X,HISTONE H2A.X CKATQASQEY 10 T 13 Class_IIIsignal pdbhh F Eukaryota T 6zx9 2 B B DCAF1_HUMAN HIV-1 VPR-BINDING PROTEIN,VPRBP,SERINE/THREONINE-PROTEIN KINASE VPRBP,VPR-INTERACTING PROTEIN MAPINFTSRLNRRASFPKYGGVDGGCFDRHLIFSRFRPISVFREANEDESGFTCCAFSARERFLMLGTCTGQLKLYNVFSGQEEASYNCHNSAITHLEPSRDGSLLLTSATWSQPLSALWGMKSVFDMKHSFTEDHYVEFSKHSQDRVIGTKGDIAHIYDIQTGNKLLTLFNPDLANNYKRNCATFNPTDDLVLNDGVLWDVRSAQAIHKFDKFNMNISGVFHPNGLEVIINTEIWDLRTFHLLHTVPALDQCRVVFNHTGTVMYGAMLQADDEDDLMEERMKSPFGSSFRTFNATDYKPIATIDVKRNIFDLCTDTKDCYLAVIENQGSMDALNMDTVCRLYEVGRQRLAEELALVPRG 360 T 0.0003 ANAPC4_WD40 pdbpssm F Eukaryota T 6zyg 1 A A A0A1X9WII3_9GAMM Protealysin-associated protein MKPLPVLNQDTVIELAREGGFAFIPKLAGQRRIALADITPEQRQRLNQLLNQTLPYAQEEGQPDSPGSGDQRYFRVQISYYSQTLRSEIVLLIPETSAPQALVDLWKTGQVDE 113 T 4.8 DUF3500 unphh F Bacteria T 6zyw 17 Q Y Q22YU3_TETTS Shulin MFNFFSSANINQNIPKYSVNDFVFRLKKIEKIVVKEGLDGFLLINGVDSRENTEYVKLTNWLFLGNSGLEIEENEYLNQIYSDMIVLIKKGTTHIFIDPEALNSLQTLIYSIPNVDVFCPTEKQYEDKDEMELLKMAFFLRVMKPTKKVGILLGQKDKGKINSIEKWPLIQSYGLEELGVGFFSMNHEVVDLTLRLNAVYKNYDKFFVSKLIYVVAKRLTGHFNSAAGQLGDMKMHKRNLATESQLTEIFRDTYEIEEISKWVQIRGVNAALPKPRVLFGKNTSADCSKEPSVAPLKDLKYSETFHSFHATFETFDLRTCLRAARTYFLAKGVKEERNLITLNDDEGVPQGYELNIDENQQYKDQDFLANLYLSIIIGFNEVMQLITKDYKNMTEEFIQDYIFQKVSKVYAGFQIPESEITLDKIQIILKAYNSFGEEVKIDFKDTISFKLTPYFFMVRIEQKNIKSQILNNTVLGSLVFAESFILQEGCYLLLTKEIPYFDLWNCQNDYSEKIEKMKKRILWEPLGKQISDELPKNRIFVQTGRKSNYGFDIPIMQASYYMHELGLRIETQRLGWFILFFKEMKEIQITQKMNHTWLIFKVDSNITFNSISKDTIALEFTGDALEQSFFKIKNYFEENQIKYEYQVDIPAIFQESQIAKKQILNQQSQGQKLITMNSIQNEQFFISYIESKQLMILNQMKDLKLSAYKNLYEQMQISQAITPVENHIGVILVNGSYCSGKRKFAENLIRFGSDNNLRLHLYKFDLNEMSELTEKSYLSGLLKFASEKKIQNTDVIVASVPHFINTKILIDYFSKSEKISNAFYIRTIATKININNIYSNFNKNPVNNVFTYGVEGYSQFLLLDTYNNYDADVNALNKTLSGVLPGAKIYKIMNNILNPALAKDILTSITFISEQNNLNRLKYSVQYDLLTSNGPSSVVFIPFKLPILREKIRDLIYKKILQNGNQTLVDTIEAEQKIAEFKELNKNSKDPLMIEIIKLKEKIEIQNAQTSDQAIKIDYVKGILRYDSKLKEGLEEITITPNYFIERTVKGVDAKEFTEELNGVSFKNVKYTGITNSIINDMGFVFAGKNLNKEKLLELLYKLVKPLNKQKLRQRKDLTEEEIVDIQFRNRGEGLENGEFYDGQFWRNIQGLILPHHPKKDEFIEEYLKQEEVRINQINEQLQQEWETWKQVYDKIHLDK 1200 T 0.00092 cobW unphh F Eukaryota T 6zzx 14 N O A0A2P6THB2_CHLSO Photosystem I subunit O WGAYEEPLSLVAGFLGWFAPSNIKVPAFGNESLFGAFHASMLENLANFPQGPALTDKFWILMITWHLGLFLALTLGNIGQAARKQGY 87 T 24 YkpC pdbhh F Eukaryota T 7a00 2 C,D C,D L6F mutant of C-terminal hexapeptide from Guanylate kinase-associated protein XEAQTRF 7 T 19 GKAP pdbhh F T 7a0n 1 A,B,C,D A,B,C,D G0S5K3_CHATD Uncharacterized protein,Uncharacterized protein SMTSSGSSRDLFRALNSFIQTPTLPPPADLDAIISSYLERHDKPEEGSGDRLNDELLAIWDKAVQDHPEKYAAFVAVLRQLRPGLGAPARTFQWWDKLLDPVLDNATREKGLARSFMDFTLEILSSSEYDDPEAWGEEGFIPWLNRLLVRWMELRESRADFRPSTDLKEQVLTDALLAFGKKDPKGFMNALNAFVLRREHRNSAFSLLCAFVNSGPPHLYLILQTPLFGNILQSLQKDESTFTVNLALIALVMLLPFFPGDIVPYLPTLFNIYARLLFWDRDSYFAQQHTEMGENHGESGTDTPWDKVLLDPDYDGHSVPYLPEYFTILYGLYPINFVDYIRKPHNYLPHAGSDDDIDVHAAEIRERSERFRKQHLLHPNFYEYTIETEKTNITRWLKSEADEIIADCMALVVDRGTADESRPGVEIIEQVSLLRYQRHRLLNDLQYERFVRQQHMSHMGELRRRQ 466 T 1.9E-05 CCDC14 unphh F Eukaryota T 7a1i 2 B,D B,D A0A3L6LBE5_9TRYP FPC4 SSLSPYLRYLPSDVSGGEWDKPDVGDVLCFQAKEPQRRRVLTSPVPDELLIK 52 T 1.7 USP7_ICP0_bdg pdbhh F Eukaryota T 7a1s 2 B B TNKS2_HUMAN TANKYRASE-2 NLEVAEYLLQHGADVNAQDK 20 T 0.00021 Ank pdb F Eukaryota T 7a1t 1 A,B,C,D,E,F A,B,C,D,E,F CC-Type2-(GgLaId)4-W19BrPhe XGEIAQGLKEIAKGLKEIAXGLKEIAQGLKGX 32 T 0.091 WXG100 pdb F T 7a23 11 K S Q9SD78_ARATH B14.5a MAKSVSTAASSLVQNLRRYIKKPWQITGPCAHPEYLEAVPKATEYRLRCPATIDEEAIVPSSDPETVYNIVYHGRDQRRNRPPIRRYVLTKDNVVQMMNEKKSFDVSDFPKVYLTTTVEEDLDTRGGGYEK 131 T 0.11 CI-B14_5a pdbpercent F Eukaryota T 7a23 30 DA n UMP2_ARATH P2 MATRNALRIVSRRFSSGKVLSEEERAAENVFIKKMEQEKLQKLARQGPGEQAAGSASEAKVAGATASASAESGPKVSEDKNRNYAVVAGVVAIVGSIGWYLKAGGKKQPEVQE 113 T 0.01 IATP pdbhh F Eukaryota T 7a48 2 B B APH colied-coil XLEEELKQLEEELQAIEEQLAQLQWKAQARKEKLAQLKEKLX 42 T 0.0018 DUF5320 pdbhh F T 7a5h 59 GB G MRES1_HUMAN Mitochondrial transcription rescue factor 1 MAMASVKLLAGVLRKPDAWIGLWGVLRGTPSSYKLCTSWNRYLYFSSTKLRAPNYKTLFYNIFSLRLPGLLLSPECIFPFSVRLKSNIRSTKSTKKSLQKVDEEDSDEESHHDEMSEQEEELEDDPTVVKNYKDLEKAVQSFRYDVVLKTGLDIGRNKVEDAFYKGELRLNEEKLWKKSRTVKVGDTLDLLIGEDKEAGTETVMRILLKKVFEEKTESEKYRVVLRRWKSLKLPKKRMSK 240 T 0.002 S4 pdbpercent F Eukaryota T 7a66 1 A,B,C A,B,C Pcc2 MKIRAKVELTWEYEDEETAKAIANAVNVDNISIPEKLKKSLNLITFPDGARVVTKVKYEGEIESLVVALDDLIFAIKVAEEVLWSH 86 T 0.00032 Pcc1 pdbpercent F T 7a67 2 B B Pcc2 MKIRAKVELTWEYEDEETAKAIANAVNVDNISIPEKLKKSLNLITFPDGARVVTKVKYEGEIESLVVALDDLIFAIKVAEEVLWSHPQFEK 91 T 0.00084 Pcc1 pdbpercent F T 7a8s 1 A A sTIM11_h3 MDKDEAWKQVEQLRREGATQIAYRSDDWRDLKEAWKKGADILIVDATDKDEAWKQVEQLRREGATQIAYRSDDWRDLKEAWKKGADILIVSEEMARHAPKDEAWKQVEQLRREGATQIAYRSDDWRDLKEAWKKGADILIVDATDKDEAWKQVEQLRREGATQIAYRSDDWRDLKEAWKKGADILIVDATGLEHHHHHH 199 T 0.00077 NanE pdbhh F T 7a8w 2 C,F CCC,FFF G4MXW3_MAGO7 Uncharacterized protein METGNKYIEKRAIDLSRERDPNFFDNPGIPVPECFWFMFKNNVRQDDGTCYSSWKMDMKVGPNWVHIKSDDNCNLSGDFPPGWIVLGKKRPGF 93 T 0.086 TMEM18 unp F Eukaryota T 7a9w 1 A A RMD9_YEAST REQUIRED FOR MEIOTIC NUCLEAR DIVISION PROTEIN 9 MSHHHHHHKNVPKGVLDKKNGREQRKTEQNVFNVDPASPWRHELLSFDECVSSALKYSTTPLQNTYKRIGNNQLNKNPSFAMFWDSMGRAMELYYSLRESPDFNAYRVSRLIHLLHNGLRSTRDQLVKLSRKPDYDSQSFHKEMMNFLCNSLKDISDDILIGKVSVSGYGATHLLTSFKELSFDDDCIRIWEASKNLSDETTSQAFQEPKVVGFMLPLLYAKTRSLTEPNELYNQIIQSKEFIHPNLYSGLIKVFIKAEDYEKALSLFGQLCEKAEVRNYGYLIETHLSFIGDSKNLTLAESFFDKIINDEMPYKIILQVSTVNSFLQNIWKAQNDFDHVYRIWEKAVKFYGNTVNPGILSSLNNTFFTIFFENYINDNINGFRKLQEIITFYSGVKKIDEPFFNVMLTRASIWHERSIIDFIDKNYTLYHIPRTIISYRILLKSLGSIDNTNNEEILDRWLELVKKLNELGQQYIANADLSALRDATVVWSQSKRDEKVFSAKAKGTPATTTTTEDDIKVPKPLENLKNEDSTSNSEDRIELYLKILKRYTPYFRATKQVYRYTTGCAESYPILNEYLSGYSDLSAEDIPVPQLHSFIAKEQ 603 T 2.6E-05 MRP-S27 unphh F Eukaryota T 7aa9 2 B,D,F,H,J,L B,D,F,H,J,L pT13/PT15 SCOC LIR EDSTFTNISLAD 12 T 6.3 DUF2370 pdbhh F T 7aam 2 C C PTN22_HUMAN HEMATOPOIETIC CELL PROTEIN-TYROSINE PHOSPHATASE 70Z-PEP,LYMPHOID PHOSPHATASE,LYP,PEST-DOMAIN PHOSPHATASE,PEP GFANRFSKPKGPRNPPPTWNI 21 T 44 Ral pdbhh F Eukaryota T 7abl 1 A,B,C,D A,B,C,D CAPSD_HBVCJ CORE ANTIGEN,CORE PROTEIN,HBCAG,P21.5 MDIDPYKEFGASVELLSFLPSDFFPSIRDLLDTASALYREALESPEHCSPHHTALRQAILCWGELMNLATWVGSNLEDPASRELVVSYVNVNMGLKIRQLLWFHISCLTFGRETVLEYLVSFGVWIRTPPAYRPPNAPILSTLPETTVVRRRGRSPRRRTPSPRRRRSQSPRRRRSQSRESQC 183 T 1.1E-26 Hepatitis_core pdb T Viruses T 7acb 1 A,B,C,D A,B,C,D W5VVI0_CAPHI Capra hircus Cathelicidin-1 (dodecylphosphocholine) RICQFVLIRVCR 12 T 1.7 bpX4 pdbhh F Eukaryota T 7acv 2 C C Q9AEM2_CLODI S-LAYER PROTEIN MADIIADADSPAKITIKANKLKDLKDYVDDLKTYNNTYSNVVLEHHHHHH 50 T 0.03 DUF4458 unp F Bacteria T 7ad0 2 G,H,I,J,K,L H,I,J,K,L,M Modified p53 peptide ATSFAEYWALLXPA 14 T 0.49 P53_TAD pdbhh F T 7ad5 1 A A V5TFR9_LEPMC Avirulence protein LmJ1 GHMHDCHQVTVSRDVTLQNKERHDCNQVCASIDKETENKLNTDIIPRLTRYMSVKGNSIIARVQQSNSDPKCSCTWRAIIWRVYKAYDENSLNVALHVSHPNQQIGENPDWSLVISNPNVHCLKH 125 T 3.8 Antimicrobial_6 pdbhh F Eukaryota T 7ad6 2 C C K92 knob domain CPEGWSECGVAIYGYACGRWGCGHFLNSGPNISP 34 T 0.18 Toxin_4 pdbhh F T 7ad7 2 C C K8 peptide SVCPDGFDWGYGCAAGSSRFCTRHDWCCYDERADSHTYGFCTGNRVENLYFQ 52 T 0.098 DUF4716 pdb F T 7ad9 1 A,C,E,G,I A,C,L,E,G AB140_YEAST Lifeact MGVADLIKKFESISKEE 17 T 1.6 Antimicrobial_8 pdbhh F Eukaryota T 7adj 1 A A A0A7Z7PMS6_MYCMC Putative immunoglobulin-blocking virulence protein MGSSHHHHHHSSGLVPRGSHISFDTSSNGITDAELAPINNAINDAIVSNRDNKLKPSEEKIIKETEKKIEEKIIIPPAKKEEKIEAAKPIPKPVVRKPETKITSPKITRRKQTITIAGIEVEAEIEGPPGFVTHQRDKDRKISNPTKPYQNHTVNKILSVKVTDKLKEQVAKDALSGGNGYDEGVGLFNNSIFNVFKEEFNSGKELNDILSSLESVARQNSGAFQNTLERYKKMLDSNNVINFLKSEAQKEYPKLKSKFQTKNQEYIWLIANLDQSKFTKIASTSEKYLEKGLTISPRSAFINEAGEIDSNGWGPPDEYNTVTSRLRRDNSEYRVFDYDEYYSRSSDRIANGTYPGWVKEDVSEPYSKKYNFKASDGIRFSKLERINPNPAKGKLNSGLVLDLDVSNDEAYRRSKELIEKLQKDGEQITSYRIKNMGEKNSDQAFKDILGALPKDIQQLELFFSDKATNTASLIALENKNIKELSLYTSGNSLKKAWSYNPLALRNTTWINTIDYNVSAEYSSHDKITTRITFNTLAFDQEDFSNGSYERINDGLRMVYYARNNEPFFQGGHGPGLEPDKKLGQNSYPTGLDFSRVTGIKSLKGLRFDDDLDTSNEPRKITELTLYNNESYFEISSDELNEANLQHLSTGEGNPEKPKIHFSNGNNTTSIRISGKTLLSDEGRRNLDKYFEYNESLRNSGKQIQIPNGSDELKKQLEGWGYKVSTASDRSFT 732 T 0.02 DUF3403 pdb F Bacteria T 7ado 9 I I EMC10_HUMAN HEMATOPOIETIC SIGNAL PEPTIDE-CONTAINING MEMBRANE DOMAIN-CONTAINING PROTEIN 1 MAAAASAGATRLLLLLLMAVAAPSRARGSGCRAGTGARGAGAEGREGEACGTVGLLLEHSFEIDDSANFRKRGSLLWNQQDGTLSLSQRQLSEEERGRLRDVAALNGLYRVRIPRRPGALDGLEAGGYVSSFVPACSLVESHLSDQLTLHVDVAGNVVGVSVVTHPGGCRGHEVEDVDLELFNTSVQLQPPTTAPGPETAAFIERLEMEQAQKAKNPQEQKSFFAKYWMYIIPVVLFLMMSGAPDTGGQGGGGGGGGGGGSGR 263 T 0.002 2OG-FeII_Oxy_5 pdbpssm F Eukaryota T 7ads 1 A A A0A0R8HV90_ORFV Apoptosis inhibitor GPLGSMANRDDIDASAVMAAYLAREYAEAVEEQLTPRERDALEALRVSGEEVRSPLLQELSNAGEHRANPENSHIPAALVSALLEAPTSPGRMVTAVELCAQMGRLWTRGRQLVDFMRLVYVLLDRLPPTADEDLGAWLQAVARVHGT 148 T 0.029 VMAP-M0 pdbpssm T Viruses T 7adz 2 G,H,I,J,K,L 1A,1B,1C,1D,1E,1F A3HTC3_9BACT CAP ADAPTOR PROTEIN (ALGO2) MQVSSSFRSFLKLDILHSYFLNDGEKDFSSMNEEESKTQLKSYNWKDFLEIYPSQKTSHMMRGNKIFFKSFNDSIILAIKVESGTENQPFNELYEDESMTFLLSLKDQYFGNYTDLDLADQLLYFSNKTPVLPEAFTFKPIDRINQSGTVGEEYLYEGENKKHLLEEAHLNPGGGVLGIIQIYMKGDTPVLSLINNDGTLKNSLPHFKIHFSNRKSTWKYINLKDDFETETKKDYPLTKFGFILLDKKSDFISPPAHFEKYVFPNPDARRIKITPTKNYSEIFI 284 T 0.12 PanZ pdb F Bacteria T 7ae4 2 G,H,I,J,K,L a,b,c,d,e,f SHDD_SEDHY PHENOLIC ACID DECARBOXYLASE SUBUNIT D,PAD MKCHRCGSDNVRKMVDSPVGDAWEVYVCEKCCYSWRSTENPVVMEKFKLDDNKIANMGVIPPIPPLKK 68 T 0.00011 YjdM_Zn_Ribbon pdbhh F Bacteria T 7ae7 2 G,H,I,J,K,L a,b,c,d,e,f SHDD_SEDHY PHENOLIC ACID DECARBOXYLASE SUBUNIT D,PAD MKCHRCGSDNVRKMVDSPVGDAWEVYVCEKCCYSWRSTENPVVMEKFKLDDNKIANMG 58 T 2.5E-05 DUF1936 pdbhh F Bacteria T 7aeb 1 A,B,C,D,E,F A,B,C,D,E,F A3HTB3_9BACT BASEPLATE PROTEIN (ALGO12) MSTLNKHISIPKDMSSKDDLDFHFLREEGIRYIKELGSNFWTDYNTHDPGITMLEVLCYAISDLGNRINIPIEDLIANEEGGVKGQFYKVQEILPSAPTSELDLRKLFIDIEGIKNCWIKRERVTVFADLKNQKLSYEKTIWEDLKENQKAQFDLKGLYRILVETEDADKVLSESLEKAVFTKFHANRNLCEDLIKVEKVATEPISVCANVEVAPEADEELIHAQILIAIEDYLAPSPRHYSLKQMVDKGYTMDEIFEGPFLENGFIDTVELKASELRKEVRLSDIINIIMSIDGVKIVKEITLGNCDENDGIENNQWVICIPENKKPKLCKKTTINYFKGILPINLNPVRVDNHKSKILASRLENDLKAKDDLEPAIPQGTFADWGEYSSIQHEFPETYGISDIGLPPKLGVKRAVLARQLKGYLLFFDQILASYFEHLSKIKSLLSLDQGPSFTYFTQAIKDIKDVEELFKDPTLLENDEELTKSLIGKLDDTIERRNQLMDHLIARFAENFSSYAFLMKFLYGESTDEIVLQDKQSFLREYKEISRERGEGFNFYEQSNDNLWDTLNVSGAQKRISKLVGVKDYSRRNLSDTAVEIYRYEHVDGNWVYRWRIRDENGKVLLSATTSYPTYNSAGNEMYFAILKILETPLSDLEKLLEVNFRNENEAGSFHFHKAATSNKFSFDIINPVIDSESSSDFIVAKQYTYYPDRTQAVLGAISLLNFIKYTFTEEGIYLVEHILLRPSPLDPEYLAMQTDAGKEYIEGNFLPFCSDDYENCKMIDPYSFRVSIVLPGFTYRFANKDFRDYLENLIREELPAHIVAKICWIGYRKGEEPELFQEDVENPETPIFKENQLEIFEKAYKNYLFELTDIHKRKGFIASMNKYNQVLNEMTSSLTGLHTIYPTGRLYDCEDEEEELDGKLILGKTNLGTL 933 T 0.00032 DUF276 pdbhh F Bacteria T 7aed 1 A,B A,B D1LHF8_ENTFL PrgL SVGQRKQVNTNEKQVKVEKKEELTTSTVKKFLIAYYTKKDLGENRNRYEPLVTSAMYNELVNVEKQPVNQAYKGYVVNQVLDTYKIYIDTENNEVIVDVTYKNTQRTKRNNDEGALKNQSNQEALKLTFVKQGANFLVDKMAPVTLTNELQEEPNSYNTHVVTTEESAKESANSGEKLEVLFQGPHHHHHHHHHH 195 T 0.00049 TraE unphh F Bacteria T 7aew 2 B,C CCC,BBB AMPN_HUMAN HAPN,ALANYL AMINOPEPTIDASE,AMINOPEPTIDASE M,AP-M,MICROSOMAL AMINOPEPTIDASE,MYELOID PLASMA MEMBRANE GLYCOPROTEIN CD13,GP150 EKNKNANSSPVASTTPSASATTNPASATTLDQSKAWNR 38 T 0.069 MacB_PCD unppssm F Eukaryota T 7agw 1 A,B A,B KHTT_BACSU K(+)/H(+) antiporter subunit KhtT GSGLNIKENDLPGIGKKFEIETRSHEKMTIIIHDDGRREIYRFNDRDPDELLSNISLDDSEARQIAAILGG 71 T 0.067 Imm40 unp F Bacteria T 7ah0 1 A A 4D2 GSPELREKHRALAEQVYATGQEMLKNTSNSPELREKHRALAEQVYATGQEMLKNGSVSPSPELREKHRALAEQVYATGQEMLKNTSNSPELREKHRALAEQVYATGQEMLKN 112 T 0.055 Cluap1 pdb F T 7aih 4 D D Q4QI77_LEIMA uL10m MFSRGAAATAMAKVSRLVSPRLRIIHRDYLTRRGGRTHQRCSAVAVDYTPTYFATYKSDPGQCPRLIDAEAVHGDEQAFWSARRDFYRGGASRSYYPAWDRQAQALIMLTREVPRIPQEAAFRLFTLGLKMMLLPRLVAGVELMLPSWVTMNAESVLNEGLEGKVAEADGDGKATGAAADAAALPSASSAGANEDSGSANAEKR 204 T 1.8 DUF5783 pdbhh F Eukaryota T 7aih 10 J J Q4QCY7_LEIMA bL19m MGYTRERTNRHFFVSRANAFFSRLPISRIQRALAMEAIKKGSMKPWKHTKEQIIGSPITCNFEYNPRPVRLIGTVMDAHTEETSIKGGLKVYARSEEANMMLWIPAGNPKLKYEVTAAKGSFEHYLDERSKWDEAWLTGRARMK 144 T 0.074 Endonuc_Holl pdb F Eukaryota T 7aih 16 P P Q4QG34_LEIMA bL27m MLRITPSRYASKVTAGNAKNQAGSPRQKAKLFHVIPGTPVTPVEKLKEQRRRFGQDRYSRQPEYRPGRNVRMDPNTFTLYATTKGVMTIRTSRINPSYKWLDVEPDIQKVYRSRCMRAALQARGKASMMVAGNVHYRAELDHVTEPHWRERVMRVPKATERFQDPNYFTRGLVPSLRPLSRYSYE 185 T 0.31 Ribosomal_L27 pdb F Eukaryota T 7aih 17 Q Q Q4Q719_LEIMA bL28m MLRQSSLLCFSTFALNPETSRAPHGPPRGLINRYISMGLPPWAAWCNRVNRHALYRMSDVSPRSFLPKAPHEMDVIWMNERVRERVRTSRQVQHVYRQLKYPFVKTGIHYSDTLDHWVQVPMVEAAMFEIEKDGGFDNFILKRSGPELRSTYGERIRRHLLVRQKETQKNFVLDQQAKALAEVTQAELMKATSEEELDAVLAKYGMDAEEFKRLMAKRVMEQRKSVAAAGLRSK 234 T 0.012 Methyltransf_5 pdb F Eukaryota T 7aih 19 S S Q9U0Z7_LEIMA uL30m MRRCVMAKGEDPAHVAGWDDRQDAVEWWWTEANDSRGRQRLEAAAAVAAAAASSTVGLPLFPRFSPGRRRRRRPPAPPPPPPPLFLSRHLHSMPWLWCTCVKMQMYYTPTALTCPLSNSLAAHVGHIIVGVAALLPYSMLLFLTVMCNPRKHEPVLRAQRIRWLTFHSLMFRLLRCITASPAVAASVAVAAAQTPTSLRPAAVCRRGVHLAPSVLAASAPPPPPQQQQQPTSAAVPASTATSTTTIAAGPYRRVGNVFIVTCIDHPFKFSWEVNRMLRELRLEFMGQTTVVPDIPPVRKRIWRVRHVVRVDQLDLDEAKALIGIPEHISFRDLAGQIPPTFGRGGSVANPHMRSKMNFMRLRRMRLRDVMHRDQLEKRLLEERHHALQQQQQQQQGGGEAAAAAAATTA 409 T 0.00017 Ribosomal_L30 pdbpercent F Eukaryota T 7aih 20 T T Q4Q2W9_LEIMA bL32m MLQRTTLRCYSALVGQATPVLLGSKGGTPKRKKNPMQLRRKTYGLHFKERYLKLEEWYFCPLCAEPKKQGEWCRREDCRQIKP 83 T 0.12 Metallothio_Pro pdbpercent F Eukaryota T 7aih 21 U U Q4Q2Q8_LEIMA bL33m MFRASCTLLGHGQYKTRLKKRMVGFIPKVIPRKIRNNMVALRSEANTGHMEGYIKTEAERLDATGRKLQKTMWDPVLQRYTLMKETKVRGPFLTKSNIARKVDFPVGALHGTKLGGKK 118 T 0.0038 Ribosomal_L33 pdbpercent F Eukaryota T 7aih 22 V V Q4QCK6_LEIMA bL35m MFRISLICFPKAGCEEITRQGRRVVLKPQEYFAQHRMQVWQMRFKEMGPPFSRVWVALGGKMRRRRIGRQIDVKDMRYYWRPIEPQYQRLYMSRLRIKDHSNKRVQPMRLRATNNDIGQASSLKEWERSSDRKYGAALAPPKKRDFEFRVF 151 T 10 Gln_deamidase_2 pdbhh F Eukaryota T 7aih 23 W W Q4Q6A3_LEIMA bL36m MLQYTSSARQALRATALVLNFFPLGYTCGPKNKQVFFPPNNLDGRTTHQMKKLQGSTDKHPGLVPRDKLKLHCEFCRFHWVQDTLVVRCAAHPKEHNQREIWLEPTWTWGKQQPYQYYKYMPVNINPRTGMPLAREDAKGMNNERRSQGLPTKTRLLERERRGISRAITGLGIYNQRWQTRFPFAT 186 T 4.8E-05 Ribosomal_L36 pdbhh F Eukaryota T 7aih 25 Y Y Q4Q448_LEIMA mL40 MWTLSRPCLAAVRTAVLCQKKQTAAGYMASAGKVGNEEKWAQAAMEYIHEKNHVNDARKRQQDVDQERSIANAYDRYSAVSEAKFDERLSRLIARMSEALEEMRNLGLEEALEEAVLLNSEQPPGHYRRPSLTPPLAGYEPGFGLDVPQLRSQQAEYPPLRRPTDWLEFGEGGADDFPYVDTHKIEDLTAKHEAQLEEQHGVLREAAPLTGVEGEGWEAYVALHRKALARQHLIMDLHNDPELRDKYNADEAFRAAEWERRGMGALSIEAPLERDLELHYAQVPAYEAFRSH 292 T 0.00047 MRP-L28 pdb F Eukaryota T 7aih 26 Z Z Q4Q152_LEIMA mL41 MLWCTGPRRIVFHNAPSVYPFTKPFHDTPYDQDRGRFDKTKNILRENKWPAWMDHGADGTGFGIGLNRTHPLSKLRGNLRRNPSEIPRVLNMMIQGVWHKSGNKLYFRGGKPPNPSTHPYLTGEPCPVYGWKVTDPGVIREFNLPQPEDKTRYKPYVALQERKIMGMQAPTKEHSAASTSAASTDSKPLMKRLFFWK 197 T 3.3 MRP-L27 pdbhh F Eukaryota T 7aih 27 AA BA E9ACP5_LEIMA mL94 MAQWIPKTAWKVSNLNKRYGAPYVAKGYASLDPRCSLDAYSSFQQTVTSADMKKALLSIDSTSSGALVIDVRSEPERRLRPLLSPAIVALHPHDILSGAACPILPSNKERAEMFVVASEAQRAVNACTALRRWGFSRVTAVSVDAVSEAIAAVQKPADAATSSSTKS 167 T 0.0043 Rhodanese pdbhh F Eukaryota T 7aih 29 CA BB Q4Q4D6_LEIMA mL95 MLHRSCVLVDSFKEHYHRVHLPRRLALQRYIKREEARLSRHKGKAVAAAAAAGVQPGEVAYKYNRWWVSNDHEFVHQFAFVEDPDVTREKRNTLPLVTKENIWKEPQQTFFLPFAPFVRVVDYAKDPDTKFLKPVNIPRWKDYMQRTKPIVPRTWY 156 T 0.14 DUF4653 pdbpssm F Eukaryota T 7aih 30 DA Aw E9AD00_LEIMA mL89 MSSGAVGRGSFHSVVAGANPRRIPTYYNSAYELIQLHRAHREVTRNFLVRDKVFDNKFPGCSLANGLFKMVPNKRGNFHTRELTESIRHRTIWGQRIQQQRTINAAILEDATKVLSPAQMEDRFSYRTPDAAAYFSPQEYTAANNWPNYWQHPTEKHVVPKPRWRREPELGGITRVRDAVATPIADY 187 T 0.092 DUF3295 pdb F Eukaryota T 7aih 31 EA Bj Q4Q0E1_LEIMA bL31m MVLNKWAAVTKSAPPAAGLRPLARTVSPNPKLRPADYKVPYVLRTFIKDRHSSEMQHIENRGMYREELAIERSRFPRMQKTLTIQTDGSLNEREFEFAVPPVVMLFQDRLSAHRQRQVALAKIGKLKRVKSWETSVRGKESLNPVCNALVFPYCVPKKMLVRPRIVDPLSAKSMADNRRSRDDPS 185 T 11 RNA_pol_L pdbhh F Eukaryota T 7aih 32 FA An Q4Q0F5_LEIMA mL76 MLQCTALVLKSQHKNVLRKGRPHMQKYKELNRWQREAQGITKWEQGHSHRPQPYVERFNPEGAGLTRGTSAYAWKWWHTQYPWLPNVAPADYVPPSPRGIRPAAWDDEFADVVLSMSDEEIQSYLLDKLTEVIFAETQRDGYELRRLDFEGKPLTELPERRIIENFVFEEETLRERVLDRVVEGVFRLVPTSTDRLELKSVANIIDFVLTHVTVARKPLQHEIPEAARTVMRSHPLQPQLGFVHALPTDNRDAVVQEWERMHHLDWQFGKAVYEPRSAENERGNLTWLREVRHHEAREAFQADVDSGEARRRHMAKIKAAAQVPHTGTTSQ 331 T 2.9 LRR_1 pdbhh F Eukaryota T 7aih 33 GA Al Q4Q1C8_LEIMA mL74 MLSSAHRAAFARPTATLWASARSFGAGPTRLLLGLEQVQDVPTSTDRKPTGMHRGPGKRQTAPKEAAQYQFIKKWDLQMRETWDELEPFKGLPKPKVQFGNEAAEVIWPYALLLENVIKVHPYTKSIYVYYSQRQSTPLGELAARVAKRVSQAYLIPITFHNSHVYVEAEMLLEYSETPWVVVHCLDGTHKLIPVKPQAGQTVKEGAEEVLNGIVSACNEIGSAVKNPKEVMRLLSERPLQNQYVRVNYQWYGDTPEERMSHLVKWDYEPEEVVPQLRNRTQHVLDWMNYDGNLPTHNSVRVNIHREAARMRKPNVSAGPKTFFNSSGSRANARTARFDNSRSSQS 346 T 0.011 L51_S25_CI-B8 pdbhh F Eukaryota T 7aih 35 IA Az Q4Q4D9_LEIMA mL93 MLRFTQVIRKNPVVFKQGQGMFSHQLKRILNKKSLHKYNWDPLHMYDPRKLVHANRYVDHDTYEEKYDPHWEHNAHLVPDQQFYNIPVPKEYKDAYWWRDLQARRVQCPTEWVHFRMHTKDKLKYDFQDLAFRKKFEYSYEDVVANAKDMCS 152 T 7.6 Pox_VP8_L4R pdbhh F Eukaryota T 7aih 36 JA At Q4Q4L5_LEIMA mL86 MRPSALCLGGFTMKYKRGTGLWDEDHVNDFDANKYLSARSTMRWYYGMERLQTRNNMNARRATQSYNNNMGLHHSGRGAFERELERRGIQVDKYPLTTTTGAARVAEMVLLRRQELEAHAKKAMESQRQARRRDAPSEWYDETEGPLNPRFLASMQSNYTQVITELPSSPVTGRRELPGASFA 183 T 2.3 DUF2663 pdbhh F Eukaryota T 7aih 37 KA BC Q4Q5D8_LEIMA mL96 MNDIYARRLAQTSMFHQLMRSHGTLWAATQVTKEKLNLAFVKEEMMRVNGRRAMPLLIGAAANENLNDTHFTHLTEHCAWTESARAFAVQRQTPLTQHIASMGRMAETITQAKTASTSQLLFNEHLARIDGISEFEEEPFVDDEDDS 147 T 0.027 Chloroa_b-bind pdb F Eukaryota T 7aih 40 NA Ap Q4Q7V3_LEIMA mL80 MQRCLARLFQAGVHTPHGSRYNAARMKNWPVQEVPQNFNFTNEQRFKAKAVPRDTGKIPRDFLLSVLYRNQPCEVASLWEHCLHDPQIVLDSKRHLREVLQQARAEGFVSFEKDAVTDRWVCHLTRERFEEVRALVGARAEAQDLYSGLRGASATETSAYSESFREMNEDTKREHFRLLSEQVADTTTHLRKFQRMEMDYLPYTDLNGKVNFMWWYEMSDTRDATALPEAAAEGSPKLSE 240 T 0.15 Gluconate_2-dh3 pdbpssm F Eukaryota T 7aih 41 OA Au Q4Q8J6_LEIMA mL87 MMLQHTSLLCRKALQSYPVPPRARNYERRWSSSRTNPYNRMFWRTVLNEDFARPSFWVSDFRHKYLAKHGMDYQGRVPASPAPGMYQGFSDVHKILANHPKPQRESRHLPVMPMTPRVVFEHAQEKRIDYAKKMHRDRRLVEQLRTHEFWGWYMKLQRVRGRWCKEHGVSSRGVYGPAVDAAELWG 186 T 1.2 Crl pdbhh F Eukaryota T 7aih 42 PA Aa Q4Q183_LEIMA mL42 MLRLTQAVLRVQSHQKKRAQHPNAGTRFGRVYNRGFVRYGFGGFGMSVYSSKKDRTFKVMPVPPPPPATTAVEQRDDFADNRGLSATTRTLSPTFRMFALEDGGVLVSHPSHAQIMRWNQRVHTEEGKAANSTVMDEYVNSRIQAIIADNTIENTSLSQWRKAHMWNVIKSHGKLQRRWGTPDFVMGARSTLYNN 195 T 5.8 RHINO pdbhh F Eukaryota T 7aih 43 QA Ao Q4Q547_LEIMA mL79 MLRTTHVSWASTAKGYMNRVMVYAHRRRKARYLAPKNAHVRSPLAHKMPEEYGNTWDPRSGVEWHNRMRNRNHYRHWPWARWTDDPVRFHQDSVCHRTVSALSTVANNGAPEWDYYAEVGQAYETPSHFPLSYTAPFIYQYTAQCWSREDLQSYLERIEQSSGLRTIADAASRREALYTWWHNAGMNVIPLGVLQHLELVSRDIVAQNARKSYRIEQHERGILRTPEMERYYALPHLRGPSMPVQLAQPSGKYPSGKFTQMMEDVAIHPLQKPDARYKHNMYPA 284 T 6.4 BNR_6 pdbhh F Eukaryota T 7aih 44 RA BM Q4Q703_LEIMA mL70 MSVFPGLCGDVATTNYRVFLGTLPNLAVEERFLRQVQPVFPWYASRKHVKEQASEFLEIDLASCDPELLLRYTHVYYVRRQLYDELVDRQLTLMETGKAAKVADSALLTCLAQVNAAITPRLQYELHLLQQAKKACRVPRRRELNPDAALEAHDYLCMMRVVEEDVGGIPDAEMQARAYLPREVLEAKVKELAAMIFGDGGSATKGTGAALERKEQKLLQRMIPADYNKVGAVEKLRPVDVTALYRFTGERVCGRPADKPFARALWGHVFRKVGSHPLYLQRASLYWARHSGLDPQSATSAMPADLATAVCVQQALFPALKYRCQYLYTSPDIARQQWRTGHVVPLLRLFPLLGAPAAEDLAAQLVVEGEWAKLGIEADTNLLHDTVLRQLKDMVEQVSALYESDAGAVLKRVEDGAKVLCPSLSERESLTMRGAPEDTSREVSAAAAARVANAAPA 457 T 0.37 DUF4911 pdb F Eukaryota T 7aih 45 SA Ar Q4Q712_LEIMA mL84 MLRWSRLLREMAPELQLEYIPIIFTRTILGPQGGFAGEERLIKREVAQKYMSEGNAVTPSAEFHQGVWCYNPDSEQYDRFVERNAEFLDFAARKRQWLDVYWRVNTGYLLFGRQSWGQGWLLNCPLRKKDIAQKLWEQYKVRVDPRLIEFREKDRRTGIQDLGHNWCWLYLPGAEELAIDREVYDNKRVKVRMHIRKMSSYGALY 205 T 1.8 Ribosomal_L9_C pdbhh F Eukaryota T 7aih 46 TA Aj Q4Q728_LEIMA mL72 MCTRVFDVSQRFILLVSLPPLSLSLRPSCCSRKAATRRRRSTASVLLATDALSLFVRTHYPSPLLFLLLPNFPLPIHSPRKQMRAAVGAVLQPSSSVGALRCQARFITRLYTSYFKGELFPNQLARPLERLPRGVSLAAARKGQQAAAPSSSGSGNPATTASLDVVTWGDVDSTDLVHANEQSRAVAPQAGLAPRRPYVPLGEVAKLELQGDYLTEGGLHQEALEYYGVVAKAYELAYPKDHPQVAGIRLKLAGAFRRTGRLTSSKANCEAVLQMLDSAVQPPLELIVEALFELGLTSEAMSDAAAGTVFEEAVALVDMFHNSGQSHKMLRLLPRLGRRFNLNFEEKFVYFSPFDYDRVFALADQCLERAEVFYQARNDRAGVMRVLQQRKELIDKKFFNMRDFAGRIHTMRGHWKRRAQVLTNAPTPDELLRYSPTIHQVYRDFKYELNAPIGREKEVQPGVNRVVHDMGNPYRRSGVRSQRMFRDAEKNFEKYIRADAFEA 503 T 0.0013 TPR_12 pdb F Eukaryota T 7aih 49 WA Aq Q4QD92_LEIMA mL82 MLYGGSRVQYLVQPPFTLHKIRSENLPPPSLYAERHDLGLEMQLPRDMHVYNSINMAIQRQVGGDSSTLDGEQQQLQGGHADGFDMGAFFTEQHHPERHHNSSLPYAKHDTNNVLAMRLFPVNVGVRARTEAIRIRTDDCLQRLRDADLCAKMRLPLEHPLPLSRRSQYAAIHRVRQERCYDAPTEAAGERAAAAAEEASRTAHLRGAAAHPPPSELSIVTRPVDRLGSHSGSSAAACTADADHLSFPVHPFAAAAVSSGCHSARSGSAARLASQRWLPLQTLKPMGHNWSAATRSSGVRGPHMQLMQERLDQKGFGWKRKSRSLWQQDVATAGFRPHRYF 341 T 0.21 T3SS_ExsE pdbpercent F Eukaryota T 7aih 50 XA BE Q4QE16_LEIMA mL98 MLGGLRPLAAATRRTVGGALVSPALITPSRALSVRTEDFFSKEAVSHARRVSWAPHTTEKKVGAFAKLSRSNFNDPLPVSFQSEPYFEEEIEAYRAHHRPDVYVYKYNVSPTHLSLRE 118 T 3.7 DUF2975 pdbhh F Eukaryota T 7aih 52 ZA BP Q4QGE0_LEIMA mL52,mL52 MRRRDWCGVCLPAATLHALARRYSEYRSSYTGARSAPWAAPEAAPAYPSARSPFPLERPRFRKTHIEWMLHHGHGDRYGKYGPSREIADFEYADGTPSSISGKRFALKHHQDHLLVQLIRSAAIVERFEEEELLPRIPGTPEQRSWDPEIPLFLEDVDEFGRPPRPVAGNMVARVIEERFAQESGRTPVNLANKHAGEVLEPNTMFATYDPAAFVSDDIKKDVRRPFWSRRRWALSDNFMVPMSPKPKNTIKDE 254 T 0.0014 MRPL52 pdbhh F Eukaryota T 7aih 54 BB BF Q4QIQ1_LEIMA mL99 MRRTVRALYNSFERGWKDKTVHPLDRRGRFNLDEAAAELQLDEAYVASLYKPLHYTYSMKGQRYPAEQGRTSRPGSLAASRDRMFPLYRRNYKLNRELRVLDHRRISTD 109 T 0.52 DUF6416 pdbhh F Eukaryota T 7aih 55 CB Av Q4QIT7_LEIMA mL88 MFQRTCTPRLLACTSALLKRSGKPSDLPDYKQVYLPYDTAPTKTELDRERRKFMHAYSGRMEHRKMVEVKDVPQNMYTYGKEGMSIPISIFKDQADPVIGPEWTYPGIFENKIVAQHWYMEELFDREKSNTFESPWQRQVLDNQVKRRLGKVAWRMSMLNIKTIDIFHKERGASKRPGAGDTKAPATPAGKK 192 T 2.1 Ribosomal_L37 pdbhh F Eukaryota T 7aih 56 DB Af Q4QJB6_LEIMA mL63 MLRRSPVPRRYRTAWRELLHPLPVWARRQQWLKRDTVEMNEAILREPYYRIKTFAQPAAFVSPRVSESAAHEPDTQQSSRYGVDRQLRGPRRAVSPERLQELREQLQFVGSIGPKVPPAAGAGTAYQDEYGTRLRPRYPQSWDTVPPHQPSRSEI 155 T 28 PsbP pdbhh F Eukaryota T 7aih 57 EB As E9ABZ5_LEIMA mL85 MRRLPLFCRRPSRCCGATASGSGSSSAAVLAASAAPSVLVLAARGIATSGRVTNEDRRWWLVHLECAPDVTPGTFVSWLDCCGTHTTKKLIERNIWTIEQVAELDSDRVDELKYKEGCLKMDVVWEHARTIITPLKQREVSGGVESQLQSRILELRKKRELERQRELLARERATVSDKREETLRRLRESVAAKKAALRKKLDEQHGEATPAASESASTEAHRGTAEAAVEDEAVGNIVDRMSGGNPPRA 249 T 0.39 OmpH pdbpercent F Eukaryota T 7aih 58 FB Ae E9ACG2_LEIMA mL53 MTAPASHYTFANLKKLGLCAPQVALSRQPRLRPHVGHLNGLVYPLPYYAMWRGNHDKYTYNQATPARWGEGNTNTMYHQHYAHAKCPTDYGRGGREFQFLSVKRGKLKRKPLPTVQYVDPNSKPQWVFKSWHNPLSAPSMWEREVQYPEHTPAHTGAKRPLAVVAPKTSHKHLFLMHMEKVTVTVSPLLFGYGHTLQKAALDFYRRGLSARSPFPSDKMFLYYSIDHITPKIEVTWLDGSVYVPPLIEGVKAQDLIQMVMEQAWLAADRMSAEGRVLNPIAIDDYKWEQLIAFKQKRAKGAEAAKGGAKKK 311 T 5.7 MRP_L53 pdbhh F Eukaryota T 7aih 60 HB Ah Q4QC45_LEIMA mL68 MHLHISSIPHRNSNNSKGGVLDATGPMLSAKRGALLLQAYHRPGEVISYKAGDYHLVPKKFTVGKRIAVRSYLDRNRTELSDRTFMPQKNWFRPYDLQDGCFDRDHERLSYRFYNLETKVIWKAFDTPELIGMLLHDETVKGNSGMYAPDMLDAALHYTREARYWRCIGITKPFYDRNTLRAHCWEDNGLQVGTLVMSQAMRHALMDLERAVRRKELGLEPNYLWDRWGPIGFIDGARADYLPRFEHNPYVDPDGVDVTEIDVLPFNTHEQIRERYRDFIEPDTAPFEEVFRSPSHGSLTTLADIPNASVVALYKDLKLKAGTPVAGDAVELAPADVRTLFYLSANPEWRAVADGKASWEEVVDAMQPVQAELDEKIDAARLLQNTRHNAERVRAFFEEKCGFHDFMYTPDKTITAAVLCYLTELRRICTETAWGAALAKCLTDMERVQGMGRDAFLVYRHIEDAILDKKRRLWAGRFAGESHEESTLDYLLENFGRRAERPRNVGTTGVEFDREQEPIGRQVQRRVLDSDKANKLAEIRRSRGKMWSKKRSVFDALHEKQLQNFNYGVH 570 T 3.9 VirE_N pdbhh F Eukaryota T 7aih 61 IB BD Q4QE11_LEIMA mL97 MSNRFFQKFYLRCGNCSAIQRSAQGYQPIANPILFKSDEHCRNYHDEQRRAAGYSGMVVTCRCHRCERVHSNWKVLDAQQFLDAKLRMTPEERAQRLWVSKS 102 T 0.91 Mu-like_Com pdbpercent F Eukaryota T 7aih 62 JB Ay E9ADN7_LEIMA C2H2-type domain-containing protein MLRIGRTLLAEVTTINSTTASVSGRLIRIRKKSKWIDRRSTRVPHNGKDIWYFGDQPSCALCHIRFRYKQDYEAHKESELHVNRLRWVETMNWWRETGEPAYLKASNEQWEWFEQHVLPTKAQEMGCTLDEARRVYRQAIMTETPTWHRPLQCPTVKQEVQEPRDQRWPASPKW 174 T 0.0078 zf-met pdbpercent F Eukaryota T 7aih 63 KB Ag Q4Q829_LEIMA mL54/69 MAFRGSSARLAATPGVGIAPETTPVKYVPEMLNIQNAKWWNGRGKPVYRSTYNEKSWLEKARWGAFTKGSRPVMRQRYSAAALKEALEMVPEGFETCDVPRPPQRIRAQSEGVVGRWYTNYWTLHSVRYQCQLAGVEWQFGERQRPRTNYDEPHMYTDFEETKAIRDYRSRWINVNRSLVGMSRRMKESEEEARYLHFKKVQDTFWSNRKVLVNRIKSMHNQGTLQSAKDLPIKTINIKAFLAE 244 T 0.036 DUF1672 pdbpercent F Eukaryota T 7aih 67 OB BG A0A504WW14_LEIDO mL100 MLARYLDPSVHPLRVGQVVAYDYLHAAKTWQWTLGTVREIKDYTAVVQQWGLHTGDIDTLRSILLKEVDTENGRMKNYHDMLAIAREKLASIRRSNEDRVSHVRGHFDKAREKVELIDEVDLRKVTAQAAPSPVAVAVLKAVWAVAKCDPTAVEFYEWADVQLEYRKPAALDEIAKTDVLAKLYPSAESLQQSLEQDPKLNYKAAARDSPVVASLHAWVITALAYQQAYNLLAHDKRIQEQNDAIAAAIAGMKACRAKIAKLKDELSSKDTAALPGQVTSFTRTSVLVTIPLSAVISPVNVDTDVKRCVLTKDEVEQIPIDAKITRYAQKQKLAITGSHLLDQYAAATTTHIYVTELEDRLFFFQHYMASALRDAQTAAVDAHQRLAVSLHELEAFRQKRHDAKKARAAEPELADADGVEPSSGPTSSRSPTGRAAPRGQSAAPRGTASQQHKLLGPAYQSIDPATIANEPLYAVTIEEYKAKDAAGERAMDEAERMADEVQRLAVELEDAKAAADKLAEELAAKDEELAAHRQKRHDAQQARASDPALAAADAVAPRSGKGAASPHVGAVQRQAVDPATVPVAPAVIAEEPLYVATAEELQHVRDFADQLAEELEAFRQKRHDAKKARAAEPELADADGVEPSSGPTSSRSPTGRAAPRGQSAAPRGTASQQHKLLGPAYQSIDPATIANEPLYAVTIEEYKAKDAAGERAMDEAERMADEVQRLAVELEDAKAAADKLAEELAAKDEELAAHRQKRHDAQQARASDPALAAADAVAPRSGKGAASPHVGAVQRQAVDPATVPVAPAVIAEEPLYVATAEELQHVRDFADQLAEELEAFRQKRHDAKKARAAEPELADADGVEPSSGPTSSRSPTGRAAPRGQSAAPRGTASQQHKLLGPAYQSIDPATIANEPLYAVTIEEYKAKDAAGERAMDEAERMADEVQRLAVELEDAKAAADKLAEELAAKDEELAAHRQKRHDAQQARASDPALAAADAVAPRSGKGAASPHVGAVQRQAVDPATVPVAPAVIAEEPLYVATAEELQHVRDFADQAAHDAATREAEVAGTVENLRNELDDVREMNAKLEDEVFALKEQLSDAEDAYKKLAGALVVAEDERQELCDDLEAALDELEQKKDEYDELLGNLEEVQGLLEAADVAGRTAVEALEQRNRDMADLQGELANALDASKENENLRALLDAKEREIDRLKEYNSFWTDTVGTGKQKVTHRLTKIFDGDWTRLMRHRPEALKAAFVIDSSNACHVPGDQIFLVSNSFTRRLLTRTDHCPKCDRLSTFRFMSVSGMVGRMPYKPVDTPGPSYATLYWRKQRSGKIASQPLNEVCNKNEF 1347 T 0.012 Fez1 pdbpercent F Eukaryota T 7ako 2 C,D C,D CLSPN_HUMAN HCLASPIN MEELLNLCSGKFTSQD 16 T 0.37 RPAP1_C pdbhh F Eukaryota T 7aks 2 B,D,F,H BaB,DaD,FaF,HaH modified peptide XAKSAPAPKKG 11 T 46 NOB1_Zn_bind pdbhh F T 7al0 1 A A Heymonin APCKLGCKIKKVKQKIKQKLKAKVNAVKTVIGKISEHLG 39 T 7.9 Herpes_UL33 pdbhh F T 7al2 2 B B B9AGF7_METSM Cell division protein FtsZ QLDDFIDGIF 10 T 2.1 DUF4316 pdbhh F Archaea T 7ald 1 A A R7TSD6_CAPTE BRICHOS domain-containing protein SPRVCIRVCRNGVCYRRCWG 20 T 0.037 Toxin_25 pdbhh F Eukaryota T 7am2 21 U CA Q4QGU5_LEIMA TRUD domain-containing protein MKALGRGPITRLANTAAPGGFAAPGAVYNRDDWNAGRDVSAEERQCGILTRLCTLAATAPREAASPACGLAPLEAVIRVQSTDAHVTEVDANGGGAFLEKAPKGRWRKISRSKTLLVEDTATPFSNSDKSFSPRVQSYGEYVRRIGKLPEGRPLLRFAMFRDGYSLDSVCHRLRYEIGVPHDGVYLHEPPGGSFAAVTQFGVAVGVTREQLPHASRHYNVHALIFDDRGYHALDELPRLSVAPQAYLHRILLRCVSGDEAAVAQRLRHLSSNGFINYFGLESFGIGSNTLFDMAAFAFRREPHRSVGAYLQTLAECSPLHHQPYLSYANAEESTVAGAVAEWLRVCERAKLPRETRELLRKLHCYHLSQCHPSDATTISMEDVWKACPIMHRAEQSAAAFVWNAMASQRLLSFGSRPVKGDLVCRIGNRGAIEIAEVASDTDASHYTIDDVVLPIPCGGTPAAELRYPTHSVNEAFFTQFAKKHSLSFLFNSGVDPTPRAAATLGPYRRLVSRPRNLQAAVLQDPSSCAALKSDLFLLQEHQPTEGWSLDYRQRVREPSNFNVSERFRERMSCIRKRRAGEHSVALAFVLPAGSSPWVALREAFHMHYGTFHDFYGVS 618 T 0.069 TruD pdbpercent F Eukaryota T 7am2 25 Y BK E9AD80_LEIMA mL67 MRFDNAAPPPPPPPEAAGSTTTSSSTAASALPRVSYRTLPSSMYPKEVSSDFIPFPLPKHDASMGFGPVRLRNVPDIEEARARMAKAAQGPPRGSASTQLQDADGDNAEAASALWAELSGETTATSVGLSTTSVTGASPEDPEALSRPPDAQDEDLVGYHVHRHFPLLDVLGCDRSVNDLLAQFWNRPQREARTATVLDFAATLQRHSNEELTRVLYELSSLFEWDGNGLQFIAAKVLKYGRSYTVSSELTKAFVQLVDAMTVAFVEEQPHRLAESPALLAQVLHFLALVKIMEPNKWYTLNPNAPQNRADYTHPRGVNRTCGHVTTGRALLDFLEDMVTSGHNWTGEAAVDDQQGSLARSPVPQHATNRFTEGWSEDDILDVMAGFSGVMPDGKASSPVLYALLDELWMRWSKVGFVLSGSEQAVRLERLYMLLQVMDMQRDAVLDALLGGQLRAHSTAPSTSTLPTLFCERDDTPPLTLAQSLTQTRGPDFFSAVSRDKRAMVKAAALRLLTASLAKARDDSDAVLHQALVESGTELLQSLTSKSAALSFAQREQFDVITLRAVPHMADVAERLAEQRAEAPFFPLTASAGGLPDTAAVLAHLSSHPAPYIVLCKGRRVHPVRTLVSNLDHVAAVENVFLLHSSGVSKCVDALVAVARRLRSGKDALIVTASCLRALQAAAQYGATEKRRATADRALDIVSYELEAGRAILMPVTDELYLHDAGTYCDEDLMLWTLAAYLARDVPLVKVHTIMSSRSRARNPQHALRGEHSPLTSTDDLYNKSTPLLQALRSKELRAVTHHPVVQRPVRDPPQTLYNVNPIRARFVYRRDKALFDKYHVTARNLAPGFSQGALNSDLRALGFYTPDHPQVPYTPLSELKCHPVPANPPASQ 893 T 1.6 NRDE-2 pdbpercent F Eukaryota T 7am2 27 AA BN Q4QAP7_LEIMA mL81 MRAAPTRLAPSTVASLGRSHCGSQAYHLDAAGAAGWRRRRRLTGVSMTTTHTVAPAVSALPFSLSTARRTYYWPYPENLVPEGATTSPFQSSPVPSVRERIIREYALGPLFGSRTPCCVLGFAGTARDVAACKRDVRRWVARALGKSEADVELGALVQAKEMLLHRSGTDESPRLGDGPGSRPDAEQRRVTRYARLPVQARTLLEVYLPGEEHGEVDAAADADATILAHGYFLQEQLHRHMTTTASSGSDPTGRRDSEDCQPEEPARKMQSHCQGCSEPNDEADVKSSVAALHDVCGVIYCEVPVLDESDFAFDQLCGKEVDDETTERVTDRWTRRAMQQQPQL 344 T 17 DUF1382 pdbhh F Eukaryota T 7am2 71 TB BR A0A504XZ90_LEIDO mL78 MILITSNCARVAAFARKAACRGALCDRHRCISGGGRGDLFTRHASAFKPPAFGVLRGLTHSSQTTHPQTGQLRARKLIQHAMHDRTLSGSGHHRTAVATWSYLLPSLRQNVEQAVPDTLYEKLLTDEVPLTPAESRQLADAHRLLRFELQKRIGLLEDSLADAALPYLLQWPALFQRAWLRLPMDQGSASVTDAKRDAVVPAPPSFVHAPAALPITEWAPTLSPASPSACSNGLIGRGALVPRLAQLTRHVIRCVEEDLGRLEHGSEPREDGAPRSRRQVTLEAAWRAQWASLLSWHAGTS 301 T 0.37 DUF3945 pdbpercent F Eukaryota T 7am6 3 D P ICIC_HIRME LEU-PRO-GLU-GLY-SER-PRO-VAL-THR-ASP-LEU-ARG-TYR LPEGSPVTLDLRY 13 T 0.62 Inhibitor_I78 pdbhh F Eukaryota T 7ane 2 B h Q4QIP8_LEIMA uS14m MLSCKGVLLMRHIGQDVPRRHTHFVLESRLMYEKSFRDEWLRSLCQGLANVDEPLAKSLSGLPQQMLQRKVTCFSYNQFGLFKVPYYRLANVDRYYAVQGALGTREWVPYANVSSWTMNKMVRSGNILVHRVHYKGWGTDNALNQGGWEHRWNKVMQRNALQYNRI 166 T 12 Phage_Cox pdbhh F Eukaryota T 7ane 3 C aw Q4QHA2_LEIMA mS69 MHDANRFGGRTAYLREIGPIDHKKKGRLFKRDLPTLQFNVDVWCAQQTLRKQWKGRDWDVVEMPFEMAPKELQRVVPEKYTDVPIMTDPARHDYMNIRRKVFDREDMQDALFASGGAGQSPYPAIQRVDKAAMTLDKYL 139 T 0.15 DUF4993 pdbpercent F Eukaryota T 7ane 5 E f Q4QJG8_LEIMA uS11m MLKSSIVLLRRGKPRPRAGMFPEKYRRVPTLLKPQQGGQQYFNEFLIRSANDALEAQQQGYGSAFRVGGSGAARILADGNATAADGIHDGEISSVHPRLPQADIDGVLQRSRAETIQAELKQLVAQDGFISQRGFNERLWYEQEHHRLRTHGDGVEASPSEVAATAAASSSTAEAVSATGEGRAVPERILGDDYFQSKFGYSLLKGRSPDASAVADNVKAYAQLDLWGEMPMYSRDFVFLYLVSRRRNTYAVAYDYDGKRLLPTYTAGNRGLKGGDRGFRGDGSTDNGHQVTSMYLNDLLPKIREARAASGRPLGRGEKIDLVVRVMGFYNGRQGAVRAVQDRSADFRVRYLEDVTPFPLNGPKMPRGVFR 371 T 0.00026 Ribosomal_S11 pdbpssm F Eukaryota T 7ane 6 F s E9ACZ5_LEIMA mS33 MLRSSLRYGVHKVGYTHPHHLPVPCAQRWDLRLARARIFQEYIEEKAPGAWQLEDERHMSPEFNTFTGYPMRNLRPGYGQNLPEFIMKKRLPNNTHYELFARRDIPNEDNAMYGKLLYDMTIHGTSLPSIYRMHKDINKAQRNDRKLSGNRFKVLNSSGAKSPPSGFEAIPDAVEEEDD 179 T 0.95 Nmad3 pdbhh F Eukaryota T 7ane 8 H am mS59 MSPPALLRASGVLLDKSMFAAKRRVIVPIQPTPGYPAHFIKTSFTTDPLKEKQKARFSSGGDAMREVQDIPKRLEGQRSRAELTSRGDEDFAALIEFIQGASYDQLISGRRFRKVYEKLSENDDMFVWLCHTAMAVLNPGDMRSRLIHNHLKALAEAVASGEMTQRTAFRFFESAVRSPAYREIAARQLETGTATRLAGLAAAADVMREMGLTRRPMSSYFELYQRIVERSEAMTPWGFPPLFQFEERLALEPRLKFFSRVGQQQLERRRRGSIFSPHTILQGRRIFWIPPTWNRAGRFIGPHINLYPGLTPD 313 T 0.4 DUF2840 pdbpssm F T 7ane 9 I n Q4Q7N2_LEIMA uS19m MLRRCASAVAPAAHIPSPAAAVSGVQKRFLKIAKSTFGFYLARRGQRKFPFHRRPHIKNTQAMNLNAPYFWSYMTAKSQSFFLPEENYITGDWTGKFFVSKRQVYTLQHATSGGKVRVKSFPSVFELNSPSRWNVGKEMNTLTKPRMDLIDDQMLTKKQRLDYVKAGLLPK 171 T 4.3 LIN52 pdbhh F Eukaryota T 7ane 10 J ae Q4QFA7_LEIMA mS53 MSVIGVFSKGRATGHASVMSVLRYVPRARVPWQPSRFGRENLADDDMAQLWGTGRYRTGPGNYNSGYSTKKTHALEDSTVSIIPKHELEKFMPDISLGSKALVTPVSLMSARNGHRVTHDLIHSYDPYIGRLQKPAVVDHDNITVEDPNRVGLNAATLDCRSRIYRWLRRGPFFQEDNYFRRSTKLQRNGPVPVSVHEVPLMQRIIRLARRGHLKAACEEYRRVTSVPPVDVYRALTAACVPGAKLADAIAIFEDGHSRLFYVARDGEVLHNMLRCAIRAKHRVRVMWVYNVMVGRHYENVVVRAEVDVIWRYRIASLALEYLLDSNAGEEARTVYDYLVENELVDCDLHVRLGHVMQQALKEGKTVHVQQDALDGMALSQNVVAVAPQVAVAVYARYLETMQEGAAWTDARGLPLTDPAKTGADTNGAAAVAWLKAAFPDIDPVAVLRLARFRRSSKDLMAKDRPVYVQRAAQWVELLSSAHQTREEAPLTYLRKSRPSMANPNVRVAWLPERQRAHALLASDEGFKFAYAGPHTRFVEETFAYGENTLQSRYLAQQPVHTEVTPSVALGAAAVAAGMSSGSALPRLPGSSAAPQILHASVLLDGSLNGSSGSGTSTLRSGRNESAKAAASPTAGISSRPSSASSSAGLDDTHF 655 T 0.00058 PPR_long pdbhh F Eukaryota T 7ane 11 K ay Q4Q7W4_LEIMA mS71 MNRSFVSSADLRGLTAAFCGSLTCQKRFWAKPKKRPKVGPGFHEKAQKWRDEYLLDRHRVLADSLRAYVDFSSTKRVEPWDTRFAPFDRVEKDGVYILTRYLMDDKLQLCNYHHRPVKRMLCNVGLMGPQVTTTARWKPYRFATNPANTTRAERTFTKDKTVFTGYHHD 169 T 0.29 Tox-MPTase5 pdbpercent F Eukaryota T 7ane 13 M aj Q4Q7Q8_LEIMA mS57 MLRRTSRRLLGYTPINPDTSPMLMYSQCHWHYNLPQGMERPSSVNRSLPAPYQPHHSSVNKYRGVWISTEMHPAFLVGLAPQLKKLPHGRVVPQTPVAEVIDEFNKLSPLIDDAAARDGWLAKIFQHCAFQRSGAEAMALWDKHCAPRFMRDDSASAPPLPLVQAILFCCSKSDSAEWRPIFTKCLKDGWNYTPSFDTPQWSYLLKSLGRQGDEEGVRLVLEEMADVQADLDRVEARSLVYALNAVHDKAIYNYVKKYLFYLGERKVKFLRITYADLRGHGAEKLRVPLKENDSMFYHVCWHASIRQPRQFSPRQLYFDYAPSQLATSGHSPNAKVDGIVKDKIDKWKAEGLLPEDYVHEDRVYDRTAAFKSVARQEKWKKVPRIVKSKRFGYSGEP 397 T 0.2 RPM2 pdbhh F Eukaryota T 7ane 16 P az Q4QBM5_LEIMA mS72 MLRQTAARLNTYLTRSVATPPISVIRTGPKWWAEPERMVKHKVMYFTMGIDQLPLRRTAVIQKDLKRFHMCKPPPRVGDATGYKRSRGAQLTTWYRRIQYQEYHLQHLFVRHMWGLLRMYPGNTTKIQGKADDGYVGYDSVHFHRYNRSPLPFPAREIYERRK 163 T 39 MBDa pdbhh F Eukaryota T 7ane 17 Q ax Q4Q103_LEIMA mS70 MRRTRLVCTATPEKFSILGTTHPKPKRNGMGRNNKMRSKPSDNVAWYDKGPVEWLPRPVRLTYDQLDQLRDWMMRETIAGRTEEFNKIRHLHREWSQHPLMPVLGDVEPKFPLNLYKQNHRAKRRFLVRWHKANSPTYWMWMPRGPAVATPLHRSSPSQFPEHWKSLARTSSSSSSSGSSSAAP 184 T 11 THDPS_M pdbhh F Eukaryota T 7ane 21 U aa Q4QEP4_LEIMA mS48 MLRARARSAYTVARSSAARCSAAASIGAGSGALNGSTGLGSGVGAGAASASLHEPRRYSFKYATKRQHNEARQPSYIHDKRYGLFSNEHNIGKSRRGLPHITPIYTKHMSLWETDTDASTNRFFRHYVFGQREEHLLLGRPHGFEADQAGGRQGDSAYELNTDQRYKGVPRPAITNLHYEPVWNQTLYRNTAHGNQLTNPSSKLTAAVLGPDLMEVRDIKSVEHCKAWFDRLSFLIQQHYDAVGDIGAFRSRHAQHVHEFFVAFHDALSSFDFQDRYLYDQFAKARPKHLEDLFAIFLEMEANYVNEAYCPRCSLPYATTRYCGEGDPNTPFRKHRGRWAPHQYWGKEWYDVVMRRAEALWYRATEDPFFGTTAHTQRQAEALLSVYVKTKHRAKMVDFLHALRGSKEFLLGQLQITPAMQKAADDLLDSTPHEHLLTNAFKLESSAAKYTGDAQSVPHSPLQMRLDMEMNKYRRQQREEGVVRVPPASWKLDTSAIVPYKVDPQTKHVANWRAVKEGIEQSFLATGLPKEAYTSEEWREMLYLKERIASRGARRAQLEAERQAEEAAMAKKYGQRSPAATSSSSWCVFPDTAWYRVFETAAEALKPYGVTHAGQRVLMRSQTYANPAAVPYTDPVRNASVLLDTTPEACSVFGGFESGEVLRLTPKEPEGAAAFDVVVVGVDKSGAEAEWSLYAMHRDVASQTSKGLLNLGTDCLQLLSSYAKVETTRRRAVLTVLDAQRTLPEEHLGQVVGVRRGELYVQWHLQRGGSSELDRSVAVPLGNPETVKQLYKLQTLTDADGAPALLQEPPSWRTPFRNDFVDERLKELEQAPFKREQWASLIQGKYTPKVKKYGYAQHTTQDDFQTKEYKDRLLARQYFHSPQAFSVIPERHERSVKFMGKWEHQRVCGLPTADRDELEKGWGEAEEISDAAVGAIEQALRDISGRRPGNFVKSPTETQSLRLNESWWTPLEFGWEEHNREQMAFLDSSERSIVEGARLPFGGKRPPFGTTYGMGERISEIAADYAKGFGLGPHGHSPQHDTAHFNTLEAEQQRVKVLGLGNALVRLFHEKLGNQDIQAWSLQQCGESDANVRQLLLSVEEWRTKGRAPSLLLRKVLQRYLKEELDAFNSGLPAHVPRLAVPCADASAADSIGSSSAGCIWVDVDRNAFALEHASQFRRGDSDEPYIVGLVQRAGMSSGSGGALAASPAGSGADNTYAEYIQQDVLQRFEIGLARIVGRGIPSSIIMERTVKNSRGLERESAKMLTLVLVGELAKLLSKMRVTPDNISMVVRGLAQCPEKELLGGGDFAVPVSLIFSWNGPQSNVSAATSNSSSSNANAGASRGAGISAVQQMLERTRHGSSPGSSGSGDGEKMVGKVLEELAWSQDGVAADVLYALQQNKANPTLRQEFLNAFLPVCSNDHQKAQHMYADYTLGKFVPNITVAIEAFVKFLGNISTHPGQLTSDVEYFEVDNARRADGTEGTGQYTQVRLLPPEIGPFQYENTLIESIETAERFRRYGILAGPARVPASGFIAANCKSLTYMTHRDKEVVYVTTENDQGLANALRSSALFKSIASNPKLSYLLKGITGGAPSHPLLVDSFNRFFYRVAPMLSFYQSLLQEYSATMPSAQAEAQIANFGLARALESEASTAIEQDFRRNAERYWRNVLEGRSTEEAALSSGGRESASAQGRRPSQQGSGRAGQSGSGSRRGSGEFNLASVVGHRAAAGARSEKSRVPVSSSSSSSSSTVATAASTSTRVKGLLGSLKGGGSGAGRGKGSRPASPSGVASGSRGGRSSSSNNSGDKNGSTSGRK 1813 T 0.78 Metallothio_2 pdbpssm F Eukaryota T 7ane 22 V ab E9ADG8_LEIMA mS49 MSSAGSAAPPPPHTSSFGADVELPMSDWALRLQRELMSPVDPLGGLAHKDYYRDPATGYAPQYAPRDFVHGGSIAYPHMQGSGSAHDSYAAAAARRNWLEHDVESMAFMSQDARATARQLSSDAEREAFTQRHVPADRHRSAFPGNASLAAMDQLRTSGPQSDEKVYQQAILDRYRAAATSSSSSTAPGVSYTAATGLSGGELVDALAEDYAAAVDDGMDEELRIAHGLRAKERFDFKVMQRTSRVPFQGYDMDRFAAQREGRPHGAQQLPPVIPPSSMEEAMKNMRGGAAALLDTEAQAWQTYAQNTTSEEPKLGEALTGDVINSLHARRWSAQHAKEQARKQRFGLGRQGALVQDGGPDRRTLKKHTNDERLLDAVNFASDAYRRTITDEHVDPYVRRSTERGVGHLLTNSFDMARREDRVAHGQQDLTERNTVHYGVPIQQSIDEFVLSHRNARGERPLDYFKPFPDFRAQRLIRMYRDIEGFSLLKQRPEAFEWELFTRYRAHHQQRRELALLHGLEPVANETAAERTARRLALDELCEKTPFDPSKLHLNDDEVEIDAETLRNWFGVYVLPSPTIVESVVRAEGGALNLHLQHAADEMNTADTREHILSSRYMNRLLLFEGFQHRWNRGFTKEVAGKAPEPVIKYAQPQEVLKYFDSDERAMYQQYVQQESDAQLSEWAKVTRGRRYIAEKEQYGEVAGQGYKVPVVDVQHQETGAVLTVSSKLVEKSAAAALADKKLAGGSSSSTTSSSSMVHFDGQAYFVLPGSKRTVTPLSIRLESGESMEMTDEVFSAYPLEVSASAKYNHALNYGIGEYDYNRGNYIETQDAIWEKATADQEEGWSPATHADGLCPGLPVRARRRLAAAGEDKTGAAITGDFQRGRIVQYYRQPFFNPDPRLVTVAFYADGVVQEVPLANVMIWQRRYHGPERTVGDESRRYNPAGLRRYIDVADPNNKKLSPSSSAGAGANGAGDHFLEKYEGRLTNSVAASRYRTTKQITEIDQWNRFDTSRADNHRPLSISHRRDYVRQGYLPRYTPWEWIAIQEADQPIIHETMRTDNIGASYFFSLNRSWRYKARPHGYLRNYENEVRDMLQFVDGVTPWKQAQKIRTYWEVRQHHPMPQFNRPEVAMHRNSAGLLPSHMWEMDKKTGKVRAVKDSVRDYQTKIPVPKWVQL 1177 T 0.04 PSD5 pdb F Eukaryota T 7ane 23 W ak E9ACK8_LEIMA mS58 MSFRYTNHLVATLKHRLFLEAAHRQLVRQTFTGVCNGIEVTCTAYGSVVGIRMLDRAVWEPHYQVAADNKSEAAAATPTAPAAASPSRPSSSSSSSSTASGKTGIDLVKLSASIQAATWQAIQKVRAAKEETHSRSLRRNPQVLAEARLRDWYEQDANTLHPRPFDGLKNLEATEWMQAVRFGVPQPARYRRPNAAPKDLGEGGDGAHTDQKPRETITVLRDEDCDPANIPIGSVHPLFAPGLLQLEVDPNVATNGGSRVDEFFVLSEQRKEMRRDEEAFWERVELIRRSQLATIPKGGVKRGYADMADTVQDSIEEKVQLRFTQ 325 T 0.00057 YbaB_DNA_bd pdbhh F Eukaryota T 7ane 24 X ac Q4QBP8_LEIMA mS50 MQCHHNVLVGWANSGSSTAAFLTQQQQQPLPPSPLRFLWADQPLGSPSVLAPGCGMYRARSCGVVAASAAAPRVRTALDMVIRSYTPIYAPDPATDHLGALRSADECRTLWAQHIPVPSLTRAIELWLRFGNDPVVHTAASTASAERAEGDAAPSSTSPFAYVEDYMGSNMVTGTPEHVKESAELWSEYFETKYVRRMRQSRRTSKQYVGVLGAAGRGRGAGESGGGSASSSIANLLLDEADHPNTKWEADTFFCEVAYLSERHLKTRVTNHLQLDKLLWGGTAKPDAFVQFFEAFQQQTITRIPLPVPSIWVHESTEAKKKWAEHYLPACSAAHEFFQEKLRPHAADAAAQAKLLADVAAAYRQVHAILLERRARQVQAGVYPSTWTGGGAAATATEEAWAANEAEKEQRRMDEGVYDPEDLLDTTAEWATEHAKIQAILEQPLTSSGSNGEKSYGFSLQDFWLHTERREALETVHVLESESLARVAAAARRRLYSETPLPDVFAGLEESVAKARLDLRAAVLKPHFNSVWCRMHYVKFGAASLVQHTHTASRQLLFHYAASTQVVAATAEMYYATKPLSSQLDYASPYTFRRSLARHCTRYGVEMAHAAQQPLLLSAAYLAKAEGVIGRVARQAAAPFGARRRARYSAAQLNNQRLLNPVKSVQVTAPAPELLAAGADLLTILREERTPKAKAAGEALKVWPLGSRQTVSYDWTSPALDKLRLTDSSLTAEQAAQRDQLRQAGRLEISLWRRRTAEERQKVRAEMKKEAVDVQALVAETPVLQEVLAYASHLYRKLTREQEQEQSYSDTVPTPHAWDEASGEWVFAVMLDDDVPLSETQSTEVFLPYVDAAGRRLPNGEYRVAVRAVDRELNPTEHPTLMSAATSSPFSVVDALPQLYAQYTRHPQPADKATGTPTEAPLEGDVAGKDLMSFCAFLREAGLHISLSAEFAMGQSLDKQGNVSVAEVAAVLRGTEYHRSQCEHGITDAQRTIEPQCRLHWSLYHPGATEQEWAAARRRVLRRAMAEERDWWLPDPMLEVTDVRTDSAGAASFSFGAYPAVARYGTELCTVLPAHGSNHMEYAVLPPPPGVKATARGIGAQVQAECTVDGTGAIASLHYGAPISAADVTVEDALRAAMEAIQVAQMRHNTLSMVKLCAFEKQAQTMLFCGIQGLEFGGKHGRTYAYALEKAKREMAATAEAGQVASLQAADAEKLRLSDQEQTSAAVDRFASQTNPEQRLTRFVPRSTMSGYSMEDVGPERASTWGL 1267 T 0.76 MIase pdbpercent F Eukaryota T 7ane 25 Y ad Q4QAQ0_LEIMA mS51 MMRHTVVRHHRTGKARAFIMRDPSLKILRAGSGFQQLKRMGMPSQKTLGYRQVDNFYANNQYQHAWPLLTHDDLGNSDQSNQTKNILYSMYMPKRNKGTAPWFRGADTYSVKYCEQGRYEYQRYLMINRFPSEYKRHFLRFLGNIRGSKSRDAVPQEALHWLLRMIVDNFNPQHVHYIAAMRTLQDSGELDMARDVWKIMERQQTWPDTATICAYLDVCVEAGEKTWAVEAWNRYCTELRFLQAGEVDPKPITRTPFSLTREELLYLPKWKKHFDHDPNLDVPDLNRFNRTREVYLRMAKVMLASDDMSMFEHFFDKLQAAMLTTPTPVPEPPNPHLVRRPRWSPYEHQKSLHHSPWRMDNNGRAMALGPSRTIEGEMQSRFFSNPQFLVHAVKEAVAVVLQRHMAMFPEATDAQTAAPAFFELTETAQETLAFCDGLVQRMMERLEDKLGSLGTSSLLSTLLCIRRVVGKQSGRALLEYANQFLAKKATLSADGLRESLTAPNYFQILAAYADESAYHYDPKTRQYTYAPGFRPTETMKGLSATLNEISANQHVAWSAEMHLQVVRTLVGCGTMKANAYFVENVLRQFKWDSRFLEALYAEYRRHNTVDGWAELTKRALVWTARYNVIASERLKRLIEDDYDIIHVQTRTFRELAVFQFRDAEEKRHARDVVNELPNPWIDYVTHALPFPDRDAGYPDEYGDIGQWRAPGGPGSPVKGPGYYAPPMEGEHMRGYTAEWRDLKNPMKPPAFPEPWERKYKQYARGQHPSYDMVYAGPMPEIFPGRRDFRKPTRWDYHDVEKQGKHKISGPY 811 T 0.0012 PPR_long pdbhh F Eukaryota T 7ane 27 AA ao E9AF47_LEIMA mS61 MLRTCRVLRFRMKLGSMYVDYKIVSRNHRRSIRVEDALVDPLLPTTVVPLHWLEQLRCPSTRLLTGYHTEEAVYAKPNYGDRVSRTPALLSLPDAAAKTADNGAHANAIRAGPVVLYITGQSIPVVLNPLFVQPDEWGLTQSNGEWDLRIGMDAIEQCSLYAELRPGGLLYSKLPHASLTEAMEPVQDTLKRYGMRCALAESPLVPRPWTRMRYMFIDELQRGQKMTEFVGYNPRNGTQWRFSQHTKYFRTGIWRETIRRNEMNDGLHAHSSWQKSPQQAVPEISFLAPYP 291 T 5.8 AAA_11 pdbhh F Eukaryota T 7ane 28 BA ap Q4Q847_LEIMA mS62 MERAVDARRAIYELWSRTAAAEEHAQFSSDSTSTGEEAAAEAKAAEERSTAVAALLDKYKLDPATPREEDISRGLGDALDRLLLLCVPLSSRHGADLLVKLMQVSAQQGRQFSMRTIQHLFARTSSYAEALAVFYAMRRSNFAMSMEAYHAMLYSLQRLEEEGWAARFHEEFAASKGEAISEQALDFVLRGVDNQLMPENKPWLGRIMFAEVKDNKATQRQSMASFDAMGKLWVQRYKNGGTAPE 245 T 0.045 zf-C3HC4_5 pdbpssm F Eukaryota T 7ane 29 CA aq Q4QF40_LEIMA mS63 MLSTSQAFLASLRYRRPYWMLFLKGADNWKIYTVIQQPDHQRTEMLYQAWLGGLDRPYVRPKCMAHQPVWLSKKRHLLQKARLEGPETSVEKYVLEWYKKFHSFQGTDRPTAEDLHTAFDLVERPLDLSYACQLLNQCRNHYYIRLSDDSFEIFLEACLRVDRRDCAIYALEHAEELGFWHVSDNCRRYLAGEQTWYKRSPVDLLYYPLEENAERNTAVTSGTAASAVAASDKVKGDEKVEGAPRTSATTSATAASAENEGEAEVTDDEIARLQAELEALEREIGSEGADGTDKD 295 T 0.034 NMT1 pdb F Eukaryota T 7ane 30 DA as Q4QIF7_LEIMA mS65 MMFRGTSCALARSFRANLKYPSLVSYNKLPWEVVSHDSTKLHMHLAPNYEQLLTLAAVTDVPHLTLASHLIVPEAERLRVMPGVVYLLGGQAAHENPSSFTAYRIADPTSLQYYGRIHHNLAPIRRVDMCASADLRLLCLAMHFDGVLTNTSAGSTLDGVTTASQEGHFSLFYFFRPNRPANELTQPFEKFYRHRPSLASLDAFNAASPGKAESWTPVLQVPRRTAEKARLTPAEPYRPPQNYLMGLAERLGVRPGNAFGRRSLMWGTWF 270 T 31 PELOTA_1 pdbhh F Eukaryota T 7ane 34 HA v Q4QCC8_LEIMA mS37 MKSSDIFHACKYTPILLKSRTNDSGVNQYGLRPVNSYDYLNPTNLVNFGRGTAFDNLGVRRSERGQIDSAPSLGGSPVFTQAKLLGLSGDDQLRLCEAETTQLRMCMAKGGSACERESLLLDACLSKVGHLRRAISQAGSEFNDWFIQNVSDNHTKPFQHRPHDWRHYYAQEKLVREKQQNGHAYGRRPKEFSFGARYVKTEGYGKRPRLPYNK 214 T 8.9 CHCH pdbhh F Eukaryota T 7ane 36 JA p Q4QFH6_LEIMA mS23 MRRSSSCLYKIPKNTGVAPRFDTWNEKYEPWEHLKRMGRLAGTGFYIPPEWYNHFRMFPPINHNFQQEKTLNPHNVSEPTQDDTDYLSVERIALRDELARKSRLLASEGMRYYNVFWIRKPLDRMERHYYELERKGVAHSVAIKRVLQMFYDELTVKKRVAAIQAEEAKLSGKYISMREATVLMGVLTQLQKEQLTPHQVTLLAKEQREKAQAGEAFAATVERSTETVADAAKDAAEGDEVMSADSLAELLSTEADLDDSSPSISYKVTIHETEHDSVKQLQELALDHTGKSDWYTGASPVLHMEEAMPAKRTPASKKTES 321 T 0.051 MRP-S25 pdbhh F Eukaryota T 7ane 37 KA j Q4Q8H5_LEIMA bS16m MFSTNTVLWARALVDRKSPQLWGAPGAPIIRMRGHHVTWKFQSYDMFVEHTHRRRNSDIRLLHYLGKHCPHPQKSLWSPDTPVTQDRHLFMLTTIDVDAFKYWFGVKRCRLSVGPWNILAKSGLLPPSYKQNSKIMPKPIFDKEQLMRYYLANRKDQRQMEREDYLNYKNSMVKSPEERAAERPVAPFL 189 T 0.067 Ribosomal_S16 pdbpercent F Eukaryota T 7ane 38 LA l Q4Q3T6_LEIMA mS52 MITNPGPLRVAYSPDYLDWLYRAYRSKLKYTDERKKAEEVFNGLLLTNQTDEQGPAAGAALPGAPPPGQTLRPRHSVRRQAGEARRAAAQAKLDTLAKQQGMLDLFERQPQFPAIHIDKAARFHVVELFKEMVLDRAWKPEEVWDKALLYRAILTERQASYPASYRYILDTAQRVNLAPRESGSSDTGSSSSSADARSSNESAGGIVSESTLVIPREDNYMYFVYLVRRYYIDNAVEGHVVLRCHRQPNASELLFSHPPPKDEHEVLRSLYRPGTATATQGKDASAKAQDGAATAPQQRPSGAATIARPRPPSSYPPIEALWRCEENEALLRVLVFGELNLLVSENPFVRFPKAQAYLTRPSASTPVPGAAGGSVEGADGYGGPQQQRRGGGHRGIGSDGGISLSSVIAEKRGHLLAPLSRNVAMMIDSRANDVRRLQQRYEREDTASFQKMLRGSAQVEENPGLYSAYSDWSYFNPRAVRAEERDALSRQTVAALKTYDEASRDIYRVGFEEAEARSAVRPVEGVNNAPSYVPTLPHFVALVKKDPHVSFLSHVALPEVYNTASHAAAGVSAKHQLEKLVVQLARALYRTALEFHKEQLRRVNRQKVQVAASLLDRFVTERWRVHCVAHPSSEGVRDMARRFRAYVPFEGRILDESGFPTDARVEDYERWMAAPSV 677 T 0.066 WcbI pdb F Eukaryota T 7ane 41 OA ai Q4Q6W8_LEIMA mS56 MLAKYGDLTVVKDDLTLLEKTESYIAKWRLNRWEFRVPPLLYPAVREKVMLQQEILKALCLNRAEEHKHVLGDIQIVASITGISPESVREKNRAWLQEEASKLRWKGEVNKAKELRDAFLRLEVYGSRDHRLLERLCCIYGMGMQGTFDEAFSNIIVQDPSTGKLAVDEANPFAELQAYILSRYPQIDLIHDFLGLNVVSGYRPSLGRFLIHCLSKKNNISNPVSNGRVLLHVSTSKETLFDYGDSKNQVAHDDSIYGLPDFMYVRGNDIFLIIIAADNHWLRKRQVPHTKQLEGIARRCSFVLGIPFDKVRIRNLLLPPNYVDSSSLRRLTETVFDMSPASVKEAVPWISLYEKGLDAQDVDYCELEKTVNEEEWLTL 379 T 0.12 ApoC-I pdbpercent F Eukaryota T 7ane 44 RA g E9AE13_LEIMA bS21m MQCTSRLLGGYMMYHRKSMSTMRYSKWKGARGGLSHFYNRTAMLEKVPVNMPVSIVDRRMMAYVHRSRLRHFQLFRSYQQKSNSTECKLREGEFLRRRWHRQLQKSFIAFMQFKTMKVLEEQAKLVSQYGQASVNAALGDPQAAAGDVAHERKYAALHRRVQTLPRIQLVPKHVATMKQIHNDRFNYRWRVN 192 T 3.4 HMD pdbhh F Eukaryota T 7ane 45 SA o E9AFL9_LEIMA mS22 MLRRSALARRYPFTKRGPRERKSWKHHVLTEPPKPVEWRDPKVWTKDLSQMKSFDAPQWDLWLNRSRSQDMDEALQPFMDMPQSLKDRRYDIPWWANPFGAWYLQNVLSVELMKLPGRTNAEKIAIYRGRKRPATWDKSKEGLMDDEVLLKQIVKERWRTLEFGDRDAGYPCTFSDYIQFLNEWFKSLDEEGLQRLREHFDRKIRPLLAVMTHVDLMWLEALTQNSVQNKEQLERRIGFQTSLGTPEFFDMSKRLRYEINEDYKVRDELGPELFALWSKAPERWPPERLAKMYGLDFTLVRKILVWHHFKACYDACVEPDWTLPKRLFALEWIRDVRARKQGLFYGKLRFAEQKITFYSDKFLFRDLVNRREASYANVWEMDDPYRFLQTEQDYEDYWGDNYDVYRRMFPEMIGKSGEPVQQYSQMPVWAGPHRDHANKSEHNWMFAEIGVNVGHEPLKKLELDPTNEKRRRFVIRQPDGSLRSAKMSEMRAWYWKEEWADFRFWAPHMEWGVENTPSMEQYQEHVPDTPDADYRKQRRIQSRPVKWFYESHYSRSGSFAGFQPLRFMQRRTQREVRWPDVINAAVQIEKSKPSSYVFKAIPEI 604 T 0.00082 INCENP_N pdb F Eukaryota T 7ane 48 VA q E9AFH3_LEIMA mS26 MHSSGVARRQMRPYYNLPSKSEHGRRMTGFLTPYRHWMWKQNELWRNVHEAQFEHLRKLYKRQWLESFRVNADEYIYKYNITKSAQLAQWEHEMQGQERKRRESQQLAQGRQALKKKHLDLLREFHERQFFYWYERASERLQYMTHINYVPQASIQEHIDRELDKYTVGSKAPYPLNFVGQMPMLEDKDGNIAQVPANLMTNHATENPDGGVTMYEAPEGTAVAEEKLLQMIASAQEEELRIRPEDSDALSESMEDMDRSESARIDSRKVARTMEETDEEREVNRRAYIDRGKTGSKTIFRRPRLDSDGATPPSAGGTTPMKRRKSKMDRMHALQEQQDAVAAKATAKALKDEDSVAKATKRGEVAENRGRLRDRMIMPSLETLQQSPEMMAQNKPGGRVRTHHLMEKVYGIGKFKKGGGGDEDA 425 T 0.026 ThylakoidFormat pdbpssm F Eukaryota T 7ane 51 YA ba Q4Q6W6_LEIMA mS73 LRRSNRWCMKYANLELTTRGEFPHGMKEPGFVKKLDKNIPWYFSTYRSMYHWPVAGDGWSDLNEAEKHHDLHMYYTLAWWKLGEGIFDADDEDR 94 T 9.2 Mastoparan_2 pdbhh F Eukaryota T 7ane 52 ZA z Q4Q190_LEIMA mS47 LRRRVSATPSLAVSPAGPSSLSLTPSPADSQQRRSLKTLDVREYRPLGTPIEFRFYQRYANHPNRQSGVQFLTHYNTHQRFRVNKDFIDYMHWGKEQGQARLPHRHQRVAFDFDDSLQPTRAEGAVGAWFAGQDPTMRSHPDISASFDPNKKLFSHPEHWNKMFSKRRPGEGDIKLNVIPSNSLLGPMVTQTDTQDMAYFKTETCGPTHGRVPGINAPFKGEMDRKMMQAMSRPLNRSCTLTGNNGRFSNTIFINDPKRHQTLSATLAKELNREVDRATNGLYSKLTVLTSAQSGLTDFFCGGTDLQCIGFDLTMAQLLRKEADALTKSAASGSKKVEAKVHELLRDAERYEERADSVLRENAAVIWRAYTSPRALMTLVNGKCRGTGCGLALAAKYAGLQDASEFIVDGPNVGLTPYSGMTRLLARPETSLKYPGLAEFVMLTGASLFAGDALRLGWSDLFTSLPDMPYHIKDWFDSTEHMHNDAVAWQLGHLLEKCFQMKDRWHTSAMERCAMTPIRARWVEDAFADQSSIEEILKTLSAMEKLPLTDRHNTYDPSYATPYTLASVAEGVEKLGASRLRYTLSPWDATPPEEAVEVRQAAEIFTSYVLERRGKVNIVAHRDRHKVQAWQKQREREYVAYSNMKNAPHRRHVYVRLEGCEGTLVDFDFTIDPAGDAAAAAAEKGAGVDDRNELVHTASVERLKRAVLQAMGMPADRDVDLCWYLPTLDTCPIRNDEELVDVLHSDPGFEDPSAQLRYPPIYFLVKRNTLHLSEWAYAVKHQLLLQSPYALKATLQLLQEVRGDGSAKAVRSLADTLATEYRYAARLLKRPDFYQVGQHVDKSPEEWDVVKEERMRYVHKAHLPTRPLPDYEVVFERNVQLDGHTFQLRPRWSPRTVQEVTAESLAPLATPLDFEKDGAVEFNVVVHASKADRLAGMIEDAGGFEVVAHLGEVDKEGNAKVPPLHGDAHVPTNVNFYEMARHPWEDTPSSTRRDGFTAGSKEYFEQQYKKAEKAVYDEAGRGQRNYWPSKAAVDGVTGEESNALLEERFFAKLRDAERGVESWARQLRKKAVEGKLDNKPEIATQQEKIYDDDYYRWFIQPGHNPNPSGLLRGRKVADSGSSSVDKDLEVFLNQLLSGAAERGADGTTGDEGEALSLPEEDTDEAADST 1169 T 3.1E-05 ECH_2 pdb F Eukaryota T 7ane 53 AB bd uS3m TKKGATKILFIYKLSKLNVYNNESYKIKLLFNHLYCIDNYNSIYFNLNGILIWLNVLHINIILIKYAFLILLNNLEYLIIFKYNIISIK 89 T 150 Cytadhesin_P30 pdbhh F T 7ane 63 KB J Q4QCY7_LEIMA bL19m GYTRERTNRHFFVSRANAFFSRLPISRIQRALAMEAIKKGSMKPWKHTKEQIIGSPITCNFEYNPRPVRLIGTVMDAHTEETSIKGGLKVYARSEEANMMLWIPAGNPKLKYEVTAAKGSFEHYLDERSKWDEAWLTGRARMK 143 T 0.073 Endonuc_Holl pdb F Eukaryota T 7ane 69 QB P Q4QG34_LEIMA bL27m LRITPSRYASKVTAGNAKNQAGSPRQKAKLFHVIPGTPVTPVEKLKEQRRRFGQDRYSRQPEYRPGRNVRMDPNTFTLYATTKGVMTIRTSRINPSYKWLDVEPDIQKVYRSRCMRAALQARGKASMMVAGNVHYRAELDHVTEPHWRERVMRVPKATERFQDPNYFTRGLVPSLRPLSRYSYE 184 T 0.31 Ribosomal_L27 pdb F Eukaryota T 7ane 70 RB Q Q4Q719_LEIMA bL28m LRQSSLLCFSTFALNPETSRAPHGPPRGLINRYISMGLPPWAAWCNRVNRHALYRMSDVSPRSFLPKAPHEMDVIWMNERVRERVRTSRQVQHVYRQLKYPFVKTGIHYSDTLDHWVQVPMVEAAMFEIEKDGGFDNFILKRSGPELRSTYGERIRRHLLVRQKETQKNFVLDQQAKALAEVTQAELMKATSEEELDAVLAKYGMDAEEFKRLMAKRVMEQRKSVAAAGLRSK 233 T 0.011 Methyltransf_5 pdb F Eukaryota T 7ane 72 TB S Q9U0Z7_LEIMA uL30m RRCVMAKGEDPAHVAGWDDRQDAVEWWWTEANDSRGRQRLEAAAAVAAAAASSTVGLPLFPRFSPGRRRRRRPPAPPPPPPPLFLSRHLHSMPWLWCTCVKMQMYYTPTALTCPLSNSLAAHVGHIIVGVAALLPYSMLLFLTVMCNPRKHEPVLRAQRIRWLTFHSLMFRLLRCITASPAVAASVAVAAAQTPTSLRPAAVCRRGVHLAPSVLAASAPPPPPQQQQQPTSAAVPASTATSTTTIAAGPYRRVGNVFIVTCIDHPFKFSWEVNRMLRELRLEFMGQTTVVPDIPPVRKRIWRVRHVVRVDQLDLDEAKALIGIPEHISFRDLAGQIPPTFGRGGSVANPHMRSKMNFMRLRRMRLRDVMHRDQLEKRLLEERHHALQQQQQQQQGGGEAAAAAAATTA 408 T 0.00017 Ribosomal_L30 pdbpercent F Eukaryota T 7ane 73 UB T Q4Q2W9_LEIMA bL32m LQRTTLRCYSALVGQATPVLLGSKGGTPKRKKNPMQLRRKTYGLHFKERYLKLEEWYFCPLCAEPKKQGEWCRREDCRQIKP 82 T 0.11 Metallothio_Pro pdbpercent F Eukaryota T 7ane 74 VB U Q4Q2Q8_LEIMA bL33m FRASCTLLGHGQYKTRLKKRMVGFIPKVIPRKIRNNMVALRSEANTGHMEGYIKTEAERLDATGRKLQKTMWDPVLQRYTLMKETKVRGPFLTKSNIARKVDFPVGALHGTKLGGKK 117 T 0.0038 Ribosomal_L33 pdbpercent F Eukaryota T 7ane 75 WB V Q4QCK6_LEIMA bL35m FRISLICFPKAGCEEITRQGRRVVLKPQEYFAQHRMQVWQMRFKEMGPPFSRVWVALGGKMRRRRIGRQIDVKDMRYYWRPIEPQYQRLYMSRLRIKDHSNKRVQPMRLRATNNDIGQASSLKEWERSSDRKYGAALAPPKKRDFEFRVF 150 T 10 Gln_deamidase_2 pdbhh F Eukaryota T 7ane 76 XB W Q4Q6A3_LEIMA bL36m LQYTSSARQALRATALVLNFFPLGYTCGPKNKQVFFPPNNLDGRTTHQMKKLQGSTDKHPGLVPRDKLKLHCEFCRFHWVQDTLVVRCAAHPKEHNQREIWLEPTWTWGKQQPYQYYKYMPVNINPRTGMPLAREDAKGMNNERRSQGLPTKTRLLERERRGISRAITGLGIYNQRWQTRFPFAT 185 T 4.8E-05 Ribosomal_L36 unphh F Eukaryota T 7any 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L Laspartomycin C Friulimicin-like mutant XNXXDDGDGXVP 12 T 1.4 LCAT pdbhh F T 7aoi 4 D A5 A0A3L6LD92_9TRYP bL32m PKRKKNPMQLRRKVYGLHFKEKYLKMEEWYYCPLCAEPKKPGEWCRREDCRQIKP 55 T 0.036 Metallothio_Pro pdbpercent F Eukaryota T 7aoi 5 E A8 A0A3L6L070_9TRYP bL33m PKMGCEEITRKARRVQLQPTEYLAQHRMQVWQLRFKEMGPPFSRVWVALGGKMRRRRVGRQVDVKDMRYYWRPIEPQYQRLYMSRLRIRDHSNKLRQPMRLRATNADIGSGSSSIEWERASNRKYGAMLAPPKRQDFEFRVV 142 T 0.095 Cytochrom_B559a pdbpssm F Eukaryota T 7aoi 14 N AT A0A3L6LD66_9TRYP bL19m GYTRERTNRHFFVARANAFFSRLPIARVQRSLAMEAVREGRMRPWKYTKEQILGAPVTCNFEYNPRPVRLIGTVMDAHTEETSIKGGLKVYARNEETNMMLWIPPGNPKLRHEVTSTKGSFQHYLDERDKWDEAWITG 138 T 3.5 DUF2760 pdbhh F Eukaryota T 7aoi 15 O AU A0A3L6KV21_9TRYP bL20m HYRAKLELDRIRSMLRGRARLERKVGLKRLFFLMRTQTRYRVEQQAHWERAIVRKNVDSAAREHGTGWQHLRNELGRQNVMLLPRSQQLLAQYEPLAFRAVVELCASRIPPPPPPVVASVPEESYTLWPPASHDNSECASTDGSDAPHGQQQSLSHPAARVELRCGVERVLRRGPSGLGNNVNELIDAWKEFDVSP 196 T 6.2E-05 Ribosomal_L20 pdbhh F Eukaryota T 7aoi 20 T Ae A0A3L6KTI0_9TRYP mL41 KNSWPEWMDNGADGTGYGIGLHRTHPLSRLRGNLKRSPSHVPRVLGMMIQGVWHKSGVKLYFRGGKPPNPSVHPYLTGEPCPLYGWKVTDESVIRQFNMPSIDGTNFRYKPYVALQ 116 T 0.71 MRP-L27 pdbhh F Eukaryota T 7aoi 21 U Af A0A3L6KXE9_9TRYP mL42 FGGFGMSVYTPKKDRRFRVQPLPSLHANSLADDTPLVTTTRTLSPNFRAFALQDGGVFFTHPSHEQVMRVGQNILAEETKATGMTSMDTYVNSRIQSIIAENTVENVALSHWRRRHMWNLVRTHGKLQRHWGV 133 T 0.064 ATP-grasp_6 pdbpssm F Eukaryota T 7aoi 25 Y Ap A0A3L6L3K9_9TRYP mL53 TVGLLPAQLALSRKPRLRPHVGNLKGLVYPLPYYAMWRGNHNKYTYNKSTVCLWGEGDTRSMYHQHYAHAKCPTDYGRGGREFEYLTVKRGKMLQKPLPRVQYVAEGSKPVWLFKSWHTPLSSPSMWEREVQYAEHTPEHIGAKRPLAVVAPRTMHRYLFLMHMEKVTITVSPLLFGYGHTIQKAVLDFYRRAISARSPFPKDKVFLFYAIDHITPRIEVTWLDGTSYVPPVLEGASSQDLIQMVMEEAWLAADRMAAEGRVLNPLAIDDYKWDQLVVFKKVRDKEAS 288 T 5.2 MRP_L53 pdbhh F Eukaryota T 7aoi 26 Z At C9ZU82_TRYB9 mL63 RYRTAWRELLHPLPVRARKMEWLKRDAVEENEEILRRPYYTIKSYALPPAVGRQESIHNSNNIRGGMHSSHSLDLIMRQPRRVKTPEQLRALRDRLRFIGVTGPMPQATSVSTKSYTDTYGSRLRPRYPESWDTVPPHQPSRELL 145 T 21 DUF4113 unphh F Eukaryota T 7aoi 27 AA Av A0A3L6KTC7_9TRYP mL64,mL64 TPMMLNIQNMMWWNGKRNLYRATYREKTWYEISRTGAFTKGRRPVMRQKYSREALQAALAMVPPGFEVADVPRPPQRILAQSEGIVGRWYSNYWTLHSMRYQCLLAGVEWPLGERQRPRTNYDEPFFFADFEESKARRDYRSRWINVNRSLVGMTKRMKEAEEEARYMQFRKLQDTFWSNRKVLVNRVKSMYNQGAX 197 T 0.022 DUF1672 unppercent F Eukaryota T 7aoi 28 BA BA D0A5V6_TRYB9 mL67 RYRPIPESMQPKHLEDNFTPFPLPKFDESLEYGPVRLRNIPDIEAAKERRRGSRLAATEVLLQETLQEENQFATSGKGDGNMAIAITERHTEDVTTPAADSRFPSQTMSPCSHEEEMRGYVVSRDYPLIDRLHCTRSIEELVAQFEDRPQIESRVAALADMASTVSFRSDEELLRMFTAISAPFSVDGRGLNFLTVKVSKFGRPYYVPNSLLPAYVNLVDATTIALVREQPWRLSASPALFIQVLQFMALIKVFEPNKWFTFSDHAPSNRADYRHAIGVNHSTAFWGTGEELYDFMVELLRVEDDGRIPTMLDLCTREQMVDLLSGFCGVMPCGKAVGDVFKTITDAFLRRVRNDISGPWSAHDWAIVERMYLVTVLCDAGNNEILQLLLSDTASPRGPDFFAAVSRTKDTPTKKRALCLLQEAIDNASAKADKVTLLGLLESGSEFLLSLVDKGVAHTFATQNLFDYRILNSFLHCSLVADRLRVEQSVITSLIPSSLRDVQVQMLMSNERNALNPLTSSLPGNSGAIATAPKLKRPLMTMLSQLEYLNSIDSVFILHSSLMATSTDQLVSAVRRLPSGKDSLIVTMSCLRALSVKSLTSPSMKERIACARALEIVSYELEKGRAVLLPFSEEILLHDAGAYCDEDLMLWTVAAFLARELPLVKVHTLMHSNCTARTPYRFLKGGHNLLVSSRSLYDKGAPLLSSLHSKELRLVTHNVRLRTPVRDRKCTLQYYNPIRARFVYRRDKPLFDKYHVTARNLAPGFSRGALKHDWRALGVYTPDHPQVPYHPLQTWMLG 798 T 0.045 DUF5642 pdb F Eukaryota T 7aoi 29 CA BB A0A3L6KX69_9TRYP mL68,mL68,mL68 KAWFEPYTPKKFDMEHQRISHNFYNLETKLIWTAFDTPELIGILLHDETIKGAPHLYDAEFLESAVHWTRESRYWRCIGITKPFYNKTTLRAQCWHDRGLQVGTLVFSQAMRDALMDLERAVRRKELGLEPNYVWDRWGPVGFIDGARTDHLPRFAHNPYVDPDGVEVTEVDIAPFNTHEQIKERYGAFIDPDLRPFAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAALAAAAEKVEALRLMENTRHDAARVRTFYEEKCGFSDFMRTPDKVITAAVLCYLQELQRICTETDWGKPLARCLTDLERVNVMGKDAFLVYRHIEDAILDKKRRVWATRFA 371 T 1.7 DUF2339 pdb F Eukaryota T 7aoi 30 DA BD A0A3L6L6S3_9TRYP mL70 PICPGLCGELAAVPFRVFLGTLPTLAVEERFLRQLQPVFAWYSSRKRVKEQANEFIEIDLASCDAELLLRYSHIYYVRRQLFDELIERQMTLLDSGKAPKMAEPSLLQCLAGCNMTIADRLQLEIRQLGAAKRAASVPGRRELDPVARLEVYDYACMMRLVEEDAGAVGDAEMKARAYLPREVIESKLGHLTQLLLGSDARAALDKKDVKLLNRMIPPDYTRVGCVEKLRPFDVTAYFRFYGERINNVKVENYFKRALWGHVYRRFATTPSFLSGVSTYWARHSGLDASFTTTTMPQEVAVAVCDQQIQFPAIKFRAQYVYTSPETARQLWRTDAAVPLMRLFPLMGSRTAEDLAAGVLTDAFWMHLGLSEEENLLQDSLLLKVRRFVDEVGDMYETNIDSVLKRVDDNFKQVVPQL 417 T 19 Vps8 pdbhh F Eukaryota T 7aoi 31 EA BE C9ZSQ8_TRYB9 mL71 YQRGGWSPGSKHQKHMTLNPTLYLYRFPGPHGPGPYTMKYWWTLGCFPTGMEVPFRLHEFLSTYQQEHVPVEVEEWLRCYIKDPLSELVNASNDFFKAVEVYPEVESARGYKTLQPSIAPLLVPMKKFEEQLGVKISPVGLRSVLSNPVLKDRFLDDLFDYKSYVEKGGSTPHRRLARSRFEGSLSVLGECEKCLPEQHQVEISESLGTFIGATVSPAETTADDERSLILLLTTISEGCINAGNYSDAASVLADALMFCHDPDSQATTHANISFASLLNADFKGAEYNGREAALLQPQVKPTSTACARGYVGWAAAAAYQDDFEKAEAIVKDGLTLYVGNEHLEKLANKLQALREEQPSVYKQVPRSLRESRSHLPSQQSRGLLSGSGKGFSNEFDWVEFKNKLYPSKMDPRNNEMGSVFRRVGDLGSFISTSRSMER 438 T 1.7E-05 TPR_21 unphh F Eukaryota T 7aoi 33 GA BH A0A3L6KXN2_9TRYP mL74 PFKGLPKPKRQFGNEAAEVIWPYALLLERVVKVHPFTKSIYVYYAQRQSTARGKLAAEIARSFAREFLIPITFHNSQVYTEAEMLLEYSETPWVVLHSLDNGQKPRILPVAPVEGTPAHTAVEQLLAEVVQGCEALGASVADPVTATRVLNERPLQNQYVRVDYQWFGDTPDERASHLVRWEFEPEQIEPKIRHRTRHVLDWLNYDGNLPTHRAVHVNAMREKAR 225 T 0.012 L51_S25_CI-B8 pdbhh F Eukaryota T 7aoi 35 IA BJ A0A3L6KX00_9TRYP mL76 LEEETIRERVIYQVVEGVFRLSPTSADRRELRSVANIIDYVLTHVRAARPTDRERRQERPITSAALAVMQKCPIQPQLGFVHALPHDTRDALLQEWERMHHLDWQFGKAVYTPRSKENVRGNLTWLREDRHYDQRMKFMQEVESGEARAKHMKLIAEAAGN 161 T 0.17 Gluconate_2-dh3 pdbpercent F Eukaryota T 7aoi 38 LA BN C9ZQF0_TRYB9 mL80 WQSGVHTPHGVVYRGAKMKNWPEQRIPENFKFTEEQRFRTKAIPRDVGTIPRNFVLGVLYRHQPCEVGGLWEHCTNDPEIVLDSKRHLREVLKQAREEGFVTFERDAISNEWLCFLTRERYEEVQRIVTAKSEAVDTHSGLRGAAATETSTYAEKFREMNVEAKEAHARRLEEEVANTTRYLRRFQQREIDYLPYTDLNGKVNFMWWYETRDVQ 214 T 0.12 SHR3_chaperone pdb F Eukaryota T 7aoi 39 MA BO A0A3L6KS29_9TRYP mL81 YWPYNEDFVPEGAETSRFQSSGSPGTRRRVLQEYALSPLFGARVPCCVVGTLRTAKEIIVLKRDIQSLLCELMELPQGSVTLGPLQQLREMMLYRHGTPLSPRLGDERQLNMDAYARRPIAARTMMDVFLAEDMSLDEIVNFGRLSLGKLQIALNNLRKENAEKSSADCEDGSAVEPAQLLVDRMGISYCEIPSLDESDYVFAEVGGVAVTDEEAERVAQRWAERCE 227 T 0.081 MCR_gamma unppercent F Eukaryota T 7aoi 41 OA BR A0A3L6L538_9TRYP mL84 MAPELQLEYMPVLFTRTILGPQGGFAGEERLVKLEVARKYMEAGHAVTPTEELRRGLWCYNPDTDKYDCFIERNEEFLDFAARKRQWLDVYWRVNTGYLLFGRQSWGQGFLINCPLRKRDVAQKLWEQYKVRIDPRLIEFREKDRRTGIQELGHNWCWLYLPGAEELGINREVYDNKRVKVRIHVRKMNSMFALY 195 T 2.2 Ribosomal_L9_C pdbhh F Eukaryota T 7aoi 43 QA BT A0A3L6L8W0_9TRYP mL86 GFTMKYKKGTGLWDEDHVNDYKTNRYLSARATMRWYQEMERHQTRNSLNARRATQSHNNNRGLHHTGRGAFERELERRGVQVEKYPLTTTTGTMRVAELVILRRMELEKRAEEALAEQRGELQKKNPTPSEWYDESKGPLNPNFLRSMRSHYEVDIANLPDTPLIRG 167 T 2.7 DUF2663 pdbhh F Eukaryota T 7aoi 44 RA BU A0A3L6KX50_9TRYP mL87 ESRHLPVLPMTPRIVFEHANEKRIDTAKRMRRDRRRIEELKTLEFWGWYMKLQRVRGRWCREQGVSSRGVYGPAVDAAELWG 82 T 0.35 Crl pdbhh F Eukaryota T 7aoi 45 SA BW Q57WW5_TRYB2 mL89 SGAFGCGSYRSVVAGTQNVPRRMTFYPSAYELIQLHKAHREVIRHFYVRDKIFDNKFPGNALANGLFKFVPNRRENYHMRELMESIRRRSIWMHRIKQQREINAKVVENMEVKYGKKAAASMLCFTTPDSNAYFAPHRYQDVANSWPNYWQHPSVNHVVPKPRWRRHRELGGITRVEDPFAVQASDY 187 T 23 Babuvirus_MP pdbhh F Eukaryota T 7aoi 48 VA Ba D0A4T0_TRYB9 mL93 QGMFSHQLKRLLQKKSIHRYNWDPLPMYDPRKLVHASRHMDVETWREVPDPHWDERSYLVPDQMFYNIPVPPEYKDAYWWRELQARRVQCPVEWVSHRMYNKGDRQRYDFQDLAFRKKFEFSYEEVVKNAKDMRS 135 T 7.9 Pox_VP8_L4R unphh F Eukaryota T 7aoi 50 XA Bc A0A3L6L276_9TRYP mL95 DSFKEHYHRVHLPRRLALQRYARQQSLRNAAKGNVKAEEVPYKYNRWWVNEEHEFVHQYAFVEDPEVTKAKRETLPPVTRENIWKEPQQTFFLPFAPYVRVVDYPKDPDAKFLKPVNIPRWKDYMQRTKPVIPRTWY 137 T 1.4 DUF4653 pdbhh F Eukaryota T 7aoi 51 YA Bf A0A3L6L4A5_9TRYP mL98 LSVRTEDFFSKEAISHARRVSWAPHTTEKKQGAFAKLARSNFGDPLPSSFAQEPYFEEEIEAHRKHHRPDVYIYKYNVSPTHFSLR 86 T 0.053 DUF2975 unppssm F Eukaryota T 7aoi 52 ZA Bg A0A3L6LDF0_9TRYP mL99 GRFNMDEAAAALQLNPAYAAALYRPLNYTFHIRGQLYPAQKGRPSRPGSLAASQGRMFPLYQRNDRLDKELFRLNSRGLTTE 82 T 0.98 Tenui_NS4 unphh F Eukaryota T 7aoi 53 AB Bh A0A3L6L2V1_9TRYP mL100,mL100 ALFSCFRCGYMYEFAVSNSYCRKLTLRNDHCPRCDQLTLFRFMSVSGMVGNMPFKPIGVPGPSYATLWWRKTREGKEASAPLDAVCKSDRWX 92 T 0.0035 DUF1178 pdbhh F Eukaryota T 7aoi 65 MB XA Q387S8_TRYB2 mt-LAF7,mt-LAF7 NIKGGVGSFLMRRAAPKSIRQKYQTGPQFYKRKFFQFQKGHHRLHRRISGVQTGSPTHQREYERFHHLPGDVRTRPQFDFTFGETRADRVMFAWRKRGDLQLYQMSGRGETFVCYRCGYPVRSQLVAVKADNWDYRMCYRCYTNTVHRGMENDTX 155 T 0.023 Ring_hydroxyl_B pdbpercent F Eukaryota T 7aoi 69 RB XF A0A3L6LB16_9TRYP mt-LAF15_1 QSYLNAVVVSNQVLRAADDVLIALSIGEMEAVRQTHGNLIDCVAALDASLQQTTENEEGGGGNGATQEVDCLSTWPLFTTIQFLVEEGGLPLGPFPRMSRAYYRLKESTPVVAHSQLVWRTFELSRGPEGPTGELPAWPHRGFLRDIQRQIAEYTTDPPERIMAGVTGEKGPLRARVSGARLGLQRTPARIPWTMQGL 198 T 0.42 Hemerythrin unppssm F Eukaryota T 7aoi 71 TB XH mt-LAF8 KSVFDIARSHVTFPVSRDGTALRRVLKDWLDYTECQSLQAKPAFPAELCITVHPSYTKRVRLLLWSAVLPWEVQRGLSMSVLIIPGMLFVMSPESAAQGLCFWSGAIRQPIDIAFIAPVEPPAADTPSFSELRQRRLQLSLEGYDISRFFPDGELESPTVTFAVQSHSYLDPFPDCEQRDQQVGCGSSRERNKGIEDSRRYTATPEGVGRGNENVRYVLETRRNLLRDSIRSALRECRCTHGGVVWASNTGCGADGNPTGTGECDVEVTISLTLSDELKEDLREKARLYTNYVVPLEGHVRRHIKCLSGISPHPKGDATDGSDALQQKGTEAVWGEEGCVCTNGSAPFVAPPPIIKPPLPVKVGTLASTRPRSPMLADEAEGRPTRLAPSVFGRHDAPALQRAQQECNQLISASALARIPNTSPRAPEIPPIDYEIFDLCLRLGLCQSEAIYYFYGRIMREWSKELRRLRAAKSHGEGGVNDGNVMVLREEDVHRMLRLVHDPSLQVPPELSACVEAVASLRKITNEVGVPVV 533 T 0.0043 DUF192 pdbpssm F T 7aoi 72 UB XI Q57U79_TRYB2 mt-LAF14 LKLSPDRTRNEEIQDRQNAFVWSDEHIFRPHQHFTHDPCSWSRSLEQSMKKQRKLSMVERLRSLEQRQLEEKQSASATAGGSSKCANHMDGEKAEGPRFYGAVGDSEDLKEYVANEDYFYTMQQEEKPNDPPLQELVDEVQSLHVLLSSPRYEDTPLATVERLQCAYSEALRCVFDRVRNASVGKTMSCNALLFSWSLLLQGLPALLESLAEKRTEECLVRALSTVHEALNIVLQEFNRITHSKERVELLPLEGWIESLDVVTHPLTNKDYTSLKGNIRLPESSFKPQCKLDSATVEFVHSRAIQAAAIRMIENDQSDVETEPLDPYHLYILLRCMVRLAEKGVNDSHIHRAALLTGMVGERIFSSLERTVAPPRRYSLRHALLGKQLRDASKPHAIPLDVCAPPGGVKKPPTAADDVLLLTRACTLLMNVATNVLPQTKFKVLETVDTVLKTLSYAPNYDLSTADTVIFSNMVLEELHHVDEASATDRHLRVLLLLSRLRLSMCADRSALSHLLSCLCNLLPPHSIQQDKLREWKRLRGLVMRHLLYSVRGEEVEQHYTRVLKSSETWVEHLAFGQYSGGLPLSLWLEACHIYLTAGRKLTVSCAEALITLRGRCKDGGVLRSSNSAGVCPLDFVSVTLLAQLLEVVSHGCCSADDLVASPVAWDKVRQTIQGAIGEDENTIQLLRAGRLCVADRQATGSLV 703 T 1.1 DUF4048 unppssm F Eukaryota T 7aoi 76 YB XN Q57YY3_TRYB2 mt-LAF12 CSPFLSSLLSPVETVPLHDVTRTYSTMDVVDPPARYNPMVPNVEPSSSSAGHMEQMLENEEEEGPVACAHKNGKLWGVFEGSEDNKPPAWFYRLCKDLFYRTNSEDNMDDAALVSDIEPSHYISSTENLHIDGCDTTQRSAEAGTDVRDGVDPYVWIPFNLLDEADYHVGPYRFPSTATYTHEQRTLLCLGDTRREYVHFCDSYAFPGRAQIPTSVGTCPSKLYVNPKQQQPVVYIQLSNDIPPAMWLPVKGTAASVRRVLAEFASMAALHRDWHHDEFMERHATAVRMLELQRLPAGEGDILRYMAYDARNAQFAFAPIREFPNQQEFFLGEHDDPEKLMEHVDLCPLLFAIPHMRTVVDLHAEHMIPTIAGPGVATSLYRCIYSKALLFVQVHLSSEVKLPPQDPEAFKFMWKDSQVLPKMRIPVFVRVVWPTNERMSGGGGLLRRFNRLFGTEFASDIPVDAAMALLYVMQWSGHIKDFLGVRGMRQRLADLLLASQQPEPTKLYPGTREIPNPEYTVAERLGMHVQYLAQLHDPDISLTIQRLLPVASAPVRMGCAKAALIAGDRELFRHIVSSEPPGRMQTYMTKLVRKRKTRDLVDAEPRLLEDQYEFAAPLWT 620 T 11 RNF152_C unphh F Eukaryota T 7aoi 81 DC XS Q4GZ80_TRYB2 mL101 KEYRLTVPYRSEVTMLRLANHKAINSNIRELFKKPLVMNNIKAIPRDLGEIPRDYVLRLLFFHQPIRLVDLWTICKEHDDVPLDSAKHLRLVLKIAKLQRWVYAEKNQTNNLYYYYVHQSRIQEVQQMVRASEVRKKEQESVREIEAEKLRMEEQERRKVALDENIVALQNALVSNIAQIQE 182 T 0.00067 DUF2514 pdbpercent F Eukaryota T 7aoi 82 EC XT A0A3L6KWY9_9TRYP mt-LAF19 LVKRHKITNNQMLLMRRREPYKPTMKDRQEIADRAKLEEFERKNADGLMFVPEKALPPWQKSLAHNAKALGSRINFRGFRVRVADGQDEPGFPTPFR 97 T 0.44 Ribosomal_S6e pdbpercent F Eukaryota T 7aor 2 B s Q4CTU7_TRYCC mS33 MALRGVPIRLGVHRVGYTHPSTLPVPCAQRWDLRLARARIFQEYIEEKAPGAWQLEDERSMSPEFKTFTGYPMREMRPGYGQNLPDFIMKKRLPNNTHYELFARRDIPNEDNAMYGKYLYDMTVHGTSLPSTYRMHKDINKAQRNDRKLSGNRFRVLCSSGAKKPPSGWEPIPDATEEEE 180 T 0.97 Nmad3 pdbhh F Eukaryota T 7aor 3 C r Q4DW24_TRYCC mS29 MRSRLVGKSLASLYVRPPVTCYTDACEAPVAMWNGAIPLKEVRVMQKGVPVRYVTKLYSHPLEASPTRLSFNDINSMYCVGNDELMQFFPEGLGGKVMQLMPPGHPRGFLYRKEAHLLNLFIDKIQHWQAKRNVLSSLTNNRPGFIIDGPKGCGKSALMCQVVHYARSRNLLTLYVPNAKEWTHGEWCWPSTILPGFFDAPDAARFFLRYFAKANRSTLLSWRLKCTPNDLPVEQGERQPQNLYELCEWGHQVVAPASIDRQSVCVKFLMDELSAEKKLPIVIVVDGWNLFSHDTHFRYPHPDFLRTLASLNDDSTDIDLYPQELPRIPASRLGFVRGLNKMILSKDEPNKFFFTCTTRDFKPFDGISGFPDVETDRFTNSLDEYAPYDAEKDSLFHPIQLGNFDEYEFRAFTRFLVNSGELAGLGWGPLWHFSSDFERKLYKIGFLSNRNPQGVIDHYHQELVWRYEYQRTRQKQYLLHRNMELVVSKRKNRHAEPKGG 500 T 3.3E-10 DAP3 pdbpercent F Eukaryota T 7aor 4 D n Q4D583_TRYCC uS19 MLRRCCPATLHVAPSTAMVAGVFVNNQKRFLKMAKSAFGFYLARRGQRKFPFLRRPHIKNTHAMNLSAPYFWSFMTAKSQTYFLPEENYITGDWTGKFFVSKLQVYTLQHATSGSTVRVKSFPSVFELSSPSRWNIGKELNTLTKPRMDLIDEQMLTKKQRLDYVKAGLLPK 172 T 4.3 LIN52 pdbhh F Eukaryota T 7aor 5 E h Q4DRG2_TRYCC uS14m MLRLSVGFLMRHIGQDVPKRHTHFVLESRLMYEKSFRDSWLHSVCRAVSQIDEPLSKTISGTRQKMLQRKVTCFQYNQYGLFKVPYYRLANVDRYHAVQGVPGTREWVPYANVSYWTMNKMVRSGNLLVHRVHYTGWGTDPHLKKGGWEHRWNKVMQRNALQYSRI 166 T 12 Phage_Cox pdbhh F Eukaryota T 7aor 7 G az Q4DX04_TRYCC mS72 MFGTTRVWRNTFLTKSVATPPISVIRTGPKWWADPERMVRQKLMYFTLGVDQLPLRRTAVIQKDLHRFHMCKPPPRIGDTTGYKRSRAAQLTTWYRRIQYQEYHLQHLFTRHVWGLVRAYPGNTTKIQGKADDGYVGYDSVPYHRYNRTPLPFPAREIYGRRE 163 T 41 MBDa pdbhh F Eukaryota T 7aor 8 H ay Q4DX19_TRYCC mS71 MYCYYLHTSPASNENDVCHSASAKSNFIFPSFSSLVGFIVFFLWKRQMPLFHRLFVSGADLRGCHTALSSTFTQRRYWAKPKKRPKVGQGFHEKAQKWRDEFLLDRHRILADSLRAYVEFSASKRTEPWDTRFRPFDRVEKDGVYVLMRHLMEDKFQLCNYHHRPVKRLFCNVGLLGPQVTTKARWKPYRYATNPANTSKAERIFQKDKTLYTHGHND 218 T 0.3 Tox-MPTase5 pdbpercent F Eukaryota T 7aor 9 I ax Q4DLH4_TRYCC mS70 MRRTIAALTATPERFSILGTTHPKPKRTGFGRNNKMRSKPSDNVAWYDKGPVEWLPRPVRLTYDHLDQLRDWMMRETLDGKTEEFNRIRDMHREWSQHPLMPVLGDVEPKFPLNLFKQNHRAKRRFLVRWHKANTPANWLWMPRGPTVVTPLHHTNPSQYPESWRQMVRKKK 172 T 12 THDPS_M pdbhh F Eukaryota T 7aor 10 J aw Q4D0Q8_TRYCC mS69 MQRAGCGIVRPGRGCHTTPLYCSLATISTGVFDHLPFQHRRQHAFNTLPLHDANHFGGRTAYLREIGPVNIKKSGRRFKKDLRTVQFNVDIWCAQQTLRKRWKQRDWEVIEVPFRLAPAEQQRVIPEMYTDVPPMTDPERHDFSNIRNKVYDREELQGVLFGASGPLPYPPLQRIDRQAMTLDKFL 186 T 0.076 DUF4993 pdbpercent F Eukaryota T 7aor 12 L ak Q4DTE7_TRYCC mS58 MSFRYTNGLVGALKHRMMLESSHRELVRRRFTGHCRGVEVVCSGYGTVLAVRLVDKTVWEPFYRKGHPSPSGADMDSAPPSSHAEGTPVGGQSAAPLDFERIAESIKAALWDATRKIRSAKEAALNRSLSHNQQLRAQAHLEHWYDEDANTLQPLAFEALKHEAATPWMQFVQFGKYKHAAAVMHSESGGKTSEAFTGETKRAGADNEEGPCVTALDEKDVDPTSIPIGSVHPLFLPALIQFESRVDNSLNDDAIRQEQRREMSRDEQLFWERVELIRKGQVATIKGGHKRDYADEAAVASDNAVDKVQLRFTQ 314 T 0.00033 YbaB_DNA_bd pdbhh F Eukaryota T 7aor 13 M aj Q4DA51_TRYCC mS57 MLRRTSWRAVGYTPVNPDTSPMLAYSQYHWHYNLPQGMERPHGVNRTMTAPYQSAHSLVNKYRGVWIELDMHPAFRVALEPQLRKLPQGRTIPKTSVDEVISDYINTAHLIQDEMTRDLWLAKVLQHCAFQRSNEGMALWEKYCHSRFIADGATATPPLPLVKAILFYCSKIDYQGWSSIFQKCLKNDWNYTPLFDTAQWNFLLKSVGRMGDEKGVRLILEEMLDVQADLDRVEARSIVIALNAVTDNDIYEYIKKYLFNFGERKVKFLRIIYSDLRGHGAGKLRIPLKENDKMFYHVCWHSSIRAPRQFSPRQLYFDYTPSTLGASVHNPNAKIDDIVKDKIEKWKTEGLLPEDYVHEDRVYDRGTAFKNVARQEKWKKMPRIVKSKKMGYTGDP 396 T 1.4 RPM2 pdbhh F Eukaryota T 7aor 14 N ag Q4DWR5_TRYCC mS55 MLSQNVAKTTVPSYYMIRTNLPQRKPQNQWEGVYYFGGITKRQRHLILLQRKREREARMRAFSASCSNLLRLLEGDTQEQQQAKTQTIQLSSPHGPFDLAIRLAQHGLYQQASRIVDELHQQRALRMSHYGLLIDALSAPCLGQRILYGSAQCDPALTYKLLGDENGEERAQEAHRWFDMAFALLTTECRMSGSEHRLPQATAAATHLVNALMRALLTCGYTHVSAVPDAVYDRMGLMGISPTISTYELVMLALSLQGNMKEAESVFSFLRRHHNEHVTIGSFNALLLGHRECRQFDRCDAIWQELVDRRWPRASTLTAELYLRSIVDHSYTPTSGPLQRFGNINVVEKKKIPLVLAQMDDLGIPRAHLSRPLMDEVEDALRKFHIYKSRYYEWGRAVKQFNFIEFRRRNGWMYDLHLMKNTTKQVGPLRDFNQPDATQAPVATVEIPAFFNERPAWEQPPLEETLYVTESRERYDDVRSGDIYEDRTRSLHDRSPTWMNEVPETRYDHLYGVNHPDIAKIGIRRHLNAEYVNRKEVVERDAALMKKNLSTGRRLRRKVESSRTHRNAGSMSGAAPASASR 581 T 0.025 PPR_1 pdbpssm F Eukaryota T 7aor 16 P ae Q4D651_TRYCC mS53 MSVTGVFSKGRGIGHAAVTSILRYIPRARVPWQPSRFGRENLSASDLAVLWSRGRYRDGPGNYNSGYHTEKTHVLEDNTVTMIPKHELEKYMPDISIGPKALVTPVSLMSARNGHRVTHDLLHSYDPHIGRLDKPAVVDHDNITVEDPNRVGLNAATLDCRGRIYRWLRRGPFFQEDHYFRRSLRLNRDGTVPTAAHEAPLMRKIVRLAQRGHLKAACEEYRRVTTVPPVEVYRALTACCIPGGLIADAVAIFEDGNSKLFYVARDGEVLHNVMRCAIKAKNRVRVMWVYNVMRGRYYENVIVRAEIDPIWRYRIALLALEYFLDHNCAEEAGTVYSYLVEEDLLQCDVHLRVGLHMREALSKGKSVGLSDEVLRATSLVTDVATVAPEVARELYQRHVEALRENEKSNDCDMNTKNDGATGCAWSAHGLLTALDFTQKDDALPWMQQNFGDVDVASVLRWARFYHSKDLMAKDRPRYLARAVAWIELLSKRSHMMEEAPLTYMRKSKPLSLNTNSNLRVAWQTPVARPDGPPRLLAREEGYTFHHNEHSRFVTETYRHPGETLQSRFLAMQPIHTEVSAKEDFQEIYAQQQEQRALPSGVVSPTARILHHSVTSEIHGGGGGQHHRSMSRAESSTGGAVKKEKLNLVSPHVAAEKRGGSGTGNIGGTAAGGGVTPEF 678 T 0.0011 PPR_long pdbhh F Eukaryota T 7aor 17 Q l Q4E2R4_TRYCC mS52 MRRRTVISFGYVRVAASAVGVIGMGANHRHHYQQQQRPITTRGQFNPVHDFTYAMERGVRARDEKTFEKLITNPGPLRIAYSPDYLDWLYRCYKAKGKYMDARAAAEKKFNGNIISGGAVTTSSGEMIHPLPGAPPPGMFLRPPNSFRRLSGEMKRKHAQETLDEVSKAQGMLDLFERQPQFPAIHIDRCTRFHLVELFKEMVLERSLEAVAIWDKALLYRAILSERKASYPASFRYIFKAVEDTVFAHSSVNCPSLEAYYYFLYLVKKYYIDNAVEAHVVLRCHREPNATDLLFSNPPPKDEVDVRNAIEALQSAEATSHAASSATSGTAKHDDPAATDEKHTRGDNDQKDSANANAKHPCQFAPPSSYPPIEALWRCEENVPLLEILLFGEFNLIVSENPFVKFPTAHAFLTRPYSTESSKGPIDGASLANVIAEKRGHLLPSFPMNVASAIDGRAQELRRLQQKHHRDDTVSFQTLLRSTHVDDNPSTFSSYSDWSYFNPRAVRAEERDRLTRKGIDALKEYDSATEDIYRRSFEDAQASNFQRVTEAWNTFPPYLPTLPHFVSIIKKDSHISFLLHVGLPERCSSAEAAAKHKEFERRIYQLARALYHTALEFHKETVRRVNRQKVNVAASLLDNFFEQEWVAMLRESESLENSLEQGAWPDKKTDMARRLGRYIPFARRSLDENGFPTDARADDYARWMEAPARMKGAA 714 T 0.16 CM_2 pdbpercent F Eukaryota T 7aor 18 R ac Q4CW80_TRYCC mS50 MMRARRVVVALSPLAQLCVHVQWRLYTPIWQPDPAVDHVAPLRESDENRTLWASSAPIANVSDAIAAWIRFGNDPVLHTALPVIHAGQNERTRTDGSSASLSLSSLPSPSSTSPFATVEDYMGTNMVFGSPEHVKDSAAVWASYFERRYLSQLRHSRRTAANHVGLVNAPDVFTDEADRPETKWSQDTRFRERAYMAEKFLKEKVANLQQLEQALKQAKPAEYIAFHDALQQQTLTLIPLPSPSVWHYGGARRTQWAERFLPLSHEAQQFFTTVLAEDLKRAGDAPEKVLQKVAAVFAEVGKILLQRHRRCLGGREWSALAPHEKDEFCMKEVERWKQQVEVGEFDPPLDGDDDPTSTEWQSEHDAIMQLMTATIDGLSFSALEFWTHTIRCEEMETEHIHTEKRVRAISAAARRAMYDTTSYEAVLQGIVDAVAKGQLDMKAAGFKPHMNDIWCQLNYAKFGASTVTQHTTTARRQLNYFHAGLLKEVAATAALYYATKPLSSSLDYASPYKFRRSLVGLFSTYGVEMVYAVQRPLLFSAANLAKAEDLIRGVVKNVARPFGERRRAKLKQLRANHRRLATPVQGVVVSAVVSDLLESGADVSEAKKAEKMQESVTFWPLGARRVVSYDWPTPHFDALKRRVAAAGSAVTAQSTKEIQEIKRNAFVEVSLWRRVTAEETKQRRDAVEEETRRVADVVRTIPPLAQVQQYATSLYQRIEDAAPFPAATDNNAKSEQEDDESSWEFVVMLDDRVVLNANQAAELYLPYTDASGVPIPQGECRVRVRGFDVDVNPTLNPAFCSEAFSTPFQVFDAIPQLVQQFFGTAKPSVAEVSDIPSSKFIQFCAFLREAGLDVPVQCEFEAGQVLNAEGDVFMEYFLNLLRSDRFHRSCAQAGLTEMQRVIESSCRAHWEVHHPGANEAEWAEARRRVLDRAMEKEREWWFPNEMLDVTNMSPGSNHGLRLPMYPATVRYGRELCTLLAAEGQFDNNSGLSATCAVNGTGAAESITFSTGDHISSTFSMEEALAVAKGALRNAHDRQNTLAAFRLGPLSKHSQVLLFCGINATEFGGKYARTYTYAFEKAKKELAETFVSGRVVPGVDEDELLRVSDKEGVDRFASSTHPEQRKTQFVPRVGPGGAPIEDPTADQKTQWGR 1152 T 0.31 MIase pdbpercent F Eukaryota T 7aor 19 S ab Q4DNX8_TRYCC mS49 MIRRRLCFVSRPTKAASISVTLFSVQRQKGGLHTFIRDARSSSFTTPRQASHAEGEHTSSSLNSTDWATQMQRELFGETDPLGGQAHKDYYRDPARGYSPQYAPRNFAEGGAISYHHAQSPMEYAEATHRRSWLDHDVARMEAAFQEQRALLRGMESATERDELARRYAAEHHVADIVVENQSLLPSTQVHHSTSTSGSALRQQAVVDRFQIADQQSPLATSDGMGREELAHTYRMRSETVHNDWIEENLRIVHGLREKEKYDFTVLQRATRIPFQGYDMDRFLAQQKGTPYGAQSLPPNTASSTMEEAQRTLRDPTATVPSFEAISQKAFARNTVRDHPTTGEELTQEVVDTIRTSREASEWQREQERAQRFGLGRQGALVQDGGPDKRTLKKHVNDERIMDAMFFRSDAYRKTQTDEHWNPYMRQDTTHGVAHLLNNKFDIARREDRLSKGEQDLTERSVMHFGVPIQQTIDEFVFRHRNARGERPLDYFKPFPGFRDFRLNRMYRDVEGFSLMKQRPEFLEWELFTRYRAHHQQRRRIALLHGLEPVANETAQERDARREKLDEICERTPFDERELHTNDDEMQVSGETLRSWFGVYMLPSPTVVEAVVGASASVNLHLFPLADEMGTADTRENVLSSRYFNRLLLMEGFQNRISRAFMGNVSGKAPEPVVQYMQPPEVLRHFTAEERAMYEQYVKEQTSKQLGEWATAMRRRRWIPDRQQYGHVVAQGYGVSVVDLEHADTAAVLTVSAKAFERELAAAKGNTSHIIMVEGQAYKLRPDSERFVVPLSVRLESGEVLDMTDEAFGRYELELLPRNVNHALNYGIGDYAYNRGNYIETQDVIWEEQTASGEEGWSPATHADGLRAGLPVRARRHVGMNANGSRIVSSPQRAVIVAYDRQPFFNPEPRLVRVAFQSDGSVEEVPLANIMIWQRRYHGPERTVGDESRRFSPASLRRYIDVSDPFNEKKSKGEHFLDKYEAARTSEVAAGKYRTTKQITEIDQWTRFDVSRADNFRPLSISHRRDYIRLGYMHRYTPWEWIAVQEADQPLIAEQIRQDNIGTSYFFSLNRYWRYKARPHGYIRHFDNEVRDLFQFVDGVTPWKQAQKIRTYWEVRAHHPMPQFNRPEVAMHRNTVGLLPAHMWETDKKTGKVKAVKDSVRDYQTKTPLPKWVQL 1175 T 22 UL11 pdbhh F Eukaryota T 7aor 20 T ad Q4DV41_TRYCC mS51 MFLRTHVERHRSGKARAFVFRDPTLKMMRAGSGYQQLRRMGMPIQVSKGWRKVDHFHANNQYQHAWPLLSHDDLGNSDQSNNTRNIMYSMYLPKRNKGTAPWFRGADTYSVKYCEQGRYEYQRYLMINRFPSEYRKHFMNFLSNIRSSSGPATIPQEALHWLLRMIVDNFNPQHVHYIAAMKTLQNAGELDMARDVWKIMERQQTWPCTSTICAYLDVCVEAGEKTWAMEAWNRYCTELKFLQPGEVDPKPVSRVPFSLTREELLYLPKWKKHFDHDPNLDVVDLNRFNRTREVYLRMAQVMLAGGERDSFQHFYTKLEEAMLSTPTPVPEPPNPHLVRRPQWSPYEHCKSVHHSPWRVGNNGRAMALGPSLTTEDEMQSRFFSNDQFLVHMLKEILRIVLQEHRRRHPEACSRGEGEAFFDQVVDARETLNFCNELIERLFAVLGQKMHGLNTSSLLSVILELYRVMGKETGMALLRRANQFLERKAALEDGAKESLTAPNYLQVLMGFADESAYVYDSKRKGLCRYRSGFDPRTTMQQLAATVQEIAGNPHVTWAADMHLQVVCTMVGCGTMKANDYFVRNVLRQFCWDSRFLEALYMEYRRHDDVDMWAELTKRALVWTARYNVNASERLKRLIEDDYDTIQVHTRTFRELAVFQFRDVEEKRHSRDVVNELPNPWTDYVSHALPFPDRDAGYPDEYGDIGQWRAPGGPGSPVKGPGYYAPPMEGEHQRGYTAEWRDLKNPMRPPEFPTPWERKYKQYARGQHPSYDMVYAGPMPEIFPNRYDFRKPTRWDFHDIEKQGKYKTSGPY 810 T 0.0043 RPM2 pdbhh F Eukaryota T 7aor 21 U m Q4E4E0_TRYCC bS18m MNRMGGSVYANAMAQFAICRQPWNEYINLLTKQDSTPYHVEPQEKPAYRGRKRGREGWLFGQQVQLHYHRFPDEQLLTNLTRWRTGETVGDIALQQFRNAQPFDIEDKDPQGMQRPSPEVYMKLNYKNPATISRFLTRTGHMYPADILPLNPEAVAKLRVAKAQAVRIGLYPRFGNPFWFRSQKFRPKAYQENYDPTTYSTKHTMEHFAYNWVQTDRIRRYFKELEELQKNASNGARGGSATTAEQKQQNQFYAPENQPISMHRNNISYMAEVERSMKNPTVPGLMSTKGMKKKFHNLYSSTSTKRMGFSNPTLGIKKV 319 T 1.2E-05 Ribosomal_S18 pdbhh F Eukaryota T 7aor 23 W f Q4CT44_TRYCC uS11m MRQSLVLLRRGKPRPRAGMFPDKYRRVPALLKPQQGGQQFFNQFLIRFTNDRLMRRDVEDGEDKKESKIAAQLPQMDWEHMSARSSSDAIREEMHRLVEGDAVQHQRVFNERIWYEEEERRRLQTGADAAAPTEDAGGAKEHDIPPRVLGNDYFQSRFGYSLVKQSEMPQGVTDYNQLDMWGEMPKYTRDMVFLYLISRRRNTYAVAYTYEGKRILSTYTAGNRGLKGGDRGFRSDGSTDNGHQVTSMYLNDLLPKVRELRANEGRPIGRGEKIELVVRVMGFYNGRQGAVRAVQDRANEFHVRYFEDITPFPLNGPKMPRGVFK 325 T 0.00013 Ribosomal_S11 pdbpssm F Eukaryota T 7aor 26 Z ba Q4D7F8_TRYCC mS73 MITELRTKGATRTPAIRYSYPAPPPSKNAAPQQPRGTPRPRTAKTNVRKPKRSDARRRARAGSGFGVVSPVRARRRRVVGPPFADFFGALFPATRGIAFLSDLPTHIFSSFLLSFLAGRGVAMLRMCRRLAMKYADLELTTRGEFPHGMKEPGFVKKLDQNIPWYFSTYRSMYHWPITGDNWSDLNEAEKHHDLHMFYTLAWWKLGEGIFDANDEDN 217 T 19 Mastoparan_2 pdbhh F Eukaryota T 7aor 30 DA aq Q4E0X6_TRYCC mS63 MLHGSLPSLASLRYRRPYWMLFLKDVDNWKIYTVIQQPDHQRTEMLYQAWLGGLDRPYTRPKCMANQPLWLSKKRHILRKDRLDGPETPLEKYVLEWHKRFHSFQGTERPTVDDLHTALDLVERPLDLSYAFQLLNQCRNVNNIRFAKDTFLVFLEACLRVDRKDCALYATENAEALGFWHIEEDYRRYLRGEQSWYRLSPLDNMYYPLEENVKLNAGRSPPSSAAVAEGDAEGTAETTGFEAAAGAAAWSDDEGSMKAGSSRESMTVDDEIARLEAELAALEEEGTDGDNDVKGPKGH 299 T 0.087 ABC_tran_CTD pdbpercent F Eukaryota T 7aor 31 EA ap Q4D014_TRYCC mS62 MRRFCFPVATFAQAALRHRGIRWNTTMADNESHTGAKSSASSSTPSEANEISAMERASEARREIHDLWMSTEKMLDLENRVRSVASLIEKYKLDPSTPRENDVSRGLGDAFDRLLLLCVPLGKDSSKGTDDLERLMNLAGRNGREISVRTIQHLFARTDSFSEALAVFYAMRRCHVAMNMEAYYAMLYSLQRLEEEGWAQRFREECEEKGGVSEQAMDFVVKGINNALLPENKPWLGRVMFGDRDAPAQRREARDYDELSAMWTERYRDGSAFPTSP 277 T 0.095 ECSIT pdbhh F Eukaryota T 7aor 32 FA ao Q4D7Y5_TRYCC mS61 MLRSTRPWRFRMKGGEMFVEYKIMSRDHRRSIRVEDAIVDPSVARTVVPLSWLEQLRSPSLRLHTGYHMEEAVYVPPAYAAVDEKEGRRLSEKSTMTPNAILAGPVVLSITGQSVPVVLNPYFVPDDTWGIRRNRDEWDLRLGMDAIEQCTLFSELRPGGLLYNKLPSSQNVTRHEPVRATLQRYGMKCGLAESPLVPRPWTRMRYMFIDELQRGPKLTEFVGHNPRNGTQWRFSQHSKYFRIGVWRETIRRNDMNEGLHGHSSWQKSPQQAVPEVRLMAPYP 283 T 6.2 AAA_11 pdbhh F Eukaryota T 7aor 33 GA ai Q4D4C7_TRYCC mS56 MRSIRGVCCFSSLYTTQFRVHATYDVAPLSHKELFSIYQNWDKTRDELDLLEEVEERISKWKLNKWEMRIPPLLTAREKELMRQQQELLKSIFFDWGKCRDALNKDLELISSITGLPKGTVREKNRAWLQEEAAKLRWVGEVSKATRLRDAFLRLEVYGSRDHRLLERLCCIYGLGLQGSFESAFSNYIVEDPITKKIYVDEKNSFRDLLAYIIHTYPQIDIIYDFLGFNFIGGYRSSLRRYLECMVSRSTEGEKIPGRLVFGRGKPAEILFDFGNSNESLVSGECTQGFPDFVFVKGSDMTLIIIASENSWLRNRQLPHRKQMEGIARRASFVLGIPFSEVRVRNLLLPPTYLDKDSIVRINEAVLGLSKEEQRNLAPWLEMYQKELDSKDVDFCSLMKSTNEEEWLTL 410 T 2.3 RB_B pdbhh F Eukaryota T 7aor 39 MA v Q4DRC8_TRYCC mS37 MKSSDIFHAYRYTPVFLKARQHDSGVNQYGLKPVNAYDFINPTNLVNFGRGTSFDNLGVRRAGRGEIDSSPSLGGSPVFTQAKLVGLSGEEQLTMCQSETMALRVCMARGGQDTCERESRALDACLSRVGHLRRAMSEACGEFNDWFIQNVSDNHTKPFQHRPHDWRHFYAQEKLVRERQQNGHAYGRRPKQFSFGARYVKTEGYGKRPRLPYNK 215 T 0.11 Gypsy pdbpercent F Eukaryota T 7aor 40 NA bb Q4DMI0_TRYCC mS38 MRGCGVLCAGHKRAAVIATTTCSLGPPTLSPSLTLRPVACAATPVTIPIFPPPMRGSFIDRNPVWASFNEKHTAKSFRHRIVSSADVSLRPPQFYLSNEKVSAGEAAVVQKRAEATDSYGEQLDEISARWAAKFYGRVTFGPRNYPYPSSRWLARRFKMKKHRIIKRFRFRRYKLAAVANLPFAKMIRVGLLPELKSSKTKKGDDVEADLSGRLVSAVRSSGGKKKGNRGRPKSKYQI 238 T 22 DUF2996 pdbhh F Eukaryota T 7aor 42 PA al Q4DRU1_TRYCC mS59 MMRCCCVLQDKSMFAAKRRVIVPIHPTPNYPAHFIKASFTTDPLKEKQKARFSSGGEAMREVQMIPKNLEGERSRRELMSRGDTEFEALVEFIQGASYDQLISGRRFKKVYDKLSENDDTFVWLCHTAMSVLNPGDVRSRLVYNHLRTLAEAVANGEMTLRTAFRFYESAVRSPAYREIAKRQMEGGAATRLAGISAAADVMRRMGLTRRPMASYFELYQRIVERSEAMTPWGFPPLFQFEERLSLEPRLKFFSRASQQALERRRRGHIMSAYTTLQGRRIFWIPPTWNRAGRFLGPHVTLYPGMTPD 308 T 0.35 HLH pdb F Eukaryota T 7aor 43 QA q Q4DUA8_TRYCC mS26 MMRCSRGCRRQLRPYYNLPSKSDHGRKMTGFLTPYRHWMWKQNELWRNVHEAQFEHLRKVYKRQWLESFRVNADEYIYKYNITKAAQLAQWEYEMQAQEKKRLEAKQMTEGRQALKKKHLDLLREFHERQFFFWYERASERLQSMNLIQYIPQSRVQEHIERELDKYVAGKSEAYPLNFVGQMPLVEDREGNIVEVPEGLLTNHVSEHPESTVKPHQPHESTSVSVEEQLLRTMASAREESLEEWIDDSRALSETIDDISREEEQRDEDTRVARSMEETDNEREISRRMYIDRGKTGSKAIFRRPTLSETDGGSASIPVGGVAASPADTSAPMRRRKKGKLDKAHALQEQQDAMIARMSAKSLKDGESSISIVKRGEIATSRGRIRDKAAIPTQEVLMQKPELAAGSVPNARISFKDKVDQLYHRGKYKQKKEDDNNPNEDL 442 T 0.033 ThylakoidFormat pdbpercent F Eukaryota T 7aor 44 RA p Q4DRR8_TRYCC mS23 MRSTALRLYKMPKNMGVAPRFDVWNESYEPWQHMKRMGRLVGTGFYIPPEWYNHFRMFPPINHNFQQEKTLNPRNESEPTQDDTGSLSQERAALRDELARKSRLLASEGMRYYNIFWVRKPLDRMEKEYYELKRKGIAHSEAIKKVLEGFYKELAVKKRVAAVQAEEAKLSGRFITMREATVVLNVLAQLHREQLTPHQVTLLAREQHQATEKASGLTATVSRVSAPAAGIDADSTASKESGSDEALSADSLANMLEDDHASGAGTQYQVEVKHSARDSVRQLHEKSTDDTGSPDWYTGASPVYNGAA 308 T 0.049 MRP-S25 pdbhh F Eukaryota T 7aor 45 SA Ca Q4E4S6_TRYCC mS22 MFRRGLVHRRYPFNKRGPRERKSWKHHVLTDPPKPIQWRDPKVWTKDLTTMKSFDAPQWDLWQSRARSEDIDEALQPFMDMPQSLKDRRYDIPWWANPFGAWYLQNILSVELLKLPSRTNAEKVAIYRNQKHSLSSKKKGEAAQDDEILANIIKERWRTLEFGDRDAGYPCTFSDYIQFLNEWFKSLDEEGMQRLREHFDRRIRPLLAVMSPVDILWLEALTQNSPHNKEQLQRRIAFQTSLGTPEFFDMSKRLRYEINEDYKVRDELGPELFALWSKAPERWPPERLSKMYGLDFTLVRKILVWHHFKACYDACVEPDWTLPKRLFALEWIRDVRARKHGLFYGKMRFAEQKITFYSDRFLFRDLVNRREASYANVWEMDDPYRFLQTEQDYEDYWGDNYDVYRRMFPEMIGKSGEPVQQYGQMPVWTGPHRQHANKSQHNWMFAEIGVNVGHEALKKLELDPTNEKRRRFVIRQPDGTLRSAKMSEMRAWYWKEEWADFRFWAPNMEWGIENTPSQEQYQEHVPDTPDADFRKQRRIQSRPVKWFYESHYTRTGNFAGFQPLRFMQRRTEREVRWPDVINAAVQIQKRKPDAYIFKAIPEI 603 T 0.014 INCENP_N pdbpercent F Eukaryota T 7aor 46 TA g Q4DET1_TRYCC bS21m MLRITSACRGGYMMYHRKSMGTMKYSKWKGAHGGVSHFYGRTPMVEEVKRNEPITLIDRRIMHYVHRSRLRHFQLFRSYQQKSNATECKLREGEMLRRRWHRKLQKSFIAFMQFKTMKVLEDQAKLVNTYGQAAVNAALGDSWDAVSDKEKDRKYVTIRRQVKALPVVSVVPKHVATMKQIHNDRFNYRWRVN 193 T 0.072 HMG_box pdbpercent F Eukaryota T 7aor 47 UA j Q4D6Y0_TRYCC bS16m MLHFTSLFRARAIIKRRTPQLWGAPGAPIIRMRGHHVVWKFQSYDLFVEHTHKRRNSDARLLHYLGKHCPHPQKSLWSPDTPVAQDRHLFMLTTVDVDAFKYWFGVKRCRLSMRPWALLAKAGLLPPSLRQNSKIMPKPIFDKEQLMRYYLANRKEEATIEREDYLNYKNSLVKSEEERAAERPVAPYL 189 T 0.11 Ribosomal_S16 pdb F Eukaryota T 7aor 49 WA bd MAXICIRCLE UNASSIGNED READING FRAME 5 MTKKGATKILFIYKLSKLNVYNNESYKIKLLFNHLYCIDNYNSIYFNLNGILIWLNVLHINIILIKYAFLILLNNLEYLIIFKYNIISIK 90 T 150 Cytadhesin_P30 pdbhh F T 7aor 51 YA bc Q4D913_TRYCC mt-iF3 MKKMVFCKMCRPLLVFCATSCWRRSARLPLPFFFSPLNLQVAPFTSWNLQARYFSTTAGGGREGTNSSEDDYVFDPTLSVQKDAAIHVAKKSLDAIVRDLLPENAPDAATQKVRAYLQQHPMDTLITQPTVHITHVEDPESGRETKMSLSPCDLSEALEQAQEREMNLVQMGTRGDVAYCRIRREIPRILGLVGPELEALREEEKQEGSSHRGGSDQAGGKIRELVDHSFRDVVDAHFVGWKSKKIVEDIKKRHPVKITIKEFQSPEAAIGKIREMCQAMQRYAEEKLIYHHFTSIVANDREVSVSFVPSLPSEKGNSWKHIKYPGEKEWAHANKRMEEACRKSGRYGTYVKNNMLKPRSLGQTFFRVDKYGRKID 376 T 0.00073 mIF3 pdbhh F Eukaryota T 7aor 52 ZA aa Q4DVD2_TRYCC mS48 MYAHVVSWTLFLHVFFFFLSLSRTLFFFFFFFVCLFFPFLCVYLEEEMLRRLVSSHGCSNGGGTNGGRCVRPVKEEPPVFSSLVLQRRFSFKYATKLQHDEMRQPFYIHEKRHGIFSNEKNIRKSRRGLPFITPLYTRHMNLWETDTDASKNRFFRGYVFGQRELHQLLGRPHGFEANNTDGSNDISAYEMTTDQRYKGIPRPAITNLHYEPEWNYTLYRAGTHGSQLSNPRSPLTAEVLGDELMKIRDIKSFDHCKAWFDRLQYLIKLHYDAVGDIGEFKSRHTQHVHEFFVAFHDALSSFDFGDSYLFEQFHAARPSELTDLFGIFLEMEANYVHEDYCPRCSLPYSTTRYCGEGDANTPFRKHRGRWAPHQRWGREWYAVVARRAEALWYRATEDPYFGTPQHTQRQAEALLRVYVQTKQRGKAIDFMNALRGSKEFLLGSICITPEMQESYDRLLDTTPHPHLLTNGFTLESNAAKYTGEVQKVPFSPLQFRIDMEMNKYRRQQKEEGAVRVPPAMWRIDTSAIVPYKVDPKTKRVINWREVKEGIEKSFLSTGLPKEAYTGSEWREMLHLKSIIAGRAAKAAELERRLHTDKVKLLEVSSKSKIASETNTSTGSSNSSSIIFPDKDGYHVFDTSVASLRPFGVSQSGAVFQSVTHTYPSPHAVLYNDPVHGKQFILDTTNESCHLFGGFEHGDRLLIRARKIDKNDNNPNSVLAVKGNEFEVIVVGVNKEGSDSEWQLCAMHVDPALQREYGLVFLGTDCVDIHERWASVRYAAAAPHVKGRVTLLEERRTAREESLGQIVGVRDGVLFVQWRLLRGGGSEMDRSVAEPIGTVEQVRNAYQITEQGVEELMHPPSWRTPFRNDFAEERLEELRQAPFKRENWVSLIQGRYTPKLKRFGYTQHTTMDDFETKEYKDRLLSKQFFHNPQAFEVIPDRRDRAVTFGGKWEYQRTHGLPTVDRNELENGWSEVEAVTDAEMHVIEQALRDISGRRPGNFIKSPTKKNTLQLNESWWEPLEFGWEQHNKEQKALVDPTEQRLIDSASLPFGGKIPPFGTTIGIGERIREIAEDYAKGFGLGPHGHSPSHDTCQYNTLNAEEDRVRELGYKDALVRLFDEKMADKDVHQWAVEQCADGEADVRQLLLSLHEWRERGRPPSLMLLQVLSKYLEQEIAAFNEGVPSSVPKLSLQTVDGTLSPSGNSGERSGTIWADVEPTAYALQYASQANHSSLDEPFILQLLKSAQLGGRNAQFTDPFYNAYLENSVVSEFQLGLAALAGKGVSPSLLAQKISQLHRGSVRLSGNVIPFVKSRELAHLLERMGLSSENIAVVTRGLANCPEQESVGDDFAVPVSVILSWGGPGSGSSTNAADRKGNATQSRNELQRKGSAALSSAIRQLGQKRSSSKNWQNEDKMMVHVIEELALRDDGLVMDIQYIVRENRRNPVLRHEFFAALLPVFAGKHEKVAQLYDEYCEGKYVPNITLAIEAFIAFLCNVTKHADVYPGSSYFDVDTTNGPNAGQYISLKLLDPLDGPFIFDNIKAEHIETVERFKQHGIQVGPVRAPATGFIAANSKSLSYFTRRPEEVVYVSTDADQGLRRSLERSAHYKTIAASPAMQFLLHTQNGAGLVATFNRFFYRTMPMLSFYQRILKHYSDNVQPLRQKAQNSVRGLARVLENERSAAMEEFRRNSERYWRNVLEGRSVEQAMGGSGGSGGGGGGTTPPVSSSPSQSSQEAMAADVARAIGSGRTGDQKGGAARQQQQQQQRTAGDDGGVRTTFASRKGGSRSMTDLLSKLNKPKGSNTSGTAKGPTKRGNPKSHTDGGRGAKP 1827 T 11 DUF5053 pdbhh F Eukaryota T 7aor 55 CB as Q4D4G1_TRYCC mS65 MFGRSALCLAKRFRYNTKYPSLVSYNKLPWEILNHETPEFHMHVAPHYEQIMTLAASTHVPHIVGKKHLEMPPEHRLRLLPGMFYMLDGDSIPEGFTANRVLDPTALQYYGRLESLVAPVQAVRMLISDDLRIVCNSVTLQGPLQLPVASYASLASLEVVTNKASASFTLFHFVRPNRPPSELQLEKYYIHAPRAMALAEFNSTSNTSWEPKLQAPKRSKRVTPLPAYRPPQSYLMGLAERLAVVPGSSFGRRSLMWGHWF 261 T 32 PELOTA_1 pdbhh F Eukaryota T 7aqw 4 D c Q8VZT9_ARATH Transmembrane protein MGGGDHGHGAEGGDFRAKVWSMTGGPNCRPKHWRRNTAIAMFGVFLVCIPIAKLSAKLEQRPHMPVRPIPSQIWCKNFGTKDDYEKEH 88 T 1.8 Chordopox_A13L pdbhh F Eukaryota T 7aqw 9 I l NDUB8_ARATH NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 8, mitochondrial MAGRLSGVASRIMGGNGVVARSVGSSLRQRAGMGLPVGKHIVPDKPLSVNDELMWDNGTAFPEPCIDRIADTVGKYEALAWLSGGLGFFVGLGLLAVLNDKASKVPFTPRVYPYDNLRVELGGEP 125 T 0.00028 NDUF_B8 pdbhh F Eukaryota T 7aqw 13 M p NDBAB_ARATH NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 10-B MGRKKGLPEFEESAPDGFDPENPYKDPVAMVEMREHIVREKWIQIEKAKILREKVKWCYRVEGVNHYQKCRHLVQQYLDSTRGVGWGKDHRPISLHGPKPEAVEAE 106 T 0.00012 NDUFB10 pdb F Eukaryota T 7ar7 26 Z b Q9ZPY5_ARATH EXPRESSED PROTEIN,F11C10.23/F11C10.23,FIBER PVMEKLRMFVAQEPVVAASCLIGGVGLFLPAVVRPILDSL 40 T 0.0026 NADHdh_A3 pdb F Eukaryota T 7ar7 27 AA c Q8VZT9_ARATH Transmembrane protein GDFRAKVWSMTGGPNCRPKHWRRNTAIAMFGVFLVCIPIAKLSAKLEQRPHMPVRPIPSQIWCKNFGTKDDYEKEH 76 T 1.8 Chordopox_A13L unphh F Eukaryota T 7ar7 28 BA d Q94AL6_ARATH UNCHARACTERIZED PROTEIN AT4G20150 PISATMVGALLGLGTQMYSNALRKLPYMRHPWEHVVGMGLGAVFANQLVKWDVKLKEDLDVMLAKARAANERRYF 75 T 0.00096 NDUF_C2 unppercent F Eukaryota T 7ar7 35 IA l NDUB8_ARATH NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 8, mitochondrial YEALAWLSGGLGFFVGLGLLAVLNDKASKVPFTPRVYPYDNLRVEL 46 T 0.00028 NDUF_B8 unphh F Eukaryota T 7ar7 36 JA m Q9SIQ8_ARATH AT2G31490,EXPRESSED PROTEIN,NEURONAL ACETYLCHOLINE RECEPTOR SUBUNIT ALPHA-5,UNCHARACTERIZED PROTEIN AT2G31490 METNKNKFIEDWGSARENLEHNFRWTRRNFALIGIFGIALPIIVYKGIVKDFHMQDEDAGRPHRKFL 67 T 0.006 NDUF_B4 pdbpercent F Eukaryota T 7ar7 41 OA r B14.5a PPIRRYVLTK 10 T 0.34 Gp45_2 pdbhh F T 7ar7 43 QA v UMP2_ARATH Uncharacterized protein At2g27730, mitochondrial PKVSEDKNRNYAVVAGVVAIVGSIGWYLKA 30 T 0.051 Gram_pos_anchor pdb F Eukaryota T 7ar7 44 RA x GCAL2_ARATH ATCAL2,GAMMA CAL2 PKSQVTPSPDRVKWDYRGQRQIIPLGQWLPKVAVDAYVAPNVVLAGQVTVWDGSSVWNGAVLRGDLNKITVGFCSNVQERCVVHAAWSSPTGLPAQTLIDRYVTVGAYSLLRSCTIEPECIIGQHSILMEGSLVETRSILEAGSVLPPGRRIPSGELWGGNPARFIRTLTNEETLEIPKLAVAINHLSGDYFSEFLPYSTIYLEVEKFKKSLGI 214 T 0.00013 Hexapep pdbpercent F Eukaryota T 7ar9 7 G N Q8LYW0_9CHLO ND2 MWENLWYLDILINVLIITIFGLISCSSSATKSYDLKGCFIISMVGGVYDIPSAILWCLASLSILNFNGFFASLFLVFTWISNLFAMQSLNFLGIYLAFEMQSLCLLVLGKITANENQRWFAYRGLLKYLVLSLIAGSIFIFHASSSYLQSGVMISDSLVTYVFLLFKLGVAPFHMYTLELFSVVSRHVAFVFSTLPKLSVLYLISNSNIGSECVWWGLISLWLGSISQYQSVFVRSILLYSSVAEIGLVLLVLQEGFSWEAFSWVSIYFLSLSGVWHANSKFVSAISVASIAGLPPFLGFIGKAQILKSLVSINLGILIFSSILAATISFIGYLRLIRLMYLVSPVKWKNNKDSSFINWSTWMLTVGTLPMVYSV 375 T 5.5E-09 Proton_antipo_M pdbpssm F Eukaryota T 7ar9 8 H O C1-FDX MALLRALAKPLRSLQAVSSVAQVSLRQFGAASHHDDHHDDHDHYTPPKTVFEDTITINVLDYDGKKHAVKALIGTPLNKALVEYGFSSTYFFPNMGYYTQHISDAHVFIPEEYWKYVENVDLKTDDAEAIKLMFKLVVQDYQRETSFFASYLTLNKEMDNMTIGFGPIKPWHITPKWSFNGHHNVKDRMFDRLETGPFIE 200 T 0.2 HEPN_DZIP3 pdbpssm F T 7ar9 15 O c KFYI MGGHDHHHPSVPEPPYAKYLANKSHYCPPDFHYSREIYAPYGGYFNDPKGWRTNTAIATLVMLAGAYAVFCFGNAREERLRAPKGWIPSQLWNDNVPTPVDYRGKVLKDE 110 T 2.3 Deltameth_res pdbhh F T 7ar9 16 P d B14.5b MGWEYAGTYGALCGMVYAIGSNVISGRAWFRRPWVHVTSVTLSYLGSKLLDEVQDTYYLEHLKRVERKGLQVTEEHKKLFSAY 83 T 0.00022 NDUF_C2 pdbhh F T 7ar9 20 T h NUOP4 MAGGNYASLKADTSMDHVFGDSTNKLNYDFQLMSSKEAFFWNYTLYPIVGFPIFLYLYQFNKLENFEAEIAAAKAAKAASE 81 T 0.058 DUF2517 pdbpercent F T 7ar9 21 U i NUOP5 MFFFEFLQGKISDSQKEVDSQAEWYAEYDKLEKARQKRRIWKWRDSDSRDEYAINAEEPVIYIRSSLFGRTEVDPTGKNTNRNHQYLYNLKVLGHKTYTRRDPNELQKAQAEVDTLSAAGRLGPLSPF 128 T 5.9 PTPlike_phytase pdbhh F T 7ar9 24 X l A0A7S0YPK2_9CHLO ASHI MSLTLGLRSLSRGALAARNALPKRAGAGGPVKLSPPVDKPLPYNYDFWMDNGIYPGQPIYDGMFGSMVGNMSLEYMAKGWLVLLPILSAPFVYEHCIADDVTRNPFVPRQYPSEVREFLALFKNGFVLNDYSQPDYEEMKRRQSGLLTPIY 151 T 0.39 NDUF_B8 pdbhh F Eukaryota T 7ar9 25 Y m A0A7S0Y945_9CHLO B15 MSALQAVKNLTTRMRPFAFQQIRKASNSSRIAGDTTGKYTPNIFSPETPMDRSFSHVPKNPFWEAWVFRRDNIQREFVWTWQTIFDLATFVGGLYVAMYATASFCSRQNDKRNGYPERNYYFSDSKSNFVIPDEREFY 138 T 0.00082 NDUF_B4 pdbhh F Eukaryota T 7ar9 28 BA p PDSW MTTATIEERRAFHKEVLDIVQSKLANKNSEWARPEEPILHNLKSEKEEPHVYYHNNNFRVTRQLIHFEKTKIFEDELDKCMRTHGEAKYRKCQEIAKRFQASCRVASNLERGPNARKRDVGFIYQNNKLRELEKDAKELGLNNPFPPSSPRTTIGY 156 T 0.00011 NDUFB10 pdbhh F T 7ar9 29 CA s A0A7S0VJV3_9CHLO NUOP7 MVKTLADYIHWRNKPSSIPPVDEYRPPVPLVNYDKLSTQFFSKLDNDPVINRVLRAPKVTVMATSLPIVNHPAFLFVAGALTGFSLTYAITSHYVGRKEIENLVKFDPRYFPEYTKSS 118 T 1.1 YtxH pdbhh F Eukaryota T 7ar9 30 DA t A0A7S0YCV2_9CHLO NUOP8 MRSALRLANATRLSTFRLTSAPAVRLASPSFFVQKEDEENTRSIHTSNSSFHDEPKHQIPGNALDNWAFLRTYAKPLPDMIHYYYYVYLFGFFFVYKVADFPEYSPRVLVMAALIGSLFYVRRDWVHREFKDSP 134 T 7.4 DUF2555 pdbhh F Eukaryota T 7arc 16 P r A0A7S0YAP9_9CHLO B14.5a MSGILKTVQSIFYSVGLKEPWKMTGIRSLPDFEYYLPFGLTYRGISPGNQPIKAVVPHDVPKLVYDIKYFARDYRRNNSYTVRSVDSKTPFDYSKVFGSAPLKPADVKTVRIPEVMPHRGC 121 T 0.012 CI-B14_5a pdbpssm F Eukaryota T 7arr 1 A,B,C,D A,B,C,D alpha/beta-peptide LSEEEIQRIFGLSSEQIKSLPEEXYKKXVEXTGYL 35 T 0.043 CSTF_C pdbhh F T 7ars 1 A,B A,B alpha/beta-peptide LSEEEIQRIFGLSSEQIKSLPEEXYKKXVEXTGYM 35 T 0.042 CSTF_C pdbhh F T 7arx 3 C C SFTI1_HELAN SFMI1 - Sunflower MASP1 inhibitor GICSRSLPPICIPD 14 T 0.052 Bowman-Birk_leg unp F Eukaryota T 7asd 2 B,D,F,H,J,L,N,P AB,BB,CB,DB,EB,FB,GB,HB APIM_APIME APISIN SUBUNIT APISIMIN,ROYAL JELLY PROTEIN RJP54 MSKIVAVVVLAAFCVAMLVSDVSAKTSISVKGESNVDVVSQINSLVSSIVSGANVSAVLLAQTLVNILQILIDANVFA 78 T 0.44 SRP54_N pdbpssm F Eukaryota T 7ase 22 V F Q4CQU0_TRYCC 40S ribosomal protein SA MTSVESGAKVLRMKEGDVQKLVAMHCHLGTKNRSNAMKKYIHSRTKEGTNIIDLHMTWEKLILAARVIAAVENPQDVTVCSTRLFGQRAIFKFSQLVGTSFLAGRFIPGTFTNQIQKKFMQPRVLLVTDPRTDHQALREASLVNIPVIAFCDTDAPLEFVDIAIPCNNRGRYSISMMYWLLAREVLRLRGTIPRSVPWDVKVDLFFYRDPEEALKHEEVNQAAAPVAEVDEGFGWVERDNNAWEQ 245 T 1.5E-13 Ribosomal_S2 pdb F Eukaryota T 7asy 1 A A TNFA_HUMAN CACHECTIN,TNF-ALPHA,TUMOR NECROSIS FACTOR LIGAND SUPERFAMILY MEMBER 2,TNF-A RRCLFLSLFSFLIVAGATTLFCLLHFGVIGPQR 33 T 6.2 PBP1_TM unphh F Eukaryota T 7at7 1 A A TNFA_HUMAN CACHECTIN,TNF-ALPHA,TUMOR NECROSIS FACTOR LIGAND SUPERFAMILY MEMBER 2,TNF-A RRCLFLPLFSFLIVAGATTLFCLLHFGVIGPQR 33 T 6.2 PBP1_TM unphh F Eukaryota T 7atb 1 A A TNFA_HUMAN CACHECTIN,TNF-ALPHA,TUMOR NECROSIS FACTOR LIGAND SUPERFAMILY MEMBER 2,TNF-A RRCLFLSLFSFLIVLLLTTLFCLLHFGVIGPQR 33 T 6.2 PBP1_TM unphh F Eukaryota T 7ath 1 A AAA A0A3Q9JIL7_9MICO UipA MGSSHHHHHHSSGENLYFQIGDEFGDDDRSSMSDDGPRHDADDHGPRGEDRGDDDRGNAPSNGRGPVTGIGTASADELIAIADAARGAADGEVTSIDAKRDGTWEVQLTTAAGAETEVRVDEALVASVTSTDAADGDDTGPALTLDDETIRALVSAALAEAEGMITDLDVDGDDVSPYDASVLTSDNRSIDIDFSADFAVVGTDID 206 T 0.0072 HPTransfase pdbpercent F Bacteria T 7atr 2 B B YEJA_ECOLI Uncharacterized protein YejA LGEPRYAFNFN 11 T 10 Cas9_C pdbhh F Bacteria T 7ax1 2 B B CNOT7_HUMAN BTG1-BINDING FACTOR 1,CCR4-ASSOCIATED FACTOR 1,CAF-1,CAF1A GPHMLEPAATVDHSQRICEVWACNLDEEMKKIRQVIRKYNYVAMDTEFPGVVARPIGEFRSNADYQYQLLRCNVDLLKIIQLGLTFMNEQGEYPPGTSTWQFNFKFNLTEDMYAQDSIELLTTSGIQFKKHEEEGIETQYFAELLMTSGVVLCEGVKWLSFHSGYDFGYLIKILTNSNLPEEELDFFEILRLFFPVIYDVKYLMKSCKNLKGGLQEVAEQLELERIGPQHQAGSDSLLTGMAFFKMREMFFEDHIDDAKYCGHLYGLGSGSSYVQNGTGNAYEEEANKQS 290 T 6.1E-32 CAF1 unphh F Eukaryota T 7ay8 1 A A Tbo-IT2 CIQRHRSCRKSSECCGCSVCQCNLFGQNCQCKSGGLIAC 39 T 0.00031 Toxin_9 pdb F T 7azx 2 C C HUWE1_HUMAN E3 ubiquitin-protein ligase HUWE1 SHDQHAVLVLQPAVEAFFLVHATERESK 28 T 0.49 DUF3652 pdbhh F Eukaryota T 7b0n 30 DA d A0A1D8NGI5_YARLL subunit NEBM of protein NADH:Ubiquinone Oxidoreductase (Complex I) [Yarrowia lipolytica] ALFTSLVGASGLGFATKFLSNKIRLKPAGYYPLGYVFSGVAWAGLGLVLHNVHQHSLEVLEKKKTALSEQRTE 73 T 0.065 DUF6404 pdbpssm F Eukaryota T 7b13 2 B P SHN3pS542 ASHSMPSAAC 10 T 6.1 Equine_IAV_S2 pdbhh F T 7b1f 2 C,D C,D BUB1_HUMAN HBUB1,BUB1A KVQPSPTVHTKEALGFIMNMFQAPTS 26 T 2.7 Feld-I_B pdbhh F Eukaryota T 7b1i 2 B C A0A219T3Y8_MAGOR AVR-PIK PROTEIN ( PIKMPROTEIN,PIKP PROTEIN ),AVRPI7 PROTEIN GPETGNKYIEKRAIDLSRERDPNFFDNADIPVPECFWFMFKNNVRQDAGTCYSSWKMDKKVGPNWVHIKSDDNCNLSGDFPPGWIVLGKKRPGF 94 T 2.8 DIM unphh F Eukaryota T 7b26 3 C C CirpA1 GPMGEDQETDFSSTDGAELIAKEPEVYPIDQFMNNTEIWVFNTTQPDPPNCKKDKSKSMTQTATSFVRSHVKNGNIIEENLVGNFTYFNDKEKVYDGIYISGESSGVYAEHLYYVSEDKKCGLFQVFAHVNDKTTIWRDVRVSGRPEEGVPLELNCTKEFDEYVKLVNATSKSPYTSECQ 180 T 0.0082 FBA_1 pdbpercent F T 7b2a 1 A A CirpA5 MGQSEKQEEPDYPINKFMNTTDEIWVFRTTQENVQKCKKDKNKYMTTSATFFTRSHEEQDQIHEQELVGKFANFYDKPDGVYDRIDITGDKTGVYEEALAYASKENTCGVVGVWAFDGETTVVWRELRVRNRPNDATKVDEMCKKKFDDYVQVVNKSWTSPYNEKCK 167 T 4.8 His_binding pdbhh F T 7b2b 1 A B A0A3D9UGN9_9GAMM PEPTIDE SYNTHETASE XPSB (MODULAR PROTEIN) MNNNELTSLPLAERKRLLELAKAAKLSRQHY 31 T 1.1 CSTF_C pdbhh F Bacteria T 7b4t 1 A B HIV-1 envelope variable loop 3 crown mimetic peptide V3-IF (BG505) KSIRIGPGQAFYAXP 15 T 0.00056 GP120 pdbhh F T 7b5h 1 A,B,C,DC,EC,FC,GA,HA,IA,MB,NB,OB,R,S,T,XA,YA,ZA AA,AB,AC,FA,FB,FC,CA,CB,CC,EA,EB,EC,BA,BB,BC,DA,DB,DC Q8YRX8_NOSS1 All3314 protein MTTTITIPNSYPIFTPNQVLTNKDLNRVVTYLDEQNRLTRVYLIGMGIVAGMEVSSIYQPGDVNIVVAPGCGITSEGYIISLAETKLTHYQSGVSVPSALFAPSEEQTAASTDQLVELFEQEGNNRLALKNLPDENAFARFLADQTLVVVYELQDQQRDSCLLDCDDTGKDRNFRLRYFLLPRSVPEKLSAEALLQQGFSREPLPQQWRDFSINDIFQAQSSFFQNFFPQVRRFGYTLETPPVIRLSNIVDYDAFLKGYQQVCLQAIDEIDRTFPNLFRLFSPFFSSFNPAPSDFTGLKTLLNQRLSDIVSGSSAENRRSPISQIEAQYALQYFYDYLSQLVSAFRELAESAFDLMDDATPDTRRFPKFLMLGLVPLPNQKPEVYALNSPYRSNFSQSPIYNGNQLRVKQVRFLYDRLVRLCAADSFYLLPFYDTPLKITPSKDRAATLSQQAIPYYLNYPQLYQYWSYDTYRKGRSQSHPAYFYPNNANITPNSDLLHRLDDYSFYRIEGHIGEANATALQRILDYQQRYNLAFDVITLKIGNLQSFQDINISGQFDDLNADFGRIKDTFAKLWQRYEESWSRNVFLYTLKRVFFDKTSLAEIKSDQLFNPIVARASVKEAYEFVKESGDSYRLYLRNAAGIRIARFETVINFSGLSGDSLTQEQERIIGDLLACLPLGKITYGVEPESANNPLSYYLRFSLADELDLPANRGTADISFISLNFFTVNFEGNSPIINQPEFQDFETLYSLLRDVPESSIRVNRLELRMGDRLAADTLNYFELKGLMTAYQQRLAQIMELQLFHKFAQNNPGMEHLGGVPKGGTFVLVYVDGRELVRNLLSADRDPTYQARTEVIKKYASLPPGSPQELATSRELLNREDIVVGDFCLPYRFSSKTPTVSYVLTQPRPIVLLDRTTFCAGDETRYEFILDPTGGTLKGEGSFFADGKYYFQPSRITDDITSETAITFTYVVESSYDTLSVTVYPLPDASFQIKTNFCSNENPVTLRATQPGGNFRAFDSETDISASVINNQEFNPSAVNLGGATEKVITLVYTITSDQGCTNELSRDITIFAVPNATFQVGQGKTRFCSNDEPVDLIARVPGGTFQVRDGAEDISADVINRLTTPPQFDPSAVNLGVAREKVITLEYSISNQGCSNKFTQELRIFAVPNANFRLSTGNRDTFTNNDPPVGLIATQLGGTFQAFDGEEDITADVISPTTPPQFNPSAVNLGDEEEKVITLRYTISNQGCSNNTERRVTIVPPPEVPVRDVEDTSNPDSGDAPTENPIPHPEVRAVNLLAISNNEVINSTNLDGDRTFNLSDFNPNNQYTFEAMTVPEKVNSVIFTYTKPNGSRQALTANTAPYRMPDDWQPSIGIHEIQAQAIREVNGDRLEGATIKVIIRVIDADTDTSPSRSTNPDNLFTRIQNLFPLNRGEIITKIKLPQLLAMSTAIFMLIVGWTYSSSKQVGSTPPSVIKPR 1476 T 0.035 Cadherin_5 pdbpercent F Bacteria T 7b5h 2 AB,D,GC,JA,PB,U DD,AD,FD,CD,ED,BD Q8YRX7_NOSS1 All3315 protein MPEYLSISKQKPDFPPYLNFQTLRDIGITHLQALSGKIWTDYNLHDPGVTILEVLCYAITDLGYRNNLDIADLLALNPQDGNSRENNFFTPDAVLTCNPVTELDVRKRLIDIPGVRNAWLQKVTSYEPNIYVNFSDKRLQYNPPTAESKTLNPRGLYTVRLDLDQDYRKNACGQIDRSWGDTLDEVKQVLCDSRNLCEDFADIVILGEEEIGICADIQLETNADAEDVLVNIYVRIQQFLSPRLKFYTLQELLDKGKSPAEIFAGRPSVFDGENRLYKSHGFIDTDELEALTLPTILHTSDLYQEILQVPGVSAIKKLSIANYINGLRQTQGHPWYLQLTDQYRPVLGVKTSKINFFKSELPIGVDEEEVERRYYEQQAAYIKTIRDRDELDIPVPKGSYYDLADHYSIHHDFPTTYGISEDGLPPTVPALRKAQALQLKAYLVFFDQLLASYLAQLSHIRDLFSWEVDVTQPQQNDYATRLQEKQRTYFTQKLDFPEIEKIIPDNYLDVLDEAPETYRDRRNRFLDHLLARFSESFSDYVLLNYQMFATRNNKATQETEIIHDKAQFLQDYPTLSRDRFRAYNYYDCHAVWDTDNVAGFKKRVLRLLGIDDVRRRHLSHYRVDKDSRNLFLSIDFSSDDLTLTSKQRYATTEQAQADQDKLLLFALHPNFYKRLSYKYYYHYSWEILDTQNQSIVRSDRFFPSTKERAAALEPLLQSLLTQLSQLDDTALQNLVITQPTDEDLYSFRLQIPLNSGVITFTGVQRYFSRTEAVDAGVISLRLIQDVQNYRNITLGQDQGTTPQKFTYYGYGLVDHQGSLLSEYTHHFPTELERELSLQRWLTHIQANQNQYKFAIETITNGYVFVINDITNSQTLLRGISSYATEYLAWQAASEFAENLRYLNRYLSPAKDHTGQTYSLGITDKTGKLLAVTTTESDRLLTFQRLNALEPFLVIEAATTPTSGYRYRLVDRQETTILQSIQIYGDETTARDRFYQDVLGTLFETGVINPTTTNKEFGFRILSRPRDTNSVAAIHTQTYTSEAERDAAIEHLLLLVRTARLRISTNSLDSLAYISQIYNPDNQLILQGTQRYTSEDIAWEQGNTLMELAQDEENFRLIDSDDGVYGWELTNEGKDEIFAAQYYNSREERTAAIAEIQKYSNDEGFHLLEHILLRPRTKLPDLTAGDGFLPILVTPEDVNTEPDDPYLLARTDPYSFWVTIVLPYWPQRFRDIPFRRFVERTLRLEAPAHIALKIAWVNVRQMRDFELAYRHWLEQLALESCENAACDLTGTLNRLLKILPQLRNVYPKATLHDCEESSADNNPAILNQTALGTAND 1335 T 0.001 DUF276 pdbhh F Bacteria T 7b5h 3 BB,CB,DB,E,F,G,HC,IC,JC,KA,LA,MA,QB,RB,SB,V,W,X DE,DF,DG,AE,AF,AG,FE,FF,FG,CE,CF,CG,EE,EF,EG,BE,BF,BG Q8YRX6_NOSS1 All3316 protein MNLRKRNELKSLFKNKSRLSETYFVELIDSTLNKRDDRFHGIWKPGQTYQKGDVVYYNHSLWEMQSENEICAKEEQTPGISTDWKSLLKELEQKVDKLQHELETLHQEFTEYQKQMEIRLQLLARFIPILFIGLGIMFFWLLGQSTVHILAGTT 154 T 0.00052 CLZ pdbpssm F Bacteria T 7b5i 2 AA,B,G,L,Q,V FB,AB,BB,CB,DB,EB Q8YRW6_NOSS1 All3326 protein MKILYKKILNLELWHDFYLGQPNTPGSLPNNYDISRTLALVPTQECLRVLANLRWVFRPQLYGASLFANVNAAPSGQFPTIFPIDRVYRLTFWLVVSDRYFANFTNLSLINSRNQIYYFSNLSGNEGHALFLTQPLSAYTTNNEYQLGQLVTHADKTLESLTYQGNATNIPNPSDWDSLPASQYVSELDHLPRQGTYRTQVITNANPDNTYNFTLVNTNEQESWAIDVIVPDTHKSGEPFSTSLNFVGQTPGHYRLLENDTQVAEFVLVDNSLPEAFALVEVILNPELVPSAFSLLQASAGQTFIQPKTYVIRFKNRATRWRYRYEQPHGCSAANLPSYFNLIDTHTYATARPIGLRQRPDSLLNDCQDRPLPAPSITLIQPETDGSQRIARIFSDIYL 399 T 0.66 Y_Y_Y pdbpercent F Bacteria T 7b9v 27 AA Z NTC20 isoform 1 MPSLRDLSLERDQELNQLRARINQLGKTGKEEANDFVGLNISNEPVYDTVIQTGQSSNATNSFVQETIQKTKQKESGQPYIIPQKNEHQRYIDKVCETSDLKAKLAPIMEVLEKKTNEKIKGIIRKRVLQEPDRDNDDSG 140 T 9.3E-05 cwf18 pdbhh F T 7bag 3 C C Compstatin CP40 XICVXQDWXAHRCX 14 T 1.9 Inhibitor_I36 pdbhh F T 7bas 1 A,B,C,D,E A,B,C,D,E CC-Type2-(TgLaId)4-W19BrPhe. XGEIAQTLKEIAKTLKEIAXTLKEIAQTLKGX 32 T 0.004 ApoC-I pdb F T 7bat 1 A,B,C A,B,C CC-Type2-(GgIaId)4 XGEIAQGIKEIAKGIKEIAWGIKEIAQGIKGX 32 T 0.05 MCPsignal pdbpssm F T 7bau 1 A,B,C,D,E,F,G,H,I,J A,B,C,D,E,F,G,H,I,J CC-Type2-(TgLaId)4-W19BrPhe. XGEIAQTIKEIAKTIKEIAXTIKEIAQTIKGX 32 T 0.0016 MCPsignal pdb F T 7baw 1 A,B,C,D,E,F,G A,B,C,D,E,F,G CC-Type2-(GgIaId)4 XGEIAQGIKEIAKGIKEIAWGIKEIAKGIKGX 32 T 0.062 MCPsignal pdbpssm F T 7bb5 1 A A G4A6K5_AGGAC AcrIF9 MGSSHHHHHHSQDPMTNVVYYFTETNNINAYATAEALKAQTLADAKREASRRQCFQGTTLKIGTIYSLNSDGLLVDEITSKEDGKKWVDRY 91 T 3.2 DVNP unphh F Bacteria T 7bcj 1 A A Q6A6F6_CUTAK Radical Oxygenase of Propionibacterium acnes MTPIDESQLPVGPQVSVTDSAQHTGPFAASSPLTITVKPGAPCVRADGYQESMVTRVLDDKGHQVWTGTFDESKLIGGTGLGTATFHVGSPAAAFNFHGSERTTYRTLSYCAYPHYVNGTRERLSQVSVKTFMVDPALNLEHHHHHH 147 T 2.2 EAGR_box pdbhh F Bacteria T 7bcy 2 C,D P,Q LANA1_HHV8P ORF 73 XCRKRNRSPERX 12 T 0.022 DUF5401 unphh T Viruses T 7bdx 1 A,B,C,D A,B,C,D HSF2B_HUMAN Heat shock factor 2-binding protein AEMGAAACTLLWGVSSSEEVVKAILGGDKALKFFSITGQTMESFVKSLDGDVQELDSDESQFVFALAGIVTNVAAIACGREFLVNSSRVLLDTILQLLGDLKPGQCTKLKVLMLMSLYNVSINLKGLKYISESPGFIPLLWWLLSDPDAEVCLHVLRLVQSVVLEPEVFSKSASEFRSSLPLQRILAMSKSRNPRLQTAAQELLEDLRTLEHNV 214 T 0.0026 KAP unphh F Eukaryota T 7bdx 2 E,F E,F BRCA2_HUMAN FANCONI ANEMIA GROUP D1 PROTEIN NEFDRIIENQEKSLKASKSTPDGTIKDRRLFMHHVSLEPITTVPFRTTKERQENLYFQG 59 T 8.8 GMP_synt_C pdbhh F Eukaryota T 7bfy 1 A A B4EH86_BURCJ Lectin MPLLSASIVSAPVVTSETYVDIPGLYLDVAKAGIRDGKLQVILNVPTPYATGNNFPGIYFAIATNQGVVADGCFTYSSKVPESTGRMPFTLVATIDVGSGVTFVKGQWKSVRGSAMHIDSYASLSAIWGTA 131 T 0.002 DUF1543 unppercent F Bacteria T 7bgh 1 A A OEP21_PEA CHLOROPLASTIC OUTER ENVELOPE PORE PROTEIN OF 21 KDA,GOEP21 METSLRYGGDSKALKIHAKEKLRIDTNTFFQVRGGLDTKTGQPSSGSALIRHFYPNFSATLGVGVRYDKQDSVGVRYAKNDKLRYTVLAKKTFPVTNDGLVNFKIKGGCDVDQDFKEWKSRGGAEFSWNVFNFQKDQDVRLRIGYEAFEQVPYLQIRENNWTFNADYKGRWNVRYDLLEHHHHHHHHHH 189 T 0.021 Fmp27_GFWDK unppssm F Eukaryota T 7bgt 2 E,F G,F peptidomimetic inhibitor PYVXAMH 7 T 55 Allatostatin pdbhh F T 7bim 1 A,AA,B,BA,C,CA,D,DA,E,EA,F,FA,G,GA,H,HA,I,IA,J,JA,K,L,M,N,O,P,Q,R,S,T,U,V,W,X,Y,Z A,a,B,b,C,c,D,d,E,e,F,f,G,g,H,h,I,i,J,j,K,L,M,N,O,P,Q,R,S,T,U,V,W,X,Y,Z Nonameric de novo coiled coil CC-Type2-(GgLaId)4 XGEIAQGLKEIAKGLKEIAWGLKEIAQGLKGX 32 T 0.048 WXG100 pdbpssm F T 7bjs 1 A,B A,B KINH_DROME Kinesin heavy chain SMSFLENNLDQLTKVHKQLVRDNADLRCELPKLEKRLRCTMERVKALETALKEAKEGAMRDRKRYQYEVDRIKEAVRQKHLGRRGPQAQ 89 T 1.2E-05 SMC_N unphh F Eukaryota T 7bkx 1 A AAA Q6SVB5_DIPPU Milk protein MRQVWFSWIVGLFLCFFNVSSAKEPCPPENLQLTPRALVGKWYLRTTSPDIFKQVSNITEFYSAHGNDYYGTVTDYSPEYGLEAHRVNLTVSGRTLKFYMNDTHEYDSEYEILAVDKDYFIFYGHPPAAPSGLALIHYRQSCPKEDIIKRVKKSLKNVCLDYKYFGNDTSVHCRYLE 177 T 0.23 CE2_N pdbpercent F Eukaryota T 7blo 3 C,F N,H NRAM2_HUMAN C-term (residues 493-54) of Wls (fitted sequence corresponds to hDMT1-II) QPELYLLNTM 10 T 4.9 DUF5081 pdbhh F Eukaryota T 7blz 15 O O M1VFJ4_CYAM1 PsaO FEVSDGEPYPLNPAVIFIALIGWSAVAAIPSNIPVLGGTGLTQAFLASIQRLLAQYPTGPKLDDPFWFYLIVYHVGLFALLIFGQIGYAGYAKGTYN 97 T 23 YbgT_YccB unphh F Eukaryota T 7bn1 2 C,D E,F MUNS_REOVL Protein mu-NS from Reovirus type 1 VDGAADLIDFSVPTDEY 17 T 5.4 HAUS-augmin3 unphh T Viruses T 7bn2 2 C,D CCC,DDD Non structured protein 3 from Eastern Equine Encephalitis Virus SDHSVDLITFDSVTDIY 17 T 2.9 DUF3343 pdbhh F T 7bnt 1 A,B A,B Predicted ancestral HMA domain of Pik-1 from Oryza spp. GPGMKQKIVIKVPMASDKCRSKAMALVASTGGVDSVALVGDLRDKIEVVGDGIDSIKLVSALRKKVGHAELLQVS 75 T 0.00011 HMA pdbpercent F T 7bnt 2 C C C4B8B8_MAGOR AVR-PIK PROTEIN ( PIKMPROTEIN,PIKP PROTEIN ),AVRPI7 PROTEIN,FRAGMENT OF MAGNAPORTHE ORYZAE AVR-PIKD GPETGNKYIEKRAIDLSRERDPNFFDHPGIPVPECFWFMFKNNVRQDAGTCYSSWKMDMKVGPNWVHIKSDDNCNLSGDFPPGWIVLGKKRPGF 94 T 0.1 TMEM18 unp F Eukaryota T 7bny 1 A,B,C,D A,B,C,D POLG_ENMGO Genome polyprotein SPNPLDVSKTYPTLHILLQFNHRGLEARIFRHGQLWAETHAEVVLRSKTKQISFLSNGSYPSMDATTPLNPWKSTYQAVLRAEPHRVTMDVYHKRIRPFRLPLVQKEWRTCEENVFGLYHVFETHYAGYFSDLLIHDVETNPGGSKHHHHHH 152 T 0.0044 LZ3wCH pdbpssm T Viruses T 7bo8 1 A,B,C,D,E,F A,B,C,D,E,F CC-Type2-(VaYd)4-Y3F-W19(BrPhe)-Y24F XGEFAQAVKEYAKAVKEYAXAVKEFAQAVKGX 32 T 0.053 IFT20 pdb F T 7bo9 1 A,B,C,D,E,F A,B,C,D,E,F CC-Type2-(VaYd)4-Y3F-W19(BrPhe) XGEFAQAVKEYAKAVKEYAXAVKEYAQAVKGX 32 T 0.08 IFT20 pdb F T 7boa 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H CC-Type2-(YaFd)4-W19(BrPhe) XGEFAQAYKEFAKAYKEFAXAYKEFAQAYKGX 32 T 8.6 Cas_Csy3 pdbhh F T 7boc 2 B B RIOK1_HUMAN peptide SRVVPGQFDDADSSD 15 T 0.048 COPR5 pdbhh F Eukaryota T 7bow 1 A,B A,B Hydroxynitrile lyase GSLTCDKLPKVIPPGIDAFTSHNPFEFSYVLTDDLDCTARVYVQPVHGLTNYSGTAFDIKGTHITINDFTIGADGLTAYLTNCDTGEKQVWHFQYVDLGDPQGANYCAYSCNGPQIAEYKCTTNTGYISPKQLQAVKEARSVPNGDKIHLAQVDCPPHLYCPLYY 165 T 2.6 VanY pdbhh F T 7bph 2 B B GN13 XXFESVYAIWGTLCGX 16 F F T 7bpl 1 A A NF1 GDADKIMEQAKRQDPNAQVYKVTTPDEIEEAVRRIEKYGAQVVLIIYTSSGIVILVAVRDPSQADQILKEAKKQNPSATFVRLEGVSPDDLRRQVEDVWRGSLEHHHHHH 110 T 0.0052 GGDEF_2 pdb F T 7bpm 1 A A NF2 GTEIELESKNGQREHYTATSEDEARKIIEKAVRRGIKRIELRGASEQLIRDMQEIAKQIGLQYRTDGSLEHHHHHH 76 T 0.1 DUF6506 pdb F T 7bpn 1 A A NF7 GQIQYFNVDENPEQVRKLIEQAGLDPDELREAEVIIIIISRTPEQLEKLSRQVKELGADRLLEFNVDENPEQASKLAKTAGISEKQLREADYIILILVRDEKKAKKFADSLRKKGSLEHHHHHH 124 T 0.015 AAA_12 pdb F T 7bpp 1 A A NF5 GEDDEILQRAKDILKEDPNRKILIILNPDGKIELYEVTSEEDIKRIAKKAGISEELLRRILQSFRDGQYDLFFIAKTEDDERRARELKERMGKPVEILRGSLEHHHHHH 109 T 0.002 Nucleoporin_N pdb F T 7bq9 1 A,B B,A PP62_ASFB7 60 kDa polyprotein RSPWPSNMKQFCKISVWLQQHDPDLLEIINNLCMLGNLSAAKYKHGVTFIYPKQAKIRDEIKKHAYSNDPSQAIKTLESLILPFYIPTPAEFTGEIGSYTGVKLEVEKTEANKVILKNGEAVLVPAADFKPFPDRRLAVWIMESGSMPLEGPPYKRKKEGGGNLESRGPFEGKPIPNPLLGLDSTRTGHHHHHH 194 T 0.34 Fasciclin unppssm T Viruses T 7bqa 1 A,B B,A PP62_ASFB7 CP530R,PCP530R RSPWDPPVPKHISPYTPRTRIAIEVEKAFDDCMRQNWCSVNNPYLAKSVSLLSFLSLNHPTEFIKVLPLIDFDPLVTFYLLLEPYKTHGDDFLIPETILFGPTGWNGTDLYQSAMLEFKKFFTQITRQTFMDIADSATKEVDVPICYSDPETVHSYTNHVRTEILHHNAVNKVTTPNLVVQAYNELEQTNTIRHYGPIFPESTINALRFWKKLWQDEQRFVIHGLHRTLMDQPTYETSEFAEIVRNLRFSRPGNNYINELNITSPAMYGDKHTTGDIAPNDRFAMLVAFINSTDFLYTAIPEEKVGGNLESRGPFEGKPIPNPLLGLDSTRTGHHHHHH 339 T 30 Imm74 pdbhh T Viruses T 7bqb 1 A A NF6 GKLYEVDSPDSVEKIARELGLSEEQLRRIQKEFERAERKGKLVIVYLTSDGKVEIREVTSEEELEKILKKLGVDEEIIRRIKRLRKEGQIKLVIIEGSLEHHHHHH 106 T 0.00032 HTH_23 pdb F T 7bqc 1 A A NF4 GSEEIRELVRKIYETVRKENPNVKILIFIIFTSDGTIKVIIVIIADDPNDAKRIVKKIQERFPKLTIKQSRNEEEAEKRIQKELEERNPNAEIQVVRSEDELKEILDKLDEKKGSWSLEHHHHHH 125 T 0.011 DUF6377 pdb F T 7bqd 1 A A NF8 GTILIFLDKNKEQAEKLAKEVGVTEIYESDNLEELYREIKERIERENPNATILTVTDPNELKKIQDEGKVDRIILLIKGSLEHHHHHH 88 T 0.023 Methyltransf_25 pdb F T 7bqe 1 A A NF3 GSDEEIRKKLEELAKRKGKDLQLRRYNDPNEVEKSIREALKKGRTLIIIINGVFVVVSTDEDLIREIKRLIKESNPNKKTLDVTTEEDLEEVLRRIKKGSWSLEHHHHHH 110 T 0.03 BtrH_N pdb F T 7bqm 1 A A Chantal GEEEKEIDKLVELFAQAYEDAREKKRNGTPEEWVRDAIEEAARRVGRSRSRVVEALRRYAEKHGKEELLKRAGITPEALKVIEKIEKEEGSLEHHHHHH 99 T 0.0054 HTH_28 pdb F T 7bqn 1 A A Rei GDEAEKQAERALELVRKSPDLLKKLLEAMAEELKRQGKSPDEIQKAKDEVKTKVEQAIREWKQGNEEQARKDMRKVLKSPAFKQAVKVMEEQEPNNPEVQELKKAMEEAERGSLEHHHHHH 121 T 0.024 EcoEI_R_C pdb F T 7bqq 1 A A Gogy GDERKLEEVTEEMRKMAENMDGQDPEKVKEIVRRALQQMANDNPEVSEQLRELAKRKGTSPSEVIKDLAEQVWRAMERAREGDKDTARELIRKFADDLGISPEQVKKFIKIMREVQRKEDGSLEHHHHHH 130 T 0.001 PSK_trans_fac pdbhh F T 7bqr 1 A A Mussoc GDEDKEKLKREAERALSEALSEFEKQGKITPETLKRLAEEIAEAALAQQQGDSERLEKAARRFAETLLRALKESGASAEEIEEAIERIRKALSKAPSPQLQKLANSPQWQTALQEAIKKARQEKKEKGSLEHHHHHH 137 T 0.0043 TMP_3 pdb F T 7bqs 1 A A Nomur GETKAKAAQEALRAAREQATTPEAQKALEELEKVLKTASPEQWRQAAEKIFEAFREASNGNTEKAKKLLEEAARTAGASPEIIKKLASALERLAEEGAAKEAARQAEEVRKRGSLEHHHHHH 122 T 0.0038 BTG pdb F T 7br1 1 A,B A,B A0A7I6N400_PARLM Hydroxynitrile lyase SLTCDKLPKVIPPGIDAFTSHNPFEFSYVLTDDLDCTARVYVQPVHGLTNYSGTAFDIKGTHITINDFTIGADGLTAYLTNCDTGEKQVWHFQYVDLGDPQGANYCAYSCNGPQIAEYKCTTNTGYISPKQLQAVKEARSVPNGDKIHLAQVDCPPHLYCPLYY 164 T 2.6 VanY pdbhh F Eukaryota T 7bsb 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N B,G,A,D,E,F,C,H,I,J,K,L,M,N D7DTD6_METV3 lectin MAQNDNYIYSTEVGGVGGTPFTFMQESGTITSIKFNWSDQYKLLHHIEVKFINNANIYATGDPKGNHEVILEIDDDETIIGSVIGYKKGNDGRCTGVKLTTSKGKSIMAGYFEESLITTYTGKLAGIKGGAGSDIDRLGLIFLKK 145 T 0.00015 Jacalin pdbpssm F Archaea T 7bt8 1 A,B,C,D,E,F,G B,G,A,D,E,F,C D7DTD6_METV3 lectin DNYIYSTEVGGVGGTPFTFMQESGTITSIKFNWSDQYKLLHHIEVKFINNANIYATGDPKGNHEVILEIDDDETIIGSVIGYKKGNDGRCTGVKLTTSKGKSIMAGYFEESLITTYTGKLAGIKGGAGSDIDRLGLIFLK 140 T 0.00015 Jacalin unppssm F Archaea T 7bu5 2 B B SLX4_HUMAN BTB/POZ DOMAIN-CONTAINING PROTEIN 12 RKKNLPPKVPITPMPQYSIMETPVLKKELDRFGVRPLPKRQMVLKLKEIFQYTHQTLDSDS 61 T 0.067 SAP pdbpercent F Eukaryota T 7bv3 1 A,B A,B A0A346A6C4_SIRGR UGT TRANSFERASE KVELVFVPGPGIGHLSTALQIADLLLRRDHRLSVTVLSIPLPWEAKTTTQPESLFPSSTTTTTSRIRFISLPQRPLPDDAKGPFQFQAVFETQKQNVKEAVAKLSDSSILAGLVLDMFCVTMVDVAKQLGVPSYVFFTSSAGYLSFTSHLQDLSDRHGKETQQLMRSDVEIAVPGFTNPVPGKVIPGVYFNKNMAEWLHDCARRFRETNGILVNTFSELESQVMDSFSDATAASQFPAVYAVGPILSLNKNTSAASSESQSGDEILKWLDQQPPSSVVFLCFGSKGSLNPDQAREIAHALERSGHRFVWSLRQPSPKGKFEKPIEYDNIEDVLPEGFLDRTAEMGRVIGWAPQVEILGHPATGGFVSHCGWNSTLESLWYGVPIATWPMYAEQHFNAFEMGVELGLAVGISSESSIEEGVIVSAEKIEEGIRKLMGGGGGGGGGEVRKLVKAKSEESRKSVMEGGSSFTSLNRFIDEVMKSPF 483 T 1.7999999999999997E-24 UDPGT pdb F Eukaryota T 7bv4 2 B,D,F,H C,D,F,H STX17_HUMAN Syntaxin-17 NAAESWETLEADLIELSQLVTD 22 T 0.0009 Syntaxin unphh F Eukaryota T 7bv7 1 A,B A,B INT3_HUMAN INT3,SOSS COMPLEX SUBUNIT A,SENSOR OF SINGLE-STRAND DNA COMPLEX SUBUNIT A,SENSOR OF SSDNA SUBUNIT A TVVEEPVDITPYLDQLDESLRDKVLQLQKGSDTEAQCEVMQEIVDQVLEEDFDSEQLSVLASCLQELFKAHFRGEVLPEEITEESLEESVGKPLYLIFRNLCQMQEDNSSFSLLLDLLSELYQKQPKIGYHLLYYLRASKAAAGKMNLYESFAQATQLGDLHTCLMMDMKACQEDDVRLLCHLTPSIYTEFPDETLRSGELLNMIVAVIDSAQLQELVCHVMMGNLVMFRKDSVLNILIQSLDWETFEQYCAWQLFLAHNIPLETIIPILQHLKYKEHPEALSCLLLQLRREKPSEEMVKMVLSRPCHPDDQFTTSILRHWCMKHDELLAEHIKSLLIKNNSLPRKRQSLRSSSSKLAQLTLEQILEHLDNLRLNLTNTKQNFFSQTPILQALQHVQASCDEAHKMKFSDLFSLAEEYEDSSTKPPKSRRKAALSS 436 T 0.0083 IFRD pdbpssm F Eukaryota T 7bw5 1 A A A0A2M8WFL4_9SPHN lasso peptide koreensin GPKGDFPDVGDGRILAG 17 T 0.022 DUF5974 unphh F Bacteria T 7bwk 1 A,F A,F Q5ZYC6_LEGPH IcmO (DotL) GQNEPEPVEDIVEEEVEGALTIFSKLRIDPNAPPILVADKEVFSEPLLPINETRNQMITIERLAGAKDKYAGTVANELIKDFQIATSYPPEERDVIDVQELTGIIRDLSAKISAEREKANKKAAEELT 128 T 0.098 DUF1840 pdb F Bacteria T 7bwk 4 D,I D,I Q5ZY48_LEGPH Hypothetical virulence protein MADGDIEIKAGFVDTDLDDRKLTMIDDLNNPLAIVERVYLIWWHWADFHLHVISPHIDTITPAIVIEPELIPGSNDHEFVYSIHDSGSKLSTSKSQDMFSAGMSMCKLFYTIEKMVYILVERLKSGGVSMEAEVQIAFAGHEIAQRKAFESIINLPYNVVVTNFDPGIWGEKYLQNVKRLADKGYGYPPESPRKIYMHPVSSGTTARK 208 T 3.1 Herpes_TK_C pdbhh F Bacteria T 7bwk 5 E,J E,J Q5ZW60_LEGPH PNPLA domain-containing protein NSSQQQEQLKEKTMLFKSRLQSFKQGEGVKPWSQHVENAIDRLMSLKGEITKAQVDLGRTWFDIKSENADPAVRLKKFNDAFLASPLAKPSSNQQEINFSKEIRKEIDLLKGLPGLNNTSSHCTEEFNEQ 130 T 0.041 Antigen_Bd37 pdb F Bacteria T 7bxf 1 A A Q5ZTL3_LEGPH MvcA SMIVRGINMTKIKLESPGFMVHKKLKSMSQSYGVMMTGVPAEVLGQMQAERSIPSINKTGNLKQQIAKEVSKVCHMMTEPTQSCGQASNDVCELLLGKIEAEKFHFTKYEALSADGDNLKNVLENTAPSSTNLLIRFEIDREDPPIVLVKTKNENFNPETAVKNKIYLLENKLYFIDKMGNLFNLGPGKKKCTQLFNAIGDSAEYSLCDPFVLEEPEKPEDFAISEIVDIFNEQKERFDFWIGSHSFTIYIPQTLGESPRQFYPYQAYFGSHTLQDWFVSDKDEYLSRIGIDKYIEKLAVLGKTTNTKERSDIYAEFFSKRGREAFFCAHLNEKRQPLRVKFKITEINPELALKNLQETQEFIDTHPGENPSDKVENYRNRAKLAMTEHLESLLDIKPESS 401 T 0.023 AgrD pdb F Bacteria T 7bxg 1 A A Q5ZTL4_LEGPH MavC SMTTSKLEKTGLHVHEKIKHMVKNYGTMITGIPAEILGQNEAEISVGYVKKMGNMKENIAEVVRKSEMTQPTNSCGKASNEVCDLLLGTEGASEFEKSSYQVLSGDGSNLKGSLPNKNLLVRVEMDRFNAPQKYQKIKREEFNPETAEKNKIYLLEDQLVYLDIFGKVIDLGQTSDTCHRLFNAITTPFYQNYILYDEYIDPEESAEEAAMFEMGEIVKAKMKNIDCWTATHSFTIFVPESDSEDTRTLYPYQAYWTSHTLQQWFSGDKDEKLSRLGIDGYIEKLALLGTTTDSKIRSSIYGELFSPPGKEHVFCTGMNEKFSPLRVKFKVTEVNPEIALQNLEEVQEFIDTNYPGENAKDQCELYKIKAQEAMTKQLEMRLLIEP 386 T 0.035 V-ATPase_H_N unppercent F Bacteria T 7bxt 7 K,L K,L F1NSD9_CHICK CENTROMERE PROTEIN CENP-C QKIVLPSNTPNVRRTKRIRLKPLEYWRGERVTYTLKPSGRL 41 T 0.073 DUF3141 pdbpercent F Eukaryota T 7by7 1 A A GP46_BPSP1 Putative gene 46 protein MMTEDQKFKYLTKIEELEAGCFSDWTKEDITGDLKYLKKGIIEESIELIRAVNGLTYSEELHDFTQEIIEELDISPL 77 T 3.6 DUF1244 pdbhh T Viruses T 7bye 1 A,D A,D A6TJ72_KLEP7 Antitoxin MazE KAGPTLEELLGQCTAENRHHEYLCDSQGKEML 32 T 4.4 ACT_3 pdbhh F Bacteria T 7byf 2 B,F B,E NUP98_MOUSE Peptidase S59 domain-containing protein PTGTTIKFNPPTGTDTMVKAGVSTNISTKHQCITAMKEYESKSLEELRLEDYQANRK 57 T 14 PRC2_HTH_1 pdbhh F Eukaryota T 7bz7 1 A A lasso peptide LVVIVQADWNAPGWY 15 T 0.58 InlK_D3 pdbhh F T 7bz8 1 A A lasso peptide LVAIVQADWNAPGWF 15 T 0.94 DUF6446 pdbhh F T 7bz9 1 A A lasso peptide LVVAVQADWNAPGWF 15 T 1.1 DUF6446 pdbhh F T 7bza 1 A A lasso peptide LVVIVQADWNAPGWF 15 T 1.6 DUF6446 pdbhh F T 7bzh 1 A A D2PEW5_SULID Sul7s MEDVKQSVEKIIKDREWVTFNDLLKYIPYPAPEVYDALSQLIKENKVGRRGRYFYYIKR 59 T 1.8E-05 SelB-wing_1 pdbhh F Archaea T 7c06 2 B,E,H,K,N,Q,T,W,Z B,E,H,K,N,Q,T,W,Z U2AF2_SCHPO U2 AUXILIARY FACTOR 59 KDA SUBUNIT,U2AF59,U2 SNRNP AUXILIARY FACTOR LARGE SUBUNIT SSVGRSRSPPPSRERSVRSIEQELEQLRDVTPINQWKRKRSLWDIKPPGYELVTADQAKMSGVFPLPGA 69 T 9.8 Transformer unphh F Eukaryota T 7c1x 1 A,B A,B Q94AT3_ARATH PfkB-like carbohydrate kinase family protein MEPVIIGALILDVHAKPSTTPISGTTVPGQVLFAPGGVARNVADCIFKLGITPFMIGTLGLDGPANVLLKEWKLSMKGILRREDISTPIVSLVYDTNGEVAAGVAGVDAVENFLTPEWIQRFEYNISSARLLMVDANLSSLALEASCKLAAESSVPVWFEPVSVTKSQRIASIAKYVTIVSPNQDELIAMANALCAKNLFHPFRSDENKLSIEDMFRALKPAILVLLKNGVKVVIVTLGSNGALLCSKGNPKKALNIDRKFLRSGEVFKRVQSVCSPNRFSELGSNRSPSLFAMHFPTIPAKVKKLTGAGDCLVGGTVASLSDGLDLIQSLAVGIASAKAAVESDDNVPPEFKLDLISGDAELVYNGAKMLMVHQSML 378 T 1.2E-18 PfkB pdbpercent F Eukaryota T 7c4j 3 C C SNF6_YEAST SWI/SNF COMPLEX COMPONENT SNF6 MGVIKKKRSHHGKASRQQYYSGVQVGGVGSMGAINNNIPSLTSFAEENNYQYGYSGSSAGMNGRSLTYAQQQLNKQRQDFERVRLRPEQLSNIIHDESDTISFRSNLLKNFISSNDAFNMLSLTTVPCDRIEKSRLFSEKTIRYLMQKQHEMKTQAAELQEKPLTPLKYTKLIAAAEDGSRSTKDMIDAVFEQDSHLRYQPDGVVVHRDDPALVGKLRGDLREAPADYWTHAYRDVLAQYHEAKERIRQKEVTAGEAQDEASLQQQQQQDLQQQQQVVTTVASQSPHATATEKEPVPAVVDDPLENMFGDYSNEPFNTNFDDEFGDLDAVFF 332 T 0.0043 CENP-Q pdbpercent F Eukaryota T 7c4j 8 I G Unkown XXXXXGRXXXXXPXXXXXXXXXXXXXXXVXXTXXVTXLXXXXXXXXXXXXXXXXXXXXXXXXXTRXYLRFHXXXYXXXXXXX 82 T 210 YlbE pdbhh F T 7c53 1 A,B,C,D,E,F A,B,C,D,E,F SPIKE_SARS2 Spike protein S2',pan-CoVs inhibitor EK1 GVTQNVLYENQKLIANQFNSAIGKIQDSLSSTASALGKLQDVVNQNAQALNTLVKQLSSNFGAISSGGRGGSLDQINVTFLDLEYEMKKLEEAIKKLEESYIDLKEL 107 T 1.2E-07 CoV_S2 pdbpssm T Viruses T 7c5v 1 A,B A,B Q8YT18_NOSS1 iota-carbonic anhydrase GSHDSGDKITATSSLKTPIVNRAITESEVLAAQKAWGEALVAISTTYDAKGKASAKALAEKVIDDAYGYQFGPVLFKPTLAISPRTFRTTRAGALAYFVGDDKAFPEDKGFALSSWRKVEIKNAAIFITGNTATTMGNVIITDKQGKATTVDKTWQFLKDDHGKLRIITHHSSLPYEQ 178 T 0.0077 SnoaL_3 unphh F Bacteria T 7c5x 1 A,B A,B iota-carbonic anhydrase GSHDATITEAEVLNAQSKWAEAIKTISRTYLNGGDYIKTAGDAAAELYGYGKSKVLFKPTKAAEFPFRPTGEEAMSYFVGGNAVEKGYKEDAGFAINGGKGWSNVVFNNHDIDINGNTAVAMGSYVFTCATTGTETKVEYTFGYKRNDDGKVRIFLHHSSVPYSESPAPVTLKEVTECQEKWANAIQTISKTYLDGGDYIGEAGKQAGILYGYGNTNVLFKPTKATDHPFRPTGEQAMSYFVGGDVVDNGYVGEDAGFAINGGKGWSKVVFRNHQVDLNGPVAIAMGDYVFTSAADGSETRVEYTFGYKRNDDGNVRIFVHHSSVPYKEEVAPITEAEVLECQKNWANAIQTISKTYLDGGDYIGEAGKQAGILYGYGNTNVLFKPTKATDHPFRPTGEEAMSYFVGGDVVENGYVGEDAGFAINGGKGWKNVVFRNHQLDFNGPVAIAMGDYVFTSAADNSETRVEYTFGYKRNPDGKPRIFLHHSSVPYKEEPVTNTIRKRLFASA 508 T 0.032 SnoaL_3 pdbhh F T 7c5z 1 A,B A,B A7J936_SVCV Phosphoprotein SWEEESTGIDLGFGPGIVMPSVSNHEGGTYVRYNGLGNVDPNYKNLISKMMRSLIGQIGNKYGYDIDLFDYQGDFLEVFLPHKPSK 86 T 0.022 Cass2 pdbpssm T Viruses T 7c78 1 A A A0A2S2CJ39_9GAMM AcrIF9 GSMKAAYIIKEVQNINSEREGTQIEATSLSQAKRIASKEQCFHGTVMRIETVNGLWLAYKEDGKRWVDCQ 70 T 1.7 YhzD unphh F Bacteria T 7c8e 2 C,D C,D 9J10 LNRTPGRRRNSN 12 T 3.7 PSRT pdbhh F T 7cbc 1 A,B A,B De novo designed switch protein caging a hemagglutinin binder (sCageHA267_1S) MSELARKLLEASTKLQRLNIRLAEALLEAMARLQELNLELVYLAVELTDPKRIRDEIKEVKDKSKEIIRRAEKEIDDAAKESEKILEEAREAISGSGSYLAKLLLKAIAETQDLNLRAAKAFLEAAAKLQELNIRAVELLVKLYDPATIREALEHAKRRSKEIIDEAERAIRAAKRESERIIEEARRLIEKGSGSGSELARELLRAHAQLQRLNLELLRELLRALAQLQELNLDLLRLASELTDPDEARKAIARSKRESKRIVEDAERGGGTFACRIAAKIAAEFGYSEEQIKELLKNAGCSEDEARDAVEYLRSRPGL 319 T 0.012 PhoU pdbpercent F T 7cc9 1 A,B,C A,B,C A0A0M4DML1_STRPR HNHc domain-containing protein LTDTDRSEDFLRRVRGLKAARTANGPRLYQPITLLWAVGRARRGEARTLAWADTDEAIGALLKRHGARGERPRPDYPVLALHRAGLWTLEGHVGEVPTAHGDSALRNWFAEQRPVGGLAEPFHDLLHRSGHSRVSVIEALLTTYFAGLDPVPLLEDTGLYDEG 163 T 20 TMEM214 pdbhh F Bacteria T 7ccd 1 A,D A,B A0A0M4DML1_STRPR HNHc domain-containing protein PLTDTDRSEDFLRRVRGLKAARTANGPRLYQPITLLWAVGRARRGEARTLAWADTDEAIGALLKRHGARGERPRPDYPVLALHRAGLWTLEGHVGEVPTAHGDSALRNWFAEQRPVGGLAEPFHDLLHRSGHSRVSVIEALLTTYFAGLDPVPLLEDTGLYDEGHHHHH 169 T 20 DUF4014 pdbhh F Bacteria T 7ccj 1 A,D A,B A0A0M4DML1_STRPR HNHc domain-containing protein MPLTDTDRSEDFLRRVRGLKAARTANGPRLYQPITLLWAVGRARRGEARTLAWADTDEAIGALLKRHGARGERPRPDYPVLALHRAGLWTLEGHVGEVPTAHGDSALRNWFAEQRPVGGLAEPFHDLLHRSGHSRVSVIEALLTTYFAGLDPVPLLEDTGLYDEG 165 T 0.26 Hemerythrin pdbpercent F Bacteria T 7ccn 1 A A LBT3 FIDTNNDGWIEGDELLA 17 T 0.0037 EF-hand_6 pdb F T 7cdb 2 C C GBRG2_MOUSE GABA(A) RECEPTOR SUBUNIT GAMMA-2 ERDEEYGYECLDGKDCAS 18 T 11 FOLN pdbhh F Eukaryota T 7cdc 3 C C PRO-ARG-SER-PHE-LEU-VAL-ARG-ARG-PRO PRSFLVRRP 9 T 2.9 pPIWI_RE_Y pdbhh F T 7cdd 3 C C PRO-ARG-SER-PHE-LEU-VAL-ARG-ARG PRSFLVRR 8 T 9.6 HOOK pdbhh F T 7cde 3 C C PRO-ARG-SER-PHE-LEU-VAL-ARG-LYS-ARG PRSFLVRKR 9 T 1.2 HOOK pdbhh F T 7cdf 3 C C PRO-ARG-SER-PHE-LEU-VAL-ARG-ARG-LYS PRSFLVRRK 9 T 2.3 hNIFK_binding pdbhh F T 7cdg 3 C C PRO-ARG-SER-PHE-LEU-VAL-ARG-ARG-ARG PRSFLVRRR 9 T 3.5 HOOK pdbhh F T 7cfc 2 F,G,H,I F,G,H,I AGO3_DROME AGO3 NISVGRGRARLIDTLK 16 T 2.8 DUF1343 pdbhh F Eukaryota T 7cg1 1 A A A0A0L6JMH4_9FIRM Anti-sigma factor RsgI, N-terminal MVEIAINPASEITATSAFISGTVTKFEQSKGFYGSGCNISLLYWEASNPMHVKVASSISKKDFPADISATIKDLKPHTTYQFKVTVNFYFSSSLQTFKTLALESKSTSIVSTSTPTPSMPVKVTLEHHHHHH 132 T 0.0018 fn3 pdb F Bacteria T 7cg8 1 A,B,C,D A,B,C,D A0A0L6JMH4_9FIRM Anti-sigma factor RsgI, N-terminal SVSPVEIAINPASEITATSAFISGTVTKFEQSKGFYGSGCNISLLYWEASNPMHVKVASSISKKDFPADISATIKDLKPHTTYQFKVTVNFYFSSSLQTFKTLAL 105 T 0.00081 fn3 pdb F Bacteria T 7chk 2 B B Q9JGP1_9SECO VP24 protein GSDPFSFLLNYSHCGTLVESSLNKGGMWCVPVSPVNLAAYTLQGEALVFNDAFVSKTHNWLHFMASTTAYWRGTLHYQMRVTYKDRNAACRNLVAFYTTNNESLFGFNNKPVGDTGISSVMGDSFSVDITVPFLIPTCYLQTIRGKFDYLNSCNGCIYFHLPTKSATSVQLWVRPGQDFDFARFRLLKAGYT 192 T 1.4E-05 CRPV_capsid pdbhh T Viruses T 7chq 1 A A A0A125RN64_9CAUD anti-CRISPR AcrIE2 MNTYLIDPRKNNDNSGERFTVDAVDITAAAKSAAQQILGEEFEGLVYRETGESNGSGMFQAYHHLHGTNRTETTVGYPFHVMELLEHHHHHH 92 T 9.2 Baculo_E66 unphh T Viruses T 7chr 1 A A A0A2S2CJ39_9GAMM anti-CRISPR AcrIF9 MKAAYIIKEVQNINSEREGTQIEATSLSQAKRIASKEQCFHGTVMRIETVNGLWLAYKEDGKRWVDCQLEHHHHHH 76 T 1.7 YhzD unphh F Bacteria T 7ci1 1 A,B A,B AcrVA2 SMHHTIARMNAFNKAFANAKDCYKKMQAWHLLNKPKHAFFPMQNTPALDNGLAALYELRGGKEDAHILSILSRLYLYGAWRNTLGIYQLDEEIIKDCKELPDDTPTSIFLNLPDWCVYVDISSAQIATFDDGVAKHIKGFWAIYDIVEMNGINHDVLDFVVDTDTDDNVYVPQPFILSSGQSVAEVLDYGASLFDDDTSNTLIKGLLPYLLWLCVAEPDITYKGLPVSREELTRPKHSINKKTGAFVTPSEPFIYQIGERLGSEVRRYQSIIDGEQKRNRPHTKRPHIRRGHWHGYWQGTGQAKEFRVRWQPAVFVNSGRVSS 323 T 10 SeqA_N pdbhh F T 7ci2 2 C,D C,D A0A0U2B2X7_9GAMM MbCpf1 NTGKSVYQKMIYKLLPGPNKMLPKVFFAKSNLD 33 T 23 DUF5100 pdbhh F Bacteria T 7cio 2 B B CTLA4_HUMAN CYTOTOXIC T-LYMPHOCYTE-ASSOCIATED ANTIGEN 4,CTLA-4 GVXVKMPP 8 T 0.2 TMEM190 unppssm F Eukaryota T 7ciz 4 J,K,L D,H,L DNJC9_HUMAN HDJC9,DNAJ PROTEIN SB73 GPLGSKESKQKMNARKRRAQEEAKEAEMSRKELGLDEGVDSLKAAIQSRQKDRQKEMDNFLAQMEAKYSKSSKGG 75 T 0.032 CobN-Mg_chel pdbpercent F Eukaryota T 7cj0 4 D,H D,A DNJC9_HUMAN HDJC9,DNAJ PROTEIN SB73 GPLGSEVPSYNAFVKESKQKMNARKRRAQEEAKEAEMSRKELGLDEGVDSLKAAIQSRQKDRQKEMDNFLAQMEAKYSKSSKGG 84 T 0.0043 CobN-Mg_chel pdbpssm F Eukaryota T 7ck5 1 A A B1B578_P1AMV PlAMV replicase peptide from RNA-dependent RNA polymerase FEDILSGNLLQRMLRPLRSGLTQLLDFF 28 T 11 Lambda_CIII pdbhh T Viruses T 7cl0 1 A A SIR6_HUMAN REGULATORY PROTEIN SIR2 HOMOLOG 6,SIR2-LIKE PROTEIN 6 MSVNYAAGLSPYADKGKCGLPEIFDPPEELERKVWELARLVWQSSSVVFHTGAGISTASGIPDFRGPHGVWTMEERGLAPKFDTTFESARPTQTHMALVQLERVGLLRFLVSQNVDGLHVRSGFPRDKLAELHGNMFVEECAKCKTQYVRDTVVGTMGLKATGRLCTVAKARGLRACRGELRDTILDWEDSLPDRDLALADEASRNADLSITLGTSLQIRPSGNLPLATKRRGGRLVIVNLQPTKHDRHADLRIHGYVDEVMTRLMKHLGLEIPAWDGPRVLERALPPLPRPPTPKLEPKEESPTRINGSIPAGPKQEPCAQHNGSEPASPKRERPTSPAPHRPPKRVKAKAVPS 355 T 3.2E-07 SIR2 unppercent F Eukaryota T 7clv 2 C C COX4_YEAST COX4 isoform 1 MLSLRQSIRFFKPATRTLCSSRYLL 25 T 9.7 OTCace_N pdbhh F Eukaryota T 7cma 1 A,B A,C A0A2X0TC55_ASF OXYGENASE MNKKIIVMMALLHKEKLIECIYHELENGGTILLLTKNIVVSEISYIGNTYKYFTFNDNHDLISKEDLKGATSKNIAKMIYNWIIKNPQNNKIWSGEPRTQIYFENDLYHTNYNHKCIKDFWNVSTSVGPHIFNDRSIWCTKCTSFYPFTNIMSPNIFQ 158 T 0.12 ox_reductase_C pdbpssm T Viruses T 7cmx 1 A,B,C,D A,B,C,D Q81GQ9_BACCR Isocitrate lyase MKNERIEKLQESWELDERWEGITRPYSAEDVIRLRGSIDIEHTLARRGAEKLWTSLHTEDYINALGALTGNQAMQQVKAGLKAIYLSGWQVAADANLSGHMYPDQSLYPANSVPAVVKRINQTLQRADQIQHMEGSDDTDYFVPIVADAEAGFGGQLNVFELMKGMIEAGASGVHFEDQLSSEKKCGHLGGKVLLPTQTAVRNLISARLAADVMGVPTIIVARTDADAADLITSDIDPVDKAFITGERTPEGFYRTNAGLDQAIARGLAYAPYADLVWCETSEPNLEDAKRFADAIHKEHPGKLLAYNCSPSFNWKQKLDEKAIASFQKEIASYGYKFQFVTLAGFHSLNYGMFELARGYKERGMAAYSELQQAEFAAEKHGYSATRHQREVGTGYFDEVAQVITGGTSSTTALKGSTEEAQFTKLEHHHHHH 433 T 1.6E-47 ICL pdb F Bacteria T 7cmz 2 B B PHF8_HUMAN PHD FINGER PROTEIN 8,[HISTONE H3]-DIMETHYL-L-LYSINE(36) DEMETHYLASE PHF8,[HISTONE H3]-DIMETHYL-L-LYSINE(9) DEMETHYLASE PHF8 GACFKDAEYIYPSLESDDDDPA 22 T 8.7 Ph1570 pdbhh F Eukaryota T 7cn6 1 A,B,C A,B,C SPAC_BPT4 Protein spackle GYDKDLCEWSMTADQTEVETQIEADIMNIVKRDRPEMKAEVQKQLKSGGVMQYNYVLYCDKNFNNKNIIAEVVGE 75 T 0.068 Autoind_synth pdb T Viruses T 7cna 2 B,D B,E SPNDC_HUMAN SPIN1-DOCKING PROTEIN,SPIN-DOC ETFAAPAEVRHFTDGSFPAGFVLQLFSHTQ 30 T 29 DUF2852 pdbhh F Eukaryota T 7cna 4 F F ALA-ARG-THR-M3L-GLN-THR-ALA-ARG-M3L-SER-GLY ARTKQTARKSGG 12 T 0.24 Histone pdbhh F T 7cnc 2 B B DGCR8_HUMAN DIGEORGE SYNDROME CRITICAL REGION 8 PRTARHAPAVRKFSPDLKLLKDVKISVSFTE 31 T 7.7 CoV_NSP15_M pdbhh F Eukaryota T 7cnw 2 B,D B,D PSD_ECOLI Phosphatidylserine decarboxylase alpha chain XTVINLFAPGKVNLVEQLESLSVTKIGQPLAVSTGHHHHHHG 42 T 13 Herpes_UL51 pdbhh F Bacteria T 7co1 2 B,D,F B,D,F CBP_HUMAN HISTONE LYSINE ACETYLTRANSFERASE CREBBP,PROTEIN-LYSINE ACETYLTRANSFERASE CREBBP GPPPAAVEAARQIEREAQQQQHLYSDED 28 T 0.8 WPP pdbhh F Eukaryota T 7co5 1 A,C,E,G,I,K G,A,C,E,I,K decapeptide SVRDELRWVF SVRDELRWVF 10 T 9.7 Chisel pdbhh F T 7coy 6 BA,F,Q cF,aF,bF B0C7S7_ACAM1 Photosystem I protein PsaF MRRLFAVLLVMTLFLGVVPPASADIGGLVPCSESPKFQERAAKARNTTADPNSGQKRFEMYSSALCGPEDGLPRIIAGGPMRRAGDFLIPGLFFIYIAGGIGNSSRNYQIANRKKNAKNPAMGEIIIDVPLAVSSTIAGMAWPLTAFRELTSGELTVPDSDVTVSPR 167 T 3.4E-06 PSI_PsaF pdbpssm F Bacteria T 7coy 7 CA,G,R cI,aI,bI Photosystem I protein Psa27 MISDILPAIMTPLVVLIGGGAAMTAFFYYVEREG 34 T 0.0026 PSI_8 pdbpercent F T 7cp1 1 A,B A,B ACEA_MYCTU ICL,ISOCITRASE,ISOCITRATASE MSVVGTPKSAEQIQQEWDTNPRWKDVTRTYSAEDVVALQGSVVEEHTLARRGAEVLWEQLHDLEWVNALGALTGNMAVQQVRAGLKAIYLSGWQVAGDANLSGHTYPDQSLYPANSVPQVVRRINNALQRADQIAKIEGDTSVENWLAPIVADGEAGFGGALNVYELQKALIAAGVAGSHWEDQLASEKKCGHLGGKVLIPTQQHIRTLTSARLAADVADVPTVVIARTDAEAATLITSDVDERDQPFITGERTREGFYRTKNGIEPCIARAKAYAPFADLIWMETGTPDLEAARQFSEAVKAEYPDQMLAYNCSPSFNWKKHLDDATIAKFQKELAAMGFKFQFITLAGFHALNYSMFDLAYGYAQNQMSAYVELQEREFAAEERGYTATKHQREVGAGYFDRIATTVDPNSSTTALTGSTEEGQFHLEHHHHHH 436 T 1.8E-47 ICL unp F Bacteria T 7cp2 1 A,B,C A,B,C PP62_ASFB7 CP530R CDS PROTEIN,CP530R PROTEIN,PP62 PSNMKQFCKISVWLQQHDPDLLEIINNLCMLGNLSAAKYKHGVTFIYPKQAKIRDEIKKHAYSNDPSQAIKTLESLILPFYIPTPAEFTGEIGSYTGVKLEVEKTEANKVILKNGEAVLVPAADFKPFPDRRLAVWIMESGSMPLEGPPYKR 152 T 0.34 Fasciclin unppssm T Viruses T 7cpo 3 C C HIS-VAL-TYR-GLY-PRO-LEU-LYS-PRO-ILE HVYGPLKPI 9 T 0.63 DUF952 pdbhh F T 7cqh 2 B A TRP_DROME Transient receptor potential protein GPMNQTQLIEFNPNLGDVTRATRVAYVKFMRKKMAADEVSLADD 44 T 0.18 LEM pdbpssm F Eukaryota T 7cqp 2 B C TRPC4_MOUSE TRPC4,CAPACITATIVE CALCIUM ENTRY CHANNEL TRP4,RECEPTOR-ACTIVATED CATION CHANNEL TRP4 GPDKRKNLSLFDLTTLIHPRSAAIASERHN 30 T 4.1 Pox_RNA_Pol_19 pdbhh F Eukaryota T 7cqv 2 C B TRP_DROME Transient receptor potential protein GPNNNWDVPDIEKKSQGVARTTKGKVMERRILKDFQIGFVENLKQEMSESESGRDIFSSLAKVIGRKKTQKGDKDWNAIARK 82 T 5.3 DUF1331 pdbhh F Eukaryota T 7crb 1 A J ATR1_HYAAE ARABIDOPSIS THALIANA RECOGNIZED PROTEIN 1 MRVCYFVLVPSVALAVIATESSETSGTIVHVFPLRDVADHRNDALINRALRAQTALDDDEERWPFGPSAVEALIETIDRHGRVSLNDEAKMKKVVRTWKKLIERDDLIGEIGKHYFEAPGPLHDTYDEALATRLVTTYSDRGVARAILHTRPSDPLSKKAGQAHRLEEAVASLWKGRGYTSDNVVSSIATGHDVDFFAPTAFTFLVKCVESEDDANNAIFEYFGSNPSRYFSAVLHAMEKPDADSRVLESSKKWMFQCYAQKQFPTPVFERTLAAYQSEDYAIRGARNHYEKLSLSQIEELVEEYSRIYSV 311 T 0.0016 RXLR pdbhh F Eukaryota T 7cu6 1 A A lasso peptide C24_A11V2C LCVIVQADWNCPGWF 15 T 1.3 Exo_endo_phos pdbhh F T 7cui 1 A,C A,C POT1_SCHPO Protection of telomeres protein 1 SENPFIAHELKQTSVNEITAHVINEPASLKLTTISTILHAPLQNLLKPRKHRLRVQVVDFWPKSLTQFAVLSQPPSSYVWMFALLVRDVSNVTLPVIFFDSDAAELINSSKIQPCNLADHPQMTLQLKERLFLIWGNLEERIQHHISKGESPTLAAEDVETPWFDIYVKEYIPVIGNTKDHQSLTFLQKRWRGFGTKIV 199 T 7.8 CDC24_OB3 pdbhh F Eukaryota T 7cui 2 B,D B,D TPZ1_SCHPO MEIOTICALLY UP-REGULATED GENE 169 PROTEIN SQQEKPNDNTSNSRDIKNNIQFHWKNMTSLSIEECIIPKGQQLILEKESEENTTHGIYLEERKMAQGLHNSVSETPE 77 T 0.0062 TEBP_beta unphh F Eukaryota T 7cuj 1 A,C A,B CCQ1_SCHPO STRUCTURAL MAINTENANCE OF CHROMOSOMES PROTEIN CCQ1,SMC PROTEIN CCQ1 ITKSSKSSFSVLDIGLPMSALQRKMMHRLVQYFAFCIDHFCTGPSDSRIQEKIRLFIQSAHNIAKHPSLYDTEVRNPSAAESTNSHVSLDASNFSSYAENSSKFLFLQELFKNLSPSYSKTFFLFISNQFLANTLTQWLKSQNIDAELWAEEDAKTSQHPAIWICVSKKAPSASHFLQSCPDLSATIFYDIEAYMSVTSSLPSIQSLVLRLIHLGSIEHAIKCFQSSYNASFLVNIVGVVATLSSSSEENSEASNLSTLFEKSGNFEEILGSESHSSITEKTRDIAKNVATWLKNGENFSSWPLPPLMDLASLSVAE 317 T 0.00039 HDA2-3 unppssm F Eukaryota T 7cuj 2 B,D C,D TPZ1_SCHPO MEIOTICALLY UP-REGULATED GENE 169 PROTEIN QIELEYKRKPIPDYDFMKGLETTLQELYVEHQSKKRRLELFQLTN 45 T 0.071 Radical_SAM_N unp F Eukaryota T 7cun 7 G H INT8_HUMAN INT8 MSAEAADREAATSSRPCTPPQTCWFEFLLEESLLEKHLRKPCPDPAPVQLIVQFLEQASKPSVNEQNQVQPPPDNKRNRILKLLALKVAAHLKWDLDILEKSLSVPVLNMLLNELLCISKVPPGTKHVDMDLATLPPTTAMAVLLYNRWAIRTIVQSSFPVKQAKPGPPQLSVMNQMQQEKELTENILKVLKEQAADSILVLEAALKLNKDLYVHTMRTLDLLAMEPGMVNGETESSTAGLKVKTEEMQCQVCYDLGAAYFQQGSTNSAVYENAREKFFRTKELIAEIGSLSLHCTIDEKRLAGYCQACDVLVPSSDSTSQQLTPYSQVHICLRSGNYQEVIQIFIEDNLTLSLPVQFRQSVLRELFKKAQQGNEALDEICFKVCACNTVRDILEGRTISVQFNQLFLRPNKEKIDFLLEVCSRSVNLEKASESLKGNMAAFLKNVCLGLEDLQYVFMISSHELFITLLKDEERKLLVDQMRKRSPRVNLCIKPVTSFYDIPASASVNIGQLEHQLILSVDPWRIRQILIELHGMTSERQFWTVSNKWEVPSVYSGVILGIKDNLTRDLVYILMAKGLHCSTVKDFSHAKQLFAACLELVTEFSPKLRQVMLNEMLLLDIHTHEAGTGQAGERPPSDLISRVRGYLEMRLPDIPLRQVIAEECVAFMLNWRENEYLTLQVPAFLLQSNPYVKLGQLLAATCKELPGPKESRRTAKDLWEVVVQICSVSSQHKRGNDGRVSLIKQRESTLGIMYRSELLSFIKKLREPLVLTIILSLFVKLHNVREDIVNDITAEHISIWPSSIPNLQSVDFEAVAITVKELVRYTLSINPNNHSWLIIQADIYFATNQYSAALHYYLQAGAVCSDFFNKAVPPDVYTDQVIKRMIKCCSLLNCHTQVAILCQFLREIDYKTAFKSLQEQNSHDAMDSYYDYIWDVTILEYLTYLHHKRGETDKRQIAIKAIGQTELNASNPEEVLQLAAQRRKKKFLQAMAKLYF 995 T 0.0069 TPR_12 pdbpercent F Eukaryota T 7cwj 1 A,B,C,D A,B,C,D G9MQD3_HYPVG Root induced effector protein Tsp1 MAAPTPADKSMMAAVPEWTITNLKRVCNAGNTSCTWTFGVDTHLATATSCTYVVKANANASQASGGPVTCGPYTITSSWSGQFGPNNGFTTFAVTDFSKKLIVWPAYTDVQVQAGKVVSPNQSYAPANLPLEHHHHHH 138 T 5.5 DUF6520 unphh F Eukaryota T 7cwz 1 A,B,C A,B,C TYRDC_ENTFA TYROSINE DECARBOXYLASE MKNEKLAKGEMNLNALFIGDKAENGQLYKDLLIDLVDEHLGWRQNYMPQDMPVISSQERTSKSYEKTVNHMKDVLNEISSRMRTHSVPWHTAGRYWGHMNSETLMPSLLAYNFAMLWNGNNVAYESSPATSQMEEEVGHEFAHLMSYKNGWGHIVADGSLANLEGLWYARNIKSLPFAMKEVKPELVAGKSDWELLNMPTKEIMDLLESAEDEIDEIKAHSARSGKHLQAIGKWLVPQTKHYSWLKAADIIGIGLDQVIPVPVDHNYRMDINELEKIVRGLAEEQIPVLGVVGVVGSTEEGAVDSIDKIIALRDELMKDGIYYYVHVDAAYGGYGRAIFLDEDNNFIPYEDLQDVHEEYGVFKEKKEHISREVYDAYKAIELAESVTIDPHAMGYIPYSAGGIVIQDIRMRDVISYFATYVFEKGADIPALLGAYILEGSKAGATAASVWAAHHVLPLNVAGYGKLIGASIEGSHHFYNFLNDLTFKVGDKEIEVHTLTHPDFNMVDYVFKEKGNDDLVAMNKLNHDVYDYASYVKGNIYNNEFITSHTDFAIPDYGNSPLKFVNSLGFSDEEWNRAGKVTVLRAAVMTPYMNDKEEFDVYAPKIQAALQEKLEQIYDVK 620 T 3.9E-18 Pyridoxal_deC pdbpssm F Bacteria T 7cyl 2 B B FUS_HUMAN 75 KDA DNA-PAIRING PROTEIN,ONCOGENE FUS,ONCOGENE TLS,POMP75,TRANSLOCATED IN LIPOSARCOMA PROTEIN GGSRGGYDRGGYRGRGGDRGGFRGGRGGGDRGGFGPGKMDSRGEHRQDRRERLY 54 T 510 DUF6114 pdbhh F Eukaryota T 7cz6 1 A C A0A7M3VBX7_9VIRU Capsid protein IDCDSSVFGNNFNITTSPQTLTMSGPLAPGKYQTTLTVQALIGGTGVVVGTVTFAGKTVAYQVFDDSFASFDLGTVTVSASTTPSVIWTGSTGATLTMAVNIICKPITPTSVAISGQPIWTTPYAP 126 T 0.11 DUF3459 pdb T Viruses T 7czm 2 C,D C,D OPTN_HUMAN Optineurin LIR SSEDSFVEIRMAE 13 T 11 DUF5856 pdbhh F Eukaryota T 7d0e 2 B B CCPG1_HUMAN Cell cycle progression protein 1 FIR2 SDDSDIVTLEPPK 13 T 39 YodL pdbhh F Eukaryota T 7d0j 13 M O A8JCL6_CHLRE Photosystem I subunit O ASNKSFPRDWVKTDPLVPVLGFAGWTIPANIGVSAFGGQSLFGLFTQSIGENLAHFPTGPALDDKFWLYLITYHLGLFLTITLGQIGVQGRKQ 93 T 25 YkpC pdbhh F Eukaryota T 7d0k 1 A,B A,B A0A7M3VBX7_9VIRU Capsid protein PISADFSEVENAPSFLSLAENTDEVLKPYTGLEIQTIITNIVGDANPNQSRIFDQDRLRGNQYSAGGLVTQNAVSAIPFTNLIPRTIRVGNILVNSANRLQITETNVSEYYSNPIIATKLSEMISDQVKNNQFSTWRRDNTSLQGFNAFDIATINTAILPNGLSLESMLLKLSLLHSIKAMNVDAASINRSQYQVIDHNTVPTIGAPAVVGVNNSPVFGEDCGGNNPVYPFGGGTGAIAFHVTLQTVPDERKSYAIFVPPAILQATSDANEALALFALSMSEWPHALYTVTKQTTDLAGANAGQQVFIPTQSTIHIGGRRVLDLIIPRREIAPNPTTLVAANAMCMVRPQAGPDATAGAIPLAAGQLFNMNFIGAPAFEEWPMTSYLYSWAGRFDITTIRQYMGRLATMVGVKDAYWAAHELNVALSQVAPKMTTAAGGWAAQAANSAQQSDVCYSSLLTVTRSAANFPLANQPAADMRVYDTDPATWNKVALGLATAANLVPEQSMDVPFVVGDARASFWERLQAIPMCIAWTMYYHSRGITTLAWDNAYTDNTNKWLQKMVRNTFSTTQSVGTIIPARYGKIVCNLYKNMFHRAPAYVATSVGGKELHITHFERWLPGGTYANVYSGAGAVVNCFSPVLIPDIWCQYFTAKLPLFAGAFPPAQGQNSTKGFNSKQGLMIHRNQNNNLVAPYLEKFADNSSYFPVGQGPEINDMATWNGRLWMTTGNVQYLDYSGAAIVEAVPPAGELPVGKQIPLLAGENAPIELTNAATTCVPRYSNDGRRIFTYLTTAQSVIPVQACNRAANLARSCWLLSNVYAEPALQALGDEVEDAFDTLTNSSFLDVAKSVAESAGEVPATKALTDLQAVDVSSLPSTSDPSNVLSQPAPLMSPPTSSS 897 T 0.17 TPP_enzyme_C pdb T Viruses T 7d13 1 A A VGF_HUMAN Neurosecretory protein VGF SQAEATRQAAAQEERLADLASDLLLQYLLQGGARQRGLG 39 T 3.6 AcylCoA_dehyd_C pdbhh F Eukaryota T 7d2f 1 A,B A,B Q5ZWG6_LEGPH HISTIDINE ACID PHOSPHATASE DKLIFAVDIIRHGDRTPIVALPTVNYQWQEGLGQLTAEGMQQEYKMGVAFRKKYIEESHLLPEHYEYGTIYVRSTDYARTLMSAQSLLMGLYPPGTGPTIPAGTSALPHAFQPIPVFSAPSKYDEVIIQQVDRKEREKLMEQYVFSTREWQQKNNELKDKYPLWSRLTGINIDNLGDLETVGHTLYIHQIHNAPMPEGLASNDIETIINSAEWAFMAQEKPQQIANVYSSKLMTNIADYLNSGSMKKSKLKYVLLSAHATTIASVLSFLGAPLEKSPPYASNVNFSLYDNGANYYTVKITYNGNPVSIPACGGSVCELQQLINLVHDS 328 T 0.012 His_Phos_2 pdbpssm F Bacteria T 7d2l 1 A A 12i1-D647A MSNKEKNASETRKAYTTKMIPRSHDRMKLLGNFMDYLMDGTPIFFELWNQFGGGIDRDIISGTANKDKISDDLLLAVNWFKVMPINSKPQGVSPSNLANLFQQYSGSEPDIQAQEYFASNFDTEKHQWKDMRVEYERLLAELQLSRSDMHHDLKLMYKEKCIGLSLSTAHYITSVMFGTGAKNNRQTKHQFYSKVIQLLEESTQINSVEQLASIILKAGDCDSYRKLRIRCSRKGATPSILKIVQDYELGTNHDDEVNVPSLIANLKEKLGRFEYECEWKCMEKIKAFLASKVGPYYLGSYSAMLENALSPIKGMTTKNCKFVLKQIDAKNDIKYENEPFGKIVEGFFDSPYFESDTNVKWVLHPHHIGESNIKTLWEDLNAIHSKYEEDIASLSEDKKEKRIKVYQGDVCQTINTYCEEVGKEAKTPLVQLLRYLYSRKDDIAVDKIIDGITFLSKKHKVEKQKINPVIQKYPSFNFGNNSKLLGKIISPKDKLKHNLKCNRNQVDNYIWIEIKVLNTKTMRWEKHHYALSSTRFLEEVYYPATSENPPDALAARFRTKTNGYEGKPALSAEQIEQIRSAPVGLRKVKKRQMRLEAARQQNLLPRYTWGKDFNINICKRGNNFEVTLATKVKKKKEKNYKVVLGYAANIVRKNTYAAIEAHANGDGVIDYNDLPVKPIESGFVTVESQVRDKSYDQLSYNGVKLLYCKPHVESRRSFLEKYRNGTMKDNRGNNIQIDFMKDFEAIADDETSLYYFNMKYCKLLQSSIRNHSSQAKEYREEIFELLRDGKLSVLKLSSLSNLSFVMFKVAKSLIGTYFGHLLKKPKNSKSDVKAPPITDEDKQKADPEMFALRLALEEKRLNKVKSKKEVIANKIVAKALELRDKYGPVLIKGENISDTTKKGKKSSTNSFLMDWLARGVANKVKEMVMMHQGLEFVEVNPNFTSHQDPFVHKNPENTFRARYSRCTPSELTEKNRKEILSFLSDKPSKRPTNAYYNEGAMAFLATYGLKKNDVLGVSLEKFKQIMANILHQRSEDQLLFPSRGGMFYLATYKLDADATSVNWNGKQFWVCNADLVAAYNVGLVDIQKDFKKKLEHHHHHH 1101 T 25 DUF4060 pdbhh F T 7d2o 1 A A Q9BLZ2_9MAXI GLUC TGKPTENNEDFNIVAVASNFATTDLDADRGKLPGKKLPLEVLKEMEANARKAGCTRGCLICLSHIKCTPKMKKFIPGRCHTYEGDKESAQGGIGEAIVDIPAIPRFKDLEPMEQFIAQVDLCVDCTTGCLKGLANVQCSDLLKKWLPQRCATFASKIQGQVDKIKGAGGDIEGR 174 T 0.024 GASA pdbpssm F Eukaryota T 7d3d 2 B,D a,b GLU-VAL-SER-ILE-ILE-GLN-GLY-ALA-ASP-SER-THR-THR EVSIIQGADSTT 12 T 10 DUF6180 pdbhh F T 7d3j 1 A A 12i1-WT MSNKEKNASETRKAYTTKMIPRSHDRMKLLGNFMDYLMDGTPIFFELWNQFGGGIDRDIISGTANKDKISDDLLLAVNWFKVMPINSKPQGVSPSNLANLFQQYSGSEPDIQAQEYFASNFDTEKHQWKDMRVEYERLLAELQLSRSDMHHDLKLMYKEKCIGLSLSTAHYITSVMFGTGAKNNRQTKHQFYSKVIQLLEESTQINSVEQLASIILKAGDCDSYRKLRIRCSRKGATPSILKIVQDYELGTNHDDEVNVPSLIANLKEKLGRFEYECEWKCMEKIKAFLASKVGPYYLGSYSAMLENALSPIKGMTTKNCKFVLKQIDAKNDIKYENEPFGKIVEGFFDSPYFESDTNVKWVLHPHHIGESNIKTLWEDLNAIHSKYEEDIASLSEDKKEKRIKVYQGDVCQTINTYCEEVGKEAKTPLVQLLRYLYSRKDDIAVDKIIDGITFLSKKHKVEKQKINPVIQKYPSFNFGNNSKLLGKIISPKDKLKHNLKCNRNQVDNYIWIEIKVLNTKTMRWEKHHYALSSTRFLEEVYYPATSENPPDALAARFRTKTNGYEGKPALSAEQIEQIRSAPVGLRKVKKRQMRLEAARQQNLLPRYTWGKDFNINICKRGNNFEVTLATKVKKKKEKNYKVVLGYDANIVRKNTYAAIEAHANGDGVIDYNDLPVKPIESGFVTVESQVRDKSYDQLSYNGVKLLYCKPHVESRRSFLEKYRNGTMKDNRGNNIQIDFMKDFEAIADDETSLYYFNMKYCKLLQSSIRNHSSQAKEYREEIFELLRDGKLSVLKLSSLSNLSFVMFKVAKSLIGTYFGHLLKKPKNSKSDVKAPPITDEDKQKADPEMFALRLALEEKRLNKVKSKKEVIANKIVAKALELRDKYGPVLIKGENISDTTKKGKKSSTNSFLMDWLARGVANKVKEMVMMHQGLEFVEVNPNFTSHQDPFVHKNPENTFRARYSRCTPSELTEKNRKEILSFLSDKPSKRPTNAYYNEGAMAFLATYGLKKNDVLGVSLEKFKQIMANILHQRSEDQLLFPSRGGMFYLATYKLDADATSVNWNGKQFWVCNADLVAAYNVGLVDIQKDFKKKLEHHHHHH 1101 T 0.38 DUF1910 pdbpercent F T 7d55 1 A,B,C,D A,B,C,D A0A2Z6FZW5_9CAUD Putative N-acetylmuramoyl-L-alanine amidase MYCLYERPINSKTGVLEWNGDAWTVMFCNGVNCRRVSHPDEMKVIEDIYRKNNGKDIPFYSQKEWNKNAPWYNRLETVCPVVGITKKS 88 T 0.0018 Lipoprotein_15 pdbpssm T Viruses T 7d6c 2 C,D,E,F 3,4,5,F CCMN_SYNE7 CARBON DIOXIDE CONCENTRATING MECHANISM PROTEIN CCMN,ORF I FQSNMHLPPLEPPISDRYFASGEVTIAADVVIAPGVLLIAEADSRIEIASGVCIGLGSVIHARGGAIIIQAGALLAAGVLIVGQSIVGRQACLGASTTLVNTSIEAGGVTAPGSLLSAETPP 122 T 1.2E-05 Fucokinase unphh F Bacteria T 7d6f 2 B B KDIS_RAT ARMS, ANKYRIN REPEAT-RICH MEMBRANE-SPANNING PROTEIN GPGSSSESTGFGEERESIL 19 T 29 CCDC85 pdbhh F Eukaryota T 7d6v 3 C C A0QPJ4_MYCS2 Succinate dehydrogenase (Membrane anchor subunit) MSAPTADRRATGVFSPRRAQIPERTLRTDRWWQAPLLTNLGLAAFVIYATIRAFWGSAYWVADYHYLTPFYSPCVSTACAPGSSHFGQWVGDLPWFIPMAFISLPFLLAFRLTCYYYRKAYYRSVWQSPTACAVAEPHAKYTGETRFPLILQNIHRYFFYAAVLISLVNTYDAITAFHSPSGFGFGLGNVILTGNVILLWVYTLSCHSCRHVTGGRLKHFSKHPVRYWIWTQVSKLNTRHMLFAWITLGTLVLTDFYIMLVASGTISDLRFIGHHHHHHHHHH 283 T 0.044 IncE unppercent F Bacteria T 7d7c 5 F F gp55 MSETKPKYNYVNNKELLQAIIDWKTELANNKDPNKVVRQNDTIGLAIMLIAEGLSKRFNFSGYTQSWKQEMIADGIEASIKGLHNFDETKYKNPHAYITQACFNAFVQRIKKERKEVAKKYSYFVHNVYDSRDDDMVALVDETFIQDIYDKMTHYEESTYRTPGAEKKSVVDDSPSLDFLYEAND 185 T 0.0025 Sigma70_r2 pdbpssm F T 7d87 2 B E V9H1G0_HUMAN Gene for histone H3 (germline gene) ARTKQTARKSTGGKAPRKQLATKAA 25 T 0.044 PAF unp F Eukaryota T 7d8a 2 B E V9H1G0_HUMAN Gene for histone H3 (germline gene) ARTKQTARKSTGGSGSGS 18 T 0.044 PAF unp F Eukaryota T 7dbt 1 A,B A,B A0A1D1VPD8_RAMVA AMNP/g12777 GSHMGRFADFFRIETEIQRLDNPAGILANGKKCDFTGACDPVVTAFLDLESPLSPWPGSVAASKWKTIFEATDQNSPTIGRSVIRDMCGGSASNVNLRVLVNDADSLSSQDEIGKFSCLFQLDARDVAMDSLSAQWGPSTECTAEAQQGKIRLFARRRAFEIPSTSCRAPSSL 173 T 0.082 MNNL pdbpssm F Eukaryota T 7dcv 1 A A PD1L1_HUMAN HPD-L1,B7 HOMOLOG 1,B7-H1 AHPPNERTHLVILGAILLALGVALTFIFRLRKGRLLDVKKSGIQDTNSKKQSDTHLEET 59 T 0.0032 RIF5_SNase_1 pdbpssm F Eukaryota T 7dd1 1 A A SRPK1_HUMAN SRSF protein kinase 1,SRSF protein kinase 1 HHHHHHSSGLVPRGSHMDPNDYCKGGYHLVKIGDLFNGRYHVIRKLGWGHFSTVWLSWDIQGKKFVAMKVVKSAEHYTETALDEIRLLKSVRNSDPNDPNREMVVQLLDDFKISGVNGTHICMVFEVLGHHLLKWIIKSNYQGLPLPCVKKIIQQVLQGLDYLHTKCRIIHTDIKPENILLSVNEQYIRRLAAEATEWQRSGAPPPSGSAVSTAPATAGNFLVNPLEPKNAEKLKVKIADLGNACWVHKHFTEDIQTRQYRSLEVLIGSGYNTPADIWSTACMAFELATGDYLFEPHSGEEYTRDEDHIALIIELLGKVPRKLIVAGKYSKEFFTKKGDLKHITKLKPWGLFEVLVEKYEWSQEEAAGFTDFLLPMLELIPEKRATAAECLRHPWLNS 398 T 3.2 RFXA_RFXANK_bdg unppercent F Eukaryota T 7dda 1 A A A0A2U9GDM4_WSSV ENVELOPE PROTEIN VP37 ADFLLDRMTPVSEEDIEGFAASTFKEVSDSKTATVIVKADCETGDIDEVYNLAPSFGVTQEIKIYRSNNSSELDNVADSFHIYKISATDSDSGNTKKLLYGLRNKKAGYTCLCRIFAEIESDGIMANTNIGVAENNRDEIDENEEGKYGFLIPKQPAGAKLIIYFFLNCWT 171 T 12 DUF5043 pdbhh T Viruses T 7de2 1 A,B A,B NVFI_ASPN1 NvfI MGSSHHHHHHSSGLVPRGSHMVGSRTWCESEMLFVQPDAGTKEELYYRVTPKPGQTQANFNWTPHKVRFHDARPQRDSFDLNTHGFTFVEDAISPQLIERIRADDTAAVEGDYFASVAALVKRVTGADHVVCFSPYTRKENSEKGIFGQPARTVHCDHTPAAAIELTHKLCGEDAVRLLQSRFRAFSVWRPLVEPVLDWPLAVVDGRTIAPDDLHPVHWLRYEKKDTEPPFQLSFSETQKWYYLSRQRSDEVSIVKNYDSEVVPSPRSAHCAFKHPFVPKDAPPRESIDVRCLVFGGR 298 T 0.28 EF-hand_5 unppercent F Eukaryota T 7de7 1 A,B A,B PDZD7_MOUSE PDZ domain-containing protein 7 GPGSTVNEQVQAWESRRPLIQDLARRLLTDDEVLAVTRHCSRYVHEGGVEDLVRPLLAILDRPTKLLLLRDIRSVVAPTDLGRFDSMVMPVELEAFEALKSRAVG 105 T 0.005 CCM2_C pdbhh F Eukaryota T 7deg 2 B,E C,F G0LWX8_AQUAO Cytochrome oxidase subunit IIa FFPSGTIAFFIFMMVFYAVLWFMIYWVLLERG 32 T 0.0016 CoxIIa pdb F Bacteria T 7df2 1 A A A0A1D1UCW7_RAMVA C2 domain protein MGSSHHHHHHHSSENLYFQGLIDRGQDLADVAKYPLITGFFRIEMNVVRLDTQGKSHTGLPCDIFDKCDPKIIAFIDTEKPNNDFGGDSVPYSNYITLVDANNTPDVVEIDKTISRDVCGKGVRKIAMRVRAIDKDGLNDDKIDNYKCHITGERNPPAENEKVAQWSPEIACAGEDRASSKVYLRYRWYNIPESTCRPSSNGQGLFSGLFSR 212 T 0.36 UPAR_LY6_2 pdbpssm F Eukaryota T 7df9 2 B V V2R_HUMAN VASOPRESSIN V2 RECEPTOR PHOSPHOPEPTIDE RTPPSLGPQDESCTTASSSLRKD 23 T 27 DUF6352 pdbhh F Eukaryota T 7dfa 4 D V V2R_HUMAN VaRpp-4 RTPPSLGPQDESCTTASSSLAKD 23 T 110 DUF6352 pdbhh F Eukaryota T 7dfc 2 B V V2R_HUMAN V2Rpp-3 RTPPSLGPQDESCTTASSSLRK 22 T 23 DUF6352 pdbhh F Eukaryota T 7dgu 1 A A de novo designed protein H4A1R MGDEYKKYYQQAIQLIQQLKKALEGNPEMKKLADKVLALLKQAYAAFKAGRSPEEIRALLRKAIEAAKKLAKLGASLGGFDLAKRIIELLKKMYELGGLEHHHHHH 106 T 0.0022 Gp-FAR-1 pdb F T 7dgw 1 A A de novo designed protein H4A2S MGEDYLKLLEEALKIAREVLENYPLTPVMRAAARAIIEAVKMAKKYGDEELIKLVVEAARLLRQAAKQGDLELARQALAAARQALAFARRVAGLEHHHHHH 101 T 0.05 VMAP-M8 pdb F T 7dgx 1 A,B A,B Q57W63_TRYB2 Coronin EFSQLLALASLLGQQQAEVQRCREDLQKKESLVMETIAKIKALALEHHHHHH 52 T 0.079 PEARLI-4 pdbpssm F Eukaryota T 7dgy 1 A A de novo designed protein H4C2R MGHPEIVAAAVAFVRQIWEYARQGMSLDEMIAWAVKYAKKIFDLVKKMGASDEVLKKVMDAVLAAAQAYAQQLNDEAAQRLLVAAQVIVQVLQQLGLEHHHHHH 104 T 1.3 DUF2277 pdb F T 7dh4 1 A,B A,B Q57W63_TRYB2 Coronin EFSQLLALASLLGQQQAEIQRCREDLQKKESLMMETIAKIKALALEHHHH 50 T 0.11 DUF2486 pdb F Eukaryota T 7dhb 1 A,B A,B Q57W63_TRYB2 Coronin FSQLLALASLLGQQQAEVQRCREDLQKKESLMMETIAKIKALALEHHHHH 50 T 0.11 DUF3391 pdbpssm F Eukaryota T 7di3 1 A A A0A160P685_STRLU Cytochrome P450 hydroxylase MTEAVAFPQNRSCPYHPPTAYEPLREERPLSRVTLWNGRQVWFVTGHQAARALLGDQRLSTDSTREDFPLPTERSESLRRQRRGALLGWDDPEHNEQRRMLIPSFTLRRAESMRPRIQAIVDRLLDDMIAAGPSAELVGAFALPVPSMVICELLGVPYGDHEFFEEQSRRLLRGPAAEDIEKAFRSLEGYFGELIETKRTDPGEGVIDDLVARQREEGRPDDDELVQFATVLLVAGHETTANMISLATYTLLEHPARLAELRADPGLVPAAVEELLRFLSIADGLVRVAREDVPVGDQVIRAGEGVVFPTSLINRDDSVYEHPDTLDWSRSARHHVAFGFGIHQCLGQNLARIELEIALGTLLRRLPGLRLAAPADRIPFKPGDTIQGMLELPVTW 396 T 3E-05 p450 pdbpssm F Bacteria T 7dii 1 A,B A,B O67854_AQUAE LEUT MEVKREHWATRLGLILAMAGNAVGLGNFLRFPVQAAENGGGAFMIPYIIAFLLVGIPLMWIEWAMGRYGGAQGHGTTPAIFYLLWRNRFAKILGVFGLWIPLVVAIYYVYIESWTLGFAIKFLVGLVPEPPPNATDPDSILRPFKEFLYSYIGVPKGDEPILKPSLFAYIVFLITMFINVSILIRGISKGIERFAKIAMPTLFILAVFLVIRVFLLETPNGTAADGLNFLWTPDFEKLKDPGVWIAAVGQIFFTLSLGFGAIITYASYVRKDQDIVLSGLTAATLNEKAEVILGGSISIPAAVAFFGVANAVAIAKAGAFNLGFITLPAIFSQTAGGTFLGFLWFFLLFFAGLTSSIAIMQPMIAFLEDELKLSRKHAVLWTAAIVFFSAHLVMFLNKSLDEMDFWAGTIGVVFFGLTELIIFFWIFGADKAWEEINRGGIIKVPRIYYYVMRYITPAFLAVLLVVWAREYIPKIMEETHWTVWITRFYIIGLFLFLTFLVFLAERRRNHESA 513 T 7.399999999999999E-33 SNF unppercent F Bacteria T 7djq 1 A C A0A1V9TQZ2_9LACO C-Terminal peptide of ribosomal S4 Domain protein EEYREDFSI 9 T 7 Pox_A6 pdbhh F Bacteria T 7dkh 3 C,G,K C,G,K CDC73_YEAST RNA POLYMERASE-ASSOCIATED PROTEIN CDC73 NDSEVSDPVVVETMKHERILVDHNSALRGAKPINFGYLIKDAELKLVQSIKGSLRGS 57 T 0.34 CDC73_N unppercent F Eukaryota T 7dkh 4 D,H,L D,H,L RTF1_YEAST RNA polymerase-associated protein RTF1 KTRTKVYYQEIQKEENAKAKEIAQQEKLQEDKDAKDKREKELLVAQFRRLGGLERMVGELDIKFDLKF 68 T 0.14 DUF1366 pdbpercent F Eukaryota T 7dkk 1 A,B,C,D A,B,C,D De novo design protein XM2H MHSWSATVDSRSEEAVRAAARRLAERLLAAGISGKIKIEVEANGIKYEYEVEGPATEEVAKKIVEYAVAAALRAIAAGATSVTITVGLE 89 T 0.094 NAGLU_N pdb F T 7dko 1 A,B,C A,B,C de novo designed protein AM2M MASAEAEVKPDATIEEIRAAARRLAEALRKAGVSGPVTVTAEAGDVSFSYTADLDGTEEGLKRVVEAIVRAAIAALKATGGTKPVLLSAVLE 92 T 0.014 DUF1887 pdb F T 7dmf 1 A A Designed protein EXTD-3 DCQQELSLVQTVTRGSRAFLSREEAQHFVKECGLLNCEAVLELLICHLRLGMEIMKLGRQLREAVRANDVDAMLKIAKEIIKVIGETGLDEVYRQLLKAAKEFLERRAENFSHEEAVAFAQQIIQLIKQVECVQMRALGAVASLGCTDLLPQEHILLLTRPRLQELSAGSPGPVTNKATKILRHFEASC 189 T 0.074 AviRa pdb F T 7dmn 1 A A FSA2_FUSSF FUSARISETIN A BIOSYNTHESIS PROTEIN 2 GSHMSNVTVSAFTVDKSISEEHVLPSSFIPGSGNIFPKFTSAIPKTAWELWYFDGISKDDKSSIVIGVTRNAEGLKHGGFKVQVFVIWADERTWHRDLFFPESVVSINESGVTDGIWKDATSNSSISFSCAGDLSKASLVFDVPGVVQGDMHLEALPGDTGLDTDARLGPSVYYVRPIGRASVKAQLSLYSSDATAAEQFSLGTSANGGMDRVWSPLSWPQVMTESYYLRTQVGPYAMQIMRIFPPAGSEDQPSTMARLYREGQLVCVAQHVVTREDALMTHDSLILSKQDNSDSEDVVTGGYRDKNTGYTVEFVEKGNEGQRWKFQVRHERIIWNTPTSRPGPDATGNTGFVEVLCGGTIGESYEGVGTGGQCELS 377 T 0.00021 Svf1_C unphh F Eukaryota T 7dmo 1 A,B,C,D,E,F A,B,C,D,E,F PHM7_PYRSX Diels-Alderase GSHMSEPTSSSSLDITSNCIIETPLQPSDFLPKSANLFPKFPERISVDSWELWEFDTFDTNGSVAFGCSLYRDARGVEQGGFHAEVNALWPDGTHWGETLYFAVSEVVENSDGTTGGKWLSKDGGSITFHIASDYTAAALDFNVPGKVSGTMELRNHANVSPTSNLPASDAEAQLCPGVYYTFPMGPVATSVTATFSSVGANGESRELFISSGYGGMVRGWSARPWPTFMNDAYYVVAQVGPYMLQILRTLGSVFVQHKPFAVARLYLDGSLVSAANTVVGDELTAHADDVKGDAVRLTKVQPDEKSQGLSGKFRDGNVGYVLEFAKKDSEHGWTFQISHKRAVWSEPTSAPGPDGTGKSGWIEAISGGAKGENYEGHGFGGQLQIPVP 389 T 0.0068 Svf1_C unphh F Eukaryota T 7dmq 1 A A CS13A_LEPSD CRISPR/Cas system Cas13a GSMGNLFGHKRWYEVRDKKDFKIKRKVKVKRNYDGNKYILNINENNNKEKIDNNKFIRKYINYKKNDNILKEFTRKFHAGNILFKLKGKEGIIRIENNDDFLETEEVVLYIEAYGKSEKLKALGITKKKIIDEAIRQGITKDDKKIEIKRQENEEEIEIDIRDEYTNKTLNDCSIILRIIENDELETKKSIYEIFKNINMSLYKIIEKIIENETEKVFENRYYEEHLREKLLKDDKIDVILTNFMEIREKIKSNLEILGFVKFYLNVGGDKKKSKNKKMLVEKILNINVDLTVEDIADFVIKELEFWNITKRIEKVKKVNNEFLEKRRNRTYIKSYVLLDKHEKFKIERENKKDKIVKFFVENIKNNSIKEKIEKILAEFKIDELIKKLEKELKKGNCDTEIFGIFKKHYKVNFDSKKFSKKSDEEKELYKIIYRYLKGRIEKILVNEQKVRLKKMEKIEIEKILNESILSEKILKRVKQYTLEHIMYLGKLRHNDIDMTTVNTDDFSRLHAKEELDLELITFFASTNMELNKIFSRENINNDENIDFFGGDREKNYVLDKKILNSKIKIIRDLDFIDNKNNITNNFIRKFTKIGTNERNRILHAISKERDLQGTQDDYNKVINIIQNLKISDEEVSKALNLDVVFKDKKNIITKINDIKISEENNNDIKYLPSFSKVLPEILNLYRNNPKNEPFDTIETEKIVLNALIYVNKELYKKLILEDDLEENESKNIFLQELKKTLGNIDEIDENIIENYYKNAQISASKGNNKAIKKYQKKVIECYIGYLRKNYEELFDFSDFKMNIQEIKKQIKDINDNKTYERITVKTSDKTIVINDDFEYIISIFALLNSNAVINKIRNRFFATSVWLNTSEYQNIIDILDEIMQLNTLRNECITENWNLNLEEFIQKMKEIEKDFDDFKIQTKKEIFNNYYEDIKNNILTEFKDDINGCDVLEKKLEKIVIFDDETKFEIDKKSNILQDEQRKLSNINKKDLKKKVDQYIKDKDQEIKSKILCRIIFNSDFLKKYKKEIDNLIEDMESENENKFQEIYYPKERKNELYIYKKNLFLNIGNPNFDKIYGLISNDIKMADAKFLFNIDGKNIRKNKISEIDAILKNLNDKLNGYSKEYKEKYIKKLKENDDFFAKNIQNKNYKSFEKDYNRVSEYKKIRDLVEFNYLNKIESYLIDINWKLAIQMARFERDMHYIVNGLRELGIIKLSGYNTGISRAYPKRNGSDGFYTTTAYYKFFDEESYKKFEKICYGFGIDLSENSEINKPENESIRNYISHFYIVRNPFADYSIAEQIDRVSNLLSYSTRYNNSTYASVFEVFKKDVNLDYDELKKKFKLIGNNDILERLMKPKKVSVLELESYNSDYIKNLIIELLTKIENTNDTL 1391 T 0.067 PET117 pdbpercent F Bacteria T 7dms 1 A A Q665A4_YERPS Fe(II)-binding effector SMKTDNAMKKIKLAIDGINQAIDNFNEVQTFTTINQLNHFKEKLMNCEHLIQLNNIPDKSHRNLGISRIIIDQWPFDSELGCMIINAESEYKSL 94 T 0.0065 Laminin_I pdbpssm F Bacteria T 7dn2 1 A,B,C,D,E,F,G,H,I a,b,c,d,e,f,g,h,i ORF14_BPKHP Major structural protein ORF14 MLEKLNNINFNNISNNLNLGIEVGREIQNASWIKSPFFSITGTGADRGVRLFSVASQQPFRPRIKAQLSGSGVSGNTDFEANYDNLEILSQTIYPDAFGNSLRSKIKAYSELERIDFIKESVDSLTTWMNEERDKRIVASLTNDFTNYLYTQTMNVATIRKAIFHARNGLKGDNSKAFPIKPIRATMQSVGNVMVQNTSYIILLDSYQANQLKADSEFKELRKLYAFAGEDKGMLYSGLLGVIDNCPVIDAGVWNKFNVGMPNSSISDSDFMRYLNKANVSSIVTPRQFKEKLNQEKDEKKRSINKEISIGCLIGASAVLLAGSKETRFYIDETVDAGRKSLVGVDCLLGVSKARYQSTDGVVTPYDNQDYAVIGLVSDME 381 T 0.00023 DUF4043 pdbpssm T Viruses T 7dn2 2 J,K,L,M,N,O,P,Q,R 1,2,3,4,5,6,7,8,9 I7HFW5_BPKHP Cement protein gp15 MKQKVHSVSYLAKAEFKFNNGVYNLVALPSGAEVVKVSLEVVGNPIATSTTSVSVGFEDETTKNYFLTLDNLAVDDASKKHTTSAKDYTATSNKVVVAEVKNANDNNVKGVLRVLYFLPSVIEVEY 126 T 0.056 Spore_III_AF pdbpercent T Viruses T 7dne 2 C,D C,D V3-IY (MN) crown mimetic peptide KRIHIGPGRAFYTTXP 16 F F T 7dnf 2 B,E B,E V3-IY (MN) crown mimetic peptide PKRIHIGPGRAFYTTXP 17 T 0.0018 GP120 pdbhh F T 7dno 2 C,D C,D CYS-ARG-THR-LEU-PRO-PHE CRTLPFHEC 9 T 1.6 ORC3_ins pdbhh F T 7dns 1 A,B A,B de novo designed protein GGHMGEEQKEIETLVELFAEAFREAKRQKKNGTPEEWARDAVEEAARQQGRSRKDVVEALTKYAQEQGRDELLKRLGITPEIYKVIQQIRKEEGSLE 97 T 0.0081 DUF2226 pdb F T 7doq 1 A,B,C,D A,B,C,D A0A2S6F805_LEGPN HISTIDINE ACID PHOSPHATASE,HISTIDINE-TYPE PHOSPHATASE HHMEDKLIFAVDIIRHGDRTPIVALPTVNYQWQEGLGQLTAEGMQQEYKMGVAFRKKYIEESHLLPEHYEYGTIYVRSTDYARTLMSAQSLLMGLYPPGTGPTIPAGTSALPHAFQPIPVFSAPSKYDEVIIQQVDRKEREKLMEQYVFSTREWQQKNNELKDKYPLWSRLTGINIDNLGDLETVGHTLYIHQIHNAPMPEGLASNDIETIINSAEWAFMAQEKPQQIANVYSSKLMTNIADYLNSGSMKKSKLKYVLLSAHDTTIASVLSFLGAPLEKSPPYASNVNFSLYDNGANYYTVKITYNGNPVSIPACGGSVCELQQLINLVHDSKNS 335 T 0.018 His_Phos_2 pdbpssm F Bacteria T 7dou 1 A,B,C 4,5,6 I7GUT5_9CAUD Cement protein gp16 MKQKVHSVSYLAKAEFEYKNGVYDLVALPTGAEVIKISLEVVGLPTAGHVSVGFKDESKKNYSSILTLPVNETSGVVTKDYTVKSDKIVAAEVKDALAEGSDGRPVKCVLRALYFLPSVIEVEY 124 T 0.0053 Clathrin_bdg pdbpssm T Viruses T 7drm 2 E,F E,F A6GH40_9DELT PsnA214-38, Precursor peptide LFIEDLGKVTGGKGGPYTTLAIGEE 25 T 0.016 LcnG-beta unphh F Bacteria T 7drp 2 E,F F,E A6GH40_9DELT PsnA214-38, Precursor peptide, phospho-mimic LFIEDLGKVTGGKGGPYTTLAIGXE 25 T 0.016 LcnG-beta unphh F Bacteria T 7ds2 3 C C TWF1_MOUSE PROTEIN A6 GPLGSKQHAHKQSFAKPKGPAGKRGIRRLIRGPAEAEATTD 41 T 24 Cylicin_N unphh F Eukaryota T 7ds3 3 C C TWF2_MOUSE A6-RELATED PROTEIN,MA6RP,TWINFILIN-1-LIKE PROTEIN KQHAFKQAFAKPKGPGGKRGHKRLIRGPGE 30 T 17 Tristanin_u2 unphh F Eukaryota T 7ds4 3 C C TWF1_MOUSE PROTEIN A6 KQHAHKQSKAKPKGPAGKRGIRRLIRGPAE 30 T 24 Cylicin_N unphh F Eukaryota T 7ds6 3 C C CD2AP_HUMAN;TWF1_MOUSE PROTEIN A6,ADAPTER PROTEIN CMS,CAS LIGAND WITH MULTIPLE SH3 DOMAINS KQHAHKQSFAKPKMPGRRLPGRFNG 25 T 7.1 CAP-ZIP_m unphh F Eukaryota T 7ds8 3 C C CD2AP_HUMAN;TWF1_MOUSE ADAPTER PROTEIN CMS,CAS LIGAND WITH MULTIPLE SH3 DOMAINS,PROTEIN A6 NLLHLTANRPKGPAGKRGIRRLIRGPAE 28 T 7.1 CAP-ZIP_m unphh F Eukaryota T 7dsb 4 D D TWF1_MOUSE PROTEIN A6 KQHAHKQSFAKPKGPAGKRGIRRLIRGPAE 30 T 24 Cylicin_N unphh F Eukaryota T 7dsz 1 A,B,C A,B,C B2UR43_AKKM8 Amuc_1102 MGHHHHHHMQTTSNPRMQVRVSLEKLSLYMRQSPNVLTQDDPRPLPKPKKWADFEIPFKVEAAPTPKSGYIDALTFKFYIAVVNPDRSRQYLKLYKEVKYVNVPVGENTYASVYLSPSSVKRITGVEGGRGKWVKYQGVVVEYNGKIVATYSSERGKMEKWWTIQSPSIVETSYYPLLNKDETPFSVFWYDRYPEIMRPNSQQAASSSVPAPFGTPVEPPADGELEHHHHHH 232 T 0.14 OMP_b-brl unppssm F Bacteria T 7dta 1 A A S2A4R_HUMAN GLUT4 ENHANCER FACTOR,GEF,HUNTINGTON DISEASE GENE REGULATORY REGION-BINDING PROTEIN 1,HDBP-1 GDAKKCRKVYGMERRDLWCTACRWKKACQRFLD 33 T 0.59 ERG4_ERG24 pdbpssm F Eukaryota T 7dtr 1 A A AcrIF24 MNAIHIGPFSITPAARGLHYGGLPHHQWTLYYGPREMAIKTLPDSYTSSEVRDEFSDIIAEFVIDARHRYAPGSGSSGSAPPLAWITGLLPGEVLTHDAEEWRPPTSWELRHVVGEGSFTGVSGAAAAALLGMSATNFRKYTAGDSAANRQKISFAAWHYLLDRLGVKRAS 171 T 0.00017 DUF4447 pdbhh F T 7du0 1 A A A0A0R6PCL0_9CAUD AcrIF14 AMKKIEMIEISQNRQNLTAFLHISEIKAINAKLADGVDVDKKSFDEICSIVLEQYQAKQISNKQASEIFETLAKANKSFKIEKFRCSHGYNEIYKYSPDHEAYLFYCKGGQGQLNKLIAENGRFM 125 T 0.053 GatB_Yqey unp T Viruses T 7du4 2 C Q MAZE7_MYCTU peptide DEDREWEGTVGDGLG 15 T 0.23 FUSC unppercent F Bacteria T 7du5 2 C C MAZE9_MYCTU A fragment of MazE-mt1 TLEDDYANAWQEWSAAG 17 T 0.0075 ParD_antitoxin unppercent F Bacteria T 7duv 1 A,B,C A,B,C Q981B2_SACS2 SegB MSELDFLLKKKRKSEDEEKIINNNENAKKEEITNEEEKIKNDMLKYIEKDPKIGVWSYPAFLVLQYLYHTVPGFKMSRTAKEALEKGLKEMYPTLFTIAEKIAKERFKEHHHHHH 115 T 0.0063 CBFD_NFYB_HMF unppercent F Archaea T 7dv2 1 A,B,C,D A,B,C,D Q981B2_SACS2 SegB MNEEEKIKNDMLKYIEKDPKIGVWSYPAFLVLQYLYHTVPGFKMSRTAKEALEKGLKEMYPTLFTIAEKIAKERFKEHHHHHH 83 T 0.0062 CBFD_NFYB_HMF pdbpercent F Archaea T 7dxa 2 B B Q8DMP8_THEEB Tsl0063 protein MRYTTDEGGRLNNFAIEPKVYQAQPWTPQQKVRAALLVGGGLLLVAGLVAIAVGVS 56 T 0.026 DUF2157 pdbpssm F Bacteria T 7dyr 3 C,F,I A,D,G MCEA_KLEPN MCCE492 GETDPNTQLLNDLGNNMAWGAALGAPGGLGSAALGAAGGALQTVGQGLIDHGPVNVPIPVLIGPSWNGSGSGYNSATSSSGSGS 84 T 0.00094 MccV unphh F Bacteria T 7dz7 13 M O A8JCL6_CHLRE Photosystem I subunit O MAVAMRSAAMPSLASRPRVSSRRSVVVRAEASNKSFPRDWVKTDPLVPVLGFAGWTIPANIGVSAFGGQSLFGLFTQSIGENLAHFPTGPALDDKFWLYLITYHLGLFLTITLGQIGVQGRKQGYW 126 T 42 YkpC pdbhh F Eukaryota T 7dz9 3 C,F D,C E3BK13_9VIBR MbnC MEEILDRIINPLSAKPLTKKEHIYTSLVLQSSQSLILSACPSLQSQRQFCSFEYHQQFIDWCFFNKKRTDWCLALSFYQYLSYKNEQVSVEILKELIHLACSQWTYADKSTNQTVVICHTRLPSMVFGGNKSLFAQEFREVFLLETEQLKPFIQSHVPDGYFVYWILRDDSEYPSTMGEK 180 T 0.14 SEFIR unppercent F Bacteria T 7dz9 4 D E MbnA MKNDKKVVVKVKDKEMTCGAFNK 23 T 5.1 ATP-synt pdbhh F T 7dz9 5 E F MbnA MKNDKKVVVKVKDKEMTCGAFN 22 T 3.8 ATP-synt pdbhh F T 7e0b 2 B B PBM DDVQTSF 7 T 55 DUF2150 pdbhh F T 7e0c 1 A A Q8L3C7_9ACTN L-glutamate oxidase MNEMTYEQLARELLLVGPAPTNEDLKLRYLDVLIDNGLNPPGPPKRILIVGAGIAGLVAGDLLTRAGHDVTILEANANRVGGRIKTFHAKKGEPSPFADPAQYAEAGAMRLPSFHPLTLALIDKLGLKRRLFFNVDIDPQTGNQDAPVPPVFYKSFKDGKTWTNGAPSPEFKEPDKRNHTWIRTNREQVRRAQYATDPSSINEGFHLTGCETRLTVSDMVNQALEPVRDYYSVKQDDGTRVNKPFKEWLAGWADVVRDFDGYSMGRFLREYAEFSDEAVEAIGTIENMTSELHLAFFHSFLGRSDIDPRATYWEIEGGSRMLPETLAKDLRDQIVMGQRMVRLEYYDPGRDGHHGELTGPGGPAVAIQTVPEGEPYAATQTWTGDLAIVTIPFSSLRFVKVTPPFSYKKRRAVIETHYDQATKVLLEFSRRWWEFTEADWKRELDAIAPGLYDYYQQWGEDDAEAALALPQSVRNLPTGLLGAHPSVDESRIGEEQVEYYRNSELRGGVRPATNAYGGGSTTDNPNRFMYYPSHPVPGTQGGVVLAAYSWSDDAARWDSFDDAERYGYALENLQSVHGRRIEVFYTGAGQTQSWLRDPYACGEAAVYTPHQMTAFHLDVVRPEGPVYFAGEHVSLKHAWIEGAVETAVRAAIAVNEAPVGDTGVTAAAGRRGAAAATEPMREEALTS 687 T 2.2999999999999998E-32 Amino_oxidase unppercent F Bacteria T 7e15 3 C,F C,F Q5JET1_THEKO POLDP1, POL II,EXODEOXYRIBONUCLEASE SMALL SUBUNIT MLVEDLLKNNYLITPSAYYLLSDHYKKAFTLAELIKFAKNRGTFVVDSNLAREFLAEKGIISSG 64 T 0.12 DUF2492 pdbhh F Archaea T 7e2h 1 A D DISP1_HUMAN Protein dispatched homolog 1 MAMSNGNNDFVVLSNSSIATSAANPSPLTPCDGDHAAQQLTPKEATRTKVSPNGCLQLNGTVKSSFLPLDNQRMPQMLPQCCHPCPYHHPLTSHSSHQECHPEAGPAAPSALASCCMQPHSEYSASLCPNHSPVYQTTCCLQPSPSFCLHHPWPDHFQHQPVQQHIANIRPSRPFKLPKSYAALIADWPVVVLGMCTMFIVVCALVGVLVPELPDFSDPLLGFEPRGTAIGQRLVTWNNMVKNTGYKATLANYPFKYADEQASSLEVLFQ 270 T 0.76 Adeno_E3_14_5 pdbhh F Eukaryota T 7e4j 1 A,B,C,D A,B,C,D VPB12_MYCTU ANTITOXIN VAPB12,CONSERVED PROTEIN OF UNCHARACTERIZED FUNCTION,POSSIBLE ANTITOXIN VAPB12 MSAMVQIRNVPDELLHELKARAAAQRMSLSDFLLARLAEIAEEP 44 T 0.00013 ParD unppercent F Bacteria T 7e57 1 A,B A,B TNF18_MOUSE GITR LIGAND,GITRL,GLUCOCORTICOID-INDUCED TNF-RELATED LIGAND MEEMPLRESSPQRAERCKKSWLLCIVALLLMLLCSLGTLIYTSLKPTAIESCMVKFELSSSKWHMTSPKPHCVNTTSDGKLKILQSGTYLIYGQVIPVDKKYIKDNAPFVVQIYKKNDVLQTLMNDFQILPIGGVYELHAGDNIYLKFNSKDHIQKTNTYWGIILMPDLPFIS 173 T 0.00013 TNF pdb F Eukaryota T 7e5e 2 E,F,G,H E,F,G,H GD20 XXLITFRQWAFNLPCGX 17 T 3.8 DUF2433 pdbhh F T 7e5t 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H FSA2_FUSSF FUSARISETIN A BIOSYNTHESIS PROTEIN 2 MGSSHHHHHHMSNVTVSAFTVDKSISEEHVLPSSFIPGSGNIFPKFTSAIPKTAWELWYFDGISKDDKSSIVIGVTRNAEGLKHGGFKVQVFVIWADERTWHRDLFFPESVVSINESGVTDGIWKDATSNSSISFSCAGDLSKASLVFDVPGVVQGDMHLEALPGDTGLDTDARLGPSVYYVRPIGRASVKAQLSLYSSDATAAEQFSLGTSANGGMDRVWSPLSWPQVMTESYYLRTQVGPYAMQIMRIFPPAGSEDQPSTMARLYREGQLVCVAQHVVTREDALMTHDSLILSKQDNSDSEDVVTGGYRDKNTGYTVEFVEKGNEGQRWKFQVRHERIIWNTPTSRPGPDATGNTGFVEVLCGGTIGESYEGVGTGGQCELS 384 T 0.00021 Svf1_C unphh F Eukaryota T 7e87 3 C,D,G,H J,I,E,F DPP6_HUMAN DPPX,DIPEPTIDYL AMINOPEPTIDASE-RELATED PROTEIN,DIPEPTIDYL PEPTIDASE 6,DIPEPTIDYL PEPTIDASE IV-LIKE PROTEIN,DIPEPTIDYL PEPTIDASE VI,DPP VI WKGIAIALLVILVICSLIVTSVILLTPA 28 T 0.11 TMEM_230_134 unppercent F Eukaryota T 7e8e 3 C,I K,I DPP6_HUMAN DPPX,DIPEPTIDYL AMINOPEPTIDASE-RELATED PROTEIN,DIPEPTIDYL PEPTIDASE 6,DIPEPTIDYL PEPTIDASE IV-LIKE PROTEIN,DIPEPTIDYL PEPTIDASE VI,DPP VI AAAAKGIAIALLVILVICSLIVTSVILLTPA 31 T 0.11 TMEM_230_134 unppercent F Eukaryota T 7e8e 5 F,L L,J DPP6_HUMAN DPPX,DIPEPTIDYL AMINOPEPTIDASE-RELATED PROTEIN,DIPEPTIDYL PEPTIDASE 6,DIPEPTIDYL PEPTIDASE IV-LIKE PROTEIN,DIPEPTIDYL PEPTIDASE VI,DPP VI AKGIAIALLVILVICSLIVTSVILLTPA 28 T 0.11 TMEM_230_134 unppercent F Eukaryota T 7e9k 2 E,F C,F DAG1_HUMAN mono-mannosyl peptide (379Man long peptide) XTIRTRGAIIQTPTLGPIQPTRX 23 T 19 Ste5 pdbhh F Eukaryota T 7e9l 2 C C DAG1_HUMAN mono-mannosyl peptide (379Man short peptide) XQTPTLGPIQPTRX 14 T 5.1 Ste5 pdbhh F Eukaryota T 7e9m 2 B,D B,D SPNDC_HUMAN SPIN1-DOCKING PROTEIN,SPIN-DOC FAAPAEVRHFTDGSFPAGFVLQLFSHT 27 T 23 DUF2852 pdbhh F Eukaryota T 7e9s 2 B B a polypeptide linked to an inhibitory N-glycosylation sequon-containing peptide APYXVTASCR 10 T 7.4 SsgA pdbhh F T 7ea1 2 B,D B,D SPNDC_HUMAN SPINDOC DOCPEP2 VRKKRGRPMTKN 12 T 3 AT_hook pdbhh F Eukaryota T 7ea7 2 C C AZI2_HUMAN NAP1_LIR motif EDDICILNHEK 11 F F Eukaryota T 7eau 1 A A N4VVN8_COLOR SIN1 QEGKCTAKGECQENTSGVKLFCTSGSCAKKEGQACTRNGPGSSNSASCPK 50 T 0.011 Nodulin_late unp F Eukaryota T 7ebd 1 A,B A,B A0A133PTK7_9BACT STING SGGGLPSTVIAISYFEGFVKLAAEWIVTEMPTTEIDGKTYTSGKLYIKMPETLDTDIKKSAMLFYKKQGLNETQMSTNHRNYPIHIVSKEEGDTLEVYDMPTILSGIDKAIDMYFRVGHIGKTTEQQLAEDNEMNNFKRVLQLLINEDSFCRECVEILRQA 161 T 1.7E-05 TMEM173 pdbhh F Bacteria T 7ebl 1 A,B A,B STING GIHLGELGLLPSTVLAIGYFENLVNIICESLNMLPKLEVSGKEYKKFKFTIVIPKDLDANIKKRAKIYFKQKSLIEIEIPTSSRNYPIHIQFDENSTDDILHLYDMPTTIGGIDKAIEMFMRKGHIGKTDQQKLLEERELRNFKTTLENLIATDAFAKEMVEVIIEE 167 T 0.26 Pih1_fungal_CS pdbpercent F T 7eca 2 B B NF2L2_MOUSE LEU-ASP-GLU-GLU-THR-GLY-GLU-PHE-LEU-PRO EQEKAFFAQFQLDEETGEFLPIQPA 25 T 0.055 Radial_spoke unppercent F Eukaryota T 7ecd 1 A A R5MX27_9FIRM Phosphatidate cytidylyltransferase MHHHHHHMKDFIKEFLNERPEVVAAFGYGSGVFKQLGYDSKEKPQIDLILIVNDMKLWHKENIKKNPKDYSFIGRNFFLNSSIDEIKGITGITYQSNIEYKGHLFKYGIIEYGDFVRHMQTWDSFYVPGRFQKPILTIKSNNFIDELILQNRRNACKVGLLCLNNKDLKDLYLTICNLSYSGDTRMKVAENPKKVENIVGASYDKFNEMYNFNDLYQKNGERIEYEIDIDELPSSLEKYIKDDKTKEKVMEYLSDLNRKESSLQTMKGIKTN 272 T 4.6E-13 Tam41_Mmp37 unppssm F Bacteria T 7ecv 5 J,K I,J A0A0R6PCL0_9CAUD AcrIF14 MKKIEMIEISQNRQNLTAFLHISEIKAINAKLADGVDVDKKSFDEICSIVLEQYQAKQISNKQASEIFETLAKANKSFKIEKFRCSHGYNEIYKYSPDHEAYLFYCKGGQGQLNKLIAENGRFM 124 T 0.053 GatB_Yqey pdb T Viruses T 7edo 3 C,F C,F CYS-ASN-VAL-THR-LEU-ASN-TYR-PRO CNVTLNYP 8 T 1.2 DUF2884 pdbhh F T 7edp 1 A B DOT1L_HUMAN DOT1-LIKE PROTEIN,HISTONE H3-K79 METHYLTRANSFERASE,H3-K79-HMTASE,LYSINE N-METHYLTRANSFERASE 4 DWATLSLEKLLKEKQALKSQISEKQRHCLELQISIVELEK 40 T 0.02 HALZ pdbpercent F Eukaryota T 7edz 1 A,B,C,D D,C,B,A PPCS_HUMAN PHOSPHOPANTOTHENOYLCYSTEINE SYNTHETASE,PPC SYNTHETASE MAEMDPVAEFPQPPGAARWAEVMARFAARLGAQGRRVVLVTSGGTKVPLEARPVRFLDNFSSGRRGATSAEAFLAAGYGVLFLYRARSAFPYAHRFPPQTWLSALRPSGPALSGLLSLEAEENALPGFAEALRSYQEAAAAGTFLVVEFTTLADYLHLLQAAAQALNPLGPSAMFYLAAAVSDFYVPVSEMPEHKIQSSGGPLQITMKMVPKLLSPLVKDWAPKAFIISFKLETDPAIVINRARKALEIYQHQVVVANILESRQSFVLIVTKDSETKLLLSEEEIEKGVEIEEKIVDNLQSRHTAFIGDRN 311 T 1.7E-07 DFP pdbpercent F Eukaryota T 7eeb 14 N K CTSRZ_MOUSE CATSPER-ZETA,CATSPERZETA,PROTEIN EXPRESSED IN MALE LEPTOTENE AND ZYGOTENE SPERMATOCYTES 622,MLZ-622,TESTIS-EXPRESSED PROTEIN 40 MEESVKPVPKHANHRRSSVRSSLYGDVRDLWSTATMSTANVSVSDVCEDFDEEGKSVRNRIRKYSQTISIRDSLNLEPEEIQQQARRELELCHGRSLEHGEDHEESETSLASSTSESLIFSLWKPHRTYWTEQQNRLPLPLMELMETEVLDILKKALITYRSTIGRNHFMTKELQGYIEGIRKRRNKRLYFLDQ 194 T 18 EGL-1 pdbhh F Eukaryota T 7eel 2 H,I,J,K,L,M,N H,I,J,K,L,M,N Cement (decoration) proteins MPATNSAQARLAAPGHGFGGNVKVSYGSVAFTGTITTADAATVCNLPVGAIVLGVTLESDDLDTNATPTITLNVGDAGSATRYFSASTVAQAGTSSSAPATTGLLWTVTEGNTAVRIAVANNAATSADGSVRVAVTYYLP 140 T 57 DUF6476 pdbhh F T 7eep 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L Pam1 portal proteins MEDTMTMPSHAQLKAYFEEARDANEEYRKEAFIDRDYFDGHQWTEEELQKLEARKQPATYFNEVKLSIRGLVGVFEQGDSDPRAWPRNPQDEDSADIATKALRYVKDYSEWSDERSRAALNYFVEGTCAAIVGVDENGRPEIEPIRFEEFFHDPRSRELDFSDARFKGVAKWRFADEVGMEYGIKGEIDGALDGDSEGLSIGGDTFGDRPDGKISSWIDSKLRRVFVVEMYVRWNGVWIRALFWGRGILEMSVSAYLDRNGKPTCPIEARSCYIDRENRRYGEVRDLRSPQDAINKRESKLLHMLNNRQAIATNPEYAYNSDAEMVRKEMSKPDGIIPPGWQPASMTDLANGQFALLSSAREFIQRIGQNPSVLAAQSASASGRAQLARQQAGMVDSAMALNGLRRFELAVYRQAWLRCRQFWKAPDYIRVTDDEGAPQFVGINQPIKGPPQPVLNEMGQVVIAEPILGYENALAELDVDINIDAVPDTANLAQEQFLQLTELARLYGPQEVPFDDLLELSSMPEKTKLIAKRRERSEQMAQVQAQQGQMQEQIAMQGAMAEIENTQADTAYLAARAQNEMLKPQIEAFKAGFGAA 596 T 0.00011 P22_portal pdbpssm F T 7eep 2 M,N,O,P,Q,R,S,T,U,V,W,X M,N,O,P,Q,R,S,T,U,V,W,X Pam1 adaptor proteins MITCRDIITLGLQQARVVPLGREPKAKEADAGLTVLQSIYDSMFADGPLGPFTEVYATSAYTAQENERIVTNGAAITIPQTITEGNETRKPYDLTAIIVINGAAQENHVFSLGRWQTAHDLTLNSEAPLAERDKAGLAALFAMEFAEMFGAELPPRTTARGFRFKGAISQKLATKRDDPVYY 182 F F T 7eeq 2 G,H,I,J,K,L,M,N,O,P,Q,R,S,T,U,V,W,X A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R Tailspike head-binding domain MAAELHITPSRATSSNGLNLDGAKWFFYQTGTTTPQSVYTTAALSVAHSNPVVADAAGKFPAIYFDTTLEYRGVLKTADEATTIYDIDPINSGILSVLGTSS 102 T 0.00073 Big_1 pdbpercent F T 7ees 1 A A GLY-THR-ILE-ASP-PRO-GLN-ASN-SER-GLU-GLU-HIS-PRO-VAL-LEU-SER-ARG-ARG-LEU-GLU-ASN GTIDPQNSEEHPVLSRRLEN 20 T 9.9 DUF3243 pdbhh F T 7ef0 2 C P H32_MAIZE Histone H3.2 ARTKQTARMSTGGKAPRKQ 19 T 350 Sirohm_synth_M pdbhh F Eukaryota T 7egb 23 W Q TF2AA_HUMAN GENERAL TRANSCRIPTION FACTOR IIA SUBUNIT 1,TFIIAL,TRANSCRIPTION INITIATION FACTOR TFIIA 42 KDA SUBUNIT,TFIIA-42 MANSANTNTVPKLYRSVIEDVINDVRDIFLDDGVDEQVLMELKTLWENKLMQSRAVDGFHSEEQQLLLQVQQQHQPQQQQHHHHHHHQQAQPQQTVPQQAQTQQVLIPASQQATAPQVIVPDSKLIQHMNASNMSAAATAATLALPAGVTPVQQILTNSGQLLQVVRAANGAQYIFQPQQSVVLQQQVIPQMQPGGVQAPVIQQVLAPLPGGISPQTGVIIQPQQILFTGNKTQVIPTTVAAPTPAQAQITATGQQQPQAQPAQTQAPLVLQVDGTGDTSSEEDEDEEEDYDDDEEEDKEKDGAEDGQVEEEPLNSEDDVSDEEGQELFDTENVVVCQYDKIHRSKNKWKFHLKDGIMNLNGRDYIFSKAIGDAEW 376 F F Eukaryota T 7egs 1 A A RPOB_ECOLI RNAP SUBUNIT BETA,RNA POLYMERASE SUBUNIT BETA,TRANSCRIPTASE SUBUNIT BETA AMDSPGVFFDSDKGKTHSSGKVLYNARIIPYRGSWLDFEFDPKDNLFVRIDRRRKLPATIILRALNYTTEQILDLFFEKVIFEIRDNKLQMELVPERLRGETASFDIEANGKVYVEKGRRITARHIRQLEKDDVKLIEVPVEYIAGKVVAKDYIDESTGELICAANMELSLDLLAKLSQSGHKRIETLFTNDLDHGPYISETLRVDPTNDRLSALVEIYRMMRPGEPPTREAAESLFENLFFSEDRYDLSAVGRMKFNRSLLREEIEGSGILSKDDIIDVMKKLIDIRNGKGEVD 295 T 5.6E-08 RNA_pol_Rpb2_2 pdb F Bacteria T 7egs 2 B B UVRD_ECOLI DNA helicase II AMDVSHQRMGTPMVENDSGYKLGQRVRHAKFGEGTIVNMEGSGEHSRLQVAFQGQGIKWLVAAYARLESV 70 T 0.00015 Tudor_1_RapA pdbhh F Bacteria T 7egu 2 B B macrocyclic peptide X FPLIFPRKGCGG 12 T 0.28 MOSC_N pdbhh F T 7ehz 2 B B macrocyclic peptide 2 GXPYRPXXC 9 T 1.4 LINES_C pdbhh F T 7ei2 2 B B macrocyclic peptide 8 GXPYKPXXC 9 T 1.8 LINES_C pdbhh F T 7eii 1 A,B A,B FAD dependent L-Lys oxidase MGNKNTPLNSGKHPDLKIEVAIIGAGTSGLYTAYRLVTDKKFKAHDVQIFDMNNKLGGRLESVIMPGMNFWGELGGMRYLTSQQIVTTLIEGYPLSEKDPNKRTPVLKDKMTPVPFPMGDPSKLLMYLRKERFKQNAWNEAQKKGEKLPTRYYLNENDLGFSSDQLFNKIIYDVLMADPWVAETYGSKIIKGSSVYDYSFKLTSRDWDDIKPKLVYNFPNSPYDQRKVNDIGFWNLIKDQVSQEGYEFLANAGGYYSNTINWNSAEAFPYMVGDFSAGTIYKTIEEGYDSIAYAVANSYMEHEGACIWSENKLLTFTKDHPLTNTHKYELTFLNLKTNTQWKVYANSIVLAMPRKSLELLDQNNFFFNINKNSVLNNNIRSVIMEPAFAILMGFEYPWWKELGIDSGHSITDLPMRQCYYFGTDPETNNSMLLGSYGDMETETFWKALSDDKVLFEVKAAKSASLRELHQLDDVQATKLMVGELMNQLRELHGDTVTIPEPYVTYFKDWTDEPFGAGYHAWKAGFSVENVMPYMRKPLTDEQIHICGEAYSDQQGWVEGAFCEAEKMLQEYFGLDRPYWLSPDYYLGWEHHHHHH 595 T 3.4E-17 MCRA pdbhh F T 7ekn 1 A,C,E,G B,D,F,H ipep SQIEWAKARVEKLRKRNQALKSQTSELQRQIAELEASNAELKK 43 T 0.00013 GIT_CC pdb F T 7el1 5 E E 100AA MKSVKYISNMSKQEKGYRVYVNVVNEDTDKGFLFPSVPKEVIENDKIDELFNFEHHKPYVQKAKSRYDKNGIGYKIVQLDEGFQKFIELNKEKMKENLDY 100 T 24 ATP-synt_DE_N pdbhh F T 7elh 4 D,F,G,I,J,L,M,O,P,R D,E,F,G,H,I,J,K,L,M F1ARN3_9REOV LAMBDA1 YQCHVCSAVLFSPLDLDAHVASHGLHGNMTLTSSDIQRHITEFISSWQNHPIVQVSADVENKKTAQLLHADTPRLVTWDAGLCTSFKIVPIVPAQVPQDVLAYTFFTSSYAIQSPFPEAAVSRIVVHTRWASNVDFDRDSSVIMAPPTENNIHLFKQLLNTETLSVRGANPLMFRANVLHMLLEFVLDNLYLNRHTGFSQDHTPFTEGANLRSLPGPDAEKWYSIMYPTRMGTPNVSKICNFVASCVRNRVGRFDRAQMMNGAMSEWVDVFETSDALTVSIRGRWMARLARMNINPTEIEWALTECAQGYVTVTSPYAPSVNRLMPYRISNAERQISQIIRIMNIGNNATVIQPVLQDISVLLQRISPLQIDPTIISNTMSTVSESTTQTLSPASSILGKLRPSNSDFSSFRVALAGWLYNGVVTTVIDDSSYPKDGGSVTSLENLWDFFILALALPLTTDPCAPVKAFMTLANMMVGFETIPMDNQIYTQSRRASAFSTPHTWPRCFMNIQLISPIDAPILRQWAEIIHRYWPNPSQIRYGAPNVFGSANLFTPPEVLLLPIDHQPANVTTPTLDFTNELTNWRARVCELMKNLVDNQRYQPGWTQSLVSSMRGTLDKLKLIKSMTPMYLQQLAPVELAVIAPMLPFPPFQVPYVRLDRDRVPTMVGVTRQSRDTITQPALSLSTTNTTVGVPLALDARAITVALLSGKYPPDLVTNVWYADAIYPMYADTEVFSNLQRDMITCEAVQTLVTLVAQISETQYPVDRYLDWIPSLRASAATAATFAEWVNTSMKTAFDLSDMLLEPLLSGDPRMTQLAIQYQQYNGRTFNIIPEMPGSVIADCVQLTAEVFNHEYNLFGIARGDIIIGRVQSTHLWSPLAPPPDLVFDRDTPGVHIFGRDCRISFGMNGAAPMIRDETGLMVPFEGNWIFPLALWQMNTRYFNQQFDAWIKTGELRIRIEMGAYPYMLHYYDPRQYANAWNLTSAWLEEITPTSIPSVPFMVPISSDHDISSAPAVQYIISTEYNDRSLFCTNSSSPQTIAGPDKHIPVERYNILTNPDAPPTQIQLPEVVDLYNVVTRYAYETPPITAVVMGVP 1095 T 0.0075 zf_C2H2_6 unphh T Viruses T 7elh 5 E,H,K,N,Q e,g,i,k,m F1ARN3_9REOV LAMBDA1 MKRIPRKTKGKSSGKGNDSTERADDGSSQLRDKQNNKAGPATTEPGTSNREQYKARPGIASVQRATESAEMPMKNNDEGTPDKKGNTKGDLVNEHSEAKDEADEATKKQAKDTDKSKAQVTYSDTGINNANELSRSGNVDNEGGSNQKPMSTRIAEATSAIVSKHPARVGLPPTASSGHG 180 T 0.25 zf-C2H2_3rep unphh T Viruses T 7elm 5 S,T U,V A0A8G3G219_PSEAI AcrIF24 MNAIHIGPFSITPAARGLHYGGLPHHQWTLYYGPREMAIKTLPDSYTSSEVRDEFSDIIAEFVIDARHRYAPDVLELVNSDGDAVLARVAVSRLPEALSGCIPDDRFPYWLLTASRPRLGLPVTLNEYTALAVELSAPPLAWITGLLPGEVLTHDAEEWRPPTSWELRHVVGEGSFTGVSGAAAAALLGMSATNFRKYTAGDSAANRQKISFAAWHYLLDRLGVKRAS 228 T 0.00054 DUF4447 pdbhh F Bacteria T 7ely 1 A A 16X_BCL GCMILLDTDIWCPCSHPYACPENICC 26 T 1.6 WCCH pdbhh F T 7em9 3 C C SER-LEU-ASP-GLU-TYR-SER-SER-ASP-VAL SLDEYSSDV 9 T 9.7 V-ATPase_H_C pdbhh F T 7ema 3 C C TYR-SER-SER-ASP-VAL-THR-THR-LEU-VAL YSSDVTTLV 9 T 15 Peptidase_S66 pdbhh F T 7emc 3 C,F,I C,F,I ALA-THR-GLU-ILE-ARG-GLU-LEU-LEU-VAL ATEIRELLV 9 T 0.66 DUF5908 pdbhh F T 7emd 3 C C CAPSH_ASFB7 TYR-GLY-ASP-PHE-PHE-HIS-ASP-MET-VAL YGDFFHDMV 9 T 0.7 Ebp2 pdbhh T Viruses T 7emz 1 A,B A,B NVFI_ASPN1 NvfI W199F MGSSHHHHHHSSGLVPRGSHMVGSRTWCESEMLFVQPDAGTKEELYYRVTPKPGQTQANFNWTPHKVRFHDARPQRDSFDLNTHGFTFVEDAISPQLIERIRADDTAAVEGDYFASVAALVKRVTGADHVVCFSPYTRKENSEKGIFGQPARTVHCDHTPAAAIELTHKLCGEDAVRLLQSRFRAFSVWRPLVEPVLDWPLAVVDGRTIAPDDLHPVHFLRYEKKDTEPPFQLSFSETQKWYYLSRQRSDEVSIVKNYDSEVVPSPRSAHCAFKHPFVPKDAPPRESIDVRCLVFGGR 298 T 0.28 EF-hand_5 unppercent F Eukaryota T 7ena 6 F DQ TFIIA-a MANSANTNTVPKLYRSVIEDVINDVRDIFLDDGVDEQVLMELKTLWENKLMQSRAVDGFHSEEQQLLLQVQQQHQPQQQQHHHHHHHQQAQPQQTVPQQAQTQQVLIPASQQATAPQVIVPDSKLIQHMNASNMSAAATAATLALPAGVTPVQQILTNSGQLLQVVRAANGAQYIFQPQQSVVLQQQVIPQMQPGGVQAPVIQQVLAPLPGGISPQTGVIIQPQQILFTGNKTQVIPTTVEEEPLNSEDDVSDEEGQELFDTENVVVCQYDKIHRSKNKWKFHLKDGIMNLNGRDYIFSKAIGDAEW 307 T 8.3E-42 TFIIA pdb F T 7eni 3 C C AcrIIA13 protein MNKSIEIKDQNNIVLIDSLGQFFTDIENDNNGRYNIDYVLLNEVEHDNGNTYYEVGMYRTEEVPFSDKVTQDNVELLEDKWLQIDQQGESYVESIFFENEEDAREYIKLVLKGHETFEETAKAIGVIK 128 T 0.054 EABR pdb F T 7enm 1 A A AcrIIA14 protein SMKSVKYISNMSKQEKGYRVYVNVVNEDTDKGFLFPSVPKEVIENDKIDELFNFEHHKPYVQKAKSRYDKNGIGYKIVQLDEGFQKFIELNKEKMKENLDY 101 T 25 ATP-synt_DE_N pdbhh F T 7ep0 1 A,B A,B ZY11B_HUMAN Protein zyg-11 homolog B GSTEQTAQLGTELFIVRQLLQIVKQKTNQNSVDTTLKFTLSALWNLTDESPTTCRHFIENQGLELFMRVLESFPTESSIQQKVLGLLNNIAEVQELHSELMWKDFIDHISSLLHSVEVEVSYFAAGIIAHLISRGEQAWTLSRSQRNSLLDDLHSAILKWPTPECEMVAYRSFNPFFPLLGCFTTPGVQLWAVWAMQHVCSKNPSRYCSMLIEEGGLQHLYNIKDHEHTDPHVQQIAVAILDSLEKHIVR 250 T 0.0059 DUF3361 pdb F Eukaryota T 7ep1 1 A,B A,B ZY11B_HUMAN Protein zyg-11 homolog B GFLHVGAQLGTELFIVRQLLQIVKQKTNQNSVDTTLKFTLSALWNLTDESPTTCRHFIENQGLELFMRVLESFPTESSIQQKVLGLLNNIAEVQELHSELMWKDFIDHISSLLHSVEVEVSYFAAGIIAHLISRGEQAWTLSRSQRNSLLDDLHSAILKWPTPECEMVAYRSFNPFFPLLGCFTTPGVQLWAVWAMQHVCSKNPSRYCSMLIEEGGLQHLYNIKDHEHTDPHVQQIAVAILDSLEKHIVR 250 T 0.00014 DUF4704 pdb F Eukaryota T 7ep2 1 A,B,C,D A,C,D,B ZY11B_HUMAN Protein zyg-11 homolog B GGFNRFEAAKLVMQWLCNHEDQNMQRMAVAIISILAAKLSTEQTAQLGTELFIVRQLLQIVKQKTNQNSVDTTLKFTLSALWNLTDESPTTCRHFIENQGLELFMRVLESFPTESSIQQKVLGLLNNIAEVQELHSELMWKDFIDHISSLLHSVEVEVSYFAAGIIAHLISRGEQAWTLSRSQRNSLLDDLHSAILKWPTPECEMVAYRSFNPFFPLLGCFTTPGVQLWAVWAMQHVCSKNPSRYCSMLIEEGGLQHLYNIKDHEHTDPHVQQIAVAILDSLEKHIVR 288 T 0.00019 V-ATPase_H_N pdbpercent F Eukaryota T 7ep3 1 A A ZER1_HUMAN HZYG,ZYG-11 HOMOLOG B-LIKE PROTEIN,ZYG11B-LIKE PROTEIN GAGNKMGFVVTMLKLIQKKLLDKTCDQVMEFSWSALWNITDETPDNCEMFLNFNGMKLFLDCLKEFPEKQELHRNMLGLLGNVAEVKELRPQLMTSQFISVFSNLLESKADGIEVSYNACGVLSHIMFDGPEAWGVCEPQREEVEERMWAAIQSWDINSRRNINYRSFEPILRLLPQGISPVSQHWATWALYNLVSVYPDKYCPLLIKEGGMPLLRDIIKMATARQETKEMARKVIEHCSNFKEENMDTSR 251 T 0.0015 Arm_3 pdbpercent F Eukaryota T 7ep4 1 A,B B,A ZER1_HUMAN HZYG,ZYG-11 HOMOLOG B-LIKE PROTEIN,ZYG11B-LIKE PROTEIN GFLHVGKMGFVVTMLKLIQKKLLDKTCDQVMEFSWSALWNITDETPDNCEMFLNFNGMKLFLDCLKEFPEKQELHRNMLGLLGNVAEVKELRPQLMTSQFISVFSNLLESKADGIEVSYNACGVLSHIMFDGPEAWGVCEPQREEVEERMWAAIQSWDINSRRNINYRSFEPILRLLPQGISPVSQHWATWALYNLVSVYPDKYCPLLIKEGGMPLLRDIIKMATARQETKEMARKVIEHCSNFKEENMDTSR 253 T 0.0007 Arm_3 pdbpercent F Eukaryota T 7ep5 1 A,B A,B ZER1_HUMAN HZYG,ZYG-11 HOMOLOG B-LIKE PROTEIN,ZYG11B-LIKE PROTEIN GKLHVGKMGFVVTMLKLIQKKLLDKTCDQVMEFSWSALWNITDETPDNCEMFLNFNGMKLFLDCLKEFPEKQELHRNMLGLLGNVAEVKELRPQLMTSQFISVFSNLLESKADGIEVSYNACGVLSHIMFDGPEAWGVCEPQREEVEERMWAAIQSWDINSRRNINYRSFEPILRLLPQGISPVSQHWATWALYNLVSVYPDKYCPLLIKEGGMPLLRDIIKMATARQETKEMARKVIEHCSNFKEENMDTSR 253 T 0.001 Arm_3 pdbpercent F Eukaryota T 7ep7 2 B B WHRN_HUMAN AUTOSOMAL RECESSIVE DEAFNESS TYPE 31 PROTEIN TNQHFVMVEVHRPDSEPDVNEVRALPQTRT 30 T 190 Cwf_Cwc_15 pdbhh F Eukaryota T 7eqc 1 A,B,E,G B,C,D,G Q9XUS9_CAEEL CYK-4 GAGSMKSSTSKEKVCGENSRHIFNMILNSQRPQFDIKDIGMFHLIDEIERLRKLWKDSEESKKRLNADMREAEEALAKARKKLAMFDIDVKDTQKHLRALMEENKALKLDLNVYETREKQLKDA 124 T 0.0019 BRE1 pdbpercent F Eukaryota T 7eqg 3 H,I,M,N,O J,K,P,Q,R L7P7V3_9CAUD AcrIF5 MSRPTVVTVTETPRNPGSYEVNVERDGKMVVGRARAGSDPGAAAAKAMQMAMEWGSPNYVILGSNKVLAFIPEQLRVKM 79 T 0.067 Flagellin_D3 pdb T Viruses T 7esi 2 B B Peptide P1 EPSQQVTEIYQHHA 14 T 16 Inhibitor_I34 pdbhh F T 7esi 3 C C Peptide P2 DYAPTKLLPQQP 12 T 9.5 DUF724 pdbhh F T 7esx 1 A,B A,B B3CP62_WOLPP Bacteria factor 1 MPTQKELRDTMSKKLQEAIKHPDPAVVAGRKSAIKRWVGVLQDNFMEHIKYFKGDKLKFLHNVFQDEGCWSGVRLDNAALGQRFTEEKIGGIDNPLRKYEMACSYCVVDKIHPLFQKRFESYRNKFPPGAFDGKTETEFGKYVRNSLLDSIKRKGPVFDFWIDRESGELKKYDAVEGFDSAVKFKWSEGVEYFYNHLKEEDKEKKLTEAILALSRVQSVEKDAPILDFCVNKIVDKDTLLQKLSQKDKGVYSLFAELIESCFFDTVHDLVQCWCYKEVSAGGDHSEKIFSQRDYELFLSSLSDTMLKNPELSVQARSLIMEFWECGSLYQYRKAAVNTSNYTVPTSGVFAELIVNWRREDIYKTDEEKEIEKKEILDMMSFAKDCFPEKFELFKKLIIRDLRLCGREGKRVNVDYGLFAEELFSELEKTILPPGPVGDGPCSNLRSRSKAHGSKKTTLPVDDSPQSELGTPSVSGVSSYKKKSVFTLSGNKLEHHHHHH 499 T 0.21 DUF3437 pdbpssm F Bacteria T 7esy 2 B B B3CP63_WOLPP ULP_PROTEASE domain-containing protein MSNGDGLIRSLVDGDLEGFRQGFESFLDQCPSFLYHVSAGRFLPVFFFSMFSTAHDANILNANERVYFRFDNHGVNPRNGENRNTANLKVAVYRDGQQVVRCYSISDRPNSDGLRFSTRERNALVQEIRRQNPNLREEDLNFEQYKVCMHGKGKSQGEAIATVFEVIREKDRQGRDKFAKYSASEVHFLRQLFRNHRLTIKEIEGRQLNQNQLRQLGRSVNFTRVEPGQQRIDNFMEMLASNQRQDVRDSLRGDILEYVTDTYNNYRAQIENNIEGRSQKFESHGFLLGFLANFSHRYTIGVDLDLSPRNSHVAFLVRHQVERENIPIVINLATRAPPYIALNRARSHAERLHVFSFIPIHTESRNTVCVGLNFNLNLDPFSVDTVGLQQDRFPLVQRLFECLENEGIRENIRDFLLHHLPAEIPRNAENYDRIFDCITGFAFGNSAFDRHPLELEEEDEAPITKYIFRHGDEGLRCLTMVFHAEGSDIVILHIRAHDAQQQGAINLQTLNVNGNDVHVWEVSCTLNNQLELDIDLPNDLGLYHDYQNNNANNFLAGDLVQVPNTENVHNTLNQVVNDGWKNIAQHRGLFQEISGALMPLVDTINVNSEDKFRSILHGTFYASDNPYKVLAMYKVGQTYSLKRGQEEEGERVILTRITEQRLDLLLLRQPRENDLDTHPIGYVLRLANNAEEVGQQQNDARQEIGRLKKQHRGFIPITSGNEVVLFPIVFNRDAHEAGNLILFPEGIGREEHVHRLDRHVRLEHHHHHH 769 T 3.6E-05 PDDEXK_9 pdbhh F Bacteria T 7esz 2 B,D B,D B3CP73_WOLPP BACTERIA FACTOR A MESGLDHNYNKILDILKGAIKGDDNQVKARKHLRVERWLRAYIQLIEDFDEEKLIFFSDIFSDNSCWDGIKLKNKAVGERLTEEKNKNGKENPLDLADRYYLACKYCLEDKIPGLFEQVFMRFKRSAFEEDGSDDDLRRELLENIEETSPIEAFWSFLIDKQIGKLNEYKSVEGLQKSIQINSNKNWEEGIEFFYNKLHNDSSISSQDKDDLLIEAALSAVKGYKEVDTIEFCLSKMDDEQKKKLLDRDYKENTYYAVLNVLVGQYYFDSFMELSRLCSQIECERYTTFLSSLSDQVLKNPDLSEETKKCMMNVWERIIKLKTQDRGEQSISSIFVDYSVTYTIANLIVDPSRQGVSKEEILGKILKHVKEMSGEEMIKVKDSVLSKIQLFHGGKKLQLGEQVFSKLAQEASKESILREAGDTLPQSSLSTTDTPYNIKSLSHSKLEHHHHHH 453 T 34 Ldr_toxin pdbhh F Bacteria T 7eu3 24 X 9 F2CPQ4_HORVV Photosynthetic NDH subunit of subcomplex B4 NQRDWVVTKSIWHLSDTAIKSFYTFYAMFTVWGVCFFASMKASMADPFYDSEHYRGQGGDGTVHWYYDRQEDIEATARGDLLR 83 T 0.023 CCDC142 unp F Eukaryota T 7eu3 25 Y 0 F2DWH9_HORVV Photosynthetic NDH subunit of subcomplex B5 GPLTEIEPDLQEDPIDKWRTNGVSPEDFVYGVYDGHHTYDEGQEKKGFWEDVSEWYQEAEPPQGFQALISWSFPPAVILGMAFDVPGEYLYIGAAIFIVVFCIIEMDKPDKPHNFEPEIYMMERSKRDKLIADYNSMDIWDFNEKYGELWDFTVN 155 T 0.06 DUF3098 pdb F Eukaryota T 7eus 1 A,B A,B CTB9_CERBT 2-oxoglutarate (2-OG)-dependent dioxygenase MTSTTTTTETLQEAVPFVAPPSPPEDVNNKELPEKPYYDVEFNYRLDPRDGGDEVIWGGTVGLMRRKYETRTVRINNERGNEHNFNLDTHGFAWVKHKTSVTEFADYLAIRQGPYYGEVAEMLKRVTGATKVHVIGHLHRSLNYNDTTEEEKNAPDMTMTKGQTPGRFVHVDQSYQGAVRRLYLDLPQEEARRLEKTRWAIINVWRPVRKVTNEPLAVCDARSVREDELFNTLHLVPMRWPDAAPQENQMWAVAPPKTPTQHKWHYVSGMTEDEALLIKMFDSKKDGTARRVPHSSFPTPDDFGEPRASTETRCFVFWEDQEAEALEHHHHHH 333 T 0.5 EF-hand_5 pdbpercent F Eukaryota T 7ev8 2 B B PHOSP_PI3H4 Phosphoprotein MESDAKNYQIMDSWEEEPRDKSTNISSALNIIEFILSTDPQE 42 T 1.7 SBP_bac_1 pdbhh T Viruses T 7evn 2 B C SF3B1_HUMAN PRE-MRNA-SPLICING FACTOR SF3B 155 KDA SUBUNIT,SF3B155,SPLICEOSOME-ASSOCIATED PROTEIN 155,SAP 155 MASDYKDDDDKASDEVDAGTMKSVNDQPSGNLPFLKPDDIQYFDKLLVDVDESTLSPEEQKERKIMKLLLKIKNGTPPMRKAALRQITDKAREFGAGPLFNQILPLLMSPTLEDQERHLLVKVIDRILYKLDDLVRPYVHKILVVIEPLLIDEDYYARVEGREIISNLAKAAGLATMISTMRPDIDNMDEYVRNTTARAFAVVASALGIPSLLPFLKAVCKSKKSWQARHTGIKIVQQIAILMGCAILPHLRSLVEIIEHGLVDEQQKVRTISALAIAALAEAATPYGIESFDSVLKPLWKGIRQHRGKGLAAFLKAIGYLIPLMDAEYANYYTREVMLILIREFQSPDEEMKKIVLKVVKQCCGTDGVEANYIKTEILPPFFKHFWQHRMALDRRNYRQLVDTTVELANKVGAAEIISRIVDDLKDEAEQYRKMVMETIEKIMGNLGAADIDHKLEEQLIDGILYAFQEQTTEDSVMLNGFGTVVNALGKRVKPYLPQICGTVLWRLNNKSAKVRQQAADLISRTAVVMKTCQEEKLMGHLGVVLYEYLGEEYPEVLGSILGALKAIVNVIGMHKMTPPIKDLLPRLTPILKNRHEKVQENCIDLVGRIADRGAEYVSAREWMRICFELLELLKAHKKAIRRATVNTFGYIAKAIGPHDVLATLLNNLKVQERQNRVCTTVAIAIVAETCSPFTVLPALMNEYRVPELNVQNGVLKSLSFLFEYIGEMGKDYIYAVTPLLEDALMDRDLVHRQTASAVVQHMSLGVYGFGCEDSLNHLLNYVWPNVFETSPHVIQAVMGALEGLRVAIGPCRMLQYCLQGLFHPARKVRDVYWKIYNSIYIGSQDALIAHYPRIYNDDKNTYIRYELDYIL 872 T 0.0012 Adaptin_N pdbpercent F Eukaryota T 7evp 2 B,D C,D GP168_BPTWO GP168 MLFFKEKFYNELSYYRGGHKDLESMFELALEYIEKLEEEDEQQVTDYENAMEEELRDAVDVIESQLEIIKDIVR 74 T 0.067 DUF3810 pdb T Viruses T 7evr 2 B,D B,D SETD2_HUMAN HIF-1,HUNTINGTIN YEAST PARTNER B,HUNTINGTIN-INTERACTING PROTEIN 1,HIP-1,HUNTINGTIN-INTERACTING PROTEIN B,LYSINE N-METHYLTRANSFERASE 3A,PROTEIN-LYSINE N-METHYLTRANSFERASE SETD2,SET DOMAIN-CONTAINING PROTEIN 2,HSET2,P231HBP YPPGYPMQAYVDPSNPNAGKVLLPTP 26 T 0.027 DUF3592 pdb F Eukaryota T 7evs 2 C,D C,D SETD2_HUMAN HIF-1,HUNTINGTIN YEAST PARTNER B,HUNTINGTIN-INTERACTING PROTEIN 1,HIP-1,HUNTINGTIN-INTERACTING PROTEIN B,LYSINE N-METHYLTRANSFERASE 3A,PROTEIN-LYSINE N-METHYLTRANSFERASE SETD2,SET DOMAIN-CONTAINING PROTEIN 2,HSET2,P231HBP SNPNAGKVLLPTP 13 T 40 ANAPC16 pdbhh F Eukaryota T 7ew8 1 A,B A,B A0A2S6F2G5_LEGPN ANKD MGSSHHHHHHSSGLVPRGSHMASMLTPPPDSKISTTDKSLDKLSAPLDMLKQMNESTMEQTKLDELRKKMSLQAEILNKAKADNDMFFRLLIELMSLKLQGELFKEQLSKISKESGYDSAQSALIQATNSEGQSPLQYALQKQDFSTAKYFLDNGAKAGPIEKAVFEIALDSKAAKEFGFPPLPPEKEKLHPVKNFGLVLGIKTTSVDGTPSQFGHIAPTYQLMTDSVSHFAKSHPGNKNFQEIANAFQFSNEASAFKFSTPQRNPEAGNDLARRIQGGELTTIPVSCKGHAMGLSYVPDGPGSKSGYLVYTNRGLGAKSSEHGTHIFRIEDSSKITPEFINNMTSGHSNGASHDEIMSQIKAAAGNKEPIHHIKQKGQKNDNCTIANSKSNIEGILLCQKAREVGGFDKLTESDMDSVKKEYKEFTKHMRVEKVNELAKALKENPQDPDLNNLTKEYLKQHPNADPKLKQTLETALKQASESSMTLSQPGKTI 494 T 0.00015 Shigella_OspC unphh F Bacteria T 7ewj 2 C,F,I C,F,I L7PH55_STAAU PEMI INHIBITOR KSIEDRIKNFFQSGGKYTELEVDWEERVGREI 32 T 0.00075 DtxR unppercent F Bacteria T 7exe 2 C C ADA22_MOUSE ADAM 22 RPRSNSWQGNMGGNKKKIRGKRFRPRSNSTE 31 T 61 BBP1_N pdbhh F Eukaryota T 7exx 1 A A A0A0P7R7Y1_ECOLX DNA phosphorothioation-dependent restriction protein DptG GSHMYPIATNLKVSNNQLDSYLPIRNKNNNIDWQIVTGLVLSYAVKYKIDTYSLEQFREDCKTHLQILIDEPAFLSVLERMYFSSQDIFRVSPLFLLFHAQFDGEKISAGSTADKRLGTLFANLMRDFSLNNPIQDKLNFIEKEMLNKLNKKLIRLGEGPFAKEQPYLPYLVTCFQSDLAFLAEHPQYLLQELTNTLRLYAFSWCAQLALNLDNWQDGEPQSKSLFFILDTEKASSERDKIKLFGYKWFARQSEKLFPVLSALEVLQVKGEEKRPLWQVYQDCLGYSDTSNRVLNELNNYIQKFISKEERDLPERDRATNLEDAFKQLLSVAVEQFQGKKTERAAVNRKYINELESQICTDFIQVRGRAGKVLVLNQDRLLLLTNLTVGKNKKLRLHELLRGFEQRGFYLDNQSTQMLVAFYERMGNVERMSDSGDAVYVRKTV 444 T 0.04 DUF1798 pdbpercent F Bacteria T 7ey7 3 AA,BA,CA,DA,Y,Z C,D,E,F,A,B GP14_BPT7 GENE PRODUCT 14,GP14 MCWAAAIPIAISGAQAISGQNAQAKMIAAQTAAGRRQAMEIMRQTNIQNADLSLQARSKLEEASAELTSQNMQKVQAIGSIRAAIGESMLEGSSMDRIKRVTEGQFIREANMVTENYRRDYQAIFAQQLGGTQSAASQIDEIYKSEQKQKSKLQMVLDPLAIMGSSAASAYASGAFDSKSTTKAPIVAAKGTKTGR 196 T 0.056 Cpn60_TCP1 pdbpssm T Viruses T 7eyb 2 I,J,K,L,M,N,O,P A,B,C,D,E,F,G,H GP15_BPT7 GENE PRODUCT 15,GP15 MSKIESALQAAQPGLSRLRGGAGGMGYRAATTQAEQPRSSLLDTIGRFAKAGADMYTAKEQRARDLADERSNEIIRKLTPEQRREALNNGTLLYQDDPYAMEALRVKTGRNAAYLVDDDVMQKIKEGVFRTREEMEEYRHSRLQEGAKVYAEQFGIDPEDVDYQRGFNGDITERNISLYGAHDNFLSQQAQKGAIMNSRVELNGVLQDPDMLRRPDSADFFEKYIDNGLVTGAIPSDAQATQLISQAFSDASSRAGGADFLMRVGDKKVTLNGATTTYRELIGEEQWNALMVTAQRSQFETDAKLNEQYRLKINSALNQEDPRTAWEMLQGIKAELDKVQPDEQMTPQREWLISAQEQVQNQMNAWTKAQAKALDDSMKSMNKLDVIDKQFQKRINGEWVSTDFKDMPVNENTGEFKHSDMVNYANKKLAEIDSMDIPDGAKDAMKLKYLQADSKDGAFRTAIGTMVTDAGQEWSAAVINGKLPERTPAMDALRRIRNADPQLIAALYPDQAELFLTMDMMDKQGIDPQVILDADRLTVKRSKEQRFEDDKAFESALNASKAPEIARMPASLRESARKIYDSVKYRSGNESMAMEQMTKFLKESTYTFTGDDVDGDTVGVIPKNMMQVNSDPKSWEQGRDILEEARKGIIASNPWITNKQLTMYSQGDSIYLMDTTGQVRVRYDKELLSKVWSENQKKLEEKAREKALADVNKRAPIVAATKAREAAAKRVREKRKQTPKFIYGRKE 747 T 18 DUF3135 pdbpercent T Viruses T 7ezm 1 A A GNAI1_HUMAN;GNAQ_HUMAN ADENYLATE CYCLASE-INHIBITING G ALPHA PROTEIN,GUANINE NUCLEOTIDE-BINDING PROTEIN ALPHA-Q MGCTLSAEDKAAVERSKMIDRNLREDGEKARRELKLLLLGTGESGKSTFIKQMRIIHGSGYSDEDKRGFTKLVYQNIFTAMQAMIRAMDTLKIPYKYEHNKAHAQLVREVDVEKVSAFENPYVDAIKSLWNDPGIQECYDRRREYQLSDSTKYYLNDLDRVADPAYLPTQQDVLRVRVPTTGIIEYPFDLQSVIFRMVDVGAQRSERRKWIHCFENVTSIMFLVALSEYDQVLVESDNENRMEESKALFRTIITYPWFQNSSVILFLNKKDLLEEKIMYSHLVDYFPEYDGPQRDAQAAREFILKMFVDLNPDSDKIIYSHFTCSTDTENIRFVFAAVKDTILQLNLKEYNLV 353 T 6.9E-126 G-alpha unp F Eukaryota T 7ezn 1 A A J9VTB5_CRYNH Protein-tyrosine-phosphatase KEAMGHMQEVVDGLWVGDLVAANDDDELEKNGIKNILSALRPSLKFSDKYAVYPLEIDDSADTDLLSHLPSCVAWIKEILDLRQKAAEPSSQKNGTENGESLKRSPDIDTVAQPGKPGGVLVHCQAGMSRSASIVAAYLMSQYDLDPMEAMTMIREKRPVVEPSATFWHQLGLFYTTDGKVSLKDRSTRQYYMERTTTQFINGDG 205 T 1.1E-22 DSPc pdbpssm F Eukaryota T 7ezw 2 B B ALA-CYS-GLU-MET-GLY-PHE-PHE-GLN-ASP-CYS-GLY ACEMGFFQDCGX 12 T 1.2 RNA_pol_Rbc25 pdbhh F T 7ezx 11 MC,ME D5,D8 A0A5J4YX67_PORPP CaRSPs2 MWEQQRPRRCEAPAAPSSRPAERRAAARRSRAQLRMKQDDYEQWKTEFAGGFPGGEAFYKKWIEEGAKGDVPALEEELQPRSPNKKPTIYEEQMISNRGQQKGVDPTWKTLLAGGFPGGEFFFKKWIGEGAQGEVPNLDADLQPGSGSAKKTGKKEDADKSSPGGIMTPGRIMVPSGLGEEEETVDKEAPRPELYNKYFSADRLHKAPEILFEYNKTKYDRVGVRYTEVTSKASERFFPKSRMNRAPVIEISYREGAVSTASVSLSMPEISGPPALPFPVPKGDVTTTMVTDPTTGRLKLEFKVDGAAVSAYSDPRATEAYNKMIRK 327 T 0.016 Fz pdbpssm F Eukaryota T 7ezx 12 NC,NE E5,E8 A0A5J4YJY8_PORPP CaRSPs1 MAAFVSGGCGVGGQRRAWPAKGAAVARTHACPTTMVVLPVARAGLAATAKKNQYMGTSVAPEIVLTDKGSDMSRKVKTEDKKVAADQAAAMGILANMSLYASLNPVKRMTYKAKEQAPAYVKKTGNPVEDFYPSSWRNMAPVISLSANRVAVAFEKIDAASNGVKANSNNKPFWKSGATTKNYVAPEAPAQSEPETLDDAAYQRYFPARIRNKAPAMEFRRPSFANTEDPSAYFMLQKETVPLRMALAEKLLTKLGRKGDGSEAEEEKVKPQKKAAKKDAKDDAKDDE 288 T 89 DUF6243 pdbhh F Eukaryota T 7f0l 8 GA U U5NME9_CERS4 protein-U MPEVSEFAFRLMMAAVIFVGVGIMFAFAGGHWFVGLVVGGLVAAFFAATPNSN 53 T 0.05 TrbC pdbpercent F Bacteria T 7f0r 5 F G Q9HTR9_PSEAE Transcriptional factor SutA GAMGMSEEELEQDELDGADEDDGEELAAADDGEADSGDGDEAPAPGKKAKAAVVEEELPSVEAKQKERDALAKAMEEFLSRGGKVQEIEPNVVADPPKKPDSKYGSRPI 109 T 0.014 fvmX3 unppssm F Bacteria T 7f1n 1 A,B A,B A0A256XQM7_9CREN Beta-galactosidase MPFPEKFFWGASSSGFQFEMGDPEGKSIDPNTDWFKWVHDETNIRRGVVSGDLPEHGINYWDLFRSDHELAASIGMNAYRIGIEWSRIFPKPTLDVRVGIELDPEGYITRVEVDDKAIEELDLLANKEAVSRYREIILDLRDRGLKVFVCLNHFTLPLWIHDPIACRDTKLKRGPKGWVDKTTILEFAKYSAYMAWSLGNIVDYWVTFNEPMVVTEAGYFQPEVGFPPGLRNISAFKTACLNIANAHVVAYDLIKKYDKVRADDDSPSAAYVGIVHNIVPIKPYSERKLDLKAADLMNYIHNKWILEFIVRGKIDRSLVGREKYLIDKFKDKLDWLGVNYYTRIVLKGKWVPPLISPVPVIPDIVKGYGFNCTPGGRSLDGMPVSDFGWEVYPQGLSDALDIASEYGKPLIVTENGIADSEDNIRPYFLVSHLKVLEEYVEKKKNVYGYLHWALTDNYEWAQGFKMRFGLTDVDLETKERKPRESSEVFKIIASEKTVPEELVEKYPKPIF 511 T 1.4E-35 Glyco_hydro_1 unppercent F Archaea T 7f32 1 A A SYCNCLCRRGVCRCICTI SYCNCLCRRGVCRCICTI 18 T 3.5 EB pdbhh F T 7f38 1 A,B,C,D,E,F,G,H,I A,B,C,D,E,F,G,H,I A0A191SAV5_9CAUD Putative major capsid protein MPLTNLTPTELLANKAVDYLANSFLVETPMLGLLANRVINQKQKAIEWGAKVAQGVVGGRTRTGALANDTQGTIKGASLSVPDYYIKHQFDVGKDEIVNSDATGKISAVRDPVGTAIADAFDVLSKKINSVLYTASGVADATNYGIFGLDAAAGTTVANSATGTYAGISKVTFPRWRSIIQGGAVPGTNEALTIARMTAMLRARRTAGVTYKGNQNQRLVILTSDNIENDVLRPLYGTVVDNQNVDFTRLDKDLLPYVNYMVKGIPVVSDIDCPANKMYLLNLDKLAIYSFDQSDADQSNGKITYIPLRYVDETGDTPSESTLWVRLADVSDEHPDLLKFELSVALQLVAFDLIDSISVIRDITQ 365 T 0.083 Phage_cap_P2 pdbpercent T Viruses T 7f4i 6 F U SHU9119 XDHXRWK 7 T 13 TSA pdbhh F T 7f4l 1 A,F B,A I7M8B9_TETTS Transmembrane protein, putative MKKNGKSQNQPLDFTQYAKNMRKDLSNQDICLEDGALNHSYFLTKKGQYWTPLNQKALQRGIELFGVGNWKEINYDEFSGKANIVELELRTCMILGINDITEYYGKKISEEEQEEIKKSNIAKGKKENKLKDNIYQKLQQMQ 142 T 0.00014 Myb_DNA-binding unppercent F Eukaryota T 7f4v 7 AA,G,Q cI,aI,bI PSAZ_GLOVI PSI-Z MQSYNVFPALVIITTLVVPFMAAAALLFIIERDPS 35 T 0.72 MWFE pdbhh F Bacteria T 7f69 1 A A WIPI2_HUMAN WIPI-2,WIPI49-LIKE PROTEIN 2 GPGSGQLLFANFNQDNTSLAVGSKSGYKFFSLSSVDKLEQIYECTDTEDVCIVERLFSSSLVAIVSLKAPRKLKVCHFKKGTEICNYSYSNTILAVKLNRQRLIVCLEESLYIHNIRDMKVLHTIRETPPNPAGLCALSINNDNCYLAYPGSATIGEVQVFDTINLRAANMIPAHDSPLAALAFDASGTKLATASEKGTVIRVFSIPEGQKLFEFRRGVKRCVSICSLAFSMDGMFLSASSNTETVHIFKLETVKEKPPEEPTTWTGYFGKVLMASTSYLPSQVTEMFNQGRAFATVRLPFCGHKNICSLATIQKIPRLLVGAADGYLYMYNLDPQEGGECALMKQHRLDGSLV 354 T 0.084 WD40 unppercent F Eukaryota T 7f6g 2 B L SAR1-AngII XRVYIHPF 8 T 0.9 Adeno_PVIII pdbhh F T 7f6i 2 B L KNG1_HUMAN Kallidin KRPPGFSPFR 10 T 3.2E-05 Bradykinin unphh F Eukaryota T 7f6j 2 C C PDZD8_HUMAN SARCOMA ANTIGEN NY-SAR-84/NY-SAR-104 SAMGNSTGIKLVRKEGGLDDSVFIAVKEIGRDLYRGLPTEERIQKLEFMLDKLQNEIDQELEHNNSLVREEKETTDTRKKSLLSAALAKSGERLQALTLLMIHYRAGIEDIETLESLSLDQHSKKISKYTDDT 133 T 0.011 AAA_32 pdbpercent F Eukaryota T 7f7g 2 C,D C,D UNK-ARG-ILE-ARG-ARG-ASP-GLU-TYR-LEU-LYS-ALA-ILE-GLN-UNK XRIRRDEYLKAIQX 14 T 4.4 DUF6026 pdbhh F T 7f7o 2 B B Tracer 7 XAGESLYEKX 10 T 12 RPN6_C_helix pdbhh F T 7f7p 1 A,B A,B A0A377JKY9_HAEPA anti-CRISPR protein AcrIIC4 MKITSSNFATIATSENFAKLSVLPKNHREPIKGLFKSAVEQFSSARDFFKNENYSKELAEKFNKEAVNEAVEKLQKAIDLAEKQGIQFLEHHHHHH 96 T 1.3 Nif11 unphh F Bacteria T 7f91 1 A,B A,B THRCO_CORXX Thrombocorticin MGHHHHHHMTACTTGPQTISFPAGLIVSLNASVKSSRNESVEVKDSNGNTVSRGSGSSSSGGTFTVINMEPPTFISDGNDYTVELSPQATPGILQTESSRVDNGRLIWQNYAFGANDGGCIVGDRDFNDVFVLITGLVRG 140 T 0.0024 FlgD_ig unppercent F Eukaryota T 7f9f 1 A,B,C,D A,B,C,D THRCO_CORXX Thrombocorticin TACTTGPQTISFPAGLIVSLNASVQSSRNESVEVKDSNGNTVSRGSGSSSSGGTFTVINMEPPTFISDGNDYTVELSPQATPGILQTESSRVDNGRLIWQNYAFGANDGGCIVGDRDFNDVFVLITGLVRG 131 T 0.0024 FlgD_ig pdbpercent F Eukaryota T 7f9g 1 A,B A,B THRCO_CORXX Thrombocorticin MGHHHHHHMTACTTGPQTISFPAGLIVSLNASVQSSRNESVEVKDSNGNTVSRGSGSSSSGGTFTVINMEPPTFISDGNDYTVELSPQATPGILQTESSRVDNGRLIWQNYAFGANDGGCIVGDRDFNDVFVLITGLVRG 140 T 0.0024 FlgD_ig unppercent F Eukaryota T 7f9x 1 A A Q5ZTB4_LEGPH LotA TIDNPGLGNCAFYAFAIGLVNIIQEEAKYNRRTMFDRWVGLDRSISGQYDEILKLNLEDPDKELLDRLQSSLRIVTYQYQIRELRNVCVFRNGNYNRLTGNSNFVNFAALYYGDPLDTDSRFNPFADSVPILIKMANIDRDSVHPGHENDVLVPLFLDLLYGDTTNPADITLETEPKSDSPIITAMNNITQDFFWGTHLDLNYLAEAFEVNLHVLRNNSPIQEFVDIPERHTLTLTNSNNTHWTTQITTAR 251 T 0.026 Coagulase pdb F Bacteria T 7fad 1 A,C,E A,C,E Q6DDJ4_XENLA Gamma-tubulin complex component GPLGSMSEFRIHHDVNELISLLHVFGLEGADVYIDLLQKNRTPYVTTSVSTHSAKVKIAEFSRTPDDFLKKYEELKSKNTRNLDPLVYLLSKLIEDKETLQYLQQNAKDKAELATSSVTSVSLPIAPNTSKISMQELEELRRQLETATVAVSCSHQPVEVLRKFLRDK 168 T 0.014 DUF1993 pdb F Eukaryota T 7fao 1 A,B A,C Top7 Surface mutant GSHMDIQVQVNIDDNGKNFDYTYTVTTESELQKVLNELMDYIKAAGAARVRISITARTSSEAEKFAAILRKVFAELGYNDINVTFDGDTVTVEGQLE 97 T 0.0043 Yop-YscD_ppl pdbhh F T 7fax 2 B B Q38DC5_TRYB2 TbLeo1 peptide GSTLEDLFGPLFYVDKSL 18 T 1.3 RE_LlaMI pdbhh F Eukaryota T 7fb5 2 B B RETR1_HUMAN RETICULOPHAGY RECEPTOR 1 EGDDFELLDQSELDQIESELGLTQDQ 26 T 4.4 Uds1 pdbhh F Eukaryota T 7fb8 1 A B ASP-ASP-LYS-ASP-CYS-ASP-GLU-TYR-CYS-LYS-LYS-THR-LYS-GLU-NH2 DDKDCDEYCKKTKEX 15 T 0.71 Macin pdbhh F T 7fb8 2 B A GLU-LE1-THR-GLY-HIS-ILE-GLU-GLY-PRO-THR-LE1-THR-LE1-HIS-CYS-LYS-NH2 EXTGHIEGPTXTXHCKX 17 T 79 CoV_NSP10 pdbhh F T 7fba 1 A A GLU-CYS-ARG-GLU-TYR-GLY-PRO-LE1-LYS-LE1-LE1-ALA-NH2 ECREYGPXKXXAX 13 T 3 PHA-1 pdbhh F T 7fba 2 B B ALA-LE1-CYS-GLU-CYS-GLY-PRO-THR-ARG-GLU-CYS-LYS-NH2 AXCECGPTRECKX 13 T 0.36 DUF6315 pdbhh F T 7fbh 1 A,B,C A,B,C A0A1V4D079_9ACTN;A0A2Z5X7B9_9ACTN BezA MSNLDELASSRQTVLEPQDEVRIVGQYYDDKTAKLVRKYGPGPRIHYHVGYYPSSEAPRHTRDVTPDAFRRSIRLHQEGLLRYAAKIWGAEHRLSGRILDVGCGLGGGSLFWAQEYGADVTAVTNAPEHAPIVEGFARECGVGGRVRTLVCDAMHLPLDGGPYDAAVAIESSGYFDRPVWFERLAHVLRPGGSVCIEEVFTTRPHGADVWAEYFYTKPATVLDYAEAAKAAGFELVDDVDATSETLPFWEESTAWTKAVLDSDSTLSAVDRRQLRISLMANQALGAEWQAGGLRLGFLRFERK 303 T 2.7000000000000002E-30 CMAS unppssm F Bacteria T 7fbr 1 A A MATR3_MOUSE Matrin-3 GSSGSSGQKGRVETRRVVHIMDFQRGKNLRYQLLQLVEPFGVISNHLILNKINEAFIEMATTEDAQAAVDYYTTTPALVFGKPVRVHLSQKYKRIKSGPSSG 102 F F Eukaryota T 7fc0 1 A,F A,D RrMbnA precosur peptide MKIVIVKKVEIQVAGRTGMRCASSCGAKS 29 T 2.8 DUF5522 pdbhh F T 7fc0 4 D,E F,C A0A1I4IFH0_9BURK Methanobactin biosynthesis cassette protein MbnC MNAPTTAAAGAAPGRQVKDSELLARLADPAARGDFPPGCRAHVRIDISIRAYWHTLFDICPGLLDIADPDGMAIFAPFMDWARRENLTMGWSFYIWVGRWLAQSPWRERLDEELTQALLSASAARWAVLDRSADVGVVLGRRGSDDWIIGWKPNTLAAGRRVELVSLDGQLPRPAEDVGVFHLAGYELDSFPGWLALPR 199 T 4.2 PSD5 pdbpssm F Bacteria T 7fcn 1 A,B,C,D A,B,C,D G5DBH3_9GAMM Insecticidal protein SVYSNSPVPVYKDLNAVGPLSELTISPHASVEVFRIDTPIIPESRKSLRVVNTGLANSVTAKFYWSHSFTSEWFESGSIDVGLGEDKVLNVPSNSFYYSKFVIYNNTDKVAYVTANLV 118 T 0.84 DUF916 pdbhh F Bacteria T 7fdm 2 C,D C,D MYB29_ARATH MYB-RELATED PROTEIN 29,ATMYB29,PROTEIN HIGH ALIPHATIC GLUCOSINOLATE 3,PROTEIN PRODUCTION OF METHIONINE-DERIVED GLUCOSINOLATE 2 SSKKRCFKRSSSTSKLLNKVAARASSMGTILGASIEGTLISSTPLSSCL 49 T 55 DUF2375 pdbhh F Eukaryota T 7fe0 1 A,B,C A,B,C A0A8F7LEJ5_9ACTN AvmM MGSSHHHHHHSSGLVPRGSHMTSTVSTDGPVYREYKGFRVNDNIVADFIGVPAVITPGETIEFSVFYTNRGRYAYPDTGMNLVIWFSDRDDLRREDFKLFYKVSRADWQEQDPAKCWDPQFPAEGGVHIACQMSGPDGGILSKPDGTVPLPEVESVTAHVRLAFREGITSEHAGIFALPGMLDAPGDKSIIPGLFGNVFGRLQQASFRLGEGPSSLY 217 T 0.14 HSP90 unp F Bacteria T 7fe5 1 A,B,C A,B,C A0A8F7LEJ5_9ACTN AvmM MGSSHHHHHHSSGLVPRGSHMTSTVSTDGPVYREYKGFRVNDNIVADFIGVPAVITPGETIEFSVFYTNRGRYAYPDTGLNLVIWFSDRDDLRREDFKLFYKVSRADWQEQDPAKCWDPQFPAEGGVHIACQLSGPDGGILSKPDGTVPLPEVESVTAHVRLAFREGITSEHAGIFALPGMLDAPGDKSIIPGLFGNVFGRLQQASFRLGEGPSSLY 217 T 0.14 HSP90 unp F Bacteria T 7fe6 1 A,B,C A,B,C A0A8F7LEJ5_9ACTN AvmM MTSTVSTDGPVYREYKGFRVNDNIVADFIGVPAVITPGETIEFSVFYTNRGRYAYPDTGLNLVIWFSDRDDLRREDFKLFYKVSRADWQEQDPAKCWDPQFPAEGGVHIACQLSGPDGGILSKPDGTVPLPEVESVTAHVRLAFREGITSEHAGIFALPGMLDAPGDKSIIPGLFGNVFGRLQQASFRLGEGPSSLY 197 T 0.14 HSP90 pdb F Bacteria T 7fgj 3 C C VQILNK VQILNK 6 T 56 pKID pdbhh F T 7fgm 2 B B FAF1_HUMAN HFAF1,UBX DOMAIN-CONTAINING PROTEIN 12,UBX DOMAIN-CONTAINING PROTEIN 3A RQIVERQPRMLDFRVEYRDRNVDVVLEDTCTVGEIKQILENELQIPVSKMLLKGWKTGDVEDSTVLKSLHLPKNNSLYVLT 81 T 3.2E-05 YukD pdbhh F Eukaryota T 7fgn 1 A A FAF1_HUMAN HFAF1,UBX DOMAIN-CONTAINING PROTEIN 12,UBX DOMAIN-CONTAINING PROTEIN 3A GSHMLDFRVEYRDRNVDVVLEDTCTVGEIKQILENELQIPVSKMLLKGWKTGDVEDSTVLKSLHLPKNNSLYVLTPDL 78 T 3.1E-05 YukD pdbhh F Eukaryota T 7fgr 3 C C VQIFNK VQIFNK 6 T 49 ArlS_N pdbhh F T 7fi4 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H A0A3A9QXE8_MORCA AcrIF13 AGSMKLLNIKINEFAVTANTEAGDELYLQLPHTPDSQHSINHEPLDDDDFVKEVQEICDEYFGKGDRTMARLSYAGGQAYDSYTEEDGVYTTNTGDQFVEHSYADYYNVEVYCKADLV 118 T 0.019 DUF1882 unppssm F Bacteria T 7fia 1 A A A0A8F9PCN6_PSEAI AcrIF23 GSMTNFQTWLDSADIPVQQNGQWIDLETGIAYDPSYNYAANTRRASLSPRGIDARAVAKTFGGRALTGTARQKEWAEKIRAEKVQQMNQDQAEMACDPSGLLTAAKFWIENRNDSAQEIAGFVMQQKALLAQHRSAKAAGQADKVAKIAAEYNALTARWGF 161 T 8.3 DUF6440 pdbhh F T 7fik 10 J J A0A1L8H1I9_XENLA NUCLEAR PORE COMPLEX PROTEIN NUP133-LIKE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXYFREVSQMEIIFECLVDKEEADLESTSIDSVEWANIVVNVNTILKDMLHVACQYRQSKNSLYKNESGIQEPEHVPWTASSGTAGIRSVVTRQHGIILKVYPQADSGLRTILIEQLAALLNYLLDDYVTQLKSIDKLANEERYNILEMEYAQKRSELLSPLLILGQYAWASNLAEKYCDFDILVQICEMTDNQSRLQRYMTLFAEQNFSDFLFRWYLEKGKRGKLLSQPASQHGQLAAFLQAHDHLSWLHELNSQEFEKAHRTLQTLANMETRYFCKKKTLLGLSKLAALASDFQEDVLQEKVEEIAEQEHFLLHQETLPKKLLEEKQLDLNAMPVLAPFQLIQLYVCEENKRANENDFMKALDLLEYIGDDSEVDVEELKLEILCKAIKRDEWSATDGKDDPIEATKDSIFVKVLQNLLNKGIELKGYLPKAETLLQSEELNSLKTNSYFEFSLKANYECYMKMQS 1140 T 9.7 Nucleoporin_C pdbpercent F Eukaryota T 7fit 1 A A Q73HD5_WOLPM bacteria factor 1 MPIETKRQAEVLKKLQDVIKHTDRDIAAGRKLAIKRWVETYIEYIKLFKDDKLEFLYNVFRDEGCWLGTRLNNTVLGQKLTEEKIGEIDNPLPRYGMASRYCITGKIGDFFNKQFVLSRGQFTSEEVDSQGNPISDQYVRNILLSSMKRNGPVFDFWIDRESGELKKYDAVEGFDSTVKLKWSEGVEYFYNQLEEKDKEKKLTEAIVALSRPQSVKRDAPILDFCVRNIGDKDTLLQKLLQKDKGVYFLLAELIESCFFDTVHDLVQCWCYKGVSAGGDCSDKIFSQQDYELFLYSLSNVMLKNPELSVQARSLIMEIWKCERFAEYRETSVNTSNYTVPIKSVLGGLIINWKREDVCKPDREIEKEEILDMISFAKGCFPEKFDLFKEVMIENLRICGREGKRKGVDYGKFAEELFLQLEKVTLPSVGDGPWNNLRSQSKVSLPLDGSGDGPQSEFEAPSVSGISGSHKKRRILEHHHHHH 482 T 0.5 PMAIP1 unppssm F Bacteria T 7fiv 1 A A A0A2K9VS01_9RICK CidA_I gamma/2 protein MPTQKELRDTMSKKLQEAIKHPDPAVVAGRKSAIKRWVGVLQDNFMEHIKYFKGDKLKFLHNVFQDEGCWSGVRLDNAALGQRFTEEKIGGIDNPLRKYEMACSYCVVDKIHPLFQKRFESYRNKFPPGAFDGKTETEFGKYVRNSLLDSIKRKGPVFDFWIDRESGELKKYDAVEGFDSAVKFKWSEGVEYFYNHLKEEDKEKKLTEAILALSRVQSVEKDAPILDFCVNKIVDKDTLLQKLSQKDKGVYSLFVELIESCFFDTVHDLVQCWCYKEVSAGGDHSEKIFSQRDYELFLSSLSDTMLKNPELSVQARSLIMEFWECGSLYQYRKAAVNTSNYTVPTSGVFAELIVNWRREDIYKTDEEKEIEKKEILDMMSFAKDCFPEKFELFKKLIIRDLRLCGREGKRVNVDYGLFAEELFSELEKTILPPGPVGDGPCSNLRSRSKAHGSKKTTLPVDDSPQSELGTPSVSGVSSYKKKSVFTLSGNKLEHHHHHH 499 T 0.012 DUF3437 pdbpssm F Bacteria T 7fiv 2 B B A0A2K9VS18_9RICK CidB_I b/2 protein MSNGDGLIRSLVDGDLEGFRQGFESFLDQCPSFLYHVSAGRFLPVFFFSMFSTAHDANILNANERVYFRFDNHGVNPRNGENRNTANLKVAVYRDGQQVVRCYSISDRPNSDGLRFSTRERNALVQEIRRQNPNLREEDLNFEQYKVCMHGKGKSQGEAIATVFEVIREKDRQGRDKFAKYSASEINLIRRLLGDHRLTIKEIEGRQLNQNQLRQLGRLVNFAQVAQGQQGIDNFMEMLASDRRQDVRDRIRREILPYITDIYNNYRQVLENNIENRNQRFEGHGFLLGFLANFSHRYTIGVDLDLSPRNSHVAFLVRHQVERENIPIVINLATRAPPYIALNRARSHAERLHVFSFIPIHTESRNTVCVGLNFNLNLDPFSVDTVGLQQDRFPLVQRLFECLENEGIRENIRDFLLHHLPAEIPRNAENYDRIFDCITGFAFGNSAFDRHPLELEEEDEAPITKYIFRHGDEGLRCLTMVFHAEGSDIVILHIRAHDAQQQGAINLQTLNVNGNDVHVWEVSCTLNNQLELDIDLPNDLGLYHDYQNNNANNFLAGDLVQVPNTENVHNTLNQVVNDGWKNIAQHRGLFQEISGALMPLVDTINVNSEDKFRSILHGTFYASDNPYKVLAMYKVGQTYSLKRGQEEEGERVILTRITEQRLDLLLLRQPRENDLDTHPIGYVLRLANNAEEVGQQQNDARQEIGRLKKQHRGFIPITSGNEVVLFPIVFNRDAHEAGNLILFPEGIGREEHVHRLDRHVRLEHHHHHH 769 T 3.8E-05 PDDEXK_9 pdbhh F Bacteria T 7fiw 2 B B A0A5B8WHG9_9RICK;Q73HD5_WOLPM bacteria factor 4,CidA I(Zeta/1) protein MPIETKKQAEVLKKLQDVIKHTDRDIAAGRKLAIKRWVETYIEYIKYFKDDKLEFLYNVFRDEGCWLGTRLNNTVLGQKLTEEKIGEIDNPLRRYGMASRYCITGKIHPLFQKRFESYRNKFPPGAFDGKTETEFGKYVRNSLLDSIKRKGPVFDFWIDRESGELKKYDAVEGFDSTVKLKWSEGVEYFYNQLEEKDKEKKLTEAIVALSRPQSVKRDAPILDFCVRNIGDKDTLLQKLLQKDKGVYFLLAELIESCFFDTVHDLVQCWCYKGVSAGGDCSDKIFSQRDYELFLSSLSDVMLKNPELSVQARSLIMEIWKCERFAEYRETSVNTSNYTVPIKSVLGELIINWKREDVCKPDREIEKEEILDMISFAKGCFPEKFDLFKEVMIRNLRLCGREGKRKGVDYGKFAEELFLQLEKVTLPSVGDGPWNNLRSQSKVSLPLDGSGDGPQSEFEAPSVSGISGSHKKRRILEHHHHHH 482 T 0.28 EFG_III pdbpercent F Bacteria T 7fix 6 F,FA,S F1,F3,F2 PSAF_THEVB PSI-F HHHHHHHHHHMRRFLALLLVLTLWLGFTPLASADVAGLVPCKDSPAFQKRAAAAVNTTADPASGQKRFERYSQALCGEDGLPHLVVDGRLSRAGDFLIPSVLFLYIAGWIGWVGRAYLIAVRNSGEANEKEIIIDVPLAIKCMLTGFAWPLAALKELASGELTAKDNEITVSPR 174 T 2.5E-07 PSI_PsaF unppercent F Bacteria T 7fj1 7 AA,Z Z,Y G3G8Y0_9ALPH VP1/2 RVVESDTLINRRYMRATGLGALALLIAACRLIARRLRETRTTLKGSARRFNVDLFQVRLILG 62 T 7.1 Alpha_GJ pdbpssm T Viruses T 7jfo 3 Q,R,S,T,U,V,W,X q,r,s,t,u,v,w,x Q94ET8_CHLRE LCI5 TNRVSPTRSVLPANWRQELESLRN 24 T 0.047 LigXa_C unp F Eukaryota T 7jgx 1 A A neuroVAL derived peptide ILE-PHE-TRP-LEU-PHE-ARG-GLY-LYS-ALA-ASP-VAL-ALA-LEU-NH2 IFWLFRGKADVALX 14 T 0.94 FRG pdbhh F T 7jh6 1 A,B,C,D A,B,C,D Two-domain di-Zn(II) and porphyrin-binding protein DYLRELLKLELQAIKQYEKLRQTGDELVQAFQRLREIFDKGDDDSLEQVLEEIEELIQKHRQLASELPKLELQAIKQYREALEYVKLPVLAKILEDEEKHIEWLKEAAKQGDQWVQLFQRFREAIDKGDKDSLEQLLEELEQALQKIRELTEKTGRKILEDEEKHIEWLETILG 174 T 0.00053 DUF5667 pdbpercent F T 7jhf 1 A A Protonectin-F derived peptide ILE-PHE-GLY-THR-ILE-LEU-GLY-PHE-LEU-LYS-GLY-LEU-NH2 IFGTILGFLKGLX 13 T 0.13 DUF445 pdbhh F T 7jhx 2 C,D C,D EPG5_HUMAN Ectopic P granules protein 5 homolog DEDPETSWILLN 12 T 0.36 TRI9 pdbhh F Eukaryota T 7jhy 3 H,I,J,K,L h,g,j,k,i L0JA79_9MYCO Csf4 (Cas11) MTTPTPTQVWRATVPELPPLVDEAGDTGSATARAADTAERLLLLLHYSIDWESSWVADPKHRKTYWDELLPGRVRRAAYRADTLDRWWSEVAGQLGAPAPRHRDRRLELATLLREPALPVITVLRDSLPALLLRVRIIAEAVAAQRGNNSAATSSADPNEPA 162 T 17 RBDV_coat pdbhh F Bacteria T 7ji2 3 E,F C,F OVA mutant peptide SIIQFEHL 8 T 9 KCTD4_C pdbhh F T 7jic 1 A A CD19_HUMAN B-LYMPHOCYTE SURFACE ANTIGEN B4,DIFFERENTIATION ANTIGEN CD19,T-CELL SURFACE ANTIGEN LEU-12 DYKDDDDLEVLFQGPPEEPLVVKVEEGDNAVLQCLKGTSDGPTQQLTWSRESPLKPFLKLSLGLPGLGIHMRPLAIWLFIFNVSQQMGGFYLCQPGPPSEKAWQPGWTVNVEGSGELFRWNVSDLGGLGCGLKNRSSEGPSSPSGKLMSPKLYVWAKDRPEIWEGEPPCLPPRDSLNQSLSQDLTMAPGSTLWLSCGVPPDSVSRGPLSWTHVHPKGPKSLLSLELKDDRPARDMWVMETGLLLPRATAQDAGKYYCHRGNLTMSFHLEITARPVLWHWLLRTGGWKVSAVTLAYLIFCLCSLVGILHLQRALVLRRKRKRMT 323 T 0.00011 G6B unphh F Eukaryota T 7jil 2 B C A0A1M5L9Q4_FLAJO 50S ribosomal protein L3 MSGLIGKKIGMTSIFDENGKNIPCTVIEAGPCVVTQVRTNEVDGYEALQLGFDDKNEKHSTKAALGHFKKAGTVAKKKVVEFQDFAAAQALGDLIDVSIFEEGEFVDVQGVSKGKGFQGVVKRHGFGGVGQATHGQHQRLRAPGSVGASSYPSRVFKGMRMAGRMGGDNVKVQNLRVLKVVAEKNLLVVKGCIPGHKNSYVIIQK 205 T 0.28 T2SS-T3SS_pil_N pdbpssm F Bacteria T 7jil 28 BA 5 A0A4V2PMH1_FLAJO 30S ribosomal protein S22 MPSGKKRKRHKVATHKRKKRARANRHKKKK 30 T 5.9 DUF1713 pdbpercent F Bacteria T 7jiy 1 A A A8E5C4_DANRE Granulin 1 CEGNFYCPAEKFCCKTRTGQWGCC 24 T 0.00024 Granulin unppercent F Eukaryota T 7jjl 2 B B KDM1A_HUMAN BRAF35-HDAC COMPLEX PROTEIN BHC110,FLAVIN-CONTAINING AMINE OXIDASE DOMAIN-CONTAINING PROTEIN 2,[HISTONE H3]-DIMETHYL-L-LYSINE(4) FAD-DEPENDENT DEMETHYLASE 1A TPEGRRTSRRKRAKVEYREMDESLAN 26 T 48 EFG_IV pdbhh F Eukaryota T 7jjv 1 A,B A,B GrAFP antifreeze protein MQCDGLDGADGTSNGQAGASGLAGGPNCNGGKGGKGAPGVGTAGGAGGVGGAGGTGNTNGGAGGSGGNSDVAAGGAGAAGGAAGGAGTGGTGGNGGAGKPGGAPGAGGAGTPAGSAGSPGQTTVLEHHHHHH 132 T 1100 NIP_1 pdbhh F T 7jk7 1 A A KDM1A_HUMAN BRAF35-HDAC COMPLEX PROTEIN BHC110,FLAVIN-CONTAINING AMINE OXIDASE DOMAIN-CONTAINING PROTEIN 2,[HISTONE H3]-DIMETHYL-L-LYSINE(4) FAD-DEPENDENT DEMETHYLASE 1A TPEGRRTERRKRAKVEYREMDESLAN 26 T 24 EFG_IV pdbhh F Eukaryota T 7jk9 1 A,AA,B,BA,C,CA,D,DA,E,EA,F,FA,G,GA,H,HA,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,P,Q,R,S,T,U,V,W,X,Y,Z A,BA,B,CA,C,DA,D,EA,E,FA,F,GA,G,HA,H,IA,I,JA,J,KA,K,LA,L,MA,M,NA,N,OA,O,P,Q,R,S,T,V,W,X,Y,Z,AA PORB_ARATH PCR B,NADPH-PROTOCHLOROPHYLLIDE OXIDOREDUCTASE B,POR B,LIGHT-DEPENDENT PROTOCHLOROPHYLLIDE OXIDOREDUCTASE MALQAASLVSSAFSVRKDAKLNASSSSFKDSSLFGASITDQIKSEHGSSSLRFKREQSLRNLAIRAQTAATSSPTVTKSVDGKKTLRKGNVVVTGASSGLGLATAKALAETGKWNVIMACRDFLKAERAAKSVGMPKDSYTVMHLDLASLDSVRQFVDNFRRTETPLDVLVCNAAVYFPTAKEPTYSAEGFELSVATNHLGHFLLARLLLDDLKKSDYPSKRLIIVGSITGNTNTLAGNVPPKANLGDLRGLAGGLNGLNSSAMIDGGDFDGAKAYKDSKVCNMLTMQEFHRRFHEETGVTFASLYPGCIASTGLFREHIPLFRALFPPFQKYITKGYVSETESGKRLAQVVSDPSLTKSGVYWSWNNASASFENQLSEEASDVEKARKVWEISEKLVGLA 401 T 0.07 adh_short pdbpercent F Eukaryota T 7jl6 1 A,B A,B SRRB_STAAU STAPHYLOCOCCAL RESPIRATORY RESPONSE PROTEIN B AMGRDSLINSMVEGVLGINESRQIILSNKMANDIMDNIDEDAKAFLLRQIEDTFKSKQTEMRDLEMNTRFFVVTTSYIDKIEQGGKSGVVVTVRDMTNEHNLDQ 104 T 0.0008 PAS_4 pdbhh F Bacteria T 7jmn 1 A E G0SGD2_CHATD MEDIATOR COMPLEX SUBUNIT 5 MVTVTDPLTARLEAAIKAWSDFFSDAEHERLDPAIFADQSQTLFANHPLAPVPLADLLLRPTPSNRECVDQRTLQYLQVLQKQGRITTAAVLRALYKYSTAHTRAQTPDGKPKHGAGDSSTNDADVGGSSKADLTSRMVRWRNSYMVEEDVLWRLARAVNHGTGIKTSHDVTEVAKVLARWTALFAEVSAAISRDAFNSMNGLQVKDESEDARNAFVLFYFAFCENQIVNETLSQPVCKDICRKLLDSLDAFLPTLMHLTADITGRLEHFRSEVLARYAPQEKKSMDMPSFMNDLSMSLESFQVPELPVVNTRAGLYIYLGAALVGRPMIDDEALFSYLHNRYQGDLQAMAVHLILASFDLLANAVFRNEGAKTGHLLKSFLINKVPLILVQLVAYAATTMYPFNAEMCITEALNQVDINMFPTLSGMFDMPNNNSFNDSVRQDFCFACQLHGLLSQAAIETLLGDITYQSLPPEGRYVKEQLVQACLQEPDRTLKLIGELDNMNGNVGAAAQAIVEICRDLASKPLSLDVLLLFDKPHKILHPLCELLDNWAGYEEDHGEYQPVYEEFGSVLLLLLAFVYRYNLSTADLGIRSSGSFVAKLLNGVDRCQPLEQLSEQEKSHLGGWIHGLFDTEAGGLGDELMSSCPPQDFYLLAPTLFHQIVNALSAGYLTDEMLKGGLEYLVDVLLLPALVPALLYLSNLLWADNQPIQNAVIKILQPILKPTSISNEASTMLSSVLNIVAKPLEHALKSYQRQDPECQKIEPLLLAIADNLAVSRRTGGADHTELESWCSAQITNPATGALIHGGLAAAVRTTIQQLVQWAQNPTLNSMNGMPAPYTHRQTLAAQQILGPHRLLGIILDELKSSPEPGIAYDVVTTMICAPDVRNSTISSPSTQSNNSSDHNDQAQNHQSQDAKHKHPHRLTLRDALRLEAHDFRAHLRADPVLAETVVRLYRRVEAQLTPLALPLPPAAAAAPAVGVNVGVDAATAAAAAAAAAMMPDALGLGVVGGVELGGMEGAIAAAVAAANGSGTGGAGGDGTQGGAGDAGMGLDGQQQGQGGSSAGDMGLGGGTADDIFSGLSGPDDFGADFGSWSMDLS 1099 T 7.6E-85 Med5 unppercent F Eukaryota T 7jn6 1 A A DF204_ARATH Defensin-like protein 204 AHCDHFLGEAPVYPCKEKACKSVCKEHYHHACKGECEYHGREVHCHCYGDYH 52 T 0.0013 SLR1-BP unp F Eukaryota T 7jqd 2 B B MAXA_LUTLO Peptide-43 CDATCQFRKAIDDCARQAYHSSVFKACMKQKKKEWKAGX 39 T 0.74 Clavanin unphh F Eukaryota T 7jqr 1 A A Abeta 16-36 beta-hairpin mimic VAL-ORN-LYS-LEU-VAL-MEA-PHE-ALA-GLY-ORN-ALA-ILE-ILE-GLY-LEU-MET VXKLVXFAGXAIIGLM 16 T 3.2 Beta-APP pdbhh F T 7jqs 1 A A Abeta 16-36 beta-hairpin mimic VAL-ORN-LYS-LEU-VAL-MEA-PHE-ALA-ASP-ORN-ALA-ILE-ILE-GLY-LEU-MET VXKLVXFADXAIIGLM 16 T 0.11 Beta-APP pdbhh F T 7jqt 1 A A Abeta 16-36 beta-hairpin mimic VAL-ORT-LYS-LEU-VAL-MEA-PHE-ALA-LYS-ORT-ALA-ILE-ILE-GLY-LEU-MET VXKLVXFAKXAIIGLM 16 T 0.79 Beta-APP pdbhh F T 7jqu 1 A,B,C A,B,C Abeta 16-36 beta-hairpin mimic VAL-ORN-LYS-LEU-VAL-MEA-PHE-ALA-GLN-ORN-ALA-ILE-ILE-GLY-LEU-MET VXKLVXFAQXAIIGLM 16 T 3.6 Beta-APP pdbhh F T 7jrg 10 J,T K,W QCR10 MAGLPARLRIQPADVKAAAMWGVAAATGGLYLVQVSILVLPPVKVVFHFYLVSGFRICLDVKDLRTMPCSAPRIWRLYILI 81 T 0.063 QCR10 pdbhh F T 7jrh 1 A A Cyclic peptide ASP-GLN-TRP-MLE-GLN-VAL-ASP-ORD-GLU-VAL-THR-GLY-ILE-ILE-THR-ORD DQWXQVDXEVTGIITX 16 T 6.3 GPHH pdbhh F T 7jro 8 H h A0A1S3V319_VIGRR Cytochrome c oxidase subunit 5C MAGPRIAHATLKGPSVVKEIIIGITLGLAAGSVWKMHHWNEQRKIRTFYDLLEKGEIGVVVDEQ 64 T 0.001 COX6C pdbhh F Eukaryota T 7js6 1 A A des-citrulassin F LLGRSGNDRLILSKN 15 T 0.95 hemP pdbhh F T 7jsd 1 A,B,C,D A,B,C,D Lysine hydroxylase GPHMDVHEIDETLEKFLAENYTPERVQQLADRFQRTGFVKFDSHMRIVPEELITAVRAEADRLVREHKERRDLVLGTTGGTPRNLSVVKSQDVEQSDLIRAVTRSEVLLTFLAGITRERIIPEVSDDERYLITHQEFASDTHGWHWDDYSFAFNWALRMPPIASGGMVQAVPHTHWDKNAPRINETLCERQIDTYGLVSGDLYLLRSDTTMHRTVPLTEDGAVRTMLVVSWSAERDLGKVLTGNDRWWENPEAGAAQPVHRAG 263 T 0.0012 2OG-FeII_Oxy_3 pdbpercent F T 7jsq 1 A A DNJB6_HUMAN HHDJ1,HEAT SHOCK PROTEIN J2,HSJ-2,MRJ,MSJ-1 MGNFKSISTSTKMVNGRKITTKRIVENGQERVEVEEDGQLKSLTINGKEQLLRLDNK 57 T 28 DUF1408 pdbhh F Eukaryota T 7jsx 3 Q,R,S,T,U,V,W,X q,r,s,t,u,v,w,x A0A2K3DA85_CHLRE EPYC1 RSSSASKKAVTPSRSALPSNWKQELESLRS 30 T 3.1 3-alpha pdbhh F Eukaryota T 7jta 1 A,B A,B A0A5B9TEE9_9BACT NTF2-like nuclease/anti-CRISPR GSSMGMVVEETRDLAETADCVVIEAILVDDGLRYRQLSVGIKDENGDIIRIVPISTVLI 59 T 0.029 Urease_alpha pdbpercent F Bacteria T 7jtk 18 IA s A0A2K3D359_CHLRE Uncharacterized protein MSDPEAEQGEQGYEESPEEPGPGSEAPSPSRIDNGLDTIIDIDPQTQHAEEGSNTAYESEQPDVISSYTGGQQEEDGEQAGNGAIDETTEEAAGEADDGGKASGFAVEVDAGTDAAAEGDLEPEPEPERPASASGEPQPTASTSRPASGAAARPASARPTSARPGSAAPRQPSASGGSRPGSGHPVNLAPDSVGLAQQQQQKSQIEVGAQAYEARGSSRPQSGGDAYGQAEEASAAAAAGRPSTSQSGSRPPPSREGVAVVPSIPEDQPLAVPIHIERYIAPGLKAIEVEVAQGPGMPHRLVRVLLDYTQCDAKPYLGGFRNKRTGAVYHHGATQTPRAPKYSEADRKLSRETQTVKIKQHSQQTVREQATQMARPGVLLDNDYDKEVTPGRYQTADERDEIVLRSTLRIQRWVRGWLGRKRAAYLRGKKMEREAFLRDQEARAQSEAEEHRRREIQRRMHPRTAADFEVLYNELEAWRLQETRKIKEAGLAKEQEQQVLQQLLHKETKLLQTIDRLKINANQENKEARIQHTLNEMSKPKKFALRNGGKVDVHTPFTTRAKELQQLYNGLNLPLLTVDERLDVLLHVKWTVKEFDCDLTRELVDLIDREADLLNRGRNPKMLEGLRKRISSLFLNFIETPEFNPEAVRFQIVPMDFEAYLYEQVGKATAKAGTSVGTRTLS 682 T 0.00029 IQ pdbpercent F Eukaryota T 7jtv 2 C,D E,H GLU-ALA-PRO-SER-ALA GAEAEAPSAVPDAAG 15 T 68 DUF6412 pdbhh F T 7ju9 1 A A Q7V450_PROMM PCN2.11 GRIDXCPAGGGXXEQXGXCC 20 T 0.03 Bacteriocin_IIc unppssm F Bacteria T 7jvf 1 A A Q7V449_PROMM Prochlorosin 2.10 AGGXIPXLMXGCGWLXGLCVR 21 T 0.00033 L_biotic_typeA unphh F Bacteria T 7jvs 2 B C RL27_STAA8 L27 ribosomal peptide XKLNLQFFASKKGX 14 T 13 mit_SMPDase pdbhh F Bacteria T 7jxt 1 A,B A,B PGH1_SHEEP CYCLOOXYGENASE-1,COX-1,PROSTAGLANDIN H2 SYNTHASE 1,PHS 1,PROSTAGLANDIN-ENDOPEROXIDE SYNTHASE 1 PVNPCCYYPCQHQGICVRFGLDRYQCDCTRTGYSGPNCTIPEIWTWLRTTLRPSPSFIHFLLTHGRWLWDFVNATFIRDTLMRLVLTVRSNLIPSPPTYNIAHDYISWESFSNVSYYTRILPSVPRDCPTPMDTKGKKQLPDAEFLSRRFLLRRKFIPDPQSTNLMFAFFAQHFTHQFFKTSGKMGPGFTKALGHGVDLGHIYGDNLERQYQLRLFKDGKLKYQMLNGEVYPPSVEEAPVLMHYPRGIPPQSQMAVGQEVFGLLPGLMLYATIWLREHNRVCDLLKAEHPTWGDEQLFQTARLILIGETIKIVIEEYVQQLSGYFLQLKFDPELLFGAQFQYRNRIAMEFNQLYHWHPLMPDSFRVGPQDYSYEQFLFNTSMLVDYGVEALVDAFSRQPAGRIGGGRNIDHHILHVAVDVIKESRVLRLQPFNEYRKRFGMKPYTSFQELTGEKEMAAELEELYGDIDALEFYPGLLLEKCHPNSIFGESMIEMGAPFSLKGLLGNPICSPEYWKASTFGGEVGFNLVKTATLKKLVCLNTKTCPYVSFHVPD 553 T 2.5E-05 An_peroxidase pdb F Eukaryota T 7jyn 2 B B NSD3_HUMAN NUCLEAR SET DOMAIN-CONTAINING PROTEIN 3,PROTEIN WHISTLE,WHSC1-LIKE 1 ISOFORM 9 WITH METHYLTRANSFERASE ACTIVITY TO LYSINE,WOLF-HIRSCHHORN SYNDROME CANDIDATE 1-LIKE PROTEIN 1,WHSC1-LIKE PROTEIN 1 EFTGSPEIKLKITKTIQNGRELFESSLCGDLLNEVQASE 39 T 47 DUF3587 pdbhh F Eukaryota T 7jzl 2 B,D,F E,G,F LCB1 DKEWILQKIYEIMRLLDELGHAEASMRVSDLIYEFMKKGDERLLEEAERLLEEVE 55 T 0.29 ER pdbhh F T 7jzo 2 C,D C,D LyCALTPP peptide core ANSRLPTSKI 10 T 10 DUF3697 pdbhh F T 7jzu 1 A A LCB1 GGSDKEWILQKIYEIMRLLDELGHAEASMRVSDLIYEFMKKGDERLLEEAERLLEEVERGS 61 T 0.5 ER pdbhh F T 7jzw 5 J J L7P7U3_9CAUD Type I-F anti-CRISPR protein MMTISKTDIDCYLQTYVVIDPVSNGWQWGIDENGVGGALHHGRVEMVEGENGYFGLRGATHPTEKEAMAAALGYLWKCRQDLVAIARNDAIEAEKYRAKA 100 T 1.8 TAL_effector pdbhh T Viruses T 7jzx 6 K J B3G1L5_PSEAI AcrF7 MSHASHNGEAPKRIEAMTTFTSIVTTNPDFGGFEFYVEAGQQFDDSAYEEAYGVSVPSAVVEEMNAKAAQLKDGEWLNVSHEA 83 T 0.15 Ribosomal_S19 pdbpercent F Bacteria T 7k04 1 A E RAD33_YEAST DNA repair protein RAD33 MSKSTNVSYERVELFENPKVPIEVEDEILEKYAESSLDHDMTVNELPRFFKDLQLEPTIWKLVRNEDVIIEGTDVIDFTKLVRCTCQLLILMNNLTVIDDLWSMLIRNCGRDVDFPQVALRDHVLSVKDLQKISNLIGADQSSGTIEMISCATDGKRLFMTYLDFGCVLGKLGYLKM 177 T 8.4E-13 Rad33 pdbpssm F Eukaryota T 7k1m 1 A A GLY-CYS-HIS-TYR-THR-PRO-PHE-GLY-LEU-ILE-CYS-PHE peptide GCHYTPFGLICF 12 T 2.3 DUF3951 pdbhh F T 7k28 2 C P NF2L2_HUMAN Nrf2 peptide,ADEETGEFL XADEETGEFLX 11 T 3.2 DUF4585 pdbhh F Eukaryota T 7k29 2 C P NF2L2_HUMAN ACE-LEU-ASP-GLU-GLU-THR-GLY-GLU-ALA-LEU-NH2 XLDEETGEALX 11 T 4.9 Adeno_100 pdbhh F Eukaryota T 7k2a 2 C P NF2L2_HUMAN ACE-LEU-ASP-GLU-GLU-THR-GLY-GLU-PHE-ALA-NH2 XLDEETGEFAX 11 T 1.3 MBF1 pdbhh F Eukaryota T 7k2b 2 C P NF2L2_HUMAN ACE-ALA-ASP-GLU-GLU-THR-GLY-GLU-PHE-ALA-NH2 XADEETGEFAX 11 T 10 DUF4585 pdbhh F Eukaryota T 7k2c 2 C P NF2L2_HUMAN Nrf2 peptide,ADEETGEAA XADEETGEAAX 11 T 41 Phi29_Phage_SSB pdbhh F Eukaryota T 7k2l 3 C P Nrf2 cyclic peptide,c[BAL-NPETGE] XNPETGE 7 T 1.7 RBDV_coat pdbhh F T 7k2n 2 C P (BAL)DPETGE XDPETGE 7 T 0.37 DUF4585 pdbhh F T 7k3h 1 A,B A,B Network hallucinated protein 0217 MGSSHHHHHHSSGLVPRGSHMSPIARQALDIAKSVLEHSKGMFDYWEGMLEQYEKTGDPDQANKLRQTLNRVKNSVGRLESALKRAERAYDTGNPDAAVGAVVELIGNVHEIMSTFHELFG 121 T 0.015 DUF2379 pdb F T 7k3j 2 B,D,F,H B,D,F,H PANX_DROME PROTEIN SILENCIO STLYKNAATQTERRTATRDAGTQVRLE 27 T 7.3 CFAP91 pdbhh F Eukaryota T 7k3k 2 B B PANX_DROME PROTEIN SILENCIO STLYKNAATQTERR 14 T 1 CFAP91 pdbhh F Eukaryota T 7k3l 2 B B PANX_DROME PROTEIN SILENCIO STATRDAGTQVRLE 14 T 7.8 NifQ unppercent F Eukaryota T 7k3s 1 A A BRCA1_MOUSE RING-TYPE E3 UBIQUITIN TRANSFERASE BRCA1 MNLSEDCSQSDILTTQQRATMKYNLIKLQQEMAHLEAVLEQRGNQPSGHSPSLEHHHHHH 60 T 0.0056 HrpB7 pdbpssm F Eukaryota T 7k58 11 K E Q23FU1_TETTS Flagellar outer dynein arm intermediate protein, putative KEFNNPINFQDTETRYGGIQNQVVNINQYVQRNPNFIDLDNIAELSEHSVNTERVKTGDRGMSHKEGGWPGNVDPNEAQETGRFKKRIEKDTSFPQAVKDLKEGVEKCIYQNNQIDLLEEYFEGETSEHVVENLSSKTLMLFKDEKEICKRSVSEISWHPEGPTKVAVSYAIMRFQQMPEKMPTQAYVWDLLNPNSPEIKLMSPSAVTNISYNQKIPDQIGGGCYNGLLAVWDGRKGENPIMISPVENSHYEPVTHFHWLMSKTGSECVTTSTDGKVMWWDTRKFEAGPVEKLNIIEGLGENEEIIGGTALEYNVEAGPSKFLIGTESGSILTANKKLKKPVEITTRYGLDQGRHLGPVYSINRSNQNPKYFLSVGDWSCKIWVEDLKTPIIRTKYHGSYLSDGCWSPTRSGAFFLVRRDGWMDVWDYYYRQNEIAFSHKVSDSPLTCIKINQTGGAYHNSGKLCAIGDQDGTVTILELCDSLYTMQPKEKDIINEMFEREYRKEKNLETIKKQQELAKRQVQKDMGSQKEKWEKKKLEMIETAEASFHENLAKNPV 557 T 0.13 DUF2247 pdb F Eukaryota T 7k58 12 L D I7M008_TETTS Dynein intermediate chain 2 LTAQELNEDMPSKMLEPKNPQAPKNITVYDYYTRKFKTDELVDQMIVHFSMDGDYIWKESNEYKTQEEIRDTKKALIKEAMRKQESEEPGANHDEEAIKQTLRNKFNYNTRECQTINPSIRERGVSTEPPPSDTICGNITQWEIFDAYYAEIMKDHQIENKKKKEVDQDKKQDQSMYSTSFKRCCKIMERMVVQNDQEDKYHDYRYYWSQGDNLEAGKNEGHLLPIWRFSNEKQRKKNVTSICWNPLYPDLFAVSLGSYDFTKQRMGLICLYSLKNTTHPEYAFNCEAGVMCLDFHPKSAALLAVGLYDGTVLVYDIRNKHKKPIYQSTVRNQKHTDPVWQVKWNPDTSKNYNFYSISSDGRVMNWILMKNKLEPEEVILLRLVGKNEEESTLIGLACGLCFDFNKFEPHIFLVGTEEGKIHKCSRAYSGQYQETYNGHLLAVYKVKWNNFHPRTFISASADWTVRIWDSKYTSQIICFDLSMMVVDAVWAPYSSTVFACATMDKVQVYDLNVDKLNKLAEQKIVKQPKLTNLSFNYKDPILLVGDSHGGVTLVKLSPNLCKSGPEIKQTEDKKAMEEFKNVKIEDYEREKMENL 595 T 0.004 WD40 pdb F Eukaryota T 7k5m 1 A,B,C,D,E,F A,B,C,D,E,F CAPSD_HBVD1 CORE ANTIGEN,CORE PROTEIN,HBCAG,P21.5 SMDIDPYKEFGATVELLSFLPSDFFPSVRDLLDTAAALYRDALESPEHCSPHHTALRQAILCWGDLMTLATWVGTNLEDPASRDLVVSYVNTNVGLKFRQLLWFHISCLTFGRETVLEYLVSFGVWIRTPPAARPPNAPILSTLPETTVVKLENLYFQ 158 T 3.9E-25 Hepatitis_core unp T Viruses T 7k76 3 E P PfCSP N-terminal peptide P17 KLRKPKHKKLKQPAD 15 T 7.6 P120R pdbhh F T 7k7a 1 A,B,C A,B,C TNR1A_HUMAN TUMOR NECROSIS FACTOR RECEPTOR 1,TNF-R1,TUMOR NECROSIS FACTOR RECEPTOR TYPE I,TNFR-I,P55,P60 GTTVLLPLVIFFGLALLSLLFIGLAYRYQR 30 T 0.13 Papilloma_E5A pdbhh F Eukaryota T 7k7h 4 H G A0A4Z0MXD9_SALET PERTUSSIS-LIKE TOXIN SUBUNIT ARTA FYDARPVIELILSK 14 T 3.5 DUF4334 pdbhh F Bacteria T 7k7r 3 C,F C,F EBNA1_EBVB9 EBNA1 peptide AA386-405 SQSSSSGSPPRRPPPGRRPF 20 T 26 ODV-E18 pdbhh T Viruses T 7k9b 1 A,B A,B Q9KGD7_BACHD OAPB GPSPEIGQIVKIVKGRDRDQFSVIIKRVDDRFVYIADGDKRKVDRAKRKNMNHLKLIDHISPEVRHSFEETGKVTNGKLRFALKKFLEEHADLLKEGE 98 T 0.023 FERM_F2 pdbpssm F Bacteria T 7kbb 3 C C UL128_HCMVA UL128 EECCEFINVNHPPERCYDFKMCNRFTVALRCPDGEVCYSPEKTAEIRGIVTTMTHSLTRQVVHNKLTSCNYNPLYLEADGRIRCGKVNDKAQYLLGAAGSVPYRWINLEYDKITRIVGLDQYLESVKKHKRLDVCRAKMGYMLQ 144 T 12 SH3_19 pdbhh T Viruses T 7kbb 5 E E U131A_HCMVM UL131A QCQRETAEKNDYYRVPHYWDACSRALPDQTRYKYVEQLVDLTLNYHYDASHGLDNFDVLKRINVTEVSLLISDFRRQNRRGGTNKRTTFNAAGSLAPHARSLEFSVRLFAN 111 T 0.064 Prion pdbpercent T Viruses T 7kbj 1 A,E G,I GANAB_MOUSE ALPHA-GLUCOSIDASE 2,GLUCOSIDASE II SUBUNIT ALPHA MGILPSPGMPALLSLVSLLSVLLMGCVAETGVDRSNFKTCDESSFCKRQRSIRPGLSPYRALLDTLQLGPDALTVHLIHEVTKVLLVLELQGLQKDMTRIRIDELEPRRPRYRVPDVLVADPPTARLSVSGRDDNSVELTVAEGPYKIILTAQPFRLDLLEDRSLLLSVNARGLMAFEHQRAPR 184 T 0.0016 NtCtMGAM_N pdbpercent F Eukaryota T 7kbq 1 A A DE NOVO DESIGNED OR689 MRIIVIIVTDEQKIEDMWEILKEIGVDRIVIITSNKQLAERAKELGVDRIFLLTDDELIAEIVKKLGADIVFSENRDIAKKIIRKLKNIIILSNDEQLVKELQKEASDARVFNVQTKQDFKDLIEKILEHHHHHH 135 T 0.00077 ADH_zinc_N pdb F T 7kbr 1 A,E G,I GANAB_MOUSE ALPHA-GLUCOSIDASE 2,GLUCOSIDASE II SUBUNIT ALPHA MGILPSPGMPALLSLVSLLSVLLMGCVAETGVDRSNFKTCDESSFCKRQRSIRPGLSPYRALLDTLQLGPDALTVHLIHEVTKVLLVLELQGLQKDMTRIRIDELEPRRPRYRVPDVLVADPPTARLSVSGRDDNSVELTVAEGPYKIILTAQPFRLDLLEDRSLLLSVNARGLMAFEHQRAP 183 T 0.0089 NtCtMGAM_N pdbpercent F Eukaryota T 7kdf 5 E E STU2_YEAST Y55_G0035590.MRNA.1.CDS.1 EESYKRAAAVTSTLKARIEKMKAKSRREGTTRT 33 T 0.1 SSP160 pdbpssm F Eukaryota T 7kdq 1 A A NDB4S_TITST Stigmurin analog StigA15 FFSLIPKLVGGLIKAFKX 18 T 0.48 Endotoxin_N pdbhh F Eukaryota T 7kei 3 C C HA peptide from 2009 H1N1 pandemic flu virus. AMERNAGSGIIISDGGGGSLVPRGS 25 T 2.9 Cuticle_1 pdbhh F T 7kev 3 C C cyclic peptide LDLR disruptor XFVSTXXXDRPCGX 14 T 33 Sod_Ni pdbhh F T 7kfa 3 C D 1-[2,6,10.14-TETRAMETHYL-HEXADECAN-16-YL]-2-[2,10,14-TRIMETHYLHEXADECAN-16-YL]GLYCEROL XFVPTTXXEAPCX 13 T 26 Chromadorea_ALT pdbhh F T 7kgb 51 YA v A0A3E0UTA6_MYCTX 50S ribosomal protein L37 MAKRGRKKRDRKYSKANHGKRPNS 24 T 21 Protamine_3 pdbhh F Bacteria T 7kgv 1 A,B A,B S38A9_DANRE SOLUTE CARRIER FAMILY 38 MEMBER 9 MDEDSKPLLGSVPTGDYYTDSLDPKQRRPFHVEPRNIVGEDVQERVSAEAAVLSSRVHYYSRLTGSSDRLLAPPDHVIPSHEDIYIYSPLGTAFKVQGGDSPIKNPSIVTIFAIWNTMMGTSILSIPWGIKQAGFTLGIIIIVLMGLLTLYCCYRVLKSTKSIPYVDTSDWEFPDVCKYYFGGFGKWSSLVFSLVSLIGAMVVYWVLMSNFLFNTGKFIFNYVHNVQTSDAFGTQGTERVICPYPDVDPHGQSSTSLYSGSDQSTGLEFDHWWSKTNTIPFYLILLLLPLLNFRSASFFARFTFLGTISVIYLIFLVTYKAIQLGFHLEFHWFDSSMFFVPEFRTLFPQLSGVLTLAFFIHNCIITLMKNNKHQENNVRDLSLAYLLVGLTYLYVGVLIFAAFPSPPLSKECIEPNFLDNFPSSDILVFVARTFLLFQMTTVYPLLGYLVRVQLMGQIFGNHYPGFLHVFVLNVFVVGAGVLMARFYPNIGSIIRYSGALCGLALVFVLPSLIHMVSLKRRGELRWTSTLFHGFLILLGVANLLGQFFM 549 T 1.3E-22 Aa_trans unppercent F Eukaryota T 7kh0 3 C A GNAI3_HUMAN;GNAS2_HUMAN G(I) ALPHA-3,ADENYLATE CYCLASE-STIMULATING G ALPHA PROTEIN GCTLSAEDKAAVERSKMIEKQLQKDKQVYRATHRLLLLGAGESGKSTIVKQMRILHVNGFNGDSEKATKVQDIKNNLKEAIETIVAAMSNLVPPVELANPENQFRVDYILSVMNVPDFDFPPEFYEHAKALWEDEGVRACYERSNEYQLIDCAQYFLDKIDVIKQDDYVPSDQDLLRCRVLTSGIFETKFQVDKVNFHMFDVGGQRDERRKWIQCFNDVTAIIFVVASSSYNMVIREDNQTNRLQEALNLFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKIEDYFPEFARYTTPEDATPEPGEDPRVTRAKYFIRDEFLRISTASGDGRHYCYPHFTCAVDTENIRRVFNDCRDIIQRMHLRQYELL 372 T 2.3E-123 G-alpha unp F Eukaryota T 7kh1 3 M,N,O,P,Q,R A3,B3,C3,D3,E3,F3 baseplate organization protein, gp11 MSLVNGMVESLNNTKSETEIGIGGYRLFARVRETVNYRNIVPTDTLEDGSSSTDDIINEPITVSIEGVVSNLFVEERQYPQLVSRDFSAVGEITALLPAKSQQQIQRISQIDSQIRDAVLAAERAERLAGKPYEFFGNSGNSAKTEQEKFIDFMEALYFSRRPTEVSVNFRDYKNMALVSFIPVRDNNTKDTRFTADFQQINYSTLVYTPVSSPSKSVSGKVSDASNKGGQNPESNETGERSLLSSLVGG 250 T 64 T2SSM_b pdbhh F T 7kh1 4 S,T,U,V,W,X A4,B4,C4,D4,E4,F4 baseplate stabilizing protein, gp12 MNLIENITSEYIQTHALEFSRGFAVLTLIYEQAVQMWKMNVVYTRAGDEEPQPPIYGVKLALSTTHIKHRNWPFDFTVIDTTNNGMDPYRADDFETGRCQLYFITPEEMIQVRGVDVQ 118 T 21 Frankia_peptide pdbhh F T 7kh1 7 QA,RA,SA,TA,UA,VA A7,B7,C7,D7,E7,F7 tail sheath initiator protein, gp15 MRVRTLDDNGDWTFGRGKADYITSKKAIAQTVSTRIKSWANDNPLAMNANIDWKDLLGRKGTEDTILREIERVVVQTDGVIRVTELEVIKTEKRVQSILLSYDTIYDDSETLEINDL 117 T 0.0097 DUF2634 pdbpssm F T 7kix 1 A A A0A125RN64_9CAUD Anti-CRISPR protein AcrIE2 MNTYLIDPRKNNDNSGERFTVDAVDITAAAKSAAQQILGEEFEGLVYRETGESNGSGMFQAYHHLHGTNRTETTVGYPFHVMELEHHHHHH 91 T 8.5 Baculo_E66 pdbhh T Viruses T 7kiy 2 B B Q8I060_PLAFA RHOPH2 MIKVTIFLLLSIFSFNLYGLELNEKVSIKYGAEQGVGSADSNTKLCSDILKYLYMDEYLSEGDKATFEKKCHNVIGNIRNTFSNKNTIKEGNEFLMSILHMKSLYGNNNNNNAGSESDVTLKSLYLSLKGSQNTEGESEVPSDDEINKTIMNFVKFNKYLLDNSNDIKKVHDFLVLTSQSNENLLPNKEKLFEQIVDQIKYFDEYFFASGGKIKVKKGYLKYNFLDIYKQPVCSAYLHLCSRYYESVSIYIRLKKVFNGIPAFLDKNCRKVKGEEFKKLMDMELKHNHIVERFDKYIISDDLYYVNMKVFDLKNVDKIQVSKIDDINNLNIYEHKETMHLSAKNLSRYIDIKKELNDEKAYKQLMSAIRKYVTTLTKADSDITYFVKQLDDEEIERFLIDLNFFLYNGFLRITEDKHLINADDVSPSYINLYRSNNIVALYILKTQYEENKLSEYRAHKFYRRKRVSNITNDMIKKDFTQTNALTNLPNLDNKKTTEYYLKEYENFVENFQPDLHDIMKLQLFFTMAFKDCNVNQNFTETSKKLWFDLLYAYDKFGWFYIHPNEVINSINKTDFVRHVLVSRNFLLKNNDQLTFLETQVAKIVEIINLSLEVDKSPDSLDFSIPMNFFNHKNGYHVMNDDKLKLLTSYEYIDSIANNYFFLSEYKNDVFRTGNNFKLYFNLPNIYSLAYQLFNELAININVITNVPLKKYLKYNASYAYFTLMNMIGKNHDIYSKGSRFVYASYILGLVFFIESHIDIARLKPKDFFFMKQSLPIIDHVYHKDLKTLKKNCTLLTDFMKINKNSQNYSLTHTEEMIKILGLLTVTLWAKEGKKSVYYDDDVSLYRKLMVSCVFNGGETIQEKLANNIEKSCDISQYGIKSKNLKDMIDINLSIHKWNPAEIEKLAYSFVLSCKMQKLMYKPMNVEKLPLEDYYKLPLAPDMVKTYHCYKLGKQAAKLLESIILKKKFVRFRVTDAIDVYDFFYIKKVLSSHIKKEYNEFLQDKRAFEKKELETILNNSPFSEEQTMKLINSYECHWFTSYENFRILWMHASSNLGTGTYLKNFFSELWQNIRFLFKSKLKIRDMEYFSGDISQMNLLDYYSPMVHSESHCQEKMQVLFITLRDSKEENRSEIAQKVKSAYYQCKLDYYKNHHSDFIHRIHPNDFLNNKVYVLKQPYYLMSNVPLNNPKKVSRLFVTEGTLEYLLLDKINIPECFGPCTKLHFNKVVIKESKQRIYDMTINNALVPEIQPYNRRKYMTIYINEAYIKNIVSDALTSEEIKRHDIQKGNIKICMGKSTYLTEPILTEEHFNLTHKPVYDFSSVKHNLKVFHMKNEHLVSEDPNDDCFINYPLATINLDISDPYKEISEDLIKNLYILKSS 1378 T 1.3 Crystall_2 pdbpercent F Eukaryota T 7kiy 3 C C A0A024X9S2_PLAFC RHOPH3 MRSKHLVTLFIITFLSFSTVKVWGKDVFAGFVTKKLKTLLDCNFALYYNFKGNGPDAGSFLDFVDEPEQFYWFVEHFLSVKFRVPKHLKDKNIHNFTPCLNRSWVSEFLKEYEEPFVNPVMKFLDKEQRLFFTYNFGDVEPQGKYTYFPVKEFHKYCILPPLIKTNIKDGESGEFLKYQLNKEEYKVFLSSVGSQMTAIKNLYSTVEDEQRKQLLKVIIENESTNDISVQCPTYNIKLHYTKECANSNNILKCIDEFLRKTCEKKTESKHPSADLCEHLQFLFESLKNPYLDNFKKFMTNSDFTLIKPQSVWNVPIFDIYKPKNYLDSVQNLDTECFKKLNSKNLIFLSFHDDIPNNPYYNVELQEIVKLSTYTYSIFDKLYNFFFVFKKSGAPISPVSVKELSHNITDFSFKEDNSEIQCQNVRKSLDLEVDVETMKGIAAEKLCKIIEKFILTKDDASKPEKSDIHRGFRILCILISTHVEAYNIVRQLLNMESMISLTRYTSLYIHKFFKSVTLLKGNFLYKNNKAIRYSRACSKASLHVPSVLYRRNIYIPETFLSLYLGLSNLVSSNPSSPFFEYAIIEFLVTYYNKGSEKFVLYFISIISVLYINEYYYEQLSCFYPKEFELIKSRMIHPNIVDRILKGIDNLMKSTRYDKMRTMYLDFESSDIFSREKVFTALYNFDSFIKTNEQLKKKNLEEISEIPVQLETSNDGIGYRKQDVLYETDKPQTMDEASYEETVDEDAHHVNEKQHSAHFLDAIAEKDILEEKTKDQDLEIELYKYMGPLKEQSKSTSAASTSDELAGSEGPSTESTSTGNQGEDKTTDNTYKEMEELEEAEGTSNLKKGLEFYKSSLKLDQLDKEKPKKKKSKRKKKRDSSSDRILLEESKTFTSENEL 897 T 11 Phage_TAC_10 pdbhh F Eukaryota T 7kj6 1 A,B A,B Q5ZSR1_LEGPH Ankyrin repeat-containing protein SNALTPPPDSKISTTDKSLDKLSAPLDMLKQMNESTMEQTKLDELRKKMSLQAEILNKAKADNDMFFRLLIELMSLKLQGELFKEQLSKISKESGYDSAQSALIQATNSEGQSPLQYALQKQDFSTAKYFLDNGAKAGPIEKAVFEIALDSKAAKEFGFPPLPPEKEKLHPVKNFGLVLGIKTTSVDGTPSQFGHIAPTYQLMTDSVSHFAKSHPGNKNFQEIANAFQFSNEASAFKFSTPQRNPEAGNDLARRIQGGELTTIPVSCKGHAMGLSYVPDGPGSKSGYLVYTNRGLGAKSSEHGTHIFRIEDSSKITPEFINNMTSGHSNGASHDEIMSQIKAAAGNKEPIHHIKQKGQKNDNCTIANSKSNIEGILLCQKAREVGGFDKLTESDMDSVKKEYKEFTKHMRVEKVNELAKALKENPQDPDLNNLTKEYLKQHPNADPKLKQTLETALKQASESSMTLSQPGKTI 473 T 0.00015 Shigella_OspC pdbhh F Bacteria T 7kjk 1 A,B,C,D,E,F A5,B5,C5,D5,E5,F5 Tail terminator protein MSQSIINVARYIRDLLDYDENLIQFDRKNTQQSDTVTGYIVVNGSGVQNVLSHGSSYDGDAEIMEYSKSESRLITLEFYGSDAYENAELFSLLNQSQKAKEVSRGLGLTIYNVSQATDVKQLLGYQYGNRVHVDFNIQYCPSVYVETLRVDASEFEILVDD 161 T 33 Collectrin pdbhh F T 7kjk 3 AA,BA,CA,DA,Y,Z C4,D4,E4,F4,A4,B4 Head completion protein MLPNMRSALKMFEQSVLLKSVETIRVDFVDDIIITATPIRAVVQVADKKKLNLDSLDWSKQYIWVHSGSKMEIGQFIEWHGKDFKLVAAGDDYSDYGYNAWYGEETLKPVLVSS 114 T 0.014 Hepatitis_core pdbpssm F T 7kkm 1 A,B,C,D A,B,C,D TNKS1_HUMAN PARP GSQGTILLDLAPEDKEYQSVEEEMQSTIREHRDGGNAGGIFNRYNVIRIQKVVNKKLRERFCHRQKEVSEENHNHHNERMLFHGSPFINAIIHKGFDERHAYIGGMFGAGIYFAENSSKSNQYVYGIGGGTGCPTHKDRSCYICHRQMLFCRVTLGKSFLQFSTMKMAHAPPGHHSVIGRPSVNGLAYAEYVIYRGEQAYPEYLITYQIMKPE 213 T 0.011 PARP pdb F Eukaryota T 7kl5 2 B B RYR2_HUMAN RYR-2,RYR2,HRYR-2,CARDIAC MUSCLE RYANODINE RECEPTOR,CARDIAC MUSCLE RYANODINE RECEPTOR-CALCIUM RELEASE CHANNEL,TYPE 2 RYANODINE RECEPTOR FALRYNILTLMRMLSLKSLKKQMKKVKKMT 30 T 5.8 SRP_SPB pdbhh F Eukaryota T 7klc 5 E A Q2N0S5_9HIV1 HIV-1 clade A BG505 gp120,HIV-1 clade A BG505 gp120 VWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVGAGNCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQAFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIE 383 T 1.5999999999999998E-49 GP120 unp T Viruses T 7kld 2 C,E,F C,E,Q LYS-LEU-ASN-LEU-GLN-PHE-PCS KLNLQFX 7 T 2.2 Aquarius_N pdbhh F T 7klz 2 C,D C,D GEMI_HUMAN Geminin peptide AEGTVSSSTDALPCI 15 T 23 BCAS2 pdbhh F Eukaryota T 7kmx 1 A,B,C,D,E,F,G a,b,c,d,e,f,g A0A7D7FKF5_9CAUD Minor capsid protein MAFNNAVLQEVSDLPAGEVIKASPHNVSAFEVFQNGLIEGRFVKFDAGSIDILDASATPTIAGIAKRKVTGEIGPGVYSTSGIEIDQVAEVINFGFATVTVQDAAAPSKYDPVYAINLDSAEAGKATENSGATGALAVADCVFWEQKAANVWLVRMNKFL 160 T 13 DUF2292 pdbhh T Viruses T 7knf 2 C,D C,D DTY-ASP-TYR-PRO-GLY-ASP-HIS-CYS-TYR-LEU-TYR-GLY-THR XXDYPGDHCYLYGTX 15 T 7.9 Gln_deamidase_2 pdbhh F T 7kng 2 C,D C,D DTY-ASP-TYR-PRO-GLY-ASP-PHE-CYS-TYR-LEU-TYR-GLY-THR-CYS XXDYPGDFCYLYGTCX 16 T 0.3 DUF5714 pdbhh F T 7kpk 2 B B PDX1_HUMAN Pdx1 peptide LSASPQPSSVAPRRPQEPR 19 T 25 Pim pdbhh F Eukaryota T 7kpo 1 A A Q9KUA1_VIBCH Response regulator GSSKQDLMRAVLVEAMTSALNYWERVSGQSKFTFAEQSGLWRVYLDRSTLQTRTLDKYLRIETLPKTPRWRTVLNSLDYILEHCKEAGPERTHIEMQRDKLQKLLTSE 108 T 0.079 DUF3024 pdb F Bacteria T 7kpq 1 A A FAD-dependent monooxygenase CtdE MTKTPEAPVPRTMEKDHTQQINVIIVGLGIAGLTAAIECHRKGHKVIAFEKTPKMMHIGDIFSIGPNAESVIRQWKDGAISRALNEARCAIDEIKVFDETGKLQNVNTMEGYREGEGYVINRAEAVDIFFEYAQSLGIDIRFNSNVTEYWETPHNAGIIVDGLKIEADCVIATDGIHSKARNAICGAVVQPKKTGSAIYRSGYAMEELRGHSGAVWLTEGKEDVDQLYHFIGKDITVLVGTGRRGKDVYWGCMHKSLHDVSESWIQVSDVRRAIELISDWNVRDRLEPIMACTPQGKCFDHLVMTMDQLPSWVSPKHRMIVLGDAAHPFLPNTGQGANQAIEDGATVAICLELAGKNQVTKGVQVAERLRYQRVAKIQELGHRMLKTLQNADWDGEKDEDAPTMITRPAWIYSHDCQQYAYNEFQTVAQLVSERRDFHHHHHH 443 T 3.6E-12 FAD_binding_3 pdbpercent F T 7kpr 1 A,B A,B PPM1H_HUMAN Protein phosphatase 1H GSHMSDLPLRFPYGRPEFLGLSQDEVECSADHIARPILILKETRRLPWATGYAEVINAGKSTHNEDQASCEVLTVKKKAGAVTSTPNRNSSKRRSSLPNGEGLQLKENSESEGVSCHYWSLFDGHAGSGAAVVASRLLQHHITEQLQDIVDILKNSAVLPPTCLGEEPENTPANSRTLTRAASLRGGVGAPGSPSTPPTRFFTEKKIPHECLVIGALESAFKEMDLQIERERSSYNISGGCTALIVICLLGKLYVANAGASRAIIIRNGEIIPMSSEFTPETERQRLQYLAFMQPHLLGNEFTHLEFPRRVQRKELGKKMLYRDFNMTGWAYKTIEDEDLKFPLIYGEGKKARVMATIGVTRGLGDHDLKVHDSNIYIKPFLSSAPEVRIYDLSKYDHGSDDVLILATDGLWDVLSNEEVAEAITQFLPNCDPDDPHRYTLAAQDLVMRARGVLKDRGWRISNDRLGSGDDISVYVIPLIHGNKLS 486 T 8.5E-20 PP2C pdbpercent F Eukaryota T 7kq0 2 B,D,F B,D,F CDN1A_HUMAN LYS-ARG-ARG-GLN-THR-SER-MET-THR-ASP-TYR-TYR-HIS-SER-LYS-ARG KRRQTSMTDYYHSKR 15 T 1.7 CDC27 pdbhh F Eukaryota T 7kq1 2 B,D,F B,D,F CDN1A_HUMAN LYS-ARG-ARG-GLN-THR-SER-MET-THR-ASP-PHE-TYR-HIS-SER-LYS-ARG KRRQTSMTDFYHSKR 15 T 0.37 CDC27 pdbhh F Eukaryota T 7kqk 3 C,F C,P TAU_HUMAN pTau peptide KKVAVVRTPP 10 T 2 Sulfotransfer_2 pdbhh F Eukaryota T 7kqr 1 A,B A,B Heme-dependent L-tyrosine hydroxylase GHMNTGTGTVLTELPDHGRWDFGDFPYGLEPLTLPEPGSLEAADSGSVPAEFTLTCRHIAAIAAGGGPAERVQPADSSDRLYWFRWITGHQVTFILWQLLSRELARLPEEGPERDAALKAMTRYVRGYCAMLLYTGSMPRTVYGDVIRPSMFLQHPGFSGTWAPDHKPVQALFRGKKLPCVRDSADLAQAVHVYQVIHAGIAARMVPSGRSLLQEASVPSGVQHPDVLGVVYDNYFLTLRSRPSSRDVVAQLLRRLTAIALDVKDNALYPDGREAGSELPEELTRPEVTGHERDFLAILSEVAEEATGSPALASDR 316 T 3.1 Hs1pro-1_C pdbhh F T 7ksq 18 R O A0A2K1JDE1_PHYPA PsaO NRDWLRRDLSVIGFGLIGWLAPSSLPVINGNSLTGLFLGSIGPELAHFPTGPALTSPFWLWMVTWHVGLFIVLTFGQIGFKGRQDGYW 88 T 0.1 Plasmid_RAQPRD unppercent F Eukaryota T 7ktr 3 C C SP20H_HUMAN P38-INTERACTING PROTEIN,P38IP, SUPT20H MQQALELALDRAEYVIESARQRPPKRKYLSSGRKSVFQKLYDLYIEECEKEPEVKKLRRNVNLLEKLVMQETLSCLVVNLYPGNEGYSLMLRGKNGSDSETIRLPYEEGELLEYLDAEELPPILVDLLEKSQVNIFHCGCVIAEIRDYRQSSNMKSPGYQSRHILLRPTMQTLICDVHSITSDNHKWTQEDKLLLESQLILATAEPLCLDPSIAVTCTANRLLYNKQKMNTRPMKRCFKRYSRSSLNRQQDLSHCPPPPQLRLLDFLQKRKERKAGQHYDLKISKAGNCVDMWKRSPCNLAIPSEVDVEKYAKVEKSIKSDDSQPTVWPAHDVKDDYVFECEAGTQYQKTKLTILQSLGDPLYYGKIQPCKADEESDSQMSPSHSSTDDHSNWFIIGSKTDAERVVNQYQELVQNEAKCPVKMSHSSSGSASLSQVSPGKETDQTETVSVQSSVLGKGVKHRPPPIKLPSSSGNSSSGNYFTPQQTSSFLKSPTPPPSSKPSSIPRKSSVDLNQVSMLSPAALSPASSSQRSGTPKPSTPTPTPSSTPHPPDAQSSTPSTPSATPTPQDSGFTPQPTLLTQFAQQQRSLSQAMPVTTIPLSTMVTSITPGTTATQVMANSAGLNFINVVGSVCGAQALMSGSNPMLGCNTGAITPAGINLSGLLPSGGLLPNALPSAMQAASQAGVPFGLKNTSSLRPLNLLQLPGGSLIFNTLQQQQQQLSQFTPQQPQQPTTCSPQQPGEQGSEQGSTSQEQALSAQQAAVINLTGVGSFMQSQAAAVAILAASNGYGSSSSTNSSATSSSAYRQPVKK 811 T 6.5E-20 Spt20 unp F Eukaryota T 7ktr 10 J J TADA1_HUMAN SPT3-ASSOCIATED FACTOR 42,STAF42,TRANSCRIPTIONAL ADAPTER 1-LIKE PROTEIN MATFVSELEAAKKNLSEALGDNVKQYWANLKLWFKQKISKEEFDLEAHRLLTQDNVHSHNDFLLAILTRCQILVSTPDGAGSLPWPGGSAAKPGKPKGKKKLSSVRQKFDHRFQPQNPLSGAQQFVAKDPQDDDDLKLCSHTMMLPTRGQLEGRMIVTAYEHGLDNVTEEAVSAVVYAVENHLKDILTSVVSRRKAYRLRDGHFKYAFGSNVTPQPYLKNSVVAYNNLIESPPAFTAPCAGQNPASHPPPDDAEQQAALLLACSGDTLPASLPPVNMYDLFEALQVHREVIPTHTVYALNIERIITKLWHPNHEELQQDKVHRQRLAAKEGLLLC 335 T 4.4E-17 SAGA-Tad1 pdbpercent F Eukaryota T 7ktt 1 A A VINC_HUMAN VINCULIN, MV MPVFHTRTIESILEPVAQQISHLVIMHEEGEVDGKAIPDLTAPVAAVQAAVSNLVRVGKETVQTTEDQILKRDMPPAFIKVENACTKLVQAAQMLQSDPYSVPARDYLIDGSRGILSGTSDLLLTFDEAEVRKIIRVCKGILEYLTVAEVVETMEDLVTYTKNLGPGMTKMAKMIDERQQELTHQEHRVMLVNSMNTVKELLPVLISAMKIFVTTKNSKNQGIEEALKNRNFTVEKMSAEINEIIRVLQLTSWDEDAWASKDTEAMKRALASIDSKLNQAKGWLRDPSASPGDAGEQAIRQILDEAGKVGELCAGKERREILGTCKMLGQMTDQVADLRARGQGSSPVAMQKAQQVSQGLDVLTAKVENAARKLEAMTNSKQSIAKKIDAAQNWLADPNGGPEGEEQIRGALAEARKIAELCDDPKERDDILRSLGEISALTSKLADLRRQGKGDSPEARALAKQVATALQNLQTKTNRAVANSRPAKAAVHLEGKIEQAQRWIDNPTVDDRGVGQAAIRGLVAEGHRLANVMMGPYRQDLLAKCDRVDQLTAQLADLAARGEGESPQARALASQLQDSLKDLKARMQEAMTQEVSDVFSDTTTPIKLLAVAATAPPDAPNREEVFDERAANFENHSGKLGATAEKAAAVGTANKSTVEGIQASVKTARELTPQVVSAARILLRNPGNQAAYEHFETMKNQWIDNVEKMTGLVDEAIDTKSLLDASEEAIKKDLDKCKVAMANIQPQMLVAGATSIARRANRILLVAKREVENSEDPKFREAVKAASDELSKTISPMVMDAKAVAGNISDPGLQKSFLDSGYRILGAVAKVREAFQPQEPDFPPPPPDLEQLRLTDELAPPKPPLPEGEVPPPRPPPPEEKDEEFPEQKAGEVINQPMMMAARQLHDEARKWSSKPGIPAAEVGIGVVAEADAADAAGFPVPPDMEDDYEPELLLMPSNQPVNQPILAAAQSLHREATKWSSKGNDIIAAAKRMALLMAEMSRLVRGGSGTKRALIQCAKDIAKASDEVTRLAKEVAKQCTDKRIRTNLLQVCERIPTISTQLKILSTVKATMLGRTNISDEESEQATEMLVHNAQNLMQSVKETVREAEAASIKIRTDAGFTLRWVRKTPWYQHHHHHHHH 1142 T 1.9E-200 Vinculin pdb F Eukaryota T 7kuw 1 A A Sequence-Based Designed Protein nmt_0994_guided_02 DEREIARKVASELQKFSEWVKKLKEVIKKASPEQQTKIAQWVAKLAGVRPEDVKKIIKAFND 62 T 0.004 FliG_N pdb F T 7kw6 1 A A Q65JI8_BACLD PROCESSIVE CELLULASE FROM GLYCOSIDE HYDROLASE FAMILY 48 MDNKTRFMQLYEQIKNPNNGYFSPEGIPYHSVETLICEAPDYGHMTTSEAYSYWLWLEAMYGRYTQDWSKLEAAWDNMEKYIIPVNEGDNNEEQPTMNYYNPSSPATYAAEHPYPDLYPSALTGQYPAGNDPLDAELKATYGSNETYLMHWLLDVDNWYGFGNLLNPSHTAVYVNTYQRGEQESVWETVPHPSQDNQTFGKPNEGFMSLFTKENQAPAPQWRYTNATDADARAVQAMFWARQWGYSNTNYLEKAKKMGDFLRYGMYDKYFQEIGSAADGSPSRGAGKNACHYLMAWYTAWGGGLGQYANWAWRIGASHVHQGYQNPVASYALSTAEGGLIPNSSTARSDWEKALKRQLELYTWLLSSEGAVAGGATNSWNGNYSAYPQNVSTFYEMAYTEAPVYHDPPSNNWFGMQVWPLERVAELYYIFAEKGDKSSESFHMAKHVIEKWIAYSLDYVFVGERPVTDEEGYYLNDAGERVLGGQNPQIAVQSDPGEFWIPANLEWSGQPDPWKGFDSFTGNPGLHVTTKNPSQDVGVLGSYIKTLVFFAAGTKAETGGFTALGNKAKNLAKELLDAAWSKNDGIGIAAEEEHEDYIRYFTKEIYFPNGWSGRNGQGNTIPGPNTVPSDPAKGGNGVYISHAELRPKIKNDPMWPYLENKYQTSWNPNTGKWENGLPTFVYHRFWSQVDMATAYAEYDRLIGNA 704 T 7.1E-66 Glyco_hydro_48 pdbpercent F Bacteria T 7kwt 1 A,B A,B Q7Y3F3_9CAUD PlyCB SKINVNVENVSGVQGFLFHTDGKESYGHRAFINGVEIGIKDIETVQGFQQIIPSINISKSDVEAIRKAMKK 71 T 0.087 DUF3201 pdb T Viruses T 7kww 1 A,B A,B Q7Y3F3_9CAUD PlyCB MSKINVNVENVSGVQGFLFHTDGKESYGYRAFINGVEIGIKDIETVQGFQQIIPSINISHSDVEAIRKAMKK 72 T 2.6 DUF3213 pdbhh T Viruses T 7kwy 1 A,B A,B Q7Y3F3_9CAUD PlyCB SKINVNVENVSGVQGFLFHTDGKESYGYRAFINGVEIGIKDIETVQGFQQIIPSINISKSDVEAIKKAMKK 71 T 2.5 DUF3213 pdbhh T Viruses T 7kwz 1 A,B,C,D,E A,B,C,D,E TADBP_HUMAN TDP-43 NRQLERSGRFGGNPGGFGNQGGFGNSRGGGAGLGNNQGSNMGGGMNFGAFSINPAMMAAAQAALQSSWGMMGMLASQQNQSGPSGNNQNQGNMQREPNQAFGSGNNSYSGSNSGAAIGWGSASNAGSGSGFNGGFGSSMDSKSSGWGM 148 T 0.043 Glucosaminidase pdbpssm F Eukaryota T 7kzp 2 B,J B,O FANCB_HUMAN PROTEIN FACB,FANCONI ANEMIA-ASSOCIATED POLYPEPTIDE OF 95 KDA,FAAP95 MDYKDDDDKENLYFQGGGRKLGTGSMTSKQAMSSNEQERLLCYNGEVLVFQLSKGNFADKEPTKTPILHVRRMVFDRGTKVFVQKSTGFFTIKEENSHLKIMCCNCVSDFRTGINLPYIVIEKNKKNNVFEYFLLILHSTNKFEMRLSFKLGYEMKDGLRVLNGPLILWRHVKAFFFISSQTGKVVSVSGNFSSIQWAGEIENLGMVLLGLKECCLSEEECTQEPSKSDYAIWNTKFCVYSLESQEVLSDIYIIPPAYSSVVTYVHICATEIIKNQLRISLIALTRKNQLISFQNGTPKNVCQLPFGDPCAVQLMDSGGGNLFFVVSFISNNACAVWKESFQVAAKWEKLSLVLIDDFIGSGTEQVLLLFKDSLNSDCLTSFKITDLGKINYSSEPSDCNEDDLFEDKQENRYLVVPPLETGLKVCFSSFRELRQHLLLKEKIISKSYKALINLVQGKDDNTSSAEEKECLVPLCGEEENSVHILDEKLSDNFQDSEQLVEKIWYRVIDDSLVVGVKTTSSLKLSLNDVTLSLLMDQAHDSRFRLLKCQNRVIKLSTNPFPAPYLMPCEIGLEAKRVTLTPDSKKEESFVCEHPSKKECVQIITAVTSLSPLLTFSKFCCTVLLQIMERESGNCPKDRYVVCGRVFLSLEDLSTGKYLLTFPKKKPIEHMEDLFALLAAFHKSCFQITSPGYALNSMKVWLLEHMKCEIIKEFPEVYFCERPGSFYGTLFTWKQRTPFEGILIIYSRNQTVMFQCLHNLIRILPINCFLKNLKSGSENFLIDNMAFTLEKELVTLSSLSSAIAKHESNFMQRCEVSKGKSSVVAAALSDRRENIHPYRKELQREKKKMLQTNLKVSGALYREITLKVAEVQLKSDFAAQKLSNL 884 T 0.46 DP unp F Eukaryota T 7kzw 1 A A Q5NEJ0_FRATT FTT_1639c DKYQARELPLLKHGYSKKNMTAYNMFGFCCDNTPSGIFNIMDKKPTEFLVNIYVGDNQGCKFIYAADTKGKQGEITQTGSFTAYLSGRNELLKLECKGKDSNIDYKVIAYANAIEYDRVGNLSYLVESGGL 131 T 0.11 DUF4972 unphh F Bacteria T 7l04 1 A C C0LA97_9VIRU VP1 YPKKKKARIE 10 T 27 MARCKS pdbhh T Viruses T 7l0u 1 A,AA,AB,B,BA,BB,C,CA,CB,D,DA,DB,E,EA,EB,F,FA,FB,G,GA,GB,H,HA,HB,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,PA,Q,QA,R,RA,S,SA,T,TA,U,UA,V,VA,W,WA,X,XA,Y,YA,Z,ZA A,a,1,B,b,2,C,c,3,D,d,4,E,e,5,F,f,6,G,g,7,H,h,8,I,i,J,j,K,k,L,l,M,m,N,n,O,o,P,p,Q,q,R,r,S,s,T,t,U,u,V,v,W,w,X,x,Y,y,Z,z B9UYL6_HBOC2 VP2 PROTEIN GSGVGISTGGWVGGSYFTDSYVITKNTRQFLVKIQNDHKYRTENIIPSNAGGKSQRCVSTPWSYFNFNQYSSHFSPQDWQRLTNEYKRFKPRKMHVKIYNLQIKQILSNGADTTYNNDLTAGVHIFCDGEHAYPNATHPWDEDVMPELPYETWYLFQYGYIPVIHELAEMEDANAVEKAIALQIPFFMLENSDHEVLRTGESTEFTFDFDCEWINNERAYIPPGLMFNPKVPTRRAQYIRQHGNTASSNTRIQPYAKPTSWMTGPGLLSAQRVGPAGSDTASWMVVVNPDGTAVNSGMAGVGSGFDPPSGSLRPTDLEYKIQWYQTPEGTNSDGNIISNPPLSMLRDQALYRGNQTTYNLCSDVWMFPNQIWDRYPITRENPIWCKKPRSDKNTIIDPFDGTLAMDHPPGTIFIKMAKIPVPSNNNADSYLNIYCTGQVSCEIVWEVERYATKNWRPERRHTALGLGIGGEENINPTYHVDKNGKYIQPTTWDMCYPIKTNINKVL 506 T 4.2E-12 Parvo_coat pdbpssm T Viruses T 7l0w 1 A,AA,AB,B,BA,BB,C,CA,CB,D,DA,DB,E,EA,EB,F,FA,FB,G,GA,GB,H,HA,HB,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,PA,Q,QA,R,RA,S,SA,T,TA,U,UA,V,VA,W,WA,X,XA,Y,YA,Z,ZA A,a,1,B,b,2,C,c,3,D,d,4,E,e,5,F,f,6,G,g,7,H,h,8,I,i,J,j,K,k,L,l,M,m,N,n,O,o,P,p,Q,q,R,r,S,s,T,t,U,u,V,v,W,w,X,x,Y,y,Z,z H9C5X6_HBOC1 VP2 GSGVGISTGGWVGGSHFSDKYVVTKNTRQFITTIQNGHLYKTEAIETTNQSGKSQRCVTTPWTYFNFNQYSCHFSPQDWQRLTNEYKRFRPKAMQVKIYNLQIKQILSNGADTTYNNDLTAGVHIFCDGEHAYPNASHPWDEDVMPDLPYKTWKLFQYGYIPIENELADLDGNAAGGNATEKALLYQMPFFLLENSDHQVLRTGESTEFTFNFDCEWVNNERAYIPPGLMFNPKVPTRRVQYIRQNGSTAASTGRIQPYSKPTSWMTGPGLLSAQRVGPQSSDTAPFMVCTNPEGTHINTGAAGFGSGFDPPSGCLAPTNLEYKLQWYQTPEGTGNNGNIIANPSLSMLRDQLLYKGNQTTYNLVGDIWMFPNQVWDRFPITRENPIWCKKPRADKHTIMDPFDGSIAMDHPPGTIFIKMAKIPVPTASNADSYLNIYCTGQVSCEIVWEVERYATKNWRPERRHTALGMSLGGESNYTPTYHVDPTGAYIQPTSYDQCMPVKTNINKVL 510 T 5.8E-14 Parvo_coat unppercent T Viruses T 7l0x 1 A,AA,AB,B,BA,BB,C,CA,CB,D,DA,DB,E,EA,EB,F,FA,FB,G,GA,GB,H,HA,HB,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,PA,Q,QA,R,RA,S,SA,T,TA,U,UA,V,VA,W,WA,X,XA,Y,YA,Z,ZA A,a,1,B,b,2,C,c,3,D,d,4,E,e,5,F,f,6,G,g,7,H,h,8,I,i,J,j,K,k,L,l,M,m,N,n,O,o,P,p,Q,q,R,r,S,s,T,t,U,u,V,v,W,w,X,x,Y,y,Z,z B9UYL6_HBOC2 VP2 PROTEIN GATGSVGGGKGSGVGISTGGWVGGSYFTDSYVITKNTRQFLVKIQNDHKYRTENIIPSNAGGKSQRCVSTPWSYFNFNQYSSHFSPQDWQRLTNEYKRFKPRKMHVKIYNLQIKQILSNGADTTYNNDLTAGVHIFCDGEHAYPNATHPWDEDVMPELPYETWYLFQYGYIPVIHELAEMEDANAVEKAIALQIPFFMLENSDHEVLRTGESTEFTFDFDCEWINNERAYIPPGLMFNPKVPTRRAQYIRQHGNTASSNTRIQPYAKPTSWMTGPGLLSAQRVGPAGSDTASWMVVVNPDGTAVNSGMAGVGSGFDPPSGSLRPTDLEYKIQWYQTPEGTNSDGNIISNPPLSMLRDQALYRGNQTTYNLCSDVWMFPNQIWDRYPITRENPIWCKKPRSDKNTIIDPFDGTLAMDHPPGTIFIKMAKIPVPSNNNADSYLNIYCTGQVSCEIVWEVERYATKNWRPERRHTALGLGIGGEENINPTYHVDKNGKYIQPTTWDMCYPIKTNINKVL 516 T 8.4E-11 Parvo_coat unppercent T Viruses T 7l0y 1 A,AA,AB,B,BA,BB,C,CA,CB,D,DA,DB,E,EA,EB,F,FA,FB,G,GA,GB,H,HA,HB,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,PA,Q,QA,R,RA,S,SA,T,TA,U,UA,V,VA,W,WA,X,XA,Y,YA,Z,ZA A,a,1,B,b,2,C,c,3,D,d,4,E,e,5,F,f,6,G,g,7,H,h,8,I,i,J,j,K,k,L,l,M,m,N,n,O,o,P,p,Q,q,R,r,S,s,T,t,U,u,V,v,W,w,X,x,Y,y,Z,z H9C5X6_HBOC1 VP2 GTGSIGGGKGSGVGISTGGWVGGSHFSDKYVVTKNTRQFITTIQNGHLYKTEAIETTNQSGKSQRCVTTPWTYFNFNQYSCHFSPQDWQRLTNEYKRFRPKAMQVKIYNLQIKQILSNGADTTYNNDLTAGVHIFCDGEHAYPNASHPWDEDVMPDLPYKTWKLFQYGYIPIENELADLDGNAAGGNATEKALLYQMPFFLLENSDHQVLRTGESTEFTFNFDCEWVNNERAYIPPGLMFNPKVPTRRVQYIRQNGSTAASTGRIQPYSKPTSWMTGPGLLSAQRVGPQSSDTAPFMVCTNPEGTHINTGAAGFGSGFDPPSGCLAPTNLEYKLQWYQTPEGTGNNGNIIANPSLSMLRDQLLYKGNQTTYNLVGDIWMFPNQVWDRFPITRENPIWCKKPRADKHTIMDPFDGSIAMDHPPGTIFIKMAKIPVPTASNADSYLNIYCTGQVSCEIVWEVERYATKNWRPERRHTALGMSLGGESNYTPTYHVDPTGAYIQPTSYDQCMPVKTNINKVL 519 T 8.2E-15 Parvo_coat pdbpercent T Viruses T 7l1b 3 C C PK3CA_HUMAN PHOSPHATIDYLINOSITOL 4,5-BISPHOSPHATE 3-KINASE CATALYTIC SUBUNIT ALPHA ISOFORM,PTDINS-3-KINASE SUBUNIT ALPHA,PHOSPHATIDYLINOSITOL 4,5-BISPHOSPHATE 3-KINASE 110 KDA CATALYTIC SUBUNIT ALPHA,P110ALPHA,PHOSPHOINOSITIDE-3-KINASE CATALYTIC ALPHA POLYPEPTIDE,SERINE/THREONINE PROTEIN KINASE PIK3CA AHHGGWTTK 9 T 0.029 Prion_octapep pdbhh F Eukaryota T 7l1c 3 C C PK3CA_HUMAN PHOSPHATIDYLINOSITOL 4,5-BISPHOSPHATE 3-KINASE CATALYTIC SUBUNIT ALPHA ISOFORM,PTDINS-3-KINASE SUBUNIT ALPHA,PHOSPHATIDYLINOSITOL 4,5-BISPHOSPHATE 3-KINASE 110 KDA CATALYTIC SUBUNIT ALPHA,P110ALPHA,PHOSPHOINOSITIDE-3-KINASE CATALYTIC ALPHA POLYPEPTIDE,SERINE/THREONINE PROTEIN KINASE PIK3CA ALHGGWTTK 9 T 3.1 CFC pdbhh F Eukaryota T 7l2m 2 E,F E,F DKTX_CYRSC TAU-TRTX-HS1A,DOUBLE-KNOT TOXIN,DKTX MDCAKEGEVCSWGKKCCDLDNFYCPMEFIPHCKKYKPYVPVTTNCAKEGEVCGWGSKCCHGLDCPLAFIPYCEKYR 76 T 0.079 Conotoxin_I2 unp F Eukaryota T 7l33 1 A,B,C A,B,C Cu-3SCC XGIAAIKQEHAAIKQEIAAIKQEIAAIKWEGX 32 T 0.016 DivIC pdbpssm F T 7l4z 2 F,G,H,I,J S,T,R,U,V ACE-DTY-LYS-ALA-GLY-VAL-VAL-TYR-GLY-TYR-ASN-ALA-TRP-ILE-ARG-CYS-NH2 XXKAGVVYGYNAWIRCX 17 T 2.1 DUF3212 pdbhh F T 7l51 1 A,B A,B Cyclic plant protein PDP-23 GFCWHHSCVPSGTCADFPWPLGHQCFPD 28 T 0.46 DUF5763 pdbhh F T 7l6g 1 A,B,C,D,E,F A,B,C,D,E,F A0A2D2CY67_METTR Metallo-mystery pair system four-Cys motif protein AGVKTQPVAVRFALVADGKEVGCGAPLANLGSGRLAGKLHEARLYVYGFELVDAKGKHTPIALTQNDWQYADVALLDFKDARGGNAACTPGNPAKNTTVVGAAPQGAYVGLAFSVGAPVESLVDGKPVFVNHSNVEAAPPPLDISGMAXNWQAGRRFVTIEVIPPAAVIKPDGSKSRTWMVHVGSTGCKGNPATGEIVACAHENRFPVVFDRFDPKTQRVELDLTTLFESSDISVDKGGAVGCMSALDDPDCPAVFRALGLNLADSAPGANDAGKPSRPGVSPIFSVGAAASKVAGGKQ 299 T 0.79 DUF4382 pdbhh F Bacteria T 7l6o 1 A,C,E a,c,e A0A1W6IPB2_9HIV1 ENVELOPE GLYCOPROTEIN GP120 AENLWVTVYYGVPVWKEAKTTLFCASDARAYEKEVHNVWATHACVPTDPSPQELVLGNVTENFNMWKNDMVDQMHEDIISLWDQSLKPCVKLTPLCVTLICSNATVKNGTVEEMKNCSFNTTTEIRDKEKKEYALFYKPDIVPLSETNNTSEYRLINCNTSACTQACPKVTFEPIPIHYCAPAGYAILKCNDETFNGTGPCSNVSTVQCTHGIRPVVSTQLLLNGSLAEKEIVIRSENLTNNAKIIIVHLHTPVEIVCTRPNNNTRKSVRIGPGQTFYATGDIIGDIKQAHCNISEEKWNDTLQKVGIELQKHFPNKTIKYNQSAGGDMEITTHSFNCGGEFFYCNTSNLFNGTYNGTYISTNSSANSTSTITLQCRIKQIINMWQGVGRCMYAPPIAGNITCRSNITGLLLTRDGGTNSNETETFRPAGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGR 466 T 3.5E-53 GP120 pdbpssm T Viruses T 7l7a 1 A A NuxVA GCCPAPLTCHCVIY 14 T 3 US10 pdbhh F T 7l7t 1 A,C,E A,C,E Q2N0S6_9HIV1 BG505 SOSIP.v5.2(7S) - gp120 MKRGLCCVLLLCGAVFVSPSQEIHARFRRGARAENLWVTVYYGVPVWKDAETTLFCASDAKAYETKKHNVWATHCCVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCTNVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNESVQINCTRPNNNTVKSIRIGPGQWFYYTGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRV 505 T 4.1E-54 GP120 pdbpercent T Viruses T 7l86 3 C,E,G E,C,A Q2N0S6_9HIV1 BG505 SOSIP MD39 - gp120 MGILPSPGMPALLSLVSLLMGCVAETGAENLWVTVYYGVPVWKDAETTLFCASDAKAYETKKHNVWATHCCVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHEDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNTPVQINCTRPNNNTVKSIRIGPGQWFYYTGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFAQSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRV 500 T 3.9999999999999995E-54 GP120 pdbpercent T Viruses T 7l87 3 C,E,G C,A,D Q2N0S6_9HIV1 BG505 SOSIP MD39 - gp120 MGILPSPGMPALLSLVSLLMGCVAETGAENLWVTVYYGVPVWKDAETTLFCASDAKAYETKKHNVWATHCCVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHEDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNTPVQINCTRPNNNTVKSIRIGPGQWFYYTGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFAQSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKR 498 T 3.9E-54 GP120 pdbpercent T Viruses T 7l8a 1 A,E,G E,A,C Q2N0S6_9HIV1 BG505 SOSIP MD39 - gp120 NLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHEDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNTPVQINCTRPNNNTVKSIRIGPGQAFYYTGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFAQSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKR 469 T 3.5E-54 GP120 pdbpercent T Viruses T 7l8t 1 A,C,E A,C,E Q2N0S6_9HIV1 BG505 SOSIP.v5.2 N241/N289 - gp120 MKRGLCCVLLLCGAVFVSPSQEIHARFRRGARAENLWVTVYYGVPVWKDAETTLFCASDAKAYETKKHNVWATHCCVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCTNVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNESVQINCTRPNNNTRKSIRIGPGQWFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKR 503 T 1.5999999999999998E-49 GP120 unp T Viruses T 7lc0 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H Q8ZL27_SALTY Putative selenocysteine synthase (L-seryl-tRNA(Ser) selenium transferase) MTPNIYQQLGLKKVINACGKMTILGVSSVAPEVMQATARAASAFVEIDALVEKTGELVSRYTGAEDSYITSCASAGIAIAVAAAITHGDRARVALMPDSSGMANEVVMLRGHNVDYGAPVTSAIRLGGGRIVEVGSSNLATRWQLESAINEKTAALLYVKSHHCVQKGMLSIDDFVQVAQANHLPLIVDAAAEEDLRGWVASGADMVIYSGAKAFNAPTSGFITGRKTWIAACKAQHQGIARAMKIGKENMVGLVYALENYHQGQTTVTAAQLQPVAEAISAIHGLYADIEQDEAGRAIWRIRVRVNASELGLNAQDVEAQLRGGEIAIYARKYQLHQGVFSLDPRTVAEGEMALIVARLREIAEHAAD 369 T 1.4 SelA pdbpssm F Bacteria T 7lc2 2 C,D D,E SIN1_HUMAN TORC2 SUBUNIT MAPKAP1,MITOGEN-ACTIVATED PROTEIN KINASE 2-ASSOCIATED PROTEIN 1,STRESS-ACTIVATED MAP KINASE-INTERACTING PROTEIN 1,MSIN1 GSKESLFVRINAAHGFSLIQVDNTKVTMKEILLKAVKRRKGSQKVSGPQYRLEKQSEPNVAVDLDSTLESQSAWEFCLVRENSSRADG 88 T 0.051 E3_UbLigase_RBR unp F Eukaryota T 7lcw 1 A A A0A1H8GYX0_9BACL Lasso Peptide Lihuanodin GSKYSDTADESSYRW 15 T 0.015 DUF5972 unppercent F Bacteria T 7ldf 1 A A Minimal thioredoxin fold protein, ems_thioM_802 MDEVKVHVGDDQFEEVSREIKKAGWKVEVHKHPSNTSQVTVTKGNKQWTFKDPKQAVEFVQKSLEHHHHHH 71 T 0.077 AvrPtoB-E3_ubiq pdb F T 7ldg 1 A,C A,C HSF2B_HUMAN MEILB2 SARLETVQADNIREKKEKLALRQQLNEAKQQLLQQAEYCTEMGAAACTLLWGVSSSEEVVKAILGGDKALKFFSITGQTMESFVKSLDGDVQELDSDESQFVFALAGIVTNVAAIACGREFLVNSSRVLLDTILQLLGDLKPGQCTKLKVLMLMSLYNVSINLKGLKYISESPGFIPLLWWLLSDPDAEVCLHVLRLVQSVVLEPEVFSKSASEFRSSLPLQRILAMSKSRNPRLQTAAQELLEDLRTLEHNV 253 T 0.0026 KAP unphh F Eukaryota T 7ldg 2 B,D B,D BRCA2_HUMAN FANCONI ANEMIA GROUP D1 PROTEIN MKRRGEPLILVGEPSIKRNLLNEFDRIIENQEKSLKASKSTPDGTIKDRRLFMHHVSLEPITCVPF 66 T 5.3 TFIIA_gamma_N pdbhh F Eukaryota T 7leq 2 B B NFKB1_HUMAN DNA-BINDING FACTOR KBF1,EBP-1,NUCLEAR FACTOR OF KAPPA LIGHT POLYPEPTIDE GENE ENHANCER IN B-CELLS 1 DKEEVQRKRQKLMP 14 T 0.63 ETAA1 pdbhh F Eukaryota T 7let 3 C C TF65_HUMAN NUCLEAR FACTOR NF-KAPPA-B P65 SUBUNIT,NUCLEAR FACTOR OF KAPPA LIGHT POLYPEPTIDE GENE ENHANCER IN B-CELLS 3 DRHRIEEKRKRTYETFKSIMKK 22 T 0.32 Fimbrial_CS1 unp F Eukaryota T 7lfz 3 C C R1AB_SARS2 ORF1ab IPRRNVATL 9 T 0.79 TbpB_A pdbhh T Viruses T 7lhy 2 B B H3_CAEEL H3(7-20)K14ac ARKSTGGXAPRKQL 14 T 0.12 Sirohm_synth_M pdbpercent F Eukaryota T 7lki 3 C,F CCC,FFF Epitope III peptide GLY-ALA-PRO-THR-TYR-SER-TRP-GLY DRSGAPTYSWGANDK 15 T 5.7E-05 HCV_NS1 pdbhh F T 7lky 2 I,J,K,L,M,N,O,P J,I,K,L,M,N,O,P Peptidomimetic inhibitor UNC6641 GGVXKPLR 8 T 24 DUF104 pdbhh F T 7ll7 2 B B VAL-CYS-ILE-GLY-THR-PRO-ILE-SER-PHE-TYR-CYS VCIGTPISFYC 11 T 0.41 KSHV_K1 pdbhh F T 7llk 1 A,E,I E,A,I O55774_9HIV1 Envelope glycoprotein gp120 AENLWVTVYYGVPVWRDADTTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLDNVTEKFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLHCTNVTSVNTTGDREGLKNCSFNMTTELRDKRQKVYSLFYRLDIVPINENQGSEYRLINCNTSACTQACPKVSFEPIPIHYCAPAGFAILKCKDEGFNGTGLCKNVSTVQCTHGIKPVVSTQLLLNGSLAEKNITIRSENITNNAKIIIVQLVQPVTIKCIRPNNNTRKSIRIGPGQAFYATGDIIGDIRQAHCNVSRSRWNKTLQEVAEKLRTYFGNKTIIFANSSGGDLEITTHSFNCGGEFFYCNTSGLFNSTWYVNSTWNDTDSTQESNDTITLPCRIKQIINMWQRAGQCMYAPPIPGVIKCESNITGLLLTRDGGKDNNVNETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVERRRRRR 474 T 9.7E-54 GP120 pdbpssm T Viruses T 7lma 6 F G TAP50_TETTS P50 MKLLLQNQNIFQKLKNTLNGCIKKFYDTYQDLEQMQKFEMIVEDKLLFRYSCSQSEMFSAQIQAHYLEKRVLQLTDGNVKYIVNFRDKGVLDKANFFDTPNNSLVIIRQWSYEIYYTKNTFQINLVIDEMRCIDIITTIFYCKLELDFTQGIKGISKSSSFSNQIYEYSAQYYKAIQLLKKLLINDSYISELYNSTKSKQQPRLFIFQSFKPKMNLAEQNLSRQFEQCQQDDFGDGCLLQIVNYTHQSLKQIENKNNSNQIVNGQNEISKKKRVLKSNEDLYKISLQKQLKIFQEEEIELHSQSTIRNQTNQQLETFESDTSKRNSEKILHSINELNTSKQKVNQMNSSQHQIQKLENNNLNKNILNQINENDIKNELEERQQQHLTQSFNSKAQLKKIITLKKNQDILLFKPQEQEGSKKY 422 T 0.093 DUF6235 pdb F Eukaryota T 7lmk 2 E,F,G,H F,G,H,I H4_HUMAN Histone H4 GAKRHRKVLRDNY 13 T 0.27 UPF0137 unp F Eukaryota T 7lmv 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L Integrin inhibitor AVVRFVFRGDLAELMLRAVKDHLKKEGPHWNITSRGNELVVRGIHESDAKRIQKEFPSVQSTIQAAAAAA 70 T 0.0011 NIR_SIR_ferr pdb F T 7lmx 1 A,B,C A,B,C Integrin inhibitor STKCVVRFVFRGDLATLMLRAVKDHLKKEGPHWNITSTNNGAELVVRGIHESDAKRIAKWVEKRFPGVHTETQCD 75 T 0.17 Pox_H7 pdb F T 7lo0 2 I,J,K,L,M,N,O,P,Q,R,S,T,U,V,W,X I,J,K,L,M,N,O,P,Q,R,T,U,V,W,X,Y TLK2_HUMAN HSHPK,PKU-ALPHA,TOUSLED-LIKE KINASE 2 EELHSLDPRRQELLEARFTGV 21 T 47 Plasmid_stab_B pdbhh F Eukaryota T 7lp2 2 B,D,F B,D,F AMOT_HUMAN Angiomotin MEHRGPPPEYPFKGM 15 T 1.2 GvpL_GvpF pdbhh F Eukaryota T 7lp3 2 B,D B,D AMOT_HUMAN Angiomotin EHRGPPPEYPFKGM 14 T 0.95 GvpL_GvpF pdbhh F Eukaryota T 7lq2 1 A,B,C,D,E,F B,C,F,E,D,A A0A023X3Z4_9ACTN RR RsiG GSHMRESAEEVWGGTEDLTSLSVEELKGLMARFDEEEKRISYRRRVMQGRIDVIRAEIVRRGGAVLSPEELARVLMGDVGDESE 84 T 0.019 DUF1192 pdb F Bacteria T 7lq3 1 A,B,C B,A,D A0A023X3Z4_9ACTN RsiG GSHMARESAEEVWGGTEDLTSLSVEELKGLMARFDEEEKRISYRRRVMQGRIDVIRAEIVRRGGAVLSPEELARVLMGDVGDESE 85 T 0.02 DUF1192 pdb F Bacteria T 7lq4 1 A,B D,T A0A023X3Z4_9ACTN RsiG MGEETYEGTSGREGGRHEEEVETRAARESAEEVWGGTEDLTSLSVEELKGLLARFDEEEKRISYRRRVIQGRIDVIRAEIVRRGGAVLSPEELARVLMGDVGDESEGGASGDRRGDGA 118 T 0.021 KfrA_N pdbpercent F Bacteria T 7lqs 1 A A Alpha-conotoxin CIC CCSNPACQVQHSDLC 15 T 0.00012 Toxin_8 pdbhh F T 7lrw 1 A A Hact-2 CAPECRSFCPDQKCLKDCGCI 21 T 0.59 C_tripleX pdbhh F T 7lso 1 A A A0A0U1U1Y3_BOAPU L-Phenylseptin peptide FFFDTLKNLAGKVIGALT 18 T 0.091 TRM13 unp F Eukaryota T 7lsp 1 A A A0A0U1U1Y3_BOAPU D-Phenylseptin FXFDTLKNLAGKVIGALT 18 T 0.091 TRM13 unp F Eukaryota T 7lsv 1 A,B A,B Q5ZTM4_LEGPH Calmodulin-dependent protein kinase SNAELESEALGLQAYKNQMSKQQLLGEIQGFKENYWNMKDLLTLTNRHHLRVFLEYLDNICSAFKDDKTDEKSARAAYDFLNAQINKLFEDNSKNSKPSFESFSEDVQRFLIHIDTYLMKNPSACSNSIASTIQLLKQLDNKKSFNPEQSFKDFCSYKEITIQLLLKPFETPVAEMAS 178 T 0.016 Radial_spoke pdbpssm F Bacteria T 7lt7 1 A A Hact-3 FNPVGVAFKGNNGKYLSRIHRSGIDYTEFAKDNTD 35 T 0.093 Agglutinin pdbhh F T 7lu9 6 N,P,R d,e,f M4M097_9HIV1 CH505 GP120 ENLWVTVYYGVPVWKEAKTTLFCASDAKAYEKEVHNVWATHACVPTDPNPQEMVLKNVTENFNMWKNDMVDQMHEDVISLWDQSLKPCVKLTPLCVTLNCTNATASNSSIIEGMKNCSFNITTELRDKREKKNALFYKLDIVQLDGNSSQYRLINCNTSVITQACPKVSFDPIPIHYCAPAGYAILKCNNKTFTGTGPCNNVSTVQCTHGIKPVVSTQLLLNGSLAEGEIIIRSENITNNVKTIIVHLNESVKIECTRPNNKTRTSIRIGPGQWFYATGQVIGDIREAYCNINESKWNETLQRVSKKLKEYFPHKNITFQPSSGGDLEITTHSFNCGGEFFYCNTSSLFNRTYMANSTDMANSTETNSTRTITIHCRIKQIINMWQEVGRAMYAPPIAGNITCISNITGLLLTRDGGKNNTETFRPGGGNMKDNWRSELYKYKVVKIEPLGVAPTRCKRRV 461 T 8.4E-53 GP120 pdbpssm T Viruses T 7luu 1 A A A0A1L5BQA7_SPHIB Subclass B3 metallo-beta-lactamase MIATMTIAASLAISPAAAATGPEPEAMAAMDRAGGARASDDPLTRPMAVERAKEWLAPLPPERVFGNSYLVGFAGLSVALIDTGAGLVLIDGALPQAAPMILSNVRKLGFDPRDIKFILSTEPHYDHAGGIAALARDTGATVVASRRGAEGLRAGAHAKDDPQFDYGGAWPAVSRLRVMKDGEVLRIGRASITAHATPGHTMGSMTWSWNACEGKRCKAIVFASSLNPVSADRYRFTAPSSAPIVKGFEASYRRMGALKCDILISAHPDNAGAGRYGSGSGACRSYAERSRRLLAKRLAEERRETSK 307 T 0.53 Lactamase_B pdbpssm F Bacteria T 7lw7 1 A A EXO5_HUMAN EXO V, HEXO5, DEFECTS IN MORPHOLOGY PROTEIN 1 HOMOLOG SNALEDAQESKALVNMPGPSSESLGKDDKPISLQNWKRGLDILSPMERFHLKYLYVTDLATQNWCELQTAYGKELPGFLAPEKAAVLDTGASIHLARELELHDLVTVPVTTKEDAWAIKFLNILLLIPTLQSEGHIREFPVFGEVEGVLLVGVIDELHYTAKGELELAELKTRRRPMLPLEAQKKKDCFQVSLYKYIFDAMVQGKVTPASLIHHTKLCLEKPLGPSVLRHAQQGGFSVKSLGDLMELVFLSLTLSDLPVIDILKIEYIHQETATVLGTEIVAFKEKEVRAKVQHYMAYWMGHREPQGVDVEEAWKCRTCTYADICEWRKGSGVLSSTLAPQVKKAK 346 T 2.4999999999999997E-43 Exo5 pdbpercent F Eukaryota T 7lwa 1 A A EXO5_HUMAN EXO V, HEXO5, DEFECTS IN MORPHOLOGY PROTEIN 1 HOMOLOG SNALEDAQESKALVNMPGPSSESLGKDDKPISLQNWKRGLDILSPMERFHLKYLYVTDLAEQNWCELQTAYGKELPGFLAPEKAAVLDTGASIHLARELELHDLVTVPVTTKEDAWAIKFLNILLLIPTLQSEGHIREFPVFGEVEGVLLVGVIDELHYTAKGELELAELKTRRRPMLPLEAQKKKDCFQVSLYKYIFDAMVQGKVTPASLIHHTKLCLEKPLGPSVLRHAQQGGFSVKSLGDLMELVFLSLTLSDLPVIDILKIEYIHQETATVLGTEIVAFKEKEVRAKVQHYMAYWMGHREPQGVDVEEAWKCRTCTYADICEWRKGSGVLSSTLAPQVKKAK 346 T 1.6999999999999998E-43 Exo5 pdbpercent F Eukaryota T 7lwh 2 B B LATS1_HUMAN LARGE TUMOR SUPPRESSOR HOMOLOG 1,WARTS PROTEIN KINASE,H-WARTS PKFGTHHKALQEIRNSLLPFANE 23 T 0.049 EcoEI_R_C unp F Eukaryota T 7lwy 1 A,B B,A Q9JE95_9VIRU Capsid protein SNPRLTKVLDEMSKKPCVNINEIRKMIRNFQPQFIQPRNGNRPNAQPRTVDSFEWVVRIQSTVETQLLGATNTVPQQTLNLDISFTDDSTTITPASIPGSISMLDNSRHIPAIQSMIQNFKARYLGSLQDTAQLQSPQYPQLLAYLFGQLIAIKDRLDLFRPSNPLSLADALFGFTLAQNARPRYDDHRHAKACQGPLVIPAATNSDCGPCGFVQINANQGLTLPLGACLFVNPETVNDQSFQDFLWLIFATHHRMPNQMQNNWPFSLNIVSTCAAPGRQAPHAGELTDERVRLALDTGHRILLSMFNDDEETLRYYQRKGIETMFRPCCFYTEGGLLRKATRYVSMVPLNGLYYYNGATSYVVSPIHTDAHPGITAAIESFVDIMVLQAVFSFSGPKVVAAKVNASQIDAAMVFGPAVAEGDGFVYDPLRPAPPLSAFYTEFIHRPAEQRIFQMAMSQIYGSHAPLIIANVINSIHNCKTKIVNNKLRATFVRRPPGAPHLKADTAIINRFHDPELAYALGILADGIAPLDGSHEYNVLDELDYLFNGGDIRNCFGLNALNTRGLGQIVHIRPKREPGKRPRRGFYTTLDGQVHPVTQDAPLDEIYHWRDHGNLTRPYSCHILDSQGLEFADVSNGRSRGKILVVVNSPLKTCAAYQGPSFAPK 665 T 0.0092 HEPN_Swt1 pdbpercent T Viruses T 7lx4 1 A A Hact-SCRiP1 QSEFCGHDVGECVPPKLVCRPPTHECLHFPCPGYLKCCCYP 41 T 2.5 CLIP_SPH_mas pdbhh F T 7lxk 1 A A ALL12_ARAHY ALLERGEN ARA H I GKSSPYQKKTENPCAQRCLQSCQQEPDDLKQKACESRCTKLEYDPRCVYDPRGHTGTTN 59 T 0.27 PAN_2 pdbpercent F Eukaryota T 7lxm 1 A,E,I A,C,E HIV-1 Env glycoprotein gp120 MDAMKRGLCCVLLLCGAVFVSPSQEIHARFRRGARAENLWVTVYYGVPVWKDAETTLFCASDAKAYDTEKRNVWATHCCVPTDPNPQEIVLENVTENFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLNCTDVNATNNTTNNEEIKNCSFNITTELRDKKKKVYALFYKLDVVPIDDNNSYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCNDKKFNGTGPCKNVSTVQCTHGIKPVVSTQLLLNGSLAEEEIIIRSENITNNAKTIIVQLNESVEINCTRPNNNTRKSIRIGPGQWFYATGDIIGDIRQAHCNISRTKWNKTLQQVAKKLREHFNKTIIFNPSSGGDLEITTHSFNCGGEFFYCNTSELFNSTWNGTNNTITLPCRIKQIINMWQRVGQAMYAPPIEGKIRCTSNITGLLLTRDGGNNNTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVERRRRRRA 493 T 9.1E-54 GP120 pdbpssm F T 7lxn 1 A,E,I A,C,E HIV-1 Env glycoprotein gp120 MDAMKRGLCCVLLLCGAVFVSPSQEIHARFRRGARAENLWVTVYYGVPVWKDAETTLFCASDAKAYDTEKRNVWATHCCVPTDPNPQEIVLENVTENFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLNCTDVNATNNTTNNEEIKNCSFNITTELRDKKKKVYALFYKLDVVPIDDNNSYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCNDKKFNGTGPCKNVSTVQCTHGIKPVVSTQLLLNGSLAEEEIIIRSENITNNAKTIIVQLNESVEINCTRPNNNTRKSIRIGPGQWFYATGDIIGDIRQAHCNISRTKWNKTLQQVAKKLREHFNKTIIFNPSSGGDLEITTHSFNCGGEFFYCNTSELFNSTWNGTNNTITLPCRIKQIINMWQRVGQAMYAPPIEGKIRCTSNITGLLLTRDGGNNNTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVERRRRRR 492 T 9.1E-54 GP120 pdbpssm F T 7ly9 3 C,G,K G,D,K A0A0N9FF17_9HIV1 Envelope glycoprotein gp120 GLWVTVYYGVPVWREAKTTLFCASDAKSYEKEVHNVWATHACVPTDPNPQELVLENVTENFNMWKNDMVDQMHEDIISLWDQSLKPCVKLTPLCVTLNCSDAKVNATYKGTREEIKNCSFKATTELRDKKRREYALFYRLDIVPLSGEGNNNSEYRLINCNTSVITQICPKVTFDPIPIHYCAPAGYAILKCNNKTFNGTGPCNNVSTVQCTHGIKPVVSTQLLLNGSLAEEEIIIRSENLTDNVKTIIVHLNESVEITCTRPNNMTRKSVRIGPGQTFYALGDIIGDIRQPHCNISEIKWEKTLQRVSEKLREHFNKTIIFNQSSGGDLEITTHSFNCGGEFFYCNTSDLFFNKTFNETYSTGSNSTNSTITLPCRIKQIINMWQEVGRAMYAPPIAGNITCKSNITGLLLTRDGGGNNSTKETFRPGGGNMRDNWRSELYKYKVVEVKPLGIAPTECNRTVVQRRRRRR 471 T 3.2999999999999996E-53 GP120 pdbpssm T Viruses T 7lzj 1 A,B,C A,B,C A0A3S7W7I3_9CAUD Depolymerase MALYREGKAAMAADGTVTGTGTKWQSSLSLIRPGATIMFLSSPIQMAVVNKVVSDTEIKAITTSGAVVASTDYAILLSDSLTVDGLAQDVAETLRHYQSQETVIADAVEFFKSFDFDSLQNLANQIKADSESAESSAAAAAASESKAKTSEDNAKSSENAAKNSEVAAETTRDQIQQIIDNAGDQSTLVVLAQPDGFDSIGRVSSFAALRNLKPKKSGQHVLLTSYYDGWAAENKMPTGGGEFISSIGTATDDGGYIAAGPGYYWTRVVNNNSFTAEDFGCKTTATPPPNFNVLPAELFDNTAMMQAAFNLAISKSFKLNLSTGTYYFESSDTLRITGPIHIEGRPGTVFYHNPSNKANPKTDAFMNISGCSAGRISSINCFSNSYLGKGINFDRSVGDNRKLVLEHVYVDTFRWGFYVGEPECINQIEFHSCRAQSNYFQGIFIESFKEGQEYGHSAPVHFFNTICNGNGPTSFALGATYKTTKNEYIKVMDSVNDVGCQAYFQGLSNVQYIGGQLSGHGSPRNTSLATITQCNSFIIYGTDLEDINGFTTDGTAITADNIDAIESNYLKDISGAAIVVSSCPGFKIDSPHIFKIKTLSTIKLMNNTYNYEIGGFTPDEALKYNVWDANGLATNRISGVIHPRLVNSRLGINSVAFDNMSNKLDVSSLIHNETSQIVGLTPSTGSNVPHTRKMWSNGAMYSSTDLNNGFRLNYLSNHNEPLTPMHLYNEFSVSEFGGSVTESNALDEIKYIFIQTTYANSGDGRFIIQALDASGSVLSSNWYSPQSFNSTFPISGFVRFDVPTGAKKIRYGFVNSANYTGSLRSHFMSGFAYNKRFFLKIYAVYNDLGRYGQFEPPYSVAIDRFRVGDNTTQMPSIPASSATDVAGVNEVINSLLASLKANGFMSS 907 T 0.0099 Pectate_lyase_3 pdbhh T Viruses T 7m0q 1 A,B A,B Network hallucinated protein 0738_mod MGSSHHHHHHSSGLVPRGSHMNIQVSLQWEDPKKGKVFSHTVNIPPGGTAEQIADNILDMARSLQDEGWDKLTVQVTVNPGFPKETAMRVAAALKEAFEDRGLRLTSIETSGNSIHLKFRY 121 T 0.012 Pilus_CpaD pdb F T 7m10 2 B B VRK1_HUMAN VACCINIA-RELATED KINASE 1 PRVKAAQAGRQS 12 T 34 KGG pdbhh F Eukaryota T 7m25 1 A A PawL-Derived Peptide PLP-13 TFGVVIAD 8 T 0.47 DUF3905 pdbhh F T 7m27 1 A A PawL-Derived Peptide PLP-16 GLFPYGPD 8 T 0.0096 LINES_C pdbhh F T 7m28 1 A A PawL-Derived Peptide PLP-22 GLPPYVD 7 T 2.5 DUF2119 pdbhh F T 7m29 1 A A PawL-Derived Peptide PLP-29 GYFPVGVD 8 T 1.7 PhnH pdbhh F T 7m2b 1 A A PawL-Derived Peptide PLP-42 TFFNPVID 8 T 0.53 PsbK pdbhh F T 7m2c 1 A A PawL-Derived Peptide PLP-46 GYITPLD 7 T 3.1 Caleosin pdbhh F T 7m3t 2 G G COAT_STMV Coat protein MGRGKVKPNRKSTGDNSNVVTMIRAGSYPKVNPTPTWVRAIPFEVSVQSGIAFKVPVGSLFSANFRTDSFTSVTVMSVRAWTQLTPPVNEYSFVRLKPLFKTGDSTEEFEGRASNINTRASVGYRIPSNLRQNTVAADNVCEVRSNCRQVALVISCCFN 159 T 24 Csm4_C pdbhh T Viruses T 7m3t 3 N N COAT_STMV Coat protein MGRGKVKPNRKSTGDNSNVVTMIRAGSYPKVNPTPTWVRAIPFEVSVQSGIAFKVPVGSLFSANFRTDSFTSVTVMSVRAWTQLTPPVNEYSFVRLKPLFKTGDSTEEFEGRASNINTRASVGYRIPTNLRQNTVAVDNVCEVRSNCRQVALVISCCFN 159 T 24 Csm4_C unphh T Viruses T 7m3u 1 A A PawS-Derived Peptide PDP-24 GFCWQHTCLPSGCADFPWPVGHQCFPD 27 T 0.098 DUF5763 pdbhh F T 7m4n 1 A,B A,B RN216_HUMAN RING FINGER PROTEIN 216,RING-TYPE E3 UBIQUITIN TRANSFERASE RNF216,TRIAD DOMAIN-CONTAINING PROTEIN 3,UBIQUITIN-CONJUGATING ENZYME 7-INTERACTING PROTEIN 1,ZINC FINGER PROTEIN INHIBITING NF-KAPPA-B GPEELAEKDDIKYRTSIEEKMTAARIRKCHKCGTGLIKSEGANRMSCRCGAQMCYLCRVSINGYDHFCQHPRSPGAPCQECSRCSLWTDPTEDDEKLIEEIQKEAEEEQKRKNGENTFKRIGPPLEKPVEKVQRVEAL 138 T 0.0044 Rhodanese_C pdbpercent F Eukaryota T 7m5f 1 A A CdiI MKEIKLMADYHCYPLWGTTPDDFGDISPDELPISLGLKNSLEAWAKRYDAILNTDDPALSGFKSVEEEKLFIDDGYKLAELLQEELGSAYKVIYHADY 98 T 0.052 RHH_5 pdbpercent F T 7m5l 2 D,E,F E,D,F Peptide mimetic (ACE)RQCSMTCFYHSK(NH2) with linker XRQCSMTCFYHSKX 14 T 0.56 Fer4_12 pdbhh F T 7m5t 1 A A De novo designed protein 0515 MDFTERLDRLVKYAKEIAKWYKESGDPDFANSVDNVLGHLENIRKAFKHGDPARAMDHVSNVVGSLDSIQTSFKQTGNPEIATRWQELTQEVRELYAYLG 100 T 0.0097 PEX11 pdb F T 7m60 1 A,B C,B MLH1_HUMAN MUTL PROTEIN HOMOLOG 1 SANPRKRHRED 11 T 0.57 T_cell_tran_alt pdbhh F Eukaryota T 7m67 1 A A A0A5K4F6V0_SCHMA Schistocin-1 antimicrobial peptide GILDIKNKVSNLFKKIKGEKX 21 T 2.3 KxDL pdbhh F Eukaryota T 7m6t 4 D D Non-canonical peptide F3 ALQHLMDKWMAM 12 T 2 ATG16 pdbhh F T 7m73 1 A A A0A5K4F6V0_SCHMA Schistocin-2 antimicrobial peptide GILDIKNKVSNLFKKIKX 18 T 2 C_Hendra pdbhh F Eukaryota T 7m77 1 A A G4VEE0_SCHMA Schistocin-3 antimicrobial peptide GILDIKNKVSNLFX 14 T 6 DUF3484 pdbhh F Eukaryota T 7m79 1 A A G4VEE0_SCHMA Schistocin-4 antimicrobial peptide GILDILNKVSNLFX 14 T 0.25 Antimicrobial20 pdbhh F Eukaryota T 7m7a 1 A,B,C,D A,B,C,D Q5ZRA8_LEGPH Phosphoinositide 3-kinase MavQ SEFELRRQASMGLPKKALKESQLQFLTAGTAVSDSSHQTYKVSFIENGVIKNAFYKKLDPKNHYPELLAKISVAVSLFKRIFQGRRSAEERLVFDDEERLVGTLSISVDGFKGFNFHKESVPQESSAKEQVIPSTRTLIEKSFMEILLGRWFLDDDDGHPHNLSLAGDIDFDMFFYWFTIYMKEPRPAIGIPKTRVNLTVRDWEGFPNVKDSKPFHWPTYKNPGQETLPTVLPVQDKLVNLILEKTYPDPGQFEQLAHEPVAQEQKFAAALKILLTYQPEMIRKRLTELFGEMTLNYTSLDETDVALRNQYEKTFPHLCNENTNIKPFVDFIMNLYQMHYDNLYRVVVFYMGCENNGYGVPLPATNSALYHKPSFYKDIVEWARTQNITIFSKDDSSIKFDEDELRRRYHQVWRDAYAPTFRDLLHDSYSLTNKLLQQVSTFHVVLDEVEGKKPTDDTLTNAWELFGTMPELSLEKITPLISVDKDSKLRTALILLVEFTTQFHAVAKTYYQKDRKDLTEEDNLEFSEQLVQLYTNYNLKIRQSLAHTSTLAGEFNRIAVGLKQYTERANFQLHLTTTDEQMKEATVATT 590 T 0.12 SYF2 pdbpssm F Bacteria T 7m7x 1 A A CsrA-binding peptide XVCSELCWX 9 T 0.65 RPAP2_Rtr1 pdbhh F T 7m98 2 B B H4_HUMAN Histone H4 SGRGXGGKGLGKGGA 15 T 11 Shadoo unppercent F Eukaryota T 7mb9 2 C,D C,D R1AB_SARS2 ARG-GLU-PRO-MET-LEU-GLN REPMLQ 6 T 74 Phageshock_PspD pdbhh T Viruses T 7md4 1 A,B N,M INSR_HUMAN IR AAAKELEESSFRKTFEDYLHNVVFVPSPSR 30 T 0.00017 DUF4998 unphh F Eukaryota T 7mdt 1 A,E,F A,C,E Q2N0S6_9HIV1 SU, GLYCOPROTEIN 120, GP120 MKRGLCCVLLLCGAVFVSPSQEIHARFRRGARAENLWVTVYYGVPVWKDAETTLFCASDAKAYETKKHNVWATHCCVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQWFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 513 T 3.8E-54 GP120 pdbpercent T Viruses T 7mdu 2 B A Q2N0S6_9HIV1 SU, GLYCOPROTEIN 120, GP120 MKRGLCCVLLLCGAVFVSPSQEIHARFRRGARAENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHEDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNTPVQINCTRPNNNTVKSIRIGPGQAFYYTGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFAQSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 513 T 3.8E-54 GP120 pdbpercent T Viruses T 7mex 3 C D N-degron RHGSGSGAWLLPVSLVKRKTTLAPNTQTASPPSYRALADSLMQ 43 T 19 EZH2_N pdbhh F T 7mgr 2 B B R1AB_SARS2 ALA-VAL-LYS-LEU-GLN-ASN-ASN-GLU AVKLQNNEL 9 T 5.2 Phospho_p8 pdbhh T Viruses T 7mgs 2 B B R1AB_SARS2 SER-ALA-VAL-LEU-GLN-SER-GLY-PHE SAVLQSGFR 9 T 12 GPAT_N pdbhh T Viruses T 7mgv 2 B,C V,U CdnA3 Leader peptide KEPFFAAFLEKQ 12 T 0.028 Inhibitor_I10 pdb F T 7mgv 3 E T CdnA3 Core peptide TLKYPSDSDEG 11 T 1.1E-05 Inhibitor_I10 pdbhh F T 7mir 1 A A SIDJ_LEGPH Calmodulin-dependent glutamylase SidJ SGHVKQYYFARRGETSTHDTSLPPPVKVLSGRSIPLKEIPFEATRNELVQIYLTSIDKLIKSNKLNSIPSQQIASHYLFLRSLANSETDGIKKNQILSLAKPLGTYLASKEPHVWKMINELIEKSEYPIIHYLKNNRAHSNFMLALIHEYHKEPLTKNQSAFVQKFRDSSVFLFPNPIYTAWLAHSYDEDSSFNPMFRERLSTNFYHSTLTDNLLLRTEPKEVTLSSEHHYKKEKGPIDSSFRYQMSSDRLLRIQGRTLLFSTPQNDVVAVKVQKKGEPKSTLEEEFEMADYLLKHQRRLDVHSKLPQPLGQYSVKKSEILEISRGSLDFERFKTLIDDSKDLEVYVYKAPQSYFTYLHDKNQDLEDLTASVKTNVHDLFVLLREGIVFPQLADIFHTHFGEDEREDKGRYQALVQLLNVLQFQLGRIDKWQKAVEYVNLRSSGLADLGDSLPITSLFTSSDFTKHYFSELLTGGYHPTFFDKSSGTANSLFTGKRRLFGNYLYLNTIAEYLLVIQLTLGSYGDKVTRDMMDKPKKEAVWRELANVMFTSCAEAIHIMTGIPQSRALTLLKQRANIEKHFRQTQFWMTPDYSKLDEDTLQMEQYSIYSGEPEYEFTDKLVSGVGLSVDGVHQDLGGYNRESPLRELEKLLYATVTLIEGTMQLDKEFFKQLEQVEKILSGEIKTDANSCFEAVAQLLDLARPGCHFQKRLVLSYYEEAKLKYPSAPTDAYDSRFQVVARTNAAITIQRFWREARKN 756 T 0.29 IQ pdb F Bacteria T 7mj3 1 A A Hact-4 QPRSHVDCPALHGQCQSLPCTYPLVFVGPDPFHCGPYPQFGCCA 44 T 5.3 Toxin_14 pdbhh F T 7mj5 1 A,N,O,P,Q,R A,N,O,P,Q,R A2IAB2_9NEOP Putative secreted salivary protein KPVEAEVAQPKLYQRGEGGNGMEPIPEDVLNEALNA 36 T 3.3 FixH unphh F Eukaryota T 7mja 3 C C PHX2B_HUMAN NEUROBLASTOMA PHOX,NBPHOX,PHOX2B HOMEODOMAIN PROTEIN,PAIRED-LIKE HOMEOBOX 2B QYNPIRTTF 9 T 4 Myb_DNA-bind_3 unp F Eukaryota T 7mjb 1 A,B A,B Nanoluc Luciferase MGSSHHHHHHSSGMVFTLEDFVGDWRQTAGYNLDQVLEQGGVSSLFQNLGVSVTPIQRIVLSGENGLKIDIHVIIPYEGLSGDQMGQIEKIFKVVYPVDDHHFKVILHYGTLVIDGVTPNMIDYFGRPYEGIAVFDGKKITVTGTLWNGNKIIDERLINPDGSLLFRVTINGVTGWQLCERILA 184 T 0.021 Lipocalin_7 pdbhh F T 7mkk 1 A,C,E,G B,A,E,G Q9W3W6_DROME SMALL OVARY,ISOFORM B,ISOFORM C MDTSMKEKVKAKLVEIRKFVPFIRRVRIDFQDTLSKVQGHRLDALVNLLDREDVSMSSLNKIEVIIDKLRTRFNPRIE 78 T 0.007 Merozoite_SPAM pdbpssm F Eukaryota T 7mkk 2 B,D,F,H C,D,F,H PANX_DROME PROTEIN SILENCIO SMEPKIKEDADNAMLDSLLADPFENNSP 28 T 15 DBB pdbhh F Eukaryota T 7mmm 1 A A LYT1_LYCER Toxin LyeTx 1 IWLTALKFLGKNLGKHLAKQQLAKLX 26 T 0.019 Cu pdbpercent F Eukaryota T 7mnj 1 A,B,C A,B,C RBP2_HUMAN 358 KDA NUCLEOPORIN,NUCLEAR PORE COMPLEX PROTEIN NUP358,NUCLEOPORIN NUP358,RAN-BINDING PROTEIN 2,RANBP2,P270 DGWNKLFDLIQSELYVRPDDVHVNIRLVEVYRSTKRLKDAVAHCHEAERNIALRSSLEWNSCVVQTLKEYLESLQCLESDKSDWRATNTDLLLAYANLMLLTLSTRDVQESRELLQSFDSALQSVKSLGGNDELSATFLEMKGHFYMHAGSLLLKMGQHSSNVQWRALSELAALCYLIAFQVPRPKIKLIKGEAGQNLLEMMACDRLSQSGHMLLNLSRGKQDFLKEIVETFANKSGQSALYDALFSSQSPKDTSFLGSDDIGNIDVREPELEDLTRYDVGAIRAHNGSLQHLTWLGLQWNSLPALPGIRKWLKQLFHHLPHETSRLETNAPESICILDLEVFLLGVVYTSHLQLKEKCNSHHSSYQPLCLPLPVCKQLCTERQKSWWDAVCTLIHRKAVPGNVAKLRLLVQHEINTLRAQEKHGLQPALLVHWAECLQKTGSGLNSFYDQREYIGRSVHYWKKVLPLLKIIKKKNSIPEPIDPLFKHFHSVDIQASEIVEYEEDAHITFAILDAVNGNIEDAVTAFES 529 T 0.015 TPR_19 pdbpercent F Eukaryota T 7mnk 1 A,B,C,D A,B,C,D RBP2_HUMAN 358 KDA NUCLEOPORIN,NUCLEAR PORE COMPLEX PROTEIN NUP358,NUCLEOPORIN NUP358,RAN-BINDING PROTEIN 2,RANBP2,P270 SYEDQNSLLKMICQQVEAIKKEMQELKLNS 30 T 0.4 DUF5320 pdbhh F Eukaryota T 7moq 21 X X Q22T00_TETTS Docking complex 1 protein MRASSATKSQKDLKTLEEEYIHQSKKSNLLENDRKIFHKNAEETKNNNMQIIESLKKENKQLKTLRDELIANKRASTPGMSKTQGSLVSWSGDIKDENYWRRKFDEARHATKNKKSQLLQLQDKLNEVSDAKFGAVEESPLMRQIRILENRLDKVMIKFNEAQSIRKTYEQIVKRLKEERVGYDNQLAAIERSLKGKEHDFEELLLLAHDATHAKELAAAELKKYEHKKAAVRELRKTYIAEKRKAIEQREAVISRMEKKDKDNDDRNLEKSQANNLNELNNPQIEPQNHQDATFQRQKLNDYDEAFRKLYEATGVTDVNEIIQKFTTQDETSKSLKDLQREYQDTIDDKKKQRDDLKAGLNALKYEGNENPNRKQLDEIEKNVNNAVNKCDKAKLKYERVSKILVDVKAGIEHLYEKLEFYKLEGKPNIVITDETLVEGLSQIVEKMKLIFQPVKNDPSYNPEDFKQTAKGVSNYINLNLRDKSGRIESISKNIRVKLPEKDEEEVSNDEIEDDIDIETTTKLKQKYQAQAKQEKAARNKQKKQLGSTQQGRKV 555 T 0.051 CALCOCO1 pdbhh F Eukaryota T 7moq 22 Z Z Q233H6_TETTS Outer dynein arm docking complex protein oda protein MNENLEKKKKMEELEEYQRKFRNLESDRKAYAEETVALIKKQRGIVDKLKNENQQLKDIISKMNAQKIQQSNTMYGKPSSDSLVEELKQKIEVERRQQMEIEKHVVDFQKKIIEKRSNIGGYNAGAENDSSLAKQIKILENRLDKANQKFNEAIAVNKQLRQQIDSLRRERVIFDNLYKKLEKELHEKRKQMANIIETANTAYEERDRANDQIQNLKMLAKKESENFEKDLRELSHIMEKNKKALDYIKLTEKNRDDNKLNNDLLDSDKFARTTSQKLYKDRNVNQTQSEKIQRYEEDFAKIQAATKVNDFEKLVNTFIENEEKNFQTFKFVNELSNEIEELEKQIGELRSELDQYKGGSNMDIQYKRKIKEFEEVMTRAENKSESYEFKRHDAQKLINSLTNWIETLFNTIECDKKVAKELAGSHSVTDGNMMIFLAIIENKVNQIVQAFSAIDAQGANENYHTLLQNVSNLSTALMANKQRQDAPDNDEFEEEEGEGDRILNIEDFRKKALEKLDDRKQTQQSKKLPKAITNRRKR 538 T 0.0012 Imm30 pdbpssm F Eukaryota T 7mp3 2 E,F L,N bicyclic peptide B8 KFEGYDNEFP 10 T 4.6 DUF1284 pdbhh F T 7mpa 1 A A DWORF_HUMAN SERCA REGULATOR DWORF,DWARF OPEN READING FRAME,DWORF,SMALL TRANSMEMBRANE REGULATOR OF ION TRANSPORT 1 AMAEKAGSTFSHLLVPILLLIGWIVGCIIMIYVVFS 36 T 0.23 GNVR pdbpssm F Eukaryota T 7mq8 65 RB ST Nucleolar protein 14 MAKAKKVGARRKASGAPAGARGGPAKANSNPFEVKVNRQKFQILGRKTRHDVGLPGVSRARALRKRTQTLLKEYKERDKSNVFRDKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTEAELAKEEQEHLRKLEAERLRRMLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAEGNKAKLEKLFGFLLEYVGDLATDDPPDLTVIDKLVVHLYHLCQMFPESASDAIKFVLRDAMHEMEEMIETKGRAALPGLDVLIYLKITGLLFPTSDFWHPVVTPALVCLSQLLTKCPILSLQDVVKGLFVCCLFLEYVALSQRFIPELINFLLGILYIATPNKASQGSTLVHPFRALGKNSELLVVSAREDVATWQQSSLSLRWASRLRAPTSTEANHIRLSCLAVGLALLKRCVLMYGSLPSFHAIMGPLQALLTDHLADCSHPQELQELCQSTLTEMESQKQLCRPLTCEKSKPVPLKLFTPRLVKVLEFGRKQGSSKEEQERKRLIHKHKREFKGAVREIRKDNQFLARMQLSEIMERDAERKRKVKQLFNSLATQEGEWKALKRKKFKK 632 T 7.999999999999999E-26 Nop14 pdbpssm F T 7mqq 1 A A A0A5B0MRS6_PUCGR Stem rust effector protein AvrSr50 GPMARSLIKTDWSGSEYTILGANHYEEPNTGAAAQFPGTMAEDDGRSPYIVRKLRNSSGKRFYVFTDHPQQPIIWNPHEEIEIQFSRKYLIAVLTEFEADSKVFTHFARRQHRS 114 T 6.3 SelB-wing_3 pdbhh F Eukaryota T 7mrw 2 B B A0A2I0BSI4_PLAFO High molecular weight rhoptry protein 2 MIKVTIFLLLSIFSFNLYGLELNEKVSIKYGAEQGVGSADSNTKLCSDILKYLYMDEYLSEGDKATFEKKCHNVIGNIRNTFSNKNTIKEGNEFLMSILHMKSLYGNNNNNNAGSESDVTLKSLYLSLKGSQNTEGESEVPSDDEINKTIMNFVKFNKYLLDNSNDIKKVHDFLVLTSQSNENLLPNKEKLFEQIVDQIKYFDEYFFASGGKIKVKKGYLKYNFLDIYKQPVCSAYLHLCSRYYESVSIYIRLKKVFNGIPAFLDKNCRKVKGEEFKKLMDMELKHNHIVERFDKYIISDDLYYVNMKVFDLKNVDKIQVSKIDDINNLNIYEHKETMHLSAKNLSRYIDIKKELNDEKAYKQLMSAIRKYVTTLTKADSDITYFVKQLDDEEIERFLIDLNFFLYNGFLRITEDKHLINADDVSPSYINLYRSNNIVALYILKTQYEENKLSEYRAHKFYRRKRVSNITNDMIKKDFTQTNALTNLPNLDNKKTTEYYLKEYENFVENFQPDLHDIMKLQLFFTMAFKDCNVNQNFTETSKKLWFDLLYAYDKFGWFYIHPNEVINSINKTDFVRHVLVSRNFLLKNNDQLTFLETQVAKIVEIINLSLEVDKSPDSLDFSIPMNFFNHKNGYHVMNDDKLKLLTSYEYIDSIANNYFFLSEYKNDVFRTGNNFKLYFNLPNIYSLAYQLFNELAININVITNVPLKKYLKYNASYAYFTLMNMIGKNHDIYSKGSRFVYASYILGLVFFIESHIDIARLKPKDFFFMKQSLPIIDHVYHKDLKTLKKNCTLLTDFMKINKNSQNYSLTHTEEMIKILGLLTVTLWAKEGKKSVYYDDDVSLYRKLMVSCVFNGGETIQEKLANNIEKSCDISQYGIKSKNLKDMIDINLSIHKWNPAEIEKLAYSFVLSCKMQKLMYKPMNVEKLPLEDYYKLSLAPDMVKTYHCYKLGKQAAELLESIILKKKFVRFRVTDAIDVYDFFYIKKVLSSRIKKEYNEFLQDKRAFEKKELETILNNSPFSEEQTMKLINSYECHWFTSYENFRILWMHASSNLGTGTYLKNFFSELWQNIRFLFKSKLKIRDMEYFSGDISQMNLLDYYSPMVHSESHCQEKMQVLFITLRDSKEENRSEIAQKVKSAYYQCKLDYYKNHHSDFIHRIHPNDFLNNKVYVLKQPYYLMSNVPLNNPKKVSRLFVTEGTLEYLLLDKINIPECFGPCTKLHFNKVVIKESKQRIYDMTINNALVPEIQPYNRRKYMTIYINEAYIKNIVSDALTSEEIKRHDIQKGNIKICMGKSTYLTEPILTEEHFNLTHKPVYDFSSVKHNLKVFHMKNEHLVSEDPNDDCFINYPLATINLDISDPYKEISEDLIKNLYILKSS 1378 T 1.3 Crystall_2 pdbpercent F Eukaryota T 7mrw 3 C C W7JUX6_PLAFO High molecular weight rhoptry protein 3 MRSKHLVTLFIITFLSFSTVKVWGKDVFAGFVTKKLKTLLDCNFALYYNFKGNGPDAGSFLDFVDEPEQFYWFVEHFLSVKFRVPKHLKDKNIHNFTPCLNRSWVSEFLKEYEEPFVNPVMKFLDKEQRLFFTYNFGDVEPQGKYTYFPVKEFHKYCILPPLIKTNIKDGESGEFLKYQLNKEEYKVFLSSVGSQMTAIKNLYSTVEDEQRKQLLKVIIENESTNDISVQCPTYNIKLHYTKECANSNNILKCIDEFLRKTCEKKTESKHPSADLCEHLQFLFESLKNPYLDNFKKFMTNSDFTLIKPQSVWNVPIFDIYKPKNYLDSVQNLDTECFKKLNSKNLIFLSFHDDIPNNPYYNVELQEIVKLSTYTYSIFDKLYNFFFVFKKSGAPISPVSVKELSHNITDFSFKEDNSEIQCQNVRKSLDLEVDVETMKGIAAEKLCKIIEKFILTKDDASKPEKSDIHRGFRILCILISTHVEAYNIVRQLLNMESMISLTRYTSLYIHKFFKSVTLLKGNFLYKNNKAIRYSRACSKASLHVPSVLYRRNIYIPETFLSLYLGLSNLVSSNPSSPFFEYAIIEFLVTYYNKGSEKFVLYFISIISVLYINEYYYEQLSCFYPKEFELIKSRMIHPNIVDRILKGIDNLMKSTRYDKMRTMYLDFESSDIFSREKVFTALYNFDSFIKTNEQLKKKNLEEISEIPVQLETSNDGIGYRKQDVLYETDKPQTMDEASYEETVDEDAHHVNEKQHSAHFLDAIAEKDILEEKTKDQDLEIELYKYMGPLKEQSKSTSAASTSDEISGSEGPSTESTSTGNQGEDKTTDNTYKEMEELEEAEGTSNLKKGLEFYKSSLKLDQLDKEKPKKKKSKRKKKRDSSSDRILLEESKTFTSENEL 897 T 11 Phage_TAC_10 pdbhh F Eukaryota T 7msl 1 A A A0A377JKY9_HAEPA AcrIIC4 GSMAMKITSSNFATIATSENFAKLSVLPKNHREPIKGLFKSAVEQFSSARDFFKNENYSKELAEKFNKEAVNEAVEKLQKAIDLAEKQGIQF 92 T 1.3 Nif11 unphh F Bacteria T 7mto 1 A A SET_HUMAN HLA-DR-ASSOCIATED PROTEIN II,INHIBITOR OF GRANZYME A-ACTIVATED DNASE,IGAAD,PHAPII,PHOSPHATASE 2A INHIBITOR I2PP2A,I-2PP2A,TEMPLATE-ACTIVATING FACTOR I,TAF-I GTSEKEQQEAIEHIDEVQNEIDRLNEQASEEILKVEQKYNKLRQPFFQKRSELIAKIPNFWVTTFVNHPQVSALLGEEDEEALHYLTRVEVTEFEDIKSGYRIDFYFDENPYFENKVLSKEFHLNESGDPSSKSTEIKWKSGKDLTKRSSQTQNKASRKRQHEEPESFFTWFTDHSDAGADELGEVIKDDIWPNPLQYYLVPDM 204 T 6E-06 NAP pdb F Eukaryota T 7mu2 1 A,C A,C WIPI2_HUMAN WIPI2 MAGQLLFANFNQDNTSLAVGSKSGYKFFSLSSVDKLEQIYECTDTEDVCIVERLFSSSLVAIVSLKAPRKLKVCHFKKGTEICNYSYSNTILAVKLNRQRLIVCLEESLYIHNIRDMKVLHTIRETPPNPAGLCALSINNDNCYLAYPGSATIGEVQVFDTINLRAANMIPAHDSPLAALAFDASGTKLATASEKGTVIRVFSIPEGQKLFEFRRGVKRCVSICSLAFSMDGMFLSASSNTETVHIFKLETVGSGSFNQGRAFATVRLPFCGHKNICSLATIQKIPRLLVGAADGYLYMYNLDPQEGGECALMKQHRLDGSLET 324 T 0.00048 WD40 pdbpercent F Eukaryota T 7mu2 2 B,D B,D E7EVC7_HUMAN Autophagy-related protein 16-1 YAENEKDSRRRQARLQKELAEAAKE 25 T 0.0039 Macoilin unphh F Eukaryota T 7mu9 1 A A Q8PJC6_XANAC CARBOXYPEPTIDASE, VIPCD GSHMSDPRHPDNAMYNGAVSKLEALGERGGFANRKELEQAAGQIVFESKVSGLQRIDHVVPNKSGDGFFAVQGELTDPAMQRVFVDRNQAQNQPLENSSRQAAEE 105 T 0.068 DUF4369 pdb F Bacteria T 7muc 9 I,IA,IB,IC,ID,IE,IF,V,VA,VB,VC,VD,VE AN,CN,EN,GN,IN,KN,MN,BN,DN,FN,HN,JN,LN A0A2S6FAR3_LEGPN Neurogenic locus notch MLFLKIKTNQRTTMNILKPKAFLLASVFVLSISPAFAADGCCSKMGGINYCDSSAGRLVCNNGFYSTCYCTRHAVMDLQFLMGCCLWHGGVYPQLNSSGLVVCNDGYVSEECSLQKPVEQISVY 124 T 0.067 PAN_2 pdbpercent F Bacteria T 7mvt 1 A B NIC96_CHATD NUCLEAR PORE PROTEIN NIC96 SEPGKAHYFLAASGVDPGAAVRDLGALGLQAKTERTAASVGPAAGPSGVSTTGFGTGLGEVDVDTYLSNLQTKTTLSMIADGLERSARDFDAFLEENVTLEWEAQRKRIYQHFGIK 116 T 2.6 UPF0172 pdbhh F Eukaryota T 7mvu 2 B B NIC96_CHATD NUCLEAR PORE PROTEIN NIC96 SGTGLGEVDVDTYLSNLQTKTTLSMIADGLERSARDFDAFLEENVTLEWEAQRKRIYQHFGIK 63 T 0.19 UPF0172 pdbpssm F Eukaryota T 7mvv 3 C D NUP53_CHATD NUCLEAR PORE PROTEIN NUP53 SQQDGSLRSRKANLETGAFGKSTRRTRSKAATPAKRED 38 T 33 DUF6374 pdbhh F Eukaryota T 7mvv 4 D C NU145_CHATD N-NUP145 SATPLSGKAKVKSRSILPMYKLSPANASRLVTTPQKRAYGFSFSAYGSPTSPSSSASSTPGAFGQSILS 69 T 68 DUF3591 pdbhh F Eukaryota T 7mvw 1 A A NU188_CHATD NUCLEAR PORE PROTEIN NUP188 GPHNMATLTDRTYLPPLEDCLTGRTVILSWRLVASALEDADLARLTSPALSTFLRDGFVHELLKHPARVFEPKDLKQEFETKTSSIQTVAPGVDTIKKDALWLADAVAINQVAALRIVLIEYQTRAHSHLVLPLSTQDVANIQEAAGVGDAHASSILSLLNPASAVDAETMWCDFETEARRRERILATYLSERRSFTAAVDALVTFLLHSAPGQHKDLDSLRRALLKDAFAFDEDLDVPDRSKLLTMAPTYMNLVEDCIARAQALPAKLGESFKTEAFELDWLRTAITEAVHSLSIAFQALDLDTPYFAPHELLSEWFELMNSSLFLESILGFEVVADLAMPARSLVSAICLKMLNIDRTIQFLHDFDYPDGEEPYLLSSQTLNKIHTAVTNAVNSGVAASLPVAFAWSLIVHQMHLGYQERAERRDLLVNQRAQAGFELEFQPSASTPNRRRRNSAGSIVSLEASPYDDFLREQRLDNDIAPVEQIAMLATSRGQVYQVMSEMALCLGTTHEAAFRPAVGARARLVFQDLLKRSAYLIPYQDEPVFSLLAILATGRQYWDVTDALSASSLNQVYTDMLDDETLFTQFTMQAINRFPYEFNPFSVLCRVLAAALITNKDKADVVTGWLWRTPTLTVDWNPAWDRSYELCFEDENTNSFRLTRDVDLFGSASPARPRHLAAEERFIIPEGTLGRFVTDVGRTARLEFEHSALALLGKRLEVKAAEEICDSGMAPLDVDEQAEAVAMLATVLRAESLKSTAKGGDPEAPLKFLKEASRLLPHNKDILTVISDTIDGLVEKELLELDGPQIAVLASCLQFLHAALAVCPGRVWAYMSRCALIAGDARPGRLSRITGSLDMYAERFDLLSSAVKLFAALIDSAACSAVQRRAGSTALVSVRSAVENPWLGTSEKILSRVALAIAQAALDVYESTTTWRFRSELDRSILVRDVVGLMHKLVVHAHTLSSHLTSTLSPAAAHIISSFLTPPPSASSLRFQPLLGTLLVALITPRATLYPGQSRILAERVTSVLAFCTSLLRAADFLGQTHIPLQTHLFQSACLLARLPAANAVYRAPVLELLRALVEVAGRAANGSGEPPSLLGYLGSHAARSFISLVEGIDKPFGRVEHAVVTWRFFAAVIRN 1138 T 8.4E-19 Nup188 pdbpercent F Eukaryota T 7mvz 3 C C NU145_CHATD N-NUP145 SNASRLVTTPQKRAYGFSFSAYGSPTSPSSSASSTPGAFGQSILSSSINRGLNKSISASNLRRSLNVEDSILQPGAFSANSSMRLLGGPGSHKK 94 T 85 WSK pdbhh F Eukaryota T 7mw1 2 C,D D,C NUP35_HUMAN 35 KDA NUCLEOPORIN,MITOTIC PHOSPHOPROTEIN 44,MP-44,NUCLEAR PORE COMPLEX PROTEIN NUP53,NUCLEOPORIN NUP53 SDKSGAPPVRSIYDDISSPGLGSTPLTSRRQPNISVMQSPLVGVTSTPGTGQSMFSPASIGQPRKTTL 68 T 0.27 DUF4712 pdbpercent F Eukaryota T 7mwq 1 A,C A,C LHD29A53 SSIFLLSNVSEDAAQLAEELVREISKKEGTEVRFEKDDGFLTIEVKNLSEERLREIAKALQLIVDVANAERVVRERPGSNLAKKALEIILRAAEELAKLDLKASLKAAVRAAEKVVREQPGSNLAKKALEIILRAAEELAKLPDPEALKEAVKAAEKVVREQPGSELAKKALEIIERAAEELKKSPDPEAQKEAKKAEQKVREERPGS 208 T 0.0026 Activator-TraM pdb F T 7mwq 2 B,D B,D LHD29B53 TWQWVLINISEEARQLIEKAVRAISKKEGTEVHFEKDDGVLHIRVKNLHEKRAREIHKVAKLILEVAAAERIVRERPGSNLAKKALEIILRAAEELAKADVDAALEAAVRAAEKVVREQPGSNLAKKALEIILRAAEELAKLPDPEALKEAVKAAEKVVREQPGSELAKKALEIIERAAEELKKSPDPEAQKEAKKAEQKVREERPGS 208 T 0.0089 YfiO pdbpssm F T 7mwr 1 A A LHD101A54 GSNDEKEKLKELLKRAEELAKSPDPEDLKEAVRLAEEVVRERPGSNLAKKALEIILRAAEELAKLPDPEALKEAVKAAEKVVREQPGSNLAKKALEIILRAAAALANLPDPESRKEADKAADKVRREQPGSELAVVAAIISAVARMGVKMELHPSGNEVKVVIKGLHIKQQRQLYRDVREAAKKAGVEVEIEVEGDTVTIVVRG 204 T 0.034 YfiO pdbpssm F T 7mwr 2 B B LHD101B4 YEDECEEKARRVAEKVERLKRSGTSEDEIAEEVAREISEVIRTLKESGSSYEVICECVARIVAEIVEALKRSGTSEDEIAEIVARVISEVIRTLKESGSSYEVICECVARIVAEIVEALKRSGTSEEEIAEIVARVIQEVIRTLKESGSSYEVIRECLRRILEEVIEALKRSGVDSSEIVLIIIKIAVAVMGVTMEEHRSGNEVKVVIKGLHESQQEELLELVLRAAELAGVRVRIRFKGDTVTIVVRG 249 T 0.00012 mTERF pdb F T 7mx1 2 C,D C,E ACE-PRO-LEU-ALA-SER-TPO XPLAST 6 T 280 TruB_N pdbhh F T 7mzv 1 A A PUS7_YEAST RNA PSEUDOURIDYLATE SYNTHASE 7,RNA-URIDINE ISOMERASE 7,TRNA PSEUDOURIDINE(13) SYNTHASE MSDSSEATVKRPLDAHVGPSENAAKKLKIEQRTQADGIHEADVGITLFLSPELPGFRGQIKQRYTDFLVNEIDQEGKVIHLTDKGFKMPKKPQRSKEEVNAEKESEAARRQEFNVDPELRNQLVEIFGEEDVLKIESVYRTANKMETAKNFEDKSVRTKIHQLLREAFKNELESVTTDTNTFKIARSNRNSRTNKQEKINQTRDANGVENWGYGPSKDFIHFTLHKENKDTMEAVNVITKLLRVPSRVIRYAGTKDRRAVTCQRVSISKIGLDRLNALNRTLKGMIIGNYNFSDASLNLGDLKGNEFVVVIRDVTTGNSEVSLEEIVSNGCKSLSENGFINYFGMQRFGTFSISTHTIGRELLLSNWKKAAELILSDQDNVLPKSKEARKIWAETKDAALALKQMPRQCLAENALLYSLSNQRKEEDGTYSENAYYTAIMKIPRNLRTMYVHAYQSYVWNSIASKRIELHGLKLVVGDLVIDTSEKSPLISGIDDEDFDEDVREAQFIRAKAVTQEDIDSVKYTMEDVVLPSPGFDVLYPSNEELKQLYVDILKADNMDPFNMRRKVRDFSLAGSYRTVIQKPKSLEYRIIHYDDPSQQLVNTDLDILNNTRAKESGQKYMKAKLDRYMPDKGGEKTAVVLKFQLGTSAYATMALRELMKLETSRRGDMCDVKENI 676 T 0.038 TruD pdbhh F Eukaryota T 7n0w 2 F G CA1A_CONAV Ribbon alpha-conotoxin AusIA SCCARNPACRHNHPCV 16 T 0.0034 Toxin_8 pdbhh F Eukaryota T 7n19 3 C,F,I,L C,F,I,L HST4 peptide GGIGSDNKVTRRGG 14 T 9.2 DUF3976 pdbhh F T 7n1j 2 B,D B,D Binder GGGDRRKEMDKVYRTAFKRITSTPDKEKRKEVVKEATEQLRRIAKDEEEKKKAAYMILFLKTLG 64 T 0.055 UQCC3 pdb F T 7n20 1 A A CA1B_CONAN Alpha-conotoxin AnIB GGCCSHPACAANNQDYCX 18 T 0.0004 Toxin_8 pdbpssm F Eukaryota T 7n21 1 A A CA1B_CONAN Alpha-conotoxin AnIB GGCCSHPACAANNQDYC 17 T 2E-05 Toxin_8 pdbhh F Eukaryota T 7n2d 1 A A ZN292_HUMAN zinc finger protein 292 FRNWQAYMQ 9 T 0.046 zinc_ribbon_11 unp F Eukaryota T 7n2e 1 A C CPEB3_HUMAN CPEB3 QIGLAQTQ 8 T 41 DUF5315 pdbhh F Eukaryota T 7n2p 3 C C A0A2R8Y7R8_HUMAN Ribonuclease H2 subunit B GQVMVVAPR 9 T 7.1 DUF45 pdbhh F Eukaryota T 7n2y 1 A A Apo-(GRAND CoilSerL16CL23C)3 EWEALEKKLAALESKCQALEKKCQALEKKLEALEHG 36 T 0.0003 Lebercilin pdb F T 7n3o 1 A A Cas12k MSQITIQARLISFESNRQQLWKLMADLNTPLINELLCQLGQHPDFEKWQQKGKLPSTVVSQLCQPLKTDPRFAGQPSRLYMSAIHIVDYIYKSWLAIQKRLQQQLDGKTRWLEMLNSDAELVELSGDTLEAIRVKAAEILAIAMPASESDSASPKGKKGKKEKKPSSSSPKRSLSKTLFDAYQETEDIKSRSAISYLLKNGCKLTDKEEDSEKFAKRRRQVEIQIQRLTEKLISRMPKGRDLTNAKWLETLLTATTTVAEDNAQAKRWQDILLTRSSSLPFPLVFETNEDMVWSKNQKGRLCVHFNGLSDLIFEVYCGNRQLHWFQRFLEDQQTKRKSKNQHSSGLFTLRNGHLVWLEGEGKGEPWNLHHLTLYCCVDNRLWTEEGTEIVRQEKADEITKFITNMKKKSDLSDTQQALIQRKQSTLTRINNSFERPSQPLYQGQSHILVGVSLGLEKPATVAVVDAIANKVLAYRSIKQLLGDNYELLNRQRRQQQYLSHERHKAQKNFSPNQFGASELGQHIDRLLAKAIVALARTYKAGSIVLPKLGDMREVVQSEIQAIAEQKFPGYIEGQQKYAKQYRVNVHRWSYGRLIQSIQSKAAQTGIVIEEGKQPIRGSPHDKAKELALSAYNLRLTRRS 639 T 0.0027 RuvC_1 pdbhh F T 7n3t 2 C,D C,D Designed TrkA-binding miniprotein MSHHHHHHHHSENLYFQSGGGRDEIKERIFKAVVRAIVTGNPEQLKEAKKLLEKLKKLGRLDQDAKKFEKAIRQVEKRLRS 81 T 0.012 LCD1 pdb F T 7n50 1 A A Gasdermin SNXSRDTGDELMAALLAEGINLILPPRDNIAPGDLIIADPQGGARLGGWHEVFNLQLSPEVATDPGFKSFQFRASSILQVGVAASVMGRVLQALGLGSGSFSSAFSSSNADTIQLSIVAPANKELTNFDAVLVQMNEAKAEPAQGYTDRNFFVVTKVWRARGIRISVADKSKKQVDLSAKAVEELTAKAKMELKREDTGSYAFLAASQLIFGLTLREVTYKDGAIVDVAPTGPLKFRGKGPGDPFAFIGDDAFVDLPES 259 T 0.0034 Gasdermin pdbhh F T 7n51 1 A A A0A2T4VDM4_9DELT Gasdermin SGLXSDPAITYLKRLGYNVVRLPREGIQPLHLLGQQRGTVEYLGSLEKLITQPPSEPPAITRDQAAAGINGQKTENLSFSIGINILKSVLAQFGAGAGIEAQYNQARKVRFEFSNVLADSVEPLAVGQFLKMAEVDADNPVLKQYVLGNGRLYVITQVIKSNEFTVAAEKSGGGSIQLDVPEIQKVVGGKLKVEASVSSQSTVTYKGEKQLVFGFKCFEIGVKNGEITLFASQPGAIAMALDAAGGVMPSDSALLDEGGLLDLEGF 266 T 0.00083 Gasdermin pdbpercent F Bacteria T 7n52 1 A,B,C,D A,B,C,D Gasdermin SEXNDPFVVALKDKGYSLVAYPKTSIRPLHIYEHTIKNAFKRIWIQSEAQPTSGFIKSLFSDKIHGAIGLSDGQGIDIDLRKTNSLSSAVAAKILESYFQDSAPSFDLAFENSSSVIFHIEEIITTDADEISLRNWLNDNQNELREIYKEEIKKGNFFVATSLLRAKKMRMQFERKNKGELGVDVSKIKNLPVDAKLESKIEGSTYDRLVFETPDEGIVFGVKLVRLFFSDNGILTIDKKQDFNRVLGENMALNLFTEIQDAGFIEVT 268 T 0.13 AKAP95 pdb F T 7n5c 3 C C PA_I97A1 RNA-DIRECTED RNA POLYMERASE SUBUNIT P2 SSLCNFRAYV 10 T 3.9 P34-Arc pdbhh T Viruses T 7n61 4 K,L 0K,0L A0A2K3DV98_CHLRE FAP239 MPPQLGREVQERVKVYGPLNELTYEGRLLTQTLQDELNRSISAPAGPRSPWYEGDPELESMRERVRQQRAIREAQRRRDHAALTASIQKRNLQEEQRRDAMLGSLLGDVIGGLTDPNSPLAEAEAALSHADKVRRKKKESLHNEWSTQVFDTIQGRLQAAVDARDPAAIESRLKTQYDQYLHTTNTKVAVFRDVIIEQDYNPLAAADAAIRVPTGDIRDPLKRDVLKGEYERRLMTGGRGGGGASPTGRGGAAAAGAGSIYGPLGKETLGTQQWGELAVKATPYGHCTDGQGGYVARPLSGSAVALRASRVPMDHYDYPVGNAAAAAEVPPGKRIVPGPEQRRGRQDLFDVVQHTVHLKPQGYTGGDQWLEHKGKGNAPGPEQRRGRRDLADVLQQKAVADGPRGTSAPARGDQLQHKEQGDAWLDAKGKRRVEGPEMRRGRQGLYETLQQTSNPYQGGNKVGDAWLEHKGRKVQPRPEPEAAAALSAVPPLPTVRPPRVGDDKKYAVNIEAAMGQMTVKDGAKVTGW 528 T 0.42 Histone pdbpssm F Eukaryota T 7n61 6 O,P 0O,0P FLAGELLAR ASSOCIATED PROTEIN MEGAAGPSGFRNVEPLSRQERAAARDKDLLEKSRLQARNRGGPLKQPENVVGNPVMPARNAPAFCDEYDRFNRDVAGEMNAKKQQNLQKKEEVYAVKRAEQYHRERSNWETQAQAAAREAARLEASRTTGTGAKRNQGSESYNIISLNYNNSSGGQQLAAKDTAVKEARQARAVNLYSKSHSVSHNIITGEPIKFPTAGKE 201 T 0.36 VIR_N pdbpssm F T 7n65 1 A,E,I A,E,I A0A6H1VCM1_9PLVG ENV POLYPROTEIN MDAMKRGLCCVLLLCGAVFVSPSQEIHARFRRGARAENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQAFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRV 508 T 4.1E-54 GP120 pdbpercent T Viruses T 7n6g 9 LA 1Q A0A2K3E6N2_CHLRE FAP297 MPPLPTLWDPAGAVDKLPQPFRMIDKILADIVEQVVDMIGTRESQRRAEDASRVDLTFAPHAVMEVAPETCCFLPIGIAGIAAVAMPDGEVQVRSARDPSICFSDRTHTSPVTAMEAGMAVCTSGRLLAAASRTALTLHEVDIKTCEISLLATVPLPADPDDTSPPVRLHWSDNLGHLAVCRRSGALALFTLSLPPPNVSLESVGAFVALSLKAFGETRVEELLRVPAATVRAFCLGSAAVGPSNLVWQLQQRPDKSPHSDRYHKSARGAYVWWEGANRLLLLDFEAAAGAAAAAGGAGGAVVPPPEVEAAVKAGAAGTSKPSTPATKAASSKSVAPPGEASSLLTASASAAAAAYAGPERPPEVPQIAPMARDWLLPHDVTAAATTSDHKTMAWGLADGSVVIWDDRSCCSTKVLPRLKGGITALSWVNGVAHKLVCASAGGHIFIADVIKPEDSSQKPYEFPQAIHEVHTLPNEPFALCICRGHTSSDGEIASTHRGGRGGPSILSTMSGHGPGGGGHDVMRVPPQRPRVFWYNVLEEKPVAELMGPRAEQGFGLACCVPPPPSLAPPPPRPDTAATDAGAADGKSAAASAAATPAPGAAPKGKGGAAAPAPPSGGGGGGAAPAASQEAPSEPAMSEAQRALIKAALDAMGANSTTPVLVPVKLHVPAGQVVSYPACVFRDTYLLAGGDVVDKVVRSNFADDDAVPQRVTQLYMYKVDALLRHLLPEDESTSRLGKVVLDRLLADLEAPKMNRKKGKRVKMDVEDPDAPKLSSAMRKPDPFDTTGSRPGSRAANRHITFGGGDADGLFDDVLEGGRAKPKKETKVFPKETKKGLKTGPGDVKMIDKNASGRPRLAPLDLEKAKEAPLPFSERSQSPPWHHTNPLARIHPDWEEAPVLVRIMDRIGSKGGGRKRRDKRLEALTTELMTKYSKEAGAKPNLLVPT 945 T 0.0003 Lgl_C pdbhh F Eukaryota T 7n6g 10 MA,NA 1R,1S A0A2K3DQN7_CHLRE FAP108 MPLYFEEVAPDPKAKKERDAKQQRPAILVERKGPPPAPMHLESQVIPTLIRKVGDWKTGRISQAMCEAYLDRHTLVFDRELLTKLFKEADYQKEGSLDTRALTIAIAGRFPKREHTPEWRLLTALLLGLPELVLTTDAEVTTLRTTHERPVGGGTYNSGNFWDSPPPPLPPVRRRTGSGRSTVGKVTAHEPSPEWLDTLNRTAAAASMSAGGSPSASMAGSFAGAASLNASMLRTGSVGAMDPGGAGVVGTTGGLKQTTQIADEARLNAALMGGAASTFATQREFADWSRGLEVMPRLAADTAGPGPGSEFGGGVRTATHLGSPKAPVRVWAAPLPPSAISLPSSALRTLRETVRSTASTKPDFVKGVKPLDSHELDLKKTLGEPLDVGMSLARVEPVRDTKVLPNADYVTWGDYAANCRTGPTGWYSKHPTAQAQDTGEHKYPWC 446 T 0.61 EF-hand_8 pdbhh F Eukaryota T 7n6g 14 XA,YA 2F,2G A8J870_CHLRE FLAGELLAR ASSOCIATED PROTEIN MAAKGKQQWDFLKADANTPASPAHYYEPLNAKKEGEFKPGWNTKRRGPAWEAERQAAIMTKEQKNIGCVALRSERLNNAQQQSGFNPIAHTERAADGSWVPATNAWMHQKVGVKQQDPRAAAADTLKHQAEGASRAAAIAEMRKERIAAGGASRPAAGGGVKDALTWG 168 T 0.17 Nop25 pdbpssm F Eukaryota T 7n6g 24 KB,LB,MB,NB 2S,2T,2U,2V A8JCZ9_CHLRE FLAGELLAR ASSOCIATED COILED-COIL PROTEIN MALTVEPLSPNLLHTDLLTIKHTSDPALLFPSHAYGRGGVGHFLCTGHSRAGIHRLPQAASKAASLSLTARGRASSSRASDYADGMDPAATGNGSSISGFSLISSDPAAIGATRAWEAGTVAHASNFVSACRSVRPTDVPRGSKWAAVSFFENDPKEVSRHMQVLDFIQAKHDMAKAAERQTEQARHNIWAEQQRETFRRQRLEQASRYTRSGIRPRSAYAELGAVAAAEQAHNGYGSGSVLGDSQDGSRFGGGGGGGRLGGIAPPSGDPASRYYGALGRDGTFGSRISGTGSAQGGGGSGSMSYYPGGKPRPSTALPAGSVYTGRGGPLTAAQAATANPFNATLSATAATAAAGGGGPIPLPKKTYIHVYDRMAAEAAETPAARAAAQAAAQAAAEDERRQAVLDGELAELDSFEARMRSLQRARSRAAHSERRRGSNAFAEDSDA 447 T 7.7 PDDEXK_1 pdbpssm F Eukaryota T 7n6g 29 DC 3E A0A2K3CZ11_CHLRE FAP92 MAPKAKGPDPAEAAAPVGPSQELHLVLKVRMRLKPPAPERPPQEPPAQPSTPPSAEPSAATGADTASAAKDAKGRAKSPPKKPVTPGSARKAAAAAAAAEAAAAAAAAAAAALAKPVDPAALVLTVKYTPLGGAEAVVGPVKPLKPAVAAAAAAAAAAAAAEAAAAAAAAEAAAASKSGTAAGGKKPPPAKPGASAAPSPPPPATPSPPLTPPPPPPAAASAAAPAGPEPYAEVCVEHHVKVVVDEAVVRSLAAANALLPVSVSLASPSDEEGADGKGAKKPTPAAGKSKAALAAAAAAPPTPVRSYGAVLMLDVSGLLVGDTSARAVWPDKAKGLPSALEEVAEGVEAELQLLSLPRPENPTTPAAGKGRPAAAATAAKPVSAKPGGKKGEAPPEPELKDLPGEPIGLLPPELITQLNPIVINIRKAKELPAAPATRAQLDNNCASPALFLRWPPGVPPREQPGLASPWQLPASAATSVTGLTHNGSSGGAEVLVLAGPAGAAMSRVPPSAGRRYLLFGQPEVFFAGDLPGGGEEALRLCRECPLLVEVHDRTAIPEPPEFPLEPLPAADGGAAGGAAGGQAAPPESEGYVCGLARVPMLDLARGYTRFRFHTSLAPHTTVRGAASLDWTKRPGNYAEAGSILKGEVRCACPLPSTRGADPSSRVFARALFIMDYRDSDLFHLLEDTVRKNNAWRLGLAEPQDKVSPDQLPPELKAMRAAGGGVPGVGAGVGGHHSHSHPAHDDDDDDGRERRPSTDERQRRGSASSGSSAQAAGGGGAGGGLGRRSSSMYSYTEPNSPTRTGGLVPDRGRPGGGGGVPPLPPAGVAGGLQRRESTSPGALAAMASGLGGVDGRRHVIWDSESEEDDEPDEPPTAAELALDALCVDIDRISLELRELQALSTAQLTPEQAADRQLDLLTGWHLVDGRERVIVVEGLAEGAMKIVKGISDWALEDPKPEEGWRRRSVLLSTSPSLRSAWRLYSPLGVDLWIVKLRAPLPKLLGEPGSFATGRVRPDCVQGIRRLGALRSVAAWSRQAHDLSLWPSPQQLQLVDKKFGGELLAADVLGVEANEPDSEDEDGGPRALGGLLDADARSAKSSKSNRSRRSSRSRRSGKSGRSGKSGKTGKSGKRRQRLRQSRVPPLDTHNDGYLALRRQARQRRLQRDWLSLNRDSLRELERRTSHIKDTWREWNPQRTAREQLDAAIRAGTIPPEELEALSTARKRSPAPLPGTVPRQDALARGWYPHPSPFKWPSPRVPADFRQLPARPTDFRVQQLEEPWEEGALHRGVDLRGGPGTAGVGSAKDEFLTAVRGDQTGLFGTDPEYWRTVHLGGAGREAELISTRRAEAEEWRRRLVVEDPVMRTVLPQGPPVPAQADRLKPLLKDEPSKKGFKVAAVPPAPLNSQLAYPWTDPSSAAAQAATVGRGRDDKTKFIDPGHEFRNVTGKPKTAVHKLSYNQSWSASTHDKYYNQ 1471 T 28 BLUF pdbhh F Eukaryota T 7n6g 30 EC,FC 3F,3G CFA99_CHLRE CILIA- AND FLAGELLA-ASSOCIATED PROTEIN 99 MESRPGSASSYHSHAAGNTQSRPGSATSVESSNRLSGKPLGAEKLMAACEKIYKSFNPAQVTLDTHVDNCIGQLSVHNSFDDSFIRQVVYGTVRYRRLLGALMDSFYHYNGGAASREDVDMYKLYSYLTIFRLEELNFSNFQRLVDAMTPQKMYVLLKYMFNPQYMREVVREDWLKLYDKEFVDEIIDRLLSWKSESKEMLGRLEYKVTLSRKEDDEKETLGTGYAAARSTTVPQPFNLTQPKPRPLPVEDPPPPPIRAKPAPRPREGPTKEEVALAAAREANRAAAERKAAKAAPFKLRVLERPTNIDKIREELEAERTRELTFKGIRAAPPPPVPNAQVRLNAAAILREDALYRRKQQEEADALKRYEAELRDASSFKAWQNAMLEQDEAARAASVERRRQEMAAAQENAIRARMAAQEANAELARAAKEEAKRIEEDLKRDREEQARLNALRRDAVVEARQNVQAAVEKMSQERRLAAEEERRKQQEDARARAEVAAREMAERRDIILQLKALEKVPKQRVKEFDPTETGPDHGLLETMSLVELRERLNVAKRRQREEEERQRAEILRQKQERESALLEKAANIQRVRRVAAAQAAQRRATSAETIQRKNTEVSKAREADVLQLADKLDAKRAALAAERARLAAEQKRTRFEQMQAAAGAAVVEETKFRELRAGAQREAKTRQENALASATVYEATKARQQNVRLKNVRQELKAKDDFMRAYDEKLAALRGQAGAESAADLARRTQMAQTQRAAEATVRNRTTTTAYRPYEGGSTSMQARLAALGQGMELED 795 T 0.038 DUF1948 pdbhh F Eukaryota T 7n6g 34 HD,ID,LC,MC,NC,OC,PC,QC,RC,SC,TC,UC,VC,WC,XC,YC 4A,4B,3M,3N,3O,3P,3Q,3R,3S,3T,3U,3V,3W,3X,3Y,3Z A8IVW2_CHLRE FLAGELLA ASSOCIATED PROTEIN MPSPKRSGLGSGRLIGGQASTTSLGSPGAGTSQFQINHENTLRKNRNHFQGQIEQYSIDYHSSHNKALAELPVLQEQHALEIEEYQNAIEETVTRHLGRIAQLRQDYSLLIKEASRRHMELQQRRFAGARVDIPQVAQQPIAGLPALSAQPQIVAPSAVPNGSLAASRSGNVSTSSEQAGSHAMGQVNPNAYLRTLNNDAAAAMSQYNTRPLPATPGGIAALSISTPASPLTARLATPSEATARNSDFARLEAMHEKYMGRTKSQHEQGIAARINEVSNWQQTFAACMAALAQHQSSALAHLSSCVEEAEAWHAQQRGALSAAHDEVMTEAERLSRQLEAAQTAAAAKLNGLLASFLERVLPSGEVGLAEAQASYTRSAANLRNEHVAALEAAEAALRALVPRHAAMRQHMSASYSSGLASHEAALAAAGGHYDSRGIPELRRQYQDAESRHRDTLAAIRAEHLKGLAGSRDGWMGEAAALLEEYRARMQELKQQYMLAYDVNLTEV 507 T 0.022 AAA_13 pdb F Eukaryota T 7n6g 35 JD 4C A0A2K3DQM4_CHLRE FAP81 MSSAHILSTFQSTFPGLYQAPKKGEDEPPPEAPAPEPVTQHDDEPDQYSTRIAGITSKFERMRASADEMEQYLRSAAEDAKEAEARALAKADEDFTPAWRNVGLPLKPSHLHLDHGAMAGARLVNPKAIVQEYQAIKGREVLNPPRIAEYADTSAKPNYLKSTHAMEERKMRTMSPDRTARIQALSARHLAWQTLTPEEVAAKMEEAEQRRRQLGLKMPRAQFELEQKIIQSMHHKLTFLRNPRHPLPPAVKTLMELRPDANRWVGPRTTVLEGVKPPIKADMTSRPDQVFVVEPAEVTFTNYAVGRAYEQVVRVRNVTAVSRSLRIFPPASQYFHASLPRFPGEVGVLAPGMAAEVTLRFCPDSLGDYEDAIAVDATHSRQTVPLRARRPPPSLTLPEEIDMGQVVIGNVKTEQVTFKNMGGAGRFRIVPEAHWPDFAMDAPTDRAVVGQFKIWPLYFEMAAGEQLGLNVSYEPTEWGNTEERLVLVCDNCQVKTFSLSGNAVGVDVLLHSVDGRMLEPRELDLPLWFGECAPGAGFSKTVSVRNTTKLPFAFEWGLTKFPQVQNRRRANEPLQTEAQYDEEQDDEGHVLLVDNKSLRGTSPLRLGTGGGGAAAAPPAVGGGAGGGAGGGAGGVSSSAAGVNGSVAQAPGGGVAGAKPPGPMKALAGAENAGTPWGVHCGNEAHGPDPLALGAVVEDLFRVVPRSGVLQPGEVMEFLVTFTPPGQARYERWAQLRVDRKPISVPSGASPMVRGSGSGTGRSHAAIAASATCDVLVAEVGLEGLGCPVQLSAAPRLVSLPGKLMPAEGTTRHVTLRNPTRAQVVVRATVDNPAIAVSPSEFRMPSLGAISLAVTVRAPPDAAPGPLSGRVLLEVEHGPPVPIEVRAAVGSSYARLITPRINFGDVPLSGSSEQRLVIRNMSATCPTPWSIRELTPALVAAEKSRLLRAQLLQSSRMLDPQAAAAAIDALVQEEEEAEAAAAAREEEEAVGYRGASVTGHHARFAEAQRSPAASTSGALVALPASAAERRAVFAAGRHTDSSLAQALPPPDTTHVTFEPSSGVLEPNQELTVRVTCHALTDGRHRSIIQLRSGAPHAGGGPDGGLHMECLEAFACVVTPACVVDRPVMDLGVTFVGVQVRQTLYLTNLSQLPVLYRWTAEAEDEGSQTAGLAELKIKPDHGELEPGEDVEIQVRYTPRYPGPCVMYGVCELEGAPEPLGFRVSSAIHGLDVTYDLLTQEQYDDYMAHDQAAAALGSTGPKGAAAMAVAAAAAGGANTGSGGYYDSESGEVAGAGGEGGGPSRAEEIAAALDKVNFMRVGRHGKAEVDVEAFSGLTHLPPDVLSRVASHRHASTSAAGSKAASRRASARPGAQARPRSGAAAGGGVAAWPPPTPQRHLVADFGHNVPLGETRQMYLVVTNRTAMHTSIRTWLERFGVADASRFVRGTESGAAPPGGGADKAGGAEGGAGKGGASRRQTKDEDAHHPQGPKLSRYSKYTPIKLAGTDAEHRAPFRADKGNEMMATRRLQEEADEALGNKGLAVSVTPPESTLEPWSRLVLTVSCFNDMCGAYMDMMHVKVGDLPARDIPVLVGVSGTPLVVQRERVLVRGLRARSWRTDLEWGQVPQGVEQTRTFYVFNTGSLDMHLAWEARRYHDYVDLERLPPTTDVGAGGTGQHDSLWGGAPGTLRDTRAGMKLFDVKLQPDERAGCVRLATERHADPTDDVPFRVEPEEQVIKGNTTAKFTVTFCASESRRHGGYLHGTQRVFSPESPLELRVWTAGENADRVGALLSGTFHPYAGAPPTPLQPLRVDLGAQAQACRLEPDGQTDLSWVVTSIQQPGSHAAFVRSVTLSNTAHCPQVFSLDVEGPWDMVAASPSVPQDPVAYRGTSTLLGPAAASGRLGTSAADGGLTFLPPGESVDVTLRFSPGKGDMEALPVRFAAMQAKQRVIETVNDYKNTGALCITFANGDSQSLPLVAEMLHPRLEVKPRKLDFKKVHLQSPKEMFVMLSNPTNVDAAWAVTVEGHKPRFPTLPGAGAAASAAAAKEAKEEAAAAAAAAGGGGSASAQPTPRSASGGNLAGEASAPDSRAASAVSGASRPATVDGGAAGAAAPAGGVPPPPKLPGAGGPGTLPGVTGIIAEARIGPYVVKPASGVLSGRGLGMPRSQRISITFAPTEAEAYEGELIFAVLRGKQCSVDVDGEGSIEETDETKGNLFVI 2215 T 0.0019 PapD-like pdbhh F Eukaryota T 7n6g 46 NF 5Z Unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXWAMHVVFARXXXXXXXXXXXXXXXXVHAAVVMXXXXXXXXXXXVAAHAVAVVAVAAAVXXXXXXXXXMAAAAALLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 181 T 410 HD_3 pdbhh F T 7n6g 52 JG,KG,LG 6O,6P,6Q Unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGYSVYESXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 89 T 730 DUF1939 pdbhh F T 7n6j 2 B B PPB_ECOLI APASE RKQSTIALALLPLLFTPRR 19 T 6.7 ASTN_1_2_N pdbhh F Bacteria T 7n84 3 D,K a,l NU120_YEAST NUCLEAR PORE PROTEIN NUP120 MACLSRIDANLLQYYEKPEPNNTVDLYVSNNSNNNGLKEGDKSISTPVPQPYGSEYSNCLLLSNSEYICYHFSSRSTLLTFYPLSDAYHGKTINIHLPNASMNQRYTLTIQEVEQQLLVNVILKDGSFLTLQLPLSFLFSSANTLNGEWFHLQNPYDFTVRVPHFLFYVSPQFSVVFLEDGGLLGLKKVDGVHYEPLLFNDNSYLKSLTRFFSRSSKSDYDSVISCKLFHERYLIVLTQNCHLKIWDLTSFTLIQDYDMVSQSDSDPSHFRKVEAVGEYLSLYNNTLVTLLPLENGLFQMGTLLVDSSGILTYTFQNNIPTNLSASAIWSIVDLVLTRPLELNVEASYLNLIVLWKSGTASKLQILNVNDESFKNYEWIESVNKSLVDLQSEHDLDIVTKTGDVERGFCNLKSRYGTQIFERAQQILSENKIIMAHNEDEEYLANLETILRDVKTAFNEASSITLYGDEIILVNCFQPYNHSLYKLNTTVENWFYNMHSETDGSELFKYLRTLNGFASTLSNDVLRSISKKFLDIITGELPDSMTTVEKFTDIFKNCLENQFEITNLKILFDELNSFDIPVVLNDLINNQMKPGIFWKKDFISAIKFDGFTSIISLESLHQLLSIHYRITLQVLLTFVLFDLDTEIFGQHISTLLDLHYKQFLLLNLYRQDKCLLAEVLLKDSSEFSFGVKFFNYGQLIAYIDSLNSNVYNASITENSFFMTFFRSYIIENTSHKNIRFFLENVECPFYLRHNEVQEFMFAMTLFSCGNFDQSYEIFQLHDYPEAINDKLPTFLEDLKSENYHGDSIWKDLLCTFTVPYRHSAFYYQLSLLFDRNNSQEFALKCISKSAEYSLKEIQIEELQDFKEKQHIHYLNLLIHFRMFEEVLDVLRLGHECLSDTVRTNFLQLLLQEDIYSRDFFSTLLRLCNAHSDNGELYLRTVDIKIVDSILSQNLRSGDWECFKKLYCFRMLNKSERAAAEVLYQYILMQADLDVIRKRKCYLMVINVLSSFDSAYDQWILNGSKVVTLTDLRDELRGL 1037 T 0.086 TPR_1 pdbpssm F Eukaryota T 7n89 2 C,D C,D R1AB_SARS2 ACE-SER-ALA-VAL-LEU-GLN-SER-GLY-PHE-NH2 XSAVLQSGFX 10 T 14 GPAT_N pdbhh T Viruses T 7n9f 12 EA,FA u,v NUP82_YEAST NUCLEAR PORE PROTEIN NUP82 MSQSSRLSALPIFQASLSASQSPRYIFSSQNGTRIVFIQDNIIRWYNVLTDSLYHSLNFSRHLVLDDTFHVISSTSGDLLCLFNDNEIFVMEVPWGYSNVEDVSIQDAFQIFHYSIDEEEVGPKSSIKKVLFHPKSYRDSCIVVLKEDDTITMFDILNSQEKPIVLNKPNNSFGLDARVNDITDLEFSKDGLTLYCLNTTEGGDIFAFYPFLPSVLLLNEKDLNLILNKSLVMYESLDSTTDVIVKRNVIKQLQFVSKLHENWNSRFGKVDIQKEYRLAKVQGPFTINPFPGELYDYTATNIATILIDNGQNEIVCVSFDDGSLILLFKDLEMSMSWDVDNYVYNNSLVLIERVKLQREIKSLITLPEQLGKLYVISDNIIQQVNFMSWASTLSKCINESDLNPLAGLKFESKLEDIATIERIPNLAYINWNDQSNLALMSNKTLTFQNISSDMKPQSTAAETSISTEKSDTVGDGFKMSFTQPINEILILNDNFQKACISPCERIIPSADRQIPLKNEASENQLEIFTDISKEFLQRIVKAQTLGVSIHNRIHEQQFELTRQLQSTCKIISKDDDLRRKFEAQNKKWDAQLSRQSELMERFSKLSKKLSQIAESNKFKEKKISHGEMKWFKEIRNQILQFNSFVHSQKSLQQDLSYLKSELTRIEAETIKVDKKSQNEWDELRKMLEIDSKIIKECNEELLQVSQEFTTKTQ 713 T 2.4E-12 Nup88 pdbpercent F Eukaryota T 7n9h 2 B A TADBP_HUMAN TDP-43 KDNKRKMDETDASSAVKVKRAVQK 24 T 1.2E-05 DUF4523 unphh F Eukaryota T 7nac 3 C 7 LOC1_YEAST LOCALIZATION OF ASH1 MRNA PROTEIN 1 MAPKKPSKRQNLRREVAPEVFQDSQARNQLANVPHLTEKSAQRKPSKTKVKKEQSLARLYGAKKDKKGKYSEKDLNIPTLNRAIVPGVKIRRGKKGKKFIADNDTLTLNRLITTIGDKYDDIAESKLEKARRLEEIRELKRKEIERKEALKQDKLEEKKDEIKKKSSVARTIRRKNKRDMLKSEAKASESKTEGRKVKKVSFAQ 204 T 2 PIN_6 pdb F Eukaryota T 7naf 3 C 8 NOC2_YEAST Nucleolar complex protein 2 KVSKSTKKFQSKHLKHTLDQRRKEKIQKKRIQGRRGNKT 39 T 1.5 Hid1 unppssm F Eukaryota T 7nbv 1 A A POLG_TMEVG CAPSID PROTEIN VP1,CAPSID PROTEIN VP2,CAPSID PROTEIN VP3,CAPSID PROTEIN VP4,GENOME POLYPROTEIN,LEADER PROTEIN,P1A,P1B,P1C,P1D,PICORNAIN 3C,PROTEASE 3C,PROTEIN 2C,PROTEIN 3A,RNA-DIRECTED RNA POLYMERASE,VP4-VP2,VPG,VIRION PROTEIN 1,VIRION PROTEIN 2,VIRION PROTEIN 3,VIRION PROTEIN 4,PROTEIN 2B, 2A PROTEIN (DERIVED FROM GENOME POLYPROTEIN) GPLGSNPASLYRIDLFITFTDELITFDYKVHGRPVLTFRIPGFGLTPAGRMLVCMGEKPAHSPFTSSKSLYHVIFTSTCNSFSFTIYKGRYRSWKKPIHDELVDRGYTTFREFFKAVRGYHADYYKQRLIHDVEMNPG 138 T 0.11 SpoVAD pdbpssm T Viruses T 7ncr 1 A,B A,B F8VBM8_9VIRU PUTATIVE COAT PROTEIN MGDRVNAQDDDTVVPHQAPLQPAALQQDLTRSADYLLDNVRIGNHRQRYDKYRRYVLLRSSEIFTSLVAIYAHIFSSYWQHFRRFTDQFQAPTGVQLPTFVARVYISTWLHDLYCSIREATRSISPLAFNERYSYELLPYSTEYDPFLAFLSMSIKPTHIQHTPENTLWIPILCENYDWDRNEANHNPFGITNFTLNSNLFYGLLAILKERKEFKLSTLTTNTIGRPCWLFDWHDNVQVCAWFPREANFNSQDVTAAYIIGVACTPKLGPSDDDAWKYYASLNSVPTFTPTEPRLTNRRSYGAYEVRTRETENNYFLPDSLLNIIEDFTATGTTQRRKIRRPSATSASTGAAIIIRDTPGTASTATTSTTETEVTFPPVIRTKIRDWYYHSRVILELEDNSRTAALRMFIIA 412 T 13 Peptidase_C62 pdbhh T Viruses T 7nff 1 A,B A,B CC-Type2-(LaId)4-I24A XGEIAQALKEIAKALKEIAWALKEAAQALKGX 32 T 0.002 WXG100 pdbpssm F T 7nfg 1 A,B,C,D,E,F A,B,C,D,E,F CC-Type2-(LaId)4-L14A XGEIAQALKEIAKAAKEIAWALKEIAQALKGX 32 T 0.015 MCPsignal pdbpssm F T 7nfh 1 A,B,C,D,E,F,G A,B,C,D,E,F,G CC-Type2-(MaId)4 XGEIAQAMKEIAKAMKEIAWAMKEIAQAMKGX 32 T 0.0046 WXG100 pdbpssm F T 7nfi 1 A,B A,B CC-Type2-(LaId)4-L7Y XGEIAQAYKEIAKALKEIAWALKEIAQALKGX 32 T 0.0046 WXG100 pdbpssm F T 7nfj 1 A,B,C,D,E,F,G A,B,C,D,E,F,G CC-Type2-(LaId)4-L28Y XGEIAQALKEIAKALKEIAWALKEIAQAYKGX 32 T 0.0009 WXG100 pdbpssm F T 7nfk 1 A,B A,B CC-Type2-(LaId)4-I24S XGEIAQALKEIAKALKEIAWALKESAQALKGX 32 T 0.0037 WXG100 pdbpssm F T 7nfl 1 A,B A,B CC-Type2-(LaId)4-I24N XGEIAQALKEIAKALKEIAWALKENAQALKGX 32 T 0.0048 WXG100 pdbpssm F T 7nfm 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P CC-Type2-(LaId)4-L21K XGEIAQALKEIAKALKEIAWAKKEIAQALKGX 32 T 0.042 MCPsignal pdbpssm F T 7nfn 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N A,B,C,D,E,F,G,H,I,J,K,L,M,N CC-Type2-(LaId)4-L21N-I24N XGEIAKALREIAKALREIAWANRENAKALRGX 32 T 0.098 Ada3 pdbpssm F T 7nfo 1 A,B,C A,B,C CC-Type2-(LaId)4-I17C XGEIAQALKEIAKALKECAWALKEIAQALKGX 32 T 0.026 WXG100 pdbpssm F T 7nfp 1 A,B,C,D,E,F,G A,B,C,D,E,F,G CC-Type2-(LaId)4-I17K XGEIAKALREIAKALREKAXALREIAKALRG 31 T 0.13 HTH_38 pdbpercent F T 7ng2 1 A A S8F6K2_TOXGM CPSF4 GDPFGHVASPQSTKRFFIIKSNRMSNIYTSIQHGVWATSKGNSRKLSNAFTSTDHVLLLFSANESGGFQGFGRMMSLPDPQLFPGIWGPVQLRLGSNFRVMWLKQCKIEFEELGKVTNPWNDDLPLRKSRDGTEVPPALGSLLCTWMSQRPSEDLLAGTGIDPATR 166 T 5.6E-31 YTH pdbhh F Eukaryota T 7ng8 2 D D Klebicin C activity MADNQPVPLTPAPPGMVSLGVNENGEEEMTVIGGDGSGTGFSGNEAPIIPGSGSLQADLGKKSLTRLQAESSAAIHATAKWTTENLAKTQAAQAERAKAAMLSQQAAKAKQAKLTQHLKDVVDRALQNNKTRPTVIDLAHQNNQQMAAMAEFIGRQKAIEEARKKAEREAKRAEEAYQAALRAQEEEQRKQAEIERKLQEARKQEAAAKAKAEADRIAAEKAEAEARAKAEAERRKAEEARKALFAKAGIKDTPGCLEHHHHHH 264 T 0.043 DUF2612 pdb F T 7nht 15 O,P c,d AKIR2_HUMAN Akirin-2 MACGATLKRTLDFDPLLSPASPKRRRCAPLSAPTSAAASPLSAAAATAASFSAAAASPQKYLRMEPSPFGDVSSRLTTEQILYNIKQEYKRMQKRRHLETSFQQTDPCCTSDAQPHAFLLSGPASPGTSSAASSPLKKEQPLFTLRQVGMICERLLKEREEKVREEYEEILNTKLAEQYDAFVKFTHDQIMRRYGEQPASYVS 203 T 4.2 ATP-synt_E_2 pdbhh F Eukaryota T 7nix 2 B B TBCD4_HUMAN AKT SUBSTRATE OF 160 KDA,AS160 RRRAHTFSHPP 11 T 9.3 THEG4 pdbhh F Eukaryota T 7nkv 1 A A Q8XAD6_ECO57 Phage repressor protein CI MQKKEIRRLRLKEWFKDKTLPPKEKSYLSQLMSGRASFGEKAARRIEQTYGMPEGYLDAEYAEQPGSSHHHHHH 74 T 0.0032 HTH_3 unppssm F Bacteria T 7nlj 2 B B APikL2A MQNEYLDAKKHGIDLSRERAPNFVDHPGIPPSDCFWFLYKNYVRQDAGVCQSDWSFDMKIGQYWVTIHTDEGCRLSGIIPAGWLILGIKRLGF 93 T 2.9 OPA1_C pdbhh F T 7nma 2 B P AMOT_HUMAN ANGIOMOTIN GHVRSLSERLMQM 13 T 2.8E-05 Macoilin unphh F Eukaryota T 7nmm 2 I,J,K,L,M,N,O,P I,J,K,L,M,N,O,P APikL2F GPMQNEYIDAKKHGIDLSRERAPNFVDHPGIPPSDCFWFLYKNYVRQNAGVCQSDWSFDMKIGQYWVTIHTDEGCRLSGIIPAGWLILGMKRPGF 95 T 2.7 OPA1_C pdbhh F T 7nn3 1 A,B,C,D A,B,C,D E4S6E9_CALKI Beta-xylanase MGSSHHHHHHSSENLYFQGHIETLPDSFTFYDGTKVQRLSDWPKRAQELKDLYQFYMYGYKPDTSVEDVTYSVNGNTLTITVKVGDKQASFNATVRLPQANSGYQPPYPVIISLGYLAGFNWQTWQFIDYSTNAVNRGYAVISFMPNDVARDDSSYTGAFYTLYPHSNKVENDTGVLMAWAWGASKILDALEKGAIPEIDAKKAIVTGFSRYGKAALVAGAFDERFAVVNPHASGQGGAASFRYSFAGKQYSWGVAGNAEAFSNLQGNTEGHWFNAVFREFKDPRQLPFDQHELIALCAPRTVLITGGYSDWGTNPEGTWVSFVGARKVYEFLGVADRIGFALRDGSHAITEEDVNNLLDFCDWQLRGIQPTKDFSTSRFAIDPAWDTISVPTLYRNAD 399 T 0.00012 AXE1 pdbpercent F Bacteria T 7nn6 1 A A TOXR_VIBCH ToxR MRGSHHHHHHGSPSQTSFKPLTVVDGVAVNMPNNHPDLSNWLPSIELCVKKYNEKHTGGLKPIEVIATGGQNNQLTLNYIHSPEVSGENITLRIVANPNDAIKVCE 106 T 0.051 Oest_recep pdb F Bacteria T 7nna 1 A A Q5V9K0_KLEPN KLEBC TOL BINDING DOMAIN MSGSLQADLGKKSLTRLQAESSAAIHATAKWTTENLAKTQAAQAERAKAAMLSQQAAKAKQAKLTQHLKDVVDRALQNNKTRPTVIDLAHQNNQQMAAMAEFIGRQKAIEEARKKAEREAKRAEEAYQAALRAQEEEQRKQAEIERKLQEARKQEAAAKAKAEADRIAAEKAEAEARAKAEAERRKAEEARKALFAKAGIKDTPLEHHHHHH 212 T 0.027 DUF2612 pdb F Bacteria T 7np2 2 B P AMOT_HUMAN ANGIOMOTIN GHVRSLSERLMQMSLATSGV 20 T 2.8E-05 Macoilin unphh F Eukaryota T 7nqc 2 B B RALA_HUMAN PRO-ASN-GLY-LYS-LYS-LYS-ARG-LYS-SER-LEU-ALA-LYS-ARG-ILE-ARG-GLU-ARG-CMF PNGKKKRKSLAKRIRERC 18 T 12 Protamine_3 pdbhh F Eukaryota T 7nqd 1 A,B A,B H2J4R1_MARPK TPR_REGION domain-containing protein KAPSQNAIKRFMTLFSGREDVFSIQYEGGYRPIRRPLNFQDIKNHFSGKKTLGIYLLKKNDTVKFAAYDIDIKKHYLNREDKFVYEENSKKVAKRLSRELNLENITHYFEFTGNRGYHIWIFFDIPVSAYKIKYIMEKILDRIELEEGIDVEIFPKQTSLNGGLGNLIKVPLGVHKKTGKKCLFVDNDFNVIENQIEFLNNIKENKATEINKLFREIFNEND 222 T 0.024 DNA_primase_S pdbpssm F Bacteria T 7nqf 1 A,B A,B H2J4R1_MARPK TPR_REGION domain-containing protein GPSQNAIKRFMTLFSGREDVFSIQYEGGYRPIRRPLNFQDIKNHFSGKKTLGIYLLKKNDTVKFAAYDIDIKKHYLNREDKFVYEENSKKVAKRLSRELNLENITHYFEFTGNRGYHIWIFFDIPVSAYKIKYIMEKILDRIELEEGIDVEIFPKQTSLNGGLGNLIKVPLGVHKKTGKKCLFVDNDFNVIENQIEFLNNIKENKATEINKLFREIFNE 219 T 0.019 DUF1882 pdbhh F Bacteria T 7nrc 86 HC A GCN1 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGDYLGILMEKLLNPTVASSMRKGAAWGIAGLVKGYGISALSEFDIIRNLIEAAEDKKEPKRRESVGFCFQYLSESLGKFFEPYVIEILPNILKNLGDAVPEVRDATARATKAIMAHTTGYGVKKLIPVAVSNLDEIAWRTKRGSVQLLGNMAYLDPTQLSASVSTIVPEIVGVLNDSHKEVRKAADESLKRFGEVIRNAAIQKLVPVLLQAIGDPTKYTEEALDSLIQTQFVHYIDGPSLALIIHIIHRGMHDRSANIKRKACKIVGNMAILVDTKDLIPYLQQLLDEVEIAMVDPVPNTRATAARALGALVERLAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 1209 T 0.00065 CLASP_N pdbpssm F T 7nre 2 B C Darobactin WNXSKSF 7 T 5.7 TMP pdbhh F T 7nrn 1 A A GIPC1_MOUSE GAIP C-TERMINUS-INTERACTING PROTEIN,RGS-GAIP-INTERACTING PROTEIN,RGS19-INTERACTING PROTEIN 1,SEMAF CYTOPLASMIC DOMAIN-ASSOCIATED PROTEIN 1,SEMCAP-1,SYNECTIN GAMGDLPSAFEEKAIEKVDDLLESYMGIRDTELAATMVELGKDKRNPDELAEALDERLGDFAFPDEFVFDVWGAIGDAKVGRY 83 T 0.0092 PWI unphh F Eukaryota T 7ns0 1 A,B,C A1,B2,C3 A0A0B6VL42_9VIRU Capsid protein VP2 MVRRRGGKTAGSKRPKMSSKNFGANRKRDFRRPARKSKAKKARSMAPAKTVRKSTTAGAHSKHFSVIGNPFSKATQQPQIPDGRMLESLPRRCQLVTEIRNNVTVGSNPTYILVAPSLGLAFQAYQDTNVPGGLDSSVYGLQNRGCTVRANLSATSIENYNDIAKWRIVSQGINLKLNNVEDENDGWYEACRFQHDWTPDELCLRSTENDASTISQDEDLVMGVISSSFMNGALNTIGNNMVEQRGYESGLLKNIHKRMFQLHNNTSAIRPKTLQGQFNYGSEITFSGTESEARFTDVPSNRQLVDSLWHNDYDCILIKLYPRENTGAAGQTGSALIVNAIQNLELQYSPTSDLSTYHIANKRARMVEAKLDKKNNTDAAGEPFVPGSSR 390 T 0.082 Peptidase_A6 pdbhh T Viruses T 7ns3 1 A 5 VID28_YEAST GLUCOSE-INDUCED DEGRADATION PROTEIN 5 MTVAYSLENLKKISNSLVGDQLAKVDYFLAPKCQIFQCLLSIEQSDGVELKNAKLDLLYTLLHLEPQQRDIVGTYYFDIVSAIYKSMSLASSFTKNNSSTNYKYIKLLNLCAGVYPNCGFPDLQYLQNGFIQLVNHKFLRSKCKIDEVVTIIELLKLFLLVDEKNCSDFNKSKFMEEEREVTETSHYQDFKMAESLEHIIVKISSKYLDQISLKYIVRLKVSRPASPSSVKNDPFDNKGVDCTRAIPKKINISNMYDSSLLSLALLLYLRYHYMIPGDRKLRNDATFKMFVLGLLKSNDVNIRCVALKFLLQPYFTEDKKWEDTRTLEKILPYLVKSFNYDPLPWWFDPFDMLDSLIVLYNEITPMNNPVLTTLAHTNVIFCILSRFAQCLSLPQHNEATLKTTTKFIKICASFAASDEKYRLLLLNDTLLLNHLEYGLESHITLIQDFISLKDEIKETTTESHSMCLPPIYDHDFVAAWLLLLKSFSRSVSALRTTLKRNKIAQLLLQILSKTYTLTKECYFAGQDFMKPEIMIMGITLGSICNFVVEFSNLQSFMLRNGIIDIIEKMLTDPLFNSKKAWDDNEDERRIALQGIPVHEVKANSLWVLRHLMYNCQNEEKFQLLAKIPMNLILDFINDPCWAVQAQCFQLLRNLTCNSRKIVNILLEKFKDVEYKIDPQTGNKISIGSTYLFEFLAKKMRLLNPLDTQQKKAMEGILYIIVNLAAVNENKKQLVIEQDEILNIMSEILVETTTDSSSNGNDSNLKLACLWVLNNLLWNSSVSHYTQYAIENGLEPGHSPSDSENPQSTVTIGYNESVAGGYSRGKYYDEPDGDDSSSNANDDEDDDNDEGDDEGDEFVRTPAAKGSTSNVQVTRATVERCRKLVEVGLYDLVRKNITDESLSVREKARTLLYHMDLLLKVK 921 T 0.0017 HEAT_2 pdbpercent F Eukaryota T 7nso 33 GA 7 ermDL MTHSMRL 7 T 0.0092 Ery_res_leader2 pdb F T 7nus 2 D,E,F D,E,F p53/MDM2 macrocyclic peptide inhibitor FSDXSSVPNXXRNXX 15 T 2.8 DUF1244 pdbhh F T 7nuv 1 A,B A,B E9RJ22_BACNA Aux2pLS20 GPMAKVKKHLTFSGPTESPYGIAYIEKEMKAKNCSKMNETIELIFAEHDEMKARLSEQDALVEKIFQRFKKTLDVIRVRAGHTDKNAQINLELWNAFLMANPLPVTVLTDQHTSESVSMAKEKVSNDIATFKQRKDEQKAKQEMQKGEK 149 T 0.0015 DUF1433 unppercent F Bacteria T 7nw1 2 C,D CCC,FFF UBA5_HUMAN UBIQUITIN-ACTIVATING ENZYME 5,THIFP1,UFM1-ACTIVATING ENZYME,UBIQUITIN-ACTIVATING ENZYME E1 DOMAIN-CONTAINING PROTEIN 1,UBA5 DSGESLEDLMAKMKNM 16 T 0.39 DUF5786 pdbhh F Eukaryota T 7nx5 1 A,B,E,F A,B,E,F BZLF1_EBVB9 EB1,ZEBRA MLEIKRYKNRVASRKSRAKFKQLLQHYREVAAAKSSENDRLRLLLKQMCPSLDVDSIIPRTPD 63 T 0.0012 bZIP_2 pdb T Viruses T 7nyi 1 A A BS222_STAPS Bacteriocin BacSp222 MAGLLRFLLSKGRALYNWAXSHVGKVWEWLKSGATYEQIKEWIENALGWR 50 F F Bacteria T 7nyj 1 A A A0A7M7KVA6_VARDE Odorant Binding Protein 1 from Varoa destructor, form P3<2>21 APQAPASATPAKVPVIEWGKCEQLKPSESERTSKAAVVDKCLQSLPLPDPEKATQQEIDKHRESVTTCALKAEGWFDDEGVYKFDRARNEIKNKKLDSEVEEAVLLKHDACQKEATEKHDDYINQVQLYQACMDYNISQICGIKVMV 147 T 0.0003 PBP_GOBP pdbpssm F Eukaryota T 7nzf 3 C CCC mutant human collagen type II,259-273 AGFAGEQGPAGEP 13 T 2.8 FokI_C pdbhh F T 7nzh 3 E,F EEE,FFF citrullinated cartilage intermediate layer protein (CILP) peptide 982-996 GKLYGIXDVXSTRD 14 T 4.7 DUF6489 pdbhh F T 7o07 2 B P YAP1_HUMAN Transcriptional coactivator YAP1 XRAHSSPAXLQX 12 T 0.00014 FAM181 unp F Eukaryota T 7o0u 3 WA C A0A143BHR6_9BACT MULTIHEME_CYTC domain-containing protein MVPVSLLTLGACGDAATDTVQVGYRGTAMEQNYDHGDLKTKFAQVKLPQSPPPAGESPPGPLPWKNVQVLNDISIAEFNRTMIAMSTWVAGTGNCAYCHNVAAFQDDTLPNGKPLYTKIVARRMLQMTRNINGNYSQHVKNTGVTCYTCHMGKPLPNGLWFYSSQTDYLRHYLDRDGARVITQGVAPSNANRSSTKQAEWTYALMISQSRSLGVNCTYCHNTRQFASWREAPPARVTAYHGILMLRDVNQNYLAPLQPVYPAVRLGAMGDAPKAQCVTCHNGAYKPLYGAQMAKDFPAMWGRADWNGVPFPGIMRVAADSTKTDSTVVAAPAAAPAQRTSARPGSVTTPVGGVN 354 T 2.7999999999999997E-68 CytoC_RC pdbpercent F Bacteria T 7o0u 4 XA C1 RC-S MPASPSPLPRSSRVRNAAVVVALVAVGLAARGRDAQGTQPPVAPPAAPTATAAPDLAVQDSTKADSTAVADTLMDLSMVMAAEAAAATVTTAPVAVAPTAWPVDPTTGQTLINGRPVVGRVFIMRKTDGTVKYPNVADVVAHEALAPLPPVVGSSYQQAPITNQRRMRGIMIQSTLWDMDRKRSATRQRYYPASTPANQLGQ 202 T 0.42 DUF126 pdbhh F T 7o0w 5 YA C2 A0A143BK87_9BACT RC-U MNMHSSDATVSIPDDIDLILVDSVPVNDGIWAWYGIDDDRPMAAWSRFHATRCVEQLAINRARVGAAEWALADVQARGIVPCIAKAAAHLARARAELADWEAQGHRLEAARKVTPGAWTTPVIES 125 T 8 DUF5563 pdbhh F Bacteria T 7o1f 2 G,H,I,J J,K,M,O G0RZ52_CHATD Peptide fragment from PolD4 KHQSTLNFKHRVTKP 15 T 0.068 DUF4643 unppercent F Eukaryota T 7o3j 3 AA,C,DA,F,GA,I,JA,L,MA,O,PA,R,U,X a,C,d,F,g,I,j,L,m,O,p,R,U,X O50334_ECOLX TrwH protein MKTIIFAILMTGLLSACASAPKPKQPSDFNREPVNKTVPVEIQRGAL 47 T 0.0089 LPAM_1 pdbpssm F Bacteria T 7o4i 24 X Q T2FA_YEAST TFIIF-ALPHA,TFIIF LARGE SUBUNIT,TRANSCRIPTION FACTOR G 105 KDA SUBUNIT,P105 GPGMSRRNPPGSRNGGGPTNASPFIKRDRMRRNFLRMRMGQNGSNSSSPGVPNGDNSRGSLVKKDDPEYAEEREKMLLQIGVEADAGRSNVKVKDEDPNEYNEFPLRAIPKEDLENMRTHLLKFQSKKKINPVTDFHLPVRLHRKDTRNLQFQLTRAEIVQRQKEISEYKKKAEQERSTPNSGGMNKSGTVSLNNTVKDGSQTPTVDSVTKDNTANGVNSSIPTVTGSSVPPASPTTVSAVESNGLSNGSTSAANGLDGNASTANLANGRPLVTKLEDAGPAEDPTKVGMVKYDGKEVTNEPEFEEGTMDPLADVAPDGGGRAKRGNLRRKTRQLKVLDENAKKLRFEEFYPWVMEDFDGYNTWVGSYEAGNSDSYVLLSVEDDGSFTMIPADKVYKFTARNKYATLTIDEAEKRMDKKSGEVPRWLMKHLDNIGTTTTRYDRTRRKLKAVADQQAMDEDDRDDNSEVELDYDEEFADDEEAPIIDGNEQENKESEQRIKKEMLQANAMGLRDEEAPSENEEDELFGEKKIDEDGERIKKALQKTELAALYSSDENEINPYLSESDIENKENESPVKKEEDSDTLSKSKRSSPKKQQKKATNAHVHKEPTLRVKSIKNCVIILKGDKKILKSFPEGEWNPQTTKAVDSSNNASNTVPSPIKQEEGLNSTVAEREETPAPTITEKDIIEAIGDGKVNIKEFGKFIRRKYPGAENKKLMFAIVKKLCRKVGNDHMELKKE 738 T 9.4E-36 TFIIF_alpha pdbhh F Eukaryota T 7o54 2 B B CYNT_SYNE7 CARBONATE DEHYDRATASE GWLAPEQQQRIYRGNAS 17 T 0.42 NPA pdbhh F Bacteria T 7o5b 5 E h MifM-stalling construct GPGTMFVESINDVLFLVDFFTIILPALTAIGIAFLLRECRAGEQWKSKRTDGPGKPIPNPLLGLDSTDFLIIIYHRITTWIRKVFRMNSPVNDEED 96 T 0.03 DUF4231 pdbpercent F T 7o5y 1 A,B,C,D B,C,D,A A0A0B7GNW3_STRSA Type IV pilus biogenesis protein PilA MDHHHHHHDTGQSQTQRMYNYLKAKYTATSGTQLAWGAYLDPVDGNPSSVYAEFDERAHNVDPSTEPIKSTHTFKDGSVAEIEMNGQLVDGLTGPENYNITIKSKSKLAGSNDYYEHIVTFNFDTKGIRSEEGHLRSAQK 140 T 0.0049 DUF3377 unppercent F Bacteria T 7o6n 2 C,D C,D PID3_CAEEL PIRNA BIOGENESIS AND CHROMOSOME SEGREGATION PROTEIN 1,PIRNA-INDUCED SILENCING DEFECTIVE PROTEIN 3 GPDSMWTFDKVLFNSEDIKDSVFKVLHAEEEPRGADQEN 39 T 0.031 Nup35_RRM unphh F Eukaryota T 7o6t 1 A A VIN3_ARATH Protein VERNALIZATION INSENSITIVE 3 KDLGHIVKTIRCLEEEGHIDKSFREDFLTWYSLRATHREVRVVKDFVETFMEDLSSLGQQLVDTFSESILSKK 73 T 3 DUF6429 pdbhh F Eukaryota T 7o6u 1 A A VIN3_ARATH Protein VERNALIZATION INSENSITIVE 3 DLGHIVKTIRCLEEEGHIDKSFAEDFLTWYSLRATHREVRVVKIFVETFMEDLSSLGQQLVDTFSESILS 70 T 2.7 DUF6429 pdbhh F Eukaryota T 7o6v 1 A,B,C,D A,B,C,D VIL2_ARATH VERNALIZATION5/VIN3-LIKE PROTEIN 1 GGTESGLEHCVKIIRQLECSGHIDKNFAQDFLTWYSLRATSQEIRVVKDFIDTFIDDPMALAEQLIDTFDDRVSIKR 77 T 0.23 DUF6184 pdb F Eukaryota T 7o6w 1 A,B A,B VIL2_ARATH VERNALIZATION5/VIN3-LIKE PROTEIN 1 GLEHCVKIIRQLECSGHIDKNFRQKFLTWYSLRATSQEIRVVKDFIDTFIDDPMALAEQLIDTFDDRVS 69 T 2.9 DUF6429 pdbhh F Eukaryota T 7o9o 1 A A AWP3b MGSSHHHHHHSSGLVPRGSHMASMTGGQQMGRGSDQTVRSVAGDQRVTDPVIVGDNSILDYYGGSNYDFSNNFEIGRGTLYIGKESYFSSFQSAPTDVPNSFHLLIKNTNNLQNNGQFIIENIKRHANQCSNSSIQVFPINFQNDGEFEIISGGVEGRCCLPTSVIAPQNFLNNGKFYYKVLTDTGSIYSGSCMQNVDIGASTTTTVNNNLWEFTGSINAQINGAVSGAAQINLDGSNMFVNANTFSGQVVNLINGGSFLQTSDPLSNIVVINGLGTSDTGVTSIAVKGKGKSFTYNPSSGIVKLTTVEGKTYAYQIGCGYNTKKFITNNDSGASYESADNFFVLTYSEPYSPQTCQLEN 360 T 2.9 Put_Phosphatase pdbpercent F T 7o9q 1 A,B A,B Q6FPN0_CANGA Awp1A MGSSHHHHHHSSGLVPRGSHMASMTGGQQMGRGSLDILTPTTLTGDQTFNEDVSVVSSLTLNDGSQYLFNNLLQIAPSSASVTANALAAVSVFTFSLPPSSSLSNSGTLIISNSNTGPSTEQHIVITPNVMANTGTITLSLAHTNTDSSSTLIIDPVTFYNTGTINYESIGSETNDPSLTGNILSIGSSGRTLQNLGTINLNAANSYYLLGTITENSGSINVQKGFLYVNALDFIGNTINLSTTTALAFISPVSQVVRVRGVFFGNIIASVGSSGTFSYNTQTGILTVTTNGVYSYDIGCGYNPALMSGQQETLSFQGNLYDTFLVLVNQPIPSDLTCAAV 341 T 1.8 Cadherin_4 pdbpssm F Eukaryota T 7o9t 1 A A MEN1_HUMAN Menin AMGLKAAQKTLFPLRSIDDVVRLFAAELGREEPDLVLLSLVLGFVEHFLAVNRVGLTYFPVADLSIIAALYARFTAQIRGAVDLSLYPREGGVSSRELVKKVSDVIWNSLSRSYFKDRAHIQSLFSFITGTKLDSSGVAFAVVGACQALGLRDVHLALSEDHAWVVFGPNGEQTAEVTWHGKGNEDRRGQTVNAGVAERSWLYLKGSYMRCDRKMEVAFMVCAINPSIDLHTDSLELLQLQQKLLWLLYDLGHLERYPMALGNLADLEELEPTPGRPDPLTLYHKGIASAKTYYRDEHIYPYMYLAGYHCRNRNVREALQAWADTATVIQDYNYCREDEEIYKEFFEVANDVIPNLLKEAASLLEAGQSQGSALQDPECFAHLLRFYDGICKWEEGSPTPVLHVGWATFLVQSLGRFEGQVRQKVRIVSGTVAGTARGPEGGSTAQVPAPAASPPPEGPVLTFQSEKMKGMKELLVATKINSSAIKLQLTAQSQ 494 T 1.8E-23 Menin unp F Eukaryota T 7oa7 1 A A A0A0B7GRV8_STRSA TYPE IV PILUS PROTEIN MDHHHHHHMKVTIPSGKRYYYAGMGITTPGGKVDIADSKKKSKTRIYTESGWFLSDRAIGQGVSGIVPVGTIGQKGDGTISQTLFPEMPTDFKQLSKLETGIHITDDMRGKYLTFAARAINSYGRVGNYQEADRIWIMGLPVTQNVRLHTDADLALLKNGNTTSLIPTDNQLHTNTEVRDYFNDVVYGATIPVLNYKEPAINQTRQLIALDGRTMQFSNHNFNNGYTTSVLIGNRQQTGPLLTYKLDDTLTWGINLENDGRIAIKTVDTTTANNGGQEYIQNVKLDYSNDNSIQVRSAAKNGSLGIEIFINGQSVYNKTVSLTRNRTTHNISSGQIIFGGNTYINEFAVYTESLNNSNIQKLAEYFRDKYKAS 373 T 0.11 SlpA unppercent F Bacteria T 7oa9 1 A A MEN1_HUMAN Isoform 2 of Menin AMGLKAAQKTLFPLRSIDDVVRLFAAELGREEPDLVLLSLVLGFVEHFLAVNRVGLTYFPVADLSIIAALYARFTAQIRGAVDLSLYPREGGVSSRELVKKVSDVIWNSLSRSYFKDRAHIQSLFSFITGTKLDSSGVAFAVVGACQALGLRDVHLALSEDHAWVVFGPNGEQTAEVTWHGKGNEDRRGQTVNAGVAERSWLYLKGSYMRCDRKMEVAFMVCAINPSIDLHTDSLELLQLQQKLLWLLYDLGHLERYPMALGNLADLEELEPTPGRPDPLTLYHKGIASAKTYYRDEHIYPYMYLAGYHCRNRNVREALQAWADTATVIQDYNYCREDEEIYKEFFEVANDVIPNLLKEAASLLEAGEERPGEQSQGTQSQGSALQDPECFAHLLRFYDGICKWEEGSPTPVLHVGWATFLVQSLGRFEGQVRQKVRIVSGTVAGTARGPEGGSTAQVPAPAASPPPEGPVLTFQSEKMKGMKELLVATKINSSAIKLQLTAQSQ 505 T 2.1999999999999996E-24 Menin unppssm F Eukaryota T 7ob2 1 A A RiLK1 RLKWVRIWRR 10 T 7 SNX17_FERM_C pdbhh F T 7ob5 2 B P LDB1_HUMAN LIM DOMAIN-BINDING PROTEIN 1,LDB-1,CARBOXYL-TERMINAL LIM DOMAIN-BINDING PROTEIN 2,CLIM-2,LIM DOMAIN-BINDING FACTOR CLIM2,HLDB1,NUCLEAR LIM INTERACTOR KSENPTSQASQ 11 T 13 DUF999 pdbhh F Eukaryota T 7ob6 1 A,B A,B CPR-C4 TMITHHHHHHGSMHYKAQLQKLLTTEEKKILARLSTPQKIQDFLDTIKNKDLAEGEHTMWSPRAVLKHKHAHCMEGAMLAALALAYHGHSPLLMDLQTTDEDEDHVVALFKIDGHWGAISKTNHPVLRYRDPIYKSVRELAMSYFHEYFIWWTKKNGGKKTLRAYSNPFDLTRYKPERWVIATGDLDWLAEALDDSKHFPILNKKMQKQLRPASRIETKAASLSEWPKRKTNS 233 T 0.00031 DUF553 pdb F T 7obc 2 B B RND3_HUMAN PROTEIN MEMB,RHO FAMILY GTPASE 3,RHO-RELATED GTP-BINDING PROTEIN RHO8,RND3 TDLRKDKAKSC 11 T 34 ALC pdbhh F Eukaryota T 7obk 2 B B E2AK2_HUMAN INTERFERON-INDUCED, DOUBLE-STRANDED RNA-ACTIVATED PROTEIN KINASE,EUKARYOTIC TRANSLATION INITIATION FACTOR 2-ALPHA KINASE 2,EIF-2A PROTEIN KINASE 2,INTERFERON-INDUCIBLE RNA-DEPENDENT PROTEIN KINASE,P1/EIF-2A PROTEIN KINASE,PROTEIN KINASE RNA-ACTIVATED,PKR,PROTEIN KINASE R,TYROSINE-PROTEIN KINASE EIF2AK2,P68 KINASE KSPEKNERHTC 11 T 1.3 VEK-30 pdbhh F Eukaryota T 7obr 48 VA s DAP2_YEAST DPAP B,YSCV LDKLIRVGIILVLLIWGTVLLLKSIPHHSNTPDYQEPNSNYTNDGKLKVSFSVVRNNTFHPKYHELH 67 T 0.00011 DPPIV_rep unppercent F Eukaryota T 7obs 2 B B RIPK2_HUMAN CARD-CONTAINING INTERLEUKIN-1 BETA-CONVERTING ENZYME-ASSOCIATED KINASE,CARD-CONTAINING IL-1 BETA ICE-KINASE,RIP-LIKE-INTERACTING CLARP KINASE,RECEPTOR-INTERACTING PROTEIN 2,RIP-2,TYROSINE-PROTEIN KINASE RIPK2 PSLNLLQNKSM 11 T 16 FtsK_alpha pdbhh F Eukaryota T 7obx 2 B B SSBP4_HUMAN SINGLE-STRANDED DNA-BINDING PROTEIN 4 ESYSPGMTMSV 11 T 23 SCAB-PH pdbhh F Eukaryota T 7oc4 1 A,B A,B ASR6_SARSH XENOVULENE A BIOSYNTHESIS CLUSTER PROTEIN R6 GAMPVTTPTKMATLTTKQMWQTIKDYFGDGFVTGSAPISYNVHTCDMQLQPDSGIHAASDGIHYGVQISEDSMPLFSIMGDTAAPPCTCHRVDEIVKHIDEFLERAPEALPDDGAITSGKPCDTNPDQVSLYAMRDSLSWWVHWGGNLRPEHYWKQIYIGFAAIPDDVQISPREFLDGTYRYLGHTWDDCLSGLEEEGVSPDEIEFANMCMWRQMLTQWLEKADPELLPLLKGKISLMLQYRVLTANTLGCLALFMNATADPKDGPIHYADSSYEMEIASVAQCVTLDMAKEAMGILQGERTEVVAGDRAQRKRELRWIYVRCMQILESQPHAHMLRRYGSAGLHYVPMMDRYLERVSGHTRFPIRDGAARILERFINRAELPKESEDINPNGRSLKVSAKMNGNGQLHHEVNGNAKLHLEAERPDVTTAVG 432 T 19 NETI unphh F Eukaryota T 7oc6 1 A A ASR6_SARSH XENOVULENE A BIOSYNTHESIS CLUSTER PROTEIN R6 MRGSHHHHHHGMASMTGGQQMGRDLYDDDDKDHPFTMPVTTPTKMATLTTKQMWQTIKDYFGDGFVTGSAPISYNVHTCDMQLQPDSGIHAASDGIHYGVQISEDSMPLFSIMGDTAAPPCTCHRVDEIVKHIDEFLERAPEALPDDGAITSGKPCDTNPDQVSLYAMRDSLSWWVHWGGNLRPEHYWKQIYIGFAAIPDDVQISPREFLDGTYRYLGHTWDDCLSGLEEEGVSPDEIEFANMCMWRQMLTQWLEKADPELLPLLKGKISLMLQYRVLTANTLGCLALFMNATADPKDGPIHYADSSYEMEIASVAQCVTLDMAKEAMGILQGERTEVVAGDRAQRKRELRWIYVRCMQILESQPHAHMLRRYGSAGLHYVPMMDRYLERVSGHTRFPIRDGAARILERFINRAELPKESEDINPNGRSLKVSAKMNGNGQLHHEVNGNAKLHLEAERPDVTTAVG 466 T 19 NETI unphh F Eukaryota T 7oc9 1 A A Q6MQ12_BDEBA Bd0675 GGNDFVSRLKALDGREGKIVSSYDDENTGRCRLELQKYELEDGSQGLAVYLQDTGMYFTPSAGLDKETKLKDANTAVVSTSSERPGGDACGDFGGALGYKKVLVLKDNQVTIRETFRCVMDGFKKYDLSTTCQF 134 T 8.1 Fimbrial_PilY2 unphh F Bacteria T 7oca 3 C,H G,E CNIH2_RAT CNIH-2,CORNICHON FAMILY AMPA RECEPTOR AUXILIARY PROTEIN 2,CORNICHON-LIKE PROTEIN MAFTFAAFCYMLTLVLCASLIFFVIWHIIAFDELRTDFKNPIDQGNPARARERLKNIERICCLLRKLVVPEYSIHGLFCLMFLCAAEWVTLGLNIPLLFYHLWRYFHRPADGSEVMYDAVSIMNADILNYCQKESWCKLAFYLLSFFYYLYSMVYTLVSFENLYFQSGGSTETSQVAPAYPYDVPDYA 188 T 2E-13 Cornichon pdbpssm F Eukaryota T 7ock 2 I,J,K,L L,A,K,J ADOM_BPT3 SAM hydrolase MIFTKEPANVFYVLVSAFRSNLCDEVNMSRHRHMVSTLRAAPGLYGSVESTDLTGCYREAISSAPTEEKTVRVRCKDKAQALNVARLACNEWEQDCVLVYKSQTHTAGLVYAKGIDGYKAERLPGSFQEVPKGAPLQGCFTIDEFGRRWQVQHHHHHH 158 T 0.0035 DUF3293 unphh T Viruses T 7oco 1 A,B,C,D B,A,C,D CAPSD_HBVD3 CORE ANTIGEN,CORE PROTEIN,HBCAG,P21.5 MDIDPYKEFGATVELLSFLPSDFFPSVRDLLDTASALYREALESPEHCSPHHTALRQAIVCWGELMTLATWVGVNLEDPASRDLVVSYVNTNMGLKFRQLLWFHISCLTFGRETVIEYLVSFGVWIRTPPAYRPPNAPILSTLPETTVVRRRGRSPRRRTPSPRRRRSQSPRRRRSQSRESQC 183 T 1.5999999999999998E-25 Hepatitis_core pdb T Viruses T 7ocw 1 A,B,C,D B,A,C,D CAPSD_HBVD3 CORE ANTIGEN,CORE PROTEIN,HBCAG,P21.5 MDIDTYKEFGATVELLSFLPSDFFPSVRDLLDTASALYREALESPEHCSPHHTALRQAILCWGELMTLATWVGVNLEDPASRDLVVSYVNTNMGLKFRQLLWFHISCLTFGRETVIEYLVSFGVWIRTPPAYRPPNAPILSTLPETTVVRRRGRSPRRRTPSPRRRRSQSPRRRRSQSRESQC 183 T 1.9E-17 Hepatitis_core unp T Viruses T 7ocz 1 A,B A,B PID3_CAEEL PIRNA BIOGENESIS AND CHROMOSOME SEGREGATION PROTEIN 1,PIRNA-INDUCED SILENCING DEFECTIVE PROTEIN 3 GPDSMPRGADQENMLKISGYPGMLNTFGIAQLLTPYRVNGITITGAQSAVVALENKFQVYQAVQDFNGKKLDRNHKLQVSSLVV 84 T 0.0014 RRM_1 pdbpercent F Eukaryota T 7od2 1 A A K1A_ANEER KAPPA-AITX-AER3A,ANERK,POTASSIUM CHANNEL TOXIN AETX K ACKDYLPKSECTQFRCRTSMKYKYTNCKKTCGTC 34 T 0.0073 ShK pdbpercent F Eukaryota T 7od6 2 E,F F,E Inhibitory Peptide P2 (GSLLGRMKGA) XXXXXXGSLLGRMKGA 16 T 6.6 Aconitase_B_N pdbhh F T 7od7 2 E E SLLRGM XXXXXXSLLRGM 12 T 39 Leu_leader pdbhh F T 7od8 2 E,F E,F peptide GSLLGRMKGA XXXXXXXXXXGSLLGRMKGA 20 T 11 Aconitase_B_N pdbhh F T 7odx 1 A A PURZ_BPS2L Succinoaminodeoxyadenylate synthetase (PurZ) GTGDGSMLSIPPYYRVKNCNLIVDCQYGSTGKGLLAGYLGALEAPQVLCMAPSPNAGHTLVEEDGTARVHKMLPLGITSPSLERIYLGPGSVIDMDRLLEEYLALPRQVELWVHQNAAVVLQEHRDEEAAGGLAPGSTRSGAGSAFIAKIRRRPGTLLFGEAVRDHPLHGVVRVVDTRTAQDMLFRTRSIQAEGCQGYSLSVHHGAYPYCTARDVTTAQLIADCGLPYDVARIARVVGSMRTYPIRVANRPEAGEWSGPCYPDSVECQFADLGLEQEYTTVTKLPRRIFTFSAIQAHEAIAQNGVDEVFLNFAQYPPSLGALEDILDAIEARAEVTYVGFGPKVTDVYHTPTRAELEGLYARYRR 365 T 1.9999999999999999E-56 Adenylsucc_synt pdbpssm T Viruses T 7oe2 2 F,G,H,I,J 1,2,3,4,5 D0LZ73_HALO1 Haliangium ochraceum Encapsulated ferritin localisation sequence MSSEQLHEPAELLSEETKNMHRALVTLIEELEAVDWYQQRADACSEPGLHDVLIHNKNEEVEHAMMTLEWIRRRSPVFDAHMRTYLFTERPILELEEEDTGSSSSVAASPTSAPSHGSLGIGSLRQEGKED 131 T 0.00024 Rubrerythrin pdbpercent F Bacteria T 7oec 1 A A DP2L_PYRHO POL II,EXODEOXYRIBONUCLEASE LARGE SUBUNIT SGNAFPGDTRILVQINGTPQRVTLKELYELFDEEHYESMVYVRKKPKVDIKVYSFNPEEGKVVLTDIEEVIKAPATDHLIRFELELGSSFETTVDHPVLVYENGKFVEKRAFEVREGNIIIIIDESTLEPLKVAVKKIEFIEPPEDFVFSLNAKKYHTVIINENIVTHQ 169 T 1.4E-08 Intein_splicing pdbhh F Archaea T 7oew 2 E,F E,F MHRSLLGRMKGA XXXXXXXXXXXXMHRSLLGRMKGA 24 T 19 Aconitase_B_N pdbhh F T 7ofm 1 A A BAK_HUMAN APOPTOSIS REGULATOR BAK,BCL-2-LIKE PROTEIN 7,BCL2-L-7 SLGNGPILNVLVVLGVVLLGQFVVRRFFKS 30 T 3.5 FeoB_associated pdbhh F Eukaryota T 7og1 5 E,H GGG,DDD FCHO2_HUMAN F-BAR domain only protein 2 GSPEFNIPDVDEEGYSIKPETNQNDTKENHFYSSSDSDSEDEEPKKYRIEIKPMHPNNSHHTMASLDELKVSIGNITLSPAISRHSPVQMNRNLSNEELTKSKPSAPPNEKGTSDLLAWDPLFGPSLDSSSSSSLTEFPGRPHHHHHHHHHH 152 T 94 Nas2_N pdbhh F Eukaryota T 7og2 1 A,B A,B A0A166WMK8_9GAMM Amine oxidoreductase MTHYTFGKEITDKQLPSQVKVAIVGAGMSGLYSAWRLQQEANCQDLAIFERSDRTGGRLDSDLIEFKNLRSDEPKTITVKEEQGGMRFLFDGMDDLMALFLKLNLQDDIVPFPMNSGGNNRLFFRGESFSVSDAQQDDYAIWSHLYNLDQSEQGVNPKDIVNVVFNRILEANPQFQQRPKVRGPQFWQDFRLECQWKGQGLNQWTLWDLYTDMGYSQECITMLYRVLGFNGTFLSQMNAGVAYQLLEDFPAGVKFKTFKDGFSTLPNKLVEEVGTNNIHLQTTIEEIDFNEESGLYELSYAHIDAHGKIHKGLVKAEKVILGLPRLALEKLFVRSNVINRLDQDRSELLWNTLQSASNQPLLKINLYYDSAWWGRGTTGRPAVEFGPNFADLPTGSVYPFYAVNEELAAALMYEERTTHPSDAVEAKLERIGNDKYERPAALTIYCDYLNINFWSNLQNIGETYHNPKQDHYVENVPDDIYPASTAVVEQATRFFKDIFNTHYVPAPVLTSARIWEGSVKFDIPANRQFGYGVHQWAVGANDKEVMATLSEPLPNLFTCGEAFSDYQGWVEGALRSTDLALEKGFGLKPLSQAYFESTHISSSDAIKAVYEENSSKLINQYIETNFAASAAPIEKADDEQSVIGVNLSYFDVK 653 T 9.2E-21 Amino_oxidase pdbhh F Bacteria T 7ogo 3 C,F CCC,FFF IDL1_ARATH Protein IDA-LIKE 1 YVLVPPSGPSMRHN 14 T 0.021 Sperm_Ag_HE2 unp F Eukaryota T 7ogp 2 B B Q8SD94_BPDPK PHIKZ068 MEIIVTGVQGTGFTEVATEHNGKRLTWTTTAYSKIRVQDQQRVFQEINDYWSGLSAEAQQHIWNCYVEIRKIMDMAMHPMRIAMSLSYYIKEMYKAMPMNSFRRWLLTIGKLYIPVDIEEVITDDSRYNRPDQTYLKHDYINLASVSLALRPLVPIWGEFIDQGTSQEMHKECEVISLISDCEVNHWPVDEISIDGTPVETAYDKLSAYVKFCVEDEAPTLANLYRGMSSAEVPDILQAKVMVRRLTILPLNDATSHSIVSNMFRYVKSNLNPAERSTADRVNDKRPDKGGIDDDDKTSFIESHKTKQRVTPGDIVAYNLDALDVVKLVHKIDDTVPVELIQECLDCVAVTATKDIYPHQILLAQWVMHKAFPARAFSHINKNAVNHLLAAAQSLMWHWGFQQVAVFMQVELYYSGEHAMSIQPRNSTRIQIKYKDVMDELYPHQRQQRAINGVPVAPVNIAGIAVQSAHASIRSSNWIYHGPDRLFKEAEQVTQNKVLVVPATIKSVITELVIHLGKLNQ 521 T 0.16 FF pdbpercent T Viruses T 7ogp 5 E E Q8SD39_BPDPK PHIKZ123 MPDPFLIEKIRENTPCMNPTLANGITVEHTMTRDPNTGVNMTRRYIDSLFDISSVLFPDGFKYEGNRACTPLKHFEEITREYNAKRIANIAPTDMYMIDLMFSYKGEMLYPRPMLLPAFKRGNMVTINGAKYIGSPVLTDVGFSVLNDSIFIPFRRTKLTFKQTDHHYMCNGQRKIMYVIWSQIHNEMAKRTKRDLGNRPHIESCLAHYFFCQFGVTQTFKQWANVDVKCGLLSDFPEEEYPREKWNIYSSATLKGKHPTGEMVLVIPRHQESIFATRLIAGFWYVVDAFPMRFTRPEYVDSTNLWRVILGHMVFGDFEHQGKVEENIDSHLHSFCNSLDEMTIEELKTVGVNVSTIWELLYEIMTSLAHHLYATDIDETSMYGKRLTVLHYLMSEFNYAVSMFGYMFQSRRDREWTVQELNEGLKRSFKLQTAIKRLTVDHGELDTMSNPNSSMLIKGTSILVTQDRAKTAKAHNKSLINDSSRIIHASIAEVGQYKNQPKNNPDGRGRLNMYTKVGPTGLVERREEVREIIDNAQLMFRAK 543 T 0.4 OGG_N unp T Viruses T 7ogq 3 C CCC IDL2_ARATH Protein IDA-LIKE 2 YVPVPASGPSRKHN 14 T 0.033 RSN1_TM unp F Eukaryota T 7ogz 3 C,F CCC,FFF IDL3_ARATH PEPTIDE FROM PROTEIN IDA-LIKE 3 PVPTSGPSRKHN 12 T 2.3 Disulph_isomer pdbhh F Eukaryota T 7ohi 2 B B FCHO1_HUMAN F-BAR domain only protein 1 QSEEQVSKNLFGPPLESAFDHED 23 T 4.4 CDI pdbhh F Eukaryota T 7oiq 2 C,D CCC,DDD FCHO2_HUMAN F-BAR domain only protein 2 SDLLAWDPLFG 11 T 1.1 DUF1871 pdbhh F Eukaryota T 7oj1 1 A A IMDH_BACSU Inosine-5'-monophosphate dehydrogenase WESKFSKEGLTFDDVLLVPAKSEVLPRDVDLSVELTKTLKLNIPVISAGMDTVTESAMAIAMARQGGLGIIHKNMSIEQQAEQVDKVKRSERGITNPFFLTPDHQVFDAEHLMGKRISGVPIEEDLVGIITNRDLRFISMKISDVMTKEELVTASVGTTLDEAEKILQKHKIEKLPLVGLITIKDIEKVIEFPNSSKDIHGRLIVGAAVGVTGDTMTRVKKLVEANVDVIVIDTAHGHSQGVLNTVTKIRETYPELNIIAGNVATAEATRALIEAGADVVKVGIGPICTTRVVAGVGVPQITAIYDCATEARKHGKTIIADGGIKFSGDITKALAAGGHAVMLGSLLAGTSESPGETPYKGPVEETVYQLVGGLRSGMGYCGSKDLRALREEAQFIRMTGA 401 T 1.7E-11 IMPDH pdb F Bacteria T 7oj9 1 A B POLN_EEEV1 EEEV nsP3 peptide AERLIPRRPAPPVPVPARIPSPR 23 T 31 EspF pdbhh T Viruses T 7oko 2 AB,BA,C,F,FB,GA,K,KB,LA,QA,R,VA,W n,O,AC,2,s,T,7,x,Y,d,E,i,J TraB PGMMDSQEFS 10 T 2.1 NADPH_Ox pdbhh F T 7old 31 EA LW G0S1P9_CHATD 60S ribosomal protein L24-like protein MRTYEDTFSGQRIYPGKVRFPISHEGDNGDISHPEEIRTGRRKIAPATRQLRAEVQKTSMKGKLYVRGDSKIFRFQNGKSESLFLQRKNPRRIAWTVLYRRQHRKGISEEVAKKRTRRTIKSQRAIVGASLEVIKERRSMRPEARNAARLAAIKESKEKKAAAQAAKKAEKAKNAAAAAKGQPQGRVTSKQGAKGAPVKVAAKSR 205 T 0.03 Ribosomal_L24e pdbpssm F Eukaryota T 7ole 3 G H TTI1_HUMAN PROTEIN SMG10 MAVFDTPEEAFGVLRPVCVQLTKTQTVENVEHLQTRLQAVSDSALQELQQYILFPLRFTLKTPGPKRERLIQSVVECLTFVLSSTCVKEQELLQELFSELSACLYSPSSQKPAAVSEELKLAVIQGLSTLMHSAYGDIILTFYEPSILPRLGFAVSLLLGLAEQEKSKQIKIAALKCLQVLLLQCDCQDHPRSLDELEQKQLGDLFASFLPGISTALTRLITGDFKQGHSIVVSSLKIFYKTVSFIMADEQLKRISKVQAKPAVEHRVAELMVYREADWVKKTGDKLTILIKKIIECVSVHPHWKVRLELVELVEDLLLKCSQSLVECAGPLLKALVGLVNDESPEIQAQCNKVLRHFADQKVVVGNKALADILSESLHSLATSLPRLMNSQDDQGKFSTLSLLLGYLKLLGPKINFVLNSVAHLQRLSKALIQVLELDVADIKIVEERRWNSDDLNASPKTSATQPWNRIQRRYFRFFTDERIFMLLRQVCQLLGYYGNLYLLVDHFMELYHQSVVYRKQAAMILNELVTGAAGLEVEDLHEKHIKTNPEELREIVTSILEEYTSQENWYLVTCLETEEMGEELMMEHPGLQAITSGEHTCQVTSFLAFSKPSPTICSMNSNIWQICIQLEGIGQFAYALGKDFCLLLMSALYPVLEKAGDQTLLISQVATSTMMDVCRACGYDSLQHLINQNSDYLVNGISLNLRHLALHPHTPKVLEVMLRNSDANLLPLVADVVQDVLATLDQFYDKRAASFVSVLHALMAALAQWFPDTGNLGHLQEQSLGEEGSHLNQRPAALEKSTTTAEDIEQFLLNYLKEKDVADGNVSDFDNEEEEQSVPPKVDENDTRPDVEPPLPLQIQIAMDVMERCIHLLSDKNLQIRLKVLDVLDLCVVVLQSHKNQLLPLAHQAWPSLVHRLTRDAPLAVLRAFKVLRTLGSKCGDFLRSRFCKDVLPKLAGSLVTQAPISARAGPVYSHTLAFKLQLAVLQGLGPLCERLDLGEGDLNKVADACLIYLSVKQPVKLQEAARSVFLHLMKVDPDSTWFLLNELYCPVQFTPPHPSLHPVQLHGASGQQNPYTTNVLQLLKELQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGXGXXXXXXXXXXXXXXXXXXXXXXXXXGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1733 T 0.00097 Proteasom_PSMB pdbhh F Eukaryota T 7ole 4 H J TTI2_HUMAN TELO2-interacting protein 2,TELO2-interacting protein 2,TTI2 MELDSALEAPSQEDSNLSEELSHSAFGQAFSKILHCLARPEARRGNVKDAVLKDLGDLIEATEFDRLFEGTGARLRGMPETLGQVAKALEKYAAPSKEEEGGGDGHSEAAEKAAQVGLLFLKLLGKVETAKNSLVGPAWQTGLHHLAGPVYIFAITHSLEQPWTTPRSREVAREVLTSLLQVTECGSVAGFLHGENEDEKGRLSVILGLLKPDLYKESWKNNPAIKHVFSWTLQQVTRPWLSQHLERVLPASLVISDDYQTENKILGVHCLHHIVLNVPAADLLQYNRAQVLYHAISNHLYTPEHHLIQAVLLCLLDLFPILEKTLHWKGDGARPTTHCDEVLRLILTHMEPEHRLLLRRTYARNLPAFVNRLGILTVRHLKRLERVIIGYLEVYDGPEEEARLKILETLKLLMQHTWPRVSCRLVVLLKALLKLICDVARDPNLTPESVKSALLQEATDCLILLDRCSQGRVKGLLAKIPQSCEDRKVVNYIRKVQQVSEGAPYNGTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGXGXXXXXXXXXXXXXXXXXXXXXXXXXGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGXXGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 964 T 2.1 Tti2 pdbpssm F Eukaryota T 7ole 5 I K TELO2_HUMAN PROTEIN CLK-2 HOMOLOG,HCLK2,PROTEIN CLK-2 HOMOLOG,HCLK2,PROTEIN CLK-2 HOMOLOG,HCLK2,PROTEIN CLK-2 HOMOLOG,HCLK2 MEPAPSEVRLAVREAIHALSSSEDGGHIFCTLESLKRYLGEMEPPALPREXXXXXXXXXXKEEFASAHFSPVLRCLASRLSPAWLELLPHGRLEXXXXXXXXELWASFFLEGPADQAFLVLMETIEGAAGPSFRLMKMARLLARFLREGRLAVLMEAQCRQQTQPGFILLRETLLGKVVXXXXXXXXXXXXXXXXXXALPDHLGNRLQQENLAEFFPQNYFRLLGEEVVRVLQAVVDSLQGGLDSSVSFVSQVLGKACVHGRQQEILGVLVPRLAALTQGSYLHQRVCWRLVEQVPDRAMEAVLTGLVEAALGPEVLSRLLGNLVVKNKKAQFVMTQKLLFLQSRLTTPMLQSLLGHLAMDSQRRPLLLQVLKELLETWGSSSAIRHTPLPQQRHVSKAVLICLAQLGEPELRDSRDELLASMMAGVKCRLDSSLPPVRRLGMIVAEVVSARIHPEGPPLKFQYEEDELSLELLALASPQPAGDGASEAGT 491 T 0.0069 Ribosomal_60s pdbpssm F Eukaryota T 7olg 2 C,D C,D DYST_HUMAN 11MACF KPSKIPTPQRK 11 T 4.6 DUF3697 pdbhh F Eukaryota T 7op0 3 C C K92chemFE TCPEGWSECGVAIYGYACGRWGCGHFLNSGPNISP 35 T 0.14 Toxin_4 pdbhh F T 7opb 2 D,E,F D,E,F IL7R binder SVIEKLRKLEKQARKQGDEVLVMLARMVLEYLEKGWVSEEDADESADRIEEVLKK 55 T 0.89 TyeA pdbhh F T 7opm 2 B B P28 MQLXLDSSNLARRRRRRR 18 T 6.9 UCMA pdbhh F T 7opm 3 C C ORF45_HHV8P Protein ORF45 RPPVKFIFPPPPLS 14 T 0.98 AIM3 pdbhh T Viruses T 7opo 2 B,D,F,H,J,L B,D,F,H,J,L ORF45_HHV8P Protein ORF45 GSRMLPIEGAPRRRPPVKFIFPPPPLSSLPGFGRPRGYAGPTVIDMSAPDDVFAEDTPSPPAT 63 T 43 Corona_NS1 pdbhh T Viruses T 7oq4 14 N Z RIP_ATV RIP MKNMLHPQKYETHVLDDLMEFYEGVIGYPEIDLRLAGEEAWLKGVNPELAEAVKKIIKTIRRYLEGSPYDGSEKPIPRYIIAEIFSQIAPEVQLLVNALDTEGKYGFLKHIKKLNLNSLAMLSKNYNENDKLWKELENEGYVYLELVPR 149 T 0.11 MnmE_helical pdbpssm T Viruses T 7oqe 17 Q D PRP39_YEAST Pre-mRNA-processing factor 39 MPDETNFTIEDIEPRPDALRGLDTQFLQDNTALVQAYRGLDWSDISSLTQMVDVIEQTVVKYGNPNDSIKLALETILWQILRKYPLLFGFWKRFATIEYQLFGLKKSIAVLATSVKWFPTSLELWCDYLNVLCVNNPNETDFIRNNFEIAKDLIGKQFLSHPFWDKFIEFEVGQKNWHNVQRIYEYIIEVPLHQYARFFTSYKKFLNEKNLKTTRNIDIVLRKTQTTVNEIWQFESKIKQPFFNLGQVLNDDLENWSRYLKFVTDPSKSLDKEFVMSVFDRCLIPCLYHENTWMMYIKWLTKKNISDEVVVDIYQKANTFLPLDFKTLRYDFLRFLKRKYRSNNTLFNNIFNETVSRYLKIWPNDILLMTEYLCMLKRHSFKNSLDQSPKEILEKQTSFTKILETSITNYINNQIDAKVHLQTLINDKNLSIVVVELIKTTWLVLKNNMQTRKYFNLYQKNILIKNSVPFWLTYYKFEKSNVNFTKLNKFIRELGVEIYLPTTVMNDILTDYKTFYLTHSNIVTYESSIIDSNTFDPILYPELKMSNPKYDPVLNTTANVDWHKKTEWKEAGHIGITTERPQISNSIIECNSGTLIQKPISLPNFRNLEKINQVKINDLYTEEFLKEGK 629 F F Eukaryota T 7oqv 1 A,B,C,D AAA,BBB,CCC,DDD VIN3_ARATH Protein VERNALIZATION INSENSITIVE 3 GDKDLGHIVKTIRCLEEEGHIDKSFRERFLTWYSLRATHREVRVVKDFVETFMEDLSSLGQQLVDTFSESILSKR 75 T 3.3 DUF6495 pdbhh F Eukaryota T 7or3 2 B B NOTC4_HUMAN NOTCH 4,HNOTCH4 RGRRFSAGMRG 11 T 7.3 RNA_polI_A14 pdbhh F Eukaryota T 7or8 2 B P CDN1B_HUMAN CYCLIN-DEPENDENT KINASE INHIBITOR P27,P27KIP1 TPKKPGLRRRQT 12 T 10 MepB pdbhh F Eukaryota T 7os0 1 A,B A,C D5AUW0_RHOCB Cas13a MQIGKVQGRTISEFGDPAGGLKRKISTDGKNRKELPAHLSSDPKALIGQWISGIDKIYRKPDSRKSDGKAIHSPTPSKMQFDARDDLGEAFWKLVSEAGLAQDSDYDQFKRRLHPYGDKFQPADSGAKLKFEADPPEPQAFHGRWYGAMSKRGNDAKELAAALYEHLHVDEKRIDGQPKRNPKTDKFAPGLVVARALGIESSVLPRGMARLARNWGEEEIQTYFVVDVAASVKEVAKAAVSAAQAFDPPRQVSGRSLSPKVGFALAEHLERVTGSKRCSFDPAAGPSVLALHDEVKKTYKRLCARGKNAARAFPADKTELLALMRHTHENRVRNQMVRMGRVSEYRGQQAGDLAQSHYWTSAGQTEIKESEIFVRLWVGAFALAGRSMKAWIDPMGKIVNTEKNDRDLTAAVNIRQVISNKEMVAEAMARRGIYFGETPELDRLGAEGNEGFVFALLRYLRGCRNQTFHLGARAGFLKEIRKELEKTRWGKAKEAEHVVLTDKTVAAIRAIIDNDAKALGARLLADLSGAFVAHYASKEHFSTLYSEIVKAVKDAPEVSSGLPRLKLLLKRADGVRGYVHGLRDTRKHAFATKLPPPPAPRELDDPATKARYIALLRLYDGPFRAYASGITGTALAGPAARAKEAATALAQSVNVTKAYSDVMEGRTSRLRPPNDGETLREYLSALTGETATEFRVQIGYESDSENARKQAEFIENYRRDMLAFMFEDYIRAKGFDWILKIEPGATAMTRAPVLPEPIDTRGQYEHWQAALYLVMHFVPASDVSNLLHQLRKWEALQGKYELVQDGDATDQADARREALDLVKRFRDVLVLFLKTGEARFEGRAAPFDLKPFRALFANPATFDRLFMATPTTARPAEDDPEGDGASEPELRVARTLRGLRQIARYNHMAVLSDLFAKHKVRDEEVARLAEIEDETQEKSQIVAAQELRTDLHDKVMKCHPKTISPEERQSYAAAIKTIEEHRFLVGRVYLGDHLRLHRLMMDVIGRLIDYAGAYERDTGTFLINASKQLGAGADWAVTIAGAANTDARTQTRKDLAHFNVLDRADGTPDLTALVNRAREMMAYDRKRKNAVPRSILDMLARLGLTLKWQMKDHLLQDATITQAAIKHLDKVRLTVGGPAAVTEARFSQDYLQMVAAVFNGSVQNPKPRRRDDGDAWHKPPKPATAQSQPDQKPPNKAPSAGSRLPPPQVGEVYEGVVVKVIDTGSLGFLAVEGVAGNIGLHISRLRRIREDAIIVGRRYRFRVEIYVPPKSNTSKLNAADLVRIDENLYFQKLAAALEHHHHHH 1304 T 0.54 D5_N unppercent F Bacteria T 7os1 2 B F WBP4_HUMAN WBP-4,FORMIN-BINDING PROTEIN 21,WW DOMAIN-CONTAINING-BINDING PROTEIN 4 GAMAFNPHTSDLPSSKVNENSLGTLDESKSSDSHSDSDGEQEAEEGGVSTETEKPKIKFKEKNKNSDGGSDPETQKEKSIQKQNSLGSNEEKSKTLKKSNPYGEWQEIKQEVESHEEVDLELPSTENEYVSTSEADGGGEPKVVFKEKTVTSLGVMADGVAPVFKKRRTENGKSRNLRQRGDDQ 184 T 0.91 CDC45 pdbpssm F Eukaryota T 7os8 1 A A TPL_RANTE PHE-VAL-PRO-TRP-PHE-SER-LYS-PHE-DLE-GLY-ARG-ILE-LEU-NH2 FVPWFSKFXGRILX 14 T 0.013 Mim2 pdbhh F Eukaryota T 7osc 1 A A A0A2Y9FJE4_PHYMC cathelicidin-1-like QICRIIVVRVCRPICRITVIRVCS 24 T 15 IF3_N pdbhh F Eukaryota T 7osd 1 A A TPL_RANTE PHE-VAL-PRO-TRP-PHE-LYS-LYS-PHE-DLE-GLU-ARG-ILE-LEU-NH2 FVPWFKKFXERILX 14 T 0.063 MOSC_N unphh F Eukaryota T 7osu 1 A A sTIM11noCys-SB MDKDEAWKQVEQLRREGATRIAYRSDDWRDLKEAWKKGADILIVDATDKDEAWKQVEQLRREGATEIAYRSDDWRDLKEAWKKGADILIVDATDKDEAWKQVEQLRREGATRIAYRSDDWRDLKEAWKKGADILIVDATDKDEAWKQVEQLRREGATEIAYRSDDWRDLKEAWKKGADILIVDATGLEHHHHHH 194 T 0.00014 NanE pdbhh F T 7osv 1 A A DeNovoTIM6-SB MDKDEAWKQVEILRRLGAKRIAYRSDDWRDLQEALKKGADILIVDATDKDEAWKQVEILRRLGAKEIAYRSDDWRDLQEALKKGADILIVDATDKDEAWKQVEILRRLGAKRIAYRSDDWRDLQEALKKGADILIVDATDKDEAWKQVEILRRLGAKEIAYRSDDWRDLQEALKKGADILIVDATGLEHHHHHH 194 T 1.2E-05 NanE pdbhh F T 7oui 23 Z,ZA U,u PST2_ARATH PsbTn EPKRGTEAAKKKYAQVCVTMPTAKICRY 28 T 0.47 Surface_antigen pdbhh F Eukaryota T 7oun 2 B B macrocyclic peptide AFLFVIRDRVFRCG 14 T 3.9 BOFC_N pdbhh F T 7ovc 2 B B UBA5_HUMAN UBIQUITIN-ACTIVATING ENZYME 5,THIFP1,UFM1-ACTIVATING ENZYME,UBIQUITIN-ACTIVATING ENZYME E1 DOMAIN-CONTAINING PROTEIN 1 GMSVTELTVEDSGESLEDLMAKMKNMW 27 T 0.43 DUF5786 pdbhh F Eukaryota T 7ovx 2 B Q GLYG_HUMAN Peptide G DNIKRKLDTYLQ 12 T 3.1 NTS_2 pdbhh F Eukaryota T 7owm 2 C C HPCA_HUMAN CALCIUM-BINDING PROTEIN BDR-2 GKQNSKLR 8 T 0.07 EF-hand_7 unppercent F Eukaryota T 7own 2 C D ALA-LYS-SER-PHE-SER-LYS-PRO-ARG AKSFSKPR 8 T 19 Crystall_4 pdbhh F T 7owo 2 C,D D,F N-Acetyl-LYS-SER-PHE-SER-LYS-PRO-ARG XKSFSKPR 8 T 19 Crystall_4 pdbhh F T 7owp 2 C,D E,D ACE-GLY-ORN-SER-PHE-SER-LYS-PRO-ARG XGXSFSKPR 9 T 7.7 cIII pdbhh F T 7owr 2 C E GLY-GLY-LYS-SER-PHE-SER-LYS-PRO-ARG GGKSFSKPR 9 T 3 zf_C2H2_6 pdbhh F T 7owu 2 C,D C,D ALA-ASN-CYS-PHE-SER-LYS-PRO-ARG ANCFSKPR 8 T 4 Flexi_CP_N pdbhh F T 7ox1 3 G,J,K,L G,X,Y,Z IL9_HUMAN IL-9,CYTOKINE P40,T-CELL GROWTH FACTOR P40 GSHMQGCPTLAGILDINFLINKMQEDPASKCHCSANVTSCLCLGIPSDNCTRPCFSERLSQMTNTTMQTRYPLIFSRVKKSVEVLKNNKCPYFSCEQPCNQTTAGNALTFLKSLLEIFQKEKMRGMRGKI 130 T 0.0044 Dynamin_M unppercent F Eukaryota T 7ox4 3 C C IL9_MOUSE IL-9,CYTOKINE P40,T-CELL GROWTH FACTOR P40 GSHMQRCSTTWGIRDTNYLIENLKDDPPSKCSCSGNVTSCLCLSVPTDDCTTPCYREGLLQLTNATQKSRLLPVFHRVKRIVEVLKNITCPSFSCEKPCNQTMAGNTLSFLKSLLGTFQKTEMQRQKSRP 130 T 0.0041 Dynamin_M unppssm F Eukaryota T 7oxe 2 B B THR-ALA-GLU-HIS-ASP-GLU-PHE TAEHDEF 7 T 110 Histidinol_dh pdbhh F T 7oxp 1 A,B,C,D,E,F,G,H,I,J A,B,C,D,E,F,G,H,I,J A0A6A5PUS7_YEASX HLJ1_G0030540.MRNA.1.CDS.1,SEIPIN,Y55_G0030470.MRNA.1.CDS.1 MKINVSRPLQFLQWSSYIVVAFLIQLLIILPLSILIYHDFYLRLLPADSSNVVPLNTFNILNGVQFGTKFFQSIKSIPVGTDLPQTIDNGLSQLIPMRDNMEYKLDLNLQLYCQSKTDHLNLDNLLIDVYRGPGPLLGAPGGSNSKDEKIFHTSRPIVCLALTDSMSPQEIEQLGPSRLDVYDEEWLNTIRIEDKISLESSYETISVFLKTEIAQRNLIIHPESGIKFRMNFEQGLRNLMLRKRFLSYIIGISIFHCIICVLFFITGCTAFIFVRKGQEKSKKHSGRRIPGLINGGGGGGDYKDHDGDYKDHDIDYKDDDDK 322 T 0.23 Seipin pdbpercent F Eukaryota T 7oxr 1 A,B,C,D,E,F,G,H,I,J A,B,C,D,E,F,G,H,I,J A0A6A5PUS7_YEASX HLJ1_G0030540.MRNA.1.CDS.1,SEIPIN,Y55_G0030470.MRNA.1.CDS.1 MKINVSRPLQFLQWSSYIVVAFLIQLLIILPLSILIYHDFYLRLLPADSSNVVPLNTFNILNGVQFGTKFFQSIKSIPVGTDLPQTIDNGLSQLIPMRDNMEYKLDLNLQLYCQSKTDHLNLDNLLIDVYRGPGPLLGAPGGSNSKDEKIFHTSRPIVCLALTDSMSPQEIEQLGPSRLDVYDEEWLNTIRIEDKISLESSYETISVFLKTEIAQRNLIIHPESGIKFRMGGSGGSRFLSYIIGISIFHCIICVLFFITGCTAFIFVRKGQEKSKKHSGRRIPGLINGGGGGGDYKDHDGDYKDHDIDYKDDDDK 315 T 0.047 Telomerase_RBD pdbpercent F Eukaryota T 7oye 2 B B THR-ALA-GLU-HIS-ASP-GLU-LEU TAEHDEL 7 T 150 FAS_I_H pdbhh F T 7oym 2 B B Hit2 (MH65) XHPYKAHA 8 T 25 RRN9 pdbhh F T 7oyn 2 B B Hit3 (MH57) XSLPFTVYX 9 T 8.2 Soc pdbhh F T 7p02 4 D A GNAI1_HUMAN;GNAS2_HUMAN ADENYLATE CYCLASE-INHIBITING G ALPHA PROTEIN,ADENYLATE CYCLASE-STIMULATING G ALPHA PROTEIN MGCTLSAEDKAAVERSKMIEKQLQKDKQVYRATHRLLLLGADNSGKSTIVKQMRILHGGSGGSGGTSGIFETKFQVDKVNFHMFDVGGQRDERRKWIQCFNDVTAIIFVVDSSDYNRLQEALNLFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKIEDYFPEFARYTTPEDATPEPGEDPRVTRAKYFIRDEFLRISTASGDGRHYCYPHFTCAVDTENARRIFNDCRDIIQRMHLRQYELL 246 T 9E-10 G-alpha pdb F Eukaryota T 7p12 1 A A DeNovoTIM13-SB MDVDEMLKQVEILRRLGAKRIAVRSDDWRILQEALKKGGDILIVDATDVDEMLKQVEILRRLGAKEIAVRSDDWRILQEALKKGGDILIVDATDVDEMLKQVEILRRLGAKRIAVRSDDWRILQEALKKGGDILIVDATDVDEMLKQVEILRRLGAKEIAVRSDDWRILQEALKKGGDILIVDATLEHHHHHH 193 T 0.00032 NanE pdbhh F T 7p1c 2 B B TRP-ASN-UX8-THR-LYS-ARG-PHE WNXTKRF 7 T 11 TMP pdbhh F T 7p3h 1 A,B,C A,B,C Peptide HC02 XEWEAIEKKIAANESKDQAIEKKIQAIEKKIEAIEHGX 38 T 0.028 FlaC_arch pdb F T 7p46 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H T23O_XANCP TDO,TRYPTAMIN 2,3-DIOXYGENASE,TRYPTOPHAN OXYGENASE,TO,TRPO,TRYPTOPHAN PYRROLASE,TRYPTOPHANASE KNLRDLEPGIHTDLEGRLTYGGYLRLDQLLSAQQPLSEPAHHDEMLFIIQSQTSELWLKLLAHELRAAIVHLQRDEVWQCRKVLARSKQVLRQLTEQWSVLETLTPSEYMGFRDVLGPSSGFQSLQYRYIEFLLGNKNPQMLQVFAYDPAGQARLREVLEAPSLYEEFLRYLARFGHAIPQQYQARDWTAAHVADDTLRPVFERIYENTDRYWREYSLCEDLVDVETQFQLWRFRHMRTVMRVIGFKRGTGGSSGVGFLQQALALTFFPELFDVRTSVGVDN 282 T 6.3E-41 Trp_dioxygenase unp F Bacteria T 7p47 1 A B SMC5_YEAST Structural maintenance of chromosomes protein 5 MGTDEFLKAKEKINEIFEKLNTIRDEVIKKKNQNEYYRGRTGTRKDVSQKIKDIDDQIQQLLLKQRHLLSKMASSMKSLKNCQK 84 T 0.00014 Phe_tRNA-synt_N unp F Eukaryota T 7p4a 2 B,D E,D A0A659I9D5_STAAU Sri MVTKEFLKIKLECSDMYAQKLIDEAQGDENKLYDLFIQKLAERHTRPAIVEY 52 T 0.42 DUF3173 pdbhh F Bacteria T 7p4n 1 A A VWF_HUMAN VWF GSMATACTIQLRGGQIMTLKRDETLQDGCDTHFCKVNERGEYFWEKRVTGCPPFDEHKCLAEGGKIMKIPGTCCDTCE 78 T 0.099 zf_CCCH_5 pdb F Eukaryota T 7p5u 2 C,D CCC,EEE MGC0122 DRAATPHHRPQPR 13 T 15 Holin_2-3 pdbhh F T 7p5z 7 M 1 CDC7_YEAST Cell division control protein 7 MTSKTKNIDDIPPEIKEEMIQLYHDLPGIENEYKLIDKIGEGTFSSVYKAKDITGKITKKFASHFWNYGSNYVALKKIYVTSSPQRIYNELNLLYIMTGSSRVAPLCDAKRVRDQVIAVLPYYPHEEFRTFYRDLPIKGIKKYIWELLRALKFVHSKGIIHRDIKPTNFLFNLELGRGVLVDFGLAEAQMDYKSMISSQNDYDNYANTNHDGGYSMRNHEQFCPCIMRNQYSPNSHNQTPPMVTIQNGKVVHLNNVNGVDLTKGYPKNETRRIKRANRAGTRGFRAPEVLMKCGAQSTKIDIWSVGVILLSLLGRRFPMFQSLDDADSLLELCTIFGWKELRKCAALHGLGFEASGLIWDKPNGYSNGLKEFVYDLLNKECTIGTFPEYSVAFETFGFLQQELHDRMSIEPQLPDPKTNMDAVDAYELKKYQEEIWSDHYWCFQVLEQCFEMDPQKRSSAEDLLKTPFFNELNENTYLLDGESTDEDDVVSSSEADLLDKDVLLISE 507 T 7.6E-21 Pkinase pdbpssm F Eukaryota T 7p6r 1 A,B A,B GP2_HUMAN PANCREATIC ZYMOGEN GRANULE MEMBRANE PROTEIN GP-2,ZAP75 VQRGYGNPIEASSYGLDLDCGAPGTPEAHVCFDPCQNYTLLDEPFRSTENSAGSQGCDKNMSGWYRFVGEGGVRMSETCVQVHRCQTDAPMWLNGTHPALGDGITNHTACAHWSGNCCFWKTEVLVKACPGGYHVYRLEGTPWCNLRYCTDPSHHHHHHHH 161 T 140 Diphtheria_R pdbhh F Eukaryota T 7p70 1 A C VE6_HPV35 Protein E6 TDDSKPTRRETEV 13 T 0.38 E6 unphh T Viruses T 7p73 2 B B TAX_HTL1A PROTEIN X-LOR,PROTEIN PX,TRANS-ACTIVATING TRANSCRIPTIONAL REGULATORY PROTEIN OF HTLV-1 TDDSEKHFRETEV 13 T 6 VGLL4 pdbhh T Viruses T 7p74 2 B B KS6A1_HUMAN S6K-ALPHA-1,90 KDA RIBOSOMAL PROTEIN S6 KINASE 1,P90-RSK 1,P90RSK1,P90S6K,MAP KINASE-ACTIVATED PROTEIN KINASE 1A,MAPK-ACTIVATED PROTEIN KINASE 1A,MAPKAP KINASE 1A,MAPKAPK-1A,RIBOSOMAL S6 KINASE 1,RSK-1 TDDSRRVRKLPSTTL 15 T 9.6 DUF1088 pdbhh F Eukaryota T 7p7q 15 O H RL3_ENTFA 50S ribosomal protein L3 MTKGILGKKVGMTQIFTESGELIPVTVVEATPNVVLQVKTVETDGYEAIQVGYQDKREVLSNKPAKGHVAKANTAPKRFIKEFKNVELGEYEVGKEIKVDVFQAGDVVDVTGTTKGKGFQGAIKRHGQSRGPMSHGSRYHRRPGSMGPVAPNRVFKNKRLAGRMGGDRVTIQNLEVVKVDVERNVILIKGNIPGAKKSLITIKSAVKAK 209 F F Bacteria T 7p93 2 B,C B,M ACKR1_HUMAN DUFFY ANTIGEN/CHEMOKINE RECEPTOR,FY GLYCOPROTEIN,GPFY,GLYCOPROTEIN D,PLASMODIUM VIVAX RECEPTOR XDSFPDGDYGANLE 14 T 0.67 DUF2716 pdbhh F Eukaryota T 7pc3 2 B C TAX_HTL1A PROTEIN X-LOR,PROTEIN PX,TRANS-ACTIVATING TRANSCRIPTIONAL REGULATORY PROTEIN OF HTLV-1 SEKHFRETEV 10 T 6.9 DUF6428 pdbhh T Viruses T 7pc5 2 B B EXOC4_HUMAN EXOCYST COMPLEX COMPONENT SEC8 ATKDKKITTV 10 T 78 FAM76 pdbhh F Eukaryota T 7pc7 2 C,D E,F PTEN_HUMAN MUTATED IN MULTIPLE ADVANCED CANCERS 1,PHOSPHATASE AND TENSIN HOMOLOG EDQHTQITXV 10 T 63 Pas_Saposin pdbhh F Eukaryota T 7pc8 2 C,D C,D KS6A1_HUMAN S6K-ALPHA-1,90 KDA RIBOSOMAL PROTEIN S6 KINASE 1,P90-RSK 1,P90RSK1,P90S6K,MAP KINASE-ACTIVATED PROTEIN KINASE 1A,MAPK-ACTIVATED PROTEIN KINASE 1A,MAPKAP KINASE 1A,MAPKAPK-1A,RIBOSOMAL S6 KINASE 1,RSK-1 RVRKLPETTL 10 T 6.2 CITED pdbhh F Eukaryota T 7pfo 13 M C CDC45_HUMAN PORC-PI-1,PORC-PI-1 MFVSDFRKEFYEVVQSQRVLLFVASDVDALCACKILQALFQCDHVQYTLVPVSGWQELETAFLEHKEQFHYFILINCGANVDLLDILQPDEDTIFFVCDTHRPVNVVNVYNDTQIKLLIKQDDDLEVPAYEDIFRDEEEDEEHSGNDSDGSEPSEKRTRLDYKDDDEEEIVEQTMRRRQRREWEARRRDILFDYEQYEYHGTSSAMVMFELAWMLSKDLNDMLWWAIVGLTDQWVQDKITQMKYVTDVGVLQRHVSRHNHRNEDEENTLSVDCTRISFEYDLRLVLYQHWSLHDSLCNTSYTAARFKLWSVHGQKRLQEFLADMGLPLKQVKQKFQAMDISLKENLREMIEESANKFGMKDMRVQTFSIHFGFKHKFLASDVVFATMSLMESPEKDGSGTDHFIQALDSLSRSNLDKLYHGLELAKKQLRATQQTIASCLCTNLVISQGPFLYCSLMEGTPDVMLFSRPASLSLLSKHLLKSFVCSTKNRRCKLLPLVMAAPLSMEHGTVTVVGIPPETDSSDRKNFFGRAFEKAAESTSSRMLHNHFDLSVIELKAEDRSKFLDALISLLS 572 T 2.1E-35 CDC45 unppssm F Eukaryota T 7pfo 19 U Q CLSPN_HUMAN HCLASPIN LEVLFQGPDYKDDDDKDYKDDDDKDYKDDDDKMTGEVGSEVHLEINDPNVISQEEADSPSDSGQGSYETIGPLSEGDSDEEIFVSKKLKNRKVLQDSDSETEDTNASPEKTTYDSAEEENKENLYAGKNTKIKRIYKTVADSDESYMEKSLYQENLEAQVKPCLELSLQSGNSTDFTTDRKSSKKHIHDKEGTAGKAKVKSKRRLEKEERKMEKIRQLKKKETKNQEDDVEQPFNDSGCLLVDKDLFETGLEDENNSPLEDEESLESIRAAVKNKVKKHKKKEPSLESGVHSFEEGSELSKGTTRKERKAARLSKEALKQLHSETQRLIRESALNLPYHMPENKTIHDFFKRKPRPTCHGNAMALLKSSKYQSSHHKEIIDTANTTEMNSDHHSKGSEQTTGAENEVETNALPVVSKETQIITGSDESCRKDLVKNEELEIQEKQKQSDIRPSPGDSSVLQQESNFLGNNHSEECQVGGLVAFEPHALEGEGPQNPEETDEKVEEPEQQNKSSAVGPPEKVRRFTLDRLKQLGVDVSIKPRLGADEDSFVILEPETNRELEALKQRFWKHANPAAKPRAGQTVNVNVIVKDMGTDGKEELKADVVPVTLAPKKLDGASHTKPGEKLQVLKAKLQEAMKLRRFEERQKRQALFKLDNEDGFEEEEEEEEEMTDESEEDGEEKVEKEEKEEELEEEEEKEEEEEEEGNQETAEFLLSSEEIETKDEKEMDKENNDGSSEIGKAVGFLSVPKSLSSDSTLLLFKDSSSKMGYFPTEEKSETDENSGKQPSKLDEDDSCSLLTKESSHNSSFELIGSTIPSYQPCNRQTGRGTSFFPTAGGFRSPSPGLFRASLVSSASKSSGKLSEPSLPIEDSQDLYNASPEPKTLFLGAGDFQFCLEDDTQSQLLDADGFLNVRNHRNQYQALKPRLPLASMDENAMDANMDELLDLCTGKFTSQAEKHLPRKSDKKENMEELLNLCSGKFTSQDASTPASSELNKQEKESSMGDPMEEALALCSGSFPTDKEEEDEEEEFGDFRLVSNDNEFDSDEDEHSDSGNDLALEDHEDDDEEELLKRSEKLKRQMRLRKYLEDEAEVSGSDVGSEDEYDGEEIDEYEEDVIDEVLPSDEELQSQIKKIHMKTMLDDDKRQLRLYQERYLADGDLHSDGPGRMRKFRWKNIDDASQMDLFHRDSDDDQTEEQLDESEARWRKERIEREQWLRDMAQQGKITAEEEEEIGEDSQFMILAKKVTAKALQKNASRPMVIQESKSLLRNPFEAIRPGSAQQVKTGSLLNQPKAVLQKLAALSDHNPSAPRNSRNFVFHTLSPVKAEAAKESSKSQVKKRGPSFMTSPSPKHLKTDDSTSGLTRSIFKYLESLEVLFQGPDYKDDDDKDYKDDDDKDYKDDDDK 1403 T 0.00057 BUD22 pdbpercent F Eukaryota T 7pg8 3 Q,R,S,T,U,V,W,X C,E,H,K,O,Q,T,W F7IVA8_RUEPO;Q0ABW0_ALKEH Ion transport protein,Voltage-gated sodium channel GPSSPSLLRAIPGIAWIALLLLVIFYVFAVMGTKLFAQSFPEWFGTLGASMYTLFQVMTLESWSMGIARPVIEAYPWAWIYFVSFILVSSFTVLNLFIGIIIESMQSAHHAEDGERTDAYRDEVLARLEQIDQRLNALGETKK 143 T 1.3E-51 Ion_trans unppssm F Bacteria T 7ph8 1 A A IGF1R_HUMAN INSULIN-LIKE GROWTH FACTOR I RECEPTOR,IGF-I RECEPTOR NFIHLIIALPVAVLLIVGGLVIMLYVFHRKR 31 T 0.00041 Insulin_TMD pdbhh F Eukaryota T 7phx 3 C I TTI_GLOMM Tsetse thrombin inhibitor GEPGAPIDXDEXGDSSEEVGGTPLHEIPGIRL 32 T 0.048 Cytochrom_B558a unppssm F Eukaryota T 7pi0 25 XA,Y u,U PsbU QRVRTVLDMDDPAKEETVKELRKDINN 27 T 0.018 LMSTEN pdb F T 7pil 6 FA UU U5NME9_CERS4 RC-Y EVSEFAFRLMMAAVIFVGVGIMFAFAGGHWFVGLVVGGLVAAFFAATPN 49 T 0.054 DUF3487 pdbhh F Bacteria T 7pjo 1 A,B AAA,BBB CPR-C4 GHMASMTGGQQMGRGSMHYKAQLQKLLTTEEKKILARLSTPQKIQDFLDTIKNKDLAEGEHTMWSPRAVLKHKHAHCMEGAMLAALALAYHGHSPLLMDLQTTDEDEDHVVALFKIDGHWGAISKTNHPVLRYRDPIYKSVRELAMSYFHEYFIWWTKKNGGKKTLRAYSNPFDLTRYKPERWVIATGDLDWLAEALDDSKHFPILNKKMQKQLRPASRIETKAASLSEWPKRKTNS 237 T 0.00035 DUF553 pdb F T 7pkq 1 A B A0A2K3CST5_CHLRE mS35 MTLSTQRRALLAGFGKARGGQWTQEGIAALHSTTTTNAQVETQPDGELAESSADDASRLFQELSRRNRPNSAGTEPGPSPRPVLAQLPAAAELMERAAGTPSSQALYDLPTYLSVTHPHARVEPRNPAYDWRRSPQLEAGGPRRAALLLAVDAHMAAPESREALLRMAQLELAHLWYYQQHAAELPAAASPSASAASAASTATPDAAAAGQRRGGVAKQREAEPAAASTSAAAGDKAQAGAGTGTGAGAGAEAEAGDDDIFAAADRKARERAAAVEAATAAAASSAAKSLGRRGGRLPAELEARVRDMQLRYGMASRDAASVLRLLADSFSRRLPGGGRRLEGEAEAEAAVGLGEAEAEAGGDPTAALTWALSGGGRGGSGGTISGLSRHLAKQAAQAKAADRAIITAMQSLADAVNTASSSSSSATASSSTASSSGSSAYWAAHPALAYDQTLAARAHRQGLAWRAAAEAGPDGAASLSAAITALSASLRRPTSSPTSSASASSPPATPVLDLVLDYLQAKYADLLTAWETAARLREATERVSAAVARARATVPMSAVPPLPPALAAELERSWQNAAAAFRPALLQPLLQPAGSGAAARNSLAAAAASGSPALLQAQSSLTRPLDWAKAKALIEQHYAKQQQAAALAGATAAAEATAAAAAAASAAAAAGSPRGAVEALLQPLLQRAADRHRALLAGGDAAAAEAAAAAAAAAAHGSSTAAAGGAAPSEGAASVVCFRETLTLDASSAYANSSFDSRVTLEFNVDRLAAAEPGLGGAWGARFLERLLATPALPGSGRRIRGGGGSSGMFGRARASPAARAAANAAAYPVDVEHSFCRRTRTAALSTARYGSREANRKLLLEHYNELLRAAALAAAAGASAAV 883 T 31 4HPAD_g_N pdbhh F Eukaryota T 7pkq 6 H O A0A2K3DZV7_CHLRE mS106 MDLIGRGGAPSASALGTDALVLLCGPEAQAGISTCSAAAESAASPQSSCSWVGTAQHSSATRAEAAGPSAPCGPAPLLRNLRTPSQSGRLSGAPPTWISAAAASLGNSPRFAASARCSTADVDVSRSGLEDASALGDCVDRWSTPAATVSGIALLHQYHNHHHNHHHIGHSSSSCGSVASSTSSVPGSAAGSPFPPRPSLPSSATNSSRFLMQSQSQQLRSVSFTAATAAPKAAAKGASAKAGSSSSSGAAAPSPAAAALRTPEWARVPGHLHELQDLYRKRQKRMAALRHGAQVELEAREAVAWKTGGRGRAVAAAAKAALDALPLPPPEDGGKAELAAALAADTTFAAAHEALTGLLRRGVVLDAAEHLAPLLRKAGDAGQLASALGVAAANHLSQAARQLGAHSPRHGPLHAVLVQECRRLKAAAPLVTLWESLHEHGLAPEAADAAAAVRAAVELGDGGAAVRLLMLACMYGEAPLAGAAEAGAVLQLLQQGNPDQAAQLRELLPKLGLRGA 516 T 1.2 PPR_long pdbhh F Eukaryota T 7pkq 7 I P A0A2K3CRX4_CHLRE mS107 MRKRELLNEARALVPEGSGWLEAYTRNISPRQLTWRLGKRDSLAAMTEGWQLYQGKFDTVAMAALLRRLRHAQLQDPGFDPLAAQRLLDDLVPRLRSVGLRFGKLRDITAYLHALAKLRSPAPSASSSAASPRAGAAAASLLTQPDALVLDLAVFATRNRTELLHASPQRLATLLWALMRLLPPQLYGSEQLQVVLDRMALASLGRLQNFAPLDLRWAALAFATFGPHGPSSKATATTTAAAAGTRAAAGAGSGPALPEWPRVVRGEAAAAATAGAQGAEDVTARRQSRNARVIKALCDELAARSGNLALPQPEPRDLALAAHALGLVAAASSSAGGAVAPPALLVKAAGVAARSLPALSGEEAVGLVETLAVWGLRQPALLEALRDAAGRWGEGEQAEALRGRLQAAYTRLGVEL 416 T 0.8 Med3 pdb F Eukaryota T 7pkq 14 P d A0A2K3E198_CHLRE uS4m MQRTLRSLATRCAGAITGSTAQTGASGCIKAEAGVSTSALSGLISQDAIHGLCPRGAAFASLASSGVSAAAGAGPASQLRAGAPLSCLWLLALAPSRTGLAATASSSSSCGACGSHSSTSSPFAPLPASAAGLRHYAKAAKGGAAAAPAAPSGPKRPRTSTTKLYSCRNDRLRHTHEQIWPTLQLTEYEQAMFKRNSRLFVVDMGRSLSLRDKFRMGAYEPATAASTGAAAEEAGGAGAGDALVPAAGGASYRRVPYWQARSLLHESNLHLDALGENPRYLRLRRVGSLFATKLQNVRKLRLLLGFQRRGFVQKLYEHSLLARGSDRMWKMVCAMEATLPMTVTRMGLAEDVVGAATAIRNDKIYVNGKQPVMPRKGLLEPGDVVGPAAGGAAYLRKRVARSMEPLASVVTRDYV 415 T 0.0089 S4 pdbpercent F Eukaryota T 7pkq 31 GA h A0A2K3CNM3_CHLRE uS8m MAAPLLDPLVSKLRQTTATAARAAEVMRAAFPGATHETAGRNTIAVQLPRKDVPTYVMANQRPQPWELLPMKAAAMTQYPNFFNNSCTFFGSIKRDVVNGVPFCLLRPSRLALDMAKVVRNLGIVDGFEVVQRRSRLGAHDFVWLPEQQPQEPEHLYDTSLFRQRLIRLHLRTDLFSRLPGAPGSGAGGPQPASAQLAPAVGLLPLSVKNISKASQPVLMYPRQLEEAAARLPAGVFMCYHPQLGLITDAMAQQYDVPALVAAHVGLPLSQAAAIRGALRVKAAEEAGKELRHVTQLKDWNMMELLRQRMVERRAALEAGMGVGGEVAARLQELREAGLRLRDEASDRVTSALNVAQDLEDGALAWQLVHSRALGAAPGAAAAGVDEGAGGEEQAGAGEGRTSPRGQPRRRR 412 T 1.6 Ribosomal_S8 unppercent F Eukaryota T 7pkq 40 PA x A0A2K3DXG4_CHLRE mS29 MTSALLVASRRARQAQGLPRCLLHAIGIHAGTRAEFASVALQEAGTTPSTSGQEQPSSAQLTPAHLRSYYPLNLALLPEAARGSAGAFYTPRDPGHERRGGCKALQQEMEATGRASILYRPIMAALNGAVAAGQQPRLLLTGPAGCGKSLALLGLVEWARQQGWLVVYVPSCLALVRGGYFARRGRGAAGGWDTLTSAQQLLKGVMDAHGPLLQSLPVLPVPGRAARRQQQQQHEPRQADKPAKVEEGQGQGQGQAGGSGLLEEGSGSASGAGGGGGRTLQDVALRGLSSDDNAQLAVDSALQLIRQLQLLGSGAAQPPDSQPGQPPRVLFALDDYNYLYGPTDYGVQPPSASPLQGRRRVLDAGELILARGLRLLESELGTNPVAAAAAGGGAGGVGGAVVVAATTATPALPAPRSLALEVPHTVVEVPGFDEAETAAALAHYAATGAATRAASAAEARHLFALTGGNGRELRAKAGALGVRVG 485 T 0.0048 DAP3 pdbpercent F Eukaryota T 7pkt 4 D d A0A2K3DYN2_CHLRE uL5m MLKLQPRSWDALPRLTAIEVSIPAIETQLERDVVDKSELLLYALALEVLAGKPAGFTAPANKALGTRATGVAVRLDAVTEPEAAHLFMEKLVHVLLPNQVGFEGVPPPMLVPPPRRSKAAEAAQARKAALDHRKAPAKAHFTEIKVGNLLTYPDFEQNFSLFEPLRGMRVRLVMEGASAADCAALLGGMSLPVLSGAAAEAALAEITAEVARRARG 216 T 0.0091 Ribosomal_L5_C pdbpssm F Eukaryota T 7pkt 28 BA E A8J2J1_CHLRE mL41 MTVRSIVVSLIRGANKASRQHQGDIGREAVVDLIQQSAAKQSGIRKGWQVKAATWVKRVHVDRGDVKVGRLEGGEFQVLPHLRPRYFVPADLDKFQLKPYVEVEKKVEAAKQ 112 T 0.00012 MRP-L27 pdbhh F Eukaryota T 7pkt 31 EA I A8IS96_CHLRE mL63/57/60 MFFSRCVMVVFKTTGGRSWNPPSGLRPLSPAQRRNRTKNLALTMKNMSILKLAEANQPEVPVRLYKPLNFSRMQWMKKKLEETRAALGWDMEARALQEQARALRVGGGRQGAAGSLLPPAARAALQGSVGDK 132 T 0.12 L31 pdbhh F Eukaryota T 7pkt 33 GA K A0A2K3DBX4_CHLRE mL80 MHASVSGSLDTAPSSSSGTALATASTSSAPLDLREQRHLYLDGTRTADPGEPRYTAPYWVPPSARAGIPNILFSEPWPSHEEPQLRRQHAAMCLEALKRADRPLTAEQVHEAVNSTAGYSASASAGDAGDSGAGADKPVLSTLAYTKKLLEHLRRTRFVYGRKNPDSMLSPGHPDHPRLYEALPFQAARYGKPETLAAADEAARAAAIAKAQKRLRNGKAPYPQHRRRARFSIWQHELAQEALRELQAK 249 T 0.0046 DUF4777 pdbpercent F Eukaryota T 7pkt 34 HA L A8J535_CHLRE mL87 MLALLAVRARSPSLPSITLPARLLSTQTSASVSETYSNRPTSSAESTEAVSSSGQSASKWDWKWVLGKASGRKPAITRPRRHQWHYCNPEYDPAAPLPEVLRSPFGPPGAERSHDWATYARHLQLQPENRRDLKRYRARFVRFMQLRELDWREAFQRGVAEDSRVSNKVARAKAEAQRQDAWSDYKQAMWQRAQLADSHQSHGTGR 206 T 4.2 Statherin pdbpssm F Eukaryota T 7pkt 36 JA M A0A2K3DVD3_CHLRE mL113 MRQLSLSLLARLGRGSRGSLQPASSAINSGVDPGVLSGEECSTSAPAGMPSWLRHSRRYAHQYNLIQPVDTNHINALLSSATALEHAAVPVLRYSAWFDPEQVTRTMQRVPRMLQYQRRKGRRGAAYASSPSSSADLARSLLDALGSRLAALAPACSDQQLARALWALGAARHPHPQALAAACEVLPQRLKGASGAAAAAGAGSGAGGMAMTDLATAAWGLAAAASAGPQSVREPVRRALQEVARHLVASRPADLSATPALPQPSSPSSISSPSSGAVAPAADEVAAAASAAALAADRPWLDPRSAVKLAWAFASCEVKDAAALDVVAEAAEARIASQLQAHDPTTGPLTPRATYMYQTIRGWQAWPRPRPRVIRSAASAARGGRSRYLYDDRPRVVLRDFTAGSLAQLLAALAAAGHRHEGLMQAAAAHLTASSGRSLRVDPHDLKRLAAAFARLDLAAPAAASGGAATAAALTALLSAAQLSSLPAPLLARLAILAAESGVRRRSVYDRLVRQLMARAWVPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 876 T 0.94 CIS_TMP unppercent F Eukaryota T 7pkt 37 KA N A0A2K3DR53_CHLRE mL114 MRRAVARCTAQIAAEACSVSSACSTSGRQVQEQLDNCLRWMTYARPRSRIDYYDPTRNVYKEFQRKWDRANAAAAAAASQAEAAAARPSHSAPPSAAPQQQPYGGAGSYPFGSSLTTPCSSYGSGYGAGYGGTAAAGADWPPGYEALLQTYLEQELPLSAEEASTLVAAAKKGLLPGSRSLIRTRFMHIKDLEPRFPGFDARAAVLGEPRLLRHAADKVMRAMLVFQDHWPSHPVGPLMGRIGCPVIRDPAGVGHRLYALTRALKTDLHYELDPHRLTPESEGFLASGVSPFELEARVSALVTIFGREGAGRLLDVSLDVLTYAPRDLDRAVLALREVFSAAGDRGYGRHSLSTPEGAAAAAADRGYVTDLAVAWPGVLALPGRLGGADGVARLLARVRRAGGARYRGAVGRRALLSEVLERPELLQAAAEAAMRGEEEDEEEDVEGRAELEGKV 455 T 5.1 NpwBP pdbhh F Eukaryota T 7pkt 39 MA P A0A2K3DXQ3_CHLRE mL116 MRQQAGMLLGEAVASTSGRASPALQLIIQRTLSLVASGIPSPRADVALQLSTPHGGHINRMINTSESIVELDSILYRFRKRLRPANIGAAAMRLEHLNRLERRTPYALRVQRVAAELQKYVATYTDRLALTQAANVLRGLSAVRHRLPPELVLRLAAGAVADGGAALRLAPDVDVRDLCFGLAGQGFNNTAFWARLCAAVLPRLRSFDPNTLPALVTALQAAQQLPAPASASASSGSAGSAVAAAAGGSTPQAAVAAEALRLLSRSETLAALAPARLADAASLLAGLGPALGVAVDARLVEAVQTATARALPSLSPNQLPGLLLAVAALRRAAAPAEAAAAAAATAPQQQLPAALLATALPHLSAGAVTMDLTAVMRAARLLAPHAAEPAAADTLVRLARRTLLLLPAPGSSTGGPTGSASGSSSSNGEGLVTLSRVPRGGQAAGAVLAAAAPAGQLQGRTAGAVEGVARAFAAAAPAVAPQPALVGELAARLAAAGEAAAARGLLDEAQLASLGRSVEVLAAAGAAKSG 530 T 0.41 HrpB1_HrpK pdbpssm F Eukaryota T 7pkt 40 NA Q A0A2K3CXJ4_CHLRE mL117 MLQALAGGALGGLQTNGPANLVGALGLLQRAAAAVVTGVPSSSSPVPPHADRSLASLSAGAQSAAESACSHGGCGHDEAPCCSARSSSNSSSDAGAPRGLQQQLRSQQLQQHQHQQRRGIATSAGSALAYKFQSNVSPASSRGSGRGSKVATRDNYQRWRESGGDVRVAQDILREAEGSGRGGGGGGGAGAARRGDSRLRRGADRPGGSGAGSGGGTGAGAGVVDVQDELRAMVLGCRDLSELQVVVCECGADLNPFLVCAAAARLHKLKQATPPGASPAALARRVGESLMVLLQDRAAEAPLSQLAGAAHGLAEAGLAPGAALLEALAARCEAASPRGGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAAASELGAGMLRPLCDALTPRVPALSCADVASLATGLAAALGAAGGEAAAAAADGAPPLLSPSHFGSLPRLLSDLLLLRGPGQFGGRNFASVALALALVTGGPAGAGGGGAAAAGSLPPAFWSKLAAVALPEVPAMDAGSLSRLAGAFCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 820 T 0.19 DUF4799 pdb F Eukaryota T 7pkt 41 OA R A0A2K3DTE5_CHLRE mL118 MAFLVAARRQLKGGLSLAEVALPVSVLGTCTRSNAVFTEALAGCSGASLHCSAEQPLSRATSYSDAGAGAAGVPACGSASPRQPSPEPCSSSGSHATFNSLGRYRSQHVGISRGASALSALAEASASPSPASNLPRGQHQHQQHHHQQLRTYHAWYYGSKLRNRAISQAESLEELGEMLVREGHRLDHVNLTALLAQLKRVARAAEEEAVAEATGGSSSSSSNSSSSGSSAVTAAAAAAAARAVRVRVAELAAVAARLVRRRAKWYDPRHAALAVAHTAALRHTDGRLLHDMTGRALARLDEAYSRDVLLLLRGLCAHQHMQQLAAASSPPAVAAAVAPAVPAVAGAGKPYGGAPAVLLGGVKVFLTAKVPTGRMPPENLAGLLRHWRALAPPGRRLGPAVCGVVAADLQTRTAIYAPEPLAGVLATLSAERHALPPPLLDAAAEQFAAHALTHGSGAAAARFLAAVGAQLRLQQQAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 653 T 15 Tachystatin_B pdbhh F Eukaryota T 7pkt 42 PA S A0A2K3D424_CHLRE mL119 MATGAVAASAADTASTSAPAPPSSSPFMPRWLRNLFPGGEHLPASPAMERQTSGASASSSGGSGPGSEADDIKQLEEMRNMDMQGYVEYCKKMRGGAPPPRPRRPSVSPDHYDYRTMQDQRRIAFLRMQQHEHIGSLVTKEESDLILAKREDVVKNRALLQAIADRTGVYIDLEVKDCIEQFLETRENAGQMHRYATEFGMPLPKGSQEQREMRRFMKRVEAEEKLAVALEKRDLTSCSLRHKLSWAGPTALCDQTTLRYHECCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKREVVDKDRERQVHGMFKSRAEGILRKAAMDGVKPRLRDY 334 T 0.1 VipB_2 pdbpercent F Eukaryota T 7pl4 1 A A INSRR_HUMAN Insulin receptor-related protein beta chain GGLHVLLTATPVGLTLLIVLAALGFFYGKKR 31 T 0.0071 DUF6203 pdbhh F Eukaryota T 7pla 1 A A A0A8X6EH11_9CYAN ShCas12k SNASQITIQARLISFESNRQQLWKLMADLNTPLINELLCQLGQHPDFEKWQQKGKLPSTVVSQLCQPLKTDPRFAGQPSRLYMSAIHIVDYIYKSWLAIQKRLQQQLDGKTRWLEMLNSDAELVELSGDTLEAIRVKAAEILAIAMPASESDSASPKGKKGKKEKKPSSSSPKRSLSKTLFDAYQETEDIKSRSAISYLLKNGCKLTDKEEDSEKFAKRRRQVEIQIQRLTEKLISRMPKGRDLTNAKWLETLLTATTTVAEDNAQAKRWQDILLTRSSSLPFPLVFETNEDMVWSKNQKGRLCVHFNGLSDLIFEVYCGNRQLHWFQRFLEDQQTKRKSKNQHSSGLFTLRNGHLVWLEGEGKGEPWNLHHLTLYCCVDNRLWTEEGTEIVRQEKADEITKFITNMKKKSDLSDTQQALIQRKQSTLTRINNSFERPSQPLYQGQSHILVGVSLGLEKPATVAVVDAIANKVLAYRSIKQLLGDNYELLNRQRRQQQYLSHERHKAQKNFSPNQFGASELGQHIDRLLAKAIVALARTYKAGSIVLPKLGDMREVVQSEIQAIAEQKFPGYIEGQQKYAKQYRVNVHRWSYGRLIQSIQSKAAQTGIVIEEGKQPIRGSPHDKAKELALSAYNLRLTRRS 641 T 0.0027 RuvC_1 pdbhh F Bacteria T 7pll 2 B B FAK2_HUMAN Pyk2-PRR2 peptide TAFQEPPPKPSRPKYRPPP 19 T 61 NinE pdbhh F Eukaryota T 7pln 1 A,B A,B A0A1G4H7E1_PLAVI Sporozoite micronemal protein essential for cell traversal, putative ETGKDIVKILTASTTVTKTGPPPISAECPHNMVVLFGFVVKQNFWDHTNKLQSYEMEICESGASSCTSKQGNTNKYDVSYTYIECGPQALPFTEQVVSVSGTTYNSVKCPNDYSVLFGFGMATSSGRHQSALYSYFTPCRPGLKSCSLNMNEHDDKSYIYLVCVDATIWTGLNALSMIAKDDLHSAVNRYQQFNDGELVVTCPSEGTILTGFYGETHTSSPYVTVPFGKCAKSLKACSVHGSGQAIGIHNYRTLFTVALCKNNKTHHHHHH 271 T 9.2 DERM pdbhh F Eukaryota T 7plo 1 A Q CLSPN_HUMAN HCLASPIN MTGEVGSEVHLEINDPNVISQEEADSPSDSGQGSYETIGPLSEGDSDEEIFVSKKLKNRKVLQDSDSETEDTNASPEKTTYDSAEEENKENLYAGKNTKIKRIYKTVADSDESYMEKSLYQENLEAQVKPCLELSLQSGNSTDFTTDRKSSKKHIHDKEGTAGKAKVKSKRRLEKEERKMEKIRQLKKKETKNQEDDVEQPFNDSGCLLVDKDLFETGLEDENNSPLEDEESLESIRAAVKNKVKKHKKKEPSLESGVHSFEEGSELSKGTTRKERKAARLSKEALKQLHSETQRLIRESALNLPYHMPENKTIHDFFKRKPRPTCHGNAMALLKSSKYQSSHHKEIIDTANTTEMNSDHHSKGSEQTTGAENEVETNALPVVSKETQIITGSDESCRKDLVKNEELEIQEKQKQSDIRPSPGDSSVLQQESNFLGNNHSEECQVGGLVAFEPHALEGEGPQNPEETDEKVEEPEQQNKSSAVGPPEKVRRFTLDRLKQLGVDVSIKPRLGADEDSFVILEPETNRELEALKQRFWKHANPAAKPRAGQTVNVNVIVKDMGTDGKEELKADVVPVTLAPKKLDGASHTKPGEKLQVLKAKLQEAMKLRRFEERQKRQALFKLDNEDGFEEEEEEEEEMTDESEEDGEEKVEKEEKEEELEEEEEKEEEEEEEGNQETAEFLLSSEEIETKDEKEMDKENNDGSSEIGKAVGFLSVPKSLSSDSTLLLFKDSSSKMGYFPTEEKSETDENSGKQPSKLDEDDSCSLLTKESSHNSSFELIGSTIPSYQPCNRQTGRGTSFFPTAGGFRSPSPGLFRASLVSSASKSSGKLSEPSLPIEDSQDLYNASPEPKTLFLGAGDFQFCLEDDTQSQLLDADGFLNVRNHRNQYQALKPRLPLASMDENAMDANMDELLDLCTGKFTSQAEKHLPRKSDKKENMEELLNLCSGKFTSQDASTPASSELNKQEKESSMGDPMEEALALCSGSFPTDKEEEDEEEEFGDFRLVSNDNEFDSDEDEHSDSGNDLALEDHEDDDEEELLKRSEKLKRQMRLRKYLEDEAEVSGSDVGSEDEYDGEEIDEYEEDVIDEVLPSDEELQSQIKKIHMKTMLDDDKRQLRLYQERYLADGDLHSDGPGRMRKFRWKNIDDASQMDLFHRDSDDDQTEEQLDESEARWRKERIEREQWLRDMAQQGKITAEEEEEIGEDSQFMILAKKVTAKALQKNASRPMVIQESKSLLRNPFEAIRPGSAQQVKTGSLLNQPKAVLQKLAALSDHNPSAPRNSRNFVFHTLSPVKAEAAKESSKSQVKKRGPSFMTSPSPKHLKTDDSTSGLTRSIFKYLESLEVLFQGPDYKDDDDKDYKDDDDKDYKDDDDK 1371 T 0.0022 BUD22 pdbpercent F Eukaryota T 7plp 1 A,B A,B TEN4_HUMAN TEN-4,PROTEIN ODD OZ/TEN-M HOMOLOG 4,TENASCIN-M4,TEN-M4,TENEURIN TRANSMEMBRANE PROTEIN 4 HHHHHHGSMETACGDSKDNDGDGLVDCMDPDCCLQPLCHINPLCLGAAA 49 T 0.017 DUF6085 pdb F Eukaryota T 7pmp 1 A A Q5ZVW8_LEGPH Type II protein secretion LspD MAHHHHHHVDDDDKMGSKLWNLRNADIRAVIAEVSRITGKNFVIDPRVQGKVSIVSSTPLSSRELYQVFLSVLQVSGYAAIPNGEIIKIIPNIDAKTQSPDLLSGMKSPPR 111 T 0.26 DUF3738 pdbhh F Bacteria T 7pnb 1 A,B,C,D,E,F,G,H,I A,B,C,D,E,F,G,H,I Q4JBK8_SULAC Sulfolobus acidocaldarius 0406 filament. MNKKRLLLLSVFLVSVFVPVLVADVIYYYQGQITVGNVAPPMYFAIQPNGNAKIGNNSNVPSYINAQPSSGGSGFTAQVNITNATYNYYFNFMGLAVSKTGYIYLAKVAYSYTATNNPIQNATLYIMNQQGQIVYKYKLIVNGVVNSTLPSTPLQINSGSYIVSLLIVPYQGTLPKTPSNDLATITVNFGFSPMTASPPPIPLPSP 206 T 0.024 PPC pdbpssm F Archaea T 7pnt 2 B B RT02_MOUSE MRP-S2,S2MT MAPAPAVLTRLLCAGVRRWPGFLQKAIPGPAEQNGRKVTGAPVPAVSEPQDGDDFQSRILDTPLQHSDFFNVKELFSVKSLFEARVHLGHKAGCRHRFMEPYIFGNRLGQDIIDLDQTALNLQLALNFTAHVAYRKGIILFVSRNRQFSHLIETTAQACGEYAHTRYFKGGLLTNAQLLFGPSVRLPDLIIFLHTLNNVFEPHVAVRDAAKMNIPTVGIVDTNCNPCLITYPIPGNDDSPQAIQLFCKLFRTTINRAKEKRRQMEALHRLQSPKGSEGSGTSPVPDKSHSP 291 T 1.0000000000000001E-29 Ribosomal_S2 pdbpercent F Eukaryota T 7po6 2 B,C,D B,A,C YTDC1_HUMAN SPLICING FACTOR YT521,YT521-B GGGGTSKLKYVLQDARFFLIKSNNHENVSLAKAKGVWSTLPVNEKKLNLAFRSARSVILIFSVRESGKFQGFARLSSESHHGGSPIHWVLPAGMSAKMLGGVFKIDWICRRELPFTKSAHLTNPWNEHKPVKIGRDGQEIELECGTQLCLLFPPDESIDLYQVIHKMRH 169 T 7.399999999999999E-32 YTH pdbhh F Eukaryota T 7poh 1 A,B A,B SRYD_DROME Serendipity locus protein delta PEFMDTCFFCGAVDLSDTGSSSSMRYETLSAKVPSSQKTVSLVLTHLANCIQTQLDLKPGARLCPRCFQELSDYDTIMVNLMTTQKRLTTQLKLDK 96 T 0.0082 zf-AD pdbpercent F Eukaryota T 7pp2 2 B B C4B8B7_MAGOR AVR-Pii protein LPTPASLNGNTEVATISDVKLEARSDTTYHKCSKCGYGSDDSDAYFNHKCN 51 T 0.0029 zf_C2H2_6 pdbhh F Eukaryota T 7ppl 2 B B IRS1_HUMAN IRS-1 GRKGSGDXMPMSPKS 15 T 0.7 STAT1_TAZ2bind pdbhh F Eukaryota T 7ppm 2 B B IRS1_HUMAN IRS-1 EPKSPGEXVNIEF 13 T 0.29 DUF4834 pdbhh F Eukaryota T 7ppn 2 B B CD28_HUMAN TP44 RSRLLHSDXMNMTPRR 16 T 0.0099 WBP-1 unppssm F Eukaryota T 7ppo 2 B C SIDJ_LEGPH Calmodulin-dependent glutamylase SidJ HHHHHHSAGLEVLFQGPMVKQYYFARRGETSTHDTSLPPPVKVLSGRSIPLKEIPFETTRNELVQIYLTSVDQLIKSNKLNSIPSQQIASHYLFLRSLANSETDGIKKNQILSLAKPLGIYLASKEPHVWKTINELIEKSEYPIIHYLKNNRAHSNFMLALIHEYHKEPLTKNQSAFVQKFRDSSVFLFPNPIYTAWLAHSYDEDSSFNPMFRERLSTNFYHSTLTDNLLLRTEPKEVTLSSEHHYKKEKGPIDSSFRYQMSSDRLLRIQGRTLLFSTPQNDVVAVKVQKRGEPKSTLEEEFQMADYLLKHQSRLDVYSKLPQPLGQYSVKKSEILEISRGSLDFERFKTLIGDSKDLEVYVYKAPLTYFTYLHDKNQDLEDLTASVKTNVHDLFVLLREGIMFPQLADIFHTHFGEDEREDKGRYQALVQLLNVLQFQLGRIDKWQKAVEYVNLRSSGLADLGDSLPITSLFTSSDFTKHYFSALLTGGYHPTFFDKSSGTANSLFTGKRRLFGNYLYLNTIAEYLLVIQLTLGSYGDKVTRDMMDKPKKEAVWRELANVMFTSCAEAIHIMTGIPQSRALTLLKQRANIEKHFRQTQFWMTPDYSKLDEDTLQMEQYSIYSGEPEYEFTDKLVSGVGLSVDGTHQDLGGYNRESPLRELEKLLYATVTLIEGTMQLDKEFFKQLQQVEKILSGEIKTDANSCFEAVAQLLDLARPRCHFQKRLVLSYYEEAKLKYPSAPTDAYDSRFQVVAKTNAAITIQRFWRETRKNLSENSDIESEKPESERTTDKRLK 794 T 0.28 DUF5415 pdbpercent F Bacteria T 7pqd 6 FA,GA,OB,PB UA,UB,ua,ub PufZ MAYMFGIIVFLAMLAVCWFGFMAAERQAGRL 31 T 0.031 Orai-1 pdbpercent F T 7pqw 1 A A BCR4 DFDPTEFKGPFPTIEICSKYCAVVCNYTSRPCYCVEAAKERDQWFPYCYD 50 T 6.2 Ragweed_pollen pdbhh F T 7pr7 2 B B HPSE_HUMAN Heparanase 8 kDa subunit QDVVDLDFFTQEPLHLVSPSFLSVTIDANLATDPRFLILLGSPKLRTLARGLSPAYLRFGGTKTDFLIFDPKKE 74 T 3.5 Glyco_hydro_79n unppercent F Eukaryota T 7prd 1 A A NAB3_YEAST;NRD1_YEAST Protein NRD1,HLJ1_G0022400.mRNA.1.CDS.1 TANTASQQLSLDPKQRSKQILSNLKKSPPLNLNISLPTDLTSTDPAKQQAALFQVIAALQKHFKTNMENVNYDLLQKQVKYIMDSNMLNLPQFQHLPQEEKMSAILAMLNSNSDTALSVPPHDST 125 T 0.066 OSK pdbpercent F Eukaryota T 7pre 1 A A NAB3_YEAST HLJ1_G0022400.mRNA.1.CDS.1 GSWGSMENVNYDLLQKQVKYIMDSNMLNLPQFQHLPQEEKMSAILAMLNSNSD 53 T 0.34 DUF5452 pdbhh F Eukaryota T 7prv 4 E F PRGC1_HUMAN PGC-1-ALPHA,PPAR-GAMMA COACTIVATOR 1-ALPHA,PPARGC-1-ALPHA,LIGAND EFFECT MODULATOR 6, PGC1A COACTIVATOR FRAGMENT PPQEAEEPSLLKKLLLAPANT 21 T 100 Neurokinin_B pdbhh F Eukaryota T 7pua 8 H CJ C9ZPU0_TRYB9 LysM domain-containing protein MVRRSHVVAYYWSRYRMPTQMPKFDGPAPVAAPQSMNSTKTNEFIDPIDDKFPMSIRGPLVRPDVPEDQYVDSWYICTSMTHHMGDYRPWSASAPPNAFRFRPFNEFDAKGREYVQYMREFARFDPRKSRGNGQKGFPFRDAYLTKMNEANQKTPPPTLETIMDRAVREHHQHARILSPLEVQRDVGRLEPIPSYAGKINADRSVFPFQWKTEDWYEYEVAKVRNRRFVFENTEEDGIRGSEVTYKIVLEGFWDHHVMKLAEDVCMFLKDVGRQIVEEKLVAVRRLLQGGAVDPELLAAFNCARAGPFGGLDEYDKEEVANFLRSDLRRLEEQCLSVINRCNVPVPGATNIYDPHTSWPHVEKLEPWVRMAEFWTSSSDTSFTELEMSTAHYEFRKFFRVIICKLPFQSTEFEKRMYDIRHWLHRQTSCEFHTIYRRNVIHDSAVFPTEHDPATPTTHEHHRMFSFALDWQSAPVNRLSTDTVHEGESWDAVAQRLGCSVGELKDANAERETIEAGVVINVPVTATRRLTSFGATPLVLPLKTTSAKDGERIRTWEEAAAILDCTVEELQQCNGHAALTYQKKESEAGEFDSSVTELVAPLSCWTSTSESEFSPVERVHANDTLVAIARRLQCSEEALRAVNDGITDVSGLDFVRVPPEARRPRRLVEPQLRPQAATDALLARTIAEEETFKLKSIPHLPQNAERFPHEYHTPTSRFPPTPSETPATQDWMAYTAKYLDKQFTISAEPAPVYNVNKLWPMQQIPGKVDQTPFEEDQTWLLHSIPVQQLEMHHHEKDLQDLPFINHEQFPRSLEWNAP 817 F F Eukaryota T 7pua 19 S Cb C9ZNU0_TRYB9 mS23 MRQCVVRRYKMPKNMGVAPRFDTWNEKYEPWEHMKRMGRLVGTGFYIPPEWYNHFRMFPPINHNFQQEKTLNPHNASEPTQDDTSTLSPERVALRDELARKSRLVASEGMRYYNIFWVRKPLDTMEKEYYELKRRGVDHGEAIRKVLQGFYSGLAVKKRVAAIQAEEAKLTGRFITMREATVVLGVLAKLHKEQLTPHQVSLLAKEQGETTQSGAKLTAIVSRTQPHVNKEASSPATSEAVGSSTEESLSADALASMLSEDGEQSAVGTRYQVEVKETANDSVRQLREKAEDQTGFPDWYTGESPTYSGTSXCSRDGFALMKANK 325 T 0.035 CHAD pdb F Eukaryota T 7pua 20 T Cd Q38DK6_TRYB2 mS26 MMRSSSFCRRQIRPYYNLPSKSEHGRKMTGFLTPYRHWMWKQNELWRNVHEAQFEHLRRVYKRQWLESFRVNADEYIYKYNITKAAQLAQWECEMKEQEKKRIEARQMMDGRQALKKKHLDLLREFHERQFFFWYERASERLQNMNLINYVPHAQLREHIDKELDKYVAGKNEPYPLNFVGQMPFLEDGDGNIVEVPESLLSNHMAEHPDSTAKPHEPHTSSSISEAAAFEERMLRAMVSAKEEDLKEWLGDDSRALSETIDDISREEEEREADIRVARSMEETDAEREVSRRAYIERXKTGSRSIFRPPTVSEGAGGTPSAPAGDANTPMRRRKKGKLDRVHALQAHQDELLAKLSSQGLKEXVDASSVPERGKIVQSRGRIRDKAVIPTHEVLMQKPELAAGSTPGARIQTKDMVDKMYHRGKYKKSGSGDKSDGEDL 440 T 0.028 ThylakoidFormat pdbpssm F Eukaryota T 7pua 23 W Cj Q57UK0_TRYB2 mS34 MLRCARVALRADPLNGGSSMTLGSKGSKLSPEPHRRRMPWTAAKEYVPGVVLNARDKMVLDGVQLLDIESIDRASQLDPLEVLRAVVATREYNISTGKNIFQLASQATYNGRGQRFYRKEWQEGTYDKYVTLSAIDFDRDGNKGTAYGYITFHGETTTRPVQVDFADVPGWYMDFVEERAVPFTGIVPPPPSIGTDVPVDPHSYRLKAYPYYDAPNPPEFVERLLKDRGVLPDTPTETADVDKDPTTSDGSVHYDGK 257 F F Eukaryota T 7pua 30 DA DD D0A752_TRYB9 mS51 XXXXXXXXXXXXXXXXXFVFRDPSLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHANTQYQHAWPLLSHDDLGNSDQSNNTKNIMYSMYMPKRNKGTAPWFRGADTYSVKYCEQGRYEYQRYLMINRFPSEYKKHFLSFLSNIRMSSGSATIPQEALHWLLRMIVDNFNPQHVHYIAAMKTLQSAGELDMARDVWKIMERQQTWPCTATICAYLDVCVEAGEKTWAMEAWNRYCTELKFLEPGEVDPKPISRVPFSLTREELLYLPKWKKHFDHDPNLDVMDLNRFNRTREVYLRMAQVMLAGGERNAFQHFFTKLEEAMLNKPTPVPEPPNPHLVRRPRWAPYEHCKSVHHSPWRLQNNGRALALGPPVTIEDEMQSRFFSNDQFLVHSVKEVLRIVLQEHKRAHPTECTRCKTEAFFYKTKDADETLKFCDDLIERLFASLGVRLSNLNTSSLLSTILEVFRVVGKESGAALLQRANEFLERKASLGDAEGSRENLTASNYLQVLSGFADESAFVYNTKKDGTCQYKTGFDPRTTMRHLADVVQEIAGNPHVTWAADMHLQVVETMVGCGTMKANDYFVRNVLRQFSWDSRFLEALYVEYRRQDDVDMWAELTKRALVWTARYNAPASERLRRLIEDDYDTIRVQTRTFRELAVFQFRDVEERRHSRDVVNELPNPWYDYVAHALPFPDRDAGYPDEYGDLGQWRAPGGPGSPVRGPGYYAPPMEGEHQRGYTAEWRDLRNPMRPPEFPTPWERKYRQYARGQHPSYDMVYAGPMPEIFPMRRDFRKPTRWDFHDIEKQGKYRTSGPY 812 T 0.0012 PPR_long pdbhh F Eukaryota T 7pua 31 EA DE Q386Q7_TRYB2 mS52 MRRCSFASTCRYRSADDVPLGLGSAYVWTCRNITSRGQFNPIHNFSYAMERGVRARDVKAFEKLITNPGPLRVAYTPDYLDWLHRCYKAKGTYMDARAVAEKKFNGNIVSSELSAAVNRREGKRGDTRGDTVDNDHHNLPGAPPPGMFLRPAHSFRRLAGELKRRRAQSILDEVARAQGMLDLFERQPHFPAIHIDRCSRFHLVELFKEMVLERSLDSNMIWEKALLYRAILSERKPSYPTSFHYIFTAVEDTVFAPTISTPMEMAGSGGEGHDRSSHPLAAKCPTLEAYYYYVYLVKKYYIDNAVEAHVVLRCHREPNAADLLFSNPPPKDDTEIMKAVELLRNADIQRGVAAAAAVSDPTLPPGGEGSVIGNSDNXKNSEKXSEGSRGRPARPPVLPGAYPPIDMLWRCEENLPLLKVLLFGEFNLIVSENPFVKFPSAHGFLTRPYSTDSSRTLADGMSLANVMAEKRGHLLPSLPRNTATSIDARAQDIRRLQQKHHRDDIVSFQKLLRSTHAEDSPSAFSSYSDWSYFNPRAVRAEERDRLTRKAVEALKLYDSATNDIYRHSFEDVQACHTQRVTERDRTMPPYLPTLPHFVAIIKKDPHISFLLHIGLPDRNSSEEGSAKHKELEKRIYYLARALYHTALEYHNETVRRVNRQKVNVAASLLDNFVEQEWTTILRDKHDVTDVTKTLNDTQNDKKQLARRLGRYMLFANRSLDDTGFPTDARADDYTRWMAPPSVGKVSL 747 T 7.7 Complex1_LYR_2 unppssm F Eukaryota T 7pua 33 GA DG C9ZNY4_TRYB9 mS54 MFRRAIPLLSANIPRSVWDPAQHNPNWSDSYGHDITNRRAWPARKWTVGLEPCTPREWLQFSHRNLAYAYNGALRACHSLPSMLLLYKEMKQRGVKVDVDTMNVLLTRAARHEHIQVDDVFLLFDELVALGARPDLAAAETLHTVLSHSASMPEEWREARRLQLVELYNNLAMEEVERLAPHRADRLLKEQMKRFRGNLQQLGSGLRPTVYCRYLHTTHTAAVLLEEVHNFLWELVPNDHPAMEIPALQLRVPFVASVLRRPSVNPGVSLASVSRAEFGDTDVCAVFLAAAERMVDADFDDQRPVSERRLFLSLLTMISYSGVLYTSDLMAQLMEMVKYSNNDETRDSDAQRVLRYALRGSSAAQDSASRTLWHSVEKVADCRVVGRYIGARNPWNPIRVCFDEQGVFKAYPISTTTTTREVSPPEGNGAVTQEQRASCVEGRTLEALNMRWDDVRRLIECTGVLVTPPSERCPQQQKMEVFTGMAVYLRTVATGRRYEGGEDVLSDGAVATSSCEQRRRGTLFAEGYDFDVWVRLFSLVQEVRHDMEKFMADHTLQCVEPEFECWEALLVTLRCALDFCVVQMQGGGARGTEREVVERLFRDVVALREELIEESRTRFGGRMRVLWLQEA 631 F F Eukaryota T 7pua 34 HA DH A0A3L6LGC8_9TRYP mS55 MLSQNVAKTTVPSYYMIRTNLPHRKPQNQWEGVYYYSGITKRQRHLILLHRKREREAHMRSFNISRASVLQRLEQLSGDRKQESLPPHVRLDLAVRLAQHGLYQQATPIVDELHHQKALHAGHYALLINALACPRLGQRILHCDAQCDPALTYKLLGDENGEERAQEAYRWFDLALTSLAVDCGGRTQPSHFVPYLPQGTAAASHITNALMRTLLTCGYTHVAAIPDSVYDRMGSMGISPTISTYELVMLALSLQGNMVEAESILSFLRSHHSEHITVESFNALLLGHREARQFDCCDAIWQELVDRRWPRASPLTAELYLRSIMDHANTPTSEPLQSFANINVVEKKKVPLVLAQMDELGVPRTHLSRVLMDEVEDSLRKFQTYRSRFYEWGRAVKQFDFIEFRRRNGWLYDLHLMKCTTKQVGPLRDFNDPDAVQGAVATAEIPAFFNERPAWERPPLEETLYVTTNKERYDDVRGGDIYYDDTRGLHDRSPTWMNEVPETRYDRLYGVNHPDIAKIGIRRHLNVEYVNRKEVVERDAALMKKTLSSGRRLRHRVESSRTHRNAGSLSGISSTAGGGSR 581 F F Eukaryota T 7pua 37 KA DK A0A3L6L3U6_9TRYP mS58 MSFRYTNNLIGALKHRLLLESSYREIASRKFIGNCRGVEVVCSGYGTVLAVQLTDKAVWESFYRKGGRPTVSGGGDVSGDAETGTQGSATTTGATGDLDLDKLAESIKTALWDATRKIRSAKEAALHRSLSHNTRMRASADLKHWYEEDANTLRPLAFEALKHEAATPWMQLVQHGKKEEAAALLKEFEQKGDAAEATPTRVKDDRVKGTRTELSNREQPPLASTLKAEDSNPATIPIGSVHPLFTPALVQIEEAGGGSVSNEAVCRAQLWELSRDEQLFWERVELIRKGQVASIGSSHKRGYADEAAFAKDDTEEKVQLRFTQ 324 T 0.00027 YbaB_DNA_bd pdbhh F Eukaryota T 7pua 55 CB F9 C9ZSL5_TRYB9 mt-SAF9 MTGPTRALFLSSGINLGRLRLAEQFSSMNGWQSKEDPAFDAYVKERRRKENYEAFDQRVERGYAAAAKLHKAEIQNAVKRRLKSSGAKFTAETLREMSSAVTERLAWLRDVWAQIDADYRSGDSARQETAAQEISAALRGEPNDYMRWVYETKRELRFAGPVGRRAIQEELQAAELPEVLDEEVNRYHDLKLNMMEIEREVKAKYGVAGQQHWAELQAAKDEEYIQKLDEAAEVYKQLLDQSARLDESRRSELQRSYVERVHQAQVRFKAAMELEGQREQLIEAHQAMKEERMRTEREKRRQLLREAAELRAQGKKSADVLTALKERQLDANAKRQAEYELKECEDILKRKSEMLDMIAHFKHDVEEREGREMLQRQKSDERQVNVFGFYEEVGVEDGLSISSEGTTSQGGSSGLGTVSTSTSCAKSADSNSSAQPSQKLRKEELWKVINADTYEDPFRTVHQARLDAVKTYDPAYARTFPLNLVLGRKYSRQGAGEMAAGNETDKQILQKGNNILYSFQWGLNNGTVHDLDADGGTDYFMDGAFHVRDKETGDIDWRYEKKRGGPVFRGPKFYRLGAQREAADPGERAMDPTPYTSTPREHKWRSS 607 T 0.27 HMGL-like unp F Eukaryota T 7pua 67 PB Fh Q38A63_TRYB2 mt-SAF37 MWQRLRFDRLSSSVRRTNLNPLKPCAALTEQRAELRNLHQYPTARHKSLVKDRLRFARNWWLTGGNNYELVHEVGHEREATECFAEYAQDSSRDVYLMSTNRLSDLPPGDRLKAIVGLMRSRWEVKDANRGYDKAKLLLQALECFSEMKASGQIGDFNSLPEPDQDTFLQYVEGCSRFAQACSHSHPDAVRVLLRAAQICEEMRCVEKRDEMIQVTEAAANRMDRAYAFSRPHDTLRAAPPSLHENEDCVRLKNTEELRRRFGNTAPHVLEKPKRVDCLRIHRNRPLLLHPMKDNNKLLELSKLPARPEFDSWTSHQT 318 T 0.077 ATG8 pdbpercent F Eukaryota T 7pua 68 QB Fi C9ZNX5_TRYB9 mt-SAF38 MRGSLPLLFNPVLPPSTARLRLLTYPMALAQPHATVPLIQPTIDGTHDGRNGATVSLRTQARMHGTADGTMATAGDSSQNNSVMDSPRWLRNPDELCVAALRRSRDVNKINSYVATYKFDDPQWAPLLLPEVTLRPQVNSTGDKPNGGNEAAADVVSVGPSVSATPESTPPPPPSSSSSSPYSCPADCVSISHNKMIMLECMSRHVNFSLRHIVQKGHGIYLIYHAQHSILQPKGLVEQSFVTCSFGIRGERLRTDIVHVGPIDAADVMELQPSEGHDHPRCCFNLYQKSDVRRGVIAVSQVEGYGTWFQRKPMLWQRSRRIGALQSQLGAFAYDLVDPHEVGKWRDCEVSLLAPHMRFFRNGLNGAEAVGIIASSQVAQQRRLYLGEFEAPAITALDAVQQLAHASALRCKLVTPVVDPNGVGGTGSGSLGDENMDKHIDMETLLPLSWATRTPPPYVPLEADLPFKLQMSRPTVFAESHQQNQAYPTGGTVGSPFVRGAPMMMFEYNMHQGVDHYVYDDAPSARPMKWWSQKSNMPYSGYMYFARSGLVDRFTPSEDIPNPLEPTSKRKPLHAVVPPTKVVQERLRKYRRKQQEGHKQRRRASSGSGVSNEPDAVNRQESVSRGTCE 629 T 51 RCDG1 pdbhh F Eukaryota T 7pua 70 SB IB D0A0V4_TRYB9 mt-SAF39 MRRSGRGSAVRWSSLCKCQCCLYRTPLGGTYFEQALPRSLGARQGKGVLSTVNTALSRKALKRRQSLPRKKLNVPLTAEGLKERLKQLSAEERELSIKNNTEECDEPSPNEFTTTHEARVALARVLHHGENAGERKEVAMRIPSFCRSPAVSETQSIVVDDKEGDITNAAVHVGCSVLGSDLDHLERDMIRDYHQRGKKLPTFDNIYRTLGCGRKGTSVSDTEPEDENSSGAIQSECGLGDAGRRGTVVVAPSHLHHSTPPTKGRSGEEEEEGGCFDTNTLPADANPHFPPGACDNEVLAPLSGGCAASEQTEITDTASFIPSNSRLSTAVYDAYRQRPADDRLVVLRGTDFWDNEENRARLQELTDYAEEDFAREMLMEGAMDTSEVGYSTNKVRKETLLYFQAHPINEMIQEPFARVRSILPSDGGPEVHFPADDPDTDVDIPTAQARTMARELGLDLIRVGTLYTPINDRRVVAVCTIADHREHMRDMIRFKIKKLGVQRPPTKEGIEVPFRGGTHPHAVRFKSIGIAKHLLLGHVVRINLTDFGTVREGFPVFGSILDEVARQALQLHAYHTAGVVRANYNEVYCYLYPSTGRSPKSTVLHPTQEQLATVRDRCLLEREREVYFDGLYDKKTPRERLTYMRKLQDGTAWADRDDGLSLQRQRDMKVMLGYLPKGNHELYAARGDVNVPAPFRASHPTSVDRWTHPQESNLEQAARGSAVLAKRLSMTVSEMHDRQETAENPATLDRFYYRIQGPALEAGELKEALGLKGNRKRLPRRAPGWATLGMEKVSPQEPGHAAK 803 T 0.00057 mIF3 pdbhh F Eukaryota T 7pua 75 XB UG UnkG AAAAAAETDWKVIAA 15 T 14 Mastoparan pdbhh F T 7pub 30 DA DA Q57UJ2_TRYB2 mS48 MMRRVASTGSNGGHCYVGSSLWADSVYMEVSSITLQKRFSFKYATKLQHDEMRQPYYIHEKRYGIFSNERNIAKARRGLPFITPLYTKHMNLWDTDTDASNNRFFRGYYYGQRELHQLLGRPHSLEATNADGSNDLSTYEANTSQLYKGIPRPAITNLHYEPAWRYTLYQAGAHGAQLSNPRSPFTAKVLGDELMQVRDIKSVEHCKAWFDRLQYLINLHYEAVGDIGEFKSRHTRHVHEFFVAFHDALSSLDFRDTYLFDQFKAVRPPELSDLFGIFLEMEANYVHEDYCPRCSLPYSTTRYCGEGDVDTPFRKHRGRWAPHQKWGREWYDVVARRAEALWYRATEDPYFGTPQHTQRQAEALLKVYVQAKQRGKAIDFMNKLRGSVEYLTGDITITTVMQESYDALLDTTPHPHLLTNGFTLESDAAKYTGEVNKAPLSPLQFRIDMEMNKYRRQQKEEGVVRVPPALWRLDTSAIVPYKVDSKTRRIVNWREVKEGIEKSFLSVGLPKEAYTGNEWREMLYLHDVIANREAKAAVLEQQQKLDREKLKARERASLKGEKSSNSASNGIGIIFPDKDGYQVFFVDEASLRPFGIGASGTLFKAVARVYPSPAAVPYDDPVHGKQFLIDTTDETCHLFGGFEHGDRLLLRAKSSGGDVDGGNNSNCRNTGSFTDEEEEVIVMGVSTEGSDAERMLCAMHVDTGKQRERGIVFLGTDCIDIRERWESVRYAASAYVKGRVTLLESQRTAREEFMGQVVGIRDGVLFVQWRLLKGGGSVLDRSVAVPIGDSTQVRELFVETKVLGVEPLQSPPSWRTPFRNDFAEERLKELQRAPFKREKYVSLIQGKYTPKVKKFGYTQHTTVDDFETKEYKDRLLSKQFFQNPQAFEVVPDRNERSVQFGGKWEYQRTHGLPTVDRNELENGWSEVEPISDGEMKVIEQALRDISGPRPGNFVKPPSKTKSLQLSESWWEPLGYGWEQHNEEQKALCDVTEQRLIDGSNLPFGGKLPPFGTSYGMGERMRAIIEDYSKGFGLGPHGHSPTHDTIHYNTLNAEGERVRDLGYTDALGRLFSEKLGDRDVHQWAVESCADGEADVRQLLLSLHEWRERGRPPSLLLANVLSKYLEEEIAAFNKGIPENAPKLQLETSDGTLAHSGSGGSRSGTMWADVDPTTFALHQASQSTRSTRCDEPFILHLVKRAKLGECVTNFTDTAYIAHLESSVINEFHLALTKLVGKGISPTLLAQKTGQLHRGSVRVGGNVVPFVKSRELSRLLERMGLTSESIAVITRGLANCPEQESVGDDFAVPVSLILSWDGPGSSGGSGSGDRRLNTTAARNEQQRKGSAALSSAIRQLSQHQKQRGSASGRGWQNEDKMVVHVLEELSLRNDGLVMDIQYAIRENKKNRRLRWEFVTTLLPVFGGNEAKVEQLYSDYCDGKYVPNITVAVEAFIAFLHNATQHPETYSASDYFDIDDGKSSTASGTGDQPSAGQYTTLKLLDPLEGPFVFDNVKVEYIQTVERFRRHGIRAGPVMAPATGFIAANCKSLNYFTRRQEEVVYVTNDSDQGLRRSLENSAYHKTIAANPALQYLLKARRGAALVETFNRFFYRTMPMLNFYQNVLKHYSETIQPLRQAAKSSTRGLARALESERSAAMEEFRRNSERYWRNIIEGRSVEQVVLANGESNRRTVGGGAGEQLQKPQGNRDAVRTNFTGEATDGSRKGQRQHQQPPLSQQQQHSRGEGERSAVHTSPRQGGSRGMADLLSKLNTPKTGSGVKGSTAKNTGNNRRGPKS 1788 T 10 DUF5053 pdbhh F Eukaryota T 7pvm 1 A A G0S058_CHATD 5'-3' exoribonuclease GGEAKARLCKLCGQKGHDERSCKGEAKQKQG 31 T 0.0018 zf-CCHC pdbpercent F Eukaryota T 7pwf 10 J A RSSA_GIAIC 40S ribosomal protein SA MSTEKTSQASKEYQLKEADVKKMLVATTHLGVRNIDRRMQFYIFDRQKDGTFVFNLQKVWAKIVFAARILVTIDDPAEIAVVANRPDAQRAILKFCKYTHATAFPGRFIPGNFTNRMNPNYCEPRLLLVNDPVVDRQAILEASYVNIPTISLCNSDANLKFIDVAIPCNNKTPMSIGLIYWLLAREVLRLKGSISRTEEWDVKPDLFVALPEEIPDEEESEDFYDDDEEEDEEFSAGNGNLFDEY 245 T 1.1E-11 Ribosomal_S2 pdb F Eukaryota T 7px1 1 A,B A,B Conus mucronatus GENSDNLTHCRLFEFRLCLLECMSLTLDHCYARCTTVITQIHGSDTNRFDCTIFKTCYYRCYVLGKTEDHCWKGTATSVTGDVGDLEFC 89 T 6.1 Toxin_25 pdbhh F T 7pzn 2 E M SLLGRM, modelled as poly-A,SLLGRM, modelled as poly-A XXXXSLLGRM 10 T 16 Cas_CT1975 pdbhh F T 7pzt 1 A,B A,B A0A4P8JK46_ALCFA Urea amidohydrolase MNLTEKGTKTAKLSASDRIIYADNHLIHGPDDITAYMKGVCYDAAAYMRYLYNAKISFDQLTSISAQNWLPVFKFAEGRMWDGRNSLPGGKAIGFCRVKGMEFFHAAVAVGGTEIRAINGGLLGAGWLHPVDLRKVLTQKNPDGSFKYDGTDIFVYISNL 160 T 10 AAA_assoc_C pdbhh F Bacteria T 7q02 1 A A Q6SVB5_DIPPU Milk protein IAAILVANAKEPCPPENLQLTPRALVGKWYLRTTSPDIFKQVSNITEFYSAHGNDYYGTVTDYSPEYGLEAHRVNLTVSGRTLKFYMNDTHEYDSEYEILAVDKDYFIFYGHPPAAPSGLALIHYRQSCPKEDIIKRVKKSLKNVCLDYKYFGNDTSVHCRYLE 164 T 4.6 Transglut_N pdbhh F Eukaryota T 7q1e 4 E P CENPJ_HUMAN CENP-J,CENTROSOMAL P4.1-ASSOCIATED PROTEIN,LAG-3-ASSOCIATED PROTEIN,LYST-INTERACTING PROTEIN 1 MVNIEERPIKAAIGERKQTFEDYMEEQIQLEEQELKQKQLKEAEGPLPIKAKPKQPFLKRGEGLARFTNAKSKFQKGKE 79 T 0.11 DRAT pdb F Eukaryota T 7q1f 4 E,J P,V CENPJ_HUMAN Centromere protein J MVNIEERPIKAAIGERKQTFEDYLEEQIQLEEQELKQKQLKEGGGGGKPKQPFLKRGEGLARFTNAKSKFQKGKE 75 T 0.64 DUF1654 unppssm F Eukaryota T 7q1r 1 A,B A,B apCC-Di XGQLEQELAALDQQIAALKQRRAALKWQIQGX 32 T 0.0002 DivIC pdb F T 7q1s 1 A,C,F,G,J,K,N F,L,J,D,N,H,B apCC-Di-B_var XGQLKQRLAALDQRIAALKQRRAALKWQIQGX 32 T 0.0029 DivIC pdb F T 7q1s 2 B,D,E,H,I,L,M E,K,I,C,M,G,A apCC-Di-A_var XGQLEQELAALDQEIAALEQERAALEWQIQGX 32 T 0.00059 ABC_tran_CTD pdbpssm F T 7q1t 1 A A apCC-Di-A GQLEQELAALDQEIAAAEQELAALDWQIQG 30 T 0.0029 ABC_tran_CTD pdb F T 7q1t 2 B B apCC-Di-B GQLKQRRAALKQRIAALKQRRAALKWQIQG 30 T 0.0019 DivIC pdb F T 7q21 13 W,X V,v Q8NS61_CORGL Actinobacterial supercomplex, subunit C (AscC) MFPEFERMYDMANVEKKHFVDPAWPEHNPADGHVVTELISKVAGASSPWGDDKEFPVSAEETGYVHPYTRINR 73 T 10 PHYHIP_C pdbhh F Bacteria T 7q21 14 Y,Z K,k Q8NSJ8_CORGL Hypothetical membrane protein MYMGKSFALLVLGAIILAGGVWYTIEVGYSVMAIVAALIMAAGGGIITWGLAVAADVNSPTSHKI 65 T 0.00051 DsbD_2 pdbpssm F Bacteria T 7q3u 1 A,B,C,D,E A,B,C,D,E TADBP_HUMAN TDP-43 NPGGFGNQGGFGNSRGGGAGLGNNQGSNMGGGMNFGAFSINPAMMAAAQAALQSSWGMMGMLASQQNQSGPSGNNQNQGNMQ 82 T 0.024 Glucosaminidase pdbpssm F Eukaryota T 7q42 2 D,E,F D,B,F BAZ2B_HUMAN HWALP4 EDDDDKDQDESDSDT 15 T 0.0019 SDA1 unppssm F Eukaryota T 7q44 2 D,E,F D,B,F UBP35_HUMAN DEUBIQUITINATING ENZYME 35,UBIQUITIN THIOESTERASE 35,UBIQUITIN-SPECIFIC-PROCESSING PROTEASE 35, UBIQUITIN CARBOXYL-TERMINAL HYDROLASE 35 GFDEDKDEDEGSPGG 15 T 12 DUF3245 pdbhh F Eukaryota T 7q45 2 D,E,F B,D,F MYT1_HUMAN MYT1,MYELIN TRANSCRIPTION FACTOR I,MYTI,PLPB1,PROTEOLIPID PROTEIN-BINDING PROTEIN RSDDDKDEDTHSRK 14 T 1.3 PTN13_u3 pdbhh F Eukaryota T 7q47 1 A,B A,B H6WYJ5_9CAUD Endolysin GAKTSLPRGIRNNNPGNIEWGSPWQGLQARTAASDPRFCQFIDPASGIRALAVILTTYFDKRKAADGSKIDTIREVIERWAPPKKNGVVENNTTAYANQIARVLNMQPDDETLNLHDYETMRKMVEGIIRHENGSPEDYDRAPYNNINQWYSDEQIAEGLRRAGLVKPKT 170 T 0.044 MPAB_Lcp_cat unppssm T Viruses T 7q4i 2 C,D G,F MUC1_HUMAN MUC-1,BREAST CARCINOMA-ASSOCIATED ANTIGEN DF3,CANCER ANTIGEN 15-3,CA 15-3,CARCINOMA-ASSOCIATED MUCIN,EPISIALIN,H23AG,KREBS VON DEN LUNGEN-6,KL-6,PEMT,PEANUT-REACTIVE URINARY MUCIN,PUM,POLYMORPHIC EPITHELIAL MUCIN,PEM,TUMOR-ASSOCIATED EPITHELIAL MEMBRANE ANTIGEN,EMA,TUMOR-ASSOCIATED MUCIN APDTRX 6 T 170 DDE_Tnp_1_assoc pdbhh F Eukaryota T 7q4q 3 E,F E,F A2GL_HUMAN LRG1 epitope GNKLQVLGKDLLLPQ 15 T 1.9 DUF3719 pdbhh F Eukaryota T 7q50 2 B B FDVSWFMG peptide FDVSWFM 7 T 1.7 DUF5724 pdbhh F T 7q51 2 B B FWLPANLW peptide FWLPANLW 8 T 1.7 Pinin_SDK_memA pdbhh F T 7q5a 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H Lanreotide XCYXKVCTX 9 T 0.021 Urotensin_II pdbhh F T 7q5w 2 G,H,I,J,K,L GGG,HHH,III,JJJ,KKK,LLL TYOBP_HUMAN DNAX-ACTIVATION PROTEIN 12,KILLER-ACTIVATING RECEPTOR-ASSOCIATED PROTEIN,KAR-ASSOCIATED PROTEIN ESPXQELQGQRSDVXSDLNT 20 T 0.0049 ITAM unphh F Eukaryota T 7q64 1 A,AA,B,BA,C,CA,D,DA,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T,U,V,W,X,Y,Z A,W,a,X,b,Y,c,Z,d,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T,U,V NUP98_HUMAN Nuclear pore complex protein Nup98 TGTANTLFGTASTGTSLFSSQNNAFAQNKPTGFGNFGTST 40 T 0.43 Nucleoporin_FG unp F Eukaryota T 7q6i 2 Q,R X,Y Cell division protein FtsN (polyAla model) MANRDYVRRGKGTSRRPAKKKTSGKKPWRXXXXXXXX 37 T 8.1 MRP-S33 pdbhh F T 7q72 2 B,D C,D RED1_SCHPO NURS complex subunit red1 GAMGISLPLLKQDDWLSSSKPFGSSTPNVVIEFDSDDDGDDFSNSKIEQSNLEKPPSNSENGGSHHHHHH 70 T 5.2 DnaA_N pdbhh F Eukaryota T 7q8d 2 C,D PA,PB ASP-LEU-GLU(AMI) XTRESEDLEX 10 T 64 AbiEii pdbhh F T 7q8f 2 C,D PB,PA GNYKEAKK Peptide XGNYKEAKKX 10 T 1.5 TPR_3 pdbhh F T 7q8l 2 C,D PA,PB VPCGTAHE Peptide XVPCGTAHEX 10 T 2.8 Sod_Ni pdbhh F T 7q8n 2 C,D PA,PB KKYDAFLA Peptide XKKYDAFLAX 10 T 4.3 Pollen_allerg_2 pdbhh F T 7q8q 2 C,D PA,PB RLSAKP Peptide RLSAKP 6 T 1.7 HMG14_17 pdbhh F T 7q98 3 C,F,I,L,O C,F,I,L,O ASN-LEU-SER-ALA-LEU-GLY-ILE-PHE-SER-THR NLSALGIFST 10 T 13 NPH-II pdbhh F T 7q9c 2 C,D,E PAA,PAC,PBA RLSAKP Peptide XRLSAKPX 8 T 3.9 HMG14_17 pdbhh F T 7q9h 2 C,D,E PAA,PAC,PB LLKAVAEKQ Peptide XLLKAVAEKQX 11 T 23 YebO pdbhh F T 7q9s 2 C,D CCC,DDD KRas DGKKKKKKSKTKC 13 T 4.5 TMEMspv1-c74-12 pdbhh F T 7qal 1 A A PGAM5_HUMAN BCL-XL-BINDING PROTEIN V68,PHOSPHOGLYCERATE MUTASE FAMILY MEMBER 5 AFRQALQLAALGLAGGSAAVLFSAVAVGKPRAGGD 35 T 290 TMEM210 pdbhh F Eukaryota T 7qam 1 A A PGAM5_HUMAN BCL-XL-BINDING PROTEIN V68,PHOSPHOGLYCERATE MUTASE FAMILY MEMBER 5 AFRQALQLAACGLAGGSAAVLFSAVAVGKPRAGGD 35 T 370 TMEM210 pdbhh F Eukaryota T 7qan 1 A,B AAA,BBB F4F6Q5_MICM1 Cytochrome P450 MAHHHHHHSSGLEVLFQGPMIEIPSAATASPQYPQRRACPYRPAGGYERPVTRVRLYDGRPAWLVTGHETARQVLLDAATFSSDRQHPAFPALAARFEAARAVRNFIGMDPPEHTAQRRMLISGFTAKRVATLRPAITEIVDSLLDEVVRRGPGVDLVATFTLPVPSVVICRLLGVPYADHEFFEHQSRRIAAGTSTAAESADAFGQLKRYLLGLIETKGRGGEDMLDVLVDEQVATGTVTTPDLVDLALLLLVAGHETTASTLALGVALLLEQDGGAVAADPTRVGAVVEEILRHTAVADGVARFATRDTEVAGVRIAAGDAVVVALSAANRDPGPFPDPDRFDPRRGGRQHVTFGHGPHQCIGANLARAELEIALSRLFTRLPTLALAVPVEELGGKEAGGVQGVQRLPVTW 414 T 1.3E-33 p450 unppercent F Bacteria T 7qao 1 A A PGAM5_HUMAN BCL-XL-BINDING PROTEIN V68,PHOSPHOGLYCERATE MUTASE FAMILY MEMBER 5 AFRQALQLAASGLAGGSAAVLFSAVAVGKPRAGGD 35 T 300 TMEM210 pdbhh F Eukaryota T 7qap 1 A A PGAM5_HUMAN BCL-XL-BINDING PROTEIN V68,PHOSPHOGLYCERATE MUTASE FAMILY MEMBER 5 AFRQALQLAACGLAGLSAAVLFSAVAVGKPRAGGD 35 T 290 TMEM210 pdbhh F Eukaryota T 7qby 1 A A DNJB6_HUMAN HHDJ1,HEAT SHOCK PROTEIN J2,HSJ-2,MRJ,MSJ-1 MGNFKSISASTKMVNGRKITTKRIVENGQERVEVEEDGQLKSLTINGKEQLLRLDNK 57 T 5.2 AGA2 pdbpercent F Eukaryota T 7qca 24 X LM0 S7XVN9_SPRLO eL14 LM0 KYFSYPLMYFIKPGCLIKKKFSIYTSIVISVIDNNSVVIQSYDKSDNGDVISIDREVINVSKIVPIGNIDIKNKSKKEIDGILVEENRNLKNKDDVLLMNDFERFKEQLKKEVEDMVIEEMA 122 T 0.00025 Ribosomal_L14e pdbhh F Eukaryota T 7qdj 1 A A PK-10+PK-11 XGELXXLKELXXLKXXXWKGX 21 T 13 Beta_protein pdbhh F T 7qdk 1 A,B,C A,B,C CC-TypeN-LaLd XGELAALKQELAALKWELAALKEELAALKXGX 32 T 0.0005 DUF5320 pdbhh F T 7qdw 1 A A Q8I2Y4_PLAF7 Zinc finger protein, putative GPHMDYDMLTEEQKKKLKEDHTLKILLKNNYVREVFKQFTLSNDKIGYLSHYINDPTIVQVIDHIMKTIDDT 72 T 0.00019 STI1 pdbhh F Eukaryota T 7qdw 2 B B Q8IK99_PLAF7 NUFIP1 domain-containing protein DIYTYEKKLIKSIEYITKNKFFDDS 25 T 1 Sec34 pdbhh F Eukaryota T 7qec 1 A A E4SK47_LACAR S-layer SVSFYEIANGNEVHTGSLNMTANPTSHELNVSAVLAAAKAKYAAHQLENGASNGASVAVTTDVKDLTDQLTKAGIKVDPLGNFQAQASFSFNLAAKSAQNAATATLPITVSVAN 114 T 0.039 DUF6074 pdb F Bacteria T 7qep 38 LA M4 I7L8J2_ENCCU ECU06_1215 protein MRTVRLGRIVTPALKERRHTYAIIVGIIDVTFVLLQRKDGEREICSVANLHLEDEAFDIKGLSAEEIGKLIPEDTYIEDTTNDFDRFKLKLRKRVEEELLKEKGLA 106 T 0.0021 Ribosomal_L14e pdbpercent F Eukaryota T 7qep 45 SA MS I7IV41_ENCCU ECU06_1135 protein MSKTYLKSWKEKKEKMPNAALSFKQRLRIKQQKRVERSALLSKIKILKTRKRNFLRERQKQREMKKQENMAKS 73 T 13 Cgr1 pdbhh F Eukaryota T 7qep 68 PB S0 RSSA_ENCCU 40S ribosomal protein S0 MPQDNTRISDSIKIPDEFVKLLIVSQSHLGGTSTNKSFARYLYGTRPRDRINIIDINATWEKLIIAARAFCGIKHPSSIAVVSTKTFGRKPVVKFCEAVGATPITGRFIPGSFTNSEVKRVYDPRVLIVSDTYADKQAILESQYCNLPTIAFVNTDNSLVGVDIAIPMNNRSPSAIAAGFFILSRLINYMKTGAELVRDMKEVELFLFRDSVELEQLVEEQLLETTDSILNVGKEGILSGIGTGNADEWNSF 252 T 2.5E-12 Ribosomal_S2 pdbpercent F Eukaryota T 7qf9 2 C EEE HRas peptide SGPGCMSCKC 10 T 0.85 DUF4536 pdbhh F T 7qfb 2 B B PPR3C_HUMAN PROTEIN PHOSPHATASE 1 REGULATORY SUBUNIT 5,PP1 SUBUNIT R5,PROTEIN TARGETING TO GLYCOGEN,PTG AKKRVVFADSKGLSLTAIHVFSDLPEE 27 T 0.079 PBCV_basic_adap pdb F Eukaryota T 7qff 2 C,E PA,PB ACE-VAL-ALA-CYS-LYS XVACKSSQPX 10 T 13 KI67R pdbhh F T 7qfh 2 C,D PA,PB LYS-VAL-LEU-AMI XAYFKKVL 8 T 4.7 DUF5339 pdbhh F T 7qfi 1 A,B A,B Q5FLN0_LACAC SlpX MGDTAVNVGSAAGTGANTTNTTTQAPQNKPYFTYNNEIIGEATQSNPLGNVVRTTISFKSDDKVSDLISTISKAVQFHKNNSASGENVTINENDFINQLKANGVTVKTVQPSNKNEKAYEAIDKVPSTSFNITLSATGDNNQTATIQIPMVPQGLEHHHHHH 162 T 0.0028 T2SSC pdbpercent F Bacteria T 7qfj 1 A,B,C,D,E,F A,B,C,D,E,F Q5FLN0_LACAC SlpX MGSTPTDTTQNPQINWTKGGQAQSSSLNGQVFQVAVGSNFNPLNFTNSNGENIIVSAQQSKNNTTFASIEATSNPVNTSEAGRYYNVTLTATGNTGKKTTATYTVLITSSQKQTLYGNGESTISTYSIYGNNVLSNSTTFKDGDQVYVSDQTKTVGGVSYSQVSPKSKNDANSSNIWVKTSLEHHHHHH 189 T 0.00023 DUF5011 pdbpssm F Bacteria T 7qfk 1 A,B,C,D A,B,C,D Q5FLN0_LACAC SlpX MGSTPTDTTQNPQINWTKGGQAQSSSLNGQVFQVAVGSNFNPLNFTNSNGENIIVSAQQSKNNTTFASIEATSNPVNTSEAGRYYNVTLTATGNTGKKTTATYTVLITSSQKQTLYGNGESTISTYSIYGNNVLCNSTTFKDGDQVYVSDQTKTVGGVSYSQVSPKSKNDANSSNIWVKTSLEHHHHHH 189 T 0.00023 DUF5011 pdbpssm F Bacteria T 7qfl 1 A A SLAP_LACAC SURFACE LAYER PROTEIN,SA-PROTEIN MGHHHHHHHHHHSSGHIEGRMGNVNFYDVTSGATVTNGAVSVNADNQGQVNVANVVAAINSKYFAAQYADKKLNTRTANTEDAIKAALKDQKIDVNSVGYFKAPHTFTVNVKATSNTNGKSATLPVVVTVPN 132 T 0.067 TrmE_N pdb F Bacteria T 7qg9 1 A,D,E,G,K,L,O,P,S,T,X,Y R,Q,S,N,M,O,J,L,U,K,P,T FIBL2_BPT5 L-shaped tail fiber protein p132 MSTENRVIDLVVDENVPYGLLMQFMDVDDSVYPSTSKPVDLTDFSLRGSIKSSLEDGAETVASFTTAIVDAAQGVASISLPVSAVTTIASKASKERDRYNPRQRLAGYYDVIITRTAVGSAASSFRIMEGKVYISDGVTQ 140 T 3.3E-05 BppU_N pdbhh T Viruses T 7qg9 2 B,H,Q I,H,G TAIL1_BPT5 TAIL PROTEIN P140 MFYSLMRESKIVIEYDGRGYHFDALSNYDASTSFQEFKTLRRTIHNRTNYADSIINAQDPSSISLAINFSTTLIESNFFDWMGFTREGNSLFLPRNTPNIEPIMFNMYIINHNNSCIYFENCYVSTVDFSLDKSIPILNVGIESGKFSEVSTFRDGYTITQGEVLPYSAPAVYTNSSPLPALISASMSFQQQCSWREDRNIFDINKIYTNKRAYVNEMNASATLAFYYVKRLVGDKFLNLDPETRTPLIIKNKYVSITFPLARISKRLNFSDLYQVEYDVIPTADSDPVEINFFGERK 298 T 0.023 DUF4965 pdbpssm T Viruses T 7qh3 1 A,B,C,D A,B,C,D I2N5H0_STRT9 RsfG MNDTTAAAPGTAADPGPDAAVRALDRLIGTWRVSGGAEGTVSYRGLEGGHFLLQDIALEQFGQPVTGVEVIGRLKEFGAEEPGEDIRSRYYDSRGNTFDYVYELDGDTLTIWGGEKGSPAYYRATFSADGNTLSGAWVYPGGGGYDSVMTRVAV 154 T 0.0017 DUF1579 pdbpssm F Bacteria T 7qh7 5 E I RM10_HUMAN L10MT,MRP-L10,39S RIBOSOMAL PROTEIN L8,MITOCHONDRIAL,L8MT,MRP-L8,MITOCHONDRIAL LARGE RIBOSOMAL SUBUNIT PROTEIN UL10M RRVMHFQRQKLMAVTEYIPPKPAIHPSCLP 30 T 0.19 UL42 unppssm F Eukaryota T 7qh7 17 Q V RM24_HUMAN L24MT,MRP-L24,MITOCHONDRIAL LARGE RIBOSOMAL SUBUNIT PROTEIN UL24M TWIDGPKDTSVEDALERTYVPCLKTLQEEVMEAMGIKETRKYKKVYWY 48 T 0.1 DUF3848 pdbpssm F Eukaryota T 7qh7 34 HA f RM48_HUMAN 39S ribosomal protein L48, mitochondrial YKTKPTHGIGKYKHLIK 17 T 0.14 Lysozyme_like unp F Eukaryota T 7qhm 10 J,W J,W Q8NTD4_CORGL Hypothetical membrane protein MNTMSSAKKKPAPERMHYIKGYVPVAYSSPHSSLERSATWLGMGFLLTALAGVGAVLFAVGANSVGQQQEHWVLYSIIGVVFAVVCTVLGTVLIIKGRAPYNRYVKETGRTQ 112 T 0.00033 Phage_holin_3_6 pdbpssm F Bacteria T 7qi5 80 BC l RM54_HUMAN L54MT,MRP-L54,MITOCHONDRIAL LARGE RIBOSOMAL SUBUNIT PROTEIN ML54 MATKRLFGATRTWAGWGAWELLNPATSGRLLARDYAKKPVMKGAKSGKGAVTSEALKDPDVCTDPVQLTTYAMGVNIYKEGQDVPLKPDAEYPEWLFEMNLGPPKTLEELDPESREYWRRLRKQNIWRHNRLSKNKRL 138 F F Eukaryota T 7qik 2 C,D E,F NCAP_SARS2 SER-SER-ARG-ASN-SEP-THR-PRO-GLY SSRNSTPG 8 T 20 FTCD_C pdbhh T Viruses T 7qil 1 A A DnaE intein SGGALSYDTEILTTEYGLLPIGDIVESETECTVYSVDSDGSTYTQGVAEWHDRGEQEVFEYCLEDGSTIRATKDHKFMTTDGEMLPIDEIFESELDLMRVDSSGDTKIATREYTGSEDVYDIGVESDHNFALSDGFIASN 140 T 1.1E-06 Intein_splicing pdbpercent F T 7qip 2 C,D C,D NCAP_SARS2 ARG-GLY-TPO-SER-PRO-ALA-ARG-MET SSRGTSPARM 10 T 6.8 RNA_pol_Rpa2_4 pdbhh T Viruses T 7qix 3 C E A0A3Q7H1U4_SOLLC 40S ribosomal protein SA MATQDVRTLSTKEADIQMMLAAEVHLGTKNCDFQMERYAFKRRNDGIYIINLGKTWEKLQMAARVIVAIENPQDIIVQSARPYGQRAVLKFAQYTGAHAIAGRHTPGTFTNQLQTSYSEPRLLILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGVLFWILARMVLQMRGAINQGPKWDVMVDLFFYREPEEAKEQEEEVPAIADYADYSASAALGGDWTSSQIPEAQWTADAAAPAVGGGWAGDGAADGGWDAAAAPAPVPLPVPDVAPTSGATGWE 296 T 1.9E-12 Ribosomal_S2 pdb F Eukaryota T 7qjf 1 A A LLP_BPT5 Lytic conversion lipoprotein GSTFGPKDIKCEAYYMQDHVKYKANVFDRKGDMFLVSPIMAYGSFWAPVSYFTEGNTCEGVF 62 T 0.0032 Mfp-3 unphh T Viruses T 7qjh 24 TC,X KM0,LM0 S7XVN9_SPRLO Transposase MYFIKPGCLIKKKFSIYTSIVISVIDNNSVVIQSYDKSDNGDVISIDREVINVSKIVPIGNIDIKNKSKKEIDGILVEENRNLKNKDDVLLMNDFERFKEQLKKEVEDMVIEEMA 115 T 0.00024 Ribosomal_L14e pdbhh F Eukaryota T 7qke 1 A A CP125_MYCTU CHOLEST-4-EN-3-ONE 26-MONOOXYGENASE,CHOLEST-4-EN-3-ONE C26-MONOOXYGENASE [(25S)-3-OXOCHOLEST-4-EN-26-OATE FORMING],CHOLESTEROL C26-MONOOXYGENASE,CHOLESTEROL C26-MONOOXYGENASE [(25S)-3BETA-HYDROXYCHOLEST-5-EN-26-OATE FORMING],CYTOCHROME P450 125,STEROID C27-MONOOXYGENASE NGPSPNLPPGFDFTDPAIYAERLPVAEFAELRSAAPIWWNGQDPGKGGGFHDGGFWAITKLNDVKEISRHSDVFSSYENGVIPRFKNDIAREDIEVQRFVMLNMDAPHHTRLRKIISRGFTPRAVGRLHDELQERAQAIAAEAAAAGSGDFVEQVSCELPLQAIAGLLGVPQEDRGKLFHWSNEMTGNEDPEYAHIDPKASSAELIGYAMKMAEEKAKNPADDIVTQLIQADIDGEKLSDDEFGFFVVMLAVAGNETTRNSITQGMMAFAEHPDQWELYKKVRPETAADEIVRWATPVTAFQRTALRDYELSGVQIAAGQRVVMFYRSANFDEEVFQDPFTFNILRNPNPHVGFGGTGAHYCIGANLARMTINLIFNAVADHMPDLKPISAPERLRSGWLNGIKHWQVDYTGRCPVAH 418 T 3.4000000000000003E-22 p450 unppssm F Bacteria T 7qla 2 B C G0SD94_CHATD Ccz1 MTTPVSPSPSGIIPAQLGFLAIYNPALGTTDETLEDQIVYYATASTLSQARRRHRRPRRRDRQRAQSVVKDSRPNAAGATGDSEAVAEDKDPVSKEERHERLRQIGLAQGMVEFAKSFSDGEPVDTIDTEKARVILVEVEEGWWILASIDLTRLPLPQIKTPTSSSAPPPAPNLNPLPPEPAYEYSSREVKPPSLLRADLLRAYDLFLLHHGSSLSSLLASQGRAQLVASLTRFWDHFLATWNVLLHGNPACDVFGGIKLAASGELGIGVGEEERGSGEREVLEGLVERVEGLVDVVVGRYGGPPSEKGPEEEQWLGLGGEVGEEDGAVFLGVGALDRKSLRGVVQWMEEVYVWGENAFGKPRRDLSTGHFLLGLSECSEEELTSSQANPKAIFVELKPSYQHPSRKIPPEDPQPLGKVGPELPRDHTARLRPVIYVSQPFIYILLFSEITPSPSTWPTLAESLHAQLSPLQKPLLHSTSYRPERPVVETTSSSGTTTQHQIFDLVYDTETLTLQSTIPNIPDPFPYSATTPTGHSTGQQHHQQSIWTRVEALQTHAQILAILSSGRAIPTDPSSFTHLPWEEGERTCKTARGWWIVWTRVVEHSPPDAVSLHHARDDDDNDDDASCSVLGHLRSVSSSHAAGSTSSSSGSGFGLGAIPGLGGLGGWAADGATRLAQGIGIDTRRYVEGLLTSLGR 696 T 0.045 Intu_longin_1 pdbpssm F Eukaryota T 7qld 1 A,B B,A SLAP_LACAC SURFACE LAYER PROTEIN,SA-PROTEIN RHMATTINASSSAINTNTNAKYDVDVTPSVSAVAANTANNTPAIAGNLTGTISASYNGKTYTANLKADTENATITAAGSTTAVKPAELAAGVAYTVTVNDVSFNFGSENAGKTVTLGCANSNVKFTGTNSDNQTETNVSTLKVKLDQNGVASLTNVSIANVYAINTTDNS 170 T 0.0049 Cadherin-like pdb F Bacteria T 7qle 1 A,B A,B SLAP_LACAC SURFACE LAYER PROTEIN,SA-PROTEIN MGHHHHHHHHHHSSGHIEGRHMATTINASSSAINTNTNAKYDVDVTPSVSAVAANTANNTPAIAGNLTGTISASYNGKTYTANLKADTENATITAAGSTTAVKPAELAAGVAYTVTVNDVSFNFGSENAGKTVTLGSANSNVKFTGTNSDNQTETNVSTLKVKLDQNGVASLTNVSIANVYAINTTDNS 189 T 0.0057 Cadherin-like pdb F Bacteria T 7qlh 1 A,B A,B E4SK47_LACAR S-layer MGKGDVNVTSNVQAITSPQTTTIDNQTGAVTYSNWDGKVNGTVTATYNGQSYTATLNETAGKENSRVTPWYTQDGGKTWNVLKKDGGVYRLEPAGKYQLSVNNVSFNFGTANANKKNITLTSSNGVQFRENGQWKDSIKVSTDQNGAVSQPLTLLIPITPVDVTNAKSHHHHHH 174 T 0.052 BNR pdb F Bacteria T 7qlr 1 A,B,C,D A,B,C,D A0A1J1J928_9CAUD CDHS1_22 Putative tail fiber protein MSWAETYKVNSDLQGEPLNFLSYLQDIKLNGLDSYVLFIGNARIWEELYLNSLYLFSDRGIRETVYTAFSETDIDNLFNKSTKLGEQLNAFYRTDIFSLGNADNVVKEMTIEHYNSLEEKFKAGYDRYVTREQEKSTIGAWFNSTFSLDNTDLENLTTIEEILANVEATNAILNNSNAIVALTMCKSSMDAVVASSNAMDLLGQYILRVTTESPVIRAILKNNVIRDAIINSDEAMTQISSNENSVMEIFNDLEATKVLVQNQNSINKILTNNVTVEKIIPNLLEMKYNLQTSLNYINTIKSNIASGKGQIMAITYNEEIFPILKNAVKNYDGMETTRNISQRDIEEKIKISDAILESSIAMATFANNSIIVNKVGDRVGIIESIFSKTVSLNAFMKSTTAINILVNKTTAFTKIANNSTAFNAMLTISENNVTIANNTTAMGIIANNAQAMSTVANNDTSISVFVNNTTAMGIIANSSTAMTKITLTGLALNRMVKSNTAKSILISKNSTLQTYKNNIQNTIQGSTAYFRTITGFADADDNPPQTINSTYVGITYCYGYKGNSYYGIVYHGYNTSIEAGRGNGYKDETKKFITLGGARYDQSGDGYFTYAMYQAI 616 T 0.00067 HC2 pdbpssm T Viruses T 7qof 1 A,B,C,D,E,F,G,H,I A,B,C,D,E,F,G,H,I A0A385DVU6_9CAUD Major capsid protein gp32 MAGKLGKFQMLGFQHWKGLTSDNHLGAIFQQAPQKATNLMVQLLAFYRGKSLDTFLNSFPTREFEDDNEYYWDVIGSSRRNIPLVEARDENGVVVAANAANVGVGTSPFYLVFPEDWFADGEVIVGNLNQVYPFRILGDARMEGTNAVYKVELMGGNTQGVPAERLQQGERFSIEFAPVEKELSRKVGDVRFTSPVSMRNEWTTIRIQHKVAGNKLNKKLAMGIPMVRNLESGKQVKDTANMWMHYVDWEVELQFDEYKNNAMAWGTSNRNLNGEYMNFGKSGNAIKTGAGIFEQTEVANTMYYNTFSLKLLEDALYELSASKLAMDDRLFVIKTGERGAIQFHKEVLKTVSGWTTFVLDNNSTRVVEKVQSRLHSNALSAGFQFVEYKAPNGVRVRLDVDPFYDDPVRNKILHPMGGVAFSYRYDIWYIGTMDQPNIFKCKIKGDNEYRGYQWGIRNPFTGQKGNPYMSFDEDSAVIHRMATLGVCVLDPTRTMSLIPAILQG 504 T 0.75 DUF5309 pdbhh T Viruses T 7qof 2 J,K,L,M,N,O,P,Q,R a,b,c,d,e,f,g,h,i A0A385DVS7_9CAUD Auxiliary capsid protein gp36 MVISINQVRQLYVAKALKANTAALTTAGDIVPKADTAKTTLYFQSMSPAGIVASDKINLKHVLYAKATPSEALAHKLVRYSVTLDADVSATPVAGQNYILRLAFRQYIGLSEEDQYFKYGEVIARSGMTASDFYKKMAISLAKNLENKTESTPLVNIYLISAAAASTDVPVTSATKESDLTATDYNQIIIEETEQPWVLGMMPQAFIPFTPQFLTITVDGEDRLWGVATVVTPTKTVPDGHLIADLEYFCMGARGDIYRGMGYPNIIKTTYLVDPGAVYDVLDIHYFYTGSNESVQKSEKTITLVAVDDGSHTAMNALIGAINTASGLTIATL 333 T 3.4 FTP pdbpercent T Viruses T 7qof 3 S,T j,k A0A385DVL5_9CAUD Head fiber trimer protein gp21 MKRVLNLGNLSRIVEGDPNEITDDEILVIKDKIIEGKIIDIQKRVDGKLVSLITEKYTYTINPTPADAIVVINGSTTKSIRAAKGHTVTWSVSKTGFVTQSGSDVISGDVSKDVTLVANPAS 122 T 0.0062 PEGA unppssm T Viruses T 7qog 1 A A A0A385DT68_9CAUD Portal protein gp20 MADFLNFPRQMLPFSKKTKQWRKDCLLWANQKTFFNYSLVRKSVIHKKINYDLLNGRLHMSDLELVLNPDGIKAAYIPDRLQHYPIMNSKLNVLRGEESKRVFDFKVVVTNPNAISEIEDNKKNELLQRLQEMITDTSISEDEYNIKLEKLNDYYTYEWQDIREVRANELLNHYIKEYDIPLIFNNGFMDAMTCGEEIYQCDIVGGEPVIERVNPLKIRIFKSGYSNKVEDADMIILEDYWSPGRVIDTYYDVLSPKDIKYIETMPDYIGQGAVDQMDNIDERYGFVNQNMIGDEITVRDGTYFFDPANLFTEGIANSLLPYDLAGNLRVLRLYWKSKRKILKVKSYDPETGEEEWNFYPENYVVNKEAGEEVQSFWVNEAWEGTMIGNEIFVNMRPRLIQYNRLNNPSRCHFGIVGSIYNLNDSRPFSLVDMMKPYNYLYDAIHDRLNKAIASNWGSILELDLSKVPKGWDVGKWMYYARVNHIAVIDSFKEGTIGASTGKLAGALNNAGKGMIETNIGNYIQQQINLLEFIKMEMADVAGISKQREGQISQRETVGGVERATLQSSHITEWLFTIHDDVKKRALECFLETAKVALKGRNKKFQYILSDTSTRVMEIDGDEFAEADYGLVVDNSNGTQELQQKLDTLAQAALQTQTLSFSTITKLYTSSSLAEKQRLIEKDEKQIRERQAQAQKEQLEAQQQIAAMQQQQKEAELLQKEEANIRDNQTKIIIAQIQSEGGPDEEDGIMIDDYSPEAKANLAEKIREFDEKLKLDKDKLKLDKKKAETDASIKRQALRKKSSTTNK 806 T 0.21 RbsD_FucU pdbpssm T Viruses T 7qog 2 B B A0A385DT91_9CAUD Ring protein 1 gp43 MVNNINWVKLPVILDRLLRHPLLTDLNLETAIQYTLDFISAMGLPNVYVDKIETIDIKEYRGELPCDLISINQVRLHKNGIALRAMTDNFNAYPTHDHKEGDWYERGEPSFKTQGRVIFTSIKHEKVDISYKAIMLDDEGLPLIPDNPIFLKTLELYIKKEWFTILFDMGKISPAVLNNTQQEYAFKAGQCNNEFVIPSVSEMEAITNMWNQLIPRVTEFRRGFKNLGDKEYIRVH 236 T 3.5 PriX pdbhh T Viruses T 7qog 3 C C A0A385DT87_9CAUD Ring protein 2 gp40 MTYNELIYMVLDELKLSSDDSYYTPDHVIFLLVKYRSFLLKQRYSDIKKQIPDSDYQSICLDLIEVPAISGEPCEGSSYLRSKNKVPTTMMIGNPRVYPMDFYQGEITYISRDRMRYVGYNKFLRNIIYCSKAPDGYLYFKSWNPQFLHLEKVSFNAIFEDAKEASEMACPEENGTICKLEDKEFPIEDALVPPLIELVVKELRGPEYSPKDEDNNAKDDLPDAR 225 T 0.18 DUF547 pdbpssm T Viruses T 7qog 4 D,E M,N A0A385DV85_9CAUD Cargo protein 1 gp45 MAKKKIKRRGKMPPNIFDTGGQSWGQQSSGQFSNAFKGENLGNSIGSIGGAVGGIAQAGISNAQIADTSGIEAQNKAQKNMVVGASSNDDLMSEWGSWNKVKDDYSWKDVRGGSTGQRVTNTIGAAGQGAAAGASVGGPIGAIVGGVVGLGSAIGGWLGGNRKAKRKAKKLNKEAKEANERALTSFETRADNIDTQNDFNMLANFSAYGGPLEFGSGAIGYEFDNRYLNNQEMSAVAKQRLTSLPNSFQALPEMNTYNAFAEGGGLSREKNYGSKKKPYPSVPSGDFAGPHRSYPIPTKADARDALRLAGLHGNESVRRKVLAKYPSLKAFGGSLFDSVVGNNFNQSFTQGIQGMFQQEPEQTVQAANIAKDGGDIKIKEKNKGKFTAYCGGKVTEACIRKGKNSSNPTTRKRATFAQNARNWNAFGGWLNTQGGDFTNGVTFINEGGSHEENPYQGIQIGVDPEGAPNLVEQGEVVYDDYVFSDRMEIPDDIRKEYKLRGKTFAKAAKSAQRESEERPNDPLSTKGLQAAMERIATAQEEARQRKEAHREGNEYPSMFAYGGDTNPYGLALEDPMSVEELEALMVQSGETGEIAPEGNNGNRQTWTRYAPIIGSGLASLSDLFSKPDYDSADLISGVDLGAEAVGYAPIGNYLSYRPLDRDFYINKMNQQAAATRRGLMNTSGGNRLNAQAGILAADYNYGQNMGNLARQAEEYNQQLRERVEAFNRGTNMFNTETGLKASMFNAESRNAAKRARLGQATTVAQLRQGIKDQDAARRSANITNFLQGLGDMGWENEQANWLDTLAKSGVLKMNTKGEYTGGTKKAKGGKVRTKKKKGLTYG 842 T 0.0068 RTX pdb T Viruses T 7qoh 3 L,M g,h A0A385DTA3_9CAUD Portal vertex capsid protein gp57 MAGQQGIYCAPDNIVPNRDRVDVGCAPDGAMQLWVMEYEVTGIGKGCAMCKAINPQQAEMLLKSNGIYNGSSYLYKVTRIEQVIVPPCNGLMAEQVVTYKDVVS 104 T 0.092 DUF2931 pdb T Viruses T 7qoj 4 D D A0A385DV73_9CAUD Ring protein 3 gp35 MTNKEFSDGFSTLLNSFGITPNITLDEYEKSTFLTNAQEQLIIDIYSGRNIIYGKSFEQTEEIRRYLSNLVETYETSTKVTGKLGLSKDSVFFEIPQDTWFITYEVAFLKDSRLGCLDGIEASVVPLPQDDLYRAKDNPFRGPSKDRVLRLDIKSDLAELISKYNVDKYLMRYISQPTPIILVDLPDGLSINGVSTESECELNPVVHRAILERAVQLAIISKTQLTGNKE 230 T 0.22 LAGLIDADG_3 pdb T Viruses T 7qoj 5 E,F E,F A0A385DVC3_9CAUD Ring protein 4/5 gp34 MNVNEFSNEFDVLYNNIMSNAAPGLNEYEKSVLLTKAQEEIVKNYFEPAGNKYGKGLDDSPKRQIDFSELIKVGEGVLNTSAPTITFDKRAKVYDLPADLFLVINEAVDTNAGTKQIVPISYSDYTRLMSRPYKEPVKYQAWRIITTSINNISVELIVNSNETITDYKVRYIRRPAPIITTNLSSEYGDVTINGVSTVSECELNPIIHSEILQRAVELAKAAYQGDLQASVELGQRSE 238 T 16 DUF3206 pdbhh T Viruses T 7qoj 6 G,H G,H A0A385DTH1_9CAUD Tail hub protein A gp38 MHFNELRISQDNRFLIIDVSVDNQDYFEDVLLDSIVIDTQDTFVMNGPSDNPLYIYNVEDAYDLTYSLPEQCNCNPVRVEEDESYCFTYGTQQMKNVRLELNIQDLKVSPCSTMFFVYVKSKGTPSTDTPCGFDKDQILGTVINLQPIYKQTLKYLKEVECDCNIPKGFIDMILKLKAIELCVRTGNYPQAIKYWNKFFIKNNCKSPTSNCGCYG 215 T 3.4 SPOB_a pdbhh T Viruses T 7qoj 7 I I A0A385DVM6_9CAUD Tail hub protein B gp39 MDKMLEISEEAITRYFTTLSQFGYKKYSDVDKIIVLFFMEEMLAGEMSYYVTQDDYRNIVNALYCLAGSTCMIDFPMFESYDTLVHSNNRTFVPRITEDSILRSTEDDNFRVEA 114 T 0.077 DUF5854 unppssm T Viruses T 7qok 1 A A A0A385DVD6_9CAUD MUZZLE PROTEIN MALKKEQHFFKGMQRDLSVSKFNPEYAFDAQNIRITAREHDTLLSVSNEKGNKEIPLQSPSGDPVVIDGVLLGQNVLNNYVTLFTKGTNDNIYRLENKGTYFETLILFSGNLNFSTDYPIESISVYENNNIQKVYWVDGLNQARVINITKDDYNNADDFDFVGTIHTSSKIEVSKVNGSGAFGQGVIQYAFTYYNKYGKETNIFRTSPLLYIAYSDRGASPEETVSCSFQINFTELDSSYDFIRVYSIHRTSIDATPTVRKVADLATDTKLYVDTGTTGEIVDPTLLLYVGGEEIAPYTMTQKDNTLFLGNYTLKRSLISTELKNQIKSDSIVTTILGGLDDAIESEWNVNTQYNSNYDLNYDSRIKGFQKGEIYRLGIQFQDNKGKWSEVVFIGDYECTERFKYTQYDTYGITLIPRFKVVISNSTTIQAIKNLGYINARGVVVFPTLEDRNILCQGILCPTVANYKDRLDNSPFVQSSWFSRPKQATETWKTEYSGTNHLSEFGEVPYFQHNEPIGSASLSEITRWEIQTSLGLVPYYNPSTTNAKDFVDGSPSEFLVDENIVTMHSPDVEFDDRLQNITNGKFKLRIIGTTHLTNTLSDISVITSTPTYGNYATGFYKGKVANMNISTSYYGGRQLSAGLFWSDNVKFQDPSPQDKLERLWMVYPWHRNGSLMNMGVPTEGTRAAALQRKIISNLKFASQNNYLPNQSVWEAEISGDANHTGITPVNSWTEGLVRIPAQANSNLGSLNYYANIDKVLTFNRSEQISEIYKNGYLIYTTKDWITDGKIADLFNNAISQTISVDQVQDWLTRIADTDKYGTEPVSMKYKSNPHLVFAFNYTESGKQLILPMKNNNNGYLAPSANSKPFWNPTAPEGAVYQDSINFTNENRAFFWLAELYRDSVVNRFGGDTEEAILNNTWLPSGDSVIIGDSINIEYTEGDTYYQRYDCLRTFAYTNEDQNSIVDIVSFMCESKVNIDGRYDKNRGQVNNLAVSPTNFNLFNPVYSQKNNFFTFRTIDYERFSINYFPNSITVTKEKSLGEDIDTWTNITLATTLDLDGDKGEIVSLNTYNNEIFCFQRRGLSNILFNSRVQIPTSDGMPIEITNGLKVSGKRYISNTIGCANKWSIAESPSGLYFIDNETNSLYLFNGEIVSLSDKLGFRQWISTHNVHVNWEPVGYNNYRSFYDKNNNDVYFTYKDHCLCYSELINQFTSFMSYEGVPAMFNVSSEFYAFKDGKMWEQFAGDYNMFFGEYKPFSITFVANAEEPNDKIFNTVEFRADSWDSDNLISNKTFDTLDVWNEYQHGTTPLTNLLGHPSPLKKKFRIWRANIPRAIANNRDRIRNTWAYIKLGMNTPNTYRTEFHDAIIHYFA 1371 T 0.014 Phage_stabilise pdbhh T Viruses T 7qot 2 C,D C,D KNG1_HUMAN Kininogen-1 light chain SDDDWIPDIQIDPNGLSFNPISDFPDTTSPK 31 T 5.6 NUC153 pdbhh F Eukaryota T 7qox 2 C,D D,C KNG1_HUMAN Kininogen-1 light chain TQSDDDWIPDIQIDPNGLSFNPISDFPDT 29 T 4.6 NUC153 pdbhh F Eukaryota T 7qpd 5 E C CALR_HUMAN CRP55,CALREGULIN,ENDOPLASMIC RETICULUM RESIDENT PROTEIN 60,ERP60,HACBP,GRP60 EPAVYFKEQFLDGDGWTSRWIESKHKSDFGKFVLSSGKFYGDEEKDKGLQTSQDARFYALSASFEPFSNKGQTLVVQFTVKHEQNIDCGGGYVKLFPNSLDQTDMHGDSEYNIMFGPDICGPGTKKVHVIFNYKGKNVLINKDIRCKDDEFTHLYTLIVRPDNTYEVKIDNSQVESGSLEDDWDFLPPKKIKDPDASKPEDWDERAKIDDPTDSKPEDWDKPEHIPDPDAKKPEDWDEEMDGEWEPPVIQNPEYKGEWKPRQIDNPDYKGTWIHPEIDNPEYSPDPSIYAYDNFGVLGLDLWQVKSGTIFDNFLITNDEAYAEEFGNETWGVTKAAEKQMKDKQDEEQRLKEEEEDKKRKEEEEAEDKEDDEDKDEDEEDEEDKEEDEEEDVPGQAKDEL 400 T 2.1E-21 Calreticulin pdb F Eukaryota T 7qpk 1 A,B A,B A0A0W0CDE7_CANGB A-region of Awp14 MGSSHHHHHHSSGLVPRGSHMASMTGGQQMGRGSKTYENQKIVIDGVALGTTTFEDDELLVLKNSTLTLNNFMNIKLPAGISLTDNSVLNINTPPDDTPPSDSYDVKRPQYSMVINGKVSIDNGSQFVFDGSSLVYSLGPYASEKFLFDINTGMDGIFISKDSTMRITLPKYLDWGFSHATTKFSGIHIGGTYKAPYNSPLVILGTLEVLRSDSRTDDGYFDDNLFRIDLGPDKIDENGVFTMKNDLSGNIHCQGILSFFADIFKGTDNVFIRTIGFQAISPISPITVDLAEGPVQGNGYLRYNVIISQGQGNGLKLLNLQARLDIGLPIIYIYNSDNYKDLTAKAHDNVIDIIDHSSNKSFSIIGDRKYNITYWYQQYTEIYPSYQYGGYFKVPLFKKSLQLDFIPIIEPDY 413 T 0.19 Hyphal_reg_CWP pdbhh F Eukaryota T 7qql 2 B,E,F F,D,E KS6A1_HUMAN S6K-ALPHA-1,90 KDA RIBOSOMAL PROTEIN S6 KINASE 1,P90-RSK 1,P90RSK1,P90S6K,MAP KINASE-ACTIVATED PROTEIN KINASE 1A,MAPK-ACTIVATED PROTEIN KINASE 1A,MAPKAP KINASE 1A,MAPKAPK-1A,RIBOSOMAL S6 KINASE 1,RSK-1 RRVRKLPSTTL 11 T 13 DUF3549 pdbhh F Eukaryota T 7qqn 2 C,D B,D TRPV3_HUMAN TRPV3,VANILLOID RECEPTOR-LIKE 3,VRL-3 EVEEFPETSV 10 T 6 Gemini_V2 pdbhh F Eukaryota T 7qqy 2 B B ECM21_YEAST ECM21 PFITSRPW 8 T 3.8 DUF2183 pdbhh F Eukaryota T 7qrj 1 A,B,C,D,E,F A,B,C,D,E,F A0A2P1EHJ0_9VIRU Zav_19 protein MSISSLLEKNIYNVHNKSNTLTNVPANPTGNTNTVWSNSNFTPPHLMYGASDITQAIGNISLTTGSFSLSLSGPWASPLVQNVAYTKINNLVNLTFPPFQANATSSAVINSAIGALPADLRPTTNIQVDFEIFVIDDGNRPVNPGLITLLSNGQIVVYKDNNLGQFTTGIGGSGFNPFSITYMV 184 T 0.11 CDC45 pdb T Viruses T 7qrr 1 A,B,C,D,E,F,G,H,I,J,K,L F,A,B,C,D,E,G,H,I,J,K,L A0A1Q1PNC6_9VIRU NMV_189 protein MSVYGPVPTVTTRAFLPRLATAADSITSTTTTIALDPQTEQSYWTRVGDTATIHIHLVGAALPAAAPSTRIYGNFPPLRITPSSALAAQHGVIVPMQYYVAPTLPVGSSAAARIETGFIELGSLLNGAFTPLAANLIGTVGYEFAIDATYAAQ 153 T 29 FANCL_d3 pdbhh T Viruses T 7qrs 2 C,D C,D TAX_HTL1C PROTEIN X-LOR,PROTEIN PX,TRANS-ACTIVATING TRANSCRIPTIONAL REGULATORY PROTEIN OF HTLV-1 KHFRETEV 8 T 41 FeThRed_A pdbhh T Viruses T 7qs6 2 B B THAN_PODMA Thanatin-like derivative XPITYXNRXTXKCXRY 16 T 2.6 YihI unphh F Eukaryota T 7qsj 1 A,B A,B A0A3P4A4D3_MYCHD Methylmannose polysaccharide hydrolase (MmpH) MVLRDDLDAVPGVPGVLTPEQCRQTAQAIADAQEPSGALPWFEGGHTDPWDHVENAMALTVAGLLEPARAAFDWCRTTQRPDGSWPIQIRNGVVEDANSDSNFCAYVATGVWHHVLITGDRRFAETMWPVVAKAIDFVIDMQLPGGEIAWARSPSGLYEEALLTGCASIYHSIRCALALADYMGEPQPEWEVAVGRLGHAIAEHPEAFVTKDRWSMEWYYPVLGGALRGEAARARINRRWNDFVVPGLGIRCVDDRPWVTGAETCELVLALDAIGDLTRAHEQFAAMHHLREEDGSYWTGLVYDDGKRWPIERTTWTGAAMILAADALSRTTPGNGIFRGVDLPRGLEGEYDCACATSERKLAAALEHHHHHH 373 T 0.0028 Bac_rhamnosid6H unppercent F Bacteria T 7qto 2 C,D C,D Q6DP93_9INFA NS1 ARTIESEV 8 T 34 TEX12 pdbhh T Viruses T 7qtp 2 B B Q6B3P2_9INFA NS1 KMARTIESKV 10 T 10 Tipalpha pdbhh T Viruses T 7quu 1 A A YFPE_SCHPO Uncharacterized protein C7D4.14c MGWSHPQFEKSSDAVEPSVEKEYKKIISFRDTVFEGKHQQFLVPNNVRLKFLRDR 55 T 9.5 NPCC pdbhh F Eukaryota T 7quu 2 B B RED1_SCHPO NURS complex subunit red1 GAMGTTNQKEAEKAVSQLFEVGVRFNDFIAEGIEPSVVHTLFLKLGLDS 49 T 0.11 T4bSS_IcmS pdb F Eukaryota T 7quv 3 C C Peptide 3 RSPESVAFPMFQSHWYSG 18 T 3.4 Pneumo_matrix pdbhh F T 7qux 2 B D P7C8 CRLYGFKW 8 T 0.33 DUF1281 pdbhh F T 7qvb 1 A,B A,B DNA damage response protein C GAMGMKNAPLTLNFGSVRLPVSADGLLHAPTAQQQLGLTQSWEAALVEHGLPETYRDFGAGPEAAVSVPDFVALAFALDTPEARRWQKRARELLARAMQGDVRVAAQIAERNPEPDARRWLAARLESTGARRELLATVARHGGEGRVYGQLGSISNRTVLGKDSASVRQERGVKATRDGLTSAELLRLAYIDTVTARAIQESEARGNAAILTLHEQVARSERQSWERAGQVQRVG 235 T 32 DUF2789 pdbhh F T 7qwa 1 A,B,C,D,E,F A,B,C,D,E,F CC-Type2-(UgUe)4 XGEIXQXLKEIXKXLKEIXXXLKEIXQXLKGX 32 T 0.011 WXG100 pdbpssm F T 7qwb 1 A,B,C,D,E,F A,B,C,D,E,F CC-Type2-(Ue)4 XGEIXQALKEIXKALKEIXXALKEIXQALKGX 32 T 0.011 WXG100 pdbpssm F T 7qwc 1 A,B,C B,C,E CC-Type1-(UbUc)4 XGELXXIKQELXXIKKELXXIKXELXXIKQGX 32 T 0.0032 DUF5320 pdbhh F T 7qwd 1 A,B,C,D,E A,B,C,D,E CC-Type2-(Ug)4 XGEIAQXLKEIAKXLKEIAXXLKEIAQXLKGX 32 T 0.011 WXG100 pdbpssm F T 7qwn 1 A,B,C A,B,C CP125_MYCTU CHOLEST-4-EN-3-ONE 26-MONOOXYGENASE,CHOLEST-4-EN-3-ONE C26-MONOOXYGENASE [(25S)-3-OXOCHOLEST-4-EN-26-OATE FORMING],CHOLESTEROL C26-MONOOXYGENASE,CHOLESTEROL C26-MONOOXYGENASE [(25S)-3BETA-HYDROXYCHOLEST-5-EN-26-OATE FORMING],CYTOCHROME P450 125,STEROID C27-MONOOXYGENASE NGPSPNLPPGFDFTDPAIYAERLPVAEFAELRSAAPIWWNGQDPGKGGGFHDGGFWAITKLNDVKEISRHSDVFSSYENGVIPRFKNDIAREDIEVQRFVMLNMDAPHHTRLRKIISRGFTPRAVGRLHDELQERAQKIAAEAAAAGSGDFVEQVSCELPLQAIAGLLGVPQEDRGKLFHWSNEMTGNEDPEYAHIDPKASSAELIGYAMKMAEEKAKNPADDIVTQLIQADIDGEKLSDDEFGFFVVMLAVAGNETTRNSITQGMMAFAEHPDQWELYKKVRPETAADEIVRWATPVTAFQRTALRDYELSGVQIKKGQRVVMFYRSANFDEEVFQDPFTFNILRNPNPHVGFGGTGAHYCIGANLARMTINLIFNAVADHMPDLKPISAPERLRSGWLNGIKHWQVDYTGRCPVA 417 T 3.4000000000000003E-22 p450 unppssm F Bacteria T 7qy5 2 C,D F,G RED1_SCHPO RNA elimination defective protein Red1 KNEEDESNDSDKEDGEISEDD 21 T 6.2 Vpu pdbhh F Eukaryota T 7qzd 2 C,F C,G A0A219T3Y8_MAGOR AVR-PIK PROTEIN METGNKYIEKRAIDLSRERDPNFFDNADIPVPECFWFMFKNNVRQDAGTCYSSWKMDKKVGPNWVHIKSDDNCNLSGDFPPGWIVLGKKRPGF 93 T 2.8 DIM unphh F Eukaryota T 7qzr 4 E,F E,F Q2G0X2_STAA8 MYELOPEROXIDASE INHIBITOR SPIN,PEROXIDASE INHIBITOR MKFKKVLVATAMVGVLATGVVGYGNQADAKVYSQNGLVLHDDANFLEHELSYIDVLLDKNADQATKDNLRSYFADKGLHSIKDIINKAKQDGFDVSKYEHVK 102 T 0.0051 CompInhib_SCIN unphh F Bacteria T 7qzv 1 A A Hm-AMP2 EKRWRRLIFNYFX 13 T 1.8 MCRS_N pdbhh F T 7qzw 1 A A Hm-AMP8 RAVIYKIPYNAIASRWIIAPKKCX 24 T 1.9 TGF_beta pdbhh F T 7r0j 1 A A V2R_HUMAN V2R Cter ESCTTASSSLAKD 13 T 57 DPM3 pdbhh F Eukaryota T 7r0w 5 E,M M,E A0A6P1VG96_9SYNC Cytochrome B6 MAAGVGIFIGYIAVFTGVTLGLLYGLRFVKLI 32 T 0.00039 PetL pdbpercent F Bacteria T 7r1i 1 A,B,C A,B,C CP125_MYCTU CHOLEST-4-EN-3-ONE 26-MONOOXYGENASE,CHOLEST-4-EN-3-ONE C26-MONOOXYGENASE [(25S)-3-OXOCHOLEST-4-EN-26-OATE FORMING],CHOLESTEROL C26-MONOOXYGENASE,CHOLESTEROL C26-MONOOXYGENASE [(25S)-3BETA-HYDROXYCHOLEST-5-EN-26-OATE FORMING],CYTOCHROME P450 125,STEROID C27-MONOOXYGENASE NGPSPNLPPGFDFTDPAIYAERLPVAEFAELRSAAPIWWNGQDPGKGGGFHDGGFWAITKLNDVKEISRHSDVFSSYENGVIPRFKNDIAREDIEVQRFVMLNMDAPHHTRLRKIISRGFTPRAVGRLHDELQERAQKIAAEAAAAGSGDFVEQVSCELPLQAIAGLLGVPQEDRGKLFHWSNEMTGNEDPEYAHIDPKASSAELIGYAMKMAEEKAKNPADDIVTQLIQADIDGEKLSDDEFGFFVVMLAVAGNETTRNSITQGMMAFAEHPDQWELYKKVRPETAADEIVRWATPVTAFQRTALRDYELSGVQIKKGQRVVMFYRSANFDEEVFQDPFTFNILRNPNPHVGFGGTGAHYCIGANLARMTINLIFNAVADHMPDLKPISAPERLRSGWLNGIKHWQVDYTGRCPVAH 418 T 3.4000000000000003E-22 p450 unppssm F Bacteria T 7r1v 2 B B Dynobactin A WNSNVHSYRF 10 T 1.3 DUF5504 pdbhh F T 7r2m 2 C,D B,E Vangl2 peptide GSGSGGMRLQSETSVMRLQSETSV 24 T 3.6 Strabismus pdbhh F T 7r2t 2 B B Vangle2 peptide binding motif with the P-1 phosphrylated MRLQSETSV 9 T 0.42 Strabismus pdbhh F T 7r31 1 A,B,C A,B,C Y1513_SYNY3 Membrane-associated protein slr1513 MAKPANKLVIVTEKILLKKIAKIIDESGAKGYTVMNTGGKGSRNVRSSGQPNTSDIEANIKFEILTETREMAEEIADRVAVKYFNDYAGIIYICSAEVLYGHTFAGPEGASAWSHPQFEK 120 T 0.0021 DUF3240 pdbpercent F Bacteria T 7r32 1 A,B,C A,B,C Y1513_SYNY3 Membrane-associated protein slr1513 MAKPANKLVIVTEKILLKKIAKIIDESGAKGYTVMNTGGKGSRNVRSSGQPNTSDIEANIKFEILTETREMAEEIADRVAVKYFNDYAGIIYICSAEVLYGHTFSAWSHPQFEK 114 T 0.011 DUF3240 unppercent F Bacteria T 7r4h 4 E L STING_HUMAN HSTING,ENDOPLASMIC RETICULUM INTERFERON STIMULATOR,ERIS,MEDIATOR OF IRF3 ACTIVATION,HMITA,TRANSMEMBRANE PROTEIN 173 QEPELLISG 9 T 3 SurE pdbhh F Eukaryota T 7r6k 16 P 5 RRP17_YEAST RRP17 isoform 1 YLTKNERR 8 T 0.53 RyR unp F Eukaryota T 7r6q 14 N h RL35A_YEAST 60S ribosomal protein L35 GKKYQPKVTEKQRKKQIA 18 T 0.52 AgrD pdbhh F Eukaryota T 7r73 1 A G ENV_HV1H2;H6VWK7_9HIV1 Glycoprotein 120 VWKEANTTLFCASDAKAYDTEAHNVWATHACVPTDPNPQEVVLENVTENFNMWKNHMVEQMHEDIISLWDQSLKPCVKLTGGSVITQACPKISFEPIPIHFCAPAGFAILKCNDKKFNGTGPCTNVSTVQCTHGIRPVVSTQLLLNGSLAEEEVVIRSENFTNNVKNIIVQLNESVQINCTRHNNGGSGSGGDIRQAHCNISREKWQNTLKQIVKKLREQFKNKTIAFAPSSGGDPEIVMHSFNCNGEFFYCNTTKLFTSTWNSTWNSTWNNTEGSNSTVITLPCRIRQIINMWQEVGKAMYAPPIQGQIKCSSNITGLLLTRDGGVDTTKETFRPGGGNMKDNWRSELYKYKVVRIE 358 T 4.8E-52 GP120 unp T Viruses T 7r8x 2 B C EPOR pY454 phosphopeptide KXLYLVVS 8 T 0.81 Glyco_hydro_47 pdbhh F T 7rae 2 B D NCOA2_HUMAN TIF2 QALLRYLLDKDD 12 T 0.0028 DUF4927 pdb F Eukaryota T 7rap 1 A A W5IDB3_LASLA Heterogeneous-backbone analogue of lasiocepsin GLXRKXLCAXAKXKXXCXXAXKLXCKCX 28 T 1.2 Antimicrobial_1 pdbhh F Eukaryota T 7raw 1 A A A0A2N0URA4_9FIRM Dockerin domain-containing protein HHHHHHENLYFQGEETDTKIYFDASNLPAEWGTTKTVYCHLYAVAGDDLPETSWQGKAEKCKKDTATGLYYFDTAKLKSADGTNHGGLKDNADYAVIFSTIDTKSQSHQTCNVTLGKPCLGDTIYLTGGTVENTEDSSKRDFAATWKNNSDNYGPKAAITSLGHVTEGRFPIYLSRAEMVAQAIFNWAVKNPKNYTPETVADICAQVEAEPMDVYNAYAEMYATELADPAAYPDCAPLTTVATLLGVDPSG 251 T 0.0015 CBM26 pdbpercent F Bacteria T 7rbx 1 A,B,C,D A,B,C,D Q2YQA0_BRUA2 ISOCITRATASE,ISOCITRATE LYASE MAHHHHHHMGTLEAQTQGPGSMTDFYSLIPSAPKGRFDGIERAHTAEDVKRLRGSVEIKYSLAEMGANRLWKLIHEEDFVNALGALSGNQAMQMVRAGLKAIYLSGWQVAADANTASAMYPDQSLYPANAGPELAKRINRTLQRADQIETAEGKGLSVDTWFAPIVADAEAGFGGPLDAFEIMKAYIEAGAAGVHFEDQLASEKKCGHLGGKVLIPTAAHIRNLNAARLAADVMGTPTLIVARTDAEAAKLLTSDIDERDQPFVDYEAGRTAEGFYQVKNGIEPCIARAIAYAPYCDLIWMETSKPDLAQARRFAEAVHKAHPGKLLAYNCSPSFNWKKNLDDATIAKFQRELGAMGYKFQFITLAGFHQLNYGMFELARGYKDRQMAAYSELQQAEFAAEADGYTATKHQREVGTGYFDAVSLAITGGQSSTTAMKESTETAQFKPAAE 450 T 2.3E-50 ICL unp F Bacteria T 7rc6 2 B C AERA-DL XAXAXVXYXGXAXVXGXAXGXVXAXAXAXTXAXA 34 T 440 TAA-Trp-ring pdbhh F T 7rdh 3 G,H G,H De novo designed protein H3mb MSHHHHHHHHSENLYFQSGGSQHEKFLEWMLRKIEEAIKRGNKISAEFLINLAKNFIHVLGDDEIRRRLERLERQLH 77 T 5.2 YqaH pdbhh F T 7rdr 1 A,B,C A,B,C Circular tendon repeat protein SELAARCLIILFQQLVELARLAIESGDEELLRRVSEWLEEVIKDMRRVVEQALREGNSELAARILIILFQQLVELARLAIESGDEELLRRVSEWLEEVIKDMRRVVEQALREGNSELAARILIILFQQLVELARLAIESGDEELLRRVSEWLEEVIKDMRRVVEQALREGNSELAARILIILFQQLVELARLAIESGDEELLRRVSEWLEEVIKDMRRVVEQALREGNSELAARILIILFQQLVELARLAIESGDEELLRRVSEWLEEVIKDMRRVVEQALREGNSELAARILIILFQQLVELARLAIESGDEELLRRVSEWLEEVIKDMRRVVEQALREGNSELAARILIILFQQLVELARLAIESGDEELLRRVSEWLEEVIKDMRRVVEQALREGNSELACRILIILFQQLVELARLAIESGDEELLRRVSEWLEEVIKDMRRVVEQALREGN 456 T 0.0073 Glyco_hydro_88 pdbpssm F T 7rdv 5 E H PGCA_HUMAN Aggrecan core peptide EGRVRVNSAYQS 12 T 7.3 Peptidase_M15_3 pdbhh F Eukaryota T 7rdw 1 A,B,G,H C,D,M,N ENV_HV1H2 Glycoprotein 120 VWKEATTTLFCASDAKAYDTECHNVWATHACVPTDPNPQEVVLVNVTENFNMWKNDMVEQMHEDIISLWDQCLKPCVKLTNTSVITQACPKVSFEPIPIHYCAPAGFAILKCNNKTFNGTGPCTNVSTVQCTHGIRPVVSTQLLLNGSLAEEEVVIRSVNFTDNAKTIIVQLNTSVEINCTRPNNGGSGSGGNMRQAHCNISRAKWNNTLKQIASKLREQFGNNKTIIFKQSSGGDPEIVTHSFNCGGEFFYCNSTQLFNSTWFNSTWSTEGSNNTEGSDTITLPCRIKQIINMWQKVGKAMYAPPISGQIRCSSNITGLLLTRDGGNSNNESEIFRPGGGDMRDNWRSELYKYKVVKIEPLGSGSGHHHHHH 373 T 2.7999999999999998E-36 GP120 pdb T Viruses T 7re7 5 I,J C,F FETA_HUMAN PHE-MET-ASN-LYS-PHE-ILE-TYR-GLU-ILE FMNKFIYEI 9 T 3 Serum_albumin pdbhh F Eukaryota T 7rft 1 A,B A,B A0A2N0URA4_9FIRM Dockerin domain-containing protein GEETDTKIYFDASNLPAEWGTTKTVYCHLYAVAGDDLPETSWQGKAEKCKKDTATGLYYFDTAKLKSADGTNHGGLKDNADYAVIFSTIDTKSQSHQTCNVTLGKPCLGDTIYLTGGTVENTEDSSKRDFAATWKNNSDNYGPKAAITSLGHVTEGRFPIYLSRAEMVAQAIFNWAVKNPKNYTPETVADICAQVEAEPMDVYNAYAEMYATELADPAAYPDCAPLTTVATLLGVDPSG 239 T 0.0014 CBM26 pdbpercent F Bacteria T 7rfv 1 A A G3M192_9CAUD Tailspike protein MANKPTQPLFPLGLETSESSNIKGFNNSGTIEHSPGAVMTFPEDTEVTGLPSSVRYNPDSDEFEGYYENGGWLSLGGGGIRWETLPHAPSSNLLEGRGYLINNTTGTSTVVLPSPTRIGDSVTICDAYGKFATYPLTVSPSGNNLYGSTEDMAITTDNVSATFTWSGPEQGWVITSGVGLGQGRVYSREIFTQILASETSAVTLNTPPTIVDVYADGKRLAESKYSLDGNVITFSPSLPASTELQVIEYT 250 T 0.00013 T4_gp9_10 pdbhh T Viruses T 7rfx 2 B B ORF4B_MERS1 ORF4b RKARKRSHSPTKKLRYVKRRF 21 T 0.081 SUIM_assoc unppercent T Viruses T 7rg0 2 B B ORF4B_MERS1 ORF4b RKARKASHSPTKKLRYVKRRF 21 T 0.081 SUIM_assoc unppercent T Viruses T 7rg1 2 B B ORF4B_MERS1 ORF4b RKARKRSASPTKKLRYVKRRF 21 T 0.16 SUIM_assoc unp T Viruses T 7rg2 2 B B ORF4B_MERS1 ORF4b RKARKRSHSPTKKLAYVKRRF 21 T 0.081 SUIM_assoc unppercent T Viruses T 7rg3 2 B,C B,C ORF4B_MERS1 ORF4b RKARKRSHSPTKKLRYVKARF 21 T 0.081 SUIM_assoc unppercent T Viruses T 7rg6 2 B,C C,D ORF4B_BCHK5 ORF4B STRKRRRHPMNKRRYAKRRF 20 T 1.6 DUF1713 pdbhh T Viruses T 7rg8 2 B B HPSE_HUMAN Heparanase 8 kDa subunit MGSSHHHHHHSQDPNSSSQDVVDLDFFTQEPLHLVSPSFLSVTIDANLATDPRFLILLGSPKLRTLARGLSPAYLRFGGTKTDFLIFDPKKE 92 T 3.5 Glyco_hydro_79n unppercent F Eukaryota T 7ri4 1 A G ESPP_ECOLX EspPbeta9-12 DWKVTARACLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRMLMSVGLNAEIRDNVRFGLEFEKSAFGKYNVDNAVNANFRYSF 83 T 0.012 OMP_b-brl pdbpercent F Bacteria T 7rj1 1 A,B,C,D A,B,C,D Q59TS4_CANAL Chorismate mutase SMDFMKPETVLDLANIRQALVRMEDTIVFDLIERSQFFSSPSVYEKNKYNIPNFDGTFLEWALLQLEVAHSQIRRYEAPDETPFFPDQLKTPILPPINYPKILAKYSDEINVNSEIMKFYVDEIVPQVSCGQGDQKENLGSASTCDIECLQAISRRIHFGKFVAEAKYQSDKPLYIKLILDKDVKGIENSITNSAVEQKILERLIVKAESYGVDPSLKFGQNVQSKVKPEVIAKLYKDWIIPLTKKVEIDYLLRRLEDEDVELVEKYKK 269 T 0.18 IL10 pdb F Eukaryota T 7rjf 1 A,B A,B [L47W]MOPD-1 IQIREYKRCGQDEERVRRECKERGERQNCHYVIHKEGNCYVCGIICW 47 T 3.9 DUF2175 pdbhh F T 7rkc 1 A,B A,B D_3_633 SGSGSPELDELWKRVKKLVTELLEQAERAGDPEEIFKLLEVAQQLLWLAEMFLRLAAIQEKATDPEIQELAERVLRLIKRLLEEAERAGDPRRIKELVEVALALAKLLEMFYRLKEIQERATDPEIQELAERVLRLIKKLLKAAEEAGDPRKIYKLVFVALVLLHLLQTFYRLKEIQEKATDPEIQRKAQEVLEKIKRLLEAAERAGDPAKILLYVIRAQLLAMELKFAYRKR 233 T 0.007 PMC2NT pdb F T 7rle 2 B,D B,D CBP_HUMAN HISTONE LYSINE ACETYLTRANSFERASE CREBBP,PROTEIN-LYSINE ACETYLTRANSFERASE CREBBP GNLVPDAASKHKQLSELLRGGSGS 24 T 3.1 SRC-1 pdbhh F Eukaryota T 7rlv 2 B,E,H P,R,Q CSP_PLAVS CS GDRADGQPAGDRADGQPA 18 T 90 RNaseH_C pdbhh F Eukaryota T 7rlw 1 A,F P,R CSP_PLAVS CS GDRAAGQPAGDRAAGQPA 18 T 0.088 X unppercent F Eukaryota T 7rlx 3 C P CSP_PLAVS CS GDRADGQPAGDRAAGQPA 18 T 85 RNaseH_C pdbhh F Eukaryota T 7rly 1 A,H,I P,R,Q CSP_PLAVS CS DRAAGQPAGDRADGQPA 17 T 110 DUF5632 pdbhh F Eukaryota T 7rlz 1 A,F P,R CSP_PLAVB CS GDRAAGQPAGNGAGGQAA 18 T 0.01 DUF2000 unppercent F Eukaryota T 7rm1 3 E P Q2TM01_PLAVI peptide from Circumsporozoite protein variant VK247 EDGAGNQPGANGAGNQPGANGAGNQPG 27 T 0.33 Collagen unppssm F Eukaryota T 7rma 2 B C AN13D_HUMAN Ankyrin repeat domain-containing protein 13D RGQQEEEDLQRILQLSLTEH 20 T 0.00092 UIM pdbhh F Eukaryota T 7rmi 3 C S Substance P 6-11 QFFGLMX 7 T 0.00044 Tachykinin pdbhh F T 7rmx 1 A A Tunable symmetric protein, D_3_212 SGSGSTEEEEALLRWFQTLLAKFDELVKQLGDPRLLEEARRLQERLEEAKKRGDKRTIKQLAALLQMFVLIAQIFQLVEELGDPKLLEQAKRLLERLKEAVERGDEETIKELLDLAHMTYLIAQIFQLVEQLGDPRLLELAKELLKRLKEAQERGDRRTIERLLRLVQMTYLIAQIFQLVRQLGDPRLLETAKTLLTLLKLAFEEGDELLIKSLLTLVAETYRQAAAEQ 229 T 0.012 Tim44 pdb F T 7rmy 1 A A De Novo designed tunable homodimer, D_3-337 SGSGSSEELKKVQKMVSQILATAEAVLKLAKVLGDPKAVELAERILEDAKELAKRAESGDEETLRRAQTLLKVLKMVLEILLLAIKVELAAKELGDPKAVEAAQRILKQALRLLAEIKSGDEETLKRAQELLKVLKMVLRIIYLAIEVEKAAKELGDPTAVEAAQRILELALRLLQKVESGDEDTLRKALELLEVLYMVLRIIRLAIEVEKLAKKAGDPSAVEEAQRILKQALRLLKEISSGDEQTLDEAAKTLSFLAAELEAIAFAIRVKW 272 T 0.016 FANCI_S3 pdb F T 7rne 3 E,F F,G Ac-YKPVD-CHO XYKPVD 6 T 96 DUF932 pdbhh F T 7rng 3 E,F F,G Ac-ITAKD-CHO XITAKD 6 T 120 Alpha_Helical pdbhh F T 7rnw 2 E,F,G,H X,Y,Z,W ACE-DTY-LEU-GLN-TYR-ALA-VAL-LEU-ARG-HIS-LYS-ARG-ARG-GLU-SEC XXLQYAVLRHKRREX 15 T 9.8 Potass_KdpF pdbhh F T 7ro2 1 A A W0UUV5_9VIRU major capsid protein MSVSVVVPSAKATGAGGKKAKTFKVIKVSTPKVNNVHVPKIKKATRIHDPGAISGSLAKVTFGTNGFDIPTIAIALLIVGVIIGLSGLILSIFATATASAISNPSPGTLAYNLTHPLINGMVSFFSFFPTLYVLLGVTGIVLIAAGIISIIMEKFKT 157 T 0.0039 DUF4064 pdbpercent T Viruses T 7roa 1 A A Q836L4_ENTFA EntV SDQLEDSEVEAVAKGLEEMYANGVTEDNFKNYVKNNFAQQEISSVEEELNVNISDASTVVQARFNWNALGSCVANKIKDEFFAMISISAIVKAAQKKAWKELAVTVLRFAKANGLKTNAIIVAGQLALWAVQCGLS 136 T 0.00033 Potyvirid-P3 pdb F Bacteria T 7rol 1 A,B A,B Alpha-crystallin B chain peptide KVKVXWDVIEV 11 T 0.074 Ycf34 pdbhh F T 7rov 2 C,D E,F Cyclic peptide MP-9903 AXPLYISYDPVCRA 14 T 0.83 Ribosomal_L33 pdbhh F T 7roy 2 E,F G,H FNIP1_MOUSE Folliculin-interacting protein 1 GRNKSSLLFKESEETRTPNCNCKYCSHPVLG 31 T 0.092 zf_C2H2_13 pdbhh F Eukaryota T 7rps 1 A A W5SFE3_9SPIR Fibronectin-binding lipoprotein FbpB GSTGSDNQYKFKLKNITDSVEQALKIAKQIKDDLDIIEFHRIKLSNHYGIRAEEHEKQTAREELSKFSKDKLEADLKKLLSEIEKSLNAATILITYNDYGGNLQSDLSAKTTLEALKTEVSSLITKIQDFNNKDHQAYPTSYYNDYQTYQALRNPYSKLTLVKDLLTRT 169 T 0.036 Hormone_1 pdbpssm F Bacteria T 7rpy 1 A A A0A412DXQ2_9FIRM Cohesin-containing protein HHHHHHENLYFQGAADTTYVVAGTTNLTGYEWVGTPDAAPENVMTADGSVFTKTFSAVPAGKNYQLKVVANTGDEQKWIGLDGTDNNVTFDVETACDVTVTFDPATNKITVTGDGVKMVTDLEVNSITVVGNGEDNWLNGVAWGVDAEVNHMTQVSDKVYQIKYENIESADDAYQFKFAANDDWAASWGLPEQSATPIGEEFDLTFNGQNMLLNTVSAGFEEDSLVDVTITLDITNFDYSTRSGAKATVKVEPSTP 256 T 0.0024 DUF5121 pdbhh F Bacteria T 7rr3 1 A A M5AAG8_9CAUD Primase MIMEIPAIKALSRYAQWVIWKKERDTKIPYNPNNGKKASSTDPLAWGDIDEAQAGLVRYGANGLGFVLTKSDPFVFIDLDHVLDENKRVKCEWARQLLKEIKSYTEISPSGDGLHVVVSGKLPDYIKHKTKFDDGSALEVYESGRYMTITGEVFDGRDDIKELDLSILGEFAEHKIETKNAPVQIESATTLDDEAIIDLMKRKGQWPDAPADGDDWSSLDMSFANRLAFWCGKDIERMDRIFRQSPLMRQKWDRPTAGSTYGRITLKKACDFVDSVYDPALRNESDCPF 289 T 0.0044 VirE_N pdbhh T Viruses T 7rr4 1 A A M5AAG8_9CAUD Primase MIMEIPAIKALSRYAQWVIWKKAADTKIPYNPNNGKKASSTDPLAWGDIDEAQAGLVRYGANGLGFVLTKSDPFVFIDLDHVLDENKRVKCEWARQLLKEIKSYTEISPSGDGLHVVVSGKLPDYIKHKTKFDDGSALEVYESGRYMTITGEVFDGRDDIKELDLSILGEFAEHKIETKNAPVQIESATTLDDEAIIDLMKRKGQWPDAPADGDDWSSLDMSFANRLAFWCGKDIERMDRIFRQSPLMRQKWDRPTAGSTYGRITLKKACDFVDSVYDPALRNESDCP 288 T 0.0037 VirE_N pdbhh T Viruses T 7rro 5 I,J 8,9 CF107_BOVIN Uncharacterized protein C1orf158 homolog MQFLTAVSPQSSSTPSWKIETKYSTRVLTGNWTEERRKFIKATEKTPQTIYRKEYVPFPGHRPDQISRWYSKRTVEGLPYKYLITHHQEPSQRYLISTYDDHYNRHNYHPGLPELRTWNRHKLLWLPEKADFPLLGPPTNYGLYEQLKQKWLPPPEATLRESIYTSSYPRPPAGAMSRREHAIPVPPPRLQPVPHF 196 T 0.031 DUF1143 pdbpercent F Eukaryota T 7rro 13 ZB D SPAG8_BOVIN Sperm associated antigen 8 METSESTDRSQSRCLDLQPSSDGLGSSSDPFSSWDGRHRSALVAATAAASAAATAASTARAAALWTKSPAPYSHGNLLTEPSSDSLTERYTGPRFTHKISHGRLGFQPAYFSHIAWNPYTTNDLSSSRGPIPGSSSGPVPGSSSSPGPDSSSDPGPSSSSGPGGSPGGSGRGPGHGPGPGGGSGQGPGGGSGQGTDLGPAIDSRHSPGHGHGPRFNFSAPVGFRNPRGDLIPNYTGCKHHCHWEPQKQSWKFLKVSEPGARGLWKPPEVEGKSTVLSETLPRGQCLLYNWEEERATNYLDQVPVMQDGSESFFFRHGHRGLLTLQPQSPTSSCTTQKDSYQPPKSHCQPIRGKREAILEMLLRQQICKEVQAEQEPTRKDSEVESVTHHDYKKELVQAGPPAPTKIHDYHTEQPETFWLERAPQLPGVSNIRTLDTPFRKNCSFSTPVPLSLEQPLPFEPESYSQHGEISSLACQGGGQGGGGG 484 T 1.3 DUF1143 pdbhh F Eukaryota T 7rro 15 PD,XC F,E CF161_BOVIN CFAP161 MAQNLYGPRVRIGNWNEDVYLEEEIMKDFLAKRDKGQLLIQRNRRLKENLLRPMQLSVSEDGYIHYGDKVMLVSPDHPETEADLFLPGDLSLCMTPDEIKAHLSNELEVPCGLSAAQTKIPVGRNTFTILCAAGEVIGQVLRYGQNFRLGITGGFDDRMLYLSSDHRTLLKSSKRSWLQEVFLTHEDSYLNCWQAAFPHPQLRLEYEGSPVPANTKILITHCHTNRGLVAHRHLFLRTYFGQEAEVAAHTYLDSHRVEKPKNHWMLVTGAPRKDLSTMLDLPKPPAEDTRALEQEREQVSDPGARSTPDARGCVPQCTLPM 321 T 0.18 DUF1143 pdbhh F Eukaryota T 7rro 19 XE,YE,ZE H1,H2,H3 ODAD1_BOVIN Coiled-coil domain containing 114 MPFGLSAGSTRSEDGSEAFLEGMVDWELSRLQRQCKVMEDERRAYSKEVHQRINKQLEEIQRLEGVRHKLRVQISIAQSQVRRLRDSERLESMGHLLKCQVRVQAEVKELQAQNQALDREIQEWESRNSAHSKNARSPGCVQHDKVKSQRRIKSLENQLDKVICRFDIQLAQNATLREELDLLRIERNRYLNVDRKLQKEIQLLKDSVRNLMVSSTSAYTVREEAKAKLGMLRERAEKEVAQNETEVQILQRQIAHLEQLHHFLKLKNGDRQPDSAIVEKREQRAREVAEGLRKTSQEKLVLRYEDALNKLSQMTGESDPDLLVEKYLELEERNFAEFNFINEQNSELEHLQEEIKEMQEALVSGRRSEEDRRAQQEQQRAELQQRVDDVHSEADDLEARYHNFREQLEKLKTNIQHLFTRAQCDSTLINDLLGIKTHMRDRDISLFLSLIEKRLVQLLTVQAFLETQVVVMFNAALMVLGQSSEDFPKKVAPPQPPDNLEDPPGFEAKDDYPLSKEELLSSVMKAEQHLKELVESIKVESTPSMTSSTQKVSSSSRLVTQRPSQVPGSIMSHRTSGILVSSGGRATSSNVGHVTFGDSSATTGGLMSSRGSIPGRVTFRSPNSSSYLGSTGYVGSSRDHDSFEASKGPGSESSGGLGSSPGPASSPGPASSTGQASSTSKDSQSNY 687 T 2.4E-05 CCDC73 pdbhh F Eukaryota T 7rro 20 AF,BF,CF H4,H5,H6 ODAD3_BOVIN COILED-COIL DOMAIN-CONTAINING PROTEIN 151 MTSPLCWAAASNAMPSQDQISTPSKVKATQVQLKPYRSRGKGLVPVWHSLHSKAGPLHASEGKSAVNMQVAELQRKIQLLEGDRKAFYESTQWNIKKNQETINQLREETRVLQLQLTALLQGDEKVVQAVIREWKSEKPYLKNRTGQQALEHLDYRLNEKVKQLNALRHQLGLRQKWLEELQLQHSLRELEIAEAQDSNTEVAKTMRNLENRLEKARMKAEEAEHITSVYLQLKAYLQEESLHLGNRLDFMEAEVVRTKHELEELHLVNQEALNARDIAKNQLQYLEETVFRERKKRERYLTECKKRAEEKKLQNERMERKTQREHVLLQSDDTLQDSMYSKEEELKRRWSMYQMEVLFGKVKDATGVAETHAVVRRFLAQGDTFTQLEMLKSENEQTLLRLKQEKQRLQQELEDLKYSGEALLVSEQKRQAELQGRLKMEEQRRADAQNQLDRTMRALQITKEGLEHLAGKLNHIVVAGPTYEEGSPGASLDTKGSATPQPQETGRSVGKMDPKVDDYLPNLLGLVEEKLLKLHSQLENHNVPEMLRHIVDLEFYATLEGKLPSYNTRIALPVAGHKDKFFDEEESEEDDSDVVTRAALKMRSQKLIESRSKRRGRSRRS 621 T 0.0014 CCDC73 pdbhh F Eukaryota T 7rro 32 CP,DP,EP l,m,n FLTOP_BOVIN CILIA- AND FLAGELLA-ASSOCIATED PROTEIN 126 MATNYSANQYEKPFSPKYLQNWSLAKPTKERISSHEGYTQIIANDRGHLLPSVPRSKASPWGSFMGTWQMPLKVPPARATLTSRTAAGAASLTRWIQKNPDLLKASNGLRPEIFGKPHDPDSQKKLRKSITKTVQQAPSPTIIPSSPASNLSSPDQLQSSHPSAGHTPGPQSPLNSPKCPPGSPCLPHAGRNLAEV 196 T 0.34 DUF4248 pdbpssm F Eukaryota T 7rsl 1 A,B,C,D,E,F,G,H,I,J A,B,G,H,E,F,I,J,C,D SEI1_YEAST FEW LIPID DROPLETS PROTEIN 1 MKINVSRPLQFLQWSSYIVVAFLIQLLIILPLSILIYHDFYLRLLPADSSNVVPLNTFNILNGVQFGTKFFQSIKSIPVGTDLPQTIDNGLSQLIPMRDNMEYKLDLNLQLYCQSKTDHLNLDNLLIDVYRGPGPLLGAPGGSNSKDEKIFHTSRPIVCLALTDSMSPQEIEQLGPSRLDVYDEEWLNTIRIEDKISLESSYETISVFLKTEIAQRNLIIHPESGIKFRMNFEQGLRNLMLRKRFLSYIIGISIFHCIICVLFFITGCTAFIFVRKGQEKSKKHS 285 T 0.11 Seipin pdbpercent F Eukaryota T 7rsw 1 A,B A,B H0USR8_ROTGA HEMAGGLUTININ MSLGQSDLHIDPTQFIMYSGTISNGISYVNQAPSCGTVLSLKFTPGNSSLIENLHIEPYKVEVLKIEHVGDVSRATLLSDIVSLSTAQKKLLLYGFTQPGVQGLTGDVVSVETKRIPTPTQTNLLTIEDSIQCFTWDMNCANARSTNQDSRLIIYEQEDGRSHHHHHH 168 T 2.3E-05 Rota_VP4_MID unphh T Viruses T 7rsx 1 A,B,C,D C,B,A,D ENVELOPE GLYCOPROTEIN GP120 VWKEAKTTLFCASDAKAYEKEVHNVWATHACVPTDPNPQEMVLANVTENFNMWKNDMVEQMHEDIISLWDESLKPCVKLTGGSAITQACPKVSFDPIPLHYCAPAGFAILKCNNKTFNGTGPCRNVSTVQCTHGIKPVVSTQLLLNGSLAEEEIIIRSENLTNNAKTIIVHLNESVNIVCTRPNNGGSGSGGNIRQAHCNINESKWNNTLQKVGEELAKHFPSKTIKFEPSSGGDLEITTHSFNCRGEFFYCNTSDLFNGTYRNGTYNHTGRSSNGTITLQCKIKQIINMWQEVGRAIYAPPIEGEITCNSNITGLLLLRDGGNDDNDTETFRPGGGDMRDNWRSELYKYKVVEIKHHHHHH 362 T 7.9E-36 GP120 pdb F T 7rt7 1 A,C,E,G,I,K A,B,E,G,I,K A0A0H2Z8A2_PSEAB RhsP2 MGSSHHHHHHSQDPAGPIVELDAQGNEIYYRTLSEQHLEILRNNFEVPPTSETFISPLQSYSQEYDGKLVRLTASPGTMNELSKIGVTANSGTGLLLPDLPPARKGWKQNNALFKLEALKKPTINEGGGVINTGLGDGKALEIFNKNLIDFEVID 155 T 25 STAT1_TAZ2bind pdbhh F Bacteria T 7rt7 2 B,D,F,H,J,L D,C,F,H,J,L A0A367GXM0_PSEAI RhsI2 MKTIYNFKQRIKEDPEYIRKAHELTLNTTKPKAGLKGTYGLLGSKEWWDNLENGSIPQKEISGTIKKVYLTGQDNTEDFNTIDIETENKTLCTEGTYTNKNTDRKHYEAGKKITIKYAFDPLKKPKPNGDIDYSKIVVEILISE 144 T 0.95 DUF6152 pdbhh F Bacteria T 7rte 4 D D LMBL3_HUMAN H-L(3)MBT-LIKE PROTEIN 3,L(3)MBT-LIKE PROTEIN 3,L3MBT-LIKE 3,MBT-1 KKATATTTWMVPTA 14 T 40 RNF180_C pdbhh F Eukaryota T 7rti 4 D D LMBL3_HUMAN H-L(3)MBT-LIKE PROTEIN 3,L(3)MBT-LIKE PROTEIN 3,L3MBT-LIKE 3,MBT-1 KKATATTWMVPTA 13 T 1.9 ATP-synt_Z pdbhh F Eukaryota T 7ru9 3 D,G D,G BAG6_HUMAN BAG FAMILY MOLECULAR CHAPERONE REGULATOR 6,BCL2-ASSOCIATED ATHANOGENE 6,BAG-6,HLA-B-ASSOCIATED TRANSCRIPT 3,PROTEIN G3,PROTEIN SCYTHE GPGSAETEPWAAAVPPEWVPIIQQDIQSQRKVKPQPPLSDAYLSGMPAKRRKTMQGEGPQLLLSEAVSRAAKAAGARPLTSPESLSRDLEAPEVQESYRQQLRSDIQKRLQEDPNYSPQRFPNAQRAFADDP 132 T 0.02 DUF2939 pdb F Eukaryota T 7rx4 1 A,B A,a AS2 peptide QARILEADARILQAYANILSAHAEILRAE 29 T 4.1 Proho_convert pdbhh F T 7rx5 1 A A F1-N2 nanotube QAEILKADAENNRAYARILEAHAEILKAQ 29 T 35 DUF167 pdbhh F T 7rxq 2 B B CAC1S_RABIT CALCIUM CHANNEL,L TYPE,ALPHA-1 POLYPEPTIDE,ISOFORM 3,SKELETAL MUSCLE,DIHYDROPYRIDINE RECEPTOR ALPHA-1S SUBUNIT,DHPR,VOLTAGE-GATED CALCIUM CHANNEL SUBUNIT ALPHA CAV1.1 EERIFRRTGGLFGQVD 16 T 0.12 CAC1F_C unppercent F Eukaryota T 7ryf 43 QA m RS13_ACIB5 30S ribosomal protein S13 MARIAGVNIPDNKHAVISLTYIFGIGRHTAKNILAAVGITETTKIRELDDAQLDAIRAEVAKVPTEGDLRREISMNIKRLMDLGCYRGLRHRRSLPVRGQRTKTNARTRKGPRKPIKK 118 F F Bacteria T 7rz3 1 A A Xt3a SAVLWNQQYDKTVCNQEGEFCSKSGVDCCAGLSCRKYNLMGYGVCAAQTCSEEGTFCSLSDSDCCSGLKCKRRGHGYGECSK 82 T 0.0031 Toxin_12 pdbpssm F T 7rzd 3 C C KMT2A_HUMAN N-TERMINAL CLEAVAGE PRODUCT OF 320 KDA,P320 EPRSPSHSM 9 T 2.3 DUF5069 pdbhh F Eukaryota T 7rzz 3 C C P91820_CAEEL Lateral Signaling Target GSNSSTIAYSKSQHEAPKQLLQLRSEIKPLIPLNQP 36 T 5.7 AgrD pdbhh F Eukaryota T 7s00 1 A,E c,e A0A172JI16_BPPB1 DNA-directed RNA polymerase beta subunit MISNFRKFHGNKNQEKFNENLILNKENESILNYLDPICKTLEIIPEITYLGSSVEPINKVYKFNKEEKTSDIERSELQLIKMSFLIEKDDKKEEINKFIYFPKLIDSQYFIINGNRYYPIYQLLDSGTYRTNKALTLKTLLMPIVLREKKETFDDINGETHTMLNVDLDLFKSKVPFLIYFFSKFGFEGTLEYFGLQDLIHVLMKEDLDQLDEDEINDNVIFMITKNISLVVDKNFFSNKNNQIIIATLLNCFNTRIKIDKIYEKDYWVKKLGGYFTTNNSNKQEKGEGIILSFERILDEWTKKILRTEEKNKEDIYSVVRWMINNYLALVKQDNMNLANKRIRLYEYLLHPLLIKFSKGTYRVLNNRNSNKFEKIKTIFSNIQEGFLVKKIINNELLRYDNSVNSISLFTLILRYTQSGPQSPFSSNSTNNKLRGLHPSYLGRLGLTSTSAGDPGASGSLTPFLELPENSYMHFTEEPEINLNIDDISIDEVIES 496 T 0.0014 RNA_pol_Rpb2_3 pdbpssm T Viruses T 7s01 1 A A A0A172JIC8_BPPB1 DNA-directed RNA polymerase subunit MDILENYVSFDEQARDINIAFDKLFGRDDISHMNNFSINKRSYYNCLDQISDDLNLVLNKYNDLAYSLLEIRYNMATKENYTHMEFYSDIERLFIKNEKLLNVISDIVEEEYDLDLNQASKGKKINIELQVTDNLNKIYLKSSVLMRILIPILCDFNCDDDINEVLVYDIFKEVIKSFDDGKKNALNKLYKIIYSRVFETKYSDVVIWTYLKNMSTDLMIIVKDYFKVIIKKIFPKLKHNSSVISYLDVVIKQKLKYLFTFKYPISYKPLKAETTDDEELSEQERMEINLLRNDQGNSIINECSIKQEIAKIKKKYNVTDEVMKEFINGRELNSIQIYLVKIYYSNKFKVNSNKNDIFYLLYGMTRELGEMNFSIIPEILSCAIAPNVRKMNNRKKLVDKIIHSDKYSYLLKSYLPIKNILDKNNVILQLMTIKNAKFMNKENKEVDFSTDHLAEEVLDMLLCI 464 T 23 GvpK pdbhh T Viruses T 7s07 3 C C GP42_EBVB9 Soluble gp42 KPNVEVWPVDPPPPVNFNKTAEQEYGDKEVKLP 33 T 5.4 MarB unphh T Viruses T 7s1o 1 A,B A,B POTE1_HUMAN HPOT1,POT1-LIKE TELOMERE END-BINDING PROTEIN SNIEVERCQQLSATILTDHQYLERTPLCAILKQKAPQQYRIRAKLRSYKPRRLFQSVKLHCPKCHLLQEVPHEGDLDIIFQDGATKTPDVKLQNTSLYDSKIWTTKNQKGRKVAVHFVKNNGILPLSNECLLLIEGGTLSEICKLSNKFNSVIPVRSGHEDLELLDLSAPFLIQGTIHHYGCKQCSSLRSIQNLNSLVDKTSWIPSSVAEALGIVPLQYVFVMTFTLDDGTGVLEAYLMDSDKFFQIPASEVLMDDDLQKSVDMIMDMFCPPGIKIDAYPWLECFIKSYNVTNGTDNQICYQIFDTTVAEDVI 313 T 0.032 CDC24_OB3 pdbhh F Eukaryota T 7s1u 1 A,B A,B POTE1_HUMAN HPOT1,POT1-LIKE TELOMERE END-BINDING PROTEIN QLSATILTDHQYLERTPLCAILKQKAPQQYRIRAKLRSYKPRRLFQSVKLHCPKCHLLQEVPHEGDLDIIFQDGATKTPDVKLQNTSLYDSKIWTTKNQKGRKVAVHFVKNNGILPLSNECLLLIEGGTLSEICKLSNKFNSVIPVRSGHEDLELLDLSAPFLIQGTIHHYGCKQCSSLRSIQNLNSLVDKTSWIPSSVAEALGIVPLQYVFVMTFTLDDGTGVLEAYLMDSDKFFQIPASEVLMDDDLQKSVDMIMDMFCPPGIKIDAYPWLECFIKSYNVTNGTDNQICYQIFDTTVAEDVI 304 T 0.022 Zn_Tnp_IS1 pdbpssm F Eukaryota T 7s2t 2 D,E,F F,G,H ENCB_MYXXD EncB targeting peptide ESHPLTVGSLRR 12 T 0.11 DUF2076 unppercent F Bacteria T 7s3d 6 DA,F,R f,F,Q B4WP24_SYNS7 PHOTOSYSTEM I REACTION CENTER SUBUNIT III, PSAF2 MHKTIRKFFSLLLAAFVWLSVVSPAVAASEGYTDTHLVPCASSPAFNERMQNAPEGYYFDTPYQSYAANLLCGAEGLPHQQLRFDRAIDVLIPFGIFFYVAGFIGWSGRAYLISSNRNSKPEETEIFIDVALAIKSFVQGLLWPLLAVKELTTGELTAPVSEVSVSPR 168 T 0.0004 PSI_PsaF pdbpercent F Bacteria T 7s3d 7 EA,G,S i,I,R B4WP23_SYNS7 PsaI2 MVDATQLEGAYAAAWLPWIMIPMITYILPFPIFAIAFLWIEREGGEGGLDIDVMGSNAMSNEAMGRDISS 70 T 0.46 PSI_8 pdb F Bacteria T 7s4a 2 B,D B,D PALB2_HUMAN Partner and localizer of BRCA2 MHHHHHHSSGVDLGTENLYFQSNMLSLKQLLSFLSITDFQLPDEDFGPLKLEKVKSC 57 T 5 Eaf7 pdbhh F Eukaryota T 7s4g 3 G,H,I,J G,I,J,K LY66D_HUMAN PROTEIN LY6-D,MEGAKARYOCYTE-ENHANCED GENE TRANSCRIPT 1 PROTEIN DCYLGDLCN 9 T 1.1E-05 PLA2_inh unphh F Eukaryota T 7s4o 2 C,D C,D LEU-PRO-ALA-THR-SER-GLY XLPATSGKX 9 T 5.9 Pas_Saposin pdbhh F T 7s4q 2 D,E,F F,G,H ENCC_MYXXD EncC targeting peptide PEKRLTVGSLRR 12 T 4.6 DUF6225 pdbhh F Bacteria T 7s51 2 C,D C,D LEU-PRO-ALA-THR-ALA XLPATAGKX 9 T 29 Pas_Saposin pdbhh F T 7s59 2 C,D 4,2 CCL7_HUMAN;CCL8_HUMAN HC14,MONOCYTE CHEMOATTRACTANT PROTEIN 2,MONOCYTE CHEMOTACTIC PROTEIN 2,MCP-2,SMALL-INDUCIBLE CYTOKINE A8,MONOCYTE CHEMOATTRACTANT PROTEIN 3,MONOCYTE CHEMOTACTIC PROTEIN 3,MCP-3,NC28,SMALL-INDUCIBLE CYTOKINE A7 QPDSVSIPITCCFNVINKKIPKQRLESYRRTTSSHCPREAVIFKTKLDKEICADPTQKWVQDFMKHLDKKTQTPKL 76 T 7.5E-26 IL8 unppssm F Eukaryota T 7s5g 3 C C 8VH-Z02-ALA-DAL-PHE-FTR-PRO-THR-0A1-3WX XXAXFWPTXXX 11 T 0.6 Gla pdbhh F T 7s5j 2 B B A3DCU2_ACET2 CtA peptide LNIGRELTDEELMEMTGGSTFSIQ 24 T 0.012 L_biotic_typeA pdbhh F Bacteria T 7s5u 1 A,B,C,D A,B,D,C KL61_DROME BIPOLAR KINESIN KRP-130 MAQSLQDQTNLHNKLIGEVMKISDQHSQAFVAKLMEQMQQQQLLMSKEIQTNLQVIEENNQRHKAMLDSMQEKFATIIDSSLQSVEEHAKQMHKKLEQLGAMSLPDAEELQNLQEELANERALAQQEDALLESMMMQMEQIKNLRSKNSISMSVHLNKMEESRLTRNHRIDDIKSGIQDYQKLGIEASQSAQAELTSQMEAGMLCLDQGVANCSMLQVHMKNLNQKYEKETNENVGSVRVSGHHHHHH 248 T 0.00078 DUF1340 pdb F Eukaryota T 7s76 1 A,B,C,D,E,F A,B,C,D,E,F CAPSD_HBVD1 CORE ANTIGEN,CORE PROTEIN,HBCAG,P21.5 MGSMDIDPYKEFGATVELLSFLPSDFFPSVRDLLDTAAALYRDALESPEHCSPHHTALRQAILCWGDLMTLATWVGTNLEDPASRDLVVSYVNTNVGLKFRQLLWFHISCLTFGRETVLEYLVSFGVWIRTPPAARPPNAPILSTLPETTVVKLENLYFQ 160 T 3.9E-25 Hepatitis_core unp T Viruses T 7s79 3 C E KMT2A_HUMAN N-TERMINAL CLEAVAGE PRODUCT OF 320 KDA,P320 EPRXPSHSM 9 T 23 Ribosomal_S12 pdbhh F Eukaryota T 7s7e 3 C C DOT1L_HUMAN DOT1-LIKE PROTEIN,HISTONE H3-K79 METHYLTRANSFERASE,H3-K79-HMTASE,LYSINE N-METHYLTRANSFERASE 4 LPASPAHQL 9 T 18 YkpC pdbhh F Eukaryota T 7sag 1 A A A0A384E130_9METZ Barrettide C NVVPCFCVEDETSGAKTCIPDNCDASRGTNP 31 T 5.1 IL4Ra_N pdbhh F Eukaryota T 7sau 2 C,D,E,F,G C,D,E,F,G A0A085L0W4_9FLAO Gliding motility protein GldL MPLIDVNGKKFKNFLAKLYGFGASIVILGAMFKILHWTGADLMLIIGLSTEAVIFFFSAFEKPAPEYDWTLVYPELAGVEDLDSKNNALVPQGGTSLTQELDNMLKEASIDEELIKSLGDGLRKFGDAALKLNETIDAAEGTQKYTEQITLAAKHMESLNALYAVQLEGTASQMELQNALIEKLGSSIENTEKLSTELSELVTNMSALNKVYGGMLSAMGVSK 223 T 0.0034 DUF489 unppssm F Bacteria T 7sax 2 C,D,E,F,G C,D,E,F,G A0A1I6R6J4_9SPHI GldL MAKKTKFKFGINTLINWGATVVIIGLMFKILHLKGGEWMIGVGLAVEALLFFIMGFMQAEQEPDWTRVYPELDEDYNGELPTRSVRAVAQPVATGNTAALDKLLQDAKIDENLIGNLGDGLRTFSDKVASISKVADTAVATNQFADKLNAASTGAAQLSNAFERAASDLQTFNESAADMQQFKEQVSTFNKNLSSLNAIYGNMLSAMNTNRS 212 T 0.0057 DASH_Dam1 pdb F Bacteria T 7saz 2 C,D,E,F,G C,D,E,F,G F9YQB6_CAPCC GldL MAQSNKTTKKIFQMAYGIGASIVILGALFKILHWEIDFGGFKLGGGFLLAFGLITEAIIFFISAFEPVEEGYDWSLVYPELVGGEARQNQLVGRGVVSQLSEEDKAIKESLSEKLDNLLAEAQIDANLMHSLSASIQNFAGAAKEIAPVTDAMVSTHKYGEELSMAAAHLESLNSLYKLQLERTENQVSAQAGVVDNLNSLNEQMMSFKDNLKSLNSVYGGMLSAMGK 228 T 0.0067 DASH_Dad4 pdbpssm F Bacteria T 7sba 2 H H Q6ZEI5_SYNY3 Cas5d MTKIYRCKLTLHDNVFFASREMGILYETEKYFHNWALSYAFFKGTIIPHPYGLVGQNAQTPAYLDRDREQNLLHLNDSGIYVFPAQPIHWSYQINTFKAAQSAYYGRSVQFGGKGATKNYPINYGRAKELAVGSEFLTYIVSQKELDLPVWIRLGKWSSKIRVEVEAIAPDQIKTASGVYVCNHPLNPLDCPANQQILLYNRVVMPPSSLFSQSQLQGDYWQIDRNTFLPQGFHYGATTAIAQDSPQLSLLDTN 254 T 27 PNISR pdbhh F Bacteria T 7sba 3 I I Q6ZEI7_SYNY3 Cas10d MTTLLQTLLIRTLSEQKDYILLEYFQTILPALEEHFGNTSGLGGSFISHQKHFGTQGYDTEKAKKMAQGFAKKGDQTLAAHILNALLTTWNVMQELEFPLNDIERRLLCLGITLHDYDKHCHAQDMAAPEPDNIQEIINICLELGKRLNFDEFWADWRDYIAEISYLAQNTHGKQHTNLISSNWSNAGYPFTIKERKLDHPLRHLLTFGDVAVHLSSPHDLVSSTMGDRLRDLLNRLGIEKRFVYHHLRDTTGILSNAIHNVILRTVQKLDWKPLLFFAQGVIYFAPQDTEIPERNEIKQIVWQGISQELGKKMSAGDVGFKRDGKGLKVSPQTSELLAAADIVRILPQVISVKVNNAKSPATPKRLEKLELGDAEREKLYEVADLRCDRLAELLGLVQKEIFLLPEPFIEWVLKDLELTSVIMPEETQVQSGGVNYGWYRVAAHYVANHATWDLEEFQEFLQGFGDRLATWAEEEGYFAEHQSPTRQIFEDYLDRYLEIQGWESDHQAFIQELENYVNAKTKKSKQPICSLSSGEFPSEDQMDSVVLFKPQQYSNKNPLGGGQIKRGISKIWSLEMLLRQAFWSVPSGKFEDQQPIFIYLYPAYVYAPQVVEAIRELVYGIASVNLWDVRKHWVNNKMDLTSLKSLPWLNEEVEAGTNAQLKYTKEDLPFLATVYTTTREKTDTDAWVKPAFLALLLPYLLGVKAIATRSMVPLYRSDQDFRESIHLDGVAGFWSLLGIPTDLRVEDITPALNKLLAIYTLHLAARSSPPKARWQDLPKTVQEVMTDVLNVFALAEQGLRREKRDRPYESEVTEYWQFAELFSQGNIVMTEKLKLTKRLVEEYRRFYQVELSKKPSTHAILLPLSKALEQILSVPDDWDEEELILQGSGQLQAALDRQEVYTRPIIKDKSVAYETRQLQELEAIQIFMTTCVRDLFGEMCKGDRAILQEQRNRIKSGAEFAYRLLALEAQQNQN 975 T 0.002 HD pdbpssm F Bacteria T 7sba 4 J,K J,K Q6ZEI7_SYNY3 Cas11d MTEKLKLTKRLVEEYRRFYQVELSKKPSTHAILLPLSKALEQILSVPDDWDEEELILQGSGQLQAALDRQEVYTRPIIKDKSVAYETRQLQELEAIQIFMTTCVRDLFGEMCKGDRAILQEQRNRIKSGAEFAYRLLALEAQQNQN 146 T 0.018 RE_TaqI pdbpssm F Bacteria T 7sc5 1 A,C,E A,C,E Q2N0S6_9HIV1 ENVELOPE GLYCOPROTEIN GP160,GLYCOPROTEIN 120,SURFACE PROTEIN GP120,TRANSMEMBRANE PROTEIN GP41 AENLWVTVYYGVPVWKDAETTLFCASDARAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQAFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVV 474 T 3.9E-50 GP120 pdb T Viruses T 7sc7 8 BA,DB BG,CO P74135_SYNY3 Sll1873 protein MLKKLFGAKKEFYVQLDESQAPAQVEEADVAIVKSEVAPVEKPAPTTSKKTSIKKKSATKAAAPVETPASAPVAPAPKAKVDPSQVAFASGDPIPQNVARRTPGPSLNRFKEMARQVKVKR 121 T 0.28 LZ_Tnp_IS66 pdbpercent F Bacteria T 7sfr 51 YA v A0A3E0UTA6_MYCTX peptide AKRGRKKRDRKYSKANHGKRPN 22 T 0.2 DUF6254 pdb F Bacteria T 7sfy 1 A,B,D,E A,B,D,E MS18A_HUMAN FAPP1-ASSOCIATED PROTEIN 1 SNAELFNLESRVEIEKSLTQMEDVLKALQMKLWEAESKLSFATCKS 46 T 1.1 Trimer_CC unppercent F Eukaryota T 7sfy 2 C,F C,F MS18B_HUMAN CANCER/TESTIS ANTIGEN 86,CT86,OPA-INTERACTING PROTEIN 5,OIP-5 QNVPLSEKIAELKEKIVLTHNRLKSLMKILSEVTPDQSKPEN 42 T 0.071 SAND unppssm F Eukaryota T 7sg1 5 G,H C,H DQ2-glia-alpha1a peptide LQPFPQPELPYGSGGS 16 T 5.7 Sod_Fe_N pdbhh F T 7sg2 5 G,H C,H DQ2-glia-omega1 peptide QPFPQPEQPFPGS 13 T 4.7 Statherin pdbhh F T 7sgz 8 H H DDC1_YEAST DNA Damage Checkpoint protein DDC1 MSFKATITESGKQNIWFRAIYVLSTIQDDIKITVTTNELIAWSMNETDTTLCQVRFQKSFFEEYEFKPHEIVFGENGVQVIEDTYGNSHKLYSFRVNGRHLTTISRKPDGDGIKSFTIAVNNTSTCPESLANRLIVVIEMDSLIVKEYCPQFQPIKYDPIIINLKYKRRFLDVFGTAASDRNPQEPLDPKLLDVFTNTERELTSALFNEEVESDIRKRNQLTAADEINYICCNSTLLKNFLDNCNVNVTDEVKLEINVHRLSITAFTKAVYGKNNDLLRNALSMSNTISTLDLEHYCLFTKSIIFKLKDFKNFITIGPSWKTTQDGNDNISLWFCHPGDPILMQMQKPGVKLELVEVTDSNINDDILEGKFIKTAISGSKEEAGLKDNKESCESPLKSKTALKRENLPHSVAGTRNSPLKVSYLTPDNGSTVAKTYRNNTARKLFVEEQSQSTNYEQDKRFRQASSVHMNMNREQSFDIGTTHEVACPRNESNSLKRSIADICNETEDPTQQSTFAKRADTTVTWGKALPAADDEVSCSNIDRKGMLKKEKLKHMQGLLNSQNDTSNHKKQDNKEMEDGLGLTQVEKPRGIFD 593 T 6.4E-09 Rad9 pdbpercent F Eukaryota T 7sh2 8 H H DDC1_YEAST DNA damage checkpoint protein DDC1 MSFKATITESGKQNIWFRAIYVLSTIQDDIKITVTTNELIAWSMNETDTTLCQVRFQKSFFEEYEFKPHEIVFGENGVQVIEDTYGNSHKLYSFRVNGRHLTTISRKPDGDGIKSFTIAVNNTSTCPESLANRLIVVIEMDSLIVKEYCPQFQPIKYDPIIINLKYKRRFLDVFGTAASDRNPQEPLDPKLLDVFTNTERELTSALFNEEVESDIRKRNQLTAADEINYICCNSTLLKNFLDNCNVNVTDEVKLEINVHRLSITAFTKAVYGKNNDLLRNALSMSNTISTLDLEHYCLFTTIEDEKQDKRSHSKRREHMKSIIFKLKDFKNFITIGPSWKTTQDGNDNISLWFCHPGDPILMQMQKPGVKLELVEVTDSNINDDILEGKFIKTAISGSKEEAGLKDNKESCESPLKSKTALKRENLPHSVAGTRNSPLKVSYLTPDNGSTVAKTYRNNTARKLFVEEQSQSTNYEQDKRFRQASSVHMNMNREQSFDIGTTHEVACPRNESNSLKRSIADICNETEDPTQQSTFAKRADTTVTWGKALPAADDEVSCSNIDRKGMLKKEKLKHMQGLLNSQNDTSNHKKQDNKEMEDGLGLTQVEKPRGIFD 612 T 6.8E-09 Rad9 pdbpercent F Eukaryota T 7sh3 1 A A Synthetic VirB8 Miniprotein Binder SGGNAEEITEKATLVGIEAWLLAKDEEQKKKVRTLNRQVKKLLQQNDLDQAKRVLDQLKSVLEDLKS 67 T 0.0067 DUF3375 pdb F T 7sjp 1 A E HTRA1_HUMAN HtrA1-LoopA peptide RKLPFSKREVP 11 T 3.7 Integrin_alpha pdbhh F Eukaryota T 7ska 1 A,C,E A,N,Y Q6TAN8_9HIV1 ENV POLYPROTEIN KLWVTVYYGVPVWKEATTTLFCASDAKAYDTEVHNVWATHACVPTDPNPQEVVLENVTENFNMWKNNMVEQMHEDIISLWDQSLKPCVKLTPLCVTLNCTDLRNVTNINNSSEGMRGEIKNCSFNITTSIRDKVKKDYALFYRLDVVPIDNDNTSYRLINCNTSTITQACPKVSFEPIPIHYCTPAGFAILKCKDKKFNGTGPCKNVSTVQCTHGIRPVVSTQLLLNGSLAEEEVVIRSSNFTDNAKNIIVQLKESVEINCTRPNNNTRKSIHIGPGRAFYTTGEIIGDIRQAHCNISRTKWNNTLNQIATKLKEQFGNNKTIVFNQSSGGDPEIVMHSFNCGGEFFYCNSTQLFNSTWNFNGTWNLTQSNGTEGNDTITLPCRIKQIINMWQEVGKAMYAPPIRGQIRCSSNITGLILTRDGGTNSSGSEIFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTK 465 T 1.9E-55 GP120 pdbpercent T Viruses T 7skn 1 A,B,C,D A,B,C,D De novo synthetic protein DIG8-CC GHMRIEVRVDNGRVRVRNGTDRPCRVRVTAGGETREYTVNPGTELEVELSPEQQNNAEVEVECGNEKYRFQLG 73 T 0.00047 DUF756 pdb F T 7skz 1 A A SPIKE_SARS2 PRO-SER-LYS-ARG-SER-PHE-ILE-GLU-ASP-LEU-LEU-PHE-ASN PSKRSFIEDLLFN 13 T 0.00014 CoV_S2 pdbhh T Viruses T 7sl5 3 C,F,I,L C,F,I,L SPIKE_SARS2 PRO-SER-LYS-ARG-SER-PHE-ILE-GLU-ASP-LEU-LEU-PHE-ASN KPSKRSFIEDLLFNK 15 T 0.0004 CoV_S2 pdbhh T Viruses T 7smc 2 B,D B,D ARI4A_HUMAN ARID DOMAIN-CONTAINING PROTEIN 4A,RETINOBLASTOMA-BINDING PROTEIN 1,RBBP-1 GPETLVCHEVDLDDL 15 T 48 DUF126 pdbhh F Eukaryota T 7smd 2 B B EID1_HUMAN 21 KDA PRB-ASSOCIATED PROTEIN,CREBBP/EP300 INHIBITORY PROTEIN 1,E1A-LIKE INHIBITOR OF DIFFERENTIATION 1,EID-1 LTEELGCDEIIDRE 14 T 0.071 Nse4-Nse3_bdg unphh F Eukaryota T 7sme 2 B B HDAC1_HUMAN HD1,PROTEIN DEACETYLASE HDAC1,PROTEIN DECROTONYLASE HDAC1 RIACEEEFSD 10 T 2.6 RAM pdbhh F Eukaryota T 7smf 2 B,D B,D Histone deacetylase 1 DIYCYEEFSD 10 T 3.9 End_beta_propel pdbhh F T 7smj 1 A A AI-designed TIM-barrel F2N HHHHHHENLYFQSDIAIVDADNPADAIQQVKDLRKYGAKLIAYKSKSSEELKLALKAGADIAIVDADNPADAIQQVKDLRKYGAKLIAYKSKSSEELKLALKAGADIAIVDADNPADAIQQVKDLRKYGAKLIAYKSKSSEELKLALKAGADIAIVDADNPADAIQQVKDLRKYGAKLIAYKSKSSEELKLALKAG 196 T 0.0034 TrkA_N pdb F T 7smk 3 C C CSOCA_HALNC CSOSCA,CARBONIC ANHYDRASE,CA,CARBOXYSOME SHELL PROTEIN CSOS3 MNTRNTRSKQRAPFGVSSSVKPRLDLIEQAPNPAYDRHPACITLPERTCR 50 T 3.3 zf-LYAR pdbhh F Bacteria T 7smu 1 A,B,C,D,E,F D,E,C,B,F,A Consomatin-Ro1 EGYKCVXKTCMPA 13 T 0.6 Urotensin_II pdbhh F T 7snx 1 A A LUCI_OPLGR 19KOLASE SDNMVFTLEDFVGDWEQTAAYNLDQVLEQGGVSSLLQNLAVSVTPIQRIVRSGENALKIDIHVIIPYEGLSADQMAQIEEVFKVVYPVDDHHFKVILPYGTLVIDGVTPNMLNYFGRPYEGIAVFDGKKITVTGTLWNGNKIIDERLITPDGSMLFRVTINSA 163 T 0.026 DUF4950 unp F Eukaryota T 7snx 2 B B LUCI_OPLGR 19KOLASE XVTGYRLFEEIL 12 T 0.24 Lipocalin_7 unphh F Eukaryota T 7sny 1 A A LUCI_OPLGR 19KOLASE MVFTLEDFVGDWEQTAAYNLDQVLEQGGVSSLLQNLAVSVTPIQRIVRSGENALKIDIHVIIPYEGLSADQMAQIEEVFKVVYPVDDHHFKVILPYGTLVIDGVTPNMLNYFGRPYEGIAVFDGKKITVTGTLWNGNKIIDERLITPDGSMLFRVTINSHHHHHH 165 T 0.026 DUF4950 unp F Eukaryota T 7som 7 DF,EF,FF m,n,o A0A2K3DLJ2_CHLRE FAP65 MSERPHQSGPRSWAEDCNQYRTTRGSKSYSTLEDAGRVPERYSRTNYVPFLAERHPLYSYNDLGEDGKGKVRLDPATQADRFSRHGWGDVSLLKQEGLAGQPHPRSSEAGPTRSGRLRPGSPRGADGRNGLYGVLQMTEAGGTDSWVGHPQIDPTKGKRAVAPPPDPKGRRDLFDVLHARSPGMPADDSWLGHQKIDPARGKAHPPGPEQSRGRRDLTELFTMNILHDPRRLELLQKGADKHGDAWCGNILIDPARGKKPVEDVAAAGQNLHGATFKPLPAGTPLPDAPRRHTRPAPAPASDAYAAEVIRGEAAGDDWGPRTKRSVPDMPKPNAFDGRTDLYAHMQYRPLSNGEQGKYAKAFDDRGTRGRRQLHTPGDADPAKEALLTWKPEMRVGQFVKNGGLAQENRVRGHTLRATAGR 421 T 12 zf-C2H2_7 pdbhh F Eukaryota T 7som 8 GF,HF,IF p,q,r A0A2K3DKW3_CHLRE FAP70 MFRQEEQPKTGVRQFGTHTTGKVDHMLGTHATVRPEYKDPPPKRTVPTSQLEAVRNIETQYIKARKAAEDARERQGTSHLYAAGKGWGH 89 T 14 DUF6422 pdbhh F Eukaryota T 7sq1 2 B,D,H C,E,G Q2N0S6_9HIV1 ENV POLYPROTEIN ENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHEDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNTPVQINCTRPNNNTVKSIRIGPGQAFYYTGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFAQSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRV 472 T 3.6999999999999997E-54 GP120 pdbpercent T Viruses T 7sq3 1 A A Designed trefoil knot protein, variant 1 GSSMGSDEQRRELEEKIKFKLAELASKSEEERKEIKLRVIAYVLVQLEDLQKNLSDEQRRELEEKIKFKLAELASKSEEERKEIKLRVIAYVLVQLEDLQKNLSDEQRRELEEKIKFKLAELASKSEEERKEIKLRVIAYVLVQLEDLQKNLS 153 T 0.071 CID pdbpercent F T 7sq4 1 A A Designed trefoil knot protein, variant 2 GSSMGSDEQRRELEEKIKWKLAELASKSEEERKEIKLRVIAYVLVQLEDLQKNLSDEQRRELEEKIKWKLAELASKSEEERKEIKLRVIAYVLVQLEDLQKNLSDEQRRELEEKIKWKLAELASKSEEERKEIKLRVIAYVLVQLEDLQKNLS 153 T 0.19 Colicin pdbpssm F T 7sq5 1 A A Designed trefoil knot protein, variant 3 GSSMGSDEQRRELEEKIKLKLEELKTKSEEERKEIKLRVIAYVLVQLEDLQKNLSDEQRRELEEKIKAKLEELKTKSEEERKEIKLRVIAYVLVQLEDLQKNLSDEQRRELEEKIKWKLEELKTKSEEERKEIKLRVIAYVLVQLEDLQKNLS 153 T 0.039 DUF4854 pdbpercent F T 7sqc 14 NC,OC,PC,QC 8A,8B,8C,8D A8IKV8_CHLRE FAP105 MSGVRACLQPNEGPACPIGQTYGEVGGSSPNFRGNFCDAGRKHVSGPASELATQGFSRWEQSSGNRDPPLRRHAEQPASTSYAEPSVMGGHASYYGRRAVGEGEDGTAYRRTMKAVPQPVRQDRPEGKKAIPEPYGAPPAHPRGTRPPPDDVRFREAESAPEPNLPRLGRPDGISGLRESGDQQYQFESSLGRKIRVGQDSYRGAGRAGDRSLVYGAPARTEDDPTYFRSMKDSPTFTRFCNSLPAKPAVSPHQRREEGMRRAAEEERRREAALVSTLDIQGVPDDD 287 T 0.41 BRD4_CDT pdb F Eukaryota T 7sqc 16 AD,BD,XC,YC,ZC A3,A4,A0,A1,A2 FAP219 MDDTGSVIDDLPPPSPNRSLVATPTPAIPGRTGKLDYGLHESAMRVPVLDKVKEARKATIESKPDILGVRGPVWNETVALNPGKHHGKFSHNLLSNTLTPELINSTDVRKLTGTTAARGDPAAATLDRSLSPAAGGPSGWNTSTTLPSSNDRQRQLEAGLNASLAATARRRASPTPHYVDPVARQTAYSETIRAIKANSGADMGELTARYGPDGAEAMAAILAMPAKESRPRIRTTRADLQAVAALDAFSAGRDDAEEEEQGEAVPLSSPMPGAER 276 T 0.075 Lipase_chap pdbpssm F T 7sqk 1 A A HAUS1_HUMAN COILED-COIL DOMAIN-CONTAINING PROTEIN 5,ENHANCER OF INVASION-CLUSTER,HEI-C MEPQEERETQVAAWLKKIFGDHPIPQYEVNPRTTEILHHLSERNRVRDRDVYLVIEDLKQKASEYESEAKYLQDLLMESVNFSPANLSSTGSRYLNALVDSAVALETKDTSLASFIPAVNDLTSDLFRTKSKSEEIKIELEKLEKNLTATLVLEKCLQEDVKKAELHLSTERAKVDNRRQNMDFLKAKSEEFRFGIKAAEEQLSARGMDASLSHQSLVALSEKLARLKQQTIPLKKKLESYLDLMPNPSLAQVKIEEAKRELDSIEAELTRRVDMMEL 278 T 0.00056 DUF3496 pdb F Eukaryota T 7sqq 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T,U,V,W,X A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T,V,W,X,Y GP105_BP201 Chimallin SNAMIRDTATNTTQTQAAPQQAPAQQFTQAPQEKPMQSTQSQPTPSYAGTGGINSQFTRSGNVQGGDARASEALTVFTRLKEQAVAQQDLADDFSILRFDRDQHQVGWSSLVIAKQISLNGQPVIAVRPLILPNNSIELPKRKTNIVNGMQTDVIESDIDVGTVFSAQYFNRLSTYVQNTLGKPGAKVVLAGPFPIPADLVLKDSELQLRNLLIKSVNACDDILALHSGERPFTIAGLKGQQGETLAAKVDIRTQPLHDTVGNPIRADIVVTTQRVRRNGQQENEFYETDVKLNQVAMFTNLERTPQAQAQTLFPNQQQVATPAPWVASVVITDVRNADGIQANTPEMYWFALSNAFRSTHGHAWARPFLPMTGVAKDMKDIGALGWMSALRNRIDTKAANFDDAQFGQLMLSQVQPNPVFQIDLNRMGETAQMDSLQLDAAGGPNAQKAAATIIRQINNLGGGGFERFFDHTTQPILERTGQVIDLGNWFDGDEKRDRRDLDNLAALNAAEGNENEFWGFYGAQLNPNLHPDLRNRQSRNYDRQYLGSTVTYTGKAERCTYNAKFIEALDRYLAEAGLQITMDNTSVLNSGQRFMGNSVIGNNMVSGQAQVHSAYAGTQGFNTQYQTGPSSFY 634 T 6.2 TGBp3 pdbhh T Viruses T 7sqt 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T,U,V,W,X A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T,V,W,X,Y A0A482GDX1_9CAUD CHIMALLIN SNAMGLDVRNNGNDNVEIRAAETRTAQRADEALETAADFAGQPKVTHTMRTINRTLSRRISRNTGSEQVLNLRRLMEKYLEDTRFKDDFIFVAVDPNQYSVPYPTLVVMSGAKVGDHNHFFGYVLPLVAGLAPLPRREEQGPHGNILVPRTWVDNLNGTFINEVMAAMYAAIGGKSNGTARIAGLAVVTNEITAESAHLATTLLSAADNAIQTAIEIRLGDKLGLPQFNLGMMASDQPISSVQYNTSGMQDSDIVGNPVRSDITVTISNRIRQAMSDYDSQQRLVATTGYIDLTYSPQNPTFNQGPVLVNGYPVPPTVQYQPRYVMTSAYPLELDAFTPNTFVLGLIGTIATLNSGMAWAQSLISNAARGIGPHNPGALAMVLDPEVTAPLDLSTQTNEQIYKFLQQVLYPSLLISIDVPEEGEYSWLLRMIPAAEKIYTGKVEGEVREISEGYKALYRAFDDVTLGCFSKKYQYGLPLVYATGNRIPLGHYNHQDGHRHDIRDMDDLYMMNITNPDTVEAWEDSFDRTDMTMSQRVVARHEIIDRVLSGSWEQTGWAMRYDFDPLALQALIEAAADAGFTIRPENIQHLAGTAVRGNMAARARGLGNISGNIYARSDRPNVGVNNMGGAFNLF 634 T 0.16 PSII_Pbs31 pdbpercent T Viruses T 7st8 3 C S ASTL_HUMAN SAS1B, OOCYTE ASTACIN,OVASTACIN,ZP2-PROTEINASE MGSSHHHHHHSSGLVPRGSHMASGPRPRGRGSHAHSTGRSPAPASLSLQRLLEALSAESRSPDPSGSSAGGQPVPAGPGESPHGWESPALKKLSAEASARQPQTLASSPRSRPGAGAPGVAQEQSWLAGVSTKPTVPSSEAGIQPVPVQGSPALPGGCVPRNHFKGMSED 170 T 240 GP63 pdbhh F Eukaryota T 7st9 7 G G DDC1_YEAST DNA damage checkpoint protein 1 MDYKDDDDKDYKDDDDKDYKDDDDKLEVLFQGPGMSFKATITESGKQNIWFRAIYVLSTIQDDIKITVTTNELIAWSMNETDTTLCQVRFQKSFFEEYEFKPHEIVFGENGVQVIEDTYGNSHKLYSFRVNGRHLTTISRKPDGDGIKSFTIAVNNTSTCPESLANRLIVVIEMDSLIVKEYCPQFQPIKYDPIIINLKYKRRFLDVFGTAASDRNPQEPLDPKLLDVFTNTERELTSALFNEEVESDIRKRNQLTAADEINYICCNSTLLKNFLDNCNVNVTDEVKLEINVHRLSITAFTKAVYGKNNDLLRNALSMSNTISTLDLEHYCLFTTIEDEKQDKRSHSKRREHMKSIIFKLKDFKNFITIGPSWKTTQDGNDNISLWFCHPGDPILMQMQKPGVKLELVEVTDSNINDDILEGKFIKTAISGSKEEAGLKDNKESCESPLKSKTALKRENLPHSVAGTRNSPLKVSYLTPDNGSTVAKTYRNNTARKLFVEEQSQSTNYEQDKRFRQASSVHMNMNREQSFDIGTTHEVACPRNESNSLKRSIADICNETEDPTQQSTFAKRADTTVTWGKALPAADDEVSCSNIDRKGMLKKEKLKHMQGLLNSQNDTSNHKKQDNKEMEDGLGLTQVEKPRGIFD 646 T 2.8E-09 Rad9 pdbpssm F Eukaryota T 7su9 5 E C KRAS-G12D-9mer with A18L substitution GADGVGKSL 9 T 4.8E-05 Thymidylate_kin pdbhh F T 7sua 1 A A A0A237U7Y1_ACIBA DUF4175 domain-containing protein SNARAIARPTRSEYLARINEENRLKHEIQELTQALALEKQNTVTLVAQAQQQAKAKPIVRSQPEKSLESTDQNTLALNIQFYDPKQLLSSVNQSVSVPYFKLCQLFLNKSIELCTKHYHLKATDIDVVDEFHAEGATLAISTSHPHAVECLLMVGTVFQLLSDVLYKRYREDKRFALQTRSAVCNAVEAMQIDAKEAAQRLAQHLHAKESALYLDNEQLKAIQDSYQLVAMPNPSNVMTRHAFMINGMNAECAELAQNIRTEILMGKKSIPQNDSPSSAAS 281 T 0.0013 DUF4175 unp F Bacteria T 7suk 29 CA LV NOL10_YEAST ESSENTIAL NUCLEAR PROTEIN 2 VLKSTSANDVSVYQVSGTNVSRSLPDWIAKKRKRQLKNDLEYQNRVELIQDFEFSEASNKIKVSRDGQYCMATGTYKPQIHVYDFANLSLKFDRHTDAENVDFTILSDDWTKSVHLQNDRSIQFQNKGGLHYTTRIPKFGRSLVYNKVNCDLYVGASGNELYRLNLEKGRFLNPFKLDTEGVNHVSINEVNGLLAAGTETNVVEFWDPRSRSRVSKLYLENNIDNRPFQVTTTSFRNDGLTFACGTSNGYSYIYDLRTSEPSIIKDQGYGFDIKKIIWLDNVGTENKIVTCDKRIAKIWDRLDGKAYASMEPSVDINDIEHVPGTGMFFTANESIPMHTYYIPSLGPSPRWCSFLDSITEEL 362 T 0.00022 ANAPC4_WD40 unppercent F Eukaryota T 7suk 46 WA SS UTP14_YEAST U3 small nucleolar RNA-associated protein 14 QRIQQRHDRKAAYEISRQEVSKWNDIVQQNRRADHLIFPLNKPTEHNHASAFTRTQDVPQTELQEKVDQVLQESNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNVIINEKVNKKNLKYQSSAVPFPFENREQYERSLRMPIGQEWTSRASHQELIKPRIMTKPGQVIDPLKAP 197 T 2.1E-40 Utp14 pdbpercent F Eukaryota T 7suk 47 XA ST NOP14_YEAST Nucleolar complex protein 14 MAGSQLKNLKAALKARGLTGQTNVKSKNKKNSKRQAKEYDREEKKKAIAEIREEFNPFEIKAARNKRRDGLPSKTADRIAVGKPGISKQIGEEQRKRAFEARKMMKNKRGGVIDKRFGERDKLLTEEEKMLERFTRERQSQSKRNANLFNLEDDEDDGDMFGDGLTHLGQSLSLEDELANDEEDFLASKRFNEDDAELQQPQRKKTKAEVMKEVIAKSKFYKQERQKAQGIMEDQIDNLDDNFEDVMSELMMTQPKKNPMEPKTDLDKEYDIKVKEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKEKLGKFTAVLLRHIIFLSNQNYLKNVQSFKRTQNALISILKSLSEKYNRELSEECRDYINEMQARYKKNHFDALSNGDLVFFSIIGILFSTSDQYHLVITPALILMSQFLEQIKFNSLKRIAFGAVLVRIVSQYQRISKRYIPEVVYFFQKILLTFIVEKENQEKPLDFENIRLDSYELGLPLDVDFTKKRSTIIPLHTLSTMDTEAHPVDQCVSVLLNVMESLDATISTVWKSLPAFNEIILPIQQLLSAYTSKYSDFEKPRNILNKVEKLTKFTEHIPLALQNHKPVSIPTHAPKYEENFNPDKKSYDPDRTRSEINKMKAQLKKERKFTMKEIRKDAKFEARQRIEEKNKESSDYHAKMAHIVNTINTEEGAEKNKYERERKLR 806 T 1.7E-134 Nop14 pdbpssm F Eukaryota T 7suo 2 C,D C,D NCAP_SARS2 Nucleoprotein APRITFGGPSD 11 T 7 Tymo_coat pdbhh T Viruses T 7svu 4 D,F,H,J,L,N,Q,T,V a,b,c,d,e,f,h,j,k A0A979HMQ2_9CYAN TnsB-CTD IEVWDYEQLREEYGF 15 T 1.1 ODC_AZ pdbhh F Bacteria T 7sx3 2 B B NALF1_HUMAN Transmembrane protein FAM155A MTRGAWMCRQYDDGLKIWLAAPRENEKPFIDSERAQKWRLSLASLLFFTVLLSDHLWFCAEAKLTRARDKEHQQQQRQQQQQQQQQRQRQQQQQQRRQQEPSWPALLASMGESSPAAQAHRLLSASSSPTLPPSPGDGGGGGGKGNRGKDDRGKALFLGNSAKPVWRLETCYPQGASSGQCFTVENADAVCARNWSRGAAGGDGQEVRSKHPTPLWNLSDFYLSFCNSYTLWELFSGLSSPNTLNCSLDVVLKEGGEMTTCRQCVEAYQDYDHHAQEKYEEFESVLHKYLQSEEYSVKSCPEDCKIVYKAWLCSQYFEVTQFNCRKTIPCKQYCLEVQTRCPFILPDNDEVIYGGLSSFICTGLYETFLTNDEPECCDVRREEKSNNPSKGTVEKSGSCHRTSLTVSSATRLCNSRLKLCVLVLILLHTVLTASAAQNTAGLSFGGINTLEENSTNEEGGSGGSDYKDDDDKGNSDYKDDDDK 483 F F Eukaryota T 7sxb 1 A A A0A2D1LW19_HELBK Transforming growth factor mimic GSGTGCPPLPDDGIVFYEYYGYAGDRHTVGPVVTKDSSGNYPSPTHARRRCRALSQEADPGEFVAICYKSGTTGESHWEYYKNIGKCPDP 90 T 32 DUF5678 pdbhh F Eukaryota T 7sxf 2 B B Axin peptide LLPQKFAEELIHRLEAV 17 T 4.5 CAP_N pdbhh F T 7sxg 2 B B Axin peptide LLPQKFAEELIHRLEAVQ 18 T 4.4 CAP_N pdbhh F T 7sxh 2 B B axin peptide PQKFAEELIHRLEAVQ 16 T 3 CAP_N pdbhh F T 7sxi 1 A A SDS3_MOUSE SUPPRESSOR OF DEFECTIVE SILENCING 3 PROTEIN HOMOLOG SNAQRFEARIEDGKLYYDKRWYHKSQAIYLESKDNQKLSCVISSVGANEIWVRKTSDSTKMRIYVGQLQRGLFVIRRRS 79 T 0.044 Fascin pdb F Eukaryota T 7sxj 2 B B axin peptide EPQKFAEELIHRLEAVQ 17 T 6.9 DUF1690 pdbhh F T 7sxk 1 A,B,C,D,E,F,G,H,I,J,K,L b,a,l,k,j,i,h,c,d,e,f,g Q8H9R8_9CAUD Portal protein MAKRRKIKPMDDEQVLRHLDQLVNDALDFNSSELSKQRSEALKYYFGEPFGNERPGKSGIVSRDVQETVDWIMPSLMKVFTSGGQVVKYEPDTAEDVEQAEQETEYVNYLFMRKNEGFKVMFDWFQDTLMMKTGVVKVYVEEVLKPTFERFSGLSEDMVADILSDPDTSILAQSVDDDGTYTIKIRKDKKKREIKVLCVKPENFLVDRLATCIDDARFLCHREKYTVSDLRLLGVPEDVIEELPYDEYEFSDSQPERLVRDNFDMTGQLQYNSGDDAEANREVWASECYTLLDVDGDGISELRRILYVGDYIISNEPWDCRPFADLNAYRIAHKFHGMSVYDKIRDIQEIRSVLMRNIMDNIYRTNQGRSVVLDGQVNLEDLLTNEAAGIVRVKSMNSITPLETPQLSGEVYGMLDRLEADRGKRTGITDRTRGLDQNTLHSNQAAMSVNQLMTAAEQQIDLIARMFAETGVKRLFQLLHDHAIKYQNQEEVFQLRGKWVAVNPANWRERSDLTVTVGIGNMNKDQQMLHLMRIWEMAQAVVGGGGLGVLVSEQNLYNILKEVTENAGYKDPDRFWTNPNSPEALQAKAIREQKEAQPKPEDIKAQADAQRAQSDALAKQAEAQMKQVEAQIRLAEIELKKQEAVLQQREMALKEAELQLERDRFTWERARNEAEYHLEATQARAAYIGDGKVPETKKPTKAVRR 705 T 0.044 Sema4F_C pdbpercent T Viruses T 7sxn 1 A AAA Orb2A residues 1-9 MYNKFVNFI MYNKFVNFI 9 T 0.12 DUF5505 pdbhh F T 7szi 2 D D A0A2X1SF68_KLEPN TraN CSGGQNTHC 9 T 1.9 DUF220 pdbhh F Bacteria T 7t0l 3 C,F C,F PHE-ARG-TYR-ASN-GLY-LEU-ILE-HIS-ARG peptide FRYNGLIHR 9 T 0.8 Ribosomal_L28e pdbhh F T 7t0y 2 B,D B,D RRP1B_HUMAN RRP1-LIKE PROTEIN B GKKVTFGLNRNMTAEFKKTDKSILVSPTGPSRVAFDPEQKPLHGVLK 47 T 30 PP1_bind pdbhh F Eukaryota T 7t1n 2 B B HEXIM Arginine Rich Motif GISYGRQLGKKKHRRRAHQ 19 T 0.02 Tat pdb F T 7t26 1 A A ACB1_BPFBB Acb1 SGLYVAAKFSESTLDALEELQRSLKLPNPVPRDKLHTTIVYSRVNVPYKVASGSFEIADKGKLTVFETQSGNRALVLEMDSDYLSARHSYAKALGASYDYPDYRPHITLSYNIGVLNFSGEYKVPVVLDREYSEELDLEWSDKD 144 T 0.00041 2_5_RNA_ligase2 pdbhh T Viruses T 7t2f 1 A,B A,B HEEH mini protein HEEH_TK_rd5_0341 SGLVPRGSHMDLEELEEDLKQALREGRKVNILGIEVTTEEQARRLIEFLRRFI 53 T 0.015 SepF pdb F T 7t2r 4 D,I D,I I4BYB2_ACEMN COENZYME F420-REDUCING HYDROGENASE, ALPHA SUBUNIT MTEVFKLEINPVTRIEGHGKITVMLDESGHVRETRFHVTQYRGFEVFTHGRDFREMPVITPRICGICPVSHHLASAKACDEILGVTITPAAHKLRELMHMGQIVQSHALSFFHLSSPDILWGFDAPVKIRNVAGLVDRYPELAKKGIMLRKFGQEIIKTLGGKKIHPWHSIPGGVNRSLTPQERDAIAAQLPEMKSIAMEAIKLIKDYLQEGGEELKEFATLDTAYMGLVRDGYLELYDGEVRIKAPRGRILDQFDPKDYLDHIGEHVEPWSYLKFPFYKALGFPHGSYRVGPLARLNAADAVSTPEASKEFALYKEMGEDGIVPYTLYYHYARLIEALYGLERIEQLLADPDITSSDLRVTSKEINPEGIGVIEAPRGTLIHHYQVNESGVITKVNLIVATGHNNFAMNKGVEMVAKKYITGTNVPEGVFNRLEHVIRAYDPCLSCSTHAVGKMPLKLELVGPTGEILKEVTRD 475 T 6.4E-19 NiFeSe_Hases pdb F Bacteria T 7t2u 2 C,E E,F NEMO_HUMAN NF-KAPPA-B ESSENTIAL MODULATOR,NEMO,FIP-3,IKB KINASE-ASSOCIATED PROTEIN 1,IKKAP1,INHIBITOR OF NUCLEAR FACTOR KAPPA-B KINASE SUBUNIT GAMMA,I-KAPPA-B KINASE SUBUNIT GAMMA,IKK-GAMMA,IKKG,IKB KINASE SUBUNIT GAMMA,NF-KAPPA-B ESSENTIAL MODIFIER KLAQLQVAYH 10 T 0.00027 Tropomyosin unppercent F Eukaryota T 7t4q 3 C C UL128_HCMVM UL128 MSPKNLTPFLTALWLLLGHSRVPRVRAEECCEFINVNHPPERCYDFKMCNRFTVALRCPDGEVCYSPEKTAEIRGIVTTMTHSLTRQVVHNKLTSCNYNPLYLEADGRIRCGKVNDKAQYLLGAAGSVPYRWINLEYDKITRIVGLDQYLESVKKHKRLDVCRAKMGYMLQ 171 T 11 SH3_19 pdbhh T Viruses T 7t5m 3 E,F E,F I27RA_HUMAN IL-27 RECEPTOR SUBUNIT ALPHA,IL-27R SUBUNIT ALPHA,IL-27R-ALPHA,IL-27RA,CYTOKINE RECEPTOR WSX-1,CYTOKINE RECEPTOR-LIKE 1,TYPE I T-CELL CYTOKINE RECEPTOR,TCCR,ZCYTOR1 FLPTPEELGLLGPPRPQVLA 20 T 1.9 COX5A pdbhh F Eukaryota T 7t5p 1 A A SIMC1_HUMAN PLATFORM ELEMENT FOR INHIBITION OF AUTOLYTIC DEGRADATION AYLQDMPRSPGDVPQSPSDVSPSPDAPQSPGGMPHLPGDVLHSPGDMPHSSGDVTHSPRDIPHLPGDRPDFTQNDVQNRDMPMDISALSSPSCSPRPQSETPLEKVPWLSVMETPARKEISLSEPAKPGSAHVQSRTPQGGLYNRPCLHRLKYFLRPPVHHLFFQTLIPDKDTRENKGQRLEPIPHRRLRMVTNTIEENFPLGTVQFLMDFVSPQHYPPREIVAHIIQKILLSGSETVDVLKEAYMLLMKIQQLHPANAKTVEWDWKLLTYVMEEEGQTLPGRVLFLRYVVQTLEDDFQQTLRRQRQHLQQSIANMVLSCDKQPHNVRDVIKWLVKAVTEDGLTQPPNGNQTSSGTGILKASSSHPSSQPNLTKNTNQLIVCQLQRMLSIAVEVDRTPTCSSNKIAEMMFGFVLDIPERSQREMFFTTMESHLLRCKVLEIIFLHSCETPTRLPLSLAQALYFLNNSTSLLKCQSDKSQWQTWDELVERLQFLLSSYQHVLREHLRSSVIDRKDLIIKRIKPKPQQGDDITVVDVEKQIEAFRSRLIQMLGEPLVPQLQDKVHLLKLLLFYAADLNPDAEPFQKGWSGS 589 T 0.06 Anticodon_2 unppssm F Eukaryota T 7t5v 1 A A A0A1X1LKI5_ECOLX HELIX-TURN-HELIX TRANSCRIPTIONAL REGULATOR,TRANSCRIPTIONAL REGULATOR,XRE FAMILY TRANSCRIPTIONAL REGULATOR SNADDLREPEERHLDDAFFRGYKNLEPEAKAQLRKMLDTFKKDF 44 T 0.012 Metal_resist pdbpssm F Bacteria T 7t5w 1 A,B,C,D A,B,C,D A0A1X1LKI5_ECOLX HELIX-TURN-HELIX TRANSCRIPTIONAL REGULATOR,TRANSCRIPTIONAL REGULATOR,XRE FAMILY TRANSCRIPTIONAL REGULATOR SNADDLREPEERHLDDAFFRGYKNLEPEAKAQLRKILDTFKKDF 44 T 0.013 Metal_resist pdbpssm F Bacteria T 7t69 1 A A Q709D8_FUSOX SECRETED IN XYLEM 1 PROTEIN GPMQEAAVREPQIFFNLTYTEYLDKVAASHGSPPDKSDLPWNDTMGSFPGNETDDGVQTETGSSLSRRGHIVNLRKREPFGEESRNDRVTQDMLQALHDLCVERFGTGYRAVSGLCYTDRRATRKIECNKPSVRERDRSVTRACPKGQECTTFNAYNFRNRHHQVTFPVCGPRIEVKDRHDIGIHTEWQGTWYPESPKSPGTYDYFAQMAGTLNGYFGYDGVYSDGYKTSSHGYGHSWSCINCPRGKVTITNTYRATWAFGYTSPHS 267 T 0.28 DAP_epimerase pdb F Eukaryota T 7t6a 1 A,B A,B Q2A0P0_FUSOX SECRETED IN XYLEM 4 PROTEIN SAHTESVCVHAGTATGADLHWLNAICTGKSTYTVNCAPAGNKNAGSTHTGTCPAGQDCFQLEQVGNFWGDREPDATCSPSNTVFDAVDDKEATHVNGKVVTRAGKPGIGRKLIRLKAQVYRRDGHYGQTSRMGFFRNGKEVYHIDNVASMEPTWNFDPSSDQSFSFFFTPGPNAFRIQGTLNLAS 185 T 0.32 Phage_CI_repr unppercent F Eukaryota T 7t6a 2 C,D C,D Q2A0P0_FUSOX SECRETED IN XYLEM 4 PROTEIN GPMLPKGEEGDIIGTFNFSSSDSQPLKIHWVDTPDSSGSNLVPR 44 T 0.32 Phage_CI_repr unppercent F Eukaryota T 7t6g 1 A A B1Q143_ANCCA Truncated Ac-AIP-2 TPEEHDLLMDLMGDPKKAEE 20 T 1.1 B3GALT2_N pdbhh F Eukaryota T 7t6u 5 E L Synthetic peptide QKFTSWFX 8 T 1.7 DUF4518 pdbhh F T 7t70 2 C,D C,D R1AB_SARS2 Nonstructural protein 4/5 TSAVLQSGFRKM 12 T 8.5 IQ pdbhh T Viruses T 7t73 3 C,E,F A,C,E HIV Envelope ApexGT2.2MUT gp120 MGILPSPGMPALLSLVSLLSVLLMGCVAETGAENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHEDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNATTELRNKRQKVYSLFYRLDIVPMGENSANYRLIDCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNTPVQINCTRPNNNTVKSIRIGPGQAFYYTGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFAQSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 504 T 3.9999999999999995E-54 GP120 pdbpercent F T 7t74 1 A,G,K A,C,E HIV Envelope ApexGT2 gp120 MGILPSPGMPALLSLVSLLSVLLMGCVAETGAENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHEDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNATTELRNKRQKVYSLFYRLDIVPMGENSTNYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNTPVQINCTRPNNNTVKSIRIGPGQAFYYTGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFAQSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 504 T 3.9E-54 GP120 pdbpercent F T 7t76 1 A,B,D A,C,E HIV Envelope ApexGT3 gp120 MGILPSPGMPALLSLVSLLSVLLMGCVAETGAENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHEDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNATTELRNKRVKRYSLFYRLDIVQIDSNRAKSHYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNTPVQINCTRPNNNTVKSIRIGPGQAFYYTGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFAQSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 506 T 3.9E-54 GP120 pdbpercent F T 7t77 1 A,C,E A,C,E HIV Envelope ApexGT3.N130 gp120 MGILPSPGMPALLSLVSLLSVLLMGCVAETGAENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHEDIISLWDQSLKPCVKLTPLCVTLNCTNVTNNITDDMRGELKNCSFNATTELRNKRVKRYSLFYRLDIVQIDSNRTKSHYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNTPVQINCTRPNNNTVKSIRIGPGQAFYYTGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFAQSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 506 T 3.9E-54 GP120 pdbpercent F T 7t8m 2 C,D C,D R1AB_SARS2 Nonstructural protein 5/6 GVTFQSAVKR 10 T 6.9 PhnI pdbhh T Viruses T 7t8n 1 A,B AAA,BBB PGAA_ECOLI PGA EXPORT PROTEIN,POLY-BETA-1,6-GLCNAC EXPORT PROTEIN DANLTPDIRADIHAELVRLSFMPTRSESERYAIADRALAQYAALEILWHDNPDRTAQYQRIQVDHLGALLTRDRYKDVISHYQRLKKTGQIIPPWGQYWVASAYLKDHQPKKAQSIMTELFYHKETIAPDLSDEELADLFYSHLESEN 148 T 0.029 TPR_19 unppercent F Bacteria T 7t8r 2 B B R1AB_SARS2 Nonstructural protein 7/8 NRATLQAI 8 T 23 CDC4_D pdbhh T Viruses T 7t9a 1 A,E,I A,C,E HIV Envelope ApexGT2 gp120 AENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHEDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNATTELRNKRQKVYSLFYRLDIVPMGENSTNYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNTPVQINCTRPNNNTVKSIRIGPGQAFYYTGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFAQSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 473 T 3.5E-54 GP120 pdbpercent F T 7t9b 1 A,E,I A,C,E HIV-1 Envelope ApexGT5 gp120 AENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHEDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNATTELRNKRQKVYSLFYRLDIVPMVDLWTNYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNTPVQINCTRPNNNTVKSIRIGPGQAFYYTGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFAQSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 473 T 3.5E-54 GP120 pdbpercent F T 7t9i 5 E X GNAS2_HUMAN ADENYLATE CYCLASE-STIMULATING G ALPHA PROTEIN GGSLEVLFQGPSGNSKTEDQRNEEKAQREANKKIEKQLQKDKQVYRATHRLLLLGADNSGKSTIVKQMRILHGGSGGSGGTSGIFETKFQVDKVNFHMFDVGGQRDERRKWIQCFNDVTAIIFVVDSSDYNRLQEALNLFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKLEDYFPEFARYTTPEDATPEPGEDPRVTRAKYFIRDEFLRISTASGDGRHYCYPHFTCAVDTENARRIFNDCRDIIQRMHLRQYELL 261 T 5E-10 G-alpha pdb F Eukaryota T 7t9w 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P R1AB_SARS2 NON-STRUCTURAL PROTEIN 3,NSP3,PL2-PRO,PAPAIN-LIKE PROTEINASE,PL-PRO SEEVVENPTIQKDVLECNVKTTEVVGDIILKPANNSLKITEEVGHTDLMAAYVDNSSLTIKKPNELSRVLGLKTLATHGLAAVNSVPWDTIANYAKPFLNKVVST 105 T 33 PHB_acc_N pdbhh T Viruses T 7t9y 2 C,D C,D R1AB_SARS2 Nonstructural protein 8/9 SAVKLQNNEL 10 T 7.9 Phospho_p8 pdbhh T Viruses T 7ta3 1 A,C A,C Alpha-peptide-3 ECGWRIGEAGTDPNLNHQQFRAKILSIWEECX 32 T 3 CTK3 pdbhh F T 7ta4 2 C,D C,D R1AB_SARS2 Nonstructural protein 9/10 ATVRLQAGNA 10 T 1.4 CoV_NSP9 pdbhh T Viruses T 7ta6 2 I,J,K,L,M,N,O,P I,J,K,L,M,N,O,P Alpha/Beta-peptide-1 XLGWCIGEXGTDPNLNHXQFRXKILXCWX 29 T 1.5 Poty_coat pdbhh F T 7ta7 2 C,D C,D R1AB_SARS2 Nonstructural protein 10/11 REPMLQSADAQ 11 T 60 LD_cluster3 pdbhh T Viruses T 7tb1 2 C,D C,D ALA-CYS-SER-SER-ILE-TRP-CYS-PRO-ASP-GLY ACSSIWCPDG 10 T 0.52 CENP-U pdbhh F T 7tb2 2 B B R1AB_SARS2 Nonstructural protein 12/13 PHTVLQAV 8 T 11 ATXN-1_C pdbhh T Viruses T 7tb9 1 A A CEMP1_HUMAN CEMP1-p1 MGTSSTDSQQAGHRRCSTSN 20 T 12 DUF983 pdbhh F Eukaryota T 7tbi 2 D,E,F,G B1,B2,B3,B4 Nup53/Nup59 R3 RKAKLLPMEEALLP 14 T 9.2 POX pdbhh F T 7tbi 3 H,I,TB,UB C2,C3,C1,C4 Nup145N/Nup100/Nup116 R3 KLVINKDMRTDLFSPPN 17 T 4.6 DUF4616 pdbhh F T 7tbi 5 N,O,P,Q E1,E2,E3,E4 Nup53/Nup59 R2 DPTIAAADKIFSNWLASQ 18 T 1.5 DUF3986 pdbhh F T 7tbi 7 T,U G1,G2 Nic96 R2 DVDTYLSNLQTKTTLSMIADGLERSARDFDAFLEENVTLEWEAQRKRIYQHFG 53 T 0.16 X pdbpssm F T 7tbi 8 V,W H1,H2 Nup145N/Nup100/Nup116 R2 EDSILQPGAFSAN 13 T 1 Hydrolase_2 pdbhh F T 7tbi 11 BA,CA K1,K2 Nup145N/Nup100/Nup116 R1 ILPMYKLSP 9 T 2.6 AIMP2_LysRS_bd pdbhh F T 7tbi 14 JA,KA,LA,MA N1,N2,N3,N4 Nup57 SEALQQEIAKIDEEIQKCIRDKEAVDAFLPAHGEQLAAIPTDVNFVTRKSEGAHNALSSDILAIDQLRELVKQDADNARLSFKAIDNLKLPMQYHQAGLWSKQMGGAGTAGASGASADADGQSNADLISYFSKTADEMEEMMKKFEKTITEIEAHLTGVEAHAMAMQNVAAQSRNAAQGGVDERVYELAAVLREFEESILKVAGVVGGVKEGVTELQLRDFM 222 T 0.0012 AAA_13 pdbpssm F T 7tbi 16 RA,SA,TA,UA P1,P2,P3,P4 Nic96 R1 ALFDSLLARNKKQAEGETALGELPSLQLGLADLRQRLRKL 40 T 7.9 TACI-CRD2 pdbhh F T 7tbi 26 LB,MB Y1,Y2 Nup159 YAESGIQTDLSESSKENEVQTDAIPVKHNSTQTVKKEAVDNGLQTEPVETCNFSVQTFEGDENYLAEQCKPKQLKEYYTSAKVSNIPFVSQNSTLRLIESTFQTVEAEFTVLMENIRNMDTFFTDQSSIPLVKRTVRSINNLYTWRIPEAEILLNIQNNIKCEQMQITNANIQDLKEKVTDYVRKDIAQITEDVANAKEEYLFLMHFDDASSGYVKDLSTHQFRMQKTLRQKLFDVSAKINHTEELLNILKLFTVKNKRLDDNPLVAKLAKESLARDGLLKEIKLLREQVSRLQLEEKGKKASSFDASSSITKDMKGFKVVEVGLAMNTKKQIGDFFKNL 340 T 0.025 Lzipper-MIP1 pdbpercent F T 7tbj 5 M,N,O,P,Q,R C1,C2,C3,C4,C5,C6 NUP98 R3 HKKLVINKDMRTDLFSPPN 19 T 5.9 MethyTransf_Reg pdbhh F T 7tbj 7 AA,BA,CA,DA,EA,FA,Z E2,E3,E4,E5,E6,E7,E1 NUP53 R2 APPVRSIY 8 T 5.2 DUF502 pdbhh F T 7tbj 18 SB,TB,UB,VB P1,P2,P3,P4 NUP54 Ferrodoxin-like domain DEDGLISLIFNKKESDIRGQQQQLVESLHKVLGGHQTLTVNVEGVKTKADNQTEVIIYVVERSPNGTSRRVGASALFSYFEQAHIKANMQSLGVTGAMAQTELSPVQIKQLIQNPL 116 T 0.018 Glutaminase pdb F T 7tbj 30 OD,PD,QD,RD b1,b2,b3,b4 NUP37 SNQYQLPLNVRPYTTTWCSQSPSCSNLLAIGHDTGITIYCASEEQTPGSTGLTLQELFTIQTGLPTLHLSFSSSCSYSENLHDGDGNVNSSPVYSLFLACVCQDNTVRLIITKNETIITQHVLGGKSGHHNFVNDIDIADVYSADNRLAEQVIASVGDDCTLIIWRLTDEGPILAGYPLSSPGISVQFRPSNPNQLIVGERNGNIRIFDWTLNLSAEENSQTELVKNPWLLTLNTLPLVNTCHSSGIASSLANVRWIGSDGSGILAMCKSGAWLRWNLFANNDYNEISDSTMKLGPKNLLPNVQGISLFPSLLGACPHPRYMDYFATAHSQHGLIQLINTYEKDSNSIPIQLGMPIVDFCWHQDGSHLAIATEGSVLLTRLMGFT 385 T 0.00016 WD40 pdbpssm F T 7tbl 34 AE f NUP42 IIATDNVLFTPRDKLTVEELEQFQSKKFTLGKIPLKPPPLELLNV 45 T 1.8 Arcadin_1 pdbhh F T 7tbl 40 GE k2 NUP62 CCS2 QHADEEREKTYKLAENIDAQLKRMAQDLKEVIEHLNT 37 T 0.006 Tektin pdbpercent F T 7tbt 2 B B R1AB_SARS2 Nonstructural protein 13/14 NVATLQAENV 10 T 34 Shugoshin_N pdbhh T Viruses T 7tc4 2 C,D C,D R1AB_SARS2 Nonstructural protein 15/16 FYPKLQSSQ 9 T 9.2 HSR pdbhh T Viruses T 7tcr 2 C,D C,D A0A2D2CY73_METTR Methanobactin biosynthesis cassette protein MbnC MSLLPTAPVRIDADLYDDLANPARQSLYPRDSRGFIRIDISLRAYWHTLFDTCPRLLELSGPSGGAIFLPFMAWARENNLAFDWSFFLWVYVWLQQSEFRERLDEDQLLPVMTASATRWLMIDRDIDACQIVLGSRSLAGAAVVGAKIDSIHCRLEQVQQVAFAAPLPLPDGEFGYFLTPGFEIDHFPGWRPLPR 195 T 34 IgG_binding_B pdbhh F Bacteria T 7tdr 1 A A Q5MIU2_AEDAL 34k2 salivary protein GNPTPKSCTVSEEDLTTIRNAIQKASRASLDDVNLDEDLIAKCPLLKTITASLKSVASEIATLKDTGISEEQVDELKQSYEQQVNEIVKSRDIFEKQSGGDVMKEQGAMINRMTELQVQVAQLQQQIGEQTSRMYDDMAELIFQRLAMNSTDSIRNYTAHMMEQKLHTLMTKLETNYRIFLGALRYLDHLGDQPLIDKVFDGILKRLDEMSLETNKERENGKYVLVNLLCWTVNNRFLTEKYRKKQLELFRIALKFYPKTGNKEANEADIRGRQFCDANFPVNVITWFAVSRAAEGWGLRGTLAAA 306 T 0.00044 TINF2_N pdbpssm F Eukaryota T 7tdz 3 D,DA s,S Q9PVZ2_XENLA NUCLEOPORIN CAN MEDDTDLPPERETKDFQFRQLKKVRLFDYPADLPKQRSNLLVISNKYGLLFVGGFMGLKVFHTKDILVTVKPKENANKTVVGPQGIHVPMNSPIHHLALSSDNLTLSVCMTSAEQGSSVSFYDVRTLLNESKQNKMPFASCKLLRDPSSSVTDLQWNPTLPSMVAVCLSDGSISVLQVTDTVSVFANLPATLGVTSVCWSPKGKQLAVGKQNGTVVQYLPSLQEKKVIPCPSFYDSDNPVKVLDVLWLSTYVFTVVYAAADGSLEASPQLVIVTLPKKEDKRAERFLNFTETCYSICSERQHHFFLNYIEDWEILLAASAASVDVGVIARPPDQVGWEQWLLEDSSRAEMPMTENNDDTLPMGVALDYTCQLEVFISESQILPPVPVLLLLSTDGVLCPFHVVNLNQGVKPLTTSPEQLSLDGEREMKVVGGTAVSTPPAPLTSVSAPAPPASAAPRSAAPPPYPFGLSTASSGAPTPVLNPPASLAPAATPTKTTSQPAAAATSIFQPAGPAAGSLQPPSLPAFSFSSANNAANASAPSSFPFGAAMVSSNTAKVSAPPAMSFQPAMGTRPFSLATPVTVQAATAPGFTPTPSTVKVNLKDKFNASDTPPPATISSAAALSFTPTSKPNATVPVKSQPTVIPSQASVQPNRPFAVEAPQAPSSVSIASVQKTVRVNPPATKITPQPQRSVALENQAKVTKESDSILNGIREEIAHFQKELDDLKARTSRACFQVGSEEEKRQLRTESDGLHSFFLEIKETTESLRGEFSAMKIKNLEGFASIEDVQQRNKLKQDPKYLQLLYKKPLDPKSETQMQEIRRLNQYVKNAVQDVNDVLDLEWDQYLEEKQKKKGIIIPERETLFNSLANHQEIINQQRPKLEQLVENLQKLRLYNQISQWNVPDSSTKSFDVELENMQKTLSQTAIDTQTKPQAKLPAKISPVKQSQLRNFLSKRKTPPVRSLAPANLSRSAFLAPSFFEDLDDVSSTSSLSDMADNDNRNPPPKEIERQETPPPESTPVRVPKHAPVARTTSVQPGLGTASLPFQSGLHPATSTPVAPSQSIRVIPQGADSTMLATKTVKHGAPNITAAQKAAVAAMRRQTASQIPAASLTESTLQTVPQVVNVKELKNNGPGPTIPTVIGPTVPQSAAQVIHQVLATVGSVSARQAAPAAPLKNPPASASSIAPQTWQGSAPNKPAAQAIPKSDPSASQAPAPSVSQVNKPVSFSPAAGGFSFSNVTSAPVTSALGSSSAGCAATARDSNQASSYMFGGTGKSLGSEGSFSFASLKPASSSSSSSVVEPTMSKPSVVTAASTTATVTSTTAASSKPGEGLFQGFSGGETLGSFSGLRVGQADEASKVEVAKTPTAAQPVKLPSNPVLFSFAGAPQPAKVGEAPSTTSSTSASLFGNVQLASAGSTASAFTQSGSKPAFTFGIPQSTSTTAGASSAIPASFQSLLVSAAPATTTPSAPINSGLDVKQPIKPLSEPADSSSSQQQTLTTQSAAEQVPTVTPAATTATALPPPVPTIPSTAEAKIEGAAAPAIPASVISSQTVPFTSTVLASQTPLASTPAGGPTSQVPVLVTTAPPVTTESAQTVSLTGQPVAGSSAFAQSTVTAASTPVFGQALASGAAPSPFAQPTSSSVSTSANSSTGFGTSAFGATGGNGGFGQPSFGQAPLWKGPATSQSTLPFSQPTFGTQPAFGQPAASTATSSAGSLFGCTSSASSFSFGQASNTSGTSTSGVLFGQSSAPVFGQSAAFPQAAPAFGSASVSTTTTASFGFGQPAGFASGTSGSLFNPSQSGSTSVFGQQPASSSGGLFGAGSGGASTVGLFSGLGAKPSQEAANKNPFGSPGSSGFGSAGASNSSNLFGNSGAKAFGFGGTSFGDKPSATFSAGGSVASQGFSFNSPTKTGGFGAAPVFGSPPTFGGSPGFGGSPAFGTAAAFSNTLGSTGGKVFGEGTSAATTGGFGFGSNSSTAAFGSLATQNTPTFGSISQQSPGFGGQSSGFSGFGAGPGAAAGNTGGFGFGVSNPTSPGFGCWRS 2037 T 0.00071 NUP214 pdbpercent F Eukaryota T 7tem 1 A,B A,B A0A5P8YGV9_YERPE Putative exported protein YPO2471 SNAMKLLNTLVCIIGLTSFSSSAKLVNAEHLDALYQKVTVANKTELGLIHIYSEFPDYRWVKDPIEGVSAIDDVARAAIFYQRQYQATGSAADLEKVKSLVEFILYQRADNGYFYNFIYPDHSINKEYKTSVAEPNWWTWRALWALTQVYPTLVKTDNALAQRTRETIFATIDVIYKDFNFKQTRGEKEGVAVPEWLPHTAGDQASVLLMALSDAQALEAKPEIEKMMRSLAAGIMLMQVKDTSSPVNGAFLSWQNLWHGYGNSQAYALLVAGNRLGDRDMIKAAFNELDHFHPWLISNGLLNEFTVRQQGEKVTLIEQKKFSQIAYIIRPMVFANIKAWEISRDAVYLERAVDLSLWFFKNNPAQAQMYYPVTGIAFDGIDSATTVNKNSGAESTIEALLTLQLIESIPDAKRMLESALEKRNIKQ 427 T 0.15 Glyco_hydro_127 unppercent F Bacteria T 7tf6 2 D,E,I,J,K,M,P,Q,U,V,W,X D,E,I,J,K,M,P,Q,U,V,W,X Q53687_STAAU GLUTAMINE SYNTHETASE,GLUTAMINE SYNTHETASE REPRESSOR,HTH-TYPE TRANSCRIPTIONAL REGULATOR GLNR,MERR FAMILY TRANSCRIPTIONAL REGULATOR,GLUTAMINE SYNTHETASE REPRESSOR,TRANSCRIPTIONAL REGULATOR (NITROGEN METABOLISM),TRANSCRIPTIONAL REGULATOR,MERR FAMILY,MERR FAMILY PROTEIN, FEMC, FACTOR INVOLVED IN METHICILLIN RESISTANCE PINRGDLSRFI 11 T 0.75 EST1 pdbhh F Bacteria T 7tfa 1 A,C,D,G,H,I,M,N,O,P,U,V A,C,D,G,H,I,M,N,O,P,U,V GlnR C-tail peptide LIQGELSRFF 10 T 3.5 DUF3146 pdbhh F T 7tfc 2 AA,BA,D,E,J,K,L,M,U,V,W,X,Y,Z a,b,D,E,J,K,L,M,U,V,W,X,Y,Z GLNR_BACSU GlnR C-tail peptide TFRQGDMSRF 10 T 0.086 Dfp1_Him1_M unp F Bacteria T 7tfn 1 A,B,C A,B,C Q2N0S6_9HIV1 Envelope glycoprotein BG505 SOSIP.664 - gp120 AENLWVTVYYGVPVWKDAETTLFCASDAKAYETKKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQAFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 481 T 3.4E-54 GP120 pdbpercent T Viruses T 7tgg 2 B a Q74D22_GEOSL Geopilin domain 2 protein MKKIITIVAMLLAMQGIAIAAGKIPTTTMGGKDFTFKPSTNVSVSYFTTNGATSTAGTVNTDYAVNTKNSSGNRVFTSTNNTSNIWYIENDAWKGKAVSDSDVTALGTGDVGKSDFSGTEWKSQ 124 T 0.02 Mfp-3 pdbhh F Bacteria T 7tgh 12 L,W 3G,3g Q23F81_TETTS UQCRB MVRLEKILWEQLVNVKAFSRQRVIGAPSKWYNENRTEWFKVAQHNAFNTGFSGVILRALEPLLAKFIYRWRLDIAHQRGLTLEDSLLFMDRELRRCYFFETVARQNLHPYTVLFMKKRRARYYKVERGLRGFYVPDWVRKEAEERQLSETVDNIFNWENFVYREYMSDMTPIGRWTSLSKITPLDMFQYYGLFRNEAWDRFFYNEAFYESYSEKEKQEANGNPFGKFNLQTADGRAQFEKEVNTFIERYPFAVTKPGQKFDFTRFYALEDLANKRDTSKYDPALLESVKNELKQSAALPADNGANKTKKSKPILPDWLQPKFGKAFQA 328 T 0.69 DUF6322 pdb F Eukaryota T 7tgh 13 M,X 3H,3h I7M484_TETTS Transmembrane protein, putative MNVTGAGLTHVKDFHSDEMRVFRGGLRHIADKQGNLIYGSVNSSVRYYHDKMSYERGFIQHSRSPSNQFINFHFMLGGFRTYVLERFFKQVWYRRNIRTFWFPVLISYTSGCITMRMYDNNCYDYFYFSD 130 T 1.6 DUF5320 pdbpssm F Eukaryota T 7tgh 14 N,Y 3I,3i I7MM45_TETTS Transmembrane protein, putative MVYGKLIFNNIKEYTPSWIKTIPYSQVTKPILRKQPQIVGKINADPKVKKFWVFLRENVQYYPFLWQFFILGTSFVWFHVCYDPWLAIYQANNAHRSLETALTKEKAHKKKLAEQEESE 119 T 2 Selenoprotein_S pdbhh F Eukaryota T 7tgh 15 O,Z 3J,3j I7MFL6_TETTS Transmembrane protein, putative MYLPTFYKLFHETNAFRLKRYVGYGPLLLTWSIWTLYPALYNMIYSDFIPPERGVPKRIVDA 62 T 1.5 DUF5621 pdbhh F Eukaryota T 7tgh 16 P 3M UNK1 MESRSYMFSLAKKRSTLAA 19 T 5.9 Gryzun-like pdbhh F T 7tgh 18 BA 3m UNK3 AAARAYKFALAKARAAA 17 T 25 GspH pdbhh F T 7tgh 20 DA 4L Q950Z5_TETTH Ymf58 MLTWISFWSLIFWLILIILVLKPKNFISILFMSELTWLALYCLSLLFGAIYCDITLLSISFFILGVAGLEFSFGILIAILYKNLNESLNTDLNNNNNNQNIFDKNFKTPLEKINWQ 116 T 0.0017 Oxidored_q2 pdbhh F Eukaryota T 7tgh 22 FA 5B Q951C2_TETTH Ymf57 MLKNKLIKFKFFRFVQSGFYVDFIFKKFSEMFIRNIFIYSSIFFGEKFMIEYLTKKTIDSFIFNNNRFNFINLVESKYFLQILTLILYLFFITIFILFYI 100 T 29 HEPN_AbiV pdbhh F Eukaryota T 7tgh 23 GA 6 Q950Y2_TETTH Ymf62 MFLITITSYFSNIIEFNSYIINLIDFITPLFFIENFVIQFFILYLFYLLIVNNNLYYILLYIFLEIVFFGLFLCLYQLELFTGFLWVAEFAIVFIAVVLLFYLNIDGLHLKYNHNINNVLYYTPSLVLFLIFFNIDYFSELELFLPLELSFIDIYDDYYEGFNNSIMNDFTPLTLSYYSINSAEFIIIGLLLLLGSVACVNLYKSNKNYTIVKQSNLLTMFDFFKDFINFSFIRKQDLNNQTNFNPSLRSIKKKY 255 T 8.3 DUF2070 pdbpssm F Eukaryota T 7tgh 26 JA A6 I7M2Y3_TETTS NADH dehydrogenase, putative MNHYWGSSNTIPASSTQNNNYFSGGGNNVTIRGNEIMERLPSQTPSQNMVQASMKTLRFYRKFCRLIPFILRIHNIGTKFTAQQAMINFGNYIRERNHYRDPGLIDHRIQLGYELLYEAEMHFSQHTILMQYLSPYNTPLSDRGYSYLEKVKYGNKSKFLQGFYKGNKPTEF 172 T 0.00098 Complex1_LYR_1 pdbhh F Eukaryota T 7tgh 27 KA A7 I7MIJ7_TETTS NDUA7 MRKALERFNEIIFNPAIRWYQLPKPTVRRTRYPAPGSEPINREVHQIDYKTAFRDSPHNIRYHHEIHTSDQTYHSSYDPVGETTTERLVRYGYLNKDQVNNAEAVAAAAKEFQEKEKRSPSNNIIIDEISNSDKPITKENRESVAHHVRQQFEFFREVNAEEVWSVSIEEKYNPELYIYKTYDMAADDPVWRQVKLDLEWTFENIAERRESLGYMPTFKGDPNFWQALDNSFSPENIAQVQSSIGDKVTNIDTKALALNHQTEEYHKTSKLVYPIRTNLVVE 282 T 1.1 Synaptobrevin pdb F Eukaryota T 7tgh 32 PA AM I7M2U4_TETTS NDUA13 MQFFRPDFIATQVLRRADMAHSPFHKAIHDLEDKRSKLFPDRRRIPGRKAKLLLAASLLLQMWGVGKIIEIKKFMKRRDIELKGLQRKAAPFMQSMNDVRHLALRERNDMLYNELLSVHGEEYAQKMQKRFHQTDIWAPFRHRYAYMYNSSNKNVKDYKQVTLSRYINGFDKFNV 175 T 0.00014 GRIM-19 pdbpercent F Eukaryota T 7tgh 34 RA B8 I7M855_TETTS NDUB8 MALRRVLKNQFNLIHKGQAQAVRGGHGWDRPDVPLSFNPLYVHKRELSIFDTNMWMYDQVYPEYVISYNEIHLVDQWKGLKESFSQSAYWWAMMAMVFGFYFINTTPRQLGIDTNDLKGFLGEYYGQYKKRSGIRSNFLGLDVTGENSIIQPNYDRKNGIRDVIDSLNADAGKRKLINLEAKNFIERVEKECEQRILKKGGATQSHH 207 T 0.23 Spore_YhaL pdbpercent F Eukaryota T 7tgh 35 SA BL Q23KG0_TETTS NDUB10 MAFGGFRQTDNSLIIDDRRKIILNTRSLNDFQQKIYLRNFFTNYRPDLSSYDYFAFKEKLRIGELFLNEYRKRINNEVRRAAILTPTSSLREKMNHKIADQILDLSSPHVRGAHFQAVRSWTDASKIVNYVEEKQTKINKYGLQFPLLGNMTEEQCASKEDEVYQRLLKEMQKPPKKASEPVEESSDE 188 T 0.12 CMV_1a pdb F Eukaryota T 7tgh 40 XA TD Q22DC2_TETTS Transmembrane protein, putative MNLPWFVRWGTDVALFFIPAYTFANYPTTFFVFAAEKRRQRRRKDFSDVKLRDDAAFSVDQVKQLQTKLHLKQ 73 T 1.6 HIND pdbhh F Eukaryota T 7tgh 45 CB A1 I7MI60_TETTS NDUA1 MVNTAYPTPLKTILKTTPAFVVYFVFGLGFSTVIYDVVYHPKDRIERFYFRSSKFERLSRKRDEKLRHYFKPAIEWQPWYNTSTNNNTRPLLRY 94 T 0.15 KCT2 pdbhh F Eukaryota T 7tgh 48 FB T5 I7LT77_TETTS Transmembrane protein, putative MFLYKKILSIYKQSFSFFLSFNFSFFLYALLAIFLLINFCQHIHKFLYYCKEKIQKEMQNAYPEITDQHREFLKKQGLKVYEPKPLPDQINPFSKTYWITNAFIIGVSFLARRHALKVGAPRIFWSGCIVGVPLAAIISRGKSDQLDELVGARKTLEQKLEYAPITRRAWERALATNQEYQNEIKTQIQDLQAEIAAKKVAAKLE 205 T 0.02 FlxA pdb F Eukaryota T 7tgh 49 GB R UNK5 MNIAWKELENDAFKAKDIAKFSFSNASNLANFVAESQALATNHFNTALNNGFNVFFAVAGLLVVGVLVYIFFNSVGGMIIRSRIKAAQPNPNQVKVLVMPFVALGVSLVISRAGINGDDFGYKG 124 T 0.012 DUF3611 pdb F T 7tgh 57 OB B9 Q233X7_TETTS NDUB9 MSKAYYFVKNFSWAEVSNLLCYGTKYPTVLNHQQKVTRLYRATLRRVYAHQVEGYKTDFKQYNENITDIGKDFNKMLALKPESLELQAYFKKYEDLQEELFDPAMIIDESRPYAASSGRYYIFDDYLLKFDPFGFYSPKLLSENRPEEAMPFYEDYPQNDSHWNLWEQFPEDFEDSNAEREAILKSNKH 189 T 0.005 Complex1_LYR pdbpssm F Eukaryota T 7tgh 59 QB A8 I7MMF4_TETTS NDUA8 MSILNLIKNVLNMLINIYIFVQYFKQLKYNNKVGLIYQNDIYECLYVYMNESNIFIQCQEYVSVFFNAIRKEKEKEKDQFDRQIDKQRKKQRLLKKANQILKRISKMNTKSFEVLIHSQYAFDVCREQVYNFEDCRQTDTPLPKDPIHCKAQAKEVLSCYKEAEKMDPICLSSFNDSRECMFKSDGNLYNCKTWINQYVTCQKNPAAFAEFLEASTAEQLKSKKFDFVKNRGHSDKYL 238 T 0.002 Cmc1 pdbpercent F Eukaryota T 7tgh 60 RB TB Q22T55_TETTS Transmembrane protein, putative MFWRNVVRGLNCQQALRRQNFAKNITTTDIPKDSHHFAAKRSGFTQTEQAPFAYNDVYQYPKDYKPWNYNYKGNGVLLALFLGSAFSLVAYERSYASKTGRYQRKVQQNYYQI 113 T 0.055 Ncstrn_small pdbpercent F Eukaryota T 7tgh 63 UB B6 Q231G0_TETTS NDUB6 MLLIEMAFNAMKMKIFSLRKIKVKSKEQYLYNYQQKLLILGQGKEKNNKQYKKDIEMGGFQKYPIPRYLHVGQWIVNKNWKWNTFHMFFPTAILCFMVWRNSMISTAKPPNYGEYVDPQSPVAPKAIKY 129 T 0.0021 TMEM117 pdb F Eukaryota T 7tgh 64 VB BM Q22Z32_TETTS Transmembrane protein, putative MNPRNIFNLAKKVQNFNSITQKAFKRFGGAAAHHDDHHDDHHGHGGHGYEVHLVKDKNLIGNKSFKDDLVAVYGFTDVNDHHHHDETDPYHHLRGVPTLSFERMYFADAYYHDDTHEGLMNEPHGYLTMDDPMDLRPNYEKSALELLFLVSGGAILALMLGYQGLNLANPAESLFSLNTAAEEIEDKIRQIRIDNDKLLQRKAQLEEELASLNN 214 T 0.022 MctB pdbhh F Eukaryota T 7tgh 65 WB C4 Q22W63_TETTS NDUC2 MSSMLIWGACFGLFTRAAACKASMIPLTTSPWKYPKYMIVSAVTFYYFDWYRRMALEQLCYNEEKLERYQIRAKLQSLKIGEELSDAYRESFFEHAVQKNNI 102 T 0.0018 NDUF_C2 pdbhh F Eukaryota T 7tgh 66 XB AN Q24F24_TETTS Transmembrane protein, putative MELNSSAKEDSHYVGVLGYPSQHDPHTLHPKKHDSTFTKVYACRDMLWDHHWEVRNTLYAGFKGALLGVAYASGFGLISKTVPSIVLKKMFRFVRNNNFGHIRIMQDLLTPYALTGFGLGSVYYLYQHNVWENRSNKWLAEVLSNALFFQVATAVCVNPGFHIYGMVGGILFGTLKYAFYNSSFFQEKESIGSYTTFGDLSEEERKKQEYKDYIQFLGNYHKVRNGQLVDL 231 T 9.3 ENOD93 pdbhh F Eukaryota T 7tgh 67 YB B4 NDUB4 AAPPAAFFLYFFVPDNFPSAQSGFRTASRNPFQVQFVFAYDNWEYKYCGQWWSMGSLAVNVLFFVVPLFLWLILQTQSDQSSNRDDNSLTFYFSNAGFFFFQIYTNTG 108 T 0.021 DUF1579 pdbpercent F T 7tgh 68 ZB T4 I7MIE0_TETTS NDUTT4 MGGDHHHEDSHHKSNVDQHELKAEMIKELSHYYDHHDLSLFGKVQHFVEHLLEEKHHAKINTSNFDQKKLENFSESKQISRTVFALKKIKTFNHDFFTSEEEMILEPLPLGILTYGLKYAFAGVDAALLTYFWRNWNFNVRTIGLLGGLVGIQMATLHIPNLVNEVVIQTPRRRALAKKYISAYGPQFFHDIVNPKYDIEHLRHLQNKLNPY 212 T 0.17 PfUIS3 pdb F Eukaryota T 7tgh 69 AC T8 Q22SC4_TETTS NDUTT8 MTHQFENVLLSNRKNLTPQESVQKVINYALLQDAKQRSRTLRHIKASWVIPALLFTYPAWYLAKGAVNGVWSNIHPTDKVTLSFANIGRPFRLIYRPEIFLRDQQAKFIQLEKEHIEKSKKGEFVETTSPLVLWN 135 T 4 DUF1852 pdbhh F Eukaryota T 7tgh 70 BC B2 I7MG29_TETTS NDUB2 MSLRKGTSIFSRQFKKAFNDAKYQNLTAAQGETYSHLGWISNVDLRLGRAIFTFGVVGIAFCIYLEPSYFHETFGHMSQPPKYDLIDSNINGVEKKLNKQILHREHNEHKLDGFVSMFKGSDVAKN 126 T 6 Biopterin_H pdbhh F Eukaryota T 7tgh 71 CC T3 I7LUQ4_TETTS NDUTT3 MSGLLRNFEKLVCQSQLSKAGHKLLLRSPNSTLHPTAFYYKRNSSQRLANEMDVFQLGLAAAALTRQANNYAQLLDQVDKEAVREEVQERITQNHSDLNVYFGEILSLFKIGKKECPVQTVADISYVLAFGPIQVPNAAAIITENLLPVLKEKLDYASIHNLQDILSAFVKLNYVSDKELLKRLITALSQKDFPNQLQPVTNHAWNIDQYEYSDCNSWNIVSCGDNTFEKYIHEGGCENSLAKAKFAVHELLDHISFNFVNPFLFRENRINHRFAKRNADLDHEVLMQTLSKLQEIVPETSEAIATIKARL 311 T 0.034 Baculo_F pdb F Eukaryota T 7tgh 72 DC P1 Q24C39_TETTS Transmembrane protein, putative MIARRLFKRSLYYIPRAGFGGGDIRHKFSNEITDDDYDYQRAMHVKPPKEESLFQLTNILSSVPVFKTRFFLDFIARNLDTNSAVSTSDFVAPPRVHENSFFVYHSRELGNVIRKYRSLESIVLPGALLTFTYPLFAAFVAIPSYYFMFNAKIYEMSRRFVVRMDVLPHLEMISVQRIGAFGILYTKLHRIQDLEYVPFDQVKEQENYLWAIGGHGVDNQLIFKDRSTGEFFYFERQGVWDAKGLNHPLLN 251 T 0.7 TMEM70 pdbhh F Eukaryota T 7tgh 73 EC B3 A4VD20_TETTS Transmembrane protein, putative MNSPQKVAQGAGRKLFKHYINENIKSNNEQKLFFYRVNRWRWNTKDNTTAPKFLRLKYPLLVTGVCLFAYDWTYGFTQVDAHH 83 T 0.086 RGS pdbpssm F Eukaryota T 7tgh 74 FC TA I7MAF0_TETTS NDUTT10 SPAPPTIGQVELEPFKFNHERDQLIYGYTMEELYGKKFGLKHSATVLREIKKDTIMMILFIIGGFTYCYHMRETRFQLDDDFNEYVNTNKQTFRPIPDHVKL 102 T 0.028 Viral_Beta_CD pdbpssm F Eukaryota T 7tgh 76 HC TC Q22E95_TETTS NDUTT12 MVYQGFKVLRRNPTFYNPRSAGMVALSYFAYSYYVNKYYKPQNSNFEEYNSSHPHNHDEKVRQYHEKTNQAIRDAVLEKRAEHDQRLREEAKL 93 T 0.015 Hrs_helical pdbhh F Eukaryota T 7tgh 77 IC P2 Q23KE0_TETTS NDUPH2 MFNILKGAQLSFRSITNKSVNNYYNIMRQVSLDSNPIVLYQSSTFTGNGLQEFYENADALTKYLKLVPFFLEKNLYDHPKQFVIKMEFHPQNKVLSLDCLTHQGVLKKTVNLENLIPVPYEDYVQFCRRKLFNAPLFLDTEMIYFNTFQNEFYVFDKNAKWNEEGINHPELDISKLYNEKAWFDSLRII 189 T 0.26 TMEM70 pdbhh F Eukaryota T 7tgh 78 JC A3 I7M9B3_TETTS Transmembrane protein, putative MSNNNQGDFFVDKYNFSRRVVDHRQPYDLNFSINNPVGSRVWFKAWKQKAIGNFLNLVGVHYAFYGAGFCLLFVLADAWGREKYAQPYKSQILHGRQPFGHTFVQNYRNQATDLGRWNHNFACYEKQPGCGRDFD 135 T 8.3 DUF983 pdbhh F Eukaryota T 7tgh 79 KC T9 Q23B10_TETTS NDUTT9 MFLNPVKDSEFDDEVKGFVPSEGEVRFVANKNKECGYYLQGIEQCRRKMVQLAGDSSSQFHSLGFLPCKRLVDAHYRCMTDDKFGSTIEEVPEIGLDSAQKFFDCTFQQLKPMQSCRRFFDQVVRDVYRANGSQL 135 T 0.22 COX17 pdb F Eukaryota T 7tgh 80 LC TE I7MIK1_TETTS Transmembrane protein, putative MDKYIQQAKCAYNFSLKAVRFVGPLNIVFAGVAFLMFYENNYKKLYLNPRYSYTMPYLQSAKITKNLYEKL 71 T 0.21 YpmT pdbhh F Eukaryota T 7tgh 81 MC T7 Q22HE4_TETTS Transmembrane protein, putative MTNFGSPFRNTDSGIVIRDPENEKRLKLAFQNFWKSKQEDKEFQAQIKTAVSKDTVNFMFYASPLFGALLGKTYIDMFCNPRYFYFRAFTLSMFALAGYCVGNGFRNRYEHSLYTRNYHLFPKDLQDALVNGDARYCISWWKQ 143 T 0.053 AHH pdbpssm F Eukaryota T 7th0 1 A A RPNA_ECOLI Recombination-promoting nuclease RpnA MTIAERLRQEGEQSKALHIAKIMLESGVPLADIMRFTGLSEEELAAASQ 49 T 0.00037 DUF2802 pdbhh F Bacteria T 7tjl 1 A A De novo designed protein, SEWN0.1 AGPEEHKARVEEYMRRALQATTEPEKKYWEEEAKKEIEQAMYADALINPIRFTEKAAKYIKTYGFRGQEAYDQVKKEMFEKLYKYFMEKLKSE 93 T 0.088 DUF5759 pdb F T 7tl6 1 A,B A,B METRN_MOUSE HYPOXIA/REOXYGENATION REGULATORY FACTOR GYSEDRCSWRGSGLTQEPGSVGQLTLDCTEGAIEWLYPAGALRLTLGGPDPGTRPSIVCLRPERPFAGAQVFAERMTGNLELLLAEGPDLAGGRCMRWGPRERRALFLQATPHRDISRRVAAFRFELHEDQRAE 134 T 32 MORN_2 pdbhh F Eukaryota T 7tl7 2 E,F,G,H a,b,c,d peptide Sa-D2 XXRYEXYKXECPKCX 15 T 0.045 DUF983 pdbhh F T 7tl8 2 B B Peptide Sa-D3 XXQVTVWWAXPWEDC 15 T 1.8 HRCT1 pdbhh F T 7tlh 1 A,B,C,D B,C,A,D METRN_MOUSE HYPOXIA/REOXYGENATION REGULATORY FACTOR MSPQAQGLGVDGACRPCSDAELLLAACTSDFVIHGTIHGVAHDTELQESVITVVVARVIRQTLPLFKEGSSEGQGRASIRTLLRCGVRPGPGSFLFMGWSRFGEAWLGCAPRFQEFSRVYSAALTTHLNPCEMALD 136 T 8.8E-05 TIMP pdbhh F Eukaryota T 7tlj 4 D,H D,H 14KD_CERSP 14 kDa peptide of ubiquinol-cytochrome c2 oxidoreductase complex MFSFIDDIPSFEQIKARVRDDLRKHGWEKRWNDSRLVQKSRELLNDEELKIDPATWIWKRMPSREEVAARRQRDFETVWKYRYRLGGFASGALLALALAGIFSTGNFGGSSDAGNRPSVVYPIE 124 T 0.052 MtrF pdbpercent F Bacteria T 7tlw 1 A A METRL_MOUSE SUBFATIN LMSGQRGLDLHVLSAPCRPCSDTEVLLAICTSDFVVRGFIEDVTHVPEQQVSVIYLRVNRLHRQKSRVFQPAPEDSGHWLGHVTTLLQCGVRPGHGEFLFTGHVHFGEAQLGCAPRFSDFQRMYRKAEEMGINPCEINME 140 T 7.8E-05 TIMP pdbhh F Eukaryota T 7tm9 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H A0A0H3GK94_KLEPH Bacterial alkaline phosphatase MAHHHHHHSPVIHAETTAAPVLENRAAQGDITTPGGARRLTGDQTEALRASLINKPAKNVILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYSLDKKTGKPDYVTDSAASATAWTTGVKTYNGALGVDIHENAHQTILELAKAAGLATGNVSTAELQDATPAALVAHVTSRKCYGPTVTSEKCPSNALEKGGKGSITEQLLNARPDVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQIVTDAASLAAATEASQDKPLLGLFADGNMPVRWEGPKASYHGNIDKPPVTCTPNPKRDASVPTLAQMTEKAIDLLSRNEKGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQKALEFARKDGNTLVIVTADHAHASQIIPADSKAPGLTQALNTHDGAVMVMSYGNSEEESMEHTGTQLRIAAYGPHAANVVGLTDQTDLFTTMKAALSLK 464 T 2.1E-09 Alk_phosphatase unppssm F Bacteria T 7to7 2 C,F C,F 1xAcK.4xE (monoAcK.4xE) EEALLLAXLYHFGEE 15 T 7.1 Gal_mutarotas_2 pdbhh F T 7to8 2 C C 2xAcK.1 (diAcK.1) AQRSLXLLXHLYHG 14 T 6.2 WhiA_N pdbhh F T 7to9 2 C C 2xAcK.4xE (diAcK.4xE) EEAQRSLXLLXHLYHGEE 18 T 10 WhiA_N pdbhh F T 7tod 1 A A A0A2S6F4N3_LEGPN SETA SDEKIKTAHDLIDEIIQDVIQLDGKLGLLGGNTRQLEDGRVINIPNGAAMIFDDYKKYKQGELTAESALESMIKIAKLSNQLNRHTFFNQRQPETGQFYKKVAAIDLQ 108 T 0.36 CUB_2 unppercent F Bacteria T 7tok 1 A,B A,B A0A5P6A8B9_FLAJO Acetylxylan esterase I PPEPGLAQNTLRQIIKVSLGGKQIRMRFSNLFSDQPAVLKSVSVANVTEAPAVDIKTQKILSFKGSPQVTLGADEVMYSDAFDFELQPGQLLAITIHYGEISSNVSGHPGSRTTSYILQGDHINNESFAGAVKTDHWYSIMGVDISSVKN 150 T 2.3 PCuAC pdbhh F Bacteria T 7toq 47 UA ALP0 60S acidic ribosomal protein P0 RATWKSNYFLKIIQLLDTMMRKAIRGH 27 T 2.7 Holin_SPP1 pdbhh F T 7tpp 3 C C FA5_HUMAN ACTIVATED PROTEIN C COFACTOR,PROACCELERIN,LABILE FACTOR AQLRQFYVAAQGISWSYRPEPTNSSLNLSVTSFKKIVYREYEPYFKKEKPQSTISGLLGPTLYAEVGDIIKVHFKNKADKPLSIHPQGIRYSKLSEGASYLDHTFPAEKMDDAVAPGREYTYEWSISEDSGPTHDDPPCLTHIYYSHENLIEDFNSGLIGPLLICKKGTLTEGGTQKTFDKQIVLLFAVFDESKSWSQSSSLMYTVNGYVNGTMPDITVCAHDHISWHLLGMSSGPELFSIHFNGQVLEQNHHKVSAITLVSATSTTANMTVGPEGKWIISSLTPKHLQAGMQAYIDIKNCPKKTRNLKKITREQRRHMKRWEYFIAAEEVIWDYAPVIPANMDKKYRSQHLDNFSNQIGKHYKKVMYTQYEDESFTKHTVNPNMKEDGILGPIIRAQVRDTLKIVFKNMASRPYSIYPHGVTFSPYEDEVNSSFTSGRNNTMIRAVQPGETYTYKWNILEFDEPTENDAQCLTRPYYSDVDIMRDIASGLIGLLLICKSRSLDRRGIQRAADIEQQAVFAVFDENKSWYLEDNINKFCENPDEVKRDDPKFYESNIMSTINGYVPESITTLGFCFDDTVQWHFCSVGTQNEILTIHFTGHSFIYGKRHEDTLTLFPMRGESVTVTMDNVGTWMLTSMNSSPRSKKLRLKFRDVKCIPDDDEDSYEIFEPPESTVMATRKMHDRLEPEDEESDADYDYQNRLAAALGIR 709 F F Eukaryota T 7tr6 2 B,C,D,E,F D,E,F,G,H Q8U332_PYRFU Cas11a GGWIRNIGRYLSYLVDDTFEEYAYDVVDGIAKARTQEELLEGVYKALRLAPKLKKKAESKGCPPPRIPSPEDIEALEEKVEQLSNPKDLRKLAVSLALWAFASWNNCP 108 T 0.00011 Cas_Csa5 pdbhh F Archaea T 7tsq 2 C,D C,D CDND2_ENTCL CGAS/DNCV-LIKE NUCLEOTIDYLTRANSFERASE,CD-NTASE038 KPAEPQKTGRFA 12 T 16 Rrp44_CSD1 pdbhh F Bacteria T 7tta 1 A A A0A2P9IBF7_9ACTN Putative cytochrome P450 hydroxylase EHRVLHLRDRLDLAAELKLLCERGPLVRIPLEDGSAVHWFALGYDVVREVLGSEKFDKRVIGTHFNHQEMALPGNLLQLDPPEHTRLRRMVAPAYSVRRMQALEPRVQAIVDDHLDTMASTGPPVEFLREVAGPMAARVACEFLGIPLDDRGELIRLTAHRGGKRRRVLNGHAYLAYMRELAARLRRDPGDGMLGMVARDHGADISDEELAGLCAVVMNSSVEQTESCLAAGTLLLLEHPEQFALLRERPELGEQAVEEIVRYLSVFEGLDPRTATEDVEIGGQVIKKGEAVFCSLLAANRADPALDGFDITRKESRHVAFGHGIHHCLGAPLARMELRIAFTTLVSRFPSLRTAVPAEEIRFRPPSSNVFTLLELPLTW 380 T 1.4E-27 p450 unppssm F Bacteria T 7ttb 1 A A A0A2P9IBF7_9ACTN Putative cytochrome P450 hydroxylase MVAPEHRVLHLRDRLDLAAELKLLCERGPLVRIPLEDGSAVHWFALGYDVVREVLGSEKFDKRVIGTHFNHQEMALPGNLLQLDPPEHTRLRRMVAPAFYSVRRMQALEPRVQAIVDDHLDTMASTGPPVEFLREVAGPMAARVACEFLGIPLDDRGELIRLTAHRGGKRRRVLNGHAYLAYMRELAARLRRDPGDGMLGMVARDHGADISDEELAGLCAVVMNSSVEQTESCLAAGTLLLLEHPEQFALLRERPELGEQAVEEIVRYLSVFEGLDPRTATEDVEIGGQVIKKGEAVFCSLLAANRADPALDGFDITRKESRHVAFGHGIHHCLGAPLARMELRIAFTTLVSRFPSLRTAVPAEEIRFRPPSSNVFTLLELPLTW 385 T 1.4E-27 p450 unppssm F Bacteria T 7tto 1 A A A0A2P9IBF7_9ACTN cytochrome P450 hydroxylase MVAPEHRVLHLRDRLDLAAELKLLCERGPLVRIPLEDGSAVHWFALGYDVVREVLGSEKFDKRVIGTHFNHQEMALPGNLLQLDPPEHTRLRRMVAPAYSVRRMQALEPRVQAIVDDHLDTMASTGPPVEFLREVAGPMAARVACEFLGIPLDDRGELIRLTAHRGGKRRRVLNGHAYLAYMRELAARLRRDPGDGMLGMVARDHGADISDEELAGLCAVVMNSSVEQTESCLAAGTLLLLEHPEQFALLRERPELGEQAVEEIVRYLSVFEGLDPRTATEDVEIGGQVIKKGEAVFCSLLAANRADPALDGFDITRKESRHVAFGHGIHHCLGAPLARMELRIAFTTLVSRFPSLRTAVPAEEIRFRPPSSNVFTLLELPLTW 384 T 1.4E-27 p450 unppssm F Bacteria T 7tuj 1 A A SLX4_HUMAN BTB/POZ DOMAIN-CONTAINING PROTEIN 12 AQMPSAGGAQKPEGLETPKGANRKKNLPPKVPITPMPQYSIMETPVLKKELDRFGVRPLPKRQMVLKLKEIFQYTHQTLDSDSEDE 86 T 1.9 Endonuc-dimeris pdbhh F Eukaryota T 7tv0 2 E,F E,G VEMP_SARS2 Envelope small membrane protein XPSFYVYSRVXN 12 T 0.56 CoV_E pdbhh T Viruses T 7tv5 1 A A W5IDB3_LASLA Lasiocepsin GLPRKILCAIAKKKGKCKGALKLVCKCX 28 T 0.035 Defensin_2 pdbpssm F Eukaryota T 7tv6 1 A A W5IDB3_LASLA Lasiocepsin heterogeneous-backbone proteomimetic analogue GLXRKXLCAXAKXKGKCKGAXKLXCKCX 28 T 1.3 Antimicrobial_1 unphh F Eukaryota T 7tv7 1 A A W5IDB3_LASLA Lasiocepsin heterogeneous-backbone proteomimetic analogue GLXRKXLCAXAKXKGXCXGAXKLXCKCX 28 T 1.3 Antimicrobial_1 unphh F Eukaryota T 7tv8 1 A A W5IDB3_LASLA Lasiocepsin heterogeneous-backbone proteomimetic analogue GLXRKXLCAXAKXKXKCKXAXKLXCKCX 28 T 1.2 Antimicrobial_1 pdbhh F Eukaryota T 7tvh 1 A,B,C,D A,B,C,D TSE1_PSEAE TYPE VI SECRETION EXPORTED 1 MDSLDQCIVNACKNSWDKSYLAGTPNKDNCSGFVQSVAAELGVPMPRGNANAMVDGLEQSWTKLASGAEAAQKAAQGFLVIAGLKGRTYGHVAVVISGPLYRQKYPMCWSGSIAGAVGQSQGLKSVGQVWNRTDRDRLNYYVYSLASCSLPRASLEHHHHHH 162 T 3.2E-05 Amidase_6 unphh F Bacteria T 7txf 2 F,G,H F,G,H CDKB_CONVX VX20.2,VXXIIB TRMCGSMSCPRNGCTCVYHWRRGHGCSCPG 30 T 6.1 IGFL pdbhh F Eukaryota T 7txj 2 C A A7WKI9_9VIRU MCP1 MAGKKRRLSQASVLRYYAKRFTMNVGTTAHVLGKEVAGNPWVAKAIDKLSYQETYNWISDYQASHLAKQVAKQVAEKYGIPPTFQGLLMAYAEKVVANYILDYKGESLTQMHDNYLYELMQKMPIAPTGTSSGYIYVFIGKDGKTHTVDMSKVLTDIEDALLKRA 165 T 0.2 RP1-2 pdb T Viruses T 7txj 3 D a A7WKJ0_9VIRU MCP2 MAGRQAHRKFDVRNDTSTRWKGKLYGIFVNYMGEDYAKEFVEQAYSNYEKVFVNIYTKIHNQLRTTLTSSAGAGATFPLWQIINEAIYAVYLTHKETASFLYAKYVARGIQPNVVKKILAETGNALKGIVPAVAQELGETVLDESNVISVVDDIVRKNPALPNSYAGIILQEARISTTPHYEGTEGFSSMESAYSALEEIEKGL 204 T 0.0031 Dynein_light pdb T Viruses T 7tzk 2 C,D C,D B7ULW4_ECO27 T3SS secreted effector NleH homolog PPELPSVDYNSL 12 T 6.1 Se-cys_synth_N pdbhh F Bacteria T 7u09 3 C A SARS-CoV-2 S fusion peptide PSKRSFIEDLLFNK 14 T 0.00028 CoV_S2 pdbhh F T 7u0e 3 C,D C,D SARS-CoV-2 S fusion peptide PKRSFIEDLLFNK 13 T 5.4E-05 CoV_S2 pdbhh F T 7u5d 4 D A Cas8/5 MVTIMHIEELLDIEDHGERDRQLRRYLAPYSAEIGVDGAEKMALVVLLNLTLKRDRVESLCDEGLARQLLSDEGHITNCLHTVRWLHTHNLKYPDARVSGERLIINAPPLIPGVISSAGLPMRMGWAHDSSDINLAKLFGTSFRYRDDSTNLALQLVARSKTWEQALIGLGLTQQQLDIWCQLLASNLENNTFPTVVSPFSKQVRFLYQGNYCVVTPVVSHALLAQLQNVVHEKKLQCTYIHHDHPASVGSLVGALGGKVAVLDYPPPVSPDKARSFSQARKHRLANGQSLFDRSVFNDHVFIDALKHVISRPGLTRKQQRQLRLSALRYLRRQLAIWLGPIIEWRDEIVSSGRGEPGNLPSGGLELELITQPKKMLPELMLQVAGRFHLELQNHSAGRRFAFHPALMAPIKSQILWLLRQLADDEEKDEPHPPTSCYYLHLSGLTVYDASALANPYLCGIPSLSALAGFCHDYERRLQSLIGQSVYFRGLAWYLGRYSLVTGKHLPEPSKSADPKSVSAIRRPGLLDGRYCDLGMDLIIEVHIPTGGSLPFTTCLDLLRVALPARFAGGCLHPPSLYEEYNWCTVYQDKSTLFTVLSRLPRYGCWIYPSDADLRSFEELSEALALDRRLRPVATGFVFLEEPVERAGSIEGQHVYAESAIGTALCINPVEMRLAGKKRFFGAGFWQLNDAKGAILMNGSANTG 704 T 0.00011 Cas_Csy2 pdb F T 7u6d 1 A A IM459 SLEQEWXKIECEVYGKCPPKKAXYDWFERQLK 32 T 0.3 DUF2161 pdbhh F T 7u6e 4 E,F G,H IM462 XSLEEEWAQIECEVYGRCPPSES 23 T 1.5 DUF6058 pdbhh F T 7u7n 4 D D IL27A_HUMAN IL-27 SUBUNIT ALPHA,IL-27-A,IL27-A,INTERLEUKIN-30,P28 FPRPPGRPQLSLQELRREFTVSLHLARKLLSEVRGQAHRFAESHLPGVNLYLLPLGEQLPDVSLTFQAWRRLSDPERLCFISTTLQPFHALLGGLGTQGRWTNMERMQLWAMRLDLRDLQRHLRFQVLAAGFNLPEEEEEEEEEEEEERKGLLPGALGSALQGPAQVSWPQLLSTYRLLHSLELVLSRAVRELLLLSKAGHSVWPLGFPTLSPQP 215 F F Eukaryota T 7u8o 8 P,Q,R Q,R,S Q5ZWW6_LEGPH Bacterial effector protein SidK MSFIKVGIKMGGLTSEQYHSQVVGKIGYIARCMQTIDPENNLKKIREDYQDVLIWAEKNYRFEEILEASKSGKCPNDLDALSRRSLILQELLRLVSSISPFKMKLDLIESQYEKMKQHVNLWKSDYHVKLNQLNQLTDYLKNAAPTPKNNFLRAMTSVLQMQIAQYGITEDNEGINQLFKLGLHLLAMANEKIDEQYHLFKGYVKDQPEESPFEGILPAEDQKILVKTMIDYAMPKLSSKVLQDKLSALSSSDVLTKTLLDSIDRIVKENEKLNALSKVKLGKFGLDIREIEVIYSQALKISPQDALQYTAQQCDAQLLSMAFPDSQNYIIESISNK 337 T 0.15 Chalcone unp F Bacteria T 7u8o 16 Z f A0A480L8C4_PIG Ribonuclease kappa MASLLCCGPKLAACGIVLSAWGVIMLIMLGIFFNVHSAVLIEDVPFTEKDFENGPQDIYKLYEQVSYNCFIAAGLYLLLGGFSFCQVRLNKRKEYMVR 98 T 0.0082 DUF2650 unp F Eukaryota T 7u9e 1 A A P230_PLAF7 Gametocyte surface protein P230 SVLQSGALPSVGVDELDKIDLSYETTESGDTAVSEDSYDKYASQNTNKEYVCDFTDQLKPTESGPKVKKCEVKVNEPLIKVKIICPLKGSVEKLYDNIEYVPKKSPYVVLTKEETKLKEKLLSKLIYGLLISPTVNEKENNFKEGVIEFTLPPVVHKATVFYFICDNSKTEDDNKKGNRGIVEVYVEPYGN 191 T 1.1 s48_45 unppercent F Eukaryota T 7uav 1 A,B,C,D,E A,B,C,D,E TAD1_CLOBO TAD1 SMKELSTIQKREKLNTVERIGSEGPGGAYHEYVIKSNSMDSQGNYDVYETIKFQKGARKEEKSQHGVIDSDLLEIVRDRLKSFQAGPFSSRENACALTHVEEALMWMNRRVEDRIERNVLGTNTK 125 T 0.023 HlyU pdb F Bacteria T 7uba 1 A A A7TRP1_VANPO HORMA domain-containing protein QSPDIECECDLLCPITSTRIKQCKNCRKFVHSLCYGNKPGPKVDKCISCVYGPMFDPSSSEFKDLMMLRKCYRFLSRNKGFPPSIKEFTNSIMEEGQVTLENIERINFCISTLSSDGILNFSQCNKQRDASQDGSASKATRIQGNKVSIDEEGIFVPKIGELLKGREYMCCFIYNSDNSHACYLDVSPESKRQIENWIDQVKSIRNDFEPNSS 213 T 0.06 PHD_2 pdb F Eukaryota T 7ubu 2 B,E P,Q H32_MAIZE Histone H3.2 SARTKQTARXSTGGKAPRKQLATKAARKSAPAT 33 T 0.41 PAF pdbpercent F Eukaryota T 7ucf 4 D G Q2N0S6_9HIV1 ENV POLYPROTEIN MDAMKRGLCCVLLLCGAVFVSPAGAGENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQAFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGR 501 T 3.4E-54 GP120 pdbpercent T Viruses T 7udi 1 A,B A,B DNA damage response protein DdrC MKNAPLTLNFGSVRLPVSADGLLHAPTAQQQLGLTQSWEAALVEHGLPETYRDFGAGPEAAVSVPDFVALAFALDTPEARRWQKRARELLARAMQGDVRVAAQIAERNPEPDARRWLAARLESTGARRELMATVARHGGEGRVYGQLGSISNRTVLGKDSASVRQERGVKATRDGLTSAELLRMAYIDTVTARAIQESEARGNAAILTLHEQVARSERQSWERAGQVQRVG 231 T 21 KilA-N pdbhh F T 7udj 2 B H De novo designed helical repeat protein RPB_PEW3_R4 KKEAEEVAAHVEQIAFIAKEQGNEEVAKLAKRLAETIKRLNEGTEEEVKRLLEAAEVAAHVLQIAFIAHEQGNEEVAKLALELAESILRLIEGTEEEVKRLLEAAEVAAHVLQIAFIAHEQGNEEVAKLALELAESILRLIEGTEEEVKELLERAEEAAHVLQHAFIATEQGNEEDAKEALRKAEEILRRNA 192 T 0.097 HemY_N pdb F T 7udk 1 A A Designed helical repeat protein (DHR) RPB_LRP2_R4 DREEAAFLAASILIQHAHEQGKDDRELEKILEIAIRILEKNGVDREEAAFLAASILIQHAHEQGKDDRELEKILEIAIRILEKNGVDREEAAFLAASILIQHAHEQGKDDRELEKILEIAIRILEKNGVDREEAAFLAASILIQHAHEQGKDDRELEKILEIAIRILEKNGV 172 T 0.032 Hormone_recep pdb F T 7udl 1 A A Designed helical repeat protein (DHR) RPB_PLP1_R6 PEEERIKYVITVVEQIAKDAHRNGQEELAKLAERTAEEAKKATERGEEETLRIVYVIVVVLQIALEAHRNGQEELAKLALRTAEEAIKATERGEEETLRIVYVIVVVLQIALEAHRNGQEELAKLALRTAEEAIKATERGEEETLRIVYVIVVVLQIALEAHRNGQEELAKLALRTAEEAIKATERGEEETLRIVYVIVVVLQIALEAHRNGQEELAKLALRTAEEAIKATERGEEETERIVYDIVVVLQEALEAHRNGEEERAKKALDEARRRIEATERGE 282 T 0.019 SMBP pdb F T 7udm 1 A,B A,B Designed helical repeat protein (DHR) RPB_PLP1_R6 APEEERIKYVITVVEQIAKDAHRNGQEELAKLAERTAEEAKKATERGEEETLRIVYVIVVVLQIALEAHRNGQEELAKLALRTAEEAIKATERGEEETLRIVYVIVVVLQIALEAHRNGQEELAKLALRTAEEAIKATERGEEETLRIVYVIVVVLQIALEAHRNGQEELAKLALRTAEEAIKATERGEEETLRIVYVIVVVLQIALEAHRNGQEELAKLALRTAEEAIKATERGEEETERIVYDIVVVLQEALEAHRNGEEERAKKALDEARRRIEATERGE 283 T 0.019 SMBP pdb F T 7udv 1 A,B,C,D,E,F,G,H,I,J A,B,C,D,E,F,G,H,I,J De novo designed proton channel LLQL DSLKWIVFLLFLIVLLQLAIVFLLRG 26 T 0.0062 RCR pdbhh F T 7udw 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T De novo designed pentameric proton channel QQLL DSQKWIVFLQFLIVLLLLAIVFLLRG 26 T 0.0078 RCR pdbhh F T 7udx 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O A,B,C,D,E,F,G,H,I,J,K,L,M,N,O De novo designed pentameric proton channel QLQL DSQKWIVFLLFLIVLLQLAIVFLLRG 26 T 0.0068 RCR pdbhh F T 7udy 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O A,B,C,D,E,F,G,H,I,J,K,L,M,N,O Designed channel QLLL DSQKWIVFLLFLIVLLLLAIVFLLRG 26 T 0.011 RCR pdbhh F T 7udz 1 A,B,C,D,E,F,G,H,I,J A,B,C,D,E,F,G,H,I,J De novo designed pentameric proton channel LQLL DSLKWIVFLQFLIVLLLLAIVFLLRG 26 T 0.0065 RCR pdbhh F T 7ue2 1 A A RPB_PLP3_R6 MDEEREKLKEKLKEVLRRAKEAKKKGDKEKLIELAYEAAALAAWIIHKDSNDDEIVELAKEALKLVLEAAKEAKKNGDKEKLIKLAYLAAAVAAWIIHTDGDDDEIVELAKEALKLVLEAAKEAKKNGDKEKLIKLAYLAAAVAAWIITTDGDDDEIVELAKEALKLVLEAAKEAKKNGDKEKLIKLAYLAAAVAAWIIHTDGDDDEIVELAKEALKLVLEAAKEAKKNGDKEKLIKLAYLAAAVAAWIITTDGDDEEIVELAKEALKLVKEAAEEAEKQGDEELREKLRYLSEAVREWIERND 304 T 0.002 SMBP pdb F T 7ueg 1 A,B,C,D,E,F A,E,B,C,D,F A3MUL8_PYRCJ Pilin MARKKNYRPLIALAALAVAALAMATLTFTNLTYWLINATLPPAMKYPGTDTTITRSDSSGYNRYVYVSYYYDPSTGYNVTRISIVGFTGDPTNYTNVLQLCNKYYSGTLYAKLVAVGTVGTTNYESYIKDFRVYFVNPTTTPNYVQFQGTSVTQSATGSVSIGPGQCATVGAYVLVDPSLPTSARDGKTVIATYQVNVVFSTSP 204 T 0.19 DUF3254 pdb F Archaea T 7uek 1 A A OT3 MHHHHHHENLYFQSDAICIYLDESATWKDMKKAMEILYKLGVKKIVVLFKYDEKLIKVAAKVLHDLGAEEAIIILIFDIDDEDEFKKQVKKALELMKKLGVDHRIIALRMTDEEKFKKLAKIAAELGADAICIYLDESATWKDMKKAMEILYKLGVKKIVVLFKYDEKLIKVAAKVLHDLGAEEAIIILIFDIDDEDEFKKQVKKALELMKKLGVDHRIIALRMTDEEKFKKLAKIAAELGA 242 T 0.00023 DeoC pdb F T 7ug2 1 A A TRI75_MOUSE Tripartite motif-containing protein 75 GPGGVTLREQAEAQRSQLTSECEKLMRFLDQEERAAFSRLEDEEMRLEKRLLDNIAALE 59 T 0.00063 DUF3583 unphh F Eukaryota T 7ugb 2 B I ISG20_HUMAN ESTROGEN-REGULATED TRANSCRIPT 45 PROTEIN,PROMYELOCYTIC LEUKEMIA NUCLEAR BODY-ASSOCIATED PROTEIN ISG20 XIRARRGLPRLAVSD 15 T 0.00026 DNA_pol_B_exo2 unphh F Eukaryota T 7ugc 1 A A A0A827X9M7_ECOLX VWA DOMAIN PROTEIN INTERACTING WITH AAA ATPASE MLTLDTLNVMLAVSEEGLIEEMIIALLASPQLAVFFEKFPRLKAAITDDVPRWREALRSRLKDARVPPELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTITSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQL 191 T 0.35 RHH_3 unppssm F Bacteria T 7ugn 1 A,B,C A,B,C Q2N0S5_9HIV1 ENV POLYPROTEIN ENLWVTVYYGVPVWKDAETTLFCASDAKAYKKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNIDDMRGELKNCSFNMTTELRDKRQKVHALFYKLDIVPINENQNTSYRLINCNTAAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSEDIRNNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQWFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCDTSGLFNSTWISNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTDSTTETFRPSGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVV 451 T 2.5E-53 GP120 pdbpercent T Viruses T 7ugo 1 A,B,C A,B,C Q2N0S5_9HIV1 ENV POLYPROTEIN ENLWVTVYYGVPVWKDAETTLFCASDAKAYETKKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKRQKVHALFYKLDIVPINENQNTSYRLINCNTAAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSEDIRNNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQWFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCDTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTDSTTETFRPSGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVV 466 T 3.5E-54 GP120 pdbpercent T Viruses T 7ugp 1 A,B,C A,B,C Q2N0S5_9HIV1 ENV POLYPROTEIN ENLWVTVYYGVPVWKDAETTLFCASDKKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNIGELKNCSFNMTTELRDKRQKVHALFYKLDIVPINENQNTSYRLINCNTAAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQWFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCDTSGLFNSTWISNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTDSTTETFRPSGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVV 443 T 1.5999999999999998E-49 GP120 unp T Viruses T 7ugw 4 F E evybactin XTXTHXXFGXSX 12 T 9 RNA_pol pdbhh F T 7uhb 2 B K Multivalent miniprotein inhibitor AHB2-2GS-SB175 ELEEQVMHVLDQVSELAHELLHKLTGEELERAAYFNWWATEMMLELIKSDDEREIREIEEEARRILEHLEELARKGGSEALEELEKALRELKKSTDELERSTEELEKNPSEDALVENNRLIVENNKIIVEVLRIIAKVLKLEHHHHHH 148 T 0.0012 Syntaxin-6_N pdbpssm F T 7uhe 2 B,D B,D TAF2_YEAST TAFII-150,TBP-ASSOCIATED FACTOR 150 KDA,TBP-ASSOCIATED FACTOR 2,TSM-1 SRSFMVKIRTKN 12 T 2 DUF3970 pdbhh F Eukaryota T 7ui9 5 E a MED1_YEAST MEDIATOR COMPLEX SUBUNIT 1 MVEGDSYVETLDSMIELFKDYKPGSITLENITRLCQTLGLESFTEELSNELSRLSTASKIIVIDVDYNKKQDRIQDVKLVLASNFDNFDYFNQRDGEHEKSNILLNSLTKYPDLKAFHNNLKFLYLLDAYSHIESDSTSHNNGSSDKSLDSSNASFNNQGKLDLFKYFTELSHYIRQCFQDNCCDFKVRTNLNDKFGIYILTQGINGKEVPLAKIYLEENKSDSQYRFYEYIYSQETKSWINESAENFSNGISLVMEIVANAKESNYTDLIWFPEDFISPELIIDKVTCSSNSSSSPPIIDLFSNNNYNSRIQLMNDFTTKLINIKKFDISNDNLDLISEILKWVQWSRIVLQNVFKLVSTPSSNSNSSELEPDYQAPFSTSTKDKNSSTSNTEPIPRSNRHGSVVEASRRRRSSTNKSKRPSITEAMMLKEEGLQQFNLHEILSEPAIEEENGDSIKEHSTTMDGANDLGFTASVSNQENAGTDIVMEDHGVLQGTSQNYGTATADDADIEMKDVSSKPSKPESSVLQLIVSEDHIILDTISECNLYDDVKCWSKFIEKFQDIVS 566 T 7.5E-16 Med1 pdbhh F Eukaryota T 7uii 1 A a CanA MTTQSPLNSFYATGTAQAVSEPIDVESHLGSITPAAGAQGSDDIGYAIVWIKDQVNDVKLKVTLANAEQLKPYFKYLQIQITSGYETNSTALGNFSETKAVISLDNPSAVIVLDKEDIAVLYPDKTGYTNTSIWVPGEPDKIIVYNETKPVAILNFKAFYEAKEGMLFDSLPVIFNFQVLQVG 183 T 6.4 Hormone_3 pdbhh F T 7uik 7 H n MED14_YEAST GLUCOSE REPRESSION REGULATORY PROTEIN 1,MEDIATOR COMPLEX SUBUNIT 14 MQLVVLTDVVERLHKNFESENFKIIALQPNEISFKYLSNNDEDDKDCTIKISTNDDSIKNLTVQLSPSNPQHIIQPFLDNSKMDYHFIFSYLQFTSSLFKALKVILNERGGKFHESGSQYSTMVNIGLHNLNEYQIVYYNPQAGTKITICIELKTVLHNGRDKIQFHIHFADVAHITTKSPAYPMMHQVRNQVFMLDTKRLGTPESVKPANASHAIRLGNGVACDPSEIEPILMEIHNILK 241 T 0.092 CDT1 unp F Eukaryota T 7uiq 2 C,D C,D TIAM1_MOUSE TIAM-1 RTLDSHASRMTQLKKQAAL 19 T 0.67 Gas_vesicle_C pdbhh F Eukaryota T 7uit 1 A,AA,AB,B,BA,BB,C,CA,CB,D,DA,DB,E,EA,EB,F,FA,FB,G,GA,GB,H,HA,HB,I,IA,IB,J,JA,JB,K,KA,KB,L,LA,LB,M,MA,MB,N,NA,NB,O,OA,OB,P,PA,PB,Q,QA,QB,R,RA,RB,S,SA,SB,T,TA,TB,U,UA,V,VA,W,WA,X,XA,Y,YA,Z,ZA B,b,1,C,c,2,D,d,3,E,e,4,F,f,5,G,g,6,H,h,7,I,i,8,J,j,9,K,k,AA,L,l,BA,M,m,CA,N,n,DA,O,o,EA,P,p,FA,Q,q,A,R,r,JA,S,s,GA,T,t,HA,U,u,IA,V,v,W,w,X,x,Y,y,Z,z,a,0 Peptide 2 XLKAIAQEFKAIAKKFKAIAXEFKAIAQKX 30 T 12 DUF5741 pdbhh F T 7uj4 1 A,B A,B MEN1_HUMAN Isoform 2 of Menin MGLKTAQKTLFPLRSIDDVVRLFAAELGREEPDLVLLSLVLGFVEHFLAVNRVIPTNVPELTFQPSPAPDPPGGLTYFPVADLSIIAALYARFTAQIRGAVDLSLYPREGGVSSRELVKKVSDVIWNSLSRSYFKDRAHIQSLFSFITGTKLDSSGVAFAVVGACQALGLRDVHLALSEDHAWVVFGPNGEQTAEVTWHGKGNEDRRGQTVNAGVAERSWLYLKGSYMRCDRKMEVAFMVCAINPSIDLHTDSLELLQLQQKLLWLLYDLGHLERYPMALGNLADLEELEPTPGRPDPLTLYHKGIASAKTYYRDEHIYPYMYLAGYHCRNRNVREALQAWADTATVIQDYNYCREDEEIYKEFFEVANDVIPNLLKEAASLLEAGEERPGEQSQGTQSQGSALQDPECFAHLLRFYDGICKWEEGSPTPVLHVGWATFLVQSLGRFEGQVRQKVRITFQSEKMKGMKELLVATKINSSAIKLQLTAQ 488 T 2.4E-11 Menin pdb F Eukaryota T 7ujd 6 F Z ACY-PHE-PRO-ASP-VAL-SAR-LEU-HIS-ARG-TYR-TRP-GLY-TRP-ASP-CYS-GLY-NH2 GFPDVXLHRYWGWDCGX 17 T 1.3 DUF6172 pdbhh F T 7ukn 2 B B UL145_HCMVM H-Box Motif of pUL145 NAVQLLCARTRDG 13 T 0.71 DUF5500 unphh T Viruses T 7um2 3 C C SARS-CoV-2 Spike-derived peptide S417-425 K417T mutant (TIADYNYKL) TIADYNYKL 9 T 0.22 bCoV_S1_RBD pdbhh F T 7ung 5 I,J 8,9 CF107_HUMAN CILIA- AND FLAGELLA-ASSOCIATED PROTEIN 107 MFLTAVNPQPLSTPSWQIETKYSTKVLTGNWMEERRKFTRDTDKTPQSIYRKEYIPFPDHRPDQISRWYGKRKVEGLPYKHLITHHQEPPHRYLISTYDDHYNRHGYNPGLPPLRTWNGQKLLWLPEKSDFPLLAPPTNYGLYEQLKQRQLTPKAGLKQSTYTSSYPRPPLCAMSWREHAVPVPPHRLHPFPHF 194 T 0.14 DUF1143 pdbpercent F Eukaryota T 7ung 13 UB D SPAG8_HUMAN HSD-1,SPERM MEMBRANE PROTEIN 1,SMP-1,SPERM MEMBRANE PROTEIN BS-84 METNESTEGSRSRSRSLDIQPSSEGLGPTSEPFPSSDDSPRSALAAATAAAAAAASAAAATAAFTTAKAAALSTKTPAPCSEFMEPSSDPSLLGEPCAGPGFTHNIAHGSLGFEPVYVSCIAQDTCTTTDHSSNPGPVPGSSSGPVLGSSSGAGHGSGSGSGPGCGSVPGSGSGPGPGSGPGSGPGHGSGSHPGPASGPGPDTGPDSELSPCIPPGFRNLVADRVPNYTSWSQHCPWEPQKQPPWEFLQVLEPGARGLWKPPDIKGKLMVCYETLPRGQCLLYNWEEERATNHLDQVPSMQDGSESFFFRHGHRGLLTMQLKSPMPSSTTQKDSYQPPGNVYWPLRGKREAMLEMLLQHQICKEVQAEQEPTRKLFEVESVTHHDYRMELAQAGTPAPTKPHDYRQEQPETFWIQRAPQLPGVSNIRTLDTPFRKNCSFSTPVPLSLGKLLPYEPENYPYQLGEISSLPCPGGRLGGGGGRMTPF 485 T 0.027 PIP49_C pdbpssm F Eukaryota T 7ung 15 ED,QC F,E CF161_HUMAN Cilia- and flagella-associated protein 161 MAQNVYGPGVRIGNWNEDVYLEEELMKDFLEKRDKGKLLIQRSRRLKQNLLRPMQLSVTEDGYIHYGDKVMLVNPDDPDTEADVFLRGDLSLCMTPDEIQSHLKDELEVPCGLSAVQAKTPIGRNTFIILSVHRDATGQVLRYGQDFCLGITGGFDNKMLYLSSDHRTLLKSSKRSWLQEVYLTDEVSHVNCWQAAFPDPQLRLEYEGFPVPANAKILINHCHTNRGLAAHRHLFLSTYFGKEAEVVAHTYLDSHRVEKPRNHWMLVTGNPRDASSSMLDLPKPPTEDTRAMEQAMGLDTQ 301 T 0.028 zf-RING_5 pdbpssm F Eukaryota T 7ung 32 IP,JP,KP l,m,n FLTOP_HUMAN CILIA- AND FLAGELLA-ASSOCIATED PROTEIN 126 MATNYSANQYEKAFSSKYLQNWSPTKPTKESISSHEGYTQIIANDRGHLLPSVPRSKANPWGSFMGTWQMPLKIPPARVTLTSRTTAGAASLTKWIQKNPDLLKASNGLCPEILGKPHDPDSQKKLRKKSITKTVQQARSPTIIPSSPAANLNSPDELQSSHPSAGHTPGPQRPAKS 177 T 38 Scm3 pdbhh F Eukaryota T 7unh 1 A,B A,B SP2 designed chlorophyll dimer protein SSDEEFKFLATEAKMLITAAERLAGTDPELQEMVALIKKELEQAERTFRNGDKSEAQRQLEFVLTAARAVMNVAAAANAAGTDPELIEMVLRILKQLKEAIRTFQNGDQEEAETQLRFVLRAAIAVAVVAAALVLAGTDPELQEMVKQILEELKQAIETFARGDKEKALTQLLFVAWAAHAVAMIAAAANLAGTDPRLQQQVKEILEKLKEAIETFQKGDEEQAFRQLAEVLAEAALVALRAALTN 246 T 0.018 Cas_DxTHG pdb F T 7uni 1 A,B,C,D A,C,B,D SP2-ZnPPaM designed chlorophyll dimer protein SGSGSSDEEFKFLATEAKMLITAAERLAGTDPELQEMVALIKKELEQAERTFRNGDKSEAQRQLEFVLTAARAVMNVAAAANAAGTDPELIEMVLRILKQLKEAIRTFQNGDQEEAETQLRFVLRAAIAVAVVAAALVLAGTDPELQEMVKQILEELKQAIETFARGDKEKALTQLLFVAWAAHAVAMIAAAANLAGTDPRLQQQVKEILEKLKEAIETFQKGDEEQAFRQLAEVLAEAALVALRAALTN 250 T 0.015 Vps35 pdb F T 7unx 1 A A A0A8E4SKK8_MYXXA Xanthusin-1 NAPEFTQSVCERNSDCDHFCGEGFGHCIRGMYCACM 36 T 0.2 Gamma-thionin pdbhh F Bacteria T 7uny 1 A,B A,D Q8IM47_PLAF7 Cysteine-rich small secreted protein CSS GTQDEKSVKNICVCDFTDKLNFLPLEKTKILCELKPQYGEDIKIIANKEYEINCMNNSKVFCPLKDTFINNTNIKLYSPKLHFEIKDITHKGKNAALYYLKIDEEASDIFFSCSIKPKQVSGLLEGEVRVNLKKHINEEYSIFNEEEDVHVCDFSKGNLDITPSAGFYLKNSRNVSCIYRVIPNKLFLIKLPKLDIVTEKLLPSIVNCLSEFSFINFTLKHVQEGDNYISFNVIFGEFKKHFNLACSLDLSDFQQEPCNLGKTANITFIFSKLENLYFQGDYKDDDDKH 289 T 2 RnlA_toxin_N unppssm F Eukaryota T 7unz 1 A,B B,D Q8IM47_PLAF7 Cysteine-rich small secreted protein CSS, putative GTQDEKSVKNICVCDFTDKLNFLPLEKTKILCELKPQYGEDIKIIANKEYEINCMNNSKVFCPLKDTFINNTNIKLYSPKLHFEIKDITHKGKNAALYYLKIDEEASDIFFSCSIKPKQVSGLLEGEVRVNLKKHINEEYSIFNEEEDVHVCDFSKGNLDITPSAGFYLKNSRNVSCIYRVIPNKLFLIKLPKLDIVTEKLLPSIVNCLSEFSFINFTLKHVQEGDNYISFNVIFGEFKKHFNLACSLDLSDFQQEPCNLGKTANITFIFSKLENLYFQ 279 T 1.9 RnlA_toxin_N pdbpssm F Eukaryota T 7uoa 2 B B MTP-1 YIRLYDYHNC 10 T 2.7 TTR-52 pdbhh F T 7upo 1 A A DHT03 protein A GSSPEEEKLKELLKELKKVLDRLKKILERNDEEIKKSDELDDESLLEDIVELLKEIIKLWKILVELSDILLKLIS 75 T 0.01 DUF713 pdb F T 7upo 2 B B DHT03 protein B SSPVDEIDKEVKKLEEEAKKSQEEVERLKQEVEKASKAGLDHEGDSRIFKKIHDVVTKQIKVIIRLIEVYVRLVEIIL 78 T 0.0017 GAS pdb F T 7upo 3 C C DHT03 protein C GSKQKEAIKVYLELLEVHSRVLKALIEQIKLFIELIKRPDEDLADKVRKSSEELKKIIKEVEKILRKVDDILYKVKS 77 T 0.00071 ALIX_LYPXL_bnd pdb F T 7upp 1 A A DHT03 protein A SEKEKVEELAQRIREQLPDTELAREAQELADEARKSDDSEALKVVYLALRIVQQLPDTELAREALELAKEAVKSTDSEALKVVELALKIVQQLPDTELAKEALKLAKEAVKSTDSEALKVVELALEIVQQLPDTELAKEALELAEEAVKSTDSEALKVVKLALEIVQQLPDTELAREALELAKEAVKSTDSEALKVVYLALRIVQQLPDTELARLALELAKKAVEMTAQEVLEIARAALKAAQAFPNTELAELMLRLAEVAARVMKELERNDEEIKKSDELDDESLLEDIVELLKEIIKLWKILVEVSDVMLKLIS 316 T 0.0023 DCB pdb F T 7upp 2 B B DHT03 protein B SEKEKVEELAQRIREQLPDTELAREAQELADEARKSDDSEALKVVYLALRIVQQLPDTELAREALELAKEAVKSTDSEALKVVELALKIVQQLPDTELAKEALELAKEAVKSTDSEALKVVELALEIVQQLPDTELAKEALKLAKEAVKSTDSEALKVVYLALRIVQQLPDTELAREALELAKEAVKSTDSEQLEVVRLALEIVQLAPDTRLARAALKLAKEAVKSTDQEELKKVKAILRVASEVLKLEEEAKKSQEEVERLKQEVEKASKAGLDHEGDSRIFKKIHDVVTKQIKVILRLIAVYAELVAIIG 312 T 0.0023 DCB pdb F T 7upp 3 C C DHT03 protein C GSKQKEAIKVYLELLEVHSRVLKALIEQIKLFIELIMEPDEDLADKVRKSSEELKKIIKEVEKILRKVDDILEKVKS 77 T 0.0002 Anticodon_2 pdb F T 7upq 1 A,B,C,D A,D,G,J DHT03 protein A SEKEKVEELAQRIREQLPDTELAREAQELADEARKSDDSEALKVVYLALRIVQQLPDTELAREALELAKEAVKSTDSEALKVVYLALRIVQQLPDTELARLALELAKKAVEMTAQEVLEIARAALKAAQAFPNTELAELMLRLAEVAARVMKELERNDEEIKKSDELDDESLLEDIVELLKEIIKLWKILVEVSDVMLKLIS 202 T 0.0022 DCB pdb F T 7upq 2 E,F,G,H B,E,H,K DHT03 protein B GPVDEIDKEVKKLEEEAKKSQEEVERLKQEVEKASKAGLDHEGDSRIFKKIHDVVTKQIKVILRLIAVYAELVAIIG 77 T 0.0035 GAS pdb F T 7ups 1 A,B,C,D A,B,C,D DOTY_LEGPH DotY (Lpg0294) SNATRDALLKAMQVGETSIEAAEYMATRFEQILTKAKLLPECNDMLEKIKEYAQFVKFKLLSSAQVWSGQERPTSDYQNTQENKAEFLASHLEGLPSGLKLEVAIGDDAKILRGFSSNGKMVEGDQLKTMDGLLEGWLAKNSLAISGGAVVKIDNTGNQTKVDPQEIRQLINDSEKGVAKYFADKGVGMEVAQRTYQEPKALETKREEIRQEIES 215 T 0.019 GPW_gp25 unppercent F Bacteria T 7uq2 1 A,B,C,D,E,F A,B,C,D,E,F Y06G_BPT4 Vs.4 SMIEDIKGYKPHTEEKIGKVNAIKDAEVRLGLIFDALYDEFWEALDNCEDCEFAKNYAESLDQLTIAKTKLKEASMWACRAVFQPEEKY 89 T 0.057 DUF1631 unppssm T Viruses T 7ur1 3 C C SARS-CoV-2 Spike-derived peptide S1215-1224 (YIWLGFIAGL) YIWLGFIAGL 10 T 0.24 MtrB pdbhh F T 7ur6 1 A,E,I G,A,F C6G0D7_9HIV1 ENV POLYPROTEIN GPAENLWVTVYYGVPVWKEAKTTLFCASDAKAYEKEVHNVWATHACVPTDPNPQEMVLENVTENFNMWKNDMVDQMHEDVISLWDQSLKPCVKLTPLCVTLNCTNTTVSNGSSNSNANFEEMKNCSFNATTEIKDKKKNEYALFYKLDIVPLNNSSGKYRLINCNTSACTQICPKVTFEPIPIHYCAPAGYAILKCNNKTFNGTGPCNNVSTVQCTHGIKPVVSTQLLLNGSLAEKEIIIRSENLTNNAKTIIVHLNESVGIVCTRPSNMTRKSIRIGPGQTFYALGDIIGDIRQPHCNISKQNWNRTLQQVGRKLAEHFPNRNITFNHSSGGDLEITTHSFNCRGEFFYCNTSGLFNGTYHPNGTYNETAVNSSDTITLQCRIKQIINMWQEVGRCMYAPPIAGNITCNSNITGLLLTRDGGINQTGEEIFRPGGGDMRDNWRSELYKYKVVEIKPLGIAPTKCKRRVVERRRRRR 477 T 1.6E-54 GP120 pdbpssm T Viruses T 7ur7 1 A A 17_bp_sh3 MSEVKELLEEFLKRNKPVRIHHKNGEEIKVRITHIGEDTVEFELNGRTHRINIKDILDVKEWLEHHHHHH 70 T 0.016 DUF2642 pdbhh F T 7ur8 1 A A 170_h_ob MSGDRTRELKVIDYREYDNTVYFILRDGDKIYTIEVSPEEAKKLKPGDWVIVNEDGKLLHVQGSLEHHHHHH 72 T 0.0032 Prot_ATP_OB_N pdbhh F T 7urg 1 A,B A,B M1PRZ0_9CAUD Ribonucleotide reductase MSKPPKELIARTGRVQSWIDDPTSRLPVSCTVFVVEDTMEGENGIEASWRFVSHALRYGAGVAVHLSKLRPKGAENGKGLVASGPVSFAKIYSTLNEILRRGGVYKNGAVVCHLDLSHPDVLEFITASRSELPWVKRCVNINDHWWKEATPTVKNALLEGIKRGDIWLNKTKVDRNGNRIRGNVCLEVYLPSRGTCLLQHVNLGGCELDEIRGAFAQGMSELCELHGKTNVGESGEYLPSETDRQVGLGMLGLANLLRTQGVTYNDFGRALEALNSGRPYPSTPGYVIAQELKAGIQAAAEIAKANKMERAFAIAPTASCSYRYTDLDGYTTCPEIAPPIARQVDRDSGTFGVQSFDYGPVEIASEVGWESYKRVVDGIIRLLDSTGLLHGYSFNSWSDVVTYDEQFIEDWLASPQTSLYYSLQVMGDVQDKSDAYAALDDGDVTAYLESLLNDPVGASPPLAPDCNCGE 470 T 1.5E-35 Ribonuc_red_lgC pdbpercent T Viruses T 7urp 1 A A A0A2N0UYJ0_9FIRM Ribonucleases G and E AVDNLTINATSNICQANGSGTFNVGDKVSVYYLLDTKDAQLEEVQWALTYDKNLLTLDSLTMPEIADGMVNMDDVSGNASNLALYDFAGGKKLVEAVFTVNGTGTTNVDLNVVDLTLGKLNPATGTVDADSEYEAVVNGDMANDLFDHINSDAKVEAYVE 160 T 0.015 Cohesin unppssm F Bacteria T 7uso 3 E,F F,G Peptide Inhibitor AcITVKD-CHO XITVKD 6 T 91 Ribosomal_TL5_C pdbhh F T 7ust 2 B A P230_PLAF7 Gametocyte surface protein P230 GASTNKEYVCDFTDQLKPTESGPKVKKCEVKVNEPLIKVKIICPLKGSVEKLYDNIEYVPKKSPYVVLTKEETKLKEKLLSKLIYGLLISPTVNEKENNFKEGVIEFTLPPVVHKATVFYFICDNSKTEDDNKKGNRGIVEVYVEPYG 148 T 0.71 DUF2129 pdbpercent F Eukaryota T 7utd 1 A,C,E,G,I,K,M,O A,C,E,G,I,K,M,O A0QUM7_MYCS2 Hydrogenase-2, large subunit LDLFVSPLGRVEGDLDVRVTINDGVVTSAWTEAAMFRGFEIILRGKDPQAGLIVCPRICGICGGSHLYKSAYALDTAWRTHMPPNATLIRNICQACETLQSIPRYFYALFAIDLTNKNYAKSKLYDEAVRRFAPYVGTSYQPGVVLSAKPVEVYAIFGGQWPXSSFMVPGGVMSAPTLSDVTRAIAILEHWNDNWLEKQWLGCSVDRWLENKTWNDVLAWVDENESQYNSDCGFFIRYCLDVGLDKYGQGVGNYLATGTYFEPSLYENPTIEGRNAALIGRSGVFADGRYFEFDQANVTEDVTHSFYEGNRPLHPFEGETIPVNPEDGRRQGKYSWAKSPRYAVPGLGNVPLETGPLARRMAASAPDAETHQDDDPLFADIYNAIGPSVMVRQLARMHEGPKYYKWVRQWLDDLELKESFYTKPVEYAEGKGFGSTEAARGALSDWIVIEDSKIKNYQVVTPTAWNIGPRDASEVLGPIEQALVGSPIVDAEDPVELGHVARSFDSCLVCTVH 513 T 4.1E-19 NiFeSe_Hases pdb F Bacteria T 7utd 3 Q,R,S,T Q,R,S,T A0QUM5_MYCS2 Type 2 [NiFe]-hydrogenase Huc membrane adapter subunit SPVDGIRRRLDDPQVAEALNSLLDHADLLAVLVKGLDGFVRRGDDIANNLTSAIGELKAL 60 T 0.12 CompInhib_SCIN pdb F Bacteria T 7utj 2 G,H,I,J,K,L G,H,I,K,L,Z CTNA1_HUMAN ALPHA E-CATENIN,CADHERIN-ASSOCIATED PROTEIN,RENAL CARCINOMA ANTIGEN NY-REN-13 GPHMTLAVERLLEPLVTQVTTLVNTNSKGPSNKKRGRSKKAHVLAASVEQATENFLEKGDKIAKESQFLKEELVAAVEDVRKQGDLMKAAAGEFADDPCSSVKRGNMVRAARALLSAVTRLLILADMADVYKLLVQLKVVEDGILKLRNAGNEQDLGIQYKALKPEVDKLNIMAAKRQQELKDVGHRDQMAAARGILQKNVPILYTASQACLQHPDVAAYKANRDLIYKQLQQAVTGISNAAQATASDDASQHQGGGGGELAYALNNFDKQIIVDPLSFSEERFRPSLEERLESIISGAALMADSSCTRDDRRERIVAECNAVRQALQDLLSEYMGNAGRKERSDALNSAIDKMTKKTRDLRRQLRKAVMDHVSDSFLETNVPLLVLIEAAKNGNEKEVKEYAQVFREHANKLIEVANLACSISNNEEGVKLVRMSASQLEALCPQVINAALALAAKPQSKLAQENMDLFKEQWEKQVRVLTDAVDDITSIDDFLAVSENHILEDVNKCVIALQEKDVDGLDRTAGAIRGRAARVIHVVTSEMDNYEPGVYTEKVLEATKLLSNTVMPRFTEQVEAAVEALSSDPAQPMDENEFIDASRLVYDGIRDIRKAVLMIRTPEELDDSDFETEDFDVRSRTSVQTEDDQLIAGQSARAIMAQLPQEQKAKIAEQVASFQEEKSKLDAEVSKWDDSGNDIIVLAKQMCMIMMEMTDFTRGKGPLKNTSDVISAAKKIAEAGSRMDKLGRTIADHCPDSACKQDLLAYLQRIALYCHQLNICSKVKAEVQNLGGELVVSGVDSAMSLIQAAKNLMNAVVQTVKASYVASTKYQKSQGMASLNLPAVSWKMKAPEKKPLVKREKQDETQTKIKRASQKKHVNPVQALSEFKAMDSI 889 T 2.9E-97 Vinculin unp F Eukaryota T 7uus 1 A,C,E,G,I,K,M,O A,C,E,G,I,K,M,O A0QUM7_MYCS2 Hydrogenase-2, large subunit TELDLFVSPLGRVEGDLDVRVTINDGVVTSAWTEAAMFRGFEIILRGKDPQAGLIVCPRICGICGGSHLYKSAYALDTAWRTHMPPNATLIRNICQACETLQSIPRYFYALFAIDLTNKNYAKSKLYDEAVRRFAPYVGTSYQPGVVLSAKPVEVYAIFGGQWPXSSFMVPGGVMSAPTLSDVTRAIAILEHWNDNWLEKQWLGCSVDRWLENKTWNDVLAWVDENESQYNSDCGFFIRYCLDVGLDKYGQGVGNYLATGTYFEPSLYENPTIEGRNAALIGRSGVFADGRYFEFDQANVTEDVTHSFYEGNRPLHPFEGETIPVNPEDGRRQGKYSWAKSPRYAVPGLGNVPLETGPLARRMAASAPDAETHQDDDPLFADIYNAIGPSVMVRQLARMHEGPKYYKWVRQWLDDLELKESFYTKPVEYAEGKGFGSTEAARGALSDWIVIEDSKIKNYQVVTPTAWNIGPRDASEVLGPIEQALVGSPIVDAEDPVELGHVARSFDSCLVCTVH 515 T 4.1E-19 NiFeSe_Hases pdb F Bacteria T 7uv1 1 A A Q8L5L6_ANAOC Vicilin-like protein GLGFALAKIDPELKQCKHQCKVQRQYDEQQKEQCVKECEKYYKEKKGREREHEHRD 56 T 0.00024 Vicilin_N pdb F Eukaryota T 7uv2 1 A A Q8L5L5_ANAOC Vicilin-like protein GVDEPSTHEPAEKHLSQCMRQCERQEGGQQKQLCRFRCQERYKKERGQHNYKREDD 56 T 0.0011 Vicilin_N pdb F Eukaryota T 7uv3 1 A A VCL_PISVE 7S GLOBULIN,7S SEED STORAGE PROTEIN,7S VICILIN-LIKE PROTEIN PIS V 3,VICILIN PIS V 3 GKTDPELKQCKHQCKVQRQYDEEQKEQCAKGCEKYYKEKKGREQEELE 48 T 0.0024 Vicilin_N pdb F Eukaryota T 7uva 2 B,E B,E KDM2A_MOUSE F-BOX AND LEUCINE-RICH REPEAT PROTEIN 11,F-BOX/LRR-REPEAT PROTEIN 11,JMJC DOMAIN-CONTAINING HISTONE DEMETHYLATION PROTEIN 1A,[HISTONE-H3]-LYSINE-36 DEMETHYLASE 1A MQVHLTHFELEGLRCLVDKLESLPLHKKCVPTGIEDEDALIADVKILLEELASSDPKLALTGVPIVQWP 69 T 0.0031 JHD pdbhh F Eukaryota T 7uve 2 B B B4I1C5_DROSE peptide H3K9me2K14ac TKQTARKSTGGXAPRKQ 17 T 230 WW pdbhh F Eukaryota T 7uvg 1 A A Coh5 HHHHHHENLYFQGVTATSNLFPEKQVTLSADKKTVKVTYMFQSKDKDMLDFQWDMNYDANVLKPTANTTRAKSFEYPKIGSYVWNSLPGVIKANGNTLSLYDTTSKEIVFASAEFEVIDPEATATTVNLDVQVLRLSKVDPATDMEIGDEEVSVADKSIVDQEVFDKYVVANNTVTDPDGSEE 183 T 0.0095 Cohesin pdbpercent F T 7uvh 3 C,F C,F P230_PLAF7 Gametocyte surface protein P230 VGVDELDKIDLSYETTESGDTAVSEDSYDKYASQNTNKEYVCDFTDQLKPTESGPKVKKCEVKVNEPLIKVKIICPLKGSVEKLYDNIEYVPKKSPYVVLTKEETKLKEKLLSKLIYGLLISPTVNEKENNFKEGVIEFTLPPVVHKATVFYFICDNSKTEDDNKKGNRGIVEVYVEPYGGSLKENLYFQGWSHPQFEK 199 T 0.12 tRNA_edit pdbpercent F Eukaryota T 7uwi 2 B F Helicon Polypeptide FP01567 ATHRCEWAALHCELVX 16 T 11 Stonin2_N pdbhh F T 7uwo 2 B B Helicon Polypeptide FP05874 PAVMECYEAAFICHYV 16 T 3.1 DUF6117 pdbhh F T 7uwy 1 A A De novo designed small beta-barrel protein 29_bp_sh3 SEVETVLRKAAERNKTVDIHTKSGTTVRVNVKRVDSKSVKVERNGQDLEISLDQITHVDGW 61 T 0.0011 ROF pdbhh F T 7uwz 1 A A De novo designed small beta-barrel protein 33_bp_sh3 MDGFDRGADVTYTDSDGSKKTYKVLSYSGDKVTVQDSDGRTLTFDARLLRVKKWLEHHHHHH 62 T 0.012 DUF2835 pdb F T 7ux5 2 C,D,F,H,J,L B,D,F,H,J,L Helicon FP28136 DPALWQCVFAARYCYEE 17 T 0.27 Zea_mays_MuDR pdbhh F T 7uxe 1 A,B,C,D,E,F,G,H,I,J A,B,C,D,E,F,G,H,I,J A0A2K8I4H6_9CAUD Small terminase MTKFYSPDDLVTPQEFADPHFAAINQKRFDLYIDLRVQGYSSWRVFRAIWGEEHMDGPAQARIFAMESNPYYRKQFKAKLNATKTSDLWNPKTALHELLQMVRDPTVKDSSRLSAIKELNVLAEITFVDESGKTRIGRGLADFYASEAEAQTATVAAAAEANSYVPEGEEGDFPSPTPEPTEEDRANPI 189 T 0.047 DUF2992 pdbpssm T Viruses T 7uxi 2 B B FP19711 DPAWWVCAIAAIECSDV 17 T 3 Ytca pdbhh F T 7uxj 2 E,F,G,H E,F,G,H FP29102 XPECHIEAYWCI 12 T 2.6 DUF6390 pdbhh F T 7uxk 2 B B FP24322 XFECLDAFFSC 11 T 1.9 Mif2_N pdbhh F T 7uxm 2 D,E,F D,E,F FP29092 XDPANQDCHVAAWHCWQR 18 T 5.3 Phage_Cox pdbhh F T 7uxn 2 B B FP29103 XPDCHIRAYVCH 12 T 2.8 DUF3051 pdbhh F T 7uxo 2 B B FP30790 XDPAAADCQWAAFLCRVYX 19 T 9.6 Poty_PP pdbhh F T 7uxp 2 C,D C,D FP28132 XDPALWQCVFAARSCYEE 18 T 0.37 Zea_mays_MuDR pdbhh F T 7uxq 2 C,D C,D FP28135 XDPALWMCVFAARQCYESX 19 T 2.2 TMEM220 pdbhh F T 7uy2 2 C,D C,D Helicon FP06649 FTDCQLAAAVCMTY 14 T 12 ODAPH pdbhh F T 7uy5 9 I I TAP75_TETTS P75 MEIEEDLNLKILEDVKKLYLQSFDYIKNGISSSLPSDKKFLADDDIDLSRITFLYKFISVNPTLLLINEKTQAKRRIFQGEYLYGKKKIQFNIIAKNLEIERELIQFFKKPYQCYIMHNVQVFQMLNKNKNNNVVEFMDSEDLQSSVDCQLYYLIDESSHVLEDDSMDFISTLTRLSDSFNSNEFVFETNYSIQISQMPKPLNTTHFKLLQPKVVNSFEGVILQVQEGKNILQIEELIDQVYLNSRRDRFYILKVANGKNYMDFIEVYLVYDNEDQEAKQQLQFYLKPFQRILIFQSLKHFTKNLKLFMISFFYSSGVQPNNSNVKNFLVSHKGVEFFSRFDIQKNELLCKDLIKSYNKLPLSNISKLLEDEGVMIRSNMKFQVRVKKVKYFKIRLNCLNCKQEWTVGLKNCINCKGQQSYISYNIQVLVQDQHFLEQQAYIYLYDDLAAQFFNITESEKKELHLHLTKNETFIQLYYSFNKDYPLSIIKFKDKIFNKDITNCIVAYPFADIDNKIFNSQQQIIQDENLRIESEKFIQNFTEDNNLQESKLYYEKFKSKNKQQIFVNGTYISTNYSQGQKICLKPIPCLKVMYVFPQEDIKLSALKIIEEINQLKIQIDQLN 622 T 0.08 CDC24_OB3 pdbhh F Eukaryota T 7uy5 10 J K TAP19_TETTS P19 MQQPKRNFDLYKLITDKQIDFQVADLIQDEQSSFVSVRIYGQFKCFVPKSTIQEQLDKIKNLSSKELAKNKIFKFLSEYNKNNQKQDELSHDYYGYFKVQQHQFILNLENAQREASLAVDDFYFINGRIYKTNHDILILQAHHVYQMQKPTLQLLQAASEINQN 164 T 0.53 TMF_DNA_bd pdb F Eukaryota T 7uy5 11 K J TAP45_TETTS P45 MEDNFELVFLKELPSLPDFSKVCFTGLILSFSNFPSSEQNQQKDVPHKIAIIQDSTGEAELFLDMYKFCQEEISVFKAITGIGVLKKKNIGAGQVCKIIVERFRIIHSADEEMLQYLLIQKYKLSKTLNEQQQIKQKEQQINQQKIDKVVQDKESKEHLLWKQQQIPQIKSNQENINTLKYKELIAGELMRITHKLLIQKLQQQQPANNNKQINEMDVESNELAEKKEVIIKIQEIAKDQQLYDTLSIQYQVDQKEQYYAKIAQSLEDFVSISALKMVSYIYPNISYQVSIGFFQNILDIATKTVKDRGALGCNYKYLKDKLTKALNLQQISYPLISESYISYLVHLFQDFNIIEIENEHKFYYKQAFQYDDS 373 T 12 Ten1 pdbhh F Eukaryota T 7uyg 1 A A Q5ZTB4_LEGPH LotA GPMAKTIKATGDGACLFNAVSIGLSVEILSGRLDSQLDTPGYQALLDEFAKHHPQFNPKSWKTLKEWLAYYNDTRDIELILAPVLFNLNQKYQDHLDEEILNELTNLVWKNKANIENGQAWFQLQNTGDLGEALFPKLENLDLKKDRAPLLDKLREILKDYKLELTRENVKQFLTEKAKELLSALKKKISSDPHAFQRGYSCDELKGMTDALAISLVENREEDITDNRIKIRLENQEEHWNVLCNEEDSERFLDSTPSRLKMTSLEAYRGDKQVSAPTSEQLDLIEEPGVFLRERT 296 T 0.066 OTU pdbhh F Bacteria T 7uyh 2 B A Q5ZTB4_LEGPH LotA GPMAKTIKATGDGAALFNAVSIGLSVEILSGRLDSQLDTPGYQALLDEFAKHHPQFNPKSWKTLKEWLAYYNDTRDIELILAPVLFNLNQKYQDHLDEEILNELTNLVWKNKANIENGQAWFQLQNTGDLGEALFPKLENLDLKKDRAPLLDKLREILKDYKLELTRENVKQFLTEKAKELLSALKKKISSDPHAFQRGYSCDELKGMTDALAISLVENREEDITDNRIKIRLENQEEHWNVLCNEEDSERFLDSTPSRLKMTSLEAYRGDKQVSAPT 278 T 0.093 OTU pdbhh F Bacteria T 7uyj 2 C,D C,D Helicon FP06652 DPAIVQCAWAALYCDMQ 17 T 1.6 UPF0139 pdbhh F T 7uyk 2 C,D D,C Helicon FP06655 DPAMQRCFSAAVYCAIS 17 T 1.3 CCSAP pdbhh F T 7uyx 1 A,B,C,D A,B,C,D A0A4D6BFJ2_9CAUD Bacteriophage PA1C gp2 SNAMTAVNYPFVDTMDKFDKITKGLIFEHQAEGESETMISHELSILDNDGVVHSLHFSQITSLIDTITGKHPSLELPPQLFLITQYLLEDLKEVGEKGFVITEYFIDVLPTGNKAIFRGTLAHKSTVDGHPDFDPSSTISKKEFEFSLNQFSILQQIALSHCIANLHEECAGFRGTFDVEYTFHWTPFAFNVKFSE 196 T 0.03 HTH_5 unppssm T Viruses T 7uz1 1 A,B A,B A0A0E3K5E4_SACSO GLYCOSIDE HYDROLASE FAMILY 1 PROTEIN MYSFPNSFRFGWSQAGFQSEMGTPGSEDPNTDWYKWVHDPENMAAGLVSGDLPENGPGYWGNYKTFHDNAQKMGLKIARLNVEWSRIFPNPLPRPQNFDESKQDVTEVEINENELKRLDEYANKDALNHYREIFKDLKSRGLYFILNMYHWPLPLWLHDPIRVRRGDFTGPSGWLSTRTVYEFARFSAYIAWKFDDLVDEYSTMNEPNVVGGLGYVGVKSGFPPGYLSFELSRRHMYNIIQAHARAYDGIKSVSKKPVGIIYANSSFQPLTDKDMEAVEMAENDNRWWFFDAIIRGEITRGNEKIVRDDLKGRLDWIGVNYYTRTVVKRTEKGYVSLGGYGHGCERNSVSLAGLPTSDFGWEFFPEGLYDVLTKYWNRYHLYMYVTENGIADDADYQRPYYLVSHVYQVHRAINSGADVRGYLHWSLADNYEWASGFSMRFGLLKVDYNTKRLYWRPSALVYREIATNGAITDEIEHLNSVPPVKPLRH 489 T 1.3E-42 Glyco_hydro_1 unppercent F Archaea T 7v0e 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H DNM3B_HUMAN DNMT3B,DNA METHYLTRANSFERASE HSAIIIB,DNA MTASE HSAIIIB,M.HSAIIIB AARRRPIRVLSLFDGIATGYLVLKELGIKVGKYVASEVCEESIAVGTVKHEGNIKYVNDVRNITKKNIEEWGPFDLVIGGSPCNDLSNVNPARKGLYEGTGRLFFEFYHLLNYSRPKEGDDRPFFWMFENVVAMKVGDKRDISRFLECNPVMIDAIKVSAAHRARYFWGNLPGMNRPVIASKNDKLELQDCLEYNRIAKLKKVQTITTKSNSIKQGKNQLFPVVMNGKEDVLWCTELERIFGFPVHYTDVSNMGRGARQKLLGRSWSVPVIRHLFAPLKDYFACE 285 T 6.1E-16 DNA_methylase unppercent F Eukaryota T 7v1a 2 B B ASP-ILE-ASP-GLN-MET-PHE-SER-THR-LEU-LEU-GLY-GLU-MK8-ASP-LEU-LEU-MK8-GLN-SER DIDQMFSTLLGEXDLLXQS 19 T 4.8 Caskin-tail pdbhh F T 7v4w 3 C C MUC1_HUMAN MUC1-NT,MUC1-ALPHA RPAPGSTAPPAHG 13 T 43 TOH_N pdbhh F Eukaryota T 7v5e 1 A A ILE-GLN-CYS-CYS-ARG-CYS-GLN-SER-TRP-PRO-TYR-MET-CYS-SER-VAL-PHE-CYS-CYS IQCCRCQSWPYMCSVFCC 18 T 0.32 zf-CW pdbhh F T 7v5f 1 A A Wisotide ADCTEYCSNSCPFCNGQPLYQLCCINNCCPS 31 T 0.11 Radical_SAM_2 pdbhh F T 7v8o 1 A,B A,B A0A1L1QK40_9PSEU Cyclohexanone Monooxygenase from Thermocrispum municipale MSTTQTPDLDAIVIGAGFGGIYMLHKLRNDLGLSVRVFEKGGGVGGTWYWNKYPGAKSDTEGFVYRYSFDKELLREYDWTTRYLDQPDVLAYLEHVVERYDLARDIQLNTEVTDAIFDEETELWRVTTAGGETLTARFLVTALGLLSRSNIPDIPGRDSFAGRLVHTNAWPEDLDITGKRVGVIGTGSTGTQFIVAAAKMAEQLTVFQRTPQYCVPSGNGPMDPDEVARIKQNFDSIWDQVRSSTVAFGFEESTVEAMSVSESERQRVFQQAWDKGNGFRFMFGTFCDIATNPEANAAAAAFIRSKIAEIVKDPETARKLTPTDLYAKRPLCNEGYYETYNRDNVSLVSLKETPIEEIVPQGVRTSDGVVHELDVLVFATGFDAVDGNYRAMNLRGRDGRHINEHWTEGPTSYLGVTKAGFPNMFMILGPNGPFTNTPPSIEAQVEWISDLIDKATREGLTTVEPTADAEREWTETCAEIANMTLFPKADSWIFGANIPGKRHAVMFYLGGLGNYRRQLADVADGGYRGFQLRGERAQAVA 541 T 0.11 Pyr_redox_2 unppercent F Bacteria T 7v93 1 A A cas12c2 MKIEEGKGHHHHHHMTKHSIPLHAFRNSGADARKWKGRIALLAKRGKETMRTLQFPLEMSEPEAAAINTTPFAVAYNAIEGTGKGTLFDYWAKLHLAGFRFFPSGGAATIFRQQAVFEDASWNAAFCQQSGKDWPWLVPSKLYERFTKAPREVAKKDGSKKSIEFTQENVANESHVSLVGASITDKTPEDQKEFFLKMAGALAEKFDSWKSANEDRIVAMKVIDEFLKSEGLHLPSLENIAVKCSVETKPDNATVAWHDAPMSGVQNLAIGVFATCASRIDNIYDLNGGKLSKLIQESATTPNVTALSWLFGKGLEYFRTTDIDTIMQDFNIPASAKESIKPLVESAQAIPTMTVLGKKNYAPFRPNFGGKIDSWIANYASRLMLLNDILEQIEPGFELPQALLDNETLMSGIDMTGDELKELIEAVYAWVDAAKQGLATLLGRGGNVDDAVQTFEQFSAMMDTLNGTLNTISARYVRAVEMAGKDEARLEKLIECKFDIPKWCKSVPKLVGISGGLPKVEEEIKVMNAAFKDVRARMFVRFEEIAAYVASKGAGMDVYDALEKRELEQIKKLKSAVPERAHIQAYRAVLHRIGRAVQNCSEKTKQLFSSKVIEMGVFKNPSHLNNFIFNQKGAIYRSPFDRSRHAPYQLHADKLLKNDWLELLAEISATLMASESTEQMEDALRLERTRLQLQLSGLPDWEYPASLAKPDIEVEIQTALKMQLAKDTVTSDVLQRAFNLYSSVLSGLTFKLLRRSFSLKMRFSVADTTQLIYVPKVCDWAIPKQYLQAEGEIGIAARVVTESSPAKMVTEVEMKEPKALGHFMQQAPHDWYFDASLGGTQVAGRIVEKGKEVGKERKLVGYRMRGNSAYKTVLDKSLVGNTELSQCSMIIEIPYTQTVDADFRAQVQAGLPKVSINLPVKETITASNKDEQMLFDRFVAIDLGERGLGYAVFDAKTLELQESGHRPIKAITNLLNRTHHYEQRPNQRQKFQAKFNVNLSELRENTVGDVCHQINRICAYYNAFPVLEYMVPDRLDKQLKSVYESVTNRYIWSSTDAHKSARVQFWLGGETWEHPYLKSAKDKKPLVLSPGRGASGKGTSQTCSCCGRNPFDLIKDMKPRAKIAVVDGKAKLENSELKLFERNLESKDDMLARRHRNERAGMEQPLTPGNYTVDEIKALLRANLRRAPKNRRTKDTTVSEYHCVFSDCGKTMHADENAAVNIGGKFIADIEK 1232 T 3.3E-05 RuvC_1 pdbhh F T 7v9b 2 B B FOXO3_HUMAN ARG-ARG-ARG-ALA-VAL-SEP-MET-ASP-ASN-SER-ASN RRRAVSMDNSN 11 T 18 Carla_C4 pdbhh F Eukaryota T 7v9x 2 B C RIB86_ECOLX retron St85 family effector protein MNKKFTDEQQQQLIGHLTKKGFYRGANIKITIFLCGGDVANHQSWRHQLSQFLAKFSDVDIFYPEDLFDDLLAGQGQHSLLSLENILAEAVDVIILFPESPGSFTELGAFSNNENLRRKLICIQDAKFKSKRSFINYGPVRLLRKFNSKSVLRCSSNELKEMCDSSIDVARKLRLYKKLMASIKKVRKENKVSKDIGNILYAERFLLPCIYLLDSVNYRTLCELAFKAIKQDDVLSKIIVRSVVSRLINERKILQMTDGYQVTALGASYVRSVFDRKTLDRLRLEIMNFENRRKSTFNYDKIPYAHP 307 T 0.05 Stork_head pdb F Bacteria T 7vbl 1 A Q F1S1A8_PIG NADH-UBIQUINONE OXIDOREDUCTASE 49 KDA SUBUNIT,COMPLEX I-49KD ARQWQPDVEWAEQFGGAVMYPTKETAHWKPPPWNDVDPPKDTLV 44 T 13 eIF_4G1 pdbhh F Eukaryota T 7vbp 1 A Q A0A4X1VKC6_PIG NADH-UBIQUINONE OXIDOREDUCTASE 49 KDA SUBUNIT,COMPLEX I-49KD ARQWQPDVEWAEQFGGAVMYPTKETAHWKPPPWNDVDPPK 40 T 14 eIF_4G1 pdbhh F Eukaryota T 7vcf 1 A A YCF78_CHLRE UNCHARACTERIZED MEMBRANE PROTEIN YCF78 MITFTFMSLVTSVKDYVEITHKLIEIEPLKNYTEFGAVFTYFIFSIGEFFKNFFSFSFLNNIWSIPIIIPDIASAMISEVSVLDGYFHNAFTFLETSVNTTTNPSLVIFEKFVIGIINSLFLILPTSTSHLITLRRFVMQGLEAGYMAGLGTLAGNFLWLASIILGWRFFVIPWLSLDIFRYLLGFVLLVKYIWDSSKERRMALEDLSKWKIFLLNFLLALTEQSCIYPFISNLSFGPDASILEGFPVDNYPQFLLIHGAYLLGILFGSFSLLQFTCWFWENPAFSIYLWITTKSSLKISTSSYYKILNFTFLYATMLCAIASIPYYGLDYTITNPIGLVPQDRILNQKKSQSDPDKLITETAFLNLNPTDKNSRIRDGVHARRERWKQRLIKYQAFDASTYDQGVYDFLTIEDLNYGFDRFWLRRKMRNHQIRFRLFPGPWMRSLKKQLNNPANPSLETSTKAASGPRVEFFRILFEQFYHPNFHDRAAMQTNPAEARNKFISTSPLASTESKKALNSTFSLGNINNSSTGIEGLVLTNTQATLLPTDLQTKRTIKPGLIYTNSALRKFVRNVNTRLNLKLLNSKETNLTTKYKSQFIYSKRWKSIFSKIQPLQNGTTRKSYQLFRNVAKQILVTPDAKSLKLITINQKLSLKERKLLELRTQYNNNSTLTTTAPLTLVRPLNVYLQKEEAFKRKLRYYGTMPMRKLTVGNQAPYFKALMKRGFYYYKPTLRWRKTLYVASLRRGFRKKSRKQRILVMPSNQQNFNNTLDNTKTNINQNNLANPLGGNEVPMYGADGENSLITKPTHSYTVLGKRASRYRHQIYKDVLQHWYYTPFNRLLMKFDVDAFINRQPKSHFLTKNEERALHIRRFLLSEHYDTLRWYTYMQHYKTMKTNIGGTKSFANRAYNQQFQGTFKKIRHLFAITPKQGDFYTLKFDQPLYNDNKLKDNLYFHEELLTDYYNGTNLQTNQTSNISVNSTTTFIDNSLRTTQLPVPSSSFDIVNQSSTLIGLTTMQNALRKNVVESTLTSLNSDGEAATSQPKLNFVYSELFVKLIKECKKRIHDQTFLKNYITHRIEKREQLNQEQTKELNKRLEKLKVWLNSDKGSISKLQNTPVQDPNISSPDKVLTTAMQKAVNESISLSGIMPSDKIKTTYGNLTNAYTIKTENAILTKLNVINQLTNNETTTQKNTLIKSIGVNKIQTVLQTIITNFKSSLYNQTQLLRVKTDKDLQWWRTKQRVITKRKSARKRDRFKKQIAVVNKKLAALSKKVETEKSNLYQTLYGNYEISDYLLRNVPTGSSAVIDSTVLRKKQDNQAYLPKETNNVQFNSFVDSNNNVWQTFFAKKLRKKISSKGRRYRSLSLARYLTATRKPRLVGLDNLTKIDNITTLQGAFITKEEKQDSLNLTIQRKQELTNSLKKSQIKKRSRHSWKKRSRHQFSRNHYKYRKRHTHGNGKLRVMNKKLKKFKATNELRQWWWNSFLPRYLSNLQVNNSTLTNKNVSFKPLSNTNSVPSTNMASPTTSRNLLDNLNSSNQISTSASMNQNIVTESVKVETNQVYLPEGEKSFDITSMTTTLPFYAGWDESLKKFVVTNRLLSRRDAGLSVNNNPQEINFTNPPIQGLNEGSFLYWQTEMPFNSYNIDQFITTNQSFYAPLGWRRFEFRHSILKTWVNNTKAGNNNIKKKTLIISLKNLQPLKSSQQKQNQIKTKKLVARRIKKRYKLLKQMPNQLMYSPTGPLLTEVLPSHYISVFDQQYRLPRNRYLKRNPLKTLKKTTLLALMDSSKQTNGVNKEFTLRKRVKPRRKYHRKRFIKKDGLIFPRRTKFNTNTTLTGNALITNNVNSIEEDDLRWRPSSRTKQKRKDNTRSSAASKTKSNKRVKTNPLRLRQLRRREFQQVLKPLQRYIPQNGGFTWPGDYLRLEIVEMPKLKSINIKKTSLKQKINVQPVGIMPRKYLIEKHNIKVLKKKLSQAYSTQQLTKVVQEYKNLIQNSPPAI 1995 T 2E-05 Ycf1 pdbhh F Eukaryota T 7vcf 3 C C A0A2K3D4W3_CHLRE Toc52 MADGPSPIRIVLWNDGGESLAAGVEDEEQQQVLHSFADLVGSAIDAVLELPQFRHVEAVTAEAEEDEPGLSIGFDAGSGDGEVDIDNLKGRLDIAGLLLGSAQLPEELAEVAAVEVTDEEEGTTELQFTDEGLVQQLQAVVKRAKLEKRYNDWVAGVAESLGPALDAAAGGVEVTEMPVDPYDVLQAVVAQLIRVAGVSPPAPSLFSRTGALVGGVLGAPRSAVRQVTKRLGRAQRLWWRLEDVVVDGSKLALRLAVKAARPVLVGFVLHRVLKTLDRSRQLEYRLARMGPEEAREAYYEAVLGKDWKQQLQADWDKALEDVDAGLVTDEINHEKRLMTAAQLRRLEVEEWDKQRMKNFYLASFGGLRWFDQMEQALHNPLFIESRGWTDPVQNWVGQNRTYMDDLPAGQYMAGVGNAAIRIKEAELKRKLTDVERAHVLARGGAVAGGLLPQQPTDPATLAVAVGGAFVPSVAGKR 477 T 0.46 AXH pdbpssm F Eukaryota T 7vcf 4 D D A8J5D4_CHLRE Tic13 MSSDVQAKLSGLLGDIGVKCTLAFAGTVAAGAAIVVPSGKQVEAASLDIYGRPPSQLLPNERRAAEFAAGHRRWKGFVDNSIYSWTRTLPGHDNPIVNPYKGPRRPQRPQQKLEEEVEAAAKQE 124 T 3.9 DUF6460 pdbhh F Eukaryota T 7vcf 8 H I A8J6H7_CHLRE Toc39 MGASQESELDFVPRLSFLPIEWRSIGSAFGLKDKSGAAANGRATFTVRQGVDAAELTSTGRVIDGQADVGASLKLNTLAIGVSASNITFHSGLDDPTAAAAQRSSLIPSLKLTAAKQFKRDNYIAVSYDLKHQKPELSACWTGEAGADRATLLVNVDPVMRSVKLAAAVRTPGPEWRKVLYNDETDLLEYPADDGARHTLYVQHEVRGRDLLHATRLGCRLDLGRLVNYVVDFVDYRIEENIPSFVWNVPLLPQLYSLLVPADNDEQVRHRITGWELDVSHDFARSGLLPVVAISKTSKKLLGGGTLTASYDAAAREAGVSLSRKGVSVGARVARAEGAAGGLSAGWGRPSIHVAVEPLGLLQ 363 T 1 Thyroglobulin_1 pdbpssm F Eukaryota T 7vcf 9 I K A0A2K3E4D9_CHLRE Toc10 MKLVKTVSKLAGAAVGMLPAGQAGLAVKVALGVAFAFWWTSGPGADEEMDAKAQQEPDRRSQYTRHYAFKGRGRKEFLRSDMKNDANELVPTRGAAGL 98 T 0.13 DUF6479 pdbpercent F Eukaryota T 7vcf 10 J M A8J1J3_CHLRE Tic12 MDEEPPFNLALNVYKGPASIPHASAEVFGAFFLATNTALLAHMFPGKLFGSELHVRKWDPDYLASCCNEQGMRREALSGKKPNLWLLGGGPRLVNDSWERMWWNNLHWKRWKVPRTGPAFPQDMYWQ 127 T 17 TetR_C_18 pdbhh F Eukaryota T 7vcf 11 K N Unknown fragment KFIFWAAMVYATLYGNYE 18 T 2.7 FixP_N pdbhh F T 7vcf 12 L O A0A2K3DWN5_CHLRE Tic35 MQLGQLRQPLRACQDQRLTRGVPLARRQLVVVSNWNPLGGKGGGNSKDKEDAARRALEQSLGQKKFGADASKKTPAAKPAEPSKPAGEDASKNPLQNLFGGGGPKPPAGGGGGGGGDGGGGFFSGGNAEQPGGEEPIQDELLKLLRGGWVLLSNLALFLVFSSFLHRSLNWFVQTELLVAVGAPQQAGERVVGKFFEAIEWVERNILGWKLPGDEEAEDATSKVYEVLQNYTPAEAAYSFAQLKYKDLTHKERELFHKAYALRHFERRDGRPGDVDAAELQAVKDRLDPLEADRRAYAAAKAAGRLDEYWAAPGREATYQRIVGAPRIA 329 T 0.016 DUF2878 pdbpercent F Eukaryota T 7vcl 2 B B BKRF4_EBVB9 Tegument protein BKRF4 GLPGSDTDESDYSDEDEEIDLEEEYPSDEDPSEGSDSDPSWHPSDSDESDYSESDEDEATPGSQASRSSR 70 T 0.033 Nop14 unppercent T Viruses T 7vcn 1 A,C C,D D4FSQ3_STROR PITA EPQTTLHKTITPISGQDDKYELSLDITSKL 30 T 0.13 PA-IL unppssm F Bacteria T 7vcq 4 J K BKRF4_EBVB9 Tegument protein BKRF4 GPLGSDTDESDYSDEDEEIDLEEEYPSDEDPSEGSDSDPSWHPSDSDESDYSESDEDEATPGSQASRSSR 70 T 0.048 Nop14 unp T Viruses T 7vd5 21 RA,U w,W A0A679C6E8_9STRA Photosystem II reaction center protein W TEGTNEWFGVDDLRLLAVLFLGHWAILSLWLGSYGDSNEDEDFFGEIDYSAR 52 T 0.0021 PsbW unppssm F Eukaryota T 7vec 2 M,N,O,P,Q,R,S,T,U,V,W M,N,O,P,Q,R,S,T,U,V,X TX264_HUMAN TEX264 phospho-LIR SSFEELDLY 9 T 2.9 DRMBL pdbhh F Eukaryota T 7veh 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H A0A3A9QXE8_MORCA AcrIF13 AGSMKLLNIKINEFAVTANTEAGDELYLQLPHTPDSQHSINHEPLDDDDFVKEVQEICDEYFGKGDRTLARLSYAGGQAYDSYTEEDGVYTTNTGDQFVEHSYADYYNVEVYCKADLV 118 T 0.019 DUF1882 unppssm F Bacteria T 7vf2 2 B B ZC3HD_HUMAN Zinc finger CCCH domain-containing protein 13 PVATATATTVPATLAATTAAAATSFSTSAITISTSATPTNTTNNTFANEDSHRKCHRTRVEKVETPHVTIEDAQHRKPMDQKRSSSLGSNRSNRSHTSGRLRSPSNDSAHRSGDDQSGRKRVLHSGSRDREKTKSLEITGERKSRIDQLKRGEPSRSTSSDRQDSRSHSSRRSSPESDRQVHSRSGSFDSRDRLQERDRYEHDRERERERRDTRQREWDRDADKDWPRNRDRDRLRERERERERDKRRDLDRERERLISDSVERDRDRDRDRTFESSQIESVKRCEAKLEGEHERDLESTSRDSLALDKERMDKDLGSVQGFEETNKSERTESLEGDDESKLDDAHSLGSGAGEGYEPISDDELDEILAGDAEKREDQQDEEKMPDPLDVIDVDWSGLMPKHPKEPREPGAALLKFTPGAVMLRVGISKKLAGSELFAKVKETCQRLLEKPKDADNLFEHELGALNMAALLRKEERASLLSNLGPCCKALCFRRDSAIRKQLVKNEKGTIKQAYTSAPMVDNELLRLSLRLFKRKTTCHAPGHEKTEDNKLSQSSIQQELCVS 563 T 0.35 Fimbrillin_C pdbpercent F Eukaryota T 7vf6 1 A,B A,B A0A7L7SI10_9CAUD PurA-like adenylosuccinate synthetase MGSAIDVIVGGQFGSEAKGRVTLERVQHWADNGHAVASMRVAGPNAGHVVWDQGHRFAMRSLPVGFVDPGTDLYIAAGSEVDIEVLQQEVDLVESYGYEVRDRLYIHPQATWLEPVHRDREASSTLTAKVGSTSKGIGAARSDRIWRVANLVGDNPAFQELGRVSDFTEDLRSELVDGSLALVIEGTQGYGLGLHAGHYPQCTSSDARAIDFLAMAGINPWDLSREDLAAHGFRIHVVIRPFPIRVAGNSGELSGETSWDELGLEAERTTVTNKIRRVGQFDPELVRRAVLANGVNNVKIHLSMADQLIPQLAGLEDLPEGWRESEYAGRLREFIDQIPFNERLVSLGTGPHTRIELFKENLYFQLE 367 T 1.6E-54 Adenylsucc_synt unppssm T Viruses T 7vfi 2 C C H31_MOUSE ARG-ARG-TYR-GLN-LYS-SER-THR-GLU-LEU RRYQKSTEL 9 T 13 SAP30_Sin3_bdg pdbhh F Eukaryota T 7vgm 1 A A Q818B4_BACCR Phenylalanine-4-hydroxylase MTKKTEIPSHLKPFVSTQHYDQYTPVNHAVWRYIMRQNHSFLKDVAHPAYVNGLQSSGINIDAIPKVEEMNECLAPSGWGAVTIDGLIPGVAFFDFQGHGLLPIATDIRKVENIEYTPAPDIVHEAAGHAPILLDPTYAKYVKRFGQIGAKAFSTKEEHDAFEAVRTLTIVKESPTSTPDEVKAAENAVIEKQNLVSGLSEAEQISRLFWWTVEYGLIGNIDDPKIYGAGLLSSVGESKHCLTDAVEKVPFSIEACIGTTYDVTKMQPQLFVCESFEELTDALETFSKTMAFKTGGKEGLEKAIRSENYATAELNSGLQITGTFSETIENDAGELIYMRTNSPTALALHNKQLANHSTSVHSDGFGTPIGLLTENIALENCTDEQLQSLGITIGTIAEFTFASGIHVKGTVTDIVKNDKKIALISFIDCTVTYNARVLFDASWGAFDMAVGSQITSVFPGAADAAAFFPMDEEVHEIPAPLVLNELERMYQTVRDIRSEGILHDAHIDQLIAIQEVLNKFYAKEWLLRLEVLELLLEHNKGHETSAALLHQLSTFTTDEAVTRLINNGLALLPVKDVKNDAKINLEHHHHHH 592 T 7.7E-39 Biopterin_H unppssm F Bacteria T 7vi4 1 A A TIA1_HUMAN RNA-BINDING PROTEIN TIA-1,T-CELL-RESTRICTED INTRACELLULAR ANTIGEN-1,TIA-1,P40-TIA-1 GYRVTGYETQ 10 T 1.1 DUF3520 pdbhh F Eukaryota T 7vi5 1 A A TIA1_HUMAN RNA-BINDING PROTEIN TIA-1,T-CELL-RESTRICTED INTRACELLULAR ANTIGEN-1,TIA-1,P40-TIA-1 GYRVAGYETQ 10 T 1.3 DUF3520 pdbhh F Eukaryota T 7viv 1 A,B A,B A0A2X0RU36_ASF I73R CDS PROTEIN,I73R PROTEIN METQKLISMVKEALEKYQYPLTAKNIKVVIQKEHNVVLPTGSINSILYSNSELFEKIDKTNTIYPPLWIRKN 72 T 0.012 HARE-HTH pdbpercent T Viruses T 7vlm 1 A A A0A2G3NPZ8_STRMC H2C7 MKIDTTVTEVKENGKTYLRLLKGNEQLKAVSDKAVAGVNLFPGAKIGSFLVRQDNIVVFPDNKGEFDLDFFNLLNDNFETLVEYAKMADCLDIAFDINEKSYFNMIMWLMKNIDENWSQSPYGESFYSSKDIDWGYKPEGSLRVSDHWNFGQDGEHCPTAEPVDGWAVCKFENGKYHLIKKF 182 T 4.2 PhoU_div unppercent F Bacteria T 7vmb 2 B B IQEC1_HUMAN ADP-RIBOSYLATION FACTORS GUANINE NUCLEOTIDE-EXCHANGE PROTEIN 100,ADP-RIBOSYLATION FACTORS GUANINE NUCLEOTIDE-EXCHANGE PROTEIN 2,BREFELDIN-RESISTANT ARF-GEF 2 PROTEIN,BRAG2 GPGSEFLSESYELSSDLQDKQVEMLERKYGGRLVTRHAARTIQTAFRQYQMNKNFERLRSSMSENRMSRRIVLS 74 T 0.0085 Protamine_P2 pdbpssm F Eukaryota T 7vmc 3 C C A0A2A3ULE6_ECOLX Contact-dependent inhibitor I MKLTVDSVINEPRSVAITIDGYIPVDIKIIDSKKLPPLYWRGGDGKKNLLELAVLPENGFLSSITLVMIASDSIHKTDSLSVSLPSSECGVPVVNTKLWSHSESDDFSRRFVDDFSLDIEVIISSESMLLTIGENKKVTSWIKCSDNFYLGIDAGRNVVHLYLDKLTPSEVESFFEAVG 179 T 1.4 DUF2283 pdbhh F Bacteria T 7vmt 1 A,B,C,D,E,F A,B,C,D,E,F MGT4A_MOUSE Alpha-1,3-mannosyl-glycoprotein 4-beta-N-acetylglucosaminyltransferase A soluble form GPLGNPPAEVSTSLKVYQGHTLEKTYMGEDFFWAITPTAGDYILFKFDKPVNVESYLFHSGNQEHPGAILLNTTVDVLPLKSDSLEISKETKDKRLEDGYFRIGKFEYGVAEGIVDPGLNPISAFRLSVIQNSAVWAILNEIHIKKVTS 149 T 0.37 NADase_NGA pdbhh F Eukaryota T 7vmw 2 C C substrate peptide XGAHTIX 7 T 160 DUF4399 pdbhh F T 7vpg 1 A,C,E,G A,C,E,G RAE1L_HUMAN RAE1 PROTEIN HOMOLOG,MRNA-ASSOCIATED PROTEIN MRNP 41 MSLFGTTSGFGTSGTSMFGSATTDNHNPMKDIEVTSSPDDSIGCLSFSPPTLPGNFLIAGSWANDVRCWEVQDSGQTIPKAQQMHTGPVLDVCWSDDGSKVFTASCDKTAKMWDLSSNQAIQIAQHDAPVKTIHWIKAPNYSCVMTGSWDKTLKFWDTRSSNPMMVLQLPERCYCADVIYPMAVVATAERGLIVYQLENQPSEFRRIESPLKHQHRCVAIFKDKQNKPTGFALGSIEGRVAIHYINPPNPAKDNFTFKCHRSNGTNTSAPQDIYAVNGIAFHPVHGTLATVGSDGRFSFWDKDARTKLKTSEQLDQPISACCFNHNGNIFAYASSYDWSKGHEFYNPQKKNYIFLRNAAEELKPRNKKHHHHHHHHHH 378 T 0.00011 WD40 unppercent F Eukaryota T 7vpg 2 B,D,F,H B,D,F,H NUP98_HUMAN Isoform 3 of Nuclear pore complex protein Nup98-Nup96 MHHHHHHHHHHTGTTIKFNPPTGTDTMVKAGVSTNISTKHQCITAMKEYESKSLEELRLEDYQANRK 67 T 0.045 DUF5023 pdbpssm F Eukaryota T 7vpw 1 A B BAP1_HUMAN BRCA1-associated protein 1 (BAP1) IGRLHKQRKPDRRKRSRPY 19 T 0.74 DUF3734 unppercent F Eukaryota T 7vrc 1 A,C A,C SNF11_YEAST SWI/SNF COMPLEX COMPONENT SNF11 MSSEIAYSNTNTNTENENRNTGAGVDVNTNANANANATANATANATANATAELNLPTVDEQRQYKVQLLLHINSILLARVIQMNNSLQNNLQNNINNSNNNNIIRIQQLISQFLKRVHANLQCISQINQGVPSAKPLILTPPQLANQQQPPQDILSKLYLLLARVFEIW 169 T 0.045 SSXT pdbhh F Eukaryota T 7vsx 1 A A LUCI_OPLGR QLnK EFFTLEDFVGDWRQTAGYNQDQVLEQGGLSSLFQNLGVSVTPIQRIVLSGENGLKIDIHVIIPYEGLSGDQMGQIEKIFKVVYPVDDHHFKVILHYGTLVIDGVTPNMIDYFGRPYEGIAVFDGKKITVTGTLWNGNKIIDERLINPDGSLLFRVTINGVTGWRLCERILA 171 T 0.023 Lipocalin_7 pdbhh F Eukaryota T 7vt4 1 A,B A,B A0A399DY85_9DEIN Endoglucanase H MGCQSTQLQTPAPDTGGIVELNRQLGRGVNLGNALEAPWEGAWGVRLEEGFFELIREAGFKTIRLPVSWTHHAGRAAPYTIDPAFFSRVDWAVTQATRRGLNIVVNVHHYDELNANPQAEEARYLSIWRQIAERYRNQPGSVYFELLNEPHGRFNDNPQLWNDLLAKALRVVRESNPSRAVIVGPVGWNSLWRLSELRLPDDPNLIVTFHYYDPLEFTHQGAEWLNPVPPTGVVWRENQGAFAAGWQNWSWGSRVGFVGEALEITYQEGWAGFYLHSDAGVEGYDRLAFRTSAPVSLQVSCRRDAPAKAVTTSGGVETVVNLSECGNPSRLTDLILQNNSPNARAAFRLERLELRGPGSPLALLTHQQNAIAQAMEFAQRWAEQNRRPIFVGQFGAYEKGDLDSRVRWTGAVRSELEKRNFSWAYWEFAAGFGIYDRTTRQWRTPLLKALVPEQPKLAAALEHHHHHH 468 T 6.3E-06 Cellulase pdbpercent F Bacteria T 7vt8 1 A A A0A399DY85_9DEIN Endoglucanase H MGCQSTQLQTPAPDTGGIVELNRQLGRGVNLGNALEAPWEGAWGVRLEEGFFELIREAGFKTIRLPVSWTHHAGRAAPYTIDPAFFSRVDWAVTQATRRGLNIVVNVHHYDELNANPQAEEARYLSIWRQIAERYRNQPGSVYFELLNEPHGRFNDNPQLWNDLLAKALRVVRESNPSRAVIVGPVGWNSLWRLSELRLPDDPNLIVTFHYYDPLEFTHQGAEWLNPVPPTGVVWRENQGAFAAGWQNWSWGSRVGFVGEALEITYQEGWAGFYLHSDAGVEGYDRLAFRTSAPVSLQVSCRRDAPAKAVTTSGGVETVVNLSECGNPSRLTDLILQNNSPNARAAFRLERLELRGPGSPLALLTHQQNAIAQAMEFAQRWAEQNRRPIFVGEFGAYEKGDLDSRVRWTGAVRSELEKRNFSWAYWEFAAGFGIYDRTTRQWRTPLLKALVPEQPKLAAALEHHHHHH 468 T 7.6E-06 Cellulase pdb F Bacteria T 7vti 1 A A A0A660UUL5_9BACT Cas13bt3 GGMAQVSKQTSKKRELSIDEYQGARKWCFTIAFNKALVNRDKNDGLFVESLLRHEKYSKHDWYDEDTRALIKCSTQAANAKAEALANYFSAYRHSPGCLTFTAEDELRTIMERAYERAIFECRRRETEVIIEFPSLFEGDRITTAGVVFFVSFFVERRVLDRLYGAVSGLKKNEGQYKLTRKALSMYCLKDSRFTKAWDKRVLLFRDILAQLGRIPAEAYEYYHGEQGDKKRANDNEGTNPKRHKDKFIEFALHYLEAQHSEICFGRRHIVREEAGAGDEHKKHRTKGKVVVDFSKKDEDQSYYISKNNVIVRIDKNAGPRSYRMGLNELKYLVLLSLQGKGDDAIAKLYRYRQHVENILDVVKVTDKDNHVFLPRFVLEQHGIGRKAFKQRIDGRVKHVRGVWEKKKAATNEMTLHEKARDILQYVNENCTRSFNPGEYNRLLVCLVGKDVENFQAGLKRLQLAERIDGRVYSIFAQTSTINEMHQVVCDQILNRLCRIGDQKLYDYVGLGKKDEIDYKQKVAWFKEHISIRRGFLRKKFWYDSKKGFAKLVEEHLESGGGQRDVGLDKKYYHIDAIGRFEGANPALYETLARDRLCLMMAQYFLGSVRKELGNKIVWSNDSIELPVEGSVGNEKSIVFSVSDYGKLYVLDDAEFLGRICEYFMPHEKGKIRYHTVYEKGFRAYNDLQKKCVEAVLAFEEKVVKAKKMSEKEGAHYIDFREILAQTMCKEAEKTAVNKVARAFFAHHLKFVIDEFGLFSDVMKKYGIEKEWKFPVK 777 T 0.026 Perilipin unppercent F Bacteria T 7vu5 1 A,B A,B CD28_HUMAN TP44 GPSKPFWVLVVVGGVLAFYSLLVTVAFIIFWVRSKRSRLLH 41 T 0.0099 WBP-1 unppssm F Eukaryota T 7vu7 1 A,B A,B A0A4Y2M0V6_ARAVE Flagelliform fibroin GGQPSGGVLPGGSYTPAAGGSSRLPSLINGIMSSMQGGGFNYQNFGNVLSQFATGTGTCNSNDLNLLMDALLSALHTLSYQGMGTVPSYPSPSAMSAYSQSVRRCFGY 108 T 0.0068 Spidroin_MaSp pdb F Eukaryota T 7vul 1 A,B,C A,B,C A0A7S6TZU4_9CAUD P560 DEPOLYMERASE MGSSHHHHHHSSGLVPRGSHMLNNLNQPKGSTIGVLKDGRTIQQAIDGLENPVHYVKDVSITPSALLAVAVEAARLGRTVAFGPGHYTNQGQPFEVDFPLNLDVPVGTFLDFPIIIRGKTVKMVRSVTTNLTAAQCPAGTTVIAGDFSAFPVGSVVGVKLGDNTNGSASYNNEAGWDFTTVAASSNTSITLSTGLRWAFDKPEVFTPEYAVRYSGQLSRSSYFIPGDYTSGLNVGDIIRVENIDGTDGVHGNKEYFEMLKVSSIDSSGITVETRLRYTHVNPWIVKTGLVKGSSVTGGGRLKRLEVRGVDTPKVNSVDVDRLIVGLCYNIDVGEITSRGVGEPSSVNFTFCFGRGFLYNVRASGSVSTTDNSALKLMSCPGLIINNCSPHNSTSTGSQGDYGFYVDAYYSPYWCWNDGMSINGIVTETPRSAVTRALWLFGLRGCSVSNLSGAQVFLQGCAKSVFSNIVTPDNLLELRDLSGCIVSGMANNALVLGCWNSTFDLTLFGIGSGSNLNIALRAGAGVTHPETGVPTTLGKNNTFNVKSFSPSSLAVTLSIAQQERPIFGAGCVDVDSANKSVALGSNVTVPTMLPLALTKGIDSGSGWVGGRTKGGIWFDGNYRDAAVRWNGQYVWVADNGSLKAAPTKPDSDSPSNGVVIGPLE 663 T 0.11 E1_FCCH unppssm T Viruses T 7vuo 1 A A KCNQ1_HUMAN Kv7.1 FNRQIPAAASLIQTAWRCYAAENPDSSTWKIYIRISQLREHHRATIKVIRRMQYFVAKKKFQQARIGSG 69 T 0.014 DUF5546 pdbpssm F Eukaryota T 7vv8 2 B C SIN1_HUMAN TORC2 SUBUNIT MAPKAP1,MITOGEN-ACTIVATED PROTEIN KINASE 2-ASSOCIATED PROTEIN 1,STRESS-ACTIVATED MAP KINASE-INTERACTING PROTEIN 1,SAPK-INTERACTING PROTEIN 1,MSIN1 TSKESLFVRINAAHGFSLIQVDNTKVTMKEILLKAVKRRKGSQKVSGPQYRLEKQSEPNVAVDLDSTLESQSAWEFCLVRENSSRAD 87 T 0.051 E3_UbLigase_RBR unp F Eukaryota T 7vv9 2 B C SIN1_HUMAN TORC2 SUBUNIT MAPKAP1,MITOGEN-ACTIVATED PROTEIN KINASE 2-ASSOCIATED PROTEIN 1,STRESS-ACTIVATED MAP KINASE-INTERACTING PROTEIN 1,SAPK-INTERACTING PROTEIN 1,MSIN1 SKESLFVRINAAHGFSLIQVDNTKVTMKEILLKAVKRRKGSQKVSGPQYRLEKQSEPNVAVDLDSTLESQSAWEFCLVRENSSRAD 86 T 0.051 E3_UbLigase_RBR unp F Eukaryota T 7vwo 2 G,H,I H,D,J VPB43_MYCTU Antitoxin VapB43 YRVQPSGKGGLRPGVDLSSNAALAEAMN 28 T 0.81 TetR_C_6 unp F Bacteria T 7vwv 1 A,B A,B A0A2X0RU36_ASF I73R CDS PROTEIN,I73R PROTEIN MAHHHHHHEFMETQKLISMVKEALEKYQYPLTAKNIKVVIQKEHNVVLPTGSINSILYSNSELFEKIDKTNTIYPPLWIRKN 82 T 0.012 HARE-HTH unppercent T Viruses T 7vyx 1 A,B A,B Selenomethionine (SeMet)-labeled Cas12c1 D969A mutant MQTKKTHLHLISAKASRKYRRTIACLSDTAKKDLERRKQSGAADPAQELSCLKTIKFKLEVPEGSKLPSFDRISQIYNALETIEKGSLSYLLFALILSGFRIFPNSSAAKTFASSSCYKNDQFASQIKEIFGEMVKNFIPSELESILKKGRRKNNKDWTEENIKRVLNSEFGRKNSEGSSALFDSFLSKFSQELFRKFDSWNEVNKKYLEAAELLDSMLASYGPFDSVCKMIGDSDSRNSLPDKSTIAFTNNAEITVDIESSVMPYMAIAALLREYRQSKSKAAPVAYVQSHLTTTNGNGLSWFFKFGLDLIRKAPVSSKQSTSDGSKSLQELFSVPDDKLDGLKFIKEACEALPEASLLCGEKGELLGYQDFRTSFAGHIDSWVANYVNRLFELIELVNQLPESIKLPSILTQKNHNLVASLGLQEAEVSHSLELFEGLVKNVRQTLKKLAGIDISSSPNEQDIKEFYAFSDVLNRLGSIRNQIENAVQTAKKDKIDLESAIEWKEWKKLKKLPKLNGLGGGVPKQQELLDKALESVKQIRHYQRIDFERVIQWAVNEHCLETVPKFLVDAEKKKINKESSTDFAAKENAVRFLLEGIGAAARGKTDSVSKAAYNWFVVNNFLAKKDLNRYFINCQGCIYKPPYSKRRSLAFALRSDNKDTIEVVWEKFETFYKEISKEIEKFNIFSQEFQTFLHLENLRMKLLLRRIQKPIPAEIAFFSLPQEYYDSLPPNVAFLALNQEITPSEYITQFNLYSSFLNGNLILLRRSRSYLRAKFSWVGNSKLIYAAKEARLWKIPNAYWKSDEWKMILDSNVLVFDKAGNVLPAPTLKKVCEREGDLRLFYPLLRQLPHDWCYRNPFVKSVGREKNVIEVNKEGEPKVASALPGSLFRLIGPAPFKSLLDDCFFNPLDKDLRECMLIVDQEISQKVEAQKVEASLESCTYSIAVPIRYHLEEPKVSNQFENVLAIAQGEAGLAYAVFSLKSIGEAETKPIAVGTIRIPSIRRLIHSVSTYRKKKQRLQNFKQNYDSTAFIMRENVTGDVCAKIVGLMKEFNAFPVLEYDVKNLESGSRQLSAVYKAVNSHFLYFKEPGRDALRKQLWYGGDSWTIDGIEIVTRERKEDGKEGVEKIVPLKVFPGRSVSARFTSKTCSCCGRNVFDWLFTEKKAKTNKKFNVNSKGELTTADGVIQLFEADRSKGPKFYARRKERTPLTKPIAKGSYSLEEIERRVRTNLRRAPKSKQSRDTSQSQYFCVYKDCALHFSGMQADENAAINIGRRFLTALRKNRRSDFPSNVKISDRLLDNLEHHHHHH 1310 T 8.6E-05 RuvC_1 pdbhh F T 7vzg 3 C,I E,e G2LK98_CHLTF PscE MTAILLACLFVLGGYAALWGIIKFVVANTKDIAAN 35 T 3.1 Maff2 pdbhh F Bacteria T 7vzg 4 D,J F,f G2LEN5_CHLTF PscF MWNVVGQIISVLCFFILTVGTLFGIVYVSHLLSRG 35 T 1.1 TssO pdbhh F Bacteria T 7vzg 5 E,K G,g G2LJ20_CHLTF PscG DISKVAWAWFGVLLAICLIGAFGNYVPKLFVKMLMFLN 38 T 0.32 DUF5383 pdbhh F Bacteria T 7vzg 9 N D G2LHG2_CHLTF PscD' MARTPEEIVKRYKEANIWLRHWKQQIGLAKDEEQREMFTQYYEERVQEIAALEEPYRAALKILNQQESQR 70 T 0.15 Elongin_A pdb F Bacteria T 7vzm 1 A A A0A1A9KGY0_9PSED AcrIE4-F7 GMSTQYTYQQIAEDFRLWSEYVDTAGEMSKDEFNSLSTEDKVRLQVEAFGEEKSPKFSTKVTTKPDFDGFQFYIEAGRDFDGDAYTEAYGVAVPTNIAARIQAQAAELNAGEWLLVEHEA 120 T 0.02 G2F pdbpssm F Bacteria T 7vzr 5 E,K G,g G2LJ20_CHLTF PscG MEGVAMEDISKVAWAWFGVLLAICLIGAFGNYVPKLFVKMLMFLN 45 T 0.46 DUF5383 pdbhh F Bacteria T 7w0n 6 G,H D,E ELA_HUMAN PROTEIN ELABELA,ELA,PROTEIN TODDLER QRPVNLTMRRKLRKHNCLQRRCMPLHSRVPFP 32 T 0.031 DUF5527 unppssm F Eukaryota T 7w0q 2 B B POLG_CXB3N peptide VGTTLEALFQ 10 T 0.39 E2F_TDP unppercent T Viruses T 7w3s 1 A,B,C,D A,B,C,D A0A2S6F197_LEGPN Type IV secretion protein Dot MKAIPPKIWFETQLKGSGLDKKFQIDELIETQSSVRVFANKKYLPDTETINEALTKVTAVNVSGDKSGYFQNGLPFPNEAGYFEKIPVGHPELLSPIERLTGSKKIVSSHSLVTASGGYPLTNPLLPYRKPIRVSIFSLAGPSFENNYLHYRLFLLDSVQKIIDSPLFSHLHDGLPIQFDEAKKELGEYDTNKLMARIRLGFPYLARFSSGGFYPSFSKSNAIIFLSEAYFRYQLEDVSLLLASVNQTGKETGKAALLKATAVGMGFFAKIDCGYDIQHIIFPYYLRAYKKLLSEHKFPWIAKIEFPIFNEIQQEQFDSIFEDYDGPTKVYRSTRDVLEFREEEIEKYLPAAINPSDAFALTGNEWGYGSVESMIGNNSSIRFDQVHHMNPLILDPSHHVEAQINKDHGVELT 413 T 0.0071 DUF4804 unppssm F Bacteria T 7w40 6 F E Bombesin XQWAVXHFX 9 T 0.14 Bombesin pdbhh F T 7w43 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L YJOB_BACSU Uncharacterized ATPase YjoB GSMTNIPFIYQYEEKENERAAAGYGTFGYLITRIEETLYDQYGVFYELYASDDPNTEYWELLVEDVRSGSLEPEHVAYIFEKLEKKTFAYDEDEKEPDYTVHKSIRNSVYAYPEKGVAFARIPYFQDGSIMSFDCLFAVNDEKMRAFLEGVRPRLWEKSKR 161 T 0.049 zf-NOSIP pdb F Bacteria T 7w54 1 A,B A,B Q5ZTB4_LEGPH Lpg2248 MAKTIKATGDGACLFNAVSIGLSVEILSGRLDSQLDTPGYQALLDEFAKHHPQFNPKSWKTLKEWLAYYNDTRDIELILAPVLFNLNQKYQDHLDEEILNELTNLVWKNKANIENGQAWFQLQNTGDLGEALFPKLENLDLKKDRAPLLDKLREILKDYKLELTRENVKQFLTEKAKELLSALKKKISSDPHAFQRGYSCDELKGMTDALAISLVENREEDITDNRIKIRLENQEEHWNVLCNEEDSERFLDSTPSRLKMTSLEAYRGDKQVSAPTSEQLDLIEEPGVFLRERTIDNPGLGNCAFYAFAIGLVNIIQEEAKYNRRTMFDRWVGLDRSISGQYDEILKLNLEDPDKELLDRLQSSLRIVTYQYQIRELRNVCVFRNGNYNRLTGNSNFVNFAALYYGDPLDTDSRFNPFADSVPILIKMANIDRDSVHPGHENDVLVPLFLDLLYGDTTNPADITLETEPKSDSPIITAMNNITQDFFWGTHLDLNYLAEAFEVNLHVLRNNSPIQEFVDIP 521 T 0.18 DUF2754 pdbhh F Bacteria T 7w56 3 C C NMS_HUMAN Neuromedin-S ILQRGSGTAAVDFTKKDHTATWGRPFFLFRPRNQ 34 T 0.00047 NMU unphh F Eukaryota T 7w5z 9 I,OB C3,c3 Q950Y6_TETTH Ymf68 MLICNFLMYSNFSRIYWFDFNGTVNENLPLNYNVLKICRNEINKLEKLNENNLGTQKNPIKLNLSFEDKHYNTNNLVLDLNSYETFNSKNFISSIFDKTFESLNTVLMAPIYSFLEFKLKLSSTKINTNHYYVINGKLYITYNDSFKLFTTINDYFNDLNELSNTKLFFLYRSFNIYNIKLNSLVDFVFLKLILFIHLLYLKSTNYNRFDYRLKQTDWGFYINNNSNYIQNIFSGLKYIWRGLRFWIIGLLLGLSSIYYLMYVRLLPFNKIIFAWILVAMFLYWLLSGFVFFVKKYQYSKFTAAIQRFWKRTYIIFWVIEAGTFSVFFYLTLNASSEPVYMYDQIKIYKTHLFSWRWFLIKLLPSVSIILLGYYLQLTLKWNLFNKQNTIVLLITLLLLYILWLEFYQFYHILSFYGNINWAFDYDEYIWTLELDTRRTRLANNYIAICLFAKFWHFVFIFLFWVFFVLRINELGRIRYPLLVANVQNFIIIYIMSWAYMYPWLKFIFRKYLDVPYYWFYLNGRELGIRVFFTDLKLFFYGITNRLFDFNPSSIKFEKYPFYYWINSSQLTEFNQYRKFVIRDSIIYSLNNYII 594 T 0.29 DUF3408 pdbpssm F Eukaryota T 7w5z 11 K,QB 6A,6a W7XCY5_TETTS Transmembrane protein, putative MIWKYLQRTNRGNIIQAGLQHRKFENLPFKQNFDNLTKAYDLRMWYISNSPHEAKNLEYVNELEALHNELNYQNSRQFLFRTVSFLLGWALFYQFYELPKTYDWQDTQEPKHQVPAYGDLEEGGDEGGDD 130 T 0.025 SpoU_methylas_C pdbpssm F Eukaryota T 7w5z 12 L,RB 6B,6b Q24I72_TETTS Cytochrome c oxidase subunit 6B MSSAVEKKDLPADYGKMPAGYNFLTRGKDWREYDKDFILRTDAVWEKFQLEHFFRNYMKCFFFDHGLKKYQMFEPEDMYTVVFEGWALDDLITFPGFTPTGRTNSYQIGLSPRQRTVVPTQTFYQMQDYYMLCGLRFERWFRCDLVYHDQRHTKFDQVKNQKNYKTYPCYREYYEAQYACQDDMFDFLMELAYARRAADNFESDFASHELTTLPTFYDTPKAAERKTYTY 230 T 35 SNN_linker pdbhh F Eukaryota T 7w5z 13 M,SB 6L,6l I7LVX0_TETTS Cytochrome c oxidase subunit 6B-like MEVKYRGPSDDKLECEFLENNLLSCLREKSVQDNVAKMTCRPEFLVWFFLECPTKAAVYHDPKGLRNIFIQDKIKQKGSDDGVLSKDD 88 T 1.3 Defensin_4 pdbhh F Eukaryota T 7w5z 14 N,TB 6C,6c Q23DS4_TETTS Transmembrane protein, putative MGSVWFRNRYWWYRSLYDDYVAREAKLAFGIAAFIWLPHYYWGIHLNRAFEVNFSHRNYAHEWGPRRNRLAHSLEFEQFDMILENWQDLEDEYAQRGDGMLKK 103 T 7.9 NIPSNAP pdbhh F Eukaryota T 7w5z 15 O,UB 7A,7a I7MGF9_TETTS Transmembrane protein, putative MNRFFKVSSKYQYYKYLEQYDAAFLRKYQSETHWYLGRRGAWKNLVIKYAGDHISLEEEHNVKYKTHLSFVYLSYRLAWVLFAYVLIYNHFLLGDIGKTFNVGEWDHRLKPSAERDYPTRYESLYILDRTQKW 133 T 0.0061 IRK pdbpercent F Eukaryota T 7w5z 16 P,VB 7C,7c W7X287_TETTS Cytochrome c oxidase subunit 7C MISKYRYLHCARKLVKQSVQAFGGGHHHHEYDWRDDPKVNKDIEEDIRDRGWHPETYDFPYTKKHDDWVFDVTMPSQNYQTDLTVNIHPENKKMHVMKQVMRQSYWDAEHDMAHEYDYESEDLDFQCESFKSQHFRKKGPISQYLILGLLPILYFGTEFFYNHYPDEDYWRVAHPPPLDYPDTDDTDDTETFKDYKSFTGRRMVDTGIVDPLWYDIREGKKVYYDWAGVNQPMEDI 236 T 0.14 Tctex-1 pdb F Eukaryota T 7w5z 22 BC,V t2,T2 W7XDM6_TETTS Cytochrome c oxidase small TIM subunit 2 MSDKRKIQHEGIIALINYSTLCAQKCDVLKGHDDKITDTEEQCLRVCAEKIRQTFEFTNDIYLKNPNLTKPN 72 T 0.00018 zf-Tim10_DDP pdb F Eukaryota T 7w5z 23 CC,W t3,T3 Q231A8_TETTS Cytochrome c oxidase small TIM subunit 3 MSTRKIFDSEEQSFIRLVDKFYLGLSLTKLCAQSCNLLRNDISGSALTQKEKDCLSICYNNIEKTQSAFYAKVKTTMNLPAVEDDGEEGGDDE 93 T 0.0022 zf-Tim10_DDP pdb F Eukaryota T 7w5z 24 DC,X t4,T4 Q22A35_TETTS Cytochrome c oxidase small TIM subunit 4 MEQNTTQVFSDLAYKVCFKVINDKNKPFVLHDEQRLANCLTRYVEAFNVTSEYFFRERAGETKVTEKQ 68 T 0.0086 zf-Tim10_DDP pdb F Eukaryota T 7w5z 25 EC,Y t5,T5 Q22N23_TETTS Cytochrome c oxidase small TIM subunit 5 MEDNYAADVQRQFNRTAFDSLYKICYNSLVQKNGSTIDFQKQIDCHQRLIQVFAKIAPIVVKVEQDAASSGGAAAGGEDEE 81 T 11 MetOD2 pdbhh F Eukaryota T 7w5z 26 FC,Z t6,T6 Q233U0_TETTS Cytochrome c oxidase small TIM subunit 6 MDPVLGDVIATRIYKACFKHVYGKNMKAYSEKDEAKFDQCLTSYVESYKSVTNHFITYLGQLPKKGLSLDGS 72 T 0.088 zf-Tim10_DDP pdb F Eukaryota T 7w5z 29 CA,IC AC,ac Q24C97_TETTS Cytochrome c oxidase acyl carrier-like subunit MMQNLKKFMSKTIQVQPVSFNQIPKAFYNFPEYRTGGVQANPGITAKRIIKCIGERLRKYDPARWENVPITFKTHFRDENGYSDVATSIQIHDALEREFGIDIKDRLALVTDVETAFYIVMSHHDPL 127 T 0.16 DUF1493 pdbhh F Eukaryota T 7w5z 30 DA,JC Y7,y7 Q950Y7_TETTH Ymf67 MTALFLHILWSISYIIINILYIFLSLLLSNNNEKIKQYNSNYFIKILLVLFYNKNLSFYKNLLSEDEISKIEFERLKNYPTLVLIHSNLNKLEKRNKIINSFINFKTKYRFYKFISTNFNLQTIIKNCNDKIIFSTLLYIVNLNYSFFYKTIKNTDLIVYLLANKFSILNDNIIVSKFNISKFNDYIKYINNTNSIDTYLENQIILGLNNNTNSNITKNINTKLLNSYSNLKNLVNITNNTFYLKKINDNYNTVINSEFLTYLKSNYKISFSASNIVKYLSDKSVNNSVILYLRKNKIFNKSRYSRNRQTYRTGAYWCLYVNIIAVVAFYFWFYKFTMNFGYLWWLLYSLILSFFFSRALKHRFYNPLNVMTEFKNGFMWFIIILINIFKPLLKLLENNYINLYNHLVIKYYQSFICNTLINKKKLEFNYILSSFKFIKELNNIIIISLNKLF 453 T 0.0058 NUFIP1 pdbpercent F Eukaryota T 7w5z 31 EA,KC Y0,y0 Q950Y0_TETTH Ymf70 MFRWLFLYWYNSTDTPSAIAKVNLWSYINLRLFKARLSSSIAYYILGLNNLELKKLKIFYKNTYFDYIYLKSIPCLFLIIFFTNLYLFL 89 T 37 DUF5784 pdbhh F Eukaryota T 7w5z 32 FA,LC Y5,y5 Q951A7_TETTH Ymf75 MFLGIFKDVIKLLNKKVVPVYFWFFLYCFLSTMDTNIFVSSCSFLKVEVFGKDENTTLVLLFYVFYSLFNFYLSRIKNKNNYLVRKHLYTTELLIELILFKYKLIILKFSSIKYILNFNVRKFILFNLFLINNYKAYKINTFFLYIYIYLNNLNIIWYPIFKAYSIFGYYKSTRLNFIDTKNENIKRIKY 190 T 1.8 PDH_E1_M pdb F Eukaryota T 7w5z 33 GA,MC A,a Q22PJ5_TETTS Transmembrane protein, putative MLSKVTRRFLNYNQIYCFASQHGAEHHKLTASDEAYLNEVRQRYVTPDMEKWAYLDYKKHPSTTLSHYDHKSKDYVESERDDYNADVATNSHNKLIDDFKRNLQMQRKVHDILQKMDRPYLRGVPGVTKNISAGLQDYSAPVSKKSQSDPNDFYRDAYRNENRWIDQSVFTPKTSKMTHYDVEWPKELASRPVTKKFHHDKGYKYDVTTPYDQRYNYVADRLGHPEILGNPFERLMRLEGDIYHPNYLDQPFVKVPNANPNASLNFEEGEVLYENTRLLEWAKFWNYSVVVGYLWCAYFVPYNIFFKTHMPLEHAYDNLFFPYFQHTHFLWDNNALHIPTVGGVAIYATYIALSYINNIWKDYVVRAQFSKDKELLFVTRVSPFGTTEEEVYEVAHLEHLPPSVRSGVKDLSAQDADGLVDVTCMSSQRSLVFYKGDQYWNPKVYNDFINQTSNLWTRNYTGYNRLEVQNSVEQVKIGFSHSSQPKLEKK 490 T 0.015 TMEM70 pdbhh F Eukaryota T 7w5z 34 HA,NC B,b Q22FX8_TETTS Protein phosphatase 2C, putative MFRRIISNGALLSTQTQRWQDLSKFACLRASLNKESEKAFQELAKKNNVSPQELVELSKIVSMNLDVLKQNINSEQFLLEKESTLKRYRQSSIGTRGHLQTVNEAVNTKYPTLAEGLGQVAGYKEAYQALREIFVHPSISVNNLRQGSYGQQFAVDFRTRADEYVKALLKDHSSNPQAVQTIQEIQHTLHQIIKNYEQNPASIYARILTVLQTRGVNTLPVSKTADQKAVATIQKTSTPSLTIDQLTVPVQERVQTQTVFDAELAFIKEANEMIQQNTGNLPWDGGKKKIFQGQANKYLETPYYLLAALSGLGLLYFLYSGDAKYKTLVLTPVVGIAAFVLLRRNQILNRVPTLTELFLHKDGKFVDAVVSVNGQLISKNDIPVSTLKLYRGDHTVKVNLNDFEDASAKKFLAQQSGQEGVINVHFSKLRNLAARNGQVLNLGDTEVVVPFENQANRIILKQIFKGVEVLPSS 473 T 0.011 Rh5 pdb F Eukaryota T 7w5z 38 LA,QC F,f Q23DG8_TETTS Transmembrane protein, putative MRYLKIEKEKLVSCKKQEQEVQRIRRRKGNQKLNSIAKQQRVKRRDYQQNIKQNKEVKNPKKLIKQQIINKVKKRKKMFRGLTKFNKVFALNSFKNSLVAVPKANLNHVQNMLEENLKYDAQKYNDEVAVIQKTSRIYKPTYTIEFNREGEVLVYSADPIKNSVVYFKYPYVLYEAAIPLFIWAWIYNPLELSKNAVNSLLIYPNIAWIPRMWYWRSLQYKIQKMYLLRGGKVAKIETQSLAGDRFTSWVETYQFHPLTQDQKNFDNQDNAEFLEDEGQLKYELGVQLDNLQEMGTTSQDIVINFMKEGTVHHPELFEAIVKGYNIDTSDYVINTANNLRAREGNHNH 348 T 0.023 TMEM70 pdbhh F Eukaryota T 7w5z 39 MA,RC G,g Q23DZ5_TETTS Cytochrome c oxidase subunit TT7 MFLNRLVKETSKAKRLFSMAQNNFARAGPYNPNRYKDYYIPRTLPKNEEIVEFVQSQHSVPASPIRNQRHINPVRESGPLPSYDGTYTMEDIRAVFYNTTVGRDYCYCQMDPEEIMRRVPGITRKEAEFITKLGLSPQEQVDFAYIAYNIGLDIFYFTNQMFVARQVVTNSKGEKVEVLWNAQCYEDIAQLNVGFAPVLESVDYHWEIFLWADPPIKPNNDFDLNVPCTWFEYEQEWWMESCIQEDQFNLPEDERPYNTPRNPHCRKELWRSQDALQEEELMVNENWYPKNTQYNIYNQPDFIKPKSGSGAAADDIRI 318 T 5.7 IMS_HHH pdbhh F Eukaryota T 7w5z 41 OA,TC I,i I7LY65_TETTS Cytochrome c oxidase subunit TT9 MVYHLFERICNPDNFKLSGEAARVRTLIAAGFSKEEAEQVAWLQNHQVNGKILGLFTGGFALYCCNNYFHYFERYFPRLRYQPFTKFLAQAATVYFFFKIGDYYFTSRRYGSNDARMNGLMYSNTYYSTNKEALIQNFEPLNRKFTEEEVEQFLRNEGRSQEEKRNWIYNPHIHGSTEGEWKADIHEKFDSGKAPWEREHVKAKILETNKAKIDAGEEIQLKPFKTLNHLDKTGLLHRLHPFIWTNNWTLLG 252 T 0.19 Bac_luciferase pdbpssm F Eukaryota T 7w5z 42 PA,UC J,j I7MD70_TETTS Cytochrome c oxidase subunit TT10 MSSFIQYEFLKIYQGNQKIKNYYKRKRLIFQQKKVLKKKQKEIQMSTNNLRLKPWFHWTDEERSHAIFSAYEKRILKSEDLPSFLRANRINNVSTWVFPLIALPLFNQSIFKLGFAQRILLTRPAIEWHCFKIATVAASWLAWLNFSPFYRKLENEKEYLLDTLESRIGINVLDLNDALPRWTTSQEYNRRTQQLYNQRNGFFAGLLYPQEESSRPLVDIASFPKNLHKEKLTK 234 T 2.9 TFA2_Winged_2 pdbhh F Eukaryota T 7w5z 43 QA,VC K,k W7X4J9_TETTS Cytochrome c oxidase subunit TT11 MFGRLVLKQTRRTLFNPVLKNTFCIYQAYQNPLRHINTGHNPNNVYEDIVMLGDYPVQNRTHDKVISQTYVPAIANIAFTHLSKKYPQAGLKVDQLNTLKEKTWNDLGVNIEHEKQEILVELSEQIFVKESKLRWVHEQRQRLAHTTYVFSGLEFQNVKVGFFIDSYNFLLQELAHRSNLYQSKDIVGEKSFHEKHLEQQTAPYSGVKSLEEPVSQNKSFINSLMRAIHNH 231 T 27 FliD_C pdbhh F Eukaryota T 7w5z 44 RA,WC L,l I7M3P9_TETTS Cytochrome c oxidase subunit TT12 IKGNQKKQKGKNQSNNNNNIREEGKQIKEMILPHNNRQLARQYFDSLPENDINRKYYEGLKYETPKTFFGRFLNQFNIDAKLDTLSKFYTYQKTIRATQAELQEDRKSYLTNSLLFTAVSWFSIYQFARKGAVLPVLREYGRYFGTHRLFRQYLHTLVLPLLYTEYALNQKYYTHMEHLWTVHVNRLNQKILEDPLYTFYPQELNVPKHNIIVPTIFRDTPQ 222 T 18 Plk4_PB2 pdbhh F Eukaryota T 7w5z 45 SA,XC M,m W7WZP1_TETTS Transmembrane protein, putative MSLSLFGVKNNWHKNGIWWFSKILNKTVGEERYDALRVQRRIWSMRFYYARQQCLYELFVDHPDLAQWTGTYPKVDSSHGFPFYSTYEMYRDFQENTLNSDGSFAQWITLVCGIYVIHVIYNYMIPYYWVSTPLKNDEFTRLRMKDYIASTVLEEVYGISYAEWGWLPHDFAYNRMRGLAGYMHPDDPRAMCTSTFHRKHKYIEHEVEKVGDYHHMTYPK 220 T 61 Spore_YabQ pdbhh F Eukaryota T 7w5z 46 TA,YC N,n I7LZX8_TETTS Transmembrane protein, putative MKKGTASEEELKKLYDPNTFYEHGDNPAFKQFMNIAVENLREGKLTDHRTYVVDTYKKWMYARNWDDFLQRDCKAITFPRAFALWIVGTLGMATASKWCRQILPVGSHGITKISQTQFFHQFGPLGTLGAVGFYGLTAYLYYKTTIFTVKKFYSHCILQEREWIFEQERQNPGYGEYFFKDVPLSAEEHFNDLARGEMAKKKFEKPNHEF 210 T 8.8 DUF4500 pdbhh F Eukaryota T 7w5z 47 UA,ZC O,o Q23F08_TETTS Cytochrome c oxidase subunit TT15 MKEKIFNELTRKMKRKEISAKIQREENKQILIRQRNNKKYIQSIQGIQQERKKGKLYLVEMATQNVEEMDTIQKMNYEATVNMGRQDLITREYTFYSDYEFIPIQEDRKQQMEDALNNLHKIIHPTVTQLKKKANVQEIQDRVFRKLQGWEGELNTCVFSAKNVRDSNFCADRFTNRINTEGVEFVKQILREY 193 T 0.13 DUF3221 pdbpssm F Eukaryota T 7w5z 48 AD,VA p,P I7M8Y9_TETTS Cytochrome c oxidase subunit TT16 MNNTFKFLHQVISKLTLKAQVPNYGQYSHSLKRPINPKVVVFGNSSRAYELISSQFRNFNHVNGLELKGQEDNIQANKVAQSVLSINDGFQDGYYITDFPQNSKQAERLDLITDGVNLALYIKDPSDKVTVTRQQEAIDYYRKTGALVEFEVDPRGDLEEQVKQLSNQVLNGYKH 175 T 0.11 PRORP pdbhh F Eukaryota T 7w5z 49 BD,WA q,Q Q23D87_TETTS Transmembrane protein, putative MDNNYHFWGNGDRQDVSLSYEDYYSILDCLLDEKLSPQGLMKFKNLHEVSMYGVSYVPLYCFPVAYGISHMLTGKVRRGHSGYRNLFSLMSVVLPFTCWYAYTTPIPRRLYTEIICSNNADGAYVRNRIKQQKPGIWRKLSQQLYNKNFRFPELNQDLTATEFPLDYVAPHKF 173 T 0.16 DUF2206 pdb F Eukaryota T 7w5z 50 CD,XA r,R I7MKT6_TETTS Cytochrome c oxidase subunit TT18 MVFEFLFYNQQHKTRNGYFINHDNLMLASLEERKKLIFYFIANQVPEKLDPVDRVKFNEELSDNLSTKARLIGSLTGLIGLVGFPYISTRIYSRPVLNIGLSLLICPFLYYVGNQLTYSVWEPKFIANNNTVCELSKKYNFTVFDFAQAKKEAHLKALRTELVSDNLLYSPGI 173 T 0.048 DUF1689 pdbhh F Eukaryota T 7w5z 51 DD,YA s,S Q230X6_TETTS Cytochrome c oxidase subunit TT19 MAIRNFVFKISNQIQNLAAKRSLAYLNQIDSQSVPSRATINMKDQVTQMQREIDNMANVIRAQIPDEDRAEFEILKKYYVTGQHDSLVDPQDVLLQLDRIQVLKNLKMIELNEEAYDPELVRLEKLKARVLLEEEGALLEYAHFISKRPYNKPYEKWGVSEEHVKQQILG 170 T 0.35 APG6_N pdb F Eukaryota T 7w5z 52 ED,ZA t,T Q23VY4_TETTS Transmembrane protein, putative MGFETVVPAPPTRDDELRMIKATEEQFLQQPRYKLYMNEAHRIAKMNHGDRHNNIRAHFWSNFALGLLITGPIFIIPFGKAFRNLRSGVPYYFRPKYVFTQKNQYNQDRNWGAMKKQIPLWLGLSTAYAYWFTDFSINDDEWLEKGKVIYPHQTIKVL 158 T 0.091 MASE1 pdbpercent F Eukaryota T 7w5z 53 AB,FD U,u Q22DP8_TETTS Transmembrane protein, putative MSCTTRRFIDEKEKLEYSRGYNQQELEASKLRKDFVKKYIVDFDTTLYKTQVERDWAYIAKREYRYEVQLKSIGYGGALANAVLLWRIYANKKMVFWPIPIVGALGYLYFQPVFFQKSNKRFFDMCNVGEEYYLGRERNKILRECNKILNVEDF 154 T 0.1 DUF559 pdb F Eukaryota T 7w5z 54 BB,GD V,v I7MFV5_TETTS Cytochrome c oxidase subunit TT22 MGKDQLDFSHFDKAFENKYDIVAPEFGDLHQKRAEFIAKNQGTYRPVPLVPNNIKGLIPKTCRLPATRNWYRRTSSFERNGFFNIHTPVLNTKMIPWLLFIVLTWGWSSFQIGGYNYERFDDNGERRNTLYWKLSPVEFPQSKLWNRPS 149 T 0.056 TOM6p pdb F Eukaryota T 7w5z 55 CB,HD W,w Q23TE5_TETTS Transmembrane protein, putative MVFHYTNFVQETNAWWLRRVRPVYCTVLAYYGWWLYDRYYLFGKNATQDIRKDTTEVWEKRAALNKRNWGYNAHYKPELERSMKKVLYADPNYKFPIEWPERYMAETKTLEQVMDEEENWEYYK 124 T 4.1 GTA_holin_3TM pdbhh F Eukaryota T 7w5z 56 DB,ID X,x Q22W32_TETTS Transmembrane protein, putative MEPFGTDERNWTHEEKDIITRFLKYDKHVNLKTAEMVYSAEVESAYFGKAGALAGGVISALFFNFPIVRNLPIIRRSVIGVLPFLYCYTWGKNTQEELRWLKTFAAYQRFVVYHGQHCKLWV 122 T 3.1 Pepsin-I3 pdbhh F Eukaryota T 7w5z 57 EB,JD Y,y I7M9E7_TETTS Cytochrome c oxidase subunit TT25 MAQTAHQNRYQGGLCYAQCNELFSFWNPSIQQCWKGCDFGVGRVNDPEGRIEAQQMCKRWAAELYWTYKGELDTIKDLRVHADMYPTTPQNVYRACLAGVRRQKF 105 T 0.59 BSMAP pdbhh F Eukaryota T 7w5z 58 FB,KD Z,z I7LTF1_TETTS Cytochrome c oxidase subunit TT26 MSSDPFKKVERDYHNERSVHKHFASYPLKFWWGLNKFETIQGIHSILGNAADLVVSTLSFIPGVQGRNNASYIENSIRVTRFRGFDDKTQ 90 T 0.14 DUF5493 pdbpssm F Eukaryota T 7w63 1 A,B,C A,B,C TCPB_VIBCH Toxin-coregulated pilus biosynthesis protein B GGELMIKSSNAFDVIELSSQIQRYASLSKINNRTNPILKDNKAKEFKDADLKWLKLENCPTAGDVPTTGNNNDLQDQFIACDADYRKGDLSYFGSQFEFSTYVHPSNPEIQRQIKQVVSYFQYRGMERAFIGDAAGYVISEAKKKGFSAQDYRIVLIEPDRVGYFESNAISYEEFIENPSARENFLLKATKDRTLALAVSLAQTGEIAMQRDGSVAFLEDSELCWDTAAGSAKSCLSVRYDTVGNKTELDLKQIDVVSAKGLSFESDGKTKTPVVSTYETFQDGGRAKTINAIECPTGLNNRFAAVVSSFSTAGQNANFSSESAKDSQGTTQKDGSKGPHALLSGISLNWTLTNKVWDVTASIGIESGILPTSGIDSGSLLRNPKSLSFIAFQWCEN 397 T 0.0013 DUF1494 unphh F Bacteria T 7w75 2 C,D,E,F C,D,E,F BRE1_KLULC RING-TYPE E3 UBIQUITIN TRANSFERASE BRE1 MNDHFVKRPKLELSDPSEPLTQKDVIAFQKEALFRCLNKWRVKANQLVEENEVLAAGLSKTTESVSGCCSSIVVLARSVVEDCSDEQDKRFLQQLINTEDEHTLTQIISNNSARICELILKTSGSNISDNIGRLQELESLTLTLQKLLKSSENKLKKATEYYENIIAQYDRQDSESVSRVFNTADDDSNVKKEKQSSTGASSVNDE 206 T 7.8E-05 HCR unphh F Eukaryota T 7w7g 4 D D NALF1_MOUSE Transmembrane protein FAM155A MTRGAWMCRQYDDGLKIWLAAPRENEKPFIDSERAQKWRLSLASLLFFTVLLSDHLWFCAEAKLTRTRDKEHHQQQQQQQQQQQQQQQQQQQQQQRQQQRQRQQQRQRQQEPSWPALLASMGESSPAAQAHRLLSASSSPTLPPSPGGGGGSKGNRGKNNRSRALFLGNSAKPVWRLETCYPQGASSGQCFTVESADAVCARNWSRGAAAGEEQSSRGSRPTPLWNLSDFYLSFCNSYTLWELFSGLSSPSTLNCSLDVVLTEGGEMTTCRQCIEAYQDYDHHAQEKYEEFESVLHKYLQSDEYSVKSCPEDCKIVYKAWLCSQYFEVTQFNCRKTIPCKQYCLEVQTRCPFILPDNDEVIYGGLSSFICTGLYETFLTNDEPECCDIRSEEQTAPRPKGTVDRRDSCPRTSLTVSSATRLCPGRLKLCVLVLILLHTVLTASAAQNSTGLGLGGLPTLEDNSTRED 467 F F Eukaryota T 7w8k 1 A A drp1 CPPCHGRPTCDSFTNCWELLTCPPC 25 T 0.23 CCAP pdbhh F T 7w8o 1 A A drp2-a GCPPCESCHSGESTFWCYWEALCPPC 26 T 0.32 FlpD pdbhh F T 7w8t 1 A A DRP3 GCPPCASGCSPETGEFCWREDDCPPC 26 T 1.3 DNA_ligase_ZBD pdbhh F T 7w8z 1 A A drp4 SCPPCHGRPTCTKPGDNATPEKLAKYQACWELLTCPPC 38 T 0.14 Hormone_3 pdb F T 7w96 1 A A drp6 SCPPCMEVSSCDEETGECEIGSRCPPC 27 T 0.083 zinc_ribbon_4 pdbhh F T 7wa4 1 A A GIGAN_ARATH Protein GIGANTEA GSMASSSSSERWIDGLQFSSLLWPPPRDPQQHKDQVVAYVEYFGQFTSEQFPDDIAELVRHQYPSTEKRLLDDVLAMFVLHHPEHGHAVILPIISCLIDGSLVYSKEAHPFASFISLVCPSSENDYSEQWALACGEILRILTHYNRPIYKTEQQNGDTERNCLSKATTSGSPTSEPKAGSPTQHERKPLRPLSPWISDILLAAPLGIRSDYFRWCSGVMGKYAAGELKPPTIASRGSGKHPQLMPSTPRWAVANGAGVILSVCDDEVARYETATLTAVAVPALLLPPPTTSLDEHLVAGLPALEPYARLFHRYYAIATPSATQRLLLGLLEAPPSWAPDALDAAVQLVELLRAAEDYASGVRLPRNWMHLHFLRAIGIAMSMRAGVAADAAAALLFRILSQPALLFPPLSQVEGVEIQHAPIGGYSSNYRKQIEVPAAEATIEATAQGIASMLCAHGPEVEWRICTIWEAAYGLIPLNSSAVDLPEIIVATPLQPPILSWNLYIPLLKVLEYLPRGSPSEACLMKIFVATVETILSRTFPPESSRELTRKARSSFTTRSATKNLAMSELRAMVHALFLESCAGVELASRLLFVVLTVCVSHEAQSSGSKRPRSEYASTTENIEANQPVSNNQTANRKSRNVKGQGPVAAFDSYVLAAVCALACEVQLYPMISGGGNFSNSAVAGTITKPVKINGSSKEYGAGIDSAISHTRRILAILEALFSLKPSSVGTPWSYSSSEIVAAAMVAAHISELFRRSKALTHALSGLMRCKWDKEIHKRASSLYNLIDVHSKVVASIVDKAEPLEAYLKNTPVQKD 815 T 16 AvrL567-A pdbhh F Eukaryota T 7wap 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N B,G,A,D,E,F,C,H,I,J,K,L,M,N D7DTD6_METV3 Mevo lectin MAQNDNYIYSTEVGGVGGTPFTFMQESGTITSIKFNWSDQYKLLHHIEVKFINNANIYATGDPKGNHEVILEIDDDETIIGSVIGYKKGNDGRCTGVKLTTSKGKSIMAGYFEESLITTYTGKLAGIKGGAGSAIDRLGLIFLKK 145 T 0.00015 Jacalin unppssm F Archaea T 7wbk 1 A,B A,B Q5ZZD0_LEGPH Lpg0081 MKAIPPKIWFETQLKGSGLDKKFQIDELIETQSSVRVFANKKYLPDTETINEALTKVTAVNVSGDKSGYFQNGLPFPNEAGYFEKIPVGHPELLSPIERLTGSKKIVSSHSLVTASGGYPLTNPLLPYRKPIRVSIFSLAGPSFENNYLHYRLFLLDSVQKIIDSPLFSHLHDGLPIQFDEAKKELGEYDTNKLMARIRLGFPYLARFSSGGFYPSFSKSNAIIFLSEAYFRYQLEDVSLLLASVNQTGKETGKAALLKATAVGMGFFAKIDCGYDIQHIIFPYYLRAYKKLLSEHKFPWIAKIEFPIFNEIQQEQFDSIFEDYDGPTKVYRSTRDVLEFREEEIEKYLPAAINPSDAFALTGNEWGYGSVESMIGNNSSIRFDQVHHMNPLILDPSHHVEAQINKDHGVELTVN 415 T 0.0071 DUF4804 unppssm F Bacteria T 7wcy 3 C,F C,F Q8MWF9_CRYPV SER-VAL-PHE-ALA-ILE-PHE-ALA-ALA-LEU SVFAIFAAL 9 T 3.2 DUF2547 pdbhh F Eukaryota T 7we3 1 A A DRP8II GCPPCPREHELVAVPCEGLNNCWFVEACPPC 31 T 0.94 Late_protein_L1 pdbhh F T 7weg 2 C,D C,D FCSD2_MOUSE FCHSD2 GPGSTEKMEDVEITLV 16 T 6.9 ChW pdbhh F Eukaryota T 7wek 2 C,D C,D CAMP3_MOUSE MARSHALIN,PROTEIN NEZHA NSEVKMTSFAERKKQLVKAEAESGLGSPTS 30 T 16 CEP44 pdbhh F Eukaryota T 7wex 1 A A Q82A10_STRAW Cytochrome P450 hydroxylase MGSSHHHHHHSSGLVPRGSHMDPAEGLLADPYAVYDRLRDTAPVHRIAGTDGKPAWLVTRYDDVREGLANPLLSLDKKHALPGNYRGLALPPALDANLLNMDAPDHTRIRRLVGRAFTLRRVEQLREPVRETAHRLLDALGTHGSTDLIASYAAPLPITVICDLLGVPDEHRRDFRAWTDPLVTPDPARPDVARESVVSLLGFFTGLLADKRKNPADDLLSDLIAVQEEGDRLTEDELMSLAFLILFAGYENTVHLIGNAVLALLRHPEQLAALREDPARLPDAVGEFARYEGPALLAIRRFPVRDVTIGGVTVPAGETVLLSLSAANRDPSRFPDPDRLDLGRDAAGHLALGHGVHYCLGAPLARLETEVALAALLERFPDLALAETEPRRRPSLRARGLLALPVTY 408 T 2.1E-33 p450 pdbpercent F Bacteria T 7wff 9 I b PNSB2_ARATH PROTEIN PNSB2,NAD(P)H DEHYDROGENASE SUBUNIT 45,NDH-DEPENDENT CYCLIC ELECTRON FLOW 2 MASLISFSLLPKPKAVRSSISAPQTQTINTEKLEDKFGRKGIKFSESNNIPMVELKVRNGSSLKLSLSDAHVLSYKPKVYWKDEGFEEVLYTVDGDESRGGVGVVIVNGEEPKGGSSVISGCDWSVKDTDSDAIDALQIELSCTAGVLDITYIVSLYPVSMATALVVKNNGRKPVTLKPGIMSYLRFKKRSGAGIQGLKGCSYCPNPPLSSPFELLSPSEAMKAESSGWFGSEEGEKPGIWAVEDSVITLLEKKMSRIYGAPPAERLKAVYNTPPSKFETIDQGRGLFFRMIRIGFEEMYVGSPGSMWDKYGKQHYFVCTGPTSMLVPVDVASGETWRGAMVIEHDNL 348 T 0.33 Aldose_epim pdbpssm F Eukaryota T 7wff 11 K d B3H6Z4_ARATH NDH dependent flow 6 MAEAFTSFTFTNLHIPSSYNHSPKQNSGPNHGYWLSNVNEKRERNLMRGSLCVRKALPHDLPLMAVMVQQIEGMRDIITEKHVWHLSDKAIKNVYMFYIMFTCWGCLYFGSAKDPFYDSEEYRGDGGDGTGYWVYETVCISPFLILLGKKEKNLEMHTNYN 161 T 7.1 NAD_kinase pdbhh F Eukaryota T 7wff 12 L e PNSB5_ARATH PROTEIN PNSB5,NAD(P)H DEHYDROGENASE 18 MATVTILSPKSIPKVTDSKFGARVSDQIVNVVKCGKSGRRLKLAKLVSAAGLSQIEPDINEDPIGQFETNSIEMEDFKYGYYDGAHTYYEGEVQKGTFWGAIADDIAAVDQTNGFQGLISCMFLPAIALGMYFDAPGEYLFIGAALFTVVFCIIEMDKPDQPHNFEPQIYKLERGARDKLINDYNTMSIWDFNDKYGDVWDFTIEKDDIATR 212 T 0.028 DUF3098 pdbpssm F Eukaryota T 7wj2 3 C C GAG_HV1H2 8-mer peptide LYNTVATL 8 T 0.15 Gag_p17 pdbhh T Viruses T 7wjt 1 A,B,C,D A,B,C,D TM266_MOUSE Isoform 2 of Transmembrane protein 266 GGSVKLEMEMVTQQYEKAKAIQDEQLERLTQICQEQGFEIRQLRAHLAQQDLDLAAEREAALQA 64 T 0.00011 VGPC1_C unphh F Eukaryota T 7wko 1 A A M1R2X3_9CAUD Csy1 MGSSHHHHHHSSGRENLYFQGMIKEMIEDFISKGGLIFTHSGRYTNTNNSCFIFNKNDIGVDTKVDMYTPKSAGIKNEEGENLWQVLNKANMFYRIYSGELGEELQYLLKSCCTAKEDVTTLPQIYFKNGEGYDILVPIGNAHNLISGTEYLWEHKYYNTFTQKLGGSNPQNCTHACNKMRGGFKQFNCTPPQVEDNYNA 200 T 0.0033 Cas_Csy1 unppercent T Viruses T 7wko 2 B B M1QWL5_9CAUD Csy2 MGSSHHHHHHSSGRENLYFQGMRKFIIVKNVKVDGINAKSSDITVGMPPATTFCGLGETMSIKTGIVVKAVSYGSVKFEVRGSRFNTSVTKFAWQDRGNGGKANNNSPIQPKPLADGVFTLCFEVEWEDCAEVLVDKVTNFINTARIAGGTIASFNKPFVKVAKDAEELASVKNAMMPCYVVVDCGVEVNIFEDAVNRKLQPMVNGYKKLEKIVDNKHMRDKFTPAYLATPTYTMIGYKMVSNVDNFDQALWQYGENTKVKTIGGIYND 269 T 6.1999999999999986E-24 Cas_Csy2 unppssm T Viruses T 7wlp 2 B,C,D B,D,C BKRF4_EBVG Tegument protein BKRF4 MRRLLSDEEEETSQSSSYTLGSQASQSIQEEDVSDTDESDYSDEDEEIDLEEEYPSDEDPSEGSDSDPSWHPSDSDESDYSESDEDEA 88 T 0.033 Nop14 unppercent T Viruses T 7wmc 2 B,C,E C,E,D Peptide1 GFXRGXWPCG 10 T 0.22 DUF1677 pdbhh F T 7wmp 1 A,B,C,D,E,F,G,H,I,J,K,L a,b,c,d,e,f,g,h,i,j,k,l PORTL_BPKHP HEAD-TO-TAIL CONNECTOR GP8,PUTATIVE PORTAL PROTEIN ORF17 MDFTTLQNDFTNDYQKALIANNEFLEAKKYYNGNQLPQDVLNIILERGQTPIIENMFKVIVNKILGYKIESISEIRLSPKQEEDRALSDLLNSLLQVFIQQENYDKSMIERDKNLLIGGLGVIQLWVSQDKDKNVEIEIKAIKPESFVIDYFSTDKNALDARRFHKMLEVSEQEALLLFGDSVIVNYSNVNHERIASVIESWYKEYNEETQSYEWNRYLWNRNTGIYKSEKKPFKNGACPFIVSKLYTDELNNYYGLFRDIKPMQDFINYAENRMGNMMGSFKAMFEEDAVVDVAEFVETMSLDNAIAKVRPNALKDHKIQFMNNQADLSALSQKAEQKRQLLRLLAGLNDESLGMAVNRQSGVAIAQRKESGLMGLQTFLKATDDMDRLIFRLAVSFICEYFTKEQVFKIVDKKLGDRYFKINSNDDNKIRPLKFDLILKSQLKTESRDEKWYNWNELLKILAPIRPDLVPSLVPLMLNDMDSPITNDVLEAIQNANALQQQNAEANAPYNQQIQALQIQKLQAEIMELQAKAHKYAEQGALSQTTNESEKINQAVAITEMQQQNANNANNEESNNKPKKKLKTSDKTTWRKYPSAQNLDY 602 T 0.056 LUC7 pdb T Viruses T 7wmp 2 M,N,O,P,Q,R,S,T,U,V,W,X m,n,o,p,q,r,s,t,u,v,w,x I7HHN3_BPKHP Adaptor protein gp12 MIEVSEVIAKVRERLNDNEVGNYEILDSVLVENINQALLKICLEFRLKKAITRSLITEEERFLTLNNLLGIESVKLDKKEIESRNTIEKDTGELELLILSDRISVTPFKIGELEVVYYTYEEIRNILETIKLPKICLDVLVYSVLCNLLEIPNNETNFSVLANYKQLLKLAKDNLTNYLSLMYSKNIHFSKVVRV 195 T 0.0012 GST_C_6 pdbpssm T Viruses T 7wmp 3 AA,BA,CA,DA,EA,FA,GA,HA,IA,JA,Y,Z C,D,E,F,G,H,I,J,K,L,A,B I7HFX1_BPKHP Nozzle protein gp25 MDTTRFIRNFILFKDALQKQNFNNKDLNTTSMQAALQSEQLALSEESQYLQSEQVRAKMQIDFLGMQANLQNAKAETLNKLIQCQAMLKSLKDNAMINRANALVSLLQVQANAANGITTSNFEAAFKIIAQIGSEYNQITLNNGNVSVQEKEQTNELKTILNSLSKELEKLNQQSEVNSIQIFSDKLEVLKDAPARLWGFSTLSNAKEGFYNEANEQIASGSVCLFRSDKVRKHTITFKAINTKTSLSKNITISVIANKLKERMS 265 T 0.019 PKD_3 pdbhh T Viruses T 7wn0 1 A A Q8IDM6_PLAF7 Equilibrative nucleoside/nucleobase transporter MSTGKESSKAYADIESRGDYKDDGKKGSTLSSKQHFMLSLTFILIGLSSLNVWNTALGLNINFKYNTFQITGLVCSSIVALFVEIPKIMLPFLLGGLSILCAGFQISHSFFTDTQFDTYCLVAFIVIGVVAGLAQTIAFNIGSTMEDNMGGYMSAGIGISGVFIFVINLLLDQFVSPEKHYGVNKAKLLALYIICELCLILAIVFCVCNLDLTNKNNKKDEENKENNATLSYMELFKDSYKAILTMFLVNWLTLQLFPGVGHKKWQESHNISDYNVTIIVGMFQVFDFLSRYPPNLTHIKIFKNFTFSLNKLLVANSLRLLFIPWFILNACVDHPFFKNIVQQCVCMAMLAFTNGWFNTVPFLVFVKELKKAKKKKEIEIISTFLVIAMFVGLFCGIWTTYIYNLFNIVLPKPDLPPIDVTQ 422 T 1.5000000000000002E-22 Nucleoside_tran unppercent F Eukaryota T 7wrk 1 A A Q5SH57_THET8 hypothetical protein TTHA1873 MGNYLEDCATVDVQARPTAYALAISSLGEFNSLTGGTSTDPVAEGNDYYYRFEIRAWEGSSGPQTNVTLNVTRTLGNSTFAGSGTKGVDFEVELDPDGPFGPASYAPVLSADVQVLAWGPTGVQLRYLPSLAPGATLRFSLRANAVNGTNTTVQADATSTEAPGPYTVFETTTIIP 176 T 0.021 CRISPR_assoc pdb F Bacteria T 7wrw 1 A,B,C,D,E,F B,C,F,D,E,A Q9RW32_DEIRA HerA MTGNDVQGAEKADAIGMVLGTEDVTPTVFWFAVSHGASVGLDDLVVVETRKPDGTPVRFYGLVDNVRKRHEGVTFESDVEDVVAGLLPASVSYAARVLVTRVDPENFIPPQPGDHVRHAAGRELAMALSADKMEEAAFPGGLLADGQPLPLNFRFINGESGGHINISGISGVATKTSYALFLLHSIFRSGVMDRTAQGSGGRQSGTAGGRALIFNVKGEDLLFLDKPNARMVEKEDKVVRAKGLSADRYALLGLPAEPFRDVQLLAPPRAGAAGTAIVPQTDQRSEGVTPFVFTIREFCARRMLPYVFSDASASLNLGFVIGNIEEKLFRLAAAQTGKGTGLIVHDWQFEDSETPPENLDFSELGGVNLQTFEQLISYLEYKLLEEREGEGDPKWVLKQSPGTLRAFTRRLRGVQKYLSPLIRGDLTPEQAEGYRPDPLRRGIQLTVVDIHALSAHAQMFVVGVLLREVFEYKERVGRQDTVFVVLDELNKYAPREGDSPIKDVLLDIAERGRSLGIILIGAQQTASEVERRIVSNAAIRVVGRLDLAEAERPEYRFLPQSFRGRAGILQPGTMLVSQPDVPNPVLVNYPFPAWATRRDEVDDLGGKAAAEVGAGLLR 618 F F Bacteria T 7wt5 3 C,F C,F RDRP_I97A1 8-mer model peptide RAGFVANF 8 T 6.1 Thiol_cytolys_C pdbhh T Viruses T 7wu2 1 A A GNAS2_HUMAN Guanine nucleotide-binding protein G(s) subunit alpha isoforms short MGHHHHHHENLYFQGIEKQLQKDKQVYRATHRLLLLGADNSGKSTIVKQMRILHGGSGGSGGTSGIFETKFQVDKVNFHMFDVGGQRDERRKWIQCFNDVTAIIFVVDSSDYNRLQEALNDFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKIEDYFPEFARYTTPEDATPEPGEDPRVTRAKYFIRDEFLRISTASGDGRHYCYPHFTCAVDTENARRIFNDCRDIIQRMHLRQYELL 243 T 4.6E-09 G-alpha pdb F Eukaryota T 7wug 5 E Y YD176_YEAST UNCHARACTERIZED PROTEIN YDL176W,Y55_G0042020.MRNA.1.CDS.1 MATGRIQFAVSTPCNTKGKPSGYRLFEFKNDRLALVPSERGCTKVDVNANIQAFCYLRPNGRDTSISPDATHILDSCDYMVLAKSNGFIEIISNYQYKIKNGLRLAPSYILRCTPEDFESNFFSDYMIAGLEYSQGLLYCCMCSGRIYVFVMNLPTDYIQYKNMYNPMFPDCFFKVHHDNNTTHSSEEEKLFEGSTRYTGRSCSKHICYFLLPIEPSHLRSSPVVSSFCNMYQGLPIYRPSMYLHIERGISTFHINPLDRFCFMTVSPRSPLFIRKIILPLTYVTFLSTFISLKNSIQGDTCGEILSWDNVAQQNGFGSLFSWISNKFTFDTDIINSTIWDDIVKYSGTGMLDSGIVWKQRQGHAKDDIYELFHTQDMLGSSRRNSSFSTASSEPRPLSRRRRESFQALTRDAFRERMDVPCSTKWELDSFIRGLRRNTFMVDFEIVEKISHRNGNDGVNEDDNTTDESDETMTSFLTDNYKKMDIVCIDHFVTLSAFRPRYYDEPIIKIDSLSNKNGSENGTNEEEWAESQMKVDGQVIDDETAQFKQALGNLCSFKKLFMLDDSLCFILDTHGVLLINRFEIKNTKNLLRNSKDTIRIIPHDFGLINDTIVIINDIDVGTDNVCALTFHLVVTSMAGEITVLKGEFFKNCRLGRIKLCDSLKLNRKDRFVDKLALIDYDGLNAQKRRLDYDEKDLYTFIVKKVKRD 708 T 14 BBS1 pdbhh F Eukaryota T 7wuw 1 A,B A,B B4XYC0_STREG AZI28 MASWSHPQFEKGGTHVAETSAPTRSEPDTRVLTLPGTASAPEFRLIDIDGLLNNRATTDVRDLGSGRLNAWGNSFPAAELPAPGSLITVAGIPFTWANAHARGDNIRCEGQVVDIPPGQYDWIYLLAASERRSEDTIWAHYDDGHADPLRVGISDFLDGTPAFGELSAFRTSRMHYPHHVQEGLPTTMWLTRVGMPRHGVARSLRLPRSVAMHVFALTLRTAAAVRLAEGATT 233 T 0.11 Rib unppercent F Bacteria T 7wwq 2 B B UFD1_HUMAN UBIQUITIN FUSION DEGRADATION PROTEIN 1,UB FUSION PROTEIN 1 IPNYEFKLGKITFIRN 16 T 8.1 Ribonuc_2-5A pdbhh F Eukaryota T 7wyg 1 A,B A,B CYPC_BACSU Cytochrome P450 152A1 HMDEQIPHDKSLDNSLTLLKEGYLFIKNRTERYNSDLFQARLLGKNFICMTGAEAAKVFYDTDRFQRQNALPKRVQKSIFGVNAIHGMDGSAHIHRKMLFLSLMTPPHQKRLAELMTEEWKAAVTRWEKADEVVLFEEAKEILCRVACYWAGVPLKETEVKERADDFIDMVDAFGAVGPRHWKGRRARPRAEEWIEVMIEDARAGLLKTTSGTALHEMAFHTQEDGSQLDSRMAAIELINVLRPIVAISYFLVFSALALHEHPKYKEWLRSGNSREREMFVQEVRRYYPFIPFLGALVKKDFVWNNCEFKKGTSVLLDLYGTNHDPRLWDHPDEFRPERFAEREENLFDMIPQGGGHAEKGHRCPGEGITIEVMKASLDFLVHQIEYDVPEQSLHYSLARMPSLPESGFVMSGIRRKS 418 T 0.34 p450 pdb F Bacteria T 7wzz 3 C C LYS-ALA-GLY-GLN-VAL-VAL-THR-ILE-TRP KAGQVVTIW 9 T 0.18 MepB pdbhh F T 7x00 3 C C VAL-SER-PHE-ILE-GLU-PHE-VAL-GLY-TRP VSFIEFVGW 9 T 0.037 EBV-NA3 pdbhh F T 7x14 2 B B MIGA2_MOUSE MIGA2 phospho FFAT motif SEDSFFSATE 10 T 0.91 Miga pdbhh F Eukaryota T 7x1b 3 C C LYS-ALA-GLY-GLN-VAL-VAL-THR-ILE KAGQVVTI 8 T 2.8 DUF1989 pdbhh F T 7x1t 3 C B mini-G alpha q protein MGSTVSAEDKAAAERSKMIDKNLREDGEKARRTLRLLLLGADNSGKSTIVKQMRILHGGSGGSGGTSGIFETKFQVDKVNFHMFDVGGERDERRKWIQCFNDVTAIIFVVDSSDYNRLQEALNDFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKIEDYFPEFARYTTPEDATPEPGEDPRVTRAKYFIRKEFVDISTASGDGRHICYPHFTCAVDTENARRIFNDCKDIILQMNLREYNLV 246 T 2.6E-10 G-alpha pdb F T 7x2e 2 B B CDHR2_MOUSE Cadherin-related family member 2 QQKKNLSFTNPGLDTTDL 18 T 5.2 ARF7EP_C pdbhh F Eukaryota T 7x3k 2 B B SSZ1_YEAST DNAK-RELATED PROTEIN SSZ1,HEAT SHOCK PROTEIN 70 HOMOLOG SSZ1,PLEIOTROPIC DRUG RESISTANCE PROTEIN 13 MSSPVIGITFGNTSSSIAYINPKNDVDVIANPDGERAIPSALSYVGEDEYHGGQALQQLIRNPKNTIINFRDFIGLPFDKCDVSKCANGAPAVEVDGKVGFVISRGEGKEEKLTVDEVVSRHLNRLKLAAEDYIGSAVKEAVLTVPTNFSEEQKTALKASAAKIGLQIVQFINEPSAALLAHAEQFPFEKDVNVVVADFGGIRSDAAVIAVRNGIFTILATAHDLSLGGDNLDTELVEYFASEFQKKYQANPRKNARSLAKLKANSSITKKTLSNATSATISIDSLADGFDYHASINRMRYELVANKVFAQFSSFVDSVIAKAELDPLDIDAVLLTGGVSFTPKLTTNLEYTLPESVEILGPQNKNASNNPNELAASGAALQARLISDYDADELAEALQPVIVNTPHLKKPIGLIGAKGEFHPVLLAETSFPVQKKLTLKQAKGDFLIGVYEGDHHIEEKTLEPIPKEENAEEDDESEWSDDEPEVVREKLYTLGTKLMELGIKNANGVEIIFNINKDGALRVTARDLKTGNAVKGEL 538 T 1.1E-15 HSP70 pdbpssm F Eukaryota T 7x45 1 A,B A,B C0LEE1_CTEID Interferon gamma MDSWLNMMLLCGLLLIASLQTTNAFRFRRSKSEMTHLETNIHSLQEHYKTRGTEWVSKSVFVPHLNQLNSKASCTCQALLLERMLNIYEELFQDMKSEHKEGRKDLDHLMDEVKKLRGNYKEEHKVWKELQEMNSVKVKNGTIRGGALNDFLMVFDRASTEKHKKVQ 167 T 0.00078 IFN-gamma pdbpercent F Eukaryota T 7x5c 1 A A TAP75_TETTS Telomerase-associated protein p75OB1 MEIEEDLNLKILEDVKKLYLQSFDYIKNGISSGGSGGSIDLSRITFLYKFISVNPTLLLINEKTQAKRRIFQGEYLYGKKKIQFNIIAKNLEIERELIQFFKKPYQCYIMHNVQVFQMLNKNKNNNVVEFMDSEDLQSSVDSQLYYLIDESSHVLEDDSMDFISTLTRLSDS 172 T 0.1 Clusterin pdb F Eukaryota T 7x5c 2 B B TAP50_TETTS Telomerase associated protein p50PBM QDDFGDGCLLQIVN 14 T 2.2 FAM47 pdbhh F Eukaryota T 7x5v 2 E E R1DBK9_EMIHU ion channel MIAAIHNARRKKREAAA 17 T 6 RNF111_N pdbhh F Eukaryota T 7x6z 2 B B R1AB_SARS2 peptide PHTVLQ 6 T 3.8 ATXN-1_C pdbhh T Viruses T 7x7n 2 D,E,F,G,H,I D,E,H,I,F,G Synthetic peptide SIH-5 DKEWILQKIYEIMRLLDELXDXEASMRVSDLIYEFMKK 38 T 0.11 ER pdbhh F T 7x8u 1 A A Q941Q8_SOLLC SW-5B NLR IMMUNE RECEPTOR SMAENEIEEMLEHLRRIKSGGDLDWLDILRIEELEMVLRVFRTFTKYNDVLLPDSLVELTKRAKLIGEILHRLFGRIPHKCKTNLNLERLESHLLEFFQGNTASLSHNYELNNFDLSKYMDCLENFLNDVLMMFLQKDRFFHSREQLAKHRSIKELKIVQKKIRFLKYIYATEINGYVDYEKQECLENRIQFMTNTVGQYCLAVLDYVTEGKLNEENDNFSKPPYLLSLIVLVELEMKKIFHGEVK 246 T 0.07 Dna2 pdb F Eukaryota T 7x9d 2 B,C B,C DNM3L_HUMAN DNA (cytosine-5)-methyltransferase 3-like GHMFETVPVWRRQPVRVLSLFEDIKKELTSLGFLESGSDPGQLKHVVDVTDTVRKDVEEWGPFDLVYGATPPLGHTCDRPPSWYLFQFHRLLQYARPKPGSPRPFFWMFVDNLVLNKEDLDVASRFLEMEPVTIPDVHGGSLQNAVRVWSNIPAIRSRHWALVSEEELSLLAQNKQSSKLAAKWPTKLVKNCFLPLREYFKYFS 204 F F Eukaryota T 7xbj 1 A,B A,B Q306L3_XENNE 40kDa insecticidal toxin MVIKPVTTPSVIQLTPDDRVTPDDKGEYQPVEKQIAGDIIRVLEFKQTNESHTGLYGIAYRAKKVIIAYALAVSGIHNVSQLPEDYYKNKDNTGRIYQEYMSNLLSALLGENGDQISKDMANDFTQNELEFGGQRLKNTWDIPDLENKLLEDYSDEDKLLALYFFASQELPMEANQQSNAANFFKVIDFLLILSAVTSLGKRIFSKNFYNGLETKSLENYIERKKLSKPFFRPPQKLPDGRTGYLAGPTKAPKLPTTSSTATTSTAASSNWRVSLQKLRDNPSRNTFMKMDDAAKRKYSSFIKEVQKGNDPRAAAASIGTKSGSNFEKLQGRDLYSIRLSQEHRVTFSINNTDQIMEIQSVGTHYQNI 368 T 0.00062 HigB-like_toxin pdbhh F Bacteria T 7xbm 1 A,B A,B PIKC_STRVZ CYTOCHROME P450 MONOOXYGENASE PICK,NARBOMYCIN C-12 HYDROXYLASE,PIKROMYCIN SYNTHASE CYP107L1 MGSSHHHHHHSSGLVPRGSHMRRTQQGTTASPPVLDLGALGQDFAADPYPTYARLRAEGPAHRVRTPEGDEVWLVVGYDRARAVLADPRFSKDWRNSTTPLTEAEAALNHNMLESDPPRHTRLRKLVAREFTMRRVELLRPRVQEIVDGLVDAMLAAPDGRADLMESLAWPLPITVISELLGVPEPDRAAFRVWTDAFVFPDDPAQAQTAMAEMSGYLSRLIDSKRGQDGEDLLSALVRTSDEDGSRLTSEELLGMAXILLVAGHETTVNLIANGMYALLSHPDQLAALRADMTLLDGAVEEMLRYEGPVESATYRFPVEPVDLDGTVIPAGDTVLVVLADAHRTPERFPDPHRFDIRRDTAGHLAFGHGIHFCIGAPLARLEARIAVRALLERCPDLALDVSPGELVWYPNPMIRGLKALPIRWRRGREAGRRTGLEHHHHHH 444 T 4.6999999999999995E-33 p450 unppssm F Bacteria T 7xc2 2 B,D,F,H,J D,B,F,H,J A0A5B0N367_PUCGR Avirulence factor VNFPFPKKMITESNSKDIREYLASTFPFEQQSTILDSVKSIAKVQIDDRKAFDLQLKFRQENLAELKDQIILSLGANNGNQNWQKLLDYTNKLDELSNTKISPEEFIEEIQKVLYKVKLESTSTSKLYSQFNLSIQDFALQIIHSKYKSNQISQNDLLKLITEDEMLKILAKTKVLTYKMKYFDSASKMGINKYISTEMMDLDWQFSHYKTFNDALKKNKASDSSYLGWLTHGYSIKYGLSPNNERSMFFQDGRKYAELYAFSKSPHRKIIPGEHLKDLLAKINKSKGIFLDQNALLDKRIYAFHELNTLETHFPGITSSFTDDLKSNYRKKMESVSLTCQVLQEIGNIHRFIESKVPYHSSTEYGLFSIPKIFSIPIDYKHGEKENLVSYVDFLYSTAHERILQDNSINQLCLDPLQESLNRIKSNIPVFFNL 434 T 0.31 Type2_restr_D3 pdbpssm F Eukaryota T 7xcb 1 A A IL9_MOUSE IL-9,CYTOKINE P40,T-CELL GROWTH FACTOR P40 CSTTWGIRDTNYLIENLKDDPPSKCSCSGNVTSCLCLSVPTDDCTTPCYREGLLQLTNATQKSRLLPVFHRVKRIVEVLKNITCPSFSCEKPCNQTMAGNTLSFLKSLLGTFQKTEMQRQ 120 T 0.0041 Dynamin_M unppssm F Eukaryota T 7xdi 2 D D A0A5Q0V0G9_9VIRU C131 MGTKIIINVIFFDIILALLMMSFASIQPPSIANPPTVAQAQAQANITWNLTVGSINWEWLWPVFYFVDWLIWIVTTIFAVVAFIFNVFTTSLSLLASVPIVGPFLLMFAVIINFVLIWEVVKLIRGYDNPG 131 T 8.1E-12 C166 pdbpercent T Viruses T 7xdi 3 E E A0A5Q0V0F6_9VIRU B210 MKWPLLLFTVLLIIGFTLIARAGTISLLSTPPVNPPAYSYFYIEFQFLPTNNTPQPYAIFVGPNPNNLTEVAEGYTLSNGTGYARVPVINAQTEYVDIVWVNQNYTMFEIFPQIQNATTTVTLSANNNQGFSFSLPTWVSWVIGAVLMLIFMGVGWKFMGPAGLAIFGIFGLFIAMFFGLLPSYLMYVILFIVAIVGARILTKQLGGGEE 210 T 0.01 DUF5489 pdb T Viruses T 7xdi 4 F F A0A5Q0V0A2_9VIRU VP4 MKRVFLLYIIGILLTLFLPLIQTQSAVSLPPLYVEDAVNAEIQQLWSKSPTGVYAFHEAPSVNNSFWPDDNAKFLESIAPWWQSYSSYVNSTLQFLQQSDVNGLFIKRFEYPLNPLQSITIGNLSGYTNGFYDIVGNPLLNSMRIATYYNPTLAVTYLFGNVVQYPNGILVNIEQGLENPITDGGFGGTGGQNPPWESLNSSSLVNDSIVSIVNNAKTYLNLTGPTFFGTPSEELQYNFPIVNVLPHYLAFQNVNGILGQYNYQGKFIPFNVTLVLQSSSINRIYLEFIWENSTSGTYVLTDIPVYFTANGQWQQVVVTVPASAWPKYWNLGALSAVPLLIGIGLDLPGSSPSQTGPTGVYVGDIATNYPTTFGPQFNVTNKGSYVVFNESWKSDSLGATFWIAYVLGQGNAIEVLASAPVNQSWIYVGYNGLATIGTGYTILETPSGILKNYQNSGNISWTYLGPNFGKWMLLSTNYAPNWIGDFQMLFIFPMAGTSNPYMDTLNNAVYMGDPTEVRNTLYFGNYTTLPGYFQWVQIAYQNDGNTSGVFGFFLIPSVDYLVNPSVIVNDMFPSSLTAYSPSSIPNYWWEAVWGENYYEGEIIYALALLGKYGNSQALQMAQQAWLSYYNQLKAYNGATYTSSLARFIMATILLYNITGNTQYSNAYTQLANWLLQYQNQSKYAYVYIPMWYHKDVDVPSVNGFATYGYIINRTAQMDVGTVISGTSIGLNFFEDIPLNTSYGIYLLTNGTGKLPFTYQNVLNVSGTFITYLYMNGGGTATTANITITVQIAYNGNVLQTIGTAAVDNVPIQPGGISGSPPFYPVKIVVPVLTTVNAPPGSTLIIGWNIKAPQTVYVLIDSTNGPSNVTIPLSWPNPFYGLFTIPKIYNPNPGVHNYPQPYFLDISAMAGQAMMALYAVTKNITYLLDAQLVMNAIHYGPVPMPTYGILGVPNPPVEPRLWVYANYSTVDADYYTYKSELVSEFGDAIGNNTLASLAISRVWQRTSYTYPTSYIYYVARYGSGLQMNSETQPWGDVATQFYVNTWSPSNLDLFWASLPNNNYITNQTWNGTALFIHLYAYQQSQVQLIFLTTTVNFNVLVNGNYTNYEANHQIMQIAPTLEPGPNTIIIIPNPKNQVSQNTNISTTTTTSPLSNAISGLGITLTQNELMLLGFVIYFVIIMVTYGVSRNKTITVLSSIVAVAIVYALALWPTYMAFILGAVGFFMLFYSISRREEE 1236 T 0.0037 VKOR pdbpercent T Viruses T 7xds 1 A,B A,B A0A5B0N367_PUCGR AvrSr35 HHHHHHSSGVDLGTENLYFQSNAMRNFAADRVHGVESVISGSKSSSNPMALSKSMDKPDTSDLVDSNVQAKNDGSRYEEDFTAKYSEQVDHVSKILKEIEEQEPGTIIIDHKAFPIQDKSPKQVVNFPFPKKMITESNSKDIREYLASTFPFEQQSTILDSVKSIAKVQIDDRKAFDLQLKFRQENLAELKDQIILSLGANNGNQNWQKLLDYTNKLDELSNTKISPEEFIEEIQKVLYKVKLESTSTSKLYSQFNLSIQDFALQIIHSKYKSNQISQNDLLKLITEDEMLKILAKTKVLTYKMKYFDSASKMGINKYISTEMMDLDWQFSHYKTFNDALKKNKASDSSYLGWLTHGYSIKYGLSPNNERSMFFQDGRKYAELYAFSKSPHRKIIPGEHLKDLLAKINKSKGIFLDQNALLDKRIYAFHELNTLETHFPGITSSFTDDLKSNYRKKMESVSLTCQVLQEIGNIHRFIESKVPYHSSTEYGLFSIPKIFSIPIDYKHGEKENLVSYVDFLYSTAHERILQDNSINQLCLDPLQESLNRIKSNIPVFFNLASHSSPIKPSNVHEGKL 575 T 0.44 Type2_restr_D3 pdbpssm F Eukaryota T 7xf2 1 A,B A,B K7WK08_9VIRU VP51 MSASLILDEYLKKTASAVLDVADSFEKIKGEIQSPEEAAALSVALYGAPPKPSASAVASIITGERTSLNDKYLSDNVLLKMSVARVGQENNRKRADQAADEIRTIMEDITGSLSGAYRQYSPLEEENKVHIGIMNNKTPSIVCGYYTMDTSISSEPLSLTDFQNPTVIANVTKRMESIFSKVDSARSTRFDAFVNGVANNMDIKSSIDWANMVENVIKLPDSTPNPCSVDTIVSRDASVVKTAVNDIYASVGKSYCRPATQLTFMSEIEKLRKAAVVCFEALMSDTRERAFVEFLFYVSFKEDASNTNSKLFVQNKLSSMSGNPRQPIKLVRRSAEETLFGLCFMFKVMPPEFMNCIFNFPTIPHSTQYHGLYGTCLTPLLRKYGSSFEKSWAHFEEILSERANAVKKFGVNDTRIDCLDAVANLTGPVYVLILDLVRTLSAQRSCSTKFLREIKENYLLWNRFVS 466 T 0.068 7TMR-HDED pdbpercent T Viruses T 7xfr 1 A,C A,C WIPI2_HUMAN WIPI-2,WIPI49-LIKE PROTEIN 2 GPGSGQLLFANFNQDNTSLAVGSKSGYKFFSLSSVDKLEQIYECTDTEDVCIVERLFSSSLVAIVSLKAPRKLKVCHFKKGTEICNYSYSNTILAVKLNRQRLIVCLEESLYIHNIRDMKVLHTIRETPPNPAGLCALSINNDNCYLAYPGSATIGEVQVFDTINLRAANMIPAHDSPLAALAFDASGTKLATASEKGTVIRVFSIPEGQKLFEFRRGVKRCVSICSLAFSMDGMFLSASSNTETVHIFKLETVKEQGRAFATVRLPFCGHKNICSLATIQKIPRLLVGAADGYLYMYNLDPQEGGECALMKQHRLDGSLE 321 T 0.00025 WD40 pdbpercent F Eukaryota T 7xhf 2 C,D C,D USP10/6-21 PQYIFGDFSPDEFNQF 16 T 1.2 Methyltrans_RNA pdbhh F T 7xhg 2 E,F,G F,G,E Caprin-1(369-378) PYNFIQDSML 10 T 4E-05 Caprin-1_C pdbhh F T 7xhn 1 A,H o,O CENPO_HUMAN CENP-O,INTERPHASE CENTROMERE COMPLEX PROTEIN 36 MEQANPLRPDGESKGGVLAHLERLETQVSRSRKQSEELQSVQAQEGALGTKIHKLRRLRDELRAVVRHRRASVKACIANVEPNQTVEINEQEALEEKLENVKAILQAYHFTGLSGKLTSRGVCVCISTAFEGNLLDSYFVDLVIQKPLRIHHHSVPVFIPLEEIAAKYLQTNIQHFLFSLCEYLNAYSGRKYQADRLQSDFAALLTGPLQRNPLCNLLSFTYKLDPGGQSFPFCARLLYKDLTATLPTDVTVTCQGVEVLSTSWEEQRASHETLFCTKPLHQVFASFTRKGEKLDMSLVS 300 F F Eukaryota T 7xhs 1 A A A0A2S8QTL8_PHOLU Cro/Cl family transcriptional regulator MINDMHPSLIKDKDIVDDVMLRSCKIIAMKVMPDKVMQVMVTVLMHDGVCEEMLLKWNLLDNRGMAIYKVLMEALCAKKDVKISTVGKVGPLGCDYINCVEISM 104 T 7.9 HU-CCDC81_bac_1 pdbhh F Bacteria T 7xi1 1 A A anti-CRISPR protein AcrIF24 MNAIHIGPFSITPAARGLHYGGLPHHQWTLYYGPREMAIKTLPDSYTSSEVRDEFSDIIAEFVIDARHRYAPDVLELVNSDGDAVLARVAVSRLPEALSGCIPDDRFPYWLLTASRPRLGLPVTLNEYTALAVELSAPPLAWITGLLPGEVLTHDAEEWRPPTSWELRHVVGEGSFTGVSGAAAAALLGMSATNFRKYTAGDSAANRQKISFAAWHYLLDRLGVKRASLEHHHHHH 236 T 0.00065 DUF4447 pdbhh F T 7xjj 1 A A A0A1W2PP38_HUMAN;GNAO_HUMAN G protein subunit alpha o1,Guanine nucleotide-binding protein G(o) subunit alpha MGCTLSAEERAALERSKAIEKNLKEDGISAAKDVKLLLLGADNSGKSTIVKQMKIIHGGSGGSGGTTGIVETHFTFKNLHFRLFDVGGQRSERKKWIHCFEDVTAIIFCVDLSDYNRMHESLMLFDSICNNKFFIDTSIILFLNKKDLFGEKIKKSPLTICFPEYTGPNTYEDAAAYIQAQFESKNRSPNKEIYCHMTCATDTNNAQVIFDAVTDIIIANNLRGCGLY 228 T 1.2E-123 G-alpha unp F Eukaryota T 7xjk 2 B B Guanine nucleotide-binding protein G(q) MGSTVSAEDKAAAERSKMIDKNLREDGEKARRTLRLLLLGADNSGKSTIVKQMRILHGGSGGSGGTSGIFETKFQVDKVNFHMFDVGGQRDERRKWIQCFNDVTAIIFVVDSSDYNRLQEALNDFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKIEDYFPEFARYTTPEAATPEPGDDPRVTRAKYFIRKEFVDISTASGDGRHICYPHFTCAVDTENARRIFNDCKDIILQMNLREYNLV 246 T 2.5E-10 G-alpha pdb F T 7xjl 2 B B Guanine nucleotide-binding protein G(q) subunit alpha GPMGSTVSAEDKAAAERSKMIDKNLREDGEKARRTLRLLLLGADNSGKSTIVKQMRILHGGSGGSGGTSGIFETKFQVDKVNFHMFDVGGQRDERRKWIQCFNDVTAIIFVVDSSDYNRLQEALNDFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKIEDYFPEFARYTTPEAATPEPGDDPRVTRAKYFIRKEFVDISTASGDGRHICYPHFTCAVDTENARRIFNDCKDIILQMNLREYNLV 248 T 2.5E-10 G-alpha pdb F T 7xl7 1 A,B A,B A0A2J0R8J6_SALTM Uncharacterized protein MKEGFYWIQHNGRVQVAYYTHGVTEDLETGQTIIGVWHLTQGDDICHNGEAEILAGPLEPPI 62 T 4.9 Cuticle_2 pdbhh F Bacteria T 7xml 2 B,D C,D GP60_BPSP1 PEIP MLNQVEVLREEYVEGYVVQMWRRNPSNAPVIEVFTEDNLEEGIIPEYVTANDDTFDRIVDAVEFGYLEELELV 73 T 0.12 NtrY_N pdb T Viruses T 7xmw 1 A,B A,B U2Q5N5_LEPWF AcrVIA2 HHHHHAMWKCKKCGCDRFYQDITGGISEVLEMDKDGEVLDEIDDVEYGDFSCAKCDNSSSKIQEIAYWDEIN 72 T 0.011 CpXC unphh F Bacteria T 7xnj 1 A,B,C,D A,B,C,D A6V4P9_PSEA7 Stress Response Facilitator A, SrfA MAESQDKYTRRTGRTWADDQATYNRLREEADAARQKLRESGYSGAEYDQLRQAAFDLNRKANQYWEQMLSDLRQED 76 T 0.051 Flg_hook pdb F Bacteria T 7xnm 2 C,D C,D ILE-LEU-ALA-PRO-PRO-GLU-ARG ILAPPER 7 T 43 DUF5543 pdbhh F T 7xno 4 D,H,L B,F,J SAIA_LATSK Sakacin-A immunity factor MKHHHHHHHGAAGTSLYKKAGENLYFQGSMKADYKKINSILTYTSTALKNPKIIKDKDLVVLLTIIQEEAKQNRIFYDYKRKFRPAVTRFTIDNNFEIPDCLVKLLSAVETPKAWSGFS 119 T 0.06 DUF5112 pdb F Bacteria T 7xox 6 F L cck-8 FDMWGMYDEAYGWMDF 16 T 0.0021 Gastrin pdbhh F T 7xp4 1 A A GNAS2_HUMAN Guanine nucleotide-binding protein G(t) subunit alpha-3 MDYKDDDDKENLYFQSNSKTEDQRNEEKAQREANKKIEKQLQKDKQVYRATHRLLLLGADNSGKSTIVKQMRILHGGSGGSGGTSGIFETKFQVDKVNFHMFDVGGQRDERRKWIQCFNDVTAIIFVVDSSDYNRLQEALNDFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKIEDYFPEFARYTTPEDATPEPGEDPRVTRAKYFIRDEFLRISTASGDGRHYCYPHFTCAVDTQNVKFVFDAVTDIIIKENLKDCGLF 264 T 5.1E-10 G-alpha pdb F Eukaryota T 7xpk 2 B,D B,D A0A0E0IIA9_ORYNI Alpha-aminoacylpeptide hydrolase PPKRRAISAIRKFPRDCG 18 T 5.8 DUF3343 pdbhh F Eukaryota T 7xql 1 A,B B,A Q5ZSR1_LEGPH ANK_REP_REGION DOMAIN-CONTAINING PROTEIN MLTPPPDSKISTTDKSLDKLSAPLDMLKQMNESTMEQTKLDELRKKMSLQAEILNKAKADNDMFFRLLIELMSLKLQGELFKEQLSKISKESGYDSAQSALIQATNSEGQSPLQYALQKQDFSTAKYFLDNGAKAGPIEKAVFEIALDSKAAKEFGFPPLPPEKEKLHPVKNFGLVLGIKTTSVDGTPSQFGHIAPTYQLMTDSVSHFAKSHPGNKNFQEIANAFQFSNEASAFKFSTPQRNPEAGNDLARRIQGGELTTIPVSCKGHAMGLSYVPDGPGSKSGYLVYTNRGLGAKSSEHGTHIFRIEDSSKITPEFINNMTSGHSNGASHDEIMSQIKAAAGNKEPIHHIKQKGQKNDNCTIANSKSNIEGILLCQKAREVGGFDKLTESDMDSVKKEYKEFTKHMRVEKVNELAKALKENPQDPDLNNLTKEYLKQHPNADPKLKQTLETALKQASESSMTLSQPGKTI 471 T 0.00015 Shigella_OspC pdbhh F Bacteria T 7xqp 14 N O A0A2K1JDE1_PHYPA PsaO FNRDWLRKDLSVIGFGLIGWLAPSSLPVINGNSLTGLFLGSIGPELAHFPTGPALTSPFWLWMVTWHVGLFIVLTLGQIGFKGRQDGYW 89 T 0.1 Plasmid_RAQPRD unppercent F Eukaryota T 7xr2 1 A,B A,B E9LEU6_9REOV VP3 MASTTRLVNDRKQLEQQVKDDARILADARGLNITTVANDSATGGQAIRNVGPNDEATIKALDNVIKQIEALSVIVNRSEKADDAQILGPNTYKQLLEHLFSPEENVYILLPIQAYTGGVIDRRDASFSNFAYSIASKLMMELSAATHNKIFTDYTRIAASALGPEISTEGMPLFSLIESLELTEAETSRLPVIQDSMVIQKSTATVGNAQQGISTINIKRVPFVGSAFQQVIDQLLWEYSTTSLTTKEQRRQRITEMVNDRRIMIQKLTLAEKPQVMRHVTTEINNDLFFKMSPVAQLYIYHLDRAFLDGVGFTPLAEKQQQLQLQLKTNILTANLIRSAINGMNTESNLEVAIKMMQAAQLHRASIEIAFPMNVSLSPEIIVQCFIVWMSIPEQLLSDRSNFIIAAVIWAGFSADDSYADIMRRSARASDRQNYDIIKAALSSRKFKLPRASTTLFDENEPVVRRYQIGRVYAPFPVDRYGSPVYSNCTKVELASDYNAEGFTIRKDDFRALQAVLRIDEDRAADMFTTLRIMISSIPAVWYDAEVVHYPHTAVELEQLAAYGLTGAYPRTNHSVDTIVKTVNNISATYSTIAQMLSTIDLDPTRYGTSESIDKFKIAWENVESVLNMEGNDFVKTIMYAYEDNFPKKDFYMMLKQIASDGQGAHPIAAAIDQLRTIVYREPERFGYIDSVILTHNPDVDTAYNRFFHLHPIVTNQPSNTIKNAQLWNEMRLEQQVEHIKAGPVRIIGPFHVTYNYLSEEEDMPATSHIIMKDNMILNDHLTFNFVKRERRNNKKRVSSFRYKAVEMYVAVRISRFQLEVLRDLHDLVRSRTYLDVSKSPLATTPIRVVEYVR 854 T 6.7 DUF4982 pdbhh T Viruses T 7xr2 2 C,D 1,2 G9BDA7_9REOV VP11 MNWSKAINFQPFMLETRPPLTTIPIMDQLVEIGERSNQKWSMTDRLFFAIRKINPIFVTSSQIPSKFDYTILQMPTQLIASLKETLLFLAFSYYLREYQDKVGQMKFYPVAMKNMIPIVNYLKDRVHNNFDTTLEQAYRQNVVHTLSASDAFDLLSGMIATTRLDLIQRTRICPELLNVLNKMSFILIYAPNRPSILSWKNQS 203 T 0.11 MgtE_N unppercent T Viruses T 7xr2 3 E,F,G,H,I,J,K,L,M,N,O,P,Q a,b,c,d,e,f,g,h,i,j,k,l,m G9BDA8_9REOV VP12 MNLEINNFAPAISSIGSQLCSLSAQKLLTCRKQYGNGAKSFEEFYAEIGGIIGMMGINSQTPSGIREAIYRLYQSAFLFGDIFPESFGIQNTQNIKPPPGFTAPAKKLEVVLPQGGAFDLIYNNGEIRVTTTRNVQAGDLVCTVTFPIQGSVIATRNCHVNEIGGQLTTTRPEIIASVPMPARTVIVASFDAIEIGYGEGDDLFAIGIAILSNRFNGQITPMSRHNYMTQMFANLPANMSERDSSAVLHFAQAAPVVLGMMERLTGAPKWVLDY 274 T 0.082 SopE_GEF pdb T Viruses T 7xr3 2 K Z G9BD97_9REOV VP1 MRIMAQRLKELQREIDKKKKERIAEAYLSSVEVTNSSPSLSKQDDALTLPKVSPFLDSTPFTTLHNSLYGQQIHSIDDELAQICKLEYELQTQIADEQITALKHFLTIRTGSPQEIQYVDKEWMKSNQHVPSFLGDVKLMFGDTAGKFRSTSKSVDSIHSITSDVQVTRKKQTRSQIRNSYRVQKKHKVQQPLKPNTLYVYKYKGLPRVVLRFVPKVDTTSNSNSSSASDSKKDKDAFSCDDLSPTWKYILTEAKRAFPDRSYSDCIHPMTWEEWLEENQDHVKVLTQYAHQLDYVTLLQDFNLYVSGGASRVRNIDMSTLPTSINVLDHFELYGDASMKEYVRSGEWYGLLREIEQEGMTVNESEKVFANPDTYVLNVKKYFLRRFQQEIASTGMTPLTDELLNIMFVHWNIIVTAEPKLQVIKDDLLKYYSRYGVDATFDYNMKRSEMTVVTRGHLLAHKVLECALRIVETIYTYDIQDETFKDILIDLGRLIMRDPIYGTTTVRDATTVMKQLMYTQGTQFRRIMFKKYDYSNFNEKLVLKGEQMTNEPPTLLATTHYEEMDKKRIDALIKANQRAGNILSQSSIERCRYTDSLDLVGDANRYFSALTTLEAVAGFASSDLLSGFIDSNESIEFTGTAHLRKLLYHSVREQITTLNTSTVPRPSLPKVLLSSAKDTASASIEPLTFRIYKTTPEYDGESLNLVESTVEMSTRQKKPNLMKAAEILRSTVTTNQEMIISGGTRAVQGGKGARAVYPTKQPYHIAGSLLFHKVDTIVNANKKYRGVSNKYGQGISNAIPHIGVPEIIAVSSDGMAICLALDVSAFDVAQKYTEADIELAMRDGFLDSEISMISGETVLERMNPADLANNLLTNTPPRYKYQTALGDIIILQHDNRSGVPWTGTQNDLVNVSNHHMAYDEYKKRVAELQRQGKISIDVNDKHHIVRVFGDDSTFIMTYDEPPSAEEVHLMCATFVESYQDTAGTLGFAINARKGMIGRYGSEYLKNSAIYGNIKSVNQVKFRGSEKSASYHFGVSEKVSMIRDITDLTITRGCDETRKWKYNLMMLPVDLTTRAGAFRMHNLCSIMTGVGKMYLGGTLNNKLIASYHGSSFGWNFDDNLIKTANSIGAISDSSYDAISTKITNLADFKDSQQRITRDIITSGRLPQHLNRYGKSNILRHILASAAMGPLSQIEKNVNAYNVVMGILNGKLEAPTVLERLNMGFKYVVMSDLKQDDYSPYSCQGLQYRRMLVHWGLNDSRITSFDPKGKLQHLLAKNSQILPIHFDIEFVYRLYLQAGTMGFLQVMSYYQLPDTLTHEMLAAVVALELQLGNDKYAVDMGVYSSQAGQIRINDALMDSIIQHRRGPPLPIIDRTLNRLLLHTYMLMFGLMGKSIDSTKIDPTLSWRAILESNDQRIAQLSELLTAV 1425 T 0.021 FAM220 pdb T Viruses T 7xrw 1 A A C1JEX5_TRYBB Repressor activator protein 1 GPGSEATEEIAALDQPFEKCFIPTEALGSDREGLDRTQLERQLPFRNYPIKLNVSKSGIFCQFPTVSDAKRFYEEGTVEILNRSLPIKPVFEKRNETVAPAERKRRRSVSPGGVHPQTAAVSALSRR 127 T 0.012 VIR_N unp F Eukaryota T 7xsj 3 C C APBA1_RAT ADAPTER PROTEIN X11ALPHA,NEURON-SPECIFIC X11 PROTEIN,NEURONAL MUNC18-1-INTERACTING PROTEIN 1,MINT-1 GPGSEFYRQEALGARLHHYDERSDGESDSPEKEAEFAPYPRMDSYEQEEDIDQIVAEVKQSMSSQSLDKAAEDMPEAEQDLER 83 T 0.1 ACP_PD pdb F Eukaryota T 7xtg 2 B,F,J B,F,J SAIA_LATSK Sakacin-A immunity factor ADYKKINSILTYTSTALKNPKIIKDKDLVVLLTIIQEEAKQNRIFYDYKRKFRPAVTRFTIDNNFEIPDCLVKLLSAVETPKAWSGFS 88 F F Bacteria T 7xtl 1 A,B A,B MGT4A_HUMAN MGAT4A MGSSHHHHHHSSGLVPRGSHMASKIHVNPPAEVSTSLKVYQGHTLEKTYMGEDFFWAITPIAGDYILFKFDKPVNVESYLFHSGNQEHPGDILLNTTVEVLPFKSEGLEISKETKDKRLEDGYFRIGKFENGVAEGMVDPSLNPISAFRLSVIQNSAVWAILNEIHIKKATN 172 T 0.58 NADase_NGA pdbhh F Eukaryota T 7xuv 2 B B RMI1_HUMAN BLM-ASSOCIATED PROTEIN OF 75 KDA,BLAP75,FAAP75 SGSDEELLASLDENDELTANND 22 T 23 DUF4293 pdbhh F Eukaryota T 7xv3 3 C A Engineered G protein subunit S (mini-Gs) MGCTLSAEDKAAVERSKMIEKQLQKDKQVYRATHRLLLLGADNSGKSTIVKQMRIYHVNGYSEEECKQYKAVVYSNTIQSIIAIIRAMGRLKIDFGDSARADDARQLFVLAGAAEEGFMTAELAGVIKRLWKDSGVQACFNRSREYQLNDSAAYYLNDLDRIAQPNYIPTQQDVLRTRVKTSGIFETKFQVDKVNFHMFDVGAQRDERRKWIQCFNDVTAIIFVVDSSDYNRLQEALNDFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKIEDYFPEFARYTTPEDATPEPGEDPRVTRAKYFIRDEFLRISTASGDGRHYCYPHFTCSVDTENARRIFNDCRDIIQRMHLRQYELL 361 F F T 7xv4 2 B B ATRIP_HUMAN ATM AND RAD3-RELATED-INTERACTING PROTEIN GDFTADDLEELDTLASQ 17 T 1.6 Med21 unppercent F Eukaryota T 7xv7 1 A,B B,A ZY11B_HUMAN Protein zyg-11 homolog B GYINVGAQLGTELFIVRQLLQIVKQKTNQNSVDTTLKFTLSALWNLTDESPTTCRHFIENQGLELFMRVLESFPTESSIQQKVLGLLNNIAEVQELHSELMWKDFIDHISSLLHSVEVEVSYFAAGIIAHLISRGEQAWTLSRSQRNSLLDDLHSAILKWPTPECEMVAYRSFNPFFPLLGCFTTPGVQLWAVWAMQHVCSKNPSRYCSMLIEEGGLQHLYNIKDHEHTDPHVQQIAVAILDSLEKHIVR 250 T 0.00014 DUF4704 pdb F Eukaryota T 7xva 2 B B C9JGY8_HUMAN Juxtaposed with another zinc finger protein 1 QQPTYVALSYINRFMTDAARREQES 25 T 9.1 Herpes_U34 pdbhh F Eukaryota T 7xvg 2 B B A0A5B0N367_PUCGR AvrSr35 HHHHHHSSGVDLGTENLYFQSNANSKDIREYLASTFPFEQQSTILDSVKSIAKVQIDDRKAFDLQLKFRQENLAELKDQIILSLGANNGNQNWQKLLDYTNKLDELSNTKISPEEFIEEIQKVLYKVKLESTSTSKLYSQFNLSIQDFALQIIHSKYKSNQISQNDLLKLITEDEMLKILAKTKVLTYKMKYFDSASKMGINKYISTEMMDLDWQFSHYKTFNDALKKNKASDSSYLGWLTHGYSIKYGLSPNNERSMFFQDGRKYAELYAFSKSPHRKIIPGEHLKDLLAKINKSKGIFLDQNALLDKRIYAFHELNTLETHFPGITSSFTDDLKSNYRKKMESVSLTCQVLQEIGNIHRFIESKVPYHSSTEYGLFSIPKIFSIPIDYKHGEKENLVSYVDFLYSTAHERILQDNSINQLCLDPLQESLNRIKSNIPVF 441 T 0.32 Type2_restr_D3 pdbpssm F Eukaryota T 7xwo 6 F D HIS-LYS-THR-ASP-SER-PHE-VAL-GLY-LEU-MET-NH2 HKTDSFVGLMA 11 T 0.046 Tachykinin pdbhh F T 7xx2 2 B B A0A5B0N367_PUCGR AvrSr35 HHHHHHSSGVDLGTENLYFQSNAMRNFAADRVHGVESVISGSKSSSNPMALSKSMDKPDTSDLVDSNVQAKNDGSRYEEDFTAKYSEQVDHVSKILKEIEEQEPGTIIIDHKAFPIQDKSPKQVVNFPFPKKMITESNSKDIREYLASTFPFEQQSTILDSVKSIAKVQIDDRKAFDLQLKFRQENLAELKDQIILSLGANNGNQNWQKLLDYTNKLDELSNTKISPEEFIEEIQKVLYKVKLESTSTSKLYSQFNLSIQDFALQIIHSKYKSNQISQNDLLKLITEDEMLKILAKTKVLTYKMKYFDSASKMGINKYISTEMMDLDWQFSHYKTFNDALKKNKASDSSYLGWLTHGYSIKYGLSPNNERSMFFQDGAKYAELYAFSKSPHRKIIPGEHLKDLLAKINKSKGIFLDQNALLDKRIYAFHELNTLETHFPGITSSFTDDLKSNYRKKMESVSLTCQVLQEIGNIHRFIESKVPYHSSTEYGLFSIPKIFSIPIDYKHGEKENLVSYVDFLYSTAHERILQDNSINQLCLDPLQESLNRIKSNIPVFFNLASHSSPIKPSNVHEGKL 575 T 0.44 Type2_restr_D3 unppssm F Eukaryota T 7xxf 7 KA,LA,MA,NA,OA,PA,QA,RA,SA,TA,UA a,b,c,d,e,f,g,h,i,j,k Light-harvesting protein LH1 Gamma-like MAMVWMWILIAPAIGIVLLSRQ 22 T 0.59 DUF1514 pdbhh F T 7xya 9 J G A0A2R3ITY7_PSEAI AlpA MFQSTEQALAVAYWMFEQQPGPRSSTAMVIDSLRERFDRRFIERLPSGLSPHEWQAQAVMTVRFAQRQLAAHPLELAVVRAEFARGRDFVLGLAALRDWLKPAAGPIEQRAALALLMRMFRRPPSSIREIERLSGLSKSTLHRWDKEWRERVAALLRQALLRLEEPMAQVGIVCEH 176 T 0.00097 HTH_IclR pdbpercent F Bacteria T 7xys 1 A,B,C,D A,B,C,D ZER1_HUMAN HZYG,ZYG-11 HOMOLOG B-LIKE PROTEIN,ZYG11B-LIKE PROTEIN SFLHVGKMGFVVTMLKLIQKKLLDKTCDQVMEFSWSALWNITDETPDNCEMFLNFNGMKLFLDCLKEFPEKQELHRNMLGLLGNVAEVKELRPQLMTSQFISVFSNLLESKADGIEVSYNACGVLSHIMFDGPEAWGVCEPQREEVEERMWAAIQSWDINSRRNINYRSFEPILRLLPQGISPVSQHWATWALYNLVSVYPDKYCPLLIKEGGMPLLRDIIKMATARQETKEMARKVIEHCSNFKEENMDTSR 253 T 0.0027 Arm_3 pdbpercent F Eukaryota T 7xyt 1 A,B,C,D A,B,D,C ZER1_HUMAN HZYG,ZYG-11 HOMOLOG B-LIKE PROTEIN,ZYG11B-LIKE PROTEIN AFLHVGKMGFVVTMLKLIQKKLLDKTCDQVMEFSWSALWNITDETPDNCEMFLNFNGMKLFLDCLKEFPEKQELHRNMLGLLGNVAEVKELRPQLMTSQFISVFSNLLESKADGIEVSYNACGVLSHIMFDGPEAWGVCEPQREEVEERMWAAIQSWDINSRRNINYRSFEPILRLLPQGISPVSQHWATWALYNLVSVYPDKYCPLLIKEGGMPLLRDIIKMATARQETKEMARKVIEHCSNFKEENMDTSR 253 T 0.0027 Arm_3 pdbpercent F Eukaryota T 7xyu 1 A,B,C,D B,A,C,D ZER1_HUMAN HZYG,ZYG-11 HOMOLOG B-LIKE PROTEIN,ZYG11B-LIKE PROTEIN TFLHKMGFVVTMLKLIQKKLLDKTCDQVMEFSWSALWNITDETPDNCEMFLNFNGMKLFLDCLKEFPEKQELHRNMLGLLGNVAEVKELRPQLMTSQFISVFSNLLESKADGIEVSYNACGVLSHIMFDGPEAWGVCEPQREEVEERMWAAIQSWDINSRRNINYRSFEPILRLLPQGISPVSQHWATWALYNLVSVYPDKYCPLLIKEGGMPLLRDIIKMATARQETKEMARKVIEHCSNFKEENMDTSR 251 T 0.0027 Arm_3 pdbpercent F Eukaryota T 7xyv 1 A,B A,B ZY11B_HUMAN Protein zyg-11 homolog B SFLHVGAQLGTELFIVRQLLQIVKQKTNQNSVDTTLKFTLSALWNLTDESPTTCRHFIENQGLELFMRVLESFPTESSIQQKVLGLLNNIAEVQELHSELMWKDFIDHISSLLHSVEVEVSYFAAGIIAHLISRGEQAWTLSRSQRNSLLDDLHSAILKWPTPECEMVAYRSFNPFFPLLGCFTTPGVQLWAVWAMQHVCSKNPSRYCSMLIEEGGLQHLYNIKDHEHTDPHVQQIAVAILDSLEKHIVR 250 T 0.00014 DUF4704 pdb F Eukaryota T 7xyw 1 A,B B,A ZY11B_HUMAN Protein zyg-11 homolog B AFLHVGAQLGTELFIVRQLLQIVKQKTNQNSVDTTLKFTLSALWNLTDESPTTCRHFIENQGLELFMRVLESFPTESSIQQKVLGLLNNIAEVQELHSELMWKDFIDHISSLLHSVEVEVSYFAAGIIAHLISRGEQAWTLSRSQRNSLLDDLHSAILKWPTPECEMVAYRSFNPFFPLLGCFTTPGVQLWAVWAMQHVCSKNPSRYCSMLIEEGGLQHLYNIKDHEHTDPHVQQIAVAILDSLEKHIVR 250 T 0.00014 DUF4704 pdb F Eukaryota T 7xyx 1 A,B B,A ZY11B_HUMAN Protein zyg-11 homolog B CFLHVGAQLGTELFIVRQLLQIVKQKTNQNSVDTTLKFTLSALWNLTDESPTTCRHFIENQGLELFMRVLESFPTESSIQQKVLGLLNNIAEVQELHSELMWKDFIDHISSLLHSVEVEVSYFAAGIIAHLISRGEQAWTLSRSQRNSLLDDLHSAILKWPTPECEMVAYRSFNPFFPLLGCFTTPGVQLWAVWAMQHVCSKNPSRYCSMLIEEGGLQHLYNIKDHEHTDPHVQQIAVAILDSLEKHIVR 250 T 0.00014 DUF4704 pdb F Eukaryota T 7xzi 3 C 5 Ctap5 MQLGQLRQPLRACQDQRLTRGVPLARRQLVVVSNWNPLGGKGGGNAEQPGGEEPIQDELLKLLRGGWVLLSNLALFLVFSSFLHRSLNWFVQTELLVAVGAPQQAGERVVGKFFEAIEWVERNILGWKLPGDEEAEDATSKVYEVLQNYTPAEAAYSFAQLKYKDLTHKERELFHKAYALRHFERRDGRPGDVDAAELQAVKDRLDPLEADRRAYAAAKAAGRLDEYWAAPGREATYQRIVGAPRIAARQCEMASMLKGLQAVLPAMELLAQLQVAQFVYAASKASKSRQQDDFKLQLQTFYGNVLDEQCQLRCMLLNVQLPMALVTVFVPQYCFCLLDRVVLPRRECGSASTLGSLLRACGRPGSACHAGGCSAQRRDPLTI 383 T 0.023 DUF2878 pdbpercent F T 7xzq 2 B B thiopeptide TP1 XWGFIYKTLKXXGXXXXX 18 T 2 FeoC pdbhh F T 7xzr 2 C,D C,D thiopeptide TP15 XWTIRTRGRIATXXXXXX 18 T 0.72 SWIM pdbhh F T 7y1c 3 E Y phage tail tubular protein B MALVSQSIKNLKGGISQQPEILRYPEQGTLQVNGWSSETEGLQKRPPMVFIKSLGPRGYLGEDPYIHLINRDEYEQYYAVFTGNDVRVFDLSGYEYQVRGDRSYVTVNNPKDNLRMVTVADYTFIVNRTRQVRENQNRTNGGTFRDNVDAIINVRGGQYGRKLEVNINGVWVSHQLPPGDNAKEDPPKVDAQAIAEAIATLLRTAHPTWTFNVGTGFIHCIAPADTTIDILETKDGYADQLINPVTHYVQSFSKLPLNAPDGYMVKIVGDTSKTADQYYVKYDKSQKVWKETVGWNISVGLEYHTMPWTLVRAADGNFDLGYHEWKDRRAGDDDTNPQPSFVNSTITDVFFFRNRLGFISGENIVMSRTSKYFEFYPPSVANYTDDDPLDVAVSHNRVSVLKYAVSFAEELLLWSDEAQFVLSANGVLSAKTAQLDLTTQFDVSDRARPYGIGRNIYYASPRSSFTSIMRYYAVQDVSSVKNAEDMTAHVPNYIPNGVYSINGSGTENFACVLTKGAPSKVFIYKFLYMDENIRQQSWSHWDFGDGVEVMAANCINSTMYMLMRNGYNVWIAAVDFKKESTDFPFEPYRFHVDAKRSYHISETAYDIETNQTVVNVKDIYGASFAKGTVAICESDGKITEYEPTGNSWDSTPDIRISGDVSGKNIVIGFLYDFQYVFSRFLIKQEQNDGTTSTEDSGRLQLRRAWVNYQNTGAFTVSVDNGSREFNYLVNARVGSTGLRLGQKATTTGQYRFPVTGNALYQKVSLSSFNASPVSIIGCGWEGNYSRRANGI 791 T 0.041 Phage_stabilise pdbhh F T 7y22 3 E Y phage tail tubular protein B MEVQGSLGRQIQGISQQPASVRLPGQCTDAINCSMDVVEGTKSRPGTVHIARLGDLGLIQDNTNIHHYRRGDDVEEYWMITNPLGIPDIFDKQGRKCTVTETEGAASYFNSNNPRVDYKFFTVGDTTFVVNRTKIVRARADKTPAVGGTALVFSAYGQYGTNYQIIINGVKAAEYKTASGGSASDVETIRTEVIAEQLYTNLLTWAGASDYSISRMGTTIVISSLSGASFTVDTEDGSKGKDLVAIQYKVTSTDLLPSKAPVGYLVQVWPTGSKPESRYWLKAEAADGNLVTWQETLGADEVLGFDGSTMPYIIERTNIVGGIAQFTIKQGYWDDRAVGDELTNPMPSFVDQSLSDIFMVQNRLCLAAGESCIMSRTSYFFQFFRQTVLSAVDTDPIDVFADASEVYALKHAKVLDGDTVLFSDNAQFILPGDKPLTKATALLRPTTTFEVDTNVAPVVTGEAVMFATKDGAYSNIREFYTDSYSDTKKAQPVTSHVNKLIRGGIYHMASSTNFNRLFALSEDNRSRVFVYDWLWQGTDKVQSAWHKWEFYGATIGGLYYSGETLYLIIKRNDGVFLEAMYMGDPLLSGSDQVRMDRTVTVSLTWDEATLSWKSSPLPWVPTQVEMLEAVLTNGDPAYLGGAFLFEYDANTRILSTKYGLGDTSQIWAAKVGQMYKVEFVPTDVIIRDSQDRVSYQDVPVIGLVHLNLDRYPDFTVEITNRKSGAVRVAKASNRVGGARNNVVGYVKPTSGTFSFPLRALSTDVEYRIISISPHTFQLRDIEWSGSYNPTRKRV 794 T 0.017 Phage_stabilise pdbhh F T 7y26 2 B B Engineered Guanine nucleotide-binding protein G(q) subunit alpha TVSAEDKAAAERSKMIDKNLREDGEKARRTLRLLLLGADNSGKSTIVKQMRILHGGSGGSGGTSGIFETKFQVDKVNFHMFDVGGQRDERRKWIQCFNDVTAIIFVVDSSDYNRLQEALNDFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKIEDYFPEFARYTTPEDATPEPGEDPRVTRAKYFIRKEFVDISTASGDGRHICYPHFTCAVDTENARRIFNDCKDIILQMNLREYNLV 243 T 2.5E-10 G-alpha pdb F T 7y39 1 A,B A,B ZFAN1_HUMAN ZINC FINGER AN1-TYPE-CONTAINING PROTEIN 1 GGSGAKNSETAAKVALMKLKMHADGDKSLPQTERIYFQVFLPKGSKEKSKPMFFCHRWSIGKAIDFAASLARLKNDNNKFTAKKLRLCHITSGEALPLDHTLETWIAKEDCPLYNGGNIILEYLNDEEQFCKNVESYLE 139 T 0.036 EndoU_bacteria pdbpercent F Eukaryota T 7y3f 14 O,P 4,5 ISIA_NOSS1 CP43' MQTYDNPNIKYDWWAGNARFANLSGLFIGAHVAQAALTTLWAGAFTWFEISRYKPEIPMGEQGLILLPHLATLGFGVGVSGQVVNTYPYFVIGALHLISSAVLGAGALFHTFKGPRNLKNTTGSARKFHFEWNDPKQLGLILGHHLLFLGMAALLLVGKAMFWGGLYDATTQVVRVVNHPTLNPFVIYGYQTHFASVNNLEDLVGGHIYVGLILIGGGIWHIVKEPLPWAKKLLIFSGEAILSYSLGGIALAGFVAAYFCAVNTLAYPVEFYGAPLELKFGVTPYFADTVKLADGGYSARAWLANAHFFLAFFFLQGHLWHALRAIGVDFRQIEKSLNAISSAE 344 T 1.4E-06 PSII pdb F Bacteria T 7y3j 3 C A A4_HUMAN ALA-LEU-VAL-PHE-PHE-ALA-PRO-ALA-VAL-GLY-SER KLVFFAPDVGS 11 T 0.01 Beta-APP pdbhh F Eukaryota T 7y3t 1 A,B,C,D,E,F,G G,A,B,C,D,E,F phage major capsid protein MSTPNVLTNVAVSHSGEVDSLLIEKFNGKVREQYLKGENLLSHFQVETVTGTNTVSNKYLGETEIQVLAPGQSPAATPTKADKNQVVIDTTVIARNTVAMLHDVQGDIDSLKPKIAVNQAKQLKRLEDEMVVQQLMLGGISNTKAQRTNPRVPGHGFSINVNITADTAETSPQYLAAAIEYALEQQLEQEVDISDLVILMPWKFFNALRDMDRIVDRSYTLADESTVQGFALKSFNVPVVPSNRFPKFSQGAAHHKLSNADNGFRYDTTAPMAGAVAVIFSMDALLVGRTIELTGDIFWEKKEKTFYIDTYLAEGAIPDRWEAVSVVTTARNATTGDPDGTGADDTVVTKRANRKVILTKAVS 363 F F T 7y43 1 A A KAT6A_HUMAN MOZ,YBF2/SAS3,SAS2 AND TIP60 PROTEIN 3,MYST-3,MONOCYTIC LEUKEMIA ZINC FINGER PROTEIN,RUNT-RELATED TRANSCRIPTION FACTOR-BINDING PROTEIN 2,ZINC FINGER PROTEIN 220 SMVKLANPLYTEWILEAIKKVKKQKQRPSEERICNAVSSSHGLDRKTVLEQLELSVKDGTILKVSNKGLNSYKDPDNPGRIALPKP 86 T 0.0018 Linker_histone pdb F Eukaryota T 7y4a 2 B,D,F,H B,D,F,H ELMO1_HUMAN PROTEIN CED-12 HOMOLOG GMPPPADIVKVAIEWPGAYPKLMEIDQKKPLSAIIKEVCDGWSLANHEYFALQHADSSNFYITEKNRNEIKNGTILRLTTSPA 83 T 0.0014 FERM_N pdb F Eukaryota T 7y4h 1 A A AcvX MRAKGISYDTGFVKNGATSRKRFDPDVVERELRIIRDDLHCTAVRVMGGDPERIEVAAAHAADLGLEVWFSPYPLELTAEEMLSLFADCAERAERLRRRGAEVVFVVGAELSLMNPGFLPGDSTDERVALLRRPDRVREQLGEVSARVNAFLGKAVQLVRERFDGKVTYASVPFERVDWAPFDIVSMDLYRSAEIADRFTDGVRDLVAQGKPVAITEFGAAGYQGAGDRGALALEIVEYGKDGPVRLKGDHARDEPGQAAYVRELLEAFDAGGVDGAFVFTFALYDHVHRPDGDPRDDLDLASYGIVKVYEDRLGATYPDMPWEPKAAFTTLAEYYRG 338 T 0.42 Cellulase pdb F T 7y4l 1 A,E AA,EA A0A5J4YXP2_PORPP Linker4 MAFVTGGLVGSSSAPALRTVCNASQSKLRMAASAADVVNAAYPKNIKNKAPVISFDGKKGVKLEMVTLQTFAGDDSEDTLFDYSSGKFMPQKPADMGIAWPSGDGRQAEMKGGKGSFNQPDLRKYGPFPDFLKRSMDL 138 T 26 GRP pdbhh F Eukaryota T 7y4l 4 K,U KA,UA A0A5J4YJY8_PORPP CaRSPs1 MAAFVSGGCGVGGQRRAWPAKGAAVARTHACPTTMVVLPVARAGLAATAKKNQYMGTSVAPEIVLTDKGSDMSRKVKTEDKKVAADQAAAMGILANMSLYASLNPVKRMTYKAKEQAPAYVKKTGNPVEDFYPSSWRNMAPVISLSANRVAVAFEKIDAASNGVKANSNNKPFWKSGATTKNYVAPEAPAQSEPETLDDAAYQRYFPARIRNKAPAMEFRRPSFANTEDPSAYFMLQKETVPLRMALAEKLLTKLGRKGDG 261 T 85 DUF6243 pdbhh F Eukaryota T 7y4l 5 SW,W cA,WA A0A5J4YX67_PORPP CaRSPs2 MWEQQRPRRCEAPAAPSSRPAERRAAARRSRAQLRMKQDDYEQWKTEFAGGFPGGEAFYKKWIEEGAKGDVPALEEELQPRSPNKKPTIYEEQMISNRGQQKGVDPTWKTLLAGGFPGGEFFFKKWIGEGAQGEVPNLDADLQPGSGSAKKTGKKEDADKSSPGGIMTPGRIMVPSGLGEEEETVDKEAPRPELYNKYFSADRLHKAPEILFEYNKTKYDRVGVRYTEVTSKASERFFPKSRMNRAPVIEISYREGAVSTASVSLSMPEISGPPALPFPVPKGDVTTTMVTDPTTGRLKLEFKVDGAAVSAYSDPRA 317 T 10 DUF5767 pdbhh F Eukaryota T 7y4l 30 MAA,NAA Y3,53 A0A5J4Z365_PORPP LPP2 MVVVCASRKSGRAVVPAMVPALRVELVVSKQMVMGLAALFFQVQRTALMAKLELPKFSMPSMPSMPKLTVPKLQMPKLGGGEKKDKAAKPSPSAPKTTIRPSGGVKVRAAVGNKSNVDAPSFKGSNMELADSGADYKAFPKRRMPGANMQGFLDMAKGMKPK 162 T 0.14 CCDC71L unppercent F Eukaryota T 7y5e 30 CX,DW,JFA,OJ q6,Q6,qL,QL A0A5J4Z679_PORPP PsbQ' MAFVSGFAGAQIASGSSAQVCRAPAAVRMSAAGEEMSRRDLMAGVATAAAGLLVIPGAAMAGDAPKQSFFGGSSASSPFVYNMKQTGEILYKPLNDEDLQFHKNVLEKSRGELDRTSEQIARKSWDDMRGVIRNQMYNMRHSQLRLIESVESAEKQKAAKKNYNDLKKSLEEMDLAARNKKQEDARKFRASALKAFDSFTTSVGI 205 T 0.00097 TAT_signal unppercent F Eukaryota T 7y5e 53 LL,UQ ON,O2 A0A5J4YUC8_PORPP Photosystem I subunit O YEISEGSSFDANPLVIGLALIGWVVPSSVPSNIPLLDGKGLTPAFVASISDNLSRWPQGPQLADPFWLLMGMWHVGLFATLIFGTVGYNLRK 92 T 35 Mif2_N pdbhh F Eukaryota T 7y5e 54 ML,VQ RN,R2 A0A5J4YR43_PORPP PsaR DNYPSSEVLGLGKNIPSALYVLISIACFAIGVTSVAKSNLITPLTPESINPQYVVGSLLLPISWGAHTAAFIQKVNKK 78 T 0.022 Antimicrobial22 pdbpercent F Eukaryota T 7y5x 2 B B PSN2_HUMAN PS-2,AD3LP,AD5,E5-1,STM-2 MLTFMASDSEEEVCDERTSLMSAESPTPRSCQEGRQGPEDGENTAQWRSQENEEDGEEDPDRYVCSGVPGRPPGLEEELTLKYGAKHVIMLFVPVTLCMIVVVATIKSVRFYTEKNGQLIYTPFTEDTPSVGQRLLNSVLNTLIMISVIVVMTIFLVVLYKYRCYKFIHGWLIMSSLMLLFLFTYIYLGEVLKTYNVAMDYPTLLLTVWNFGAVGMVCIHWKGPLVLQQAYLIMISALMALVFIKYLPEWSAWVILGAISVYDLVAVLCPKGPLRMLVETAQERNEPIFPALIYSSAMVWTVGMAKLDPSSQGALQLPYDPEMEEDSYDSFGEPSYPEVFEPPLTGYPGEELEEEEERGVKLGLGDFIFYSVLVGKAAATGSGDWNTTLACFVAILIGLCLTLLLLAVFKKALPALPISITFGLIFYFSTDNLVRPFMDTLASHQLYI 448 T 1.5E-46 Presenilin pdb F Eukaryota T 7y66 6 F E BM213 peptide XFKPLAAXR 9 T 20 T3SS_HrpK1 pdbhh F T 7y7a 36 JI,JOB R7,Ro A0A5J4YR43_PORPP PsaR MAFINGAALGGGAKVAFSGKAVASRRVVAASTANKRSVVVKMADNYPSSEVLGLGKNIPSALYVLISIACFAIGVTSVAKSNLITPLTPESINPQYVVGSLLLPISWGAHTAAFIQKVNKK 121 T 0.093 Antimicrobial22 pdbpssm F Eukaryota T 7y7a 64 BM,DQA,XL,ZPA EA,EW,AA,AW A0A5J4YXP2_PORPP Linker4 DVVNAAYPKNIKNKAPVISFDGKKGVKLEMVTLQTFAGDDSEDTLFDYSSGKFMPQKPADMGIAWPSGDGRQAEMKGGKGSFNQPDLRKYGPFPDFLKRSMDL 103 T 23 ISAV_HA pdbhh F Eukaryota T 7y7a 65 HM,JQA,RM,TQA KA,KW,UA,UW A0A5J4YJY8_PORPP CaRSP1 VVLPVARAGLAATAKKNQYMGTSVAPEIVLTDKGSDMSRKVKTEDKKVAADQAAAMGILANMSLYASLNPVKRMTYKAKEQAPAYVKKTGNPVEDFYPSSWRNMAPVISLSANRVAVAFEKIDAASNGVKANSNNKPFWKSGATTKNYVAPEAPAQSEPETLDDAAYQRYFPARIRNKAPAMEFRRPSFANTEDPSAYFMLQKETVPLRMALAEKLLTKLGRKGDG 226 T 100 TBC1D23_C pdbhh F Eukaryota T 7y7a 66 BRA,TM,VQA,ZM cW,WA,WW,cA A0A5J4YX67_PORPP CaRSP2 GEEEETVDKEAPRPELYNKYFSADRLHKAPEILFEYNKTKYDRVGVRYTEVTSKASERFFPKSRMNRAPVIEISYREGAVSTASVSLSMPEISGPPALPFPVPKGDVTTTMVTDPTTGRLKLEFKVDGAAVSAYSDPRA 139 T 2.8 DUF5767 pdbhh F Eukaryota T 7y7b 21 U O PsaO FEISDGVEFDLNPLVLAISFLGWSLPGLLPSNIPLYGGKGLTTALFAEIGEHLQTFPAPPPIGDPFWVILFIWHSGLFATMIFGTIGYNGYGPKSTTKY 99 T 34 G2F pdbhh F T 7y7b 22 V R PsaR MVRALCFLALIASAAAFSTAPGLALRSSVRPATSTKTPMKMAGYSPVPSPDNTKETYWETKAPSSQVLGIGKDVSSGNYIVASVVAAVVGAACTGQCIPLTVSPNPVFILGSFLLPYSWALHVAAWIQRNNGK 133 T 0.042 MRP_L53 pdb F T 7y7b 24 X Z ACPI-S MAQAAPPSKLPENMAKKNALKVNKEQWGIEEAIKVDAKAAAPAPKAAAPAPKKAAPKKGAAPAEATVGFSGVPSDFCRPAPTAFPAEPAGMTVFGARGPRAEGHRDKFGSRHAAVSLACAALLWQPISQAGMYSIDSGSLAKKSFSEMEVPGFGDAKKVPTIESFFPFTKNGFDASPALFGKDSMIVFENPLGKCGAYASSCHTFLDEMGDMLKATPQEMPRSKAAPTYSFPWMYDHAAWKK 242 T 0.53 KAR9 pdbpssm F T 7y7i 7 K,L K,L A0A3Q3AQL2_CHICK Myb-like domain-containing protein SSNGIYTRSGRLVKPPLSFWCGEREFVDRELNVTIQKGGTDYLS 44 T 0.8 DUF4764 pdbhh F T 7y8a 21 U O PsaO MKVAFLVLLAAATANAFAPTAAFLPKAHGIAASKPAMALRAAPRAVAKPLAVQAKFEISDGVEFDLNPLVLAISFLGWSLPGLLPSNIPLYGGKGLTTALFAEIGEHLQTFPAPPPIGDPFWVILFIWHSGLFATMIFGTIGYNGYGPKSTTKY 154 T 43 DMP1 pdbhh F T 7y8w 2 E,F,K,L,O,P,S,T E,F,K,L,O,P,S,T SAO1_CAEEL Isoform b of Suppressor of aph-1 GPGSEFMQHANVATDQVVMKSVECQTEPVE 30 T 14 XRN_N pdbhh F Eukaryota T 7y9x 1 A B A0A401FT52_9DELT CHAT domain-containing protein MSNPIRDIQDRLKTAKFDNKDDMMNLASSLYKYEKQLMDSSEATLCQQGLSNRPNSFSQLSQFRDSDIQSKAGGQTGKFWQNEYEACKNFQTHKERRETLEQIIRFLQNGAEEKDADDLLLKTLARAYFHRGLLYRPKGFSVPARKVEAMKKAIAYCEIILDKNEEESEALRIWLYAAMELRRCGEEYPENFAEKLFYLANDGFISELYDIRLFLEYTEREEDNNFLDMILQENQDRERLFELCLYKARACFHLNQLNDVRIYGESAIDNAPGAFADPFWDELVEFIRMLRNKKSELWKEIAIKAWDKCREKEMKVGNNIYLSWYWARQRELYDLAFMAQDGIEKKTRIADSLKSRTTLRIQELNELRKDAHRKQNRRLEDKLDRIIEQENEARDGAYLRRNPPCFTGGKREEIPFARLPQNWIAVHFYLNELESHEGGKGGHALIYDPQKAEKDQWQDKSFDYKELHRKFLEWQENYILNEEGSADFLVTLCREIEKAMPFLFKSEVIPEDRPVLWIPHGFLHRLPLHAAMKSGNNSNIEIFWERHASRYLPAWHLFDPAPYSREESSTLLKNFEEYDFQNLENGEIEVYAPSSPKKVKEAIRENPAILLLLCHGEADMTNPFRSCLKLKNKDMTIFDLLTVEDVRLSGSRILLGACESDMVPPLEFSVDEHLSVSGAFLSHKAGEIVAGLWTVDSEKVDECYSYLVEEKDFLRNLQEWQMAETENFRSENDSSLFYKIAPFRIIGFPAE 751 T 1.7E-07 CHAT pdbpercent F Bacteria T 7yca 21 U O A0A096P9N0_OSTTA PsaO MLALAPINVQRRSPLGVSARGRKTQSKARFAVKVNAANADLKFDDDWKKSNVAVHLASLFGWVIPSASPCPAFPDNASLFKVFSDRISENLAHFPTGPSADDPIWLYMLTWHMGLFACMMFGQIGVQARKQGYFGN 136 T 48 YkpC pdbhh F Eukaryota T 7yd4 1 A A P95206_MYCTU SECRETORY PROTEIN EPTGALPPMTSSGSGPVIGDGDAALRQRISQQLFSFGDPTVQEVDGSDAAQFITAAAAVADRDVASVFLPLQRVLGCQQNTAGSGAGFGARAYRRTDGQWGGAMLVVAKSTVSDVDALKACVKSGWRKATAGTPTSMCNNGWTYPPFADTRRGEEGYFVLLAGTASDFCSAPNANYRTTASSWPG 185 T 14 DUF983 pdbhh F Bacteria T 7yed 1 A,B,C,D,E,F,G,H,I,J,U,V,W,X,Y 1,2,3,4,5,A,B,C,D,E,a,b,c,d,e C9E874_9VIRU LAMBDA 1 MKRIPRKTKGKSSGKGNDSTSRSDDGSSQLRDKQSNKANPATAEPGTSNCEHYKARPGIASVQKATESAELPMKNNDEGTPDKRGNTKGALVNEHVEARDEADDATKKQAKDTEKAKAQVTYSDTGINNANELSRSGNVDNEGGSNQKPMSTRIAEATSAIVSKHPARVGLPPTASSGHGYQCHVCSAVLFSPLDLDAHVASHGLHGNMTLTSSEIQRHITEFISSWQNHPIVQVSADVENRKTAQLLHADTPRLVTWDAGLCTSFKIVPIVPAQVPQDVLAYTFFTSSYAIQSPFPEAAVSRIVVHTRWASNVDFDRDSSVIMAPPTENNIHLFKQLLNTETLSVRGANPLMFRANVLHMLLEFVLDNLYLNRHTGFSQDHTPFTEGANLRSLPGPDAEKWYSIMYPTRMGTPNVSKICNFVASCVRNRVGRFDRAQMMNGAMSEWVDVFETSDALTVSIRGRWMARLARMNINPTEIEWALTECAQGYVTVTSPYAPSVNRLMPYRISNAERQISQIIRVMNIGNNATVIQPVLQDISVLLQRISPLQIDPTIISNTMSTVSESTTQTLSPASSILGKLRPSNSDFSSFRVALAGWLYNGVVTTVIDDSSYPKDGGSVTSLENLWDFFILALALPLTTDPCAPVKAFMTLANMMVGFETIPMDNQIYTQSRRASAFSTPHTWPRCFMNIQLISPIDAPILRQWAEIIHRYWPNPSQIRYGTPNVFGSANLFTPPEVLLLPIDHQPANVTTPTLDFTNELTNWRARVCELMKNLVDNQRYQPGWTQSLVSSMRGTLGKLKLIKSMTPMYLQQLAPVELAVIAPMLPFPPFQVPYVRLDRDRVPTMVGVTRQSRDTITQPALSLSTTNTTVGVPLALDARAITVALLSGKYPPDLVTNVWYADAIYPMYADTEVFSNLQRDVITCEAVQTLVTLVAQISETQYPVDRYLDWIPSLRASAATAATFAEWVNTSMKTAFDLSDMLLEPLLSGDPRMTQLAIQYQQYNGRTFNVIPEMPGSVIADCVQLTAEVFNHEYNLFGIARGDIIIGRVQSTHLWSPLAPPPDLVFDRDTPGVHIFGRDCRISFGMNGAAPMIRDETGMMVPFEGNWIFPLALWQMNTRYFNQQFDAWIKTGELRIRIEMGAYPYMLHYYDPRQYANAWNLTSAWLEEITPTSIPSVPFMVPISSDHDISSAPAVQYIISTEYNDRSLFCTNSSSPQTIAGPDKHIPVERYNILTNPDAPPTQIQLPEVVDLYNVVTRYAYETPPITAVVMGVP 1275 T 0.0075 zf_C2H2_6 pdbhh T Viruses T 7yeq 1 A A A0A2X0RVH4_ASF CP312R CDS PROTEIN,CP312R PROTEIN MTTHIFHADDLLQALQQAKAEKNFSSVFSLDWDKLRTAKRNTTVKYVTVNVIVKGKKAPLMFNFQNEKHVGTIPPSTDEEVIRMNAENPKFLVKKRDRDPCLQFNKYKISPPLEDDGLTVKKNEQGEEIYPGDEEKSKLFQIIELLEEAFEDAVQKGPEAMKTKHVIKLIQRKISNSAVKNADKPLPNPIARIRIKINPATSILTPILLDKNKPITLQNGKTSFEELKDEDGVKANPDNIHKLIESHSIHDGIINARSICISNMGISFPLCLEMGVVKVFEKNNGIDVNSIYGSDDISTLVNQIAIA 307 T 61 DUF5721 pdbhh T Viruses T 7yf6 2 C C Macrocyclic Peptide FVPVLWLXX 9 T 0.47 BshC pdbhh F T 7yfs 1 A A noursin APSNVLSTLLHGRACV 16 T 0.85 DUF765 pdbhh F T 7yfu 1 A,B A,B FTM_MOUSE NEPHROCYSTIN-8,RPGR-INTERACTING PROTEIN 1-LIKE PROTEIN,RPGRIP1-LIKE PROTEIN GPGSRDVEMEEMIEQLQEKVHELERQNEVLKNRLISAKQQLQVQG 45 T 0.00078 EzrA unphh F Eukaryota T 7yfv 1 A,B,C,D A,B,C,D FTM_MOUSE NEPHROCYSTIN-8,RPGR-INTERACTING PROTEIN 1-LIKE PROTEIN,RPGRIP1-LIKE PROTEIN GPGSVSRVSREELEDRFLRLHDENILLKQHARKQEDKIKRMATKLIRLVNDKKRYERVGG 60 T 0.00014 WEMBL unphh F Eukaryota T 7yfw 1 A,B,C a,b,c Pam3 fiber proreins MASINLPFSLSGSKRIPTSEELADGYQCGPLDVELDNWLMWWLTGQVDGVIEGAGLTTDDTDLARLYKAIQSMTSGNLRTVVLTAASGNLPIPSDVSVLNWVRAVGGGGAGGNSNTGNSKASGGGGGAGFDRFNVAVTPGSNVPYTVGAAGAVNGLGAGYNGGAGGSTAILGTTAGGGAGGLGVNNNATAVQVNGGTTSGTTPEISYPGGLGTEGIVGTGGGSVLSQPTQRAFTNAGNNNPANSWGGGGPGGSDFGGAWQPGGVGKQGIIIVQYFSRFAP 280 T 180 DUF777 pdbhh F T 7yfz 1 A,B,E,F,H,I,K,L,N,O,P,Q A,B,E,F,H,I,K,L,O,P,Q,R Pam3 baseplate wedge gp22 MTYGVQPTGYVKKPLAVHLAEIEASMVDLFGPGVIQTEQSPLGQLNGLYADLSYDLDERGEDLYQSFDPEQAEGSRLDILARYRLLSRRAGESDESFRRAITNVDRARIDLSDLSTALSAINGVSWSRVYVNEDATTDADGIPPNTVSVAVIGGDDDEVAQLVRRYVVPGVGMYGNTTIETTIGGFCRRIRVIRPVLIPTSVEIDVQSRPLKNGCPPPSVNAMAAGLYTELTGPDRPGNGEDGTVYLFRKIMERLYPNVEVVDVRLSQAPAAPTTPPLVMSFFQMMSFNADDILVEIVP 299 F F T 7yfz 4 T,U,V,W,X,Y a,b,c,d,e,f Pam3 tube initiator gp17 MIIAFSSAIGPVPLTVVISEKHTSKVELTTNPIESGADVTDHAYVKGKEIELEVADRNAAATWAALVAFQESRVPFVLMTGLSMYRNMIITEIDATRNAQHSKILKGTVRLREVKIVETGTAEDSSGKDGTDKNKSSNPSKDKAADAKTADKANSGVNAGDKGGTTVAAPRAQSLLKGVFGGSSASGGAAP 191 T 2.4E-05 Phage_P2_GpU pdbhh F T 7yfz 5 AA,BA,CA,DA,EA,Z i,j,k,l,m,h Pam3 plug gp18 MIELEVLDESKQKFSVILNDRRVTIELWYNTTNDRWSFSLALDGDNVVTGRRLVTGVDLLAPFGLGIGALFLLSENGEPPTRANLPLGLVKLYHATQEEIDAAISA 106 T 3.8 RC-P840_PscD pdbhh F T 7yg3 3 C B ARG-GLN-ASP-ILE-LEU-ASP-LEU-TRP-ILE RQDILDLWI 9 T 0.0037 F-protein pdbhh F T 7yg4 1 A A VIR_HUMAN Protein virilizer homolog MASVKLTELLDLYREDRGAKWVTALEEIPSLIIKGLSYLQLKNTKQDSLGQLVDWTMQALNLQVALRQPIALNVRQLKAGTKLVSSLAECGAQGVTGLLQAGVISGLFELLFADHVSSSLKLNAFKALDSVISMTEGMEAFLRGRQNEKSGYQKLLELILLDQTVRVVTAGSAILQKCHFYEVLSEIKRLGDHLAEKTSSLPNHSEPDHDTDAGLERTNPEYENEVEASMDMDLLESSNISEGEIERLINLLEEVFHLMETAPHTMIQQPVKSFPTMARITGPPERDDPYPVLFRYLHSHHFLELVTLLLSIPVTSAHPGVLQATKDVLKFLAQSQKGLLFFMSEYEATNLLIRALCHFYDQDEEEGLQSDGVIDDAFALWLQDSTQTLQCITELFSHFQRCTASEETDHSDLLGTLHNLYLITFNPVGRSAVGHVFSLEKNLQSLITLMEYYSKEALGDSKSKKSVAYNYACILILVVVQSSSDVQMLEQHAASLLKLCKADENNAKLQELGKWLEPLKNLRFEINCIPNLIEYVKQNIDNLMTPEGVGLTTALRVLCNVACPPPPVEGQQKDLKWNLAVIQLFSAEGMDTFIRVLQKLNSILTQPWRLHVNMGTTLHRVTTISMARCTLTLLKTMLTELLRGGSFEFKDMRVPSALVTLHMLLCSIPLSGRLDSDEQKIQNDIIDILLTFTQGVNEKLTISEETLANNTWSLMLKEVLSSILKVPEGFFSGLILLSELLPLPLPMQTTQVIEPHDISVALNTRKLWSMHLHVQAKLLQEIVRSFSGTTCQPIQHMLRRICVQLCDLASPTALLIMRTVLDLIVEDLQSTSEDKEKQYTSQTTRLLALLDALASHKACKLAILHLINGTIKGDERYAEIFQDLLALVRSPGDSVIRQQCVEYVTSILQSLCDQDIALILPSSSEGSISELEQLSNSLPNKELMTSICDCLLATLANSESSYNCLLTCVRTMMFLAEHDYGLFHLKSSLRKNSSALHSLLKRVVSTFSKDTGELASSFLEFMRQILNSDTIGCCGDDNGLMEVEGAHTSRTMSINAAELKQLLQSKEESPENLFLELEKLVLEHSKDDDNLDSLLDSVVGLKQMLES 1107 T 3 T3SS_ATPase_C pdb F Eukaryota T 7yh8 1 A,C A,C L-19437 LPVEKIIREAKKILDELLKRGLIDPELARIAREVLERARKLGNEEAARFVLELIERLRRELS 62 T 0.05 DUF2095 pdbhh F T 7yhr 1 A A A0A239N0M2_9PSED Anti-CRISPR protein Type I-C5 MSKVTLNGQQIDFDAAVNLMDAELREELHSAQEWTNDQEFLDAYVQAHAAKFDGEEFQVA 60 T 0.1 TubC_N pdb F Bacteria T 7yhs 4 I J AcrIF4 MMTISKTDIDCYLQTYVVIDPVSNGWQWGIDENGVGGALHHGRVEMVEGENGYFGLRGATHPTEKEAMAAALGYLWRCRQDLVAIARNDAIEAEKYRAKA 100 T 1.7 TAL_effector pdbhh F T 7yil 1 A,B A,B Y248_METJA GINS FVITMYESLKNYFFEEIKNDKLLKLPDDFYDDIREYIKNIKDDIELERVKYYFKELRKLRIYKALYLDNERENLLPEELNIIHAIENIVVELKIE 95 T 0.041 Peptidase_M3_N unppercent F Archaea T 7yiu 2 B E SPTC1_HUMAN LONG CHAIN BASE BIOSYNTHESIS PROTEIN 1,LCB 1,SERINE-PALMITOYL-COA TRANSFERASE 1,SPT 1,SPT1 MATATEQWVLVEMVQALYEAPAYHLILEGILILWIIRLLFSKTYKLQERS 50 T 0.55 Mucin15 pdbhh F Eukaryota T 7yji 1 A A A0A2S6F9N6_LEGPN T4SS effector Lpg1083 MGHHHHHHMALIDQITTINKNEFTDDFLRKYFELGFGSLSKHDIDLLVYYLVKEHSDLFNGKTNYEISSLLTITERKLQSIQMESYLRYENNSISKNLEELSVKITKGEIKPEVEGDKIRVLIDSPVLRRDLEYSITSLGHIVDYSFNKNILSLRLSNFFEVFGNLNIENGKELKTQVIDFFREQNKWDKEILIEIENKSWWIKQFNTLQAAVKKEAAALIFHSIISMVKSHIGI 235 T 0.017 Sigma70_r4_2 unppercent F Bacteria T 7yjm 4 D E LCB1_ARATH atLCB1 MASNLVEMFNAALNWVTMILESPSARVVLFGVPIRGHFFVEGLLGVVIIILLTRKSYKPPKR 62 T 0.0009 RELT pdb F Eukaryota T 7yk3 2 B,D B,D DARG_MYCTU DARG,ANTITOXIN DARG,MACRO DOMAIN-CONTAINING PROTEIN MTWGRAVILEAMRRYLQQRRAMEPWEDPAGISHLEIQKLMYFANEADPDLALDFTPGRYGPYSERVRHLLQGMEGAFTVGLGDGTARVLANQPISLTTKGTDAITDYLATDAAADRVSAAVDTVLRVIEGFEGPYGVELLASTHWVATREGAKEPATAAAAVRKWTKRKGRIYSDDRIGVALDRILMTA 189 T 1.7E-05 DUF4065 pdbhh F Bacteria T 7yk5 3 Q,R,S,T i,k,j,l PYCO1 SSU binding motif KWSPRGGS 8 T 5.2 DUF2415 pdbhh F T 7yk5 4 AA,BA,U,V,W,X,Y,Z b,a,h,g,f,e,d,c PYCO1 LSU binding motif AAEWGSMNQ 9 T 0.025 Intimin_C pdbhh F T 7ykd 1 A L RARR2_HUMAN CHEMERIN,RAR-RESPONSIVE PROTEIN TIG2,TAZAROTENE-INDUCED GENE 2 PROTEIN YFPGQFAFS 9 T 1.6 BTK pdbhh F Eukaryota T 7ym8 1 A B miniGsq MGHHHHHHHHLEVLFQGPIEKQLQKDKQVYRATHRLLLLGADNSGKSTIVKQMRILHGGSGGSGGTSGIFETKFQVDKVNFHMFDVGGQRDERRKWIQCFNDVTAIIFVVDSSDYNRLQEALNDFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKIEDYFPEFARYTTPEDATPEPGEDPRVTRAKYFIRDEFLRISTASGDGRHYCYPHFTCAVDTENARRIFNDCKDIILQMNLREYNLV 246 T 5.9E-09 G-alpha pdb F T 7yoj 1 A A A0A399WQY8_9BACT REVERSE TRANSCRIPTASE DOMAIN-CONTAINING PROTEIN MAKATKEVKSKRVEALRQVAYQRLERLERKAQKIGAHLRKPGKAADLQSLHYLLHKVEVEYHDIARNLEKDPTWTPKPKMRREKRAIVPESGPAAPLPTTAKGEPGRPANRHIPPPVPLDSARIPEDQQSMGQGSGGRSWCSAPFVEVKLPPTQWSNVREKLLKFRIEDDADIVRRWAEAKFGSIETARDGLRASAEIGTSPDVWRSFISRAISNGKKDFEPLLSLDDDELTADATAERVVRRWHQIDWVGRMLDSILETVPSGVSKDTFRSRVESRLKTFHSSVNSFELKKRKDGTVERKRKHTNPQFPYLSPSAVSIDPDVVTMEAVELLQMQPEERFAKDPNDANGRMRLRVLQAELGKARREALGRRGEKAPPWSGRKVFRGTTTRKREACLVWDKEAQADGLYFALVMSGGPKIDDKRFVYMDGQPLQSDWQLHNGVAGKAKSCRAMPLILKHDFLRWYHRHIKNHDVNAPLEKRCVHTTTQFVFVEPDEKKGLQPRLFIRPVFKFYDPVYEVPDSHSIDKKPDCRYLIGIARGVNYPYRAAVYDCETNSIIADKFVDGRKADWERIRNELAYHQRRRDLLRNSRASSAAIQREIRAIARIRKRERGLNKVETVESIARLVDWAEENLGKCNYCFVLADLSSNLNLGRNNRVKHIAAIKEALINQMRKRGYRFKKSGKVDGVREESAWYTSAVAPSGWWAKKEEVDGAWKADKTRPLARKIGSYYCCEEIDGLHLRGVLKGLGRAKRLVLQSDDPSAPTRRRGFGSELFWDPYCTELCGHAFPQGVVLDADFIGAFNIALRPLVREELGKKAKAVDLADRHQTLNPTVALRCGVTAYEFVEVGGDPRGGLRKILLNPAEAVI 867 T 0.0053 RuvC_1 unphh F Bacteria T 7ypx 1 A,B,C A,B,C A0A9E7DT93_9CAUD Pam3 tail fiber proreins HHHHHHSSGMASINLPFSLSGSKRIPTSEELADGYQCGPLDVELDNWLMWWLTGQVDGVIEGAGLTTDDTDLARLYKAIQSMTSGNLRTVVLTAASGNLPIPSDVSVLNWVRAVGGGGAGGNSNTGNSKASGGGGGAGFDRFNVAVTPGSNVPYTVGAAGAVNGLGAGYNGGAGGSTAILGTTAGGGAGGLGVNNNATAVQVNGGTTSGTTPEISYPGGLGTEGIVGTGGGSVLSQPTQRAFTNAGNNNPANSWGGGGPGGSDFGGAWQPGGVGKQGIIIVQYFSRFAP 289 T 180 DUF777 unphh T Viruses T 7ypx 2 D,E,F a,b,c A0A9E7J192_9CAUD tail fiber chaperone MTDKHYARVVDGLVVETKTLPADFNLDDLFGPDHGWVEAPLEVEQGWRKVGAKFAPAPPPERDPASILAGLKAEASRHIFATISATAQSNLLLAVGLASAKAPSARTPEERDLLNVADEGRAWIDAVRARVHALAEHDGVTPKGEDRWPAPSEAVLEMAAKF 162 T 0.48 DUF6276 unppercent T Viruses T 7yqk 8 L K TP53B_HUMAN UDR motif of 53BP1 KAADISLDNLVEGKRKRR 18 T 4.2 FYTT pdbhh F Eukaryota T 7yxb 3 E,F G,H CLIP peptide AFAPVSKMRMATPLLMQAGN 20 T 21 DUF3440 pdbhh F T 7yxd 2 B,D,F,H C,F,J,N NR0B2_HUMAN ORPHAN NUCLEAR RECEPTOR SHP,SMALL HETERODIMER PARTNER,NUCLEAR RECEPTOR SUBFAMILY 0 GROUP B MEMBER 2 RPAILYALLSSS 12 T 6.2 NR_Repeat pdbhh F Eukaryota T 7yxp 2 B B NR0B2_HUMAN SHP NR Box 1 Peptide SRPAILYALLSSS 13 T 8.2 NR_Repeat pdbhh F Eukaryota T 7z0f 4 D P CENPJ_HUMAN CENP-J,CENTROSOMAL P4.1-ASSOCIATED PROTEIN,LAG-3-ASSOCIATED PROTEIN,LYST-INTERACTING PROTEIN 1 MVNIEERPIKAAIGERKQTFEDYLEEQIQLEEQELKQKQLKEAEGPLPIKAKPKQPFLKRGEGLARFTNAKSKFQKGKE 79 T 0.11 DUF4122 pdbpercent F Eukaryota T 7z0o 3 D D RRN5_YEAST RNA polymerase I-specific transcription initiation factor RRN5 SMEHQQLRKYVELYNKEVEEFYNGAASGRPAEFHPSKVHVKSIHEKAGTANAGVEISSVGVDWDSEEKNTFFWCLSRYSIHRVDEWRSLLPRKSAMEILGYYRLLRRASASARSRKAGDDGAPIAYEMSAEWVALETKLSETVMAITEGAAEVADEEGHCEGLIDYESWKRRWVAIYSHSRIAEIRPLPRHALPLSRSATQTLERCVSRYTRTLLWCTALAGMASRSVSARAAESRGHKSLPTVVTRRQVERALCTEARSRDLHVLPRRIVLTLRKWELDYPREGKLFRTKEMAHLFLQSQLSRRDAPPVHQDENQENQENQENQEQDNTASEGESEAERDEIDEADLFRSALHENQLLKWLSK 364 T 0.0022 Myb_DNA-binding unppercent F Eukaryota T 7z0q 3 C G CLIP peptide PVSKMRMATPLLMQAGN 17 T 55 DUF3440 pdbhh F T 7z14 5 F,G F,G Consensus short-chain short-chain alpha-neurotoxin ScNtx MICYNQQSSQPPTTKTCSETSCYKKTWRDHRGTIIERGCGCPKVKPGIKLHCCRTDKCNN 60 F F T 7z36 2 C,E C,S SMRCD_HUMAN SMARCAD1 CUE1 domain MGSSHHHHHHSQDPNSSSENLYFQGLSELEDLKDAKLQTLKELFPQRSDNDLLKLIESTSTMDGAIAAALLMF 73 T 0.00029 CUE pdbpssm F Eukaryota T 7z3n 8 H D G0RZX9_CHATD Putative heat shock protein MAESASKAAPGERVVIGITFGNSNSSIAHTVDDKAEVIANEDGDRQIPTILSYVDGDEYYGQQAKNFLVRNPKNTVAYFRDILGQDFKSVDPTHNHASAHPQEAGDNVVFTIKDKAEEDAEPSTLTVSEIATRYLRRLVGAASEYLGKKVTSAVITIPTNFTEKQKAALIAAAAAADLEVLQLISEPAAAVLAYDARPEATISDKIIVVADLGGSRSDVTVLASRSGMYTILATVHDYEYHGIALDKVLIDHFSKEFLKKNPGAKDPRENPRSLAKLRLEAESTKRALSRSTNASFSVESLIDGLDFASTINRLRYETIARTVFEGFNRLVESAVKKAGLDPLDVDEVIMSGGTSNTPRIAANFRYIFPESTRILAPSTDPSALNPSELQARGAALQASLIQEFETEDIEQSTHAAVTTMPHVTNAIGVVSVSESGEEKFVPIIAPETAVPARRTVHLDAPKEGGDVLVKVVEGSTHINVIKPEPKAKEDGETKEKTEDADDDGDFDDDDEEEEEEEEEEEKREKVWKIGSTLAEAAVRGVKKGAKVEVTINVNTDLTVIVTAREVGGKGGVRGTLSA 578 T 0.06 DUF3221 pdb F Eukaryota T 7z44 1 A A A0A0B4N229_9CAUD Portal protein MAKQKYSEEVLDELRVDLQRRFNYAQGYVDMAVKGYAREAWEYFYGNLPAPVTAGSSSWVDRTVWESVNGTLQDIINVFCSGDEAVTFVADNQQDSDAADVATKLVNQILLRDNPGYNIISSAAQECLVTRNSFIKYYWDEQTSTQTEEAEGVPPEALAAYVQGLEAGGLKNLEVFTEENEDGTVDVKVTYEQTVKRVKVEYVPSEQIFVDEHATSFADAQYFCHRVRRSKEDLVAMGFPKDEIEAFNDWTDTMDTTQSTVAWSRTDWRQDIDADIGTDTEDIASMVWVYEHYIRTGVLDKNKESKLYQVIQAGEHILHTEEVTHIPFVTFCPYPIPGSFYGQSVYDITKDIQDLRTALVRGYIDNVNNANYGRYKALVGAYDRRSLLDNRPGGVVEMERQDAIDLFPYHNLPQGIDGLLGMSEELKETRTGVTKLGMGINPDVFKNDNAYATVGLMMNAAQNRLRMVCRNIAHNGMVELMRGIYNLIRENGEVPIEVQTPRGMIQVNPKQLPARHNLQVVVAISPNEKAERAQKLISLKQLIAADAQLAPLFGLEQDRYMTAQIFELMGIKDTHKYLLPLEQYQPPEPSPMEILQLEMTKAQVENVQASSQKMIADAFDQRERTTFEQQKAADELSLRQEELQFKQENAADAMTLENRKEDNNATLEQAKHKLALMQQQVRQYESVLKELQMVMSHQVDQEKIVQQARVQDKTLELQKKEANVTKKEQQASLKDSRIPGKRLGSKK 747 T 18 GAGA_bind pdb T Viruses T 7z47 1 A,B A,B A0A0B4N231_9CAUD Adaptor protein MAMPDVQYPINTYGWLKKAVALWADRDDDEFVNQIPNFINFAEKEIYRNLRIPPLEKEVYLDIKDGVAYIPPDYLEAQWMMRAKDGTIFQVTSPEEISYRRQHGTINPSHWNNQPVNFARFGSRFIFYPSIEADTPYYPDDGSPLIPAENSVILSYYADPPEFHEDTDTSTILTIAPELLLYFTLRHACLFVQDDNGVQKWSALGKAILDEMVEQNKKQEYSGSPIAIPNNMTRLQSSLPDIYGIRTSRV 250 T 0.025 TraD pdbpercent T Viruses T 7z47 3 D,E,F D,E,F A0A0B4N0B9_9CAUD Putative tail fiber MIVYNNQAPDAVNNVGQFGATEGSIGAYKQAAEYAADSKYWALLAESKFGTIDDLIAEVERLYQQGVLMKQDIEDLKQDFKDQDARLMSLIAQTNAAVSDANNAVALINQKLIEVQNQLDVLLGMSVDVTTLPPGTPATGSFNPNTGVISLGIPEGEPGKDGSVKDLDTAPTGVPELGDLGFYVDKDDNTVHKTTLENIANLTPSVRSVSVNGGPALDGEVALTINKETVGLGNVLNVAQYSRQEINDKFDKTTKTYQSKAEAYADAQYRQVGEKVLVWEATKYEFYTVAANKTLTPVKTEGRILTVNSRSPDSSGNIDITIPTGNPSLYLGEMVMFPYDPSKNISYPGVLPADGRLVSKESASDLGPSLVSGQLPVVSETEWQSGAKQYFSWGKLADGITDADSTNFINIRLPDWTGGEAIRAPDSDKDSQYNGSVQAQKPYVVTVNNQAPDEITGNVNISRSILGAASSGANSDITSLSGLTTPLSISQGGTGAKDAASARSNLGLGSVSTLDNVPIASGGTGAGDAAGARFNLGLGNSATMNTGTNSDNVLKVGDFGIGRPDGALVFDTTSQDQLLAGLDTYGLCVFRNNQQIAAPWDIWNYSSNLFFRAGDTYSMISIPFESAGKIKVFGGASGSGWKTSRTVYDTVNTTVDVNGFIKAASPIVKVFHDGSFETNEQSDGVSVKKISTGVYLISGCLGLNSDAGWGGVDGGFEIPIDRNKQPRVWLDYEVKEDGSLLIKTYHRTHSTSPAFARNELEGFSDGDPVDIPKDAFISVRVEMPSK 786 T 0.92 Laminin_II pdb T Viruses T 7z47 4 G,H,I I,H,C A0A0B4N235_9CAUD Putative structural protein MAIETNAVVITDLNPLYPRDRDYIYEGAAQIRLIKQTLQNTFPNVTEPVDIDSDTFKIMSEKLKFTGDAMDVGGLMIKNVTPGTGDKDVVTKGQMEAFMKNWMENKLYRIGSYYITEEDINPGDSISLGFGSWAKVTGVIMGTGVVNPDGSVPNAQRVEFQAGGTGGRVFNTIRTENVPLMTVNGSSFSLSSNTHSHNMVFGRGDASGHNSSPNWYSPGGGYSQRTDNDTHTHTISGSVSLGRDDISRQPINTLPPFRAAHIWRRIS 267 T 0.16 YadA_stalk pdb T Viruses T 7z4s 2 C,D C,D Macrocyclic peptide inhibitor XXFHXLNLGYRPGCX 15 T 2.5 DUF3228 pdbhh F T 7z50 5 I,J T,W Hybrid insulin peptide LQTLALEVEDDPCGG 15 T 0.9 DUF2405 pdbhh F T 7z53 3 E,F,K,L,Q,R,W,X E,F,K,L,Q,R,W,X Q2G0X2_STAA8 Myeloperoxidase inhibitor SPIN ANFLEHELSYIDVLLDKNADQATKDNLRSYFADKGLHSIKDIINKAKQDGFDVSKYEHV 59 T 0.046 Drf_FH3 unppssm F Bacteria T 7z6m 1 A A A0A0H3LM39_BORBR Putative membrane protein MGSSHHHHHHSSGLVPRGSHMNQPSSLAADLRGAWHAQAQSHPLITLGLAASAAGVVLLLVAGIVNALTGENRVHVGYAVLGGAAGFAATALGALMALGLRAISARTQDAMLGFAAGMMLAASAFSLILPGLDAAGTIVGPGPAAAAVVALGLGLGVLLMLGLDYFTPHEHERTGHQGPEAARVNRVWLFVLTIILHNLPEGMAIGVSFATGDLRIGLPLTSAIAIQDVPEGLAVALALRAVGLPIGRAVLVAVASGLMEPLGALVGVGISSGFALAYPISMGLAAGAMIFVVSHEVIPETHRNGHETTATVGLMAGFALMMFLDTALG 329 T 1.1E-26 Zip unp F Bacteria T 7z6q 1 A,K A,a Q8KAY0_CHLTE Photosystem P840 reaction center, large subunit MAEQVKPAGVKPKGTVPPPKGNAPAPKANGAPGGASVIKEQDAAKMRRFLFQRTETRSTKWYQIFDTEKLDDEQVVGGHLALLGVLGFIMGIYYISGIQVFPWGAPGFHDNWFYLTIKPRMVSLGIDTYSTKTADLEAAGARLLGWAAFHFLVGSVLIFGGWRHWTHNLTNPFTGRCGNFRDFRFLGKFGDVVFNGTSAKSYKEALGPHAVYMSLLFLGWGIVMWAILGFAPIPDFQTINSETFMSFVFAVIFFALGIYWWNNPPNAAIHLNDDMKAAFSVHLTAIGYINIALGCIAFVAFQQPSFAPYYKELDKLVFYLYGEPFNRVSFNFVEQGGKVISGAKEFADFPAYAILPKSGEAFGMARVVTNLIVFNHIICGVLYVFAGVYHGGQYLLKIQLNGMYNQIKSIWITKGRDQEVQVKILGTVMALCFATMLSVYAVIVWNTICELNIFGTNITMSFYWLKPLPIFQWMFADPSINDWVMAHVITAGSLFSLIALVRIAFFAHTSPLWDDLGLKKNSYSFPCLGPVYGGTCGVSIQDQLWFAMLWGIKGLSAVCWYIDGAWIASMMYGVPAADAKAWDSIAHLHHHYTSGIFYYFWTETVTIFSSSHLSTILMIGHLVWFISFAVWFEDRGSRLEGADIQTRTIRWLGKKFLNRDVNFRFPVLTISDSKLAGTFLYFGGTFMLVFLFLANGFYQTNSPLPPPVSHAAVSGQQMLAQLVDTLMKMIA 731 F F Bacteria T 7z8o 2 B B Stapled peptide XCPYVAGXXTCLXX 14 T 0.61 Paired_CXXCH_1 pdbhh F T 7zak 3 C C Synthetic peptide KNLEKYKGKFVREID 15 T 0.42 DUF5678 pdbhh F T 7zb0 2 E,F,G,H E,F,G,H 15mer GFPWXIXXXXXXVIG 15 T 0.038 DUF2897 pdbhh F T 7zb1 2 E,F,G,H E,F,G,H 18mer WXIXXXXXXVXXSXMSTE 18 T 0.31 DUF2897 pdbhh F T 7zbq 1 A A Q8GF97_PHOLU TccC3 MSTTSTNLQKKSFTLYRADNRSFEEMQSKFPEGFKAWTPLDTKMARQFASIFIGQKDTSNLPKETVKNISTWGAKPKLKDLSNYIKYTKDKSTVWVSTAINTEAGGQSSGAPLHKIDMDLYEFAIDGQKLNPLPEGRTKNMVPSLLLDTPQIETSSIIALNHGPVNDAEISFLTTIPLKNVKPHKRGTLEVLFQ 194 T 0.087 UFC1 pdb F Bacteria T 7zcu 3 C S Q6N9P5_RHOPA LIGHT-HARVESTING PROTEIN B-800-850 SUBUNIT GAMMA MSEEYKGHSGHPLILKQEGEYKGYSGEPLILKQEGEYKGYSGTPLILEQKGEYQSFSGTPLILKQEGEYRGFSGAPLILKQDGEYKSFSGYPLLLNI 97 T 0.18 DUF3823 pdb F Bacteria T 7zcx 1 A AAA SLAA_SULAC SURFACE LAYER LARGE PROTEIN MNKLVGLLVSSLFLASILIGIAPAITTTALTPPVSAGGIQAYLLTGSGAPASGLVLFVVNVSNIQVSSSNVTNVISTVVSNIQINAKTENAQTGATTGSVTVRFPTSGYNAYYDSVDKVVFVVVSFLYPYTTTSVNIPLSYLSKYLPGLLTAQPYDETGAQVTSVSSTPFGSLIDTSTGQQILGTNPVLTSYNSYTTQANTNMQEGVVSGTLTSFTLGGQSFSGSTVPVILYAPFIFSNSPYQAGLYNPMQVNGNLGSLSSEAYYHPVIWGRALINTTLIDTYASGSVPFTFQLNYSVPGPLTINMAQLAWIASINNLPTSFTYLSYKFSNGYESFLGIISNSTQLTAGALTINPSGNFTINGKKFYVYLLVVGSTNSTTPVEYVTKLVVEYPSSTNFLPQGVTVTTSSNKYTLPVYEIGGPAGTTITLTGNWYSTPYTVQITVGSTPTLTNYVSQILLKAVAYEGINVSTTQSPYYSTAILSTPPSEISITGSSTITAQGKLTATSASATVNLLTNATLTYENIPLTQYSFNGIIVTPGYAAINGTTAMAYVIGALYNKTSDYVLSFAGSQEPMQVMNNNLTEVTTLAPFGLTLLAPSVPATETGTSPLQLEFFTVPSTSYIALVDFGLWGNLTSVTVSAYDTVNNKLSVNLGYFYGIVIPPSISTAPYNYQNFICPNNYVTVTIYDPDAVLDPYPSGSFTTSSLPLKYGNMNITGAVIFPGSSVYNPSGVFGYSNFNKGAAVTTFTYTAQSGPFSPVALTGNTNYLSQYADNNPTDNYYFIQTVNGMPVLMGGLSIVASPVSASLPSSTSSPGFMYLLPSAAQVPSPLPGMATPNYNLNIYITYKIDGATVGNNMINGLYVASQNTLIYVVPNGSFVGSNIKLTYTTTDYAVLHYFYSTGQYKVFKTVSVPNVTANLYFPSSTTPLYQLSVPLYLSEPYYGSPLPTYIGLGTNGTSLWNSPNYVLFGVSAVQQYLGFIKSISVTLSNGTTVVIPLTTSNMQTLFPQLVGQELQACNGTFQFGISITGLEKLLNLNVQQLNNSILSVTYHDYVTGETLTATTKLVALSTLSLVAKGAGVVEFLLTAYPYTGNITFAPPWFIAENVVKQPFMTYSDLQFAKTNPSAILSLSTVNITVVGLGGKASVYYNSTSGQTVITNIYGQTVATLSGNVLPTLTELAAGNGTFTGSLQFTIVPNNTVVQIPSSLTKTSFAVYTNGSLAIVLNGKAYSLGPAGLFLLPFVTYTGSAIGANATAIITVSDGVGTSTTQVPITAENFTPIRLAPFQVPAQVPLPNAPKLKYEYNGSIVITPQQQVLKIYVTSILPYPQEFQIQAFVYEASQFNVHTGSPTAAPVYFSYSAVRAYPALGIGTSVPNLLVYVQLQGISNLPAGKYVIVLSAVPFAGGPVLSEYPAQLIFTNVTLTQ 1424 T 1.2 Tcp10_C pdbpssm F Archaea T 7zfr 3 C C Synthetic peptide IEFVFKNKAKEL 12 T 6.8 DUF4566 pdbhh F T 7zg0 1 A,B A,B IL27A_MOUSE IL-27 SUBUNIT ALPHA,IL-27-A,IL27-A,P28 MGILPSPGMPALLSLVSLLSVLLMGCVAETGFPTDPLSLQELRREFTVSLYLARKLLSEVQGYVHSFAESRLPGVNLDLLPLGYHLPNVSLTFQAWHHLSDSERLCFLATTLRPFPAMLGGLGTQGTWTSSEREQLWAMRLDLRDLHRHLRFQVLAAGFKCSKEEEDKEEEEEEEEEEKKLPLGALGGPNQVSSQVSWPQLLYTYQLLHSLELVLSRAVRDLLLLSLPRRPGSAWDSGTKHHHHHH 246 T 0.077 XK-related unp F Eukaryota T 7zgv 1 A A A0A2I5TBB8_SERS3 Serratia NucC KEEKLTMTNQAKKLSRINGREFLKQSFNLQQQLLASQLNLSRTITHDGTMGEVNESYFLSIIRQYLPERYSVDRGVVVDSEGQTSDQIDAVIFDRHYTPTLLDQQGHRFIPAEAVYAVLEVKPTINKTYLEYAADKAASVRKLYRTSTVIKNIYGTAKPVEHFPIVAGIVAIDVEWQDGLGKAFTENLQAVSSDENRKLDCGLAVSGACFDSYDEEIKIRSGENALIFFLFRLLGKLQSLGTVPAIDWRVYIDSLE 256 T 0.1 UPF0102 unppercent F Bacteria T 7zhj 5 AA,BA,Z g,e,f Q7Y5E2_BPT5Z Pore-forming tail tip protein pb2 MTDKLIRELLIDVKQKGATRTAKSIENVSDALENAAAASELTNEQLGKMPRTLYSIERAADRAAKSLTKMQASRGMAGITKSIDGIGDKLDYLAIQLIEVTDKLEIGFDGVSRSVKAMGNDVAAATEKVQDRLYDTNRALGGTSKGFNDTAGAAGRASRALGNTSGSARGATRDFAAMAKIGGRLPIMYAALASNVFVLQTAFESLKVGDQLNRLEQFGTIVGTMTGTPVQTLALSLQNATNGAISFEEAMRQASSASAYGFDSEQLEQFGLVARRAAAVLGVDMTDALNRVIKGVSKQEIELLDELGVTIRLNDAYENYVKQLNATSTGIKYTVDSLTTYQKQQAYANEVIAESTRRFGYLDDALKATSWEQFAANANSALRSLQQSAATYLNPVMDTLNTFLYQTKSSQMRVSAMARSASAKTTPAENVTALIENAVGAREDLDTYLKESEERVKKAQELKQQLDDLKAKQAATAPIANALTAGGIGGDESNKLVVQLTNELARQNKEIEERTKTEKVLRQAVQDTGEALLRNGKLAEQLGAKMKYADTAVPGDKGVFEVDPNNLKAVSEIQKNFDFLKKSSSDTANNIRMAASSITNAKKASSDLNSVVKAVEDTSKVTGQSADTLVKNLNLGFSSLDQMKAAQKGLSEYVTAMDKSEQNALEVAKRKDEVYNQTKDKAKAEAAAREVLLRQQQEQLTAAKALLAINPNDPEALKQVAKIETEILNTKAQGFENAKKTKDYTDKILGVDREIALLNDRTMTSTQYRLAQLRLELQLEQEKTELYSKQADGQAKVEQSRRAQAQISREIWEAEKQGTASHVSALMDALEVSQTQRNVTGQSQILTERLSILQQQLELSKGNTEEELKYRNEIYKTSAALEQLKKQRESQMQQQVGSSVGATYTSTTGLIGEDKDFADMQNRMASYDQAISKLSELNSEATAVAQSMGNLTNAMIQFSQGSLDTTSMIASGMQTVASMIQYSTSQQVSAIDQAIAAEQKRDGKSEASKAKLKKLEAEKLKIQQDAAKKQIIIQTAVAVMQAATAVPYPFSIPLMVAAGLAGALALAQASSASGMSSIADSGADTTQYLTLGERQKNVDVSMQASSGELSYLRGDKGIGNANSFVPRAEGGMMYPGVSYQMGEHGTEVVTPMVPMKATPNDQLSDGSKTTSGRPIILNISTMDAASFRDFASNNSTAFRDAVELALNENGTTLKSLGNS 1219 T 0.11 Asp-Al_Ex unppssm T Viruses T 7zhl 1 A,B A,B Q8ZRL0_SALTY RHS repeat protein GASTATVGRWMGPAEYQQMLDTGTVVQSSTGTTHVAYPADIDAFGKQAKNGAMYVEFDVPEKSLVPTNEGWAKIVGPDSIEGRLAKRKGLPVPEMPTAENITVRGEKINGEVEAKC 116 T 0.69 Ntox47 unphh F Bacteria T 7zhm 1 A,B A,B Q8ZRL0_SALTY TYPE IV SECRETION PROTEIN RHS MTATVGRWMGPAEYQQMLDTGTVVQSSTGTTHVAYPADIDAFGKQAKNGAMYVEFDVPEKSLVPTNEGWAKIVGPDSIEGRLAKRKGLPVPEMPTAENITVRGEKINGEVEAKC 114 T 0.69 Ntox47 unphh F Bacteria T 7zhm 2 C,D C,D A0A0H3TET1_SALTM Immunity protein TriTu MLNKFKLWVSKHTDYTVIHNENDLSYSIIIDFEDDRYISRFTVWDDLSCMSEVMDVDTGLYKLNKRNEFSTFDELLDIFDDFMISIK 87 T 0.21 Spindle_Spc25 pdb F Bacteria T 7zjj 1 A A CspZ GAMGRLNQRNINELKIFVEKAKYYSIKLDAIYSEYTGAYNDIMTYIMTYSEGTSSDKSKVNQAISILKKDNKIVNKFKELEKIIEEYKPMFLSKLIDDFAIELDQAVDNDVSNARHVADSYEKLRKSVALAYIESFDVISSKFVDSKFVEASKKFVNKAKEFVEENDLIALKCIVKTIGDMVNDREINSRSRYNNFYKKEADFLGAAVELEGAYKAIKQTLL 222 T 3.4 Pepsin-I3 pdbhh F T 7zjk 1 A,B A,B CspZ GAMGRLNQRNINELKIFFEKAKYYSIKLDAIYNEYTEAYNDIMTYSEVNNVTDSDKSKVNQAISILKKDNKIVNKFKELEKIIEEYKPIFLSKLIDDFAIELDQAVDNDVSNARHVADSYKKLRKSVVLAYIESFDVISSKFVDSKFVEASKKFVNKAKEFVEENDLIALECIVKTIGDMVNDREINSRSRYDNFYKKEADFLGAAVELEGAYKAIKQTLL 221 T 0.0086 LEF-9 pdbpercent F T 7zjs 3 E,F E,F SGO1_HUMAN SEROLOGICALLY DEFINED BREAST CANCER ANTIGEN NY-BR-85,SHUGOSHIN-LIKE 1 SNDAYNFNLEE 11 T 0.26 Menin unp F Eukaryota T 7zjy 1 A A L7IQQ2_MAGOP MAX effector protein GPHMADCTLGCKYLENNRWVSVSKSANIGDTLYIMGHSTKIGRGCKPETTEWSDAEIYSW 60 T 0.031 TGBp3 unppssm F Eukaryota T 7zk0 1 A A L7JNQ8_MAGOP MAX effector protein GPHMGKHGRDDYDCTVIFRNNHAPERQPIVVHTYYSRDLPIELDGVRHTIQLSGCTPEQSQIPQGYSVEHMTYKNYLRQEILNERPFWP 89 T 1.4 Sec-ASP3 pdbhh F Eukaryota T 7zkd 1 A A L7ISI9_MAGOP MAX effector protein GPHMNNVMASSSSDTDSDSSPDRGLSRMCCVYKIHPGGNIWSTKKGEQAWFRRRFSKYEVMAYDRCNLEWGFSGKPRGLTFEFLWDKEAAADGTC 95 T 5.1 PetN unphh F Eukaryota T 7zkp 13 M C Q6CG31_YARLI assembly factor CIA84 MPKNALLRSARQVAISRVFATSRASHVVSHAPILASVRPRSNPAPYRRNFSSSRALRNDYGLDTAERSLKESLVPFNGAPVDRKVVRDQLMELISVSPGQVFPISVIPVVKSAYYELFRENERVLSAGDTKTLFGAVAGNNPEDVQDLPFVLAVYHQAEQAAETNRDSRDNILLLGKYFLFQDRLDNFWKLLEAQIKTHDDVDAGFVKQLLELISVDPHLTLGNVARVLQLKTDNHVSSSDELRNALSATLEQLYYKENEGSEFFLSLVENHILDSKDFTPSDSVVAMILNTCVNEGREDLGQSVLRNVVSRVGNLSPGQEDPQNCWGFWSSVAMDLHGSKTDVKAFISRLEALPHRTKATWDILIRYAVFKADLAGRNDLLQVRALLAEMQKVGFEPDAETYFDAYRSSKSIKPDVVHLFEAELDIEKDTSIFAIEMDKALKNHDTLEALSIFYESFEQGAQWENKRLHMEAMTELLIQYAGLNDTSVADILQLVQRIEPICAQGRIPYSAETAIAQNVLQRHSDTANFYTFMNRQYGNTADKVTKQDPQIRPHTYQVIHDYIYSCESERADLAWEMYGLLHKFYVVPFADYYKAIKFFAQDVKRQDYALLTFQQIRKNHDLHGQPAATSEMVAFLFHEFAKTKYKRGIKRLHEVVALETSFDVNRDVLNEMMAAYVSVEDLNRVQDCWAQLQQLPPSIGANNRSVDVLLSYFKDNIHYTERTWQGIPEFGLLPTLENYEQYLINNCRTGNYRRALEITKNMEIDSGLKPTAKIIAAVYNYTFTEQRKLEVEQWAEKAHPEMWLELKEGDKLKSLCLPANSDNDNVESLLKQASADMDEEMSGGIVKVESV 852 T 0.00062 PPR_2 unppssm F Eukaryota T 7zkr 2 B B Pen3-ortho XDAXYTWECLAWPX 14 T 3.9 Stealth_CR1 pdbhh F T 7zkx 1 A A SRPK2_HUMAN SFRS PROTEIN KINASE 2,SERINE/ARGININE-RICH PROTEIN-SPECIFIC KINASE 2,SR-PROTEIN-SPECIFIC KINASE 2 PVKIGDLFNGRYHVIRKLGWGHFSTVWLCWDMQGKRFVAMKVVKSAQHYTETALDEIKLLKCVRESDPSDPNKDMVVQLIDDFKISGMNGIHVCMVFEVLGHHLLKWIIKSNYQGLPVRCVKSIIRQVLQGLDYLHSKCKIIHTDIKPENILMCVDDAYVRRMAAEATEWQKAGAPPPSGSAVSTAPQQKPIGKISKNKKKKLKKKQKRQAELLEKRLQEIEELEREAERKIIEENITSAAPSNDQDGEYCPEVKLKTTGLEEAAEAETAKDNGEAEDQEEKEDAEKENIEKDEDDVDQELANIDPTWIESPKTNGHIENGPFSLEQQLDDEDDDEEDCPNPEEYNLDEPNAESDYTYSSSYEQFNGELPNGRHKIPESQFPEFSTSLFSGSLEPVACGSVLSEGSPLTEQEESSPSHDRSRTVSASSTGDLPKAKTRAADLLVNPLDPRNADKIRVKIADLGNACWVHKHFTEDIQTRQYRSIEVLIGAGYSTPADIWSTACMAFELATGDYLFEPHSGEDYSRDEDHIAHIIELLGSIPRHFALSGKYSREFFNRRGELRHITKLKPWSLFDVLVEKYGWPHEDAAQFTDFLIPMLEMVPEKRASAGECLRHPWLNS 619 T 5.4E-20 Pkinase unppercent F Eukaryota T 7zl7 2 B,D D,B Pen8-ortho XDACYTWEXLAWPX 14 T 0.32 DUF1666 pdbhh F T 7zlv 1 A,B,C i,h,j FIBC_BPT5 TAIL PROTEIN PB4 MISNNAPAKMVLNSVLTGYTLAYIQHSIYSDYDVIGRSFWLKEGSNVTRRDFTGIDTFSVTINNLKPTTTYEVQGAFYDSIIDSELLNAQIGINLSDKQTFKMKSAPRITGARCESEPVDVGVGAPIVYIDTTGEADYCTIELKDNSNANNPWVKYYVGALMPTIMFGGVPIGSYKVRISGQISLPDGVTIDSSGYYEYPNVFEVRYNFVPPAAPINIVFKAARIADGKERYDLRVQWDWNRGAGANVREFVLSYIDSAEFVRTGWTKAQKINVGAAQSATIISFPWKVEHKFKVSSIAWGPDAQDVTDSAVQTFILNESTPLDNSFVNETGIEVNYAYIKGKIKDGSTWKQTFLIDAATGAINIGLLDAEGKAPISFDPVKKIVNVDGSVITKTINAANFVMTNLTGQDNPAIYTQGKTWGDTKSGIWMGMDNVTAKPKLDIGNATQYIRYDGNILRISSEVVIGTPNGDIDIQTGIQGKQTVFIYIIGTSLPAKPTSPAYPPSGWSKTPPNRTSNTQNIYCSTGTLDPVTNQLVSGTSWSDVVQWSGTEGVDGRPGATGQRGPGMYSLAIANLTAWNDSQANSFFTSNFGSGPVKYDVLTEYKSGAPGTAFTRQWNGSAWTSPAMVLHGDMIVNGTVTASKIVANNAFLSQIGVNIIYDRAAALSSNPEGSYKMKIDLQNGYIHIR 688 T 0.27 fn3 pdb T Viruses T 7zm7 8 H 9 G0SG48_CHATD SUBUNIT NDUFS5 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MASGYGLNGGPSRCFPFWQELLACYVTNSSEDNPDGKNKCIPVMEDYYECLHHRKEAARVRALQAAYREAEAKKLQENPPTAGQIRNLGLLNKEEDTKKVHCATATQQWMKMCFALSAKMPCLRGIEVFLTTRPDNEKIPEYPHPEGTSARVICGQHLIRSPSLASNLSNSAITPPSSFSPRKANPLVSVYLPSVPGTPFTINYKISQVPPEPCKYLFFRLYINARPMVSWGIDPHSRPYGKVIKSLWLPSDDRYRGLVGFEKRSFVFLPGEEFKSVAEDGGLIEVQVFRAKDRRARTPKLETFRFRDNYGIAAPSIGLLDKPQNAFFYDWLLIDPKDEPFAKFRMHYRSWRNLKSLNLIPSSEWELLLAVSPKALRTAASTGKIEKPTSPAFSDSDSDDSLCSATDSDECVFDDHSKKTKSNRSKESPFAFLNSPPERFRAMAPSSEKLPQPSKLLRDSQRAPYQSRPLPELPVEAGVNSTGLNPPSVKVAADLRRKPSATSMESNAVSITPSLLRCMEEGTLDLEKAEVGIAKLVKVAASGSEPSSSSSSATQLTVSAVREVELRPPQPERKGGLPMDYSFSDYEKSSQSSFGDDERMSNISDMEEKPFPAPPTCYLPTTGSGLERELAMFDSPSPSPVPYSAMEPPVTPSPSSTPYKTKLGRKPLLFSRRLGLFSPRKSLPSDFLLGQAKKLVVEEETTNSNVSLAFGELTVTDMRAESPSPAPKGKPLRRFSTIRVEEISIKEKRPLFNSLRRIASASPRKLAGRVLSMDLGKKGGEEKEG 785 T 0.0017 COX6B unppssm F Eukaryota T 7zm7 30 EA Z G0SEF0_CHATD SUBUNIT NDUFA7 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MASKAAAAAASNAVSITKKYTVQSTGIWERIRRALVIDPNRSNGVPLNPYNRNPSPGDNPPLEYTDPVTIPAGDIADNPYWKRDFRRNYPRPSVIAQAQQVALLSVGSAAQPRVELIGEEGTKALVAAEEEGKEKGVAKYLEEKGAEEAKRVLALTGGLPPTPSGQTMVTGQWDVHKYGLAEEQSYGGSYPCRSFV 196 T 0.004 CI-B14_5a pdbhh F Eukaryota T 7zm7 34 IA d G0SEZ1_CHATD SUBUNIT NDUFB10 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MPTPESEAFLAKKPQVPPTFDGVDYEDNKRLKQAQDAIIREQWVQVMMGRLVREELSKCYYREGVNHLEKCGKLRERYLQLLANAKVKGYLFEQQNYWSKENQQQ 105 T 0.00014 NDUFB10 unppercent F Eukaryota T 7zm7 35 JA e SUBUNIT NDUFB2 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MAGGGQHVSRVHRFLATGLGASMWFWIFYRAKKDGPVLLGWKHPWD 46 T 0.00013 NADH_B2 pdb F T 7zm7 37 LA g G0RZZ2_CHATD SUBUNIT NDUFA3 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MSATTPRFWSTPLKYCRWAARERPALFFSVVIGALGPVTLATVPPLRRLIGDVDAAPIPLTYPIPPGPRKQLKGYDDDTEDN 82 T 0.11 NADHdh_A3 pdbpercent F Eukaryota T 7zm7 39 NA i G0S569_CHATD SUBUNIT NDUFB6 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MGGGPKIPYPKHVWSPAGGWYAQPANWKQNTAIFGLVIFGITAMVWKYSAEHEVRHKMPEPDRFYPSRYWVKQIKDYERAQKEKQQNNTEASS 93 T 0.0065 Elongin_A pdbpercent F Eukaryota T 7zm7 41 PA n G0S086_CHATD SUBUNIT NDUFB5 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MLALRQRAALLARRVRPTVVVPRNARTYASSHDHDHHDHHHDHGHNVEEPLGAAFYIAVGGIASSFVIYNISRPGPNGEPSSLHKWFSKISDYKDEWETRNTLMAAALEQAAHDKHLLLTAERSRHIELKYPEVFSHGSPFNVPAGFYPNLDHVIEHYRKQHLEEEERKAKKLAAAAAAASEAR 184 T 0.061 Glyco_transf_36 unppercent F Eukaryota T 7znz 1 A A B2UR61_AKKM8 FucOB, a GH95 family alpha-1,2-fucosidase KPSASNLIWSDEPAVVVYPQEDKNSEGSFGKYRKPASVWEAEGYPIGNGRVGAMIFSAPGRERLALNEISLWSGGANPGGGYGYGPDAGTNQFGNYLPFGDLFVDFKKGDQPASLSVEDFTRSLDLRDGIHKVNYKADGVTYDREAFSSTPANVLVLNYKASKPGQFSADFSVNSQLGADISAKGSVITWKGMLKNGMNYEGRVLIRPKGGTLSASGDKISVKNADSCMVVIAMETDYLMDYKKDWKGESPSRKLDRYAAKAASADYAALKQAHISQYKSMFDRVKVNFGKTEEDVAKLPTPKRLEAYKKNPADPDLEETMFQFGRYLLLSSSRPGTLPANLQGLWNDYVKPPWACDYHNNINVQMAYWGAEPANLSECHEALVNYVEAMAPGCRDASQANKGFNTKDGKPVRGWTVRTSQNIFGGNGWQWNIPGAAWYALHIWEHYAFTGDRKYLEKQAYPLMKEICHFWEDHLKELGAGGEGFKTNGKDPSEEEKKDLADVKAGTLVAPNGWSPEHGPREDGVMHDQQLIAELFSNTIKAARILGKDAAWAKSLEGKLKRLAGNKIGKEGNLQEWMIDRIPKTDHRHTSHLFAVFPGNQISKLKTPKLAEAARLSLEWRGTTGDSRRSWTWPWRTALWARLGEGNKAHEMVQGLLKFNTLPNMLTTHPPMQMDGNFGIVGGICEMLVQSHAGGLDIMPSPVEAWPEGSVKGLKARGNVTVDFSWKDGKVSNVKLYSAQPKVLPVRVNGKMTRMKTLPLK 761 T 4.8E-18 Glyco_hyd_65N_2 pdbpercent F Bacteria T 7zo0 1 A A B2UR61_AKKM8 GH95 family alpha-1,2-fucosidase MHHHHHHENLYFQGSGADKPSASNLIWSDEPAVVVYPQEDKNSEGSFGKYRKPASVWEAEGYPIGNGRVGAMIFSAPGRERLALNEISLWSGGANPGGGYGYGPDAGTNQFGNYLPFGDLFVDFKKGDQPASLSVEDFTRSLDLRDGIHKVNYKADGVTYDREAFSSTPANVLVLNYKASKPGQFSADFSVNSQLGADISAKGSVITWKGMLKNGMNYEGRVLIRPKGGTLSASGDKISVKNADSCMVVIAMETDYLMDYKKDWKGESPSRKLDRYAAKAASADYAALKQAHISQYKSMFDRVKVNFGKTEEDVAKLPTPKRLEAYKKNPADPDLEETMFQFGRYLLLSSSRPGTLPANLQGLWNDYVKPPWACDYHNNINVQMAYWGAEPANLSECHEALVNYVEAMAPGCRDASQANKGFNTKDGKPVRGWTVRTSQNIFGGNGWQWNIPGAAWYALHIWEHYAFTGDRKYLEKQAYPLMKEICHFWEDHLKELGAGGEGFKTNGKDPSEEEKKDLADVKAGTLVAPNGWSPAHGPREDGVMHDQQLIAELFSNTIKAARILGKDAAWAKSLEGKLKRLAGNKIGKEGNLQEWMIDRIPKTDHRHTSHLFAVFPGNQISKLKTPKLAEAARLSLEWRGTTGDSRRSWTWPWRTALWARLGEGNKAHEMVQGLLKFNTLPNMLTTHPPMQMDGNFGIVGGICEMLVQSHAGGLDIMPSPVEAWPEGSVKGLKARGNVTVDFSWKDGKVSNVKLYSAQPKVLPVRVNGKMTRMKTLPLKSGAGSSQPAAR 790 T 1.3E-18 Glyco_hyd_65N_2 pdbpercent F Bacteria T 7zol 3 C A A0A975BRS1_9BACT TPR-CHAT MSSAFSGLKIPELSVDPAEVFKSDNPQLVSVLLDEFELQEQRPFFSGLIPEKQINIALKKSPQLKKLACHLLEAYEINGRRWKHADRRRVLEKAIRLLEKVSNELKGDIQKLENNVKESGKDSEELNKTREKHGEILADMGRAYLHRAKII 151 T 0.0032 TFR_dimer pdb F Bacteria T 7zpq 81 CC CB CUE3_YEAST CUE DOMAIN-CONTAINING PROTEIN 3,COUPLING OF UBIQUITIN CONJUGATION TO ER DEGRADATION PROTEIN 3 MLSRYNRVIEINGGNADISLPIVKFPPFKLRAQLIEKDPVVWLHLIETYVTYFEYLMQGANVELLDESTLDHLRLFLRTYLHEIADEEGKLLSLGINHDVSEQLYLLKGWIFSLIKKCGLLHLQIFGDSLWNLIKVYVRRNPDSIRGLIDGSLKPRINTQRVQLDKSYQVQQHLKQLIESGKFKRIDLRCVEDLLSAKSMQPNKFAENFFTANWIEILEALWAKGQGRGHKEARELIIISLFSVSADRLLKITKELGISNFETLALYPLLGTMLINEGVHKRLPDLKSKLLFLNLGG 297 T 0.21 DUF4919 pdbpercent F Eukaryota T 7zpy 2 B B Peptide inhibitor (ASP-TYR-ASN-PRO-TYR-LEU-LEU-TYR-LEU-LYS) DYNPYLLYLK 10 T 3.2 Flu_PB1 pdbhh F T 7zqp 2 B,C,D j,h,i TMP_BPT5 TAIL PROTEIN PB2 MTDKLIRELLIDVKQKGATRTAKSIENVSDALENAAAASELTNEQLGKMPRTLYSIERAADRAAKSLTKMQASRGMAGITKSIDGIGDKLDYLAIQLIEVTDKLEIGFDGVSRSVKAMGNDVAAATEKVQDRLYDTNRALGGTSKGFNDTAGAAGRASRALGNTSGSARGATRDFAAMAKIGGRLPIMYAALASNVFVLQTAFESLKVGDQLNRLEQFGTIVGTMTGTPVQTLALSLQNATNGAISFEEAMRQASSASAYGFDSEQLEQFGLVARRAAAVLGVDMTDALNRVIKGVSKQEIELLDELGVTIRLNDAYENYVKQLNATSTGIKYTVDSLTTYQKQQAYANEVIAESTRRFGYLDDALKATSWEQFAANANSALRSLQQSAATYLNPVMDTLNTFLYQTKSSQMRVSAMARSASAKTTPAENVTALIENAVGAREDLDTYLKESEERVKKAQELKQQLDDLKAKQAATAPIANALTAGGIGGDESNKLVVQLTNELARQNKEIEERTKTEKVLRQAVQDTGEALLRNGKLAEQLGAKMKYADTAVPGDKGVFEVDPNNLKAVSEIQKNFDFLKKSSSDTANNIRMAASSITNAKKASSDLNSVVKAVEDTSKVTGQSADTLVKNLNLGFSSLDQMKAAQKGLSEYVTAMDKSEQNALEVAKRKDEVYNQTKDKAKAEAAAREVLLRQQQEQLTAAKALLAINPNDPEALKQVAKIETEILNTKAQGFENAKKTKDYTDKILGVDREIALLNDRTMTSTQYRLAQLRLELQLEQEKTELYSKQADGQAKVEQSRRAQAQISREIWEAEKQGTASHVSALMDALEVSQTQRNVTGQSQILTERLSILQQQLELSKGNTEEELKYRNEIYKTSAALEQLKKQRESQMQQQVGSSVGATYTPTTGLIGEDKDFADMQNRMASYDQAISKLSELNSEATAVAQSMGNLTNAMIQFSQGSLDTTSMIASGMQTVASMIQYSTSQQVSAIDQAIAAEQKRDGKSEASKAKLKKLEAEKLKIQQDAAKKQIIIQTAVAVMQAATAVPYPFSIPLMVAAGLAGALALAQASSASGMSSIADSGADTTQYLTLGERQKNVDVSMQASSGELSYLRGDKGIGNANSFVPRAEGGMMYPGVSYQMGEHGTEVVTPMVPMKATPNDQLSDGSKTTSGRPIILNISTMDAASFRDFASNNSTAFRDAVELALNENGTTLKSLGNS 1219 T 0.083 Asp-Al_Ex pdbpssm T Viruses T 7zrp 2 B,D B,D KCC2D_HUMAN CAM KINASE II SUBUNIT DELTA,CAMK-II SUBUNIT DELTA FNARRKLKGAILTTMLATRNFS 22 T 0.11 SBP_bac_3 unppercent F Eukaryota T 7zru 1 A A KKX29_PANIM POTASSIUM CHANNEL-BLOCKING TOXIN 6,PI6 VDACYEACMHHHMNSDDCIEACKNPVPP 28 T 0.048 Thionin pdb F Eukaryota T 7zrv 2 D,E E,F de novo designed binder ETGASSTNMLEALQQRLQFYHGQVARAALENNSGKARRFGRIVKQYEDAIKLYKAGKPVPYDELPVPPGFGGSENLYFQ 79 T 3.7 DUF5327 pdbhh F T 7zs9 19 S U TOA1_YEAST Transcription initiation factor IIA large subunit MSNAEASRVYEIIVESVVNEVREDFENAGIDEQTLQDLKNIWQKKLTETKVTTFSWDNQFNEGNINGVQNDLNFNLATPGVNSSEFNIKEENTGSALLDTDEVGSELDDSDDDYLISEGEEDGPDENLMLCLYDKVTRTKARWKCSLKDGVVTINRNDYTFQKAQVEAEWV 171 F F Eukaryota T 7zud 2 B M CPSF6_HUMAN CLEAVAGE AND POLYADENYLATION SPECIFICITY FACTOR 68 KDA SUBUNIT,CPSF 68 KDA SUBUNIT,CLEAVAGE FACTOR IM COMPLEX 68 KDA SUBUNIT,CFIM68,PRE-MRNA CLEAVAGE FACTOR IM 68 KDA SUBUNIT,PROTEIN HPBRII-4/7 PVLFPGQPFGQPP 13 T 1.4 MF_alpha pdbhh F Eukaryota T 7zv1 1 A,B,C A,B,C POLG_AIVA8 P2A GAASATPDVDPDDRVYIVRAQRPTYVHWAIRKVAPDGSAKQISLSRSGIQALVALEPPEGEPYMEILPSHWTLAELQLGNKWEYSATNNCTHFVSSITGESLPNTGFSMALGIGALTAIAASAAVAVKALPGIRRQ 136 T 0.0007 Calici_PP_N pdbhh T Viruses T 7zv6 1 A,B,C B,A,C POLG_AIVA8 P2A GPGGAASATPDVDPDDRVYIVRAQRPTYVHWAIRKVAPDGSAKQISLSRSGIQALVALEPPEGEPYLEILPSHWTLAELQLGNKWEYSATNNCTHFVSSITGESLPNTGFSLALGIGALTAIAASAAVAVKALPGIRRQ 139 T 0.00087 Calici_PP_N pdbhh T Viruses T 7zvi 2 B E A4ZF88_9CAUD Sri GAMDPMVTKEFLKIKLECSDMYAQKLIDEAQGDENKLYDLFIQKLAERHTRPAIVEY 57 T 0.42 DUF3173 unphh T Viruses T 7zvo 1 A A Q8A921_BACTN Beta-galactosidase MGSSHHHHHHSSGPQQGLRYEAETATLKGKFRKKEHRKQTGVFFDKGKGNSIEWNISTGLAQVYALRFKYMNTTGKPMPVLMKFIDSKGVVLKEDILTFPETPDKWKMMSTTTGTFINAGHYKVLLSAENMDGLAFDALDI 141 T 0.023 GH115_C pdbhh F Bacteria T 7zw0 37 KA sj YIQ1_YEAST Uncharacterized protein YIL161W MDTKLSVTGAKKSQGKASGLGNEGTPIGNEESTNKAKNGNKKRNKNRNRNKKTETKEQNEPKPVTGGEEVRVEKSQAKNRRRKNNNGANKKNTLHYSKEINVEERKQIAKRQEEIEQCIHTLSDFKLFKKGKHVTSYGYRISPMTDSGKISLKILFNIPLDYPKAPIKLTMKSNEEVSSYMDTVIANFNWKARQLVKEDWRILSQINYLVSELEILKMENYKQIDKLRNSFYKTI 235 T 0.0044 RWD pdbpercent F Eukaryota T 7zwj 1 A A Triculamin SKKSKPGDGIRGKGVRG 17 T 5.7 CP_ATPgrasp_1 pdbhh F T 7zx4 2 C,D,E C,D,E DLGP5_HUMAN DAP-5,DISCS LARGE HOMOLOG 7,DISKS LARGE-ASSOCIATED PROTEIN DLG7,HEPATOMA UP-REGULATED PROTEIN,HURP YRHISFGGNLITFSPLQPGEF 21 T 17 DUF4722 pdbhh F Eukaryota T 7zy4 2 C,D C,D FIP1_HUMAN hFip1 SNAMSAGEVERLVSELSGGTGGDEEEEWLYGDENEVER 38 T 0.002 DUF5404 pdbhh F Eukaryota T 7zzz 1 A,B,C,D,E,F,G,J,K,L J,D,E,F,G,H,I,A,B,C A0A9E7D7B0_9VIRU Major capsid protein P5 MKIATITGVTKSPELQVTKAIGALILSSDVALSALTTEKISIYIERGNGSNVILANKVLLKDFILASTYGTENTQSDADNAMIALCELADEGSIYLADKESIKITLEDLISDKRYDLHGIEEPQQTNNLFFFEQKSVASEEFNKKIDVQGFDLAIMTVDDSVSDLSYQYSNGQVVKYLPFELQTLSRDIDPIQAVLSDGKVVQGLTDRLTLPLVAVVGIEINKSQGSIINFVVRCLKTV 239 T 34 DUF5053 pdbhh T Viruses. T 7zzz 2 H P A0A9E7A4L7_9VIRU Spike protein P13 N-terminal, capsid internal domain MNFIQYIDDSYAVKVKEINSSEGFYINGIQTPFFILSVFIGNKRVTGVEFNNYDSLPMLSVINDLGNIDLNVIPQNYFATAFTEIYFNIPF 91 T 15 GAPES2 pdbhh T Viruses. T 8a06 1 A E A0A222NP85_9VIRU Penton protein P12 DFSTIPIDYVKAKDPNTIDFCLSYLELYHTTKAVKACTPFSFILGSDAGMQRATETTESLYWGKVILDINPNLSPLVNTTIVLEIESMLSSNSINRSENKRITRYIEKENFVNESSERFEFFKSMELSHLSTAYDVYVTFIGFKIDL 147 T 0.028 McrBC pdbpssm T Viruses T 8a09 1 A,B A,B A hexameric barrel state of a de novo coiled-coil assembly: CC-Type2-(QgLaId)4 XGEIAQQLKEIAKQLKEIAWQLKEIAQQLKGX 32 T 0.0058 WXG100 pdbpssm F T 8a0j 1 A,B A,B G0V1V5_TRYCI Uncharacterized protein TCIL3000_11_11110 KAFLALPRGEEQRMRFVDEFLSGAWVRFYSFTTDDVVAMYYSLQPGRYGAFFATEQGVGTAVVDVHSKLVLYVPCMDKDSMNRIQPHPHVLTYFEEDVQLLNISDAQKVLGSVLTGIMNFVQEIARQRGEGLPPPAVHAAYLHERDKTAVPSNTKFAYVRKVFPDPSGSFVLFRLSNLRSQVICNVLMDIRWQSDRQNNVGQRYYVLADGTAEPFTVDHTGILFEVDQVVRNNFRR 236 T 0.31 PH_BEACH pdb F Eukaryota T 8a0k 1 A,B,C,D A,B,C,D Q38DT1_TRYB2 Protein kinase, putative SSVPPTPEERHMLLNGDWIRYYHFYPMEEEGGDSVAVTYHIQPGRTGVTFFNHSFSVHSAVLSVLEHIVYVVDRVDIEEDNDVARILSLAQALNEEKKIYDVLQLVETHDTHMLKQRRSPGIMSVYCPPQTAFQCNGDPFVFVRWYRFHMENSMSGFMLSNGAVQVFVGGKYELRWLDDNRKFIVRSNGVCEVLDEEKFPLSEELNQMLYGGV 213 T 0.024 DUF4704 pdb F Eukaryota T 8a22 21 U Am uL18m PLKPLVFRPLKLFFWATNRHVHAKVVRFSSPFNEGVPVVDISTFDAFNKLKGSAPVVAPRSLECYKEVAKMVKEETARQNINEVTLHLNSHASDRGVREVVRELKNLGLLVKKV 114 T 0.0038 Ribosomal_L18p pdbhh F T 8a22 33 GA Ay bL31m LVRPLTKAMTVVLSNGATLRLPTVYARAKPWFPVMDLHSHNVWKHKIKTDFQLESEKNITPDFSNFYNKFGK 72 T 0.002 Ribosomal_L31 pdbhh F T 8a22 39 MA AE mL40 EGNTRLQKVVSFFVPEVEKKEEEEKLATQYKRWKVAQVHAWNHDIAVKHRLQTEAIASLPQRLKEQALKPDYSPIPLNRKLLFHTPPESYRD 92 T 7.3E-05 MRP-L28 pdbhh F T 8a22 43 QA AI mL63 VVFKTTGGKAWNPPGGLKPLTNTQKRSRKENLQILLRNLSVLKLAAENQPEVTVNLFSPLKFMH 64 T 0.0048 L31 pdbhh F T 8a22 45 SA AK mL87 RPIMHKNWDWEFVVGAKAGRKPAIQRPKPHQWYYCNPKYSAEDPLPTKIFPPHAPPTAESLDDWAKFRKLCPKDPVEAKKFRKHFVRFLNQRNYDWRTAFERGLAKEVAVAKAAQRAEDETKRQEAWHAYRTAVFESAL 139 T 6 Serglycin pdbhh F T 8a22 46 TA,UA,VA AL,AM,AN mL116 GTIFNTGVPGPRPEVAQKLSTEYQGHILRMISLAESASELDEVLWSSKKHLRPVHIARSCLKLEYLRTKEKGREVSEPIKNLASELENYVELYSTKFTIGQVSQLVRGLSSIRRNIQPDLLLKLAAVVVADDGRQVQLANEMDCRDLFFGFFSQGFDNELFWKRLSESVLPRLPYFNADVVSTVLRVVSGLRFLHNTEFAHATMTALVPKVGDLSPARLADAFFSASLLDPTDVSGLNAKLEERFLREFTSFPIKDTVTMFQTVTVRRHSTPELAAQVAPLVAAQAHQLPVRHLRRALEGMVTAGWKDTAEIPLYAILAKQAARLVLGKQSAATSAILGKHVDNQGYQRTPVQLLRQLARIFANTGLKAGPGANQPLAPYFAALQRELEGRLAELDEQVTDDFAESFKKVGIAEGARVQI 420 T 9.8 FAST_1 pdbhh F T 8a22 47 WA AO mL118 YTWHFLSRQRVEAVNKATDILELEDIMRLEGNKYDYIAIRAFLKRVCILLQERADALGLPPSNEGLLVRFDEPERARYEALVSQVCDVVSARAKWFDPSNAAAVAYCLTRWLGRAEAPLIEQLLRRVVARLPEAKSKDVQYALDATLESAAAPHLEHLREPMLRAAGAFLGAKLPTGRVPPEVVAKITRLLVNHWDQPDEELLEAIVTDIAVRLEIYSPTALGRTLLALSKVPALTGAAFKRSRSSFLPEGVNVPSGADVAVPLADACLAHVAAHAAEHANEHDLIKFLGAISKLASPGRAATAGADAGAEATESGAAWAKRNSASLAWFALEQRLAPSTRGSFEGNQFPFVIKLVSAAARPPPAVTKFISSTVAKE 377 T 3.4 TAN pdbpercent F T 8a22 48 XA Xa mL120 IEEYYVVPPACPPPPHNPKTLKYVPKLNRTQIIARVYEAKTPTELENSALGKKFFSEFAAVAKLVRLSQLRRANVYNSRDDAMCVSIYNTSVRVADKLLHLSSDEELCGLIWALSQLPYPEYENLVDRSLQILLEEDKPLKTGSSLAVSRAAAGLASLGRWDASTWEVLVPLLRKNVQEGKEVELSNLALGLYDARETV 199 T 8.9 4HB pdbhh F T 8a22 49 YA Xb mL121 ITTLEFYEQNKEKLSFLVGNDVYSEFEEKWKPKPKVFDPEIPIAEAMWPEGRSEKLQSVIVKKLSFSTESAGSVVAGMKHGYCPDSVDVIVSRVEQLKAFEGYFPGFKTEEFVSVNPRLLNFSTDRIYWAMLTFQDMFSSAEVGPLMANIGHFIIENPIRCAKNLFCLATELESQLSMKLDVTAVKQDSWFTLSSEETIRERVAALATIFGSKTAGDILLRDINYLRLDSKDVNRDAVKIREHF 244 T 0.054 DUF732 pdbpssm F T 8a22 50 ZA Xc mL122 RKNIVWPEQLEEQKRNKQYEYQWVISNGKKFIARRESTSTKWEIWKSIEKVTPGQKP 57 T 0.0053 HTH_56 pdb F T 8a22 52 BB Xe mL124 FHTGVNLVQPIDTSKLTRQIKKLTLLHEAALTVLQYSNYCNPEQATEILRRLPFLMRHEESRVLKGQTLDPKLPPMFHGLLHVMGDRFVQVFSDCNLRQIERGAWALAAARHQHDGVALALSEKLKQLTQELLDLNAKPFNTRVTKPTPEQLNSGIFASRVLVPESVNQLPVKAVLPEFNALAGIAWALATVAGEHSAAAAKAALEQLAEKFGALQVDPKPLPDADSLCRLAWAFAKAGVHNPAAVDKLFHLAEERLKSQLQAHDPASGPLRPRCTYRYKTVRGWVDQHFPRKPRDSSYLGDTAPKIIPRDFEIDSLGSLLSAAALLRDQVPVERLQTILNLAAQHTAASSVAGGALQPLMVTYEEVTRVLAACEQLGFRSSTLVTPLLHGLPMAALSAEALSQLAAAATLHHVRSRTVYLRIVRAFNAKLSVSPTLVAGAGIGAEGKKEGEAAAALGAQLLLAVTKAGLPANASVSRIASLV 483 T 11 GYR pdb F T 8a22 53 CB Xf mL125 RPALTPSFSRVSDPWTGEKEAKYAAPYRIPEEVWKNSGAPKILFQDPWNSPDYDEVRKKHAVLVHDYLKQQSQPINVQTILEGVNKTHGLVLGTIEYVTSLLENMLWHDMAYVVKPVFSSPRKAKLSKIPLLYGANKYQQVFRGTPKEVAERYEAARAKHIKVAFTRLRTSKTPQPFRRRTDEYSHVQASQSALGLAAAAA 201 T 1.6 HARE-HTH pdbhh F T 8a22 54 DB Xg mL126 SVKYIPNHAATPNKYKDAQQKVLWDRAKKLGKKPEYKVPNIKDTQTVFEIGKLTKLCLEHWKPMHFAAALGHVINVWTTQALKSGRYGGKSFTVRELLGFRSLPYGVNSITAVLPLQSPEDFLSQPLAKQPFSFKPVSVREEVKKIIASNPGLLIHNWSLKIEGQPNHPITDEDRAAAVIAICTSSFRARFNEAGDVAVALVLSRLARCGYWLPPLYELIAPFAAFQGARIDHSSPAVIANVLLVLARAKGQAEMGQPTALQIRAIAPALEQKCLQRLGELLPSLEALVISDTLAATALLSSPEARALLAQIKAEVLARNFLGFESRDIIACFKELVANVYQPLQLSADLPAPGELRDELPGGEKVLDEQLLAALSGAVVEGGALXXXXXXXXXXXXXXXXXXXXXXXXX 410 T 15 hDGE_amylase pdbhh F T 8a22 56 FB Xi mL128 KFQSRAEKKYRIMDEKVGKPRFQA 24 T 11 HJURP_C pdbhh F T 8a22 57 GB Xj mL129 AKRLLREAPPFTEQVDEKNFDDKDYLGAGAMSNEQLRKELEKVEPGEAKWDEESPLIPPPQPRQYRNKGHQ 71 T 1.5 La_HTH_kDCL pdbhh F T 8a22 62 LB Ba bS1m VSLPMTKAELRASLLYKMRKDTLTKVIDSSLSTSVLTIEEKEKIDRTLYAHIETENNHIDPGLHCLIRALRRSNLTMPILKGQQIQAKVIQKTDEVMLLDPGFYNLSEVPVNYLTTAHIVRKVDDSPRENLYDVRPGDVVKVLVDDVYTPYGDMQLDVPQQDPRLILNQVWDELHLKMKKKELVRGRILNECKSGYAVGVAGFVALLPYANTSREVANRVGEAQSFQIKSMSEPHRRRIVLQ 242 T 0.00036 MRP-S35 pdbhh F T 8a22 69 SB Bh uS8m AAQAYFDLRYHVKKQGLLTVNRAASIINSIFPEFSHESHRNQLAVPLPRKEIPTYIMQNAKVQPWALLPTKAAAYAQYPNFFRSSSLFFGSLNREIVNRRPYSLLPADKLSMDLAQVCTNLGILNGWDIVQKREKLKDLDFVWPANELPRDHHEVKLFKHLHLRLALKWEQHKPLWEDGSMVKDQREYRDQQQVQQQQPLPHLPLAPLFGPLPLTVRNLSKASQPVLLYPLQLRELAQRMPSGLFLLYHHELGVITDAQAFLFDVPVVALAHVGLPVSMAAAVNGAVNRTFRAELGKPLREVTKLKDWSLSATIAAQVRERRQQLLERAEQTKRERKQIQDLVTVRVGKFKAEVDKEDSSLALQDELLAWQLKE 374 T 59 OPA1_C pdbhh F T 8a22 82 FC Bu mS23 AFSRYFVQKFKQSYTRKYMRDMESGAFSFPKCHDILGKYRPDVLFAAAAAPLKLELPEQAVYKKLYRDFPELRKDAVDLSSLEAPLAKQFALKHLVLSAEIAANSPRTRHILRRDLEADPAYERLKEEFMPRIAELRKQQEQTASLQQLQADEEEHLKLALTYVAAQ 167 T 64 UIM pdbhh F T 8a22 85 IC Bx mS31 SSAARWRAAIAQRLGVEAAAAAQALAALLGQGDLALTVLAAASEADVLNITELLENNSVDEAVTNARKVAIVSGHGLFLATATSEDLAALSDVEAGELAALMGKVHVVGLPLADALLGSDSLTHDQLLTLTRSEKQALLWRLASVGKLREGRAKAVAALRKAALDRAAAAAEASEGLLSAAAMMKLEHDIAEFDLVRERYLPGPGLPEGVQEAFAPSGLPSAFSRDEQALYDAYFGLRSHAASAQPEPLEGPSAAQLHSSFLDGFQCREEDSQMEELPESFGQWVANIKGLIVKAPVPLLGLLAKFVTAKIDGADARDASETQSRLRLLAAEIATDIARRREARLAVSPWWQRASAPIDALAISSIDHPSSDPLVQLLEVLLGHSGADEFGSWISAVAMRPVSPYEILADEHRLMDLERYLSMTSASELHLELAATPLPWASPAVHVPPAAFLEEMRAKFNNYLLATGLSPLSAAEWSAYKDWALEEFAEKRALGEEALLQEGHSGFFNPKADEIYLRALLEATIPPEAPLREQAVRYLETVNMNKTWTFLKKKHMVQRLAELSRHLTEHPPVEEQGSPFAALFAVGPGAKPTPLVPKLSKRLPAHGPESLDLPELPEIFR 621 T 43 MRP-S31 pdbhh F T 8a22 91 OC BD mS45 PSVNDLASLLSLSEQYRGADVLAEGAALPGTGFANARGTFLPHELPTAIEYLKELDPEAEMKLEQMEAMYKLLYSRNESEREVGRQMMYDLLKLSGHPFRELELCNWDYMAAFLDARVAGRVFHRGSGERLVHRTATFPAFEGYPLAEVDQTTEGEVSKLNREESKRQDNAMFQDFRKKLLFNLGMVGEQLWEPVQGVLSANLRSALDRPLVVYDITAATGETVYPPKFVAEVDGTRRALNEQERAYQAKRKPGPRLPYYMRRIARKEEL 270 T 0.19 TOM6p pdbpssm F T 8a22 92 PC BE mS106 IAPSQLDKLEKFVHVRPPKTDYEEDIKQAISSVTDNEGLKKCLDLFLTNHAAQTWVGKHEYANAGQVSDFIKVCEKVGSSAPLVTLWQSAYRYGVDPTVPLLRSSAAACAALGEGADAALVLLYGSCYIVNVPEDLAAAVRTALEAHEKAQEGNAEALAKVQKYREALDRL 171 T 0.019 PfaD_N pdb F T 8a22 93 QC BF mS107 MATIPKGLDIDPESPMLYHYFKSIHPHQVSFRIKKRKQLQHLWELCKLYENKMDTLASAAMLGQLFRLQKRNNPDYSVELANQIFEHCVKRLSFTIRFATYQEIVPVLFTLARMNVSIVPSDTLLLDPTHRVSREFVHLFLKRAVRNHVHIRVVNPRQMARVLWATAKLFPEDQRMDPRVQDAVDKLARSSVKRLSELHPGSLSIYASAFAKLSPAPTSQEGPLKDVDVSSWDATITGVKSSLLDLDSKELAFVARARTLKVFQGISREILLRVGDLNHEQFTVRNVFHVLGAYIRAQIQDPLVAKVLAENITGRIQDVYAEELIALVRAAERLDGFKNPDLTAAVLRRAREVDLPEETQKDYAKRLQSA 370 T 43 TYA pdbhh F T 8a22 94 RC Ya uS4m-2 (fragment) ADLVRHLQSSGSKLQKLANLTSASCSYRDISVSLFGLQARQLGCTKPFVYTFSASEQAQSKSKPFDGKLRLPDVSQLSDTITVSGDAAPATLNLDQKYMDKISCWTQTIPSHLEMSYQNLTSVKLFPPVDASYPTYVDFEHATRLLDHFTSRKRLNYQRKMIKKRDKFQIKSWDHHAGEA 180 T 23 KN_motif pdbhh F T 8a22 95 SC Yb mS108 HRFRNNKFLRLEPDLDPKVYGQTETLQKQVDDNFSLLLAKHRLDMKAAAA 50 T 6 SMAP pdbhh F T 8a22 97 UC Yd mS110 TTRRRKLLGSRYGARLAKKNRQQFERARVILDCYSDEELRPDSPPVAVKEVTINTLQTMRFLFPQTSKEHLDIKQNIASYKIFSLNRELLSLLPK 95 T 1.3 DUF3135 pdbhh F T 8a22 98 VC Ye uS3m-2 (fragment) GRHLLPATAIRVRLNRGFESAWYTDVSYREMIKKDFLLAKLASSFVNRSSRASLRQIFPGGKDFPNFRTSRIFMQHLPYKSYASTFSYVAPKDGPQAKYGLFQSKL 106 T 20 YebO pdbhh F T 8a22 99 WC Yf mS111 PFNVSDANPKDVEFLQVLLSKFLPDADKATVYRTGQEPRRLRLGDLPATSQFMESFVSEKLPKEPLYDMPSWLANNMPQYDAQPKSPHYHWSSWMRQHLSLDLQRLYAAFAEYMASEPHRLGIVRQANFELARLWDWQHRRVAAGLSPDL 150 T 8.8 DUF2497 pdbhh F T 8a22 100 XC Yg mS112 SADVYKEFFKMARVAVRTMKEPTKSIMKDLQRNARRSENIQRDNNLDKGVYMRFLRQRAGLSVPPIK 67 T 9.2 CCDC53 pdbhh F T 8a22 101 YC Yh mS113 GADGVLRRHNEVRGALRFFLDSWYKNKTSGTIADKNQVMFDYLKYKGVTEIAQHLHAPAPQVFRK 65 T 0.12 Nudix_N pdbpercent F T 8a22 103 AD Yj mS115 EYNGQGYVFSLLQRPPAPTLELLAEYLTVKYQDVIAQRDFVTHILGRMSVLERGGELPAADAAASGTWTGGAKRRLSPQEIRDINGELNRLFDADLNEYVSLAQRLATENVLSPADLATCLQAARSKAQTSSFASLAAPGSSNVDRNILAQVLQGKQDVSALAAAAAAAAASGPEGARVAWDEALQVGKYGAWATKAKAWAADDIAARREKGQQISPEQEAALVCLWDNPLSYDAAAGLWHQYAEKAGAVSAPSLADVISADQAIQAAKAAAAADPASLPAVKATAEKAAQVQEAVKKLYLGFAARQGSTSGAVTVDGVPLPFADVVKANAELDVASPAALAAAFQPLELGELLACHWEAVSRTFMWEDMYQLMLETAKEIEVNGA 386 T 0.15 Ykof pdbpercent F T 8a22 104 BD Yk mS116 GPLPEDVFLVAPKVAAAVQQTQAQLIDLLAPYGYSFDAFSEAVLEDLSKTKELCVKARFVLWEARVLEALEAVRPFVSGPVFRTESEAAALT 92 T 5 POTRA_TamA_1 pdbhh F T 8a22 105 CD Yl uS7m-2 (fragment) TVVLAPSKYDSQLKIPLKPTEMDEFEELRSFVDISIEKEADYVMNKFVGRLIKGGEKATAQQVLLRTLLHTRRLMQEGNITSLK 84 T 0.13 Ribosomal_S7 pdb F T 8a22 107 ED Ub mL105 RDEIIKLLESRKDMDVNGYVMYCREELGKLTVPRPRAPPVSPKHEDYKTFVDEERVTYMRMKQHEKISLFLTEEEKNTVTTKGKDILDDKRFIQTIASRTGFYIAEEVRDCLSEFFNFRDSSRRLLTYYAD 131 T 0.025 DUF5863 pdb F T 8a3g 1 A,B A,B apCC-Tet* XGQLEEIAQQLEEIAKQLKKIAWQLKKIAQGX 32 T 0.01 WXG100 pdb F T 8a3i 1 A,B A,B apCC-Tet*3 XGQLEEIAKQLQQIAWQLKKIAQGX 25 T 0.68 DUF5320 pdbhh F T 8a3j 1 A,C,E,G A,C,E,G apCC-Tet*3-A XGQLEEIAKQLEEIAWQLEEIAQGX 25 T 0.063 DUF5320 pdbhh F T 8a3j 2 B,D,F,H B,D,F,H apCC-Tet*3-B XGQLKKIAKQLKKIAYQLKKIAQGX 25 T 1.2 DUF5320 pdbhh F T 8a3t 7 J S HSL1_YEAST HSL1 isoform 1 MTGHVSKTSHVPKGRPSSLAKKAAKRAMAKVNSNPKRASGHLERVVQSVNDATKRLSQPDSTVSVATKSSKRKSRDTVGPWKLGKTLGKGSSGRVRLAKNMETGQLAAIKIVPKKKAFVHCSNNGTVPNSYSSSMVTSNVSSPSIASREHSNHSQTNPYGIEREIVIMKLISHTNVMALFEVWENKSELYLVLEYVDGGELFDYLVSKGKLPEREAIHYFKQIVEGVSYCHSFNICHRDLKPENLLLDKKNRRIKIADFGMAALELPNKLLKTSCGSPHYASPEIVMGRPYHGGPSDVWSCGIVLFALLTGHLPFNDDNIKKLLLKVQSGKYQMPSNLSSEARDLISKILVIDPEKRITTQEILKHPLIKKYDDLPVNKVLRKMRKDNMARGKSNSDLHLLNNVSPSIVTLHSKGEIDESILRSLQILWHGVSRELITAKLLQKPMSEEKLFYSLLLQYKQRHSISLSSSSENKKSATESSVNEPRIEYASKTANNTGLRSENNDVKTLHSLEIHSEDTSTVNQNNAITGVNTEINAPVLAQKSQFSINTLSQPESDKAEAEAVTLPPAIPIFNASSSRIFRNSYTSISSRSRRSLRLSNSRLSLSASTSRETVHDNEMPLPQLPKSPSRYSLSRRAIHASPSTKSIHKSLSRKNIAATVAARRTLQNSASKRSLYSLQSISKRSLNLNDLLVFDDPLPSKKPASENVNKSEPHSLESDSDFEILCDQILFGNALDRILEEEEDNEKERDTQRQRQNDTKSSADTFTISGVSTNKENEGPEYPTKIEKNQFNMSYKPSENMSGLSSFPIFEKENTLSSSYLEEQKPKRAALSDITNSFNKMNKQEGMRIEKKIQREQLQKKNDRPSPLKPIQHQELRVNSLPNDQGKPSLSLDPRRNISQPVNSKVESLLQGLKFKKEPASHWTHERGSLFMSEHVEDEKPVKASDVSIESSYVPLTTVATSSRDPSVLAESSTIQKPMLSLPSSFLNTSMTFKNLSQILADDGDDKHLSVPQNQSRSVAMSHPLRKQSAKISLTPRSNLNANLSVKRNQGSPGSYLSNDLDGISDMTFAMEIPTNTFTAQAIQLMNNDTDNNKINTSPKASSFTKEKVIKSAAYISKEKEPDNSDTNYIPDYTIPNTYDEKAINIFEDAPSDEGSLNTSSSESDSRASVHRKAVSIDTMATTNVLTPATNVRVSLYWNNNSSGIPRETTEEILSKLRLSPENPSNTHMQKRFSSTRGSRDSNALGISQSLQSMFKDLEEDQDGHTSQADILESSMSYSKRRPSEESVNPKQRVTMLFDEEEEESKKVGGGKIKEEHTKLDNKISEESSQLVLPVVEKKENANNTENNYSKIPKPSTIKVTKDTAMESNTQTHTKKPILKSVQNVEVEEAPSSDKKNWFVKLFQNFSSHNNATKASKNHVTNISFDDAHMLTLNEFNKNSIDYQLKNLDHKFGRKVVEYDCKFVKGNFKFKIKITSTPNASTVITVKKRSKHSNTSSNKAFEKFNDDVERVIRNAGRS 1518 T 3.1E-06 Pkinase pdbpssm F Eukaryota T 8a44 2 B B A0A1P8P1S7_HUMAN DUFFY ANTIGEN/CHEMOKINE RECEPTOR MGNALHRAELSPSTENSSQLDFEDVWNSSYGVNDSFPDGDYDANLEAAAPAHSANLLDDS 60 T 0.099 DUF4120 pdbpssm F Eukaryota T 8a49 2 B,C C,D ENDOS_STRP1 Secreted endoglycosidase EndoS MIPEKIPMKPLHGPLYGGYFRTWHDKTSDPTEKDKVNSMGELPKEVDLAFIFHDWTKDYSLFWKELATKHVPKLNKQGTRVIRTIPWRFLAGGDNSGIAEDTSKYPNTPEGNKALAKAIVDEYVYKYNLDGLDVAVLHDSIPKVDKKEDTAGVERSIQVFEEIGKLIGPKGVDKSRLFIMDSTYMADKNPLIERGAPYINLLLVQVYGSQGEKGGWEPVSNRPEKTMEERWQGYSKYIRPEQYMIGFSFYEENAQEGNLWYDINSRKDEDKANGINTDITGTRAERYARWQPKTGGVKGGIFSYAIDRDGVAHQPKKYAKQKEFKDATDNIFHSDYSVSKALKTVMLKDKSYDLIDEKDFPDKALREAVMAQVGTRKGDLERFNGTLRLDNPAIQSLEGLNKFKKLAQLDLIGLSRITKLDRSVLPANMKPGKDTLETVLETYKKDNKEEPATIPPVSLKVSGLTGLKELDLSGFDRETLAGLDAATLTSLEKVDISGNKLDLAPGTENRQIFDTMLSTISNHVGSNEQTVKFDKQKPTGHYPDTYGKTSLRLPVANEKVDLQSQLLFGTVTNQGTLINSEADYKAYQNHKIAGRSFVDSNYHYNNFKVSYENYTVKVTDSTLGTTTDKTLATDKEETYKVDFFSPADKTKAVHTAKVIVGDEKTMMVNLAEGATVIGGSADPVNARKVFDGQLGSETDNISLGWDSKQSIIFKLKEDGLIKHWRFFNDSARNPETTNKPIQEASLQIFNIKDYNLDNLLENPNKFDDEKYWITVDTYSAQGERATAFSNTLNNITSKYWRVVFDTKGDRYSSPVVPELQILGYPLPNADTIMKTVTTAKELSQQKDKFSQKMLDELKIKEMALETSLNSKIFDVTAINANAGVLKDCIEKRQLLKKLLEHHHHHH 906 T 0.00021 LRR_4 pdbpssm F Bacteria T 8a4o 1 A D I2G262_USTHO Effector protein Uvi2 MGHHHHHHHSMDITFTADKFARRAEEAAPVAVKPPRNPEFGIFLNNRYLLHNGEGLPKPKDVKETYPECKWRKYGQWAWLDENNVQCYLGPSYKYHAYSPAKNFDPVPSIQRGACADTANPQDFPQGIPRYTISVPYLYFNNFYDRRCKVRALVKVPQTDKEKEHWIQAWVVEHNGGNWSTKSGDLGPNGPQEGIMLDTKLYPKFLNSGDKDIGVLPNKVEWFFLDINTIG 231 T 4 DUF3868 unphh F Eukaryota T 8a50 1 A,B A,B HSF2B_HUMAN Heat shock factor 2-binding protein EFVKVRKKDLERLTTEVMQIRDFLPRILNGEV 32 T 0.021 Exonuc_VII_L unp F Eukaryota T 8a57 13 M H RL3_LISMO 50S ribosomal protein L3 MTKGILGRKVGMTQVFTENGELIPVTVIEAAQNVVLQKKTVETDGYEAVQIGFEDKRAILSNKPEQGHVAKANTTPKRFIREFRDVNLDEYEIGAEVKVDVFAEGDIIDATGVSKGKGFQGVIKRHGQSRGPMAHGSRYHRRPGSMGPVAPNRVFKNKLLPGRMGGEQITIQNLEIVKVDVEKNVLLVKGNVPGAKKALVQIKTATKAK 209 F F Bacteria T 8a5a 5 E X IES4_YEAST Ino eighty subunit 4 MSQESSVLSESQEQLANNPKIEDTSPPSANSRDNSKPVLPWDYKNKAIEIKSFSGYKVNFTGWIRRDVREERQRGSEFTASDVKGSDDKATRKKEPADEDPEVKQLEKEGEDGLDS 116 T 0.29 INO80_Ies4 pdbhh F Eukaryota T 8a5l 2 B B POLG_HE71 2BC peptide TIEALFQ TIEALFQ 7 T 29 BBP1_N pdbhh T Viruses T 8a60 2 B B LLP_BPT5 Lytic conversion lipoprotein MKKLFLAMAVVLLSACSTFGPKDIKCEAYYMQDHVKYKANVFDRKGDMFLVSPIMAYGSFWAPVSYFTEGNTCEGVFHHHHHH 83 T 0.00082 LPAM_1 unphh T Viruses T 8a62 2 B B FOXO1_HUMAN FORKHEAD BOX PROTEIN O1A,FORKHEAD IN RHABDOMYOSARCOMA PRSCTWPLPRX 11 T 12 FOXP-CC pdbhh F Eukaryota T 8a65 2 B B FOXO1_HUMAN FORKHEAD BOX PROTEIN O1A,FORKHEAD IN RHABDOMYOSARCOMA RSCTWPLP 8 T 8.6 PCSK9_C1 pdbhh F Eukaryota T 8a68 2 B B RAF1_HUMAN PROTO-ONCOGENE C-RAF,CRAF,RAF-1 QRSTSTPNV 9 T 58 NB pdbhh F Eukaryota T 8a6f 2 B P RAF1_HUMAN PROTO-ONCOGENE C-RAF,CRAF,RAF-1 QRSTSTPNVHX 11 T 68 ALC pdbhh F Eukaryota T 8a6i 1 A A TADBP_HUMAN TDP-43 GGMNFGAFSINPAMMAAAQAALQSSWGMMGMLASQQNQSGPS 42 T 0.0067 Glucosaminidase pdb F Eukaryota T 8a82 2 B B K7XCU4_SERPL OocQ MSEYLINSGEFNMIVCPADKAYYILNDDRASTETLQEFLDGEKVQYHRLKPLWFKYRADESWQDLNKKEYRLGKELSEAELIDRFVLKAFNFGSLVAVRDSQTGAVKIFKRDKLKMSVR 119 T 0.0091 NOGCT unppssm F Bacteria T 8a8c 2 B B RBP5_BPT5 RBP-PB5,TAIL PROTEIN PB5 MSFFAGKLNNKSILSLRRGSGGDTNQHINPDSQTIFHSDMSHVIITETHSTGLRLDQGAGDYYWSEMPSRVTQLHNNDPNRVVLTEIEFSDGSRHMLSGMSMGVGAKAYGIINPQIMSQGGLKTQITASADLSLDVGYFNTGTSGTIPQKLRDGTGCQHMFGAFSGRRGFASSAMYLGGAALYKSAWSGSGYVVADAGTLTIPSDYVRHPGARNFGFNAIYVRGRSCNRVLYGMEGPNYTTGGAVQGASSSGALNFTYNPSNPESPKYSVGFARADPTNYAYWESMGDPNDSANGPIGIYSEHLGIYPSKITWYVTNLVYNGSGYNIDGGLFNGNDIKLSPREFIIKGVNVNNTSWKFINFIEKNFNVGNRADFRDVGCNLSKDSPSTGISGIATFGLPTTESNNAPSIKGGNVGGLHANVVSIYNFLPSASWYVSSNPPKIGNNYGDVWSENLLPLRLLGGSGSTILSGNIVFQGNGSVHVGTVGLDLNSSRNGAIVCTMEFIDDTWLSAGGIGCFNPTEMLSQGAEYGDSRFRIGGNTINKKLHQILSLPAGEYVPFFTIKGTVVNACKLQAAAYNPTPYWVSGLPGSVGQTGYYTLTYYMRNDGNNNISIWLDSSMSNIIGMKACLPNIKLIIQRLTHHHHHH 646 T 0.18 NPM1-C unppercent T Viruses T 8a8x 2 B,D B,D Q80J95_9CALI MNV1-NS3 C term peptide HDDFGLQ 7 T 4 DUF4175 pdbhh T Viruses T 8a9a 1 A,B B,A Y213_MYCPN UNCHARACTERIZED PROTEIN MG075 HOMOLOG NKTHQVEHESEQSDFQDIRFGLNSVKLPKAQPAAATRITVENGTDKLVNYKSSPQQLFLAKNALKDKLQGEFDKFLSDAKAFPALTADLQEWVDQQLFNPNQSFFDLSAPRSNFTLSSDKKASLDFIFRFTNFTESVQLLKLPEGVSVVVDSKQSFDYYVNASAQKLLVLPLSLPDYTLGLNYMFDHITLNGKVVNKFSFNPFKTNLNLAFSNVYNGVDVFEAQKNLVGKGKYLNTHVKAEDVKKDVNANIKNQFDIAKIIAELMGKALKEFGNQQEGQPLSFLKVMDKVKEDFEKLFNLVRPGLGKFVKDLIQSSSQAENKITVYKLIFDNKKTILNLLKELSIPELNSSLGLVDVLFDGITDSDGLYERLQSFKDLIVPAVKTNEKTAALSPLIEELLTQKDTYVFDLIQKHKGILTNLLKNFLADFQKSTPFMADQVAIFTELFDNEGAFDLFGEADFVDKIAELFLTKRTVKNGEKIETKDSLLVTSLKSLLGEKVAALGDLLDSYIFKNELLNRSVEVAKAEAKDTKGATDYKKEQAKALKKLFKHIGENTLSKTNLDKITLKEVKNTENVELEETETTLKVKKLDVEYKVELGNFEIKNGLIKAMLEFLPDTKDLETTLDKLLFKGESYKAMKDKYIKEGFPGYGWAKGVVPGAFESIENTFKSAIDKTKSIRDLFGDMLFGNDLSSVKETDSFITLGGSFDIKYGGENLNVLPAYYSLINSEIGYQIIGVDTTIDATKVKVELKNKEYKGKSPAINGQVKLSQSFFNVWTNMFDSITKQIFQKKYEFKDNIQVFARNEDNTSRLELDISDPEQRVIPFAFVDGFGIQLKAVDKNITKEAGNTEPKSPVIQLYEALNKEKDQKQQSKQSPKQLDTKTQLGYLLKLGDNWSKDDYKSLIDDTIINNNYLEASFNSKITVDRLGIPIDLWLFKIWPKFNLEIPMQGSLQLYSSSVIFPYGIYDTSVQDAAKIVKRLNFTDMGFKLNDPKPNFWFVGFKHHHHH 1007 T 0.0075 IFN-gamma pdbpercent F Bacteria T 8aaa 2 B B Stapled peptide ACMFVPCAVRHALGLCAX 18 T 3.4 DUF22 pdbhh F T 8aac 1 A,AA,AB,AC,AD,AE,AF,B,BA,BB,BC,BD,BE,BF,C,CA,CB,CC,CD,CE,CF,D,DA,DB,DC,DD,DE,DF,E,EA,EB,EC,ED,EE,EF,F,FA,FB,FC,FD,FE,FF,G,GA,GB,GC,GD,GE,GF,H,HA,HB,HC,HD,HE,HF,I,IA,IB,IC,ID,IE,IF,J,JA,JB,JC,JD,JE,JF,K,KA,KB,KC,KD,KE,KF,L,LA,LB,LC,LD,LE,LF,M,MA,MB,MC,MD,ME,MF,N,NA,NB,NC,ND,NE,NF,O,OA,OB,OC,OD,OE,OF,P,PA,PB,PC,PD,PE,PF,Q,QA,QB,QC,QD,QE,QF,R,RA,RB,RC,RD,RE,RF,S,SA,SB,SC,SD,SE,SF,T,TA,TB,TC,TD,TE,TF,U,UA,UB,UC,UD,UE,UF,V,VA,VB,VC,VD,VE,VF,W,WA,WB,WC,WD,WE,WF,X,XA,XB,XC,XD,XE,XF,Y,YA,YB,YC,YD,YE,Z,ZA,ZB,ZC,ZD,ZE 1A,JA,RB,ZC,iA,qB,yC,AC,1B,RC,aA,iB,qC,zA,BA,JB,1C,aB,iC,rA,zB,BB,JC,SA,2A,jA,rB,zC,BC,KA,SB,aC,2B,rC,3A,CA,KB,SC,bA,jB,2C,3B,CB,KC,TA,bB,jC,sA,3C,CC,LA,TB,bC,kA,sB,4A,DA,LB,TC,cA,kB,sC,4B,DB,LC,UA,cB,kC,tA,4C,DC,MA,UB,cC,lA,tB,5A,EA,MB,UC,dA,lB,tC,5B,EB,MC,VA,dB,lC,uA,5C,EC,NA,VB,dC,mA,uB,6A,FA,NB,VC,eA,mB,uC,6B,FB,NC,WA,eB,mC,vA,6C,FC,OA,WB,eC,nA,vB,7A,GA,OB,WC,fA,nB,vC,7B,GB,OC,XA,fB,nC,wA,7C,GC,PA,XB,fC,oA,wB,8A,HA,PB,XC,gA,oB,wC,8B,HB,PC,YA,gB,oC,xA,8C,HC,QA,YB,gC,pA,xB,AA,IA,QB,YC,hA,pB,xC,AB,IB,QC,ZA,hB,pC,yA,IC,RA,ZB,hC,qA,yB A0A3S9H6T3_9VIRU C protein MGTFIELVKNMKGYKELLLPMEMVPLPAVVLKHVKLILTSQKEHQPWMTEMALKADQCLIHKATLDLAGKATSNEAKPLIEAMQQIILAMTRELWGQIQRHHYGIVQVEHYVKQITLWQDTPQAFRGDQPKPPSFRSDGPTRGQGSFRPFFRGRGRGRGRGRGSQSPARKGPLPK 175 T 0.0012 API5 unphh T Viruses T 8aca 1 A,B,C A,B,C Q9RWM2_DEIRA DR_0644, only-Cu Superoxide Dismutase MKKLALIALPLVLASCTMAGPTEGTYTLAPQAVVKPAGPVYAPAGTAKISETLGVTRTTITLTGMAPYAIYVAHYHKMGTAAPMGSAPATNTNMAMSSTDATATTTASTSTTSTDTTVAASTDMTTTVTMAPVTAAPNPCNSDGPAIMESRMIAQASADGKVTLTGIVPTALIRDAAYINVHHGRDFSGALADSGVICTPITMTMR 206 T 0.028 LPAM_1 pdbhh F Bacteria T 8ack 2 C,D P,C PCP ERWGHDFIK 9 T 0.19 RPN1_RPN2_N pdbhh F T 8ada 1 A,B A,B Y2667_MYCTU Uncharacterized protein Rv2667 MPEPTPTAYPVRLDELINAIKRVHSDVLDQLSDAVLAAEHLGEIADHLIGHFVDQARRSGASWSDIGKSMGV 72 T 0.047 HTH_AsnC-type unppercent F Bacteria T 8adb 1 A A D6YWY5_WADCW Wc-VDT1 KMPEEEQDSLAAFSRIEANITQYDPLLDNAGKSACTCICLKAAEMLLEASPDQVNAGLIDDILVEGVADYNRFKVGGVVEHTSVENYELNTFELKRLEFRDVDNPFSAEGNPYAGTLDSFAKMMEKASDSKDLPKPVALVMTKSNMTITIVIRPDGKYWLFDPHGTNGKGAYIESCNTDELIKKIKEIFPKTSYPGMTEDENLGFNSFEAYAVRR 215 T 0.00012 Herpes_teg_N pdb F Bacteria T 8adg 6 F F Darobactin 22 WNXTKRW 7 T 33 RNA_capsid pdbhh F T 8adm 2 B P UBP8_HUMAN DEUBIQUITINATING ENZYME 8,UBIQUITIN ISOPEPTIDASE Y,HUBPY,UBIQUITIN THIOESTERASE 8,UBIQUITIN-SPECIFIC-PROCESSING PROTEASE 8 RSYSSPDI 8 T 5.8 DUF3912 pdbhh F Eukaryota T 8adn 1 A,B 3,4 Proteasome Inhibitor 31-Like MDFQDYIQSLKKDFKLVKINDHTYILHKNKKTLELTPDKMYNIQEINDILNVSYPDISYDRNLEDLGSKNKGILNGFGNVGEDDLHPQIGRRKGKKKGAIFSPEEFKEEEDSDGIDKTDIFPLKKKRDPDSDHFKKTGGDDDNPFLY 147 T 0.047 Cap4_SAVED pdbpssm F T 8ae5 2 B,D C,D MCP1A_MACPC Macrocypin-1a MGFEDGFYTILHLAEGQHPNSKIPGGMYASSKDGKDVPVTAEPLGPQSKIRWWIARDPQAGDDMYTITEFRIDNSIPGQWSRSPVETEVPVYLYDRIKAEETGYTCAWRIQPADHGADGVYHIVGNVRIGSTDWADLREEYGEPQVYMKPVPVIPNVYIPRWFILGYEELEHHHHHH 177 T 0.04 Inhibitor_I48 unppercent F Eukaryota T 8af9 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L Q2YPZ9_BRUA2 NyxB, T4SS effector protein from Brucella MNTQATIDTAAVAPLNFDPNAWHHSQMTTLEAIELSRSGGHPYSSPNVPKGFNTVVGFFFDTYDWYPAAYDDEEGNAMKDRELIQYEDWCAKYARTLGLEVKEVEAPAALKVHGIMALKAYPEALLEIRLIEMP 134 T 0.0056 Glyco_hydro_114 pdb F Bacteria T 8afi 2 C,D,E,F,G,H,I,J F,H,B,J,N,P,D,L C9JNW8_HUMAN Ubiquitin-like-conjugating enzyme ATG3 YSDELEAIIEEDDGDGGWVDTYHG 24 T 0.24 SDA1 unppercent F Eukaryota T 8afz 3 C C MPRI_HUMAN CI MAN-6-P RECEPTOR,CI-MPR,M6PR,300 KDA MANNOSE 6-PHOSPHATE RECEPTOR,MPR 300,INSULIN-LIKE GROWTH FACTOR 2 RECEPTOR,INSULIN-LIKE GROWTH FACTOR II RECEPTOR,IGF-II RECEPTOR,M6P/IGF2 RECEPTOR,M6P/IGF2R SNVSYKYSKVNKEEETDENETEWLMEEIQLP 31 T 12 TMEM154 unphh F Eukaryota T 8aif 1 A,B,C A,B,C A0A164X7F2_BACIU YqxM protein required for localization of TasA to extracellular matrix DKRWDQSDLHISDQTDTKGTVCSPFALFAVLENTGEKLKKSKWKWELHKLENARKPLKDGNVIEKGFVSNQIGDSLYKIETKKKMKPGIYAFKVYKPAGYPANGSTFEWSEPMRLAKCDE 120 T 0.002 Herpes_PAP unp F Bacteria T 8ail 2 E,F,G,H,I,J,K,L O,E,F,J,N,C,K,D A0A0N9SK00_9CAUD Bacillus phage VMY22 p56 MEGFKDSYTLIYVTRDEEGKMFDIKLENQTKEECEIIYGMITDEILIWNMILEGMF 56 T 0.011 DUF5406 unppercent T Viruses T 8aiw 2 B B O25273_HELPY Cag pathogenicity island protein (Cag19) MKCFLSIFSFLTFCGLSLNGTEVVITLEPALKAIQADAQAKQKTAQAELKAIEAQSSAKEKAIQAQIEGELRTQLATMSAMLKGANGVINGVNGMTGGFFAGSDILLGVMEGYSSALSALGGNVKMIVEKQKINTQTEIQNMQIALQKNNEIIKLKMNQQNALLEALKNSFEPSVTLKTQMEMLSQALGSSSDNAQYIAYNTIGIKAFEETLKGFETWLKVAMQKATLIDYNSLTGQALFQSAIYAPALSFFSSMGAPFGIIETFTLAPTKCPYLDGLKISACLMEQVIQNYRMIVALIQNKLSDADFQNIAYLNGINGEIKTLKGSVDLNALIEVAILNAENHLNYIENLEKKADLWEEQLKLERETTARNIASSKVIVK 381 T 0.15 Bin3 pdbpssm F Bacteria T 8aj8 2 B,D,F,H B,D,F,H PI3R6_MOUSE PHOSPHOINOSITIDE 3-KINASE GAMMA ADAPTER PROTEIN OF 87 KDA,P84 PI3K ADAPTER PROTEIN,P84 PIKAP,P87 PI3K ADAPTER PROTEIN,P87PIKAP MESSDVELDFQRSVQAVLRELNTPNPALQSNQGMWRWSLHKKVERNPGKSSILVRILLRELEKAESEDGRRVIIPLLLTLMSVLTKATGIPEDLYHRAYTFCTRLLTLPAPYSTVALDCAIRLKTETAVPGTLYQRTVIAEQNLISELYPYQERVFLFVDPELVSASVCSALLLEIQAAQEQQTPEACMRHVVSHALQAALGEACHTGALNRKLQASSRRVLEYYFHAVVAAIEQVASEDSPSRLGHLEKMEEIYCSLLGPATTRRHCVGDLLQDRLPSIPLPSPYITFHLWTDQEQLWKELVLFLRPRSQLRLSADLDALDLQGFRLDRDLARVSTDSGIERDLPLGSDELPDPSSSEMERAALQRKGGIKKRVWPPDFFMPGSWDGPPGLHRRTGRPSGDGELLPGVSRVHTARVLVLGDDRMLGRLAQAYYRLRKRETKKFCLTPRLSLQLYYIPVLAPQVTGQDPEASRKPELGELASFLGRVDPWYESTVNTLCPAILKLAEMPPYLDTSRTVDPFILDVITYYVRMGTQPIYFQLYKVKIFTSLSHDPTEDIFLTELKVKIQDSKSPKEGSSPRRRGAAEGTGAELSMCYQKALLSHRPREVTVSLRATGLVLKAIPAGDTEVSGFFHCTSPNAASATDCSCLHVSVTEVVKSSNLAGRSFTTSTNTFRTSSIQVQSQDQRLLTLWLDKDGRRTFRDVVRFEVSPCPEPCSRTQKSKTSALNSHGQETEKNMAKPNSLLMPINTFSGIIQ 756 T 1.3999999999999997E-73 PI3K_1B_p101 pdbpssm F Eukaryota T 8ajm 2 B B DCA12_HUMAN CENTROSOME-RELATED PROTEIN TCC52,TESTIS CANCER CENTROSOME-RELATED PROTEIN,WD REPEAT-CONTAINING PROTEIN 40A MDWSHPQFEKSAVDENLYFQGGGRMARKVVSRKRKAPASPGAGSDAQGPQFGWDHSLHKRKRLPPVKRSLVYYLKNREVRLQNETSYSRVLHGYAAQQLPSLLKEREFHLGTLNKVFASQWLNHRQVVCGTKCNTLFVVDVQTSQITKIPILKDREPGGVTQQGCGIHAIELNPSRTLLATGGDNPNSLAIYRLPTLDPVCVGDDGHKDWIFSIAWISDTMAVSGSRDGSMGLWEVTDDVLTKSDARHNVSRVPVYAHITHKALKDIPKEDTNPDNCKVRALAFNNKNKELGAVSLDGYFHLWKAENTLSKLLSTKLPYCRENVCLAYGSEWSVYAVGSQAHVSFLDPRQPSYNVKSVCSRERGSGIRSVSFYEHIITVGTGQGSLLFYDIRAQRFLEERLSACYGSKPRLAGENLKLTTGKGWLNHDETWRNYFSDIDFFPNAVYTHCYDSSGTKLFVAGGPLPSGLHGNYAGLWS 477 T 0.0099 WD40_like unppssm F Eukaryota T 8ajy 1 A,C A,C A0AEF6_RUMFL Cell-wall anchoring protein MLTDRGMTYDLDPKDGSSAATKPVLEVTKKVFDTAADAAGQTVTVEFKVSGAEGKYATTGYHIYWDERLEVVATKTGAYAKKGAALEDSSLAKAENNGNGVFVASGADDDFGADGVMWTVELKVPADAKAGDVYPIDVAYQWDPSKGDLFTDNKDSAQGKLMQAYFFTQGIKSSSNPSTDEYLVKANATYADGYIAIKAGEPE 203 T 0.00027 Cohesin pdb F Bacteria T 8ako 2 B B ESPK_MYCTU ESX-1 secretion-associated protein EspK GDALRLARRIAAALNASDNNAGDYGFFWITAVTTDGSIVVANSYGLAYIPDGMELPNKVYLASADHAIPVDEIARCATYPVLAVQAWAAFHDMTLRAVIGTAEQLASSDPGVAKIVLEPDDIPESGKMTGRSRLEVVDPSAAAQLADTTDQRLLDLLPPAPVDVNPPGDERHMLWFELMKPMTSTATGREAAHLRAFRAYAAHSQEIALHQAHTATDAAVQRVAVADWLYWQYVTGLLDRALAAAC 246 T 0.0019 DUF5632 pdbpssm F Bacteria T 8akp 1 A A catalytic domain of G7048 AVSKGFNYGATKADGSSKYQADFKKDFAAAKALVEGGSGFTSARLYTMIQGGTTNTPIEAIPAAIEEKTELLLGLWASGGNMDNEIAALKSAISQYGDDFANLVVGISVGSEDMYRNSVTGSKSNAGPGVEPEELVSYIQQVRSTIAGTGLSDASIGHVDTWDSWTNSSNSDVVNHLDWLGFDGYPYYQLTMENGIENAKKLFDESVEKTKSVANGKEVWITETGWPVTGPQEGDATASPANAKTYWDEVGCPLFGNTNTWWYMLEDEGASPSFGVVKSDLKTPQFDLSC 290 T 0.0016 Glyco_hydro_17 pdb F T 8amo 1 A A CP143_MYCTU Putative cytochrome P450 143 MHHHHHHTTPGEDHAGSFYLPRLEYSTLPMAVDRGVGWKTLRDAGPVVFMNGWYYLTRREDVLAALRNPKVFSSRKALQPPGNPLPVVPLAFDPPEHTRYRRILQPYFSPAALSKALPSLRRHTVAMIDAIAGRGECEAMADLANLFPFQLFLVLYGLPLEDRDRLIGWKDAVIAMSDRPHPTEADVAAARELLEYLTAMVAERRRNPGPDVLSQVQIGEDPLSEIEVLGLSHLLILAGLDTVTAAVGFSLLELARRPQLRAMLRDNPKQIRVFIEEIVRLEPSAPVAPRVTTEPVTVGGMTLPAGSPVRLCMAAVNRDGSDAMSTDELVMDGKVHRHWGFGGGPHRCLGSHLARLELTLLVGEWLNQIPDFELAPDYAPEIRFPSKSFALKNLPLRWS 399 T 5.8E-32 p450 unppercent F Bacteria T 8aop 1 A,B,C A,B,C A4TVL0_9PROT CULT DOMAIN-CONTAINING PROTEIN MPLDAGGQNSTQMVLAPGASIFRCRQCGQTISRRDWLLPMGGDHEHVVFNPAGMIFRVWCFSLAQGLRLIGAPSGEFSWFKGYDWTIALCGQCGSHLGWHYEGGSQPQTFFGLIKDRLAEGPAD 124 F F Bacteria T 8ap6 1 A,LB A,a ATP6_TRYBB F-ATPASE PROTEIN 6 MFLFFFCDLFWLRLLLCMYYCVWSRLCFIVYFNCLMLIFDFLLFCLFDLYLFVGLCLFLLLWFMLFNLYSLILYYCITYLNLYLLFCIVFLLYIAFLFLFCFLCDFFLFNNLLVGDSFMDVFFIRFLLCFLECFSLLCRCLSTFLRLFCNLLSSHFLLLMFFDFFYFIFVFFFYGVFCYWFILFIFVFCFCLLFYVFLYLLDLFAAILQLFIFCNMILQLIMDFLLFLLFV 231 T 0.43 ATP-synt_A unppssm F Eukaryota T 8ap6 3 F,MB C,c Q585K5_TRYB2 subunit-8 MLRRLGANVSNMARPMNKYAVTVSPRRHLEPMSTWYLASWAMVWYYAFFFWMPMVWTDIMVPSFVYNKLPVIHFLQEKRAEQKLRRVLDETYTEWTEELDQAHVTDAITRSLNI 114 T 10 bVLRF1 pdbhh F Eukaryota T 8ap6 4 I,NB D,d Q57ZW9_TRYB2 subunit-d MRRVSSPNITIQSVRWISGVSPLLYFPPTTTSTTNREDQINKNTNIAIQMIKRYKGEVPPHYTRKSSATIEQVEKEIDALLGGAEKLRKTSTDDQPMDKLTLMERCLRHALWSYHKEEGRYDFDQIGRWVVYTPEDEVKLAQLKREVEAKEKLAALRKRREEEGLPGGPVPRINWPQEYSSFIDREPVVAKRIRYDTLASTTLERDEKQIESTLQQYRRASQDKRLDDLVDLLERFKPVLAREAIMQRLTIKHLEGQLGVWRYMDWCPEVRDRAELEVDITGWQWWSPLEERRLLPVRLRSVNEVREIMSKTQAKKSAEAAERNPIVTQTSTGDNARDRLLKEVLALQARINQRDEVEPSQTEQKKKAHH 370 T 0.0093 SAM_Exu pdbpercent F Eukaryota T 8ap6 6 L,OB E,e Q38CI8_TRYB2 ATPTB1 XQGSWSVLKKNCSNFFPGLLAFAQQTQEAYGIWLRIYNRQQKYGPTDFVEQSETFSPDYHKRFHSQDKNMWVDKELCTEVSQKEVARLMTYKLDMWRMAHCAGALLATGGYAIPFGLFWLANDTWVPSSFNLTGEELRAWREAQDLYRYRSAPSYLTDTKWHFDFHAYPWNETQERAWDDLFEKNDVRRDPKVVRPAAEMYDGFIKFELIRRKSLRHLCRSMNIPTFPMLARLCNGTRVRDYWNLAWCEDYMVITQRLHESMTDEELYDYAWRRYLAPYDKNLNREQLMERVEDYFEFLGPDFVAHGKAPNLVILTNYVLGYYNDPAYLEGDISELDKNDYDHLASWGKDAFLRRLEFENGPLRDQVEAHTQRLLAERAAIAKGDNAAAVEGRHTA 396 F F Eukaryota T 8ap6 7 O,PB F,f Q57ZE2_TRYB2 subunit-f MVLFSTYRSSRLVSKEFLHGPVMRFRALGEYYFQRAWNGTLNWALPGEYRLYAVMIPFIYFYHRWHNDHTLDRDHVEKAMIMRWGGTLEDVRKLSAKDQLRVRCFTDIEKLYSAYGPKDTYLQPPGDTLPGKDFYRKAGGAQAHH 145 T 0.39 DUF3094 pdbhh F Eukaryota T 8ap6 8 QB,R g,G A0A3L6KRX7_9TRYP ATPTB3 MSKQLTFISAGATAAVLQSASAIVSKVAGGRVQTKTAKEAGRHAVVVGPETPIGVHTAVTEVPKSAQDPLFSGVSTVVVRAVLPRAAPDSVQLRDALDVYASAGIDTKEEVRSATEAFKKSAEVAVGKAKAKGVKRIVLVVKQASKHNCINELFKKISTETIESAGLTTEVVGTAAVANQLIVNPESLGVVLLNDVAATEQIELAFAGVVGGVSRVYHTVEGGKISAGHSFKSVALAVAQELRELGLSSEADKVEAAASKNPRAVVSAL 269 F F Eukaryota T 8ap6 10 RB,U h,H Q389Z3_TRYB2 ATPTB4 MRRTFISFSAASAAAAAPVTSTKMQTLHKLLTGEVSFKNKAPVKDCNIVHQFGENWATELSAYAKTLPAEQQKIIVRQIARVKLTRYTVAELAAYCGDGPALLDETARAANIEQGVAFVKAKGVEAFEKYVAEESTNANWKPEEAKKFIEDVKAKAK 157 T 0.042 TAT_signal pdb F Eukaryota T 8ap6 12 SB,X i,I Q57ZM4_TRYB2 subunit-i/j MVYTRWKCDRLPVFQLKLFTQEYPMHAAVGIFTIIFLWKHMSHCSEETERKYGWWAGYPYWRDPIARRNETKYKQMIINNDVDITHPKWTGCSVEQLEELSRVV 104 T 0.18 FPN1 pdb F Eukaryota T 8ap6 14 AA,TB J,j D0A5R7_TRYB9 ATPTB6 MTKYELKMQYFDEWMIRWRKFQTESDWEIEKGRQWWRRFNMAVSGALFCGLVLYTSGTATLKRQYGLPHFFDIGVDGQAKETMLKTLTSRWRYTPQGYGRVLITGVPTYILFVTLEHYRERRRMQQYLQQNTVFGEQMRRLLSTGKIEEYLPVNIKATLPASQQAIYNY 169 T 0.14 GT87 pdbpssm F Eukaryota T 8ap6 15 BA,CA,EA,FA,HA,IA J1,J2,K1,K2,L1,L2 ATP18_TRYBB ATP SYNTHASE F1 SUBUNIT P18 MMRRVYSPVFCSVAAARFAATSAAKKYDLFGYEVDTNTAPWIEKIKKCKYYDEAGEVLVNMNVSNCPPDIATYNATLQCIYQSPSKQSTPVDNESKFCAMMDLLEEMQHRNRLKPNEESWTWVMKECVKSGQFRLGYCIQQVMETECKGCPADLVKANEANAQKAKTEGKEHPGHLSQQAGLFDVKVE 188 T 1.1E-05 RPM2 pdbhh F Eukaryota T 8ap6 16 DA,UB K,k Q57VT0_TRYB2 subunit-k MLRRSSAALIRRTPVRHSGGELFVRPKLEEIPPADQCRGFFGPLNDSLKFLRLLDIKWMMNRAVAMRREYLIATPTLFTFIWMFTWKGAVIYFWGDRAPPRRMDWNTEETGRLPLGFKPTPAPL 124 T 0.28 DUF1048 pdbpercent F Eukaryota T 8ap6 17 GA,VB L,l Q387J1_TRYB2 subunit-e MSAKAAPKTLHQVRNVAYFFAAWLGVQKGYIEKSANDRLWVEHQRKVRQQNVERQQALDSIKLMQQGVRATTPGQLEGVPAELQQLAEAFTK 92 T 0.051 DUF3708 pdb F Eukaryota T 8ap6 18 JA,WB M,m C9ZJA0_TRYB9 subunit-g MSSTKCAVACKIMTPLCNAASKVQARSAKKLAALTDAGIQKTISEHNANGTDAAVSSTKRYLAEQRQLFHYRVVRFFDECHYIISGEYFAQYTKVNLIWDLRFLTKLVVLFLIGTVLGRQSIFPPIDPDSPLVEALVTKVNPNY 144 T 0.076 CbtA pdbpercent F Eukaryota T 8ap6 20 MA,XB N,n Q582T1_TRYB2 ATPTB11 MLRKTPLFAMATTRKALVGNGPTFSTGGECMNTCDIQNAFPMNDRGVRSSSPFQEPNTAIYDSYLAWTYFQPMDVHIEKLPAPEAKYYQRHTKKPWDVSSTELTEIQSRKKYFQTLGYLVAFIYLYFLMPKEKSFSGLSGPDGHWIMLPKGRPELF 156 T 0.091 FCP1_C pdb F Eukaryota T 8ap6 21 NA,YB O,o Q57Z84_TRYB2 ATPTB12 MSSGFHFHDVSNDAIKGMPPSEALHKHLENAQLAHRICLAKALKAGEPPVEKCALTWGEVLIRYQAWSEYRPPFQDSVAQAKYKKYWSKKRQEEDDKNPFK 101 T 6.5 DUF1539 pdbhh F Eukaryota T 8ap6 23 QA,ZB P,p C9ZLR9_TRYB9 subunit-b MLRRLVPRVMMAPMGGATALCTSRGYNMLVFRDPKRRPQLSEEERAKVVVNQAEWPEEFKDFDPDDPYKNSPEIIKGMSSWNLFLWGVECAFIYQFYELVFPKSI 105 T 0.25 Mus7 unp F Eukaryota T 8ap6 24 AC,TA q,Q Q583U4_TRYB2 ATPEG3 MTENIEAVMSDFWSNPADHFRPNLKALTLYAERQHYVDRWLHVKERWLAPWYLPWWSPLFQLGTWYSQRSRNLFLVENHLSYRPYKFRRNDEDRNNPY 98 T 32 CbiN pdbhh F Eukaryota T 8ap6 25 BC,WA r,R ATPEG4 MLLGGFVPRRFSQFNRDPCWMFFIFSVGFWLGEYPAMMIKYNARDLVYDPHRYVWSHHDDHH 62 T 0.029 DUF5357 pdb F T 8apn 46 TA AL mL116 NTGVPGPRPEVAQKLSTEYQGHILRMISLAESASELDEVLWSSKKHLRPVHIARSCLKLEYLRTKEKGREVSEPIKNLASELENYVELYSTKFTIGQVSQLVRGLSSIRRNIQPDLLLKLAAVVVADDGRQVQLANEMDCRDLFFGFFSQGFDNELFWKRLSESVLPRLPYFNADVVSTVLRVVSGLRFLHNTEFAHATMTALVPKVGDLSPARLADAFFSASLLDPTDVSGLNAKLEERFLREFTSFPIKDTVTMFQTVTVRRHSTPELAAQVAPLVAAQAHQLPVRHLRRALEGMVTAGWKDTAEIPLYAILAKQAARLVLTPVQLLRQLARIFANTGLKAGPGANQPLAPYFAALQRELEGRLAELDEQVTDDFAESFKKVGIAEGARVQI 394 T 0.24 TadB_TadC_N pdb F T 8apn 47 UA AM mL116 TIFNTGVPGPRPEVAQKLSTEYQGHILRMISLAESASELDEVLWSSKKHLRPVHIARSCLKLEYLRTKEKGREVSEPIKNLASELENYVELYSTKFTIGQVSQLVRGLSSIRRNIQPDLLLKLAAVVVADDGRQVQLANEMDCRDLFFGFFSQGFDNELFWKRLSESVLPRLPYFNADVVSTVLRVVSGLRFLHNTEFAHATMTALVPKVGDLSPARLADAFFSASLLDPTDVSGLNAKLEERFLREFTSFPIKDTVTMFQTVTVRRHSTPELAAQVAPLVAAQAHQLPVRHLRRALEGMVTAGWKDTAEIPLYAILAKQAARLVLGKQSAATSAILGKHVDNQGYQRTPVQLLRQLARIFANTGLKAGPGANQPLAPYFAALQRELEGRLAELDEQVTDDFAESFKKVGIAEGARVQI 419 T 9.8 FAST_1 pdbhh F T 8apn 110 FD Ub mL105 RDEIIKLLESRKDMDVNGYVMYCREELGKLTVPRPRAPPVSPKHEDYKTFVDEERVTYMRMKQHEKISLFLTEEEKNTVTTKGKDILDDKRFIQTIASRTGFYIAEEVRDCLSEFFNFRDSSRRLLTYYA 130 T 0.025 DUF5863 pdb F T 8aqm 2 C,D C,D NCOR2_HUMAN N-COR2,CTG REPEAT PROTEIN 26,SMAP270,SILENCING MEDIATOR OF RETINOIC ACID AND THYROID HORMONE RECEPTOR,SMRT,T3 RECEPTOR-ASSOCIATING FACTOR,TRAC,THYROID-,RETINOIC-ACID-RECEPTOR-ASSOCIATED COREPRESSOR HASTNMGLEAIIRKALMGKYDQW 23 T 15 PPS_PS pdbhh F Eukaryota T 8ar0 1 A A TLR2_HUMAN TOLL/INTERLEUKIN-1 RECEPTOR-LIKE PROTEIN 4 MSVSECHRTALVSGMCCALFLLILLTGVLCHRFHGLWYMKMMWAWLQAKR 50 T 0.00085 LRRCT unp F Eukaryota T 8ar2 1 A A TLR5_HUMAN TOLL/INTERLEUKIN-1 RECEPTOR-LIKE PROTEIN 3 MEEVLKSLKFSLFIVCTVTLTLFLMTILTVTKFRGFCFICYKTAQRLVFK 50 T 0.13 CoV_E pdb F Eukaryota T 8ar3 1 A A TLR9_HUMAN Toll-like receptor 9 MEALSWDCFALSLLAVALGLGVPMLHHLCGWDLWYCFHLCLAWLPWRGRQ 50 T 0.0096 Tlr3_TMD pdbhh F Eukaryota T 8as2 2 B B C-C chemokine receptor type 5 APERASSVYTRSTGEQEISVGL 22 T 150 Pardaxin pdbhh F T 8as3 2 B B C-C chemokine receptor type 5 APERASSVYTRSTGEQEISVG 21 T 120 Pardaxin pdbhh F T 8at2 1 A B A0A8J1L9M8_XENLA LOC495502 PROTEIN MDEKSTKIIMWLKKMFGDKPLPPYEVNTRTMEILYQLAEWNEARDKDLSLVTEDLKLKSAEVKAEAKYLQDLLTEGLGPSYTNLSRMGNNYLNQIVDSCLALELKNSSLSSYIPAVNDLSSELVAIELNNQEMEAELTSLRKKLTEALVLEKSLERDLKKAEEQCNFEKAKVEIRSQNMKKLKDKSEEYKYKIHAAKDQLSSAGMEEPLTHRSLVSLSETLTELKAQSMAAKEKLNSYLDLAPNPSLVKVKIEEAKRELKATEVELTTKVNMMEFVVPEPSKRRLK 286 T 0.0091 ATG16 pdbpssm F Eukaryota T 8at3 7 G G B1H1T5_XENLA LOC100158301 PROTEIN MTGGKELGAAVELYERLQMLSCPCLEGVYLTDPQSIYELLCTPSSHRLDILQWLCSRIYPPVQEQLSSLKESQTDTKVKEIAKLCFDLMLCHFDDLDLIRGHASPFKQISFIGQLLDVIQYPDTISSNVILESLSHSTEKNVVTCIRENEELLKELFSSPHFQATLSPECNPWPADFKPLLNAEESLQKRATQSSKGKDMSNSVEALLEISSSLKALKEECVDLCSSVTDGDKVIQSLRLALTDFHQLTIAFNQIYANEFQEHCGHPAPHMSPMGPFFQFVHQSLSTCFKELESIAQFTETSENIVDVVRERHQSKEKWAGSTISTLCEKMKELRQSYEAFQQSSLQD 348 T 0.1 L27 pdbpercent F Eukaryota T 8au0 1 A,B,C A,B,C SUN1_HUMAN PROTEIN UNC-84 HOMOLOG A,SAD1/UNC-84 PROTEIN-LIKE 1 GSMSGVEQQVASLSGQCHHHGENLRELTTLLQKLQARVDQME 42 T 0.00057 Trimer_CC pdb F Eukaryota T 8au1 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R A0A7U0GB71_9CAUD Putative tail sheath protein MSEQITGSTPRIYYRGTKDSSVTRSTGSTTTLPLHRPLIMFFGQKGPTVPTWIDPVKFEDIYGSETTNLSGVYCTHSTPFIKEAIAAGNQFMALRLEPSDIPDVATLGLSVDWVKTKIDDYERNDDGTYKLDTNGDKIPLATQIDGIKFRFVLEKIETNESGVSQYKKRTAKAGTIGTEATPSTITPLADFRCRFKSSLGANTALRIWAPTINSAQAADADLQARIKSFLYRFQILTRADKASSPTIFETIYNEPSLSVGFGENLVDPQTEVVYDFVERIDSRYNDEDPSTYLMSPLDTPYLYQANIDSVLTAIQELEAPFDTVSADEDDLYQINLFGAQTVEGVPYHAVQILGVLDGGVTLTETATNYLQGGGDGTLGNDSFNAAAYAVLSNLSNNAAFNITNYARYPFNAFWDSGFDLKTKQTIPQLIGLRADTWIALSTQDISSDFNSNEEEESIALSLMSRVSAFPDSSDFGTPAFRGMIVGGAGYYTETTRKLPVPLTLDRFRAYCRYAGASDGVLKPEYAVDEGDARKVQVVKSINNLDKSWRVRRAQWNNNLVYVEDYDTNSQFYPGQQSFYSEQGSVLKAAIVGLCVANLNRFAFEAWRDLTGTQKLTDDQLIERSDDAVSTRGTGAFDDRLIFTPHSEITQADKERGYSWSMRIDFGANAFRTVMDMSSVAYTREELANG 689 T 0.17 XFP_N pdbpercent T Viruses T 8auv 28 BA Z A0A1S3Y4M0_TOBAC 40S RIBOSOMAL PROTEIN S11-LIKE MAAEGRTLSTKEADIQMMLAAEVHLGTKNCDFQMERYVFKRRNDGIYIINLGKTWEKLQMAARVIVAIENPKDIIVQSARPYGQRAVLKFAQYTGANAIAGRHTPGTFTNQLQTSYSEPRLLILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGVLFWLLARMVLQMRGSINQGHKWDVMVDLFFYREPEEAKEQQEEEAPAIDYADYSAGGDWSSSQIPEAQWTGDAAPSGPVVASGWSGEGVAEGGGWDTAAAPVPVPVSDAAPTAGGGWDTAAAPVPVPVSDAAPTAGGGWDTAAAPVPVPVSDAAPTAGATGWE 336 T 3E-13 Ribosomal_S2 pdb F Eukaryota T 8av0 2 B P RAF1_HUMAN PROTO-ONCOGENE C-RAF,CRAF,RAF-1 RSTSTPNVA 9 T 56 NB pdbhh F Eukaryota T 8b0f 7 G G CD59_HUMAN 1F5 ANTIGEN,20 KDA HOMOLOGOUS RESTRICTION FACTOR,HRF-20,HRF20,MAC-INHIBITORY PROTEIN,MAC-IP,MEM43 ANTIGEN,MEMBRANE ATTACK COMPLEX INHIBITION FACTOR,MACIF,MEMBRANE INHIBITOR OF REACTIVE LYSIS,MIRL,PROTECTIN MGIQGGSVLFGLLLVLAVFCHSGHSLQCYNCPNPTADCKTAVNCSSDFDACLITKAGLQVYNKCWKFEHCNFNDVTTRLRENELTYYCCKKDLCNFNEQLENGGTSLSEKTVLLLVTPFL 120 T 0.00021 UPAR_LY6 pdb F Eukaryota T 8b0h 1 A G CD59_HUMAN 1F5 ANTIGEN,20 KDA HOMOLOGOUS RESTRICTION FACTOR,HRF-20,HRF20,MAC-INHIBITORY PROTEIN,MAC-IP,MEM43 ANTIGEN,MEMBRANE ATTACK COMPLEX INHIBITION FACTOR,MACIF,MEMBRANE INHIBITOR OF REACTIVE LYSIS,MIRL,PROTECTIN MGIQGGSVLFGLLLVLAVFCHSGHMLQCYNCPNPTADCKTAVNCSSDFDACLITKAGLQVYNKCWKFEHCNFNDVTTRLRENELTYYCCKKDLCNFNEQLENGGTSLSEKTVLLLVTPFLAAAWSLHP 128 T 0.00025 UPAR_LY6 pdb F Eukaryota T 8b0u 2 B,D D,C B2V8L8_SULSY CalpT10 STSQKATYTDDFVLYRGDDFIEIIIDEKYLNKKVKILLDNDTIFNGILKDTSIFIPVKEQIDLEELAKHISILPEG 76 T 0.0095 LSM pdb F Bacteria T 8b14 2 B B RBP5_BPT5 pb5 bacteriophage T5 receptor binding protein MSFFAGKLNNKSILSLRRGSGGDTNQHINPDSQTIFHSDMSHVIITETHSTGLRLDQGAGDYYWSEMPSRVTQLHNNDPNRVVLTEIEFSDGSRHMLSGMSMGVGAKAYGIINPQIMSQGGLKTQITASADLSLDVGYFNTGTSGTIPQKLRDGTGCQHMFGAFSGRRGFASSAMYLGGAALYKSAWSGSGYVVADAGTLTIPSDYVRHPGARNFGFNAIYVRGRSCNRVLYGMEGPNYTTGGAVQGASSSGALNFTYNPSNPESPKYSVGFARADPTNYAYWESMGDPNDSANGPIGIYSEHLGIYPSKITWYVTNLVYNGSGYNIDGGLFNGNDIKLSPREFIIKGVNVNNTSWKFINFIEKNFNVGNRADFRDVGCNLSKDSPSTGISGIATFGLPTTESNNAPSIKGGNVGGLHANVVSIYNFLPSASWYVSSNPPKIGNNYGDVWSENLLPLRLLGGSGSTILSGNIVFQGNGSVHVGTVGLDLNSSRNGAIVCTMEFIDDTWLSAGGIGCFNPTEMLSQGAEYGDSRFRIGGNTINKKLHQILSLPAGEYVPFFTIKGTVVNACKLQAAAYNPTPYWVSGLPGSVGQTGYYTLTYYMRNDGNNNISIWLDSSMSNIIGMKACLPNIKLIIQRLT 640 T 0.18 NPM1-C unppercent T Viruses T 8b1r 4 D,E P,Q GP59_BPT7 GENE PRODUCT 5.9,GP5.9 MSRDLVTIPRDVWNDIQGYIDSLERENDSLKNQLMEADEYVAELEEKLNGTS 52 T 0.0021 Phage_GP20 pdbpssm T Viruses T 8b1x 1 A A P3-7_2 KKPGASLAALQALQALQAAQAAKKY 25 T 5.4 Asr pdbhh F T 8b2e 1 A A Muramidase LVLPGLDALQTRNALAIIAEAKKENVGPHGCQAAITTGLTESSLRILANNAVPPSLQYPHDGLGSDHDSIGIFQQRASIYKDIRCDMDAACSASQFFKVMKGVSGWQTLDVATLCQRVQKSAYPAAYQKFTALAVGVCKAGGL 143 T 66 Lys pdbhh F T 8b2k 2 B B RND3_HUMAN PROTEIN MEMB,RHO FAMILY GTPASE 3,RHO-RELATED GTP-BINDING PROTEIN RHO8,RND3 TDLRKDKAKSCTVM 14 T 25 DUF3012 pdbhh F Eukaryota T 8b2m 1 A A A0A1D3UV35_TANFO Tannerella forsythia Potempin A (PotA) MKQQIILWIGVLLLLIGGVGCKKDQSSCCDKEIIKDVSELTGIISYNTEVKRWYISVSDANSYDNVTLYFPCNLDSKYMKEKEKVIFSGQISKSTLKITLPAGTTSYCINLMSINKIN 118 T 0.00053 DUF4971 pdbhh F Bacteria T 8b2n 2 B,D B,D G8UM88_TANFA Tannerella forsythia potempin A (PotA) DQSSCCDKEIIKDVSELTGIISYNTEVKRWYISVSDANSYDNVTLYFPCNLDSKYMKEKEKVIFSGQISKSTLKITLPAGTTSYCINLMSINKIN 95 T 0.0008 DUF4971 unphh F Bacteria T 8b2q 2 B I G8UM88_TANFA Tannerella forsythia potempin A (PotA) QSSCCDKEIIKDVSELTGIISYNTEVKRWYISVSDANSYDNVTLYFPCNLDSKYMKEKEKVIFSGQISKSTLKITLPAGTTSYCINLMSINKIN 94 T 0.002 DUF4999 unphh F Bacteria T 8b2r 2 B B A0A219T3Y8_MAGOR AVR-PIK PROTEIN ( PIKMPROTEIN,PIKP PROTEIN ),AVRPI7 PROTEIN GPMRAIDLSRERDPNFFDNADIPVPECFWFMFKNNVRQDAGTCYSSWKMDKKVGPNWVHIKSDDNCNLSGDFPPGWIVLGKKRPGF 86 T 0.1 TMEM18 unp F Eukaryota T 8b2x 1 A M B8GLG4_THISH Type I-G CRISPR Cascade large subunit CSX17 MDKDMHINEIVLRGCAPTPLAAYLKALGVLRLVCEQVDATAKGWWQDECFMLRTRLDDNDLRRFFIEDYRPTPMLSPWNGGSGFYRKGNETAWSTLEKIITTQAERWRPFRDTAEVMADALEHLKLTEKPAELDKRALLARLRATLDDEFLPWLDAAVLLTDDKPDYPPLLGTGGNDGRLDFTSNYMQRLLEMFDPVTGKAQGDVGNKLESALFARPVPGMTALAIGQFSPGAAGGPNSSTGFDSGAQVNIWDYVLMLEGALLFAATATRRLESADPSALSYPFTVRPSGGGSGAVALGDERPARAEIWMPLWERPASLPELRVLLGEGRVTLNGRLPRDGLDFARAVAKLGTDRGVRAFQRYAFMMRSGKAYLATPLNRFHVHRNPKADLIDQLERGDWLRRFRRAARSTHAPARLQGLAHRLDDALFDLVRVADPRRVQEVLKVLGEVQFYLALSPSLREQVRPVPRLDAHWVEAARDDSHEFRVAAALAGLDDGLPMGVHLAPIDPVKRNVWAPESRLAVWGQGNLSDNLAQVLQRRLLTASRTDLNDKPLSGRCPADEGAVAAFLAGDADERRIAELMAGLACARLPARLPLRQRGASEASSLPMIYALLKPLFVPDAQLREAAVLTPDGCLPLPPALPRLLRAGPAGVGRAVDLARRRRRASGLADAGWRLTPPYPDGGRLLAALMIPVEIRVIKGFIKRLADHKSDEPATQDAS 720 T 0.16 DUF2795 pdbpercent F Bacteria T 8b3p 2 B,IA,M,TA,X FFF,III,GGG,JJJ,HHH G9P_BPF1 COAT PROTEIN C,POLYPEPTIDE II,G9P MSVLVYSFASFVLGWCLRSGITYFTRLMETSS 32 T 0.35 LapA_dom pdbhh T Viruses T 8b48 1 A,B,C,D A,B,C,D A0A6G1IIU9_9PLEO Carbohydrate esterase family 15 protein QAPSCPNLPASINYAANPKLPDPFLALSGTRLSKKDQWPCRKEEIRQLFQRYSYGTFPPRPESVTAAMSGNALKITVSEGSKSMSFSVNIKLPSSGAAPYPAIIAYGSASLPIPNTVATITYQNFEMAADNGRGKGKFYEFYGSNHNAGGMIAAAWGVDRIIDALEMTPAAKIDPKRVGVTGCSRNGKGSMIAGAFVDRIALALPQEGGQSAAGCWRIADEIQKNGTKVETAHQIVNGDSWFSTDFSKYVDTVPTLPWDNHMLHALYAYPPRGLLIIENTAIDYLGPTSNYHCATAGRKVHEALGVKDYFGFSQNSHSDHCGFPKAQQPELTAFIERFLLAKDTKTDVWKTDGKFTIDERRWIDWAVPSLSGLEQKLISEEDLNSAVDHHHHHH 394 F F Eukaryota T 8b4z 1 A,B A,B D0FZL0_RMBV1 Major capsid protein A MSGDNGVYSGSAAYNTATAPKVPVSRATFFQNTKSKDFDFKFADGADAIANVLQQMEHGVAQHQLGDMNVRTDGLATVSAVLNGRKRKIANQYMMHFDLFGRAARSTVRMESRIQSFGEGKDVDNFMAKFHNQLSGVYERRSEGVANFGRILATDTDLGGTSGLSVVFNGLLRGLHHVSTVPTPNVANLPIRNNRDGAGAVVGRGDMPGREFMDSSRILPPRSSRWYGAPGQPIVPPAPNNPPAHVAPMETVMAGLQKTVMNELNRVIVSIADVPKLPAHRIRNLIAVLAAVSKPNLGFDANRLEDHSCFTKGWLGFNDILLFPLTVDLFDRVVANEAGVNDAGFIVPNAAPPQFLQNTNQQVIDFRGVGVGQAGDIPALRLAQSWSDAIGFLLDTIGGEAQLAMGLNDMVAQCFHMHGAQTTMLSTPIISRADFGVYHNVVTNMYRRLAYMYTRLIRTNAAAGGGAMLDRQHYQWPTHAKVGFHDDTAVNAAAAAARIHDGLRQPLLDEAFGAGVVQPGNMDLVGAGIDFTRDLTSSLGKAYPEHRPIGADDNKRDLGDFTAGTVDAAASGYEWDNYVYRLFGNMSAMRSKAEFDRLLATFPSSTLSELFIWMGNVGFADTWEERWGYDAAPLCSIPIPAGHDRSMLRNWSWVNVHNVHSVTGTSENVVLAGYVGLSRTHDYIMDTRSTPATSQGRRLAAMFYYTNADKMLSLTFGLAGQLRAAADTTVAKFQICPHTIARAQGYIMTDNDPLSDELKGTDFVTEQFSLAGLTNLYLGYFDGLATRLGIYDLRYTYSEYAECRVELHGIQRNFLTDRLDAFVSYKCLHPIMFEYYMCGANISGGILNGDKAYEQVEMGNIRAYDAMFDTSAARDFNFVGVRGASQQIAAVGGFHIQYKMEVEIQRPGDGTEASRFNVYERYLNNYLRMSDCAPTSVLNAVSPLFWMAGTTRVVLCEAANGYKPMAYDISQTSFWNRENGLWAFTWGESEKTHRPNAIPHGTRRLGNSEVLMNSRFSKILDKKGITKLETRVGGRKRGDNNDDFVAADTRMFIIQDVAGGEHAAYSSLRDPGFALVRAAHTWDTFVQNPRMLLLERGYGNTGFTDTYSAAGIRRTNGHISLRLSALTDDFEFTMHPLARAEYKETSRVSLTSMIYVGTAGKDLSLPTGTVEDIIGAVDGMRRVVRTIGGQTIKTAPVVPPTEQRDMVQEERVGTPVKNAGNANPAADSDNATEGVVEPKN 1240 T 0.31 NTP_transf_3 pdb T Viruses T 8b59 1 A,B,C D,E,C D0FZL2_RMBV1 RnMBV1 Crown protein MGITYRDAQIFSACVEALSARNNRITLTSFPLTAGQGQAPTATPAWYPVDLFVADATAVYGRRQLFAWTVDKVRPTRNVAFVTDRVAMDFSAALLSLMAELEAVAPDVYAAIHGGATPGADLGDRITQLENRRVGCLAYVMATVVRAPITHNVRSFSAMLASDPQAHAALLAYLTPNSAGQLDGAPIYFRRSDVDLRNNHLALHAEVVPGLPNMVPLTKAMVEVALANVEWWSDPLGYDSLTSFGGLELLSLCDALAVCELSVAYGLKESGYCYLRFAGGCPLAEVILARLGYNPPLGVAVGWALYNGIKLDWYSKVISVGHNMRLHVCDTAGEANACLIDVLTGEYDGMPVGGVDTVSCWVEQLDLLAAAAGVGRNLSNLHCGVQTPPRTINTTRRRLLASLVRTLIADPTLTDEELLHGAVRGTLNGLPRDRALWRCLQVVNTTVREFLAQDLDAMVRDRRECTTYASRAAFAERCAMSGNASGLVGRQYSDMPAALEGEARACGLSAIDAIEIVRVVASGEPIRVLLDQHGRPATRPNGRLTADELRRCRPLVVGQGGQVGFLPFVPFIVGGVGATVAAASGLALATFATVTGAGAVALGGLGLGAGVAALSITVGQLSYQVTRRALTTILPGGREFGLDDLTRVLGGMVGRYISFVDTWFAHGRGDVFQETDAVPAGTVVFVLPNIEYEVLELRERALGRWSTLLVTTPNGVIAMRANGALPLRVVDAREAGQTFEWSTAARRRFTRAQANAINMMVTASKRVPGLKGSIDAAPSQGTGGSGTDLAGILQRLSALEQTSVPRAEFDALQGRVAACEAKITELEADRVPRIDFTELRDRVHHIDGIGLSCLAHLARDLGITVPHNVRTFRQMRANVGEVIWARFVDAVAESFSPMGGRPIFVRTDPAQPRNNHVSLVDEPTTTGFNGTVTPAMRRLTVADLTGDLVDTEWFSWTPYDASGPLGGTIEGIEAYLTDFTSKLKAELEATPTRTELGVAVGTRAPPLSDRLAAVERVIGMQEGNQVWRSNELRELWVAIDSIVTGRGQREFTTATIKWPAAFPSAVATAGRSFGQPGLAGYGELCTLARQLNALVAGVRNGVVSGMTRNGAGVLQLSTISSATGNLTSDQQAVLRACFFPATPRVGEYQIVYPVGGTMGLTRVDPSTNSSIGQYTRESLVAARNAMPRFAVHTTTPDTVGVAWDNQSAAGLPMGAAPVLTVSVNQLSGVPVTEADKQRWDAKQDKFKIVNTDDRVAALSWVDSVDGFAAPGSDMLLDYQAPAGTGSLPFGSKYAMAVAIGGSLGSQLSEAQVSAARVVLGNGVWRDAVIDVLRKLHNVMYGGKYGRIDDIAAMRSYLNDGTGLLPGSEPIVDVGGAEGNACARATILLRGFSSTMVGVDLKIQMLVELYGAEPATAALLYRGWTMQ 1426 T 0.13 DUF3375 pdb T Viruses T 8b5a 2 B B H4K20ApmTri HRXVLRDNY 9 T 7.8 Ribosomal_S13_N pdbhh F T 8b5b 2 D,E D,E H4K5acK8ApmTri XSGRGXGGXGLGK 13 T 7.5 DUF6272 pdbhh F T 8b6e 1 A,B A,B sCTP-23166 MGHHHHHHHHHHMAGYSRAVRCVETGVEYPSLSAAAKAMDLFGPQNIYKAIRLGKLAGGYHWVYVD 66 T 0.0024 NUMOD1 pdbhh F T 8b6f 1 A A0 Q22E24_TETTS Lipid-A-disaccharide synthase MLTHISRRYFSFTGRKTIFVAAGSPSHDLQAANFMRDLKKKSNNNYDFVGIGGPLMQAEGLNQSYADINKFIDKPFFPLKNFIRFHVARCYHPYMAPLHFFNKQVLNQVDKSSLLKDQVELSIPSAIITFGNEFFMKKLYVRLCDQYELHNKIRPPTFFYDRSHINQRFEFQDYLDHFFYTIPMKQINFQSFTYPSTCVGHEGVGRAIQYLFQNSKQYANVKSLVTANGLKIASNPKQHREIIEKLVEEQRGIQRARLGINESKNVFLLAPGNTKAEINFAVNLLSRSLEEFFKKPQLTNVSRDHFTIIITADNAQNAEFVNQAVSNTKYLKTLQTIVTTGEKEKFGAMCAADVGIPLNGELVSECAALQLPSVIISNMNLFYAYITQLYNNFYSDINFAIQGEAYHELVSTAANPYKLSDEIFDLYSDPKLRYHFAERYQNVVHEMIPQANSQDNIVTTDVATLHGVEVQERAFTYETIAAKVLKAARAYESLDKNIPNHQIDQHRKEKLIKAAF 516 T 8.5E-42 LpxB pdbpssm F Eukaryota T 8b6f 9 I A8 NDUTT15 QRGRDYTPSNKKYLQPWELERKEYVELSLAIQSAYSCKMLSEILKDNLYMLTDYQLSFAMFHLWNHEIPIDNYFYNVISPILKEYITRFDRECNKSLAEIATFLGRMNVQDDAALWKVIETKLVQERLYRYIPLNDLIDLAHGMATANRGSQEFYNIVENVIIKHRLRLIPDKIAVAKDCFTARKIGSPLLYQVLENPQAEAHELAGLKEHEQLKIS 217 T 0.32 DUF6386 pdbpssm F T 8b6f 39 MA B2 W7X4R4_TETTS GRAM domain protein MSYSGYSLNGGVHPCLPFYERMLQCAKSEALPIKMCTAQTEDYLECHHRKKQYALNYAIKKELNNIRIVALPRYDEENDTFVPFSQATADHIFQ 94 F F Eukaryota T 8b6f 51 YA BH Q951B2_TETTH NADH dehydrogenase subunit 2 MSIFSNIWINNDLNSYGLSILLLNIINYLIVFMLILSVILLTNLSKFKSLNQFKEFNSYNFILYSLIFSLLSMAGIPPLLGFTGKFLAILYSSFKSQYLLILFMTILNIFGMYFYIQNLRFVVKKNKSSILNYKNYYVNINYSITLNIILLNFFNFFGILFLSDLIIILNYISSYIYI 178 F F Eukaryota T 8b6f 58 FB BO Q23B10_TETTS NADH dehydrogenase [ubiquinone] 1 alpha subcomplex subunit 8 MFLNPVKDSEFDDEVKGFVPSEGEVRFVANKNKECGYYLQGIEQCRRKMVQLAGDSSSQFHSLGFLPCKRLVDAHYRCMTDDKFGSTIEEVPEIGLDSAQKFFDCTFQQLKPMQSCRRFFDQVVRDVYRANGSQLI 136 T 0.23 COX17 pdb F Eukaryota T 8b6f 60 HB BQ I7MAF0_TETTS NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 4 MLARTLKNYMRVQQNLRFSRANIEKSPAPPTIGQVELEPFKFNHERDQLIYGYTMEELYGKKFGLKHSATVLREIKKDTIMMILFIIGGFTYCYHMRETRFQLDDDFNEYVNTNKQTFRPIPDHVKL 127 T 0.1 Viral_Beta_CD pdbpssm F Eukaryota T 8b6f 63 KB BT NDUB15 NYHYDFCGRAYMGNPAVQSPPKEFFNYHYVPDNYPDALSGFRIAYRDPFEVQHVFAYENWEYQYDGQWWSMGSLACNVLFFCTPLFLYLILQVEELNSEKRGGTSNKFYHNNAGFFHFQIYDNKQ 125 T 1.5 DUF3930 pdbhh F T 8b6f 64 LB BU NDUTT16 ASQLQREQKLVQSLQQESLQPHLFKIIVDSQSDLVCEADRREYIKHYTRANEKSSTSQLLQVGALLGYIYAVGRYVSNPSTRKFSYGLAALLGSFSLLNPSKNLHHNHSLREIYSKYNISTNPQALEILKSRIY 134 T 21 ApoO pdbhh F T 8b6f 65 MB BV NDUTT17 MNISYTGLKLEDYSDEVIRKYKFPNSNELERFLNREQTLTVQQHKSAIKLAQQDFFAVAGLLSVGSLSYIFYNSVGGKVIRDRIRASRTKAQKVIVLALAFVANAAALIIARNAITAHNFGWQAA 125 T 0.32 P_C10 pdbhh F T 8b6g 1 A CH I7LX66_TETTS Diphthamide synthesis protein MLTQRFYMIQFTKEEQSSEEKYLKTREREEDRKKELMHPQKVLNKKENKRKALLSKNQQNKKLIKYLNLNKRQEKLININQEEMSILPPLQYTYSNEESLELLIHSIKGNKDCNSERKAFNLCRSTVLGKHVEPEKCLDKALVFVNCFQKVRRDESAACQSAFNSTLECGKKYSESTISLGSSCQSQLDAYLNCK 195 T 0.0012 CHCH pdbpercent F Eukaryota T 8b6g 2 B CM Q22HD6_TETTS Transmembrane protein, putative MARLWWTLDPSKYYLKQISSGGRNEILFTVLGVTAAYWYFGNKRCEHYWRRQIDNCQSWSRAQNINGNNLTVKQYF 76 T 0.0044 PriCT_1 pdb F Eukaryota T 8b6g 3 C CL W7XBF5_TETTS Transposase MKLDQIISYYITPVRRFDKNLTAEQIYEQYQQAAQFNEIDAFTNIRFHRKFKEYIQTQEQSDYLYEKAKQISTLAQKMFEKKFPEYYTQ 89 T 0.13 Cdc6_C pdb F Eukaryota T 8b6g 5 E CI Q22YL0_TETTS DUF4885 domain-containing protein MFSDFNMYEAKVFLKAVADAQNTFRQTAQQENQLARYESQSQSLLNGSTSGAISITGDNIQQGRNFKALKEVKLFQYSNEIFKKYLAGFDSFSGDYTAFKKFLNESVKKIEQDA 114 T 0.0045 gp37_C pdb F Eukaryota T 8b6g 7 G CF Q248F8_TETTS Transmembrane protein, putative MIKYLLHQLFIYIYVAEVLLGCIFAFAETVFFHSDQDEDYFLQIKQIQIKNQKRFRNNQKKSRSFKKKIINQQLVSKMVRLNLKSNVDQNEYPFLAKWDKDMRQNYEEYQNRIDATTYHLQRSQRGIAVFGEWMYPRYFQKDILELEVLRRKQQLGKIYPEEVSSYTQINPDIANDLNLTFNAKLLWPVRGMTVGAGFFAFAHLFNLPYSFRLGLFVLPTAVELAFTWGNKTSQFKSIEFMDYLLQYRVSKALLEKNAKHFAEKKAAYQKEINSSQSVQDLYNQLITLVSEQAPSE 296 T 16 NADH_dh_m_C1 pdbhh F Eukaryota T 8b6g 8 H CG I7MEX7_TETTS SDHTT3 MSLVSLFKNTFLKSRVIGLSFQAQRVMAQMAKTDFENPDEHFLLNDAMKYNELVFYGRLAENWSINPELFGKAELAKYNEAKQTLIDFNQYHALVQNLHEFYWELKTIYLELSRGVATSNFHNKREVTHSIIESDIKNSIHKYIQLIDDLKDYPEWQHKVREEIGYYAHMIYTSVNHDGNFPEIFKEFNKVDSLYYFK 198 T 0.0012 MiaE_2 pdb F Eukaryota T 8b6g 9 I CK Q24CW6_TETTS Transmembrane protein, putative MLDDTKYIQMAQKFPRNVSVQLNKKLFVTRTWFRNYYFVGVFGIFAYFIYNQPKIFAPFSGYPTTVAYKAQPDFLNDQVIFYSQQRQNTLKNF 93 T 9.1 DUF108 pdbhh F Eukaryota T 8b6g 11 K CJ Q23S01_TETTS Transmembrane protein, putative MNHSCQKVFEGFVSALYDTSYFFRNFGPFKATIHYATYANYLAQNWAPRVSYIETSTPAYTLAKNKYAVYIVYGLIGGALIHNYMLDNKAAQKSQQYYLKHRD 103 T 11 Fzo_mitofusin pdbhh F Eukaryota T 8b6g 12 L CN W7XF00_TETTS Transmembrane protein, putative MRRIFWNFKTAFVGLPMFSLAPKNILVYPIVVGVPLYTFIVLQNSVRGFAYFDEYDSDVKEN 62 T 6 PPI_Ypi1 pdbhh F Eukaryota T 8b6g 13 M CC Q23RH8_TETTS Cytochrome b-c1 complex subunit 8 MRTKLYNAAYFLLNNNESFGHSFGIRLKIVGLNTWIVGYAVSRYYFSSLRVKAAQDERFE 60 T 4.8 eIF3_p135 pdbhh F Eukaryota T 8b6g 14 N CO SDHTT11 LPIRNIQFARYHYLAAVTVFTYFATRCCLLDYKKYYPLASVKK 43 T 12 AcylCoA_DH_N pdbhh F T 8b6g 15 O CD SDHD MFKELIHIFRTYYITFRYLKKSNINFLKNLSYTLIAYYLIINFQ 44 T 17 DUF1869 pdbhh F T 8b6h 4 D,DB DD,Dd Q23FF5_TETTS Cytochrome C oxidase subunit Vb protein MKKQKRTQGKQNTKQIKQEKLSSKRKANNQKEGKKKVKQEDYKEIKQKGKRMLSKIVKASFSSKGFNLANAVNTVKSTLNAPIKHIKRNIEPTGSNYSRMTNTTEEAFDEVSHEWQALVTSNPFDLNVFNYLENTQTSNFGTVDNPLVVFTSETPFRYVGCTGQMNEDDYEGHELLFFLLREGSLQRCMGCGQVFKLVRLRNEYSPEMDYYLSNFHPYEMQEMGESDTTVLMSPYKYASHYEYTQFETPSNMVYSMVNPDEHDRLLVDPAYRMERTKALEEKYKVYTSSLREVEKQFEERYGRAGQINISKVTYSTLIDVEKAVLKMDRLFRKVAKFENRAFIDRANHSRREKRMLERAQQRWDSNYSFFTGSLTEEEQKYRDYYETELEAYPEDEGIEQQLDQQEVLLSGRYDPKLYDFQEGYTKNPEDDQTSLIEKKAFKFRYRLANETSETFQRRNNRMVERQIKRFQQPQYKHAFEQLQKNIAISSNSGNALHSEYGYLELLSNESVQLYKDYYESDAEEDFKVFENLSSKEKLVMIANFENNLLPKYDRSEVHLIPKRQWEPAFGVWENFLYDITEYASFIAPRGKEIAADYQIQSAIPLTKEELIEAGLYKETIEKKVEPKLEAKKQTKSE 637 F F Eukaryota T 8b6h 8 H,HB DH,Dh I7MGF9_TETTS Transmembrane protein, putative XNRFFKVSSKYQYYKYLEQYDAAFLRKYQSETHWYLGRRGAWKNLVIKYAGDHISLEEEHNVKYKTHLSFVYLSYRLAWVLFAYVLIYNHFLLGDIGKTFNVGEWDHRLKPSAERDYPTRYESLYILDRTQKW 133 T 0.0061 IRK pdbpercent F Eukaryota T 8b6h 26 Z,ZB EA,Ea I7M3P9_TETTS COXTT12,Transmembrane protein,Transmembrane protein MYVLFVCLIDSMNVEEGKQIKEMILPHNNRQLARQYFDSLPENDINRKYYEGLKYETPKTFFGRFLNQFNIDAKLDTLSKFYTYQKTIRATQAELQEDRKSYLTNSLLFTAVSWFSIYQFARKGAVLPVLREYGRYFGTHRLFRQYLHTLVLPLLYTEYALNQKYYTHMEHLWTVHVNRLNQKILEDPLYTFYPQELNVPKHNIIVPTIFRDTPQ 215 T 17 Plk4_PB2 pdbhh F Eukaryota T 8b6h 28 BA,BC EC,Ec COXTT27 MSALLKEILALTVKSEAALWKGAEQKVLSGLNNLAKTELVQITHHFGVNKQGSEALWSQLDKAAVGAFPELSVDETLQLIDGFGECPDSYTLSHDLNQRLLVSWEQLGKLNFQKLKETNPYFASDIVNQLDAAAAEFIKVRPAAESEAGGFLNSLGVSSSFNTTKNDIYVVQSASGKKLNNKEQREAYVLEKAQKYLKEDPQSKILDIIAQK 212 T 1.3 A_thal_3526 pdb F T 8b6h 32 FA,FC EG,Eg COXTT28 MAARDFEYNNQDVNQLNGAFISLVEDEKIGFWVGVGGFAYSQFIMRKFVKSTNIFASVTSLFAGAALANLYTHQSRASYARVAARANRNASLALNKLMEY 100 T 0.02 DUF1689 pdbhh F T 8b6h 33 GA,GC EH,Eh Q23D87_TETTS Transmembrane protein, putative XDNNYHFWGNGDRQDVSLSYEDYYSILDCLLDEKLSPQGLMKFKNLHEVSMYGVSYVPLYCFPVAYGISHMLTGKVRRGHSGYRNLFSLMSVVLPFTCWYAYTTPIPRRLYTEIICSNNADGAYVRNRIKQQKPGIWRKLSQQLYNKNFRFPELNQDLTATEFPLDYVAPHKF 173 T 0.16 DUF2206 pdb F Eukaryota T 8b6h 35 IA,IC EV,Ev I7LVX0_TETTS Decapping nuclease XEVKYRGPSDDKLECEFLENNLLSCLREKSVQDNVAKMTCRPEFLVWFFLECPTKAAVYHDPKGLRNIFIQDKIKQKGSDDGVLSKDD 88 T 1.3 Defensin_4 pdbhh F Eukaryota T 8b6h 42 PA,PC EQ,Eq Q22W32_TETTS Transmembrane protein, putative XEPFGTDERNWTHEEKDIITRFLKYDKHVNLKTAEMVYSAEVESAYFGKAGALAGGVISALFFNFPIVRNLPIIRRSVIGVLPFLYCYTWGKNTQEELRWLKTFAAYQRFVVYHGQHCKLWV 122 T 3.1 Pepsin-I3 pdbhh F Eukaryota T 8b6j 12 W,X l,L UQCRTT2 MAPVFLKALRYVIYSYPLYVCYLIKQAQINAQGSEKEEEHH 41 T 2.8 DUF5392 pdbhh F T 8b8f 1 A A B0D650_LACBS N-terminal beta-trefoil domain of the lectin LBL from Laccaria bicolor MSNEYNPPLGIAFRLCGLASDRVLFSRVSPSPEVFHHPKSEVYPDQWFVAIPGSGQNAGCYAIKSKNTGKVLFSRMSPDPRVGHIDGDGKYPDNWFKFEAGSGKYAGYFRLRAVASDTVLVSRTSTGTDTQVINYPATSAKYDDQYFTILFD 152 T 0.001 RicinB_lectin_2 pdb F Eukaryota T 8b9z 26 AA b Q9W380_DROME UNCHARACTERIZED PROTEIN,ISOFORM A,ISOFORM B SLLKRAWNEIPDIVGGSALALAGIVMATIGVANYYAKDGDNRRYKLGYVVYRHDDPRALKVRNDED 66 T 0.038 NADHdh_A3 pdbhh F Eukaryota T 8bat 1 A A B3E6E9_TRIL1 Geobacter lovleyi NADAR MGSSHHHHHHSSGLVPRGSHMAERPVYIPNISGTNLVKTQYVDFKWFPGMAIVQKQKSIESLHEAAKKLLNITNLLEISSKSKTTLGVDLSAFNLMITTIKYNKTFSVESAFQSSKVFEKGGPYLDLLDKTSREAKKDGRLQTSGRLKCFKFFGIEWGLEPQTAFYDWLYINALKKNSDYAEQVMEYSAFTDIEFNPERSINCQAYSAALYVSLCHRDLLEYATSSQTAFLEVVTGAPISNARQDDIVQGALKF 254 T 9.4 Phage_30_3 pdbhh F Bacteria T 8bbt 1 A A A0A0B4VFQ3_9VIRU MOBP MGYLCNDYGYEPNVDYPNASHAGLYDRSKQPYVDTAIGPKTTIQFDHVFIKSDFKTWLAHNQDEAILLIRLYELGLLLQGRSDSFLEFYNNTTYITRTDSKQPFLNKYGKLVDTTSVTCLDIFLSVVLFALNQIDSLICDFKNTPWINLSKEHKKIYELVRGIFGICYGEKDYNRFEYCPFDANSTASALNVNATLNAKKTIELITCGLIRALIAYANLVTAFSADKTALLHEILLTKVCC 241 T 0.51 DUF3746 pdb T Viruses T 8bc5 1 A A A0A0B4VFQ3_9VIRU MOBP MGYLCNDYGYEPNVDYPNASHAGLYDRSKQPYVDTAIGPKTTIQFDHVFIKSDFKTWLAHNQDEAILLIRLYELGLLLQGRSDSFLEFYNNTTYITRTDSKQPMMNKYGKLVDTTSVTCLDIFLSVVLFALNQIDSMICDFKNTPWINLSKEHKKIYELVRGIFGICYGEKDYNRFEYCPFDANSTASALNVNATLNAKKTIELITCGLIRALIAYANLVTAFSADKTALLHEILLTKVCC 241 T 0.51 DUF3746 pdb T Viruses T 8bc7 2 D D PHE-PHE-GLU-GLN-MET-GLN-QCI KFFEQMQX 8 T 3.5 FAM110_C pdbhh F T 8bcs 1 A A CC-HP1.0 XGELEALAKKLKALAWKLKALSKEPSAQELEALAQELEALAKKLKALAQGX 51 T 0.0097 LIN9_C pdb F T 8bct 1 A,C,E,H D,B,H,F 26alpha XGELEALGKKFKALAWKVKALSKEPSAQELEALTQEAEALGKKIKALAQGX 51 T 0.021 Seryl_tRNA_N pdb F T 8bct 2 B,D,F,G G,E,A,C 26beta XGELEALAKKTKALTWKFKALSKEPSAQELEALTQECEALGKKLKALAQGX 51 T 0.026 FlgN pdb F T 8bd1 2 B B A0A0L8UU71_VIBPH RHSPI MISLSDIENLIQHIWEEPIFSDVTSKKVVVSLYGTLSKKIPDKFIIIEEVFPKDELEDIWSNYEEYLDEYLIFPFLGTLGEAVICIGYGNDNKGKIFYFDFDFGACELDGDNLEAFLEKLLESGSTENLYFQ 132 T 0.00035 SUKH_6 unppercent F Bacteria T 8bd5 1 A A A0A8X6EH11_9CYAN ShCas12k MGSSHHHHHHSGGGSGGSAWSHPQFEKGGGSGGGSGGSAWSHPQFEKSGGGENLYFQSNASQITIQARLISFESNRQQLWKLMADLNTPLINELLCQLGQHPDFEKWQQKGKLPSTVVSQLCQPLKTDPRFAGQPSRLYMSAIHIVDYIYKSWLAIQKRLQQQLDGKTRWLEMLNSDAELVELSGDTLEAIRVKAAEILAIAMPASESDSASPKGKKGKKEKKPSSSSPKRSLSKTLFDAYQETEDIKSRSAISYLLKNGCKLTDKEEDSEKFAKRRRQVEIQIQRLTEKLISRMPKGRDLTNAKWLETLLTATTTVAEDNAQAKRWQDILLTRSSSLPFPLVFETNEDMVWSKNQKGRLCVHFNGLSDLIFEVYCGNRQLHWFQRFLEDQQTKRKSKNQHSSGLFTLRNGHLVWLEGEGKGEPWNLHHLTLYCCVDNRLWTEEGTEIVRQEKADEITKFITNMKKKSDLSDTQQALIQRKQSTLTRINNSFERPSQPLYQGQSHILVGVSLGLEKPATVAVVDAIANKVLAYRSIKQLLGDNYELLNRQRRQQQYLSHERHKAQKNFSPNQFGASELGQHIDRLLAKAIVALARTYKAGSIVLPKLGDMREVVQSEIQAIAEQKFPGYIEGQQKYAKQYRVNVHRWSYGRLIQSIQSKAAQTGIVIEEGKQPIRGSPHDKAKELALSAYNLRLTRRS 698 T 0.0027 RuvC_1 unphh F Bacteria T 8be6 3 C P SOS1-HRas-peptidomimetic2 XHPWSVAX 8 T 0.079 DUF3019 pdbhh F T 8be7 3 C P SOS1-HRas-peptidomimetic3 KXHPWSVAX 9 T 0.03 DUF3019 pdbhh F T 8bea 3 C P SOS1-HRas-peptidomimetic10 XXHPWSV 7 T 1.6 DUF3019 pdbhh F T 8bef 18 R u Q8VZ65_ARATH Uncharacterized protein At1g67785 MVKVLTYFGMTLAAFAFWQSMDKVHVWIALHQDEKQERMEKEAEVRRVRAELLRKAREEDPLA 63 T 0.01 DUF6082 pdb F Eukaryota T 8bel 7 G,N J,T UCRY_ARATH COMPLEX III SUBUNIT 10,COMPLEX III SUBUNIT XI,UBIQUINOL-CYTOCHROME C OXIDOREDUCTASE SUBUNIT 10 MAGTSGLLNAVKPKIQTIDIQAAAGWGIAAAAGAIWVVQPFGWIKKTFIDPPPTEEK 57 T 0.0016 QCR10 unppssm F Eukaryota T 8bfe 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P CC-TypeN-LaUbUcLd XGELXXLKQELXXLKWELXXLKEELXXLKYGX 32 T 0.0005 DUF5320 pdbhh F T 8bfj 2 B B GGNB2_HUMAN LARYNGEAL CARCINOMA-RELATED PROTEIN 1,PROTEIN ZNF403 DEEIFISQDEIQSFMANNQSFYSNREQYRQHLKEKF 36 T 0.012 Clr5 pdbpssm F Eukaryota T 8bfk 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R A0A7U0GBC4_9CAUD TAIL INNER TUBE AKKDPNTIMSANSSYANGANGSVTNLEIPAAFGYTPDFRYYHAAADYTRRPTIAFLMELPNCFKDTDDAAKWGGSLKALIEMHSRTIDGLDYTLEVEHVETPFGGGGEMMQTLSKVRRARSVPVFTWVEKIGMPVSRFWNNYILYFMGEPNSNVAGIIGKGGITPAATYPDYNTFSVLFVEPDPTERYALRSTLITNMQPTGQGPEMRMSKDQTSSPEQLQISQTFTGLQMVGRGVDKLGQMMLDRASQTGIDLNAQPAFLSDREADVAARTDGYIDQLVSSLSKPGVAI 290 T 0.48 Phage_T4_gp19 pdbhh T Viruses T 8bfl 1 A,AA,B,BA,C,CA,D,DA,E,EA,F,FA,G,GA,H,HA,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,PA,Q,R,S,T,U,V,W,X,Y,Z A,a,B,b,C,c,D,d,E,e,F,f,G,g,H,h,I,i,J,j,K,k,L,l,M,m,N,n,O,o,P,p,Q,R,S,T,U,V,W,X,Y,Z A0A7U0GBA8_9CAUD Major head protein HEFAELFYRTYIVTPDQAGFQLSIRRNLVWEGWTGEGLSGEKQEISKRNILQGLLDYTTLETNSTELIPVIQSGENDEQFIDPSVLPAQTVKQGKDTFDTNFLKFSENGEGFNLLMLAQTPSRLKKGSMTFTDSLDSRIALKQLLISVTKGGTTELFALDVNRDQYAAYTATREYNFRLMQLKFHTSLGLGEESTTVAGAESALLKDLFDLGYRIELDVKVDGEMNVENGNGDTSLRALRLARVFDKEGKEIALTDSRVSAALSGLTVTGVGYSLEARLTNINQLEMGLLIDSDVQKQGFMIPTLPPLVIVKPAMVEDDKTYPRLEALTTAYRIQQMRNNAVTTLLNRADTLKSYLGVGVPHPIESNLGLEGVGQYYVRPYYNEATIDVLNDLNNLTSAAKQTDIQGLIVSKINEMVYTADQLTGYTAALEAAFSGRSPKPHVAIGTDMRLPQYIQINGDDRTVGIGYDYTIARISDLRMKDKIVMTFILPNESEPHPLQHGVLGFIPEYLVDFNMIRNQRIGREIRLTPRYRYFNFLPIMLVINVINLEEAIAQRTALDVNETQVTPAS 570 T 0.26 NigD_N unppssm T Viruses T 8bft 2 B B OBG_ECOLI GTP-BINDING PROTEIN OBG LEEIAEEDDEDWDDDWDEDD 20 T 0.38 DUF1967 unppercent F Bacteria T 8bgm 1 A,C A,C A0A5P3XKM0_PARBF ORFX1 MHHHHHHENLYFQGNREFPFHFNDGNVSMNGLFCLKKIKTQYHPNYDYFKIKFCEGFLSIKNKVKDDLCEYDLKNIESVIALKREYSKENNLKNKESAIFMNIGNKGIHNKYDLYVVNVDINNILDENYMLKGILNDKLKILFLGNERKLLRIKN 155 T 0.013 RTBV_P12 unp F Bacteria T 8bi8 1 A,B A,B NOS1_HUMAN CONSTITUTIVE NOS,NC-NOS,NOS TYPE I,NEURONAL NOS,N-NOS,NNOS,PEPTIDYL-CYSTEINE S-NITROSYLASE NOS1,BNOS QTHLETTFTGDGTPKTIRVTQXG 23 T 13 DUF5377 pdbhh F Eukaryota T 8bi9 1 A,B,C,D B,A,C,D NOS1_HUMAN CONSTITUTIVE NOS,NC-NOS,NOS TYPE I,NEURONAL NOS,N-NOS,NNOS,PEPTIDYL-CYSTEINE S-NITROSYLASE NOS1,BNOS QTHLETTFWGDGEPKTIRVTQXG 23 T 9.5 hemP pdbhh F Eukaryota T 8bmw 1 A,B,C,D,E A,B,C,D,E A0A157T170_SACSO CRISPR-associated small subunit protein (Type III-D) MRVKHYIQREFNYSVSSQDLLDIATRIAISAIKPKPKSNKPEPYVDSSTINSLLSFLQSRRNVNELLLYIMRQAGRDEIDEETGKLLLASLKDRELKDAVNLLGYVKWVYDTLTGLKVNYNNVKGVKTFKELVNILSKV 139 T 0.081 HTH_33 pdb F Archaea T 8bon 2 B,C,F E,D,F Macrocyclic peptide S1B3inL1 YRRPREQIIIGSLWVFXGX 19 T 3.4 RIC1 pdbhh F T 8boz 2 C,E,F,H,J,L,N,P B,D,F,H,J,L,N,P A0A2G9AAX8_ECOLX Lipoprotein MLKEWMIFTCSLLTLAGASLPLSGCISRGQESISEGAAFGAGILREPGATKKADTKDLNVPPPVYGPPQVIFRIDDNRYFTLENYTHCENGQTFYNNKAKNIHVKILDASGYLFKGRLFWLSTRDDFLAFPATLNTRHASCMGSNKGCMNAVIVTTDGGKRRSGVPYGSYTQNPTGATRDYDMLVMNDGFYLLRYRGGQGRFSPVILRWILSTEDSSGVVRSEDAYELFRPGEEVPSTGFYKIDLSRFYPKNNVMEMQCDRTLEPVQPSESKIQ 274 T 0.012 BNR unphh F Bacteria T 8bpn 1 A,B,C,D A,B,C,D W0DP94_9GAMM Twin-arginine translocation signal domain-containing protein MSYYHHHHHHDYDIPTTENLYFQGAMGKYVKVQDFYDQLGKYVLVAPGKFSGTVAATDLSTGWTMAWLAAWNYGDTCPIMHHMAAFPSPDPYKEFEFVVNTQGGKNLFIYGVPVTVEDPGEGMKIYRIKYDGTRMNLQRDAAEVSGLGLGVHVTITPEADGYAVGDGQKDICAEFDRETDMVRYAWAFDWDPNVKDLKRAWLDGGTMTIKRLKPTLPGGRYDLQGSKGNKIDWELVPGGELAIEDGKVSGDRPLHSVANDALVFDPRGKWAVASMRLPGVCVVFDRENQVPVAVLAGPKGTPSQFQLVKVDDDTWTVDIPEVISAGHQAGFSPDGQSFLFMNSLRQNNIMVWDSSNHDDPTTWEKKAVVESPDWRGAYPNTQHMVFTPDAKKIYVTMWWPSPTPNGIAVIDAVNWEVLKEVDLGPDMHTLAITYDGKFVVGTLSGYQNTASAIVVMETETDEVLGFLPSPMGHHDNVIVPRTLEDLRISRSTTT 494 T 0.0037 Cytochrom_D1 unppercent F Bacteria T 8bss 2 B B THAN_PODMA Thanatin-like derivative VPIIYXNRXTXKCXXY 16 T 2.1 Fuz_longin_3 pdbhh F Eukaryota T 8bt6 1 A A A0A151HKA5_TOXGO Putative anonymous antigen-1 GMAIGKVKRNPDAGVATAVQSVIQHQTFKRMLLFGLRSLADFCSPSNQLYQENALDALDRGVLSAIQTAVTTFSDDDDLMLCASRVLWAMSVAIKEEMDPAHIARVHSEGSPVIVAVVNSSPTDPQTIEDSMNFVDNLKRAGAPVDGASLAGGMLSIFTKTALDMKTAKRVTAALAIAAETAEGSTALYNAGGTSVLLTYCLDQGDLSDAGVEMVEGAFDTVRYMAGYQCTDATTLPQCIALMDKYRGRKSASAKGSSALAAMIGPEQLQKCLNTLKTAEAGSAEYDEALVTLGSMSYISSFTDEIVRAGGVPLLIELINSGLPQMEGNPEKIASMISGAAKMLARIASNPVNVDAIVQAGGVATLCTAVSYCTESMEALGALCMALVPLASRESLAHEIVQYQTFATVLPILYQNVESPEIAALAMELVATGSQHEEIQEHMLQNQAAEICSLCCQYHTADASYQQHAISALNRLVPRLTTLHGVSEYGGIQGVIASLNANVNNEQVALLAVQLLDNFSEVSDAKTYMSDGTCVDAVLAAMLEHEGNDLLISAGVHCLARIATEDDCARHLNVLDTAIQTARGNPDGVYRVLAAISGLSRVPSLRQIFEEKNASDTILAGISSWIECSRFEGQNRIIKAALKTVKNMKISGDGDLTSCFAAMCDVACLPQVKRVVELEEPDNNILVADTAAFRDLAATMRITGAENLERCIESVLRVMRKYPDSRRAQLNCLETLNYLAQCDGGEGVAILSRTGGLNAVVQYLTRAPMYLDAQIAGFTVLATSAKIDSNVGETLRKCNCLQALKVAMRTHAKSKELKRTIAPLVALLMPTDALETEIQELLNECASACEKNNFPHLHENLAALNELLISSEGAKIAARLGIGAHMCKYQEYISAHEQDALAVTDYDILGKDLFDATVSECAHAMEQVASTRSGRNALIKAGNVATLISLYESLKAPQSQYSEEAAIHCLEALRILLKSDKRSAELAFERNFVSTLCVGIDSFPHSAPVLGATCACLAAMATTPERVQMLTAQPAFESLLQKLVFVIQNDPSKDNKLVAMRALQELVEITNDATMANKIAEAGAVTALFRIIDEYGDDEQLTVQAAEVLALLGAFEDLRRFYDNDVRFPAQVLTAALTKQKNNETAVVHLLDVLNKLATSEDRAVLRELGVMEQVADAMRVHSESEAVTRLGGELFAKMGADEQIKSLMLQIIETVESGAEDTAQTVDILCGRLAVFLAAPLEDPRDALQHTEKCLGSLVATLQTYPGSERLEGNVALVCRRLCDRCFDDADDPYGAWAVAASGMLAQFAGMVAGETVLANKKFLGPAYRTFTACCANAYCMPTMVEVAPSFLPQTYTLLEMHKNDAETVARVLEFLRYFAEDPTACGLIVQNMSGSSGDVVALTVLLMQQHQNNDAVVCAGMEFLGALAYTLSQAGYEPLPTLADGSVLRDCDALMGSNSSSARQLAHMHMIEKMLLSKAYNDALIQEQALKKLTMSLKAEDDKKRFSDEERAGLYAAMACVLLAAGGAGLTGEMEKFNGFEVVLQAIEEFGENPTVIKEVNRALQGLSMADVNMTARTVKEAVPKLCTEATTAIQTDAECADTFCDLMLQLVSQEGNGRQLLQVYGLEETLQGVENLAAYYGEDFGTQLSEKVAMIRQAMEDDQPREKTCKDVYDLLNSRVQQGLSVAISEVAILQEEVEFLVSQMGMYNQEQLDHQTAMGADHQYGNMAFELLAATSANVKLLQANEFSKMELALIKGQADPEIVLYAVKALTAFCKFPPAAQDTARIQGCPALVTEACSKINKSGLPNERKEEHLCARYFLVERTAINRNLYNKTPIMTELINSWNDYDKGAYTTTLLRFVFRAMRRVVSDAHVEELLKANVLQRLIGIISDVNADMALLPDVLFLLGSLAVVPEIKTKIGELNGIAACTDLLQRALPKPNTAPVVTNVCLAFANICIGHKKNTEIFSKLGGPALNVKVLNDRGHEYDVCNAASVLLCNLLYKNESMKKLLGTNGAPAALVKGLSNYDGSEEKTAIRCLESVFKAISNLSLYTPNIQPFLDAGIENAYSTWLSNLSETFPDAQLETGCRTLVNLVMENEENNMRKFGVCLLPCMAVAKQGRTDTKALLLLLDIEASLCRLKENAEAFAANGGIETTIRLIHQFDYDVGLLTLGIHLLGIQSAVKDSIQRMMDADVFSILVGCVEVDAEGNEVTDLVVGGLRCTRRIVRSEELAFEYCNAGGIATIANVICKSINQPMVMLEACRVLLGLLFYTTRSQADRQAAVEALHAQCQQRAEQMHAQAQADYEAGVVSEPPPEEMEVPEPDPDELANAAYGGWYQMGMDEVMIDAILQAVCACAAVEAHAKQLRLQRVCLGLAAYFASEQMGTSSLVGSGIEQVLTQIMTNFAGEGTTMQLSCVIINSIAMTSGDMYEEIKTSALLSALKTSVGKMATKKPEEKALKETCAATLEAASSGEDPFDAFSKTVTELDFKFTEWNVDPYPNGVHDLPSNVKEALRKGGKLKVFLPEKEKEEIRWRSSQDLNVFEWCMGNDQDYNNRIPIVRIRNVAKGLVHPALKAAAKKEPRKVAAKFTMCLFGPPNDDFPEGVELPMVAKSQKERDAFVEMMVQWRDAATYNFHHHHHH 2646 T 0.00073 Arm_2 unppercent F Eukaryota T 8bv1 2 G,H,I,J,K,L N,G,K,H,L,I P4 peptide inhibitor of histone chaperone ASF1 XEKXARLARRIAX 13 T 3.5 LsmAD pdbhh F T 8bvp 1 A A Q5ZSL3_LEGPH Restriction endonuclease GGAMSIPCKWLKKDKGDYSIPFPTGTTSIPEETIPSAIVLQPVANENTVISGYKLKDTVSSPEKAQEVNNKTVSPRTPKIIVKHDNSLQSLTIMDIYSQKPIQFDESKVDEIIHSLETKKVNLEKAIEDNNAELSKIKKQKSKLAYLTRLYKENKENIQDYCTLNEYIEAHLFNPKFLSRHEKALNNFKALKSQFTGPVNLKELEKLTDKLTGIKEYSYDFHSNSLPYDLEHDKSFRNFYDFDGLKESIESIIKELEVLNSIRQAVSDKYPNSFKALNETEEHDDKLKFINIIFNDGFSTTYDQQTFIKALSALDIEKAIDAYTNVKNKLENTQDIIANKEGCRNKLISELQTLIANKQEPYLSANEKLGGFYSKRKLSASEGFHLAYQANRRDPIKPEVIENIITKMKPIDEDTHLDIHIRPPDCGVFITPEDIKKFQEAGIKVNITIHEYKQNYTRRYLQQYTHDLMRQANSVQFFNAEDRENAIIAATYGDCDKRNTTEPTGVAKKIREVGEDFDLDKYPVQKYDLKGKSGLTVASQKL 542 T 0.42 Epimerase_2 unphh F Bacteria T 8bx8 4 D D I7M008_TETTS Dynein intermediate chain 2 MPPKQTKVVASRKTVMPISRAGRAQIRRKDSNTQNNMNDQGMEDEEIDQQREGMKNQYEQLTAQELNEDMPSKMLEPKNPQAPKNITVYDYYTRKFKTDELVDQMIVHFSMDGDYIWKESNEYKTQEEIRDTKKALIKEAMRKQESEEPGANHDEEAIKQTLRNKFNYNTRECQTINPSIRERGVSTEPPPSDTICGNITQWEIFDAYYAEIMKDHQIENKKKKEVDQDKKQDQSMYSTSFKRCCKIMERMVVQNDQEDKYHDYRYYWSQGDNLEAGKNEGHLLPIWRFSNEKQRKKNVTSICWNPLYPDLFAVSLGSYDFTKQRMGLICLYSLKNTTHPEYAFNCEAGVMCLDFHPKSAALLAVGLYDGTVLVYDIRNKHKKPIYQSTVRNQKHTDPVWQVKWNPDTSKNYNFYSISSDGRVMNWILMKNKLEPEEVILLRLVGKNEEESTLIGLACGLCFDFNKFEPHIFLVGTEEGKIHKCSRAYSGQYQETYNGHLLAVYKVKWNNFHPRTFISASADWTVRIWDSKYTSQIICFDLSMMVVDAVWAPYSSTVFACATMDKVQVYDLNVDKLNKLAEQKIVKQPKLTNLSFNYKDPILLVGDSHGGVTLVKLSPNLCKSGPEIKQTEDKKAMEEFKNVKIEDYEREKMENLLA 657 T 0.0046 WD40 pdb F Eukaryota T 8c17 2 B B Stapled peptide GFSPXDFHXDIXCDVXRGX 19 T 23 DUF5510 pdbhh F T 8c29 17 OA,R u,U A9NJW3_PICSI Photosystem II 5 kDa protein, chloroplastic MASLSLCAPCNISSASSLAAGYNKVPCKSVRGGAQVGQVFMVNKPFKASQDWAVHDENVTMKKKEDDQERMQRRRMMFTAAAAAVSAAASQGMMAMAAGEKPTGPEPKRGTPEAKKLYARVCVTMPTASVCHN 133 T 0.0024 PsbQ pdb F Eukaryota T 8c2d 2 B PPP Pyrin pS208 peptide RLRRNASSAGRLQGLAGGA 19 T 55 SfsA_N pdbhh F T 8c2p 2 B B POLG_FMDVS P3B-3,GENOME-LINKED PROTEIN VPG3 GPYEGPVKKPVALKVKAKNLIVTE 24 T 44 DUF2111 pdbhh T Viruses T 8c3e 1 A A Engineered protein LCB2 GSSDDEDSVRYLLYMAELRYEQGNPEKAKKILEMAEFIAKRNNNEELERLVREVKKRL 58 T 0.0011 TPR_6 pdb F T 8c3l 1 A,C A,C Q8XAD6_ECO57 Phage repressor protein CI MQKKEIRRLRLKEWFKDKTLPPKEKSYLSQLMSGRASFGEKAARRIEQTYGMPEGYLGSSHHHHHH 66 T 0.0032 HTH_3 unppssm F Bacteria T 8c3w 1 A A dnHEM1 MVSLDQAILILVVAAKLGTTVEEAVKRALWLKTKLGVSLDQALRILSAAANTGTTVEEAVKRALKLKTKLGVSLEAALAILSAAAQLGTTVEEAVKRALKLKTKLGVDLETAALALLTAAKLGTTVEEAVKRALKLKTKLGVSLIEALHILLTAAVLGTTVEEAVYRALKLKTKLGVSLLQAAAILILAARLGTTVEEAVKRALKLKTKLGGGSGGSHHWGSGSHHHHHH 230 T 0.0037 RuvA_C pdb F T 8c4a 1 A A A0A7J6JYP1_TOXGO Putative anonymous antigen-1 KRNPDAGVATAVQSVIQHQTFKRMLLFGLRSLADFCSPSNQLYQENALDALDRGVLSAIQTAVTTFSDDDDLMLCASRVLWAMSVAIKEEMDPAHIARVHSEGSPVIVAVVNSSPTDPQTIEDSMNFVDNLKRAGAPVDGASLAGGMLSIFTKTALDMKTAKRVTAALAIAAETAEGSTALYNAGGTSVLLTYCLDQGDLSDAGVEMVEGAFDTVRYMAGYQCTDATTLPQCIALMDKYRGRKSASAKGSSALAAMIGPEQLQKCLNTLKTAEAGSAEYDEALVTLGSMSYISSFTDEIVRAGGVPLLIELINSGLPQMEGNPEKIASMISGAAKMLARIASNPVNVDAIVQAGGVATLCTAVSYCTESMEALGALCMALVPLASRESLAHEIVQYQTFATVLPILYQNVESPEIAALAMELVATGSQHEEIQEHMLQNQAAEICSLCCQYHTADASYQQHAISALNRLVPRLTTLHGVSEYGGIQGVIASLNANVNNEQVALLAVQLLDNFSEVSDAKTYMSDGTCVDAVLAAMLEHEGNDLLISAGVHCLARIATEDDCARHLNVLDTAIQTARGNPDGVYRVLAAISGLSRVPSLRQIFEEKNASDTILAGISSWIECSRFEGQNRIIKAALKTVKNMKISGDGDLTSCFAAMCDVACLPQVKRVVELEEPDNNILVADTAAFRDLAATMRITGAENLERCIESVLRVMRKYPDSRRAQLNCLETLNYLAQCDGGEGVAILSRTGGLNAVVQYLTRAPMYLDAQIAGFTVLATSAKIDSNVGETLRKCNCLQALKVAMRTHAKSKELKRTIAPLVALLMPTDALETEIQELLNECASACEKNNFPHLHENLAALNELLISSEGAKIAARLGIGAHMCKYQEYISAHEQDALAVTDYDILGKDLFDATVSECAHAMEQVASTRSGRNALIKAGNVATLISLYESLKAPQSQYSEEAAIHCLEALRILLKSDKRSAELAFERNFVSTLCVGIDSFPHSAPVLGATCACLAAMATTPERVQMLTAQPAFESLLQKLVFVIQNDPSKDNKLVAMRALQELVEITNDATMANKIAEAGAVTALFRIIDEYGDDEQLTVQAAEVLALLGAFEDLRRFYDNDVRFPAQVLTAALTKQKNNETAVVHLLDVLNKLATSEDRAVLRELGVMEQVADAMRVHSESEAVTRLGGELFAKMGADEQIKSLMLQIIETVESGAEDTAQTVDILCGRLAVFLAAPLEDPRDALQHTEKCLGSLVATLQTYPGSERLEGNVALVCRRLCDRCFDDADDPYGAWAVAASGMLAQFAGMVAGETVLANKKFLGPAYRTFTACCANAYCMPTMVEVAPSFLPQTYTLLEMHKNDAETVARVLEFLRYFAEDPTACGLIVQNMSGSSGDVVALTVLLMQQHQNNDAVVCAGMEFLGALAYTLSQAGYEPLPTLADGSVLRDCDALMGSNSSSARQLAHMHMIEKMLLSKAYNDALIQEQALKKLTMSLKAEDDKKRFSDEERAGLYAAMACVLLAAGGAGLTGEMEKFNGFEVVLQAIEEFGENPTVIKEVNRALQGLSMADVNMTARTVKEAVPKLCTEATTAIQTDAECADTFCDLMLQLVSQEGNGRQLLQVYGLEETLQGVENLAAYYGEDFGTQLSEKVAMIRQAMEDDQPREKTCKDVYDLLNSRVQQGLSVAISEVAILQEEVEFLVSQMGMYNQEQLDHQTAMGADHQYGNMAFELLAATSANVKLLQANEFSKMELALIKGQADPEIVLYAVKALTAFCKFPPAAQDTARIQGCPALVTEACSKINKSGLPNERKEEHLCARYFLVERTAINRNLYNKTPIMTELINSWNDYDKGAYTTTLLRFVFRAMRRVVSDAHVEELLKANVLQRLIGIISDVNADMALLPDVLFLLGSLAVVPEIKTKIGELNGIAACTDLLQRALPKPNTAPVVTNVCLAFANICIGHKKNTEIFSKLGGPALNVKVLNDRGHEYDVCNAASVLLCNLLYKNESMKKLLGTNGAPAALVKGLSNYDGSEEKTAIRCLESVFKAISNLSLYTPNIQPFLDAGIENAYSTWLSNLSETFPDAQLETGCRTLVNLVMENEENNMRKFGVCLLPCMAVAKQGRTDTKALLLLLDIEASLCRLKENAEAFAANGGIETTIRLIHQFDYDVGLLTLGIHLLGIQSAVKDSIQRMMDADVFSILVGCVEVDAEGNEVTDLVVGGLRCTRRIVRSEELAFEYCNAGGIATIANVICKSINQPMVMLEACRVLLGLLFYTTRSQADRQAAVEALHAQCQQRAEQMHAQAQADYEAGVVSEPPPEEMEVPEPDPDELANAAYGGWYQMGMDEVMIDAILQAVCACAAVEAHAKQLRLQRVCLGLAAYFASEQMGTSSLVGSGIEQVLTQIMTNFAGEGTTMQLSCVIINSIAMTSGDMYEEIKTSALLSALKTSVGKMATKKPEEKALKETCAATLEAASSGEDPFDAFSKTVTELDFKFTEWNVDPYP 2498 T 0.0018 Arm unppercent F Eukaryota T 8c6j 48 VA j STEEP_HUMAN STEEP MPKVVSRSVVCSDTRDREEYDDGEKPLHVYYCLCGQMVLVLDCQLEKLPMRPRDRSRVIDAAKHAHKFCNTEDEETMYLRRPEGIERQYRKKCAKCGLPLFYQSQPKNAPVTFIVDGAVVKFGQGFGKTNIYTQKQEPPKKVMMTKRTKDMGKFSSVTVSTIDEEEEEIEAREVADSYAQNAKVIEKQLERKGMSKRRLQELAELEAKKAKMKGTLIDNQFK 222 T 0.1 R_equi_Vir pdbpercent F Eukaryota T 8c8q 9 I I COX9_SCHPO CYTOCHROME C OXIDASE POLYPEPTIDE VIIA MAVGPVTGMFKRRIVTDFSVTMILGTLGACYWWFGYHKPAARQREEFYVKLAAEKNAE 58 T 0.00028 COX6C pdbpercent F Eukaryota T 8c8t 1 A,D,E A,F,G Q2N0S5_9HIV1 ENV POLYPROTEIN LWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQAFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRR 469 T 5.6E-50 GP120 pdb T Viruses T 8cav 2 C,D D,G D1A2F9_THECD CuvA MSALLEPRDAGATNLDALAAIKWEAPAHQAGTCTVCHWGYTILCDDFSTRS 51 T 0.091 DUF5973 pdb F Bacteria T 8cdp 1 A A Q57XL7_TRYB2 Guide_RNA_associated_protein_-_putative MKHHHHHHSAGLEVLFQGPDSMQNQGSVSQGALNMRDQQAAAAENVTPERVWALWNEGNLFSLSLAQLQGFLSRCGVRTDPAAKKAAVVRQVEEYLHSKDTTVKGGGQGAASPQQHQQHGQQGGYGRWNQASVMQPETLLDLSQAGFYEGAANMVPKAFQLLVSDTAPDVVVSRVNTTAFPGFPSNTECYTLGASEKDVAIRSRYSKVLQWCCLNMSNLQMDGELYVDFGKLLLKPSVMRKNRRIVSSYTLQQRLQVNHPYTWVPTLPESCLSKIQEQFLQPEGFAPIGKGVQLTYSGTIKRSKDQLHVDLDNKGKVLAVNSAWVNLQTAWCTHAKGPDVRLLLRSRPPIRRQDVELFASTPIIKLADDDVADVLPPEHGQLVYLSEDETRLFERVSDRGVTITVREVKRQPLIILRDEEEDPRVEYSLSAHIPANAAKATDVRAVGLTAFELAGRLAGLVAEDFVREYGCEAKL 475 T 0.012 HeH pdbpercent F Eukaryota T 8cdp 2 B B Q586X1_TRYB2 Mitochondrial guide RNA binding complex subunit 2 MQSFSAAAPAASGDFSHITRNTVWGLWNEGNLFSLSVPELAFFLQEHCRVANVDPRAKKSALVRQVEEILSAEQQASATVPQEDNPHAIVVTDYDRAEDALEEADEYGDWGAEPGFEDRRELDFMELSPGRMGERYDPLSPRAFQLLHSETATDVGIASIDPSKLPGQSKVKNALAAIHVAPNDANKMRFRMAFEWCLMNIWNMNMPGELNIGAGKALYYRSVAKQNRNVMPLWTVQKHLYAQHPYAWFAIASESNVAAMESLAAALNMSIQQERTTSYKVTIRRMAEFFDCELNGQLKCTMMNKPWDRFFVSHYIRSKMPDLRYVVRARHPIKKRIADAYLEADILRSTRDSVQSVLSPELGDVVYCCERVVRKWAKKTATGVTLQLVETKRTPLIITKAGDEGERLEYEWIVPLPQQAERIDIAALTDELWEYGNKLAAALEEGMEELMVHTMTAVSAY 461 T 0.54 ARMET_C pdbhh F Eukaryota T 8cei 1 A,B,C,D A,B,C,D SUCD_CLOK5 Succinate-semialdehyde dehydrogenase (acetylating) MSNEVSIKELIEKAKVAQKKLEAYSQEQVDVLVKALGKVVYDNAEMFAKEAVEETEMGVYEDKVAKCHLKSGAIWNHIKDKKTVGIIKEEPERALVYVAKPKGVVAATTPITNPVVTPMCNAMAAIKGRNTIIVAPHPKAKKVSAHTVELMNAELKKLGAPENIIQIVEAPSREAAKELMESADVVIATGGAGRVKAAYSSGRPAYGVGPGNSQVIVDKGYDYNKAAQDIITGRKYDNGIICSSEQSVIAPAEDYDKVIAAFVENGAFYVEDEETVEKFRSTLFKDGKINSKIIGKSVQIIADLAGVKVPEGTKVIVLKGKGAGEKDVLCKEKMCPVLVALKYDTFEEAVEIAMANYMYEGAGHTAGIHSDNDENIRYAGTVLPISRLVVNQPATTAGGSFNNGFNPTTTLGCGSWGRNSISENLTYEHLINVSRIGYFNKEAKVPSYEEIWG 453 T 0.001 Aldedh pdb F Bacteria T 8cen 27 AA U Transcription initiation factor IIA large subunit MSNAEASRVYEIIVESVVNEVREDFENAGIDEQTLQDLKNIWQKKLTETKVTTFSWDNQFNEGNINGVQNDLNFNLATPGVNSSEFNIKEENTGNEGLILPNINSNNNIPHSGETNINTNTVEATNNSGATLNTNTSGNTNADVTSQPKIEVKPEIELTINNANITTVENIDDESEKKDDEEKEEDVEKTRKEKEQIEQVKLQAKKEKRSALLDTDEVGSELDDSDDDYLISEGEEDGPDENLMLCLYDKVTRTKARWKCSLKDGVVTINRNDYTFQKAQVEAEWV 286 F F T 8cir 2 C,D C,D C3P EDGGSWKYPDAFELSG 16 T 0.28 DUF817 pdbhh F T 8cis 3 C C C3S1 EDGGSSWEYIWTLPSG 16 T 0.91 BNR pdbhh F T 8cj2 2 E,F,G,H E,F,G,H c3u_5 chimera inhibitor of histone chaperone ASF1 XEKXAXXXRIX 11 T 69 YcbB pdbhh F T 8cj3 2 B B c3u_7 chimera inhibitor of histone chaperone ASF1 XEKXARLXXXAX 12 T 20 SEEK1 pdbhh F T 8cjd 1 A,B A,B A0A861B9Z9_9CYAN AetF GASGSGSGMLEVCIIGFGFSAIPLVRELARTQTEFQIISAESGSVWDRLSESGRLDFSLVSSFQTSFYSFDLVRDYEKDYYPTAKQFYEMHERWRSVYEEKIIRDFVTKIENFKDYSLISTRSGKTYEAKHVVLATGFDRLMNTFLSNFDNHVSNKTFVFDTMGDSANLLIAKLIPNNNKIILRTNGFTALDQEVQVLGKPFTLDQLESPNFRYVSSELYDRLMMSPVYPRTVNPAVSYNQFPLIRRDFSWVDSKSSPPNGLIAIKYWPIDQYYYHFNDDLENYISKGYLLNDIAMWLHTGKVILVPSDTPINFDKKTITYAGIERSFHQYVKGDAEQPRLPTILINGETPFEYLYRDTFMGVIPQRLNNIYFLGYTRPFTGGLANITEMQSLFIHKLITQPQFHQKIHQNLSKRITAYNQHYYGAAKPRKHDHTVPFGFYTEDIARLIGIHYQPNECRSVRDLLFYYAFPNNAFKYRLKGEYAVDGVDELIQKVNDKHDHYAQVFVQALSIRNMNSDEAAEWDHSARRFSFNDMRHKEGYRAFLDTYLKAYRQVENISVDDTVVDEEWNFMVKEACQVRDKVAPNIEEKTHYSKDEDVNKGIRLILSILDSDISSLPDSNGSRGSGNLKEGDRLCKFEAQSIEFIRRLLQPKNYELLFIRES 663 T 0.0036 Pyr_redox_2 pdb F Bacteria T 8cjz 2 H h Spike Base Protein MAIGDIQTSVAFDRQVGRFPPRAEVVTPSNSEEFTSGVSVFSNDGGDISVVPLLPYGSAAIVVTVAAGGFVPFMVRKVNATGTTSTSIVAVW 92 T 0.05 CarboxypepD_reg pdb F T 8cjz 3 I,J,K,L,M,N,O B,C,c,F,D,A,E Capsid Decoration Protein MIMDKENTFSYKQAITGTAVSTNVIDLGVSRDIGKGVPVPIIIQVVEDFADATSLTATLQTSETENFSSATTLATSGAVPVADLTAGKQLAVQYMPLGTQRYLRVNYTVSGTATAGAVTAGVVMSHQQND 130 T 0.0098 SSURE pdbpercent F T 8ck1 1 A A Tail Nozzle MASNNYQPASSYIQPSFAGGELAPSLQGRVDLARYAISLKTCRNFVVQPYGGASNRPGFRFNTACKYKNYATRLIPFSFNTEQTYVIEIGHQYMRFHRDGAPVLDGGEPVEVATSWHRDDIFEIKYVQSADVLTLVHPDYKPRQLKRYSETDWVLDFFDNEFGPLQDQNVDESITIISNGVVDLVELTASEAIFSEAMVGTTIKLQQVSSGEVAAWQNRSAVEQGDLAYVDERTYKATSLSGGVDNTLTGDNTPAHTEGEQWDGPRTTIQGVTETLGVKWAYLHSGFGYVRITEHRDDTHIVGRVIGRLPEEIRTEGTYRWSFAAWDSDRGYPGTASYYQQRLVFANSRAEPQAFWMSETGIFNGFKVSFPIEADDAITFTLASRQVNEIRHLIPLGSLLALTSGAEWMISDNDQGLAPDTVSADVQGYRGASDVTPLLIGSSALYVQARGTVIRDLAYSFELDGYTGDDLTIFSNHLLKDYTIKDWAYAQEPDSVVWLVRSDGALLSMTYQREQQVVAWARHDTVDGEFESVAVIAEGSRDVPYAIVKRQVGGETVRYIEYLDSRRFSHVEDFFCVDSGLTYDGRSSTGALLTIGGGTNWTTDEDLTLTASASSFSPSDVGRRVRVYTGDKFADVDVDAYVSATSVAVSAVRIVPEELRGVQGDRWGFMAKTLTGLDHLEGKTVSILADGNVHAPEVVTGGQVTLDYSAAVVHVGLPIESDIETLPISSSGATVRDSHKAIVGVGIQLEKSRGVFAARSRRDFTSSDLIELKQRDAEDWGEATGLETGLVELGIPTSWDKDGSLFIRQSDPLPLTILSIIPRVVMGGKG 830 T 0.0073 Phage_stabilise pdbhh F T 8ck1 3 E,F E,F Connector Protein MPSKVDICNRALSNTGTDITIASLTEKSKEARLCQQWYDATLASLLRTYQWAFAQRRVTLALIGVGPAGWRHKYRYPTDAITIHDVFTADTYPDGASEFTDGRYRQIFQIASDGEGGRLVLANCEDAMCRYTSDIEDPNLMPPDFSTALEMMLAKNIAMPMTGNPGLMTVLAQQAASLVSDAIARDQNEGYRNPLPYASWTRANIGDSYPDDDHLPHRGGRR 222 F F T 8cka 1 A,B A,B HPI_DEIRA Hexagonally packed intermediate-layer surface protein MKKNIALMALTGILTLASCGQNGTGTTPTADACATANTCSVTVNISGVSSADFDVTMDGKTTSMTLSNGQKLPVAKTGTVTLTPKAKDGYTTPAAQSTTISSTNLTPSVNFAYTTVPSTGNGNGNGGTTPTQPFTLNITSPTNGAAATTGTPIRVVFTSSVALSSATCKIGNSAAVNAQVSSTGGYCDVTPTTAGGGLITVTGTANGQTVSSTVTVDVKAPVVDNRYGTVTPAGDQELTLTNEGIVKDADNGWRRLGQGVSTPSDPNGNVDIYVKGTVNFSVNAAAGSKVEVFLARTTGSDVPTNDDVQAGDVLRSVASTSGTETFSLDSRRLAEFDGVRKWIVVRINGTQVTYQPVIADNKGPQQPDPELNGVQNAYSNILNNYNNSGLTYVRGDVNVFTGNPSLQDREFGQAPLGSSFVQRRPSGFESIRYYLVPETAFGNKALQESDEMLRAKAIKSVATVVSAPVLEPGTVKATSFSRVIGSGATSTVTPKAQDNVTYRVYAISRDQLGNETASATYELVRFDNVGPTITGSVIRDTSDLPFASQEPERCLSDIATITLGGITDNAGGVGLNPGQGLTFTLGGRQIQAGQFDTNQLADGEYTIGFNSLTDALGNPVVSAPTNAKVYIDNTDPTVNFNRAVMQGTFASGERVSVESDASDGGCGVYETRLFWDTDNGVVDDATTTPAIGHPVQFARQRVTDGAKADSLNAGWNALQLPNGAGAVYLRALVVDRAGNATISTTPIVVNAKITNQARPLLGGFDAFKRNASAQFMSNSNAISGVNGTAVTPNTTANSALDNILSLDSVGTLTTNAYLPRGATETAITEKIRNVGAYGRFDATQWNRIRDYQLNTDPTLRSAYVNAGNLANQRGNNWRIRTPWVELGSSDTANTQQKFDFNSDLLNDFYFGRTFGNNDNVNLFSYDQFNGIVSGTAGAYSFYGETVQK 948 T 0.0072 Big_7 pdbpssm F Bacteria T 8cli 3 C C TF3C2_HUMAN TF3C-BETA,TRANSCRIPTION FACTOR IIIC 110 KDA SUBUNIT,TFIIIC 110 KDA SUBUNIT,TFIIIC110,TRANSCRIPTION FACTOR IIIC SUBUNIT BETA MHHHHHHENLYFQGMDTCGVGYVALGEAGPVGNMTVVDSPGQEVLNQLDVKTSSEMTSAEASVEMSLPTPLPGFEDSPDQRRLPPEQESLSRLEQPDLSSEMSKVSKPRASKPGRKRGGRTRKGPKRPQQPNPPSAPLVPGLLDQSNPLSTPMPKKRGRKSKAELLLLKLSKDLDRPESQSPKRPPEDFETPSGERPRRRAAQVALLYLQELAEELSTALPAPVSCPEGPKVSSPTKPKKIRQPAACPGGEEVDGAPRDEDFFLQVEAEDVEESEGPSESSSEPEPVVPRSTPRGSTSGKQKPHCRGMAPNGLPNHIMAPVWKCLHLTKDFREQKHSYWEFAEWIPLAWKWHLLSELEAAPYLPQEEKSPLFSVQREGLPEDGTLYRINRFSSITAHPERWDVSFFTGGPLWALDWCPVPEGAGASQYVALFSSPDMNETHPLSQLHSGPGLLQLWGLGTLQQESCPGNRAHFVYGIACDNGCIWDLKFCPSGAWELPGTPRKAPLLPRLGLLALACSDGKVLLFSLPHPEALLAQQPPDAVKPAIYKVQCVATLQVGSMQATDPSECGQCLSLAWMPTRPHQHLAAGYYNGMVVFWNLPTNSPLQRIRLSDGSLKLYPFQCFLAHDQAVRTLQWCKANSHFLVSAGSDRKIKFWDLRRPYEPINSIKRFLSTELAWLLPYNGVTVAQDNCYASYGLCGIHYIDAGYLGFKAYFTAPRKGTVWSLSGSDWLGTIAAGDISGELIAAILPDMALNPINVKRPVERRFPIYKADLIPYQDSPEGPDHSSASSGVPNPPKARTYTETVNHHYLLFQDTDLGSFHDLLRREPMLRMQEGEGHSQLCLDRLQLEAIHKVRFSPNLDSYGWLVSGGQSGLVRIHFVRGLASPLGHRMQLESRAHFNAMFQPSSPTRRPGFSPTSHRLLPTP 925 T 0.0008 WD40 unppercent F Eukaryota T 8cob 2 B,D,F B,D,F DNA excision repair protein ERCC-6-like 2 SSPGQLTLLQCGFSK 15 T 8.7 TAL_effector pdbhh F T 8coy 2 C,D C,D peptido-mimetic inhibitor XITAXDE 7 T 250 Big_3_4 pdbhh F T 8cpn 1 A A PolB16 intein MKTEFSGDTDAVHGKTHVFIRSIKNGSHMQEAKIDIKSLYDSLAKKYDVQHKNSYEVIYPKGYEIKVLGNKYVKLVAMSRHKTQKHLVKIVVKSEKTIDSLDPIRQKSLLKKQDEVVVTTDHICMVYNDDHFFENVNAKNLKVGNYVSVYDEASDKEVIGEIASIEDLGMTDDYVYDCEVDDDSHAFYASNILVHASQFCNGTKLGG 207 T 0.0063 CathepsinC_exc pdb F T 8cpo 1 A A PolB16 Intein Cys-less MKTEFSGDTDAVHGKTHVFIRSIKNGSHMQEAKIDIKSLYDSLAKKYDVQHKNSYEVIYPKGYEIKVLGNKYVKLVAMSRHKTQKHLVKIVVKSEKTIDSLDPIRQKSLLKKQDEVVVTTDHIAMVYNDDHFFENVNAKNLKVGNYVSVYDEASDKEVIGEIASIEDLGMTDDYVYDAEVDDDSHAFYASNILVHASQFCNGTKLGG 207 T 0.0059 CathepsinC_exc pdb F T 8cqy 2 B B ADA17_HUMAN ADAM 17,SNAKE VENOM-LIKE PROTEASE,TNF-ALPHA CONVERTASE,TNF-ALPHA-CONVERTING ENZYME RQNRVDSKETEC 12 T 10 Spp-24 pdbhh F Eukaryota T 8crx 51 YA V 50S ribosomal protein bL37 MGKTGRKRRARRKKGANHGKRPNA 24 T 21 Protamine_3 pdbhh F T 8ct8 1 A,B A,B UEX_DROME PUTATIVE METAL TRANSPORTER UEX GPLGSVNIISGALELRKKTVADVMTHINDAFMLSLDALLDFETVSEIMNSGYSRIPVYDGDRKNIVTLLYIKDLAFVDTDDNTPLKTLCEFYQNPVHFVFEDYTLDIMFNQFKEGTIGHIAFVHRVNNEGDGDPFYETVGLVTLEDVIEELIQAEIVDELE 161 T 0.0059 CBS pdbpercent F Eukaryota T 8cuk 1 A,B,C A,B,C PEP5_YEAST CARBOXYPEPTIDASE Y-DEFICIENT PROTEIN 5,HISTONE E3 LIGASE PEP5,RING-TYPE E3 UBIQUITIN TRANSFERASE PEP5,VACUOLAR BIOGENESIS PROTEIN END1,VACUOLAR MORPHOGENESIS PROTEIN 1,VACUOLAR PROTEIN SORTING-ASSOCIATED PROTEIN 11,VACUOLAR PROTEIN-TARGETING PROTEIN 11 MKHHHHHHHGAAGTSLYKKAGENLYFQGSMSLSSWRQFQLFENIPIRDPNFGGDSLLYSDPTLCAATIVDPQTLIIAVNSNIIKVVKLNQSQVIHEFQSFPHDFQITFLKVINGEFLVALAESIGKPSLIRVYKLEKLPNREQLYHSQVELKNGNNTYPISVVSISNDLSCIVVGFINGKIILIRGDISRDRGSQQRIIYEDPSKEPITALFLNNDATACFAATTSRILLFNTTGRNRGRPSLVLNSKNGLDLNCGSFNPATNEFICCLSNFIEFFSSSGKKHQFAFDLSLRKRIFCVDKDHILIVTEETGVPTTSISVNELSPTIINRIFIIDAKNKIISLNFVVSSAIIDIFSTSQSGKNITYLLTSEGVMHRITPK 379 T 3.2E-05 WD40_like pdbhh F Eukaryota T 8cv4 2 C,D,E C,D,E Peptide 4.2F WXYWRXYVLKIC 12 T 1.7 Plk4_PB1 pdbhh F T 8cv5 2 B B Peptide 4.2E WYDVFLTRXYGXXKVAC 17 T 2.1 DUF756 pdbhh F T 8cv7 2 C,D C,D Peptide 2.2E WYGYWXVPXRKC 12 T 0.61 DUF3793 pdbhh F T 8cvl 56 DB,ID 1z,2z P-site Peptidyl-tRNA fMTHSMRC-NH-tRNAmet Peptide-part MTHSMRC 7 T 0.00014 Ery_res_leader2 pdbhh F T 8cww 1 A P Meiosis-specific protein HOP1 SNASNNPVTGICSCECGLEVPKAATVLKTCKSCRKTLHGICYGNFLHSSIEKCFTCIFGPSLDTKWSKFQDLMMIRKVFRFLVRKKKGFPASITELIDSFINVEDQNNEVKERVAFALFVFFLDETLCLDNGGKPSQTIRYVTSSVLVDVKGIVIPNTRKQLNVNHEYKWHFTTSSPKAESFYQEVLPNSRKQVESWLQDITNLRKVYSEALS 213 T 0.091 DUF928 pdb F T 8cwx 1 A A A0A7Y7E8Q0_STRMO Lanthipeptide Natural Product mSmoAc FAADAWAAQDMAXGNPLXXXFCCXVQCG 28 T 3.7 Baculo_LEF5_C pdbhh F Bacteria T 8cxp 1 A A A0A649YC68_9PICO Capsid protein VP1 STDNAETGVIEAGNTDTDFSGELAAPGSNHTNVKFLFDRSRLLNVIKVLEKDAVFPRPFPTQEGAQQDDGYFCLLTPRPTVASRPATRFGLYANPSGSGVLANTSLDFNFYSLACFTYFRSDLEVTVVSLEPDLEFAVGWFPSGSEYQASSFVYDQLHVPFHFTGRTPRAFASKGGKVSFVLPWNSVSSVLPVRWGGASKLSSATRGLPAHADWGTIYAFVPRPNEKKSTAVKHVAVYIRYKNARAWCPSMLPFRSYKQKMLM 263 F T Viruses T 8cyk 1 A,B B,A HALC1_878 MSGMKKLYEYTVTTLDEFLEKLKEFILNTSKDKIYKLTITNPKLIKDIGKAIAKAAEIADVDPKEIEEMIKAVEENELTKLVITIEQTDDKYVIKVELENEDGLVHSFEIYFKNKEEMEKFLELLEKLISKLSGS 135 T 0.0082 SUKH_5 pdb F T 8cz9 2 B C JAK2 pY813 phosphopeptide PDXELLTE 8 T 1.6 LIN9_C pdbhh F T 8czf 2 B B DF2 peptide XSYIDKIADLIRKVAEEINSKLEX 24 T 1.8 ZapA pdbhh F T 8czg 2 E,F,G,H E,F,G,H dF3 peptide XSLLEKLAEELRQLADELNKKFEKX 25 T 0.11 Bclx_interact pdb F T 8czh 2 B B DM2 peptide XAPYLEQVARTLRKIGEEINEALRX 25 T 0.023 Bclx_interact pdb F T 8czk 2 C,D C,D Deb-Erk peptide GFLXEY 6 T 1.6 DUF5918 pdbhh F T 8d02 1 A A Q81TN4_BACAN EXOSPORIUM PROTEIN MFSSDCEFTKIDCEAKPASTLPAFGFAFNASAPQFASLFTPLLLPSVSPNPNITVPVINDTVSVGDGIRILRAGIYQISYTLTISLDNSPVAPEAGRFFLSLGTPANIIPGSGTAVRSNVIGTGEVDVSSGVILINLNPGDLIRIVPVELIGTVDIRAAALTVAQIS 167 T 0.0004 BclA_C pdbpssm F Bacteria T 8d03 1 A A HALC2_068 MSGMIKVPEDLERIGRELRARGLDTKRLLEEGPKLYPELSIPDLMAIALYDHLNLDPEFLYRLLQQSRGS 70 T 3.9 Mut7-C pdbhh F T 8d04 1 A,B,C,D,E,F B,A,E,F,C,D HALC2_062 MSGMARVEYSYEKLNDTHYKLKLKVTYEYRKSPEARRLAEDLVQAFVDALSSLPFITVEYEVEEVEVEGS 70 T 10 DUF1307 pdbhh F T 8d05 1 A A HALC2_065 MSGSEEEKPIVIDLNKTIERDGRKVKLVRATITVDPETNTITIDIEYEGGPITKEDLLEAFKLAASKLGS 70 T 0.0011 RNase_PH_C pdb F T 8d06 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,H,G,I,L,K,J HALC3_104 MSGKRIDEIESKLKHLEEFTTHLIKLMETMLELLKLVSDGKSDSEEYKELLEKAEEYLKQATEAAKKIGS 70 T 0.012 ISG65-75 pdb F T 8d07 1 A,B,C,D,E,F D,A,B,C,E,F HALC3_109 MSGREEIEEAVKEAELKVLAIVLVALRSVSHYEPLSRLYESFLDALKKALSEEELKEVEKEAERIEKKGS 70 T 0.02 Ku_PK_bind pdb F T 8d08 1 A,B,C,D A,B,C,D HALC4_135 MSGMEKFKEQLLEEVKKIVLETMTKVMEHLEKWFVTLAEIIITKSEEKLEELKETMEKSIEELRKEAEGS 70 T 0.0057 DUF3884 pdb F T 8d09 1 A,B A,B HALC4_136 MSGMSPYKKAIEITKRLLELLLSNPELAKKNLGGIATLISLLALISALDGTLDEKDIEPYIKKLEESLGS 70 T 0.14 SRP54_N pdb F T 8d0y 5 E G BG505SOSIPv8 gp120 ENLWVTVYYGVPVWKDAETTLFCASDAKAYETKKHNVWATHCCVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHEDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPNVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNTPVQINCTRPNNNTVKSIRIGPGQWFYYTGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVG 455 T 3.3E-54 GP120 pdbpercent F T 8d35 2 C C TRM1_HUMAN TRNA 2,2-DIMETHYLGUANOSINE-26 METHYLTRANSFERASE,TRNA(GUANINE-26,N(2)-N(2)) METHYLTRANSFERASE,TRNA(M(2,2)G26)DIMETHYLTRANSFERASE EPRLQANFTIR 11 T 18 Chisel pdbhh F Eukaryota T 8d3r 2 B,C B,C DPOG2_HUMAN DNA POLYMERASE GAMMA ACCESSORY 55 KDA SUBUNIT,P55,MITOCHONDRIAL DNA POLYMERASE ACCESSORY SUBUNIT,MTPOLB,POLG-BETA MRSRVAVRACHKVCRCLLSGFGGRVDAGQPELLTERSSPKGGHVKSHAELEGNGEHPEAPGSGEGSEALLEICQRRHFLSGSKQQLSRDSLLSGCHPGFGPLGVELRKNLAAEWWTSVVVFREQVFPVDALHHKPGPLLPGDSAFRLVSAETLREILQDKELSKEQLVAFLENVLKTSGKLRENLLHGALEHYVNCLDLVNKRLPYGLAQIGVCFHPVFDTKQIRNGVKSIGEKTEASLVWFTPPRTSNQWLDFWLRHRLQWWRKFAMSPSNFSSSDCQDEEGRKGNKLYYNFPWGKELIETLWNLGDHELLHMYPGNVSKLHGRDGRKNVVPCVLSVNGDLDRGMLAYLYDSFQLTENSFTRKKNLHRKVLKLHPCLAPIKVALDVGRGPTLELRQVCQGLFNELLENGISVWPGYLETMQSSLEQLYSKYDEMSILFTVLVTETTLENGLIHLRSRDTTMKEMMHISKLKDFLIKYISSAKNV 485 F F Eukaryota T 8d5e 3 C P TF2B_MOUSE GENERAL TRANSCRIPTION FACTOR TFIIB,RNA POLYMERASE II ALPHA INITIATION FACTOR TGAASFDEF 9 T 4.8 DUF2852 pdbhh F Eukaryota T 8d5f 3 C P TF2B_MOUSE GENERAL TRANSCRIPTION FACTOR TFIIB,RNA POLYMERASE II ALPHA INITIATION FACTOR TGAARFDEF 9 T 13 DUF4295 pdbhh F Eukaryota T 8d5j 3 C C PRP19_MOUSE NUCLEAR MATRIX PROTEIN 200,PRP19/PSO4 HOMOLOG,RING-TYPE E3 UBIQUITIN TRANSFERASE PRP19,SENESCENCE EVASION FACTOR KYLQVASHV 9 T 2.1 DUF2894 unppercent F Eukaryota T 8d5k 3 C C PRP19_MOUSE NUCLEAR MATRIX PROTEIN 200,PRP19/PSO4 HOMOLOG,RING-TYPE E3 UBIQUITIN TRANSFERASE PRP19,SENESCENCE EVASION FACTOR KYRQVASHV 9 T 2.1 DUF2894 unppercent F Eukaryota T 8d5n 2 B,D E,B Dense granule protein 6, HF10 peptide HPGSVNEFDFGCGGSG 16 T 1 Polysacc_synt_4 pdbhh F T 8d5q 4 D E Dense granule protein 6, HF10 peptide HPGSVNEFDF 10 T 1.4 CITED pdbhh F T 8d7m 2 C,D C,D Period circadian protein homolog 2 peptide GKAESVASLT 10 T 80 Tmemb_14 pdbhh F T 8d7n 2 C,D C,D Period circadian protein homolog 2 peptide GKAESVASLTSQ 12 T 13 BRX_N pdbhh F T 8d7o 2 C,D C,D Period circadian protein homolog 2 peptide GKAESVASLTSQCSYA 16 T 18 Rota_NSP4 pdbhh F T 8d7p 2 C,D D,C Period circadian protein peptide VAERDSVMLGEIAPHHDY 18 T 12 DUF2718 pdbhh F T 8d85 2 B D IL27A_HUMAN IL-27 SUBUNIT ALPHA,IL-27-A,IL27-A,INTERLEUKIN-30,P28 FPRPPGRPQLSLQELRREFTVSLHLARKLLSEVRGQAHRFAESHLPGVNLYLLPLGEQLPDVSLTFQAWRRLSDPERLCFISTTLQPFHALLGGLGTQGRWTNMERMQLWAMRLDLRDLQRHLRFQVLAAGFNLPEEEEEEEEEEEEERKGLLPGALGSALQGPAQVSWPQLLSTYRLLHSLELVLSRAVRELLLLSKAGHSVWPLGFPTLSPQPEQKLISEEDLGGEQKLISEEDLHHHHHH 243 T 5.7 Myc-LZ pdb F Eukaryota T 8d8i 2 B B NCOR1_HUMAN N-COR,N-COR1 THRLITLADHIAQIITQDFA 20 T 25 Es2 pdbhh F Eukaryota T 8d8j 1 A 0 RT22_YEAST Probable S-adenosyl-L-methionine-dependent RNA methyltransferase RSM22, mitochondrial MMKRCFSILPQNVRFSSKFTSLNLPKLDLADFIDSNKRGINVLPSYRDETASTTQATNSKELRLLSKTLQGQSYRDQLELNPDVSKAINNNIMAVHIPNNLRRVATNYYKEIQEPNSLHRPCRTKMEVDAHIASIFLQNYGSIFQSLKELQKRVGPDNFKPQRILDVGYGPATGIVALNDILGPNYRPDLKDAVILGNAEMQERAKIILSRQLNEVVDTVEENVSTEKEQETDRRNKNFQEDEHIGEVMTKKINIMTNLRSSIPASKEYDLIILTHQLLHDGNQFPIQVDENIEHYLNILAPGGHIVIIERGNPMGFEIIARARQITLRPENFPDEFGKIPRPWSRGVTVRGKKDAELGNISSNYFLKVIAPCPHQRKCPLQVGNPNFYTHKEGKDLKFCNFQKSIKRPKFSIELKKGKLLATSWDGSQGNASRLKGTGRRNGRDYEILNYSYLIFERSHKDENTLKEIKKLRNENVNGKYDIGSLGDDTQNSWPRIINDPVKRKGHVMMDLCAPSGELEKWTVSRSFSKQIYHDARKSKKGDLWASAAKTQIKGLGDLNVKKFHKLEKERIKQLKKEERQKARKAMESYNELEDSLQFDDHQFSNFEVMKKLSTFHGNDFLQHVNRK 628 T 2.3E-26 Rsm22 pdbpercent F Eukaryota T 8d8j 8 H V RTPT_YEAST MITOCHONDRIAL SMALL RIBOSOMAL SUBUNIT PROTEIN MS26 MGKGAAKYGFKSGVFPTTRSILKSPTTKQTDIINKVKSPKPKGVLGIGYAKGVKHPKGSHRLSPKVNFIDVDNLIAKTVAEPQSIKSSNGSAQKVRLQKAELRRKFLIEAFRKEEARLLHKHEYLQKRTKELEKAKELELEKLNKEKSSDLTIMTLDKMMSQPLLRNRSPEESELLKLKRNYNRSLLNFQAHKKKLNELLNLYHVANEFIVTESQLLKKIDKVFNDETEEFTDAYDVTSNFTQFGNRKLLLSGNTTLQTQINNAIMGSLSNEKFFDISLVDSYLNKDLKNISNKIDSKLNPTSNGAGNNGNNNNTTNL 318 T 0.003 MRP-S26 unppssm F Eukaryota T 8dc2 1 A A CasLambda MASHKKTESNQIIKTFSFKIKNANGLSLDVLNDAITEYQNYYNICSDWIKDHLTMKISELYKYIPNEKKNSGYALTLISDEWKDKPMYMMFKKGYPANNRDNAIYETLNTCNTEHYTGNILNFSDTYYRRFGYVASAISNYVTKISKMSTGSRSKNISNDSDVDTIMEQVIYEMEHNGWTSVKDWENQMEYLESKTDSNPNFVYRMTTLYEFYKSHIDEVNSKMETMSIDSLIKFGGCRRKDSKKSMYIMGGSNTPFDITQIGGNSLNIKFSKNLNVDVFGRYDVIKDNTLLVDIINGHGASFVLKIINDEIYIDINVSVPFDKKIATTNKVVGIDVNIKHMLLATNILDDGNVKGYVNIYKEVINDSDFKKVCNSTVMQYFTDFSKFVTFCPLEFDFLFSRVCNQKGIYNDNSAMEKSFSDVLNKLKWNFIETGDNTKRIYIENVMKLRSQMKAYAIVKNAYYKQQSEYDFGKSEEFIQEHPFSNTDKGIEILNKLDNISKKILGCRNNIIQYSYNLFEINGYDMVSLEKLTSSQFKKKPFPTVNSLLKYHKILGCTQEEMEKKDIYSVIKKGYYDIIFDNDVVTDAKLSAKGELSKFKDDFFNLMIKSIHFADIKDYFITLSNNGTAGVSLVPSYFTSQMDSIDHKIYFVQDNKSGKLKLANKHKVRSSQEKHINGLNADYNAARNIAYIMENTDCRNMFMKQSRTDKSLYNKPSYETFIKTQGSAVAKLKKEGFVKILDEASVGSSGHHHHHH 756 T 12 OrfB_IS605 pdbhh F T 8dcn 3 C,F C,F A8DS70_CLODI ADP-RIBOSYLTRANSFERASE BINDING COMPONENT,CDTB MKIPTDQEIMDAHKIYFADLNFNPSTGNTYINGMYFAPTQTNKEALDYIQKYRVEATLQYSGFKDIGTKDKEMRNYLGDPNQPKTNYVNLRSYFTGGENIMTYKKLRIYAITPDDRELLVLSVDHHHHHH 130 T 2.1E-05 Fve unphh F Bacteria T 8ddc 1 A A IL2RG_MOUSE INTERLEUKIN-2 RECEPTOR SUBUNIT GAMMA,IL-2 RECEPTOR SUBUNIT GAMMA,IL-2R SUBUNIT GAMMA,IL-2RG,GAMMAC,P64 EENPSLFALEAVLIPVGTVGLIITLIFVYFWLER 34 T 0.29 TMEM154 pdbhh F Eukaryota T 8ddc 2 B B IL7RA_MOUSE IL-7 RECEPTOR SUBUNIT ALPHA,IL-7R SUBUNIT ALPHA,IL-7R-ALPHA,IL-7RA GGWDPVLPSVTILSLFSVFLLVILAHVLWKK 31 T 0.0029 IFNGR1 unphh F Eukaryota T 8ddd 2 B B IL9R_MOUSE IL-9 RECEPTOR,IL-9R QWSASILVVVPIFLLLTGFVHLLFKLSPRLK 31 T 0.021 DUF2207 unppercent F Eukaryota T 8des 2 B E A0A6G9L8Z7_9VIRU Putative DNA binding protein MARSRRRMSKRSSRRSFRKYAKTHKRNFKARSMRGGIRL 39 T 31 ELFV_dehydrog_N pdbhh T Viruses T 8deu 2 D D CASP peptide CTNEDGKPC 9 T 0.49 DUF6440 pdbhh F T 8dfo 6 M M AcrIC4 MDNKITPADEEKIREWLNCEEASVDNDGDVWVAVPMTGHWLSDEQKAKYIEWRGDET 57 T 0.15 LEA_3 pdbpercent F T 8dfs 6 M M ACR30_BPD31 GENE PRODUCT 30,GP30, ACRIF2 MIAQQHKDTVAACEAAEAIAIAKDQVWDGEGYTKYTFDDNSVLIQSGTTQYAMDADDADSIKGYADWLDDEARSAEASEIERLLESVEEE 90 T 0.13 Transglycosylas pdb T Viruses T 8dft 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T A3MU74_PYRCJ Pilin protein TSVEFWQNIASGVGKWLRAIFAIAFWSSLILLTFYAIMTQVAPSKVFRLGALVDLIESVKTVLLGIFVFTASVTGIIAGVAAIANAFGASFAVSPIDVVNALIFQPIVDMVK 112 T 0.0039 TrbC unppssm F Archaea T 8dfu 1 A,AA,B,BA,C,CA,D,DA,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T,U,V,W,X,Y,Z B,0,A,1,C,2,D,3,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T,U,V,W,X,Y,Z A0A401HBH5_AERPX Pilin protein AADIVQMVEDLTGKLTALAWALFLLSWSIGWTLRGSPIPSSRIKRVGNSLIEDSMWAALWLALGTTVFAVIVRLAGIVNEVLLG 84 T 0.036 DUF6010 unppercent F Archaea T 8dgh 2 B B CNGB1_HUMAN CYCLIC NUCLEOTIDE-GATED CATION CHANNEL 4,CNG CHANNEL 4,CNG-4,CNG4,CYCLIC NUCLEOTIDE-GATED CATION CHANNEL GAMMA,CYCLIC NUCLEOTIDE-GATED CATION CHANNEL MODULATORY SUBUNIT,CYCLIC NUCLEOTIDE-GATED CHANNEL BETA-1,CNG CHANNEL BETA-1,GLUTAMIC ACID-RICH PROTEIN,GARP KLAHLRARLKEL 12 T 0.68 DUF5320 pdbhh F Eukaryota T 8dgk 2 B B CNGB1_HUMAN CYCLIC NUCLEOTIDE-GATED CATION CHANNEL 4,CNG CHANNEL 4,CNG-4,CNG4,CYCLIC NUCLEOTIDE-GATED CATION CHANNEL GAMMA,CYCLIC NUCLEOTIDE-GATED CATION CHANNEL MODULATORY SUBUNIT,CYCLIC NUCLEOTIDE-GATED CHANNEL BETA-1,CNG CHANNEL BETA-1,GLUTAMIC ACID-RICH PROTEIN,GARP NDRLQELVKLFK 12 T 1.8 UBA_6 pdbhh F Eukaryota T 8dgm 2 B B PEAK1_HUMAN PSEUDOPODIUM-ENRICHED ATYPICAL KINASE 1,SUGEN KINASE 269,TYROSINE-PROTEIN KINASE SGK269 PPPLPKKMIIRANTEPISKD 20 T 13 DUF4628 pdbhh F Eukaryota T 8dgn 2 B B Phosphorylated PEAK2 (pS826) peptide PPPLPQKKIVSRAASSPDGF 20 T 100 Fmp27_WPPW pdbhh F T 8dgo 2 C C Phosphorylated PEAK3 (pY24) peptide XXSNLGQ 7 T 0.43 LIME1 pdbhh F T 8dgp 2 E,F,G,H E,F,G,H Phosphorylated PEAK3 (pS69) peptide PLPPPLPKKILTRTQSLPTRR 21 T 3.7 NINJA_B pdbhh F T 8di2 1 A A Site 2 binding peptide IM459N21 XSLEQEWXKIECEVYGKCPPKX 22 T 2.5 DUF5385 pdbhh F T 8dj6 2 E,F,G,H E,F,G,H Imub-peptide XQLPLWGX 8 T 6 Plug_translocon pdbhh F T 8dk9 2 C,D,G,H E,F,G,H DPO41_MYCTU POL IV 1 QESLFA 6 T 20 DUF6248 pdbhh F Bacteria T 8dkn 2 B C NCOR1_HUMAN N-COR,N-COR1 NLGLEDIIRKALM 13 T 6.9 DUF1244 pdbhh F Eukaryota T 8dkv 2 B C NCOR1_HUMAN N-COR,N-COR1 SNLGLEDIIRKALM 14 T 9.2 DUF1244 pdbhh F Eukaryota T 8dnq 2 C,D C,D Cyclic peptide 2.2B XWYSXKYAXWWTVYPCX 17 T 2.2 Trp_leader1 pdbhh F T 8do8 2 B,D B,D ATG13_HUMAN Autophagy-related protein 13 METDLNSQDRKDLDKFIKFFALKTVQVIVQARLGEKICTRSSSSPTGSDWFNLAIKDIPEVTHEAKKALAGQLPAVGRSMCVEISLKTSEGDSMELEIWCLEMNEKCDKEIKVSYTVYNRLSLLLKSLLAITRVTPAYRLSRKQGHEYVILYRIYFGEVQLSGLGEGFQTVRVGTVGTPVGTITLSCAYRINLAFMS 197 T 7.3E-08 ATG13 pdb F Eukaryota T 8doa 1 A A HEEH mini-protein TK_rd5_0958 MGSSHHHHHHSSGLVPRGSHMDIEEIEKKARKILEKGDSIEIAGFEVRDEEDLKKILEWLRRHG 64 T 0.027 TFIIE_beta pdbpssm F T 8dpk 1 A,B A,D RESC5 MGSSHHHHHHSSGLVPRGSHMNNALHAFVRSPHYRTIPSAGPNGIVVNRDMLVHQFRDFYKTLQHCSLVDKVHLMSERPSVEALRVADQMVSIGATFLEMPLTGMEHRATEFMESMRYVRGAGGPSTLASYLQDTENCRCNSGDVVCLPNGIAVGHGPRTNAVAHTTLKQLFEVKDDQFSFDVFTLEQEGDAPPLGDYFGFAGSNVLLTWKDEHGLLAVDQYQQKQPHTEMNVVYLEPGCHFLSFYGVDHTIDVLVQKGYERSMDSIAAAGLNPIPVQWSEMDKLGISMRAAVLPLKFFK 300 T 0.27 ADI pdb F T 8dpy 1 A,B A,B beta sheet-forming peptide with flexible linker TYRVXTWETX 10 T 46 DUF3768 pdbhh F T 8dsx 1 A,B A,B EA22_LAMBD Protein ea22 HHHHHHMSNEVREDGNQFLVVRHPGKTPVIKHCTGDLEEFLRQLIEQDPLVTIDIITHRYYGVGGQWVQDAGEYLHMMSDAGIRIKGEIETAV 93 T 0.11 SNAD4 pdb T Viruses T 8dt0 1 A,B A,B Scaffolding protein functional sites EEEELEELAKELEKILRDEEGHLRKLKEALAEGLGDAEEAAELFRAESIDEMKHAEELAKLLKKGGLDPELRELLEELAELELVAINQYREAAEAAAEAAENGSEEARAAAREALEEALALELDGAKLARAALEAVEKLL 140 T 0.037 Ferritin pdb F T 8dtl 1 A,D C,D Insulin mimetic peptide S597 SLEEEWAQIECEVYGRGCPSESFYDWFERQL 31 T 0.78 BioT2 pdbhh F T 8dtm 2 C C Insulin mimetic peptide S597 component 2 SLEEEWAQIECEVYGRGCPS 20 T 1.9 DUF5385 pdbhh F T 8dto 1 A,E,I E,A,I CH848.3.D0949.10.17chim.6R.DS.SOSIP.664_N133D_N138T gp120 MGSLQPLATLYLLGMLVASVLAAENLWVTVYYGVPVWKEAKTTLFCASDARAYEKEVHNVWATHACVPTDPSPQELVLGNVTENFNMWKNDMVDQMHEDIISLWDQSLKPCVKLTPLCVTLICSDATVKTGTVEEMKNCSFNTTTEIRDKEKKEYALFYKPDIVPLSETNNTSEYRLINCNTSACTQACPKVTFEPIPIHYCAPAGYAILKCNDETFNGTGPCSNVSTVQCTHGIRPVVSTQLLLNGSLAEKEIVIRSENLTNNAKIIIVHLHTPVEIVCTRPNNNTRKSVRIGPGQTFYATGDIIGDIKQAHCNISEEKWNDTLQKVGIELQKHFPNKTIKYNQSAGGDMEITTHSFNCGGEFFYCNTSNLFNGTYNGTYISTNSSANSTSTITLQCRIKQIINMWQGVGRCMYAPPIAGNITCRSNITGLLLTRDGGTNSNETETFRPAGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVG 487 T 5E-50 GP120 pdb F T 8duz 3 E,F E,F Mimetic peptide GPIPVLDENGLFAPGPC 17 T 0.58 FAD-oxidase_C pdbhh F T 8dwc 1 A A PENK_BOVIN Proenkephalin-A VGRPEWWMDYQKRYG 15 T 7.2 DUF1694 pdbhh F Eukaryota T 8dyn 1 A A Cloacaenodin GHSVDRIPEYFGPPGLPGPVLFYS 24 T 6.7 DREPP pdbhh F T 8dz8 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H neoleukin-4 GPKKKIQIMAEEALKDALSILNIVKTNSPPAEEQLERFAKRFERNLWGIARLFESGDQKDEAEKAKRMIEWMKRIKTTASEDEQEEMANAIITILQSWFFS 101 T 0.044 DUF2264 pdb F T 8e0l 1 A,B,C A,B,C BGL06 EGSDDLLLKLLELLVEQARVSAEFARRQGDEKMLEEVARKAEEVARKAEEIARKARKEGNLELALKALEILVRAAHVLAEIARERGNEELLKKAWKLAKEALRQVKEIAEQAQKEGNLELAIIALHISVRIAEVLLETRPDDREEIRKQQEEFEELIKRLEKQVG 165 T 0.058 PLU-1 pdb F T 8e0m 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L BGL15 SLKEKIEKLVEELIRHTEELRELLEKLVKHGGASEEYLLELLENLVRLAHVIAEVAREQGNEELLEEAARLAEEAARQAEELAREARREGNLELALKALQILVNAAYVLAEIARDRGNEELLEKAERLAREALRQVREISKRLQKEGNIELALKANRLLIDALRVLVRIMRHR 173 T 0.02 Ferritin pdbpssm F T 8e0n 1 A,B,C,D,E,F A,B,C,D,E,F BGL18 EGSPRLVLRALENMVRAAHTLAEIARDNGNEEWLERAARLAEEVARRAEELAREARKEGNLELALKALQILVNAAYVLAEIARDRGNEELLEKAARLAEEAARQAEEIARQARKEGNFELALEALEILNEAARVLARIAHHRGNQELLEKAWRLTHRSAKWSREIAEQARKEGE 174 T 0.039 PLU-1 pdbpercent F T 8e0o 1 A A hetBGL03-15-18a GSLELELQNLELLVHIAEVLARLARRTGNEEALEHAARVAEEVAKQAEEIAREARYRGDLRLALEALRIMVEAARVLAEIARERGNEELLQKAEELAREALRQVREISKRLQEEGNIELALKANRLLIDALEVLVRIMRHR 141 T 0.0019 HisKA_3 pdb F T 8e0o 2 B B hetBGL03-15-18b SSLEEKIEELVKELIKHTEELRRLLEKLVKEGGASEEYLLELLENLVRLARVIAEVAREQGNEELLEEAARLAEEAARQAEELAREARYEGDLELALKALQILVNAARVLAEIARDRGNEELLQKAAELAKEAARQAEEIAKEARERGNFELALEALEILNEAARVLARIAHHRGNQELLEEAWRLTHRSAKWSREIAEQARKEGE 206 T 0.027 DUF3584 pdb F T 8e0o 3 C C hetBGL03-15-18c SPRLVLRALENMVRAAHTLAEIARDNGNEEWLERAARLAEEVARRAEELAREAREKGDLELALKALQILVNAAYVLAEIARDRGNEELLKKAHELARKAAEEAQKIAEQARYEGNLELFNKALRILLEAIRVLIEHDDSEEAARELIRRLEELLEQSRRSMKG 163 T 0.014 Adaptin_N pdb F T 8e12 1 A,B,C A,B,C BGL14 SAEEELKKLLEENIKLIEELLEEVKHNDPELLLSVLEVLVRSVHVIAEVAREQGNEELLERAARLAEEAAYQAEEVAREARKRGNLELALKALQILVNAAYVLAEIARDRGNEELLQKAHELAREALRQVKEILEQARKEGNLELVIIALRLHTEIMRVLVEIWRHR 167 T 0.0031 Abdominal-A pdb F T 8e13 3 C C PHE-ALA-LYS-LYS-LYS-TYR-CYS-LEU FAKKKYCL 8 T 1 MIER1_beta_C pdbhh F T 8e1d 1 A B MITF_HUMAN CLASS E BASIC HELIX-LOOP-HELIX PROTEIN 32,BHLHE32 GSRASCMQMDDVIDDIISLESSYNEEILGLMDPA 34 T 0.61 MITF_TFEB_C_3_N pdbhh F Eukaryota T 8e1p 5 I,J,L E,G,M BG505-SOSIP.v4.1-GT1.2gp120 AENLWVTVYYGVPVWKDAETTLFCASDAKAYETKKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKRQKVHALFYKLDIVPINENQNTSYRLINCNTAAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSEDIRDNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQWFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCDTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTDSTTETFRPSGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 474 T 5.1E-50 GP120 pdb F T 8e1u 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H A0A160VN62_PROFR PPi-dependent PEPCK MSVVERRQINAAINLRLSLLGLPHPDSNAESPDAILVEPLLARQRELSRRLKDRLSAPDLRIQRFLDDYLADCDEHPQLPRTTLVLDEPGLARGLSLPVDGDEFHSDIVASYRLVNGVLHNPKHDRRTTAGVFHISTGGLPIPQDKVEVDKNVYARILARAFQAPDEELALPYTANLPEQAHCWASLLMRPTVLPAVPGRTTEKSYEVHFIVPGGLMCNLDFVEGIFGNAGDPYLPENDASLDPDSWTGHTGCVILAPHLTTMTKKSLGMPHYDDATERQRRDGQCWRHEDDLYNDGKAFKVCARDERGVIVTVIADNYFGYCKKEVKTQISYSANLLGGAEEEHSGGAEVYPAWNLNQDFTDRTPDDFTLADVISTNRELLDVRPEGYAVYKPEPNIVFIPEHSHYSMRTQTISWTAHGAEQTIKLLAGKHYLSPDGYRIHAKHREMDATQWHLIGTSSRAVTCHKPATVSGGGKSEISKSISDAFVFGNAFSHDIDSAMDQVQALFDTDFTNRFADASRNGTDHRPVLSIDRSLGSVIKLLTPSIQYNDEYNAFLEGIEPDVKELAFTVKRYYLPEWGEDWRSHFTVGIMNGRHGNMVRLDGKKIITNMLRVGFREDGSWRLFTLRPDYSPAVKVQTEDDITASTVTPPWEDAEGLPRKYVTNCEHLLFQRPDDAIHRGYDKQAEFDLASGTDTFISNFEPLTHEQARDLLTDVQAYSEFTKPVRKLIERVAAMPDDQSPEFWVCSDDPRHLPDGGRSKNPRYLQVRPTDSNPELTTVADVAGKLARKLPLAGHAPQPIDVVAAGRRNNPPEDKVPALCAYNPLHYMELPELFMEYISSMTGKSPSTTGAGSEGALTKGPFNALPAVYDLNAAVLSYALTDYDGWLSSAGYIGPNARVDHDISMLIPELFSHMGPNDRNTKRLISEGYLEKMQDFDFDGHRVLASRLGYRINDRFVTHYFGRIFLHPDVVFSEEMLRPELQDEKIFADSIDVIVKTHQRVAQMYFDDGTVSLACPPIRALLEIMAHGASAEGWTLDSPEFRKLFERESVLASDWYAARLDAKQAEDVKQTEEGVERLKEYIERPDSGSVSARLHLADRLRELEAQLTYERSPEYRRSLVGTLGRQPRFV 1131 T 2.7 DUF5788 pdbpercent F Bacteria T 8e55 1 A,B,C,D A,B,C,D SG135 DREIKEEARKLIREAIELLQKGDPRAKEILRQAILILLAIRLLEEMEENIEKAEKLGNEELSELAKRAIKLVREALELLKEGDPRAEEILKLALKIIKAILLLLEMYENIKQAEELGDEDLSELAKIAIRLVRQALKLLQEGDPRAEEILEIALRIIKLILQLLFLKQRIEEAKKKGDQQFVFEAEEKIRRIVEELFKLLEG 202 T 0.019 IF-2B pdbpercent F T 8e5f 1 A A A3MW92_PYRCJ c-type cytochrome MKKFPALITTLLLLAVFVAATYGPPYSYNHPTNCISCHSNSTGTANSQALSGLTSGPAAGACDPSQQECVWSHQVLKGTDVWKKCINCHVAIWNSINSGPGNVHSGLLNSYGCACHAVAHVGYGNPTDGYTACIYFYVPRLSTATPGYFGAKPTLDFRNVYICFKGTPEGTYTFSGNAPTSLMQLLESKGEVTVKALLVGYDKYANGTVKAKSSAADFLETDFFSALEQAGIFRYEWGTASGAVLKNPSVRTHPLTEEAPNGETIVMGVFDIHTGDFILVAPYAPYSRAPYYLPVAVNPGVAACFNCHFVYQGQLGTAKVMEVGGVWKIGIPADVLNSLTDPHKIVMPAAQAAGGGVAPNLSLVALLATATLLGGAFLALRRRAQ 385 T 0.0015 Cytochrom_NNT pdbhh F Archaea T 8e73 34 QA B4 A0A1S3ULL3_VIGRR NDUB4 MGGGMEANKNKFIEDWGTARENLEFNFRWTRRNLALVGIFGIAIPVLVYKGIVREFHMQDEDNGRPYRKFM 71 T 0.0032 NDUF_B4 unppercent F Eukaryota T 8e73 36 SA B8 A0A1S3UJ95_VIGRR NDUB8 MAGRLTNAASRILGGNGVVYRSVASSLRLRSGMGLPVGKHYIPDKPLPMNEELLWDNGTPFPEPCIDRIADTVGKYEALAWLCGGLSFFASLGLLAVWNDKASKIPFTPKVYPYDNLRVELGGEP 125 T 0.00063 NDUF_B8 pdbhh F Eukaryota T 8e73 41 XA C2 NDUC2 MVLSATTIGALLGLGTQMYSNALRKLPYMRHPWEHVVGMGLGAVFVNQLLKWEAQVEQDLDKMLEKAKAANERRYIDGDDDI 82 T 0.036 NDUF_C2 pdbpercent F T 8e73 55 LB P4 A0A1S3UND4_VIGRR NDUP4 MVRVASYFAMTLGAFVFWQSMDKVHVWIALHQDEKQERLEKEAEIRRVREELLKQQANQKG 61 T 0.63 DUF3935 pdbhh F Eukaryota T 8e73 56 MB C1 A0A1S3TTD7_VIGRR NDUB6 MGGGGGDHGHGNGDFRTKVWSMTGGPYCRPKHWKRNTAIAMFGVVLVCIPIAMKSAELEQRPHHPVRPIPSQLWCKNFGTKDYEQSE 87 T 1.9 Chordopox_A13L pdbhh F Eukaryota T 8e8i 3 C C PHE-VAL-LYS-LYS-LYS-TYR-CYS-LEU FVKKKYCL 8 T 4.1 MIER1_beta_C pdbhh F T 8e90 1 A,B A,B MEN1_HUMAN Isoform 2 of Menin MGLKTAQKTLFPLRSIDDVVRLFAAELGREEPDLVLLSLVLGFVEHFLAVNRVIPTNVPELTFQPSPAPDPPGGLTYFPVADLSIIAALYARFTAQIRGAVDLSLYPREGGVSSRELVKKVSDVIWNSLSRSYFKDRAHIQSLFSFITGTKLDSSGVAFAVVGACQALGLRDVHLALSEDHAWVVFGPNGEQTAEVTWHGKGNEDRRGQTVNAGVAERSWLYLKGSYMRCDRKMEVAFMVCAINPSIDLHTDSLELLQLQQKLLWLLYDLGHLERYPMALGNLADLEELEPTPGRPDPLTLYHKGIASAKTYYRDEHIYPYIYLAGYHCRNRNVREALQAWADTATVIQDYNYCREDEEIYKEFFEVANDVIPNLLKEAASLLEAGEERPGEQSQGTQSQGSALQDPECFAHLLRFYDGICKWEEGSPTPVLHVGWATFLVQSLGRFEGQVRQKVRITFQSEKMKGMKELLVATKINSSAIKLQLTAQS 489 T 4.5E-12 Menin pdb F Eukaryota T 8eas 1 A A VMA22_YEAST Vacuolar ATPase assembly protein VMA22 MSETRMAQNMDTTDEQYLRLIELLSNYDSTLEQLQKGFQDGYIQLSRSNYYNKDSLRGNYGEDYWDETYIGQLMATVEEKNSKVVVEIVKRKAQDKQEKKEEEDNKLTQRKKGTKPEKQKTQSHKLKQDYDPILMFGGVLSVPSSLRQSQTSFKGCIPLIAQLINYKNEILTLVETLSEQE 181 T 0.00073 ATP-synt_D unppercent F Eukaryota T 8eb0 1 A A RN216_HUMAN RING FINGER PROTEIN 216,RING-TYPE E3 UBIQUITIN TRANSFERASE RNF216,TRIAD DOMAIN-CONTAINING PROTEIN 3,UBIQUITIN-CONJUGATING ENZYME 7-INTERACTING PROTEIN 1,ZINC FINGER PROTEIN INHIBITING NF-KAPPA-B GPGQLIECRCCYGEFPFEELTQCADAHLFCKECLIRYAQEAVFGSGKLELSCMEGSCTCSFPTSELEKVLPQTILYKYYERKAEEEVAAAYADELVRCPSCSFPALLDSDVKRFSCPNPHCRKETCRKCQGLWKEHNGLTCEELAEKDDIKYRTSIEEKMTAARIRKCHKCGTGLIKSEGANRMSCRCGAQMCYLCRVSINGYDHFCQHPRSPGAPCQECSRCSLWTDPTEDDEKLIEEIQKEAEEEQKRKNGENTFKRIGPPLEKPVEKVQRVEAL 277 T 0.00058 zf-RING_2 unppercent F Eukaryota T 8eb9 1 A A C9WMH0_FUSOX Secreted in xylem Six8 GSDTSGILLASITGAGSAFQAYAGCYLTAFRNDPRTLTLRMDKTRGERISNVLVILSGGALSHAVEEVVQIAPGAVRNLATLGASTVQFLHNFRS 95 T 4.3 LppA unphh F Eukaryota T 8ebb 1 A,B A,B C9WMG8_FUSOX Secreted in xylem Six6 SDTLPVSTCPAGQKYDRSVCYKADKIRSFCVANPRSNREKITDTPCQPREICVQRNLSNGKSFAKCIPIVDLVEWKTSANGNKEGCTTTSVNPAGYHHLGTIVYDINKNPIEVDKISYFGEPGNVNEGIGGSTSYFSSDNFQFSKSRYMKTCIFSGGYGNLNAYTWSWES 170 T 0.16 Agglutinin unppssm F Eukaryota T 8ebb 2 C C C9WMG8_FUSOX Secreted in xylem Six6 GPMGPLAQTESESADVAEHTINYIDIAPEEFEPPKANLSSLVENLYFQ 48 T 10 Mfp-3 unphh F Eukaryota T 8ebl 2 C,D D,C GLU-ASP-SER-HIS-LYS-GLU-SER-ASN-ASP-CYS-SER-CYS-GLY-GLY EDSHKESNDCSCGG 14 T 0.68 zf-CSL pdbhh F T 8ebm 2 C,D C,D ASN-GLN-ARG-PHE-GLY-SER-ASN-ASN-THR-SER-GLY-SER NQRFGSNNTSGS 12 T 14 Ribosomal_S24e pdbhh F T 8ec3 1 A A G9BXS5_BORHE Fibronectin-binding protein GSTGSEYYDQLKKAEKDIDSAFKILEKLKKDRDQVELQGTMRMSGHSTSEDRATAQAKLNQFSKAKLVQELKDLLEKIDKNAKLTIDNAVEDFSKFSSETPQSNYVTEADKSLYLAKDKLYDLIKAVESSANTYDAYAKRTGIGHGSKFSEVENHLKDAKSLIKKALK 168 T 0.013 Zw10 unppercent F Bacteria T 8ec5 3 C C peptide RARARARARARAFVKKKYCL RARARARARARAFVKKKYCL 20 T 0.23 TCP pdbhh F T 8ec9 1 A,B,C,D,E,F A,B,C,D,E,F ORN-LYS-LEU-VAL-PHI-PHE-ALA-GLU-ORN-CYS-ILE-ILE-SAR-CYS-MET-VAL XKLVXFAEXCIIXCMV 16 T 1.5 Beta-APP pdbhh F T 8eca 1 A,B,C,D,E,F A,B,C,D,E,F ORN-LYS-LEU-VAL-PHE-PHE-ALA-GLU-ORN-CYS-ILE-ILE-SAR-CYS-MET XKLVFFAEXCIIXCMV 16 T 1.5 Beta-APP pdbhh F T 8eci 1 A,B 1,2 A0A3G2KE53_9CAUD GP7 MATGRTTEGRTVTVTTASNTTITGAAGTFVASDVGRTITRAGIPAGTTITAVASGTSATISAAATTSATSAATLGSLNGQSQGLVGWSPETDTEAGAYSVAATNAGTVTPDRLTNAFTPVSQRGRG 126 T 14 DUF5979 unppssm T Viruses T 8edv 1 A,B,C,D A,B,C,D Q21096_CAEEL MITOGUARDIN 2 (MIGA2) SSMNPIEADITSGQELCTELRKTIEKVHHNLEMVKNRSSKDMERSIKIEGILKGLKQVEDEIVLLVPQMEDFRDDNMEFYSVSGGSGYAGSVRTGRSRTLSVLSDDSFRSAVEEFACDIDDIDFVSDAANLDKNELRFLDEGMQAALNGEVKYRKSRMEFCKCDSETDFAAKLYCVRQALTNALKDEHKRVWLAKCGRTLLADFIRHTKQDPVKFFNAYDEMLEYVSNDRNEEQLRQDVEGRGVCETGFYDVAIDFIILDAFEDLKSPPSAVYSVTKNYFMSMSMKYSTLNTIIWSIIKSKRQRLKNPDGFIAKFYNISETVMPAITLGFLGTDERLGELCQYFKEQVVQFVLDVFNTQKVCYRSLEEMSEDVWIVMRNRLEAVQTRMSNEL 392 T 3.9E-06 Miga unppercent F Eukaryota T 8eey 1 A E A0A401FT41_9DELT Csx30 MNTTTYNTTTDALLEWGKVYFQKEDFSEFLDNLEAYISDAGDSLKDELESGVEKLVLGIKSAEAVIFGEAVIGTTPENEAWYDAEESFLTLDCAVWLSQALDRVVRRQDASLADSLIARLDEAINRVAEKLYADNLSPLRFSSLNEIRRSALEATDEKYHYLFPWHGAACDVDENILLILTEEYHLIGADKAGANLSEELRGDLPFIFAELERDEVLRAYVEKENALSLALENTMREHWAFGLLEAARDEGYNHPYPADVGMRIHQVARAVFSQTNLSPAERLAVAIAGACFTPEISEDRRLEILLDCEERVCEIEAPTGDDTSVRVIKDLKALADHRVRHEIPAESLVSLWFEQIEAAGTDFDTKTPMDELVLRMLSDNVITLSVDRKAASQTETDDVKPQKGKIIPFPVPDIANDEVEYQKAVGGGSANDSKVKFPGLLEIQGCRDGDKAILLEDTDDAAANHRKLFSILKAGKLNSAFFIQSDDGEWVESESKPTMEDNRIILHDSHHSSFVWILDTGSMQLRQSVKCVKDALNKKTGSAKKLKPKTMIVWVTIPQEG 561 T 12 TFIIA_gamma_N pdbhh F Bacteria T 8ef4 1 A A HIRV2_HIRME Bivalirudin XPRPGGGGNGDFEEIPEEYL 20 T 0.00018 Hirudin pdbhh F Eukaryota T 8egr 3 I,J J,I A0A1S6L1H8_9CAUD gp16, tail stem protein LSKHTTTLYEIIESELQRLGLNEFVNNDRIHFNDSKHAFMQKMLYFDDDVKQIVDHMFFKGFMFNDERIDRYFKESFTLRFLYREIGRQTVESFASQVLYITMTHEDYIYRVYGSDMYKYIEQVTDTQSQDLGKAIENAIEQGQTKDRQQDKGHEEYKDYEDTITKSFDDNRTAESTLPQSKVNIDVDNTVLDYADTNTISRDKNTSETVSEKTGTKDNTFDSLRNGESDTKRNTQSQNEMNRTGLTKQYLIDNLQKLYSMRDTIFKTYDKECFLHIW 278 T 0.076 MreB_Mbl unp T Viruses T 8egr 5 M,N,O,P,Q,R,S,T,U,V,W,X M,N,O,P,Q,R,S,T,U,V,W,X A0A1S6L1H7_9CAUD gp20, portal-proximal core protein MAEEEKIIKEEPTNEETEQPEKIESAEDVVTEPEKEVTEEKSEAFVQLEQRISSLEQRLNNLESQPQPTQESSDPNFEDKTVPTEVDDNQETDGIESSEEIKQMLNL 107 T 0.11 PspB pdbhh T Viruses T 8egs 1 A,B J,I A0A1S6L1H8_9CAUD Lower collar protein MQKMLYFDDDVKQIVDHMFFKGFMFNDERIDRYFKESFTLRFLYREIGRQTVESFASQVLYITMTHEDYIYRVYGSDMYKYIEQVTDTQSQDLGKAIENAIEQGQTKDRQQDKGHEEYKDYEDTITKSFDDNRTAESTLPQSKVNIDVDNTVLDYADTNTISRDKNTSETVSEKTGTKDNTFDSLRNGESDTKRNTQSQNEMNRTGLTKQYLIDNLQKLYSMRDTIFKTYDKECFLHIW 239 T 0.076 MreB_Mbl pdb T Viruses T 8egt 1 A,B,G,H H,G,F,E A0A1S6L1I6_9CAUD gp19, capsid lining protein MANFDGNEMRGMTHANYEDSRLNKSRELNANMSIGTSKSEDEYGRQVHSLTKQSYSDDSVQEA 63 T 11 Dehydrin pdbhh T Viruses T 8egt 2 C,D,E,F A,B,C,D A0A1S6L1I0_9CAUD Major capsid protein MADKKTDIPTLIADSTKASLQDFNHDYGKQWTFGENWSNVNTMFETYVNKYLFPKINETLLIDIALGNRFNWLAKEQDFIGQYSEEYVIMDTIPIEMNLSKSEELMLKRNYPQMATRLYGSGIVKKQKFTLNNNDVRFNFQTLGDATNYALGVLRKKISDINVQEEKEIRAMMVDYAINQLQDSNRRTASSKEDLTERVFEAILNMQNNSAKYNEVHKASGGSVGQYTTVSKLSDIAILTTDSLKSYLLDTKIANTFQMAGIDFTDHIISFDDLGGVYKTTKDVTLANEDTINYLRAFGDYQAMIGDVIPTGSVFTFNVSDLKEFKGNIEEIKPQGELFAFIFDINALKYKRNTKGMLKEPFYNGEFDEVTHWIHYYSFKAMSPFFNKILITEAPKEQPDAGATE 405 T 0.085 Ribosomal_L22 pdb T Viruses T 8ehb 1 A,B,C,D,E,F A,B,C,D,E,F G8ULV2_TANFA Putative lipoprotein GPLGSPEFAEKESHASCSCECVEEKIPIVTLKNENAHFRYMKRRNDFALEIENKELVRGLYLIPRGCDIPKKYKEDGLPVIISGEVFDCSEYIKPWIKRDPVYFIKLSTIKKK 113 T 0.00065 DUF4971 unphh F Bacteria T 8ehc 1 A,B A,B A0A1D3UL35_TANFO Potempin E (PotE) ANPEQAILGKWELINSGGRPIIPTGYREFLPSGIVHKYDYTKEQYTSFQCEYSILNDTVLLMCNYRYKYLFYRDKMQLFPLDLIAIRDLTEIYQRKK 97 T 0.13 DUF5640 pdbhh F Bacteria T 8ehd 1 A A G8UII1_TANFA Potempin E (PotE) MKQQIILWIGVLLLLIGGVGCENGQLHSPPANPEQAILGKWELINSGGRPIIPTGYREFLPSGIVHKYDYTKEQYTSFQCEYSILNDTVLLMCNYRYKYLFYRDKMQLFPLDLIAIRDLTEIYQRKK 127 T 0.00069 DUF4971 pdbhh F Bacteria T 8ehe 2 B B A0A1D3UUC0_TANFO Potempin C (PotC) MKQKIILWISTLLLLTAGAGCKKETLPPNQAKGKVLGPTGPCQGYALYIEVENPKGIGLEGKGIPAGSGRTWNYRNAISVPLFNRIGLPVELMEEGTWLHFEYREMTEEEKNRKLFQPDEPVICLMNQIPPPANTYMITKIIAHKPLKINPS 152 T 0.0004 DUF4969 pdbhh F Bacteria T 8eit 1 A A A modified Guanine nucleotide-binding protein G(q) subunit alpha MGSTLSAEDKAAVERSKMIDRNLREDGEKARRTLRLLLLGADNSGKSTIVKQMRILHTSGIFETKFQVDKVNFHMFDVGGQRDERRKWIQCFNDVTAIIFVVDSSDYNRLQEALNDFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKIEDYFPEFARYTTPEDATPEPGEDPRVTRAKYFIRKEFVDISTASGDGRHICYPHFTCAVDTENARRIFNDCKDIILQMNLREYNLV 238 T 2.8E-10 G-alpha pdb F T 8ej5 2 B,C,D B,C,D A0A1S6L1I2_9CAUD gp1, tail tip protein VTNEKGQAYTEMLQLFNLLQQWNDFYTAENANNLLVACQQLLINYNEPVIKFINDENEDKSLLQYLAGDDGLAQWQFYKGFYNNYNVHIF 90 T 0.012 GSG-1 unppssm T Viruses T 8elg 3 C C NQK-OC43 peptide NQKLIANAF 9 T 0.78 CoV_S2 pdbhh F T 8em1 1 A,B A,B A0A2N8KYF9_9BURK PaqCI, DNA Unbound MPYDHNAEADFAASEVARMLVADPGLCYDAASLPASISASASYEPSAAGWPKADGLVSVLEGGTSTQRAIALEYKRPQEGIHGLLTAIGQAHGYLHKGYSGAAIVIPGRYSSHPTPAEYVRDVLNAISGSRAIAVFSYSPPDTTSPTPFAGRIQCVRPLVFDAGRVHLRPANQGPKTQWVHMREGSTTRDAFFRFLQVAKRLSADPTAPRPTLRSELVAAIGRLAPGRDPIEYITNTADNKFLTKVWQFFWLEWLATPAVLTPWKLEAGVYSAPGARTRILREDGTDFSQLWEGRVNSLKETIAGMLNRGEISEAQGWEAFVGGISATGGGQDKQGVRARAHSYREDIDSALAQLRWIEDDGLPTDQGYRFMTICERYGGANSRAAIDYMGATLIQTGRYASFLHYINRLSERKFAENPLAYTKPGPGGMPVFTEESYWEYLQDLETKLTDELRVMRKVSGRARPRVRTTFQVELTLLRNYGFVSSTRHRLGVGIPIDWEQVVQALNVDL 510 T 0.25 DUF5343 unppercent F Bacteria T 8emb 1 A,B,C,D,E,F A,B,C,D,E,F RPOC2_THEVB RNAP SUBUNIT BETA',RNA POLYMERASE SUBUNIT BETA',TRANSCRIPTASE SUBUNIT BETA' GSHMATEKVTKDVASDLAGQVKFVNLDAEEKRDRQGTTTRIAPKGGLIWVLSGEVYNLPPGAEPVVKNGDRIEAGAVMAETTVKTEHGGVVRLPEQQDSKGGREVEIITASVMLDKAKVLKETQQGREHYIIETATGQRFSLKAAPGTKVANGQVVAELIDDRYHTTTGGILKYADIEVAKKGKAKQGYEVLKGGTLLWIPEETHEVNKDISLLMVEDNQYVEAGTEVVKDIFCQNSGVVEVIQKNDILREIIIKPGELHLVDDPEAARLKHGTLARPGEEVLPGLVVDTLSQVDYLEDTPEGPAILMRPVQEFSVPDEPSVPSQDSSDGSGQSIRLRAVQRLPYKHDERVKSVDGVDLLRTQLVLEIGSEAPQLAADIEIVTDEVDPEAQRLQLVILESLIIRRDIAADQTQGSTFTSLLVKDGDHIGPGAVIARTDIKAKQAGEVQGIVRSGESVRRILVVTDSDRLRVETNGAKPTVKVGDLVRPGDEMAKGVTAPETAAVMAVADDHVILRLARPYLVSPGAVLQIEEGDLVQRGDNLALLVFERAKTG 553 T 0.017 RNA_pol_Rpb1_5 unppercent F Bacteria T 8eno 3 E E C1DH13_AZOVD nitrogenase-associated factor T MSWRILLCHKHPVSARLRFLIPTGGGVVLPQTLPRLAVIAEDQEAPVQCHPASALRALQETMALGWQLELIGEFRLNMEVPGQIMPIYLAALAGHELPPPPEGTRWIELTQSIGMPWLDRELLRRVYEELIG 132 T 0.034 RcnB pdb F Bacteria T 8env 2 AA,FA,G,L,Q,V a,f,G,L,Q,V A0A6G9LFR0_9CAUD Structural protein gp33 KIPLTAVPNQAISFNAGSSYWKIRLYQNMDMMNADISRDGVIVCHGVRCFGGIPLLQYSHQYRPDYGNFVFDRDADWTLFGDGINLFYLDGAEFAEYQALAT 102 T 2 FAIM1 pdbhh T Viruses T 8env 4 CA,HA,I,N,S,X c,h,I,N,S,X A0A5C1KAX6_9CAUD Ripcord gp36 MINVSGFGTGIVIVSASSFPMGFSLSKFADDESPISSKELEPFGYEMLYDGGLFAFDKAAPLEVSVSVIAGSEDDINLRILLNSKKGSFRFLPGIIPDMTTLVATLPDGGRTVLSNGTILKGPAIDTIQNTGRRKGNTYTFVFGSYLGAQTA 152 T 0.72 DUF3277 pdbhh T Viruses T 8eoi 8 H I EMC10_HUMAN HEMATOPOIETIC SIGNAL PEPTIDE-CONTAINING MEMBRANE DOMAIN-CONTAINING PROTEIN 1 GTVGLLLEHSFEIDDSANFRKRGSLLWNQQDGTLSLSQRQLSEEERGRLRDVAALNGLYRVRIPRRPGALDGLEAGGYVSSFVPACSLVESHLSDQLTLHVDVAGNVVGVSVVTHPGGCRGHEVEDVDLELFNTSVQLQPPTTAPGPETAAFIERLE 157 T 0.0033 2OG-FeII_Oxy_5 unppssm F Eukaryota T 8eon 6 EA,FA,GA A,B,C A0A2K8HNS1_9CAUD Baseplate component gp37 MLGIFTSLLSSRSFSIVDQNTNQLVAADLRISRVNTRFSSVGQRHMLEDGTTKMDSRTIHPMEIIVEVFCPSIDVVDQINQLLLDRDTLYKVITRGMVFERMMCTSEALNQTPDMISATPARLTFSQVLVQNPKPIMFRNAGDSSMIDRGLALAEDVVGSAGDLFDYAVNGV 172 T 0.023 GGGtGRT pdbpercent T Viruses T 8eon 7 HA,KA,OA D,k,o A0A2K8I4C0_9CAUD Baseplate component gp38 MNSFLKSILNTPTLTIRDDVTKLPVWKSLQVKKVEIYSPASVVSKPLATKDQTEAQVYTEALDIDVKNGKIIQPVRLRINAICPDLSTVESIMNAFNDNTSTFAITSKSILADKMAIMTLDVDQSPDMLNAAEINMEFEQVEPPVLNKFDPAFPQDSPTYGVQIQSLSDANLLDLGAIGDSISSAAKSLYNRV 193 T 47 Apo-CII pdbhh T Viruses T 8eon 8 IA,LA,MA E,l,m A0A2K8IA76_9CAUD Baseplate hub gp41 MKKRILRVTFNMPYGPEVIREDLDVRVRIMKAALRIQNRATMEIFGLTTQLRESLLSQFTAWKHRQRQVGREDELMIKVSVEAGYSDQGREQVSRVFVGEVAIVDIISPPPDIGIRIQCYTRQIDRTKTIRNMPPANTTFVKFVEWGANEMGLNFICDTSYNDQVLKNPGRSITVASAILASIQDMYMPDVAAFVDDDILIVKDRDKVIRPDEVTNVNSFVGIPSWSEWGVEFQCLFEPSIRVAGGVAVESLMNPSVNGNYVITALEYDLASRDRPFYIKVMGSPAA 287 T 0.0001 Phage_GPD pdbhh T Viruses T 8eq5 2 B B SPRE2_HUMAN SPRED-2 STIHNEAELGDDDVFTTATDSSSNSSQKRE 30 T 150 Senescence_reg pdbhh F Eukaryota T 8er8 1 A A Acheta domesticus segmented densovirus major capsid protein TKEGYGKHITSMHVRNIFNQGNQVIRNIVKQQRYELLDFTGTEAGTTNLPKIIPYQCIWWRGLQNAANVNQTINNMIALNTISYGVRFLKAKLCIEVYAVTRKRLIQTGATSYYTDDFEQGQNLFIGWADRKAESIPITTPADLDETKLTVANTTLFDANNDNITKEEVPTREKWCHTWDLDVLNHNYLWEPNNLDSQWTLIPGAQAVQPTATPIGPTYQEIVIATKAIGANESALVTTIQDRRSYPRLMLSQPQIKDETDTMKFKYQIRISTELEMEHHIKPDIANPWLTRQTLPLPALSGDGTTRYVPCVPYETHVSQRNWNHVGEYL 330 T 51 YppF pdbhh F T 8erk 1 A A Acheta domesticus segmented densovirus major capsid protein EGYGKHITSMHVRNIFNQGNQVIRNIVKQQRYELLDFTGTEAGTTNLPKIIPYQCIWWRGLQNAANVNQTINNMIALNTISYGVRFLKAKLCIEVYAVTRKRLIQTGATSYYTDDFEQGQNLFIGWADRKAESIPITTPADLDETKLTVANTTLFDANNDNITKEEVPTREKWCHTWDLDVLNHNYLWEPNNLDSQWTLIPGAQAVQPTATPIGPTYQEIVIATKAIGANESALVTTIQDRRSYPRLMLSQPQIKDETDTMKFKYQIRISTELEMEHHIKPDIANPWLTRQTLPLPALSGDGTTRYVPCVPYETHVSQ 318 T 38 Pox_Rif unphh F T 8ese 1 A X B3KT69_HUMAN VPS35 endosomal protein-sorting factor-like EFASCRLEAVPLEFGDYHPLKPI 23 T 0.65 CytochromB561_N unppercent F Eukaryota T 8esw 40 NA A3 Q9W380_DROME UNCHARACTERIZED PROTEIN,ISOFORM A,ISOFORM B MSASAARGSTSLLKRAWNEIPDIVGGSALALAGIVMATIGVANYYAKDGDNRRYKLGYVVYRHDDPRALKVRNDEDD 77 T 0.08 NADHdh_A3 pdbhh F Eukaryota T 8eth 5 E 5 NSA1_SCHPO Ribosome biogenesis protein nsa1 MKLLLGDEIGQLKFIEIKKGTDTSNPESEAPVIQKFGELDREKGVLFMLKHEMNVFVARKNGTIECWNVNQEPPILSSLWQLDSSLLETASIVSMKYSNGWLMLALSDGNLLFRHIESSKLRKLQLHGPLSAVELHPRIPGIIAAGGKENDVCLYSCNPTCKSNIDELELWRTENVVKVFQGKNVKNDSLNLRVRVWITGIVFTEDIINVIDGKSEDDESLCFHFATITHYGQLRFYDTKHGRRPVSTFDVSTSPLSHVGLLPSIKLLYFADKRAQISIFDHSKKKVIGRFQGVKGAPSSIHCLGNVVAITGLDRNVRIFDADRKPLANAYIKALPTSIIVINERDAEIIKKEEELEAAKEEEEEIWRNMEQLEDTEDKKPSKRIKL 387 T 0.13 SopA pdbpssm F Eukaryota T 8ets 2 B R ARP5_YEAST ACTIN-LIKE PROTEIN ARP5 KAVVIDDPPLRQTPEPFDEQSAYNPQSPIAIDFGSSKLRAGFVNHATPTHIFPNALTKFRDRKLNKNFTFVGNDTLLDQAVRSQSRSPFDGPFVTNWNLTEEILDYTFHHLGVVPDNGIPNPILLTERLATVQSQRTNWYQILFETYNVPGVTFGIDSLFSFYNYNPSGNKTGLVISCGHEDTNVIPVVDGAGILTDAKRINWGGHQAVDYLNDLMALKYPYFPTKMSYLQYETMYKDYCYVSRNYDEDIEKILTLENLDTNDVVVEAPFTEVLQPQKTEEELRIQAEKRKETGKRLQEQARLKRMEKLVQKQEEFEYFSKVRDQLIDEPKKKVLSVLQNAGFDDERDFKKYLHSLEQSLKKAQMVEAEDDSHLDEMNEDKTAQKFDLLDIADEDLNEDQIKEKRKQRFLKASQDARQKAKEEKERVAKEEEEKKLKEQQWRETDLNGWIKDKRLKLNKLIKRRKEKLKLRDEMKDRKSQVSQNRMKNLASLAEDNVKQGAKRNRHQATIDNDPNDTFGANDEDWLIYTDITQNPEAFEEALEYEYKDIVELERLLLEHDPNFTEEDTLEAQYDWRNSILHLFLRGPRPHDSENIHEQHQMHLNVERIRVPEVIFQPTMGGQDQAGICELSETILLKKFGSQPGKLSQTSIDMVNNVLITGGNAKVPGLKERIVKEFTGFLPTGTNITVNMSSDPSLDAWKGMAALARNEEQYRKTVISKKEYEEYGPEYIKEHKLGNTKYFED 744 T 9.499999999999999E-43 Actin pdbpercent F Eukaryota T 8eu9 2 B R ARP5_YEAST ACTIN-LIKE PROTEIN ARP5 MSSRDASLTPLKAVVIDDPPLRQTPEPFDEQSAYNPQSPIAIDFGSSKLRAGFVNHATPTHIFPNALTKFRDRKLNKNFTFVGNDTLLDQAVRSQSRSPFDGPFVTNWNLTEEILDYTFHHLGVVPDNGIPNPILLTERLATVQSQRTNWYQILFETYNVPGVTFGIDSLFSFYNYNPSGNKTGLVISCGHEDTNVIPVVDGAGILTDAKRINWGGHQAVDYLNDLMALKYPYFPTKMSYLQYETMYKDYCYVSRNYDEDIEKILTLENLDTNDVVVEAPFTEVLQPQKTEEELRIQAEKRKETGKRLQEQARLKRMEKLVQKQEEFEYFSKVRDQLIDEPKKKVLSVLQNAGFDDERDFKKYLHSLEQSLKKAQMVEAEDDSHLDEMNEDKTAQKFDLLDIADEDLNEDQIKEKRKQRFLKASQDARQKAKEEKERVAKEEEEKKLKEQQWRETDLNGWIKDKRLKLNKLIKRRKEKLKLRDEMKDRKSQVSQNRMKNLASLAEDNVKQGAKRNRHQATIDNDPNDTFGANDEDWLIYTDITQNPEAFEEALEYEYKDIVELERLLLEHDPNFTEEDTLEAQYDWRNSILHLFLRGPRPHDSENIHEQHQMHLNVERIRVPEVIFQPTMGGQDQAGICELSETILLKKFGSQPGKLSQTSIDMVNNVLITGGNAKVPGLKERIVKEFTGFLPTGTNITVNMSSDPSLDAWKGMAALARNEEQYRKTVISKKEYEEYGPEYIKEHKLGNTKYFED 755 T 2.8E-42 Actin pdbpercent F Eukaryota T 8ev3 41 OA T RL21A_SCHPO RPL21 KAHFVSTENNEPVTLHPVA 19 T 0.16 MIase unppercent F Eukaryota T 8ew5 1 A A A0A6J3L7M6_9HYME Flightin MWADEEPAPWDIEETPAEQAPEASAESAAPAAEAATGEKPKIKLEKIEPPHYNHHWVRPLFLNYAYYLYEYRKNYYNDVIDYLNQREKGIFREPPRAQEWAERAMRTYDEKNTDKSFKRSADMKYIINMRHEPRYYSYHTRAYYSLKYQKIL 152 T 33 DUF3579 pdbhh F Eukaryota T 8ewy 2 C,D C,D INLR1_MOUSE IFN-LAMBDA R1,CYTOKINE RECEPTOR CLASS-II MEMBER 12,CYTOKINE RECEPTOR FAMILY 2 MEMBER 12,CRF2-12,INTERLEUKIN-28 RECEPTOR SUBUNIT ALPHA,IL-28 RECEPTOR SUBUNIT ALPHA,IL-28R-ALPHA,IL-28RA GPRMKQLEDKVEELLSKNYHLENEVARLKKLVGERKIMKGNPWFQGVKTPRALDFSEYRYPVATFQPSGPEFSDDLILCPQKELT 85 F F Eukaryota T 8f0l 3 E,F P,Q CD3E_HUMAN T-CELL SURFACE ANTIGEN T3/LEU-4 EPSILON CHAIN QDGNEEMGGITQT 13 T 0.0084 Ig_3 unp F Eukaryota T 8f24 1 A,B,C,D,E,F C,A,E,D,B,F Mirror-image RNA 0G-XEC-0G-0U-0A-0C-0A-0C 0GX0G0U0A0C0A0C 15 T 0.36 CXCXC pdbhh F T 8f2f 1 A A CLML_MESEU CALCIUM CHANNEL TOXIN-LIKE PEPTIDE-1 GCNRLNKKCNSDADCCRYGERCISTGVNYYCRPDVGPX 38 T 2.4E-05 Toxin_12 unphh F Eukaryota T 8f3a 1 A,B,C A,B,C IQN17 RMKQIEDKIEEIESKQKKIENEIARIKKLLQLTVWGIKQLQARILX 46 T 0.00044 GP41 pdbhh F T 8f3b 1 A,B,C A,B,C IQN22 GRMKQIEDKIEEIESKQKKIENEIARIKKLLQLTVWGIKQLQARILAVERYX 52 T 5.9E-05 GP41 pdbhh F T 8f3k 1 A A A0A220S190_9NEIS ACRIIC5Nch MTIKEDGMSETQYFVSHDGNRHDLFDTLEQAEHYILKKNGWTDGEIAEKWAFVKKEARKYGGDPFSSNGRHSLWFITELKLSDGVIMEVDGQLFDDYVESISAERGTEEFAETKRRLVGYYLGW 124 T 0.088 DUF4761 pdbhh F Bacteria T 8f4b 2 B B Cyclic peptide inhibitor 1 (CPI1) XFWGNLHWYYEQFDSTCX 18 T 2.5 UreE_C pdbhh F T 8f4x 1 A,AA,AB,B,BA,BB,C,CA,CB,D,DA,DB,E,EA,EB,F,FA,FB,G,GA,GB,H,HA,HB,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,PA,Q,QA,R,RA,S,SA,T,TA,U,UA,V,VA,W,WA,X,XA,Y,YA,Z,ZA 0,Q,q,1,R,r,2,S,s,3,T,t,4,U,u,5,V,v,6,W,w,7,X,x,8,Y,9,Z,A,a,B,b,C,c,D,d,E,e,F,f,G,g,H,h,I,i,J,j,K,k,L,l,M,m,N,n,O,o,P,p RC_I_1-H11 MEEERRRHLAAAEARFLLELGRPDEVLRLLERLLEEGDPALFAALRELLESGDPLARLIAETVFRRL 67 T 0.00063 TPR_14 pdb F T 8f53 1 A,AA,AB,B,BA,BB,C,CA,CB,D,DA,DB,E,EA,EB,F,FA,FB,G,GA,GB,H,HA,HB,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,PA,Q,QA,R,RA,S,SA,T,TA,U,UA,V,VA,W,WA,X,XA,Y,YA,Z,ZA A,Wq,R,F,Ae,Ex,K,D,S,U,I,Dy,J1,N,T,Od,X,Cz,Tc,Bb,0,Ba,Gf,2,B,Lj,G,Qn,L,Vr,V,Zt,Z,E,Id,J,Nh,O,Sl,Y,Xp,Dc,Yb,Fg,C,Kk,H,Po,M,Us,W,Hu,Ca,P,He,Gv,Mi,Q,Rm,Fw RC_I_2 MMEAMVKYLAEKAGISEVEAAEIVLKAVKISGGDVVKSIELVDLFIEILNKGRE 54 T 0.16 DUF3606 pdbhh F T 8f54 1 A,AA,AB,B,BA,BB,C,CA,CB,D,DA,DB,E,EA,EB,F,FA,FB,G,GA,GB,H,HA,HB,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,PA,Q,QA,R,RA,S,SA,T,TA,U,UA,V,VA,W,WA,X,XA,Y,YA,Z,ZA L,l,3,o,G,d,p,n,4,A,h,e,q,R,5,S,J,f,r,K,7,C,H,6,D,M,T,I,E,u,B,v,s,w,i,W,k,x,U,X,m,y,g,Y,t,z,N,Z,O,0,V,a,P,1,F,b,Q,2,j,c RC_I_1 PDEDLKAELAATEAIWLLRQGRPEEVWKLMQRLYEKGDPALWAVLRALLRSGDEIAILIAWNFMQRI 67 T 0.91 DUF1841 pdbhh F T 8f7n 1 A A A0A8B3MS64_RHIML Methyl-accepting chemotaxis protein GLLQGRMEISNSVLKTLSGFKDVYAQMNNFLQQTTDESRRMLKDAIVTQKEVLAETAAQVAGGNGEDELAAAIAATSDIETRIDGLWTLHEGEQKLRAETRADLERLAAEQAKINEEANRLQYAVRKDENAAKTMLRNAEKLMRASRFYAEFATEVSGAITVEEKLKVAEGHFPAIGRTQRDIFVLLPKGEKSLAETVNSASGAIGALIKTPPGPETLAGLSKYVDRFRTASFRLEAASVGKMREATQIFSELDGKIAGTESVLTATRRLSTSLTDIQIAAAAFLGTTSEESRKKLLDRFLAVQSNLTTLRGIASGMSFFDQAAGALLPIIDGMKKDGLALVEITDKRTVEFEAAGAAINEIWSDLTGFAEQQKVAAGSERAEANQ 386 T 0.0028 Lipoprotein_6 unppercent F Bacteria T 8f7s 2 B,G P,Q DA2D_PHYBI [D-ALA2]-DELTORPHIN II YXFEVVGX 8 T 2.2 DapB_C unppercent F Eukaryota T 8f7w 2 B P PDYN_HUMAN Dynorphin YGGFLRRI 8 T 0.53 Op_neuropeptide pdbhh F Eukaryota T 8f7x 2 B P PNOC_HUMAN ORPHANIN FQ,PPNOC FGGFTGARKSARKL 14 T 1.3 Lem_TRP pdbhh F Eukaryota T 8f86 7 K K SIR6_HUMAN NAD-DEPENDENT PROTEIN DEACETYLASE SIRTUIN-6,PROTEIN MONO-ADP-RIBOSYLTRANSFERASE SIRTUIN-6,REGULATORY PROTEIN SIR2 HOMOLOG 6,HSIRT6,SIR2-LIKE PROTEIN 6 CSVNYAAGLSPYADKGKCGLPEIFDPPEELERKVWELARLVWQSSSVVFHTGAGISTASGIPDFRGPHGVWTMEERGLAPKFDTTFESARPTQTHMALVQLERVGLLRFLVSQNVDGLHVRSGFPRDKLAELHGNMFVEECAKCKTQYVRDTVVGTMGLKATGRLCTVAKARGLRACRGELRDTILDWEDSLPDRDLALADEASRNADLSITLGTSLQIRPSGNLPLATKRRGGRLVIVNLQPTKHDRHADLRIHGYVDEVMTRLMKHLGLEIPAWDGPRVLERALPPLPRPPTPKLEPKEESPTRINGSIPAGPKQEPCAQHNGSEPASPKRERPTSPAPHRPPKRVKAKAVPS 355 T 3.2E-07 SIR2 unppercent F Eukaryota T 8f8e 1 A,B A,B DCAF1_HUMAN HIV-1 VPR-BINDING PROTEIN,VPRBP,SERINE/THREONINE-PROTEIN KINASE VPRBP,VPR-INTERACTING PROTEIN GASAFRPISVFREANEDESGFTCCAFSARERFLMLGTCTGQLKLYNVFSGQEEASYNCHNSAITHLEPSRDGSLLLTSATWSQPLSALWGMKSVFDMKHSFTEDHYVEFSKHSQDRVIGTKGDIAHIYDIQTGNKLLTLFNPDLANNYKRNCATFNPTDDLVLNDGVLWDVRSAQAIHKFDKFNMNISGVFHPNGLEVIINTEIWDLRTFHLLHTVPALDQCRVVFNHTGTVMYGAMLQADDEDDLMEERMKSPFGSSFRTFNATDYKPIATIDVKRNIFDLCTDTKDCYLAVIENQGSMDALNMDTVCRLYEVG 315 F F Eukaryota T 8fae 2 B,D,E C,E,A O40222_9HIV1 Envelope glycoprotein gp120 EKLWVTVYYGVPVWKEATTTLFCASDAKAYDTEVHNVWATHACVPTDPNPQEVVLENVTENFNMWKNNMVEQMHEDIISLWDESLKPCVKLTPLCVTLNCTDLRNVTNINNSSEGMRGEIKNCSFNITTSIKDKVKKDYALFYKLDVVPIDNDNTSYRLINCNTSTITQACPKVSFEPIPIHYCTPAGFAILKCKDKKFNGTGPCKNVSTVQCTHGIKPVVSTQLLLNGSLAEEEVVIRSSNFTDNAKNIIVQLKESVEINCTRPNNNTRKSIHIGPGKAFYTTGDIIGDIRQAHCNISRTKWNNTLNQIATKLKEQFGNNKTIVFNQSSGGDPEIVMHSFNCGGEFFYCNSTQLFNSTWNFNGTWNLTQSNGTEGNDTITLPCKIKQIINMWQEVGKAMYAPPIRGQIRCSSNITGLILTRDGGNNHNNDTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTKAKRRV 472 T 1.9E-55 GP120 pdbpercent T Viruses T 8fbc 1 A,B A,B A0A3C0KFZ6_9BURK Cytochrome P450 MGLGSFHFDPYSPAIDADPFPSYKRLRDEFPCFWSEEAQMWILSRYSDIVTAGQDWQTYSSASGNLMTELPGRAGATLGSSDPPKHDRLRGLIQHAFMKRNLMALEEPIRDVAKQVFAQVKGVKEFDFKDVSSQFTVKVLMAALGLPMGEDALVPEHEVRENAVLMVQSDARTRAKGPEHIAAYNWMQDYASKVIAMRRASPQNDLISNFALAEIDGDRLDDREVLLTTTTLIMAGVESLGGFMMMFAYNLATFDEARRAVVANPALLPDAIEESLRFNTSAQRFRRRLMKDVTLHGQTMKEGDFVCLAYGSGNRDERQYPNPDVYDIARKPRGHLGFGGGVHACLGTAIARLAVKIAFEEFHQVVPDYRRVADQLPWMPSSTFRSPLVLQLKAQ 395 T 9.7E-32 p450 unppercent F Bacteria T 8fbd 1 A,C A,C A0A126JJ68_CLOBO Neurotoxin complex component Orf-X1 MELKQAFVFEFDENLSSSSGSIHLEKVKQNCSPNYDYFKITFIDGYLYIKNKSGVILDKYDLKNVISLVALKRDYLSLSLSNNKQIKKFKNIKNKHLKNKFNLYVINEDIEKRITKNGILEEVILNKMLLSILLGNEENLLQIS 144 T 0.021 Glyco_hydro_39 unppercent F Bacteria T 8fbe 1 A,B A,B O52975_CLOBO Neurotoxin complex component Orf-X1 SELKQAFVFEFDENLSSSSGSIHLEKVKQNSSPNYDYFKITFIDGYLYIKNKSGVILDKYDLKNVISLVALKRDYLSLSLSNNKQIKKFKNIKNKHLKNKFNLYVINEDIEKRITKNGILEEVILNKMLLSILLGNEENLLQIS 144 T 31 DUF3161 unphh F Bacteria T 8fbi 1 A A KWOCA_39 MPETFEAIARAIEVAREVEKVAQRAEEEGNPDLRDSAKELARAVDEAIEEAKKQGNPELVEWVARAAKVAAEVIKVAIQAEKEGNRDLFRAALELVRAVIEAIEEAVKQGNPELVEWVARAAKVAAEVIKVAIQAEKEGARDLFRLALELVRAVIEAIEFAVKLGDPEMVERAARIAKTAAELIKRAIRAKKEGDKDQEREAKKRVTRLIIELTLMVLKASLDLLRRILEELKEMLERLEKNPDKDVIVKVLKVIVKAIEASVDNQRVSADNQKMLAELAGSWSGGGSEQKLISEEDLGGS 301 T 0.0025 EST1_DNA_bind pdbpercent F T 8fbn 1 A,B,C,D,E,F A,B,C,D,E,F KWOCA_73 ALEKDRRALEALKRAQEAEKKGDVEEAVRAAQEAVRAAKESGASWILRLVAEQALRIAKEAEKQGNVEVAVKAARVAVEAAKQAGDNDVLRKVAEQALRIAKEAEKQGNVDVAAKAAQVAAEAAKQAGDKDMLEKVAKVAEQIAKAAEKEGDKKVSIDATRIALEASLAALEIILEELKEMLERLEKNPDKDVIVKVLKVIVKAIEASVKNQKISAKNQKALAELA 226 T 0.00058 Syntaxin-6_N pdbpercent F T 8fbo 1 A,B,C A,B,C KWOCA_102 MTEEKIEEARQSIKEAERSLREGNPEKALDAVARALSLVNELERLARKTGSTEVLIEAARLAIEVARVALKVGSPEMAQLAVELALRLVQELERQARKTGSTEVLIEAARLAIEVARVAFKVGSPETAREAARTALELVEELERQARKTGSEEVLERAARLAEEVARVAEEIGDPELARKAMKVAIRLTEELLKKSLRELRRILEELKEMLERLEKNPDKDVIVKVLKVIVKAIEASVENQRISADNQRALARLAGSWSGGGSEQKLISEEDLGGS 276 T 0.024 Fmp27_WPPW pdb F T 8fbw 3 C,F E,F SIV V2 peptide LKSDKKIEYNETWYSRD 17 T 13 YlaC pdbhh F T 8fck 1 A A A0A8J1L9M8_XENLA HAUS augmin-like complex subunit 1 MDEKSTKIIMWLKKMFGDKPLPPYEVNTRTMEILYQLAEWNEARDKDLSLVTEDLKLKSAEVKAEAKYLQDLLTEGLGPSYTNLSRMGNNYLNQIVDSCLALELKNSSLSSYIPAVNDLSSELVAIELNNQEMEAELTSLRKKLTEALVLEKSLEQDLKKAEEQCNFEKAKVEIRSQNMKKLKDKSEEYKYKIHAAKDQLSSAGMEEPLTHRSLVSLSETLTELKAQSMAAKEKLNSYLDLAPNPSLVKVKIEEAKRELKATEVELTTKVNMMEFVVPEPSKRRLK 286 T 0.23 DUF16 pdb F Eukaryota T 8fck 8 H H HAUS8_XENLA HEC1/NDC80-INTERACTING CENTROSOME-ASSOCIATED PROTEIN 1,SARCOMA ANTIGEN NY-SAR-48 HOMOLOG MSEAGVAPIEDGSQNSSGGSSGDAALKKSKGGAKVVKSRYMQIGRSKVSKNSLANTTVCSGGKVPERGSGGTPTRRSLAPHKAKITAAVPLPALDGSIFTKEDLQSTLLDGHRIARPDLDLSVINDRTLQKITPRPVVTSEQKKPKRDTTPVNLVPEDMVEMIESQTLLLTYLTIKMQKNLFRLEEKAERNLLLVNDQKDQLQETIHMMKRDLTLLQREERLRDLIEKQDEVLTPVVTSKDPFKDNYTTFATALDSTRHQLAIKNIHITGNRHRYLEELQKHLAITKSLLEEIMPSHASENAESFDTIKDLENIVLKTDEELARSFRQILDLSFKVNKEISLQSQKAVEETCESALVRQWYFDGSLP 367 F F Eukaryota T 8fed 10 K K A0QWR2_MYCS2 Transmembrane protein MSKWLLRGVVFATAMVIVRLLQGALVNASPGNAIWFSTGLLVLYAIGVAVWGVLDGRGDARSNPDPDRRADLAMTWLLAGLAAGILSGAVSWFIGLFYKSIYTESLLNEITTFAAFTALLTFLVAVAGVTIGRWTIDRKAPPVTRTRHGLAADDDRADTDVFAAVSANGAQEHTDTTQTTPLENPDQPRQS 191 T 0.0021 DUF3611 pdbpssm F Bacteria T 8fg0 3 E,F P,Q CSP_PLAF7 CS,PFCSP QGHNMPNDPNRNVD 14 T 0.26 DUF3533 unppercent F Eukaryota T 8fis 2 C,E,F C,F,G Q2N0S6_9HIV1 ENV POLYPROTEIN AENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSACTQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQAFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQCMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 481 T 3.5E-54 GP120 pdbpercent T Viruses T 8fjk 4 M,N,OA,PA,QA M,N,k,l,m CAPSD_AQRVC ATP-DEPENDENT DNA HELICASE VP3 TASPADTNVVPAKDAPTTNSPPSTTSPNQAAADANQQQAGIVSSQSGPNAVGDSAPSTSVNNDGDIITRPTSDSIAAVANATKPAAVVSDPQSM 94 T 0.068 DUF5888 pdb T Viruses T 8fjo 1 A A B2HHT9_MYCMM Cytochrome P450 124A1, Cyp124A1 MDLSTNLNTGLLPRVNGTPPPEVPLADIELGSLEFWGRDDDFRDGAFATLRREAPISFWPPIELAGLTAGKGHWALTKHDDIHFASRHPEIFHSSPNIVIHDQTPELAEYFGSMIVLDDPRHQRLRSIVSRAFTPKVVARIEASVRERAHRLVAAMIENHPDGQADLVSELAGPLPLQIICDMMGIPEEDHEQIFHWTNVILGFGDPDLTTDFDEFLQVSMAIGGYATALADDRRVNHHGDLTTSLVEAEVDGERLSSSEIAMFFILLVVAGNETTRNAISHGMLALSRYPDERAKWWSDFDGLAATAVEEIVRWASPVVYMRRTLSQDVDLRGTKMAAGDKVTLWYCSANRDEEKFADPWTFDVTRNPNPQVGFGGGGAHFCLGANLARREIRVVFDELRRQMPDVVATEEPARLLSQFIHGIKRLPVAWSRHHHHHH 439 T 1.3E-22 p450 unppssm F Bacteria T 8fk7 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T A3MVU7_PYRCJ Flagellin MPMKTKGLEPIVAAVLLIVVAVIGAVLVYLWFSGYVTRATSQAEQLSAAEQLKIEAVSKTGTTVSVNVRNVGEVPVKIASAYVLNATTLTMICGGSLTSPQQIDPGTIQTINVPGTCNLIAGARYIVKVVTARGTEAAATFISP 144 T 0.00034 Pilin_N unppssm F Archaea T 8fkp 47 UA SZ NOP16_HUMAN HBV PRE-S2 TRANS-REGULATED PROTEIN 3 MPKAKGKTRRQKFGYSVNRKRLNRNARRKAAPRIECSHIRHAWDHAKSVRQNLAEMGLAVDPNRAVPLRKRKVKAMEVDIEERPKELVRKPYVLNDLEAEASLPEKKGNTLSRDLIDYVRYMVENHGEDYKAMARDEKNYYQDTPKQIRSKINVYKRFYPAEWQDFLDSLQKRKMEVE 178 T 2.5E-17 Nop16 unppssm F Eukaryota T 8flp 1 A A Alpha-conotoxin LvIC analogue GCCANPVCNGKHCX 14 T 0.016 Toxin_8 pdbhh F T 8flt 5 E P M-PTH(1-14) XVXEIQLMHQXAKW 14 T 0.0055 Parathyroid pdbhh F T 8flx 1 A A LK031 PELFLQDLRSLVEAARILARLARQRGDEHALERAARWAEQAARQAEKLARQARKEGNLELALKALQILVNAAYVLAEIARDRGNEELLEYAARLAEEAARQAAEIWAEAARRGNQQLRTKAAHILLRAAEVLLEIARDRGNQELLEKAQRIVEAVAAAQQVAALALRLAEELDSEEAKKAVRAIAEAAAAALLAALQGKDEVAKLALKVLKEAIELAKENRSEEALKVVLEIARAAAAAARAAEEGKTEVAKLALKVLEEAIELAKENRSEEALKVVLEIARAALAAAQAAEEGKSDEARDALRRLEEAIEEAKENRSKESLEKVREEAKEAEQQAEDAREGKGWSENLYFQ 352 T 0.014 Phage-MuB_C pdbpercent F T 8fn4 1 A 1 Q57XL7_TRYB2 RNA-editing substrate-binding complex protein 1 (RESC1) MLRLLRRSIVGSTFNIMVRRQNQGSVSQGALNMRDQQAAAAENVTPERVWALWNEGNLFSLSLAQLQGFLSRCGVRTDPAAKKAAVVRQVEEYLHSKDTTVKGGGQGAASPQQHQQHGQQGGYGRWNQASVMQPETLLDLSQAGFYEGAANMVPKAFQLLVSDTAPDVVVSRVNTTAFPGFPSNTECYTLGASEKDVAIRSRYSKVLQWCCLNMSNLQMDGELYVDFGKLLLKPSVMRKNRRIVSSYTLQQRLQVNHPYTWVPTLPESCLSKIQEQFLQPEGFAPIGKGVQLTYSGTIKRSKDQLHVDLDNKGKVLAVNSAWVNLQTAWCTHAKGPDVRLLLRSRPPIRRQDVELFASTPIIKLADDDVADVLPPEHGQLVYLSEDETRLFERVSDRGVTITVREVKRQPLIILRDEEEDPRVEYSLSAHIPANAAKATDVRAVGLTAFELAGRLAGLVAEDFVREYGCEAKL 473 T 0.058 HeH unppercent F Eukaryota T 8fn4 2 B 2 B6SBL9_9TRYP RNA-editing substrate-binding complex protein 2 (RESC2) MLRARLKIFSALNGATSAFSRAVAPLQIATRQQSFSAAAPAASGDFSHITRNTVWGLWNEGNLFSLSVPELAFFLQEHCRVANVDPRAKKSALVRQVEEILSAEQQASATVPQEDNPHAIVVTDYDRAEDALEEADEYGDWGAEPGFEDRRELDFMELSPGRMGERYDPLSPRAFQLLHSETATDVGIASIDPSKLPGQSKVKNALAAIHVAPNDANKMRFRMAFEWCLMNIWNMNMPGELNIGAGKALYYRSVAKQNRNVMPLWTVQKHLYAQHPYAWFAIASESNVAAMESLAAALNMSIQQERTTSYKVTIRRMAEFFDCELNGQLKCTMMNKPWDRFFVSHYIRSKMPDLRYVVRARHPIKKRIADAYLEADILRSTRDSVQSVLSPELGDVVYCCERVVRKWAKKTATGVTLQLVETKRTPLIITKAGDEGERLEYEWIVPLPQQAERIDIAALTDELWEYGNKLAAALEEGMEELMVHTMTAVSAY 492 T 0.7 ARMET_C pdbhh F Eukaryota T 8fn4 3 C 3 Q381A0_TRYB2 RNA-editing substrate-binding complex protein 3 (RESC3) MSNPFEKVARGIAFKMRSKVHKQGYSNTVMAQQARRLSPTGLLAMERLTELTALQQRHQCTFDPALRSKATQILRTLPLLSIDEDPYFTHTQRALRLAAYFGAVDLPVTYALINQHTKNAFMLDAFSMASFFYTLAKLKHPQTKEIVGILLPRLREVAPELIAREAVHILRLLCSIQMADAQLVKVVTETVVATAADVPLRDARQCAFILSETFPEEAQRILGAVEHRLCDDIDMNADANEVKTTILDVCRVVSATCKGPRRLLNSVARRSMELLPQLTPLDVAFVLKAFHLSSYRHLRLLRVLSSSLAASFPTSNVTKEHGLAASIVVQSLAHFYLSGCEEVVVTLVNASVNVLEGLNLALTLLACVRLRCVSPGVDPAVDALCSGAPMRRYVHNAHSMQVTSRILYGLAHAGRCRSDEEVAIVLPLLKSVVRTPGALRDDCRGFLLDAVTALGADGECSNDALQEQVRKVYERLSQDGGK 482 T 11 DUF6489 pdbhh F Eukaryota T 8fn4 4 D 4 Q384R6_TRYB2 RNA-editing substrate-binding complex protein 4 (RESC4) MNGRLYCLIRRITSPPVATRLIKEELCLSMAAIARLPLRRDQLAHVTNTEAITTRAQRISHLCTPTELGMIAEGAEALSCNRFDLADALIDGAYESVRRAASSTRLSHVSAIARYSASIKTYGNETITTLLKAGASLLQKNDSVPVLKSFLGVAQSHLTDGEMRVLIDEMCAKATEEQRLCINSIGTQSLAKDAAKCGEETLTKGNEDGDETAVDDEETQAWDMLRARQWMLQLVRCGKPPTAAEAVQAMELYAHFAVRDFVLHEKIEDLVLLVLPTGNKFHLNEMHKIVLRSPNLFPRVRNTLGQDHSGVSDVHRADRGVEWSDDPASSLTTTYTTSRAYSMLLLGQRLSEDIMFDVVQEQSETIPVDVAAQAACLFAEKGDIPEGVILRLSAELEHISPQGVTAFVRAARRDSSGALLPHYAAVLNRFTERDLCDTPLETLLQMCEVFALPAPRGTSEGDNDSINESQSKFQKALIVRLFSVIQGSRDVPFLCKVAKAVRAFDANDELIQFVCSSICAQGALSECEALIAFDMIRCCDFVYEPLLDAMEPVFRRLVESVSAMLEGKSTINDVEVRRCACFATLQSEFDCPDFETLASLLVHTVEKNVTGCPVELIPSVGLLCVRTRRTSALYIVGNKLEGNMQQLSDDAIGELARLLVGTENLATKELAVEFQSVVVSRLLRQQSLPPDVVALSAVVWLRQGDKVGTIDERSVDYIIKWMYAIGSSVYTDLCLAVHLSASVESLSNALIDDLPRRLELLTTNEMANAIFGLGEVSDMGARLSHQLVAERCSDYVVDHSQEFWSGKVIARLLYGFSRMHCTKRSLYNVFATRLAHRPVFSLLDQEAISFAIAAFGRVKYLDKKLFDRFTRWILDHSKDLNAAELLLTIRGVSRVMLLNDQLYDDLGSKAAEKVKEFPIESQCVLLSSFGSLGVEHERLASRMVSSIAENREELTDATKAVDVITSLWSMNYDVEDDKHVAQLADWVVQRAEELTDESIGKLCLVLSDTNWRHVPLVRAIAEQSVRLQGQQSISPKCCREVLDVLGTFMIHHQGARENLSALGRSISKERIQLSEEEEQHLQLLLRR 1087 T 0.017 MOR2-PAG1_C pdb F Eukaryota T 8fn4 5 E 5 Q389F5_TRYB2 RNA-editing substrate-binding complex protein 5 (RESC5) MLRHTSRNNALHAFVRSPHYRTIPSAGPNGIVVNRDMLVHQFRDFYKTLQHCSLVDKVHLMSERPSVEALRVADQMVSIGATFLEMPLTGMEHRATEFMESMRYVRGAGGPSTLASYLQDTENCRCNSGDVVCLPNGIAVGHGPRTNAVAHTTLKQLFEVKDDQFSFDVFTLEQEGDAPPLGDYFGFAGSNVLLTWKDEHGLLAVDQYQQKQPHTEMNVVYLEPGCHFLSFYGVDHTIDVLVQKGYERSMDSIAAAGLNPIPVQWSEMDKLGISMRAAVLPLKFFKANVGGMLSRNKSRGARWQTHQLQKGSGSGSASSGASAAGSSGASASSGASAAGSSGASAGHHHHHHHHHHSGSEDQVDPRLIDGKASAWSHPQFEKGGGSGGGSGGSAWSHPQFEK 402 T 0.29 ADI unp F Eukaryota T 8fnc 3 C 5 Q389F5_TRYB2 Mitochondrial RNA binding protein MLRHTSRNNALHAFVRSPHYRTIPSAGPNGIVVNRDMLVHQFRDFYKTLQHCSLVDKVHLMSERPSVEALRVADQMVSIGATFLEMPLTGMEHRATEFMESMRYVRGAGGPSTLASYLQDTENCRCNSGDVVCLPNGIAVGHGPRTNAVAHTTLKQLFEVKDDQFSFDVFTLEQEGDAPPLGDYFGFAGSNVLLTWKDEHGLLAVDQYQQKQPHTEMNVVYLEPGCHFLSFYGVDHTIDVLVQKGYERSMDSIAAAGLNPIPVQWSEMDKLGISMRAAVLPLKFFKANVGGMLSRNKSRGARWQTHQLQK 310 T 0.29 ADI pdb F Eukaryota T 8fnc 5 E 7 Q384B4_TRYB2 RxLR effector protein MRSSRGILFLSGAFAIRGMSAYHSYQRLDTVSHTSKVYSLQMQRQTVHFTPITRLGVEATANPTTATNATGQTGDGDGATALDVAMRVNKLKRLHQTGGGPSGKKQVELDAWRDLNNLTEAQINSAEGKAVSLLLNSWAYFAKYWEKGAEGPSASLSEVTPSNDSSSAGEHGTQ 174 T 0.14 LppC unppssm F Eukaryota T 8fnc 7 G 10 Q57VS6_TRYB2 RAP domain-containing protein MRRRVVLCCQDVGSLLSSKHSVHSGIGYHERVFSRNLLYRRYPVVTVLPKAGFTVLDTKRWIASSGPPVTGSPLSPVTNPSLNVGTGGGEAVAMEGPLPVSYSPGSGVNGSLPVTSTAITAHCDVLSECVAKADELAVQLKAQNALSASAEILTQEGMEEFVEELKTSATNEMTALVKQMQTTPLLQRAGMHELRRTLYYTTSLKERDWLEEKQYTAAMRMLTVEVLRRDGDGVLSADDVLYVTTHVVTANFYNRHLWNRMEKSLLKFSNYENIDMSSVKAFSTRLFKTRRGCAKETLDIRRKVLLAMSRRVGVLANDFDLPSLLGVLQCYTVHDLTPFHLEPLAIRATNHVGDFTPHECATLAHVLRKWRTMRLEVCERLVERICTSDQLTHHMANAAMIAIRTCFNQVSDGGRNAMNAEPTRQKLRAMGEQIGCRLDEVEYPALPVILSILDVVVTLKIYVPKKCLQVIFSQANDMVAIVMEQKDDLVDPKTGKRVRPITAEEGRQLQALLSHYGNDLAPELSQRMKEAFREGVLPDEASL 543 T 0.13 DUF3646 pdb F Eukaryota T 8fni 7 G 9 Q585T1_TRYB2 RNA-editing substrate-binding complex protein 9 (RESC9) MLLPTLERLLERCGRPIFSNVEDVRMVMASLLDISAYVDRASTKVIAKPLRRFCHKDPDTVASVMEAVPIDAAEPTHGRRAAMLLRCLPKHSCDEVIWERAVAATLAGLKSRKWDLHDYRVAMAHAGRGGRHAPALAAAAEEFVSSSARTASQSELPALLVILTSLPELKRSPCLQVAADRIVQLSEILSPAAIGQICASVNKVSFRHTAMAIALQEEAIRFAEESDLFSAVQLFSFICQQEKEAISPDAVKCLAERVIEGKDLDQETVSVLCRALRSIPRPHRPELLREIGEMMEFLGGEVKELLELPVAKGGLKGDVSAGDIQSFISKFLSLDGLLPADHDRPGTYMAAIVACVDYITERLEDIVSDENPPFSIIPHLLNINMEETRRCGQAIIREAAEQGIHFPTLQVFRFLLALGDHNMRDQRVYRHLRNEFAKTASDIPMIQLCAALKCFVRGLMQNVETQSLDEQVEHELEKEDMDAFLRFCVENLRRGFADGMEVKCVMAATESLYQLGYTSTEFYEQVARYLGSKCSSASASVNSSETATAVCLALGEDILDRHPDVHTFLLEVEKSGLKGEASLSPTEWMNKNDPANFITPLTEIQQEGWNIINRMVETRAADTEKLTALANEYVAILKSTRVDDLKYFFGVFEEKVFKQDRILKQCLDYLVESNAAVKLSATSIGAMLNSLAAIRFTYHRSVKQFMIAISTEQWSEMDASPLVKIVSAMAKLSLRLPQVLVHVGDRLLDVYTFLSPLDTALVINSLQSIGYGNDEVLMMLMRHAASSARRWDEVSLTLLFGASGVHRLLRNVEVAAPLLEQAAGKTSSPHLRQRIAASLRRSALPRALVQSSTSLLTGGAHEVVNNPPLQLV 872 T 0.028 AAA_assoc pdb F Eukaryota T 8fni 9 I 11 Q57WL2_TRYB2 RNA-editing substrate-binding complex protein 11 (RESC11) MYRLYRRTVGYQSLHQRLSACHVMCRHVSTDNSDGTTPPKPRRSGIRRVVPSDEEMAELHDLEQEVASTTSSRSKQSALSGVMVEPMRFSTSGGSGMEGDGDDLGELEAEGDEEGVGTNSLAEAENVYKRHNDGGALEKQGLAIPPSGKPTDPLLANRDDEGEGGAVPLSQAEEMTVSRSTLERQACVRSLSLEELVEAVTLYLRATKNPRLVSADEEHIFFPVLMERLNEFHVSQLLDVVECHWARSTLVRYGTTFKDMVRDRIALIATAAAKSASKRPAAAGKSGNDNRDGGAVEEEADDYDEQGDAVYVHEAEEKTSDLIILRAAEEMSPETVLRCIIVMGMSAGRRKRDLQFFQAMGMFLVHHINHYKDPHELVRVLTAFARAKIVPPKRFLALLGRRFAVLNKRKKLGSLPSYRAFVNLYKMGHDQMNTFRFLADCILETIDSNIKAEKKRLRLAQLQSSSNITAATTNENGATNESGCGGSTSSSNPTVTNITGAGDLKATHTSEGASDVAFIGDLDPHLLQNLRARERFKRLTELKPSMFTKLLLVLARFGAPHQQYLRPTTVPLILPTLRAFPPPSFTRLLRAMSLFRTTDLDLIEPVIDFMADSLGPTNVVPADVLQMVRLVAPPDVPVPRNLVKLISLCEAVYSSSASFSHSDGKSSDSADAAACAMTTLSPIRPGDMCAVAVVLLKIQMKDDVPLEALDPLTRLMEFFAERMYLLMKLHIVSLTHVDVFTDLCRQQQHPDVSGHIERLCAERRRVNDAEGDDEYYSQLDIDVRETLHRILIVNDYNTYGQYRPTPGVLQVDFKQALTEVSAFDVLEAADLFAQAFSNALKPAVERHLSRSIIAKLDGGGEEVITEGNSIVLRPPRELLLTREDLGKFVCLLQRTPLRRVRASPVVWRFVEEKAKKLGMDDVLRVVENKLATAV 934 T 0.27 DUF440 pdbpssm F Eukaryota T 8frs 2 G,H,I f,g,h A0A2K8HLV9_9CAUD Structural protein gp24 MFQKQVYRQYTPGFPGDLIEDGPKRARPGRIMSLSAVNPAATATGPNRASRAFGYAGDVSALGEGQPKTIAARASEVVIGGANFFGVLGHPKHYALFGSAGDSLAPSYDLPDGAEGEFFDMATGLVVEIFNGAAAALDLDYGDLVAYVPNNLATADDALGLPAGALVGFKTGSMPTGLVQIPNARIVNAISLPAQSAGNLVAGVTIVQLTQ 211 T 69 CBFB_NFYA pdbhh T Viruses T 8fuv 2 B,C,D,E,F,G T,A,B,C,D,E A0A2K8HPF4_9CAUD Tail fiber protein gp32 FGSICAFTASRTFPNGFTVTEEFADADPIDSPPFAAADTGAGLNGDMVVWNRANILEVVVNVIPNTEGERNLAVLLDANRTGKDKSGARDVVGLVVAMPDGSKITCTNGTPIDGVLINAVASVGRLKTKPYRFRFEKVIKAGTS 144 T 0.06 Arch_flagellin unp T Viruses T 8fvh 1 A,B,C,D,E,F C,A,B,D,E,F A0A2K8I4A6_9CAUD E217 collar protein gp28 IPGANLLRMAFGVIGTQIVRYRKFEQRVKNDQAQYVSMFGEPFDLAASVQRVRRDQYAQFNLEFQRNYVMIFANFDMVDLDRNMAGDQFLWTGRVFQLESQGSWFYQDGWGVCLAVDIGAAKA 123 T 0.00015 Phage_H_T_join pdbhh T Viruses T 8fvh 2 G,H,I,J,K,L M,P,S,V,Y,b A0A2K8HWZ4_9CAUD E217 gateway protein gp29 MFDGELIAKLVVELNAAMTSAQEALQFPDFEVVQKAQPTQQGTSTRPTIFFQKLFDIPRGWPATDWHLDNTARKYVEITRQHVETTFQISSLHWQNPEITHVVTASDIANYVRAYFQARSTIERVKELDFLILRVSQISNEAFENDNHQFEFHPSFDMVVTYNQYIRLYENAAYSADGVLIG 182 T 0.11 HAD_2 pdb T Viruses T 8fw5 2 B B NPRL2_HUMAN GENE 21 PROTEIN,G21 PROTEIN,NITROGEN PERMEASE REGULATOR 2-LIKE PROTEIN,NPR2-LIKE PROTEIN,TUMOR SUPPRESSOR CANDIDATE 4 MGYPYDVPDYADLNGGGGGSTMGSGCRIECIFFSEFHPTLGPKITYQVPEDFISRELFDTVQVYIITKPELQNKLITVTAMEKKLIGCPVCIEHKKYSRNALLFNLGFVCDAQAKTCALEPIVKKLAGYLTTLELESSFVSMEESKQKLVPIMTILLEELNASGRCTLPIDESNTIHLKVIEQRPDPPVAQEYDVPVFTKDKEDFFNSQWDLTTQQILPYIDGFRHIQKISAEADVELNLVRIAIQNLLYYGVVTLVSILQYSNVYCPTPKVQDLVDDKSLQEACLSYVTKQGHKRASLRDVFQLYCSLSPGTTVRDLIGRHPQQLQHVDERKLIQFGLMKNLIRRLQKYPVRVTREEQSHPARLYTGCHSYDEICCKTGMSYHELDERLENDPNIIICWK 401 T 2.3E-23 NPR2 unppercent F Eukaryota T 8fw5 3 C C NPRL3_HUMAN -14 GENE PROTEIN,ALPHA-GLOBIN REGULATORY ELEMENT-CONTAINING GENE PROTEIN,NITROGEN PERMEASE REGULATOR 3-LIKE PROTEIN,PROTEIN CGTHBA MGYPYDVPDYADLNGGGGGSTMRDNTSPISVILVSSGSRGNKLLFRYPFQRSQEHPASQTSKPRSRYAASNTGDHADEQDGDSRFSDVILATILATKSEMCGQKFELKIDNVRFVGHPTLLQHALGQISKTDPSPKREAPTMILFNVVFALRANADPSVINCLHNLSRRIATVLQHEERRCQYLTREAKLILALQDEVSAMADGNEGPQSPFHHILPKCKLARDLKEAYDSLCTSGVVRLHINSWLEVSFCLPHKIHYAASSLIPPEAIERSLKAIRPYHALLLLSDEKSLLGELPIDCSPALVRVIKTTSAVKNLQQLAQDADLALLQVFQLAAHLVYWGKAIIIYPLCENNVYMLSPNASVCLYSPLAEQFSHQFPSHDLPSVLAKFSLPVSLSEFRNPLAPAVQETQLIQMVVWMLQRRLLIQLHTYVCLMASPSEEEPRPREDDVPFTARVGGRSLSTPNALSFGSPTSSDDMTLTSPSMDNSSAELLPSGDSPLNQRMTENLLASLSEHERAAILSVPAAQNPEDLRMFARLLHYFRGRHHLEEIMYNENTRRSQLLMLFDKFRSVLVVTTHEDPVIAVFQALLP 590 T 6.3E-10 NPR3 unppssm F Eukaryota T 8fw5 7 G G Schizosaccharomyces pombe LAM2, Human LAMTOR2 ortholog MGSSHHHHHHSLEVLFQGPGSMIKPKKLSSLMKQAVEETVPSIMVFTTTGSLLAYVSFEDPKDGLKRLDLAKRVRSIAALAGNMYSLYTATNPSPLVAESTDDVIAHQRDVLFETIIEFERGKLLIAAISIDGAEDKLYSKDPLLLGIVGTENAKEGMMQIKSELLKECITNELSTLGKPV 181 T 0.86 Robl_LC7 pdbpercent F T 8fw5 9 I I Schizosaccharomyces pombe LAM4, Human LAMTOR5 ortholog MDSQLSENLLKCVNETYRGAMLVRNGLPIATAGDVNAEEQRVICEWNSNAVSEVLHLHDSNTKILIATKESCVLGLIYRNT 81 T 0.016 LAMTOR5 pdbpssm F T 8fy9 2 B,C,E,F B,C,E,F Cas1 MAGPIIAGKSESSELPRVEDRATFIYIEHAKINRVDSAVTVAEAKGVVRIPAAMIGVLLLGPGTDISHRAVELLGDTGTALVWVGEQGVRYYASGRALARSTRFLVKQAELVTNERSRLRVARRMYQMRFPTEDVSKLTMQQLRSHEGARVRRKYRELSKKYNVPWKKRVYNPDDFAGGDPINQALSAAHVALYGLVHSVVAALGLSPGLGFVHTGHDRSFIYDVADLYKAEITVPIAFAVAAEAEEGQDIGQLARLRTRDAFVDGKILKRMVKDLQTLLEIPEEGQIEAEPLSLWDDKEKLVPYGVNYSEVTSCP 316 T 5.3E-07 Cas_Cas1 pdb F T 8fyc 4 E,F,G,H B,C,E,F Cas1 MAGPIIAGKSESSELPRVEDRATFIYIEHAKINRVDSAVTVAEAKGVVRIPAAMIGVLLLGPGTDISHRAVELLGDTGTALVWVGEQGVRYYASGRALARSTRFLVKQAELVTNERSRLRVARRMYQMRFPTEDVSKLTMQQLRSHEGARVRRKYRELSKKYNVPWKKRVYNPDDFAGGDPINQALSAAHVALYGLVHSVVAALGLSPGLGFVHTGHDRSFIYDVADLYKAEITVPIAFAVAAEAEEGQDIGQLARLRTRDAFVDGKILKRMVKDLQTLLEIPEEGQIEAEPLSLWDDKEKLVPYGVNYSE 311 T 1.7E-08 Cas_Cas1 pdbpssm F T 8fzm 2 B,D B,D Bimax2 SRRRRRRKRKREWDDDDDPPKKRRRLD 27 T 0.68 Med24_N pdbhh F T 8g21 1 A A RELN_HUMAN Reelin STRKQNYMMNFSRQHGLRHFYNRRRRSLRRYP 32 T 33 MazG_C pdbhh F Eukaryota T 8g2z 1 A,KB,NA,P 0A,3A,2A,1A I7LUL4_TETTS RIB27A MSQYAYDFNPAKTQSLQSRPIEHNKFSAWTGDQLYRTSYGTHWTSKPQEPKTHVPPGYAGYIPGLKPNNHYGASYGEIAKNCLSNPKVAQNPFKLASTGFNYQRHDFRDPSLTATTHKFGAQTLLKNHPSIDQKSNQWQSQTHDSFRNPLHKPNPTYRETDKDLQTQKYFTKTSGFQQNHTTFDRTGWVPEKVLHADRTTSEYRIHFNKQVPFHRDTVLFKERRLPPKEYNYKYMG 236 T 0.00015 DUF2475 pdbpercent F Eukaryota T 8g2z 7 G,V 0G,1G Q237T1_TETTS CFAP107 MNAQTIQDNVNKYRFGVLIGNFAEEKFGMDMAQRQIDERLPNSTMKDSYGLKNSALNCEPSKLTPIDKEFNQHVIFNTQGVQQHILFGHGVKQTDYNKREYGTSYDLSFNQKIKPQTQIYSKYTPDALSASRTFHKDTIFEKDYQKHIPELGSKPTVPKDKARPYNEFTKTYDSTHMKIPLRK 183 T 0.058 DUF1143 pdbhh F Eukaryota T 8g2z 18 X 1I A4VD56_TETTS CFAP143 MELNQTTLESYNKQLQKGQGTLIGNWWEERELRDVTGIGRSAHNYAKLKSTISQTGAQSLYKESPQTESVNDTNERTMGKKFNHIPNTTNSEYGKGFNKADQLPRTGPVHLRTQQQMINHIKQELNEIENQKEIQRNIRYFQTTTQTEFGPKEHAMAGCTVGRRVMRTQNGQPITPDNRDEDILVDHGFLERQPFLTDEELKNQLPQGESYLTQQPITYWTEKTTDGFGCVYQSKSNPNDPKSTFKLNNQFLKTFHDYSHVRK 263 T 0.044 DA1-like pdb F Eukaryota T 8g2z 19 Y 1J I7MLS4_TETTS CFAP21A MAANRTKDLFPGFKTAGASTKLPDESAMNCIKPKENLNPGTPEHIKKYRKSYKNQPGSTILHYGIYDDQKPPETFVYGKKIEGSDHVQQVMDSGKTDGIKQMINEIKEAKYHSRQVEPLGKRMERNYEFPQEVHQDNFKFGVATINSENTAKEVMFPQKPAVNDQAAHDQYVRSHGNYEAGEQKNRNYNWKVDPQQHRFGKTDKIASNEVVYCLNQEALQDQFPKTTIVQKKQEDFRNFKEDHLGLPKNLGQTNAKNNPDMVFGARLGGPDEWNAGKCISGEATLKEVQTDKDLGKTNRFGFRNITKDGDENRVFGVPTIRDDIQKPLMKSIADPNNYGDEKPAVNLLFPKKYDYMGVEQDDFKIKRHKYEIKDIFEKIGYKYKIGKFEGIFKRAQEIENSTDNKVSCSSFLQAIQEMDHIE 422 T 0.21 DUF4483 unphh F Eukaryota T 8g2z 31 QB,VA 3I,2I I7M2G0_TETTS STPG2 MADKVEEEQAPPKIYTYIDLLKFENPEIYSSFGKMRLSHKTTAPKYGFGTADRNKQAKVFQNKELAKTNFAGKSSPGPAYDVRDTDYFTYQKAPKWKIGSEVRNTLNTGSKHDFYLRKDVDFDPLEADIFRRPKAPTVRIGLELRFPNDPKRHKGTPGPQYNPTLRHEIPNPPKFSFGFRREIQGFSPLVANSSTPQLVGPGSYIQKNVPNTSKIRNEPKWSFPNAERFSGFQADLSNAHYTKPRAIGTQYDSRKQTLPMFSFGKSTRESKRGTFKDMMSTQEVRIRISMPKF 293 T 0.039 SHIPPO-rpt unppercent F Eukaryota T 8g2z 33 BC 4F I7M9I4_TETTS CFAP129 MISKVSGYQGFSSYGDKPYPSLKPGRQVLSPNDVWEQTQRTRDDASMYENHQHYKTVYKVDVSNAVQPKVYQNHTHVKKGLQTQYQLQATGKSVLSYGSDRKDQTFDPNNETSKLKTGVEHWKSNYNANIKDPYSYSKASRPEWSYHLKPHQVDSKIGPTEYKTTFGEFGTKPTDKLNEYGNEIINKHEDPLKMGTTKSTFHIPNYTGFIPAARTVGKSLEHAAALNSRIDKSRATIVDNYHTKIPGYAGHQPKAPVNQRGFLREHCFSTVSSLKL 276 T 0.0059 ZinT unp F Eukaryota T 8g2z 36 HC,IC,JC,KC 5A,5B,5C,5D Q236L2_TETTS OJ2 MPPIDTKKSQITDFSQSTRLQYLGDKKSQKSAAFFRDTRVSSCSYISLKGGVPRAIPYYGSPTKTYADTMSKSSYISSLNHDTYRVRPYQHVGMSQKLLEPYHPHSYRNRLPVPDAPPQFSNASQIEVGDRSEVNHRRFVSQSKNVYGNFGKFDPVSNPGILASKTKWHHHLQSK 175 T 31 DUF6014 pdbhh F Eukaryota T 8g2z 39 VC 6F Q231B2_TETTS SB1 MIRDFVLQQTTEPEKKFNSTVLIGNWYEERCDPNREQSKFYNERKFADNNYQKYTLSENKFQDTSNSWLNFQENKPEVKNDQFITMNMQEYKKPSEQKRNSELKPFIVKKSHFDKNPHELEEYREKWTKSAHTFDRMYLGTQKPN 145 T 0.00073 DUF1143 pdbhh F Eukaryota T 8g2z 40 WC 6G Q24GM1_TETTS STPG1A MQCLLLVFQFNQNYTVKKIAEKEILSNFYLQSRLIDQQNSSYLNMSQSLPSVQKSASMTLMPIMEMYNISTRQHAAWGLDGYEVPKKYFDHLKVVQDRHFEEISKSGKATKNNKIITKRGSYLEDEIKFRGQNPGPQKYDVTYKWVSDAEIEKGKKLPKNTKKNTFIEQIFLEQQRRGIPGPGKYNILKTDEQVKAEAEKMNKKLKYGERSNYLQEYEYLSSTLPGPGNYNPRPILPKIHKDNMSPDKWIAFHKAKLSKTAKSSLPDVGTYKMNYPLDYATFGKMLVKTQEEGGNKKSSVRYMGTEERFKDPKKTKSKTSQIVPGPGQYPLVAKWQGKEQKKDSKDKNWMDSITTGISKSIYYS 364 T 0.13 SHIPPO-rpt pdbhh F Eukaryota T 8g2z 41 XC 6H Q231B6_TETTS Nebulin MTDNPQQPHKSKQEIQREQRKELARELRKAHFDLGFKEGFDDETRYREFYKWYDLEQSEKTKQEMLKLRNDLRSTHYILGTDDPNKLFVSTATQSFVKPVNPQVSQLSVETKNDLRSHHFNLGHYNDKVLSDYKLNYDQKQIDPETLKDRKEQINFLRKHNHDFGDKNNYHSSMYNENFNKSYDPHFLKQGKSKEEIHQQIVDLRKTNLVMGNNNPQFTSEAMSEFNNKPQAFRTQVDLGLKKSHFKLGEDPSLYETTTAKTYQGKQMFQHDPEKIKALSKDLRAEHFKLGNDPQSYTSEAAAKFKEFDKNSLTQQPDMSYLYRSHFNLEGFGGSNPVQHYVSNYKQNYEPKAAQKSEATRNDRADRGSHIVFGSDKIDDQFKSEAQKNFVNFGRQAPSALEKEVQADLRRHHYQFGTDQPEMISEMKKTFNDKTKESSQSKLDPNLIKDLRSNHFEYGTMGNEYTTTMQDIGRYQCQPSRLNPELAKDLRSHHFRPGDLEKYYDTTYRLAFIDFKAV 518 T 130 Malate_DH unphh F Eukaryota T 8g3d 7 G,V 0G,1G Q237T1_TETTS CFAP107 MNAQTIQDNVNKYRFGVLIGNFAEEKFGMDMAQRQIDERLPNSTMKDSYGLKNSALNCEPSKLTPIDKEFNQHVIFNTQGVQQHILFGHGVKQTDYNKREYGTSYDLSFNQKIKPQTQIYSKYTPDLSASARTFHKDTIYEKDYQKHIPELGSKPTVPKDKARPYNEFTKTYDSTHMKIPLRK 183 T 0.058 DUF1143 pdbhh F Eukaryota T 8g3d 19 Y 1J I7MLS4_TETTS CFAP21A MAANRTKDLFPGFKTAGASTKLPDESAMNCIKPKENLNPGTPEHIKKYRKSYKNQPGSTILHYGIYDDQKPPETFVYGKKIEGSDHVQQVMDSGKTDGIKQMINEIKEAKYHSRQVEPLGKRMERNYEFPQEVHQDNFKFGVATINSENTAKEVMFPQKPAVNDQAAHDQYVRSHGNYEAGEQKNRNYNWKVDPQQHRFGKTDKIASNEVVYCLNQEALQDQFPKTTIVQKKQEDFRNFKEDHLGLPKNLGQTNAKNNPDMVFGARLGGPDEWNAGKCISGEATLKEVQTDKDLGKTNRFGFRNITKDGDENRVFGVPTIRNDIQKPLMKSIADPNNYGDEKPAVNLLFPKKYDYMGVEQDDFKIKRHKYEIKDIFEKIGYKYKIGKFEGIFKRAQEIENSTDNKVSCSSFLQAIQEMDHIE 422 T 0.21 DUF4483 pdbhh F Eukaryota T 8g3d 33 BC 4F I7M9I4_TETTS CFAP129 MISKVSGYQGFSSYGDKPYPSLKPGRQVLSPNDVWEQTQRTRDDASMYENHQHYKTVYKVDVSNAVQPKDYQNHTHVKKGLQTQYQLQATGKSVLSYGSDRKDQTFDPNNETSKLKTGVEHWKSNYNANIKDPYSYSKASRPEWSYHLKPHQVDSKIGPTEYKTTFGEFGTKPTDKLNEYGNEIINKHEDPLKMGTTKSTFHIPNYTGFIPAARTVGKSLEHAAALNSRIDKSRATIVDNYHTKIPGYAGHQPKAPVNQRGFLREHCFSTVSSLKL 276 T 0.0059 ZinT pdb F Eukaryota T 8g3d 41 XC 6H Q231B6_TETTS Nebulin MTDNPQQPHKSKQEIQREQRKELARELRKAHFDLGFKEGFDDETRYREFYKWYDLEQSEKTKQEMLKLRNDLRSTHYILGTDDPNKLFVSTATQSFVKPVNPQVSQLSVETKNDLRSHHFNLGHYNDKVLSDYKLNYDQKQIDPETLKDRKEQINFLRKHNHDFGDKNNYHSSMYNENFNKSYDPQFLKQGKSKEEIHMQIVDLRKTNLVMGNNNPQFTSEAMSEFNNKPQAFRTQVDLGLKKSHFKLGEDPSLYETTTAKTYQGKQMFQHDPEKIKALSKDLRAEHFKLGNDPQSYTSEAAAKFKEFDKNSLTQQPDMSYLYRSHFNLEGFGGSNPVQHYVSNYKQNYEPKAAQKSEATRNDRADRGSHIVFGSDKIDDQFKSEAQKNFVNFGRQAPSALEKEVQADLRRHHYQFGTDQPEMISEMKKTFNDKTKESSQSKLDPNLIKDLRSNHFEYGTMGNEYTTTMQDIGRYQCQPSRLNPELAKDLRSHHFRPGDLEKYYDTTYRLAFIDFKAV 518 T 130 Malate_DH pdbhh F Eukaryota T 8g57 1 A K SIR6_HUMAN NAD-DEPENDENT PROTEIN DEACETYLASE SIRTUIN-6,PROTEIN MONO-ADP-RIBOSYLTRANSFERASE SIRTUIN-6,REGULATORY PROTEIN SIR2 HOMOLOG 6,HSIRT6,SIR2-LIKE PROTEIN 6 GSMSVNYAAGLSPYADKGKCGLPEIFDPPEELERKVWELARLVWQSSNVVFHTGAGISTASGIPDFRGPHGVWTMEERGLAPKFDTTFESARPTQTHMALVQLERVGLLRFLVSQNVDGLHVRSGFPRDKLAELHGNMFVEECAKCKTQYVRDTVVGTMGLKATGRLCTVAKARGLRACRGELRDTILDWEDSLPDRDLALADEASRNADLSITLGTSLQIRPSGNLPLATKRRGGRLVIVNLQPTKHDRHADLRIHGYVDEVMTRLMKHLGLEIPAWDGPRVLERALPPLPRPPTPKLEPKEESPTRINGSIPAGPKQEPCAQHNGSEPASPKRERPTSPAPHRPPKRVKAKAVPSKLN 360 T 3.2E-07 SIR2 unppercent F Eukaryota T 8g59 4 D A GNAI1_HUMAN;GNAQ_HUMAN ADENYLATE CYCLASE-INHIBITING G ALPHA PROTEIN,GUANINE NUCLEOTIDE-BINDING PROTEIN ALPHA-Q MGCTLSAEDKAAVERSKMIDRNLREDGEKAAREVKLLLLGAGESGKSTIVKQMKIIHEAGYSEEECKQYKAVVYSNTIQSIIAIIRAMGRLKIDFGDSARADDARQLFVLAGAAEEGFMTAELAGVIKRLWKDSGVQACFNRSREYQLNDSAAYYLNDLDRIAQPNYIPTQQDVLRTRVKTTGIVETHFTFKDLHFKMFDVGAQRSERKKWIHCFEGVTAIIFCVALSDYDLVLAEDEEMNRMHESMKLFDSICNNKWFTDTSIILFLNKKDLFEEKIKKSPLTICYPEYAGSNTYEEAAAYIQCQFEDLNKRKDTKEIYTHFTCSTDTENIRFVFAAVKDTILQLNLKEYNLV 354 T 6.200000000000001E-119 G-alpha unp F Eukaryota T 8g8k 1 A A ACEAB_MYCTU ICL,ISOCITRASE,ISOCITRATASE EVPRKLLEEWLAMWSGHYQLKDKLRVQLRPQRAGSEVLELGIHGESDDKLANVIFQPIQDRRGRTILLVRDQNTFGAELRQKRLMTLIHLWLVHRFKAQAVHYVTPTDDNLYQTSKMKSHGIFTEVNQEVGEIIVAEVNHPRIAELLTPDRVALRKLITKEA 162 T 9.4 Lentiviral_Tat pdbhh F Bacteria T 8ga8 3 C J SAP30_YEAST Transcriptional regulatory protein SAP30 MARPVNTNAETESRGRPTQGGGYASNNNGSCNNNNGSNNNNNNNNNNNNNSNNSNNNNGPTSSGRTNGKQRLTAAQQQYIKNLIETHITDNHPDLRPKSHPMDFEEYTDAFLRRYKDHFQLDVPDNLTLQGYLLGSKLGAKTYSYKRNTQGQHDKRIHKRDLANVVRRHFDEHSIKETDCIPQFIYKVKNQKKKFKMEFRG 201 T 0.041 NAM-associated pdb F Eukaryota T 8gai 2 B,D B,D Bimax2 QSGSRRRRRRKRKREWDDDDDPPKKRRRLD 30 T 0.85 Med24_N pdbhh F T 8gak 2 B,D B,D Thanatin GSKPVPIIACNRKTGKCTRI 20 T 4.5 Fuz_longin_3 pdbhh F T 8gal 2 B,D B,D Thanatin GSKPVPIIACNRKTGKCRRI 20 T 4.7 Fuz_longin_3 pdbhh F T 8gdi 1 A A B2HHT9_MYCMM CYP124A1 GLLPRVNGTPPPEVPLADIELGSLEFWGRDDDFRDGAFATLRREAPISFWPPIELAGLTAGKGHWALTKHDDIHFASRHPEIFHSSPNIVIHDQTPELAEYFGSMIVLDDPRHQRLRSIVSRAFTPKVVARIEASVRERAHRLVAAMIENHPDGQADLVSELAGPLPLQIICDMMGIPEEDHEQIFHWTNVILGFGDPDLTTDFDEFLQVSMAIGGYATALADDRRVNHHGDLTTSLVEAEVDGERLSSSEIAMFFILLVVAGNETTRNAISHGMLALSRYPDERAKWWSDFDGLAATAVEEIVRWASPVVYMRRTLSQDVDLRGTKMAAGDKVTLWYCSANRDEEKFADPWTFDVTRNPNPQVGFGGGGAHFCLGANLARREIRVVFDELRRQMPDVVATEEPARLLSQFIHGIKRLPVAWS 423 T 1.3E-22 p450 unppssm F Bacteria T 8ght 1 A,B A,B A0A0H3LM39_BORBR Putative membrane protein GSHMNQPSSLAADLRGAWHAQAQSHPLITLGLAASAAGVVLLLVAGIVNALTGENRVHVGYAVLGGAAGFAATALGALMALGLRAISARTQDAMLGFAAGMMLAASAFSLILPGLDAAGTIVGPGPAAAAVVALGLGLGVLLMLGLDYFTPHEHERTGHQGPEAARVNRVWLFVLTIILHNLPEGMAIGVSFATGDLRIGLPLTSAIAIQDVPEGLAVALALRAVGLPIGRAVLVAVASGLMEPLGALVGVGISSGFALAYPISMGLAAGAMIFVVSHEVIPETHRNGHETTATVGLMAGFALMMFLDTALG 312 T 1.1E-26 Zip pdb F Bacteria T 8gi1 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H E6K399_9BACT Accessory protein Csx28 MDYMELAKEAFSIICTFIAAYVAYYYAIKQLHQKSVENIEYAKYQAVLQAHKSLYKLLRFTTNTENEDSILIWEKTKDGKQEATYYFRKENIRKFIKELSKEIYNEGCGIFMSKEALSLISEYRNIVYGFMLSAQNNPQETIRITNRESVERMKKIHQNLSIEIRQAINLKKRDLRFENLYFQ 183 T 0.0028 DUF6019 pdbpercent F Bacteria T 8giu 1 A,B Z,Y G1JWB5_9CAUD gp_4 (capsid accessory protein) MANRTVSPSTQGVRPAMRQMYNGRNVATRPIPLIVDTSEIRAIMAAAADARPKTSAVNFPQSGPRPAGAAVVFGTKVSGAPGNVVSNNAATFAPLTGTQNFE 102 T 64 CLP1_N pdbhh T Viruses T 8giu 2 C,D,E,F,G,H,I E,F,A,B,C,D,G G1JWD4_9CAUD Capsid protein MATKELKIGGVPVFPIFGGTAPVRQEGIMTQGDLVTVTSDGIDLNALWNSFAESIAIYNEAMDNLIQLLTYPVTVPVEPVVQIGETTFEEATELGVPRGAGLPIEVFQMGYDLRHYDKRNAYSWMFLADADGRQVEAIHDAVLWADKRLVFRKVMEALFDNRTRRANIRNQAYNVYPLYNGDGVPPPRFKNNVFDETHSHYVISHNSVVDSSDLEDLMELLAEHGYSPQAGTQFLLLANKAETDAIRQFRRGVVNNNGATAGYDFIPSPTQPAMMLPNAEGLLGNQPAPTFGGLAVIGSYGFWNIVEEDYIPPGYLVGVGYGGAFNLGNPVGLRQHANPAMQGLRIIAGNYQRYPLVDGFYARSFGTGVRQRGGAAIMQIKASGAYECPPIYKKGGGFLV 400 T 0.78 DUF5309 pdbhh T Viruses T 8giu 3 J,K,L,M,N,O,P 1,2,3,4,5,6,7 G1JWD3_9CAUD gp_22 (Minor Capsid Protein) MALKTKPRWDKYDGYVGNYRGVLGEDIDLDTEANRVLAVGTNSNGAIVVGAGQTGIKGLMIVAVGADIHGAMLDGGINNHAGDPQDVGKHGEITNFQPTVFGRTFGVAISATEGNVKLAVNGVDTGNIAYDTSAANLKSGIVAVDDGFTADDFTVTGTAPNFTIVTTRTDVTITASGEGVTVTEATSVAAAGTNYYGHADGTVNAVKGSDGVYVGHTQEADRLIVNVKDEED 232 T 14 DUF5114 pdbhh T Viruses T 8gjs 2 B B ACE-LEU-THR-PHE-ALA-GLU-TYR-TRP-ALA-GLN-LEU-DAL-ALA-ALA-ALA-ALA-ALA-DAL XLTFXEYWAQLXAAAAAX 18 T 0.61 Abi_alpha pdbhh F T 8glv 33 RF,RZ,SF,SNA,SZ 8B,IG,8C,Pi,IH A8ICC1_CHLRE Flagellar associated protein MNNNKLDEAAILAGCKGVFSKTSYITHTGQEGKAEEYEKKGGHRSAFAGKQLATAPLKEGKTVDVYFTKKHDWISDKDPYVDRIRYKDSNQEKKKGFYTSDFSKRDEFTNTIRTEQWREQLKGENTHAKKALDMFAEATGLEASQLRTSRKMEPEVFMYDQVFEKEDPGFDGASRTHRDTKNKTMLSRDRANGEMMTTTALAFQAPDEHHKPEHARKPLVRETFFRKTNVFFPEGCAADPST 242 F F Eukaryota T 8glv 38 PG AE A0A2K3CYR5_CHLRE IC97/Casc1 N-terminal domain-containing protein MAPKDAAKGKKKKKTKEELEEERRQAEEAARLAEEERLRAEEAERQRLAELERQRLELLGQFLDAEKARLDAQLSELDPLLRQFEHERSRSRAAAREAAEWERFLRCVDVPHPRQRVPLAEFLRRMHEAATKDVTGSPDGRDLRAAFLAVEACRTVILEARQELLAARHSSELAAPPAAAAAAAGGGGGASAEEAQAAAGSAARGAAGGAAVAEWAEALESDLRTLYGIVNARIDRLTAAVLHHCDEYANDKNEIQLGHVAPMPAWWPTAPGSGSGSAGSQQQQQQGGGGEQQQLAEGAGEGPGGAFKWGVWVNTAKNPRLKAVEMPQLGVTLEIPKQIALANIALRVQQRSGPGVDEYFSRCANAWMAVGGLLAVDLLAMPPGAKKVRGWTLRQVTPLALNVQRVPYPIPPAGADPATWASEEEPPPLGVTAPLPPDVVLLEDPLQVAWWDEAHSIWNTDGISDVAFDGASKTFSFHTTHLAPLALVMRRTRLLPYAGWAVRPTGGRNGNGAAISLDVGLDAPVVIEVSKGAAWLSSPAWPQLASLIGQPMPPLDLLQALSDRGLHLLPEDRDAEAAGVTAKARDTEEAMCRDLALLGGVFLMASSRWNQTAGPEEALARLSEVTDWEEGGRTAPHHLARIFDKEKEDGERRVLVVMRRGAKGVAFSDALNRRPEYPALPGVGSVEAVKECELSIWGEVHASVLTLLRGQFSAPGAADSPLALRLAAAPESLELCRTTSPLFTATLADTMLALRLFSFS 760 T 0.69 Casc1_N pdb F Eukaryota T 8glv 40 RG AG A8J3B6_CHLRE IC140 MEDASAGPPPVDDGEVPAAPADSSPLDDAPASSGAEPGDGGYDEGEPLDNEQAGPADVEGLDDAGEPGAAPEDGEEGTGGEGEAGAGAEAPGDGESPEAAAEAEAAAAAPPPPVELEPLPEDYVPVRDLPPIPEPFARNEEGKPVPLDGVESLFLTGTTIELVGLKELGAANVMGEVSRDELLKDIQFRGAISDFHAYKAKIQAADYEPLLVRFNEDDVYGDGNNFELAVTAAAAAVWRGIGEEVARRAALLELEAAHAAAEKAKPRSKRVRKVKPWQSMGSEVDIEEASVRPRRDPIRLVVQRRRREFNQPNAKLADKDAHELWNSSQMECRPFKDPNFDMRRMEQDVAVQAVAPLRDAATQSTGTVPPRPAVTQTEPLDLPPEAKQDLVRRPRNAPGSVADFLERVRDQCEVALVQNEITNIFRDDLSSLNDEADGGGGGSRKETLVSEAQSFTHLTYSKNKVVSAIQWLPHRKGVVAVACTEAQSHAERVARMGRTAPAHILLWNFRDPIHPELVLQSPWEVFSFQFNPLQPDLLTGGCYNGQVVLWDLSSEADRLSRRAGGGAGAGAAKSSDGAAAGAGGKGADSTPPSTALPGGGGGGGGVDSTSGSSADGDAHIPVIKHRFMTDTQFSHHQVVTDLQWLPGVEISHRGKVTKLGEGSKECNFFATIAADGKVLFWDVRVEKLLKKGKKADELLDLVWKPIHSVHLISLIGMDLGGTKLAFDFRKLEQGMFYAGSFDGELVYADFVKPEGEENPDYAKSCLQAHVGPVIALERSPFFDDIVLTCGDWQWQIWQEGQSTPLFQSGYAQDYYTAACWSPTRPAVLYLADQSGSLEVWDLLDRSHEPSIRVTLAATPIMSLSFNPMPTSASAAQQAAQQLLAVGDATGVLRIMELPRNLRRPVHNEKKLMGTWLERQQARLADVGARQPVRTSARKEAEERKKEAESAALAEAAAKEAAAKDAAAAAAAGMPLPTANERKKDKGPPPPEFDEKAEQEYLKLEARFKAQLGLMPAEANGGPGH 1024 T 0.02 WD40 pdbpssm F Eukaryota T 8glv 50 EIA,LH,MH,XT,YFA Lp,Aa,Ab,Fo,Kt DYI2_CHLRE IC78 MPALSPAKKGTDKGKTGKKTGKQEQNAQDYIPPPPPMPGDEAFAMPIREIVKPDNQLWLSEADLNEEVAKMLTANNPAAPKNIVRFNMKDKVFKLEPMVEQTVVHYATDGWLLHKSSDEAKRQMDMEKMEQEASARFQADIDRASHEHKDHGDVEPPDDSRQLRNQFNFSERAAQTLNYPLRDRETFTEPPPTATVSGACTQWEIYDEYIKDLERQRIDEAMKSKGGKKAAAAARAAGAAHRQRNEHVPTLQSPTLMHSLGTLDRMVNQNMYEEVAMDFKYWDDASDAFRPGEGSLLPLWRFVSDKSKRRQVTSVCWNPLYDDMFAVGYGSYEFLKQASGLINIYSLKNPSHPEYTFHTESGVMCVHFHPEFANLLAVGCYDGSVLVYDVRLKKDEPIYQASVRTGKLNDPVWQIYWQPDDAQKSLQFVSISSDGAVNLWTLTKSELIPECLMKLRVVRAGETREEEDPNASGPAGGCCMDFCKMPGQESIYLVGTEEGAIHRCSKAYSSQYLSTYVSHHLAVYAVHWNNIHPSMFLSASCRLDHQAVGLCHDPKRAVMNFDLNDSIGDVSWAALQPTVFAAVTDDGRVHVFDLAQNKLLPLCSQKVVKKAKLTKLVFNPKHPIVLVGDDKGCVTSLKLSPNLRITSKPEKGQKFEDLEVAKLDGVVEIARKSDADLAKNAAH 683 T 0.035 HTH_8 pdb F Eukaryota T 8glv 51 BI,FIA,IU,NH,ZFA Aq,Lq,Fz,Ac,Ku A8IJZ3_CHLRE WD_REPEATS_REGION domain-containing protein MEIYHQYIKLRKQFGRFPKFGDEGSEMLADIRPNEDHGKEYIPRNPVTTVTQCVPEMSEHEANTNAVILVNKAMSHVEGGWPKDVDYTEAEHTIRYRKKVEKDEDYIRTVVQLGSSVEDLIKQNNAVDIYQEYFTNVTMDHTSEAPHVKTVTVFKDPNNIKRSASYVNWHPDGSVPKVVVAYSILQFQQQPAGMPLSSYIWDVNNPNTPEYEMVPTSQICCAKFNLKDNNLVGAGQYNGQLAYFDVRKGNGPVEATPIDISHRDPIYDFAWLQSKTGTECMTVSTDGNVLWWDLRKMNECVENMPLKEKNSETTVGGVCLEYDTNAGPTNFMVGTEQGQIFSCNRKAKNPVDRVKYVLSGHHGPIYGLRRNPFNSKYFLSIGDWTARVWVEDTAVKTPILTTKYHPTYLTGGTWSPSRPGVFFTIKMDGAMDVWDLYYKHNEPTLTVQVSDLALTAFAVQESGGTVAVGTSDGCTSVLQLSTGLSEASPAEKANINAMFERETTREKNLEKAIKEAKVKARKEQARRDEVKDNVTEEQLKALEDEFFKTTDPAVGGGYGAGEGAAAE 567 T 0.35 Stap_Strp_toxin pdb F Eukaryota T 8glv 56 AKA,DW,HM,IM,MJA,WJA,YJA Mb,Gk,Cc,Cd,MN,MX,MZ ODA1_CHLRE DOCKING COMPLEX COMPONENT 2 MPSADATRGGGSAGSMGKGTLGAGDTLGHKSVLDKQRAAIEKLRAQNEQLKTELLLENKFSVRPGDPFAQALINRLQDEGDMLARKIVLEMRKTKMLDQQLSEMGSTLTTTRNNMGGIFSAKEQSTAVQKRIKLLENRLEKAYVKYNQSITHNKQLRESINNLRRERIMFESIQSNLERELAKLKRDMADMIQQANGAFEAREKAIGEMNALKAQADKEQQGFEEEWRQLTTIIEEDKKERERARAQELAMRERETQELLKMGTLSSAEKKKRITKGSWNVGYNKAMAQNVAAEKVEMYGQAFKRIQDATGIEDIDQLVNTFLAAEDQNYTLFNYVNEVNQEIEKLEDQINIMRGEINKYRETGRELDMTKSRELTEEEARLAASEAQSQLYEKRTDSALSMTTALKAGINDLFERIGCNTPAVRDLLGEEGVTEANLTAYLGIIEQRTNEILQIYAKRKAQQGTDGLAEALLAQPLTQPGNRIIIEPPSTTQEEEVEGLEPEPVEEDRPLTREHLESKVQRTLPRKLETAIKVRPAGADATGGKRGSPTRR 552 F F Eukaryota T 8glv 65 DP,OJA Do,MP A0A2K3DCF8_CHLRE FAP44 MAEPGEDSLPVDGLAEVNEQPASNPEQQAIVDVAAPAESAPDSDGDGVAETSEPADADEPAQSGSGEEAAIADETGSKPPAEAVATGTPETMPEEQPAEEQEPELRTDAPAADAEATDAPEEQPQEASATEAAPGAEAVEDVGAAAASNQDEKCTPEGPCSAVPDGEPRQADAEAPVPTPAAAAAAAAAASAAADQQLAGKPTTDAADGLTAEPTDRVPVPDPQAVGAQDGPAGEQADAEGAASGGPLAASAEEAANAGHAAGPEDRAALEAAAVAELDAASAGAGDAAAADGAACAPATPDSQAEDQPQPHAVAEAVTAPAAAPPAAPGSRTSSARSAPVAATVAAAEPAPRPPSATPPAEPRPQSGSSRTLPPPAPPPSLPPASAASSGVAQLLSVHGLDTHRRNNLVLLDEDTAASCIAGQLVLLSLSTGARRYLPGRDGGGVGAVAVHPSRTLLAVGEKARPGPASAGPAVYIYSYPGLEVVKVLRGGTERAYSALAFDGERGDTLASVGHFPDFLLTLWDWRQEAIVLRAKAFSQDVYGVAFSPYFEGQLTTSGQGHIRFWRMASTFTGLKLQGAIGKFGNVELSDVAAFVELPDGKVLSSTETGELLLWDGGLIKVVLTRPGSRPCHDGPIEALLLDRPAGRVLSAGADGRVRMWDFGAVNDAEPREDSHSLELSPLDEVVVAEGAALSALLADSGGRRWVVADKAGNVYTVALPPAGPVGKGAVVTRVASHPAGAVAGLQLSARTHTALVASADGCLRALDYVSGAVLAEAATPQRITAFTPLPAASPACPGGAMTAATGYRDGVVRLHARCAEGLALVGVAKAHKGAVAALAVSADGGRLVSAGEDGSVFFFDLTAQPQPGTGVPGMPACGLLAPRAFIKLPSGSGTVTCGVWEAAEGGGVLLGTNRGTILSVPLPPPDLNTHHSYEWAAGTSAVSSYQLVVPKPKRPKKKKGKNDGEEGDKEGGEQDEGQGGEDKGGEQADGEGGSKEGGEEGRAAEEEEEEEADDEADGGAGGPSSTTGELISLTLAPNEPGALLVTAGGVGHAARKAWRVRMGEPLAAPLLEGFASAPVTCLAHAGPEGRLALLGSGDGLVRLQALEEPFGSAAPGALPLWEAPLHDMQSGRVSGLGLSHDGAYLVTAAADGALHLLALALPPELAPPPTTQPGDEPLPGPAALPLRPPDVLAAAAYTLEEEKQQAERDQQVREAEEKKLSVRQRLGLIRAEFEALLAENEAAPEALRLPRADLEVDPGLRALMEAEALRREEVARLELAWESERQRLGLAKLRRYFLDGLESERVVLHSLRGSSTVTTFRVAKLSDETRAELAAMRQAARAAAAASAAAGGEGGAGGRDTDARGKGSDTGGGPGGDAAARARLAEATAALEEGTASGKLNKADLRRLARKRREAEWAAFNGTRPDDTYDSPADLAAIEEARRTIGDFKLKSDPNYVVPEEERLTPQRKRLAMLELEEALHDIAAAFNAKFFALRDVKRKVLADVRVKLAALAELAAAAGAATGGADPDATAAAYLAPFSGLPSGLLPEEEPAEAREAVTDADLAAFAARKAEDERKAAAAAAGGLGGFAGAAAGPKKPAAGGAAPAGGALAGGAAGSGSVAHGAGGPSAGGQQGLTAAEEALAKMMAAVPQSELEKGLAAYNRRRVEHMRSKLSEEITAMLDAFDDAHSALKAEKLGLEADVKAGQMRLLVGLQELQLLREFDKRESVLLAKRQAKLDDKQEIVDKIAECTDKLETKRLELEGLVARRAAVVAELDAVVPESDPFREALVRVFHRRIKRSKKKAGGGGGEDDYDSEEDEEDEDMGDDEVDDDDDGGEEVCPPGCDQSVYERVCDLREKRLDEEDMIAEFTKTIEVLRKEKEALAKKQRLVEQGLAAVNADMAEFQKEKQGRLNQVEVIVALRMHQIEYLLDGCLPDDLSACLVFSASQLRRLQARVDELEEEKAGLRAAHKELRRQHAALLRDKADKEARVAELEARAHDVQMLKFGQVIDLELLDRVSSSRGTEELREDLKKQELAYARELAEWDAKINARMDELVVLTRENTACLNAVSELTAAQRRLESGLTATRKGLFADPVQQRRAEVEERDALVALVNAQAAELDRLKGQLLALRRKDTSMYA 2141 T 0.026 Macoilin pdbhh F Eukaryota T 8glv 69 LQ EM DRC6_CHLRE FLAGELLAR ASSOCIATED PROTEIN 169 MAPKKKGGGKKKKKDDGAEPPHDGSWERAVESGTWEKPVTDLPDANTWPTWGALRERVLTACREIKINNTASLRDAFANELVKLSPPELTLIDLRGSSNLHNFNLSPMTTCPKLTDLDLSECAGLDYVLLQSQTVRSVNLRKNPAITKALIHCPRLNKLSITDCPALETLMLWTDELTELDLTGCNNLSVVKLQCPNLLDSKIPPLKVAPQHVKPSHPPIASLLKENLTTAAHKAAADKEALAGVKDTSDSIIPHVFRPF 260 T 1.4E-05 FBXL18_C pdbhh F Eukaryota T 8glv 114 DHA LO A0A2K3D574_CHLRE Flagellar associated protein MDGNFVADVRLDDSDEVLQLPIVKSKVKKLLQGAVKKIALLGAPVIPSSDDSLEQFLQSAGRFFGKDPAKWDQVGETKVDVVEEAGKRTTKLTGVFTGTELALMVENPYYDERLPAREDKPELNFSQKRTPIRTDDEWQELIAEQPWTASKRRQLLTAYLHAKVEAEEPVLATEGAQGFWELAINKDHHADFRLDRLAALLNRLSSPSLEVATTTAAAIWGLATTGLSRKNLADLDIVSLLLSNIKRSFKMPVIPDEATLAAAAAAAKAAGKDGDAAAAAAAAAAAGGDEGGGVGGKPAAGALPEAQRNKYQSFLLGALSVLLIDRNCRRAYLQQEPEFGTLFVLARNLDGYEPGHAAARREAAAKLLTTMVQRDADARRSLIASGALRNVISLLNPKGPGENMIQFCAASLLATLVLDDDAMELIRDRGEAPLMFEACIVLLQSTLGKLKREVQRFYGQLTPEEAASTPPFDVELGVRLGEAASQAMWGSAHYCVMMDPIQVKMDHIQQLGVMGNDCYTTVALPLSRIAHCITASLATLAANPDAALLIMTSPNDVALVFLMSMLDCVETENFEQAGHVKASACAGVAFLACHPIGAEGDECMFGPFRQKLLGLGAFGALLRAALSSVLESDCDRIIQQAAAIGLMYLSTMAGAVDAAELAMYAALLTDSDNSEMIEFLMAGMWILLRDGNNRKVLGTSFNPSPANALAKNMINKLNDAITLHEINDEVAAKTAKLKQTMAGRGGEDEGGEDSGAVTAEPSALESMAAASASAPSPPPPAVEGEASATAAAGAAEGGEPGAAPGEGVGAGAEGGAGGEGGLPGEVGDAGAGPAPGEDGGEEGLLSPEASGVNVAMEPEPAAAAPPPPAAAAPADEGEEQLERDPDEAFAMPPDTNMPDGEKFYENKDTAFPSPMLMKREESADRVRRKAEAVKGRMKQLEKRFDKQLKDNWGLETLVSVGESWLPAMLEQDEVGEATDVPVLKLFEFLVASICMFMVDDDGVPERRELDVFRLSAPEGARNKTWWTVDVRAPEADGTVDSDTERALRILLQILGMHLSAAWKSMQLGVLTLWNACCRHPNMERHVVERGVALKLLMVVNNPMWPPSLREISAGCLEFFQERWSNLATFGAGATLLPGGLPAEVSGVSAGGVPPEGVVPYIAAMVGLVNTGVPLMEYRGCHGLARMTYTAPYACPEPKPFLKEAKAVAAALGGVEALVALMKRLNRRYQDLCGPAGLGGSLRATGGAAGGSGGAPPPGGGGGAAAGGGGGGNSNRGPTGQGEEPENPAMFERDMQNLEAVQDIYFVCMAALLNLSVLRGNQVPIAKRGLLVLLGTNTVFYNRVVVLRANLNATRPGGAHAPSAADALAREEQLLHLCSAIIQNIAQHPQNRTRMYKAELKGSVALDKVIEAATDVDEETRTAASFLPTIPSTRSMSPSAVPSAASLGRGGRSAGAAGGSLHASASAARVGAGGGGSTSPTRAATTGRLAQNAKMTQNGQVINGGVDTALAGSVRPKVVFPPICERGAGGDAITLQRYGPGGAGSPGVMRMGSQHSGRSGSPGTADTSGMDYNEGGGAGGAGMSHEAITDSRYRFLTWIDNTFHDLEAGANVPFSKKGGADDRSLASGSGRTYRKALWDEHGDWLPNEPESAKALNKLLARPMSHLWQDMPEHRARQGRQRWEPTVSEYRELQGAKPLTRPAAKLLSTLPPRDQEDLMVAASQMIGLMPEDYDSDEDEPLGPRGDGDGGFDGGAGGAAGPSGVEASGASGSGGHGAHHRKAGGAAGAAGAGGAAGGAAGGAAGGKKAGPPPPSNMAAMLAGEATMEIAKPTVSASLNWNDGPPNRPRTAERDNGRVGLTVLAAPPEALQATAARKAAEAAAASGTHFGAVGMDISDEALAAAAGSSTAAKADAVPLKVCLGPKRPRQIITFEDRIVIDNDNRPTLTLFEHVEGSRVSDGLFPSYILPNGKRAHMYYNGGTLLDEVGVEAVIPPPRPSTVPQALQQTMPLANVLNLIAKPPGSAPPFIPYKPVPRLVPLPPEHTLTVKRPDIHAAEAFGDLREDNLQLVIQAKKIIKTQTTTRVENIEVKQQEEREPWTLPSSIFKNRVKECDARAFFDSHTVEEKMFERDWQRACAKEKFTSMMSRENKANKEGKDEKVAIKEVHDVLLKYWPQVVGAFVYYATGGSSDPYHMSLNAFTTFLDECCIADSESQYCKRSDCDTVFIVCNFQPDKKSAEAQVNMENAMMRYEFLEAIVRLAISKYGKGQATDDLPTAVTMLLEKNIIPNLVPGAVIQSNTFRSERLYHEEVDLVFKKHSVLLKALYSRYRLKPVGGGLRPKVLKLDGWQQFMNDASLVDSQFTLQDASLAYLWARMYTIDEIKDYARYTCLSFTDFLEALGRVADMKALPAASDLDLAGYDSVLEWALDKERMEGGPDKGGQGQGGDGGEGGAGGGGATLDIFRPRPSAGFSAPKTRPLYVKLEMFLDLVFRRLYWDPSQPEVPFNYDGLLKLVKKIDKELGP 2520 T 4.7E-05 KAP pdbhh F Eukaryota T 8glv 119 THA Le A4PET3_CHLRE Subunit of axonemal inner dynein MATLTYTVFSLGEAQLHQLHTSNGKLFVMGEVAVELFQESPTAFLQELRKNKLPKLQSANRDVLHTVAELHLPVESSANSQGVCLLPAATVETLLVDKRRMELVQPFKLALLKLASQEAARLMAAGEYELALPVALDAVQQGQALFKPAPALQLFPLYLLAAQANLGLRRAKQCEDFLALASWLAMKEPGLTTSIMKSQLSRLYGQLYAFQSKHAEALHAFAEDVYYCSLEYGPEDVRTSLGYYNMGKVFQSSAELDKAASCNDQVVAIWAAALNAVVLGLADGGGAAQPAALPVGRLQLMEVVDMLTDIARSRAAALGSGHVTVGEAHLVTALACIQLEERGRAGEELEAAAATFGEDDVERLRLVEMARVMLNALTGG 380 T 0.01 TPR_12 pdb F Eukaryota T 8glv 127 ASA,BSA,CSA,DSA,ESA,FSA,GSA zt,zu,zv,zw,zx,zy,zz A8J0T8_CHLRE FAP1 MSGPIYPSTLRYKDRLDCGKDDAFTYNRLYTATQGSDVWARLTVDASVRQSSARSRGSFQEGQAMVRHSFKNSGFDSNTCPAVLTHSTFKAGLYSYGVPIEEQHPITARRFKQKQPLEFMTQIRPHTETTNQALRMLGTYVSDQPHADRLGMFIPAGCPGGKPAYHPDVTTGGFGLLPTLPRRGMGATLTDGRNLK 196 T 17 DBB pdbhh F Eukaryota T 8gnn 4 D D RAD17_HUMAN HRAD17,RF-C/ACTIVATOR 1 HOMOLOG TDWVDPSFDDF 11 T 0.29 DUF4088 pdbhh F Eukaryota T 8go8 2 B,D V,U C5AR1_HUMAN C5A ANAPHYLATOXIN CHEMOTACTIC RECEPTOR,C5A-R,C5AR ESKSFTRSTVDTMAQKTQAV 20 T 24 DUF4355 pdbhh F Eukaryota T 8gok 1 A A Q5ZTB4_LEGPH Legionella OTU-deubiquitinase A OTU1 domain GIPATGDGACLFNAVSIGLSVEILSGRLDSQLDTPGYQALLDEFAKHHPQFNPKSWKTLKEWLAYYNDTRDIELILAPVLFNLNQKYQDHLDEEILNELTNLVWKNKANIENGQAWFQLQNTGDLGEALFPKLENLDLKKDRAPLLDKLREILKDYKLELTRENVKQFLTEKAKELLSALKKKISSDPHAFQRGYSCDELKGMTDALAISLVENREEDITDNRIKIRLENQEEHWNVLCNEEDSERFLDSTPSRLKMTSLEAYRGDKQVSAPT 273 T 0.07 OTU pdbhh F Bacteria T 8gp3 2 B,D V,U CXCR4_HUMAN CXC-R4,CXCR-4,FB22,FUSIN,HM89,LCR1,LEUKOCYTE-DERIVED SEVEN TRANSMEMBRANE DOMAIN RECEPTOR,LESTR,LIPOPOLYSACCHARIDE-ASSOCIATED PROTEIN 3,LAP-3,LPS-ASSOCIATED PROTEIN 3,NPYRL,STROMAL CELL-DERIVED FACTOR 1 RECEPTOR,SDF-1 RECEPTOR GHSSVSTESESSSFHSS 17 T 55 DUF5582 pdbhh F Eukaryota T 8gpn 7 K K MEN1_HUMAN Isoform 2 of Menin SMGLKAAQKTLFPLRSIDDVVRLFAAELGREEPDLVLLSLVLGFVEHFLAVNRVIPTNVPELTFQPSPAPDPPGGLTYFPVADLSIIAALYARFTAQIRGAVDLSLYPREGGVSSRELVKKVSDVIWNSLSRSYFKDRAHIQSLFSFITGTKLDSSGVAFAVVGACQALGLRDVHLALSEDHAWVVFGPNGEQTAEVTWHGKGNEDRRGQTVNAGVAERSWLYLKGSYMRCDRKMEVAFMVCAINPSIDLHTDSLELLQLQQKLLWLLYDLGHLERYPMALGNLADLEELEPTPGRPDPLTLYHKGIASAKTYYRDEHIYPYMYLAGYHCRNRNVREALQAWADTATVIQDYNYCREDEEIYKEFFEVANDVIPNLLKEAASLLEAGEERPGEQSQGTQSQGSALQDPECFAHLLRFYDGICKWEEGSPTPVLHVGWATFLVQSLGRFEGQVRQKVRIVSREAEAAEAEEPWGEEAREGRRRGPRRESKPEEPPPPKKPALDKGLGTGQGAVSGPPRKPPGTVAGTARGPEGGSTAQVPAPTASPPPEGPVLTFQSEKMKGMKELLVATKINSSAIKLQLTAQSQVQMKKQKVSTPSDYTLSFLKRQRKGL 611 T 2.2E-24 Menin unppssm F Eukaryota T 8gqa 2 B B precursor peptide analog MslAdeltaW21 CLGVGSCNDFAGCGYAIVCF 20 T 0.37 CCAP pdbhh F T 8gqv 3 C,F C,F As64 ALLRSATYY 9 T 9.3 BetaGal_dom3 pdbhh F T 8gre 2 C C F-box protein UCC1 MNQSDSSLMDLPLEIHLSLLEYVPNELRAVNKYFYVLHNHSYKEKSLAWIAEDNYIWAVVKHSLCLYVKSLDPLRQHAREIIQETKEPGFNVPLCMTKYIADSWYIVYNALQYPGKIINMGWDKYTKSQDLNGSDSTSNFNSRPKERTLMQSLTALPVNFWSRKKDEPTPVNVWFYVKNAHVARYIPKIITEIGICNYGPKQIVASAGYINELITSEGIYCVNLGHLPRLYDEQIFEGTGTTHLPLELKAIDRTDSDVCINSDLVLLGYDFIPYQISKPWLLFRIEPVNSIEAIFNYSECSFSYQFAWSLACLQSEEKISFPRDTIIGHGLPYKPSKLIRIFVYKHPEQKQDLGQEIALPNWNTPYLRR 369 T 0.058 Elongin_A pdbhh F T 8grq 2 B,F B,F A0A0P9AXL3_DROAN Histone H4 LRDNIQGITKPAIRRLARRGGVKRISGLIYEETRGVLKVFLENVIRDAVTYTEHAKRKTVTAMDVVYALKRQGRTLYGFG 80 F F Eukaryota T 8gtb 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R B,A,C,D,E,F,I,G,K,M,O,Q,J,H,L,N,P,R A0A4Y6EGR9_9CAUD Major tail protein MLKGKDGVVKNASTGDSIGHLQSWALDTQRDEVSGWGMGDDAERAFTTVGRASGNFEVYLDPADPSDDLEPGDLVDLELYPGGESTGSGYRSVAGALILSTAESASKDGIPMLTVNWRTSGALPQKATVS 130 T 0.11 tRNA_anti-codon pdbpercent T Viruses T 8gtd 2 B,D,F,H,J,L,N,P,R,T,V,X M,N,O,P,Q,R,S,T,U,V,W,X A0A4Y6E755_9CAUD Head-to-tail joining protein MTVSIHPPATLVAGDSWAWEAGAVFEDHPDPWAASYVLRPEAGGDPVTVSGGLEVLAPVFRLPASVTADLPPGEWTWFAVAVDATTDARAVLAQGRVTVIPDPLAGTEDRRTPARRILAAIEATLEGRATKDADTYSIEGRSITRTPLPDLLRLRAVYAEQVARETGRSPYRQRRVSF 178 T 0.057 DUF6148 unppercent T Viruses T 8gtf 1 A,B,C,D,E,F M,N,O,P,Q,X A0A4Y6E757_9CAUD Head-to-tail joining protein MIESLADWSIFTDPDVFGEPVTWTTPPLPDPVPAIFTDASEDRPATLGPGVLTIAPTLTLGAAQLPFSPARNHRCTVRGITYRVAEVQPDGSGGLRLLLERV 102 T 0.0008 Phage_attach unppercent T Viruses T 8gtf 3 M,N,O,P,Q,R k,l,m,n,o,p A0A4Y6E8T3_9CAUD Terminator protein MSEAIIAAARGRLISPPFSDATGDVYRTPEAALPAIIVELDYTDAERISMGGGFIASAELRVEILAKRDDWSLLTPTPANTAEGMARLAALVRTAILAPPSDLSGLAWSIAPAGYEFETERGETPLARATQSFALQILQP 140 F T Viruses T 8gue 1 A,B A,B PIKC_STRVZ CYTOCHROME P450 MONOOXYGENASE PICK,NARBOMYCIN C-12 HYDROXYLASE,PIKROMYCIN SYNTHASE CYP107L1 MRRTQQGTTASPPVLDLGALGQDFAADPYPTYARLRAEGPAHRVRTPEGDEVWLVVGYDRARAVLADPRFSKDWRNSTTPLTEAEAALNHNMLESDPPRHTRLRKLVAREFTMRRVELLRPRVQEIVDGLVDAMLAAPDGRADLMESLAWPLPITVISELLGVPEPDRAAFRVWTDAFVFPDDPAQAQTAMAEMSGYLSRLIDSKRGQDGEDLLSALVRTSDEDGSRLTSEELLGMAXILLVAGHETTVNLIANGMYALLSHPDQLAALRADMTLLDGAVEEMLRYEGPVESATYRFPVEPVDLDGTVIPAGDTVLVVLADAHRTPERFPDPHRFDIRRDTAGHLAFGHGIHFCIGAPLARLEARIAVRALLERCPDLALDVSPGELVWYPNPMIRGLKALPIRWRRGREAGRRTGLEHHHHHH 424 T 3.9E-36 p450 pdbpssm F Bacteria T 8gvn 1 A A TRP-LEU-ARG-ARG-ILE-LYS-ALA-TRP-LEU-ARG-ARG-ILE-LYS-ALA WLRRIKAWLRRIKA 14 T 1.6 DUF3349 pdbhh F T 8gym 12 L,PF 2o,2O SDHTT11 MGLPIRNIQFARYHYLAAVTVFTYFATRCCLLDYKKYYPLASVKKI 46 T 15 DUF4519 pdbhh F T 8gym 55 CB,GH sd,SD SDHD MFKELIHIFRTYFITFRYLKKSNINFLKNLSYTLIAYYLIINFM 44 T 17 DUF1869 pdbhh F T 8gym 87 IC,MI b4,B4 NDUB4 MALRRILANQAKLNQKLAQTNRNYHYDFCGRAYMGNPAVQSPPKEFFNYHYVPDNYPDALSGFRIAYRDPFEVQHVFAYENWEYQYDGQWWSMGSLACNVLFFCTPLFLYLILQVEELNSEKRGGTSNKFYHNNAGFFHFQIYDNKQ 147 T 1.9 DUF3930 pdbhh F T 8gym 142 AL,XE TF,tf NDUTT15 MNNLKGSNCLVQNVAFNFSQRGRDYTPSNKKYLQPWELERKEYVELSLAIQSAYSCKMLSEILKDNLYMLTDYQLSFAMFHLWNHEIPIDNYFYNVISPILKEYITRFDRECNKSLAEIATFLGRMNVQDDALWKVIETKLVQERLYRYIPLNDLIDLAHGMATANRGSQEFYNIVENVIIKHRLRLIPDKIAVAKDCFTARKIGSPLLYQVLENPQAEAHELAGLKEHEQLKISG 236 T 0.83 FAST_1 pdbhh F T 8gym 143 BL,YE TG,tg NDUTT16 MASQLQREQKLVQSLQQESLQPHLFKIIVDSQSDLVCEADRREYIKHYTRANEKSSTSQLLQVGALLGYIYAVGRYVSNPSTRKFSYGLAALLGSFSLLNPSKNLHHNHSLREIYSKYNISTNPQALEILKSRIY 135 T 22 ApoO pdbhh F T 8gym 144 CL,ZE TH,th NDUTT17 MNISYTGLKLEDYSDEVIRKYKFPNSNELERFLNREQTLTVQQHKSAIKLAQQDFFAVAGLLSVGSLSYIFYNSVGGKVIRDRIRASMPFPKRVLVQVLPFVALGTALIISRRGIEGHNHGYKQ 124 T 0.12 EMC3_TMCO1 pdbpercent F T 8h0g 1 A A VGF_HUMAN VGF-DERIVED PEPTIDE SAQEEAEAEERRLQEQEELENYIEHVLLRRP 31 T 9.3 TSKS pdbhh F Eukaryota T 8h0i 2 C,E,G,I C,E,G,I Viral infectivity factor MGSSHHHHHHSQDPMENRWQVMIVWQVDRMRINTWKRLVKHHMYISRKAKDWFYRHHYESTNPKISSEVHIPLGDAKLVITTYWGLHTGERDWHLGQGVSIEWRKKRYSTQVDPDLADQLIHLHYFDEASEGSQIKPPLPSVRKLTEDRWNK 152 T 0.052 Vif pdbpercent F T 8h2i 17 HD ch Q98534_PBCV1 ENTRY/FUSION COMPLEX COMPONENT MWLFFFALAVIYMIYKRDVFKKIAVNLKMNGVSIPFVDKYSKQYPTYTKNALFHVTRFNNAYQKTFEYKNISIDTINNLFSIRDDVLYNISEIKLRLPNDLTQEKEINYMYEKTDQRLMEYITDVKSRFHINIYPGTMSSAFEARNYRASNDIVF 155 T 0.19 CortBP2 unppercent T Viruses T 8h2i 18 ID ci Q98530_PBCV1 GLYCINE-RICH PROTEIN MQGGLFGTIKLMIMLFSYFAAYQLGKMQERPQSQWPKAKAGQNKYMVGDWAAWKPIYMGVLGVAVLLTLLGPGGVGGGMGGMFGGGGGYGGYY 93 T 0.00098 DUF2062 pdb T Viruses T 8h2i 19 JD,KD,LD,MD cj,ck,cl,cm Q84629_PBCV1 P17 MGAFTSFVLMLLFTGIILIATNELTYNRPREIQYRYLPRDLDSFIRTQEMPSAIFSSMWDVDTRRGGDGGPNPPGIRQSN 80 T 0.13 Ac110_PIF pdbhh T Viruses T 8h2i 20 ND,OD cn,co Q84533_PBCV1 TRANSMEMBRANE PROTEIN MTTITYDTDLLPPPELKVPSLDQALAPESVKNDDPFLDLSYFPVPKGFDNVGSLELNNLSTAEDVATLQNQLNKLAEEKHKRSTWKGLTFRIAVQDMWEALTGIPTDIYQNSGRVSLKELLTRDDRLRGLGLIFFLVAVVSIFFLAAG 148 T 0.19 BMP2K_C pdbpercent T Viruses T 8h2i 21 AE,BE,CE,DE,EE,FE,GE,HE,IE,JE,KE,LE,ME,NE,PD,QD,RD,SD,TD,UD,VD,WD,XD,YD,ZD cA,cB,cC,cD,cE,cF,cG,cH,cI,cJ,cK,cL,cM,cN,cp,cq,cr,cs,ct,cu,cv,cw,cx,cy,cz Q84602_PBCV1 A2M DOMAIN-CONTAINING PROTEIN MFLYYKMNEVLNVNNGSSINVIKILKTPGPDIIRPGKTYKKKDVFTPKFFKNGNVMYTCNTFSLNVPVNNSIATIDFAEHVNGAVFKIEYNRVNFIAPSLYPISGLGTAVVFDLQKGEKASQRKITEFGNKDIRIADEISDIAADDHSVLITTKLMSESTPGELSRDVVLNGEIATGRINMNTGFVSDIIHTKKISIIDDGIVDYYVKINVPAKYSHGIVEVVSGTFLNDIMIHLVRNKNKWALMDSKMYIVNNTNGFVIAKNPDTAMGVSLLEWPRGAVVFPPYYNFKKFKNVNKWSISQQIASCVNTSVKIPGGEYSWKIRLFFGPIWQVQESINSVKTVPHGNDYMYLYKEEIVKYKYQRNTKRNFHEYDYDYKF 378 T 0.11 Fer4_16 pdbpssm T Viruses T 8h2i 23 BF,HF,PE,TF db,dh,cP,dt Q89349_PBCV1 P21 MSWFDPNWNPFFNRSIIVSGNLVVAGGVIEGNGGGLTNSAFSNPDTIRTDLTGNVTSNGTVRSSLIQTNSIVASGDAYANVWNGNAAFITGISANTTLVGNVRISISGNLTGNFAVSTTTVSNTMTANVLIGTTLTVNNLSATRANVQVLTGNVTGNVRLIANVVNVAAYTVGNLTTSSNVSTTNLNVTSNINGNVRSSTANVQGRLSATTPTTGSIISSSFIANSFTSSGNVQIGDFFGNVRASVGTFDMIASNVFSNVANIIEIRSDIISNIANVGNGTFVFVNASNITTTNITGNITSNVASFSNVDSIFGNVGNLRANTLTVTYANITGTLASGNVAVGNLISADNTSEFANITSFTTNGLVVNGNIIAAIGNGNVFLANVITANSLTLQSSSSTAASNVTTTRWIQSTTSTSNVMTIGNITSGNVISSNAAYFANVISNTITVMSAMTVSGNVSFGNVSFGNVSYGNITANVTTSTGNVSVDNLTSNVVNVVFCNASSVTANTLTSIALTGNTYVYSANTGVLTANVGNVFGTLTVGVANVTSMVSNALFANTAVVSNLSVSNLTVTISSDISNISITGNTSGNVFAANIATIGTLNTANVVANLVASNVMNAWVTSNIRTLITGNANVSTTTSNVITTGGFTITGNITSGLLTSNVIAGNITMYNTSNTTLFTSNTSNIANFFAGNMTAVNTIVSNLEIQGNSVVITQTTPFKVTSVLLANVLFANTIISNTFTTTANVVGNISTDIANIGIANVNFLNTTDFAVNTANIVNYTPTSNINVTGNLTLGNANIVNFYATSANIVGNITANNAVITFLTTPFYKGSGTVSAGFTSIISANALSNVIVFGNLSANIVVSNTLLSNSINVATVFSNVVNIGQVTSTGTTLVNSINFNISNVDVSGNVLVTRDVYTTNISVTTANIANVTFFSNVAIGNLISTRNLTAANLISSSDLFNSGPYTSSQNVTIDTLNFTAGNLGTVAARTTITTPTLFATDVNFTQDALVAGELITQNFYGNITSANIGSRITIGNANVTQTNITGNVVIPRTSNAAGFNSRLLISNNTTVTSNIVANSLISTGNIITNLLQTTGQVSFASLAVEYIDVANLAVRNVVTIGGNLTVSNVANLFSITANTVNMSTVTTNTLSANTITGTSNVLVAGNIIGNCFGNVLVNRGVVTGNVFADTITVPLNAVGVLTGNALIVPNTALHSVAITGSSAFTSNIRTLNTPNAFIWNTSVPSSGLLDARRLRFSQYITSNRIVEMRYFYITNALPQKYDSGGLNARYDLSFGTTLPTSQGQTWHHFLNTLYIGAQYNTNFSINCLIINATTTGLEVIIYNPFGIAASLSSSTPELYFYITSIATSTQ 1369 T 47 GMP_PDE_delta pdbhh T Viruses T 8h2j 1 A,B,C,D,E,F A,B,C,D,E,F Q6PVL0_9CAUD p26 MDNQHKKIKGYRDLSQEEIDMMNRVKELGSQFEKLIQDVSDHLRGQYNASLHNRDEITRIANAEPGRWLAIGKTDIQTGMMAIIRAIAQPDSF 93 T 1.6 S36_mt pdbhh T Viruses T 8h7g 4 D D SP20H_HUMAN P38-INTERACTING PROTEIN,P38IP MQQALELALDRAEYVIESARQRPPKRKYLSSGRKSVFQKLYDLYIEECEKEPEVKKLRRNVNLLEKLVMQETLSCLVVNLYPGNEGYSLMLRGKNGSDSETIRLPYEEGELLEYLDAEELPPILVDLLEKSQVNIFHCGCVIAEIRDYRQSSNMKSPGYQSRHILLRPTMQTLICDVHSITSDNHKWTQEDKLLLESQLILATAEPLCLDPSIAVTCTANRLLYNKQKMNTRPMKRCFKRYSRSSLNRQQDLSHCPPPPQLRLLDFLQKRKERKAGQHYDLKISKAGNCVDMWKRSPCNLAIPSEVDVEKYAKVEKSIKSDDSQPTVWPAHDVKDDYVFECEAGTQYQKTKLTILQSLGDPLYYGKIQPCKADEESDSQMSPSHSSTDDHSNWFIIGSKTDAERVVNQYQELVQNEAKCPVKMSHSSSGSASLSQVSPGKETDQTETVSVQSSVLGKGVKHRPPPIKLPSSSGNSSSGNYFTPQQTSSFLKSPTPPPSSKPSSIPRKSSVDLNQVSMLSPAALSPASSSQRTTATQVMANSAGLNFINVVGSVCGAQALMSGSNPMLGCNTGAITPAGINLSGLLPSGGLLPNALPSAMQAASQAGVPFGLKNTSSLRPLNLLQLPGGSLIFNTLQQQQQQLSQFTPQQPQQPTTCSPQQPGEQGSEQGSTSQEQALSAQQAAVINLTGVGSFMQSQAAVLSQLGSAENRPEQSLPQQRFQLSSAFQQQQQQIQQLRFLQHQMAMAAAAAQTAQLHHHRHTGSQSKSKMKRGTPTTPKF 779 T 6.5E-20 Spt20 pdb F Eukaryota T 8h7g 6 F G TADA1_HUMAN SPT3-ASSOCIATED FACTOR 42,STAF42,TRANSCRIPTIONAL ADAPTER 1-LIKE PROTEIN MDYKDHDGDYKDHDIDYKDDDDKGGSGGSLEVLFQGPLDMATFVSELEAAKKNLSEALGDNVKQYWANLKLWFKQKISKEEFDLEAHRLLTQDNVHSHNDFLLAILTRCQILVSTPDGAGSLPWPGGSAAKPGKPKGKKKLSSVRQKFDHRFQPQNPLSGAQQFVAKDPQDDDDLKLCSHTMMLPTRGQLEGRMIVTAYEHGLDNVTEEAVSAVVYAVENHLKDILTSVVSRRKAYRLRDGHFKYAFGSNVTPQPYLKNSVVAYNNLIESPPAFTAPCAGQNPASHPPPDDAEQQAALLLACSGDTLPASLPPVNMYDLFEALQVHREVIPTHTVYALNIERIITKLWHPNHEELQQDKVHRQRLAAKEGLLLC 374 T 3.7E-15 SAGA-Tad1 pdb F Eukaryota T 8h89 2 J,K,L,M,N,O,P,Q,R J,K,L,M,N,O,P,Q,R A0A345GTT2_9CAUD GP1 MALIQSDFAQGIRMTPVPDCAGDVTACRFDITLKNAPAAGDIIELGVLPGNAVPVEAILDVDDLDTGGAPTITLDVGIMSGPVGKNDPARTCGNELFAASTVGQAGGVVRATASSAFRIQKAEDHRSVGVKVAAGPATGAAGKTIALILFYVQGTSQ 157 T 81 DUF6476 pdbhh T Viruses T 8h8h 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H Q88GF9_PSEPK H-NS family protein MvaT MSRLAEFRAAEKALQEQMAQLEALKKDAGLKREIEFEQKLVGLMKSYDKSLEHHHHHH 58 T 0.00012 Histone_HNS unppercent F Bacteria T 8hbr 1 A,B,C,D,E,F A,B,C,D,E,F A0A2K1IUB4_PHYPA TOG domain-containing protein GSHMLWSQAMESVRASDFDLAYADILGSNDELLLVRLMSRTGPVLEQLSDATLTHLMGNLKHFLQQQSFLECVIPWIQQVADLVLSNGPNALGLTGDSKKDLVFALQEAASMDHAQSWMAAKIVELAEQLRSAWL 135 T 0.084 Dna2 pdb F Eukaryota T 8hcr 8 H,Q I,U CYTOCHROME AA3 SUBUNIT CtaJ MSAMEIHLFFVGIPLLLVVVLSVLIWSRKGPHPATYKLSEPWTHPPILWAATDVVGSAHGGHGHDASEFTVGGGASGTW 79 T 0.2 ASFV_J13L pdbhh F T 8hdj 1 A,C,E,G A,C,E,G Periplasmic domain of RsgI2 SSQEYAYIDVDIN 13 T 2.1 Peptidase_M15_4 pdbhh F T 8hdj 2 B,D,F,H B,D,F,H RSGI2_ACET2 Anti-sigma-I factor RsgI2 PSIGLVIDKKEKVIDAKPLNNDAKPILDEAAPKDMPLYDALSKILDISKKNGYINSADNIVLFSASINSGRNNVSESDKGIQEIISTLKDVAKDAGVKFEIIPSTEEDRQKALDQNLSMGRYAIYVKAVEEGVNLNLEDARNLSVSEILGKVNIGKFAISDT 162 T 0.0038 Spore_III_AB pdbpercent F Bacteria T 8hdr 2 C,D,E,F,G,H A,B,C,D,E,F Pam3 connector protein MIDVAIAIDAESVEVTWRNRSGGSYDSRGNATGASWADTQIRAAIQPVSGRELQDLPEGVRSKVTLVAWTRSEVAENDQIIYLGDAYRVYAARPRPMDGFTRIALGKVSP 110 T 0.00037 Phage_H_T_join pdbhh F T 8hdr 3 I,J,K,L,M,N G,H,I,J,K,L Pam3 terminator protein MRRITGITVIKDHQSEDRPALPYGVVELANFRDLHQQVRTIHYEDIEDSDNGEGFPEVQATPEVEQEWVFLVQVYGPGGLDYLRKVAAAFHVNQVNDLPGSLVIHEVAQINSIPEFLGERWEKRAQTNITLRGMSTDGFKVDVIEQHVINVTGERA 156 T 7.9 DUF3168 pdbhh F T 8hdr 4 O,P,Q,R,S,T,U,V,W,X,Y,Z M,N,O,P,Q,R,S,T,U,V,W,X Pam3 sheath protein MAKLPYSRVTNVTLTRTDNFPTRRGFGTQLILTHTAVSGQVDATKRTKLYASLAEVEADYPANTSVYKAALSAFSQNPRPIRLKVGYAATPTGGDDAAKKADFITSLGAILNYDQAFYQITLDAALRDQPYLDGLVEWVEAQPKIAMIDSNAAGHEDPANTTVIAARHKGTVERTAVFYHTDSTEYLAASMAAYMSTRVFDDANSAYTLKFKKAPGVRAIDKGSAVVTAITGFVEQTGQSESAGHCANTLIDIGDQEFLVEGSTLTQNVFLDEIHATDWIIARTEEEMLSLFLNNDRVPFTDQGMQQLASVPRAIMQLAARAGIVALDLNPLTGAYEPAYTITVPSVFDIPESQRKARIAPAIQVRFRYAGAVHYSVINYTMTF 384 T 3.3E-20 DUF3383 pdbpssm F T 8hdt 2 G,H,I,J,K,L,M G,H,I,J,K,L,M Cement MAPYNETYASDYAFAYEGMVSDIAPADIISRTVETSAGIGFGKIVAQGTSDRGCKADVSAVSPTAPPLGITVRSQATENLTLDKYPRYDGAAIMRKGVIWVLVTDAGGVVAGDPVWLKKSDGTFSNADVGSSGGLRLAGCRWDTSAANGALARMRVDFDVPPVAGA 166 T 5.7E-05 DUF2190 pdbhh F T 8hdu 1 A,B,C,D,E A,B,C,D,E De novo design cavitated protein MGSSHHHHHHSSGLVPRGSHMASMTGGQQMGRGSEDEKKKIEELLKKAKEMLKKYASNIDKFIAALRRVVQALYDAGAYQVVIRMYQAALAGQIDREHLRFLIETLQRIMANAPSEMTRMAALLLRLLALLALLTGDLLLVILLAAMIILLFAGYGEVVVKIFKIIREMPDKEEALKKAVELAIKMVEEFRKKQGLEHHHHHH 203 T 0.06 DUF5344 pdb F T 8hdv 1 A,B A,B De novo design cavitated protein MGSSHHHHHHSSGLVPRGSHMASMTGGQQMGRGSEDVIKQALKRVQQYIQQAPNGYRDVIQQILQTVLKILKLMGMPEVEAVLIVAYVAEMLVLAAKYGYIDELLKLAKEALEADDVDKMIEIFLKMLKIMFLALALDPEGLKKLKELKKNGSEEVRKLIEEVIKQLKQQRQQQALEHHHHHH 183 T 0.059 DUF3103 pdb F T 8he0 2 B B HIF1A_HUMAN HIF-1-ALPHA,HIF1-ALPHA GSQRKRKMEHDGSLFQAVGIGTLLQQPDDHAATTSLSWKRVKG 43 T 33 PheRS_DBD2 pdbhh F Eukaryota T 8he3 2 B B Hypoxia-inducible factor 1-alpha GSQRKRKMEWKRVKG 15 T 3 Nucleos_tra2_N pdbhh F T 8hep 1 A A H6SHX8_ACETH Anti-sigma factor MYGYICVDIN 10 T 0.014 Pox_F11 unppssm F Bacteria T 8hep 2 B B H6SHX8_ACETH Anti-sigma factor PSVELVIDETCRVLEVRPQNKDGEQLISGLELLDKNVEDVVYELINRSISFGFVKADDNRKIVLISGALNDKRNELKTKKENDEAELTELLDNIKARVDRIDNIKVRTITATSRERKDALKYGLSMGKYCLYLEAQELNGSITIDEVHDMSISDMIEKLEHHHHHH 166 T 0.0087 SesA pdb F Bacteria T 8heq 1 A A RSGI2_ACET2 Anti-sigma-I factor RsgI2 MYAYIDVDIN 10 T 2 DUF4179 unppssm F Bacteria T 8heq 2 B B RSGI2_ACET2 Anti-sigma-I factor RsgI2 PSIGLVIDKKEKVIDAKPLNNDAKPILDEAAPKDMPLYDALSKILDISKKNGYINSADNIVLFSASINSGRNNVSESDKGIQEIISTLKDVAKDAGVKFEIIPSTEEDRQKALDQNLSMGRYAIYVKAVEEGVNLNLEDARNLSVSEILGKLEHHHHHH 159 T 0.0077 Spore_III_AB unppercent F Bacteria T 8her 1 A A H6SHY0_ACETH Anti-sigma factor MYAYVGIDIN 10 T 0.86 Peptidase_M23_N pdbhh F Bacteria T 8her 2 B B H6SHY0_ACETH Anti-sigma factor PSIELWINYNNKIAEAKALNGDAETVLEGLELKEKTVAEAVNEIVQKSMELGFISREKENIILISTACDLKAGEGSENKDVQNKIGQLFDDVNKAVSDLKNSGITTRILNLTLEERESSKEENISMGRYAVYLKAKEQNVNLTIDEIKDADLLELIAKLEHHHHHH 166 T 0.0064 Gypsy pdb F Bacteria T 8hf2 1 A,B,C,D,E A,B,C,D,E A0A654EJS8_ARATH WEITSING METVSAVNQTLPISGGEPVKFTTYSAAVHKVLVMVNAGILGLLQLVSQQSSVLETHKAAFLCFCVFILFYAVLRVREAMDVRLQPGLVPRLIGHGSHLFGGLAALVLVSVVSTAFSIVLFLLWFIWLSAVVYLETNKPSACPPQLPPV 148 T 0.0024 Frag1 pdb F Eukaryota T 8hif 3 GE,ZD y1,y2 Q5YFC8_9VIRU VP137 MDCATYATRKDKGWELNENRCVWAASVKPTSGAIMTNVGVHGKSGNAVLMTPKRRPHAQNHAGYKIKYCKQVPLIPLHGGDYILNHWETRGVDRMRIPGIQHAPPPPAPSGMQNAYSTHPDAYRTPLLADSHALSRMPVVQVHGPQVAPKNSHFTVAPEKHGPVEDMNAIINALPTKVDAVKLEYSASKTNRTNKRPGDGGAPPPKNLSKCHQNKLKTFARTANSGANPFRPATAAPQGLSKQPVRKPFASARNANSGANPFRPPLAHQGLSKAHVVKTAVSVANRSAGAEPFVTRNDPRALAMELANNKTISVTLGLRHWKTVSAAPPEKMSKSGVCKIATNVYNRDGGANPFLVKYEPDSLAVCPMETVEIAAVPSKRPWEGSANPRRPEQISFGMSDKPKFVNDKIGIVLRGPTLAPTLDRTATHTVTRPRALGSFHSTAGPAKHAASIMAECKDESR 461 T 87 TAL_effector pdbhh T Viruses T 8hif 5 BE y5 Q5YFK6_9VIRU VP59 MDSQGFWAILAFTPVLMILSLKGEGLLAMVGLLVLTVTLLASREKNDRPRLSCRGKIGRKVSGFENAGHVRDSHHVIYKRPPVNEYCAETREDNSLYVPEYCGQNWKNGVLSGMGTHHDAYRNLAVNMMTLRRESAVSAGWAHSYL 146 T 4.2 DAG1 pdbhh T Viruses T 8hif 6 CE y6 Q5YFC6_9VIRU VP139 MNAYQNDKLHLCAPRPDLVRAAMSAMVRETGCTPNVNIREMAISAGVMLTKIRANPGMLRYGMTATQTVIYNLKELFAAHAARGVVFKTPAIHPAHPSQWKGF 103 T 2.7 DUF3285 pdbhh T Viruses T 8hif 7 HE y3 Q5YFQ1_9VIRU Penton protein (VP14) MYRGFSLKLPNNYRSGQVTTEHRLPASNSHARWPVEVSFYSAVLHVPAKHQHKFPPVLELKLHNMTTGSMAAHRGSGHHFTFMFHAQSSPTEAVYSCVPVPIVFSDYQSNVIASVDMGEHDTAEKLHFYGSIRNCDNGCTY 141 T 5.7 DUF1848 pdbhh T Viruses T 8hj4 3 C C V9H5N5_9NEIS Phage protein SMNNSIKFHVSYDGTARALFNTKEQAEKYCLVEEINDEMNGYKRKSWEEKLREENCASVQDWVEKNYTSSYSDLFNICEIEVSSAGQLVKIDNTEVDDFVENCYGFTLEDDLEEFNKAKQYLQKFYAECEN 131 T 0.028 LZ3wCH pdb F Bacteria T 8hjc 1 A A Bidentatide CLESGTSCIPGAQHNCCSGVCVPIVTIFYGVCY 33 T 0.0013 Tryp_inh pdb F T 8hlo 2 B C MICA1_HUMAN MOLECULE INTERACTING WITH CASL PROTEIN 1,MICAL-1,NEDD9-INTERACTING PROTEIN WITH CALPONIN HOMOLOGY AND LIM DOMAINS GPGSEPPPKPPRS 13 T 2.9 Dscam_C pdbhh F Eukaryota T 8hn9 2 C C CCNE2 peptide SPVKLKTFKXIPM 13 T 0.36 DUF3754 pdbhh F T 8hns 1 A,B A,B anti-CRISPR protein AcrIIC4 SMKITSSNFATIATSENFAKLSVLPKNHREPIKGLFKSAVEQFSSARDFFKNENYSKELAEKFNKEAVNEAVEKLQKAIDLAEKQGIQF 89 T 1.5 Nif11 pdbhh F T 8hsv 2 B,D E,F D3ZVH5_RAT peptide from E3 ubiquitin-protein ligase Mdm2 DLDDGVSDHSADCLDQDS 18 T 27 Raf1_N pdbhh F Eukaryota T 8hvp 2 C I INHIBITOR VAL-SER-GLN-ASN-LEU-PSI(CH(OH)-CH2)-VAL-ILE-VAL (U-85548E) VSQNXIV 7 T 120 Diphtheria_R pdbhh F T 8hzw 1 A A noursinH11W APSNVLSXLLWGRACV 16 T 0.91 DUF765 pdbhh F T 8i03 8 I I RXT3_SCHPO Transcriptional regulatory protein rxt3 MEEKTPENEQSKKTFDPKDSMKIEETSTNGSSQPSQPSNIKLSIGSILESSNDNGDPEYSENGMGNMNMNTLPMATSTPMSYTKQPSEAKYPNSVWERKGVSDQEENTSSVKRQKTLPTQSSGEEEAKYSHPGAPTATSADSISMESRPSNLSTSLSKTTSYPQFQVRQFVSPIISIDNSALEPFLNRYPASESLFPVTEYEYTPWLEFPLLYSSIGKFVRVTIDIKWLNAAINPRLCRREIWGTDVYTDDSDIATILAHCGCFSLLKPVRKIAVVDLYILPPLVHYKGTRKNQIESRSWSSRQDGISLKIKEVTWKPACASIFENSIHTLTLEERLQARLELSRSSTFKI 351 T 8.9E-16 Rxt3 unppssm F Eukaryota T 8i60 2 C,D A,B ALA-ARG-KCR-SER-ALA-PRO ATKAARXSAPATG 13 T 95 OxoGdeHyase_C pdbhh F T 8i87 2 B,D,E,H B,D,F,T A0A316E3U6_9FLAO Piwi domain-containing protein MKELIYIEEPKILFAHGQKCTDARDGLALFGPLNNLYGIKSGVIGTKQGLKIFRDYLDHIQKPIYNSNSITRPMFPGFEAVFDCKWESTGITFKEVTNEDIGKFLYNSSTHKRTYDLVSLFIDKIISANKNEDENVDVWFVIVPDEIYKYCRPNSVLPKEMVQTKALMSKSKAKSFRYEPSLFPDINIELKEQEKEAETYNYDAQFHDQFKARLLKHTIPTQIFRESTLAWRDFKNAFGLPIRDFSKIEGHLAWTISTAAFYKAGGKPWKLSDVRNGVCYLGLVYKKVEKSKNPRNACCAAQMFLDNGDGTVFKGEVGPWYNPKNGQYHLEPKEAKALLSQSLQSYKEQIGEYPKEVFIHAKTRFNHQEWDAFLEVTPKETNLVGVTISKTKPLKLYKTEGDYTILRGNAYVVNERSAFLWTVGYVPKIQTALSMEVPNPLFIEINKGEADIKQVLKDILSLTKLNYNACIFADGEPVTLRFADKIGEILTASTDIKTPPLAFKYYI 507 T 0.28 TPR_10 pdbpercent F Bacteria T 8i9r 48 WA CX G0SGJ0_CHATD 60S ribosomal subunit-like protein MGKTRTIKNKHAEPSKKKAKKAGDGGVKKTKDRAGSKSKAKVPAIEVKGKPNLGQDKKKQKRVYSEKELGIPQLNTVTPVGVTKPKGKKKGKVFVDDRESMNTILAMVEAEKQGQIESKIMRARQLEEIREARRIEAEKKEAERKALLENTKEQLRKKRKKNKKGGDQKSSEDEGPSLKELTSTGSKAVKSKKKKKVSFATPE 203 T 0.32 LptF_LptG pdbpercent F Eukaryota T 8ia5 2 B B 9-mer peptide DLENLYFQG 9 T 5.7 DUF1563 pdbhh F T 8ia8 1 A L ALA-SER-LYS-LEU-GLY-LEU-ALA-ARG WWGKKYRASKLGLAR 15 T 13 GXWXG pdbhh F T 8ibl 1 A,B A,B W0TJ64_9PSEU CUTINASE GPNPYERGPDPTEDSIEAIRGPFSVATERVSSFASGFGGGTIYYPRETDEGTFGAVAVAPGFTASQGSMSWYGERVASHGFIVFTIDTNTRLDAPGQRGRQLLAALDYLVERSDRKVRERLDPNRLAVMGHAMGGGGSLEATVMRPSLKASIPLTPWHLDKTWGQVQVPTFIIGAELDTIAPVSTHAKPFYESLPSSLPKAYMELCGATHFAPNIPNTTIAKYVISWLKRFVDEDTRYSQFLCPNPTDRAICEYRSTCPYKLN 263 F F Bacteria T 8ig0 1 A,B,C,D A,B,C,D MEN1_HUMAN Menin MGLKAAQKTLFPLRSIDDVVRLFAAELGREEPDLVLLSLVLGFVEHFLAVNRVIPTNVPELTFQPSPAPDPPGGLTYFPVADLSIIAALYARFTAQIRGAVDLSLYPREGGVSSRELVKKVSDVIWNSLSRSYFKDRAHIQSLFSFITGTKLDSSGVAFAVVGACQALGLRDVHLALSEDHAWVVFGPNGEQTAEVTWHGKGNEDRRGQTVNAGVAERSWLYLKGSYMRCDRKMEVAFMVCAINPSIDLHTDSLELLQLQQKLLWLLYDLGHLERYPMALGNLADLEELEPTPGRPDPLTLYHKGIASAKTYYRDEHIYPYMYLAGYHCRNRNVREALQAWADTATVIQDYNYCREDEEIYKEFFEVANDVIPNLLKEAASLLEAGEERPGEQSQGTQSQGSALQDPECFAHLLRFYDGICKWEEGSPTPVLHVGWATFLVQSLGRFEGQVRQKVRIVSGTVAGTARGPEGGSTAQVPAPTASPPPEGPVLTFQSEKMKGMKELLVATKINSSAIKLQLTAQSQVQMKKQKVSTPSDYTLSFLKRQRKGL 550 T 2.2E-24 Menin unppssm F Eukaryota T 8igg 1 A,B,C,D E,A,B,C CHMA_BP201 CHMA,PHAGE NUCLEUS ENCLOSURE PROTEIN,PHUN,GENE PRODUCT 105,GP105 MIRDTATNTTQTQAAPQQAPAQQFTQAPQEKPMQSTQSQPTPSYAGTGGINSQFTRSGNVQGGDARASEALTVFTRLKEQAVAQQDLADDFSILRFDRDQHQVGWSSLVIAKQISLNGQPVIAVRPLILPNNSIELPKRKTNIVNGMQTDVIESDIDVGTVFSAQYFNRLSTYVQNTLGKPGAKVVLAGPFPIPADLVLKDSELQLRNLLIKSVNACDDILALHSGERPFTIAGLKGQQGETLAAKVDIRTQPLHDTVGNPIRADIVVTTQRVRRNGQQENEFYETDVKLNQVAMFTNLERTPQAQAQTLFPNQQQVATPAPWVASVVITDVRNADGIQANTPEMYWFALSNAFRSTHGHAWARPFLPMTGVAKDMKDIGALGWMSALRNRIDTKAANFDDAQFGQLMLSQVQPNPVFQIDLNRMGETAQMDSLQLDAAGGPNAQKAAATIIRQINNLGGGGFERFFDHTTQPILERTGQVIDLGNWFDGDEKRDRRDLDNLAALNAAEGNENEFWGFYGAQLNPNLHPDLRNRQSRNYDRQYLGSTVTYTGKAERCTYNAKFIEALDRYLAEAGLQITMDNTSVLNSGQRFMGNSVIGNNMVSGQAQVHSAYAGTQGFNTQYQTGPSSFYALEHHHHHH 640 T 6.2 TGBp3 pdbhh T Viruses T 8igl 1 A,B A,B A0A2X0THU5_ASF CP2475L CDS PROTEIN,CP2475L PROTEIN,POLYPROTEIN PP220,PROTEIN CP2475L MHHHHHHHHHHGSDYKDHDGDYKDHDIDYKDDDDKELENLYFQGAGSMKIFLFHETVITGLNLLSAIYVLLNNFRNNIKGLDLDTIQKSIIEWLRETQAANVNRANLIDWLGRKHGAISEIRNPGLVIKEINMRLSMVYPDPTTEAAAAAQDRNLTTETLFAWIVPYVGIPAGGGVRPEQELAARYLVDNQRIMQLLLTNIFEMTSSFNKMVQVRFPETSTAQVHLDFTGLISLIDSLMADTKYFLDLLRPHIDKNIIQYYENRSNPGSFYWLEEHLIDKLIKPPTDAGGRPLPGGELGLEGVNQIINKTYTLLTKPYNVLQLRGGAQRRDAANIQINNNPQSSERFEQYGRVFSRLVFYDALENNSGLRVEQVALGDFRLSNLIRTNNAQEENTLSYWDNIALRTYANVNDAANNLRRYRLYGSDYGIQNNRSMMMVFNQLIASYITRFYDAPSGKIYLNLINAFANGNFSQAVMEMGYAHPDLARNNNVFGHRGDPTEQSVLLLSLGLILQRLIKDTNRQGLSQHLISTLTEIPIYLKENYRANLPLFNKMFNILISQGELLKQFIQYTNVQLARPNLTALLGANNDSVIYYNNNNVPATGLSVGQAALRGIGGVFRPNVTLMPLGDAQNNTSDVVRKRLVAVIDGIIRGSHTLADSAMEVLHELTDHPIYLETEEHFIQNYMSRYNKEPLMPFSLSLYYLHDLRIENNEVYDPLLYPNLESGSPEFKLLYGTRKLLGNDPVQLSDMPGVQLIMKNYNETVVAREQITPTRFEHFYTHAIQALRFIINIRSFKTVMMYNENTFGGVNLISENRDDKPIITAGIGMNAVYSLRKTLQDVISFVESSYQEEQINHIHKIVSPKGQTRTLGSNRERERIFNLFD 883 T 9.3 DUF3888 unp T Viruses T 8im8 1 A,B,C,D A,B,C,D AMY1_ECOLI 1,4-ALPHA-D-GLUCAN GLUCANOHYDROLASE MKLAACFLTLLPGFAVAASWTSPGFPAFSEQGTGTFVSHAQLPKGTRPLTLNFDQQCWQPADAIKLNQMLSLQPCSNTPPQWRLFRDGEYTLQIDTRSGTPTLMISIQNAAEPVASLVRECPKWDGLPLTVDVSATFPEGAAVRDYYSQQIAIVKNGQIMLQPAATSNGLLLLERAETDTSAPFDWHNATVYFVLTDRFENGDPSNDQSYGRHKDGMAEIGTFHGGDLRGLTNKLDYLQQLGVNALWISAPFEQIHGWVGGGTKGDFPHYAYHGYYTQDWTNLDANMGNEADLRTLVDSAHQRGIRILFDVVMNHTGYATLADMQEYQFGALYLSGDEVKKSLGERWSDWKPAAGQTWHSFNDYINFSDKTGWDKWWGKNWIRTDIGDYDNPGFDDLTMSLAFLPDIKTESTTASGLPVFYKNKMDTHAKAIDGYTPRDYLTHWLSQWVRDYGIDGFRVDTAKHVELPAWQQLKTEASAALREWKKANPDKALDDKPFWMTGEAWGHGVMQSDYYRHGFDAMINFDYQEQAAKAVDCLAQMDTTWQQMAEKLQGFNVLSYLSSHDTRLFREGGDKAAELLLLAPGAVQIFYGDESSRPFGPTGSDPLQGTRSDMNWQDVSGKSAASVAHWQKISQFRARHPAIGAGKQTTLLLKQGYGFVREHGDDKVLVVWAGQQ 676 T 6.800000000000001E-27 Alpha-amylase pdb F Bacteria T 8iqb 1 A,B A,B A0A0C5B022_ASF ASFVPRIMPOL GSMREESWEEHDTIQLTAQRKYLAEVQALETLLARELSVFLTEPGSKKTNIINRITGKTYALPSTELLRFYEHLEQCRKQGALMYFLERQGTYSGLMLDYDLKLNTNAAPSLESSVLSRLCHRIFVHIKNSSVLPEGSHKIHFFFTLKPEAVQGKYGFHVLIPGLKMAASTKKSIIASLQHDATVQKILHEQGVANPESCLDPHSASVPSLLYGSSKLNHRPYQLKTGFELVFDSSDPDYIPIHQIKNIESYNLVSELSLTNEQGSLVRPVYCAADIAAEKEEEIPA 287 T 0.0044 PPL4 unppercent T Viruses T 8iqc 1 A,B A,B A0A0C5B022_ASF Putative primase C962R GSLAEVQALETLLARELSVFLTEPGSKKTNIINRITGKTYALPSTELLRFYEHLEQCRKQGALMYFLERQGTYSGLMLDYDLKLNTNAAPSLESSVLSRLCHRIFVHIKNSSVLPEGSHKIHFFFTLKPEAVQGKYGFHVLIPGLKMAASTKKSIIASLQHDATVQKILHEQGVANPESCLDPHSASVPSLLYGSSKLNHRPYQLKTGFELVFDSSDPDYIPIHQIKNIESYNLVSELSLTNEQGSLVRPVYCA 254 T 0.0044 PPL4 unppercent T Viruses T 8itg 2 B B A0A385ZG42_9ACTN Tricyclic peptide MS-271 MSAVYEPPMLQEVGDFDELTKCLGVGSCNDFAGCGYAIVCFG 42 T 0.12 DUF5972 pdb F Bacteria T 8iyj 5 I 8 CF107_MOUSE Cilia- and flagella-associated protein 107 MAMLSTSVVPEAFSTPGWQIEKKYSTKVLLGNWVEERGKFTKAIDHTPQCIYRKEYVPMPDHRPDFVSRWYSKSKMEGLPYKHLITHHQEPSHRYLISTYDDHYNRHNYNPGLPALRTWNGQKLLWLPEKSDFPLVAPPTNYGLLEQLQQKWLASKTSLKESIYTTSYPRLPVCAMSRREHAIPVPHPRLQPIPRF 196 T 0.025 DUF1143 pdbpssm F Eukaryota T 8iyj 13 AC,BH D,K3 SPAG8_MOUSE SPERM MEMBRANE PROTEIN 1,SMP-1,SPERM MEMBRANE PROTEIN BS-84 METTESTEGSLSRSCDVQPSSERLDTPSEPVPSSSSSPRSTAPAEAPAQYSVLTEPSSDSLYGAPCPPAHHRGHGFGFQPFYVSCIPQDPCNMADLSSRADPTSSYPCHSSVHGSGSGTCGLGQSSEPSQGSGPTSGPAPASVPSLVSGPDSASGPDSSASGPALASGPGPADPGQGPKFSTCIPQGYRCIPVDLAPDYNAWCQHLHWKPQRSWEPLQVSEPGVRGPYKPPEPGALGPCEPCEPCEPPEAESEETLCKARPRGQCLLYNWEEERATNQLDQIPPLQDGSESYFFRHGHQGLLTTQPQSPMSSSTTQRDSYQLPRHICQPLRGKREAMLEMLLRHQICKEVQAEQEPARKLFETESVTHHDYRVELVRAAPPASTKPHDYRQEQPETFWIQRAARLPGVSDIRTLDTPFRKNCSFSTPVPLSLGQPLPYELESGPHQVGVISSLACQGGGQGCGRTKTTPI 470 T 2.3 DUF1143 pdbhh F Eukaryota T 8iyj 15 HJ,ND,XC N2,F,E CF161_MOUSE Cilia- and flagella-associated protein 161 MAQNVYGPGVRMGNWNEDVYLEEERMRHFLEKREKGELLIQRNRRVKKNILRPMQLSVSEDGYVHYGDKVIIVNPDQVLGEEAGKFMRGDLSLCMSPDEVKAQLSDDLEIPCGVSAVQTIAPMGRNTFTILSDGANSCEMGQVVVYGQNFCLGIAAGLEGKMLYLTSDHRTLLKSSLKSGLQEVTLTDEVTHLNCWQAAFLDPQLRLEYEGFPVRANEKIVIYHRHTNRALAVHRNLFLRTYFGKEMEVVAHTYLDSHKVEKPKNQWMLVTGNPRNKSNTMLDISKPITEDTRALEQAMGINT 303 T 0.066 DUF1143 pdbhh F Eukaryota T 8iyj 38 MT Z1 TEPP_MOUSE Testis, prostate and placenta-expressed protein MAQIIDLVPWDECSAHLYASPAVLLPLERVRHPLAGVKHQLYHPALPSLRRMDMDTVKGCLSDEHCQSSTYFSKDDFNKAHFTLLGVPNKPLQCLDFTATGQKLCHKYRGGKMIPIAPGINRVDWPCFTRAIEDWSKFVSRSEEFKLPCANKRVEGFSGYAVRYLKPEVTQNWRYCLNQNPSLDRYGQKPLPFDSLNAFRRFGSHYSRINYLTPWH 216 T 0.012 PAN_4 unppssm F Eukaryota T 8iyj 40 ST,TT,UT,VT a1,a2,a3,a4 CJ082_MOUSE Uncharacterized protein C10orf82 homolog MESPKTFMRKLPITPGYCGFIPWLSCQESSSEDRMNPCVKAFQERTQRYKEDQQGLNCSVANTPPLKPICSEDTVLWVLHEYAKKYHPLTLECKNEKKPLQEPPIPGWAGYLPRARVTEFGYATRYTIMAKKCYKDFLDLVEQAKRAQLKPYEQTYDVRAAQPLSPSSKILQLQGLSPAFPEFSGPGQTPPSEDPQAPRPCGCAQWSSQSCSRNVYGEPPSLAKAFAES 229 T 0.0034 DUF2475 pdbhh F Eukaryota T 8iyj 45 OU,VU,WU,XU i1,l,m,n FLTOP_MOUSE CILIA- AND FLAGELLA-ASSOCIATED PROTEIN 126 MATNYSANQYEKAYLPTYLQNWSPARPTKEKIAAHEGYTQIIANDRGHLLPSVPRSKASPWGSFMGTWQMPLKIPPAKVTLTARTTTAADNLTKWIHKNPDLLNACNGLRPEISGKPFDPDSQTKQKKSVTKTVQQAPNPTIIPSSPVIQGDNPDEPQSSHPSAGHTPGPQTPVNSPNNPPPSPCKSTK 189 T 0.2 DUF4248 pdb F Eukaryota T 8j07 23 AC,BC,CC,DC,EC,FC,GC,HC,IC,JC,KC,LC,MC 4A,4B,4C,4D,4E,4F,4G,4H,4I,4J,4K,4L,4M F166B_HUMAN Protein FAM166B MAVASTFIPGLNPQNPHYIPGYTGHCPLLRFSVGQTYGQVTGQLLRGPPGLAWPPVHRTLLPPIRPPRSPEVPRESLPVRRGQERLSSSMIPGYTGFVPRAQFIFAKNCSQVWAEALSDFTHLHEKQGSEELPKEAKGRKDTEKDQVPEPEGQLEEPTLEVVEQASPYSMDDRDPRKFFMSGFTGYVPCARFLFGSSFPVLTNQALQEFGQKHSPGSAQDPKHLPPLPRTYPQNLGLLPNYGGYVPGYKFQFGHTFGHLTHDALGLSTFQKQLLA 275 F F Eukaryota T 8j07 35 FE 7 DRC7_HUMAN COILED-COIL DOMAIN-CONTAINING PROTEIN 135,COILED-COIL DOMAIN-CONTAINING PROTEIN LOBO HOMOLOG MEVLREKVEEEEEAEREEAAEWAEWARMEKMMRPVEVRKEEITLKQETLRDLEKKLSEIQITVSAELPAFTKDTIDISKLPISYKTNTPKEEHLLQVADNFSRQYSHLCPDRVPLFLHPLNECEVPKFVSTTLRPTLMPYPELYNWDSCAQFVSDFLTMVPLPDPLKPPSHLYSSTTVLKYQKGNCFDFSTLLCSMLIGSGYDAYCVNGYGSLDLCHMDLTREVCPLTVKPKETIKKEEKVLPKKYTIKPPRDLCSRFEQEQEVKKQQEIRAQEKKRLREEEERLMEAEKAKPDALHGLRVHSWVLVLSGKREVPENFFIDPFTGHSYSTQDEHFLGIESLWNHKNYWINMQDCWNCCKDLIFDLGDPVRWEYMLLGTDKSQLSLTEEDDSGINDEDDVENLGKEDEDKSFDMPHSWVEQIEISPEAFETRCPNGKKVIQYKRAKLEKWAPYLNSNGLVSRLTTYEDLQCTNILEIKEWYQNREDMLELKHINKTTDLKTDYFKPGHPQALRVHSYKSMQPEMDRVIEFYETARVDGLMKREETPRTMTEYYQGRPDFLSYRHASFGPRVKKLTLSSAESNPRPIVKITERFFRNPAKPAEEDVAERVFLVAEERIQLRYHCREDHITASKREFLRRTEVDSKGNKIIMTPDMCISFEVEPMEHTKKLLYQYEAMMHLKREEKLSRHQVWESELEVLEILKLREEEEAAHTLTISIYDTKRNEKSKEYREAMERMMHEEHLRQVETQLDYLAPFLAQLPPGEKLTCWQAVRLKDECLSDFKQRLINKANLIQARFEKETQELQKKQQWYQENQVTLTPEDEDLYLSYCSQAMFRIRILEQRLNRHKELAPLKYLALEEKLYKDPRLGELQKIFA 874 T 0.00016 Peptidase_C93 pdbhh F Eukaryota T 8j07 48 CJA,IV,IW,RIA r,Q,R,q ROP1L_HUMAN ROPN1-LIKE PROTEIN,AKAP-ASSOCIATED SPERM PROTEIN MPLPDTMFCAQQIHIPPELPDILKQFTKAAIRTQPADVLRWSAGYFSALSRGDPLPVKDRMEMPTATQKTDTGLTQGLLKVLHKQCHHKRYVELTDLEQKWKNLCLPKEKFKALLQLDPCENKIKWINFLALGCSMLGGSLNTALKHLCEILTDDPEGGPARIPFKTFSYVYRYLARLDSDVSPLETESYLASLKENIDARKNGMIGLSDFFFPKRKLLESIENSEDVGH 230 T 0.0089 RIIa unppercent F Eukaryota T 8j07 87 NGA k2 DNAI4_HUMAN WD REPEAT-CONTAINING PROTEIN 78 MTPGKHSGASARAANGGAWGYRDFRGGQKKGWCTTPQLVATMPVSPAGSHKQQNFGLNNATQPKKSISFFATMKATSVKGYTGANQSRMAVSKTVLIPPELKTVEKPNPNIKTTQVFDINGTDVTPRPLYHPDPLTGTAKPSKLLTSQEGSLGSEFISSYSLYQNTINPSTLGQFTRSVLGSSTVSKSSVSASESIAEDLEEPSYKRERLTSFTDLQVIRAAPEKIVTKEDLEKNIEIILTETETLRFFDLPTVMVSVESEEAEKVTQRNKNYEVLCRNRLGNDLYVERMMQTFNGAPKNKDVQCDKIIMEDKGIMSTAWDLYDSYNAMELVSLSVKQSVVESSSKANVLPKDQDQRLPGSTTEKNSETSSLMDIENVILAKIHEDEEDHSDAILKSDKFHQDLFFMERVLMENIFQPKLAAYRQLPVLKEPEPEEPEDVLESAKHEEVEEESKKEEEEEIHAEESTIPANLERLWSFSCDLTKGLNVSSLAWNKTNPDLLAVGYGHFGFKEQKRGLACCWSIKNPMWPERIYQSPYGVTAVDFSIGAPNLLAVGYHNGTIAIYNVRSNSNVPVLDSSESPQKHLGPVWQLQWIEQDRGTTGDGKREILVSISADGRISKWVIRKGLDCYDLMRLKRTTAASNKKGGEKEKKDEALISRQAPGMCFAFHPKDTNIYLAGTEEGHIHKCSCSYNEQYLDTYRGHKGPVYKVTWNPFCHDVFLSCSADWGVIIWQQENVKPSLSFYPATSVVYDVAWSPKSSYIFAAANENRVEIWDLHISTLDPLIVNTANPGIKFTTILFAKQTDCLLVGDSDGQVSVYELRNMPTVLETGRGDIMDTLLGSKSNQSA 848 T 0.23 WD40 pdb F Eukaryota T 8j07 96 EHA,PJA,TIA,ZHA m1,s1,q1,o1 DNAI1_HUMAN AXONEMAL DYNEIN INTERMEDIATE CHAIN 1 MIPASAKAPHKQPHKQSISIGRGTRKRDEDSGTEVGEGTDEWAQSKATVRPPDQLELTDAELKEEFTRILTANNPHAPQNIVRYSFKEGTYKPIGFVNQLAVHYTQVGNLIPKDSDEGRRQHYRDELVAGSQESVKVISETGNLEEDEEPKELETEPGSQTDVPAAGAAEKVTEEELMTPKQPKERKLTNQFNFSERASQTYNNPVRDRECQTEPPPRTNFSATANQWEIYDAYVEELEKQEKTKEKEKAKTPVAKKSGKMAMRKLTSMESQTDDLIKLSQAAKIMERMVNQNTYDDIAQDFKYYDDAADEYRDQVGTLLPLWKFQNDKAKRLSVTALCWNPKYRDLFAVGYGSYDFMKQSRGMLLLYSLKNPSFPEYMFSSNSGVMCLDIHVDHPYLVAVGHYDGNVAIYNLKKPHSQPSFCSSAKSGKHSDPVWQVKWQKDDMDQNLNFFSVSSDGRIVSWTLVKRKLVHIDVIKLKVEGSTTEVPEGLQLHPVGCGTAFDFHKEIDYMFLVGTEEGKIYKCSKSYSSQFLDTYDAHNMSVDTVSWNPYHTKVFMSCSSDWTVKIWDHTIKTPMFIYDLNSAVGDVAWAPYSSTVFAAVTTDGKAHIFDLAINKYEAICNQPVAAKKNRLTHVQFNLIHPIIIVGDDRGHIISLKLSPNLRKMPKEKKGQEVQKGPAVEIAKLDKLLNLVREVKIKT 699 T 0.0027 WD40 pdb F Eukaryota T 8j07 97 AIA,FHA,QJA,UIA o2,m2,s2,q2 DNAI2_HUMAN AXONEMAL DYNEIN INTERMEDIATE CHAIN 2 MEIVYVYVKKRSEFGKQCNFSDRQAELNIDIMPNPELAEQFVERNPVDTGIQCSISMSEHEANSERFEMETRGVNHVEGGWPKDVNPLELEQTIRFRKKVEKDENYVNAIMQLGSIMEHCIKQNNAIDIYEEYFNDEEAMEVMEEDPSAKTINVFRDPQEIKRAATHLSWHPDGNRKLAVAYSCLDFQRAPVGMSSDSYIWDLENPNKPELALKPSSPLVTLEFNPKDSHVLLGGCYNGQIACWDTRKGSLVAELSTIESSHRDPVYGTIWLQSKTGTECFSASTDGQVMWWDIRKMSEPTEVVILDITKKEQLENALGAISLEFESTLPTKFMVGTEQGIVISCNRKAKTSAEKIVCTFPGHHGPIYALQRNPFYPKNFLTVGDWTARIWSEDSRESSIMWTKYHMAYLTDAAWSPVRPTVFFTTRMDGTLDIWDFMFEQCDPTLSLKVCDEALFCLRVQDNGCLIACGSQLGTTTLLEVSPGLSTLQRNEKNVASSMFERETRREKILEARHREMRLKEKGKAEGRDEEQTDEELAVDLEALVSKAEEEFFDIIFAELKKKEADAIKLTPVPQQPSPEEDQVVEEGEEAAGEEGDEEVEEDLA 605 T 0.088 DUF4795 pdb F Eukaryota T 8j07 99 AKA,EIA,JHA,RJA,YIA u6,o6,m6,s6,q6 ODAD1_HUMAN COILED-COIL DOMAIN-CONTAINING PROTEIN 114 MEGERRAYSKEVHQRINKQLEEIRRLEEVRGDLQVQISAAQNQVKRLRDSQRLENMDRLLKGRAQVQAEIEELQEQTRALDKQIQEWETRIFTHSKNVRSPGFILDQKVKIRRRIRILENQLDRVTCHFDNQLVRNAALREELDLLRIDRNRYLNVDRKLKKEIHHLHHLVSTLILSSTSAYAVREEAKAKMGLLRERAEKEEAQSEMEAQVLQRQILHLEQLHHFLKLKNNDRQPDPDVLEKREKQAGEVAEGVWKTSQERLVLCYEDALNKLSQLMGESDPDLLVQKYLEIEERNFAEFNFINEQNLELEHVQEEIKEMQEALVSARASKDDQHLLQEQQQKVLQQRMDKVHSEAERLEARFQDVRGQLEKLKADIQLLFTKAHCDSSMIDDLLGVKTSMGDRDMGLFLSLIEKRLVELLTVQAFLHAQSFTSLADAALLVLGQSLEDLPKKMAPLQPPDTLEDPPGFEASDDYPMSREELLSQVEKLVELQEQAEAQRQKDLAAAAAKLDGTLSVDLASTQRAGSSTVLVPTRHPHAIPGSILSHKTSRDRGSLGHVTFGGLSSSTGHLPSHITHGDPNTGHVTFGSTSASSGGHVTFRPVSASSYLGSTGYVGSSRGGENTEGGVESGGTASDSSGGLGSSRDHVSSTGPASSTGPGSSTSKDSRG 670 T 2.4E-05 CCDC73 pdbhh F Eukaryota T 8j07 101 AJA,CKA,FIA,LHA,TJA q8,u8,o8,m8,s8 ODAD3_HUMAN COILED-COIL DOMAIN-CONTAINING PROTEIN 151 MTSPLCRAASANALPPQDQASTPSSRVKGREASGKPSHLRGKGTAQAWTPGRSKGGSFHRGAGKPSVHSQVAELHKKIQLLEGDRKAFFESSQWNIKKNQETISQLRKETKALELKLLDLLKGDEKVVQAVIREWKWEKPYLKNRTGQALEHLDHRLREKVKQQNALRHQVVLRQRRLEELQLQHSLRLLEMAEAQNRHTEVAKTMRNLENRLEKAQMKAQEAEHITSVYLQLKAYLMDESLNLENRLDSMEAEVVRTKHELEALHVVNQEALNARDIAKNQLQYLEETLVRERKKRERYISECKKRAEEKKLENERMERKTHREHLLLQSDDTIQDSLHAKEEELRQRWSMYQMEVIFGKVKDATGTDETHSLVRRFLAQGDTFAQLETLKSENEQTLVRLKQEKQQLQRELEDLKYSGEATLVSQQKLQAEAQERLKKEERRHAEAKDQLERALRAMQVAKDSLEHLASKLIHITVEDGRFAGKELDPQADNYVPNLLGLVEEKLLKLQAQLQGHDVQEMLCHIANREFLASLEGRLPEYNTRIALPLATSKDKFFDEESEEEDNEVVTRASLKIRSQKLIESHKKHRRSRRS 595 T 0.0023 CCDC73 pdbhh F Eukaryota T 8j07 106 IKA w LRC34_HUMAN Leucine-rich repeat-containing protein 34 MAAQPPRPVGERSMGSSREAARAPARSPAWASTQASTPGAALAVQRESPESGLQKHYSNLCMEKSQKINPFILHILQEVDEEIKKGLAAGITLNIAGNNRLVPVERVTGEDFWILSKILKNCLYINGLDVGYNLLCDVGAYYAAKLLQKQLNLIYLNLMFNDIGPEGGELIAKVLHKNRTLKYLRMTGNKIENKGGMFFAAMLQINSSLEKLDLGDCDLGMQSVIAFATVLTQNQAIKAINLNRPILYSEQEESTVHVGRMLKENHCLVALHMCKHDIKNSGIQQLCDALYLNSSLRYLDVSCNKITHDGMVYLADVLKSNTTLEVIDLSFNRIENAGANYLSETLTSHNRSLKALSVVSNNIEGEGLVALSQSMKTNLTFSHIYIWGNKFDEATCIAYSDLIQMGCLKPDNTDVEPFVVDGRVYLAEVSNGLKKHYYWTSTYGESYDHSSNAGFALVPVGQQP 464 T 7.1E-08 FBXL18_C pdbhh F Eukaryota T 8j62 2 B,D,H,J C,E,G,I Viral infectivity factor MGHHHHHHSQDPMENRWQVMIVWQVDRMRINTWKRLVKHHMYISRKAKDWFYRHHYESTNPKISSEVHIPLGDAKLVITTYWGLHTGERDWHLGQGVSIEWRKKRYSTQVDPDLADQLIHLHYFDEASEGSQIKPPLPSVRKLTEDRWNK 150 T 0.21 Vif pdb F T 8j8p 2 B A A0A0L8RF82_SACEU CDC73-like protein SGSAGNGLVPSDPVLAETMKNERVVQDHNSALRGARPINFGYLIKDAELKLVQSIKGSLRGSKLPPGHKGAHGRVSKTNGS 81 T 5.6 CDC73_N unppercent F Eukaryota T 8j8p 4 D R A0A0L8RIY1_SACEU RTF1-like protein SKSDPFSRLKTRTKVYYQEIQKEENAKAKEMAQQEKLQEDRETKERREKELLLAQFRRLGGLERMIGELDIKFDFKF 77 T 0.095 Tom37_C pdbpercent F Eukaryota T 8jaj 1 A,B,C,D,E,F,G,H,I,J,K,L a,b,c,d,e,f,g,h,i,j,k,l Q71TB2_BPP1 THE TAIL SHEATH PROTEIN MSQYSIQQSLGNASGVAVSPINADATLSTGVALNSSLWAGIGVFARGKPFTVLAVTESNYEDVLGEPLKPSSGSQFEPIRHVYEAIQQTSGYVVRAVPDDAKFPIIMFDESGEPAYSALPYGSEIELDSGEAFAIYVDDGDPCISPTRELTIETATADSAGNERFLLKLTQTTSLGVVTTLETHTVSLAEEAKDDMGRLCYLPTALEARSKYLRAVVNEELISTAKVTNKKSLAFTGGTNGDQSKISTAAYLRAVKVLNNAPYMYTAVLGLGCYDNAAITALGKICADRLIDGFFDVKPTLTYAEALPAVEDTGLLGTDYVSCSVYHYPFSCKDKWTQSRVVFGLSGVAYAAKARGVKKNSDVGGWHYSPAGEERAVIARASIQPLYPEDTPDEEAMVKGRLNKVSVGTSGQMIIDDALTCCTQDNYLHFQHVPSLMNAISRFFVQLARQMKHSPDGITAAGLTKGMTKLLDRFVASGALVAPRDPDADGTEPYVLKVTQAEFDKWEVVWACCPTGVARRIQGVPLLIK 529 T 0.034 Phage_sheath_1 pdbpercent T Viruses T 8jan 1 A,B,C,D,E,F,G,H,I,J,K,L a,b,c,d,e,f,g,h,i,j,k,l Q71TM5_BPP1 TAIL TUBE PROTEIN MGHNNTKGNRKFIKGRYTANAAKGERLVSSEFLLTFAGHEDISVLVRTSQIPEMTREDVEDYGPNGVKFNQHGPIRNSGEIQVQCVETIEGDILQFIKDRIAAKDYVDITMAATPESKSSGVNAVTKAATTIEMLDCKIYSDAIDFSTEDVTAAVRPSLRIVYNWIEWD 169 T 0.0002 Phage_T4_gp19 pdbhh T Viruses T 8jan 3 AA,BA,CA,DA,Y,Z A,B,C,D,y,z Q71T90_BPP1 TAIL TERMINATOR PROTEIN MILNNQEWLLAIFKKKGLTPTGKLEFATIDGIDSALAQALNEAFDSQVVSFNDRINQSFREFLKRTPRDRITLGTFSDVKEWLSSFEADRAGRKDTASAGPVNKLAMPLVNLSRSPAFSIYEGELCRDNYDEGHVTNENDEIEALVSTIPFSLEYSLWIASDEKESLGMVTTALAFWLRMYASLGQASFTHIANVGGYEIPVTCYIEGQKSIAFQDLTTGTADNRLFAVGLNLTVVAELPILAYMQQTTGTITVKAKILEE 261 T 0.87 T4-gp15_tss pdbhh T Viruses T 8jtk 2 B A Q2NK94_AYWBP Sequence-variable mosaic (SVM) signal sequence domain-containing protein GPAPNEEFVGDMRIVNVNLSNIDILKKHETFKKYFDFTLTGPRYNGNIAEFAMIWKIKNPPLNLLGVFFDDGTRDDEDDKYILEELKQIGNGAKNMYIFWQYEQK 105 T 8.7 Sigma_reg_N unppercent F Bacteria T 8jtl 2 C,D D,C Q6YQ57_ONYPE Sequence-variable mosaic (SVM) signal sequence domain-containing protein GPAPHEERVGDMRIVNITFSDINSIKNFQPFSQYFDFTLTGPRYNGNIAQFAMIWKIKNPPHNLLGVFFDNNTRDDEDDKYTLEELKQMGNGAKNMYIFWQYEQK 105 T 4.5 DUF5454 pdbhh F Bacteria T 8ju8 1 A A de novo designed protein MWGKVVVIGSGEYGKRAAQRVADLLDPRIDVYLIFDAKSTDEIRKMIKDHGADAVIVIGAPLGTAFAIAKAAAELGAAVIVIIPRRPGVREAARRFGEEARKYGGRVEVLLGATVEEAVAFARRVVQQFFALEHHHHHH 139 T 0.0022 GFO_IDH_MocA pdb F T 8oep 2 B,D B,D VE6_HPV18 Protein E6 RQERLQRRRETQV 13 T 0.19 Mu-like_Com unphh T Viruses T 8ofg 2 C C GLU-ARG-LEU-LEU-GLY-GLY-TRP-LYS ERLLGGWK 8 T 0.66 hSac2 pdbhh F T 8og5 1 A A DCAF1_HUMAN HIV-1 VPR-BINDING PROTEIN,VPRBP,SERINE/THREONINE-PROTEIN KINASE VPRBP,VPR-INTERACTING PROTEIN GGGREPKQRRQAPINFTSRLNRRASFPKYGGVDGGCFDRHLIFSRFRPISVFREANEDESGFTCCAFSARERFLMLGTCTGQLKLYNVFSGQEEASYNCHNSAITHLEPSRDGSLLLTSATWSQPLSALWGMKSVFDMKHSFTEDHYVEFSKHSQDRVIGTKGDIAHIYDIQTGNKLLTLFNPDLANNYKRNCATFNPTDDLVLNDGVLWDVRSALAIHKFDKFNMNISGVFHPNGLEVIINTEIWDLRTFHLLHTVPALDQCRVVFNHTGTVMYGAMLQADDEDDLMEERMKSPFGSSFRTFNATDYKPIATIDVKRNIFDLCTDTKDCYLAVIENQGSMDALNMDTVCRLYEVGRQRLAEDEDEE 367 T 0.0019 ANAPC4_WD40 pdbpercent F Eukaryota T 8oga 1 A P DCAF1_HUMAN HIV-1 VPR-BINDING PROTEIN,VPRBP,SERINE/THREONINE-PROTEIN KINASE VPRBP,VPR-INTERACTING PROTEIN GGGRFRPISVFREANEDESGFTCCAFSARERFLMLGTCTGQLKLYNVFSGQEEASYNCHNSAITHLEPSRDGSLLLTSATWSQPLSALWGMKSVFDMKHSFTEDHYVEFSKHSQDRVIGTKGDIAHIYDIQTGNKLLTLFNPDLANNYKRNCATFNPTDDLVLNDGVLWDVRSALAIHKFDKFNMNISGVFHPNGLEVIINTEIWDLRTFHLLHTVPALDQCRVVFNHTGTVMYGAMLQADDEDDLMEERMKSPFGSSFRTFNATDYKPIATIDVKRNIFDLCTDTKDCYLAVIENQGSMDALNMDTVCRLYEVGRQR 318 T 0.069 ANAPC4_WD40 pdb F Eukaryota T 8oij 1 A,B A,B SMG_DROME Protein Smaug GGSGGSGGSGGSLFCEQVTTVTNLFEKWNDCERTVVMYALLKRLRYPSLKFLQYSIDSNLTQNLGTSQTNLSSVVIDINANNPVYLQNLLNAYKTARKEDILHEVLNMLPLLKPGNEEAKLIYLTLIPVAVKDTMQQIVPTELVQQIFSYLLIHPAITSEDRRSLNIWLRHLEDHIQ 177 T 0.067 DUF6179 pdbpssm F Eukaryota T 8oij 2 C,D C,D SMO_DROME DSMO,SMOH,SMOOTH SVPSYGEDELQQAMRLLNAASRQRTEAANEDFGGT 35 T 1.1 DUF1635 pdbhh F Eukaryota T 8oik 1 A,B,C A,B,C SMAG1_HUMAN SMAUG 1,HSMAUG1,STERILE ALPHA MOTIF DOMAIN-CONTAINING PROTEIN 4A,SAM DOMAIN-CONTAINING PROTEIN 4A MKHHHHHHPMSDYDIPTTENLYFQSMFRDQVGVLAGWFKGWNECEQTVALLSLLKRVSQTQARFLQLCLEHSLADCAELHVLEREANSPGIINQWQQESKDKVISLLLTHLPLLKPGNLDAKVEYMKLLPKILAHSIEHNQHIEESRQLLSYALIHPATSLEDRSALAMWLNHLEDRTST 180 T 0.22 MerR-DNA-bind pdbpercent F Eukaryota T 8oin 33 LA Bc I3LN63_PIG mL54 MAARRLFGAARSWAAWRAWELSDAAVSGRLHVRNYAKRPVIKGGKGGKGAVVGEALKDPEVCTDPFRLTTHAMGVNIYKEGQDVVLKPDSEYPEWLFEMNVGPPKKLEELDPETREYWRLLRKHNIWRHNRLSKNRKF 138 F F Eukaryota T 8opz 1 A A Tailspike depolymerase (APK16_gp47) from Acinetobacter phage APK16 GSEVAAAQTQYYLKYFNPDIVYPKNARIMLDTGVVVMSMVDGNSTNPNSNMTGWVRVNSASLIFDQSGKTQQEINDSQKQKLPSLKDYGAVSGQDSTAAIKAAIAAEDFLYFGDIGDNFIVSEQIDLRDGCYYVSNGAKFTAALGIEGSQPYTPKSIINASGKVGINISGLVRTHIDHNIFSALGDANSKPTISGFLADAAIDCDFGKWESVGSVNYYYTPNFKEYGIVDLRNSIDCYIEADVNGRWTEETTASTPSTVGIMGSNNKGCYLKGRAKNCYWSGILWEGEDCVVDGPHVRNTKGSNLNLAGKNTAAYNVDLYGSEQGNISIGEGATQAENCNVVGGVAGNAKFANCHLHSVTKNCHVKLFHYGWGQTASAVSDATSGIRCQGTGNTIDSEFDVTYGGLTVKGDAVNVYCSTLTNPEATNIKVNVVGIGARVQIRAPYTIVNAKITGATGDAVVLGERCKGSIVEEVTAIKCGRPLQYAPKTTDANDYAGVIIGRINDVECTNRSVFYGQKIVHSQRKIERIYAQETAFVLDQVLEAIEVYTNDSGVTGANKLASAIRHISADSFGTSYGLDLVASTISKNNLANSKTKVRAGHIEVEPAVAGAASHIVLYAANGTKWKLEPTGSASAANWVAV 641 T 0.39 YmcE_antitoxin pdbpssm F T 8oq0 1 A A Tailspike protein GSEEAAQVARSADKVIDASGLTQQDINDRLAITYPTAVGLVGKPNLKDADVIYVQCYSNIFDGGDGYYRVSADTTTVADGAYVIRINPNLIATMLNTTGSVDVARFGAVMNADVSPFIEKAFKYFRDVCLTKPYKLNTVVGIPDQNNYSKNVYYLRGLGDPEITVDCPSAVFTSASAKLDPTSTVNKFTAKIDVSNISFIGTTVANSVVFNGDRLYNINVHHNNFKGNITIFKAYVKREVGRQYTQSVSINHNHLTGVYRVIESDKSYNLDFSYNMCEACIGGIYVGVDAPWDPNNISLTIHRNLWEGSGMLLKTNGGIIGGTISANYFENNTFNDAGIEKCLISINRTGTGAGYASGLVISGNTFSGNGAIPDFVDVRYVNQSTESSSTSKTANVKPVVFIGNWSNSYLMTNFAGALLINNRCSNRNTMFNAYSPQEGRVTFASGYLDKPLSSMLSGNLLNLITLDTRPCFTAGYINTNFKTTFDVNVLFKTSGGINTASCSFKLDVFVYTPLGAGTPPKSNLKAVMSAFMQSDTNDIISTGVNETMKSVIGATPTMAVVNNGDGTYGIRLSPFTNASSPNWGAITSARIEYTYQGTLIASHTSTYSTANLLTIT 616 T 9.1E-05 Pectate_lyase_3 pdbhh F T 8ou0 2 B D A0A3S5ZPV0_BOVIN Stabilizer of axonemal microtubules 1 MAPTKGKCVCELCSCGRHHCPHLPTKIYDKTEKPCLLSEYTENYPVYHSYLPRESFKPKMDYQRACTPMEGLTTSRRDFGPHKVLPVKIHQPNPFVPSEENMDLQTTYKQDYNPYPLCRVDPFKPRDSKYPCGDKMESLPTYKADYLPWNQPRRELLRPPHHYRPASTKFDSRTTQQDDYSMKGLVNTRSCKPPAVPKLCNVPLEDLTNYKMSYVAHPLEKRFVHESEKFRPCEIPFESLTTHKESYRGLMGEPAKSLKPPARPYGLDTPFSNTTEFRDKYQAWPTPQVFSKPPSMYVPPEEKMDLLTTVQTHYTYPKGAPAESCRPALSVKKGGRFEGSTTTKEDYKQWASTRTEPAKPIPQLNLPTEPLDCLTTARAHYVPHLPMMTKSCKPVWSGPQGNIPVEGQTTYTISFTPKEMSRCLASYPEPPGYIFEEIDALGHRIYRPVSQTGSRRSSRFSVGDSENPNQQELTVSA 477 T 0.0011 STOP pdb F Eukaryota T 8owi 1 A,B A,B CENPE_HUMAN CENTROMERE PROTEIN E,CENP-E,KINESIN-7,KINESIN-RELATED PROTEIN CENPE GPSPYKEEIEDLKMKLVKIDLEKMKNAKEFEKEISATKATVEYQKEVIRLLRENLRRSQQAQDTSVISEHTDPQPSNKPLTCGGGSGIVQNTKALILKSEHIRLEKEISKLKQQNEQLIKQKNELLSNNQHLSNEVKTWKERTLKREAHK 150 T 0.0014 ZapB pdb F Eukaryota T 8p26 1 A,B,C,D,E,F,G,H,I,J A,B,C,D,E,F,G,H,I,J F4KC77_ARATH U2 small nuclear ribonucleoprotein auxiliary factor-like protein GSMASFEKFEPIFGEVVPERSDPGSGLLRRCLFHVYASDSYNLTVHVTDFISGVWTTILSVSQLDDMRDTVGIGGSWSEFVDYTVASLKSDNVKLLLGETSVSNGVKTARLVSQKAKGMPRINVPLTKMVESSASEAMANLSLELFRAFKSKQHLQGEVSFSAAATDEKDKRDATYNQLERYSRKLDVMAPSTNNRQDSPANQSAREANTKNPVKRVPAHRRTRKRGALLQDSEEEDG 238 T 0.003 PAXX unphh F Eukaryota T 8p3l 1 A,B,C,D A,D,G,J W0DP94_9GAMM THIOCYANATE DEHYDROGENASE MSYYHHHHHHDYDIPTTENLYFQGAMGKYVKVQDFYDQLGKYVLVAPGKFSGTVAATDLSTGWTMAWLAAWNYGDTCPIMHHMAAFPSPDPYKEFEFVVNTQGGKNLFIYGVPVAVEDPGEGMKIYRIKYDGTRMNLQRDAAEVSGLGLGVHVTITPEADGYAVGDGQKDICAEFDRETDMVRYAWAFDWDPNVKDLKRAWLDGGTMTIKRLKPTLPGGRYDLQGSKGNKIDWELVPGGELAIEDGKVSGDRPLHSVANDALVFDPRGKWAVASMRLPGVCVVFDRENQVPVAVLAGPKGTPSQFQLVKVDDDTWTVDIPEVISAGHQAGFSPDGQSFLFMNSLRQNNIMVWDSSNHDDPTTWEKKAVVESPDWRGAYPNTFHMVFTPDAKKIYVTMWWPSPTPNGIAVIDAVNWEVLKEVDLGPDMHTLAITYDGKFVVGTLSGYQNTASAIVVMETETDEVLGFLPSPMGHHDNVIVPRTLEDLRISRSTTT 494 T 0.0037 Cytochrom_D1 unppercent F Bacteria T 8p3m 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P A,D,G,J,M,P,S,V,Y,2,5,8,x,e,h,k W0DP94_9GAMM THIOCYANATE DEHYDROGENASE MSYYHHHHHHDYDIPTTENLYFQGAMGKYVKVQDFYDQLGKYVLVAPGKFSGTVAATDLSTGWTMAWLAAWNYGDTCPIMHHMAAFPSPDPYKEFEFVVNTQGGKNLFIYGVPVTVEDPGEGMKIYRIKYDGTRMNLQRDAAEVSGLGLGVHVTITPEADGYAVGDGQKDICAEFDRETDMVRYAWAFDWDPNVKDLKRAWLDGGTMTIKRLKPTLPGGRYDLQGSAGNKIDWELVPGGELAIEDGKVSGDRPLHSVANDALVFDPRGKWAVASMRLPGVCVVFDRENQVPVAVLAGPKGTPSQFQLVKVDDDTWTVDIPEVISAGHQAGFSPDGQSFLFMNSLRQNNIMVWDSSNHDDPTTWEKKAVVESPDWRGAYPNTFHMVFTPDAKKIYVTMWWPSPTPNGIAVIDAVNWEVLKEVDLGPDMHTLAITYDGKFVVGTLSGYQNTASAIVVMETETDEVLGFLPSPMGHHDNVIVPRTLEDLRISRSTTT 494 T 0.0037 Cytochrom_D1 unppercent F Bacteria T 8pe9 1 A A DDR1_HUMAN EPITHELIAL DISCOIDIN DOMAIN RECEPTOR 1,CD167 ANTIGEN-LIKE FAMILY MEMBER A,CELL ADHESION KINASE,DISCOIDIN RECEPTOR TYROSINE KINASE,HGK2,MAMMARY CARCINOMA KINASE 10,MCK-10,PROTEIN-TYROSINE KINASE 3A,PROTEIN-TYROSINE KINASE RTK-6,TRK E,TYROSINE KINASE DDR,TYROSINE-PROTEIN KINASE CAK RDGLLSYTAPVGQTMYLSEAVYLNDSTYDGHTVGGLQYGGLGQLADGVVGLDDFRKSQELRVWPGYDYVGWSNHSFSSGYVEMEFEFDRLRAFQAMQVHCNNMHTLGARLPGGVECRFRRGPAMAWEGEPMRHNLGGNLGDPRARAVSVPLGGRVARFLQCRFLFAGPWLLFSEISFISDVVN 183 T 0.011 Lamprin pdb F Eukaryota T 8pfm 2 S S Q84626_PBCV1 Paramecium bursaria chlorella virus 1 (PBCV-1) penton protein. VETTQHFVSIESSNRPDPANTTPANYSIQLPQRYRNIWSAMLVNIALPAVSPPQKYVYLDIDKLNSIDSTSPSGGVNFALAKIPLSIAGTGNVFFADTMTSSFPNVPLQNPVATMDKLNIKLKDANGNVLTIPAGNEHSFMIQLTCGDYIPRGGGSTITQNGRVLGG 167 T 1.8 DUF2433 unphh T Viruses T 8phq 1 A,B,BA,C,CA,CB,DA,DB,EB,J,K,KA,L,LA,LB,MA,MB,NB,S,T,TA,U,UA,UB,VA,VB,WB AA,AB,BB,AC,BC,CC,BD,CD,CE,AJ,AK,BK,AL,BL,CL,BM,CM,CN,AS,AT,BT,AU,BU,CU,BV,CV,CW Major capsid protein MELFDENYYAKAVANIIGEVKDPIMYKWFSPDQIEDVDLQMGYQKTVKWDAFLNANPTTIANEVNTISTIGFSSEVVRLNYLKLQYKFRHLKQTSEKFYTSDSYIGDINNNLLPFAQAYKLASSEIIKLINHFVLTGTVSIQKDGKNQKRLLPNMYGLLNMPEQIKEEVASGDKDKMDKIFEKIEAGLSKLELGDEFSTPMMVIVDPATSLKLVKPYAAAQGAASSCEKWEDVLIQTIKAINNREDVYIETSNLLKHKILIYPLNSELIKFKPSKYMLPTPNEQVDKDSTDVAHSYIDFVLGGLLATRKTILQVNIKQS 319 T 0.12 DUF6260 pdbhh F T 8s9i 2 B G SSB_BPT4 SSB PROTEIN,GP32,HELIX-DESTABILIZING PROTEIN AATAAKKADKVADDLDAFNVDDF 23 T 0.34 Dehydrin unppercent T Viruses T 8sah 1 A A HD_HUMAN HUNTINGTON DISEASE PROTEIN,HD PROTEIN MVSPDKDWYVHLVKSQCWTRSDSALLEGAELVNRIPAEDMNAFMMNSEFNLSLLAPCLSLGMSEISGGQKSALFEAAREVTLARVSGTVQQLPAVHHVFQPELPAEPAAYWSKLNDLFGDAALYQSLPTLARALAQYLVVVSKLPSHLHLPPEKEKDIVKFVVATLEALSWHLIHEQIPLSLDLQAGLDCCCLALQLPGLWSVVSSTEFVTHACSLIHCVHFILEAVAVQPGEQLLSPERRTNTPKAISEEEEEVDPNTQNPKYITAACEMVAEMVESLQSVLALGHKRNSGVPAFLTPLLRNIIISLARLPLVNSYTRVPPLVWKLGWSPKPGGDFGTAFPEIPVEFLQEKEVFKEFIYRINTLGWTSRTQFEETWATLLGVLVTQPLVMEQEESPPEEDTERTQINVLAVQAITSLVLSAMTVPVAGNPAVSCLEQQPRNKPLKALDTRFGRKLSIIRGIVEQEIQAMVSKRENIATHHLYQAWDPVPSLSPATTGALISHEKLLLQINPERELGSMSYKLGQVSIHSVWLGNSITPLREEEWDEEEEEEADAPAPSSPPTSPVNSRKHRAGVDIHSCSQFLLELYSRWILPSSSARRTPAILISEVVRSLLVVSDLFTERNQFELMYVTLTELRRVHPSEDEILAQYLVPATCKAAAVLGMDKAVAEPVSRLLESTLRSSHLPSRVGALHGILYVLECDLLDDTAKQLIPVISDYLLSNLKGIAHCVNIHSQQHVLVMCATAFYLIENYPLDVGPEFSASIIQMCGVMLSGSEESTPSIIYHCALRGLERLLLSEQLSRLDAESLVKLSVDRVNVHSPHRAMAALGLMLTCMYTGKEKVSPGRTSDPNPAAPDSESVIVAMERVSVLFDRIRKGFPCEARVVARILPQFLDDFFPPQDIMNKVIGEFLSNQQPYPQFMATVVYKVFQTLHSTGQSSMVRDWVMLSLSNFTQRAPVAMATWSLSCFFVSASTSPWVAAILPHVISRMGKLEQVDVNLFCLVATDFYRHQIEEELDRRAFQSVLEVVAAPGSPYHRLLTCLRNVGGSGDYKDDDDK 1057 T 0.06 Spidroin_MaSp pdbpercent F Eukaryota T 8san 1 A,E,I A,E,I A0A1W6IM54_9HIV1 CH848.0836.10 gp120 AENLWVTVYYGVPVWKEAKTTLFCASDAKAYKKEVHNVWATHACVPTDPSPQELFLKNVTENFNMWKNDMVDQMHEDIISLWDQSLKPCVKLTPLCVTLICSNATVKNSTVEEMKNCSFNTTTEIRDKEKKEYALFYRPDIVPLNNETSNTSEYRLINCNTSACTQACPKVTFEPIPIHYCAPAGYAILKCNDETFNGTGPCSNVSTVQCTHGIRPVVSTQLLLNGSLAEKGIVIRSENLTNNAKIIIVHLHTPVEIVCTRPNNNTRKSVRIGPGQTFYATGDIIGDIRQAHCNISESKWNETLQKVGKELQKHFPNKTIKYAQSAGGDMEITTHSFNCGGEFFYCNTAKLFNGTYNGTDISTNSSTNSNPTITLQCRIKQIINMWQGVGRCMYAPPIAGNITCKSNITGLLLTRDGGTNSSGKEEIFRPAGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRR 464 T 3.5E-53 GP120 pdbpssm T Viruses T 8sar 1 A,E,I A,F,K A0A1W6IPB2_9HIV1 CH848.10.17 gp120 AENLWVTVYYGVPVWKEAKTTLFCASDARAYEKEVHNVWATHACVPTDPSPQELVLGNVTENFNMWKNDMVDQMHEDIISLWDQSLKPCVKLTPLCVTLICSNATVKNGTVEEMKNCSFNTTTEIRDKEKKEYALFYKPDIVPLSETNNTSEYRLINCNTSACTQACPKVTFEPIPIHYCAPAGYAILKCNDETFNGTGPCSNVSTVQCTHGIRPVVSTQLLLNGSLAEKEIVIRSENLTNNAKIIIVHLHTPVEIVCTRPNNNTRKSVRIGPGQTFYATGDIIGDIKQAHCNISEEKWNDTLQKVGIELQKHFPNKTIKYNQSAGGDMEITTHSFNCGGEFFYCNTSNLFNGTYNGTYISTNSSANSTSTITLQCRIKQIINMWQGVGRCMYAPPIAGNITCRSNITGLLLTRDGGTNSNETETFRPAGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRV 463 T 2.5E-52 GP120 pdbpssm T Viruses T 8sat 1 A,E,I A,E,I A0A1W6IPB2_9HIV1 CH848.10.17 gp120 AENLWVTVYYGVPVWKEAKTTLFCASDARAYEKEVHNVWATHACVPTDPSPQELVLGNVTENFNMWKNDMVDQMHEDIISLWDQSLKPCVKLTPLCVTLICSNATVKNGTVEEMKNCSFNTTTEIRDKEKKEYALFYKPDIVPLSETNNTSEYRLINCNTSACTQACPKVTFEPIPIHYCAPAGYAILKCNDETFNGTGPCSNVSTVQCTHGIRPVVSTQLLLNGSLAEKEIVIRSENLTNNAKIIIVHLHTPVEIVCTRPNNNTRKSVRIGPGQTFYATGDIIGDIKQAHCNISEEKWNDTLQKVGIELQKHFPNKTIKYNQSAGGDMEITTHSFNCGGEFFYCNTSNLFNGTYNGTYISTNSSANSTSTITLQCRIKQIINMWQGVGRCMYAPPIAGNITCRSNITGLLLTRDGGTNSNETETFRPAGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 471 T 2.2E-52 GP120 pdbpssm T Viruses T 8sax 1 A,E,I A,E,I A0A1W6IPB2_9HIV1 CH848.10.17.SOSIP gp120 AENLWVTVYYGVPVWKEAKTTLFCASDARAYEKEVHNVWATHACVPTDPSPQELVLGNVTENFNMWKNDMVDQMHEDIISLWDQSLKPCVKLTPLCVTLICSDATVKTGTVEEMKNCSFNTTTEIRDKEKKEYALFYKPDIVPLSETNNTSEYRLINCNTSACTQACPKVTFEPIPIHYCAPAGYAILKCNDETFNGTGPCSNVSTVQCTHGIRPVVSTQLLLNGSLAEKEIVIRSENLTNNAKIIIVHLHTPVEIVCTRPNNNTRKSVRIGPGQTFYATGDIIGDIKQAHCNISEEKWNDTLQKVGIELQKHFPNKTIKYNQSAGGDMEITTHSFNCGGEFFYCNTSNLFNGTYNGTYISTNSSANSTSTITLQCRIKQIINMWQGVGRCMYAPPIAGNITCRSNITGLLLTRDGGTNSNETETFRPAGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 471 T 3.3999999999999995E-53 GP120 pdbpssm T Viruses T 8sk7 3 C,F,I X,Y,Z HA_20 minibinder (RFdiffusion-designed) MEKEKELKEYAEKIKKEIGDIESVEVKDGKILVKAKKITDKTVDAIMKLTVKAARLGFKVEVELV 65 T 7.2 DUF5320 pdbhh F T 8sl0 1 A A GSDM_VITXG BGSDM,BACTERIAL GASDERMIN SGLCSDPAITYLKRLGYNVVRLPREGIQPLHLLGQQRGTVEYLGSLEKLITQPPSEPPAITRDQAAAGINGQKTENLSFSIGINILKSVLAQFGAGAGIEAQYNQARKVRFEFSNVLADSVEPLAVGQFLKMAEVDADNPVLKQYVLGNGRLYVITQVIKSNEFTVAAEKSGGGSIQLDVPEIQKVVGGKLKVEASVSSQSTVTYKGEKQLVFGFKCFEIGVKNGEITLFASQLVPR 237 T 0.00027 Gasdermin pdbpercent F Bacteria T 8smq 1 A,B,C,D A,B,C,D Q182N1_CLOD6 hypothetical protein CD630_25440 SNADKILDLSFKKIETDLSSKITYEDTGVKIETDSSKSDKERYLYIYQNIKENWSMYNNFYIEIQNKNKSSQKINLSIQSKNMFEFRLKEGSEVFLEGKNIIYSDKIKEGCIEVPGEFEGKIYVNFNSLINEESNVVLDSNMLSNIVSWGITFIPSDEEHNIVIIKKISLLSE 173 T 1.6E-05 Agarase_CBM pdbhh F Bacteria T 8snb 11 X,Y 1l,1m A0A7M7GGC2_STRPU Tex26(LOC100888047) MPNSGFQGQINQSFYGTHKAHLVPLGDQYVSGNLPNPAFWRRFIRDPVSSIENPSGTRVLTTTEVLAPVWQSNNCAANLSKNDRATSNTLPRLHTSSTWTTNEGSDHAPLRPQVPSSARGRFRDAHIPLVALNSLAPFATTYKPTCGYFFSRSTNNKKKQMGIPATDLVKYRYYVK 176 T 18 DUF983 pdbhh F Eukaryota T 8snb 12 AA,BA,CA,Z 1p,1q,1r,1o A0A7M7NFX5_STRPU Meiosis-specific nuclear structural protein 1 MSQQQYHWEALRKQRVIDRRLAAMKKMTDEERQMEELSTEGMARDELQSRQMDHAKRRELAEQIQHRDRRGIRNSYKEREQTIQKLVSERTWHDNLIAKMDQSEKDDLLKDLLRDHAQKSKTLRDRGVYRGDAKFFIDPQYN 142 T 0.19 Rubella_Capsid unppercent F Eukaryota T 8snb 14 MA 2G A0A7M7RF95_STRPU CFAP107 MAHGDPQKWNLPGWRIEQRYAGNVLIGNWSEERQKFGRGGEKHTSTHRMDYLNNRNFAPDVMTRRAAKMRNEGLDQTLLFAHHNKNLKNNLISWYDEQFNKRERSGGDQLPELRHWDGQKLAWEPEKTDHPVKGAPTNFGLKDRLQEKWKTEEADKKLSDYSTTYGLDYKNKPRAALVTEHFAPQRAQSSRMHPVNKINKDTNLRSTSILQTPQQIHMRTRNEAVSRSGPAPVSV 235 T 0.0066 DUF1143 unppssm F Eukaryota T 8snb 15 NA,OA,PA 2J,2K,2L A0A7M7NA77_STRPU Cilia- and flagella-associated protein 126 MSSHFSANQYKQAFDSRRLQNSQIPQTYKERPSSYEGFTQIIANDRGHLKQGVPRSKDSPWGGFVGTWEMPKKIPGNVTTYMSRGDPAIDNIQKTRAEHNEYMRQAVSPDKTLAMEPKPQVTKVAEEDRPGNPSPNDAIPA 141 T 48 Scm3 pdbhh F Eukaryota T 8snb 18 TA,UA 2V,2W A0A7M7RAY9_STRPU Cilia- and flagella-associated protein 161 MSVRSYNPSVRVGNWNEDTCLEEDMVKDFLEKKEKGELLIQKASNLRSTILKPSDLSVTVDGFVHFGDTVVIMNEAAADQVRTQPGVEPRQANVLSVNMSETKMHETMRFEGTCTASASKSLNPCVRNTFVIAPAQDGIPPGSPLTYGQHFRLCTLPGVGGNLVLQSDRVSFHASAEKSRKQLISFVDEVQSPYLTEWRILCFNPQIRMESEGLPVPANQRIIFNHCKSNEDLCVVSGMSVRTPFGREYEVVAHTDLDSHRAEKDVNHWIIKTGEPAQPTTLAKTLPVGDQQ 292 T 0.1 DUF1143 pdbhh F Eukaryota T 8snb 29 CC 4Y A0A7M7RHW6_STRPU HeLo_like_N(LOC577943) MTDTVPVPAVRPPVDKPMRLVGVSHSNNSYSLVDHASANDQLHYIFLVVKQHIAGILADVQYYKIAELKNEIIALGTDVAALVRDRSFEESRDKYHSHFWRADDEAENKRLIVKLSEVAYALLDYKQNCCTQSRGPYALDEAKLEILCKKHLYELQNFRRELAQFVNTARDRNAQVTRLSAPLQSQLEWYQYKSPLDDPSIRRPLPYECTLTSRETIRPRTEPPEVDGHISGVKHVPSQWDVPNLSGKPGAANSGTHGNLTSQGRYGGRHMYERNINERRKPCEQEIQHVSSRYKHYHGLCE 302 T 0.18 DUF4208 pdb F Eukaryota T 8snb 34 AD,BD 6Q,6R A0A7M7THB5_STRPU RIB35 MSVTSANPATPYTRAEFFNACTSVLEGLNCLQRQQSVIDERSWEVLNRLGQMVSDLRPEVKGNKEYWVDGPLVKFLTANVLKAKSVLQEMKRTCSQKSAATNGPEYIERHLLQHVEDIRTSLEELETYHQQTLYRTDTTGIDAGVGQRMHTITQGPTEMTAGGLTQTPPAEISLEPKGSGTFMKDLNNAKIATNDSYPGTSSQPMTGMWSSGLPHHPETSKPGVFEHIQEMTNIYPEHDPNWPKREQKPKLPFERTQPPLVYPGIRDMHNYSNLPQFTGKDLPMPKIPNGEMGLRAPHVPHWDSTNHYSY 310 T 0.032 CCCAP pdbpercent F Eukaryota T 8snb 38 ND,OD 7M,7N A0A7M7THD0_STRPU SAXO3(LOC115918676) MTGADRRFDLHQTSSGRGLDYRPEYYFPASDFKTTINNPLPPQLAKQDEIIKPFQTTTGGAHDYKYHGGLMANPQHHKAPGHWNMHYNKDLREKLQQRGWRKPLTMGNQESEVQAQYKGDQMQMGVDFDNRLSGNPQPSDLQTHHQNCPAPVRDSVPKYKPTLVRDDGALQLLDIYVPTSHHVHKRFTRHELDDYPKKDAATYWRCEDYTQAWGHGTKHNPLPKGAEIHQRAPMVDEMVFKTAIKEPARWPERFKRVPHAGMKTTMTSSYKTPSDPKMTELFSCPVDTPWVIPEAGPIQTFSVPNMYTTEYKTYASGKPITV 322 T 27 DUF4632 pdbhh F Eukaryota T 8snb 39 PD,QD 7Q,7R A0A7M7N5A5_STRPU TEPP protein MPTVEVPYYVPQYPTFRRAQLAAVKEGLYHPSLPTFRRMDMDTAAHRLPDEHCRTTTGVGPADFQNATATYFQPPANTYNGANITDTGRLLRETMKDDVKSLRLDWAKAKDIKELPQIKNTGQLRFSGYAVRYQKPAISGSWRYTFTQEPRLDQYGQRPVPANIYSRYRDTFPQYSRNMSTDAFR 185 T 1.7 Cofac_haem_bdg unppercent F Eukaryota T 8snb 41 TD,UD 7Y,7Z A0A7M7N7W6_STRPU Sperm-associated antigen 8 MATLNPARTLNNSGGRCLMENWVEERQVFQTGLDSAGVNSTESYTSNSSLPYKDGHKGILTRELDTAVEKESNSMGSYQRPAQCGVRTVGRKKELMERALYAKVSQELQEEINEPSPVEEYKSVTQKDFYDDEFESELPAPLYEHNVNTEQPITFWSHHKEKIPGVSQIKTLDTPFKKNSAFSKPIAEGTDQPQPYEQETHPFL 204 T 0.00018 Apis_Csd pdb F Eukaryota T 8snb 50 QE 9J A0A7M7RFU3_STRPU Tex49_homologue(LOC580808) MTAQCMGNFGAHKWRYPEFMADRYGRSNGSVDPESRHKQYDAGVSDQTVWVNKRYIPTDLRSTAPVPRRKLLASARLPHVERDWVPLGEASCRQISRAMEERQLYVSQPSTRSEDWSTLRQILPSKGLPLRDSPPNWGTGNAYAPPMLGARQRRFPHINSPMTRYTDNMHTTHKLFKLH 179 T 0.7 ARL6IP6 unppssm F Eukaryota T 8snb 52 VE 9T A0A7M7RHE9_STRPU SPATA45 MDPQKNYEMNNQRESWCAVELSPLQDWCKSERKHHGENFKSSVFNAKQGQPESEARCTFEVNDKTHREKRHFPNKTSYSHLAI 83 T 0.048 ESF1 unppssm F Eukaryota T 8soi 3 D D ATG13_HUMAN Autophagy-related protein 13 DLGTFYREFQNPPQLSSLSIDIGAQSMAEDLDSLPEKLAVHEKNVREFDAFVETLQGSDEA 61 T 12 DUF2315 pdbhh F Eukaryota T 8sqz 3 E,F E,F ATG13_HUMAN Autophagy-related protein 13 HDVLETIFVRKVGAFVNKPINQVTLTSLDIPFAMFAPKNLELEDTDPMVNPPDSPETESPLQGSLHSDGSSGGSSGNTHDDFVMIDFKPAFSKDDILPMDLGTFYREFQNPPQLSSLSIDIGAQSMAEDLDSLPEKLAVHEKNVREFDAFVETLQ 155 T 22 BLI1 unphh F Eukaryota T 8srm 3 E,F E,F ATG13_HUMAN Autophagy-related protein 13 KPAFSKDDILPMDLGTFYREFQNPPQLSSLSIDIGAQSMAEDLDSLPEKLAVHEKNVREFDAFVETLQGSDEA 73 T 11 DUF1244 pdbhh F Eukaryota T 8srz 1 A,B A,B PROT2_LYSEN TRYPSIN-LIKE PROTEASE 2 SVQADYSRAEALAAWTRLSDEFIGNCYVSVRPRHAPAWEVVVASAAGSLRLEAFKRAHDHDFLDRLAVAIGNWEQKAQRPDHEIAQMLDQVG 92 T 3.3 Anillin_N pdbhh F Bacteria T 8ss1 1 A,B A,B A0A2U1VUZ9_9PROT Serine protease SNAERLAAWTRLPWEGLRYSYNRERRGTAARSCPQLEADVALKAETQPSEIPLERQLILEACREAERFGFLHELSIAIVEMERLNKRPEAEVEEIAKLWQ 100 T 0.85 EAD7 unphh F Bacteria T 8sv0 2 B,C C,E protein VII PGGFKRRRL 9 T 5.3 Nha1_C pdbhh F T 8sw7 1 A,C,F A,C,F BG505 Boost 2 gp120 MDAMKRGLCCVLLLCGAVFVSPSQEIHARFRRGARAENLWVTVYYGVPVWKDAETTLFCASDAKAYETKKHNVWATHCCVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCTNVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNESVQINCTRPNNNTRKSIRIGPGQWFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 516 T 2.8E-49 GP120 pdb F T 8swd 1 A,B,C,D A,B,C,D Q0PAA2_CAMJE CIAD MHHHHHHSSGVDLWSHPQFEKGTENLYFQSNIMNLEDLAKKTISEVSSIMEEQRRQNEILKEQELNRKTEIKDELPPMEFVCEELDTPQDLEDKISMAKFEEEQKIQNNIEISTQENKEFKKEEPFLQNEILNPSVMTEVQTLNEDIFLKHLRERILVLFEGLNSIKKDDLENRLNLTINFLEFLLANIEDKLKK 195 T 0.31 YebO unp F Bacteria T 8t61 1 A A Designed peptide BH33 RHYYKFNSTGRHYHYY 16 T 0.091 Phage_fiber pdb F T 8t62 1 A A Designed peptide BH21 TMIEDPEAGHFHTSSA 16 T 5.2 MPLKIP pdbhh F T 8t63 1 A A Designed peptide PH1 WHMWNTVPNAKQVIAA 16 T 7.7 DUF5820 pdbhh F T