PdbID EntityID AsymChainIDs AuthorChainIDs UnpCode Name Sequence SeqLength HasWeakHits BestWeakEvalue BestWeakPfamID Source IsVirus Category IsValid 148l 2 B S SUBSTRATE CLEAVED FROM CELL WALL OF ESCHERICHIA COLI AXXX 4 T 380 NSF pdbhh F F 173d 2 C,D C,D DACTINOMYCIN TXPXXXTXPXX 11 T 61 DUF5572 pdbhh F F 185d 1 A A TRIOSTIN A XAXXXAXX 8 T 190 RSF pdbhh F F 193d 2 C C QUINOMYCIN A, UK-63052 XAXXXAXX 8 F F F 1a07 2 B,D C,D ACE-MALONYL TYR-GLU-(N,N-DIPENTYL AMINE) XXEX 4 T 830 DUF5972 pdbhh F F 1a08 2 B,D C,D ACE-DIFLUORO PHOSPHOTYR-GLU-(N,N-DIPENTYL AMINE) XXEX 4 T 830 DUF5972 pdbhh F F 1a09 2 B,D C,D ACE-FORMYL PHOSPHOTYR-GLU-(N,N-DIPENTYL AMINE) XYEX 4 T 830 DUF5972 pdbhh F F 1a0n 1 A A P85A_HUMAN P2L PPRPLPVAPGSSKT 14 T 0.9 AAA_11 pdbhh F Eukaryota T 1a1a 2 B,D C,D ACE-FORMYL PHOSPHOTYR-GLU-(N,N-DIPENTYL AMINE) XYEX 4 T 830 DUF5972 pdbhh F F 1a1b 2 B,D C,D ACE-PHOSPHOTYR-GLU-(N,N-DIPENTYL AMINE) XXEX 4 T 830 DUF5972 pdbhh F F 1a1c 2 C,D C,D ACE-PHOSPHOTYR-GLU-(N-ME(-(CH2)3-CYCLOPENTYL)) XXEX 4 T 830 DUF5972 pdbhh F F 1a1e 2 C,D C,D ACE-PHOSPHOTYR-GLU-(3-BUTYLPIPERIDINE) XXEX 4 T 830 DUF5972 pdbhh F F 1a1m 3 C C PEPTIDE TPYDINQML TPYDINQML 9 T 12 Connexin40_C pdbhh F T 1a1n 3 C C PEPTIDE VPLRPMTY VPLRPMTY 8 T 8.1E-05 F-protein pdbhh F T 1a1o 3 C C PEPTIDE LS6 (KPIVQYDNF) KPIVQYDNF 9 T 5 NitrOD1 pdbhh F T 1a1p 1 A _ COMPSTATIN ICVVQDWGHHRCTX 14 T 2.2 RPN1_RPN2_N pdbhh F T 1a2c 4 D J Aeruginosin 298-A XLXX 4 T 1400 RNA_polI_A14 pdbhh F F 1a30 2 C C TRIPEPTIDE GLU-ASP-LEU EDL 3 T 290 GoLoco pdbhh F F 1a34 1 A,A10,A11,A12,A13,A14,A15,A2,A3,A4,A5,A6,A7,A8,A9 A,A,A,A,A,A,A,A,A,A,A,A,A,A,A COAT_STMV STMV MGRGKVKPNRKSTGDNSNVVTMIRAGSYPKVNPTPTWVRAIPFEVSVQSGIAFKVPVGSLFSANFRTDSFTSVTVMSVRAWTQLTPPVNEYSFVRLKPLFKTGDSTEEFEGRASNINTRASVGYRIPTNLRQNTVAADNVCEVRSNCRQVALVISCCFN 159 T 24 Csm4_C pdbhh T Viruses T 1a37 2 B,B2,D,D2 P,P,Q,P PS-RAF259 PEPTIDE LSQRQRST(SEP)TPNVHM KSQRQRSTSTPNVHM 15 T 26 PSRT pdbhh F T 1a38 2 B,D P,Q R18 PEPTIDE (PHCVPRDLSWLDLEANMCLP) FHCVPRDLSWLDLEANMCLP 20 T 0.33 PP_kinase_N pdbhh F T 1a3i 1 A A COLLAGEN-LIKE PEPTIDE PPGPPGPPG 9 T 0.46 EKLF_TAD1 pdbhh F F 1a3i 2 B,C B,C COLLAGEN-LIKE PEPTIDE PPGPPG 6 T 4.3 CbtA pdbhh F F 1a3j 1 A A COLLAGEN-LIKE PEPTIDE PPGPPGPPG 9 T 0.46 EKLF_TAD1 pdbhh F F 1a3j 2 B,C B,C COLLAGEN-LIKE PEPTIDE PPGPPG 6 T 4.3 CbtA pdbhh F F 1a4t 2 B B REGN_BPP22 20-MER BASIC PEPTIDE NAKTRRHERRRKLAIERDT 19 T 1.9 N36 unphh T Viruses T 1a7c 2 B,C B,C PENTAPEPTIDE XTVASSX 7 T 770 AsiA pdbhh F F 1a7y 1 A,B,C A,B,C DACTINOMYCIN TXPXXXTXPXX 11 T 61 DUF5572 pdbhh F F 1a7z 1 A,B A,B ACTINOMYCIN Z3 XXXXXXTXXXX 11 T 21 Cm_res_leader pdbhh F F 1a9b 3 C,F C,F PEPTIDE LPPLDITPY LPPLDITPY 9 T 0.94 PINIT pdbhh F T 1a9e 3 C C PEPTIDE LPPLDITPY LPPLDITPY 9 T 0.94 PINIT pdbhh F T 1aa5 1 A,B A,B VANCOMYCIN XXNXXXX 7 T 95 P53_C pdbhh F F 1ab9 1 A A CTRA_BOVIN GAMMA-CHYMOTRYPSIN CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 1ab9 4 D D PENTAPEPTIDE (TPGVY) TPGVY 5 T 22 RsgA_GTPase pdbhh F F 1abo 2 C,D C,D 3BP1_MOUSE 3BP-1 SYNTHETIC PEPTIDE, 10 RESIDUES APTMPPPLPP 10 T 1.4 Cytochrom_B558a pdbhh F Eukaryota F 1abz 1 A _ ATA XDWLKARVEQELQALEARGTDSNAELRAMEAKLKAEIQKX 40 T 0.03 DUF4148 pdb F T 1afq 1 A A CTRA_BOVIN BOVINE GAMMA-CHYMOTRYPSIN CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 1aft 1 A _ RIR2_MOUSE RIBONUCLEOSIDE-DIPHOSPHATE REDUCTASE XFTLDADF 8 T 15 GDE_N_bis pdbhh F Eukaryota T 1agb 3 C C PEPTIDE GGRKKYKL 8 T 0.29 Gag_p17 pdbhh F F 1agc 3 C C PEPTIDE GGKKKYQL 8 T 0.085 Gag_p17 pdbhh F F 1agd 3 C C PEPTIDE GGKKKYKL 8 T 0.055 Gag_p17 pdbhh F F 1age 3 C C PEPTIDE GGKKKYRL 8 T 0.015 Gag_p17 pdbhh F F 1agf 3 C C PEPTIDE GGKKRYKL 8 T 0.11 Gag_p17 pdbhh F F 1aj1 1 A A LANA_ACTGA LANTIBIOTIC ACTAGARDINE XSGWVCXLXIECGXVICAC 19 T 7.3E-05 L_biotic_typeA pdbhh F Bacteria T 1aja 1 A,B A,B PPB_ECOLI ALKALINE PHOSPHATASE TPEMPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQGATPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAEARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 449 T 4.2E-10 Alk_phosphatase pdbpercent F Bacteria T 1ajb 1 A,B A,B PPB_ECOLI ALKALINE PHOSPHATASE TPEMPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQGATPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAEARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 449 T 4.2E-10 Alk_phosphatase pdbpercent F Bacteria T 1ajc 1 A,B A,B PPB_ECOLI ALKALINE PHOSPHATASE TPEMPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQGATPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAEARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 449 T 4.2E-10 Alk_phosphatase pdbpercent F Bacteria T 1ajd 1 A,B A,B PPB_ECOLI ALKALINE PHOSPHATASE INTERMEDIATE II OF HOLO ENZYME TPEMPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQGATPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAEARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 449 T 4.2E-10 Alk_phosphatase pdbpercent F Bacteria T 1akj 3 C C HIV REVERSE TRANSCRIPTASE EPITOPE ILKEPVHGV 9 T 0.56 DUF2115 pdbhh F T 1al1 1 A A ALPHA HELIX PEPTIDE: ELLKKLLEELKG XELLKKLLEELKG 13 T 11 NABP pdbhh F F 1al2 1 A 0 P1/MAHONEY POLIOVIRUS GSSST 5 T 190 DltD pdbhh F F 1al4 1 A,B A,B GRAMICIDIN D XXGAXAXVXWXWXWXWX 17 T 4.6 MAP17 pdbhh F F 1ali 1 A,B A,B PPB_ECOLI ALKALINE PHOSPHATASE TPEMPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQDATPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQENTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 449 T 6.8E-10 Alk_phosphatase unppssm F Bacteria T 1alj 1 A,B A,B PPB_ECOLI ALKALINE PHOSPHATASE TPEMPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQDATPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQENTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 449 T 6.8E-10 Alk_phosphatase unppssm F Bacteria T 1alx 1 A A VALYL GRAMICIDIN XGAXAXVXWXYWXWXWX 17 T 3.1 MAP17 pdbhh F T 1alx 2 B B VALYL GRAMICIDIN XGAXAXVXWXWXWXWX 16 T 4.6 MAP17 pdbhh F F 1alz 1 A A ILE-GRAMICIDIN C XXGAXAXVXWXYWXWXWX 18 T 3.3 DUF5848 pdbhh F T 1alz 2 B B VAL-GRAMICIDIN A XGAXAXVXWXWXWXWX 16 T 4.6 MAP17 pdbhh F F 1amt 1 A,B,C A,B,C ALAMETHICIN XXPXAXAQXVXGLXPVXXEQX 21 T 23 RRT14 pdbhh F T 1ani 1 A,B A,B PPB_ECOLI ALKALINE PHOSPHATASE MPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQHATPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDHQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 446 T 6.8E-10 Alk_phosphatase unppssm F Bacteria T 1anj 1 A,B A,B PPB_ECOLI ALKALINE PHOSPHATASE MPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQDATPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDHQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 446 T 6.8E-10 Alk_phosphatase unppssm F Bacteria T 1aot 2 B P MT_POVHA PHOSPHOTYROSYL PEPTIDE EPQXEEIPIYL 11 T 3.2 Imm15 pdbhh T Viruses T 1aou 2 B P MT_POVHA PHOSPHOTYROSYL PEPTIDE EPQXEEIPIYL 11 T 3.2 Imm15 pdbhh T Viruses T 1apt 2 B I INHIBITOR ISOVALERYL (IVA)-VAL-VAL-LYSTA-O-ET (LYSTA IS A LYSYL SIDE CHAIN ANALOGUE OF STATIN XVVX 4 T 2200 RPN6_C_helix pdbhh F F 1apu 2 B I PEPSTATIN ANALOGUE ISOVALERYL-VAL-VAL-STA-O-ET XVVX 4 T 2200 RPN6_C_helix pdbhh F F 1apv 2 B I INHIBITOR ISOVALERYL (IVA)-VAL-VAL-HYDRATED DIFLUOROSTATONE-N-METHYLAMINE XVVXX 5 T 2500 Toxin_8 pdbhh F F 1apw 2 B I INHIBITOR ISOVALERYL (IVA)-VAL-VAL-DIFLUOROSTATINE-N-METHYLAMINE XVVXX 5 T 2500 Toxin_8 pdbhh F F 1aq7 2 B B AERUGINOSIN 98-B XXXX 4 T 520 LysM pdbhh F F 1aqg 1 A _ GNAT1_BOVIN GT(ALPHA)(340-350) IKENLKDCGLF 11 T 2.5 Peptidase_C48 pdbhh F Eukaryota T 1aqz 1 A,B A,B RNMG_ASPRE RESTRICTOCIN ATWTCINQQLNPKTNKWEDKRLLYSQAKAESNSHHAPLSDGKTGSSYPHWFTNGYDGNGKLIKGRTPIKFGKADCDRPPKHSQNGMGKDDHYLLEFPTFPDGHDYKFDSKKPKENPGPARVIYTYPNKVFCGIVAHQRGNQGDLRLCSH 149 T 46 Cuticle_2 pdbhh F Eukaryota T 1ar6 1 A 0 P1/MAHONEY POLIOVIRUS GSSST 5 T 190 DltD pdbhh F F 1ar7 1 A 0 P1/MAHONEY POLIOVIRUS GSSST 5 T 190 DltD pdbhh F F 1ar8 1 A 0 P1/MAHONEY POLIOVIRUS AAAASSST 8 T 47 DUF2600 pdbhh F F 1ar9 1 A 0 P1/MAHONEY POLIOVIRUS GSSST 5 T 190 DltD pdbhh F F 1asj 1 A 0 P1/MAHONEY POLIOVIRUS GSSST 5 T 190 DltD pdbhh F F 1ati 2 C C GLYCYL-tRNA SYNTHETASE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 37 F F F 1ati 3 D D GLYCYL-tRNA SYNTHETASE XXXXXXXXXXXXXXXX 16 F F F 1av2 1 A,B,C,D A,B,C,D VALYL GRAMICIDIN XGAXAXVXWXWXWXWX 16 T 4.6 MAP17 pdbhh F F 1awi 2 C P L-PRO10 PPPPPPPPPP 10 T 23 IL11 pdbhh F F 1awq 2 B B PEPTIDE FROM THE HIV-1 CAPSID PROTEIN HAGPIA 6 T 24 FTHFS pdbhh F F 1awr 2 G,H,I,J,K,L G,H,I,J,K,L PEPTIDE FROM THE HIV-1 CAPSID PROTEIN HAGPIA 6 T 24 FTHFS pdbhh F F 1aws 2 B B PEPTIDE FROM THE HIV-1 CAPSID PROTEIN HAGPIA 6 T 24 FTHFS pdbhh F F 1awt 2 G,H,I,J,K,L G,H,I,J,K,L PEPTIDE FROM THE HIV-1 CAPSID PROTEIN HAGPIA 6 T 24 FTHFS pdbhh F F 1awu 2 B B PEPTIDE FROM THE HIV-1 CAPSID PROTEIN HVGPIA 6 T 100 LEM pdbhh F T 1awv 2 G,H,I,J,K,L G,H,I,J,K,L PEPTIDE FROM THE HIV-1 CAPSID PROTEIN HVGPIA 6 T 100 LEM pdbhh F T 1axc 2 B,D,F B,D,F CDN1A_HUMAN P21/WAF1 GRKRRQTSMTDFYHSKRRLIFS 22 T 0.85 CDC27 pdbhh F Eukaryota T 1axd 2 C,D C,D LACTOYLGLUTATHIONE XXG 3 T 790 Flp_Fap pdbhh F F 1ay3 1 A A PEPTIDIC TOXIN NODULARIN XRXXX 5 T 700 DUF2777 pdbhh F F 1aya 2 B,D P,Q PGFRB_MOUSE PEPTIDE PDGFR-1009 SVLXTAVQPNE 11 T 38 Phage_holin_2_2 pdbhh F Eukaryota T 1ayb 2 B P IRS1_MOUSE PEPTIDE IRS-1-895 SPGEXVNIEFGS 12 T 0.7 CBM32 pdbhh F Eukaryota T 1ayc 2 B P PGFRB_MOUSE PEPTIDE PDGFR-740 DGGXMDMSKGS 11 F F Eukaryota T 1aze 2 B B SOS_DROME SOS VPPPVPPRRR 10 T 2.5 Dscam_C pdbhh F Eukaryota F 1azg 1 A A P85A_HUMAN P2L PPRPLPVAPGSSKT 14 T 0.9 AAA_11 pdbhh F Eukaryota T 1b05 2 B B PEPTIDE LYS-CYS-LYS KCK 3 T 62 B3GALT2_N pdbhh F F 1b07 2 B C SOS1_MOUSE PROTEIN (SH3 PEPTOID INHIBITOR) YEVPGPVPPRRR 12 T 11 Duffy_binding pdbhh F Eukaryota T 1b0g 3 C,F C,F EMC7_HUMAN PEPTIDE P1049 (ALWGFFPVL) ALWGFFPVL 9 T 0.51 MRP-L47 pdbhh F Eukaryota T 1b0h 2 B B LYS-ALN-LYS KXK 3 T 500 VGCC_beta4Aa_N pdbhh F F 1b0q 1 A A MSH XCEHXRWCKPVX 12 F F T 1b0r 3 C C PROTEIN (INFLUENZA MATRIX PEPTIDE) GILGFVFTX 9 T 1.7 Flu_M1 pdbhh F T 1b1h 2 B B PROTEIN (LYS HPE LYS) KXK 3 T 220 YopX pdbhh F F 1b2h 2 B B LYS-ORN-LYS KXK 3 T 500 VGCC_beta4Aa_N pdbhh F F 1b32 2 B B PROTEIN (LYS-MET-LYS) KMK 3 T 260 DUF1598 pdbhh F F 1b3f 2 B B PROTEIN (LYS-HIS-LYS) KHK 3 T 370 DUF1153 pdbhh F F 1b3g 2 B B PROTEIN (LYS-ILE-LYS) KIK 3 T 390 DUF1030 pdbhh F F 1b3h 2 B B LYS-ALC-LYS KXK 3 T 500 VGCC_beta4Aa_N pdbhh F F 1b3l 2 B,D B,D PROTEIN (LYS-GLY-LYS) KGK 3 T 250 zf-CCHC pdbhh F F 1b40 2 B B PROTEIN (LYS-PHE-LYS) KFK 3 T 220 YopX pdbhh F F 1b46 2 B B PROTEIN (LYS-PRO-LYS) KPK 3 T 280 CitT pdbhh F F 1b4h 2 B B LYS-DAB-LYS PEPTIDE KXK 3 T 500 VGCC_beta4Aa_N pdbhh F F 1b4z 2 B B PROTEIN (PEPTIDE LYS-ASP-LYS) KDK 3 T 540 E7R pdbhh F F 1b51 2 B B PROTEIN (LYS-SER-LYS) KSK 3 T 460 DUF2569 pdbhh F F 1b52 2 B B PROTEIN (LYS-THR-LYS) KTK 3 T 630 PAS_12 pdbhh F F 1b58 2 B B PROTEIN (LYS-TYR-LYS) KYK 3 T 180 DUF2039 pdbhh F F 1b5h 2 B B LYS-DPP-LYS PEPTIDE KXK 3 T 500 VGCC_beta4Aa_N pdbhh F F 1b5i 2 B B PROTEIN (LYS-ASN-LYS) KNK 3 T 560 DUF496 pdbhh F F 1b5j 2 B B PROTEIN (LYS-GLN-LYS) KQK 3 T 460 MEF2_binding pdbhh F F 1b6h 2 B B LYS-NVA-LYS PEPTIDE KXK 3 T 500 Whi5 pdbhh F F 1b6j 2 C C CYCLIC PEPTIDE INHIBITOR NXPIVX 6 T 230 DUF4224 pdbhh F F 1b7h 2 B B LYS-NLE-LYS PEPTIDE KXK 3 T 590 zf-C2H2_4 pdbhh F F 1b8d 3 E G PROTEIN (RHODOPHYTAN PHYCOERYTHRIN (GAMMA CHAIN)) GYXXYX 6 T 150 SLBP_RNA_bind pdbhh F F 1b8h 2 D D DPOL_BPR69 GP43 KKASLFDMFDF 11 T 0.82 Radial_spoke_3 pdbhh T Viruses T 1b8q 2 B B PROTEIN (HEPTAPEPTIDE) VVKVDSV 7 T 60 Gyro_capsid pdbhh F F 1b9j 2 B B PROTEIN (LYS-LEU-LYS) KLK 3 T 590 zf-C2H2_4 pdbhh F F 1b9p 1 A A COEA1_CHICK ALPHA 1 TYPE XIV COLLAGEN CAVELRSPGISRFRRKIAKRSIKTLEHKRENAKE 34 T 1.8 Mrx7 pdbhh F Eukaryota T 1b9q 1 A A COEA1_CHICK ALPHA 1 TYPE XIV COLLAGEN CAVELRSPGISRFRRKIAKRSIKTLEHKRENAKE 34 T 1.8 Mrx7 pdbhh F Eukaryota T 1bbr 4 D,G,J F,G,I FIBA_HUMAN FIBRINOGEN ALPHA/ALPHA-E CHAIN PRECURSOR XDFLAEGGGVR 11 T 1.4 DUF4715 unphh F Eukaryota T 1bbz 2 B,D,F,H B,D,F,H PEPTIDE P41 XAPSYSPPPPP 11 T 1.8 N1221 pdbhh F F 1bc5 2 B T TAR XNWETF 6 T 37 FDF pdbhh F T 1bcc 9 I,I2 I,I CYTOCHROME BC1 COMPLEX, COMPLEX III XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 33 F F F 1bck 2 B C CYCLOSPORINE, CICLOSPORIN, CICLOSPORINE XXXXXTXXVXA 11 T 11 DNA_pol_D_N pdbhh F F 1bcr 3 C C ANTIPAIN XRVX 4 T 41 Receptor_IA-2 pdbhh F F 1bcs 3 C C CHYMOSTATIN A FXLX 4 T 160 HycA_repressor pdbhh F F 1bcv 1 A _ POLG_FMDVA PEPTIDE CORRESPONDING TO THE MAJOR IMMUNOGEN SITE OF FMD VIRUS XGSGVRGDFGSLAPRVARQL 20 T 0.00016 Rhv unppercent T Viruses T 1bdk 1 A _ bradykinin antagonist B-9340 XRPPGXSXXR 10 T 1 Bradykinin pdbhh F F 1bdw 1 A,B A,B VALYL GRAMICIDIN XGAXAXVXWXWXWXWX 16 T 4.6 MAP17 pdbhh F F 1be9 2 B B CRIPT KQTSV 5 T 360 DIE2_ALG10 pdbhh F F 1bei 1 A _ K1A_STIHL SHK-DNP22 RSCIDTIPKSRCTAFQCKHSMXYRLSFCRKTCGTC 35 T 0.0045 ShK unp F Eukaryota T 1bfw 1 A _ VP1 PROTEIN XXXXXXXXXXGXXGXXGXGX 20 T 9.7 Rhodanese_C pdbhh F F 1bfz 1 A _ HCMV PROTEASE R-SITE N-TERMINAL CLEAVAGE PRODUCT XSYVKA 6 T 150 DUF632 pdbhh F T 1bhf 2 B I INHIBITOR ACE-IPA-GLU-GLU-ILE XXEEI 5 T 94 eIF3h_C pdbhh F F 1bhx 4 D E ALPHA THROMBIN DFEEI 5 T 27 TSLP pdbhh F F 1bi6 1 A L IBRO_ANACO BROMELAIN INHIBITOR VI TACSECVCPLR 11 T 0.014 CID_GANP unp F Eukaryota T 1bjr 2 B I LACTOFERRIN VAQGGAAGLA 10 T 7.5 MCRA pdbhh F F 1bk6 2 B,E C,E LARGE T ANTIGEN KKKRKV 6 T 76 LCD1 pdbhh F F 1bk6 3 C,F D,F LARGE T ANTIGEN AKKAA 5 T 260 DUF3726 pdbhh F F 1bll 2 B I AMASTATIN XVVD 4 T 400 Fer4 pdbhh F F 1bm2 2 B L CYCLO-[N-ALPHA-ACETYL-L-THIALYSYL-O-PHOSPHOTYROSYL -VALYL-ASPARAGYL-VALYL- PROLYL] XXXVNVP 7 T 2.8 LAX pdbhh F F 1bmb 2 B I PROTEIN (PKF270-974) KPFXVNVEF 9 T 0.61 SH3-WW_linker pdbhh F T 1bog 3 C C PEPTIDE GATPEDLNQKL 11 T 8.6 DUF4605 pdbhh F T 1br8 2 C P PROTEIN (PEPTIDE) SEAAASTAVVIA 12 T 28 ACC_epsilon pdbhh F T 1bs6 2 D,E,F D,E,F PROTEIN (MET-ALA-SER) MAS 3 T 280 zf-C2H2_4 pdbhh F F 1bs8 2 D,E,F D,E,F PROTEIN (MET-ALA-SER) MAS 3 T 280 zf-C2H2_4 pdbhh F F 1bt6 2 C,D C,D ANXA2_CHICK ANNEXIN II XSTVHEILSKLSLE 14 T 8 DUF4581 pdbhh F Eukaryota T 1bw8 2 B P A8IP97_RAT PROTEIN (INTERNALIZATION SIGNAL FROM EGFR) FYRALM 6 T 0.2 GcnA_N pdbhh F Eukaryota T 1bx9 2 B B FOE-4053-glutathione conjugate GGL-FOE-GLY XXG 3 T 24 DUF1936 pdbhh F F 1bxp 2 B B PEPTIDE MET-ARG-TYR-TYR-GLU-SER-SER-LEU-LYS-SER-TYR-PRO-ASP MRYYESSLKSYPD 13 T 3.3 Prion pdbhh F T 1bxx 2 B P PROTEIN (TGN38 PEPTIDE) DYQRLN 6 T 30 Fer4_24 pdbhh F T 1by5 2 B B FERRICHROME XXXGGG 6 T 190 IncE pdbhh F F 1byz 1 A,B,C,D A,B,C,D PROTEIN (SYNTHETIC DESIGNED PEPTIDE "ALPHA-1") XELLKKLLEELKG 13 T 11 NABP pdbhh F F 1bz9 3 C C PROTEIN (PEPTIDE P1027 (FAPGVFPYM)) FAPGVFPYM 9 T 0.35 CT_C_D pdbhh F T 1bzh 2 B I PROTEIN (PROTEIN-TYROSINE-PHOSPHATASE 1B INHIBITOR) DADEXLX 7 T 0.29 Glyco_transf_92 pdbhh F F 1c0q 1 A,B A,B VANCOMYCIN XXNXXXX 7 T 95 P53_C pdbhh F F 1c0r 1 A,B A,B VANCOMYCIN XXNXXXX 7 T 95 P53_C pdbhh F F 1c2u 1 A A K1A_STIHL SYNTHETIC PEPTIDE ANALOGUE OF SHK TOXIN RSXIDTIPKSRCTAFQCKHSAKYRLSFCRKTCGTX 35 T 0.0045 ShK unp F Eukaryota T 1c4b 1 A A PROTEIN (CYCLO(RD-262)) CXXXXXGXXXXXX 13 T 2.7 DUF4793 pdbhh F F 1c4d 1 A,B,C,D A,B,C,D VALYL GRAMICIDIN XGAXAXVXWXWXWXWX 16 T 4.6 MAP17 pdbhh F F 1c4e 1 A A GUR_GYMSY PROTEIN (GURMARIN) QQCVKKDELCIPYYLDCCEPLECKKVNWWDHKCIG 35 T 0.00036 Toxin_7 pdb F Eukaryota T 1c4v 3 C 3 HIRUGEN ACENEDFEEIPGEYL 15 T 0.033 Hirudin pdbhh F T 1c4y 3 C 3 HIRUGEN ENEDFEGIPGEYL 13 T 0.3 Hirudin pdbhh F T 1c51 1 A A PROTEIN (PHOTOSYSTEM I: SUBUNIT PSAA) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 597 F F F 1c51 2 B B PROTEIN (PHOTOSYSTEM I: SUBUNIT PSAB ) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 619 F F F 1c51 3 C C PROTEIN (PHOTOSYSTEM I: SUBUNIT PSAC) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 77 F F F 1c51 4 D D PROTEIN (PHOTOSYSTEM I: SUBUNIT PSAD) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 125 F F F 1c51 5 E E PROTEIN (PHOTOSYSTEM I: SUBUNIT PSAE) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 75 F F F 1c51 6 F F PROTEIN (PHOTOSYSTEM I: SUBUNIT PSAF) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 153 F F F 1c51 7 G K PROTEIN (PHOTOSYSTEM I: SUBUNIT PSAK) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 78 F F F 1c51 8 H L PROTEIN (PHOTOSYSTEM I: SUBUNIT PSAL) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 120 F F F 1c5f 2 B,D,F,H,J,L,N,P B,D,F,H,J,L,N,P CYCLOSPORINE, CICLOSPORIN, CICLOSPORINE XXXXXXXXVXA 11 T 9.7 DUF6090 pdbhh F F 1c9i 2 C,D C,D B-ADAPTIN 3 AVSLLDLDA 9 T 9.6 AP3B1_C pdbhh F F 1c9l 2 C,D C,D B-ADAPTIN 3 DTNLIEFE 8 T 55 DUF247 pdbhh F T 1ca0 1 A,E A,F CTRA_BOVIN BOVINE CHYMOTRYPSIN CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 1ca9 2 G,H G,H TNR1B_HUMAN PROTEIN (TNF-R2) GQVPFSKEEC 10 T 3.2 Bac_export_2 pdbhh F Eukaryota T 1cbw 1 A,E A,F CTRA_BOVIN BOVINE CHYMOTRYPSIN CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 1cdl 2 B,D,F,H E,F,G,H MYLK_CHICK CALCIUM/CALMODULIN-DEPENDENT PROTEIN KINASE TYPE II ALPHA CHAIN ARRKWQKTGHAVRAIGRLSS 20 T 7.3 PACT_coil_coil pdbhh F Eukaryota T 1cdm 2 B B KCC2A_RAT CALMODULIN LKKFNARRKLKGAILTTMLATRNFS 25 T 13 PACT_coil_coil pdbhh F Eukaryota T 1ce1 3 C P PROTEIN (PEPTIDE ANTIGEN) GTSSPSAD 8 T 9.5 Phage_T4_gp36 pdbhh F T 1cf0 2 C C PROTEIN (L-PRO10-IODOTYROSINE) PPPPPPPPX 9 T 0.21 Mtp pdbhh F F 1cfa 2 B B SYNTHETIC N-TERMINAL TAIL CLGX 4 T 53 SNAD3 pdbhh F F 1cfn 3 C C PROTEIN (BOUND PEPTIDE) GATPQDLNTX 10 T 3.2 DNA_Packaging_2 pdbhh F T 1cfs 3 C C PROTEIN (ANTIGEN BOUND PEPTIDE) GLYEWGGARIT 11 T 3.4 DUF4873 pdbhh F T 1cft 3 C C PROTEIN (ANTIGEN BOUND PEPTIDE) LKGPL 5 T 71 PufQ pdbhh F F 1cg9 3 C C EBNA6_EBVB9 PROTEIN (EBNA-6 NUCLEAR PROTEIN (EBNA-3C) (EBNA-4B)) LPPLDITPY 9 T 0.94 PINIT pdbhh T Viruses T 1cho 1 A E CTRA_BOVIN ALPHA-CHYMOTRYPSIN A CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 1cjf 2 B,D C,D PROTEIN (PROLINE PEPTIDE) PPPPPPPPPPPPPPP 15 T 24 Caskin-Pro-rich pdbhh F F 1cka 2 B B C3G PEPTIDE (PRO-PRO-PRO-ALA-LEU-PRO-PRO-LYS-LYS-ARG) PPPALPPKKR 10 T 1 PTN13_u3 pdbhh F F 1ckb 2 B B SOS PEPTIDE (PRO-PRO-PRO-VAL-PRO-PRO-ARG-ARG-ARG-ARG) PPPVPPRRRR 10 T 1.2 HCV_NS5a_C pdbhh F F 1ckk 2 B B KKCC1_RAT CAMKK 1,CAM-KINASE IV KINASE,CALCIUM/CALMODULIN-DEPENDENT PROTEIN KINASE KINASE ALPHA,CAMKK ALPHA VKLIPSWTTVILVKSMLRKRSFGNPF 26 T 14 DUF4326 pdbhh F Eukaryota T 1clv 2 B I IAAI_AMAHP PROTEIN (ALPHA-AMYLASE INHIBITOR) CIPKWNRCGPKMDGVPCCEPYTCTSDYYGNCS 32 T 0.022 Toxin_12 pdbpssm F Eukaryota T 1cm1 2 B B KCC2A_RAT CALMODULIN-DEPENDENT PROTEIN KINASE II-ALPHA LKKFNARRKLKGAILTTMLATRNFS 25 T 13 PACT_coil_coil pdbhh F Eukaryota T 1cm4 2 B B KCC2A_RAT CALMODULIN-DEPENDENT PROTEIN KINASE II-ALPHA LKKFNARRKLKGAILTTMLATRNFS 25 T 13 PACT_coil_coil pdbhh F Eukaryota T 1cmi 2 C,D C,D NOS1_RAT BNOS, CONSTITUTIVE NOS, NC-NOS, NOS TYPE I, NEURONAL NOS, N-NOS, NNOS KAEMKDTGIQVDR 13 T 2.4 Exog_C pdbhh F Eukaryota T 1cmj 1 A A NOR_FUSOX P450NOR ASGAPSFPFSRASGPEPPAEFAKLRATNPVSQVKLFDGSLAWLVTKHKDVCFVATSEKLSKVRTRQGFPELSASGKQAAKAKPTFVDMDPPEHMHQRSMVEPTFTPEAVKNLQPYIQRTVDDLLEQMKQKGCANGPVDLVKEFALPVPSYIIYTLLGVPFNDLEYLTQQNAIRTNGSSTAREASAANQELLDYLAILVEQRLVEPKDDIISKLCTEQVKPGNIDKSDAVQIAFLLLVAGNATMVNMIALGVATLAQHPDQLAQLKANPSLAPQFVEELCRYHTATALAIKRTAKEDVMIGDKLVRANEGIIASNQSANRDEEVFENPDEFNMNRKWPPQDPLGFGFGDHRCIAEHLAKAELTTVFSTLYQKFPDLKVAVPLGKINYTPLNRDVGIVDLPVIF 402 T 2.3E-36 p450 unppercent F Eukaryota T 1cmn 1 A A NOR_FUSOX P450NOR ASGAPSFPFSRASGPEPPAEFAKLRATNPVSQVKLFDGSLAWLVTKHKDVCFVATSEKLSKVRTRQGFPELSASGKQAAKAKPTFVDMDPPEHMHQRSMVEPTFTPEAVKNLQPYIQRTVDDLLEQMKQKGCANGPVDLVKEFALPVPSYIIYTLLGVPFNDLEYLTQQNAIRTNGSSTAREASAANQELLDYLAILVEQRLVEPKDDIISKLCTEQVKPGNIDKSDAVQIAFLLLVAGNATMVNMIALGVATLAQHPDQLAQLKANPSLAPQFVEELCRYHTAVALAIKRTAKEDVMIGDKLVRANEGIIASNQSANRDEEVFENPDEFNMNRKWPPQDPLGFGFGDHRCIAEHLAKAELTTVFSTLYQKFPDLKVAVPLGKINYTPLNRDVGIVDLPVIF 402 T 2.3E-36 p450 pdbpercent F Eukaryota T 1cnl 1 A A CA1_CONIM PROTEIN (ALPHA-CONOTOXIN IMI) GCCSDPRCAWRCX 13 T 0.0098 Toxin_8 unphh F Eukaryota T 1cp3 2 B,D C,D ACETYL-ASP-VAL-ALA-ASP-FLUOROMETHYLKETONE XDVADX 6 T 670 GIT_SHD pdbhh F F 1cpi 2 C C CYCLIC PEPTIDE INHIBITOR NXPIVX 6 T 230 DUF4224 pdbhh F F 1csa 1 A A CYCLOSPORINE, CICLOSPORIN, CICLOSPORINE XXXXXXXXVXA 11 T 9.7 DUF6090 pdbhh F F 1csy 2 B B ACETYL-THR-PTR-GLU-THR-LEU-NH2 XTXETLX 7 T 230 DUF3921 pdbhh F F 1csz 2 B B ACETYL-THR-PTR-GLU-THR-LEU-NH2 XTXETLX 7 T 230 DUF3921 pdbhh F F 1cu4 3 C P RECOGNITION PEPTIDE APKTNMKHMA 10 T 22 MPP6 pdbhh F T 1cvq 1 A A HISTONE H3 XXGXXGGCX 9 T 12 DUF4223 pdbhh F F 1cvu 2 C F PROTEIN (9-MER) TKTATINAS 9 T 100 Snu56_snRNP pdbhh F T 1cw8 1 A A HISTONE H3 XXGXXGGCX 9 T 12 DUF4223 pdbhh F F 1cwa 2 B C CYCLOSPORINE, CICLOSPORIN, CICLOSPORINE XXXXXXXXVXA 11 T 9.7 DUF6090 pdbhh F F 1cwb 2 B C CYCLOSPORINE, CICLOSPORIN, CICLOSPORINE XXXXXXXXVXA 11 T 8 IncD pdbhh F F 1cwc 2 B C CYCLOSPORINE, CICLOSPORIN, CICLOSPORINE XXXXXXXXVXA 11 T 9.7 DUF6090 pdbhh F F 1cwd 2 B P (PHOSPHONOMETHYL)PHENYLALANINE-CONTAINING PEPTIDE PRO-GLU-GLY-ASP-PM3-GLU-GLU-VAL-LEU PEGDXEEVL 9 T 1.9 Ykof pdbhh F T 1cwe 2 B,D B,D PHOSPHOPEPTIDE ACQ-PMP-GLU-GLU-ILE-PRO XQXEEIP 7 T 1.4 Imm15 pdbhh F F 1cwf 2 B C CYCLOSPORINE, CICLOSPORIN, CICLOSPORINE XXXXXVXXVXA 11 T 4.4 PV-1 pdbhh F F 1cwh 2 B C CYCLOSPORINE, CICLOSPORIN, CICLOSPORINE XXXXXXXXVXA 11 T 6 YycH pdbhh F F 1cwi 2 B C CYCLOSPORINE, CICLOSPORIN, CICLOSPORINE XXXXXVXXVXA 11 T 45 IL13 pdbhh F F 1cwj 2 B C CYCLOSPORINE, CICLOSPORIN, CICLOSPORINE XXXXXVXXVXA 11 T 4.4 PV-1 pdbhh F F 1cwk 2 B C CYCLOSPORINE, CICLOSPORIN, CICLOSPORINE XXXXXVXXVXA 11 T 4.4 PV-1 pdbhh F F 1cwl 2 B C CYCLOSPORINE, CICLOSPORIN, CICLOSPORINE XXXXXXXXVXA 11 T 9.7 DUF6090 pdbhh F F 1cwm 2 B C CYCLOSPORINE, CICLOSPORIN, CICLOSPORINE XXXXXXXXVXA 11 T 0.96 DUF6090 pdbhh F F 1cwo 2 B C CYCLOSPORINE, CICLOSPORIN, CICLOSPORINE XXLXXTXXLXA 11 T 1.2 DUF6090 pdbhh F F 1cwu 1 A,B A,B FABI_BRANA ENOYL ACP REDUCTASE LPIDLRGKRAFIAGIADDNGYGWAVAKSLAAAGAEILVGTWVPALNIFETSLRRGKFDQSRVLPDGSLMEIKKVYPLDAVFDNPEDVPEDVKANKRYAGSSNWTVQEAAECVRQDFGSIDILVHSLGNGPEVSKPLLETSRKGYLAAISASSYSFVSLLSHFLPIMNPGGASISLTYIASERIIPGYGGGMSSAKAALESDTRVLAFEAGRKQNIRVNTISAGPLGSRAAKAIGFIDTMIEYSYNNAPIQKTLTADEVGNAAAFLVSPLASAITGATIYVDNGLNSMGVALDSPVF 296 T 3.1E-05 adh_short_C2 unppssm F Eukaryota T 1cwz 1 A A HISTONE H3 XXGXXGGCX 9 T 12 DUF4223 pdbhh F F 1cya 1 A A CYCLOSPORINE, CICLOSPORIN, CICLOSPORINE XXXXXXXXVXA 11 T 9.7 DUF6090 pdbhh F F 1cyb 1 A A CYCLOSPORINE, CICLOSPORIN, CICLOSPORINE XXXXXXXXVXA 11 T 9.7 DUF6090 pdbhh F F 1cyn 2 B C CYCLOSPORINE, CICLOSPORIN, CICLOSPORINE XXXXXXXXVXA 11 T 11 DUF6090 pdbhh F F 1cz6 1 A A ANDT_ANDAU PROTEIN (ANDROCTONIN) RSVCRQIKICRRRGGCYYKCTNRPY 25 T 0.35 DUF4528 pdbhh F Eukaryota T 1czi 2 B P PFIZER INHIBITOR PXCX 4 T 26 DUF166 pdbhh F F 1czq 2 B D D10-P1 XGXXXXXXXXXXXXXXX 17 T 8.3 Harakiri pdbhh F F 1czz 2 D,E D,E TNR5_HUMAN CD40 XPVQETLHGC 10 T 1.8 Ripply pdbhh F Eukaryota T 1d00 2 I,J,K,L,M,N,O,P I,J,K,L,M,N,O,P TNR5_HUMAN B-CELL SURFACE ANTIGEN CD40 XPVQETX 7 T 11 DUF3827 unphh F Eukaryota T 1d01 2 G,H,I G,H,I TNR8_HUMAN CD30 PEPTIDE XMLSVEEEG 9 T 64 DDRGK pdbhh F Eukaryota T 1d0a 2 G,H,I,J,K,L G,H,I,J,K,L TNR4_HUMAN OX40L RECEPTOR PEPTIDE XPIQEE 6 T 0.0025 TMEM154 unphh F Eukaryota F 1d0j 2 G,H,I,J,K G,H,I,J,K TNR9_MOUSE 4-1BB LIGAND RECEPTOR XGAAQEE 7 T 0.011 TMEM154 unphh F Eukaryota F 1d0w 1 A A C-TERMINAL ANALOGUE OF NEUROPEPTIDE Y, A POTENT Y2 RECEPTOR AGONIST ARHYKNLLERQRYX 14 T 1.1 Hormone_3 pdbhh F T 1d1e 1 A A C-TERMINAL ANALOGUE OF NEUROPEPTIDE Y, A POTENT Y2 RECEPTOR AGONIST XRHYKNLIERQRYX 14 T 0.00024 Hormone_3 pdbhh F T 1d4t 2 B B SLAF1_HUMAN SLAM KSLTIYAQVQK 11 T 0.1 MFS_1 unppssm F Eukaryota T 1d4w 2 C,D C,D SLAF1_HUMAN SLAM KSLTIXAQVQK 11 T 0.1 MFS_1 unppssm F Eukaryota T 1d5g 2 B B PEPTIDE FADSEADENEQVSAV FADSEADENEQVSAV 15 T 25 DUF1660 pdbhh F T 1d5m 4 D D INHIBITOR XXRAMXSLX 9 T 57 DUF3725 pdbhh F T 1d5q 1 A A CHIMERIC MINI-PROTEIN CNLARCQLSCKSLGLKGGCQGSFCTCG 27 T 0.027 Toxin_2 pdbhh F T 1d5x 4 D D DIPEPTIDE MIMETIC INHIBITOR XXRXXX 6 T 790 DALR_1 pdbhh F F 1d5z 4 D D PROTEIN (PEPTIDOMIMETIC INHIBITOR) XXRAXSLX 8 T 480 Etmic-2 pdbhh F F 1d6e 4 D D PEPTIDOMIMETIC INHIBITOR XXRXMASXX 9 T 48 DUF2556 pdbhh F F 1d6x 1 A A ANTIMICROBIAL PEPTIDE, TRITRPTICIN VRRFPWWWPFLRR 13 T 1.5 DUF2841 pdbhh F T 1d7q 1 A B PROTEIN (N-TERMINAL HISTIDINE TAG) MRGSHHHHHHTDPM 14 T 8300 zf_CCCH_4 pdbhh F T 1d7t 1 A A YNK-CONTRYPHAN GCPXNPKX 8 T 0.038 zf-U11-48K pdbhh F T 1d8e 3 C P RASK_HUMAN K-RAS4B PEPTIDE SUBSTRATE KKKSKTKCVIM 11 T 0.035 Fer4_22 unppssm F Eukaryota T 1d8s 1 A,B,C A,B,C F1 ATPASE (ALPHA SUBUNIT) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 492 F F F 1d8s 2 D,E,F D,E,F F1 ATPASE (BETA SUBUNIT) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 467 F F F 1d8s 3 G G F1 ATPASE (GAMMA SUBUNIT) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 214 F F F 1d8t 2 C,D C,D THCL_PLARO GE2270A SXNXVXGXXXXXSPX 15 T 1.2 CCER1 unphh F Bacteria T 1ddm 2 B B NAK GFSNMSFEDFP 11 T 1.9 Dodecin pdbhh F T 1de3 1 A A RNAS_ASPGI RIBONUCLEASE ALPHA-SARCIN AVTWTCLNDQKNPKTNKYETKRLLYNQNKAESNSHHAPLSDGKTGSSYPHWFTNGYDGDGKLPKGRTPIKFGKSDCDRPPKHSKDGNGKTDHYLLEFPTFPDGHDYKFDSKKPKENPGPARVIYTYPNKVFCGIIAHTKENQGELKLCSH 150 T 23 MtrE unphh F Eukaryota T 1de7 3 E,F A,B FACTOR XIII ACTIVATION PEPTIDE (28-37) TVELQGVVPXX 11 T 0.7 DUF4075 pdbhh F T 1deq 4 G,N M,Z FIBRINOGEN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 90 F F F 1dfy 1 A A COW_CONSE CONTRYPHAN-SM GCPXQPWX 8 T 0.45 EndIII_4Fe-2S pdbhh F Eukaryota T 1dfz 1 A A COW_CONSE CONTRYPHAN-SM GCPXQPWX 8 T 0.45 EndIII_4Fe-2S pdbhh F Eukaryota T 1dg0 1 A A COW_CONRA DES[GLY1]-CONTRYPHAN-R CPXQPWX 7 F F Eukaryota F 1dit 3 C P PEPTIDE INHIBITOR CVS995 XDPXGGGGGNGDFEEIPEYL 20 T 0.16 Hirudin pdbhh F T 1dkd 2 B,D,F,H E,F,G,H 12-MER PEPTIDE SWMTTPWGFLHP 12 T 1.1 DUF6163 pdbhh F T 1dkx 2 B B SUBSTRATE PEPTIDE (7 RESIDUES) NRLLLTG 7 T 10 PgaPase_1 pdbhh F F 1dky 2 C,D C,D PEPTIDE SUBSTRATE NRLLLTG 7 T 10 PgaPase_1 pdbhh F F 1dkz 2 B B SUBSTRATE PEPTIDE (7 RESIDUES) NRLLLTG 7 T 10 PgaPase_1 pdbhh F F 1dlk 1 A,C A,C CTRA_BOVIN Thrombin light chain CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 1dlk 3 E,F E,F peptidic inhibitor XGGXX 5 T 220 DUF4045 pdbhh F F 1dlz 1 A A ZERVAMICIN IIB XWIQXITXLXPQXPXPX 17 T 25 bpX0 pdbhh F T 1dm4 3 C C FIBA_HUMAN PROTEIN (FIBRINOPEPTIDE) XDFLAEGGGVR 11 T 1.4 DUF4715 unphh F Eukaryota T 1dmc 1 A _ MT1_CALSI CD6 METALLOTHIONEIN-1 SPCQKCTSGCKCATKEECSKTCTKPCSCCPK 31 T 1.5 Metallothio_5 pdbhh F Eukaryota T 1dmd 1 A _ MT1_CALSI CD6 METALLOTHIONEIN-1 SPCQKCTSGCKCATKEECSKTCTKPCSCCPK 31 T 1.5 Metallothio_5 pdbhh F Eukaryota T 1dme 1 A _ MT1_CALSI CD6 METALLOTHIONEIN-1 PGPCCNDKCVCQEGGCKAGCQCTSCRCS 28 T 0.53 Metallothio_5 pdbhh F Eukaryota T 1dmf 1 A _ MT1_CALSI CD6 METALLOTHIONEIN-1 PGPCCNDKCVCQEGGCKAGCQCTSCRCS 28 T 0.53 Metallothio_5 pdbhh F Eukaryota T 1dn2 2 B,D E,F ENGINEERED PEPTIDE DCAWHLGELVWCTX 14 T 7.4 FAT pdbhh F T 1dng 1 A A HUMAN PLATELET FACTOR 4, SEGMENT 59-73 QAPAYEEAAEELAKS 15 T 0.91 Comm pdbhh F T 1dpu 2 B B UNG_HUMAN URACIL DNA GLYCOSYLASE (UNG2) RIQRNKAAALLRLAAR 16 T 3 ARL6IP6 unppssm F Eukaryota T 1dsc 2 C C DACTINOMYCIN TXPXXXTXPXX 11 T 61 DUF5572 pdbhh F F 1dsd 2 C C DACTINOMYCIN TXPXXXTXPXX 11 T 61 DUF5572 pdbhh F F 1dsr 1 A A A-166686,MDL 62,198) NXXXXXXXFXXXXGLXX 17 T 110 DUF3482 pdbhh F F 1dt7 2 C,D X,Y P53_HUMAN CELLULAR TUMOR ANTIGEN P53 SHLKSKKGQSTSRHKKLMFKTE 22 T 56 Class_IIIsignal pdbhh F Eukaryota T 1dtd 2 B B MCPI_HIRME METALLOCARBOXYPEPTIDASE INHIBITOR DESFLCYQPDQVCCFICRGAAPLPSEGECNPHPTAPWCREGAVEWVPYSTGQCRTTCIPYV 61 T 0.019 Inhibitor_I68 unp F Eukaryota T 1dtv 1 A A MCPI_HIRME LCI GSHTPDESFLCYQPDQVCCFICRGAAPLPSEGECNPHPTAPWCREGAVEWVPYSTGQCRTTCIPYVE 67 T 0.0093 Inhibitor_I68 pdb F Eukaryota T 1du1 1 A A CAC1S_RABIT SKELETAL DIHYDROPYRIDINE RECEPTOR TSAQKAKAEERKRRKMSRGL 20 T 0.59 DUF1682 pdbhh F Eukaryota T 1dum 1 A,B A,B MAGA_XENLA MAGAININ 2 GIGKYLHSAKKFGKAWVGEIMNS 23 T 1.6 TAFII28 pdbhh F Eukaryota T 1duy 3 C,F C,F HTLV-1 OCTAMERIC TAX PEPTIDE LFGYPVYV 8 T 0.076 Pecanex_C pdbhh F T 1dva 3 C,F X,Y PEPTIDE E-76 XALCDDPRVDRWYCQFVEGX 20 T 0.97 HTH_48 pdbhh F T 1dzi 2 B,C,D B,C,D COLLAGEN GPPGPPGFPGERGPPGPPGPPX 22 T 0.0013 Collagen pdbpssm F T 1e4w 3 C P CYCLIC PEPTIDE SHFNEYE 7 T 21 Phospho_p8 pdbhh F T 1e4x 4 E,F P,Q CYCLIC PEPTIDE VVSHFND 7 T 3.7 TnpW pdbhh F T 1e54 2 B B OMP32 DNWQNGTS 8 T 4.8 DUF1842 pdbhh F T 1e6i 2 B P H4_YEAST HISTONE H4 AXRHRKILRNSIQGI 15 T 4.2 Shadoo unppercent F Eukaryota T 1e74 1 A A CA1_CONIM ALPHA-CONOTOXIN IM1(R11E) GCCSDPRCAWECX 13 T 0.0098 Toxin_8 unphh F Eukaryota T 1e75 1 A A CA1_CONIM ALPHA-CONOTOXIN IM1(R7L) GCCSDPLCAWRCX 13 T 0.0098 Toxin_8 unphh F Eukaryota T 1e76 1 A A CA1_CONIM ALPHA-CONOTOXIN IM1(D5N) GCCSNPRCAWRCX 13 T 0.0098 Toxin_8 unphh F Eukaryota T 1e8n 2 B I PEPTIDE INHIBITOR XGFGPFGFA 9 T 0.062 PgaPase_1 pdbhh F F 1e91 2 B B MAD1_HUMAN MAD PROTEIN (MAX DIMERIZER) NIQMLLEAADYLE 13 T 3.9 Rad10 pdbhh F Eukaryota T 1e9w 1 A A THCL_STRAJ ALANINAMIDE, BRYAMYCIN, THIACTIN XIAXASXTXXXXTXXXXXX 19 T 0.93 CCER1 pdbhh F Bacteria F 1eak 2 E,F P,R INHIBITOR PEPTIDE GPAGPPGA 8 T 21 DUF6053 pdbhh F F 1eb1 1 A A PEPTIDE INHIBITOR DYEPIPEEAF 10 T 0.018 Hirudin pdbhh F T 1eb1 4 D B 3-CYCLOHEXYL-D-ALANYL-L-PROLYL-N~2~-METHYL-L-ARGININE XPX 3 T 300 Periviscerokin pdbhh F F 1ee5 2 B B NUPL_XENLA NUCLEOPLASMIN AVKRPAATKKAGQAKKKKL 19 T 0.0016 BSP_II unppercent F Eukaryota T 1ee7 1 A A CHRYSOSPERMIN C XFXSXXLQGXXAAXPXXXQX 20 T 21 DUF4141 pdbhh F T 1een 2 B B ALA-ASP-PBF-PTR-LEU-ILE-PRO ADXXLIP 7 T 0.67 SPOC pdbhh F T 1eeo 2 B B ACETYL-E-L-E-F-PTYR-M-D-Y-E-NH2 PEPTIDE XELEFXMDYEX 11 T 5.1 ATP1G1_PLM_MAT8 pdbhh F T 1eey 3 C,F C,F GP2 PEPTIDE ILSALVGIV 9 T 0.7 H2O2_YaaD pdbhh F T 1eez 3 C,F C,F GP2 PEPTIDE ILSALVGIL 9 T 0.63 H2O2_YaaD pdbhh F T 1efg 2 B B ELONGATION FACTOR G XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 36 F F F 1efg 3 C C ELONGATION FACTOR G XXXXXXXXXXXXXXXXXXXXXXXXXXXX 28 F F F 1efr 4 H Q ATPG_BOVIN EFRAPEPTIN C XXXXXXLXGXXXXGLXX 17 T 49 Transport_MerF unphh F Eukaryota F 1eg0 12 L H PROTEIN (S20 RIBOSOMAL PROTEIN) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 100 F F F 1ehf 1 A A NOR_FUSOX NITRIC-OXIDE REDUCTASE B MASGAPSFPFSRASGPEPPAEFAKLRATNPVSQVKLFDGSLAWLVTKHKDVCFVATSEKLSKVRTRQGFPELSASGKQAAKAKPTFVDMDPPEHMHQRSMVEPTFTPEAVKNLQPYIQRTVDDLLEQMKQKGCANGPVDLVKEFALPVPSYIIYTLLGVPFNDLEYLTQQNAIRTNGSSTAREASAANQELLDYLAILVEQRLVEPKDDIISKLCTEQVKPGNIDKSDAVQIAFLLLVAGNATMVNMIALGVATLAQHPDQLAQLKANPSLAPQFVEELCRYHTATALAIKRTAKEDVMIGDKLVRANEGIIASNQSANRDEEVFENPDEFNMNRKWPPQDPLGFGFGDHRCIAEHLAKAELTTVFSTLYQKFPDLKVAVPLGKINYTPLNRDVGIVDLPVIF 403 T 2.3E-36 p450 unppercent F Eukaryota T 1ehg 1 A A NOR_FUSOX NITRIC-OXIDE REDUCTASE B MASGAPSFPFSRASGPEPPAEFAKLRATNPVSQVKLFDGSLAWLVTKHKDVCFVATSEKLSKVRTRQGFPELSASGKQAAKAKPTFVDMDPPEHMHQRSMVEPTFTPEAVKNLQPYIQRTVDDLLEQMKQKGCANGPVDLVKEFALPVPSYIIYTLLGVPFNDLEYLTQQNAIRTNGSSTAREASAANQELLDYLAILVEQRLVEPKDDIISKLCTEQVKPGNIDKSDAVQIAFLLLVAGNATMVNMIALGVATLAQHPDQLAQLKANPSLAPQFVEELCRYHTAVALAIKRTAKEDVMIGDKLVRANEGIIASNQSANRDEEVFENPDEFNMNRKWPPQDPLGFGFGDHRCIAEHLAKAELTTVFSTLYQKFPDLKVAVPLGKINYTPLNRDVGIVDLPVIF 403 T 2.3E-36 p450 unppercent F Eukaryota T 1ei8 1 A,B,C,D,E,F A,B,C,D,E,F Q8T018_DROME COLLAGEN-LIKE PEPTIDE (PRO-HYP-GLY)4-PG-(PRO-HYP-GLY)5 PPGPPGPPGPPGPGPPGPPGPPGPPGPPG 29 T 0.0046 Collagen pdbpercent F Eukaryota F 1ejl 1 A,B A,B LT_SV40 SV40 LARGE T ANTIGEN NLS PEPTIDE PKKKRKV 7 T 0.28 FAM60A unppercent T Viruses F 1ejo 3 C P POLG_FMDVT FMDV PEPTIDE YTTSTRGDLAHVTTT 15 T 0.0013 Waikav_capsid_1 unphh T Viruses T 1ejy 1 A N NUPL_XENLA NUCLEOPLASMIN NLS PEPTIDE KRPAATKKAGQAKKKK 16 T 0.0016 BSP_II unppercent F Eukaryota T 1ekb 3 C C VAL-ASP-ASP-ASP-ASP-LYK PEPTIDE VDDDDXX 7 T 200 DUF3510 pdbhh F F 1elr 2 B B Q9H2A1_HUMAN HSP90-PEPTIDE MEEVD XMEEVD 6 T 13 TBP unphh F Eukaryota F 1elw 2 C,D C,D HSC70-PEPTIDE GPTIEEVD 8 T 8.1 DUF4028 pdbhh F T 1elx 1 A,B A,B PPB_ECOLI ALKALINE PHOSPHATASE TPEMPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDAAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQDATPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 449 T 6.8E-10 Alk_phosphatase unppssm F Bacteria T 1ely 1 A,B A,B PPB_ECOLI ALKALINE PHOSPHATASE TPEMPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDCAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQDATPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 449 T 6.5E-11 Alk_phosphatase pdbpssm F Bacteria T 1elz 1 A,B A,B PPB_ECOLI ALKALINE PHOSPHATASE TPEMPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDGAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQDATPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 449 T 6.8E-10 Alk_phosphatase unppssm F Bacteria T 1eoj 2 B B THROMBIN INHIBITOR P798 XRXXXDYEPIPEEA 14 T 0.12 Hirudin pdbhh F T 1eol 2 B B THROMBIN INHIBITOR P628 XRXXXDYEPIPEEAA 15 T 0.16 Hirudin pdbhh F T 1epl 2 B I PS1, PRO-LEU-GLU-PSA-ARG-LEU PLEXRL 6 T 12 DUF1923 pdbhh F F 1epm 2 B I PS2, THR-PHE-GLN-ALA-PSA-LEU-ARG-GLU TFQAXLRE 8 T 0.26 SAC3 pdbhh F T 1eqx 1 A A UBE3A_HUMAN PAPILLOMAVIRUS E6-ASSOCIATED PROTEIN IPESSELTLQELLGEERR 18 T 3.5 DUF1413 pdbhh F Eukaryota T 1er8 2 B I ANGT_HORSE H-77 XPFHLLVY 8 T 0.86 Nairo_nucleo unphh F Eukaryota T 1eva 1 A _ MICROCYSTIN-LR XLXRXXX 7 T 49 Phage_TAC_2 pdbhh F F 1evb 1 A _ MICROCYSTIN-LR XLXRXXX 7 T 49 Phage_TAC_2 pdbhh F F 1evc 1 A _ NODULARIN-V XVXXX 5 T 840 PAS_11 pdbhh F F 1evd 1 A _ NODULARIN-V XVXXX 5 T 840 PAS_11 pdbhh F F 1evh 2 B B Peptide ACTA XFPPPPT 7 T 0.38 AIM3 pdbhh F F 1eww 1 A A Q9GTP0_CHOFU ANTIFREEZE PROTEIN DGSCTNTNSQLSANSKCEKSTLTNCYVDKSEVYGTTCTGSRFDGVTITTSTSTGSRISGPGCKISTCIITGGVPAPSAACKISGCTFSAN 90 T 6.2E-25 CfAFP unppssm F Eukaryota T 1exy 2 B B REX_HTL1C PROTEIN X (HTLV-1), P27 PROTEIN (HTLV-1) MPKTRRRPRRSQRKRP 16 T 5.9 DUF1639 pdbhh T Viruses T 1eyx 3 E,F G,H R-PHYCOERYTHRIN AAFRAA 6 T 110 DUF5302 pdbhh F F 1ezg 1 A,B A,B ANPY1_TENMO THERMAL HYSTERESIS PROTEIN ISOFORM YL-1 QCTGGADCTSCTGACTGCGNCPNAVTCTNSQHCVKANTCTGSTDCNTAQTCTNSKDCFEANTCTDSTNCYKATACTNSSGCPGH 84 T 0.0023 AFP pdb F Eukaryota T 1f1j 2 C,D C,D ACE-ASP-GLU-VAL-ASP-CHO XDEVX 5 T 570 Helicase_RecD pdbhh F F 1f1w 2 B B S(PTR)VNVQN PHOSPHOPEPTIDE SXVNVQN 7 T 84 Fringe pdbhh F F 1f24 1 A A NOR_FUSOX NITRIC OXIDE REDUCTASE ASGAPSFPFSRASGPEPPAEFAKLRATNPVSQVKLFDGSLAWLVTKHKDVCFVATSEKLSKVRTRQGFPELSASGKQAAKAKPTFVDMDPPEHMHQRSMVEPTFTPEAVKNLQPYIQRTVDDLLEQMKQKGCANGPVDLVKEFALPVPSYIIYTLLGVPFNDLEYLTQQNAIRTNGSSTAREASAANQELLDYLAILVEQRLVEPKDDIISKLCTEQVKPGNIDKSDAVQIAFLLLVAGNAAMVNMIALGVATLAQHPDQLAQLKANPSLAPQFVEELCRYHTASALAIKRTAKEDVMIGDKLVRANEGIIASNQSANRDEEVFENPDEFNMNRKWPPQDPLGFGFGDHRCIAEHLAKAELTTVFSTLYQKFPDLKVAVPLGKINYTPLNRDVGIVDLPVIF 402 T 2.3E-36 p450 unppercent F Eukaryota T 1f25 1 A A NOR_FUSOX NITRIC OXIDE REDUCTASE ASGAPSFPFSRASGPEPPAEFAKLRATNPVSQVKLFDGSLAWLVTKHKDVCFVATSEKLSKVRTRQGFPELSASGKQAAKAKPTFVDMDPPEHMHQRSMVEPTFTPEAVKNLQPYIQRTVDDLLEQMKQKGCANGPVDLVKEFALPVPSYIIYTLLGVPFNDLEYLTQQNAIRTNGSSTAREASAANQELLDYLAILVEQRLVEPKDDIISKLCTEQVKPGNIDKSDAVQIAFLLLVAGNANMVNMIALGVATLAQHPDQLAQLKANPSLAPQFVEELCRYHTASALAIKRTAKEDVMIGDKLVRANEGIIASNQSANRDEEVFENPDEFNMNRKWPPQDPLGFGFGDHRCIAEHLAKAELTTVFSTLYQKFPDLKVAVPLGKINYTPLNRDVGIVDLPVIF 402 T 1.4999999999999999E-36 p450 pdbpssm F Eukaryota T 1f3r 1 A A ACETYLCHOLINE RECEPTOR ALPHA WNPGDYGGIX 10 T 0.45 CBM32 pdbhh F T 1f47 2 B A FTSZ_ECOLI CELL DIVISION PROTEIN FTSZ KEPDYLDIPAFLRKQAD 17 T 0.99 Drc1-Sld2 pdbhh F Bacteria T 1f4v 2 D,E,F D,E,F FLIM_ECOLI FLIM MGDSILSQAEIDALLN 16 T 0.027 CitT pdbhh F Bacteria T 1f59 2 C,D C,D NSP1P XDDSKPAFSFGXXXXXXXXXXXAFSFGX 28 T 16 SHIPPO-rpt pdbhh F T 1f7a 2 C P Q9YX54_9HIV1 CA-P2 SUBSTRATE KARVLAEAMS 10 T 13 GREB1 pdbhh T Viruses T 1f8a 2 B C Y(SEP)PT(SEP)S PEPTIDE YSPTSPS 7 T 0.03 RNA_pol_Rpb1_R pdbhh F F 1f8h 2 B B PTGSSSTNPFR PTGSSSTNPFR 11 T 1.8 Yuri_gagarin pdbhh F T 1f8i 1 A,B,C,D A,B,C,D ACEA_MYCTU ICL MASVVGTPKSAEQIQQEWDTNPRWKDVTRTYSAEDVVALQGSVVEEHTLARRGAEVLWEQLHDLEWVNALGALTGNMAVQQVRAGLKAIYLSGWQVAGDANLSGHTYPDQSLYPANSVPQVVRRINNALQRADQIAKIEGDTSVENWLAPIVADGEAGFGGALNVYELQKALIAAGVAGSHWEDQLASEKKSGHLGGKVLIPTQQHIRTLTSARLAADVADVPTVVIARTDAEAATLITSDVDERDQPFITGERTREGFYRTKNGIEPCIARAKAYAPFADLIWMETGTPDLEAARQFSEAVKAEYPDQMLAYNCSPSFNWKKHLDDATIAKFQKELAAMGFKFQFITLAGFHALNYSMFDLAYGYAQNQMSAYVELQEREFAAEERGYTATKHQREVGAGYFDRIATTVDPNSSTTALTGSTEEGQFH 429 T 1.8E-47 ICL pdb F Bacteria T 1f90 3 C E ANTIGENIC NONAPEPTIDE KPLEEVLNL 9 T 5.2 IL2 pdbhh F T 1f95 2 C,D C,D B2L11_HUMAN BCL2-LIKE 11 (APOPTOSIS FACILITATOR) MSCDKSTQT 9 T 0.17 FAM117 pdbhh F Eukaryota T 1f96 2 C,D C,D PROTEIN (NNOS, NEURONAL NITRIC OXIDE SYNTHASE) MKDTGIQVDRDLDGKSHK 18 T 8.5 APOBEC1 pdbhh F T 1f9e 3 C,F,I,L,O,R Q,R,S,T,U,V (PHQ)DEVD XDEVX 5 T 140 zf-NPL4 pdbhh F F 1fbv 2 B B ZAP70_HUMAN ZAP-70 PEPTIDE SDGXTPEPA 9 T 1.5 FSIP1 pdbhh F Eukaryota T 1fch 2 C,D C,D PTS1-CONTAINING PEPTIDE YQSKL 5 T 85 DUF678 pdbhh F F 1fe1 1 A,J A,J PROTEIN (PHOTOSYSTEM II: SUBUNIT PSBA) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 169 F F F 1fe1 2 B,K B,K PROTEIN (PHOTOSYSTEM II: SUBUNIT PSBD) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 174 F F F 1fe1 3 C,L C,L PROTEIN (PHOTOSYSTEM II: SUBUNIT PSBC) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 156 F F F 1fe1 4 D,M D,M PROTEIN (PHOTOSYSTEM II: SUBUNIT PSBB) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 155 F F F 1fe1 5 E,N E,N PROTEIN (PHOTOSYSTEM II: SUBUNIT PSBE) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 40 F F F 1fe1 6 F,O F,O PROTEIN (PHOTOSYSTEM II: SUBUNIT PSBF) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 30 F F F 1fe1 7 G,P G,P PROTEIN (PHOTOSYSTEM II: SUBUNIT UNKNOWN) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 312 F F F 1fe1 8 H,Q H,Q PROTEIN (PHOTOSYSTEM II: SUBUNIT PSBO) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 115 F F F 1fe1 9 I,R I,R PROTEIN (PHOTOSYSTEM II: SUBUNIT PSBV) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 87 F F F 1ff1 2 B B PTGSSSTNPFL PEPTIDE PTGSSSTNPFL 11 T 1.6 Yuri_gagarin pdbhh F T 1ffo 3 C,F C,F PEPTIDE WITH SEQUENCE ALA-ALA-VAL-TYR-ASN-PHE-ALA-THR-MET AAVYNFATM 9 T 5.9 DUF5607 pdbhh F T 1ffp 3 C,F C,F SYNTHETIC PEPTIDE WITH SEQUENCE SER-ALA-VAL-TYR-ASN-PHE-ALA-THR-MET SAVYNFATM 9 T 6.2 DUF5607 pdbhh F T 1fft 4 D,H D,I UBIQUINOL OXIDASE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 109 F F F 1ffx 3 E E RB3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 91 F F F 1fg2 3 C,F,I,L C,F,I,L LCMV PEPTIDIC EPITOPE GP33 KAVYNFATC 9 T 0.97 TOM6p pdbhh F T 1fip 2 C,D C,D UNKNOWN PEPTIDE, POSSIBLY PART OF THE UNOBSERVED RESIDUES IN ENTITY 1 XXXX 4 F F F 1fiv 2 B B FIV PROTEASE INHIBITOR LP-149 XXVXEXX 7 T 650 RhoGEF67_u2 pdbhh F F 1fiw 2 B L ACRO_SHEEP BETA-ACROSIN LIGHT CHAIN DNTTCDGPCGVRFRQNRQGGVR 22 T 130 Peptidase_C3 unp F Eukaryota T 1fiz 2 B L ACRO_PIG BETA-ACROSIN LIGHT CHAIN RDNATCDGPCGLRFRQKLESGMR 23 F F Eukaryota T 1fja 2 C,D C,D DACTINOMYCIN TXPXXXTXPXX 11 T 61 DUF5572 pdbhh F F 1fjm 2 C,D M,N microcystin LR XLXRXXX 7 T 49 Phage_TAC_2 pdbhh F F 1fka 2 B B 30S RIBOSOMAL PROTEIN S2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 111 F F F 1fka 3 C C 30S RIBOSOMAL PROTEIN S3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 176 F F F 1fka 9 I I 30S RIBOSOMAL PROTEIN S9 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 89 F F F 1fka 10 J J 30S RIBOSOMAL PROTEIN S10 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 71 F F F 1fka 11 K K 30S RIBOSOMAL PROTEIN S11 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 70 F F F 1fka 12 L L 30S RIBOSOMAL PROTEIN S12 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 103 F F F 1fka 13 M M 30S RIBOSOMAL PROTEIN S13 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 77 F F F 1fka 14 N N 30S RIBOSOMAL PROTEIN S14 XXXXXXXXXXXXXXXXXXXXXXXXXX 26 F F F 1fka 16 P P 30S RIBOSOMAL PROTEIN S16 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 73 F F F 1fka 17 Q Q 30S RIBOSOMAL PROTEIN S17 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 84 F F F 1fka 20 T T 30S RIBOSOMAL PROTEIN S20 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 95 F F F 1fkn 2 C,D C,D inhibitor EVNXAEF 7 T 200 DUF1480 pdbhh F T 1fll 2 B,D X,Y TNR5_HUMAN B-CELL SURFACE ANTIGEN CD40 KTAAPVQETLHGSQPVTQEDG 21 T 11 DUF3827 unphh F Eukaryota T 1flt 2 C,D X,Y VGFR1_HUMAN FLT-1, VGR1 GRPFVEMYSEIPEIIHMTEGRELVIPCRVTSPNITVTLKKFPLDTLIPDGKRIIWDSRKGFIISNATYKEIGLLTCEATVNGHLYKTNYLTHRQT 95 T 0.00015 Ig_2 pdbpssm F Eukaryota T 1fme 1 A A FSD-EY PEPTIDE EQYTAKYKGRTFRNEKELRDFIEKFKGR 28 T 0.77 DUF4121 pdbhh F T 1fn8 2 B B GLY-ALA-ARG GAR 3 T 360 AglB_L1 pdbhh F F 1foz 1 A A SYNTHETIC CYCLIC PEPTIDE XFELDKDF 8 T 10 Sda pdbhh F T 1fph 4 D F FIBA_HUMAN FIBRINOPEPTIDE A XDFLAEGGGVXX 12 T 1.4 DUF4715 unphh F Eukaryota T 1fpr 2 B B PEPTIDE PY469 EDTLTXADLD 10 T 2 G6B pdbhh F T 1fry 1 A A SC51_SHEEP SMAP29, SMAP-29 GENE PRODUCT RGLRRLGRKIAHGVKKYGPTVLRIIRIAG 29 T 0.095 CAP18_C unppercent F Eukaryota T 1fsd 1 A _ FULL SEQUENCE DESIGN 1 OF BETA BETA ALPHA MOTIF QQYTAKIKGRTFRNEKELRDFIEKFKGR 28 T 0.091 SpoVIF pdb F T 1fsv 1 A _ FULL SEQUENCE DESIGN 1 OF BETA BETA ALPHA MOTIF QQYTAKIKGRTFRNEKELRDFIEKFKGR 28 T 0.091 SpoVIF pdb F T 1fu5 2 B B MT_POVMA MT PEPTIDE EEEXMPMEDLXLDIL 15 T 3.6 DUF402 pdbhh T Viruses T 1fu9 1 A A USH_DROME U-SHAPED TRANSCRIPTIONAL COFACTOR GSAAEVMKKYCSTCDISFNYVKTYLAHKQFYCKNKP 36 T 0.0003 zf-met pdb F Eukaryota T 1ful 1 A A RGD PEPTIDE ISOMER-B ACDCRGDCFCG 11 T 0.48 Squash pdbhh F T 1fuv 1 A A RGD PEPTIDE ISOMER-A ACDCRGDCFCG 11 T 0.48 Squash pdbhh F T 1fvm 1 A,B,C,D,E,F A,B,C,D,E,F VANCOMYCIN XXNXXXX 7 T 95 P53_C pdbhh F F 1fvm 2 G,H,I,J,K,L G,H,I,J,K,L DI-ACETYL-LYS-D-ALA-D-ALA XXX 3 T 530 Sel1 pdbhh F F 1fy4 2 B B GLY-ALA-ARG GAR 3 T 360 AglB_L1 pdbhh F F 1fy5 2 B B GLY-ALA-LYS GAK 3 T 430 ASFV_360 pdbhh F F 1fyn 2 B B 3BP-2 PPAYPPPPVP 10 T 1.5 Med7 pdbhh F F 1fyr 2 E,F,G,H I,J,K,L MET_HUMAN HEPATOCYTE GROWTH FACTOR RECEPTOR PEPTIDE XXVNV 5 T 89 Peptidase_M43 pdbhh F Eukaryota F 1fzb 4 G,H G,H PEPTIDE LIGAND GPRG GPRP 4 T 65 SRCR_2 pdbhh F F 1fzc 4 G,H G,H FIBRIN GPRP 4 T 65 SRCR_2 pdbhh F F 1fzc 5 I,J I,J FIBRIN GHRP 4 T 14 VPS38 unphh F F 1fzf 4 G,H,I,J S,T,M,N FIBB_HUMAN FIBRINOGEN GHRP 4 T 14 VPS38 unphh F Eukaryota F 1fzg 4 G,H,I,J S,T,M,N FIBB_HUMAN FIBRINOGEN GHRP 4 T 14 VPS38 unphh F Eukaryota F 1g0y 2 B I ANTAGONIST PEPTIDE AF10847 ETPFTWEESNAYYWQPYALPL 21 T 0.41 PilJ_C pdbhh F T 1g1e 1 A A MAD1_HUMAN MAX DIMERIZATION PROTEIN RMNIQMLLEAADYLER 16 T 1.8 DUF6117 pdbhh F Eukaryota T 1g1f 2 B B TRI-PHOSPHORYLATED PEPTIDE FROM THE INSULIN RECEPTOR KINASE RDIXETDXXRK 11 T 4.9 Glyco_hydro_108 pdbhh F T 1g1g 2 B B MONO-PHOSPHORYLATED PEPTIDE FROM THE INSULIN RECEPTOR KINASE ETDYXRKGGKGLL 13 T 1.3 LEA_3 pdbhh F T 1g1h 2 B B BI-PHOSPHORYLATED PEPTIDE FROM THE INSULIN RECEPTOR KINASE ETDXXRKGGKGLL 13 T 1.3 LEA_3 pdbhh F T 1g1p 1 A A CO6A_CONER CONOTOXIN EVIA DDCIKPYGFCSLPILKNGLCCSGACVGVCADLX 33 T 0.018 Conotoxin unp F Eukaryota T 1g1s 2 C,D C,D SELPL_HUMAN PSGL-1 QATEYEYLDYDFLPETEPPRPMMDDDDK 28 T 7.6 Coilin_N pdbhh F Eukaryota T 1g1z 1 A A CO6A_CONER CONOTOXIN EVIA DDCIKPYGFCSLPILKNGLCCSGACVGVCADLX 33 T 0.018 Conotoxin unp F Eukaryota T 1g2g 1 A A CA1_CONIM ALPHA-CONOTOXIN IMI GCCSDPRCAWRCX 13 T 0.0098 Toxin_8 unphh F Eukaryota T 1g37 2 B B THROMBIN NONAPEPTIDE INHIBITOR FEAIPAEYL 9 T 0.45 Hirudin pdbhh F T 1g3x 2 M M N(ALPHA)-(9-ACRIDINOYL)-TETRAARGININE-AMIDE XRRRR 5 T 74 Gemini_AL2 pdbhh F F 1g65 15 CA,DA 3,4 EPOXOMICIN (peptide inhibitor) XXITX 5 T 840 DUF4597 pdbhh F F 1g6g 2 C,D E,F SER-LEU-GLU-VAL-TPO-GLU-ALA-ASPALA-THR-PHE-ALA-LYS SLEVTEADATFAK 13 T 14 TBK1_CCD1 pdbhh F T 1g6m 1 A A 3S1B2_NAJKA SHORT NEUROTOXIN 1 LECHNQQSSQTPTTTGCSGGENNCYKKEWRDNRGYRTERGCGCPSVKKGIGINCCTTDRCNN 62 T 0.032 Hyr1 pdbpercent F Eukaryota T 1g6r 5 E,J P,Q SIYR PEPTIDE SIYRYYGL 8 T 8.9 LEF-9 pdbhh F T 1g70 2 B B RSG-1.2 PEPTIDE DRRRRGSRPSGAERRRRRAAAA 22 T 9.5 BRD4_CDT pdbhh F T 1g7q 3 C P MUCIN 1, TRANSMEMBRANE SAPDTRPA 8 T 32 PNPase_C pdbhh F T 1g89 1 A A CTHL4_BOVIN INDOLICIDIN ILPWKWPWWPWRRX 14 T 0.12 CoV_S2 pdbhh F Eukaryota T 1g8c 1 A A CTHL4_BOVIN INDOLICIDIN ILPWKWPWWPWRRX 14 T 0.12 CoV_S2 pdbhh F Eukaryota T 1g92 1 A A POTX_PARCV PAC-TX FLPLLILGSLLMTPPVIQAIHDAQR 25 T 0.84 Viral_Beta_CD pdbhh F Eukaryota T 1g9m 1 A G ENV_HV1H2 ENVELOPE GLYCOPROTEIN GP120 GARSEVVLVNVTENFNMWKNDMVEQMHEDIISLWDQSLKPCVKLTPLCVGAGSCNTSVITQACPKVSFEPIPIHYCAPAGFAILKCNNKTFNGTGPCTNVSTVQCTHGIRPVVSTQLLLNGSLAEEEVVIRSVNFTDNAKTIIVQLNTSVEINCTGAGHCNISRAKWNNTLKQIASKLREQFGNNKTIIFKQSSGGDPEIVTHSFNCGGEFFYCNSTQLFNSTWFNSTWSTEGSNNTEGSDTITLPCRIKQIINMWQKVGKAMYAPPISGQIRCSSNITGLLLTRDGGNSNNESEIFRPGGGDMRDNWRSELYKYKVVKIE 321 T 4.5000000000000003E-23 GP120 pdb T Viruses T 1g9w 1 A A COLLAGEN-LIKE PEPTIDE PPGPPGPPG 9 T 0.46 EKLF_TAD1 pdbhh F F 1g9w 2 B,C B,C COLLAGEN-LIKE PEPTIDE PPGPPG 6 T 4.3 CbtA pdbhh F F 1ga1 2 B I FRAGMENT OF IODOTYROSTATIN XXX 3 T 530 zf-C2H2_11 pdbhh F F 1ga4 2 B I PSEUDOIODOTYROSTATIN XXX 3 T 170 GM130_C pdbhh F F 1ga6 2 B I FRAGMENT OF TYROSTATIN XYX 3 T 890 WW pdbhh F F 1gac 1 A,B A,B CELL WALL PENTAPEPTIDE AXKXX 5 T 230 OAM_dimer pdbhh F F 1gac 2 C,D C,D A82846B, A82846 FACTOR B, CHLOROEREMOMYCIN XXNXXXX 7 T 95 P53_C pdbhh F F 1gag 2 B B BISUBSTRATE PEPTIDE INHIBITOR PATGDFMNMSPVG 13 T 0.69 Glycoprot_B_PH1 pdbhh F T 1gbb 2 B P METHOXYSUCCINYL-ALA-ALA-PRO-ALANINE BORONIC ACID INHIBITOR XAAPX 5 T 730 Trp_leader1 pdbhh F F 1gbc 2 B P METHOXYSUCCINYL-ALA-ALA-PRO-LEUCINE BORONIC ACID INHIBITOR XAAPX 5 T 500 Suv3_C_1 pdbhh F F 1gbd 2 B P METHOXYSUCCINYL-ALA-ALA-PRO-PHENYLALANINE BORONIC ACID INHIBITOR XAAPX 5 T 170 DUF3054 pdbhh F F 1gbf 2 B P METHOXYSUCCINYL-ALA-ALA-PRO-ALANINE BORONIC ACID INHIBITOR XAAPX 5 T 730 Trp_leader1 pdbhh F F 1gbh 2 B P METHOXYSUCCINYL-ALA-ALA-PRO-LEUCINE BORONIC ACID INHIBITOR XAAPX 5 T 500 Suv3_C_1 pdbhh F F 1gbi 2 B P METHOXYSUCCINYL-ALA-ALA-PRO-PHENYLALANINE BORONIC ACID INHIBITOR XAAPX 5 T 170 DUF3054 pdbhh F F 1gbk 2 B P METHOXYSUCCINYL-ALA-ALA-PRO-ALANINE BORONIC ACID INHIBITOR XAAPX 5 T 730 Trp_leader1 pdbhh F F 1gbl 2 B P METHOXYSUCCINYL-ALA-ALA-PRO-LEUCINE BORONIC ACID INHIBITOR XAAPX 5 T 500 Suv3_C_1 pdbhh F F 1gbm 2 B P METHOXYSUCCINYL-ALA-ALA-PRO-PHENYLALANINE BORONIC ACID INHIBITOR XAAPX 5 T 170 DUF3054 pdbhh F F 1gbq 2 B B SOS1_MOUSE SOS-1 XVPPPVPPRRRX 12 T 4.2 Dscam_C pdbhh F Eukaryota F 1gbr 2 B B SOS2_MOUSE SOS-A PEPTIDE SPLLPKLPPKTYKRE 15 T 1.2 PHINT_rpt pdbhh F Eukaryota T 1gct 1 A A CTRA_BOVIN GAMMA-CHYMOTRYPSIN A CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 1gct 4 D D TETRAPEPTIDE ADDUCT XPGAY 5 T 56 BsuPI pdbhh F F 1gdn 2 B B GLY-ALA-LYS GAK 3 T 430 ASFV_360 pdbhh F F 1gdq 2 B B GLY-ALA-ARG GAR 3 T 360 AglB_L1 pdbhh F F 1gdu 2 B B GLY-ALA-ARG GAR 3 T 360 AglB_L1 pdbhh F F 1geb 1 A A CPXA_PSEPU CYTOCHROME P450-CAM MTTETIQSNANLAPLPPHVPEHLVFDFDMYNPSNLSAGVQEAWAVLQESNVPDLVWTRCNGGHWIATRGQLIREAYEDYRHFSSECPFIPREAGEAYDFIPTSMDPPEQRQFRALANQVVGMPVVDKLENRIQELACSLIESLRPQGQCNFTEDYAEPFPIRIFMLLAGLPEEDIPHLKYLTDQMTRPDGSMTFAEAKEALYDYLIPIIEQRRQKPGTDAISIVANGQVNGRPITSDEAKRMCGLLLVGGLDIVVNFLSFSMEFLAKSPEHRQELIERPERIPAACEELLRRFSLVADGRILTSDYEFHGVQLKKGDQILLPQMLSGLDERENACPMHVDFSRQKVSHTTFGHGSHLCLGQHLARREIIVTLKEWLTRIPDFSIAPGAQIQHKSGIVSGVQALPLVWDPATTKAV 415 T 1.6E-05 p450 unppercent F Bacteria T 1gec 2 B I BENZYLOXYCARBONYL-LEUCINE-VALINE-GLYCINE-METHYLENE INHIBITOR XLVGX 5 T 550 MciZ pdbhh F F 1gff 1 A 1 VGF_BPG4 BACTERIOPHAGE G4 CAPSID PROTEINS GPF, GPG, GPJ SNVQTSADRVPHDLSHLVFEAGKIGRLKTISWTPVVAGDSFECDMVGAIRLSPLRRGLAVDSRVDIFSFYIPHRHIYGQQWINFMKDGVNASPLPPVTCSSGWDSAAYLGTIPSSTLKVPKFLHQGYLNIYNNYFKPPWSDDLTYANPSNMPSEDYKWGVRVANLKSIWTAPLPPDTRTSENMTTGTSTIDIMGLQAAYAKLHTEQERDYFMTRYRDIMKEFGGHTSYDGDNRPLLLMRSEFWASGYDVDGTDQSSLGQFSGRVQQTFNHKVPRFYVPEHGVIMTLAVTRFPPTHEMEMHYLVGKENLTYTDIACDPALMANLPPREVSLKEFFHSSPDSAKFKIAEGQWYRTQPDRVAFPYNALDGFPFYSALPSTDLKDRVLVNTNNYDEIFQSMQLAHWNMQTKFNINVYRHMPTTRDSIMTS 426 T 2.1E-69 Phage_F pdb T Viruses T 1gg6 1 A A CTRA_BOVIN GAMMA CHYMOTRYPSIN CGVPAIQPVL 10 T 1.7 CaM_bind pdbhh F Eukaryota T 1ggd 1 A A CTRA_BOVIN GAMMA CHYMOTRYPSIN CGVPAIQPVL 10 T 1.7 CaM_bind pdbhh F Eukaryota T 1gha 1 A E CTRA_BOVIN GAMMA-CHYMOTRYPSIN A CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 1gha 4 D P PRO GLY VAL TYR PEPTIDE PGVY 4 T 24 DUF5625 pdbhh F F 1ghb 1 A E CTRA_BOVIN GAMMA-CHYMOTRYPSIN CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 1ghb 4 D P PRO-GLY-ALA PGA 3 T 150 TDP43_N pdbhh F F 1ghg 1 A,B,C,D A,B,C,D VANCOMYCIN AGLYCON XXNXXXX 7 T 750 Antirestrict pdbhh F F 1gje 1 A A IGFBP-1 antagonist CRAGPLQWLCEKYFGX 16 T 2.3 DUF6497 pdbhh F T 1gjf 1 A A IGFBP-1 antagonist XRAGPLQWLAEKYQGX 16 T 9.2 Pico_P2B pdbhh F T 1gjg 1 A A IGFBP-1 antagonist XRPLQWLAEKYFQX 14 T 2.7 DUF5053 pdbhh F T 1gkt 2 B B INHIBITOR, H261 XHPFHXIH 8 T 2.4 DUF5372 pdbhh F F 1gmc 1 A E CTRA_BOVIN GAMMA-CHYMOTRYPSIN A CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 1gmc 4 D B PRO GLY ALA TYR PEPTIDE PGAY 4 T 35 DUF5660 pdbhh F F 1gmd 1 A E CTRA_BOVIN GAMMA-CHYMOTRYPSIN A CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 1gmd 4 D B PRO GLY ALA TYR ASP PEPTIDE PGAYD 5 T 32 DUF3288 pdbhh F F 1gmh 1 A E CTRA_BOVIN GAMMA-CHYMOTRYPSIN A CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 1gmk 1 A,B,C,D A,B,C,D VALYL GRAMICIDIN XGAXAXVXWXWXWXWX 16 T 4.6 MAP17 pdbhh F F 1go6 1 A,C,E,G,I,J,K,L A,C,E,G,I,K,M,O BALHIMYCIN XXNXXXX 7 T 95 P53_C pdbhh F F 1go6 2 B,D,F,H B,D,F,H PEPTIDE LYS-DAL-DAL KXX 3 T 530 Sel1 pdbhh F F 1gq0 1 A A ANTIAMOEBIN I XFXXXXGLXXPQXPXPX 17 T 0.21 Pep_deformylase pdbhh F T 1grm 1 A,B A,B VALYL GRAMICIDIN XGAXAXVXWXWXWXWX 16 T 4.6 MAP17 pdbhh F F 1gtj 2 C,D 3,4 ALDEHYDE INHIBITOR XIAX 4 T 340 PD40 pdbhh F F 1gtl 2 C,D 3,4 ALDEHYDE INHIBITOR XIPX 4 T 77 G0-G1_switch_2 pdbhh F F 1gur 1 A _ GUR_GYMSY GURMARIN QQCVKKDELCIPYYLDCCEPLECKKVNWWDHKCIG 35 T 0.00036 Toxin_7 pdb F Eukaryota T 1gvk 1 A A N-AC-NPI-CO2H XNPI 4 T 110 TrmO pdbhh F F 1gvu 2 B I ANGT_BOVIN INHIBITOR, H189 PHPFHXVIHK 10 T 0.74 Ins134_P3_kin_N pdbhh F Eukaryota T 1gvx 2 B I INHIBITOR H256 PTEXRE 6 T 210 DUF5737 pdbhh F F 1gwk 1 A,B A,B Q9C171_PIREQ NCP1 MNVRATYTVIFKNASGLPNGYDNWGWGCTLSYYGGAMIINPQEGKYGAVSLKRNSGSFRGGSLRFDMKNEGKVKILVENSEADEKFEVETISPSDEYVTYILDVDFDLPFDRIDFQDAPGNGDRIWIKNLVHSTGSADDFVDPINLEHHHHHH 153 T 0.22 RNase_H pdbpssm F Eukaryota T 1gwl 1 A A Q9C171_PIREQ NCP1 MNVRATYTVIFKNASGLPNGYDNWGWGCTLSYYGGAMIINPQEGKYGAVSLKRNSGSFRGGSLRFDMKNEGKVKILVENSEADEKFEVETISPSDEYVTYILDVDFDLPFDRIDFQDAPGNGDRIWIKNLVHSTGSADDFVDPINLEHHHHHH 153 T 0.22 RNase_H pdbpssm F Eukaryota T 1gwm 1 A A Q9C171_PIREQ NCP1 MNVRATYTVIFKNASGLPNGYDNWGWGCTLSYYGGAMIINPQEGKYGAVSLKRNSGSFRGGSLRFDMKNEGKVKILVENSEADEKFEVETISPSDEYVTYILDVDFDLPFDRIDFQDAPGNGDRIWIKNLVHSTGSADDFVDPINLEHHHHHH 153 T 0.22 RNase_H pdbpssm F Eukaryota T 1gxc 2 B,D,F,H B,E,H,K SYNTHETIC PHOSPHOPEPTIDE RHFDTYLIRR 10 T 5.6 DUF4650 pdbhh F T 1gy3 3 E,F E,F SUBSTRATE PEPTIDE HHASPRK 7 T 9 DUF1324 pdbhh F T 1gyb 2 E,F,G,H E,F,G,H NUCLEOPORIN DSGFSFGSK 9 T 5.6 Peptidase_S9 pdbhh F F 1h0g 2 C,D C,D Argadin XXXHX 5 T 140 RsmF_methylt_CI pdbhh F F 1h0i 2 C,D C,D ARGIFIN XXXXX 5 T 130 DUF2015 pdbhh F F 1h24 3 E E E2F1_HUMAN E2F-1, PRB-BINDING PROTEIN E2F-1, PBR3, RETINOBLASTOMA-ASSOCIATED PROTEIN 1, RBAP-1 PVKRRLDLE 9 T 2.9 Humanin pdbhh F Eukaryota T 1h26 3 E E P53_HUMAN P53 RECRUITMENT PEPTIDE 11MER STSRHKKLMFK 11 T 29 DUF420 pdbhh F Eukaryota T 1h28 3 E,F E,F RBL1_HUMAN 107 KDA RETINOBLASTOMA-ASSOCIATED PROTEIN, PRB1, P107, P107 RECRUITMENT PEPTIDE 11MER AGSAKRRLFGE 11 T 0.74 PPV_E1_N pdbhh F Eukaryota T 1h3h 2 B B LCP2_HUMAN SLP-76 APSIDRSTKPA 11 T 39 TagF_N pdbhh F Eukaryota T 1h6e 2 B P CTLA4_HUMAN CYTOTOXIC T-LYMPHOCYTE-ASSOCIATED ANTIGEN 4, CTLA-4, CD152 ANTIGEN TTGVYVKMPPT 11 T 0.2 TMEM190 unppssm F Eukaryota T 1h9l 1 A A PEPTIDE INHIBITOR XVEPI 5 T 150 LAT2 pdbhh F F 1h9o 2 B B PGFRB_HUMAN BETA-PLATELET-DERIVED GROWTH FACTOR RECEPTOR XVPML 5 T 30 DapH_N pdbhh F Eukaryota F 1ha8 1 A A MER23_EUPRA ER-23 GECEQCFSDGGDCTTCFNNGTGPCANCLAGYPAGCSNSDCTAFLSQCYGGC 51 T 1.1 DUF3716 pdbhh F Eukaryota T 1haa 2 B B HIGH AFFINITY PEPTIDE WRYYESSLEPYPD 13 T 9.4 DUF1489 pdbhh F T 1haj 2 B B HIGH AFFINITY PEPTIDE WRYYESSLEPYPD 13 T 9.4 DUF1489 pdbhh F T 1hax 1 A A BCM7 VEPI 4 T 89 LAT2 pdbhh F F 1haz 1 A A BCM7 VEPX 4 T 320 Arr-ms pdbhh F F 1hbt 3 C I P596 Inhibitor peptide XPXGGGGDYEPIPEEAXX 18 T 0.05 Hirudin pdbhh F T 1hc9 3 C,D C,D HIGH AFFINITY PEPTIDE WRYYESSLLPYPD 13 T 4 Cys_rich_VLP pdbhh F T 1hcs 1 A A ACETYL-PYEEIE-OH XXEEIE 6 T 88 TMEM171 pdbhh F F 1hct 1 A A ACETYL-PYEEIE-OH XXEEIE 6 T 88 TMEM171 pdbhh F F 1hcw 1 A _ BBA1 XYTVPSXTFSRSDELAKLLRLHAGX 25 T 11 DUF3196 pdbhh F T 1hd9 1 A A BOWMAN-BIRK INHIBITOR DERIVED PEPTIDE XCTASIPPQCY 11 T 0.02 Bowman-Birk_leg pdb F T 1hef 2 B I SKF 108738 PEPTIDE INHIBITOR AAXVX 5 T 2000 HHH pdbhh F F 1hes 2 B P LYAM3_HUMAN P-SELECTIN PEPTIDE SHLGTYGVFTNAAFDPSP 18 T 1 YoaP pdbhh F Eukaryota T 1hff 1 A A VMI2_HHV8P VMIP-II, VMIP-1B LGASWHRPDK 10 T 5 Apc15p pdbhh T Viruses T 1hgv 1 A A CAPSD_BPH75 PH75 BACTERIOPHAGE MAJOR COAT PROTEIN MDFNPSEVASQVTNYIQAIAAAGVGVLALAIGLSAAWKYAKRFLKG 46 T 0.00018 Phage_Coat_B pdb T Viruses T 1hgz 1 A A CAPSD_BPH75 PH75 BACTERIOPHAGE MAJOR COAT PROTEIN MDFNPSEVASQVTNYIQAIAAAGVGVLALAIGLSAAWKYAKRFLKG 46 T 0.00018 Phage_Coat_B pdb T Viruses T 1hh0 1 A A CAPSD_BPH75 PH75 BACTERIOPHAGE MAJOR COAT PROTEIN MDFNPSEVASQVTNYIQAIAAAGVGVLALAIGLSAAWKYAKRFLKG 46 T 0.00018 Phage_Coat_B pdb T Viruses T 1hh3 1 A,B,C,D A,B,C,D M86-1410 XXNXXXX 7 T 95 P53_C pdbhh F F 1hh6 3 C C PEP-4 DATPEDLGARL 11 T 4.8 CdiI pdbhh F T 1hh9 3 C C PEP-2 DATPEDLNAKLX 12 T 6.6 DUF6489 pdbhh F T 1hha 1 A,B,C,D A,B,C,D M86-1410 XXNXXXX 7 T 95 P53_C pdbhh F F 1hhc 1 A,B,C,D A,B,C,D M86-1410 XXNXXXX 7 T 95 P53_C pdbhh F F 1hhf 1 A,B,C,D A,B,C,D M86-1410 XXNXXXX 7 T 95 P53_C pdbhh F F 1hhj 3 C,F C,F POL_HV1MA HIV-1 REVERSE TRANSCRIPTASE (RESIDUES 309-317) ILKEPVHGV 9 T 0.56 DUF2115 pdbhh T Viruses T 1hhn 1 A A CALR_RAT CALRETICULIN SKKIKDPDAAKPEDWDERAKIDDPTDSKPEDWDKPEHIPDPDAKKPEDWDEEMDGEWEPPVIQNPEYKGEWKPRQIDNPDYKGTWIHPEIDNPEYSPDANI 101 T 2.1E-21 Calreticulin unp F Eukaryota T 1hhu 1 A,B,C,D A,B,C,D BALHIMYCIN XXNXXXX 7 T 95 P53_C pdbhh F F 1hhy 1 A,B A,B DEGLUCOBALHIMYCIN XXNXXXX 7 T 95 P53_C pdbhh F F 1hhz 1 A,B,C A,B,C DEGLUCOBALHIMYCIN XXNXXXX 7 T 95 P53_C pdbhh F F 1hhz 2 D,E,F D,E,F CELL WALL PEPTIDE XXKXX 5 T 230 OAM_dimer pdbhh F F 1hi6 3 C C PEPTIDE 5 DATPEWLGARLX 12 T 4.1 Birna_VP3 pdbhh F T 1hin 3 C P INFLUENZA HEMAGGLUTININ HA1 (STRAIN X47) (RESIDUES 100-107) YDVPDYAS 8 T 7.1 DUF4535 pdbhh F T 1hja 1 A A CTRA_BOVIN ALPHA-CHYMOTRYPSIN CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 1hjk 1 A,B A,B PPB_ECOLI ALKALINE PHOSPHATASE TPEMPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQDATPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQDQAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 449 T 1.8E-10 Alk_phosphatase pdbpssm F Bacteria T 1hl3 2 B B PRO-ILE-ASP-LEU-SER-LYS-LYS PEPTIDE PIDLSKK 7 T 2.3 NRIP1_repr_2 pdbhh F T 1hne 2 B I METHOXYSUCCINYL-ALA-ALA-PRO-ALA CHLOROMETHYL KETONE INHIBITOR XAAPXX 6 T 950 A_amylase_inhib pdbhh F F 1hoy 2 B B MIMOTOPE OF THE NICOTINIC ACETYLCHOLINE RECEPTOR HRYYESSLEPWYPD 14 T 4.1 NDUF_C2 pdbhh F T 1hpg 2 B B BOC-ALA-ALA-PRO-GLU PEPTIDE XAAPE 5 T 670 Stanniocalcin pdbhh F F 1hqa 1 A,B A,B PPB_ECOLI ALKALINE PHOSPHATASE TPEMPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQDATPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEQTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 449 T 5.4E-10 Alk_phosphatase pdbpercent F Bacteria T 1hqj 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L SIN-ASP-GLU-LEU-GLU-ALA-ARG-ILE-ARG-GLU-LEU-GLU-ALA-ARG-ILE-LYS-NH2 DELERRIRELEARIK 15 T 0.024 DUF1192 pdbhh F T 1hqq 2 B,D,F,H E,F,G,H MINI-PROTEIN 2 RCCHPQCGAVEECR 14 T 0.74 Enterotoxin_ST pdbhh F T 1hqw 2 B B YPY YPY 3 T 42 LINES_C pdbhh F F 1hr1 1 A A CTHL4_BOVIN INDOLICIDIN ILAWKWAWWAWRRX 14 T 0.21 SAG unp F Eukaryota T 1hr3 1 A,B,C A,B,C HEMERYTHRIN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 118 F F F 1hr8 3 I,J,K,L O,P,Q,R COX4_YEAST COX4 LSLRQSIRFFKPATRTLCSSRYLL 24 T 9.1 OTCace_N pdbhh F Eukaryota T 1hr9 3 I,J,K,L O,P,Q,R MDHM_YEAST MDH1 LSRVAKRA 8 T 20 Peptidase_S58 pdbhh F Eukaryota T 1hsa 3 C,F C,F MODEL PEPTIDE SEQUENCE - ARAAAAAAA ARAAAAAAA 9 T 270 DUF6324 pdbhh F F 1hsb 3 C C BOUND PEPTIDE FRAGMENT AVA 3 T 860 zf-CCHC pdbhh F F 1hte 2 C C HIV-1 PROTEASE LQES 4 T 210 DUF1118 pdbhh F F 1htx 1 A A IAAI_AMAHP ALPHA-AMYLASE INHIBITOR AAI CIPKWNRCGPKMDGVPCCEPYTCTSDYYGNCS 32 T 0.022 Toxin_12 pdbpssm F Eukaryota T 1hu5 1 A A OVISPIRIN-1 KNLRRIIRKIIHIIKKYG 18 T 1.1 Lambda_CIII pdbhh F T 1hu6 1 A A G10 NOVISPIRIN KNLRRIIRKGIHIIKKYG 18 T 1.6 YabA pdbhh F T 1hu7 1 A A T7 NOVISPIRIN KNLRRITRKIIHIIKKYG 18 T 2.3 Lambda_CIII pdbhh F T 1hvz 1 A A RTD-1 GFCRCLCRRGVCRCICTR 18 T 0.63 DUF5354 pdbhh F T 1hxl 2 C,D C,D MINI-PROTEIN 2 RCCHPQCGMAEECR 14 T 0.56 Cys_rich_CWC pdbhh F T 1hxz 2 C,D C,D MINI-PROTEIN 2 RCCHPQCGMVEECR 14 T 0.64 Cys_rich_CWC pdbhh F T 1hy2 2 E,F,G,H E,F,G,H MINI-PROTEIN 1 CCHPQCGAAYSC 12 T 0.074 Enterotoxin_ST pdbhh F T 1i1f 3 C,F C,F I1F FLKEPVHGV 9 T 6.9 DUF2115 pdbhh F T 1i1y 3 C,F C,F I1Y YLKEPVHGV 9 T 8.3 DUF2115 pdbhh F T 1i2v 1 A A DEFN_HELVI DEFENSIN HELIOMICIN DKLIGSCVWGAVNYTSDCNGECLLRGYKGGHCGSFANVNCWCET 44 T 0.00019 Toxin_3 unppssm F Eukaryota T 1i31 2 B P A8IP97_RAT EGFR FYRALM 6 T 0.2 GcnA_N pdbhh F Eukaryota T 1i3w 2 E,F,G,H E,F,G,H DACTINOMYCIN TXPXXXTXPXX 11 T 61 DUF5572 pdbhh F F 1i3z 2 B B SLAF1_HUMAN SIGNALING LYMPHOCYTIC ACTIVATION MOLECULE VEKKSLTIXAQVQK 14 T 0.1 MFS_1 unppssm F Eukaryota T 1i5h 2 B B SCNNB_RAT RENAL BP2 PEPTIDE GSTLPIPGTPPPNYDSL 17 T 0.14 Myc_target_1 pdbhh F Eukaryota T 1i6y 1 A A ION-SELECTIVE LIGAND A1 XCRVVRGDYLDCX 13 T 0.96 YedD pdbhh F T 1i73 2 B B THREE RESIDUE PEPTIDE INHIBITOR PLX 3 T 23 Thyroglob_assoc pdbhh F F 1i7a 2 E E PHE-ALA-PHE FAF 3 T 180 CathepsinC_exc pdbhh F F 1i7r 3 C,F C,F 9 RESIDUE PEPTIDE FAPGFFPYL 9 T 2.7 LINES_C pdbhh F T 1i7t 3 C,F C,F 9 RESIDUE PEPTIDE ALWGVFPVL 9 T 0.95 PK_C pdbhh F T 1i7u 3 C,F C,F 9 RESIDUE PEPTIDE ALWGFVPVL 9 T 0.23 Tom7 pdbhh F T 1i8e 1 A A ION-SELECTIVE LIGAND A22 XCYCSLRGDCYCX 13 T 3.3 CRAM_rpt pdbhh F T 1i8g 1 A A MPIP3_XENLA M-PHASE INDUCER PHOSPHATASE 3 EQPLTPVTDL 10 T 12 DUF4636 pdbhh F Eukaryota T 1i8h 1 A A TAU_HUMAN PHF-TAU KVSVVRTPPKSPS 13 T 13 Disulph_isomer pdbhh F Eukaryota T 1i8i 3 C C EPIDERMAL GROWTH FACTOR RECEPTOR, EGFRVIII PEPTIDE ANTIGEN EEKKGNYVVTDH 12 T 1.1 MFA1_2 pdbhh F T 1i8k 3 C C EPIDERMAL GROWTH FACTOR RECEPTOR, EGFRVIII PEPTIDE ANTIGEN EEKKGNYVVTDH 12 T 1.1 MFA1_2 pdbhh F T 1i8n 1 A,B,C A,B,C LAPP_HAEOF ANTI-PLATELET PROTEIN QDEDAGGAGDETSEGEDTTGSDETPSTGGGGDGGNEETITAGNEDCWSKRPGWKLPDNLLTKTEFTSVDECRKMCEESAVEPSCYILQINTETNECYRNNEGDVTWSSLQYDQPNVVQWHLHACSK 126 T 0.0023 PAN_1 pdbpssm F Eukaryota T 1i93 1 A A ION-SELECTIVE LIGAND D16 XCHWLRGDMRRCX 13 T 3.5 DNA_photolyase pdbhh F T 1i98 1 A A ION-SELECTIVE LIGAND D18 XCRWLRGDWRQCX 13 T 1.8 PTN_MK_C pdbhh F T 1i9f 2 B B RSG-1.2 PEPTIDE RRGSRPSGAERRRRRAAAA 19 T 7.6 BRD4_CDT pdbhh F T 1iau 2 B B ACE-ILE-GLU-PRO-ASP-CHO XIEPX 5 T 430 DUF4035 pdbhh F F 1ibc 3 C C PEPTIDE ACE-TRP-GLU-HIS-ASA XWEHX 5 T 74 NPCC pdbhh F F 1ic9 1 A A TH10AOX SKYEYTIXSYTFRGPGCPTLKPXITVRCE 29 T 1.1 DUF4360 pdbhh F T 1ice 2 B T TETRAPEPTIDE ALDEHYDE XYVAX 5 T 220 MRJP pdbhh F F 1icl 1 A A TH1OX SKYEYTVXSYTFRGPGCPTVKPXISLRCE 29 T 2.2 DUF4360 pdbhh F T 1ico 1 A A TH10BOX SKYEYTIXSYTFRGPGCPTVKPXVTIRCE 29 T 1.4 DUF4360 pdbhh F T 1id6 1 A A SYR6 SVQARWEAAFDLDLY 15 T 2.5 DUF3841 pdbhh F T 1id7 1 A A SYR6 SVQARWEAAFDLDLY 15 T 2.5 DUF3841 pdbhh F T 1ieo 1 A A CT1B_CONMR PROTEIN MRIB-NH2 VGVCCGYKLCHPCX 14 T 0.47 Oxidored-like unphh F Eukaryota T 1ifh 3 C P INFLUENZA HEMAGGLUTININ HA1 (STRAIN X47) (RESIDUES 101-107) XDVPDYAS 8 T 10 AbiTii pdbhh F T 1igw 1 A,B,C,D A,B,C,D ACEA_ECOLI ISOCITRASE, ISOCITRATASE, ICL MKTRTQQIEELQKEWTQPRWEGITRPYSAEDVVKLRGSVNPECTLAQLGAAKMWRLLHGESKKGYINSLGALTGGQALQQAKAGIEAVYLSGWQVAADANLAASMYPDQSLYPANSVPAVVERINNTFRRADQIQWSAGIEPGDPRYVDYFLPIVADAEAGFGGVLNAFELMKAMIEAGAAAVHFEDQLASVKKCGHMGGKVLVPTQEAIQKLVAARLCADVTGVPTLLVARTDADAADLITSDCDPYDSEFITGERTSEGFFRTHAGIEQAISRGLAYAPYADLVWCETSTPDLELARRFAQAIHAKYPGKLLAYNCSPSFNWQKNLDDKTIASFQQQLSDMGYKFQFITLAGIHSMWFNMFDLANAYAQGEGMKHYVEKVQQPEFAAAKDGYTFVSHQQEVGTGYFDKVTTIIQGGTSSVTALTGSTEESQF 434 T 2.3E-49 ICL pdb F Bacteria T 1ih9 1 A A ZERVAMICIN IIB XWIQXITXLXPQXPXPX 17 T 25 bpX0 pdbhh F T 1ihj 2 C,D C,D NORPA GKTEFCA 7 T 0.054 cobW pdbhh F T 1iid 2 B O Octapeptide GLYASKLA GLYASKLA 8 T 8 RLAN pdbhh F T 1iij 1 A A ERBB2_RAT ERBB-2 RECEPTOR PROTEIN-TYROSINE KINASE EQRASPVTFIIATVVGVLLFLILVVVVGILIKRRR 35 T 0.0014 Mucin15 pdbhh F Eukaryota T 1ikf 3 C C CYCLOSPORINE, CICLOSPORIN, CICLOSPORINE XXXXXXXXVXA 11 T 9.7 DUF6090 pdbhh F F 1ilp 2 C C CXCR1_HUMAN CXCR-1,CDW128A,HIGH AFFINITY INTERLEUKIN-8 RECEPTOR A,IL-8R A,IL-8 RECEPTOR TYPE 1 XMWDFDDXMPPADEDYSPX 19 T 0.01 FA_desaturase unppercent F Eukaryota T 1ilq 2 C C CXCR1_HUMAN IL8-RA XMWDFDDXMPPADEDYSPX 19 T 0.01 FA_desaturase unppercent F Eukaryota T 1ilx 1 A,J A,J PHOTOSYSTEM II: SUBUNIT PSBA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 169 F F F 1ilx 2 B,K B,K PHOTOSYSTEM II: SUBUNIT PSBD XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 174 F F F 1ilx 3 C,L C,L PHOTOSYSTEM II: SUBUNIT PSBC XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 156 F F F 1ilx 4 D,M D,M PHOTOSYSTEM II: SUBUNIT PSBB XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 155 F F F 1ilx 5 E,N E,N PHOTOSYSTEM II: SUBUNIT PSBE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 40 F F F 1ilx 6 F,O F,O PHOTOSYSTEM II: SUBUNIT PSBF XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 30 F F F 1ilx 7 G,P G,P PHOTOSYSTEM II: SUBUNIT UNKNOWN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 312 F F F 1ilx 8 H,Q H,Q PHOTOSYSTEM II: SUBUNIT PSBO XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 115 F F F 1ilx 9 I,R I,R PHOTOSYSTEM II: SUBUNIT PSBV XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 87 F F F 1im1 1 A _ CA1_CONIM ALPHA-CONOTOXIN IM1 GCCSDPRCAWRC 12 T 0.0098 Toxin_8 unphh F Eukaryota T 1im9 3 C,G C,G HLA-Cw4-specific peptide QYDDAVYKL 9 T 22 Cas_Cas02710 pdbhh F T 1imi 1 A A CA1_CONIM PROTEIN (ALPHA-CONOTOXIN IMI) GCCSDPRCAWRCX 13 T 0.0098 Toxin_8 unphh F Eukaryota T 1imw 1 A A IGFBP-1 antagonist CRAGPLQWLCEKYFGX 16 T 2.3 DUF6497 pdbhh F T 1in2 1 A A IGFBP-1 antagonist XRAGPLQWLAEKYQGX 16 T 9.2 Pico_P2B pdbhh F T 1in3 1 A A IGFBP-1 antagonist XRPLQWLAEKYFQX 14 T 2.7 DUF5053 pdbhh F T 1io6 2 B B A LIGAND PEPTIDE RHYRPLPPLP 10 T 1.6 DUF5323 pdbhh F F 1iq5 2 B B KKCC_CAEEL CA2+/CALMODULIN DEPENDENT KINASE KINASE VRVIPRLDTLILVKAMGHRKRFGNPFR 27 T 5.7 HCNGP pdbhh F Eukaryota T 1ir3 2 B B PEPTIDE SUBSTRATE KKKLPATGDYMNMSPVGD 18 T 0.064 Gram_pos_anchor pdb F T 1irs 2 B B IL4RA_HUMAN IL-4 RECEPTOR PHOSPHOPEPTIDE LVIAGNPAXRS 11 T 0.91 DUF1890 pdbhh F Eukaryota T 1is0 2 C,D C,D AY0 GLU GLU ILE peptide XEEI 4 T 340 B5 pdbhh F F 1isq 2 B B RFCL_PYRFU replication factor C large subunit XKQATLFDFLKK 12 T 0.018 Peptidase_C37 unppercent F Archaea T 1itt 1 A A COLLAGEN TRIPLE HELIX GPPGPPG 7 T 6.8 Milton pdbhh F F 1itt 2 B B COLLAGEN TRIPLE HELIX PGPPGPP 7 T 0.67 EKLF_TAD1 pdbhh F F 1itt 3 C C COLLAGEN TRIPLE HELIX PPGPPGP 7 T 0.48 DUF374 pdbhh F F 1ivi 1 A,B,C,D,E A,D,B,E,C dihydrolipoamide dehydrogenase XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 478 F F F 1iw4 1 A A ITRP_HALRO trypsin inhibitor AHMDCTEFNPLCRCNKMLGDLICAVIGDAKEEHRNMCALCCEHPGGFEYSNGPCE 55 T 5.1 DUF5913 pdbhh F Eukaryota T 1iwk 1 A A CPXA_PSEPU CYTOCHROME P450-CAM MTTETIQSNANLAPLPPHVPEHLVFDFDMYNPSNLSAGVQEAWAVLQESNVPDLVWTRCNGGHWIATRGQLIREAYEDYRHFSSECPFIPREAGEAYDFIPTSMDPPEQRQFKALANQVVGMPVVDKLENRIQELACSLIESLRPQGQCNFTEDYAEPFPIRIFMLLAGLPEEDIPHLKYLTDQMTRPDGSMTFAEAKEALYDYLIPIIEQRRQKPGTDAISIVANGQVNGRPITSDEAKRMCGLLLVGGLDTVVNFLSFSMEFLAKSPEHRQELIERPERIPAACEELLRRFSLVADGRILTSDYEFHGVQLKKGDQILLPQMLSGLDERENACPMHVDFSRQKVSHTTFGHGSHLCLGQHLARREIIVTLKEWLTRIPDFSIAPGAQIQHKSGIVSGVQALPLVWDPATTKAV 415 T 1.6E-05 p450 unppercent F Bacteria T 1iyc 1 A A SCAB_ORYRH scarabaecin ELPKLPDDKVLIRSRSNCPKGKVWNGFDCKSPFAFS 36 T 0.083 DUF5615 unp F Eukaryota T 1ize 2 B B Pepstatin XVVXAX 6 T 1700 FAM60A pdbhh F F 1izl 7 G,U G,R Photosystem II: Subunit PsbG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 220 F F F 1izl 8 H,V H,S Photosystem II: Subunit PsbH XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 33 F F F 1izl 9 I,W I,T Photosystem II: Subunit PsbI XXXXXXXXXXXXXXXXXXXXXXXXXX 26 F F F 1izl 11 K,Y O,Y Photosystem II: Subunit PsbO XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 205 F F F 1izl 12 L,Z U,Z Photosystem II: Subunit PsbU XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 97 F F F 1izl 14 BA,N 1,X Photosystem II: Subunit PsbX XXXXXXXXXXXXXXXXXXXXXXXXX 25 F F F 1j19 2 B B ICAM2_MOUSE ICAM-2 CYTOPLASMIC PEPTIDE, ICAM-2 CYTOPLASMIC TAIL RRRTGTYGVLAAWRRL 16 T 0.28 DUF4231 unppercent F Eukaryota T 1j4l 2 B P RAD9_YEAST DNA REPAIR PROTEIN RAD9 EVELTQELP 9 T 8.5 SidE_DUB pdbhh F Eukaryota T 1j4m 1 A A MBH12 RGKWTYNGITYEGR 14 T 3.8 DUF4923 pdbhh F T 1j4p 2 B B RAD9_YEAST DNA REPAIR PROTEIN RAD9 KKMTFQTPTDPLE 13 T 11 YugN pdbhh F Eukaryota T 1j4q 2 B B RAD9_YEAST DNA REPAIR PROTEIN RAD9 SLEVTEADATFVQ 13 T 41 Myticin-prepro pdbhh F Eukaryota T 1j4x 2 B D DDE(AHP)(TPO)G(PTR)VATR DDEXTGXVATR 11 T 2.8 BioW pdbhh F T 1j51 1 A,B,C,D A,B,C,D CPXA_PSEPU CAMPHOR 5-MONOOXYGENASE TTETIQSNANLAPLPPHVPEHLVFDFDMYNPSNLSAGVQEAWAVLQESNVPDLVWTRCNGGHWIATRGQLIREAYEDYRHFSSECPWIPREAGEAFDFIPTSMDPPEQRQFRALANQVVGMPVVDKLENRIQELACSLIESLRPQGQCNFTEDYAEPFPIRIFMLLAGLPEEDIPHLKYLTDQMTRPDGSMTFAEAKEALYDYLIPIIEQRRQKPGTDAISIVANGQVNGRPITSDEAKRMCGLLLLGGLDTVVNFLSFSMEFLAKSPEHRQELIERPERIPAACEELLRRFSLVADGRILTSDYEFHGVQLKKGDQILLPQMLSGLDERENAAPMHVDFSRQKVSHTTFGHGSHLCLGQHLARREIIVTLKEWLTRIPDFSIAPGAQIQHKSGIVSGVQALPLVWDPATTKAV 414 T 1.6E-05 p450 unppercent F Bacteria T 1j5b 1 A A ANPA_PSEAM Antifreeze protein type 1 analogue DVASDAKAAAELVAANAKAAAELVAANAKAAAEAVARX 38 T 9.3 DUF3157 unppssm F Eukaryota T 1j5l 1 A A MT1_HOMAM CUMT-1 PCEKCTSGCKCPSKDECAKTCSKPCSCCPT 30 T 1.5 Metallothio_5 pdbhh F Eukaryota T 1j5m 1 A A MT1_HOMAM CUMT-1 PGPCCKDKCECAEGGCKTGCKCTSCRCA 28 T 0.53 Metallothio_5 pdbhh F Eukaryota T 1j71 2 B B Tetrapeptide Thr-Ile-Thr-Ser TITS 4 T 330 DUF1308 pdbhh F F 1j9n 3 C C peptide ACE-LYS-TRP-LYS-HSE-ALA XKWKXA 6 T 46 Surface_antigen pdbhh F F 1jac 2 B,D,F,H B,D,F,H LECB1_ARTIN JACKFRUIT AGGLUTININ NEQSGKSQTVIVGSWGAKVS 20 T 2.9 DUF3842 pdbhh F Eukaryota T 1jan 2 B I PRO-LEU-GLY-HYDROXYLAMINE INHIBITOR PLGX 4 T 170 Prefoldin_3 pdbhh F F 1jap 2 B I PRO-LEU-GLY-HYDROXYLAMINE PLGX 4 T 170 Prefoldin_3 pdbhh F F 1jbd 2 B B MIMOTOPE OF THE NICOTINIC ACETYLCHOLINE RECEPTOR HRYYESSLEPWYPD 14 T 4.1 NDUF_C2 pdbhh F T 1jbf 1 A A IGE06 XNLPRCTEGPWGWVCM 16 T 2.2 Mss4 pdbhh F T 1jbl 1 A A SFTI1_HELAN SFTI-1 GRCTKSIPPICFPD 14 T 0.0029 Bowman-Birk_leg pdb F Eukaryota T 1jbn 1 A A SFTI1_HELAN SFTI-1 GRCTKSIPPICFPD 14 T 0.0029 Bowman-Birk_leg pdb F Eukaryota T 1jbr 4 D,E A,B RNMG_ASPRE RIBONUCLEASE MITOGILLIN ATWTCINQQLNPKTNKWEDKRLLYSQAKAESNSHHAPLSDGKTGSSYPHWFTNGYDGNGKLIKGRTPIKFGKADCDRPPKHSQNGMGKDDHYLLEFPTFPDGHDYKFDSKKPKEDPGPARVIYTYPNKVFCGIVAHQRGNQGDLRLCSH 149 T 46 Cuticle_2 pdbhh F Eukaryota T 1jbs 2 C,D A,B RNMG_ASPRE RIBONUCLEASE MITOGILLIN ATWTCINQQLNPKTNKWEDKRLLYSQAKAESNSHHAPLSDGKTGSSYPHWFTNGYDGNGKLIKGRTPIKFGKADCDRPPKHSQNGMGKDDHYLLEFPTFPDGHDYKFDSKKPKEDPGPARVIYTYPNKVFCGIVAHQRGNQGDLRLCSH 149 T 46 Cuticle_2 pdbhh F Eukaryota T 1jbt 2 C,D A,B RNMG_ASPRE RIBONUCLEASE MITOGILLIN ATWTCINQQLNPKTNKWEDKRLLYSQAKAESNSHHAPLSDGKTGSSYPHWFTNGYDGNGKLIKGRTPIKFGKADCDRPPKHSQNGMGKDDHYLLEFPTFPDGHDYKFDSKKPKEDPGPARVIYTYPNKVFCGIVAHQRGNQGDLRLCSH 149 T 46 Cuticle_2 pdbhh F Eukaryota T 1jbu 3 C X Peptide exosite inhibitor A-183 EEWEVLCWTWETCER 15 T 0.65 Rad10 pdbhh F T 1jcr 3 C C SYNTHETIC TETRAPEPTIDE CVFM CVFM 4 T 53 DUF5841 pdbhh F F 1jcs 3 C C SYNTHETIC HEXAPEPTIDE TKCVFM TKCVFM 6 T 2.4 Plk4_PB2 pdbhh F T 1jd2 15 CA,DA 8,9 TMC-95A inhibitor XYNXX 5 T 48 Whirly pdbhh F F 1jd5 2 B B GRIM_DROME cell death protein GRIM AIAYFIPDQA 10 T 0.61 DUF5521 unppercent F Eukaryota T 1jd6 2 B B HID_DROME head involution defective protein AVPFYLPEGG 10 T 0.62 DUF4367 pdbhh F Eukaryota T 1jdk 1 A A ACETYL GROUP XIWGESGKLIXTTA 14 T 0.038 GP41 pdbhh F T 1je9 1 A A 3S1C_NAJKA SHORT NEUROTOXIN 1 - MONOCLED COBRA LECHNQQSSQAPTTKTCSGETNCYKKWWSDHRGTIIERGCGCPKVKPGVNLNCCRTDRCNN 61 T 0.038 Toxin_TOLIP pdb F Eukaryota T 1jeg 2 B B PTN22_MOUSE HEMATOPOIETIC CELL PROTEIN-TYROSINE PHOSPHATASE 70Z-PEP SRRTDDEIPPPLPERTPESFIVVEE 25 T 6 DUF6436 pdbhh F Eukaryota T 1jet 2 B B PEPTIDE LYS ALA LYS KAK 3 T 500 VGCC_beta4Aa_N pdbhh F F 1jeu 2 B B PEPTIDE LYS GLU LYS KEK 3 T 660 CCT pdbhh F F 1jev 2 B B PEPTIDE LYS TRP LYS KWK 3 T 25 CHDNT pdbhh F F 1jg3 2 C,D C,D VYP(L-iso-ASP)HA VYPXHA 6 T 3.1 DUF4140 pdbhh F T 1jgd 3 C C peptide s10R RRLLRGHNQY 10 T 11 DUF2570 pdbhh F T 1jge 3 C C peptide m9 GRFAAAIAK 9 T 6.5 Ribosomal_L13 pdbhh F T 1jjg 1 A A Q9Q8E9_MYXVL M156R MTVIKPSSRPRPRKNKNIKVNTYRTSAMDLSPGSVHEGIVYFKDGIFKVRLLGYEGHECILLDYLNYRQDTLDRLKERLVGRVIKTRVVRADGLYVDLRRFF 102 T 1.6 RNase_II_C_S1 pdbhh T Viruses T 1jky 2 B B MP2K2_HUMAN MAPKK2, MEK2 MLARRKPVLPALTINP 16 T 0.4 DHHA2 pdbhh F Eukaryota T 1jlp 1 A A CM3F_CONPU PSI-CONOTOXIN PIIIF GPPCCLYGSCRPFPGCYNALCCRKX 25 T 0.11 Toxin_7 unphh F Eukaryota T 1jlz 1 A A KA131_TITOB Tityustoxin alpha-KTx ACGSCRKKCKGSGKCINGRCKCY 23 T 0.0072 Toxin_2 pdb F Eukaryota T 1jmt 2 B B U2AF2_HUMAN U2 SNRNP AUXILIARY FACTOR LARGE SUBUNIT KKKVRKYWDVPPPGFEHITPMQYKAMQA 28 T 0.0013 Transformer unp F Eukaryota T 1jn5 3 C C FG-repeat GQSPGFGQGGSV 12 T 1.4 PGF-CTERM pdbhh F T 1jn7 1 A A USH_DROME U-shaped TRANSCRIPTIONAL COFACTOR GSAAEVMKKYCSTCDISFNYVKTYLAHKQFYHKNKP 36 T 0.0003 zf-met pdb F Eukaryota T 1jno 1 A,B A,B VALYL GRAMICIDIN XGAXAXVXWXWXWXWX 16 T 4.6 MAP17 pdbhh F F 1jnv 1 A,B,C A,B,C ATP SYNTHASE ALPHA CHAIN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 492 F F F 1jnv 2 D,E,F D,E,F ATP SYNTHASE BETA CHAIN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 467 F F F 1jnv 3 G Y ATP SYNTHASE EPSILON CHAIN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 135 F F F 1jnv 4 H Z ATP SYNTHASE GAMMA CHAIN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 287 F F F 1jo3 1 A,B A,B VAL-GRAMICIDIN B XGAXAXVXWXFXWXWX 16 T 0.53 MAP17 pdbhh F T 1jo4 1 A,B A,B GRAMICIDIN D XGAXAXVXWXYXWXWX 16 T 3.1 MAP17 pdbhh F T 1joh 1 A,B A,B ANTIAMOEBIN I XFXXXXGLXXPQXPXPX 17 T 0.21 Pep_deformylase pdbhh F T 1joj 2 E,F,G,H P,Q,R,S HEXAPEPTIDE XMYWYPYX 8 T 6.4 Rax2 pdbhh F F 1jot 2 B B LECB2_MACPO AGGLUTININ GRNGKSQSIIVGPWGDRVTN 20 T 0.9 DUF3842 pdbhh F Eukaryota T 1jp5 2 C,D C,D epitope peptide corresponding to N-terminus of HIV-1 protease PQITLWQRR 9 T 0.5 Tfb2_C pdbhh F T 1jpf 3 C C LCMV peptidic epitope gp276 SGVENPGGYCL 11 T 9.3E-05 Arena_glycoprot pdbhh F T 1jpg 3 C C LCMV peptidic epitope np396 FQPQNGQFI 9 T 1.1 Arena_ncap_C pdbhh F T 1jpl 2 B,D,F,H E,F,G,H MPRI_HUMAN Cation-Independent Mannose 6-phosphate receptor FHDDSDEDLLHI 12 T 8 NTF-like pdbhh F Eukaryota T 1jq8 2 C P Peptide inhibitor LAIYS 5 T 86 KH_7 pdbhh F F 1jq9 2 C P Peptide inhibitor FLSYK 5 T 67 Cyanate_lyase pdbhh F F 1jrs 2 B B Leupeptin XLLX 4 T 590 DNTTIP1_dimer pdbhh F F 1jrt 2 B B LEUPEPTIN XLLX 4 T 590 DNTTIP1_dimer pdbhh F F 1jsp 1 A A P53_HUMAN tumor protein p53 SHLKSKKGQSTSRHKXLMFK 20 T 0.081 Zn_Tnp_IS1 pdbpssm F Eukaryota T 1ju5 2 B B CRK_MOUSE PROTO-ONCOGENE C-CRK, ADAPTER MOLECULE CRK, P38 EPGPXAQPSVNTK 13 T 1.6 Shugoshin_C pdbhh F Eukaryota T 1jui 2 E,F,G,H P,Q,R,S 10-mer Peptide MYWYPYASGS 10 T 3.9 DUF3263 pdbhh F T 1jvq 2 C C P14-P8 reactive loop peptide XSEAAAST 8 T 290 DUF5424 pdbhh F F 1jvq 3 D D exogenous Cholecystokinin tetrapeptide WMDFX 5 T 25 DUF1725 pdbhh F F 1jw6 2 B B PROTEIN (6-MER) MYWYPY 6 T 4.2 Glyco_hydro_70 pdbhh F F 1jwg 2 B,D C,D MPRI_HUMAN M6PR SFHDDSDEDLLHI 13 T 10 NTF-like pdbhh F Eukaryota T 1jxq 2 E,F E,F benzoxycarbonyl-Val-Ala-Asp-fluoromethyl ketone Inhibitor XEVDX 5 T 640 DUF5952 pdbhh F F 1jy4 1 A,B A,B B4DIMER RGECKFTVXGRTALNTXAVQKWHFVLXGYKCEILA 35 T 2.6 NAAA-beta pdbhh F T 1jy6 1 A,B A,B B4DIMER RGECKFTVXGRTALNTXAVQKWHFVLXGYKCEILA 35 T 2.6 NAAA-beta pdbhh F T 1jy9 1 A A DP-TT2 TTTTRYVEVXGKKILQTTTT 20 T 16 DUF6450 pdbhh F T 1jyc 2 E,F,G,H P,Q,R,S 15-mer peptide RVWYPYGSYLTASGS 15 T 2.2 DUF6375 pdbhh F T 1jyi 2 E,F,G,H P,Q,R,S 12-mer peptide DVFYPYPYASGS 12 T 1.8 XRN_M pdbhh F T 1jyq 2 C,D L,H mAZ-pY-(alpha Me)pY-N-NH2 peptide inhibitor XXXN 4 T 110 TFA2_Winged_2 pdbhh F F 1jyr 2 B L peptide: PSpYVNVQN APSXVNVQN 9 T 0.8 SH3-WW_linker pdbhh F T 1jzp 1 A A CAC1S_RABIT Skeletal Dihydropydrine Receptor TSAQKAKAEERKRRKMSRGLX 21 T 0.66 DUF1682 pdbhh F Eukaryota T 1k2d 3 C P MBP_HUMAN MBP PEPTIDE HSRGGASQYRPSQRHGTGSGSGS 23 T 2.2 Selenoprotein_S pdbhh F Eukaryota T 1k2n 2 B P RAD9_YEAST DNA repair protein Rad9 EVELTQELP 9 T 8.5 SidE_DUB pdbhh F Eukaryota T 1k3a 2 B B IRS1_HUMAN IRS-1 KKKSPGEYVNIEFG 14 T 0.41 DUF4834 pdbhh F Eukaryota T 1k3n 2 B B RAD9_YEAST DNA repair protein Rad9 KKMTFQTPTDPLE 13 T 11 YugN pdbhh F Eukaryota T 1k3q 2 B B RAD9_YEAST DNA repair protein Rad9 SLEVTEADATFVQ 13 T 41 Myticin-prepro pdbhh F Eukaryota T 1k43 1 A A MBH12 RGKWTYNGITYEGR 14 T 3.8 DUF4923 pdbhh F T 1k5n 3 C C nonameric model peptide m9 GRFAAAIAK 9 T 6.5 Ribosomal_L13 pdbhh F T 1k83 11 K M AAMAT_AMAPH AMATOXIN XXGIGCNP 8 T 0.85 DUF3085 pdbhh F Eukaryota T 1k91 1 A A CALR_RAT CRP55; CALREGULIN; HACBP; ERP60; CALBP; CALCIUM-BINDING PROTEIN 3; CABP3 GKPEHIPDPDAKKPEDWDEEMDGEWEPPVIQNPEYKG 37 T 2.1E-21 Calreticulin unp F Eukaryota T 1k9c 1 A A CALR_RAT CRP55; CALREGULIN; HACBP; ERP60; CALBP; CALCIUM-BINDING PROTEIN 3; CABP3 SKKIKDPDAAKPEDWDERAKIDDPTDSKPEDWDKPEHIPDPDAKKPEDWDEEMDGEWEPPVIQNPEYKGEWKPR 74 T 2.1E-21 Calreticulin unp F Eukaryota T 1k9q 2 B B WBP-1 GPPPYX 6 T 30 MT-A70 pdbhh F F 1k9r 2 B B WBP-1 XPLPPY 6 T 35 SCIMP pdbhh F F 1ka6 2 B B SLAF1_HUMAN SIGNALING LYMPHOCYTIC ACTIVATION MOLECULE RKSLTIXAX 9 T 0.1 MFS_1 unppssm F Eukaryota T 1ka7 2 B B SLAF1_HUMAN SIGNALING LYMPHOCYTIC ACTIVATION MOLECULE RKSLTIYAQVQK 12 T 0.1 MFS_1 unppssm F Eukaryota T 1kap 2 B I TETRAPEPTIDE (GLY SER ASN SER) GSNS 4 T 220 Penicil_amidase pdbhh F F 1kat 2 C,D X,Y V107 GGNECDIARMWEWECFERL 19 T 6 ARF7EP_C pdbhh F T 1kc2 2 B B PQpYEEIPI peptide PQXEEIPI 8 T 5.1 Imm15 pdbhh F F 1kcn 1 A A e109 zeta peptide ALCPAVCYVGGKALCPDVCYVX 22 T 3.8 MVL pdbhh F T 1kco 1 A A e131 Zeta Peptide VQCPHFCYELDYELCPDVCYVX 22 T 1.7 Prot_inhib_II pdbhh F T 1kfp 1 A A GOME_ACAGO GOMESIN QCRRLCYKQRCVTYCRGRX 19 T 0.0046 PanZ unp F Eukaryota T 1kga 1 A,A1,A2 A,A,A 2-KETO-3-DEOXY-6-PHOSPHOGLUCONATE ALDOLASE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 219 F F F 1kh4 1 A,B A,B PPB_ECOLI ALKALINE PHOSPHATASE TPEMPVLENRAAQGNITAPGGARRLTGDQTAALRNSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQDATPAALVAHVTSRKCYGPSATSQKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREEAEARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQNHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 449 T 6.8E-10 Alk_phosphatase unppssm F Bacteria T 1kh5 1 A,B A,B PPB_ECOLI ALKALINE PHOSPHATASE TPEMPVLENRAAQGNITAPGGARRLTGDQTAALRNSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQDATPAALVAHVTSRKCYGPSATSQKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREEAEARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQNHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 449 T 6.8E-10 Alk_phosphatase unppssm F Bacteria T 1kh7 1 A,B A,B PPB_ECOLI alkaline phosphatase TPEMPVLENRAAQGNITAPGGARRLTGDQTAALRNSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQGATPAALVAHVTSRKCYGPSKTSQKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREEAEARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQNHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 449 T 6.8E-10 Alk_phosphatase unppssm F Bacteria T 1kh9 1 A,B A,B PPB_ECOLI Alkaline phosphatase TPEMPVLENRAAQGNITAPGGARRLTGDQTAALRNSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQDATPAALVAHVTSRKCYGPSATSQKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREEAEARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQNHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 449 T 6.8E-10 Alk_phosphatase unppssm F Bacteria T 1khj 1 A,B A,B PPB_ECOLI Alkaline phosphatase TPEMPVLENRAAQGNITAPGGARRLTGDQTAALRNSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQHATPAALVAHVTSRKCYGPSATSQKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREEAEARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQNHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 449 T 6.8E-10 Alk_phosphatase unppssm F Bacteria T 1khk 1 A,B A,B PPB_ECOLI Alkaline Phosphatase TPEMPVLENRAAQGNITAPGGARRLTGDQTAALRNSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQHATPAALVAHVTSRKCYGPSATSQKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREEAEARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQNHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 449 T 6.8E-10 Alk_phosphatase unppssm F Bacteria T 1khl 1 A,B A,B PPB_ECOLI Alkaline Phosphatase TPEMPVLENRAAQGNITAPGGARRLTGDQTAALRNSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQHATPAALVAHVTSRKCYGPSATSQKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREEAEARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQNHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 449 T 6.8E-10 Alk_phosphatase unppssm F Bacteria T 1khn 1 A,B A,B PPB_ECOLI Alkaline phosphatase TPEMPVLENRAAQGNITAPGGARRLTGDQTAALRNSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQHATPAALVAHVTSRKCYGPSATSQKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREEAEARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQNHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 449 T 6.8E-10 Alk_phosphatase unppssm F Bacteria T 1khp 2 B I peptidic inhibitor XLFGX 5 T 180 MRP-L51 pdbhh F F 1khq 2 B I peptidic inhibitor XLFGX 5 T 180 MRP-L51 pdbhh F F 1kj7 2 C P POL_HV1JR gag polyprotein PATIMMQRGN 10 T 0.6 HypA unp T Viruses T 1kjh 2 C P POL_HV1B1 POL POLYPROTEIN IRKILFLDGI 10 T 0.011 Spermine_synt_N pdbhh T Viruses T 1kjm 3 C P B6 Peptide AQFSASASR 9 T 9 PAGK pdbhh F F 1kjv 3 C P UBQL1_RAT peptide NPR NPRAMQALL 9 T 1.1 STI1 unp F Eukaryota T 1kkq 2 E,F,G,H E,F,G,H NCOR2_HUMAN NUCLEAR COREPRESSOR SMRT C-TERMINAL RECEPTOR INTERACTING MOTIF NMGLEAIIRKALMGKYDQW 19 T 4.2 RHH_7 pdbhh F Eukaryota T 1kl3 2 E,F,G,H E,F,G,H strep-tag II peptide NWSHPQFEK 9 T 1.3 CreD pdbhh F T 1kl5 2 E,F,G,H E,F,G,H strep-tag II NWSHPQFEK 9 T 1.3 CreD pdbhh F T 1klq 2 B B MBP1 SWYSYPPPQRAV 12 T 8.6 NADHdh_A3 pdbhh F T 1kmr 1 A A PSPB_HUMAN SP-B, PULMONARY SURFACTANT-ASSOCIATED PROTEOLIPID SPL(PHE), 18 KDA PULMONARY-SURFACTANT PROTEIN CRALIKRIQAMIPKG 15 T 0.013 SapB_1 unphh F Eukaryota T 1ko6 2 B,D B,D NUP98_HUMAN NUCLEOPORIN NUP98, 98KDA NUCLEOPORIN SKYGLQDSDEEEEEHPSKTSTKKLKTAPLPPASQTTPLQMALNGKPAPPPQVEKKGQLEHHHHH 64 T 74 SIN1 pdbhh F Eukaryota T 1kp6 1 A A KP6T_UMV6 PROTEIN (TOXIN) NNAFCAGFGLSCKWECWCTAHGTGNELRYATAAGCGDHLSKSYYDARAGHCLFSDDLRNQFYSHCSSLNNNMSCRSLSK 79 T 0.3 YobH unp T Viruses T 1kpr 3 E,F P,Q Peptide VMAPRTVLL VMAPRTVLL 9 T 0.0013 UL40 pdbhh F T 1kqe 1 A,C A,D MINI-GRAMICIDIN A XXVXWXWXWXWX 12 T 4.3 DUF2826 pdbhh F F 1kqe 2 B,D B,E MINI-GRAMICIDIN A AXVXWXWXWXWX 12 T 4.3 DUF2826 pdbhh F F 1ktl 3 E,F P,Q PEPTIDE B27 VTAPRTLLL 9 T 0.24 UL40 pdbhh F T 1ktr 2 B P Oligohistidine peptide Antigen HHHHHH 6 T 6100 zf_CCCH_4 pdbhh F F 1ku8 2 B,D,F,H B,D,F,H LECB1_ARTIN AGGLUTININ BETA CHAIN NEQSGISQTVIVGPWGAK 18 T 2.3 DUF3842 pdbhh F Eukaryota T 1kug 2 B B ENW QNW 3 T 27 DUF6275 pdbhh F F 1kui 2 B B EQW QQW 3 T 34 SARS_3b pdbhh F F 1kuj 2 B,D,F,H B,D,F,H LECB1_ARTIN AGGLUTININ BETA CHAIN NEQSGISQTVIVGPWGAK 18 T 2.3 DUF3842 pdbhh F Eukaryota T 1kuk 2 B B EKW QKW 3 T 38 Herpes_DNAp_acc pdbhh F F 1kvd 1 A,C A,C TOXK_MILFA SMK TOXIN WSLRWRMQKSTTIAAIAGCSGAATFGGLAGGIVGCIAAGILAILQGFEVNWHNGGGGDRSNPV 63 T 0.63 DUF1056 pdbhh F Eukaryota T 1kvd 2 B,D B,D TOXK_MILFA SMK TOXIN GEATTIWGVGADEAIDKGTPSKNDLQNMSADLAKNGFKGHQGVACSTVKDGNKDVYMIKFSLAGGSNDPGGSPCSDD 77 T 11 IMS_HHH pdbhh F Eukaryota T 1kve 1 A,C A,C TOXK_MILFA SMK TOXIN WSLRWRMQKSTTIAAIAGCSGAATFGGLAGGIVGCIAAGILAILQGFEVNWHNGGGGDRSNPV 63 T 0.63 DUF1056 pdbhh F Eukaryota T 1kve 2 B,D B,D TOXK_MILFA SMK TOXIN GEATTIWGVGADEAIDKGTPSKNDLQNMSADLAKNGFKGHQGVACSTVKDGNKDVYMIKFSLAGGSNDPGGSPCSDD 77 T 11 IMS_HHH pdbhh F Eukaryota T 1kvf 1 A A PROTEIN: EMP-18 Receptor Agonist TYSCHFGPLTWVCKPQX 17 F F T 1kvg 1 A A Protein: EPO-3 Receptor Agonist SCHFGPLGWVCKX 13 F F T 1ky6 2 B P EPN1_RAT EPSIN 1 FSDPWGG 7 T 0.33 Imm32 pdbhh F Eukaryota T 1ky7 2 B P AMPH_HUMAN AMPHIPHYSIN SFFEDNFVPE 10 T 0.058 CCDC32 pdbhh F Eukaryota T 1kyd 2 B P EPN1_HUMAN EPSIN 1 GSDPWK 6 T 0.58 DUF5054 pdbhh F Eukaryota T 1kyf 2 B P EPS15_MOUSE PROTEIN EPS15, AF-1P PROTEIN GSDPFK 6 T 2.9 DUF5054 pdbhh F Eukaryota T 1kyj 1 A A PENTAPEPTIDE FRAGMENT OF SIALOPHORIN XSTTAV 6 T 600 RPN2_C pdbhh F F 1kyu 2 B P EPS15_MOUSE PROTEIN EPS15, AF-1P PROTEIN GSDPFK 6 T 2.9 DUF5054 pdbhh F Eukaryota T 1kzo 3 C C RASK_HUMAN TRANSFORMING PROTEIN P21B KKKSKTKCVIM 11 T 0.035 Fer4_22 unppssm F Eukaryota T 1kzp 3 C C RASK_HUMAN TRANSFORMING PROTEIN P21B KKKSKTKCVIM 11 T 0.035 Fer4_22 unppssm F Eukaryota T 1l0s 1 A,B,C,D A,B,C,D Q9GTP0_CHOFU ANTIFREEZE PROTEIN ISOFORM 337 DGSCTNTNSQLSANSKCEKSTLTNCYVDKSEVFGTTCTGSRFDGVTITTSTSTGSRISGPGCKISTCIITGGVPAPSAACKISGCTFSAN 90 T 6.2E-25 CfAFP unppssm F Eukaryota T 1l1i 1 A A ANPY1_TENMO Thermal hysteresis protein isoform YL-1 (2-14) QCTGGADCTSCTGACTGCGNCPNAVTCTNSQHCVKANTCTGSTDCNTAQTCTNSKDCFEANTCTDSTNCYKATACTNSSGCPGH 84 T 0.0023 AFP pdb F Eukaryota T 1l1v 2 B B DACTINOMYCIN TXPXXXTXPXX 11 T 61 DUF5572 pdbhh F F 1l2y 1 A A TC5b NLYIQWLKDGGPSSGRPPPS 20 T 2.5 Mastoparan_2 pdbhh F T 1l2z 2 B B CD2_HUMAN T-CELL SURFACE ANTIGEN T11/LEU-5, LFA-2, LFA-3 RECEPTOR, ERYTHROCYTE RECEPTOR, ROSETTE RECEPTOR SHRPPPPGHRV 11 T 8.9 Peptidase_C21 pdbhh F Eukaryota T 1l3q 1 A A ARAGONITE-ASSOCIATED PROTEIN FPGKNVNCTSGE 12 T 7.7 DUF5736 pdbhh F T 1l4x 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H SIN-ASP-GLU-LEU-GLU-ARG-ALA-ILE-ARG-GLU-LEU-ALA-ALA-ARG-ILE-LYS-NH2 XDELERAIRELAARIKX 17 T 1.6 DUF5320 pdbhh F T 1l5g 3 C C CYCLIC ARG-GLY-ASP PEPTIDE RGDFX 5 T 23 DUF5414 pdbhh F F 1l6o 2 D,E,F D,E,F DISHEVELLED INTERACTING ANTAGONIST, DPR1 SLKLMTTV 8 T 0.0032 Dapper pdbhh F T 1l7z 2 B B ACIDIC MEMBRANE PROTEIN GGKLSKKKK 9 T 0.07 BASP1 pdbhh F F 1lb5 2 B B TNR11_HUMAN receptor activator of nuclear factor-kappa B QMPTEDEY 8 T 0.24 KIX unp F Eukaryota T 1lb6 2 B B TNR5_HUMAN CD40 antigen KQEPQEIDF 9 T 0.027 DUF2207 unppercent F Eukaryota T 1lb7 1 A A IGF-1 ANTAGONIST F1-1 RNCFESVAALRRCMYG 16 T 2.6 DUF4695 pdbhh F T 1lcj 2 B B MT_POVHA PHOSPHOPEPTIDE EPQ(PHOSPHO)YEEIPIYL EPQXEEIPIYL 11 T 3.2 Imm15 pdbhh T Viruses T 1lck 2 B B LCK_HUMAN TAIL PHOSPHOPEPTIDE TEGQ(PHOSPHO)YQPQPA EGQXQPQPA 9 T 8.1 Sa_NUDIX pdbhh F Eukaryota T 1lcm 1 A _ MICROCYSTIN-LR XLXRXXX 7 T 49 Phage_TAC_2 pdbhh F F 1ld9 3 C,F C,F NANO-PEPTIDE YPNVNIHNF 9 T 1.1 DUF5454 pdbhh F T 1ldp 3 C P PEPTIDE APAAAAAAM 9 T 210 DUF5994 pdbhh F F 1le0 1 A _ Tryptophan Zipper 1 SWTWEGNKWTWKX 13 T 2.6 WXXGXW pdbhh F T 1le1 1 A _ Tryptophan Zipper 2 SWTWENGKWTWKX 13 T 0.64 Chibby pdbhh F T 1lew 2 B B MEF2A_HUMAN SERUM RESPONSE FACTOR-LIKE PROTEIN 1 RKPDLRVVIPPS 12 T 5.8 PDDEXK_7 pdbhh F Eukaryota T 1lez 2 B B MP2K3_MOUSE MKK3B SKGKSKRKKDLRISCNSK 18 T 11 Paramyxo_NS_C pdbhh F Eukaryota T 1lf8 2 E,F,G,H E,F,G,H MPRI_HUMAN CI MAN-6-P RECEPTOR, CI-MPR, INSULIN-LIKE GROWTH FACTOR II RECEPTOR FHDDSDEDLLHI 12 T 8 NTF-like pdbhh F Eukaryota T 1lgc 2 B,E,H H,I,J DIPEPTIDE NQ 2 T 370 FcoT pdbhh F F 1lj2 2 C,D C,D IF4G1_HUMAN EIF4GI APKRERKTIRIRDPNQGGKDITEEIMSG 28 T 0.036 PHB_acc_N pdbhh F Eukaryota T 1lk6 2 C C antithrombin P14-P9 peptide XSEAAAS 7 T 580 CAP_N pdbhh F F 1lk6 3 D D exogenous tripeptide formyl-MLF MLF 3 T 140 DUF3719 pdbhh F F 1lkk 2 B B PHOSPHOTYROSYL PEPTIDE AC-PTYR-GLU-GLU-ILE XXEEI 5 T 92 Phage_XkdX pdbhh F F 1lkl 2 B B PHOSPHOTYROSYL PEPTIDE AC-PTYR-GLU-GLU-GLY XXEEG 5 T 180 zf-DNL pdbhh F F 1loc 3 C,F,I,L 1,2,3,4 MURAMYL-DIPEPTIDE D-ALA-D-IGLN XXX 3 T 2700 Proteasome_A_N pdbhh F F 1loi 1 A _ RNPDE4A1A, RAT TYPE IV CYCLIC AMP SPECIFIC PHOSPHODIESTERASE, ISOFORM SUBFAMILY A, SPLICE VARIANT 1 MPLVDFFCETCSKPWLVGWWDQFKRX 26 T 0.28 Rad50_zn_hook pdbhh F T 1lop 2 B B SUCCINYL-ALA-PRO-ALA-P-NITROANILIDE XAPAX 5 T 1100 UPF0547 pdbhh F F 1lp9 3 C,H C,J EMC7_HUMAN self-peptide P1049 ALWGFFPVL 9 T 0.51 MRP-L47 pdbhh F Eukaryota T 1lq7 1 A A Alpha3W GSRVKALEEKVKALEEKVKALGGGGRIEELKKKWEELKKKIEELGGGGEVKKVEEEVKKLEEEIKKL 67 T 0.00012 ZapB pdb F T 1ls5 2 C,D C,D Pepstatin XVVXAX 6 T 1700 FAM60A pdbhh F F 1ltj 4 G,I G,I Gly-Pro-Arg-Pro GPRP 4 T 65 SRCR_2 pdbhh F F 1ltj 5 H,J H,J Gly-His-Arg-Pro GHRP 4 T 14 VPS38 unphh F F 1ltx 3 C R RAE1_RAT RAB PROTEINS GERANYLGERANYLTRANSFERASE COMPONENT A 1 MADNLPSDFDVIVIGTGLPESIIAAACSRSGQRVLHVDSRSYYGGNWASFSFSGLLSWLKEYQENNDVVTENSMWQEQILENEEAIPLSSKDKTIQHVEVFCYASQDLHKDVEEAGALQKNHASVTSAQSAEAAEAAETSCLPTAVEPLSMGSCEIPAEQSQCPGPESSPEVNDAEATGKKENSDAKSSTEEPSENVPKVQDNTETPKKNRITYSQIIKEGRRFNIDLVSKLLYSRGLLIDLLIKSNVSRYAEFKNITRILAFREGTVEQVPCSRADVFNSKQLTMVEKRMLMKFLTFCVEYEEHPDEYRAYEGTTFSEYLKTQKLTPNLQYFVLHSIAMTSETTSCTVDGLKATKKFLQCLGRYGNTPFLFPLYGQGELPQCFCRMCAVFGGIYCLRHSVQCLVVDKESRKCKAVIDQFGQRIISKHFIIEDSYLSENTCSRVQYRQISRAVLITDGSVLKTDADQQVSILTVPAEEPGSFAVRVIELCSSTMTCMKGTYLVHLTCMSSKTAREDLERVVQKLFTPYTEIEAENEQVEKPRLLWALYFNMRDSSDISRDCYNDLPSNVYVCSGPDSGLGNDNAVKQAETLFQQICPNEDFCPAPPNPEDIVLDGDSSQQEVPESSVTPETNSETPKESTVLGNPEEPSE 650 T 4.3E-15 GDI pdbpssm F Eukaryota T 1ltx 4 D P AAAA AAAA 4 T 900 Cyclin_C pdbhh F F 1lup 1 A A MTX2_GRARO GsMTx2 YCQKWMWTCDEERKCCEGLVCRLWCKRIINM 31 T 0.00052 Toxin_12 pdb F Eukaryota T 1lvb 2 B,D C,D POLG_TEV OLIGOPEPTIDE SUBSTRATE FOR THE PROTEASE TENLYFQSGT 10 T 6.2 CX pdbhh T Viruses T 1lvm 2 C,D C,D POLG_TEV OLIGOPEPTIDE SUBSTRATE FOR THE PROTEASE XENLYFQSGT 10 T 5.7 CX pdbhh T Viruses T 1lvm 3 E E POLG_TEV CATALYTIC DOMAIN OF THE NUCLEAR INCLUSION PROTEIN A (NIA) EATQLMN 7 T 8.7 DUF3460 pdbhh T Viruses T 1lvz 1 A A GNAT1_BOVIN TRANSDUCIN ALPHA-1 CHAIN IRENLKDSGLF 11 T 0.38 ssDNA_TraI_N pdbhh F Eukaryota T 1lwu 4 M,N,O,P M,N,O,P Peptide Ligand Gly-His-Arg-Pro-NH2 GHRPX 5 T 100 zf-CCHC pdbhh F F 1lyb 3 C,F I,J PEPSTATIN XVVXAX 6 T 1700 FAM60A pdbhh F F 1m02 1 A A PW2 HPLKQYWWRPSI 12 T 0.31 Leader_Trp pdbhh F T 1m08 1 A,B A,B CEA7_ECOLX Colicin E7 MRNKPGKATGKGKPVNNKWLNNAGKDLGSPVPDRIANKLRDKEFKSFDDFRKKFWEEVSKDPELSKQFSRNNNDRMKVGKAPKTRTQDVSGKRTSFELHHEKPISQNGGVYDMDNISVVTPKRHIDIHRGK 131 T 0.021 HNH pdbpercent F Bacteria T 1m0v 2 B B SKAP2_MOUSE SKAP-HOM PEPTIDE XDEXDDPFX 9 T 2.7 DUF2555 pdbhh F Eukaryota F 1m1j 4 G,H G,H GLY-PRO-ARG-PRO peptide GPRP 4 T 65 SRCR_2 pdbhh F F 1m1j 5 I,J I,J GLY-HIS-ARG-PRO peptide GHRP 4 T 14 VPS38 unphh F F 1m21 2 C,D C,D CHYMOSTATIN FXLX 4 T 52 FAA_hydrolase_N pdbhh F F 1m24 1 A,B A,B TRICHOTOXIN_A50E XXGXLXQXXXAAXPLXXQX 19 T 22 FAD_oxidored pdbhh F T 1m26 2 B,D,F,H B,D,F,H LECB3_ARTIN AGGLUTININ BETA CHAIN SGISQTVIVGPWGAKSA 17 T 0.51 DUF3842 pdbhh F Eukaryota T 1m27 2 B B SLAF1_HUMAN SLAM, IPO-3, CD150 ANTIGEN, CDW150 KSLTIYAQVQK 11 T 0.1 MFS_1 unppssm F Eukaryota T 1m2e 1 A A KAIA_SYNE7 KaiA MLSQIAICIWVESTAILQDCQRALSADRYQLQVCESGEMLLEYAQTHRDQIDCLILVAANPSFRAVVQQLCFEGVVVPAIVVGDRDSEDPDEPAKEQLYHSAELHLGIHQLEQLPYQVDAALAEFLRLAPVETMA 135 T 0.066 CHAT pdb F Bacteria T 1m2f 1 A A KAIA_SYNE7 KaiA MLSQIAICIWVESTAILQDCQRALSADRYQLQVCESGEMLLEYAQTHRDQIDCLILVAANPSFRAVVQQLCFEGVVVPAIVVGDRDSEDPDEPAKEQLYHSAELHLGIHQLEQLPYQVDAALAEFLRLAPVETMA 135 T 0.066 CHAT pdb F Bacteria T 1m3w 1 A,B,C,D A,B,C,D H10H24 CGGGEIWKLHEEFLKKFEELLKLHEERLKKMX 32 T 4.6 DUF761 pdbhh F T 1m43 2 C,D C,D Pepstatin XVVXAX 6 T 1700 FAM60A pdbhh F F 1m46 2 B B MYO2_YEAST IQ4 SVLRTITNLQKKIRKELKQRQLKQE 25 T 0.00018 IQ unppssm F Eukaryota T 1m4h 2 C,D C,D Inhibitor OM00-3 ELDXVEF 7 T 3.3 Endotoxin_C pdbhh F T 1m63 4 D,H D,H CYCLOSPORINE, CICLOSPORIN, CICLOSPORINE XXXXXXXXVXA 11 T 9.7 DUF6090 pdbhh F F 1m72 2 B,D,F D,E,F Ace-Asp-Glu-Val-Asp-chloromethylketone XDEVDX 6 T 200 ResIII pdbhh F F 1m7t 1 A A THIO_ECOLI;THIO_HUMAN Chimera of Human and E. coli thioredoxin MVKQIESKTAFQEALDAAGDKLVVVDFSATWCGPCKMIKPFFHSLSEKYSNVIFLEVDVDDAQDVAPKYGIRGIPTLLLFKNGEVAATKVGALSKGQLKEFLDANLV 107 T 8.3E-39 Thioredoxin unppssm F Eukaryota T 1ma2 1 A A TAC1_TACTR Tachyplesin I KWCFRVCYRGICYRRCR 17 T 0.021 Myticin-prepro unp F Eukaryota T 1ma3 2 B B P53_HUMAN TUMOR SUPPRESSOR P53, PHOSPHOPROTEIN, P53, ANTIGEN NY-CO-13 KKGQSTSRHKXLMFKTEG 18 T 30 Class_IIIsignal pdbhh F Eukaryota T 1ma4 1 A A TAC1_TACTR Tachyplesin 1 KWYFRVYYRGIYYRRYR 17 T 0.021 Myticin-prepro unp F Eukaryota T 1ma5 1 A A TAC1_TACTR Tachyplesin 1 KWCFRVCYRGICYRRCR 17 T 0.021 Myticin-prepro unp F Eukaryota T 1ma6 1 A A TAC1_TACTR Tachyplesin I KWYFRVYYRGIYYRRYR 17 T 0.021 Myticin-prepro unp F Eukaryota T 1mag 1 A,B A,B VALYL GRAMICIDIN XGAXAXVXWXWXWXWX 16 T 4.6 MAP17 pdbhh F F 1mcb 2 C P PEPTIDE N-ACETYL-L-GLN-D-PHE-L-HIS-D-PRO-OH XQXXX 5 T 29 PMI_typeI pdbhh F F 1mcc 2 C P PEPTIDE N-ACETYL-L-GLN-D-PHE-L-HIS-D-PRO-NH2 XQXHXX 6 T 0.13 Peptidase_C26 pdbhh F F 1mcd 2 C P PEPTIDE N-ACETYL-D-PHE-B-ALA-L-HIS-D-PRO-NH2 XXXHXX 6 T 50 Flagellar_rod pdbhh F F 1mce 2 C P PEPTIDE N-ACETYL-L-GLN-D-PHE-L-HIS-D-PRO-B-ALA-OH XQXHXX 6 T 0.13 Peptidase_C26 pdbhh F F 1mcf 2 C P PEPTIDE N-ACETYL-L-GLN-D-PHE-L-HIS-D-PRO-B-ALA-B-ALA-OH XQXHXXX 7 T 0.22 Peptidase_C26 pdbhh F F 1mch 2 C P PEPTIDE N-ACETYL-L-GLN-D-PHE-L-HIS-D-PRO-B-ALA-B-ALA-OH XQFHPXX 7 T 0.22 Peptidase_C26 pdbhh F F 1mci 2 C P PEPTIDE N-ACETYL-D-PHE-L-HIS-D-PRO-OH XXHX 4 T 40 GATase_5 pdbhh F F 1mcj 2 C P PEPTIDE N-ACETYL-D-PHE-L-HIS-D-PRO-NH2 XFHPX 5 T 34 GATase pdbhh F F 1mck 2 C P PEPTIDE N-ACETYL-D-GLU-L-HIS-D-PRO-NH2 XXHXX 5 T 260 ADD_DNMT3 pdbhh F F 1mcl 2 C P N-ACETYL-D-HIS-L-PRO-OH XXP 3 T 190 DUF883_C pdbhh F F 1mcn 2 C P PEPTIDE N-ACETYL-D-HIS-L-PRO-NH2 XXPX 4 T 320 BTK pdbhh F F 1mcq 2 C P PEPTIDE N-ACETYL-L-HIS-D-PRO-NH2 XHXX 4 T 320 BTK pdbhh F F 1mcr 2 C P PEPTIDE N-ACETYL-L-HIS-D-PRO-OH XHX 3 T 190 DUF883_C pdbhh F F 1mcs 2 C P PEPTIDE N-ACETYL-L-GLN-D-PHE-L-HIS-D-PRO-OH XQXHX 5 T 29 PMI_typeI pdbhh F F 1mf4 2 B B VAL-ALA-PHE-ARG-SER VAFRS 5 T 83 IML1 pdbhh F F 1mf8 4 D D CYCLOSPORINE, CICLOSPORIN, CICLOSPORINE XXXXXXXXVXA 11 T 9.7 DUF6090 pdbhh F F 1mfg 2 B B ERBB2_HUMAN Erb-B2 carboxyl-terminal fragment EYLGLDVPV 9 T 8.1 FAM110_C pdbhh F Eukaryota T 1mfl 2 B B ERBB2_HUMAN PHOSPHORYLATED Erb-B2 carboxyl-terminal fragment. EXLGLDVPV 9 T 8.1 FAM110_C pdbhh F Eukaryota T 1mhe 3 E,F P,Q PEPTIDE (VMAPRTVLL) VMAPRTVLL 9 T 0.0013 UL40 pdbhh F T 1mhw 3 E,F,G,H E,F,G,H 4-biphenylacetyl-Cys-(D)Arg-Tyr-N-(2-phenylethyl) amide XCXYX 5 T 52 YobH pdbhh F F 1mic 1 A,B A,B VALYL GRAMICIDIN XGAXAXVXWXWXWXWX 16 T 4.6 MAP17 pdbhh F F 1mik 2 B B CYCLOSPORINE, CICLOSPORIN, CICLOSPORINE XXXXXXXXVXA 11 T 9.7 DUF6090 pdbhh F F 1mnf 2 B,BA,D,F,H,J,L,N,P,R,T,V,X,Z O,2,P,Q,R,S,T,U,V,W,X,Y,Z,1 SBP, STRONG BINDING PEPTIDE SWMTTPWGFLHP 12 T 1.1 DUF6163 pdbhh F T 1mnv 2 C,D C,D DACTINOMYCIN TXPXXXTXPXX 11 T 61 DUF5572 pdbhh F F 1mpw 1 A,B A,B CPXA_PSEPU CAMPHOR 5-MONOOXYGENASE TTETIQSNANLAPLPPHVPEHLVFDFDMYNPSNLSAGVQEAWAVLQESNVPDLVWTRCNGGHWIATRGQLIREAYEDYRHFSSECPWIPREAGEAFDFIPTSMDPPEQRQFRALANQVVGMPVVDKLENRIQELACSLIESLRPQGQCNFTEDYAEPFPIRIFMLLAGLPEEDIPHLKYLTDQMTRPDGSMTFAEAKEALYDYLIPIIEQRRQKPGTDAISIVANGQVNGRPITSDEAKRMCGLLLLGGLDTVVNFLSFSMEFLAKSPEHRQELIERPERIPAACEELLRRFSLVADGRILTSDYEFHGVQLKKGDQILLPQMLSGLDERENAAPMHVDFSRQKVSHTTFGHGSHLCLGQHLARREIIVTLKEWLTRIPDFSIAPGAQIQHKSGIVSGVQALPLVWDPATTKAV 414 T 1.6E-05 p450 unppercent F Bacteria T 1mt8 2 C P Q9YX54_9HIV1 Capsid-p2 substrate peptide of HIV-1 Gag polyprotein KARVLAEAMS 10 T 13 GREB1 pdbhh T Viruses T 1mtn 1 A,E A,E CTRA_BOVIN A-CHT CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 1mw4 2 B B ERBB2_HUMAN PY1139, NEU PROTO-ONCOGENE, C-ERBB-2, TYROSINE KINASE-TYPE CELL SURFACE RECEPTOR HER2, MLN 19 PQPEXVNQPD 10 T 0.38 CYSTM pdbhh F Eukaryota T 1mw5 1 A,B A,B Y1480_HAEIN HYPOTHETICAL PROTEIN HI1480 GSHMSETDLLMKMVRQPVKLYSVATLFHEFSEVITKLEHSVQKEPTSLLSEENWHKQFLKFAQALPAHGSASWLNLDDALQAVVGNSRSAFLHQLIAKLKSRHLQVLELNKIGSEPLDLSNLPAPFYVLLPESFAARITLLVQDKALPYVRVSMEYWHALEYKGELNDPAANKARKEAELAAATAEQ 187 T 11 DUF4211 unphh F Bacteria T 1mxe 2 B,D E,F KCC1A_RAT CAM KINASE I IKKNFAKSKWKQAFNATAVVRHMRK 25 T 0.12 Tyrosinase unp F Eukaryota T 1mxq 1 A A TKN_ELECI Eledoisin QPSKDAFIGLM 11 T 0.0001 Tachykinin pdbhh F Eukaryota T 1n09 1 A A bhpW, disulfide cyclized beta-hairpin peptide XCTWEGNKLTCX 12 T 0.74 Lipocalin_7 pdbhh F T 1n0a 1 A A bhpw_pdg, beta-hairpin peptide XCTWEPDGKLTCX 13 T 0.64 PD40 pdbhh F T 1n0c 1 A A bhp_HWLV, disulfide cyclized beta-hairpin peptide XCHWEGNKLVCX 12 T 0.37 Lipocalin_7 pdbhh F T 1n0d 1 A A bhp_VWLH, disulfide cyclized beta-hairpin peptide XCVWEGNKLHCX 12 T 2.6 PHA_gran_rgn pdbhh F T 1n0w 3 C L peptide linker TGSTGSTGSTGSMG 14 T 7.9 CCDC85 pdbhh F F 1n0w 4 D C ARTIFICIAL GLY-SER-MSE-GLY PEPTIDE GSMG 4 T 33 Ssl1 pdbhh F F 1n0x 3 E,F P,R B2.1 peptide HERSYMFSDLENRCIAAEXKK 21 T 0.58 TEP1_N pdbhh F T 1n3n 3 I,J,K,L I,J,K,L mycobacterial hsp60 decameric epitope SALQNAASIA 10 T 9.7 dsRBD2 pdbhh F T 1n4i 1 A A Q9GTP0_CHOFU thermal hysteresis protein DGSCTNTNSQLSANSKCEKSTLTNCYVDKSEVYGTTCTGSRFDGVTITTSTSTGSRISGPGCKISTCIITGGVPAPSAACKISGCTFSAN 90 T 6.2E-25 CfAFP unppssm F Eukaryota T 1n4m 2 C,D,E C,D,E E2F2_HUMAN E2F-2 DDYLWGLEAGEGISDLFD 18 T 5.2 Carcinustatin pdbhh F Eukaryota T 1n4p 3 M,N M,N RASK_HUMAN KKKSKTKCVIL PEPTIDE KKKSKTKCVIL 11 T 0.035 Fer4_22 unppssm F Eukaryota T 1n4q 3 M,N,O,P,Q,R M,N,O,P,Q,R RASK_HUMAN KKKSKTKCVIL PEPTIDE KKKSKTKCVIL 11 T 0.035 Fer4_22 unppssm F Eukaryota T 1n4r 3 M,N,O,P,Q,R M,N,O,P,Q,R RASK_HUMAN KKKSKTKCVIL PEPTIDE PRODUCT KKKSKTKCVIL 11 T 0.035 Fer4_22 unppssm F Eukaryota T 1n4s 3 M,N,O,P,Q,R M,N,O,P,Q,R RASK_HUMAN KKKSKTKCVIL PEPTIDE PRODUCT KKKSKTKCVIL 11 T 0.035 Fer4_22 unppssm F Eukaryota T 1n51 2 B B apstatin XPPAX 5 T 680 SEC-C pdbhh F F 1n5z 2 B,D P,Q PEX14_YEAST PEROXIN-14 EAMPPTLPHRDWKD 14 T 3.4E-05 DUF1664 unphh F Eukaryota T 1n6d 2 B,D,F,H,J,L G,H,I,J,K,L RVRK RVRK 4 T 110 DUF5560 pdbhh F F 1n6e 2 B,D,F,H,J,L B,D,F,H,J,L DQTQKAAAELTFF DQTQKAAAELTFF 13 T 50 NEMP pdbhh F T 1n73 4 G,H,I,J G,H,I,J FIBB_HUMAN peptide ligand: Gly-his-Arg-Pro-amide GHRP 4 T 14 VPS38 unphh F Eukaryota F 1n7f 2 C,D C,D LIPA1_HUMAN 8-mer peptide from interacting protein (liprin) ATVRTYSC 8 T 2.4 BLIP pdbhh F Eukaryota T 1n7t 2 B B phage-derived peptide TGWETWV 7 T 0.59 ATG101 pdbhh F F 1n86 4 G,H G,H FIBA_HUMAN fibrin alpha chain peptide ligand fragment Gly-Pro-Arg GPR 3 T 1.4 DUF4715 unphh F Eukaryota F 1n86 5 I,J I,J FIBB_HUMAN fibrin beta chain peptide ligand fragment Gly-His-Arg-Pro-Leu-Asp-Lys GHRPLDK 7 T 5.8 DUF1824 pdbhh F Eukaryota T 1n8o 1 A A CTRA_BOVIN Chymotrypsin A, A chain CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 1n9u 1 A A ANGT_HUMAN ANG I DRVYIHPFHL 10 T 0.39 Nairo_nucleo pdbhh F Eukaryota T 1n9v 1 A A ANGT_HUMAN ANG II DRVYIHPF 8 T 0.67 Nairo_nucleo pdbhh F Eukaryota T 1nb3 2 B,E,H,K P,R,S,T CATH_PIG CATHEPSIN H MINI CHAIN EPQNCSAT 8 T 0.4 SCAN unp F Eukaryota T 1nb5 2 B,E,H,K P,R,S,T CATH_PIG Cathepsin H MINI CHAIN EPQNCSAT 8 T 0.4 SCAN unp F Eukaryota T 1nes 2 B,C I,J ACETYL-ALA-PRO-ALA XAPA 4 T 800 DUF1467 pdbhh F F 1nex 3 E,F E,F GLL(TPO)PPQSG GLLTPPQSG 9 T 6.5 FTZ pdbhh F T 1ng8 1 A,B A,B VALYL GRAMICIDIN XGAXAXVXWXWXWXGX 16 T 0.88 DUF6186 pdbhh F F 1nh0 2 C,D I,J peptidomimetic inhibitor KI2-PHE-GLU-GLU-NH2 XFEEX 5 T 350 DUF4210 pdbhh F F 1nhg 2 C,D C,D Q9BH77_PLAFA ENOYL-ACP-REDUCTASE YTFIDYAIEYSEKYAPLRQKLLSTDIGSVASFLLSRESRAITGQTIYVDNGLNIMFLPDD 60 F F Eukaryota T 1nhw 2 C,D C,D Q9BH77_PLAFA ENOYL-ACP-REDUCTASE YTFIDYAIEYSEKYAPLRQKLLSTDIGSVASFLLSRESRAITGQTIYVDNGLNIMFLPDD 60 F F Eukaryota T 1nik 4 D D DNA-directed RNA polymerase II, chain RPB4 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 161 F F F 1nik 7 G G DNA-directed RNA polymerase II, chain RPB7 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 170 F F F 1niw 2 B,D,F,H B,D,F,H NOS3_HUMAN EC-NOS, NOS (TYPE III), NOSIII, ENDOTHELIAL NOS, ENOS, CONSTITUTIVE NOS, CNOS RKKTFKEVANAVKISASLMG 20 T 2.3 DUF2774 pdbhh F Eukaryota T 1njt 2 E,F,G,H E,F,G,H Peptidomimetic Inhibitor XVXXAX 6 T 280 B_solenoid_ydck pdbhh F F 1nkk 2 E,F,G,H E,F,G,H Peptidomimetic inhibitor XVXXAX 6 T 280 B_solenoid_ydck pdbhh F F 1nlo 2 B N NL1 (MN7-MN2-MN1-PLPPLP) XXXXPLPPLPX 11 T 22 SCIMP pdbhh F F 1nlp 2 B N NL2 (MN8-MN1-PLPPLP) XXXPLPPLPX 10 T 17 SCIMP pdbhh F F 1nlt 2 B B Seven residue peptide GWLYEIS 7 T 2.2 DUF4907 pdbhh F T 1nlu 2 B,C B,C PSEUDO-IODOTYROSTATIN XXX 3 T 170 GM130_C pdbhh F F 1nnu 2 C,D C,D Q9BH77_PLAFA ENOYL-ACP-REDUCTASE YTFIDYAIEYSEKYAPLRQKLLSTDIGSVASFLLSRESRAITGQTIYVDNGLNIMFLPDD 60 F F Eukaryota T 1nop 3 E C topoisomerase I-derived peptide KLNYLDPR 8 T 0.0021 Topo_C_assoc pdbhh F T 1not 1 A A CAIA_CONGE GI ALPHA CONOTOXIN ECCNPACGRHYSCX 14 T 0.039 Enterotoxin_ST pdbhh F Eukaryota T 1nrm 1 A,B A,B VALYL GRAMICIDIN XGAXAXVXWXWXWXWX 16 T 4.6 MAP17 pdbhh F F 1nrn 3 C R PAR1_HUMAN RECEPTOR BASED PEPTIDE NRS LDPRSFLLRNPNDKYEPFWEDEE 23 T 2.1 DUF4710 pdbhh F Eukaryota T 1nro 3 C R PAR1_HUMAN RECEPTOR BASED PEPTIDE NRP LDPRPFLLRNPNDKYEPFWEDEEKNES 27 T 2.6 SYCP2_SLD pdbhh F Eukaryota T 1nrp 3 C R PAR1_HUMAN RECEPTOR BASED PEPTIDE NR'S LDPXSFLLRNPNDKYEPFWEDEE 23 T 2.1 DUF4710 pdbhh F Eukaryota T 1nrq 3 C R PAR1_HUMAN RECEPTOR BASED PEPTIDE D-FPR'S XPXSXLLRNPNDKYEPFWEDEE 22 T 1.7 DUF4710 pdbhh F Eukaryota T 1nrr 3 C R PAR1_HUMAN THROMBIN RECEPTOR, COAGULATION FACTOR II RECEPTOR FLLRNPNDKYEPFWEDEE 18 T 0.93 SYCP2_SLD pdbhh F Eukaryota T 1nrs 3 C R RECEPTOR BASED PEPTIDE NRP LDPR 4 T 49 EF-hand_14 pdbhh F F 1nru 1 A,B A,B VALYL GRAMICIDIN XGAXAXVXWXWXWXWX 16 T 4.6 MAP17 pdbhh F F 1nt5 1 A,B A,B VALYL GRAMICIDIN XGAXAXVXWXWXWXWX 16 T 4.9 RCR pdbhh F F 1nt6 1 A,B A,B GRAMICIDIN C XGAXAXVXWXYXWXWX 16 T 3.7 DUF5848 pdbhh F T 1ntv 2 B B Apolipoprotein E Receptor-2 peptide NFDNPVYRKT 10 T 3.3 DUF3498 pdbhh F T 1ntx 1 A _ 3S11_DENPO ALPHA-NEUROTOXIN RICYNHQSTTRATTKSCEENSCYKKYWRDHRGTIIERGCGCPKVKPGVGIHCCQSDKCNY 60 T 0.051 Lentiviral_Tat pdb F Eukaryota T 1nu2 2 B B peptide derived from murine Apolipoprotein E Receptor-2 NFDNPVYRKT 10 T 3.3 DUF3498 pdbhh F T 1nu8 2 C D 3-mer peptide IPI 3 T 95 CBM46 pdbhh F F 1nvq 2 B B Peptide ASVSA ASVSA 5 T 490 DUF2121 pdbhh F F 1nvr 2 B B Peptide ASVSA ASVSA 5 T 490 DUF2121 pdbhh F F 1nvs 2 B B Peptide ASVSA ASVSA 5 T 490 DUF2121 pdbhh F F 1nwd 2 B,C B,C DCE_PETHY GAD GSHKKTDSEVQLEMITAWKKFVEEKKKK 28 T 0.013 DUF4951 pdb F Eukaryota T 1nx0 3 E E Small molecule inhibitor AKAIA 5 T 140 DUF688 pdbhh F F 1nxn 1 A A CONTRYPHAN-VN, MAJOR FORM (CIS CONFORMER) GDCPXKPWCX 10 T 0.53 Thioredoxin_4 pdbhh F T 1ny2 4 D 4 KNG1_HUMAN Inhibitor peptide RPPGF RPPGF 5 T 3.2E-05 Bradykinin unphh F Eukaryota F 1nyb 2 B A REGN_BPPH3 Probable regulatory protein N ESKGTAKSRYKARRAELIAERR 22 T 0.17 N36 unphh T Viruses T 1nzl 2 C C Doubly phosphorylated peptide ligand (PQpYEpYIPI) PQXEXIPA 8 T 0.29 Ac110_PIF pdbhh F T 1nzq 3 C D Decapeptide Hirudin Analogue XYEPIPEEXXQ 11 T 0.91 Hirudin pdbhh F T 1nzs 1 A A OPSD_BOVIN 19-mer peptide fragment of RHODOPSIN DDEASTTVSKTETSQVAPA 19 T 110 DUF5840 pdbhh F Eukaryota T 1nzv 2 C C Doubly phosphorylated peptide PQpYIpYVPA PQXIXVPA 8 T 1.7 DUF3300 pdbhh F T 1o06 1 A A VPS27_YEAST Vacuolar protein sorting-associated protein VPS27 EEDPDLKAAIQESLREAEEA 20 T 2.1E-05 UIM pdbhh F Eukaryota T 1o0d 3 C D Decapeptide Hirudin Analogue XYEPIPEEXXQ 11 T 0.91 Hirudin pdbhh F T 1o20 1 A A PROA_THEMA GPR, GLUTAMATE-5-SEMIALDEHYDE DEHYDROGENASE, GLUTAMYL-GAMMA-SEMIALDEHYDE DEHYDROGENASE, GSA DEHYDROGENASE MGSDKIHHHHHHMDELLEKAKKVREAWDVLRNATTREKNKAIKKIAEKLDERRKEILEANRIDVEKARERGVKESLVDRLALNDKRIDEMIKACETVIGLKDPVGEVIDSWVREDGLRIARVRVPIGPIGIIYESRPNVTVETTILALKSGNTILLRGGSDALNSNKAIVSAIREALKETEIPESSVEFIENTDRSLVLEMIRLREYLSLVIPRGGYGLISFVRDNATVPVLETGVGNCHIFVDESADLKKAVPVIINAKTQRPGTCNAAEKLLVHEKIAKEFLPVIVEELRKHGVEVRGCEKTREIVPDVVPATEDDWPTEYLDLIIAIKVVKNVDEAIEHIKKYSTGHSESILTENYSNAKKFVSEIDAAAVYVNASTRFTDGGQFGFGAEIGISTQRFHARGPVGLRELTTYKFVVLGEYHVRE 427 T 2.7E-07 Aldedh unppercent F Bacteria T 1o53 1 A A PTGA_SALTI EIIA-GLC, GLUCOSE-PERMEASE IIA COMPONENT, PHOSPHOTRANSFERASE ENZYME II, A COMPONENT, EIII-GLC GLFDKLKSLVSDDKK 15 T 1.1 Antimicrobial20 pdbhh F Bacteria T 1o6k 2 B C GSK3B_HUMAN GSK-3 BETA GRPRTTSFAE 10 T 6.3 DUF3084 pdbhh F Eukaryota T 1o6l 2 B C GSK3B_HUMAN GSK-3 BETA GRPRTTSFAE 10 T 6.3 DUF3084 pdbhh F Eukaryota T 1o6o 2 D,E,F D,E,F NSP1_YEAST NUCLEAR PORE PROTEIN NSP1, NUCLEOSKELETAL-LIKE PROTEIN, P110, NSP1, YJL041W, J1207 MGSSTKSNEKKDSGSSKPAFSFGAKPDEKKNDEVSKPAFSFGAKANEKKESDESKSAFSFGSKPTGKEEGDGAKAAISFGAKPEEQKSSDTSKPAFTFGAQKDNEKKTEESSTGKSMQA 119 T 0.26 SHIPPO-rpt pdbpercent F Eukaryota T 1o8y 1 A A SFTI1_HELAN SFTI-1 GRCTKSIPPICFPD 14 T 0.0029 Bowman-Birk_leg pdb F Eukaryota T 1o8z 1 A A SFTI1_HELAN SFTI-1 GRCTKSIPPICFPD 14 T 0.0029 Bowman-Birk_leg pdb F Eukaryota T 1o9d 2 B P Q40409_NICPL PLASMA MEMBRANE H+ ATPASE QSYTV 5 T 130 DUF4642 pdbhh F Eukaryota F 1o9f 2 B P Q40409_NICPL PLASMA MEMBRANE H+ ATPASE QSYTV 5 T 130 DUF4642 pdbhh F Eukaryota F 1o9k 3 I,J,K,L P,Q,R,S E2F1_HUMAN PBR3, PRB-BINDING PROTEIN E2F-1, RETINOBLASTOMA-ASSOCIATED PROTEIN 1 LDYHFGLEEGEGIRDLFD 18 T 7.5 Guanylin pdbhh F Eukaryota T 1o9u 2 B B AXIN1_HUMAN AXIS INHIBITION PROTEIN 1, HAXIN, AXIN1, AXIN VEPQKFAEELIHRLEAVQ 18 T 0.19 Cwf_Cwc_15 unppssm F Eukaryota T 1oai 2 B B FXFG NUCLEOPORIN PEPTIDE DSGFSFGSK 9 T 5.6 Peptidase_S9 pdbhh F F 1ob4 1 A A CEPHAIBOL A XFXXXXGLXXPQXPXPX 17 T 7.7 Pep_deformylase pdbhh F T 1ob6 1 A,B A,B CEPHAIBOL B XFXXXXGLXXPQXPXPX 17 T 0.28 Pep_deformylase pdbhh F T 1ob7 1 A A CEPHAIBOL C XFXXXXGLXXPQXXPXPX 18 T 7.7 Pep_deformylase pdbhh F T 1obx 2 B B IL5RA_HUMAN IL-5R-ALPHA, CD125 ANTIGEN ETLEDSVF 8 T 15 DUF5588 pdbhh F Eukaryota T 1oby 2 C,D P,Q SDC4_HUMAN AMPHIGLYCAN, SYND4, RYUDOCAN CORE PROTEIN TNEFYA 6 T 3.1 Herpes_gE unphh F Eukaryota T 1obz 2 C P IL5RA_HUMAN IL-5R-ALPHA, CD125 ANTIGEN ETLEDSVF 8 T 15 DUF5588 pdbhh F Eukaryota T 1odf 1 A A TDA10_YEAST YGR205W MCDKSKTVLDYTIEFLDKYIPEWFETGNKCPLFIFFSGPQGSGKSFTSIQIYNHLMEKYGGEKSIGYASIDDFYLTHEDQLKLNEQFKNNKLLQGRGLPGTHDMKLLQEVLNTIFNNNEHPDQDTVVLPKYDKSQFKGEGDRCPTGQKIKLPVDIFILEGWFLGFNPILQGIENNDLLTGDMVDVNAKLFFYSDLLWRNPEIKSLGIVFTTDNINNVYGWRLQQEHELISKVGKGMTDEQVHAFVDRYMPSYKLYLNDFVRSESLGSIATLTLGIDSNRNVYSTKTRCIE 290 T 0.00012 AAA_16 pdbpercent F Eukaryota T 1oeb 2 C,D C,D LCP2_MOUSE SH2 DOMAIN-CONTAINING LEUCOCYTE PROTEIN OF 76 KDA, SLP-76 TYROSINE PHOSPHOPROTEIN, SLP76 PAPSIDRSTKPPL 13 T 51 Chi-conotoxin pdbhh F Eukaryota T 1oeh 1 A A PRIO_HUMAN PRP, PRION PROTEIN, PRP27-30, PRP33-35C HGGGWGQP 8 T 0.095 Prion_octapep pdb F Eukaryota F 1oei 1 A A PRIO_HUMAN PRP, PRION PROTEIN, PRP27-30, PRP33-35C HGGGWGQPHGGGWGQPHGGGWGQP 24 T 0.0082 Prion_octapep pdbhh F Eukaryota F 1oex 2 B B INHIBITOR H261 XHPFAXIH 8 T 13 SoDot-IcmSS pdbhh F T 1of2 3 C C VIPR1_HUMAN PEPTIDE VIPR RRKWRRWHL 9 F F Eukaryota F 1of5 1 A A MEX67_YEAST MEX67, YPL169C, P2520 QQFFFENDALGQSSTDFATNFLNLWDNNREQLLNLYSPQSQFSVSVDSTIPPSTVTDSDQTPAFGYYMSSSRNISKVSSEKSIQQRLSIGQESINSIFKTLPKTKHHLQEQPNEYSMETISYPQINGFVITLHGFFEETGKPELESNKKTGKNNYQKNRRYNHGYNSTSNNKLSKKSFDRTWVIVPMNNSVIIASDLLTVRAYSTGAWKTASIAIAQAAGS 221 T 9.9E-08 NTF2 unppssm F Eukaryota T 1ogt 3 C C VIPR1_HUMAN VPAC1, VIP-R-1, PITUITARY ADENYLATE CYCLASE ACTIVATING POLYPEPTIDE TYPE II RECEPTOR, PACAP TYPE II RECEPTOR, PACAP-R-2 RRKWRRWHL 9 F F Eukaryota F 1ohe 2 B B PEPTIDE LIGAND XASP 4 T 530 LSPR pdbhh F F 1okv 3 E,F E,F H-ARG-ARG-LEU-ILE-PHE-NH2 RRLIFX 6 T 83 VirDNA-topo-I_N pdbhh F F 1okw 3 E,F E,F ACE-ARG-ARG-LEU-ASN-FCL-NH2 XRRLNXX 7 T 39 CENP_C_N pdbhh F F 1okx 2 C,D C,D SCYPTOLIN A XATTLXXV 8 T 1400 40S_S4_C pdbhh F F 1ol1 3 E,F F,H CIR-CIR-LEU-ILE-PFF-NH2 XXLIXX 6 T 83 VirDNA-topo-I_N pdbhh F F 1ol2 3 E,F E,F ARG-ARG-LEU-ASN-PFF-NH2 RRLNXX 6 T 26 CENP_C_N pdbhh F F 1ola 2 B B PEPTIDE VAL-LYS-PRO-GLY VKPG 4 T 41 DUF3617 pdbhh F F 1olc 2 B B LYS-LYS-LYS-ALA KKKA 4 T 600 Insulin_TMD pdbhh F F 1oln 2 B B THCL_STRAJ ALANINAMIDE, BRYAMYCIN, THIACTIN XIAXASXTXXXXTXXXXXX 19 T 0.93 CCER1 pdbhh F Bacteria F 1om2 2 B B ALDH2_RAT ALDH GPRLSRLLSYA 11 T 9.7 TFIID_30kDa pdbhh F Eukaryota T 1om9 2 B,D P,Q CCD91_HUMAN 15-mer peptide fragment of p56 DDDDFGGFEAAETFD 15 T 0.086 DUF5102 pdbhh F Eukaryota T 1oo4 2 B B 8-mer peptide from PDGFr SVDXVPML 8 T 0.57 Frem_N pdbhh F T 1oqp 2 B B KAR1_YEAST Cell division control protein KAR1 KKRELIESKWHRLLFHDKK 19 T 6.1 TnpW pdbhh F Eukaryota T 1or8 2 B,C,D,E B,C,D,E Substrate peptide GGRGGFGGRGGFGGRGGFG 19 T 49 Sde2_N_Ubi pdbhh F F 1orh 2 B B Substrate peptide GGFGGRGGFG 10 T 0.19 Sde2_N_Ubi pdbhh F F 1osg 2 G,H,I,J,K,L G,J,H,I,K,L BR3 derived PEPTIDE CHWDLLVRHWVC 12 T 0.38 TetM_leader pdbhh F T 1osz 3 C C VSV-8 RGYLYQGL 8 T 1.1 Cap4_nuclease pdbhh F F 1ot5 2 C,D C,D Ac-Ala-Lys-boroArg N-acetylated boronic acid peptide inhibitor XAKX 4 T 540 Mtf2_C pdbhh F F 1ou8 2 C,D C,D synthetic ssrA peptide GRHGAANDENY 11 T 10 Tox-HDC pdbhh F T 1ov3 2 C,D C,D P22 PHAGOCYTE B-CYTOCHROME, NEUTROPHIL CYTOCHROME B, 22 KDA POLYPEPTIDE, P22-PHOX, P22PHOX, CYTOCHROME B558, ALPHA CHAIN, CYTOCHROME B-245 ALPHA-SUBUNIT LIGHT CHAIN, SUPEROXIDE- GENERATING NADPH OXIDASE LIGHT CHAIN SUBUNIT KQPPSNPPPRPPAEARKK 18 T 0.00018 Cytochrom_B558a pdbhh F T 1ovf 2 B B DACTINOMYCIN TXPXXXTXPXX 11 T 61 DUF5572 pdbhh F F 1ow6 2 D,E D,F PAXI_HUMAN Paxillin ATRELDELMASLS 13 T 0.99 SAM_LFY pdbhh F Eukaryota T 1ow7 2 D,E,F D,E,F PAXI_HUMAN Paxillin ATRELDELMASLS 13 T 0.99 SAM_LFY pdbhh F Eukaryota T 1ox1 2 B B SYNTHETIC PEPTIDE INHIBITOR SCTRSIPPQCY 11 T 0.0036 Bowman-Birk_leg pdb F T 1ox9 2 B,D,F,H,J,L,N,P I,J,K,L,M,N,O,P ssrA AANDENYA 8 T 17 DUF6231 pdbhh F F 1oxn 2 F F AEAVPWKSE peptide AEAVPWKSE 9 T 13 LodA_C pdbhh F T 1oy7 2 F F AEVVAVKSE peptide AEVVAVKSE 9 T 82 DUF6068 pdbhh F F 1ozz 1 A A DEFN_ARCDE defensin ARD1 DKLIGSCVWGAVNYTSNCNAECKRRGYKGGHCGSFANVNCWCET 44 T 0.00026 Toxin_3 pdbpssm F Eukaryota T 1p00 1 A A DEFN_ARCDE defensin ARD1 DKLIGSCVWGAVNYTSNCRAECKRRGYKGGHCGSFANVNCWCET 44 T 0.00019 Toxin_3 pdbpssm F Eukaryota T 1p02 2 B P METHOXYSUCCINYL-ALA-ALA-PRO-ALANINE BORONIC ACID INHIBITOR XAAPX 5 T 730 Trp_leader1 pdbhh F F 1p03 2 B P METHOXYSUCCINYL-ALA-ALA-PRO-VALINE BORONIC ACID INHIBITOR XAAPX 5 T 450 DUF3458 pdbhh F F 1p04 2 B P METHOXYSUCCINYL-ALA-ALA-PRO-ISOLEUCINE BORONIC ACID INHIBITOR XAAPX 5 T 330 Fst_toxin pdbhh F F 1p05 2 B P METHOXYSUCCINYL-ALA-ALA-PRO-NORLEUCINE BORONIC ACID INHIBITOR XAAPX 5 T 970 Pep_deformylase pdbhh F F 1p06 2 B P METHOXYSUCCINYL-ALA-ALA-PRO-PHENYLALANINE BORONIC ACID INHIBITOR XAAPX 5 T 170 DUF3054 pdbhh F F 1p0a 1 A A DEFN_ARCDE DEFENSIN ARD1 DKLIGSCVWGAVNYTSNCNAECKRRGYKGGHCGSFLNVNCWCET 44 T 0.00019 Toxin_3 pdbpercent F Eukaryota T 1p0g 1 A A RL1_HELPY RIBOSOMAL PROTEIN L1 AKKVFKRLEKLFSKIQNDKX 20 T 0.042 Tower pdb F Bacteria T 1p0l 1 A A RL1_HELPY RIBOSOMAL PROTEIN L1 AKKVFKRLEKLFSKIWNDKX 20 T 0.088 Antimicrobial_7 pdbhh F Bacteria T 1p0o 1 A A RL1_HELPY RIBOSOMAL PROTEIN L1 AKKVFKRLEKLFSKIWNWKX 20 T 0.37 Antimicrobial_7 pdbhh F Bacteria T 1p10 2 B P METHOXYSUCCINYL-ALA-ALA-PRO-VALINE BORONIC ACID INHIBITOR XAAPX 5 T 450 DUF3458 pdbhh F F 1p11 2 B P PHOSPHONATE ESTER INHIBITOR A XAAPXXA 7 T 1300 DUF4478 pdbhh F F 1p11 3 C I PHOSPHONATE ESTER INHIBITOR B(TRANSITION STATE) XAAPX 5 T 970 Pep_deformylase pdbhh F F 1p12 2 B I PHOSPHONATE ESTER INHIBITOR XAAPXXA 7 T 1300 DUF4478 pdbhh F F 1p13 2 C,D C,D Peptide CDXANFK 7 T 1.1 Whi5 pdbhh F T 1p22 3 C C CTNB1_HUMAN PRO2286 KAAVSHWQQQSYLDSGIHSGATTTAP 26 T 13 AvrPto pdbhh F Eukaryota T 1p4b 3 C P GCN4(7P-14P) peptide AHLENEVARLKK 12 T 1.1 WD40_alt pdbhh F T 1p4n 2 B B UDP-MurNAc-pentapeptide XXKXX 5 T 230 OAM_dimer pdbhh F F 1p5k 1 A A RL1_HELPY RIBOSOMAL PROTEIN L1 AKKVFKRLEKSFSKIQNDKX 20 T 3.5 PRTRC_E unppssm F Bacteria T 1p5l 1 A A RL1_HELPY RIBOSOMAL PROTEIN L1 AKKVSKRLEKLFSKIQNDKX 20 T 0.025 Tower pdb F Bacteria T 1p7q 3 C C POL_HV1Z2 POL polyprotein ILKEPVHGV 9 T 0.56 DUF2115 pdbhh T Viruses T 1p7v 2 B B inhibitor peptide PAPFAAA 7 T 140 PapC_C pdbhh F F 1p7w 2 B B inhibitor peptide PAPFASA 7 T 91 DUF5358 pdbhh F F 1p8j 2 I,J,K,L,M,N,O,P J,K,L,M,N,P,Q,R DECANOYL-ARG-VAL-LYS-ARG-CHLOROMETHYLKETONE INHIBITOR XRVKXX 6 T 280 MIB_HERC2 pdbhh F F 1p9f 1 A A TKNK_HUMAN NKB, NEUROMEDIN K, ZNEUROK1 DMHDFFVGLM 10 T 0.0032 Tachykinin pdbhh F Eukaryota T 1p9u 2 G,H G,H PHQ-VNSTLQ-CHLOROMETHYLKETONE INHIBITOR XVNSTLQX 8 T 11 Peptidase_C98 pdbhh F T 1pad 2 B I ACAAPACK XAAFAX 6 T 500 UPA pdbhh F F 1pau 3 C C ACE-ASP-GLU-VAL-ASJ XDEVX 5 T 570 Helicase_RecD pdbhh F F 1pbz 1 A,B A,B De novo designed cyclic peptide XCGAEAAKAHAKAAEAGCX 19 T 14 DUF3721 pdbhh F T 1pcg 2 C,D E,F peptide inhibitor KXILCRLLQ 9 T 0.67 TMEM95 pdbhh F T 1pd1 2 B B DxE cargo sorting signal peptide of yeast Sys1 protein QLKDLESQI 9 T 2.3 NuA4 pdbhh F T 1pd7 2 B B MAD1_HUMAN Mad1 VRMNIQMLLEAADYLERREREAEH 24 T 3 LMBR1 unp F Eukaryota T 1pef 1 A A PEPTIDE F (EQLLKALEFLLKELLEKL) EQLLKALEFLLKELLEKL 18 T 1.4 RnlA-toxin_DBD pdbhh F T 1peh 1 A _ PCY1A_RAT CYTIDYLYLTRANSFERASE MEMBRANE BINDING DOMAIN PEPTIDE XNEKKYHLQERVDKVKKKVKDVEEKSKEWVQKVEX 35 T 0.02 AKNA pdbpercent F Eukaryota T 1pek 2 B C PEPTIDE PRO-ALA-PRO-PHE PAPF 4 T 69 DUF2316 pdbhh F F 1pek 3 C D D-DAL-ALA-NH2 XAX 3 T 2500 zf-met pdbhh F F 1pfe 2 B B QUINOMYCIN A XAXXXXAXXX 10 T 190 RSF pdbhh F F 1pfg 2 B B N-Ac-PAPFAAAA-NH2 XPAPFAAAAX 10 T 130 MAD pdbhh F F 1pg1 1 A _ PG1_PIG PROTEGRIN-1 RGGRLCYCRRRFCVCVGRX 19 T 0.16 Defensin_1 pdbhh F Eukaryota T 1pgi 1 A A D-GLUCOSE 6-PHOSPHATE ISOMERASE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 514 F F F 1pgv 1 A A TMOD_CAEEL TMD-1; TROPOMODULIN PROTEIN 1, ISOFORM A GSHGTTFNGIMQSYVPRIVPDEPDNDTDVESCINRLREDDTDLKEVNINNMKRVSKERIRSLIEAACNSKHIEKFSLANTAISDSEARGLIELIETSPSLRVLNVESNFLTPELLARLLRSTLVTQSIVEFKADNQRQSVLGNQVEMDMMMAIEENESLLRVGISFASMEARHRVSEALERNYERVRLRRLGKDPNV 197 T 0.018 LRR_6 pdbpercent F Eukaryota T 1pic 2 B B BETA-PLATELET-DERIVED GROWTH FACTOR RECEPTOR XXVPML 6 T 27 Frem_N pdbhh F F 1pip 2 B B SUCCINYL-GLN-VAL-VAL-ALA-ALA-P-NITROANILIDE XQVVAAX 7 T 390 SNAD1 pdbhh F F 1piv 1 A,A10,A11,A12,A13,A14,A15,A2,A3,A4,A5,A6,A7,A8,A9 0,0,0,0,0,0,0,0,0,0,0,0,0,0,0 POLIOVIRUS TYPE 3 (SUBUNIT VP1) ISEV 4 T 250 Tnp_22_trimer pdbhh F F 1pj8 2 B I 6-residue peptide (N-Ac-PAPFPA-NH2) PAPFPAX 7 T 130 RcsF pdbhh F F 1pjn 1 A A HIBN_XENLA Histone-binding protein N1/N2 RKKRKTEEESPLKDKAKKSKG 21 T 19 CMS1 pdbhh F Eukaryota T 1pjp 2 B I SUCCINYL-ALA-ALA-PRO-PHE-CHLOROMETHYLKETONE INHIBITOR XAAPXX 6 T 230 zf-C2H2_jaz pdbhh F F 1plw 1 A A PENK_HUMAN Met-enkephalin 1 YGGFM 5 T 1.5 Op_neuropeptide pdb F Eukaryota F 1plx 1 A A PENK_HUMAN Met-enkephalin 1 YGGFM 5 T 1.5 Op_neuropeptide pdb F Eukaryota F 1pmx 2 B B IGF-1 ANTAGONIST F1-1 RNCFESVAALRRCMYG 16 T 2.6 DUF4695 pdbhh F T 1pn3 2 C,D C,D DESVANCOSAMINYL VANCOMYCIN XXNXXXX 7 T 95 P53_C pdbhh F F 1pnv 2 C C VANCOMYCIN XXNXXXX 7 T 95 P53_C pdbhh F F 1po1 1 A 0 P1/MAHONEY GSSST 5 T 190 DltD pdbhh F F 1po2 1 A 0 P1/MAHONEY GSSST 5 T 190 DltD pdbhh F F 1pop 2 B B LEUPEPTIN XLLX 4 T 590 DNTTIP1_dimer pdbhh F F 1pp5 1 A A MCJA_ECOLX microcin J25 GGAGHVPEYFVGIGTPISFYG 21 T 0.13 Endonuc-BglII unp F Bacteria T 1ppg 2 B I MEO-SUCCINYL-ALA-ALA-PRO-VAL CHLOROMETHYLKETONE XAAPXX 6 T 1300 DUF726 pdbhh F F 1pq8 2 B C GLY-GLY-ARG PEPTIDE GGR 3 T 140 PriCT_1 pdbhh F F 1prl 2 B A PROLINE-RICH LIGAND PLR1 (AFAPPLPRR) AFAPPLPRR 9 T 2.9 FAA_hydro_N_2 pdbhh F F 1prm 2 B A PROLINE-RICH LIGAND PLR1 (AFAPPLPRR) AFAPPLPRR 9 T 2.9 FAA_hydro_N_2 pdbhh F F 1psb 2 C,D C,D STK38_HUMAN Ndr Ser/Thr kinase-like protein KRLRRSAHARKETEFLRLKRTRLGLE 26 T 3.9 Pam17 pdbhh F Eukaryota T 1psm 1 A _ Q9NIG6_PLAFA SPAM-H1 EAYKKAKQASQDAEQAAKDAENASKEAEEAAKEAVNLK 38 T 0.0019 Alanine_zipper pdbpercent F Eukaryota T 1pso 2 B I PEPSTATIN XVVXAX 6 T 1700 FAM60A pdbhh F F 1psv 1 A _ PDA8D KPYTARIKGRTFSNEKELRDFLETFTGR 28 T 0.56 Glyco_transf_61 pdbhh F T 1pts 2 C P PEPTIDE (FSHPQNT) FSHPQNT 7 T 14 PmoA pdbhh F T 1ptt 2 B B AC-DEPYL-NH2 XDEXL 5 T 110 DNase_NucA_NucB pdbhh F F 1ptu 2 B B DADEPYL-NH2 DADEXLX 7 T 0.61 Glyco_transf_92 pdbhh F F 1pvc 1 A,A10,A11,A12,A13,A14,A15,A2,A3,A4,A5,A6,A7,A8,A9 0,0,0,0,0,0,0,0,0,0,0,0,0,0,0 POLIOVIRUS TYPE 3, SABIN STRAIN ISEV 4 T 250 Tnp_22_trimer pdbhh F F 1pwv 2 C,D C,D LF20 MLARRKKVYPYPMEPTIAEG 20 T 5.8 DHHA2 pdbhh F T 1pww 2 C,D C,D LF20 MLARRKKVYPYPMEPTIAEG 20 T 5.8 DHHA2 pdbhh F T 1pxd 2 B B LECB3_ARTIN JACALIN BETA-3 CHAIN DEQSGISQTVIVGPWGAKVS 20 T 3.2 DUF3842 pdbhh F Eukaryota T 1py1 2 E,F,G,H E,F,G,H BACE1_HUMAN BETA-SITE APP CLEAVING ENZYME, BETA-SITE AMYLOID PRECURSOR PROTEIN CLEAVING ENZYME, ASPARTYL PROTEASE 2, ASP 2, ASP2, MEMBRANE-ASSOCIATED ASPARTIC PROTEASE 2, MEMAPSIN-2 ADDISLLK 8 T 0.72 CD34_antigen unphh F Eukaryota T 1pyh 1 A A PHOTOSYNTHETIC REACTION CENTER L SUBUNIT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 281 F F F 1pyh 2 B B PHOTOSYNTHETIC REACTION CENTER M SUBUNIT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 302 F F F 1pyh 3 C C PHOTOSYNTHETIC REACTION CENTER H SUBUNIT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 241 F F F 1pyh 4 AA,CA,D,EA,F,GA,H,J,L,N,P,R,T,V,X,Z 1,3,D,5,F,7,H,J,L,N,P,R,T,V,X,Z ANTENNA PIGMENT PROTEIN, ALPHA CHAIN XXXXXXXXXXXXXXXXXXXXXXXXXX 26 F F F 1pyh 5 BA,DA,E,FA,G,HA,I,K,M,O,Q,S,U,W,Y 2,4,E,6,G,8,I,K,M,O,Q,S,U,W,Y ANTENNA PIGMENT PROTEIN, BETA CHAIN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 30 F F F 1pyk 1 A A PYRUVATE KINASE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 432 F F F 1pyo 3 C,F E,F YKR18_YEAST ACETYL-LEU-ASP-GLU-SER-ASJ XLDESX 6 F F Eukaryota F 1pyz 1 A,B A,B MIMOCHROME IV, MINIATURIZED METALLOPROTEIN XESQLHSNKRX 11 T 11 DUF3949 pdbhh F T 1pz5 3 C C Octapeptide (MDWNMHAA) MDWNMHAA 8 T 6.4 DUF2969 pdbhh F T 1q1a 2 B B H4_YEAST Histone H4 KGGAXRHRKI 10 T 4.2 Shadoo unppercent F Eukaryota T 1q1s 1 A,B A,B LT_SV40 Large T antigen PGSDDEAAADAQHAAPPKKKRKVE 24 T 0.28 FAM60A unppercent T Viruses T 1q1t 1 A,B A,B LT_SV40 Large T antigen PGSDDEAAADAQHAAPPKKKRKVEY 25 T 0.28 FAM60A unppercent T Viruses T 1q2c 2 B B Histone H4 peptide SGRGKGGKGLGKGGAKRHR 19 T 180 DUF1884 pdbhh F T 1q2d 2 B B 19-mer peptide fragment from p53 Tumor Suppressor NTSSSPQPKKKPLDGEYFT 19 T 0.3 P53_tetramer pdbhh F T 1q3m 1 A A OSTCN_BOVIN GAMMA-CARBOXYGLUTAMIC ACID-CONTAINING PROTEIN, BONE GLA-PROTEIN, BGP YLDHWLGAPAPYPDPLEPKREVCELNPDCDELADHIGFQEAYRRFYGPV 49 T 0.14 Toxin_23 unppercent F Eukaryota T 1q3p 2 C,D C,D C-TERMINAL HEXAPEPTIDE FROM GKAP EAQTRL 6 T 1.8 GKAP pdbhh F T 1q40 2 B,D B,D MEX67_CANAL MEX67 MSPETMFFQDEDSRNLATNFIANYLKLWDANRSELMILYQNESQFSMQVDSSHPHLIESGNSGYSGSTDFGYYLNNSRNLTRVSSIKARMAKLSIGQEQIYKSFQQLPKTRHDIIATPELFSMEVYKFPTLNGIMITLHGSFDEVAQPEVDGSASSAPSGPRGGSRYHSGPKHKRIPLSKKSFDRTFVVIPGPNGSMIVASDTLLIRPYTSDFPWKVQK 219 T 2E-07 NTF2 pdbpssm F Eukaryota T 1q4k 2 D,E,F D,E,F Phospho-peptide sequence Met.Gln.Ser.pThr.Pro.Leu MQSTPL 6 T 180 DUF5540 pdbhh F T 1q4q 2 K,L,M,N,O,P,Q,R,S,T K,L,M,N,O,P,Q,R,S,T DRONC_DROME DRONC SRPPFISLNERR 12 T 0.38 MMgT pdbhh F Eukaryota T 1q5l 2 B B ASN-ARG-LEU-LEU-LEU-THR-GLY PEPTIDE NRLLLTG 7 T 10 PgaPase_1 pdbhh F F 1q5z 1 A A SIPA_SALTY SIPA GPVDKAGTTDNDNSQTDKTGPFSGLKFKQNSFLSTVPSVTNMHSMHFDARETFLGVIRKALEPDTSTPFPVRRAFDGLRAEILPNDTIKSAALKAQCSDIDKHPELKAKMETLKEVITHHPQKEKLAEIALQFAREAGLTRLKGETDYVLSNVLDGLIGDGSWRAGPAYESYLNKPG 177 T 0.0058 DUF3288 pdbpercent F Bacteria T 1q68 2 B B LCK_HUMAN P56-LCK, LSK, T CELL-SPECIFIC PROTEIN-TYROSINE KINASE SHPEDDWLENIDVCENCHYPIVPLDGKGT 29 T 1.3 zf-ACC pdbhh F Eukaryota T 1q69 1 A A CD8A_HUMAN T-LYMPHOCYTE DIFFERENTIATION ANTIGEN T8/LEU-2 RNRRRVCKCPRPVVKSGDK 19 T 0.0033 RCR unphh F Eukaryota T 1q69 2 B B LCK_HUMAN P56-LCK, LSK, T CELL-SPECIFIC PROTEIN-TYROSINE KINASE SHPEDDWLENIDVCENCHYPIVPLDGKGT 29 T 1.3 zf-ACC pdbhh F Eukaryota T 1q71 1 A A MCJA_ECOLX microcin J25 GGAGHVPEYFVGIGTPISFYG 21 T 0.13 Endonuc-BglII unp F Bacteria T 1q7o 1 A A chemotactic peptide MLX 3 T 120 MIase pdbhh F F 1q8h 1 A A OSTCN_PIG BONE GLA PROTEIN,BGP,GAMMA-CARBOXYGLUTAMIC ACID-CONTAINING PROTEIN YLDHGLGAPAPYPDPLEPRREVCELNPDCDELADHIGFQEAYRRFYGIA 49 T 6.7 Cytomega_UL84 pdbhh F Eukaryota T 1q90 5 E R UCRIA_CHLRE RIESKE IRON-SULFUR PROTEIN, RISP AASSEVPDMNKRNIMNLILAGGAGLPITTLALGYGAFFVPPSSGGGGGG 49 T 0.00019 UCR_Fe-S_N pdbhh F Eukaryota T 1qbq 3 C P ACETYL-CYS-VAL-ILE-SELENOMET-COOH PEPTIDE XCVIM 5 T 140 CAF1-p150_C2 pdbhh F F 1qc6 2 B,D C,D PHE-GLU-PHE-PRO-PRO-PRO-PRO-THR-ASP-GLU-GLU FEFPPPPTDEE 11 T 0.14 ActA pdbhh F F 1qd7 10 J J S20 RIBOSOMAL PROTEIN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 100 F F F 1qd8 1 A,B A,B VANCOMYCIN XXNXXXX 7 T 95 P53_C pdbhh F F 1qdu 3 C,F,I,L,O,R T,U,V,W,X,Y Z-EVD-DCBMK XEVDX 5 T 640 DUF5952 pdbhh F F 1qfd 1 A A IAAI_AMAHP PROTEIN (ALPHA-AMYLASE INHIBITOR) CIPKWNRCGPKMDGVPCCEPYTCTSDYYGNCS 32 T 0.022 Toxin_12 pdbpssm F Eukaryota T 1qfi 1 A,B,C A,B,C ACTINOMYCIN V TXXXXXTXPXX 11 T 61 DUF5572 pdbhh F F 1qfn 2 B B RIR1_ECOLI RIBONUCLEOTIDE REDUCTASE, B1 PROTEIN, R1 PROTEIN GAEDAQDDLVPSIQDDGSESGACKI 25 T 0.68 JmjN pdbhh F Bacteria T 1qg1 2 B I SHC1_HUMAN PROTEIN (SHC-DERIVED PEPTIDE) DDPSXVNVQNLDK 13 T 0.23 SH3-WW_linker pdbhh F Eukaryota T 1qhf 1 A,B A,B PMG1_YEAST PROTEIN (PHOSPHOGLYCERATE MUTASE) PKLVLVRHGQSEWNEKNLFTGWVDVKLSAKGQQEAARAGELLKEKKVYPDVLYTSKLSRAIQTANIALEKADRLWIPVNRSWRLNERHYGDLQGKDKAETLKKFGEEKFNTYRRSFDVPPPPIDASSPFSQKGDERYKYVDPNVLPETESLALVIDRLLPYWQDVIAKDLLSGKTVMIAAHGNSLRGLVKHLEGISDADIAKLNIPTGIPLVFELDENLKPSKPSYYLDPEAAAAGAAAV 240 T 7.6E-05 His_Phos_1 pdb F Eukaryota T 1qix 1 A A BETA-CASOMORPHIN-7 YPFVEPI 7 T 21 PCP pdbhh F T 1qja 2 C,D Q,R PHOSPHOPEPTIDE RLYHSLPA 8 T 2.9 DUF668 pdbhh F T 1qjb 2 C,D Q,S MT_POVBG PHOSPHOPEPTIDE ARSHSYPA 8 T 24 DUF3637 pdbhh T Viruses T 1qjj 2 B B PRO-LEU-GLY-HYDROXAMIC ACID PLGX 4 T 170 Prefoldin_3 pdbhh F F 1qka 2 B B LYS-ARG-LYS KRK 3 T 240 MazG_C pdbhh F F 1qkb 2 B B PEPTIDE LYS-VAL-LYS KVK 3 T 500 Whi5 pdbhh F F 1qls 2 B D ANXA1_HUMAN ANNEXIN I XAMVSAFLKQAW 12 T 3.3 DUF5680 pdbhh F Eukaryota T 1qmz 3 E,F E,F SUBSTRATE PEPTIDE HHASPRK 7 T 9 DUF1324 pdbhh F T 1qng 2 B D CYCLOSPORINE, CICLOSPORIN, CICLOSPORINE XXXXXXXXVXA 11 T 9.7 DUF6090 pdbhh F F 1qnh 2 C,D C,D CYCLOSPORINE, CICLOSPORIN, CICLOSPORINE XXXXXXXXVXA 11 T 9.7 DUF6090 pdbhh F F 1qp6 1 A,B A,B PROTEIN (ALPHA2D) GEVEELEKKFKELWKGPRRGEIEELHKKFHELIKG 35 T 0.0098 CZB pdb F T 1qqd 3 C C HLA-CW4 SPECIFIC PEPTIDE QYDDAVYKL 9 T 22 Cas_Cas02710 pdbhh F T 1qr1 3 C,F C,F ERBB2_HUMAN GP2 PEPTIDE IISAVVGIL 9 T 0.014 RIFIN unppercent F Eukaryota T 1qr3 2 B I FR901277 Inhibitor XXTXXFXV 8 T 1.1 Syntaxin-5_N pdbhh F F 1qrn 3 C C TAX PEPTIDE P6A LLFGYAVYV 9 T 0.96 CLPTM1 pdbhh F T 1qs3 1 A A CAIA_CONGE DES-GLU1-[CYS3ALA]-DES-CYS13-ALPHA CONOTOXIN GI CANPACGRHYSX 12 T 0.042 Enterotoxin_ST unphh F Eukaryota T 1qs7 2 B,D B,D MYLK_CHICK RS20 RRKWQKTGHAVRAIGRLSSSX 21 T 4.3 PACT_coil_coil pdbhh F Eukaryota T 1qs8 2 C,D C,D PEPSTATIN A XVVXAX 6 T 1700 FAM60A pdbhh F F 1qsc 2 D,E,F D,E,F CD40 RECEPTOR XYPIQET 7 T 3.3 stn_TNFRSF12A pdbhh F T 1qse 3 C C Tax Peptide V7R LLFGYPRYV 9 T 2.3 DUF5759 pdbhh F T 1qsf 3 C C TAX PEPTIDE Y8A LLFGYPVAV 9 T 0.21 DUF4504 pdbhh F T 1qsv 1 A A VGFR1_HUMAN FLT-1 SDTGRPFVEMYSEIPEIIHMTEGRELVIPCRVTSPNITVTLKKFPLDTLIPDGKRIIWDSRKGFIISNATYKEIGLLTCEATVNGHLYKTNYLTHRQTNTI 101 T 0.00047 Ig_2 pdbpercent F Eukaryota T 1qsz 1 A A VGFR1_HUMAN FLT-1 SDTGRPFVEMYSEIPEIIHMTEGRELVIPCRVTSPNITVTLKKFPLDTLIPDGKRIIWDSRKGFIISNATYKEIGLLTCEATVNGHLYKTNYLTHRQTNTI 101 T 0.00047 Ig_2 pdbpercent F Eukaryota T 1qtj 1 A,B A,B SAP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 217 F F F 1qtn 3 C C ACETYL-ILE-GLU-THR-ASP-ALDEHYDE XIETX 5 T 890 Imm53 pdbhh F F 1qtx 2 B B MYLK_CHICK PROTEIN (RS20) RRKWQKTGHAVRAIGRLSSSX 21 T 4.3 PACT_coil_coil pdbhh F Eukaryota T 1qty 2 E,F,G,H X,Y,T,U VGFR1_HUMAN FLT-1 SDTGRPFVEMYSEIPEIIHMTEGRELVIPCRVTSPNITVTLKKFPLDTLIPDGKRIIWDSRKGFIISNATYKEIGLLTCEATVNGHLYKTNYLTHRQTNTI 101 T 0.00047 Ig_2 pdbpercent F Eukaryota T 1qur 3 C I BIVALENT INHIBITOR (BZA-2 HIRULOG) XXGGGGNGDYEPIPEEAXX 19 T 0.047 Hirudin pdbhh F T 1qvk 1 A A c-RW RRWWRF 6 T 0.14 VPS13 pdbhh F F 1qvl 1 A A c-RW RRWWRF 6 T 0.14 VPS13 pdbhh F F 1qwe 2 B B APP12 APPLPPRNRPRL 12 T 0.33 SCIMP pdbhh F F 1qwf 2 B B VSL12 VSLARRPLPPLP 12 T 0.73 DUF4522 pdbhh F T 1qx9 1 A A CTHL4_BOVIN CYCLOCP-11 ICLKKWPWWPWRRCKX 16 T 0.071 CoV_S2 pdbhh F Eukaryota T 1qxa 2 B B peptide GLY-GLY-GLY GGG 3 T 79 FTCD_C pdbhh F F 1qxq 1 A A CTHL4_BOVIN CP-11 ILKKWPWWPWRRKX 14 T 0.055 CoV_S2 pdbhh F Eukaryota T 1qz0 2 C,D,E,F C,D,E,F ASP-ALA-ASP-GLU-FTY-LEU-NH2 DADEXLX 7 T 0.61 Glyco_transf_92 pdbhh F F 1qz2 2 D,E G,H HS90B_HUMAN HSP90 MEEVD 5 T 120 NUSAP pdbhh F Eukaryota F 1qzv 1 A,Q A,P PLANT PHOTOSYSTEM I: SUBUNIT PSAA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 726 F F F 1qzv 2 B,R B,Q PLANT PHOTOSYSTEM I: SUBUNIT PSAB XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 732 F F F 1qzv 3 C,S C,R PLANT PHOTOSYSTEM I: SUBUNIT PSAC XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 80 F F F 1qzv 4 D,T D,S PLANT PHOTOSYSTEM I: SUBUNIT PSAD XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 154 F F F 1qzv 5 E,U E,T PLANT PHOTOSYSTEM I: SUBUNIT PSAE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 64 F F F 1qzv 7 G,W G,V PLANT PHOTOSYSTEM I: SUBUNIT PSAG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 74 F F F 1qzv 8 H,X H,W PLANT PHOTOSYSTEM I: SUBUNIT PSAH XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 52 F F F 1qzv 9 I,Y I,Y PLANT PHOTOSYSTEM I: SUBUNIT PSAI XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 30 F F F 1qzv 10 J,Z J,Z PLANT PHOTOSYSTEM I: SUBUNIT PSAJ XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 41 F F F 1qzv 11 AA,K 5,K PLANT PHOTOSYSTEM I: SUBUNIT PSAK XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 42 F F F 1qzv 12 BA,L 6,L PLANT PHOTOSYSTEM I: SUBUNIT PSAL XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 135 F F F 1qzv 13 CA,M 7,1 PLANT LIGHT HARVESTING COMPLEX I(LHCI): SUBUNIT LHCA1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 109 F F F 1qzv 14 DA,N 8,2 PLANT LIGHT HARVESTING COMPLEX I(LHCI): SUBUNIT LHCA2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 115 F F F 1qzv 15 EA,O 9,3 PLANT LIGHT HARVESTING COMPLEX I(LHCI): SUBUNIT LHCA3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 117 F F F 1qzv 16 FA,P 0,4 PLANT LIGHT HARVESTING COMPLEX I(LHCI): SUBUNIT LHCA4 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 119 F F F 1r17 2 C,D C,D fibrinopeptide B NEEGFFFSARGHRPLD 16 T 0.26 PyrBI_leader pdbhh F T 1r1l 2 C C Antithrombin P14-P9 peptide XSEAAAS 7 T 580 CAP_N pdbhh F F 1r1l 3 D D EXOGENOUS TRIPEPTIDE formyl-(NLE)LF XLF 3 T 330 Bac_small_YrzI pdbhh F F 1r1p 2 E,F,G,H E,F,G,H LAT pY171 peptide XDDXVNV 7 T 0.21 DUF4692 pdbhh F F 1r1q 2 C,D C,D LAT pY191 peptide XREXVNV 7 T 0.42 LAT pdbhh F F 1r1r 2 B,D,F,G D,E,F,P RIR2_ECOLI RIBONUCLEOTIDE REDUCTASE R2 PROTEIN YLVGQIDSEVDTDDLSNFQL 20 T 12 UL11 pdbhh F Bacteria T 1r1s 2 B,D,F,H B,D,F,H LAT pY226 peptide XPDXENL 7 T 0.48 LAT pdbhh F T 1r2b 2 C,D C,D NCOR2_HUMAN N-COR2, SILENCING MEDIATOR OF RETINOIC ACID AND THYROID HORMONE RECEPTOR, SMRT, SMRTE, THYROID-, RETINOIC-ACID-RECEPTOR-ASSOCIATED CO-REPRESSOR, T3 RECEPTOR- ASSOCIATING FACTOR, TRAC, CTG REPEAT PROTEIN 26 GSLVATVKEAGRSIHEIPR 19 T 7.6 DUF211 pdbhh F Eukaryota T 1r42 2 B B disordered segment of collectrin homology domain XXXXXX 6 F F F 1r42 3 C C disordered segment of collectrin homology domain XXXXXXXXXXXXXXXXXXXX 20 F F F 1r42 4 D D disordered segment of collectrin homology domain XXXXXXXXXXXXXXXXXX 18 F F F 1r42 5 E E disordered segment of collectrin homology domain XXXXXXXXXXXXXX 14 F F F 1r4l 2 B B disordered segment of collectrin homology domain XXXXXX 6 F F F 1r4l 3 C C disordered segment of collectrin homology domain XXXXXXXXXXXXXXXXXXXX 20 F F F 1r4l 4 D D disordered segment of collectrin homology domain XXXXXXXXXXXXXXXXXX 18 F F F 1r4l 5 E E disordered segment of collectrin homology domain XXXXXXXXXXXXXX 14 F F F 1r4y 1 A A RNAS_ASPGI RRNA ENDONUCLEASE AVTWTCGGLLYNQNKAESNSHHAPLSDGKTGSSYPHWFTNGYDGDGKLPKGRTPIKFGKSDCDRPPKHSKDGNGKTDHYLLEFPTFPDGHDYKFDSKKPKENPGPARVIYTYPNKVFCGIIAHTKENQGELKLCSH 136 T 23 MtrE unphh F Eukaryota T 1r5u 11 K M TRANSCRIPTION FACTOR II B (TFIIB) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 86 F F F 1r5v 2 B,E E,F artificial peptide ADLIAYPKAATKF 13 T 9.8 DUF3800 pdbhh F T 1r5w 3 E,F E,F artificial peptide ADLIAYFKAATKF 13 T 8 TMEM192 pdbhh F T 1r64 2 C,D C,D Ac-Arg-Glu-Lys-boroArg peptide inhibitor XREKX 5 T 180 YlaC pdbhh F F 1r8t 1 A A MP1 RCCHPQCGAAYSCRK 15 T 0.14 Enterotoxin_ST pdbhh F T 1r9u 1 A A ZERVAMICIN IIB XWIQXITXLXPQXPXPX 17 T 25 bpX0 pdbhh F T 1r9v 1 A A BOC-(D-NLE-L-NLE)4-D-NLE(METHYL)-L-NLE-D-NLE-L-NLE METHYL ESTER XXXXXXXXXXXXX 13 T 23 Mfp-3 pdbhh F F 1rdt 4 D E CBP_HUMAN LxxLL motif coactivator NLVPDAASKHKQLSELLRGGSGS 23 T 0.95 SRC-1 pdbhh F Eukaryota T 1re3 4 G,H G,H GHRP peptide GHRP 4 T 14 VPS38 unphh F F 1rf1 4 G,H,I,J G,H,I,J GHRP peptide GHRP 4 T 14 VPS38 unphh F F 1rf3 2 B B TNR3_HUMAN 24-residue peptide from Lymphotoxin-B Receptor PYPIPEEGDPGPPGLSTPHQEDGK 24 T 5.3 LAX unphh F Eukaryota T 1rff 3 E,F C,E Topoisomerase I-Derived Peptide KLNYYDPR 8 T 0.037 Topo_C_assoc pdbhh F T 1rfi 3 E,F C,E Topoisomerase I-Derived Peptide KLNYK 5 T 160 FAST_1 pdbhh F F 1rgj 2 B B MIMOTOPE OF THE NICOTINIC ACETYLCHOLINE RECEPTOR FRYYESSLEPWDD 13 T 1.8 LicD pdbhh F T 1rgr 2 B B postsynaptic protein CRIPT peptide YKKTEV 6 T 200 DivIVA pdbhh F F 1rh4 1 A A RIGHT-HANDED COILED COIL TETRAMER XAALAQXKKEIAYLLAKXKAEILAALKKXKQEIAX 35 T 2.6 Phe_tRNA-synt_N pdbhh F T 1rhk 3 C C acetyl-asp-glu-val-fpr XDEVX 5 T 570 Helicase_RecD pdbhh F F 1rij 1 A A E6apn1 peptide ALQELLGQWLKDGGPSSGRPPPS 23 T 1.5 RE_NgoBV pdbhh F T 1rjk 2 B C MED1_HUMAN PBP, PPAR BINDING PROTEIN, THYROID HORMONE RECEPTOR-ASSOCIATED PROTEIN COMPLEX 220 KDA COMPONENT, TRAP220, THYROID RECEPTOR INTERACTING PROTEIN 2, TRIP2, P53 REGULATORY PROTEIN RB18A, VITAMIN D RECEPTOR-INTERACTING PROTEIN COMPLEX COMPONENT DRIP205, ACTIVATOR-RECRUITED COFACTOR 205 KDA COMPONENT, ARC205 KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 1rjl 4 D D Outer surface protein B XXXXX 5 F F F 1rjq 1 A A Q9AGH8_ALCFA D-aminoacylase MRGSHHHHHHGSMSQPDATPFDYILSGGTVIDGTNAPGRLADVGVRGDRIAAVGDLSASSARRRIDVAGKVVSPGFIDSHTHDDNYLLKHRDMTPKISQGVTTVVTGNCGISLAPLAHANPPAPLDLLDEGGSFRFARFSDYLEALRAAPPAVNAACMVGHSTLRAAVMPDLRREATADEIQAMQALADDALASGAIGISTGAFYPPAAHASTEEIIEVCRPLITHGGVYATHMRDEGEHIVQALEETFRIGRELDVPVVISHHKVMGKLNFGRSKETLALIEAAMASQDVSLDAYPYVAGSTMLKQDRVLLAGRTLITWCKPYPELSGRDLEEIAAERGKSKYDVVPELQPAGAIYFMMDEPDVQRILAFGPTMIGSAGLPHDERPHPRLWGTFPRVLGHYSRDLGLFPLETAVWKMTGLTAAKFGLAERGQVQPGYYADLVVFDPATVADSATFEHPTERAAGIHSVYVNGAAVWEDQSFTGQHAGRVLNRAGA 496 T 5.9E-16 Amidohydro_3 unphh F Bacteria T 1rjr 1 A A Q9AGH8_ALCFA D-aminoacylase MRGSHHHHHHGSMSQPDATPFDYILSGGTVIDGTNAPGRLADVGVRGDRIAAVGDLSASSARRRIDVAGKVVSPGFIDSHTHDDNYLLKHRDMTPKISQGVTTVVTGNCGISLAPLAHANPPAPLDLLDEGGSFRFARFSDYLEALRAAPPAVNAACMVGHSTLRAAVMPDLRREATADEIQAMQALADDALASGAIGISTGAFYPPAAHASTEEIIEVCRPLITHGGVYATHMRDEGEHIVQALEETFRIGRELDVPVVISHHKVMGKLNFGRSKETLALIEAAMASQDVSLDAYPYVAGSTMLKQDRVLLAGRTLITWCKPYPELSGRDLEEIAAERGKSKYDVVPELQPAGAIYFMMDEPDVQRILAFGPTMIGSAGLPHDERPHPRLWGTFPRVLGHYSRDLGLFPLETAVWKMTGLTAAKFGLAERGQVQPGYYADLVVFDPATVADSATFEHPTERAAGIHSVYVNGAAVWEDQSFTGQHAGRVLNRAGA 496 T 5.9E-16 Amidohydro_3 unphh F Bacteria T 1rk3 2 B C MED1_HUMAN PBP, PPAR BINDING PROTEIN, THYROID HORMONE RECEPTOR-ASSOCIATED PROTEIN COMPLEX 220 KDA COMPONENT, TRAP220, THYROID RECEPTOR INTERACTING PROTEIN 2, TRIP2, P53 REGULATORY PROTEIN RB18A, VITAMIN D RECEPTOR-INTERACTING PROTEIN COMPLEX COMPONENT DRIP205, ACTIVATOR-RECRUITED COFACTOR 205 KDA COMPONENT, ARC205 KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 1rk5 1 A A Q9AGH8_ALCFA D-aminoacylase MRGSHHHHHHGSMSQPDATPFDYILSGGTVIDGTNAPGRLADVGVRGDRIAAVGDLSASSARRRIDVAGKVVSPGFIDSHTHDDNYLLKHRDMTPKISQGVTTVVTGNCGISLAPLAHANPPAPLDLLDEGGSFRFARFSDYLEALRAAPPAVNAACMVGHSTLRAAVMPDLRREATADEIQAMQALADDALASGAIGISTGAFYPPAAHASTEEIIEVCRPLITHGGVYATHMRDEGEHIVQALEETFRIGRELDVPVVISHHKVMGKLNFGRSKETLALIEAAMASQDVSLDAYPYVAGSTMLKQDRVLLAGRTLITWCKPYPELSGRDLEEIAAERGKSKYDVVPELQPAGAIYFMMDEPDVQRILAFGPTMIGSAGLPHDERPHPRLWGTFPRVLGHYSRDLGLFPLETAVWKMTGLTAAKFGLAERGQVQPGYYADLVVFDPATVADSATFEHPTERAAGIHSVYVNGAAVWEDQSFTGQHAGRVLNRAGA 496 T 5.9E-16 Amidohydro_3 unphh F Bacteria T 1rkg 2 B C MED1_HUMAN PBP, PPAR BINDING PROTEIN, THYROID HORMONE RECEPTOR-ASSOCIATED PROTEIN COMPLEX 220 KDA COMPONENT, TRAP220, THYROID RECEPTOR INTERACTING PROTEIN 2, TRIP2, P53 REGULATORY PROTEIN RB18A, VITAMIN D RECEPTOR-INTERACTING PROTEIN COMPLEX COMPONENT DRIP205, ACTIVATOR-RECRUITED COFACTOR 205 KDA COMPONENT, ARC205 KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 1rkh 2 B C MED1_HUMAN PBP, PPAR BINDING PROTEIN, THYROID HORMONE RECEPTOR-ASSOCIATED PROTEIN COMPLEX 220 KDA COMPONENT, TRAP220, THYROID RECEPTOR INTERACTING PROTEIN 2, TRIP2, P53 REGULATORY PROTEIN RB18A, VITAMIN D RECEPTOR-INTERACTING PROTEIN COMPLEX COMPONENT DRIP205, ACTIVATOR-RECRUITED COFACTOR 205 KDA COMPONENT, ARC205 KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 1rkk 1 A A PPM1_LIMPO POLYPHEMUSIN I RRWCFRVCYRGFCYRKCRX 19 T 1.9 ADAM_CR_2 pdbhh F Eukaryota T 1rlp 2 B R PROLINE-RICH LIGAND RLP2 (RALPPLPRY) RALPPLPRY 9 T 5.2 FAA_hydro_N_2 pdbhh F F 1rlq 2 B R PROLINE-RICH LIGAND RLP2 (RALPPLPRY) RALPPLPRY 9 T 5.2 FAA_hydro_N_2 pdbhh F F 1rmh 2 C,D C,D AAPF PEPTIDE SUBSTRATE XAAPFX 6 T 230 zf-C2H2_jaz pdbhh F F 1rpb 1 A _ 3CP1_STRS9 Tricyclic peptide RP 71955 CLGIGSCNDFAGCGYAVVCFW 21 T 0.31 CCAP unphh F Bacteria T 1rpc 1 A _ 3CP1_STRS9 Tricyclic peptide RP 71955 CLGIGSCNDFAGCGYAVVCFW 21 T 0.31 CCAP unphh F Bacteria T 1rpq 2 E,F,G,H W,X,Y,Z Peptide E131 VQCPHFCYELDYELCPDVCYV 21 T 1.6 Prot_inhib_II pdbhh F T 1rqf 2 I,J,N M,N,T P21WAF1 XXXX 4 F F F 1rqf 3 K O P21WAF1 XXXXXXXX 8 F F F 1rqf 4 L,M P,S P21WAF1 XXX 3 F F F 1rqq 3 E,F E,F BISUBSTRATE INHIBITOR KKKLPATGDFMNMSPVGD 18 T 0.3 TagF_N pdbhh F T 1rrv 2 C,D C,D DESVANCOSAMINYL VANCOMYCIN XXNXXXX 7 T 95 P53_C pdbhh F F 1rst 2 B P STREP-TAG PEPTIDE AWRHPQFGG 9 T 1.8 TIMELESS pdbhh F T 1rsu 2 B P STREP-TAG II PEPTIDE SNWSHPQFEK 10 T 0.22 CreD pdbhh F T 1rtf 1 A A (TC)-T-PA SYQSTCGLRQYSQRQRR 17 T 9.5 Abp2 pdbhh F T 1rv6 2 C,D X,Y VGFR1_HUMAN FLT1 protein DTGRPFVEMYSEIPEIIHMTEGRELVIPCRVTSPNITVTLKKFPLDTLIPDGKRIIWDSRKGFIISNATYKEIGLLTCEATVNGHLYKTNYLTHRQTNTI 100 T 0.00097 Ig_2 pdbpssm F Eukaryota T 1rxm 2 B B consensus FEN-1 peptide KTTQSTLDSFFK 12 T 0.86 CitT pdbhh F T 1rxz 2 B B FEN_ARCFU Flap structure-specific endonuclease KSTQATLERWF 11 T 0.3 DUF494 unppercent F Archaea T 1rzj 1 A G ENV_HV1H2 ENVELOPE GLYCOPROTEIN GP120 GARSEVVLVNVTENFNMWKNDMVEQMHEDIISLWDQSLKPCVKLTPLCVGAGSCNTSVITQACPKVSFEPIPIHYCAPAGFAILKCNNKTFNGTGPCTNVSTVQCTHGIRPVVSTQLLLNGSLAEEEVVIRSVNFTDNAKTIIVQLNTSVEINCTGAGHCNISRAKWNNTLKQIASKLREQFGNNKTIIFKQSSGGDPEIVTHSFNCGGEFFYCNSTQLFNSTWFNSTWSTEGSNNTEGSDTITLPCRIKQIINMWQKVGKAMYAPPISGQIRCSSNITGLLLTRDGGNSNNESEIFRPGGGDMRDNWRSELYKYKVVKIE 321 T 4.5000000000000003E-23 GP120 pdb T Viruses T 1rzx 2 B B Acetylated VKESLV Peptide XVKESLV 7 T 200 DUF5627 pdbhh F T 1s1o 1 A,B A,B BOC-L-NLE-(D-NLE-L-NLE)5-D-NLE(METHYL)-L-NLE-D-NLE-L-NLE METHYL ESTER XXXXXXXXXXXXXXXX 16 T 23 ASTN_1_2_N pdbhh F F 1s2k 2 B B Ala-Ile-His tripeptide AIH 3 T 250 DUF5709 pdbhh F F 1s4a 1 A,B A,B HCO-(D-Nle-L-Nle)3-D-MeNle-L-Nle-D-Nle-L-Nle-OMe XXXXXXXXXX 10 T 43 TMEM51 pdbhh F F 1s4v 2 C,D C,D DVA-LEU-LYS-0QE peptide XLKX 4 T 710 GSP_synth pdbhh F F 1s4z 2 C C CAF1A_MOUSE CAF-1 SUBUNIT A, CHROMATIN ASSEMBLY FACTOR I P150 SUBUNIT, CAF-I 150 KDA SUBUNIT, CAF-IP150 GSKAGDLLFIEKVPVVVLEDILATKPSIAS 30 T 0.84 DUF411 pdbhh F Eukaryota T 1s5l 18 KA,R n,N Photosystem II PsbN protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 37 F F F 1s5p 2 B B HISTONE H4 (RESIDUES 12-19) KGGAXRHR 8 T 130 DUF2476 pdbhh F T 1s5q 1 A A MAD1_HUMAN MAX DIMERIZER RMNIQMLLEAADYLER 16 T 1.8 DUF6117 pdbhh F Eukaryota T 1s5r 1 A A HBP1_MOUSE high mobility group box transcription factor 1 DFTPMDSSAVYVLSSMARQRRAS 23 T 1.5 PAXIP1_C pdbhh F Eukaryota T 1s7p 1 A B MCJA_ECOLX microcin J25 VGIGTPISFYG 11 T 0.13 Endonuc-BglII unp F Bacteria T 1s7p 2 B A MCJA_ECOLX microcin J25 GGAGHVPEYF 10 T 0.13 Endonuc-BglII unp F Bacteria T 1s9v 3 C,F C,F alpha-I gliadin LQPFPQPELPY 11 T 3.7 Sod_Fe_N pdbhh F T 1s9x 3 C C NY-ESO-1 peptide analogue S9A SLLMWITQA 9 T 0.7 DUF6405 pdbhh F T 1s9y 3 C C NY-ESO-1 peptide analogue S9S SLLMWITQS 9 T 1.6 DUF6405 pdbhh F T 1sbu 1 A A delta-conotoxin EVIA GFASLXILKNG 11 T 0.56 Pneumo_NS1 pdbhh F T 1sdx 2 B E TRFL_BOVIN Lactotransferrin LEACA 5 T 91 DUF3986 pdbhh F Eukaryota F 1sdz 2 B B Reaper AVAFYIPDQA 10 T 2.3 Insulin_TMD pdbhh F T 1se0 2 B B GRIM_DROME Cell death protein Grim AIAYFIPDQA 10 T 0.61 DUF5521 unppercent F Eukaryota T 1seb 3 C,G C,G ENDOGENOUS PEPTIDE MODEL, POLY-ALA XXXXXXXXXXXXX 13 F F F 1sem 2 C,D C,D 10-RESIDUE PROLINE-RICH PEPTIDE FROM MSOS (ACE-PRO-PRO-PRO-VAL-PRO-PRO-ARG-ARG-ARG) XPPPVPPRRR 10 T 2.5 Dscam_C pdbhh F F 1sfi 2 B I SFTI1_HELAN SFTI-1 GRCTKSIPPICFPD 14 T 0.0029 Bowman-Birk_leg pdb F Eukaryota T 1sgc 2 B B CHYMOSTATIN A FXLX 4 T 52 FAA_hydrolase_N pdbhh F F 1sha 2 B B PGFRB_HUMAN PHOSPHOPEPTIDE A XVPML 5 T 30 DapH_N pdbhh F Eukaryota F 1shb 2 B B PHOSPHOPEPTIDE B XLRVA 5 T 61 THDPS_N_2 pdbhh F F 1shc 2 B B NTRK1_HUMAN TRKA RECEPTOR PHOSPHOPEPTIDE HIIENPQXFSDA 12 T 1.6 DUF2399 pdbhh F Eukaryota T 1shd 2 B B TRKA RECEPTOR XXEEIE 6 T 88 TMEM171 pdbhh F F 1sho 1 A,B A,B VANCOMYCIN XXNXXXX 7 T 95 P53_C pdbhh F F 1sio 2 D,E,F D,E,F Ace-ILE-PRO-PHL peptide inhibitor XIPX 4 T 77 G0-G1_switch_2 pdbhh F F 1skg 2 B B VAFRS VAFRS 5 T 83 IML1 pdbhh F F 1ski 1 A A cyclic hexapeptide RRYYRF RRYYRF 6 T 15 Spore_III_AF pdbhh F F 1skk 1 A A cyclic hexapeptide KKWWKF KKWWKF 6 T 0.057 VPS13 pdbhh F F 1skl 1 A A cyclic hexapeptide RR(NAL)(NAL)RF RRXXRF 6 T 5.3 RET_CLD4 pdbhh F F 1skv 1 A,B,C,D A,B,C,D D63_SSV1 ORF D-63 MSKEVLEKELFEMLDEDVRELLSLIHEIKIDRITGNMDKQKLGKAYFQVQKIEAELYQLIKVSHHHHHH 69 T 0.093 Oxidored_nitro unppssm T Viruses T 1sld 2 B P CYCLO-AC-CHPQFC-NH2 XCHPQFCX 8 T 1.4 Cytochrom_C_2 pdbhh F T 1sle 2 B,D M,P AC-CHPQGPPC-NH2 XCHPQGPPCX 10 T 2.8 Defensin_int pdbhh F T 1slg 2 B,D M,P FCHPQNT FSHPQNT 7 T 14 PmoA pdbhh F T 1sm1 6 F 5 PRISTINAMYCIN IA, RP 57669 XTXPXXXX 8 T 1500 zf-met pdbhh F F 1sm3 3 C P MUC1_HUMAN PEPTIDE EPITOPE TSAPDTRPAPGST 13 T 21 DUF3235 pdbhh F Eukaryota T 1sme 2 C,D C,D Pepstatin XVVXAX 6 T 1700 FAM60A pdbhh F F 1smr 2 B,D,F,H B,D,F,H ANGT_RAT INHIBITOR CH-66 XHPFHXYYS 9 T 0.31 DUF5372 pdbhh F Eukaryota T 1sn9 1 A,B,C,D A,B,C,D BBAT XYRIXSYDFXDELAKLLRQAXGX 23 T 8 DUF5813 pdbhh F T 1sna 1 A,B,C,D A,B,C,D BBAT XYRIXSYDFXDELXKLLRQAXGX 23 T 8.6 DUF1949 pdbhh F T 1sne 1 A,B A,B BBAT XYRIXSYDFXDELAKLLRXAXGX 23 T 8.1 PelD_GGDEF pdbhh F T 1soc 1 A _ OCTREOTIDE XCFXKTCX 8 T 0.0019 Urotensin_II pdbhh F F 1sol 1 A _ GELS_HUMAN GELSOLIN (150-169) KHVVPNEVVVQRLFQVKGRR 20 T 1.4 Sua5_yciO_yrdC pdbhh F Eukaryota T 1soz 2 D,E D,E activating peptide DNRLGLVYQF 10 T 1.2 POLO_box pdbhh F T 1sps 2 B,D,F D,E,F MT_POVHA PEPTIDE YEEI EPQXEEIPIYL 11 T 3.2 Imm15 pdbhh T Viruses T 1sqz 2 B B synthetic peptide XIARS 5 T 210 GP67 pdbhh F F 1sse 1 A A YAP1_YEAST PHENANTHROLINE RESISTANCE PROTEIN PAR1, PLEIOTROPIC DRUG RESISTANCE PROTEIN PDR4 NLDSNMFSNDFNFENQFDEQVSEFCSKMNQVCGTR 35 T 0.94 PAP1 pdbhh F Eukaryota T 1ssh 2 B B SLA1_YEAST 12-RESIDUE PEPTIDE FROM SLA1 EGPPPAMPARPT 12 T 21 p47_phox_C pdbhh F Eukaryota T 1str 2 C,D M,P AC-CHPQNT-NH2 XCHPQNTX 8 T 9.2 DHOR pdbhh F T 1sts 2 C,D M,P FCHPQNT-NH2 FCHPQNTX 8 T 1.8 DUF2799 pdbhh F T 1sua 2 B C TETRAPEPTIDE ALA-LEU-ALA-LEU ALAL 4 T 410 Phage_mat-A pdbhh F F 1suy 2 C,D C,D KAIC_THEEB CIIABD AMAGIISGTPTRISVDEKTELARIAKGMQDLESE 34 T 5.4 Cep57_MT_bd pdbhh F Bacteria T 1sv1 2 C,D C,D KAIC_THEEB CIIABD AMAGIISGTPTRISVDEKTELARIAKGMQDLESE 34 T 5.4 Cep57_MT_bd pdbhh F Bacteria T 1svz 2 C,D C,D epitope peptide corresponding to N-terminus of HIV-2 protease PQFSLWKR 8 T 0.42 MTP_lip_bd pdbhh F T 1sy9 2 B B CNGA2_BOVIN CYCLIC-NUCLEOTIDE-GATED CATION CHANNEL 2, CNG CHANNEL 2, CNG-2, CNG2 QQRRGGFRRIARLVGVLREWAYRNFR 26 T 5.6 Adeno_E4 pdbhh F Eukaryota T 1szc 2 B B H4_YEAST Histone H4 peptide KGGAXRHRKI 10 T 4.2 Shadoo unppercent F Eukaryota T 1szd 2 B B H4_YEAST Histone H4 peptide KGGAXRHRKI 10 T 4.2 Shadoo unppercent F Eukaryota T 1t0j 3 C C CAC1C_HUMAN CALCIUM CHANNEL, L TYPE, ALPHA-1 POLYPEPTIDE, ISOFORM 1, CARDIAC MUSCLE GAQQLEEDLKGYLDWITQAE 20 T 1.8 Antimicrobial14 pdbhh F Eukaryota T 1t15 2 B B FANCJ_HUMAN BRCA1 interacting protein C-terminal helicase 1 STSPTFNK 8 T 4.7 DUF4675 pdbhh F Eukaryota T 1t1x 3 C C GAG PEPTIDE SLYLTVATL 9 T 5.1 Gag_p17 pdbhh F T 1t1y 3 C C GAG PEPTIDE SLYNVVATL 9 T 0.31 Gag_p17 pdbhh F T 1t1z 3 C C GAG PEPTIDE ALYNTAAAL 9 T 3.6 Gag_p17 pdbhh F F 1t29 2 B B FANCJ_HUMAN BACH1 phosphorylated peptide ISRSTSPTFNKQTK 14 T 5.4 DUF782 pdbhh F Eukaryota T 1t2v 2 F,G,H,I,J F,G,H,I,J BRCTide-7PS GAAYDISQVFPFAKKK 16 T 1.9 Thump_like pdbhh F T 1t2w 2 D D Peptide LEU-PRO-GLU-THR-GLY LPETG 5 T 83 ArAE_1_C pdbhh F F 1t2y 1 A A MT_NEUCR MT GDCGCSGASSCNCGSGCSCSNCGSK 25 T 0.003 Metallothio unphh F Eukaryota T 1t37 2 B P Synthetic peptide LAIYS 5 T 86 KH_7 pdbhh F F 1t3l 2 B B CAC1S_RABIT CALCIUM CHANNEL, L TYPE, ALPHA-1 POLYPEPTIDE, ISOFORM 3, SKELETAL MUSCLE QQLEEDLRGYMSWITQGE 18 T 1.8 Antimicrobial14 pdbhh F Eukaryota T 1t4f 2 B P optimized p53 peptide RFMDYWEGL 9 T 0.51 Usg pdbhh F T 1t51 1 A A NDB41_OPIMA ISCT ILGKIWEGIKSLFX 14 T 0.005 SecG unppssm F Eukaryota T 1t52 1 A A NDB41_OPIMA ISCT ILGKIWKGIKSLFX 14 T 0.005 SecG unppssm F Eukaryota T 1t54 1 A A NDB41_OPIMA ISCT ILGKIAEGIKSLFX 14 T 0.005 SecG unppssm F Eukaryota T 1t55 1 A A NDB41_OPIMA ISCT ILGKIWKPIKKLFX 14 T 0.005 SecG unppssm F Eukaryota T 1t5m 1 A A A21978C, CUBICIN WXDTGXDXDGXXX 13 T 31 Imm64 pdbhh F F 1t5n 1 A A A21978C, CUBICIN WXDTGXDXDGXXX 13 T 31 Imm64 pdbhh F F 1t5w 3 C,F C,F MIG1_YEAST REGULATORY PROTEIN CAT4 AAYSDQATPLLLSPR 15 T 19 DUF5888 pdbhh F Eukaryota T 1t5x 3 C C MIG1_YEAST REGULATORY PROTEIN CAT4 AAYSDQATPLLLSPR 15 T 19 DUF5888 pdbhh F Eukaryota T 1t5z 2 B B NCOA4_HUMAN NCOA-4, 70 KDA ANDROGEN RECEPTOR COACTIVATOR, 70 KDA AR-ACTIVATOR, RET-ACTIVATING PROTEIN ELE1 RETSEKFKLLFQSYN 15 T 4 DUF1279 pdbhh F Eukaryota T 1t6o 2 B L linker GSGSGSGS 8 T 2.1 Glypican pdbhh F F 1t73 2 B B FxxFF motif peptide SRFADFFRNEGLGSRSGSGK 20 T 1.7 HATPase_c_4 pdbhh F T 1t74 2 B B WxxLF motif peptide SRWQALFDDGTDTSR 15 T 3.4 NUC153 pdbhh F T 1t76 2 B B WxxVW motif peptide SRWAEVWDDNSKVSR 15 T 3 ODC_AZ pdbhh F T 1t79 2 B B FxxLW motif peptide SSKFAALWDPPKLSRSGSGK 20 T 3.7 MOSP_N pdbhh F T 1t7d 2 C,D C,D ARYLOMYCIN A2 XXGXAY 6 T 230 MMACHC pdbhh F F 1t7f 2 B B LxxLL motif peptide SSRGLLWDLLTKDSRSGSGK 20 T 4.2 TPK_B1_binding pdbhh F T 1t7r 2 B B FxxLF motif peptide SSRFESLFAGEKESR 15 T 5.5 CoV_NSP15_C pdbhh F T 1t85 1 A A CPXA_PSEPU CAMPHOR 5-MONOOXYGENASE, P450CAM TTETIQSNANLAPLPPHVPEHLVFDFDMYNPSNLSAGVQEAWAVLQESNVPDLVWTRCNGGHWIATRGQLIREAYEDYRHFSSECPFIPREAGEAYDFIPTSMDPPEQRQFRALANQVVGMPVVDKLENRIQELACSLIESLRPQGQCNFTEDYAEPFPIRIFMLLAGLPEEDIPHLKYLTDQMTRPDGSMTFAEAKEALYDYLIPIIEQRRQKPGTDAISIVANGQVNGRPITSDEAKRMCGLLLVGGLDTVVNFLSFSMEFLAKSPEHRQELIERPERIPAACEELLRRFSLVADGRILTSDYEFHGVQLKKGDQILLPQMLSGLDERENAAPMHVDFSRQKVSHTTFGHGSHLCPGQHLARREIIVTLKEWLTRIPDFSIAPGAQIQHKSGIVSGVQALPLVWDPATTKAV 414 T 1.6E-05 p450 unppercent F Bacteria T 1t86 1 A,B A,B CPXA_PSEPU CAMPHOR 5-MONOOXYGENASE, P450CAM TTETIQSNANLAPLPPHVPEHLVFDFDMYNPSNLSAGVQEAWAVLQESNVPDLVWTRCNGGHWIATRGQLIREAYEDYRHFSSECPFIPREAGEAYDFIPTSMDPPEQRQFRALANQVVGMPVVDKLENRIQELACSLIESLRPQGQCNFTEDYAEPFPIRIFMLLAGLPEEDIPHLKYLTDQMTRPDGSMTFAEAKEALYDYLIPIIEQRRQKPGTDAISIVANGQVNGRPITSDEAKRMCGLLLVGGLDTVVNFLSFSMEFLAKSPEHRQELIERPERIPAACEELLRRFSLVADGRILTSDYEFHGVQLKKGDQILLPQMLSGLDERENAAPMHVDFSRQKVSHTTFGHGSHLCPGQHLARREIIVTLKEWLTRIPDFSIAPGAQIQHKSGIVSGVQALPLVWDPATTKAV 414 T 1.6E-05 p450 unppercent F Bacteria T 1t8j 1 A A BBA5 XYRVXSYDFSRSDELAKLLRQHAGX 25 T 8.3 EZH2_N pdbhh F T 1t9e 1 A A SFTI1_HELAN SFTI-1 GRXTKSIPPIXFPD 14 T 0.052 Bowman-Birk_leg unp F Eukaryota T 1tdv 2 B B YWAAAA YWAAAA 6 T 7.4 DUF5953 pdbhh F F 1tfs 1 A _ 3SL2_DENPO TOXIN FS2 RICYSHKASLPRATKTCVENTCYKMFIRTHREYISERGCGCPTAMWPYQTECCKGDRCNK 60 T 5.2 Activin_recp pdb F Eukaryota T 1tg1 2 B C peptide inhibitor XLVRY 5 T 120 DUF6027 pdbhh F F 1tg4 2 B I FLAYK peptide FLAYK 5 T 71 Dynein_light pdbhh F F 1tgg 1 A,B,C A,B,C right-handed coiled coil trimer XAEXEQXKKEIAYLXKKXKEEILEEXKKXKQEIA 34 T 1.2 YojJ pdbhh F T 1ths 3 C I SYNTHETIC INHIBITOR XYEPIPEEAXE 11 T 0.65 Hirudin pdbhh F T 1tj9 2 B B VARS peptide VARS 4 T 220 PPP1R32 pdbhh F F 1tjb 1 A,B A,B Lanthanide-Binding Peptide YIDTNNDGWYEGDELLAX 18 T 0.53 DUF5057 pdbhh F T 1tjk 2 B I synthetic peptide FLSTK 5 T 130 Pinin_SDK_memA pdbhh F F 1tk2 2 B B GRAMICIDIN SOVIET VXLXPVXLXP 10 T 7.1 CemA pdbhh F F 1tk4 2 B B Tetrapeptide Ala-Ile-Arg-Ser AIRS 4 T 220 AvrM-A pdbhh F F 1tkq 1 A A MINI-GRAMICIDIN A AXVXWXWXWXW 11 T 3.4 DUF2826 pdbhh F F 1tkq 2 B B VALYL GRAMICIDIN VGAXAXVXWXWXWXW 15 T 3.9 MAP17 pdbhh F F 1tl9 2 B B leupeptin inhibitor XLLX 4 T 590 DNTTIP1_dimer pdbhh F F 1tmb 4 D T cyclotheonamide A XXXXP 5 T 240 Aurora-A_bind pdbhh F F 1tmc 3 C C DECAMERIC PEPTIDE (EVAPPEYHRK) EVAPPEYHRK 10 T 16 DUF6328 pdbhh F T 1tn6 3 C C peptide derived from the C-terminus of Rap2a DDPTASACNIQ 11 T 22 MTCP1 pdbhh F T 1tn7 3 C C Fusion protein KKSKTKCVIF 10 T 2.4 Acetyltransf_14 pdbhh F T 1tn8 3 C C peptide derived from the C-terminus of H-Ras GCVLS 5 T 37 DUF5675 pdbhh F F 1tnb 3 M,N,O,P,Q,R M,N,O,P,Q,R Fusion protein KKSKTKCVIF 10 T 2.4 Acetyltransf_14 pdbhh F T 1tno 3 M,N,O,P,Q,R M,N,O,P,Q,R c-K-ras2 protein isoform b KKKSKTKCVIM 11 T 0.035 Fer4_22 unppssm F T 1tnu 3 M,N,O,P,Q,R M,N,O,P,Q,R Transforming protein RhoB GCINCCKVL 9 T 0.72 Gal_GalNac_35kD pdbhh F T 1tnv 1 A,A2,A3,A4,A5,B,B2,B3,B4,B5 A,A,A,A,A,B,B,B,B,B TOBACCO NECROSIS VIRUS (SUBUNIT VP1) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 186 F F F 1tnv 2 C,C2,C3,C4,C5 C,C,C,C,C TOBACCO NECROSIS VIRUS (SUBUNIT VP3) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 210 F F F 1tny 3 M,N,O,P,Q,R M,N,O,P,Q,R guanine nucleotide-binding protein G(I)/G(S)/G(O) gamma-2 subunit FREKKFFCAIL 11 T 11 ITAM_Cys-rich pdbhh F T 1tnz 3 M,N,O,P,Q,R M,N,O,P,Q,R Cell division control protein 42 homolog (Cdc42) RRCVLL 6 T 2.2 DUF5883 pdbhh F F 1toq 2 B,D,F,H B,D,F,H LECB3_ARTIN AGGLUTININ BETA CHAIN DENSGKSQTVIVGPWGAKVS 20 T 2.7 DUF3842 pdbhh F Eukaryota T 1tp3 2 B B KKETPV peptide ligand KKETPV 6 T 2.3 DapH_N pdbhh F F 1tp5 2 B B LYS-LYS-GLU-THR-TRP-VAL peptide ligand KKETWV 6 T 45 Mit_proteolip pdbhh F F 1tp8 2 B,D,F,H B,D,F,H LECB3_ARTIN AGGLUTININ BETA CHAIN DENSGKSQTVIVGPWGAKVS 20 T 2.7 DUF3842 pdbhh F Eukaryota T 1tps 2 B B INHIBITOR A90720A XXTRELXV 8 T 5.4 Endotoxin_M pdbhh F T 1tsq 2 C P GAG_HV1H2 AP2V NC-P1 SUBSTRATE PEPTIDE RQVNFLGKIN 10 T 0.61 zf-CCHC_5 unphh T Viruses T 1tsu 2 C P GAG_HV1H2 NC-P1 SUBSTRATE PEPTIDE RQANFLGK 8 T 0.61 zf-CCHC_5 unphh T Viruses T 1tt5 3 E,F E,F UBC12_HUMAN UBC12N26, UBIQUITIN-PROTEIN LIGASE M, UBIQUITIN CARRIER PROTEIN M, NEDD8-CONJUGATING ENZYME UBC12 MIKLFSLKQQKKEEESAGGTKGSSKK 26 T 7.8E-11 UFC1 unphh F Eukaryota T 1tvb 3 C,F C,F PMEL_HUMAN epitope of Melanocyte protein Pmel 17 ITDQVPFSV 9 T 4.8 PatG_C pdbhh F Eukaryota T 1tvh 3 C,F C,F PMEL_HUMAN epitope of Melanocyte protein Pmel 17 IMDQVPFSV 9 T 4.3 DUF1422 pdbhh F Eukaryota T 1twb 2 C,D C,D ssrA peptide ACNDENYA 8 T 19 DUF6231 pdbhh F T 1twq 2 B P muramyl tripeptide XAXKX 5 T 1200 zf-met pdbhh F F 1txp 1 A,B,C,D A,B,C,D HNRPC_HUMAN HNRNP C IQAIKKELTQIKQKVDSLLENLEKIEKE 28 T 0.037 IES5 unppercent F Eukaryota T 1tze 2 B I PHOSPHOTYROSYL HEPTAPEPTIDE LYS-PRO-PHE-PTYR-VAL-ASN-VAL-NH2 KPFXVNVX 8 T 0.35 SH3-WW_linker pdbhh F T 1tzg 3 E,F P,Q GP41 KGWNWFDITNWGK 13 T 0.029 GP41 pdbhh F T 1tzs 3 C X 23-mer peptide from PelB-IgG kappa light chain fusion protein MKYLLPTAAAGLLLLAAQPAMAM 23 T 0.036 DUF6488 pdbhh F T 1u00 2 B P IscU recognition peptide ELPPVKIHC 9 T 7.1 DUF4528 pdbhh F T 1u0i 1 A A IAAL-E3 EIAALEKEIAALEKEIAALEK 21 T 0.0011 DUF3138 pdbhh F F 1u0i 2 B B IAAL-K3 KIAALKEKIAALKEKIAALKE 21 T 0.012 ZapB pdb F F 1u38 2 B B PVYI PVYI 4 T 23 Apidaecin pdbhh F F 1u3h 5 E,J P,I MBP_MOUSE Myelin basic protein (MBP)-peptide SRGGASQYRPSQ 12 T 10 Tsg pdbhh F Eukaryota T 1u67 1 A A PGH1_SHEEP CYCLOOXYGENASE-1, COX-1, PROSTAGLANDIN-ENDOPEROXIDE SYNTHASE 1, PROSTAGLANDIN H2 SYNTHASE 1, PGH SYNTHASE 1, PGHS-1, PHS 1 MSRQSISLRFPLLLLLLSPSPVFSADPGAPAPVNPCCYYPCQHQGICVRFGLDRYQCDCTRTGYSGPNCTIPEIWTWLRTTLRPSPSFIHFLLTHGRWLWDFVNATFIRDTLMRLVLTVRSNLIPSPPTYNIAHDYISWESFSNVSYYTRILPSVPRDCPTPMGTKGKKQLPDAEFLSRRFLLRRKFIPDPQGTNLMFAFFAQHFTHQFFKTSGKMGPGFTKALGHGVDLGHIYGDNLERQYQLRLFKDGKLKYQMLNGEVYPPSVEEAPVLMHYPRGIPPQSQMAVGQEVFGLLPGLMLYATIWLREHNRVCDLLKAEHPTWGDEQLFQTARLILIGETIKIVIEEYAQQLSGYFLQLKFDPELLFGAQFQYRNRIAMEFNQLYHFHPLMPDSFRVGPQDYSYEQFLFNTSMLVDYGVEALVDAFSRQPAGRIGGGRNIDHHILHVAVDVIKESRVLRLQPFNEYRKRFGMKPYTSFQELTGEKEMAAELEELYGDIDALEFYPGLLLEKCHPNSIFGESMIEMGAPFSLKGLLGNPICSPEYWKASTFGGEVGFNLVKTATLKKLVCLNTKTCPYVSFHVPDPRQEDRPGVERPPTEL 600 T 2.7E-05 An_peroxidase unp F Eukaryota T 1u7b 2 B B FEN1_HUMAN SRQGSTQGRLDDFFKVTGSL peptide of Flap endonuclease-1 SRQGSTQGRLDDFFKVTGSL 20 T 0.036 LRV_FeS pdbhh F Eukaryota T 1u7j 1 A,B A,B Four-helix bundle model MDYLRELYKLEQQAMKLYREASERVGDPVLAKILEDEEKHIEWLETING 49 T 0.00018 Rubrerythrin pdbhh F T 1u8g 2 C I peptidomimetic inhibitor KI2-PHE-GLU-GLU-NH2 XFEEX 5 T 350 DUF4210 pdbhh F F 1u8h 3 C C GP41 PEPTIDE ALDKWAS 7 T 4.1 DUF148 pdbhh F T 1u8i 3 C C GP41 PEPTIDE ELDKWAN 7 T 0.46 TMEM154 pdbhh F T 1u8j 3 C C GP41 PEPTIDE ELDKWAG 7 T 2.4 TMEM154 pdbhh F T 1u8k 3 C C GP41 PEPTIDE LELDKWASL 9 T 3.6 Kri1_C pdbhh F T 1u8l 3 C C GP41 PEPTIDE DLDRWAS 7 T 1.2 YacG pdbhh F T 1u8m 3 C C GP41 PEPTIDE ELDKYAS 7 T 6.1 DUF3283 pdbhh F T 1u8n 3 C C GP41 PEPTIDE ELDKFAS 7 T 3.8 Gag_p17 pdbhh F T 1u8o 3 C C GP41 PEPTIDE ELDKHAS 7 T 2.2 DUF3283 pdbhh F T 1u8p 3 C C GP41 PEPTIDE ECDKWCS 7 T 0.62 Sex_peptide pdbhh F T 1u8q 3 C C GP41 PEPTIDE ELEKWAS 7 T 5.7 DUF1186 pdbhh F T 1u8t 2 E,F E,F FLIM_ECOLI Flagellar motor switch protein fliM MGDSILSQAEIDALLN 16 T 0.027 CitT pdbhh F Bacteria T 1u91 3 C C GP41 PEPTIDE ANALOG ENDKWAS 7 T 3.2 Sin3_corepress pdbhh F T 1u92 3 C C GP41 PEPTIDE ANALOG EADKWQS 7 T 1.7 DEC1 pdbhh F T 1u93 3 C C GP41 PEPTIDE ANALOG EQDKWAS 7 T 28 Tmemb_9 pdbhh F T 1u95 3 C C GP41 PEPTIDE ELDHWAS 7 T 7.4 DUF3606 pdbhh F T 1u9e 2 C,D C,D STEROID RECEPTOR COACTIVATOR-1 KLVQLLTTT 9 T 0.24 SRC-1 pdbhh F F 1u9f 1 A,B,C,D A,B,C,D AMINO ACID BIOSYNTHESIS REGULATORY PROTEIN XRMKQIEDKLEEILSXYHIENELARIKKLLGER 33 T 0.0068 VGPC1_C pdbhh F T 1u9g 1 A,B A,B AMINO ACID BIOSYNTHESIS REGULATORY PROTEIN RMKQIEDXEEILSKLYHIENELARIKKLLGER 32 T 0.0046 DUF1192 pdbpercent F T 1u9h 1 A,B A,B AMINO ACID BIOSYNTHESIS REGULATORY PROTEIN XRMKQIEDKLEEILSKLYHIENXARIKKLLGER 33 T 0.048 VGPC1_C pdbhh F T 1u9l 2 C C Lambda N NRPILSL 7 T 1.8 CedA pdbhh F T 1uao 1 A A Chignolin GYDPETGTWG 10 T 0.046 DUF4585 pdbhh F T 1ucy 4 D,G,J F,G,I FIBA_MACFU FIBRINOPEPTIDE A-ALPHA XDFLAEGGGVRPR 13 T 4 ThuA unphh F Eukaryota T 1uef 2 C,D C,D RET_MOUSE POLYPEPTIDE CONTAINING A PHOSPHORYLATED TYROSINE STWIENKLXGMSD 13 T 2.7 DUF3541 pdbhh F Eukaryota T 1ugw 2 B,D,F,H B,D,F,H LECB3_ARTIN JACALIN BETA-3 CHAIN DEQSGISQTVIVGPWGAKSS 20 T 2.5 DUF6409 pdbhh F Eukaryota T 1ugx 2 B B LECB3_ARTIN JACALIN BETA-3 CHAIN DEQSGISQTVIVGPWGAKVS 20 T 3.2 DUF3842 pdbhh F Eukaryota T 1ugy 2 B,D,F,H B,D,F,H LECB3_ARTIN JACALIN BETA-3 CHAIN DEQSGISQTVIVGPWGAKSS 20 T 2.5 DUF6409 pdbhh F Eukaryota T 1uh0 2 B,D,F,H B,D,F,H LECB3_ARTIN JACALIN BETA-3 CHAIN DEQSGKSQTVIVGPWGAKVS 20 T 3 DUF3842 pdbhh F Eukaryota T 1uh1 2 B,D,F,H B,D,F,H LECB3_ARTIN JACALIN BETA-3 CHAIN DEQSGKSQTVIVGPWGAKVS 20 T 3 DUF3842 pdbhh F Eukaryota T 1uj0 2 B B UBP8_MOUSE UBPY-DERIVED PEPTIDE TPMVNRENKPP 11 T 1.1 DUF6440 pdbhh F Eukaryota T 1ujj 2 C C BACE1_HUMAN C-TERMINAL PEPTIDE FROM BACE HDDFADDISLLK 12 T 0.72 CD34_antigen unphh F Eukaryota T 1ujk 2 C,D C,D BACE1_HUMAN C-TERMINAL PEPTIDE FROM BACE HDDFADDISLLK 12 T 0.72 CD34_antigen unphh F Eukaryota T 1ujz 2 B B CEA7_ECOLX DC DNASE KRNKPGKATGKGKPVNNKWLNNAGKDLGSPVPDRIANKLRDKEFKSFDDFRKKFWEEVSKDPELSKQFSRNNNDRMKVGKAPQTRTQDVSGKRRSFELHHEKPISQNGGVYDMDNISVVTPKRAIDIH 128 T 0.0091 HNH pdbpercent F Bacteria T 1uk4 2 C,D,E G,H,K 5-mer peptide of inhibitor NSTLQ 5 T 130 LRR_6 pdbhh F F 1ukh 2 B B JIP1_MOUSE JIP1 RPKRPTTLNLF 11 T 12 CTP-dep_RFKase pdbhh F Eukaryota T 1uki 2 B B JIP1_MOUSE JIP1 RPKRPTTLNLF 11 T 12 CTP-dep_RFKase pdbhh F Eukaryota T 1umw 2 C,D E,F PEPTIDE PMQSTPL 7 T 20 KRE9 pdbhh F T 1unj 2 E,F,K,L,Q,R,W,X E,F,K,L,Q,R,W,X DACTINOMYCIN TXPXXXTXPXX 11 T 61 DUF5572 pdbhh F F 1unm 2 E,F E,F 7-AMINOACTINOMYCIN D TXPXXXTXPXX 11 T 61 DUF5572 pdbhh F F 1uno 1 A,B A,B H-(L-TYR-D-TYR)4-LYS-OH YXYXYXYXK 9 T 170 DUF4234 pdbhh F F 1uoc 1 A,B A,B POP2_YEAST CCR4-ASSOCIATED FACTOR 1 GAMPPIFLPPPNYLFVRDVWKSNLYSEFAVIRQLVSQYNHVSISTEFVGTLARPIGTFRSKVDYHYQTMRANVDFLNPIQLGLSLSDANGNKPDNGPSTWQFNFEFDPKKEIMSTESLELLRKSGINFEKHENLGIDVFEFSQLLMDSGLMMDDSVTWITYHAAYDLGFLINILMNDSMPNNKEDFEWWVHQYMPNFYDLNLVYKIIQEFKNPQLQQSSQQQQQQQYSLTTLADELGLPRFSIFTTTGGQSLLMLLSFCQLSKLSMHKFPNGTDFAKYQGVIYGIDGDQ 289 T 3.5E-30 CAF1 pdbhh F Eukaryota T 1uoo 2 B B PEPTIDE LIGAND GLY-PHE-ARG-PRO GFRP 4 T 22 K_oxygenase pdbhh F F 1uop 2 B B PEPTIDE LIGAND GLY-PHE-GLU-PRO GFEP 4 T 23 SSV1_ORF_D-335 pdbhh F F 1uoq 2 B B PEPTIDE LIGAND GLU-PHE-SER-PRO EFSP 4 T 59 DUF2418 pdbhh F F 1upk 2 B B STRAA_HUMAN STRAD ALPHA NLEELEVDDWEF 12 T 6.3 ANAPC16 pdbhh F Eukaryota T 1ura 1 A,B A,B PPB_ECOLI ALKALINE PHOSPHATASE MPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGNGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQDATPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 446 T 6.8E-10 Alk_phosphatase unppssm F Bacteria T 1urb 1 A,B A,B PPB_ECOLI ALKALINE PHOSPHATASE MPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGNGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQDATPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 446 T 6.8E-10 Alk_phosphatase unppssm F Bacteria T 1urc 3 E,F E,F PEPTIDE INHIBITOR XRKLFG 6 T 29 Mog1 pdbhh F T 1url 2 B B ALA-GLY-HIS-THR-TRP-GLY-HIA AGHTWGX 7 T 1.4 HSNSD pdbhh F T 1utc 2 C,D P,Q AMPH_HUMAN AMPHIPHYSIN TLPWDLWTT 9 T 5.5 GldC-like pdbhh F Eukaryota F 1uti 2 B D M4K1_MOUSE HEMATOPOETIC PRGENITOR KINASE I, MAPK/ERK KINASE KINASE KINASE 1, MEK KINASE KINASE 1, MEKKK 1, HPK GQPPLVPPRKEKMRGK 16 T 5.1 NapB pdbhh F Eukaryota T 1uw1 1 A A ARTIFICIAL NUCLEOTIDE BINDING PROTEIN (ANBP) GAMDYKDDDDKKTNWLKRIYRVRPCVKCKVAPRNWKVKNKHLRIYNMCKTCFNNSIDIGDDTYHGHDDWLMYADSKEISN 80 T 0.013 ZZ pdbpssm F T 1uwi 1 A,B,C,D A,B,C,D BGAL_SULSO GLYCOSIDASE, LACTASE MYSFPNSFRFGWSQAGFQSEMGTPGSEDLNTDWYKWVHDPENMAAGLVSGDLPENGPGYWGNYKTFHNNAQKMGLKIARLNSEWSRQFPNPLPRPQNFDESKQDVTEVEINENELKRLDEYANKDALNHYREIFKDLKSRGLYFIQNMYHWPLPLWLHDPIRVRRGDFTGPSGWLSTRTVYEFARFSAYTAWKFDDLVDEYSTMNEPNVVGGLGYVGVKSGFPPGYLSFELSRRAMYNIIQAHARAYDGIKSVSKKPVGIIYANSSFQPLTDKDMEAVEMAENDNRWWFFDAIIRGEITRGNEKIVRDDLKGRLDWIGVNYYTRTVVKRTGKGYVSLGGYGHGCERNSVSLAGLPTSDFGWEFFPEGLYDVLTKYWNRYHLYMYVTENGIADDADYQRPYYLVSHVYQVHRAINSGADVRGYLHWSLADNYEWASGFSMRFGLLKVDYNTKRLYWRPSSLVYREIATNGAITDEIEHLNSVPPVKPLRH 489 T 1.3E-42 Glyco_hydro_1 unppercent F Archaea T 1v13 1 A,B A,B CEA9_ECOLX COLICIN E9 MESKRNKPGKATGKGKPVGDKWLDDAGKDSGAPIPDRIADKLRDKEFKSFDDFRKAVWEEVSKDPELSKNLNPSNKSSVSKGYSPFTPKNQQVGGRKVYELHADKPISQGGEVYDMDNIRVTTPKRHIDIHRGK 134 T 0.016 LHH unppercent F Bacteria T 1v14 1 A,B,C,D A,B,C,D CEA9_ECOLX COLICIN E9 MESKRNKPGKATGKGKPVGDKWLDDAGKDSGAPIPDRIADKLRDKEFKSFDDFRKAVWEEVSKDPELSKNLNPSNKSSVSKGYSPFTPKNQQVGGRKVYELHADKPISQGGEVYDMDNIRVTTPKRHIDIHRGK 134 T 0.016 LHH unppercent F Bacteria T 1v15 1 A,B,C,D A,B,C,D CEA9_ECOLX COLICIN E9 MESKRNKPGKATGKGKPVGDKWLDDAGKDSGAPIPDRIADKLRDKEFKSFDDFRKAVWEEVSKDPELSKNLNPSNKSSVSKGYSPFTPKNQQVGGRKVYELHADKPISQGGEVYDMDNIRVTTPKRHIDIHRGK 134 T 0.016 LHH unppercent F Bacteria T 1v1t 2 C,D S,T TNEYKV PEPTIDE TNEYKV 6 T 120 DUF1230 pdbhh F T 1v4f 1 A A collagen like peptide GPPGPPG 7 T 6.8 Milton pdbhh F F 1v4f 2 B B collagen like peptide PGPPGPP 7 T 0.67 EKLF_TAD1 pdbhh F F 1v4f 3 C C collagen like peptide PPGPPGP 7 T 0.48 DUF374 pdbhh F F 1v4q 1 A A CO7C_CONMA omega-conotoxin MVIIC CKGKGAPCRKTMYDCCKGRCGRRGRCX 27 T 0.0018 Conotoxin unphh F Eukaryota T 1v4y 1 A A Q9AGH8_ALCFA D-aminoacylase MRGSHHHHHHGSMSQPDATPFDYILSGGTVIDGTNAPGRLADVGVRGDRIAAVGDLSASSARRRIDVAGKVVSPGFIDSHTHDDNYLLKHRDMTPKISQGVTTVVTGNCGISLAPLAHANPPAPLDLLDEGGSFRFARFSDYLEALRAAPPAVNAACMVGHSTLRAAVMPDLRREATADEIQAMQALADDALASGAIGISTGAFYPPAAHASTEEIIEVCRPLITHGGVYATAMRDEGEHIVQALEETFRIGRELDVPVVISHHKVMGKLNFGRSKETLALIEAAMASQDVSLDAYPYVAGSTMLKQDRVLLAGRTLITWCKPYPELSGRDLEEIAAERGKSKYDVVPELQPAGAIYFMMDEPDVQRILAFGPTMIGSDGLPHDERPHPRLWGTFPRVLGHYSRDLGLFPLETAVWKMTGLTAAKFGLAERGQVQPGYYADLVVFDPATVADSATFEHPTERAAGIHSVYVNGAAVWEDQSFTGQHAGRVLNRAGA 496 T 2.1E-16 Amidohydro_3 pdbhh F Bacteria T 1v5a 1 A A Covalitoxin-I RCLPSGKACAGVTQKIPCCGSCVRGKCS 28 T 0.046 Conotoxin pdbhh F T 1v66 1 A A PIAS1_HUMAN PROTEIN INHIBITOR OF ACTIVATED STAT-1, PIAS-1, GU BINDING PROTEIN, GBP, RNA HELICASE II BINDING PROTEIN, DEAD/H BOX-BINDING PROTEIN 1 MADSAELKQMVMSLRVSELQVLLGYAGRNKHGRKHELLTKALHLLKAGCSPAVQMKIKELYRRRF 65 T 0.003 SAP_new25 pdbhh F Eukaryota T 1v6d 2 B B PD(AIB)L(AIB)LA PDXLXLA 7 T 5 DUF151 pdbhh F F 1v6q 1 A A Collagen like peptide GPPGPPG 7 T 6.8 Milton pdbhh F F 1v6q 2 B B Collagen like peptide PGPPGPP 7 T 0.67 EKLF_TAD1 pdbhh F F 1v6q 3 C C Collagen like peptide PPGPPGP 7 T 0.48 DUF374 pdbhh F F 1v7h 1 A A Collagen like peptide GPPGPPG 7 T 6.8 Milton pdbhh F F 1v7h 2 B B Collagen like peptide PGPPGPP 7 T 0.67 EKLF_TAD1 pdbhh F F 1v7h 3 C C Collagen like peptide PPGPPGP 7 T 0.48 DUF374 pdbhh F F 1v9t 2 C C (SIN)APA(NIT) XAPAX 5 T 1100 UPF0547 pdbhh F F 1vai 2 C C (ACE)AAPA(MCM) XAAPAX 6 T 950 A_amylase_inhib pdbhh F F 1vba 1 A,A10,A11,A12,A13,A14,A15,A2,A3,A4,A5,A6,A7,A8,A9 0,0,0,0,0,0,0,0,0,0,0,0,0,0,0 POLIOVIRUS TYPE 3 ISEV 4 T 250 Tnp_22_trimer pdbhh F F 1vbb 1 A,A10,A11,A12,A13,A14,A15,A2,A3,A4,A5,A6,A7,A8,A9 0,0,0,0,0,0,0,0,0,0,0,0,0,0,0 POLIOVIRUS TYPE 3 ISEV 4 T 250 Tnp_22_trimer pdbhh F F 1vbc 1 A,A10,A11,A12,A13,A14,A15,A2,A3,A4,A5,A6,A7,A8,A9 0,0,0,0,0,0,0,0,0,0,0,0,0,0,0 POLIOVIRUS TYPE 3 ISEV 4 T 250 Tnp_22_trimer pdbhh F F 1vbd 1 A 0 POLIOVIRUS TYPE 1 MAHONEY GSSST 5 T 190 DltD pdbhh F F 1vbe 1 A,A10,A11,A12,A13,A14,A15,A2,A3,A4,A5,A6,A7,A8,A9 0,0,0,0,0,0,0,0,0,0,0,0,0,0,0 POLIOVIRUS TYPE 3 ISEV 4 T 250 Tnp_22_trimer pdbhh F F 1vbs 2 B B TETRAPEPTIDE AXPFX 5 T 180 DUF3054 pdbhh F F 1vbt 2 C,D C,D SULFUR-SUBSTITUTED TETRAPEPTIDE AXPFX 5 T 180 DUF3054 pdbhh F F 1vd7 1 A A Q5FBS0_BOMMO FMBP-1 ETSEERAARLAKMSAYAAQRLAN 23 T 0.31 Lipase_chap unppssm F Eukaryota T 1vd8 1 A A Q5FBS0_BOMMO FMBP-1 ESPEQRATRLKRMSEYAAKRLSS 23 T 0.095 EF-1_beta_acid pdbpercent F Eukaryota T 1vd9 1 A A Q5FBS0_BOMMO FMBP-1 ETREQRAIRLARMSAYAARRLAN 23 T 0.15 DUF6366 unppercent F Eukaryota T 1vda 1 A A Q5FBS0_BOMMO FMBP-1 ETPAQRQARLLRMSAYAAKRQAS 23 T 0.15 DUF6366 unppercent F Eukaryota T 1vdb 1 A A Q5FBS0_BOMMO FMBP-1 ETSEERAARLAKMSAYAAQRLAN 23 T 0.31 Lipase_chap unppssm F Eukaryota T 1vdn 2 B B (ACE)AAPA(MCM) XAAPAX 6 T 950 A_amylase_inhib pdbhh F F 1vg0 1 A A RAE1_RAT RAB ESCORT PROTEIN 1, REP-1 MADNLPSDFDVIVIGTGLPESIIAAACSRSGQRVLHVDSRSYYGGNWASFSFSGLLSWLKEYQENNDVVTENSMWQEQILENEEAIPLSSKDKTIQHVEVFCYASQDLHKDVEEAGALQKNHASVTSAQSAEAAEAAETSCLPTAVEPLSMGSCEIPAEQSQCPGPESSPEVNDAEATGKKENSDAKSSTEEPSENVPKVQDNTETPKKNRITYSQIIKEGRRFNIDLVSKLLYSRGLLIDLLIKSNVSRYAEFKNITRILAFREGTVEQVPCSRADVFNSKQLTMVEKRMLMKFLTFCVEYEEHPDEYRAYEGTTFSEYLKTQKLTPNLQYFVLHSIAMTSETTSCTVDGLKATKKFLQCLGRYGNTPFLFPLYGQGELPQCFCRMCAVFGGIYCLRHSVQCLVVDKESRKCKAVIDQFGQRIISKHFIIEDSYLSENTCSRVQYRQISRAVLITDGSVLRTDADQQVSILTVPAEEPGSFAVRVIELCSSTMTCMKGTYLVHLTCMSSKTAREDLERVVQKLFTPYTEIEAENEQVEKPRLLWALYFNMRDSSDISRDCYNDLPSNVYVCSGPDSGLGNDNAVKQAETLFQQICPNEDFCPAPPNPEDIVLDGDSSQQEVPESSVTPETNSETPKESTVLGNPEEPSE 650 T 9.7E-16 GDI pdbpssm F Eukaryota T 1vg9 1 A,C,E,G A,C,E,G RAE1_RAT RAB ESCORT PROTEIN 1, REP-1 MADNLPSDFDVIVIGTGLPESIIAAACSRSGQRVLHVDSRSYYGGNWASFSFSGLLSWLKEYQENNDVVTENSMWQEQILENEEAIPLSSKDKTIQHVEVFCYASQDLHKDVEEAGALQKNHASVTSAQSAEAAEAAETSCLPTAVEPLSMGSCEIPAEQSQCPGPESSPEVNDAEATGKKENSDAKSSTEEPSENVPKVQDNTETPKKNRITYSQIIKEGRRFNIDLVSKLLYSRGLLIDLLIKSNVSRYAEFKNITRILAFREGTVEQVPCSRADVFNSKQLTMVEKRMLMKFLTFCVEYEEHPDEYRAYEGTTFSEYLKTQKLTPNLQYFVLHSIAMTSETTSCTVDGLKATKKFLQCLGRYGNTPFLFPLYGQGELPQCFCRMCAVFGGIYCLRHSVQCLVVDKESRKCKAVIDQFGQRIISKHFIIEDSYLSENTCSRVQYRQISRAVLITDGSVLRTDADQQVSILTVPAEEPGSFAVRVIELCSSTMTCMKGTYLVHLTCMSSKTAREDLERVVQKLFTPYTEIEAENEQVEKPRLLWALYFNMRDSSDISRDCYNDLPSNVYVCSGPDSGLGNDNAVKQAETLFQQICPNEDFCPAPPNPEDIVLDGDSSQQEVPESSVTPETNSETPKESTVLGNPEEPSE 650 T 9.7E-16 GDI pdbpssm F Eukaryota T 1vgc 1 A A CTRA_BOVIN GAMMA CHYMOTRYPSIN CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 1vgk 3 C C syvntnmgl SYVNTNMGL 9 T 4.3 Colipase_C pdbhh F T 1vl3 1 A,B A,B GLU-SER-GLN-LEU-HIS-SER-ASN-LYS-ARG XESQLHSNKRX 11 T 11 DUF3949 pdbhh F T 1vlu 1 A,B A,B PROA_YEAST GPR, GLUTAMATE-5-SEMIALDEHYDE DEHYDROGENASE, GLUTAMYL-GAMMA-SEMIALDEHYDE DEHYDROGENASE, GSA DEHYDROGENASE MGSDKIHHHHHHMSSSQQIAKNARKAGNILKTISNEGRSDILYKIHDALKANAHAIEEANKIDLAVAKETGLADSLLKRLDLFKGDKFEVMLQGIKDVAELEDPVGKVKMARELDDGLTLYQVTAPVGVLLVIFESRPEVIANITALSIKSGNAAILKGGKESVNTFREMAKIVNDTIAQFQSETGVPVGSVQLIETRQDVSDLLDQDEYIDLVVPRGSNALVRKIKDTTKIPVLGHADGICSIYLDEDADLIKAKRISLDAKTNYPAGCNAMETLLINPKFSKWWEVLENLTLEGGVTIHATKDLKTAYFDKLNELGKLTEAIQCKTVDADEEQDFDKEFLSLDLAAKFVTSTESAIQHINTHSSRHTDAIVTENKANAEKFMKGVDSSGVYWNASTRFADGFRYGFGAEVGISTSKIHARGPVGLDGLVSYQYQIRGDGQVASDYLGAGGNKAFVHKDLDIKTVTL 468 T 9E-07 Aldedh pdbpercent F Eukaryota T 1vm2 1 A A peptide A2 GLFDKLKSLVSDFX 14 T 0.74 Antimicrobial20 pdbhh F T 1vpp 2 C,D X,Y PROTEIN (PEPTIDE V108) RGWVEICAADDYGRCLTEAQ 20 T 2.1 zf-LYAR pdbhh F T 1vqx 1 A A OPSD_BOVIN RHODOPSIN DDEASTTVSKTETSQVAPA 19 T 110 DUF5840 pdbhh F Eukaryota T 1vrk 2 B B MYLK_CHICK RS20 RRKWQKTGHAVRAIGRLSSSX 21 T 4.3 PACT_coil_coil pdbhh F Eukaryota T 1vrz 1 A A DE NOVO DESIGNED 21 RESIDUE PEPTIDE XGXAXXAXXAGGGGXALXALXAX 23 T 5.8 PRCH pdbhh F F 1vs2 2 B B TRIOSTIN A XAXXXAXX 8 T 190 RSF pdbhh F F 1vtg 2 B B triostin A XAXXXAXX 8 T 190 RSF pdbhh F F 1vtp 1 A _ Q40378_NICAL NA-PROPI SEYASKVDEYVGEVENDLQKSKVAVS 26 T 3 RasGEF_N_2 unp F Eukaryota T 1vwa 2 C,D M,P PEPTIDE LIGAND CONTAINING HPQ FSHPQNT 7 T 14 PmoA pdbhh F T 1vwb 2 B P PEPTIDE LIGAND CONTAINING HPQ XCHPQFCX 8 T 1.4 Cytochrom_C_2 pdbhh F T 1vwc 2 B P PEPTIDE LIGAND CONTAINING HPQ XCHPQFCX 8 T 1.4 Cytochrom_C_2 pdbhh F T 1vwd 2 B P PEPTIDE LIGAND CONTAINING HPQ XCHPQFCX 8 T 1.4 Cytochrom_C_2 pdbhh F T 1vwe 2 B P PEPTIDE LIGAND CONTAINING HPQ XCHPQFCX 8 T 1.4 Cytochrom_C_2 pdbhh F T 1vwf 2 B P PEPTIDE LIGAND CONTAINING HPQ XCHPQGPPCX 10 T 2.8 Defensin_int pdbhh F T 1vwg 2 B P PEPTIDE LIGAND CONTAINING HPQ XCHPQGPPCX 10 T 2.8 Defensin_int pdbhh F T 1vwh 2 B P PEPTIDE LIGAND CONTAINING HPQ XCHPQGPPCX 10 T 2.8 Defensin_int pdbhh F T 1vwi 2 C,D M,P PEPTIDE LIGAND CONTAINING HPQ HPQGPPCKX 9 T 3.4 TMEM135_C_rich pdbhh F T 1vwj 2 C,D M,P PEPTIDE LIGAND CONTAINING HPQ HPQGPPCKX 9 T 3.4 TMEM135_C_rich pdbhh F T 1vwk 2 C,D M,P PEPTIDE LIGAND CONTAINING HPQ HPQGPPCKX 9 T 3.4 TMEM135_C_rich pdbhh F T 1vwl 2 C,D M,P PEPTIDE LIGAND CONTAINING HPQ HPQGPPCKX 9 T 3.4 TMEM135_C_rich pdbhh F T 1vwm 2 B P PEPTIDE LIGAND CONTAINING HPQ XCHPQFCX 8 T 1.4 Cytochrom_C_2 pdbhh F T 1vwn 2 B P PEPTIDE LIGAND CONTAINING HPQ XCHPQFCX 8 T 1.4 Cytochrom_C_2 pdbhh F T 1vwo 2 B P PEPTIDE LIGAND CONTAINING HPQ XCHPQGPPCX 10 T 2.8 Defensin_int pdbhh F T 1vwp 2 B P PEPTIDE LIGAND CONTAINING HPQ XCHPQGPPCX 10 T 2.8 Defensin_int pdbhh F T 1vwq 2 B P PEPTIDE LIGAND CONTAINING HPQ HPQGPPCKX 9 T 3.4 TMEM135_C_rich pdbhh F T 1vwr 2 B P PEPTIDE LIGAND CONTAINING HPQ HPQGPPCKX 9 T 3.4 TMEM135_C_rich pdbhh F T 1vyj 2 B,D,F,H,J,L B,D,F,H,J,L SMALL PEPTIDE SAVLQKKITDYFHPKK SAVLQKKITDYFHPKK 16 T 2.4 Lactococcin_972 pdbhh F T 1vyq 1 A,B,C A,B,C Q8II92_PLAF7 DUTP PYROPHOSPHATASE MHLKIVCLSDEVREMYKNHKTHHEGDSGLDLFIVKDEVLKPKSTTFVKLGIKAIALQYKSNYYYKCEKSENKKKDDDKSNIVNTSFLLFPRSSISKTPLRLANSIGLIDAGYRGEIIAALDNTSDQEYHIKKNDKLVQLVSFTGEPLSFELVEELDETSRGEGGFGSTSNNKY 173 T 1.7E-06 dUTPase unppercent F Eukaryota T 1vyt 2 C,D E,F CAC1C_RAT CALCIUM CHANNEL L TYPE ALPHA-1 POLYPEPTIDE ISOFORM 1 FROM CARDIAC MUSCLE, RAT BRAIN CLASS C QKLREKQQLEEDLKGYLDWITQAED 25 T 3.1 Antimicrobial14 pdbhh F Eukaryota T 1vzj 2 I,J I,J COLQ_HUMAN COLQ, ACETYLCHOLINESTERASE-ASSOCIATED COLLAGEN, ACHE Q SUBUNIT LLTPPPPPLFPPPFF 15 T 9.3 Dicty_CAD pdbhh F Eukaryota F 1vzm 1 A,B,C A,B,C OSTCN_ARGRE OSTEOCALCIN AAKELTLAQTESLREVCETNMACDEMADAQGIVAAYQAFYGPIPF 45 T 0.5 UCMA unphh F Eukaryota T 1w0v 3 C C TISD_HUMAN EGF-RESPONSE FACTOR 2, ERF-2, TIS11D PROTEIN RRLPIFSRL 9 T 11 Imm15 pdbhh F Eukaryota T 1w0w 3 C C TISD_HUMAN EGF-RESPONSE FACTOR 2, ERF-2, TIS11D PROTEIN RRLPIFSRL 9 T 11 Imm15 pdbhh F Eukaryota T 1w3m 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L FRIULIMICIN, AMPHOMYCIN, A1437 B DXXXDGDGXVP 11 T 1 LCAT pdbhh F F 1w5c 7 M,N O,P PSII SUBUNIT PSBO, MSP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 179 F F F 1w5c 8 O,Q S,U PSII SUBUNIT PSBU, PS II COMPLEX 12KDA EXTRINSIC PROTEIN, PSII-U XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 100 F F F 1w5c 10 S,T X,Y UNASSIGNED SUBUNITS XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 359 F F F 1w5u 1 A,B,C,D A,B,C,D VAL-GRAMICIDIN A XGAXAXVXWXWXWXWX 16 T 4.6 MAP17 pdbhh F F 1w6i 2 C,D B,D PEPSTATIN XVVXAX 6 T 1700 FAM60A pdbhh F F 1w7q 1 A,B,C,D,E,F A,B,C,D,E,F FEGLYMYCIN XXVXXXXXVXXFD 13 T 42 Tubulin_2 pdbhh F F 1w7r 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H FEGLYMYCIN XXVXXXXXVXXFD 13 T 42 Tubulin_2 pdbhh F F 1w7v 2 E,F,G,H E,F,G,H PEPTIDE VAL-PRO-LEU VPL 3 T 230 YebF pdbhh F F 1w80 2 B P SYNJ1_HUMAN SYNAPTIC INOSITOL-1,4,5-TRISPHOSPHATE 5-PHOSPHATASE 1, SYJ-P3 NPKGWVTFEEEE 12 T 0.37 Stonin2_N pdbhh F Eukaryota T 1w80 3 C Q SYNJ1_HUMAN SYNAPTIC INOSITOL-1,4,5-TRISPHOSPHATE 5-PHOSPHATASE 1, SYJ-P3 LDGFKDSFDLQG 12 T 3.8 Glycoamylase pdbhh F Eukaryota T 1w8t 1 A A Q9C171_PIREQ NON CATALYTIC PROTEIN 1 SNVRATYTVIFKNASGLPNGYDNWGWGCTLSYYGGAMIINPQEGKYGAVSLKRNSGSFRGGSLRFDMKNEGKVAILVENSEADEKFEVETISPSDEYVTYILDVDFDLPFDRIDFQDAPGNGDRIWIKNLVHSTGSADDFVDPINLEHH 149 T 0.025 ShlB pdbpercent F Eukaryota T 1w8w 1 A,B A,B Q9C171_PIREQ NON-CATALYTIC PROTEIN 1 SNVRATYTVIFKNASGLPNGYDNWGWGCTLSYYGGAMIINPQEGKAGAVSLKRNSGSFRGGSLRFDMKNEGKVKILVENSEADEKFEVETISPSDEYVTYILDVDFDLPFDRIDFQDAPGNGDRIWIKNLVHSTGSADDFVDPINLEHH 149 T 0.17 RNase_H pdbpssm F Eukaryota T 1w8z 1 A,B A,B Q9C171_PIREQ NON CATALYTIC PROTEIN 1 SNVRATYTVIFKNASGLPNGYDNWGWGCTLSYYGGAMIINPQEGKYGAVSLKRNSGSFRGGSLRFDMKNEGKVKILVENSEADEAFEVETISPSDEYVTYILDVDFDLPFDRIDFQDAPGNGDRIWIKNLVHSTGSADDFVDPINLHHH 149 T 0.23 RNase_H pdbpssm F Eukaryota T 1w90 1 A,B A,B Q9C171_PIREQ NON-CATALYTIC PROTEIN 1 SNVRATYTVIFKNASGLPNGYDNWGWGCTLSYYGGAMIINPQEGKYGAVSLKRNSGSFRGGSLRFDMKNEGKVKILVENSEADEKFEVETISPSDEYVTYILDVDFDLPFDRIAFQDAPGNGDRIWIKNLVHSTGSADDFVDPINLEHHHHHH 153 T 0.1 RNase_H pdbpssm F Eukaryota T 1w94 1 A,B A,B BRIX_METTH MIL HMLLTTSRKPSQRTRSFSQRLSRIMGWRYINRGKMSLRDVLIEARGPVAVVSERHGNPARITFLDERGGERGYILFNPSFEMKKPELADKAVRVSSCPPGSEGLCNLMGLEVDESSSRDAWSIRTDEEYAWVMELMDARGTPAGFKLLIRDFRVGE 156 T 6.4E-05 Brix unphh F Archaea T 1w9e 2 C,D,E R,S,T TNEFYF PEPTIDE TNEFYF 6 T 37 GA-like pdbhh F F 1w9f 1 A,B A,B Q9C171_PIREQ NON CATALYTIC PROTEIN 1 SNVRATYTVIFKNASGLPNGYDNWGWGCTLSYYGGAMIINPQEGKYGAVSLKRNSGSFRGGSLRFDMKNEGKVKILVENSEADEKFEVETISPSDEYVTYILDVDFDLPFDAIDFQDAPGNGDRIWIKNLVHSTGSADDFVDPINLHHH 149 T 0.26 RNase_H pdbpssm F Eukaryota T 1w9o 2 C,D S,T TNEYYV PEPTIDE TNEYYV 6 T 67 YLP pdbhh F F 1w9q 2 C S TNEFAF PEPTIDE TNEFAF 6 T 82 DUF5871 pdbhh F F 1w9r 1 A A A0A0H2US50_STRPN CBPA-R2 GSHMPEKKVAEAEKKVEEAKKKAEDQKEEDRRNYPTNTYKTLELEIAESDVEVKKAELELVKEEAKEPRNEEKVKQAKAEVESKKAEATRLEKIKTDRKKAEEEAKRKAAEEDKVKEKP 119 T 3.3 UPF0231 pdb F Bacteria T 1w9u 2 C,D C,D ARGADIN XXXHX 5 T 140 RsmF_methylt_CI pdbhh F F 1w9v 2 C,D C,D ARGIFIN XXXXX 5 T 130 DUF2015 pdbhh F F 1wa7 2 B B TIP_SHV2C TIP WDPGMPTPPLPPRPANLGERQA 22 T 0.061 EVI2A unp T Viruses T 1wak 1 A A SRPK1_HUMAN SRPK1, SRPK1A PROTEIN KINASE, SERINE/ARGININE-RICH PROTEIN SPECIFIC KINASE 1, SR-PROTEIN-SPECIFIC KINASE 1, SFRS PROTEIN KINASE 1 PEQEEEILGSDDDEQEDPNDYCKGGYHLVKIGDLFNGRYHVIRKLGWGHFSTVWLSWDIQGKKFVAMKVVKSAEHYTETALDEIRLLKSVRNSDPNDPNREMVVQLLDDFKISGVNGTHICMVFEVLGHHLLKWIIKSNYQGLPLPCVKKIIQQVLQGLDYLHTKCRIIHTDIKPENILLSVNEQYIRRLAAEATEWQRSGAPPPSGSAVSTAPATAGNFLVNPLEPKNAEKLKVKIADLGNACWVHKHFTEDIQTRQYRSLEVLIGSGYNTPADIWSTACMAFELATGDYLFEPHSGEEYTRDEDHIALIIELLGKVPRKLIVAGKYSKEFFTKKGDLKHITKLKPWGLFEVLVEKYEWSQEEAAGFTDFLLPMLELIPEKRATAAECLRHPWLNS 397 T 3.2 RFXA_RFXANK_bdg unppercent F Eukaryota T 1waw 2 B B ARGADIN XXXHX 5 T 140 RsmF_methylt_CI pdbhh F F 1wb0 2 B B ARGIFIN XXXXX 5 T 130 DUF2015 pdbhh F F 1wbp 1 A A SRPK1_HUMAN SRPK1, SRPK1A PROTEIN KINASE, SERINE/ARGININE- RICH PROTEIN SPECIFIC KINASE 1, SR-PROTEIN-SPECIFIC KINASE 1, SFRS PROTEIN KINASE 1 PEQEEEILGSDDDEQEDPNDYCKGGYHLVKIGDLFNGRYHVIRKLGWGHFSTVWLSWDIQGKKFVAMKVVKSAEHYTETALDEIRLLKSVRNSDPNDPNREMVVQLLDDFKISGVNGTHICMVFEVLGHHLLKWIIKSNYQGLPLPCVKKIIQQVLQGLDYLHTKCRIIHTDIKPENILLSVNEQYIRRLAAEATEWQRSGAPPPSGSAVSTAPATAGNFLVNPLEPKNAEKLKVKIADLGNACWVHKHFTEDIQTRQYRSLEVLIGSGYNTPADIWSTACMAFELATGDYLFEPHSGEEYTRDEDHIALIIELLGKVPRKLIVAGKYSKEFFTKKGDLKHITKLKPWGLFEVLVEKYEWSQEEAAGFTDFLLPMLELIPEKRATAAECLRHPWLNS 397 T 3.2 RFXA_RFXANK_bdg unppercent F Eukaryota T 1wbp 2 B B MAGI1_HUMAN ATROPHIN-1-INTERACTING PROTEIN 3,AIP-3,BAI1-ASSOCIATED PROTEIN 1,BAP-1,MEMBRANE-ASSOCIATED GUANYLATE KINASE INVERTED 1,MAGI-1,TRINUCLEOTIDE REPEAT-CONTAINING GENE 19 PROTEIN, WW DOMAIN-CONTAINING PROTEIN 3,WWP3,9-MER PEPTIDE RRRERSPTR 9 T 4 DUF4775 pdbhh F Eukaryota F 1wc2 1 A A GUN_MYTED CMCASE, ENDO-1,4-BETA-GLUCANASE, CELLULASE NQKCSGNPRRYNGKSCASTTNYHDSHKGACGCGPASGDAQFGWNAGSFVAAASQMYFDSGNKGWCGQHCGQCIKLTTTGGYVPGQGGPVREGLSKTFMITNLCPNIYPNQDWCNQGSQYGGHNKYGYELHLDLENGRSQVTGMGWNNPETTWEVVNCDSEHNHDHRTPSNSMYGQCQCAHQ 181 T 0.00088 DPBB_1 pdbpssm F Eukaryota T 1wco 1 A L ALA-FGA-LYS-DAL-DAL PEPTIDE AXKXX 5 T 230 OAM_dimer pdbhh F F 1wcu 1 A A Q9C171_PIREQ CBM29_1 MVSATYSVVYETGKKLNSGFDNWGWDSKMSFKDNSLVLTADPDEYGAISLKNLNSNYYGKGGCIYLQVKTETEGLVKVQGVRGYDETEAFNVGSFRSSSDFTEYKFEVDDEYQFDRIIVQDGPASNIPIYMRYIIYSTGSCDDHILEHHHHHH 153 T 0.071 YegS_C unppercent F Eukaryota T 1wcy 2 C,D C,D Diprotin A IPI 3 T 95 CBM46 pdbhh F F 1weq 1 A A PHF7_MOUSE PHD finger protein 7 GSSGSSGELEPGAFSELYQRYRHCDAPICLYEQGRDSFEDEGRWRLILCATCGSHGTHRDCSSLRPNSKKWECNECLPASGPSSG 85 T 0.00095 PHD pdb F Eukaryota T 1wfa 1 A,B A,B ANPA_PSEAM ANTIFREEZE PROTEIN ISOFORM HPLC6 DTASDAAAAAALTAANAKAAAELTAANAAAAAAATARX 38 T 9.3 DUF3157 unppssm F Eukaryota T 1wfb 1 A,B A,B ANPA_PSEAM ANTIFREEZE PROTEIN ISOFORM HPLC6 DTASDAAAAAALTAANAKAAAELTAANAAAAAAATARX 38 T 9.3 DUF3157 unppssm F Eukaryota T 1wh5 1 A A ZHD1_ARATH ZINC FINGER HOMEOBOX FAMILY PROTEIN GSSGSSGSSAEAGGGIRKRHRTKFTAEQKERMLALAERIGWRIQRQDDEVIQRFCQETGVPRQVLKVWLHNNKHSGPSSG 80 T 0.0036 Homeodomain pdbhh F Eukaryota T 1wh7 1 A A ZHD2_ARATH HYPOTHETICAL PROTEIN F22K18.140, AT4G24660/F22K18_140, ZINC FINGER HOMEOBOX FAMILY PROTEIN GSSGSSGSNPSSSGGTTKRFRTKFTAEQKEKMLAFAERLGWRIQKHDDVAVEQFCAETGVRRQVLKIWMHNNKNSGPSSG 80 T 0.0038 Homeodomain pdbhh F Eukaryota T 1wjj 1 A A Y4844_ARATH hypothetical protein F20O9.120 GSSGSSGSTVKRKPVFVKVEQLKPGTTGHTLTVKVIEANIVVPVTRKTRPASSLSRPSQPSRIVECLIGDETGCILFTARNDQVDLMKPGATVILRNSRIDMFKGTMRLGVDKWGRIEATGAASFTVKEDNNLSLVEYESGPSSG 145 T 0.13 DUF3253 unppercent F Eukaryota T 1wkr 2 B I pepstatin XVVXAX 6 T 1700 FAM60A pdbhh F F 1wnk 1 A A Q5FBS0_BOMMO FMBP-1 ETREQRAIRLARMSAYAARRLAN 23 T 0.15 DUF6366 unppercent F Eukaryota T 1wnm 1 A A Q5FBS0_BOMMO FMBP-1 ESPEQRATRLKRMSEYAAKRLSS 23 T 0.095 EF-1_beta_acid pdbpercent F Eukaryota T 1wnn 1 A A Q5FBS0_BOMMO FMBP-1 ETPAQRQARLLRMSAYAAKRQAS 23 T 0.15 DUF6366 unppercent F Eukaryota T 1wo0 1 A A TAC1_TACTR Tachyplesin I KWCFRVCYRGICYRRCRX 18 T 0.021 Myticin-prepro unp F Eukaryota T 1wo1 1 A A TAC1_TACTR Tachyplesin I KWCFRVCYRGICYRRCRX 18 T 0.021 Myticin-prepro unp F Eukaryota T 1wqb 1 A A TXP7_APOSC PARALYTIC PEPTIDE VII, PP VII WLGCARVKEACGPWEWPCCSGLKCDGSECHPQ 32 T 0.027 Toxin_7 pdbhh F Eukaryota T 1wqc 1 A A KKX21_OPIMA OmTx1 DPCYEVCLQQHGNVKECEEACKHPVE 26 T 0.023 Thionin pdb F Eukaryota T 1wqd 1 A A KKX21_OPIMA OmTx2 DPCYEVCLQQHGNVKECEEACKHPVEY 27 T 0.026 Thionin pdb F Eukaryota T 1wqe 1 A A KKX23_OPIMA OmTx3 NDPCEEVCIQHTGDVKACEEACQ 23 T 0.55 DUF1289 unphh F Eukaryota T 1wrz 2 B B DAPK2_HUMAN DAP KINASE 2, DAP- KINASE RELATED PROTEIN 1, DRP-1 RRRWKLSFSIVSLCNHLTR 19 T 4.9 AAA_lid_8 pdbhh F Eukaryota T 1ws4 2 B,D,F,H B,D,F,H LECB3_ARTIN JACALIN BETA-3 CHAIN DEQSGISQTVIVGPWGAKSA 20 T 2.8 DUF3842 pdbhh F Eukaryota T 1ws5 2 B,D,F,H B,D,F,H LECB3_ARTIN JACALIN BETA-3 CHAIN DEQSGISQTVIVGPWGAKSA 20 T 2.8 DUF3842 pdbhh F Eukaryota T 1wvm 2 C,D C,D CHYMOSTATIN FXLX 4 T 52 FAA_hydrolase_N pdbhh F F 1wzc 1 A,B A,B MPGP_PYRHO MPGP MIRLIFLDIDKTLIPGYEPDPAKPIIEELKDMGFEIIFNSSKTRAEQEYYRKELEVETPFISENGSAIFIPKGYFPFDVKGKEVGNYIVIELGIRVEKIREELKKLENIYGLKYYGNSTKEEIEKFTGMPPELVPLAMEREYSETIFEWSRDGWEEVLVEGGFKVTMGSRFYTVHGNSDKGKAAKILLDFYKRLGQIESYAVGDSYNDFPMFEVVDKVFIVGSLKHKKAQNVSSIIDVLEVIKHHHHHH 249 T 1.5E-10 Hydrolase_3 pdbpercent F Archaea T 1x2r 2 B B NF2L2_MOUSE NF-E2 RELATED FACTOR 2, NFE2-RELATED FACTOR 2, NUCLEAR FACTOR, ERYTHROID DERIVED 2, LIKE 2 LDEETGEFL 9 T 0.055 Radial_spoke unppercent F Eukaryota T 1x3c 1 A A ZN292_HUMAN Zinc finger protein 292 GSSGSSGRKKPVSQSLEFPTRYSPYRPYRCVHQGCFAAFTIQQNLILHYQAVHKSDLPAFSAEVEEESGPSSG 73 T 0.0055 zf-C2H2 unppercent F Eukaryota T 1x3z 3 C I peptide PHQ-Val-Ala-Asp-CF0 XVADX 5 T 1100 RE_HindIII pdbhh F F 1x5v 1 A A TXFK1_PSACA PcFK1 ACGILHDNCVYVPAQNPCCRGLQCRYGKCLVQVX 34 T 8.7E-05 Conotoxin unphh F Eukaryota T 1x7k 1 A A PPM1_LIMPO PV5 RRWCFRVCYRGRFCYRKCR 19 T 0.19 zf-CCHH pdbhh F Eukaryota T 1x8s 2 B B Pals1 peptide YPKHREMAVDCP 12 T 3.5 GSH_synthase pdbhh F T 1x9t 2 B,B10,B11,B12,B13,B14,B15,B16,B17,B18,B19,B2,B20,B21,B22,B23,B24,B25,B26,B27,B28,B29,B3,B30,B31,B32,B33,B34,B35,B36,B37,B38,B39,B4,B40,B41,B42,B43,B44,B45,B46,B47,B48,B49,B5,B50,B51,B52,B53,B54,B55,B56,B57,B58,B59,B6,B60,B7,B8,B9 B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B SPIKE_ADE02 N-terminal peptide of Fiber protein MKRARPSEDTFNPVYPYDTEC 21 T 0.34 DUF5449 pdbhh T Viruses T 1xb7 2 B P PRGC1_HUMAN PPAR GAMMA COACTIVATOR-1 ALPHA, PPARGC-1 ALPHA, PGC-1 ALPHA, LIGAND EFFECT MODULATOR-6 RPASELLKYLTT 12 T 4.4 MTBP_mid pdbhh F Eukaryota T 1xbh 1 A A PROTEIN (CYCLO(L-262)) CIYYKDGEALKYX 13 T 18 MORN_2 pdbhh F T 1xdh 2 C,D C,D Pepstatin XVVXAX 6 T 1700 FAM60A pdbhh F F 1xdk 3 C,D,G,H C,D,G,H MED1_MOUSE TRAP220 NHPMLMNLLKDNPA 14 T 7.6 DnaI_N pdbhh F Eukaryota T 1xe4 1 A A FEMX_WEIVI FemX PVLNLNDPQAVERYEEFMRQSPYGQVTQDLGWAKVMNNWEPVDVYLEDDQGAIIAAMSMLLGDTPTDKKFAYASKGPVMDVTDVDLLDRLVDEAVKALDGRAYVLRFDPEVAYSDEFNTTLQDHGYVTRNRNVADAGMHATIQPRLNMVLDLTKFPDAKTTLDLYPSKTKSKIKRPFRDGVEVHSGNSATELDEFFKTYTTMAERHGITHRPIEYFQRMQAAFDADTMRIFVAEREGKLLSTGIALKYGRKIWYMYAGSMDGNTYYAPYAVQSEMIQWALDTNTDLYDLGGIESESTDDSLYVFKHVFVKDAPREYIGEIDKVLDPEVYAELVKD 335 T 2.6E-25 FemAB unppssm F Bacteria T 1xf8 1 A A FEMX_WEIVI FemX PVLNLNDPQAVERYEEFMRQSPYGQVTQDLGWAKVKNNWEPVDVYLEDDQGAIIAAMSMLLGDTPTDKKFAYASKGPVMDVTDVDLLDRLVDEAVKALDGRAYVLRFDPEVAYSDEFNTTLQDHGYVTRNRNVADAGMHATIQPRLNMVLDLTKFPDAKTTLDLYPSKTKSKIKRPFRDGVEVHSGNSATELDEFFKTYTTMAERHGITHRPIEYFQRMQAAFDADTMRIFVAEREGKLLSTGIALKYGRKIWFMYAGSMDGNTYYAPYAVQSEMIQWALDTNTDLYDLGGIESESTDDSLYVFKHVFVKDAPREYIGEIDKVLDPEVYAELVKD 335 T 2.6E-25 FemAB unppssm F Bacteria T 1xga 1 A _ CAIA_CONGE GI(2-7,3-13) ECCNPACGRHYSCX 14 T 0.039 Enterotoxin_ST pdbhh F Eukaryota T 1xgb 1 A _ CAIA_CONGE GI(2-13,3-7) ECCNPACGRHYSCX 14 T 0.039 Enterotoxin_ST pdbhh F Eukaryota T 1xgc 1 A _ CAIA_CONGE ALPHA-CONOTOXIN GI ECCNPACGRHYSCX 14 T 0.039 Enterotoxin_ST pdbhh F Eukaryota T 1xgf 1 A,B A,B cocosin XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 382 F F F 1xgy 3 E,F P,Q Rhodopsin Epitope Mimetic Peptide TGALQERSK 9 T 30 BshC pdbhh F T 1xh3 3 C C aa 4-17 (LPAVVGLSPGEQEY) of alternative reading frame of M-CSF LPAVVGLSPGEQEY 14 T 6.3 DUF1127 pdbhh F T 1xhm 3 C C SIGK Peptide SIGKAFKILGYPDYD 15 T 1.1 UPF0175 pdbhh F T 1xia 1 A,B A,B D-XYLOSE ISOMERASE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 393 F F F 1xj7 2 B B RAC3 derived peptide HKKLLQLLT 9 T 0.0026 SRC-1 pdbhh F F 1xkh 2 D,E,F I,J,K Pyoverdin C-E XRXXKXTT 8 T 12 DapH_N pdbhh F F 1xkm 1 A,C A,C Distinctin chain A ENREVPPGFTALIKTLRKCKII 22 T 4 Bradykinin pdbhh F T 1xkm 2 B,D B,D Distinctin chain B NLVSGLIEARKYLEQLHRKLKNCKV 25 T 0.35 hGDE_central pdbhh F T 1xn2 2 E,F,G,H E,F,G,H OM03-4 REWWSEVNXAEF 12 T 2.2 PHTB1_N pdbhh F T 1xn3 2 E I Peptidic inhibitor KTEEISEVNXVAEF 14 T 13 DUF1805 pdbhh F T 1xoc 2 B B Nonapeptide VDSKNTSSW VDSKNTSSW 9 T 8.4 Ac76 pdbhh F T 1xof 1 A A BBAhetT1 XYRIXSYDFXDEAEKLLRDAXG 22 T 5.7 DUF4952 pdbhh F T 1xof 2 B B BBAhetT1 XYRIXSYDFXDKFKKLLRKAXG 22 T 11 DUF1949 pdbhh F T 1xq7 2 D,E,F D,E,F CYCLOSPORINE, CICLOSPORIN, CICLOSPORINE XXXXXXXXVXA 11 T 9.7 DUF6090 pdbhh F F 1xqd 1 A A NOR_FUSOX P450NOR MASGAPSFPFSRASGPEPPAEFAKLRATNPVSQVKLFDGSLAWLVTKHKDVCFVATSEKLSKVRTRQGFPELGAGGKQAAKAKPTFVDMDPPEHMHQRSMVEPTFTPEAVKNLQPYIQRTVDDLLEQMKQKGCANGPVDLVKEFALPVPSYIIYTLLGVPFNDLEYLTQQNAIRTNGSSTARQASAANQELLDYLAILVEQRLVEPKDDIISKLCTEQVKPGNIDKSDAVQIAFLLLVAGNATMVNMIALGVATLAQHPDQLAQLKANPSLAPQFVEELCRYHTASALAIKRTAKEDVMIGDKLVRANEGIIASNQSANRDEEVFENPDEFNMNRKWPPQDPLGFGFGDHRCIAEHLAKAELTTVFSTLYQKFPDLKVAVPLGKINYTPLNRDVGIVDLPVIF 403 T 6.1E-07 p450 unp F Eukaryota T 1xqh 2 B,D B,F P53_HUMAN 9-mer peptide from tumor protein p53 LKSKKGQSTY 10 T 14 Flp_Fap pdbhh F Eukaryota T 1xqy 2 B Q PLGG PLGG 4 T 80 Prefoldin_3 pdbhh F F 1xr0 1 A A FGFR1_HUMAN FGFR-1, BFGF-R, FMS-LIKE TYROSINE KINASE-2, C-FGR HSQMAVHKLAKSIPLRRQVTVS 22 T 1.3 DUF1823 pdbhh F Eukaryota T 1xr8 3 C C EBNA3_EBVB9 EBNA-3A LEKARGSTY 9 T 4.5 DUF4872 pdbhh T Viruses T 1xrp 2 B Q PLGG PLGG 4 T 80 Prefoldin_3 pdbhh F F 1xt7 1 A A A21978C, CUBICIN WXDTGXDXDGXXX 13 T 31 Imm64 pdbhh F F 1xu6 1 A A VSM2_TRYBB VSG 221, MITAT1.2 C-TERMINAL DOMAIN GSHMLEVLTQKHKPAESQQQAAETEGSCNKKDQNECKSPCKWHNDAENKKCTLDKEEAKKVADETAKDGKTGNTNTTGSS 80 T 0.00099 Trypan_glycop_C pdbpercent F Eukaryota T 1xvk 2 B B QUINOMYCIN A XAXXXXAXXX 10 T 190 RSF pdbhh F F 1xvm 2 B B substrate tripeptide GLY-ALA-ARG GAR 3 T 360 AglB_L1 pdbhh F F 1xvn 2 B B QUINOMYCIN A XAXXXXAXXX 10 T 190 RSF pdbhh F F 1xvr 2 C,D D,E QUINOMYCIN A XAXXXAXX 8 T 190 RSF pdbhh F F 1xxp 2 C,D,E,F C,D,E,F Hexapeptide ASP-ALA-ASP-GLU-PTR-CLE XDADEXLX 8 T 1.1 Glyco_transf_92 pdbhh F F 1xxv 2 C,D,E,F C,D,E,F Epidermal growth factor receptor derived peptide XDADEXLX 8 T 1.1 Glyco_transf_92 pdbhh F F 1xxz 1 A A SRIF CKFFXXTXTSC 11 F F T 1xy4 1 A A SRIF YCKEFXXTFKSC 12 T 0.64 CRM1_repeat pdbhh F T 1xy5 1 A A SRIF YCKFEXXTFXSC 12 T 0.36 Laterosporulin pdbhh F T 1xy6 1 A A SRIF YCKFEXXTFKSC 12 T 0.41 Laterosporulin pdbhh F T 1xy8 1 A A SRIF YCKFEXXTFXSC 12 T 12 Peptidase_M24_C pdbhh F T 1xy9 1 A A SRIF CKFAXXTXTSC 11 T 0.41 DUF2195 pdbhh F T 1xyr 4 D 5 POLG_POL1M Genome polyprotein, Coat protein VP3 GLPVMNTPGSNQ 12 T 2 GSH-S_N pdbhh T Viruses T 1xyr 7 G 8 POLG_POL1M Genome polyprotein, Coat protein VP1 PALTAVETGAT 11 T 8.9 DUF6047 pdbhh T Viruses T 1y03 1 A A ANP3_MYOSC RSS3 GSMNAPARAAAKTAADALAAAKKTAADAAAAAAAA 35 T 160 DUF2443 pdbhh F Eukaryota T 1y04 1 A A ANP3_MYOSC RSS3 GSMNAPARAAAKTAADALAAAKKTAADAAAAAAAA 35 T 160 DUF2443 pdbhh F Eukaryota T 1y0y 2 B B AMASTATIN XVVD 4 T 400 Fer4 pdbhh F F 1y19 1 A,C,E,G,I,K A,C,E,G,I,K PI51C_MOUSE Phosphatidylinositol-4-phosphate 5-kinase, type 1 gamma DERSWVYSPLHYSA 14 T 2.2 Invas_SpaK pdbhh F Eukaryota T 1y29 1 A A TXH10_HAPSC huwentoxin-x KCLPPGKPCYGATQKIPCCGVCSHNKCT 28 T 0.0049 Conotoxin unp F Eukaryota T 1y3a 2 E,F,G,H E,F,G,H KB752 peptide SRVTWYDFLMEDTKSR 16 T 2.1 DUF2760 pdbhh F T 1y5c 1 A A Q0PGA5_BUBBU LACTOFERRIN, LACTOFERRICIN B, LFCIN B RRWQWRMKKLG 11 T 0.00046 Transferrin unppercent F Eukaryota T 1y7a 1 A,B A,B PPB_ECOLI APASE TPEMPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQHATPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDWQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 449 T 6.8E-10 Alk_phosphatase unppssm F Bacteria T 1y7l 2 B P CYSE_HAEIN SAT FRAGMENT; E.C.2.3.1.30 GIDDGMNLNI 10 T 18 DUF2523 pdbhh F Bacteria T 1y98 2 B B CTIP_HUMAN CtIP PHOSPHORYLATED PEPTIDE PTRVSSPVFGAT 12 T 4.3 Pardaxin pdbhh F Eukaryota T 1yc5 2 B B P53_HUMAN TUMOR SUPPRESSOR P53, PHOSPHOPROTEIN P53, ANTIGEN NY-CO-13 KKGQSTSRHKXLMFKTEG 18 T 30 Class_IIIsignal pdbhh F Eukaryota T 1ycp 3 C,G F,N FIBA_HUMAN FIBRINOPEPTIDE A-ALPHA ADSGEGDYLAEGGGVRGPRVVER 23 T 1.1 DUF2388 pdbhh F Eukaryota T 1ygu 2 C,D C,D MT_POVMA Polyoma Middle T antigen PTXS 4 T 110 LigXa_C pdbhh T Viruses F 1yit 5 E 8 VIRGINIAMYCIN FACTOR S1, VIRGINIAMYCIN S XTXPXXX 7 T 260 zf-C2H2_jaz pdbhh F F 1yjm 2 D,E,F E,F,G X-RAY REPAIR CROSS-COMPLEMENTING PROTEIN 4 XYDESTDEESEKK 13 T 0.6 XRCC4 pdbhh F T 1yjo 1 A A ERF3_YEAST ERF2, TRANSLATION RELEASE FACTOR 3, ERF3, ERF-3, OMNIPOTENT SUPPRESSOR PROTEIN 2, G1 TO S PHASE TRANSITION PROTEIN 1 NNQQNY 6 T 1.3 TFIIA unppssm F Eukaryota F 1yjp 1 A A ERF3_YEAST ERF2, TRANSLATION RELEASE FACTOR 3, ERF3, ERF-3, OMNIPOTENT SUPPRESSOR PROTEIN 2, G1 TO S PHASE TRANSITION PROTEIN 1 GNNQQNY 7 T 1.3 TFIIA unppssm F Eukaryota F 1yjw 5 E 4 QUINUPRISTIN XTXPXXXX 8 T 290 zf-C2H2_jaz pdbhh F F 1yl8 1 A A [Tyr3]Octreotate peptide XCYXKTCT 8 T 0.027 Urotensin_II pdbhh F F 1yl9 1 A A [Tyr3]Octreotate XCYXKTCT 8 T 0.027 Urotensin_II pdbhh F F 1ym0 2 B B fibrinotic enzyme component B QPPVWYPGGQCGVSQYSDAGDMELPPG 27 T 2.3 PHM7_ext pdbhh F T 1ym2 2 D,E,F X,Y,Z NVP-AUR200 INHIBITOR XLMXVX 6 T 1000 XAP5 pdbhh F F 1ym4 2 D,E,F X,Y,Z NVP-AMK640 INHIBITOR EVNXA 5 T 200 GSH-S_ATP pdbhh F F 1ymt 2 B B NR0B2_MOUSE ORPHAN NUCLEAR RECEPTOR SHP, SMALL HETERODIMER PARTNER RPTILYALLSPSPR 14 T 0.086 NR_Repeat unphh F Eukaryota T 1yow 2 B B TIF2 peptide AQALAALLAKA 11 T 17 DUF4699 pdbhh F F 1yp0 2 B B NR0B2_RAT Nuclear receptor subfamily 0, group B, member 2 HPTILYTLLSPG 12 T 0.02 NR_Repeat unphh F Eukaryota T 1yp1 2 B B KNL KNL 3 T 390 PH_6 pdbhh F F 1yph 1 A,D A,B CTRA_BOVIN ALPHA-CHYMOTRYPSIN, CHYMOTRYPSINOGEN A CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 1yr5 2 B B DAPK1_HUMAN DAP KINASE 1 RKKWKQSVRLISLCQRLSR 19 T 11 AAA_lid_8 pdbhh F Eukaryota T 1yrk 2 B B 13-residue peptide MALYSIXQPYVFA 13 T 1.5 Monellin pdbhh F T 1yt6 1 A A peptide SD ACLPWSDGPC 10 T 0.31 VERL pdbhh F T 1ytg 2 C I PEPTIDE PRODUCT PIVX 4 T 290 DUF1996 pdbhh F F 1yth 2 C I PEPTIDE PRODUCT XSLNF 5 T 240 DBINO pdbhh F F 1yti 2 B I PEPTIDE PRODUCT FLEK 4 T 97 mIF3 pdbhh F F 1ytj 2 B I PEPTIDE PRODUCT XEAXS 5 T 69 HET pdbhh F F 1ytr 1 A A PLNA_LACPL Bacteriocin plantaricin A KSSAYSLQMGATAIKQVKKLFKKWGW 26 T 0.045 Bacteriocin_IIc unp F Bacteria T 1yuc 2 C,D C,D NR0B2_HUMAN ORPHAN NUCLEAR RECEPTOR SHP; SMALL HETERODIMER PARTNER ASRPAILYALLSSS 14 T 9.2 NR_Repeat unphh F Eukaryota T 1yvh 2 B B SH2B2_RAT APS ADAPTER PROTEIN GRARAVENQXSFY 13 T 4 UPF0542 pdbhh F Eukaryota T 1yvl 2 C,D C,D 5-residue peptide XDKPH 5 T 84 fvmJAB_N pdbhh F F 1ywh 2 B,D,F,H,J,L,N,P B,D,F,H,J,L,N,P antagonist peptide KSDXFXXYLWSSK 13 T 3.5 EF-hand_like pdbhh F T 1ywi 2 B B Formin APPTPPPLPP 10 T 11 SCIMP pdbhh F F 1ywo 2 B P LCP2_HUMAN SH2 DOMAIN-CONTAINING LEUCOCYTE PROTEIN OF 76 KDA, SLP-76 TYROSINE PHOSPHOPROTEIN, SLP76 QPPVPPQRPM 10 T 11 HCV_NS5a_C pdbhh F Eukaryota F 1ywt 2 C,D C,D synthetic optimal phosphopeptide (mode-1) MARSHSYPAGKK 12 T 8.4 Ribosomal_S13_N pdbhh F T 1yxn 1 A,B,C A,B,C LATE PROTEIN GP8 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360 F F F 1yy2 1 A A Leuprolide QHWSYXLRPX 10 T 0.00076 GnRH pdb F T 1yy6 2 B B EBNA1_EBVB9 EBNA1 DPGEGPSTGP 10 T 0.18 Herpes_IE1 pdbhh T Viruses T 1yyp 2 B B DPOL_HCMVA POL LPRRLHLEPAFLPYSVKAHECC 22 T 2.8 TP53IP5 pdbhh T Viruses T 1z56 3 D D Ligase interacting factor 1 XXXXXXXX 8 F F F 1z56 4 E,H E,H Ligase interacting factor 1 XXXXXXX 7 F F F 1z56 5 F F Ligase interacting factor 1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 45 F F F 1z56 6 G G Ligase interacting factor 1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 37 F F F 1z56 7 I I Ligase interacting factor 1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 30 F F F 1z56 8 J J Ligase interacting factor 1 XXXXXXXXXXXXXXXXXXXX 20 F F F 1z56 9 K K Ligase interacting factor 1 XXXXXXXXXXXXX 13 F F F 1z7k 3 C C IOVO_MELGA Ovomucoid TNEE 4 T 0.029 Kazal_1 unphh F Eukaryota F 1z7z 4 D 4 POLG_CXA21 human coxsackievirus A21 LPLTKVDSITTF 12 T 7 Flot pdbhh T Viruses T 1z7z 5 E 5 human coxsackievirus A21 LIGRTQ 6 T 33 Corona_7 pdbhh F T 1z8g 2 B L ACE-LYS-GLN-LEU-ARG-Chloromethylketone XKQLXX 6 T 240 DEK_C pdbhh F F 1z9o 2 G,H,I,J,K,L G,H,I,J,K,L OSBL1_RAT ORP-1 SEDEFYDALS 10 T 1.2 AAA_assoc_2 pdbhh F Eukaryota T 1zb5 2 B,C B,C PEPTIDE TRP-PRO-TRP WPW 3 T 22 Sex_peptide pdbhh F F 1zbc 2 B C 3 mer peptide WPW 3 T 22 Sex_peptide pdbhh F F 1zbk 2 B C PEPTIDE TRP-PRO-TRP WPW 3 T 22 Sex_peptide pdbhh F F 1zbv 2 B B WPW WPW 3 T 22 Sex_peptide pdbhh F F 1zbw 2 B D WPW WPW 3 T 22 Sex_peptide pdbhh F F 1zea 3 C A short synthetic D-amino acid peptide D2 XXGXXXXXX 9 T 0.0094 Enterotoxin_b pdbhh F F 1zfi 1 A A MCPI_HIRME LEECH CARBOXYPEPTIDASE INHIBITOR, LCI, INHIBITOR OF A/B METALLOCARBOXYPEPTIDASES GSHTPDESFLCYQPDQVCCFICRGAAPLPSEGECNPHPTAPWCREGAVEWVPYSTGQCRTTCIPYVE 67 T 0.0093 Inhibitor_I68 pdb F Eukaryota T 1zfl 1 A A MCPI_HIRME LEECH CARBOXYPEPTIDASE INHIBITOR, LCI, INHIBITOR OF A/B METALLOCARBOXYPEPTIDASES GSHTPDESFLCYQPDQVCCFICRGAAPLPSEGECNPHPTAPWCREGAVEWVPYSTGQCRTTCIPYVE 67 T 0.0093 Inhibitor_I68 pdb F Eukaryota T 1zfp 2 B I 2-ABZ-GLU-TYR(PO3H2)-ILE-ASN-GLN-NH2, WITH 2-ABZ BEING 2-AMINO-BENZOYL XEXINQX 7 T 10 GAPES1 pdbhh F F 1zh7 2 C,D C,D NR0B2_RAT nuclear receptor subfamily 0, group B, member 2 SHPTILYTLLS 11 T 0.02 NR_Repeat unphh F Eukaryota T 1zhb 3 C,F,I,L C,F,I,L DOPO_RAT DOPAMINE BETA- HYDROXYLASE, DBH KALYNYAPI 9 T 0.55 Ntox47 pdbhh F Eukaryota T 1zhk 3 C C EBV-peptide LPEPLPQGQLTAY LPEPLPQGQLTAY 13 T 14 AP-5_subunit_s1 pdbhh F T 1zhl 3 C C EBV-peptide LPEPLPQGQLTAY LPEPLPQGQLTAY 13 T 14 AP-5_subunit_s1 pdbhh F T 1zi7 1 A,B,C A,B,C KES1_YEAST OXYSTEROL-BINDING PROTEIN HOMOLOG 4 GAMDPPFILSPISLTEFSQYWAEHPELFLEPSFINDDNYKEHCLIDPEVESPELARMLAVTKWFISTLKSQYCSRNESLGSEKKPLNPFLGELFVGKWENKEHPEFGETVLLSEQVSHHPPVTAFSIFNDKNKVKLQGYNQIKASFTKSLMLTVKQFGHTMLDIKDESYLVTPPPLHIEGILVASPFVELEGKSYIQSSTGLLCVIEFSGVDGKKNSFKARIYKDSKDSKDKEKALYTISGQWSGSSKIIKANKKEESRLFYDAARIPAEHLNVKPLEEQHPLESRKAWYDVAGAIKLGDFNLIAKTKTELEETQRELRKEEEAKGISWQRRWFKDFDYSVTPEEGALVPEKDDTFLKLASALNLSTKNAPSGTLVGDKEDRKEDLSSIHWRFQRELWDEEKEIVL 406 T 2.8E-13 Oxysterol_BP unppssm F Eukaryota T 1zkf 2 C,D C,D Suc-ALA-GLY-PRO-PHE-pNA XAGPFX 6 T 4.4 DUF4387 pdbhh F F 1zkk 2 E,F,G,H E,F,G,H H4_HUMAN Peptide corresponding to residues 15-24 of histone H4 AKRHRKVLRD 10 T 0.27 UPF0137 unp F Eukaryota T 1zl1 2 B C TRP-HIS-TRP peptide WHW 3 T 22 Molybdopterin_N pdbhh F F 1zla 6 K K Q9DUM3_HHV8 latent nuclear antigen MAPPGMRLRSGRSTGAPLTRGS 22 T 7.1 PolC_DP2 pdbhh T Viruses T 1zm6 2 B P designed penta peptide Leu-Ala-Ile-Tyr-Ser LAIYS 5 T 86 KH_7 pdbhh F F 1zns 2 C A CEA7_ECOLX Colicin E7 MESKRNKPGKATGKGKPVNNKWLNNAGKDLGSPVPDRIANKLRDKEFKSFDDFRKKFWEEVSKDPELSKQFSRNNNDRMKVGKAPKTRTQDVSGKRTSFELHEEKPISQNGGVYDMDNISVVTPKRHIDIHRGK 134 T 0.026 HNH pdbpercent F Bacteria T 1znv 2 B,D B,D CEA7_ECOLX Colicin E7 MESKRNKPGKATGKGKPVNNKWLNNAGKDLGSPVPDRIANKLRDKEFKSFDDFRKKFWEEVSKDPELSKQFSRNNNDRMKVGKAPKTRTQDVSGKRTSFELHEEKPISQNGGVYDMDNISVVTPKRHIDIHRGK 134 T 0.026 HNH pdbpercent F Bacteria T 1zpx 1 A A Q00484_9CNID mini-collagen XPCGSYCPSVCAPACAPVCCYPX 23 T 0.047 C_tripleX pdbhh F Eukaryota T 1zrv 1 A A SPING_PSEUS spinigerin HVDKKVADKVLLLKQLRIMRLLTRL 25 T 5.4 YfbU pdbhh F Eukaryota T 1zrw 1 A A SPING_PSEUS spinigerin HVDKKVADKVLLLKQLRIMRLLTRL 25 T 5.4 YfbU pdbhh F Eukaryota T 1zsd 3 C C BZLF1_EBVB9 EB1, ZEBRA EPLPQGQLTAY 11 T 7.4 AP-5_subunit_s1 pdbhh T Viruses T 1zsg 2 B B PAK1_HUMAN P21-ACTIVATED KINASE 1, PAK-1, P65-PAK, ALPHA-PAK, PAK PEPTIDE DATPPPVIAPRPEHTKSVYTRS 22 T 1.4 TFIIA unppercent F Eukaryota T 1zt1 3 C P Influenza virus epitope, FEANGNLI FEANGNLI 8 T 0.046 TTc_toxin_rep pdbhh F T 1zt7 3 E,F P,Q SV40 epitope, SEFLLEKRI SEFLLEKRI 9 T 20 OMS28_porin pdbhh F T 1ztz 2 C P autoproteolytic tetrapeptide AGAA 4 T 450 DUF3824 pdbhh F F 1zub 2 B B RB6I2_RAT ERC PROTEIN 1, ERC1, CAZ-ASSOCIATED STRUCTURAL PROTEIN 2, CAST2, RAB6 INTERACTING PROTEIN 2, C-TERMINAL PEPTIDE CDQDEEEGIWA 11 T 1.8 MgrB pdbhh F Eukaryota T 1zuk 2 C C LAS17_YEAST Proline-rich protein LAS17 RGPAPPPPPHR 11 T 2.2 Dscam_C pdbhh F Eukaryota F 1zuz 2 B B DAPK2_HUMAN DRP-1 kinase RRRWKLDFSIVSLCNHLTR 19 T 5 AAA_lid_8 pdbhh F Eukaryota T 1zvs 3 C,F C,F Tat-Tl8 TTPESANL 8 T 70 SepA pdbhh F T 1zx3 1 A A Q82XL7_NITEU hypothetical protein NE0241 MGSSHHHHHHSSGRENLYFQGHMGKKKNKKTEVQQPDPMRKNWIMENMDSGVIYLLESWLKAKSQETGKEISDIFANAVEFNIVLKDWGKEKLEETNTEYQNQQRKLRKTYIEYYDREMKGS 122 T 0.0013 Ets pdbpssm F Bacteria T 1zy1 2 C,D D,E tripeptide fragment MAS 3 T 280 zf-C2H2_4 pdbhh F F 1zy6 1 A,B A,B PG1_PIG PG-1, NEUTROPHIL PEPTIDE 1 RGGRLCYCRRRFCVCVGRX 19 T 0.16 Defensin_1 pdbhh F Eukaryota T 1zys 2 B B pentapeptide fragment ASVSA 5 T 490 DUF2121 pdbhh F F 1zzd 2 B B RIR4_YEAST RIBONUCLEOTIDE REDUCTASE SMALL SUBUNIT 2 KEINFDDDF 9 T 8.7 Etd1 pdbhh F Eukaryota T 209d 2 C C N8-ACTINOMYCIN D TXPXXXTXPXX 11 T 61 DUF5572 pdbhh F F 2a1m 1 A,B A,B CPXA_PSEPU CAMPHOR 5-MONOOXYGENASE, P450CAM MTTETIQSNANLAPLPPHVPEHLVFDFDMYNPSNLSAGVQEAWAVLQESNVPDLVWTRCNGGHWIATRGQLIREAYEDYRHFSSECPFIPREAGEAYDFIPTSMDPPEQRQFRALANQVVGMPVVDKLENRIQELACSLIESLRPQGQCNFTEDYAEPFPIRIFMLLAGLPEEDIPHLKYLTDQMTRPDGSMTFAEAKEALYDYLIPIIEQRRQKPGTDAISIVANGQVNGRPITSDEAKRMCGLLLVGGLDTVVNFLSFSMEFLAKSPEHRQELIERPERIPAACEELLRRFSLVADGRILTSDYEFHGVQLKKGDQILLPQMLSGLDERENAAPMHVDFSRQKVSHTTFGHGSHLCLGQHLARREIIVTLKEWLTRIPDFSIAPGAQIQHKSGIVSGVQALPLVWDPATTKAV 415 T 1.6E-05 p450 unppercent F Bacteria T 2a2x 3 C P synthetic peptide XYEPIPEEXXQ 11 T 0.91 Hirudin pdbhh F T 2a3d 1 A A PROTEIN (DE NOVO THREE-HELIX BUNDLE) MGSWAEFKQRLAAIKTRLQALGGSEAELAAFEKEIAAFESELQAYKGKGNPEVEALRKEAAAIRDELQAYRHN 73 T 0.0085 DUF1202 pdb F T 2a3i 2 B B NCOA1_HUMAN NCOA-1, STEROID RECEPTOR COACTIVATOR-1, SRC-1, RIP160, HIN-2 PROTEIN QQKSLLQQLLTE 12 T 3.8 GFD1 pdbhh F Eukaryota T 2a4j 2 B B XPC_HUMAN XERODERMA PIGMENTOSUM GROUP C COMPLEMENTING PROTEIN P125 NWKLLAKGLLIRERLKR 17 T 9.9 MazG_C pdbhh F Eukaryota T 2a5z 1 A,B,C A,B,C Q8ED25_SHEON hypothetical protein SO2946 MGGSFGKKGASSATAAQVPLATETTPGLMSPSEKLKLSTLTTSIATSDFYASYDFMMHSIGLTSANNISLLSTGNISLQNILSEGNHFGVQPIVSSTTANASFLAGMLMAIFPKESELEVTVYFKTPSAFNPAQLTVIGSTSIGLGISDRSGLIIENGNAFGGIVKASAATETGSTYALSTSTWYICKFKMLTDDRFKVTLYSDSGTQLYSYTSTAAMFRADNATAHIGFKTQCKTATAGISLISIDLIEFKAKVSATRAKV 262 T 33 DUF1652 pdbhh F Bacteria T 2a6d 3 E P Dodecapeptide, RLLIADPPSPRE RLLIADPPSPRE 12 T 2.3 DUF4666 pdbhh F T 2a6i 3 C P Dodecapeptide: KLASIPTHTSPL KLASIPTHTSPL 12 T 18 IML1 pdbhh F T 2a6k 3 E P DODECAPEPTIDE: SLGDNLTNHNLR SLGDNLTNHNLR 12 T 8.5 DUF764 pdbhh F T 2a79 3 C C poly-unknown chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 52 F F F 2a79 4 D D poly-unknown chain XXXXXXXXXXXXXXXXXXXXX 21 F F F 2a7u 1 A A ATPA_ECO57 F-ATPASE ALPHA CHAIN MQLNSTEISELIKQRIAQFNVV 22 T 0.5 GnsAB_toxin pdbhh F Bacteria T 2a83 3 C C GLR_HUMAN THE GLUCAGON RECEPTOR (GR) PEPTIDE RRRWHRWRL 9 T 1 DUF3019 pdbhh F Eukaryota F 2a9x 2 B 1 BIV-2 cyclic peptide RVRTRGKRRIRVPP 14 T 1.6 YhdX pdbhh F T 2ab9 1 A A SFTI1_HELAN pro-SFTI-1 GYKTSISTITIEDNGRCTKSIPPICFPDGRP 31 T 0.015 Bowman-Birk_leg pdb F Eukaryota T 2abz 2 C,D,E,F C,D,E,F MCPI_HIRME LEECH CARBOXYPEPTIDASE INHIBITOR, LCI, INHIBITOR OF A/B METALLOCARBOXYPEPTIDASES GSHTPDESFLCYQPDQVCAFICRGAAPLPSEGECNPHPTAPWAREGAVEWVPYSTGQCRTTCIPYVE 67 T 0.0093 Inhibitor_I68 pdbpssm F Eukaryota T 2ad9 2 B A PTBP1_HUMAN PTB, HETEROGENEOUS NUCLEAR RIBONUCLEOPROTEIN I, HNRNP I, 57 KDA RNA-BINDING PROTEIN PPTB-1 MGSSHHHHHHSSGLVPRGSHMGDSRSAGVPSRVIHIRKLPIDVTEGEVISLGLPFGKVTNLLMLKGKNQAFIEMNTEEAANTMVNYYTSVTPVLRGQPIYIQFSNHKELKTDSSPNQAR 119 T 0.00041 RRM_1 pdbpssm F Eukaryota T 2adw 2 E,F,G,H H,I,J,K QUINOMYCIN A XAXXXXAXXX 10 T 190 RSF pdbhh F F 2ag3 1 A A GCN4-pLI RMKQIEDKLEEILSXYHIENELARIKKLLGER 32 T 0.0053 VGPC1_C pdbhh F T 2age 2 B A succinyl-Ala-Ala-Pro-Arg XAAPR 5 T 570 SAMP pdbhh F F 2agg 2 B A succinyl-Ala-Ala-Pro-Lys XAAPK 5 T 450 Ldr_toxin pdbhh F F 2agh 3 C C KMT2A_HUMAN ALL-1, TRITHORAX-LIKE PROTEIN SDDGNILPSDIMDFVLKNTPSMQALGESPES 31 T 9.5 ComFB pdbhh F Eukaryota T 2agi 2 B A leupeptin XLLX 4 T 590 DNTTIP1_dimer pdbhh F F 2ain 2 B B BCR_HUMAN 6-mer peptide from Breakpoint cluster region protein LFSTEV 6 T 79 DUF3916 pdbhh F Eukaryota T 2aiz 2 B U L-alanyl-D-glutamyl-meso-2,6-diaminopimeloyl-D-alanyl-D-alanine AXXXX 5 T 230 OAM_dimer pdbhh F F 2ajj 1 A A POLG_BVDVC NS5A SGNYVLDLIYSLHKQINRGLKKIVLGWA 28 T 3.9 DUF5103 pdbhh T Viruses T 2ajm 1 A A POLG_BVDVC NS5A SGNYVLDLIYSLHKQINRGLKKIVLGWA 28 T 3.9 DUF5103 pdbhh T Viruses T 2ajn 1 A A POLG_BVDVC NS5A SGNYVLDLIYSLHKQINRGLKKIVLGWA 28 T 3.9 DUF5103 pdbhh T Viruses T 2ajo 1 A A POLG_BVDVC NS5A SGNYVLDLIYSLHKQINRGLKKIVLGWA 28 T 3.9 DUF5103 pdbhh T Viruses T 2ak4 3 C,H,M,R C,H,M,S BZLF1_EBVB9 EBV peptide LPEPLPQGQLTAY LPEPLPQGQLTAY 13 T 14 AP-5_subunit_s1 pdbhh T Viruses T 2ak5 2 C D CBLB_HUMAN 8-residue peptide from a signal transduction protein CBL-B RPPKPRPR 8 T 5.1 Hap4_Hap_bind pdbhh F Eukaryota F 2aka 2 B L LINKER TRLVPRGSELALE 13 T 7.8 SsgA pdbhh F T 2amn 1 A A CTHL1_CHICK cathelicidin RVKRVWPLVIRTVIAGYNLYRAIKKK 26 T 2.2 Phage_coatGP8 pdbhh F Eukaryota T 2amq 2 C,D C,D N-[(5-METHYLISOXAZOL-3-YL)CARBONYL]ALANYL-L-VALYL-N~1~-((1R,2Z)-4-(BENZYLOXY)-4-OXO-1-{[(3R)-2-OXOPYRROLIDIN-3-YL]METHYL}BUT-2-ENYL)-L-LEUCINAMIDE XAVLXX 6 T 1700 zf-H2C2_5 pdbhh F F 2an6 2 E,F,G,H E,F,G,H peptide from Phyllopod LQQERTKLRPVAMVRPTVRVQPQL 24 T 5.9 PRR20 pdbhh F T 2anh 1 A,B A,B PPB_ECOLI ALKALINE PHOSPHATASE MPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQHATPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 446 T 6.8E-10 Alk_phosphatase unppssm F Bacteria T 2ank 3 C P synthetic peptide XYEPIPEEXXQ 11 T 0.91 Hirudin pdbhh F T 2aof 2 C C PEPTIDE INHIBITOR RPGNXLQSRPX 11 T 46 IBV_3A pdbhh F T 2aoh 2 C C PEPTIDE INHIBITOR VSFNXPQITA 10 T 9.6 DUF3912 pdbhh F T 2aoi 2 C C PEPTIDE INHIBITOR RPGNXLQSRPX 11 T 46 IBV_3A pdbhh F T 2aoj 2 C C PEPTIDE INHIBITOR VSFNXPQITAAX 12 T 16 DUF3912 pdbhh F T 2aos 2 B D Trp-Pro-Trp tripeptide WPW 3 T 22 Sex_peptide pdbhh F F 2ap2 2 C,D P,Q MDR1_CRIGR EPITOPE PEPTIDE VVQEALDKAREGRT 14 T 10 Dodecin pdbhh F Eukaryota T 2ap7 1 A A BMNH5_BOMVA Bombinin H2 IIGPVLGLVGSALGGLLKKI 20 T 0.00098 Bombinin pdb F Eukaryota T 2ap8 1 A A BMNH5_BOMVA bombinin H4 IXGPVLGLVGSALGGLLKKI 20 T 0.00098 Bombinin pdb F Eukaryota T 2aph 2 C,D C,D muramyl pentapeptide XAXKXXX 7 T 320 DUF2175 pdbhh F F 2aq9 2 B X peptide inhibitor SSGWMLDPIAGKWSR 15 T 0.11 Kelch_1 pdb F T 2asq 2 B B PIAS2_HUMAN PROTEIN INHIBITOR OF ACTIVATED STAT X, MSX-INTERACTING ZINC FINGER PROTEIN, MIZ1, DAB2-INTERACTING PROTEIN, DIP, ANDROGEN RECEPTOR-INTERACTING PROTEIN 3, ARIP3, PIAS-NY PROTEIN KVDVIDLTIESSSDEEEDPPAKRQM 25 T 0.026 EF-1_beta_acid pdb F Eukaryota T 2ast 4 D D CDN1B_HUMAN CYCLIN-DEPENDENT KINASE INHIBITOR P27, P27KIP1 AGSVEQTPKK 10 T 52 DUF1850 pdbhh F Eukaryota T 2asu 1 A A HGFL_HUMAN MACROPHAGE STIMULATORY PROTEIN, MSP, MACROPHAGE STIMULATING PROTEIN FEKCGKRVDRLDQRRSKLR 19 F F Eukaryota T 2atg 1 A A Retrocyclin-2 RRICRCICGRGICRCICG 18 T 0.24 HECA pdbhh F F 2atp 2 B,E E,F artifact linker AGSADDARKDAARKDDARKDDARKDGSSA 29 T 78 Oberon_cc pdbhh F T 2auc 2 D D MYOA_PLAYO MYOA XLMRVQAHIRKRMVA 15 T 0.063 BORCS8 pdbhh F Eukaryota T 2aw6 2 C,D E,F peptide LVTLVFV 7 T 37 TMEM156 pdbhh F F 2awu 2 C C AHH AHH 3 T 210 Cu2_monooxygen pdbhh F F 2aww 2 C C GRIA1_RAT 18-RESIDUE C-TERMINAL PEPTIDE FROM GLUR-A SIPCMSHSSGMPLGATGL 18 T 4.1 Glyco_hydr_116N pdbhh F Eukaryota T 2ax3 2 B B unknown peptide XXWXFHXX 8 T 29 AnfG_VnfG pdbhh F F 2axf 3 C C BZLF1_EBVB9 EBV, EB1, ZEBRA APQPAPENAY 10 T 0.25 Mucin-like unp T Viruses T 2axg 3 C C BZLF1_EBVB9 EBV, EB1, ZEBRA APQPAPENAY 10 T 0.25 Mucin-like unp T Viruses T 2axi 2 B B cyclic 8-mer peptide PFEXLDWEFX 10 T 0.48 Pico_P2B pdbhh F T 2axt 17 IA,Q x,X Unassigned subunits XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 129 F F F 2axz 2 E,F,G E,F,H LVTLVFV peptide LVTLVFV 7 T 37 TMEM156 pdbhh F F 2axz 3 H I TPPKEVT(MSE) peptide TPPKEVTM 8 T 21 TrwC pdbhh F T 2azm 2 C,D C,D H2AX_HUMAN HISTONE H2AFX KKATQASQEY 10 T 16 Class_IIIsignal pdbhh F Eukaryota T 2b05 2 B,D,F,H,J,L G,H,I,J,K,L peptide RAISLP 6 T 54 Tryp_alpha_amyl pdbhh F T 2b0f 2 B B Ace-LEALFQ-ethylpropionate XLEALFX 7 T 6.7 DUF3674 pdbhh F F 2b19 1 A A TKN1_HUMAN NPK DADSSIEKQVALLKALYGHGQISHKRHKTDSFVGLM 36 T 0.0027 Tachykinin pdbhh F Eukaryota T 2b1j 2 C,D C,D FLIM_ECOLI Flagellar motor switch protein fliM MGDSILSQAEIDALLN 16 T 0.027 CitT pdbhh F Bacteria T 2b1n 2 B B peptide (LYS)(ALA)(SER)(VAL)(GLY) KASVG 5 T 280 zf-CCHC pdbhh F F 2b26 2 D D HSP7B_DROME HEAT SHOCK 70 KDA PROTEIN 87D PTVEEVD 7 T 2.6 DUF2368 pdbhh F Eukaryota F 2b5b 1 A A DBTEW_CARCR Defensin EKKCPGRCTLKCGKHERPTLPYNCGKYICCVPVKVK 36 F F Eukaryota T 2b5k 1 A A PPM1_LIMPO PV5; POLYPHEMUSIN I RRWCFRVCYRGRFCYRKCRX 20 T 0.22 zf-CCHH pdbhh F Eukaryota T 2b5p 1 A A CT6A_CONMR Lambda-conotoxin CMrVIA VCCGYKLCHPC 11 T 0.33 Oxidored-like pdbhh F Eukaryota T 2b5q 1 A A CT6A_CONMR Lambda-conotoxin CMrVIA VCCGYKLCHPC 11 T 0.33 Oxidored-like pdbhh F Eukaryota T 2b6n 2 B B TRIPEPTIDE APT 3 T 390 GM_CSF pdbhh F F 2b7f 2 C,F,I I,J,K (ACE)APQV(STA)VMHP peptide XAPQVXVMHP 10 T 19 OTCace pdbhh F T 2b9h 2 B C STE7 RRNLKGLNLNLHPD 14 T 3.6 DUF3626 pdbhh F T 2b9i 2 B C MSG5 PRSLQNRNTKNLSLDIAALHP 21 T 28 DUF2000 pdbhh F T 2b9j 2 B C CKI, FAR1, FACTOR ARREST PROTEIN SKRGNIPKPLNLS 13 T 3.8 DUF5361 pdbhh F T 2bb4 1 A P beta-casomorphin-7 YPFVEPI 7 T 21 PCP pdbhh F T 2bba 2 B P Agonist peptide TNYLFSPNGPIARAW 15 T 0.036 PufQ pdbhh F T 2bbl 1 A A POLG_POL1M Genome linked protein VPg GAYTGLPNKKPNVPTIRTAKVQ 22 T 11 DUF2111 pdbhh T Viruses T 2bbm 2 B B MYLK2_RABIT MYOSIN LIGHT CHAIN KINASE KRRWKKNFIAVSAANRFKKISSSGAL 26 T 0.024 PACT_coil_coil unppssm F Eukaryota T 2bbn 2 B B MYLK2_RABIT MYOSIN LIGHT CHAIN KINASE KRRWKKNFIAVSAANRFKKISSSGAL 26 T 0.024 PACT_coil_coil unppssm F Eukaryota T 2bbp 1 A A POLG_POL1M Genome linked protein VPg GAYTGLPNKKPNVPTIRTAKVQ 22 T 11 DUF2111 pdbhh T Viruses T 2bbu 2 B B IL6RB_MOUSE GP130 PHOSPHOPEPTIDE STASTVEXSTVVHSG 15 T 10 DUF4244 pdbhh F Eukaryota T 2bc7 1 A A CA1_CONIM Alpha-conotoxin ImI GXCSDPRXAWRC 12 T 0.0098 Toxin_8 unphh F Eukaryota T 2bc8 1 A A CA1_CONIM Alpha-conotoxin ImI GXXSDPRXAWRX 12 T 0.0098 Toxin_8 unphh F Eukaryota T 2bcc 9 I,I1 I,I CYTOCHROME BC1 COMPLEX, COMPLEX III XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 33 F F F 2bcd 2 B B MOTUPORIN XVXXX 5 T 840 PAS_11 pdbhh F F 2bcx 2 B B RYR1_RABIT SKELETAL MUSCLE-TYPE RYANODINE RECEPTOR, RYR1, RYR-1, SKELETAL MUSCLE CALCIUM RELEASE CHANNEL KSKKAVWHKLLSKQRRRAVVACFRMTPLYN 30 T 2.1 Spc110_C pdbhh F Eukaryota T 2bd2 1 A P beta-casomorphin-7 YPFVEPI 7 T 21 PCP pdbhh F T 2bd3 1 A P beta-casomorphin-7 YPFVEPI 7 T 21 PCP pdbhh F T 2bd4 1 A P beta-casomorphin-7 YPFVEPI 7 T 21 PCP pdbhh F T 2bd5 1 A P beta-casomorphin-7 YPFVEPI 7 T 21 PCP pdbhh F T 2bd7 1 A P beta-casomorphin-7 YPFVEPI 7 T 21 PCP pdbhh F T 2bd8 1 A P beta-casomorphin-7 YPFVEPI 7 T 21 PCP pdbhh F T 2bda 1 A P NPI NPI 3 T 51 Sigma54_AID pdbhh F F 2bdx 2 B B DIHYDROMICROCYSTIN-LA XLXAXXX 7 T 350 AIG2_2 pdbhh F F 2be1 2 C D peptide VVVVVVVV 8 T 170 FeoB_associated pdbhh F F 2bec 2 B B SL9A1_HUMAN NA(+)/H(+) EXCHANGER 1, NHE-1, SOLUTE CARRIER FAMILY 9 MEMBER 1, NA(+)/H(+) ANTIPORTER, AMILORIDE-SENSITIVE, APNH VDLLAVKKKQETKRSINEEIHTQFLDHLLTGIEDICGHYGHHH 43 T 0.99 Herpes_TK_C pdbpercent F Eukaryota T 2beu 3 C C PEPTIDE ALA-TYR-ARG AYR 3 T 110 Arv1 pdbhh F F 2bev 3 C C PEPTIDE ALA-TYR-ARG AYR 3 T 110 Arv1 pdbhh F F 2bew 3 C C PEPTIDE ALA-TYR-ARG AYR 3 T 110 Arv1 pdbhh F F 2bey 1 A A BIKK CTKSIPPICTKSIPPI 16 T 0.016 Bowman-Birk_leg pdb F T 2bfi 1 A A SYNTHETIC PEPTIDE KFFEAAAKKFFE 12 T 3.5 DASH_Ask1 pdbhh F F 2bha 2 B B VALINE-PROLINE-LEUCINE VPL 3 T 230 YebF pdbhh F F 2bhd 2 B B VALINE-PROLINE-LEUCINE TRIPEPTIDE VPL 3 T 230 YebF pdbhh F F 2bi6 1 A L IBRO_ANACO BROMELAIN INHIBITOR VI TACSECVCPLR 11 T 0.014 CID_GANP unp F Eukaryota T 2bil 1 A A CONSENSUS PIM1 PEPTIDE PIMTIDE ARKRRRHPSGPPTA 14 T 1.5 DUF3019 pdbhh F T 2bj4 3 C,D C,D PEPTIDE ANTAGONIST LTSRDFGSWYA 11 T 0.6 HNH_repeat pdbhh F T 2bn5 2 B B RU17_DROME U1 SNRNP 70 KDA, SNRNP70, U1-70K PROTEIN RPPPAHHNMFSVPPPPILGRG 21 T 17 DUF1851 pdbhh F Eukaryota T 2bp3 2 C,D S,T GP1BA_HUMAN GLYCOPROTEIN B ALPHA, GLYCOPROTEIN IBALPHA, GP-IB ALPHA, GPIBA, GPIB-ALPHA, CD42B-ALPHA, CD42B LRGSLPTFRSSLFLWVRPNGRV 22 T 0.091 GGN unphh F Eukaryota T 2bp5 2 B P P2RX4_RAT ATP RECEPTOR, P2X4, PURINERGIC RECEPTOR VEDYEQGLSG 10 T 4.9 GM_CSF pdbhh F Eukaryota T 2bqz 2 B,D B,F SMYD5_HUMAN HISTONE H4 RHRKVLRDNY 10 T 3.9 Phage_X pdbhh F Eukaryota T 2br8 2 F,G,H,I,J F,G,H,I,J CA1A_CONPE ALPHA-PNIA GCCSLPPCALNNPKYCX 17 T 0.0013 Toxin_8 pdbpssm F Eukaryota T 2br9 2 B P CONSENSUS PEPTIDE FOR 14-3-3 PROTEINS RRQRSAP 7 T 100 tRNA-synt_2_TM pdbhh F F 2bss 3 C C Q98Y46_9HIV1 HIV PEPTIDE KRWIILGLNK 10 T 1 COX2-transmemb pdbhh T Viruses T 2bta 1 A _ B3AT_HUMAN B3P MEELQDDYEDMMEENX 16 T 1.1 DUF1265 pdbhh F Eukaryota T 2btb 1 A _ B3AT_HUMAN B3P MEELQDDYEDMMEENX 16 T 1.1 DUF1265 pdbhh F Eukaryota T 2btp 2 C,D P,Q CONSENSUS PEPTIDE FOR 14-3-3 PROTEINS RQRSAP 6 T 130 DUF1840 pdbhh F F 2btx 2 B B LIBRARY DERIVED PEPTIDE MRYYESSLKSYPD 13 T 3.3 Prion pdbhh F T 2bug 2 B B HS90A_HUMAN DSCR1 XMEEVD 6 T 13 TBP unphh F Eukaryota F 2buo 2 B T INHIBITOR OF CAPSID ASSEMBLY ITFEDLLDYYGP 12 T 0.92 DUF2610 pdbhh F T 2bvo 3 C C Q70A61_9HIV1 HIV-P24 KAFSPEVIPMF 11 T 9.1E-05 Gag_p24 unphh T Viruses T 2bvq 3 C C Q70A61_9HIV1 HIV-P24 KAFSPEVIPMF 11 T 9.1E-05 Gag_p24 unphh T Viruses T 2byp 2 F,G,H,I,J F,G,H,I,J CA1_CONIM ALPHA-CONOTOXIN IMI GCCSDPRCAWRX 12 T 0.0098 Toxin_8 unphh F Eukaryota T 2bz8 2 C C CBLB_HUMAN SIGNAL TRANSDUCTION PROTEIN CBL-B SH3-BINDING PROTEIN CBL-B, RING FINGER PROTEIN 56, CBL-B PARPPKPRPRR 11 T 2.3 Hap4_Hap_bind pdbhh F Eukaryota F 2bzk 1 A A PIMTIDE ARKRRRHPSGPPTAX 15 T 1.8 DUF3019 pdbhh F T 2c1d 1 A,C,E,G A,C,E,G SOXA_PARPN SOXA DPVEDGLVIETDSGPVEIVTKTAPPAFLADTFDTIYSGWHFRDDSTRDLERDDFDNPAMVFVDRGLDKWNAAMGVNGESCASCHQGPESMAGLRAVMPRVDEHTGKLMIMEDYVNACVTERMGLEKWGVTSDNMKDMLSLISLQSRGMAVNVKIDGPAAPYWEHGKEIYYTRYGQLEMSCANCHEDNAGNMIRADHLSQGQINGFPTYRLKDSGMVTAQHRFVGCVRDTRAETFKAGSDDFKALELYVASRGNGLSVEGVSVRH 264 T 6E-05 DUF1924 unphh F Bacteria T 2c1e 3 C C AZA-PEPTIDE INHIBITOR (5S, 8R, 11S)-8-(2-CARBOXYETHYL)-5-(CARBOXYMETHYL)-14-(4-ETHOXY-4-OXOBUTANOYL)-11-(1-METHYLETHYL)-3,6,9,12-TETRAOXO-1-PHENYL-2-OXA-4,7,10,13,14-PENTAAZAHEXADECAN -16-OIC ACID XDEVX 5 T 570 Helicase_RecD pdbhh F F 2c2k 3 C C AZA-PEPTIDE INHIBITOR (5S, 8R, 11S)-8-(2-CARBOXYETHYL)-5-(CARBOXYMETHYL)-14-(4-ETHOXY-4-OXOBUTANOYL)-11-(1-METHYLETHYL)-3,6,9,12-TETRAOXO-1-PHENYL-2-OXA-4,7,10,13,14-PENTAAZAHEXADECAN -16-OIC ACID XDEVX 5 T 570 Helicase_RecD pdbhh F F 2c2l 2 E,F,G,H E,F,G,H HS90A_HUMAN HSP90 DTSRMEEVD 9 T 6.5 Clathrin_lg_ch pdbhh F Eukaryota T 2c2m 3 C C AZA-PEPTIDE INHIBITOR (5S, 8R, 11S)-14-[4-(BENZYLOXY)-4-OXOBUTANOYL]-8-(2-CARBOXYETHYL)-5-(CARBOXYMETHYL)-11-(1-METHYLETHYL)-3,6,9,12-TETRAOXO-1-PHENYL-2-OXA-4,7,10,13,14 -PENTAAZAHEXADECAN-16-OIC ACID XDEVX 5 T 570 Helicase_RecD pdbhh F F 2c2o 3 C C AZA-PEPTIDE INHIBITOR (5S, 8R, 11S)-14-{4-[BENZYL(METHYL) AMINO]-4-OXOBUTANOYL}-8-(2-CARBOXYETHYL)-5-(CARBOXYMETHYL)-11-(1-METHYLETHYL)-3,6,9,12-TETRAOXO-1-PHENYL-2-OXA-4,7,10,13,14-PENTAAZAHEXADECAN-16-OIC ACID XDEVX 5 T 570 Helicase_RecD pdbhh F F 2c2z 3 C C AZA-PEPTIDE INHIBITOR (5S, 8R, 11S)-8-(2-CARBOXYETHYL) -14-[4-(3,4-DIHYDROQUINOLIN-1(2H)-YL)-4-OXOBUTANOYL] -11-[(1R)-1-HYDROXYETHYL]-5-(2-METHYLPROPYL)-3,6,9,12-TETRAOXO -1-PHENYL-2-OXA-4,7,10,13,14-PENTAAZAHEXADECAN-16-OIC ACID XLETX 5 T 1200 Peptidase_S68 pdbhh F F 2c3i 1 A A PIMTIDE KRRRHPSG 8 T 3.5 RNA_GG_bind pdbhh F T 2c5i 1 A P VPS51_YEAST VPS51, APICAL BUD GROWTH PROTEIN 3 AEQISHKKSLRVSSLNKDRRLLLREFYNL 29 T 0.028 rRNA_processing unppercent F Eukaryota T 2c5k 1 A P VPS51_YEAST VPS51, APICAL BUD GROWTH PROTEIN 3 KSLRVSSLNKDRRLLLREFYNLEN 24 T 0.028 rRNA_processing unppercent F Eukaryota T 2c5v 3 E,F F,H ALA-ALA-ABA-ARG-SER-LEU-ILE-PFF-NH2 AAXRSLIXX 9 T 0.68 SDA1 pdbhh F T 2c63 2 E,F,G,H P,Q,R,S CONSENSUS PEPTIDE FOR 14-3-3 PROTEINS RAISLP 6 T 54 Tryp_alpha_amyl pdbhh F T 2c74 2 C,D P,Q CONSENSUS PEPTIDE MODE 1 FOR 14-3-3 PROTEINS RRQRSAP 7 T 100 tRNA-synt_2_TM pdbhh F F 2c77 2 B B THCL_PLARO GE22700A SXNXVXGXXXXXSPX 15 T 1.2 CCER1 unphh F Bacteria T 2c9f 2 F,G,H,I,J S,T,U,V,W SPIKE_ADE02 N-TERMINAL PEPTIDE OF THE FIBER MKRARPSGDTFNPVYPYDT 19 T 0.44 DUF5449 pdbhh T Viruses T 2c9l 3 C,D Y,Z BZLF1_EBVB9 EB1, ZEBRA MLEIKRYKNRVAARKSRAKFKQLLQHYREVAAAKSSENDRLRLLLKQMCPSLDVDSIIPRTPD 63 T 0.00058 bZIP_2 pdb T Viruses T 2c9n 3 C,D Y,Z BZLF1_EBVB9 EB1, ZEBRA MLEIKRYKNRVASRKCRAKFKQLLQHYREVAAAKSSENDRLRLLLKQMCPSLDVDSIIPRTPD 63 T 0.0016 bZIP_2 pdb T Viruses T 2c9t 2 K,L,M,N,O,P,Q,R K,M,O,P,Q,R,S,T CA1_CONIM ALPHA-CTX IMI GCCSDPRCAWRCX 13 T 0.0098 Toxin_8 unphh F Eukaryota T 2cbl 2 B B ZAP70_HUMAN ZAP-70 TLNSDGXTPEPA 12 T 0.99 Galanin pdbhh F Eukaryota T 2cch 3 E,F E,F CDC6_HUMAN CDC6-BIS, CDC6-RELATED PROTEIN, P62, CDC6, HSCDC6, HSCDC18 HTLKGRRLVFDN 12 T 1 Med26_C pdbhh F Eukaryota T 2cci 3 E,F F,I CDC6_HUMAN CDC6-RELATED PROTEIN,CDC18-RELATED PROTEIN,HSCDC18,P62(CDC6),HSCDC6 HHASPRKQGKKENGPPHSHTLKGRRLVFDN 30 T 9.8 Rhodanese_C pdbhh F Eukaryota T 2cdr 3 C I AZA-PEPTIDE EXPOXIDE XDEVX 5 T 570 Helicase_RecD pdbhh F F 2ce8 2 E,F X,Y EH1 PEPTIDE MFSIDNILA 9 T 0.28 TerC pdbhh F T 2ce9 2 E,F X,Y WRPW PEPTIDE MWRPW 5 T 22 Trp_leader2 pdbhh F F 2cef 1 A A TF_HUMAN TFCD, TF, COAGULATION FACTOR III, THROMBOPLASTIN, CD142 ANTIGEN, TFPP CRKAGVGQSWKENSPLNVS 19 T 0.0002 Shisa unppssm F Eukaryota T 2ceh 1 A A TF_HUMAN TFCD, TF, COAGULATION FACTOR III, THROMBOPLASTIN, CD142 ANTIGEN CRKAGVGQSWKENSPLNVS 19 T 0.0002 Shisa unppssm F Eukaryota T 2cez 1 A A TF_HUMAN TFSP 253, TF, COAGULATION FACTOR III, THROMBOPLASTIN, CD142 ANTIGEN CRKAGVGQSWKENSPLNVS 19 T 0.0002 Shisa unppssm F Eukaryota T 2cfj 1 A A TF_HUMAN TFSP 258, TF, COAGULATION FACTOR III, THROMBOPLASTIN, CD142 ANTIGEN CRKAGVGQSWKENSPLNVS 19 T 0.0002 Shisa unppssm F Eukaryota T 2cha 1 A,D A,E CTRA_BOVIN ALPHA-CHYMOTRYPSIN A CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 2chn 2 C,D C,D HEXOSAMINIASE XXXXXXXXXXXX 12 F F F 2cho 2 C,D C,D HEXOSAMINIASE XXXXXXXXXXXXX 13 F F F 2cjx 3 C I Z-DEVD-CMK XDEVDX 6 T 200 ResIII pdbhh F F 2cjy 3 C I Z-DEVD-CMK XDEVDX 6 T 200 ResIII pdbhh F F 2ck0 3 C P PROTEIN (11-MER; CYCLIC PEPTIDE) CKEWLSTAPCG 11 T 0.75 PSI_PsaJ pdbhh F T 2clr 3 C,F C,F CALR_HUMAN DECAMERIC PEPTIDE FROM CALRETICULIN MLLSVPLLLG 10 T 1.6 DUF4634 pdbhh F Eukaryota T 2cm4 1 A A Q5YD59_ORNMO OMCI DSESDCTGSEPVDAFQAFSEGKEAYVLVRSTDPKARDCLKGEPAGEKQDNTLPVMMTFKQGTDWASTDWTFTLDGAKVTATLGQLTQNREVVYDSQSHHCHVDKVEKEVPDYEMWMLDAGGLEVEVECCRQKLEELASGRNQMYPHLKDC 150 T 7.8E-05 His_binding pdbhh F Eukaryota T 2cm9 1 A A Q5YD59_ORNMO OMCI DSESDCTGSEPVDAFQAFSEGKEAYVLVRSTDPKARDCLKGEPAGEKQDNTLPVMMTFKQGTDWASTDWTFTLDGAKVTATLGQLTQNREVVYDSQSHHCHVDKVEKEVPDYEMWMLDAGGLEVEVECCRQKLEELASGRNQMYPHLKDC 150 T 7.8E-05 His_binding pdbhh F Eukaryota T 2cmy 2 B B TI_VERHE VERONICA HEDERIFOLIA TRYPSIN INHIBITOR NTDPEQCKVMCYAQRHSSPELLRRCLDNCEKEHD 34 T 0.0098 DUF842 pdb F Eukaryota T 2cnk 3 C I AZA-PEPTIDE EXPOXIDE XDEVX 5 T 570 Helicase_RecD pdbhh F F 2cnl 3 C I AZA-PEPTIDE EPOXIDE XDEVX 5 T 570 Helicase_RecD pdbhh F F 2cnm 2 D,E,F D,E,F RS18_SALTY CTERM-ARG-ARG-PHE-TYR-ARG-ALA-N-ALPHA-ACETYL XARYFRR 7 T 1.7 Toxin_37 pdbhh F Bacteria F 2cnn 3 C I AZA-PEPTIDE EXPOXIDE XIETX 5 T 890 Imm53 pdbhh F F 2cp8 1 A A NBR1_HUMAN KIAA0049 PROTEIN, NEIGHBOR OF BRCA1 GENE 1 PROTEIN GSSGSSGQTAALMAHLFEMGFCDRQLNLRLLKKHNYNILQVVTELLQLSGPSSG 54 T 0.0021 UBA pdbpssm F Eukaryota T 2cp9 1 A A EFTS_HUMAN EF-TS, EF-TSMT GSSGSSGSSKELLMKLRRKTGYSFVNCKKALETCGGDLKQAEIWLHKEAQKEGWSKAASGPSSG 64 T 0.00034 EF_TS unphh F Eukaryota T 2csp 1 A A RIMB2_HUMAN RIM-BP2 GSSGSSGVEFSTLPAGPPAPPQDVTVQAGVTPATIRVSWRPPVLTPTGLSNGANVTGYGVYAKGQRVAEVIFPTADSTAVELVRLRSLEAKGVTVRTLSAQGESVDSAVAAVPPELLVPPTPHPSGPSSG 130 T 0.00022 DUF4998 unphh F Eukaryota T 2ctd 1 A A ZN512_HUMAN Zinc finger protein 512 GSSGSSGRIRKEPPVYAAGSLEEQWYLEIVDKGSVSCPTCQAVGRKTIEGLKKHMENCKQEMFTCHHCGKQLRSLAGMKYHVMANHNSLPSGPSSG 96 T 0.00024 zf-C2H2_4 pdbpercent F Eukaryota T 2cvy 2 B B RIR2_YEAST RNR2 C-TERMINAL 9 MER PEPTIDE GAFTFNEDF 9 T 2.1 DUF4295 pdbhh F Eukaryota T 2cwg 2 B,D D,E T5 SIALOGLYCOPEPTIDE OF GLYCOPHORIN A DTYAATPR 8 T 34 DUF2024 pdbhh F T 2czs 1 A,B A,B Q748S4_GEOSL DHC2 MVSGEVRTKKVPLDTNHKRFYDAFAQGAGKLDLDRQCVECHHEKPGGIPFPKNHPVKPADGPMRCLFCHKFKLEHHHHHH 80 T 3.6E-05 Cytochrom_NNT unphh F Bacteria T 2czy 2 B B REST_HUMAN NRSF/REST APQLIMLANVALTGE 15 T 0.93 zf-C2H2 unppssm F Eukaryota T 2d0n 2 B,D B,D SLP-76 binding peptide PSIDRSTKP 9 T 36 Protein_K pdbhh F T 2d1x 2 E,F P,Q ASAP1_HUMAN proline rich region from development and differentiation enhancing factor 1 SKKRPPPPPPGHKRT 15 T 3 DUF6059 pdbhh F Eukaryota T 2d3g 2 C P HGS_HUMAN ubiquitin interacting motif from hepatocyte growth factor-regulated tyrosine kinase substrate LQEEEELQLALALSQSEAEEK 21 T 6.3E-05 UIM pdbhh F Eukaryota T 2d4o 1 A A Q72J89_THET2 hypothetical protein TTHA1254 MRFRPFTEEDLDRLNRLAGKRPVSLGALRFFARTGHSFLAEEGEEPMGFALAQAVWQGEATTVLVTRMEGRSVEALRGLLRAVVKSAYDAGVYEVALHLDPERKELEEALKAEGFALGPLVLAVRVLGSRGARGETRGVLE 141 T 0.067 DUF1999 unppssm F Bacteria T 2d55 2 C C ACTINOMYCIN D TXPXXXTXPXX 11 T 61 DUF5572 pdbhh F F 2d5w 2 C C pentapeptide A ASKPK 5 T 310 AvrPtoB-E3_ubiq pdbhh F F 2d5w 3 D D pentapeptide B ASKTK 5 T 460 CBFNT pdbhh F F 2d7s 2 B B Q9QCE4_9PICO VPg1 protein GPYAGPLERQRPLKVRAKLPRQE 23 T 5.1 RNase_HII pdbhh T Viruses T 2d8v 1 A A ANCHR_MOUSE Zinc finger FYVE domain-containing protein 19 GSSGSSGLPWCCICNEDATLRCAGCDGDLYCARCFREGHDNFDLKEHQTSPYHPRRPCQEHSGPSSG 67 T 0.00043 zf-B_box pdbpssm F Eukaryota T 2da8 1 A A TRIOSTIN A XAXVXAXV 8 T 190 RSF pdbhh F F 2db2 1 A A DHX30_HUMAN KIAA0890 protein GSSGSSGASRDLLKEFPQPKNLLNSVIGRALGISHAKDKLVYVHTNGPKKKKVTLHIKWPKSVEVEGYGSKKIDAERQAAAAACQLFKGWGLLGPRNELFDAAKYRVLADRFGSGPSSG 119 T 0.00018 Dicer_dimer pdbhh F Eukaryota T 2dcx 1 A A DMS4_PHYSA DS IV ALWKTLLKKVLKAX 14 T 0.056 DD_K pdb F Eukaryota T 2dd6 1 A A DMS4_PHYSA DS IV ALWKTLLKKVLKAX 14 T 0.056 DD_K pdb F Eukaryota T 2dew 2 B A 10-mer peptide from histone H3 LQTARKSTGG 10 T 23 DUF5915 pdbhh F T 2dex 2 B A 10-mer peptide from histone H3 LAPRKQLATK 10 T 24 DUF3597 pdbhh F T 2dey 2 B A 10-mer peptide from histone H4 XSGRGKGGKGL 11 T 5.5 G3P_acyltransf pdbhh F T 2df6 2 C,D C,D PAK2_RAT 18-mer from PAK2 PPVIAPRPEHTKSIYTRS 18 T 2.2 TFIIA unppercent F Eukaryota T 2dhx 1 A A PAR10_HUMAN poly (ADP-ribose) polymerase family, member 10 variant GSSGSSGGVAVEVRGLPPAVPDELLTLYFENRRRSGGGPVLSWQRLGCGGVLTFREPADAERVLAQADHELHGAQLSLRPAPPRAPARLLLQGLPPGTSGPSSG 104 T 0.00023 NID pdbhh F Eukaryota T 2dhz 1 A A RPGFL_HUMAN LINK GUANINE NUCLEOTIDE EXCHANGE FACTOR II GSSGSSGDEIFCRVYMPDHSYVTIRSRLSASVQDILGSVTEKLQYSEEPAGREDSLILVAVSSSGEKVLLQPTEDCVFTALGINSHLFACTRDSYEALVPLPEEIQVSPGDTEISGPSSG 120 T 0.01 RA pdbpssm F Eukaryota T 2djy 2 B B SMAD7_HUMAN SMAD 7, MOTHERS AGAINST DPP HOMOLOG 7, SMAD7, HSMAD7 GPLGSELESPPPPYSRYPMD 20 T 0.051 WBP-1 pdbhh F Eukaryota T 2dko 3 C I Z-DEVD-CMK XDEVDX 6 T 200 ResIII pdbhh F F 2do2 2 B P Ala-Leu-Ala-Ser-Lys ALASK 5 T 350 DIMCO_N pdbhh F F 2dqk 2 B P VLLH VLLH 4 T 82 Ser_hydrolase pdbhh F F 2drm 2 E,F E,G 18-mer peptide from Acan125 AKPVPPPRGAKPAPPPRT 18 T 30 HCV_NS5a_C pdbhh F T 2ds8 2 B,D P,Q XB APALRVVK 8 T 9.8 ACC_epsilon pdbhh F T 2duj 2 B P LLFND LLFND 5 T 42 CDC50 pdbhh F F 2dun 1 A A DPOLM_HUMAN POL MU GSSGSSGSTRFPGVAIYLVEPRMGRSRRAFLTGLARSKGFRVLDACSSEATHVVMEETSAEEAVSWQERRMAAAPPGCTPPALLDISWLTESLGAGQPVPVECRHRLEVAGPRKGPLSPAWMPAYACSGPSSG 133 T 0.00019 BRCT pdbpercent F Eukaryota T 2dvq 2 D,E P,Q H4_YEAST histone H4 SGRGKGGKGLGXGGA 15 T 11 Shadoo unppercent F Eukaryota T 2dvr 2 D,E P,Q H4_YEAST histone H4 SGRGXGGKGLGXGGA 15 T 11 Shadoo unppercent F Eukaryota T 2dvs 2 D,E P,Q histone H4 LGXGGAKRHRKV 12 T 35 DUF4196 pdbhh F T 2dwf 1 A A PSPB_HUMAN SP-B, 6 KDA PROTEIN, PULMONARY SURFACTANT-ASSOCIATED PROTEOLIPID SPLPHE, 18 KDA PULMONARY-SURFACTANT PROTEIN CWLCRALIKRIQAMIPKGGRMLPQLVCRLVLRCS 34 T 4.2E-12 SapB_2 unppssm F Eukaryota T 2dwx 2 E,F P,Q GGA1_HUMAN hinge peptide from ADP-ribosylation factor binding protein GGA1 SLDGTGWNSFQSS 13 T 3.8 DpnII pdbhh F Eukaryota T 2dx2 1 A A Target Peptide INYWLAHAKAG 11 T 2.4 DUF3717 pdbhh F T 2dx3 1 A A DP5_conformation1 INYWLAHAKAGYIVHWTA 18 T 2 XkdW pdbhh F T 2dx4 1 A A DP5_conformation2 INYWLAHAKAGYIVHWTA 18 T 2 XkdW pdbhh F T 2dxp 2 B B A(PTR)R AXR 3 T 110 Arv1 pdbhh F F 2dyf 2 B B BBC1_YEAST PROTEIN BBC1 GSTAPPLPR 9 T 12 FAA_hydro_N_2 pdbhh F Eukaryota T 2dyh 2 B B NF2L2_MOUSE NF-E2-RELATED FACTOR 2, NFE2-RELATED FACTOR 2, NUCLEAR FACTOR, ERYTHROID DERIVED 2, LIKE 2 ILWRQDIDLGVSREV 15 T 0.19 LicD pdbhh F Eukaryota T 2dzm 1 A A FAF1_HUMAN PROTEIN FAF1, HFAF1 GSSGSSGRMLDFRVEYRDRNVDVVLEDTCTVGEIKQILENELQIPVSKMLLKGWKTGDVEDSTVLKSLHLPKNNSLYVLTPDLPPPSSSSHAGALQESLN 100 T 2.5E-05 YukD pdbhh F Eukaryota T 2e30 2 B B SL9A1_HUMAN NA+, /H+, EXCHANGER 1, NHE-1, SOLUTE CARRIER FAMILY 9 MEMBER 1, NA+, /H+, ANTIPORTER, AMILORIDE- SENSITIVE, APNH VDLLAVKKKQETKRSINEEIHTQFLDHLLTGIEDICGHYGHHH 43 T 0.99 Herpes_TK_C pdbpercent F Eukaryota T 2e3k 2 E,F Q,R H4_YEAST 15-mer peptide from Histone H4 SGRGXGGKGLGXGGA 15 T 11 Shadoo unppercent F Eukaryota T 2e4e 1 A A CHIGNOLIN GYDPATGTFG 10 T 0.06 BA14K pdbhh F T 2e4h 2 B B TBA1B_HUMAN ALPHA-TUBULIN UBIQUITOUS, TUBULIN K-ALPHA-1, ALPHA-TUBULIN 3 GEFSEAREDMAALEKDYEEVGVDSVEGEGEEEGEEY 36 T 1.8 Hrs_helical pdbhh F Eukaryota T 2e50 1 A,B,C,D A,B,P,Q SET_HUMAN SET/TAF-1BETA, PHOSPHATASE 2A INHIBITOR I2PP2A, I-2PP2A, TEMPLATE-ACTIVATING FACTOR I, TAF-I, HLA-DR ASSOCIATED PROTEIN II, PHAPII, INHIBITOR OF GRANZYME A-ACTIVATED DNASE, IGAAD MSAQAAKVSKKELNSNHDGADETSEKEQQEAIEHIDEVQNEIDRLNEQASEEILKVEQKYNKLRQPFFQKRSELIAKIPNFWVTTFVNHPQVSALLGEEDEEAMHYLTRVEVTEFEDIKSGYRIDFYFDENPYFENKVLSKEFHMNESGDPSSKSTEIKWKSGKDMTKRSSQTQNKASRKRQHEEPESFFTWFTDHSDAGADELGEVIKDDIWPNPLQYYLVPDM 225 T 7.5E-06 NAP pdb F Eukaryota T 2e6z 1 A A SPT5H_HUMAN HSPT5, DRB SENSITIVITY-INDUCING FACTOR LARGE SUBUNIT, DSIF LARGE SUBUNIT, DSIF P160, TAT-COTRANSACTIVATOR 1 PROTEIN, TAT-CT1 PROTEIN GSSGSSGFQPGDNVEVCEGELINLQGKILSVDGNKITIMPKHEDLKDMLEFPAQELRKY 59 T 0.00013 KOW pdbpercent F Eukaryota T 2e72 1 A A POGZ_HUMAN Pogo transposable element with ZNF domain GSSGSSGQDGGRKICPRCNAQFRVTEALRGHMCYCCPEMVEYQSGPSSG 49 T 4.8E-05 zf_C2H2_6 pdbhh F Eukaryota T 2e7m 1 A A K0319_HUMAN Protein KIAA0319 GSSGSSGPRTVKELTVSAGDNLIITLPDNEVELKAFVAPAPPVETTYNYEWNLISHPTDYQGEIKQGHKQTLNLSQLSVGLYVFKVTVSSENAFGEGFVNVTVKPARSGPSSG 113 T 0.00052 PKD unppercent F Eukaryota T 2eax 2 D L GLYCOSAMYL MURAMYL PENTAPEPTIDE AXKXX 5 T 230 OAM_dimer pdbhh F F 2efg 2 B B EF-G XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 88 F F F 2egn 2 B B mGluR5 C-terminal peptide SSSSL 5 T 200 Cryptochrome_C pdbhh F F 2ehp 1 A,B A,B Y1627_AQUAE aq_1627 protein MPAIFTHEGKVEGVPGNYPLTAENLFRIGLALCTLWILDKEIEEPTLSIPETNFVTLALSVGFMNAGGSVNVGKGGDIKLFLQKGEIYVLEFQPLSETDIKKLESILFGRAPIPKKTGEDIGSFKC 126 T 0.071 POB3_N pdbpercent F Bacteria T 2eiu 1 A,B,C,D,E,F A,C,D,E,F,G Y1627_AQUAE Hypothetical protein aq_1627 MPAIFTHEGKVEGVPGNYPLTAENLFRIGLALCTLWILDKEIEEPTLSIPETNFVTLALSVGFMNAGGSVNVGKGGDIKLFLQKGEIYVLEFQPLSETDIKKLESILFGRAPIPKKTGEDIGSFKC 126 T 0.071 POB3_N pdbpercent F Bacteria T 2ejy 2 B B GLPC_HUMAN PAS-2', GLYCOPROTEIN BETA, GLPC, GLYCOCONNECTIN, SIALOGLYCOPROTEIN D, GLYCOPHORIN D, GPD, CD236 ANTIGEN DAGDSSRKEYCI 12 T 0.043 Herpes_gE unppercent F Eukaryota T 2eph 2 E H P90573_PLABE PbTRAP EDNDWN 6 T 27 DUF4878 pdbhh F Eukaryota F 2er0 2 B I L364,099 XHPFHXLF 8 T 0.028 DUF5372 pdbhh F F 2er6 2 B I H-256 peptide PTEXRE 6 T 210 DUF5737 pdbhh F F 2er7 2 B I TRANSITION-STATE ISOSTERE INHIBITOR OF RENIN XHPFHXIH 8 T 2.4 DUF5372 pdbhh F F 2er9 2 B I L363,564 XHPFHXLF 8 T 0.028 DUF5372 pdbhh F F 2era 1 A A 3S1EA_LATSE ERABUTOXIN A RICFNHQGSQPQTTKTCSPGESSCYNKQWSDFRGTIIERGCGCPTVKPGIKLSCCESEVCNN 62 T 0.0034 Toxin_TOLIP pdb F Eukaryota T 2erh 2 B B CEA7_ECOLX Colicin E7 RNKPGKATGKGKPVNNKWLNNAGKDLGSPVPDRIANKLRDKEFKSFDDFRKKFWEEVSKDPELSKQFSRTQNDRMKVGRAPQTRTQDVSGKRQSFELHHEKPISQNGGVYDMDNISVVTPKRHIDIH 127 T 0.0087 HNH pdbpercent F Bacteria T 2esl 2 G,H,I,J,K,L I,J,K,L,M,N CYCLOSPORIN A XXXXXXXXVXA 11 T 9.7 DUF6090 pdbhh F F 2esx 1 A A O36236_9HIV1 Envelope polyprotein GP160 TRKSIHIGPGRAFYTTGEI 19 F T Viruses T 2esz 1 A A O36236_9HIV1 Envelope polyprotein GP160 TRKSIHIGPGRAFYTTGEI 19 F T Viruses T 2etz 2 B B LCP2_MOUSE SH2 DOMAIN-CONTAINING LEUCOCYTE PROTEIN OF 76 KDA, SLP-76 TYROSINE PHOSPHOPROTEIN, SLP76 XADXEPPX 8 T 0.0057 SDA1 unppercent F Eukaryota F 2eu0 2 B B LCP2_MOUSE SH2 DOMAIN-CONTAINING LEUCOCYTE PROTEIN OF 76 KDA, SLP-76 TYROSINE PHOSPHOPROTEIN, SLP76 XADXEPPX 8 T 0.0057 SDA1 unppercent F Eukaryota F 2evq 1 A A HP7 KTWNPATGKWTE 12 T 0.52 Collagen_bind_2 pdbhh F T 2ewr 1 A A Q9X0A5_THEMA hypothetical protein TM1012 MGSDKIHHHHHHMIRPEYLRVLRKIYDRLKNEKVNWVVTGSLSFALQGVPVEVHDIDIQTDEEGAYEIERIFSEFVSKKVRFSSTEKICSHFGELIIDGIKVEIMGDIRKRLEDGTWEDPVDLNKYKRFVETHGMKIPVLSLEYEYQAYLKLGRVEKAETLRKWLNERKG 170 T 0.00088 NTP_transf_5 unppercent F Bacteria T 2ezd 3 C A HMGA1_HUMAN HIGH MOBILITY GROUP PROTEIN HMG VPTPKRPRGRPKGSKNKGAAKTRKT 25 T 0.029 AT_hook pdbhh F Eukaryota T 2eze 3 C A HMGA1_HUMAN HIGH MOBILITY GROUP PROTEIN HMG VPTPKRPRGRPKGSKNKGAAKTRKT 25 T 0.029 AT_hook pdbhh F Eukaryota T 2ezf 3 C A HMGA1_HUMAN HIGH MOBILITY GROUP PROTEIN HMG GRKPRGRPKK 10 T 0.0031 AT_hook pdbhh F Eukaryota F 2ezg 3 C A HMGA1_HUMAN HIGH MOBILITY GROUP PROTEIN HMG GRKPRGRPKK 10 T 0.0031 AT_hook pdbhh F Eukaryota F 2f3a 1 A A aurein 1.2 analog RLFDKIRQVIRKFX 14 T 7.5 DUF6200 pdbhh F T 2f4l 1 A,B,C,D A,B,C,D Q9WXX3_THEMA acetamidase, putative MGSDKIHHHHHHMKVVPAQRCVYSFSANMAPVEEVYPGEQVVFETLDALGGSYDKIDFSKVNPATGPVFVNGVKPGDTLKVRIKRIELPRRGMIVTGKGFGVLGDEVEGFHTKELEIEKWAVLFDGVRIPIHPMVGVIGVAPQEGEYPTGTAHRHGGNMDTKEITENVTVHLPVFQEGALLALGDVHATMGDGEVCVSACEVPAKVVVEIDVSKEEIKWPVVETNDAYYIIVSLPDIEEALKEVTRETVWFIQRRKTIPFTDAYMLASLSVDVGISQLVNPAKTAKARIPKYIFTGV 297 T 3.4E-14 FmdA_AmdA unppercent F Bacteria T 2f4o 3 C I PHQ-VAL-ALA-ASP-CF0 XVADX 5 T 1100 RE_HindIII pdbhh F F 2f58 3 C P V3 LOOP HIGPGRAFGGG 11 T 0.065 GP120 pdbhh F T 2f69 2 B B TAF10_HUMAN TAF10 peptide, Acetyl-Ser-Lys-Ser-Mlz-Asp-Arg-Lys-Tyr-Thr-Leu XSKSKDRKYTL 11 T 0.0084 TFIID-31kDa unphh F Eukaryota T 2f8e 2 B A Q9DS05_9PICO VPg protein GPYAGPLERQRPLKVKAKLPQAE 23 T 3.7 RNase_HII pdbhh T Viruses T 2f9n 2 E,F,G,H E,F,G,H Leupeptin XLLX 4 T 590 DNTTIP1_dimer pdbhh F F 2f9p 2 B,D,F,H E,F,G,H Leupeptin XLLX 4 T 590 DNTTIP1_dimer pdbhh F F 2fci 1 A B Q32PK0_BOVIN Doubly phosphorylated peptide derived from Syk kinase comprising residues 338-350 XDTEVXESPXADPX 14 T 27 Holin_2-3 pdbhh F Eukaryota T 2fcl 1 A A Q9X0A5_THEMA hypothetical protein TM1012 MGSDKIHHHHHHMIRPEYLRVLRKIYDRLKNEKVNWVVTGSLSFALQGVPVEVHDIDIQTDEEGAYEIERIFSEFVSKKVRFSSTEKICSHFGELIIDGIKVEIMGDIRKRLEDGTWEDPVDLNKYKRFVETHGMKIPVLSLEYEYQAYLKLGRVEKAETLRKWLNERK 169 T 0.00088 NTP_transf_5 unppercent F Bacteria T 2fdm 2 B P Tripeptide WPW 3 T 22 Sex_peptide pdbhh F F 2feq 3 C D Decapeptide Hirudin Analogue XYEPIPEEXXQ 11 T 0.91 Hirudin pdbhh F T 2fes 3 C D Decapeptide Hirudin Analogue XYEPIPEEXXQ 11 T 0.91 Hirudin pdbhh F T 2ff3 2 B C WASL_HUMAN N-WASP ADGQESTPPTPAPTS 15 T 0.17 WH2 unppssm F Eukaryota T 2ff4 2 C,D E,F RAD9_YEAST DNA repair protein RAD9 SLEVTEADT 9 T 43 AglB_L1 pdbhh F Eukaryota T 2ffd 4 G,H,I,J G,H,I,J GLY-PRO-ARG-VAL-VAL-GLU peptide GPRVVE 6 T 14 DUF4605 pdbhh F F 2ffu 2 B P Q63549_RAT 13-Peptide EA2, PTTDSTTPAPTTK PTTDSTTPAPTTK 13 T 39 DUF1263 pdbhh F Eukaryota T 2ffw 1 A A TRI18_HUMAN TRIPARTITE MOTIF PROTEIN 18, PUTATIVE TRANSCRIPTION FACTOR XPRF, MIDIN, RING FINGER PROTEIN 59 QKASVSGPNSPSETRRERAFDANTMTSAEKVLCQFCDQDPAQDAVKTCVTCEVSYCDECLKATHPNKKPFTGHRLIEP 78 T 0.0015 Siva pdbpssm F Eukaryota T 2fge 2 C,D D,E nonspecific peptide AALTRA AALTRA 6 T 94 DUF4712 pdbhh F F 2fgr 2 B B PAP DNWQNGTS 8 T 4.8 DUF1842 pdbhh F T 2fib 2 B B GLY-PRO-ARG-PRO GPRP 4 T 65 SRCR_2 pdbhh F F 2fiv 2 C,D I,J FIV PROTEASE INHIBITOR LP-149 XXVXEXX 7 T 650 RhoGEF67_u2 pdbhh F F 2flu 2 B P NF2L2_HUMAN Nrf2 AFFAQLQLDEETGEFL 16 T 0.18 DUF4585 pdbhh F Eukaryota T 2fmc 1 A A RODL_NEUCR RODLET PROTEIN, CLOCK-CONTROLLED GENE PROTEIN 2, BLUE LIGHT-INDUCED PROTEIN 7, EAS ATTIGPNTCSIDDYKPYCCQSMSGPAGSPGLLNLIPVDLSASLGCVVGVIGSQCGASVKCCKDDVTNTGNSFLIINAANCVA 82 T 0.05 Hydrophobin pdbhh F Eukaryota T 2fns 2 C P NC-P1 SUBSTRATE PEPTIDE RQANFLGKIN 10 T 9.8 Phage_30_3 pdbhh F T 2fnt 2 C P NC-p1 substrate PEPTIDE RQVNFLGKIN 10 T 0.61 zf-CCHC_5 unphh F T 2fnx 2 B P Inhibitor peptide VIAK 4 T 290 LRR_4 pdbhh F F 2fo4 3 C P MUC1_HUMAN MUCIN 1, TRANSMEMBRANE, MUC-1, POLYMORPHIC EPITHELIAL MUCIN, PEM, PEMT, EPISIALIN, TUMOR-ASSOCIATED MUCIN, CARCINOMA-ASSOCIATED MUCIN, TUMOR-ASSOCIATED EPITHELIAL MEMBRANE ANTIGEN, EMA, H23AG, PEANUT- REACTIVE URINARY MUCIN, PUM, BREAST CARCINOMA-ASSOCIATED ANTIGEN DF3, CD227 ANTIGEN SAPDFRPL 8 T 2.6 DUF724 pdbhh F Eukaryota T 2fo5 2 E,F,G,H E,F,G,H ACE-LEU-LEU-argininal (leupeptin) XLLX 4 T 590 DNTTIP1_dimer pdbhh F F 2foj 2 B B P53_HUMAN p53 peptide GARAHSS 7 T 140 zf_C2HC_14 pdbhh F Eukaryota F 2foo 2 B B P53_HUMAN p53 peptide EPGGSR 6 T 9.3 PTPA pdbhh F Eukaryota F 2fop 2 B B MDM2_HUMAN mdm2 peptide EKPSSS 6 T 260 Pox_F17 pdbhh F Eukaryota F 2fot 2 B C SPTN1_HUMAN SPECTRIN, NON-ERYTHROID ALPHA CHAIN, SPECTRIN ALPHA CHAIN, FODRIN ALPHA CHAIN QQEVYGMMPRDETDSKTASASPWKSARLMVHTVATFNSIKER 42 T 0.13 Spectrin unppercent F Eukaryota T 2fp7 3 C C N-benzoyl-L-norleucyl-L-lysyl-N-[(2S)-5-carbamimidamido-1-hydroxypentan-2-yl]-L-argininamide XXKRX 5 T 220 SUZ pdbhh F F 2fq5 1 A A Peptide 2F XDWLKAFYDKVAEKLKEAFX 20 T 0.08 ApoC-I pdb F T 2fq8 1 A A 2F XDWLKAFYDKVAEKLKEAFX 20 T 0.08 ApoC-I pdb F T 2fqc 1 A A CJEA_CONPO CONOTOXIN PL14A FPRPRICNLACRAGIGHKYPFCHCRX 26 T 0.3 DUF1181 pdbhh F Eukaryota T 2fr9 1 A A Alpha-conotoxin GI ECCNPACGRHYXC 13 T 0.017 Enterotoxin_ST pdbhh F T 2frb 1 A A Alpha-conotoxin GIA ECCXPACGRHYSC 13 T 0.048 Enterotoxin_ST unphh F T 2frz 1 A,B A,B CPXA_PSEPU CAMPHOR 5-MONOOXYGENASE, P450CAM TTETIQSNANLAPLPPHVPEHLVFDFDMYNPSNLSAGVQEAWAVLQESNVPDLVWTRCNGGHWIATRGQLIREAYEDYRHFSSECPWIPREAGEAFDFIPTSMDPPEQRQFRALANQVVGMPVVDKLENRIQELACSLIESLRPQGQCNFTEDYAEPFPIRIFMLLAGLPEEDIPHLKYLTDQMTRPDGSMTFAEAKEALYDYLIPIIEQRRQKPGTDAISIVANGQVNGRPITSDEAKRMCGLLLLGGLDTVVNFLSFSMEFLAKSPEHRQELIERPERIPAACEELLRRFSLVADGRILTSDYEFHGVQLKKGDQILLPQMLSGLDERENAAPMHVDFSRQKVSHTTFGHGSHLCLGQHLARREIIVTLKEWLTRIPDFSIAPGAQIQHKSGIVSGVQALPLVWDPATTKAV 414 T 1.6E-05 p450 unppercent F Bacteria T 2fym 2 B,E B,E RNE_ECOLI RNASE E ASPELASGKVWIRYPIVR 18 T 0.18 XisI pdbhh F Bacteria T 2fyy 3 C C EBNA1_EBVB9 EBNA-1 HPVGEADYFEY 11 T 6.4 Sel_put pdbhh T Viruses T 2fz3 3 C C EBNA1_EBVB9 EBNA-1 HPVGEADYFEY 11 T 6.4 Sel_put pdbhh T Viruses T 2fzt 1 A,B A,B Q9WZF7_THEMA hypothetical protein TM0693 GMNIDEIERKIDEAIEKEDYETLLSLLNKRKELMEGLPKDKLSEILEKDRKRLEIIEKRKTALFQEINVIREARSSLQK 79 T 0.00012 FliT pdb F Bacteria T 2g01 2 B,D F,G JIP1_HUMAN JNK-INTERACTING PROTEIN 1, JIP-1, JNK MAP KINASE SCAFFOLD PROTEIN 1, ISLET-BRAIN-1, IB-1, MITOGEN-ACTIVATED PROTEIN KINASE 8-INTERACTING PROTEIN 1 RPKRPTTLNLF 11 T 12 CTP-dep_RFKase pdbhh F Eukaryota T 2g1t 2 E,F,G,H E,F,G,H ATP-Peptide Conjugate AEEEIFGEFEAKK 13 T 16 SNN_linker pdbhh F T 2g2f 2 C C ATP-Peptide Conjugate EAIFAAPFAKK 11 T 13 AmoC pdbhh F T 2g2i 2 C,D C,D ATP-Peptide Conjugate AEEEIFGEFEAKK 13 T 16 SNN_linker pdbhh F T 2g2l 2 C,D C,D M0R9A7_RAT 18-MER PEPTIDE FROM GLURA18 SIPCMSHSSGMPLGATGL 18 T 4.1 Glyco_hydr_116N pdbhh F Eukaryota T 2g30 2 B P ARH_HUMAN AUTOSOMAL RECESSIVE HYPERCHOLESTEROLEMIA PROTEIN, ARH PEPTIDE DDGLDEAFSRLAQSRT 16 T 3.5 AalphaY_MDB pdbhh F Eukaryota T 2g30 3 C S peptide sequence AAF AAF 3 T 320 EZH2_WD-Binding pdbhh F F 2g35 2 B B PI51C_HUMAN peptide SWVXSPLH 8 T 4.6 Pox_F15 pdbhh F Eukaryota T 2g3v 2 E,F,G,H E,F,G,H (UNK)(UNK)(UNK)(UNK)(UNK)(MSE)(UNK) XXXXXMX 7 T 2500 ADH_N_assoc pdbhh F F 2g42 1 A,B A,B Q9WZF7_THEMA hypothetical protein TM_0693 GMNIDEIERKIDEAIEKEDYETLLSLLNKRKELMEGLPKDKLSEILEKDRKRLEIIEKRKTALFQEINVIREARSSLQK 79 T 0.00012 FliT pdb F Bacteria T 2g46 2 B,D C,D O24165_TOBAC meK27 H3 Peptide GKAPRKQLATKAARKSAPATG 21 T 0.023 PAF unp F Eukaryota T 2g57 1 A A Q0PNE9_RABIT Beta-catenin XKAAVSHWQQQSYLDSGIHSGATTTAPX 28 T 12 AvrPto pdbhh F Eukaryota T 2g58 2 B B (PHQ)IARS XIARS 5 T 210 GP67 pdbhh F F 2g5l 2 C,D X,Y (FME)(ASP)(VAL)(GLU)(ALA)(TRP)(LEU) MDVEAWL 7 T 1.1 DUF4276 pdbhh F T 2g6u 1 A A Miniprotein MP2 RCCHPQCGMVEECRK 15 T 0.76 Cys_rich_CWC pdbhh F T 2g80 1 A,B,C,D A,B,C,D ENOPH_YEAST UNKNOWN TRANSCRIPT 4 PROTEIN MGSDKIHHHHHHMVIGQKVLLARIPKMGDNYSTYLLDIEGTVCPISFVKETLFPYFTNKVPQLVQQDTRDSPVSNILSQFHIDNKEQLQAHILELVAKDVKDPILKQLQGYVWAHGYESGQIKAPVYADAIDFIKRKKRVFIYSSGSVKAQKLLFGYVQDPNAPAHDSLDLNSYIDGYFDINTSGKKTETQSYANILRDIGAKASEVLFLSDNPLELDAAAGVGIATGLASRPGNAPVPDGQKYQVYKNFETL 253 T 0.00011 Hydrolase_like unppssm F Eukaryota T 2g83 2 C,D C,D KB-1753 phage display peptide RGYYHGIWVGE 11 T 0.089 Clr2 pdbhh F T 2g8z 2 B P (TRP)(PRO)(TRP) WPW 3 T 22 Sex_peptide pdbhh F F 2g9y 1 A,B A,B PPB_ECOLI APASE TPEMPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDTAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQDATPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 449 T 6.5E-10 Alk_phosphatase pdbpercent F Bacteria T 2ga3 1 A,B A,B PPB_ECOLI APASE TPEMPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDTAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQDATPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 449 T 6.5E-10 Alk_phosphatase pdbpercent F Bacteria T 2gbq 2 B B SOS1_MOUSE AC-VPPPVPPRRR-NH2 XVPPPVPPRRRX 12 T 4.2 Dscam_C pdbhh F Eukaryota F 2gch 1 A E CTRA_BOVIN GAMMA-CHYMOTRYPSIN A CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 2gct 1 A A CTRA_BOVIN GAMMA-CHYMOTRYPSIN A CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 2gct 4 D D TETRAPEPTIDE ADDUCT XPGAY 5 T 56 BsuPI pdbhh F F 2gdl 1 A A myeloid antimicrobial peptide 27 LVQRGRFGRFLRKIRRFRPKVTITIQGSARF 31 T 11 Dapper pdbhh F T 2ggm 2 C,D C,D XPC_HUMAN XERODERMA PIGMENTOSUM GROUP C COMPLEMENTING PROTEIN, P125 NWKLLAKGLLIRERLKR 17 T 9.9 MazG_C pdbhh F Eukaryota T 2ghq 2 C,D C,D RPB1_HUMAN RPB1 PSYSPTSPS 9 T 0.003 RNA_pol_Rpb1_R pdbhh F Eukaryota F 2ght 2 C,D C,D RPB1_HUMAN RPB1 SYSPTSPS 8 T 0.049 RNA_pol_Rpb1_R pdbhh F Eukaryota F 2git 3 C,F C,F Transcriptional activator TAX LLFGKPVYV 9 T 0.28 PDU_like pdbhh F T 2gj6 3 C C Modified HTLV-1 TAX (Y5K-IBA) peptide, chain C LLFGKPVYV 9 T 0.28 PDU_like pdbhh F T 2gjh 1 A,B A,B DESIGNED PROTEIN MERVRISITARTKKEAEKFAAILIKVFAELGYNDINVTWDGDTVTVEGQLEGGSLEHHHHHH 62 T 0.024 Helicase_RecD pdb F T 2gkw 2 B B BAFF RECEPTOR, B CELL-ACTIVATING FACTOR RECEPTOR, BAFF-R, BLYS RECEPTOR 3, B-CELL MATURATION DEFECT SVPVPATELGSTELVTTKTAGPEQ 24 T 24 Methyltrans_RNA pdbhh F T 2gmt 1 A A CTRA_BOVIN GAMMA-CHYMOTRYPSIN CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 2gmx 2 B,D F,G JIP1_HUMAN JNK-INTERACTING PROTEIN 1, JIP-1, JNK MAP KINASE SCAFFOLD PROTEIN 1, ISLET-BRAIN-1, IB-1, MITOGEN-ACTIVATED PROTEIN KINASE 8-INTERACTING PROTEIN 1 RPKRPTTLNLF 11 T 12 CTP-dep_RFKase pdbhh F Eukaryota T 2gns 2 B P ALVYK ALVYK 5 T 150 DUF6436 pdbhh F F 2gph 2 B B PTN7_HUMAN PROTEIN-TYROSINE PHOSPHATASE LC-PTP, HEMATOPOIETIC PROTEIN-TYROSINE PHOSPHATASE, HEPTP RLQERRGSNVALMLDC 16 T 6.1 PA_decarbox pdbhh F Eukaryota T 2gpv 2 G,H,I G,H,I Q4RA23_TETNG N-COR2, SILENCING MEDIATOR OF RETINOIC ACID AND THYROID HORMONE RECEPTOR, SMRT, SMRTE, THYROID-, RETINOIC-ACID-RECEPTOR-ASSOCIATED COREPRESSOR, T3 RECEPTOR- ASSOCIATING FACTOR, TRAC, CTG REPEAT PROTEIN 26, SMAP270, TNMGLEAIIRKALMGKYDQWEE 22 T 3.5 RHH_7 pdbhh F Eukaryota T 2grl 2 E E peptide AITLIFI 7 T 1.7 DUF2754 pdbhh F F 2grm 2 D,E D,E peptide AITLIFI 7 T 1.7 DUF2754 pdbhh F F 2gs6 2 B B Peptide AEEEIYGEFEAKK 13 T 12 NCD2 pdbhh F T 2gv2 2 B B 8-MER P53 PEPTIDE ANALOGUE XFMXXXEXL 9 T 1.7 CNTF pdbhh F F 2h1c 2 B B FITA_NEIG1 Trafficking protein A VRLGSMLASIGQEIGGVEL 19 T 0.035 PSK_trans_fac unphh F Bacteria T 2h1p 3 C P PA1 GLQYTPSWMLVG 12 T 1.6 Polyoma_coat2 pdbhh F T 2h1u 1 A P MFLE MFLE 4 T 140 CENP_C_N pdbhh F F 2h2d 2 B B P53_HUMAN TUMOR SUPPRESSOR P53, PHOSPHOPROTEIN P53, ANTIGEN NY-CO-13 KKGQSTSRHKXLMFKTEG 18 T 30 Class_IIIsignal pdbhh F Eukaryota T 2h2f 2 B B P53_HUMAN TUMOR SUPPRESSOR P53, PHOSPHOPROTEIN P53, ANTIGEN NY-CO-13 KKGQSTSRHKKLMFKTEG 18 T 30 Class_IIIsignal pdbhh F Eukaryota T 2h43 4 G,H I,J GLY-HIS-ARG-PRO-AMIDE peptide ligand AHRPX 5 T 190 RPEL pdbhh F F 2h48 3 C C N-[(benzyloxy)carbonyl]-L-valyl-N-[(2S)-1-carboxy-4-fluoro-3-oxobutan-2-yl]-L-alaninamide XVADX 5 T 1100 RE_HindIII pdbhh F F 2h4f 2 B D P53_HUMAN TUMOR SUPPRESSOR P53, PHOSPHOPROTEIN P53, ANTIGEN NY-CO-13 KKGQSTSRHKXLMFKTEG 18 T 30 Class_IIIsignal pdbhh F Eukaryota T 2h4h 2 B B P53_HUMAN TUMOR SUPPRESSOR P53, PHOSPHOPROTEIN P53, ANTIGEN NY-CO-13 KKGQSTSRHKXLMFKTEG 18 T 30 Class_IIIsignal pdbhh F Eukaryota T 2h4j 2 B D P53_HUMAN TUMOR SUPPRESSOR P53, PHOSPHOPROTEIN P53, ANTIGEN NY-CO-13 KKGQSTSRHKKLMFKTEG 18 T 30 Class_IIIsignal pdbhh F Eukaryota T 2h4w 3 C C N-[(benzyloxy)carbonyl]-L-valyl-N-[(2S)-1-carboxy-4-fluoro-3-oxobutan-2-yl]-L-alaninamide XVADX 5 T 1100 RE_HindIII pdbhh F F 2h4y 3 C C N-[(benzyloxy)carbonyl]-L-valyl-N-[(2S)-1-carboxy-4-fluoro-3-oxobutan-2-yl]-L-alaninamide XVADX 5 T 1100 RE_HindIII pdbhh F F 2h51 3 C C N-[(benzyloxy)carbonyl]-L-valyl-N-[(2S)-1-carboxy-4-fluoro-3-oxobutan-2-yl]-L-alaninamide XVADX 5 T 1100 RE_HindIII pdbhh F F 2h54 3 C C N-[(benzyloxy)carbonyl]-L-valyl-N-[(2S)-1-carboxy-4-fluoro-3-oxobutan-2-yl]-L-alaninamide XVADX 5 T 1100 RE_HindIII pdbhh F F 2h59 2 C,D D,E P53_HUMAN TUMOR SUPPRESSOR P53, PHOSPHOPROTEIN P53, ANTIGEN NY-CO-13 KKGQSTSRHKKLMFKTEG 18 T 30 Class_IIIsignal pdbhh F Eukaryota T 2h5d 2 B B MEOSUC-ALA-ALA-PRO-ALA BORONIC ACID INHIBITOR XAAPX 5 T 450 DUF3458 pdbhh F F 2h5i 3 C C Ac-DEV(ASJ) XDEVX 5 T 570 Helicase_RecD pdbhh F F 2h5j 3 E,F E,F Ac-DMQD-Cho XDMQX 5 T 510 RamS pdbhh F F 2h5k 2 C C Shc-Derived Ligand XXVNX 5 T 260 DUF1830 pdbhh F F 2h65 3 E,F E,F Ac-VDVAD-Cho XVDVAX 6 T 460 Hemopexin pdbhh F F 2h6f 3 C P farnesylated peptide DDPTASACVLS 11 T 1.8 B pdbhh F T 2h6g 3 C P farnesylated peptide DDPTASACVLS 11 T 1.8 B pdbhh F T 2h6h 3 C P farnesylated peptide DDPTASACVLS 11 T 1.8 B pdbhh F T 2h6i 3 C P farnesylated peptide DDPTASACVLS 11 T 1.8 B pdbhh F T 2h6m 2 B I AC-LAAQMM-PMK XLAAXX 6 T 610 DUF4212 pdbhh F F 2h6t 2 B B pepstatin A XVVXAX 6 T 1700 FAM60A pdbhh F F 2h7r 1 A A CPXA_PSEPU CAMPHOR 5-MONOOXYGENASE, P450CAM TTETIQSNANLAPLPPHVPEHLVFDFDMYNPSNLSAGVQEAWAVLQESNVPDLVWTRCNGGHWIATRGQLIREAYEDYRHFSSECPFIPREAGEAYDFIPTSMDPPEQRQFRALANQVVGMPVVDKLENRIQELACSLIESLRPQGQCNFTEDYAEPFPIRIFMLLAGLPEEDIPHLKYLTDQMTRPDGSMTFAEAKEALYDYLIPIIEQRRQKPGTDAISIVANGQVNGRPITSDEAKRMCGALLVGGLDTVVNFLSFSMEFLAKSPEHRQELIERPERIPAACEELLRRFSLVADGRILTSDYEFHGVQLKKGDQILLPQMLSGLDERENAAPMHVDFSRQKVSHTTFGHGSHLCLGQHLARREIIVTLKEWLTRIPDFSIAPGAQIQHKSGIVSGVQALPLVWDPATTKAV 414 T 1.6E-05 p450 unppercent F Bacteria T 2h7s 1 A,B A,C CPXA_PSEPU CAMPHOR 5-MONOOXYGENASE, P450CAM TTETIQSNANLAPLPPHVPEHLVFDFDMYNPSNLSAGVQEAWAVLQESNVPDLVWTRCNGGHWIATRGQLIREAYEDYRHFSSECPFIPREAGEAYDFIPTSMDPPEQRQFRALANQVVGMPVVDKLENRIQELACSLIESLRPQGQCNFTEDYAEPFPIRIFMLLAGLPEEDIPHLKYLTDQMTRPDGSMTFAEAKEALYDYLIPIIEQRRQKPGTDAISIVANGQVNGRPITSDEAKRMCGALLVGGLDTVVNFLSFSMEFLAKSPEHRQELIERPERIPAACEELLRRFSLVADGRILTSDYEFHGVQLKKGDQILLPQMLSGLDERENAAPMHVDFSRQKVSHTTFGHGSHLCLGQHLARREIIVTLKEWLTRIPDFSIAPGAQIQHKSGIVSGVQALPLVWDPATTKAV 414 T 1.6E-05 p450 unppercent F Bacteria T 2h96 2 B,D F,G JIP1_HUMAN JNK-INTERACTING PROTEIN 1, JIP-1, JNK MAP KINASE SCAFFOLD PROTEIN 1, ISLET-BRAIN 1, IB-1, MITOGEN-ACTIVATED PROTEIN KINASE 8-INTERACTING PROTEIN 1 RPKRPTTLNLF 11 T 12 CTP-dep_RFKase pdbhh F Eukaryota T 2h9e 4 D S selectide inhibitor DTY-ILE-ARG-LEU-LPD peptide XIRLX 5 T 22 DUF5956 pdbhh F F 2h9h 2 B I Three residue peptide XLAAXX 6 T 610 DUF4212 pdbhh F F 2h9r 2 C C AKAP5_HUMAN AKAP79(391-412), AKAP75(391-412) LLIETASSLVKNAIQLSIEQLV 22 T 7.4 IpaB_EvcA pdbhh F Eukaryota T 2hal 2 B I AC-LFFE-FMK XLFFXEX 7 T 190 DUF2536 pdbhh F F 2hbq 3 C C N-[(benzyloxy)carbonyl]-L-valyl-N-[(2S)-1-carboxy-4-fluoro-3-oxobutan-2-yl]-L-alaninamide XVADX 5 T 1100 RE_HindIII pdbhh F F 2hbr 3 C C N-[(benzyloxy)carbonyl]-L-valyl-N-[(2S)-1-carboxy-4-fluoro-3-oxobutan-2-yl]-L-alaninamide XVADX 5 T 1100 RE_HindIII pdbhh F F 2hby 3 C C N-[(benzyloxy)carbonyl]-L-valyl-N-[(2S)-1-carboxy-4-fluoro-3-oxobutan-2-yl]-L-alaninamide XVADX 5 T 1100 RE_HindIII pdbhh F F 2hbz 3 C C Z-VAD-FMK XVADX 5 T 1100 RE_HindIII pdbhh F F 2hdx 2 G,H,I,J,K,L G,H,I,J,K,L JAK2_MOUSE Jak2 protein TPDXELLTEND 11 T 6.5 SPT6_acidic pdbhh F Eukaryota T 2hev 1 A F TNFL4_HUMAN OX40 LIGAND, OX40L, GLYCOPROTEIN GP34, TAX TRANSCRIPTIONALLY-ACTIVATED GLYCOPROTEIN 1, CD252 ANTIGEN GSHMQVSHRYPRIQSIKVQFTEYKKEKGFILTSQKEDEIMKVQDNSVIINCDGFYLISLKGYFSQEVDISLHYQKDEEPLFQLKKVRSVNSLMVASLTYKDKVYLNVTTDNTSLDDFHVNGGELILIHQNPGEFCVL 137 T 0.016 tRNA_NucTran2_2 unppercent F Eukaryota T 2hfr 1 A A CTHL3_CHICK CATHELICIDIN KRFWPLVPVAINTVAAGINLYKAIRRK 27 T 6.2 PsaX pdbhh F Eukaryota T 2hgo 1 A A CASSI_CORCC CASSIICOLIN QTCVSCVNFGNGFCGDNCGNSWACSGC 27 T 0.32 CIAPIN1 pdbhh F Eukaryota T 2hh0 3 C P PRIO_BOVIN Prion protein HGQWNKPSK 9 T 1.1 ACTH_domain pdbhh F Eukaryota T 2hjk 3 C C Q70AA1_9HIV1 Gag protein KGFNPEVIPMF 11 T 4.1E-05 Gag_p24 unphh T Viruses T 2hkf 3 C P CAH9_HUMAN CARBONIC ANHYDRASE IX, CARBONATE DEHYDRATASE IX, CA-IX, CAIX, MEMBRANE ANTIGEN MN, P54/58N, RENAL CELL CARCINOMA-ASSOCIATED ANTIGEN G250, RCC-ASSOCIATED ANTIGEN G250, PMW1 LPGEEDLPG 9 T 1.1 Octapeptide pdbhh F Eukaryota F 2hl3 2 C C MARE1_HUMAN APC-BINDING PROTEIN EB1, END-BINDING PROTEIN 1, EB1 EEQEEY 6 T 5.5 BRCC36_C pdbhh F Eukaryota F 2hlo 4 G,H G,H GLY-HYP-ARG-PRO-AMIDE PEPTIDE LIGAND GPRPX 5 T 110 SRCR_2 pdbhh F F 2hm3 1 A A Q8IT70_HYDVU Nematocyst outer wall antigen GSQITGTCPSGCSGDCYPECKPGCCGQVNLN 31 T 0.41 DUF6331 pdbhh F Eukaryota T 2hm4 1 A A Q8IT70_HYDVU Nematocyst outer wall antigen GSQITGTCPSGCSGDCYPECPPGCCGQVNLN 31 T 0.39 DUF6331 pdbhh F Eukaryota T 2hm5 1 A A Q8IT70_HYDVU Nematocyst outer wall antigen GSQITGTCPSGCSGDCYPECPPGCCGQVNLN 31 T 0.39 DUF6331 pdbhh F Eukaryota T 2hm6 1 A A Q8IT70_HYDVU Nematocyst outer wall antigen GSQITGTCPSVCSGDCYPECPPGCCGQVNLN 31 T 1.6 Tme5_EGF_like pdbhh F Eukaryota T 2hmh 2 B B IL6RB_MOUSE IL-6R-BETA, INTERLEUKIN 6 SIGNAL TRANSDUCER, MEMBRANE GLYCOPROTEIN 130, GP130, CD130 ANTIGEN STVEXSTVVHS 11 T 4.9 S19 pdbhh F Eukaryota T 2hn7 3 C C DNA polymerase PEPTIDE HOMOLOGUE AIMPARFYPK 10 T 0.013 DNA_pol_viral_N pdbhh F T 2ho2 2 B B ENAH_HUMAN HMENA PPPPPPPPPL 10 T 24 Adeno_E4 pdbhh F Eukaryota F 2hob 2 B B N-[(5-METHYLISOXAZOL-3-YL)CARBONYL]ALANYL-L-VALYL-N~1~-((1R,2Z)-4-(BENZYLOXY)-4-OXO-1-{[(3R)-2-OXOPYRROLIDIN-3-YL]METHYL}BUT-2-ENYL)-L-LEUCINAMIDE XAVLXX 6 T 1700 zf-H2C2_5 pdbhh F F 2hod 4 M,N,O,P,Q,R,S,T M,N,O,P,Q,R,S,T Gly-hydroxyPro-Arg-Pro-amide peptide ligand GPRPX 5 T 110 SRCR_2 pdbhh F F 2hpc 4 M,N,O,P,Q,R,S,T M,N,O,P,Q,R,S,T Gly-Pro-Arg-Pro-amide peptide ligand GPRPX 5 T 110 SRCR_2 pdbhh F F 2hpe 2 C S UNIDENTIFIED PEPTIDE FRAGMENT XXXXXXXXX 9 F F F 2hpf 2 C S UNIDENTIFIED PEPTIDE FRAGMENT XXXXXXXX 8 F F F 2hpl 2 B B C-terminal of mouse p97/VCP DDLYG 5 T 40 DUF1958 pdbhh F F 2hpz 2 B B 11-mer synthetic peptide KLKLLVVIRLK 11 T 40 Med14 pdbhh F F 2hqw 2 B B NMDZ1_RAT N-METHYL-D-ASPARTATE RECEPTOR SUBUNIT NR1, NR1C1 PEPTIDE KKKATFRAITSTLASSFKRRRSSK 24 T 14 Neuropeptide_S pdbhh F Eukaryota T 2hrp 3 C,F P,Q HIV-1 PROTEASE PEPTIDE MSLPGRWKPK 10 T 1.1 DUF3304 pdbhh F T 2ht9 2 C X 12-mer peptide LGTENLYFQSME 12 T 6.7 DUF6099 pdbhh F T 2htf 1 A A DPOLM_HUMAN POL MU GTPPSTRFPGVAIYLVEPRMGRSRRAFLTGLARSKGFRVLDACSSEATHVVMEETSAEEAVSWQERRMAAAPPGCTPPALLDISWLTESLGAGQPVPVECRHRLE 105 T 0.0002 BRCT pdbpercent F Eukaryota T 2hu2 2 B B ZN217_HUMAN 9-mer peptide from Zinc finger protein 217 RRTGAPPAL 9 T 48 CCDC84 pdbhh F Eukaryota T 2hug 2 B B SR54C_ARATH SRP54, 54, CHLOROPLAST PROTEIN, 54CP, FFC APPGTARRKRKADS 14 T 7.1 DapB_C pdbhh F Eukaryota T 2hwl 3 E P FIBG_HUMAN Fibrinogen gamma' peptide PAETEXDSLXPEDD 14 T 0.12 DUF3637 pdbhh F Eukaryota T 2hwn 2 E,F E,F Q4R5S0_MACFA A Kinase binding peptide QEELAWKIAKMIVSDVMQQCKK 22 T 2.6 Imm-NTF2-2 pdbhh F Eukaryota T 2hzs 3 I,J,K,L I,J,K,L MED8_YEAST RNA POLYMERASE II TRANSCRIPTIONAL REGULATION MEDIATOR 8 SKPSKPFNVDDVLKFTFTGEKHHHHHH 27 T 13 Tna_leader pdbhh F Eukaryota T 2i04 2 B,D C,D VE6_HPV18 peptide E6 RRRETQV 7 T 0.19 Mu-like_Com unphh T Viruses F 2i0i 2 B,D,F D,E,F VE6_HPV18 peptide E6 RRRETQV 7 T 0.19 Mu-like_Com unphh T Viruses F 2i0l 2 B,D C,D VE6_HPV18 peptide E6 RRRETQV 7 T 0.19 Mu-like_Com unphh T Viruses F 2i1d 1 A A PF11_PIG TRITRP1; PF-1; C6 VRRFPWWWPFLRRX 14 T 1.8 DUF2841 pdbhh F Eukaryota T 2i1e 1 A A 13-mer analogue of Prophenin-1 containing WWW VKKFPWWWPFLKKX 14 T 0.95 DUF2841 pdbhh F T 2i1f 1 A A PF11_PIG 13-mer analogue of Prophenin-1 containing WWW VRRFAWWWAFLRRX 14 T 0.7 DUF6499 pdbhh F Eukaryota T 2i1g 1 A A 13-mer analogue of Prophenin-1 containing WWW VRRYPWWWPYLRRX 14 T 0.9 DUF2841 pdbhh F T 2i1h 1 A A PF11_PIG 13-mer analogue of Prophenin-1 containing WWW VRRFAWWWPFLRRX 14 T 0.14 DUF6264 pdbhh F Eukaryota T 2i1i 1 A A PF11_PIG 13-mer analogue of Prophenin-1 containing WWW VRRFPWWWAFLRRX 14 T 0.58 DUF6499 pdbhh F Eukaryota T 2i3h 2 C,D C,D AVPW peptide AVPW 4 T 42 Cbl_N2 pdbhh F F 2i6o 2 B B NK(PTR)GN NKXGN 5 T 56 HET-s_218-289 pdbhh F F 2i7u 1 A,B A,B Four-alpha-helix bundle MKKLREEAAKLFEEWKKLAEEAAKLLEGGGGGGGGELMKLCEEAAKKAEELFKLAEERLKKL 62 T 0.00038 DUF1771 pdb F T 2i94 2 B B RK_BOVIN RK, G PROTEIN-COUPLED RECEPTOR KINASE 1 MDFGSLETVVANSAFIAARGSFDAS 25 T 3 DUF5465 pdbhh F Eukaryota T 2i9m 1 A A MHA6 SAAEAYAKRIAEAMAKG 17 T 2.7 PilA4 pdbhh F T 2i9n 1 A A MHB4A peptide RGKWTYNGITYEGGGGSAAEAYAKRIAEAMAKG 33 T 1.9 DUF4923 pdbhh F T 2i9o 1 A A MHB8A peptide RGKWTYNGITYEGGGGGGGGSAAEAYAKRIAEAMAKG 37 T 3.8 Cytidylate_kin pdbhh F T 2iae 4 G,H M,N microcystin-LR XLXRXXX 7 T 49 Phage_TAC_2 pdbhh F F 2iah 2 B I Pyoverdin C-E XRXXKXTT 8 T 12 DapH_N pdbhh F F 2id4 2 C,D C,D Ac-RERK-CMK inhibitor XRERXX 6 T 180 DUF1005 pdbhh F F 2ie3 3 C I microcystin LR XLXRXXX 7 T 49 Phage_TAC_2 pdbhh F F 2ifi 1 A A Alpha-conotoxin ImI GCCSDARCAWRCX 13 T 0.09 Toxin_8 pdbhh F T 2ifj 1 A A Alpha-conotoxin ImI GCCSDKRCAWRC 12 T 0.066 Toxin_8 pdbhh F T 2ifr 2 B B Octapeptide XFKFXALRX 9 T 53 Root_cap pdbhh F T 2ifw 2 C,D C,D Heptapeptide XFKFXLR 7 T 27 DUF3889 pdbhh F F 2ifz 1 A A Alpha-conotoxin ImI GCCSDKRCAWRCX 13 T 0.089 Toxin_8 pdbhh F T 2ig0 2 B B H4_HUMAN Dimethylated Histone H4-K20 peptide KRHRKVLRDN 10 T 0.27 UPF0137 unp F Eukaryota T 2igr 1 A A Anticancer peptide CB1a KWKVFKKIEKKWKVFKKIEKAGPKWKVFKKIEKX 34 T 0.16 Cecropin pdb F T 2igu 1 A A CA1_CONIM Alpha-conotoxin ImI GCCSDPRCAWRC 12 T 0.0098 Toxin_8 unphh F Eukaryota T 2igz 1 A A BACILLOMYCIN L-3 XNXXSEXT 8 T 30 DUF5497 pdbhh F F 2ih0 1 A A BACILLOMYCIN L-3 XDXXSQXT 8 T 110 Parecho_VpG pdbhh F F 2ih6 1 A A Lambda-conotoxin CMrVIA VCCGYPLCHPC 11 T 0.073 RPAP2_Rtr1 pdbhh F T 2ih7 1 A A Lambda-conotoxin CMrVIA VCCGYPLCHPCX 12 T 0.095 RPAP2_Rtr1 pdbhh F T 2ihs 2 C,D C,D VASA1_DROME ANTIGEN MAB46F11 DINNNNNIVEDVERKREFYI 20 T 4 CppA_N pdbhh F Eukaryota T 2ii1 1 A,B,C,D A,B,C,D Q9KGN3_BACHD Acetamidase GMIRLSNENTIFFMDKENVPIASCQSGDTVIFETKDCFSDQITNEEQALTSIDFNRVNPATGPLYVEGARRGDMLEIEILDIKVGKQGVMTAAPGLGALGESLNSPTTKLFPIEGDDVVYSTGLRLPLQPMIGVIGTAPPGEPINNGTPGPHGGNLDTKDIKPGTTVYLPVEVDGALLALGDLHAAMGDGEILICGVEIAGTVTLKVNVKKERMFPLPALKTDTHFMTIASAETLDAAAVQATKNMATFLANRTALSIEEAGMLLSGAGDLYVSQIVNPLKTARFSLALHYFEKLGVDLCN 301 T 1.5E-21 FmdA_AmdA pdbpercent F Bacteria T 2ipu 3 E,F P,Q A4_HUMAN abeta 1-8 peptide DAEFRHDS 8 T 0.0001 Beta-APP unphh F Eukaryota T 2iq6 2 B B Peptide, (Leucyl-leucyl-leucine) LLL 3 T 930 PAP_assoc pdbhh F F 2isq 2 B B SAT1_ARATH ATSAT-1, SAT-P, ATSERAT2;1 TEWSDYVI 8 T 0.23 Phage_T4_gp36 pdbhh F Eukaryota T 2itb 1 A,B A,B Q88KV1_PSEPK TRNA-(Ms(2)io(6)a)-hydroxylase, putative GMSLIPEIDAFLGCPTPDAWIEAALADQETLLIDHKNCEFKAASTALSLIAKYNTHLDLINMMSRLAREELVHHEQVLRLMKRRGVPLRPVSAGRYASGLRRLVRAHEPVKLVDTLVVGAFIEARSCERFAALVPHLDEELGRFYHGLLKSEARHYQGYLKLAHNYGDEADIARCVELVRAAEMELIQSPDQELRFHSGIPQALAA 206 T 2.1E-18 MiaE pdbpssm F Bacteria T 2itk 2 B B D-Peptide XFXXXQX 7 T 340 eIF3m_C_helix pdbhh F F 2iuh 2 B B KIT_HUMAN C-KIT PHOSPHOTYROSYL PEPTIDE TNEXMDMKPGV 11 T 15 AvrPtoB-E3_ubiq pdbhh F Eukaryota T 2iui 2 C,D C,D PGFRB_HUMAN PDGFR-BETA,BETA PLATELET-DERIVED GROWTH FACTOR RECEPTOR,BETA-TYPE PLATELET-DERIVED GROWTH FACTOR RECEPTOR,CD140 ANTIGEN-LIKE FAMILY MEMBER B,PLATELET-DERIVED GROWTH FACTOR RECEPTOR 1,PDGFR-1 SIDXVPMLDMK 11 T 2.1 DapH_N pdbhh F Eukaryota T 2iv8 2 B,C P,Q ARRB1_HUMAN B-ARRESTIN2 DDDIVFEDFARQRLKGMKDD 20 T 19 Lsm_interact pdbhh F Eukaryota T 2iv9 2 C P EPS15_HUMAN EPS15, PROTEIN EPS15, AF-1P PROTEIN SFGDGFADFSTL 12 T 1.6 Pico_P2B pdbhh F Eukaryota T 2ivf 3 C C Q5P5I2_AROAE ETHYLBENZENE DEHYDROGENASE GAMMA-SUBUNIT MKAKRVPGGKELLLDLDAPIWAGAESTTFEMFPTPLVMVKEVSPFLALSEGHGVIKRLDVAALHNGSMIALRLKWASEKHDKIVDLNSFVDGVGAMFPVARGAQAVTMGATGRPVNAWYWKANANEPMEIVAEGFSAVRRMKDKAGSDLKAVAQHRNGEWNVILCRSMATGDGLAKLQAGGSSKIAFAVWSGGNAERSGRKSYSGEFVDFEILK 214 T 5E-18 EB_dh pdbpercent F Bacteria T 2ivh 1 A A CEA7_ECOLX COLCIN-E7 KPGKATGKGKPVNNKWLNNAGKDLGSPVPDRIANKLRDKEFKSFDDFRKKFWEEVSKDPELSKQFSRNNNDRMKVGKAPKTRTQDVSGKRTSFELHQEKPISQNGGVYDMDNISVVTPKRHIDIHRGK 128 T 0.0014 HNH pdbpercent F Bacteria T 2iwb 2 B B TETRAPEPTIDE GHMS 4 T 59 Peptidase_C23 pdbhh F F 2ixp 2 E,F,G,H F,G,H,I SIN-ALA-ALA-PRO-LYS-NIT XAAPKX 6 T 540 MMR1 pdbhh F F 2iy2 1 A,B A,B DSBG_ECOLI DSBG MELPAPVKAIEKQGITIIKTFDAPGGMKGYLGKYQDMGVTIYLTPDGKHAISGYMYNEKGENLSNTLIEKEI 72 T 0.00056 DsbC_N pdbpssm F Bacteria T 2iy3 3 C C SIGNAL SEQUENCE AALALAAAAALALAAAG 17 T 16 DUF108 pdbhh F F 2izq 1 A,B,C,D A,B,C,D GRAMICIDIN D XGAXAXVXWXWYFXWXWX 18 T 4.6 MAP17 pdbhh F T 2izx 2 C C AKAP-IS QIEYLAKQIVDNAIQQAK 18 T 0.0037 RII_binding_1 pdb F T 2j04 2 B,D B,D TFC6_YEAST TAU91 MGLLKDLSSARDKIERIYGLNKEKLLLLAKVKEGFETSVFDFPFKNIQPDSPYFVCLDPPCKKESAYNKVIGDKNRTVYHEINKTEFENMIKLRTKRLKLLIGEVDAEVSTGDKIEFPVLANGKRRGFIYNVGGLVTDIAWLNIEENTDIGKDIQYLAVAVSQYMDEPLNEHLEMFDKEKHSSCIQIFKMNTSTLHCVKVQTIVHSFGEVWDLKWHEGCHAPHLVGCLSFVSQEGTINFLEIIDNATDVHVFKMCEKPSLTLSLADSLITTFDFLSPTTVVCGFKNGFVAEFDLTDPEVPSFYDQVHDSYILSVSTAYSDFEDTVVSTVAVDGYFYIFNPKDIATTKTTVSRFRGSNLVPVVYCPQIYSYIYSDGASSLRAVPSRAAFAVHPLVSRETTITAIGVSRLHPMVLAGSADGSLIITNAARRLLHGIKNSSATQKSLRLWKWDYSIKDDKYRIDSSYEVYPLTVNDVSKAKIDAHGINITCTKWNETSAGGKCYAFSNSAGLLTLEYLSLEHHHHHH 524 T 4E-09 Lgl_C unphh F Eukaryota T 2j30 2 B B AC-DEVD-CMK XDEVDX 6 T 200 ResIII pdbhh F F 2j31 2 B B AC-DEVD-CMK XDEVDX 6 T 200 ResIII pdbhh F F 2j32 2 B B AC-DEVD-CMK XDEVDX 6 T 200 ResIII pdbhh F F 2j33 2 B B AC-DEVD-CMK XDEVDX 6 T 200 ResIII pdbhh F F 2j6f 2 B C CBLB_HUMAN CAS-BR-M MURINE ECTROPIC RETROVIRAL TRANSFORMING SEQUENCE B, CBL-B, SIGNAL TRANSDUCTION PROTEIN CBL-B, SH3-BINDING PROTEIN CBL-B, CASITAS B-LINEAGE LYMPHOMA PROTO-ONCOGENE B, RING FINGER PROTEIN 56 PARPPKPRPRR 11 T 2.3 Hap4_Hap_bind pdbhh F Eukaryota F 2j6o 2 B C CD2_HUMAN T-CELL SURFACE ANTIGEN T11/LEU-5, LFA-2, LFA-3 RECEPTOR, ERYTHROCYTE RECEPTOR, ROSETTE RECEPTOR, CD2 KGPPLPRPRV 10 T 4.6 Caskin-Pro-rich pdbhh F Eukaryota T 2j7i 2 C,D C,D CD2_HUMAN T-CELL SURFACE ANTIGEN T11/LEU-5, LFA-2, LFA-3 RECEPTOR, ERYTHROCYTE RECEPTOR, ROSETTE RECEPTOR, CD2 KGPPLPRPRV 10 T 4.6 Caskin-Pro-rich pdbhh F Eukaryota T 2j7x 2 B B NCOA5_HUMAN NCOA-5, COACTIVATOR INDEPENDENT OF AF-2, CIA, NCOA5 HPPAIQSLINLLADNRY 17 T 2.5 HEAT_PBS pdbhh F Eukaryota T 2j7y 2 B B NCOA5_HUMAN NCOA-5, COACTIVATOR INDEPENDENT OF AF-2, CIA, NCOA5 HPPAIQSLINLLADNRY 17 T 2.5 HEAT_PBS pdbhh F Eukaryota T 2j8a 1 A A SET1_YEAST COMPASS COMPONENT SET1, SET DOMAIN-CONTAINING PROTEIN 1, SET1 HISTONE METHYLTRANSFERASE MSCEIVVYPAQDSTTTNIQDISIKNYFKKYGEISHFEAFNDPNSALPLHVYLIKYASSDGKINDAAKAAFSAVRKHESSGCFIMGFKFEVILNKHSILNNIISKFVEINVKKLQKLQENLKKAKEKEAENHHHHHH 136 T 0.0005 DUF4618 pdb F Eukaryota T 2j8u 3 C,H C,J EMC7_HUMAN SELF-PEPTIDE P1049 ALWGFFPVL 9 T 0.51 MRP-L47 pdbhh F Eukaryota T 2j9a 2 B D MICROGININ FR1 AXYY 4 T 110 Lipoprotein_15 pdbhh F F 2j9j 3 C C INHIBITOR MOLECULE JG365 XSLNXIX 7 T 490 Rad54_N pdbhh F F 2j9n 2 B B UNKNOWN PEPTIDE XXXXXXXXXXXXXXX 15 F F F 2j9n 3 C C UNKNOWN PEPTIDE QXX 3 T 1600 GoLoco pdbhh F F 2jam 2 C D POLYPEPTIDE GVSKFA 6 T 75 Toxin_36 pdbhh F T 2jam 3 D E POLYPEPTIDE VSKF 4 T 170 Get5_bdg pdbhh F F 2jaz 2 B,D B,D CEA7_ECOLX COLICIN E7 KRNKPGKATGKGKPVNNKWLNNAGKDLGSPVPDRIANKLRDKEFKSFDDFRKKFWEEVSKDPELSKQFSRNNNDRMKVGKAPKTRTQDVSGKRTSFELHHEKPISQNGGVYDMDDISVVTPKRHIDIHRGK 131 T 0.042 HNH pdbpercent F Bacteria T 2jb0 2 B B CEA7_ECOLX COLICIN E7 KRNKPGKATGKGKPVNNKWLNNAGKDLGSPVPDRIANKLRDKEFKSFDDFRKKFWEEVSKDPELSKQFSRNNNDRMKVGKAPKTRTQDVSGKRTSFELHHEKPISQNGGVYDMDNISVVTPKRHIDIARGK 131 T 0.021 HNH pdbpercent F Bacteria T 2jbg 2 B,D B,D CEA7_ECOLX COLICIN-E7 KRNKPGKATGKGKPVNNKWLNNAGKDLGSPVPDRIANKLRDKEFKSFDDFRKKFWEEVSKDPELSKQFSRNNNDRMKVGKAPKTRTQDVSGKRTSFELHHEKPISQNGGVYDMDAISVVTPKRHIDIHRGK 131 T 0.0015 HNH pdbpercent F Bacteria T 2jbu 2 C,D C,D CO-PURIFIED PEPTIDE AAAAAAAAAAAA 12 T 250 K_channel_TID pdbhh F F 2jcc 3 C,H C,J EMC7_HUMAN P1049 ALWGFFPVL 9 T 0.51 MRP-L47 pdbhh F Eukaryota T 2jd5 2 C C NOP3_YEAST MITOCHONDRIAL TARGETING SUPPRESSOR 1 PROTEIN, NUCLEAR POLYADENYLATED RNA-BINDING PROTEIN 1, NPL3P RERSPTR 7 T 75 DUF3220 pdbhh F Eukaryota F 2jdo 2 B C GSK3B_HUMAN GSK3-BETA PEPTIDE, GSK-3 BETA GRPRTTSFAE 10 T 6.3 DUF3084 pdbhh F Eukaryota T 2jdr 2 B C GSK3B_HUMAN GSK3-BETA PEPTIDE, GSK-3 BETA GRPRTTSFAE 10 T 6.3 DUF3084 pdbhh F Eukaryota T 2je4 2 C C INHIBITOR MOLECULE JG365 XSLNXIX 7 T 490 Rad54_N pdbhh F F 2jes 2 B,D,F,H,J,L,N,P,R,T,V,X,Z B,D,F,H,J,L,N,P,R,T,V,X,Z UNIDENTIFIED FRAGMENT OF PORTAL PROTEIN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 30 F F F 2jet 1 A A CTRB1_RAT CHYMOTRYPSINOGEN B CHAIN A MSTQACGVPTIQPVL 15 T 2.2 Zn_ribbon_2 pdbhh F Eukaryota T 2jf9 2 D,E,F P,Q,R AB5 PEPTIDE SPGSREWFKDMLS 13 T 0.56 Sorb pdbhh F T 2jfa 3 C,D P,Q COREPRESSOR PEPTIDE DAFQLRQLILRGLQDD 16 T 9.8 DUF5731 pdbhh F T 2jk9 2 B B PAWR_HUMAN PROSTATE APOPTOSIS RESPONSE 4 PROTEIN, PAR-4 PEPTIDE NELNNNLPGGAPAAP 15 T 15 RTT107_BRCT_6 pdbhh F Eukaryota T 2jkg 2 B P OCTAPROLINE PEPTIDE PPPPPPPP 8 T 23 TMEM171 pdbhh F F 2jld 2 C E PEPTIDE (ALA-GLY-GLY-ALA-ALA-ALA-ALA-ALA) AGGAAAAA 8 T 3.2 DUF1383 pdbhh F F 2jma 2 B B P41 peptide XAPSYSPPPPP 11 T 1.8 N1221 pdbhh F F 2jmf 2 B B Q32UW5_DROME Neurogenic locus Notch protein GPLGSPNTGAKQPPSYEDCIK 21 T 0.56 TMEM52 pdbhh F Eukaryota T 2jms 1 A A A0FKY4_EUPNO Pheromone En-6 TDPEEHFDPNTNCDYTNSQDAWDYCTNYIVNSSCGEICCNDCFDETGTGACRAQAFGNSCLNW 63 T 0.0036 Euplotes_phero unp F Eukaryota T 2jmv 1 A A SVN_SCYVA SVN GSGPTYCWNEANNPGGPNRCSNNKQCDGARTCSSSGFCQGTSRKPDPGPKGPTYCWDEAKNPGGPNRCSNSKQCDGARTCSSSGFCQGTAGHAAA 95 T 0.0034 EB pdb F Bacteria T 2jmx 2 B B ATPA_BOVIN F1-ATPASE QKTGTAEVSSILEERILGADTSVDL 25 T 76 PspB pdbhh F Eukaryota T 2jmy 1 A A CM15 KWKLFKKIGAVLKVL 15 T 0.2 Melittin pdbhh F T 2jni 1 A A ANN2_AREMA Arenicin-2 RWCVYAYVRIRGVLVRYRRCW 21 T 2.4 Toxin_25 pdbhh F Eukaryota T 2jnr 1 A A VIR165 LEAIPCSIPPCFAFNKPFVF 20 T 0.93 Serpin pdbhh F T 2jnw 2 B B XPA_HUMAN XERODERMA PIGMENTOSUM GROUP A-COMPLEMENTING PROTEIN KIIDTGGGFILEEE 14 T 1.3 SDH_beta pdbhh F Eukaryota T 2jo4 1 A,B,C,D A,B,C,D KIA7 XAKAAAAAIKAIAAIIKAGGYX 22 T 4.4 DUF1726 pdbhh F T 2jo5 1 A,B,C,D A,B,C,D KIA7F XAKAAAAAIKAIAAIIKAGGFX 22 T 4.3 DUF1726 pdbhh F T 2joa 2 B B Peptide H1-C1 DSRIWWV 7 T 0.86 DUF4894 pdbhh F T 2jof 1 A A TRP-CAGE DAYAQWLKDGGPSSGRPPPS 20 T 1.8 Pam17 pdbhh F T 2jog 2 B B NFAT GPHPVIVITGPHEELE 16 T 0.24 Sigma_reg_C pdbhh F T 2jou 1 A A PSPB_HUMAN SP-B, 6 KDA PROTEIN, PULMONARY SURFACTANT-ASSOCIATED PROTEOLIPID SPLPHE, 18 KDA PULMONARY-SURFACTANT PROTEIN CWLCRALIKRIQAMIPKGGRMLPQLVCRLVLRCS 34 T 4.2E-12 SapB_2 unppssm F Eukaryota T 2jp5 1 A A ATWLPPR peptide ATWLPPR 7 T 17 SBE2 pdbhh F T 2jp6 1 A A KA181_TITOB TOXIN TC32 GSTGPQTTCQAAMCEAGCKGLGKSMESCQGDTCKCKA 37 T 0.0073 Defensin_2 unphh F Eukaryota T 2jp8 1 A P ANGT_HUMAN SERPIN A8, ANGIOTENSINOGEN DRVYIHP 7 T 3.4 PH_RBD pdbhh F Eukaryota T 2jpy 1 A A PHYL2_PHYHY Phylloseptin-2 protein FLSLIPHAINAVSTLVHHFX 20 T 0.0063 Clavanin unp F Eukaryota T 2jq0 1 A A PHYL1_PHYHY PS-1 FLSLIPHAINAVSAIAKHNX 20 T 3.4 BESS unphh F Eukaryota T 2jq1 1 A A PHYL3_PHYHY PS-3 FLSLIPHAINAVSALANHGX 20 T 3.8 BESS unphh F Eukaryota T 2jq2 1 A A pw2 HPLKQYWWRPSI 12 T 0.31 Leader_Trp pdbhh F T 2jq7 3 C C THCL_STRAJ THIOSTREPTON XIAXASXTXXXXTXXXXXX 19 T 0.93 CCER1 pdbhh F Bacteria F 2jq9 2 B B CHM1A_HUMAN CHARGED MULTIVESICULAR BODY PROTEIN 1A, CHMP1A, VACUOLAR PROTEIN SORTING 46-1, VPS46-1, HVPS46-1 VRSQEDQLSRRLAALRN 17 T 4.2 VESA1_N pdbhh F Eukaryota T 2jqi 2 B B RAD53_YEAST SERINE-PROTEIN KINASE 1 NITQPTQQST 10 T 11 VlpA_repeat pdbhh F Eukaryota T 2jqk 2 B B CHM2B_HUMAN CHROMATIN-MODIFYING PROTEIN 2B, CHMP2B, CHMP2.5, VACUOLAR PROTEIN SORTING 2-2, VPS2-2, HVPS2- 2 KATISDEEIERQLKALGVD 19 T 0.021 LEM pdb F Eukaryota T 2jql 2 B B RAD53_YEAST SERINE-PROTEIN KINASE 1 NITQPTQQST 10 T 11 VlpA_repeat pdbhh F Eukaryota T 2jqs 1 A A ALLS_DIPPU Allatostatins DRLYSFGLX 9 T 0.094 Carcinustatin pdbhh F Eukaryota T 2jqu 1 A A ALLS_DIPPU Allatostatins GGSLYSFGLX 10 T 0.092 Carcinustatin pdbhh F Eukaryota T 2jqw 1 A A D0VWW5_ODOGR lectin-like peptide YASPKCFRYPNGVLACT 17 T 2 MORN_2 pdbhh F Eukaryota T 2jrv 1 A A PEPTIDE PEP.1 PMTLPENYFSERPYH 15 T 4.4 DUF4524 pdbhh F T 2jrw 1 A A Cyclic extended Pep.1 CAEPMTLPENYFSERPYHPPPPC 23 T 5.8 Tryp_FSAP pdbhh F T 2jsb 1 A A ANN1_AREMA Arenicin-1 RWCVYAYVRVRGVLVRYRRCW 21 T 2.4 Toxin_25 pdbhh F Eukaryota T 2jst 1 A,B A,B Four-Alpha-Helix Bundle MKKLREEAAKLFEEWKKLAEEAAKLLEGGGGGGGGELMKLCEEAAKKAEELFKLAEERLKKL 62 T 0.00038 DUF1771 pdb F T 2jt9 1 A A 5-mer immunosuppressory peptide from cyclolinopeptide X PPILL 5 T 54 DUF452 pdbhh F F 2jta 1 A A 10-mer ubiquitin peptide LEDGRTLSDY 10 T 0.011 FERM_f0 pdbhh F T 2jtd 1 A A MYOM1_MOUSE SKELEMIN GSSHHHHHHSSGLVPRGSHMEEEMKRLLALSQEHKFPTVPTKSELAVEILEKGQVRFWMQAEKLSSNAKVSYIFNEKEIFEGPKYKMHIDRNTGIIEMFMEKLQDEDEGTYTFQIQDGKATGHSTLVLIGDVYKKLQKEAEF 142 T 0.00037 V-set pdb F Eukaryota T 2jui 1 A A P71470_LACPN BACTERIOCIN PEPTIDE PLNE FNRGGYNFGKSVRHVVDAIGSVAGIRGILKSIR 33 T 4.7 DHH pdbhh F Bacteria T 2jup 2 B P FMN1_MOUSE LIMB DEFORMITY PROTEIN GPPLIPPPP 9 T 0.98 SMN pdbhh F Eukaryota F 2juq 1 A A CA1A_CONRE ALPHA-RGIA GCCSDVRCRYRCR 13 T 0.00026 Toxin_8 unp F Eukaryota T 2jur 1 A A CA1A_CONRE ALPHA-RGIA GCCSEPRCRYRCR 13 T 0.00026 Toxin_8 unp F Eukaryota T 2jus 1 A A CA1A_CONRE ALPHA-RGIA GCCSDPRCRWRCR 13 T 0.00026 Toxin_8 unp F Eukaryota T 2juy 1 A A Neopetrosiamide A FFCPFGCALVDCGPNRPCRDTGFMSCDC 28 T 4.1 Fib_alpha pdbhh F T 2jv8 1 A A Q82V59_NITEU Uncharacterized protein NE1242 MTHHTEVFEGGTIDIEDDTSLTINGKEISYVHDAVKNKWSSRYLPYTQYDSLLDLARAIIRDTVEFSGVKEGS 73 T 0.022 PFU unppercent F Bacteria T 2jve 1 A A A8D0E6_NOTVI Prod 1 MGSSHHHHHHSSGLVPRGSHMALKCFTRNGDDRTVTTCAEEQTRCLFVQLPYSEIQECKTVQQCAEVLEEVTAIGYPAKCCCEDLCNRSEQ 91 T 0.71 Toxin_TOLIP pdbpercent F Eukaryota T 2jvu 1 A A Q08JB9_ECOLX DISPERSIN GGSGWNADNVDPSQCIKQSGVQYTYNSGVSVCMQGLNEGKVRGVSVSGVFYYNDGTTSNFKGVVTPSTPVNTNQDINKTNKVGVQKYRALTEWVGSRSHHHHHH 104 T 0.076 Colicin_M unppercent F Bacteria T 2jw1 2 B B MXID_SHIFL Outer membrane protein mxiD XSETTLLEDEKSLVSYLNY 19 T 17 DUF3512 pdbhh F Bacteria T 2jx6 1 A A DDSK_PHYDS DD K GLWSKIKAAGKEAAKAAAKAAGKAALNAVSEAVX 34 T 0.00011 DD_K unp F Eukaryota T 2jy0 1 A A POLG_HCVCO Protease NS2-3 MDREMAASAGGAVFVGLVLLTLSPHYK 27 T 0.01 HCV_NS2 pdbhh T Viruses T 2jyp 1 A A Q9BP37_HALRU Aragonite protein AP7 TRHSFRRPFHECALCYSITDPGERQRCIDMYCSYTN 36 T 1.8 HMw1_D2 pdbhh F Eukaryota T 2jzi 2 B B PP2BA_HUMAN CALMODULIN-DEPENDENT CALCINEURIN A SUBUNIT ALPHA ISOFORM, CAM-PRP CATALYTIC SUBUNIT ARKEVIRNKIRAIGKMARVFSVLR 24 T 6.1 DUF2626 pdbhh F Eukaryota T 2k00 2 B B LAYN_MOUSE Layilin GRSKESGWVENEIYY 15 T 8.8 Ploopntkinase2 pdbhh F Eukaryota T 2k0f 2 B B MYLK_CHICK 19-MER PEPTIDE FROM TELOKIN; 19-MER PEPTIDE FROM KINASE-RELATED PROTEIN RRKWQKTGHAVRAIGRLSS 19 T 8.4 PACT_coil_coil pdbhh F Eukaryota T 2k13 1 A X D0VWW8_HAEOF Saratin EEREDCWTFYANRKYTDFDKSFKKSSDLDECKKTCFKTEYCYIVFEDTVNKECYYNVVDGEELDQEKFVVDENFTENYLTDCEGKDAGNAAGTGDESDEVDED 103 T 0.00083 PAN_3 pdb F Eukaryota T 2k1q 2 B B PHENETHYLAMIDE XELXX 5 T 1500 SEC-C pdbhh F F 2k20 2 B B O54857_RAT PROTEIN TYROSINE PHOSPHATASE AND TENSIN-LIKE PROTEIN DEDQHSQITKV 11 T 15 Invas_SpaK pdbhh F Eukaryota T 2k2f 1 A,B C,D RYR2_RAT Ryanodine receptor 1 peptide KKAVWHKLLSKQ 12 T 2.4 DUF3693 pdbhh F Eukaryota T 2k2r 2 B B PAXI_HUMAN Paxillin DLDALLADLE 10 T 0.94 DUF2525 pdbhh F Eukaryota F 2k3u 2 B B C5aR(P7-28S) peptide XTTPDYGHYDDKDTLDLNTPVDKX 24 T 0.16 EAGR_box pdbhh F T 2k6a 1 A A RODL_NEUCR RODLET PROTEIN, CLOCK-CONTROLLED GENE PROTEIN 2, BLUE LIGHT-INDUCED PROTEIN 7 SATTIGPNTCSIDDYKPYCCQSMSGSASLGCVVGVIGSQCGASVKCCKDDVTNTGNSFLIINAANCVA 68 T 0.083 Hydrophobin unphh F Eukaryota T 2k6q 2 B B SQSTM_RAT UBIQUITIN-BINDING PROTEIN P62, PROTEIN KINASE C-ZETA-INTERACTING PROTEIN, PKC-ZETA-INTERACTING PROTEIN MSGGDDDWTHLSSKEVD 17 T 12 EpuA pdbhh F Eukaryota T 2k6r 1 A A Full Sequence Design 1 Synthetic Superstable GQQYTAXIKGRTFRNEKELRDFIEKFXGR 29 T 0.13 SpoVIF pdb F T 2k7l 1 A B CTDP1_HUMAN centFCP1-T584PO4 peptide EDTDEDDHLIYLEEILVRV 19 T 2.5 Es2 pdbhh F Eukaryota T 2k84 1 A A GAG_EIAVY P9 LYPDLSEIKKEYNVKEKDQVEDLNLDSLWE 30 T 8.3 LSPR pdbhh T Viruses T 2k8j 1 A X POLG_HCVJA p7tm2 RLVPGAAYALYGVWPLLLLLLALPPRAYA 29 T 9.1 DUF2244 pdbhh T Viruses T 2k8q 1 A A SHQ1_YEAST SMALL NUCLEOLAR RNAS OF THE BOX H/ACA FAMILY QUANTITATIVE ACCUMULATION PROTEIN 1 GITPRFSITQDEEFIFLKIFISNIRFSAVGLEIIIQENMIIFHLSPYYLRLRFPHELIDDERSTAQYDSKDECINVKVAKLNKNEYFEDLDLPTKLLARQGDLAGADALTENTDAKKTQKPLIQEVETDGVSNN 134 T 1.7E-05 PIH1_CS pdbhh F Eukaryota T 2k9b 1 A A DDSK_PHYDS DD K GLWSKIKAAGKEAAKAAAKAAGKAALNAVSEAVX 34 T 0.00011 DD_K unp F Eukaryota T 2k9e 1 A A K1A_STIHL KAPPA-SHTX-SHE3A,POTASSIUM CHANNEL TOXIN SHK XXRSCIDTIPKSRCTAFQCKHSXKYRLSFCRKTCGTCX 38 T 0.0045 ShK unp F Eukaryota T 2k9u 2 B B FBLI1_HUMAN FBLP-1, MITOGEN-INDUCIBLE 2-INTERACTING PROTEIN, MIG2-INTERACTING PROTEIN, MIGFILIN MASKPEKRVASSVFITLAPPRRDV 24 T 11 Pox_A3L pdbhh F Eukaryota T 2ka9 2 B,C B,C cypin peptide QVVPFSSSV 9 T 10 Toxin_8 pdbhh F F 2kaa 1 A A P78696_HIRTH HTA APIVTCRPKLDGREKPFKVDVATAQAQARKAGLTTGKSGDPHRYFAGDHIRWGVNNCDKADAILWEYPIYWVGKNAEWAKDVKTSQQKGGPTPIRVVYANSRGAVQYCGVMTHSKVDKNNQGKEFFEKCD 130 T 0.02 GLEYA pdb F Eukaryota T 2kb9 1 A A JAG1_HUMAN JAGGED1, HJ1 RCQYGWQGLYCDKCIPHPGCVHGICNEPWQCLCETNWGGQLCDK 44 T 0.0036 hEGF pdbhh F Eukaryota T 2kbb 1 A A TLN1_MOUSE Talin-1 GIDPFTAPGQLECETAIAALNSCLRDLDQASLAAVSQQLAPREGISQEALHTQMLTAVQEISHLIEPLASAARAEASQLGHKVSQMAQYFEPLTLAAVGAASKTLSHPQQMALLDQTKTLAESALQLLYTAKEAGGNPKQAAHTQEALEEAVQMMTEAVEDLTTTLNEAASAAG 174 T 0.00049 I_LWEQ unppssm F Eukaryota T 2kbq 1 A A USH1C_HUMAN USHER SYNDROME TYPE-1C PROTEIN, AUTOIMMUNE ENTEROPATHY-RELATED ANTIGEN AIE-75, ANTIGEN NY-CO-38/NY-CO-37, PDZ-73 PROTEIN, RENAL CARCINOMA ANTIGEN NY-REN-3 MDRKVAREFRHKVDFLIENDAEKDYLYDVLRMYHQTMDVAVLVGDLKLVINEPSRLPLFDAIRPLIPLKHQVEYDQLTPR 80 T 0.0072 DUF3567 pdb F Eukaryota T 2kbr 1 A A USH1C_HUMAN USHER SYNDROME TYPE-1C PROTEIN, AUTOIMMUNE ENTEROPATHY-RELATED ANTIGEN AIE-75, ANTIGEN NY-CO-38/NY-CO-37, PDZ-73 PROTEIN, RENAL CARCINOMA ANTIGEN NY-REN-3 MDRKVAREFRHKVDFLIENDAEKDYLYDVLRMYHQTMDVAVLVGDLKLVINEPSRLPLFDAIRPLIPLKHQVEYDQLTPR 80 T 0.0072 DUF3567 pdb F Eukaryota T 2kbr 2 B B CAD23_HUMAN OTOCADHERIN DDDRYLREAIQEYDNIAK 18 T 27 DUF2686 pdbhh F Eukaryota T 2kbs 2 B B CAD23_HUMAN OTOCADHERIN TPLEITEL 8 T 1.4 DUF5908 pdbhh F Eukaryota F 2kc6 1 A A MEN1_EUPNO Mating pheromone En-1 NPEDWFTPDTCAYGDSNTAWTTCTTPGQTCYTCCSSCFDVVGEQACQMSAQC 52 T 41 eIF3g pdbhh F Eukaryota T 2kdq 1 A A L-22 CYCLIC PEPTIDE RVRTRKGRRIRIXP 14 T 0.24 DUF2835 pdbhh F T 2kdr 1 A X POLG_HCVH NS4B, P27 SDAAARVTAILSSLTVTQLLRRLHQWIS 28 T 14 SbcD_C pdbhh T Viruses T 2kdu 2 B B UN13A_RAT MUNC13-1 GSRAKANWLRAFNKVRMQLQEARGEGEMSKSLWFKG 36 T 18 MgrB pdbhh F Eukaryota T 2keg 1 A A P71460_LACPN BACTERIOCIN PLNK RRSRKNGIGYAIGYAFGAVERAVLGGSRDYNK 32 T 0.003 Bacteriocin_IIc unppssm F Bacteria T 2keh 1 A A P71460_LACPN BACTERIOCIN PLNK RRSRKNGIGYAIGYAFGAVERAVLGGSRDYNK 32 T 0.003 Bacteriocin_IIc unppssm F Bacteria T 2keq 1 A A B2J066_NOSP7;B2J821_NOSP7 DNA polymerase III alpha subunit, Nucleic acid binding OB-fold tRNA/helicase-type GGALSYETEILTVEYGLLPIGKIVEKRIECTVYSVDNNGNIYTQPVAQWHDRGEQEVFEYCLEDGSLIRATKDHKFMTVDGQMLPIDEIFERELDLMRVDNLPNIKIATRKYLGKQNVYDIGVERDHNFALKNGFIASN 139 T 4.8E-07 Intein_splicing pdbhh F Bacteria T 2ket 1 A A CTHL6_BOVIN ANTIBACTERIAL PEPTIDE BMAP-27, MYELOID ANTIBACTERIAL PEPTIDE 27 GRFKRFRKKFKKLFKKLSPVIPLLHLX 27 T 0.21 Stomoxyn pdb F Eukaryota T 2kfe 1 A A meucin-24 GRGREFMSNLKEKLSGVKEKMKNS 24 T 1.4 DUF5398 pdbhh F T 2kff 2 B B Rab11-FIP2 NPF peptide FNYESTNPFTAK FNYESTNPFTAK 12 T 6.1 DUF3729 pdbhh F T 2kfg 2 B B Rab11-FIP2 DPF peptide FNYESTDPFTAK FNYESTDPFTAK 12 T 4.8 SsgA pdbhh F T 2kfh 2 B B Rab11-FIP2 GPF peptide FNYESTGPFTAK FNYESTGPFTAK 12 T 9.9 DUF5973 pdbhh F T 2kft 2 B B Histone H3 ARTKQTARKSTGGKAPRKQLC 21 T 0.44 Histone pdbhh F T 2kgn 1 A A STE5_YEAST Protein STE5 PLSRGKKWTEKLARFQRSSAKKKR 24 T 41 DUF3579 pdbhh F Eukaryota T 2kgx 1 A A TLN1_MOUSE Talin-1 GIDPFTAPGQLECETAIAALNSCLRDLDQASLAAVSQQLAPREGISQEALHTQMLTAVQEISHLIEPLASAARAEASQLGHKVSQMAQYFEPLTLAAVGAASKTLSHPQQMALLDQTKTLAESALQLLYTAKEAGGNPKQAAHTQEALEEAVQMMTEAVEDLTTTLNEAASAAG 174 T 0.00049 I_LWEQ unppssm F Eukaryota T 2khf 1 A A P71461_LACPN BACTERIOCIN PLNJ, BACTERIOCIN PEPTIDE PLNJ GAWKNFWSSLRKGFYDGEAGRAIRR 25 T 0.004 ComC unphh F Bacteria T 2khg 1 A A P71461_LACPN BACTERIOCIN PLNJ, BACTERIOCIN PEPTIDE PLNJ (PUTATIVE) GAWKNFWSSLRKGFYDGEAGRAIRR 25 T 0.004 ComC unphh F Bacteria T 2khh 2 B B FxFG DSGFSFGSK 9 T 5.6 Peptidase_S9 pdbhh F F 2khv 1 A A Q2YAJ6_NITMU Phage integrase MTFSECAALYIKAHRSSWKNTKHADQWTNTIKTYCGPVIGPLSVQDVDTKLIMKVLDPIWEQKPETASRLRGRIESVLDWATVRGYREGDNPARWRGYLEHHHHHH 106 T 0.24 Toxin_5 pdbpssm F Bacteria T 2ki0 1 A A DS119 GSGQVRTIWVGGTPEELKKLKEEAKKANIRVTFWGD 36 T 0.029 Alpha-amylase pdbhh F T 2kib 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H NFGAIL segment from human islet amyloid polypeptide NFGAILS 7 T 3.9 SidC_N pdbhh F T 2kid 2 B C (PHQ)LPA(B27) peptide XLPAX 5 T 540 CFAP91 pdbhh F F 2kik 1 A,B A,B Artificial diiron protein XDYLRELLKGELQGIKQYREALEYTHNPVLAKILEDEEKHIEWLETILGX 50 T 0.00069 COQ7 pdbhh F T 2kj9 1 A A Q6D355_PECAS Integrase KSVQEKRNNTRAFKTVAKSWFATKTTWSEDYQRSVWTRLETYLFPDIGNKDIAELDTGDLLVPIKKIEKLGYLEIAMRVKQYATAIMRYAVQQKMIRFNPAYDLEGAVQKLEHHHHHH 118 T 0.015 Phage_int_SAM_3 pdbpercent F Bacteria T 2kjn 1 A A lah4 KKALLALALHHLAHLALHLALALKKA 26 T 10 DUF5664 pdbhh F F 2kjo 1 A A lah4 KKALLALALHHLAHLALHLALALKKA 26 T 10 DUF5664 pdbhh F F 2kjy 1 A A MYPT1_HUMAN MYOSIN PHOSPHATASE-TARGETING SUBUNIT 1, MYOSIN PHOSPHATASE TARGET SUBUNIT 1, PROTEIN PHOSPHATASE MYOSIN-BINDING SUBUNIT GPMSTTEVRERRRSYLTPVRDEESESQRKARSRQARQSRRSTQGVTLTDLQEAEKTIGRS 60 T 0.00014 DUF4695 pdb F Eukaryota T 2kk2 1 A A C4NXD5_EUPNO En-A1 YNPEDDYTPLTCPHTISVVWYECTENTANCGTACCDSCFELTGNTMCLLQAGAAGSGCDME 61 T 0.058 AWS pdb F Eukaryota T 2kke 1 A,B A,B O26567_METTH Uncharacterized protein MVGRRPGGGLKDTKPVVVRLYPDEIEALKSRVPANTSMSAYIRRIILNHLEDE 53 T 0.00052 DUF6290 pdb F Archaea T 2kl8 1 A A OR15 MEMDIRFRGDDLEAFEKALKEMIRQARKFAGTVTYTLDGNDLEIRITGVPEQVRKELAKEAERLAKEFNITVTYTIRLEHHHHHH 85 T 0.0033 DUF2067 pdb F T 2km9 1 A A omega_conotoxin-FVIA CKGTGKSCSRIAYNCCTGSCRSGKC 25 T 0.00087 Conotoxin pdbhh F T 2kmj 2 B,C B,C Pyrimidinylpeptide XXXX 4 F F F 2kna 1 A A XIAP_HUMAN E3 UBIQUITIN-PROTEIN LIGASE XIAP, INHIBITOR OF APOPTOSIS PROTEIN 3, X-LINKED INHIBITOR OF APOPTOSIS PROTEIN, X-LINKED IAP, IAP-LIKE PROTEIN, HILP GSAMADIGSEFEKTPSLTRRIDDTIFQNPMVQEAIRMGFSFKDIKKIMEEKIQISGSNYKSLEVLVADLVNAQKDSMQDESSQTSLQKEISTEEQLRRLQEEKL 104 T 0.022 Baculo_RING unphh F Eukaryota T 2knh 2 B B HTF4_HUMAN TRANSCRIPTION FACTOR HTF-4, E-BOX-BINDING PROTEIN, DNA-BINDING PROTEIN HTF4 IGTDKELSDLLDFSAMFS 18 T 9.6 HSV_VP16_C pdbhh F Eukaryota T 2knj 1 A A MPSIN_RHIMP Microplusin preprotein HHQELCTKGDDALVTELECIRLRISPETNAAFDNAVQQLNCLNRACAYRKMCATNNLEQAMSVYFTNEQIKEIHDAATACDPEAHHEHDH 90 T 0.037 zf-C2H2_aberr pdbpssm F Eukaryota T 2knp 1 A A D0VWX1_MOMCO MCoCC-1 GCEGKQCGLFRSCGGGCRCWPTVTPGVGICSSS 33 T 0.00057 Albumin_I pdbhh F Eukaryota T 2kon 1 A A Q7NW74_CHRVO Uncharacterized protein MNVAHYRGYEIEPGHQYRDDIRKYVPYALIRKVGVPDRTPIPTTYPEFYDLEADAERVSIACAKIIIDSHLDRHDQGLADLG 82 T 0.12 Sel_put pdbpssm F Bacteria T 2koz 1 A A nasonin-1 ACNDRDCSLDCIMKGYNTGSCVRGSCQCRRTSG 33 T 0.00035 Toxin_2 pdbpercent F T 2kp0 1 A A nasonin-1M ACNDRDCSLDCIMKGYNFGKCVRGSCQCRRTSG 33 T 0.00047 Toxin_2 pdbpercent F T 2kpa 1 A A ARNO(375-400) VSVDPFYEMLAARKKRISVKKKQEQP 26 T 0.87 KIF1B pdbhh F T 2kpb 1 A A ARNO-p(375-400) VSVDPFYEMLAARKKRISVKKKQEQP 26 T 0.87 KIF1B pdbhh F T 2kpl 2 B B VE6_HPV16 E6CT RSSRTRRETQV 11 T 0.34 FpoO unphh T Viruses T 2kpz 2 B B PRO_HTL1L PR76GAG-PRO, MATRIX PROTEIN P19, MA, SDPQIPPPYVEP 12 T 3.9 RAM pdbhh T Viruses T 2kq0 2 B B VP40_EBOZM MEMBRANE-ASSOCIATED PROTEIN VP40 ILPTAPPEYMEA 12 T 0.96 STAT1_TAZ2bind pdbhh T Viruses T 2kq6 1 A A PKD2_HUMAN POLYCYSTIC KIDNEY DISEASE 2 PROTEIN, AUTOSOMAL DOMINANT POLYCYSTIC KIDNEY DISEASE TYPE II PROTEIN, POLYCYSTWIN, R48321 NTVDDISESLRQGGGKLNFDELRQDLKGKGHTDAEIEAIFTKYDQDGDQELTEHEHQQMRDDLEKEREDLDLDHSSLP 78 T 0.00016 EF-hand_8 pdbpercent F Eukaryota T 2kqf 2 B B Q8JJY9_9RHAB C-terminal motif from Glycoprotein SWESHKSGGETRL 13 T 3.5 DUF5052 unphh T Viruses T 2kql 1 A A D-MAUROCALCINE GXXXXXXXXXXXXXXXXXXXXXXXGXXXXXXXX 33 F F F 2kqr 1 A A SYNC_BRUMA ASPARAGINE--TRNA LIGASE, ASNRS, POTENTIALLY PROTECTIVE 63 KDA ANTIGEN GSMTVYICPETGDDGNDGSELKPLRTLYQAMIITKSSKGDFLIRTKKDGKQVWEAASKTALKKSWKRYEQEMLKNEKVAAKMLEKDATEVGVKAALEEAKKVQIELDTSLSYI 113 T 0.00065 DUF1565 pdbpssm F Eukaryota T 2kqs 2 B B DAXX_HUMAN DAXX, HDAXX, FAS DEATH DOMAIN-ASSOCIATED PROTEIN, ETS1-ASSOCIATED PROTEIN 1, EAP1 GSKTSVATQCDPEEIIVLSDSD 22 T 14 TMEM169 pdbhh F Eukaryota T 2ksp 2 B B MILK1_HUMAN MOLECULE INTERACTING WITH RAB13, MIRAB13 LESKPYNPFEEEEED 15 T 0.0047 NPF pdbhh F Eukaryota T 2ksw 1 A A O96050_ORYRH Oryctin VPVGSDCEPKLCTMDLVPHCFLNPEKGIVVVHGGCALSKYKCQNPNHEKLGYTHECEEAIKNAPRP 66 T 1.2 DUF5437 unphh F Eukaryota T 2kub 1 A A FAP1_STRPA Fimbriae-associated protein Fap1 ENLDKMISEAEVLNDMAARKLITLDAEQQLELMKSLVATQSQLEATKNLIGDPNATVADLQIAYTTLGNNTQALGNELIKL 81 T 0.0082 FIVAR pdbpssm F Bacteria T 2kup 2 B B ALK_HUMAN HALK, ANAPLASTIC LYMPHOMA KINASE LFRLRHFPCGNVNYGYQQQ 19 T 0.4 Ntox44 pdbhh F Eukaryota T 2kv6 3 C C KWKK Tetrapeptide KWKK 4 T 30 Post_transc_reg pdbhh F F 2kvm 2 B B histone H3 peptide (residues 15-30) with dimethylated lysine 27 APRKQLATKAARKSAP 16 T 16 Rsc14 pdbhh F T 2kwf 2 B B ITF2_HUMAN TCF-4, IMMUNOGLOBULIN TRANSCRIPTION FACTOR 2, ITF-2, SL3-3 ENHANCER FACTOR 2, SEF-2, CLASS B BASIC HELIX-LOOP-HELIX PROTEIN 19, BHLHB19 GSGTDKELSDLLDFSAMFS 19 T 6.3 HSV_VP16_C pdbhh F Eukaryota T 2kwh 1 A A RBP1_HUMAN RALBP1, RAL-INTERACTING PROTEIN 1, 76 KDA RAL-INTERACTING PROTEIN, DINITROPHENYL S-GLUTATHIONE ATPASE, DNP-SG ATPASE GSETQAGIKEEIRRQEFLLNSLHRDLQGGIKDLSKEERLWEVQRILTALKRKLREA 56 T 0.0087 SAB pdbpssm F Eukaryota T 2kwi 2 B B RBP1_HUMAN RALBP1, RAL-INTERACTING PROTEIN 1, 76 KDA RAL-INTERACTING PROTEIN, DINITROPHENYL S-GLUTATHIONE ATPASE, DNP-SG ATPASE GSETQAGIKEEIRRQEFLLNSLHRDLQGGIKDLSKEERLWEVQRILTALKRKLREA 56 T 0.0087 SAB pdbpssm F Eukaryota T 2kwn 1 A B H4_HUMAN Histone peptide GLGKGGAXRHRKVLR 15 T 0.27 UPF0137 unp F Eukaryota T 2kwo 1 A B H4_HUMAN Histone peptide XGRGKGGKGLGKGGAKRHRK 20 T 11 Shadoo unppercent F Eukaryota T 2kwu 1 A A POLI_MOUSE RAD30 HOMOLOG B GSPEFDSAEEKLPFPPDIDPQVFYELPEEVQKELMAEWERAGAARPSAHR 50 T 0.0011 UBM pdbhh F Eukaryota T 2kwv 1 A A POLI_MOUSE RAD30 HOMOLOG B GSDTSDLPLQALPEGVDQEVFKQLPADIQEEILSGKSRENLKGKGSLS 48 T 0.00043 UBM pdb F Eukaryota T 2kx5 2 B B Cyclic peptide mimetic of Tat protein RVRCRQRKGRRICIRIXP 18 T 0.79 DUF3877 pdbhh F T 2kxe 1 A A DP2S_PYRHO DP1 SUBUNIT, POL II GSHMDEFVKGLMKNGYLITPSAYYLLVGHFNEGKFSLIELIKFAKSRETFIIDDEIANEFLKSIGAEVELPQEIK 75 T 0.031 Leu_Phe_trans pdb F Archaea T 2kxh 2 B B FUBP1_HUMAN FUSE-BINDING PROTEIN 1, FBP, DNA HELICASE V, HDH V GAMGYVNDAFKDALQRARQIAAKIGGDAGTS 31 T 23 DUF4312 pdbhh F Eukaryota T 2kxq 2 B B SMAD7_HUMAN Smad7 PY motif containing peptide GPLGSELESPPPPYSRYPMD 20 T 0.051 WBP-1 pdbhh F Eukaryota T 2ky5 1 A A PECA1_HUMAN PECAM-1, ENDOCAM, GPIIA', PECA1 GSSDVQYTEVQVSSAESHKDLGKKDTETVYSEVRKAVPDAVESRYSRTEGSLDGT 55 T 0.11 Shisa unp F Eukaryota T 2kyg 2 C C MTG8_HUMAN PROTEIN MTG8, PROTEIN ETO, EIGHT TWENTY ONE PROTEIN, CYCLIN-D-RELATED PROTEIN, ZINC FINGER MYND DOMAIN-CONTAINING PROTEIN 2 AMADIGSASGYVPEEIWKKAEEAVNEVKRQAMTELQKA 38 T 0.038 DUF3731 unp F Eukaryota T 2kyj 1 A A TXS2B_LIOWA LITX DFPLSKEYESCVRPRKCKPPLKCNKAQICVDPNKGW 36 T 0.6 IL8 pdbhh F Eukaryota T 2kyl 2 B B PTEN_HUMAN C-TERMINUS OF PTEN PFDEDQHTQITKV 13 T 2.1 Surface_antigen pdbhh F Eukaryota T 2kym 2 B B STE20_YEAST Peptide form Serine/threonine-protein kinase STE20 GKFIPSRPAPKPPSSA 16 T 0.00039 TFIIA unppssm F Eukaryota T 2kzu 2 B B RASF1_HUMAN RAS ASSOCIATION (RALGDS/AF-6) DOMAIN FAMILY 1, ISOFORM CRA_A GSQEDSDSELEQYFTARW 18 T 0.76 HSV_VP16_C pdbhh F Eukaryota T 2l07 1 A A BRAZZEIN DCKRKVYPNGSISDYCEY 18 T 1.1 EBA-175_VI pdbhh F T 2l0l 1 A A Oxidoreductase that catalyzes reoxidation of DsbA protein disulfide isomerase I KKLSIYERVALFGVLGAALIGAIAPKK 27 T 1.7E-05 DsbB pdbhh F T 2l0m 1 A A Oxidoreductase that catalyzes reoxidation of DsbA protein disulfide isomerase I KKLSIYERVALFGVLGAALIGAIAPKK 27 T 1.7E-05 DsbB pdbhh F T 2l0n 1 A A Oxidoreductase that catalyzes reoxidation of DsbA protein disulfide isomerase I KKRYVAMVIWLYSAFRGVQLTYEHTMLQKK 30 T 0.012 DsbB pdbpssm F T 2l0o 1 A A Oxidoreductase that catalyzes reoxidation of DsbA protein disulfide isomerase I KKRYVAMVIWLYSAFRGVQLTYEHTMLQKK 30 T 0.012 DsbB pdbpssm F T 2l2w 1 A A THCL_STRAJ ALANINAMIDE, BRYAMYCIN, GARGON, THIACTIN XIAXASXTXXXXTXXXXXX 19 T 1.5 DUF4803 pdbhh F Bacteria F 2l2x 1 A A THCL_STRAJ ALANINAMIDE, BRYAMYCIN, GARGON, THIACTIN XIAXASXTXXXXTXXXXX 18 T 1.3 DUF4803 pdbhh F Bacteria F 2l2y 1 A A THCL_STRAJ ALANINAMIDE, BRYAMYCIN, GARGON, THIACTIN XIAXASXTXCXXTXXXXX 18 T 1.3 DUF4803 pdbhh F Bacteria T 2l2z 1 A A THCL_STRAJ ALANINAMIDE, BRYAMYCIN, GARGON, THIACTIN XIAXASXTXXXXTXXXXXX 19 T 1.5 DUF4803 pdbhh F Bacteria F 2l3i 1 A A TOP4A_OXYTA AOXKI4A, antimicrobial peptide in spider venom GIRCPKSWKCKAFKQRVLKRLLAMLRQHAF 30 T 3.2 DUF2615 pdbhh F Eukaryota T 2l3n 1 A A RAP1_SCHPO;TAZ1_SCHPO DNA-binding protein rap1,Telomere length regulator taz1 SVSILRSSVNHREVDEAIDNILRYTNSTEQQFLEAMESTGGRVRIAIAKLLSKQTSGGSGGSKLGGSGGSRKDLSVKGMLYDSDSQQILNRLRERVSGSTAQSA 104 T 0.34 HYPK_UBA pdbhh F Eukaryota T 2l4k 2 B B ERBB2_HUMAN P185ERBB2, TYROSINE KINASE-TYPE CELL SURFACE RECEPTOR HER2, METASTATIC LYMPH NODE GENE 19 PROTEIN, MLN 19 PQPEXVNQPD 10 T 0.38 CYSTM pdbhh F Eukaryota T 2l4t 2 B B Glutaminase L peptide KENLESMV 8 T 22 DUF1128 pdbhh F T 2l4u 1 A A STE5_YEAST 24mer peptide from Protein STE5 PLSRGKKWTEKLARFQRSSAKKKR 24 T 41 DUF3579 pdbhh F Eukaryota T 2l56 1 A A General control protein GCN4 XNYHLENEVARLKKLVGX 18 T 0.039 VGPC1_C pdbhh F T 2l5e 2 B B GATA1_MOUSE GATA-1 KASGXGKXKRGSN 13 T 29 DUF1087 pdbhh F Eukaryota T 2l5r 1 A A H0USY4_ALYOB Antimicrobial peptide Alyteserin-1C GLKEIFKAGLGSLVKGIAAHVAS 23 T 0.094 Bombinin pdbhh F Eukaryota T 2l6e 2 B B NYAD-13 stapled peptide inhibitor ITFXDLLXYYGKKK 14 T 6.5 YaaC pdbhh F T 2l6j 2 B B HS90B_HUMAN C-terminus Hsp90 chaperone peptide MEEVD MEEVD 5 T 120 NUSAP pdbhh F Eukaryota F 2l6s 1 A A VIR-576 LEAIPCSIPPEFLFGKPFVF 20 T 3 DUF5759 pdbhh F T 2l6t 1 A A VIR-576 LEAIPCSIPPEFLFGKPFVF 20 T 3 DUF5759 pdbhh F T 2l7l 2 B B KCC1A_RAT CAM KINASE I, CAM-KI, CAM KINASE I ALPHA, CAMKI-ALPHA AKSKWKQAFNATAVVRHMRKLQ 22 T 0.12 Tyrosinase unp F Eukaryota T 2l7t 1 A A USH1G_HUMAN 11-MER PEPTIDE FROM USHER SYNDROME TYPE-1G PROTEIN, SANS EELPWDELDLG 11 T 0.61 DUF4099 pdbhh F Eukaryota T 2l87 1 A A CCR5_HUMAN CCR5, C-C CKR-5, CC-CKR-5, CCR5, CHEMR13, HIV-1 FUSION CORECEPTOR MDYQVSSPIYDINYYTSEPAQKINVKQ 27 T 6 Polysacc_syn_2C pdbhh F Eukaryota T 2l8j 2 B B NBR1_HUMAN NBR1-LIR peptide GAMGSASSEDYIIILPES 18 T 0.2 CENP-B_dimeris unppercent F Eukaryota T 2l8x 1 A,B A,B ANN2_AREMA Arenicin-2 RWCVYAYVRIRGVLVRYRRCW 21 T 2.4 Toxin_25 pdbhh F Eukaryota T 2l96 1 A X LAK160-P7 KKLKLAPAKLALLWKALALKLKKA 24 T 19 Mastoparan pdbhh F F 2l99 1 A X LAK160-P10 KKLKLALAKPALLWKALALKLKKA 24 T 1.7 Microvir_lysis pdbhh F F 2l9a 1 A X LAK160-P12 KKLKLALAKLAPLWKALALKLKKA 24 T 19 Thioredoxin_6 pdbhh F F 2l9x 1 A A Uncharacterized protein GNAACVIGCIGSCVISEGIGSLVGTAFXLG 30 T 0.95 Bacteriocin_IIc unphh F T 2la0 1 A A Uncharacterized protein GWVACVGACGTVCLASGGVGTEFAAASXFL 30 T 0.4 Herpes_US9 pdbhh F T 2laj 2 B B SMAD3_HUMAN MAD HOMOLOG 3, MAD3, MOTHERS AGAINST DPP HOMOLOG 3, HMAD-3, JV15-2, SMAD FAMILY MEMBER 3, SMAD 3, SMAD3, HSMAD3 AGSPNLSPNP 10 T 2.5 DUF1930 pdbhh F Eukaryota T 2law 2 B B SMAD1_HUMAN MAD HOMOLOG 1, MOTHERS AGAINST DPP HOMOLOG 1, JV4-1, MAD-RELATED PROTEIN 1, SMAD FAMILY MEMBER 1, SMAD 1, SMAD1, HSMAD1, TRANSFORMING GROWTH FACTOR-BETA-SIGNALING PROTEIN 1, BSP-1 TPPPAYLPPEDP 12 T 0.72 Myc_target_1 pdbhh F Eukaryota T 2lax 2 B B SMAD1_HUMAN MAD HOMOLOG 1, MOTHERS AGAINST DPP HOMOLOG 1, JV4-1, MAD-RELATED PROTEIN 1, SMAD FAMILY MEMBER 1, SMAD 1, SMAD1, HSMAD1, TRANSFORMING GROWTH FACTOR-BETA-SIGNALING PROTEIN 1, BSP-1 STYPHSPTS 9 T 58 DUF5372 pdbhh F Eukaryota F 2lay 2 B B SMAD1_HUMAN MAD HOMOLOG 1, MOTHERS AGAINST DPP HOMOLOG 1, JV4-1, MAD-RELATED PROTEIN 1, SMAD FAMILY MEMBER 1, SMAD 1, SMAD1, HSMAD1, TRANSFORMING GROWTH FACTOR-BETA-SIGNALING PROTEIN 1, BSP-1 STYPHSPTS 9 T 58 DUF5372 pdbhh F Eukaryota F 2laz 2 B B SMAD1_HUMAN MAD HOMOLOG 1, MOTHERS AGAINST DPP HOMOLOG 1, JV4-1, MAD-RELATED PROTEIN 1, SMAD FAMILY MEMBER 1, SMAD 1, SMAD1, HSMAD1, TRANSFORMING GROWTH FACTOR-BETA-SIGNALING PROTEIN 1, BSP-1 SDPGSPFQ 8 T 4.4 DUF5667 pdbhh F Eukaryota T 2lb0 2 B B SMAD1_HUMAN MAD HOMOLOG 1, MOTHERS AGAINST DPP HOMOLOG 1, JV4-1, MAD-RELATED PROTEIN 1, SMAD FAMILY MEMBER 1, SMAD 1, SMAD1, HSMAD1, TRANSFORMING GROWTH FACTOR-BETA-SIGNALING PROTEIN 1, BSP-1 TSSDPGSPFQ 10 T 2 DUF3297 pdbhh F Eukaryota T 2lb1 2 B B SMAD1_HUMAN MAD HOMOLOG 1, MOTHERS AGAINST DPP HOMOLOG 1, JV4-1, MAD-RELATED PROTEIN 1, SMAD FAMILY MEMBER 1, SMAD 1, SMAD1, HSMAD1, TRANSFORMING GROWTH FACTOR-BETA-SIGNALING PROTEIN 1, BSP-1 ADTPPPAYLPPEDPX 15 T 2.5 Myc_target_1 pdbhh F Eukaryota T 2lb2 2 B B SMAD3_HUMAN MAD HOMOLOG 3, MAD3, MOTHERS AGAINST DPP HOMOLOG 3, HMAD-3, JV15-2, SMAD FAMILY MEMBER 3, SMAD 3, SMAD3, HSMAD3 ETPPPGYLSEDG 12 T 0.85 Gsf2 pdbhh F Eukaryota T 2lb3 2 B B SMAD2_HUMAN MAD HOMOLOG 2, MOTHERS AGAINST DPP HOMOLOG 2, JV18-1, MAD-RELATED PROTEIN 2, HMAD-2, SMAD FAMILY MEMBER 2, SMAD 2, SMAD2, HSMAD2 IPETPPPG 8 T 10 DUF4677 pdbhh F Eukaryota F 2lcn 1 A A WALP19-P10 peptide XGWWLALALAPALALALWWAX 21 T 1.4 DUF4381 pdbhh F T 2lco 1 A A WALP19-P8 peptide XGWWLALAPALALALALWWAX 21 T 1.3 DUF4381 pdbhh F T 2lct 2 B B KSYK_MOUSE SPLEEN TYROSINE KINASE DTEVXESPXADPE 13 T 23 Holin_2-3 pdbhh F Eukaryota T 2lcu 1 A A H3JQU2_BABCA Bc28.1 SSGIEGCTEDEKRDSVVEGATSVEASLKEQIDWLAERYSADLTNKDTSKWNTDEKVKELLNEKAVGIESRLLAIAKEFHKLKSVLCTGVNETPAHVANRVSPGDAISMLYVLSITHRELSSLKNKIDEWKKVKASEDGTKVIQNIKDDRTNTWFVAHGFKVAELNDVTLEKLATVVNELVSHKDMIYINDAMKQNVDKWTKEESERLAMMAEQGISGAKGKKD 223 T 0.71 ERp29 pdb F Eukaryota T 2ld0 1 A A HD_HUMAN HUNTINGTON DISEASE PROTEIN, HD PROTEIN MATLEKLMKAFESLKSFX 18 T 2 Mito_fiss_reg unphh F Eukaryota T 2ld2 1 A A HD_HUMAN HUNTINGTON DISEASE PROTEIN, HD PROTEIN MATLEKLMKAFESLKSFX 18 T 2 Mito_fiss_reg unphh F Eukaryota T 2ld3 1 A A MYO6_MOUSE Myosin VI QGPGSLVKVGTLKKRLDKFNEVVSALKDGKPEVNRQIKNLEISIDALMAKIKSTMMTREQIQKEYDALVKSSEDLLSALQKKKQQEEE 88 T 0.00048 XhlA pdb F Eukaryota T 2ldj 1 A A Trp-Cage mini-protein NLYIQWLKDXGPSSGRPPPS 20 T 0.12 NDUF_B6 pdbhh F T 2lds 1 A A LAIT1_LIOAU Insecticidal toxin LaIT1 DFPLSKEYETCVRPRKCQPPLKCNKAQICVDPKKGW 36 T 0.63 IL8 pdbhh F Eukaryota T 2le2 1 A,B A,B P56_BPPH2 P56 MVQNDFVDSYDVTMLLQDDDGKQYYEYHKGLSLSDFEVLYGNTADEIIKLRLDKVL 56 T 0.55 GhoS unphh T Viruses T 2ler 1 A A CUGA_CONPB Conotoxin pc16a SCSCKRNFLCC 11 T 1.4 Argos pdbhh F Eukaryota T 2lfk 1 A A Q1EG59_RHIAP Tryptase inhibitor GDKEECTVPIGWSEPVKGLCKARFTRYYCMGNCCKVYEGCYTGGYSRMGECARNCPG 57 T 0.03 DUF3788 unppssm F Eukaryota T 2lfl 1 A A Q1EG59_RHIAP Tryptase inhibitor GDKEECTVPIGWSEPVKGLCKARFTRYYCMGNCCKVYEGCYTGGYSRMGECARNCPG 57 T 0.03 DUF3788 unppssm F Eukaryota T 2lfn 1 A A RODL_NEUCR BLUE LIGHT-INDUCED PROTEIN 7, CLOCK-CONTROLLED GENE PROTEIN 2, RODLET PROTEIN SATTIGPNTCSIDDYKPYCCQSMSGSASLGCVVGVIGSQCGASVKCCKDDVTNTGNSGLIINAANCVA 68 T 0.083 Hydrophobin unphh F Eukaryota T 2lg7 1 A A A6L9X6_PARD8 Uncharacterized protein GDDDEPGGKGAMYEVTIEQSGDFRSFIKSVVVVANGTQLKDGATGESLASPVILSDEELAVEKVTLSTTGKAIEFAVSGGVVDGEDGVVNEPMQWVVTVYKNGKEIEKKSLVFRDGKEISTDDLNLYYN 129 T 0.00015 PLCC unp F Bacteria T 2lge 1 A A A6L9X6_PARD8 Uncharacterized protein GDDDEPGGKGAMYEVTIEQSGDFRSFIKSVVVVANGTQLKDGATGESLASPVILSDEELAVEKVTLSTTGKAIEFAVSGGVVDGEDGVVNEPMQWVVTVYKNGKEIEKKSLVFRDGKEISTDDLNLYYN 129 T 0.00015 PLCC unp F Bacteria T 2lgf 2 B B LYAM1_HUMAN CD62 ANTIGEN-LIKE FAMILY MEMBER L, LEUKOCYTE ADHESION MOLECULE 1, LAM-1, LEUKOCYTE SURFACE ANTIGEN LEU-8, LEUKOCYTE-ENDOTHELIAL CELL ADHESION MOLECULE 1, LECAM1, LYMPH NODE HOMING RECEPTOR, TQ1, GP90-MEL AFIIWLARRLKKGKK 15 T 0.38 MWFE unp F Eukaryota T 2lhr 1 A A ISDH_STAAW HAPTOGLOBIN RECEPTOR A, STAPHYLOCOCCUS AUREUS SURFACE PROTEIN I SDDYVDEETYNLQKLLAPYHKAKTLERQVYELEKLQEKLPEKYKAEYKKKLDQTRVELADQVKSAVTEFENVTPTNDQ 78 T 0.0033 Tropomyosin pdbpssm F Bacteria T 2lht 1 A A A8W3P3_VENIN Cellophane-induced protein 1 ADVFDPPTQYGYDGKPLDASFCRTAGSREKDCRKDVQACDKKYDDQGRETACAKGIREKYKPAVVYGYDGKPLDLGFCTLAGIREVDCRKDAQTCDKKYESDKCLNAIKEKYKPVVDPNPPA 122 T 0.14 Brr6_like_C_C pdbpercent F Eukaryota T 2lhv 1 A A MUC2 Mucin Domain Peptide XPTTTPLKX 9 T 110 MRP-L28 pdbhh F F 2lhw 1 A A MUC2 Mucin Domain Peptide XPTTTPLKX 9 T 110 MRP-L28 pdbhh F F 2lhx 1 A A MUC2 Mucin Domain Peptide XPTTTPLKX 9 T 110 MRP-L28 pdbhh F F 2lhy 1 A A MUC2 Mucin Domain Peptide XPTTTPLKX 9 T 110 MRP-L28 pdbhh F F 2lhz 1 A A MUC2 Mucin Domain Peptide XPTTTPLKX 9 T 110 MRP-L28 pdbhh F F 2li0 1 A A MUC2 Mucin Domain Peptide XPTTTPLKX 9 T 110 MRP-L28 pdbhh F F 2li1 1 A A MUC2 Mucin Domain Peptide XPTTTPLKX 9 T 110 MRP-L28 pdbhh F F 2li2 1 A A MUC2 Mucin Domain Peptide XPTTTPLKX 9 T 110 MRP-L28 pdbhh F F 2li3 1 A A KA20X_TITTR Potassium channel toxin kappa-KTX3.1 GSGCMPEYCAGQCRGKVSQDYCLKNCRCIR 30 T 13 Yeast_MT pdbhh F Eukaryota T 2li5 2 B B ATG7_YEAST ATG7C30, ATG12-ACTIVATING ENZYME E1 ATG7, AUTOPHAGY-RELATED PROTEIN 7, CYTOPLASM TO VACUOLE TARGETING PROTEIN 2 GPHMISGLSVIKQEVERLGNDVFEWEDDESDEIA 34 T 0.12 VMAP-M18 pdb F Eukaryota T 2lid 1 A A Vitellogenin EHKHSDESTSESFESIADNNDDSYFQRKPKLTEAP 35 T 24 KIP1 pdbhh F T 2lk9 1 A A BST2_HUMAN BST-2, HM1.24 ANTIGEN, TETHERIN KRSKLLLGIGILVLLIIVILGVPLIIFTIKKKKKK 35 T 0.00067 UPF0242 unppssm F Eukaryota T 2lkq 1 A A IGLL1_HUMAN CD179 ANTIGEN-LIKE FAMILY MEMBER B, IG LAMBDA-5, IMMUNOGLOBULIN OMEGA POLYPEPTIDE, IMMUNOGLOBULIN-RELATED PROTEIN 14.1 SRSSLRSRWGRFLLQRGSWTGPRC 24 T 26 Toxin_7 pdbhh F Eukaryota T 2lkw 1 A A Q918V6_9REOV Membrane fusion protein p15 XGQRHSIVQPPAPPPNAFVEIX 22 T 2 DUF4381 unphh T Viruses T 2ll1 1 A A TX1_SELPU U1-TRTX-Sp1a DCGHLHDPCPNDRPGHRTCCIGLQCRYGKCLVR 33 T 0.0017 Tachystatin_A pdbhh F Eukaryota T 2ll2 1 A A CXA1_HUMAN CONNEXIN-43, CX43, GAP JUNCTION 43 KDA HEART PROTEIN KGVKDRVKGKSDPYHATSGALSPAKD 26 T 0.52 7tm_1 unp F Eukaryota T 2ll5 1 A A Cyclo-TC1 GDAYAQWLADGGPSSGRPPPSG 22 T 5.1 MOSC_N pdbhh F T 2ll6 2 B B NOS2_HUMAN HEPATOCYTE NOS, HEP-NOS, INDUCIBLE NO SYNTHASE, INDUCIBLE NOS, INOS, NOS TYPE II, PEPTIDYL-CYSTEINE S-NITROSYLASE NOS2 LKVLVKAVLFACMLMRK 17 T 1.2 DUF488 unppercent F Eukaryota T 2ll7 2 B B NOS3_HUMAN CONSTITUTIVE NOS, CNOS, EC-NOS, ENDOTHELIAL NOS, ENOS, NOS TYPE III, NOSIII KKTFKEVANAVKISASL 17 T 0.028 DUF2774 pdbhh F Eukaryota T 2llo 2 B B ESR1_HUMAN ER, ER-ALPHA, ESTRADIOL RECEPTOR, NUCLEAR RECEPTOR SUBFAMILY 3 GROUP A MEMBER 1 RAANLWPSPLMIKRSKKNS 19 T 4 Tom5 pdbhh F Eukaryota T 2llp 1 A,B,C A,B,C CO1A1_HUMAN ALPHA-1 TYPE I COLLAGEN PPGPQGIAGQRGVVGLPG 18 T 0.019 Collagen pdb F Eukaryota T 2llq 2 B B ESR1_HUMAN ER, ER-ALPHA, ESTRADIOL RECEPTOR, NUCLEAR RECEPTOR SUBFAMILY 3 GROUP A MEMBER 1 RAANLWPSPLMIKRSKKNS 19 T 4 Tom5 pdbhh F Eukaryota T 2llr 1 A A Alvinellacin RGCYTRCWKVGRNGRVCMRVCT 22 T 0.00044 Toxin_25 pdbhh F T 2lm8 1 A A CDT-LPS KWFRVYRGIYRRR 13 T 2.7 DUF2161 pdbhh F T 2lma 1 A A Thp5 peptide WRPYLQTEYYDVMTVISPPEFG 22 T 9.9 Fmp27_SW pdbhh F T 2lmb 1 A A RAGE_HUMAN RECEPTOR FOR ADVANCED GLYCOSYLATION END PRODUCTS MWQRRQRRGEERKAPENQEEEEERAELNQSEEPEAGESSTGGP 43 T 0.0011 TMEM154 unphh F Eukaryota T 2lmz 1 A A CANA_CONIM Conotoxin im17a IPYCGQTGAECYSWCIKQDLSKDWCCDFVKDIRMNPPADKCP 42 T 0.0032 TSGP1 unphh F Eukaryota T 2ln3 1 A A DE NOVO DESIGNED PROTEIN OR135 MGLTRTITSQNKEELLEIALKFISQGLDLEVEFDSTDDKEIEEFERDMEDLAKKTGVQIQKQWQGNKLRIRLKGSLEHHHHHH 83 T 0.19 Cas_APE2256 pdb F T 2lnd 1 A A DE NOVO DESIGNED PROTEIN, PFK fold MGKVLLVISTDTNIISSVQERAKHNYPGRYIRTATSSQDIRDIIKSMKDNGKPLVVFVNGASQNDVNEFQNEAKKEGVSYDVLKSTDPEELTQRVREFLKTAGSLEHHHHHH 112 T 0.0034 DUF3801 pdb F T 2lnw 2 B B ARAP3_HUMAN CENTAURIN-DELTA-3, CNT-D3 EEPVXEEVG 9 T 1.9 NETI pdbhh F Eukaryota F 2lny 1 A A ShB peptide MAAVAGLYGLGEDRQHRKKQ 20 T 0.53 AcrZ pdbhh F T 2lo7 1 A A KA20_TITSE TITYUSTOXIN-16 GSGCMKEYCAGQCRGKVSQDYCLKHCKCIPR 31 T 1.1 TCR pdb F Eukaryota T 2lob 2 B B CFTR_HUMAN CFTR, ATP-BINDING CASSETTE SUB-FAMILY C MEMBER 7, CHANNEL CONDUCTANCE-CONTROLLING ATPASE, CAMP-DEPENDENT CHLORIDE CHANNEL EEVQDTRL 8 T 4.7 DUF1507 pdbhh F Eukaryota T 2lox 2 B B RAD2_YEAST DNA repair protein RAD2 GSEILERESEKESSNDENKDDDLEVLSEELFEDVPTKSQISKEAEDNDSRKY 52 T 18 DUF3161 pdbhh F Eukaryota T 2loz 2 B B RHG07_HUMAN DELETED IN LIVER CANCER 1 PROTEIN, DLC-1, HP PROTEIN, RHO-TYPE GTPASE-ACTIVATING PROTEIN 7, START DOMAIN-CONTAINING PROTEIN 12, STARD12, STAR-RELATED LIPID TRANSFER PROTEIN 12 EDHKPGTFPKALTN 14 T 4.9 HAV_VP pdbhh F Eukaryota T 2lpb 2 B B GCN4_YEAST AMINO ACID BIOSYNTHESIS REGULATORY PROTEIN STDSTPMFEYENLEDNSKEWTSLFDNDIPVTTDD 34 T 0.53 Iwr1 unppssm F Eukaryota T 2lpr 2 B P METHOXYSUCCINYL-ALA-ALA-PRO-VALINE BORONIC ACID INHIBITOR XAAPX 5 T 450 DUF3458 pdbhh F F 2lq0 1 A A D0EKL2_9BASI de novo designed antifreeze peptide 1m QRSNFHPLAASFIVRCAFEHSRRFT 25 T 3.1 DUF5677 pdbhh F Eukaryota T 2lq4 1 A p Lysophosphatidic acid receptor 1 MQALEKELAQNEWELQALEKELAQLEKELQAWNCICDIENCSNMAPLYSDQALKKKLAQLKWKLQALKKKNAQLKKKLQA 80 T 0.0065 DUF489 pdb F T 2lqc 2 B B CAC1C_HUMAN CALCIUM CHANNEL, L TYPE, ALPHA-1 POLYPEPTIDE, ISOFORM 1, CARDIAC MUSCLE, VOLTAGE-GATED CALCIUM CHANNEL SUBUNIT ALPHA CAV1.2 GTGAALSWQAAIDAARQAKLMGSA 24 T 17 DUF1911 pdbhh F Eukaryota T 2lqg 1 A A TLN1_MOUSE Talin-1 GIDPFTLVQRLEHAAKQAAASATQTIAAAQHAASAPKASAGPQPLLVQSCKAVAEQIPLLVQGVRGSQAQPDSPSAQLALIAASQSFLQPGGKMVAAAKASVPTIQDQASAMQLSQCAKNLGTALAELRTAAQKAQEA 138 T 0.0032 I_LWEQ pdbpercent F Eukaryota T 2lqx 1 A A Trypsin inhibitor BWI-2c SEKPQQELEECQNVCRMKRWSTEMVHRCEKKCEEKFERQQR 41 T 0.001 Vicilin_N pdb F T 2lr0 1 A A P-loop ntpase fold MKILILINTNNDELIKKIKKEVENQGYQVRDVNDSDELKKEMKKLAEEKNFEKILIKSNDKQLLKEMLELISKLGYKVFLLLADQDENELEEFKRKIESQGYEVRKVTDDEEALKIVREFMQKAGSLEHHHHHH 134 T 0.002 NLBH pdb F T 2lr1 2 B B VGLI_HCMVA Immediate early glycoprotein CEALKKALRRHRFLWQRRQRA 21 T 0.064 AbfB unppercent T Viruses T 2lr2 1 A A Immunoglobulin G-binding protein A MGSSHHHHHHSSGVDNKFNKEQQNAFYEILHLPNLNEEQRNAFIQSLKDDSYIDTNNDGAYEGDELSGSQSANLLAEAKKLNDAQAPK 88 T 0.0071 B pdbpssm F T 2lr7 1 A A M9MMP3_LITCT Cathelicidin-PY RKCNFLCKLKEKLRTVITSHIDKVLRPQG 29 T 2.6 Importin_rep_5 pdbhh F Eukaryota T 2lrd 1 A A I3NI56_9EUKA Acanthaporin AMGKCSVLKKVACAAAIAGAVAACGGIDLPCVLAALKAAEGCASCFCEDHCHGVCKDLHLC 61 T 0.13 PNTB pdbpercent F Eukaryota T 2lre 1 A,B A,B I3NI56_9EUKA Acanthaporin AMGKCSVLKKVACAAAIAGAVAACGGIDLPCVLAALKAAEGCASCFCEDHCHGVCKDLHLC 61 T 0.13 PNTB pdbpercent F Eukaryota T 2lrh 1 A A De novo designed protein MKELILINTNNDELIKKIKKEVENQGYQVRDVNDSDELKKEMKKLAEEKNFEKILIISNDKQLLKEMLELISKLGYKVFLLLQDQDENELEEFKRKIESQGYEVRKVTDDEEALKIVREFMQKAGSLEHHHHHH 134 T 0.0021 NLBH pdb F T 2ls1 1 A A B5I0A0_9ACTN Uncharacterized protein CVWGGDCTDFLGCGTAWICV 20 T 0.36 CCAP pdbhh F Bacteria T 2lsa 1 A A MAGA_XENLA MAGAININ II GIGKFLHSAKKFGKAFVGEIMNS 23 T 0.87 TAFII28 pdbhh F Eukaryota T 2lse 1 A A Four Helix Bundle Protein MQEERKKLLEKLEKILDEVTDGAPDEARERIEKLAKDVKDELEEGDAKNMIEKFRDEMEQMYKDAPNAVMEQLLEEIEKLLKKAGSLVPRGSYLEHHHHHH 101 T 0.00029 Prominin pdb F T 2lsi 2 B B POLK_HUMAN DINB PROTEIN, DINP GSHKKSFFDKKRSERKW 17 T 12 FDF pdbhh F Eukaryota T 2lsj 2 B B POLK_MOUSE DINB PROTEIN, DINP SHMSHKKSFFDKKRSERISNCQDTS 25 T 0.0065 DUF4113 unphh F Eukaryota T 2lsk 2 B B POLH_HUMAN RAD30 HOMOLOG A, XERODERMA PIGMENTOSUM VARIANT TYPE PROTEIN QSTGTEPFFKQKSLLL 16 T 4.1 Med28 pdbhh F Eukaryota T 2lsp 1 A A TF65_HUMAN NF-kB-K310ac peptide RTYETFXSIMKKS 13 T 2.1 Pab87_oct pdbhh F Eukaryota T 2lsr 1 A A USH1C_HUMAN ANTIGEN NY-CO-38/NY-CO-37, AUTOIMMUNE ENTEROPATHY-RELATED ANTIGEN AIE-75, PROTEIN PDZ-73, RENAL CARCINOMA ANTIGEN NY-REN-3, USHER SYNDROME TYPE-1C PROTEIN MDRKVAREFRHKVDFLIENDAEKDYLYDVLRMYHQTMDVAVLVGDLKLVINEPSRLPLFDAIRPLIPLKHQVEYDQLTPR 80 T 0.0072 DUF3567 pdb F Eukaryota T 2lsr 2 B B CAD23_HUMAN peptide from Cadherin-23 GSLLKEVLEDYLRLKK 16 T 5.8 Nup54_57_C pdbhh F Eukaryota T 2lsv 2 B B HSP82_YEAST 82 KDA HEAT SHOCK PROTEIN, HEAT SHOCK PROTEIN HSP90 HEAT-INDUCIBLE ISOFORM ADTEMEEVD 9 T 19 CHZ pdbhh F Eukaryota T 2lti 1 A A E8RMD3_ASTEC ASTEXIN1 GLSQGVEPDIGQTYFEESRINQD 23 T 4.6 LSPR pdbhh F Bacteria T 2lto 2 B B RPB1_HUMAN RNA POLYMERASE II SUBUNIT B1, DNA-DIRECTED RNA POLYMERASE II SUBUNIT A, DNA-DIRECTED RNA POLYMERASE III LARGEST SUBUNIT, RNA-DIRECTED RNA POLYMERASE II SUBUNIT RPB1 YSPSSPXYTPQSP 13 T 0.00021 RNA_pol_Rpb1_R pdbhh F Eukaryota T 2ltu 1 A A AAPK2_HUMAN AMPK SUBUNIT ALPHA-2 GPHMSYDANVIDDEAVKEVCEKFECTESEVMNSLYSGDPQDQLAVAYHLIIDNRRIMNQASE 62 T 0.00043 UBA_2 unppssm F Eukaryota T 2ltv 2 B B SMAD7_HUMAN Smad7 derived peptide SPPPPYSRYPMD 12 T 0.082 WBP-1 pdbhh F Eukaryota T 2ltw 2 B B SMAD7_HUMAN Smad7 derived peptide GESPPPPYSRYPMD 14 T 7.5 SlyX pdbhh F Eukaryota T 2ltx 2 B B SMAD7_HUMAN Smad7 derived peptide ELESPPPPYSRYPMD 15 T 2.4 WBP-1 pdbhh F Eukaryota T 2lty 2 B B SMAD7_HUMAN Smad7 derived peptide ELESPPPPYSRYPMD 15 T 2.4 WBP-1 pdbhh F Eukaryota T 2ltz 2 B B SMAD7_HUMAN Smad7 derived peptide ELESPPPPYSRYPMD 15 T 2.4 WBP-1 pdbhh F Eukaryota T 2lu2 1 A A H4, PUTATIVE RTMDTQNDVESAGRQSEPMEAADRQAEHPGAPTQSEMKEFQEEIKEGVEETKHEGDPEMTRLMVTEKQESKNFSKMAKSQSFSTRIEELGGSISFLTETGVTMIELPKTVSEHDMDQLLHDILAAGGVVGLDSEVKLA 138 T 0.024 THF_DHG_CYH pdb F T 2lue 2 B B OPTN_HUMAN E3-14.7K-INTERACTING PROTEIN, FIP-2, HUNTINGTIN YEAST PARTNER L, HUNTINGTIN-INTERACTING PROTEIN 7, HIP-7, HUNTINGTIN-INTERACTING PROTEIN L, NEMO-RELATED PROTEIN, OPTIC NEUROPATHY-INDUCING PROTEIN, TRANSCRIPTION FACTOR IIIA-INTERACTING PROTEIN, TFIIIA-INTP NSSGSSEDSFVEIRMAE 17 T 5.1 Pea-VEAacid pdbhh F Eukaryota T 2luf 1 A A Retro Trp-cage peptide SPPPRGSSPGGDKLWQIYLN 20 T 5 DUF1822 pdbhh F T 2lv6 2 B B MYLK2_HUMAN MLCK2 KRRWKKNFIAVSAANRFKKISSSGAL 26 T 0.024 PACT_coil_coil unppssm F Eukaryota T 2lvb 1 A A DE NOVO DESIGNED PFK fold PROTEIN MGKVLLVISTDTNIISSVQERAKHNYPGREIRTATSSQDIRDIIKSMKDNGKPLVVFVNGASQNDVNEFQNEAKKEGVSYDVLKSTDPEELTQRVREFLKTAGSLEHHHHHH 112 T 0.0027 DUF3801 pdb F T 2lvh 1 A A Y059A_AFV1Y Putative zinc finger protein ORF59a MIEVSSMERVYQCLRCGLTFRTKKQLIRHLVNTEKVNPLSIDYYYQSFSVSLKDVNKII 59 T 0.00023 zf-C2H2 pdb T Viruses T 2lvm 2 B B H4_HUMAN Histone H4 GAKRHRKVLRDNIQ 14 T 0.27 UPF0137 unp F Eukaryota T 2lw5 1 A A L7P7M1_9CAUD ACR30-35 GMKFIKYLSTAHLNYMNIAVYENGSKIKARVENVVNGKSVGARDFDSTEQLESWFYGLPGSGLGRIENAMNEISRRENP 79 T 0.091 UXS1_N pdbpercent T Viruses T 2lw6 1 A A H2DQR0_MAGOR AvrPiz-t protein SFVQCNHHLLYNGRHWGTIRKKAGWAVRFYEEKPGQPKRLVAICKNASPVHCNYLKCTNLAAGFSAGTSTDVLSSGTVGS 80 T 12 DUF3918 unphh F Eukaryota T 2lwb 1 A A C5GR14_AJEDR Adhesin WI-1 NCDWDKSHEKYDWELWDKWC 20 T 8.6 NinD pdbhh F Eukaryota T 2lwc 1 A A PENK_HUMAN Met-enkephalin YGGFM 5 T 1.5 Op_neuropeptide pdb F Eukaryota F 2lwq 1 A A PawS derived peptide 11 (PDP-11) GCWPVPYPPFFDCKPN 16 T 0.12 Antimicrobial23 pdbhh F T 2lws 1 A A PawS Derived Peptide 4 (PDP-4) GSCFGAFCFRRD 12 T 0.43 LIX1 pdbhh F T 2lwt 1 A A PawS Derived Peptide 5 (PDP-5) GRYRRCIPGMFRAYCYMD 18 T 9.6 IGF2_C pdbhh F T 2lwu 1 A A PawS Derived Peptide 7 (PDP-7) GHCIPTTSGPICLRD 15 T 4.5 Toxin_29 pdbhh F T 2lwv 1 A A PawS Derived Peptide 6 (PDP-6) GHCIQVPPMATEICFSD 17 T 0.78 YcgL pdbhh F T 2lww 2 B B TF65_MOUSE V-REL RETICULOENDOTHELIOSIS VIRAL ONCOGENE HOMOLOG A (AVIAN) GSHMKSTQAGEGTLSEALLHLQFDADEDLGALLGNSTDPGVFTDLASVDNSEFQQLLNQGVSMSHSTAEP 70 T 0.19 HBS1_N pdb F Eukaryota T 2lx0 1 A A Membrane fusion protein p14 KKHTIWEVIAGLVALLTFLAFGFWLFKYLQKK 32 T 0.0041 GAPT pdbhh F T 2lx4 1 A A VPP2_MOUSE V-type proton ATPase 116 kDa subunit a isoform 2 MGSLFRSESMCLAQLFL 17 T 0.0014 V_ATPase_prox unphh F Eukaryota T 2lx5 1 A A ATPE_MYCTU ATP SYNTHASE F1 SECTOR EPSILON SUBUNIT, F-ATPASE EPSILON SUBUNIT DPRIAARGRARLRAVGAI 18 T 0.00011 ATP-synt_DE unppssm F Bacteria T 2lx6 1 A A D5VKJ9_CAUST CAULOSEGNIN I GAFVGQPEAVNPLGREIQG 19 T 0.044 DUF5972 unphh F Bacteria T 2lxg 1 A A CM3A_CONKI Mu-conotoxin KIIIA CCNCSSKWCRDHSRCCX 17 T 0.55 C5HCH pdbhh F Eukaryota T 2lxs 2 B B KMT2A_HUMAN ALL-1, CXXC-TYPE ZINC FINGER PROTEIN 7, LYSINE N-METHYLTRANSFERASE 2A, KMT2A, TRITHORAX-LIKE PROTEIN, ZINC FINGER PROTEIN HRX, MLL CLEAVAGE PRODUCT N320, N-TERMINAL CLEAVAGE PRODUCT OF 320 KDA, P320, MLL CLEAVAGE PRODUCT C180, C-TERMINAL CLEAVAGE PRODUCT OF 180 KDA, P180 DAGNILPSDIMDFVLKNTP 19 T 7.3 PLN_propep pdbhh F Eukaryota T 2lxt 2 B B KMT2A_HUMAN ALL-1, CXXC-TYPE ZINC FINGER PROTEIN 7, LYSINE N-METHYLTRANSFERASE 2A, KMT2A, TRITHORAX-LIKE PROTEIN, ZINC FINGER PROTEIN HRX, MLL CLEAVAGE PRODUCT N320, N-TERMINAL CLEAVAGE PRODUCT OF 320 KDA, P320, MLL CLEAVAGE PRODUCT C180, C-TERMINAL CLEAVAGE PRODUCT OF 180 KDA, P180 DAGNILPSDIMDFVLKNTP 19 T 7.3 PLN_propep pdbhh F Eukaryota T 2lye 1 A A BTD-2 GVCRCVCRRGVCRCVCRR 18 T 0.98 CXCXC pdbhh F F 2lyf 1 A A RTD-1 GFCRCLCRRGVCRCICTR 18 T 0.63 DUF5354 pdbhh F T 2lzi 1 A A HTD-2 GICRCICGRRICRCICGR 18 T 0.18 Haemadin pdbhh F F 2lzo 1 A A TX9A_URTGR UGTX ISIDPPCRFCYHRDGSGNCVYDAYGCGAV 29 T 1.3 DUF1247 pdbhh F Eukaryota T 2lzx 1 A A Asteropsin B QGCAFEGESCNVEFYPCCPGLGLTCIPGNPDGTCYYL 37 T 0.059 Tachystatin_A pdbhh F T 2lzy 1 A A ABU8-3 QDCPGEGEQCDVEFNPCCPPLTCIPGDPYGICYII 35 T 0.00043 Tachystatin_A pdbhh F T 2m0j 2 B B CNGA2_RAT CYCLIC NUCLEOTIDE-GATED CATION CHANNEL 2, CYCLIC NUCLEOTIDE-GATED CHANNEL ALPHA-2, CNG CHANNEL ALPHA-2, CNG-2, CNG2, CYCLIC NUCLEOTIDE-GATED OLFACTORY CHANNEL SUBUNIT OCNC1 TPRRGRGGFQRIVRLVGVIRDWANKNFR 28 T 0.16 CtsR pdbhh F Eukaryota T 2m0k 2 B B CNGA2_RAT CYCLIC NUCLEOTIDE-GATED CATION CHANNEL 2, CYCLIC NUCLEOTIDE-GATED CHANNEL ALPHA-2, CNG CHANNEL ALPHA-2, CNG-2, CNG2, CYCLIC NUCLEOTIDE-GATED OLFACTORY CHANNEL SUBUNIT OCNC1 TPRRGRGGFQRIVRLVGVIRDWANKNFR 28 T 0.16 CtsR pdbhh F Eukaryota T 2m0u 2 B B C-terminal CFTR peptide QDTRL 5 T 270 DUF6013 pdbhh F F 2m0v 2 B B C-terminal CFTR peptide QDTRL 5 T 270 DUF6013 pdbhh F F 2m0w 1 A A ALPS peptide DFLNSAMSSLYSGWSSFTTGASK 23 T 46 DUF4748 pdbhh F T 2m14 2 B B RAD4_YEAST DNA repair protein RAD4 GSTDDSVEEIQSSEEDYDSEEFEDVTDGNEVAGVEDISVEIK 42 T 15 UL11 pdbhh F Eukaryota T 2m1f 1 A A Antiamoebin I XFXXXXGLXXPQXPXPX 17 T 0.21 Pep_deformylase pdbhh F T 2m1p 1 A A [Aba5,14]BTD-2 GVCRXVCRRGVCRXVCRR 18 T 0.79 DUF6029 pdbhh F F 2m20 1 A,B A,B EGFR_HUMAN PROTO-ONCOGENE C-ERBB-1, RECEPTOR TYROSINE-PROTEIN KINASE ERBB-1 KIPSIATGLVGALLLLLVVALGIGLFIRRRHIVRKRTLRRLLQERELVEPLTPSGEKLWS 60 T 0.0014 GAPT pdb F Eukaryota T 2m2g 1 A A [Aba3,16]BTD-2 GVXRCVCRRGVCRCVXRR 18 T 0.78 DUF5354 pdbhh F F 2m2h 1 A A [Aba3,7,12,16]BTD-2 GVXRCVXRRGVXRCVXRR 18 T 30 DUF4235 pdbhh F F 2m2q 1 A A V5IRT8_MOMCH Inhibitor cystine knot peptide MCh-1 GCAGKSCNILGSDPCDAGCFCLPVGIVAGVCV 32 T 0.0001 Albumin_I pdbhh F Eukaryota T 2m2r 1 A A V5IRT9_MOMCH Inhibitor cystine knot peptide MCh-2 GCAGKACNLLGLTCDAGCFCRPDGVGIVAGVCV 33 T 0.00011 Albumin_I unphh F Eukaryota T 2m2s 1 A A [Aba5,7,12,14]BTD-2 GVCRXVXRRGVXRXVCRR 18 T 1.8 Cob_adeno_trans pdbhh F F 2m2x 1 A A [Aba3,5,7,12,14,16]BTD-2 GVXRXVXRRGVXRXVXRR 18 T 11 PhetRS_B1 pdbhh F F 2m2y 1 A A BTD-2[3,4] RCVCRRGVCRCVCRRGVC 18 T 0.94 CXCXC pdbhh F F 2m32 2 B,C,D B,C,D GLOGEN peptide XGPPGPPGLPGENGPPGPPGPPX 23 T 0.00073 Collagen pdbpssm F T 2m35 1 A A TXK1A_SCOMU k-Ssm1a TDDESSNKCAKTKRRENVCRVCGNRSGNDEYYSECCESDYRYHRCLDLLRNF 52 T 2.6 DUF2614 unphh F Eukaryota T 2m37 1 A A E8RMD3_ASTEC ASTEXIN-1 GLSQGVEPDIGQTYFEESR 19 T 2.8 LSPR pdbhh F Bacteria T 2m3a 1 A A KNL2_CAEEL Protein KNL-2 GPLGSVAKKITWRKQDLDRLKRVIALKKPSASDADWTEVLRLLAKEGVVEPEVVRQIAITRLKWVEP 67 T 0.012 Kdo pdb F Eukaryota T 2m3j 1 A A I1SB10_9METZ Asteropsin_E CPGEGEQCDVEFNPCCPPLTCIPGDPYGICYII 33 T 0.0004 Tachystatin_A pdbhh F Eukaryota T 2m3m 2 B B VE6_HPV51 Protein E6 QRTRQRNETQV 11 T 0.072 DUF3716 unp T Viruses T 2m3o 2 B P SCNNA_HUMAN ALPHA-NACH, EPITHELIAL NA(+) CHANNEL SUBUNIT ALPHA, ALPHA-ENAC, ENACA, NONVOLTAGE-GATED SODIUM CHANNEL 1 SUBUNIT ALPHA, SCNEA TAPPPAYATLG 11 T 5.2 Myc_target_1 pdbhh F Eukaryota T 2m41 1 A A CIC_HUMAN Protein capicua homolog VFPWHSLVPFLAPSQ 15 T 15 DUF6356 pdbhh F Eukaryota T 2m45 1 A A MCM_SULSO Minichromosome maintenance protein MCM GSHMGESGKIDIDTIMTGKPKSAREKMMKIIEIIDSLAVSSECAKVKDILKEAQQVGIEKSNIEKLLTDMRKSGIIYEAKPECYKKV 87 T 0.0016 RPA_C pdbpercent F Archaea T 2m4i 1 A A MINC_BACSU Septum site-determining protein MinC GSHMKTKKQQYVTIKGTKNGLTLHLDDACSFDELLDGLQNMLSIEQYTDGKGQKISVHVKLGNRFLYKEQEEQLTELIASKKDLFVHSIDSEVITKKEAQQIREE 105 T 0.009 AF0941-like pdbpssm F Bacteria T 2m56 1 A A CPXA_PSEPU CYTOCHROME P450-CAM, CYTOCHROME P450CAM LAPLPPHVPEHLVFDFDMYNPSNLSAGVQEAWAVLQESNVPDLVWTRCNGGHWIATRGQLIREAYEDYRHFSSECPFIPREAGEAYDFIPTSMDPPEQRQFRALANQVVGMPVVDKLENRIQELACSLIESLRPQGQCNFTEDYAEPFPIRIFMLLAGLPEEDIPHLKYLTDQMTRPDGSMTFAEAKEALYDYLIPIIEQRRQKPGTDAISIVANGQVNGRPITSDEAKRMCGLLLVGGLDTVVNFLSFSMEFLAKSPEHRQELIQRPERIPAACEELLRRFSLVADGRILTSDYEFHGVQLKKGDQILLPQMLSGLDERENACPMHVDFSRQKVSHTTFGHGSHLCLGQHLARREIIVTLKEWLTRIPDFSIAPGAQIQHKSGIVSGVQALPLVWDPATTKAV 404 T 1.6E-05 p450 unppercent F Bacteria T 2m5z 1 A A Q1A2D3_ENTFL ENTEROCIN 7A, ENTEROCIN MR10A, ENTEROCIN NA MGAIAKLVAKFGWPIVKKYYKQIMQFIGEGWAINKIIDWIKKHI 44 T 0.0087 Bacteriocin_IIi pdbhh F Bacteria T 2m60 1 A A Q1A2D2_ENTFL ENTEROCIN 7B, ENTEROCIN MR10B, ENTEROCIN NB MGAIAKLVAKFGWPFIKKFYKQIMQFIGQGWTIDQIEKWLKRH 43 T 9E-05 Bacteriocin_IIi unphh F Bacteria T 2m61 1 A A TXM46_CONAO CONOTOXIN AR1430 CCRLACGLGCHPCCX 15 T 0.24 Radical_SAM_2 pdbhh F Eukaryota T 2m62 1 A A CXT48_CONAO CONOTOXIN AR1232 GVCCGVSFCYPC 12 T 0.18 Oxidored-like unphh F Eukaryota T 2m6c 1 A A Contryphan-In GCVXYPWC 8 T 0.027 ProRS-C_1 pdbhh F T 2m6d 1 A A Contryphan-In GCVXYPWC 8 T 0.027 ProRS-C_1 pdbhh F T 2m6e 1 A A Contryphan-In GCVXYPWC 8 T 0.027 ProRS-C_1 pdbhh F T 2m6f 1 A A Contryphan-In GCVXYPWC 8 T 0.027 ProRS-C_1 pdbhh F T 2m6j 1 A A A0A023GPI4_9ARAC Toxin AbTx IKSCETFIVACDGGKACREVKCKTIX 26 T 1.4 TFIIA_gamma_C pdbhh F Eukaryota T 2m6x 1 A,B,C,D,E,F A,B,C,D,E,F POLG_HCVEV p7 GAKNVIVLNAASAAGNHGFFWGLLVVTLAWHVKGRLVPGATYLSLGVWPLLLVRLLRPHRALA 63 T 0.23 FixQ pdbpercent T Viruses T 2m77 1 A A [Asp2]RTD-1 GDCRCLCRRGVCRCICTR 18 T 0.89 DUF5354 pdbhh F T 2m78 1 A A [Asp11]RTD-1 GFCRCLCRRGDCRCICTR 18 T 0.43 Albumin_I pdbhh F T 2m79 1 A A [Asp2,11]RTD-1 GDCRCLCRRGDCRCICTR 18 T 0.48 Albumin_I pdbhh F T 2m7a 1 A A Uncharacterized protein GSMKRGVEMSIHDLCEDQEQWAMQTLMGSGVLARCRIHNDVILDSGNDASSAYKLGTYLYQKDNSCNLFNTLTEARDAIKDAYESYCGIDDCPQCSKYIDD 101 T 0.071 Phage_FRD3 unppssm F T 2m7b 1 A A Q88G17_PSEPK uncharacterized protein GSMGGIKRLMEEEDAKYSEAVYIAIEAGTLAECEVHEGTYFSDSGDISEAEELAREKFEKGEVSNFDDVEELVKKVVAVCEELGAEECFSCDFD 94 T 0.55 DUF5789 unphh F Bacteria T 2m7c 1 A A Trp-Cage mini-protein RPPPSDXAAYAQWLADXGWAS 21 T 1.1 DUF3349 pdbhh F T 2m7d 1 A A Trp-Cage mini-protein DAYAQWLADXGWASXRPPPS 20 T 3 Sec16_C pdbhh F T 2m7i 1 A A Beta-Hairpin Peptidomimetic antibiotic TWL(DAB)(ORN)(DLY)RW(ORN)(DAB)AK(DPR)P TWLXXXRWXXAKXP 14 T 1.5 Mak_N_cap pdbhh F T 2m7j 1 A A beta-Hairpin Peptidomimetic Antibiotic TWLKKRRWKKAK(DPR)P TWLKKRRWKKAKXP 14 T 0.82 Mak_N_cap pdbhh F T 2m7r 1 A A CON BK-B GEEEYSEAIX 10 T 0.19 Toxin_36 pdbhh F T 2m8f 1 A A E8RUP8_ASTEC astexin3 GPTPMVGLDSVSGQYWDQHAPLAD 24 T 2.1 Cut12 pdbhh F Bacteria T 2m8s 2 B B HBEGF_HUMAN HEPARIN-BINDING EGF-LIKE GROWTH FACTOR, HB-EGF, HBEGF, DIPHTHERIA TOXIN RECEPTOR, DT-R RYHRRGGYDVENEEKVKLGMTNSH 24 T 0.011 DAG1 unppssm F Eukaryota T 2m9p 2 B B Serine protease inhibitor XXKRX 5 T 590 COE1_HLH pdbhh F F 2m9q 2 B B Serine protease inhibitor XXKRX 5 T 590 COE1_HLH pdbhh F F 2ma3 1 A A O27798_METTH DNA replication initiator (Cdc21/Cdc54) GAMGETGKIDIDKVEGRTPKSERDKFRLLLELIKEYEDDYGGRAPTNILITEMMDRYNVSEEKVEELIRILKDKGAIFEPARGYLKIV 88 T 0.0019 Sigma70_r3 unppercent F Archaea T 2maa 1 A A TEMA_RANTE Temporin-A FLPLIGRVLSGIL 13 T 0.59 Endotoxin_N pdbhh F Eukaryota T 2mae 1 A A TACD2_HUMAN CELL SURFACE GLYCOPROTEIN TROP-2, MEMBRANE COMPONENT CHROMOSOME 1 SURFACE MARKER 1, PANCREATIC CARCINOMA MARKER PROTEIN GA733-1 TNRRKSGKYKKVEIKELGELRKEPSL 26 T 0.0044 DAG1 unppercent F Eukaryota T 2mag 1 A _ MAGA_XENLA MAGAININ 2 GIGKFLHSAKKFGKAFVGEIMNSX 24 T 0.98 TAFII28 pdbhh F Eukaryota T 2mai 1 A A Lassomycin GLRRLFANQLVGRRNX 16 T 5.1 Rod_cone_degen pdbhh F T 2mak 2 B,D B,D CRCM1_HUMAN PROTEIN ORAI-1, TRANSMEMBRANE PROTEIN 142A GSELNELAEFARLQDQLDHRGDH 23 T 0.029 DUF2207 unppssm F Eukaryota T 2mbd 1 A A W5IDB3_LASLA lasiocepsin GLPRKILCAIAKKKGKCKGPLKLVCKC 27 T 1.3 Antimicrobial_1 pdbhh F Eukaryota T 2mbl 1 A A Top7 Fold Protein Top7m13 MSGKKVEVQVKITCNGKTYERTYQLYAVRDEELKEKLKKVLNERMDPIKKLGCKRVRISIRVKHSDAAEEKKEAKKFAAILNKVFAELGYNDSNVTWDGDTVTVEGQLEGVDLEHHHHHH 120 T 0.0046 N-glycanase_N pdb F T 2mbm 1 A A Top7 Fold Protein Top7m13 MSGKKVEVQVKITCNGKTYERTYQLYAVRDEELKEKLKKVLNERMDPIKKLGCKRVRISIRVKHSDAAEEKKEAKKFAAILNKVFAELGYNDSNVTWDGDTVTVEGQLEGVDLEHHHHHH 120 T 0.0046 N-glycanase_N pdb F T 2mbz 2 B B Promothiocin A SXVGXAXAXXAX 12 T 50 Peptidase_C12 pdbhh F F 2mc0 2 B B nosiheptide SXTXXXXCXXXAX 13 T 0.06 CCER1 pdbhh F F 2mc1 2 B B KSYK_MOUSE SPLEEN TYROSINE KINASE DTEVYESPXADPE 13 T 23 Holin_2-3 pdbhh F Eukaryota T 2mc3 1 A A MUS81_HUMAN CDNA FLJ44872 FIS, CLONE BRAMY2022320, HIGHLY SIMILAR TO CROSSOVER JUNCTION ENDONUCLEASE MUS81 (EC 3.1.22.-) GPTMGSGSYWPARHSGARVILLVLYREHLNPNGHHFLTKEELLQRCAQKSPRVAPGSAPPWPALRSLLHRNLVLRTHQPARYSLTPEGLELAQKLAESEGLSLLNVGIG 109 T 0.00099 DUF6429 pdbpercent F Eukaryota T 2mc4 1 A A O52732_STRCH BLDD MEPPPKLVLDLERLATVPAEKAGPLQRYAATIQSQRGDYNGKVLSIRQDDLRTLAVIYDQSPSVLTEQLISWGVLDADARRAVASHDEL 89 T 0.053 DUF43 pdbpercent F Bacteria T 2mc5 1 A A Q8LTJ5_9CAUD RNA POLYMERASE INHIBITOR P7 MNEFTQISGYVNAFGSQRGSVLTVKVENDEGWTLVEEDFDRADYGSDPEFVAEVSSYLKRNGGIKDLTKVLTR 73 T 0.18 DUF1494 pdb T Viruses T 2mc6 1 A A Q8LTJ5_9CAUD 45L MNEFTQISGYVNAFGSQRGSVLTVKVENDEGWTLVEEDFDRADYGSDPEFVAEVSSYLKRNGGIKDLTKVLTR 73 T 0.18 DUF1494 pdb T Viruses T 2mc6 2 B B RPOC_XANOR RNAP SUBUNIT BETA', RNA POLYMERASE SUBUNIT BETA', TRANSCRIPTASE SUBUNIT BETA' MKDLLNLFNQ 10 T 6.5 CRM1_repeat_3 pdbhh F Bacteria T 2mc7 1 A A B0LJC7_SALTM Regulatory peptide MNRSPDKIIALIFLLISLLVLCLALWQIVF 30 T 0.038 DUF202 pdb F Bacteria T 2mcd 1 A A Q80J95_9CALI Murine norovirus 1 MRGSHHHHHHGSVSFGAPSPLSSESEDEINYMTPPEQEAQPGALAALHAEGPLAGLPVTRSDARVLIFNEWEERKKSEPWLRLDMSDKAIFRRYPHLR 98 T 5.3 Amidase pdbhh T Viruses T 2mce 1 A A TKN1_RABIT NPGAMMA, PROTACHYKININ-1 DAGHGQISHKRHKTDSFVGLM 21 T 0.0051 Tachykinin pdbhh F Eukaryota T 2mcf 1 A A C5A217_THEGJ TGAM_1934 MKYDVVIIPESFHRFDKHNMEHICPPMVIGDRSYDIAMEIVNGVDRVIKASFNASVEELEGEDCDVLYRKYTLEKEGKKGIVHVKLRKITENCPPVDGNRCSVLEFERDIECIVKAIEECLAKGELNSKLEGKPIPNPLLGLDSTRTG 148 T 0.088 DDE_Tnp_1 pdbpssm F Archaea T 2mch 1 A A Q80J95_9CALI Murine norovirus 1 MRGSHHHHHHGSGALAALHAEGPLAGLPVTRSDARVLIFNEWEERKKSDPWLRLDMSDKAIFRRYPHLR 69 T 2.9 DUF3539 pdbhh T Viruses T 2mck 1 A A H6WEV7_9CALI Polyprotein MRGSHHHHHHGSGALAALHADGPHAGLPVTRSDARVLIFNDWEERKRSEPWLRLDMSDKAIFRRYPHLR 69 T 2.7 DUF3539 pdbhh T Viruses T 2mdb 1 A A TAC1_TACTR TACHYPLESIN I KWCFRVCYRGICYRRCRX 18 T 0.021 Myticin-prepro unp F Eukaryota T 2mfa 1 A A 3SX2_DENPO MAMB-2, PI-DP2 LKCFQHGKVVTCHRDMKFCYHNTGMPFRNLKLILQGCSSSCSETENNKCCSTDRCNK 57 T 0.0012 Toxin_TOLIP pdb F Eukaryota T 2mfm 1 A A G7K427_MEDTR CEP11 AFRXTAPGHSXGVGH 15 T 0.22 RNA_pol_Rpb1_R unp F Eukaryota T 2mfo 1 A A G7K427_MEDTR CEP1 AFQXTTPGNSXGVGH 15 T 0.22 RNA_pol_Rpb1_R unp F Eukaryota T 2mfq 2 B B NTRK2_HUMAN GP145-TRKB, TRK-B, NEUROTROPHIC TYROSINE KINASE RECEPTOR TYPE 2, TRKB TYROSINE KINASE, TROPOMYOSIN-RELATED KINASE B GPDAVIIGMTKIPVIENPQXFGI 23 T 8.9 DUF6330 pdbhh F Eukaryota T 2mfs 1 A A Ep-AMP1 CVLIGQRCDNDRGPRCCSGQGNCVPLPFLGGVCAV 35 T 0.0023 Toxin_7 pdb F T 2mfv 1 A A F0CAT1_9XANT Xanthomonin II GGPLAGEEMGGITT 14 T 4.9 Rhabdo_M2 pdbhh F Bacteria T 2mg5 2 B B NOS3_HUMAN target peptide TFKEVANAVKISASLM 16 T 0.013 DUF2774 pdbhh F Eukaryota T 2mgw 1 A A NBR1_HUMAN CELL MIGRATION-INDUCING GENE 19 PROTEIN, MEMBRANE COMPONENT CHROMOSOME 17 SURFACE MARKER 2, NEIGHBOR OF BRCA1 GENE 1 PROTEIN, PROTEIN 1A1-3B GPLGSSEDQTAALMAHLFEMGFCDRQLNLRLLKKHNYNILQVVTELLQLNNN 52 T 0.0014 UBA_5 pdbhh F Eukaryota T 2mh0 1 A A TFE2_HUMAN CLASS B BASIC HELIX-LOOP-HELIX PROTEIN 21, BHLHB21, IMMUNOGLOBULIN ENHANCER-BINDING FACTOR E12/E47, IMMUNOGLOBULIN TRANSCRIPTION FACTOR 1, KAPPA-E2-BINDING FACTOR, TRANSCRIPTION FACTOR 3, TCF-3, TRANSCRIPTION FACTOR ITF-1 GSMNQPQRMAPVGTDKELSDLLDFSMMFPLPVTNGKGRP 39 T 32 DUF1480 pdbhh F Eukaryota T 2mh5 1 A A LAN91_MICS0 Lantibiotic 107891 VXXXXLCXPGCTXPGGGXNCXFCX 24 T 0.00093 Gallidermin unppssm F Bacteria T 2mho 2 B B 5HT2C_RAT peptide from 5-hydroxytryptamine receptor 2C VVSERISSV 9 T 6.7 Codanin-1_C pdbhh F Eukaryota F 2mhy 1 A A Q0GB44_9SALA Plethodontid modulating factor LQCNTLDGGTEECIPGIYNVCVHYKSEDEEYKSCGIQEECEDAEGATVLCCPEDLCN 57 T 0.042 Defensin_propep unppercent F Eukaryota T 2mid 1 A A CLE10_ARATH CLE10P RLVPSGPNPLHN 12 T 21 DUF502 pdbhh F Eukaryota T 2mie 1 A A CLE41_ARATH TRACHEARY ELEMENT DIFFERENTIATION INHIBITORY FACTOR-LIKE PROTEIN, TDIF-LIKE PROTEIN, CLE44P HEVPSGPNPISN 12 T 2.6 DUF502 pdbhh F Eukaryota T 2mif 1 A A CLAVATA-like encoded peptide of Meloidogyne hapla - MhCLE4 HEVPSGPNPSSN 12 T 1.1 DUF2315 pdbhh F T 2mig 1 A A CLAVATA-like encoded peptide of Meloidogyne hapla - MhCLE5 RKVPTGSNPQKN 12 T 4.6 Bradykinin pdbhh F T 2mih 1 A A CLAVATA-LIKE ENCODED PEPTIDE OF MELOIDOGYNE HAPLA - MHCLE6/7 HQVPSGPNPLHNKK 14 T 3.3 DUF3581 pdbhh F T 2mip 2 E,F,G,H E,F,G,H INHIBITOR BI-LA-398 FVFLEIX 7 T 51 HVSL pdbhh F T 2mix 1 A A T3A_TERVA venom peptide toxin TRICCGCYWNGSKDVCSQSCC 21 T 1.1 Hepcidin pdbhh F Eukaryota T 2mj5 2 B B NBR1_HUMAN CELL MIGRATION-INDUCING GENE 19 PROTEIN, MEMBRANE COMPONENT CHROMOSOME 17 SURFACE MARKER 2, NEIGHBOR OF BRCA1 GENE 1 PROTEIN, PROTEIN 1A1-3B GPLGSSEDQTAALMAHLFEMGFCDRQLNLRLLKKHNYNILQVVTELLQLNNN 52 T 0.0014 UBA_5 pdbhh F Eukaryota T 2mjf 1 A A RSA1_YEAST Ribosome assembly 1 protein GPHMFANENSQLLDFIRELGDVGLLEYELSQQEKDVLFGS 40 T 0.17 SpoIISA_toxin unppssm F Eukaryota T 2mjq 1 A A ANOP_ANOSM AS-183 GLLKRIKTLLX 11 T 4.2 PspA_IM30 unphh F Eukaryota T 2mjr 1 A A ANOP_ANOSM AS-183 GLLKWIKTLLX 11 T 0.52 DUF4653 pdbhh F Eukaryota T 2mjs 1 A A ANOP_ANOSM AS-183 GLLKKIKWLLX 11 T 4.2 PspA_IM30 unphh F Eukaryota T 2mjt 1 A A ANOP_ANOSM AS-183 GLLKFIKWLLX 11 T 0.44 Lipoprotein_10 pdbhh F Eukaryota T 2mjv 1 A A TWST1_HUMAN CLASS A BASIC HELIX-LOOP-HELIX PROTEIN 38, BHLHA38, H-TWIST SPAQGXRGXKSA 12 T 10 Parecho_VpG pdbhh F Eukaryota T 2mk0 1 A A O22015_CYLFU PLEURALIN-1, FORMERLY HEP200 SYYHHHHHHTMMPSPEPSSQPSDCGEVIEECPIDACFLPKSDSARPPDCTAVGRPDCNVLPFPNNIGCPSCCPFECSPDNPMFTPSPDGSPPNCSPTMLPSPSPSAVTVPLTPTMLPSPS 120 T 40 DUF35_N pdbhh F Eukaryota T 2mk7 1 A A DAG1_HUMAN ALPHA-DG, ADG XPPTTTTKKPX 11 T 110 DUF5852 pdbhh F Eukaryota F 2mkc 2 B B PML1_YEAST Pre-mRNA leakage protein 1 GSKSQYIDIMPDFSPSGLLELES 23 T 7.1 VirE_N pdbhh F Eukaryota T 2mkr 2 B B EBNA2_EBVB9 EBNA-2, EBV NUCLEAR ANTIGEN 2 DLDESWDYIFETT 13 T 0.75 DUF3841 pdbhh T Viruses T 2ml5 1 A A A7LT22_BACO1 Uncharacterized protein GDSELTTQDGEDFKSFLDKFTSSAAFQYTRVKFPLKTPITLLADDGETEKTFPFTKEKWPLLDSETMKEERITQEEGGIYVSKFTLNEPKHKIFEAGYEESEVDLRVEFELQADGKWYVVDCYTGWYGYDLPIGELKQTIQNVKEENAAFKEIHP 155 T 0.00055 DUF4348 unppssm F Bacteria T 2ml6 1 A A A7V0E7_BACUC Uncharacterized protein GAEEEDFKTFLQKFTSSASFQYSRIKFPLKSPIALLKDDGETEQTFPFTREKWALLDEETLKEGRTTEEEGGTYISHFTVNEPAHKEFEAGYDESEPSLRVVFELTDGKWYVTDCYNDWYNFDLPINELEETIQAVQEENKAFEELHP 148 T 0.00018 DUF4348 pdbpercent F Bacteria T 2mlj 1 A A A0A0H2UKY1_CAUSK Caulonodin V SIGDSGLRESMSSQTYWP 18 T 7.8 Herpes_UL47 pdbhh F Bacteria T 2mlp 1 A _ MCBA_ECOLX MCBA PROPEPTIDE MELKASEFGVVLSVDALKLSRQSPLGX 27 T 2.8 DUF3905 pdbhh F Bacteria T 2mlu 1 A A Q7X2B5_LACLL LsbB MKTILRFVAGYDIASHKKKTGGYPWERGKA 30 T 0.82 DUF4262 pdbhh F Bacteria T 2mlv 1 A A Q7X2B5_LACLL LsbB MKTILRFVAGYDIASHKKKTGGYPWERGKA 30 T 0.82 DUF4262 pdbhh F Bacteria T 2mm5 1 A A A0A0S0ZR47_9GENT Alpha amylase Alstotide S4 CVPQYGVCDGIINQCCDPYYCSPPIYGHCI 30 T 0.0051 Toxin_35 unp F Eukaryota T 2mm6 1 A A A0A0S0ZR07_9GENT Alpha amylase Alstotide S1 CRPYGYRCDGVINQCCDPYHCTPPLIGICL 30 T 0.0063 Conotoxin unphh F Eukaryota T 2mmj 1 A A MCU11_LITGE maculatin G15 GLFGVLAKVAXHVVGAIAEHFX 22 T 4.7E-05 Caerin_1 unphh F Eukaryota T 2mmt 1 A A MCJA_ECOLX MCCJ25(RGDF) GGAGHVPEYFVRGDFPISFYG 21 T 0.13 Endonuc-BglII unp F Bacteria T 2mmw 1 A A MCJA_ECOLX MCCJ25 GGAGHVPEYFVRGDTPISFYG 21 T 0.13 Endonuc-BglII unp F Bacteria T 2mni 1 A A Q4D059_TRYCC HP_Q4D059 GSAMGHMPAVDVEIHFPLKRIAAEGYAEDELLLNQMGKVNDTPEEEGMPLRAWVIKCAHEALEKNPKIREVYLKPRAVKNSSVQFHVIFDEE 92 T 8.9 TetR_C_1 unphh F Eukaryota T 2mnu 2 B B APT SSSPIQGSWTWENGKWTWKGIIRLEQ 26 T 0.8 WXXGXW pdbhh F T 2mnw 1 A A SHQ1_HUMAN Protein SHQ1 homolog GSTAIGMKETAAAKFERQHMDSPDLGTGGGSGDDDDKMLTPAFDLSQDPDFLTIAIRVSYARVSEFDVYFEGSDFKFYAKPYFLRLTLPGRIVENGSEQGSYDADKGIFTIRLPKETPGQHFEGLNMLTALLA 133 T 0.002 PIH1_CS pdbpercent F Eukaryota T 2moa 1 A A CA1_CONIM ALPHA-CTX IMI GXASDPRCAWRCX 13 T 0.0098 Toxin_8 unphh F Eukaryota T 2moc 1 A A TKN4_HUMAN ENDOKININ-A/B TGKASQFFGLM 11 T 0.00015 Tachykinin unphh F Eukaryota T 2mow 2 B B PAP2_YEAST TRF4P, DNA POLYMERASE KAPPA, DNA POLYMERASE SIGMA, TOPOISOMERASE 1-RELATED PROTEIN TRF4 DDDEDGYNPYTL 12 T 3.8 IreB pdbhh F Eukaryota T 2mp2 3 C C RNF4_MOUSE RING FINGER PROTEIN 4 TVGDEIVDLTCESLEPVVVDLTHND 25 T 0.012 Bac_luciferase unppercent F Eukaryota T 2mp9 1 A A AFP_CENMR CM-P1 SRSELIVHQRLFX 13 T 5.7 Nbs1_C unphh F Eukaryota T 2mpl 1 A A FOG1_MOUSE FRIEND OF GATA PROTEIN 1, FOG-1, FRIEND OF GATA 1, ZINC FINGER PROTEIN MULTITYPE 1 PWSGPEELELALQDGQRCVRARLSLTEGLSWGPFYGSIQTRALSPEREEPGPAVTLMVDESCWLRMLPQVLTEEAANSEIYRKDDALWCRVTKVVPSGGLLYVRLVTEPHGAPRHPVQEPVEPGGLA 127 T 0.069 SET unphh F Eukaryota T 2mpm 2 B B CCR3 VETFGTTSYYDDVGLL 16 T 2.3 G6PD_C pdbhh F T 2mpo 1 A A Q967S9_TOXGO MIC2-associated protein TFLELVEVPCNSVHVQGVMTPNQMVKVTGAGWDNGVLEFYVTRPTKTGGDTSRSHLASIMCYSKDIDGVPSDKAGKCFLKNFSGEDSSEIDEKEVSLPIKSHNDAFMFVCSSNDGSALQCDVFALDNTNSSDGWKVNTVDLGVSVSPDLAFGLTADGVKVKKLYASSGLTAINDDPSLGCKA 182 T 3.9E-05 Etmic-2 unphh F Eukaryota T 2mps 2 B B P73_HUMAN P53-LIKE TRANSCRIPTION FACTOR, P53-RELATED PROTEIN DGGTTFEHLWSSLEPD 16 T 0.019 P53_TAD unphh F Eukaryota T 2mpv 1 A A O30595_ECOLX Major fimbrial subunit of aggregative adherence fimbria II AafA NFCDITITPATNRDVNVDRSANIDLSFTIRQPQRCADAGMRIKAWGEANHGQLLIKPQGGNKSAGFTLASPRFSYIPNNPANIMNGFVLTNPGVYQLGMQGSITPAIPLRPGLYEVVLNAELVTNDNKQNATAVAKTATSTITVV 145 T 0.0003 SEF14_adhesin unphh F Bacteria T 2mq2 1 A A CDP-1 peptide, Cysteine Deleted Protegrin-1 RGGRLYRRRFVVGR 14 T 6.9 Sid-5 pdbhh F T 2mq4 1 A A RR11 peptide from Cysteine Deleted Protegrin-1 RLYRRRFVVGR 11 T 2.9 Sid-5 pdbhh F T 2mq5 1 A A LR10 peptide from Cysteine Deleted Protegrin-1 LYRRRFVVGR 10 T 3.7 DUF2623 pdbhh F T 2mq8 1 A A De novo designed protein LFR1 MLTVEVEVKITADDENKAEEIVKRVIDEVEREVQKQYPNATITRTLTRDDGTVELRIKVKADTEEKAKSIIKLIEERIEEELRKRDPNATITRTVRTEVGSSWSLEHHHHHH 112 T 0.00044 CinA_KH pdb F T 2mqd 1 A A A5VHK8_LACRD Uncharacterized protein GHMKFTDQQIGVLAGLAISPEWLKQNIAANQLVYGIVKPSDTVPAGVDDYSYLVAADDQDGTIIFFKAEGQTVIIKYTSQRNTKLKAKALTLSQLKKEFYQTRSQKREVDDYVAGLRTE 119 T 0.012 Imm42 pdbpssm F Bacteria T 2mr5 1 A A De novo designed Protein OR457 MGTVVIVVSNDERILEELLEVVLKSDPNVKTVRTDDKEKVKEEIEKARKQGRPIVIFIRGAYEEVVRDIVEYAQKEGLRVLVIKVAQDQELLERFYEQLKKDGVDVRVTDNEDEAKKRLKELLEKVGSLEHHHHHH 136 T 0.00094 ANF_receptor pdbpercent F T 2mra 1 A A De novo designed protein OR459 MAGKELRVEIKIDCGNDDKETTYDLYFSKAEEAKELLKKVAEKAADKIKKQGCKRVKIRFEKKGLDDDARKKAKKWALEVANKIANELGAKQSTTTTDGDTFEVEVILELEHHHHHH 117 T 0.0058 DUF4230 pdbpercent F T 2mrk 2 B B FYN_HUMAN PROTO-ONCOGENE SYN, PROTO-ONCOGENE C-FYN, SRC-LIKE KINASE, SLK, P59-FYN EPQXQPGENL 10 T 5.5 Leader_Erm pdbhh F Eukaryota T 2mrl 1 A A Q2SV23_BURTA Uncharacterized protein BTH I2711 MDRIFMTRTEALEFLLKAHQTAVDKIGHPSHKQTPADHAAIEALDRLLLDVRARRVDQFQINASAAQIIVTD 72 T 7.6 Thioredoxin_11 pdbhh F Bacteria T 2ms4 2 B B CRK_HUMAN Peptide PEPGPYAQP 9 T 8.9 HMMR_N pdbhh F Eukaryota T 2msa 1 A A CSP_PLAFA Circumsporozoite protein peptide KNSFSLGENPNANPX 15 T 3.1 DUF1930 pdbhh F Eukaryota T 2msf 1 A A KEX11_TITSE TS11 KPKCGLCRYRCCSGGCSSGKCVNGACDCS 29 T 0.85 Toxin_2 pdb F Eukaryota T 2msq 1 A A Conotoxin cBru9a SCGGSCFGGCWPGCSCYARTCFRDGLP 27 T 0.048 Cyclotide pdbhh F T 2msr 1 A A KMT2A_HUMAN LYSINE N-METHYLTRANSFERASE 2A, ALL-1, CXXC-TYPE ZINC FINGER PROTEIN 7, MYELOID/LYMPHOID OR MIXED-LINEAGE LEUKEMIA, MYELOID/LYMPHOID OR MIXED-LINEAGE LEUKEMIA PROTEIN 1, TRITHORAX-LIKE PROTEIN, ZINC FINGER PROTEIN HRX, MLL CLEAVAGE PRODUCT N320, N-TERMINAL CLEAVAGE PRODUCT OF 320 KDA, P320, MLL CLEAVAGE PRODUCT C180, C-TERMINAL CLEAVAGE PRODUCT OF 180 KDA, P180 GGSGEDEQFLGFGSDEEVRVR 21 T 4.7 EF-1_beta_acid pdbhh F Eukaryota T 2mtg 1 A A LARP6_HUMAN ACHERON, ACHN, LA RIBONUCLEOPROTEIN DOMAIN FAMILY MEMBER 6 GNENLPSKMLLVYDLYLSPKLWALATPQKNGRVQEKVMEHLLKLFGTFGVISSVRILKPGRELPPDIRRISSRYSQVGTQECAIVEFEEVEAAIKAHEFMITESQGKENMKAVLIGMKP 119 T 6.7E-05 Nup35_RRM pdbhh F Eukaryota T 2mtl 1 A A De novo designed protein FR55 OR109 MGEMDIRFRGDDLEALEKALKEMIRQARKFAGTVTYTLDGNDLEIRITGVPEQVRKELAKEAERLAKEFNITVTYTIRGSLEHHHHHH 88 T 0.0035 DUF2067 pdbpercent F T 2mtm 1 A A B9T5G6_RICCO STABLE PEPTIDE BIOMARKER RCB-1 ARCCLVMPVPPFACVKFCS 19 T 0.017 GRP unp F Eukaryota T 2mto 1 A A CA1A_CONRE Alpha-conotoxin RgIA GXCSDPRXRYRCR 13 T 0.00026 Toxin_8 unp F Eukaryota T 2mtq 1 A A Designed Peptide MGSWAEFKQRLAAIKTRCQALGGSEAECAAFEKEIAAFESELQAYKGKGNPEVEALRKEAAAIRDECQAYRHN 73 T 0.021 DUF1202 pdb F T 2mts 1 A A POLG_HCVJ4 HEPATITIS C VIRUS P7 PROTEIN ALENLVVLNAASVAGAHGILSFLVFFSAAWYIKGRLAPGAAYAFYGVWPLLLLLLALPPRAYA 63 T 0.23 FixQ pdbpercent T Viruses T 2mtw 1 A A EBA1_PLAFC EBA-175 YTNQNINISQERDLQKHGFH 20 T 0.023 DBP pdbhh F Eukaryota T 2mty 1 A A Q9U3Y8_PLAFA STARP antigen VIKHNRFLSEYQSNFLGGGY 20 T 0.83 Yos9_DD pdbhh F Eukaryota T 2mtz 2 B,C,D,E,F,G B,C,D,E,F,G intact bacterial peptidoglycan AXXX 4 T 380 NSF pdbhh F F 2mu6 1 A A Q9U3Y8_PLAFA STARP antigen KSMINAYLDKLDLETVRKIH 20 T 1.2 zf-Nse unppssm F Eukaryota T 2mu7 1 A A MSP1_PLAFW 1513 MSP-1 peptide GYSLFQKEKMVLNEGTSGTA 20 T 4.4 NOP5NT pdbhh F Eukaryota T 2mu8 1 A A MSA2_PLAF7 MSP-2 peptide KNESKYSNTFINNAYNMSIR 20 T 4.3 GatD_N pdbhh F Eukaryota T 2mu9 1 A A ABRA_PLAF7 P101/acidic basic repeat antigen KMNMLKENVDYIQKNQNLFK 20 T 0.91 ComX pdbhh F Eukaryota T 2muf 1 A A M1EUE6_PLAFA TRSP SDVRYNKSFINNRLLNEHAH 20 T 0.65 preATP-grasp_3 unp F Eukaryota T 2mug 1 A A SERA_PLAFG Serine-repeat antigen protein XNEVSERVHVYHILKHIKDGKX 22 T 8.4 Gemini_AC4_5 pdbhh F Eukaryota T 2muh 1 A A PG2_PIG PG-2 RGGRLCYCRRRFCVCV 16 T 0.075 Defensin_1 pdbhh F Eukaryota T 2muj 1 A A T1RTG8_PLAFA 111 KDA ANTIGEN, P126 YDNILVKMFKTNENNDKSELI 21 T 16 DUF4643 pdbhh F Eukaryota T 2mun 1 A A TX6A_SCOMU MU-SLPTX-SSM6A ADNKCENSLRREIACGQCRDKVKTDGYFYECCTSDSTFKKCQDLLH 46 T 4.1 Ribosomal_L32p unphh F Eukaryota T 2muz 1 A,B,C,D A,B,C,D designed rocker protein YYKEIAHALFSALXALSELYIAVRYX 26 T 9.4 DUF6332 pdbhh F T 2mv3 1 A A YME1_YEAST PROTEIN OSD1, TAT-BINDING HOMOLOG 11, YEAST MITOCHONDRIAL ESCAPE PROTEIN 1, YME1-N MVAVSHAMLATREQEANKDLTSPDAQAAFYKLLLQSNYPQYVVSRFETPGIASSPECMELYMEALQRIGRHSEADAVRQNLEHHHHHH 88 T 0.027 Imm49 pdbpssm F Eukaryota T 2mv7 2 B B DOT1L_HUMAN DOT1-LIKE PROTEIN, HISTONE H3-K79 METHYLTRANSFERASE, H3-K79-HMTASE, LYSINE N-METHYLTRANSFERASE 4 TNKLPVSIPLASVVLPSRAERARST 25 T 3.4 CCDC73 unphh F Eukaryota T 2mva 1 A A TX41A_SCOMU RhTx toxin LNNPCNGVTCPSGYRCSIVDKQCIKKE 27 T 0.018 Secretogranin_V unppssm F Eukaryota T 2mvk 1 A A TACD2_HUMAN CELL SURFACE GLYCOPROTEIN TROP-2, MEMBRANE COMPONENT CHROMOSOME 1 SURFACE MARKER 1, PANCREATIC CARCINOMA MARKER PROTEIN GA733-1 TNRRKSGKYKKVEIKELGELRKEPSL 26 T 0.0044 DAG1 unppercent F Eukaryota T 2mvl 1 A A TACD2_HUMAN CELL SURFACE GLYCOPROTEIN TROP-2, MEMBRANE COMPONENT CHROMOSOME 1 SURFACE MARKER 1, PANCREATIC CARCINOMA MARKER PROTEIN GA733-1 TNRRKSGKYKKVEIKELGELRKEPSL 26 T 0.0044 DAG1 unppercent F Eukaryota T 2mvt 1 A A TX31A_SCOSD Scoloptoxin SSD609 ADDKCEDSLRREIACTKCRDRVRTDDYFYECCTSESTFKKCQTMLHQ 47 T 2.5 DUF2614 unphh F Eukaryota T 2mw3 1 A A A0A0C2JEQ8_9ACTN Lasso peptide SLGSSPYNDILGYPALIVIYP 21 T 0.0013 DUF5972 unp F Bacteria T 2mw7 1 A A A0A0R4I952_CONMO Mo3964 DGECGDKDEPCCGRPDGAKVCNDPWVCILTSSRCENP 37 T 0.19 Sin3_corepress pdb F Eukaryota T 2mwi 1 A A TDIF1_HUMAN TERMINAL DEOXYNUCLEOTIDYLTRANSFERASE-INTERACTING FACTOR 1, TDIF1, TDT-INTERACTING FACTOR 1 GAREGPKWDPARLNESTTFVLGSRANKALGMGGTRGRIYIKHPHLFKYAADPQDKHWLAEQHHMRATGGKMAYLLIEEDIRDLAASDDYRGCLDLKLEELKSFVLPSWMVEKMRKYMETLRT 122 T 5.7E-06 CRC_subunit pdbhh F Eukaryota T 2mwl 1 A A antimicrobial peptide VARGWKRKCPLFGKGG 16 T 0.0078 Flavi_glycoprot pdbhh F T 2mwn 1 A A AB1IP_HUMAN APBB1-INTERACTING PROTEIN 1, PROLINE-RICH EVH1 LIGAND 1, PREL-1, PROLINE-RICH PROTEIN 73, RAP1-GTP-INTERACTING ADAPTER MOLECULE, RIAM, RETINOIC ACID-RESPONSIVE PROLINE-RICH PROTEIN 1, RARP-1 DIDQMFSTLLGEMDLLTQSLGVDT 24 T 1.9 Drf_DAD pdbhh F Eukaryota T 2mwo 2 B B P53_HUMAN P53K370ME2, ANTIGEN NY-CO-13, PHOSPHOPROTEIN P53, TUMOR SUPPRESSOR P53 RAHSSHLKSKKGQST 15 T 8.2 RE_NgoPII pdbhh F Eukaryota T 2mwp 2 B B P53_HUMAN P53K382ME2, ANTIGEN NY-CO-13, PHOSPHOPROTEIN P53, TUMOR SUPPRESSOR P53 STSRHKKLMFKT 12 T 34 DUF420 pdbhh F Eukaryota T 2mwt 1 A A CAMP_CRODU Cathelicidin-like peptide KRFKKFFKKVKKSVKKRLKKIFKKPMVIGVTIPF 34 T 0.0032 Sigma70_ner unp F Eukaryota T 2mx6 2 B B (PHQ)WV peptide XWV 3 T 170 Kelch_2 pdbhh F F 2mxg 1 A A antimicrobial peptide VARGWKRKCPLFGKGG 16 T 0.0078 Flavi_glycoprot pdbhh F T 2mxh 1 A A antimicrobial peptide VARGWKRKCPLFGKGG 16 T 0.0078 Flavi_glycoprot pdbhh F T 2mxx 1 A A A8AZZ3_STRGC Amylase-binding protein AbpA MADEATDAARNNDGAYYLQTQFTNADKVNEYLAQHDGEIRAEAAADPAVVAAKAALDAVEGGSHNYGEVKAAYEAAFNNAFNAVRNKYVQRFQATYNNATEQEGKTYIQGETPEQANARYLKRVGAANNQNPAAEDKGATTPASKEEAKKSEAAAKNAGKAAGKALPKTSAVKHHHHHH 179 T 0.089 DUF3752 pdbpssm F Bacteria T 2my3 2 B B PML1_YEAST Pre-mRNA leakage protein 1 GSKSQYIDIMPDFSPSGLLELES 23 T 7.1 VirE_N pdbhh F Eukaryota T 2myh 1 A A A0A0G3F8Z3_9ARAC Omega-Tbo-IT1 toxin CASKNERCGNALYGTKGPGCCNGKCICRTVPRKGVNSCRCM 41 T 0.012 Conotoxin unphh F Eukaryota T 2myv 1 A A Q8J180_MAGGR Uncharacterized protein APQDNTSMGSSHHHHHHSSGRENLYFQGHMAWKDCIIQRYKDGDVNNIYTANRNEEITIEEYKVFVNEACHPYPVILPDRSVLSGDFTSAYADDDESC 98 T 5.4 Ceramidase_alk unphh F Eukaryota T 2myw 1 A A B9WZW9_MAGOR AVR-Pia protein APQDNTSMGSSHHHHHHSSGRENLYFQGHMAAPARFCVYYDGHLPATRVLLMYVRIGTTATITARGHEFEVEAKDQNCKVILTNGKQAPDWLAAEPY 97 T 0.012 Pirin_C unppssm F Eukaryota T 2mz0 1 A A DEF32_ARATH Defensin-like protein 32 KDIDGRKPLLIGTCIEFPTEKCNKTCIESNFAGGKCVHIGQSLDFVCVCFPKYYI 55 T 0.00026 Gamma-thionin unppssm F Eukaryota T 2mz4 1 A A TX6A_SCOMU MU-SLPTX-SSM6A ADNKCENSLRREIACGQCRDKVKTDGYFYECCTSDSTFKKCQDLLH 46 T 4.1 Ribosomal_L32p unphh F Eukaryota T 2mz6 1 A,B A,B PG3_PIG PG-3 RGGGLCYCRRRFCVCVGR 18 T 0.16 Defensin_1 pdbhh F Eukaryota T 2n01 1 A A Q8PJB3_XANAC VirB7 protein XTKPAPDFGGRWKHVNHFDEAPTEX 25 T 0.056 BNR_6 pdbhh F Bacteria T 2n08 1 A A Short hydrophobic peptide with cyclic constraints HAEGTFTSDFFX 12 T 0.00035 Hormone_2 pdbhh F T 2n09 1 A A Short hydrophobic peptide with cyclic constraints HXEGTFTSDFFX 12 T 0.00035 Hormone_2 pdbhh F T 2n0i 1 A A di-sulfide 11mer peptide HXEGXFTSDFXX 12 T 0.019 Hormone_2 pdbhh F T 2n0n 1 A A lactam (5,9) 11mer peptide HXEGKFTSEFXX 12 T 0.032 Hormone_2 pdbhh F T 2n0o 1 A A ALBO1_HYPAB HY-A1 IFGAILPLALGALKNLIKX 19 T 4 SH3_7 unphh F Eukaryota T 2n0v 1 A A CN-AMP1 SVAGRAQGMX 10 T 10 Dirigent pdbhh F T 2n0y 2 B B NSS_RVFVZ Non-structural protein NS-S GGGGYDVEMESEEESDDDGFVEVD 24 T 0.5 LRR19-TM pdbhh T Viruses T 2n0z 1 A A MYO6_HUMAN Unconventional myosin-VI GPLGSPNSGTKKYDLSKWKYAELRDTINTSCDIELLAACREEFHRRLKVYH 51 T 3 Caldesmon unphh F Eukaryota T 2n10 1 A A MYO6_HUMAN UNCONVENTIONAL MYOSIN-6 GPLGSPNSGTKKYDLSKWKYAELRDTINTSCDIELLAACREEFHRRLKVYHAWKSKNKKR 60 T 3 Caldesmon unphh F Eukaryota T 2n11 1 A A MYO6_HUMAN UNCONVENTIONAL MYOSIN-6 QQQAVLEQERRDRELALRIAQSEAELISDEAQADLALRRSLDSYPVSKNDGTRPKMTPEQMAKEMSEFLSRGPA 74 T 0.00031 BUD22 unp F Eukaryota T 2n12 1 A A MYO6_HUMAN UNCONVENTIONAL MYOSIN-6 RPKMTPEQMAKEMSEFLSRGPAVLATKAAAGTKKYDLSKWKYAELRDTINTSCDIELLAACREEFHRRLKVYHAWKSKNKKR 82 T 0.027 BUD22 unppercent F Eukaryota T 2n13 1 A,D A,D MYO6_HUMAN UNCONVENTIONAL MYOSIN-6 GTKKYDLSKWKYAELRDTINTSCDIELLAACREEFHRRLKVYH 43 T 3 Caldesmon unphh F Eukaryota T 2n16 1 A A DHX36_HUMAN DEAH BOX PROTEIN 36, G4-RESOLVASE 1, G4R1, MLE-LIKE PROTEIN 1, RNA HELICASE ASSOCIATED WITH AU-RICH ELEMENT ARE SMHPGHLKGREIGMWYAKKQ 20 T 2.6 PsaL pdbhh F Eukaryota T 2n1e 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H MAX1 peptide VKVKVKVKVXPTKVKVKVKVX 21 T 22 Mgr1 pdbhh F F 2n1p 1 A A POLG_HCVH Non-structural protein 5B, NS5B HSVSHARPRWFWFSLLLLAAGVGIYLLPNR 30 T 0.081 BSMAP pdbhh T Viruses T 2n21 1 A A DHX36_HUMAN DEAH BOX PROTEIN 36, G4-RESOLVASE 1, G4R1, MLE-LIKE PROTEIN 1, RNA HELICASE ASSOCIATED WITH AU-RICH ELEMENT ARE SMHPGHLKGREIGMWYAKKQ 20 T 2.6 PsaL pdbhh F Eukaryota T 2n24 1 A A O2VC1_CONVC O2_contryphan_Vc1 QWCQPGYAYNPVLGICTITLSRIEHPGNYDY 31 T 1.4 ANATO pdbhh F Eukaryota T 2n2a 1 A,B A,B ERBB2_HUMAN METASTATIC LYMPH NODE GENE 19 PROTEIN, MLN 19, PROTO-ONCOGENE NEU, PROTO-ONCOGENE C-ERBB-2, TYROSINE KINASE-TYPE CELL SURFACE RECEPTOR HER2, P185ERBB2 AEQRASPLTSIISAVVGILLVVVLGVVFGILIKRRQQKIRKYTMRRLLQETELVEPLG 58 T 0.0017 Mucin15 pdbhh F Eukaryota T 2n2c 1 A A TADBP_HUMAN TDP-43 MGGGMNFGAFSINPAMMAAAQAALQSSWGMMGMLASQQNQSGP 43 T 0.0072 Glucosaminidase pdb F Eukaryota T 2n2f 1 A A PDYN_HUMAN PROENKEPHALIN-B, BETA-NEOENDORPHIN-DYNORPHIN, PREPRODYNORPHIN YGGFLRRIRPKLK 13 T 0.025 Op_neuropeptide pdbhh F Eukaryota T 2n2g 1 A A A0A1A9T938_9METZ Asteropsin_F CPGEGEECDVEFNPCCPPLTCIPGDPYGICYII 33 T 0.00039 Tachystatin_A pdbhh F Eukaryota T 2n2h 1 A A SDS3_MOUSE SUPPRESSOR OF DEFECTIVE SILENCING 3 PROTEIN HOMOLOG SNAAQLNYLLTDEQIMEDLRTLNKLKS 27 T 2.9 DUF1639 pdbhh F Eukaryota T 2n2j 1 A,B A,B EBNA2_EBVB9 EBNA-2, EBV NUCLEAR ANTIGEN 2 GAMEMPTFYLALHGGQTYHLIVDTDSLGNPSLSVIPSNPYQEQLSDTPLIPLTIFVGENTGV 62 T 2.4 Swi6_N pdbhh T Viruses T 2n2s 1 A A A0A182DV16_9SPIT pheromone Ep-1 SCGSECAPEPDCWGCCLVQCAPSICAGWCGGS 32 T 1.4 DUF3079 pdbhh F Eukaryota T 2n2t 1 A A OR303 MGQWQIKIYSENEREFRELIERLEEERPSVQYTETTRNGRRQLTIRSNDKNEVDRILEEVRRKVPNARVRETETGSLEHHHHHH 84 T 0.025 F_actin_bind pdb F T 2n2u 1 A A OR358 MVDLKIDVSDDEEAEKIIREIREQWPKATVTRTNGDIKLDAQTEKEAEKMEKAVKKVKPNATIRKTGGSLEHHHHHH 77 T 0.0075 MmoB_DmpM pdb F T 2n31 1 A A TOLIP_HUMAN Toll interacting protein variant GPLGSMATTVSTQRGPVYIGELPQDFLRITPTQQQRQVQLDAQAAQQLQYGGAVGTVG 58 T 0.11 RNA_pol_Rpb1_1 unppssm F Eukaryota T 2n37 1 A A B9WZW9_MAGOR AVR-Pia protein APARFCVYYDGHLPATRVLLMYVRIGTTATITARGHEFEVEAKDQNCKVILTNGKQAPDWLAAEPY 66 T 0.012 Pirin_C unppssm F Eukaryota T 2n3a 1 A A POGZ_HUMAN SUPPRESSOR OF HAIRY WING HOMOLOG 5, ZINC FINGER PROTEIN 280E, ZINC FINGER PROTEIN 635 EGESETESFYGFEEAD 16 T 1.1 Sororin pdbhh F Eukaryota T 2n3p 1 A A A0A1A9T940_9METZ Asteropsin_G QWCAEEGESCEVYPCCDGLICYPTFPEPICGV 32 T 0.0075 Tachystatin_A pdbhh F Eukaryota T 2n3x 1 A A TADBP_HUMAN TDP-43 MNFGAFSINPAMMAAAQAALQSSWGMMGMLASQQNQSGPSGNNQNQGNMQ 50 T 0.0075 Glucosaminidase pdbpercent F Eukaryota T 2n3z 1 A A OR446 MGRLVVVVTSEQLKEEVRKKFPQVEVRLVTTEEDAKQVIKEIQKKGVQKVVLVGVSEKLLQKIKQEANVQVYRVTSNDELEQVVKDVKGSGLEHHHHHH 99 T 0.00041 PrpR_N pdbpssm F T 2n4g 1 A A TADBP_HUMAN TDP-43 MNFGAFSINPAMMAAAQAALQSSWDMMGMLASQQNQSGPSGNNQNQGNMQ 50 T 0.0075 Glucosaminidase pdbpercent F Eukaryota T 2n4h 1 A A TADBP_HUMAN TDP-43 MNFGAFSINPAMMAAAQAALQSSWGMMGMLASRQNQSGPSGNNQNQGNMQ 50 T 0.0075 Glucosaminidase pdbpercent F Eukaryota T 2n4n 1 A A DESIGNED BETA SHEET XERFYEKXXVQKFIRVXGVTIREKX 25 T 15 DUF3692 pdbhh F T 2n4q 1 A A CBX8_HUMAN POLYCOMB 3 HOMOLOG, PC3, HPC3, RECTACHROME 1 TQGGRPSLIARIPVARILGDPEEE 24 T 86 VARLMGL pdbhh F Eukaryota T 2n5c 1 A A A0A0F7VRL1_9ACTN chaxapeptin GFGSKPLDSFGLNFF 15 T 16 CHB_HEX_C pdbhh F Bacteria T 2n5d 1 A A A4PHN0_STRVG fusion protein of two PKS domains GPGSYTGAGEPSQADLDALLSAVRDNRLSIEQAVTLLTPRRGGGSGGGSMDAKEILTRFKDGGLDRAAAQALLAGRTPAAAPRP 84 T 0.56 VbhA pdbhh F Bacteria T 2n5q 1 A A A0A0S2KUN2_9LAMI cysteine-rich peptide jS1 QLCLQCRSNSDCNIIWRICRDGCCNVI 27 T 4.4 ACI44 unphh F Eukaryota T 2n5s 1 A A EGFR_HUMAN PROTO-ONCOGENE C-ERBB-1, RECEPTOR TYROSINE-PROTEIN KINASE ERBB-1 GSCKIPSIATGMVGALLLLLVVALGIGLFMRRRHIVRKRTLRRLLQERELVEGG 54 T 0.0005 GAPT pdb F Eukaryota T 2n5w 1 A A Octyl-tridecaptin A1 XXGSWSXXFEVXA 13 T 12 DUF5626 pdbhh F T 2n5y 1 A A Octyl-tridecaptin A1 XXGXXSXXFEVXA 13 T 12 DUF5626 pdbhh F T 2n65 1 A,B A,B antimicrobial peptide VG16KRKP VARGWKRKCPLFGKGG 16 T 0.0078 Flavi_glycoprot pdbhh F T 2n67 1 A B Q81AN8_BACCR Hemolysin II DNQKALEEQMNSINSVNDKLNKGKGKLSLSMNGNQLKATSSNAGYGISYEDKNWGIFVNGEKVYTFNEKSTVGNISNDINKLNIKGMYIEIKQI 94 T 0.0097 Gal-bind_lectin pdbpercent F Bacteria T 2n68 1 A A E8RMD3_ASTEC LASSO PEPTIDE GLSQGVEPDIGQTYFEESRINQD 23 T 4.6 LSPR pdbhh F Bacteria T 2n69 1 A A DEF_PENBA BRAZZEIN QDKCKKVYENYPVSKCQLRIANQCNYDCKLDKHARSGECFYDEKRNLQCICDYCEY 56 T 0.00073 Toxin_3 pdb F Eukaryota T 2n6h 1 A A designed 2-stranded parallel beta-sheet XERFYEKXXVQKFIRX 16 T 3.8 DUF3692 pdbhh F T 2n6i 1 A A designed 2-stranded parallel beta-sheet XQKFIRVXGVTIREKX 16 T 7.6 TIP41 pdbhh F T 2n6n 1 A A TXAG4_AGEOR U4-AGTX-AO1A, MU-2AAGA_15 GYCAEKGIKCHNIHCCSGLTCKCKGSSCVCRK 32 T 0.04 Toxin_7 pdbpercent F Eukaryota T 2n6u 1 A A E8RUP9_ASTEC Astexin2-dC4 GLTQIQALDSVSGQFRDQLG 20 T 7.9 BCMA-Tall_bind unphh F Bacteria T 2n6v 1 A A E8RUP8_ASTEC ASTEXIN3 GPTPMVGLDSVSGQYWDQHAPLAD 24 T 2.1 Cut12 pdbhh F Bacteria T 2n72 1 A A GCP60_HUMAN ACYL-COA-BINDING DOMAIN-CONTAINING PROTEIN 3, GOLGI COMPLEX-ASSOCIATED PROTEIN 1, GOCAP1, GOLGI PHOSPHOPROTEIN 1, GOLPH1, PBR- AND PKA-ASSOCIATED PROTEIN 7, PERIPHERAL BENZODIAZEPINE RECEPTOR-ASSOCIATED PROTEIN PAP7 MQQKQQIMAALNSQTAVQFQQYAAQQYPGNYEQQQILIRQLQEQHYQQYMQQLYQVQLAQQQAALQKQQ 69 T 0.011 Sulfatase pdbpercent F Eukaryota T 2n73 1 A A GCP60_HUMAN ACYL-COA-BINDING DOMAIN-CONTAINING PROTEIN 3, GOLGI COMPLEX-ASSOCIATED PROTEIN 1, GOCAP1, GOLGI PHOSPHOPROTEIN 1, GOLPH1, PBR- AND PKA-ASSOCIATED PROTEIN 7, PERIPHERAL BENZODIAZEPINE RECEPTOR-ASSOCIATED PROTEIN PAP7 MQQKQQIMAALNSQTAVQFQQYAAQQYPGNYEQQQILIRQLQEQHYQQYMQQLYQVQLAQQQAALQKQQ 69 T 0.011 Sulfatase pdbpercent F Eukaryota T 2n73 2 B B PI4KB_HUMAN Phosphatidylinositol 4-kinase beta GAMVEARSLAVAMGDTVVEPAPLKPTSEPTSGPPGNNGGSLLSVITEGVGELSVIDPEVAQKACQEVLEKVKLLHGGVAV 80 T 47 Pik1 pdbhh F Eukaryota T 2n75 1 A A De novo designed protein MGRLVVVVTSEQLKEEVRKKFPQVEVRLVTTEEDAKQVIKEIQKKGVQKVVLVGVSEKLLQKIKQEANVQVYRVTSNDELEQVVKDVKGSGLEHHHHHH 99 T 0.00041 PrpR_N pdbpssm F T 2n76 1 A A De novo designed protein LFR1 MLTVEVEVKITADDENKAEEIVKRVIDEVEREVQKQYPNATITRTLTRDDGTVELRIKVKADTEEKAKSIIKLIEERIEEELRKRDPNATITRTVRTEVGSSWSLEHHHHHH 112 T 0.00044 CinA_KH pdb F T 2n77 2 B B PCP4_HUMAN BRAIN-SPECIFIC ANTIGEN PCP-4, BRAIN-SPECIFIC POLYPEPTIDE PEP-19 MAERQGAGATNGKDKTSGENDGQKKVQEEFDIDMDAPETERAAVAIQSQFRKFQKKKAGSQS 62 T 0.0063 IQ pdbhh F Eukaryota T 2n7f 1 A A muO-conotoxin MfVIA RDCQEKWEYCIVPILGFVYCCPGLICGPFVCV 32 T 0.00046 Conotoxin pdbhh F T 2n7i 1 A A PRLR_HUMAN PRL-R GSFTMNDTTVWISVAVLSAVICLIIVWAVALKGYSMV 37 T 0.00011 IFNGR1 unphh F Eukaryota T 2n7n 1 A A Peptide PG-989 XXDPPXRWKX 10 T 1.1 DUF2678 pdbhh F T 2n7o 1 A A Peptide PG-990 XXDPPXRWKX 10 T 2.8 DUF765 pdbhh F T 2n7t 1 A A Peptide PG-992 XXDWPXRWKX 10 T 1.5 Xpo1 pdbhh F T 2n85 1 A A SPN1A_OXYTA OTTX1A KFKWGKLFSTAKKLYKKGKKLSKNKNFKKALKFGKQLAKNL 41 T 0.0033 Latarcin unphh F Eukaryota T 2n86 1 A A SPN1A_OXYTA OTTX1A GTPVGNNKCWAIGTTCSDDCDCCPEHHCHCPAGKWLPGLFRCTCQVTESDKVNKCPPAE 59 T 1.6 DUF5814 pdbpssm F Eukaryota T 2n8d 1 A A antimicrobial peptide Lavracin WDPYFAGVKKLTKAILAVRAX 21 T 8.8 YceD pdbhh F T 2n8j 2 B B NOS3_HUMAN CONSTITUTIVE NOS, CNOS, EC-NOS, ENDOTHELIAL NOS, ENOS, NOS TYPE III, NOSIII TRKKTFKEVANAVKISASLMGT 22 T 4.1 DUF2774 pdbhh F Eukaryota T 2n9a 1 A A DCRLN_OREDC Decoralin SLLSLIRKLITX 12 T 3.2 BDV_P10 unphh F Eukaryota T 2n9e 1 A A UIMC1_HUMAN RECEPTOR-ASSOCIATED PROTEIN 80, RETINOID X RECEPTOR-INTERACTING PROTEIN 110, UBIQUITIN INTERACTION MOTIF-CONTAINING PROTEIN 1 XEDAFIVISDSDGEX 15 T 0.064 MLIP pdbhh F Eukaryota T 2n9m 1 A A antimicrobial peptide VARGWKRKCPLFGKGG 16 T 0.0078 Flavi_glycoprot pdbhh F T 2n9n 1 A A antimicrobial peptide VARGWKRKCPLFGKGG 16 T 0.0078 Flavi_glycoprot pdbhh F T 2n9x 2 B B FUND1_HUMAN FUN14 domain-containing protein 1 DYESDDDSYEVLDLTEY 17 T 1.2 DUF6327 unphh F Eukaryota T 2n9z 1 A A DKTX_HAPSC TAU-TRTX-HS1A, DOUBLE-KNOT TOXIN, DKTX DCAKEGEVCSWGKKCCDLDNFYCPMEFIPHCKKYKPYVPVTT 42 T 0.15 Ragweed_pollen pdbhh F Eukaryota T 2na6 1 A,B,C A,B,C TNR6_MOUSE APO-1 ANTIGEN, APOPTOSIS-MEDIATING SURFACE ANTIGEN FAS, FASLG RECEPTOR RNRLWLLTILVLLIPLVFIYRKYRKRKS 28 T 0.093 DAG1 pdbhh F Eukaryota T 2na7 1 A,B,C A,B,C TNR6_HUMAN APO-1 ANTIGEN, APOPTOSIS-MEDIATING SURFACE ANTIGEN FAS, FASLG RECEPTOR RSNLGWLSLLLLPIPLIVWVKRKEVQKT 28 T 0.027 KdpC unppercent F Eukaryota T 2na8 1 A A IL3RB_HUMAN CDW131, GM-CSF/IL-3/IL-5 RECEPTOR COMMON BETA SUBUNIT GKRSWDTESVLPMWVLALIVIFLTIAVLLALRFCGIYGYRLRRK 44 T 0.0006 Interfer-bind unppssm F Eukaryota T 2na9 1 A A IL3RB_HUMAN CDW131, GM-CSF/IL-3/IL-5 RECEPTOR COMMON BETA SUBUNIT GKRSWDTESVLAMWVLALIVIFLTIAVLLALRFCGIYGYRLRRK 44 T 0.0006 Interfer-bind unppssm F Eukaryota T 2nae 1 A A CD28_MOUSE T-cell-specific surface glycoprotein CD28 GTNSRRNRLLQSDYMNMTPRRPGLTRKPYQPYAPARDFAAYRP 43 T 0.0075 LAX unppercent F Eukaryota T 2naj 1 A A DKTX_HAPSC TAU-TRTX-HS1A, DOUBLE-KNOT TOXIN, DKTX NCAKEGEVCGWGSKCCHGLDCPLAFIPYCEKYR 33 T 0.0099 Conotoxin_I2 pdb F Eukaryota T 2nal 1 A A Retro-KR-12 RLFDKIRQVIRK 12 T 6.3 TnpW pdbhh F T 2nau 1 A A entity KYEITTIHNLARKLTHRLARRNAGATLR 28 T 2.7 CbtA_toxin pdbhh F T 2nb2 1 A A A0A1S4NYD1_NIGSA nigellin-1.1 DRYQDCLSECNSRCTYIPDYAGMRACIGLCAPACLTSR 38 T 0.006 TIL pdb F Eukaryota T 2nb5 1 A A A0A023GYK7_PERMO Preproalbumin PawS1 GDCYWTSTPPFFTCTPD 17 T 1.8 ESCRT-II pdbhh F Eukaryota T 2nb6 1 A A A0A023GYI2_GALQU Preproalbumin PawS1 GCYPVPYPPFFTCDPN 16 T 0.1 ESCRT-II pdbhh F Eukaryota T 2nbc 1 A A PON1A_ANOEM poneritoxin WCASGCRKKRHGGCSCX 17 T 0.025 Fib_alpha unphh F Eukaryota T 2nbi 1 A A O22015_CYLFU HEP200 protein QPSDLNPSSQPSECADVLEECPIDECFLPYSDASRPPSCLSFGRPDCDVLPTPQNINCPRCCATECRPDNPMFTPSPDGSPPICSPTMLPTNQPTPPEPSSAPSDCGEVIEECPLDTCFLPTSDPARPPDCTAVGRPDCDVLPFPNNLGCPACCPFECSPDNPMFTPSPDGSPPNCSPTMLPTPQPSTPTVITSPAPSSQPSQCAEVIEQCPIDECFLPYGDSSRPLDCTDPAVNRPDCDVLPTPQNINCPACCAFECRPDNPMFTPSPDGSPPICSPTMMPSPEPSSQPSDCGEVIEECPIDACFLPKSDSARPPDCTAVGRPDCNVLPFPNNIGCPSCCPFECSPDNPMFTPSPDGSPPNCSPTMLPSPSPSAVTVPLTPAPSSAPTRQPSSQPTGPQPSSQPSECADVLELCPYDTCFLPFDDSSRPPDCTDPSVNRPDCDKLSTAIDFTCPTCCPTQCRPDNPMFSPSPDGSPPVCSPTMMPSPLPSPTE 494 T 8.8 Mito_fiss_reg unphh F Eukaryota T 2nbl 1 A A Designed beta-arch XTEIRVXGVTIRMRXSHXFWVQVXXKEFKHX 31 T 2.3 RisS_PPD pdbhh F T 2nc7 1 A A PG5_PIG PG-5 RGGRLCYCRPRFCVCVGR 18 T 0.0091 Tmpp129 pdbhh F Eukaryota T 2ncz 2 B B NSD3_HUMAN NUCLEAR SET DOMAIN-CONTAINING PROTEIN 3, PROTEIN WHISTLE, WHSC1-LIKE 1 ISOFORM 9 WITH METHYLTRANSFERASE ACTIVITY TO LYSINE, WOLF-HIRSCHHORN SYNDROME CANDIDATE 1-LIKE PROTEIN 1, WHSC1-LIKE PROTEIN 1 EIKLKITKTIQN 12 T 18 GIT1_C pdbhh F Eukaryota T 2nd0 2 B B LANA1_HHV8P LANA NLQSSIVKFKKPLPLTQPG 19 T 0.00062 EBV-NA1 unphh T Viruses T 2nd1 2 B B NSD3_HUMAN NUCLEAR SET DOMAIN-CONTAINING PROTEIN 3, PROTEIN WHISTLE, WHSC1-LIKE 1 ISOFORM 9 WITH METHYLTRANSFERASE ACTIVITY TO LYSINE, WOLF-HIRSCHHORN SYNDROME CANDIDATE 1-LIKE PROTEIN 1, WHSC1-LIKE PROTEIN 1 VVPKKKIKKEQVE 13 T 7.7 S1FA pdbhh F Eukaryota T 2nd2 1 A A De novo mini protein HHH_06 APCEDLKERLKKLGMSEECRQRLEKMCKEGTSEDAERMARNCES 44 T 0.66 VMAP-M14 pdbhh F T 2nd3 1 A A De novo mini protein EEH_04 QCYTFRSECTNKEFTVCRPNPEEVEKEARRTKEEECRK 38 T 0.047 YTV pdb F T 2nd4 1 A A I1ZJ30_STRPA Amylase-binding protein AbpA GENPSASNQLIQKKYVSWRDAADEANTQVAAHEAEIKEETLRQPGVVAAQQALDKANAIVGHDHEQAVKRAQEDYNTAYNEAYNTVRNRYIQVLQQKYIEAAKAQGNYYDETAVEANRTNEQRIADDIKAQTGKDVTVTKDENGNYVVKDEKGNVVATVDKDGKTVKADAKAG 173 T 0.0016 DUF4988 pdbpssm F Bacteria T 2ndc 1 A A CTHL5_BOVIN ANTIBACTERIAL PEPTIDE BMAP-28, MYELOID ANTIBACTERIAL PEPTIDE 28 GGLRSLGRKILRAWKKYG 18 T 0.85 Fungal_KA1 pdbhh F Eukaryota T 2ndd 1 A A KKX51_HETLA HELATX1 SCKKECSGSRRTKKCMQKCNREHGHX 26 T 0.023 ETRAMP unp F Eukaryota T 2nde 1 A A CTHL5_BOVIN ANTIBACTERIAL PEPTIDE BMAP-28, MYELOID ANTIBACTERIAL PEPTIDE 28 IGLRGLGRKIALIHKKYG 18 T 2.3 IMS_HHH pdbhh F Eukaryota T 2ndi 1 A A Q4PN35_IXOSC Putative secreted salivary protein GLCSENGDCAADECCVDTVFEGDMVTRSCEKTTGNFTECPGLTPIA 46 T 0.037 Conotoxin_I2 unppercent F Eukaryota T 2ndl 1 A A PawS derived peptide GPCFPMGPWGPFCIPD 16 T 0.35 Psg1 pdbhh F T 2ndm 1 A A A0A1C7D043_9ASTR PawS derived peptide 21 GRPCYTLQSCFPD 13 T 1.3 Comm pdbhh F Eukaryota T 2ndn 1 A A A0A0A0V2B6_9ASTR PawS1a Derived Peptide 20 GICFKDPFGSTLCAPD 16 T 0.99 C_GCAxxG_C_C pdbhh F Eukaryota T 2nm1 2 B B SYT2_RAT SYTII EDMFAKLKDKFFNEINK 17 T 0.027 DUF4713 unphh F Eukaryota T 2nmb 2 B B PROTEIN (GPPY PEPTIDE) AYIGPXL 7 T 0.29 Crl pdbhh F T 2no3 2 B,D F,G JNK-INTERACTING PROTEIN 1, JIP-1, JNK MAP KINASE SCAFFOLD PROTEIN 1, ISLET-BRAIN 1, IB-1, MITOGEN-ACTIVATED PROTEIN KINASE 8-INTERACTING PROTEIN 1 RPKRPTTLNLF 11 T 12 CTP-dep_RFKase pdbhh F T 2nou 1 A A TKN1_SCYCA Scyliorhinin I AKFDKFYGLM 10 T 0.00029 Tachykinin pdbhh F Eukaryota T 2np0 2 B B SYT2_MOUSE SYNAPTOTAGMIN II, SYTII GESQEDMFAKLKEKFFNEINK 21 T 0.34 Alpha_E2_glycop unphh F Eukaryota T 2nph 2 C S tetrapeptide fragment AETF 4 T 240 DUF5380 pdbhh F F 2nph 3 D T pentapeptide fragment YVDGA 5 T 24 zf-HC2 pdbhh F F 2npm 2 B,D X,Y CONSENSUS PEPTIDE FOR 14-3-3 PROTEINS RAISLP 6 T 54 Tryp_alpha_amyl pdbhh F T 2npp 4 G,H X,Y microcystin LR XLXRXXX 7 T 49 Phage_TAC_2 pdbhh F F 2npv 1 A A ELLVDLL ELXVDXL 7 T 11 Chibby pdbhh F F 2nq8 2 C,D C,D Q9BH77_PLAFA ENOYL-ACP REDUCTASE YTFIDYAIEYSEKYAPLRQKLLSTDIGSVASFLLSRESRAITGQTIYVDNGLNIMFLPDD 60 F F Eukaryota T 2nqa 2 C,D D,E Leupeptin Inhibitor XLLX 4 T 590 DNTTIP1_dimer pdbhh F F 2nr5 1 A,B,C,D,E,F,G,H A,B,C,D,F,G,H,E Q8EDS4_SHEON Hypothetical protein SO2669 SNAMMTKKERIAIQRSMAEEALGKLKAIRQLCGAEDSSDSSDMQEVEIWTNRIKELEDWLWGESPIA 67 T 0.031 DUF2385 unp F Bacteria T 2ns4 1 A A L-22 CYCLIC PEPTIDE RVRTRKGRRIRIPP 14 T 0.24 DUF2835 pdbhh F T 2ns8 2 E,F,G,H,I H,E,F,G,Z 16 residue peptide Tip (Transcription inducing peptide) XWTWNAYAFAAPSGGGS 17 T 4.1 DUF3710 pdbhh F T 2nsv 1 A A MEN1_EUPNO Mating pheromone En-1 NPEDWFTPDTCAYGDSNTAWTTCTTPGQTCYTCCSSCFDVVGEQACQMSAQC 52 T 41 eIF3g pdbhh F Eukaryota T 2nsw 1 A A MEN2_EUPNO Mating pheromone En-2 DIEDFYTSETCPYKNDSQLAWDTCSGGTGNCGTVCCGQCFSFPVSQSCAGMADSNDCPNA 60 T 30 Inhibitor_I67 pdbhh F Eukaryota T 2nw3 3 C C EBV peptide EPLPQGQLTAY EPLPQGQLTAY 11 T 7.4 AP-5_subunit_s1 pdbhh F T 2nwn 2 B B upain-1 CSWRGLENHRMC 12 T 0.95 DUF2632 pdbhh F T 2nx5 3 C,H,M,R C,H,M,S BZLF1_EBVB9 EBV peptide, EPLPQGQLTAY EPLPQGQLTAY 11 T 7.4 AP-5_subunit_s1 pdbhh T Viruses T 2nx6 1 A A Q8IT70_HYDVU Nematocyst outer wall antigen GSSSCPQFPSCSPSCAPQCSQQCCQQP 27 T 0.0015 C_tripleX pdbhh F Eukaryota T 2nx7 1 A A Q8IT70_HYDVU Nematocyst outer wall antigen AQNPCSLQQPGCSSACAPACRLSCCSLG 28 T 0.13 C_tripleX pdbhh F Eukaryota T 2nxd 2 C P Analogue of RT-RH pol protease substrate peptide GADIFYLDGA 10 T 3.5 XPG_N pdbhh F T 2nxl 2 C P Analogue of RT-RH pol protease substrate peptide GAEVFYVDGA 10 T 1.8 MHC_II_alpha pdbhh F T 2nxm 2 C P Analogue of RT-RH pol protease substrate peptide GAQTFYVDGA 10 T 2.4 BOFC_N pdbhh F T 2nyl 4 G,H G,H microcystin LR XLXRXXX 7 T 49 Phage_TAC_2 pdbhh F F 2nym 4 G,H G,H microcystin LR XLXRXXX 7 T 49 Phage_TAC_2 pdbhh F F 2nyq 2 B B Tetrapeptide XWCF 4 T 24 Phage_term_smal pdbhh F F 2o01 11 K K Photosystem I reaction center subunit psaK, chloroplast XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 38 F F F 2o02 2 C,D P,Q ExoS (416-430) peptide GHGQGLLDALDLAS 14 T 0.58 CTP_transf_1 pdbhh F T 2o0s 1 A A YW12 YVLWKRKRMIFI 12 T 6.7 Transport_MerF pdbhh F T 2o1n 2 B P Ala-Ile-Ala-Ser peptide AIAS 4 T 270 Peptidase_C34 pdbhh F F 2o4j 2 B C MED1_HUMAN PBP, PPAR-BINDING PROTEIN, THYROID HORMONE RECEPTOR-ASSOCIATED PROTEIN COMPLEX 220 KDA COMPONENT, TRAP220, THYROID RECEPTOR-INTERACTING PROTEIN 2, TRIP-2, P53 REGULATORY PROTEIN RB18A, VITAMIN D RECEPTOR-INTERACTING PROTEIN COMPLEX COMPONENT DRIP205, ACTIVATOR- RECRUITED COFACTOR 205 KDA COMPONENT, ARC205 KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 2o4r 2 B C MED1_HUMAN PBP, PPAR-BINDING PROTEIN, THYROID HORMONE RECEPTOR-ASSOCIATED PROTEIN COMPLEX 220 KDA COMPONENT, TRAP220, THYROID RECEPTOR-INTERACTING PROTEIN 2, TRIP-2, P53 REGULATORY PROTEIN RB18A, VITAMIN D RECEPTOR-INTERACTING PROTEIN COMPLEX COMPONENT DRIP205, ACTIVATOR- RECRUITED COFACTOR 205 KDA COMPONENT, ARC205 KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 2o5g 2 B B MYLK_CHICK Smooth muscle Myosin light chain kinase peptide XARRKWQKTGHAVRAIGRLSX 21 T 13 PACT_coil_coil pdbhh F Eukaryota T 2o60 2 B B NOS1_MOUSE Peptide corresponding to calmodulin binding domain of neuronal nitric oxide synthase KRRAIGFKKLAEAVKFSAKLMGQX 24 T 0.094 EDR1 pdbpssm F Eukaryota T 2o6n 1 A A RH4B designed peptide XAEIEQAKKEIAYLIKKAKEEILEEIKKAKQEIAX 35 T 0.037 Endotoxin_C2 pdb F T 2o88 2 C,D C,D P41 peptide XAPSYSPPPPP 11 T 1.8 N1221 pdbhh F F 2o8z 1 A A cCRF(30-41) Peptide XEAHKNRKLMEIIX 14 T 0.01 CRF pdbhh F T 2o98 2 C,D P,Q PMA3_NICPL H-ATPASE PMA2 TNFNELNQLAEEAKRRAEIARQRELHTLKGHVESVVKLKGLDIETIQQSYDI 52 T 0.023 DUF4398 pdb F Eukaryota T 2obh 2 C,D C,D XPC_HUMAN XERODERMA PIGMENTOSUM GROUP C-COMPLEMENTING PROTEIN, P125 XNWKLLAKGLLIRERLKR 18 T 16 S1FA pdbhh F Eukaryota T 2od2 2 B B Acetylated H4 peptide KGGAXRHRKILTAQ 14 T 32 DUF4196 pdbhh F T 2od4 1 A,B A,B hypothetical protein GMFAGSIPMYIRVVSITAQSKLQFDMTVTYFENVWSPKVISLGAISAEFVQSNENSGMYIIHYPDKQTAISVFDKIKPEVDEVRTQNRIQITEGKRLFRVD 101 T 2.2 ABM pdbhh F T 2od6 1 A,B,C,D A,B,C,D hypothetical protein GMAEPKFTSFTTADFINDVDMELFIDAVEKTAPVWVKEMKSRGLLKFSMNRVWNKGEVFRVVMTYEYKDRASFEANIAYLEDTFGKNPVFLQLVTTAKFTTSRCLVVMEV 110 T 0.0042 DUF3906 pdbpercent F T 2od7 2 B B Acetylated histone H4 peptide KGGAXRHRKILTAQ 14 T 32 DUF4196 pdbhh F T 2od8 2 B B DNLI1_YEAST CDC9, POLYDEOXYRIBONUCLEOTIDE SYNTHASE AGKKPKQATLARFFTSMKNKPT 22 T 1.6 RXLR_WY pdbhh F Eukaryota T 2od9 2 B B H4 peptide KGGAXRHRKILTAQ 14 T 32 DUF4196 pdbhh F T 2odd 1 A B NCOR2_HUMAN SMRT GSGSTISNPPPLISSAK 17 T 28 Connexin43 pdbhh F Eukaryota T 2oei 2 B B poly-proline peptide PPPPPPLPP 9 T 3.5 IL11 pdbhh F F 2ofq 2 B B Q79SE5_SALTM TraN PPPEPDWSNTVPVNKTIPVDTQ 22 T 0.12 Cag12 unphh F Bacteria T 2oi3 2 B B artificial peptide PD1 XHSKYPLPPLPSLX 14 T 9.6 DUF5855 pdbhh F T 2oj2 2 B B artificial peptide PD1 XHSKYPLPPLPSLX 14 T 9.6 DUF5855 pdbhh F T 2oju 2 C,D C,D CICLOSPORIN, CICLOSPORINE XXXXXXXXVXA 11 T 9.7 DUF6090 pdbhh F F 2ojx 2 B E MPIP3_HUMAN Synthetic peptide LLCSTPNGL 9 T 2.1 DUF3038 pdbhh F Eukaryota T 2okr 2 B,D C,F MAPK2_HUMAN MAPK-ACTIVATED PROTEIN KINASE 2, MAPKAP KINASE 2, MAPKAPK-2, MK2 IKIKKIEDASNPLLLKRRKKARAL 24 T 0.52 DUF6278 pdbhh F Eukaryota T 2ol9 1 A A peptide from human prion SNQNNF 6 T 220 AsiA pdbhh F F 2olb 2 B B TRIPEPTIDE LYS-LYS-LYS KKK 3 T 580 Rrn6 pdbhh F F 2olx 1 A A NNQQ peptide derived from Yeast Prion Sup35 NNQQ 4 T 350 Sec3_C_2 pdbhh F F 2omm 1 A A ERF3_YEAST GNNQQNY peptide corresponding to residues 7-13 of yeast prion sup35 GNNQQNY 7 T 1.3 TFIIA unppssm F Eukaryota F 2omp 1 A,B A,B LYQLEN peptide derived from human insulin chain A, residues 13-18 LYQLEN 6 T 4.8 DUF5418 pdbhh F F 2onv 1 A A amyloid-fibril forming peptide GGVVIA derived from the Alzheimer's amyloid Abeta GGVVIA 6 T 3.5 Beta-APP pdbhh F F 2onw 1 A X fibril forming peptide from Bovine Pancreatic Ribonuclease (RNase A) SSTSAA 6 T 350 Pet127 pdbhh F F 2onx 1 A A peptide corresponding to residues 8-11 of yeast prion sup35 NNQQ 4 T 350 Sec3_C_2 pdbhh F F 2oob 1 A A CBLB_HUMAN SIGNAL TRANSDUCTION PROTEIN CBL-B, SH3-BINDING PROTEIN CBL-B, CASITAS B-LINEAGE LYMPHOMA PROTO-ONCOGENE B, RING FINGER PROTEIN 56 GSGPEAALENVDAKIAKLMGEGYAFEEVKRALEIAQNNVEVARSILREFAFP 52 T 0.00014 UBA pdbpssm F Eukaryota T 2op5 1 A,B,C,D,E,F A,B,C,D,E,F hypothetical protein GMKDTDETAFLNSLFMDFTSENELELFLKSLDEVWSEDLYSRLSAAGLIRHVISKVWNKEQHRISMVFEYDSKEGYQKCQEIIDKEFGITLKEKLKKFVFKIHNNRGVVVSEFIRST 117 T 0.052 Ion_trans_N pdbpercent F T 2opz 2 E,F,G,H E,F,G,H AVPF (Smac homologue, N-terminal tetrapeptide) AVPF 4 T 94 PriA_3primeBD pdbhh F F 2oq9 1 A A A6N8P1_HYDVU Minicollagen-5 APMQAPVQAAPACMASCAPQCCGR 24 T 0.22 C_tripleX pdbhh F Eukaryota T 2oqj 3 C,F,I,L C,F,I,L peptide 2G12.1 (ACPPSHVLDMRSGTCLAAEGK) ACPPSHVLDMRSGTCLAAEGK 21 T 1.3 Glyco_hydro_65N pdbhh F T 2oqs 2 B B C-terminal HPV-18 E6 peptide RRETQV 6 T 110 LisH_TPL pdbhh F F 2oru 1 A A xtz1-peptide KAWTWTWNPATGKWTWRKNE 20 T 0.31 LPD24 pdbhh F T 2orz 2 B B Tuftsin TKPR 4 T 200 zf-CCHC pdbhh F F 2os2 2 C,D C,D histone 3 peptide STGGVKKPHRY 11 T 7.1 UPF0715 pdbhh F T 2os6 2 B B PLXB1_HUMAN SEMAPHORIN RECEPTOR SEP VENKVTDL 8 T 0.41 TMCCDC2 pdbhh F Eukaryota T 2ot0 2 E,F,G,H E,F,G,H WASP_HUMAN WASP EDQAGDEDEDDEWDD 15 T 0.15 SMN pdbhh F Eukaryota T 2otq 1 A A cRW3 cationic antimicrobial peptide RRWFWR 6 T 3.2 CsiD pdbhh F F 2otu 3 I,J,K,L P,Q,R,S peptide antigen QQQQQQQQQQG 11 T 50 DUF3947 pdbhh F F 2otw 3 E,F E,F poly-Gln peptide antigen GQQQQQQQQQQG 12 T 71 DUF3947 pdbhh F F 2ovh 2 B B SMRT peptide TNMGLEAIIRKALMGKY 17 T 2.8 RuvA_C pdbhh F T 2ovm 2 B B NCoR GHSFADPASNLGLEDIIRKALMGSF 25 T 4.3 RuvA_C pdbhh F T 2ovq 3 C C cyclinE C-terminal degron LPSGLLTPPQSG 12 T 17 Cuticle_1 pdbhh F T 2ovr 3 C C cyclinE N-terminal degron SLIPTPDK 8 T 5 PH_18 pdbhh F T 2ox2 1 A A cRW2 peptide RRWWFR 6 T 3.9 DERM pdbhh F F 2oxw 2 B X ILE-ALA-GLY peptide IAG 3 T 180 Arg_repressor_C pdbhh F F 2oxz 2 B Y PRO-GLN-GLY peptide PQG 3 T 120 Turandot pdbhh F F 2oxz 3 C X ILE-ALA-GLY peptide IAG 3 T 180 Arg_repressor_C pdbhh F F 2oy2 2 B,D W,Y ILE-ALA-GLY peptide IAG 3 T 180 Arg_repressor_C pdbhh F F 2oyh 4 G,H,I,J G,H,I,J GHRP peptide GHRP 4 T 14 VPS38 unphh F F 2oyi 4 G,H,I,J G,H,I,J GPRP Peptide GPRP 4 T 65 SRCR_2 pdbhh F F 2p05 1 A A a non-biological ATP binding protein 1819 GSMDYKDDDDKKTNWLKRIYRVRPCVKCKVAPRNWKVKNKHLRIYNMCKTCFNNSIDIGDDTYHGHDDWLMYADSKEISNT 81 T 0.011 ZZ pdbpercent F T 2p09 1 A A a non-biological ATP binding protein with two mutations N32D and D65V GSMDYKDDDDKKTNWLKRIYRVRPCVKCKVAPRDWKVKNKHLRIYNMCKTCFNNSIDIGDDTYHGHVDWLMYADSKEISNT 81 T 0.025 Glyco_tranf_2_4 pdbpssm F T 2p0r 2 C,D D,E leupeptin XLLX 4 T 590 DNTTIP1_dimer pdbhh F F 2p0w 2 C,D P,Q Histone peptide H4 KGGKGLGKGGAKRHR 15 T 130 DUF1884 pdbhh F T 2p0x 1 A A abiotic ATP-binding, folding optimized protein GSFRVKPCVVCKVAPRDWRVKNRHLRIYNMCKTCFNNSIKSGDDTYHGHVDWLMYTDAKEFSST 64 T 0.0067 ZZ pdbpercent F T 2p4r 2 B T ITCH_HUMAN ITCH, ATROPHIN-1-INTERACTING PROTEIN 4, AIP4, NFE2-ASSOCIATED POLYPEPTIDE 1, NAPP1 GGFKPSRPPRPSRPPPPTPRRPASV 25 T 3.5 UPF0449 pdbhh F Eukaryota T 2p5b 2 C,D I,J H3_URECA Histone H3 RKSAPATGGVKKPHRYRPGTVL 22 T 1.8 YlzJ pdbhh F Eukaryota T 2p5h 1 A A pip9 VDIHVWDGV 9 T 0.45 DUF4883 pdbhh F T 2p5j 1 A A pip17 LGRVDIHVWDGVYIRGR 17 T 0.24 DUF4883 pdbhh F T 2p5t 1 A X fragment of PezA helix-turn-helix motif XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 33 F F F 2p6a 3 E E probable fragment of follistatin AAAAAAAAAA 10 T 200 FAD_oxidored pdbhh F F 2p6b 1 A E PVIVIT 14-mer Peptide GPHPVIVITGPHEEX 15 T 0.95 DUF4609 pdbhh F T 2p6j 1 A A designed engrailed homeodomain variant UVF MKQWSENVEEKLKEFVKRHQRITQEELHQYAQRLGLNEEAIRQFFEEFEQRK 52 T 0.0061 DUF72 pdb F T 2p7r 1 A A cyclo-CPFVC CPFVC 5 T 22 zf-Sec23_Sec24 pdbhh F F 2p8l 3 C C gp41 peptide ELLELDKWASLNW 13 T 4 Sex_peptide pdbhh F T 2p8o 1 A A CTRA_BOVIN Chymotrypsin A chain A CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 2p8p 3 C C gp41 peptide LELDKWASLWX 11 T 0.72 Chrome_Resist pdbhh F T 2p9w 1 A A Q6WIF3_MALSM Mal s 1 allergenic protein ALPDQIDVKVKNLTPEDTIYDRTRQVFYQSNLYKGRIEVYNPKTQSHFNVVIDGASSNGDGEQQMSGLSLLTHDNSKRLFAVMKNAKSFNFADQSSHGASSFHSFNLPLSENSKPVWSVNFEKVQDEFEKKAGKRPFGVVQSAQDRDGNSYVAFALGMPAIARVSADGKTVSTFAWESGNGGQRPGYSGITFDPHSNKLIAFGGPRALTAFDVSKPYAWPEPVKINGDFGTLSGTEKIVTVPVGNESVLVGARAPYAISFRSWDNWKSANIKKTKRSELQNSGFTAVADYYQGSEQGLYAVSAFFDNGAHGGRSDYPLYKLDNSIQNFHHHHHH 334 T 4.3E-07 MRJP pdbhh F Eukaryota T 2pav 3 C V VASP_HUMAN VASP GAGGGPPPAPPLPAAQ 16 T 3.5 Tir_receptor_N pdbhh F Eukaryota F 2pb8 2 B P AVYS AVYS 4 T 210 YqzH pdbhh F F 2pbd 3 C V VASP_HUMAN VASP GPPPAPPLPAAQGPGGGGAGAPGLAAAIAGAKLRKVSKQEEAS 43 T 0.00045 DUF4106 unphh F Eukaryota T 2pbk 2 C,D C,D hexapeptide phosphonate inhibitor XPVYXQX 7 T 42 DUF3120 pdbhh F F 2pc4 2 E H P90573_PLABE PbTRAP EDNDWN 6 T 27 DUF4878 pdbhh F Eukaryota F 2pcu 2 B B peptide FNRPV 5 T 42 DUF5395 pdbhh F F 2pdz 2 B B PEPTIDE GVKESLV KESLV 5 T 290 E3_UbLigase_RBR pdbhh F F 2pem 2 G R RbcL EIKFEFD 7 T 2.7 DUF5370 pdbhh F F 2pff 3 C,F,I C,F,I Tail protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 65 F F F 2pg4 1 A,B A,B Q9YDN4_AERPE Uncharacterized protein GMDDETLRLQFGHLIRILPTLLEFEKKGYEPSLAEIVKASGVSEKTFFMGLKDRLIRAGLVKEETLSYRVKTLKLTEKGRRLAECLEKCRDVLGS 95 T 0.0017 HTH_27 unppercent F Archaea T 2pgc 1 A,B,C,D,E A,B,C,D,E uncharacterized protein GMSNINYVILTVASVDFSYRETMARLMSSYSKDLIDNAGAKGTRFGSIGTGDHAGSLIFIQFYDDLTGYQKALEIQSKSSVFKEIMDSGKANIYLRNISTSLPTKFEQSYEHPKYIVLTRAEAAMSDKDKFLNCINDTASCFKDNGALTLRFGNLLTGSNVGNYLLGVGYPSMEAIEKTYDELLAHSSYKELMTFAKVNMRNIIKIL 207 T 1.4E-05 DUF6039 pdbhh F T 2pgk 1 A A PHOSPHOGLYCERATE KINASE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 408 F F F 2ph7 1 A,B A,B Y2093_ARCFU Uncharacterized protein AF_2093 GMDVEIVEELSKMLAGRKAVTEEEIRRKAIRCALKIMGARLVGIDAELIEDVTCSLIDCPITLKSLHFSEKVKIGDVLFYHPHVIKPEKEDFEQAYFEYKQSKKFLDAFDIMREVTDRFFEGYEAEGRYMRKYTKDGRNYYAFFSTIDDTFEDVDIHLRMVDEVDGDYVVIVPTENELNPFLKFFKQYSEDAKRAGLKIWVVNPDEKTIDPFIGYPKDFRLLKGFKNPKAAALVSAYWRVTVTDLD 246 T 7 DUF1882 unphh F Archaea T 2phk 2 B B MC-PEPTIDE RQMSFRL 7 T 20 OAM_dimer pdbhh F T 2pie 2 B F phosphopeptide ELKTERY 7 T 52 MvaI_BcnI pdbhh F T 2pkl 2 B B ARA70 peptide KLLF 4 T 190 Ric8 pdbhh F F 2pku 2 B B peptide (GLU)(SER)(VAL)(LYS)(ILE) ESVKI 5 T 200 KOW pdbhh F F 2pld 2 B B PGFRB_HUMAN PHOSPHOPEPTIDE FROM PDGF DNDXIIPLPDPK 12 T 2 PA28_alpha pdbhh F Eukaryota T 2ple 2 B B PGFRB_HUMAN PHOSPHOPEPTIDE FROM PDGF DNDXIIPLPDPK 12 T 2 PA28_alpha pdbhh F Eukaryota T 2plx 2 B B TI_VERHE Peptide Inhibitor QCKVMCYAQRHSSPELLRRCLDNCEK 26 T 0.0098 DUF842 unp F Eukaryota T 2poy 2 D,E,F T,U,V CICLOSPORIN, CICLOSPORINE XXXXXXXXVXA 11 T 9.7 DUF6090 pdbhh F F 2pps 1 A A PSI, SYSTEM I OF OXYGENIC PHOTOSYNTHESIS XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 478 F F F 2pps 2 B B PSI, SYSTEM I OF OXYGENIC PHOTOSYNTHESIS XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 503 F F F 2pps 3 C L PSI, SYSTEM I OF OXYGENIC PHOTOSYNTHESIS XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 111 F F F 2pps 4 D K PSI, SYSTEM I OF OXYGENIC PHOTOSYNTHESIS XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 64 F F F 2pps 5 E F PSI, SYSTEM I OF OXYGENIC PHOTOSYNTHESIS XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 130 F F F 2pps 6 F C PSI, SYSTEM I OF OXYGENIC PHOTOSYNTHESIS XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 80 F F F 2pq2 2 B B GALAG peptide GALAG 5 T 64 TMEM237 pdbhh F F 2pqw 2 B B Histone H4 RHRKVLRDN 9 T 2.9 Phage_X pdbhh F T 2pr9 2 B P GBRG2_RAT GABA(A) receptor subunit gamma-2 peptide DEEYGYECLD 10 T 5.1 DUF5816 pdbhh F Eukaryota T 2psx 2 B B LEUPEPTIN XLLX 4 T 590 DNTTIP1_dimer pdbhh F F 2psy 2 B B LEUPEPTIN XLLX 4 T 590 DNTTIP1_dimer pdbhh F F 2puq 4 D I TRP-TYR-THR-ARG CHLOROMETHYLKETONE INHIBITOR WYTXX 5 T 33 FAD-SLDH pdbhh F F 2pux 3 C C PAR3_MOUSE PAR-3, THROMBIN RECEPTOR-LIKE 2, COAGULATION FACTOR II RECEPTOR-LIKE 2 QNTFEEFPLSDIE 13 T 0.87 Hirudin pdbhh F Eukaryota T 2pv2 2 E,F E,F C-peptide NFTLKFWDIFRK 12 T 2.2 Fmp27_GFWDK pdbhh F T 2pv3 2 C C C-peptide NFTLKFWDIFRK 12 T 2.2 Fmp27_GFWDK pdbhh F T 2pv9 3 C C PAR4_MOUSE PAR-4, THROMBIN RECEPTOR-LIKE 3, COAGULATION FACTOR II RECEPTOR-LIKE 3 KSSDKPNPRGYPGKFCANDSDTLELP 26 T 25 Colicin_Ia pdbhh F Eukaryota T 2pw1 3 C C peptide epitope ELDKWNSL 8 T 1.9 Lar_restr_allev pdbhh F T 2pw2 3 C C peptide epitope ELDKWKSL 8 T 1.2 DUF4720 pdbhh F T 2pxj 2 C,D I,J monomethylated Histone H3K36 peptide RKSAPATGGVKKPHRYRPGTVL 22 T 1.8 YlzJ pdbhh F T 2pxy 5 E P Myelin basic protein (MBP)-peptide HSRGGASQYRPSQ 13 T 13 Tsg pdbhh F T 2q0n 2 B B Synthetic peptide RRRRRSWYFDG 11 T 0.35 CFIA_Pcf11 pdbhh F T 2q2k 2 B,C A,B O87365_STAAU Hypothetical protein MGSSHHHHHHSSGLVPGSHMDKKETKHLLKIKKEDYPQIFDFLENVPRGTKTAHIREALRRYIEEIGENP 70 T 0.051 PutA_N unppssm F Bacteria T 2q3c 2 B B DFSI inhibitory peptide DFSI 4 T 44 DUF6241 pdbhh F F 2q3i 2 B D D-peptide XGXXGXGXXXXXXXXXX 17 T 2 Rubredoxin_2 pdbhh F F 2q3y 2 B B NR0B2_HUMAN ORPHAN NUCLEAR RECEPTOR SHP, SMALL HETERODIMER PARTNER PAILYALLSS 10 T 6.6 NR_Repeat pdbhh F Eukaryota T 2q3z 2 B X Polypeptide XPXLPFX 7 T 160 Sm_like pdbhh F F 2q5a 2 B B Five residue peptide XFTXXQX 7 T 340 eIF3m_C_helix pdbhh F F 2q5y 2 B,D B,D NUP98_HUMAN Nuclear pore complex protein Nup96 SKYGLQD 7 T 15 DUF2683 pdbhh F Eukaryota T 2q6f 2 C,D D,C N-[(5-METHYLISOXAZOL-3-YL)CARBONYL]ALANYL-L-VALYL-N~1~-((1R,2Z)-4-(BENZYLOXY)-4-OXO-1-{[(3R)-2-OXOPYRROLIDIN-3-YL]METHYL}BUT-2-ENYL)-L-LEUCINAMIDE XAVLXX 6 T 1700 zf-H2C2_5 pdbhh F F 2q82 1 A A Q94M07_9VIRU Core protein P7 MDFITDMSKNQRLELQNRLAQYETSLMVMSHNGDVPVITGFNVMRVTTMLDALKVELPAVAVLGDDAQDLAYVFGARPLAVGVNIIRVVDVPGQQPSALVDAELGALHEVSMVRVLNDIADEQLVKANM 129 T 15 C_Hendra pdbhh T Viruses T 2q8d 2 C,D F,G PEPTIDE RKSAPATGGVKKPHRY 16 T 31 DUF5976 pdbhh F T 2q8e 2 C,D F,G histone 3 peptide RKSAPATGGVKKPHRY 16 T 31 DUF5976 pdbhh F T 2q9i 4 G,H S,T Fibrin B Knob (GHRPam) GHRP 4 T 14 VPS38 unphh F F 2q9i 5 I,J M,N Fibrin B Knob (MHRPYam) MHRPY 5 T 39 Muted pdbhh F F 2qa9 2 B I PRTB_STRGR 4-mer peptide DAIY DAIY 4 T 79 Exotox-A_bind pdbhh F Bacteria F 2qac 2 B T MYOA_PLAYO Myosin-A XLMRVQAHIRKRMVA 15 T 0.063 BORCS8 pdbhh F Eukaryota T 2qas 2 B B C. crescentus ssrA peptide KKGRHGAANDNFAEEFAVAA 20 T 15 KCTD4_C pdbhh F T 2qbl 1 A A CPXA_PSEPU CAMPHOR 5-MONOOXYGENASE, P450CAM MTTETIQSNANLAPLPPHVPEHLVFDFDMYNPSNLSAGVQEAWAVLQESNVPDLVWTRCNGGHWIATRGQLIREAYEDYRHFSSECPFIPREAGEAYDFIPTSMDPPEQRQFRALANQVVGMPVVDKLENRIQELACSLIESLRPQGQCNFTEDYAEPFPIRIFMLLAGLPEEDIPHLKYLTDQMTRPDGSMTFAEAKEALYDYLIPIIEQRRQKPGTDAISIVANGQVNGRPITSDEAKRMCGLLLVTGLDTVVNFLSFSMEFLAKSPEHRQELIERPERIPAACEELLRRFSLVADGRILTSDYEFHGVQLKKGDQILLPQMLSGLDERENACPMHVDFSRQKVSHTTFGHGSHLCLGQHLARREIIVTLKEWLTRIPDFSIAPGAQIQHKSGIVSGVQALPLVWDPATTKAVHHHHHH 421 T 1.6E-05 p450 unppercent F Bacteria T 2qbm 1 A A CPXA_PSEPU CAMPHOR 5-MONOOXYGENASE, P450CAM MTTETIQSNANLAPLPPHVPEHLVFDFDMYNPSNLSAGVQEAWAVLQESNVPDLVWTRCNGGHWIATRGQLIREAYEDYRHFSSECPFIPREAGEAYDFIPTSMDPPEQRQFRALANQVVGMPVVDKLENRIQELACSLIESLRPQGQCNFTEDYAEPFPIRIFMLLAGLPEEDIPHLKYLTDQMTRPDGSMTFAEAKEALYDYLIPIIEQRRQKPGTDAISIVANGQVNGRPITSDEAKRMCGLLLVTGLDTVVNFLSFSMEFLAKSPEHRQELIERPERIPAACEELLRRFSLVADGRILTSDYEFHGVQLKKGDQILLPQMLSGLDERENACPMHVDFSRQKVSHTTFGHGSHLCLGQHLARREIIVTLKEWLTRIPDFSIAPGAQIQHKSGIVSGVQALPLVWDPATTKAVHHHHHH 421 T 1.6E-05 p450 unppercent F Bacteria T 2qbn 1 A A CPXA_PSEPU CAMPHOR 5-MONOOXYGENASE, P450CAM MTTETIQSNANLAPLPPHVPEHLVFDFDMYNPSNLSAGVQEAWAVLQESNVPDLVWTRCNGGHWIATRGQLIREAYEDYRHFSSECPFIPREAGEAYDFIPTSMDPPEQRQFRALANQVVGMPVVDKLENRIQELACSLIESLRPQGQCNFTEDYAEPFPIRIFMLLAGLPEEDIPHLKYLTDQMTRPDGSMTFAEAKEALYDYLIPIIEQRRQKPGTDAISIVANGQVNGRPITSDEAKRMCGLLLVVGLDTVVNFLSFSMEFLAKSPEHRQELIERPERIPAACEELLRRFSLVADGRILTSDYEFHGVQLKKGDQILLPQMLSGLDERENACPMHVDFSRQKVSHTTFGHGSHLCLGQHLARREIIVTLKEWLTRIPDFSIAPGAQIQHKSGIVSGVQALPLVWDPATTKAVHHHHHH 421 T 1.6E-05 p450 unppercent F Bacteria T 2qbo 1 A A CPXA_PSEPU CAMPHOR 5-MONOOXYGENASE, P450CAM MTTETIQSNANLAPLPPHVPEHLVFDFDMYNPSNLSAGVQEAWAVLQESNVPDLVWTRCNGGHWIATRGQLIREAYEDYRHFSSECPFIPREAGEAYDFIPTSMDPPEQRQFRALANQVVGMPVVDKLENRIQELACSLIESLRPQGQCNFTEDYAEPFPIRIFMLLAGLPEEDIPHLKYLTDQMTRPDGSMTFAEAKEALYDYLIPIIEQRRQKPGTDAISIVANGQVNGRPITSDEAKRMCGLLLVVGLDTVVNFLSFSMEFLAKSPEHRQELIERPERIPAACEELLRRFSLVADGRILTSDYEFHGVQLKKGDQILLPQMLSGLDERENACPMHVDFSRQKVSHTTFGHGSHLCLGQHLARREIIVTLKEWLTRIPDFSIAPGAQIQHKSGIVSGVQALPLVWDPATTKAVHHHHHH 421 T 1.6E-05 p450 unppercent F Bacteria T 2qbw 2 B B Polypeptide PQPVDSWV 8 T 4 Spt4 pdbhh F T 2qbx 2 C,D D,P antagonistic peptide SNEWIQPRLPQH 12 T 13 B3_4 pdbhh F T 2qc5 1 A A O87275_9STAP Streptogramin B lactonase GSEAWMNFYLEEFNLSIPDSGPYGITSSEDGKVWFTQHKANKISSLDQSGRIKEFEVPTPDAKVMCLIVSSLGDIWFTENGANKIGKLSKKGGFTEYPLPQPDSGPYGITEGLNGDIWFTQLNGDRIGKLTADGTIYEYDLPNKGSYPAFITLGSDNALWFTENQNNSIGRITNTGKLEEYPLPTNAAAPVGITSGNDGALWFVEIMGNKIGRITTTGEISEYDIPTPNARPHAITAGKNSEIWFTEWGANQIGRITNDNTIQEYQLQTENAEPHGITFGKDGSVWFALKCKIGKLNLNE 300 T 0.0004 SGL unppssm F Bacteria T 2qhr 3 C P VGP_EBOEC Envelope glycoprotein peptide VEQHHRRTDND 11 T 0.0011 SOG2 unppercent T Viruses T 2qiy 2 B,D C,D UBP3_YEAST UBIQUITIN THIOESTERASE 3, UBIQUITIN-SPECIFIC-PROCESSING PROTEASE 3, DEUBIQUITINATING ENZYME 3 GSASVTKLKNLKENSSNLIQLPLFINTTEAEFAAASVQRYELNMKALN 48 T 0.031 Caldesmon unppssm F Eukaryota T 2qki 4 G,H G,H compstatin XICVWQDWGAHRCTX 15 T 2 DX pdbhh F T 2ql5 3 E,F E,F inhibitor XDMQX 5 T 510 RamS pdbhh F F 2ql5 4 G G peptide QGHGE 5 T 54 Raptor_N pdbhh F F 2ql7 3 E,F E,F Inhibitor AC-IEPD_CHO XIEPX 5 T 430 DUF4035 pdbhh F F 2ql7 4 G G QGHGE QGHGE 5 T 54 Raptor_N pdbhh F F 2ql9 3 E,F E,F Inhibitor AC-DQMD-CHO XDQMX 5 T 520 Tfb2_C pdbhh F F 2ql9 4 G G PEPTIDE QGHGE QGHGE 5 T 54 Raptor_N pdbhh F F 2qlb 3 E,F E,F Inhibitor AC-ESMD-CHO XESMX 5 T 570 BD_b_sandwich pdbhh F F 2qlb 4 G G Peptide QGHGE QGHGE 5 T 54 Raptor_N pdbhh F F 2qlf 3 E,F E,F Inhibitor AC-DNLD-CHO XDNLX 5 T 590 EpmC pdbhh F F 2qlf 4 G G Peptide QGHGE QGHGE 5 T 54 Raptor_N pdbhh F F 2qlj 3 E,F E,F Inhibitor AC-WEHD-CHO XWEHX 5 T 140 Vps62 pdbhh F F 2qlj 4 G G Peptide QGHGE QGHGE 5 T 54 Raptor_N pdbhh F F 2qll 2 B B protein targeting to glycogen - GL GPYY 4 T 22 DUF2939 pdbhh F F 2qn6 3 C C IF2B_SULSO EIF-2-BETA, AIF2-BETA SSEKEYVEMLDRLYSKLP 18 T 0.8 DUF6103 pdbhh F Archaea T 2qos 2 B A CO8A_HUMAN COMPLEMENT COMPONENT 8 SUBUNIT ALPHA LRYDSTAERLY 11 T 9.3 MHC_II_beta pdbhh F Eukaryota T 2qqf 2 B B H4_YEAST Histone H4 KGGAXRHRKIL 11 T 11 Shadoo unppercent F Eukaryota T 2qqg 2 B B H4_YEAST Histone H4 KGGAXRHRKIL 11 T 11 Shadoo unppercent F Eukaryota T 2qqs 2 C,D C,D H4_HUMAN METHYLATED HISTONE H4 PEPTIDE KRHRKVLRDN 10 T 0.27 UPF0137 unp F Eukaryota T 2qrv 2 B,C,F,G B,C,F,G DNM3L_HUMAN DNA (cytosine-5)-methyltransferase 3-like GSMWRSQLKAFYDRESENPLEMFETVPVWRRQPVRVLSLFEDIKKELTSLGFLESGSDPGQLKHVVDVTDTVRKDVEEWGPFDLVYGATPPLGHTCDRPPSWYLFQFHRLLQYARPKPGSPRPFFWMFVDNLVLNKEDLDVASRFLEMEPVTIPDVHGGSLQNAVRVWSNIPAIRSRHWALVSEEELSLLAQNKQSSKLAAKWPTKLVKNCFLPLREYFKYFSTELTSSL 230 T 1E-09 DNA_methylase pdbhh F Eukaryota T 2qsk 1 A A SVN_SCYVA scytovirin GSGPTYCWNEANNPGGPNRCSNNKQCDGARTCSSSGFCQGTSRKPDPGPKGPTYCWDEAKNPGGPNRCSNSKQCDGARTCSSSGFCQGTAGHAAA 95 T 0.0034 EB pdb F Bacteria T 2qt4 1 A A SVN_SCYVA scytovirin GSGPTYCWNEANNPGGPNRCSNNKQCDGARTCSSSGFCQGTSRKPDPGPKGPTYCWDEAKNPGGPNRCSNSKQCDGARTCSSSGFCQGTAGHAAA 95 T 0.0034 EB pdb F Bacteria T 2qt5 2 C,D X,Y FRAS1 NNLQDGTEV 9 T 2.3 DUF4288 pdbhh F T 2qyf 3 E,F E,F peptide SWYSYPPPQRAV 12 T 8.6 NADHdh_A3 pdbhh F T 2qzx 2 C,D C,D Pepstatin XVVXAX 6 T 1700 FAM60A pdbhh F F 2r03 2 B B GAG_EIAVY p6-Gag NLYPDLSE 8 T 0.24 LSPR pdbhh T Viruses T 2r0l 4 D B HGFA_HUMAN Hepatocyte growth factor activator VQLSPDLLATLPEPASPGRQACGRRHKKRTFLRPR 35 T 2.4E-22 DUF316 unphh F Eukaryota T 2r0w 3 C Q A4_HUMAN AMYLOID BETA A4 PROTEIN, FRAGMENT DAEFRHDS 8 T 0.0001 Beta-APP unphh F Eukaryota T 2r0y 2 B B Histone H3 peptide TARKSTGGXAPRK 13 T 0.1 Sirohm_synth_M pdbpercent F T 2r0z 3 C Q GLUTAMATE RECEPTOR INTERACTING PROTEIN 1 AKFRHD 6 T 46 PIG-Y pdbhh F T 2r1r 2 B,D,F,G D,E,F,P RIR2_ECOLI RIBONUCLEOTIDE REDUCTASE R2 PROTEIN YLVGQIDSEVDTDDLSNFQL 20 T 12 UL11 pdbhh F Bacteria T 2r28 2 B,D C,D PP2BA_HUMAN CALMODULIN-DEPENDENT CALCINEURIN A SUBUNIT ALPHA ISOFORM, CAM-PRP CATALYTIC SUBUNIT AAARKEVIRNKIRAIGKMARVFSVL 25 T 3.3 7kD_DNA_binding pdbhh F Eukaryota T 2r3c 2 C,D C,D HIV entry inhibitor PIE1 XXXGXXXXXXXXXXXXXX 18 F F F 2r3y 2 D,E,F D,E,F Synthetic peptide YWF DNRLGLVYWF 10 T 2.5 DUF3325 pdbhh F T 2r5b 2 D,E,F H,K,L HIV entry inhibitor PIE7 XXGXXXXXXXXXXXXXX 17 F F F 2r5d 2 D,E,F H,K,L HIV entry inhibitor PIE7 XXGXXXXXXXXXXXXXX 17 F F F 2r5m 2 B L peptide R(me)KS RKS 3 T 280 VP9 pdbhh F F 2r9b 2 B,D C,D peptide-based inhibitor KPFSXLQF 8 T 15 PPARgamma_N pdbhh F T 2r9q 2 E X Synthetic peptide 1 SNPACVA 7 T 0.58 MFA1_2 pdbhh F T 2r9q 3 F Y Synthetic peptide 2 VEVPLAGAV 9 T 24 BAMBI_C pdbhh F T 2rd4 3 C C pentapeptide inhibitor LVFFA 5 T 70 DUF4577 pdbhh F F 2rdl 2 C,D I,J METHOXYSUCCINYL-ALA-ALA-PRO-ALA-CHLOROMETHYLKETONE INHIBITOR XAAPXX 6 T 950 A_amylase_inhib pdbhh F F 2rem 2 D T 8 residue peptide XXXXXXXX 8 F F F 2rfi 2 C,D P,Q Histone H3 TKQTARKSTGG 11 T 2.2 Histone pdbhh F T 2rhi 2 B B H15_HUMAN Histone H1.5 KATKK 5 T 610 PC4 pdbhh F Eukaryota F 2rje 2 D,E P,Q H4_HUMAN Histone H4 AKRHRKVLRDN 11 T 0.27 UPF0137 unp F Eukaryota T 2rjf 2 B,D B,D Histone H4 YKGGAKRHRKVLRDNIQGIT 20 T 8.6 DUF1938 pdbhh F T 2rky 2 C,D B,D FNBA_STAA8 STAPHYLOCOCCUS AUREUS FIBRONECTIN BINDING PROTEIN, FNBP NEKNGPIIQNNKFEYKEDTIKET 23 T 6.9 IPU_b_solenoid pdbhh F Bacteria T 2rkz 2 G,H,I,J,K,L M,N,O,P,Q,R FNBA_STAA8 FNBPA XETLTGQYDKNLVTTVEEEYDSX 23 T 40 DUF1372 pdbhh F Bacteria T 2rl0 2 B,D,F,H,J,L G,C,E,H,J,L FNBA_STAA8 STAPHYLOCOCCUS AUREUS FIBRONECTIN BINDING PROTEIN, FNBP GQVTTESNLVEFDEESTK 18 T 0.015 Fn_bind unppssm F Bacteria T 2rlg 1 A A antimicrobial peptide RP-1 ALYKKFKKKLLKSLKRLG 18 T 4 NAC pdbhh F T 2rlh 1 A A antimicrobial peptide RP-1 ALYKKFKKKLLKSLKRLG 18 T 4 NAC pdbhh F T 2rlj 1 A A VGP_EBOZM Envelope glycoprotein GAAIGLAWIPYFGPAA 16 T 0.89 DUF4855 pdbhh T Viruses T 2rll 1 A A CCR5_HUMAN C-C CKR-5, CC-CKR-5, CCR-5, CCR5, HIV-1 FUSION CORECEPTOR, CHEMR13, CD195 ANTIGEN SPIYDINYY 9 T 3.1 Pico_P1A pdbhh F Eukaryota T 2rlw 1 A A P71469_LACPN BACTERIOCIN PEPTIDE PLNF VFHAYSARGVRNNYKSAVGPADWVISAVRGFIHG 34 T 0.0046 Bacteriocin_IIc unp F Bacteria T 2rly 2 B P FMN1_MOUSE LIMB DEFORMITY PROTEIN PTPPPLPP 8 T 4.1 SCIMP pdbhh F Eukaryota F 2rm0 2 B P FMN1_MOUSE LIMB DEFORMITY PROTEIN PPPLIPPPP 9 T 2.6 Adeno_E4 pdbhh F Eukaryota F 2rma 2 B,D,F,H,J,L,N,P,R,T B,D,F,H,J,L,N,P,R,T CICLOSPORIN, CICLOSPORINE XXXXXXXXVXA 11 T 9.7 DUF6090 pdbhh F F 2rmb 2 B,D,F,H,J,L,N,P,R,T B,D,F,H,J,L,N,P,R,T CICLOSPORIN, CICLOSPORINE XXXXXXXXVXA 11 T 8 IncD pdbhh F F 2rmc 2 B,D,F,H B,D,F,H CICLOSPORIN, CICLOSPORINE XXXXXXXXVXA 11 T 9.7 DUF6090 pdbhh F F 2rmp 2 B B PEPSTATIN XVVXAX 6 T 1700 FAM60A pdbhh F F 2rmx 2 B B NKG2A_HUMAN NKG2-A/B-ACTIVATING NK RECEPTOR, NK CELL RECEPTOR A, CD159A ANTIGEN MDNQGVIXSDLNLPP 15 T 10 MAP pdbhh F Eukaryota T 2rny 2 B B H4_HUMAN Histone H4 GGAKRHRXVLRDNIQ 15 T 0.27 UPF0137 unp F Eukaryota T 2ror 2 B B LCP2_HUMAN SH2 DOMAIN-CONTAINING LEUKOCYTE PROTEIN OF 76 KDA, SLP-76 TYROSINE PHOSPHOPROTEIN, SLP76 GEDDGDXESPNEEEE 15 T 0.087 SDA1 unppercent F Eukaryota T 2rp5 1 A,B A,B CEP1_CAEEL TRANSCRIPTION FACTOR CEP-1 GPLGSHENCQSPSMKRSRCTNYSFRTLTLSTAEYTKVVEFLAREAKVPRYTWVPTQVVSHILPTEGLERFLTAIKAGHDSVLFNANGIYTMGDMIREFEKHNDIFERIGIDSSKLSKYYEAFLSFYRIQEAMKLPK 136 T 0.0011 SAM_2 pdbpssm F Eukaryota T 2rpa 1 A A KTNA1_MOUSE KATANIN P60 SUBUNIT A1, P60 KATANIN, LIPOTRANSIN GSDHMTMSLQMIVENVKLAREYALLGNYDSAMVYYQGVLDQMNKYLYSVKDTHLRQKWQQVWQEINVEAKQVKDIMKT 78 T 0.00078 MIT pdbpercent F Eukaryota T 2rpn 2 B B ARK1_YEAST Actin-regulating kinase 1 AKKTKPTPPPKPSHLKPK 18 T 9.6 Dynein_attach_N pdbhh F Eukaryota T 2rpq 2 B B MCAF1_HUMAN ATFA-ASSOCIATED MODULATOR, HAM, ATF-INTERACTING PROTEIN, ATF-IP, MBD1-CONTAINING CHROMATIN-ASSOCIATED FACTOR 1, P621 GSPEFKTIDASVSKKAADSTSQCGKATGSDSSGVIDLTMDDEESGASQD 49 T 18 Tox-PLDMTX pdbhh F Eukaryota T 2rps 1 A A B7XBA7_MYTSE Chemokine SVQILRCPDGMQMLRSGQCVATTEPPFDPDSY 32 T 0.13 Bowman-Birk_leg unppssm F Eukaryota T 2rqo 1 A A polytheonamide B XGXGXXXXXXAGAXAXXGAGXXXXAGGXIXXXGXIXVXAXVXVXXXQXT 49 F F T 2rqw 2 B B STE20_YEAST STE20P-PRR PEPTIDE SSSANGKFIPSRPAPKPPSSASAS 24 T 0.00039 TFIIA unppssm F Eukaryota T 2rr3 2 B B OSBP1_HUMAN OSBP PLGSDHWGKGDMSDEDDENEFFDAPEIITMPENLGHKRTGSHHHHHH 47 T 10 DUF1180 pdbhh F Eukaryota T 2rs9 1 A A H4_HUMAN H4K5AC SGRGXGGKGL 10 T 4.7 G3P_acyltransf pdbhh F Eukaryota T 2rsk 2 C,D C,D PRIO_BOVIN partial binding peptide of Major prion protein GQWNKPSKPKTN 12 T 0.19 OATP unppssm F Eukaryota T 2rt4 1 A A AF.2A1 GVVRQWSGYDPRTGTWRSSIAYGGG 25 T 0.7 XPB_DRD pdbhh F T 2rt5 2 B B NCOR2_HUMAN peptide from Silencing mediator of retinoic acid and thyroid hormone receptor YETLSDSE 8 T 7.1 Lactococcin pdbhh F Eukaryota T 2rtv 1 A A TAC1_TACTR TACHYPLESIN I KWCFRVCYRGICYRRCRX 18 T 0.021 Myticin-prepro unp F Eukaryota T 2ru7 2 C,D C,D PRIO_BOVIN PRP, MAJOR SCRAPIE-ASSOCIATED FIBRIL PROTEIN 1 GQWNKPSKPKTN 12 T 0.19 OATP unppssm F Eukaryota T 2rui 2 B B Boc-LPAT* XLPAX 5 T 540 CFAP91 pdbhh F F 2rvb 1 A A XPC_HUMAN XERODERMA PIGMENTOSUM GROUP C-COMPLEMENTING PROTEIN, P125 GSHMAHHLKRGATMNEDSNEEEEESENDWEEVEELSEPVLGDVRESTAFSRS 52 T 0.043 DUF5810 pdbpercent F Eukaryota T 2rvd 1 A A CLN025 YYDPETGTWY 10 T 0.011 OCRE pdb F T 2seb 4 D E CO2A1_HUMAN PEPTIDE FROM COLLAGEN II AYMRADAAAGGA 12 T 0.022 DUF2600 unphh F Eukaryota T 2sem 2 C,D C,D PROTEIN (SH3 PEPTOID INHIBITOR) XPPPVXPRR 9 T 33 DUF6131 pdbhh F F 2soc 1 A _ OCTREOTIDE XCFXKTCX 8 T 0.0019 Urotensin_II pdbhh F F 2uud 3 E,F S,T NQ10-1.12 ANTI-PHOX ANTIBODY STSSGGGGSGGGGSGGSA 18 T 10 DUF917 pdbhh F F 2uue 3 E,F E,F GVC-TETRAPEPTIDE INHIBITOR RLIXX 5 T 110 Pellino pdbhh F F 2uux 1 A A Q1EG59_RHIAP TRYPTASE INHIBITOR TDPI AAECTVPIGWSEPVKGLCKARFTRYYCMGNCCKVYEGCYTGGYSRMGECARNCPG 55 T 0.03 DUF3788 unppssm F Eukaryota T 2uuy 2 B B Q1EG59_RHIAP TRYPTASE INHIBITOR TDPI CTVPIGWSEPVKGLCKARFTRYYCMGNCCKVYEGCYTGGYSRMGECARNCPA 52 T 0.03 DUF3788 unppssm F Eukaryota T 2uw9 2 B C GSK3B_HUMAN GSK3-BETA PEPTIDE, GSK-3 BETA GRPRTTSFAE 10 T 6.3 DUF3084 pdbhh F Eukaryota T 2uwe 3 C,H C,J EMC7_HUMAN SELF-PEPTIDE, P1049 ALWGFFPVL 9 T 0.51 MRP-L47 pdbhh F Eukaryota T 2v17 1 A A PEPTIDE FRAGMENT TDHGAE 6 T 68 Exo_endo_phos pdbhh F T 2v1r 2 C,D,E P,Q,R PEX14_YEAST PEX14 XEAMPPTLPHRDWKD 15 T 3.4E-05 DUF1664 unphh F Eukaryota T 2v1s 2 H,I,J,K,L,M,N H,I,J,K,L,M,N ALDH2_RAT ALDH CLASS 2, ALDH1, ALDH-E2 GPRLSRLLSYAGX 13 T 8.8 TFIID_30kDa pdbhh F Eukaryota T 2v1t 2 C,D C,D ALDH2_RAT ALDH CLASS 2, ALDH1, ALDH-E2 GPRLSRLLSAAGX 13 T 5.4 Atypical_Card pdbhh F Eukaryota T 2v2x 3 C,F C,F HIV P17 SLFNTVATL 9 T 0.0057 Gag_p17 pdbhh F T 2v3s 2 C,D C,D WNK4_HUMAN PROTEIN KINASE WITH NO LYSINE 4, PROTEIN KINASE LYSINE-DEFICIENT 4 GRFQVT 6 T 36 DUF3446 pdbhh F Eukaryota T 2v3x 2 B B TRIPEPTIDE (VALINE-PROLINE-LEUCINE) VPL 3 T 230 YebF pdbhh F F 2v3y 2 B B TRIPEPTIDE (VALINE-PROLINE-LEUCINE) VPL 3 T 230 YebF pdbhh F F 2v3z 2 B B TRIPEPTIDE (VALINE-PROLINE-LEUCINE) VPL 3 T 230 YebF pdbhh F F 2v5f 2 B X HEXA-HISTIDINE PEPTIDE HHHHHH 6 T 6100 zf_CCCH_4 pdbhh F F 2v5w 2 C G GLYCYL-GLYCYL-GLYCINE GGG 3 T 79 FTCD_C pdbhh F F 2v5w 3 D,E I,L P53_HUMAN PEPTIDIC SUBSTRATE XRHXXX 6 T 360 Viral_helicase1 pdbhh F Eukaryota F 2v64 2 B,G,I B,G,I MBP1 SWYSYPPPQRAV 12 T 8.6 NADHdh_A3 pdbhh F T 2v7x 1 A,B,C A,B,C FLA_STRCT 5'-FLUORO-5'-DEOXY ADENOSINE SYNTHETASE MAANSTRRPIIAFMSDLGTTDDSVAQCKGLMYSICPDVTVVDVCHSMTPWDVEEGARYIVDLPRFFPEGTVFATTTYPATGTTTRSVAVRIKQAAKGGARGQWAGSGAGFERAEGSYIYIAPNNGLLTTVLEEHGYLEAYEVTSPKVIPEQPEPTFYAREMVAIPSAHLAAGFPLSEVGRPLEDHEIVRFNRPAVEQDGEALVGVVSAIDHPFGNVWTNIHRTDLEKAGIGYGARLRLTLDGVLPFEAPLTPTFADAGEIGNIAIYLNSRGYLSIARNAASLAYPYHLKEGMSARVEAR 299 T 3.6E-42 SAM_adeno_trans pdbpercent F Bacteria T 2v8c 2 B C VASP_MOUSE VASP GPPPPPGPPPPPGPPPPPGL 20 T 11 MIC19_MIC25 unppssm F Eukaryota F 2v9k 1 A A PUS10_HUMAN PSEUDOURIDINE SYNTHASE GMFPLTEENKHVAQLLLNTGTCPRCIFRFCGVDFHAPYKLPYKELLNELQKFLETEKDELILEVMNPPPKKIRLQELEDSIDNLSQNGEGRISVSHVGSTASKNSNLNVCNVCLGILQEFCEKDFIKKVCQKVEASGFEFTSLVFSVSFPPQLSVREHAAWLLVKQEMGKQSLSLGRDDIVQLKEAYKWITHPLFSEELGVPIDGKSLFEVSVVFAHPETVEDCHFLAAICPDCFKPAKNKQSVFTRMAVMKALNKIKEEDFLKQFPCPPNSPKAVCAVLEIECAHGAVFVAGRYNKYSRNLPQTPWIIDGERKLESSVEELISDHLLAVFKAESFNFSSSGREDVDVRTLGNGRPFAIELVNPHRVHFTSQEIKELQQKINNSSNKIQVRDLQLVTREAIGHMKEGEEEKTKTYSALIWTNKAIQKKDIEFLNDIKDLKIDQKTPLRVLHRRPLAVRARVIHFMETQYVDEHHFRLHLKTQAGTYIKEFVHGDFGRTKPNIGSLMNVTADILELDVESVDVDWPPALDD 530 T 0.00012 TruB_N unphh F Eukaryota T 2vda 2 B B LAMB_ECOL6 MALTOSE-INDUCIBLE PORIN MMITLRKRRKLPLAVAVAAGVMSAQAMA 28 T 0.4 PAGK pdbhh F Bacteria T 2vdn 3 C C MPT HRG GLY ASP TRP PRO CYS NH2 XXGDWPCX 8 T 5.3 Ferlin_C pdbhh F T 2vdo 3 C C FIBG_HUMAN FIBRINOGEN, GAMMA POLYPEPTIDE HHLGGAKQAGDV 12 T 37 Tox-HNH-HHH pdbhh F Eukaryota T 2vdp 3 C C FIBG_HUMAN FIBRINOGEN LGGAKQAGDV 10 T 56 DUF5974 pdbhh F Eukaryota T 2vdq 3 C C FIBG_HUMAN FIBRINOGEN, GAMMA POLYPEPTIDE HHLGGAKQRGDV 12 T 5.1 DUF6305 pdbhh F Eukaryota T 2vdr 3 C C FIBG_HUMAN FIBRINOGEN LGGAKQRGDV 10 T 69 DUF5974 pdbhh F Eukaryota T 2ve6 3 C,F,I,L C,F,I,L SENDAI VIRUS EPITOPE RESIDUES 324-332 MODIFIED AT P7 FAPGNYXAL 9 T 0.12 Paramyxo_ncap pdbhh F T 2vf1 1 A,B A,B Q9Q1V2_9VIRU CAPSID PROTEIN DWSWYAPSELVAKQIANVPFNVLAGTPIKASVHLRYDPSLVSGLKDQLFVGNNASIMGARLLYLPSFGISTTVLDGLSMAANQLYAYVRKSNSGAKVYEAPDLMMTVLAIQEAYRVLFEIRRAITFANYWNFWNKYLPKQVFEQLLAIDFDDLMSNKANYCAQFNLMAQKINTFALPKYFKSILRMAYVSSNIFMDSDAVTGQMYAFVSSGYYRYSATTSESGTSLVYRDWPVGAAMPRKLNRLFTVLRELLDAIYGDADAQTMFGDIYKAFGSDGLYSIAEISVDETSTPVFDVDILAQIENCTILEANAGLAWTLDSCNVTQSKGQVLLWQPTGTITSSDNTEHIAGDIAVALGDRVLNSHIMEPQYSDVLEWTRLMATIEFDKASVTSSEKVTFKVTSCGAELIRNVLYFKNVWNDAAEDASQRVITYFSHFSQITVTNATDDPTSAYGLMSNTLDFTQLDWHPIIYVTETSVHNVANLNSILIGGDLKRPTVITTDVVKRINSAANYALYYSANLLSNIST 525 T 4 R2K_2 pdb T Viruses T 2vgc 1 A A CTRA_BOVIN GAMMA CHYMOTRYPSIN CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 2vh3 1 A A RANSM_POLLE RSF-1 AXACSFPPSEIPGSKECLAEALQKHQGFKKKSYALICAYLNYKEDAENYERAAEDFDSAVKCTGCKEGVDLHEGNPELIEEGFEKFLASLKIDRKALGSLCTLFQKLXAIPHN 113 T 0.17 Oxidored-like pdbpssm F Eukaryota T 2vh3 2 B B RANSM_POLLE RSF-1 AXACSFPPXEIPGSKECLAEALQKHQGFKKKSYALICAYLNYKEDAENYERAAEDFDSAVKCTGCKEGVDLHEGNPELIEEGFEKFLASLKIDRKALGSLCTLFQKLYAIPHN 113 T 4.2 PHtD_u1 pdbhh F Eukaryota T 2vif 2 B P KIT_HUMAN SCFR, PROTO-ONCOGENE TYROSINE-PROTEIN KINASE KIT, C-KIT, CD117 ANTIGEN NGNNXVYIDPT 11 T 1 DNA_pack_C pdbhh F Eukaryota T 2vj0 2 B P SYNJ1_HUMAN SYNAPTIC INOSITOL-1,4,5-TRISPHOSPHATE 5-PHOSPHATASE 1, SYNAPTOJANIN-1 P170 NPKGWVTFEEEE 12 T 0.37 Stonin2_N pdbhh F Eukaryota T 2vj0 3 C Q AMPH_RAT AMPHIPHYSIN1 FEDNFVP 7 T 0.0069 CCDC32 pdbhh F Eukaryota T 2vkn 2 B C PBS2_YEAST POLYMYXIN B RESISTANCE PROTEIN 2, SUPPRESSOR OF FLUORIDE SENSITIVITY 4, PBS2 NKPLPPLPLAGS 12 T 0.23 DUF4554 unppercent F Eukaryota T 2vln 2 B B CEA9_ECOLX COLICIN E9 MESKRNKPGKATGKGKPVGDKWLDDAGKDSGAPIPDRIADKLRDKEFKSFDDFRKAVWEEVSKDPELSKNLNPSAKSSVSKGYSPFTPKNQQVGGRKVYELHHDKPISQGGEVYDMDNIRVTTPKRHIDIHRGK 134 T 0.016 LHH unppercent F Bacteria T 2vlo 2 B B CEA9_ECOLX COLICIN E9 MESKRNKPGKATGKGKPVGDKWLDDAGKDSGAPIPDRIADKLRDKEFKSFDDFRKAVWEEVSKDPELSKNLNPSNKSSVSKGYSPFTPKNQQVGGRAVYELHHDKPISQGGEVYDMDNIRVTTPKRHIDIHRGK 134 T 0.016 LHH unppercent F Bacteria T 2vlp 2 B B CEA9_ECOLX COLICIN E9 MESKRNKPGKATGKGKPVGDKWLDDAGKDSGAPIPDRIADKLRDKEFKSFDDFAKAVWEEVSKDPELSKNLNPSNKSSVSKGYSPFTPKNQQVGGRKVYELHHDKPISQGGEVYDMDNIRVTTPKRHIDIHRGK 134 T 0.016 LHH unppercent F Bacteria T 2vlq 2 B B CEA9_ECOLX COLICIN E9 MESKRNKPGKATGKGKPVGDKWLDDAGKDSGAPIPDRIADKLRDKEFKSFDDFRKAVWEEVSKDPELSKNLNPSNKSSVSKGYSPATPKNQQVGGRKVYELHHDKPISQGGEVYDMDNIRVTTPKRHIDIHRGK 134 T 0.00018 LHH pdbpercent F Bacteria T 2voy 5 E E AT2A1_RABIT CA2+-ATPASE, SERCA1, COPA DELTA C TAFVEPFVILLILIANAIVGVWQERNAENA 30 T 0.078 Chi-conotoxin pdbpercent F Eukaryota T 2voy 11 K K AT2A1_RABIT CA2+-ATPASE, SERCA1, COPA DELTA C EGRAIYNNMKQFIRYLISSNVGEVVCIFLTAA 32 T 0.018 PhoLip_ATPase_C unphh F Eukaryota T 2vp7 2 B B BCL9_HUMAN B-CELL LYMPHOMA 9 PROTEIN, BCL-9, PROTEIN LEGLESS HOMOLOG, BCL9 AAKVVYVFSTEMANKAAEAVLKGQVETIVSFHI 33 T 0.24 Ribosomal_L23eN pdb F Eukaryota T 2vpb 2 B B BCL9_HUMAN B-CELL LYMPHOMA 9 PROTEIN, BCL-9, PROTEIN LEGLESS HOMOLOG, BCL9 AMAAKVVYVFSTEMANKAAEAVLKGQVETIVSFHI 35 T 0.26 Ribosomal_L23eN pdb F Eukaryota T 2vpd 2 B,D B,D BCL9_HUMAN BCL9, B-CELL LYMPHOMA 9 PROTEIN, PROTEIN LEGLESS HOMOLOG AMAAKVVYVFSTEMANKAAEAVLKGQVETIVSFHI 35 T 0.26 Ribosomal_L23eN pdb F Eukaryota T 2vpe 2 B,D B,D BCL9_HUMAN B-CELL LYMPHOMA 9 PROTEIN, BCL-9, PROTEIN LEGLESS HOMOLOG, BCL9 GAMVYVFSTEMANKAAEAVLKGQVETIVSFHI 32 T 0.21 Ribosomal_L23eN pdb F Eukaryota T 2vpg 2 B,D B,D BCL9_HUMAN B-CELL LYMPHOMA 9 PROTEIN, BCL-9, PROTEIN LEGLESS HOMOLOG, BCL9 GAMVYVFSTEMANKAAEAVLKGQVETIVSFHI 32 T 0.21 Ribosomal_L23eN pdb F Eukaryota T 2vr3 2 C,D C,D FIBG_HUMAN FIBRINOGEN GAMMA-CHAIN QHHLGGAKQAGAV 13 T 17 Tox-HNH-HHH pdbhh F Eukaryota T 2vsl 2 B B PEPTIDE (MAA-LYS-PRO-PHE) XKPF 4 T 69 Cas6_N pdbhh F F 2vum 13 M M AAMAT_AMAPH ALPHA AMANITIN, GAMMA-AMANITIN NPXXGIGC 8 T 0.55 Wzy_C pdbhh F Eukaryota T 2vvd 1 A A SPIKE_BPPM2 P1-RECEPTOR BINDING PROTEIN VNYWVSDEEIRVFKEYSARAKYAQNEGRTALEANNVPFFDIDVPPELDGVPFSLKARVRHKSKGVDGLGDYTSISVKPAFYITEGDETTDTLIKYTSYGSTGSHSGYDFDDNTLDVMVTLSAGVHRVFPVETELDYDAVQEVQHDWYDESFTTFIEVYSDDPLLTVKGYAQILMERT 177 T 0.58 PNPase_C pdbpercent T Viruses T 2vve 1 A,B A,B SPIKE_BPPM2 P1-RECEPTOR BINDING PROTEIN SFQEQTTKSRDVNSFQIPLRDGVRELLPEDASRNRASIKSPVDIWIGGENMTALNGIVDGGRKFEAGQEFQINTFGSVNYWVSDEEIRVFKEYSARAKYAQNEGRTALEANNVPFFDIDVPPELDGVPFSLKARVRHKSKGVDGLGDYTSISVKPAFYITEGDETTDTLIKYTSYGSTGSHSGYDFDDNTLDVMVTLSAGVHRVFPVETELDYDAVQEVQHDWYDESFTTFIEVYSDDPLLTVKGYAQILMERT 254 T 3.3 Phage_T4_Ndd pdbpssm T Viruses T 2vwf 2 B B GAB2_HUMAN GROWTH FACTOR RECEPTOR BOUND PROTEIN 2-ASSOCIATED PROTEIN 2, GRB2-ASSOCIATED BINDER 2, PP100, GAB2 IQPPPVNRNLKPDRK 15 T 12 DUF5898 pdbhh F Eukaryota T 2vxc 2 C C H2A1 PEPTIDE SQEL 4 T 200 DUF4205 pdbhh F F 2vxg 1 A,B A,B EDC4_DROME LD41624, GE-1 GAMGDSIKQLLMAGQINKAFHQALLANDLGLVEFTLRHTDSNQAFAPEGCRLEQKVLLSLIQQISADMTNHNELKQRYLNEALLAINMADPITREHAPKVLTELYRNCQQFIKNSPKNSQFSNVRLLMKAIITYRDQLK 139 T 0.0069 Csm2_III-A pdb F Eukaryota T 2vzd 2 C,D C,D PAXI_HUMAN PAXILLIN MDDLDALLADLESTTSHISK 20 T 0.036 DUF883 pdb F Eukaryota T 2vzi 1 A A PAXI_HUMAN Paxillin,Paxillin ATRELDELMASLSDFKFMAQ 20 T 1.2 SAM_LFY pdbhh F Eukaryota T 2w0c 2 K,K10,K11,K12,K13,K14,K15,K16,K17,K18,K19,K2,K20,K21,K22,K23,K24,K25,K26,K27,K28,K29,K3,K30,K31,K32,K33,K34,K35,K36,K37,K38,K39,K4,K40,K41,K42,K43,K44,K45,K46,K47,K48,K49,K5,K50,K51,K52,K53,K54,K55,K56,K57,K58,K59,K6,K60,K7,K8,K9 L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L SPIKE_BPPM2 PROTEIN 2 MIVKKKLAAGEFAETFKNGNNITIIKAVGELVLRAYGADGGEGLRTIVRQGVSIKGMNYTSVMLHTEYAQEIEYWVGDLDYSFQEQTTKSRDVNSFQIPLRDGVRELLPEDASRNRASIKSPVDIWIGGENMTALNGIVDGGRKFEAGQEFQINTFGSVNYWVSDEEIRVFKEYSARAKYAQNEGRTALEANNVPFFDIDVPPELDGVPFSLKARVRHKSKGVDGLGDYTSISVKPAFYITEGDETTDTLIKYTSYGSTGSHSGYDFDDNTLDVMVTLSAGVHRVFPVETELDYDAVQEVQHDWYDESFTTFIEVYSDDPLLTVKGYAQILMERT 335 T 6 DUF4115 pdbhh T Viruses T 2w0c 3 L,L10,L11,L12,L13,L14,L15,L16,L17,L18,L19,L2,L20,L21,L22,L23,L24,L25,L26,L27,L28,L29,L3,L30,L31,L32,L33,L34,L35,L36,L37,L38,L39,L4,L40,L41,L42,L43,L44,L45,L46,L47,L48,L49,L5,L50,L51,L52,L53,L54,L55,L56,L57,L58,L59,L6,L60,L7,L8,L9,M,M10,M11,M12,M13,M14,M15,M16,M17,M18,M19,M2,M20,M21,M22,M23,M24,M25,M26,M27,M28,M29,M3,M30,M31,M32,M33,M34,M35,M36,M37,M38,M39,M4,M40,M41,M42,M43,M44,M45,M46,M47,M48,M49,M5,M50,M51,M52,M53,M54,M55,M56,M57,M58,M59,M6,M60,M7,M8,M9,N,N10,N11,N12,N13,N14,N15,N16,N17,N18,N19,N2,N20,N21,N22,N23,N24,N25,N26,N27,N28,N29,N3,N30,N31,N32,N33,N34,N35,N36,N37,N38,N39,N4,N40,N41,N42,N43,N44,N45,N46,N47,N48,N49,N5,N50,N51,N52,N53,N54,N55,N56,N57,N58,N59,N6,N60,N7,N8,N9,O,O10,O11,O12,O13,O14,O15,O16,O17,O18,O19,O2,O20,O21,O22,O23,O24,O25,O26,O27,O28,O29,O3,O30,O31,O32,O33,O34,O35,O36,O37,O38,O39,O4,O40,O41,O42,O43,O44,O45,O46,O47,O48,O49,O5,O50,O51,O52,O53,O54,O55,O56,O57,O58,O59,O6,O60,O7,O8,O9 P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S P3_BPPM2 PROTEIN III MNTSVPTSVPTNQSVWGNVSTGLDALISGWARVEQIKAAKASTGQGRVEQAMTPELDNGAAVVVEAPKKAAQPSETLVFGVPQKTLLLGFGGLLVLGLVMRGNK 104 T 0.069 RseC_MucC pdb T Viruses T 2w0c 4 P,P10,P11,P12,P13,P14,P15,P16,P17,P18,P19,P2,P20,P21,P22,P23,P24,P25,P26,P27,P28,P29,P3,P30,P31,P32,P33,P34,P35,P36,P37,P38,P39,P4,P40,P41,P42,P43,P44,P45,P46,P47,P48,P49,P5,P50,P51,P52,P53,P54,P55,P56,P57,P58,P59,P6,P60,P7,P8,P9 T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T P6_BPPM2 PROTEIN VI MANFLTKNFVWILAAGVGVWFYQKADNAAKTATKPIADFLAELQFLVNGSNYVKFPNAGFVLTRDALQDDFIAYDDRIKAWLGTHDRHKDFLAEILDHERRVKPVYRKLIGNIIDASTIRAASGVEL 127 T 0.06 Phageshock_PspG pdbpssm T Viruses T 2w0p 2 C C FBLI1_HUMAN MITOGEN-INDUCIBLE 2-INTERACTING PROTEIN, MIGFILIN PEKRVASSVFITLAP 15 T 12 Ycf15 pdbhh F Eukaryota T 2w0t 1 A A LMBL2_HUMAN L(3)MBT-LIKE 2 PROTEIN, H-L(3)MBT-LIKE PROTEIN GSGSEPAVCEMCGIVGTREAFFSKTKRFCSVSCSRSYSSNSKK 43 T 0.0088 zf-FCS pdbhh F Eukaryota T 2w0z 2 B B GAB2_HUMAN GROWTH FACTOR RECEPTOR BOUND PROTEIN 2-ASSOCIATED PROTEIN 2, GRB2-ASSOCIATED BINDER 2, PP100, GAB2 APPPRPPKP 9 T 0.45 Apidaecin pdbhh F Eukaryota F 2w10 2 C,D C,D PTN23_MOUSE HD-PTP PPPRPTAPKPLL 12 T 7.3 UPF0449 pdbhh F Eukaryota T 2w16 2 C C DSN-ARG-DSN-FHO-LYS-FHO-THR-THR XRXXKXTT 8 T 12 DapH_N pdbhh F F 2w3o 2 C,D C,D XRCC1_HUMAN X-RAY REPAIR CROSS-COMPLEMENTING PROTEIN 1, XRCC1-DERIVED PHOSPHOPEPTIDE YAGSTDEN 8 T 12 M3 pdbhh F Eukaryota T 2w5v 1 A,B A,B Q9KWY4_9BACT TAB5 ALKALINE PHOSPHATASE MUTANT MKLKKIVFTLIALGLFSCKTTSVLVKNEPQLKTPKNVILLISDGAGLSQISSTFYFKSGTPNYTQFKNIGLIKTSSSREDVTDSASGATAFSCGIKTYNAAIGVADDSTAVKSIVEIAALNNIKTGVVATSSITDATPASFYAHALNRGLEEEIAMDMTESDLDFFAGGGLNYFTKRKDKKDVLAILKGNQFTINTTALTDFSSIASNRKMGFLLADEAMPTMEKGRGNFLSAATDLAIQFLSKDNSAFFIMSEGSQIDWGGHANNASYLISEINDFDDAIGTALAFAKKDGNTLVIVTSDHETGGFTLAAKKNKREDGSEYSDYTEIGPTFSTGGHSATLIPVFAYGPGSEEFIGIYENNEIFHKILKVTKWNQ 375 T 2.1E-11 Alk_phosphatase pdbpssm F Bacteria. T 2w5w 1 A,B A,B Q9KWY4_9BACT TAB5 ALKALINE PHOSPHATASE MUTANT MKLKKIVFTLIALGLFSCKTTSVLVKNEPQLKTPKNVILLISDGAGLSQISSTFYFKSGTPNYTQFKNIGLIKTSSSREDVTDSASGATAFSCGIKTYNAAIGVADDSTAVKSIVEIAALNNIKTGVVATSSITDATPASFYAHALNRGLEEEIAMDMTESDLDFFAGGGLNYFTKRKDKKDVLAILKGNQFTINTTALTDFSSIASNRKMGFLLADEAMPTMEKGRGNFLSAATDLAIQFLSKDNSAFFIMSEGSQIDWGGHANNASYLISEINDFDDAIGTALAFAKKDGNTLVIVTSDHETGGFTLAAKKNKREDGSEYSDYTEIGPTFSTGGHSATLIPVFAYGPGSEEFIGIYENNEIFHKILKVTKWNQ 375 T 2.1E-11 Alk_phosphatase pdbpssm F Bacteria. T 2w5x 1 A,B A,B Q9KWY4_9BACT TAB5 ALKALINE PHOSPHATASE MUTANT MKLKKIVFTLIALGLFSCKTTSVLVKNEPQLKTPKNVILLISDGAGLSQISSTFYFKSGTPNYTQFKNIGLIKTSSSREDVTDSASGATAFSCGIKTYNAAIGVADDSTAVKSIVEIAALNNIKTGVVATSSITEATPASFYAHALNRGLEEEIAMDMTESDLDFFAGGGLNYFTKRKDKKDVLAILKGNQFTINTTALTDFSSIASNRKMGFLLADEAMPTMEKGRGNFLSAATDLAIQFLSKDNSAFFIMSEGSQIDWGGHANNASYLISEINDFDDAIGTALAFAKKDGNTLVIVTSDHETGGFTLAAKKNKREDGSEYSDYTEIGPTFSTGGHSATLIPVFAYGPGSEEFIGIYENNEIFHKILKVTKWNQ 375 T 2.4E-11 Alk_phosphatase unppssm F Bacteria. T 2w65 3 E,F E,F COLLAGEN DERIVED PEPTIDE PCII-CIT1 AXGLTGRPG 9 T 4.2 Glyco_hydro_15 pdbhh F T 2w6t 2 C C DSN-LYS-GLY-FHO-SER-DSN-GLY-ORN-FHO-SER XKGXSXGXXS 10 T 31 APC_u9 pdbhh F F 2w6u 2 C C PYOVERDIN G173 XAXXXXS 7 T 530 Allene_ox_cyc pdbhh F F 2w73 2 E,F,G,H K,L,M,O PP2BA_HUMAN CALMODULIN-DEPENDENT CALCINEURIN A SUBUNIT ALPHA ISOFORM, CAM-PRP CATALYTIC SUBUNIT VIRNKIRAIGKMARVFS 17 T 26 DUF5435 pdbhh F Eukaryota T 2w76 2 C C PYOVERDIN R XXXQXXG 7 T 15 DUF4691 pdbhh F F 2w77 2 C C PYOVERDIN 18-1 XKGXSXGKXS 10 T 45 Surfac_D-trimer pdbhh F F 2w78 2 C C SER-LYS-GLY-FHO-LYS-FH7-SER SKGXKXS 7 T 0.7 Microvir_J pdbhh F F 2w84 2 B B PEX5_HUMAN PEROXISOME RECEPTOR 1, PEROXISOMAL C-TERMINAL TARGETING SIGNAL IMPORT RECEPTOR, PTS1-BP, PEROXIN-5, PTS1 RECEPTOR, PTS1R, PEX5 GVADLALSENWAQEFLAAGD 20 T 11 Drf_GBD pdbhh F Eukaryota T 2w97 3 C,D E,F IF4G1_HUMAN P220, EIF-4-GAMMA 1, EIF-4G 1, EIF-4G1, EIF4GI KKRYDREFLLGFQF 14 T 0.00025 eIF_4G1 unphh F Eukaryota T 2w9r 2 B B DPS_ECOLI PEXB, VTM LVKSKATNLLY 11 T 0.1 7TMR-HDED unppercent F Bacteria T 2wa8 2 B,D B,D N-END RULE PEPTIDE FRSKGEELFT 10 T 14 DNAP_B_exo_N pdbhh F T 2wa9 2 H,I,J,K,L,M,N H,I,J,K,L,M,N TRP PEPTIDE LLT 3 T 940 Conotoxin pdbhh F F 2wb6 1 A A Y102_AFV1Y AFV1-102 MSYYHHHHHHLESTSLYKKAGSENLYFQGIVDKNKIVIPMSEFLDSMFLVIEKLGVHAEKKGSMIFLSSERVKLADWKQLGAMCSDCYHCKLPLSSFIEIVTRKAKDKFLVMYNEKEVTLVARGVQTIQK 130 T 0.069 DYW_deaminase pdbpercent T Viruses T 2wb7 1 A,B A,B PT26-6P MNATINDDDIDDVKKALDHATQAAHKAAAELTAKLRSDFVEYGNGGTAGQVLIHIYGPGLIYGFSAFPVQIRLEIPNQPVPFNKVHITEVTAYVIDENNRTYWTRVWNSSTFRQGGYIADTLDLVTVMKAPDPLVYQIRDAIVTGQISRELYDKIWNTSTTHFEIRVIVKGYQEAWKTDSSVSNQSSCPSDGHWYEDACWVHDKDIDFTLKAETTTAWGHVTGTNDVATIDGGMLGSLPIKFLQSLDLSGKWVLYQNKYAGALSDFIIITAASPVHVLNSTAMYKFLITPNPGYFQPANPKISDEYRFVTLRVIEGGRMELADTTTGHIGDLTEPTFFGLTAHYTDAPGTLDYHALGLVYAYVERDDGVKIPIWLAAEPMISVLSNTYTVMKDQDVKNLIDLYKKKDREKINATTKAMINSLQEKIDEAEQLLAKAKGMNNENAIEYAQGAIDEYKAAINDLQKAAQQDDYQMFLNYLNAAKKHEMAGDYYVNAARKALNGDLEQAKIDAEKAKEYSNLAKEYEPG 526 T 0.00057 TPR_12 pdbpssm F T 2wev 3 E,F E,F ARG-ARG-B3L-MEA XRRXXX 6 T 200 Rod_cone_degen pdbhh F F 2wfj 2 B B CYCLOSPORINE, CICLOSPORIN, CICLOSPORINE XXXXXXXXVXA 11 T 9.7 DUF6090 pdbhh F F 2wfy 3 E,F E,F ARG-ARG-B3L-PHE XRRXFX 6 T 200 Rod_cone_degen pdbhh F F 2wgo 1 A A B5DCK2_9NEOB RSN-2 GSLILDGDLLKDKLKLPVIDNLFGKELLDKFQDDIKDKYGVDTKDLKILKTSEDKRFYYVSVDAGDGEKCKFKIRKDVDVPKMVGRKCRKDDDDDDGY 98 T 8.4 CDI unphh F Eukaryota T 2wh0 2 E,F Q,R KPCE_HUMAN PKCEV3 DRSKSAPTSPCDQEIKELENNIRKALSFDNR 31 T 54 Arc_MA pdbhh F Eukaryota T 2whb 3 E,F E,F ARG-ARG-L3O-PFF RRXXX 5 T 60 TC1 pdbhh F F 2why 2 B B CORYNEBACTIN, 9-GLN-BETA-LIPOTROPIN XGTXGTXGT 9 T 22 Trp_dioxygenase pdbhh F F 2wjg 2 C C POLYALANINE AAAAAAA 7 T 270 DUF4179 pdbhh F F 2wma 3 E E CYCLIC RKLFN-NH2 RKLFN 5 T 48 Stomoxyn pdbhh F F 2wmb 3 E I LINEAR RKLFD RKLFD 5 T 40 Oscp1 pdbhh F F 2wo6 2 C C CRUM2_HUMAN ARTIFICIAL CONSENSUS SEQUENCE ARPGTPAL 8 T 6.1 Microvir_J pdbhh F Eukaryota T 2wp2 2 C,D P,Q H4_MOUSE HISTONE H4 SGRGXGGXGLGKGGAKRHRK 20 T 11 Shadoo unppercent F Eukaryota T 2wpm 2 B L GLU-GLY-ARG-CHLOROMETHYL KETONE EGR 3 T 210 zf-CCHC pdbhh F F 2wpt 2 B B CEA9_ECOLX E9 DNASE MESKRNKPGKATGKGKPVGDKWLDDAGKDSGAPIPDRIADKLRDKEFKSFDDFRKAVWEEVSKDPELSKNLNPSNKSSVSKGYSPFTPKNQQVGCRKVYELHHDKPISQGGEVYDMDNIRVTTPKRHIDIHRGK 134 T 0.016 LHH unppercent F Bacteria T 2wq4 1 A,B,C A,B,C B4EH86_BURCJ BC2L-C N-TERMINAL DOMAIN MPLLSASIVSAPVVTSETYVDIPGLYLDVAKAGIRDGKLQVILNVPTPYATGNNFPGIYFAIATNQGVVADGCFTYSSKVPESTGRMPFTLVATIDVGSGVTFVKGQWKSVRGSAMHIDSYASLSAIWGTAAPSSQGSGNQGAETGGTGAGNIGGG 156 T 0.002 DUF1543 unppercent F Bacteria T 2wsc 18 R R PHOTOSYSTEM I-N SUBUNIT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 53 F F F 2wse 18 R R PHOTOSYSTEM I-N SUBUNIT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 53 F F F 2wsf 18 R R PHOTOSYSTEM I-N SUBUNIT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 53 F F F 2wub 2 B,D B,D HGFA_HUMAN HEPATOCYTE GROWTH FACTOR ACTIVATOR SHORT CHAIN VQLSPDLLATLPEPASPGRQACGRRHKKRTFLRPR 35 T 2.4E-22 DUF316 unphh F Eukaryota T 2wuc 4 D I ACE-KQLR-CHLOROMETHYLKETONE INHIBITOR XKQLXX 6 T 240 DEK_C pdbhh F F 2wv4 2 C,D C,D POLG_FMDV1 FOOT AND MOUTH DISEASE VIRUS (SEROTYPE A) VARIANT VP1 CAPSID PROTEIN XAPAKQLLNFD 11 T 8.3 Fn3-like pdbhh T Viruses T 2wv5 2 E,F,G,H E,F,G,H POLG_FMDV1 FOOT AND MOUTH DISEASE VIRUS (SEROTYPE A) VARIANT VP1 CAPSID PROTEIN XAPAKELLNFD 11 T 16 Fn3-like pdbhh T Viruses T 2wwx 2 B B DRRA_LEGPH SIDM MPYSDAKAMLDEVAKIRELGVQRVTRIENLENAKKLWDNANSMLEKGNISGYLKAANELHKFMKEKNLKEDDLRPELSDKTISPKGYAILQSLWGAASDYSRAAATLTESTVEPGLVSAVNKMSAFFMDCKLSPNERATPDPDFKVGKSKILVGIMQFIKDVADPTSKIWMHNTKALMNHKIAAIQKLERSNNVNDETLESVLSSKGENLSEYLSYK 217 T 0.036 GlnD_UR_UTase unppssm F Bacteria T 2x0x 2 D,E,F,G D,E,F,P RIR2_ECOLI RIBONUCLEOTIDE REDUCTASE 1, PROTEIN B2, PROTEIN R2 YLVGQIDSEVDTDDLSNFQL 20 T 12 UL11 pdbhh F Bacteria T 2x1n 3 E H ACE-LEU-ASN-PFF-NH2 XLNXX 5 T 390 DUF4402 pdbhh F F 2x2c 1 A,B,D,G,I B,F,L,P,R CICLOSPORIN, CICLOSPORINE XXXXXXXXVXA 11 T 9.7 DUF6090 pdbhh F F 2x39 2 B C GSK3B_HUMAN GSK-3 BETA, GSK-3 BETA GRPRTTSFAE 10 T 6.3 DUF3084 pdbhh F Eukaryota T 2x3g 1 A A Q5TJA9_SIRV1 SIRV1 HYPOTHETICAL PROTEIN ORF119 GDLKKVLNFHFSYIYTYFITITTNYKYGDTEKIFRKFRSYIYNHDKNSHVFSIKETTKNSNGLHYHILVFTNKKLDYSRVHKHMPSHSDIRIELVPKSISDIKNVYKYMLKTKKDIKMS 119 T 0.0041 Phage_GPA pdbhh T Viruses T 2x3m 1 A A Q6ZYJ2_PSVY HYPOTHETICAL PROTEIN ORF239 GMSAFDEFNEGFGLDVSDTPEELAFETESAIEEIESETSPGDQPKGSEPEEIRVWAEEKARKAVEEGREVTNWADWIMGWRTPNASEKKMEFMYWYTRTYLEEAKDIRPDIADALARGMAGLAFGRTDWVASMLDPQIMRHIYTDPEVARIYSETRDMLRRVSDYYISLTTMELGKVADIIAEAKAKGENPEVVAREIAEAVPRLSPKSLYFNLYYIGRSIGDNYVLEVARVLSKMRRR 239 T 0.52 HD_assoc pdb T Viruses T 2x3t 2 E,F E,M GLYCOPEPTIDE KXXXXXXEX 9 T 100 DUF3913 pdbhh F F 2x4n 3 C,F C,F HLA-A2.1-RESTRICTED INFLUENZA A MATRIX EPITOPE KILGXVFXV 9 T 28 COPIIcoated_ERV pdbhh F T 2x4p 3 C,F C,F HLA-A2.1-RESTRICTED INFLUENZA A MATRIX EPITOPE MILGXVFXV 9 T 0.53 COPIIcoated_ERV pdbhh F T 2x4q 3 C,F C,F HLA-A2.1-RESTRICTED INFLUENZA A MATRIX EPITOPE MILGXVFXV 9 T 0.53 COPIIcoated_ERV pdbhh F T 2x4r 3 C,F C,F PP65_HCMVA LOWER MATRIX PROTEIN PP65,64 KDA MATRIX PHOSPHOPROTEIN NLVPMVATV 9 T 15 GDH_N pdbhh T Viruses T 2x4t 3 C,F C,F PP65_HCMVA LOWER MATRIX PROTEIN PP65,64 KDA MATRIX PHOSPHOPROTEIN NLVXMVATV 9 T 18 Pilus_CpaD pdbhh T Viruses T 2x4u 3 C,F C,F POL_HV1B1 P66 RT ILKEPVHGV 9 T 0.56 DUF2115 pdbhh T Viruses T 2x5c 1 A,B A,B Q6ZYH1_PSVY HYPOTHETICAL PROTEIN ORF131 GMGETPEGPMPNKKGKSEGGQIRTIPLKYYKQEYDMAADLVRMLRGLGVFMHAKCPRCGAEGSVSIVETKNGYKYLVIRHPDGGTHTVPKTDISAILKELCEVKKDLEYVLKRYKEYEEEGGVKFCAEGRK 131 T 0.038 zf-ISL3 unp T Viruses T 2x5g 1 A A Y131_SIRV1 UNCHARACTERIZED PROTEIN 131, CAG38830 GASLKEIIDELGKQAKEQNKIASRILKIKGIKRIVVQLNAVPQDGKIRYSMTIHSQNNFRKQIGITPQDAEDLKLIAEFLEKYSDFLNEYVKFTPR 96 T 0.0019 ANTH unp T Viruses T 2x5h 1 A,B,C,D A,B,C,D Y131_SIRV1 UNCHARACTERIZED PROTEIN 131, CAG38830 GMASLKEIIDELGKQAKEQNKIASRIMKIKGIKRIVVQLNAVPQDGKIRYSMTIHSQNNFRKQIGITPQDAEDLKLIAEFLEKYSDFLNEYVKFTPR 97 T 0.0019 ANTH unp T Viruses T 2x5r 1 A A Q6ZYF6_PSVY HYPOTHETICAL PROTEIN ORF126 GAMARVGPKIEITHGGKKYTVFSKVTHLVPRTENGEEAEYVVFGPEKEGVISVVVLAPKDLNEEALALRVKWFNDTKPRCVKCGAAYNGKNHFRVVAIRNGTYYLDAVCDKCEPRITWLSAIVIGRS 127 T 0.025 DUF2321 unp T Viruses T 2x5t 1 A A Y131_SIRV1 UNCHARACTERIZED PROTEIN 131, CAG38830 GASLKEIIDELGKQAKEQNKIASRILKIKGIKRIVVQLNAVPQDGKIRYSLTIHSQNNFRKQIGITPQDAEDLKLIAEFLEKYSDFLNEYVKFTPR 96 T 0.0019 ANTH unp T Viruses T 2x6m 2 B B SYUA_HUMAN ALPHA-SYNUCLEIN PEPTIDE GYQDYEPEA 9 T 4.6 DUF3270 pdbhh F Eukaryota T 2x6p 1 A,B,C A,B,C COIL SER L19C XEWEALEKKLAALESKLQACEKKLEALEHG 30 T 0.00043 DUF5320 pdbhh F T 2x70 3 C,F C,F HLA-A2.1-RESTRICTED INFLUENZA A MATRIX EPITOPE KILGXVFXV 9 T 0.41 COPIIcoated_ERV pdbhh F T 2x72 2 B B GNAT1_BOVIN GACT PEPTIDE, TRANSDUCIN ALPHA-1 CHAIN ILENLKDCGLF 11 T 0.75 Phage_holin_4_1 pdbhh F Eukaryota T 2x7k 2 B B CICLOSPORIN, CICLOSPORINE XXXXXXXXVXA 11 T 9.7 DUF6090 pdbhh F F 2xa7 4 D P TGN38 CARGO PEPTIDE DYQRLN 6 T 30 Fer4_24 pdbhh F T 2xac 2 C,D C,X VGFR1_HUMAN VEGFR-1, VASCULAR PERMEABILITY FACTOR RECEPTOR, TYROSINE-PROTEIN KINASE RECEPTOR FLT, TYROSINE-PROTEIN KINASE FRT, FLT-1, FMS-LIKE TYROSINE KINASE 1 SDTGRPFVEMYSEIPEIIHMTEGRELVIPCRVTSPNITVTLKKFPLDTLIPDGKRIIWDSRKGFIISNATYKEIGLLTCEATVNGHLYKTNYLTHRQT 98 T 0.0013 Ig_2 pdb F Eukaryota T 2xad 2 E,F,G,H E,F,G,H TEICOPLANIN XXXXXXX 7 T 1.5 Defensin_2 pdbhh F F 2xak 2 D,E,F,G D,E,F,P RIR2_ECOLI RIBONUCLEOTIDE REDUCTASE 1, PROTEIN B2, PROTEIN R2 YLVGQIDSEVDTDDLSNFQL 20 T 12 UL11 pdbhh F Bacteria T 2xap 2 D,E,F,G D,E,F,P RIR2_ECOLI RIBONUCLEOTIDE REDUCTASE 1, PROTEIN B2, PROTEIN R2 YLVGQIDSEVDTDDLSNFQL 20 T 12 UL11 pdbhh F Bacteria T 2xav 2 D,E,F,G D,E,F,P RIR2_ECOLI RIBONUCLEOTIDE REDUCTASE 1, PROTEIN B2, PROTEIN R2 YLVGQIDSEVDTDDLSNFQL 20 T 12 UL11 pdbhh F Bacteria T 2xaw 2 D,E,F,G D,E,F,P RIR2_ECOLI RIBONUCLEOTIDE REDUCTASE 1, PROTEIN B2, PROTEIN R2 YLVGQIDSEVDTDDLSNFQL 20 T 12 UL11 pdbhh F Bacteria T 2xax 2 D,E,F,G D,E,F,P RIR2_ECOLI RIBONUCLEOTIDE REDUCTASE 1, PROTEIN B2, PROTEIN R2 YLVGQIDSEVDTDDLSNFQL 20 T 12 UL11 pdbhh F Bacteria T 2xay 2 D,E,F,G D,E,F,P RIR2_ECOLI RIBONUCLEOTIDE REDUCTASE 1, PROTEIN B2, PROTEIN R2 YLVGQIDSEVDTDDLSNFQL 20 T 12 UL11 pdbhh F Bacteria T 2xaz 2 D,E,F,G D,E,F,P RIR2_ECOLI RIBONUCLEOTIDE REDUCTASE 1, PROTEIN B2, PROTEIN R2 YLVGQIDSEVDTDDLSNFQL 20 T 12 UL11 pdbhh F Bacteria T 2xb2 5 E F REN3B_HUMAN PUTATIVE REGULATOR OF NONSENSE TRANSCRIPTS 3B AER 3 T 360 QRPTase_N pdbhh F Eukaryota F 2xb2 6 F,J G,U REN3B_HUMAN UPF3B, NONSENSE MRNA REDUCING FACTOR 3B, UP-FRAMESHIFT SUPPRESSOR 3 HOMOLOG B, UP-FRAMESHIFT SUPPRESSOR 3 HOMOLOG ON CHROMOSOME X, HUPF3P-X EVVKRDRIRNKDRPAMQLYQPGARSRNRLCPPDDSTKSGDSAAERKQESGISHRKEGGEE 60 T 10 UPF0561 pdbhh F Eukaryota T 2xc8 1 A,B,C A,B,C O48465_BPSPP BACTERIOPHAGE SPP1 COMPLETE NUCLEOTIDE SEQUENCE GIEIVNRKAVWYLTSEIKETETGIEVSAGELHKGDEEVFPVEEVSFDLTPDDTYPVEYMLYLHMNVQTKKVSWSLCKAYLDGEGYCDYQGNERLIMYPVSVTVFPNGTREGTIFLYEKEDREPDRKPPVIVEPQPVGEIGTPDIDE 146 T 71 Gpi16 pdbhh T Viruses T 2xdc 1 A,B,C,D,E,F A,B,C,D,E,F VALYL GRAMICIDIN XGAXAXVXWXWXWXWX 16 T 4.6 MAP17 pdbhh F F 2xdw 2 B P SYNTHETIC PEPTIDE PHQ-PRO-YCP XPX 3 T 1100 zinc_ribbon_2 pdbhh F F 2xe4 2 B B ANTIPAIN XRVX 4 T 41 Receptor_IA-2 pdbhh F F 2xfx 3 C C Q4MYJ2_THEPA UNCHARACTERIZED PROTEIN VGYPKVKEEML 11 T 2.6 HrpB_C pdbhh F Eukaryota T 2xh5 2 B C GSK3B_HUMAN GSK-3 BETA, GSK-3 BETA GRPRTTSFAE 10 T 6.3 DUF3084 pdbhh F Eukaryota T 2xjh 1 A,B A,B MBCTN_METTR COPPER-BINDING COMPOUND, CBC, HYDROGEN PEROXIDE REDUCTASE, SUPEROXIDE DISMUTASE, MINUS-MET METHANOBACTIN XXGSCYPXSCM 11 T 0.0043 QueC pdbhh F Bacteria T 2xji 1 A,B,C,D,E,F A,B,C,D,E,F MBCTN_METTR COPPER-BINDING COMPOUND, CBC, HYDROGEN PEROXIDE REDUCTASE, SUPEROXIDE DISMUTASE, MINUS-MET METHANOBACTIN XXGSCYPXSCM 11 T 0.0043 QueC pdbhh F Bacteria T 2xl2 2 C,D C,D RBBP5_MOUSE RBBP5, RBBP-5 YAAEDEEVDVTSVD 14 T 0.0099 DUF2457 unppercent F Eukaryota T 2xl3 2 C,E C,E RBBP5_MOUSE RBBP5, RBBP-5 YAAEDEEVDVTSVD 14 T 0.0099 DUF2457 unppercent F Eukaryota T 2xl4 1 A A LNTA_LISMO Listeria nuclear targeted protein A GSMGEDEGEQTKTKKDSNKVVKTASRPKLSTKDLALIKADLAEFEARELSSEKILKDTIKEESWSDLDFANDNINQMIGTMKRYQQEILSIDAIKRASEASADTEAFKKIFKEWSEFKIERIQVTIDLLNGKKDSEAVFKKTYPNQIIFKKVRTNKLQTALNNLKVGYELLDSQK 175 T 0.012 T4SS_pilin unppssm F Bacteria T 2xnx 4 M,N M,N M1-BC1 MVWDRQRLEKELEEKKEALELAIDQASRDYHRATALEKELEEKKKALELAIDQASQDYNRANVLEKELEAITREQEINRNLLGNAKLELDQLSSEKEQLTIEKAKLEEEKQISDASRQSLRRDLDASREAKKQVEKDLLEHHHHHH 146 T 0.0088 M pdbpercent F T 2xny 4 G,H M,N M PROTEIN MVNGDGNPREVIEDLAANNPAIQNIRLRHENKDLKARLENAMEVAGRDFKRAEELEKAKQALEDQRKDLETKLKELQQDYDLAKESTSWDRQRLEKHHHHHH 102 T 0.016 ATG14 pdb F T 2xo4 2 D,E,F,G D,E,F,P RIR2_ECOLI RIBONUCLEOTIDE REDUCTASE 1, PROTEIN B2, PROTEIN R2 YLVGQIDSEVDTDDLSNFQL 20 T 12 UL11 pdbhh F Bacteria T 2xo5 2 D,E,F,G D,E,F,P RIR2_ECOLI R2 PEPTIDE, RIBONUCLEOTIDE REDUCTASE 1, PROTEIN B2, PROTEIN R2 YLVGQIDSEVDTDDLSNFQL 20 T 12 UL11 pdbhh F Bacteria T 2xpn 2 B B Q8SRG7_ENCCU SPT6 GSHMFFEIFGTGEEYRYVLESDP 23 T 8.5 STE3 pdbhh F Eukaryota T 2xpo 2 B,D B,D Q8SRG7_ENCCU SPT6 GSHMFFEIFGTGEEYRYVLESDP 23 T 8.5 STE3 pdbhh F Eukaryota T 2xpp 2 B B Q8SRG7_ENCCU SPT6 GSHMREISEESISSIDYGDRDSLFFEIFGTGEEYRYVLESDP 42 T 18 DUF2887 pdbhh F Eukaryota T 2xqq 2 E,F,G,H E,F,G,H SAC-ARG-GLY-THR-GLN-THR-GLU XRGTQTE 7 T 61 Toxin_27 pdbhh F T 2xrw 2 B B NFAC3_HUMAN PEPNFAT4, NF-ATC3, NFATC3, T-CELL TRANSCRIPTION FACTOR NFAT4, NF-AT4, NFATX LERPSRDHLYLPLE 14 T 2.8 DUF1101 pdbhh F Eukaryota T 2xs0 2 B B NFAC3_HUMAN NF-ATC3, T-CELL TRANSCRIPTION FACTOR NFAT4, NFATX LERPSRDHLYLPLE 14 T 2.8 DUF1101 pdbhh F Eukaryota T 2xs3 2 B,D C,D PEPTIDE ALA-PHE-THR-SER AFTS 4 T 140 Toxin_8 pdbhh F F 2xs4 2 B B PEPTIDE ALA-PHE-THR AFT 3 T 290 Thioredoxin_10 pdbhh F F 2xsm 1 A A CCT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 489 F F F 2xsm 2 B B CCT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 478 F F F 2xsm 3 C C CCT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 455 F F F 2xsm 4 D D CCT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 471 F F F 2xsm 5 E E CCT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 472 F F F 2xsm 6 F F CCT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 466 F F F 2xsm 7 G G CCT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 485 F F F 2xsm 8 H H CCT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 474 F F F 2xsm 9 I I CCT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 293 F F F 2xsm 10 J J CCT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 299 F F F 2xsm 11 K K CCT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 394 F F F 2xsm 12 L,O L,O CCT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 297 F F F 2xsm 13 M M CCT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 298 F F F 2xsm 14 N N CCT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 289 F F F 2xsm 15 P P CCT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 481 F F F 2xu6 1 A,B A,B MDV1_YEAST MDV1 COILED COIL GPQTLVNSLEFLNIQKNSTMSEIRDIEVEVENLRQKKEKLLGKIANIEQNQLMLEDNLKQIDDRLDFLEEYG 72 T 0.00065 Pil1 pdb F Eukaryota T 2xu7 2 C,D C,D FOG1_HUMAN FRIEND OF GATA PROTEIN 1, FOG-1, FRIEND OF GATA 1, ZINC FINGER PROTEIN MULTITYPE 1 MSRRKQSNPRQIKRS 15 T 78 DHHA2 pdbhh F Eukaryota T 2xvc 2 B B Q97ZJ5_SULSO SSO0911 SEIPLPIPVKVINTL 15 T 3 FpoO pdbhh F Archaea T 2xvo 1 A,B,C,D A,B,C,D CMR7B_SULSO SSO1725 GAMGSPGGSQQVEWVFIPVIKDVTYEFKVDNNDNITELYVNGNKLGPASSLEMDFYFDVDVSNNQVRKFNNVFVLFGVIATKDSNKIKMQLTLNPCDFVRGFVFPSQDPSQLNNIFASNNKVSVSEKAFAILNRKKEGAVSSTINVYITQNTYTGNTKIEKIQQNTIIIEKNTGIVFKIPNDMLNIFRYSTT 192 T 0.034 OstA_2 unp F Archaea T 2xxm 3 C T INHIBITOR OF CAPSID ASSEMBLY ITFEDLLDYYG 11 T 0.69 DUF2610 pdbhh F T 2xxn 2 B B VIRF4_HHV8P VIRF-4 SVWIPVNEGASTSGM 15 T 7 Calponin pdbhh T Viruses T 2xzq 3 C P PHAGE DISPLAY DERIVED ANTIGEN YQLRPNAETLRF 12 T 5.2 7TM_GPCR_Srb pdbhh F T 2y06 3 C P PHAGE DISPLAY DERIVED ANTIGEN GDPRPSYISHLL 12 T 1.7 Tom7 pdbhh F T 2y07 3 C P PHAGE DISPLAY DERIVED ANTIGEN PPYPAWHAPGNI 12 T 1.3 DUF3612 pdbhh F T 2y1l 4 G,H G,H AC-IETD-CHO IETD 4 T 130 TSP9 pdbhh F F 2y1n 2 B,D B,D ZAP70_HUMAN TYROSINE-PROTEIN KINASE ZAP-70 ZAP-70,70 KDA ZETA-ASSOCIATED PROTEIN, SYK-RELATED TYROSINE KINASE TLNSDGXTPEPA 12 T 0.99 Galanin pdbhh F Eukaryota T 2y36 3 C P DODECAPEPTIDE (DLWTTAIPTIPS) DLWTTAIPTIPS 12 T 12 IER pdbhh F T 2y3y 2 E Q UNDECAPEPTIDE-GSSSGSASGAG GSSSGSASGAG 11 T 3 DUF1168 pdbhh F F 2y48 3 C C SNAI1_HUMAN TRANSCRIPTION FACTOR SNAIL, PROTEIN SNAIL HOMOLOG 1, PROTEIN SNA PRSFLVRKPSDPNRKPNYSE 20 T 0.29 bCoV_NS6 unppssm F Eukaryota T 2y4v 2 B B DAPK1_HUMAN DAP KINASE 1 RKKYKQSVRLISLCQRLSR 19 T 11 AAA_lid_8 pdbhh F Eukaryota T 2y5m 1 A,B,C,D,E,F A,B,C,D,E,F GRAMICIDIN D XGAXAXVXWXWXWXWX 16 T 4.6 MAP17 pdbhh F F 2y65 2 E,F,G W,X,Y KINH_DROME KINESIN GSGPQAQIAKPIRSGQGATS 20 T 0.23 FPP unphh F Eukaryota T 2y6i 2 B B ISOAMYLPHOSPHONYL-GLY-PRO-ALA XGPA 4 T 310 ACTL7A_N pdbhh F F 2y6n 1 A,B,C,D,E,F A,B,C,D,E,F GRAMICIDIN D XGAXAXVXWXWXWXWX 16 T 4.6 MAP17 pdbhh F F 2y6s 3 E,F P,Q VGP_EBOZ5 GP, GP1 GKLGLITNTIAGVAGLI 17 T 5.5 DUF4731 pdbhh T Viruses T 2y7l 2 B B FIBG_HUMAN FIBRINOGEN GAMMA CHAIN, ISOFORM CRA_A GEGQQHHLGGAKQAGDV 17 T 23 Rhodopsin_N pdbhh F Eukaryota T 2y8c 1 A,B,C A,B,C Q8II92_PLAF7 DEOXYURIDINE 5'-TRIPHOSPHATE NUCLEOTIDE-HYDROLASE MHLKIVCLSDEVREMYKNHKTHHEGDSGLDLFIVKDEVLKPKSTTFVKLGIKAIALQYKSNYYYKCEKSENKKKDDDKSNIVNTSFLLFPRSSISKTPLRLANSIGLIDAGYRGEIIAALDNTSDQEYHIKKNDKLVQLVSFTGEPLSFELVEELDETSRGEGGFGSTSNNKY 173 T 1.7E-06 dUTPase unppercent F Eukaryota T 2y8o 2 B B MP2K6_HUMAN MAPK/ERK KINASE 6, SAPKK3 SKGKKRNPGLKIPK 14 T 3 GHL15 pdbhh F Eukaryota T 2y8s 2 B,D B,E RON2_TOXGM RON2 DIVQHMEDIGGAPPVSCVTNEILGVTCAPQAIAKATT 37 T 7.1 LisH_TPL pdbhh F Eukaryota T 2y8t 2 B,D B,E RON2_TOXGM RON2 DIVQHMEDIGGAPPVSCVTNEILGVTCAPQAIAKATX 37 T 7.5 LisH_TPL pdbhh F Eukaryota T 2y9q 2 B B MKNK1_HUMAN MAP KINASE SIGNAL-INTEGRATING KINASE 1, MNK1 MKLSPPSKSRLARRRALA 18 T 0.15 HNOBA unphh F Eukaryota T 2y9w 2 C,D C,D G1K3P4_AGABI LECTIN-LIKE FOLD PROTEIN MAQARKIPLDLPGTRILNGANWANNSATENLATNSGTLIIFDQSTPGQDADRWLIHNYLDGYKIFNMGSNNWASVSRGNTVLGVSEFDGQTCKWSIEYSGNGEEFWIRVPREGGGGAVWTIKPASSQGPTTVFLDLLKETDPNQRIKFAV 150 T 0.002 Inhibitor_I66 pdbhh F Eukaryota T 2y9x 2 E,F,G,H E,F,G,H G1K3P4_AGABI LECTIN-LIKE FOLD PROTEIN MAQARKIPLDLPGTRILNGANWANNSATENLATNSGTLIIFDQSTPGQDADRWLIHNYLDGYKIFNMGSNNWASVSRGNTVLGVSEFDGQTCKWSIEYSGNGEEFWIRVPREGGGGAVWTIKPASSQGPTTVFLDLLKETDPNQRIKFAV 150 T 0.002 Inhibitor_I66 pdbhh F Eukaryota T 2yb8 1 A A SUZ12_DROME SUPPRESSOR 12 OF ZESTE PROTEIN, SUZ12 NPIFLNRTLSYMK 13 T 0.082 DUF4085 unp F Eukaryota T 2ybb 34 SA m NADH\: UBIQUINONE OXIDOREDUCTASE, MEMBRANE SUBUNIT L, XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 474 F F F 2ybb 35 TA n SUBUNIT OF NADH\: UBIQUINONE OXIDOREDUCTASE I XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 391 F F F 2ybb 36 UA o NADH DEHYDROGENASE I SUBUNIT N, NDH-1 SUBUNIT N XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 378 F F F 2ybb 37 VA p NADH DEHYDROGENASE I SUBUNIT K, NDH-1 SUBUNIT K XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 281 F F F 2ybf 2 B B RAD18_HUMAN POSTREPLICATION REPAIR PROTEIN RAD18, HHR18, HRAD18, RING FINGER PROTEIN 73 SKYRKKHKSEFQLLVDQARKGYKKIAG 27 T 0.012 PhoU_div unp F Eukaryota T 2ych 2 B B Q72IW7_THET2 COMPETENCE PROTEIN PILN MIRLNLLPKNLRRRV 15 T 0.0034 RskA unppercent F Bacteria T 2ydq 2 B T OGA_HUMAN MENINGIOMA-EXPRESSED ANTIGEN 5, NUCLEAR CYTOPLASMIC O-GLCNACASE AND ACETYLTRANSFERASE, PROTEIN O-GLCNACASE, GLYCOSIDE HYDROLASE O-GLCNACASE, HEXOSAMINIDASE C, N-ACETYL-BETA-D-GLUCOSAMINIDASE, N-ACETYL-BETA-GLUCOSAMINIDASE, O-GLCNACASE, OGA VAHSGAK 7 T 69 HEPN_AbiA_CTD pdbhh F Eukaryota T 2yds 2 B T TAB1_HUMAN MITOGEN-ACTIVATED PROTEIN KINASE KINASE KINASE 7-INTERACTING PROTEIN 1, TGF-BETA-ACTIVATED KINASE 1-BINDING PROTEIN 1, TAK1-BINDING PROTEIN 1 VPYSSAQ 7 T 18 UPF0160 pdbhh F Eukaryota T 2yen 1 A A GM3C_CONCN Mu-conotoxin CnIIIC QGCCNGPKGCSSKWCRDHARCCX 23 T 0.089 Mu-conotoxin pdbpssm F Eukaryota T 2yev 3 C,F C,F Q5SH67_THET8 UNCHARACTERIZED PROTEIN TTHA1863 MVYIALFALGAALVTLFFYLILNPRVLTTEGETFDLRFVLFMLLLILLAAGTVALMLLIGKAHHLL 66 T 0.039 7tm_3 pdbpercent F Bacteria T 2ygi 1 A,B,C,D A,B,C,D METHANOBACTIN HM1 XASXAA 6 T 1000 Ribosomal_L13 pdbhh F F 2ygj 1 A A METHANOBACTIN MB4 XASXAM 6 T 460 NCBP3 pdbhh F F 2ygu 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H VA2_SOLIN ALLERGEN SOL I II, VENOM ALLERGEN II, ALLERGEN=SOL I 2, SOL I 2 DNKELKIIRKDVAECLRTLPKCGNQPDDPLARVDVWHCAMAKRGVYDNPDPAVIKERSMKMCTKIITDPANVENCKKVASRCVDRETQGPKSNRQKAVNIIGCALRAGVAETTVLARKKHHHHHH 125 T 0.03 UPAR_LY6 pdb F Eukaryota T 2ygv 2 E,F,G,H E,F,G,H RAD53_YEAST RAD53, CHK2 HOMOLOG, SERINE-PROTEIN KINASE 1 SKKVKRAKLDQTSKGPENLQFS 22 T 19 GGA_N-GAT pdbhh F Eukaryota T 2yjv 2 M,N M,N RHLB XXXXXXXXX 9 F F F 2yka 2 B B ICP27_SHV21 ORF57 PROTEIN GPLGSSCKTSWADRVREAAAQRR 23 T 0.021 RE_BsaWI pdbhh T Viruses T 2yle 2 B B FMN2_HUMAN FMN2 PROTEIN VCRQKKGKSLYKIKPRHDSGIKAKISMKT 29 T 350 YL1_C pdbhh F Eukaryota T 2ymb 2 E,F F,H CHM1A_HUMAN CHMP1A, CHROMATIN-MODIFYING PROTEIN 1A, CHMP1A, VACUOLAR PROTEIN SORTING-ASSOCIATED PROTEIN 46-1, VPS46-1, HVPS46-1 MEDQLSRRLAALRN 14 T 3.9 DUF4549 pdbhh F Eukaryota T 2ymt 2 B B PHAGE DISPLAY DERIVED GAMMA 2 ADAPTIN EAR DOMAIN BINDING PEPTIDE GEEWGPWVX 9 T 0.69 Phage_antitermQ pdbhh F T 2ynn 2 B P KTKTN MOTIF CTFKTKTN 8 T 19 CBP_BcsO pdbhh F F 2yno 2 C P POLY ALA AAAAA 5 T 440 HCV_NS4a pdbhh F F 2ynp 2 B P KTKTN MOTIF CTFKTKTN 8 T 19 CBP_BcsO pdbhh F F 2ynr 2 B,C B,C B54NLS SVLGKRKRHPKV 12 T 6.3 DUF4668 pdbhh F T 2yns 2 C,D C,D B54NLS SVLGKRKRHPKV 12 T 6.3 DUF4668 pdbhh F T 2ypk 3 C C Q70XD7_9HIV1 KF11 P24 GAG PEPTIDE KAFSPEVIPMF 11 T 9.1E-05 Gag_p24 unphh T Viruses T 2ypl 3 C C Q70XD7_9HIV1 KF11 P24 GAG PEPTIDE KAFSPEVIPMF 11 T 9.1E-05 Gag_p24 unphh T Viruses T 2ypt 2 E,F,G,H F,G,H,I LMNA_HUMAN 70 KDA LAMIN, RENAL CARCINOMA ANTIGEN NY-REN-32, PRELAMIN A CSIM 4 T 46 PLN_propep pdbhh F Eukaryota F 2ypy 1 A,B,C,D,E,F,G,H,I,J A,B,C,D,E,F,G,H,I,J Q76SB0_HHV8 KSHV LANA GSRYQQPPVPYRQIDDCPAKARPQHIFYRRFLGKDGRRDPKCQWKFAVIFWGNDPYGLKKLSQAFQFGGVKAGPVSCLPHPGPDQSPITYCVYVYCQNKDTSKKVQMARLAWEASHPLAGNLQSSIVKFKKPLPLTQPG 139 T 0.00011 EBV-NA1 unphh T Viruses T 2ypz 1 A,B,C,D,E,F,G,H,I,J A,B,C,D,E,F,G,H,I,J Q76SB0_HHV8 KSHV LANA GSRYQQPPVPYRQIDDCPAKARPQHIFYRRFLGKDGRRDPKCQWKFAVIFWGNDPYGLKKLSQAFQFGGVKAGPVSCLPHPGPDQSPITYCVYVYCQNKDTSKKVQMARLAWEASHPLAGNLQSSIVKFKKPLPLTQPG 139 T 0.00011 EBV-NA1 unphh T Viruses T 2yq1 1 A,B,C,D A,B,C,D O41974_MHV68 MHV-68 LANA, IMMEDIATE-EARLY PROTEIN GSKRYSRYQKPHNPSDPLPKKYQGMRRHLQVTAPRLFDPEGHPPTHFKSAVMFSSTHPYTLNKLHKCIQSKHVLSTPVSCLPLVPGTTQQCVTYYLLSFVEDKKQAKKLKRVVLAYCEKYHSSVEGTIVKAKPYFPLPE 139 T 6.9E-05 EBV-NA1 pdbhh T Viruses T 2yrk 1 A A ZFHX4_MOUSE ZINC FINGER HOMEODOMAIN PROTEIN 4, ZFH-4 GSSGSSGGTDGTKPECTLCGVKYSARLSIRDHIFSKQHISKVRETVGSQLDREKD 55 T 0.00021 zf_C2H2_6 pdbhh F Eukaryota T 2ys5 2 B B ALK_HUMAN ANAPLASTIC LYMPHOMA KINASE, CD246 ANTIGEN LFRLRHFPCGNVNYGYQQQ 19 T 0.4 Ntox44 pdbhh F Eukaryota T 2yu7 2 B B NKG2A_HUMAN NKG2A ATEQEITXAELNLQK 15 T 0.02 Fez1 unppercent F Eukaryota T 2yvc 2 D,E,F D,E,F NEP_MOUSE NEUTRAL ENDOPEPTIDASE 24.11, NEUTRAL ENDOPEPTIDASE, NEP, ENKEPHALINASE, ATRIOPEPTIDASE, CD10 ANTIGEN GRSESQMDITDINAPKPKKKQR 22 T 7 Asp4 unphh F Eukaryota T 2z23 2 B B peptide (LYS)(LYS)(LYS) KKK 3 T 580 Rrn6 pdbhh F F 2z2p 1 A,B A,B VGB_STAAU STREPTOGRAMIN B LYASE MEFKLQELNLTNQDTGPYGITVSDKGKVWITQHKANMISCINLDGKITEYELPNKGAKVMCLTISSDGEVWFTENAANKIGRITKKGIIKEYTLPNPDSAPYGITEGPNGDIWFTEMNGNRIGRITDDGKIREYELPNKGSYPSFITLGSDNALWFTENQNNAIGRITESGDITEFKIPTPASGPVGITKGNDDALWFVEIIGNKIGRITTSGEITEFKIPTPNARPHAITAGAGIDLWFTEWGANKIGRLTSNNIIEEYPIQIKSAEPAGICFDGETIWFAMECDKIGKLTLIKDNME 299 T 0.00037 SGL pdbpssm F Bacteria T 2z2p 2 C,D C,D Quinupristin XTXPXXXX 8 T 290 zf-C2H2_jaz pdbhh F F 2z31 5 E P Myelin basic protein (MBP)-peptide RGGASQYRPSQ 11 T 7 Tsg pdbhh F T 2z3c 2 B B inhibitor XVXLXX 6 T 2100 zf-C2H2 pdbhh F F 2z3d 2 B I Inhibitor XLAAXX 6 T 1400 Pep_deformylase pdbhh F F 2z3e 2 B I ACE VAL Z3E LEU KCQ peptide XVXLX 5 T 2100 zf-C2H2 pdbhh F F 2z3f 2 B,D,F,H,J,L,N,P,Q,R,S I,J,K,L,M,N,O,P,Q,R,T YEG3_SCHPO CAC2 RKVESSKVSKKRIAPTPVYP 20 T 6.3E-18 PALB2_WD40 unphh F Eukaryota T 2z3l 2 C,D C,D peptide (PHE)(ARG)(TYR)(LEU)(GLY) FRYLG 5 T 23 GcnA_N pdbhh F F 2z3n 2 C,D C,D peptide (PHE)(ARG)(TYR)(LEU)(GLY) FRYLG 5 T 23 GcnA_N pdbhh F F 2z41 1 A A putative ski2-type helicase XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 648 F F F 2z4e 4 G,H S,T Fibrin B knob (tetrapeptide) GHRP 4 T 14 VPS38 unphh F F 2z4e 5 I,J I,J FIBB_BOVIN Fibrin B knob (pentapeptide) GHRPY 5 T 12 VPS38 unphh F Eukaryota F 2z5k 2 B B NXF1_HUMAN TIP-ASSOCIATING PROTEIN, TIP-ASSOCIATED PROTEIN, MRNA EXPORT FACTOR TAP, TAP NLS EEDDGDVAMSDAQDGPRVRYNPYTTRPNRR 30 T 1.6 DUF4687 pdbhh F Eukaryota T 2z5m 2 B B NXF1_HUMAN TIP-ASSOCIATING PROTEIN, TIP-ASSOCIATED PROTEIN, MRNA EXPORT FACTOR TAP, TAP NLS EEDDGDVAMSDAQDGPRVRYNPYTTRPNRR 30 T 1.6 DUF4687 pdbhh F Eukaryota T 2z5o 2 B B Heterogeneous nuclear ribonucleoprotein D-like XXXXXXXXXX 10 F F F 2z6w 2 C,D M,N CICLOSPORIN, CICLOSPORINE XXXXXXXXVXA 11 T 9.7 DUF6090 pdbhh F F 2z7x 3 C C Pam3CSK4 SKKKK 5 T 230 BCCIP pdbhh F F 2z8c 2 B B IRS1_HUMAN IRS-1 DYMNMS 6 T 7.2 LAX pdbhh F Eukaryota F 2z8p 2 B B (GLY)(GLU)(ALA)(TPO)(VAL)(PTR)(ALA) GEATVXA 7 T 34 B_solenoid_ydck pdbhh F T 2z9i 2 D,E,F D,E,F SVEQV SVEQV 5 T 150 DUF4476 pdbhh F F 2z9i 3 G,H,I G,H,I GATV GATV 4 T 240 DUF3574 pdbhh F F 2zck 2 B S KGISSQY KGISSQY 7 T 23 DUF4133 pdbhh F T 2zd7 2 C C EVDLPLSDEEPSS EVDLPLSDEEPSS 13 T 11 DUF1375 pdbhh F T 2zdj 1 A,B,C,D A,B,C,D D0VWQ2_9ZZZZ hypothetical protein TTMA177 MKMRKLVKDFGDDYTLIQDSQEVKAILEYIGSEEEPHALFVKVGDGDYEEVWGIDSFVPYNFLEAYRLK 69 T 2.8 Ribosomal_L30 pdbhh F unclassified sequences. T 2zfx 2 B C MED1_HUMAN PEROXISOME PROLIFERATOR-ACTIVATED RECEPTOR-BINDING PROTEIN, VITAMIN D RECEPTOR-INTERACTING PROTEIN COMPLEX COMPONENT DRIP205, MEDIATOR OF RNA POLYMERASE II TRANSCRIPTION SUBUNIT 1 KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 2zgh 2 B B SSGKVPL SSGKVPL 7 T 7.6 DUF1859 pdbhh F T 2zgj 2 B B SSGKVPLS SSGKVPLS 8 T 14 La_HTH_kDCL pdbhh F T 2zhr 2 C,D C,D inhibitor OM99-2 EVNXAEF 7 T 200 DUF1480 pdbhh F T 2zjd 2 B,D B,D SQSTM_MOUSE UBIQUITIN-BINDING PROTEIN P62, STONE14 SGGDDDWTHLS 11 T 7.6 DUF5888 pdbhh F Eukaryota T 2zjp 5 E 5 NOSM_STRAS NOSIHEPTIDE SXTXXXXCXXXXX 13 T 0.14 CCER1 pdbhh F Bacteria F 2zks 2 B C hGzmM inhibitor XKVPLX 6 T 250 T4-gp15_tss pdbhh F F 2zl2 2 O,P,Q,T,U,V,W,X O,P,Q,T,U,V,W,X A peptide substrate-NVLGFTQ NVLGFTQ 7 T 3.4 GRAB pdbhh F T 2zl2 3 R,S R,S A peptide substrate-NVLGFTQ for Chain R and S XXXXXXX 7 F F F 2zl4 2 B,BA,D,F,H,J,L,N,P,R,T,V,X,Z O,2,P,Q,R,S,T,U,V,W,X,Y,Z,1 Peptide substrate AAAA AAAA 4 T 900 Cyclin_C pdbhh F F 2zl9 2 B C Coactivator peptide DRIP KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F T 2zla 2 B C Coactivator peptide DRIP KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F T 2zlc 2 B C Coactivator peptide DRIP KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F T 2zld 2 C,D C,D COLICIN-E3 A CHAIN, RIBONUCLEASE XXXXXXX 7 F F F 2zlf 2 B B FTLDADF FTLDADF 7 T 18 Ad_cyc_g-alpha pdbhh F F 2zmh 2 B C MED1_XENTR MEDIATOR COMPLEX SUBUNIT 1, PEROXISOME PROLIFERATOR-ACTIVATED RECEPTOR KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 2zmi 2 B C MED1_XENTR MEDIATOR COMPLEX SUBUNIT 1, PEROXISOME PROLIFERATOR-ACTIVATED RECEPTOR KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 2zmj 2 B C MED1_XENTR MEDIATOR COMPLEX SUBUNIT 1, PEROXISOME PROLIFERATOR-ACTIVATED RECEPTOR KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 2zne 2 C,D C,D PDC6I_HUMAN PDCD6-INTERACTING PROTEIN, ALG-2-INTERACTING PROTEIN 1, HP95 QGPPYPTYPGYPGYSQ 16 T 5.1 CYYR1 pdbhh F Eukaryota T 2zok 3 I,J,K,L I,L,J,K SPIKE_CVMJC PEPTIDIC EPITOPE S510 XSLWNGPHL 9 T 5.1 Mut7-C pdbhh T Viruses T 2zol 3 E,F F,E SPIKE_CVMJC PEPTIDIC EPITOPE S510 XSLSNGPHL 9 T 6.1 WEF-hand pdbhh T Viruses T 2zos 1 A,B A,B MPGP_PYRHO MPGP MIRLIFLDIDKTLIPGYEPDPAKPIIEELKDMGFEIIFNSSKTRAEQEYYRKELEVETPFISENGSAIFIPKGYFPFDVKGKEVGNYIVIELGIRVEKIREELKKLENIYGLKYYGNSTKEEIEKFTGMPPELVPLAMEREYSETIFEWSRDGWEEVLVEGGFKVTMGSRFYTVHGNSDKGKAAKILLDFYKRLGQIESYAVGDSYNDFPMFEVVDKVFIVGSLKHKKAQNVSSIIDVLEVIKHHHHHH 249 T 1.5E-10 Hydrolase_3 pdbpercent F Archaea T 2zpk 3 C,F P,Q PAR4_HUMAN PAR-4, THROMBIN RECEPTOR-LIKE 3, COAGULATION FACTOR II RECEPTOR-LIKE 3 PRGYPGQV 8 T 0.16 Gag_p19 pdbhh F Eukaryota T 2zpy 2 B B CD44_MOUSE CD44 antigen SRRRCGQKKKLVINGGNGTV 20 T 0.046 RCR unphh F Eukaryota T 2zui 1 A A CPXA_PSEPU CYTOCHROME P450-CAM, P450CAM MTTETIQSNANLAPLPPHVPEHLVFDFDMYNPSNLSAGVQEAWAVLQESNVPDLVWTRCNGGHWIATRGQLIREAYEDYRHFSSECPFIPREAGEAYDFIPTSMDPPEQRQFRALANQVVGMPVVDKLENRIQELACSLIESLRPQGQCNFTEDYAEPFPIRIFMLLAGLPEEDIPHLKYLTDQMTRPDGSMTFAEAKEALYDYLIPIIEQRRQKPGTDAISIVANGQVNGRPITSDEAKRMCGLLLVGGLDTVVNFLSFSMEFLAKSPEHRQELIERPERIPAACEELLRRFSLVANGRILTSDYEFHGVQLKKGDQILLPQMLSGLDERENACPMHVDFSRQKVSHTTFGHGSHLCLGQHLARREIIVTLKEWLTRIPDFSIAPGAQIQHKSGIVSGVQALPLVWDPATTKAV 415 T 1.6E-05 p450 unppercent F Bacteria T 2zvk 2 B,D,F U,V,W DNA polymerase eta CKRPRPEGMQTLESFFKPLTH 21 T 2.5 RC-P840_PscD pdbhh F T 2zvl 2 B,D,F,H,J,L U,V,W,X,Y,Z DNA polymerase kappa PKHTLDIFFKPLTH 14 T 1.5 DUF4387 pdbhh F T 2zvm 2 B,D,F U,V,W DNA polymerase iota ALNTAKKGLIDYYLMPSLSTTSR 23 T 3 DUF2620 pdbhh F T 2zvv 2 C,D Y,X CDN1A_HUMAN P21, CDK-INTERACTING PROTEIN 1, MELANOMA DIFFERENTIATION-ASSOCIATED PROTEIN 6, MDA-6 GRKRRQTSMTDFYHSKRRLIFS 22 T 0.85 CDC27 pdbhh F Eukaryota T 2zvw 2 I,J,K,L,M,N,O,P I,J,K,L,M,N,O,P CDN1A_HUMAN P21, CDK-INTERACTING PROTEIN 1, MELANOMA DIFFERENTIATION-ASSOCIATED PROTEIN 6, MDA-6 GRKRRQTSMTDFYHSKRRLIFS 22 T 0.85 CDC27 pdbhh F Eukaryota T 2zxm 2 B C MED1_XENTR MEDIATOR COMPLEX SUBUNIT 1 KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 2zxn 2 B C MED1_XENTR MEDIATOR COMPLEX SUBUNIT 1 KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 316d 2 C C DACTINOMYCIN TXPXXXTXPXX 11 T 61 DUF5572 pdbhh F F 3a0b 19 MA,S n,N Photosystem II reaction center protein Y XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 3a0h 19 MA,S n,N Photosystem II reaction center protein Y XXXXXXXXXXXXXXXXXXXXXXX 23 F F F 3a0m 1 A,B,C,D,E,F A,B,C,D,E,F collagen-like peptide PPGPPGPPGPPGPVGPPGPPGPPGPPG 27 T 0.00017 Collagen pdbpssm F F 3a2h 2 B B MED1_HUMAN DRIP 205 NR2 BOX PEPTIDE, MEDIATOR COMPLEX SUBUNIT 1, PEROXISOME PROLIFERATOR-ACTIVATED RECEPTOR-BINDING PROTEIN, VITAMIN D RECEPTOR-INTERACTING PROTEIN COMPLEX COMPONENT DRIP205, ACTIVATOR-RECRUITED COFACTOR 205 KDA COMPONENT KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 3a6p 2 B,G B,G 13-mer peptide XXXXXXXXXXXXX 13 F F F 3a79 3 C C Pam2CSK4 CSKKKK 6 T 9.5 NOZZLE pdbhh F F 3a7p 1 A,B A,B ATG16_YEAST ATG16, CYTOPLASM TO VACUOLE TARGETING PROTEIN 11, SAP18 HOMOLOG GPMGNFIITERKKAKEERSNPQTDSMDDLLIRRLTDRNDKEAHLNELFQDNSGAIGGNIVSHDDALLNTLAILQKELKSKEQEIRRLKEVIALKNKNTERLNAALISGTIENNVLQQKLSDLKKEHSQLVARWLKKTEKETEAMNSEIDGTK 152 T 0.044 ATG16 unphh F Eukaryota T 3a7q 1 A A RELN_MOUSE REELER PROTEIN GRDGNNLNNPVLLLDTFDFGPREDNWFFYPGGNIGLYCPYSSKGAPEEDSAMVFVSNEVGEHSITTRDLSVNENTIIQFEINVGCSTDSSSADPVRLEFSRDFGATWHLLLPLCYHSSSLVSSLCSTEHHPSSTYYAGTTQGWRREVVHFGKLHLAGSVRFRWYQGFYPAGSQPVTWAIDNVYIGPQCEEMCYGHGSCINGTKCICDPGYSGPTCKISTKNPDFLKDDFEGQLESDRFLLMSGGKPSRKCGILSSGNNLFFNEDGLRMLVTRDLDLSHARFVQFFMRLGCGKGVPDPRSQPVLLQYSLNGGLSWSLLQEFLFSNSSNVGRYIALEMPLKARSGSTRLRWWQPSENGHFYSPWVIDQILIGGNISGNTVLEDDFSTLDSRKWLLHPGGTKMPVCGSTGDALVFIEKASTRYVVTTDIAVNEDSFLQIDFAASCSVTDSCYAIELEYSVDLGLSWHPLVRDCLPTNVECSRYHLQRILVSDTFNKWTRITLPLPSYTRSQATRFRWHQPAPFDKQQTWAIDNVYIGDGCLDMCSGHGRCVQGSCVCDEQWGGLYCDEPETSLPTQLKDNFNRAPSNQNWLTVSGGKLSTVCGAVASGLALHFSGGCSRLLVTVDLNLTNAEFIQFYFMYGCLITPSNRNQGVLLEYSVNGGITWNLLMEIFYDQYSKPGFVNILLPPDAKEIATRFRWWQPRHDGLDQNDWAIDNVLISRLENLYFQ 725 T 0.00014 EGF_2 pdb F Eukaryota T 3aa1 3 C C PKHO1_HUMAN CKIP-1, CASEIN KINASE 2-INTERACTING PROTEIN 1, C-JUN-BINDING PROTEIN, OSTEOCLAST MATURATION ASSOCIATED GENE 120 PROTEIN SYLAHPTRDRAKIQHSRRPPTRG 23 T 0.0038 CAP-ZIP_m pdbhh F Eukaryota T 3aa6 3 C C CD2AP_HUMAN CD2AP, CAS LIGAND WITH MULTIPLE SH3 DOMAINS, ADAPTER PROTEIN CMS NLLHLTANRPKMPGRRLPGRFNG 23 T 0.018 CARMIL_C pdbhh F Eukaryota T 3abd 2 B,D X,Y REV3L_HUMAN REV3, HREV3 MLTPTPDSSPRSTSSPSQSKNGSFTPRTANILKPLMSPPSREEIMATLLDHD 52 T 7 CaM_bind pdbhh F Eukaryota T 3abe 2 B Z REV3L_HUMAN REV3, HREV3 MLTPTPDSSPRSTSSPSQSKNGSFTPRTANILKPLMSPPSREEIMATLLDHD 52 T 7 CaM_bind pdbhh F Eukaryota T 3abn 1 A,B,C A,B,C collagen-like peptide PPGPPGPPGPPGPDGPPGPPGPPGPPG 27 T 0.00012 Collagen pdbpercent F F 3ade 2 B B SQSTM_MOUSE Sequestosome-1 KEVDPSTGELQSLQ 14 T 1.2 DUF2396 pdbhh F Eukaryota T 3afr 2 B C MED1_XENTR MEDIATOR COMPLEX SUBUNIT 1, PEROXISOME PROLIFERATOR-ACTIVATED RECEPTOR KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 3agm 2 B B N~2~-{8-OXO-8-[4-(9H-PURIN-6-YL)PIPERAZIN-1-YL]OCTANOYL}-D-ARGINYL-D-ARGINYL-D-ARGINYL-D-ARGINYL-D-ARGINYL-D-ARGININAMIDE XXXXXXXXX 9 T 27 CDC27 pdbhh F F 3agy 2 C,D,E C,D,F HSP7C_HUMAN HSP70, HSPA1A, HEAT SHOCK 70 KDA PROTEIN 8 GPTIEEVD 8 T 8.1 DUF4028 pdbhh F Eukaryota T 3agz 2 C,D,E,F C,D,E,F HSP7C_HUMAN HSP70, HSPA1A, HEAT SHOCK 70 KDA PROTEIN 8 GPTIEEVD 8 T 8.1 DUF4028 pdbhh F Eukaryota T 3ah8 4 D Y YM-254890 XXXXXXXAX 9 T 1100 Pro-rich pdbhh F F 3al1 1 A,B A,B PROTEIN (D, L-ALPHA-1) XELLKKLLEELKG 13 T 11 NABP pdbhh F F 3al3 2 B B FANCJ_HUMAN PROTEIN FANCJ, ATP-DEPENDENT RNA HELICASE BRIP1, BRCA1-INTERACTING PROTEIN C-TERMINAL HELICASE 1, BRCA1-INTERACTING PROTEIN 1, BRCA1-ASSOCIATED C-TERMINAL HELICASE 1 SIYFTPELYD 10 T 2 NpwBP pdbhh F Eukaryota T 3alo 2 B E p38 peptide DDEMTGYA 8 T 0.16 YicC_N pdbhh F T 3ap1 2 C,D S,T C4 peptide EDFEDYEFD 9 T 1.6 UL11 pdbhh F F 3ap2 2 C,D S,T C4 peptide EDFEDYEFD 9 T 1.6 UL11 pdbhh F F 3apr 2 B I REDUCED PEPTIDE INHIBITOR XPFHXVY 7 T 0.78 DUF5372 pdbhh F T 3asw 2 B B K1C10_HUMAN 15-MER FROM KERATIN, TYPE I CYTOSKELETAL 10 YGGGSSGGGSSGGGH 15 T 2.7 DUF3246 pdbhh F Eukaryota F 3at0 2 B B FIBA_HUMAN 16-MER FROM FIBRINOGEN ALPHA CHAIN, FIBRINOPEPTIDE A GSWNSGSSGTGSTGNQ 16 T 1.8 DUF4603 pdbhh F Eukaryota T 3atw 2 C,D C,D peptide ACE-THR-VAL-ALC-HIS-H XTVXX 5 T 250 DUF3810 pdbhh F F 3aun 2 B B MED1_HUMAN PEROXISOME PROLIFERATOR-ACTIVATED RECEPTOR-BINDING PROTEIN, VITAMIN D RECEPTOR-INTERACTING PROTEIN COMPLEX COMPONENT DRIP205, MEDIATOR OF RNA POLYMERASE II TRANSCRIPTION SUBUNIT 1 KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 3av9 2 C,D X,Y LENS EPITHELIAL DERIVED GROWTH FACTOR PEPTIDE SAKIDNLD 8 T 8.4 DUF5399 pdbhh F T 3ava 2 C,D X,Y LENS EPITHELIAL DERIVED GROWTH FACTOR PEPTIDE ALKIDNLD 8 T 14 DUF5399 pdbhh F T 3avb 2 C,D X,Y LENS EPITHELIAL DERIVED GROWTH FACTOR PEPTIDE SLKIDNLD 8 T 11 DUF3389 pdbhh F T 3avc 2 C,D D,F LENS EPITHELIAL DERIVED GROWTH FACTOR SDKIDNLD 8 T 15 Pal1 pdbhh F T 3avf 2 C,D D,F LENS EPITHELIAL DERIVED GROWTH FACTOR DLKIDNLD 8 T 16 DUF5399 pdbhh F F 3avg 2 C,D D,F LENS EPITHELIAL DERIVED GROWTH FACTOR PEPTIDE ADKIDNLD 8 T 6.9 tRNA_synt_2f pdbhh F T 3avh 2 C,D E,F LENS EPITHELIAL DERIVED GROWTH FACTOR PEPTIDE ARKIDNLD 8 T 1.1 DUF3663 pdbhh F T 3avi 2 C,D D,F LENS EPITHELIAL DERIVED GROWTH FACTOR PEPTIDE SLKIDNMD 8 T 12 DUF3389 pdbhh F T 3avj 2 C,D D,F LENS EPITHELIAL DERIVED GROWTH FACTOR PEPTIDE ALKIDNMD 8 T 15 Dak1_2 pdbhh F T 3avk 2 C,D D,F LENS EPITHELIAL DERIVED GROWTH FACTOR PEPTIDE SLKIDNED 8 T 1.5 DUF3389 pdbhh F T 3avl 2 C,D F,E LENS EPITHELIAL DERIVED GROWTH FACTOR PEPTIDE ATKIDNLD 8 T 12 DUF5399 pdbhh F T 3avm 2 C,D D,F LENS EPITHELIAL DERIVED GROWTH FACTOR PEPTIDE SRKIDNLD 8 T 5.9 DUF5399 pdbhh F T 3avn 2 C,D G,H LENS EPITHELIAL DERIVED GROWTH FACTOR PEPTIDE SHKIDNLD 8 T 15 DUF5399 pdbhh F T 3avz 2 B B peptide ACE-SER-ALA-VAL-ALC-HIS-H XSAVXX 6 T 210 HEPN_Apea pdbhh F F 3aw0 2 B B peptide ACE-SER-ALA-VAL-LEU-HIS-H XSAVLX 6 T 200 DUF4770 pdbhh F F 3awr 2 C,D C,D ALDH2_RAT ALDH CLASS 2, ALDH-E2, ALDH1 GPRLSRLLSSAGC 13 T 5.5 Trp_DMAT pdbhh F Eukaryota T 3ax2 2 B,D,F,H B,D,F,H ALDH2_RAT ALDH CLASS 2, ALDH-E2, ALDH1 GPRLSRLLSYAGSGCX 16 T 0.92 DUF4360 pdbhh F Eukaryota T 3ax3 2 B,D,F,H B,D,F,H ALDH2_RAT ALDH CLASS 2, ALDH-E2, ALDH1 GXRLCRLLSYAX 12 T 0.43 Lentiviral_Tat pdbhh F Eukaryota T 3ax5 2 B,D B,D ALDH2_RAT Aldehyde dehydrogenase, mitochondrial GXRLCRLLSYA 11 T 0.33 Lentiviral_Tat pdbhh F Eukaryota T 3axy 3 E,F,K,L E,F,K,L Rice FD homolog OsFD1 LQRVLSAPF 9 T 8.7 ArgoL2 pdbhh F T 3ayu 2 B B A4_HUMAN ABPP, APPI, APP, ALZHEIMER DISEASE AMYLOID PROTEIN, CEREBRAL VASCULAR AMYLOID PEPTIDE, CVAP, PREA4, PROTEASE NEXIN-II, PN-II ISYGNDALMP 10 T 5.1 ESAG1 pdbhh F Eukaryota T 3azq 2 C,D C,D tripeptide PGG PGG 3 T 40 P5CR_dimer pdbhh F F 3b1m 2 B B PRGC1_HUMAN PGC-1-ALPHA, PPAR-GAMMA COACTIVATOR 1-ALPHA, PPARGC-1-ALPHA, LIGAND EFFECT MODULATOR 6 QEAEEPSLLKKLLLAPANT 19 T 3.3 Apo-CIII pdbhh F Eukaryota T 3b21 1 A A Q8VSD5_SHIFL OSPI GPLGSPEFMINGVSLQGTAGYEAHTEEGNVNVKKLLESLNSKSLGDMDKDSELAATLQKMINPSGGDGNCSGCALHACMAMLGYGVREAPVPNEISEYMTGFFHRHLEQIDSEGIVSHPNETYSKFRERIAENILQNTSKGSVVMISIEQATHWIAGFNDGEKIMFLDVQTGKGFNLYDPVEKSPDAFVDENSSVQVIHVSDQEFDHYANSSSWKSKRLC 220 T 0.13 Gln_amidase unppercent F Bacteria T 3b23 3 C C VARI_AMBVA Variegin SDQGDVAEPKMHKTAPPFDFEAIPEEYLDDES 32 T 0.038 Hirudin pdbhh F Eukaryota T 3b3i 3 C C VIPR1_HUMAN VIP-R-1, PITUITARY ADENYLATE CYCLASE-ACTIVATING POLYPEPTIDE TYPE II RECEPTOR, PACAP TYPE II RECEPTOR, PACAP-R-2 RRKWXRWHL 9 F F Eukaryota T 3b6s 3 C C VIPR1_HUMAN VASOACTIVE INTESTINAL POLYPEPTIDE RECEPTOR 1 RRKWXRWHL 9 F F Eukaryota T 3b7f 1 A A Q46R99_CUPNJ Glycosyl hydrolase, BNR repeat GMTASTAPQTEPHKTSAPESGPVMLLVATIKGAWFLASDPARRTWELRGPVFLGHTIHHIVQDPREPERMLMAARTGHLGPTVFRSDDGGGNWTEATRPPAFNKAPEGETGRVVDHVFWLTPGHASEPGTWYAGTSPQGLFRSTDHGASWEPVAGFNDHPMRRAWTGGEQDGTPDGPKMHSILVDPRDPKHLYIGMSSGGVFESTDAGTDWKPLNRGCAANFLPDPNVEFGHDPHCVVQHPAAPDILYQQNHCGIYRMDRREGVWKRIGDAMPREVGDIGFPIVVHQRDPRTVWVFPMDGSDVWPRVSPGGKPAVYVTRDAGESWQRQDRGLPTDQAWLTVKRQAMTADAHAPVGVYFGTTGGEIWASADEGEHWQCIASHLPHIYAVQSARPV 394 T 0.0005 Sortilin-Vps10 pdb F Bacteria T 3b7s 2 B B RSR peptide RSR 3 T 240 zf-CCHC pdbhh F F 3b7t 2 B B RAR peptide RAR 3 T 270 BUD22 pdbhh F F 3b7v 2 C C peptide NLXQI 5 T 160 DarT pdbhh F F 3b80 2 C C peptide NLXQI 5 T 160 DarT pdbhh F F 3b9t 1 A,B,C,D A,B,C,D Q1GZG6_METFK Twin-arginine translocation pathway signal protein GMSDHVCQEGCRHHSHGEDSPEIQQEFQEGRRDFMRDFAVGGVLASAASLGISSSAFGQTMPKTGLTSGHATHYYIPASDKTVSWGFFSKSLKPVVELESGDFATIETLTHHSNDDASLMVKGDPGAESVFYWDSKRKNVDRRGMGPMDHKLGAGGGMGVHILTGPVAIKGAEPGDVLEVRIVDVALRPSANPEFKGKTFGSNVAANWGFHYNELIEEPKKREVVTIYELDATGERNWARAFYNYRWTPQKDPFGVVHPIVDYPGVPVDHSTISKNYNVLKNIRVPVRPHFGTMGLAPKEADLVNSVPPSHFGGNIDNWRIGKGATMYYPVSVAGGLFSVGDPHASQGDSEMCGTAIECSLTGTFQFILHKKADLPGTPLADLQYPLLETQDEWVLHGFSYANYLAELGPDAQNSIFSKSSLDLALKDAFRKMRHFLMQTQNLTEDEAVSLMSIGVDFGITQVVDGNWGVHAVVKKGIFPGRDV 484 T 0.34 TAT_signal unppssm F Bacteria T 3bcc 9 I,I2 I,I CYTOCHROME BC1 COMPLEX, COMPLEX III XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 33 F F F 3bdf 1 A,B A,B PPB_ECOLI APASE RTPEMPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQDAVPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLKLEHHHHHH 458 T 1.3E-10 Alk_phosphatase pdbpssm F Bacteria T 3bdg 1 A A PPB_ECOLI APASE RTPEMPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQDAVPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLKLEHHHHHH 458 T 1.3E-10 Alk_phosphatase pdbpssm F Bacteria T 3bdg 2 B B PPB_ECOLI APASE RTPEMPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQDATPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLKLEEEEEEE 458 T 6.8E-10 Alk_phosphatase unppssm F Bacteria T 3bdz 1 A,B A,B CINA_CITBR CYTOCHROME P450CIN TSLFTTADHYHTPLGPDGTPHAFFEALRDEAETTPIGWSEAYGGHWVVAGYKEIQAVIQNTKAFSNKGVTFPRYETGEFELMMAGQDDPVHKKYRQLVAKPFSPEATDLFTEQLRQSTNDLIDARIELGEGDAATWLANEIPARLTAILLGLPPEDGDTYRRWVWAITHVENPEEGAEIFAELVAHARTLIAERRTNPGNDIMSRVIMSKIDGESLSEDDLIGFFTILLLGGIDATARFLSSVFWRLAWDIELRRRLIAHPELIPNAVDELLRFYGPAMVGRLVTQEVTVGDITMKPGQTAMLWFPIASRDRSAFDSPDNIVIERTPNRHLSLGHGIHRCLGAHLIRVEARVAITEFLKRIPEFSLDPNKECEWLMGQVAGMLHVPIIFPKGKRLSE 397 T 1.3E-34 p450 pdbpercent F Bacteria T 3be0 1 A,B A,B CINA_CITBR CYTOCHROME P450CIN TSLFTTADHYHTPLGPDGTPHAFFEALRDEAETTPIGWSEAYGGHWVVAGYKEIQAVIQNTKAFSNKGVTFPRYETGEFELMMAGQDDPVHKKYRQLVAKPFSPEATDLFTEQLRQSTNDLIDARIELGEGDAATWLANEIPARLTAILLGLPPEDGDTYRRWVWAITHVENPEEGAEIFAELVAHARTLIAERRTNPGNDIMSRVIMSKIDGESLSEDDLIGFFTILLLGGIDATARFLSSVFWRLAWDIELRRRLIAHPELIPNAVDELLRFYGPAMVGRLVTQEVTVGDITMKPGQTAMLWFPIASRDRSAFDSPDNIVIERTPNRHLSLGHGIHRCLGAHLIRVEARVAITEFLKRIPEFSLDPNKECEWLMGQVAGMLHVPIIFPKGKRLSE 397 T 1.3E-34 p450 pdbpercent F Bacteria T 3bef 3 C,F C,F PAR1_HUMAN PAR-1, THROMBIN RECEPTOR, COAGULATION FACTOR II RECEPTOR NDKYEPFWE 9 T 0.26 DUF5848 pdbhh F Eukaryota T 3bg4 1 A A CTRB_BOVIN Chymotrypsin A chain A CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 3bgm 3 C C KPCD2_HUMAN NPKC-D2 RQASLSISV 9 T 0.0054 TCAD9 unppercent F Eukaryota T 3bh9 3 C C POF1B_HUMAN PREMATURE OVARIAN FAILURE PROTEIN 1B RTYSGPMNKV 10 T 0.062 CCDC158 unphh F Eukaryota T 3bhb 3 C C N4BP2_HUMAN N4BP2, BCL-3-BINDING PROTEIN KMDSFLDMQL 10 T 15 Rrp15p pdbhh F Eukaryota T 3bim 2 B,D,F,H,J,L,N,P I,J,K,L,M,N,O,P BCOR_HUMAN BCOR GSRSEIISTAPSSWVVPGP 19 T 0.4 GPR15L pdbhh F Eukaryota T 3bin 2 B B CADM1_HUMAN IMMUNOGLOBULIN SUPERFAMILY MEMBER 4, NECTIN-LIKE PROTEIN 2, NECL-2, TUMOR SUPPRESSOR IN LUNG CANCER 1, TSLC-1, SYNAPTIC CELL ADHESION MOLECULE, SPERMATOGENIC IMMUNOGLOBULIN SUPERFAMILY, SGIGSF ARHKGTYFTHEA 12 T 0.16 DAG1 unphh F Eukaryota T 3bk9 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H T23O_XANCP TDO MPVDKNLRDLEPGIHTDLEGRLTYGGYLRLDQLLSAQQPLSEPAHHDEMLFIIQAQTSELWLKLLAHELRAAIVHLQRDEVWQCRKVLARSKQVLRQLTEQWSVLETLTPSEYMGFRDVLGPSSGFQSLQYRYIEFLLGNKNPQMLQVFAYDPAGQARLREVLEAPSLYEEFLRYLARFGHAIPQQYQARDWTAAHVADDTLRPVFERIYENTDRYWREYSLCEDLVDVETQFQLWRFRHMRTVMRVIGFKRGTGGSSGVGFLQQALALTFFPELFDVRTSVGVDNRPPQGSADAGKRLEHHHHHH 306 T 6.3E-41 Trp_dioxygenase unp F Bacteria T 3bo7 2 E,F,G,H E,F,G,H CICLOSPORIN, CICLOSPORINE XXXXXXXXVXA 11 T 9.7 DUF6090 pdbhh F F 3boo 2 B B N-Ac-CRATKML inhibitory peptide XCRATKML 8 T 12 DUF3136 pdbhh F T 3bp4 3 C C PPGB_HUMAN CATHEPSIN A, CARBOXYPEPTIDASE C, PROTECTIVE PROTEIN FOR BETA-GALACTOSIDASE IRAAPPPLF 9 T 20 DUF6023 pdbhh F Eukaryota T 3bp7 3 C C PPGB_HUMAN CATHEPSIN A, CARBOXYPEPTIDASE C, PROTECTIVE PROTEIN FOR BETA-GALACTOSIDASE IRAAPPPLF 9 T 20 DUF6023 pdbhh F Eukaryota T 3bpm 2 C,D D,C Leupeptin XLLX 4 T 590 DNTTIP1_dimer pdbhh F F 3bqd 2 B B NCOA1_HUMAN NCOA-1, STEROID RECEPTOR COACTIVATOR 1, SRC-1, RIP160, PROTEIN HIN-2, RENAL CARCINOMA ANTIGEN NY-REN-52 AQQKSLLQQLLTE 13 T 1 GFD1 pdbhh F Eukaryota T 3bqo 2 B B TINF2_HUMAN TRF1-INTERACTING NUCLEAR PROTEIN 2 SHFNLAPLGRRRVQSQWASTR 21 T 1.9 COX7B pdbhh F Eukaryota T 3brd 4 D D LIN12_CAEEL ABNORMAL CELL LINEAGE PROTEIN 12 SPGNRTRKRRMINASVWMPPMENEEKNRK 29 T 0.039 OSTbeta unppercent F Eukaryota T 3brf 4 D D LIN12_CAEEL ABNORMAL CELL LINEAGE PROTEIN 12 SRMINASVWMPPME 14 T 0.039 OSTbeta unppercent F Eukaryota T 3brl 2 B C SWA_DROME Protein swallow 10-resiude peptide ATSAKATQTD 10 T 15 KxDL unp F Eukaryota T 3bs4 2 B B Unknown peptide NIF 3 T 73 OB_MalK pdbhh F F 3btb 1 A _ B3AT_HUMAN BAND 3 MEELQDDYEDMMEENX 16 T 1.1 DUF1265 pdbhh F Eukaryota T 3bts 2 C,D E,F GAL4_YEAST Regulatory protein GAL4 GMFNTTTMDDVYNYLFDDEDT 21 T 4.5 T6PP_N pdbhh F Eukaryota T 3bu3 2 B B IRS2_MOUSE IRS-2, 4PS AYNPYPEDYGDIEIG 15 T 12 STAT1_TAZ2bind pdbhh F Eukaryota T 3bu5 2 B B IRS2_MOUSE IRS-2, 4PS AYNPYPEDYGDIEIG 15 T 12 STAT1_TAZ2bind pdbhh F Eukaryota T 3bu6 2 B B IRS2_MOUSE IRS-2, 4PS AYNPYPEDXGDIEIG 15 T 12 STAT1_TAZ2bind pdbhh F Eukaryota T 3bu8 2 C,D C,D TINF2_HUMAN TRF1-INTERACTING NUCLEAR PROTEIN 2 SFNLAPLGRRRVQSQWAST 19 T 2.7 COX7B pdbhh F Eukaryota T 3bua 2 E,F,G,H E,F,G,H DCR1B_HUMAN HSNM1B SEFRGLALKYLLTPVNFFQAGYSSRRFDQQVEKYHK 36 T 7.7 Sedlin_N pdbhh F Eukaryota T 3bum 1 A A SPY2_HUMAN SPRY-2 IRNTNEXTEGPTV 13 T 3.3 KAR9 unp F Eukaryota T 3bun 1 A A SPY4_HUMAN SPRY-4, SPROUTY-4 SHVENDXIDNPSL 13 T 0.88 MRP-L51 pdbhh F Eukaryota T 3buo 1 A,C A,C EGFR_HUMAN RECEPTOR TYROSINE-PROTEIN KINASE ERBB-1, EGFR DSFLQRXSSDPTG 13 T 1.2 DUF4348 pdbhh F Eukaryota T 3buw 1 A,C A,C KSYK_HUMAN SPLEEN TYROSINE KINASE TVSFNPXEPELAP 13 T 0.13 Herpes_UL36 pdbhh F Eukaryota T 3bux 1 A,C A,C MET_HUMAN HGF RECEPTOR, SCATTER FACTOR RECEPTOR, SF RECEPTOR, HGF/SF RECEPTOR, MET PROTO-ONCOGENE TYROSINE KINASE, C-MET SNESVDXRATFPE 13 T 3.8 MtaB pdbhh F Eukaryota T 3bv9 3 C C FM19 inhibitor XXPXXX 6 T 140 CobS_N pdbhh F F 3bvh 4 G,H,I,J G,I,H,J 4-mer peptide GPRP GPRP 4 T 65 SRCR_2 pdbhh F F 3bxm 2 B I N-Acetyl-Aspartyl-Glutamate (NAAG) XDE 3 T 700 AAA_11 pdbhh F F 3bxn 3 C C PPGB_HUMAN PCATA IRAAPPPLF 9 T 20 DUF6023 pdbhh F Eukaryota T 3by7 1 A,B,C,D,E A,B,C,D,E uncharacterized protein GMKNIKIMRLVTGEDIIGNISESQGLITIKKAFVIIPMQATPGKPVQLVLSPWQPYTDDKEIVIDDSKVITITSPKDDIIKSYESHTSEIITPSGLITET 100 T 0.0012 Sm_like pdbpercent F T 3bya 2 B B NMDZ1_HUMAN N-METHYL-D-ASPARTATE RECEPTOR SUBUNIT NR1 KKKATFRAITSTLASSFKRRRSSK 24 T 14 Neuropeptide_S pdbhh F Eukaryota T 3bze 3 I,J,K,L P,Q,R,S HLAG_HUMAN HLA G ANTIGEN VMAPRTLFL 9 T 0.00093 UL40 pdbhh F Eukaryota T 3bzf 3 C,F P,Q 1C07_HUMAN MHC CLASS I ANTIGEN CW*7 VMAPRALLL 9 T 0.095 UL40 pdbhh F Eukaryota T 3bzi 2 B E MPIP3_HUMAN 9 MER PEPTIDE FROM DUAL SPECIFICITY PHOSPHATASE CDC25C LLCSTPNGL 9 T 2.1 DUF3038 pdbhh F Eukaryota T 3c0t 2 B B MED8_SCHPO MEDIATOR COMPLEX SUBUNIT 8, CELL SEPARATION PROTEIN SEP15 MEEQNANQMLTDILSFMKSGKRAAALEHHHHHH 33 T 30 YbeY pdbhh F Eukaryota T 3c2g 1 A,B A,B Q9XVI2_CAEEL Sys-1 protein MNITQAAEQAIRLWFNTPDPMQRLHMAKTIRTWIRQDKFAQVDQANMPNCVQQILNIIYDGLKPQPVQLPISYYAQLWYNLLDILRRFTFLPIISPYIHQVVQMFCPRENGPQDFRELICNLISLNWQKDPHMKHCANQVFQIFNCIIMGVKNEKLRTEFAQHLKFEKLVGTLSEYFNPQVHPGMINPAIFIIFRFIISKDTRLKDYFIWNNNPHDQPPPPTGLIIKLNAVMIGSYRLIAGQNPETLPQNPELAHLIQVIIRTFDLLGLLLHDSDAIDGFVRSDGVGAITTVVQYPNNDLIRAGCKLLLQVSDAKALAKTPLENILPFLLRLIEIHPDDEVIYSGTGFLSNVVAHKQHVKDIAIRSNAIFLLHTIISKYPRLDELTDAPKRNRVCEIICNCLRTLNNFLMMWIPTPNGETKTAGPNEKQQVCKFIEIDILKKLMSCLSCEGMDTPGLLELRSTILRSFILLLRTPFVPKDGVLNVIDENRKENLIGHICAAYSWVFRQPNNTRTQSTKQQLVERTISLLLVLMEQCGAEKEVAQYSYSIDCPLNLLNGNQVKPTFIHNVLVVCDKILEHCPTRADIWTIDRPMLEGLTNHRNSDIAKAANSLLSRFPEN 619 T 1.4E-05 Insc_C unphh F Eukaryota T 3c2g 2 C,D C,D POP1_CAEEL Pop-1 8-residue peptide GDEVKVFR 8 T 4.1 DUF5065 pdbhh F Eukaryota T 3c2h 1 A,B A,B Q9XVI2_CAEEL Sys-1 protein MNITQAAEQAIRLWFNTPDPMQRLHMAKTIRTWIRQDKFAQVDQANMPNCVQQILNIIYDGLKPQPVQLPISYYAQLWYNLLDILRRFTFLPIISPYIHQVVQMFCPRENGPQDFRELICNLISLNWQKDPHMKHCANQVFQIFNCIIMGVKNEKLRTEFAQHLKFEKLVGTLSEYFNPQVHPGMINPAIFIIFRFIISKDTRLKDYFIWNNNPHDQPPPPTGLIIKLNAVMIGSYRLIAGQNPETLPQNPELAHLIQVIIRTFDLLGLLLHDSDAIDGFVRSDGVGAITTVVQYPNNDLIRAGCKLLLQVSDAKALAKTPLENILPFLLRLIEIHPDDEVIYSGTGFLSNVVAHKQHVKDIAIRSNAIFLLHTIISKYPRLDELTDAPKRNRVCEIICNCLRTLNNFLMMWIPTPNGETKTAGPNEKQQVCKFIEIDILKKLMSCLSCEGMDTPGLLELRSTILRSFILLLRTPFVPKDGVLNVIDENRKENLIGHICAAYSWVFRQPNNTRTQSTKQQLVERTISLLLVLMEQCGAEKEVAQYSYSIDCPLNLLNGNQVKPTFIHNVLVVCDKILEHCPTRADIWTIDRPMLEGLTNHRNSDIAKAANSLLSRFPEN 619 T 1.4E-05 Insc_C unphh F Eukaryota T 3c2p 2 C,D A,B RPOLV_BPN4 Virion RNA polymerase MGGSHHHHHHRSESTVTEELKEGIDAVYPSLVGTADSKAEGIKNYFKLSFTLPEEQKSRTVGSEAPLKDVAQALSSRARYELFTEKETANPAFNGEVIKRYKELMEHGEGIADILRSRLAKFLNTKDVGKRFAQGTEANRWVGGKLLNIVEQDGDTFKYNEQLLQTAVLAGLQWRLTATSNTAIKDAKDVAAITGIDQALLPEGLVEQFDTGMTLTEAVSSLAQKIESYWGLSRNPNAPLGYTKGIPTAMAAEILAAFVESTDVVENIVDMSEIDPDNKKTIGLYTITELDSFDPINSFPTAIEEAVLVNPTEKMFFGDDIPPVANTQLRNPAVRNTPEQKAALKAEQATEFYVHTPMVQFYETLGKDRILELMGAGTLNKELLNDNHAKSLEGKNRSVEDSYNQLFSVIEQVRAQSEDISTVPIHYAYNMTRVGRMQMLGKYNPQSAKLVREAILPTKATLDLSNQNNEDFSAFQLGLAQALDIKVHTMTREVMSDELTKLLEGNLKPAIDMMVEFNTTGSLPENAVDVLNTALGDRKSFVALMALMEYSRYLVAEDKSAFVTPLYVEADGVTNGPINAMMLMTGGLFTPDWIRNIAKGGLFIGSPNKTMNEHRSTADNNDLYQASTNALMESLGKLRSNYASNMPIQSQIDSLLSLMDLFLPDINLGENGALELKRGIAKNPLTITIYGSGARGIAGKLVSSVTDAIYERMSDVLKARAKDPNISAAMAMFGKQAASEAHAEELLARFLKDMETLTSTVPVKRKGVLELQSTGTGAKGKINPKTYTIKGEQLKALQENMLHFFVEPLRNGITQTVGESLVYSTEQLQKATQIQSVVLEDMFKQRVQEKLAEKAKDPTWKKGDFLTQKELNDIQASLNNLAPMIETGSQTFYIAGSENAEVANQVLATNLDDRMRVPMSIYAPAQAGVAGIPFMTIGTGDGMMMQTLSTMKGAPKNTLKIFDGMNIGLNDITDASRKANEAVYTSWQGNPIKNVYESYAKFMKNVDFSKLSPEALEAIGKSALEYDQRENATVDDIANAASLIERNLRNIALGVDIRHKVLDKVNLSIDQMAAVGAPYQNNGKIDLSNMTPEQQADELNKLFREELEARKQKVAKA 1117 T 0.0039 RNA_pol pdbhh T Viruses T 3c3g 1 A A alpha/beta peptide with the GCN4-pLI side chain sequence on an (alpha-alpha-beta) backbone XMKXIEXKLXEIXSKXYHXENXLAXIKXLLXER 33 T 0.0016 VGPC1_C pdbhh F T 3c3h 1 A A alpha/beta-peptide based on the GCN4-pLI side chain sequence, with an (alpha-alpha-beta) backbone and cyclic beta-residues at positions 1, 4, 10, 19, 22, and 28 XXMKXIEXKLXEIXSKXYHXENXLAXIKXLLXER 34 T 0.04 DUF5082 pdbpssm F T 3c3l 2 C,D A,B RPOLV_BPN4 Virion RNA polymerase MGGSHHHHHHRSESTVTEELKEGIDAVYPSLVGTADSKAEGIKNYFKLSFTLPEEQKSRTVGSEAPLKDVAQALSSRARYELFTEKETANPAFNGEVIKRYKELMEHGEGIADILRSRLAKFLNTKDVGKRFAQGTEANRWVGGKLLNIVEQDGDTFKYNEQLLQTAVLAGLQWRLTATSNTAIKDAKDVAAITGIDQALLPEGLVEQFDTGMTLTEAVSSLAQKIESYWGLSRNPNAPLGYTKGIPTAMAAEILAAFVESTDVVENIVDMSEIDPDNKKTIGLYTITELDSFDPINSFPTAIEEAVLVNPTEKMFFGDDIPPVANTQLRNPAVRNTPEQKAALKAEQATEFYVHTPMVQFYETLGKDRILELMGAGTLNKELLNDNHAKSLEGKNRSVEDSYNQLFSVIEQVRAQSEDISTVPIHYAYNMTRVGRMQMLGKYNPQSAKLVREAILPTKATLDLSNQNNEDFSAFQLGLAQALDIKVHTMTREVMSDELTKLLEGNLKPAIDMMVEFNTTGSLPENAVDVLNTALGDRKSFVALMALMEYSRYLVAEDKSAFVTPLYVEADGVTNGPINAMMLMTGGLFTPDWIRNIAKGGLFIGSPNKTMNEHRSTADNNDLYQASTNALMESLGKLRSNYASNMPIQSQIDSLLSLMDLFLPDINLGENGALELKRGIAKNPLTITIYGSGARGIAGKLVSSVTDAIYERMSDVLKARAKDPNISAAMAMFGKQAASEAHAEELLARFLKDMETLTSTVPVKRKGVLELQSTGTGAKGKINPKTYTIKGEQLKALQENMLHFFVEPLRNGITQTVGESLVYSTEQLQKATQIQSVVLEDMFKQRVQEKLAEKAKDPTWKKGDFLTQKELNDIQASLNNLAPMIETGSQTFYIAGSENAEVANQVLATNLDDRMRVPMSIYAPAQAGVAGIPFMTIGTGDGMMMQTLSTMKGAPKNTLKIFDGMNIGLNDITDASRKANEAVYTSWQGNPIKNVYESYAKFMKNVDFSKLSPEALEAIGKSALEYDQRENATVDDIANAASLIERNLRNIALGVDIRHKVLDKVNLSIDQMAAVGAPYQNNGKIDLSNMTPEQQADELNKLFREELEARKQKVAKA 1117 T 0.0039 RNA_pol pdbhh T Viruses T 3c3o 2 B B CHM4A_HUMAN CHROMATIN-MODIFYING PROTEIN 4A, CHMP4A, VACUOLAR PROTEIN-SORTING-ASSOCIATED PROTEIN 7-1, SNF7- 1, HSNF-1, SNF7 HOMOLOG ASSOCIATED WITH ALIX-2 DEEALKQLAEWVS 13 T 2.8 DUF3884 pdbhh F Eukaryota T 3c3q 2 B B CHM4B_HUMAN CHROMATIN-MODIFYING PROTEIN 4B, CHMP4B, VACUOLAR PROTEIN-SORTING-ASSOCIATED PROTEIN 7-2, SNF7- 2, HSNF7-2, SNF7 HOMOLOG ASSOCIATED WITH ALIX 1, HVPS32 KEEEDDDMKELENWAGSM 18 T 1.4 TMEM154 pdbhh F Eukaryota T 3c3r 2 B B CHM4C_HUMAN CHROMATIN-MODIFYING PROTEIN 4C, CHMP4C, VACUOLAR PROTEIN-SORTING-ASSOCIATED PROTEIN 7-3, SNF7- 3, HSNF7-3, SNF7 HOMOLOG ASSOCIATED WITH ALIX 3 EDDDIKQLAAWAT 13 T 1.1 Ribosomal_60s unppssm F Eukaryota T 3c46 1 A,C A,B RPOLV_BPN4 Virion RNA polymerase MGGSHHHHHHRSESTVTEELKEGIDAVYPSLVGTADSKAEGIKNYFKLSFTLPEEQKSRTVGSEAPLKDVAQALSSRARYELFTEKETANPAFNGEVIKRYKELMEHGEGIADILRSRLAKFLNTKDVGKRFAQGTEANRWVGGKLLNIVEQDGDTFKYNEQLLQTAVLAGLQWRLTATSNTAIKDAKDVAAITGIDQALLPEGLVEQFDTGMTLTEAVSSLAQKIESYWGLSRNPNAPLGYTKGIPTAMAAEILAAFVESTDVVENIVDMSEIDPDNKKTIGLYTITELDSFDPINSFPTAIEEAVLVNPTEKMFFGDDIPPVANTQLRNPAVRNTPEQKAALKAEQATEFYVHTPMVQFYETLGKDRILELMGAGTLNKELLNDNHAKSLEGKNRSVEDSYNQLFSVIEQVRAQSEDISTVPIHYAYNMTRVGRMQMLGKYNPQSAKLVREAILPTKATLDLSNQNNEDFSAFQLGLAQALDIKVHTMTREVMSDELTKLLEGNLKPAIDMMVEFNTTGSLPENAVDVLNTALGDRKSFVALMALMEYSRYLVAEDKSAFVTPLYVEADGVTNGPINAMMLMTGGLFTPDWIRNIAKGGLFIGSPNKTMNEHRSTADNNDLYQASTNALMESLGKLRSNYASNMPIQSQIDSLLSLMDLFLPDINLGENGALELKRGIAKNPLTITIYGSGARGIAGKLVSSVTDAIYERMSDVLKARAKDPNISAAMAMFGKQAASEAHAEELLARFLKDMETLTSTVPVKRKGVLELQSTGTGAKGKINPKTYTIKGEQLKALQENMLHFFVEPLRNGITQTVGESLVYSTEQLQKATQIQSVVLEDMFKQRVQEKLAEKAKDPTWKKGDFLTQKELNDIQASLNNLAPMIETGSQTFYIAGSENAEVANQVLATNLDDRMRVPMSIYAPAQAGVAGIPFMTIGTGDGMMMQTLSTMKGAPKNTLKIFDGMNIGLNDITDASRKANEAVYTSWQGNPIKNVYESYAKFMKNVDFSKLSPEALEAIGKSALEYDQRENATVDDIANAASLIERNLRNIALGVDIRHKVLDKVNLSIDQMAAVGAPYQNNGKIDLSNMTPEQQADELNKLFREELEARKQKVAKA 1117 T 0.0039 RNA_pol pdbhh T Viruses T 3c5i 2 E E Cleaved fragment of N-terminal expression tag ENLYFQ 6 T 40 Phage_holin_2_4 pdbhh F T 3c5l 2 B B Peptide PPHST 5 T 110 F_actin_bind pdbhh F F 3c88 2 B B Inhibitor peptide RRGC RRGCX 5 T 40 Sdh5 pdbhh F F 3c89 2 B B Inhibitor peptide RRGM RRGMX 5 T 130 Dyp_perox pdbhh F F 3c8a 2 B B Inhibitor peptide RRGL RRGLX 5 T 100 Sda pdbhh F F 3c8b 2 B B Inhibitor peptide RRGI RRGIX 5 T 110 DUF1952 pdbhh F F 3c94 2 B,C B,C A0A0H3GL04_KLEPH Single-stranded DNA-binding C-terminal tail peptide WMDFDDDIPF 10 T 0.36 Phage_SSB pdbhh F Bacteria T 3c9c 2 B B H4_DROME Histone H4, 27-residue peptide AKRHRKVLRDNIQGITKPAIRRLARRG 27 T 8.5E-08 CENP-T_C unp F Eukaryota T 3c9n 3 C C Peptide antigen VQQESSFVM 9 T 3.5 DUF1615 pdbhh F T 3c9q 2 B L Synthetic peptide STA 3 T 720 F-112 pdbhh F F 3cal 2 B,D B,D FNBA_STAA8 FNBPA XKGIVTGAVSDHTTVEDTKX 20 T 0.015 Fn_bind unppssm F Bacteria T 3cay 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L LPD-12 XAXAEAAEKAAKYAAEAAEKAAKAXAX 27 T 30 DUF639 pdbhh F F 3cb8 2 B B peptide substrate VSGYAV VSGYAV 6 T 3 LAM_C pdbhh F F 3cba 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L LPD-12 XAXAEAAEKAAKYAAEAAEKAAKAXAX 27 T 30 DUF639 pdbhh F F 3cbl 2 B B Synthetic peptide XIYESL 6 T 89 NUC205 pdbhh F T 3cbm 2 B B ESR1_HUMAN ER, ESTRADIOL RECEPTOR, ER-ALPHA, NUCLEAR RECEPTOR SUBFAMILY 3 GROUP A MEMBER 1 IKRSKKNSLA 10 T 8.3 Chordopox_A30L pdbhh F Eukaryota T 3cbo 2 B B ESR1_HUMAN ER, ESTRADIOL RECEPTOR, ER-ALPHA, NUCLEAR RECEPTOR SUBFAMILY 3 GROUP A MEMBER 1 IKRSKKNSLA 10 T 8.3 Chordopox_A30L pdbhh F Eukaryota T 3cbp 2 B B ESR1_HUMAN ER, ESTRADIOL RECEPTOR, ER-ALPHA, NUCLEAR RECEPTOR SUBFAMILY 3 GROUP A MEMBER 1 IKRSKKNSLA 10 T 8.3 Chordopox_A30L pdbhh F Eukaryota T 3cc5 3 C,F C,F PMEL_HUMAN SILVER LOCUS PROTEIN HOMOLOG, MELANOCYTE LINEAGE-SPECIFIC ANTIGEN GP100, MELANOMA-ASSOCIATED ME20 ANTIGEN, ME20M, ME20-M KVPRNQDWL 9 T 3.2 ER pdbhh F Eukaryota T 3cch 3 C,F,I,L C,F,I,L nonameric peptide murine gp100 EGSRNQDWL 9 T 3.1 DUF5136 pdbhh F T 3cd3 2 B B Synthetic peptide XIYESL 6 T 89 NUC205 pdbhh F T 3cdg 5 I,J P,Q HLAG_HUMAN HLA G ANTIGEN VMAPRTLFL 9 T 0.00093 UL40 pdbhh F Eukaryota T 3cdw 2 B H POLG_CXB3N VPG GAYTGVPNQKPRVPTLRQAKVQ 22 T 6.8 DUF2111 pdbhh T Viruses T 3cf5 5 E 5 THCL_STRAJ ALANINAMIDE, BRYAMYCIN, THIACTIN XIAXASXTXXXXTXXXXXX 19 T 0.93 CCER1 pdbhh F Bacteria F 3cfs 2 B E H4_HUMAN Histone H4 QGITKPAIRRLARRG 15 T 2.9 Phage_Cox pdbhh F Eukaryota T 3cfv 2 B,D E,F H4_HUMAN Histone H4 peptide DNIQGITKPAIRRLARRG 18 T 8.5E-08 CENP-T_C unp F Eukaryota T 3ch1 3 I,J,K,L C,F,I,L nonameric peptide chimeric gp100 EGPRNQDWL 9 T 2.8 APOC4 pdbhh F T 3ch8 2 B P C-terminal octapeptide from protein ARVCF PQPVDSWV 8 T 4 Spt4 pdbhh F T 3che 2 C,D C,D Peptide inhibitor XXXD 4 T 530 zf-H2C2_5 pdbhh F F 3chf 2 C,D C,D Argifin XXXXD 5 T 400 zf-CCHC pdbhh F F 3chw 3 C V VASP_HUMAN VASP GAGGGPPPAPPLPAAQ 16 T 3.5 Tir_receptor_N pdbhh F Eukaryota F 3chx 4 D,I,N D,H,L 20-residue peptide XXXXXXXXXXXXXXXXXXXX 20 F F F 3chx 5 E,J,O M,N,O 26-residue peptide XXXXXXXXXXXXXXXXXXXXXXXXXX 26 F F F 3cii 3 C,F C,F HLAG_HUMAN HLA class I histocompatibility antigen peptide VMAPRTLFL 9 T 0.00093 UL40 pdbhh F Eukaryota T 3ck0 3 C P ANGT_HUMAN PROTEIN (8-MER; HUMAN ANGIOTENSIN II) DRVYIHPF 8 T 0.67 Nairo_nucleo pdbhh F Eukaryota T 3cmr 1 A,B A,B PPB_ECOLI APASE TPEMPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQDATPAALVAHVTSSKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 449 T 6.8E-10 Alk_phosphatase unppssm F Bacteria T 3cnf 1 A,B A,B CAPSD_CPVBM VP1 MHSTNNNSNKRNNEEKHKQPEIDSSANNGEGTSGTRAQTVGDTATEAGVRNETEAGASTRRQTDGTGLSGTNAKIATASSARQADVEKPADVTFTIENVDDVGIMQQKKPPTVVQSRTDVFNEQFANEALHPTTKVIFNGLDVNTEVQPLSDDFKQISDPKGYLTYSVKYEDQFTKKDKLRASEADDRIVGPTVNLFKYGAAVVNIDLNRDFFDTATGIDLTKGIPLVQDLLVPIGVTAGAEQSAEYVSGLLMVLFKVMTDNRLVIVGETTTPMSNTLSTVVNNVLRTTYHNNVGVNPALLRDFTQVNWLNRDITNMLQQAGTKYGLGLTETRLDYVRLVKTIVGHALNIDHFAASVLNINLRALMEANVTADDRIKALQAHSMISTQFHGPNQGALRPELAFDHDHIIRCLMLAAANYPRLEGIIVQINTGYVASANVIRPVSEKRYFPENLEQNQSAARLVSAVKARASEADISSIHLAIAREVSPMFNVHELKKIAESFEDPSSIVVVLEFILFALFFPTEFNRIKGDIQNVLLLFFSRWYPVEYGIFVQRGATYTINAAGEFEFSGRNEKWDQALYLSEHFPALFSDVPLAGANTIIAIMRLFTPQGFLRTDDLAIAANFPRASRNPQTYIPYTNQRGTVTNEFASRFRTIVATLANVVNERAVQDDMQKATRSCTKQWLRHLETQFDNIAVAHTDHLSVVYATMSNFMLNFTNNFSGNHATFKPDQYVITSPEGSYKPIIERQGETVDGLTIIDTSIVWPILCQCTYPLVRQSGKGVDAVSIMEEIVYPDPSTTLSQSLSVAQVLSKLTLPDAFINMILSGGDSVVMRTYQTEADDDLDEGIRMTTYDQYLSHIRERLHITNVPDPIYITGASTPDQIAASVQATHVAVVLYQSGVINGPASTYLRENEVLVVMPDYYDVVSRFANANLQMNNNRYHESVLEIADIFDQADFIQTSDAVRQLRALMPTLSTSQIRHAIERIAQITDVDSTDYGKLTLRFLGTLTRSLKMQNAQIRRIRPDGTVLRYDDQIDIEAFRWSRYFLDELQLRRLSVGLRLITNPRIARRFNGVRIMYLTDDDPDPDFVPDVPEGYVAVQYAHRLFSSSLANKRNRVTYTHPPTGMAYPSPTGRPHVHMTINERAGMSKLVADNIIASVIKSNWVVDILDIEYTAEVMTPSEGYTQHVDAESIMTAPKGKLFHLQFMDGLLRPEPSAFDPPASGEDMRLIYPLQPISVARSMRAIVNHNEVDRPRGAVAPSSYEMDTGTLSRNGDLLYSPVANGQVGIPKLEVDHISFSNVVSMMTANIRTGDDMAVERVNPDDVRAINIRNA 1333 T 130 SRP19 pdbhh T Viruses T 3cnf 2 C T Q9E957_CPVBM VP3 MWHYTSINNDTRVALDPKPNQIRTITKPNTVPQLGTDYLYTFNSQRRSHTLRLLGPFQYFNFSETDRGHPLFRLPLKYPSKAIPADELIDNLHSWMRSVHLLHVRSEDNTLRYNWMLGVYARSTNYTTPVGQLVVNAPAILNYSNPQDAFNSVFVALGIDYIDIPITNSNIFDDSSTPYNVRIWHAPTMTEVNHILALMRKSTLVSTHSSWHWDVLHTFHYRSESDMIDHFAAKILEDWRQKEKLDKGALVEADRVVQRLIPLSSSTYVQRLAAIGALYPNEFTENVLDLSRLSTALLQLSDTYYQHANDQLRRLYRRMYNDSRTLYMTQRHQELLLAQITADPNILLYPYTYIFTTAYTSMNYISNTGQGRIKHSLAVTGTTEHTIADITLGPMSEDVVTISMVEPMSIAAEDMYGYVLDTPTRDIWPADEQIEQKGDAVALYDTKTSRALGMFNNTVRIDDLLSPLLGLVYRTYIKGDTMTMTQGSLDHLTLCAAVDSDITFVGNRMIAPLAEGYIPKAMHRNNSTMKMLSLYVALKKLENFTTNSYLMAPDTSIILLGAEREPAVSILRRFNRSVSNVRIIGMGDRAVEPNIRVRVPFPIDKNISADFIICDINSYEDQSFESMFGETISVVTTCASAATRVLVKINHPSEYMINSVIERLSQLGGVFYHTALLKTASQNPYSYETYIYITPIAAAVRFPFYSNSAIINRYMTAVADDETPIIPSIHTVIKGHSNTYSPGLFCGCIDVQSAPFALSQLKSYCSEATTWRVDSDDNLVNIIARIDPARIALEFRTRSNTSAYHEYQRYVPNGLGFKGRKTREFRYIHREVTFIHKLMTYALIREQISLTENMTQVVSIGGRNLADISVVPLNMKYVVIDPATRIETLTQEKKNIEVQSRPFSFDAASMDLENNSIYLFIAVIMNEPNGAATPARTQMDKIRNVATAMLTRTNCVAYISFYEAGIITRLDQSTAHKTIRVEEGRLKVANYVPVDTLVEADVTLMLRDIGITHEIIRPSTPELINACSNYGIRLGSTGGAVLDVFNHYSPVIKLVRS 1057 T 0.32 Ig_mannosidase pdbpssm T Viruses T 3cpl 3 C,F E,F NP366 peptide ASNENAETM 9 T 26 DUF4690 pdbhh F T 3cpx 1 A,B,C A,B,C Aminopeptidase, M42 family MGSDKIHHHHHHENLYFQGMQLLKELCSIHAPSGNEEPLKDFILEYIRSNAGSWSYQPVIYADNDLQDCIVLVFGNPRTAVFAHMDSIGFTVSYNNHLHPIGSPSAKEGYRLVGKDSNGDIEGVLKIVDEEWMLETDRLIDRGTEVTFKPDFREEGDFILTPYLDDRLGVWTALELAKTLEHGIIAFTCWEEHGGGSVAYLARWIYETFHVKQSLICDITWVTEGVEAGKGVAISMRDRMIPRKKYVNRIIELARQTDIPFQLEVEGAGASDGRELQLSPYPWDWCFIGAPEKDAHTPNECVHKKDIESMVGLYKYLMEKL 321 T 1.7E-14 Peptidase_M42 pdbpssm F T 3cqu 2 B C GSK3B_HUMAN GSK-3 BETA GRPRTTSFAE 10 T 6.3 DUF3084 pdbhh F Eukaryota T 3cqw 2 B C GSK3B_HUMAN GSK-3 BETA GRPRTTSFAE 10 T 6.3 DUF3084 pdbhh F Eukaryota T 3cqz 11 K M AAMAT_AMAPH AMATOXIN, ALPHA AMANITIN XXGIGCNP 8 T 0.85 DUF3085 pdbhh F Eukaryota T 3cs0 2 B B pentapeptide XXXXX 5 F F F 3cs8 2 B B PRGC1_HUMAN PPAR-GAMMA COACTIVATOR 1-ALPHA, PPARGC-1-ALPHA, PGC-1-ALPHA, LIGAND EFFECT MODULATOR 6 PSLLKKLLLAPA 12 T 13 Neurokinin_B pdbhh F Eukaryota F 3cts 1 A A CITRATE SYNTHASE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 437 F F F 3cu8 2 C,D P,Q RAF1_HUMAN RAF-1, C-RAF, CRAF RSTSTPNVH 9 T 46 ALC pdbhh F Eukaryota T 3cv0 2 B B G6PI_TRYBB T. brucei PGI PTS1 peptide Ac-FNELSHL FNELSHL 7 T 1.7 ATG9 pdbhh F Eukaryota T 3cvf 1 A,B,C,D A,B,C,D HOME3_HUMAN HOMER-3 GSHMAAEREETQQKVQDLETRNAELEHQLRAMERSLEEARAERERARAEVGRAAQLLDVSLFELSELREGLARLAEAAP 79 T 0.00048 Cast unppercent F Eukaryota T 3cvl 2 B B PFKA_TRYBB T. brucei PFK PTS1 peptide Ac-HEELAKL HEELAKL 7 T 59 CblD pdbhh F Eukaryota F 3cvn 2 B B G3PG_TRYBB T. brucei GAPDH PTS1 peptide Ac-DRDAAKL RDRAAKL 7 T 6.8 TAF6_C pdbhh F Eukaryota F 3cvo 1 A,B,C,D A,B,C,D Q5LRV1_RUEPO Methyltransferase-like protein of unknown function GMDDQSGDQMRPELTMPPAEAEALRMAYEEAEVILEYGSGGSTVVAAELPGKHVTSVESDRAWARMMKAWLAANPPAEGTEVNIVWTDIGPTGDWGHPVSDAKWRSYPDYPLAVWRTEGFRHPDVVLVDGRFRVGCALATAFSITRPVTLLFDDYSQRRWQHQVEEFLGAPLMIGRLAAFQVEPQPIPPGSLMQLIRTMTSP 202 T 0.0049 Methyltransf_24 pdbpssm F Bacteria T 3cvp 2 B B 10-SKL PTS1 peptide Ac-GTLSNRASKL GTLSNRASKL 10 T 12 DUF2434 pdbhh F T 3cvq 2 B B PTS1 peptide 7-SKL (Ac-SNRWSKL) XNRWSKL 7 T 3.9 PilI pdbhh F T 3cww 2 C,D D,E bradykinin N-terminal tetrapeptide analogue APPA 4 T 180 HMMR_N pdbhh F F 3cxw 2 B B Pimtide peptide ARKRRRHPSGPPTA 14 T 1.5 DUF3019 pdbhh F T 3cy2 2 B B Pimtide peptide ARKRRRHPSGPPTA 14 T 1.5 DUF3019 pdbhh F T 3cy3 2 B B Pimtide peptide ARKRRRHPSGPPTA 14 T 1.5 DUF3019 pdbhh F T 3cys 2 B B CYCLOSPORINE, CICLOSPORIN, CICLOSPORINE XXXXXXXXVXA 11 T 9.7 DUF6090 pdbhh F F 3czf 3 C C GLR_HUMAN PGR RRRWHRWRL 9 T 1 DUF3019 pdbhh F Eukaryota F 3d18 3 C C TERMINAL PROTEIN RRRWRRLTL 9 T 0.02 Herpes_LMP2 pdbhh F F 3d1e 2 C P decamer from polymerase II C-terminal TLMTGQLGLF 10 T 18 Chlorosome_CsmC pdbhh F T 3d1f 2 C,D P,Q Nonapeptide from polymerase III C-terminal SEQVELEFD 9 T 2.8 DUF4462 pdbhh F T 3d23 2 E,F,G,H H,F,E,G N-[(5-METHYLISOXAZOL-3-YL)CARBONYL]ALANYL-L-VALYL-N~1~-((1R,2Z)-4-(BENZYLOXY)-4-OXO-1-{[(3R)-2-OXOPYRROLIDIN-3-YL]METHYL}BUT-2-ENYL)-L-LEUCINAMIDE XAVLXX 6 T 1700 zf-H2C2_5 pdbhh F F 3d24 2 B,D B,D PRGC1_HUMAN PPAR-GAMMA COACTIVATOR 1-ALPHA, PPARGC-1-ALPHA, PGC-1-ALPHA, LIGAND EFFECT MODULATOR 6 QQQKPQRRPCSELLKYLTTNDD 22 T 0.78 HPIP pdbhh F Eukaryota T 3d25 3 C C HMHA1_HUMAN MINOR HISTOCOMPATIBILITY ANTIGEN HA-1, MHAG HA-1 VLHDDLLEA 9 T 7.1 Flu_M1_C pdbhh F Eukaryota T 3d2y 2 B B Anhydro-N-acetylmuramic acid-L-Ala-D-gamma-Glu-L-Lys XAXK 4 T 910 Endotoxin_C pdbhh F F 3d2z 2 B B L-Ala-D-gamma-Glu-L-Lys peptide AXK 3 T 620 DUF3392 pdbhh F F 3d32 2 C,D C,D K1 peptide DATYTWEHLAWPX 13 T 1.2 DUF4172 pdbhh F T 3d39 3 C C Modified HTLV-1 TAX (Y5(4fluoro)F) peptide LLFGXPVYV 9 T 0.35 YvrJ pdbhh F T 3d3v 3 C C Modified HTLV-1 TAX (Y5(3,4-difluoro)F) peptide LLFGXPVYV 9 T 0.35 YvrJ pdbhh F T 3d3x 2 C,D C,D SNAP-25 substrate peptide RIMEX 5 T 170 TMEM95 pdbhh F F 3d4b 2 B D Acetyl P53 peptide TSRHKXLMA 9 T 12 AHD pdbhh F T 3d6f 3 C C N-[(benzyloxy)carbonyl]-L-valyl-N-[(2S)-1-carboxy-4-fluoro-3-oxobutan-2-yl]-L-alaninamide XVADX 5 T 1100 RE_HindIII pdbhh F F 3d6h 3 C C N-[(benzyloxy)carbonyl]-L-valyl-N-[(2S)-1-carboxy-4-fluoro-3-oxobutan-2-yl]-L-alaninamide XVADX 5 T 1100 RE_HindIII pdbhh F F 3d6m 3 C C N-[(benzyloxy)carbonyl]-L-valyl-N-[(2S)-1-carboxy-4-fluoro-3-oxobutan-2-yl]-L-alaninamide XVADX 5 T 1100 RE_HindIII pdbhh F F 3d81 2 B C S-alkylamidate intermediate SRHKXLMF 8 T 8 DUF420 pdbhh F T 3d8a 2 I,J,K,L,M,N,O,P S,T,U,V,W,X,Y,Z TRAD1_ECOLI Protein traD GEDVEPGDDF 10 T 0.89 Taq-exonuc pdbhh F Bacteria T 3d9t 2 C,D C,D CASP9_HUMAN CASP-9, ICE-LIKE APOPTOTIC PROTEASE 6, ICE-LAP6, APOPTOTIC PROTEASE MCH-6, APOPTOTIC PROTEASE-ACTIVATING FACTOR 3, APAF-3, CASPASE-9 SUBUNIT P35, CASPASE-9 SUBUNIT P10 ATPFQE 6 F F Eukaryota T 3da9 3 C D Hirudin peptide DFEEIPGEX 9 T 0.014 Hirudin pdbhh F T 3dda 2 B B SNP25_HUMAN SNAP-25, SYNAPTOSOMAL-ASSOCIATED 25 KDA PROTEIN, SUPER PROTEIN, SUP QRATKMX 7 F F Eukaryota T 3ddb 2 B B SNP25_HUMAN SNAP-25, SYNAPTOSOMAL-ASSOCIATED 25 KDA PROTEIN, SUPER PROTEIN, SUP RRATKMX 7 F F Eukaryota T 3dep 2 B B LHCP L18 REGION YPGGSFDPLGLA 12 T 0.0011 Chloroa_b-bind pdbhh F T 3dfe 1 A,B,C,D,E,F A,B,C,D,E,F Q3M8P8_ANAVT Putative Pii-Like Signaling Protein GMSKRANKLVIVTEKVLLKKVAKIIEEAGATGYTVVDTGGKGSRNVRSTGKPNTSDTDSNVKFEVLTENREMAEKIADQVAIKFFTDYAGIIYICEAEVLYGRTFCGPDGC 111 T 0.0009 P-II unppercent F Bacteria T 3dgj 1 A A NNFGAIL peptide NNFGAIL 7 T 5.3 SidC_N pdbhh F T 3dgl 1 A A ATP Binding Protein-DX GSMDYKDDDDKKTNWLKRIYRVRPCVKCKVAPRDWKVKNKHLRIYNMCKTCFNNSIDIGDDTYHGHVDWLMYADSKEISNT 81 T 0.025 Glyco_tranf_2_4 pdbpssm F T 3dgm 1 A A ATP Binding Protein-DX GSMDYKDDDDKKTNWLKRIYRVRPCVKCKVAPRDWKVKNKHLRIYNMCKTCFNNSIDIGDDTYHGHVDWLMYADSKEISNT 81 T 0.025 Glyco_tranf_2_4 pdbpssm F T 3dgn 1 A A ATP Binding Protein-DX GSMDYKDDDDKKTNWLKRIYRVRPCVKCKVAPRDWKVKNKHLRIYNMCKTCFNNSIDIGDDTYHGHVDWLMYADSKEISNT 81 T 0.025 Glyco_tranf_2_4 pdbpssm F T 3dgo 1 A A ATP Binding Protein-DX GSMDYKDDDDKKTNWLKRIYRVRPCVKCKVAPRDWKVKNKHLRIFNMCKTCFNNSIDIGDDTYHGHVDWLMYADSKEISNT 81 T 0.025 Glyco_tranf_2_4 pdbpssm F T 3diw 2 C,D C,D CTNB1_HUMAN BETA-CATENIN NQLAWFDTDL 10 T 3 Tobravirus_2B pdbhh F Eukaryota T 3dks 2 E,F E,F siga peptide XPIPFLXQKD 10 T 1.1 DUF5450 pdbhh F T 3dkt 2 K,L,M,N,O,P,Q,R,S,T K,L,M,N,O,P,Q,R,S,T Q9WZP3_THEMA FERRITIN-LIKE PROTEIN GGDLGIRK 8 T 21 CDCA pdbhh F Bacteria T 3dm1 2 B,D,F,H B,D,F,H EHMT2_HUMAN HISTONE H3-K9 METHYLTRANSFERASE 3, H3-K9-HMTASE 3, EUCHROMATIC HISTONE-LYSINE N-METHYLTRANSFERASE 2, HLA-B-ASSOCIATED TRANSCRIPT 8, PROTEIN G9A, LYSINE N-METHYLTRANSFERASE 1C KVHRARKTMSKP 12 T 2 RNA_pol_Rpb5_N pdbhh F Eukaryota T 3dm7 1 A,B A,B VPS75_YEAST Vacuolar protein sorting-associated protein 75 GSMMSDQENENEHAKAFLGLAKCEEEVDAIEREVELYRLNKMKPVYEKRDAYIDEIAEFWKIVLSQHVSFANYIRASDFKYMDTIDKIKVEWLALESEMYDTRDFSITFHFHGIEGDFKEQQVTKVFQIKKGKDDQEDGILTSEPVPIEWPQSYDSINPDLMKDKRSPEGKKKYRQGMKTIFGWFRWTGLKPGKEFPHGDSLASLFSEEIYPFCVKYYAEAQRDLEDEEGESGL 234 T 0.0013 NAP pdbpercent F Eukaryota T 3dnj 2 C,D C,D synthetic N-end rule peptide YLFVQRDSKE 10 T 0.97 DUF4642 pdbhh F T 3dox 2 B P A PEPTIDE SUBSTRATE-SQNY SQNY 4 T 94 OFCC1 pdbhh F F 3dox 3 C Q A PEPTIDE SUBSTRATE-PIV PIV 3 T 170 SWC7 pdbhh F F 3dpc 1 A,B A,B PPB_ECOLI APASE TPEMPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDLAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQDATPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLKHHHHHH 455 T 1.1E-10 Alk_phosphatase pdbpssm F Bacteria T 3dpc 2 C C Phosphorylated Peptide HATPPKKEAD 10 T 33 Feld-I_B pdbhh F T 3dpo 2 C,D C,D PYRRH_PYRAP inhibitor peptide VDKLYXLPRPT 11 T 2.3 PHtD_u1 pdbhh F Eukaryota T 3dpp 2 C,D C,D PYRRH_PYRAP inhibitor peptide VDKLYXLPRPTPPRPIYNRN 20 T 2.5 Apidaecin unphh F Eukaryota T 3dpq 2 E,F,G,H C,D,G,H PYRRH_PYRAP inhibitor peptide VDKLYXLPRPTPPRPIYNRN 20 T 2.5 Apidaecin unphh F Eukaryota T 3dpy 3 C C caged substrate TKCVIM 6 T 3.1 Plk4_PB2 pdbhh F T 3dqb 2 B B GNAT1_BOVIN TRANSDUCIN ALPHA-1 CHAIN ILENLKDCGLF 11 T 0.75 Phage_holin_4_1 pdbhh F Eukaryota T 3drf 2 B B endogenous peptide ASNSIASG 8 T 14 DUF2109 pdbhh F F 3drh 2 B B peptide AAAAAA AAAAAA 6 T 340 UPF0253 pdbhh F F 3dri 2 B B peptide AASASA AASASA 6 T 680 DUF5811 pdbhh F F 3drj 2 B B pTH-related peptide AHAKA 5 T 440 SWIM pdbhh F F 3drk 2 B B Neuropeptide S SFANG 5 T 27 YkpC pdbhh F F 3ds0 2 B T Peptide inhibitor of capsid assembly ITFEDLLDYYGP 12 T 0.92 DUF2610 pdbhh F T 3ds1 2 B T Peptide Inhibitor of capsid assembly ITFEDLLDYYGP 12 T 0.92 DUF2610 pdbhh F T 3ds3 2 B,D C,D Peptide inhibitor of capsid assembly ITFEDLLDYYGP 12 T 0.92 DUF2610 pdbhh F T 3ds4 2 C T Peptide inhibitor of capsid assembly ITFEDLLDYYGP 12 T 0.92 DUF2610 pdbhh F T 3ds9 2 B B octapeptide I1 inhibitor XRWTXMLG 8 T 0.55 Lentiviral_Tat pdbhh F T 3dt5 1 A A Y924_ARCFU Uncharacterized protein AF_0924 GHSNRQVQLMARQQRLKAIEDRLEKFYIPLIKAFSSYVYTAQTEDEIETIITCRRYLAGNNLLRVLPMHFKFKADKIAGSANWTFYAKEDFEQWKEALDVLWEEFLEVLKEYYTLSGTEISLPEKPDWLIGYKGS 135 T 0.03 RE_HaeIII pdb F Archaea T 3dtx 3 C C VIPR1_HUMAN Double citrullinated vasoactive intestinal polypeptide receptor RRKWXXWHL 9 F F Eukaryota T 3dvp 2 C,D C,D PAK1_HUMAN P21 activated Kinase peptide TPTRDVATSP 10 T 1.4 TFIIA unppercent F Eukaryota T 3dw8 2 B,E B,E 2ABA_HUMAN PP2A, SUBUNIT B, B-ALPHA ISOFORM, PP2A, SUBUNIT B, B55-ALPHA ISOFORM, PP2A, SUBUNIT B, PR55-ALPHA ISOFORM, PP2A, SUBUNIT B, R2-ALPHA ISOFORM MAGAGGGNDIQWCFSQVKGAVDDDVAEADIISTVEFNHSGELLATGDKGGRVVIFQQEQENKIQSHSRGEYNVYSTFQSHEPEFDYLKSLEIEEKINKIRWLPQKNAAQFLLSTNDKTIKLWKISERDKRPEGYNLKEEDGRYRDPTTVTTLRVPVFRPMDLMVEASPRRIFANAHTYHINSISINSDYETYLSADDLRINLWHLEITDRSFNIVDIKPANMEELTEVITAAEFHPNSCNTFVYSSSKGTIRLCDMRASALCDRHSKLFEEPEDPSNRSFFSEIISSISDVKFSHSGRYMMTRDYLSVKVWDLNMENRPVETYQVHEYLRSKLCSLYENDCIFDKFECCWNGSDSVVMTGSYNNFFRMFDRNTKRDITLEASRENNKPRTVLKPRKVCASGKRKKDEISVDSLDFNKKILHTAWHPKENIIAVATTNNLYIFQDKVN 447 T 0.11 ANAPC4_WD40 unppercent F Eukaryota T 3dw8 4 G,H G,H microcystin LR XLXRXXX 7 T 49 Phage_TAC_2 pdbhh F F 3dyc 1 A,B A,B PPB_ECOLI APASE TPEMPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQDATPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLAQMTDKAIELLSKNEKGFFLQVYGASIDKQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK 449 T 6.8E-10 Alk_phosphatase unppssm F Bacteria T 3dze 1 A A ATP5S_BOVIN ATP SYNTHASE-COUPLING FACTOR B, MITOCHONDRIAL ATP SYNTHASE REGULATORY COMPONENT FACTOR B SFWEWLNAVFNKVDHDRIRDVGPDRAASEWLLRCGAMVRYHGQQRWQKDYNHLPTGPLDKYKIQAIDATDSCIMSIGFDHMEGLQYVEKIRLCKCHYIEDGCLERLSQLENLQKSMLEMEIISCGNVTDKGIIALHHFRNLKYLFLSDLPGVKEKEKIVQAFKTSLPSLELKLDLK 176 T 0.0023 FBXL18_C pdbhh F Eukaryota T 3e08 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H T23O_XANCP Tryptophan 2,3-dioxygenase MPVDKNLRDLEPGIHTDLEGRLTYGGYLRLDQLLSAQQPLSEPAHHDEMLFIIQSQTSELWLKLLAHELRAAIVHLQRDEVWQCRKVLARSKQVLRQLTEQWSVLETLTPSEYMGFRDVLGPSSGFQSLQYRYIEFLLGNKNPQMLQVFAYDPAGQARLREVLEAPSLYEEFLRYLARFGHAIPQQYQARDWTAAHVADDTLRPVFERIYENTDRYWREYSLCEDLVDVETQFQLWRFRHMRTVMRVIGFKRGTGGSSGVGFLQQALALTFFPELFDVRTSVGVDNRPPQGSADAGKR 298 T 6.3E-41 Trp_dioxygenase unp F Bacteria T 3e0m 2 E,F,G E,F,G Short peptide SHMAEI SHMAEI 6 T 76 Nt_Gln_amidase pdbhh F T 3e0n 2 B A DPN-PHE-ARM XFX 3 T 58 FERM_C pdbhh F F 3e1i 4 G,H G,H Gly-His-Arg-Pro-amide GHRPX 5 T 100 zf-CCHC pdbhh F F 3e1k 2 B,D,F,H,J,L,N,P B,D,F,H,J,L,N,P LAC9_KLULA Lactose regulatory protein LAC9 TQQLFNTTTMDDVYNYIFDNDE 22 T 1.4 BTP pdbhh F Eukaryota T 3e1r 2 C C PDC6I_HUMAN PDCD6-INTERACTING PROTEIN, ALG-2-INTERACTING PROTEIN 1, HP95 QAQGPPYPTYPGY 13 T 1.1 Antimicrobial_5 pdbhh F Eukaryota T 3e2b 2 B C SWA_DROME Protein swallow 16-residue peptide MYHIRSATSAKATQTD 16 T 0.0069 KASH_CCD unppercent F Eukaryota T 3e2j 1 A,B,C,D A,B,C,D ATP5S_BOVIN ATP SYNTHASE-COUPLING FACTOR B, MITOCHONDRIAL ATP SYNTHASE REGULATORY COMPONENT FACTOR B SFWGWLNAVFNKVDHDRIRDVGPDRAASEWLLRCGAMVRYHGQQRWQKDYNHLPTGPLDKYKIQAIDATDSCIMSIGFDHMEGLQYVEKIRLCKCHYIEDGCLERLSQLENLQKSMLEMEIISCGNVTDKGIIALHHFRNLKYLFLSDLPGVKEKEKIVQAFKTSLPSLELKLDLK 176 T 0.0023 FBXL18_C pdbhh F Eukaryota T 3e2n 1 A A APX1_PEA;CCPR_YEAST CCP TTPLVHVASVEKGRSYEDFQKVYNAIALKIAEKKCGPVLVRLAWHTSGTWDKHDNTGGSYGGTYRFKKEFNDPSNAGLQNGFKFLEPIHKEFPWISSGDLFSLGGVTAVQEMQGPKIPWRCGRVDTPEDTTPDNGRLPDADKDADYVRTFFQRLNMNDREVVALMGAHALGKTHLKRSGYEGPFGAANNVFTNEFYLNLLNEDWKLEKNDANNEQWDSKSGYMMLPTDYSLIQDPKYLSIVKEYANDQDKFFKDFSKAFEKLLENGITFPKDAPSPFIFKTLEEQGL 287 T 3E-05 peroxidase pdbpssm F Eukaryota T 3e39 1 A,B A,B Q314Q8_DESAG Putative Nitroreductase GMLTENPVLQAIRQRRSIRRYTDEAVSDEAVRLILEAGIWAPSGLNNQPCRFLVIRADDPRCDILAAHTRYGHIVRGAKVIILVFLDREAMYNEVKDHQAAGAAVQNMLLAAHALQLGAVWLGEIINQAATLLPALALDPARLSFEAAIAAGHPAQNGSSSRRPLAELLLEEPFPQPE 178 T 5.1E-20 Nitroreductase pdbpercent F Bacteria T 3e3z 1 A A ATP5S_BOVIN ATP SYNTHASE-COUPLING FACTOR B, MITOCHONDRIAL ATP SYNTHASE REGULATORY COMPONENT FACTOR B SFWEWLNAVFNKVDHDRIRDVGPDRAASEWLLRCGAMVRYHGQQRWQKDYNHLPTGPLDKYKIQAIDATDSCIMSIGFDHMEGLQYVEKIRLCKCHYIEDGCLERLSQLENLQKSMLEMEIISCGNVTDKGIIALHHFRNLKYLFLSDLPGVKEKEKIVQAFKTSLPSLELKLDLK 176 T 0.0023 FBXL18_C pdbhh F Eukaryota T 3e4a 2 C,D F,G HYDROXAMATE PEPTIDE II1 AAA 3 T 1200 RNase_HII pdbhh F F 3e4g 1 A A ATP5S_BOVIN ATP SYNTHASE-COUPLING FACTOR B, MITOCHONDRIAL ATP SYNTHASE REGULATORY COMPONENT FACTOR B SFWEWLNAVFNKVDHDRIRDVGPDRAASEWLLRCGAMVRYHGQQRWQKDYNHLPTGPLDKYKIQAIDATDSCIMSIGFDHMEGLQYVEKIRLCKCHYIEDGCLERLSQLENLQKSMLEMEIISCGNVTDKGIIALHHFRNLKYLFLSDLPGVKEKEKIVQAFKTSLPSLELKLDLK 176 T 0.0023 FBXL18_C pdbhh F Eukaryota T 3e6y 2 C,D C,D Q40409_NICPL H+-ATPase phosphopeptide QSYpTV QSYTV 5 T 130 DUF4642 pdbhh F Eukaryota F 3e7a 2 C,D C,D nodularin R XRXXX 5 T 700 DUF2777 pdbhh F F 3e87 2 C,D C,D GSK3B_HUMAN GSK-3 BETA GRPRTTSFAE 10 T 6.3 DUF3084 pdbhh F Eukaryota T 3e88 2 C,D C,D GSK3B_HUMAN GSK-3 BETA GRPRTTSFAE 10 T 6.3 DUF3084 pdbhh F Eukaryota T 3e8d 2 C,D C,D GSK3B_HUMAN GSK-3 BETA GRPRTTSFAE 10 T 6.3 DUF3084 pdbhh F Eukaryota T 3e8u 3 C P BNP peptide epitope GVQGSGAFGRG 11 T 0.27 Mannitol_dh pdbhh F T 3ebb 2 E,F,G,H E,F,G,H TERA_HUMAN 15S MG(2+)-ATPASE P97 SUBUNIT, VALOSIN-CONTAINING PROTEIN, VCP TEDNDDDLYG 10 T 17 DUF228 pdbhh F Eukaryota T 3ech 2 C C Q9HXS2_PSEAE 25-mer fragment of protein ArmR RRDYTEQLRRAARRNAWDLYGEHFY 25 T 0.68 PLD_C pdbhh F Bacteria T 3edq 3 E,F E,F CASP-3; APOPAIN; CYSTEINE PROTEASE CPP32; YAMA PROTEIN; CPP-32; SREBP CLEAVAGE ACTIVITY 1; SCA-1 XLDESX 6 T 160 ResIII pdbhh F F 3edr 3 E,F E,F YKR18_YEAST Inhibitor Ac-ldesd-cho peptide XLDESX 6 F F Eukaryota F 3efd 3 C K KcsA SEKAAEEAYTRTTRALHERFDRLERMLDDN 30 T 1.1 PspB pdbhh F T 3efo 3 C C STX5_HUMAN Peptide DVAIDMM 7 T 0.19 COG5 unp F Eukaryota F 3eg1 2 C,D C,D p41 peptide XAPSYSPPPPP 11 T 1.8 N1221 pdbhh F F 3eg6 2 B C KMT2A_HUMAN MLL-1 peptide XGSARAEVHLRKS 13 T 1.1 N-SET unphh F Eukaryota T 3eg9 3 C C GOSR2_HUMAN peptide TTIPMDS 7 T 0.044 SLX9 unp F Eukaryota T 3egh 3 E,F E,F nodularin R XRXXX 5 T 700 DUF2777 pdbhh F F 3ejh 2 B,D E,F CO1A1_HUMAN Collagen type-I a1 chain GQRGVVGLPGQRGERGFPGLPGY 23 T 0.00026 Collagen pdb F Eukaryota T 3emh 2 B B KMT2A_HUMAN MLL1 ARAEVHLRKSAFD 13 T 1.1 N-SET unphh F Eukaryota T 3emw 2 B B VASA1_DROME Peptide (VASA) DINNNNNIVEDVERKREFYI 20 T 4 CppA_N pdbhh F Eukaryota T 3emy 2 B B Pepstatin XVVXAX 6 T 1700 FAM60A pdbhh F F 3eov 2 C,D C,D CICLOSPORIN, CICLOSPORINE XXXXXXXXVXA 11 T 9.7 DUF6090 pdbhh F F 3epd 2 B 0 Poliovirus Type3 peptide ISEV 4 T 250 Tnp_22_trimer pdbhh F F 3eqs 2 B B 12-mer peptide inhibitor TSFAEYWNLLSP 12 T 0.051 P53_TAD pdbhh F T 3eqy 2 C,D C,D 12-mer peptide inhibitor TSFAEYWNLLSP 12 T 0.051 P53_TAD pdbhh F T 3er5 2 B I ANGT_BOVIN H-189 PHPFHXVIHK 10 T 0.74 Ins134_P3_kin_N pdbhh F Eukaryota T 3era 1 A,B A,B 3S1EA_LATSE ERABUTOXIN A RICFNHQTSQPQTTKTCSPGESSCYNKQWSDFRGTIIERGCGCPTVKPGIKLSCCESEVCNN 62 T 0.003 Toxin_TOLIP pdb F Eukaryota T 3es5 1 A,A2,A3,A4,A5,B,B2,B3,B4,B5 A,A,A,A,A,B,A,B,A,B Q4G3H1_9VIRU Putative capsid protein MSFETSEGMSRPGDNPNKLNAKPRQSARPKTRNSTAQSNQTMRLGWIDPLPQVDTIFPLGLEPNVESIPAGEVELDFNLPETIAKPFADTVTSVGDRIQLVDDDKENIATSIYGLSFFKAARQLYSTMLDHEKAVNQPLKAVYYDETPIPAHMSGALGIIGHMKTKVGDVLVKDAGVLFKRGTAAGVTKFSEIDNDKTWNLDCSKLVWADHSSLSMIKRLASEKISQLVKQRYRVTDAQGHVYSVSMPQLTDQALPDYYDSIPDVAPNSDQLRVLTAALQMSLAQFRNDELPHDEDRSDLLTTLDLLYADGAYEISALRDQFELLMARYTTDFKWRVESIFKVGPPPAGTTGYGAQTVSSTGNTARWQFPLSDADINIGYLFSPSKSFSLFPKMVGYSKRAREDASASFANSDAKKFYAD 420 T 0.023 DUF5463 pdbpssm T Viruses T 3esk 2 B B HSP7C_HUMAN HEAT SHOCK 70 KDA PROTEIN 8 GASSGPTIEEVD 12 T 6.7 DUF4028 pdbhh F Eukaryota T 3etb 2 E,F,G,H J,K,L,M PAG_BACAN PA, PA-83, PA83, ANTHRAX TOXINS TRANSLOCATING PROTEIN [CONTAINS: PROTECTIVE ANTIGEN PA-20 AND PROTECTIVE ANTIGEN PA-63] RDKRFHYDRNNIAVGADESVVKEAHREVINSSTEGLLLNIDKDIRKILSGYIVEIEDTEGLKEVINDRYDMLNISSLRQDGKTFIDFKKYNDKLPLYISNPNYKVNVYAVTKENTIINPSENGDTSTNGIKKILIFSKKGYEIG 144 T 0.12 Fve pdbhh F Bacteria T 3eu7 2 B X BRCA2_HUMAN BRCA2, FANCONI ANEMIA GROUP D1 PROTEIN KADLGPISLNWFEELSSEA 19 T 0.091 DNAP_B_exo_N pdb F Eukaryota T 3ewf 2 E,F,G,H I,J,K,L PEPTIDIC SUBSTRATE XRHXX 5 T 290 Antimicrobial14 pdbhh F F 3exb 2 B B N-[3-(1H-BENZIMIDAZOL-1-YL)PROPANOYL]GLYCYL-L-ALANYL-L-ALANINAMIDE XGAAX 5 T 1100 US2 pdbhh F F 3eyf 3 E,F E,F GB_HCMVT Synthetic peptide ETIYNTTLKYX 11 T 12 HCMVantigenic_N unphh T Viruses T 3eys 3 C Q pyro-Glu3-A-Beta (3-8) peptide QFRHDS 6 T 59 Importin_rep pdbhh F T 3eyu 3 C Q ROR2(518-525) peptide REEFRHEA 8 T 14 ArAE_1_C pdbhh F F 3f2k 2 C C LYFA Peptide LYFA 4 T 72 Anoctamin pdbhh F F 3f2o 2 C,D C,D VASA1_DROME 20-mer peptide from ATP-dependent RNA helicase vasa DINNNNNIVEDVERKREFYI 20 T 4 CppA_N pdbhh F Eukaryota T 3f58 3 C P V3 LOOP SIGPGRAFGGG 11 T 0.24 CRISPR_assoc pdbhh F T 3f7d 2 B B PRGC1_MOUSE PPAR-GAMMA COACTIVATOR 1-ALPHA, PPARGC-1-ALPHA, PGC-1-ALPHA EEPSLLKKLLLAPA 14 T 7 Cnl2_NKP2 pdbhh F Eukaryota T 3f7f 1 A,B,C,D A,B,C,D NU120_YEAST NUCLEAR PORE PROTEIN NUP120 MACLSRIDANLLQYYEKPEPNNTVDLYVSNNSNNNGLKEGDKSISTPVPQPYGSEYSNCLLLSNSEYICYHFSSRSTLLTFYPLSDAYHGKTINIHLPNASMNQRYTLTIQEVEQQLLVNVILKDGSFLTLQLPLSFLFSSANTLNGEWFHLQNPYDFTVRVPHFLFYVSPQFSVVFLEDGGLLGLKKVDGVHYEPLLFNDNSYLKCLTRFFSRSSKSDYDSVISCKLFHERYLIVLTQNCHLKIWDLTSFTLIQDYDMVSQSDSDPSHFRKVEAVGEYLSLYNNTLVTLLPLENGLFQMGTLLVDSSGILTYTFQNNIPTNLSASAIWSIVDLVLTRPLELNVEASYLNLIVLWKSGTASKLQILNVNDESFKNYEWIESVNKSLVDLQSEHDLDIVTKTGDVERGFCNLKSRYGTQIFERAQQILSENKIIMAHNEDEEYLANLETILRDVKTAFNEASSITLYGDEIILVNCFQPYNHSLYKLNTTVENWFYNMHSETDGSELFKYLRTLNGFASTLSNDVLRSISKKFLDIITGELPDSMTTVEKFTDIFKNCLENQFEITNLKILFDELNSFDIPVVLNDLINNQMKPGIFWKKDFISAIKFDGFTSIISLESLHQLLSIHYRITLQVLLTFVLFDLDTEIFGQHISTLLDLHYKQFLLLNLYRQDKCLLAEVLLKDSSEFSFGVKFFNYGQLIAYIDSLNSNVYNASITENSFFMTFFRSYII 729 T 0.21 Nup160 pdbpssm F Eukaryota T 3f7o 2 C,D X,Y (ALA)(ALA)(PRO)(VAL) AAPV 4 T 280 DUF3458 pdbhh F F 3f9w 2 E,F,G,H E,F,G,H H4_HUMAN Histone H4 AKRHRKVLRD 10 T 0.27 UPF0137 unp F Eukaryota T 3f9x 2 E,F,G,H E,F,G,H H4_HUMAN Histone H4 AKRHRKVLRD 10 T 0.27 UPF0137 unp F Eukaryota T 3f9y 2 C,D E,F H4_HUMAN Histone H4 AKRHRKVLRD 10 T 0.27 UPF0137 unp F Eukaryota T 3f9z 2 E,F,G,H E,F,G,H H4_HUMAN Histone H4 AKRHRKVLRD 10 T 0.27 UPF0137 unp F Eukaryota T 3faj 1 A A Y131_ATV ORF131 MGSSHHHHHHSSGLVPRGSHMAKYEPKKGDYAGGAVKILDMFENGQLGYPEVTLKLAGEEANARRAGDERTKEAIHAIVKMISDAMKPYRNKGSGFQSQPIPGEVIAQVTSNPEYQQAKAFLASPATQVRNIEREEVLSKGAKKLAQAMAS 151 T 0.15 Mononeg_RNA_pol unppssm T Viruses T 3fbd 1 A,D A,D CEA7_ECOLX Colicin-E7 SKRNKPGKATGKGKPVNNKWLNNAGKDLGSPVPDRIANKLRDKEFKSFQDFRKKFWEEVSKDPELSKQFSRNNNDRMKVGKAPKTRTQDVSGKRTSFELHHEKPISQNGGVYDMDNISVVTPKRHIDIHRGK 132 T 0.038 HNH pdbpercent F Bacteria T 3fbr 2 B B peptide of EF-Tu XXXXXXXXX 9 F F F 3fdm 2 D,E,F D,E,F alpha/beta-peptide foldamer XXAXRXLXKXGDAFNRX 17 T 7.9 Bclx_interact pdbhh F T 3fdo 2 B B Synthetic high affinity peptide LTFEHYWAQLTS 12 T 1.4 ASXH pdbhh F T 3fe7 2 B L p53-peptidomimetic Ac-Phe-Met-Aib-Pmp-Trp-Glu-Ac3c-Leu-NH2 XFMXXWEXLX 10 T 2.2 CNTF pdbhh F T 3fea 2 B,C L,M p53-peptidomimetic Ac-Phe-Met-Aib-Pmp-6-Cl-Trp-Glu-Ac3c-Leu-NH2 XFMXXXEXLX 10 T 2.2 CNTF pdbhh F F 3fg5 2 B C pentapeptide FLSYK FLSYK 5 T 67 Cyanate_lyase pdbhh F F 3fga 5 E E MICROCYSTIN-LR XLXRXXX 7 T 49 Phage_TAC_2 pdbhh F F 3fhv 2 C C cftr peptide SXDXDNKEXX 10 T 0.75 DUF3243 pdbhh F T 3fif 2 I Y Uncharacterized ligand GGGGGGG 7 T 24 DUF6054 pdbhh F F 3fiv 2 C,D I,J FIV PROTEASE INHIBITOR LP-149 XXVLAEXX 8 T 140 RbpA pdbhh F F 3flo 2 B,E,H,K I,J,K,L DNA polymerase alpha catalytic subunit A XXX 3 F F F 3fma 2 F,G,H,I,J L,M,N,O,P BBP_YEAST SPLICING FACTOR 1, ZINC FINGER PROTEIN BBP, MUD SYNTHETIC-LETHAL 5 PROTEIN SSIAPPPGLSG 11 T 1.1 HMMR_N pdbhh F Eukaryota T 3fn0 3 C P Envelope polyprotein gp160 WNWFDITNK 9 T 0.22 Tna_leader pdbhh F T 3fn2 1 A,B A,B Putative sensor histidine kinase domain SNANGYTMQRDNQKTLAVYMFEEINRDVEYLSGRLSEKELKDKYRYYGRGYVRITDKDGQVITYEDGSVQDKTVFLTNEGANKLGWKLEFLIDEKMFEEEILEKQN 106 T 0.19 MucB_RseB pdbpssm F T 3fnt 2 B I Inhibitor, (IVA)VV(STA)A(STA) XVVXAX 6 T 1700 FAM60A pdbhh F F 3fod 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H AILSST hexapeptide segment from Islet Amyloid Polypeptide AILSST 6 T 130 BH3 pdbhh F F 3fol 3 C P 8 residue synthetic peptide VNDIFERI 8 T 0.58 WASH_WAHD pdbhh F T 3fom 3 C P 8 residue synthetic peptide IQQSIERI 8 T 4.9 DUF5791 pdbhh F F 3fon 3 C,F P,E Peptide VNDIFEAI 8 T 2.1 PRC2_HTH_1 pdbhh F T 3fp2 2 B Q HSP82_YEAST HEAT SHOCK PROTEIN HSP90 HEAT-INDUCIBLE ISOFORM, 82 KDA HEAT SHOCK PROTEIN EVPADTEMEEVD 12 T 25 UbiD pdbhh F Eukaryota T 3fp4 2 B Q HSP71_SCHPO Ssa1 GADNGPTVEEVD 12 T 5.8 CBP_CCPA pdbhh F Eukaryota T 3fqn 3 C C CTNB1_HUMAN peptide 30-39 from beta-Catenin: YLDSGIHSGA YLDSGIHSGA 10 T 2.3 DUF3094 pdbhh F Eukaryota T 3fqr 3 C C CTNB1_HUMAN phospho-peptide 30-39 from beta-Catenin: YLD(Sep)GIHSGA YLDSGIHSGA 10 T 2.3 DUF3094 pdbhh F Eukaryota T 3fqt 3 C C MPIP2_HUMAN peptide 38-46 from cell division cycle 25b (CDC25b): GLLGSPVRA GLLGSPVRA 9 T 7.7 Lep_receptor_Ig pdbhh F Eukaryota T 3fqu 3 C C MPIP2_HUMAN phospho-peptide 38-46 from cell division cycle 25b (CDC25b): GLLG(Sep)PVRA GLLGSPVRA 9 T 7.7 Lep_receptor_Ig pdbhh F Eukaryota T 3fqw 3 C C IRS2_HUMAN peptide 1097-1105 from insulin receptor substrate 2 (IRS2): RVASPTSGV RVASPTSGV 9 T 3.2 Frataxin_Cyay pdbhh F Eukaryota T 3fqx 3 C C IRS2_HUMAN phospho-peptide 1097-1105 from insulin receptor substrate 2 (IRS2): RVA(Sep)PTSGV RVASPTSGV 9 T 3.2 Frataxin_Cyay pdbhh F Eukaryota T 3fs1 2 B B PPARgamma Coactivator-1a (PGC-1a) AALAALLAA 9 T 17 DUF4699 pdbhh F F 3ft2 3 C P citrulline variant HA-1 peptide VLXDDLLEA 9 T 13 Trypco2 pdbhh F T 3ft3 3 C P HMHA1_HUMAN histidine variant HA-1 peptide VLHDDLLEA 9 T 7.1 Flu_M1_C pdbhh F Eukaryota T 3ft4 3 C P arginine variant HA-1 peptide VLRDDLLEA 9 T 13 Trypco2 pdbhh F T 3ftg 3 C C NP366-N3A variant peptide from influenza virus ASAENMETM 9 T 26 DUF1128 pdbhh F T 3fv3 2 B,D,F,H,J,L,N,P I,J,K,L,M,N,O,P pepstatin A XVVXAX 6 T 1700 FAM60A pdbhh F F 3fvh 2 B B Acetyl-Leu-His-Ser-phosphoThr-Ala-NH2 peptide XLHSTAX 7 T 500 HEPN_AbiA_CTD pdbhh F T 3fwg 1 A,B A,B CPXA_PSEPU CYTOCHROME P450-CAM, P450CAM NLAPLPPHVPEHLVFDFDMYNPSNLSAGVQEAWAVLQESNVPDLVWTRCNGGHWIATRGQLIREAYEDYRHFSSECPFIPREAGEAYDFIPTSMDPPEQRQFRALANQVVGMPVVDKLENRIQELACSLIESLRPQGQCNFTEDYAEPFPIRIFMLLAGLPEEDIPHLKYLTDQMTRPDGSMTFAEAKEALYDYLIPIIEQRRQKPGTDAISIVANGQVNGRPITSDEAKRMCGLLLVGGLDTVVNFLSFSMEFLAKSPEHRQELIQRPERIPAACEELLRRFSLVADGRILTSDYEFHGVQLKKGDQILLPQMLSGLDERENACPMHVDFSRQKVSHTTFGHGSHLCLGQHLARLQIIVTLKEWLTRIPDFSIAPGAQIQHKSGIVSGVQALPLVWDPATTKAV 405 T 1.6E-05 p450 unppercent F Bacteria T 3fwv 2 C,D C,D HS90B_HUMAN HSP 90, HSP 84 XMEEVF 6 T 3.3 SWI-SNF_Ssr4 pdbhh F Eukaryota F 3fxd 2 B,D B,D Q5ZYC9_LEGPH Protein IcmR EIGEPDVTDATLGSVYSEIISPVKDCILTVAKAVSFNPGGKDNTDAVEVLTELNTKVERAALNQPILTTKTER 73 T 2.8 MOSC_N pdbhh F Bacteria T 3fxe 2 B B Q5ZYC9_LEGPH Protein IcmR EIGEPDVTDATLGSVYSEIISPVKDCILTVAKAVSFNPGGKDNTDAVEVLTELNTKVERAAMNQPILTTKTER 73 T 2.8 MOSC_N pdbhh F Bacteria T 3fxh 1 A A B0BHE4_9BACT Integron gene cassette protein HFX_CASS2 MGSSHHHHHHSSGRENLYFQGMNNKHATSAVHEIIREICRLVDSGHSMTRDQFHELSEQERFIAFLAEKYSSTIKLYYLADSSPLFEKDTSSFIENAFGRHANTVVMEDFGLKSNALLLAINICLAILREINGEV 135 T 0.073 WASH-7_C unppercent F Bacteria T 3fxx 2 B B peptide substrate KQWDNYEXIW 10 T 0.12 DUF3896 pdbhh F T 3fy2 2 B B peptide substrate KQWDNYEFIW 10 T 3 DUF3896 pdbhh F T 3fy6 1 A,B,C,D A,B,C,D M1E1E6_VIBCL Integron cassette protein RENLYFQGMTEVNLNIYSPRWGRHETYIVELHKDYMEISMGAVTIKATYSENQDPEWSEETLQDIMNNDSVYPPEITQNLFQHAWLEWRKGALDNDEVTRELELVAQWVNKVTEAKPNSDFWRKYF 126 T 0.33 DUF768 pdbhh F Bacteria T 3fzx 1 A A Q5LD59_BACFN Putative exported protein GAQNQDCAFFFPNQEGEQITRNCYTADGKLTNILVYRVDQAYEYPSGMEVVANYTFADAAGKTLNSGQMVARCSDGNFSMSMGDVATFPTALNMMNADVYMMGDLMNYPDAFSNPMNPGDDDEFDDGTLRLYQKGNKNNRAEISVFDREFVTTETVNTPAGAFYCTKVKYEMNIWTPKETIKGYGYEWYAPNIGIVRSEQYNNKKELQSYSVLERIKK 218 T 0.0026 DUF3108 unppercent F Bacteria T 3g03 2 B,D B,D High affinity synthetic peptide LTFEHYWAQLTS 12 T 1.4 ASXH pdbhh F T 3g19 2 B C LLL tripeptide LLL 3 T 930 PAP_assoc pdbhh F F 3g1b 2 C,D C,D 10-residue peptide WLFVQRDSKE 10 T 1 DUF4642 pdbhh F T 3g2s 2 C,D C,D SORL_HUMAN SORTING PROTEIN-RELATED RECEPTOR CONTAINING LDLR CLASS A REPEATS, SORLA, SORLA-1, LOW-DENSITY LIPOPROTEIN RECEPTOR RELATIVE WITH 11 LIGAND-BINDING REPEATS, LDLR RELATIVE WITH 11 LIGAND-BINDING REPEATS, LR11 ITGFSDDVPMVIA 13 T 0.18 TMEM154 unphh F Eukaryota T 3g2t 2 C,D C,D SORL_HUMAN SORTING PROTEIN-RELATED RECEPTOR CONTAINING LDLR CLASS A REPEATS, SORLA, SORLA-1, LOW-DENSITY LIPOPROTEIN RECEPTOR RELATIVE WITH 11 LIGAND-BINDING REPEATS, LDLR RELATIVE WITH 11 LIGAND-BINDING REPEATS, LR11 ITGFSDDVPMVIA 13 T 0.18 TMEM154 unphh F Eukaryota T 3g2u 2 C,D C,D SORT_HUMAN NEUROTENSIN RECEPTOR 3, NTS3, NTR3, NT3 SGYHDDSDEDLLE 13 T 5.8 SiaC pdbhh F Eukaryota T 3g2v 2 B,D C,D SORT_HUMAN NEUROTENSIN RECEPTOR 3, NTS3, NTR3, NT3 SGYHDDSDEDLLE 13 T 5.8 SiaC pdbhh F Eukaryota T 3g2w 2 C,D C,D GGA1_HUMAN Internal peptide of the Hinge domain of ADP-ribosylation factor-binding protein GGA1 SASVSLLDDELMSL 14 T 6 EGL-1 pdbhh F Eukaryota T 3g3p 2 C D Peptide (NLE)LFVQRDSKE XLFVQRDSKE 10 T 1.1 DUF4642 pdbhh F T 3g6n 2 C,D F,G peptide (Met)(Ala)(Ser) MAS 3 T 280 zf-C2H2_4 pdbhh F F 3g7m 1 A A Q0WX48_WHEAT Xylanase inhibitor TL-XI APLTITNRCHFTVWPAVALVLAQGGGGTELHPGASWSLDTPVIGSQYIWGRTGCSFDRAGKGRCQTGDCGGSSLTCGGNPAVPTTMAEVSVLQGNYTYGVTSTLKGFNVPMNLKCSSGDALPCRKAGCDVVQPYAKSCSAAGSRLQIVFCP 151 T 6.1 Thaumatin unphh F Eukaryota T 3g8f 2 B B PHQ VAL ALA ARG SER peptide XVARS 5 T 330 UPA_2 pdbhh F F 3gbq 2 B B SOS1_MOUSE AC-VPPPVPPRRR-NH2 XVPPPVPPRRRX 12 T 4.2 Dscam_C pdbhh F Eukaryota F 3gch 1 A A CTRA_BOVIN GAMMA-CHYMOTRYPSIN CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 3gcn 2 B B YQF peptide YQF 3 T 56 WD40_alt pdbhh F F 3gco 2 B B DNRDGNVYQF peptide DNRDGNVYQF 10 T 4.6 DUF4651 pdbhh F T 3gct 1 A E CTRA_BOVIN GAMMA-CHYMOTRYPSIN A CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 3gct 4 D B UNK PRO GLY ALA TYR PEPTIDE XPGAY 5 T 56 BsuPI pdbhh F F 3gd1 3 D Z clathrin TNLIELDA 8 T 2.6 LRR_3 pdbhh F T 3gd2 2 B B activator peptide AHQLLRYLLDA 11 T 0.00089 DUF4927 pdbhh F T 3gd3 2 E,F E,F Apoptosis-inducing factor 1, mitochondrial XXXXXXXXX 9 F F F 3gds 2 B B DNRDGNVYYF peptide DNRDGNVYYF 10 T 0.29 Ribosomal_L24e pdbhh F T 3gdu 2 D,E,F D,E,F YRF peptide YRF 3 T 27 GAAD pdbhh F F 3gdv 2 D,E,F D,E,F YQF peptide YQF 3 T 56 WD40_alt pdbhh F F 3ge5 1 A,B A,B Q7MX99_PORGI NITROREDUCTASE FAMILY PROTEIN MGSDKIHHHHHHENLYFQGMKQIPQDFRLIEDFFRTRRSVRKFIDRPVEEEKLMAILEAGRIAPSAHNYQPWHFLVVREEEGRKRLAPCSQQPWFPGAPIYIITLGDHQRAWKRGAGDSVDIDTSIAMTYMMLEAHSLGLGCTWVCAFDQALCSEIFDIPSHMTPVSILALGYGDPTVPPREAFNRKTIEEVVSFEKL 198 T 2.5E-17 Nitroreductase unp F Bacteria T 3ggw 3 E,F E,F PEPTIDE B1 YLEDWIKYNNQK 12 T 0.0059 DUF3439 pdbhh F T 3ghb 3 C,F P,Q P88213_9HIV1 Envelope glycoprotein KGVRIGPGQA 10 T 0.085 GP120 pdbhh T Viruses T 3ghe 3 C P P88403_9HIV1 Envelope glycoprotein RKRIHIGPGRAFYAT 15 T 0.00016 GP120 pdbhh T Viruses T 3ghg 4 M,N,Q,R M,N,Q,R A knob GPRP 4 T 65 SRCR_2 pdbhh F F 3ghg 5 O,P,S,T O,P,S,T B knob GHRP 4 T 14 VPS38 unphh F F 3gi0 2 C,D C,D JG-365 inhibitor XSLNXIX 7 T 490 Rad54_N pdbhh F F 3gj9 2 C,D C,D KCNJ4_HUMAN C-TERMINAL PEPTIDE OF INWARD RECTIFIER K(+) CHANNEL KIR2.3 NISYRRESAI 10 T 1.3 Glyco_tran_10_N pdbhh F Eukaryota T 3gjn 2 B,C B,C CEA7_ECOLX Colicin-E7 MHHHHHHSMGKRNKPGKATGKGKPVNNKWLNNAGKDLGSPVPDRIANKLRDKEFKSFDDFRKKFWEEVSKDPELSKQFSRNNNDRMKVGKAPKTRTQDVSGKRTSFELHAEKPISQNGGVYDMDNISVVTPKRHIDIHRGK 141 T 0.011 HNH pdbpssm F Bacteria T 3gjo 2 E,F,G,H E,F,G,H DYST_HUMAN Dystonin GSRPSTAKPSKIPTPQRKSPASKLDKSSKR 30 T 13 DUF3697 pdbhh F Eukaryota T 3gjq 3 E,F E,F peptide inhibitor XWEHD 5 T 74 NPCC pdbhh F F 3gjs 3 E,F E,F Ac-YVAD-Cho inhibitor XYVAX 5 T 550 zf-CCHC pdbhh F F 3gjt 3 E,F E,F peptide inhibitor XIEPD 5 T 180 BIV_Env pdbhh F F 3gkl 1 A,B A,B CEA7_ECOLX Colicin-E9 immunity protein MHHHHHHSMGKRNKPGKATGKGKPVNNKWLNNAGKDLGSPVPDRIANKLRDKEFKSFDDFRKKFWEEVSKDPELSKQFSRNNNDRMKVGKAPKTRTQDVSGKRTSFELHAEKPISQNGGVYDMDNISVVTPKRHIDIHRGK 141 T 0.011 HNH pdbpssm F Bacteria T 3gkr 2 B B UDP-MurNAc-peptide XXKXX 5 T 230 OAM_dimer pdbhh F F 3gm1 2 B,C,E,F E,F,C,D PAXI_HUMAN Paxillin ATRELDELMASLS 13 T 0.99 SAM_LFY pdbhh F Eukaryota T 3gn6 1 A,B,C,D A,B,C,D Q8KDY2_CHLTE CT0912, ORFan protein with a ferredoxin-like domain repeat GMTGLSQSQASPMQIQPGNAAFNPWTDAALDTIRDVNQALTLYAEMRVVPAHHDAFLAAIDTVSAKLRVLPGFLSLALKQMSGDSTMVKNYPETYKGVLATAYLDGVAAGTQPYFYNLFVRFADGRAARAAGFEALFETHIHPLLHAMAPRGGDGPELLAYRAVLQSVVAGDRHAIYRGAEEIRSFLRRPVELPERETVTVENHVMVPEDKHAAWEPQVAILLQVAQDTFEPQDEPSGVGLPGARDNRYYRKALSTEILRNAHADGGLRAYIMHGVWESVWDHENSHLDPRFLAAAGPVGAAAVVGPVEPFYLTRRLVVAD 321 T 0.0014 ABM pdbpercent F Bacteria T 3gnn 2 C,D D,E Unknown Peptide XXXX 4 F F F 3go0 1 A,B A,B HNP-1, HP-1, HP1, DEFENSIN, ALPHA 1, HP 1-56, NEUTROPHIL DEFENSIN 2, HNP-2, HP-2, HP2 XXXXXXXXXXXGXXXXGXXXXXGXXXXXXX 30 F F F 3go3 2 C,D C,D QUINOMYCIN A XAXXXXAXXX 10 T 190 RSF pdbhh F F 3gof 2 C,D C,D NOS2_MOUSE INDUCIBLE NO SYNTHASE, INDUCIBLE NOS, INOS, NOS TYPE II, MACROPHAGE NOS, MAC-NOS RRREIRFRVLVKVVFF 16 T 5.9 RNA_pol_Rbc25 pdbhh F Eukaryota T 3gq1 2 C,D C,D WLFVQRDSKE peptide WLFVQRDSKE 10 T 1 DUF4642 pdbhh F T 3gsn 2 B P HCMV pp65 fragment 495-503 (NLVPMVATV) NLVPMVATV 9 T 15 GDH_N pdbhh F T 3gso 3 C P PP65_HCMVA HCMV pp65 fragment 495-503 (NLVPMVATV) NLVPMVATV 9 T 15 GDH_N pdbhh T Viruses T 3gsq 3 C P HCMV pp65 fragment 495-503, variant M5S (NLVPSVATV) NLVPSVATV 9 T 7 CtnDOT_TraJ pdbhh F T 3gsr 3 C P HCMV pp65 fragment 495-503, variant M5V (NLVPVVATV) NLVPVVATV 9 T 1.6 GDH_N pdbhh F T 3gsu 3 C P HCMV pp65 fragment 495-503, variant M5T (NLVPTVATV) NLVPTVATV 9 T 13 CtnDOT_TraJ pdbhh F T 3gsv 3 C P HCMV pp65 fragment 495-503, variant M5Q (NLVPQVATV) NLVPQVATV 9 T 23 DUF5464 pdbhh F T 3gsw 3 C P HCMV pp65 fragment 495-503, variant T8A (NLVPMVAAV) NLVPMVAAV 9 T 29 DUF2714 pdbhh F T 3gsx 3 C P HCMV pp65 fragment 495-503, variant T8V (NLVPMVAVV) NLVPMVAVV 9 T 6.9 ExbD pdbhh F T 3gt8 2 E X Unknown peptide XXXXXXXXXXX 11 F F F 3gv4 2 B H ubiquitin C-terminal peptide RLRGG RLRGG 5 T 100 DUF3589 pdbhh F F 3gw1 2 C,D C,D FGG peptide FGG 3 T 43 BppU_IgG pdbhh F F 3gxq 1 A,B A,B A0A0H2XIU6_STAA3 Putative regulator of transfer genes ArtA ENSVFFGKKKKVSLHLLVDPDMKDEIIKYAQEKDFDNVSQAGREILKKGLEQIA 54 T 0.033 DUF108 pdbpercent F Bacteria T 3gxv 2 C C DNAB_HELPY Replicative DNA helicase IKNASIKRKLFGLANTIREQAL 22 T 5.9 DNA_ligase_A_N unppssm F Bacteria T 3gyt 2 B B NCOA1_HUMAN SRC1 AQQKSLLQQLLTE 13 T 1 GFD1 pdbhh F Eukaryota T 3gyu 2 B B NCOA1_HUMAN SRC1 AQQKSLLQQLLTE 13 T 1 GFD1 pdbhh F Eukaryota T 3gz1 2 C,D P,Q IPAB_SHIFL 62 KDA ANTIGEN INTTNAHSTSNILIPELKAPKS 22 T 18 Phage_Treg pdbhh F Bacteria T 3gz2 2 C P IPAB_SHIFL 62 KDA ANTIGEN MGSSHHHHHHSSGLVPRGSHMILTSTELGDNTIQAANDAANKLFSLTIADLTANQNINTTNAHSTSNILIPELKAPKS 78 T 5 Aft1_HRR pdbhh F Bacteria T 3gze 2 E,F X,Y GP1_CHLRE Peptide substrate (Ser-Pro)5 SPSPSPSPSP 10 T 1.9 STIL_N pdbhh F Eukaryota F 3h11 3 C C IETD aldehyde inhibitor AIETX 5 T 140 WASH-7_mid pdbhh F F 3h1p 2 C,D C,D N-ACETYL-L-ALPHA-ASPARTYL-L-ALPHA-GLUTAMYL-N-[(2S)-1-CARBOXY-3-HYDROXYPROPAN-2-YL]-L-VALINAMIDE XDEVX 5 T 570 Helicase_RecD pdbhh F F 3h1z 2 B P PI51C_HUMAN PHOSPHATIDYLINOSITOL-4-PHOSPHATE 5-KINASE TYPE I GAMMA, PTDINS(4)P-5-KINASE GAMMA, PTDINSPKIGAMMA, PIP5KIGAMMA YFPTDERSWVYSPLH 15 T 0.63 PIG-S pdbhh F Eukaryota T 3h2h 1 A A Q5H5J0_XANOR PUTATIVE UNCHARACTERIZED PROTEIN APARGTLLTSNFLTSYTRDAISAMLASGSQPASGSQPEQAKCNVRVAEFTYATIGVEGEPATASGVLLIPGGERCSGPYPLLGWGHPTEALRAQEQAKEIRDAKGDDPLVTRLASQGYVVVGSDYLGLGKSNYAYHPYLHSASEASATIDAMRAARSVLQHLKTPLSGKVMLSGYSQGGHTAMATQREIEAHLSKEFHLVASAPISGPYALEQTFLDSWSGSNAVGENTFFILLGSYAIVAMQHTYKNIYLEPGQVFQDPWAAKVEPLFPGKQSLTDMFLNDTLPSIDKVKSYFQPGFYSDFPSNPANPFRQDLARNNLLEWAPQTPTLLCGSSNDATVPLKNAQTAIASFQQRGSNQVALVDTGTGNASDNSAFAHMLTKESCIVVVRDQLLDKQR 397 T 2.8E-13 LIP unppercent F Bacteria T 3h2i 1 A A Q5H5J0_XANOR PUTATIVE UNCHARACTERIZED PROTEIN APARGTLLTSNFLTSYTRDAISAMLASGSQPASGSQPEQAKCNVRVAEFTYATIGVEGEPATASGVLLIPGGERCSGPYPLLGWGHPTEALRAQEQAKEIRDAKGDDPLVTRLASQGYVVVGSDYLGLGKSNYAYHPYLHSASEASATIDAMRAARSVLQHLKTPLSGKVMLSGYSQGGHTAMATQREIEAHLSKEFHLVASAPISGPYALEQTFLDSWSGSNAVGEWTFGILLGSYAIVAMQHTYKNIYLEPGQVFQDPWAAKVEPLFPGKQSLTDMFLNDTLPSIDKVKSYFQPGFYSDFPSNPANPFRQDLARNNLLEWAPQTPTLLCGSSNDATVPLKNAQTAIASFQQRGSNQVALVDTGTGNASDNSAFAHMLTKESCIVVVRDQLLDKQR 397 T 2.8E-13 LIP unppercent F Bacteria T 3h32 4 G,H M,N FIBB_BOVIN Fibrin B knob pentapeptide GHRPY 5 T 12 VPS38 unphh F Eukaryota F 3h3p 3 E,F S,T 4E10_S0_1TJLC_004_N HHHHHHTNEAYLAHERRELEAKRNQLRDEVDRTKTHMQDEAANDPNWFDITAQLWEFSQELRNRDREEKLIKKIEQTLKKVENED 85 T 0.053 Ran-binding pdbpercent F T 3h52 2 E,F N,M NCOR1_HUMAN N-COR1, N-COR ASNLGLEDIIRKALMGSFD 19 T 3.6 RHH_7 pdbhh F Eukaryota T 3h5f 1 A,B,C A,B,C COIL SER L16L-Pen XEWEALEKKLAALESKXQALEKKLEALEHGX 31 T 0.00045 DUF5320 pdbhh F T 3h5r 2 E,F,G,H E,F,G,H MCCC7, MICROCIN C51, MCCC51, MICROCIN C, MCC MRTGNAD 7 T 110 FARP pdbhh F T 3h6z 2 B,D L,M 'HR(MLZ)VLR HRKVLR 6 T 37 DUF1609 pdbhh F F 3h7b 3 C,F C,F ATM_YEAST Tel1p peptide MLWGYLQYV 9 T 1 Rgp1 pdbhh F Eukaryota T 3h7z 1 A A YADA2_YEREN Adhesin yadA HTLKTANSYTDVTVSNSTKKAIRESNQYTDHKFHQLDNRLDKLDTRLLKLLASSAALNSLL 61 T 0.0018 CLZ pdb F Bacteria T 3h85 2 B P PI51C_HUMAN PHOSPHATIDYLINOSITOL-4-PHOSPHATE 5-KINASE TYPE I GAMMA, PTDINS(4)P-5-KINASE GAMMA, PTDINSPKIGAMMA, PIP5KIGAMMA SWVYSPLH 8 T 4.6 Pox_F15 pdbhh F Eukaryota T 3h8a 2 E,F E,F RNE_ECOLI RNase E QSPMPLTVAAASPELASGKVWIRYPIVR 28 T 1 XisI pdbhh F Bacteria T 3h8d 2 E,F,G,H E,F,G,H DAB2_RAT DOC-2, MITOGEN-RESPONSIVE PHOSPHOPROTEIN, C9 GSSSGGGSSSSGTSSAFSSYFNNKVGIPQEHVDHDDFDANQLLNKINE 48 T 2.4 EGL-1 pdbhh F Eukaryota T 3h9g 2 E,F,G,H E,F,G,H Microcin C7 analog MRTGNAX 7 T 110 FARP pdbhh F T 3h9h 3 C,F C,F ATM_YEAST Tel1p peptide MLWGYLQYV 9 T 1 Rgp1 pdbhh F Eukaryota T 3h9j 2 E,F,G,H E,F,G,H MCCC7, MICROCIN C51, MCCC51, MICROCIN C, MCC MRTGNAN 7 T 110 FARP pdbhh F T 3h9q 2 E,F,G,H E,F,G,H MCCC7, MICROCIN C51, MCCC51, MICROCIN C, MCC MRTGNAN 7 T 110 FARP pdbhh F T 3h9s 3 C C ATM_YEAST Tel1p peptide MLWGYLQYV 9 T 1 Rgp1 pdbhh F Eukaryota T 3hat 4 D T FPAM (FIBRINOPEPTIDE A MIMIC) XXXGVR 6 T 590 DUF4803 pdbhh F F 3hbu 2 B Z peptide AKAA 4 T 610 WVELL pdbhh F F 3hbv 2 B Z Uncharacterized peptide AKASQAA 7 T 460 Rop-like pdbhh F F 3hcv 3 C C VIPR1_HUMAN DOUBLE CITRULLINATED VASOACTIVE INTESTINAL POLYPEPTIDE RECEPTOR RRKWXXWHL 9 F F Eukaryota T 3hda 2 B Z Uncharacterized peptide AEAAQA 6 T 340 DUF773 pdbhh F F 3hdb 2 B L KNL KNL 3 T 390 PH_6 pdbhh F F 3hdi 2 C,D C,D Synthetic peptide AAAAAAAAAAAAAAAAA 17 T 260 Adeno_PIX pdbhh F F 3hds 2 E,F E,F short peptide ASWSA ASWSA 5 T 82 Equine_IAV_S2 pdbhh F F 3het 1 A,B A,B alpha/beta-peptide based on the GCN4-pLI side chain sequence with an (alpha-alpha-beta) backbone and a cyclic beta-residue at position 10 XXMKXIEXKLXEIXSKXYHXENXLAXIKXLLXER 34 T 0.002 VGPC1_C pdbhh F T 3heu 1 A A alpha/beta-peptide based on the GCN4-pLI side chain sequence with an (alpha-alpha-beta) backbone and a cyclic beta-residue at position 13 XXMKXIEXKLXEIXSKXYHXENXLAXIKXLLXER 34 T 0.002 VGPC1_C pdbhh F T 3hev 1 A A alpha/beta-peptide based on the GCN4-pLI side chain sequence with an (alpha-alpha-beta) backbone and a cyclic beta-residue at position 19 XXMKXIEXKLXEIXSKXYHXENXLAXIKXLLXER 34 T 0.002 VGPC1_C pdbhh F T 3hew 1 A A alpha/beta-peptide based on the GCN4-pLI side chain sequence with an (alpha-alpha-beta) backbone and a cyclic beta-residue at position 22 XXMKXIEXKLXEIXSKXYHXENXLAXIKXLLXER 34 T 0.0021 VGPC1_C pdbhh F T 3hex 1 A A alpha/beta-peptide based on the GCN4-pLI side chain sequence with an (alpha-alpha-beta) backbone and cyclic beta-residues at positions 1, 4, 19 and 28 XXMKXIEXKLXEIXSKXYHXENXLAXIKXLLXER 34 T 0.0016 VGPC1_C pdbhh F T 3hey 1 A A alpha/beta-peptide based on the GCN4-pLI side chain sequence with an (alpha-alpha-beta) backbone and cyclic beta-residues at positions 1, 4, 10, 19 and 28 XXMKXIEXKLXEIXSKXYHXENXLAXIKXLLXER 34 T 0.0016 VGPC1_C pdbhh F T 3hf0 1 A A GCN4-pLI side chain sequence on an (alpha-alpha-beta-alpha-beta-alpha-beta) backbone with cyclic beta-residues XRMXQXEXKLXEXLXKLXHXEXELXRXKXLLXEX 34 T 1 ATP-synt_DE pdbpssm F T 3hgk 2 E,F,G,H E,F,G,H HPAB2_PSESM AVRPTOB, AVIRULENCE PROTEIN AVRPTOB, E3 UBIQUITIN-PROTEIN LIGASE PRRGAVAHANSIVQQLVSEGADISHTRNMLRNAMNGDAVAFSRVEQNIFRQHFPNMPMHGISRDSELAIELRGALRRAVHQQAAS 85 T 0.11 Peptidase_C58 pdbpssm F Bacteria T 3hgl 1 A A HPAB2_PSESM AVRPTOB, AVIRULENCE PROTEIN AVRPTOB, E3 UBIQUITIN-PROTEIN LIGASE PRRGAVAHANSIVQQLVSEGADISHTRNMLRNAMNGDAVAFSRVEQNIFRQHFPNMPMHGISRDSELAIELRGALRRAVHQQAAS 85 T 0.11 Peptidase_C58 pdbpssm F Bacteria T 3hik 2 B B Pentamer phosphopeptide XPLHST 6 T 200 zf-C2H2_4 pdbhh F T 3hki 3 C,F C,F PAR1_HUMAN PAR-1, THROMBIN RECEPTOR, COAGULATION FACTOR II RECEPTOR SFLLRNPNDKYEPFWEDEEKN 21 T 1.3 SYCP2_SLD pdbhh F Eukaryota T 3hkj 3 C,F C,F PAR1_HUMAN PAR-1, THROMBIN RECEPTOR, COAGULATION FACTOR II RECEPTOR SFLLRNPNDKYEPFWEDEEKN 21 T 1.3 SYCP2_SLD pdbhh F Eukaryota T 3hm6 2 B C Unknown peptide XXXXXXXXXXXXXXXXXXXXXXXXXXX 27 F F F 3hqh 2 B M H2AY_HUMAN MacroH2A KAASADSTTEGTPAD 15 T 0.047 DUF1764 unp F Eukaryota T 3hqi 2 C,D C,D PucSBC1 DEVTSTT 7 T 100 Pox_P4B pdbhh F F 3hql 2 C,D C,D Q9VHV8_DROME SD08157P ENLACDEVTSTTSSST 16 T 1.2 DUF1163 pdbhh F Eukaryota T 3hqm 2 C,D C,D CI_DROME Protein cubitus interruptus NTLFPDVSSSTH 12 T 1.7 LSPR pdbhh F Eukaryota T 3hr5 3 I,J,K,L R,S,T,V M1prime-derived peptide SAQSQRAPDRVLCHSGQQQGLPRAAGGSVPHPRCH 35 T 15 HupF_HypC pdbhh F T 3hsv 2 C M H2AY_HUMAN MH2A1,HISTONE H2A.Y,H2A/Y,MEDULLOBLASTOMA ANTIGEN MU-MB-50.205 XDSTTEGTPADGFTVL 16 T 0.047 DUF1764 unp F Eukaryota T 3hu6 2 C,D C,D Q9VHV8_DROME SD08157P DEVTSTT 7 T 100 Pox_P4B pdbhh F Eukaryota F 3huf 2 D E COM1_SCHPO NBS1-INTERACTING PROTEIN 1, MEIOTICALLY UP-REGULATED GENE 38 PROTEIN IQELDSTTDEDEI 13 T 0.0085 CCDC144C unppercent F Eukaryota T 3hus 4 G,H,I,J G,H,I,J Peptide Ligand Gly-Pro-Arg-Pro-amide GPRP 4 T 65 SRCR_2 pdbhh F F 3i1g 1 A A GCN4_YEAST AMINO ACID BIOSYNTHESIS REGULATORY PROTEIN RMAQLEAKVEELLSKNWNLENEVARLKKLVGER 33 T 0.00052 bZIP_1 pdbpercent F Eukaryota T 3i5r 2 B B Peptide ligand HSKRPLPPLPSL 12 T 0.69 Herpes_LAMP2 pdbhh F T 3i74 2 C,D C,D ACE-PHE-GLU-LYS-ALA chloromethylketone INHIBITOR XFEKXX 6 T 170 DUF759 pdbhh F F 3i7l 2 B B DDB2_HUMAN DAMAGE-SPECIFIC DNA-BINDING PROTEIN 2, DDB P48 SUBUNIT, DDBB, UV-DAMAGED DNA-BINDING PROTEIN 2, UV-DDB 2 SIVRTLHQHKLGRA 14 T 2.6 YrzK pdbhh F Eukaryota T 3i7n 2 B B WDTC1_HUMAN WD and tetratricopeptide repeats protein 1 NITRDLIRRQIKE 13 T 0.77 DUF6483 pdbhh F Eukaryota T 3i7o 2 B B DCAF6_HUMAN NRIP, NUCLEAR RECEPTOR INTERACTION PROTEIN, ANDROGEN RECEPTOR COMPLEX-ASSOCIATED PROTEIN, ARCAP HLLWDVRKRSLGL 13 T 1.8 SCAPER_N pdbhh F Eukaryota T 3i7p 2 B B DCA12_HUMAN CENTROSOME-RELATED PROTEIN TCC52, TESTIS CANCER CENTROSOME-RELATED PROTEIN SLVYYLKNREVRL 13 T 2.9 DUF760 pdbhh F Eukaryota T 3i7z 2 B B EGFR receptor fragment DADEYL 6 T 0.37 Glyco_transf_92 pdbhh F F 3i89 2 B B DCAF5_HUMAN BREAKPOINT CLUSTER REGION PROTEIN 2, BCRP2 SVVGFLSQRGLHG 13 T 2.9 Blt1_C pdbhh F Eukaryota T 3i8c 2 B B DCAF4_HUMAN WD repeat-containing protein 21A NASSMLRKSQLGF 13 T 4 MRP-S35 pdbhh F Eukaryota T 3i8e 2 C,D C,D DCAF8_HUMAN WD repeat-containing protein 42A QALPALRERELGS 13 T 1.7 DUF4661 unp F Eukaryota T 3i91 2 C C H3K9 peptide QTARKSTG 8 T 290 Ribosomal_S8e pdbhh F T 3iax 2 B B CEA_CITFR Colicin-A MPGFNYGGKGDGTGWSSERGSGPEPGGGSHGNSGGHDRGDSSNVGNESVTVMKPGDSYNTPWGKVIINAAGQPTMNGTVMTADNSSMVPYGRGFTRVLNSLVNNPVSLEHHHHHH 115 T 0.32 Cloacin pdbpercent F Bacteria T 3ibc 3 E,F E,F Acetyl-YVAD-CHO XYVAD 5 T 220 MRJP pdbhh F F 3idg 3 C C gp41 MPER peptide ALDKWD 6 T 0.63 FAA_hydro_N_2 pdbhh F F 3idi 3 C C gp41 MPER peptide ALDKWQN 7 T 2.6 AATF-Che1 pdbhh F T 3idj 3 C C gp41 MPER peptide analog ELDXWAS 7 T 7.3 Med13_C pdbhh F T 3idm 3 C C gp41 MPER peptide analog ELDXWAS 7 T 11 DUF5623 pdbhh F T 3idn 3 C C gp41 MPER peptide analog ELDXWAS 7 T 11 DUF5623 pdbhh F T 3iee 1 A A Q5LA60_BACFN Putative exported protein GASCSGGDKSKAPVVSTADIENAAEVIKYYNTSLGVLKDMVKEKDVNAVLDYMEQKGKTPALSAIVPPAVVSKDSAIVLNPGNCFNEETRRNLKQNYTGLFQARTEFYANFDTYLSYLKKKDVTNAKKLLDVNYQLSTQMSEYKQNIFDILSPFTEQAELVLLVDNPLKAQIMSVRKMSSTMQSILNLYARKHRMDGPRIDLKVAELTKQLDAAKKLPVVNGHEGEMKSYQAFLSQVETFIKQVKKVREKGEYSDADYDMLTSAFETSII 270 T 0.029 LPAM_1 unphh F Bacteria T 3if2 1 A,B A,B Q4FPU3_PSYA2 Aminotransferase GMKFSKFGQKFTQPTGISQLMDDLGDALKSDQPVNMLGGGNPAKIDAVNELFLETYKALGNDNDTGKANSSAIISMANYSNPQGDSAFIDALVGFFNRHYDWNLTSENIALTNGSQNAFFYLFNLFGGAFVNEHSQDKESKSVDKSILLPLTPEYIGYSDVHVEGQHFAAVLPHIDEVTHDGEEGFFKYRVDFEALENLPALKEGRIGAICCSRPTNPTGNVLTDEEMAHLAEIAKRYDIPLIIDNAYGMPFPNIIYSDAHLNWDNNTILCFSLSKIGLPGMRTGIIVADAKVIEAVSAMNAVVNLAPTRFGAAIATPLVANDRIKQLSDNEIKPFYQKQATLAVKLLKQALGDYPLMIHKPEGAIFLWLWFKDLPISTLDLYERLKAKGTLIVPSEYFFPGVDVSDYQHAHECIRMSIAADEQTLIDGIKVIGEVVRELYDNK 444 T 0.00056 Aminotran_1_2 pdbpercent F Bacteria T 3ifl 3 C P A4_HUMAN Amyloid beta A4 protein DAEFRHD 7 T 1 DUF5973 pdbhh F Eukaryota T 3ifo 3 C,F P,Q A4_HUMAN Amyloid beta A4 protein DAEFRHD 7 T 1 DUF5973 pdbhh F Eukaryota T 3ifp 3 C,F,I,L P,Q,R,S A4_HUMAN Amyloid beta A4 protein DAEFRHD 7 T 1 DUF5973 pdbhh F Eukaryota T 3iiq 2 C,D C,D ARYLOMYCIN A2 XXGXAY 6 T 230 MMACHC pdbhh F F 3iiy 2 B B Histone H1K26 peptide KKKARKSAGAA 11 T 18 DUF3906 pdbhh F F 3ij1 2 B B H4_HUMAN Histone H4K20 peptide AKRHRKVLRDN 11 T 0.27 UPF0137 unp F Eukaryota T 3im4 2 C C AKA10_HUMAN KINASE ANCHOR PROTEIN 10, PROTEIN KINASE A-ANCHORING PROTEIN 10, PRKA10, D-AKAP-2 GSPEFVQGNTDEAQEELAWKIAKMIVSDVMQQAQYDQPLEKSTKL 45 T 0.073 TnpW pdbpssm F Eukaryota T 3ino 1 A,B A,B PAG_BACAN PA63 GSRFHYDRNNIAVGADESVVKEAHREVINSSTEGLLLNIDKDIRKILSGYIVEIEDTEGLKEVINDRYDMLNISSLRQDGKTFIDFKKYNDKLPLYISNPNYKVNVYAVTKENTIINPSENGDTSTNGIKKILIFSKKGYEIG 143 T 0.11 Fve pdbhh F Bacteria T 3ipn 1 A,B,C,D,E,F A,B,C,D,E,F Non-natural Collagen XXGXXGXXGXXGXXGXXGXXG 21 T 0.0033 Collagen pdbpssm F F 3iqg 2 B P MNWNI MNWNI 5 T 22 Kisspeptin pdbhh F F 3iqh 2 B P MNYDI MNYDI 5 T 42 DUF4917 pdbhh F F 3iqi 2 B P MNENI MNENI 5 T 67 Pinin_SDK_N pdbhh F F 3iqj 2 B P RAF1_HUMAN C-RAF, CRAF, RAF-1 QRSTSTPNVH 10 T 58 ALC pdbhh F Eukaryota T 3iqu 2 B P RAF1_HUMAN C-RAF, CRAF, RAF-1 QRSTST 6 T 160 HN pdbhh F Eukaryota F 3iqv 2 B P RAF1_HUMAN C-RAF, CRAF, RAF-1 QRSTST 6 T 160 HN pdbhh F Eukaryota F 3isw 2 C C CFTR_HUMAN CFTR, CHANNEL CONDUCTANCE-CONTROLLING ATPASE, CAMP-DEPENDENT CHLORIDE CHANNEL, ATP-BINDING CASSETTE TRANSPORTER SUB-FAMILY C MEMBER 7 PLEKASVVSKLFFSWTAP 18 T 1.4 ACTH_domain pdbhh F Eukaryota T 3it3 1 A,B A,B HISTIDINE ACID PHOSPHATASE MVGYSSKLIFVSMITRHGDRAPFANIENANYSWGTELSELTPIGMNQEYNLGLQLRKRYIDKFGLLPEHYVDQSIYVLSSHTNRTVVSAQSLLMGLYPAGTGPLIGDGDPAIKDRFQPIPIMTLSADSRLIQFPYEQYLAVLKKYVYNSPEWQNKTKEAAPNFAKWQQILGNRISGLNDVITVGDVLIVAQAHGKPLPKGLSQEDADQIIALTDWGLAQQFKSQKVSYIMGGKLTNRMIEDLNNAVNGKSKYKMTYYSGHALTLLEVMGTLGVPLDTAPGYASNLEMELYKDGDIYTVKLRYNGKYVKLPIMDKNNSCSLDALNKYMQSINEKFQKHHHHHH 342 T 0.012 His_Phos_2 pdbpercent F T 3it8 2 D,E,F,J,K,L D,E,F,J,K,L Q9DHW0_YLDV 2L protein ITLKYNYTVTLKDDGLYDGVFYDHYNDQLVTKISYNHETRHGNVNFRADWFNISRSPHTPGNDYNFNFWYSLMKETLEEINKNDSTKTTSLSLITGCYETGLLFGSYGYVETANGPLARYHTGDKRFTKMTHKGFPKVGMLTVKNTLWKDVKAYLGGFEYMGCSLAILDYQKMAKGKIPKDTTPTVKVTGNELEDGNMTLECTVNSFYPPDVITKWIESEHFKGEYKYVNGRYYPEWGRKSNYEPGEPGFPWNIKKDKDANTYSLTDLVRTTSKMSSQPVCVVFHDTLEAQVYTCSEGCNGELYDHLYRKTEEGEGGSHHHHHH 324 T 0.0046 C1-set unppercent T Viruses T 3itb 2 E L Peptidoglycan substrate (AMV)A(FGA)K(DAL)(DAL) XAXKXX 6 T 260 DUF2175 pdbhh F F 3itn 2 B B ACETYL-ASP-GLU-VAL-ASP-CHLOROMETHYL KETONE inhibitor XDEVDX 6 T 200 ResIII pdbhh F F 3iux 2 B,D B,D miniature protein inhibitor CNCKAPETFLCYWRCLQX 18 T 0.00097 Bee_toxin pdbhh F T 3ivb 2 B M H2AY_HUMAN HISTONE MACROH2A1, MH2A1, H2A.Y, H2A/Y, MEDULLOBLASTOMA ANTIGEN MU-MB-50.205 KAASADSTTEGTPAD 15 T 0.047 DUF1764 unp F Eukaryota T 3ivq 2 B,D C,D CiSBC2 NTLFPDVSSSTH 12 T 1.7 LSPR pdbhh F T 3ivv 2 B D PucSBC1 DEVTSTTSSS 10 T 86 AcetDehyd-dimer pdbhh F F 3iwm 2 E,F,G,H H,F,G,E N-[(5-METHYLISOXAZOL-3-YL)CARBONYL]ALANYL-L-VALYL-N~1~-((1R,2Z)-4-(BENZYLOXY)-4-OXO-1-{[(3R)-2-OXOPYRROLIDIN-3-YL]METHYL}BUT-2-ENYL)-L-LEUCINAMIDE XAVLXX 6 T 1700 zf-H2C2_5 pdbhh F F 3iwy 1 A,B B,D D-peptide inhibitor XXXXXXXXXXXX 12 T 2.6 VPS13 pdbhh F F 3iyb 5 E 7 VP1 core XXXXXXXXXXX 11 F F F 3iyc 5 E 7 VP1 core XXXXXXXXXXX 11 F F F 3iym 1 A,B A,B Q6YDQ6_9VIRU Capsid protein MSSIAPTDSVSSSGKRSKPGKRERQQARSAVGSAGGKPASASKAAAFAQGGSSDPVPMPGKYPVVFSTGAGEPTRDQEFALPVHKAFPLFGSVSDKYRRNPRYAEFRAHSEFTDGVFGTHLAVSSLLRLAQQLVHAHVNMGLPLGDFAPLASSDVRIPSALASVVNQFGEFSSPSIGTRFLLRDFEHAVSRVVFLADQLWTNGNSHHIFARSWLPMSNNDGNFKTIVASRLLEFISAGDLSILPTVLEDAVLSGEVPEAWEQVKDLLGDAPGVGQVDRRDRFDFLFKSYADVGQFTTAFTTQAASDVLTELGLPWNSPSAGHLNWQYSTKQRFTFLADTWAKLSAAYSQFFELSSGLATRQSATGSHAQMVDLTSVEGVTVLKAALALSAPEFSLAACFPPSCIFVGGLTRRVVVTTSLSVSQRATEFCQMDWR 434 T 0.13 BON unppercent T Viruses T 3iz3 1 A A Q914N6_CPVBM Structural protein VP3 MWHYTSINNDTRVALDPKPNQIRTITKPNTVPQLGTDYLYTFNSQRRSHTLRLLGPFQYFNFSETDRGHPLFRLPLKYPSKAIPADELIDNLHSWMRSVHLLHVRSEDNTLRYNWMLGVYARSTNYTTPVGQLVVNAPAILNYSNPQDAFNSVFVALGIDYIDIPITNSNIFDDSSTPYNVRIWHAPTMTEVNHILALMRKSTLVSTHSSWHWNVLHTFHYRSESDMIDHFAAKILEDWRQKEKLDKGALVEADRVIQRLIPLSSSTYVQRLAAIGALYPNEFTENVLDLSRLSTALLQLSDTYYQHANDQLRRLYRRMYNDSRTLYMTQRHQELLLAQITADPNILLYPYTYIFTTIPTSMNYISNTGQGRIKHSLTVTGATEHDTVADIVLGQTGEDVITISMVEPMSIAVEDMYGYVLDTPTRDIWPADEQIEQKGDAVALYDTKTSRALGMFNNTVRIDDLLSPLLSLVYRTYIKGDTMTMTQGSLDHLTLCAAVDSDITFVGNRMIAPLPEGYIPKPMHRNNSTMKMLSLYVALKKLENFATNSYLMAPDTSIILLGAEREPAVNILRRFNRNVSNVRIIGMGDRAVEPNIRVRVPFPIDKNISADFIICDINSYEDQSFESMFSETISVVTTCASAATRALVKINHPSEYMINSVIERLSQLGGVFYHTALLKTASQNPYSYETYIYITPIAAAVRFPFYSNSAMINRYMTAVADDEMPIIPSIHTVIKGHSNTYSPGLFCGCVDVQSAPLALSQLKSYCSEATTWRVDSDDNLVNIIARIDPARIALEFRTRSNTSAYHEYQRYVPNGLGFKVRKTREFRYMHREVTFIHKLMMYALIREQISLTENMTQVVSIGGRNLADISVVPLNMKYVVIDPATRIETLTQEKKNIEVQSRPFQFDAANMDLENNSIYLFIAVIMNEPNGAATPARMQMDKIRNVATAMLTRTNCVAYISFYEAGIITRLDQSTAHKTIRVEEGRLKVANYVPVDTLVEADVTLMLRDIGITHEIIRPSTPELIDACSNYGIRLGSTGGAVLDVFNHYSPVIKLVRS 1058 T 0.52 DUF5705 pdbpssm T Viruses T 3iz3 2 B,C B,C CAPSD_CPVBM Structural protein VP1 MHSTNNNSNKRNNEEKHKQPEIDSSANNGEGTSGTRAQTVGDTATEAGVRNETKAGASTRRQTDGTGLSGTNAKIATASSARQTDVEKPADVTFTIENVDDVGIMQQKKPPTVVQSRTDVFNEQFANEALHPMTKVIFNGLDVNTEVQPLSDDFKQISDPKGYLTYSVKYEDQFTKKDKLRASEADDRIVGPTVNLFKYGAAVVNIDLNRDFFDTATGIDLTKGIPLVQDLLVPIGVTAGAEQSAEYVSGLLMVLFKVMTDNRLVIVGETTTPMSNTLSTVVNNVLRTTYHNNVGVNPALLRDFTQVNWLNRDITNMLQQAGTKYGLGLTETRLDYVRLVKTIVGHALNIDHFAASVLNINLRALMEANVTADDRIKALQAHSMISTQFHGPNQGALRPELAFDHDHIIRCLMLAAANYPRLEGIIVQINTGYVASANVIRPVSEKRYFPENLEQNQSAARLVSAVKARASEADISSIHLAIAREVSPMFNVHELKKIAESFEDPSSIVVVLEFILFALFFPTEFNRIKGDIQNVLLLFFSRWYPVEYGIFIQRGATYTINAAGEFEFSGRNEKWDQSLYLSEHFPALFSDVPLAGANTIIAIMRLFTPQGFLRTDDLAIAANFPRASRNPQTYIPYTNQRGTVTNEFASRFRTIVATLANVVNERAVQDDMQKATRSCTKQWLRHLETQFDNIAVAHTDHLSVVYATMSNFMLNFTNNFSGNHATFKPDQYVITSPEGSYKPIIERQGETVDGLTIIDTSIVWPILCQCTYPLVRQSGKGVDAVSIMEEIVYPDPSTTLSQSLSVAQVLSKLTLPDAFINMILSGGDSVVMRTYQTEADDDLDEGIRMTTYDQYLSHIRERLHITNVPDPIYITGASTPDQIAASVQATHVAVVLYQSGVINGSASTYLRENEVLVVMPDYYDVVSRFANANLQMNNNRYHESVLEIADIFDQADFIQTSDAVRQLRALMPTLSTSQIRHAIERIAQITDVDSTDYGKLTLRFLGTLTRSLKMQNAQIRRIRPDGTVLRYDDQIDIEAFRWSRYFLDELRLRRLSVGLRLITNPRIARRFDGVRIMYLTDDDPDPDFVPDVPEGYVAVQYAHRLFSSSLANKRNRVTYTHPPTGMAYPSPTGRPHVHMTINERAGMSKLVADNIIASVIKSNWVVDIHDIEYTAEVMTPSEGYTQHVDAESIMTAPKGKLFHLQFMDGLLRPEPSAFDPPASGEDMRLIYPLQPISVARSMRAIVNHNEVDRPRGAVAPSSYEMDTGTLSRNGDLLYSPVANGQVGIPKLEVDHISFSNVVSMMTANIRTGDDMAVERVNPDDVRAINIRNA 1333 T 0.96 DUF2717 pdbpercent T Viruses T 3iz3 3 D,E D,E C6K2M8_CPVBM Viral structural protein 5 MLQQPTGGYTTLEQFAFTIRNDGTNATPTQFLQLLSYEATENELVKKTIPTPETHLPSARNVPGNVYIEDAITQALFGISAQNVNAHGYFSRLSALALPNTSARLGLDGVIYNSETINIPFYDPAAVANFAATYAKLGNASTPRYRADMIDIYAHVGLELAGTDAERAAGVMPVKRAKFDSWEGSLISLSRDVVNWKILAFLIDLCSLEGEALRAFKTRNRDVFRMMLFIMSTAVAANVVNRKVTKRVDRVLEYIGVNSMRTAGRTATITYDLSRHEFAAKFLQLTFTRWN 291 T 13 HAD_SAK_2 pdbhh T Viruses T 3izx 1 A A Q914N6_CPVBM Structural protein VP3 MWHYTSINNDTRVALDPKPNQIRTITKPNTVPQLGTDYLYTFNSQRRSHTLRLLGPFQYFNFSETDRGHPLFRLPLKYPSKAIPADELIDNLHSWMRSVHLLHVRSEDNTLRYNWMLGVYARSTNYTTPVGQLVVNAPAILNYSNPQDAFNSVFVALGIDYIDIPITNSNIFDDSSTPYNVRIWHAPTMTEVNHILALMRKSTLVSTHSSWHWNVLHTFHYRSESDMIDHFAAKILEDWRQKEKLDKGALVEADRVIQRLIPLSSSTYVQRLAAIGALYPNEFTENVLDLSRLSTALLQLSDTYYQHANDQLRRLYRRMYNDSRTLYMTQRHQELLLAQITADPNILLYPYTYIFTTIPTSMNYISNTGQGRIKHSLTVTGATEHDTVADIVLGQTGEDVITISMVEPMSIAVEDMYGYVLDTPTRDIWPADEQIEQKGDAVALYDTKTSRALGMFNNTVRIDDLLSPLLSLVYRTYIKGDTMTMTQGSLDHLTLCAAVDSDITFVGNRMIAPLPEGYIPKPMHRNNSTMKMLSLYVALKKLENFATNSYLMAPDTSIILLGAEREPAVNILRRFNRNVSNVRIIGMGDRAVEPNIRVRVPFPIDKNISADFIICDINSYEDQSFESMFSETISVVTTCASAATRALVKINHPSEYMINSVIERLSQLGGVFYHTALLKTASQNPYSYETYIYITPIAAAVRFPFYSNSAMINRYMTAVADDEMPIIPSIHTVIKGHSNTYSPGLFCGCVDVQSAPLALSQLKSYCSEATTWRVDSDDNLVNIIARIDPARIALEFRTRSNTSAYHEYQRYVPNGLGFKVRKTREFRYMHREVTFIHKLMMYALIREQISLTENMTQVVSIGGRNLADISVVPLNMKYVVIDPATRIETLTQEKKNIEVQSRPFQFDAANMDLENNSIYLFIAVIMNEPNGAATPARMQMDKIRNVATAMLTRTNCVAYISFYEAGIITRLDQSTAHKTIRVEEGRLKVANYVPVDTLVEADVTLMLRDIGITHEIIRPSTPELIDACSNYGIRLGSTGGAVLDVFNHYSPVIKLVRS 1058 T 0.52 DUF5705 pdbpssm T Viruses T 3izx 2 B,C B,C CAPSD_CPVBM Capsid protein VP1 MHSTNNNSNKRNNEEKHKQPEIDSSANNGEGTSGTRAQTVGDTATEAGVRNETEAGASTRRQTDGTGLSGTNAKIATASSARQADVEKPADVTFTIENVDDVGIMQQKKPPTVVQSRTDVFNEQFANEALHPTTKVIFNGLDVNTEVQPLSDDFKQISDPKGYLTYSVKYEDQFTKKDKLRASEADDRIVGPTVNLFKYGAAVVNIDLNRDFFDTATGIDLTKGIPLVQDLLVPIGVTAGAEQSAEYVSGLLMVLFKVMTDNRLVIVGETTTPMSNTLSTVVNNVLRTTYHNNVGVNPALLRDFTQVNWLNRDITNMLQQAGTKYGLGLTETRLDYVRLVKTIVGHALNIDHFAASVLNINLRALMEANVTADDRIKALQAHSMISTQFHGPNQGALRPELAFDHDHIIRCLMLAAANYPRLEGIIVQINTGYVASANVIRPVSEKRYFPENLEQNQSAARLVSAVKARASEADISSIHLAIAREVSPMFNVHELKKIAESFEDPSSIVVVLEFILFALFFPTEFNRIKGDIQNVLLLFFSRWYPVEYGIFVQRGATYTINAAGEFEFSGRNEKWDQALYLSEHFPALFSDVPLAGANTIIAIMRLFTPQGFLRTDDLAIAANFPRASRNPQTYIPYTNQRGTVTNEFASRFRTIVATLANVVNERAVQDDMQKATRSCTKQWLRHLETQFDNIAVAHTDHLSVVYATMSNFMLNFTNNFSGNHATFKPDQYVITSPEGSYKPIIERQGETVDGLTIIDTSIVWPILCQCTYPLVRQSGKGVDAVSIMEEIVYPDPSTTLSQSLSVAQVLSKLTLPDAFINMILSGGDSVVMRTYQTEADDDLDEGIRMTTYDQYLSHIRERLHITNVPDPIYITGASTPDQIAASVQATHVAVVLYQSGVINGPASTYLRENEVLVVMPDYYDVVSRFANANLQMNNNRYHESVLEIADIFDQADFIQTSDAVRQLRALMPTLSTSQIRHAIERIAQITDVDSTDYGKLTLRFLGTLTRSLKMQNAQIRRIRPDGTVLRYDDQIDIEAFRWSRYFLDELQLRRLSVGLRLITNPRIARRFNGVRIMYLTDDDPDPDFVPDVPEGYVAVQYAHRLFSSSLANKRNRVTYTHPPTGMAYPSPTGRPHVHMTINERAGMSKLVADNIIASVIKSNWVVDILDIEYTAEVMTPSEGYTQHVDAESIMTAPKGKLFHLQFMDGLLRPEPSAFDPPASGEDMRLIYPLQPISVARSMRAIVNHNEVDRPRGAVAPSSYEMDTGTLSRNGDLLYSPVANGQVGIPKLEVDHISFSNVVSMMTANIRTGDDMAVERVNPDDVRAINIRNA 1333 T 130 SRP19 pdbhh T Viruses T 3izx 3 D,E D,E C6K2M8_CPVBM Viral structural protein 5 MLQQPTGGYTTLEQFAFTIRNDGTNATPTQFLQLLSWEATENELVKKTIPTPETHLPSARNVPGNVYIEDAITQALFGISAQNVNAHGYFSRLSALALPNTSARLGLDGVIYNSETINIPFYDPAAVANFAATYAKLGNASTPRYRADMIDIYAHVGLELAGTDAERAAGVMPVKRAKFDSWEGSLISLSRDVVNWKILAFLIDLCSLEGEALRAFKTRNRDVFRMMLFIMSTAVAANVVNRKVTKRVDRVLEYIGVNSMRTAGRTATITYDLSRHEFAAKFLQLTFTRWNAASAMIRSMPDMHTPRTSITPAGENALVRHNRYMTENFKGLSPIALAQKKHEMMLHTHEIHSMDIDGSIKNMVERETVNKMNEIDAMNTAPWTEEFAEVEPTTVYERHQIGTDPEQTQLISQDAAVIVHQASSDVDENEYGNSVSELTIDTQSDSVL 448 T 0.022 CI-B14_5a unppssm T Viruses T 3j0h 1 A,B,C,D,E,F A,B,C,D,E,F Q8SDD3_BPDPK PHIKZ029 LRPEDAANPSRLIVAIEIVEDEIPLTIRRLSGFNYPNSVRDIGNAPVPTTDKVDGLKARIILIEDNTSEVGTQRVLPGTLVSDKDGSQSLVYPLFEAPVSFFGKLGDSNGMRVWSTTTADIEEFDEAAMAKFKTRQFRIQLIEKPEVGTSPVIVKTADQQDYLNITFDKGVYSDMYNADLYVGDVLVDSYSDDGVVSGLSPLYSPFSQFYVYHENIDLVRQMIYDTEMRVNPAAAAHTTAPGEIDFLTFLAVDGDPYQGIQVLGPLDGGITLGKDGNIYASGGTDGTTDLEEYAK 295 T 0.21 N-Term_TEN pdb T Viruses T 3j0i 1 A,B,C,D,E,F A,B,C,D,E,F Q8SDD3_BPDPK PHIKZ029 LRPEDAANPSRLIVAIEIVEDEIPLTIRRLSGFNYPNSVRDIGNAPVPTTDKVDGLKARIILIEDNTSEVGTQRVLPGTLVSDKDGSQSLVYPLFEAPVSFFGKLGDSNGMRVWSTTTADIEEFDEAAMAKFKTRQFRIQLIEKPEVGTSPVIVKTADQQDYLNITFDKGVYSDMYNADLYVGDVLVDSYSDDGVVSGLSPLYSPFSQFYVYHENIDLVRQMIYDTEMRVNPAAAAHTTAPGEIDFLTFLAVDGDPYQGIQVLGPLDGGITLGKDGNIYASGGTDGTTDLEEYAK 295 T 0.21 N-Term_TEN pdb T Viruses T 3j17 1 A A Q914N6_CPVBM Structural protein VP3 MWHYTSINNDTRVALDPKPNQIRTITKPNTVPQLGTDYLYTFNSQRRSHTLRLLGPFQYFNFSETDRGHPLFRLPLKYPSKAIPADELIDNLHSWMRSVHLLHVRSEDNTLRYNWMLGVYARSTNYTTPVGQLVVNAPAILNYSNPQDAFNSVFVALGIDYIDIPITNSNIFDDSSTPYNVRIWHAPTMTEVNHILALMRKSTLVSTHSSWHWNVLHTFHYRSESDMIDHFAAXILEDWRQKEKLDKGALVEADRVIQRLIPLSSSTYVQRLAAIGALYPNEFTENVLDLSRLSTALLQLSDTYYQHANDQLRRLYRRMYNDSRTLYMTQRHQELLLAQITADPNILLYPYTYIFTTIPTSMNYISNTGQGRIKHSLTVTGATEHDTVADIVLGQTGEDVITISMVEPMSIAVEDMYGYVLDTPTRDIWPADEQIEQKGDAVALYDTKTSRALGMFNNTVRIDDLLSPLLSLVYRTYIKGDTMTMTQGSLDHLTLCAAVDSDITFVGNRMIAPLPEGYIPKPMHRNNSTMKMLSLYVALKKLENFATNSYLMAPDTSIILLGAEREPAVNILRRFNRNVSNVRIIGMGDRAVEPNIRVRVPFPIDKNISADFIICDINSYEDQSFESMFSETISVVTTCASAATRALVKINHPSEYMINSVIERLSQLGGVFYHTALLKTASQNPYSYETYIYITPIAAAVRFPFYSNSAMINRYMTAVADDEMPIIPSIHTVIKGHSNTYSPGLFCGCVDVQSAPLALSQLKSYCSEATTWRVDSDDNLVNIIARIDPARIALEFRTRSNTSAYHEYQRYVPNGLGFKVRKTREFRYMHREVTFIHKLMMYALIREQISLTENMTQVVSIGGRNLADISVVPLNMKYVVIDPATRIETLTQEKKNIEVQSRPFQFDAANMDLENNSIYLFIAVIMNEPNGAATPARMQMDKIRNVATAMLTRTNCVAYISFYEAGIITRLDQSTAHKTIRVEEGRLKVANYVPVDTLVEADVTLMLRDIGITHEIIRPSTPELIDACSNYGIRLGSTGGAVLDVFNHYSPVIKLVRS 1058 T 0.52 DUF5705 pdbpssm T Viruses T 3j17 2 B,C B,C CAPSD_CPVBM VP1 MHSTNNNSNKRNNEEKHKQPEIDSSANNGEGTSGTRAQTVGDTATEAGVRNETKAGASTRRQTDGTGLSGTNAKIATASSARQTDVEKPADVTFTIENVDDVGIMQQKKPPTVVQSRTDVFNEQFANEALHPMTKVIFNGLDVNTEVQPLSDDFKQISDPKGYLTYSVKYEDQFTKKDKLRASEADDRIVGPTVNLFKYGAAVVNIDLNRDFFDTATGIDLTKGIPLVQDLLVPIGVTAGAEQSAEYVSGLLMVLFKVMTDNRLVIVGETTTPMSNTLSTVVNNVLRTTYHNNVGVNPALLRDFTQVNWLNRDITNMLQQAGTKYGLGLTETRLDYVRLVKTIVGHALNIDHFAASVLNINLRALMEANVTADDRIKALQAHSMISTQFHGPNQGALRPELAFDHDHIIRCLMLAAANYPRLEGIIVQINTGYVASANVIRPVSEKRYFPENLEQNQSAARLVSAVKARASEADISSIHLAIAREVSPMFNVHELKKIAESFEDPSSIVVVLEFILFALFFPTEFNRIKGDIQNVLLLFFSRWYPVEYGIFIQRGATYTINAAGEFEFSGRNEKWDQSLYLSEHFPALFSDVPLAGANTIIAIMRLFTPQGFLRTDDLAIAANFPRASRNPQTYIPYTNQRGTVTNEFASRFRTIVATLANVVNERAVQDDMQKATRSCTKQWLRHLETQFDNIAVAHTDHLSVVYATMSNFMLNFTNNFSGNHATFKPDQYVITSPEGSYKPIIERQGETVDGLTIIDTSIVWPILCQCTYPLVRQSGKGVDAVSIMEEIVYPDPSTTLSQSLSVAQVLSKLTLPDAFINMILSGGDSVVMRTYQTEADDDLDEGIRMTTYDQYLSHIRERLHITNVPDPIYITGASTPDQIAASVQATHVAVVLYQSGVINGSASTYLRENEVLVVMPDYYDVVSRFANANLQMNNNRYHESVLEIADIFDQADFIQTSDAVRQLRALMPTLSTSQIRHAIERIAQITDVDSTDYGKLTLRFLGTLTRSLKMQNAQIRRIRPDGTVLRYDDQIDIEAFRWSRYFLDELRLRRLSVGLRLITNPRIARRFDGVRIMYLTDDDPDPDFVPDVPEGYVAVQYAHRLFSSSLANKRNRVTYTHPPTGMAYPSPTGRPHVHMTINERAGMSKLVADNIIASVIKSNWVVDIHDIEYTAEVMTPSEGYTQHVDAESIMTAPKGKLFHLQFMDGLLRPEPSAFDPPASGEDMRLIYPLQPISVARSMRAIVNHNEVDRPRGAVAPSSYEMDTGTLSRNGDLLYSPVANGQVGIPKLEVDHISFSNVVSMMTANIRTGDDMAVERVNPDDVRAINIRNA 1333 T 0.96 DUF2717 pdbpercent T Viruses T 3j17 3 D,E D,E C6K2M8_CPVBM Structural protein VP5 MLQQPTGGYTTLEQFAFTIRNDGTNATPTQFLQLLSYEATENELVKKTIPTPETHLPSARNVPGNVYIEDAITQALFGISAQNVNAHGYFSRLSALALPNTSARLGLDGVIYNSETINIPFYDPAAVANFAATYAKLGNASTPRYRADMIDIYAHVGLELAGTDAERAAGVMPVKRAKFDSWEGSLISLSRDVVNWKILAFLIDLCSLEGEALRAFKTRNRDVFRMMLFIMSTAVAANVVNRKVTKRVDRVLEYIGVNSMRTAGRTATITYDLSRHEFAAKFLQLTFTRWNAASAMIRSMPDMHTPRTSITPAGENALVRHNRYMTENFKGLSPIALAQKKHEMMLHTHEIHSMDIDGSIKNMVERETVNKMNEIDAMNTAPWTEEFAEVEPTTVYERHQIGTDPEQTQLISQDAAVIVHQASSDVDENEYGNSVSELTIDTQSDSVL 448 T 0.022 CI-B14_5a pdbpssm T Viruses T 3j26 1 A,B,C,D,E,F,G,H,I,J,K,L,M A,B,C,D,E,F,G,H,I,J,K,L,M CAPSD_SPTNK MAJOR CAPSID PROTEIN MSNSAIPLNVVAVQEPRLELNNERTWVVVKGGQQVTYYPFPSTSFSSNQFNFICNPPSAQTVLDRLVFIQVPYDITFTANPSHAGITENLLQPGRDAFRAFPISSITNTLNATINGFPVNIELAQIIHALSRYHTPLKVKNGWMSMQPSFEDNYQSYRDADGANNNPLGVFTSAAGLSELPRGSYTMNVVTNTTTTARITGVLYEQVFLPPFLWDGEQAGGLANLTSLTFNWVLNNNLARIWSHSDITNDVSGNSTIGSMNISFQQPSMYLGFVTPRLNIPIPPRITYPYFKLSRYTTQFQNTLAPNASSTFKSNVVQLDSIPRKLYLFVKQSDNVIYQNLNNQITTPDVFLQINNLNLTWNNQQGILSGASSQNLYDFSVQNGYNKTWSEFNGVTQQFNGVSGQPTKVIGLEGGIVCLELGKDVGLRDDEAEGVIGNFNLQVQMTVTNTNQYVTVTPDMYIVAVYDGTLVISNTSAMASIGVASKEEVLNARITHGVSYNELQRIYG 508 T 0.11 IU_nuc_hydro pdb T Viruses T 3j26 2 N N I0CES9_9VIRU PENTON PROTEIN MSYSHSIKDCQEPDTVYYDILIPFKPNDQGFSPAIFQAQLTQPIVHNPSEYFLSVVRFSIPTQNIPLTIPQIQPYPNTNVNNTIYSVSIGYNGTYSSQNFVQFDPSLTSPNIPAPNAPTVTSPNVEVTPYYYIYDYSTFLQMINTALENAFNEISAPVGADAPFFFYDSNTEKISLIAQAAYYDRTLTTPIEIYCNVNLFTFFDSIKHIGLGYNTPTGRDILFDVRFLGNNYYQDPETAPSYPPEFIQMQQEYPTLSNWNAVKTIQLVSNLLPINKESIPSFRNSNVGIINAQGILADFVPLVTNGPEARISIDFVATGPWRLIDMFGSVPIYMVDLYVYWTDQTGGQYLINIPPGRILTCKLVFIKKSLSKYLVSEK 378 T 0.037 ATP_bind_2 pdb T Viruses T 3j31 1 A Q Q6Q0L4_9VIRU A223 penton base MGEVFKEVKEKFERYKFDVVYVDREYPVSSNNLNVFFEIGERNSFSGLLINEGQAVIDVLLLKKSHEGLSPIPGEGTGIQLSAGQILKFYNVPIAEIIVEYDPSNVSGVSSNVKLKGTIHPLFEVPSQISIENFQPTENYLIYSGFGTSLPQTYTIPANGYLIISITNTSTGNIGQITLTIGSTTMTFNLQTGENKIPVIAGTQITNLTLTSSSAILIYEEVI 223 T 6.7 PSP1 pdbhh T Viruses T 3j31 2 B R A55 membrane protein XXXXXXXXXXXXXXX 15 F F F 3j31 4 R P Q6Q0L3_9VIRU C381 turret protein MSVTTLGQSFPANAKVKYYYKLSEKQDLDAFVNSIFVGSYKLKQISYLLYGNTKIVSAPVVPLGPNASIIIDDELQEGLYLIRIKVYNTNSFSVTVTPFFNNNNTMTYSIGANSEFEIYDIFTKEQGNIYYIQLPPGLAILEFSLERVFEKGNRINIPKIIHTSGNGYISFRLRKGTYAIKMPYSYNNTTSTTFTNFQFGTISTSVATIPLVISSIPANGSGSGTFLVYLKITGDYEDVKFSVTYGGGLGVPFTFGLEVEEINELVENTNFVTQSVTLSGSQVTQSILNVQGSGSHLRLKYASVSGLTTAVTQCQLQATNLNRSTTYSTVWDFIAGGSSTPPSWDIREINSIQLVANGGSSTSSVTITLILVYEQIAGELS 381 T 0.2 CBM_48 pdb T Viruses T 3j3i 1 A A CAPSD_PCVC CP, COAT PROTEIN MAAPVLYGGAGGTATGPGDMRRSLMHEKKQVFAELRREAQALRVAKEARGKMSVWDPSTREGARGYREKVVRFGRQIASLLQYFENMHSPALDIIACDKFLLKYQIYGDIDRDPAFGENTMTAEVPVVWDKCEVEVKLYAGPLQKLMSRAKLVGAAREGIPNRNDVAKSTGWNQDQVQKFPDNRMDSLISLLEQMQTGQSKLTRLVKGFLILLEMAERKEVDFHVGNHIHVTYAIAPVCDSYDLPGRCYVFNSKPTSEAHAAVLLAMCREYPPPQFASHVSVPADAEDVCIVSQGRQIQPGSAVTLNPGLVYSSILTYAMDTSCTDLLQEAQIIACSLQENRYFSRIGLPTVVSLYDLMVPAFIAQNSALEGARLSGDLSKAVGRVHQMLGMVAAKDIISATHMQSRTGFDPSHGIRQYLNSNSRLVTQMASKLTGIGLFDATPQMRIFSEMDTADYADMLHLTIFEGLWLVQDASVCTDNGPISFLVNGEKLLSADRAGYDVLVEELTLANIRIEHHKMPTGAFTTRWVAAKRDSALRLTPRSRTAHRVDMVRECDFNPTMNLKAAGPKARLRGSGVKSRRRVSEVPLAHVFRSPPRRESTTTTDDSPRWLTREGPQLTRRVPIIDEPPAYESGRSSSPVTSSISEGTSQHEEEMGLFDAEELPMQQTVIATEARRRLGRGTLERIQEAALEGQVAQGEVTAEKNRRIEAMLSARDPQFTGREQITKMLSDGGLGVREREEWLELVDKTVGVKGLKEVRSIDGIRRHLEEYGEREGFAVVRTLLSGNSKHVRRINQLIRESNPSAFETEASRMRRLRADWDGDAGSAPVNALHFVGNSPGWKRWLENNNIPSDIQVAGKKRMCSYLAEVLSHGNLKLSDATKLGRLVEGTSLDLFPPQLSSEEFSTCSEATLAWRNAPSSLGVRPFAQEDSRWLVMAATCGGGSFGIGKLKSLCKEFSVPKELRDALRVKYGLFGGKDSLE 982 T 0.27 TT_ORF2 unppercent T Viruses T 3j3o 3 C 0 unknown peptide GSSST 5 T 190 DltD pdbhh F F 3j40 1 A,B,D,I,L,M,N N,M,H,K,I,J,L Q858G5_BPE15 gp10 MKTVNMKTGTDSFVGEDGKPETKDQYPWGLRITLDNESLQRLGLNAKSLPAVGDSVSVMAMANVCSVSTRTTDHGEDNYVELQITDIGLAPQKRDDAKELKDAFYPDGEDD 111 T 0.045 RNA_pol_Rpb1_7 pdb T Viruses T 3j46 4 D n NC100 XAKKIWLALAGLVLAFSASCAQYEDGSSGELERQHTFALHQRSISGDGDSPHSYHSLPEGVKMTKYLQEQKLAVAAVAAQADLELFSTPVWISQAQGIRAG 101 T 0.0023 SecM pdbhh F T 3j47 6 F R RPN7_YEAST 26S proteasome regulatory subunit RPN7 NAQYHLLVKQGDGLLTKLQKYGAAVR 26 T 0.62 Paf67 unphh F Eukaryota T 3j47 8 H T RPN12_YEAST NUCLEAR INTEGRITY PROTEIN 1 KTNIIEKAMDYAISIEN 17 T 1.9 DUF4576 pdbhh F Eukaryota T 3j4k 2 F,G F,G tropomyosin XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 136 F F F 3j4u 2 H,I,J,K,L,M,N H,I,J,K,L,M,N Q775C8_BPBPP BBP16 MIIDKLLQVSDGQAVTASAASTDVIDFGQANPNTGMDDRSKMVITVDESADAAGAATVTFSVQDSADNATFADVAATGAIGKANLAAGKQVVIPMPTKLRRYCRVYYTVATGPLTAGKFSAQVVTGIQQNVAYPDSPRIA 140 T 0.019 DUF6385 pdbpercent T Viruses T 3j5m 2 B,F,J B,F,J BG505 SOSIP gp41 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 64 F F F 3j6q 1 A,B,C,D,E A,B,C,D,E Q914N6_CPVBM Structural protein VP3 MWHYTSINNDTRVALDPKPNQIRTITKPNTVPQLGTDYLYTFNSQRRSHTLRLLGPFQYFNFSETDRGHPLFRLPLKYPSKAIPADELIDNLHSWMRSVHLLHVRSEDNTLRYNWMLGVYARSTNYTTPVGQLVVNAPAILNYSNPQDAFNSVFVALGIDYIDIPITNSNIFDDSSTPYNVRIWHAPTMTEVNHILALMRKSTLVSTHSSWHWNVLHTFHYRSESDMIDHFAAXILEDWRQKEKLDKGALVEADRVIQRLIPLSSSTYVQRLAAIGALYPNEFTENVLDLSRLSTALLQLSDTYYQHANDQLRRLYRRMYNDSRTLYMTQRHQELLLAQITADPNILLYPYTYIFTTIPTSMNYISNTGQGRIKHSLTVTGATEHDTVADIVLGQTGEDVITISMVEPMSIAVEDMYGYVLDTPTRDIWPADEQIEQKGDAVALYDTKTSRALGMFNNTVRIDDLLSPLLSLVYRTYIKGDTMTMTQGSLDHLTLCAAVDSDITFVGNRMIAPLPEGYIPKPMHRNNSTMKMLSLYVALKKLENFATNSYLMAPDTSIILLGAEREPAVNILRRFNRNVSNVRIIGMGDRAVEPNIRVRVPFPIDKNISADFIICDINSYEDQSFESMFSETISVVTTCASAATRALVKINHPSEYMINSVIERLSQLGGVFYHTALLKTASQNPYSYETYIYITPIAAAVRFPFYSNSAMINRYMTAVADDEMPIIPSIHTVIKGHSNTYSPGLFCGCVDVQSAPLALSQLKSYCSEATTWRVDSDDNLVNIIARIDPARIALEFRTRSNTSAYHEYQRYVPNGLGFKVRKTREFRYMHREVTFIHKLMMYALIREQISLTENMTQVVSIGGRNLADISVVPLNMKYVVIDPATRIETLTQEKKNIEVQSRPFQFDAANMDLENNSIYLFIAVIMNEPNGAATPARMQMDKIRNVATAMLTRTNCVAYISFYEAGIITRLDQSTAHKTIRVEEGRLKVANYVPVDTLVEADVTLMLRDIGITHEIIRPSTPELIDACSNYGIRLGSTGGAVLDVFNHYSPVIKLVRS 1058 T 0.52 DUF5705 pdbpssm T Viruses T 3j70 5 E,J,O E,Q,V TRANSMEMBRANE PROTEIN GP41, TM, GLYCOPROTEIN 41, GP41 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 140 F F F 3j7q 48 VA 3 Sec61 beta subunit XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 36 F F F 3j7r 49 WA 3 Sec61 beta subunit XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 36 F F F 3j7y 51 YA t unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 127 F F F 3j89 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S synthetic peptide QARILEADAEILRAYARILEAHAEILRAQ 29 T 3.6 DUF2563 pdbhh F T 3j8a 1 A,B F,G tropomyosin alpha-1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 135 F F F 3j92 46 UA,VA x,y E3 UBIQUITIN-PROTEIN LIGASE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 218 F F F 3j9b 3 C,F C,J RNA-DIRECTED RNA POLYMERASE SUBUNIT P3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 66 F F F 3j9m 53 AB t Unknown protein/protein extension XXXXXXXXXXXXXXXXXXXXXXXXXXXX 28 F F F 3j9w 23 W AZ MIFM_BACSU MifM MTMFVESINDVLFLVDFFTIILPALTAIGIAFLLRECRAGEQWKSKRTDEHQTVFHINRTDFLIIIYHRITTWIRKVFRMNSPVNDEEDAGSLLL 95 T 0.052 PqiA pdb F Bacteria T 3jaj 49 WA 2 Nascent chain LLLLLLLLLLLLKVGPVPVLVMSLLFIASMV 31 T 0.00017 Sec61_beta pdb F T 3jap 46 TA r EIF3B_YEAST eIF3b LHQRELLKQWTEYREKIGQEMEKSMNFKIFD 31 T 0.44 Phage_TAC_8 unp F Eukaryota T 3jau 1 A A Q5DW45_9ENTO Capsid protein VP1 GYPTFGEHKQEKDLEYG 17 T 9.9E-08 Waikav_capsid_1 unphh T Viruses T 3jay 1 A A Q914N6_CPVBM TP MWHYTSINNDTRVALDPKPNQIRTITKPNTVPQLGTDYLYTFNSQRRSHTLRLLGPFQYFNFSETDRGHPLFRLPLKYPSKAIPADELIDNLHSWMRSVHLLHVRSEDNTLRYNWMLGVYARSTNYTTPVGQLVVNAPAILNYSNPQDAFNSVFVALGIDYIDIPITNSNIFDDSSTPYNVRIWHAPTMTEVNHILALMRKSTLVSTHSSWHWNVLHTFHYRSESDMIDHFAAKILEDWRQKEKLDKGALVEADRVIQRLIPLSSSTYVQRLAAIGALYPNEFTENVLDLSRLSTALLQLSDTYYQHANDQLRRLYRRMYNDSRTLYMTQRHQELLLAQITADPNILLYPYTYIFTTIPTSMNYISNTGQGRIKHSLTVTGATEHDTVADIVLGQTGEDVITISMVEPMSIAVEDMYGYVLDTPTRDIWPADEQIEQKGDAVALYDTKTSRALGMFNNTVRIDDLLSPLLSLVYRTYIKGDTMTMTQGSLDHLTLCAAVDSDITFVGNRMIAPLPEGYIPKPMHRNNSTMKMLSLYVALKKLENFATNSYLMAPDTSIILLGAEREPAVNILRRFNRNVSNVRIIGMGDRAVEPNIRVRVPFPIDKNISADFIICDINSYEDQSFESMFSETISVVTTCASAATRALVKINHPSEYMINSVIERLSQLGGVFYHTALLKTASQNPYSYETYIYITPIAAAVRFPFYSNSAMINRYMTAVADDEMPIIPSIHTVIKGHSNTYSPGLFCGCVDVQSAPLALSQLKSYCSEATTWRVDSDDNLVNIIARIDPARIALEFRTRSNTSAYHEYQRYVPNGLGFKVRKTREFRYMHREVTFIHKLMMYALIREQISLTENMTQVVSIGGRNLADISVVPLNMKYVVIDPATRIETLTQEKKNIEVQSRPFQFDAANMDLENNSIYLFIAVIMNEPNGAATPARMQMDKIRNVATAMLTRTNCVAYISFYEAGIITRLDQSTAHKTIRVEEGRLKVANYVPVDTLVEADVTLMLRDIGITHEIIRPSTPELIDACSNYGIRLGSTGGAVLDVFNHYSPVIKLVRS 1058 T 0.52 DUF5705 pdbpssm T Viruses T 3jay 2 B,C B,C CAPSD_CPVBM CSP-A AND CSP-B MHSTNNNSNKRNNEEKHKQPEIDSSANNGEGTSGTRAQTVGDTATEAGVRNETEAGASTRRQTDGTGLSGTNAKIATASSARQADVEKPADVTFTIENVDDVGIMQQKKPPTVVQSRTDVFNEQFANEALHPTTKVIFNGLDVNTEVQPLSDDFKQISDPKGYLTYSVKYEDQFTKKDKLRASEADDRIVGPTVNLFKYGAAVVNIDLNRDFFDTATGIDLTKGIPLVQDLLVPIGVTAGAEQSAEYVSGLLMVLFKVMTDNRLVIVGETTTPMSNTLSTVVNNVLRTTYHNNVGVNPALLRDFTQVNWLNRDITNMLQQAGTKYGLGLTETRLDYVRLVKTIVGHALNIDHFAASVLNINLRALMEANVTADDRIKALQAHSMISTQFHGPNQGALRPELAFDHDHIIRCLMLAAANYPRLEGIIVQINTGYVASANVIRPVSEKRYFPENLEQNQSAARLVSAVKARASEADISSIHLAIAREVSPMFNVHELKKIAESFEDPSSIVVVLEFILFALFFPTEFNRIKGDIQNVLLLFFSRWYPVEYGIFVQRGATYTINAAGEFEFSGRNEKWDQALYLSEHFPALFSDVPLAGANTIIAIMRLFTPQGFLRTDDLAIAANFPRASRNPQTYIPYTNQRGTVTNEFASRFRTIVATLANVVNERAVQDDMQKATRSCTKQWLRHLETQFDNIAVAHTDHLSVVYATMSNFMLNFTNNFSGNHATFKPDQYVITSPEGSYKPIIERQGETVDGLTIIDTSIVWPILCQCTYPLVRQSGKGVDAVSIMEEIVYPDPSTTLSQSLSVAQVLSKLTLPDAFINMILSGGDSVVMRTYQTEADDDLDEGIRMTTYDQYLSHIRERLHITNVPDPIYITGASTPDQIAASVQATHVAVVLYQSGVINGPASTYLRENEVLVVMPDYYDVVSRFANANLQMNNNRYHESVLEIADIFDQADFIQTSDAVRQLRALMPTLSTSQIRHAIERIAQITDVDSTDYGKLTLRFLGTLTRSLKMQNAQIRRIRPDGTVLRYDDQIDIEAFRWSRYFLDELQLRRLSVGLRLITNPRIARRFNGVRIMYLTDDDPDPDFVPDVPEGYVAVQYAHRLFSSSLANKRNRVTYTHPPTGMAYPSPTGRPHVHMTINERAGMSKLVADNIIASVIKSNWVVDILDIEYTAEVMTPSEGYTQHVDAESIMTAPKGKLFHLQFMDGLLRPEPSAFDPPASGEDMRLIYPLQPISVARSMRAIVNHNEVDRPRGAVAPSSYEMDTGTLSRNGDLLYSPVANGQVGIPKLEVDHISFSNVVSMMTANIRTGDDMAVERVNPDDVRAINIRNA 1333 T 130 SRP19 pdbhh T Viruses T 3jay 3 D,E D,E C6K2M8_CPVBM LPP-3 AND LPP-5 MLQQPTGGYTTLEQFAFTIRNDGTNATPTQFLQLLSYEATENELVKKTIPTPETHLPSARNVPGNVYIEDAITQALFGISAQNVNAHGYFSRLSALALPNTSARLGLDGVIYNSETINIPFYDPAAVANFAATYAKLGNASTPRYRADMIDIYAHVGLELAGTDAERAAGVMPVKRAKFDSWEGSLISLSRDVVNWKILAFLIDLCSLEGEALRAFKTRNRDVFRMMLFIMSTAVAANVVNRKVTKRVDRVLEYIGVNSMRTAGRTATITYDLSRHEFAAKFLQLTFTRWNAASAMIRSMPDMHTPRTSITPAGENALVRHNRYMTENFKGLSPIALAQKKHEMMLHTHEIHSMDIDGSIKNMVERETVNKMNEIDAMNTAPWTEEFAEVEPTTVYERHQIGTDPEQTQLISQDAAVIVHQASSDVDENEYGNSVSELTIDTQSDSVL 448 T 0.022 CI-B14_5a pdbpssm T Viruses T 3jaz 1 A A Q914N6_CPVBM TP MWHYTSINNDTRVALDPKPNQIRTITKPNTVPQLGTDYLYTFNSQRRSHTLRLLGPFQYFNFSETDRGHPLFRLPLKYPSKAIPADELIDNLHSWMRSVHLLHVRSEDNTLRYNWMLGVYARSTNYTTPVGQLVVNAPAILNYSNPQDAFNSVFVALGIDYIDIPITNSNIFDDSSTPYNVRIWHAPTMTEVNHILALMRKSTLVSTHSSWHWNVLHTFHYRSESDMIDHFAAKILEDWRQKEKLDKGALVEADRVIQRLIPLSSSTYVQRLAAIGALYPNEFTENVLDLSRLSTALLQLSDTYYQHANDQLRRLYRRMYNDSRTLYMTQRHQELLLAQITADPNILLYPYTYIFTTIPTSMNYISNTGQGRIKHSLTVTGATEHDTVADIVLGQTGEDVITISMVEPMSIAVEDMYGYVLDTPTRDIWPADEQIEQKGDAVALYDTKTSRALGMFNNTVRIDDLLSPLLSLVYRTYIKGDTMTMTQGSLDHLTLCAAVDSDITFVGNRMIAPLPEGYIPKPMHRNNSTMKMLSLYVALKKLENFATNSYLMAPDTSIILLGAEREPAVNILRRFNRNVSNVRIIGMGDRAVEPNIRVRVPFPIDKNISADFIICDINSYEDQSFESMFSETISVVTTCASAATRALVKINHPSEYMINSVIERLSQLGGVFYHTALLKTASQNPYSYETYIYITPIAAAVRFPFYSNSAMINRYMTAVADDEMPIIPSIHTVIKGHSNTYSPGLFCGCVDVQSAPLALSQLKSYCSEATTWRVDSDDNLVNIIARIDPARIALEFRTRSNTSAYHEYQRYVPNGLGFKVRKTREFRYMHREVTFIHKLMMYALIREQISLTENMTQVVSIGGRNLADISVVPLNMKYVVIDPATRIETLTQEKKNIEVQSRPFQFDAANMDLENNSIYLFIAVIMNEPNGAATPARMQMDKIRNVATAMLTRTNCVAYISFYEAGIITRLDQSTAHKTIRVEEGRLKVANYVPVDTLVEADVTLMLRDIGITHEIIRPSTPELIDACSNYGIRLGSTGGAVLDVFNHYSPVIKLVRS 1058 T 0.52 DUF5705 pdbpssm T Viruses T 3jaz 2 B,C B,C CAPSD_CPVBM CSP-A AND CSP-B MHSTNNNSNKRNNEEKHKQPEIDSSANNGEGTSGTRAQTVGDTATEAGVRNETEAGASTRRQTDGTGLSGTNAKIATASSARQADVEKPADVTFTIENVDDVGIMQQKKPPTVVQSRTDVFNEQFANEALHPTTKVIFNGLDVNTEVQPLSDDFKQISDPKGYLTYSVKYEDQFTKKDKLRASEADDRIVGPTVNLFKYGAAVVNIDLNRDFFDTATGIDLTKGIPLVQDLLVPIGVTAGAEQSAEYVSGLLMVLFKVMTDNRLVIVGETTTPMSNTLSTVVNNVLRTTYHNNVGVNPALLRDFTQVNWLNRDITNMLQQAGTKYGLGLTETRLDYVRLVKTIVGHALNIDHFAASVLNINLRALMEANVTADDRIKALQAHSMISTQFHGPNQGALRPELAFDHDHIIRCLMLAAANYPRLEGIIVQINTGYVASANVIRPVSEKRYFPENLEQNQSAARLVSAVKARASEADISSIHLAIAREVSPMFNVHELKKIAESFEDPSSIVVVLEFILFALFFPTEFNRIKGDIQNVLLLFFSRWYPVEYGIFVQRGATYTINAAGEFEFSGRNEKWDQALYLSEHFPALFSDVPLAGANTIIAIMRLFTPQGFLRTDDLAIAANFPRASRNPQTYIPYTNQRGTVTNEFASRFRTIVATLANVVNERAVQDDMQKATRSCTKQWLRHLETQFDNIAVAHTDHLSVVYATMSNFMLNFTNNFSGNHATFKPDQYVITSPEGSYKPIIERQGETVDGLTIIDTSIVWPILCQCTYPLVRQSGKGVDAVSIMEEIVYPDPSTTLSQSLSVAQVLSKLTLPDAFINMILSGGDSVVMRTYQTEADDDLDEGIRMTTYDQYLSHIRERLHITNVPDPIYITGASTPDQIAASVQATHVAVVLYQSGVINGPASTYLRENEVLVVMPDYYDVVSRFANANLQMNNNRYHESVLEIADIFDQADFIQTSDAVRQLRALMPTLSTSQIRHAIERIAQITDVDSTDYGKLTLRFLGTLTRSLKMQNAQIRRIRPDGTVLRYDDQIDIEAFRWSRYFLDELQLRRLSVGLRLITNPRIARRFNGVRIMYLTDDDPDPDFVPDVPEGYVAVQYAHRLFSSSLANKRNRVTYTHPPTGMAYPSPTGRPHVHMTINERAGMSKLVADNIIASVIKSNWVVDILDIEYTAEVMTPSEGYTQHVDAESIMTAPKGKLFHLQFMDGLLRPEPSAFDPPASGEDMRLIYPLQPISVARSMRAIVNHNEVDRPRGAVAPSSYEMDTGTLSRNGDLLYSPVANGQVGIPKLEVDHISFSNVVSMMTANIRTGDDMAVERVNPDDVRAINIRNA 1333 T 130 SRP19 pdbhh T Viruses T 3jaz 3 D,E D,E C6K2M8_CPVBM LPP-3 AND LPP-5 MLQQPTGGYTTLEQFAFTIRNDGTNATPTQFLQLLSYEATENELVKKTIPTPETHLPSARNVPGNVYIEDAITQALFGISAQNVNAHGYFSRLSALALPNTSARLGLDGVIYNSETINIPFYDPAAVANFAATYAKLGNASTPRYRADMIDIYAHVGLELAGTDAERAAGVMPVKRAKFDSWEGSLISLSRDVVNWKILAFLIDLCSLEGEALRAFKTRNRDVFRMMLFIMSTAVAANVVNRKVTKRVDRVLEYIGVNSMRTAGRTATITYDLSRHEFAAKFLQLTFTRWNAASAMIRSMPDMHTPRTSITPAGENALVRHNRYMTENFKGLSPIALAQKKHEMMLHTHEIHSMDIDGSIKNMVERETVNKMNEIDAMNTAPWTEEFAEVEPTTVYERHQIGTDPEQTQLISQDAAVIVHQASSDVDENEYGNSVSELTIDTQSDSVL 448 T 0.022 CI-B14_5a pdbpssm T Viruses T 3jb0 1 A A Q914N6_CPVBM TP MWHYTSINNDTRVALDPKPNQIRTITKPNTVPQLGTDYLYTFNSQRRSHTLRLLGPFQYFNFSETDRGHPLFRLPLKYPSKAIPADELIDNLHSWMRSVHLLHVRSEDNTLRYNWMLGVYARSTNYTTPVGQLVVNAPAILNYSNPQDAFNSVFVALGIDYIDIPITNSNIFDDSSTPYNVRIWHAPTMTEVNHILALMRKSTLVSTHSSWHWNVLHTFHYRSESDMIDHFAAKILEDWRQKEKLDKGALVEADRVIQRLIPLSSSTYVQRLAAIGALYPNEFTENVLDLSRLSTALLQLSDTYYQHANDQLRRLYRRMYNDSRTLYMTQRHQELLLAQITADPNILLYPYTYIFTTIPTSMNYISNTGQGRIKHSLTVTGATEHDTVADIVLGQTGEDVITISMVEPMSIAVEDMYGYVLDTPTRDIWPADEQIEQKGDAVALYDTKTSRALGMFNNTVRIDDLLSPLLSLVYRTYIKGDTMTMTQGSLDHLTLCAAVDSDITFVGNRMIAPLPEGYIPKPMHRNNSTMKMLSLYVALKKLENFATNSYLMAPDTSIILLGAEREPAVNILRRFNRNVSNVRIIGMGDRAVEPNIRVRVPFPIDKNISADFIICDINSYEDQSFESMFSETISVVTTCASAATRALVKINHPSEYMINSVIERLSQLGGVFYHTALLKTASQNPYSYETYIYITPIAAAVRFPFYSNSAMINRYMTAVADDEMPIIPSIHTVIKGHSNTYSPGLFCGCVDVQSAPLALSQLKSYCSEATTWRVDSDDNLVNIIARIDPARIALEFRTRSNTSAYHEYQRYVPNGLGFKVRKTREFRYMHREVTFIHKLMMYALIREQISLTENMTQVVSIGGRNLADISVVPLNMKYVVIDPATRIETLTQEKKNIEVQSRPFQFDAANMDLENNSIYLFIAVIMNEPNGAATPARMQMDKIRNVATAMLTRTNCVAYISFYEAGIITRLDQSTAHKTIRVEEGRLKVANYVPVDTLVEADVTLMLRDIGITHEIIRPSTPELIDACSNYGIRLGSTGGAVLDVFNHYSPVIKLVRS 1058 T 0.52 DUF5705 pdbpssm T Viruses T 3jb0 2 B,C B,C CAPSD_CPVBM CSP-A AND CSP-B MHSTNNNSNKRNNEEKHKQPEIDSSANNGEGTSGTRAQTVGDTATEAGVRNETEAGASTRRQTDGTGLSGTNAKIATASSARQADVEKPADVTFTIENVDDVGIMQQKKPPTVVQSRTDVFNEQFANEALHPTTKVIFNGLDVNTEVQPLSDDFKQISDPKGYLTYSVKYEDQFTKKDKLRASEADDRIVGPTVNLFKYGAAVVNIDLNRDFFDTATGIDLTKGIPLVQDLLVPIGVTAGAEQSAEYVSGLLMVLFKVMTDNRLVIVGETTTPMSNTLSTVVNNVLRTTYHNNVGVNPALLRDFTQVNWLNRDITNMLQQAGTKYGLGLTETRLDYVRLVKTIVGHALNIDHFAASVLNINLRALMEANVTADDRIKALQAHSMISTQFHGPNQGALRPELAFDHDHIIRCLMLAAANYPRLEGIIVQINTGYVASANVIRPVSEKRYFPENLEQNQSAARLVSAVKARASEADISSIHLAIAREVSPMFNVHELKKIAESFEDPSSIVVVLEFILFALFFPTEFNRIKGDIQNVLLLFFSRWYPVEYGIFVQRGATYTINAAGEFEFSGRNEKWDQALYLSEHFPALFSDVPLAGANTIIAIMRLFTPQGFLRTDDLAIAANFPRASRNPQTYIPYTNQRGTVTNEFASRFRTIVATLANVVNERAVQDDMQKATRSCTKQWLRHLETQFDNIAVAHTDHLSVVYATMSNFMLNFTNNFSGNHATFKPDQYVITSPEGSYKPIIERQGETVDGLTIIDTSIVWPILCQCTYPLVRQSGKGVDAVSIMEEIVYPDPSTTLSQSLSVAQVLSKLTLPDAFINMILSGGDSVVMRTYQTEADDDLDEGIRMTTYDQYLSHIRERLHITNVPDPIYITGASTPDQIAASVQATHVAVVLYQSGVINGPASTYLRENEVLVVMPDYYDVVSRFANANLQMNNNRYHESVLEIADIFDQADFIQTSDAVRQLRALMPTLSTSQIRHAIERIAQITDVDSTDYGKLTLRFLGTLTRSLKMQNAQIRRIRPDGTVLRYDDQIDIEAFRWSRYFLDELQLRRLSVGLRLITNPRIARRFNGVRIMYLTDDDPDPDFVPDVPEGYVAVQYAHRLFSSSLANKRNRVTYTHPPTGMAYPSPTGRPHVHMTINERAGMSKLVADNIIASVIKSNWVVDILDIEYTAEVMTPSEGYTQHVDAESIMTAPKGKLFHLQFMDGLLRPEPSAFDPPASGEDMRLIYPLQPISVARSMRAIVNHNEVDRPRGAVAPSSYEMDTGTLSRNGDLLYSPVANGQVGIPKLEVDHISFSNVVSMMTANIRTGDDMAVERVNPDDVRAINIRNA 1333 T 130 SRP19 pdbhh T Viruses T 3jb0 3 D,E D,E C6K2M8_CPVBM LPP-3 AND LPP-5 MLQQPTGGYTTLEQFAFTIRNDGTNATPTQFLQLLSYEATENELVKKTIPTPETHLPSARNVPGNVYIEDAITQALFGISAQNVNAHGYFSRLSALALPNTSARLGLDGVIYNSETINIPFYDPAAVANFAATYAKLGNASTPRYRADMIDIYAHVGLELAGTDAERAAGVMPVKRAKFDSWEGSLISLSRDVVNWKILAFLIDLCSLEGEALRAFKTRNRDVFRMMLFIMSTAVAANVVNRKVTKRVDRVLEYIGVNSMRTAGRTATITYDLSRHEFAAKFLQLTFTRWNAASAMIRSMPDMHTPRTSITPAGENALVRHNRYMTENFKGLSPIALAQKKHEMMLHTHEIHSMDIDGSIKNMVERETVNKMNEIDAMNTAPWTEEFAEVEPTTVYERHQIGTDPEQTQLISQDAAVIVHQASSDVDENEYGNSVSELTIDTQSDSVL 448 T 0.022 CI-B14_5a pdbpssm T Viruses T 3jb1 1 A A Q914N6_CPVBM TP MWHYTSINNDTRVALDPKPNQIRTITKPNTVPQLGTDYLYTFNSQRRSHTLRLLGPFQYFNFSETDRGHPLFRLPLKYPSKAIPADELIDNLHSWMRSVHLLHVRSEDNTLRYNWMLGVYARSTNYTTPVGQLVVNAPAILNYSNPQDAFNSVFVALGIDYIDIPITNSNIFDDSSTPYNVRIWHAPTMTEVNHILALMRKSTLVSTHSSWHWNVLHTFHYRSESDMIDHFAAKILEDWRQKEKLDKGALVEADRVIQRLIPLSSSTYVQRLAAIGALYPNEFTENVLDLSRLSTALLQLSDTYYQHANDQLRRLYRRMYNDSRTLYMTQRHQELLLAQITADPNILLYPYTYIFTTIPTSMNYISNTGQGRIKHSLTVTGATEHDTVADIVLGQTGEDVITISMVEPMSIAVEDMYGYVLDTPTRDIWPADEQIEQKGDAVALYDTKTSRALGMFNNTVRIDDLLSPLLSLVYRTYIKGDTMTMTQGSLDHLTLCAAVDSDITFVGNRMIAPLPEGYIPKPMHRNNSTMKMLSLYVALKKLENFATNSYLMAPDTSIILLGAEREPAVNILRRFNRNVSNVRIIGMGDRAVEPNIRVRVPFPIDKNISADFIICDINSYEDQSFESMFSETISVVTTCASAATRALVKINHPSEYMINSVIERLSQLGGVFYHTALLKTASQNPYSYETYIYITPIAAAVRFPFYSNSAMINRYMTAVADDEMPIIPSIHTVIKGHSNTYSPGLFCGCVDVQSAPLALSQLKSYCSEATTWRVDSDDNLVNIIARIDPARIALEFRTRSNTSAYHEYQRYVPNGLGFKVRKTREFRYMHREVTFIHKLMMYALIREQISLTENMTQVVSIGGRNLADISVVPLNMKYVVIDPATRIETLTQEKKNIEVQSRPFQFDAANMDLENNSIYLFIAVIMNEPNGAATPARMQMDKIRNVATAMLTRTNCVAYISFYEAGIITRLDQSTAHKTIRVEEGRLKVANYVPVDTLVEADVTLMLRDIGITHEIIRPSTPELIDACSNYGIRLGSTGGAVLDVFNHYSPVIKLVRS 1058 T 0.52 DUF5705 pdbpssm T Viruses T 3jb1 2 B,C B,C CAPSD_CPVBM CSP-A AND CSP-B MHSTNNNSNKRNNEEKHKQPEIDSSANNGEGTSGTRAQTVGDTATEAGVRNETEAGASTRRQTDGTGLSGTNAKIATASSARQADVEKPADVTFTIENVDDVGIMQQKKPPTVVQSRTDVFNEQFANEALHPTTKVIFNGLDVNTEVQPLSDDFKQISDPKGYLTYSVKYEDQFTKKDKLRASEADDRIVGPTVNLFKYGAAVVNIDLNRDFFDTATGIDLTKGIPLVQDLLVPIGVTAGAEQSAEYVSGLLMVLFKVMTDNRLVIVGETTTPMSNTLSTVVNNVLRTTYHNNVGVNPALLRDFTQVNWLNRDITNMLQQAGTKYGLGLTETRLDYVRLVKTIVGHALNIDHFAASVLNINLRALMEANVTADDRIKALQAHSMISTQFHGPNQGALRPELAFDHDHIIRCLMLAAANYPRLEGIIVQINTGYVASANVIRPVSEKRYFPENLEQNQSAARLVSAVKARASEADISSIHLAIAREVSPMFNVHELKKIAESFEDPSSIVVVLEFILFALFFPTEFNRIKGDIQNVLLLFFSRWYPVEYGIFVQRGATYTINAAGEFEFSGRNEKWDQALYLSEHFPALFSDVPLAGANTIIAIMRLFTPQGFLRTDDLAIAANFPRASRNPQTYIPYTNQRGTVTNEFASRFRTIVATLANVVNERAVQDDMQKATRSCTKQWLRHLETQFDNIAVAHTDHLSVVYATMSNFMLNFTNNFSGNHATFKPDQYVITSPEGSYKPIIERQGETVDGLTIIDTSIVWPILCQCTYPLVRQSGKGVDAVSIMEEIVYPDPSTTLSQSLSVAQVLSKLTLPDAFINMILSGGDSVVMRTYQTEADDDLDEGIRMTTYDQYLSHIRERLHITNVPDPIYITGASTPDQIAASVQATHVAVVLYQSGVINGPASTYLRENEVLVVMPDYYDVVSRFANANLQMNNNRYHESVLEIADIFDQADFIQTSDAVRQLRALMPTLSTSQIRHAIERIAQITDVDSTDYGKLTLRFLGTLTRSLKMQNAQIRRIRPDGTVLRYDDQIDIEAFRWSRYFLDELQLRRLSVGLRLITNPRIARRFNGVRIMYLTDDDPDPDFVPDVPEGYVAVQYAHRLFSSSLANKRNRVTYTHPPTGMAYPSPTGRPHVHMTINERAGMSKLVADNIIASVIKSNWVVDILDIEYTAEVMTPSEGYTQHVDAESIMTAPKGKLFHLQFMDGLLRPEPSAFDPPASGEDMRLIYPLQPISVARSMRAIVNHNEVDRPRGAVAPSSYEMDTGTLSRNGDLLYSPVANGQVGIPKLEVDHISFSNVVSMMTANIRTGDDMAVERVNPDDVRAINIRNA 1333 T 130 SRP19 pdbhh T Viruses T 3jb1 3 D,E D,E C6K2M8_CPVBM LPP-3 AND LPP-5 MLQQPTGGYTTLEQFAFTIRNDGTNATPTQFLQLLSYEATENELVKKTIPTPETHLPSARNVPGNVYIEDAITQALFGISAQNVNAHGYFSRLSALALPNTSARLGLDGVIYNSETINIPFYDPAAVANFAATYAKLGNASTPRYRADMIDIYAHVGLELAGTDAERAAGVMPVKRAKFDSWEGSLISLSRDVVNWKILAFLIDLCSLEGEALRAFKTRNRDVFRMMLFIMSTAVAANVVNRKVTKRVDRVLEYIGVNSMRTAGRTATITYDLSRHEFAAKFLQLTFTRWNAASAMIRSMPDMHTPRTSITPAGENALVRHNRYMTENFKGLSPIALAQKKHEMMLHTHEIHSMDIDGSIKNMVERETVNKMNEIDAMNTAPWTEEFAEVEPTTVYERHQIGTDPEQTQLISQDAAVIVHQASSDVDENEYGNSVSELTIDTQSDSVL 448 T 0.022 CI-B14_5a pdbpssm T Viruses T 3jb2 1 A A Q914N6_CPVBM TP MWHYTSINNDTRVALDPKPNQIRTITKPNTVPQLGTDYLYTFNSQRRSHTLRLLGPFQYFNFSETDRGHPLFRLPLKYPSKAIPADELIDNLHSWMRSVHLLHVRSEDNTLRYNWMLGVYARSTNYTTPVGQLVVNAPAILNYSNPQDAFNSVFVALGIDYIDIPITNSNIFDDSSTPYNVRIWHAPTMTEVNHILALMRKSTLVSTHSSWHWNVLHTFHYRSESDMIDHFAAKILEDWRQKEKLDKGALVEADRVIQRLIPLSSSTYVQRLAAIGALYPNEFTENVLDLSRLSTALLQLSDTYYQHANDQLRRLYRRMYNDSRTLYMTQRHQELLLAQITADPNILLYPYTYIFTTIPTSMNYISNTGQGRIKHSLTVTGATEHDTVADIVLGQTGEDVITISMVEPMSIAVEDMYGYVLDTPTRDIWPADEQIEQKGDAVALYDTKTSRALGMFNNTVRIDDLLSPLLSLVYRTYIKGDTMTMTQGSLDHLTLCAAVDSDITFVGNRMIAPLPEGYIPKPMHRNNSTMKMLSLYVALKKLENFATNSYLMAPDTSIILLGAEREPAVNILRRFNRNVSNVRIIGMGDRAVEPNIRVRVPFPIDKNISADFIICDINSYEDQSFESMFSETISVVTTCASAATRALVKINHPSEYMINSVIERLSQLGGVFYHTALLKTASQNPYSYETYIYITPIAAAVRFPFYSNSAMINRYMTAVADDEMPIIPSIHTVIKGHSNTYSPGLFCGCVDVQSAPLALSQLKSYCSEATTWRVDSDDNLVNIIARIDPARIALEFRTRSNTSAYHEYQRYVPNGLGFKVRKTREFRYMHREVTFIHKLMMYALIREQISLTENMTQVVSIGGRNLADISVVPLNMKYVVIDPATRIETLTQEKKNIEVQSRPFQFDAANMDLENNSIYLFIAVIMNEPNGAATPARMQMDKIRNVATAMLTRTNCVAYISFYEAGIITRLDQSTAHKTIRVEEGRLKVANYVPVDTLVEADVTLMLRDIGITHEIIRPSTPELIDACSNYGIRLGSTGGAVLDVFNHYSPVIKLVRS 1058 T 0.52 DUF5705 pdbpssm T Viruses T 3jb2 2 B,C B,C CAPSD_CPVBM CSP-A AND CSP-B MHSTNNNSNKRNNEEKHKQPEIDSSANNGEGTSGTRAQTVGDTATEAGVRNETEAGASTRRQTDGTGLSGTNAKIATASSARQADVEKPADVTFTIENVDDVGIMQQKKPPTVVQSRTDVFNEQFANEALHPTTKVIFNGLDVNTEVQPLSDDFKQISDPKGYLTYSVKYEDQFTKKDKLRASEADDRIVGPTVNLFKYGAAVVNIDLNRDFFDTATGIDLTKGIPLVQDLLVPIGVTAGAEQSAEYVSGLLMVLFKVMTDNRLVIVGETTTPMSNTLSTVVNNVLRTTYHNNVGVNPALLRDFTQVNWLNRDITNMLQQAGTKYGLGLTETRLDYVRLVKTIVGHALNIDHFAASVLNINLRALMEANVTADDRIKALQAHSMISTQFHGPNQGALRPELAFDHDHIIRCLMLAAANYPRLEGIIVQINTGYVASANVIRPVSEKRYFPENLEQNQSAARLVSAVKARASEADISSIHLAIAREVSPMFNVHELKKIAESFEDPSSIVVVLEFILFALFFPTEFNRIKGDIQNVLLLFFSRWYPVEYGIFVQRGATYTINAAGEFEFSGRNEKWDQALYLSEHFPALFSDVPLAGANTIIAIMRLFTPQGFLRTDDLAIAANFPRASRNPQTYIPYTNQRGTVTNEFASRFRTIVATLANVVNERAVQDDMQKATRSCTKQWLRHLETQFDNIAVAHTDHLSVVYATMSNFMLNFTNNFSGNHATFKPDQYVITSPEGSYKPIIERQGETVDGLTIIDTSIVWPILCQCTYPLVRQSGKGVDAVSIMEEIVYPDPSTTLSQSLSVAQVLSKLTLPDAFINMILSGGDSVVMRTYQTEADDDLDEGIRMTTYDQYLSHIRERLHITNVPDPIYITGASTPDQIAASVQATHVAVVLYQSGVINGPASTYLRENEVLVVMPDYYDVVSRFANANLQMNNNRYHESVLEIADIFDQADFIQTSDAVRQLRALMPTLSTSQIRHAIERIAQITDVDSTDYGKLTLRFLGTLTRSLKMQNAQIRRIRPDGTVLRYDDQIDIEAFRWSRYFLDELQLRRLSVGLRLITNPRIARRFNGVRIMYLTDDDPDPDFVPDVPEGYVAVQYAHRLFSSSLANKRNRVTYTHPPTGMAYPSPTGRPHVHMTINERAGMSKLVADNIIASVIKSNWVVDILDIEYTAEVMTPSEGYTQHVDAESIMTAPKGKLFHLQFMDGLLRPEPSAFDPPASGEDMRLIYPLQPISVARSMRAIVNHNEVDRPRGAVAPSSYEMDTGTLSRNGDLLYSPVANGQVGIPKLEVDHISFSNVVSMMTANIRTGDDMAVERVNPDDVRAINIRNA 1333 T 130 SRP19 pdbhh T Viruses T 3jb2 3 D,E D,E C6K2M8_CPVBM LPP-3 AND LPP-5 MLQQPTGGYTTLEQFAFTIRNDGTNATPTQFLQLLSYEATENELVKKTIPTPETHLPSARNVPGNVYIEDAITQALFGISAQNVNAHGYFSRLSALALPNTSARLGLDGVIYNSETINIPFYDPAAVANFAATYAKLGNASTPRYRADMIDIYAHVGLELAGTDAERAAGVMPVKRAKFDSWEGSLISLSRDVVNWKILAFLIDLCSLEGEALRAFKTRNRDVFRMMLFIMSTAVAANVVNRKVTKRVDRVLEYIGVNSMRTAGRTATITYDLSRHEFAAKFLQLTFTRWNAASAMIRSMPDMHTPRTSITPAGENALVRHNRYMTENFKGLSPIALAQKKHEMMLHTHEIHSMDIDGSIKNMVERETVNKMNEIDAMNTAPWTEEFAEVEPTTVYERHQIGTDPEQTQLISQDAAVIVHQASSDVDENEYGNSVSELTIDTQSDSVL 448 T 0.022 CI-B14_5a pdbpssm T Viruses T 3jb3 1 A A Q914N6_CPVBM TP MWHYTSINNDTRVALDPKPNQIRTITKPNTVPQLGTDYLYTFNSQRRSHTLRLLGPFQYFNFSETDRGHPLFRLPLKYPSKAIPADELIDNLHSWMRSVHLLHVRSEDNTLRYNWMLGVYARSTNYTTPVGQLVVNAPAILNYSNPQDAFNSVFVALGIDYIDIPITNSNIFDDSSTPYNVRIWHAPTMTEVNHILALMRKSTLVSTHSSWHWNVLHTFHYRSESDMIDHFAAKILEDWRQKEKLDKGALVEADRVIQRLIPLSSSTYVQRLAAIGALYPNEFTENVLDLSRLSTALLQLSDTYYQHANDQLRRLYRRMYNDSRTLYMTQRHQELLLAQITADPNILLYPYTYIFTTIPTSMNYISNTGQGRIKHSLTVTGATEHDTVADIVLGQTGEDVITISMVEPMSIAVEDMYGYVLDTPTRDIWPADEQIEQKGDAVALYDTKTSRALGMFNNTVRIDDLLSPLLSLVYRTYIKGDTMTMTQGSLDHLTLCAAVDSDITFVGNRMIAPLPEGYIPKPMHRNNSTMKMLSLYVALKKLENFATNSYLMAPDTSIILLGAEREPAVNILRRFNRNVSNVRIIGMGDRAVEPNIRVRVPFPIDKNISADFIICDINSYEDQSFESMFSETISVVTTCASAATRALVKINHPSEYMINSVIERLSQLGGVFYHTALLKTASQNPYSYETYIYITPIAAAVRFPFYSNSAMINRYMTAVADDEMPIIPSIHTVIKGHSNTYSPGLFCGCVDVQSAPLALSQLKSYCSEATTWRVDSDDNLVNIIARIDPARIALEFRTRSNTSAYHEYQRYVPNGLGFKVRKTREFRYMHREVTFIHKLMMYALIREQISLTENMTQVVSIGGRNLADISVVPLNMKYVVIDPATRIETLTQEKKNIEVQSRPFQFDAANMDLENNSIYLFIAVIMNEPNGAATPARMQMDKIRNVATAMLTRTNCVAYISFYEAGIITRLDQSTAHKTIRVEEGRLKVANYVPVDTLVEADVTLMLRDIGITHEIIRPSTPELIDACSNYGIRLGSTGGAVLDVFNHYSPVIKLVRS 1058 T 0.52 DUF5705 pdbpssm T Viruses T 3jb3 2 B,C B,C CAPSD_CPVBM CSP-A AND CSP-B MHSTNNNSNKRNNEEKHKQPEIDSSANNGEGTSGTRAQTVGDTATEAGVRNETEAGASTRRQTDGTGLSGTNAKIATASSARQADVEKPADVTFTIENVDDVGIMQQKKPPTVVQSRTDVFNEQFANEALHPTTKVIFNGLDVNTEVQPLSDDFKQISDPKGYLTYSVKYEDQFTKKDKLRASEADDRIVGPTVNLFKYGAAVVNIDLNRDFFDTATGIDLTKGIPLVQDLLVPIGVTAGAEQSAEYVSGLLMVLFKVMTDNRLVIVGETTTPMSNTLSTVVNNVLRTTYHNNVGVNPALLRDFTQVNWLNRDITNMLQQAGTKYGLGLTETRLDYVRLVKTIVGHALNIDHFAASVLNINLRALMEANVTADDRIKALQAHSMISTQFHGPNQGALRPELAFDHDHIIRCLMLAAANYPRLEGIIVQINTGYVASANVIRPVSEKRYFPENLEQNQSAARLVSAVKARASEADISSIHLAIAREVSPMFNVHELKKIAESFEDPSSIVVVLEFILFALFFPTEFNRIKGDIQNVLLLFFSRWYPVEYGIFVQRGATYTINAAGEFEFSGRNEKWDQALYLSEHFPALFSDVPLAGANTIIAIMRLFTPQGFLRTDDLAIAANFPRASRNPQTYIPYTNQRGTVTNEFASRFRTIVATLANVVNERAVQDDMQKATRSCTKQWLRHLETQFDNIAVAHTDHLSVVYATMSNFMLNFTNNFSGNHATFKPDQYVITSPEGSYKPIIERQGETVDGLTIIDTSIVWPILCQCTYPLVRQSGKGVDAVSIMEEIVYPDPSTTLSQSLSVAQVLSKLTLPDAFINMILSGGDSVVMRTYQTEADDDLDEGIRMTTYDQYLSHIRERLHITNVPDPIYITGASTPDQIAASVQATHVAVVLYQSGVINGPASTYLRENEVLVVMPDYYDVVSRFANANLQMNNNRYHESVLEIADIFDQADFIQTSDAVRQLRALMPTLSTSQIRHAIERIAQITDVDSTDYGKLTLRFLGTLTRSLKMQNAQIRRIRPDGTVLRYDDQIDIEAFRWSRYFLDELQLRRLSVGLRLITNPRIARRFNGVRIMYLTDDDPDPDFVPDVPEGYVAVQYAHRLFSSSLANKRNRVTYTHPPTGMAYPSPTGRPHVHMTINERAGMSKLVADNIIASVIKSNWVVDILDIEYTAEVMTPSEGYTQHVDAESIMTAPKGKLFHLQFMDGLLRPEPSAFDPPASGEDMRLIYPLQPISVARSMRAIVNHNEVDRPRGAVAPSSYEMDTGTLSRNGDLLYSPVANGQVGIPKLEVDHISFSNVVSMMTANIRTGDDMAVERVNPDDVRAINIRNA 1333 T 130 SRP19 pdbhh T Viruses T 3jb3 3 D,E D,E C6K2M8_CPVBM LPP-3 AND LPP-5 MLQQPTGGYTTLEQFAFTIRNDGTNATPTQFLQLLSYEATENELVKKTIPTPETHLPSARNVPGNVYIEDAITQALFGISAQNVNAHGYFSRLSALALPNTSARLGLDGVIYNSETINIPFYDPAAVANFAATYAKLGNASTPRYRADMIDIYAHVGLELAGTDAERAAGVMPVKRAKFDSWEGSLISLSRDVVNWKILAFLIDLCSLEGEALRAFKTRNRDVFRMMLFIMSTAVAANVVNRKVTKRVDRVLEYIGVNSMRTAGRTATITYDLSRHEFAAKFLQLTFTRWNAASAMIRSMPDMHTPRTSITPAGENALVRHNRYMTENFKGLSPIALAQKKHEMMLHTHEIHSMDIDGSIKNMVERETVNKMNEIDAMNTAPWTEEFAEVEPTTVYERHQIGTDPEQTQLISQDAAVIVHQASSDVDENEYGNSVSELTIDTQSDSVL 448 T 0.022 CI-B14_5a pdbpssm T Viruses T 3jb6 1 A A D0EZK6_CPVBM RNA-dependent RNA polymerase MLPNTKLHNTIFSETRKFTRESFKEIEHLTARLANDRVARHDFLFNTSIVLISDYSGEDSNGNQLQATITIPNEIINPKEYDPSDYPLAEDESFFKQGHKYDYLVTFRAGSLTNTYEPKTKMYKLHAALDKLMHVKQRKSRFADLWRELCAVIASLDVWYQTTNYPLRTYVKLLFHKGDEFPFYESPSQDRIIFNDKSVASILPTFVYTCCQVGTAIMSGILTHVESIVAMNHFLHCAKDSYIDEKLKIKGIGRSWYQEALHNVCQATVPVWSQFNEVIGHRTKSTSEPHFVSSTFISLRAKRAELLYPEFNAYINRAIQLSKTQNDVANYYAACRAMTNDGTFLATLTELSLDAAVFPRIEQHLVTRPAVLMSNTRHESLKQKYTNGVGSIAQSYLSSFTDEIAKRVNGRHHDEAWLNFLTTSSPGRKLTEIEKLEVGGDVAAWSNSRIVMQAVFAREYRTPERIFKSLKAPIKLVERQQSDRRQRAISGLDNDRLFLSFMPYTIGKQIYELNDNAAQGKQAGNAFDIGEMLYWTSQRNVLLSSIDVAGMDASVTTNTKDIYNTFVLDVASKCTVPRFGPYYAKNMEVFEAGNRQSQVRYVNAAWQACALEAANSQTSTSYESEIFGQVKNAEGTYPSGRADTSTHHTVLLQGLVRGNELKRASDGKNSCLATIKILGDDIMEIFQGSESDTYDHAVSNASILNESGFATTAELSQNSIVLLQQLVVNGTFWGFADRISLWTREDTKDIGRLNLAMMELNALIDDLVFRVRRPEGLKMLGFFCGAICLRRFTLSVDNKLYDSTYNNLSKYMTLTKYDKNPDSDSTLMSLILPLAWLFMPRGGEYPAYPFERRDGTFTEDESMFTARGAYKRRLLYDVSNIGEMIQQNSMALDDDLLHEYGFTGALLLIDLNILDLIDEVKKEDISPVKVSELATSLEQLGKLGEREKSRRAASDLKIRGHALSNDIVYGYGLQEKIQKSAMATKETTVQSKRVSSRLHDVIVAKTRDYKISTIPADALHLHEFEVEDVTVDLLPHAKHTSYSSLAYNMSFGSDGWFAFALLGGLDRSANLLRLDVASIRGNYHKFSYDDPVFKQGYKIYKSDATLLNDFFTAISAGPKEQGILLRAFAYYSLYGNVEYHYVLSPRQLFFLSDNPVSAERLVRIPPKYYVSTQCRALYNIFSYLHILRSIANNRGKRLKMVLHPGLIAYVRGTSQGAILPEADNV 1225 T 0.59 DUF2779 pdbpercent T Viruses T 3jb6 2 B B Q9IR43_CPVBM Viral structural protein 4 MFAIDPLKHSKLYEEYGLYLRPHQINQEIKPTTIKKKELAPTIRSIKYASLIHSMLAKHAARHNGTLINPRMYADMITLGNTKVTVTKGTPKAQIDTLKMNGLTVVSKSRRNNKKKPVSDTTATIDENTDDIVTYKALTEMSTLIESFRLPSGLALIIFDDEKYQSLIPNYINQLIAYTQPHIIPTWQGIADFSDTYLRSYFKRPFELTASNLAAPQKYNLSPMTRSIFNNTGREDAVIRKLYGYGEYVFIRYEGCLITWTGIYGEVTMMVNLSKRDLGLDVGDDYLKEYKKLLFYGVITDAIPSGISARSTIMKISPHKMMNPSGGALAVLSKFLEAVVSTNVINATLVVYAEKGAGKTSFLSTYAEQLSLASGQVVGHLSSDAYGRWLAKNKDVEEPSFAYDYVLSLDTDDNESYYEQKASELLISHGISEVAQYELLSVRKKIKMMDEMNEVLIAQLENADTHSERNFYYMVSTGKTTPRTLIVEGHFNAQDATIARTDTTVLLRTINDTTQAMRDRQRGGVVQLFLRDTYYRLLPALHTTVYPFEMLESIRRWKWVH 561 T 0.00013 Zeta_toxin pdbhh T Viruses T 3jb6 3 C,D C,D D3JWE6_CPVBM VP1 CSP PTVVQSRTDVFNEQFANEALHPMT 24 T 8.5 DASH_Dam1 pdbhh T Viruses T 3jb7 1 A A D0EZK6_CPVBM CPV RNA-dependent RNA polymerase MLPNTKLHNTIFSETRKFTRESFKEIEHLTARLANDRVARHDFLFNTSIVLISDYSGEDSNGNQLQATITIPNEIINPKEYDPSDYPLAEDESFFKQGHKYDYLVTFRAGSLTNTYEPKTKMYKLHAALDKLMHVKQRKSRFADLWRELCAVIASLDVWYQTTNYPLRTYVKLLFHKGDEFPFYESPSQDRIIFNDKSVASILPTFVYTCCQVGTAIMSGILTHVESIVAMNHFLHCAKDSYIDEKLKIKGIGRSWYQEALHNVCQATVPVWSQFNEVIGHRTKSTSEPHFVSSTFISLRAKRAELLYPEFNAYINRAIQLSKTQNDVANYYAACRAMTNDGTFLATLTELSLDAAVFPRIEQHLVTRPAVLMSNTRHESLKQKYTNGVGSIAQSYLSSFTDEIAKRVNGRHHDEAWLNFLTTSSPGRKLTEIEKLEVGGDVAAWSNSRIVMQAVFAREYRTPERIFKSLKAPIKLVERQQSDRRQRAISGLDNDRLFLSFMPYTIGKQIYELNDNAAQGKQAGNAFDIGEMLYWTSQRNVLLSSIDVAGMDASVTTNTKDIYNTFVLDVASKCTVPRFGPYYAKNMEVFEAGNRQSQVRYVNAAWQACALEAANSQTSTSYESEIFGQVKNAEGTYPSGRADTSTHHTVLLQGLVRGNELKRASDGKNSCLATIKILGDDIMEIFQGSESDTYDHAVSNASILNESGFATTAELSQNSIVLLQQLVVNGTFWGFADRISLWTREDTKDIGRLNLAMMELNALIDDLVFRVRRPEGLKMLGFFCGAICLRRFTLSVDNKLYDSTYNNLSKYMTLTKYDKNPDSDSTLMSLILPLAWLFMPRGGEYPAYPFERRDGTFTEDESMFTARGAYKRRLLYDVSNIGEMIQQNSMALDDDLLHEYGFTGALLLIDLNILDLIDEVKKEDISPVKVSELATSLEQLGKLGEREKSRRAASDLKIRGHALSNDIVYGYGLQEKIQKSAMATKETTVQSKRVSSRLHDVIVAKTRDYKISTIPADALHLHEFEVEDVTVDLLPHAKHTSYSSLAYNMSFGSDGWFAFALLGGLDRSANLLRLDVASIRGNYHKFSYDDPVFKQGYKIYKSDATLLNDFFTAISAGPKEQGILLRAFAYYSLYGNVEYHYVLSPRQLFFLSDNPVSAERLVRIPPKYYVSTQCRALYNIFSYLHILRSIANNRGKRLKMVLHPGLIAYVRGTSQGAILPEADNV 1225 T 0.59 DUF2779 pdbpercent T Viruses T 3jb7 2 B B Q9IR43_CPVBM Viral structural protein 4 MFAIDPLKHSKLYEEYGLYLRPHQINQEIKPTTIKKKELAPTIRSIKYASLIHSMLAKHAARHNGTLINPRMYADMITLGNTKVTVTKGTPKAQIDTLKMNGLTVVSKSRRNNKKKPVSDTTATIDENTDDIVTYKALTEMSTLIESFRLPSGLALIIFDDEKYQSLIPNYINQLIAYTQPHIIPTWQGIADFSDTYLRSYFKRPFELTASNLAAPQKYNLSPMTRSIFNNTGREDAVIRKLYGYGEYVFIRYEGCLITWTGIYGEVTMMVNLSKRDLGLDVGDDYLKEYKKLLFYGVITDAIPSGISARSTIMKISPHKMMNPSGGALAVLSKFLEAVVSTNVINATLVVYAEKGAGKTSFLSTYAEQLSLASGQVVGHLSSDAYGRWLAKNKDVEEPSFAYDYVLSLDTDDNESYYEQKASELLISHGISEVAQYELLSVRKKIKMMDEMNEVLIAQLENADTHSERNFYYMVSTGKTTPRTLIVEGHFNAQDATIARTDTTVLLRTINDTTQAMRDRQRGGVVQLFLRDTYYRLLPALHTTVYPFEMLESIRRWKWVH 561 T 0.00013 Zeta_toxin pdbhh T Viruses T 3jb7 3 C,F C,D D3JWE6_CPVBM VP1 CSP PTVVQSRTDVFNEQFANEALHPMT 24 T 8.5 DASH_Dam1 pdbhh T Viruses T 3jb9 33 QA x unknown chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 412 F F F 3jbu 49 WA z OMPA_ECOLI SecM-glycine MASWSHPQFEKGGGARGGSGGGSWSHPQFEKGFENLYFQGMKKTAIAIAVALAGFATVAQAEQKLISEEDLFSTPVWISQAQGIRAG 87 T 1.6999999999999998E-75 OmpA_membrane unp F Bacteria T 3jc2 4 D 3 SEC61 BETA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 32 F F F 3jcu 20 TA,U u,U A0A0K9RHP1_SPIOL Photosystem II Reaction Center Tn protein MASITMTASFLGTTVSKQPPTHHLRRGVVMAKAMPETTTTTKEETSSKRRDLVFAVAAAAACSVARIAMAEEPKRGTPEAKKKYAPVCVTMPSARICYK 99 T 0.014 PsbQ pdbpercent F Eukaryota T 3jd5 32 FA,GA s,z unknown XXXXXXXXXXXXXXXXX 17 F F F 3jpv 2 B B Peptide (PIMTIDE) ARKRRRHPSGPPTA ARKRRRHPSGPPTA 14 T 1.5 DUF3019 pdbhh F T 3jpx 2 B B H4_HUMAN HISTONE PEPTIDE GGAKRHRKVLRDNIQ 15 T 0.27 UPF0137 unp F Eukaryota T 3jq5 2 B B A4_HUMAN Amyloid Beta Peptide DAEFRHDS 8 T 0.0001 Beta-APP unphh F Eukaryota T 3jqo 3 AA,C,DA,F,GA,I,JA,L,MA,O,PA,R,U,X a,C,d,F,g,I,j,L,m,O,p,R,U,X Q46702_ECOLX TraN protein CSSGHKPPPEPDWSNTVPVNKTIPVDTQGGRNES 34 T 0.0023 LPAM_1 unphh F Bacteria T 3jr3 2 B D Acetylated Peptide KKGQSTSRHKXLRFKTEG 18 T 21 DUF986 pdbhh F T 3jrv 2 C,D,E C,D,E DDX3X_HUMAN DEAD BOX PROTEIN 3, X-CHROMOSOMAL, HELICASE-LIKE PROTEIN 2, HLP2, DEAD BOX, X ISOFORM SFGSRSDSRGKSSFFSDRGS 20 T 26 DUF5725 pdbhh F Eukaryota T 3jv2 2 C,D C,D peptide XXX 3 F F F 3jxt 2 C,D C,D CCG2_MOUSE NEURONAL VOLTAGE-GATED CALCIUM CHANNEL GAMMA-2 SUBUNIT, STARGAZIN XXRTTPV 7 T 300 DUF3858 pdbhh F Eukaryota F 3jz9 1 A A DRRA_LEGPH Uncharacterized protein DrrA GHMVTRIENLENAKKLWDNANSMLEKGNISGYLKAANELHKFMKEKNLKEDDLRPELSDKTISPKGYAILQSLWGAASDYSRAAATLTESTVEPGLVSAVNKMSAFFMDCKLSPNERATPDPDFKVGKSKILVGIMQFIKDVADPTSKIWMHNTKALMNHKIAAIQKLERSNNVNDETLESVLSSKGENLSEYLSYK 197 T 0.88 DUF3800 unppssm F Bacteria T 3jza 2 B B DRRA_LEGPH Uncharacterized protein DrrA GHMVTRIENLENAKKLWDNANSMLEKGNISGYLKAANELHKFMKEKNLKEDDLRPELSDKTISPKGYAILQSLWGAASDYSRAAATLTESTVEPGLVSAVNKMSAFFMDCKLSPNERATPDPDFKVGKSKILVGIMQFIKDVADPTSKIWMHNTKALMNHKIAAIQKLERSNNVNDETLESVLSSKGENLSEYLSYK 197 T 0.88 DUF3800 unppssm F Bacteria T 3jzg 2 B B HISTONE PEPTIDE ARKS 4 T 250 HEAT_UF pdbhh F F 3jzh 2 B B HISTONE PEPTIDE DFKTD 5 T 99 DM13 pdbhh F F 3jzo 2 B P pDI peptide (12mer) LTFEHYWAQLTS 12 T 1.4 ASXH pdbhh F T 3jzp 2 B P pDI6W peptide (12mer) LTFEHWWAQLTS 12 T 0.19 Potyvirid-P3 pdbhh F T 3jzq 2 B,D P,Q pDIQ peptide (12mer) ETFEHWWSQLLS 12 T 0.14 Potyvirid-P3 pdbhh F T 3jzr 2 B P pDI6W peptide (12mer) LTFEHWWAQLTS 12 T 0.19 Potyvirid-P3 pdbhh F T 3jzs 2 B P pDIQ peptide (12mer) ETFEHWWSQLLS 12 T 0.14 Potyvirid-P3 pdbhh F T 3k05 2 C,D C,D phospho peptide SQEYX 5 T 220 DUF5858 pdbhh F F 3k0h 2 B B phospho peptide SPTFX 5 T 96 BNR_6 pdbhh F F 3k0k 2 B B phospho peptide pSPTF-COOH SPTF 4 T 51 Herpes_U30 pdbhh F F 3k15 2 B B phospho peptide SPTFX 5 T 96 BNR_6 pdbhh F F 3k16 2 B B phospho peptide SPTF 4 T 51 Herpes_U30 pdbhh F F 3k1q 2 B B Q9E3V8_9REOV VP3A ANGPELIIEDTGLCTSFMLLDNIPSAHLTKELIGFTWFMQMYQMTPPLPEGAVNRIVCMTNWASLGDEGRGLEVRLPPPTDSSVHAYKTVLSRGYIDNAQFNPLALRSNVLLMLLQFTLSNLKINKSSTFTSDVTTITSGRMIRAFEGRPELLALAYPGRAVLPTQTKNAQFLSTAIADRIGRLDRANLIGGEVSAMVECMELCDALTLHIRETYIMLLRSMHQDPTQIVQIVNECANNLLNSTIPISLRPTILCPWFASSEDLRLQEVMHLVNISSNTAAALPLVEALSTLLRSVTPLVLDPTVLTNAITTISESTTQTISPISEILRLLQPMGNDYAAFWKCIASWAYNGLVTTVLSEDAFPDSSQSITHLPSMWKCLFLTLAGPMTSDPHSPVKVFMALANLLAQPEPIAIGVPGMHQTTPASQFSHPGVWPPGFLNPQLINPQQAPLLRAFAEHIRANWPQPSEFGYGSTLQGSANLFIPSNRMVYPWPNQPLPRLTVAPTYDSAMSNWISTTIAFFIRVVNSVNMTATVNDLTRRTMTGVMTAMRQVKTMTPFYIQHMCPTELSVLASVTVTPPFQVPFTRLVQNDVITNVLVARVDPAQRGDAAVDIRATHATFAAALPVDPAAIVVAMLCGQTETNLIPSHHYGKAFAPLFASNAMFTRNQRAVITREAFVCARSAVAQCQDAGFLVPRPLDALRQFDVTSAAAAEIMHAVNDAFKTAFDLDGALLDGLALYGDPRIADLSAAYLQYGGNVVREHVPPGPSHIHRALQQVESTFMAEMNLFNVARGNLYLVQTATNGNWSPMAPVAAPPFVRGGPNVRVVGRFGTIVPRPNGLEPQLIDDGNVPRDIAGDWVYPSDVLQVSVAVFRDYVWPMVKAGRTRVLVELGHYVYTLHYYDPQISLDEAPILEEWLSKINPAGIPPVPFCIPIPQVYPCITARRVHYAFTSENNNDSLFSTNAASIDTAFGENAAVSPLRWPGLVDPNYRVGTNDLPNRITLYNSLYRYNFTYPTLDGIMYVRSAT 1027 T 27 Peptidase_C36 pdbhh T Viruses T 3k24 2 C,D C,D H3 peptide QLA 3 T 370 Enterotoxin_a pdbhh F F 3k26 2 B B HISTONE PEPTIDE ARTKKQTARKST 12 T 150 DUF3042 pdbhh F T 3k27 2 B B HISTONE PEPTIDE KQTARKSTG 9 T 300 Ice_nucleation pdbhh F T 3k33 3 E E Polypeptide of unknown amino acids and source XXXXXXXXXXXX 12 F F F 3k48 2 D,E,F R,S,T peptide SGWCDPRWYDPFMCEH 16 T 0.36 Yuri_gagarin pdbhh F T 3k8g 1 A,B A,B TP453_TREPA PUTATIVE UNCHARACTERIZED PROTEIN GSGAWKASVDPLGVVGSGADVYLYFPVAGNENLISRIIENHESKADIKKIVDRTTAVYGAFFARSKEFRLFGSGSYPYAFTNLIFSRSDGWASTKTEHGITYYESEHTDVSIPAPHFSCVIFGSSKRERMSKMLSRLVNPDRPQLPPRFEKECTSEGTSQTVALYIKNGGHFITKLLNFPQLNLPLGAMELYLTARRNEYLYTLSLQLGNAKINFPIQFLISRVLNAHIHVEGDRLIIEDGTISAERLASVISSLYSKKGSS 262 T 0.11 DUF5618 unppercent F Bacteria T 3k8h 1 A,B A,B TP453_TREPA PUTATIVE UNCHARACTERIZED PROTEIN GSGAWKASVDPLGVVGSGADVYLYFPVAGNENLISRIIENHESKADIKKIVDRTTAVYGAFFARSKEFRLFGSGSYPYAFTNLIFSRSDGWASTKTEHGITYYESEHTDVSIPAPHFSCVIFGSSKRERMSKMLSRLVNPDRPQLPPRFEKECTSEGTSQTVALYIKNGGHFITKLLNFPQLNLPLGAMELYLTARRNEYLYTLSLQLGNAKINFPIQFLISRVLNAHIHVEGDRLIIEDGTISAERLASVISSLYSKKGSS 262 T 0.11 DUF5618 unppercent F Bacteria T 3k8i 1 A A TP453_TREPA PUTATIVE UNCHARACTERIZED PROTEIN GSGAWKASVDPLGVVGSGADVYLYFPVAGNENLISRIIENHESKADIKKIVDRTTAVYGAFFARSKEFRLFGSGSYPYAFTNLIFSRSDGWASTKTEHGITYYESEHTDVSIPAPHFSCVIFGSSKRERMSKMLSRLVNPDRPQLPPRFEKECTSEGTSQTVALYIKNGGHFITKLLNFPQLNLPLGAMELYLTARRNEYLYTLSLQLGNAKINFPIQFLISRVLNAHIHVEGDRLIIEDGTISAERLASVISSLYSKKGSS 262 T 0.11 DUF5618 unppercent F Bacteria T 3k8j 1 A A TP453_TREPA PUTATIVE UNCHARACTERIZED PROTEIN GSGAWKASVDPLGVVGSGADVYLYFPVAGNENLISRIIENHESKADIKKIVDRTTAVYGAFFARSKEFRLFGSGSYPYAFTNLIFSRSDGWASTKTEHGITYYESEHTDVSIPAPHFSCVIFGSSKRERMSKMLSRLVNPDRPQLPPRFEKECTSEGTSQTVALYIKNGGHFITKLLNFPQLNLPLGAMELYLTARRNEYLYTLSLQLGNAKINFPIQFLISRVLNAHIHVEGDRLIIEDGTISAERLASVISSLYSKKGSS 262 T 0.11 DUF5618 unppercent F Bacteria T 3k93 1 A A Q0I4G3_HAES1 phage related exonuclease GMNNLYHLKVRCSSLHKIIGEPKSKADKEAGKLTDTAKSAVREMAKFDLFGYNAFEGNKYTQKGNELEEQAIKLSGVTRGLALKKNTERRENEFITGECDIYVPSRKLIIDTKCSWDIGSHPFFTDEAQEKAKKAGYDIQMQGYMWLWDCDQAQIDFVLFPTPLNLISAYDSDFKLIDLVEQIPQIRRITTVIIQRDNELIDKIKERVSAAQKYYDQLISEMS 223 T 0.0007 PDDEXK_1 unppercent F Bacteria T 3kd7 2 F,G,H,I,J G,H,I,J,K Q9H2A1_HUMAN Hsp90 MEEVD peptide XMEEVD 6 T 13 TBP unphh F Eukaryota F 3kf9 2 B,D B,D MYLK2_HUMAN MLCK2 KRRWKKNFIAVSAANRFKKISS 22 T 0.024 PACT_coil_coil unppssm F Eukaryota T 3kgv 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N A,B,C,D,E,F,X,O,P,Q,R,S,T,Y DNA-PK CATALYTIC SUBUNIT, DNA-PKCS, DNPK1, P460 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 4128 F F F 3kl4 2 B B DAP2_YEAST DPAP B, YSCV ARSGSGSGSGSKLIRVGIILVLLIWGTVLLLKSIPHHHHHHH 42 T 0.011 Holin_BlyA pdb F Eukaryota T 3kmz 2 B,D C,D NCOR1_HUMAN N-COR1, N-COR RLITLADHICQIITQDFAR 19 T 6.3 Es2 pdbhh F Eukaryota T 3knt 1 A,B,C,D A,B,C,D OGG1_METJA 8-OXOGUANINE DNA GLYCOSYLASE, DNA-(APURINIC OR APYRIMIDINIC SITE) LYASE, AP LYASE MMLIKKIEELKNSEIKDIIDKRIQEFKSFKNKSNEEWFKELCFCILTANFTAEGGIRIQKEIGDGFLTLPREELEEKLKNLGHRFYRKRAEYIVLARRFKNIKDIVESFENEKVAREFLVRNIKGIGYQEASHFLRNVGYDDVAIIDRHILRELYENNYIDEIPKTLSRRKYLEIENILRDIGEEVNLKLSELDLYIWYLRTGKVLK 207 T 0.0002 HhH-GPD unppercent F Archaea T 3kny 1 A A Q8A1X3_BACTN hypothetical protein BT_3535 GACEQNEDWVVNEPMQSFEENPEYAPLNTIPDWVSEKVTPKEYELWRTMSSRYEINYSFLKKDISEKRKKEIYDCINNICERIEKGQINKYEGFLNIADEDGTTLSDSQYFGRIATRSPEGGAEYKTNGCTLYTHSLGPYIKAAVTYKKSDDDVTITSSSVYTGSPYLGNDPSFSGASSVSYDKDKKLIAASCSGTLSFKDGSRKVEVTVQKTGFMIP 218 T 0.2 FtsH_ext unppercent F Bacteria T 3kpl 3 C C EEYLQAFTY, self peptide from the ATP binding cassette protein ABCD3 EEYLQAFTY 9 T 1.2 DUF3921 pdbhh F T 3kpm 3 C C EEYLKAWTF, mimotope peptide EEYLKAWTF 9 T 8.9 IcmF-related pdbhh F T 3kpn 3 C C EEYLQAFTY, self peptide from the ATP binding cassette protein ABCD3 EEYLQAFTY 9 T 1.2 DUF3921 pdbhh F T 3kpo 3 C C EEYLKAWTF, mimotope peptide EEYLKAWTF 9 T 8.9 IcmF-related pdbhh F T 3kpp 3 C C EEYLQAFTY, self peptide from the ATP binding cassette protein ABCD3 EEYLQAFTY 9 T 1.2 DUF3921 pdbhh F T 3kpq 3 C C EEYLKAWTF, mimotope peptide EEYLKAWTF 9 T 8.9 IcmF-related pdbhh F T 3kpr 3 C,H C,H EEYLKAWTF, mimotope peptide EEYLKAWTF 9 T 8.9 IcmF-related pdbhh F T 3kps 3 C C EEYLQAFTY, self peptide from the ATP binding cassette protein ABCD3 EEYLQAFTY 9 T 1.2 DUF3921 pdbhh F T 3kti 2 H,I,J,K,L,M,N H,I,J,K,L,M,N ADEP1 XFSPXAX 7 T 45 Acp26Ab pdbhh F F 3ktj 2 H,I,J,K,L,M,N H,I,J,K,L,M,N ADEP2 XXSPXAX 7 T 430 GreA_GreB pdbhh F F 3ktk 2 AA,BA,H,I,J,K,L,M,N,V,W,X,Y,Z 0,1,O,P,Q,R,S,T,U,V,W,X,Y,Z ADEP2 XXSPXAX 7 T 430 GreA_GreB pdbhh F F 3kww 3 C C BZLF1_EBVB9 EB1, PROTEIN ZEBRA LPEPLPQGQLTAY 13 T 14 AP-5_subunit_s1 pdbhh T Viruses T 3kxf 5 M,N,O,T Q,R,T,S BZLF1_EBVB9 EB1, PROTEIN ZEBRA LPEPLPQGQLTAY 13 T 14 AP-5_subunit_s1 pdbhh T Viruses T 3kxs 1 A,B,C,D,E,F F,E,C,D,A,B CAPSD_HBVD1 CORE PROTEIN, CORE ANTIGEN, HBCAG, P21.5 MDIDPYKEFGATVELLSFLPSDFFPSVRDLLDTAAALYRDALESPEHCSPHHTALRQAILCWGDLMTLATWVGTNLEDPASRDLVVSYVNTNVGLKFRQLLWFHISCLTFGRETVLEYLVSFGVWIRTPPAARPPNAPILSTL 143 T 3.9E-25 Hepatitis_core unp T Viruses T 3kyn 3 C P KGPPAALTL peptide KGPPAALTL 9 T 73 Holin_BhlA pdbhh F T 3kyo 3 E,F P,Q KLPAQFYIL peptide KLPAQFYIL 9 T 0.2 RRP14 pdbhh F T 3kze 2 D,E D,E Synthetic Peptide SSRKEYYA 8 T 10 DUF4052 pdbhh F T 3l35 2 D,E,F H,K,L HIV ENTRY INHIBITOR PIE12 XXGXXXXXXXXXXXXXXX 18 T 1.5 DUF951 pdbhh F F 3l36 2 B H HIV ENTRY INHIBITOR PIE12 XXXXXXXXXXXXXXXXX 17 T 1.8 Ribosomal_L37 pdbhh F F 3l37 2 B H HIV ENTRY INHIBITOR PIE12 XXGXXXXXXXXXXXXXXX 18 T 1.5 DUF951 pdbhh F F 3l3q 2 B,C B,C pepTM GSEFESPFKKKRREA 15 T 0.65 DUF240 pdbhh F T 3l41 2 B B phosphorylated H2A tail KPSQEL 6 T 11 POX pdbhh F T 3l8l 1 A,C A,C VAL-GRAMICIDIN A XGAXAXVXWXWXWXWX 16 T 4.6 MAP17 pdbhh F F 3l8l 2 B,D B,D VAL-GRAMICIDIN A XGAXAXVXWXFWXWXWX 17 T 0.53 MAP17 pdbhh F T 3l9a 1 A X A2V8B8_STRMG uncharacterized protein NKREETDMRDFFVITNSEYTFAGVHYAKGAVLHVSPTQKRAFWVIADQENFIKQVNKNIEYVEKNASPAFLQRIVEIYQVKFEGKNVH 88 T 3.5 DUF4604 pdbhh F Bacteria T 3l9k 2 E,F,G,H W,X,Y,Z DYIN_DROME DH IC, CYTOPLASMIC DYNEIN INTERMEDIATE CHAIN, PROTEIN SHORT WING LSEEQKQMIILSENFQRFVVRAGRVIERALSENVDIYT 38 T 0.36 Mrx7 pdbhh F Eukaryota T 3lae 2 B X Unknown peptide fragment IFG 3 T 66 Mac pdbhh F F 3lca 2 B Q HSP71_YEAST HEAT SHOCK PROTEIN YG100 PEAEGPTVEEVD 12 T 12 DUF6246 pdbhh F Eukaryota T 3lfk 1 A,B,C,D A,B,C,D Q97AP8_THEVO MSCTV GSHMSAMAESKVLVKGTPFNKPVIKGKLENNYDMSQDEVSLLLFLKTHGGKIPLYRIKNETGLKDPESVLKNLMDYGFALEDKERLGEKIVLTSEGEFVAQAIRVRDEELRLKEMKQKKNVNRSSAPPQ 129 T 0.0036 MotA_activ unphh F Archaea T 3lge 2 E,F,G,H E,F,G,H SNX9_HUMAN SH3 AND PX DOMAIN-CONTAINING PROTEIN 1, PROTEIN SDP1, SH3 AND PX DOMAIN-CONTAINING PROTEIN 3A QAYQGPATGDDDDWDEDWDGPKSSSYFKDSE 31 T 0.92 DUF4594 unp F Eukaryota T 3lgf 2 B B DIMETHYLATED p53 Lysine 370 PEPTIDE SSHLKSKKGQ 10 T 37 TEX12 pdbhh F T 3lgl 2 B B DIMETHYLATED p53 LYSINE 382 PEPTIDE TSRHKKLMFKT 11 T 26 DUF420 pdbhh F T 3lh0 2 B B DIMETHYLATED p53 LYSINE 372 PEPTIDE SHLKSKKGQST 11 T 17 Flp_Fap pdbhh F T 3liy 2 C,F,I I,J,K statine-containing inhibitor XAPQVXVMHP 10 T 19 OTCace pdbhh F T 3lk4 3 AA,C,DA,F,GA,I,JA,L,O,R,U,X 0,C,3,F,6,I,9,L,O,R,U,X CD2AP_HUMAN CAS LIGAND WITH MULTIPLE SH3 DOMAINS, ADAPTER PROTEIN CMS VNFDDIASSENLLHLTANRPKMPGRRLPG 29 T 0.042 CARMIL_C pdbhh F Eukaryota T 3lkn 3 C C NP418 epitope from 1918 influenza strain LPFERATIM 9 T 2.1 Shal-type pdbhh F T 3lko 3 C C NP418 epitope from 1934 influenza strain LPFDRTTIM 9 T 0.53 DUF5775 pdbhh F T 3lkp 3 C C NP418 epitope from 1972 influenza strain LPFDKSTIM 9 T 3.7 Pas_Saposin pdbhh F T 3lkq 3 C C NP418 epitope from 1977 influenza strain LPFDKTTIM 9 T 4.2 Pas_Saposin pdbhh F T 3lkr 3 C C NP418 epitope from 2009 swine-influenza strain LPFERATVM 9 T 1.3 Shal-type pdbhh F T 3lks 3 C C NP418 epitope from 1980 influenza strain LPFEKSTVM 9 T 2.5 DUF724 pdbhh F T 3ll8 1 A E AKAP5_HUMAN AKAP79 peptide EPIAIIITDTE 11 T 3.2 Copine pdbhh F Eukaryota T 3lly 2 B B LECB2_MACPO MPA GRNGKSQSIIVGPWGD 16 T 0.7 DUF3842 pdbhh F Eukaryota T 3llz 2 B B LECB2_MACPO MPA NGKSQSIIVGPWGD 14 T 0.48 DUF3842 pdbhh F Eukaryota T 3lm1 2 B,D,F,H,J,L,N,P B,D,F,H,J,L,N,P LECB2_MACPO MPA RNGKSQSIIVGPWGD 15 T 0.57 DUF3842 pdbhh F Eukaryota T 3ln4 3 C C HNRPC_HUMAN 16-mer peptide from Heterogeneous nuclear ribonucleoproteins C1/C2 AEMYGSVTEHPSPSPL 16 T 7.6 NepR pdbhh F Eukaryota T 3lnj 2 B,D,F B,D,F D-peptide inhibitor XXXXXXXXXXXX 12 T 6.2 DUF4924 pdbhh F F 3lny 2 B B RPGF6_HUMAN PDZ DOMAIN-CONTAINING GUANINE NUCLEOTIDE EXCHANGE FACTOR 2, PDZ-GEF2, RA-GEF-2 EQVSAV 6 T 92 Adeno_52K pdbhh F Eukaryota F 3lnz 2 B,D,F,H,J,L,N,P B,D,F,H,J,L,N,P 12-mer peptide inhibitor TSFAEYWALLSP 12 T 0.31 P53_TAD pdbhh F T 3lpr 2 B P METHOXYSUCCINYL-ALA-ALA-PRO-NORLEUCINE BORONIC ACID INHIBITOR XAAPX 5 T 970 Pep_deformylase pdbhh F F 3lqa 2 B G Q1PHM6_9HIV1 Envelope glycoprotein gp160 EIVLENVIENFNMWKNDMVDQMHQDIISLWDQSLKPCVKLTPLCVGAGNCNTSTIAQACPKVSFDPIPIHYCAPAGYAILKCNDKTFNGIGPCNNVSTVQCTHGIKPVVSTQLLLNGSLAEEEVVIRSENISNNVKTIIVHLTESVNITCIGAGHCNINEKAWNETLKKVVEKLVKYFPNKTIEFAPPVGGDLEITTHSFNCGGEFFYCNTTKLFNSIHNSTDSTVNSTDSTAETGNSTNTNITLPCRIRQIINMWQEVGRAMYAPPSKGNITCISDITGLLLTRDGGENKTENNDTEIFRPGGGDMKDNWRSELYKYKVVEIKSGHHHHHH 332 T 1.1E-51 GP120 unp T Viruses T 3lrh 2 B,D,F,H,J,L,N,P B,D,F,H,J,L,N,P HD_HUMAN HUNTINGTON DISEASE PROTEIN, HD PROTEIN EKLMKAFESLKSFQ 14 T 2 Mito_fiss_reg unphh F Eukaryota T 3lt8 1 A A ATP BINDING PROTEIN-D65V GSMDYKDDDDKKTNWLKRIYRVRPCVKCKVAPRNWKVKNKHLRIYNMCKTCFNNSIDIGDDTYHGHVDWLMYADSKEISNT 81 T 0.013 ZZ pdbpssm F T 3lt9 1 A A ATP BINDING PROTEIN-D65V GSMDYKDDDDKKTNWLKRIYRVRPCVKCKVAPRNWKVKNKHLRIYNMCKTCFNNSIDIGDDTYHGHVDWLMYADSKEISNT 81 T 0.013 ZZ pdbpssm F T 3lta 1 A A ATP BINDING PROTEIN-DX GSMDYKDDDDKKTNWLKRIYRVRPCVKCKVAPRDWKVKNKHLRIFNMCKTCFNNSIDIGDDTYHGHVDWLMYADSKEISNT 81 T 0.025 Glyco_tranf_2_4 pdbpssm F T 3ltb 1 A A ATP BINDING PROTEIN-DX GSMDYKDDDDKKTNWLKRIYRVRPCVKCKVAPRDWKVKNKHLRIYNMCKTCFNNSIDIGDDTYHGHVDWLMYADSKEISNT 81 T 0.025 Glyco_tranf_2_4 pdbpssm F T 3ltc 1 A A ATP BINDING PROTEIN-DX GSMDYKDDDDKKTNWLKRIYRVRPCVKCKVAPRDWKVKNKHLRIYNMCKTCFNNSIDIGDDTYHGHVDWLMYADSKEISNT 81 T 0.025 Glyco_tranf_2_4 pdbpssm F T 3ltd 1 A A ATP BINDING PROTEIN-DX GSMDYKDDDDKKTNWLKRIYRVRPCVKCKVAPRDWKVKNKHLRIYNMCKTCFNNSIDIGDDTYHGHVDWLMYADSKEISNT 81 T 0.025 Glyco_tranf_2_4 pdbpssm F T 3lu9 3 C,F C,F PAR1_HUMAN PAR-1, THROMBIN RECEPTOR, COAGULATION FACTOR II RECEPTOR ATNATLDPRSFLLRNPNDKYEPFWE 25 T 4.4 DUF5848 pdbhh F Eukaryota T 3luo 2 B B Suc-Ala-Leu-Pro-Phe-pNA XALPAX 6 T 460 SelP_C pdbhh F F 3lv3 3 C C CAC1D_HUMAN 9-meric peptide from Voltage-dependent L-type calcium channel subunit alpha-1D SRRWRRWNR 9 T 0.58 DUF2396 pdbhh F Eukaryota F 3lw1 2 B P P53_HUMAN TUMOR SUPPRESSOR P53, PHOSPHOPROTEIN P53, ANTIGEN NY-CO-13 FKTEGPDSD 9 T 54 DoxA pdbhh F Eukaryota T 3lw5 14 N R CHAIN R XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 53 F F F 3m17 3 I,J,K,L I,J,K,L monomeric peptide inhibitor XRFXTGHFGXXYPCX 15 T 1.8 DUF5617 pdbhh F T 3m1b 3 I,J I,J DIMERIC PEPTIDE INHIBITOR XRFXTGHFGXXYPCKX 16 T 2.1 DUF5617 pdbhh F T 3m48 1 A A GCN4_YEAST AMINO ACID BIOSYNTHESIS REGULATORY PROTEIN RMAQLEAKVEELLSKNWNLENEVARLKKLVGER 33 T 0.00052 bZIP_1 pdbpercent F Eukaryota T 3m4c 2 E E HEME-PEPTIDE FRAGMENT KTTCNACHQ 9 T 1.9E-05 Cytochrom_C_2 pdbhh F T 3m50 2 B P Q42932_NICPL N.plumbaginifolia H+-translocating ATPase mRNA RRELHTLKGHVEAVVKLKGLDIETIQQSYDI 31 T 18 DUF1990 pdbhh F Eukaryota T 3m51 2 B P Q42932_NICPL N.plumbaginifolia H+-translocating ATPase mRNA RRELHTLKGHVEAVVKLKGLDIETIQQSYDI 31 T 18 DUF1990 pdbhh F Eukaryota T 3m53 2 B B TAF10_HUMAN TAF10 peptide XSKSKDRKYTL 11 T 0.0084 TFIID-31kDa unphh F Eukaryota T 3m54 2 B B TAF10_HUMAN TAF10 peptide XSKSKDRKYTL 11 T 0.0084 TFIID-31kDa unphh F Eukaryota T 3m55 2 B B TAF10_HUMAN TAF10 peptide XSKSKDRKYTL 11 T 0.0084 TFIID-31kDa unphh F Eukaryota T 3m56 2 B B TAF10_HUMAN TAF10-K189me2 PEPTIDE XSKSKDRKYTL 11 T 0.0084 TFIID-31kDa unphh F Eukaryota T 3m57 2 B B TAF10_HUMAN TAF10 peptide XSKSKDRKYTL 11 T 0.0084 TFIID-31kDa unphh F Eukaryota T 3m58 2 B B TAF10_HUMAN TAF10-K189me1 Peptide XSKSKDRKYTL 11 T 0.0084 TFIID-31kDa unphh F Eukaryota T 3m59 2 B B TAF10_HUMAN TAF10-K189me2 Peptide XSKSKDRKYTL 11 T 0.0084 TFIID-31kDa unphh F Eukaryota T 3m5a 2 B B TAF10_HUMAN TAF10-K189me3 Peptide XSKSKDRKYTL 11 T 0.0084 TFIID-31kDa unphh F Eukaryota T 3m5n 2 E,F,G F,G,H B3TKQ3_9HEPC SECTTPC peptide SECTTPC 7 T 2.4 SCIFF pdbhh T Viruses F 3m61 2 B P upain-1 W3A CSARGLENHRMC 12 T 7.9 LRRNT pdbhh F T 3m7u 2 B B SELF-PROTEOLYSIS PRODUCT (RESIDUES 184-187) LQPI 4 T 87 Phage_G pdbhh F F 3m8f 1 A,B A,B Q8KNP2_BACTI Putative DNA-binding protein MGSSHHHHHHSSGLVPRGSHMNRDHFYTLNIAEIAERIGNDDCAYQVLMAFINENGEAQMLNKTAVAEMIQLSKPTVFATVNWFYCAGYIDETRVGRSKIYTLSDLGVEIVECFKQKAMEMRNL 124 T 1.1E-05 DUF3116 pdbhh F Bacteria T 3m9c 1 A L NADH-quinone oxidoreductase subunit NuoL XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 474 F F F 3m9c 2 B M NADH-quinone oxidoreductase subunit NuoM XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 391 F F F 3m9c 3 C N NADH-quinone oxidoreductase subunit NuoN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 378 F F F 3m9c 4 D R NADH-quinone oxidoreductase subunits NuoA,J and K XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 281 F F F 3m9s 9 I,V L,O NADH DEHYDROGENASE I CHAIN 12, NDH-1 SUBUNIT 12 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 469 F F F 3m9s 10 J,W M,P NADH-quinone oxidoreductase subunit 13 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 392 F F F 3m9s 11 K,X N,Q NADH-quinone oxidoreductase subunit 14 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 379 F F F 3m9s 12 L,Y R,S NADH-quinone oxidoreductase subunits 7, 10 and 11 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 274 F F F 3m9s 13 M,Z H,T NADH-quinone oxidoreductase subunit 8 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 181 F F F 3ma3 2 B B CONSENSUS PIM1 SUBSTRATE PEPTIDE KRRRHPS 7 T 6.5 HOCHOB pdbhh F F 3mat 2 B I bestatin-based inhibitor (3R)-amino-(2S)-hydroxyheptanoyl-l-Ala-l-Leu-l-Val-l-Phe-OMe XALVX 5 T 1400 EF-hand_5 pdbhh F F 3mbu 1 A,B,C,D A,B,C,D Bipyridine-PNA XXXXXXXXXK 10 T 4200 EF-hand_5 pdbhh F F 3md4 1 A,B A,B PRIO_HUMAN PRP GYMLGS 6 T 0.03 Pectate_lyase_3 unp F Eukaryota F 3md5 1 A,B A,B PRIO_HUMAN Major prion protein GYVLGS 6 T 0.03 Pectate_lyase_3 unp F Eukaryota F 3mg9 2 B,C B,C TEICOPLANIN AGLYCONE XXXXXXX 7 T 730 PBCV_basic_adap pdbhh F F 3mgb 2 C,D C,D TEICOPLANIN AGLYCONE XXXXXXX 7 T 730 PBCV_basic_adap pdbhh F F 3mgn 2 G,H,I,J,K,L G,H,I,J,K,L D-PEPTIDE INHIBITOR PIE71 XXGXXXXXXXXXXXXXX 17 T 0.43 Ly49 pdbhh F F 3mh7 2 B,C B,C 5-mer peptide XXXXX 5 F F F 3mhp 2 C C TIC62_PEA TIC62_peptide KTEQPLSPYTAYDDLKPPSSPSPTKP 26 T 4.1 LEA_6 pdbhh F Eukaryota T 3mhr 2 B P YAP1_HUMAN YAP phosphopeptide RAHSSPASLQ 10 T 0.00014 FAM181 unp F Eukaryota T 3mjh 2 B,D B,D EEA1_HUMAN ENDOSOME-ASSOCIATED PROTEIN P162, ZINC FINGER FYVE DOMAIN-CONTAINING PROTEIN 2 SSSEGFICPQCMKSLGSADELFKHYEAVHDAGND 34 T 0.00027 ATG14 unp F Eukaryota T 3mk7 4 M,N,O,P U,X,Y,Z 30-mer peptide XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 30 F F F 3ml4 2 E,F,G,H E,F,G,H MUSK_MOUSE MUSCLE-SPECIFIC TYROSINE-PROTEIN KINASE RECEPTOR, MUSCLE-SPECIFIC KINASE RECEPTOR, MUSK LDRLHPNPMXQRM 13 T 3.4 ETC_C1_NDUFA5 pdbhh F Eukaryota T 3mls 3 C,F,I,L P,Q,R,S Rationally designed V3 mimotope ACQAFYASSPRKSIHIGACA 20 T 1.1 Peptidase_U57 pdbhh F T 3mlu 3 C P A0A0K0KAD3_9HIV1 HIV-1 gp120 third variable region (V3) crown NNTRKSIRIGPGQAFYATGGIIG 23 T 1.9E-05 GP120 pdbhh T Viruses T 3mmg 2 C,D C,D POLG_TVMV Nuclear inclusion protein B fragment ETVRFQSD 8 T 1.2 CzcE pdbhh T Viruses T 3mmv 2 B X Protein spire XXXXXXXXXXXXXXXXXXX 19 F F F 3mmy 2 B,D,F,H B,D,F,H NUP98_HUMAN NUCLEAR PORE COMPLEX PROTEIN NUP98, NUCLEOPORIN NUP98, 98 KDA NUCLEOPORIN TGTTIKFNPPTGTDTMVKAGVSTNISTKHQCITAMKEYESKSLEELRLEDYQANRK 56 T 0.06 DUF5023 pdbpssm F Eukaryota T 3mn6 2 D,E,F X,Y,Z Protein spire XXXXXXXXXXXXXXXXXXX 19 F F F 3mn7 2 B S SPIR_DROME Spire DDD PSPREQLMESIRKGKELKQSRPPLKKASDRQLGPPRMCEPSPREQLMESIRKGKELKQSRPPLKKASDRQLGPPRMCEPSPREQLMESIRKGKELKQA 98 T 0.0014 WH2 pdbpssm F Eukaryota T 3mn9 2 B X Protein spire XXXXXXXXXXXXXXXXXXX 19 F F F 3mpj 2 G Y Octapeptide KGHHHHHH 8 T 6800 zf_CCCH_4 pdbhh F F 3mpn 1 A A O67854_AQUAE Transporter REHWATRLGLILAMAGNAVGLGNFLRFPVQAAENGGGAFMIPYIIAFLLVGIPLMWIEWAMGRYGGAQGHGTTPAIFYLLWRNRFAKILGVFGLWIPLVVAIYYVYIESWTLGFAIKFLVGLVPEPPPNATDPDSILRPFKEFLYSYIGVPKGDEPILKPSLFAYIVFLITMCINVSILIRGISKGIERFAKIAMPTLFILAVFLVIRVFLLETPNGTAADGLNFLWTPDFEKLKDPGVWIAAVGQIFFTLSLGFGAIITYASYVRKDQDIVLSGLTAATLNEKAEVILGGSISIPAAVAFFGVANAVAIAKAGAFNLGFITLPAIFSQTAGGTFLGFLWFFLLFFAGLTSSIAIMQPMIAFLEDELKLSRKHAVLWTAAIVFFSAHLVMFLNKSLDEMDFWAGTIGVVFFGLTELIIFFWIFGADKAWEEINRGGIIKVPRIYYYVMRYITPAFLAVLLVVWAREYIPKIMEETHWTVWITRFYIIGLFLFLTFLVFLAERRRNHE 507 T 7.399999999999999E-33 SNF unppercent F Bacteria T 3mpq 1 A A O67854_AQUAE Transporter KREHWATRLGLILAMAGNAVGLGNFLRFPVQAAENGGGAFMIPYIIAFLLVGIPLMWIEWAMGRYGGAQGHGTTPAIFYLLWRNRFAKILGVFGLWIPLVVAIYYVYIESWTLGFAIKFLVGLVPEPPPNATDPDSILRPFKEFLYSYIGVPKGDEPILKPSLFAYIVFLITMFINVSILIRGISKGIERFAKIAMPTLFCLAVFLVIRVFLLETPNGTAADGLNFLWTPDFEKLKDPGVWIAAVGQIFFTLSLGFGAIITYASYVRKDQDIVLSGLTAATLNEKAEVILGGSISIPAAVAFFGVANAVAIAKAGAFNLGFITLPAIFSQTAGGTFLGFLWFFLLFFAGLTSSIAIMQPMIAFLEDELKLSRKHAVLWTAAIVFFSAHLVMFLNKSLDEMDFWAGTIGVVFFGLTELIIFFWIFGADKAWEEINRGGIIKVPRIYYYVMRYITPAFLAVLLVVWAREYIPKIMEETHWTVWITRFYIIGLFLFLTFLVFLAERRRNH 507 T 7.399999999999999E-33 SNF unppercent F Bacteria T 3mqr 2 B B MDM4_HUMAN HdmX Peptide LDLAHSSESQ 10 T 0.93 DUF6143 unppercent F Eukaryota T 3mqs 2 B D MDM2_HUMAN Hdm2 peptide YSQPSTSSSI 10 T 11 UBA2_C pdbhh F Eukaryota T 3mr9 3 C P PP65_HCMVM 9-meric peptide from Tegument protein pp65 NLVPAVATV 9 T 15 5-FTHF_cyc-lig pdbhh T Viruses T 3mrb 3 C P PP65_HCMVM 9-meric peptide from Tegument protein pp65 NLVPMVHTV 9 T 9.9 ExbD pdbhh T Viruses T 3mrc 3 C P PP65_HCMVA 9-meric peptide from Tegument protein pp65 NLVPMCATV 9 T 1.8 STAT1_TAZ2bind pdbhh T Viruses T 3mrd 3 C P PP65_HCMVA 9-meric peptide from Tegument protein pp65 NLVPMGATV 9 T 0.3 APS-reductase_C pdbhh T Viruses T 3mt6 2 AB,BB,CA,CB,DA,DB,EA,FA,GA,HA,IA,JA,KA,LA,MA,NA,OA,PA,QA,RA,SA,TA,UA,VA,WA,XA,YA,ZA z,3,1,4,2,u,c,d,e,f,g,h,i,j,k,l,m,n,o,p,q,r,s,t,v,w,x,y ACYLDEPSIPEPTIDE 1 XFSPXAX 7 T 45 Acp26Ab pdbhh F F 3mv5 2 B C GSK3B_HUMAN GSK3-beta peptide GRPRTTSFAE 10 T 6.3 DUF3084 pdbhh F Eukaryota T 3mv7 3 C C EBNA1_EBVB9 HPVG peptide from Epstein-Barr nuclear antigen 1 HPVGEADYFEY 11 T 6.4 Sel_put pdbhh T Viruses T 3mv8 3 C C EBNA1_EBVB9 HPVG peptide from Epstein-Barr nuclear antigen 1 HPVGEADYFEY 11 T 6.4 Sel_put pdbhh T Viruses T 3mv9 3 C C EBNA1_EBVB9 HPVG peptide from Epstein-Barr nuclear antigen 1 HPVGEADYFEY 11 T 6.4 Sel_put pdbhh T Viruses T 3mvh 2 B B GSK3-beta peptide GRPRTTSFAE 10 T 6.3 DUF3084 pdbhh F T 3n00 2 B B NCOR1_HUMAN N-COR1, N-COR THRLITLADHICQIITQDFAR 21 T 6.8 Es2 pdbhh F Eukaryota T 3n5e 3 C D Synthetic peptide LR LSCQLYQR 8 T 2 SgrT pdbhh F T 3n7y 2 D,E,F D,E,F 20-membered peptide-like macrocyclic ligand XVNVX 5 T 81 HopA1 pdbhh F F 3n84 2 G,H,I,J,K,L G,H,I,J,K,L 23-membered peptide-like macrocyclic ligand XVNVPX 6 T 1 LAX pdbhh F F 3n8m 2 B B PEPTIDE XXVNVX 6 T 7.1 DUF4692 pdbhh F F 3na2 1 A,B,C,D A,B,C,D Uncharacterized protein MGSSHHHHHHSSGRENLYFQGHVEPGVTDRIGQMILEMFRTGMCLFSVRSPGGVAELYGGEARKVEITGTSLTIEREDWHLHCKLETVETVVFDLSPKDNGGIRMAVVFRDKHQAPVLRAAWLPRLMPETPSPPEQFWAFTQRYIDLPMVVDARNRQLVFPGSGQGGFTEGS 172 T 2.9E-05 HemS pdbhh F T 3naz 2 E,F E,F PEPTIDE XXXXXX 6 F F F 3nco 2 C D peptide (ALA)(ASN)(GLU) ANE 3 T 460 DUF167 pdbhh F F 3nco 3 D E peptide (ALA)(ASP)(GLN) ADQ 3 T 380 Peptidase_A8 pdbhh F F 3nf3 2 B C JTH-NB72-39 inhibitor RRFXAMLA 8 T 6 DUF2052 pdbhh F T 3nfk 2 C,D C,D GLYCO_RABVE Glycoprotein G SWESHKSGGETRL 13 T 3.5 DUF5052 unphh T Viruses T 3ngy 2 E E his tag sequence SHHHHH 6 T 6100 zf_CCCH_4 pdbhh F F 3nhc 1 A,B A,B PRIO_HUMAN PRP, PRP27-30, PRP33-35C, ASCR GYMLGS 6 T 0.03 Pectate_lyase_3 unp F Eukaryota F 3nhd 1 A,B A,B PRIO_HUMAN PRP, PRP27-30, PRP33-35C, ASCR GYVLGS 6 T 0.03 Pectate_lyase_3 unp F Eukaryota F 3ni3 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L 54-membered ring macrocyclic beta-sheet peptide XTYFTYXSXXKX 12 T 4.5 EAV_GP5 pdbhh F T 3nih 2 B B Peptide RIAAA RIAAA 5 T 160 IFRD pdbhh F F 3nii 2 B B Peptide KIAA KIAA 4 T 360 S_layer_N pdbhh F F 3nij 2 B B Peptide HIAA HIAA 4 T 210 HypF_C pdbhh F F 3nik 2 E X Peptide REAA REAA 4 T 180 Ebola_NP pdbhh F F 3nil 2 E X Peptide RDAA RDAA 4 T 250 DUF2418 pdbhh F F 3nim 2 E X Peptide RRAA RRAA 4 T 220 DNA_alkylation pdbhh F F 3nin 2 C,D D,E Peptide RLGES RLGES 5 T 120 CsrA pdbhh F F 3njw 1 A A Bicyclic peptide BI-32169 GLPWGCPSDIPGWNTPWAC 19 T 0.94 CIMR pdbhh F T 3nk3 2 C,D C,D ZP3_CHICK Zona pellucida 3 AFAADAGKEVAADVVIGPVLLSADHHHHHH 30 T 7.2 Psg1 unphh F Eukaryota T 3nk4 2 C,D C,D ZP3_CHICK Zona pellucida 3 AFAADAGKEVAADVVIGPVLLSADHHHHHH 30 T 7.2 Psg1 unphh F Eukaryota T 3nkx 2 C,D P,Q RAF1_HUMAN PROTO-ONCOGENE C-RAF, CRAF, RAF-1 QRSTSTPNVH 10 T 58 ALC pdbhh F Eukaryota T 3nmx 2 D,E,F D,E,F ARHG4_HUMAN APC-STIMULATED GUANINE NUCLEOTIDE EXCHANGE FACTOR, ASEF SSSHHYSHPGGGGEQLAINELISDG 25 T 3.6 CDC24 unppercent F Eukaryota T 3noh 1 A A A7B039_RUMGV putative peptide binding protein GVTGATPKAKKAAQSSAQLEGSYIFCMNPLLDKLSDEDIREQLKAFVTGKTDSIRTDTELSFDIYVSETDYALIRYADSLCERLNDAGADVQIKQYSGTMLRSRAVSGKYEAFLSESDLVSTDALENADYIILDSAEMR 139 T 0.0019 SBP_bac_5 pdbhh F Bacteria T 3nqi 1 A,B,C,D A,B,C,D A0A380YR22_BACFN Putative lipoprotein GMDSGESGPQQWAGVVKVNDRMGYVTFTDAAGTELIPTNTIPVTLNARMAYIYCQVDEGQDLSTNPKSIKITLLADPTGIDATAITTPKVGESGDVTTNAPVGSLSFVSGYSTVAPFQFSENTIVLPVLYRVKNVTTTEDIKNELAKHTFTLVCYTDDIKSGDTILKLYLRYKVEDEPAAIAERATRTSSFKAYEISQILREYTLKSGQTKPAKITIVAQQNEYNNKLEDTSTIEKVYEIEYKTAE 246 T 0.00042 NigD_C pdbhh F Bacteria T 3nsw 1 A,B,C,D,E,F,G A,B,C,D,E,F,G Q6R7N7_9BILA Excretory-secretory protein 2 GSHMEYCPKMLSEIRQEDINDVETVAYVTVTGKTARSYNLQYWRLYDVPKTAPSQWPSFGTLRDDCGNIQLTADTDYVLGCKSGNQDCFVKLHDGLSQKEKDLLKE 106 T 0.098 Augurin unppssm F Eukaryota T 3nth 2 B C AUB_DROME AUB[R13(ME2S)] ARGXGR 6 T 20 Tristanin_u2 unphh F Eukaryota F 3nti 2 B C AUB_DROME AUB[R15(ME2S)] NPVIARGRGXGRK 13 T 0.023 Tristanin_u2 pdbhh F Eukaryota T 3ny3 2 B B N-degron RIFS 4 T 65 eIF3m_C_helix pdbhh F F 3nzj 15 CA,DA 3,4 TMC-95A mimic ligand 2a XXAWX 5 T 130 DUF6446 pdbhh F F 3nzw 15 CA,DA 3,4 TMC-95A mimic ligand 2b XXAXX 5 T 130 DUF6446 pdbhh F F 3nzx 15 CA,DA 3,4 TMC-95A mimic ligand 2c XXAXX 5 T 130 DUF6446 pdbhh F F 3o0e 2 G,H,I,J,K,L L,M,N,O,P,Q CEA9_ECOLX Colicin-E9 SGGDGRGHNTGAHSTSG 17 T 10 Spore_II_R pdbhh F Bacteria T 3o17 2 B,D F,G JIP1_MOUSE JNK-INTERACTING PROTEIN 1, JIP-1, JNK MAP KINASE SCAFFOLD PROTEIN 1 PKRPTTLNLF 10 T 6.8 Lipoprotein_19 pdbhh F Eukaryota T 3o2b 2 B,D B,D Phe N-end rule peptide FRSKGEELFT 10 T 14 DNAP_B_exo_N pdbhh F T 3o2h 2 B B DPS_ECOLI PEXB, VTM LVKSKATNLLY 11 T 0.1 7TMR-HDED unppercent F Bacteria T 3o2i 1 A,B,C A,B,C Uncharacterized protein MGSSHHHHHHSSGRENLYFQGMRGDDMHIYELVSRDRTHPVRIYLLHSEYWTEDEFYNLLLEAFQRSSASDWHLQILEVSKYLVTAHGFVEAGGLQEIGFPGELSKTEVRRRINAFLGKDRSDGS 125 T 0.6 EndoU_bacteria pdbpssm F T 3o2m 2 B,D F,G JIP1_MOUSE JNK-INTERACTING PROTEIN 1, JIP-1, JNK MAP KINASE SCAFFOLD PROTEIN 1 PKRPTTLNLF 10 T 6.8 Lipoprotein_19 pdbhh F Eukaryota T 3o2q 3 E F RNA polymerase II CTD Serine-5 phosphopeptide PTSPSY 6 T 3.9 Disulph_isomer pdbhh F F 3o36 2 C,D D,E H4_HUMAN H4(14-19)K16AC HISTONE PEPTIDE GAXRHR 6 T 11 Shadoo unppercent F Eukaryota F 3o3a 3 C,F C,F Peptidomimetic ELA-1 XLAXXLTV 8 T 810 Pox_M2 pdbhh F F 3o3b 3 C,F C,F Peptidomimetic ELA-1.1 ELAXXLTV 8 T 49 RasGEF pdbhh F T 3o3d 3 C,F C,F Peptidomimetic ELA-2 ELAXXLTV 8 T 49 RasGEF pdbhh F T 3o3e 3 C,F C,F Peptidomimetic ELA-2.1 XLAXXLTV 8 T 810 Pox_M2 pdbhh F F 3o8i 2 B B RAF1_HUMAN PROTO-ONCOGENE C-RAF, CRAF, RAF-1 QRSTSTPNVH 10 T 58 ALC pdbhh F Eukaryota T 3oa6 4 G G H4 peptide monomethylated at lysine 20 GLGKGGAKRHRKVLRDNIQGITKY 24 T 18 DUF1938 pdbhh F T 3oa8 1 A,C,E A,C,E SOXA_STAND SoxA MRRFAAGCLALALLVLPFVLTGARAAEDESEKEIERYRQMIEDPMANPGFLNVDRGEVLWSEPRGTRNVSLETCDLGEGPGKLEGAYAHLPRYFADTGKVMDLEQRLLWCMETIQGRDTKPLVAKPFSGPGRTSDMEDLVAFIANKSDGVKIKVALATPQEKEMYAIGEALFFRRSSINDFSCSTCHGAAGKRIRLQALPQLDVPGKDAQLTMATWPTYRVSQSALRTMQHRMWDCYRQMRMPAPDYASEAVTALTLYLTKQAEGGELKVPSIKR 275 T 1.3E-05 Dehyd-heme_bind pdbhh F Bacteria T 3oak 2 C,D C,D SPT6_YEAST CHROMATIN ELONGATION FACTOR SPT6 DPFTHMSDKIDEMYDIFGDGHDYDWALEIEN 31 T 3.3 SPT6_acidic unppssm F Eukaryota T 3ob1 1 A A SPY2_HUMAN SPRY-2 IRNTNEXTEGPT 12 T 3.3 KAR9 unp F Eukaryota T 3ob2 1 A A EGFR_HUMAN RECEPTOR TYROSINE-PROTEIN KINASE ERBB-1, PROTO-ONCOGENE C-ERBB-1 DSFLQRXSSDPT 12 T 0.91 DUF4348 pdbhh F Eukaryota T 3obq 2 B B HGS_HUMAN PROTEIN PP110, HRS PTPSAPVPL 9 T 5.4 AKAP28 pdbhh F Eukaryota T 3ocb 2 C,D C,D GSK 3 beta peptide GRPRTTSFAE 10 T 6.3 DUF3084 pdbhh F T 3od5 2 B,D C,D peptide aldehyde inhibitor AC-VEID-CHO XVEIX 5 T 100 Ig_5 pdbhh F F 3odi 2 B,D,F,H,J,L,N,P,R,T B,D,F,H,J,L,N,P,R,T E-ISA247, ISATX-247, LUVENIQ XXXXXXXXVXA 11 T 8 IncD pdbhh F F 3odl 2 B,D,F,H,J,L,N,P,R,T B,D,F,H,J,L,N,P,R,T Z-ISA247, ISATX-247, LUVENIQ XXXXXXXXVXA 11 T 8 IncD pdbhh F F 3oe0 2 B I Polyphemusin analog, CXC chemokine receptor antagonist RRXCYQKXPYRXCRGX 16 T 4.7 Cytomega_TRL10 pdbhh F T 3oiq 2 B B DPOA_YEAST DNA polymerase alpha catalytic subunit A SPLKLQSRKLRYANDVQDLLDDVENSPVVATKRQNV 36 T 0.53 Wap1 pdbhh F Eukaryota T 3oka 2 C,D C,D N-terminal His-affinity tag MGHHHHHHHHHHSSGHIEGRH 21 T 9500 zf_CCCH_4 pdbhh F T 3okr 2 E F Heptamer peptide XXXXXXX 7 F F F 3olr 2 E,F,G,H E,F,G,H SKAP2 YGEEXDDLY 9 T 12 Rox3 pdbhh F T 3omc 2 C,D C,D SYNTHETIC PEPTIDE TGXARARA 8 T 2.2 Ribosomal_L4 pdbhh F F 3omg 2 C,D C,D dimethylated arginine peptide R14me2s RGRAXGQE 8 T 37 Aim21 pdbhh F T 3omh 2 E,F,G,H E,F,G,H SKAP2_HUMAN SRC FAMILY-ASSOCIATED PHOSPHOPROTEIN 2, SRC KINASE-ASSOCIATED PHOSPHOPROTEIN 55-RELATED PROTEIN, SKAP55 HOMOLOG, SKAP-55HOM, SKAP-HOM, SRC-ASSOCIATED ADAPTER PROTEIN WITH PH AND SH3 DOMAINS, PYK2/RAFTK-ASSOCIATED PROTEIN, RETINOIC ACID-INDUCED PROTEIN 70 DGEEXDDPF 9 T 8 S100PBPR pdbhh F Eukaryota T 3oo3 1 A A Q6ZZI7_ACTTI P450 MONOOXYGENASE MALPLPHQRLRLDPVPEFEELQKAGPLHEYDTEPGMDGRKQWLVTGHDEVRAILADHERFSSMRPVDDEADRALLPGILQAYDPPDHTRLRRTVAPAYSARRMERLRPRIEEIVEECLDDFESVGAPVDFVRHAAWPIPAYIACEFLGVPRDDQAELSRMIRESRESRLPRQRTLSGLGIVNYTKRLTSGKRRDPGDGMIGVIVREHGAEISDEELAGLAEGNLIMAAEQMAAQLAVAVLLLVTHPDQMALLREKPELIDSATEEVLRHASIVEAPAPRVALADVRMAGRDIHAGDVLTCSMLATNRAPGDRFDITREKATHMAFGHGIHHCIGAPLARLQLRVALPAVVGRFPSLRLAVPEEDLRFKPGRPAPFAVEELPLEW 384 T 1.5E-26 p450 pdbpercent F Bacteria T 3op0 2 C,D C,D EGFR_HUMAN RECEPTOR TYROSINE-PROTEIN KINASE ERBB-1, PROTO-ONCOGENE C-ERBB-1 LQRXSSDPTGA 11 T 11 RNase_Y_N pdbhh F Eukaryota T 3opy 3 I,J,K,L I,J,K,L PFKA3_PICPA 6-phosphofructo-1-kinase gamma-subunit MVTKDSIIRDLERENVGPEFGEFLNTLQTDLNSEKPPIEQVKSQLETHFNLAHETQEFSRKNDNAPVDKLLTNYYNNYEVNVLEFVLQMGFSRDLSIPLNVWFVLDMISQLSTSKQDLPLDYYLVLNNSQTGKYSDFVRYLIYEAVGAEIHCFEQGSMPEQYRSSRWEDKVKGPALANRGPIRGNVGAGDRKITFHLLCKKTARMILVGDDRETDFEMSDRSFVTLLLDYYQRVGTTKKIDLLLLTNNFDTNMNNKLQQLKILESLNMLKSNCYVLDYQITVDQVTANFNSYVEGIPAFRRHEIANFLKKRKTPKNADELIFKYVGRWNICYQKKFHQGNISIHQISGYLD 351 T 0.11 DNA_III_psi pdbpercent F Eukaryota T 3oq5 2 D,E D,E P53_HUMAN Cellular tumor antigen p53 TSRHKKLMFK 10 T 21 DUF420 pdbhh F Eukaryota T 3oqg 1 A,B A,B Q9KJ88_HELPX Hpy188I MGHHHHHHEFMAKRKSDIILKSVDDLKDEIDYKDFEYKEYFNLLCELVPNNSLEKLEINAIDEKNMKNEGLVYVFVIQGKIFKIGHSITPITKRVQSYNCGKVEYRKNGTCSTTNYFVLQSLLKINKIVQVYAFFPEQPTYTLFGKTYQDSFSTSKRAENVILENFIKNHNKKPIGCTQT 180 T 0.059 MUG113 pdbpssm F Bacteria T 3or3 1 A,B A,B Q9KJ88_HELPX RESTRICTION ENDONUCLEASE HPY188I MGHHHHHHEFMAKRKSDIILKSVDDLKDEIDYKDFEYKEYFNLLCELVPNNSLEKLEINAIDEKNMKNEGLVYVFVIQGKIFKIGHSITPITKRVQSYNCGKVEYRKNGTCSTTNYFVLQSLLKINKIVQVYAFFPEQPTYTLFGKTYQDSFSTSKRAENVILENFIKNHNKKPIGCTQT 180 T 0.059 MUG113 pdbpssm F Bacteria T 3os5 2 B B Dnmt1 TPRRSKSA 8 T 1.7 DUF4808 pdbhh F T 3ots 2 C P POL_HV1Y2 MA/CA substrate peptide QNYPIVQ 7 T 43 Rep-A_N pdbhh T Viruses T 3ou0 2 B B pentapeptide XXXXX 5 F F F 3ou0 3 C C heptapeptide XXXXXXX 7 F F F 3ou1 3 C P POL_HV1Y2 RH/IN substrate peptide KVLFLDG 7 T 0.016 Spermine_synt_N pdbhh T Viruses T 3ou3 2 C C POL_HV1Y2 PR/RT substrate peptide LNFPISP 7 T 0.6 Peptidase_A2B unphh T Viruses T 3ou4 3 C C POL_HV1Y2 TF/PR substrate peptide FNFPQIT 7 T 20 DUF1810 pdbhh T Viruses T 3oua 2 C P POL_HV1Y2 p1/p6 substrate peptide GNFLQSR 7 T 0.41 HypA unp T Viruses T 3oub 2 C P POL_HV1Y2 NC/p1 substrate peptide QVNFLGK 7 T 3.8 HypA unppercent T Viruses T 3ouc 2 C P POL_HV1Y2 p2/NC substrate peptide TIMMQRG 7 T 0.41 HypA unp T Viruses T 3oud 3 C P POL_HV1Y2 CA/p2 substrate peptide RVLFEAM 7 T 0.41 HypA unp T Viruses T 3ov1 2 B B PYAC3CN XXXNX 5 T 950 SIT pdbhh F F 3ove 2 B B PYAC7CN XXXNX 5 T 950 SIT pdbhh F F 3ow4 2 C,D C,D GSK3B_HUMAN GSK 3 beta peptide GRPRTTSFAE 10 T 6.3 DUF3084 pdbhh F Eukaryota T 3owr 1 A,B,C,D A,B,C,D Q5L7M9_BACFN uncharacterized hypothetical protein GSKEDLPAYEEAEITKVGAYHRFYSGDKDAITGENIVAEKELDRTNNIDSEHGVATAVFTIPAAGGKFTEAERAKVSLSNLVVYVNVSTAARVTPLDGSPKFGVPADWTREHKYSVMAADGTKKIWTVKVTLNK 134 T 0.0012 DUF5018 pdbpercent F Bacteria T 3owt 2 C C SIR3_YEAST SILENT INFORMATION REGULATOR 3 SEKGNAKMIDFATLSKLKKKYQIILDR 27 T 0.16 IPK pdbhh F Eukaryota T 3ox7 2 B P MH027 MGSADGACSWRGLENHAMCGAAG 23 T 3.2 DUF2632 pdbhh F T 3oxi 2 B J JIP1_HUMAN Mitogen-activated protein kinase 8 interacting protein 1 PKRPTTLNLF 10 T 6.8 Lipoprotein_19 pdbhh F Eukaryota T 3oy5 2 B P MH027 MGSADGACSWRGLENHAMCGAAG 23 T 3.2 DUF2632 pdbhh F T 3oy6 2 B P MH036 MGSADGACSWRGLENHRMCGAAG 23 T 5 DUF2632 pdbhh F T 3oyp 3 E,F E,F Peptidomimetic inhibitor XXXXX 5 T 1400 zf-CCHC_2 pdbhh F F 3p1n 2 B P KCNK9_HUMAN TASK-3 PEPTIDE KRRKSV 6 T 26 DUF4739 pdbhh F Eukaryota F 3p1o 2 B P KCNK9_HUMAN TASK-3 PEPTIDE KRRKSV 6 T 26 DUF4739 pdbhh F Eukaryota F 3p1p 2 B P KCNK9_HUMAN TASK-3 PEPTIDE KRRKSV 6 T 26 DUF4739 pdbhh F Eukaryota F 3p1q 2 B P KCNK9_HUMAN TASK-3 PEPTIDE KRRKSV 6 T 26 DUF4739 pdbhh F Eukaryota F 3p1r 2 B P KCNK9_HUMAN TASK-3 PEPTIDE KRRKSV 6 T 26 DUF4739 pdbhh F Eukaryota F 3p1s 2 B P KCNK9_HUMAN TASK-3 PEPTIDE KRRKSV 6 T 26 DUF4739 pdbhh F Eukaryota F 3p2z 2 B B phosphopeptide XPLHSTAX 8 T 130 Dip pdbhh F T 3p34 2 B B phosphopeptide XMQSTPLX 8 T 160 DUF5792 pdbhh F T 3p35 2 D,E D,E phosphopeptide XMQSSPLX 8 T 49 LEA_6 pdbhh F T 3p36 2 B B phosphopeptide XDPPLHSTAX 10 T 32 IML1 pdbhh F T 3p37 2 D,E,F E,D,F phosphopeptide XFDPPLHSTAX 11 T 17 FAF pdbhh F T 3p46 1 A,B,C A,B,C Synthetic collagen peptide XGPPGPPGLPGEAGPPGPPX 20 T 0.0035 Collagen pdbpssm F T 3p4f 2 B B RBBP5_HUMAN RBBP-5, RETINOBLASTOMA-BINDING PROTEIN RBQ-3 EDEEVDVTSVY 11 T 0.014 DUF2457 unppercent F Eukaryota T 3p4f 3 C C KMT2A_HUMAN ZINC FINGER PROTEIN HRX, ALL-1, TRITHORAX-LIKE PROTEIN, LYSINE N-METHYLTRANSFERASE 2A HGAARAEVHL 10 T 1.1 N-SET unphh F Eukaryota T 3p4k 2 B P MAP kinase 14 AADLRISCNSK 11 T 7.6 mRNA_triPase pdbhh F T 3p4u 3 E,F E,F Ac-VEID-CHO inhibitor XVEIX 5 T 650 DUF72 pdbhh F F 3p69 1 A,B A,B Q5L7M9_BACFN Uncharacterized protein GSKEDLPAYEEAEITKVGAYHRFYSGDKDAITGENIVAEKELDRTNNIDSEHGVATAVFTIPAAGGKFTEAERAKVSLSNLVVYVNVSTAARVTPLDGSPKFGVPADWTREHKYSVMAADGTKKIWTVKVTLNK 134 T 0.0012 DUF5018 pdbpercent F Bacteria T 3p6z 3 C,F C,I FA5_HUMAN ACTIVATED PROTEIN C COFACTOR AHHHHHHVGTWENLYFQSIPDDDEDSYEIFEPPESTVMATRKMHDRLEPEDEESDADYDYQNRLAAALGIR 71 T 0.022 2OG-FeII_Oxy_5 pdb F Eukaryota T 3p70 3 I,J,K,L M,N,O,P FA5_HUMAN ACTIVATED PROTEIN C COFACTOR AHHHHHHVGTWENLYFQSIPDDDEDSYEIFEPPESTVMATRKMHDRLEPEDEESDADYDYQNRLAAALGIR 71 T 0.022 2OG-FeII_Oxy_5 pdb F Eukaryota T 3p72 2 B B OS1 peptide CTERMALHNLC 11 T 8.1 PHC2_SAM_assoc pdbhh F T 3p7o 2 B B active site bound peptide XXXXXXXX 8 F F F 3p7o 3 C C distal site bound peptide XXXXXXX 7 F F F 3p87 2 B,D,F,H,J,L G,H,I,J,K,L RNH2B_HUMAN RNASE H2 SUBUNIT B, AICARDI-GOUTIERES SYNDROME 2 PROTEIN, AGS2, DELETED IN LYMPHOCYTIC LEUKEMIA 8, RIBONUCLEASE HI SUBUNIT B DKSGMKSIDTFFGVKNKKKIGKV 23 T 1.4 Mif2_N pdbhh F Eukaryota T 3p8f 2 B I SFTI1_HELAN SFTI-1 GRCTKSIPPICFPD 14 T 0.0029 Bowman-Birk_leg pdb F Eukaryota T 3p9g 2 B B Q9YP46_9HIV1 Gag polyprotein XPEXTAPPEEX 11 T 130 CM2 pdbhh T Viruses F 3p9h 2 B B Q9YP46_9HIV1 Gag polyprotein XPEXTAPPEEX 11 T 130 CM2 pdbhh T Viruses F 3p9y 2 E,F,G,H E,F,G,H pSer5 CTD peptide XTSPSYX 7 T 0.015 RNA_pol_Rpb1_R pdbhh F F 3pa7 2 C D 4-mer Peptide ALPF ALPF 4 T 62 Piezo_RRas_bdg pdbhh F F 3pbj 1 A,B,C,D,E,F A,B,C,D,E,F COIL SER L9L-Pen L23H XEWEALEKKXAALESKLQALEKKHEALEHGX 31 T 0.0034 DUF5320 pdbhh F T 3pbp 1 A,D,G,J A,D,G,J NUP82_YEAST NUCLEAR PORE PROTEIN NUP82 MSQSSRLSALPIFQASLSASQSPRYIFSSQNGTRIVFIQDNIIRWYNVLTDSLYHSLNFSRHLVLDDTFHVISSTSGDLLCLFNDNEIFVMEVPWGYSNVEDVSIQDAFQIFHYSIDEEEVGPKSSIKKVLFHPKSYRDSCIVVLKEDDTITMFDILNSQEKPIVLNKPNNSFGLDARVNDITDLEFSKDGLTLYCLNTTEGGDIFAFYPFLPSVLLLNEKDLNLILNKSLVMYESLDSTTDVIVKRNVIKQLQFVSKLHENWNSRFGKVDIQKEYRLAKVQGPFTINPFPGELYDYTATNIATILIDNGQNEIVCVSFDDGSLILLFKDLEMSMSWDVDNYVYNNSLVLIERVKLQREIKSLITLPEQLGKLYVISDNIIQQVNFMSWASTLSKSINESDLNPLAGLKFESKLEDIATIERIPNLAYINWNDQSNLALMSNKTLTFQNISS 452 T 4.9E-12 Nup88 pdbpercent F Eukaryota T 3pbp 3 C,F,I,L C,F,I,L NU159_YEAST NUCLEAR PORE PROTEIN NUP159 SSITKDMKGFKVVEVGLAMNTKKQIGDFFKNLNMAK 36 T 7.2 DUF1413 pdbhh F Eukaryota T 3pcx 2 B B Inhibitor Ac-DEVD-CMK XDEVDX 6 T 200 ResIII pdbhh F F 3pd0 2 B B INHIBITOR AC-DEVD-CMK XDEVDX 6 T 200 ResIII pdbhh F F 3pd1 2 B B Inhibitor Ac-DEVD-CMK XDEVDX 6 T 200 ResIII pdbhh F F 3pdh 2 B D P53_HUMAN ANTIGEN NY-CO-13, PHOSPHOPROTEIN P53, TUMOR SUPPRESSOR P53 KKGQSTSRHKXLMFKTEG 18 T 33 Class_IIIsignal pdbhh F Eukaryota T 3pe4 2 B,D B,D CSK21_HUMAN CK II ALPHA YPGGSTPVSSANMM 14 T 13 Pr_beta_C pdbhh F Eukaryota T 3pes 1 A,B A,B A9J573_BPPYU Uncharacterized protein gp49 SNAMLAEFEDRVAGIPCLIVVTYWEPYVPAKVSGPPEYCYPAEGGCGEWEVRDRRGRPAPWLERKLTEAERERIDQAVFDRMEGR 85 T 5.2 Sulfotransfer_1 pdbhh T Viruses T 3pf6 1 A,B,C,D A,B,C,D C8ZKC7_9CAUD hypothetical protein PP-LUZ7_gp033 GMSQFQEVRPVAQALYPTHPSTKDALEEARLLFPGGTHHDFMRALMGYHNTLVKVMEEQCGS 62 T 0.006 Arabinose_Iso_C pdbpssm T Viruses T 3pfv 2 C,D C,D EGFR_HUMAN PROTO-ONCOGENE C-ERBB-1, RECEPTOR TYROSINE-PROTEIN KINASE ERBB-1 LQRXSSDPTGA 11 T 11 RNase_Y_N pdbhh F Eukaryota T 3pgm 1 A,B A,B PMG1_YEAST Phosphoglycerate mutase 1 PKLVLVRHGQSEWNEKNLFTGWVDVKLSAKGQQEAARAGELLKEKGVNVLVDYTSKLSRAIQTANIALEKADRLWIPVNRSWRLNERHYGDLQGKDKAQTLKKFGEEKFNTYRRSFDVPPPPIDASSPFSQKGDERYKYVDPNVLPETESLALVIDRLLPYWQDVIAKLVGKTSMIAAHGNSLRGLVKHLEGISDADIAKLNIPPGTILVFELDENLKPSKPSYYLDPEAAAAGAAAVANQGKK 244 T 5E-07 His_Phos_1 pdb F Eukaryota T 3pkn 2 B B LARP4_HUMAN LA RIBONUCLEOPROTEIN DOMAIN FAMILY MEMBER 4 TGLNPNAKVWQEIA 14 T 0.013 PAM2 pdbhh F Eukaryota T 3plf 1 A,C A,C MetRD peptide NESVRXDATFP 11 T 17 2-thiour_desulf pdbhh F T 3pmp 2 C,D C,D CYCLOSPORIN A XXXXXXXXVXA 11 T 9.7 DUF6090 pdbhh F F 3poa 2 B B synthetic phosphopeptide DTAPTEKIAYKK 12 T 7.1 DUF1299 pdbhh F T 3pod 1 A,B,C A,B,C MBL collagen-like peptide XGPPGPPGPPGKLGPPGPPGPPGPPX 26 T 0.00028 Collagen pdbpssm F F 3pqr 2 B B GNAT1_BOVIN GALPHA SUBUNIT OF TRANSDUCIN, TRANSDUCIN ALPHA-1 CHAIN ILENLKDVGLF 11 T 2.1 Phage_holin_4_1 pdbhh F Eukaryota T 3pqz 2 E,F L,M cyclic peptide WFEGYDNTFPX 11 T 0.77 RestrictionMunI pdbhh F T 3prk 2 B I METHOXYSUCCINYL-ALA-ALA-PRO-ALA-CHLOROMETHYL KETONE XAAPXX 6 T 950 A_amylase_inhib pdbhh F F 3psl 2 C,D C,D N-alpha acetylated form of histone H3 XARTKQ 6 T 380 SLBP_RNA_bind pdbhh F T 3ptg 2 B J JIP1_HUMAN JIP-1, JNK-INTERACTING PROTEIN 1, ISLET-BRAIN 1, IB-1, JNK MAP KINASE SCAFFOLD PROTEIN 1, MITOGEN-ACTIVATED PROTEIN KINASE 8-INTERACTING PROTEIN 1 RPKRPTTLNLF 11 T 12 CTP-dep_RFKase pdbhh F Eukaryota T 3pth 2 B B LAR4B_HUMAN LA RIBONUCLEOPROTEIN DOMAIN FAMILY MEMBER 4B, LA RIBONUCLEOPROTEIN DOMAIN FAMILY MEMBER 5, LA-RELATED PROTEIN 5 ELNPNAEVWGAPVLH 15 T 0.84 TT_ORF2 unppercent F Eukaryota T 3puj 2 B,D C,D STX4_MOUSE Syntaxin-4 N-terminal peptide MRDRTHELRQ 10 T 0.13 Syntaxin-5_N pdbhh F Eukaryota T 3puk 2 C,D C,D STX4_MOUSE Syntaxin-4 N-terminal peptide MRDRTHELRQ 10 T 0.13 Syntaxin-5_N pdbhh F Eukaryota T 3pv3 2 E,F,G,H E,F,G,H Substrate peptide (Poly-Ala) XXXXXXXXXXXXXXXXXXXX 20 F F F 3pvl 2 B B USH1G_HUMAN SCAFFOLD PROTEIN CONTAINING ANKYRIN REPEATS AND SAM DOMAIN SEVSTDSGHDSLFTRPGLGTMVFRRNYLSSGLHGLGREDGGLDGVGAPRGRLQSSPSLDDDSLGSANSLQDRSCGEELPWDELDLGLDEDLEPETS 96 T 2.1 DUF452 unphh F Eukaryota T 3pwj 3 C,F C,F HuD (G2L,I9V) peptide LLYGFVNYV 9 T 7.6 OMS28_porin pdbhh F T 3pwl 3 C,F C,F HuD peptide LGYGFVNYI 9 T 3.1 NapE pdbhh F T 3pwn 3 C,F C,F HuD (G2L) peptide LLYGFVNYI 9 T 9.1 OMS28_porin pdbhh F T 3pwp 3 C C HuD peptide LGYGFVNYI 9 T 3.1 NapE pdbhh F T 3pxe 2 E,F,G,H E,F,G,H phospho peptide SRSTSPTFNK 10 T 1.5 DUF782 pdbhh F T 3q0a 1 A,C A,B RPOLV_BPN4 Virion RNA polymerase MGGSHHHHHHRSESTVTEELKEGIDAVYPSLVGTADSKAEGIKNYFKLSFTLPEEQKSRTVGSEAPLKDVAQALSSRARYELFTEKETANPAFNGEVIKRYKELMEHGEGIADILRSRLAKFLNTKDVGKRFAQGTEANRWVGGKLLNIVEQDGDTFKYNEQLLQTAVLAGLQWRLTATSNTAIKDAKDVAAITGIDQALLPEGLVEQFDTGMTLTEAVSSLAQKIESYWGLSRNPNAPLGYTKGIPTAMAAEILAAFVESTDVVENIVDMSEIDPDNKKTIGLYTITELDSFDPINSFPTAIEEAVLVNPTEKMFFGDDIPPVANTQLRNPAVRNTPEQKAALKAEQATEFYVHTPMVQFYETLGKDRILELMGAGTLNKELLNDNHAKSLEGKNRSVEDSYNQLFSVIEQVRAQSEDISTVPIHYAYNMTRVGRMQMLGKYNPQSAKLVREAILPTKATLDLSNQNNEDFSAFQLGLAQALDIKVHTMTREVMSDELTKLLEGNLKPAIDMMVEFNTTGSLPENAVDVLNTALGDRKSFVALMALMEYSRYLVAEDKSAFVTPLYVEADGVTNGPINAMMLMTGGLFTPDWIRNIAKGGLFIGSPNKTMNEHRSTADNNDLYQASTNALMESLGKLRSNYASNMPIQSQIDSLLSLMDLFLPDINLGENGALELKRGIAKNPLTITIYGSGARGIAGKLVSSVTDAIYERMSDVLKARAKDPNISAAMAMFGKQAASEAHAEELLARFLKDMETLTSTVPVKRKGVLELQSTGTGAKGKINPKTYTIKGEQLKALQENMLHFFVEPLRNGITQTVGESLVYSTEQLQKATQIQSVVLEDMFKQRVQEKLAEKAKDPTWKKGDFLTQKELNDIQASLNNLAPMIETGSQTFYIAGSENAEVANQVLATNLDDRMRVPMSIYAPAQAGVAGIPFMTIGTGDGMMMQTLSTMKGAPKNTLKIFDGMNIGLNDITDASRKANEAVYTSWQGNPIKNVYESYAKFMKNVDFSKLSPEALEAIGKSALEYDQRENATVDDIANAASLIERNLRNIALGVDIRHKVLDKVNLSIDQMAAVGAPYQNNGKIDLSNMTPEQQADELNKLFREELEARKQKVAKAR 1118 T 0.0038 RNA_pol pdbhh T Viruses T 3q1i 2 B E TCRG1_HUMAN PEPTIDE FROM TCERG1 XFMPPPMSSMX 11 T 1.2 FAF pdbhh F Eukaryota F 3q22 1 A,C A,B RPOLV_BPN4 Virion RNA polymerase MGGSHHHHHHRSESTVTEELKEGIDAVYPSLVGTADSKAEGIKNYFKLSFTLPEEQKSRTVGSEAPLKDVAQALSSRARYELFTEKETANPAFNGEVIKRYKELMEHGEGIADILRSRLAKFLNTKDVGKRFAQGTEANRWVGGKLLNIVEQDGDTFKYNEQLLQTAVLAGLQWRLTATSNTAIKDAKDVAAITGIDQALLPEGLVEQFDTGMTLTEAVSSLAQKIESYWGLSRNPNAPLGYTKGIPTAMAAEILAAFVESTDVVENIVDMSEIDPDNKKTIGLYTITELDSFDPINSFPTAIEEAVLVNPTEKMFFGDDIPPVANTQLRNPAVRNTPEQKAALKAEQATEFYVHTPMVQFYETLGKDRILELMGAGTLNKELLNDNHAKSLEGKNRSVEDSYNQLFSVIEQVRAQSEDISTVPIHYAYNMTRVGRMQMLGKYNPQSAKLVREAILPTKATLDLSNQNNEDFSAFQLGLAQALDIKVHTMTREVMSDELTKLLEGNLKPAIDMMVEFNTTGSLPENAVDVLNTALGDRKSFVALMALMEYSRYLVAEDKSAFVTPLYVEADGVTNGPINAMMLMTGGLFTPDWIRNIAKGGLFIGSPNKTMNEHRSTADNNDLYQASTNALMESLGKLRSNYASNMPIQSQIDSLLSLMDLFLPDINLGENGALELKRGIAKNPLTITIYGSGARGIAGKLVSSVTDAIYERMSDVLKARAKDPNISAAMAMFGKQAASEAHAEELLARFLKDMETLTSTVPVKRKGVLELQSTGTGAKGKINPKTYTIKGEQLKALQENMLHFFVEPLRNGITQTVGESLVYSTEQLQKATQIQSVVLEDMFKQRVQEKLAEKAKDPTWKKGDFLTQKELNDIQASLNNLAPMIETGSQTFYIAGSENAEVANQVLATNLDDRMRVPMSIYAPAQAGVAGIPFMTIGTGDGMMMQTLSTMKGAPKNTLKIFDGMNIGLNDITDASRKANEAVYTSWQGNPIKNVYESYAKFMKNVDFSKLSPEALEAIGKSALEYDQRENATVDDIANAASLIERNLRNIALGVDIRHKVLDKVNLSIDQMAAVGAPYQNNGKIDLSNMTPEQQADELNKLFREELEARKQKVAKAR 1118 T 0.0038 RNA_pol pdbhh T Viruses T 3q23 1 A,C A,B RPOLV_BPN4 Virion RNA polymerase MGGSHHHHHHRSESTVTEELKEGIDAVYPSLVGTADSKAEGIKNYFKLSFTLPEEQKSRTVGSEAPLKDVAQALSSRARYELFTEKETANPAFNGEVIKRYKELMEHGEGIADILRSRLAKFLNTKDVGKRFAQGTEANRWVGGKLLNIVEQDGDTFKYNEQLLQTAVLAGLQWRLTATSNTAIKDAKDVAAITGIDQALLPEGLVEQFDTGMTLTEAVSSLAQKIESYWGLSRNPNAPLGYTKGIPTAMAAEILAAFVESTDVVENIVDMSEIDPDNKKTIGLYTITELDSFDPINSFPTAIEEAVLVNPTEKMFFGDDIPPVANTQLRNPAVRNTPEQKAALKAEQATEFYVHTPMVQFYETLGKDRILELMGAGTLNKELLNDNHAKSLEGKNRSVEDSYNQLFSVIEQVRAQSEDISTVPIHYAYNMTRVGRMQMLGKYNPQSAKLVREAILPTKATLDLSNQNNEDFSAFQLGLAQALDIKVHTMTREVMSDELTKLLEGNLKPAIDMMVEFNTTGSLPENAVDVLNTALGDRKSFVALMALMEYSRYLVAEDKSAFVTPLYVEADGVTNGPINAMMLMTGGLFTPDWIRNIAKGGLFIGSPNKTMNEHRSTADNNDLYQASTNALMESLGKLRSNYASNMPIQSQIDSLLSLMDLFLPDINLGENGALELKRGIAKNPLTITIYGSGARGIAGKLVSSVTDAIYERMSDVLKARAKDPNISAAMAMFGKQAASEAHAEELLARFLKDMETLTSTVPVKRKGVLELQSTGTGAKGKINPKTYTIKGEQLKALQENMLHFFVEPLRNGITQTVGESLVYSTEQLQKATQIQSVVLEDMFKQRVQEKLAEKAKDPTWKKGDFLTQKELNDIQASLNNLAPMIETGSQTFYIAGSENAEVANQVLATNLDDRMRVPMSIYAPAQAGVAGIPFMTIGTGDGMMMQTLSTMKGAPKNTLKIFDGMNIGLNDITDASRKANEAVYTSWQGNPIKNVYESYAKFMKNVDFSKLSPEALEAIGKSALEYDQRENATVDDIANAASLIERNLRNIALGVDIRHKVLDKVNLSIDQMAAVGAPYQNNGKIDLSNMTPEQQADELNKLFREELEARKQKVAKAR 1118 T 0.0038 RNA_pol pdbhh T Viruses T 3q24 1 A,B A,B RPOLV_BPN4 Virion RNA polymerase MGGSHHHHHHRSESTVTEELKEGIDAVYPSLVGTADSKAEGIKNYFKLSFTLPEEQKSRTVGSEAPLKDVAQALSSRARYELFTEKETANPAFNGEVIKRYKELMEHGEGIADILRSRLAKFLNTKDVGKRFAQGTEANRWVGGKLLNIVEQDGDTFKYNEQLLQTAVLAGLQWRLTATSNTAIKDAKDVAAITGIDQALLPEGLVEQFDTGMTLTEAVSSLAQKIESYWGLSRNPNAPLGYTKGIPTAMAAEILAAFVESTDVVENIVDMSEIDPDNKKTIGLYTITELDSFDPINSFPTAIEEAVLVNPTEKMFFGDDIPPVANTQLRNPAVRNTPEQKAALKAEQATEFYVHTPMVQFYETLGKDRILELMGAGTLNKELLNDNHAKSLEGKNRSVEDSYNQLFSVIEQVRAQSEDISTVPIHYAYNMTRVGRMQMLGKYNPQSAKLVREAILPTKATLDLSNQNNEDFSAFQLGLAQALDIKVHTMTREVMSDELTKLLEGNLKPAIDMMVEFNTTGSLPENAVDVLNTALGDRKSFVALMALMEYSRYLVAEDKSAFVTPLYVEADGVTNGPINAMMLMTGGLFTPDWIRNIAKGGLFIGSPNKTMNEHRSTADNNDLYQASTNALMESLGKLRSNYASNMPIQSQIDSLLSLMDLFLPDINLGENGALELKRGIAKNPLTITIYGSGARGIAGKLVSSVTDAIYERMSDVLKARAKDPNISAAMAMFGKQAASEAHAEELLARFLKDMETLTSTVPVKRKGVLELQSTGTGAKGKINPKTYTIKGEQLKALQENMLHFFVEPLRNGITQTVGESLVYSTEQLQKATQIQSVVLEDMFKQRVQEKLAEKAKDPTWKKGDFLTQKELNDIQASLNNLAPMIETGSQTFYIAGSENAEVANQVLATNLDDRMRVPMSIYAPAQAGVAGIPFMTIGTGDGMMMQTLSTMKGAPKNTLKIFDGMNIGLNDITDASRKANEAVYTSWQGNPIKNVYESYAKFMKNVDFSKLSPEALEAIGKSALEYDQRENATVDDIANAASLIERNLRNIALGVDIRHKVLDKVNLSIDQMAAVGAPYQNNGKIDLSNMTPEQQADELNKLFREELEARKQKVAKA 1117 T 0.0039 RNA_pol pdbhh T Viruses T 3q47 2 B C SMAD1_HUMAN Smad1 peptide SPHNPISDVD 10 T 0.84 DUF2733 pdbhh F Eukaryota T 3q49 2 B C HS71A_HUMAN Hsp70-C peptide GPTIEEVD 8 T 8.1 DUF4028 pdbhh F Eukaryota T 3q4a 2 B C SMAD1_HUMAN Smad1 peptide SPHNPISSVS 10 T 2.6 DUF4943 pdbhh F Eukaryota T 3q4j 2 G,H,I,J,K H,I,J,K,L peptide ligand XQLDLF 6 T 2.4 DUF6248 pdbhh F F 3q4k 2 C,D C,D peptide ligand XQXDLX 6 T 81 Zn_peptidase pdbhh F F 3q4l 2 C,D C,D peptide ligand XQXDLX 6 T 81 Zn_peptidase pdbhh F F 3q6s 2 E,F E,F SGO1_HUMAN HSGO1, SEROLOGICALLY DEFINED BREAST CANCER ANTIGEN NY-BR-85 TNVSLYPVVKIRRLSLSPK 19 T 64 PUB_1 pdbhh F Eukaryota T 3q75 3 C G Hexapeptide TKCVVM TKCVVM 6 T 3.2 Plk4_PB2 pdbhh F F 3q78 3 C D peptide substrate DDPTASACNIQ 11 T 22 MTCP1 pdbhh F T 3q79 3 C P isoprenylated product DDPTASACNIQ 11 T 22 MTCP1 pdbhh F T 3q8d 2 C,D E,F SSB_ECOLI Single-stranded DNA-binding protein YMDFDDDIPF 10 T 0.22 Phage_SSB pdbhh F Bacteria T 3q9g 1 A A Cyclic pseudo-peptide VQIV(4BF)(ORN)(HAO)KL(ORN) VQIVXXXKLX 10 T 40 DUF6332 pdbhh F T 3q9h 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H Cyclic pseudo-peptide LVFFA(ORN)(HAO)LK(ORN) LVFFAXXLKX 10 T 8.7 DUF5347 pdbhh F T 3q9i 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H Cyclic pseudo-peptide LV(4BF)FA(ORN)(HAO)LK(ORN) LVXFAXXLKX 10 T 10 DUF5347 pdbhh F T 3q9j 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H Cyclic pseudo-peptide AIIFL(ORN)(HAO)YK(ORN) AIIFLXXYKX 10 T 21 Corona_NS3b pdbhh F T 3qby 2 D H H4_HUMAN H4K20me3 Histone H4 Peptide AKRHRKVLRDN 11 T 0.27 UPF0137 unp F Eukaryota T 3qdr 2 B B CEA_CITFR Colicin-A GSKPGDSYNTPWGKVIINAAGQPTMNGTVMTADNSSMVPYGRGFTRVLNSLVNNPVSHHHHHH 63 T 0.069 DUF6162 pdb F Bacteria T 3qdz 3 E,F E,F PAR4_HUMAN PAR-4, COAGULATION FACTOR II RECEPTOR-LIKE 3, THROMBIN RECEPTOR-LIKE 3 TPSILPAPR 9 T 2.6 Abhydrolase_9_N pdbhh F Eukaryota T 3qe0 2 D,E G,F KB752 peptide SRVTWYDFLMEDTKSR 16 T 2.1 DUF2760 pdbhh F T 3qf9 2 B B pimtide ARKRRRHPSGPPTA 14 T 1.5 DUF3019 pdbhh F T 3qfj 3 C C TAX(Y5F) peptide LLFGFPVYV 9 T 0.35 YvrJ pdbhh F T 3qg6 3 E,F C,D Q9F6Z3_STAAU Agr autoinducing peptide YSTCYFIM 8 T 0.7 Ly49 pdbhh F Bacteria T 3qgj 2 B,D B,D Ac-AlaAlaPro-Alanal peptide XAAPX 5 T 970 Pep_deformylase pdbhh F F 3qgl 2 B,D,F,H,J F,G,H,I,J KCNJ9_RAT GIRK-3, INWARD RECTIFIER K(+) CHANNEL KIR3.3, POTASSIUM CHANNEL, INWARDLY RECTIFYING SUBFAMILY J MEMBER 9 ESESKV 6 T 22 HCV_NS5a_C pdbhh F Eukaryota F 3qhr 3 E,F,G,H J,K,L,M CDK2 substrate peptide: PKTPKKAKKL PKTPKKAKKL 10 T 3.7 AgrD pdbhh F F 3qhw 3 E,F,G,H J,K,L,M CDK2 substrate peptide: PKTPKKAKKL PKTPKKAKKL 10 T 3.7 AgrD pdbhh F F 3qis 2 B B SESQ1_HUMAN SES1 PFARLHECYGQEI 13 T 8 MEIOC pdbhh F Eukaryota T 3qjm 2 C,D C,D Beta-PIX DETNL 5 T 160 KID pdbhh F F 3qkk 2 B C GSK3B_HUMAN GSK-3 beta peptide GRPRTTSFAE 10 T 6.3 DUF3084 pdbhh F Eukaryota T 3qkl 2 B C GSK-3 beta peptide GRPRTTSFAE 10 T 6.3 DUF3084 pdbhh F T 3qkr 3 C C MRE11_PYRFU MRE11 NUCLEASE, PFMRE11 SDFFTEFELKIIDILGEKDFDDFDYIIKLITEGK 34 T 0.23 PufQ unppssm F Archaea T 3qks 3 C C MRE11_PYRFU MRE11 NUCLEASE, PFMRE11 SDFFTEFELKIIDILGEKDFDDFDYIIKLITEGK 34 T 0.23 PufQ unppssm F Archaea T 3qku 2 C C MRE11_PYRFU MRE11 NUCLEASE, PFMRE11 SDFFTEFELKIIDILGEKDFDDFDYIIKLITEGK 34 T 0.23 PufQ unppssm F Archaea T 3qn7 2 B B UK18 ACSRYEVDCRGRGSACG 17 T 1.5 Toxin_24 pdbhh F T 3qnj 2 C,D C,D antimicrobial peptide oncocin VDKPPYLPRPRPPRXIYNX 19 T 0.14 Apidaecin pdbhh F T 3qnw 3 I,J,K X,Y,Z Z-VAD-FMK XVADX 5 T 1100 RE_HindIII pdbhh F F 3qnz 3 C C TBB5_HUMAN TUBULIN BETA-5 CHAIN TAEEEEDFGE 10 T 20 DUF1639 pdbhh F Eukaryota T 3qo0 3 C C TBB5_HUMAN TUBULIN BETA-5 CHAIN YQQYQDATAEEEEDFGEEAE 20 T 10 Hrs_helical unphh F Eukaryota T 3qo6 2 D D peptide XXXXXXX 7 F F F 3qo6 3 E,H,I E,H,I peptide XXXX 4 F F F 3qo6 4 F F peptide XXXXX 5 F F F 3qo6 5 G G peptide XXX 3 F F F 3qq3 3 C,F C,F NRAM_I96A0 PEPTIDE OF SLA-1*0401-S-OIVNW9 NSDTVGWSW 9 T 2.2 DUF4902 pdbhh T Viruses T 3qw5 2 B B inhibitory peptide RRGF RRGFX 5 T 42 Sdh5 pdbhh F F 3qw6 2 B B inhibitory peptide RYGC RYGCX 5 T 38 zf-ISL3 pdbhh F F 3qw7 2 B B inhibitory peptide RRFC RRFCX 5 T 22 ADK_lid pdbhh F F 3qw8 2 B B inhibitory peptide CRGC CRGCX 5 T 22 Thioredoxin_4 pdbhh F F 3qxy 2 B,D P,Q TF65_HUMAN NUCLEAR FACTOR NF-KAPPA-B P65 SUBUNIT, NUCLEAR FACTOR OF KAPPA LIGHT POLYPEPTIDE GENE ENHANCER IN B-CELLS 3 RKRTYETFKSIMKKS 15 T 1.4 Dynein_attach_N pdbhh F Eukaryota T 3qz0 1 A,B A,B Q73RI0_TREDE Factor H binding protein MAHHHHHHVDDDDKTFKMNTAQKAHYEKFINALENELKTRHIPAGAVIDMLAEINTEALALDYQIVDKKPGTSIAQGTKAAALRKRFIPKKIK 93 T 0.0084 DUF4969 unphh F Bacteria T 3qzs 2 C,D C,D H4_HUMAN Histone H4 KGGAXRHRKV 10 T 11 Shadoo unppercent F Eukaryota T 3qzt 2 B B H4_HUMAN Histone H4 KGGAXRHRKV 10 T 11 Shadoo unppercent F Eukaryota T 3qzv 2 B C H4_HUMAN Histone H4 GKGLGXGGAKR 11 T 11 Shadoo unppercent F Eukaryota T 3r0h 2 B,D,F,H,J,L,N,P a,b,c,d,e,f,g,h NG2 ALRNGQYWV 9 T 0.34 FixS pdbhh F T 3r15 1 A,B A,B Q73RI0_TREDE Factor H binding protein MAHHHHHHVDDDDKTFKMNTAQKAHYEKFINALENELKTRHIPAGAVIDMLAEINTEALALDYQIVDKKPGTSIAQGTKAAALRKRFIPKKIK 93 T 0.0084 DUF4969 unphh F Bacteria T 3r1r 2 B,D,F,G D,E,F,P RIR2_ECOLI RIBONUCLEOTIDE REDUCTASE R2 PROTEIN YLVGQIDSEVDTDDLSNFQL 20 T 12 UL11 pdbhh F Bacteria T 3r29 2 C,D C,D NCOR2_HUMAN SMRT, N-COR2, CTG REPEAT PROTEIN 26, SMAP270, SILENCING MEDIATOR OF RETINOIC ACID AND THYROID HORMONE RECEPTOR, T3 RECEPTOR-ASSOCIATING FACTOR, TRAC, THYROID-RECEPTOR-ASSOCIATED COREPRESSOR, RETINOIC-ACID-RECEPTOR-ASSOCIATED COREPRESSOR TNMGLEAIIRKALMGK 16 T 3.6 FYTT pdbhh F Eukaryota T 3r2a 2 E,F E,F NCOR2_HUMAN SMRT, N-COR2, CTG REPEAT PROTEIN 26, SMAP270, SILENCING MEDIATOR OF RETINOIC ACID AND THYROID HORMONE RECEPTOR, T3 RECEPTOR-ASSOCIATING FACTOR, TRAC, THYROID-RECEPTOR-ASSOCIATED COREPRESSOR, RETINOIC-ACID-RECEPTOR-ASSOCIATED COREPRESSOR TNMGLEAIIRKALMGK 16 T 3.6 FYTT pdbhh F Eukaryota T 3r42 2 B B VPS27_YEAST VPS27, GOLGI RETENTION DEFECTIVE PROTEIN 11 QVPSDPYNY 9 T 14 DUF3460 pdbhh F Eukaryota T 3r46 1 A,B,C,D,E,F A,B,C,E,F,G coiled coil helix L24D XGELKAIAQELKAIAKELKAIAWEDKAIAQGAGYX 35 T 1.8 DUF5320 pdbhh F T 3r47 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,E,F,G,H,I,J,K,L,M coiled coil helix L24H XGELKAIAQELKAIAKELKAIAWEHKAIAQGAGX 34 T 0.95 Rho_N pdb F T 3r48 1 A,E,F A,F,G coiled coil helix W22-L24H XGELKAIAQELKAIAKELKAIAWEHKAIAQGAGX 34 T 0.95 Rho_N pdb F T 3r48 2 B,C,D B,C,E coiled coil helix Y15-L24D XGELKAIAQELKAIAYELKAIAKEDKAIAQGX 32 T 2.1 DUF5660 pdbhh F T 3r4a 1 A,B,C,D A,B,C,D coiled coil helix CC-tet XGELAAIKQELAAIKKELAAIKWELAAIKQGAGX 34 T 0.0033 DUF5320 pdbhh F T 3r4h 1 A,B,C,D,E,F A,B,C,D,E,F coiled coil helix CC-Tet-phi22 XGELAAIKQELAAIKKELAAIKXELAAIKQGAGX 34 T 0.0067 DUF5320 pdbhh F T 3r5j 3 E,F F,E Peptide Inhibitor (ACE)ADVAD-CHO XADVAX 6 T 390 GDE_N_bis pdbhh F F 3r6g 3 E,F F,E Peptide Inhibitor (ACE)VDVAD-CHO XVDVAX 6 T 250 DUF3563 pdbhh F F 3r6l 3 E,F F,E Peptide Inhibitor (ACE)VDVAD-CHO XVDVAX 6 T 250 DUF3563 pdbhh F F 3r7b 3 E F Peptide Inhibitor (ACE)DVAD-CHO XDVAX 5 T 520 Epimerase pdbhh F F 3r7g 2 B B FMN2_HUMAN Formin-2 KSLYKIKPRHDSGIKAKISMKT 22 T 14 DUF6140 pdbhh F Eukaryota T 3r7n 3 E,F F,E Peptide Inhibitor (ACE)DVAD-CHO XDVAX 5 T 520 Epimerase pdbhh F F 3rbq 2 G,H,I,J,K,L G,H,I,J,K,L GNAT1_HUMAN TRANSDUCIN ALPHA-1 CHAIN XGAGASAEEKH 11 T 2.8 DUF4917 pdbhh F Eukaryota T 3rc0 2 B,D P,Q TF65_HUMAN NUCLEAR FACTOR NF-KAPPA-B P65 SUBUNIT, NUCLEAR FACTOR OF KAPPA LIGHT POLYPEPTIDE GENE ENHANCER IN B-CELLS 3 RKRTYETFKSIMKKS 15 T 1.4 Dynein_attach_N pdbhh F Eukaryota T 3rc4 2 B B TCAM1_HUMAN Product TRIF PPPPPPPPSSTPC 13 T 0.098 PLRV_ORF5 pdbhh F Eukaryota F 3rc5 2 B B MAVS_HUMAN Product MAVS XQEREVPC 8 T 3.6 GRA6 unphh F Eukaryota T 3rce 2 B B Substrate Mimic Peptide GDQNATXG 8 T 3.9 S-AdoMet_synt_M pdbhh F T 3rfr 2 B,C D,H peptide AAAAAAAAAAAAAAAAAAA 19 T 410 Adeno_PIX pdbhh F F 3rgv 5 E E peptide WIYVYRPMGCGGS 13 T 0.19 BAALC_N pdbhh F T 3rh3 1 A,B A,B Q8A6H6_BACTN Uncharacterized DUF3829-like protein GQTVSSESTEELDDASKVINYYHMSLAVLRHVANAKDINAVLGYMEQTGKVPEVDPIAPPEIAARDTAELLDPGDYFNPEVRQNLKQNYAGLFNVRTQFYDNFNKFLAYKKSKDTAKTAQLLDENYKLSVELSEYKQVIFDILSPLTEQAESELLADEPLKDQIMAMRKMSGTVQSIMNLYSRKHAMDGVRIDLKMAELEKELKAAEKIPAVTGYDEELKNFQSFLSTVKSFMNDMQKARSKGAYSDKEYQAMSEAYEYGLSVI 264 T 0.0075 PPR_3 pdbpercent F Bacteria T 3rj2 1 A X Q5YFA7_9VIRU ORF158L PROTEIN MGWAIVANCEFVNATGKKTTILVNENWAKYCWIWTYKFPEKYTLLRYSVDGEMFMRHRVTFFNATGRYITHTHLNHGLEDVLEGSLAVPKDAAYARIHAAINVSLTNPGDVHMHYDETEGEQIRSYDAAEFARTLAAV 138 T 0.11 Terminase_6N pdb T Viruses T 3rjm 3 E,F E,F Peptide inhibitor (ACE)VDV(3PX)D-CHO XVDVXX 6 T 920 Acetyltransf_2 pdbhh F F 3rmr 1 A,B,C A,B,C ATR1_HYAAE ATR1 AQTALDDDEERWPFGPSAVEALIETIDRHGRVSLNDEAKMKKVVRTWKKLIERDDLIGEIGKHYFEAPGPLHDTYDEALATRLVTTYSDRGVARAILHTRPSDPLSKKAGQAHRLEEAVASLWKGRGYTSDNVVSSIATGHDVDFFAPTAFTFLVKCVESEDDANNAIFEYFGSNPSRYFSAVLHAMEKPDADSRVLESSKKWMFQCYAQKQFPTPVFERTLAAYQSEDYAIRGARNHYEKLSLSQIEELVEEYSRIYSV 260 T 0.0016 RXLR unphh F Eukaryota T 3ro2 2 B B NUMA1_HUMAN NUMA PROTEIN, SP-H ANTIGEN RNSFYMGTCQDEPEQLDDWNRIAELQQR 28 T 11 FliT pdbhh F Eukaryota T 3rof 2 B B Expression tag cleaved from protein-tyrosine-phosphatase ptpA HHHHHGS 7 T 6500 zf_CCCH_4 pdbhh F F 3rq7 2 B B C6H5(CH2)8-derivatized peptide inhibitor XPLXSTX 7 T 250 Ribosomal_L19 pdbhh F F 3rqd 2 C,D C,D Largazole XGXXV 5 T 25 Tox-PLDMTX pdbhh F F 3rqe 2 E E PAXI_HUMAN Paxillin LD1 peptide DDLDALLADLESTT 14 T 2.2 DUF2525 pdbhh F Eukaryota T 3rqg 2 E E PAXI_HUMAN Paxillin LD4 peptide ATRELDELMASLS 13 T 0.99 SAM_LFY pdbhh F Eukaryota T 3rqr 2 B U (UNK)(UNK)(UNK)(UNK) XXXX 4 F F F 3rrb 2 B B peptide AWLFEA 6 T 17 DUF3950 pdbhh F F 3rre 2 B B peptide AAWLFEA 7 T 0.27 Xin pdbhh F F 3rrf 2 B B peptide AAWLFEA 7 T 0.27 Xin pdbhh F F 3rrj 2 B B peptide AWLFEA 6 T 17 DUF3950 pdbhh F F 3rs8 2 B B Unknown peptide, probably from expression host AWLFEA 6 T 17 DUF3950 pdbhh F F 3rs9 2 B B Unknown peptide, probably from expression host AWLFEA 6 T 17 DUF3950 pdbhh F F 3rse 8 H Z WASL_BOVIN CA fragment of Bos taurus N-WASP EWE 3 T 3.2 Importin_rep_6 unppercent F Eukaryota F 3rsf 2 B B Unknown peptide, probably from expression host AAWLFEA 7 T 0.27 Xin pdbhh F F 3rsg 2 B B Unknown peptide, probably from expression host AWLFEA 6 T 17 DUF3950 pdbhh F F 3rsq 2 B B Unknown peptide, probably from expression host AWLFEA 6 T 17 DUF3950 pdbhh F F 3rss 2 B B Unknown peptide, probably from expression host APAWLFEA 8 T 0.48 Xin pdbhh F T 3rsz 2 E,F E,F Glycogen [starch] synthase isoform 2 XXXXX 5 F F F 3rt7 2 B B Unknown peptide, probably from expression host AAWLFEA 7 T 0.27 Xin pdbhh F F 3rt9 2 B B Unknown peptide, probably from expression host AWLFEA 6 T 17 DUF3950 pdbhh F F 3rta 2 B B Unknown peptide, probably from expression host AAWLFEA 7 T 0.27 Xin pdbhh F F 3rtb 2 B B Unknown peptide, probably from expression host AWLFEA 6 T 17 DUF3950 pdbhh F F 3rtc 2 B B Unknown peptide, probably from expression host APAWLFEA 8 T 0.48 Xin pdbhh F T 3rtd 2 B B Unknown peptide, probably from expression host AAWLFEA 7 T 0.27 Xin pdbhh F F 3rte 2 B B Unknown peptide, probably from expression host PAWLFEA 7 T 2 Ribosomal_L37 pdbhh F T 3rtg 2 B B Unknown peptide, probably from expression host AAWLFEA 7 T 0.27 Xin pdbhh F F 3rtx 2 C C RNA Polymerase II C-terminal domain TSPSYSPTSPSYSPTSPS 18 T 0.0001 RNA_pol_Rpb1_R pdbhh F F 3ru2 2 B B Unknown peptide, probably from expression host AWLFEA 6 T 17 DUF3950 pdbhh F F 3ru3 2 B B Unknown peptide, probably from expression host AWLFEA 6 T 17 DUF3950 pdbhh F F 3ru4 3 C C CTRA_BOVIN CHYMOTRYPSIN A CHAIN A CGVPAIQPVLS 11 T 1.9 SH pdbhh F Eukaryota T 3rul 2 E,F,G,H E,F,G,H Dalbavancin XXXXXXX 7 T 95 Hairy_orange pdbhh F F 3rum 2 B,C B,C Ristocetin XXXXXXX 7 T 0.84 5-FTHF_cyc-lig pdbhh F F 3run 2 B B VANCOMYCIN XXNXXXX 7 T 95 P53_C pdbhh F F 3rya 2 B B Oligopeptide SLSQLSSQS 9 T 29 LEF-9 pdbhh F F 3ryb 2 B B Oligopeptide SLSQSLSQS 9 T 17 LSPR pdbhh F F 3ryl 1 A,B A,B Q87GE5_VIBPA ACTIN FILAMENT POINTED END-BINDING DOMAIN GHMRLLSEDLFKQSPKLSEQELDELANNLADYLFQAADIDWHQVISEKTRGLTTEEMAKSEHRYVQAFCREILKYPDCYKSADVASPESPKSGGGSVIDVALKRLQTGRERLFTTTDEKGNRELKKGDAILESAINAARMAISTEEKNTILSNNVKSATFEVFCELPCMDGFAEQNGKTAFYALRAGFYSAFKNTDTAKQDITKFMKDNLQAGFSGYSYQGLTNRVAQLEAQLAALSAKLS 241 T 0.0032 ABC_tran_CTD unppssm F Bacteria T 3rz2 2 C,D C,D Prl-1 (PTP4A1) GWWSLIPPKYIT 12 T 0.068 RRP14 pdbhh F T 3s04 2 C,D I,J Glyco-Arylomycin XXGXAY 6 T 230 MMACHC pdbhh F F 3s1b 2 B A mini-Z FNKECLLRYKEAALDPNLNLYQRIAKIVSIDDDC 34 T 2.7 TRCF pdbhh F T 3s1t 2 C C AK_MYCTU Aspartokinase EATVYAGTGRL 11 T 3.6 PC_rep pdbhh F Bacteria T 3s3h 2 C C phosphopeptide GP4 KNSFVXQKLSE 11 T 1.6 DUF244 pdbhh F T 3s3j 2 B B Peptide inhibitor XXVPL 5 T 510 FLILHELTA pdbhh F F 3s3p 2 B B Peptide inhibitor XXQPL 5 T 260 Glycoprot_B_PH2 pdbhh F F 3s3s 2 B B peptide inhibitor XXVPL 5 T 510 FLILHELTA pdbhh F F 3s63 1 A,B A,B H2L2M0_NECAM NA-SLP-1 LTPKETCDLCQIALRTVFGHFGGNIPSRRKLVHQLKHECKRHFNYRRRCLLLMKVNSDLIFREMTDGSFKPMEVCLIMRECNPHDSPLEPEMIDKSGQPEAFALVSSSDDNYDTSEE 117 T 0.41 SapB_1 pdbhh F Eukaryota T 3s70 2 B,D B,D aldehyde inhibitor Ac-VEID-CHO XVEIX 5 T 100 Ig_5 pdbhh F F 3s7d 2 B I Monomethylated p53 peptide SSHLKSKKGQSTS 13 T 29 Class_IIIsignal pdbhh F T 3s7f 2 B I p53 peptide SSHLKSKKGQSTS 13 T 29 Class_IIIsignal pdbhh F T 3s8l 2 B B pYAc4cN XXXNX 5 T 950 SIT pdbhh F F 3s8n 2 B B pYAc5cN XXXNX 5 T 950 SIT pdbhh F F 3s8o 2 B B pYAc6cN XXXNX 5 T 430 DUF3673 pdbhh F F 3s9c 2 B B FA5_HUMAN ACTIVATED PROTEIN C COFACTOR, PROACCELERIN, LABILE FACTOR SRDPDNIAAWYLRS 14 T 0.055 PPTA pdb F Eukaryota T 3sbn 1 A,B A,B Trichovirin I-4A XXNLXPAVXPXLXPX 15 T 22 Ribosomal_L11_N pdbhh F T 3sem 2 C,D C,D SH3 PEPTOID INHIBITOR PPPVXPRRR 9 T 33 DUF6131 pdbhh F F 3seo 1 A,B A,B Q87GE5_VIBPA VopL C terminal domain protein GHMRLLSEDLFKQSPKLSEQELDELANNLADYLFQAADIDWHQVISEKTRGLTTEEMAKSEHRYVQAFCREILKYPDCYKSADVASPESPKSGGGSVIDVALKRLQTGRERLFTTTDEKGNRELKKGDAILESAINAARMAISTEEKNTILSNNVKSATFEVFCELPCMDGFAEQNGKTAFYALRAGFYSAFKNTDTAKQDITKFMKDNLQAGFSGYSYQGLTNRVAQLEAQLAALSAKLS 241 T 0.0032 ABC_tran_CTD unppssm F Bacteria T 3sfj 2 B,D B,D decameric peptide iCAL36 ANSRWPTSII 10 T 3.7 C9orf72-like pdbhh F T 3sga 2 B P ACE-PRO-ALA-PRO-PHE-ALDEHYDE XPAPX 5 T 120 DUF2316 pdbhh F F 3sge 3 E,F K,M R13 peptide EEEDDDMGFGLFD 13 T 0.0078 Ribosomal_60s pdb F T 3shv 2 C,D C,D H2AX_HUMAN Histone H2A.x KKATQASQEY 10 T 16 Class_IIIsignal pdbhh F Eukaryota T 3shw 2 B B CXG1_HUMAN CONNEXIN-45, CX45, GAP JUNCTION ALPHA-7 PROTEIN SGDGKTSVWI 10 T 0.11 SKI pdbhh F Eukaryota T 3si5 2 C,D X,Y KNL1_HUMAN ALL1-FUSED GENE FROM CHROMOSOME 15Q14 PROTEIN, AF15Q14, BUB-LINKING KINETOCHORE PROTEIN, BLINKIN, CANCER SUSCEPTIBILITY CANDIDATE GENE 5 PROTEIN, CANCER/TESTIS ANTIGEN 29, CT29, KINETOCHORE-NULL PROTEIN 1, PROTEIN D40/AF15Q14 GPLGSSSENKIDFNDFIKRLKTGK 24 T 0.065 FtsH_ext pdbpssm F Eukaryota T 3sj9 2 B B FAGLRQAVTQ peptide FAGLRQAVTQ 10 T 7.8 TOH_N pdbhh F T 3sjk 2 B B KPVLRTATVQGPSLDF peptide KPVLRTA 7 T 0.92 zf_C2H2_13 pdbhh F T 3sks 1 A A A0A6H3ACK0_BACAN Putative Oligoendopeptidase F SNAMSFKDYEYKRPNIEELKEKFTVALEKFDNAKTVEEQKQVIHSINEIRNDFGTMGNLCYIRHSVDTTDAFYKEEQDFFDEFSPVVQGYGTKYYNALIHSPFREELEAYYGKQLFALAECDLKTYSDEVVKDLQLENKLSSQYTQLLASAKIDFAGEERTLSQLIPFMQGKERSERKAASEAYYGFLAENEEELDRIYDELVKVRTKIAKSLGFKNFVELGYARMYRTDYNAEMVANYRQQVLDYIVPVTTELRKRQQARIGVEKLAYYDENFEFPTGNPTPKGDADWIVNHGKTMYKELSAETDEFFNFMLDNDLLDLVAKKGKAGGGYCTYIENYKAPFIFSNFNGTSGDIDVLTHEAGHAFQVYESRKFEIPEYNWPTYEACEIHSMSMEFFTWPWMKLFFEEDADKYYFSHLSSALLFLPYGVSVDEYQHYVYENPEASPEERKTAWRNIEKKYLPHRDYEDNDYLERGGFWQRQGHIYSSPFYYIDYTLAQICALQFWKRARDNRQEAWEDYVNLCQQGGSKSFLELVEVANLTSPFAEGCVKSVITEIEAWLHAIDDTKL 567 T 0.00047 Peptidase_M3 unppercent F Bacteria T 3sm1 2 C,D J,M Pepstatin A XVVXAX 6 T 1700 FAM60A pdbhh F F 3smk 2 B P KCNK9_HUMAN TASK-3 peptide KRRKSV 6 T 26 DUF4739 pdbhh F Eukaryota F 3sml 2 B P KCNK9_HUMAN TASK-3 peptide RRKSV 5 T 73 SOXp pdbhh F Eukaryota F 3smm 2 B P KCNK9_HUMAN TASK-3 peptide KRRKSV 6 T 26 DUF4739 pdbhh F Eukaryota F 3smn 2 B P KCNK9_HUMAN TASK-3 peptide KRRKSV 6 T 26 DUF4739 pdbhh F Eukaryota F 3smo 2 B P KCNK9_HUMAN TASK-3 peptide KRRKSV 6 T 26 DUF4739 pdbhh F Eukaryota F 3sna 2 B H Peptide aldehyde inhibitor Ac-NSFSQ-H XNSFSX 6 T 95 ChlamPMP_M pdbhh F F 3snb 2 B H Peptide aldehyde inhibitor Ac-DSFDQ-H XDSFDX 6 T 160 Catalase-rel pdbhh F F 3snc 2 B H Peptide aldehyde inhibitor Ac-NSTSQ-H XNSTSX 6 T 400 DUF645 pdbhh F F 3snd 2 C,D C,D Peptide aldehyde inhibitor Ac-ESTLQ-H XESTLX 6 T 450 T3SS_ExsE pdbhh F F 3sne 2 B H Peptide aldehyde inhibitor Ac-ESTLQ-H XESTLX 6 T 450 T3SS_ExsE pdbhh F F 3so6 2 B Q LDLR_HUMAN LDL RECEPTOR NSINFDNPVYQKTT 14 T 3.4 PARM unphh F Eukaryota T 3soq 2 B Z DKK1_HUMAN DICKKOPF-1, DKK-1, HDKK-1, SK XNSNAIKNX 9 T 35 GSH_synthase pdbhh F Eukaryota T 3sp5 2 B P KCNK9_HUMAN TASK-3 peptide KRRKSV 6 T 26 DUF4739 pdbhh F Eukaryota F 3sp6 2 B B PRGC2_HUMAN PGC-1-BETA, PPAR-GAMMA COACTIVATOR 1-BETA, PPARGC-1-BETA, PGC-1-RELATED ESTROGEN RECEPTOR ALPHA COACTIVATOR LSLLQKLLLAT 11 T 18 DUF3014 pdbhh F Eukaryota T 3spa 2 B B Nonamer peptide XXXXXXXXX 9 F F F 3spe 1 A,B A,B Q8SDD3_BPDPK PHIKZ029 LRPEDAANPSRLIVAIEIVEDEIPLTIRRLSGFNYPNSVRDIGNAPVPTTDKVDGLKARIILIEDNTSEVGTQRVLPGTLVSDKDGSQSLVYPLFEAPVSFFGKLGDSNGMRVWSTTTADIEEFDEAAMAKFKTRQFRIQLIEKPEVGTSPVIVKTADQQDYLNITFDKGVYSDMYNADLYVGDVLVDSYSDDGVVSGLSPLYSPFSQFYVYHENIDLVRQMIYDTEMRVNPAAAAHTTAPGEIDFLTFLAVDGDPYQGIQVLGPLDGGITLGKDGNIYASGGTDGTTDLEEYAK 295 T 0.21 N-Term_TEN pdb T Viruses T 3spr 2 B P KCNK9_HUMAN TASK-3 peptide KRRKSV 6 T 26 DUF4739 pdbhh F Eukaryota F 3spv 3 C C BZLF1_EBVB9 EB1, ZEBRA RAKFKQLL 8 T 0.0067 bZIP_2 unppssm T Viruses T 3sqd 2 C,D C,D H2AX_HUMAN H2AX, H2A/X KKATQASQEY 10 T 16 Class_IIIsignal pdbhh F Eukaryota T 3sri 2 B B Q8IKV6_PLAF7 Rhoptry neck protein 2 KDIGAGPVASCFTTRMSPPQQICLNSVVN 29 T 15 Stealth_CR4 pdbhh F Eukaryota T 3srj 2 C,D,E,F C,D,E,F R1 peptide VFAEFLPLFSKFGSRMHILK 20 T 2.6 DUF3898 pdbhh F T 3stj 2 M,N,O,P,Q,R,S,T,U,V,W,X,Y M,N,O,P,Q,R,S,T,U,V,W,X,Z peptide (UNK) XXXXXXX 7 F F F 3sui 2 B B TRPV1_RAT TRPV1, CAPSAICIN RECEPTOR, OSM-9-LIKE TRP CHANNEL 1, OTRPC1, VANILLOID RECEPTOR 1, VANILLOID RECEPTOR TYPE 1-LIKE GPEGVKRTLSFSLRSGRVSGRNWKNFALVPLLRDAST 37 T 0.46 Cgr1 pdbhh F Eukaryota T 3svi 1 A A Type III effector HopAB2 LYTGAVPRANRIVQQLVEAGADLANIRTMFRNMLRGEEMILSRAEQNVFLQHFPDMLPCGIDRNSELAIALREALRRADSQQA 83 T 0.014 Peptidase_C58 pdbpssm F T 3svm 2 B P DNM3A_HUMAN DNMT3A, DNA METHYLTRANSFERASE HSAIIIA, DNA MTASE HSAIIIA, M.HSAIIIA YEPSTTARKVGRPGR 15 T 9.2 AT_hook pdbhh F Eukaryota T 3sw9 2 B,D P,Q DNM3A_MOUSE DNMT3A, DNA METHYLTRANSFERASE MMUIIIA, DNA MTASE MMUIIIA, M.MMUIIIA SATARKVGRPGR 12 T 6 AT_hook pdbhh F Eukaryota T 3swc 2 B,D P,Q DNM3A_MOUSE DNMT3A, DNA METHYLTRANSFERASE MMUIIIA, DNA MTASE MMUIIIA, M.MMUIIIA SATARKVGRPGR 12 T 6 AT_hook pdbhh F Eukaryota T 3sxu 3 C C SSB_ECOLI SSB peptide WDIPF 5 T 22 HTH_44 pdbhh F Bacteria F 3szm 2 I,J,K,L,M,N,O,P I,J,K,L,M,N,O,P H2AX_HUMAN H2A/X KKATQASQEY 10 T 16 Class_IIIsignal pdbhh F Eukaryota T 3t1n 2 C,D C,D CDC27_HUMAN Cdc27 peptide SDEF 4 T 71 DUF2689 pdbhh F Eukaryota F 3t1u 2 B B succinyl-Ala-Phe-Pro-Phe-p-nitroanilide XAFPFX 6 T 90 fn2 pdbhh F F 3t4f 1 A,B,C,D,E,F A,B,C,D,E,F collagen mimetic peptide XPPGPPGPPGPKGEPGPPGPPGPPGX 26 T 0.00013 Collagen pdbpssm F F 3t4g 1 A,B A,B Cyclic pseudo-peptide (ORN)AIIGLMV(ORN)KF(HAO)(4BF)K XAIIGLMVXKFXXK 14 T 0.18 Beta-APP pdbhh F T 3t4p 2 B B tetrapeptide DWSI DWSI 4 T 28 HTH_52 pdbhh F F 3t4r 1 A A VP4A_LNYV3 PROTEIN P, PROTEIN 4A MARIRHEKEKLLADLDWEIGEIAQYTPLIVDFLVPDDILAMAADGLTPELKEKIQNEIIENHIALMALEEYSSLEHHHHHH 81 T 0.11 Pox_C4_C10 pdbpercent T Viruses T 3t5i 2 E,F Q,R C-terminal Farnesylated Rheb peptide CSQQGKSS(CMT) CSQQGKSSC 9 T 8.9 Virul_Fac pdbhh F F 3t64 2 D F 5'-(BENZHYDRYLAMINO)-2',5'-DIDEOXYURIDINE AHA 3 T 350 DUF4258 pdbhh F F 3t6b 2 B,D C,D Tynorphin VVYPW 5 T 22 Self-incomp_S1 pdbhh F F 3t6j 2 B B Tynorphin VVYPW 5 T 22 Self-incomp_S1 pdbhh F F 3t6y 2 D F peptide ALA-HIS-ALA AHA 3 T 350 DUF4258 pdbhh F F 3t70 2 D F PEPTIDE GLY-HIS-GLY GHG 3 T 41 Peptidase_C13 pdbhh F F 3t7k 2 B,D C,D H2A1_YEAST Histone H2A.1 ATKASQEL 8 T 57 POX pdbhh F Eukaryota T 3t7z 1 A A Y694_METJA UNCHARACTERIZED NOP5 FAMILY PROTEIN MJ0694 MIYVTFTPYGAFGVKDNKEVSGLEDIEYKKLFNEEEIPDIMFKLKTQPNKIADELKEEWGDEIKLETLSTEPFNIGEFLRNNLFKVGKELGYFNNYDEFRKKMHYWSTELTKKVIKSYA 119 T 0.013 U3_assoc_6 unppssm F Archaea T 3tax 2 B,D B,D CSK21_HUMAN CK II ALPHA YPGGSTPVSSANMM 14 T 13 Pr_beta_C pdbhh F Eukaryota T 3tbh 2 B B E9BR69_LEIDB Serine acetyl transferase derived octapeptide LERDGSGI 8 T 2.1 DUF2551 pdbhh F Eukaryota T 3tcf 2 I,J,K,L,M,N,O,P I,J,K,L,M,N,O,P Endogenous peptide XXX 3 F F F 3tcg 2 I,J,K,L,M,N,O,P I,J,K,L,M,N,O,P KGE Peptide KGE 3 T 350 zf-C2H2_4 pdbhh F F 3td5 2 I,J,K,L,M,N,O,P I,J,K,L,M,N,O,P peptide(L-Ala-gamma-D-Glu-m-DAP-D-Ala-D-Ala) AXXXX 5 T 230 OAM_dimer pdbhh F F 3tdi 2 C,D C,D UBC12_YEAST RUB1-CONJUGATING ENZYME, RUB1-PROTEIN LIGASE, UBIQUITIN CARRIER PROTEIN 12 XMLKLRQLQKKKQKENENSSSIQPN 25 T 2.5E-10 UFC1 unphh F Eukaryota T 3tdu 3 E,F E,F UBC12_HUMAN NEDD8 CARRIER PROTEIN, NEDD8 PROTEIN LIGASE, UBIQUITIN-CONJUGATING ENZYME E2 M XMIKLFSLKQQKKEEE 16 T 3.3 YpmT pdbhh F Eukaryota T 3tdz 3 E,F E,F UBC12_HUMAN NEDD8-CONJUGATING ENZYME UBC12, NEDD8 CARRIER PROTEIN, NEDD8 PROTEIN LIGASE, UBIQUITIN-CONJUGATING ENZYME E2 M XMIKLXSLKXQKK 13 T 8 FAM53 pdbhh F Eukaryota T 3tei 2 B B KS6A1_HUMAN S6K-ALPHA-1, 90 KDA RIBOSOMAL PROTEIN S6 KINASE 1, P90-RSK 1, P90RSK1, P90S6K, MAP KINASE-ACTIVATED PROTEIN KINASE 1A, MAPK-ACTIVATED PROTEIN KINASE 1A, MAPKAP KINASE 1A, MAPKAPK-1A, RIBOSOMAL S6 KINASE 1, RSK-1 PQLKPIESSILAQRRVRKLPSTTL 24 T 20 Sbi-IV pdbhh F Eukaryota T 3tfk 2 B B p4B10 peptide QLSDVPMDL 9 T 15 KH_9 pdbhh F T 3tfy 2 B,D,F D,E,F HNRPF_HUMAN hnRNP F MLGPEGGRWG 10 F F Eukaryota T 3tg5 2 B B P53_HUMAN ANTIGEN NY-CO-13, PHOSPHOPROTEIN P53, TUMOR SUPPRESSOR P53 HSSHLKSKKGQ 11 T 51 TEX12 pdbhh F Eukaryota T 3thk 2 C,D C,D Proline-rich peptide PPPVPPYSAG 10 T 2.8 GvpL_GvpF pdbhh F T 3ths 2 E,F E,F 5-methyltetrahydrofolate pentaglutamate XXXXXX 6 T 250 Ribosomal_L22e pdbhh F F 3tiw 2 B,D C,D AMFR_HUMAN AUTOCRINE MOTILITY FACTOR RECEPTOR, ISOFORM 2, AMF RECEPTOR, ISOFORM 2, RING FINGER PROTEIN 45, GP78 VTLRRRMLAAAAERRLQKQ 19 T 0.072 SVIP pdbhh F Eukaryota T 3tix 2 B,D B,D CHP1_SCHPO Chromo domain-containing protein 1 MISESEDLSSASTLSDYFRFVLRVGKSLYYAGELSFDISKLKAETEHQQLLRSLVSCKQVDVLRFVTSQYLEVFGTCLTKVLSGSLCIRSDVDMTHFKNILNRGNGAGIVLGSNYTLLLFTEDNNALMNLYDCQGQSNSPFWMVIFEPLESILVEWSAKNLRPKKPYHKSQSYLSYLLQLGHIDLHKIGAFQATQILIVSKQPSPEAEELEDTFREAAIPTFRGLEIPESLFLSQNVFVFLNVSLEDDFDQLQFLTLAKRKSCKFFLFGLSLPLKSPNDSHVGTDFKKNNEPLDKLTYSQYLRPMFPKGGVVSVTLSALIKTPRLLELISPFLEIKKDSWILILPPSIVDMVKSYFVTNNPDKSLLEIQNLLNTLQRYLTNPALKNVTLYQDWDIVIDDSADVSLASTLQLYQKKNYDKYRRFVLIHELKNELTPVNGLDIVDYDEFKETFMRAIGLK 458 T 11 PsiB pdbhh F Eukaryota T 3tj5 2 B B B0BXR4_RICRO Antigenic heat-stable 120 kDa protein GSHMNLLNAATALSGSMQYLLNYVNAG 27 T 3.2 SipA_VBS pdbhh F Bacteria T 3tjh 2 B B p3A1 SPLDSLWWI 9 T 2.2 FWWh pdbhh F T 3tju 2 B B Ac-PTSY-CMK inhibitor XPTSXX 6 T 140 CBP_BcsG pdbhh F F 3tjv 2 B B PTSYAGDDSG PTSYAGDDSG 10 T 8.6 DUF5837 pdbhh F T 3tjw 1 A A VILI_CHICK D-Villin-1 XXXXXXXXXXGXXXXXXXXXXXXXXXXXXXXXGX 34 F F Eukaryota F 3tjy 1 A A HPAB3_PSEYM AVIRULENCE PROTEIN HOPPMAL TGAVPRANRIVQQLVEAGADLANIRTMFRNMLRGEEMILSRAEQNVFLQHFPDMLPCGIDRNSELAIALREALRRADSQQAARAPARTPPRSSV 94 T 0.012 Peptidase_C58 pdbpssm F Bacteria T 3tkn 1 A,D,G A,D,G NUP82_YEAST NUCLEAR PORE PROTEIN NUP82 MSQSSRLSALPIFQASLSASQSPRYIFSSQNGTRIVFIQDNIIRWYNVLTDSLYHSLNFSRHLVLDDTFHVISSTSGDLLCLFNDNEIFVMEVPWGYSNVEDVSIQDAFQIFHYSIDEEEVGPKSSIKKVLFHPKSYRDSCIVVLKEDDTITMFDILNSQEKPIVLNKPNNSFGLDARVNDITDLEFSKDGLTLYCLNTTEGGDIFAFYPFLPSVLLLNEKDLNLILNKSLVMYESLDSTTDVIVKRNVIKQLQFVSKLHENWNSRFGKVDIQKEYRLAKVQGPFTINPFPGELYDYTATNIATILIDNGQNEIVCVSFDDGSLILLFKDLEMSMSWDVDNYVYNNSLVLIERVKLQREIKSLITLPEQLGKLYVISDNIIQQVNFMSWASTLSKSINESDLNPLAGLKFESKLEDIATIERIPNLAYINWNDQSNLALMSNKTLTFQNISS 452 T 4.9E-12 Nup88 pdbpercent F Eukaryota T 3tkn 2 B,E,H B,E,H NU159_YEAST NUCLEAR PORE PROTEIN NUP159 GPHSSITKDMKGFKVVEVGLAMNTKKQIGDFFKNLNMAK 39 T 8.3 DUF1413 pdbhh F Eukaryota T 3tkz 2 B,C P,Q PROTEIN (RVIpYFVPLNR peptide) RVIXFVPLNR 10 T 1.1 DUF6271 pdbhh F T 3tl0 2 B B RLNpYAQLWHR peptide RLNXAQLWHR 10 T 0.44 MOSP_N pdbhh F T 3tmh 3 D,H,J D,H,L AKA10_HUMAN AKAP-10, DUAL SPECIFICITY A KINASE-ANCHORING PROTEIN 2, D-AKAP-2, PROTEIN KINASE A-ANCHORING PROTEIN 10, PRKA10 GSPEFVQGNTDEAQEELAWKIAKMIVSDVMQQAQYDQPLEKSTKL 45 T 0.073 TnpW pdbpssm F Eukaryota T 3to6 2 B B H4_YEAST K16COA BISUBSTRATE INHIBITOR GKGGAKRHRKIL 12 T 4.2 Shadoo unppercent F Eukaryota T 3tod 2 B B TRFL_BOVIN peptide, LEACAF from Lactotransferrin LEACAF 6 T 9.3 T6SS_TssF pdbhh F Eukaryota F 3tpu 4 D,H,L,P J,F,L,R p5E8 peptide FLSPFWFDI 9 T 0.26 T6SS_VasJ pdbhh F T 3tpx 2 B,D,F B,D,F D-peptide inhibitor DPMI-delta XXXXXXXXXXXX 12 F F F 3trv 2 B B VILI_CHICK D-Villin-1 XXXXXXXXXXGXXXXXXXXXXXXXXXXXXXXXGXX 35 F F Eukaryota F 3tsz 2 B B JAM1_HUMAN JAM-A, JUNCTIONAL ADHESION MOLECULE 1, JAM-1, PLATELET F11 RECEPTOR, PLATELET ADHESION MOLECULE 1, PAM-1 EGEFKQTSSFLV 12 T 17 DUF4193 pdbhh F Eukaryota T 3tuv 2 B B Peptide XXX 3 F F F 3twe 1 A,B A,B alpha4H GNADELYKELEDLQERLRKLRKKLRSG 27 T 0.0068 DUF5798 pdb F T 3twf 1 A,B A,B alpha4F3a GNADELYKEXEDLQERXRKLRKKXRSG 27 T 0.53 DUF5798 pdbhh F T 3twg 1 A,B A,B alpha4F3af3d GNADEXYKEXEDXQERXRKXRKKXRSG 27 T 5.9 PRP1_N pdbhh F T 3twr 2 E,F,G,H E,F,G,H 3BP2_HUMAN 3BP-2 LPHLQRSPPDGQSFRX 16 T 4.1 DUF3375 pdbhh F Eukaryota T 3tws 2 E,F,G,H E,F,G,H human TERF1 LPHLQRGCADGQSFRX 16 T 7.5 GSIII_N pdbhh F T 3twt 2 E,F,G,H E,F,G,H human MCL1 LPHLQRPPPIGQSFRX 16 T 3.1 DUF3375 pdbhh F T 3twu 2 B B MCL1_HUMAN BCL-2-LIKE PROTEIN 3, BCL2-L-3, BCL-2-RELATED PROTEIN EAT/MCL1, MCL1/EAT SRRVARPPPIGAEVPX 16 T 7.6 DUF4653 pdbhh F Eukaryota T 3twv 2 E,F,G,H E,F,G,H human NUMA1 LPHLQRTQPDGQSFRX 16 T 4.8 DUF3375 pdbhh F T 3tww 2 C,D C,D human LNPEP LPHLQRQSPDGQSFRX 16 T 3.1 DUF3375 pdbhh F T 3twx 2 C,D C,D human FNBP1 LPHLQRESPDGQSFRX 16 T 3.1 DUF3375 pdbhh F T 3tzd 2 B T H14_HUMAN HISTONE H1B YPVKKKARKSAGAAKRKAS 19 T 0.2 DUF5797 unp F Eukaryota T 3tzg 1 A,B A,B A6L2L1_BACV8 hypothetical protein BVU_2266 GKEKKADTYVTKVTDLTGEEEQVLKLEYDRDGKIIKYGDTPVRYEGDQITIGQMNCLNTGNKLCNVTFQIGKGKARESRARCMLKVGEEVYEADKQTVYDYKGDTIFINSDYRATSDYRFLKKVQGKYVFDQLGRLKEVMTVFTEANDSVSSCHTYYNYDNNINYQANLNLQAYVIDYDGVDSFFYFLLNLGQLRNRTALPNDIGYCMNHGLSTYNVHANYRLDDENPVRIEVLYNYTKLLSRIDLSYNPLN 252 T 0.019 UPF0257 unphh F Bacteria T 3tzw 2 B D CO-CRYSTALLIZED PEPTIDE SDKENFWGMAVA 12 T 1.1 NPFF pdbhh F T 3tzx 2 C,D C,D CO-CRYSTALLIZED PEPTIDE SDKENFWGMAVA 12 T 1.1 NPFF pdbhh F T 3tzy 2 C,D C,D CO-CRYSTALLIZED PEPTIDE SDKENFWGMAVA 12 T 1.1 NPFF pdbhh F T 3tzz 2 C C CO-CRYSTALLIZED PEPTIDE SDKENFWGMAVA 12 T 1.1 NPFF pdbhh F T 3u1i 3 E,F E,F peptide of (BEZ)(NLE)KR(OAR) XXKRX 5 T 220 SUZ pdbhh F F 3u23 2 B B RIN3_HUMAN RAS INTERACTION/INTERFERENCE PROTEIN 3 TAKQPPVPPPRKKRISX 17 T 0.0029 HCV_NS5a_C pdbhh F Eukaryota T 3u29 1 A,B,C,D,E,F A,B,C,D,E,F collagen mimetic peptide XPPGPPGPPGPKGDPGPPGPPGPPGX 26 T 0.00016 Collagen pdbpssm F F 3u2q 2 B B THCL_PLARO Thiocillin GE2270 analogue NVP-LFF571 SXNXVXGXXXXX 12 T 0.058 Radical_SAM_2 pdbhh F Bacteria F 3u3f 2 E,F,G,H,I,J E,F,G,H,I,J PAXI_HUMAN Paxillin LD2 peptide SATRELDELMASLSDFK 17 T 0.99 SAM_LFY pdbhh F Eukaryota T 3u3z 2 B B H2AX_HUMAN Histone H2A.X peptide SQEX 4 T 180 DUF4535 pdbhh F Eukaryota F 3u4w 2 B B macrocyclic inhibitor MC4B XXFXX 5 T 530 Rab3-GTPase_cat pdbhh F F 3u51 2 C,D C,D macrocyclic inhibitor MC1 XXXXXX 6 T 4000 zf-H2C2_2 pdbhh F F 3u6b 2 C,D C,D THCL_PLARO EF-TU 1, P-43 ,EF-TU SXNXVXGXXXXX 12 T 1.2 CCER1 unphh F Bacteria F 3u6k 2 C,D C,D THCL_PLARO Thiocillin GE2270 analogue NVP-LDK733 SXNXVXGXXXXX 12 T 0.058 Radical_SAM_2 pdbhh F Bacteria F 3u72 2 B B TRFL_BOVIN C-terminal peptide of Lactotransferrin LEACAF 6 T 9.3 T6SS_TssF pdbhh F Eukaryota F 3u7d 2 B,D B,D HEG1_HUMAN Protein HEG homolog 1 SRHSCIFPGQYNPSFISDESRRRDYF 26 T 3.4 LAX unphh F Eukaryota T 3u85 2 B B KMT2A_HUMAN LYSINE N-METHYLTRANSFERASE 2A,ALL-1,CXXC-TYPE ZINC FINGER PROTEIN 7,MYELOID/LYMPHOID OR MIXED-LINEAGE LEUKEMIA,MYELOID/LYMPHOID OR MIXED-LINEAGE LEUKEMIA PROTEIN 1,TRITHORAX-LIKE PROTEIN,ZINC FINGER PROTEIN HRX SRWRFPARPGTTGGGGGGGRR 21 T 10 DUF5877 pdbhh F Eukaryota T 3u88 2 C,E M,N KMT2A_HUMAN LYSINE N-METHYLTRANSFERASE 2A,ALL-1,CXXC-TYPE ZINC FINGER PROTEIN 7,MYELOID/LYMPHOID OR MIXED-LINEAGE LEUKEMIA,MYELOID/LYMPHOID OR MIXED-LINEAGE LEUKEMIA PROTEIN 1,TRITHORAX-LIKE PROTEIN,ZINC FINGER PROTEIN HRX SRWRFPARPGTGRRGLGGAPRQRVPALLRVGPGFDAALQVSAAIGTNLRRFRAVFGESGGGGGSGEDEQFLGFGS 75 T 10 DUF3467 pdbhh F Eukaryota T 3u8o 3 C I D-PHE-PRO-D-ARG-D-THR DERIVED DIRECT THROMBIN INHIBITOR XPXXX 5 T 91 DUF4995 pdbhh F F 3u8q 2 B B TRFL_BOVIN C-terminal peptide of Lactotransferrin LEACAF 6 T 9.3 T6SS_TssF pdbhh F Eukaryota F 3u8r 3 C I D-Phe-Pro-D-Arg-Ile DERIVED DIRECT THROMBIN INHIBITOR XPXIX 5 T 62 DUF2293 pdbhh F F 3u8t 3 C I D-PHE-PRO-D-ARG-CYS DIRECT THROMBIN INHIBITOR XPXCX 5 T 22 Mlh1_C pdbhh F F 3u9q 2 B B PRGC1_HUMAN PGC-1a peptide SLLKKLLLA 9 T 35 LELP1 pdbhh F Eukaryota F 3ua0 1 A,B A,B FIBH_BOMMO FIB-H, H-FIBROIN MGHHHHHHMRVKTFVILCCALQYVAYTNANINDFDEDYFGSDVTVQSSNTTDEIIRDASGAVIEEQITTKKMQRKNKNHGILGKNEKMIKTFVITTDSDGNESIVEEDVLMKTLSDGTVAQSYVAADAGAYSQS 134 T 0.053 DUF809 pdb F Eukaryota T 3uc7 1 A,B,C,D,E,F A,B,C,D,E,F Cyclo-TC1 GDAYAQWLADGGPSSGRPPPSG 22 T 5.1 MOSC_N pdbhh F T 3uc8 1 A,B,C A,B,C cyclo-TC1 GDAYAQWLADGGPSSGRPPPSG 22 T 5.1 MOSC_N pdbhh F T 3ue7 1 A A D-Crambin XXXXXXXXXXXXXXXXXXXGXXXXXXXXXXGXXXXXGXXXXGXXXX 46 F F F 3ueo 2 E,F E,F MDC1_HUMAN phospho-peptide GFIDSDTDVEEE 12 T 1.4 LAGLIDADG_1 pdbhh F Eukaryota T 3uf7 2 B,C B,C SSB_ECOLI SSB MDFDDDIPF 9 T 0.33 Phage_SSB pdbhh F Bacteria F 3ufm 2 B B SSB_DEIRA SSB, HELIX-DESTABILIZING PROTEIN DDFPPEEDDLPF 12 T 0.54 Phage_SSB pdbhh F Bacteria F 3ugw 2 B B TRFL_BOVIN C-terminal peptide from Lactotransferrin LEACAF 6 T 9.3 T6SS_TssF pdbhh F Eukaryota F 3ui2 2 B B SR54C_ARATH 54 CHLOROPLAST PROTEIN, 54CP, SRP54, CPSRP54, FFC QKAPPGTARRKRK 13 T 6.1 DUF6490 pdbhh F Eukaryota T 3uk4 2 B B TRFL_BOVIN C-terminal peptide from Lactotransferrin LEACAF 6 T 9.3 T6SS_TssF pdbhh F Eukaryota F 3ukw 2 B C Bimax1 peptide GSRRRRPRKRPLEWDEDEEPPRKRKRLW 28 T 1.5 ROKNT pdbhh F T 3ukx 2 B C Bimax2 peptide GSRRRRRRKRKREWDDDDDPPKKRRRLD 28 T 0.74 Med24_N pdbhh F T 3uky 2 B C NCBP1_YEAST CAP-BINDING PROTEIN 80, CBP80 GSMFNRKRRGDFDEDENYRDFRPRMPKRQRIP 32 T 61 DUF2970 pdbhh F Eukaryota T 3ukz 2 B C NCBP1_MOUSE CAP-BINDING PROTEIN 80, CBP80 GSMSRRRHSYENDGGQPHKRRKTSD 25 T 15 DUF4500 pdbhh F Eukaryota T 3ul0 2 B C NCBP1_MOUSE CAP-BINDING PROTEIN 80, CBP80 GSMSRRRHSDENDGGQPHKRRKTSD 25 T 23 DUF2205 pdbhh F Eukaryota T 3ul1 2 B A NUPL_XENLA Nucleoplasmin GSAVKRPAATKKAGQAKKKKLD 22 T 0.0016 BSP_II unppercent F Eukaryota T 3ulr 3 C C ABL2_HUMAN ABELSON MURINE LEUKEMIA VIRAL ONCOGENE HOMOLOG 2, ABELSON-RELATED GENE PROTEIN, TYROSINE-PROTEIN KINASE ARG SSVVPYLPRLPILPSKT 17 T 0.31 PLU-1 unppercent F Eukaryota T 3ult 1 A,B A,B B5T007_LOLPR Ice recrystallization inhibition protein-like protein MDEQPNTISGSNNTVRSGSKNVLAGNDNTVISGDNNSVSGSNNTVVSGNDNTVTGSNHVVSGTNHIVTDNNNNVSGNDNNVSGSFHTVSGGHNTVSGSNNTVSGSNHVVSGSNKVVTDAAKLAAALEHHHHHH 133 T 0.077 NSP2-B_epitope pdb F Eukaryota T 3um0 2 B B CHMP5_HUMAN CHROMATIN-MODIFYING PROTEIN 5, SNF7 DOMAIN-CONTAINING PROTEIN 2, VACUOLAR PROTEIN SORTING-ASSOCIATED PROTEIN 60, VPS60, HVPS60 TKNKDGVLVDEFGLPQIPAS 20 T 1.3 Castor1_N pdbhh F Eukaryota T 3um2 2 B,D B,E CHMP5_HUMAN CHROMATIN-MODIFYING PROTEIN 5, SNF7 DOMAIN-CONTAINING PROTEIN 2, VACUOLAR PROTEIN SORTING-ASSOCIATED PROTEIN 60, VPS60, HVPS60 TKNKDGVLVDEFGLPQIPAS 20 T 1.3 Castor1_N pdbhh F Eukaryota T 3unn 2 B B MDC1_HUMAN phospho-T4 peptide from Mediator of DNA damage checkpoint protein 1 MEDTQAID 8 T 32 HD_assoc pdbhh F Eukaryota T 3uo8 2 C,D L,M MALT1 INHIBITOR, Z-VRPR-FMK XVRPRX 6 T 140 TTD pdbhh F F 3uoa 2 C,D L,M MALT1 INHIBITOR, Z-VRPR-FMK XVRPRX 6 T 140 TTD pdbhh F F 3uot 2 C,D D,E MDC1_HUMAN NUCLEAR FACTOR WITH BRCT DOMAINS 1 MEDTQMIDWD 10 T 1.7 DUF4502 pdbhh F Eukaryota T 3up6 1 A,B A,B A7M1U4_BACO1 hypothetical protein BACOVA_04078 GDVAYELPAHTTRAQLSIDLVNNGDVEQQEKINSMRFIVFGSTPGGVRLDVNEHILLSTPETATDIDAQLLEVTSSNDILVVVIANEPQSLTSQLDGIANLLTLQEMIYDISSILNSDGQIISATGMPMTGVIRDISIAPDETKTVQMVIERAVARVDVFIEAIDGGAVTGYTAGSTSVTLHNFSHDSYFVMGNVGNGTRDNADSSKNYGKVKEDVSESNLLTHSWTAATTETWAYSSAPGAENRKLLCSFYTAERLFKSDYSDRLSISMANVLKGPSDVTGITGKVIESVTKVDGTGSPTAQPFTEIRRNNVYQVTARVGKIGIQILTISVEDWGERQDIDLDMDL 347 T 0.00017 P_gingi_FimA unp F Bacteria T 3upr 2 B,D P,Q pep-V HSITYLLPV 9 T 4 PRCC pdbhh F T 3upv 2 B B HSP74_YEAST Heat shock protein SSA4 PTVEEVD 7 T 2.6 DUF2368 pdbhh F Eukaryota F 3uq3 2 B,C B,C HS90B_HUMAN Heat shock protein MEEVD 5 T 120 NUSAP pdbhh F Eukaryota F 3uqp 2 B B METHYL (2R)-1-[(6S,9S,12S,13S,17S,20S,23R)-9-(3-AMINO-3-OXOPROPYL)-12,23-DIBENZYL-13-HYDROXY-2,2,8,20,22-PENTAMETHYL-17-(2-METHYLPROPYL)-4,7,10,15,18,21,24-HEPTAOXO-6-(PROPAN-2-YL)-3-OXA-5,8,11,16,19,22-HEXAAZATETRACOSAN-24-YL]PYRROLIDINE-2-CARBOXYLATE XVXXLAXX 8 T 320 CEND1 pdbhh F F 3uqr 2 D,E,F D,E,F METHYL (2S)-1-[(2R,5S,8S,12S,13S)-2,13-DIBENZYL-12-HYDROXY-3,5-DIMETHYL-15-(3-[METHYL(METHYLSULFONYL)AMINO]-5-{[(1R)-1-PHENYLETHYL]CARBAMOYL}PHENYL)-8-(2-METHYLPROPYL)-4,7,10,15-TETRAOXO-3,6,9,14-TETRAAZAPENTADECAN-1-OYL]PYRROLIDINE-2-CARBOXYLATE XXXLAXX 7 T 760 zf-C2H2_4 pdbhh F F 3uri 2 B B DB5 peptide HPHLSXAH 8 T 15 DUF1045 pdbhh F T 3url 2 B B DB6 peptide HSLFHXTP 8 T 1.5 Integrase_Zn pdbhh F T 3usd 2 B B TRFL_BOVIN C-terminal peptide of Lactotransferrin LEACAF 6 T 9.3 T6SS_TssF pdbhh F Eukaryota F 3ut5 4 F F Vinca tetrapeptide XVXG 4 T 70 Rap1a pdbhh F F 3utq 3 C C INS_HUMAN Insulin ALWGPDPAAA 10 T 2.8 Lipid_DES pdbhh F Eukaryota T 3uts 3 C,H C,H INS_HUMAN Insulin ALWGPDPAAA 10 T 2.8 Lipid_DES pdbhh F Eukaryota T 3utt 3 C,H C,H INS_HUMAN Insulin ALWGPDPAAA 10 T 2.8 Lipid_DES pdbhh F Eukaryota T 3uvk 2 B B KMT2D_HUMAN ALL1-RELATED PROTEIN, LYSINE N-METHYLTRANSFERASE 2B, KMT2B, MYELOID/LYMPHOID OR MIXED-LINEAGE LEUKEMIA PROTEIN 2 GCARSEPKILT 11 T 0.12 N-SET pdbhh F Eukaryota T 3uvl 2 B B KMT2C_HUMAN HOMOLOGOUS TO ALR PROTEIN, LYSINE N-METHYLTRANSFERASE 2C, KMT2C, MYELOID/LYMPHOID OR MIXED-LINEAGE LEUKEMIA PROTEIN 3 GSARAEPKMSA 11 T 17 N-SET pdbhh F Eukaryota T 3uvm 2 B B KMT2B_HUMAN LYSINE N-METHYLTRANSFERASE 2D, KMT2D, MYELOID/LYMPHOID OR MIXED-LINEAGE LEUKEMIA PROTEIN 4, TRITHORAX HOMOLOG 2, WW DOMAIN-BINDING PROTEIN 7, WBP-7 GAARAEVYLR 10 T 11 Trp_DMAT pdbhh F Eukaryota T 3uvu 1 A B FEN1_HUMAN FEN-1, DNASE IV, FLAP STRUCTURE-SPECIFIC ENDONUCLEASE 1, MATURATION FACTOR 1, MF1, HFEN-1 SAKRKEPEPKGSTKKKAKT 19 T 140 DUF4647 pdbhh F Eukaryota T 3uvw 2 B B H4_HUMAN PEPTIDE (H4K5ACK8AC) SGRGXGGXGLGY 12 T 7.6 HTH_Tnp_Mu_1 pdbhh F Eukaryota T 3uvx 2 B B H4_HUMAN PEPTIDE (H4K12ACK16AC) GXGGAXRHRKV 11 T 11 Shadoo unppercent F Eukaryota T 3uvy 2 B B H4_HUMAN PEPTIDE (H4K16ACK20AC) AXRHRXVLRDN 11 T 0.27 UPF0137 unp F Eukaryota T 3uw4 2 B Z GDC0152 XXPX 4 T 1300 NHL pdbhh F F 3uw5 2 C,D Z,Y GDC-0152 XXPX 4 T 1300 NHL pdbhh F F 3uw9 2 E,F E,F H4_HUMAN PEPTIDE (H4K8ACK12AC) GXGLGXGGAKR 11 T 11 Shadoo unppercent F Eukaryota T 3ux0 2 B P KCNK9_HUMAN TASK3 PHOSPHOPEPTIDE KRRKSV 6 T 26 DUF4739 pdbhh F Eukaryota F 3uxg 2 B B HDAC4_HUMAN HD4 LPLYTSPSLPNITLGLP 17 T 16 RepA1_leader pdbhh F Eukaryota T 3uxw 2 I,J,K,L K,L,M,N A-T hook peptide RKPRGRPKKX 10 T 0.003 AT_hook pdbhh F F 3uzd 2 B B HDAC4_HUMAN HD4 LPLYTSPSLPNITLGLP 17 T 16 RepA1_leader pdbhh F Eukaryota T 3v2o 2 B B LRP2_RAT LRP-2, GLYCOPROTEIN 330, GP330, MEGALIN HYRKTGSLLPTLPKLPSLS 19 T 0.21 Amnionless unppercent F Eukaryota T 3v2x 2 B B LRP2_RAT LRP-2, GLYCOPROTEIN 330, GP330, MEGALIN LLPTLPKLPSL 11 T 0.21 Amnionless unppercent F Eukaryota F 3v30 2 B B RFX5_HUMAN REGULATORY FACTOR X 5 KTLVSMPPLPGLDLKGS 17 T 10 XRN_M pdbhh F Eukaryota T 3v31 2 B B HDAC4_HUMAN HD4 XLPLYTSPSLPNITLGLP 18 T 21 RepA1_leader pdbhh F Eukaryota T 3v3b 2 C,D C,D SAH-p53-8 stapled-peptide QSQQTFXNLWRLLXQN 16 T 0.0013 P53_TAD pdbhh F T 3v3v 2 B B JIP1_MOUSE JIP-1, JNK-INTERACTING PROTEIN 1, ISLET-BRAIN-1, IB-1, JNK MAP KINASE SCAFFOLD PROTEIN 1, MITOGEN-ACTIVATED PROTEIN KINASE 8-INTERACTING PROTEIN 1 RPKRPTTLNLF 11 T 12 CTP-dep_RFKase pdbhh F Eukaryota T 3v4l 2 B B Z-VRPR-FMK XVRPRX 6 T 140 TTD pdbhh F F 3v4o 2 B B Z-VRPR-FMK XVRPRX 6 T 140 TTD pdbhh F F 3v5a 2 B B TRFL_BOVIN C-TERMINAL PEPTIDE OF LACTOTRANSFERRIN LEACAF 6 T 9.3 T6SS_TssF pdbhh F Eukaryota F 3v62 3 C,F C,F SRS2_YEAST ATP-dependent DNA helicase SRS2 SHNPDDTTVDNRPIISNAKFLADAAMKKTQKFSKKVKNEPASSQMDIFSQLSRAKKKSKLNNGEIIVID 69 T 0.013 AD pdbpercent F Eukaryota T 3v79 6 F R NOTC1_HUMAN RAM KRRRQHGQLWFPEGFKVSE 19 T 0.48 DUF4381 unppssm F Eukaryota T 3v7d 3 E E SIC1_YEAST CDK INHIBITOR P40 MTSPFNGLTSPQRSPFPKS 19 T 1.8 RbcS pdbhh F Eukaryota T 3v9t 2 B C PRGC1_HUMAN PGC-1-ALPHA, PPAR-GAMMA COACTIVATOR 1-ALPHA, PPARGC-1-ALPHA, LIGAND EFFECT MODULATOR 6 QEAEEPSLLKKLLLAPANT 19 T 3.3 Apo-CIII pdbhh F Eukaryota T 3v9v 2 B C PRGC1_HUMAN PGC-1-ALPHA, PPAR-GAMMA COACTIVATOR 1-ALPHA, PPARGC-1-ALPHA, LIGAND EFFECT MODULATOR 6 QEAEEPSLLKKLLLAPANT 19 T 3.3 Apo-CIII pdbhh F Eukaryota T 3va4 2 B C CHK2_HUMAN CHK2 CHECKPOINT HOMOLOG, CDS1 HOMOLOG, HUCDS1, HCDS1, CHECKPOINT KINASE 2 LETVSTQELYS 11 T 0.39 RAP1 unppercent F Eukaryota T 3vb4 2 C,D E,F B4Z inhibitor XAVLX 5 T 1500 DUF4478 pdbhh F F 3vb5 2 C,D E,F C4Z inhibitor XAVLX 5 T 1500 DUF4478 pdbhh F F 3vb6 2 C,D E,F C6Z inhibitor XTSAVLX 7 T 470 DUF5550 pdbhh F T 3vb7 2 C,D E,F M4Z inhibitor XAVLX 5 T 1500 DUF4478 pdbhh F F 3vdf 2 B B TRFL_BOVIN C-TERMINAL PEPTIDE OF LACTOTRANSFERRIN LEACAF 6 T 9.3 T6SS_TssF pdbhh F Eukaryota F 3ve6 2 B B POLS_EEVVT Venezuelan equine encephalitis virus capsid protein NLS EGPSAKKPKKEA 12 T 2.8 AKAP2_C unp T Viruses T 3vfj 2 B G MonodeChloro- Teicoplanin A2-2 XXXXXXX 7 T 1.5 Defensin_2 pdbhh F F 3vfk 2 B G MonodeChloro- Teicoplanin A2-2 XXXXXXX 7 T 1.5 Defensin_2 pdbhh F F 3vfm 3 C C LPEP peptide from EBV, LPEPLPQGQLTAY LPEPLPQGQLTAY 13 T 14 AP-5_subunit_s1 pdbhh F T 3vfn 3 C C BZLF1_EBVB9 LPEP peptide from EBV, LPEPLPQGQLTAY LPEPLPQGQLTAY 13 T 14 AP-5_subunit_s1 pdbhh T Viruses T 3vfo 3 C C BZLF1_EBVB9 LPEP peptide from EBV, LPEPLPQGQLTAY LPEPLPQGQLTAY 13 T 14 AP-5_subunit_s1 pdbhh T Viruses T 3vfp 3 C C BZLF1_EBVB9 LPEP peptide from EBV, LPEPLPQGQLTAY LPEPLPQGQLTAY 13 T 14 AP-5_subunit_s1 pdbhh T Viruses T 3vfr 3 C C BZLF1_EBVB9 LPEP peptide from EBV, P4A, LPEALPQGQLTAY LPEALPQGQLTAY 13 T 26 AP-5_subunit_s1 pdbhh T Viruses T 3vfs 3 C C LPEP peptide from EBV, P5A, LPEPAPQGQLTAY LPEPAPQGQLTAY 13 T 17 Casc1_N pdbhh F T 3vft 3 C C LPEP peptide from EBV, P6A, LPEPLAQGQLTAY LPEPLAQGQLTAY 13 T 13 DUF99 pdbhh F T 3vfu 3 C C LPEP peptide from EBV, P7A, LPEPLPAGQLTAY LPEPLPAGQLTAY 13 T 19 DUF2808 pdbhh F T 3vfv 3 C C LPEP peptide from EBV, P9A, LPEPLPQGALTAY LPEPLPQGALTAY 13 T 23 GSAP-16 pdbhh F T 3vfw 3 C C LPEP peptide from EBV, P10A, LPEPLPQGQATAY LPEPLPQGQATAY 13 T 29 GSAP-16 pdbhh F T 3vg8 1 A,B,C,D,E,F,G,H,I,J G,H,I,J,A,B,C,D,E,F Q53VW9_THET8 Hypothetical Protein TTHB210 MNVSEALKGALPNFIPGLGTLYVDPSTLPEGPFLAYDRAGNLVKVVFMVPLKKLNESHKYVDIGTKTLRALGITRIDHVNMIPSGPHPGVSEPHYHIELVLVSVDQERKVLEGEPY 116 T 0.0015 DUF5602 pdbpssm F Bacteria T 3vgc 1 A A CTRA_BOVIN GAMMA CHYMOTRYPSIN CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 3vi1 2 C,D C,D Substance P1-6, RPKPQQ RPKPQQ 6 T 7.5 Tachykinin pdbhh F F 3vi4 5 I,J G,I RGD peptide RGDNP 5 T 70 Ornatin pdbhh F F 3viv 2 C C PSTOM_PYRHO UNCHARACTERIZED PROTEIN PH1511 NVIVLMLPME 10 T 2.9 Polysacc_deac_2 pdbhh F Archaea T 3vj6 3 C P HA1L_MOUSE Qdm peptide AMAPRTLLL 9 T 0.014 UL40 pdbhh F Eukaryota T 3vjs 2 B C MED1_HUMAN peptide from Mediator of RNA polymerase II transcription subunit 1 KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 3vjt 2 B C MED1_HUMAN peptide from Mediator of RNA polymerase II transcription subunit 1 KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 3vpj 2 C,D,G,H E,F,G,H Q9I2Q0_PSEAE Tse1-specific immunity protein MGSSHHHHHHSSGLVPRGSHMKLLAGSFAALFLSLSAQAADCTFTQLEIVPQFGSPNMFGGEDEHVRVMFSNEDPNDDNPDAFPEPPVYLADRDSGNDCRIEDGGIWSRGGVFLSQDGRRVLMHEFSGSSAELVSYDSATCKVVHREDISGQRWAVDKDGLRLGQKCSGESVDSCAKIVKRSLAPFCQTAKK 192 T 1.5 Me-amine-dh_H unphh F Bacteria T 3vqg 2 B B IGSF5_MOUSE JAM-4 YKVRNVTLV 9 T 0.79 Chisel unppercent F Eukaryota T 3vqm 2 O,P,Q,R,S,T,U,V,W O,P,Q,R,S,T,U,V,W Q970D9_SULTO C-terminal peptide from Small heat shock protein StHsp14.0 VIKIE 5 T 0.04 Frataxin_Cyay unppercent F Archaea F 3vrp 2 B B EGFR_HUMAN PROTO-ONCOGENE C-ERBB-1, RECEPTOR TYROSINE-PROTEIN KINASE ERBB-1 EDSFLQRXSSDPT 13 T 1.4 DUF4348 pdbhh F Eukaryota T 3vrr 2 B C EGFR_HUMAN PROTO-ONCOGENE C-ERBB-1, RECEPTOR TYROSINE-PROTEIN KINASE ERBB-1 EDSFLQRXSSDPT 13 T 1.4 DUF4348 pdbhh F Eukaryota T 3vrt 2 B C MED1_HUMAN 13-meric peptide from Mediator of RNA polymerase II transcription subunit 1 KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 3vru 2 B C MED1_HUMAN 13-meric peptide from Mediator of RNA polymerase II transcription subunit 1 KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 3vrv 2 B C MED1_HUMAN 13-meric peptide from Mediator of RNA polymerase II transcription subunit 1 KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 3vrw 2 B C MED1_HUMAN 13-meric peptide from Mediator of RNA polymerase II transcription subunit 1 KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 3vt3 2 B C COACTIVATOR PEPTIDE DRIP KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F T 3vt4 2 B C COACTIVATOR PEPTIDE DRIP KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F T 3vt5 2 B C COACTIVATOR PEPTIDE DRIP KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F T 3vt6 2 B C COACTIVATOR PEPTIDE DRIP KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F T 3vt7 2 B C D3ZRN2_RAT COACTIVATOR PEPTIDE DRIP KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 3vt8 2 B C COACTIVATOR PEPTIDE DRIP KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F T 3vt9 2 B C COACTIVATOR PEPTIDE DRIP KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F T 3vtb 2 B B COACTIVATOR PEPTIDE DRIP KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F T 3vtc 2 B B COACTIVATOR PEPTIDE DRIP KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F T 3vtd 2 B B COACTIVATOR PEPTIDE DRIP KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F T 3vu5 2 B B SC22 XWEEWDKKIEEYTKKIEELIKKS 23 T 0.052 GP41 pdbhh F T 3vu7 3 C Z REV3L_HUMAN PROTEIN REVERSIONLESS 3-LIKE, REV3-LIKE, HREV3 MLTPTPDSSPRSTSSPSQSKNGSFTPRTANILKPLMSPPSREEIMATLLDHD 52 T 7 CaM_bind pdbhh F Eukaryota T 3vud 2 B F JIP1_HUMAN JIP-1, JNK-INTERACTING PROTEIN 1, ISLET-BRAIN 1, IB-1, JNK MAP KINASE SCAFFOLD PROTEIN 1, MITOGEN-ACTIVATED PROTEIN KINASE 8-INTERACTING PROTEIN 1 RPKRPTTLNLF 11 T 12 CTP-dep_RFKase pdbhh F Eukaryota T 3vug 2 B F JIP1_HUMAN JIP-1, JNK-INTERACTING PROTEIN 1, ISLET-BRAIN 1, IB-1, JNK MAP KINASE SCAFFOLD PROTEIN 1, MITOGEN-ACTIVATED PROTEIN KINASE 8-INTERACTING PROTEIN 1 RPKRPTTLNLF 11 T 12 CTP-dep_RFKase pdbhh F Eukaryota T 3vuh 2 B F JIP1_HUMAN JIP-1, JNK-INTERACTING PROTEIN 1, ISLET-BRAIN 1, IB-1, JNK MAP KINASE SCAFFOLD PROTEIN 1, MITOGEN-ACTIVATED PROTEIN KINASE 8-INTERACTING PROTEIN 1 RPKRPTTLNLF 11 T 12 CTP-dep_RFKase pdbhh F Eukaryota T 3vui 2 B F JIP1_HUMAN JIP-1, JNK-INTERACTING PROTEIN 1, ISLET-BRAIN 1, IB-1, JNK MAP KINASE SCAFFOLD PROTEIN 1, MITOGEN-ACTIVATED PROTEIN KINASE 8-INTERACTING PROTEIN 1 RPKRPTTLNLF 11 T 12 CTP-dep_RFKase pdbhh F Eukaryota T 3vuk 2 B F JIP1_HUMAN JIP-1, JNK-INTERACTING PROTEIN 1, ISLET-BRAIN 1, IB-1, JNK MAP KINASE SCAFFOLD PROTEIN 1, MITOGEN-ACTIVATED PROTEIN KINASE 8-INTERACTING PROTEIN 1 RPKRPTTLNLF 11 T 12 CTP-dep_RFKase pdbhh F Eukaryota T 3vul 2 B F JIP1_HUMAN JIP-1, JNK-INTERACTING PROTEIN 1, ISLET-BRAIN 1, IB-1, JNK MAP KINASE SCAFFOLD PROTEIN 1, MITOGEN-ACTIVATED PROTEIN KINASE 8-INTERACTING PROTEIN 1 RPKRPTTLNLF 11 T 12 CTP-dep_RFKase pdbhh F Eukaryota T 3vum 2 B F JIP1_HUMAN JIP-1, JNK-INTERACTING PROTEIN 1, ISLET-BRAIN 1, IB-1, JNK MAP KINASE SCAFFOLD PROTEIN 1, MITOGEN-ACTIVATED PROTEIN KINASE 8-INTERACTING PROTEIN 1 RPKRPTTLNLF 11 T 12 CTP-dep_RFKase pdbhh F Eukaryota T 3vvi 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H I1RJZ4_GIBZE TRANSIENT RECEPTOR POTENTIAL CHANNEL VRKLRAEMEELKSMLSQLGKT 21 T 0.035 Vps51 pdb F Eukaryota T 3vvr 2 B B MAD5 XXVYSAVCAAAA 12 T 12 DUF711 pdbhh F T 3vvs 2 B B MAD3S XXVYSAVCLYV 11 T 6.1 TraL_transposon pdbhh F T 3vxw 2 B B ATG32_YEAST EXTRACELLULAR MUTANT PROTEIN 37 SWQAIQ 6 T 28 UDG pdbhh F Eukaryota F 3w0g 2 B C MED1_HUMAN VITAMIN D RECEPTOR-INTERACTING PROTEIN COMPLEX COMPONENT DRIP205 KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 3w0h 2 B C MED1_HUMAN VITAMIN D RECEPTOR-INTERACTING PROTEIN COMPLEX COMPONENT DRIP205 KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 3w0i 2 B C MED1_HUMAN VITAMIN D RECEPTOR-INTERACTING PROTEIN COMPLEX COMPONENT DRIP205 KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 3w0j 2 B C MED1_HUMAN VITAMIN D RECEPTOR-INTERACTING PROTEIN COMPLEX COMPONENT DRIP205 KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 3w11 6 F F INSR_HUMAN INSULIN RECEPTOR SUBUNIT ALPHA TFEDYLHNVVFVPRPS 16 T 0.00017 DUF4998 unphh F Eukaryota T 3w12 6 F F INSR_HUMAN INSULIN RECEPTOR SUBUNIT ALPHA TFEDYLHNVVFVPRPS 16 T 0.00017 DUF4998 unphh F Eukaryota T 3w13 6 F F INSR_HUMAN INSULIN RECEPTOR SUBUNIT ALPHA EESSFRKTFEDYLHNVVFVPRPS 23 T 0.00017 DUF4998 unphh F Eukaryota T 3w15 2 B B PEX21_YEAST PEROXIN-21 GSKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSVESKLSESKFIQLMRNISSGDVTLKKNADGNSASELFSSNNGELVGNRHIFVKDEIHKDILD 101 T 0.61 DUF3446 pdbpercent F Eukaryota T 3w1b 2 B B DCR1C_HUMAN DNA CROSS-LINK REPAIR 1C PROTEIN, PROTEIN A-SCID, SNM1 HOMOLOG C, HSNM1C, SNM1-LIKE PROTEIN DVPQWEVFFKR 11 T 1.6 DUF4570 pdbhh F Eukaryota T 3w1g 2 B B DCR1C_HUMAN DNA CROSS-LINK REPAIR 1C PROTEIN, PROTEIN A-SCID, SNM1 HOMOLOG C, HSNM1C, SNM1-LIKE PROTEIN DVPQWEVFFKR 11 T 1.6 DUF4570 pdbhh F Eukaryota T 3w30 1 A,B A,B Q8VSD5_SHIFL ORF169b GPLGSPEFMINGVSLQGTAGYEAHTEEGNVNVKKLLESLNSKSLGDMDKDSELAATLQKMINPSGGDGNASGCALHACMAMLGYGVREAPVPNEISEYMTGFFHRHLEQIDSEGIVSHPNETYSKFRERIAENILQNTSKGSVVMISIEQATHWIAGFNDGEKIMFLDVQTGKGFNLYDPVEKSPDAFVDENSSVQVIHVSDQEFDHYANSSSWKSKRLC 220 T 0.12 Gln_amidase pdbpercent F Bacteria T 3w31 1 A A Q8VSD5_SHIFL ORF169b GPLGSPEFMINGVSLQGTAGYEAHTEEGNVNVKKLLESLNSKSLGDMDKDSELAATLQKMINPSGGDGNASGCALHACMAMLGYGVREAPVPNEISEYMTGFFHRHLEQIDSEGIVSHPNETYSKFRERIAENILQNTSKGSVVMISIEQATHWIAGFNDGEKIMFLDVQTGKGFNLYDPVEKSPDAFVDENSSVQVIHVSDQEFDHYANSSSWKSKRLC 220 T 0.12 Gln_amidase pdbpercent F Bacteria T 3w3w 2 B B STE12_YEAST Protein STE12 PRRRTVGMKSSQGNVPTGNKQSVGKSAKISKPLHIKTSAYQKQYKINLETKARPSAGDEDSAHPDKNKE 69 T 480 LAG1-DNAbind pdbhh F Eukaryota T 3w3x 2 B B PHO4_YEAST Phosphate system positive regulatory protein PHO4 SANKVTKNKSNSSPYLNKRRGKPGPDS 27 T 0.72 TonB_N unp F Eukaryota T 3w3y 2 B B NUP53_YEAST NUCLEAR PORE PROTEIN NUP53 RNAEFKVSKNSTSFKNPRRLEIKDGRSLFLRNRGKIHSGVLSSIESDL 48 T 3.4 DUF3994 pdbhh F Eukaryota T 3w5p 2 B C MED1_HUMAN Mediator of RNA polymerase II transcription subunit 1 KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 3w5q 2 B C MED1_HUMAN Mediator of RNA polymerase II transcription subunit 1 KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 3w5r 2 B C MED1_HUMAN Mediator of RNA polymerase II transcription subunit 1 KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 3w5t 2 B C MED1_HUMAN Mediator of RNA polymerase II transcription subunit 1 KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 3w6k 1 A,D A,D A0A0E0TE00_GEOS2 ScpA ERALLFTKPPSDLSAYAD 18 T 7.5 Reo_sigmaC pdbhh F Bacteria T 3wa0 2 G,H G,H DCAF1_HUMAN DDB1- AND CUL4-ASSOCIATED FACTOR 1, HIV-1 VPR-BINDING PROTEIN, VPRBP, VPR-INTERACTING PROTEIN GPLGSYDDDTDDLDELDTDQLLEAELEEDDNNENAGEDGDNDFSPSDEELANLLEEGEDGEDEDSDADEEVELILGDTDSSDNSDLEDDIILSLNE 96 T 5 Cwf_Cwc_15 unphh F Eukaryota T 3wa0 3 I I Protein VPRBP XXXXXXX 7 F F F 3wa0 4 J J Protein VPRBP XXXXXXXX 8 F F F 3wa0 5 K K Protein VPRBP XXXXXX 6 F F F 3wa4 2 B B CD28_HUMAN TP44 SDXMNMTP 8 T 0.089 DUF2207 unppercent F Eukaryota T 3wa5 1 A A Q9HYC5_PSEAE Type VI secretion exported 3 MTATSDLIESLISYSWDDWQVTRQEARRVIAAIRNDNVPDATIAALDKSGSLIKLFQRVGPPELARSLIASIAGRTTMQRYQARNALIRSLINNPLGTQTDNWIYFPTITFFDICADLADAAGRLGFAAAGATGVASQAIQGPFSGVGATGVNPTDLPSIAFGDQLKLLNKDPATVTKYSNPLGDLGAYLSQLSPQDKLNQAQTLVGQPISTLFPDAYPGNPPSRAKVMSAAARKYDLTPQLIGAIILAEQRDQTRDEDAKDYQAAVSIKSANTSIGLGQVVVSTAIKYELFTDLLGQPVRRGLSRKAVATLLASDEFNIFATARYIRYVANLASQQDLRKLPKTRGAFPSIDLRAYAGNPRNWPRDNVRALASEYTSRPWDDNLSPGWPMFVDDAYATFLDPGMRFPLEHHHHHH 416 T 0.0021 DUF1402 unphh F Bacteria T 3wa5 2 B B Q9HYC4_PSEAE Tse3-specific immunity protein MKTVALILASLALLACTAESGVDFDKTLTHPNGLVVERPVGFDARRSAEGFRFDEGGKLRNPRQLEVQRQDAPPPPDLASRRLGDGEARYKVEEDDGGSAGSEYRLWAAKPAGARWIVVSASEQSEDGEPTFALAWALLERARLQLEHHHHHH 153 T 0.0052 PsbP_2 unphh F Bacteria T 3wbn 2 B B MaL6 AFTFRYSPSLYTWFLFPCG 19 T 5.3 HXXEE pdbhh F T 3wdc 2 B B Cyclomarin A XXAXVXX 7 T 6.3 DUF446 pdbhh F F 3wdd 2 B B Cyclomarin A XXAXVXX 7 T 6.3 DUF446 pdbhh F F 3wde 2 B B Cyclomarin A XXAXVXX 7 T 6.3 DUF446 pdbhh F F 3wdz 2 B B SQSTM_MOUSE STONE14, UBIQUITIN-BINDING PROTEIN P62 KEVDPSTGELQSLQ 14 T 1.2 DUF2396 pdbhh F Eukaryota T 3wg5 2 C C PSTOM_PYRHO UNCHARACTERIZED PROTEIN PH1511 NVIVLMLPME 10 T 2.9 Polysacc_deac_2 pdbhh F Archaea T 3wim 2 B B WDFY3_HUMAN AUTOPHAGY-LINKED FYVE PROTEIN, ALFY DEKDGFIFVNYSEG 14 T 0.42 ATG8 pdbhh F Eukaryota T 3wit 1 A A Q8XBY5_ECO57 UNCHARACTERIZED PROTEIN GTIAGSVHVDAVNNGGEGNGIQAYTAIKEIMLAVEESKIALTPDGIQLQVGESTVIRLSKDGITIVGGSVFINGLEHHHHHH 82 T 0.052 DUF2345 pdbpssm F Bacteria T 3wkn 2 C,D,G,H,K,L,O,P E,F,G,H,K,L,O,P AF.P17 GPGISAFSPGRGVYDPETGTWYDAAWHLGELVWATYYDPETGTWEPDWQRMLGQ 54 T 0.00013 OCRE pdb F T 3wmg 2 B B anti-CmABCB1 peptide XXLDQIVWFNAPGDLHLCG 19 T 2.5 Endothelin pdbhh F T 3wn7 2 B,D B,M NF2L2_MOUSE NF-E2-RELATED FACTOR 2, NFE2-RELATED FACTOR 2, NUCLEAR FACTOR, ERYTHROID DERIVED 2, LIKE 2 MDLIDILWRQDIDLGVSREVFDFSQRQKDYELEKQ 35 T 0.055 Radial_spoke unppercent F Eukaryota T 3wn8 1 A,B,C A,B,C collagen-like peptide PPGPPGPPGPRGPPGPPGPPGPPG 24 T 0.00065 Collagen pdb F F 3wne 2 C,D C,D LEDGF peptide PKIDNG 6 T 11 Mak10 pdbhh F T 3wnf 2 C,D C,D CKIDNC peptide XCKIDNCX 8 T 0.55 Hormone_4 pdbhh F T 3wng 2 C,D C,D PKIDN(DPR) peptide PKIDNX 6 T 4.6 OrfA pdbhh F T 3wnh 2 C,D C,D PK(NLE)DN(DVA) peptide PKXDNX 6 T 140 COMM_domain pdbhh F F 3wod 6 G,H G,H A7XX65_9CAUD GP39 MVEGFVEPYIRLFEAIPDAETELATFYDADLDTLPPRMFLPSGDLYTPPGPVRLEEIKRKRRVRLVKVSIYRFEHVGLGLAARPYAYAYAWQGDNGILHLYHAPVVLEDVPEVLELDEVTYNESYVRLMRAMGHVDAFIDL 141 T 2.5 Abp2 pdbhh T Viruses T 3woe 2 B,D B,D A7XX65_9CAUD GP39 GPVEPYIRLFEAIPDAETELATFYDADLDTLPPRMFLPSGDLYTPPGPVRLEEIKRKRRVRLVKVSIYRFEHVGLGLAARPYAYAYAWQGDNGILHLYHAPVVLED 106 T 14 CSRNP_N pdbhh T Viruses T 3wof 2 B,D,F,H,J,L,N,P,R,T,V,X B,D,F,H,J,L,N,P,R,T,V,X A7XX65_9CAUD GP39 GPVEPYIRLFEAIPDAETELATFYDADLDTLPPRMFLPSGDLYTPPGPVRLEEIKRKRRVRLVKVSIYRFEHVGLGLAARPYAYAYAWQGDNGILHLYHAPVVLEDVPEVLELDEVTYNESYVRLMRAM 129 T 2.5 Abp2 unphh T Viruses T 3woo 2 C,D C,D ANGT_HUMAN Angiotensin II VYIHPF 6 T 0.64 Adeno_PVIII pdbhh F Eukaryota T 3wop 2 C,D C,D Angiotensin IV VYIHPF 6 T 0.64 Adeno_PVIII pdbhh F T 3woq 2 C,D C,D Angiotensin IV VYIHPF 6 T 0.64 Adeno_PVIII pdbhh F T 3wor 2 C,D C,D ANGT_HUMAN Angiotensin II DRVYIHPF 8 T 0.67 Nairo_nucleo pdbhh F Eukaryota T 3wp0 2 B B L2GL2_HUMAN HGL LSRVKSLKKSLRQSF 15 T 18 DUF3511 pdbhh F Eukaryota T 3wp1 2 B A L2GL2_HUMAN HGL LKKSLRQSFRRMRRSRV 17 T 27 Neuropeptide_S pdbhh F Eukaryota T 3ws6 3 E,F E,F Mimotope 9-mer peptide YAIENYLEL 9 T 2.6 DUF4744 pdbhh F T 3wsy 2 B C SORL_HUMAN peptide from Sortilin-related receptor LPQDRGFLVVQGDPR 15 T 0.46 Inhibitor_I69 pdbhh F Eukaryota T 3wsz 2 B C 10-mer peptide AAAAAAAAAA 10 T 200 FAD_oxidored pdbhh F F 3wt5 2 B C MED1_HUMAN Mediator of RNA polymerase II transcription subunit 1 KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 3wt6 2 B C MED1_HUMAN Mediator of RNA polymerase II transcription subunit 1 KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 3wt7 2 B C MED1_HUMAN Mediator of RNA polymerase II transcription subunit 1 KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 3wtq 2 B C MED1_HUMAN Mediator of RNA polymerase II transcription subunit 1 KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 3wut 2 C,F,I,L C,F,I,L TEX14_HUMAN PROTEIN KINASE-LIKE PROTEIN SGK307, SUGEN KINASE 307, TESTIS-EXPRESSED SEQUENCE 14, TESTIS-EXPRESSED SEQUENCE 14 PROTEIN DLAVGPPSLNYIPP 14 T 1.6 Topo_C_assoc pdbhh F Eukaryota T 3wuu 2 C,F,I,L C,F,I,L TEX14_HUMAN TEX-14 DLAVGPPSLNYPGY 14 T 13 CitX pdbhh F Eukaryota T 3wuv 2 C,F,I,L,O,R C,F,I,L,O,R PDC6I_HUMAN ALG-2-INTERACTING PROTEIN X, ALIX DQAQGPPYPTYIPP 14 T 1.5 N1221 pdbhh F Eukaryota T 3wv0 2 C,D X,Y GB_HHV1K GB, GB-1, GB1 GPATPAP 7 T 5.4 DUF765 pdbhh T Viruses F 3ww1 1 A,B A,B L0N3Y0_9CELL L-ribose isomerase HHHHHHGSTRTAISRREYDEWLSEAASLARALRYPVTPEMVNDSAGIVFGDDQYEAFAHGLWSREPYEVMVILESLNEPAVDGLPAAGAAHAEYSGLCDKLMIVHPGKFCPPHFHQRKTESYEVVLGEMEVFYAPEPVTVGDDDVLSFSPMPEGSPWPEGVALPAGREDSYAGLTSYVRLRAGDPKFVMHRKHLHAFRCPADSPVPLVVREVSTYSHEPTEHAHDKAAPLPQWRGLHDNTFVAEAANSGRLATAIA 256 T 0.00016 Cupin_2 unp F Bacteria T 3ww2 1 A,B A,B L0N3Y0_9CELL L-ribose isomerase HHHHHHGSTRTAISRREYDEWLSEAASLARALRYPVTPEMVNDSAGIVFGDDQYEAFAHGLWSREPYEVMVILESLNEPAVDGLPAAGAAHAEYSGLCDKLMIVHPGKFCPPHFHQRKTESYEVVLGEMEVFYAPEPVTVGDDDVLSFSPMPEGSPWPEGVALPAGREDSYAGLTSYVRLRAGDPKFVMHRKHLHAFRCPADSPVPLVVREVSTYSHEPTEHAHDKAAPLPQWRGLHDNTFVAEAANSGRLATAIA 256 T 0.00016 Cupin_2 unp F Bacteria T 3ww3 1 A,B A,B L0N3Y0_9CELL L-ribose isomerase HHHHHHGSTRTAISRREYDEWLSEAASLARALRYPVTPEMVNDSAGIVFGDDQYEAFAHGLWSREPYEVMVILESLNEPAVDGLPAAGAAHAEYSGLCDKLMIVHPGKFCPPHFHQRKTESYEVVLGEMEVFYAPEPVTVGDDDVLSFSPMPEGSPWPEGVALPAGREDSYAGLTSYVRLRAGDPKFVMHRKHLHAFRCPADSPVPLVVREVSTYSHEPTEHAHDKAAPLPQWRGLHDNTFVAEAANSGRLATAIA 256 T 0.00016 Cupin_2 unp F Bacteria T 3ww4 1 A,B A,B L0N3Y0_9CELL L-ribose isomerase HHHHHHGSTRTAISRREYDEWLSEAASLARALRYPVTPEMVNDSAGIVFGDDQYEAFAHGLWSREPYEVMVILESLNEPAVDGLPAAGAAHAEYSGLCDKLMIVHPGKFCPPHFHQRKTESYEVVLGEMEVFYAPEPVTVGDDDVLSFSPMPEGSPWPEGVALPAGREDSYAGLTSYVRLRAGDPKFVMHRKHLHAFRCPADSPVPLVVREVSTYSHEPTEHAHDKAAPLPQWRGLHDNTFVAEAANSGRLATAIA 256 T 0.00016 Cupin_2 unp F Bacteria T 3wx4 1 A A ARN_BPT4 ANTI-RGL NUCLEASE MIIDSQSVVQYTFKIDILEKLYKFLPNLYHSIVNELVEELHLENNDFLIGTYKDLSKAGYFYVIPAPGKNIDDVLKTIMIYVHDYEIEDYFELEHHHHHH 100 T 2.3 DUF3198 unphh T Viruses T 3wxa 2 C,D C,D SC31A_HUMAN ABP125, ABP130, SEC31-LIKE PROTEIN 1, SEC31-RELATED PROTEIN A, WEB1-LIKE PROTEIN NPPPPGFIMHGN 12 T 1.6 DUF2173 pdbhh F Eukaryota T 3wyd 1 A,B A,B A0A0A6YVN5_9ZZZZ LC-Est1C MGSSHHHHHHSSGLVPRGSHMPYRLYVPTTYDGTKAFPLVIALHGMGGDENSYFDSYQRGAFMIEAENRGYIVACPKGRQPASMYVGPAERDVMDVIAEVRRDYKIDPDRIYMTGHSMGGYGTWSIAMNHPDVFAALAPVAGGGNPLGMANIAHIPQLVVHGDNDKTVPVERSRVMVEAAKKHGTEIKYIEIPGGDHVSVAARTFKDVFDWFDSHKRKRPAAKAATNK 228 T 2E-08 Peptidase_S9 unp F unclassified sequences T 3x0t 1 A,B A,B PIRA MSNNIKHETDYSHDWTVEPNGGVTEVDSKHTPIIPEVGRSVDIENTGRGELTIQYQWGAPFMAGGWKVAKSHVVQRDETYHLQRPDNAFYHQRIVVINNGASRGFCTIYYHLEHHHHHH 119 T 3.4 DUF916 pdbhh F T 3zbe 1 A A Q8XAD5_ECO57 PAAA2 MDYKDDDDKNRALSPMVSEFETIEQENSYNEWLRAKVATSLADPRPAIPHDEVERRMAERFAKMRKERSKQ 71 T 0.059 HAGH_C pdb F Bacteria T 3zbi 3 AA,C,DA,F,GA,I,JA,L,MA,O,PA,R,U,X a,C,d,F,g,I,j,L,m,O,p,R,U,X Q46702_ECOLX TRAN PROTEIN MRSLLLMGVLLISACSSGHKPPPEPDWSNTVPVNKTIPVDTQGGRNES 48 T 0.0023 LPAM_1 pdbhh F Bacteria T 3zd0 1 A A Q9WLK8_9HEPC P7 PROTEIN GPLGSPEFAAMDYKDDDDKALENLVVLNAASVAGAHGILSFLVFFCAAWYIKGRLAPGAAYAFYGVWPLLLLLLALPPRAYAAAAS 86 T 120 GBV-C_env pdbhh T Viruses T 3zdi 2 B B AXIN1_HUMAN AXIS INHIBITION PROTEIN 1, HAXIN VEPQKFAEELIHRLEAVQ 18 T 0.19 Cwf_Cwc_15 unppssm F Eukaryota T 3zdl 2 B B AB1IP_HUMAN APBB1-INTERACTING PROTEIN 1, PROLINE-RICH EVH1 LIGAND 1, PREL-1, PROLINE-RICH PROTEIN 73, RAP1-GTP-INTERACTING ADAPTER MOLECULE, RIAM, RETINOIC ACID-RESPONSIVE PROLINE-RICH PROTEIN 1, RARP-1 MGESSEDIDQMFSTLLGEMDLLTQSLGVDTLY 32 T 0.86 Drf_DAD pdbhh F Eukaryota T 3zdy 5 H,I I,J RGD PEPTIDE GRGDSP 6 T 21 Topoisom_I_N pdbhh F F 3zdz 5 H,I I,J RGD PEPTIDE GRGDSP 6 T 21 Topoisom_I_N pdbhh F F 3ze0 5 H,I I,J RGD PEPTIDE GRGDSP 6 T 21 Topoisom_I_N pdbhh F F 3ze1 5 H,I I,J RGD PEPTIDE GRGDSP 6 T 21 Topoisom_I_N pdbhh F F 3ze2 5 H,I I,J RGD PEPTIDE GRGDSP 6 T 21 Topoisom_I_N pdbhh F F 3zfw 2 C,D X,Y PKHM2_HUMAN PH DOMAIN-CONTAINING FAMILY M MEMBER 2, SALMONELLA-INDUCED FILAMENTS A AND KINESIN-INTERACTING PROTEIN, SIFA AND KINESIN-INTERACTING PROTEIN MGSSHHHHHHSSGLVPRGSHMTNLEWDDSAITGSTGSTGSTGSHM 45 T 0.71 CLSTN_C pdbhh F Eukaryota T 3zg5 2 C,D C,D PEPTIDOGLYCAN ANALOGUE AXKXX 5 T 230 OAM_dimer pdbhh F F 3zgc 2 C C NF2L2_HUMAN NF-E2-RELATED FACTOR 2, NFE2-RELATED FACTOR 2, HEBP1, NUCLEAR FACTOR, ERYTHROID DERIVED 2, LIKE 2, NEH2-DERIVED PEPTIDE GDEETGE 7 T 12 Phi29_Phage_SSB pdbhh F Eukaryota F 3zgh 1 A A A0A0H2URK1_STRPN PNEUMOCOCCAL SERINE RICH REPEAT PROTEIN, SRRP HHHHHHSGNTIVNGAPAINASLNIAKSETKVYTGEGVDSVYRVPIYYKLKVTNDGSKLTFTYTVTYVNPKTNDLGNISSMRPGYSIYNSGTSTQTMLTLGSDLGKPSGVKNYITDKNGRQVLSYNTSTMTTQGSGYTWGNGAQMNGFFAKKGYGLTSSWTVPITGTDTSFTFTPYAARTDRIGINYFNGGGKVVESSTTSQSLSQ 205 T 0.26 FlgD_ig pdbpercent F Bacteria T 3zgi 1 A,B,C A,B,C A0A0H2URK1_STRPN CELL WALL SURFACE ANCHOR FAMILY PROTEIN HHHHHHSGNTIVNGAPAINASLNIAKSETKVYTGEGVDSVYRVPIYYKLKVTNDGSKLTFTYTVTYVNPKTNDLGNISSMRPGYSIYNSGTSTQTMLTLGSDLGKPSGVKNYITDKNGRQVLSYNTSTMTTQGSGYTWGNGAQMNGFFAKKGYGLTSSWTVPITGTDTSFTFTPYAARTDRIGINYFNGGGKVVESSTTSQSLSQ 205 T 0.26 FlgD_ig pdbpercent F Bacteria T 3zha 2 E,F,G,H,I,J,M,N,O,R,S,T E,F,G,H,I,J,M,N,O,R,S,T COLLAGEN MODEL PEPTIDE 18-T8R11 XPPGPPGPTGPRGPPGPPX 19 T 0.0098 Collagen pdbpssm F F 3zhe 1 A,C A,C G5ECF1_CAEEL PROTEIN SMG-5 MQKSDEVTEKFKRYCNQLEKYGQTENVHSPVMAMLRRKGRKQLIEIMKRDGDCTSSINKLWIVGYYHPFQFFIRDKEKNMAIAVLLTMFCGELQEMLSLPDDKYPALWNMYIGDFHRYMPDEEIQKCLAVGYYSRAIDLDPNQGRAFHVLAGLRADLNVAQKLRLMILGQLADAPYKKGTELLEYLKFPQKESTDKLMVDFVIWALNEKSKRMDYQMTGIKIVNEFKAEIEQKLEFDWSLIMSTCRLASKLAMKKFGFQQFYNCFDTISTLYITIYSRTISSKCLLAEAISWISDSAEILGHLDEQKNEPHFQKLSVFAKTKWNELNDLVMNHINSVFTSMSLTINPSISMTSFLLNGPISEPNVEFLSQLINYLVSVEFPPMEIIHDREESGPLLRRINQSEQKRLDIQIKTQNDEVNR 420 T 2.8E-05 EST1_DNA_bind unphh F Eukaryota T 3zin 2 B,C B,C DDX21_MOUSE DEAD BOX PROTEIN 21, GU-ALPHA, NUCLEOLAR RNA HELICASE GU, NUCLEOLAR RNA HELICASE II, RH II/GU SRGQKRSFSKAFGQ 14 T 2.7 DUF1413 pdbhh F Eukaryota T 3zio 2 B,C B,C A28NLS IGRKRGYSVAFG 12 T 2.2 TrbH pdbhh F T 3zip 2 B,C B,C A58NLS WAGRKRTWRDAF 12 T 3.2 DUF5419 pdbhh F T 3ziq 2 B,C B,C B6NLS SSHRKRKFSDAF 12 T 2.7 Mating_C pdbhh F T 3zir 2 B,C B,C B141NLS RQRKRKWSEAF 11 T 0.25 DUF3020 pdbhh F T 3zke 2 B,D,F,H,J,L B,D,F,H,J,L NEK9_HUMAN NEK9 VGMHSKGTQTA 11 T 0.21 KASH_CCD unppssm F Eukaryota T 3zkf 2 B,D,F,H,J,L B,D,F,H,J,L NEK9_HUMAN NEK9 PROTEIN VGMHSKGTQTA 11 T 0.21 KASH_CCD unppssm F Eukaryota T 3zkt 1 A A CT5A_CONCN TAU-CNVA ECCHRQLLCCLRFVX 15 T 1.4 DUF488 unphh F Eukaryota T 3zld 2 B B RON22_TOXGM RHOPTRY NECK PROTEIN 2 GSASDIAQFLTDSGMKAIEDCSWNPIMQQMACVVVAGSGS 40 T 0.064 DUF4040 unp F Eukaryota T 3zlj 2 C,D C,D MUTS_ECOLI DNA MISMATCH REPAIR PROTEIN MUTS PNAAATQVDGTQMSLLSVPEETSPAVEALENLDPRSLTPRQALEWIYRLKSLV 53 T 2.3 DUF5830 pdbhh F Bacteria T 3zmn 1 A,B A,B C8CHL5_9VIRU VP17 MGVFDRIRGALGRGLDVFRGDLPQVQPPAPQPAPAPAITPAAVQVGGWGFAWIDNEDFSPTGLAWRSGEYFALAQMKTPETAHFRIAAQERRLRIYLRGQKVVNGRNLSDPDSRTVNLPFLMQTPQGAPTLPSTYHPDVAVWAKVGSTWQPCVITAINYSTGDVTFTEPAGVTASDGIEIYYVHGDGQFRLRVARDAGGVDDSAATVFNQSFSTMHSVDQNNVETMIAWPQQVELVPGTRLVLEVFTTQVPMVWNERSGHYIQIAAMGRRIEVLDKGGLQRLAELEARGGL 291 T 0.063 DEC-1_N pdbpercent T Viruses T 3zmn 2 C,D C,D VP17 XXXXXXXX 8 F F F 3zmo 1 A A C8CHL4_9VIRU VP16 MQEAFNRIKALRPGARPATILRSGPEFSVYSGTQRVKVGEFVVPAGASWVLPNPVPVILKLYDTGGNQLPHTTDVFLAKRTKGFDFPEFLAKVQYASYYDLTEAQLRDAKFYQNILQTLSPLRAPQPPQGVVLREGDVLEVYVEAPAGVTVNLNDPRTRIELPIGVDNSNPTL 173 T 2.6 TraI_2B pdbhh T Viruses T 3zmp 2 C,D C,D SRC_HUMAN SRC-DERIVED PEPTIDE, PROTO-ONCOGENE C-SRC, PP60C-SRC, P60-SRC EPQXQPGENL 10 T 5.5 Leader_Erm pdbhh F Eukaryota T 3zmq 2 B C SRC_HUMAN PROTO-ONCOGENE C-SRC, PP60C-SRC, P60-SRC EAQXQPGENL 10 T 4.5 Leader_Erm pdbhh F Eukaryota T 3zmt 3 C C PEPTIDE PRSFLV 6 T 14 IBV_3A pdbhh F T 3zmu 3 C C PKSFLV PEPTIDE PKSFLV 6 T 14 IBV_3A pdbhh F T 3zmv 3 C,D C,D PKSFLV PEPTIDE PLSFLV 6 T 1.3 Ntox5 pdbhh F F 3zmz 3 C C PEPTIDE PRSFAV 6 T 34 tRNA-synt_1c_C pdbhh F T 3zn0 3 C C PEPTIDE PRSFAA 6 T 8.3 NUC205 pdbhh F F 3zn1 3 C C PEPTIDE PRLYLV 6 T 6.3 UPF0300 pdbhh F F 3zn4 1 A A C8CHL4_9VIRU VP16 MQEAFNRIKALRPGARPATILRSGPEFSVYSGTQRVKVGEFVVPAGASWVLPNPVPVILKLYDTGGNQLPHTTDVFLAKRTKGFDFPEFLAKVQYASYYDLTEAQLRDAKFYQNILQTLSPLRAPQPPQGVVLREGDVLEVYVEAPAGVTVNLNDPRTRIELPIGVDNSNPTL 173 T 2.6 TraI_2B pdbhh T Viruses T 3zn5 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H C8CHL4_9VIRU VP16 MQEAFNRIKALRPGARPATILRSGPEFSVYSGTQRVKVGEFVVPAGASWVLPNPVPVILKLYDTGGNQLPHTTDVFLAKRTKGFDFPEFLAKVQYASYYDLTEAQLRDAKFYQNILQTLSPLRAPQPPQGVVLREGDVLEVYVEAPAGVTVNLNDPRTRIELPIGVDNSNPTL 173 T 2.6 TraI_2B pdbhh T Viruses T 3zn6 1 A A C8CHL5_9VIRU VP17 MGVFDRIRGALGRGLDVFRGDLPQVQPPAPQPAPAPAITPAAVQVGGWGFAWIDNEDFSPTGLAWRSGEYFALAQMKTPETAHFRIAAQERRLRIYLRGQKVVNGRNLSDPDSRTVNLPFLMQTPQGAPTLPSTYHPDVAVWAKVGSTWQPCVITAINYSTGDVTFTEPAGVTASDGIEIYYVHGDGQFRLRVARDAGGVDDSAATVFNQSFSTMHSVDQNNVETMIAWPQQVELVPGTRLVLEVFTTQVPMVWNERSGHYIQIAAMGRRIEVLDKGGLQRLAELEARGGL 291 T 0.063 DEC-1_N pdbpercent T Viruses T 3zn6 2 B B C8CHL4_9VIRU VP16 MQEAFNRIKALRPGARPATILRSGPEFSVYSGTQRVKVGEFVVPAGASWVLPNPVPVILKLYDTGGNQLPHTTDVFLAKRTKGFDFPEFLAKVQYASYYDLTEAQLRDAKFYQNILQTLSPLRAPQPPQGVVLREGDVLEVYVEAPAGVTVNLNDPRTRIELPIGVDNSNPTL 173 T 2.6 TraI_2B pdbhh T Viruses T 3zn8 5 E S DAP2_YEAST DPAP B, YSCV GIILVLLIWGTVLL 14 T 2 DUF4808 pdbhh F Eukaryota T 3zni 2 B,F,J,N B,F,J,N ZAP70_HUMAN 70 KDA ZETA-CHAIN ASSOCIATED PROTEIN, SYK-RELATED TYROSINE KINASE TLNSDGXTPEPA 12 T 0.99 Galanin pdbhh F Eukaryota T 3zoq 2 B,C B,C P56_BPPH2 LEFT END OF BACTERIOPHAGE PHI-29 CODING FOR 15 POTENTIAL PROTEINS AMONG THESE ARE THE TERMINAL PROTEIN AND THE PROTEINS ENCODED BY THE GENES 1,2 (SUS), 3, AND (PROBABLY) 4 MVQNDFVDSYDVTMLLQDDDGKQYYEYHKGLSLSDFEVLYGNTADEIIKLRLDKVL 56 T 0.55 GhoS unphh T Viruses T 3zpe 1 A A Q2TLC1_9ADEN TURKEY ADENOVIRUS TYPE 3 FIBRE MGSSHHHHHHSSGLVPRGSHMASMTGGQQMGRGSEFASIGPPTTMVTGTVSPGRATNGQFVTKTAKVLRYKFVRWDALLIIQFIDNIGVMENPTFYRNKSIELRSADFLSPMLNNTYIVPLNGGVRVESPTIPVQLEVILENNSSFIQVGFVRLTVKNGNPHMIIQCNPVPGNIKMIKIKSVMLFTCLIG 190 T 67 DUF1491 pdbhh T Viruses T 3zpf 1 A A Q2TLC1_9ADEN TURKEY ADENOVIRUS TYPE 3 FIBRE MGSSHHHHHHSSGLVPRGSHMASMTGGQQMGRGSEFASIGPPTTMVTGTVSPGRATNGQFVTKTAKVLRYKFVRWDALLIIQFIDNIGVMENPTFYRNKSIELRSADFLSPMLNNTYIVPLNGGVRVESPTIPVQLEVILENNSSFIQVGFVRLTVKNGNPHMIIQCNPVPGNIKMIKIKSVMLFTCLIG 190 T 67 DUF1491 pdbhh T Viruses T 3zpv 1 A,BA,C,DA,E,FA,G,HA,I,JA,L,N,P,R,T,V,X,Z 0,R,2,T,4,V,6,X,8,Z,B,D,F,H,J,L,N,P BCL9_DROME PROTEIN LEGLESS, PROTEIN LEGLESS GAMANHIFVFSTQLANKGAESVLSGQFQTIIAYHCTQ 37 T 6.7 Csm2_III-A pdbhh F Eukaryota T 3zq8 1 A,B,C,D A,B,C,D GRAMICIDIN D XGAXAXVXWXWXWXWX 16 T 4.6 MAP17 pdbhh F F 3zqf 2 B C ANTI-INDUCER PEPTIDE TAP1 KASEGLARVAALARSR 16 T 54 Spond_N pdbhh F T 3zqg 2 B C ANTI-INDUCER PEPTIDE TAP2 TGERGRWQVWGLAKRC 16 T 3.5 DUF5691 pdbhh F T 3zqh 2 B C INDUCER PEPTIDE TIP3 KKESRVVVWRLPPLH 15 T 1.2 MHC_II_alpha pdbhh F T 3zqi 2 C,D C,D INDUCER PEPTIDE TIP2 DDSVLAARARMWMWHW 16 T 4.7 Metal_hydrol pdbhh F T 3zrj 2 C,D X,Y Q9KN57_VIBCH VIPB KKWAQGSLLDEIMAQTRCKK 20 T 0.054 DMP12 unp F Bacteria T 3zs6 2 B B OLIGOPEPTIDE DVA DVA 3 T 430 Antimicrobial20 pdbhh F F 3zwz 2 B B Q8IKV6_PLAF7 RON2 DITQQAKDIGAGPVASCFTTRMSPPQQICLNSVVNTALS 39 T 3.1 zf-XS pdbhh F Eukaryota T 3zx7 1 A A TXL_EISFE LYSENIN SAKAAEGYEQIEVDVVAVWKEGYVYENRGSTSVDQKITITKGMKNVNSETRTVTATHSIGSTISTGDAFEIGSVEVSYSHSHEESQVSMTETEVYESKVIEHTITIPPTSKFTRWQLNADVGGADIEYMYLIDEVTPIGGTQSIPQVITSRAKIIVGRQIILGKTEIRIKHAERKEYMTVVSRKSWPAATLGHSKLFKFVLYEDWGGFRIKTLNTMYSGYEYAYSSDQGGIYFDQGTDNPKQRWAINKSLPLRHGDVVTFMNKYFTRSGLCYDDGPATNVYCLDKREDKWILEVVGLVPRGSGHHHHHH 309 T 0.027 Toxin_10 unphh F Eukaryota T 3zxd 1 A,B,C,D A,B,C,D TXL_EISFE EFL1 SAKAAEGYEQIEVDVVAVWKEGYVYENRGSTSVDQKITITKGMKNVNSETRTVTATHSIGSTISTGDAFEIGSVEVSYSHSHEESQVSMTETEVYESKVIEHTITIPPTSKFTRWQLNADVGGADIEYMYLIDEVTPIGGTQSIPQVITSRAKIIVGRQIILGKTEIRIKHAERKEYMTVVSRKSWPAATLGHSKLFKFVLYEDWGGFRIKTLNTMYSGYEYAYSSDQGGIYFDQGTDNPKQRWAINKSLPLRHGDVVTFMNKYFTRSGLCYDDGPATNVYCLDKREDKWILEVVGLVPRGSGHHHHHH 309 T 0.027 Toxin_10 unphh F Eukaryota T 3zxg 1 A,B A,B TXL_EISFE EFL1 SAKAAEGYEQIEVDVVAVWKEGYVYENRGSTSVDQKITITKGMKNVNSETRTVTATHSIGSTISTGDAFEIGSVEVSYSHSHEESQVSMTETEVYESKVIEHTITIPPTSKFTRWQLNADVGGADIEYMYLIDEVTPIGGTQSIPQVITSRAKIIVGRQIILGKTEIRIKHAERKEYMTVVSRKSWPAATLGHSKLFKFVLYEDWGGFRIKTLNTMYSGYEYAYSSDQGGIYFDQGTDNPKQRWAINKSLPLRHGDVVTFMNKYFTRSGLCYDDGPATNVYCLDKREDKWILEVVGLVPRGSGHHHHHH 309 T 0.027 Toxin_10 unphh F Eukaryota T 3zxu 2 B,D B,D CTF19_KLULA CTF19 MDFTSSSGVLDSERNTGSNDSDEPSSHSDVIETEELKLIKLQEHKNNLLRQRSELLDQLSQTRVVEPRSVQLDDKLLLKLLRRNDNAVSDSSQSSNNPLPRVLPSLNIEQRKKYLDITLNDVTVTCEKDMILLRKGSFTASFRIAVENESIRSMAIDLNAFEVELQPIIQYAEDTQNVNVAMMAVVQFLRIKELHEQMISKIVEASKFIRASNNTITLNDLEVSFHCYWNLPSPYPETLILTNKVQKILDFLIYQYGIQLGVIKYGSTII 270 T 0.0022 CENP-P pdbhh F Eukaryota T 3zyb 2 I,J,K I,J,N GALA-LYS-PRO-LEUNH2 KPLX 4 T 330 EHD_N pdbhh F F 3zzy 2 C,D C,D RAVR1_MOUSE RAVER1, PROTEIN RAVER-1 GAMGPGVSLLGAPPKD 16 T 2.9 Ste5 pdbhh F Eukaryota T 3zzz 2 C,D C,D RAVR1_MOUSE RAVER1, PROTEIN RAVER-1 GAMGSSEGLLGLGPGP 16 T 0.49 DUF6027 pdbhh F Eukaryota T 4a0i 2 C,D C,D SGO1_HUMAN HSGO1, SEROLOGICALLY DEFINED BREAST CANCER ANTIGEN NY-BR-85 AKERC 5 T 67 DUF5980 pdbhh F Eukaryota F 4a1t 2 C,D C,D CP5-46-A PEPTIDE GELGRLVYLLDGPGYDPIHCD 21 T 0.79 DUF5685 pdbhh F T 4a1v 2 C,D C,D CP5-46A-4D5E GELDELVYLLDGPGYDPIHS 20 T 2.4 IreB pdbhh F T 4a1x 2 C,D C,D CP5-46-A PEPTIDE GELGRLVYLLDGPGYDPIHCD 21 T 0.79 DUF5685 pdbhh F T 4a2a 2 C,D C,D FTSZ_THEMA CELL DIVISION PROTEIN FTSZ EGDIPAIYRYGLEGLL 16 T 1.6 DUF3510 pdbhh F Bacteria T 4a3v 3 E E LINKER XXXXXXXXXX 10 F F F 4a4b 2 B B ZAP70_HUMAN 70 KDA ZETA-ASSOCIATED PROTEIN, SYK-RELATED TYROSINE KINASE, ZAP-70 TLNSDGXTPEPA 12 T 0.99 Galanin pdbhh F Eukaryota T 4a4c 2 B B ZAP70_HUMAN 70 KDA ZETA-ASSOCIATED PROTEIN, SYK-RELATED TYROSINE KINASE, ZAP-70 TLNSDGXTPEPA 12 T 0.99 Galanin pdbhh F Eukaryota T 4a4m 2 B B GNAT1_BOVIN GACT PEPTIDE, GUSTDUCIN ALPHA-3 CHAIN ILENLKDCGLF 11 T 0.75 Phage_holin_4_1 pdbhh F Eukaryota T 4a54 2 B B DCP2_SCHPO DCP2 GATTKEKNISVDVDADASSQLLSLLKSSTAPSDLATPQPSTFPQPPVESHSS 52 T 0.17 DUF1869 pdbpercent F Eukaryota T 4a5x 2 C,D C,D CHM1A_HUMAN CHROMATIN-MODIFYING PROTEIN 1A, CHMP1A, VACUOLAR PROTEIN SORTING-ASSOCIATED PROTEIN 46-1, VPS46-1, HVPS46-1 SHMEDQLSRRLAALRN 16 T 6.1 DUF4549 pdbhh F Eukaryota T 4a62 2 C,D C,D STBB_ECOLX PARR FROM PLASMID R1 EQKSDEETKKNAMKLIN 17 T 0.12 Nexin_C unppssm F Bacteria T 4a94 2 C,D C,D MCPI_NERVS CARBOXYPEPTIDASE INHIBITOR FHVPDDRPCINPGRCPLVPDATCTFVCKAADNDFGYECQHVWTFEGQRVGCYA 53 T 0.88 NPBW pdbhh F Eukaryota T 4aa1 2 B P ANGT_HUMAN ANGIOTENSIN II, ANG II DRVYIHPF 8 T 0.67 Nairo_nucleo pdbhh F Eukaryota T 4aa2 2 B P BNP_GLOBL POTENTIATOR B EGLPPRPKIPP 11 T 0.12 UPF0449 pdbhh F Eukaryota T 4aai 1 A,B A,B Q6TRU9_9VIRU ORF E73 MVESKKIAKKKTTLAFDEDVYHTLKLVSVYLNRDMTEIIEEAVVMWLIQNKEKLPNELKPKIDEISKRFFPAK 73 T 0.0004 Omega_Repress unphh T Viruses T 4abi 2 B B SFTI1_HELAN PTA-SFTI INHIBITOR GRCTKSIXICFPD 13 T 0.052 Bowman-Birk_leg unp F Eukaryota T 4abj 2 B B SFTI1_HELAN ICA-SFTI INHIBITOR, SFTI-1 GRCTKSXPICFPD 13 T 0.052 Bowman-Birk_leg unp F Eukaryota T 4aid 2 D,E,F F,G,H Q9A749_CAUCR RNASE E TAPPEKPRRGWWRR 14 T 0.24 Leader_Trp pdbhh F Bacteria T 4aif 2 C,D D,E HS90A_HUMAN HEAT SHOCK 86 KDA, HSP 86, HSP86, RENAL CARCINOMA ANTIGEN NY-REN-38 SRMEEVD 7 T 26 CAP_N pdbhh F Eukaryota T 4aim 2 B C Q9A749_CAUCR RNASE E TAPPEKPRRGWWRR 14 T 0.24 Leader_Trp pdbhh F Bacteria T 4air 2 C C FGA-FGA-FGA-FGA-FGA XXXXX 5 T 200 Myb_DNA-binding pdbhh F F 4air 3 D D FGA-FGA-FGA-FGA XXXX 4 T 250 Herpes_LP pdbhh F F 4aj5 1 A,AA,B,BA,C,CA,D,DA,Y,Z 1,W,2,X,3,Y,4,Z,U,V SKA3_HUMAN SKA3 MDPIRSFCGKLRSLASTLDCETARLQRALDGEESDFEDYPMRILYDLHSEVQTLKDDINILLDKARLENQEGIDFIKATKVLMEKNSMDIMKIREYFQKYG 101 T 0.0074 RNA_pol_RpbG pdb F Eukaryota T 4ak4 2 B,D,F,H,J,L,N,P B,D,F,H,J,L,N,P LECB4_ARTIN JACALIN BETA-4 CHAIN NEQSGISQTVIVGPWGAQVST 21 T 0.96 DUF3842 pdbhh F Eukaryota T 4akb 2 B,D,F,H B,D,F,H LECB4_ARTIN CHAMPEDAK GALACTOSE BINDING LECTIN BETA CHAIN NEQSGISQTVIVGPWGAQVST 21 T 0.96 DUF3842 pdbhh F Eukaryota T 4akc 2 B,D,F,H B,D,F,H LECB4_ARTIN JACALIN BETA-4 CHAIN, CHAMPEDAK GALACTOSE BINDING LECTIN NEQSGISQTVIVGPWGAQVST 21 T 0.96 DUF3842 pdbhh F Eukaryota T 4akt 2 C C SUBSTRATE ANALOGUE VGAPIPFPAYDG 12 T 2.8 Ibs_toxin pdbhh F T 4amq 1 A A YL544_MIMIV L544 SYYHHHHHHLESTSLYKKAGLRMLIFTYKLERYIKNKILPKILVVPDRDKYQIKGSFRRRIPYITDIDIVNNVHPEYDDTNIYQRIVDLINSFTNDNQIKLIYVICGTDDRFLLTEYSDEEIEKIKILLNPTELVELNNVLSKYQDDLNKKVFYINEIIWDLYKLRWTSSEVLAGKKILRGGIEVSFQDVVKNNSILLLQYFVKIEYYPIGFDIAVRYKPINLITAYQNAAFYQLKLANYSKEYYFMLFPLRFYFKNDPTISKQLEYIIETKFGLYKQLLVRIDSYRTIYESGNLDLDTAKSIIISIIKDIRKLNGIDMNIIDKIQEVSNNSAGQDKIIAWNTLLTQLYTNINKSVNKQSKKYFTRYINIIPKEDRKLCCLEEEHVLQSGGINFESTNFLTKKKLIY 407 T 0.42 NPV_P10 unppssm T Viruses T 4ams 1 A A G5CQN7_9VIRU MG662 GSSHHHHHHSLEVLFQGPGSLIYTYKLEKYVRTKIFPKILLIPDKNRYIIKGSFRRRVPFVTDIDVVNNVYPEISRENIYDEIIKLVNNIQSDPNIILAYLSCGTDERFKISTGSSKELSNIQSLLPDNEKNEFQLVLNKYYNDQQKKLFFLNELIWDHYKLRWKPEDVLIGSMNLANNVSVNFRETVENNSTILLQYYVKLGSYPVGIDVVINYQKIDLTPAYKNAALYQLQLANYSREYYYMLFPLRYYFKNNQDISQRLENIIEKKYGLYKQLMVRIDDYHTLYKSGNLKIDMATNIVIGILRDIEKLPGFESDTIYQIKKVATNNSPSIKIEEWDILLKVLYQEINTAVNNKSRKYFYRYIAMVPPQDRSKNYISENQDMRLKMVN 390 T 0.26 DNA_pol_B_palm unp T Viruses T 4aom 2 B T MYOA_PLAF7 PFM-A, MYOA KNIPSLLRVQAHIRKKMV 18 T 0.22 BORCS8 pdbhh F Eukaryota T 4aph 2 B P ANGT_HUMAN ANGIOTENSIN II, ANG II DRVYIHPF 8 T 0.67 Nairo_nucleo pdbhh F Eukaryota T 4apj 2 B P BNP_GLOBL POTENTIATOR B QGLPPRPKIPP 11 T 1.5 UPF0449 pdbhh F Eukaryota T 4apo 2 C,D D,E TOM20_HUMAN TOMM20 C-TERMINAL PEPTIDE, MITOCHONDRIAL 20 KDA OUTER MEMBRANE PROTEIN, OUTER MITOCHONDRIAL MEMBRANE RECEPTOR TOM20 AEDDVE 6 T 8.2 RinB pdbhh F Eukaryota F 4apr 2 B I PEPSTATIN-LIKE RENIN INHIBITOR XHPFHXLF 8 T 0.028 DUF5372 pdbhh F F 4ar2 2 B,B10,B11,B12,B13,B14,B15,B16,B17,B18,B19,B2,B20,B21,B22,B23,B24,B25,B26,B27,B28,B29,B3,B30,B31,B32,B33,B34,B35,B36,B37,B38,B39,B4,B40,B41,B42,B43,B44,B45,B46,B47,B48,B49,B5,B50,B51,B52,B53,B54,B55,B56,B57,B58,B59,B6,B60,B7,B8,B9 P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P SPIKE_ADE02 PIV MKRARPSEDTFNPVYPYDTEC 21 T 0.34 DUF5449 pdbhh T Viruses T 4ar8 2 C,D C,D ISOAMYL-PHOSPHONYL-GLY-PRO-ALA XGPA 4 T 310 ACTL7A_N pdbhh F F 4arf 2 B B ISOAMYLPHOSPHONYL-GLY-PRO-ALA XGPA 4 T 310 ACTL7A_N pdbhh F F 4art 1 A,B A,B Y273_ATV STRUCTURAL PROTEIN ORF273 MGEKITEEREFQSISEIPEEEIDATNDEEKLADIVENEIEKEIRKSKTRKCKTIENFYYYILRDGKIYPASDYDIEVEKGKRSANDIYAFVETDVTRDFDEFLFDIDYGLPSISDILKFYLEKAGFRIANEVPTPNLKYYIHAVVEFGEDRPQYLAVNIYDIDSLARALRIPQIVEQKLGNKPRTITADEFNDIERIVAEEQPILAGYTYDEALRIPYHYYVDHNNSFKDDALKIAHAYLQLFPTPYQVCYEWKARWFNKIDCLKLERLKPSSHHHHHH 279 T 3.9 Transglut_core2 unphh T Viruses T 4ats 1 A A Y273_ATV STRUCTURAL PROTEIN ORF273 MGSSHHHHHHSSGLVPRGSHMGEKITEEREFQSISEIPEEEIDATNDEEKLADIVENEIEKEIRKSKTRKCKTIENFYYYILRDGKIYPASDYDIEVEKGKRSANDIYAFVETDVTRDFDEFLFDIDYGLPSISDILKFYLEKAGFRIANEVPTPNLKYYIHAVVEFGEDRPQYLAVNIYDIDSLARALRIPQIVEQKLGNKPRTITADEFNDIERIVAEEQPILAGYTYDEALRIPYHYYVDHNNSFKDDALKIAHAYLQLFPTPYQVCYEWKARWFNKIDCLKLERLKPSS 293 T 3.9 Transglut_core2 unphh T Viruses T 4au7 2 C C H4_MOUSE HISTONE H4 PEPTIDE RHRKVLRDY 9 T 0.27 UPF0137 unp F Eukaryota T 4auc 2 B B PEPSTATIN A XVVXAX 6 T 1700 FAM60A pdbhh F F 4aui 2 D,E,F D,E,F POLY ALA AAAAAAAA 8 T 280 Androgen_recep pdbhh F F 4aw9 2 B I YVAD-CMK XYVADX 6 T 300 Rhodanese_C pdbhh F F 4awa 2 B I YVAD-CMK XYVADX 6 T 300 Rhodanese_C pdbhh F F 4awb 2 C,D I,J Z-ALA-ALA-AZAASN-CHLOROMETHYLKETONE XAAXX 5 T 3500 LisH pdbhh F F 4axg 2 C,D C,D CUP_DROME OSKAR RIBONUCLEOPROTEIN COMPLEX 147 KDA SUBUNIT STGIHKPGSLRAPKAVRPTTAPVVSSKPVKSYTRSRLMDIRNGMFNALMHRSKESFVMPRIATCDDIELEGRLRRMNIWRTSDGTRFRTRSTTANLNMNNNNNNECMPAFFKNKNKPNLISDESIIQSQP 130 T 6.9E-21 EIF4E-T unppercent F Eukaryota T 4axy 1 A,B,C A,B,C HSP47 BINDING COLLAGEN-LIKE PEPTIDE XPPGPPGPTGPRGPPGPPGX 20 T 0.0037 Collagen pdbpssm F F 4ay5 2 E,F,G,H I,J,K,L TAB1_HUMAN GTAB1TIDE PVSVPYSSAQS 11 T 9.2 YABBY pdbhh F Eukaryota T 4ay6 2 E,F,G,H E,F,G,H TAB1_HUMAN MITOGEN-ACTIVATED PROTEIN KINASE KINASE KINASE 7-INTERACTING PROTEIN 1, TGF-BETA-ACTIVATED KINASE 1-BINDING PROTEIN 1, TAK1-BINDING PROTEIN 1 PVSVPYXSAQSTS 13 T 13 DUF4128 pdbhh F Eukaryota T 4az0 2 B B PPGB_HUMAN CARBOXYPEPTIDASE C, CARBOXYPEPTIDASE L, CATHEPSIN A, PROTECTIVE PROTEIN CATHEPSIN A, PPCA, PROTECTIVE PROTEIN FOR BETA-GALACTOSIDASE MDPPCTNTTAASTYLNNPYVRKALNIPEQLPQWDMCNFLVNLQYRRLYRSMNSQYLKLLSSQKYQILLYNGDVDMACNFMGDEWFVDSLNQKMEVQRRPWLVKYGDSGEQIAGFVKEFSHIAFLTIKGAGHMVPTDKPLAAFTMFSRFLNKQPYE 155 F F Eukaryota T 4az3 2 B B PPGB_HUMAN LYSOSOMAL PROTECTIVE PROTEIN CARBOXYPEPTIDASE C, CARBOXYPEPTIDASE L, CATHEPSIN A, PROTECTIVE PROTEIN CATHEPSIN A, PPCA, PROTECTIVE PROTEIN FOR BETA-GALACTOSIDASE MDPPCTNTTAASTYLNNPYVRKALNIPEQLPQWDMCNFLVNLQYRRLYRSMNSQYLKLLSSQKYQILLYNGDVDMACNFMGDEWFVDSLNQKMEVQRRPWLVKYGDSGEQIAGFVKEFSHIAFLTIKGAGHMVPTDKPLAAFTMFSRFLNKQPYE 155 F F Eukaryota T 4aza 2 B,D B,D IF4G1_HUMAN EIF4G1_D5S PEPTIDE XKKRYSREFLLGFX 14 T 0.00025 eIF_4G1 unphh F Eukaryota T 4aze 2 D,E,F E,F,G DUAL SPECIFICITY YAK1-RELATED KINASE, HP86, PROTEIN KINASE MINIBRAIN HOMOLOG, MNBH, HMNB XSXX 4 T 2300 EF-hand_5 pdbhh F F 4b18 2 B B TERT_HUMAN HEST2, TELOMERASE CATALYTIC SUBUNIT, TELOMERASE-ASSOCIATED PROTEIN 2, TP2 RRRGGSASRSLPLPKRPRRA 20 T 19 KN_motif pdbhh F Eukaryota T 4b2u 1 A A KNO67_HEXDO S67 GTYCIELGERCPNPREGDWCCHKCVPEGKRFYCRDQ 36 T 1.2 Conotoxin pdbhh F Eukaryota T 4b2v 1 A A KNO64_HEXDO S64 SECVENGGFCPDPEKMGDWCCGRCIRNECRNG 32 T 2.5 Conotoxin pdbhh F Eukaryota T 4b3b 2 B C FHTA TETRAPEPTIDE XFHTAX 6 T 290 Archease pdbhh F F 4b45 2 B B CETZ2_HALVD CETZ2 MWHSDDLDDLLGSHHHHHH 19 T 84 Nop52 pdbhh F Archaea T 4b4n 2 B B CPSF6_HUMAN CPSF6, CLEAVAGE AND POLYADENYLATION SPECIFICITY FACTOR 68 KDA SUBUNIT, CFIM68, CPSF 68 KDA SUBUNIT, PRE-MRNA CLEAVAGE FACTOR IM 68 KDA SUBUNIT, PROTEIN HPBRII-4/7 PVLFPGQPFGQPPLG 15 T 2.2 MF_alpha pdbhh F Eukaryota T 4b4p 1 A,B A,B Q47212_ECOLX FEDF NSSASSAQVTGTLLGTGKTNTTQMPALYTWQHQIYNVNFIPSSSGTLTCQAGTILVWKNGRETQYALECRVSIHHSSGSINESQWGQQSQVGFGTACGNKKCRFTGFEISLRIPPNAQTYPLSSGDLKGSFSLTNKEVNWSASIYVPAIAK 151 T 2.5 DUF2511 pdbhh F Bacteria T 4b4q 1 A,B A,B Q47212_ECOLX FEDF NSSASSAQVTGTLLGTGKTNTTQMPALYTWQHQIYNVNFIPSSSGTLTCQAGTILVWKNGRETQYALECRVSIHHSSGSINESQWGQQSQVGFGTACGNKKCRFTGFEISLRIPPNAQTYPLSSGDLKGSFSLTNKEVNWSASIYVPAIAK 151 T 2.5 DUF2511 pdbhh F Bacteria T 4b4r 1 A,B A,B Q47212_ECOLX FEDF, FIMBRIAL ADHESIN FEDF NSSASSAQVTGTLLGTGKTNTTQMPALYTWQHQIYNVNFIPSSSGTLTCQAGTILVWKNGRETQYALECRVSIHHSSGSINESQWGQQSQVGFGTACGNKKCRFTGFEISLRIPPNAQTYPLSSGDLKGSFSLTNKEVNWSASIYVPAIAK 151 T 2.5 DUF2511 pdbhh F Bacteria T 4b60 2 C,D C,D FIBG_HUMAN FIBRINOGEN GAMMA CHAIN GEGQQHHLGGAKQAGDV 17 T 23 Rhodopsin_N pdbhh F Eukaryota T 4b7e 2 B B CONSENSUS ANKYRIN REPEAT DOMAIN-LEU EVVKLLLEHGADVLAQD 17 T 0.00035 Shigella_OspC pdbhh F T 4b7t 2 B B AXIN1_HUMAN AXIS INHIBITION PROTEIN 1, HAXIN VEPQKFAEELIHRLEAVQ 18 T 0.19 Cwf_Cwc_15 unppssm F Eukaryota T 4b8o 2 B,C B,C A7XWN5_SV40 SV40TAGNLS GSPPKKKRKVG 11 T 0.42 ACTH_domain pdbhh T Viruses T 4b8p 2 C,D C,D A89NLS VHKTVLGKRKYW 12 T 0.11 MIER1_beta_C pdbhh F T 4b8y 2 B B VIRULENCE FACTOR GGGXXX 6 T 6.5 DUF3918 pdbhh F F 4b9w 2 C,D P,S PIWL2_MOUSE MILI GRAGPAGXGLVFR 13 T 21 RNR_Alpha pdbhh F Eukaryota T 4ba3 2 B B A89NLS VHKTVLGKRKYW 12 T 0.11 MIER1_beta_C pdbhh F T 4be5 1 A,B A,B RBMA MGSSHHHHHHSSGLVPRGSHMEVDCELQPVIEANLSLNQNQLASNGGYISSQLGIRNESCETVKFKYWLSIKGPEGIYFPAKAVVGVDTAQQESDALTDGRMLNVTRGFWVPEYMADGKYTVSLQVVAENGKVFKANQEFVKGVDLNSLPELNGLTIDIKNQFGINSVESTGGFVPFTVDLNNGREGEANVEFWMTAVGPDGLIIPVNAREKWVIASGDTYSKVRGINFDKSYPAGEYTINAQVVDIVSGERVEQSMTVVKK 262 T 0.015 BsuPI pdbhh F T 4be6 1 A,B A,B RBMA MGSSHHHHHHSSGLVPRGSHMEVDCELQPVIEANLSLNQNQLASNGGYISSQLGIRNESCETVKFKYWLSIKGPEGIYFPAKAVVGVDTAQQESDALTDGRMLNVTRGFWVPEYMADGKYTVSLQVVAENGKVFKANQEFVKGVDLNSLPELNGLTIDIKNQFGINSVESTGGFVPFTVDLNNGREGEANVEFWMTAVGPDGLIIPVNAREKWVIASGDTYSKVRGINFDKSYPAGEYTINAQVVDIVSGERVEQSMTVVKK 262 T 0.015 BsuPI pdbhh F T 4bea 2 B B STAPLED EIF4E INTERACTING PEPTIDE KKRYSRXQLLXLX 13 T 2.3 BURAN pdbhh F T 4bei 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H RBMA MGSSHHHHHHSSGLVPRGSHMEVDCELQPVIEANLSLNQNQLASNGGYISSQLGIRNESCETVKFKYWLSIKGPEGIYFPAKAVVGVDTAQQESDALTDGRMLNVTRGFWVPEYMADGKYTVSLQVVAENGKVFKANQEFVKGVDLNSLPELNGLTIDIKNQFGINSVESTGGFVPFTVDLNNGREGEANVEFWMTAVGPDGLIIPVNAREKWVIASGDTYSKVRGINFDKSYPAGEYTINAQVVDIVSGERVEQSMTVVKK 262 T 0.015 BsuPI pdbhh F T 4bey 2 B B GNAT1_BOVIN TRANSDUCIN ALPHA-1 CHAIN ILENLKDCGLF 11 T 0.75 Phage_holin_4_1 pdbhh F Eukaryota T 4bg6 2 C,D Q,R RND3_HUMAN PROTEIN MEMB, RHO FAMILY GTPASE 3, RHO-RELATED GTP-BINDING PROTEIN RHO8, RND3 DLRKDKAKSC 10 T 36 DUF6306 pdbhh F Eukaryota T 4bh6 2 I,J,K,L,M,N,O,P I,J,K,L,M,N,O,P ACM1_YEAST APC/C-CDH1 MODULATOR 1 AQFMLYEETAEERNIAVHRHNEIYNNNNSVSNENNPSQVKENLSPAKICPYERAFLREGGRIALKDLSVD 70 T 0.12 HPD unp F Eukaryota T 4bh7 3 C P DODECAPEPTIDE ANTIGEN PPYPAWHAPGNI 12 T 1.3 DUF3612 pdbhh F T 4bh8 3 C P DODECAPEPTIDE ANTIGEN GDPRPSYISHLL 12 T 1.7 Tom7 pdbhh F T 4bj1 1 A A RIF2_YEAST RAP1-INTERACTING FACTOR 2 GGGRVDHVFYQKFKSMALQELGTNYLSISYVPSLSKFLSKNLRSMKNCIVFFDKVEHIHQYAGIDRAVSETLSLVDINVVIIEMNDYLMKEGIQSSKSKECIESMGQASYSGQLDFEASEKPSNHTSDLMMMVMRKINNDESIDHIVYFKFEQLDKLSTSTIIEPSKLTEFINVLSVLEKSNNIAFKVLIYSNNVSISSLLSTSLKKKLNTKYTVFEMPILTCAQEQEYLKKMIKFTFDSGSKLLQSYNSLVTCQLNNKESNLAIFFEFLKVFPHPFTYLFNAYTEIIVQSRTFDELLDKIRNRLTIKNYPHSAYNFKK 319 T 0.3 PHP_C pdbpercent F Eukaryota T 4bj5 1 A,B A,B RIF2_YEAST RAP1-INTERACTING FACTOR 2 GGGRMEHVDSDFAPIRRSKKVVDSDKIVKAISDDLEQKNFTVLRKLNLVPIKKSVSSPKVCKPSPVKERVDHVFYQKFKSMALQELGTNYLSISYVPSLSKFLSKNLRSMKNCIVFFDKVEHIHQYAGIDRAVSETLSLVDINVVIIEMNDYLMKEGIQSSKSKECIESMGQASYSGQLDFEASEKPSNHTSDLMMMVMRKINNDESIDHIVYFKFEQLDKLSTSTIIEPSKLTEFINVLSVLEKSNNIAFKVLIYSNNVSISSLLSTSLKKKLNTKYTVFEMPILTCAQEQEYLKKMIKFTFDSGSKLLQSYNSLVTCQLNNKESNLAIFFEFLKVFPHPFTYLFNAYTEIIVQSRTFDELLDKIRNRLTIKNYPHSAYNFKKNQRLPLKLTRKVHDR 399 T 0.4 PHP_C pdbpercent F Eukaryota T 4bj5 3 D,F D,F RIF2_YEAST REPRESSOR/ACTIVATOR SITE-BINDING PROTEIN, SBF-E, TUF, RAP1 FTVLRKLNLVPIK 13 T 0.46 DUF5771 pdbhh F Eukaryota T 4bj6 1 A,B A,B RIF2_YEAST RAP1-INTERACTING FACTOR 2 GGGRNFTVLRKLNLVPIKKSVSSPKVCKPSPVKERVDHVFYQKFKSMALQELGTNYLSISYVPSLSKFLSKNLRSMKNCIVFFDKVEHIHQYAGIDRAVSETLSLVDINVVIIEMNDYLMKEGIQSSKSKECIESMGQASYSGQLDFEASEKPSNHTSDLMMMVMRKINNDESIDHIVYFKFEQLDKLSTSTIIEPSKLTEFINVLSVLEKSNNIAFKVLIYSNNVSISSLLSTSLKKKLNTKYTVFEMPILTCAQEQEYLKKMIKFTFDSGSKLLQSYNSLVTCQLNNKESNLAIFFEFLKVFPHPFTYLFNAYTEIIVQSRTFDELLDKIRNRLTIKNYPHSAYNFKKNQRLPLKLTRKVHDR 365 T 0.36 PHP_C pdbpercent F Eukaryota T 4bj6 3 D,F D,F RIF2_YEAST RAP1-INTERACTING FACTOR 2 FTVLRKLNLVPIK 13 T 0.46 DUF5771 pdbhh F Eukaryota T 4bjs 2 D D RIF1_YEAST RAP1-INTERACTING FACTOR 1, RAP1 INTERACTING FACTOR 1 PSLKLHFFSKKSRRLVARLRGFTPGDLNGISVEERRNLRIELLDFMMRLEYYSNRDNDMN 60 T 0.027 POB3_N pdbpercent F Eukaryota T 4bjt 2 D,E,F D,E,F RIF1_YEAST RAP1-INTERACTING FACTOR 1 ADISVLPEIRIPIFNSLKMQ 20 T 9.1 FTP pdbhh F Eukaryota T 4bl0 3 C,F C,F SP105_YEAST 105 KDA SPINDLE POLE COMPONENT PROTEIN DPTSMEMTEVFPRSIRQKN 19 T 5.2 MELT_2 pdbhh F Eukaryota T 4blb 2 E,F,G,H E,F,G,H GLI1_HUMAN TRANSCRIPTIONAL ACTIVATOR GL1, GLIOMA-ASSOCIATED ONCOGENE, ONCOGENE GLI TSPGGSYGHLSIGTMSP 17 T 96 Ntox11 pdbhh F Eukaryota T 4bld 2 E,F,G,H E,F,G,H GLI3_HUMAN GLI3 FORM OF 190 KDA, GLI3-190, GLI3 FULL LENGTH PROTEIN, G LI3FL, GLI3 C-TERMINALLY TRUNCATED FORM, GLI3 FORM OF 83 KDA, GLI3-8 GLI3 SSASGSYGHLSASAISP 17 T 9.3 Sulfakinin pdbhh F Eukaryota T 4blg 1 A,B A,B O41974_MHV68 IMMEDIATE-EARLY PROTEIN GPGYQKDPPKKYQGMRRHLQVTAPRLFDPEGHPPTHFKSAVMFSSTHPYTLNKLHKCIQSKHVLSTPVSCLPLVPGTTQQCVTYYLLSFVEDKKQAKKLKRVVLAYCEKYHSSVEGTIVKAKPYFPLPEPPTEPPTDPEQP 141 T 0.0001 EBV-NA1 pdbhh T Viruses T 4bp9 2 G,H,I,J,K,L G,H,I,J,K,L ANTIPAIN XRVX 4 T 41 Receptor_IA-2 pdbhh F F 4bpl 2 B B NUPL_XENLA NUCLEOPLASMIN NLS SAVKRPAATKKAGQAKKKKLD 21 T 0.0016 BSP_II unppercent F Eukaryota T 4bpq 1 A,B A,B CITS XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 318 F F F 4bqd 2 C,D C,D PEPTIDE XFQSKPNVHVDGYFERLXAKL 21 T 0.54 Pea-VEAacid pdbhh F T 4bqk 2 C,D C,D VIRD2_AGRFC VIRD2NLS LSKRPREDDDGEPSERKRER 20 T 4.7 ROKNT pdbhh F Bacteria T 4bt9 2 C C (PRO-PRO-GLY)3 PEPTIDE PPGPPGPPG 9 T 0.46 EKLF_TAD1 pdbhh F F 4bta 2 C C 9 RESIDUE PEPTIDE- PPGPPGPRPG PPGPPGPPG 9 T 0.46 EKLF_TAD1 pdbhh F F 4btb 2 B C POLY PROLINE PEPTIDE PPPPPPPPP 9 T 29 Adeno_E3_14_5 pdbhh F F 4btg 1 A,A10,A11,A12,A13,A14,A15,A16,A17,A18,A19,A2,A20,A21,A22,A23,A24,A25,A26,A27,A28,A29,A3,A30,A31,A32,A33,A34,A35,A36,A37,A38,A39,A4,A40,A41,A42,A43,A44,A45,A46,A47,A48,A49,A5,A50,A51,A52,A53,A54,A55,A56,A57,A58,A59,A6,A60,A7,A8,A9,B,B10,B11,B12,B13,B14,B15,B16,B17,B18,B19,B2,B20,B21,B22,B23,B24,B25,B26,B27,B28,B29,B3,B30,B31,B32,B33,B34,B35,B36,B37,B38,B39,B4,B40,B41,B42,B43,B44,B45,B46,B47,B48,B49,B5,B50,B51,B52,B53,B54,B55,B56,B57,B58,B59,B6,B60,B7,B8,B9 A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B P1_BPPH6 CAPSID SUBUNIT OF THE BACTERIOPHAGE PHI6 GFNLKVKDLNGSARGLTQAFAIGELKNQLSVGALQLPLQFTRTFSASMTSELLWEVGKGNIDPVMYARLFFQYAQAGGALSVDELVNQFTEYHQSTACNPEIWRKLTAYITGSSNRAIKADAVGKVPPTAILEQLRTLAPSEHELFHHITTDFVCHVLSPLGFILPDAAYVYRVGRTATYPNFYALVDCVRASDLRRMLTALSSVDSKMLQATFKAKGALAPALISQHLANAATTAFERSRGNFDANAVVSSVLTILGRLWSPSTPKELDPSARLRNTNGIDQLRSNLALFIAYQDMVKQRGRAEVIFSDEELSSTIIPWFIEAMSEVSPFKLRPINETTSYIGQTSAIDHMGQPSHVVVYEDWQFAKEITAFTPVKLANNSNQRFLDVEPGISDRMSATLAPIGNTFAVSAFVKNRTAVYEAVSQRGTVNSNGAEMTLGFPSVVERDYALDRDPMVAIAALRTGIVDESLEARASNDLKRSMFNYYAAVMHYAVAHNPEVVVSEHQGVAAEQGSLYLVWNVRTELRIPVGYNAIEGGSIRTPEPLEAIAYNKPIQPSEVLQAKVLDLANHTTSIHIWPWHEASTEFAYEDAYSVTIRNKRYTAEVKEFELLGLGQRRERVRILKPTVAHAIIQMWYSWFVEDDRTLAAARRTSRDDAEKLAIDGRRMQNAVTLLRKIEMIGTTGIGASAVHLAQSRIVDQMAGRGLIDDSSDLHVGINRHRIRIWAGLAVLQMMGLLSRSEAEALTKVLGDSNALGMVVA 761 T 0.22 STAG pdb T Viruses T 4btp 1 A,B,C,D,E,F,G,H,I,J A,B,C,D,E,F,G,H,I,J Q9MC13_9VIRU p1 MSKLDLGRVDLLSMLGNSSSAGVDTAKGVIPFSTSGATWAVPRLSEDGITSHFLRRRGYVTMTQGGSRDQNAAVRKILSLIIAYDIQTQACFFISNEESMRITMAETMGVKDRPNARTNSWAEVSDSDINRGIAKALKEGNLTLDENQKDGFMKLVHAFVADILAQSGHYKPVTSVTYFSAPIDMESDYLDPFSIAIIRDVLDDSPFSELRYDARAMSELEDRDVPITRFSRVMAQMGNAMVRNIMVLNEAAQRKLRGLAVVGEIVHGRVRAPVRYLNDSFIQTLRSNINFHLLTRTTPERWAQSWIQAFGSLKGWVDAINGIADATTEEEKKKLAMQTSMDLELLSDLTPLIRDAATSVEKFVTFAPLSFYQGLGSVTQIRALDSSTNLAAVIVRYAAKEINLIPAYQSFQVPTVDVAVKKTAIMDQRLSLQLPEFSEDQFFGMLEQRMQNMSDSEVAELVDRIAKGETPFGDVVKQLPGTSTLLVTNGYYMGGLLTNEDKIIPGDASVPALLYMQAASFASSVRFPPGEYPVFHHESSNGDVRLTDQVSADAQLSHSAVETANPLNFLVACNVSVHTPSIAIDIIEPMPDLTRRGTTEYVHKGEIKVAAIPSLPPKSADRKAQVSRETAKFERVLYKARKGGAQVAAPIDLESLFGIAVNLAVPTVKHVYSPDSKTKLALDIIKGLESDGDKAAATRLLMTLARAYTGTYSSLALRRRDEITGIAAQPSDVAMQEFALQSGVQTLKAVAKHTGIMEVATIEMVEEKVRSLDDNRFYEIAAEVVLRALKGM 792 T 12 DUF445 pdbhh T Viruses T 4btq 1 A,A10,A11,A12,A13,A14,A15,A16,A17,A18,A19,A2,A20,A21,A22,A23,A24,A25,A26,A27,A28,A29,A3,A30,A31,A32,A33,A34,A35,A36,A37,A38,A39,A4,A40,A41,A42,A43,A44,A45,A46,A47,A48,A49,A5,A50,A51,A52,A53,A54,A55,A56,A57,A58,A59,A6,A60,A7,A8,A9,B,B10,B11,B12,B13,B14,B15,B16,B17,B18,B19,B2,B20,B21,B22,B23,B24,B25,B26,B27,B28,B29,B3,B30,B31,B32,B33,B34,B35,B36,B37,B38,B39,B4,B40,B41,B42,B43,B44,B45,B46,B47,B48,B49,B5,B50,B51,B52,B53,B54,B55,B56,B57,B58,B59,B6,B60,B7,B8,B9 A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B P1_BPPH6 CAPSID SUBUNIT OF THE BACTERIOPHAGE PHI6 GFNLKVKDLNGSARGLTQAFAIGELKNQLSVGALQLPLQFTRTFSASMTSELLWEVGKGNIDPVMYARLFFQYAQAGGALSVDELVNQFTEYHQSTACNPEIWRKLTAYITGSSNRAIKADAVGKVPPTAILEQLRTLAPSEHELFHHITTDFVCHVLSPLGFILPDAAYVYRVGRTATYPNFYALVDCVRASDLRRMLTALSSVDSKMLQATFKAKGALAPALISQHLANAATTAFERSRGNFDANAVVSSVLTILGRLWSPSTPKELDPSARLRNTNGIDQLRSNLALFIAYQDMVKQRGRAEVIFSDEELSSTIIPWFIEAMSEVSPFKLRPINETTSYIGQTSAIDHMGQPSHVVVYEDWQFAKEITAFTPVKLANNSNQRFLDVEPGISDRMSATLAPIGNTFAVSAFVKNRTAVYEAVSQRGTVNSNGAEMTLGFPSVVERDYALDRDPMVAIAALRTGIVDESLEARASNDLKRSMFNYYAAVMHYAVAHNPEVVVSEHQGVAAEQGSLYLVWNVRTELRIPVGYNAIEGGSIRTPEPLEAIAYNKPIQPSEVLQAKVLDLANHTTSIHIWPWHEASTEFAYEDAYSVTIRNKRYTAEVKEFELLGLGQRRERVRILKPTVAHAIIQMWYSWFVEDDRTLAAARRTSRDDAEKLAIDGRRMQNAVTLLRKIEMIGTTGIGASAVHLAQSRIVDQMAGRGLIDDSSDLHVGINRHRIRIWAGLAVLQMMGLLSRSEAEALTKVLGDSNALGMVVA 761 T 0.22 STAG pdb T Viruses T 4bu0 2 B,C B,C RHP9_SCHPO RAD9 HOMOLOG, CRB2 GYGEVLVPETVAQHRT 16 T 1.7 RTC4 pdbhh F Eukaryota T 4bu1 2 C,D C,D RHP9_SCHPO RAD9 HOMOLOG, CRB2 GYGRVESTPPAFLP 14 T 1.5 DUF2104 pdbhh F Eukaryota T 4buz 2 B P P53_HUMAN ANTIGEN NY-CO-13, PHOSPHOPROTEIN P53, TUMOR SUPPRESSOR P53, P53 RHKXLMFK 8 T 15 DUF420 pdbhh F Eukaryota T 4bv2 2 C,D E,H P53_HUMAN DEACETYLATED P53-PEPTIDE, ANTIGEN NY-CO-13, PHOSPHOPROTEIN P53, TUMOR SUPPRESSOR P53, P53 STSRHKKLMFKTE 13 T 40 DUF420 pdbhh F Eukaryota T 4bwo 1 A,B A,B Q47212_ECOLX FEDF ADHESIN, FEDF NSSASSAQVTGTLLGTGKTNTTQMPALYTWQHQIYNVNFIPSSSGTLTCQAGTILVWKNGRETQYALECRVSIHHSSGSINESQWGQQSQVGFGTACGNKKCRFTGFEISLRIPPNAQTYPLSSGDLKGSFSLTNKEVNWSASIYVPAIAK 151 T 2.5 DUF2511 pdbhh F Bacteria T 4bwq 2 B,D,F,H B,D,F,H PQBP1_HUMAN PQBP-1,38 KDA NUCLEAR PROTEIN CONTAINING A WW DOMAIN, NPW38, POLYGLUTAMINE TRACT-BINDING PROTEIN 1, PQBP-1 KRNEAKTGADTTAAGPLFQQRPYPSPGAVLRANAEASRTKQQD 43 T 5 Cortexin unphh F Eukaryota T 4bws 2 B,E B,E PQBP1_HUMAN PQBP-1,38 KDA NUCLEAR PROTEIN CONTAINING A WW DOMAIN, NPW38, POLYGLUTAMINE TRACT-BINDING PROTEIN 1, PQBP-1 TGADTTAAGPLFQQRPYPSPGAVLRANAEASRTKQQD 37 T 5.4 REV pdbhh F Eukaryota T 4bx4 1 A,B A,B Q9MC13_9VIRU P1 MSKLDLGRVDLLSMLGNSSSAGVDTAKGVIPFSTSGATWAVPRLSEDGITSHFLRRRGYVTMTQGGSRDQNAAVRKILSLIIAYDIQTQACFFISNEESMRITMAETMGVKDRPNARTNSWAEVSDSDINRGIAKALKEGNLTLDENQKDGFMKLVHAFVADILAQSGHYKPVTSVTYFSAPIDMESDYLDPFSIAIIRDVLDDSPFSELRYDARAMSELEDRDVPITRFSRVMAQMGNAMVRNIMVLNEAAQRKLRGLAVVGEIVHGRVRAPVRYLNDSFIQTLRSNINFHLLTRTTPERWAQSWIQAFGSLKGWVDAINGIADATTEEEKKKLAMQTSMDLELLSDLTPLIRDAATSVEKFVTFAPLSFYQGLGSVTQIRALDSSTNLAAVIVRYAAKEINLIPAYQSFQVPTVDVGVKKTAIMDQRLSLQLPEFSEDQFFGMLEQRMQNMSDSEVAELVDRIAKGETPFGDVVKQLPGTSTLLVTNGYYMGGLLTNEDKIIPGDASVPALLYMQAASFASSVRFPPGEYPVFHHESSNGDVRLTDQVSADAQLSHSAVETANPLNFLVACNVSVHTPSIAIDIIEPMPDLTRRGTTEYVHKGEIKVAAIPSLPPKSADRKAQVSRETAKFERVLYKARKGGAQVAAPIDLESLFGIAVNLAVPTVKHVYSPDSKTKLALDIIKGLESDGDKAAATRLLMTLARAYTGTYSSLGLRRRDEITGIAAQPSDVAMQEFALQSGVQTLKAVAKHTGIMEVATIEMVEEKVRSLDDNRFYEIAAEVVLRALKGM 792 T 14 DUF445 pdbhh T Viruses T 4bxd 2 C,D C,D PEPTIDE AXXX 4 T 380 NSF pdbhh F F 4bxe 2 C,D C,D ANHYDROMURAMIC PEPTIDE XAXXXX 6 T 260 DUF2175 pdbhh F F 4bxu 2 B B PEX5_HUMAN PTS1 RECEPTOR, PTS1R, PTS1-BP, PEROXIN-5, PEROXISOMAL C-TERMINAL TARGETING SIGNAL IMPORT RECEPTOR, PEROXISOME RECEPTOR 1, PEX5 ASEDELVAEFLQDQN 15 T 2.3 DUF5748 pdbhh F Eukaryota T 4bxw 2 C F FA5_PSETE FACTOR V A2 PEPTIDE GNEEEEEDDGDIFADIFI 18 T 5.3 CAF1-p150_C2 pdbhh F Eukaryota T 4by8 1 A A PARACELSIN-X XXAXXAXAXQXVIXGXXPVIXXQQX 25 T 18 DUF3824 pdbhh F T 4c0u 4 D,D10,D11,D12,D13,D14,D15,D16,D17,D18,D19,D2,D20,D21,D22,D23,D24,D25,D26,D27,D28,D29,D3,D30,D31,D32,D33,D34,D35,D36,D37,D38,D39,D4,D40,D41,D42,D43,D44,D45,D46,D47,D48,D49,D5,D50,D51,D52,D53,D54,D55,D56,D57,D58,D59,D6,D60,D7,D8,D9 Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y FAB EV18 4 D6-1 F1 G9 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 217 F F F 4c0u 5 E,E10,E11,E12,E13,E14,E15,E16,E17,E18,E19,E2,E20,E21,E22,E23,E24,E25,E26,E27,E28,E29,E3,E30,E31,E32,E33,E34,E35,E36,E37,E38,E39,E4,E40,E41,E42,E43,E44,E45,E46,E47,E48,E49,E5,E50,E51,E52,E53,E54,E55,E56,E57,E58,E59,E6,E60,E7,E8,E9 Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z FAB EV18 4 D6-1 F1 G9 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 220 F F F 4c0y 4 D,D10,D11,D12,D13,D14,D15,D16,D17,D18,D19,D2,D20,D21,D22,D23,D24,D25,D26,D27,D28,D29,D3,D30,D31,D32,D33,D34,D35,D36,D37,D38,D39,D4,D40,D41,D42,D43,D44,D45,D46,D47,D48,D49,D5,D50,D51,D52,D53,D54,D55,D56,D57,D58,D59,D6,D60,D7,D8,D9 X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X FAB EV18 4 D6-1 F1 G9 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 217 F F F 4c0y 5 E,E10,E11,E12,E13,E14,E15,E16,E17,E18,E19,E2,E20,E21,E22,E23,E24,E25,E26,E27,E28,E29,E3,E30,E31,E32,E33,E34,E35,E36,E37,E38,E39,E4,E40,E41,E42,E43,E44,E45,E46,E47,E48,E49,E5,E50,E51,E52,E53,E54,E55,E56,E57,E58,E59,E6,E60,E7,E8,E9 Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y,Y FAB EV18 4 D6-1 F1 G9 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 220 F F F 4c10 1 A,A10,A11,A12,A13,A14,A15,A16,A17,A18,A19,A2,A20,A21,A22,A23,A24,A25,A26,A27,A28,A29,A3,A30,A31,A32,A33,A34,A35,A36,A37,A38,A39,A4,A40,A41,A42,A43,A44,A45,A46,A47,A48,A49,A5,A50,A51,A52,A53,A54,A55,A56,A57,A58,A59,A6,A60,A7,A8,A9 4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4,4 EV19 5 C1-6 F1 C11 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 217 F F F 4c10 2 B,B10,B11,B12,B13,B14,B15,B16,B17,B18,B19,B2,B20,B21,B22,B23,B24,B25,B26,B27,B28,B29,B3,B30,B31,B32,B33,B34,B35,B36,B37,B38,B39,B4,B40,B41,B42,B43,B44,B45,B46,B47,B48,B49,B5,B50,B51,B52,B53,B54,B55,B56,B57,B58,B59,B6,B60,B7,B8,B9 5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5 EV19 5 C1-6 F1 C11 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 220 F F F 4c1a 1 A,B,C,D A,B,C,D Q3LG57_DANRE ZFL2-1 ORF1P GPAMEALELELEEVESQIRALVVRRSRLRERLLAVP 36 T 0.069 ABC_tran_CTD unppercent F Eukaryota T 4c2c 2 B B PEPTIDE1 AAA 3 T 1200 RNase_HII pdbhh F F 4c2c 3 C C PEPTIDE2 AVPA 4 T 280 CobS_N pdbhh F F 4c2d 2 E,F,G,H E,F,G,H PEPTIDE1 AASLSA 6 T 460 Alveol-reg_P311 pdbhh F F 4c2d 3 I M PEPTIDE2 AAPQA 5 T 290 Acp26Ab pdbhh F F 4c2d 4 J,K,L N,O,P PEPTIDE2 PQTA 4 T 190 DUF4322 pdbhh F F 4c2f 2 B B PEPTIDE1 AAA 3 T 1200 RNase_HII pdbhh F F 4c2f 3 C C PEPTIDE2 AAAASAA 7 T 300 Peptidase_Prp pdbhh F F 4c2g 2 B B PEPTIDE1 AAAA 4 T 900 Cyclin_C pdbhh F F 4c2g 3 C C CTPB_BACSU PEPTIDE VPA, CTPB, C-TERMINAL PROCESSING PROTEASE EMDKPQTAAVPA 12 T 0.26 Phyto-Amp unppssm F Bacteria T 4c31 3 C,F,G,H C,F,X,Y NUP1_YEAST NUCLEAR PORE PROTEIN NUP1, NUP1 GSPKKDKESIVLPTVGFDFIKDNETPSKKTSPKATS 36 T 0.81 zf-C2H2_assoc2 pdbhh F Eukaryota T 4c5a 2 C C PEPTIDE ENLYFQGA 8 T 4.7 RIP pdbhh F T 4c5e 2 E,F,G,H E,F,G,H PHO_DROME PROTEIN PLEIOHOMEOTIC, TRANSCRIPTION FACTOR YY1 HOMOLOG GAMASRRWEQKLVHIKTMEGEFSVTMWASGIS 32 T 0.24 zf-H2C2_2 unppssm F Eukaryota T 4c5g 2 B B PHO_DROME PROTEIN PLEIOHOMEOTIC, TRANSCRIPTION FACTOR YY1 HOMOLOG AGMASRRWEQKLVHIKTMEGEFSVTMWASGIS 32 T 0.092 INO80_Ies4 pdbpercent F Eukaryota T 4c5h 2 B B PHO_DROME PROTEIN PLEIOHOMEOTIC, TRANSCRIPTION FACTOR YY1 HOMOLOG GAMADINTEESGVVDKNSPFLTLGTTILNSNGKSRRWEQKLVHIKTMEGEFSVTMWASGISDDEYSGSDQIVGASDLLKGKEEFGIDGFTSQQNKEYQKMESKFTNAQTLEMPHPISSVQIMDHLIKERGNLSQE 135 T 0.24 BCL_N pdb F Eukaryota T 4c5i 2 C C TYY1_HUMAN DELTA TRANSCRIPTION FACTOR, INO80 COMPLEX SUBUNIT S, NF-E1, YIN AND YANG 1, YY-1, YY1 DPGNKKWEQKQVQIKTLEGEFSVTMWSSDE 30 T 0.98 INO80_Ies4 pdbhh F Eukaryota T 4c7b 2 B B PEPTIDE RHKX 4 T 360 STOP pdbhh F F 4c93 2 D,E D,E DPOA_YEAST DNA POLYMERASE I SUBUNIT A, DNA POLYMERASE ALPHA\: PRIMASE COMPLEX P180 SUBUNIT, DNA POLYMERASE-PRIMASE COMPLEX P180 SUBUNIT, POL ALPHA-PRIMASE COMPLEX P180 SUBUNIT, DNA POLYMERASE ALPHA CATALYTIC SUBUNIT IDNFDDILGEFES 13 T 0.8 DUF4927 pdbhh F Eukaryota T 4c95 2 D,E D,E SLD5_YEAST SLD5 MDINIDDILAELDKETTAV 19 T 0.43 Bombolitin pdbhh F Eukaryota T 4cat 1 A,B A,B CATALASE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 659 F F F 4cay 3 C C AN32E_HUMAN LANP-LIKE PROTEIN, LANP-L, ANP32E GSHMEVGLSYLMKEEIQDEEDDDDYVEEGE 30 T 0.0014 BUD22 unp F Eukaryota T 4cc2 2 B,D B,D WASL_HUMAN N-WASP, N-WASP PPPALPSSAPSG 12 T 20 Ribosomal_S12 pdbhh F Eukaryota F 4cc3 2 B,D,F,H B,D,F,H ENAH_MOUSE NPC-DERIVED PROLINE-RICH PROTEIN 1, NDPP-1, MURINE MENA PPPPLPSGPAYA 12 T 3.9 FAF pdbhh F Eukaryota T 4cc7 2 B,D,F,H,J,L,N B,D,F,H,J,L,N WASL_HUMAN N-WASP, N-WASP PPPALPSSAPSG 12 T 20 Ribosomal_S12 pdbhh F Eukaryota F 4cc8 1 A,B,C A,B,C GP41 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 31 F F F 4cc8 2 D,E,G D,E,G GP120 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 344 F F F 4cc9 3 C C SAMH1_HUMAN DNTPASE, DENDRITIC CELL-DERIVED IFNG-INDUCED PROTEIN, DCIP, MONOCYTE PROTEIN 5, MOP-5, SAM DOMAIN AND HD DOMAIN-CONTAINING PR OTEIN 1, SAMHD1 MASWSHPQFEKGALEVLFQGPGYQDPQDGDVIAPLITPQKKEWNDSTSVQNPTRLREASKSRVQLFKDDPM 71 T 20 DUF3674 pdbhh F Eukaryota T 4cdr 2 E,F,G,H E,F,G,H GOBLIN1 XVTPVXTAX 9 T 25 RB_A pdbhh F F 4ce4 33 GA o MRPL52 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 56 F F F 4ce4 35 IA,JA v,w UNASSIGNED HELICES XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 91 F F F 4ce4 36 KA x THIOREDOXIN FOLD XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 85 F F F 4ce4 37 LA z UNASSIGNED HELICES XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 426 F F F 4cfh 3 C C AAPK1_RAT AMPK SUBUNIT ALPHA-1 FQVAPRPGSHTIEFFEMCANLIKILAQ 27 T 0.00049 AdenylateSensor pdbhh F Eukaryota T 4cg6 4 D D PEPTIDE VFIVSVGSFISVLFIVI 17 T 2 DUF5383 pdbhh F T 4cgq 2 B Q HS90A_HUMAN HEAT SHOCK 86 KDA, HSP 86, HSP86, RENAL CARCINOMA ANTIGEN NY-REN-38, HSP90 SRMEEVD 7 T 26 CAP_N pdbhh F Eukaryota T 4cgu 3 C C HS90A_HUMAN HSP90, HEAT SHOCK 86 KDA, HSP 86, HSP86, RENAL CARCINOMA ANTIGEN NY-REN-38 SRMEEVD 7 T 26 CAP_N pdbhh F Eukaryota T 4cgv 2 E,F E,F HS90A_HUMAN HEAT SHOCK 86 KDA, HSP 86, HSP86, RENAL CARCINOMA ANTIGEN NY-REN-38, HSP90 SRMEEVD 7 T 26 CAP_N pdbhh F Eukaryota T 4cgw 2 C,D C,D HS90A_HUMAN HEAT SHOCK 86 KDA, HSP 86, HSP86, RENAL CARCINOMA ANTIGEN NY-REN-38, HSP90 SRMEEVD 7 T 26 CAP_N pdbhh F Eukaryota T 4ch2 3 E,F P,Q GP1BA_HUMAN GP-IB ALPHA, GPIB-ALPHA, GPIBA, GLYCOPROTEIN IBALPHA, ANTIGEN CD42B-ALPHA, GPIBALPHA PEPTIDE GDTDLXDXXPEEDT 14 T 0.34 UPF0300 pdbhh F Eukaryota T 4ch8 3 I,J,K,L P,Q,R,S GP1BA_HUMAN GP-IB ALPHA, GPIB-ALPHA, GPIBA, GLYCOPROTEIN IBALPHA, ANTIGEN CD42B-ALPHA, GPIBALPHA PEPTIDE GDTDLXDXXPEEDT 14 T 0.34 UPF0300 pdbhh F Eukaryota T 4ch9 2 C,D C,D WNK4_HUMAN PROTEIN KINASE LYSINE-DEFICIENT 4, PROTEIN KINASE WITH NO LYSINE 4 EPEEPEADQHQ 11 T 2.8 AgrD pdbhh F Eukaryota T 4cha 1 A,D A,E CTRA_BOVIN ALPHA-CHYMOTRYPSIN A CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 4chb 2 C,D C,D WNK4_HUMAN PROTEIN KINASE LYSINE-DEFICIENT 4, PROTEIN KINASE WITH NO LYSINE 4 EPEEPEADQHQ 11 T 2.8 AgrD pdbhh F Eukaryota T 4chx 2 C C NAG-ANHMUR-PENTAPEPTIDE AXXXX 5 T 230 OAM_dimer pdbhh F F 4cic 2 C Q HEXA-ALANINE PEPTIDE AAAAAA 6 T 340 UPF0253 pdbhh F F 4cih 1 A,B,C,D A,B,C,D LNTA_LISMO LNTA RPKLSTKDLALIKADLAEFEARELSSEKILKDTIKEESWSDLDFANDNINQMIGTMKRYQQEILSIDAIKRSSEASADTEAFKKIFKEWSEFKIERIQVTIDLLNGKKDSEAVFKKTYPNQIIFDDVRTNKLQTALNNLKVGYELLDSQK 150 T 0.042 DUF5697 pdbpercent F Bacteria T 4cii 1 A A O25272_HELPY CAG PATHOGENICITY ISLAND PROTEIN 18 EDITSGLKQLDSTYQETNQQVLKNLDEIFSTTSPSANNEMGEEDALNIKKAAIALRGDLALLKANFEANELFFISEDVIFKTYMSSPELLLTYMKINPLDQNTAEQQCGISDKVLVLYCEGKLKIEQEKQNIRERLETSLKAYQSNIGGTASLITASQTLVESLKNKNFIKGIRKLMLAHNKVFLNYLEELDALERSLEQSKRQYLQERQSSKIIVKLEHHHHHH 225 T 0.0046 IDO pdbpssm F Bacteria T 4ckq 2 B C 4 HISTIDINES FROM PROTEOLYSED HIS-TAG HHHH 4 T 76 Rubella_E2 pdbhh F F 4ckt 2 C,D C,D TELO2_MOUSE TELOMERE LENGTH REGULATION PROTEIN TEL2 HOMOLOG ELDSDDEFS 9 T 9.9 INCENP_ARK-bind pdbhh F Eukaryota F 4clq 2 B B BMS1_YEAST BMS1P WNIGKLIYMDNISPEECIRRWRGEDDDSKDESDIEEDVDDDFFRKKDGTVTKEGNKDHAVDLEKFVPYFDTFEKLAKKWKSVDAIKERFL 90 T 0.1 Sigma70_ner pdbpssm F Eukaryota T 4cqo 2 B,D B,D NANO1_HUMAN NOS-1, EC_REP1A FSSWNDYLGLATLITKA 17 T 2 DUF3243 pdbhh F Eukaryota T 4cse 2 C,D C,D TELO2_MOUSE TELOMERE LENGTH REGULATION PROTEIN TEL2 HOMOLOG SELDSDDEF 9 T 10 EBV-NA3 unphh F Eukaryota F 4cu4 2 B B MCJA_ECOLX MICROCIN MCCJ25, MCCJ25 GGAGHVPEYFVGIGTPISFYG 21 T 0.13 Endonuc-BglII unp F Bacteria T 4cu5 1 A,B,C,D,E,F A,B,C,D,E,F B6SBV8_9CAUD ENDOLYSIN MYKHTIVYDGEVDKISATVVGWGYNDGKILICDIKDYVPGQTQNLYVVGGGACEKISSITKEKFIMIKGNDRFDTLYKALDFINR 85 T 0.16 DUF1161 unp T Viruses T 4cvk 2 B B ALA-FGA-API AXX 3 T 620 DUF3392 pdbhh F F 4cvo 1 A A ERCC6_HUMAN ATP-DEPENDENT HELICASE ERCC6, COCKAYNE SYNDROME PROTEIN CSB SMEPSAQALELQGLGVDVYDQDVLEQGVLQQVDNAIHEASRASQLVDVEKEYRSVLDDLTSCTTSLRQINKIIEQLSPQ 79 T 0.0069 DegQ pdbpercent F Eukaryota T 4cvz 3 C C PEPTIDE YELDEKFDRL 10 T 0.4 HJURP_C pdbhh F T 4cw1 3 C,F C,F PEPTIDE SWFRKPMTR 9 T 2.7 Tenui_NS4 pdbhh F T 4cw8 1 A A Q0GF90_9ADEN TURKEY ADENOVIRUS TYPE 3 FIBRE MGSSHHHHHHSSGLVPRGSHMASMTGGQQMGRGSEFASIGPPTTMVTGTVSPGRATNGQFVTKTAKVLRYKFVRWDALLIIQFIDNIGVIENPTFYRNKSIELRSADFLSPTLNNTYIVPLNGGVRVESPTIPVQLEVILENNSSFIQVGFVRLTVKNGNPHMIIQCNPVPGNIKMIKIKSVMLFTCLIG 190 T 51 DUF1491 unphh T Viruses T 4cy1 2 C,D C,D KANL1_HUMAN KANSL1, MLL1/MLL COMPLEX SUBUNIT KANSL1, MSL1 HOMOLOG 1, HMSL1V1, NSL COMPLEX PROTEIN NSL1, NON-SPECIFIC LETHAL 1 HOMOLOG DGTCVAARTRPVLSY 15 T 5.3 DUF436 pdbhh F Eukaryota T 4cy2 2 B C KANL2_HUMAN KANSL2, NSL COMPLEX PROTEIN NSL2, NON-SPECIFIC LETHAL 2 HOMOLOG YEFSDDLDVVGDG 13 T 3.8 Rsa3 pdbhh F Eukaryota T 4cy2 3 C D KANL1_HUMAN KANSL1, MLL1/MLL COMPLEX SUBUNIT KANSL1, MSL1 HOMOLOG 1, HMSL1V1, NSL COMPLEX PROTEIN NSL1, NON-SPECIFIC LETHAL 1 HOMOLOG DGTCVAARTRPVLSY 15 T 5.3 DUF436 pdbhh F Eukaryota T 4cy3 2 B D A4V2Z1_DROME CG4699, ISOFORM D GSDYLCSRARPLVLSE 16 T 0.66 Papilloma_E5 pdbhh F Eukaryota T 4cy5 2 B C Q9VAF4_DROME NSL2, LD12439P YRDDDEIDVVSPH 13 T 0.068 Myc_N pdbhh F Eukaryota T 4cy5 3 C D A4V2Z1_DROME NSL1 GSDYLCSRARPLVLSE 16 T 0.66 Papilloma_E5 pdbhh F Eukaryota T 4cyd 2 E,F F,H PROBABLE EXPRESSION TAG AHHHHDYDIPTTENLYFQGHM 21 T 0.72 DUF5704 pdbhh F T 4cyj 2 E,F E,F PAN2_CHATD PAN2 GSMPLSSIGLPYYREPLFSAWPADIISDVGAPPLQLEPSFVATLKQAEWGLYGKNTRNVRRNQVEDTRNTNKQSNALQAPKFLSERARESALSSGGDSSSDPQVDQEPEDPNEIESLKP 119 T 0.74 PKI unppercent F Eukaryota T 4cyk 1 A A PAN3_YEAST PAB1P-DEPENDENT POLY(A)-NUCLEASE, PAN3P MDKINPDWAKDIPCRNITIYGYCKKEKEGCPFKHSDNTTAT 41 T 0.37 zf-CCCH unppercent F Eukaryota T 4czs 2 E,F,G,H E,F,G,H MAN-WYD GWYX 4 T 23 zf-C3HC pdbhh F F 4d07 2 B B MYO5A_HUMAN MYO5A GSHMSQKEAIQPKDDKNTMTDSTILLE 27 T 0.77 DUF2046 unphh F Eukaryota T 4d0b 3 C C PEPTIDE TAGQEDYDRL 10 T 8.5 CitT pdbhh F T 4d0c 3 C C 10MER PEPTIDE TAGQSNYDRL 10 T 1.2 SASA pdbhh F T 4d0d 3 C,F,I,L C,F,I,L Q9DG07_CHICK SLP-76 ADAPTOR PROTEIN VIFPAKSL 8 T 2.9 DcpS pdbhh F Eukaryota T 4d0t 2 G,H P,Z PEPTIDE DSTTPAPT 8 T 99 Tgi2PP pdbhh F F 4d0u 1 A,B,C,D A,B,C,D SPIKE_ADES1 SPIKE, PROTEIN IV, FIBER PROTEIN OF THE ATADENOVIRUS SNAKE ADENOVIRUS 1 GSSHHHHHHSSGLVPRGSHMASMTGGQQMGRGSPSPPSKTSLDIAEELQNDKGVSFAFQAREEELGAFTKRTLFAYSGDGLTGPFKAPASAELSSFLTAHPKGRWLIAFPLGTGIVSVDEGIMTMEISRSLPEVGSGSSFYLTEK 145 T 0.1 BRK pdb T Viruses T 4d0v 1 A,B,C,D A,B,C,D SPIKE_ADES1 SPIKE, PROTEIN IV, FIBER PROTEIN OF THE ATADENOVIRUS SNAKE ADENOVIRUS 1 MGSSHHHHHHSSGLVPRGSHMASMTGGQQMGRIPSPPSKTSLDIAEELQNDKGVSFAFQAREEELGAFTKRTLFAYSGDGLTGPFKAPASAELSSFLTAHPKGRWLIAFPLGTGIVSVDEGILTLEISRSLPEVGSGSSFYLTEK 145 T 0.1 BRK pdb T Viruses T 4d0z 2 G,H X,Y Q6FI18_HUMAN PEPTIDE STCPAA 6 T 2.7 DUF6083 pdbhh F Eukaryota F 4d11 3 G,H,I,J,K L,O,P,X,Z Q6FI18_HUMAN PEPTIDE STCPAA 6 T 2.7 DUF6083 pdbhh F Eukaryota F 4d1f 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L SPIKE_ADES1 SPIKE, PROTEIN IV, FIBER PROTEIN OF THE ATADENOVIRUS SNAKE ADENOVIRUS 1 GSSHHHHHHSSGLVPRGSHMASMTGGQQMGRGSPSPPSKTSLDIAEELQNDKGVSFAFQAREEELGAFTKRTLFAYSGDGLTGPFKAPASAELSSFLTAHPKGRWLIAFPLGTGIVSVDEGILTLEISRSLPEVGSGSSFYLTEK 145 T 0.1 BRK pdb T Viruses T 4d1g 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L SPIKE_ADES1 SPIKE, PROTEIN IV, FIBER PROTEIN OF THE ATADENOVIRUS SNAKE ADENOVIRUS 1 GSSHHHHHHSSGLVPRGSHMASMTGGQQMGRGSPSPPSKTSLDIAEELQNDKGVSFAFQAREEELGAFTKRTLFAYSGDGLTGPFKAPASAELSSFLTAHPKGRWLIAFPLGTGIVSVDEGILTLEISRSLPEVGSGSSFYLTEK 145 T 0.1 BRK pdb T Viruses T 4d2d 2 B B ALANINE-TRIPEPTIDE AAA 3 T 1200 RNase_HII pdbhh F F 4d49 2 C,D,G,H C,D,G,H POLY ARG DECAPEPTIDE RRRRRRRRRR 10 T 24 Adeno_PX pdbhh F F 4d62 1 A A Q2TLC1_9ADEN TURKEY ADENOVIRUS TYPE 3 FIBRE MGSSHHHHHHSSGLVPRGSHMASMTGGQQMGRGSEFGPPTTMVTGTVSPGRATNGQFVTKTAKVLRYKFVRWDALLIIQFIDNIGVMENPTFYRNKSIELRSADFLSPMLNNTYIVPLNGGVRVESPTIPVQLEVILENNSSFIQVGFVRLTVKNGNPHMIIQCNPVPGNIKMIKIKSVMLFTCLIG 187 T 65 DUF1491 pdbhh T Viruses T 4d63 1 A A Q2TLC1_9ADEN TURKEY ADENOVIRUS TYPE 3 FIBRE MGSSHHHHHHSSGLVPRGSHMASMTGGQQMGRGSEFGPPTTMVTGTVSPGRATNGQFVTKTAKVLRYKFVRWDALLIIQFIDNIGVMENPTFYRNKSIELRSADFLSPMLNNTYIVPLNGGVRVESPTIPVQLEVILENNSSFIQVGFVRLTVKNGNPHMIIQCNPVPGNIKMIKIKSVMLFTCLIG 187 T 65 DUF1491 pdbhh T Viruses T 4d69 2 M,N,O,P,Q,R,S,T,U,V,W,X O,P,Q,R,S,T,U,V,W,X,Y,Z SHORT ANTIGEN PEPTIDE APDTR 5 T 220 DUF3796 pdbhh F F 4d8i 2 B B ACE-AEIK-CHO ALDEHYDE (BOUND FORM) XAEIX 5 T 850 Maelstrom pdbhh F F 4day 3 C C FANCM_HUMAN PROTEIN FACM, ATP-DEPENDENT RNA HELICASE FANCM, FANCONI ANEMIA-ASSOCIATED POLYPEPTIDE OF 250 KDA, FAAP250, PROTEIN HEF ORTHOLOG GHMEDIFDCSRDLFSVTFDLGFCSPDSDDEILEHTSD 37 T 9.4 EDR2_C pdbhh F Eukaryota T 4dc2 2 B Z PARD3_RAT PAR-3, PARD-3, ATYPICAL PKC ISOTYPE-SPECIFIC-INTERACTING PROTEIN, ASIP, ATYPICAL PKC-SPECIFIC-BINDING PROTEIN, ASBP DPVLAFQREGFGRQSMSEKRTKQFSNAS 28 T 0.19 LamB_YcsF pdbhh F Eukaryota T 4dcj 3 C,F C,F Caspase Inhibitor AC-DEVD-CHO XDEVX 5 T 570 Helicase_RecD pdbhh F F 4dco 3 C,F C,F Caspase Inhibitor AC-DEVD-CHO XDEVX 5 T 570 Helicase_RecD pdbhh F F 4dcp 3 C,F C,F Caspase Inhibitor AC-DEVD-CHO XDEVX 5 T 570 Helicase_RecD pdbhh F F 4dfw 2 B D Peptide XXLHSTX 7 T 600 MIR pdbhh F F 4dgb 2 B B GAG_HV2RO capsid protein PGPLPA 6 T 0.0001 Gag_p24 unphh T Viruses F 4dgc 2 F,G,H,I,J F,G,H,I,J cyclosporin A XXXXXXXXVXA 11 T 9.7 DUF6090 pdbhh F F 4dig 2 B B TRFL_BOVIN C-terminal peptide from Lactotransferrin LEACAF 6 T 9.3 T6SS_TssF pdbhh F Eukaryota F 4djc 2 B B SCN5A_HUMAN HH1, SODIUM CHANNEL PROTEIN CARDIAC MUSCLE SUBUNIT ALPHA, SODIUM CHANNEL PROTEIN TYPE V SUBUNIT ALPHA, VOLTAGE-GATED SODIUM CHANNEL SUBUNIT ALPHA NAV1.5 SNAQKKYYNAMKKLGSKKPQKPIPRPLNKYQGFIF 35 T 12 Prp18 pdbhh F Eukaryota T 4djs 2 B B stapled peptide RRWPQ(MK8)ILD(MK8)HVRRVWR RRWPQXILDXHVRRVWR 17 T 0.00013 Axin_b-cat_bind pdb F T 4dkt 2 B B N-ACETYL-L-THREONYL-L-ALPHA-ASPARTYL-N5-[(1E)-2-FLUOROETHANIMIDOYL]-L-ORNITHINAMIDE, TDFA XTDXX 5 T 2200 zf-RING_11 pdbhh F F 4dm9 2 C,D X,Y Tripeptide fluoromethyl ketone inhibitor Z-VAE(OMe)-FMK XVAXX 5 T 2600 zf-met pdbhh F F 4dmi 1 A,B,C,D,E A,B,C,D,E Capsid Protein ASQQFRIDSESIRDKLNTLLPSQSRGSIGVDLSGSTTIIPVVDLTETAEGGAQREDLQKAFTLINTIDFDVENTTTTIANTPGFYKVVGNLSSRDEASGAIAVIEVTDGITTKILANNRIVSPDGTTAVQSVPVPFDLMVKLVAGDTLQARSNNAEVRVQGIARQIADVSGNLINP 176 T 27 DUF5606 pdbhh F T 4dny 1 A A STCE_ECO57 MUCINASE, NEUTRAL ZINC METALLOPROTEASE STCE, SECRETED PROTEASE OF C1 ESTERASE INHIBITOR FROM EHEC GSHMASHLDGVPEGGIDFTPHNGTKKIINTVAEVNKLSDASGSSIHSHLTNNALVEIHTANGRWVRDIYLPQGPDLEGKMVRFVSSAGYSSTVFYGDRKVTLSVGNTLLFKYVNGQWFRSGELENN 126 T 0.32 CMV_1a_C pdb F Bacteria T 4dor 2 C,D C,D NR0B2_HUMAN ORPHAN NUCLEAR RECEPTOR SHP, SMALL HETERODIMER PARTNER ASRPAILYALLSSS 14 T 9.2 NR_Repeat unphh F Eukaryota T 4dow 2 C,D C,D H4_MOUSE Histone H4 GAKRHRKVLRDN 12 T 0.27 UPF0137 unp F Eukaryota T 4dqm 2 B,D B,D NCOA1_HUMAN NCOA-1, CLASS E BASIC HELIX-LOOP-HELIX PROTEIN 74, BHLHE74, PROTEIN HIN-2, RIP160, RENAL CARCINOMA ANTIGEN NY-REN-52, STEROID RECEPTOR COACTIVATOR 1, SRC-1 KSLLQQLLTE 10 T 7.3 E3_UbLigase_RBR pdbhh F Eukaryota T 4drw 2 E,F E,F AHNK_HUMAN DESMOYOKIN GKVTFPKMKIPKFTFSGREL 20 T 11 DUF5476 pdbhh F Eukaryota T 4ds1 2 B,D B,D NU159_YEAST NUCLEAR PORE PROTEIN NUP159 NYAESGIQTDL 11 T 4.8 PilX_N pdbhh F Eukaryota T 4dt5 1 A,B A,B E5LR38_9CUCU Antifreeze protein GYSCRAVGVDGRAVTDIQGTCHAKATGAGAMASGTSEPGSTSTATATGRGATARSTSTGRGTATTTATGTASATSNAIGQGTATTTATGSAGGRATGSATTSSSASQPTQTQTITGPGFQTAKSFARNTATTTVTASHHHHHH 143 T 0.8 Sporozoite_P67 pdb F Eukaryota T 4dv9 2 B B METHYL (2S)-1-[(2R,5S,8S,12S,13S,16S,19S,22S)-16-(3-AMINO-3-OXOPROPYL)-2,13-DIBENZYL-12,22-DIHYDROXY-3,5,17-TRIMETHYL-8-(2-METHYLPROPYL)-4,7,10,15,18,21-HEXAOXO-19-(PROPAN-2-YL)-3,6,9,14,17,20-HEXAAZATRICOSAN-1-OYL]PYRROLIDINE-2-CARBOXYLATE (NON-PREFERRED NAME) XVXXLAXX 8 T 320 CEND1 pdbhh F F 4dvf 2 C,D C,D METHYL (2S)-1-[(2R,5S,8S,12S,13S)-2,13-DIBENZYL-12-HYDROXY-3,5-DIMETHYL-8-(2-METHYLPROPYL)-15-(3-[(METHYLSULFONYL)AMINO]-5-{[(1R)-1-PHENYLETHYL]CARBAMOYL}PHENYL)-4,7,10,15-TETRAOXO-3,6,9,14-TETRAAZAPENTADECAN-1-OYL]PYRROLIDINE-2-CARBOXYLATE XXXLAXX 7 T 760 zf-C2H2_4 pdbhh F F 4dx0 2 B P Q42932_NICPL N.plumbaginifolia H+-translocating ATPase mRNA RRELHTLKGHVEAVVKLKGLDIETIQQSYDI 31 T 18 DUF1990 pdbhh F Eukaryota T 4dxu 2 B B TRFL_BOVIN C-TERMINAL PEPTIDE OF Lactotransferrin LEACAF 6 T 9.3 T6SS_TssF pdbhh F Eukaryota F 4e0e 1 A,B,C,D A,B,C,D Q8A074_BACTN Putative uncharacterized protein GAQQLTPPAGTFRLGISKGTDSHWLAPQEKVKGIAFRWKALPDTRGFILEVAVTSLQQADTLFWSFGNCQPDMDINVFSVEGQAFTCYYGESMKLRTLQAVTPTDDIRLSNGRQDKTPLLLYESGKRTDRPVLAGRCPLAANSKLYFCFYEQNARADYNYFMLPDLFAKIDESKHSKK 178 T 4.5E-08 DUF4450 unppercent F Bacteria T 4e27 1 A,B,C,D,E A,B,C,D,E Capsid Protein SQQFRIDSESIRDKLNTLLPSQSRGSIGVDLSGSTTIIPVVDLTETAEGGAQREDLQKAFTLINTIDFDVENTTTTIANTPGFYKVVGNLSSRDEASGAIAVIEVTDGITTKILANNRIVSPDGTTAVQSVPVPFDLMVKLVAGDTLQARSNNAEVRVQGIARQIADVSGNLINP 175 T 27 DUF5606 pdbhh F T 4e34 2 C,D C,D decameric peptide, iCAL36 ANSRWPTSII 10 T 3.7 C9orf72-like pdbhh F T 4e35 2 C,D C,D iCAL50 peptide ANSRWPTSIL 10 T 9.5 CBP_BcsR pdbhh F T 4e3b 2 C,D C,D iCAL50 peptide ANSRWPTSIL 10 T 9.5 CBP_BcsR pdbhh F T 4e43 2 C C Random peptide NLLQKK 6 T 150 Coat_F pdbhh F F 4e67 2 B B hydrocinnamoyl-derivatized PLHSpTA peptide XPLHSTAX 8 T 130 Dip pdbhh F T 4e73 2 B B JIP1_HUMAN JIP-1, JNK-INTERACTING PROTEIN 1, ISLET-BRAIN 1, IB-1, JNK MAP KINASE SCAFFOLD PROTEIN 1, MITOGEN-ACTIVATED PROTEIN KINASE 8-INTERACTING PROTEIN 1 KPKRPTTLNLF 11 T 6.3 Baculo_8kDa pdbhh F Eukaryota T 4e9c 2 B B LDPPLHSpTA phosphopeptide XLDPPLHSTAX 11 T 13 MethyltransfD12 pdbhh F T 4e9d 2 B E 3-(1-benzothiophen-2-yl)propanoyl-derivatized DPPLHSpTA peptide XDPPLHSTAX 10 T 32 IML1 pdbhh F T 4edn 2 K,L,M,N,O,P,Q K,L,M,N,O,P,Q PAXI_HUMAN Paxillin XMDDLDALLADLESTTSHISKX 22 T 0.021 DUF883 pdbpssm F Eukaryota T 4eec 2 C C desulfo-A47934 XXXXXXX 7 T 1400 NACHT pdbhh F F 4eha 2 B,D F,B ACE-ASP-GLU-VAL-ASP-CHLOROMETHYLKETONE INHIBITOR XDEVDX 6 T 200 ResIII pdbhh F F 4ehd 2 B B ACE-ASP-GLU-VAL-ASP-CHLOROMETHYLKETONE INHIBITOR XDEVDX 6 T 200 ResIII pdbhh F F 4ehf 2 B B ACE-ASP-GLU-VAL-ASP-CHLOROMETHYLKETONE INHIBITOR XDEVDX 6 T 200 ResIII pdbhh F F 4ehh 2 B B ACE-ASP-GLU-VAL-ASP-CHLOROMETHYLKETONE INHIBITOR XDEVDX 6 T 200 ResIII pdbhh F F 4ehk 2 C,D B,D ACE-ASP-GLU-VAL-ASP-CHLOROMETHYLKETONE INHIBITOR XDEVDX 6 T 200 ResIII pdbhh F F 4ehl 2 C,D B,D ACE-ASP-GLU-VAL-ASP-CHLOROMETHYLKETONE INHIBITOR XDEVDX 6 T 200 ResIII pdbhh F F 4ehn 2 B B ACE-ASP-GLU-VAL-ASP-CHLOROMETHYLKETONE INHIBITOR XDEVDX 6 T 200 ResIII pdbhh F F 4ehp 1 A A VINC_HUMAN METAVINCULIN MMPVFHTRTIESILEPVAQQISHLVIMHEEGEVDGKAIPDLTAPVAAVQAAVSNLVRVGKETVQTTEDQILKRDMPPAFIKVENACTKLVQAAQMLQSDPYSVPARDYLIDGSRGILSGTSDLLLTFDEAEVRKIIRVCKGILEYLTVAEVVETMEDLVTYTKNLGPGMTKMAKMIDERQQELTHQEHRVMLVNSMNTVKELLPVLISAMKIFVTTKNSKNQGIEEALKNRNFTVEKMSAEINEIIRVLQLTS 253 F F Eukaryota T 4eik 2 B B VSL12 peptide VSLARRPLPPLP 12 T 0.73 DUF4522 pdbhh F T 4ejd 2 C C pepstatin XVVXAX 6 T 1700 FAM60A pdbhh F F 4eje 2 C,D C,D VP40_EBOZM MEMBRANE-ASSOCIATED PROTEIN VP40 ILPTAPPEY 9 T 1.7 MLANA pdbhh T Viruses T 4ejf 2 E,F,G,H E,F,G,H phage-derived peptide 419 TEKEKGRLHCVEWTILER 18 T 1.4 UBA_e1_thiolCys pdbhh F T 4ejk 2 C N pepstatin XVVXAX 6 T 1700 FAM60A pdbhh F F 4ekk 2 C,D C,D GSK3B_HUMAN GSK-3 BETA, SERINE/THREONINE-PROTEIN KINASE GSK3B GRPRTTSFAE 10 T 6.3 DUF3084 pdbhh F Eukaryota T 4eo0 1 A A G3P_BPIKE GENE 3 PROTEIN, G3P, MINOR COAT PROTEIN MDNWESITKSYYTGFAISKTVESKDKDGKPVRKEVITQADLTTACNDAKASAQNVFNQIKLTLSGTWPNSQFRLVTGDTCVYNGSPGEKTESWSIRAQVEGDIQRSVPDHHHHHH 115 T 0.058 DUF1579 pdbpercent T Viruses T 4eoy 2 D,E,F D,E,F C0H519_PLAF7 Autophagy-related protein 3 NDWLLPSY 8 T 1.5 DUF1566 pdbhh F Eukaryota T 4ep3 2 B E Q9YP46_9HIV1 substrate CA-p2 KARVLAEAM 9 T 0.18 HypA unp T Viruses T 4epj 2 B D Q9YP46_9HIV1 substrate p2-NC ATIMMQRG 8 T 0.18 HypA unp T Viruses T 4eq0 2 B P Q9YP46_9HIV1 substrate p2-NC ATIMMQRG 8 T 0.18 HypA unp T Viruses T 4eqa 2 C,D C,D Q9I2Q0_PSEAE PA1845 PROTEIN ADCTFTQLEIVPQFGSPNMFGGEDEHVRVMFSNEDPNDDNPDAFPEPPVYLADRDSGNDCRIEDGGIWSRGGVFLSQDGRRVLMHEFSGSSAELVSYDSATCKVVHREDISGQRWAVDKDGLRLGQKCSGESVDSCAKIVKRSLAPFCQTAKK 153 T 1.4 Me-amine-dh_H pdbhh F Bacteria T 4eqf 2 B B HCN2_MOUSE BRAIN CYCLIC NUCLEOTIDE-GATED CHANNEL 2, BCNG-2, HYPERPOLARIZATION-ACTIVATED CATION CHANNEL 1, HAC-1 SRLSSNL 7 T 40 DUF2109 pdbhh F Eukaryota F 4eqy 2 G X HIS-HIS-HIS peptide HHH 3 T 75 Herpes_UL46 pdbhh F F 4er2 2 B I PEPSTATIN XVVXAX 6 T 1700 FAM60A pdbhh F F 4er4 2 B I H-142 PHPFHXIHK 9 T 3.8 IucA_IucC pdbhh F T 4erq 2 D,E,F D,E,F KMT2D_HUMAN ALL1-RELATED PROTEIN, LYSINE N-METHYLTRANSFERASE 2B, KMT2B, MYELOID/LYMPHOID OR MIXED-LINEAGE LEUKEMIA PROTEIN 2 INPTGCARSEPKIL 14 T 0.0018 N-SET pdbhh F Eukaryota T 4ery 2 B D KMT2C_HUMAN HOMOLOGOUS TO ALR PROTEIN, LYSINE N-METHYLTRANSFERASE 2C, KMT2C, MYELOID/LYMPHOID OR MIXED-LINEAGE LEUKEMIA PROTEIN 3 VNPTGCARSEPKMS 14 T 0.00076 N-SET pdbhh F Eukaryota T 4erz 2 D,E,F D,E,F KMT2B_HUMAN LYSINE N-METHYLTRANSFERASE 2D, KMT2D, MYELOID/LYMPHOID OR MIXED-LINEAGE LEUKEMIA PROTEIN 4, TRITHORAX HOMOLOG 2, WW DOMAIN-BINDING PROTEIN 7, WBP-7 LNPHGAARAEVYLR 14 T 0.39 N-SET pdbhh F Eukaryota T 4es8 1 A,B A,B A0A0H3BY62_STRPZ Epf GDHGPEFNGVMVVKAAEAEELPDDLMNFKGTWEVSADGSSGRFFSKGATDSYVFHLIPAKDVKKPGWREHNEVKDSYIKIDKQSIAARYKTSTTAPYSVAFKVNTKSLIKDHDYKITFEQGQIASGITVDYRIGSAFNKTTDDSFKISDESKYASNVKIEGEEQGFKQREQGDKTISFRTLKEGPMSLVLLSKVEKKPQGDLDVEFKNLKIIDVTNPSQLDKGVAYVGNKNVQLTLKSDDGRTNFEGDEISLFNSRGELLQTVTVTKDQQNPISITLSEDQAKSLKNKEKLKVSIKQKQSKKTSKDFFFEVGIDPKVEAK 320 T 7 DUF4493 pdbpssm F Bacteria T 4es9 1 A,B,C,D A,B,C,D A0A0H3BY62_STRPZ Epf GDHGPEFNGVMVVKAAEAEELPDDLMNFKGTWEVSADGSSGRFFSKGATDSYVFHLIPAKDVKKPGWREHNEVKDSYIKIDKQSIAARYKTSTTAPYSVAFKVNTKSLIKDHDYKITFEQGQIASGITVDYRIGSAFNKTTDDSFKISDESKYASNVKIEGEEQGFKQREQGDKTISFRTLKEGPMSLVLLSKVEKKPQGDLDVEFKNLKIIDVTNPSQLDKGVAYVGNKNVQLTLKSDDGRTNFEGDEISLFNSRGELLQTVTVTKDQQNPISITLSEDQAKSLKNKEKLKVSIKQKQSKKTSKDFFFEVGIDPKVEAK 320 T 7 DUF4493 pdbpssm F Bacteria T 4esg 2 C,D C,D KMT2A_HUMAN ALL-1, CXXC-TYPE ZINC FINGER PROTEIN 7, LYSINE N-METHYLTRANSFERASE 2A, KMT2A, TRITHORAX-LIKE PROTEIN, ZINC FINGER PROTEIN HRX, MLL CLEAVAGE PRODUCT N320, N-TERMINAL CLEAVAGE PRODUCT OF 320 KDA, P320, MLL CLEAVAGE PRODUCT C180, C-TERMINAL CLEAVAGE PRODUCT OF 180 KDA, P180 EPPLNPHGSARAEVHLR 17 T 1.1 N-SET unphh F Eukaryota T 4est 2 B I INHIBITOR ACE-ALA-PRO-VAI-DIFLUORO-N-PHENYLETHYLACETAMIDE XAPXXX 6 T 1700 zf-H2C2_2 pdbhh F F 4exh 2 C,D,E J,M,P ACETYL-PEPSTATIN XVVXAX 6 T 1700 FAM60A pdbhh F F 4ext 2 B B REV3L_HUMAN PROTEIN REVERSIONLESS 3-LIKE, REV3-LIKE, HREV3 RTANILKPLMSPPSREEIMATLL 23 T 2.8 CaM_bind pdbhh F Eukaryota T 4eyy 1 A R Q5ZYC9_LEGPH IcmR MGNNTDDSARNPFGFYTPPRVKEIGEPDVTDATLGSVYSEIISPVKDCILTVAKAVSFNPGGKDNTDAVEVLTELNTKVERAALNQPILTTKTERMFGAAESEKSSEPPSHDERGFKLSS 120 T 8 MOSC_N pdbhh F Bacteria T 4eyz 1 A,B A,B M9MMP4_RUMFL Cellulosome-related protein module from Ruminococcus flavefaciens that resembles papain-like cysteine peptidases MASMYNSDGWYMGEAINMASLNTCAADLGKWQNFIDDYTSNDYYKGTPYIDWVFASSPKGDRWQMNEWSVSEMLKVGGTYEEGGLNXMGFVWHAIAKGLSVESGLDISQTGQYVPFSSYFNGLGLSRKCWATPGGSGGWTVFVDYYNLHYYEFPTKEEMLSSGVLQKGDIIWCVDGSVGLGMAGLRTIADNHHIGIYTGNGTSDSWWQSGPVKADGDLVNVGTDVCPIYGAAAKNTYVVLPWAKKA 246 T 0.005 Beta-lactamase unppercent F Bacteria T 4ezn 2 C,D C,D PYRRH_PYRAP Pyrrhocoricin VDKGSYLPRPTPPRPIYNRN 20 T 2.5 Apidaecin pdbhh F Eukaryota T 4ezo 2 C,D C,D PR39_PIG Antibacterial protein PR-39 RRRPRPPYLPRPRPP 15 T 0.017 Apidaecin pdbhh F Eukaryota F 4ezp 2 C,D C,D APO-monomer XRPDKPRPYLPRPRPPRPVR 20 T 0.83 Apidaecin pdbhh F T 4ezq 2 B B PYRRH_PYRAP Pyrrhocoricin PPRPIYNRN 9 T 0.38 SPC12 pdbhh F Eukaryota F 4ezr 2 B B DROS_DROME Drosocin SHPRPIRV 8 T 3.1 Antimicrobial11 unphh F Eukaryota T 4ezt 2 B B HELN_HELVI Heliocin PRRPVIMRR 9 T 0.27 LcrG pdbhh F Eukaryota F 4ezw 2 E,F,G,H E,F,G,H synthetic peptide NRLLLTG NRLLLTG 7 T 10 PgaPase_1 pdbhh F F 4ezx 2 C,D C,D synthetic peptide NRLMLTG NRLMLTG 7 T 44 Beta-Casp pdbhh F T 4ezy 2 B B synthetic peptide NRLILTG NRLILTG 7 T 4.4 hemP pdbhh F T 4ezz 2 B B synthetic peptide ELPLVKI ELPLVKI 7 T 40 Gal_mutarotas_3 pdbhh F T 4f02 3 C,F C,F IF4G1_HUMAN EIF-4-GAMMA 1, EIF-4G 1, EIF-4G1, P220 KTIRIRDPNQGGKDITEEIMSGARTAY 27 T 4.3 TrbI_Ftype pdbhh F Eukaryota T 4f14 2 B B XIRP2_HUMAN BETA-XIN, CARDIOMYOPATHY-ASSOCIATED PROTEIN 3, XEPLIN PPPTLPKPKLPKH 13 T 18 SARG pdbhh F Eukaryota F 4f1z 2 B Q K1C10_HUMAN CYTOKERATIN-10, CK-10, KERATIN-10, K10 YGGGSSGGGSSGGG 14 T 2.2 DUF3246 pdbhh F Eukaryota F 4f20 2 B Q DMKN_HUMAN EPIDERMIS-SPECIFIC SECRETED PROTEIN SK30/SK89 GQSGSSGSGSNGD 13 T 17 Thymidylate_kin pdbhh F Eukaryota F 4f27 2 B Q FIBA_HUMAN FIBRINOPEPTIDE A, FIBRINOGEN ALPHA CHAIN ASGSSGTGSTGNQ 13 T 5.8 NAD_kinase_C pdbhh F Eukaryota T 4f32 2 C C Unknown peptide XXX 3 F F F 4f73 2 C,D C,D N terminal product of substrate CA-p2 KARVL 5 T 170 RBR pdbhh F F 4f74 2 C C N terminal product of substrate MA-CA VSQNY 5 T 67 DUF2615 pdbhh F F 4f75 2 C C N terminal product of substrate RH-IN IRKIL 5 T 38 DUF1217 pdbhh F F 4f75 3 D D C terminal product of substrate RH-IN FLDGI 5 T 23 DUF6058 pdbhh F F 4f76 2 C C N terminal product of substrate p1-p6 RPGNF 5 T 41 S_tail_recep_bd pdbhh F F 4f76 3 D D C terminal product of substrate p1-p6 LQSRP 5 T 93 DUF3708 pdbhh F F 4f87 1 A,B,C,D A,B,C,D Q7Y3F3_9CAUD PlyCB MSKINVNVENVSGVQGFLFHTDGKESYGYRAFINGVEIGIKDIETVQGFQQIIPSINISKSDVEAIRKAMKK 72 T 2.6 DUF3213 pdbhh T Viruses T 4f88 2 C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P Q7Y3F3_9CAUD PlyCB MSKINVNVENVSGVQGFLFHTDGKESYGYRAFINGVEIGIKDIETVQGFQQIIPSINISKSDVEAIRKAMKK 72 T 2.6 DUF3213 pdbhh T Viruses T 4fae 2 C D Q9YP46_9HIV1 Substrate p2/NC peptide TIMMQRG 7 T 0.41 HypA unp T Viruses T 4faf 2 C D POL_HV1BR substrate CA/p2 peptide RVLFEAM 7 T 0.41 HypA unp T Viruses T 4faj 2 B B Sex pheromone cCF10 LVTLVFV 7 T 37 TMEM156 pdbhh F F 4fas 2 D,E,F D,E,F Q82V11_NITEU NE1300 SGNLESSLAPISAKDMLDYLACKDKKPTDVVKSHTEVENGKIVRVKCGDIVALVQKAREQSGDAWQGGY 69 T 0.039 DUF6488 unphh F Bacteria T 4fbw 2 C,D C,D NBS1_SCHPO DNA repair and telomere maintenance protein nbs1 GESEDDKAFEENRRLRNLGSVEYIRIMSSEKSNANSRHTSKYYSGRKNFKKFQKKASQK 59 T 0.24 Nbs1_C pdbhh F Eukaryota T 4fbx 2 B B bisubstrate inhibitor XGDDDDD 7 T 97 Stm1_N pdbhh F F 4fby 19 MA,S k,Y Photosystem II reaction center protein Y XXXXXXXXXXXXXXXXXXXXXXXXXXXX 28 F F F 4fc9 1 A,B,C B,A,C Q3BQL2_XANC5 uncharacterized protein MGSSHHHHHHSSGRENLYFQGSATASELLLTAALERIEDTAQAMLSTVIDEERNPFLEGAPSYLPGKRPTDVTTFGQVPALRDMLAESRDLEFLQRVSDMAGPSPRIEDPSEEGLARHYTNVSNWKAQKSAHLGIVDHLGQFVYHEGSPLDVATLAKAVQMWKTRELIVHAHPQDRARFPELAVHIPEQVSDDSDSEQQTSPEPSGHQ 208 T 5.8E-05 LRR_9 unphh F Bacteria T 4fcm 2 C,D C,D Nucleoporin repeat peptide DSGFSFGSK 9 T 5.6 Peptidase_S9 pdbhh F F 4fdd 2 B B FUS_HUMAN 75 KDA DNA-PAIRING PROTEIN, ONCOGENE FUS, ONCOGENE TLS, POMP75, TRANSLOCATED IN LIPOSARCOMA PROTEIN RGGGDRGGFGPGKMDSRGEHRQDRRERPY 29 T 130 Pro-NT_NN pdbhh F Eukaryota T 4ff1 1 A,C A,B RPOLV_BPN4 Virion RNA polymerase MGGSHHHHHHRSESTVTEELKEGIDAVYPSLVGTADSKAEGIKNYFKLSFTLPEEQKSRTVGSEAPLKDVAQALSSRARYELFTEKETANPAFNGEVIKRYKELMEHGEGIADILRSRLAKFLNTKDVGKRFAQGTEANRWVGGKLLNIVEQDGDTFKYNEQLLQTAVLAGLQWRLTATSNTAIKDAKDVAAITGIDQALLPEGLVEQFDTGMTLTEAVSSLAQKIESYWGLSRNPNAPLGYTKGIPTAMAAEILAAFVESTDVVENIVDMSEIDPDNKKTIGLYTITELDSFDPINSFPTAIEEAVLVNPTEKMFFGDDIPPVANTQLRNPAVRNTPEQKAALKAEQATEFYVHTPMVQFYETLGKDRILELMGAGTLNKELLNDNHAKSLEGKNRSVEDSYNQLFSVIEQVRAQSEDISTVPIHYAYNMTRVGRMQMLGKYNPQSAKLVREAILPTKATLDLSNQNNEDFSAFQLGLAQALDIKVHTMTREVMSDELTKLLEGNLKPAIDMMVEFNTTGSLPENAVDVLNTALGDRKSFVALMALMEYSRYLVAEDKSAFVTPLYVEADGVTNGPINAMMLMTGGLFTPDWIRNIAKGGLFIGSPNKTMNEHRSTADNNDLYQASTNALMESLGKLRSNYASNMPIQSQIDSLLSLMDLFLPDINLGENGALELKRGIAKNPLTITIYGSGARGIAGKLVSSVTDAIYERMSDVLKARAKDPNISAAMAMFGKQAASEAHAEELLARFLKDMETLTSTVPVKRKGVLELQSTGTGAKGKINPKTYTIKGEQLKALQENMLHFFVEPLRNGITQTVGESLVYSTEQLQKATQIQSVVLEDMFKQRVQEKLAEKAKDPTWKKGDFLTQKELNDIQASLNNLAPMIETGSQTFYIAGSENAEVANQVLATNLDDRMRVPMSIYAPAQAGVAGIPFMTIGTGDGMMMQTLSTMKGAPKNTLKIFDGMNIGLNDITDASRKANEAVYTSWQGNPIKNVYESYAKFMKNVDFSKLSPEALEAIGKSALEYDQRENATVDDIANAASLIERNLRNIALGVDIRHKVLDKVNLSIDQMAAVGAPYQNNGKIDLSNMTPEQQADELNKLFREELEARKQKVAKAR 1118 T 0.0038 RNA_pol pdbhh T Viruses T 4ff2 1 A,C A,B RPOLV_BPN4 Virion RNA polymerase MGGSHHHHHHRSESTVTEELKEGIDAVYPSLVGTADSKAEGIKNYFKLSFTLPEEQKSRTVGSEAPLKDVAQALSSRARYELFTEKETANPAFNGEVIKRYKELMEHGEGIADILRSRLAKFLNTKDVGKRFAQGTEANRWVGGKLLNIVEQDGDTFKYNEQLLQTAVLAGLQWRLTATSNTAIKDAKDVAAITGIDQALLPEGLVEQFDTGMTLTEAVSSLAQKIESYWGLSRNPNAPLGYTKGIPTAMAAEILAAFVESTDVVENIVDMSEIDPDNKKTIGLYTITELDSFDPINSFPTAIEEAVLVNPTEKMFFGDDIPPVANTQLRNPAVRNTPEQKAALKAEQATEFYVHTPMVQFYETLGKDRILELMGAGTLNKELLNDNHAKSLEGKNRSVEDSYNQLFSVIEQVRAQSEDISTVPIHYAYNMTRVGRMQMLGKYNPQSAKLVREAILPTKATLDLSNQNNEDFSAFQLGLAQALDIKVHTMTREVMSDELTKLLEGNLKPAIDMMVEFNTTGSLPENAVDVLNTALGDRKSFVALMALMEYSRYLVAEDKSAFVTPLYVEADGVTNGPINAMMLMTGGLFTPDWIRNIAKGGLFIGSPNKTMNEHRSTADNNDLYQASTNALMESLGKLRSNYASNMPIQSQIDSLLSLMDLFLPDINLGENGALELKRGIAKNPLTITIYGSGARGIAGKLVSSVTDAIYERMSDVLKARAKDPNISAAMAMFGKQAASEAHAEELLARFLKDMETLTSTVPVKRKGVLELQSTGTGAKGKINPKTYTIKGEQLKALQENMLHFFVEPLRNGITQTVGESLVYSTEQLQKATQIQSVVLEDMFKQRVQEKLAEKAKDPTWKKGDFLTQKELNDIQASLNNLAPMIETGSQTFYIAGSENAEVANQVLATNLDDRMRVPMSIYAPAQAGVAGIPFMTIGTGDGMMMQTLSTMKGAPKNTLKIFDGMNIGLNDITDASRKANEAVYTSWQGNPIKNVYESYAKFMKNVDFSKLSPEALEAIGKSALEYDQRENATVDDIANAASLIERNLRNIALGVDIRHKVLDKVNLSIDQMAAVGAPYQNNGKIDLSNMTPEQQADELNKLFREELEARKQKVAKAR 1118 T 0.0038 RNA_pol pdbhh T Viruses T 4ff3 1 A,C A,B RPOLV_BPN4 Virion RNA polymerase MGGSHHHHHHRSESTVTEELKEGIDAVYPSLVGTADSKAEGIKNYFKLSFTLPEEQKSRTVGSEAPLKDVAQALSSRARYELFTEKETANPAFNGEVIKRYKELMEHGEGIADILRSRLAKFLNTKDVGKRFAQGTEANRWVGGKLLNIVEQDGDTFKYNEQLLQTAVLAGLQWRLTATSNTAIKDAKDVAAITGIDQALLPEGLVEQFDTGMTLTEAVSSLAQKIESYWGLSRNPNAPLGYTKGIPTAMAAEILAAFVESTDVVENIVDMSEIDPDNKKTIGLYTITELDSFDPINSFPTAIEEAVLVNPTEKMFFGDDIPPVANTQLRNPAVRNTPEQKAALKAEQATEFYVHTPMVQFYETLGKDRILELMGAGTLNKELLNDNHAKSLEGKNRSVEDSYNQLFSVIEQVRAQSEDISTVPIHYAYNMTRVGRMQMLGKYNPQSAKLVREAILPTKATLDLSNQNNEDFSAFQLGLAQALDIKVHTMTREVMSDELTKLLEGNLKPAIDMMVEFNTTGSLPENAVDVLNTALGDRKSFVALMALMEYSRYLVAEDKSAFVTPLYVEADGVTNGPINAMMLMTGGLFTPDWIRNIAKGGLFIGSPNKTMNEHRSTADNNDLYQASTNALMESLGKLRSNYASNMPIQSQIDSLLSLMDLFLPDINLGENGALELKRGIAKNPLTITIYGSGARGIAGKLVSSVTDAIYERMSDVLKARAKDPNISAAMAMFGKQAASEAHAEELLARFLKDMETLTSTVPVKRKGVLELQSTGTGAKGKINPKTYTIKGEQLKALQENMLHFFVEPLRNGITQTVGESLVYSTEQLQKATQIQSVVLEDMFKQRVQEKLAEKAKDPTWKKGDFLTQKELNDIQASLNNLAPMIETGSQTFYIAGSENAEVANQVLATNLDDRMRVPMSIYAPAQAGVAGIPFMTIGTGDGMMMQTLSTMKGAPKNTLKIFDGMNIGLNDITDASRKANEAVYTSWQGNPIKNVYESYAKFMKNVDFSKLSPEALEAIGKSALEYDQRENATVDDIANAASLIERNLRNIALGVDIRHKVLDKVNLSIDQMAAVGAPYQNNGKIDLSNMTPEQQADELNKLFREELEARKQKVAKAR 1118 T 0.0038 RNA_pol pdbhh T Viruses T 4ff4 1 A,C A,B RPOLV_BPN4 Virion RNA polymerase MGGSHHHHHHRSESTVTEELKEGIDAVYPSLVGTADSKAEGIKNYFKLSFTLPEEQKSRTVGSEAPLKDVAQALSSRARYELFTEKETANPAFNGEVIKRYKELMEHGEGIADILRSRLAKFLNTKDVGKRFAQGTEANRWVGGKLLNIVEQDGDTFKYNEQLLQTAVLAGLQWRLTATSNTAIKDAKDVAAITGIDQALLPEGLVEQFDTGMTLTEAVSSLAQKIESYWGLSRNPNAPLGYTKGIPTAMAAEILAAFVESTDVVENIVDMSEIDPDNKKTIGLYTITELDSFDPINSFPTAIEEAVLVNPTEKMFFGDDIPPVANTQLRNPAVRNTPEQKAALKAEQATEFYVHTPMVQFYETLGKDRILELMGAGTLNKELLNDNHAKSLEGKNRSVEDSYNQLFSVIEQVRAQSEDISTVPIHYAYNMTRVGRMQMLGKYNPQSAKLVREAILPTKATLDLSNQNNEDFSAFQLGLAQALDIKVHTMTREVMSDELTKLLEGNLKPAIDMMVEFNTTGSLPENAVDVLNTALGDRKSFVALMALMEYSRYLVAEDKSAFVTPLYVEADGVTNGPINAMMLMTGGLFTPDWIRNIAKGGLFIGSPNKTMNEHRSTADNNDLYQASTNALMESLGKLRSNYASNMPIQSQIDSLLSLMDLFLPDINLGENGALELKRGIAKNPLTITIYGSGARGIAGKLVSSVTDAIYERMSDVLKARAKDPNISAAMAMFGKQAASEAHAEELLARFLKDMETLTSTVPVKRKGVLELQSTGTGAKGKINPKTYTIKGEQLKALQENMLHFFVEPLRNGITQTVGESLVYSTEQLQKATQIQSVVLEDMFKQRVQEKLAEKAKDPTWKKGDFLTQKELNDIQASLNNLAPMIETGSQTFYIAGSENAEVANQVLATNLDDRMRVPMSIYAPAQAGVAGIPFMTIGTGDGMMMQTLSTMKGAPKNTLKIFDGMNIGLNDITDASRKANEAVYTSWQGNPIKNVYESYAKFMKNVDFSKLSPEALEAIGKSALEYDQRENATVDDIANAASLIERNLRNIALGVDIRHKVLDKVNLSIDQMAAVGAPYQNNGKIDLSNMTPEQQADELNKLFREELEARKQKVAKAR 1118 T 0.0038 RNA_pol pdbhh T Viruses T 4ffe 1 A,B,C X,Y,Z Q8QN43_COWPX OMCP, ORTHOPOX VIRUS MHC CLASS I-LIKE PROTEIN MGHKLAFNFNLEINGSDTHSTVDVYLDDSQIITFDGKDIRPTIPFMIGDEIFLPFYKNVFSEFFSLFRRVPTSTPYEDLTYFYECDYTDNKSTFDQFYLYNGEEYTVKTQEATNKNMWLTTSEFRLKKWFDGEDCIMHLRSLVRKMEDSKR 151 T 0.078 Thioredoxin_11 unppssm T Viruses T 4fga 2 B P AYK AYK 3 T 250 NS3 pdbhh F F 4fgi 2 B,D,F,H B,D,F,H Q9I2Q0_PSEAE Tsi1 MAFTQLEIVPQFGSPNMFGGEDEHVRVMFSNEDPNDDNPDAFPEPPVYLADRDSGNDCRIEDGGIWSRGGVFLSQDGRRVLMHEFSGSSAELVSYDSATCKVVHREDISGQRWAVDKDGLRLGQKCSGESVDSCAKIVKRSLAPFCQTAKK 151 T 1.3 Me-amine-dh_H pdbhh F Bacteria T 4fgt 2 B D CG peptide CQLY 4 T 45 Clathrin_propel pdbhh F F 4fgx 2 B B Inhibitor (2R,5S,8S,12S,13S,16S,19S,22S)-16-(3-amino-3-oxopropyl)-2,13-dibenzyl-12,22-dihydroxy-8-isobutyl-19-isopropyl-3,5,17-trimethyl-4,7,10,15,18,21-hexaoxo-3,6,9,14,17,20-hexaazatricosan-1-oic acid XVXXLAX 7 T 490 Penaeidin pdbhh F F 4fim 2 B B TRFL_BOVIN C-terminal peptide from Lactotransferrin LEACAF 6 T 9.3 T6SS_TssF pdbhh F Eukaryota F 4fj3 2 C P RAF1_HUMAN PROTO-ONCOGENE C-RAF, CRAF, RAF-1 QHRYSTPHAFTFNTSSPSSEGSLSQRQRSTSTPNVH 36 T 0.23 DUF1780 pdbpssm F Eukaryota T 4fjo 2 B B POLK_MOUSE DINB PROTEIN, DINP SFFDKKRSER 10 T 0.0065 DUF4113 unphh F Eukaryota T 4fjo 4 D D REV3L_MOUSE PROTEIN REVERSIONLESS 3-LIKE, REV3-LIKE, SEIZURE-RELATED PROTEIN 4 GSFTPRTAHILKPLMSPPSREEIVATLLDH 30 T 2.2 CaM_bind pdbhh F Eukaryota T 4fjp 2 B B TRFL_BOVIN C-terminal peptide from Lactotransferrin LEACAF 6 T 9.3 T6SS_TssF pdbhh F Eukaryota F 4fl5 2 C,D P,Q TAU_HUMAN NEUROFIBRILLARY TANGLE PROTEIN, PAIRED HELICAL FILAMENT-TAU, PHF-TAU SRTPSLPTPP 10 T 3.2 Adeno_E4 pdbhh F Eukaryota F 4flg 3 D E POL_HV1BR HIV-1 protease, fragment QII 3 T 260 Milton pdbhh T Viruses F 4fln 2 D,F D,F Unknown peptide XXWX 4 T 420 CBM_1 pdbhh F F 4fln 3 E E Unknown peptide XXXXXXXXXXXXXXDXWXXX 20 T 1400 Kelch_6 pdbhh F F 4fm6 2 C C hexapeptide YDQIII 6 T 6.4 TRSP pdbhh F F 4fmn 3 C C NTH2_YEAST NTG2 (DNA N-GLYCOSYLASE AND APURINIC OR APYRIMIDINIC LYASE) XVRSKYFKK 9 T 1.1 DUF1748 pdbhh F Eukaryota T 4fmo 3 C C EXO1_YEAST EXODEOXYRIBONUCLEASE I, EXO I, EXONUCLEASE I, PROTEIN DHS1 TRSKFFNK 8 T 1.5 Tna_leader pdbhh F Eukaryota T 4fmq 2 B B MAPK DOCKING PEPTIDE LSLSSLAASSLAKRRQQ 17 T 6 Rotavirus_VP1 pdbhh F T 4fn5 2 B B Argyrin B XXWXGXXX 8 T 2.1 Glypican pdbhh F F 4for 2 B B TRFL_BOVIN C-terminal peptide from Lactotransferrin LEACAF 6 T 9.3 T6SS_TssF pdbhh F Eukaryota F 4fq3 2 B B FUS_HUMAN Fusion (Involved in t(12;16) in malignant liposarcoma) GPLGSRGGRGGGDRGGFGPGKMDSRGEHRQDRRERPY 37 T 190 Pro-NT_NN pdbhh F Eukaryota T 4fqb 2 B,D,F,H B,D,F,H Q9I2Q0_PSEAE immune protein Tsi1 MADCTFTQLEIVPQFGSPNMFGGEDEHVRVMFSNEDPNDDNPDAFPEPPVYLADRDSGNDCRIEDGGIWSRGGVFLSQDGRRVLMHEFSGSSAELVSYDSATCKVVHREDISGQRWAVDKDGLRLGQKCSGESVDSCAKIVKRSLAPFCQTAKKLEHHHHHH 162 T 1.5 Me-amine-dh_H unphh F Bacteria T 4fqx 3 C E Synthetic peptide GKQNCLKLATK 11 T 2.5 DUF373 pdbhh F T 4fr3 2 B P KCNK9_HUMAN TASK-3 peptide KRRKSV 6 T 26 DUF4739 pdbhh F Eukaryota F 4ftg 2 C,D C,D ANXA2_HUMAN ANNEXIN II, ANNEXIN-2, CALPACTIN I HEAVY CHAIN, CALPACTIN-1 HEAVY CHAIN, CHROMOBINDIN-8, LIPOCORTIN II, PLACENTAL ANTICOAGULANT PROTEIN IV, PAP-IV, PROTEIN I, P36 XSTVHEILSKLSLEGDX 17 T 4 MJ1316 pdbhh F Eukaryota T 4ftg 3 E E AHNK_HUMAN DESMOYOKIN XGKVTFPKMKIPKFTFSGRELX 22 T 7.7 DUF5476 pdbhh F Eukaryota T 4fvd 2 B C A9XG43_9ENTO 10-mer peptide from 2A proteinase GSITTLGKFG 10 T 3 PAC3 pdbhh T Viruses T 4fvs 1 A,B,C,D,E,F A,B,C,D,E,F A6LGE3_PARD8 putative lipoprotein GQDCTFFFPQTEGTVWVRKGYDAKGNLQSVMSYQVDEVETLPSGQEVEADYVYTNPSGTIVNKGDIKAYCQNGEFFLDSKETLSYPGVVSEMNTNVDITENFINYPNPYAANFDKNNVYFDEASVKIYDKKNRKNRKDMAIKDREFIKTESITTPAGTFDCAKVKYNIATRSPKSKETITGYGYEWYSPNVGLVRTEQYDKNNVLQSYTVLEELK 215 T 0.0011 DUF3108 unppercent F Bacteria T 4fvt 2 B B Acetylated ACS2 peptide SGXVX 5 T 58 AAA_18 pdbhh F F 4fys 2 B C ANGT_HUMAN SERPIN A8, ANGIOTENSIN-1, ANGIOTENSIN I, ANG I, ANGIOTENSIN-2, ANGIOTENSIN II, ANG II, ANGIOTENSIN-3, ANGIOTENSIN III, ANG III, DES-ASP[1]-ANGIOTENSIN II VYIHPF 6 T 0.64 Adeno_PVIII pdbhh F Eukaryota T 4fyt 2 B B AMASTATIN XVVD 4 T 400 Fer4 pdbhh F F 4fz3 2 B B P53_HUMAN peptide from Cellular tumor antigen p53 XRHKXX 6 T 360 Viral_helicase1 pdbhh F Eukaryota F 4fzc 15 CA,DA,EA,FA c,d,e,f Cepafungin I XTXX 4 T 2200 zf-H2C2_5 pdbhh F F 4fzd 3 C C STK26_HUMAN C-terminal peptide from Serine/threonine-protein kinase MST4 EWSFT 5 T 25 MMM1 pdbhh F Eukaryota F 4fzg 15 CA,DA,EA,FA c,d,e,f Glidobactin XTXX 4 T 2200 zf-H2C2_5 pdbhh F F 4g13 1 A A EMERIMICIN IV, STILBELLIN I XFXXXVGLXXPQXPXX 16 T 0.13 Pep_deformylase pdbhh F T 4g14 1 A A EMERIMICIN IV, STILBELLIN I XFXXXVGLXXPQXPXX 16 T 0.13 Pep_deformylase pdbhh F T 4g1c 2 C,D D,E Succinylated IDH2 peptide XAVXCAX 7 T 300 zf-met pdbhh F F 4g1w 2 B B JIP1_HUMAN JIP-1, JNK-INTERACTING PROTEIN 1, ISLET-BRAIN 1, IB-1, JNK MAP KINASE SCAFFOLD PROTEIN 1, MITOGEN-ACTIVATED PROTEIN KINASE 8-INTERACTING PROTEIN 1 RPKRPTTLNLF 11 T 12 CTP-dep_RFKase pdbhh F Eukaryota T 4g2v 2 B B FRPD1_HUMAN FERM DOMAIN-CONTAINING PROTEIN 2 ALGLLAPLRETKSTNPASRVMEMEPETMETKSVIDSRV 38 T 71 PNP_phzG_C pdbhh F Eukaryota T 4g2z 2 B B TRFL_BOVIN C-terminal peptide from Lactotransferrin LEACAF 6 T 9.3 T6SS_TssF pdbhh F Eukaryota F 4g3b 1 A,B A,B alpha4F3d GNADEXYKELEDXQERLRKXRKKLRS 26 T 0.016 SNARE pdbpssm F T 4g4l 1 A,B A,B alpha4tbA6 GNADEXYKEXEDXQERXRKXRKKXRSG 27 T 5.9 PRP1_N pdbhh F T 4g4m 1 A,B A,B alpha4F3(6-13) GNADEXYKEXEDXQERLRKLRKKLRSG 27 T 0.9 YggL_50S_bp pdbhh F T 4g5g 2 B I thiomuracin A derivative SXNXXXYXXXXXX 13 T 0.79 CCER1 pdbhh F F 4g6d 2 B B Q4Z9Y5_9CAUD ORF067 MKLKILDKDNATLNVFHRNKEHKTIDNVPTANLVDWYPLSNAYEYKLSRNGEYLELKRLRSTLPSSYGLDDNNQDIIRDNNHRCKIGYWYNPAVRKDNLKIIEKAKQYGLPIITEEYDANTVEQGFRDIGVIFQSLKTIVVTRYLEGKTEEELRIFNMKSEESQLNEALKESDFSVDLTYSDLGQIYNMLLLMKKISK 198 T 0.073 PglD_N pdbpssm T Viruses T 4g6t 2 B B Q87UE5_PSESM Type III effector HopA1 IPALKANGQLEVDGKRYEIRAADDGTISVLRPEQQSKAKSFFKGASQLIGGSSQRAQIAQALNEKVASARTVLHQSAMTGGR 82 T 0.37 Gifsy-2 pdbhh F Bacteria T 4g6u 1 A A F2WK69_ECO57 EC869 CdiA-CT MGTNQSLTFDKELSDCRKSGGNCQDIIDKWEKISDEQSAEIDQKLKDNPLEAQVIDKEVAKGGYDMTQRPGWLGNIGVEVMTSDEAKAYVQKWNGRDLTKIDVNSPEWTKFAVFASDPENQAMLVSGGLLVKDITKAAISFMSRNTATATVNASEIGMQWGQGNMKQGMPWEDYVGKSLPADARLPKNFKIFDYYDGATKTATSVKSIDTQTMAKLANPNQVYSSIKGNIDAAAKFKEYALSGRELTSSMISNREIQLAIPADTTKTQWAEINRAIEYGKSQGVKVTVTQVK 292 T 0.13 Glyco_hydro_97 unppercent F Bacteria T 4g6v 2 B,D,F,H B,D,F,H H9T8H3_BURPE CdiI MAIDLFCYLSIDRGAAESDLNKIRSNHSELFEGKFLISPVRDADFSLKEIAAEHGLVAESFFLVSLNDKNSADLIPIVSKILVDGFNGGAILILQDNEYRRTSLEHHHHHH 111 T 1.5 T3SS_TC unphh F Bacteria T 4g8g 3 C C POL_HV1B1 P24 KRWIILGLNK 10 T 1 COX2-transmemb pdbhh T Viruses T 4g8i 3 C C POL_HV1B1 Gag protein KRWIIMGLNK 10 T 0.6 DUF5790 pdbhh T Viruses T 4g8x 2 B,D B,D Q4Z9Y5_9CAUD ORF067 MKLKILDKDNATLNVFHRNKEHKTIDNVPTANLVDWYPLSNAYEYKLSRNGEYLELKRLRSTLPSSYGLDDNNQDIIRDNNHRCKIGYWYNPAVRKDNLKIIEKAKQYGLPIITEEYDANTVEQGFRDIGVIFQSLKTIVVTRYLEGKTEEELRIFNMKSEESQLNEALKESDFSVDLTYSDLGQIYNMLLLMKKISK 198 T 0.073 PglD_N pdbpssm T Viruses T 4g94 2 B B Q4Z9Y5_9CAUD ORF067 MKLKILDKDNATLNVFHRNKEHKTIDNVPTANLVDWYPLSNAYEYKLSRNGEYLELKRLRSTLPSSYGLDDNNQDIIRDNNHRCKIGYWYNPAVRKDNLKIIEKAKQYGLPIITEEYDANTVEQGFRDIGVIFQSLKTIVVTRYLEGKTEEELRIFNMKSEESQLNEALKESDFSVDLTYSDLGQIYNMLLLMKKISK 198 T 0.073 PglD_N pdbpssm T Viruses T 4g9d 3 C C POL_HV1B1 P24 KRWIILGLNK 10 T 1 COX2-transmemb pdbhh T Viruses T 4g9f 3 C C POL_HV1B1 Gag protein KRWIIMGLNK 10 T 0.6 DUF5790 pdbhh T Viruses T 4g9j 2 C,D C,D synthetic peptide RRKRPKRKRKNARVTFAEAAEII 23 T 7.8 Consortin_C pdbhh F T 4gao 2 C,E,F,H C,E,F,H UBC12_HUMAN NEDD8-conjugating enzyme Ubc12 XIKLFSLKQQKK 12 T 4.6 DUF3637 pdbhh F Eukaryota T 4gba 2 C,D F,G UBE2F_HUMAN NEDD8 CARRIER PROTEIN UBE2F, NEDD8 PROTEIN LIGASE UBE2F, NEDD8-CONJUGATING ENZYME 2, UBIQUITIN-CONJUGATING ENZYME E2 F XLTLASKLKRDDGLKGSRTAATASD 25 T 37 Spore_YtrH pdbhh F Eukaryota T 4gbq 2 B B SOS1_MOUSE AC-VPPPVPPRRR-NH2 XVPPPVPPRRRX 12 T 4.2 Dscam_C pdbhh F Eukaryota F 4gbx 5 E E synthetic peptide GKQNCLKLAT 10 T 5.7 KRBA1 pdbhh F T 4gch 1 A E CTRA_BOVIN GAMMA-CHYMOTRYPSIN A CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 4geq 3 E,F E,F CNN1_YEAST CO-PURIFIED WITH NNF1 PROTEIN 1 NKDPNEVRSFLQDLSQVLARKSQGN 25 T 0.24 DivIVA pdbhh F Eukaryota T 4gfu 2 B F ERBB2_HUMAN HER2-pY1248 phosphor-peptide PEXLGLD 7 T 2.2 DUF2666 pdbhh F Eukaryota T 4gfv 2 C,D E,F HER2-pY1196 phosphor-peptide PEXLTP 6 T 1.7 YLP pdbhh F F 4gfy 2 B B VIAK VIAK 4 T 290 LRR_4 pdbhh F F 4ggd 2 C,D C,D BUB1B_HUMAN MAD3/BUB1-RELATED PROTEIN KINASE, HBUBR1, MITOTIC CHECKPOINT KINASE MAD3L, PROTEIN SSK1 DEWELSKENVQPLRQGRIMSTLQ 23 T 1.8 DivIC unppssm F Eukaryota T 4ggn 2 D,E,F D,E,F MYOA_PLAYO Myosin-A SLMRVQAHIRKRMVA 15 T 0.063 BORCS8 pdbhh F Eukaryota T 4ghu 2 B B MAVS_MOUSE CARDIF, MAVS, CARD ADAPTER INDUCING INTERFERON BETA, INTERFERON BETA PROMOTER STIMULATOR PROTEIN 1, IPS-1, VIRUS-INDUCED-SIGNALING ADAPTER, VISA PSCPKPVQDTQPPESPVENSE 21 T 38 rpo132 pdbhh F Eukaryota T 4gk0 2 C,D C,D REV3L_HUMAN PROTEIN REVERSIONLESS 3-LIKE, REV3-LIKE, HREV3 MLTPTPDSSPRSTSSPSQSKNGSFTPRTANILKPLMSPPSREEIMATLLDHD 52 T 7 CaM_bind pdbhh F Eukaryota T 4gk5 2 C,D C,D REV3L_HUMAN PROTEIN REVERSIONLESS 3-LIKE, REV3-LIKE, HREV3 MLTPTPDSSPRSTSSPSQSKNGSFTPRTANILKPLMSPPSREEIMATLLDHD 52 T 7 CaM_bind pdbhh F Eukaryota T 4gk5 4 G G POLK_HUMAN DINB PROTEIN, DINP KKSFFDKKRS 10 T 4.2 FDF pdbhh F Eukaryota F 4gk7 15 CA,DA,EA,FA,GA,HA 1,2,3,4,5,6 BOUND FORM OF CYCLIC SYLA-GLBA HYBRID XTXX 4 T 2900 EF-hand_5 pdbhh F F 4gkg 1 A,B A,F DCTB_RHIME C4-dicarboxylate transport sensor protein dctB MGSSHHHHHHSSGLVPRGSHMEERLARNALEASVEERTRDLRMARDRLETEIADHRQTTEKLQAVQQ 67 T 0.91 Spectrin unp F Bacteria T 4gkn 3 C,F C,F FAT Cognate peptide FATGIGIITV 10 T 5.7 MLANA pdbhh F T 4gks 3 C,F C,F FLT Cognate peptide FLTGIGIITV 10 T 5.5 IGR pdbhh F T 4gkv 2 E P cleaved peptide fragment corresponding to the C-terminal His tag AIPNPLLGLA 10 T 3.2 Pigment_DH pdbhh F T 4gl8 2 C,D C,D Peptide of unknown sequence XXXX 4 F F F 4gl9 2 C,E,F,H I,K,J,L IL6RB_MOUSE IL-6RB,INTERLEUKIN-6 SIGNAL TRANSDUCER,MEMBRANE GLYCOPROTEIN 130,GP130,ONCOSTATIN-M RECEPTOR SUBUNIT ALPHA STASTVEXSTVVHSG 15 T 10 DUF4244 pdbhh F Eukaryota T 4gld 2 B B FLAYK FLAYK 5 T 71 Dynein_light pdbhh F F 4gln 1 A,D D,H D-RFX001 XXXXXXXGXXXXGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 56 F F F 4glr 1 A,B A,B TAU_HUMAN phospho-peptide KKVAVVRTPPKSPSSAKC 18 T 12 DUF1067 pdbhh F Eukaryota T 4gls 1 A,B A,B D- Vascular endothelial growth factor-A GXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGGXXXXXGXXXXXXXXXXXXXXXXXXXXXXGXXXGXXXXXXXXXXXXXXXXX 102 F F F 4gls 3 D,G D,H D- RFX001 XXXXXXXGXXXXGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 56 F F F 4glu 1 A,B,C,D,E,F A,D,B,C,E,F D- Vascular endothelial growth factor-A GXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGGXXXXXGXXXXXXXXXXXXXXXXXXXXXXGXXXGXXXXXXXXXXXXXXXXX 102 F F F 4gly 2 B B BICYCLIC PEPTIDE INHIBITOR UK504 CCLGRGCENHRCLX 14 T 1.1 Ivy pdbhh F T 4gm3 2 I,J,K,L,M,N,O,P I,J,K,L,M,N,O,P MM-101 XXRXX 5 T 2400 zf-C2H2 pdbhh F F 4gm8 2 E,F,G,H E,F,G,H MM-102 XXRXX 5 T 2400 zf-C2H2 pdbhh F F 4gm9 2 C,D E,F MM-401 XXRXX 5 T 630 RlmM_FDX pdbhh F F 4gmb 2 B E MM-402 XXRXX 5 T 630 RlmM_FDX pdbhh F F 4gnt 2 B B MLXPL_MOUSE CHREBP, MLX INTERACTOR, MLX-INTERACTING PROTEIN-LIKE, WILLIAMS-BEUREN SYNDROME CHROMOSOMAL REGION 14 PROTEIN HOMOLOG RDKIRLNNAIWRAWYIQYVQR 21 T 0.0087 DUF1752 pdb F Eukaryota T 4gpk 2 M,N,O,P,Q,R,S,T,U,V,W,X M,N,O,P,Q,R,S,T,U,V,W,X NprX peptide SSKPDIVG 8 T 2.5 Bse634I pdbhh F T 4gpl 1 A A ACE-PTR-THR-PRO-GLU-PRO, PEPTIDE INHIBITOR XXTPEPX 7 T 22 FSIP1 pdbhh F F 4gq6 2 B B KMT2A_HUMAN ALL-1, CXXC-TYPE ZINC FINGER PROTEIN 7, LYSINE N-METHYLTRANSFERASE 2A, KMT2A, TRITHORAX-LIKE PROTEIN, ZINC FINGER PROTEIN HRX, MLL CLEAVAGE PRODUCT N320, N-TERMINAL CLEAVAGE PRODUCT OF 320 KDA, P320, MLL CLEAVAGE PRODUCT C180, C-TERMINAL CLEAVAGE PRODUCT OF 180 KDA, P180 SARWRFPARPGT 12 T 2.3 Xin pdbhh F Eukaryota T 4gqb 3 C C H4_HUMAN Histone H4 peptide XSGRGKGGKGLGKGGAKRHRKV 22 T 11 Shadoo unppercent F Eukaryota T 4gqz 1 A,B,C,D A,B,C,D Q8ZL99_SALTY CUEP AMASSESAFLAQHGLAGKTVEQIVDTIDQTPQSRPLPYSASITSTELKLSDGEQIYTLPLGDKFYLSFAPYEWRTHPCFNHSLSGCQGEMPNKPFTVKVTDSKGAVIVQKEMQSYRNGFIGVWLPRNMEGTLEVSYNGKTASHAIATSDDSQTCLTELPLR 161 T 0.031 DUF3244 unppssm F Bacteria T 4grk 2 B B TRFL_BOVIN C-terminal peptide from Lactotransferrin LEACAF 6 T 9.3 T6SS_TssF pdbhh F Eukaryota F 4gs0 2 C C JANUS KINASE 1, JAK-1 XXXX 4 T 1100 zf_CCCH_4 pdbhh F F 4gur 2 B B GLYR1_HUMAN NPAC, 3-HYDROXYISOBUTYRATE DEHYDROGENASE-LIKE PROTEIN, CYTOKINE-LIKE NUCLEAR FACTOR N-PAC, GLYOXYLATE REDUCTASE 1 HOMOLOG, NUCLEAR PROTEIN NP60, NUCLEAR PROTEIN OF 60 KDA PLGSPEFSERGSKSPLKRAQEQSPRKRGRPPKDEKDLTIPESSTVKGMMAGPMAAFKWQPTASEPVKDADPHFHHFLLSQTEKPAVCYQAITKKLKICEEETGSTSIQAADSTAVNGSITPTDK 124 T 0.017 AT_hook pdb F Eukaryota T 4gus 2 B B GLYR1_HUMAN NPAC, 3-HYDROXYISOBUTYRATE DEHYDROGENASE-LIKE PROTEIN, CYTOKINE-LIKE NUCLEAR FACTOR N-PAC, GLYOXYLATE REDUCTASE 1 HOMOLOG, NUCLEAR PROTEIN NP60, NUCLEAR PROTEIN OF 60 KDA PLGSPEFSERGSKSPLKRAQEQSPRKRGRPPKDEKDLTIPESSTVKGMMAGPMAAFKWQPTASEPVKDADPHFHHFLLSQTEKPAVCYQAITKKLKICEEETGSTSIQAADSTAVNGSITPTDK 124 T 0.017 AT_hook pdb F Eukaryota T 4gut 2 B B GLYR1_HUMAN NPAC, 3-HYDROXYISOBUTYRATE DEHYDROGENASE-LIKE PROTEIN, CYTOKINE-LIKE NUCLEAR FACTOR N-PAC, GLYOXYLATE REDUCTASE 1 HOMOLOG, NUCLEAR PROTEIN NP60, NUCLEAR PROTEIN OF 60 KDA PLGSPEFSERGSKSPLKRAQEQSPRKRGRPPKDEKDLTIPESSTVKGMMAGPMAAFKWQPTASEPVKDADPHFHHFLLSQTEKPAVCYQAITKKLKICEEETGSTSIQAADSTAVNGSITPTDK 124 T 0.017 AT_hook pdb F Eukaryota T 4guu 2 B B GLYR1_HUMAN NPAC, 3-HYDROXYISOBUTYRATE DEHYDROGENASE-LIKE PROTEIN, CYTOKINE-LIKE NUCLEAR FACTOR N-PAC, GLYOXYLATE REDUCTASE 1 HOMOLOG, NUCLEAR PROTEIN NP60, NUCLEAR PROTEIN OF 60 KDA PLGSPEFSERGSKSPLKRAQEQSPRKRGRPPKDEKDLTIPESSTVKGMMAGPMAAFKWQPTASEPVKDADPHFHHFLLSQTEKPAVCYQAITKKLKICEEETGSTSIQAADSTAVNGSITPTDK 124 T 0.017 AT_hook pdb F Eukaryota T 4gvb 1 A A KP6T_UMV6 VP10 NNAFCAGFGLSCKWECWCTAHGTGNELRYATAAGCGDHLSKSYYDARAGHCLFSDDLRNQFYSHCSSLNNNMSCRSLSKR 80 T 0.3 YobH unp T Viruses T 4gvb 2 B B KP6T_UMV6 VP12.5 GKRPRPVMCQCVDTTNGGVRLDAVTRAACSIDSFIDGYYTEKDGFCRAKYSWDLFTSGQFYQACLRYSHAGTNCQPDPQYE 81 T 0.0014 DUF5948 pdb T Viruses T 4gvc 2 B B SDC1_HUMAN SYND1 TKQEEFXA 8 F F Eukaryota T 4gvd 2 C,D C,D SDC1_HUMAN SYND1 TKQEEFYA 8 F F Eukaryota T 4gvu 2 B B Lyngbyastatin 7 XQTXXFXV 8 T 80 APC_15aa pdbhh F F 4gw1 3 E,F E,F cQFD meditope CQFDLSTRRLKC 12 T 3.1 Flavi_NS1 pdbhh F T 4gw5 3 E,F E,F cQYN meditope CQYNLSSRALKC 12 T 1.9 DUF6464 pdbhh F T 4gw8 2 B B Consensus peptide (Pimtide) ARKRRRHPSGPPTA 14 T 1.5 DUF3019 pdbhh F T 4gxb 2 B B LYAM3_MOUSE P-selectin GASAGSSKRLRKKDDGKCPLNPHSHLGTYGVFTNAAYDPTP 41 T 0.58 Syndecan unppercent F Eukaryota T 4gxl 2 B B CACO2_HUMAN ANTIGEN NUCLEAR DOT 52 KDA PROTEIN, NUCLEAR DOMAIN 10 PROTEIN NDP52, NUCLEAR DOMAIN 10 PROTEIN 52, NUCLEAR DOT PROTEIN 52 ARQNPGLAYGNPYS 14 T 2.2 Bac_GH3_C pdbhh F Eukaryota T 4gye 2 C C P1F peptide RVXEAX 6 T 3.7 Adaptin_binding pdbhh F F 4gyw 2 B,D B,D CSK21_HUMAN CK II ALPHA YPGGSTPVSSANMM 14 T 13 Pr_beta_C pdbhh F Eukaryota T 4gyy 2 B,D B,D CSK21_HUMAN CK II ALPHA YPGGSTPVSSANMM 14 T 13 Pr_beta_C pdbhh F Eukaryota T 4gz3 2 B,D B,D CSK21_HUMAN CK II ALPHA YPGGSTPVSSANMM 14 T 13 Pr_beta_C pdbhh F Eukaryota T 4gzf 2 C C LrF peptide RVXEAX 6 T 3.7 Adaptin_binding pdbhh F F 4h0h 2 B D peptide DVFYPYPYASGS 12 T 1.8 XRN_M pdbhh F T 4h1l 3 C,F C,F mimotope peptide QHIRCNIPKRISA 13 T 1.4 DUF3091 pdbhh F T 4h25 3 C,F C,F peptide QHIRCNIPKRIGPSKVATLVPR 22 T 6.6 SpdB pdbhh F T 4h26 3 C,F C,F peptide QWIRVNIPKRI 11 T 0.54 DUF2096 pdbhh F T 4h36 2 B B ATF2_HUMAN CAMP-DEPENDENT TRANSCRIPTION FACTOR ATF-2, ACTIVATING TRANSCRIPTION FACTOR 2, CYCLIC AMP-RESPONSIVE ELEMENT-BINDING PROTEIN 2, CREB-2, CAMP-RESPONSIVE ELEMENT-BINDING PROTEIN 2, HB16, CAMP RESPONSE ELEMENT-BINDING PROTEIN CRE-BP1 KHEMTLKF 8 T 0.0017 zf_C2H2_6 unphh F Eukaryota T 4h39 2 B B JIP1_HUMAN JIP-1, JNK-INTERACTING PROTEIN 1, ISLET-BRAIN 1, IB-1, JNK MAP KINASE SCAFFOLD PROTEIN 1, MITOGEN-ACTIVATED PROTEIN KINASE 8-INTERACTING PROTEIN 1 PKRPTTLNLF 10 T 6.8 Lipoprotein_19 pdbhh F Eukaryota T 4h3b 2 B,D B,D 3BP5_HUMAN SH3BP-5, SH3 DOMAIN-BINDING PROTEIN THAT PREFERENTIALLY ASSOCIATES WITH BTK VVRPGSLDLP 10 T 0.44 DUF5748 pdbhh F Eukaryota T 4h3h 3 E F Pol II CTD peptide SPTSPSYSPP 10 T 0.00024 RNA_pol_Rpb1_R pdbhh F F 4h3k 3 E F Hexapeptide SYSPTSPSYS 10 T 0.0029 RNA_pol_Rpb1_R pdbhh F F 4h3p 2 C,D B,E KS6A1_HUMAN S6K-ALPHA-1, 90 KDA RIBOSOMAL PROTEIN S6 KINASE 1, P90-RSK 1, P90RSK1, P90S6K, MAP KINASE-ACTIVATED PROTEIN KINASE 1A, MAPK-ACTIVATED PROTEIN KINASE 1A, MAPKAP KINASE 1A, MAPKAPK-1A, RIBOSOMAL S6 KINASE 1, RSK-1 PQLKPIEASILAARRVRKLPSTTL 24 T 7.4 COX5A pdbhh F Eukaryota T 4h3q 2 B B MP2K2_HUMAN MAP KINASE KINASE 2, MAPKK 2, ERK ACTIVATOR KINASE 2, MAPK/ERK KINASE 2, MEK 2 RRKPVLPALTINP 13 T 1.4 DHHA2 pdbhh F Eukaryota T 4h4f 3 C Q CTRC_HUMAN CALDECRIN CGVPSFPPNL 10 T 2.9 POPLD pdbhh F Eukaryota T 4h4n 1 A A A0A6L7H4C2_BACAN hypothetical protein BA_2335 SNAMEKKPIAFKVPPNSKLKVTFFGPYNEVITNVSIINQLSTPKCQTITRYPNYTKYETEVRSLSSC 67 T 2.7 PA-IIL pdbhh F Bacteria T 4h8f 1 A,B A,B CC-Hex-II-Phi22 XGEIKAIAQEIKAIAKEIKAIAXEIKAIAQGYX 33 T 0.0015 DUF2312 pdbpssm F T 4h8l 1 A,B,C,D,E,F A,B,C,D,E,F CC-Hex-D24-A5/7C XGELKCICQELKAIAWELKAIAKEDKAIAQGAGX 34 T 2.6 DUF5741 pdbhh F T 4h8m 1 A,B A,B CC-Hex-H24-A5/7C XGELKCICQELKAIAKELKAIAWEHKAIAQGX 32 T 9.4 KELAA pdbhh F T 4h8o 1 A,B,C,D,E,F A,B,C,D,E,F CC-Hex-N24 XGELKAIAQELKAIAYELKAIAKENKAIAQGX 32 T 0.083 DUF5660 pdbpssm F T 4h9n 3 C C DAXX_HUMAN DAXX, HDAXX, ETS1-ASSOCIATED PROTEIN 1, EAP1, FAS DEATH DOMAIN-ASSOCIATED PROTEIN SPRTRGSRRQIQRLEQLLALYVAEIRRLQEKELDLSELDDPDSAYLQEARLKRKLIRLFGRLCELKDCSSLTGRVIEQRIPYRGTRYPEVNRRIERLINKPGPDTFPDYGDVLRAVEKAAARHSLGLPRQQLQLMAQDAFRDVGIRLQERRHLDLIYNFGCHLTDDYRPGVDPALSDPVLARRLRENRSLAMSRLDEVISKYAMLQDKSEEG 212 T 0.0026 Syntaxin_2 pdbpercent F Eukaryota T 4h9o 3 C C DAXX_HUMAN DAXX, HDAXX, ETS1-ASSOCIATED PROTEIN 1, EAP1, FAS DEATH DOMAIN-ASSOCIATED PROTEIN SPRTRGSRRQIQRLEQLLALYVAEIRRLQEKELDLSELDDPDSAYLQEARLKRKLIRLFGRLCELKDCSSLTGRVIEQRIPYRGTRYPEVNRRIERLINKPGPDTFPDYGDVLRAVEKAAARHSLGLPRQQLQLMAQDAFRDVGIRLQERRHLDLIYNFGCHLTDDYRPGVDPALSDPVLARRLRENRSLAMSRLDEVISKYAMLQDKSEEG 212 T 0.0026 Syntaxin_2 pdbpercent F Eukaryota T 4h9p 3 C C DAXX_HUMAN DAXX, HDAXX, ETS1-ASSOCIATED PROTEIN 1, EAP1, FAS DEATH DOMAIN-ASSOCIATED PROTEIN SPRTRGSRRQIQRLEQLLALYVAEIRRLQEKELDLSELDDPDSAYLQEARLKRKLIRLFGRLCELKDCSSLTGRVIEQRIPYRGTRYPEVNRRIERLINKPGPDTFPDYGDVLRAVEKAAARHSLGLPRQQLQLMAQDAFRDVGIRLQERRHLDLIYNFGCHLTDDYRPGVDPALSDPVLARRLRENRSLAMSRLDEVISKYAMLQDKSEEG 212 T 0.0026 Syntaxin_2 pdbpercent F Eukaryota T 4h9q 3 C C DAXX_HUMAN DAXX, HDAXX, ETS1-ASSOCIATED PROTEIN 1, EAP1, FAS DEATH DOMAIN-ASSOCIATED PROTEIN SPRTRGSRRQIQRLEQLLALYVAEIRRLQEKELDLSELDDPDSAYLQAARLKRKLIRLFGRLCELKDCSSLTGRVIEQRIPYRGTRYPEVNRRIERLINKPGPDTFPDYGDVLRAVEKAAARHSLGLPRQQLQLMAQDAFRDVGIRLQERRHLDLIYNFGCHLTDDYRPGVDPALSDPVLARRLRENRSLAMSRLDEVISKYAMLQDKSEEG 212 T 0.0026 Syntaxin_2 pdbpercent F Eukaryota T 4h9r 3 C C DAXX_HUMAN DAXX, HDAXX, ETS1-ASSOCIATED PROTEIN 1, EAP1, FAS DEATH DOMAIN-ASSOCIATED PROTEIN SPRTRGSRRQIQRLEQLLALYVAEIRRLQEKELDLSELDDPDSAYLQAARLKRKLIRLFGRLCELKDCSSLTGRVIEQRIPYRGTRYPEVNRRIERLINKPGPDTFPDYGDVLRAVEKAAARHSLGLPRQQLQLMAQDAFRDVGIRLQERRHLDLIYNFGCHLTDDYRPGVDPALSDPVLARRLRENRSLAMSRLDEVISKYAMLQDKSEEG 212 T 0.0026 Syntaxin_2 pdbpercent F Eukaryota T 4hab 2 D,E,F D,E,F PL-49 XLHSTMX 7 T 330 Thioredoxin_16 pdbhh F T 4han 2 C,D C,D CACO2_HUMAN ANTIGEN NUCLEAR DOT 52 KDA PROTEIN, NUCLEAR DOMAIN 10 PROTEIN NDP52, NUCLEAR DOMAIN 10 PROTEIN 52, NUCLEAR DOT PROTEIN 52 PGLAYGNPYSGIQE 14 T 1.9 DUF4326 pdbhh F Eukaryota T 4hd8 2 B F Fluor-de-Lys peptide RHKX 4 T 360 STOP pdbhh F F 4hda 2 C F Fluor-de-Lys peptide RHKX 4 T 360 STOP pdbhh F F 4hdq 3 C C HEG1_HUMAN HEG1 SRHSCIFPGQYNPSFISDESRRRDYF 26 T 3.4 LAX unphh F Eukaryota T 4hga 1 A A DAXX_HUMAN DAXX, HDAXX, ETS1-ASSOCIATED PROTEIN 1, EAP1, FAS DEATH DOMAIN-ASSOCIATED PROTEIN GPLQDPSRRQIQRLEQLLALYVAEIRRLQEKELDLSELDDPDSAYLQEARLKRKLIRLFGRLCELKDCSSLTGRVIEQRIPYRGTRYPEVNRRIERLINKPGPDTFPDYGDVLRAVEKAAARHSLGLPRQQLQLMAQDAFRDVGIRLQERRHLDLIYNFGCHLTDDYRPGVDPALSDPVLARRLRENRSLAMSRLDEVISKYAMLQDKSEEGE 213 T 0.017 Latarcin pdbpssm F Eukaryota T 4hgc 2 B I SFTI1_HELAN SFTI-1 GRCTXSIPPICFPD 14 T 0.022 Bowman-Birk_leg pdb F Eukaryota T 4hh6 2 B Z Peptide from EAEC T6SS Sci1 SciI protein KKWDSVYASLFEKINLKK 18 T 0.51 Rtf2 pdbhh F T 4hha 3 C P Q9YKI2_HCMV Glycoprotein B ETIYNTTLKY 10 T 5.1E-05 HCMVantigenic_N unphh T Viruses T 4hic 1 A,B A,B Q8L1C9_ENTFL TraK MKHHHHHHHSDYDIPTTENLYFQGSGSTNKNQPPVTPTATTASKESNQSETSGEATENSSQAVQGSSDHLLKLSAKERADEATEAFESWYKSFSNGDVILEINKELLKEGSGGTSPIELQTKLIDNLKAKFGDKVSDDFYTSLQASFNFNPVIVDGTKGLTISKQNDDESQWFSTWFLDTEKKEKNTKIIVRNDFPFEWVDWRNKGQHDEKVGKIFKNVDWDNDLSYEVIGIDFTEATKNIETNQILFVQMHYNEKIGKWQVTGNVGGVY 270 T 1.4 RCR unphh F Bacteria T 4hiv 2 C,D C,D ACTINOMYCIN D TXPXXXTXPXX 11 T 61 DUF5572 pdbhh F F 4hjb 1 A,B,C,D C,A,B,D GCN4pLI(alpha/beta/cyclic-gamma) XRMKQIEDKLEEILSKLYHIEXELXIKXLLGER 33 T 0.0073 VGPC1_C pdbhh F T 4hjd 1 A,B A,B GCN4pLI(alpha/beta/acyclic gamma) XRMKQIEDKLEEILXKLXIEXELARIKKLLYER 33 T 0.0071 VGPC1_C pdbhh F T 4hlb 1 A A B6WUJ7_9DELT Uncharacterized protein GAEQQADTVTENSDSEVFVDDSDRFTAFEEELLARYADKGIRSVDVAAYAKGIDIVFVAADRKMTRAEFSAIASRSIRELKERFGFDKDVPIGAVLDYKKDAATDTRTRFVLKLR 115 T 0.01 DUF4999 unp F Bacteria T 4hp2 1 A,B A,B THCL_STRAJ ALANINAMIDE, BRYAMYCIN, GARGON, THIACTIN XIAXASXTXXXXTXXXXXX 19 T 1.5 DUF4803 pdbhh F Bacteria F 4hqr 2 C,D E,F Ac-Asp-Glu-Val-Asp-Aldehyde XDEVX 5 T 570 Helicase_RecD pdbhh F F 4hr6 1 A A U3KRF6_TRIAN SGSL, A ALPHA ANLRLSEANSGTYKTFIGRVREELGSETYRLYGIPVLKHSL 41 T 0.00035 RIP pdbhh F Eukaryota T 4hre 3 E,F,K,L G,H,K,L HLTF_HUMAN DNA-BINDING PROTEIN/PLASMINOGEN ACTIVATOR INHIBITOR 1 REGULATOR, HIP116, RING FINGER PROTEIN 80, SWI/SNF-RELATED MATRIX-ASSOCIATED ACTIN-DEPENDENT REGULATOR OF CHROMATIN SUBFAMILY A MEMBER 3, SUCROSE NONFERMENTING PROTEIN 2-LIKE 3 PRLSYPTFFPRFEF 14 T 14 LegC3_N pdbhh F Eukaryota T 4hrg 2 C,D C,D AHNK_HUMAN DESMOYOKIN QKVTFPKMKIPKFTF 15 T 5 DUF5476 pdbhh F Eukaryota T 4hrh 2 C,D C,D HLTF_HUMAN DNA-BINDING PROTEIN/PLASMINOGEN ACTIVATOR INHIBITOR 1 REGULATOR, HIP116, RING FINGER PROTEIN 80, SWI/SNF-RELATED MATRIX-ASSOCIATED ACTIN-DEPENDENT REGULATOR OF CHROMATIN SUBFAMILY A MEMBER 3, SUCROSE NONFERMENTING PROTEIN 2-LIKE 3 PRLSYPTFFPRFEF 14 T 14 LegC3_N pdbhh F Eukaryota T 4hsu 2 B B GLYR1_HUMAN NPAC, 3-HYDROXYISOBUTYRATE DEHYDROGENASE-LIKE PROTEIN, CYTOKINE-LIKE NUCLEAR FACTOR N-PAC, GLYOXYLATE REDUCTASE 1 HOMOLOG, NUCLEAR PROTEIN NP60, NUCLEAR PROTEIN OF 60 KDA PLGSPEFSERGSKSPLKRAQEQSPRKRGRPPKDEKDLTIPESSTVKGMMAGPMAAFKWQPTASEPVKDADPHFHHFLLSQTEKPAVCYQAITKKLKICEEETGSTSIQAADSTAVNGSITPTDK 124 T 0.017 AT_hook pdb F Eukaryota T 4ht6 2 B,D,F B,D,F PAC11_YEAST WD repeat-containing protein PAC11 ITYDKGIQTDQ 11 T 0.55 SAICAR_synt unp F Eukaryota T 4htp 2 C,D C,E DCR1C_HUMAN DNA CROSS-LINK REPAIR 1C PROTEIN, PROTEIN A-SCID, SNM1 HOMOLOG C, HSNM1C, SNM1-LIKE PROTEIN DVPQWEVFFKR 11 T 1.6 DUF4570 pdbhh F Eukaryota T 4hva 2 C,D C,D VEID Inhibitor XVEIX 5 T 650 DUF72 pdbhh F F 4hvu 2 B B SYNTHETIC PEPTIDE Acetyl-APPLPPRNRP XAPPLPPRNRP 11 T 0.21 SCIMP pdbhh F T 4hvv 2 B B SYNTHETIC PEPTIDE Acetyl-APPLPPRNRP XAPPLPPRNRP 11 T 0.21 SCIMP pdbhh F T 4hvw 2 B B SYNTHETIC PEPTIDE Acetyl-VSLARRPLPPLP XVSLARRPLPPLP 13 T 0.95 DUF4522 pdbhh F T 4hw4 2 C,D C,D Mcl-1 BH3 peptide XALETLRRVGDGVQRNHX 18 T 14 BALF1 pdbhh F T 4hx0 1 A A Q9X0A5_THEMA Putative nucleotidyltransferase TM1012 HHHHHMIRPEYLRVLRKIYDRLKNEKVNWVVTGSLSFALQGVPVEVHDIDIQTDEEGAYEIERIFSEFVSKKVRFSSTEKICSHFGELIIDGIKVEIMGDIRKRLEDGTWEDPVDLNKYKRFVETHGMKIPVLSLEYEYQAYLKLGRVEKAETLRKWLNERKG 163 T 0.00088 NTP_transf_5 unppercent F Bacteria T 4hxj 2 C C ITB3_HUMAN C-terminal 3-mer peptide from Integrin beta-3 RGT 3 T 190 Toxin_27 pdbhh F Eukaryota F 4hy2 2 B D PL-42 XLHSTMX 7 T 330 Thioredoxin_16 pdbhh F T 4hy7 2 B B Cyclosporin A XXXXXXXXVXA 11 T 9.7 DUF6090 pdbhh F F 4hy9 2 C,D C,D PYRRH_PYRAP Pyrrhocoricin VDKLYXXPRPTT 12 T 2.5 Apidaecin unphh F Eukaryota T 4hyb 2 C,D C,D PYRRH_PYRAP Pyrrhocoricin VDKLYXIPRPP 11 T 2.5 Apidaecin unphh F Eukaryota T 4hys 2 B B JIP1_HUMAN JIP-1, JNK-INTERACTING PROTEIN 1, ISLET-BRAIN 1, IB-1, JNK MAP KINASE SCAFFOLD PROTEIN 1, MITOGEN-ACTIVATED PROTEIN KINASE 8-INTERACTING PROTEIN 1 RPKRPTTLNLF 11 T 12 CTP-dep_RFKase pdbhh F Eukaryota T 4hyu 2 B B JIP1_HUMAN JIP-1, JNK-INTERACTING PROTEIN 1, ISLET-BRAIN 1, IB-1, JNK MAP KINASE SCAFFOLD PROTEIN 1, MITOGEN-ACTIVATED PROTEIN KINASE 8-INTERACTING PROTEIN 1 RPKRPTTLNLF 11 T 12 CTP-dep_RFKase pdbhh F Eukaryota T 4i1p 2 B,D B,D tetrapeptide ALRSX 5 T 400 Triabin pdbhh F F 4i2w 2 B B HSP7A_CAEEL Heat shock 70 kDa protein A AGGPTIEEVD 10 T 0.49 EcsC unppssm F Eukaryota T 4i2z 2 B B HSP90_CAEEL ABNORMAL DAUER FORMATION PROTEIN 21 EDASRMEEVD 10 T 8.1 TEX12 pdbhh F Eukaryota T 4i4o 1 A,B A,B R4GRU5_BOLED BEL beta-trefoil VNFPNIPAEGVQFRLRARDTGYVIYSRTENPPLVWQYNGPPYDDQLFTLIYGTGPRKNLYAIKSVPNGRVLFSRTSASPYVGNIAGDGTYNDNWFQFIQDDNDPNSFRIYNLASDTVLYSRTTADPKFGNFTGAKYDDQLWHFELV 146 T 2.2E-05 RicinB_lectin_2 pdbpercent F Eukaryota T 4i4p 1 A,B A,B R4GRU5_BOLED BEL-beta trefoil VNFPNIPAEGVQFRLRARDTGYVIYSRTENPPLVWQYNGPPYDDQLFTLIYGTGPRKNLYAIKSVPNGRVLFSRTSASPYVGNIAGDGTYNDNWFQFIQDDNDPNSFRIYNLASDTVLYSRTTADPKFGNFTGAKYDDQLWHFELV 146 T 2.2E-05 RicinB_lectin_2 pdbpercent F Eukaryota T 4i4q 1 A A R4GRU4_BOLED BEL-beta trefoil VNFPNIPAEGVRFRLRARDSGYVIYSRTENDPLVWHYNGPPYDDQLFTLIHGTGSRLNLYAIKSVPNGRVLFSRNSASPTVGNIVGDGTYNDNWFQFIQDDNDANSFRIYSLASDSVLYSRTTGAPQFGNYTGPKFDDQLWHFEIV 146 T 0.00014 RicinB_lectin_2 pdb F Eukaryota T 4i4r 1 A,B,C,D A,B,C,D R4GRU5_BOLED BEL-beta trefoil VNFPNIPAEGVQFRLRARDTGYVIYSRTENPPLVWQYNGPPYDDQLFTLIYGTGPRKNLYAIKSVPNGRVLFSRTSASPYVGNIAGDGTYNDNWFQFIQDDNDPNSFRIYNLASDTVLYSRTTADPKFGNFTGAKYDDQLWHFELV 146 T 2.2E-05 RicinB_lectin_2 pdbpercent F Eukaryota T 4i4u 1 A,B,C,D A,B,C,D R4GRU5_BOLED BEL beta-trefoil VNFPNIPAEGVQFRLRARDTGYVIYSRTENPPLVWQYNGPPYDDQLFTLIYGTGPRKNLYAIKSVPNGRVLFSRTSASPYVGNIAGDGTYNDNWFQFIQDDNDPNSFRIYNLASDTVLYSRTTADPKFGNFTGAKYDDQLWHFELV 146 T 2.2E-05 RicinB_lectin_2 pdbpercent F Eukaryota T 4i4v 1 A,B,C,D A,B,C,D R4GRU5_BOLED BEL beta-trefoil VNFPNIPAEGVQFRLRARDTGYVIYSRTENPPLVWQYNGPPYDDQLFTLIYGTGPRKNLYAIKSVPNGRVLFSRTSASPYVGNIAGDGTYNDNWFQFIQDDNDPNSFRIYNLASDTVLYSRTTADPKFGNFTGAKYDDQLWHFELV 146 T 2.2E-05 RicinB_lectin_2 pdbpercent F Eukaryota T 4i4w 3 C C Immunogenic peptide ILAKFLHRL 9 T 5.7 SRC-1 pdbhh F T 4i4x 1 A,B,C,D A,B,C,D R4GRU9_BOLED BEL beta-trefoil VNFPNIPAEGVQFRLRARDTGYVIYSRTENPPLVWQYNGPPYDDQLFTLIYGTGPRKNLYAIKSVPNGRVLFSRTSASPYVGNIAGDGTYNDNWFQFIQDDNDPNSFRIYDLASDTVLYSRTTADPKFGNFTGAKYDDQLWHFELV 146 T 0.00012 RicinB_lectin_2 pdbpercent F Eukaryota T 4i51 3 E K UNKNOWN PEPTIDE XXXXXXXXXXXXX 13 F F F 4i5b 3 C,F C,F truncated hemagglutinin peptide VVKQNCLKLATK 12 T 19 Hemagglutinin pdbhh F T 4i5l 4 G,H G,H Microcystin-LR (MCLR) bound form XLXRXXX 7 T 55 Flagellar_put pdbhh F F 4i5n 4 G,H G,H Microcystin-LR (MCLR) bound form XLXRXXX 7 T 55 Flagellar_put pdbhh F F 4i79 2 C C floating chain, unknown sequence XXXXXXXXX 9 F F F 4i7b 2 B,D B,D PHYL_DROME Protein phyllopod XKLRPVXMVRPTVR 14 T 0.45 RNase_PH unppercent F Eukaryota T 4i7c 2 B,D B,D PHYL_DROME Protein phyllopod XKLRPVXMVRPWVR 14 T 0.45 RNase_PH unppercent F Eukaryota T 4i7d 2 B,D B,D PHYL_DROME Protein phyllopod XKLRPVAMVRPXVR 14 T 0.45 RNase_PH unppercent F Eukaryota T 4i80 2 B B macrocyclic peptidomimetic XRWXFPARP 9 T 4.4 DUF2842 pdbhh F T 4i9c 2 B C PhrF QRGMI 5 T 57 GWxTD_dom pdbhh F F 4ib5 2 D,E,F,G D,E,F,G CK2beta-derived cyclic peptide GCRLYGFKIHGCG 13 T 1.2 Speriolin_C pdbhh F T 4icz 2 B F HER2 NLXXW 5 T 23 Lipoprotein_15 pdbhh F F 4iea 2 B P RAF1_HUMAN PROTO-ONCOGENE C-RAF, CRAF, RAF-1 RSASEPSL 8 T 10 Baculo_LEF5_C pdbhh F Eukaryota T 4iel 2 C C Tripeptide likely a portion of the N-terminal tag XXX 3 F F F 4if6 2 B B SIR1_HUMAN NAD-dependent protein deacetylase sirtuin-1 GPHMGSQYLFLPPNRYIFHGAEVYSDSEDDV 31 T 13 Rxt3 pdbhh F Eukaryota T 4ifd 11 K K RRP6_YEAST RIBOSOMAL RNA-PROCESSING PROTEIN 6 RSMEATPIPSSETKADGILLETISVPQIRDVMERFSVLCNSNISKSRAKPVTNSSILLGKILPREEHDIAYSKDGLPNKVKTEDIRIRAQNFKSALANLEDIIFEIEKPLVVPVKLEEIKTVDPASAPNHSPEIDNLDDLVVLKKKNIQKKQPAKEKGVTEKDAVDYSKIPNILSNKPG 179 T 0.21 Laminin_N pdb F Eukaryota T 4ifi 2 B B BRAT1_HUMAN BAAT peptide RSPVFS 6 T 0.44 DUF4449 pdbhh F Eukaryota F 4ifl 2 B P NF2L2_HUMAN Nrf2 peptide AFFAQLQLDEETGEFL 16 T 0.18 DUF4585 pdbhh F Eukaryota T 4ig9 2 B,D,F,H B,D,F,H SIR1_HUMAN NAD-dependent protein deacetylase sirtuin-1 GPHMGSQYLFLPPNRYIFHGAEVYSDSEDDV 31 T 13 Rxt3 pdbhh F Eukaryota T 4igk 2 C,D C,D ATRIP_HUMAN ATM AND RAD3-RELATED-INTERACTING PROTEIN ACSPQFG 7 T 0.12 Toxin_18 pdbhh F Eukaryota T 4igq 2 B B mathylated H3K4 substrate TKQ 3 T 590 zf-C2H2_4 pdbhh F F 4ihl 2 C P RAF1_HUMAN PROTO-ONCOGENE C-RAF, CRAF, RAF-1 QHRYSTPHAFTFNTSSPSSEGSLSQRQRSTSTPNVH 36 T 0.23 DUF1780 pdbpssm F Eukaryota T 4iho 3 C,F C,F NONAMERIC PEPTIDE CHIMERIC GP100 EGPRNQDWL 9 T 2.8 APOC4 pdbhh F T 4ii9 2 B B 5-mer peptide AXCXX 5 T 130 zf-CCHC pdbhh F F 4iik 1 A A SIDD_LEGPH DE-AMPYLASE SIDD, DEAMPYLASE SIDD, ADENYLYL-[RAB1] HYDROLASE MRSIITQICNGVLHGQSYQSGSNDLDKGNSEIFASSLFVHLNEQGKEIIKHKDSDDKIVIGYTKDGMAFQIVVDGFYGCERQAVFSFIDNYVLPLIDNFSLDLTRYPDSKKVTESLIHTIYSLRSKHAPLAEFTMSLCVTYQKDEQLFCAGFGIGDTGIAIKRNEGTIEQLVCHTEVDGFKDAFDNYSSANIDLVIERNSVFNTKVMPGDELVGYTYVPPMLEMTEKEFEVETVDGKKINKRIVRHLNLDPGNFDDKDPLFSQLLQVVKSKQKQLVEQAKETGQIQRFGDDFTVGRLVIPDQLLINQLRIHALSHHHHH 319 T 0.18 Glycos_trans_3N pdbpercent F Bacteria T 4iim 2 B,D,E C,D,E peptide ligand WRDSSGYVMGPW 12 T 1 Galanin pdbhh F T 4iio 2 C C Synthetic Peptide XWRGSLSYLKGPL 13 T 0.56 CRPV_capsid pdbhh F T 4iip 1 A A SIDD_LEGPH DE-AMPYLASE SIDD, DEAMPYLASE SIDD, ADENYLYL-[RAB1] HYDROLASE MRSIITQICNGVLHGQSYQSGSNDLDKGNSEIFASSLFVHLNEQGKEIIKHKDSDDKIVIGYTKDGMAFQIVVAGFYGCERQAVFSFIDNYVLPLIDNFSLDLTRYPDSKKVTESLIHTIYSLRSKHAPLAEFTMSLCVTYQKDEQLFCAGFGIGDTGIAIKRNEGTIEQLVCHTEVDGFKDAFDNYSSANIDLVIERNSVFNTKVMPGDELVGYTYVPPMLEMTEKEFEVETVDGKKINKRIVRHLNLDPGNFDDKDPLFSQLLQVVKSKQKQLVEQAKETGQIQRFGDDFTVGRLVIPDQLLINQLRIHALSHHHHH 319 T 0.17 PP2C_2 pdbpssm F Bacteria T 4ij8 2 C I helical peptide XXXXXXXXXX 10 F F F 4ijy 1 A A Q93I65_ECOLX CofJ SPSSSEGGAFTVNMPKTSTVDDIRGCPTLETPLKLTFTEDIQPRKENGSTYFYYDGWRGVGQTVNPWSPVLDNHKYAATEHEIHIYVEFFQTPSNRFADKNGAYSYIDANGVMYTNGEYSWEHVPALGKNIYKVVISDWNKGQTKSIYLPGRDFKTVEVFHFQNNRPQWDDRNSYENVKSRINNNISKSYSKAKLNEQLSTYVHDDGTDSLFLYQKLSRASLKESQINYYQLRGKFNGVNLGYWAQEYILFGGEGAEQLKNKIPDMSNYSMEDNGSFKNALKIESLDLRLMDNNRMAYGSTGTYIASFNRTDFSMTPENLKACGLD 326 T 9.5 DUF4999 unphh F Bacteria T 4ika 2 B D B2ZUN0_9ENTO VPg GAYSGAPKQVLKKPALRTATVQ 22 T 6.7 DUF2111 pdbhh T Viruses T 4il7 1 A A Q6Q0L4_9VIRU HYPOTHETICAL PROTEIN A223 MSHEGLSPIPGEGTGIQLSAGQILKFYNVPIAEIIVEYDPSNVSGVSSNVKLKGTIHPLFEVPSQISIENFQPTENYLIYSGFGTSLPQTYTIPANGYLIISITNTSTGNIGQITLTIGSTTMTFNLQTGENKIPVIAGTQITNMTLTSSSAILIYEEVIHHHHHH 166 T 14 DUF1989 pdbhh T Viruses T 4imi 3 E F CTD SPSYSPTSPSYSPTSPSYS 19 T 1.8E-05 RNA_pol_Rpb1_R pdbhh F F 4imj 3 E F CTD SPSYSPTSPSYSPTSPSYS 19 T 1.8E-05 RNA_pol_Rpb1_R pdbhh F F 4imq 2 B B PEPTIDE INHIBITOR, syc8 XLFX 4 T 660 EF-hand_6 pdbhh F F 4imz 2 B B peptide inhibitor, syc 10 XLFX 4 T 660 EF-hand_6 pdbhh F F 4in9 2 B B Peptide SER-TRP-PHE-PRO SWFP 4 T 22 Mak_N_cap pdbhh F F 4ind 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T Q6Q0L3_9VIRU C381 turret protein MSVTTLGQSFPANAKVKYYYKLSEKQDLDAFVNSIFVGSYKLKQISYLLYGNTKIVSAPVVPLGPNASIIIDDELQEGLYLIRIKVYNTNSFSVTVTPFFNNNNTMTYSIGANSEFEIYDIFTKEQGNIYYIQLPPGLAILEFSLERVFEKGNRINIPKIIHTSGNGYISFRLRKGTYAIKMPYSYNNTTSTTFTNFQFGTISTSVATIPLVISSIPANGSGSGTFLVYLKITGDYEDVKFSVTYGGGLGVPFTFGLEVEEINELVENTNFVTQSVTLSGSQVTQSILNVQGSGSHLRLKYASVSGLTTAVTQCQLQATNLNRSTTYSTVWDFIAGGSSTPPSWDIREINSIQLVANGGSSTSSVTITLILVYEQIAGELSHHHHHH 387 T 0.2 CBM_48 pdb T Viruses T 4inh 2 I,J,K,L,M,N,O,P J,M,S,T,N,O,P,Q peptide inhibitor, syc59 XALX 4 T 1700 zf-C2H2_6 pdbhh F F 4ioi 5 E C meditope CQFDLSTRRLKC 12 T 3.1 Flavi_NS1 pdbhh F T 4iox 2 D D peptide XXXXXX 6 F F F 4ip3 1 A A Q8VSD5_SHIFL ORF169b GSMINGVSLQGTAGYEAHTEEGNVNVKKLLESLNSKSLGDMDKDSELAATLQKMINPSGGDGNASGCALHACMAMLGYGVREAPVPNEISEYMTGFFHRHLEQIDSEGIVSHPNETYSKFRERIAENILQNTSKGSVVMISIEQATHWIAGFNDGEKIMFLDVQTGKGFNLYDPVEKSPDAFVDENSSVQVIHVSDQEFDHYANSSSWKSKRLC 214 T 0.13 Gln_amidase unppercent F Bacteria T 4ipz 2 B B cyclosporine SmBz-CsA XXXXXXXXVXA 11 T 22 Neurokinin_B pdbhh F F 4iqj 5 M,N,O,P M,N,O,P A0A0M9ACL9_THEAQ DNA polymerase III subunit gamma/tau HHHHHHKAGEAQDLAEGWRAFLEALKPTLRAFVREARPHLEGKTLVLRFPESKAFHHKKAEEQKAHLLPLARAQFGVEELAFVLEKKSLSGASPPPPTKPVPPREAPPPVAAPPPEPEPPLEDPPWEAEEGEDPSEELRRLARLLGGRLLWVRKPKAPEAEEPVSEDGIGGNGIMPP 177 T 0.00029 DNA_pol3_a_NII unppssm F Bacteria T 4irv 2 E,F,G,H E,F,G,H ASPP2_HUMAN BCL2-BINDING PROTEIN, BBP, RENAL CARCINOMA ANTIGEN NY-REN-51, TUMOR SUPPRESSOR P53-BINDING PROTEIN 2, 53BP2, P53-BINDING PROTEIN 2, P53BP2 GPKLASNAPRPLKKRSSITEPEGPNGPNIQKLLYQRTTIAAMETISVPSYPSKSASVTASSE 62 T 0.16 Dermcidin pdb F Eukaryota T 4is6 3 C C PMEL_HUMAN ME20-M, ME20M, MELANOCYTE PROTEIN PMEL 17, MELANOCYTES LINEAGE-SPECIFIC ANTIGEN GP100, MELANOMA-ASSOCIATED ME20 ANTIGEN, P1, P100, PREMELANOSOME PROTEIN, SILVER LOCUS PROTEIN HOMOLOG WNRQLYPEWTEAQRLD 16 T 0.96 Rv0078B pdbhh F Eukaryota T 4isq 2 D,E,F D,E,F SYT1_HUMAN SYNAPTOTAGMIN I, SYTI, P65 GEGKEDAFSKLKEKFMNELHK 21 T 0.01 PRIMA1 unphh F Eukaryota T 4isr 2 D,E,F D,E,F SYT2_RAT SYNAPTOTAGMIN II, SYTII GESQEDMFAKLKDKFFNEINK 21 T 0.027 DUF4713 unphh F Eukaryota T 4itz 2 C C substrate peptide XALPFX 6 T 160 zf-C2H2_6 pdbhh F F 4ivh 1 A A cyclo[Gln-Lys-Leu-Val-Phe-Phe-Ala-Glu-Asp-(delta-linked-Orn)-Hao-Lys-Hao-(p-bromoPhe)-Thr-(delta-linked-Orn)] TXQKLVFFAEDXXKXX 16 T 0.47 Beta-APP pdbhh F T 4ixq 20 NA,T G,Y Photosystem II reaction center protein Y XXXXXXXXXXXXXXXXXXXXXXXXXXXX 28 F F F 4ixr 20 NA,T Y,G Photosystem II reaction center protein Y XXXXXXXXXXXXXXXXXXXXXXXXXXXX 28 F F F 4izy 2 B B JIP1_HUMAN JIP-1, JNK-INTERACTING PROTEIN 1, ISLET-BRAIN 1, IB-1, JNK MAP KINASE SCAFFOLD PROTEIN 1, MITOGEN-ACTIVATED PROTEIN KINASE 8-INTERACTING PROTEIN 1 RPKRPTTLNLF 11 T 12 CTP-dep_RFKase pdbhh F Eukaryota T 4j09 2 B B YJHB_ECOLI Putative metabolite transport protein YjhB TNLYML 6 T 3.1 UPF0697 pdbhh F Bacteria F 4j24 2 E,F,G,H K,I,J,L 19-mer peptide LTARHPLLLRHLLQNSPSD 19 T 3.5 T4_Gp59_C pdbhh F T 4j26 2 B,D I,J 12-mer Peptide HPLLMRLLHHPS 12 T 2.9 HEAT pdbhh F T 4j2c 2 B,D B,D VPS51_HUMAN ANOTHER NEW GENE 2 PROTEIN, PROTEIN FAT-FREE HOMOLOG AHGMLKLYYGLSEGEAA 17 T 1.7 Ribosomal_S4 pdbhh F Eukaryota T 4j2j 2 D,E,F D,E,F CIC_HUMAN Protein capicua homolog EPRSVAVFPWHSLVPFLAPSQ 21 T 2.6 DUF5988 pdbhh F Eukaryota T 4j2l 2 C,D C,D CIC_HUMAN Protein capicua homolog MFVWTNVEPRSVAVFPWHSLVPFLAPSQ 28 T 2.3 DUF2605 pdbhh F Eukaryota T 4j2x 2 B,D B,D FHL-1, KYOT, RBP-ASSOCIATED MOLECULE 14-1, RAM14-1, SKELETAL MUSCLE LIM-PROTEIN 1, SLIM, SLIM-1 SGLVKAPVWWPMKDNPGTTTASTAKNAP 28 T 10 QH-AmDH_gamma pdbhh F T 4j44 2 B B PEPTIDE (ALA-ILE-ALA-VAL) AIAV 4 T 310 DUF881 pdbhh F F 4j45 2 B B PEPTIDE (ALA-THR-ALA-ALA) ATAA 4 T 980 DUF5408 pdbhh F F 4j46 2 B B PEPTIDE (ALA-VAL-PRO-ILE) AVPI 4 T 110 UPF0715 pdbhh F F 4j47 2 B B PEPTIDE (SER-VAL-PRO-ILE) SVPI 4 T 110 USP7_C2 pdbhh F F 4j48 2 C B PEPTIDE (ALA-MET-ARG-VAL) AMRV 4 T 200 RbcX pdbhh F F 4j4q 2 B B GNAT1_BOVIN TRANSDUCIN ALPHA-1 CHAIN ILENLKDVGLF 11 T 2.1 Phage_holin_4_1 pdbhh F Eukaryota T 4j73 2 B B TMED9_BOVIN P25, P24 FAMILY PROTEIN ALPHA-2, P24ALPHA2 FEAKKLV 7 T 130 WSK pdbhh F Eukaryota T 4j77 1 A,D C,D DOLICHYL-DIPHOSPHOOLIGOSACCHARIDE--PROTEIN GLYCOSYLTRANSFERASE AKEKSD 6 T 410 LIX1 pdbhh F F 4j78 2 B B E7KC07_YEASA Emp47p KTKLL 5 T 180 AAA_16 pdbhh F F 4j7b 3 C,F C,F MA205_DROME 205 kDa microtubule-associated protein MGHHHHHHLDDLVAESPRKEFARINMDGIAVPDEREFDIEADMRPHELEQESDTFGAG 58 T 4 BNIP2 pdbhh F Eukaryota T 4j7f 2 B B TAF10_HUMAN STAF28, TRANSCRIPTION INITIATION FACTOR TFIID 30 KDA SUBUNIT, TAF(II)30, TAFII-30, TAFII30 XSKSKDRKYTL 11 T 0.0084 TFIID-31kDa unphh F Eukaryota T 4j7i 2 B B TAF10_HUMAN STAF28, TRANSCRIPTION INITIATION FACTOR TFIID 30 KDA SUBUNIT, TAF(II)30, TAFII-30, TAFII30 XSKSKDRKYTL 11 T 0.0084 TFIID-31kDa unphh F Eukaryota T 4j7o 1 A A SCA2_RICCN Putative surface cell antigen sca2 ASFKDLVSKTPAWEKHNSTQQQNIWKDLTPNEKIKKWQEAALVPSFTQAQNDLGIKYKETDLSSFLDNTRHKARQARAEILLYIERVKQQDFDTKKQAYINQGVVPTDIEAATNLGISYDPSKIDNNVEHDQKVRRAEKDKKAVIELYVSSINRGIKYKHYVDNDIIPEIQEVRTALNMNKDDAQSFVASIRTEIMENAKGQYIADSHIPTEKELKKKFGISRDDNRDGYIKSIRLKVMDKEKPQYIADSHIPTEKELEQKFGADKGEATNYIASIATQMMLDKKSYYIDNNIIPNADELMNEFKIGPVKATSYINQIRAGIEANQFLNNNDTTKPSTGRSQKKSGSKNDHWYMSNQSINNTGTSAR 367 T 0.11 GntR pdb F Bacteria T 4j81 2 C,D C,D INSI1_HUMAN INSIG-1 KPHSD 5 T 110 DUF951 pdbhh F Eukaryota F 4j82 2 C,D C,D INSI2_HUMAN INSULIN-INDUCED GENE 2 PROTEIN KSHQE 5 T 240 zf-C2H2_11 pdbhh F Eukaryota F 4j83 2 B B TAF10_HUMAN STAF28, TRANSCRIPTION INITIATION FACTOR TFIID 30 KDA SUBUNIT, TAF(II)30, TAFII-30, TAFII30 XSKSADRKYTL 11 T 0.0084 TFIID-31kDa unphh F Eukaryota T 4j84 2 C,D C,D SCYL1 ARKLD 5 T 140 Mto2_bdg pdbhh F F 4j86 2 C,D C,D OSTB_YEAST OLIGOSACCHARYL TRANSFERASE SUBUNIT WBP1, OLIGOSACCHARYL TRANSFERASE SUBUNIT BETA TFKKTN 6 T 79 Pox_I6 pdbhh F Eukaryota F 4j8b 2 B B Emp47p LKTKLL 6 T 16 PhrC_PhrF pdbhh F F 4j8g 2 C,D C,D membrane glycoprotein E3 gp19K AASFIDAKKMP 11 T 21 Adeno_GP19K pdbhh F T 4j8o 2 B B TAF10_HUMAN STAF28, TRANSCRIPTION INITIATION FACTOR TFIID 30 KDA SUBUNIT, TAF(II)30, TAFII-30, TAFII30 XSKSADRKYTL 11 T 0.0084 TFIID-31kDa unphh F Eukaryota T 4j8s 2 B B TTP_HUMAN TTP, G0/G1 SWITCH REGULATORY PROTEIN 24, GROWTH FACTOR-INDUCIBLE NUCLEAR PROTEIN NUP475, PROTEIN TIS11A, TIS11, ZINC FINGER PROTEIN 36 HOMOLOG, ZFP-36 APRRLPIFNRISVSE 15 T 6.2 Hormone_recep pdbhh F Eukaryota T 4j9c 2 B B P17 XAPTYSPPLPP 11 T 6.8 TAF8_C pdbhh F T 4j9d 2 B,D,F B,D,F 3BP1_HUMAN P0 XAPTYPPPLPP 11 T 1.1 HPS6 pdbhh F Eukaryota T 4j9e 2 B,D,F B,D,F P17 XAPTYSPPLPP 11 T 6.8 TAF8_C pdbhh F T 4j9f 2 B,D,F B,D,F 3BP1_HUMAN P0 XAPTYPPPLPP 11 T 1.1 HPS6 pdbhh F Eukaryota T 4j9g 2 B,D,F B,D,F P7 XAPTYPPPPPP 11 T 0.96 HPS6 pdbhh F F 4j9h 2 G,H,I,J,K,L G,H,I,J,K,L P7 XAPTYPPPPPP 11 T 0.96 HPS6 pdbhh F F 4j9i 2 B,D,F B,D,F P17 XAPTYSPPLPP 11 T 6.8 TAF8_C pdbhh F T 4jaa 2 B S CONSENSUS ANKYRIN REPEAT DOMAIN-(d)LEU HLEVVKLLLEHGADVXAQDK 20 T 0.00016 Ank pdb F T 4jbn 2 B C SAT derived tetrapeptide SPSI 4 T 140 Mur_ligase pdbhh F F 4jdh 2 B B Paktide T GGRRRRRTWYFGGGK 15 T 1.1 Microvir_J pdbhh F T 4jdi 2 B B Paktide S RRRRSWY 7 T 2.9 DUF6264 pdbhh F F 4jdj 2 B B Paktide T GGRRRRRTWYFGGGK 15 T 1.1 Microvir_J pdbhh F T 4jdk 2 B B Paktide S RRRRSWY 7 T 2.9 DUF6264 pdbhh F F 4jdt 1 A G Q0ED31_9HIV1 gp120 VWKDADTTLFCASDAKAHETECHNVWATHACVPTDPNPQEIHLENVTENFNMWKNNMVEQMQEDVISLWDQCLQPCVKLTGGSVIKQACPKISFDPIPIHYCTPAGYVILKCNDKNFNGTGPCKNVSSVQCTHGIKPVVSTQLLLNGSLAEEEIIIRSENLTNNAKTIIVHLNKSVEINCTRPSNGGSGSGGDIRKAYCEINGTKWNKVLKQVTEKLKEHFNNKTIIFQPPSGGDLEITMHHFNCRGEFFYCNTTQLFNNTCIGNETMKGCNGTITLPCKIKQIINMWQGTGQAMYAPPIDGKINCVSNITGILLTRDGGANNTSNETFRPGGGNIKDNWRSELYKYKVVQIEGSHHHHHH 361 T 1.7E-34 GP120 unp T Viruses T 4jdw 1 A A GATM_HUMAN TRANSAMIDINASE, AT38 MLRVRCLRGGSRGAEAVHYIGSRLGRTLTGWVQRTFQSTQAATASSRNSCAADDKATEPLPKDCPVSSYNEWDPLEEVIVGRAENACVPPFTIEVKANTYEKYWPFYQKQGGHYFPKDHLKKAVAEIEEMCNILKTEGVTVRRPDPIDWSLKYKTPDFESTGLYSAMPRDILIVVGNEIIEAPMAWRSRFFEYRAYRSIIKDYFHRGAKWTTAPKPTMADELYNQDYPIHSVEDRHKLAAQGKFVTTEFEPCFDAADFIRAGRDIFAQRSQVTNYLGIEWMRRHLAPDYRVHIISFKDPNPMHIDATFNIIGPGIVLSNPDRPCHQIDLFKKAGWTIITPPTPIIPDDHPLWMSSKWLSMNVLMLDEKRVMVDANEVPIQKMFEKLGITTIKVNIRNANSLGGGFHAWTCDVRRRGTLQSYLD 423 T 0.014 ADI pdbpercent F Eukaryota T 4je8 2 C,D D,E tripeptide Met-Ala-Ser MAS 3 T 280 zf-C2H2_4 pdbhh F F 4jfd 3 C C Melanoma peptide ELAAIGILTV 10 T 6 DUF3527 pdbhh F T 4jfe 3 C C Melanoma peptide L7A ELAGIGALTV 10 T 4.6 MLANA pdbhh F T 4jfo 3 C,F C,F E1A heteroclitic Melanoma peptide ALAGIGILTV 10 T 2.5 MLANA pdbhh F T 4jfp 3 C,F C,F G4A heteroclitic Melanoma peptide ELAAIGILTV 10 T 6 DUF3527 pdbhh F T 4jfq 3 C,F C,F L8A heteroclitic Melanoma peptide ELAGIGIATV 10 T 1.5 MLANA pdbhh F T 4jfx 3 E P Phosphopeptide GEKKGNYVVTXA 12 T 1.1 MFA1_2 pdbhh F T 4jfz 3 C P Phosphopeptide GEKKGNYVVTSH 12 T 0.78 MFA1_2 pdbhh F T 4jg0 3 C P Phosphopeptide GEKKGNYVVTSH 12 T 0.78 MFA1_2 pdbhh F T 4jg1 3 C P Phosphopeptide GEKKGNYVVTTH 12 T 0.64 MFA1_2 pdbhh F T 4jgj 2 C,D X,Y Unknown peptide XXXXXXXX 8 F F F 4jgl 1 A A hypothetical protein GGAKKNVQDAEGQAEAGGNAPSGYLMPAISANNFCGDFTTMTPDYGYLMPEKGLFLKMHDIRGAYGINIYTYVMDGDNIQCTPGHFVMIVPRGGDKLEITIKKSSMKNTPSFTFIPTPDCENSAYVATEKVAGKYYYLCGDAEARYKFEDLFEDERCAEFKNLVDNYGK 169 T 2.2 Surp pdbhh F T 4jhj 2 C,D C,D Q7ZU28_DANRE DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 19 (DBP5 homolog, yeast) MATDSWAQAVDEQEAAAESISTLQISEKEEKP 32 T 0.028 FliN_N pdb F Eukaryota T 4jhk 2 B C Q7SXI8_DANRE Sb:cb157 protein PLGSMSRIKNWGDEVEEQEMRT 22 T 0.24 Meiosis_expr pdbhh F Eukaryota T 4jij 1 A,C P,Q fluorogenic peptidic substrate (8MC)PLG(PHI)(DNW)AR(NH2) XPLGXXARX 9 T 11 FYRN pdbhh F T 4jiz 2 B B phosphopeptide YHSVVRYA 8 T 2.9 BioT2 pdbhh F T 4jj7 2 B B Caspase inhibitor XXDXXX 6 T 2900 zf-met pdbhh F F 4jj8 2 C,D C,D Caspase Inhibitor XXDXXX 6 T 2900 zf-met pdbhh F F 4jje 2 B B Caspase inhibitor XXDXXX 6 T 2900 zf-met pdbhh F F 4jjm 2 C,D E,F cyclosporin A XXXXXXXXVXA 11 T 9.7 DUF6090 pdbhh F F 4jjq 2 B B UB2E1_HUMAN UBCH6, UBIQUITIN CARRIER PROTEIN E1, UBIQUITIN-PROTEIN LIGASE E1 DSRASTSSSS 10 T 0.0098 SR-25 unppssm F Eukaryota F 4jk5 2 B B bicyclic peptide UK18-D-Ser, uPA inhibitor ACSRYEVDCRGRXSACGX 18 T 7.1 Kp4 pdbhh F T 4jk6 2 B B bicyclic peptide UK18-D-Aba inhibitor of uPA ACSRYEVDCRGRXSACGX 18 T 2.6 Adhesin_E pdbhh F T 4jl0 2 C,D C,D Q840U9_PSEAI PopB TGVALTPPS 9 T 0.7 GluR_Homer-bdg pdbhh F Bacteria T 4jlq 2 B B NAB2_YEAST Nuclear polyadenylated RNA-binding protein NAB2 RFTQRGGGAVGKNRRGGRGGNRGGRNNNSTRFNPLAKA 38 T 0.0034 DDHD unp F Eukaryota T 4jlu 2 B B F175A_HUMAN COILED-COIL DOMAIN-CONTAINING PROTEIN 98, PROTEIN FAM175A GFGEYSRSPTF 11 T 0.48 PipA pdbhh F Eukaryota T 4jm1 1 A A A6LA98_PARD8 hypothetical protein GADKYIDTITGFSCEKAAVTDNGFLVIAIDADSDSGYDMLASQFLEEAKKEGVSGLKGVLIVDIKNAKFEQGAVVGKRIGKAYK 84 T 0.3 DUF6503 unphh F Bacteria T 4jmg 2 B B PTN11_HUMAN PROTEIN-TYROSINE PHOSPHATASE 1D, PTP-1D, PROTEIN-TYROSINE PHOSPHATASE 2C, PTP-2C, SH-PTP2, SHP-2, SHP2, SH-PTP3 DSARVXENVGLMQ 13 T 1.6 CSM2 pdbhh F Eukaryota T 4jmh 2 B B SHC1_HUMAN SHC-TRANSFORMING PROTEIN 3, SHC-TRANSFORMING PROTEIN A, SRC HOMOLOGY 2 DOMAIN-CONTAINING-TRANSFORMING PROTEIN C1, SH2 DOMAIN PROTEIN C1 PPDHQXXNDFPGK 13 T 4.8 Herpes_TK_C pdbhh F Eukaryota T 4jmy 3 E,F E,F substrate peptide DDIVPC 6 T 11 Herpes_UL49_1 pdbhh F F 4jna 2 C,D H,I Dimethyl FK228 XXXXV 5 T 1800 DUF6451 pdbhh F F 4jo6 2 E,F Y,Z SBP-Tag MDEKTTGWRGGHVVEGLAGELEQLRARLEHHPQGQREP 38 T 2.2 BLUF pdbhh F T 4joe 2 C,D C,D A-iCAL36 peptide ANSRAPTSII 10 T 17 PMBR pdbhh F T 4jof 2 C,D C,D L-iCAL36 peptide ANSRLPTSII 10 T 5 PMBR pdbhh F T 4jog 2 C,D C,D V-iCAL36 peptide ANSRVPTSII 10 T 5.2 DUF2570 pdbhh F T 4joh 2 C,D C,D H-iCAL36 peptide ANSRHPTSII 10 T 13 TssN pdbhh F T 4joj 2 C,D C,D F-iCAL36 peptide ANSRFPTSII 10 T 3.4 LemA pdbhh F T 4jok 2 C,D C,D Y-iCAL36 peptide ANSRYPTSII 10 T 11 C9orf72-like pdbhh F T 4jol 2 E,F,G,H E,F,G,H HTF4_HUMAN TCF-12, CLASS B BASIC HELIX-LOOP-HELIX PROTEIN 20, BHLHB20, DNA-BINDING PROTEIN HTF4, E-BOX-BINDING PROTEIN, TRANSCRIPTION FACTOR HTF-4 SPLQAKKVRKVPPGLPSSVYAPSPN 25 T 33 Mvb12 pdbhh F Eukaryota T 4jop 2 C,D C,D VE6_HPV16 Protein E6 TRRETQL 7 T 0.34 FpoO unphh T Viruses F 4jor 2 C,D C,D VE6_HPV18 Protein E6 RLQRRRETQV 10 T 0.19 Mu-like_Com unphh T Viruses T 4jqg 1 A,C P,Q fluorogenic peptidic substrate (8MC)PLG(PFF)(DNW)AR(NH2) XPLGXXARX 9 T 11 FYRN pdbhh F T 4jqi 4 D V V2R_HUMAN Vasopressin V2 receptor phosphopeptide ARGRTPPSLGPQDESCTTASSSLAKDTSS 29 T 21 DUF6352 pdbhh F Eukaryota T 4jqv 3 C B BZLF1_EBVG EB1, ZEBRA SELEIKRY 8 T 0.0044 bZIP_2 unppercent T Viruses T 4jqx 3 C B BZLF1_EBVG EB1, ZEBRA EECDSELEIKRY 12 T 0.0044 bZIP_2 unppercent T Viruses T 4jr0 2 C,D C,D Ac-DEVD-CMK XDEVXX 6 T 200 ResIII pdbhh F F 4jr1 2 C,D C,D Ac-DEVD-CMK XDEVXX 6 T 200 ResIII pdbhh F F 4jr2 2 C,D C,D Ac-DEVD-CMK XDEVDX 6 T 200 ResIII pdbhh F F 4jrx 3 C C BZLF1_EBVG LPEP PEPTIDE, EB1, ZEBRA LPEPLPQGQLTAY 13 T 14 AP-5_subunit_s1 pdbhh T Viruses T 4jry 3 C C BZLF1_EBVG LPEP PEPTIDE, EB1, ZEBRA LPEPLPQGQLTAY 13 T 14 AP-5_subunit_s1 pdbhh T Viruses T 4js0 2 B B BAIP2_HUMAN BAI-ASSOCIATED PROTEIN 2, BAI1-ASSOCIATED PROTEIN 2, PROTEIN BAP2, FAS LIGAND-ASSOCIATED FACTOR 3, FLAF3, INSULIN RECEPTOR SUBSTRATE P53/P58, IRS-58, IRSP53/58, INSULIN RECEPTOR SUBSTRATE PROTEIN OF 53 KDA, IRSP53, INSULIN RECEPTOR SUBSTRATE P53 ASKSNLVISDPIPGAKPLPVPPELAPFVGRMS 32 T 2.9 DUF6248 pdbhh F Eukaryota T 4jsq 15 CA,DA c,d TMC-95A mimic ligand yCP:4e XXXXXAXX 8 T 24 Fis1_TPR_N pdbhh F F 4jsu 15 CA,DA,EA,FA c,d,e,f TMC-95A mimic ligand yCP:3a XXXXXAXX 8 T 24 Fis1_TPR_N pdbhh F F 4jt0 15 CA c TMC-95A mimic ligand yCP:4a fragment P XAXX 4 T 78 DUF6446 pdbhh F F 4jt0 16 DA d TMC-95A mimic ligand yCP:4a fragment Q XXXXAXX 7 T 18 Fis1_TPR_N pdbhh F F 4jtm 1 A,B A,B E3PJ86_ECOH1 Type II secretion system protein D GAMATFTANFKDTDLKSFIETVGANLNKTIIMGPGVQGKVSIRTMTPLNERQYYQLFLNLLEAQGYAVVPMENDVLKVVKS 81 T 0.14 Corona_NS2A pdbpercent F Bacteria T 4jwc 2 C,D C,D CTHL3_BOVIN BACTENECIN-7, BAC7, PR-59 RRIRPRPPRLPRPRPR 16 T 0.027 TonB_N unppercent F Eukaryota F 4jwd 2 C,D C,D CTHL3_BOVIN BACTENECIN-7, BAC7, PR-59 PRPLPFPRPGPRPI 14 T 0.027 TonB_N unppercent F Eukaryota T 4jwe 2 C,D C,D CTHL3_SHEEP BACTENECIN-7, BAC7, PR-59 RRLRPRRPRLPRPRPRPRPRP 21 T 0.025 Trypan_PARP unp F Eukaryota F 4jwi 2 C,D C,D CTHL3_SHEEP BACTENECIN-7, BAC7, PR-59 PRPILLPWRX 10 T 0.025 Trypan_PARP unp F Eukaryota T 4jx7 2 B B PIM1 consensus peptide ARKRRRHPSGPPTA 14 T 1.5 DUF3019 pdbhh F T 4jxt 2 B B RPB1_HUMAN RNA POLYMERASE II SUBUNIT B1, DNA-DIRECTED RNA POLYMERASE II SUBUNIT A, DNA-DIRECTED RNA POLYMERASE III LARGEST SUBUNIT, RNA-DIRECTED RNA POLYMERASE II SUBUNIT RPB1 XSPSYSPTSPSYSPTSPSYSX 21 T 3.1E-05 RNA_pol_Rpb1_R pdbhh F Eukaryota F 4k0u 2 B B GSPD2_DICD3 T2SS PROTEIN D, GENERAL SECRETION PATHWAY PROTEIN D, PECTIC ENZYMES SECRETION PROTEIN OUTD RTFRQVQSSISDFYD 15 T 0.96 DUF643 pdbhh F Bacteria T 4k1e 2 B B SFTI1_HELAN SFTI-1 GFCQRSIPPICFPD 14 T 0.052 Bowman-Birk_leg unp F Eukaryota T 4k38 2 C,D C,D Kp18Cys peptide YYTSPMCAPARSMLLTGN 18 T 0.0024 Alk_phosphatase pdbhh F T 4k39 2 C,D D,C Cp18Cys peptide YTAVPSCIPSRASILTGM 18 T 0.00017 Alk_phosphatase pdbhh F T 4k3m 2 C E ALDLF ALDLF 5 T 41 DUF346 pdbhh F F 4k3o 2 C E (ACE)QADLF XQADLF 6 T 81 Zn_peptidase pdbhh F T 4k3p 2 C E (ACE)QLALF XQLALF 6 T 97 Glyco_hydro_97 pdbhh F F 4k3q 2 C E (ACE)QLDAF XQLDAF 6 T 61 DUF565 pdbhh F T 4k3r 2 C E (ACE)QLDLA XQLDLA 6 T 200 CaM_bdg_C0 pdbhh F F 4k45 2 B B PLCG1_RAT PHOSPHOINOSITIDE PHOSPHOLIPASE C-GAMMA-1, PHOSPHOLIPASE C-GAMMA-1, PLC-GAMMA-1 DYGALYEGRNPGFXVEAN 18 T 37 DUF4207 pdbhh F Eukaryota T 4k6y 2 C,D C,D iCAL36-Q peptide ANSRWQTSII 10 T 0.093 ENOD40 pdbhh F T 4k72 2 C,D C,D iCAL36-VQD peptide ANSRVQDSII 10 T 4.2 DUF5608 pdbhh F T 4k75 2 B B iCAL36-QDTRL peptide ANSRWQDTRL 10 T 5.9 Glyco_transf_8C pdbhh F T 4k76 2 E,F,G,H E,F,G,H iCAL36-TRL peptide ANSRWPTTRL 10 T 9.3 CBP_BcsR pdbhh F T 4k78 2 B B iCAL36-QDTRL peptide ANSRWQDTRL 10 T 5.9 Glyco_transf_8C pdbhh F T 4k7h 1 A,B,C,D,E A,B,C,D,E P1_BPPH6 Major inner protein P1 GFNLKVKDLNGSARGLTQAFAIGELKNQLSVGALQLPLQFTRTFSASMTSELLWEVGKGNIDPVMYARLFFQYAQAGGALSVDELVNQFTEYHQSTACNPEIWRKLTAYITGSSNRAIKADAVGKVPPTAILEQLRTLAPSEHELFHHITTDFVCHVLSPLGFILPDAAYVYRVGRTATYPNFYALVDCVRASDLRRMLTALSSVDSKMLQATFKAKGALAPALISQHLANAATTAFERSRGNFDANAVVSSVLTILGRLWSPSTPKELDPSARLRNTNGIDQLRSNLALFIAYQDMVKQRGRAEVIFSDEELSSTIIPWFIEAMSEVSPFKLRPINETTSYIGQTSAIDHMGQPSHVVVYEDWQFAKEITAFTPVKLANNSNQRFLDVEPGISDRMSATLAPIGNTFAVSAFVKNRTAVYEAVSQRGTVNSNGAEMTLGFPSVVERDYALDRDPMVAIAALRTGIVDESLEARASNDLKRSMFNYYAAVMHYAVAHNPEVVVSEHQGVAAEQGSLYLVWNVRTELRIPVGYNAIEGGSIRTPEPLEAIAYNKPIQPSEVLQAKVLDLANHTTSIHIWPWHEASTEFAYEDAYSVTIRNKRYTAEVKEFELLGLGQRRERVRILKPTVAHAIIQMWYSWFVEDDRTLAAARRTSRDDAEKLAIDGRRMQNAVTLLRKIEMIGTTGIGASAVHLAQSRIVDQMAGRGLIDDSSDLHVGINRHRIRIWAGLAVLQMMGLLSRSEAEALTKVLGDSNALGMVVATTDIDPSLHHHHHH 775 T 0.22 STAG pdb T Viruses T 4k7t 1 A A bacitracin A2 ICLXIKXIXHXN 12 T 1.2 DUF4092 pdbhh F T 4k8y 2 B B SFTI1_HELAN SFTI-1 GRCTKSIPPICFPD 14 T 0.0029 Bowman-Birk_leg pdb F Eukaryota T 4ka3 2 B B TAB1_HUMAN TGF-beta-activated kinase 1 and MAP3K7-binding protein 1 SSAQSTSKTSVTLSLVMPSQGLEHHHHHH 29 T 7.6 SCIFF pdbhh F Eukaryota T 4ka7 2 B C short endogenous peptide substrate AAAA 4 T 900 Cyclin_C pdbhh F F 4kbb 2 C,D C,D SYT2_MOUSE SYNAPTOTAGMIN II, SYTII EGWTENQEPNVAPATTTATMPLAPVAPADNSTESTGPGESQEDMFAKLKEKFFNEINKIVLEHHHHHH 68 T 0.0051 PRIMA1 unphh F Eukaryota T 4kdi 2 C,D C,D OTU1_YEAST OTU DOMAIN-CONTAINING PROTEIN 1 GSHMASMTGGQQMGRGSMKLKVTGAGINQVVTLKQDATLNDLIEHINVDVKTMRFGYPPQRINLQGEDASLGQTQLDELGINSGEKITIE 90 T 0.00015 UBX pdbhh F Eukaryota T 4kdl 2 B B OTU1_YEAST OTU DOMAIN-CONTAINING PROTEIN 1 GSHMASMTGGQQMGRGSMKLKVTGAGINQVVTLKQDATLNDLIEHINVDVKTMRFGYPPQRINLQGEDASLGQTQLDELGINSGEKITIE 90 T 0.00015 UBX pdbhh F Eukaryota T 4ke2 1 A,B,C A,B,C ANPM_PSEAM Type I hyperactive antifreeze protein MNIDPAARAAAAAAASKAAVTAADAAAAAATIAASAASVAAATAADDAAASIATINAASAAAKSIAAAAAMAAKDTAAAAASAAAAAVASAAKALETINVKAAYAAATTANTAAAAAAATATTAAAAAAAKATIDNAAAAKAAAVATAVSDAAATAATAAAVAAATLEAAAAKAAATAVSAAAAAAAAAIAFAAAP 196 T 72 NADH_dh_m_C1 unphh F Eukaryota T 4kel 2 B B SFTI1_HELAN SFTI-1 GFCQRSIPPICFPN 14 T 0.051 Bowman-Birk_leg pdb F Eukaryota T 4kkp 1 A,B A,B RbmA protein EVDCELQPVIEANLSLNQNQLASNGGYISSQLGIRNESCETVKFKYWLSIKGPEGIYFPAKAVVGVDTAQQESDALTDGRMLNVTRGFWVPEYMADGKYTVSLQVVAENGKVFKANQEFVKGVDLNSLPELNGLTIDIKNQFGINSVESTGGFVPFTVDLNNGREGEANVEFWMTAVGPDGLIIPVNAREKWVIASGDTYSKVRGINFDKSYPAGEYTINAQVVDIVSGERVEQSMTVVKK 241 T 0.014 BsuPI pdbhh F T 4kkq 1 A,B A,B RbmA protein EVDCELQPVIEANLSLNQNQLASNGGYISSQLGIRNESCETVKFKYWLSIKGPEGIYFPAKAVVGVDTAQQESDALTDGRMLNVTRGFWVPEYMADGKYTVSLQVVAENGKVFKANQEFVKGVDLNSLPELNGLTIDIKNQFGINSVESTGGFVPFTVDLNNGREGEANVEFWMTAVGPDGLIIPVNAREKWVIASGDTYSKVRGINFDKSYPAGEYTINAQVVDIVSGERVEQSMTVVKK 241 T 0.014 BsuPI pdbhh F T 4kkr 1 A,B A,B RbmA protein EVDCELQPVIEANLSLNQNQLASNGGYISSQLGIRNESCETVKFKYWLSIKGPEGIYFPAKAVVGVDTAQQESDALTDGRMLNVTRGFWVPEYMADGKYTVSLQVVAENGKVFKANQEFVKGVDLNSLPELNGLTIDIKNQFGINSVESTGGFVPFTVDLNNGREGEANVEFWMTAVGPDGLIIPVNAREKWVIASGDTYSKVRGINFDKSYPAGEYTINAQVVDIVSGERVEQSMTVVKK 241 T 0.014 BsuPI pdbhh F T 4kmd 2 B B GLI1_HUMAN GLIOMA-ASSOCIATED ONCOGENE, ONCOGENE GLI SRCTSPGGSYGHLSIGT 17 T 1.6 HMMR_N pdbhh F Eukaryota T 4knl 2 E,F F,G Muramyl tetrapeptide AXXXX 5 T 1100 Coronavirus_5 pdbhh F F 4kp3 3 C,F E,F MELPH_MOUSE EXOPHILIN-3, LEADEN PROTEIN, SLP HOMOLOG LACKING C2 DOMAINS A, SLAC2-A, SYNAPTOTAGMIN-LIKE PROTEIN 2A GPGSDLDTEARDQPLNSKKKKRLLSFRDVDFEEDSDHLVQPCS 43 T 5.3 GLTSCR1 pdbhh F Eukaryota T 4ks6 2 B B Peptide inhibitor MPT-DPP-DAR-G-DPN-NH2 XXXGXX 6 T 92 DUF6078 pdbhh F F 4ksn 1 A,B,C,D A,B,C,D Q5ZSX5_LEGPH SdbC GNSDGQLDTHLADLYLLKYDTGLGVYESFICKYLEDSNDYIASHPQKLSLDEMPRPLESETVSLRQLIVSVLPSRPSI 78 T 1.2 DUF3213 pdbhh F Bacteria T 4kt3 2 B B Q4KC91_PSEF5 Putative lipoprotein GSHMATDSLQPARIKDSGLTREQAEQVLRVALKHQDYQLQRPGVFIDGDLQDENGKPPHPGYYDFSLGYNDPKAGATEYWGLFSVSLNTGDTWEINSCKRLDGAELRALQRRVMARTGKSLADEKSQREGLGCEDQQ 137 T 5.2 DUF4969 unphh F Bacteria T 4ktx 2 B B Peptide inhibitor MPT-DPP-ARG-G-LEU-NH2 XXRGLX 6 T 400 ATP_Ca_trans_C pdbhh F F 4kty 2 C,D C,D Peptide-like ligand GXXXXLPWP 9 T 44 DUF6525 pdbhh F F 4kv1 2 B,D C,D TF65_HUMAN Rel peptide TFXSIMK 7 T 7.6 Adenylate_cycl pdbhh F Eukaryota T 4kv4 2 B B TF65_HUMAN Rel Peptide TFXSIMK 7 T 7.6 Adenylate_cycl pdbhh F Eukaryota T 4kvm 3 I,J,K,L I,J,K,L bisubstrate analog inhibitor SASE 4 T 470 EF-hand_5 pdbhh F F 4kvt 1 A,B,C,D,E,F A,B,C,D,E,F 6-helix coiled coil CC-Hex-L24C peptide XGELKAIAQELKAIAKELKAIAWECKAIAQGAG 33 T 0.92 Rho_N pdb F T 4kx8 2 B C amastatin XVVD 4 T 400 Fer4 pdbhh F F 4kxq 2 B B SIR1_HUMAN SIRT1, HSIRT1, REGULATORY PROTEIN SIR2 HOMOLOG 1, SIR2-LIKE PROTEIN 1, HSIR2 GPHMGSQYLFLPPNRYIFHGAEVYSDSEDV 30 T 4.1 Rxt3 pdbhh F Eukaryota T 4l0k 1 A,B,C,D A,B,C,D A0A067XG67_9DEIO DraIII MELCHKTVKSRTAYSKHFPHKCQLPLGHSGKCLEFPFLVSLSKTHPRIAAKIVRDATMTTGAAWKSSQAGPNRMPRYVAILDDDILLEKFNLDMQSLPEITRLKIREKAADYDSCIDVARKLTWLAYQLHGAPIPDSFTKNYLEEFFGPMVAGSTNCEICKLPLTIDLFSENRVGKAAVETAHKTPRLHNAENVGFAHRFCNVAQGNKSLDEFYLWMEEVLTRVKML 227 T 2.9E-05 RE_BstXI pdbhh F Bacteria T 4l1u 2 G,H,I,J G,H,I,J SPT5H_HUMAN HSPT5, DRB SENSITIVITY-INDUCING FACTOR 160 KDA SUBUNIT, DSIF P160, DRB SENSITIVITY-INDUCING FACTOR LARGE SUBUNIT, DSIF LARGE SUBUNIT, TAT-COTRANSACTIVATOR 1 PROTEIN, TAT-CT1 PROTEIN YGSGSRTPMYGSQ 13 T 0.063 CTD unphh F Eukaryota T 4l29 3 AA,C,DA,F,GA,I,JA,L,MA,O,PA,R,U,X p,m,o,i,c,k,g,f,q,l,j,h,e,n NY-ESO1 DOUBLE MUTANT (1Y, 9V) YLLMWITQV 9 T 5.7 Thyroglob_assoc pdbhh F T 4l3c 3 AA,C,DA,F,GA,I,JA,L,MA,O,PA,R,U,X p,m,o,i,c,k,g,f,q,l,j,h,e,n NY-ESO1 double mutant (1Y, 9V) YLLMWITQV 9 T 5.7 Thyroglob_assoc pdbhh F T 4l3o 2 E,F,G,H E,F,G,H cyclic peptide S2iL5 GYHTYHVXRRTNYYCX 16 T 11 UCH_C pdbhh F T 4l5n 2 C,D,E,F C,D,E,F P56_BPPZA Early protein GP1B VQNDFLDSYDVTMLLQDDNGKQYYEYHKGLSLSDFEVLYGNTVDEIIKLRVDKIS 55 T 9 DUF2603 pdbhh T Viruses T 4l8b 3 C C NP-N5H peptide ASNEHMETM 9 T 21 YgaB pdbhh F T 4l8c 3 I,J,K,L I,J,K,L NP-N3D peptide ASDENMETM 9 T 22 YpmT pdbhh F T 4l8d 3 E,F E,F NP-N5D peptide ASNEDMETM 9 T 6.4 Lsm_interact pdbhh F T 4l9p 3 C C LYS-CYS-VAL-VAL-MET (CAAX peptide) KCVVM 5 T 65 DUF508 pdbhh F F 4lcd 2 C,D C,D SNA3_YEAST Protein SNA3 AQPPAYDEDDEAGADVPLMDNAQQ 24 T 14 TMEM252 unphh F Eukaryota T 4leb 2 B B hepta-threonine TTTTTTT 7 T 480 Strep_pep pdbhh F F 4lfd 2 B,D,F,H E,F,G,H (CBZ)NPQ(B27) PEPTIDE XNPQX 5 T 310 DUF2709 pdbhh F F 4lg6 2 B B CCDC8_HUMAN Coiled-coil domain-containing protein 8 RAFWHTPRLPTLPKRVP 17 T 6.1 RGS_DHEX pdbhh F Eukaryota T 4li3 2 B A CYSE_SALTY Serine acetyltransferase TFEYGDGI 8 T 2.1 Cyanate_lyase pdbhh F Bacteria T 4lkd 2 I,J,K,L,M,N,O,P I,J,K,L,M,N,O,P peptide QRSA QRSA 4 T 240 SH pdbhh F F 4lke 2 E,F,G,H E,H,F,G peptide WRIA WRIA 4 T 29 eIF2_C pdbhh F F 4lkf 2 C,D C,D peptide WKYL WKYL 4 T 22 DUF4181 pdbhh F F 4lkl 2 B B PL-55 XPLHSTMX 8 T 140 Aminopep pdbhh F T 4lkm 2 B,D B,D PL-74 XXPLHSTMX 9 T 170 Aminopep pdbhh F T 4lkx 3 C R CemX segment LAGGSAQSQRAPDR 14 T 1.3 DUF6032 pdbhh F T 4loo 2 B B TAB1_MOUSE MITOGEN-ACTIVATED PROTEIN KINASE KINASE KINASE 7-INTERACTING PROTEIN 1, TGF-BETA-ACTIVATED KINASE 1-BINDING PROTEIN 1, TAK1-BINDING PROTEIN 1 RVYPVSVPYSSAQSTSKTSVTLSLVMPSQ 29 T 6 DUF2584 pdbhh F Eukaryota T 4lop 2 E,F,G,H K,L,M,N TAB1_MOUSE MITOGEN-ACTIVATED PROTEIN KINASE KINASE KINASE 7-INTERACTING PROTEIN 1, TGF-BETA-ACTIVATED KINASE 1-BINDING PROTEIN 1, TAK1-BINDING PROTEIN 1 RVYPVSVPYSSAQSTSKTSVTLSLVMPSQ 29 T 6 DUF2584 pdbhh F Eukaryota T 4loq 2 E,F,G,H M,L,K,N TAB1_MOUSE MITOGEN-ACTIVATED PROTEIN KINASE KINASE KINASE 7-INTERACTING PROTEIN 1, TGF-BETA-ACTIVATED KINASE 1-BINDING PROTEIN 1, TAK1-BINDING PROTEIN 1 RVYPVSVPYSSAQSTSKTSVTLSLVMPSQ 29 T 6 DUF2584 pdbhh F Eukaryota T 4lp9 2 B I Ser-Leu-Phe-His-Phenylalanyl-reduced-peptide-bond-Tyrosyl-Thr-Pro SLFHXTP 7 T 12 FCP1_C pdbhh F T 4lr4 1 A,B,C,D A,B,C,D C4ZEB7_AGARV hypothetical protein GASIDNGNKVHFNTEDNDTDLTLLQSKIATEEVTCDFTDATNDGASAYADTRRVSNKYMWSASTMEYNFSDQKWTSNTEIFSTYAKTSEGFVMSGFLLNPKGQSNYNSALREGYLNDSAYDENQGHYYQCVVSDEDCNNITFMLESNVNVFIFDNDINLIYRSSDEAGVTSYFDRYYSTTKTIAGTSNKVISLGLIDGNYYIVFKVKDATATTGYHYGYYAGQPLPIAQTTTFSDLTHYTTIKWNRSSSSQSASTQTLTINCPSGSEDEYALTGVKFSDKSKAFANNTYASSIDYYYTPATASYSKKLAQTGGWWSDLVDNNPPSGSIDGNYATSVTVHWVSGISYVNASCTTMTQMTLDYLVPFGIIVG 370 T 0.15 DUF1684 unp F Bacteria T 4lrs 3 C N Symmetric aldolase, C-terminal disordered residues AAA 3 T 1200 RNase_HII pdbhh F F 4lsj 2 B B D30 peptide HSSRLWELLMEAT 13 T 1.6 CemA pdbhh F T 4lte 2 C,D M,N Macrocyclic Inhibitor QFXXX 5 T 390 zf-CCHC pdbhh F F 4luq 1 A,B A,B Q9HYC5_PSEAE VIRULENCE EFFECTOR TSE3 MGSSHHHHHHSSGLVPRGSHMTATSDLIESLISYSWDDWQVTRQEARRVIAAIRNDNVPDATIAALDKSGSLIKLFQRVGPPELARSLIASIAGRTTMQRYQARNALIRSLINNPLGTQTDNWIYFPTITFFDICADLADAAGRLGFAAAGATGVASQAIQGPFSGVGATGVNPTDLPSIAFGDQLKLLNKDPATVTKYSNPLGDLGAYLSQLSPQDKLNQAQTLVGQPISTLFPDAYPGNPPSRAKVMSAAARKYDLTPQLIGAIILAEQRDQTRDEDAKDYQAAVSIKSANTSIGLGQVVVSTAIKYELFTDLLGQPVRRGLSRKAVATLLASDEFNIFATARYIRYVANLASQQDLRKLPKTRGAFPSIDLRAYAGNPRNWPRDNVRALASEYTSRPWDDNLSPGWPMFVDDAYATFLDPGMRFP 428 T 0.0021 DUF1402 unphh F Bacteria T 4luq 2 C,D C,D Q9HYC4_PSEAE ANTITOXIN TSI3 MGSSHHHHHHSSGLVPRGSHMDFDKTLTHPNGLVVERPVGFDARRSAEGFRFDEGGKLRNPRQLEVQRQDAPPPPDLASRRLGDGEARYKVEEDDGGSAGSEYRLWAAKPAGARWIVVSASEQSEDGEPTFALAWALLERARLQ 144 T 0.0052 PsbP_2 unphh F Bacteria T 4lx2 2 B B MELPH_MOUSE EXOPHILIN-3, LEADEN PROTEIN, SLP HOMOLOG LACKING C2 DOMAINS A, SLAC2-A, SYNAPTOTAGMIN-LIKE PROTEIN 2A RDQPLNSKKKKRLLSFRDVDFEEDSD 26 T 2.9 Phage_Treg pdbhh F Eukaryota T 4m1x 1 A,B,C,D A,B,C,D B3FK35_9CAUD uncharacterized protein 201phi2-1p060 GSHMASQDNDDIFGNDSPEVPIFRKNLEKFKFSKGDGIKFSNTTFHIYEATRNYVTIHILKKYATAELMEFMHTRHDAVYIGPILEWTDGVHLTFRRKS 99 T 16 SelB-wing_3 unphh T Viruses T 4m1z 2 C,D C,D MYCP1_MYCS2 PEPTIDASE S8 AND S53, SUBTILISIN, KEXIN, SEDOLISIN RVKEVPPPVYIPPPDRGPIT 20 T 0.33 JCAD pdbhh F Bacteria T 4m38 2 C,D E,F H4_HUMAN Histone H4 SGRGKGGKGLGKGGAKRHRKV 21 T 11 Shadoo unppercent F Eukaryota T 4m5e 1 A A Q9HYC5_PSEAE Uncharacterized protein MTATSDLIESLISYSWDDWQVTRQEARRVIAAIRNDNVPDATIAALDKSGSLIKLFQRVGPPELARSLIASIAGRTTMQRYQARNALIRSLINNPLGTQTDNWIYFPTITFFDICADLADAAGRLGFAAAGATGVASQAIQGPFSGVGATGVNPTDLPSIAFGDQLKLLNKDPATVTKYSNPLGDLGAYLSQLSPQDKLNQAQTLVGQPISTLFPDAYPGNPPSRAKVMSAAARKYDLTPQLIGAIILAEQRDQTRDEDAKDYQAAVSIKSANTSIGLGQVVVSTAIKYELFTDLLGQPVRRGLSRKAVATLLASDEFNIFATARYIRYVANLASQQDLRKLPKTRGAFPSIDLRAYAGNPRNWPRDNVRALASEYTSRPWDDNLSPGWPMFVDDAYATFLDLEHHHHHH 410 T 0.0021 DUF1402 unphh F Bacteria T 4m5f 1 A A Q9HYC5_PSEAE Uncharacterized protein MTATSDLIESLISYSWDDWQVTRQEARRVIAAIRNDNVPDATIAALDKSGSLIKLFQRVGPPELARSLIASIAGRTTMQRYQARNALIRSLINNPLGTQTDNWIYFPTITFFDICADLADAAGRLGFAAAGATGVASQAIQGPFSGVGATGVNPTDLPSIAFGDQLKLLNKDPATVTKYSNPLGDLGAYLSQLSPQDKLNQAQTLVGQPISTLFPDAYPGNPPSRAKVMSAAARKYDLTPQLIGAIILAEQRDQTRDEDAKDYQAAVSIKSANTSIGLGQVVVSTAIKYELFTDLLGQPVRRGLSRKAVATLLASDEFNIFATARYIRYVANLASQQDLRKLPKTRGAFPSIDLRAYAGNPRNWPRDNVRALASEYTSRPWDDNLSPGWPMFVDDAYATF 400 T 0.0021 DUF1402 unphh F Bacteria T 4m5f 2 B B Q9HYC4_PSEAE Uncharacterized protein SHGVDFDKTLTHPNGLVVERPVGFDARRSAEGFRFDEGGKLRNPRQLEVQRQDAPPPPDLASRRLGDGEARYKVEEDDGGSAGSEYRLWAAKPAGARWIVVSASEQSEDGEPTFALAWALLERARLQ 127 T 0.0052 PsbP_2 unphh F Bacteria T 4m63 1 A,B A,B T3SS2 effector VopL nucleation of actin polymerization GHMRLLSEDLFKQSPKLSEQELDELANNLADYLFQAADIDWHQVISEKTRGLTTEEMAKSEHRYVQAFCREILKYPDCYKSADVASPESPKSGGGSVIDVALKRLQTGRERLFTTTDEKGNRELKKGDAILESAINAARMAISTEEKNTILSNNVKSATFDVFCELPCMDGFAEQNGKTAFYALRAGFYSAFKNTDTAKQDITKFMKDNLQAGFSGYSYQGLTNRVAQLEAQLAALSAKLS 241 T 0.0017 ABC_tran_CTD pdbpssm F T 4m6b 2 B,D C,F SWR1_YEAST Helicase SWR1 GSHMDRESDDKTPSVGLSALFGKGEESDGDLDLDDSEDFTVNSSSVEGEELEKDW 55 T 14 DUF5945 pdbhh F Eukaryota T 4m6e 1 A A tyrocidine A XPFXNQYVXL 10 T 1.2 Inhibitor_I10 pdbhh F T 4m7c 2 C,D C,D SLX4_HUMAN BTB/POZ DOMAIN-CONTAINING PROTEIN 12 SRGLEVSHRLAPW 13 T 5.5 DUF5673 pdbhh F Eukaryota T 4m91 2 B B CRBN_HUMAN Protein cereblon KRKFHCANLTSW 12 T 27 Chordopox_A33R pdbhh F Eukaryota T 4m9s 2 B,D,F,H E,F,G,H CED-3 fragment PMFNFMGC 8 T 0.78 NADH_u_ox_C pdbhh F T 4m9x 2 B,D C,D CED-3 fragment PLFNFLCG 8 T 3.2 TPR_3 pdbhh F T 4m9y 2 B,D C,D CED-3 fragment PLFNFMGC 8 T 2.2 NADH_u_ox_C pdbhh F T 4m9z 2 B,D,F,H E,F,G,H CED-3 fragment PMFNFLGC 8 T 0.74 NADH_u_ox_C pdbhh F T 4mbe 4 G,H,I,J H,G,X,Y NUP1_YEAST NUCLEAR PORE PROTEIN NUP1 LKKNIEPKKDKESIVLPTVGFDFIK 25 T 0.12 DUF4519 pdbhh F Eukaryota T 4mdd 2 C,D C,D NCOR1_HUMAN N-COR, N-COR1 NLGLEDIIRKALMGS 15 T 3.8 baeRF_family3 pdbhh F Eukaryota T 4mex 6 M,N M,N Salinamide A XTXXXXSGX 9 T 170 Gemini_AC4_5_2 pdbhh F F 4mfl 2 B B Teicoplanin pseudoaglycone XXXXXXX 7 T 1.5 Defensin_2 pdbhh F F 4mfp 2 B B Teicoplanin pseudoaglycone XXXXXXX 7 T 1.5 Defensin_2 pdbhh F F 4mfq 2 B B Teicoplanin pseudoaglycone XXXXXXX 7 T 1.5 Defensin_2 pdbhh F F 4mgp 1 A A MAGA_XENLA Magainin 2 Derivative GIGKFLHAAKKFAKAFVAEIMNS 23 T 1.3 TAFII28 pdbhh F Eukaryota T 4mgx 2 B B GP1BA_HUMAN GP-IB ALPHA, GPIB-ALPHA, GPIBA, GLYCOPROTEIN IBALPHA, ANTIGEN CD42B-ALPHA, GLYCOCALICIN PTFRSSLFL 9 T 1.6 TALPID3 unphh F Eukaryota T 4mi7 1 A A H9L447_SALTY Bacteriophage encoded virulence factor GPVDEGIDIIECENLLKEMNVQKIPESSLFTNIKEALQAEVFNSTVEDDFESFISYELQNHGPLMLIRPSLGSECLHAECIVGYDSEVKKVLIYDSMNTSPEWQSNIDVYDKLTLAFNDKYKNEDCSICGLYYDGVYEPK 140 T 0.021 Peptidase_C70 unppssm F Bacteria T 4mjt 2 J,K,L,M,N,O,P,Q,R J,K,L,M,N,O,P,Q,R MONAL_PSEE4 Monalysin QPQSHSIELDEVSKEAASTRAALTSNL 27 T 17 DUF3618 pdbhh F Bacteria T 4mkq 2 C,D C,D MONAL_PSEE4 Monalysin QPQSHSIELDEVSKEAASTRAALTSNL 27 T 17 DUF3618 pdbhh F Bacteria T 4mli 2 C,D B,D SpyTag AHIVMVDAYKPTK 13 T 4.7 NAMPT_N pdbhh F T 4mls 2 B B SpyTag AHIVMVDAYKPTK 13 T 4.7 NAMPT_N pdbhh F T 4mn3 2 B B peptide XFAYKSX 7 T 12 ITAM_Cys-rich pdbhh F T 4mnv 2 B B acyl-enzyme intermediate of bicyclic peptide UK729 TCRQSMCTAR 10 T 4.6 DUF5497 pdbhh F T 4mnv 3 C C acyl-enzyme intermediate of bicyclic peptide UK729 TCPX 4 T 23 zf-ACC pdbhh F F 4mnw 2 B B bicyclic peptide UK749 QCWDRGCENRKCNX 14 T 2.6 Pox_G9-A16 pdbhh F T 4mnx 2 B B bicyclic peptide UK811 LCSDRGCENRWCKX 14 T 0.81 LRRCT_2 pdbhh F T 4mny 2 C,D C,D bicyclic peptide UK903 GCQVNYCPPVPCLX 14 T 0.4 Antimicrobial23 pdbhh F T 4mod 1 A,B A,B SPIKE_MERS1 HR1 of S protein, LINKER, HR2 of S protein MENQKLIANKFNQALGAMQTGFTTTNEAFQKVQDAVNNNAQALSKLASELSNTFGAISASIGDILVPRGSGGSGGSGGLEVLFQGPLTQINTTLLDLTYEMLSLQQVVKALNESYIDLKELLEHHHHHH 129 T 1.7E-08 CoV_S2 pdbpssm T Viruses T 4moy 2 B B PP1RA_RAT MHC CLASS I REGION PROLINE-RICH PROTEIN CAT53, PHOSPHATASE 1 NUCLEAR TARGETING SUBUNIT, PROTEIN PNUTS GAMGRKRKTVTWPEEGKLREYFYFELDETERVNVNKIKDFGEAA 44 T 2.3 GAAD pdbhh F Eukaryota T 4mp0 2 C,D B,D PP1RA_RAT MHC CLASS I REGION PROLINE-RICH PROTEIN CAT53, PHOSPHATASE 1 NUCLEAR TARGETING SUBUNIT, PROTEIN PNUTS GAMGRKRKTVTWPEEGKLREYFYFELDETERVNVNKIKDFGEAA 44 T 2.3 GAAD pdbhh F Eukaryota T 4mq9 6 G I GE23077 XXXXXXX 7 T 900 DUF3927 pdbhh F F 4mqv 2 B,D B,D SMAL1_HUMAN HEPA-RELATED PROTEIN, HHARP, SUCROSE NONFERMENTING PROTEIN 2-LIKE 1 LTEEQRKKIEENRQKALARRAEKLLA 26 T 0.08 ETAA1 pdbhh F Eukaryota T 4ms8 4 D B pCPB9 SPAEAGFFL 9 T 0.047 DUF1148 pdbhh F T 4mtm 1 A A I2GUG0_9CAUD Putative tail fiber protein RSLIANNTVNPNNGLGGAWEVYSGQGSIPTATSTTAGITKVLNVLNSNDVGSALSAAQGKVLNDKFNFQNSKNQSGYVRLGDSGLIIQWGVFTSTKTQSNLIFPLAFPNALLSITGNLNSNTPDVIGIDFDLSTATKTSIKTGAAQVGASWLSGKKISWIAIGY 164 T 0.02 UPF0164 pdb T Viruses T 4mvb 4 D B pCPB7 QPAEGGFQL 9 T 7.6 Turandot pdbhh F T 4mxq 4 D B pCPC5 SPAPRPLDL 9 T 5.6 Rhabdo_M1 pdbhh F T 4myy 1 A,B A,B F4Y428_9CYAN;F4Y429_9CYAN POLYKETIDE SYNTHASE MODULE, ZN-DEPENDENT OXIDOREDUCTASE/POLYKETIDE SYNTHASE MODULE NSALEAKLLDEIKQSSNQELESSIDQILESIINGGGSGGGSMLNKFTKKEQILSEKQQIKQLSPLQRAALALKKLETKLNNTLHE 85 T 0.055 Tubulin pdb F Bacteria T 4mz5 1 A,B A,C DUT_HUMAN DUTPASE, DUTP PYROPHOSPHATASE AISPSKRARPAEV 13 T 0.021 DSBA unppercent F Eukaryota T 4mz6 1 A,B A,C DUT_HUMAN DUTPASE, DUTP PYROPHOSPHATASE AIEPSKRARPAEV 13 T 0.021 DSBA unppercent F Eukaryota T 4mzj 2 B T MYOA_PLAF7 PFM-A XKNXPSLXRVQAHIRKKMV 19 T 0.27 BORCS8 pdbhh F Eukaryota T 4mzk 2 B T MYOA_PLAF7 PFM-A XKNIPSLLRXQAHXRKKMV 19 T 0.55 IQ unppssm F Eukaryota T 4mzl 2 C,D C,D MYOA_PLAF7 hydrogen bond surrogate (HBS) myoA helix mimetic NIXSLLRVQAHIRKKMV 17 T 0.14 BORCS8 pdbhh F Eukaryota T 4mzz 1 A,B A,B OSTCN_BOVIN BONE GLA PROTEIN, BGP, GAMMA-CARBOXYGLUTAMIC ACID-CONTAINING PROTEIN EPKREVCELNPDCDELADHIGFQEAYRRFYGPV 33 T 0.099 Toxin_23 pdbpercent F Eukaryota T 4n0c 2 B,F B,F pCPE3 MPAGRPWDL 9 T 0.35 DUF4516 pdbhh F T 4n0p 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H Q9A495_CAUCR Pilus assembly protein CpaE MGSDKIHHHHHHENLYFQGIPRITIHAFCARPETAALIEKAAADRRMSRAATIVRDGGLEAAVDYYQNQPTPSLVMVETLDGAQRLLHLLDSLAQVCDPGTKVVVVGQTNDIALYRELMRRGVSEYLTQPLGPLQVIRAVGALYADPAAPF 151 T 0.00037 Response_reg pdbpssm F Bacteria T 4n39 2 B B HCFC1_HUMAN Host cell factor 1 THETGTTNTATTATSN 16 T 54 Ice_nucleation pdbhh F Eukaryota T 4n3a 2 B B HCFC1_HUMAN Host cell factor 1 VRVCSNPPCATHETGTTNTATTATSN 26 T 7.8 Ntox1 pdbhh F Eukaryota T 4n3b 2 B B HCFC1_HUMAN Host cell factor 1 VRVCSNPPCQTHETGTTNTATTATSN 26 T 3.2 Ntox1 pdbhh F Eukaryota T 4n3c 2 B B HCFC1_HUMAN Host cell factor 1 VRVCSNPPCETHETGTTNTATTATSN 26 T 15 DUF1936 pdbhh F Eukaryota T 4n3w 2 B C H4_HUMAN Histone H4 peptide GGAKRHRXVLRDNIQ 15 T 0.27 UPF0137 unp F Eukaryota T 4n4f 2 B C H4_HUMAN Histone 4 Peptide KGGKGLGXGGAXRHRKVLRDN 21 T 0.27 UPF0137 unp F Eukaryota T 4n5e 4 D B pCPA12 VPYMAEFGM 9 T 0.13 UPA_2 pdbhh F T 4n5t 2 B B ATSP-7041 stapled-peptide XLTFXEYWAQXXSAA 15 T 0.74 PBP-Tp47_a pdbhh F T 4n6p 2 B B TRFL_BOVIN LACTOFERRIN, LACTOFERRICIN-B, LFCIN-B LEACAF 6 T 9.3 T6SS_TssF pdbhh F Eukaryota F 4n78 6 F P WIRS WGAERSMSTFGKEKA 15 T 3.5 YicC_N pdbhh F T 4n7h 2 B B ARRD3_HUMAN TBP-2-LIKE INDUCIBLE MEMBRANE PROTEIN, TLIMP RPEAPPSYAEVVT 13 T 0.062 TMEM252 pdbhh F Eukaryota T 4n7s 1 A,C A,C Q9HYC5_PSEAE Uncharacterized protein TATSDLIESLISYSWDDWQVTRQEARRVIAAIRNDNVPDATIAALDKSGSLIKLFQRVGPPELARSLIASIAGRTTMQRYQARNALIRSLINNPLGTQTDNWIYFPTITFFDICADLADAAGRLGFAAAGATGVASQAIQGPFSGVGATGVNPTDLPSIAFGDQLKLLNKDPATVTKYSNPLGDLGAYLSQLSPQDKLNQAQTLVGQPISTLFPDAYPGNPPSRAKVMSAAARKYDLTPQLIGAIILAEQRDQTRDEDAKDYQAAVSIKSANTSIGLGQVVVSTAIKYELFTDLLGQPVRRGLSRKAVATLLASDEFNIFATARYIRYVANLASQQDLRKLPKTRGAFPSIDLRAYAGNPRNWPRDNVRALASEYTSRPWDDNLSPGWPMFVDDAYATFLD 401 T 0.0018 DUF1402 pdbhh F Bacteria T 4n7s 2 B,D B,D Q9HYC4_PSEAE inhibitor MGSSHHHHHHSSGENLYFQSHMSMTLTHPNGLVVERPVGFDARRSAEGFRFDEGGKLRNPRQLEVQRQDAPPPPDLASRRLGDGEARYKVEEDDGGSAGSEYRLWAAKPAGARWIVVSASEQSEDGEPTFALAWALLERARLQ 143 T 0.0052 PsbP_2 unphh F Bacteria T 4n7v 2 C C CE152_HUMAN CEP152 MSLDFGSVALPVQNEDEEYDEEDYEREKELQQLLTDLPHDMLDDDLSSPELQYSDCSEDG 60 T 0.035 BING4CT pdbpssm F Eukaryota T 4n7z 2 B B CE192_HUMAN Centrosomal protein of 192 kDa EKLILPTSLEDSSDDDIDDEMFYDDHLEAYFEQLAIPGMIYEDLEGPEPPEKGFKLPT 58 T 29 BDHCT_assoc pdbhh F Eukaryota T 4n80 1 A A Q9HYC5_PSEAE Uncharacterized protein TATSDLIESLISYSWDDWQVTRQEARRVIAAIRNDNVPDATIAALDKSGSLIKLFQRVGPPELARSLIASIAGRTTMQRYQARNALIRSLINNPLGTQTDNWIYFPTITFFDICADLADAAGRLGFAAAGATGVASQAIQGPFSGVGATGVNPTDLPSIAFGDQLKLLNKDPATVTKYSNPLGDLGAYLSQLSPQDKLNQAQTLVGQPISTLFPDAYPGNPPSRAKVMSAAARKYDLTPQLIGAIILAEQRDQTRDEDAKDYQAAVSIKSANTSIGLGQVVVSTAIKYELFTDLLGQPVRRGLSRKAVATLLASDEFNIFATARYIRYVANLASQQDLRKLPKTRGAFPSIDLRAYAGNPRNWPRDNVRALASEYTSRPWDDNLSPGWPMFVDDAYATF 399 T 0.0021 DUF1402 unphh F Bacteria T 4n80 2 B B Q9HYC4_PSEAE Uncharacterized protein SHMDFDKTLTHPNGLVVERPVGFDARRSAEGFRFDEGGKLRNPRQLEVQRQDAPPPPDLASRRLGDGEARYKVEEDDGGSAGSEYRLWAAKPAGARWIVVSASEQSEDGEPTFALAWALLERARL 125 T 0.0052 PsbP_2 unphh F Bacteria T 4n88 1 A,C A,C Q9HYC5_PSEAE Uncharacterized protein TATSDLIESLISYSWDDWQVTRQEARRVIAAIRNDNVPDATIAALDKSGSLIKLFQRVGPPELARSLIASIAGRTTMQRYQARNALIRSLINNPLGTQTDNWIYFPTITFFDICADLADAAGRLGFAAAGATGVASQAIQGPFSGVGATGVNPTDLPSIAFGDQLKLLNKDPATVTKYSNPLGDLGAYLSQLSPQDKLNQAQTLVGQPISTLFPDAYPGNPPSRAKVMSAAARKYDLTPQLIGAIILAEQRDQTRDEDAKDYQAAVSIKSANTSIGLGQVVVSTAIKYELFTDLLGQPVRRGLSRKAVATLLASDEFNIFATARYIRYVANLASQQDLRKLPKTRGAFPSIDLRAYAGNPRNWPRDNVRALASEYTSRPWDDNLSPGWPMFVDDAYATFLD 401 T 0.0018 DUF1402 pdbhh F Bacteria T 4n88 2 B,D B,D Q9HYC4_PSEAE Uncharacterized protein SHMMTLTHPNGLVVERPVGFDARRSAEGFRFDEGGKLRNPRQLEVQRQDAPPPPDLASRRLGDGEARYKVEEDDGGSAGSEYRLWAAKPAGARWIVVSASEQSEDGEPTFALAWALLERARLQ 123 T 0.0052 PsbP_2 unphh F Bacteria T 4nag 1 A,B A,B F0CAT0_9XANT Xanthomonin I GGPLAGEEIGGFNVPG 16 T 6.6 Rhabdo_M2 pdbhh F Bacteria T 4naq 2 B B poly A peptide AAAAAAA 7 T 270 DUF4179 pdbhh F F 4nb3 2 C,D C,D ATRIP_HUMAN 3,4 dichlorophenylalanine ATRIP derived peptide XDFTADDLEEWXALA 15 T 0.48 TT_ORF2 pdbhh F Eukaryota T 4nco 2 B,F,J B,F,J BG505 SOSIP gp41 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 78 F F F 4nds 1 A,B A,B AGBL_LYODE Alpha-galactosyl-binding lectin ACWKANSCPGSAFESKDRLRSFALLYCRYNYKPPYGQGAFGYASAVSTHGWETEAQCINTFEQIITSCHGQSNGGTLELNSGRLSLAFGNCEEL 94 T 0.065 Fungal_lectin_2 pdb F Eukaryota T 4ndt 1 A A AGBL_LYODE Alpha-galactosyl-binding lectin ACWKANSCPGSAFESKDRLRSFALLYCRYNYKPPYGQGAFGYASAVSTHGWETEAQCINTFEQIITSCHGQSNGGTLELNSGRLSLAFGNCEEL 94 T 0.065 Fungal_lectin_2 pdb F Eukaryota T 4ndu 1 A,B A,B AGBL_LYODE Alpha-galactosyl-binding lectin ACWKANSCPGSAFESKDRLRSFALLYCRYNYKPPYGQGAFGYASAVSTHGWETEAQCINTFEQIITSCHGQSNGGTLELNSGRLSLAFGNCEEL 94 T 0.065 Fungal_lectin_2 pdb F Eukaryota T 4ndv 1 A,B A,B AGBL_LYODE Alpha-galactosyl-binding lectin ACWKANSCPGSAFESKDRLRSFALLYCRYNYKPPYGQGAFGYASAVSTHGWETEAQCINTFEQIITSCHGQSNGGTLELNSGRLSLAFGNCEEL 94 T 0.065 Fungal_lectin_2 pdb F Eukaryota T 4nec 2 B,D,F,H E,F,G,H Echinomycin XAXXXAXX 8 T 190 RSF pdbhh F F 4nf9 2 C,D C,D NSL1_HUMAN Kinetochore-associated protein NSL1 homolog LKRKQTKDCPQRKWYPLRPKKINLDT 26 T 4.7 DUF3410 pdbhh F Eukaryota T 4nft 2 E,F E,F AN32E_HUMAN ANP32E, LANP-LIKE PROTEIN, LANP-L GSHMEEEEEEEEEEDEDEDEDEDEAGSELGEGEEEVGLSYLMKEEIQDEEDD 52 T 0.0014 BUD22 unp F Eukaryota T 4nge 3 C,F C,F HPREP, PITRILYSIN METALLOPROTEINASE 1, METALLOPROTEASE 1, HMP1 XXXXXXX 7 F F F 4ngh 3 C P MODIFIED FRAGMENT OF HIV GLYCOPROTEIN (GP41) XNWFNITNXLWXIXKKK 17 T 0.029 GP41 pdbhh F T 4nhc 3 C P MODIFIED FRAGMENT OF HIV GLYCOPROTEIN (GP41) XNWFNITNXLWXIKKKK 17 T 0.034 GP41 pdbhh F T 4nia 1 A,AB,CA,E,EB,GA,I,KA,M,OA,Q,SA,U,WA,Y A,N,H,B,O,I,C,J,D,K,E,L,F,M,G COAT_STMV Coat protein MGRGKVKPNRKSTGDNSNVVTMIRAGSYPKVNPTPTWVRAIPFEVSVQSGIAFKVPVGSLFSANFRTDSFTSVTVMSVRAWTQLTPPVNEYSFVRLKPLFKTGDSTEEFEGRASNINTRASVGYRIPTNLRQNTVAADNVCEVRSNCRQVALVISCCFN 159 T 24 Csm4_C pdbhh T Viruses T 4nio 1 A A SODC_HUMAN SUPEROXIDE DISMUTASE 1, HSOD1 GVTGIAQ 7 F F Eukaryota T 4nip 1 A A SODC_HUMAN SUPEROXIDE DISMUTASE 1, HSOD1 GVIGIAQ 7 F F Eukaryota F 4nl8 1 A,C,E C,D,F A0A0H3GL04_KLEPH Single-stranded DNA-binding protein WMDFDDDIPF 10 T 0.36 Phage_SSB pdbhh F Bacteria T 4nm0 2 B B AXIN1_HUMAN AXIS INHIBITION PROTEIN 1, HAXIN GGILVEPQKFAEELIHRLEAVQRT 24 T 0.19 Cwf_Cwc_15 unppssm F Eukaryota T 4nm3 2 B B AXIN1_HUMAN AXIS INHIBITION PROTEIN 1, HAXIN GGILVEPQKFAEELIHRLEAVQRT 24 T 0.19 Cwf_Cwc_15 unppssm F Eukaryota T 4nm5 2 B B AXIN1_HUMAN AXIS INHIBITION PROTEIN 1, HAXIN GGILVEPQKFAEELIHRLEAVQRT 24 T 0.19 Cwf_Cwc_15 unppssm F Eukaryota T 4nm5 3 C C LRP6_HUMAN Phosphorylated Wnt receptor LRP6 c-motif MPPPPTPRS 9 T 0.11 DUF5848 pdbhh F Eukaryota F 4nm7 2 B B AXIN1_HUMAN AXIS INHIBITION PROTEIN 1, HAXIN GGILVEPQKFAEELIHRLEAVQRT 24 T 0.19 Cwf_Cwc_15 unppssm F Eukaryota T 4nm7 3 C C LRP6_HUMAN Phosphorylated Wnt receptor LRP6 e-motif MPPPPSPCT 9 T 5.8 Nt_Gln_amidase pdbhh F Eukaryota F 4nmo 2 C,D C,D iCAL36(Ac-K-1) peptide ANSRWPTSXI 10 T 9.5 CBP_BcsR pdbhh F T 4nmp 2 C,D C,D iCAL36(Ac-K-3) peptide ANSRWPXSII 10 T 4.9 Arc_MA pdbhh F T 4nmq 2 C,D C,D iCAL36(Ac-K-4) peptide ANSRWXTSII 10 T 1.2 Arc_MA pdbhh F T 4nmr 2 C,D C,D iCAL36(Ac-K-5) peptide ANSRXPTSII 10 T 5.3 CX pdbhh F T 4nms 2 C,D C,D iCAL36(FLB-K-1) peptide ANSRWPTSXI 10 T 9.5 CBP_BcsR pdbhh F T 4nmt 2 C,D C,D iCAL36(TFA-K-1) peptide ANSRWPTSXI 10 T 9.5 CBP_BcsR pdbhh F T 4nmv 2 C,D C,D iCAL36(BRB-K-1) peptide ANSRWPTSXI 10 T 9.5 CBP_BcsR pdbhh F T 4nmx 3 C Z peptide 2-8 XTVFTSWEEYLDWVX 15 T 0.21 DUF5575 pdbhh F T 4nnd 2 B,D,F,H F,C,E,H ERBB2_HUMAN METASTATIC LYMPH NODE GENE 19 PROTEIN, MLN 19, PROTO-ONCOGENE NEU, PROTO-ONCOGENE C-ERBB-2, TYROSINE KINASE-TYPE CELL SURFACE RECEPTOR HER2, P185ERBB2 LQRXSE 6 T 21 LELP1 pdbhh F Eukaryota T 4nnl 2 C,D C,D TIP-1 PDZ domain ANSRFPTSII 10 T 3.4 LemA pdbhh F T 4nnm 2 C,D C,D TIP-1 PDZ domain YPTSII 6 T 44 C9orf72-like pdbhh F F 4nnx 3 C C KPCD2_HUMAN NPKC-D2 RQASLSISV 9 T 0.0054 TCAD9 unppercent F Eukaryota T 4nny 3 C C KPCD2_HUMAN NPKC-D2 RQASLSISV 9 T 0.0054 TCAD9 unppercent F Eukaryota T 4no3 3 C C AMPD2_HUMAN AMP deaminase 2 RQISQDVKL 9 T 40 DUF2590 pdbhh F Eukaryota T 4no5 3 C C AMPD2_HUMAN AMP deaminase 2 RQISQDVKL 9 T 40 DUF2590 pdbhh F Eukaryota T 4nqj 1 A,B,C A,B,C TRI69_HUMAN RFP-LIKE DOMAIN-CONTAINING PROTEIN TRIMLESS, RING FINGER PROTEIN 36, TRIPARTITE MOTIF-CONTAINING PROTEIN 69 SVGQSKEFLQISDAVHFFMEELAIQQGQLETTLKELQTLRNMQKEAIAAHKENKLHLQQHVSMEFLKLHQFLHSKEKDILTELREEGKALNEEMELNLSQLQEQCLLAKDMLVSIQAKTEQQNSFDFLKDITTLLHSLEQGMKVLATRELISRKLNLGQYKGPIQYMVWREMQDTLCPG 179 T 0.00026 DUF1043 pdbpssm F Eukaryota T 4nso 2 B B Q9KN41_VIBCH Immunity protein MGENCNDTSGVHQKILVCIQNEIAKSETQIRNNISSKSIDYGFPDDFYSKQRLAIHEKCMLYINVGGQRGELLMNQCELSMLQGLDIYIQQYIEDVDNSLLEHHHHHH 108 T 0.015 Fmp27_GFWDK pdbpercent F Bacteria T 4nsr 1 A,B,C,D,E,F B,C,A,E,F,D Q9KN41_VIBCH Immunity protein MGENCNDMGENCNDTSGVHQKILVCIQNEIAKSETQIRNNISSKSIDYGFPDDFYSKQRLAIHEKCMLYINVGGQRGELLMNQCELSMLQGLDIYIQQYIEDVDNSLLEHHHHHH 115 T 0.013 Fmp27_GFWDK pdbpercent F Bacteria T 4ntp 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P Cyclic hexadecapeptide (ORN)LV(PHI)FAED(ORN)AII(SAR)L(ORN)V XLVXFAEDXAIIXLXV 16 T 0.012 Beta-APP pdbhh F T 4ntr 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P Cyclic hexadecapeptide (ORN)LVFFAED(ORN)AII(SAR)L(ORN)V XLVFFAEDXAIIXLXV 16 T 0.012 Beta-APP pdbhh F T 4nu1 2 B B AXIN1_HUMAN AXIS INHIBITION PROTEIN 1, HAXIN GGILVEPQKFAEELIHRLEAVQRT 24 T 0.19 Cwf_Cwc_15 unppssm F Eukaryota T 4nu8 2 B A CYSE_SALTY Peptide from Serine acetyltransferase TFEYGDGI 8 T 2.1 Cyanate_lyase pdbhh F Bacteria T 4nuf 2 B P EID1_MOUSE ORPHAN NUCLEAR RECEPTOR SHP, SMALL HETERODIMER PARTNER MHRVSAALEEANKVFL 16 T 8.1 DUF4646 pdbhh F Eukaryota T 4nuu 2 C C ACKR1_HUMAN ATYPICAL CHEMOKINE RECEPTOR 1, FY GLYCOPROTEIN, GPFY, GLYCOPROTEIN D, PLASMODIUM VIVAX RECEPTOR GPTGNSSQLDFEDVWNSSYGVNDSFPDGDYGA 32 T 1.4 DUF2603 pdbhh F Eukaryota T 4nuv 2 C,D C,D ACKR1_HUMAN ATYPICAL CHEMOKINE RECEPTOR 1, FY GLYCOPROTEIN, GPFY, GLYCOPROTEIN D, PLASMODIUM VIVAX RECEPTOR GPTGTENSSQLDFEDVWNSSYGVNDSFPDGDYGA 34 T 1.7 Myosin-VI_CBD pdbhh F Eukaryota T 4nw2 2 B,D B,D NS1_I72A2 Nonstructural protein 1 PKQKRKMARTARSKV 15 T 22 LPP20 pdbhh T Viruses T 4nw8 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L Cyclic hexadecapeptide (ORN)LV(PHI)(MEA)AED(ORN)AIIGL(ORN)V XLVXXAEDXAIIGLXV 16 T 0.012 Beta-APP pdbhh F T 4nw9 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L Cyclic hexadecapeptide (ORN)LVF(MEA)AED(ORN)AIIGL(ORN)V XLVFXAEDXAIIGLXV 16 T 0.012 Beta-APP pdbhh F T 4nxq 2 D,E,F D,E,F CNTP4_HUMAN CASPR4 PEPTIDE ENQKEYFF 8 T 3.8 La pdbhh F Eukaryota T 4nxr 2 B B NRX2B_HUMAN NEUREXIN II-BETA PEPTIDE NKDKEYYV 8 T 5.3 TMEM154 unphh F Eukaryota T 4ny3 2 C,D C,D PP2AA_HUMAN PP2A-ALPHA, REPLICATION PROTEIN C, RP-C TPDYFL 6 T 7.6 TDH pdbhh F Eukaryota T 4nz8 2 B,C B,C CLEAVED poly-Ala AAAAAA 6 T 340 UPF0253 pdbhh F F 4nzr 3 C M Y281_MYCGE PROTEIN MG281 MGSSHHHHHHSSGLVPRGSHMSLSLNDGSYQSEIDLSGGANFREKFRNFANELSEAITNSPKGLDRPVPKTEISGLIKTGDNFITPSFKAGYYDHVASDGSLLSYYQSTEYFNNRVLMPILQTTNGTLMANNRGYDDVFRQVPSFSGWSNTKATTVSTSNNLTYDKWTYFAAKGSPLYDSYPNHFFEDVKTLAIDAKDISALKTTIDSEKPTYLIIRGLSGNGSQLNELQLPESVKKVSLYGDYTGVNVAKQIFANVVELEFYSTSKANSFGFNPLVLGSKTNVIYDLFASKPFTHIDLTQVTLQNSDNSAIDANKLKQAVGDIYNYRRFERQFQGYFAGGYIDKYLVKNVNTNKDSDDDLVYRSLKELNLHLEEAYREGDNTYYRVNENYYPGASIYENERASRDSEFQNEILKR 416 T 0.13 RE_endonuc pdbpercent F Bacteria T 4nzt 1 A M Y281_MYCGE PROTEIN MG281 MGSSHHHHHHSSGLVPRGSHMSLSLNDGSYQSEIDLSGGANFREKFRNFANELSEAITNSPKGLDRPVPKTEISGLIKTGDNFITPSFKAGYYDHVASDGSLLSYYQSTEYFNNRVLMPILQTTNGTLMANNRGYDDVFRQVPSFSGWSNTKATTVSTSNNLTYDKWTYFAAKGSPLYDSYPNHFFEDVKTLAIDAKDISALKTTIDSEKPTYLIIRGLSGNGSQLNELQLPESVKKVSLYGDYTGVNVAKQIFANVVELEFYSTSKANSFGFNPLVLGSKTNVIYDLFASKPFTHIDLTQVTLQNSDNSAIDANKLKQAVGDIYNYRRFERQFQGYFAGGYIDKYLVKNVNTNKDSDDDLVYRSLKELNLHLEEAYREGDNTYYRVNENYYPGASIYENERASRDSEFQNEILKR 416 T 0.13 RE_endonuc pdbpercent F Bacteria T 4o1v 2 B B PTEN_HUMAN MUTATED IN MULTIPLE ADVANCED CANCERS 1, PHOSPHATASE AND TENSIN HOMOLOG PSNPEASSSTSVTPD 15 T 60 DUF3636 pdbhh F Eukaryota T 4o27 3 C C STK24_HUMAN 5-mer peptide from serine/threonine-protein kinase 24 DWIFE 5 T 22 BRCT_2 pdbhh F Eukaryota F 4o2c 3 C C DDX3X_HUMAN DEAD BOX PROTEIN 3, X-CHROMOSOMAL, DEAD BOX, X ISOFORM, HELICASE-LIKE PROTEIN 2, HLP2 XSHVAVENAL 10 T 17 HTH_SUN2 pdbhh F Eukaryota T 4o2e 3 C,F C,F DDX3X_HUMAN DEAD BOX PROTEIN 3, X-CHROMOSOMAL, DEAD BOX, X ISOFORM, HELICASE-LIKE PROTEIN 2, HLP2 SHVAVENAL 9 T 13 HTH_SUN2 pdbhh F Eukaryota T 4o2f 3 C,F C,F DDX3X_HUMAN DEAD BOX PROTEIN 3, X-CHROMOSOMAL, DEAD BOX, X ISOFORM, HELICASE-LIKE PROTEIN 2, HLP2 HVAVENAL 8 T 9.2 HTH_SUN2 pdbhh F Eukaryota T 4o3t 3 C P ZAP.14 IVGGYPWWMDV 11 T 0.13 Laps pdbhh F T 4o3u 3 C P ZAP 2.3 IIGGCPYWMDREECI 15 T 0.39 DUF779 pdbhh F T 4o42 2 B B NS1_I83A8 Nonstructural protein 1 PKQKRKMARTARSKV 15 T 22 LPP20 pdbhh T Viruses T 4o45 2 B B NS1_I72A2 Nonstructural protein 1 PKQKRKMARTARSKV 15 T 22 LPP20 pdbhh T Viruses T 4o46 2 G,H,I,J,K,L G,H,I,J,K,L Q9YP60_9INFA Nonstructural protein 1 PKQKRKMARTARSKV 15 T 22 LPP20 pdbhh T Viruses T 4o46 3 M,N,O,P M,N,O,V Unidentified polymer XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 4o4a 1 A,B,C,D A,B,C,D A0A6L8PAP6_BACAN PUTATIVE LIPOPROTEIN MHHHHHHSSGVDLGTENLYFQSNAKETTDTIYLIPEEYEGDLIVVYNVPGAELLPKEEEFSVVTFAADGTAVTSTKNMKFGTVNDLYYTVNKEGQRTKIDSSCIHFSSTGSRTENSWEFPFANLEVTRTACSQEFSANGREVPENQEHPAEKKMRDLMQRIQERYMNKVK 170 T 0.019 Lysis_col unphh F Bacteria T 4o56 2 B B synthetic peptide GPMTSTPK 8 T 3.4 RPN1_RPN2_N pdbhh F T 4o6f 2 B B ESR1_HUMAN ER, ER-ALPHA, ESTRADIOL RECEPTOR, NUCLEAR RECEPTOR SUBFAMILY 3 GROUP A MEMBER 1 GGRMLKHKRQR 11 T 11 Gemini_mov pdbhh F Eukaryota T 4o6w 2 B C Peptide-Based inhibitor XPLXSTX 7 T 680 zf-C2H2_jaz pdbhh F F 4o7j 1 A,B B,A CarG LTPVTLKNGVNQLDINQDGLKDYVVLAQFDNNTSHPNLGLTFFIHRPDGGYSIMPVTNSSEFTWFDYRLSASADFLVQDNRLFKIKKHYYLVTARKTEEDLFDVGKVSLTIYRFKVSRDDPGVPLYEWSMSKTVTAQRSYQSADEAYQEVDEAMLTRHHHHHH 163 T 0.008 FG-GAP_3 pdbpssm F T 4o7k 1 A A OSA_SHIFL ONCOGENIC SUPPRESSION ACTIVITY PROTEIN HHMLLWRRCRAWLEIRRLDKELAQSSGLPLELPQIVPNAWNEVVWRLPVPNHPDAFMTASNAAQSDFIVYVNGLAFYRAWLALGVEDSQACPLKQDMPKDRKYPSSAAHFAVGIDSPVPLADVSPTMILGHFAVCFTDGMTRSMWLLAHEVAVFPVLSRDEASAVMLAEHVGVAAPIQVSKLREQCRKIL 190 T 0.061 DUF3613 pdb F Bacteria T 4o87 1 A,B A,B Q707V3_9ASCO N-tagged Nuclease SMNPTTCLNEGAIGYMAIDILQSQNIETITINDNEYKLNKFNNIKDYISKVWGAASVYNLDLGNDYTKWQSSLDNVETDNIKNYINGHDNVYYNPGGKNKYLIIEASKELKWKGNLNNNKFNVNLKSIFSNAENLKVGHSDLLKLFSSIVNSKGSDNQKKVLNSLLDNINDRRLKKLVSTGQWTEAISDSVANEIAKNNKLTSIKAQLGSQKTQNVMIDANGHDLLKIDYDKTFVTANDLKNKIIDKNKLENAKNYFKIQNNDKILEDIKSKFSKNINENIKGSIRDHAKLIEFTENKKFNTINDNSNSDSKIKSITCKV 320 T 0.011 DWNN pdb F Eukaryota T 4o88 1 A,B A,B Q707V3_9ASCO N-tagged Nuclease PTTCLNEGAIGYMAIDILQSQNIETITINDNEYKLNKFNNIKDYISKVWGAASVYNLDLGNDYTKWQSSLDNVETDNIKNYINGHDNVYYNPGGKNKYLIIEASKELKWKGNLNNNKFNVNLKSIFSNAENLKVGHSDLLKLFSSIVNSKGSDNQKKVLNSLLDNINDRRLKKLVSTGQWTEAISDSVANEIAKNNKLTSIKAQLGSQKTQNVMIDANGHDLLKIDYDKTFVTANDLKNKIIDKNKLENAKNYFKIQNNDKILEDIKSKFSKNINENIKGSIRDHAKLIEFTENKKFNTINDNSNSDSKIKSITCKVLEHHHHHH 325 T 0.012 DWNN pdb F Eukaryota T 4o97 2 B B ST14_HUMAN Peptide CGLR CGLR 4 T 22 CIAPIN1 pdbhh F Eukaryota F 4o9v 2 B B ST14_HUMAN Peptide CGLR CGLR 4 T 22 CIAPIN1 pdbhh F Eukaryota F 4o9w 2 B B phospho peptide VAL-LEU-SER-TPO-LEU-NH2 VLSTLX 6 T 300 Ribosomal_S8 pdbhh F F 4oaj 2 B B 5HT2A_RAT 5-hydroxytryptamine receptor 2A peptide NEKVSCV 7 T 62 ELF pdbhh F Eukaryota T 4oar 2 B B NCOR2_HUMAN SMRT PEPTIDE TNMGLEAIIRKALMGKY 17 T 2.8 RuvA_C pdbhh F Eukaryota T 4od7 2 D,E,F D,E,F (ACE)PWATCDS(NH2) Peptide XPWATCDSX 9 T 5.8 Toxin_37 pdbhh F T 4odq 2 B B RS3_ECOLI 30S ribosomal protein S3 RLGIVKPWNSTWFANX 16 T 0.0042 MRP-S24 unphh F Bacteria T 4oez 2 B B co-regulator peptide SDSAFSRLYTRS 12 T 2.9 Msap1 pdbhh F T 4ofb 2 B B nonphosphopeptide inhibitor XTIDXDEYRXRKTX 14 T 2.5 UCR_TM pdbhh F T 4ofr 2 B B co-regulator peptide ANSSFRDWYTSS 12 T 0.97 DUF1122 pdbhh F T 4ofu 2 B B co-regulator peptide SDSAFSRYYTRS 12 T 1.7 DUF3486 pdbhh F T 4oga 6 F F INSR_HUMAN IR, INSULIN RECEPTOR SUBUNIT ALPHA, INSULIN RECEPTOR SUBUNIT BETA TFEDYLHNVVFVPRPS 16 T 0.00017 DUF4998 unphh F Eukaryota T 4oh4 2 C,D F,E BKI1_ARATH BRI1 kinase inhibitor 1 STMEELQAAIQAAIAHCKNSY 21 T 0.075 DUF5765 pdbpercent F Eukaryota T 4oha 2 B B co-regulator peptide SDSAFSRLYTRS 12 T 2.9 Msap1 pdbhh F T 4oih 2 B B RCC1_YEAST PRP20, PHEROMONE RESPONSE PATHWAY COMPONENT SRM1, PRE-MRNA-PROCESSING PROTEIN 20, REGULATOR OF CHROMOSOME CONDENSATION, SUPPRESSOR OF RECEPTOR MUTATIONS 1, MRNA TRANSPORT PROTEIN 1 GSMVKRTVATNGDASGAHRAKKMSKTH 27 T 0.0078 RCC1 unppssm F Eukaryota T 4oij 2 C,D C,D D-Ser-CCL1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGXXXXXXXGXXXXXXXXXGXXXXXXXXXXXXXXXXX 74 F F F 4oik 2 C,D C,D D-Ser-CCL1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGXXXXXXXGXXXXXXXXXGXXXXXXXXXXXXXXXXX 74 F F F 4oil 2 B B co-regulator peptide NTTDTLFSQHYR 12 T 5.7 Nairo_nucleo pdbhh F T 4oin 8 I I GE23077 XXXXXXX 7 T 900 DUF3927 pdbhh F F 4oip 8 I I GE23077 XXXXXXX 7 T 900 DUF3927 pdbhh F F 4oiq 8 I I GE23077 XXXXXXX 7 T 900 DUF3927 pdbhh F F 4oir 8 I I GE23077 XXXXXXX 7 T 900 DUF3927 pdbhh F F 4oiu 2 B B co-regulator peptide ANSSFRDWYTSS 12 T 0.97 DUF1122 pdbhh F T 4oj9 2 B B co-regulator peptide SDSAFSRYYTRS 12 T 1.7 DUF3486 pdbhh F T 4ojf 3 C A A4_HUMAN Amyloid beta A4 protein DAEFRHDS 8 T 0.0001 Beta-APP unphh F Eukaryota T 4okt 2 B B co-regulator peptide SDSAFSRLYTRS 12 T 2.9 Msap1 pdbhh F T 4okv 3 E,F E,F Q7YT37_ANOST GE RICH SALIVARY GLAND PROTEIN KYSKIKECFDSLADDVKSLVEKSETSYEECSKDKNNPHCGSEGTRELDEGLIEREQKLSDCIVEKR 66 T 0.028 DUF725 pdb F Eukaryota T 4okw 2 B B co-regulator peptide NTTDTLFSQHYR 12 T 5.7 Nairo_nucleo pdbhh F T 4okx 2 B B co-regulator peptide ANSSFRDWYTSS 12 T 0.97 DUF1122 pdbhh F T 4olm 2 B B co-regulator peptide SDSAFSRYYTRS 12 T 1.7 DUF3486 pdbhh F T 4olr 1 A,B A,B [Leu-5]-Enkephalin mutant - YVVFV YVVFV 5 T 63 ATG29_N pdbhh F F 4omc 2 G,H,I,J,K,L H,I,J,K,L,N meta-guanidinomethyl-phenylacetyl-Arg-Val-Arg-(amidomethyl)benzamidine XRVRX 5 T 450 Consortin_C pdbhh F F 4omd 2 G,H,I,J,K,L H,I,J,K,L,N phenylacetyl-Arg-Val-Arg-(amidomethyl)benzamidine XRVRX 5 T 450 Consortin_C pdbhh F F 4onf 3 C P A4_HUMAN Amyloid beta A4 protein DAEFRHD 7 T 1 DUF5973 pdbhh F Eukaryota T 4oni 2 B,D C,D NR0B2_HUMAN NUCLEAR RECEPTOR NR0B2, ORPHAN NUCLEAR RECEPTOR SHP, SMALL HETERODIMER PARTNER QGAASRPAILYALLSSSLK 19 T 9.2 NR_Repeat unphh F Eukaryota T 4onk 1 A,B A,B [Leu-5]-Enkephalin mutant - YVVFL YVVFL 5 T 44 PetL pdbhh F F 4oo6 2 B B RBM39_HUMAN HEPATOCELLULAR CARCINOMA PROTEIN 1, RNA-BINDING MOTIF PROTEIN 39, RNA-BINDING REGION-CONTAINING PROTEIN 2, SPLICING FACTOR HCC1 RSRSKERRRSRSRSRDRRFRGRYRSPY 27 T 0.67 CDC45 unp F Eukaryota T 4oq8 1 A,A10,A11,A12,A13,A14,A15,A2,A3,A4,A5,A6,A7,A8,A9 A,A,A,A,A,A,A,A,A,A,A,A,A,A,A COAT_STMV Coat protein MGRGKVKPNRKSTGDNSNVVTMIRAGSYPKVNPTPTWVRAIPFEVSVQSGIAFKVPVGSLFSANFRTDSFTSVTVMSVRAWTQLTPPVNEYSFVRLKPLFKTGDSTEEFEGRASNINTRASVGYRIPTNLRQNTVAADNVCEVRSNCRQVALVISCCFN 159 T 24 Csm4_C pdbhh T Viruses T 4oq9 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O A,B,C,D,E,F,G,H,I,J,K,L,M,N,O COAT_STMV Coat protein MGRGKVKPNRKSTGDNSNVVTMIRAGSYPKVNPTPTWVRAIPFEVSVQSGIAFKVPVGSLFSANFRTDSFTSVTVMSVRAWTQLTPPVNEYSFVRLKPLFKTGDSTEEFEGRASNINTRASVGYRIPTNLRQNTVAADNVCEVRSNCRQVALVISCCFN 159 T 24 Csm4_C pdbhh T Viruses T 4ore 2 B A Peptide inhibitor TFEYGDGI 8 T 2.1 Cyanate_lyase pdbhh F T 4os1 2 B B bicyclic peptide UK601 (bicyclic 1) GXALGRGCENHRCLX 15 T 1.4 Ivy pdbhh F T 4os2 2 B B bicyclic peptide UK602 (bicyclic 1) GXLGRGCENHRCLX 14 T 0.96 Ivy pdbhh F T 4os4 2 B B bicyclic peptide UK603 (bicyclic 1) GXALGRGCENHRCLX 15 T 1.4 Ivy pdbhh F T 4os5 2 B B bicyclic peptide UK603 (bicyclic 2) GXALGRGCENHRCLX 15 T 1.4 Ivy pdbhh F T 4os6 2 B B bicyclic peptide UK604 (bicyclic 2) GXLGRGCENHRCLX 14 T 0.96 Ivy pdbhh F T 4os7 2 B B bicyclic peptide UK607 (bicyclic) GXALGRGCENHRCLX 15 T 1.4 Ivy pdbhh F T 4ou3 2 B B tumor-homing peptide CNGRCG 6 T 1.3 GON pdbhh F F 4ovb 1 A A OSA_SHIFL Protein osa MLLWRRCRAWLEIRRLDKELAQSSGLPLELPQIVPNAWNEVVWRLPVPNHPDAFMTASNAAQSDFIVYVNGLAFYRAWLALGVEDSQACPLKQDMPKDRKYPSSAAHFAVGIDSPVPLADVSPTMILGHFAVCFTDGMTRSMWLLAHEVAVFPVLSRDEASAVMLAEHVGVAAPIQVSKLREQCRKIL 188 T 0.06 DUF3613 pdb F Bacteria T 4owr 2 B B NUP98_HUMAN Nuclear pore complex protein Nup98-Nup96 GSPTGTTIKFNPPTGTDTMVKAGVSTNISTKHQCITAMKEYESKSLEELRLEDYQANRK 59 T 0.44 Nucleoporin_FG unp F Eukaryota T 4oxd 2 F H MUB-ALA-ZGL-LYS-DSG XAXKX 5 T 920 Proteasome_A_N pdbhh F F 4oyk 2 C,D C,D OTUL_HUMAN DEUBIQUITINATING ENZYME OTULIN, OTU DOMAIN-CONTAINING DEUBIQUITINASE WITH LINEAR LINKAGE SPECIFICITY, UBIQUITIN THIOESTERASE GUMBY, DEUBIQUITINATING ENZYME OTULIN, OTU DOMAIN-CONTAINING DEUBIQUITINASE WITH LINEAR LINKAGE SPECIFICITY, UBIQUITIN THIOESTERASE GUMBY AEHEEDMYRAADEIEKEKE 19 T 0.44 40S_SA_C pdbhh F Eukaryota T 4oz7 1 A,B A,B Methanobactin XASCSXGPNC 10 T 0.31 DUF1499 pdbhh F T 4ozf 5 E J GDA2_WHEAT deamidated Gliadin-alpha2 peptide APQPELPYPQPGS 13 T 3 FAP unp F Eukaryota T 4ozg 5 I,J I,J GDA2_WHEAT deamidated Gliadin-alpha2 peptide APQPELPYPQPGS 13 T 3 FAP unp F Eukaryota T 4ozh 5 I,J I,J GDA2_WHEAT Gliadin-alpha2 peptide APQPELPYPQPGS 13 T 3 FAP unp F Eukaryota T 4ozi 5 I,J I,J GDA2_WHEAT deamidated Gliadin-alpha1 peptide QPFPQPELPYPGS 13 T 3 FAP unp F Eukaryota T 4p00 3 C D unidentified peptide XXXXXXXXX 9 F F F 4p02 3 C D unidentified peptide XXXXXXXXX 9 F F F 4p0a 2 B,D B,D TERA_HUMAN TER ATPASE,15S MG(2+)-ATPASE P97 SUBUNIT,VALOSIN-CONTAINING PROTEIN,VCP TEDNDDDLYG 10 T 17 DUF228 pdbhh F Eukaryota T 4p0b 2 B,D B,D OTUL_HUMAN DEUBIQUITINATING ENZYME OTULIN, OTU DOMAIN-CONTAINING DEUBIQUITINASE WITH LINEAR LINKAGE SPECIFICITY, UBIQUITIN THIOESTERASE GUMBY EEDMYRAADE 10 T 10 Ribosomal_L18_c pdbhh F Eukaryota T 4p1n 2 C,D C,D W0TA43_KLUMA Atg13 MIM ETPPEDLLEFVKLLEDKKELNMKPSTILPQQDISSSLIKFQSMKPNNDTLSDNLSMSMSID 61 T 14 E2_bind pdbhh F Eukaryota T 4p1w 4 G G C5DB94_LACTC KLTH0A00704P SKYSSSFGRLRRQ 13 T 6.2 Corona_5a pdbhh F Eukaryota T 4p2o 5 E P 2A peptide ADPADPLAFFSSAIKGGGGSLV 22 T 0.37 Rib_5-P_isom_A pdbhh F T 4p2q 3 C,H,M,R C,H,M,R 5c2 peptide ADGLAYFRSSFKGG 14 T 10 DUF1338 pdbhh F T 4p2r 3 C,H,M,R C,H,M,R 5c1 peptide ANGVAFFLTPFKA 13 T 9.8 DUF5699 pdbhh F T 4p3w 2 C,D,G,H,K,L G,H,K,L,J,I FBLI1_HUMAN FBLP-1,MIGFILIN,MITOGEN-INDUCIBLE 2-INTERACTING PROTEIN,MIG2-INTERACTING PROTEIN PEKRVASSVFITLAPPRRDVAVAE 24 T 8.4 Pox_A3L pdbhh F Eukaryota T 4p4w 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L CYCLIC HEXADECAPEPTIDE (ORN)YLL(PHI)YTE(ORN)KVA(MVA)AVK XYLLXYTEXKVAXAVK 16 T 1.2 Peptidase_C98 pdbhh F T 4p4y 1 A,B,C,D,E,F,G A,B,C,D,E,F,G CYCLIC HEXADECAPEPTIDE (ORN)YLL(PHI)YTE(ORN)KVT(MAA)TVK XYLLXYTEXKVTXTVK 16 T 1.1 Peptidase_C98 pdbhh F T 4p4z 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L CYCLIC HEXADECAPEPTIDE (ORN)YLL(PHI)YTE(ORN)KVT(MVA)TVK XYLLXYTEXKVTXTVK 16 T 1.1 Peptidase_C98 pdbhh F T 4p6f 56 EB,JD Z7,Z8 T17-GLY-GLY-PRO-LYS-LYS-LYS-LYS-LYS-VAL-GLY-GLY XGGPKKKKKVGG 12 T 0.99 Rrn6 pdbhh F F 4p6j 1 A,B A,B Computationally Designed Transporter of Zn(II) and Proton YXKEIAHALFSALFALSELYIAVRYX 26 T 9.4 DUF6332 pdbhh F T 4p6k 1 A A Computationally Designed Transporter of Zn(II) and Proton YXKEIAHALFSALFALSELYIAVRYX 26 T 9.4 DUF6332 pdbhh F T 4p6l 1 A,B A,B Computationally Designed Transporter of Zn(II) and proton YYKEIAHALFSALFALSELYIAVRYX 26 T 9.4 DUF6332 pdbhh F T 4p6z 6 F T BST2_HUMAN BST-2,HM1.24 ANTIGEN,TETHERIN AGFSMASTSYDYCRVPMEDGDKRCK 25 T 0.36 UL42 unphh F Eukaryota T 4p7i 2 C,D C,D DCAF1_HUMAN DDB1- AND CUL4-ASSOCIATED FACTOR 1,HIV-1 VPR-BINDING PROTEIN,VPRBP,SERINE/THREONINE-PROTEIN KINASE VPRBP,VPR-INTERACTING PROTEIN GPGSEFGEDGDNDFSPSDEELANLLEEGEDGEDEDSDADEEVELILGDTDSSDNSDLEDDIILSLNE 67 T 8 ACC_epsilon pdbhh F Eukaryota T 4p9h 2 B G Q0ED31_9HIV1 Envelope glycoprotein gp160 VWKDADTTLFCASDAKAHETECHNVWATHACVPTDPNPQEIHLEQVTENFNMWKNNMVEQMQEDVISLWDQCLQPCVKLTGGSVIKQACPKISFDPIPIHYCTPAGYVILKCNDKNFNGTGPCKNVSSVQCTHGIKPVVSTQLLLNGSLAEEEIIIRSENLTNNAKTIIVHLQKSVEINCTRPSNGGSGSGGDIRKAYCEIQGTKWNKVLKQVTEKLKEHFNNKTIIFQPPSGGDLEITMHHFNCRGEFFYCNTTQLFQNTCIGNETMKGCNGTITLPCKIKQIINMWQGTGQAMYAPPIDGKINCVSQITGILLTRDGGANNTSNETFRPGGGNIKDNWRSELYKYKVVQIEGSHHHHHH 361 T 2.6E-48 GP120 unp T Viruses T 4p9v 2 B B PHQ-PTR-02K-ASN-NH2 XXXNX 5 T 430 DUF3673 pdbhh F F 4p9z 2 B B NMI-PTR-02K-ASN-NH2 XXXNX 5 T 430 DUF3673 pdbhh F F 4pby 2 C,D C,D MTA1_HUMAN MTA1 DVFYMATEETRKIRKLLSSSETKRAARRPYK 31 T 0.74 MTA_R1 unp F Eukaryota T 4pbz 2 B B MTA1_HUMAN Metastasis-associated protein MTA1 KLLSSSETKRAARRPYKPIALRQSQA 26 T 0.74 MTA_R1 unp F Eukaryota T 4pc0 2 C,D C,D MTA1_HUMAN HUMAN MTA1 KLLSSSETKRAARRPYKPIALRQSQALPPRPPPPAPVNDEPI 42 T 0.14 MTA_R1 pdbpssm F Eukaryota T 4pdc 2 E,F E,F Q8QN43_COWPX CPXV018 protein GHKLAFNFNLEINGSDTHSTVDVDLDDSQIITFDGKDIRPTIPFMIGDEIFLPFYKNVFSEFFSLFRRVPTSTPYEDLTYFYECDYTDNKSTFDQDYLYNGEEYTVKTQEATNKNMWLTTSEFRLKKWFDGEDCIMHLRSLVRKMEDSKR 150 T 0.078 Thioredoxin_11 unppssm T Viruses T 4pes 2 C,D C,D Ala-Ala-Ala AAA 3 T 1200 RNase_HII pdbhh F F 4pew 1 A,B A,B LAM55_STREK Putative secreted protein SFGSDVRPAAAQEVVGGGDLGPNVLVFDPSTPDIQGKVDEVFRKQESNQFGTDRYALMFKPGTYNDINAQIGFYTSIAGLGLNPDDTTFNGDVTVDAGWFDGNATQNFWRSAENLALNPVNGTNRWAVSQAAPFRRMHVKGGLNLAPDGYGWASGGYIADSKIDGEVGPYSQQQWYTRDSSVGGWGNGVWNMTFSGVEGAPAQSFPEPPYTTLETTPVSREKPFLYLDGDDYKVFVPAKRTNARGTSWGNGTPEGESLPLDQFYVVKPGATAETINAAVDQGLHLLFTPGVYHVDQPIEIDRANTVALGLGLATIIPDNGVTALKVGDVDGVKVAGLLVDAGPVNSETLVEVGSDGASGDHAANPTSLQDVFVRIGGAGPGKATTSIVVNSNDTIIDHTWVWRADHGEGVGWETNRADYGVHVKGDNVLATGLFVEHFNKYDVQWSGENGKTIFYQNEKAYDAPDQAAIQNGDIKGYAAYKVDDSVTTHEGWGMGSYCYFNVNPDIRQQHGFQAPVKPGVKFHDLLVVSLGGKGQYEHVINDIGDPTSGDTTIPSQVVSFP 561 T 0.0083 Pectate_lyase_3 unphh F Bacteria T 4pex 1 A,B A,B LAM55_STREK Putative secreted protein SFGSDVRPAAAQEVVGGGDLGPNVLVFDPSTPDIQGKVDEVFRKQESNQFGTDRYALMFKPGTYNDINAQIGFYTSIAGLGLNPDDTTFNGDVTVDAGWFDGNATQNFWRSAENLALNPVNGTNRWAVSQAAPFRRMHVKGGLNLAPDGYGWASGGYIADSKIDGEVGPYSQQQWYTRDSSVGGWGNGVWNMTFSGVEGAPAQSFPEPPYTTLETTPVSREKPFLYLDGDDYKVFVPAKRTNARGTSWGNGTPEGESLPLDQFYVVKPGATAETINAAVDQGLHLLFTPGVYHVDQPIEIDRANTVALGLGLATIIPDNGVTALKVGDVDGVKVAGLLVDAGPVNSETLVEVGSDGASGDHAANPTSLQDVFVRIGGAGPGKATTSIVVNSNDTIIDHTWVWRADHGEGVGWETNRADYGVHVKGDNVLATGLFVEHFNKYDVQWSGENGKTIFYQNAKAYDAPDQAAIQNGDIKGYAAYKVDDSVTTHEGWGMGSYCYFNVNPDIRQQHGFQAPVKPGVKFHDLLVVSLGGKGQYEHVINDIGDPTSGDTTIPSQVVSFP 561 T 0.0083 Pectate_lyase_3 unphh F Bacteria T 4pey 1 A A LAM55_STREK Putative secreted protein SFGSDVRPAAAQEVVGGGDLGPNVLVFDPSTPDIQGKVDEVFRKQESNQFGTDRYALMFKPGTYNDINAQIGFYTSIAGLGLNPDDTTFNGDVTVDAGWFDGNATQNFWRSAENLALNPVNGTNRWAVSQAAPFRRMHVKGGLNLAPDGYGWASGGYIADSKIDGEVGPYSQQQWYTRDSSVGGWGNGVWNMTFSGVEGAPAQSFPEPPYTTLETTPVSREKPFLYLDGDDYKVFVPAKRTNARGTSWGNGTPEGESLPLDQFYVVKPGATAETINAAVDQGLHLLFTPGVYHVDQPIEIDRANTVALGLGLATIIPDNGVTALKVGDVDGVKVAGLLVDAGPVNSETLVEVGSDGASGDHAANPTSLQDVFVRIGGAGPGKATTSIVVNSNDTIIDHTWVWRADHGEGVGWETNRADYGVHVKGDNVLATGLFVEHFNKYDVQWSGENGKTIFYQNAKAYDAPDQAAIQNGDIKGYAAYKVDDSVTTHEGWGMGSYCYFNVNPDIRQQHGFQAPVKPGVKFHDLLVVSLGGKGQYEHVINDIGDPTSGDTTIPSQVVSFP 561 T 0.0083 Pectate_lyase_3 unphh F Bacteria T 4pez 1 A A LAM55_STREK Putative secreted protein SFGSDVRPAAAQEVVGGGDLGPNVLVFDPSTPDIQGKVDEVFRKQESNQFGTDRYALMFKPGTYNDINAQIGFYTSIAGLGLNPDDTTFNGDVTVDAGWFDGNATQNFWRSAENLALNPVNGTNRWAVSQAAPFRRMHVKGGLNLAPDGYGWASGGYIADSKIDGEVGPYSQQQWYTRDSSVGGWGNGVWNMTFSGVEGAPAQSFPEPPYTTLETTPVSREKPFLYLDGDDYKVFVPAKRTNARGTSWGNGTPEGESLPLDQFYVVKPGATAETINAAVDQGLHLLFTPGVYHVDQPIEIDRANTVALGLGLATIIPDNGVTALKVGDVDGVKVAGLLVDAGPVNSETLVEVGSDGASGDHAANPTSLQDVFVRIGGAGPGKATTSIVVNSNDTIIDHTWVWRADHGEGVGWETNRADYGVHVKGDNVLATGLFVEHFNKYDVQWSGENGKTIFYQNAKAYDAPDQAAIQNGDIKGYAAYKVDDSVTTHEGWGMGSYCYFNVNPDIRQQHGFQAPVKPGVKFHDLLVVSLGGKGQYEHVINDIGDPTSGDTTIPSQVVSFP 561 T 0.0083 Pectate_lyase_3 unphh F Bacteria T 4pf0 1 A,B A,B LAM55_STREK Putative secreted protein SFGSDVRPAAAQEVVGGGDLGPNVLVFDPSTPDIQGKVDEVFRKQESNQFGTDRYALMFKPGTYNDINAQIGFYTSIAGLGLNPDDTTFNGDVTVDAGWFDGNATQNFWRSAENLALNPVNGTNRWAVSQAAPFRRMHVKGGLNLAPDGYGWASGGYIADSKIDGEVGPYSQQQWYTRDSSVGGWGNGVWNMTFSGVEGAPAQSFPEPPYTTLETTPVSREKPFLYLDGDDYKVFVPAKRTNARGTSWGNGTPEGESLPLDQFYVVKPGATAETINAAVDQGLHLLFTPGVYHVDQPIEIDRANTVALGLGLATIIPDNGVTALKVGDVDGVKVAGLLVDAGPVNSETLVEVGSDGASGDHAANPTSLQDVFVRIGGAGPGKATTSIVVNSNDTIIDHTWVWRADHGEGVGWETNRADYGVHVKGDNVLATGLFVEHFNKYDVQWSGENGKTIFYQNAKAYDAPDQAAIQNGDIKGYAAYKVDDSVTTHEGWGMGSYCYFNVNPDIRQQHGFQAPVKPGVKFHDLLVVSLGGKGQYEHVINDIGDPTSGDTTIPSQVVSFP 561 T 0.0083 Pectate_lyase_3 unphh F Bacteria T 4pg2 3 C D SPIKE_CVMJC CYS-SER-LEU-TRP-ASN-GLY-PRO-HIS-LEU CSLWNGPHL 9 T 3.1 RGM_N pdbhh T Viruses T 4pgc 4 G H Unknown helical fragment XKXXSXXXLXXXT 13 T 2900 zf-H2C2_5 pdbhh F F 4ph8 1 A,B A,B AGGREGATIVE ADHERENCE FIMBRIAE, TYPE I (AAF/I), MAJOR SUBUNIT, AGGA, SHIGA-TOXIN PRODUCING E.COLI ASQHHHHHHVTNDCPVTITTTPPQTVGVSSTTPIGFSAKVTTSDQCIKAGAKVWLWGTGPANKWVLQHAKVAKQKYTLNPSIDGGADFVNQGTDAKIYKKLTSGNKFLNASVSVNPKTQVLIPGEYTMILHAAVDFDNKQGGASQQTTQTIRLTVT 156 T 0.082 DNA_ligase_aden pdbpssm F T 4phz 1 A,B,D D,H,N unknown peptide XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 4pi0 1 A,B,F D,H,N unknown peptide XXXXXXXXXXXXXXXXXXXXXXXXX 25 F F F 4pi2 1 A,B,E D,H,N unknown peptide XXXXXXXXXXXXXXXXXXXXXXXXX 25 F F F 4pju 2 B B RAD21_HUMAN HHR21,NUCLEAR MATRIX PROTEIN 1,NXP-1,SCC1 HOMOLOG VDPVEPMPTMTDQTTLVPNEEEAFALEPIDITVKETKAKRKRKLIVDSVKELDSKTIRAQLSDYSDIVTTLDLAPPTKKLMMWKETGGVEKLFSLPAQPLWNNRLLKLFTRCLTPLVPEDLRKRRKGGEADNLDEFLKEF 140 T 14 Ashwin pdb F Eukaryota T 4pjw 2 B B RAD21_HUMAN HHR21,NUCLEAR MATRIX PROTEIN 1,NXP-1,SCC1 HOMOLOG VDPVEPMPTMTDQTTLVPNEEEAFALEPIDITVKETKAKRKRKLIVDSVKELDSKTIRAQLSDYSDIVTTLDLAPPTKKLMMWKETGGVEKLFSLPAQPLWNNRLLKLFTRCLTPLVPEDLRKRRKGGEADNLDEFLKEF 140 T 14 Ashwin pdb F Eukaryota T 4pjz 1 A B TEICOPLANIN-A2-2 XXXXXXX 7 T 1.5 Defensin_2 pdbhh F F 4pk0 1 A B TEICOPLANIN-A2-2 XXXXXXX 7 T 1.5 Defensin_2 pdbhh F F 4pk2 2 B B ACETYL-SER-(N-PROPANOYL-LYS)-ASP--THR-NH2 PEPTIDE XSDXTX 6 T 1200 PHP_C pdbhh F F 4pk3 2 B B ACETYL-SER-ASP-(N-ACETYL-LYS)-THR-NH2 PEPTIDE XSDXTX 6 T 420 DUF5780 pdbhh F F 4pk7 2 B B RAD21_HUMAN HHR21,NUCLEAR MATRIX PROTEIN 1,NXP-1,SCC1 HOMOLOG GPLGSGRPVDPVEPMPTMTDQTTLVPNEEEAFALEPIDITVKETKAKRKRKLIVDSVKELDSKTIRAQLSDYSDIVTTLDLAPPTKKLMMWKETGGVEKLFSLPAQPLWNNRLLKLFTRCLTPLVPEDLRKRRKGGEADNLDEFLKEF 148 T 23 CheC pdbhh F Eukaryota T 4pn8 1 A,B,C,D,E,F,G,H,I,J A,B,C,D,E,F,G,H,I,J CC-Pent XGKIEQILQKIEKILQKIEWILQKIEQILQG 31 T 0.034 DUF4298 pdb F T 4pn9 1 A,B,C,D,E,F A,B,C,D,E,F CC-Hex2 XGEIAKSLKEIAKSLKEIAWSLKEIAKSLKG 31 T 0.032 MCPsignal pdbpssm F T 4pnd 1 A,B,C,D,E A,B,C,D,E CC-Pent_Variant XGNILQKIENILKKIENILWKIENILQKIEG 31 T 1.9 Fer4_24 pdbhh F T 4po2 2 C,D C,D HEAT SHOCK 70 KDA PROTEIN 1/2, HSP70-1/HSP70-2, HSP70.1/HSP70.2 NRLLLTG 7 T 10 PgaPase_1 pdbhh F F 4pqz 1 A A SWT1_YEAST SYNTHETICALLY LETHAL WITH TREX PROTEIN 1 MGSSHHHHHHSSGENLYFQGSYAHIPGIETPPLQFDKVSQNVFEQVKETIFFAIDHTLRKEYGEDIGFIDYNPDKLTTIENASNYIYLFWVSVFSELFTCSKIKKNEWKSLPTVLKSKPTNLNDLRTFEQFWETVLHFLFSKFTNEEKQSLEKQIHEWKTSINAIST 167 T 0.036 Borrelia_REV unp F Eukaryota T 4pr5 3 C C EBNA1_EBVG EBNA-1, EBV NUCLEAR ANTIGEN 1 HPVGDADYFEY 11 T 9.6 Sel_put pdbhh T Viruses T 4pra 3 C C EBNA1_EBVB9 EBNA-1, EBV NUCLEAR ANTIGEN 1 HPVGQADYFEY 11 T 4.1 DUF2620 pdbhh T Viruses T 4prb 3 C C EBNA1_EBVA8 EBNA-1, EBV NUCLEAR ANTIGEN 1 HPVAEADYFEY 11 T 13 Fip1 pdbhh T Viruses T 4prd 3 C C EBNA1_EBVG EBNA-1, EBV NUCLEAR ANTIGEN 1 HPVGDADYFEY 11 T 9.6 Sel_put pdbhh T Viruses T 4pre 3 C C EBNA1_EBVB9 EBNA-1, EBV NUCLEAR ANTIGEN 1 HPVGQADYFEY 11 T 4.1 DUF2620 pdbhh T Viruses T 4prh 3 C C EBNA1_EBVG EBNA-1, EBV NUCLEAR ANTIGEN 1 HPVGDADYFEY 11 T 9.6 Sel_put pdbhh T Viruses T 4pri 3 C C EBNA1_EBVB9 EBNA-1, EBV NUCLEAR ANTIGEN 1 HPVGEADYFEY 11 T 6.4 Sel_put pdbhh T Viruses T 4prn 3 C C EBNA1_EBVA8 EBNA-1, EBV NUCLEAR ANTIGEN 1 HPVAEADYFEY 11 T 13 Fip1 pdbhh T Viruses T 4prp 3 C C EBNA1_EBVB9 EBNA-1, EBV NUCLEAR ANTIGEN 1 HPVGQADYFEY 11 T 4.1 DUF2620 pdbhh T Viruses T 4pry 2 B B Ac-LETD-CHO XLETX 5 T 460 RNA_pol_L pdbhh F F 4prz 2 B B (ACE)LET(1U8) PEPTIDE XLETX 5 T 1200 Peptidase_S68 pdbhh F F 4ps0 2 C,D C,D (BAL)LQ(HYP)(1U8) PEPTIDE XLQPX 5 T 540 DNTTIP1_dimer pdbhh F F 4ps1 2 E,F,G,H E,F,G,H (BAL)LQ(HYP)(1U8) PEPTIDE XLQPX 5 T 540 DNTTIP1_dimer pdbhh F F 4psi 2 C,D D,E TELO2_HUMAN Telomere length regulation protein TEL2 homolog ALDSDDEFVPY 11 T 6.1 Glyco_transf_21 pdbhh F Eukaryota T 4pv8 3 E,F E,F S598 peptide modified Q600F RXFIFANI 8 T 2.8 TBP-binding pdbhh F T 4pv9 3 E,F E,F S598 peptide modified Q600V RXVIFANI 8 T 2.7 DUF3099 pdbhh F T 4pvz 2 C,D C,D HEH2_YEAST HELIX-EXTENSION-HELIX DOMAIN-CONTAINING PROTEIN 2 GPLGSTNKRKREQISTDNEAKMQIQEEKSPKKKRKKRSSKANK 43 T 20 CMS1 pdbhh F Eukaryota T 4pw1 1 A,B A,B A7VV57_9FIRM Uncharacterized protein GYKGTIEEREQPQNFNLLYLNSGEELNLYPWNLYTGQEQELFEEEIVSFAANSVRILGGGSWTDEELYPLIKFRYSGQDLRFLKDMALTEKDGRRYLVNMALDPNGLCYFSYVNQDEREATADEMDQALGKLQEDWEKFLSDPLPADSEVDLYEEKPSGSYQLDDGELKTDNAFYMFFMRCQMLSDQMRKEQYSDYIGDNLYTIWELVLKSEFTSLSYDNHIYAMYSNDGGTSMVLIYSPIEERFVGFSLKY 252 T 0.091 Glyco_hydro_43 pdbpssm F Bacteria T 4pyw 2 B,C B,C ACE-THR-THR-ALA-ILE-NH2 XTTAIX 6 T 1000 zf-C2H2_4 pdbhh F F 4pz3 2 C C Undefined peptides modeled as AAAV AAAV 4 T 520 Nrf1_DNA-bind pdbhh F F 4pz5 2 B B O50835_BORBU Fibronectin-binding protein BBK32 XSISYTDEIEEEDYDQ 16 T 0.052 Fn_bind unppercent F Bacteria T 4q0p 1 A A Q93UQ5_9GAMM L-Ribose isomerase MRGSHHHHHHGSARTSITRREYDEWVREAAALGKALRYPITEKMVNDSAGIVFGADQYDAFKNGMWSGEPYEAMIIFESLNEPAVDGLPTGAAPYAEYSGLCDKLMIVHPGKFCPPHHHGRKTESYEVVLGEMEVFYSPTPSAESGVELLNFSGMPVGSPWPEGVALPKGRESSYEKLTSYVRLRAGDPKFVMHRKHLHAFRCPPDSDVPLVVREVSTYSHEPTEAAAGNHAPIPSWLGMHDNDFVSDAANTGRLQTAIS 260 T 0.0062 Cupin_2 unppercent F Bacteria T 4q0q 1 A A Q93UQ5_9GAMM L-Ribose isomerase MRGSHHHHHHGSARTSITRREYDEWVREAAALGKALRYPITEKMVNDSAGIVFGADQYDAFKNGMWSGEPYEAMIIFESLNEPAVDGLPTGAAPYAEYSGLCDKLMIVHPGKFCPPHHHGRKTESYEVVLGEMEVFYSPTPSAESGVELLNFSGMPVGSPWPEGVALPKGRESSYEKLTSYVRLRAGDPKFVMHRKHLHAFRCPPDSDVPLVVREVSTYSHEPTEAAAGNHAPIPSWLGMHDNDFVSDAANTGRLQTAIS 260 T 0.0062 Cupin_2 unppercent F Bacteria T 4q0s 1 A A Q93UQ5_9GAMM L-Ribose isomerase MRGSHHHHHHGSARTSITRREYDEWVREAAALGKALRYPITEKMVNDSAGIVFGADQYDAFKNGMWSGEPYEAMIIFESLNEPAVDGLPTGAAPYAEYSGLCDKLMIVHPGKFCPPHHHGRKTESYEVVLGEMEVFYSPTPSAESGVELLNFSGMPVGSPWPEGVALPKGRESSYEKLTSYVRLRAGDPKFVMHRKHLHAFRCPPDSDVPLVVREVSTYSHEPTEAAAGNHAPIPSWLGMHDNDFVSDAANTGRLQTAIS 260 T 0.0062 Cupin_2 unppercent F Bacteria T 4q0u 1 A A Q93UQ5_9GAMM L-Ribose isomerase MRGSHHHHHHGSARTSITRREYDEWVREAAALGKALRYPITEKMVNDSAGIVFGADQYDAFKNGMWSGEPYEAMIIFESLNEPAVDGLPTGAAPYAEYSGLCDKLMIVHPGKFCPPHHHGRKTESYEVVLGEMEVFYSPTPSAESGVELLNFSGMPVGSPWPEGVALPKGRESSYEKLTSYVRLRAGDPKFVMHRKHLHAFRCPPDSDVPLVVRQVSTYSHEPTEAAAGNHAPIPSWLGMHDNDFVSDAANTGRLQTAIS 260 T 0.0062 Cupin_2 unppercent F Bacteria T 4q0v 1 A A Q93UQ5_9GAMM L-Ribose isomerase MRGSHHHHHHGSARTSITRREYDEWVREAAALGKALRYPITEKMVNDSAGIVFGADQYDAFKNGMWSGEPYEAMIIFESLNEPAVDGLPTGAAPYAEYSGLCDKLMIVHPGKFCPPHHHGRKTESYEVVLGEMEVFYSPTPSAESGVELLNFSGMPVGSPWPEGVALPKGRESSYEKLTSYVRLRAGDPKFVMHRKHLHAFRCPPDSDVPLVVRQVSTYSHEPTEAAAGNHAPIPSWLGMHDNDFVSDAANTGRLQTAIS 260 T 0.0062 Cupin_2 unppercent F Bacteria T 4q0y 1 A,B,C A,B,C J7SH17_CLOS1 Uncharacterized protein GRMEISSLSSIDVFKFNSFSKFSNDKIGVIYDEEKLSKFKVIMNSLDTSEGIKKIEVPKDANIESFKYSYHIQPNLKYVEDNNVYDGYFLLYILVGDSEGKSYIIFSGTELSYVLDKNNTNILKEIFLNVKKQQ 134 T 0.31 ABC2_membrane_3 pdb F Bacteria T 4q1l 2 B D GLN-SER-TRP QSW 3 T 43 Globin pdbhh F F 4q2l 1 A A PREDICTED TRANSPORTER, YAJR MFS TRANSPORTER GSHMKEPPYVSSLRIEIPADIAANEALKVRLLETEGVKEVLIAEEEHSAYVKIDSKVTNRFEVEQAIRQA 70 T 0.024 Spore_YhcN_YlaJ pdbhh F T 4q2m 1 A A PREDICTED TRANSPORTER, YAJR MFS TRANSPORTER GSHMKEPPYVSSLRIEIPADIAANEALKVRLLETEGVKEVLIAEEEHSAYVKIDSKVTNRFEVEQAIRQA 70 T 0.024 Spore_YhcN_YlaJ pdbhh F T 4q2s 1 A A YE38_SCHPO UNCHARACTERIZED PROTEIN C20G4.08 GAMGAQGIAESLRRLKEYVKAGSVKECVAEWCNMPSVAGFDVLSEISYDRMLENCSNLLLLTFIYHISLLDSVDDDRLSKRMEYISRICLNIDVNDPKVETVVHPVLTLTREALLRQSEFFSPIFKRRLVVLLRALDGKISEI 143 T 7.9 CLTH pdbhh F Eukaryota T 4q4i 2 B C Amastatin XVVD 4 T 400 Fer4 pdbhh F F 4q5j 2 C,D F,E BKI1_ARATH BRI1 kinase inhibitor 1 STMEELQAAIQAAIAHCKNSY 21 T 0.075 DUF5765 pdbpercent F Eukaryota T 4q5u 2 B C PP2BA_HUMAN CAM-PRP CATALYTIC SUBUNIT, CALMODULIN-DEPENDENT CALCINEURIN A SUBUNIT ALPHA ISOFORM ARKEVIRNKIRAIGKMARVFSVLR 24 T 6.1 DUF2626 pdbhh F Eukaryota T 4q6h 2 B B iCAL36-VQDTRL peptide VQDTRL 6 T 200 STE3 pdbhh F T 4q6s 2 C,D C,D BT-L-iCAL36 peptide XWXFKKANSRLPTSII 16 T 2.1 PMBR pdbhh F T 4q8d 1 A,B A,B macrocyclic beta-sheet peptide incorporating residues amyloid beta 15-23 XQKLVFXAEDXQKLVXED 18 T 0.5 Beta-APP pdbhh F T 4q94 2 C,D C,D rpb1-ctd XSPSYSPTSPSYSPTSPSYSX 21 T 3.1E-05 RNA_pol_Rpb1_R pdbhh F F 4q96 2 C,F C,F RPB1-CTD XSPSYSPTSPSYSPTSPSYSX 21 T 3.1E-05 RNA_pol_Rpb1_R pdbhh F F 4q9i 2 B,C B,C (30F)A(30F)A(30F)A Peptide XAXAXA 6 T 2700 Nrf1_DNA-bind pdbhh F F 4q9j 2 B,C,D C,D,B N-{[5-AMINO-1-(5-O-PHOSPHONO-BETA-D-ARABINOFURANOSYL)-1H-IMIDAZOL-4-YL]CARBONYL}-L-ASPARTIC ACID XVXVXV 6 T 1300 IF2_assoc pdbhh F F 4q9k 2 B B (30F)L(30F)L(30F)L Peptide XLXLXL 6 T 2200 CotJB pdbhh F F 4q9l 2 B,C B,C (30F)F(30F)F(30F)F Peptide XFXFXF 6 T 220 NAD2 pdbhh F F 4qa5 2 C,D C,D tetrapeptide substrate XRHXX 5 T 290 Antimicrobial14 pdbhh F F 4qa6 2 C,D C,D tetrapeptide substrate XRHXX 5 T 290 Antimicrobial14 pdbhh F F 4qa7 2 B B tetrapeptide substrate XRHXX 5 T 290 Antimicrobial14 pdbhh F F 4qan 1 A,B A,B A7B4B4_RUMGV hypothetical protein GKKEESEVLNVTESLQKESEITSFSEEEEAVLYMLSALKKNDLDMALRGCAIDETALQINFVKTAEELPGMQLIDLPAPTSDYSYYFPLTSAEMTKAYIEQFEELSTEIPEIETLEVLEIAEKKEKEREEQLAECLAAQEVSELEIYVKCGEQSYRLGFTAVQYEKNWKIHSLKEGLLYETDIPACVQMEEMREAKKTYVLPNQLTGANYFQAMPISEKTPQRAVEQFIYAIEKGDLTRALAFATTESSQDTSPELLKKQGEYAKELKTMLYGFLGTEDARLYGKSEEQLNKLRGKLNPEYMVYLDLIKVIPIETEENTETVKQYAGLYSYNGKNYLTGYTLCRQEDGWQIQSLSAPALSLESGEVMRLSKEESRKTSEQSVLKAEKNER 390 T 0.0056 DUF4864 pdbhh F Bacteria T 4qbm 2 C,D C,D histone H4 peptide with sequence Gly-Ala-Lys(ac)-Arg-His-Arg-Lys(ac)-Val-Leu GAXRHRXVL 9 T 29 GIY_YIG_domain pdbhh F T 4qby 15 BA,DA,FA,Z 2,3,4,1 MACROPAIN SUBUNIT PRE3, MULTICATALYTIC ENDOPEPTIDASE COMPLEX SUBUNIT PRE3, PROTEASOME COMPONENT PRE3, PROTEINASE YSCE SUBUNIT PRE3 XAAX 4 T 3000 LisH pdbhh F F 4qc3 2 C C diacetylated histone 4 peptide (H4K8acK12ac) GXGLGXG 7 T 11 NAT pdbhh F F 4qh7 2 C,D,G,H C,D,G,H Q9XZ31_DROME Anastral spindle 2 NYTICAGTQTDP 12 T 0.013 Macoilin unppssm F Eukaryota T 4qh8 2 C C Q9XZ31_DROME Anastral spindle 2 NYSSTTGTQCDIA 13 T 0.088 SKA2 unp F Eukaryota T 4qh8 3 D D Q9XZ31_DROME Anastral spindle 2 NYSSTTGTQCDI 12 T 0.088 SKA2 unp F Eukaryota T 4qj8 2 E,F E,F GAG_HV1A2 p1-p6 peptide RPGNFLQSRL 10 T 9.9 DUF2851 pdbhh T Viruses T 4qja 2 C P GAG_HV1A2 p1-p6 peptide RPGNFLQSRL 10 T 9.9 DUF2851 pdbhh T Viruses T 4qjr 2 B B PRGC1_HUMAN PGC-1-ALPHA, PPAR-GAMMA COACTIVATOR 1-ALPHA, PPARGC-1-ALPHA, LIGAND EFFECT MODULATOR 6 EEPSLLKKLLLAPA 14 T 7 Cnl2_NKP2 pdbhh F Eukaryota T 4qk4 2 B B PRGC1_HUMAN PGC-1-ALPHA, PPAR-GAMMA COACTIVATOR 1-ALPHA, PPARGC-1-ALPHA, LIGAND EFFECT MODULATOR 6 EEPSLLKKLLLAPA 14 T 7 Cnl2_NKP2 pdbhh F Eukaryota T 4qlb 2 E,F,G,H G,E,H,F GYG1_CAEEL Protein GYG-1, isoform b PSTEERRAAWEAGQPDYLGRDAFVHIQEALNRALNE 36 T 0.62 CysG_dimeriser unp F Eukaryota T 4qli 2 B B SNAI1_HUMAN PROTEIN SNAIL HOMOLOG 1, PROTEIN SNA SHTLPC 6 T 1.5 zf-C2H2_8 unp F Eukaryota T 4qn8 1 A,B,C A,B,C Q5ZRR7_LEGPH VipE GHMPLTQTQRLINTYGASLKNGTISNEELIILLDPNTFTKSEGYVDPNAPVSDSNHSKMDAIKDFVLTIGPTLDSEILHQLTSRMIELSPPGDRNTFMRGSSLEKAFLAFEMAHYPTKAEEHFNSTRVRTEFPGENDIDNLKAVILNPIIAFFQS 155 T 6.1 PSD5 pdbhh F Bacteria T 4qnz 2 B B ACE-PHE-ALA-THR-ALA-0QE XFATAX 6 T 530 Inhibitor_I48 pdbhh F F 4qo0 2 B F ACE-PHE-ALA-THR-ALA-0QE XFATAX 6 T 530 Inhibitor_I48 pdbhh F F 4qo2 2 B B 6-AMINO-2-METHYL-1,7-DIHYDRO-8H-IMIDAZO[4,5-G]QUINAZOLIN-8-ONE XIATAX 6 T 810 RII_binding_1 pdbhh F F 4qpp 2 D,E,F D,E,F POLY-UNK XXXXXXX 7 F F F 4qqi 2 B X RFX7_HUMAN REGULATORY FACTOR X 7, REGULATORY FACTOR X DOMAIN-CONTAINING PROTEIN 2 KAFVHMPTLPNLDFHKT 17 T 0.5 DUF4739 pdbhh F Eukaryota T 4qs4 1 A A Q93I73_ECOLX CofB GSHMEKEADEARRQIVSNALISEIAGIVDFVAEEQITVIEQGIEKEITNPLYEQSSGIPYINRTTNKDLNSTMSTNASEFINWGAGTSTRIFFTRKYCISTGTQGNYEFSKDYIPCEEPAILSNSDLKIDRIDFVATDNTVGSAIERVDFILTFDKSNANESFYFSNYVSSLEKAAEQHSISFKDIYVVERNSSGAAGWRLTTISGKPLTFSGLSKNIGSLDKTKNYGLRLSIDPNLGKFLRADGRVGADKLCWNIDNKMSGPCLAADDSGNNLVLTKGKGAKSNEPGLCWDLNTGTSKLCLTQIEGKDNNDKDASLIKLKDDNGNPATMLANILVEEKSMTDSTKKELRTIPNTIYAAFSNSNASDLVITNPGNYIGNVTSEKGRIELNVQDCPVSPDGNKLHPRLSASIASIVADTKDSNGKYQADFSSLAGNRNSGGQLGYLSGTAIQVNQSGSKWYITATMGVFDPLTNTTYVYLNPKFLSVNITTWCSTEPQT 498 T 0.94 PilX_N unphh F Bacteria T 4qsy 2 B B GAB1_HUMAN GRB2-ASSOCIATED BINDER 1, GROWTH FACTOR RECEPTOR BOUND PROTEIN 2-ASSOCIATED PROTEIN 1 GDKQVEXLDLDLD 13 T 0.94 GHBP pdbhh F Eukaryota T 4qtx 2 B E ACE-ASP-GLU-VAL-ASP-CHLOROMETHYLKETONE INHIBITOR XDEVDX 6 T 200 ResIII pdbhh F F 4qty 2 B F ACE-ASP-GLU-VAL-ASP-CHLOROMETHYLKETONE INHIBITOR XDEVDX 6 T 200 ResIII pdbhh F F 4qu0 2 B E ACE-ASP-GLU-VAL-ASP-CHLOROMETHYLKETONE INHIBITOR XDEVDX 6 T 200 ResIII pdbhh F F 4qu5 2 B B ACE-ASP-GLU-VAL-ASP-CHLOROMETHYLKETONE INHIBITOR XDEVDX 6 T 200 ResIII pdbhh F F 4qu8 2 B F ACE-ASP-GLU-VAL-ASP-CHLOROMETHYLKETONE INHIBITOR XDEVDX 6 T 200 ResIII pdbhh F F 4qu9 2 B E ACE-ASP-GLU-VAL-ASP-CHLOROMETHYLKETONE INHIBITOR XDEVDX 6 T 200 ResIII pdbhh F F 4qua 2 B D ACE-ASP-GLU-VAL-ASP-CHLOROMETHYLKETONE INHIBITOR XDEVDX 6 T 200 ResIII pdbhh F F 4qub 2 B D ACE-ASP-GLU-VAL-ASP-CHLOROMETHYLKETONE INHIBITOR XDEVDX 6 T 200 ResIII pdbhh F F 4qud 2 C,D D,C ACE-ASP-GLU-VAL-ASP-CHLOROMETHYLKETONE INHIBITOR XDEVDX 6 T 200 ResIII pdbhh F F 4que 2 C,D H,J ACE-ASP-GLU-VAL-ASP-CHLOROMETHYLKETONE INHIBITOR XDEVDX 6 T 200 ResIII pdbhh F F 4que 3 E,F D,E SHORT PEPTIDE VDDDM 5 T 100 SiaC pdbhh F F 4qug 2 C,D D,B ACE-ASP-GLU-VAL-ASP-CHLOROMETHYLKETONE INHIBITOR XDEVDX 6 T 200 ResIII pdbhh F F 4quh 2 C D short peptde GVDDD 5 T 160 DUF1659 pdbhh F F 4quh 3 D,E I,J ACE-ASP-GLU-VAL-ASP-CHLOROMETHYLKETONE INHIBITOR XDEVDX 6 T 200 ResIII pdbhh F F 4qui 2 C,D D,C ACE-ASP-GLU-VAL-ASP-CHLOROMETHYLKETONE INHIBITOR XDEVDX 6 T 200 ResIII pdbhh F F 4quj 2 B F ACE-ASP-GLU-VAL-ASP-CHLOROMETHYLKETONE INHIBITOR XDEVDX 6 T 200 ResIII pdbhh F F 4qul 2 C,D G,F ACE-ASP-GLU-VAL-ASP-CHLOROMETHYLKETONE INHIBITOR XDEVDX 6 T 200 ResIII pdbhh F F 4qut 2 B B H4_HUMAN Histone H4 GLGXGGAY 8 T 7.5 CRISPR_assoc pdbhh F Eukaryota F 4quu 2 B B H4_HUMAN Histone H4 RGXGGXGLGXGGAY 14 T 11 Shadoo unppercent F Eukaryota T 4qwn 2 B,D B,D KDM2A_MOUSE F-BOX AND LEUCINE-RICH REPEAT PROTEIN 11, F-BOX/LRR-REPEAT PROTEIN 11, JMJC DOMAIN-CONTAINING HISTONE DEMETHYLATION PROTEIN 1A, [HISTONE-H3]-LYSINE-36 DEMETHYLASE 1A QVHLTHFELEGLRCLVDKLESLPLHKKCVPTGIEDEDALIADVKILLEELASSDPKLALTGVPIVQWP 68 T 0.0015 JHD pdbhh F Eukaryota T 4qx7 2 B,D B,D KDM2A_MOUSE F-BOX AND LEUCINE-RICH REPEAT PROTEIN 11, F-BOX/LRR-REPEAT PROTEIN 11, JMJC DOMAIN-CONTAINING HISTONE DEMETHYLATION PROTEIN 1A, [HISTONE-H3]-LYSINE-36 DEMETHYLASE 1A QVHLTHFELEGLRCLVDKLESLPLHKKCVPTGIEDEDALIADVKILLEELASSDPKLALTGVPIVQWP 68 T 0.0015 JHD pdbhh F Eukaryota T 4qx8 2 B,D B,D KDM2A_MOUSE F-BOX AND LEUCINE-RICH REPEAT PROTEIN 11, F-BOX/LRR-REPEAT PROTEIN 11, JMJC DOMAIN-CONTAINING HISTONE DEMETHYLATION PROTEIN 1A, [HISTONE-H3]-LYSINE-36 DEMETHYLASE 1A QVHLTHFELEGLRCLVDKLESLPLHKKCVPTGIEDEDALIADVKILLEELASSDPKLALTGVPIVQWP 68 T 0.0015 JHD pdbhh F Eukaryota T 4qxb 2 B,D B,D KDM2A_MOUSE F-BOX AND LEUCINE-RICH REPEAT PROTEIN 11, F-BOX/LRR-REPEAT PROTEIN 11, JMJC DOMAIN-CONTAINING HISTONE DEMETHYLATION PROTEIN 1A, [HISTONE-H3]-LYSINE-36 DEMETHYLASE 1A QVHLTHFELEGLRCLVDKLESLPLHKKCVPTGIEDEDALIADVKILLEELASSDPKLALTGVPIVQWP 68 T 0.0015 JHD pdbhh F Eukaryota T 4qxc 2 B,D B,D KDM2A_MOUSE F-BOX AND LEUCINE-RICH REPEAT PROTEIN 11, F-BOX/LRR-REPEAT PROTEIN 11, JMJC DOMAIN-CONTAINING HISTONE DEMETHYLATION PROTEIN 1A, [HISTONE-H3]-LYSINE-36 DEMETHYLASE 1A QVHLTHFELEGLRCLVDKLESLPLHKKCVPTGIEDEDALIADVKILLEELASSDPKLALTGVPIVQWP 68 T 0.0015 JHD pdbhh F Eukaryota T 4qxh 2 B,D B,D KDM2A_MOUSE F-BOX AND LEUCINE-RICH REPEAT PROTEIN 11, F-BOX/LRR-REPEAT PROTEIN 11, JMJC DOMAIN-CONTAINING HISTONE DEMETHYLATION PROTEIN 1A, [HISTONE-H3]-LYSINE-36 DEMETHYLASE 1A QVHLTHFELEGLRCLVDKLESLPLHKKCVPTGIEDEDALIADVKILLEELASSDPKLALTGVPIVQWP 68 T 0.0015 JHD pdbhh F Eukaryota T 4qxt 3 C Q MSA2_PLAFF MEROZOITE SURFACE PROTEIN 2, MSA-2 XNAYNMSIRRSMANEGSNX 19 T 1.1 DUF6494 pdbhh F Eukaryota T 4qy8 3 C Q MSA2_PLAF7 MEROZOITE SURFACE PROTEIN 2, MSA-2, 45 KDA MEROZOITE SURFACE ANTIGEN XNAYNMSIRRSMAESKPSX 19 T 0.87 DUF6494 pdbhh F Eukaryota T 4qyd 2 B B H4_HUMAN Histone H4 GKGGKGLGXGGAKR 14 T 11 Shadoo unppercent F Eukaryota T 4qyo 3 C Q MSA2_PLAF7 MEROZOITE SURFACE PROTEIN 2, MSA-2 XNAYNMSIRRX 11 T 0.68 DUF6494 pdbhh F Eukaryota T 4r0i 2 B B ST14_HUMAN MATRIPTASE, MEMBRANE-TYPE SERINE PROTEASE 1, MT-SP1, PROSTAMIN, SERINE PROTEASE 14, SERINE PROTEASE TADG-15, TUMOR-ASSOCIATED DIFFERENTIALLY-EXPRESSED GENE 15 PROTEIN CGLR 4 T 22 CIAPIN1 pdbhh F Eukaryota F 4r1d 1 A A Q9I3K2_PSEAE Uncharacterized protein MSSEPLEPNQDVIIPRSRDSLGRPVYKAQLTRTDNQSEKVALIRQTAPLPVIFIPGIMGTNLRNKADKSEVWRPPNGLWPMDDLFASIGALWTWAWRGPKARQELLKAEQVEVDDQGTIDVGQSGLSEEAARLRGWGKVMRSAYNPVMGLMERRLDNIVSRRELQAWWNDEALSPPGDQGEEQGKVGPIDEEELLRASRYQFDVWCAGYNWLQSNRQSALDVRDYIENTVLPFYQKECGLDPEQMRRMKVILVTHSMGGLVARALTQLHGYERVLGVVHGVQPATGSSTIYHHMRCGYEGIAQVVLGRNAGEVTAIVANSAGALELAPSAEYREGRPWLFLCDAQGQVLKDIDGKPRAYPQNQDPYEEIYKNTTWYGLVPEQNSQYLDMSDKKEGLRVGPRDNFEDLIDSIANFHGELSAAGYHSETYAHYGADDSRHSWRDLIWKGDPTPLETPGATLNDDENGTYNSWFRRGLPTIVQGPLETGNPLDASGSGGDETVPTDSGQAPALAGVKASFRHGSKGKGQANTKRGYEHQESYNDARAQWAALYGVIKITQLADWHPNDKGGT 569 T 2.8E-09 LCAT pdbhh F Bacteria T 4r1e 2 B B MYOA_PLAF7 PFM-A GSLLRVQAHIRKKMV 15 T 0.11 BORCS8 pdbhh F Eukaryota T 4r1r 2 B,D,F,G D,E,F,P RIR2_ECOLI RIBONUCLEOTIDE REDUCTASE R2 PROTEIN YLVGQIDSEVDTDDLSNFQL 20 T 12 UL11 pdbhh F Bacteria T 4r29 1 A,B,C,D A,B,C,D Q7DBA6_ECO57 CYSTEINE METHYLTRANSFERASE, NLEE MINPVTNTQGVSPINTKYAEHVVKNIYPEIKHDYFNESPNIYDKKYISGITRGVAELKQEEFVNEKARRFSYMKTMYSVCPEAFEPISRNEASTPEGSWLTVISGKRPMGQFSVDSLYNPDLHALCELPDICCKIFPKENNDFLYIVVVYRNDSPLGEQRANRFIELYNIKRDIMQELNYALPELKAVKSEMIIAREMGEIFSYMPGEIDSYMKYINNKLSKIE 224 T 0.27 Arm-DNA-bind_2 pdb F Bacteria T 4r3l 2 B B ALBA1_SULSO N-terminal 6-mer peptide from Alba SSGTPT 6 T 1.8 Cas_Csm6 pdbhh F Archaea F 4r3p 2 B B ERRFI_HUMAN MITOGEN-INDUCIBLE GENE 6 PROTEIN, MIG-6 THYXLLP 7 T 2.3 DUF1435 pdbhh F Eukaryota T 4r3r 2 B B ERRFI_HUMAN peptide from ERBB receptor feedback inhibitor 1' THXXLLP 7 T 2.3 DUF1435 pdbhh F Eukaryota F 4r3s 3 C Q MSA2_PLAF7 Merozoite surface protein XFINNAYNMSIRRSX 15 T 2 DUF6494 pdbhh F Eukaryota T 4r4k 1 A,B,C,D A,B,C,D A5ZF42_9BACE Uncharacterized protein GKNEIAQSGEDFKSFLDKFTSSAAFQYTRIKFPLKTPITLLADDGETEKTFPFTKEKWPLLDSETMKEERIEQEEGGIYVSKFTLNEPVHKVFEAGYEESEIDLRVEFEQAADGKWYVVDCYTGWYGYDLPIGELKQTIQQVKEENAAFKEIHP 154 T 0.00025 DUF4348 unppssm F Bacteria T 4r5i 2 B B HSP70/DnaK Substrate Peptide: NRLLLTG NRLLLTG 7 T 10 PgaPase_1 pdbhh F F 4r6n 2 B,D,F,H B,D,F,H LECB3_ARTIN JACALIN BETA-3 CHAIN EQSGISQTVIVGPWGAKVS 19 T 2.9 DUF3842 pdbhh F Eukaryota T 4r6o 2 B,D,F,H B,D,F,H LECB3_ARTIN JACALIN BETA-3 CHAIN EQSGISQTVIVGPWGAKVS 19 T 2.9 DUF3842 pdbhh F Eukaryota T 4r6p 2 B,D,F,H B,D,F,H LECB3_ARTIN JACALIN BETA-3 CHAIN EQSGISQTVIVGPWGAKVS 19 T 2.9 DUF3842 pdbhh F Eukaryota T 4r6q 2 B,D,F,H B,D,F,H LECB3_ARTIN JACALIN BETA-3 CHAIN EQSGISQTVIVGPWGAKVS 19 T 2.9 DUF3842 pdbhh F Eukaryota T 4r6r 2 B,D,F,H B,D,F,H LECB3_ARTIN JACALIN BETA-3 CHAIN EQSGISQTVIVGPWGAKVS 19 T 2.9 DUF3842 pdbhh F Eukaryota T 4r7a 1 A A PHF6_HUMAN PHD-LIKE ZINC FINGER PROTEIN KSKKKSRKGRPRKTN 15 T 0.29 AT_hook pdbhh F Eukaryota T 4r8m 2 C,D C,D BHJH-TM1 peptide PKKTG 5 T 140 DUF3924 pdbhh F F 4rav 3 E,F E,F HD_HUMAN HUNTINGTON DISEASE PROTEIN, HD PROTEIN MATLEKLMKAFESLKSF 17 T 2 Mito_fiss_reg unphh F Eukaryota T 4rcp 2 B B PL-2 XXSTX 5 T 600 zf-H2C2_5 pdbhh F F 4rfn 4 D,H M,D T-CELL SURFACE GLYCOPROTEIN CD4 mimetic M48 XNLHFCQLRCKSLGLLGRCAXTFCACVX 28 T 0.0091 Toxin_38 pdbpssm F T 4rh5 2 B B EPS15_HUMAN PROTEIN EPS15, PROTEIN AF-1P FSAXPSEED 9 T 1.5 RAP80_UIM unphh F Eukaryota T 4rh9 2 B B EPS15_HUMAN PROTEIN EPS15, PROTEIN AF-1P FSAXPSEED 9 T 1.5 RAP80_UIM unphh F Eukaryota T 4rhg 2 B B EPS15_HUMAN PROTEIN EPS15, PROTEIN AF-1P FSAXPSEED 9 T 1.5 RAP80_UIM unphh F Eukaryota T 4rhz 2 B B Q9KKG7_BACTU Cry37AA1 MTVYNATFTINFYNEGEWGGPEPYGYIKAYLTNPDHDFEIWKQDDWGKSTPERSTYTQTIKISSDTGSPINQMCFYGDVKEYDVGNADDILAYPSQKVCSTPGVTVRLDGDEKGSYVTIKYSLTPA 126 T 3 DUF4091 pdbpssm F Bacteria T 4riq 2 C,F,I,L,O,R,U,W C,F,I,L,O,R,U,X ASH2L_HUMAN ASH2-LIKE PROTEIN GAMGSVEHTLADVLYHVETEVENLYFQ 27 T 3.3 PRA-PH pdbhh F Eukaryota T 4rjf 2 B,D,F B,D,F CDN1A_HUMAN CDK-INTERACTING PROTEIN 1, MELANOMA DIFFERENTIATION-ASSOCIATED PROTEIN 6, MDA-6, P21 GRKRRQTSMTDFFHSKRRLIFS 22 T 0.88 CDC27 pdbhh F Eukaryota T 4rmh 2 B B Ac-Lys-H3 peptide TGGXAPR 7 T 4.5 Importin_rep_3 pdbhh F T 4rmi 2 B B Ac-Lys-OTC peptide EXR 3 T 290 MF_alpha_N pdbhh F F 4ro3 1 A,B A,B M1RHE3_VIBCL Hypothetical Protein SNAMSKFYQINTTLLESNEAVNKQTGEVVPLSPETKLVYAYMLNQYRMYRKYGNRRYTESWDKIFTVCCDVAAQKQKRLAKELTTLGLIEVIGNKNAYKVVHSVESIIETWEFTNSKLNT 120 T 2.4E-05 RepA_N pdbhh F Bacteria T 4rof 2 C,D C,D TXNIP_HUMAN THIOREDOXIN-BINDING PROTEIN 2, VITAMIN D3 UP-REGULATED PROTEIN 1,TXNIP PEPTIDE XTPEAPPCYMDVIX 14 T 14 MYEOV2 pdbhh F Eukaryota T 4roj 2 D,E,F D,E,F TXNIP_HUMAN THIOREDOXIN-BINDING PROTEIN 2, VITAMIN D3 UP-REGULATED PROTEIN 1 XTPEAPPCXMDVIX 14 T 14 MYEOV2 pdbhh F Eukaryota T 4rqz 2 D,E,F D,E,F activating peptide DNRLGLVYQF 10 T 1.2 POLO_box pdbhh F T 4rsp 2 B B Peptide inhibitor XSVLX 5 T 1200 Pox_EPC_I2-L1 pdbhh F F 4rt4 2 E E BRE2_YEAST BREFELDIN-A SENSITIVITY PROTEIN 2, COMPLEX PROTEINS ASSOCIATED WITH SET1 PROTEIN BRE2, SET1C COMPONENT BRE2 NTLDTLYKEQIAEDIVWDIIDELEQIALQQ 30 T 0.11 Ectoine_synth unp F Eukaryota T 4rtv 2 B B APP12 peptide XAPPLPPRNRPRL 13 T 0.42 SCIMP pdbhh F T 4rtw 2 B,D,E B,D,E APP12 peptide XAPPLPPRNRPRL 13 T 0.42 SCIMP pdbhh F T 4rty 2 B B APP12 peptide XAPPLPPRNRPRL 13 T 0.42 SCIMP pdbhh F T 4rtz 2 B B VSL12 peptide VSLARRPLPPLP 12 T 0.73 DUF4522 pdbhh F T 4ru2 2 B,D,F,H,J,L,N,P,R B,D,F,H,J,L,N,P,R U2AF2_MOUSE U2 AUXILIARY FACTOR 65 KDA SUBUNIT, U2 SNRNP AUXILIARY FACTOR LARGE SUBUNIT GKKKVRKYWDVPPPGFEHITPMQYKAMQA 29 T 0.0013 Transformer unp F Eukaryota T 4rud 1 A,B A,B U3EPL2_MICFL Three-finger toxin 3b LKCYSSRTETMTCPEGEDKCEKYAVGLMHGSFFFIYTCTSKCHEGAYNVCCSTDLCNK 58 T 0.0003 Toxin_TOLIP unppercent F Eukaryota T 4rwa 2 C,D G,H bifunctional peptide XXFFX 5 T 110 TRIQK pdbhh F F 4rwd 2 C,D G,H bifunctional peptide XXFFX 5 T 110 TRIQK pdbhh F F 4rwg 2 D,E,F D,E,F CGRP analog FVPTDVGPFAFX 12 T 1.2 Carcinustatin pdbhh F T 4rxh 2 B,C A,C LT_SV40 LT, LT-AG PPKKKRKV 8 T 0.28 FAM60A unppercent T Viruses F 4rxv 1 A A Q5ZWY9_LEGPH hypothetical protein lpg0944 GMAIAPQQIQERLKQEQYQKFVVADIGNFPHCLAQTPEGIASGQRYQKYSTNSLSRTPPFSQWGAPQLLTPKSAQEYIKFAQQRNKKSSFKIDGEAVRVSECSNFAYHSAGVLLDDPQIRTQYDVAVIGSMHSNGRYLHNITLLVPKGSRLPQPPEQLTAEVFPIGTLIVDPWAVGMGHPPEQALAIPKEQFAYNRSLFPATVNYQSALDESLTSTRTGQLTPYTGTPS 229 T 0.28 Ail_Lom pdbpssm F Bacteria T 4rxx 1 A A UBP38_HUMAN DEUBIQUITINATING ENZYME 38, HP43.8KD, UBIQUITIN THIOESTERASE 38, UBIQUITIN-SPECIFIC-PROCESSING PROTEASE 38 MDKILEGLVSSSHPLPLKRVIVRKVVESAEHWLDEAQCEAMFDLTTRLILEGQDPFQRQVGHQVLEAYARYHRPEFESFFNKTFVLGLLHQGYHSLDRKDVAILDYIHNGLKLIMSCPSVLDLFSLLQVEVLRMVCERPEPQLCARLSDLLTDFVQCIPKGKLSITFCQQLVRTIGHFQCVSTQERELREYVSQVTKVSNLLQNIWKAEPATLLPSLQEVFASISSTDASFEPSVALASLVQHIPLQMITVLIRSLTTDPNVKDASMTQALCRMIDWLSWPLAQHVDTWVIALLKGLAAVQKFTILIDVTLLKIELVFNRLWFPLVRPGALAVLSHMLLSFQHSPEAFHLIVPHVVNLVHSFKNDGLPSSTAFLVQLTELIHCMMYHYSGFPDLYEPILEAIKDFPKPSEEKIKLILNQSAWTSHHHHHH 430 T 0.073 DUF2228 pdb F Eukaryota T 4rxz 2 B,D C,D 12-MER PEPTIDE INHIBITOR TSFAEYWNLLSP 12 T 0.051 P53_TAD pdbhh F T 4ryd 2 G,H,I,J,K,L H,I,J,K,L,N para-guanidinomethyl-phenylacetyl-Arg-(3-methylvaline)-Arg-(amidomethyl)benzamidine XRXRX 5 T 450 Consortin_C pdbhh F F 4s0g 2 B B EPS15_HUMAN PROTEIN EPS15, PROTEIN AF-1P FSAXVSEED 9 T 1.5 RAP80_UIM unphh F Eukaryota T 4s0r 2 AA,BA,O,P,Q,R,S,T,U,V,W,X,Y,Z 1,2,O,P,Q,R,S,T,U,V,W,X,Y,Z TNRA_BACSU TnrA peptide KMLEGQNAHFRYKNR 15 T 0.00015 MerR-DNA-bind unppssm F Bacteria T 4s2t 2 C,D A,B apstatin XPPAX 5 T 680 SEC-C pdbhh F F 4s3h 1 A,B,C,D A,B,C,D MDB1_SCHPO Mdb1 MGSSHHHHHHSSGLEVLFQGPHMEIQFGNQRCRMVNSGGFLATDGSHLKEMETDDVLVEFLNIEHQLFIRNIRAIVKIADTTVLPSASDKKLLYYVFDETRVRINDTPVIFSKLEEDNANVNEGSK 126 T 8.1 ATG19 pdbhh F Eukaryota T 4sga 2 B P TETRAPEPTIDE ACE-PRO-ALA-PRO-PHE XPAPF 5 T 120 DUF2316 pdbhh F F 4thn 3 C I HIRUNORM IV XRXTDXGXPESHXGGDYEEIPXXYXX 26 T 0.16 Hirudin pdbhh F T 4tjx 2 B B Aleurain peptide ADSNPIRPVT 10 T 21 DUF6446 pdbhh F T 4tk1 2 C,D C,D GBRA3_RAT GABA(A) RECEPTOR SUBUNIT ALPHA-3 FNIVGTTYPIN 11 T 0.98 TraW_N pdbhh F Eukaryota T 4tk2 2 C,D C,D GBRA3_RAT GABA(A) RECEPTOR SUBUNIT ALPHA-3 FNIVGTTYPIN 11 T 0.98 TraW_N pdbhh F Eukaryota T 4tk3 2 C,D C,D GBRA3_RAT GABA(A) RECEPTOR SUBUNIT ALPHA-3 FSIVGTLYPIN 11 T 1.4 SH3_7 pdbhh F Eukaryota T 4tk4 2 C,D C,D GBRA3_RAT GABA(A) RECEPTOR SUBUNIT ALPHA-3 FSIVGTLYPIN 11 T 1.4 SH3_7 pdbhh F Eukaryota T 4tky 2 E,F,G,H F,E,G,H PRO-PHE-ALA-THR-CYS-ASP-SER PFATCDS 7 T 0.54 Hexapep_loop pdbhh F T 4tn7 2 B,D B,D KDM2A_MOUSE F-BOX AND LEUCINE-RICH REPEAT PROTEIN 11,F-BOX/LRR-REPEAT PROTEIN 11,JMJC DOMAIN-CONTAINING HISTONE DEMETHYLATION PROTEIN 1A,[HISTONE-H3]-LYSINE-36 DEMETHYLASE 1A QVHLTHFELEGLRCLVDKLESLPLHKKCVPTGIEDEDALIADVKILLEELASSDPKLALTGVPIVQWP 68 T 0.0015 JHD pdbhh F Eukaryota T 4tnh 19 MA,S G,Y Photosystem II reaction center protein Y XXXXXXXXXXXXXXXXXXXXXXXXXXXX 28 F F F 4tni 19 MA,S G,Y Photosystem II reaction center protein Y XXXXXXXXXXXXXXXXXXXXXXXXXXXX 28 F F F 4tnj 19 MA,S G,Y Photosystem II reaction center protein Y XXXXXXXXXXXXXXXXXXXXXXXXXXXX 28 F F F 4tnk 19 MA,S G,Y Photosystem II reaction center protein Y XXXXXXXXXXXXXXXXXXXXXXXXXXXX 28 F F F 4tot 2 E,F,G,H E,F,G,H nonimmunosuppressive inhibitor XXXXXXXXVXA 11 T 31 RNase_HII pdbhh F F 4tpg 2 C E Ala-L-3-Br-Tyr-Ala AXA 3 T 450 Flavi_M pdbhh F F 4tpj 2 C,D C,E ALA-ALA-ALA AAA 3 T 1200 RNase_HII pdbhh F F 4tq1 2 B B TCPR1_HUMAN Tectonin beta-propeller repeat-containing protein 1 MAQTAAWRKQIFQQLTERTKRELENFRHYEQAVEQSVWV 39 T 0.031 Unpaired pdbpssm F Eukaryota T 4tqe 3 C A TAU_HUMAN NEUROFIBRILLARY TANGLE PROTEIN,PAIRED HELICAL FILAMENT-TAU,PHF-TAU LPTPPTREPKKVAVVR 16 T 38 DUF3982 pdbhh F Eukaryota T 4tr9 4 G,H,I,J H,I,J,K Q76NM2_PLAF7 ASP-TRP-ASN DWN 3 T 23 Joubert pdbhh F Eukaryota F 4trw 2 D,E,F D,E,F L-alpha-glutamyl-L-isoleucyl-N-[(2R,3S)-1-{[(1S)-1-carboxybutyl]amino}-2-hydroxy-5-methylhexan-3-yl]-3-thiophen-2-yl-L-alaninamide EIXX 4 T 320 Bac_DnaA_C pdbhh F F 4try 2 D,E,F D,E,F SYN-HEA TYPE INHIBITOR EIXX 4 T 320 Bac_DnaA_C pdbhh F F 4trz 2 D,E,F D,E,F 2-thiophenyl HEA-type inhibitor EIXX 4 T 320 Bac_DnaA_C pdbhh F F 4tsz 2 B,BA,D,DA,F,FA,H,J,L,N,P,R,T,V,X,Z Q,4,R,5,S,6,T,U,V,X,Y,Z,0,1,2,3 ACE-GLN-ALC-ASP-LEU-ZCL peptide XQXDLX 6 T 81 Zn_peptidase pdbhh F F 4tt0 1 A,B A,B LTP_HHV11 TEGUMENT PROTEIN VP1-2,TEGUMENT PROTEIN VP1/2, HSV1 UL36 GPLGSAKQQRAEATERVTAGLREVLAARERRAQLEAEGLANLKTLLKVVAVPATVAKTLDQARSAEEIADQVEILVDQTEKARELDVQAVAWLEHAQRTFETHPLSAASGDGPGLLTRQGARLQALFDTRRRVEALRR 138 T 0.25 RNA_pol_Rpb2_4 pdb T Viruses T 4tt1 1 A,B A,B LTP_HHV11 TEGUMENT PROTEIN VP1-2,TEGUMENT PROTEIN VP1/2 GPLGSAKQQRAEATERVTAGLREVLAARERRAQLEAEGLANLKTLLKVVAVPATVAKTLDQARSAEEIADQVEILVDQTEKARELDVQAVAWLEHAQRTFETHPLSAASGDGPGLLTRQGARLQALFDTRRRVEALRR 138 T 0.25 RNA_pol_Rpb2_4 pdb T Viruses T 4tt2 2 B P Histone H4K5Ac SGRGXG 6 T 7.1 CTP-dep_RFKase pdbhh F F 4tt4 2 C P Histone H3(1-21)K4Ac STGGX 5 T 110 DUF829 pdbhh F F 4ttk 1 A A Sunflower Trypsin Inhibitor-1 (SFTI-1) (D-form) GXXXXXXXXXXXXX 14 F F F 4ttm 2 B B D-kalata B1 GXXXXGXXXXGGXXXXXGXXXXXXXXXXX 29 F F F 4ttn 2 B B D-kalata B1 GXXXXGXXXXGGXXXXXGXXXXXXXXXXX 29 F F F 4tto 2 B B D-kalata B1 GXXXXGXXXXGGXXXXXGXXXXXXXXXXX 29 F F F 4tuj 3 C,F E,F peptide1 RCNPNMEPPRCWAAEGD 17 T 0.57 DUF4683 pdbhh F T 4tuk 3 C I peptide2 VCNPLTGALLCSAAEGD 17 T 6.6 DUF1847 pdbhh F T 4tul 3 C I peptide2 VCNPLTGALLCSAAEGD 17 T 6.6 DUF1847 pdbhh F T 4tut 1 A A Prion peptide: GLY-GLY-TYR-MET-LEU-GLY GGYMLG 6 T 27 DUF2023 pdbhh F F 4tvg 2 C C Pepstatin XVVXAX 6 T 1700 FAM60A pdbhh F F 4tvq 2 E E CCM2_HUMAN MALCAVERNIN STIDFLDRAIFDGAST 16 T 0.023 PID_2 unphh F Eukaryota T 4twi 2 B B H4_YEAST Succinylated H4 Peptide (aa8-20) KGLGKGGAXRHRKW 14 T 4.2 Shadoo unppercent F Eukaryota T 4twj 2 B B H4_YEAST Histone H4 peptide KGLGKGGAXRHRKW 14 T 4.2 Shadoo unppercent F Eukaryota T 4twt 2 C,D E,F PEPTIDE M21 ACPPCLWQVLCG 12 T 0.053 Ragweed_pollen pdbhh F T 4txq 2 C,D C,D CHM1B_HUMAN CHMP1.5,CHROMATIN-MODIFYING PROTEIN 1B,CHMP1B,VACUOLAR PROTEIN SORTING-ASSOCIATED PROTEIN 46-2,HVPS46-2 SVGTSVASAEQDELSQRLARLRDQV 25 T 2.5 PHA_synth_III_E pdbhh F Eukaryota T 4txr 1 A B CHM1B_HUMAN CHMP1.5,CHROMATIN-MODIFYING PROTEIN 1B,CHMP1B,VACUOLAR PROTEIN SORTING-ASSOCIATED PROTEIN 46-2,HVPS46-2 SVGTSVASAEQDELSQRLARLRDQV 25 T 2.5 PHA_synth_III_E pdbhh F Eukaryota T 4txy 1 A,B A,B DNCV_VIBCH C-AMP-GMP SYNTHASE,DINUCLEOTIDE CYCLASE DNCV GSMTWNFHQYYTNRNDGLMGKLVLTDEEKNNLKALRKIIRLRTRDVFEEAKGIAKAVKKSALTFEIIQEKVSTTQIKHLSDSEQREVAKLIYEMDDDARDEFLGLTPRFWTQGSFQYDTLNRPFQPGQEMDIDDGTYMPMPIFESEPKIGHSLLILLVDASLKSLVAENHGWKFEAKQTCGRIKIEAEKTHIDVPMYAIPKDEFQKKQIALEANRSFVKGAIFESYVADSITDDSETYELDSENVNLALREGDRKWINSDPKIVEDWFNDSCIRIGKHLRKVCRFMKAWRDAQWDVGGPSSISLMAATVNILDSVAHDASDLGETMKIIAKHLPSEFARGVESPDSTDEKPLFPPSYKHGPREMDIMSKLERLPEILSSAESADSKSEALKKINMAFGNRVTNSELIVLAKAL 413 T 0.03 AAA pdbpssm F Bacteria T 4txz 1 A,B A,B DNCV_VIBCH C-AMP-GMP SYNTHASE,DINUCLEOTIDE CYCLASE DNCV GSMTWNFHQYYTNRNDGLMGKLVLTDEEKNNLKALRKIIRLRTRDVFEEAKGIAKAVKKSALTFEIIQEKVSTTQIKHLSDSEQREVAKLIYEMDDDARDEFLGLTPRFWTQGSFQYDTLNRPFQPGQEMDIDDGTYMPMPIFESEPKIGHSLLILLVDASLKSLVAENHGWKFEAKQTCGRIKIEAEKTHIDVPMYAIPKDEFQKKQIALEANRSFVKGAIFESYVADSITDDSETYELDSENVNLALREGDRKWINSDPKIVEDWFNDSCIRIGKHLRKVCRFMKAWRDAQWDVGGPSSISLMAATVNILDSVAHDASDLGETMKIIAKHLPSEFARGVESPDSTDEKPLFPPSYKHGPREMDIMSKLERLPEILSSAESADSKSEALKKINMAFGNRVTNSELIVLAKAL 413 T 0.03 AAA pdbpssm F Bacteria T 4ty0 1 A,B A,B DNCV_VIBCH C-AMP-GMP SYNTHASE,DINUCLEOTIDE CYCLASE DNCV GSMTWNFHQYYTNRNDGLMGKLVLTDEEKNNLKALRKIIRLRTRDVFEEAKGIAKAVKKSALTFEIIQEKVSTTQIKHLSDSEQREVAKLIYEMDDDARDEFLGLTPRFWTQGSFQYDTLNRPFQPGQEMDIDDGTYMPMPIFESEPKIGHSLLILLVDASLKSLVAENHGWKFEAKQTCGRIKIEAEKTHIDVPMYAIPKDEFQKKQIALEANRSFVKGAIFESYVADSITDDSETYELDSENVNLALREGDRKWINSDPKIVEDWFNDSCIRIGKHLRKVCRFMKAWRDAQWDVGGPSSISLMAATVNILDSVAHDASDLGETMKIIAKHLPSEFARGVESPDSTDEKPLFPPSYKHGPREMDIMSKLERLPEILSSAESADSKSEALKKINMAFGNRVTNSELIVLAKAL 413 T 0.03 AAA pdbpssm F Bacteria T 4tyv 1 A,B A,B LAM55_STREK Putative secreted protein AQEVVGGGDLGPNVLVFDPSTPDIQGKVDEVFRKQESNQFGTDRYALMFKPGTYNDINAQIGFYTSIAGLGLNPDDTTFNGDVTVDAGWFDGNATQNFWRSAENLALNPVNGTNRWAVSQAAPFRRMHVKGGLNLAPDGYGWASGGYIADSKIDGEVGPYSQQQWYTRDSSVGGWGNGVWNMTFSGVEGAPAQSFPEPPYTTLETTPVSREKPFLYLDGDDYKVFVPAKRTNARGTSWGNGTPEGESLPLDQFYVVKPGATAETINAAVDQGLHLLFTPGVYHVDQPIEIDRANTVALGLGLATIIPDNGVTALKVGDVDGVKVAGLLVDAGPVNSETLVEVGSDGASGDHAANPTSLQDVFVRIGGAGPGKATTSIVVNSNDTIIDHTWVWRADHGEGVGWETNRADYGVHVKGDNVLATGLFVEHFNKYDVQWSGENGKTIFYQNAKAYDAPDQAAIQNGDIKGYAAYKVDDSVTTHEGWGMGSYCYFNVNPDIRQQHGFQAPVKPGVKFHDLLVVSLGGKGQYEHVINDIGDPTSGDTTIPSQVVSFP 551 T 0.0083 Pectate_lyase_3 unphh F Bacteria T 4tz1 1 A A LAM55_STREK Putative secreted protein EVVGGGDLGPNVLVFDPSTPDIQGKVDEVFRKQESNQFGTDRYALMFKPGTYNDINAQIGFYTSIAGLGLNPDDTTFNGDVTVDAGWFDGNATQNFWRSAENLALNPVNGTNRWAVSQAAPFRRMHVKGGLNLAPDGYGWASGGYIADSKIDGEVGPYSQQQWYTRDSSVGGWGNGVWNMTFSGVEGAPAQSFPEPPYTTLETTPVSREKPFLYLDGDDYKVFVPAKRTNARGTSWGNGTPEGESLPLDQFYVVKPGATAETINAAVDQGLHLLFTPGVYHVDQPIEIDRANTVALGLGLATIIPDNGVTALKVGDVDGVKVAGLLVDAGPVNSETLVEVGSDGASGDHAANPTSLQDVFVRIGGAGPGKATTSIVVNSNDTIIDHTWVWRADHGEGVGWETNRADYGVHVKGDNVLATGLFVEHFNKYDVQWSGENGKTIFYQNAKAYDAPDQAAIQNGDIKGYAAYKVDDSVTTHEGWGMGSYCYFNVNPDIRQQHGFQAPVKPGVKFHDLLVVSLGGKGQYEHVINDIGDPTSGDTTIPSQVVSFP 549 T 0.0083 Pectate_lyase_3 unphh F Bacteria T 4tz3 1 A A LAM55_STREK Putative secreted protein EVVGGGDLGPNVLVFDPSTPDIQGKVDEVFRKQESNQFGTDRYALMFKPGTYNDINAQIGFYTSIAGLGLNPDDTTFNGDVTVDAGWFDGNATQNFWRSAENLALNPVNGTNRWAVSQAAPFRRMHVKGGLNLAPDGYGWASGGYIADSKIDGEVGPYSQQQWYTRDSSVGGWGNGVWNMTFSGVEGAPAQSFPEPPYTTLETTPVSREKPFLYLDGDDYKVFVPAKRTNARGTSWGNGTPEGESLPLDQFYVVKPGATAETINAAVDQGLHLLFTPGVYHVDQPIEIDRANTVALGLGLATIIPDNGVTALKVGDVDGVKVAGLLVDAGPVNSETLVEVGSDGASGDHAANPTSLQDVFVRIGGAGPGKATTSIVVNSNDTIIDHTWVWRADHGEGVGWETNRADYGVHVKGDNVLATGLFVEHFNKYDVQWSGENGKTIFYQNAKAYDAPDQAAIQNGDIKGYAAYKVDDSVTTHEGWGMGSYCYFNVNPDIRQQHGFQAPVKPGVKFHDLLVVSLGGKGQYEHVINDIGDPTSGDTTIPSQVVSFP 549 T 0.0083 Pectate_lyase_3 unphh F Bacteria T 4tz5 1 A,B A,B LAM55_STREK Putative secreted protein EVVGGGDLGPNVLVFDPSTPDIQGKVDEVFRKQESNQFGTDRYALMFKPGTYNDINAQIGFYTSIAGLGLNPDDTTFNGDVTVDAGWFDGNATQNFWRSAENLALNPVNGTNRWAVSQAAPFRRMHVKGGLNLAPDGYGWASGGYIADSKIDGEVGPYSQQQWYTRDSSVGGWGNGVWNMTFSGVEGAPAQSFPEPPYTTLETTPVSREKPFLYLDGDDYKVFVPAKRTNARGTSWGNGTPEGESLPLDQFYVVKPGATAETINAAVDQGLHLLFTPGVYHVDQPIEIDRANTVALGLGLATIIPDNGVTALKVGDVDGVKVAGLLVDAGPVNSETLVEVGSDGASGDHAANPTSLQDVFVRIGGAGPGKATTSIVVNSNDTIIDHTWVWRADHGEGVGWETNRADYGVHVKGDNVLATGLFVEHFNKYDVQWSGENGKTIFYQNAKAYDAPDQAAIQNGDIKGYAAYKVDDSVTTHEGWGMGSYCYFNVNPDIRQQHGFQAPVKPGVKFHDLLVVSLGGKGQYEHVINDIGDPTSGDTTIPSQVVSFP 549 T 0.0083 Pectate_lyase_3 unphh F Bacteria T 4tzl 2 C,D C,D G5EBG0_CAEEL C. elegans HIM-3 closure motif SNARDSPYGLSQGITKKNKD 20 T 5.1 DUF5699 pdbhh F Eukaryota T 4tzm 2 C,D C,D O01820_CAEEL C. elegans HTP-3 closure motif1 TARYGVSNTSINRKKP 16 T 9.3 DUF4090 pdbhh F Eukaryota T 4tzn 2 C,D C,D O01820_CAEEL Protein HTP-3 AMRYGQSPNMPSRRGN 16 T 6.5 zf-CDGSH pdbhh F Eukaryota T 4tzo 2 B,D,F,H B,D,F,H G5EBG0_CAEEL PROTEIN HIM-3,ISOFORM A SNARDSPYGLSQGITKKNKD 20 T 5.1 DUF5699 pdbhh F Eukaryota T 4tzq 2 B,D B,D O01820_CAEEL Protein HTP-3 STARYGVSNTSINRKKP 17 T 10 DUF4090 pdbhh F Eukaryota T 4tzs 2 C,D C,D G5EBG0_CAEEL PROTEIN HIM-3,ISOFORM A SNARDSPYGLSQGITKKNKD 20 T 5.1 DUF5699 pdbhh F Eukaryota T 4u03 1 A,B A,B DNCV_VIBCH C-AMP-GMP SYNTHASE, DINUCLEOTIDE CYCLASE DNCV MRMTWNFHQYYTNRNDGLMGKLVLTDEEKNNLKALRKIIRLRTRDVFEEAKGIAKAVKKSALTFEIIQEKVSTTQIKHLSDSEQREVAKLIYEMDDDARDEFLGLTPRFWTQGSFQYDTLNRPFQPGQEMDIDDGTYMPMPIFESEPKIGHSLLILLVDASLKSLVAENHGWKFEAKQTCGRIKIEAEKTHIDVPMYAIPKDEFQKKQIALEANRSFVKGAIFESYVADSITDDSETYELDSENVNLALREGDRKWINSDPKIVEDWFNDSCIRIGKHLRKVCRFMKAWRDAQWDVGGPSSISLMAATVNILDSVAHDASDLGETMKIIAKHLPSEFARGVESPDSTDEKPLFPPSYKHGPREMDIMSKLERLPEILSSAESADSKSEALKKINMAFGNRVTNSELIVLAKALPAFAQELEHHHHHH 427 T 0.94 ApoLp-III pdbpercent F Bacteria T 4u0a 2 B B CPSF6_HUMAN CLEAVAGE AND POLYADENYLATION SPECIFICITY FACTOR 68 KDA SUBUNIT,CPSF 68 KDA SUBUNIT,PRE-MRNA CLEAVAGE FACTOR IM 68 KDA SUBUNIT,PROTEIN HPBRII-4/7 PVLFPGQPFGQPPLG 15 T 2.2 MF_alpha pdbhh F Eukaryota T 4u0b 2 M,N,O,P,Q,R,S,T,U,V,W,X M,N,O,P,Q,R,S,T,U,V,W,X CPSF6_HUMAN CLEAVAGE AND POLYADENYLATION SPECIFICITY FACTOR 68 KDA SUBUNIT,CPSF 68 KDA SUBUNIT,PRE-MRNA CLEAVAGE FACTOR IM 68 KDA SUBUNIT,PROTEIN HPBRII-4/7 PVLFPGQPFGQPPLG 15 T 2.2 MF_alpha pdbhh F Eukaryota T 4u0g 3 CA,DA,EA,FA,GA,HA,IA,JA,KA,LA,MA,NA,OA,PA c,i,d,e,f,g,h,j,k,l,m,n,o,p ADEP-2B5Me XXTPXAP 7 T 520 zf-CCHC pdbhh F F 4u0l 1 A,B A,B DNCV_VIBCH C-AMP-GMP SYNTHASE, DINUCLEOTIDE CYCLASE DNCV MRMTWNFHQYYTNRNDGLMGKLVLTDEEKNNLKALRKIIRLRTRDVFEEAKGIAKAVKKSALTFEIIQEKVSTTQIKHLSDSEQREVAKLIYEMDDDARDEFLGLTPRFWTQGSFQYDTLNRPFQPGQEMAIADGTYMPMPIFESEPKIGHSLLILLVDASLKSLVAENHGWKFEAKQTCGRIKIEAEKTHIDVPMYAIPKDEFQKKQIALEANRSFVKGAIFESYVADSITDDSETYELDSENVNLALREGDRKWINSDPKIVEDWFNDSCIRIGKHLRKVCRFMKAWRDAQWDVGGPSSISLMAATVNILDSVAHDASDLGETMKIIAKHLPSEFARGVESPDSTDEKPLFPPSYKHGPREMDIMSKLERLPEILSSAESADSKSEALKKINMAFGNRVTNSELIVLAKALPAFAQE 419 T 0.017 WEMBL pdbpssm F Bacteria T 4u0m 1 A,B A,B DNCV_VIBCH C-AMP-GMP SYNTHASE,DINUCLEOTIDE CYCLASE DNCV MRMTWNFHQYYTNRNDGLMGKLVLTDEEKNNLKALRKIIRLRTRDVFEEAKGIAKAVKKSALTFEIIQEKVSTTQIKHLSDSEQREVAKLIYEMDDDARDEFLGLTPRFWTQGSFQYDTLNRPFQPGQEMDIDDGTYMPMPIFESEPKIGHSLLILLVDASLKSLVAENHGWKFEAKQTCGRIKIEAEKTHINVPMYAIPKDEFQKKQIALEANRSFVKGAIFESYVADSITDDSETYELDSENVNLALREGDRKWINSDPKIVEDWFNDSCIRIGKHLRKVCRFMKAWRDAQWDVGGPSSISLMAATVNILDSVAHDASDLGETMKIIAKHLPSEFARGVESPDSTDEKPLFPPSYKHGPREMDIMSKLERLPEILSSAESADSKSEALKKINMAFGNRVTNSELIVLAKALPAFAQELEHHHHHH 427 T 0.048 WEMBL pdbpssm F Bacteria T 4u0n 1 A,B A,B DNCV_VIBCH C-AMP-GMP SYNTHASE, DINUCLEOTIDE CYCLASE DNCV MRMTWNFHQYYTNRNDGLMGKLVLTDEEKNNLKALRKIIRLRTRDVFEEAKGIAKAVKKSALTFEIIQEKVSTTQIKHLSDSEQREVAKLIYEMDDDARDEFLGLTPRFWTQGSFQYDTLNRPFQPGQEMDIDDGTYMPMPIFESEPKIGHSLLILLVDASLKSLVAENHGWKFEAKQTCGRIKIEAEKTHIDVPMYAIPKDELDSENVNLALREGDRKWINSDPKIVEDWFNDSCIRIGKHLRKVCRFMKAWRDAQWDVGGPSSISLMAATVNILDSVAHDASDLGETMKIIAKHLPSEFARGVESPDSTDEKPLFPPSYKHGPREMDIMSKLERLPEILSSAESADSKSEALKKINMAFGNRVTNSELIVLAKALPAFAQELEHHHHHH 391 T 0.0023 SMODS pdbpercent F Bacteria T 4u1e 2 B B EIF3B_YEAST EIF3B,CELL CYCLE REGULATION AND TRANSLATION INITIATION PROTEIN,EUKARYOTIC TRANSLATION INITIATION FACTOR 3 90 KDA SUBUNIT,EIF3 P90,TRANSLATION INITIATION FACTOR EIF3 P90 SUBUNIT SNAEADTAMRDLILHQRELLKQWTEYREKIGQEMEKSMNFKIFDVQP 47 T 0.026 WWE pdbpssm F Eukaryota T 4u1h 3 C C POL_HV1H2 TL9 PEPTIDE TPQDLNTML 9 T 0.13 Gag_p24 unphh T Viruses T 4u1i 3 C C POL_HV1H2 TL9 PEPTIDE TPQDLNTML 9 T 0.13 Gag_p24 unphh T Viruses T 4u1j 3 C C POL_HV1H2 TL9 TPQDLNTML 9 T 0.13 Gag_p24 unphh T Viruses T 4u1p 2 B B MT_POVHA Middle T antigen EPQXEEIPIYL 11 T 3.2 Imm15 pdbhh T Viruses T 4u1u 54 BB,CD B6,D6 Quinupristin XTXPXXXX 8 T 1500 zf-met pdbhh F F 4u1v 54 BB,CD B6,D6 Linopristin XTXPXXX 7 T 1400 T4-Gluco-transf pdbhh F F 4u26 54 BB,CD B6,D6 Quinupristin XTXPXXXX 8 T 1500 zf-met pdbhh F F 4u27 54 BB,CD B6,D6 Linopristin XTXPXXX 7 T 1400 T4-Gluco-transf pdbhh F F 4u3m 82 XD m2 UNKNOWN PROTEIN m2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 160 F F F 4u3m 84 EF p1 UNKNOWN PROTEIN p1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 4u3m 85 FF p2 UNKNOWN PROTEIN p2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 46 F F F 4u3n 83 XD m2 UNKNOWN PROTEIN m2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 160 F F F 4u3n 85 EF p1 UNKNOWN PROTEIN p1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 4u3n 86 FF p2 UNKNOWN PROTEIN p2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 46 F F F 4u3u 82 XD m2 UNKNOWN PROTEIN m2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 160 F F F 4u3u 84 EF p1 UNKNOWN PROTEIN p1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 4u3u 85 FF p2 UNKNOWN PROTEIN p2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 46 F F F 4u4a 2 D,E,F D,E,F F175A_HUMAN COILED-COIL DOMAIN-CONTAINING PROTEIN 98,PROTEIN FAM175A GFGEYSRSPTF 11 T 0.48 PipA pdbhh F Eukaryota T 4u4c 2 B B AIR2_YEAST;PAP2_YEAST ARGININE METHYLTRANSFERASE-INTERACTING RING FINGER PROTEIN 2,DNA POLYMERASE KAPPA,DNA POLYMERASE SIGMA,TOPOISOMERASE 1-RELATED PROTEIN TRF4 GAASMEKNTAPFVVDTAPTTPPDKLVAPSIEEVNSNPNELRALRGQGRYFGVSDDDKDAIKEAAPKHGDEKDLANNDDFISLSASSEDEQAEQEEEREKQELEIKKEKQKEILNTD 116 T 0.051 RD3 unp F Eukaryota T 4u4n 81 XD m2 Unknown protein m2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 160 F F F 4u4n 83 EF p1 Unknown protein p1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 4u4n 84 FF p2 Unknown protein p2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 46 F F F 4u4o 82 XD m2 unknown protein chain m2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 160 F F F 4u4o 84 EF p1 unknown protein chain p1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 4u4o 85 FF p2 unknown protein chain p2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 46 F F F 4u4q 81 XD m2 Unknown protein chain m2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 160 F F F 4u4q 83 EF p1 Unknown protein chain p1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 4u4q 84 FF p2 Unknown protein chain p2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 46 F F F 4u4r 81 XD m2 Unknown protein m2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 160 F F F 4u4r 83 EF p1 Unknown protein p1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 4u4r 84 FF p2 Unknown protein p2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 46 F F F 4u4u 82 XD m2 unknown protein chain m2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 160 F F F 4u4u 84 EF p1 unknown protein chain p1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 4u4u 85 FF p2 unknown protein chain p2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 46 F F F 4u4y 81 XD m2 unknown protein chain m2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 160 F F F 4u4y 83 EF p1 unknown protein chain p1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 4u4y 84 FF p2 unknown protein chain p2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 46 F F F 4u4z 81 XD m2 Unknown protein m2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 160 F F F 4u4z 83 EF p1 Unknown protein p1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 4u4z 84 FF p2 Unknown protein p2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 46 F F F 4u50 82 DF m2 UNKNOWN PROTEIN m2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 160 F F F 4u50 83 EF p1 UNKNOWN PROTEIN p1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 4u50 84 FF p2 UNKNOWN PROTEIN p2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 46 F F F 4u51 81 XD m2 unknown protein chain m2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 160 F F F 4u51 83 EF p1 unknown protein chain p1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 4u51 84 FF p2 unknown protein chain p2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 46 F F F 4u52 81 XD m2 Unknown Protein m2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 160 F F F 4u52 83 EF p1 Unknown Protein p1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 4u52 84 FF p2 Unknown Protein p2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 46 F F F 4u53 81 XD m2 Unknown Protein m2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 160 F F F 4u53 83 EF p1 Unknown Protein p1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 4u53 84 FF p2 Unknown Protein p2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 46 F F F 4u55 82 DF m2 Unknown protein chain m2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 160 F F F 4u55 83 EF p1 Unknown protein chain p1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 4u55 84 FF p2 Unknown protein chain p2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 46 F F F 4u56 82 DF m2 unknown protein chain m2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 160 F F F 4u56 83 EF p1 unknown protein chain p1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 4u56 84 FF p2 unknown protein chain p2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 46 F F F 4u5b 2 E,F E,F CONII_CONST Con-ikot-ikot GPGSSGPADCCRMKECCTDRVNECLQRYSGREDKFVSFCYQEATVTCGSFNEIVGCCYGYQMCMIRVVKPNSLSGAHEACKTVSCGNPCA 90 T 0.032 DUF902 pdb F Eukaryota T 4u5c 2 E,F E,F CONII_CONST Con-ikot-ikot GPGSSGPADCCRMKECCTDRVNECLQRYSGREDKFVSFCYQEATVTCGSFNEIVGCCYGYQMCMIRVVKPNSLSGAHEACKTVSCGNPCA 90 T 0.032 DUF902 pdb F Eukaryota T 4u5d 2 E,F E,F CONII_CONST Con-ikot-ikot GPGSSGPADCCRMKECCTDRVNECLQRYSGREDKFVSFCYQEATVTCGSFNEIVGCCYGYQMCMIRVVKPNSLSGAHEACKTVSCGNPCA 90 T 0.032 DUF902 pdb F Eukaryota T 4u5e 2 E,F E,F CONII_CONST Con-ikot-ikot GPGSSGPADCCRMKECCTDRVNECLQRYSGREDKFVSFCYQEATVTCGSFNEIVGCCYGYQMCMIRVVKPNSLSGAHEACKTVSCGNPCA 90 T 0.032 DUF902 pdb F Eukaryota T 4u5f 2 E,F E,F CONII_CONST Con-ikot-ikot GPGSSGPADCCRMKECCTDRVNECLQRYSGREDKFVSFCYQEATVTCGSFNEIVGCCYGYQMCMIRVVKPNSLSGAHEACKTVSCGNPCA 90 T 0.032 DUF902 pdb F Eukaryota T 4u5g 1 A,B A,B CONII_CONST Con-ikot-ikot GPGSSGPADCCRMKECCTDRVNECLQRYSGREDKFVSFCYQEATVTCGSFNEIVGCCYGYQMCMIRVVKPNSLSGAHEACKTVSCGNPCA 90 T 0.032 DUF902 pdb F Eukaryota T 4u5h 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H CONII_CONST Con-ikot-ikot GPGSSGPADCCRMKECCTDRVNECLQRYSGREDKFVSFCYQEATVTCGSFNEIVGCCYGYQMCMIRVVKPNSLSGAHEACKTVSCGNPCA 90 T 0.032 DUF902 pdb F Eukaryota T 4u6f 82 DF m2 unknown protein chain m2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 160 F F F 4u6f 83 EF p1 unknown protein chain p1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 4u6f 84 FF p2 unknown protein chain p2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 46 F F F 4u6x 3 C P ALQDA peptide, ALQDAGDSSRKEYFI ALQDAGDSSRKEYFI 15 T 0.34 SOTI pdbhh F T 4u6y 3 C P FLNKD peptide, FLNKDLEVDGHFVTM FLNKDLEVDGHFVTM 15 T 2.1 DUF4603 pdbhh F T 4u7e 2 B A IST1_HUMAN HIST1,PUTATIVE MAPK-ACTIVATING PROTEIN PM28 STSASEDIDFDDLSRRFEELKKKTW 25 T 1.5 INCA1 pdbhh F Eukaryota T 4u7i 2 B B IST1_HUMAN HIST1,PUTATIVE MAPK-ACTIVATING PROTEIN PM28 STSASEDIDFDDLSRRFEELKKKTW 25 T 1.5 INCA1 pdbhh F Eukaryota T 4u7y 2 B B IST1_HUMAN HIST1,PUTATIVE MAPK-ACTIVATING PROTEIN PM28 STSASEDIDFDDLSRRFEELKKKTW 25 T 1.5 INCA1 pdbhh F Eukaryota T 4u90 2 B,C D,E GBRA3_RAT GABA(A) RECEPTOR SUBUNIT ALPHA-3 FNIVGTTYPC 10 T 0.97 DUF749 pdbhh F Eukaryota T 4u91 2 B,C B,E GLYCINE RECEPTOR 58 KDA SUBUNIT FSIVGSLPRDC 11 T 0.35 MucB_RseB_C pdbhh F T 4u9w 2 E,F,G,H E,F,G,H H4_HUMAN Histone H4/H2A N-terminus SGRGKX 6 T 11 Shadoo unppercent F Eukaryota F 4ubf 2 E P KIF2C_HUMAN KINESIN-LIKE PROTEIN 6,MITOTIC CENTROMERE-ASSOCIATED KINESIN,MCAK QLEEQASRQISS 12 T 0.023 Fib_alpha unp F Eukaryota T 4uby 1 A,B,C,D A,B,C,D prion peptide GGYVLG 6 T 28 DUF883_C pdbhh F F 4ubz 1 A,B A,B prion peptide GGYLLG 6 T 2.1 BioY pdbhh F F 4uca 2 C C PHOSP_HRSVA PHOSPHOSPROTEIN EDF 3 T 78 DUF2605 pdbhh T Viruses F 4ucb 2 C,D C,D PHOSP_HRSVA PROTEIN P, PHOSPHOSPROTEIN DLSLEDF 7 T 0.094 DUF4479 unppssm T Viruses F 4ud7 2 E,F,G,H F,G,H,I YS-02 XTSFXEYWXLLPENYX 16 T 0.05 P53_TAD pdbhh F T 4uda 2 B B NCOA1_HUMAN NCOA-1, CLASS E BASIC HELIX-LOOP-HELIX PROTEIN 74, BHLHE74, PROTEIN HIN-2, RIP160, RENAL CARCINOMA ANTIGEN NY-REN-52, STEROID RECEPTOR COACTIVATOR 1, SRC-1, NCOA1 PEPTIDE PQAQQKSLLQQLLTE 15 T 1.8 GFD1 pdbhh F Eukaryota T 4udb 2 B B NCOA1_HUMAN NCOA-1, CLASS E BASIC HELIX-LOOP-HELIX PROTEIN 74, BHLHE74, PROTEIN HIN-2, RIP160, RENAL CARCINOMA ANTIGEN NY-REN-52, STEROID RECEPTOR COACTIVATOR 1, SRC-1, NCOA1 PEPTIDE PQAQQKSLLQQLLTE 15 T 1.8 GFD1 pdbhh F Eukaryota T 4ue0 1 A,B,C A,B,C Q997H2_ADEB4 FIBER GALTTSTRQGSRVVGFMDFIIALGWQIIPSNIRYIYILNCSQFMPTSDVTTIYFQADSGLESIFVMDSPFYASCTQQLPDKTIKTYGVTISKKQSIISINFSSSLEPNIMVSAWTASITRTQ 122 T 4.5 DUF2534 pdbhh T Viruses T 4ue1 2 E,F,G,H F,G,H,I YS-01 XTSFXEYWXLLPENFX 16 T 0.055 P53_TAD pdbhh F T 4ue4 2 B B FTSQ_ECOLI FTSQ SIGNAL SEQUENCE LFLLTVCTTVLVSGWVVLGWME 22 T 0.23 DUF5818 unppssm F Bacteria T 4uea 2 B,D,F B,D,F DESIGNED 4E-BP GPHMLERYSKVDLLALRYSPLSQTPPGIELEGRLRRMNIWRTGS 44 T 0.0054 EIF4E-T pdb F T 4ueb 2 B,D,F B,D,F DESIGNED 4E-BP GPHMLERYSKVDLLALRYSPLSQTPPGIELEGRLRRMNIWRTGS 44 T 0.0054 EIF4E-T pdb F T 4uec 2 B B O61380_DROME EUKARYOTIC TRANSLATION INITIATION FACTOR 4G, ISOFORM C, FI02056P, TRANSLATION INITIATION FACTOR EIF4G GHMLEPETTLNDKQDSTDLKVKVSAKISSIINYNEGQWSPNNPSGKKQYDREQLLQLREVKASRIQPEVKNVSILPQP 78 T 0.00012 eIF_4G1 pdbhh F Eukaryota T 4uhp 1 A,C,E,G A,C,E,G Q51502_PSEAI PYOCIN AP41 LARGE COMPONENT DEPGVATGNGQPVTGNWLAGASQGDGVPIPSQIADQLRGKEFKSWRDFREQFWMAVSKDPSALENLSPSNRYFVSQGLAPYAVPEEHLGSKEKFEIHHVVPLESGGALYNIDNLVIVTPKRHSEIHKELKLKRKEK 136 T 0.0013 HNH pdb F Bacteria T 4ui9 17 S T PEPTIDE AAAAAQLAAAAAAAAAAAAAA 21 T 350 DUF6520 pdbhh F F 4ui9 18 T U FBX5_HUMAN PEPTIDE MSRRPCSCALRPPAAAAAAAAAAA 24 T 2.2 Toxin_14 pdbhh F Eukaryota T 4uis 3 C C GAMMA-SECRETASE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 196 F F F 4uis 4 D D GAMMA-SECRETASE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 62 F F F 4uj3 2 B,E,H,K,N,Q,T,W B,E,H,K,N,Q,T,W RAB3I_HUMAN RAB3A-INTERACTING PROTEIN, RABIN-3, SSX2-INTERACTING PROTEIN, RABIN8 GAASNKSTSSAMSGSHQDLSVIQPIVKDCKEADLSLYNEFRLWKDEPTMDRTCPFLDKIYQEDIFPCLTFSKSELASAVLEAVENNTLSIEPVGLQPIRFVKASAVECGGPKKCALTGQSKSCKHRIKLGDSSNYYYISPFCRYRITSVCNFFTYIRYIQQGLVKQQDVDQMFWEVMQLRKEMSLAKLGYFKEEL 195 T 0.11 SHE3 unphh F Eukaryota T 4uj4 2 B,E,H,K B,E,H,K RAB3I_HUMAN RAB3A-INTERACTING PROTEIN,RABIN-3,SSX2-INTERACTING PROTEIN GAASNKSTSSAMSGSHQDLSVIQPIVKDCKEADLSLYNEFRLWKDEPTMDRTCPFLDKIYQEDIFPCLTFSKSELASAVLEAVENNTLSIEPVGLQPIRFVKASAVECGGPKKCALTGQSKSCKHRIKLGDSSNYYYISPFCRYRITSVCNFFTYIRYIQQGLVKQQDVDQMFWEVMQLRKEMSLAKLGYFKEEL 195 T 0.11 SHE3 unphh F Eukaryota T 4uj5 2 C,D C,D RAB3I_HUMAN RAB3A-INTERACTING PROTEIN, RABIN-3, SSX2-INTERACTING PROTEIN, RAB3A-INTERACTING PROTEIN, RABIN-3, SSX2-INTERACTING PROTEIN, RABIN8 GAASNKSTSSAMSGSHQDLSVIQPIVKDCKEADLSLYNEFRLWKDEPTMDRTCPFLDKIYQEDIFPCLTFSKSELASAVLEAVENNTLSIEPVGLQPIRFVKASAVECGGPKKCALTGQSKSCKHRIKLGDSSNYYYISPFCRYRITSVCNFFTYIRYIQQGLVKQQDVDQMFWEVMQLRKEMSLAKLGYFKEEL 195 T 0.11 SHE3 unphh F Eukaryota T 4um9 4 E,F E,F TGFB3_HUMAN LAP XHGRGDLGRLKKX 13 T 14 DUF1843 pdbhh F Eukaryota T 4umi 1 A A SPIKE_ADES1 SPIKE, PROTEIN IV GSSHHHHHHSSGLVPRGSHMASMTGGQQMGRGSLESYPLPPLVWDYSSKSLTLDIGPGLTVVNGKLQVIGATFSNQMSRMAPAPRADLQSNSIEPLPSPPSKTSLDIAEELQNDKGVSFAFQAREEELGAFTKRTLFAYSGDGLTGPFKAPASAELSSFLTAHPKGRWLIAFPLGTGIVSVDEGILTLEISRSLPEVGSGSSFYLTEK 208 T 0.0017 Adeno_shaft unppercent T Viruses T 4umn 2 C,D C,D M06 XTSFXEYWYLLXX 13 T 4.5 P53_TAD pdbhh F T 4uot 1 A,B,C,D,E A,B,C,D,E DESIGNED HELICAL BUNDLE 5H2L XTQEYLLKEIMKLLKEQIKLLKEQIKMLKELEKQ 34 T 0.023 DUF5320 pdbhh F T 4upu 2 B B IP3KA_HUMAN INOSITOL 1\,4\,5-TRISPHOSPHATE 3-KINASE A, IP3 3-KINASE A, IP3K A, INSP 3-KINASE A GEDVGQKNHWQKIRTMVNLPVISPFK 26 T 0.68 SR-25 unppercent F Eukaryota T 4uq2 3 E,F E,G AZOBENZENE-CONTAINING PEPTIDE AIMXYPK 7 T 21 FokI_C pdbhh F T 4uq3 3 E,F E,F AZOBENZENE-CONTAINING PEPTIDE GLSXXL 6 T 540 zf-CCHC pdbhh F F 4uq8 1 A A NADH UBIQUINONE OXIDOREDUCTASE CHAIN 3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 111 F F F 4uq8 2 B B NADH DEHYDROGENASE [UBIQUINONE] IRON-SULFUR PROTEIN 7, MITOCHONDRIAL XXXXXXXXXXXXXXXXXXXXXXXXXXXCCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXXXXXXXXXXXXXXXXXX 143 T 4000 Cas1_AcylT pdbhh F F 4uq8 3 C C NADH DEHYDROGENASE [UBIQUINONE] IRON-SULFUR PROTEIN 3, MITOCHONDRIAL XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 154 F F F 4uq8 4 D D NADH DEHYDROGENASE [UBIQUINONE] IRON-SULFUR PROTEIN 2, MITOCHONDRIAL XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 384 F F F 4uq8 5 E E NADH DEHYDROGENASE [UBIQUINONE] FLAVOPROTEIN 2, MITOCHONDRIAL XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 159 T 1200 Radical_SAM_2 pdbhh F F 4uq8 6 F F NADH DEHYDROGENASE [UBIQUINONE] FLAVOPROTEIN 1, MITOCHONDRIAL XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXCXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 411 T 78 Fer4_2 pdbhh F F 4uq8 7 G G NADH-UBIQUINONE OXIDOREDUCTASE 75 KDA SUBUNIT, MITOCHONDRIAL XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXXXXXXXXCXXCXXXXXXXXXXXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHXXXCXXCXXXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXCXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 538 T 140 Fer4_2 pdbhh F F 4uq8 8 H H NADH UBIQUINONE OXIDOREDUCTASE CHAIN 1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 313 F F F 4uq8 9 I I NADH DEHYDROGENASE [UBIQUINONE] IRON-SULFUR PROTEIN 8, MITOCHONDRIAL XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXCXXCXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXCXXCXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 162 T 0.026 Fer4_8 pdbhh F F 4uq8 10 J J NADH UBIQUINONE OXIDOREDUCTASE CHAIN 6 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 171 F F F 4uq8 11 K K NADH UBIQUINONE OXIDOREDUCTASE CHAIN 4L XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 84 F F F 4uq8 12 L L NADH UBIQUINONE OXIDOREDUCTASE CHAIN 5 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 601 F F F 4uq8 13 M M NADH UBIQUINONE OXIDOREDUCTASE CHAIN 4 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 453 F F F 4uq8 14 N N NADH UBIQUINONE OXIDOREDUCTASE CHAIN 2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 345 F F F 4uq8 15 O O NADH DEHYDROGENASE [UBIQUINONE] 1 ALPHA SUBCOMPLEX SUBUNIT 10 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 220 F F F 4uq8 16 P P NADH DEHYDROGENASE [UBIQUINONE] 1 ALPHA SUBCOMPLEX SUBUNIT 9, MITOCHONDRIAL XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 303 F F F 4uq8 17 Q Q NADH DEHYDROGENASE [UBIQUINONE] SUBUNIT 4 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 85 F F F 4uq8 18 R R NADH DEHYDROGENASE [UBIQUINONE] SUBUNIT 6 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 4uq8 19 S S NADH DEHYDROGENASE [UBIQUINONE] 1 ALPHA SUBCOMPLEX SUBUNIT 2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 80 F F F 4uq8 20 T T ACYL CARRIER PROTEIN, MITOCHONDRIAL XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 75 F F F 4uq8 21 U U NADH UBIQUINONE DEHYDROGENASE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXA 79 T 11000 zinc_ribbon_2 pdbhh F F 4uq8 22 V V NADH DEHYDROGENASE [UBIQUINONE] 1 ALPHA SUBCOMPLEX SUBUNIT; 5 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 71 F F F 4uq8 23 W W NADH DEHYDROGENASE [UBIQUINONE] 1 ALPHA SUBCOMPLEX SUBUNIT; 6 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 72 F F F 4uq8 24 X X NADH DEHYDROGENASE [UBIQUINONE] 1 ALPHA SUBCOMPLEX SUBUNIT 8 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 79 F F F 4uq8 25 Y Y NADH DEHYDROGENASE [UBIQUINONE] 1 ALPHA SUBCOMPLEX SUBUNIT; 11 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 106 F F F 4uq8 26 Z Z NADH DEHYDROGENASE [UBIQUINONE] 1 ALPHA SUBCOMPLEX SUBUNIT 13 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 65 F F F 4uq8 27 AA a NADH UBIQUINONE OXIDOREDUCTASE XXXXXXXXXXXXXXXXXXXXXXXXXXXXX 29 F F F 4uq8 28 BA b NADH UBIQUINONE OXIDOREDUCTASE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 42 F F F 4uq8 29 CA,WA c,w NADH UBIQUINONE OXIDOREDUCTASE XXXXXXXXXXXXXXXXXXXXXXXXXXX 27 F F F 4uq8 30 DA d NADH UBIQUINONE OXIDOREDUCTASE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 39 F F F 4uq8 31 EA e NADH UBIQUINONE OXIDOREDUCTASE XXXXXXXXXXXXXXXXXXXX 20 F F F 4uq8 32 FA,HA,IA f,h,i NADH UBIQUINONE OXIDOREDUCTASE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 30 F F F 4uq8 33 GA g NADH UBIQUINONE OXIDOREDUCTASE XXXXXXXXXXXXXXXXXXXXXX 22 F F F 4uq8 34 JA,LA j,l NADH UBIQUINONE OXIDOREDUCTASE XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 4uq8 35 KA,PA,SA k,p,s NADH UBIQUINONE OXIDOREDUCTASE XXXXXXXXXXXXXXXXXXXXXXXXXXXX 28 F F F 4uq8 36 MA m NADH UBIQUINONE OXIDOREDUCTASE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 34 F F F 4uq8 37 NA n NADH UBIQUINONE OXIDOREDUCTASE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 59 F F F 4uq8 38 OA o NADH UBIQUINONE OXIDOREDUCTASE XXXXXXXXXXXXXXXXXXXXX 21 F F F 4uq8 39 QA q NADH UBIQUINONE OXIDOREDUCTASE XXXXXXXXXXXXXXXXXXXXXXXXX 25 F F F 4uq8 40 RA r NADH UBIQUINONE OXIDOREDUCTASE XXXXXXXXXXXXXXXXXXXXXXXXXX 26 F F F 4uq8 41 TA t NADH UBIQUINONE OXIDOREDUCTASE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 57 F F F 4uq8 42 UA u NADH UBIQUINONE OXIDOREDUCTASE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 43 F F F 4uq8 43 VA v NADH UBIQUINONE OXIDOREDUCTASE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 32 F F F 4usl 2 B D SORCN_HUMAN 22 KDA PROTEIN, CP-22, CP22, V19 MAYPGHPGAGGGYYPGGYGGAPGGPAFPGQTQ 32 T 220 Antimicrobial_5 pdbhh F Eukaryota T 4utn 2 C D SUCCINYL-CPS1-PEPTIDE XGVLXEYGV 9 T 21 DUF3744 pdbhh F T 4utr 2 C C CPSM_HUMAN 3-NITRO-PROPIONYL-CPS1 PEPTIDE XGVLKEYGV 9 F F Eukaryota T 4utv 2 C C CPSM_HUMAN 3-PHENYL-SUCCINYL-CPS1 PEPTIDE XGVLKEYGV 9 F F Eukaryota T 4utx 2 C C CPSM_HUMAN 3-NITRO-PROPIONYL-CPS1 PEPTIDE XGVLKEYGV 9 F F Eukaryota T 4utz 2 C D CPSM_HUMAN ADIPOYL-CPS1 PEPTIDE XGVLKEYGV 9 F F Eukaryota T 4uu5 2 B B CRUM1_HUMAN PROTEIN CRUMBS HOMOLOG 1 RVEMWNLMPPPAMERLI 17 T 3 DUF1180 unphh F Eukaryota T 4uu7 2 C D CPSM_HUMAN CARBAMOYLPHOSPHATE SYNTHETASE I XGVLKEYGV 9 F F Eukaryota T 4uu8 2 C D CPSM_HUMAN 3,3-DIMETHYL-SUCCINYL-CPS1 PEPTIDE XGVLKEYGV 9 F F Eukaryota T 4uua 2 C D CPSM_HUMAN 3S-Z-AMINO-SUCCINYL-CPS1 PEPTIDE XGVLKEYGV 9 F F Eukaryota T 4uub 2 C D CPSM_HUMAN 2R-BUTYL-SUCCINYL-CPS1 PEPTIDE XGVLKEYGV 9 F F Eukaryota T 4uwx 2 C,D C,D LIPA3_MOUSE PROTEIN TYROSINE PHOSPHATASE RECEPTOR TYPE F POLYPEPTIDE-IN TERACTING PROTEIN ALPHA-3, PTPRF-INTERACTING PROTEIN ALPHA-3, LIPR IN-ALPHA3 TPRSARLERMAQALALQAGSP 21 T 6.5 WSN pdbhh F Eukaryota T 4ux6 1 A A NOS2_MOUSE INDUCIBLE NO SYNTHASE, INDUCIBLE NOS, INOS, MACROPHAGE NOS, MAC-NOS, NOS TYPE II, PEPTIDYL-CYSTEINE S-NITROSYLASE NOS2, INDUCIBLE NITRIC OXIDE SYNTHASE QYVRIKNWGSGEILHDTLHHKATS 24 T 5.8 EFP_N pdbhh F Eukaryota T 4ux9 2 E,F,G,H F,G,H,I MP2K7_HUMAN MAP KINASE KINASE 7, MAPKK 7, JNK-ACTIVATING KINASE 2, MAPK/ERK KINASE 7, MEK 7, STRESS-ACTIVATED PROTEIN KINASE KINASE 4, SAPK KINASE 4, SAPKK-4, SAPKK4, C-JUN N-TERMINAL KINASE KINASE 2, JNK KINASE 2, JNKK 2, MKK7 QRPRPTLQLPLA 12 T 23 Sec-ASP3 pdbhh F Eukaryota T 4uxe 1 A,B,C A,B,C FIBP_BPT4 PROXIMAL LONG TAIL FIBRE PROTEIN GP34, PROTEIN GP34 MGSSHHHHHHSQDPSQFIRRDIAQTVNGSLTLTQQTNLSAPLVSSSTGEFGGSLAANRTFTIRNTGAPTSIVFEKGPASGANPAQSMSIRVWGNQFGGGSDTTRSTVFEVGDDTSHHFYSQRNKDGNIAFNINGTVMPININASGLMNVNGTATFGRSVTANGEFISKSANAFRAINGDYGFFIRNDASNTYFLLTAAGDQTGGFNGLRPLLINNQSGQITIGEGLIIAKGVTINSGGLTVNSRIRSQGTKTSDLYTRAPTSDTVGFWSIDINDSATYNQFPGYFKMVEKTNEVTGLPYLERGEEVKSPGTLTQFGNTLDSLYQDWITYPTTPEARTTRWTRTWQKTKNSWSSFVQVFDGGNPPQPSDIGALPSDNATMGNLTIRDFLRIGNVRIVPDPVNKTVKFEWVE 410 T 5.8 Auxin_repressed pdbhh T Viruses T 4uxf 1 A,B,C A,B,C FIBP_BPT4 PROXIMAL LONG TAIL FIBRE PROTEIN GP34, PROTEIN GP34 MGSSHHHHHHSQDPSQFIRRDIAQTVNGSLTLTQQTNLSAPLVSSSTGEFGGSLAANRTFTIRNTGAPTSIVFEKGPASGANPAQSMSIRVWGNQFGGGSDTTRSTVFEVGDDTSHHFYSQRNKDGNIAFNINGTVMPININASGLMNVNGTATFGRSVTANGEFISKSANAFRAINGDYGFFIRNDASNTYFLLTAAGDQTGGFNGLRPLLINNQSGQITIGEGLIIAKGVTINSGGLTVNSRIRSQGTKTSDLYTRAPTSDTVGFWSIDINDSATYNQFPGYFKMVEKTNEVTGLPYLERGEEVKSPGTLTQFGNTLDSLYQDWITYPTTPEARTTRWTRTWQKTKNSWSSFVQVFDGGNPPQPSDIGALPSDNATMGNLTIRDFLRIGNVRIVPDPVNKTVKFEWVE 410 T 5.8 Auxin_repressed pdbhh T Viruses T 4uxg 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L FIBP_BPT4 PROXIMAL LONG TAIL FIBRE PROTEIN GP34, PROTEIN GP34 MGSSHHHHHHSQDPSQFIRRDIAQTVNGSLTLTQQTNLSAPLVSSSTGEFGGSLAANRTFTIRNTGAPTSIVFEKGPASGANPAQSMSIRVWGNQFGGGSDTTRSTVFEVGDDTSHHFYSQRNKDGNIAFNINGTVMPININASGLMNVNGTATFGRSVTANGEFISKSANAFRAINGDYGFFIRNDASNTYFLLTAAGDQTGGFNGLRPLLINNQSGQITIGEGLIIAKGVTINSGGLTVNSRIRSQGTKTSDLYTRAPTSDTVGFWSIDINDSATYNQFPGYFKMVEKTNEVTGLPYLERGEEVKSPGTLTQFGNTLDSLYQDWITYPTTPEARTTRWTRTWQKTKNSWSSFVQVFDGGNPPQPSDIGALPSDNATMGNLTIRDFLRIGNVRIVPDPVNKTVKFEWVE 410 T 5.8 Auxin_repressed pdbhh T Viruses T 4uyz 2 E E POLY ALA AAAAAAAAAA 10 T 200 FAD_oxidored pdbhh F F 4uzc 1 A,B,C,D A,B,C,D ORF73_HHV8P LATENCY-ASSOCIATED NUCLEAR ANTIGEN, LANA-1, KSHV LANA GSRYQQPPVPYRQIDDCPAKARPQHIFYRRFLGKDGRRDPKCQWKFAVIFWGNDPYGLKKLSQAFQFGGVKAGPVSCLPHPGPDQSPITYCVYVYCQNKDTSKKVQMARLAWEASHPLAGNLQSSIVKFKKPLPLTQPG 139 T 0.00011 EBV-NA1 unphh T Viruses T 4uzy 2 B B Q946G4_CHLRE INTRAFLAGELLAR TRANSPORT PROTEIN 52 NLIPPSFETPLPPLQPAVFPPTIREPPPPALELFDLDESFASLTNKCHGEED 52 T 24 Antimicrobial18 pdbhh F Eukaryota T 4uzz 2 B B I7LT74_TETTS INTRAFLAGELLAR TRANSPORT PROTEIN 52 GAASDEFASEKVRLAQLTNKCNNNDLDYYIKESGDILGVTDKVKNKHDAKAILRYVLEELINFKKLNN 68 T 0.018 RRM_1 pdbpercent F Eukaryota T 4v11 2 B B SV2A_HUMAN SV2A SDATEGHDED 10 T 13 Toxin_25 pdbhh F Eukaryota T 4v1a 23 W z UNASSIGNED SECONDARY STRUCTURE ELEMENTS XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 4v2v 2 C,D C,D H31T_HUMAN H3/T, H3T, H3/G ARKSA 5 T 200 RNR_inhib pdbhh F Eukaryota F 4v3p 18 R SW 40S WHEAT GERM RIBOSOME1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 92 F F F 4v3p 21 U Sc Unknown 40S wheat germ ribosome protein 2 XXXXXXXXXXXXXXXXXXXXXXXXX 25 F F F 4v3p 22 V Sb Unknown 40S wheat germ ribosome protein 3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 36 F F F 4v3p 26 Z SG Unknown 40S wheat germ ribosome protein 4 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 143 F F F 4v3p 53 AB Ld 60S ribosomal protein L29 KFLRNQRYSRKHNKKSGEAESEE 23 T 15 MAT1-1-2 pdbhh F T 4v3p 62 JB,VB Ly,Lx Unknown 60S wheat germ ribosome protein 2 XXXXXXXXXXXXXXXXXXXX 20 F F F 4v3p 63 KB Lz Unknown 60S wheat germ ribosome protein 3 XXXXXXXXXXXXXX 14 F F F 4v3p 65 MB LL Unknown 60S wheat germ ribosome protein 5 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 182 F F F 4v49 46 UA BV 50S RIBOSOMAL PROTEIN L28 XXXXXXXXXXXXXXXX 16 F F F 4v4a 44 RA BV 50S RIBOSOMAL PROTEIN L28 XXXXXXXXXXXXXXXX 16 F F F 4v4u 2 F,G,H,I,J S,T,U,V,W N-TERMINAL PEPTIDE OF FIBER PROTEIN TFNPVYPYDT 10 T 0.25 DUF3463 pdbhh F T 4v5e 44 SA,ZC BJ,DJ 50S RIBOSOMAL PROTEIN L10 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 130 F F F 4v5f 43 BD,RA DJ,BJ 50S RIBOSOMAL PROTEIN L10 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 173 F F F 4v5f 45 DD,ED,IB,JB,SD,TA,TD,UA DL,DM,Bl,Bm,Dl,BL,Dm,BM 50S RIBOSOMAL PROTEIN L12 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 125 F F F 4v5g 44 SA,ZC BJ,DJ 50S RIBOSOMAL PROTEIN L10 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 173 F F F 4v5g 45 AD,TA DK,BK 50S RIBOSOMAL PROTEIN L11 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 147 F F F 4v5h 24 X AZ POLY-ALA NASCENT CHAIN AAAAAAAAAAAAAAAAAAAA 20 T 510 Adeno_PIX pdbhh F F 4v5j 44 SA,ZC BJ,DJ 50S RIBOSOMAL PROTEIN L10 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 130 F F F 4v5k 44 SA,YC BJ,DJ 50S RIBOSOMAL PROTEIN L10 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 131 F F F 4v5l 44 SA BJ 50S RIBOSOMAL PROTEIN L10 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 130 F F F 4v5l 45 TA BK 50S RIBOSOMAL PROTEIN L11 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 140 F F F 4v5p 44 SA,ZC BJ,DJ 50S RIBOSOMAL PROTEIN L10 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 173 F F F 4v5p 45 AD,TA DK,BK 50S RIBOSOMAL PROTEIN L11 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 147 F F F 4v5q 44 SA,ZC BJ,DJ 50S RIBOSOMAL PROTEIN L10 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 173 F F F 4v5q 45 AD,TA DK,BK 50S RIBOSOMAL PROTEIN L11 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 147 F F F 4v5r 44 SA,ZC BJ,DJ 50S RIBOSOMAL PROTEIN L10 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 173 F F F 4v5r 45 AD,TA DK,BK 50S RIBOSOMAL PROTEIN L11 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 147 F F F 4v5s 44 SA,ZC BJ,DJ 50S RIBOSOMAL PROTEIN L10 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 173 F F F 4v5s 45 AD,TA DK,BK 50S RIBOSOMAL PROTEIN L11 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 147 F F F 4v5z 69 QB B7 60S Ribosomal protein L19 RMRILRRLLRRYR 13 T 1.2 CRPV_capsid pdbhh F F 4v5z 76 XB B8 60S Ribosomal protein L35 ARVLTVINQT 10 T 0.0087 Ribosomal_L29 pdbhh F T 4v62 19 MA,S BY,AY Photosystem II protein Y XXXXXXXXXXXXXXXXXXXXXXXXXXXX 28 F F F 4v6i 7 G AG 40S ribosomal protein rpS7 (S7e) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 144 F F F 4v6i 23 W AW 40S ribosomal protein rpS26 (S26e) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 93 F F F 4v6i 27 AA Ab Unknown 40S ribosomal protein XS1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 37 F F F 4v6i 28 BA Ac Unknown 40S ribosomal protein XS2 XXXXXXXXXXXXXXXXXXXXXXXXXX 26 F F F 4v6i 70 RB,SB Bx,By Unknown protein XXXXXXXXXXXXXXXXXXXXX 21 F F F 4v6i 71 TB Bz Unknown protein XXXXXXXXXXXXXXX 15 F F F 4v6i 75 ZB BL 60S ribosomal protein rpL13 (L13e) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 170 F F F 4v6u 9 I A9 unknown 30S ribosomal protein SX XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 57 F F F 4v6u 32 FA BO RL18_PYRFU PFL18, 50S RIBOSOMAL PROTEIN L18 MAHGPRYRVPFRRRREGKTNYRKRLKLLKSGKPRLVVRKSLNHHIAQIIVYDPKGDRTLVSAHTRELIRDFGWKGHCGNTPSAYLLGLLIGYKAKQAGIEEAILDIGLHPPVRGSSVFAVLKGAVDAGLNVPHSPEIFPDEYRIRGEHIAEYAKMLKEQDEEKFRRQFGGYLVKGLDPEKLPEHFEEVKARIIEKFEGEGARE 203 T 1E-08 Ribosomal_L5e pdb F Archaea T 4v7c 24 Y AY viomycin XXSSXX 6 T 1500 Sid-5 pdbhh F F 4v7d 59 HB BY viomycin XXSSXX 6 T 1500 Sid-5 pdbhh F F 4v7l 26 FC,Z CZ,AZ Viomycin XXSSXX 6 T 1500 Sid-5 pdbhh F F 4v7m 26 FC,Z CZ,AZ capreomycin IA XXXXXS 6 T 2200 zf-H2C2_2 pdbhh F F 4v7r 22 QC,RB,V Ca,Bo,Aa Unassigned secondary structure XXXXXXXXXXXXXXXXXXXX 20 F F F 4v7r 23 RC,W Cb,Ab Unassigned secondary structure XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 105 F F F 4v7r 24 SC,X Cc,Ac Unassigned secondary structure XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 93 F F F 4v7r 25 TC,Y Cd,Ad Unassigned secondary structure XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 35 F F F 4v7r 26 HE,MB,Z Dj,Bj,Ae Unassigned secondary structure XXXXXXXXXXXXXXXXXXXXX 21 F F F 4v7r 27 AA Af Unassigned secondary structure XXXXXXXXXXX 11 F F F 4v7r 28 BA,UC Ah,Ch Unassigned secondary structure XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 41 F F F 4v7r 63 FE,KB Dh,Bh Unassigned secondary structure XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 44 F F F 4v7r 64 GE,LB Di,Bi Unassigned secondary structure XXXXXXXXXXXX 12 F F F 4v7r 65 IE,NB Dk,Bk Unassigned secondary structure XXXXXXXXXXXXXXXX 16 F F F 4v7r 66 OB Bl Unassigned secondary structure XXXXXXXXXXXXXXXXXXX 19 F F F 4v7r 67 PB Bm Unassigned secondary structure XXXXXXXXX 9 F F F 4v7r 68 QB Bn Unassigned secondary structure XXXXXXXXXXXXXXXXXXXXXXXXXXX 27 F F F 4v7r 69 SB Bp Unassigned secondary structure XXXXXXXX 8 F F F 4v7r 70 TB Bq Unassigned secondary structure XXXXXXXXXXXXXXXXX 17 F F F 4v7r 71 UB Br Unassigned secondary structure XXXXXXXXXXXXXXXXXXXXXXX 23 F F F 4v82 19 MA,S BY,AY PHOTOSYSTEM II PSBX PROTEIN XXXXXXXXXXXXXXXXXXXXXXXXXXXX 28 F F F 4v85 24 X AY Viomycin XXSSXX 6 T 1500 Sid-5 pdbhh F F 4v88 82 XD DK Ribosomal protein L12 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 155 F F F 4v88 84 EF Dr Ribosomal protein P1 alpha XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 4v88 85 FF Ds Ribosomal protein P2 beta XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 46 F F F 4v8n 44 SA,YC BJ,DJ 50S RIBOSOMAL PROTEIN L10 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 173 F F F 4v8o 43 QA BJ 50S RIBOSOMAL PROTEIN L10 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 173 T 11000 zf-C2H2 pdbhh F F 4v8o 44 RA BK 50S RIBOSOMAL PROTEIN L11 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 147 F F F 4v8p 28 BA,CB,QE,WC BG,CG,GG,EG RPLP0 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 123 F F F 4v8q 19 S AJ 50S RIBOSOMAL PROTEIN L10 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 130 F F F 4v8q 20 T AK 50S RIBOSOMAL PROTEIN L11 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 140 F F F 4v8t 11 K K 60S RIBOSOMAL PROTEIN L12 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 155 F F F 4v8t 44 RA r RIBOSOMAL PROTEIN P1 ALPHA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 4v8t 45 SA s RIBOSOMAL PROTEIN P2 BETA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 46 F F F 4v8u 44 RA,WC BJ,DJ 50S RIBOSOMAL PROTEIN L10 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 173 F F F 4v8x 45 AD,TA DJ,BJ 50S RIBOSOMAL PROTEIN L10 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 130 F F F 4v8y 45 SA BK 60S RIBOSOMAL PROTEIN L11-A XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 174 F F F 4v8y 77 YB Br 60S ACIDIC RIBOSOMAL PROTEIN P1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 4v8y 78 ZB Bs 60S ACIDIC RIBOSOMAL PROTEIN P2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 46 F F F 4v8z 45 SA BK 60S RIBOSOMAL PROTEIN L11-A XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 174 F F F 4v8z 77 YB Br 60S ACIDIC RIBOSOMAL PROTEIN P1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 4v8z 78 ZB Bs 60S ACIDIC RIBOSOMAL PROTEIN P2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 46 F F F 4v90 43 QA BJ CHAIN J XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 130 T 10000 zf-C2H2 pdbhh F F 4v90 44 RA BK CHAIN K XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 140 F F F 4v90 45 SA BL CHAIN L XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 71 F F F 4v9f 32 FA,GA 4,5 HMAL12 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 70 F F F 4v9h 57 EB BJ 50S ribosomal protein L10 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 130 T 16000 zf_CCCH_4 pdbhh F F 4v9h 58 FB BL 50S ribosomal protein L12 CTD XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 125 F F F 4v9i 35 IA,OC BJ,DJ 50S ribosomal protein L10 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 130 T 16000 zf_CCCH_4 pdbhh F F 4v9j 24 GC,X CU,AU Viomycin XXSSXX 6 T 1500 Sid-5 pdbhh F F 4v9j 31 EA,NC BJ,DJ 50S ribosomal protein L10 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 170 F F F 4v9j 57 EB,FB,ND,OD Bf,Bg,Df,Dg 50S ribosomal protein L7/L12 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 31 F F F 4v9j 58 GB,PD Bh,Dh 50S ribosomal protein L7/L12 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 30 F F F 4v9k 24 GC,X CU,AU VIOMYCIN XXSSXX 6 T 1500 Sid-5 pdbhh F F 4v9k 31 EA,NC BJ,DJ 50S RIBOSOMAL PROTEIN L10 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 170 F F F 4v9k 54 BB,CB,KD,LD Bf,Bg,Df,Dg 50S RIBOSOMAL PROTEIN L7/L12 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 31 F F F 4v9k 55 DB,MD Bh,Dh 50S RIBOSOMAL PROTEIN L7/L12 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 30 F F F 4v9l 24 GC,X CU,AU VIOMYCIN XXSSXX 6 T 1500 Sid-5 pdbhh F F 4v9l 31 EA,NC BJ,DJ 50S RIBOSOMAL PROTEIN L10 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 170 F F F 4v9l 55 CB,DB,LD,MD Bf,Bg,Df,Dg 50S RIBOSOMAL PROTEIN L7/L12 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 31 F F F 4v9l 56 EB,ND Bh,Dh 50S RIBOSOMAL PROTEIN L7/L12 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 30 F F F 4v9m 30 DA,LC BJ,DJ 50S ribosomal protein L10 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 170 F F F 4v9m 54 BB,CB,JD,KD Bf,Bg,Df,Dg 50S ribosomal protein L7/L12 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 31 F F F 4v9m 55 DB,LD Bh,Dh 50S ribosomal protein L7/L12 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 30 F F F 4v9o 56 DB,GD,JF,LH BW,DW,FW,HW Viomycin XXSSXX 6 T 1500 Sid-5 pdbhh F F 4v9p 55 CB,ED,HF BW,DW,FW Viomycin XXSSXX 6 T 1500 Sid-5 pdbhh F F 4v9r 24 AC,X CW,AW Dityromycin XVXPXXPXXX 10 T 72 ELF pdbhh F F 4v9s 24 AC,X CW,AW GE82832 XVXPXXPXXX 10 T 72 ELF pdbhh F F 4vgc 1 A A CTRA_BOVIN GAMMA CHYMOTRYPSIN CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 4w29 31 EA,NC BJ,DJ 50S ribosomal protein l10 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 170 F F F 4w29 57 EB,FB,ND,OD Bf,Bg,Df,Dg 50S ribosomal protein L7/L12 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 31 F F F 4w29 58 GB,PD Bh,Dh 50S ribosomal protein L7/L12 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 30 F F F 4w4z 2 E,F,G,H E,F,G,H APY-bAla8.am peptide APYCVYRXSWSCX 13 T 0.89 DUF1684 pdbhh F T 4w50 2 E,F,G,H E,F,G,H APY peptide APYCVYRGSWSC 12 T 1 DUF1684 pdbhh F T 4w5l 1 A,B A,B PrP peptide GGYLLGS 7 T 3.7 BioY pdbhh F F 4w5m 1 A,B A,B PrP peptide GGYMLGS 7 T 4.9 G0-G1_switch_2 pdbhh F F 4w5p 1 A,B A,B PrP peptide GGYVLGS 7 T 14 G0-G1_switch_2 pdbhh F F 4w5y 1 A,B A,B Prp peptide GYMLGSA 7 T 3 G0-G1_switch_2 pdbhh F T 4w67 1 A,B A,B PrP peptide GYVLGSA 7 T 11 DUF2148 pdbhh F T 4w6w 1 A A Q47212_ECOLX FEDF NSSASSAQVTGTLLGTGKTNTTQMPALYTWQHQIYNVNFIPSSSGTLTCQAGTILVWKNGRETQYALECRVSIHHSSGSINESQWGQQSQVGFGTACGNKKCRFTGFEISLRIPPNAQTYPLSSGDLKGSFSLTNKEVNWSASIYVPAIAK 151 T 2.5 DUF2511 pdbhh F Bacteria T 4w6x 1 A A Q47212_ECOLX FEDF NSSASSAQVTGTLLGTGKTNTTQMPALYTWQHQIYNVNFIPSSSGTLTCQAGTILVWKNGRETQYALECRVSIHHSSGSINESQWGQQSQVGFGTACGNKKCRFTGFEISLRIPPNAQTYPLSSGDLKGSFSLTNKEVNWSASIYVPAIAK 151 T 2.5 DUF2511 pdbhh F Bacteria T 4w6y 1 A A Q47212_ECOLX FEDF, LECTIN DOMAIN OF F18 FIMBRIAL ADHESIN FEDF NSSASSAQVTGTLLGTGKTNTTQMPALYTWQHQIYNVNFIPSSSGTLTCQAGTILVWKNGRETQYALECRVSIHHSSGSINESQWGQQSQVGFGTACGNKKCRFTGFEISLRIPPNAQTYPLSSGDLKGSFSLTNKEVNWSASIYVPAIAK 151 T 2.5 DUF2511 pdbhh F Bacteria T 4w71 1 A,B A,B PrP peptide GYLLGSA 7 T 4.2 G3P_acyltransf pdbhh F F 4w8h 2 B D hexa-His tag HHHHHH 6 T 6100 zf_CCCH_4 pdbhh F F 4w8p 2 B B AB1IP_MOUSE APBB1-INTERACTING PROTEIN 1,PROLINE-RICH EVH1 LIGAND 1,PREL-1,PROLINE-RICH PROTEIN 48 NEDIDQMFSTLLGEMDLLTQS 21 T 6.8 Mvb12 pdbhh F Eukaryota T 4wa0 1 A A E4SDB5_CALK2 possible adhesin TSVPSSPLDYAIFSKGALNTNKNLTVENGSVYSGGDLTIDGGAVFNIDNLISKGEMVINQDSDSRCRDNNIVVRNIIYVEKSLKANRISPRSTNIDAKTIYVGQEMQLYGAGSYKFVQLFSDSNVKLAGPGVNMEVSTLASIRGTLEVIDGATVTLKSNSAVYCNSLVVRNGSRLILENGAKLYLATTPDASTIISIQNNGGTISYSSSFSYPSPPAEIDEIRNRDYTSGLLTTPLPADSVGSNQLGSTADTSQTPPQIVIYGESYINDNEARIEISARLGSPIVDFSTLQLHLISRGNITFVGGGLTIMNGSIISLGSTFNINATGNPYAGLTLKYQMPSPPIQQDIESNTGIQPSQ 358 T 0.032 FecR pdb F Bacteria T 4wbu 1 A,B A,B PrP peptide GYMLGS 6 T 0.03 Pectate_lyase_3 unp F F 4wbv 1 A,B A,B PrP peptide GYVLGS 6 T 0.03 Pectate_lyase_3 unp F F 4wci 2 B,D,F B,D,F RIN3_HUMAN RAS INTERACTION/INTERFERENCE PROTEIN 3 AKKNLPTAPPRRRVSE 16 T 11 COX8 pdbhh F Eukaryota T 4wfd 3 C,F,I C,F,I MTR4_YEAST MRNA TRANSPORT REGULATOR MTR4 MDSTDLFDVFEETPVELPTK 20 T 8.8 eIF3h_C pdbhh F Eukaryota T 4whh 2 B B C6H5(CH2)8-DERIVATIZED PEPTIDE INHIBITOR XXSTX 5 T 600 zf-H2C2_5 pdbhh F F 4whk 2 B B C6H5(CH2)8-DERIVATIZED PEPTIDE INHIBITOR XXSTX 5 T 600 zf-H2C2_5 pdbhh F F 4whl 2 B B C6H5(CH2)8-DERIVATIZED PEPTIDE INHIBITOR XXSTX 5 T 600 zf-H2C2_5 pdbhh F F 4wj7 2 E,F,G,H W,X,Y,Z KRIT1 NPxY/F3 VDKVVINPYFGLG 13 T 0.029 MT-A70 pdbhh F T 4wjg 5 DA,E,J,O,T,Y 4,E,J,O,T,Y I7BA80_TRYBB Haptoglobin-hemoglobin receptor AEGLKTKDEVEKACHLAQQLKEVSITLGVIYRTTERHSVQVEAHKTAIDKHADAVSRAVEALTRVDVALQRLKELGKANDTKAVKIIENITSARENLALFNNETQAVLTARDHVHKHRAAALQGWSDAKEKGDAAAEDVWVLLNAAKKGNGSADVKAAAEKCSRYSSSSTSETELQKAIDAAANVGGLSAHKSKYGDVLNKFKLSNASVGAVRDTSGRGGKHMEKVNNVAKLLKDAEVSLAAAAAEIEEVKNAHETKAQEEMKRNGNPIENESETNSGGNAESQGNGDREDKNDEQQQVDEEETKVENGSSEEGSCCGNESNGPHVMKKRHGVEGPRPVDVVS 343 T 8.4E-05 GARP unphh F Eukaryota T 4wjp 2 B,D B,D Daxx GSGEAEERIIVLSDSDY 17 T 1.5 Rnk_N pdbhh F T 4wjq 2 B,D B,D Daxx GSGEAEERIIVLSDSDY 17 T 1.5 Rnk_N pdbhh F T 4wjv 3 I,J,K,L I,J,K,L NSA2_YEAST NOP7-ASSOCIATED PROTEIN 2 MDTDGDALPTYLLDREQNNTAK 22 T 5 Sec62 unppssm F Eukaryota T 4wjw 3 C P CHS3_YEAST CHITIN-UDP ACETYL-GLUCOSAMINYL TRANSFERASE 3,CLASS-IV CHITIN SYNTHASE 3 DDYYLNLNQDEESLLRSRC 19 T 3.2 DUF3305 pdbhh F Eukaryota T 4wk0 3 C C ARG-GLY-ASP RGD 3 T 170 SatD pdbhh F F 4wk2 3 C C GLY-ARG-GLY-ASP-SER-PRO GRGDSP 6 T 21 Topoisom_I_N pdbhh F F 4wk4 3 C C ALA-CYS-ARG-GLY-ASP-GLY-TRP-CYS ACRGDGWC 8 T 0.14 Peptidase_C65 pdbhh F T 4wkm 2 I,J,K,L,M,N,O,P I,J,K,L,M,N,O,P ALA-FGA-API-DAL-DAL AXXXX 5 F F F 4wlb 2 C,D D,E SRC-1 peptide SLLKKLLD 8 T 8.2 Neurokinin_B pdbhh F F 4wnd 2 B B FRPD4_HUMAN PDZ DOMAIN-CONTAINING PROTEIN 10,PSD-95-INTERACTING REGULATOR OF SPINE MORPHOGENESIS,PRESO GPLGSDLPPKVVPSKQLLHSDHMEMEPETMETKSVTDYFSKLHMGSVAYSCTS 53 T 100 EB1 pdbhh F Eukaryota T 4wne 2 B B FRPD4_HUMAN PDZ DOMAIN-CONTAINING PROTEIN 10,PSD-95-INTERACTING REGULATOR OF SPINE MORPHOGENESIS,PRESO KQLLHSDHMEMEPETMETKSVTDYF 25 T 83 SfsA pdbhh F Eukaryota T 4wnf 2 B B FRPD4_HUMAN PDZ DOMAIN-CONTAINING PROTEIN 10,PSD-95-INTERACTING REGULATOR OF SPINE MORPHOGENESIS,PRESO GPLGSDLPPKVVPSKQLLHSDHMEMEPETMETKSVTDYFSKLHMGSVAYSCTS 53 T 100 EB1 pdbhh F Eukaryota T 4wng 2 B B FRPD4_HUMAN PDZ DOMAIN-CONTAINING PROTEIN 10,PSD-95-INTERACTING REGULATOR OF SPINE MORPHOGENESIS,PRESO GPGSDLPPKVVPSKQLLHSDHMEMEPETMETKSVTDYFSKLHMGSVAYSCTSEFHHHHHH 60 T 130 EB1 pdbhh F Eukaryota T 4wnl 2 E,F,G,H E,F,G,H SHE3_YEAS6 SWI5-dependent HO expression protein 3 RSFYTASPLLSSGSIPKSASPVLPGVKRTASVR 33 T 0.00026 CCDC73 unphh F Eukaryota T 4wnn 3 I T SPT16_YEAST SPT16 GIKKTDDEASDESEEEVSEY 20 T 0.11 SAPS unppssm F Eukaryota T 4wpb 2 C,D C,D alpha/beta-VEGF-1 VXNKXNKEXCNXRAIEXALDPNLNDQQFHXKIWXIIXDCX 40 T 6.5 Vel1p pdbhh F T 4wph 2 C,D C,D ICP0_HHV11 ICP0 GPRKCARKTRH 11 T 2.8 Adeno_E4_34 pdbhh T Viruses T 4wpi 2 C,D C,D ICP0_HHV11 ICP0 GPRKCARKTRH 11 T 2.8 Adeno_E4_34 pdbhh T Viruses T 4wpx 2 B,E B,E G0SGL4_CHATD Putative SAC3 family protein GHMKPKRDLMADFTKWFVTGDGGIMEEFTEETLRHLLWDVWQRHQREEAERKRKAEEEESWRLAREHLTHRLQVKYFYRWREKARALAT 89 T 0.12 PV_NSP1 pdb F Eukaryota T 4wqu 58 GB,ND BX,DX Dityromycin XVXPXXPXXX 10 T 72 ELF pdbhh F F 4wsf 2 B B Q9VHP9_DROME FI18815P1 PDESSADVVFKKPLAPAPR 19 T 0.4 TSSC4 pdbhh F Eukaryota T 4wsi 2 C,D X,Y CRB_DROME 95F GPGSEFRNKRATRGTYSPSAQEYCNPRLEMDNVLKPPPEERLI 43 T 0.031 TMEM154 unphh F Eukaryota T 4wt8 34 CB,MC CJ,DJ ribosomal L10 protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 130 F F F 4wv6 2 B,C B,C TAF8_HUMAN PROTEIN TAUBE NUSS,TBP-ASSOCIATED FACTOR 43 KDA,TBP-ASSOCIATED FACTOR 8,TRANSCRIPTION INITIATION FACTOR TFIID 43 KDA SUBUNIT,HTAFII43 PVKKPKIRRKKSLS 14 T 24 Ribosomal_L29e pdbhh F Eukaryota T 4wvd 1 A,C C,D NCOR1_HUMAN N-COR1 SNLGLEDIIRKALMGSF 17 T 3.1 RuvA_C pdbhh F Eukaryota T 4wvh 2 B C substrate peptide (pep1) DHDAHA 6 T 170 Paired_CXXCH_1 pdbhh F F 4wvi 2 B D substrate peptide (pep2) GGGGAVPTAKA 11 T 24 DUF3034 pdbhh F T 4wvj 2 B D inhibitor peptide (PEP3) GGGGGAPTAKAPSK 14 T 29 DUF4023 pdbhh F T 4wvp 2 B I BTN-3V3-NLB-OMT-OIC-3V2 XXXMXX 6 T 2300 zf-C2H2_jaz pdbhh F F 4wvs 2 B B 3,11-DIFLUORO-6,8,13-TRIMETHYL-8H-QUINO[4,3,2-KL]ACRIDIN-13-IUM XXPFX 5 T 410 zf-CCHC pdbhh F F 4wvt 2 C,D C,D 3,11-DIFLUORO-6,8,13-TRIMETHYL-8H-QUINO[4,3,2-KL]ACRIDIN-13-IUM XFPFFX 6 T 4.2 Inhibitor_I10 pdbhh F F 4wvu 2 B B 3,11-DIFLUORO-6,8,13-TRIMETHYL-8H-QUINO[4,3,2-KL]ACRIDIN-13-IUM XVXFX 5 T 480 ETC_C1_NDUFA4 pdbhh F F 4wwr 2 B,D,F,H A,G,C,E BAG6_HUMAN BAG FAMILY MOLECULAR CHAPERONE REGULATOR 6,BCL2-ASSOCIATED ATHANOGENE 6,BAG6,HLA-B-ASSOCIATED TRANSCRIPT 3,PROTEIN G3,PROTEIN SCYTHE MQLLLSEAVSRAAKAAGARPLTSPESLSRDLEAPEVQESYRQQLRSDIQKRLQ 53 T 0.48 DUF2939 unp F Eukaryota T 4wx4 2 B C peptide VKSLKRRRCY 10 T 0.00019 MCPVI pdbhh F T 4wym 2 M,N,O,P,Q,R,S,T,U,V,W M,N,O,P,Q,R,S,T,U,V,W CPSF6_HUMAN CLEAVAGE AND POLYADENYLATION SPECIFICITY FACTOR 68 KDA SUBUNIT,CPSF 68 KDA SUBUNIT,PRE-MRNA CLEAVAGE FACTOR IM 68 KDA SUBUNIT,PROTEIN HPBRII-4/7 GTPVLFPGQPFGQPPLG 17 T 2.2 MlaD pdbhh F Eukaryota T 4wyq 1 A,D A,D DICER_HUMAN HELICASE WITH RNASE MOTIF,HELICASE MOI YERLLMELEEALNFINDCNISVHSKERDSTLISKQILSDCRAVLVVLGPWCADKVAGMMVRELQKYIKHEQEELHRKFLLFTDTFLRKIHALCEEHFSPASLDLKFVTPKVIKLLEILRKYKP 123 T 0.088 Tyrosinase pdbpssm F Eukaryota T 4wyq 3 C,F C,F Poly(UNK) XXXXXXXXXXX 11 F F F 4wyu 2 B,D D,C SYNTHETIC PDZ BINDING MOTIF SWFQTDL 7 T 12 DOR pdbhh F T 4wz7 10 J E 39-kDa subunit XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 195 F F F 4wz7 16 P,Q Z,D unknown subunits 1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 57 F F F 4wz7 17 R F unknown subunits 2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 54 F F F 4wz7 18 S J unknown subunits XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 63 F F F 4wz7 19 T M unknown subunits XXXXXXXXXXXXXXXXXXXXXXXXXXXXX 29 F F F 4wz7 20 U N unknown subunits XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 50 F F F 4wz7 21 V O unknown subunits XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 70 F F F 4wz7 22 W P unknown subunits XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 46 F F F 4wz7 23 X Q unknown subunits XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 51 F F F 4wz7 24 Y R unknown subunits XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 30 F F F 4wz7 25 Z S unknown subunits XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 69 F F F 4wz7 26 AA,NA T,AH unknown subunits XXXXXXXXXXXXXXX 15 F F F 4wz7 27 BA U unknown subunits XXXXXXXXXXXXXXXXXXXXXXXXXX 26 F F F 4wz7 28 CA V unknown subunits XXXXXXXXXXXXXXXXXXXXXX 22 F F F 4wz7 29 DA,EB,HA,KB W,AY,AB,BE unknown subunits XXXXXXXXX 9 F F F 4wz7 30 BB,EA,NB,PA AV,X,BH,AJ unknown subunits XXXXXXXXXXXXXXXX 16 F F F 4wz7 31 FA,XA,ZA Y,AR,AT unknown subunits XXXXXXXXXXXXX 13 F F F 4wz7 32 CB,GA,HB,MB AW,AA,BB,BG unknown subunits XXXXXXXXXXXXXXXXXX 18 F F F 4wz7 33 IA,JA AC,AD unknown subunits XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 4wz7 34 KA AE unknown subunits XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 48 F F F 4wz7 35 LA AF unknown subunits XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 35 F F F 4wz7 36 MA AG unknown subunits XXXXXXXXXXXXXXXXXXXXXXXXX 25 F F F 4wz7 37 OA,RA AI,AL unknown subunits XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 36 F F F 4wz7 38 QA,TA AK,AN unknown subunits XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 76 F F F 4wz7 39 SA AM unknown subunits XXXXXXXXXXXXXXXXX 17 F F F 4wz7 40 UA AO unknown subunits XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 32 F F F 4wz7 41 VA,YA AP,AS unknown subunits XXXXXXXXXXX 11 F F F 4wz7 42 GB,WA BA,AQ unknown subunits XXXXXXXX 8 F F F 4wz7 43 AB AU unknown subunits XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 58 F F F 4wz7 44 DB AX unknown subunits XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 39 F F F 4wz7 45 FB AZ unknown subunits XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 40 F F F 4wz7 46 IB BC unknown subunits XXXXXXXXXXXXXXXXXXXX 20 F F F 4wz7 47 JB,LB BD,BF unknown subunits XXXXXXXXXXXXXXXXXXX 19 F F F 4wz7 48 OB BI unknown subunits XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 905 F F F 4wz9 2 C M ALA-ALA-ALA-LYS-ALA AAAKA 5 T 430 TRI9 pdbhh F F 4wz9 3 D N ALA-ALA-LYS AAK 3 T 690 NUMOD1 pdbhh F F 4wzn 1 A,B A,B POLG_HAVHM Genome polyprotein SMMSRIAAGDLESSVDDPRSEEDKRFESHIECRKPYKELRLEVGKQRLKYAQEELSNEVLPPPRKMKGLFSQAKISLFYTEEHEIMKFSWRGVTADTRALRRFGFSLAAGRSVWTLEMDAGVLTGRLIRLNDEKWTEMKDDKIVSLIEKFTSNKYWSKVNFPHGMLDLEEIAANSKDFPNMSETDLCFLLHWLNPKKINLADRMLGLSGVQEIKEQG 217 T 46 APC_15aa pdbhh T Viruses T 4wzx 2 B E IST1_HUMAN HIST1,PUTATIVE MAPK-ACTIVATING PROTEIN PM28 TSASEDIDFDDLSRRFEELKKKT 23 T 2.6 TACC_C pdbhh F Eukaryota T 4x01 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H COM1_SCHPO DOUBLE-STRAND BREAK REPAIR PROTEIN CTP1,MEIOTICALLY UP-REGULATED GENE 38 PROTEIN,NBS1-INTERACTING PROTEIN 1,SPORULATION IN THE ABSENCE OF SPO11 PROTEIN 2 HOMOLOG,SAE2 MEHNKSVHWSIVYRQLGNLLEQYEVEIARLKSQLVLEKKLRIQVEKEMESVKTKQIS 57 T 0.00099 Lzipper-MIP1 unppercent F Eukaryota T 4x0w 1 A P mupain-1-17 CPAYSXYLDC 10 T 1.4 DUF6438 pdbhh F T 4x1h 2 B C C-terminal derived peptide of guanine nucleotide-binding protein G(t) subunit alpha-1 VLEDLKSCGLF 11 T 2.7 Defensin_RK-1 pdbhh F T 4x1n 1 A P mupain-1-16 CPAYSAYLDC 10 T 1.4 DUF6438 pdbhh F T 4x1p 2 B P MUPAIN-1-17 CPAYSXYLDC 10 T 1.4 DUF6438 pdbhh F T 4x1q 1 A P mupain-1 CPAYSRYLDC 10 T 0.32 Hormone_2 pdbhh F T 4x1r 1 A P mupain-1-12 CPAYSAYLDC 10 T 1.4 DUF6438 pdbhh F T 4x1s 1 A P mupain-1-16 CPAYSAYLAC 10 T 2.1 DUF6438 pdbhh F T 4x1v 2 B B ARAP1_HUMAN CENTAURIN-DELTA-2,CNT-D2 RPTPRPVPMKRHIFRS 16 T 27 DUF3864 pdbhh F Eukaryota T 4x23 7 K,L,W,X V,U,X,W Q66LH7_RAT CENP-C PNVRRSNRIRLKPLEYWRGERIDYQ 25 T 0.6 CENP-C_mid pdbhh F Eukaryota T 4x2h 3 C C G0SGL4_CHATD SER-SER-VAL-PHE-GLY-ALA-PRO-ALA MMAPANNPFGAPPAQVNNPF 20 T 2.9 NpwBP pdbhh F Eukaryota T 4x2m 1 A,B A,B G0SG92_CHATD Mtr2 MLSRRYAAKSFVEWYYRQINENKPVASGYVNNNATYTKAGHPPADITINGRVVATPEEWDTMLKEQRAQHNTSSSSTLPIGRKPVRYDVDCFDVHVINADYRFAAPQRMIEQHAPTDGVRMMMALTVSGSVYFGASPRSTDDYVIKQHFNDVFILVPNWDVLEKPGARSGRKYLIASHKYRAY 183 T 0.17 NTF2 unppercent F Eukaryota T 4x2o 3 C C G0SGL4_CHATD Putative SAC3 family protein FASPAPSNQGSSVFGAPAQST 21 T 4.1 DUF765 pdbhh F Eukaryota T 4x2v 2 E E Q80J95_9CALI NS6 Protease LEFQG 5 T 45 DUF4133 pdbhh T Viruses F 4x2v 3 F F NS6 Protease XXXXXXX 7 F F F 4x34 2 C,D C,D P53_HUMAN THR-SER-ARG-HIS-ALY-MLY-LEU-MET-PHE-LYS TSRHXKLMFK 10 T 21 DUF420 pdbhh F Eukaryota T 4x3e 2 B B ALA-GLN-ARG-M3L-PHE-ALA-GLN-SER RLQAQRKFAQSQY 13 T 28 DUF4395 pdbhh F T 4x3h 2 B B CCG2_MOUSE NEURONAL VOLTAGE-GATED CALCIUM CHANNEL GAMMA-2 SUBUNIT, STARGAZIN, TRANSMEMBRANE AMPAR REGULATORY PROTEIN GAMMA-2, TARP GAMMA-2 RIPSYRYRY 9 T 0.34 TOC159_MAD pdbhh F Eukaryota F 4x3i 2 B B KCC2A_MOUSE CAM KINASE II SUBUNIT ALPHA, CAMK-II SUBUNIT ALPHA ATRNFSG 7 T 1.7 IER unppercent F Eukaryota T 4x3o 2 B C peptide PRO-LYS-LYS-THR-GLY PKKTG 5 T 140 DUF3924 pdbhh F F 4x3p 2 B C peptide PRO-LYS-LYS-THR-GLY PKKTG 5 T 140 DUF3924 pdbhh F F 4x5k 2 B B ACE-MMAS XMMAS 5 T 200 EGL-1 pdbhh F F 4x6s 2 C,D L,M Phosphotyrosine mimetic inhibitor peptide G7-TEM1 WFEGXDNTFPX 11 T 0.65 Caf4 pdbhh F T 4x6z 15 CA,DA a,e synthetic peptide (polymer) RRRPRPPYLPRFG 13 T 6.4 TAF8_C pdbhh F T 4x86 2 B B BAG6_HUMAN BAG FAMILY MOLECULAR CHAPERONE REGULATOR 6,BCL2-ASSOCIATED ATHANOGENE 6,BAG6,HLA-B-ASSOCIATED TRANSCRIPT 3,PROTEIN G3,PROTEIN SCYTHE GPLGSAKRRKTMQGEGPQLLLSEAVSRAAKAAGARPLTSPESLSRDLEAPEVQESYRQQLRSDIQKRLQEDPNYSPQRFPN 81 T 0.038 Phosducin pdbpssm F Eukaryota T 4x8n 2 B B RBBP5_HUMAN RBBP-5,RETINOBLASTOMA-BINDING PROTEIN RBQ-3 ERESEFDIED 10 T 0.014 DUF2457 unppercent F Eukaryota T 4x8p 2 B B RBBP5_HUMAN RBBP-5,RETINOBLASTOMA-BINDING PROTEIN RBQ-3 EYEERESEFDIE 12 T 0.014 DUF2457 unppercent F Eukaryota T 4x8w 2 H H Loquacious AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 48 T 7800 Porin_4 pdbhh F F 4x9r 2 B B 1-(1-METHYLETHYL)-1H-BENZIMIDAZOLE-2-SULFONIC ACID XLXSTX 6 T 500 Engrail_1_C_sig pdbhh F F 4x9v 2 B B 1-[3-(2,4-DIAMINO-6-METHYLQUINAZOLIN-7-YL)PHENYL]ETHANONE XLXSTX 6 T 500 Engrail_1_C_sig pdbhh F F 4x9w 2 B B (4AS)-5-[(2,4-DIAMINOPTERIDIN-6-YL)METHYL]-4A,5-DIHYDRO-2H-DIBENZO[B,F]AZEPIN-8-OL LXSTX 5 T 1200 PDEase_I_N pdbhh F F 4x9z 1 A,B A,B CDKA_CONGR alphaD-conotoxin GeXXA from the venom of Conus generalis DVHRPCQSVRPGRVWGKCCLTRLCSTMCCARADCTCVYHTWRGHGCSCVM 50 T 12 Tachystatin_A pdbhh F Eukaryota T 4xa9 2 B,D,F,H,J,L,N,P a,b,c,d,e,f,g,h Q5ZWY9_LEGPH Uncharacterized protein GMAIAPQQIQERLKQEQYQKFVVADIGNFPHCLAQTPEGIASGQRYQKYSTNSLSRTPPFSQWGAPQLLTPKSAQEYIKFAQQRNKKSSFKIDGEAVRVSECSNFAYHSAGVLLDDPQIRTQYDVAVIGSMHSNGRYLHNITLLVPKGSRLPQPPQQLTAEVFPIGTLIVDPWAVGMGHPPEQALAIPKEQFAYNRSLFPATVNYQSALDESLTSTRTGQLTPYTGTPSRT 231 T 0.55 eIF_4EBP pdbpssm F Bacteria T 4xal 2 B B peptide SSGVDL SSGVDL 6 T 29 NAD_synthase pdbhh F F 4xc2 2 E,F,G,H E,F,G,H KBTB6_HUMAN Kelch repeat and BTB domain-containing protein 6 SDDDFWVRVAP 11 T 0.5 BSD pdbhh F Eukaryota T 4xdn 2 B B SCC2_YEAST Sister chromatid cohesion protein 2 MKSSHHHHHHENLYFQSNAMSYPGKDKNIPGRIIEALEDLPLSYLVPKDGLAALVNAPMRVSLPFDKTIFTSADDGRDVNINVLGTANSTTSSIKNEAEKERLVFKRPSNFTSSANSVDYVPTNFLEGLSPLAQSVLSTHKGLNDSINIEKKSEIVSRPEAKHKLESVTSNAGNLSFNDNSSNKKTKTSTGVTMTQANLA 200 T 0.64 DUF3910 pdbpercent F Eukaryota T 4xef 2 B,C,E,F B,C,E,F LPXN_HUMAN 20-mer peptide containing LD1 motif of leupaxin MEELDALLEELERSTLQDSD 20 T 1.8 Paxillin unphh F Eukaryota T 4xek 2 B C LPXN_HUMAN 19-mer peptide containing Leupaxin LD4 motif KTSAAAQLDELMAHLTEMQ 19 T 0.057 GET2 unppssm F Eukaryota T 4xfn 1 A,B A,B Amyloid forming peptide AEVVFT AEVVFT 6 T 6.2 IgaA pdbhh F F 4xfo 1 A A Amyloid-forming peptide TAVVTN TAVVTN 6 T 110 Tox-REase-2 pdbhh F F 4xgc 5 E G Origin recognition complex subunit 2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 41 F F F 4xh2 3 M,N,O,P,Q,R a,c,e,g,h,j PAXI_HUMAN paxillin LD4 WGGSATRELDELMASLSD 18 T 1.8 SAM_LFY pdbhh F Eukaryota T 4xhv 2 B B Q3KN41_DROME Neurexin 1 DSKDVKEWYV 10 T 12 DUF3929 pdbhh F Eukaryota T 4xi7 2 B C JAG1_HUMAN Jagged 1 N-box peptide NQIKNPIEKHG 11 T 0.036 SID-1_RNA_chan unppssm F Eukaryota T 4xib 2 B C DL_DROME Delta N-box peptide NIIKNTWDKSV 11 T 0.16 DAG1 unphh F Eukaryota T 4xif 2 E,F,G,H E,F,G,H K2C7_HUMAN CYTOKERATIN-7,CK-7,KERATIN-7,K7,SARCOLECTIN,TYPE-II KERATIN KB7 GPVFTSRSAAG 11 T 0.047 Keratin_2_head unppssm F Eukaryota T 4xmh 2 B B GLY-GLY-GLY GGG 3 T 79 FTCD_C pdbhh F F 4xng 1 A,B,C,D A,B,C,D Y218A_MYCGE Uncharacterized protein MG218.1 ASSFHNFSKETLQKQAKRGFLLLERCSLVGLQQLELEYVNLLGRSFDSYQQKTELLNNLKELVDEHFSDTEKIINTLEKIFDVIGGSEYTPVLNSFFNKLLSDPDPMQREIGLRQFIITLRQRFKKLSQKIDSSLKQIETEAKA 144 T 0.51 DUF1043 pdb F Bacteria T 4xoj 2 B B SFTI1_HELAN SFTI-1 GRCTKSIPPICFP 13 T 0.0023 Bowman-Birk_leg pdb F Eukaryota T 4xpm 1 A A MEH1_YEAST EGO COMPLEX SUBUNIT 1,GSE COMPLEX SUBUNIT 2 SPDSAKISKEQLKKLHSNILNEIFSQSQVNKPGPLTVPF 39 T 0.15 SDA1 unppercent F Eukaryota T 4xst 2 B F INSR_RAT IR ESSFRKTFEDYLHNVVFVPRKTS 23 T 3.7 YvbH_ext pdbhh F Eukaryota T 4xul 1 A A G5CQN7_9VIRU mg662 GSSHHHHHHSLEVLFQGPGSLIYTYKLEKYVRTKIFPKILLIPDKNRYIIKGSFRRRVPFVTDIDVVNNVYPEISRENIYDEIIKLVNNIQSDPNIILAYLSCGTDERFKISTGSSKELSNIQSLLPDNEKNEFQLVLNKYYNDQQKKLFFLNELIWDHYKLRWKPEDVLIGSMNLANNVSVNFRETVENNSTILLQYYVKLGSYPVGIDVVINYQKIDLTPAYKNAALYQLQLANYSREYYYMLFPLRYYFKNNQDISQRLENIIEKKYGLYKQLMVRIDDYHTLYKSGNLKIDMATNIVIGILRDIEKLPGFESDTIYQIKKVATNNSPSIKIEEWDILLKVLYQEINTAVNNKSRKYFYRYIAMVPPQDRSKNYISENQDMRLKMVN 390 T 0.26 DNA_pol_B_palm unp T Viruses T 4xvn 1 A,B,C,D,E,F A,B,C,D,E,F TERS_BPG20 Small terminase GSHMSVSFRDRVLKLYLLGFDPSEIAQTLSLDVKRKVTEEEVLHVLAEARELLSALPSLEDIRAEVGQALERARIFQKDLLAIYQNMLRNYNAMMEGLTEHPDGTPVIGVRPADIAAMADRIMKIDQERITALLNSLKVLGHVGSTTAGALPSATELVSVEELVAEVVDEAPKT 174 T 0.031 PLU-1 unppssm T Viruses T 4xvu 4 M,N g,a Nyv1 TMD XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 37 F F F 4xwo 4 AA,BA,Y,Z m,s,a,g Sec22 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 41 F F F 4xxc 3 C B ASP-GLU-LEU-GLU-ILE-LYS-ALA-TYR DELEIKAY 8 T 1.6 Wyosine_form pdbhh F T 4xzr 1 A A SRC1_YEAST HELIX-EXTENSION-HELIX DOMAIN-CONTAINING PROTEIN 1 SDTRKKRKDPDSDDWSESNSKENKIDNKHLNLLSSDSEIEQDYQKAKKRKTSDL 54 T 0.092 Nop14 unp F Eukaryota T 4xzx 1 A A Q8VSD5_SHIFL OSPI GPLGSPEFMINGVSLQGTAGYEAHTEEGNVNVKKLLESLNSKSLGDMDKDSELAATLQKMINPSGGDGNSSGCALHACMAMLGYGVREAPVPNEISEYMTGFFHRHLEQIDSEGIVSHPNETYSKFRERIAENILQNTSKGSVVMISIEQATHWIAGFNDGEKIMFLDVQTGKGFNLYDPVEKSPDAFVDENSSVQVIHVSDQEFDHYANSSSWKSKRLC 220 T 0.031 Gln_amidase pdbpercent F Bacteria T 4y18 2 I,J,K,L,M,N,O,P I,J,K,L,M,N,O,P F175A_HUMAN COILED-COIL DOMAIN-CONTAINING PROTEIN 98,PROTEIN FAM175A GFGEYSRSPTF 11 T 0.48 PipA pdbhh F Eukaryota T 4y1c 2 C C Cyclic hexapeptide cyc[NdPopPKID] KXDNXP 6 T 140 Pox_Rif pdbhh F F 4y1d 2 C D Cyclic hexapeptide cyc[NdPopPKID] KXDNXX 6 T 200 DUF2701 pdbhh F F 4y29 2 B B NCOA1_HUMAN NCOA-1,CLASS E BASIC HELIX-LOOP-HELIX PROTEIN 74,BHLHE74,PROTEIN HIN-2,RIP160,RENAL CARCINOMA ANTIGEN NY-REN-52,STEROID RECEPTOR COACTIVATOR 1,SRC-1 KSLLQQLLTE 10 T 7.3 E3_UbLigase_RBR pdbhh F Eukaryota T 4y2g 2 B B F175A_HUMAN COILED-COIL DOMAIN-CONTAINING PROTEIN 98,PROTEIN FAM175A YSRSPTF 7 T 0.082 PipA pdbhh F Eukaryota T 4y32 2 C,D C,D TAU_HUMAN ARG-THR-PRO-SEP-LEU-PRO-CNC(C(C)O)C(=O)N1CCCC1CCOC RTPSLPT 7 T 1.3 UPF0167 pdbhh F Eukaryota F 4y3b 2 C,D C,D TAU_HUMAN ARG-THR-PRO-SEP-LEU-PRO-THR-[H][C@@]1(C(C2=CC=CC=C2)C3=CC=CC=C3)CCCN1C RTPSLPT 7 T 1.3 UPF0167 pdbhh F Eukaryota F 4y3u 3 C C Cardiac phospholamban XXXXXXXXXXXXXXXXXXXXXXX 23 F F F 4y5i 2 C,D F,G TAU_HUMAN NEUROFIBRILLARY TANGLE PROTEIN,PAIRED HELICAL FILAMENT-TAU,PHF-TAU XRTPSLPTX 9 T 2.9 UPF0167 pdbhh F Eukaryota T 4y69 15 CA,DA c,d Ac-PAD-ep XAXX 4 T 1000 DUF333 pdbhh F F 4y6a 15 CA,DA c,d Ac-PAD-ep XAXX 4 T 1000 DUF333 pdbhh F F 4y6o 2 C,D C,D TNFA_HUMAN peptide LEU-PRO-LYS-MYK-THR-GLY-GLY LPKXTGG 7 T 15 SpoV pdbhh F Eukaryota T 4y6v 15 CA,DA c,d Ac-PAE-ep XAXX 4 T 540 DUF2760 pdbhh F F 4y6z 15 CA,DA,EA,FA,GA,HA c,d,e,f,g,h Ac-PAL-ep XPAXX 5 T 1400 zf-RING_11 pdbhh F F 4y70 15 CA,DA,EA,FA e,f,g,h Ac-LAV-ep XLAXX 5 T 2000 zf-C2H2_6 pdbhh F F 4y74 15 CA,DA,EA,FA,GA,HA c,d,e,f,g,h Ac-LAL-ep XLAXX 5 T 2000 zf-C2H2_6 pdbhh F F 4y75 15 CA,DA,EA,FA c,d,e,f Ac-PAF-ep XPAXX 5 T 280 RPOL_N pdbhh F F 4y77 15 CA,DA,EA,FA,GA,HA c,d,e,f,g,h Ac-LAF-ep XLAXX 5 T 520 DUF5604 pdbhh F F 4y78 15 CA,DA,EA,FA,GA,HA 1,2,3,4,5,6 Ac-LAD-ep XLAXX 5 T 2000 zf-C2H2_6 pdbhh F F 4y7w 15 CA,DA,EA,FA,GA,HA c,d,e,f,g,h Ac-LAE-ep XLAXX 5 T 1000 SEC-C pdbhh F F 4y7x 15 CA,DA c,d Ac-PPA-ep XPAXX 5 T 1400 zf-RING_11 pdbhh F F 4y7y 15 CA,DA,EA,FA c,d,e,f Ac-LAA-ep XLAXX 5 T 2000 zf-C2H2_6 pdbhh F F 4y80 15 CA,DA,EA,FA,GA,HA c,d,e,f,g,h Ac-LAI-ep XLAXX 5 T 2000 zf-C2H2_6 pdbhh F F 4y81 15 CA,DA,EA,FA c,d,e,f Ac-PAY-ep XPAXX 5 T 1400 zf-RING_11 pdbhh F F 4y82 15 CA,DA,EA,FA,GA,HA c,d,e,f,g,h Ac-LAY-ep XLAXX 5 T 2000 zf-C2H2_6 pdbhh F F 4y84 15 CA,DA,EA,FA,GA,HA e,f,g,h,i,j N3-A(4,4-F2P)nLL-ep XXAXXX 6 T 2600 zf-C2H2 pdbhh F F 4y8g 15 CA,DA,EA,FA,GA,HA c,d,e,f,g,h N3-APnLL-ep XXAPXX 6 T 940 zinc_ribbon_2 pdbhh F F 4y8h 15 CA,DA,EA,FA,GA,HA c,d,e,f,g,h N3-APAL-ep XXAPAX 6 T 1300 zinc_ribbon_2 pdbhh F F 4y8i 15 CA,DA,EA,FA,GA,HA c,d,e,f,g,h Ac-PLL-ep XPLXX 5 T 940 Fer4_6 pdbhh F F 4y8j 15 CA,DA,EA,FA,GA,HA c,d,e,f,g,h Ac-PLL-ep XLLXX 5 T 2000 EF-hand_1 pdbhh F F 4y8k 15 CA,DA,EA,FA c,d,e,f H-APLL-ep APLXX 5 T 690 Fer4_6 pdbhh F F 4y8l 15 CA,DA,EA,FA c,d,e,f Ac-APLL-ep XAPLX 5 T 790 zinc_ribbon_2 pdbhh F F 4y8n 15 CA,DA c,d Ac-PAE-ep XPAXX 5 T 800 cEGF pdbhh F F 4y8o 15 CA,DA,EA,FA c,d,e,f Ac-PAF-ep XPAXX 5 T 280 RPOL_N pdbhh F F 4y8p 15 CA,DA,EA,FA,GA,HA c,d,e,f,g,h Ac-PAL-ep XPAXX 5 T 1400 zf-RING_11 pdbhh F F 4y8q 15 CA,DA,EA,FA c,d,e,f Ac-PAY-ep XPAXX 5 T 1400 zf-RING_11 pdbhh F F 4y8s 15 CA,DA,EA,FA,GA,HA c,d,e,f,g,h Ac-LAE-ep XLAXX 5 T 1000 SEC-C pdbhh F F 4y8t 15 CA,DA c,d Ac-PAE-ep XPAXX 5 T 800 cEGF pdbhh F F 4y8u 15 CA,DA 1,2 Ac-PAD-ep XPAXX 5 T 1400 zf-RING_11 pdbhh F F 4y9w 2 B B PEPTIDE XVVXAX 6 T 1700 FAM60A pdbhh F F 4y9z 15 CA,DA,EA,FA,GA,HA 1,2,3,4,5,6 Ac-LAE-ep XLAXX 5 T 1000 SEC-C pdbhh F F 4ya0 15 CA,DA 1,2 Ac-PAE-ep XPAXX 5 T 800 cEGF pdbhh F F 4ya2 15 CA,DA,EA,FA,GA,HA c,d,e,f,g,h Ac-LAE-ep XLAXX 5 T 1000 SEC-C pdbhh F F 4ya3 15 CA,DA c,d Ac-PAE-ep XPAXX 5 T 800 cEGF pdbhh F F 4ya5 15 CA,DA c,d Ac-PAE-ep XPAXX 5 T 800 cEGF pdbhh F F 4ya7 15 CA,DA,EA,FA,GA,HA c,d,e,f,g,h Ac-LAE-ep XLAXX 5 T 1000 SEC-C pdbhh F F 4ya9 15 CA,DA,EA,FA,GA,HA c,d,e,f,g,h Ac-LAD-ep XLAXX 5 T 2000 zf-C2H2_6 pdbhh F F 4ycz 3 C C G2Q2S2_MYCTT Nup120 GPGSEFELMQGGSSTNHETAGLRTEMLSRLFTAATSISHFEEAHSALLSMDDEAMQKSYLRRLVEKMCETGQSSELITLPFSGLQTKVDDILVEKCRATRDVLNGVPYHQILYAWRINHNDYRGGAAILLDRLQKLRRAGEGDKVIANEHGNEDALDTQVTRQYLLLINALSCVPPQEAYILEDVLPGDGRGGDDADGDRNGGKAGDDLEADIDELEKKLDVEGGADAAKGDEMAAEEDAALIEKMKRFSTRNGQNLPARRLLMLADLRKQYQQELDRIVAIQNNQFGFGAEDDLMDLAGGSGHHHHHHHHHH 313 T 0.02 ELYS pdbpercent F Eukaryota T 4yec 3 C C Peptide inhibitor Ac-VLTK-AOMK XVLTX 5 T 1200 DUF592 pdbhh F F 4ygx 3 E E cis peptidomimetic CTD phospho-Ser5 peptide XSPYSPTXSYSX 12 T 12 Glyco_hydro_77 pdbhh F F 4ygy 2 C,D C,D peptidomimetic CTD phospho-Ser5 peptide XSPYSPTXSYSX 12 T 12 Glyco_hydro_77 pdbhh F F 4yh1 2 C,D C,D A small phosphatase 1 XSPYSPTXSYSX 12 T 12 Glyco_hydro_77 pdbhh F F 4yh8 2 B B U2AF2_SCHPO U2 AUXILIARY FACTOR 59 KDA SUBUNIT,U2AF59,U2 SNRNP AUXILIARY FACTOR LARGE SUBUNIT GGSSVGRSRSPPPSRERSVRSIEQELEQLRDVTPINQWKRKRSLWDIKPPGYELVTADQAKMSGVFPLPGA 71 T 9.8 Transformer unphh F Eukaryota T 4yiz 2 D,E,F B,D,F U6KQJ2_EIMTE Rhoptry neck protein 2, putative GSASDITQHLNDSGLGPAVECLENLVVGPVCPAAVVAPAV 40 T 8.5 LisH_TPL pdbhh F Eukaryota T 4yl6 2 B B M3K3_HUMAN MAPK/ERK KINASE KINASE 3,MEKK 3 MDEQEALNSIMNDLVALQMNRR 22 T 4.5 DUF3040 pdbhh F Eukaryota T 4yl8 2 B B CRB_DROME 95F GPGSEFRNKRATRGTYSPSAQEYCNPRLEMDNVLKPPPEERLI 43 T 0.031 TMEM154 unphh F Eukaryota T 4ym4 2 B B TIFA_HUMAN THR9 PHOSPHORYLATED N-TERMINAL PEPTIDE MTSFEDADTEET 12 T 120 Soyouz_module pdbhh F Eukaryota T 4ynh 1 A,B A,B SAS5_CAEEL SAS-5 GPLGSKIASAREVIKRDGVIPPEALTIIEQRLRSDPMFRQQIDNVLADAECDANRAAYSP 60 T 0.17 T3SS_needle_E pdbhh F Eukaryota T 4ynk 2 B C MED1_HUMAN Coactivator peptide drip from cDNA FLJ50196, highly similar to Peroxisome proliferator-activated receptor-binding protein KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 4ynl 2 C,D,G,H D,C,P,R PATS_NOSS1 Heterocyst inhibition-signaling peptide ERGSGR 6 T 3.8 Slx4 unphh F Bacteria F 4ynn 2 I I Octapeptide XXXXXXXX 8 F F F 4ynn 3 J,L J,L Hexa-peptide XXXXXX 6 F F F 4ynn 4 K,M,N,O,P K,M,N,O,P Hepta-peptide XXXXXXX 7 F F F 4ynn 5 Q,R,S,T,U,V Q,R,S,T,U,V UNK-UNK-UNK XXX 3 F F F 4yom 2 B A BRSK2_MOUSE SADA,SERINE/THREONINE-PROTEIN KINASE SAD-A MKKSWFGNFINLEKEEQIFVVIKDKPLSSIKADIVHAFLSIPSLSHSVISQTSFRAEYKATGGPAVFQKPVKFQVDITYTEGGEAQKENGIYSVTFTLLSGPSRRFKRVVETIQAQLLSTHDQPSAQHLSGIIPKSLEHHHHHH 144 T 0.12 Fungal_KA1 pdb F Eukaryota T 4yr6 3 C,F C,F GP1BA_HUMAN ACE-LYS-LEU-ARG-GLY-VAL-LEU-GLN-GLY-HIS-LEU XKLRGVLQGHL 11 T 9.8 Pinin_SDK_memA pdbhh F Eukaryota T 4ysi 2 B B VIRF1_HHV8P SER-PRO-GLY-GLU-GLY-PRO-SER-GLY SPGEGPSG 8 T 0.96 Herpes_IE1 pdbhh T Viruses F 4yuu 19 DC,HB,MA,S s2,S2,s1,S1 PEPTIDE CHAIN UNASSIGNED AAAAALALLAAALALVAVVFAVVLALFAAWAAAFAAAAFAALFLAA 46 T 12 Alph_Pro_TM pdbhh F F 4yuu 20 EC,IB,NA,T w2,W2,w1,W1 PEPTIDE CHAIN UNASSIGNED AAWFAVSAVALVVVAAVLVAVAAAA 25 T 2.2 UL42 pdbhh F T 4yv8 2 B B Lichostatinal XXSVX 5 T 530 LPD29 pdbhh F F 4yv9 2 E,F,G,H E,F,G,H Cyclosporin A XXXXXXXXVXA 11 T 9.7 DUF6090 pdbhh F F 4yvm 1 A,B A,B Q75XL3_HELPX CAGL GSHMEDITSGLKQLDSTYKETNQQVLKNLDEIFSTTSPSANDKIGKEDALNIKKAAIALRGDLALLKANFEANELFFISEDVIFKTYMSSPELLLTYMKINPLDQKTAEQQCGISDKVLVLYCEGKLKIEQEKQNIRERLETSLKAYQSNIGGTASLITASQTLVESLKNKNFIKGIRKLMLAHDKVFLNYLEKLDALEISLEQSKRQYLQERQSSKVIVK 221 T 0.0044 IDO pdbpssm F Bacteria T 4yxb 4 E E Ambiguous peptide density XXXXXX 6 F F F 4yxy 1 A,B,C,D A,B,C,D dTor_9x31L MASSHHHHHHSSGLVPRGSSMASGISVEELLKLAKAAYYSGTTVEEAYKLALKLGISVEELLKLAEAAYYSGTTVEEAYKLALKLGISVEELLKLAKAAYYSGTTVEEAYKLALKLG 117 T 0.021 T2SSF pdbpssm F T 4yy6 2 B Z H4_HUMAN Histone H4 SGRGXGGXGLG 11 T 11 Shadoo unppercent F Eukaryota F 4yyd 2 B Z H4_HUMAN Histone H4 SGRGXGGXGLG 11 T 11 Shadoo unppercent F Eukaryota F 4yyg 2 B B H4_HUMAN Histone H4 SGRGXGGXGLG 11 T 11 Shadoo unppercent F Eukaryota F 4yyh 2 C,D Z,Y H4_HUMAN Histone H4 SGRGXGGXGLG 11 T 11 Shadoo unppercent F Eukaryota F 4yyi 2 C,F C,F H4_HUMAN Histone H4 SGRGXGGXGLG 11 T 11 Shadoo unppercent F Eukaryota F 4yyj 2 C,F C,F H4_HUMAN Histone H4 SGRGXGGXGLG 11 T 11 Shadoo unppercent F Eukaryota F 4yyk 2 C,F C,F H4_HUMAN Histone H4 SGRGXGGXGLG 11 T 11 Shadoo unppercent F Eukaryota F 4yym 2 C Z H4_HUMAN Histone H4 SGRGXGGXGLG 11 T 11 Shadoo unppercent F Eukaryota F 4yyn 2 C Z H4_HUMAN Histone H4 SGRGXGGXGLG 11 T 11 Shadoo unppercent F Eukaryota F 4yyp 2 B B STIL_HUMAN TAL-1-INTERRUPTING LOCUS PROTEIN PDAYRFLTEQDRQLRLLQAQIQRLLEAQSLMP 32 T 0.091 ACT_5 pdbpercent F Eukaryota T 4yzh 2 B B CB1A_ARATH CHLOROPHYLL A-B PROTEIN 165,CAB-165,LHCII TYPE I CAB-2 RKTVAKPKGPSGSPW 15 T 2 Peptidase_S29 pdbhh F Eukaryota T 4z09 2 B C Q8IKV6_PLAF7 Rhoptry neck protein 2 CFTARMSPPQQIC 13 T 1.1 Bowman-Birk_leg pdbhh F Eukaryota T 4z0d 2 B C Q8IKV6_PLAF7 Rhoptry neck protein 2 CWTTRMSPPQQIC 13 T 0.61 Bowman-Birk_leg pdbhh F Eukaryota T 4z0e 2 B C Q8IKV6_PLAF7 Rhoptry neck protein 2 CWTTRMSPPQQIC 13 T 0.61 Bowman-Birk_leg pdbhh F Eukaryota T 4z0f 2 B C Q8IKV6_PLAF7 Rhoptry neck protein 2 CXTTRMSPPQQIC 13 T 0.61 Bowman-Birk_leg pdbhh F Eukaryota T 4z0u 2 C,D D,E SSB_ECOLI SSB-Ct Peptide WMDFDDDIPF 10 T 0.36 Phage_SSB pdbhh F Bacteria T 4z0w 1 A,B A,B PEPTAIBOL GICHIGAMIN XXPXPFXPAXXAXXLXXLXXLXG 23 T 50 DUF688 pdbhh F T 4z29 1 A,B A,B A4TUL6_9PROT Magnetotaxis protein MtxA MASWSHPQFEKGADDDDKSEPPVSMLMQVAGAVETSKGGEKWAPVTRNKFLFVGTQVRTGADGGGKLIDQNSGMAQTIGANSVVEITAAGPKAVSGSLSAPEAASGDLVAGLSNRFAEAQRYTTVRRSVKKEAADLKLRVASDITLSPTYPDLVWENMGAQYGYTLVIDGTSHAVPATSGEMVRFRVPSLTPGAHSFGVTVTEGGQAVGQTEKGGTIVWLSATEDKALVDGVARVKAASTGDEFALGNYLDSKGVTVAAMDAYRKHFASHKDDNDMRPLLIKTYNDLKLRDLRQKEALVYNEQLEGNPGFSSISAHHHHHHHHHH 325 T 3.2E-05 DUF928 unphh F Bacteria T 4z2o 2 B P HOAVI_HOEPD Hoef-peptide SVATVSESLLTE 12 T 15 Pollen_allerg_2 pdbhh F Bacteria T 4z2p 2 B,D P,C HOAVI_HOEPD Hoef-peptide (L9F) SVATVSESFLTE 12 T 12 DUF4325 pdbhh F Bacteria T 4z2v 2 B,D P,C HOAVI_HOEPD Hoef-peptide SVATVSESLLTE 12 T 15 Pollen_allerg_2 pdbhh F Bacteria T 4z33 2 C,D C,D FZD7_HUMAN LYS-GLY-GLU-THR-ALA-VAL KGETAV 6 T 190 Phage_SSB pdbhh F Eukaryota T 4z5w 2 C,D P,Q Phytosulfokine YIYTQ 5 T 56 MTHFR pdbhh F F 4z61 3 E,F P,Q PTR-ILE-PTR-THR-GLN YIYTQ 5 T 56 MTHFR pdbhh F F 4z63 2 B P Phytosulfokine YIYTQ 5 T 56 MTHFR pdbhh F F 4z64 3 C P Phytosulfokine YIYTQ 5 T 56 MTHFR pdbhh F F 4z6y 1 A,C,E,G B,G,E,A TBCD7_HUMAN CELL MIGRATION-INDUCING PROTEIN 23 GVEEKKSLEILLKDDRLDTEKLCTFSQRFPLPSMYRALVWKVLLGILPPHHESHAKVMMYRKEQYLDVLHALKVVRFVSDATPQAEVYLRMYQLESGKLPRSPSFPLEPDDEVFLAIAKAMEEMVEDSVDCYWITRRFVNQLNTKYRDSLPQLPKAFEQYLNLEDGRLLTHLRMCSAAPKLPYDLWFKRCFAGCLPESSLQRVWDKVVSGSCKILVFVAVEILLTFKIKVMALNSAEKITKFLENIPQDSSDAIVSKAIDLWHKHCGTPVHSS 273 T 1.8 RabGAP-TBC pdbpercent F Eukaryota T 4z7i 2 C,D C,D DG025 transition-state analogue enzyme inhibitor XXKHHAFSFK 10 T 18 SmaI pdbhh F T 4z7n 5 I,J G,J Tetrapeptide ALA-GLY-ASP-VAL AGDV 4 T 90 Cupin_4 pdbhh F F 4z7o 5 I,J G,I Tetrapeptide ALA-GLY-ASP-VAL AGDV 4 T 90 Cupin_4 pdbhh F F 4z7q 5 I,J G,I Tetrapeptide AGDV-NH2 AGDVX 5 T 170 PCB_OB pdbhh F F 4z80 2 B,D C,D B6KQU6_TOXGV Cytoadherence-linked asexual protein GSASQIVQNQSSLAPELSGCPPMGICMDGTIGDPIAS 37 T 0.18 MSC pdbhh F Eukaryota T 4z88 2 M,N,O,P,Q,R,S,T,U,V,W,X M,N,O,P,Q,R,S,T,U,V,W,X JIP1_DROME JIP-1,APP-LIKE-INTERACTING PROTEIN 1,APLIP1,PROTEIN EYE DEVELOPMENTAL SP512 XTRRRRKLPEIPKNKKX 17 T 17 Curto_V3 pdbhh F Eukaryota T 4z89 2 K,L,M,N,O,P,Q,R,S,T a,b,c,d,e,f,g,h,i,j CAC1A_DROME PROTEIN CACOPHONY,PROTEIN NIGHTBLIND A,PROTEIN NO-ON-TRANSIENT B,DMCA1A XIGRRLPPTPSKPSTLX 17 T 16 Oxidored-like pdbhh F Eukaryota T 4z8a 2 B B CAC1A_DROME PROTEIN CACOPHONY,PROTEIN NIGHTBLIND A,PROTEIN NO-ON-TRANSIENT B,DMCA1A XIGRRLPPTPSKPSTLX 17 T 16 Oxidored-like pdbhh F Eukaryota T 4z8c 55 CB,FD 1z,2z Oncocin VDKPPYLPRPRPPRRIYNR 19 T 0.18 Apidaecin pdbhh F T 4z8j 2 B B PTH1R_HUMAN C-terminal PDZ binding motif from parathyroid hormone receptor (PTHR) QEEWETVM 8 T 0.21 Prp19 pdbhh F Eukaryota T 4z8m 2 C,D C,D MAVS_HUMAN MAVS,CARD ADAPTER INDUCING INTERFERON BETA,CARDIF,INTERFERON BETA PROMOTER STIMULATOR PROTEIN 1,IPS-1,PUTATIVE NF-KAPPA-B-ACTIVATING PROTEIN 031N,VIRUS-INDUCED-SIGNALING ADAPTER,VISA GPCHGPEENEYKSEGTFGI 19 T 3.6 GRA6 unphh F Eukaryota T 4z8q 2 B B Q6TKR9_9XANT AvrRxo1-ORF2 MKTLTGADALEFHKKLKERNKALHASDLELALVHADAVGKERFDLEELEKICDTSDAGRLTDAKERNDIYERMYYVEYPNVMTLKEFAHIVETLFSWS 98 T 0.39 Rnk_N pdbpssm F Bacteria T 4z8t 2 B B Q6TKR9_9XANT AvrRxo1-ORF2 MKTLTGADALEFHKKLKERNKALHASDLELALVHADAVGKERFDLEELEKICDTSDAGRLTDAKERNDIYERMYYVEYPNVMTLKEFAHIVETLFSWS 98 T 0.39 Rnk_N pdbpssm F Bacteria T 4z8u 2 B B Q6TKR9_9XANT AvrRxo1-ORF2 MKTLTGADALEFHKKLKERNKALHASDLELALVHADAVGKERFDLEELEKICDTSDAGRLTDAKERNDIYERMYYVEYPNVMTLKEFAHIVETLFSWS 98 T 0.39 Rnk_N pdbpssm F Bacteria T 4z8v 2 B B Q6TKR9_9XANT AvrRxo1-ORF2 MKTLTGADALEFHKKLKERNKALHASDLELALVHADAVGKERFDLEELEKICDTSDAGRLTDAKERNDIYERMYYVEYPNVMTLKEFAHIVETLFSWS 98 T 0.39 Rnk_N pdbpssm F Bacteria T 4z96 2 B C DNMT1_HUMAN DNMT1,CXXC-TYPE ZINC FINGER PROTEIN 9,DNA METHYLTRANSFERASE HSAI,M.HSAI,MCMT SDWPNHARSPGNKGKGKGKGKGKPKSQACEPSE 33 T 37 Ribosomal_L16 pdbhh F Eukaryota T 4z97 2 B C DNMT1_HUMAN DNMT1,CXXC-TYPE ZINC FINGER PROTEIN 9,DNA METHYLTRANSFERASE HSAI,M.HSAI,MCMT SDWPNHARSPGNKGKGKGQGKGKPKSQACEPSE 33 T 38 Ribosomal_L16 pdbhh F Eukaryota T 4za1 1 A,B,C A,B,C C6FX40_STRAS NosA MTEHPAQQLYCTVVLWDLSRSAATVASLRAYLRDHAVDAYTTVPGLRQKTWISSTGPEGEQWGAVYLWDSPEAAYGRPPGVSKVVELIGYRPTERRYYSVEAATEGPAAAAAPFGKGLGLAFDPASPEPLTRPQEFVPPGADAFIPSRPPA 151 T 0.0033 ydhR pdb F Bacteria T 4zar 2 B B METHOXYSUCCINYL-ALA-ALA-PRO-PHE-CHLOROMETHYL KETONE, bound form XAAPX 5 T 170 DUF3054 pdbhh F F 4zc4 1 A,B,C,D A,B,C,D LARP1_HUMAN LA RIBONUCLEOPROTEIN DOMAIN FAMILY MEMBER 1 GHSGGGGGGHMQHPSHELLKENGFTQHVYHKYRRRCLNERKRLGIGQSQEMNTLFRFWSFFLRDHFNKKMYEEFKQLALEDAKEGYRYGLECLFRYYSYGLEKKFRLDIFKDFQEETVKDYEAGQLYGLEKFWAFLKYSKAKNLDIDPKLQEYLGKFRRLED 162 T 0.62 Frankia_peptide pdbpercent F Eukaryota T 4zdt 1 A,C A,C SLX1_SCHPO Structure-specific endonuclease subunit slx1 MEPVKCNLCYECIESDELRANCPFTDCNSINHLTCLASSFLTEECQVLPIEGMCTKCKRVLRWREFLSTVFTT 73 T 0.00013 FANCL_C pdbpercent F Eukaryota T 4zer 54 BB,DD 1y,2y Onc112 VDKPPYLPRPRPPRRIYNR 19 T 0.18 Apidaecin pdbhh F T 4zhb 2 B B 5-mer peptide VDAVN 5 T 99 OSTMP1 pdbhh F F 4zhl 2 B P mupain-1-IG CPAYSRYIGC 10 T 1.9 DUF6438 pdbhh F T 4zhm 1 A P mupain-1-16-IG CPAYSAYIGC 10 T 0.88 DUF6438 pdbhh F T 4zi1 2 B B NCOA5_HUMAN NCOA-5,COACTIVATOR INDEPENDENT OF AF-2,CIA AIESLIDLLADN 12 T 2.8 HEAT_PBS pdbhh F Eukaryota T 4zj7 2 B B CDC14_YEAST Tyrosine-protein phosphatase CDC14 TILRQLLPKNRRVTSGRRTTSAAGGIRKISGSIKK 35 T 0.074 BATS pdbpssm F Eukaryota T 4zjx 2 B B circular peptide inhibitor CXRWTKCLX 9 T 1 Brr6_like_C_C pdbhh F T 4zkn 2 B P upain-1-W3A CSARGLENHRMC 12 T 7.9 LRRNT pdbhh F T 4zko 2 B P N-terminal fragment of upain-1-W3A CSAR 4 T 54 zf-CCHC pdbhh F F 4zko 3 C Q C-terminal fragment of upain-1-W3A GLENHRMC 8 T 6.8 SARS_3b pdbhh F T 4zkq 1 A A E9M5R0_9GAMA Putative uncharacterized protein GPVGEPVASEINEASKVSSRLLTQDILFRKDRQATISLPIKLPVEDIITQTCDKITYGPLKFLDLLEKETAVLPLSTDITCPACLGRAVLVGKWECPAHVAVNESDLTVFGPNKEEHVPQFVTVQQPSDGKMQRLFFAKFLGTEESLAVLRVPGPDGHLCIQEALIHFKELSGAGVCSLWKANDSREEGLEMKQVDCLETTVLENQTCIATTLSKKIYHRLYCGERLMTGGQVSTRVLLTALGFYKRQPYTFHRVPKGMVYVHLIDSGSEDYMEYSECEEVTPGRYEDKQISYTFYTDLFQTADGEPVLASVWGTSGLKDSAYESCAFVIPTKGRRKLVPRRIMSKCYPFRLTYHPSTMTVRLDVRVEKHHGATDQGFVFLKMESGTYSEGREYYLDRVLWGEDSSTNNVLQHHHHHHHH 420 T 0.2 DUF4787 pdb T Viruses T 4zkr 2 B P upain-1-W3A CSARGLENHRMC 12 T 7.9 LRRNT pdbhh F T 4zks 2 B P upain-1-W3A CSARGLENHAAC 12 T 6.4 LRRNT pdbhh F T 4zlt 1 A,B B,A E9M5R0_9GAMA Putative uncharacterized protein GPVGEPVASEINEASKVSSRLLTQDILFRKDRQATISLPIKLPVEDIITQTCDKITYGPLKFLDLLEKETAVLPLSTDITCPACLGRAVLVGKWECPAHVAVNESDLTVFGPNKEEHVPQFVTVQQPSDGKMQRLFFAKFLGTEESLAVLRVPGPDGHLCIQEALIHFKELSGAGVCSLWKANDSREEGLEMKQVDCLETTVLENQTCIATTLSKKIYHRLYCGERLMTGGQVSTRVLLTALGFYKRQPYTFHRVPKGMVYVHLIDSGSEDYMEYSECEEVTPGRYEDKQISYTFYTDLFQTADGEPVLASVWGTSGLKDSAYESCAFVIPTDGEEDLVPRRIMSKCYPFRLTYHPSTMTVRLDVRVEKHHGATDQGFVFLKMESGTYSEGREYYLDRVLWGEDSSTNNVLQHHHHHHHH 420 T 0.2 DUF4787 pdb T Viruses T 4zmk 1 A A TAZ1_SCHPO Telomere length regulator taz1 DTFSERTLGLNSIDNTEISEVVSLGLVSSALDKITGLLSADNLSETVSQARDFSHTLSKSLKSRAKSLSQK 71 T 16 Leptin pdbhh F Eukaryota T 4znx 2 E,F,G,H E,F,G,H APP12 APPLPPRNRPRL 12 T 0.33 SCIMP pdbhh F F 4zny 2 B B POL_HTL1C T-cell leukemia virus type I, partial gag gene; HTLV1 (human T-lymphotropic virus type I) YVEPTAPQVL 10 T 19 DUF2992 pdbhh T Viruses T 4zoq 1 A,C,E,G,I,K,M,O A,B,C,D,E,F,G,H Q65DC7_BACLD LANP PROTEIN MKRIYIFLLCFAVLLPVGGKTAQAKEQAGEQYLLLEHVKDKSKLLDTAEQFHIHADVIEEIGFAKVTGEKQKLAPFTKKLAEKVGADVIEKPIANTAVNE 100 T 1 Mfp-3 unphh F Bacteria T 4zq0 2 E,F,G,H E,F,G,H A8Ap phosphopeptide ARAASAPA 8 T 520 Zein-binding pdbhh F F 4zqu 1 A A CDIA_YERPY CdiA-CT toxin, Conserved domain protein MPWEDYVGKTLPVGSRLPPNFKTYDYFDRATGAVVSAKSLDTQTMAKLSNPNQVYSSIKKNIDVTAKFEKASLSGVTVNSSMITSKEVRLAVPVNTTKAQWTEINRAIEYGKNQGVKVTVTQVK 124 T 0.068 Glyco_hydro_97 pdbpercent F Bacteria T 4zqw 2 B,D A,C CDIA4_ECO5C macrocyclic peptide SXKEYALSGRELT 13 T 0.033 Glyco_hydro_97 unppercent F Bacteria T 4zri 2 C,D C,D LATS2_HUMAN KINASE PHOSPHORYLATED DURING MITOSIS PROTEIN,LARGE TUMOR SUPPRESSOR HOMOLOG 2,SERINE/THREONINE-PROTEIN KINASE KPM,WARTS-LIKE KINASE PKFGPYQKALREIRYSLLPFANESGTSAAAEV 32 T 5.3 DUF3928 pdbhh F Eukaryota T 4zrk 2 E,F,G,H E,F,G,H LATS1_HUMAN LARGE TUMOR SUPPRESSOR HOMOLOG 1,WARTS PROTEIN KINASE,H-WARTS PKFGTHHKALQEIRNSLLPFANETNSSRSTSE 32 T 0.049 EcoEI_R_C unp F Eukaryota T 4zrl 2 B B GLD3_CAEEL GERMLINE DEVELOPMENT DEFECTIVE 3 MAHSYNPFVRSAVEYDADTRLQMAENAASARKLFVSSALKDIIVNPENFYHDFQQSAQMAEDANQRRQVSYNTKREA 77 T 0.0096 Glyco_transf_54 pdb F Eukaryota T 4zro 2 E,F,G,H E,F,G,H Bounded inhibitor of N-(tert-butoxycarbonyl)-L-seryl-L-valyl-N-{(2S)-5-ethoxy-5-oxo-1-[(3S)-2-oxopyrrolidin-3-yl]pentan-2-yl}-L-leucinamide XSVLX 5 T 1200 Pox_EPC_I2-L1 pdbhh F F 4zrt 2 B B NPHN_HUMAN GLY-PRO-LEU-PTR-ASP-GLU AWGPLXDEVQM 11 T 1.4 GIT_SHD pdbhh F Eukaryota T 4zrz 1 A,B A,B Q7Y3F3_9CAUD PlyCB SKINVNVENVSGVQGFLFHTDGKESYGYRAFINGVEIGIKDIETVQGFQQIIPSINISKSDVEAIEKAMKK 71 T 2.5 DUF3213 pdbhh T Viruses T 4ztd 2 D,E D,E TRAIP_HUMAN ALA-PHE-GLN-ALA-LYS-LEU-ASP-THR-PHE-LEU-TRP-SER AFQAKLDTFLWS 12 T 1 DNA_pol3_chi pdbhh F Eukaryota T 4ztd 3 F F ALA-GLY-ALA-GLY-ALA AGAGA 5 T 140 DUF3641 pdbhh F F 4zu1 2 B A CYSE_SALTY SAT,SERINE TRANSACETYLASE HHTFEYGDGI 10 T 3.3 DUF2023 pdbhh F Bacteria T 4zu6 2 B B CYSE_SALTY SAT,SERINE TRANSACETYLASE HHTFEYGDGI 10 T 3.3 DUF2023 pdbhh F Bacteria T 4zvo 3 E,F E,F Peptide ACE-VAL-GLU-ILE-ASJ XVEIX 5 T 650 DUF72 pdbhh F F 4zvp 3 E,F E,F Peptide ACE-ASP-GLU-VAL-ASA XDEVX 5 T 140 zf-NPL4 pdbhh F F 4zvq 3 E,F E,F Peptide ACE-VAL-GLU-ILE-ASA XVEIX 5 T 100 Ig_5 pdbhh F F 4zvr 3 E,F E,F Peptide ACE-ASP-GLU-VAL-ASJ XDEVX 5 T 570 Helicase_RecD pdbhh F F 4zvs 3 E,F E,F DEVD inhibitor XDEVX 5 T 570 Helicase_RecD pdbhh F F 4zvt 3 E,F E,F VEID inhibitor XVEIX 5 T 650 DUF72 pdbhh F F 4zvu 3 E,F E,F Tetrapeptide Inhibitor Ac-VEID-CHO XVEIX 5 T 650 DUF72 pdbhh F F 4zw2 2 B B CAC1S_MOUSE CALCIUM CHANNEL,L TYPE,ALPHA-1 POLYPEPTIDE,ISOFORM 3,SKELETAL MUSCLE,VOLTAGE-GATED CALCIUM CHANNEL SUBUNIT ALPHA CAV1.1 XQQLEEDLRGYMSWITQGEX 20 T 2.4 Antimicrobial14 pdbhh F Eukaryota T 4zxa 1 A,B,C,D A,B,C,D C1I210_PSEWB Hydroquinone dioxygenase small subunit GPGSMSNVAVNTVFASLDNFRKGTVEIISGEARHYAFSNIFEVAQNSKPYEKVVVGLNLGYVVETLRAEGQSPWFTAAHDEFAIVMDGEVRVEFLKLDAPSKHGEGTHLAGELPVGKPMGYVLLKRGHQCLLPAGSAYRFEASRPGVILQQTIKGPLSVEKWAEICLK 168 T 0.069 Ppnp unphh F Bacteria T 4zxc 1 A,B,C,D A,D,C,B C1I210_PSEWB Hydroquinone dioxygenase small subunit GPGSMSNVAVNTVFASLDNFRKGTVEIISGEARHYAFSNIFEVAQNSKPYEKVVVGLNLGYVVETLRAEGQSPWFTAAHDEFAIVMDGEVRVEFLKLDAPSKHGEGTHLAGELPVGKPMGYVLLKRGHQCLLPAGSAYRFEASRPGVILQQTIKGPLSVEKWAEICLK 168 T 0.069 Ppnp unphh F Bacteria T 4zxd 1 A,B A,B C1I210_PSEWB Hydroquinone dioxygenase small subunit GPGSMSNVAVNTVFASLDNFRKGTVEIISGEARHYAFSNIFEVAQNSKPYEKVVVGLNLGYVVETLRAEGQSPWFTAAHDEFAIVMDGEVRVEFLKLDAPSKHGEGTHLAGELPVGKPMGYVLLKRGHQCLLPAGSAYRFEASRPGVILQQTIKGPLSVEKWAEICLK 168 T 0.069 Ppnp unphh F Bacteria T 4zxl 1 A H NAG-PRO-SER-THR-ALA-Thr-O-GlcNAc containing peptide from drosophila HCF PSTA 4 T 350 DUF2254 pdbhh F F 4zya 1 A,B A,B SYNC_HUMAN ASPARAGINYL-TRNA SYNTHETASE,ASNRS GHMAELYVSDREGSDATGDGTKEKPFKTGLKALMTVGKEPFPTIYVDSQKENERWNVISKSQLKNIKKMWHREQMKS 77 T 0.086 DUF1565 pdbpssm F Eukaryota T 4zzj 2 B B P53_HUMAN Ac-p53 RHKXLXF 7 T 36 NifQ pdbhh F Eukaryota T 5a0e 2 C,D C,E JW47 XXXXXXXXVXA 11 T 8 IncD pdbhh F F 5a0n 1 A A A5MCJ6_STREE PROTEIN F2 LIKE FIBRONECTIN-BINDING PROTEIN GAMGGFPNDAKGISGNGKYYSLGQIEKLYSNQFATYNNLTVITSDTHENSDNFAFCLANGKRFPSFTDEKPKGIYTLVKDINKEQYTKLLKENHKWSSIPNLNQAWDTFSRLSYMYLKDPTDIVKRAWGTDLNTARTYFHQVIQYEIWRYTDGMRVSSDTNVYIYEKFSPQQKKALEMIRTDLYNFTVPYENLEYRFYKPDWVFGLGFQALATVRWKIEP 220 T 0.06 TED unphh F Bacteria T 5a0r 2 C,D D,E PRODUCT PEPTIDE XEVNP 5 T 27 Ins134_P3_kin pdbhh F F 5a0x 2 C,D C,D SUBSTRATE PEPTIDE XEVNPPVPX 9 T 6.7 HIF-1a_CTAD pdbhh F F 5a1q 1 A,B A,B Y1502_ARCFU AF1502 GSHMITYKKLLDELKKEIGPIAKIFLNKAMESLGYDDVDDSNYKEILSVLKMNKELREYVEIVEERLEKEG 71 T 0.0016 DUF1322 pdb F Archaea T 5a29 2 B B PECTATE LYASE HHHHHHHSSGLVPRGSHA 18 T 3000 zf-CCHC_2 pdbhh F T 5a2i 2 B P ANTIGEN TN, SER IS COVALENTLY BOUND TO GALNAC APDSRP 6 T 200 CHASE9 pdbhh F F 5a2j 2 B P THE NAKED PEPTIDE APDTRP APDTRP 6 T 170 DDE_Tnp_1_assoc pdbhh F F 5a2k 2 B P ANTIGEN TN, THR IS COVALENTLY BOUND TO GALNAC APDTRP 6 T 170 DDE_Tnp_1_assoc pdbhh F F 5a2l 2 B P MODIFIED ANTIGEN TN APDCRP 6 T 14 Mgr1 pdbhh F F 5a2q 37 KA r RL19_HUMAN RIBOSOMAL PROTEIN EL19 RRSKTKEARKRRE 13 T 4.7 SUIM_assoc pdbhh F Eukaryota T 5a2q 38 LA w RL24_HUMAN RIBOSOMAL PROTEIN EL24 XXXXXXXXXXXXXXQRAITGASLADIMAKRNQKPEVRKAQREQAIRAAKEAKKAKQASKKTA 62 T 0.15 UreF unppercent F Eukaryota T 5a31 17 S T THE ANAPHASE-PROMOTING COMPLEX CHAIN T XXXXXQLXXXXXXXXXXXXXX 21 T 4100 DUF6451 pdbhh F F 5a31 18 T U THE ANAPHASE-PROMOTING COMPLEX CHAIN U XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 5a31 19 U V THE ANAPHASE-PROMOTING COMPLEX CHAIN V AAFRIALKSVQKS 13 T 7.8 DIMCO_N pdbhh F T 5a4h 1 A A ABHD5_MOUSE ABHYDROLASE DOMAIN-CONTAINING PROTEIN 5, LIPID DROPLET-BINDING PROTEIN CGI-58, PROTEIN CGI-58, WR10_43 GAMGSVDSADAGGGSGWLTGWLPTWCPTSTSHLKEAEEK 39 T 5.8 CSN7a_helixI pdbhh F Eukaryota T 5a5u 1 A A EUKARYOTIC INITIATION FACTOR 3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 54 F F F 5a6e 1 A A SEQUENCE LIKE A CALCIUM-ACTIVATED POTASSIUM CHANNEL SUBUNIT, SLO2.2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 139 F F F 5a6e 4 D D SEQUENCE LIKE A CALCIUM-ACTIVATED POTASSIUM CHANNEL SUBUNIT, SLO2.2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 43 F F F 5a6f 2 B D SEQUENCE LIKE A CALCIUM-ACTIVATED POTASSIUM CHANNEL SUBUNIT, SLO2.2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 43 F F F 5a6g 1 A A SEQUENCE LIKE A CALCIUM-ACTIVATED POTASSIUM CHANNEL SUBUNIT, SLO2.2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 139 F F F 5a6w 2 C C C4B8B8_MAGOR AVR-PIK PROTEIN METGNKYIEKRAIDLSRERDPNFFDHPGIPVPECFWFMFKNNVRQDAGTCYSSWKMDMKVGPNWVHIKSDDNCNLSGDFPPGWIVLGKKRPGF 93 T 0.1 TMEM18 unp F Eukaryota T 5a8l 9 I Z NASCENT CHAIN MEPLVLSAKKLSSLLTCKYIPP 22 T 3.8 BLOC1S3 pdbhh F T 5a9z 33 GA Ag Unknown peptide XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 128 F F F 5aa0 33 GA Ag Unknown peptide XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 128 F F F 5aa1 2 E E N-ACETYLGLUCOSAMINE-1,6-ANHYDRO-N-ACETYLMURAMIC ACID L-ALA-D-GLU-M-DAP-D-ALA-D-ALA XAXXXX 6 T 260 DUF2175 pdbhh F F 5aa2 4 E E N-ACETYLGLUCOSAMINE-1,6-ANHYDRO-N-ACETYLMURAMIC ACID L-ALA-D-GLU-M-DAP-D-ALA-D-ALA XAXXXX 6 T 260 DUF2175 pdbhh F F 5ab0 2 C,D E,F DG025 XXKHHAFSFX 10 T 18 SmaI pdbhh F T 5ab2 2 C,D C,D GPI GPGRAF 6 T 11 GP120 pdbhh F F 5abk 1 A A PRTV_VIBCH METALLOPROTEASE PRTV GAMAQTPIDLGVVNEDKLIEMLVRTGQIPADASDVDKRIALERYLEEKIRSGFKGDAQFGKKALEQRAKILKVIDKQKGPHKAR 84 T 0.038 SLT_2 pdbpercent F Bacteria T 5abu 2 B B MXT_DROME 4E-BINDING PROTEIN MEXTLI GPHMLESRVSYDIEHLLYYSMSPHSWTLPTDWQKMQETAPSILRNKDLQDESQRFDGDKYLASIKTAAKR 70 T 0.051 eIF_4EBP unphh F Eukaryota T 5abv 2 B,D,F,H B,D,F,H MXT_DROME 4E-BINDING PROTEIN MEXTLI GPHMLESRVSYDIEHLLYYSMSPHSWTLPTDWQKMQETAPSILRNKDLQDESQRFDGDKYLASIKTAAKR 70 T 0.051 eIF_4EBP unphh F Eukaryota T 5abx 2 B B MXT_CAEEL 4E-BINDING PROTEIN MEXTLI GPHMIRYNRDTLMTARDTKRAPIPDEMLQEINRVAPDILIA 41 T 0.072 Gal11_ABD1 pdb F Eukaryota T 5aby 2 B,D,F B,D,F MXT_CAEEL 4E-BINDING PROTEIN MEXTLI GPHMIRYNRDTLMTARDTKRAPIPDEMLQEINRVAPDILIA 41 T 0.072 Gal11_ABD1 pdb F Eukaryota T 5acz 3 C C 11MER PEPTIDE GRAEEYGADTL 11 T 4.5 DUF4156 pdbhh F T 5ad0 3 C C 11MER PEPTIDE GHAEEYGADTL 11 T 9.7 DUF4156 pdbhh F T 5adx 6 M M DYNACTIN SUBUNIT 2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 824 F F F 5adx 7 N N DYNACTIN SUBUNIT 2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 818 F F F 5adx 8 O,P O,P DYNACTIN SUBUNIT 3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 88 F F F 5adx 9 Q,R Q,R DYNACTIN SUBUNIT 2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 91 F F F 5adx 12 U Y DYNACTIN SUBUNIT 2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 243 F F F 5adx 13 AA,V z,Z DYNACTIN SUBUNIT 2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 419 F F F 5adx 17 Z d A0A0J9X293_PIG DYNACTIN SUBUNIT 2 MADPKYADLPGIARNEPDVY 20 T 5.8 SmAKAP pdbhh F Eukaryota T 5aei 2 D,E,F D,E,F KR5 KRKRKRKRKR 10 T 3.8 RFX5_DNA_bdg pdbhh F F 5afg 2 B B STAPLED PEPTIDE XTFAEYWAQLAS 12 T 0.071 PBP-Tp47_a pdbhh F T 5afp 2 C,D C,D RK_HUMAN RK, G PROTEIN-COUPLED RECEPTOR KINASE 1 MDFGSLETVVANSAFIAARGSFDGS 25 T 3.1 DUF5465 pdbhh F Eukaryota T 5afq 2 C,D D,E RPC32 BETA (RPC7L) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 218 F F F 5afu 1 A 1 DYNEIN TAIL XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 361 F F F 5afu 2 B 2 DYNEIN TAIL XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 359 F F F 5afu 4 E,F 5,6 DYNEIN TAIL XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 275 F F F 5afu 10 S M DYNACTIN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 587 F F F 5afu 11 T N DYNACTIN 6 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 616 F F F 5afu 12 U,V O,P DYNACTIN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 65 F F F 5afu 13 W,X Q,R DYNACTIN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 87 F F F 5afu 16 AA Y CAPZ BETA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 243 F F F 5afu 17 BA Z F-ACTIN-CAPPING PROTEIN SUBUNIT BETA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 52 F F F 5afu 20 EA c A0A0J9X299_PIG DYNACTIN MADPKYADLPGIARNEPDVYAAAAAAAAAAA 31 T 0.16 Dynamitin pdbhh F Eukaryota T 5afu 21 FA d A0A0J9X295_PIG DYNACTIN MADPKYADLPGIARNEPDVY 20 T 5.8 SmAKAP pdbhh F Eukaryota T 5afu 22 GA z CAPZ BETA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 53 F F F 5afw 2 B B KDM1A_HUMAN BRAF35-HDAC COMPLEX PROTEIN BHC110, FLAVIN-CONTAINING AMINE OXIDASE DOMAIN-CONTAINING PROTEIN 2 RRTSRRKRAKVE 12 T 5.8 DUF3306 pdbhh F Eukaryota T 5agu 2 C,D C,D GRISELIMYCIN XXXXLXLXPXG 11 T 15 NOP19 pdbhh F F 5agv 2 C,D C,D CYCLOHEXYL GRISELIMYCIN XXXXLXLXXXG 11 T 15 NOP19 pdbhh F F 5ah2 2 E,F,G,H E,F,G,H GRISELIMYCIN XXXXLXLXPXG 11 T 15 NOP19 pdbhh F F 5ah4 2 C,D C,D CYCLOHEXYL GRISELIMYCIN XXXXLXLXXXG 11 T 15 NOP19 pdbhh F F 5aiw 1 A A A0A140UHJ9_ENTFL TRAH SNTNQSESEKIIKEFYKTVYNYEKSQKEISMTTVKELATDNVYQELQNEINVNNSYSPQQNTIQKSSVNENEIKILAYESKDNSQQYLVTAPIHQVFNGTKNDFEINQLIQIKNQKITQRTTIQLGEE 128 T 0.0033 Tim44 pdbpssm F Bacteria T 5aj0 85 HC By Nascent protein chain XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 5aj1 1 A A SNF5_HUMAN BRG1-ASSOCIATED FACTOR 47, BAF47, INTEGRASE INTERACTOR 1 PROTEIN, SNF5 HOMOLOG, HSNF5, SMARC B1 DOMAIN GGSMMMALSKTFGQKPVKFQLEDDGEFYMIGSEVGNYLRMFRGSLYKRYPSLWRRLATVEERKKIVASSHGKKTKPNTKDHGYTTLATSVTLLKASEVEEILDGNDEKYKAVSIS 115 T 0.00076 CRC_subunit pdbhh F Eukaryota T 5aj3 16 P T MITORIBOSOMAL PROTEIN BL19M, MRPL19 XXXXXXXXXXXXXX 14 F F F 5aj3 24 Y e MITORIBOSOMAL PROTEIN MS27, MRPS27 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 336 F F F 5aj3 35 JA s UNASSIGNED HELICES XXXXXXXXXXXXXXXX 16 F F F 5aj3 36 KA z UNASSIGNED HELICES XXXXXXXXXXXXXXXXX 17 F F F 5aj4 23 X Ae MITORIBOSOMAL PROTEIN MS27, MRPS27 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 336 F F F 5aj4 32 GA Ao K7GKS8_PIG MITORIBOSOMAL PROTEIN MS39, MRPS39 MAAVASARWLGVRSGLCLPLTGRRVGPCGRTPRSRFYSGSAAHPEVEGANVTGIEEVVIPKKKTWDKVAILQALASTVHRDSTAAPYVFQDDPYLIPTSSVESHSFLLAKKSGENAAKFIINSYPKYFQKDIAEPHIPCLMPEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 530 T 46 RBR pdbpssm F Eukaryota T 5aj4 34 IA As UNASSIGNED HELICES XXXXXXXXXXXXXXXX 16 F F F 5aj4 35 JA Az UNASSIGNED HELICES XXXXXXXXXXXXXXXXX 17 F F F 5aj4 88 KC Bz UNASSIGNED SECONDARY STRUCTURE ELEMENTS XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 94 F F F 5ajd 2 B,D,F,H,J,L B,D,F,H,J,L NOT4_YEAST MODULATOR OF TRANSCRIPTION 2, NOT4 GPDSMDPYDALGNAVDFLDARLHSLSNYQKRPISIKSNIIDEETYKKYPSLFSWDKIEASKKSDN 65 T 0.71 DUF3140 unppercent F Eukaryota T 5ajn 2 B P MUC5A_HUMAN MUC5AC GTTPSPVPTTSTCSAA 16 T 5.4 TYW3 pdbhh F Eukaryota T 5ajo 2 B B MUC5A_HUMAN MUC5AC AGTTPSPVPTTSTTSAA 17 T 9.2 Inhibitor_I53 pdbhh F Eukaryota T 5ajp 2 B B MUC5A_HUMAN MUCIN AGTTPSPVPTTSTTSAA 17 T 9.2 Inhibitor_I53 pdbhh F Eukaryota T 5aka 7 G 6 ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA AAAAAAAA 8 T 280 Androgen_recep pdbhh F F 5amt 1 A,B A,B A0Q7H2_FRATN IGLE MGSSHHHHHHSSGLVPRGSHGSHMDGLYINNNIPKTKIVLESKPDKNIFYSDNYQSISQRIYDDNVKVLNLKTGKNEFPLDKDIKDYALYFILPENKKTENWKYLISSDSVNEFTIKNDSSIEKD 125 T 0.0061 DUF5006 unphh F Bacteria T 5amu 1 A,B A,B A0Q7H2_FRATN IGLE MGSSHHHHHHSSGLVPRGSHMAIMDGLYINNNIPKTKIVLESKPDKNIFYSDNYQSISQRAADDNVKALNLKTGKNEFPLDKDIKDYALYFILPENKKTENWKYLISSDSVNEFTIKNDSSIEKD 125 T 0.0061 DUF5006 unphh F Bacteria T 5aoq 1 A,B A,B TORSO_BOMMO RECEPTOR TYROSINE KINASE TORSO HHHHHHHHGEVVSQRYPPAPGLLKYLEQDVCYSLYYYLNWTSLADCKTNFEETGISDVPSTVKVRCQSKNSIRFETEPSEHWQLFILMEHDNFDPIPFTLIEPNNVFGELITTANKEYQIWSTYLDEYGTLQDWMEGPIVLKFDQRNQQPDDIKYNVTQEFKYIILGNDSYTINGKFVWNTTGDRDLCFDIANICQNTNMKHAKIWPTAHPSFDVENLVLNDECEIHVKGIHGTTKHKYKTPSCFELPECFLNNMEPEIPQDVAIAADQDLR 272 T 0.00078 fn3 unppercent F Eukaryota T 5aot 1 A A A0A1A9TAF4_RUMFL CBM74-RFGH5 MGAEEEDTAILYPFTISGNDRNGNFTINFKGTPNSTNNGCIGYSYNGDWEKIEWEGSCDGNGNLVVEVPMSKIPAGVTSGEIQIWWHSGDLKMTDYKALEHHHHHH 106 T 4.3 DUF5766 pdbhh F Bacteria T 5apn 47 UA z ALB1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 85 F F F 5apo 47 UA z ALB1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 95 F F F 5apr 2 B I PEPSTATIN-LIKE RENIN INHIBITOR HPFCXLFX 8 T 3.9 RNR_inhib pdbhh F T 5aul 2 B B CD28_HUMAN TP44 SDXMNMTP 8 T 0.089 DUF2207 unppercent F Eukaryota T 5aum 3 E,F C,D peptide RENLYFQGKDG RENLYFQGKDG 11 T 3 PNPase_C pdbhh F T 5avp 1 A,B,C,D A,B,C,D D2S5K0_GEOOG UNCHARACTERIZED PROTEIN MNHKVHHHHHHIEGRHMEGLLARTSVTRREYDEWLNEAAALGRALRYPVRPEMVNDSAGIVFGEDQYDAFENGLWSREPYEAMVIFESLNEPAVDGLPAAGAPFAEYSGLCDKLMIVHPGKFCPPHYHQRKTESYEVVLGEMELFYSPKPVQVGEEEVLSFTGMHEGSPWPDGVALPIGREESYAALTSYRRLRVGDPKFVMHRKHLHAFRCPADSDVPLVVREVSTYSHEPTEEAADKAAPLPDWAGLHDNSFVAAAANSGRLRTAIQ 269 T 0.0036 Cupin_2 unp F Bacteria T 5awj 2 B C MED1_HUMAN Mediator of RNA polymerase II transcription subunit 1 KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 5awk 2 B C MED1_HUMAN Mediator of RNA polymerase II transcription subunit 1 KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 5awl 1 A A A mutant of Chignolin, CLN025 YYDPETGTWY 10 T 0.011 OCRE pdb F T 5awt 2 B B EPS15_HUMAN EPS15 YDPFGGDPFKG 11 T 0.46 CsgF pdbhh F Eukaryota T 5awu 2 B B EPS15_HUMAN EPS15 YDPFKGSDPFA 11 T 1.5 MT-A70 pdbhh F Eukaryota T 5awv 2 E,F,G,H,I,J,K,L I,J,K,L,M,N,O,P TEICOPLANIN XXXXXXX 7 T 1.5 Defensin_2 pdbhh F F 5ax6 1 A A Q93I73_ECOLX CofB GPDEARRQIVSNALISEIAGIVDFVAEEQITVIEQGIEKEITNPLYEQSSGIPYINRTTNKDLNSTMSTNASEFINWGAGTSTRIFFTRKYCISTGTQGNYEFSKDYIPCEEPAILSNSDLKIDRIDFVATDNTVGSAIERVDFILTFDKSNANESFYFSNYVSSLEKAAEQHSISFKDIYVVERNSSGAAGWRLTTISGKPLTFSGLSKNIGSLDKTKNYGLRLSIDPNLGKFLRADGRVGADKLCWNIDNKMSGPCLAADDSGNNLVLTKGKGAKSNEPGLCWDLNTGTSKLCLTQIEGKDNNDKDASLIKLKDDNGNPATMLANILVEEKSMTDSTKKELRTIPNTIYAAFSNSNASDLVITNPGNYIGNVTSEKGRIELNVQDCPVSPDGNKLHPRLSASIASIVADTKDSNGKYQADFSSLAGNRNSGGQLGYLSGTAIQVNQSGSKWYITATMGVFDPLTNTTYVYLNPKFLSVNITTWCSTEPQT 492 T 2.1 PulG unphh F Bacteria T 5axi 2 D E IRS1_MOUSE Cblin DGXMP 5 T 22 BTRD1 pdbhh F Eukaryota F 5az8 2 B B ALDH2_RAT peptide GPRLSRLLSYAGC GPRLSRLLSYAGX 13 T 8.8 TFIID_30kDa pdbhh F Eukaryota T 5azg 2 C,D C,D UNC51_CAEEL UNC-51 AIM YQESTDFTFL 10 T 15 DUF1957 pdbhh F Eukaryota T 5b0u 1 A,B A,B LUCI_OPLGR 19 KDA PROTEIN OF OPLOPHORUS LUCIFERASE, NANOKAZ MNHKVHHHHHHMELGTLEGSEFFTLEDFVGDWRQTAGYNLDQVLEQGGVSSLFQNLGVSVTPIQRIVLSGENGLKIDIHVIIPYEGLSGDQMGQIEKIFKVVYPVDDHHFKVILHYGTLVIDGVTPNMIDYFGRPYEGIAVFDGKKITVTGTLWNGNKIIDERLINPDGSLLFRVTINGVTGWRLCERILA 191 T 0.0098 Lipocalin_7 pdbhh F Eukaryota T 5b16 2 B,C B,C DGCR8_HUMAN DIGEORGE SYNDROME CRITICAL REGION 8 MANLHILSKLQEEMKRLAEEREETRHGGSRGDMLEVLFQ 39 T 7.1 Tma16 pdbhh F Eukaryota T 5b41 2 B C MED1_HUMAN Mediator of RNA polymerase II transcription subunit 1 KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 5b4w 2 G,H,I,J,K,L G,H,I,J,K,L Synthesized cyclic peptide XXRPRVARWTGQIIYCSX 18 T 0.56 DUF3104 pdbhh F T 5b4x 1 A,C A,C RELN_MOUSE REELER PROTEIN GRDGNNLNNPVLLLDTFDFGPREDNWFFYPGGNIGLYCPYSSKGAPEEDSAMVFVSNEVGEHSITTRDLSVNENTIIQFEINVGCSTDSSSADPVRLEFSRDFGATWHLLLPLCYHSSSLVSSLCSTEHHPSSTYYAGTTQGWRREVVHFGKLHLAGSVRFRWYQGFYPAGSQPVTWAIDNVYIGPQCEEMCYGHGSCINGTKCICDPGYSGPTCKISTKNPDFLKDDFEGQLESDRFLLMSGGKPSRKCGILSSGNNLFFNEDGLRMLVTRDLDLSHARFVQFFMRLGCGKGVPDPRSQPVLLQYSLNGGLSWSLLQEFLFSNSSNVGRYIALEMPLKARSGSTRLRWWQPSENGHFYSPWVIDQILIGGNISGNTVLEDDFSTLDSRKWLLHPGGTKMPVCGSTGDALVFIEKASTRYVVTTDIAVNEDSFLQIDFAASCSVTDSCYAIELEYSVDLGLSWHPLVRDCLPTNVECSRYHLQRILVSDTFNKWTRITLPLPSYTRSQATRFRWHQPAPFDKQQTWAIDNVYIGDGCLDMCSGHGRCVQGSCVCDEQWGGLYCDEPETSLPTQLKDNFNRAPSNQNWLTVSGGKLSTVCGAVASGLALHFSGGCSRLLVTVDLNLTNAEFIQFYFMYGCLITPSNRNQGVLLEYSVNGGITWNLLMEIFYDQYSKPGFVNILLPPDAKEIATRFRWWQPRHDGLDQNDWAIDNVLISRLENLYFQ 725 T 0.00014 EGF_2 pdb F Eukaryota T 5b56 2 C,D,E,F C,D,E,F VPR_HV1B9 R ORF PROTEIN,VIRAL PROTEIN R QQRRTRNGASKS 12 T 6.8 EcoRII-C pdbhh T Viruses T 5b5b 2 B,D C,F MED1_HUMAN Mediator of RNA polymerase II transcription subunit 1 KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 5b5u 3 C,D C,E LEU-VAL-VAL-ASN LVVN 4 T 150 SCP1201-deam pdbhh F F 5b62 2 B B ASN-GLU-ALA NEA 3 T 330 ComZ pdbhh F F 5b6c 2 B B UFD1_HUMAN Peptide from Ubiquitin fusion degradation protein 1 homolog FRAFSGSGNRL 11 T 0.52 SEP pdbhh F Eukaryota T 5b6g 2 B B PHQ-ALA-GLY-GLU-ALA-XYC-TYR-GLU XAGEAXYEX 9 T 38 NYAP_N pdbhh F F 5b6i 1 A,B A,B W0W999_9ACTN Fluorinase MGSSHHHHHHSSGLVPRGSHMAANGSQRPIIAFMSDLGTTDDSVAQCKGLMHSICPGVTVVDVCHSMTPWDVEEGARYIVDLPRFFPEGTVFATTTYPATGTTTRSVAVRIRQAAKGGARGQWAGSGDGFERADGSYIYIAPNNGLLTTVLEEHGYIEAYEVTSTKVIPANPEPTFYSREMVAIPSAHLAAGFPLAEVGRRLDDSEIVRFHRPAVEISGEALSGVVTAIDHPFGNIWTNIHRTDLEKAGIGQGKHLKIILDDVLPFEAPLTPTFADAGAIGNIAFYLNSRGYLSLARNAASLAYPYNLKAGLKVRVEAR 319 T 2.6E-42 SAM_adeno_trans pdbpercent F Bacteria T 5b6l 2 B U UNK-UNK-UNK-UNK-TRP XXXXW 5 T 500 CBM_1 pdbhh F F 5b74 3 C D LEU-VAL-VAL-ASN LVVN 4 T 150 SCP1201-deam pdbhh F F 5b7i 2 B,C B,C L7P7R7_9CAUD Uncharacterized protein AcrF3 MGSSHHHHHHSQDPMSNTISDRIVARSVIEAARFIQSWEDADPDSLTEDQVLAAAGFAARLHEGLQATVLQRLVDESNHEEYREFKAWEEALLNADGRVASSPFADWGWWYRIANVMLATASQNVGVTWGSRVHGRLMAIFQDKFKQRYEEQA 153 T 0.23 Rtt102p unppssm T Viruses T 5bjt 3 I,J,K,L,M,N,O P,Q,R,S,T,U,V peptide inhibitor XRYFCTKWKHGWCEEVGTX 19 T 4.8 GnRH pdbhh F T 5bk0 3 E E CSP_PLAFA Circumsporozoite protein NANP 5-mer NANPNANPNANPNANPNANP 20 T 3.2 PT unppercent F Eukaryota F 5bkl 1 A,B,BA,C,CA,D,DA,E,EA,F,FA,G,H,I,J,K,L,M,N,O A,B,GG,C,HH,D,II,E,JJ,F,KK,G,H,I,J,K,L,M,N,O COAT_STMV Coat protein MGRGKVKPNRKSTGDNSNVVTMIRAGSYPKVNPTPTWVRAIPFEVSVQSGIAFKVPVGSLFSANFRTDSFTSVTVMSVRAWTQLTPPVNEYSFVRLKPLFKTGDSTEEFEGRASNINTRASVGYRIPTNLRQNTVAADNVCEVRSNCRQVALVISCCFN 159 T 24 Csm4_C pdbhh T Viruses T 5bkn 1 A,B,BA,C,CA,D,DA,E,EA,F,FA,G,H,I,J,K,L,M,N,O A,B,GG,C,HH,D,II,E,JJ,F,KK,G,H,I,J,K,L,M,N,O COAT_STMV Coat protein MGRGKVKPNRKSTGDNSNVVTMIRAGSYPKVNPTPTWVRAIPFEVSVQSGIAFKVPVGSLFSANFRTDSFTSVTVMSVRAWTQLTPPVNEYSFVRLKPLFKTGDSTEEFEGRASNINTRASVGYRIPTNLRQNTVAADNVCEVRSNCRQVALVISCCFN 159 T 24 Csm4_C pdbhh T Viruses T 5bkq 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,GG,HH,II,JJ,KK COAT_STMV Coat protein MGRGKVKPNRKSTGDNSNVVTMIRAGSYPKVNPTPTWVRAIPFEVSVQSGIAFKVPVGSLFSANFRTDSFTSVTVMSVRAWTQLTPPVNEYSFVRLKPLFKTGDSTEEFEGRASNINTRASVGYRIPTNLRQNTVAADNVCEVRSNCRQVALVISCCFN 159 T 24 Csm4_C pdbhh T Viruses T 5bmm 2 C,D C,D macrocyclic inhibitor MC25b XXXXX 5 T 210 Hanta_nucleocap pdbhh F F 5bmt 1 A,B A,B A7AJI6_9PORP Uncharacterized protein GDGGGNTQQLSSYAIVDYSSTMRTLIYPLGYYPLYVATIANDPTYRAGDCVLANFTVDFDSADNANASTNGFYVATGAASSPLAKYDLSYSPLDSMALDNELLLSGSESALLFSNNYKRIVVIPTFTSVLTDQKNTYIMSMDSNQEPETVDGTDRVYTLCLRAQKREEGKAPTISNAMDPIAVEGGTLYSMLKGKESAAGKKIVSYRVKYPLTFNADSTKIATWGYSKISQFSIEEATN 239 T 0.0074 DUF4969 unphh F Bacteria T 5bn6 1 A,C,E,G A,B,C,D Q38720_ARTHE Jacalin AEQSGKSQTVIVGPWGAQV 19 T 2.8 DUF3842 pdbhh F Eukaryota T 5bnd 1 A,B,C A,B,C Q5HN63_STAEQ ABC transporter, ATP-binding protein TYEEKLAYGLALDGSVTLNGSKDLKVPKYSLITITGENNKRYRVEMNQRRYSVSKNQVFYFNPAGLYESHTFKKLSPYIKSNYSTYVEYFNSHLHQKHDKVTETLRPDKDKKYVVPITQQPIKMIFGDNDKLSGFVIPMTNKTELKKTFNITKDVWITKSGSGYFIADMKEEKWIYIEL 179 T 0.051 DUF2393 unppercent F Bacteria T 5bnw 2 B D LMNB2_HUMAN laminB1 residues 179-191 KLSPSPSSRVTVS 13 T 0.41 CCDC85 unppercent F Eukaryota T 5boa 1 A,B,C,D,E,F A,B,C,D,E,F A4VT01_STRSY Translation initiation factor 2 (IF-2 GTPase) MGSSHHHHHHSSGLVPRGSHMKQQSPLIQTSNADYKSGKDQEKLRTSVSINLLKAEEGQIQWKVTFDTSEWSFNVKHGGVYFILPNGLDLTKIVDNNQHDITASFPTDINDYRNSGQEKYRFFSSKQGLDNENGFNSQWNWSAGQANPSETVNSWKSGNRLSKIYFINQITDTTELTYTLTAKVTEPNQQSFPLLAVMKSFTYTNSKSTEVTSLGAREITLEKEKT 226 T 4.2 DUF5377 unphh F Bacteria T 5bob 1 A,B,C,D,E A,B,C,D,E A4VT01_STRSY Translation initiation factor 2 (IF-2 GTPase) MGSSHHHHHHSSGLVPRGSHMKQQSPLIQTSNADYKSGKDQEKLRTSVSINLLKAEEGQIQWKVTFDTSEWSFNVKHGGVYFILPNGLDLTKIVDNNQHDITASFPTDINDYRNSGQEKYRFFSSKQGLDNENGFNSQWNWSAGQANPSETVNSWKSGNRLSKIYFINQITDTTELTYTLTAKVTEPNQQSFPLLAVMKSFTYTNSKSTEVTSLGAREITLEKEKT 226 T 4.2 DUF5377 unphh F Bacteria T 5bpu 2 G H (GGL)EEEEEE XEEEEEE 7 T 82 PRT_C pdbhh F F 5bpu 3 H I (GGL)EEE XEEE 4 T 250 Herpes_LP pdbhh F F 5bpz 1 A A Q0IH16_XENLA Anapc5 protein MASVHESLYFNPMMTNGVVHANVFGIKDWVTPYKISVLVLLSEMSKNTKISLVEKRRLNKQILPLLQGPDMTLSKLIKIVEECCPNVSSSVHIRIKLMAEGELKDMEQFFDDLADSFTGTEPEVHKTSVVGLFLRHMILAYNKLSFSQVYKLYTSLQQYFQSDENLYFQ 169 T 0.071 CEP19 unp F Eukaryota T 5brm 2 G,H,I,J,K,L,M,N,O G,H,I,J,K,L,M,N,O STK3_HUMAN MAMMALIAN STE20-LIKE PROTEIN KINASE 2,MST-2,STE20-LIKE KINASE MST2,SERINE/THREONINE-PROTEIN KINASE KRS-1 DEEEEDGTMKRNATSPQVQRPSFMDYFDKQD 31 T 0.28 Fib_succ_major unp F Eukaryota T 5bs0 3 C C TITIN_HUMAN CONNECTIN,RHABDOMYOSARCOMA ANTIGEN MU-RMS-40.14 ESDPIVAQY 9 T 13 DUF6497 pdbhh F Eukaryota T 5bs2 2 C R RBL_CHLRE RUBISCO LARGE SUBUNIT WKEIKF 6 T 8.5 DUF4300 pdbhh F Eukaryota F 5btr 2 D,E,F D,E,F AMC-containing peptide XRHKX 5 T 500 DEC-1_C pdbhh F F 5btv 2 B P TAU_HUMAN Microtubule-associated protein tau - peptide pS324 GSLG 4 F F Eukaryota F 5btw 1 A,B A,B Q5ZVE4_LEGPH Uncharacterized protein MVTKIIWVSNNGKPNLKIEFVSEEEKSNFFKEVKKKASELGLNFPLVQGSGNSLLIEASNYPINPCGCYISPGGKLAINFGKVELSHFILPKVGVKTEHAEIFKDHNTIFFHKHKLPGVNSELTFIPTGTPVIVPVTKLEHHHHHH 146 T 0.73 PPV_E2_C pdbhh F Bacteria T 5btx 1 A,B A,B lpg1496 MVTKIIWVSNNGKPNLKIEFVSEEEKSNFFKEVKKKASELGLNFPLVQGSGNSLLIEASNYPINPCGCYISPGGKLAINFGKVELSHFILPKVGVKTEHAEIFKDHNTIFFHKHKLPGVNSELTFIPTGTPVIVPVTKLEHHHHHH 146 T 0.73 PPV_E2_C pdbhh F T 5bty 1 A A lpg1496 SGDSSISISAIGNVDSPMIRITFQNQTEREFFLNKITDKAKSLGVNISTHPFEIKEPNMVLIKPSKYPDNKLGCYISKNKEIAINFGRTDFRDFVLSNLGVGSHLGTCPTKNETGNDTFYFHQENLSLNGPALSVNTK 138 T 0.019 SL4P pdbpssm F T 5btz 1 A A lpg1496 KSGDSSISISAIGNVDSPMIRITFQNQTEREFFLNKITDKAKSLGVNISTHPFEIKEPNMVLIKPSKYPDNKLGCYISKNKEIAINFGRTDFRDFVLSNLGVGSHLGTCPTKNETGNDTFYFHQENLSLNGPALSVNTK 139 T 0.017 SL4P pdbpssm F T 5bvl 1 A A designed TIM barrel sTIM11 MDKDEAWKCVEQLRREGATQIAYRSDDWRDLKEAWKKGADILIVDATDKDEAWKQVEQLRREGATQIAYRSDDWRDLKEAWKKGADILIVDATDKDEAWKQVEQLRREGATQIAYRSDDWRDLKEAWKKGADILIVDATDKDEAWKQVEQLRREGATQIAYRSDDWRDLKEAWKKGADILICDATGLEHHHHHH 194 T 0.00015 NanE pdbhh F T 5bw8 4 E Z Unknown Protein XXXXXXXXXXXXX 13 F F F 5bwl 2 B B Coumarin-labelled succinyl peptide LGX 3 T 800 TPR_2 pdbhh F F 5bxo 2 C,D C,D TNKS2_HUMAN TANK2,ADP-RIBOSYLTRANSFERASE DIPHTHERIA TOXIN-LIKE 6,ARTD6,POLY [ADP-RIBOSE] POLYMERASE 5B,TNKS-2,TRF1-INTERACTING ANKYRIN-RELATED ADP-RIBOSE POLYMERASE 2,TANKYRASE II,TANKYRASE-LIKE PROTEIN,TANKYRASE-RELATED PROTEIN XREAGDGAEX 10 T 7.5 DUF5840 pdbhh F Eukaryota T 5bxu 2 B B cp4n4m5 XREAGDGAX 9 T 5.5 DUF5840 pdbhh F T 5c07 3 C,H C,H Marker peptide YQFGPDFPIA 10 T 0.58 TGT_C2 pdbhh F T 5c08 3 C,H C,H Marker peptide RQWGPDPAAV 10 T 2.1 LEA_3 pdbhh F T 5c09 3 C,H C,H Marker peptide YLGGPDFPTI 10 T 6.4 GlfT2_domain3 pdbhh F T 5c0a 3 C,H C,H Marker peptide MVWGPDPLYV 10 T 0.74 Tachykinin pdbhh F T 5c0b 3 C,H C,H Marker peptide RQFGPDFPTI 10 T 8.1 Synapsin pdbhh F T 5c0c 3 C,H C,H Marker peptide RQFGPDWIVA 10 T 1.4 Rab15_effector pdbhh F T 5c0d 3 C C INS_HUMAN Marker peptide AQWGPDPAAA 10 T 3.1 LEA_3 pdbhh F Eukaryota T 5c0e 3 C C Marker peptide YQFGPDFPIA 10 T 0.58 TGT_C2 pdbhh F T 5c0f 3 C C Marker peptide RQWGPDPAAV 10 T 2.1 LEA_3 pdbhh F T 5c0g 3 C C Marker peptide YLGGPDFPTI 10 T 6.4 GlfT2_domain3 pdbhh F T 5c0i 3 C C Marker peptide RQFGPDFPTI 10 T 8.1 Synapsin pdbhh F T 5c0j 3 C C Marker peptide RQFGPDWIVA 10 T 1.4 Rab15_effector pdbhh F T 5c0v 1 A,B,C,D A,B,C,D LARP1_HUMAN LA RIBONUCLEOPROTEIN DOMAIN FAMILY MEMBER 1 GHSGGGGGGHMQHPSHELLKENGFTQHVYHKYRRRCLNERKRLGIGQSQEMNTLFRFWSFFLRDHFNKKMYEEFKQLALEDAKEGYRYGLECLFRYYSYGLEKKFRLDIFKDFQEETVKDYEAGQLYGLEKFWAFLKYSKAKNLDIDPKLQEYLGKFRRLED 162 T 0.62 Frankia_peptide pdbpercent F Eukaryota T 5c1b 2 G,H V,U UFD1_HUMAN UB FUSION PROTEIN 1 GELGFRAFSGSGNRLDGKKKG 21 T 1.6 SEP pdbhh F Eukaryota T 5c26 2 B B GLU-VAL-PTR-GLU-SER-PRO EVXESP 6 T 2.2 Dynamitin pdbhh F F 5c27 2 B D GLU-VAL-TYR-GLU-SER EVYES 5 T 68 Raf1_HTH pdbhh F F 5c3l 5 E H Part of Nup54 N-terminus with weak electron density, built as poly-alanine. XXXXXXXXXXXXXX 14 F F F 5c56 1 A B ICP0_HHV11 Ubiquitin E3 ligase ICP0 SGPRGPRKCARKTRHAETSGA 21 T 4.6 DUF6395 pdbhh T Viruses T 5c5e 2 B,D G,H KAIC_SYNE7 KaiC C-terminal peptide DEKSELSRIVRGVQEKGPES 20 T 2 Lambda_CIII pdbhh F Bacteria T 5c6d 2 C,D C,D UHRF1_HUMAN INVERTED CCAAT BOX-BINDING PROTEIN OF 90 KDA,NUCLEAR PROTEIN 95,NUCLEAR ZINC FINGER PROTEIN NP95,HNP95,RING FINGER PROTEIN 106,TRANSCRIPTION FACTOR ICBP90,UBIQUITIN-LIKE PHD AND RING FINGER DOMAIN-CONTAINING PROTEIN 1,HUHRF1,UBIQUITIN-LIKE-CONTAINING PHD AND RING FINGER DOMAINS PROTEIN 1 SEGGFASPRTGKGKWKRKSAGGGPSRAGSPRRT 33 T 43 Ribosomal_L35p pdbhh F Eukaryota T 5c6g 2 B,D B,D SCC2_ASHGO Sister chromatid cohesion protein 2 MSTFPGEDTRIPKRISEALSHQPLNHLVPKRELSRLLSKPVQISVQLESEDAFEEVPEELWQYPHPIDLDPLRLEESQPLRFRRPRGARLDYREDSSEIADLPGMGQLARACLSGTQLVDSAAIVESIESNAKKRKQTLAIGDVEMVSPDKKTKVMASVSPVSLNRVALGSQHLKTLERLMQYIGADESSAEFGDFEYWITLEDRATHILSEQCIDKL 218 T 32 GhoS pdbhh F Eukaryota T 5c6h 2 B,D,F,H,J,L,N,P,R,T,V,X B,D,F,H,J,L,N,P,R,T,V,X HUWE1_HUMAN Mule BH3 peptide from E3 ubiquitin-protein ligase HUWE1 PGVMTQEVGQLLQDMGDDVYQQYRSL 26 T 4.4 KIP1 pdbhh F Eukaryota T 5c6v 2 E,F,G,H E,F,G,H NINJA_ARATH NINJA-FAMILY PROTEIN AT4G28910,NOVEL INTERACTOR OF JAZ DNGLELSLGLS 11 T 0.00015 EAR unppercent F Eukaryota T 5c7f 2 E,F,G,H E,F,G,H IAA1_ARATH INDOLEACETIC ACID-INDUCED PROTEIN 1 KDTELRLGLPG 11 T 1.2E-05 AUX_IAA pdbhh F Eukaryota T 5c9n 1 A,B A,B GEMC1_HUMAN Geminin coiled-coil domain-containing protein 1 DSNFPLPDLCSWEEAQLSSQLYRNKQLQDTLVQKEEELARLHEENNHLRQYLNSALVKCEEEKAKKELSSDEFSKAYGKFRKGKR 85 T 0.00086 YabA pdb F Eukaryota T 5cdc 4 D,D10,D11,D12,D13,D14,D15,D16,D17,D18,D19,D2,D20,D21,D22,D23,D24,D25,D26,D27,D28,D29,D3,D30,D31,D32,D33,D34,D35,D36,D37,D38,D39,D4,D40,D41,D42,D43,D44,D45,D46,D47,D48,D49,D5,D50,D51,D52,D53,D54,D55,D56,D57,D58,D59,D6,D60,D7,D8,D9 D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D VP4 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 45 F F F 5cdw 2 AA,BA,CA,DA,EA,FA,Q,R,S,T,U,V,W,X,Y,Z b,f,h,j,I,Q,D,M,s,a,F,J,N,R,T,X SER-PTR-VAL-ASN-VAL-GLN SXVNVQ 6 T 63 DUF4246 pdbhh F F 5ceh 2 B B Unknown Peptide XXXXXXXXXX 10 F F F 5cfa 2 C,D D,C FIBA_HUMAN Peptide from Fibrinogen alpha chain SKQFTSSTSYNRGDS 15 T 1.5 DUF5326 pdbhh F Eukaryota T 5cgg 15 CA,DA g,h carfilzomib alpha-chloroacetamide 1 XXGLFX 6 T 670 zf-C2H2_4 pdbhh F F 5cgh 15 CA,DA e,f carfilzomib-alpha-chloroacetamide 5 XXXLXX 6 T 670 zf-C2H2_4 pdbhh F F 5cgn 1 A,B,C,D A,B,C,D D-Ala-Magainin Derivative GXGXXXXXXXXXXXXXXXXXXXX 23 F F F 5cgn 2 E,F,G,H E,F,G,H MAGA_XENLA L-ACPC8-Ala-Magainin GIGKFLHXAKKFAKAFVAEIMNS 23 T 1.3 TAFII28 pdbhh F Eukaryota T 5cgo 1 A,B A,B MAGA_XENLA ACPC-13 derivative of Ala-Magainin 2 GIGKFLHAAKKFXKAFVAEIMNS 23 T 1 TAFII28 pdbhh F Eukaryota T 5cgo 2 C,D C,D D-Ala-Magainin 2 GXGXXXXXXXXXXXXXXXXXXXX 23 F F F 5cha 1 A,D A,E CTRA_BOVIN ALPHA-CHYMOTRYPSIN A CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 5cjb 2 B,C,D B,C,D collagen-like peptide GPPGPPGPPGPPGPAGFPGPPGPPG 25 T 0.00029 Collagen pdbpercent F F 5cje 1 A A Q82LM3_STRAW CYTOCHROME P450 107L2 MGNVIDLGEYGARFTEDPYPVYAELRERGPVHWVRTPPPEAFEGWLVVGHEEARAALADPRLSKDGTKKGLTSLDVELMGPYLLVVDPPEHTRLRSLVARAFTMRRVEALRPRIQEITDGLLDEMLPRGRADLVDSFAYPLPITVICELLGVPDIDRVTFRALSNEIVAPTGGDAELAAYERLAAYLDELIDDKRSTAPADDLLGDLIRTRAEDDDRLSGEELRAMAFILLVAGHETTVNLITNGVHTLLTHPDQLAALRADMTLLDGAVEEVLRFEGPVETATYRYAAESMEIGGTAIAEGDPVMIGLDAAGRDPARHPDPHVFDIHRAPQGHLAFGHGIHYCLGAPLARLEARVALRSLLERCPDLALDGPPGARPPGMLIRGVRRLPVRW 393 T 1.3E-06 p450 unppssm F Bacteria T 5ck3 2 B,D,F B,D,F G0S401_CHATD Putative signal recognition particle protein MGATTQYTTLPSVLLIGPSGAGKTALLTLFERGPLLNPDGTSVGAADLKNPYRKPIVTSPVAQTHTSQVPTSVELAVGANEDGTPTSYKVDLDAAGATARKFLLIDTPGHPKLRGTTLQHLLNPSPSLTIIPTNAPNKKTSTDSHSDPYKSKLKAVIFLLDAAALADSDGDYLSQTASYLYDVLLSLQKRFHSRKNSRAPSSIPVLIAANKQDLFTAVPASLVKSRLEHELGRIRKTRQKGLLEASVTSEDEIRADDEEGWLGAVGSKEFKFEEMMEFDMEVEVMGGNVIGDGPGAERWWRWIGERI 307 T 2.3E-10 MnmE_helical pdbhh F Eukaryota T 5ck4 1 A,B A,B G0S401_CHATD Putative signal recognition particle protein MKHHHHHHPMGATTQYTTLPSVLLIGPSGAGKTALLTLFERGPLLNPDGTSVGAADLKNPYRKPIVTSPVAQTHTSQVPTSVELAVGANEDGTPTSYKVDLDAAGATARKFLLIDTPGHPKLRGTTLQHLLNPSPSLTIIPTNAPNKKTSTDSHSDPYKSKLKAVIFLLDAAALADSDGDYLSQTASYLYDVLLSLQKRFHSRKNSRAPSSIPVLIAANKQDLFTAVPASLVKSRLEHELGRIRKTRQKGLLEASVTSEDEIRADDEEGWLGAVGSKEFKFEEMMEFDMEVEVMGGNVIGDGPGAERWWRWIGERI 316 T 4.7E-10 MnmE_helical pdbhh F Eukaryota T 5ck5 1 A,B,C,D A,B,C,D G0S401_CHATD Putative signal recognition particle protein MKHHHHHHPMGATTQYTTLPSVLLIGPSGAGKTALLTLFERGPLLNPDGTSVGAADLKNPYRKPIVTSPVAQTHTSQVPTSVELAVGANEDGTPTSYKVDLDAAGATARKFLLIDTPGHPKLRGTTLQHLLNPSPSLTIIPTNAPNKKTSTDSHSDPYKSKLKAVIFLLDAAALADSDGDYLSQTASYLYDVLLSLQKRFHSRKNSRAPSSIPVLIAANKQDLFTAVPASLVKSRLEHELGRIRKTRQKGLLEASVTSEDEIRADDEEGWLGAVGSKEFKFEEMMEFDMEVEVMGGNVIGDGPGAERWWRWIGERI 316 T 4.7E-10 MnmE_helical pdbhh F Eukaryota T 5cmu 1 A,B,C A,B,C ENV_HV1H2 Envelope glycoprotein,AP1 SSGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQARILSGGRGGWMEWDREIEELIKKSEELIKKIEEQIKKQE 73 T 0.11 GP41 pdbpssm T Viruses T 5cmz 2 B,D B,D Artificial HIV entry inhibitor AP3 XMTWEEWDKKIEELIKKSEELIKKIEEQIKKQEESIKK 38 T 0.0055 DUF1351 pdbpercent F T 5cn0 1 A A ENV_HV1H2 GP41 SSGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQARILSGGRGGWEEWDKKIEELIKKSEELIKKIEEQIKKQE 73 T 0.058 GP41 pdbpercent T Viruses T 5cod 1 A,FB,MA,RC,T,YB L1,L4,L3,L6,L2,L5 ND5 OF BOVINE COMPLEX I XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 606 F F F 5cod 2 B,GB,NA,SC,U,ZB M1,M4,M3,M6,M2,M5 ND4 OF BOVINE COMPLEX I XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 459 F F F 5cod 3 AC,C,CC,DC,E,F,HB,JB,KB,OA,QA,RA,TC,V,VC,WC,X,Y f5,f1,h5,i5,h1,i1,f4,h4,i4,f3,h3,i3,f6,f2,h6,i6,h2,i2 Unknown structure XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 30 F F F 5cod 4 BC,D,IB,PA,UC,W g5,g1,g4,g3,g6,g2 Unknown structure XXXXXXXXXXXXXXXXXXXXXX 22 F F F 5cod 5 AA,DD,EC,ED,FA,FC,G,GA,H,KC,LB,LC,M,MB,N,RB,SA,SB,TA,XC,YA,YC,Z,ZA k2,p6,j5,s6,p2,k5,j1,s2,k1,p5,j4,s5,p1,k4,s1,p4,j3,s4,k3,j6,p3,k6,j2,s3 Unknown structure XXXXXXXXXXXXXXXXXXXXXXXXXXXX 28 F F F 5cod 6 BA,GC,I,NB,UA,ZC l2,l5,l1,l4,l3,l6 Unknown structure XXXXXXXXXXXXX 13 F F F 5cod 7 AD,CA,HC,J,OB,VA U6,U2,U5,U1,U4,U3 SDAP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 88 F F F 5cod 8 BD,DA,IC,K,PB,WA n6,n2,n5,n1,n4,n3 Unknown structure XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 59 F F F 5cod 9 CD,EA,JC,L,QB,XA o6,o2,o5,o1,o4,o3 Unknown structure XXXXXXXXXXXXXXXXXXXXX 21 F F F 5cod 10 AB,FD,HA,MC,O,TB t3,t6,t2,t5,t1,t4 Unknown structure XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 57 F F F 5cod 11 BB,GD,IA,NC,P,UB u3,u6,u2,u5,u1,u4 Unknown structure XXXXXXXXXXXXXXX 15 F F F 5cod 12 CB,HD,JA,OC,Q,VB v3,v6,v2,v5,v1,v4 Unknown structure XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 32 F F F 5cod 13 DB,ID,KA,PC,R,WB w3,w6,w2,w5,w1,w4 Unknown structure XXXXXXXXXXXXXXXXXXXXXXXXXXX 27 F F F 5cod 14 EB,JD,LA,QC,S,XB BC,BF,BB,BE,BA,BD Unknown structure XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 146 F F F 5cow 1 A A E3M3V1_CAERE Putative uncharacterized protein KLLLEGVKEQDPVDKFTYLLLQPLTEATLSDAVNFIVEKYSAELPDEGDASLVVRSQLGCQFFFLVTRTLAHDQRELAKLVQTLIPRPVRLEVFPGLQRSVFKSSVFLGHHIIQIFMGAKKPFQDWSFVGLAQDFECPWRRLAIAELLKKFSVSVVEKVFDNPVALIPQHESDNEALIELVTNALRFALWIVEFYETETNEKSIKELAFLDHSSKTLLIESFTKFLQGKDVKDQDHLKRIIDALEKS 247 T 0.12 CbtA_toxin pdb F Eukaryota T 5coz 1 A A C4ZHW1_AGARV Uncharacterized protein GIDDGTQANTTDLNDYENVLNSLDEEQIGKLPQNIKCVVNDKLNIDSEINIWDATSYYVKSGKVKAINFSENKDKCYDLMEKLAKAINLNKDVCVQSHRSENGNEIYLWDNNYTQDSIAIRNDSALAETHDGKLAVSASKFGTYYSPFNDKDKFRTDKQLMFMSAEEAEELAVKTAKELEINVCEKNELYVLDDKNTLIFPEDDTDKQNDTYVFFMFPDVYGIPYSRCPENEALTGYANQENHLVIAMDEKGISFLDIPPLYDWVETTETGEILHPSSILSKEVDKLKKYVTSGDIEVSEISLEYMLFADKNETYDIKPVWVVYYYQNQLVTGENSYTQKMALYDVYDAYTGEEYRIQ 358 T 0.14 HTH_26 unppercent F Bacteria T 5cp0 2 B C MET-ALA-SER MAS 3 T 280 zf-C2H2_4 pdbhh F F 5cq2 2 B,C B,C TXNIP_HUMAN THIOREDOXIN-BINDING PROTEIN 2,VITAMIN D3 UP-REGULATED PROTEIN 1 XTPEAPPCYMDVIX 14 T 14 MYEOV2 pdbhh F Eukaryota T 5cq9 2 C,D C,D 11-mer peptide XXXXXXXXXXX 11 F F F 5cqc 1 A A Q5ZUV9_LEGPH putative RavZ protein GSKLIVDEFEELGEQELDKTTSIYPPETSWEVNKGMNSSRLHKLYSLFFDKSSAFYLGDDVSVLEDKPLTGAYGFQSKKNDQQIFLFRPDSDYVAGYHVDAKSDAGWVNDKLDRRLSEISEFCSKATQPATFILPFVEMPTDITKGVQHQVLLTISYDPKSKQLTPTVYDSIGRDTYSESLSSYFKGKYRTTCDEILTQSIEKAIKSTDFTLGKFTRAAYNHQNRLTEGNCGSYTFRTIKEVISSSAQGTEVKIPGSGYITSNSYLTSQHVQDIESCIKYRNLGVVDIESALTEGKTLPVQLSEFIVALEDYGKLRSQQSEKSMLNFIGYSKTAKLTAVELLIGILNDIKGKNEISESQYDKLVKEVDCLMDSSLGKLVQFHLKNLGAESLQKLVLPCVKFDDELFDVPDITGEELASKK 420 T 18 DUF438 unphh F Bacteria T 5cqx 2 C C MAZE_ECOLI Antitoxin MazE HENIDWGEPKDKEVW 15 T 1.1 DUF2389 pdbhh F Bacteria T 5cqy 2 C C MAZE_ECOLI Antitoxin MazE HENIDWGEPKDKEVW 15 T 1.1 DUF2389 pdbhh F Bacteria T 5cs2 2 B B Cyclomarin A XXAXVXX 7 T 6.3 DUF446 pdbhh F F 5csf 2 C C KS6A1_HUMAN S6K-ALPHA-1,90 KDA RIBOSOMAL PROTEIN S6 KINASE 1,P90S6K,MAP KINASE-ACTIVATED PROTEIN KINASE 1A,MAPKAPK-1A,RIBOSOMAL S6 KINASE 1,RSK-1 GSQSQLSHQDLQLVKGAMAATYSALNSSKPTPQLKPIESSILAQRRVRKLPSTTL 55 T 55 DUF4603 pdbhh F Eukaryota T 5csi 2 C C KS6A1_HUMAN S6K-ALPHA-1,90 KDA RIBOSOMAL PROTEIN S6 KINASE 1,P90S6K,MAP KINASE-ACTIVATED PROTEIN KINASE 1A,MAPKAPK-1A,RIBOSOMAL S6 KINASE 1,RSK-1 GSQDLQLVKGAMAATYSALNSSKPTPQLKPIESSILAQRRVRKLPSTTL 49 T 0.026 Vitelline_membr pdbpssm F Eukaryota T 5csj 2 C C KS6A1_HUMAN S6K-ALPHA-1,90 KDA RIBOSOMAL PROTEIN S6 KINASE 1,P90S6K,MAP KINASE-ACTIVATED PROTEIN KINASE 1A,MAPKAPK-1A,RIBOSOMAL S6 KINASE 1,RSK-1 GSGAMAATYSALNSSKPTPQLKPIESSILAQRRVRKLPSTTL 42 T 35 DUF4603 pdbhh F Eukaryota T 5csm 1 A A CHMU_YEAST CHORISMATE PYRUVATE MUTASE MDFTKPETVLNLQNIRDELVRMEDSIIFKFIERSHFATCPSVYEANHPGLEIPNFKGSFLDWALSNLEIAHSRIRRFESPDETPFFPDKIQKSFLPSINYPQILAPYAPEVNYNDKIKKVYIEKIIPLISKRDGDDKNNFGSVATRDIECLQSLSRRIHFGKFVAEAKFQSDIPLYTKLIKSKDVEGIMKNITNSAVEEKILERLTKKAEVYGVDPTERRIERRISPEYLVKIYKEIVIPITKEVEVEYLLRRLEE 256 T 0.048 CM_2 pdb F Eukaryota T 5csn 2 C C KS6A1_HUMAN S6K-ALPHA-1,90 KDA RIBOSOMAL PROTEIN S6 KINASE 1,P90S6K,MAP KINASE-ACTIVATED PROTEIN KINASE 1A,MAPKAPK-1A,RIBOSOMAL S6 KINASE 1,RSK-1 GSQSQLSHQDLQLVKGAMAATYSALNSSKPTPQLKPIESS 40 T 26 RRT14 pdbhh F Eukaryota T 5ctt 2 B B SART3_HUMAN SART-3,TAT-INTERACTING PROTEIN OF 110 KDA,TIP110,P110 NUCLEAR RNA-BINDING PROTEIN LVPRGSRKRARAEKKALKKKKKIRGPEKRGADEDDEKEWGDDEEEQPSKRRRVEN 55 T 0.31 Pox_Ag35 pdb F Eukaryota T 5ctv 2 B,C C,E fragment of peptidoglycan AXKXX 5 T 230 OAM_dimer pdbhh F F 5cv1 1 A A PGL1_CAEEL P granule abnormality protein 1 KQLMLDGPKSEPADPFISLLMDPLEESVGKVVNHIAQLFEEASKNEGDESLVLRSQLGYQLFFLIVRSLADGKREVSKKILSGIPTSVRAEVFPGLQRSVYKSAVFLGNHIIQVLLGSKKSFEDWDVVGVAKDLESAWKRRAIAELIKKFQVSILEQCFDKPVPLIPQSPLNNDAVIDNVNKALQFALWLTEFYGSENETEALGELRFLDSTSKNLLVDSFKKFVQGINSKTHVTRIVESLEK 243 T 3.6 DMA pdbhh F Eukaryota T 5cv3 1 A A E3M3V1_CAERE Putative uncharacterized protein GKLLLEGVKEQDPVDKFTYLLLQPLTEATLSDAVNFIVEKYSAELPDEGDASLVVRSQLGCQFFFLVTRTLAHDQRELAKLVQTLIPRPVRLEVFPGLQRSVFKSSVFLGHHIIQIFMGAKKPFQDWSFVGLAQDFECPWRRLAIAELLKKFSVSVVEKVFDNPVALIPQHESDNEALIELVTNALRFALWIVEFYETETNEKSIKELAFLDHSSKTLLIESFTKFLQGKDVKDQDHLKRIIDALEKS 248 T 0.13 CbtA_toxin pdb F Eukaryota T 5cve 2 C,D D,E H2B_DROME N-terminal peptide from Histone H2B XPKTSGKAA 9 T 110 DUF6143 pdbhh F Eukaryota T 5cw7 1 A,C,E,G,I,K,M,O A,C,E,G,I,K,M,O Q8XAD5_ECO57 PAAA2 MDYKDDDDKNRALSPMVSEFETIEQENSYNEWLRAKVATSLADPRPAIPHDEVERRMAERFAKMRKERSKQ 71 T 0.059 HAGH_C pdb F Bacteria T 5cw9 1 A A De novo designed ferredoxin-ferredoxin domain insertion protein MEMDIRFRGDDPEAYYKALREMIRQARKFAGTVTVTLIIRFRGDDLEALEKALKEMIRQARKFAGTVTYTLDGNDLEIRITGVPPQVILELVKEAIRLAKEFNITVTVELVIRITGVPEQVRKELAKEAERLAKEFNITVTYTIRL 146 T 0.0034 Radical_SAM pdb F T 5cwe 1 A,B A,B Q82LM3_STRAW Cytochrome P450 hydroxylase MGNVIDLGEYGARFTEDPYPVYAELRERGPVHWVRTPPPEAFEGWLVVGHEEARAALADPRLSKDGTKKGLTSLDVELMGPYLLVVDPPEHTRLRSLVARAFTMRRVEALRPRIQEITDGLLDEMLPRGRADLVDSFAYPLPITVICELLGVPDIDRVTFRALSNEIVAPTGGDAELAAYERLAAYLDELIDDKRSTAPADDLLGDLIRTRAEDDDRLSGEELRAMAFILLVAGHETTVNLITNGVHTLLTHPDQLAALRADMTLLDGAVEEVLRFEGPVETATYRYAAESMEIGGTAIAEGDPVMIGLDAAGRDPARHPDPHVFDIHRAPQGHLAFGHGIHYCLGAPLARLEARVALRSLLERCPDLALDGPPGARPPGMLIRGVRRLPVRW 393 T 1.3E-06 p450 unppssm F Bacteria T 5cws 6 F,L F,L NIC96_CHATD NUCLEAR PORE PROTEIN NIC96 SALFDSLLARNKKQAEGETALGELPSLQLGLADLRQRLRKLGPSSDRPIEPGKAHYFLAASGVDPGAAVRDLGA 74 T 0.012 TFIIA unppssm F Eukaryota T 5cww 2 B B NUP82_CHATD NUCLEAR PORE PROTEIN NUP82 MPKIKSFAPAWLNEPAPGHKLFAPAADDGTATVPLAYGKKIKPGPRRTIARRGTEIFVACGKQIRWGDLAQLKESWESRPSRSSVGPTSTKKDSSDFDDGAATAGYRIIKTPVADDIRQLVMSPNQDFLAVLTSHTVHICILPDSSHLHIQDTTPFKPKFWTLGPTTHVTSRSAVVSAVWHPLGVNGHALVTVTEDAIVRVWELSTADRWTFDAPTLAIDLKKLADATYLDQDFGVSTSATNKGFSPDAFDMEVAAACFPTRDSGGWAPMTLWLAMTSGDVYALCPLLPQRWTPPPTLIPSLSASIVAKVAAAEDNPESTPEERLVAQQQLEWMSEIDNQEPKLVEEATGEATIEVYTRPSRPGLVPKLQGPFDFDLNPEDEQDDEVELKDIYVIGEKPRVADLMRGEEEELEMMKEDQHNGLSLNIICLLSTSGQVKICLDIDGVEAQWLPPRSKNKRLFAPPPEPPSLLTFQTFDTLKPAEVTPDGWPMFSEDATSPYSFYVTHPAGITYISLTPWVFRLESELQSDSEAGTEFRIDLLAKGQGSERDRIFTQTRTQSPLAAATSIDDPDLGYFILSATQTDPIALFFETPER 595 T 3E-15 Nup88 unppercent F Eukaryota T 5cww 3 C C NU159_CHATD NUCLEAR PORE PROTEIN NUP159 LRAREAKRKATLRMLRESLARVGPNVVRLRDD 32 T 0.0026 DUF5768 unppssm F Eukaryota T 5cx3 2 E,F,G,H E,F,G,H FYCO1_HUMAN ZINC FINGER FYVE DOMAIN-CONTAINING PROTEIN 7 RPPDDAVFDIITDEELCQIQESGSSLVPRGS 31 T 2.9 ComC pdbhh F Eukaryota T 5cxt 2 B,D,F,H,J,L,N,P,R B,D,F,H,J,L,N,P,R U2AF2_MOUSE U2 AUXILIARY FACTOR 65 KDA SUBUNIT,U2 SNRNP AUXILIARY FACTOR LARGE SUBUNIT GKKKVRKYWDVPPPGFEHITPMQYKAMQA 29 T 0.0013 Transformer unp F Eukaryota T 5cxv 2 B C FLAG peptide DYKDDDD 7 T 55 FIMP pdbhh F F 5cze 1 A,C,E,G A,C,I,K A0A0F6F6Q9_ECO57 PaaA2 MDYKDDDDKNRALSPMVSEFETIEQENSYNEWLRAKVATSLADPRPAIPHDEVERRMAERFAKMRKERSKQ 71 T 0.059 HAGH_C pdb F T 5czf 1 A,B A,B Q8XAD5_ECO57 PaaA2 METIEQENSYNEWLRAKVATSLADPRPAIPHDEVERRMAERFAKMRKERSKQ 52 T 0.044 HAGH_C pdb F Bacteria T 5czh 2 B B CITRULLINE--ASPARTATE LIGASE DEEDYXEIP 9 T 12 N-SET pdbhh F T 5czi 2 B B SHC1_HUMAN SHC-TRANSFORMING PROTEIN 3, SHC-TRANSFORMING PROTEIN A, SRC HOMOLOGY 2 DOMAIN-CONTAINING-TRANSFORMING PROTEIN C1, SH2 DOMAIN PROTEIN C1 PDHQYXNDF 9 T 2.4 HAND pdbhh F Eukaryota T 5d0j 2 E,F L,M G7-TEdFP peptide SFEGYDNSC 9 T 1.2 Crr6 pdbhh F T 5d13 2 E,F,G,H G,E,H,F CFMOC-KKETEV peptide XKKETEV 7 T 150 DUF1438 pdbhh F F 5d1c 2 C,D C,D HDAC8 Fluor de Lys tetrapeptide substrate XRHXXX 6 T 360 Viral_helicase1 pdbhh F F 5d1d 2 C,D C,D HDAC8 Fluor de Lys tetrapeptide substrate XRHXXX 6 T 360 Viral_helicase1 pdbhh F F 5d23 1 A A Q5FBS0_BOMMO UNCHARACTERIZED PROTEIN MHHHHHHETSEERAARLAKMSAYAAQRLANESPEQRATRLKRMSEYAAKRLSSETREQRAIRLARMSAYAARRLANETPAQRQARLLRMSAYAAKRQASKKS 102 T 0.072 DUF3752 pdbpssm F Eukaryota T 5d2a 2 C C ZDC-ALA-PRO-ALA-LYS-PHE-CYS-ALA-PRO-ALA-PHB-GAL APAKFC 6 T 28 DUF6267 pdbhh F F 5d2a 3 D,E D,E ZDC-ALA-PRO-ALA-LYS-PHE-CYS-ALA-PRO-ALA-PHB-GAL APA 3 T 520 DUF2721 pdbhh F F 5d2l 5 E,J,O,T Q,U,T,R PP65_HCMVT ASN-LEU-VAL-PRO-MET-VAL-ALA-THR-VAL NLVPMVATV 9 T 15 GDH_N pdbhh T Viruses T 5d2m 4 G G ZN451_HUMAN COACTIVATOR FOR STEROID RECEPTORS GAMDHVEFGSGDPGSEIIESVPPAGPEASESTTDENEDDIQFVSEGPLRPVLEYIDLVSSDDEEP 65 T 57 SelB-wing_3 pdbhh F Eukaryota T 5d2n 5 I,J G,I PP65_HCMVT ASN-LEU-VAL-PRO-MET-VAL-ALA-THR-VAL NLVPMVATV 9 T 15 GDH_N pdbhh T Viruses T 5d2q 1 A A Q5FBS0_BOMMO UNCHARACTERIZED PROTEIN MHHHHHHETSEERAARLAKMSAYAAQRLANESPEQRATRLKRMSEYAAKRLSSETREQRAIRLARMSAYAARRLANETPAQRQARLLRMSAYAAKRQASKKS 102 T 0.072 DUF3752 pdbpssm F Eukaryota T 5d2s 1 A A Q5FBS0_BOMMO UNCHARACTERIZED PROTEIN MHHHHHHETSEERAARLAKMSAYAAQRLANESPEQRATRLKRMSEYAAKRLSSETREQRAIRLARMSAYAARRLANETPAQRQARLLRMSAYAAKRQASKKS 102 T 0.072 DUF3752 pdbpssm F Eukaryota T 5d50 2 E,F,K,L,M,N,O,P E,G,M,O,F,H,N,P T1SA45_9CAUD Anti-repressor protein MQRQYHHPLEEGFEERIHTPVGVRSLVEDSHLMKLLRELDKDGFNVDGPLAELVALVNYVTSSQMTMQDLQTHLDYCAEQLRKQTT 86 T 0.0051 DUF724 pdb T Viruses T 5d5k 2 B B PARP2_HUMAN HPARP-2,ADP-RIBOSYLTRANSFERASE DIPHTHERIA TOXIN-LIKE 2,ARTD2,NAD(+) ADP-RIBOSYLTRANSFERASE 2,ADPRT-2,POLY[ADP-RIBOSE] SYNTHASE 2,PADPRT-2 MGSSHHHHHHSSGLVPRGSHMAARRRRSTGGGRARALNESKRVNNGNTAPEDSSPAKKTRRCQRQESKKMPVAGGKANKDRTEDKQDESVKALLLKGK 98 T 96 DUF5757 pdbhh F Eukaryota T 5d5y 1 A,B B,A G0SB31_CHATD CTSKN7 SQQQIAALSESLQATQQQLQALQQQCYELEKTNRLLVSEVMTLQKMVKAQ 50 T 0.0033 CENP-H pdb F Eukaryota T 5d5z 1 A,B,C,D,E A,B,C,D,E G0SB31_CHATD CTSKN7 SQQQIAALSESLQATQQQLQALQQQCYELEKTNRLLVSEVMTLQKMVKAQ 50 T 0.0033 CENP-H pdb F Eukaryota T 5d60 1 A,B,C,D A,B,C,D G0SB31_CHATD CTSKN7 SQQQIAALSESLQATQQQLQALQQQCYELEKTNRLLVSEVMTLQKMVKAQNQASNEIINHL 61 T 6.9E-05 Sulfotransfer_4 unphh F Eukaryota T 5d61 2 B L Z-VAD-fmk XVADX 5 T 1100 RE_HindIII pdbhh F F 5d62 2 B L Z-VAD-fmk XVADX 5 T 1100 RE_HindIII pdbhh F F 5d63 2 B L Z-VAD-fmk XVADX 5 T 1100 RE_HindIII pdbhh F F 5d8h 4 D D THCL_STRAJ THIOSTREPTON XIAXASXTXXXXTXXXXXX 19 T 0.93 CCER1 pdbhh F Bacteria F 5d94 2 B B FYCO1_HUMAN Peptide from FYVE and coiled-coil domain-containing protein 1 DDAVFDIITDEEL 13 T 4.5 Flavi_NS2B pdbhh F Eukaryota T 5d9e 1 A A D5VKJ8_CAUST Caulosegnin II GTLTPGLPEDFLPGHYMMPG 20 T 0.087 DUF5974 unp F Bacteria T 5d9g 2 C,D C,D oligo peptide ENLYFQ 6 T 40 Phage_holin_2_4 pdbhh F T 5d9q 1 A,F,K G,A,J Q2N0S6_9HIV1 Envelope glycoprotein gp120 AENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQAFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGANNTSTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRR 472 T 3.5E-54 GP120 pdbpercent T Viruses T 5da7 2 C,D B,E Q5JH72_THEKO Thermococcales inhibitor of PCNA MDRKLDEFIGDATPKKVSKEKPVRRKKRLKPTSLDSFLPEEHINYFRDLRIGSKKIRNAKIEEL 64 T 0.011 Inos-1-P_synth unppercent F Archaea T 5dai 2 B B FEN_THEKO C-terminus of FEN-1 protein KQRTLESWFGR 11 T 0.15 PaaX unppercent F Archaea T 5dat 81 XD m2 60S ribosomal protein L12-A (uL11) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 150 F F F 5dat 84 EF,FF p1,p2 60S ribosomal protein P1 alpha/P2 beta XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 5dat 86 HF l1 60S ribosomal protein L1-A (uL1) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 213 F F F 5day 1 A,B A,B NRP1_ARATH NUCLEOSOME/CHROMATIN ASSEMBLY FACTOR GROUP A6,PROTEIN SET HOMOLOG 1 SNLEQIDAELVLSIEKLQEIQDDLEKINEKASDEVLEVEQKYNVIRKPVYDKRNEVIQSIPGFWMTAFLSHPALGDLLTEEDQKIFKYLNSLEVEDAKDVKSGYSITFHFTSNPFFEDAKLTKTFTFLEEGTTKITATPIKWKEGKGLPNGVNHDDKKGNKRALPEESFFTWFTDAQHKEDAGDEIHDEVADIIKEDLWSNPLTYFNN 208 T 1.1E-10 NAP pdbpercent F Eukaryota T 5dbe 2 B A CYSE_SALTY SAT,SERINE TRANSACETYLASE HHTFEYGDGI 10 T 3.3 DUF2023 pdbhh F Bacteria T 5dbr 2 B C SCN5A_HUMAN HH1,SODIUM CHANNEL PROTEIN CARDIAC MUSCLE SUBUNIT ALPHA,SODIUM CHANNEL PROTEIN TYPE V SUBUNIT ALPHA,VOLTAGE-GATED SODIUM CHANNEL SUBUNIT ALPHA NAV1.5 GPGSQDIFMTEEQKKYYNAMKKLGSKKPQKPIPRPLNKYQGFIFDIVTKQA 51 T 7.9 NPA pdbhh F Eukaryota T 5dc3 81 XD m2 60S ribosomal protein L12-A (uL11) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 150 F F F 5dc3 83 EF p1 60S ribosomal protein P1 alpha XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 5dc3 84 FF p2 60S ribosomal P2 beta XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 46 F F F 5dc6 2 C,D C,D Fluor-de-Lys tetrapeptide assay substrate XRHXXX 6 T 360 Viral_helicase1 pdbhh F F 5dc7 2 C,D C,D Fluor-de-Lys tetrapeptide assay substrate XRHXXX 6 T 360 Viral_helicase1 pdbhh F F 5dc8 2 C,D C,D Fluor-de-Lys tetrapeptide assay substrate XRHXXX 6 T 360 Viral_helicase1 pdbhh F F 5de2 2 C,D C,D NEK9_MOUSE NERCC1 KINASE,NEVER IN MITOSIS A-RELATED KINASE 9,NIMA-RELATED PROTEIN KINASE 9 GWLRKELENAEFIPMPDSP 19 T 2.2 DUF1456 pdbhh F Eukaryota T 5def 3 C C VIPR1_HUMAN peptide derived from VASOACTIVE INTESTINAL POLYPEPTIDE RECEPTOR 1 (pVIPR) RRKWRRWHL 9 F F Eukaryota F 5deg 3 C C VIPR1_HUMAN Peptide derived of VASOACTIVE INTESTINAL POLYPEPTIDE RECEPTOR 1 (pVIPR) RRKWRRWHL 9 F F Eukaryota F 5df6 2 B,C B,C TXNIP_HUMAN txnip KFMPPPTXTEVDX 13 T 0.53 Tryp_FSAP pdbhh F Eukaryota T 5dfn 1 A,B A,B Q6JXI5_TETTH Telomerase associated protein p45 GWKQQQIPQIKSNQENINTLKYKELIAGELMRITHKLLIQKLQQQQPANNNKQINEMDVESNELAEKKEVIIKIQEIAKDQQLYDTLSIQYQVDQKEQYYAKIAQSLEDFVSISALKMVSYIYPNISYQVSIGFFQNILDIATKTVKDRGALGCNYKYLKDKLTKALNLQQISYPLISESYISYLVHLFQDFNIIEIENEHKFYYKQAFQYDDS 214 T 0.014 DUF2204 pdb F Eukaryota T 5dfz 6 F G Putative N-terminal domain of S. cerevisiae Vps30 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 49 F F F 5dgo 1 A A CDC45_HUMAN PORC-PI-1 MFVSDFRKEFYEVVQSQRVLLFVASDVDALCACKILQALFQCDHVQYTLVPVSGWQELETAFLEHKEQFHYFILINCGANVDLLDILQPDEDTIFFVCDTHRPVNVVNVYNDTQIKLLIKQDDDLEVPAYEDIFRDEEEDEEHSGNDSDGSEPVEQTMRRRQRREWEARRRDILFDYEQYEYHGTSSAMVMFELAWMLSKDLNDMLWWAIVGLTDQWVQDKITQMKYVTDVGVLQRHVSRHNHRNEDEENTLSVDCTRISFEYDLRLVLYQHWSLHDSLCNTSYTAARFKLWSVHGQKRLQEFLADMGLPLKQVKQKFQAMDISLKENLREMIEESANKFGMKDMRVQTFSIHFGFKHKFLASDVVFATMSLMESPEKDGSGTDHFIQALDSLSRSNLDKLYHGLELAKKQLRATQQTIASCLCTNLVISQGPFLYCSLMEGTPDVMLFSRPASLSLLSKHLLKSFVCSTKNRRCKLLPLVMAAPLSMEHGTVTVVGIPPETDSSDRKNFFGRAFEKAAESTSSRMLHNHFDLSVIELKAEDRSKFLDALISLLS 555 T 7E-37 CDC45 pdbpercent F Eukaryota T 5dgv 81 XD m2 60S ribosomal protein L12-A (uL11) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 160 F F F 5dgv 83 EF p1 60S ribosomal protein P1 alpha/P2 beta XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 5dgv 84 FF p2 60S ribosomal protein P1 alpha/P2 beta XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 46 F F F 5dha 4 D D Engineered Nuclear Export Signal Peptide (CPEB4 NES reverse mutant) GGSYRMIDILSSELSHMDFTR 21 T 5.8 SAC3 pdbhh F T 5dhf 4 D D RIOK2_HUMAN Serine/threonine-protein kinase RIO2 GGSYRSFEMTEFNQALEEI 19 T 0.9 RhoGEF67_u1 pdbhh F Eukaryota T 5di8 3 C C Fc-III peptide DCAWHLGELVWCT 13 T 6.1 FAT pdbhh F T 5di9 4 D D Engineered Nuclear Export Signal Peptide (hRio2 NES reverse mutant) GGSYGKIEELAQNFETMEFSR 21 T 1.6 OTOS pdbhh F T 5dif 4 D D CPEB4_HUMAN Cytoplasmic polyadenylation element-binding protein 4 GGSYRTFDMHSLESSLIDI 19 T 6.4 DUF1959 pdbhh F Eukaryota T 5dir 2 E,F,G,H E,F,G,H Globomycin XXSXX 5 T 380 GLF pdbhh F F 5diy 1 A,B P,Q TAB1_HUMAN MITOGEN-ACTIVATED PROTEIN KINASE KINASE KINASE 7-INTERACTING PROTEIN 1,TGF-BETA-ACTIVATED KINASE 1-BINDING PROTEIN 1,TAK1-BINDING PROTEIN 1 VPYSSAQ 7 T 18 UPF0160 pdbhh F Eukaryota T 5dj0 3 C C Fc-III peptide DCAWHLGELVWCT 13 T 6.1 FAT pdbhh F T 5dj2 3 C C Fc-III peptide DCAWHLGELVWCT 13 T 6.1 FAT pdbhh F T 5dj6 3 C C Fc-III peptide DCAWHLGELVWCT 13 T 6.1 FAT pdbhh F T 5dj8 3 C C Fc-III peptide DCAWHLGELVWCT 13 T 6.1 FAT pdbhh F T 5dja 3 C C Fc-III peptide DCAWHLGELVWCT 13 T 6.1 FAT pdbhh F T 5djc 3 C,F C,F Fc-III peptide DCAWHLGELVWCT 13 T 6.1 FAT pdbhh F T 5djd 3 C C Fc-III peptide DCAWHLGELVWCT 13 T 6.1 FAT pdbhh F T 5djn 1 A,B A,C F8VQ75_MOUSE Kinesin-like protein GPGSHAVVNEDPNAKVIRELREEVEKLREQLSQAEAMKAELKEKLEESEKLIKELTVTWEEKLRKTEAIAQERQRQLESMGISLETSGIKVGDD 94 T 2.1E-08 Kinesin_assoc pdb F Eukaryota T 5djq 4 M,N,O,P N,O,P,Q H7ESS5_PSEST Putative uncharacterized protein MFVDNVVLAGVVTVGLMVAFLAGFGYFIWRDAHKKS 36 T 0.0048 FixQ pdbhh F T 5djx 3 C,F C,F Fc-III peptide DCAWHLGELVWCT 13 T 6.1 FAT pdbhh F T 5djy 3 C C Fc-III peptide DCAWHLGELVWCT 13 T 6.1 FAT pdbhh F T 5djz 3 C C Fc-III peptide DCAWHLGELVWCT 13 T 6.1 FAT pdbhh F T 5dk0 3 C C Fc-III peptide DCAWHLGELVWCT 13 T 6.1 FAT pdbhh F T 5dk2 3 C C Fc-III peptide DCAWHLGELVWCT 13 T 6.1 FAT pdbhh F T 5dkp 2 AB,BB,CA,CB,DA,DB,EA,FA,GA,HA,IA,JA,KA,LA,MA,NA,OA,PA,QA,RA,SA,TA,UA,VA,WA,XA,YA,ZA 0,1,O,2,P,3,Q,R,S,T,U,V,W,X,Y,Z,o,p,q,r,s,t,u,v,w,x,y,z agonist ADEP A54556 XFSPXAX 7 T 45 Acp26Ab pdbhh F F 5dmg 3 G,H,I Z,P,X TAU_HUMAN Microtubule-associated protein SIDMVDSPQLATLAD 15 T 19 YkpC pdbhh F Eukaryota T 5dms 2 B,D B,D FBX43_MOUSE ENDOGENOUS MEIOTIC INHIBITOR 2 FSQHKTSTI 9 T 5.9 SVS_QK pdbhh F Eukaryota T 5dmu 1 A A D1Z0H5_METPS NHEJ Polymerase GLVPRGSHMTEVLHIEGHDIKVTNPDKVLFPEDGITKGELVDYYRRISGVMVPLVRGRPMTMQRFPDGIGKEGFFQKEASDYFPDWVHRATLELGKGGIQHQVVCDDAATLVYLASQAMITPHVFLSRIDKVHYPDRLIFDLDPPDNNFETVRSAAKTIREALDAEGYPVYLMTTGSRGLHVVVPLDRSADFDTVRAFARGFGEKLTKKYPDRFTIELSKEKRRGRLFLDYLRNSYGQTGVAPYGVRARSGAPVATPITWDELDDISGSQEYNIRNIMGRMDKRGDAWKYIDKDRTSIKNL 301 T 0.0052 S-methyl_trans unppercent F Archaea T 5dmv 2 B D FBX43_MOUSE ENDOGENOUS MEIOTIC INHIBITOR 2 SPLVTSTIKTEDVVSNSQNSRLHFSQHKTSTI 32 T 4 SVS_QK pdbhh F Eukaryota T 5dn6 1 A 1 Chain A XXXXXXXXXXXXXXXXXXXX 20 F F F 5dn6 2 B 2 Chain B XXXXXXXXXXXXXXX 15 F F F 5dn6 3 C 3 Chain C XXXXXXXXXXXXXXXXXXX 19 F F F 5dn6 10 Y V Chain V XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 78 F F F 5dn6 11 Z W Chain W XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 124 F F F 5dn6 13 BA Y Chain Y XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 54 F F F 5dnj 2 B B peptide 707-56A-SER-TPO-NH2 XXSTX 5 T 600 zf-H2C2_5 pdbhh F F 5dof 1 A,B,C,D A,B,C,D D2CVN7_TETTH P19 QQPKRNFDLYKLITDKQIDFQVADLIQDEQSSFVSVRIYGQFKCFVPKSTIQEQLDKIKNLSSKELAKNKIFKFLSEYNKNNQKQDELSHDYYGYFKVQQHQFILNLENAQREASLAVDDFYFINGRIYKTNHDILILQAHHVYQMQKPTLQLLQAASEINQN 163 T 0.53 TMF_DNA_bd pdb F Eukaryota T 5doi 1 A,B,C,D A,B,C,D D2CVN7_TETTH P19 QQPKRNFDLYKLITDKQIDFQVADLIQDEQSSFVSVRIYGQFKCFVPKSTIQEQLDKIKNLSSKELAKNKIFKFLSEYNKNNQKQDELSHDYYGYFKVQQHQFILNLENAQREASLAVDDFYFINGRIYKTNHDILILQAHHVYQMQKPTLQLLQAASEINQN 163 T 0.53 TMF_DNA_bd pdb F Eukaryota T 5doi 2 E,F,G,H E,F,G,H Q6JXI5_TETTH P45 EDNFELVFLKELPSLPDFSKVCFTGLILSFSNFPSSEQNQQKDVPHKIAIIQDSTGEAELFLDMYKFCQEEISVFKAITGIGVLKKKNIGAGQVCKIIVERFRIIHSADEEMLQYLLIQKYKLSKTLN 128 T 0.21 YkpC pdb F Eukaryota T 5dok 1 A,B A,B Q6JXI5_TETTH P45 KSNQENINSLKYKELIAGELMRITHKLLIQKLQQQQPANNNKQINEMDVESNELAEKKEVIIKIQEIAKDQQLYDTLSIQYQVDQKEQYYAKIAQSLEDFVSISALKMVSYIYPNISYQVSIGFFQNILDIATKTVKDRGALGCNYKYLKDKLTKALNLQQISYPLISESYISYLVHLFQDFNIIEIENEHKFYYKQAFQYDDS 204 T 0.013 DUF2204 pdb F Eukaryota T 5doq 3 C C A0A0Q0UXS2_9BACI Putative membrane protein MQTFLIMYAPMVVVALSVVAAFWVGLKDVHVNE 33 T 0.16 FixS pdbpssm F Bacteria T 5dow 2 B,D,F,H B,D,F,H S26A3_MOUSE SLC26A3 TRANSPORTER, DOWN-REGULATED IN ADENOMA, PROTEIN DRA, SOLUTE CARRIER FAMILY 26 MEMBER 3 KRNKALKKIRKLQKRGLIQMTX 22 T 0.82 DUF2786 pdbhh F Eukaryota T 5dpw 2 B,D,F,H,J,L,N,P B,D,F,H,J,L,N,P PKHM1_HUMAN PH DOMAIN-CONTAINING FAMILY M MEMBER 1,162 KDA ADAPTER PROTEIN,AP162 PQQEDEWVNVQYPD 14 T 2 DOR pdbhh F Eukaryota T 5dqs 2 B D EF1B_HUMAN EF-1-BETA GAMGFGDLKSPAGLQVLNDYLADKSYIEGYVPSQADVAVFEAVSSPPPADLCHALRWYNHIKSYEKEKASLPGVKKALGKYGPADVEDTT 90 T 0.00027 GST_C_4 pdbhh F Eukaryota T 5drv 2 B B POLN_SFV NSP3 LTFGDFDE 8 T 0.16 DUF5102 pdbhh T Viruses T 5ds8 3 E P Tetrapeptide GLY-HPU-GLY-ALA GXGA 4 T 140 IN_DBD_C pdbhh F F 5dsc 3 I,J,K,L P,Q,M,N Peptide: GLY-HPU-GLY-SER-GLY GXGSG 5 T 57 Tubulin_2 pdbhh F F 5dtf 3 E P Peptide: GLY-5CT-GLY-ALA GXGA 4 T 140 IN_DBD_C pdbhh F F 5dub 3 E P Peptide: GLY-5GG-GLY-ALA GXGA 4 T 140 IN_DBD_C pdbhh F F 5dvk 2 B B Ig gamma-1 chain C region DCAWHLGELVWCT 13 T 6.1 FAT pdbhh F T 5dvl 2 B B Fc-III peptide DCAWHLGELVWCT 13 T 6.1 FAT pdbhh F T 5dvm 2 B B Fc-III peptide DCAWHLGELVWCT 13 T 6.1 FAT pdbhh F T 5dvn 2 B B Fc-III peptide DCAWHLGELVWCT 13 T 6.1 FAT pdbhh F T 5dws 2 B,D,F,H B,D,F,H TXNIP_HUMAN txnip XTPEAPPCYMDVIX 14 T 14 MYEOV2 pdbhh F Eukaryota T 5dx1 2 E,F,G,H F,G,H,I PABP1_HUMAN PABP1 peptide NMPGAIRPAAPXPPFSTMX 19 T 65 MIP-T3 pdbhh F Eukaryota T 5dx8 2 E,F,G,H E,F,G,H PABP1_HUMAN POLY(A)-BINDING PROTEIN 1 NMPGAIXPAAPXPPFSTMX 19 T 65 MIP-T3 pdbhh F Eukaryota T 5dxa 2 E,F,G F,G,I PABP1_HUMAN POLY(A)-BINDING PROTEIN 1 NMPGAIXPAAPXPPFSTMX 19 T 62 Phage_ABA_S pdbhh F Eukaryota T 5dzd 2 C,D C,D TXNIP_HUMAN TXNIP, THIOREDOXIN-BINDING PROTEIN 2, VITAMIN D3 UP-REGULATED PROTEIN 1 XTPEAPPCYMDVIX 14 T 14 MYEOV2 pdbhh F Eukaryota T 5dzk 2 B,BA,BB,D,DA,DB,F,FA,H,HA,J,JA,L,LA,N,NA,P,PA,R,RA,T,TA,V,VA,X,XA,Z,ZA O,2,3,P,o,4,Q,p,R,q,S,r,T,s,U,t,V,u,W,v,X,w,Y,x,Z,y,1,z BEZ-LEU-LEU XLL 3 T 1400 EF-hand_1 pdbhh F F 5e0l 2 B C FA83D_HUMAN Protein Chica peptide SYRKAIDAATQTEE 14 T 0.054 TMEM131_like unppercent F Eukaryota T 5e0m 2 B C FA83D_HUMAN Protein Chica peptide SYWSRSTTTQTDM 13 T 0.054 TMEM131_like unppercent F Eukaryota T 5e0u 2 D,E,F D,E,F CDN1A_HUMAN CDK-INTERACTING PROTEIN 1,MELANOMA DIFFERENTIATION-ASSOCIATED PROTEIN 6,MDA-6,P21 CGRKRRQTSMTDFYHSKRRLIFS 23 T 0.94 CDC27 pdbhh F Eukaryota T 5e0v 2 C,D C,D FEN1_HUMAN FEN-1,FLAP STRUCTURE-SPECIFIC ENDONUCLEASE 1 STQGRLDDFFKVTGSL 16 T 0.15 LRV_FeS pdbhh F Eukaryota T 5e1b 2 C,D D,E RCC1_HUMAN RCC1 SPKRIA 6 T 9.3 DUF1107 pdbhh F Eukaryota T 5e1d 2 C,D D,E RCC1_HUMAN RCC1 YPKRIA 6 T 5.6 DUF1719 pdbhh F Eukaryota T 5e1m 2 C,D D,E RCC1_HUMAN RCC1 PPKRIA 6 T 68 DUF6003 pdbhh F Eukaryota F 5e1o 2 C,D D,E RCC1_HUMAN RCC1 RPKRIA 6 T 1 S_tail_recep_bd pdbhh F Eukaryota F 5e24 2 B,D B,D HLES_DROME Protein hairless GGRLQFFKDGKFILELARSKDGDKSGWVSVTRKTFRPP 38 T 0.43 Tryp_FSAP pdbhh F Eukaryota T 5e2a 2 C,D D,E RCC1_HUMAN RCC1 XPKRIA 6 T 7.2 DUF5394 pdbhh F Eukaryota T 5e2b 2 C,D D,E RCC1_HUMAN RCC1 XPKRIA 6 T 7.2 DUF5394 pdbhh F Eukaryota T 5e2q 2 B B ANGT_HUMAN SERPIN A8 DRVYIHPF 8 T 0.67 Nairo_nucleo pdbhh F Eukaryota T 5e2v 3 C P TAU_HUMAN TAU PHOSPHOPEPTIDE RSGYSSPGSPGTPGSRSR 18 T 15 Tachystatin_A pdbhh F Eukaryota T 5e2w 3 C P TAU_HUMAN TAU-PHOSPHOPEPTIDE RSGYSSPGSPGTPGSRSR 18 T 15 Tachystatin_A pdbhh F Eukaryota T 5e33 2 B B PENK_HUMAN Met-enkephalin YGGFM 5 T 1.5 Op_neuropeptide pdb F Eukaryota F 5e3a 2 B B PENK_HUMAN Leu-enkephalin YGGFL 5 T 25 FANCL_d1 pdbhh F Eukaryota F 5e3c 2 B B IVYPW IVYPW 5 T 23 Cas9_REC pdbhh F F 5e4w 3 E,F E,F ALB3_ARATH Inner membrane protein ALBINO3, chloroplastic SKRSKRKRT 9 T 0.79 CDC45 unppssm F Eukaryota F 5e5a 6 K K VIE1_HCMVT C-terminal domain of Regulatory protein IE1 GGKSTHPMVTRSKADQ 16 T 20 p53-inducible11 pdbhh T Viruses T 5e5v 1 A,B A,B NFGAILS (22-28) from islet amyloid polypeptide, synthesized NFGAILS 7 T 3.9 SidC_N pdbhh F T 5e5x 1 A A ANFLVH (residues 13-18) from islet amyloid polypeptide ANFLVH 6 T 5 DUF1160 pdbhh F T 5e5z 1 A A LVHSSN (residues 16-21) from islet amyloid polypeptide LVHSSN 6 T 36 DUF6039 pdbhh F F 5e61 1 A,B A,B FGAILSS (residues 23-29) from islet amyloid polypeptide FGAILSS 7 T 6.2 Amelotin pdbhh F T 5e6q 2 B A XRCC1_HUMAN X-RAY REPAIR CROSS-COMPLEMENTING PROTEIN 1 SPKGKRKLDLNQEEKKTPSKPPAQLSPSVPKRPKLP 36 T 0.21 XRCC1_N pdbhh F Eukaryota T 5e7t 2 B B Q9AYV4_BPTU2 Minor structural protein 5 MADKNYLHTAYANSADGTDGFTTVYPNLNLLVNSSAKNKEGFFKNFDKVENGYGEVTMKGTNAWVNKDLGEGFSIQPINYKPGDKYTMSVDVMFTSWNVPAGTTISAFWMRQRYTENSWKEICTIDLPKDPSKMLNQWIRITQTSTIPPYEDPSVGTQAILNVGFFGQQEGSFTIRVRNPKQELGSIATPYMPSASEVTTADWPKFVGTYVDTNPVSSTVSSKYDWDEMKYRVYLDGTPVGGSKLLSFDLENLKAGTSYNVQVSQINGNVESDKSESVAFKTTLPK 286 T 0.012 CBM_4_9 pdbpercent T Viruses T 5e8f 2 C,D D,E PDE6C_HUMAN CGMP PHOSPHODIESTERASE 6C KSKTC 5 T 74 DUF2501 pdbhh F Eukaryota F 5e8n 3 C,F,I,L C,F,I,L TRH4, CERS5,LAG1 LONGEVITY ASSURANCE HOMOLOG 5,TRANSLOCATING CHAIN-ASSOCIATING MEMBRANE PROTEIN HOMOLOG 4,TRAM HOMOLOG 4 MCLRMTAVM 9 T 12 Adeno_E4_ORF3 pdbhh F T 5e8o 2 B,F C,F CERS5,LAG1 LONGEVITY ASSURANCE HOMOLOG 5,TRANSLOCATING CHAIN-ASSOCIATING MEMBRANE PROTEIN HOMOLOG 4,TRAM HOMOLOG 4 MXLRMTAVM 9 T 21 HV_small_capsid pdbhh F T 5e8p 3 C,F C,F CERS5,LAG1 LONGEVITY ASSURANCE HOMOLOG 5,TRANSLOCATING CHAIN-ASSOCIATING MEMBRANE PROTEIN HOMOLOG 4,TRAM HOMOLOG 4 MCLRXTAVM 9 T 19 DUF6401 pdbhh F T 5eay 2 E,F,G,H E,F,G,H DNA2_HUMAN HDNA2,DNA REPLICATION ATP-DEPENDENT HELICASE-LIKE HOMOLOG NELELLMEKSFWE 13 T 4.5 Retinal pdbhh F Eukaryota T 5ec5 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,R,S TXL_EISFE EFL1 GMSAKAAEGYEQIEVDVVAVWKEGYVYENRGSTSVDQKITITKGMKNVNSETRTVTATHSIGSTISTGDAFEIGSVEVSYSHSHQKSQVSMTQTEVYSSKVIEHTITIPPTSKFTRWQLNADVGGAGIEYMYLIDEVTPIGGTQSIPQVITSRAKIIVGRQIILGKTEIRIKHAERKEYMTVVSRKSWPAATLGHSKLFKFVLYEDWGGFRIKTLNTMYSGYEYAYSSDQGGIYFDQGTDNPKQRWAINKSLPLRHGDVVTFMNKYFTRSGLCYDDGPATNVYCLDKREDKWILEVVG 298 T 0.027 Toxin_10 unphh F Eukaryota T 5ecg 3 E,F E,F SEP-GLN-GLU-TYR SQEY 4 T 180 DUF4535 pdbhh F F 5ed9 1 A,B,C A,B,C SUN2_MOUSE PROTEIN UNC-84 HOMOLOG B,SAD1/UNC-84 PROTEIN-LIKE 2 GPGSEFKSMTQEAFQESSVKELGRLEAQLASLRQELAALTLKQNSVADEVGLLPQKIQAARADVESQFPDWIRQFLLG 78 T 0.0003 ADIP pdbpercent F Eukaryota T 5eel 2 G,H,I,J,K,L L,M,N,P,Q,R Bicyclic Peptide Inhibitor SFEGYDNXX 9 T 0.52 DUF4911 pdbhh F T 5eeq 2 C,D L,M Bicyclic Peptide Inhibitor SFEGYDNSFPXX 12 T 0.6 Rox3 pdbhh F T 5ef5 1 A,B E,A Raptor from Chaetomium thermophilum XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1029 F F F 5efi 3 C C p99p YEHDFHHIREKGNHWKNFLAVM 22 T 7.9 DUF1925 pdbhh F T 5efj 2 B F TOXIN I XXAX 4 F F F 5efk 2 B A alpha tubulin K40 tripeptide SDX 3 T 350 Coat_F pdbhh F F 5efn 2 B,D F,E histone H4 tripeptide RGX 3 T 500 KRAB pdbhh F F 5efv 1 A,B,C A,B,C Q8SDT4_BPPHA MINOR STRUCTURAL PROTEIN HHHHHHLVPRGSMSNKLITDLSRVFDYRYVDENEYNFKLISDMLTDFNFSLEYHRNKEVFAHDGEQIKYEHLNVTSNVSDFLTYLNGRFSNMVLGHNGDGINEVKDARVDNTGYGHKTLQDRLYHDYSTLDVFTKKVEKAVDEHYKEYRATEYRFEPKEQEPEFITDLSPYTNAVMQSFWVDPRTKIIYMTQARPGNHYMLSRLKPNGQFIDRLLVKNGGHGTHNAYRYIDGELWIYSAVLDSNKNNKFVRFQYRTGEITYGNEMQDVMPNIFNDRYTSAIYNPVENLMIFRREYKPTERQLKNSLNFVEVRSADDIDKGIDKVLYQMDIPMEYTSDTQPMQGITYDAGILYWYTGDSNTANPNYLQGFDIKTKELLFKRRIDIGGVNNNFKGDFQEAEGLDMYYDLETGRKALLIGVTIGPGNNRHHSIYSIGQRGVNQFLKNIAPQVSMTDSGGRVKPLPIQNPAYLSDITEVGHYYIYTQDTQNALDFPLPKAFRDAGWFLDVLPGHYNGALRQVLTRNSTGRNMLKFERVIDIFNKKNNGAWNFCPQNAGYWEHIPKSITKLSDLKIVGLDFYITTEESNRFTDFPKDFKGIAGWILEVKSNTPGNTTQVLRRNNFPSAHQFLVRNFGTGGVGKWSLFEGKVVE 648 T 0.1 Baculo_PEP_C pdbpercent T Viruses T 5eg2 2 B B TAF10_HUMAN STAF28,TRANSCRIPTION INITIATION FACTOR TFIID 30 KDA SUBUNIT,TAFII30 SKSKDRKYTL 10 T 0.0084 TFIID-31kDa unphh F Eukaryota T 5eha 1 A A G1K3P4_AGABI Lectin-like fold protein ARKIPLDLPGTRILNGANWANNSATENLATNSGTLIIFDQSTPGQDADRWLIHNYLDGYKIFNMGSNNWASVSRGNTVLGVSEFDGQTCKWSIEYSGNGEEFWIRVPREGGGGAVWTIKPASSQGPTTVFLDLLKETDPNQRIKFAVENLYFQ 153 T 0.002 Inhibitor_I66 unphh F Eukaryota T 5ehb 1 A,B A,B pHiosYI TDKIXDALEKLAEIQKEIAEFLRELIEAAEKT 32 T 0.1 Ribosomal_L22 pdb F T 5ehc 2 B B IF4G1_HUMAN EIF-4G1,P220 KKRYDREFLLGFQF 14 T 0.00025 eIF_4G1 unphh F Eukaryota T 5ehh 2 B B Endomorphin-2 YPFFX 5 T 32 ATP-synt_J pdbhh F F 5ei3 2 B B IF4G1_HUMAN Eukaryotic translation initiation factor 4 gamma KKRYDREFLLGFQF 14 T 0.00025 eIF_4G1 unphh F Eukaryota T 5eib 4 D F CENPJ_HUMAN CENP-J,CENTROSOMAL P4.1-ASSOCIATED PROTEIN,LAG-3-ASSOCIATED PROTEIN,LYST-INTERACTING PROTEIN 1 KQPFLKRGEGLARFTNAKSKFQK 23 T 7 DUF6200 pdbhh F Eukaryota T 5eir 2 B B IF4G1_HUMAN Eukaryotic translation initiation factor 4 gamma 1 KKRYDREFLLGFQF 14 T 0.00025 eIF_4G1 unphh F Eukaryota T 5eiv 2 C,D,E,F,G,H,I,J,K C,D,E,F,G,H,I,J,K GLY-PRO-HYP-GLY-PRO-HYP-GLY-PRO-HYP-GLY-PRO-ALA-GLY-PHE-HYP-GLY-PRO-HYP-GLY-PRO-HYP XGPPGPPGPPGPAGFPGPPGPP 22 T 0.0011 Collagen pdbpssm F F 5eiy 3 C D poly(unk) XXXXXXXXX 9 F F F 5ej1 3 C D poly(unk) XXXXXXX 7 F F F 5ejo 1 A A RLF2_YEAST CAF-1 90 KDA SUBUNIT,RAP1 LOCALIZATION FACTOR 2 GPLGSMKQKAMITDPMDLLRLFDGVQDSTFSLGTVTEIAQKNLPQYNKQTIKNTIKEYAIRSSGKGDLPRKWVIKDAQNWENLRANANMPTPSL 94 T 0.00059 CAF1-p150_C2 unphh F Eukaryota T 5ejv 2 B,D C,D EBI96 Coactivator Peptide VESEFPYLLSLLGEVSPQP 19 T 0.82 DUF4576 pdbhh F T 5ejz 3 C D poly(unk) XXXXXXXXX 9 F F F 5ekf 1 A,B B,C ERCC5_HUMAN DNA EXCISION REPAIR PROTEIN ERCC-5,XERODERMA PIGMENTOSUM GROUP G-COMPLEMENTING PROTEIN KTQKRGITNTLEESSSLKRKRLSD 24 T 7 DUF503 pdbhh F Eukaryota T 5ekg 1 A,B B,C ERCC5_HUMAN XPG2 peptide VFGKKRRKLRRARGRKRKT 19 T 6.4 AT_hook pdbhh F Eukaryota T 5elq 2 B,D P,C DGKZ_HUMAN GLU-ASP-GLN-GLU-THR-ALA-VAL REDQETAV 8 T 33 SNN_linker pdbhh F Eukaryota T 5ema 2 B B LRC3B_HUMAN ASP-ASP-ILE-SEP-THR-VAL-VAL PDDISTVV 8 T 29 B2 pdbhh F Eukaryota T 5emb 2 B B PTH1R_HUMAN GLU-GLU-TRP-SEP-THR-VAL-MET QEEWSTVM 8 T 0.54 Prp19 pdbhh F Eukaryota T 5emg 1 A,B,C,D A,B,C,D GPN-CPN-TPN-GPN-CPN-TPN-GPN-CPN XXXXXXXX 8 F F F 5en0 2 B B GNAT3_BOVIN GUSTDUCIN ALPHA-3 CHAIN ILENLKDVGLF 11 T 2.1 Phage_holin_4_1 pdbhh F Eukaryota T 5enw 3 C C Peptide G9L GLKEGIPAL 9 T 10 CENP-M pdbhh F T 5eoa 2 C,D C,D TBK1_HUMAN NF-KAPPA-B-ACTIVATING KINASE,T2K,TANK-BINDING KINASE 1 GPGSYPSSNTLVEMTLGMKKLKEEMEGVVKELAENNHILERFGSLTMDGGLRNVDCL 57 T 0.0016 DUF713 unp F Eukaryota T 5eod 2 B B LP2 EFPDFP 6 T 0.4 DNA_pol3_beta pdbhh F F 5eof 2 C,D C,D TBK1_HUMAN NF-KAPPA-B-ACTIVATING KINASE,T2K,TANK-BINDING KINASE 1 GPGSYPSSNTLVEMTLGMKKLKEEMEGVVKELAENNHILERFGSLTMDGGLRNVDCL 57 T 0.0016 DUF713 unp F Eukaryota T 5eoj 1 A,B,C A,B,C ACC-Hex-PheI XELKAIAQEFKAIAKEFKAIAXEFKAIAQKX 31 T 2.6 DUF5741 pdbhh F T 5eok 2 B K P39 HIYPDFPTD 9 T 5.5 DUF4012 pdbhh F T 5eon 1 A,B,C A,B,C ACC-Hex XELKAIAQEFKAIAKEFKAIAWEFKAIAQKX 31 T 3.9 DUF5320 pdbhh F T 5eot 3 C C Peptide G13E GLLPELPAVGG 11 T 3.5 Fapy_DNA_glyco pdbhh F T 5ep6 2 C,D B,D TBK1_HUMAN NF-KAPPA-B-ACTIVATING KINASE,T2K,TANK-BINDING KINASE 1 SGSGSYPSSNTLVEMTLGMKKLKEEMEGVVKELAENNHILERFGSLTMDGGLRNVDCL 58 T 0.0016 DUF713 unp F Eukaryota T 5epj 2 B B peptide-like inhibitor UNC3866 XFALXX 6 T 800 zf-C2H2_11 pdbhh F F 5epk 2 B B unc3866 XFALXX 6 T 800 zf-C2H2_11 pdbhh F F 5epl 2 C,D C,D unc3866 XFALXX 6 T 800 zf-C2H2_11 pdbhh F F 5epp 2 B B RHBL4_HUMAN RRP4,RHOMBOID DOMAIN-CONTAINING PROTEIN 1,RHOMBOID-LIKE PROTEIN 4 SPEEMRRQRLHRFDS 15 T 0.71 SUIM_assoc pdbhh F Eukaryota T 5eq0 2 B B unc3866 XFALXX 6 T 800 zf-C2H2_11 pdbhh F F 5eqw 1 A,B,C,D,E A,B,C,D,E A0A125SJ78_9VIRU Putative major coat protein MHHHHHHMAKYEATKGDYAGGVLAILTQYFNNMVGYPEVSLKLAGEEANMSREGMINQKEIVHQMVETIRRASEPIRQGRGFHDAYVYFASVPENAPPNSIALPPQAQSEVQAKLTELMQKLANRNPQGVAEEEQELATQGI 142 T 0.08 KfrA_N pdbpercent T Viruses T 5esq 3 E,F E,F Cyclic beta-alanine-linked meditope XQFDLSTRRLKX 12 T 10 AAA_lid_8 pdbhh F T 5et0 2 B,D B,D MYO3B_MOUSE Myosin-IIIb SQRKPRKLGQIKVLDGEDQYYKCLSPGACAPEETHSVHPFFFSSSPREDPFAQH 54 T 0.078 VHL pdb F Eukaryota T 5et1 2 C,D C,D MYO3B_MOUSE Myosin-IIIb QKQRAPRRRCQQPKMLSSPEDTMYYNQLNGTLEYQG 36 T 15 Tho2 pdbhh F Eukaryota T 5eta 2 B,D D,C B6KJB6_TOXGV Putative transmembrane protein GLLERRGVSELPPLYI 16 T 1.8 ComFB pdbhh F Eukaryota T 5etf 2 B B MP2K6_HUMAN MAPKK 6,MAPK/ERK KINASE 6,MEK 6,STRESS-ACTIVATED PROTEIN KINASE KINASE 3,SAPKK3 SKGKKRNPGLKIPKA 15 T 3.4 GHL15 pdbhh F Eukaryota T 5etu 3 E,F E,F L5E meditope variant CQFDESTRRLKC 12 T 7.7 YsaB pdbhh F T 5eu8 2 B B N-[(5-METHYLISOXAZOL-3-YL)CARBONYL]ALANYL-L-VALYL-N~1~-((1R,2Z)-4-(BENZYLOXY)-4-OXO-1-{[(3R)-2-OXOPYRROLIDIN-3-YL]METHYL}BUT-2-ENYL)-L-LEUCINAMIDE XAVLXX 6 T 1700 zf-H2C2_5 pdbhh F F 5euk 3 E,F E,F F3H meditope CQHDLSTRRLKC 12 T 11 DUF3637 pdbhh F T 5eur 1 A,B,C A,B,C SF216 MSISYRKLDIALSADKETVLVFGQELSTKYFTEIVVTTMLNSTGSDMANSNRILNDIHAAGLDAGDYGKYSRWWAQSNAQERQEAERRRKEAKAHQERMAAIHATPEEIAKAVAERKAREEALIKRFGNKGAAFGL 136 T 0.019 Topoisom_I_N pdb F T 5ev0 2 C,D C,D PRO-PRO-PRO-PRO-PRO-PRO-PRO-PRO-PRO PPPPPPPPP 9 T 29 Adeno_E3_14_5 pdbhh F F 5eve 2 B B Poly-Proline peptide PPPPPPPPPP 10 T 23 IL11 pdbhh F F 5evf 1 A A A0Q625_FRATN Francisella virulence factor GSHMETKGVYLPKYSAELPPTDPSQVRVYNLQYQSDTQGNIGQVRTSTHVSNEKDFQKLCDKNLKEAIKLAAQHGAHEIKYICLYPEGQINELSSVQLRGYAFRD 105 T 0.078 DUF2757 pdb F Bacteria T 5evg 1 A A A0Q625_FRATN Francisella virulence factor GSHMETKGVYLPKYSAELPPTDPSQVRVYNLQYQSDTQGNIGQVRTSTHVSNEKDFQKLCDKNLKEAIKLAAQHGAHEIKYICLYPEGQINELSSVQLRGYAFRD 105 T 0.078 DUF2757 pdb F Bacteria T 5ewz 2 C C GAB2_HUMAN GRB2-ASSOCIATED BINDER 2,GROWTH FACTOR RECEPTOR BOUND PROTEIN 2-ASSOCIATED PROTEIN 2,PP100 PRRNTLPAM 9 T 1.5 PKI pdbhh F Eukaryota T 5ewz 3 D D GAB2_HUMAN GRB2-ASSOCIATED BINDER 2,GROWTH FACTOR RECEPTOR BOUND PROTEIN 2-ASSOCIATED PROTEIN 2,PP100 RSASFS 6 T 100 DUF1815 pdbhh F Eukaryota F 5ex0 2 B D M3K2_HUMAN MAP3K2 peptide EKFGKGGTYP 10 T 12 Cyanate_lyase pdbhh F Eukaryota T 5ex8 1 A A Q8KLL7_STRTO STAF MHHHHHHGKPIPNPLLGLDSTENLYFQGIDPFTMFEEINVVRASQLHRRDRFDPVPELHSLMKEGGLTVLGTEDSTEGRTAWLATGIDEVRQVLGSDKFSARLLYGGTAAGITWPGFLTQYDPPEHTRLRRMVVPAFSHRRMQKFRPRVEQIVQDSLDTIESLGGPVDFVPHFGWAIATPATCDFLGIPRDDQADLARILLASRTDRSDKRRTAAGNKFMTYMKQHVAQSRRGSGDDLFGIVGRENGDAITDAELTGVAAFVMGAAADQVARLLAAGAWLMVEQPAQFALLREKPETVPEWLDETMRYLTTDEKTHPRVATQDVRIGNQLVKAGDTVTCSLLAANRPNYPSAEDEFDITREKAEHLAFGHGIHHCLGRAMAELMFKVSIPALAHRFPTLRLADPQREITLGPPPFDVEALLLDW 424 T 8.8E-29 p450 pdbpercent F Bacteria T 5ex9 1 A A Q8KLL7_STRTO STAF MHHHHHHGKPIPNPLLGLDSTENLYFQGIDPFTMFEEINVVRASQLHRRDRFDPVPELHSLMKEGGLTVLGTEDSTEGRTAWLATGIDEVRQVLGSDKFSARLLYGGTAAGITWPGFLTQYDPPEHTRLRRMVVPAFSHRRMQKFRPRVEQIVQDSLDTIESLGGPVDFVPHFGWAIATPATCDFLGIPRDDQADLARILLASRTDRSDKRRTAAGNKFMTYMKQHVAQSRRGSGDDLFGIVGRENGDAITDAELTGVAAFVMGAAADQVARLLAAGAWLMVEQPAQFALLREKPETVPEWLDETMRYLTTDEKTHPRVATQDVRIGNQLVKAGDTVTCSLLAANRPNYPSAEDEFDITREKAEHLAFGHGIHHCLGRAMAELMFKVSIPALAHRFPTLRLADPQREITLGPPPFDVEALLLDW 424 T 8.8E-29 p450 pdbpercent F Bacteria T 5exa 2 C,D C,D GAB2_HUMAN GRB2-ASSOCIATED BINDER 2,GROWTH FACTOR RECEPTOR BOUND PROTEIN 2-ASSOCIATED PROTEIN 2,PP100 PRRNTLPAMDQ 11 T 1.7 PKI pdbhh F Eukaryota T 5eyz 2 E,F,G,H E,F,G,H CYTO8-RETEV SWESHKSGRETEV 13 T 8.1 Svs_4_5_6 pdbhh F T 5ez0 2 E,F,G,H E,F,G,H B5MDL5_HUMAN MITOGEN-ACTIVATED PROTEIN KINASE 12,ISOFORM CRA_A SWARVSKETPL 11 T 0.51 CdhC pdbhh F Eukaryota T 5ez8 1 A,B,C,D,E,F,G A,B,C,D,E,F,G CC-Hept-I-C-I XGEIAQALKEIAKALKEIAWACKEIAQALKG 31 T 0.028 MCPsignal pdbpssm F T 5ez9 1 A,B,C,D,E,F,G A,B,C,D,E,F,G CC-Hept-L22H XGEIAKALREIAKALREIAWAHREIAKALRG 31 T 0.036 WXG100 pdbpssm F T 5eza 1 A,B,C,D,E,F,G A,B,C,D,E,F,G CC-Hept-C-H-I XGEIAKALREIAKALRECAWAHREIAKALRG 31 T 6.4 RecR pdbhh F T 5ezc 1 A,B,C,D,E,F,G A,B,C,D,E,F,G CC-Hept-C-H-E XGEIAKALREIAKALRECAWAHREEAKALRG 31 T 2.8 EDS1_EP pdbhh F T 5eze 1 A,B,C,D,E,F,G A,B,C,D,E,F,G CC-Hept-bMeCys-His-Glu XGEIAKALREIAKALREXAWAHREEAKALRG 31 T 2.8 EDS1_EP pdbhh F T 5f0l 4 D D NRAM2_HUMAN NRAMP 2,DIVALENT CATION TRANSPORTER 1,DIVALENT METAL TRANSPORTER 1,DMT-1,SOLUTE CARRIER FAMILY 11 MEMBER 2 HLGLTAQPELYLLNTMDADSLVSR 24 T 17 CagA pdbhh F Eukaryota T 5f0m 4 D D NRAM2_HUMAN NRAMP 2,DIVALENT CATION TRANSPORTER 1,DIVALENT METAL TRANSPORTER 1,DMT-1,SOLUTE CARRIER FAMILY 11 MEMBER 2 TAQPELYLLNTMSHHHHH 18 T 220 DUF1143 pdbhh F Eukaryota T 5f0o 2 B E C5DNF8_LACTC KLTH0G16610p LTNPSQYLLQDAVTEREVLLVP 22 T 14 AglB_L1 pdbhh F Eukaryota T 5f0p 4 D D NRAM2_HUMAN NRAMP 2,DIVALENT CATION TRANSPORTER 1,DIVALENT METAL TRANSPORTER 1,DMT-1,SOLUTE CARRIER FAMILY 11 MEMBER 2, DIVALENT CATION TRANSPORTER II TAQPELYLMNTMSHHHHH 18 T 220 DUF1143 pdbhh F Eukaryota T 5f1i 3 C,F,I,L,O,R,U,X C,F,I,L,O,R,U,X 9-mer peptide KLFSGELTK 9 T 6.4 DUF5823 pdbhh F T 5f1t 1 A,B,C,D,E,F A,B,C,D,E,F Macrocyclic peptide XAVLXVGSXVHGXATV 16 T 2.3 DUF6001 pdbhh F T 5f1w 1 A,B,C,D,E,F A,B,C,D,E,F Macrocyclic peptide XXXXXXGXXXXGXXXX 16 F F F 5f2u 2 C,D C,D Phosphatidylinositol 4,5-bisphosphate 5-phosphatase, s-farnesyl-l-cysteine methyl ester SSTIC 5 T 66 Gp_UL130 pdbhh F F 5f2y 1 A,B,C,D,E,F,G A,B,C,D,E,F,G CC-Hept-hCys-H-E XGEIAKALREIAKALREXAWAHREEAKALRG 31 T 7.8 Amphi-Trp pdbhh F T 5f3o 1 A,B A,B C4LU64_ENTHI EHRNASEIII MSSTTLHNAMQYTAFDVLSSILNLMKADPLYDLLQLNQAYSSQDQEYEKNEFYGDSYLEERASSLVLKFLRKYEQIPFEMYSGLRIHTVKNQTLGEIFDLLHLGDTKTFEKKKKGDLVESLIGGCVLLSQRENATLFLLFAHALIDYIFYHSSYIYFNANPPKLVKEEIITDIQNWFKDKLFYYRSSLEKYQTDP 195 T 0.025 Ribonuclease_3 pdbpssm F Eukaryota T 5f3p 1 A,B A,B C4LU64_ENTHI EHRNASEIII MSSTTLHNAMQYTAFDVLSSILNLMKADPLYDLLQLNQAYSSQDQEYEKNEFYGDSYLEERASSLVLKFLRKYEQIPFEMYSGLRIHTVKNQTLGEIFDLLHLGDTKTFEKKKKGDLVESLIGGCVLLSQRENATLFLLFAHALIDYIFYHSSYIYFNANPPKLVKEEIITDIQNWFKDKLFYYRSSLEKYQTDP 195 T 0.025 Ribonuclease_3 pdbpssm F Eukaryota T 5f3q 1 A,B A,B C4LU64_ENTHI EH.RNASEIII SSSTTLHNAMQYTAFDVLSSILNLMKADPLYDLLQLNQAYSSQDQEYEKNEFYGDSYLEERASSLVLKFLRKYEQIPFEMYSGLRIHTVKNQTLGEIFDLLHLGDTKTFEKKKKGDLVESLIGGCVLLSQRENATLFLLFAHALIDYIFYHSSYIYFNANPPKLVKEEIITDIQNWFKDKLFYYRSSLEKYQT 193 T 0.025 Ribonuclease_3 pdbpssm F Eukaryota T 5f3y 2 B B ANS4B_MOUSE ANKS4B GSVEEDDDVQHESILNRPGLGSIVFSRNRVLDFEDISDSKRELGFKMPSELFQRQGAAGTVEEEEEEEEEEEEEKREANGTAGDLPWDEEEVEWEEDAVDAT 102 T 0.37 PBP_sp32 pdb F Eukaryota T 5f56 3 C B G9I562_DEIRD ALA-ASP-LEU-PRO-PHE ADLPF 5 T 42 CofC pdbhh F Bacteria F 5f5b 2 B B peptidic derivative of Gurken: ACE-VAL-ARG-MET-ALA-aldehyde XVRMX 5 T 0.0014 DUF3844 unphh F F 5f5g 2 B B ACE-ARG-MET-ALA-aldehyde XRMX 4 T 830 zf-met pdbhh F F 5f5j 2 B B GRK_DROME peptidic derivative of Gurken: ACE-VAL-ARG-MET-ALA-aldehyde XVRMX 5 T 0.0014 DUF3844 unphh F Eukaryota F 5f5k 2 B B GRK_DROME Peptidic derivative of Gurken: ACE-ARG-LYS-VAL-ARG-MET-ALA-aldehyde XRKVRMX 7 T 0.0014 DUF3844 unphh F Eukaryota F 5f67 2 B,D C,D TRP_DROME TRP C terminal Tail GPGSRGKSTVTGRMISGWL 19 T 0.019 Mur_ligase_M pdbhh F Eukaryota T 5f6k 3 D,F D,F RBBP5_HUMAN RBBP-5,RETINOBLASTOMA-BINDING PROTEIN RBQ-3 SAFAPDFKELDENVEYEERESEFDIED 27 T 0.014 DUF2457 unppercent F Eukaryota T 5f6l 1 A J RBBP5_HUMAN RBBP-5,RETINOBLASTOMA-BINDING PROTEIN RBQ-3 SAFAPDFKELDENVEYEERESEFDIED 27 T 0.014 DUF2457 unppercent F Eukaryota T 5f74 2 B B MLXPL_RAT CHREBP,CLASS D BASIC HELIX-LOOP-HELIX PROTEIN 14,BHLHD14,MLX INTERACTOR,MLX-INTERACTING PROTEIN-LIKE,WS BASIC-HELIX-LOOP-HELIX LEUCINE ZIPPER PROTEIN,WS-BHLH,WILLIAMS-BEUREN SYNDROME CHROMOSOMAL REGION 14 PROTEIN MARALADLSVNLQVPRVVPSPDSDSDTDLEDPSPRRSAGGLHRSQVIHSGHFMVSSPHSDSLTRRRDQEGPVGLADFGPRSIDPTLTRLFECLSLAYSGKLVSPKWKNFKGLKLLCRDKIRLNNAIWRAWYIQYVQRRKSPVCGFVTPLQGSEADEHRKPEAVVLEGNYWKRRIEVVMREYHKWRIYYKKRLRKSS 196 T 0.17 DUF1752 pdbhh F Eukaryota T 5f7d 3 C C Peptide G11N GLKEGIPALD 10 T 14 ATPase pdbhh F T 5f88 3 E,F E,F L5Y meditope CQFDYSTRRLKC 12 T 10 Flavi_NS1 pdbhh F T 5f8k 55 CB,FD 1y,2y CTHL3_BOVIN BACTENECIN-7,BAC7,PR-59 RRIRPRPPRLPRPRPR 16 T 0.027 TonB_N unppercent F Eukaryota F 5f8t 2 B P CYS-PRO-LYS-ARG-PHE-M70-ALA-LEU-PHE-CYS CPKRFAALFC 10 T 1.1 DUF4395 pdbhh F T 5f8x 2 B B CYS-PRO-ALA-ARG-PHE-M70-ALA-LEU-TRP-CYS CPARFAALWC 10 T 1.5 BTRD1 pdbhh F T 5f8z 2 B B CYS-PRO-ALA-ARG-PHE-M70-ALA-LEU-PHE-CYS CPARFAALFC 10 T 2.3 NUC153 pdbhh F T 5f9j 3 C C Peptide Y9L YLSPIASPL 9 T 3.3 Fe_hyd_lg_C pdbhh F T 5fa3 3 C C G9V GLLPELPAV 9 T 2 Fapy_DNA_glyco pdbhh F T 5fa4 3 C C Peptide Y16R YLSPIASPLLD 11 T 6.6 Fe_hyd_lg_C pdbhh F T 5fa5 3 C C H4_HUMAN Histone H4 SGRGKGGKGLGKGGAKRHRK 20 T 11 Shadoo unppercent F Eukaryota T 5fby 2 B B cleaved peptide MGSSHHHHHHSQLEVLFQGPLGSGRP 26 T 280 Glyco_hyd_65N_2 pdbhh F T 5fbz 3 E E Autoproteolytic fragment of enzyme subtilase SubHal KPSLL 5 T 110 SDA1 pdbhh F F 5fc2 1 A A pAMK, peptide containing a phospho-serine DKSIEVGRX 9 T 11 DUF2244 pdbhh F T 5fc3 1 A A pAMK peptide DKSIEVGRX 9 T 11 DUF2244 pdbhh F T 5fcd 1 A D UNK-UNK-UNK-MSE-UNK XXXMX 5 T 1900 PPR_1 pdbhh F F 5fcd 2 B E UNK-UNK-UNK-UNK-MSE-UNK XXXXMX 6 T 2200 PPR_1 pdbhh F F 5fcf 2 C C GLY-GLY-GLY GGG 3 T 79 FTCD_C pdbhh F F 5fch 2 C C GLY-GLY-GLY GGG 3 T 79 FTCD_C pdbhh F F 5fci 81 XD m2 60S ribosomal protein L12-A (uL11) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 150 F F F 5fci 83 EF p1 60S ribosomal protein P1 alpha XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 5fci 84 FF p2 60S ribosomal protein P2 beta XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 46 F F F 5fcj 80 XD m2 60S ribosomal protein L12-A (uL11) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 150 F F F 5fcj 82 EF p1 60S ribosomal protein P1 alpha XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 5fcj 83 FF p2 60S ribosomal protein P2 beta XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 46 F F F 5fcm 1 A,B,C,D A,B,C,D A8ID55_CHLRE Basal body protein GSMAIDVDRTLAVLRRKLEALGYSDPLEPASLQLVQKLVEDLVHTTDSYTAVKQQCAKQAQEIAAFDTRLES 72 T 0.00048 ADIP unphh F Eukaryota T 5fdu 54 BB,DD 1y,2y Metalnikowin I VDKPDYRPRP 10 T 5.7 NinD pdbhh F T 5fdv 54 CB,DD 1y,2y PYRRH_PYRAP Pyrrhocoricin VDKGSYLPRPTPPRPI 16 T 1.4 Apidaecin pdbhh F Eukaryota T 5fdw 3 C C Peptide Y10L YLSPIASPLL 10 T 4.9 Fe_hyd_lg_C pdbhh F T 5ff6 3 E,F E,F L10Q meditope CQFDLSTRRQKC 12 T 7.5 DUF1254 pdbhh F T 5ffw 2 C C H4_HUMAN Histone H4 SGRGXGGXGL 10 T 4.7 G3P_acyltransf pdbhh F Eukaryota F 5fg0 1 A,B A,B LTN1_YEAST RING DOMAIN MUTANT KILLED BY RTF1 DELETION PROTEIN 1 SLNTDLGLGHNGVRISLNYFDGLPDPSLLNSLYSNELKLIFKSLLKRDETTKEKALMDLSNLISDFNQNEYFFNDIFLLCWSQIYAKLIISDYKVIRLQSHQITIMLVKSLRKKISKFLKDFIPLILLGTCELDYSVSKPSLNELTECFNKDPAKINALWAVFQEQLLNLVKEIVVNENEDTISDERYSSKEESEFRYHRVIASAVLLLIKLFVHNKDVSERNSSSLKVILSDESIWKLLNLKNGQNTNAYETVLRLIDVLYTRGYMPSHKNIMKLAVKKLLKSLTHITSKNILKVCPVLPSILNLLATLDDYEDGTIWSYDKSSKEKVLKFLSVSRTSPSPGFFNAVFALYSSTKRHSFLDYYLEWLPFWQKSVQRLNEKGFSARNSAEVLNEFWTNFLKFAEDSSEERVKKM 414 T 0.0085 CLASP_N pdbhh F Eukaryota T 5fg1 1 A A LTN1_YEAST RING DOMAIN MUTANT KILLED BY RTF1 DELETION PROTEIN 1 SLNTDLGLGHNGVRISLNYFDGLPDPSLLNSLYSNELKLIFKSLLKRDETTKEKALMDLSNLISDFNQNEYFFNDIFLLCWSQIYAKLIISDYKVIRLQSHQITIMLVKSLRKKISKFLKDFIPLILLGTCELDYSVSKPSLNELTECFNKDPAKINALWAVFQEQLLNLVKEIVVNENEDTISDERYSSKEESEFRYHRVIASAVLLLIKLFVHNKDVSERNSSSLKVILSDESIWKLLNLKNGQNTNAYETVLRLIDVLYTRGYMPSHKNIMKLAVKKLLKSLTHITSKNILKVCPVLPSILNLLATLDDYEDGTIWSYDKSSKEKVLKFLSVSRTSPSPGFFNAVFALYSSTKRHSFLDYYLEWLPFWQKSVQRLNEKGFSARNSAEVLNEFWTNFLKFAEDSSEERVKKM 414 T 0.0085 CLASP_N pdbhh F Eukaryota T 5fg8 2 B B KCNAE_DROME ETHER-A-GO-GO PROTEIN GVLPKAPKLQASQATLARQDTIDEGGEVDSSPPSRDSRVVIEGAAVSSATVGPS 54 T 30 DUF4491 pdbhh F Eukaryota T 5fj5 1 A,B A,B P1_BPPH6 P1 PROTEIN FROM BACTERIOPHAGE PHI6 GFNLKVKDLNGSARGLTQAFAIGELKNQLSVGALQLPLQFTRTFSASMTSELLWEVGKGNIDPVMYARLFFQYAQAGGALSVDELVNQFTEYHQSTACNPEIWRKLTAYITGSSNRAIKADAVGKVPPTAILEQLRTLAPSEHELFHHITTDFVCHVLSPLGFILPDAAYVYRVGRTATYPNFYALVDCVRASDLRRMLTALSSVDSKMLQATFKAKGALAPALISQHLANAATTAFERSRGNFDANAVVSSVLTILGRLWSPSTPKELDPSARLRNTNGIDQLRSNLALFIAYQDMVKQRGRAEVIFSDEELSSTIIPWFIEAMSEVSPFKLRPINETTSYIGQTSAIDHMGQPSHVVVYEDWQFAKEITAFTPVKLANNSNQRFLDVEPGISDRMSATLAPIGNTFAVSAFVKNRTAVYEAVSQRGTVNSNGAEMTLGFPSVVERDYALDRDPMVAIAALRTGIVDESLEARASNDLKRSMFNYYAAVMHYAVAHNPEVVVSEHQGVAAEQGSLYLVWNVRTELRIPVGYNAIEGGSIRTPEPLEAIAYNKPIQPSEVLQAKVLDLANHTTSIHIWPWHEASTEFAYEDAYSVTIRNKRYTAEVKEFELLGLGQRRERVRILKPTVAHAIIQMWYSWFVEDDRTLAAARRTSRDDAEKLAIDGRRMQNAVTLLRKIEMIGTTGIGASAVHLAQSRIVDQMAGRGLIDDSSDLHVGINRHRIRIWAGLAVLQMMGLLSRSEAEALTKVLGDSNALGMVVA 761 T 0.22 STAG pdb T Viruses T 5fj7 1 A,B A,B P1_BPPH6 P1 PROTEIN FROM BACTERIOPHAGE PHI6 GFNLKVKDLNGSARGLTQAFAIGELKNQLSVGALQLPLQFTRTFSASMTSELLWEVGKGNIDPVMYARLFFQYAQAGGALSVDELVNQFTEYHQSTACNPEIWRKLTAYITGSSNRAIKADAVGKVPPTAILEQLRTLAPSEHELFHHITTDFVCHVLSPLGFILPDAAYVYRVGRTATYPNFYALVDCVRASDLRRMLTALSSVDSKMLQATFKAKGALAPALISQHLANAATTAFERSRGNFDANAVVSSVLTILGRLWSPSTPKELDPSARLRNTNGIDQLRSNLALFIAYQDMVKQRGRAEVIFSDEELSSTIIPWFIEAMSEVSPFKLRPINETTSYIGQTSAIDHMGQPSHVVVYEDWQFAKEITAFTPVKLANNSNQRFLDVEPGISDRMSATLAPIGNTFAVSAFVKNRTAVYEAVSQRGTVNSNGAEMTLGFPSVVERDYALDRDPMVAIAALRTGIVDESLEARASNDLKRSMFNYYAAVMHYAVAHNPEVVVSEHQGVAAEQGSLYLVWNVRTELRIPVGYNAIEGGSIRTPEPLEAIAYNKPIQPSEVLQAKVLDLANHTTSIHIWPWHEASTEFAYEDAYSVTIRNKRYTAEVKEFELLGLGQRRERVRILKPTVAHAIIQMWYSWFVEDDRTLAAARRTSRDDAEKLAIDGRRMQNAVTLLRKIEMIGTTGIGASAVHLAQSRIVDQMAGRGLIDDSSDLHVGINRHRIRIWAGLAVLQMMGLLSRSEAEALTKVLGDSNALGMVVA 761 T 0.22 STAG pdb T Viruses T 5fjx 2 D,E D,E GCS1_YEAST ARF GAP GCS1 DEDKWDDF 8 T 0.34 BLM_N pdbhh F Eukaryota F 5fjy 2 D,E,F D,E,F UNKNOWN PEPTIDE XXXXX 5 F F F 5fjz 2 E,F,G,H P,Q,R,S DSL1_YEAST DEPENDENT ON SLY1-20 PROTEIN 1 DDWNWEVED 9 T 0.0042 COPI_C unp F Eukaryota F 5fkp 3 C C P99 YEHDFHHIREWGNHWKNFLAVM 22 T 3.2 Xpo1 pdbhh F T 5fl2 2 B K RAVA_ECOLI REGULATORY ATPASE VARIANT A, REGULATORY ATPASE VARIANT A, R AVA ATPASE DKTALTVIRLGGIFSRRQQYQLPVNVTASTLTLLLQKPLKLHDMEVVHISFERSALEQWLSKGGEIRGKLNGIGFAQKLNLEVDSAQHLVVRDVSLQGSTLALPGS 106 T 0.17 HTH_17 unppssm F Bacteria T 5fl7 5 I I ATP SYNTHASE EPSILON CHAIN, MITOCHONDRIAL XXXXXXXXXXXXXXXX 16 F F F 5flc 1 A,C 1,3 FK506-BINDING PROTEIN 12-RAPAMYCIN COMPLEX-ASSOCIATED PROTEIN 1, FKBP12-RAPAMYCIN COMPLEX-ASSOCIATED PROTEIN, MAMMALIAN TARGET OF RAPAMYCIN, MTOR, MECHANISTIC TARGET OF RAPAMYCIN, RAPAMYCIN AND FKBP12 TARGET 1, RAPAMYCIN TARGET PROTEIN 1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 615 F F F 5flc 2 B,D 2,4 FK506-BINDING PROTEIN 12-RAPAMYCIN COMPLEX-ASSOCIATED PROTEIN 1, FKBP12-RAPAMYCIN COMPLEX-ASSOCIATED PROTEIN, MAMMALIAN TARGET OF RAPAMYCIN, MTOR, MECHANISTIC TARGET OF RAPAMYCIN, RAPAMYCIN AND FKBP12 TARGET 1, RAPAMYCIN TARGET PROTEIN 1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 365 F F F 5flc 3 E,I A,E RAPTOR, P150 TARGET OF RAPAMYCIN (TOR)-SCAFFOLD PROTEIN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1029 F F F 5flc 5 G,K C,G FKBP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 107 F F F 5flz 4 E,F E,F EXTRAGENIC SUPPRESSOR OF CMD1-1 MUTANT PROTEIN 1, NUCLEAR FILAMENT-RELATED PROTEIN 1, SPINDLE POLE BODY SPACER PROTEIN SPC110, SPC110 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 44 F F F 5fm1 4 E,F E,F EXTRAGENIC SUPPRESSOR OF CMD1-1 MUTANT PROTEIN 1, NUCLEAR FILAMENT-RELATED PROTEIN 1, SPINDLE POLE BODY SPACER PROTEIN SPC110, SPC110 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 44 F F F 5fn3 5 E G POLY ALA CHAIN AAAAAAAAAAAAAAAAAAAAAAAA 24 T 670 DUF4699 pdbhh F F 5fn4 5 E G POLY ALA CHAIN AAAAAAAAAAAAAAAAAAAAAAAAA 25 T 790 DUF4699 pdbhh F F 5fot 2 B C FHTU PEPTIDE XFHTXX 6 T 290 Archease pdbhh F F 5fou 2 B C FHPA PEPTIDE XFHPAX 6 T 61 GATase_3 pdbhh F F 5fov 2 C,D E,F FHTG PEPTIDE XFHTGX 6 T 25 DUF4718 pdbhh F F 5fow 2 C,D E,F WHTA PEPTIDE XWHTAX 6 T 140 Peptidase_C97 pdbhh F F 5fox 2 B C FHAA PEPTIDE XFHAAX 6 T 340 DUF2520 pdbhh F F 5fp2 2 C,D X,Z IRON TRANSPORT OUTER MEMBRANE RECEPTOR XXXXXXXXXX 10 F F F 5fpk 2 B C FHTG PEPTIDE XFATAX 6 T 530 Inhibitor_I48 pdbhh F F 5fpx 2 C,D E,F PEPTIDE GSSHHHHH 8 T 6800 zf_CCCH_4 pdbhh F F 5fq0 3 E H RESIDUAL CLEAVED HIS TAG HHHHHH 6 T 6100 zf_CCCH_4 pdbhh F F 5fq6 4 G,K,O,P G,K,O,P BT_2261 GGGGGGGGGG 10 T 22 API5 pdbhh F F 5fq7 5 I,J I,P PEPTIDE GGGGGGGGGG 10 T 22 API5 pdbhh F F 5fq8 5 H P UNCHARACTERISED PROTEIN, BOUND PEPTIDE GGGGGGGGGG 10 T 22 API5 pdbhh F F 5fq8 6 I Q UNCHARACTERISED PROTEIN, BOUND PEPTIDE GGGGGGGGG 9 T 26 DUF444 pdbhh F F 5frp 2 C,D C,D SCC1_YEAST MCD1-LIKE PROTEIN RLNTVTRVHQLMLEDAVTEREVLVTPGLEFLDDTTIPVGLMAQE 44 T 4.2 EF-hand_13 pdbhh F Eukaryota T 5frq 2 E,F G,L DNLJ_HELPY DNA LIGASE QEFIRSLF 8 T 1.2 IFRD_C pdbhh F Bacteria T 5frs 2 B C SCC1_YEAST SCC1 LMMEDAVTEREVLVTPG 17 T 0.18 RPW8 unp F Eukaryota T 5fs4 1 A,B A,B Q9AZ42_9VIRU AP205 BACTERIOPHAGE COAT PROTEIN GSMANKPMQPITSTANKIVWSDPTRLSTTFSASLLRQRVKVGIAELNNVSGQYVSVYKRPAPKPEGGADAGVIMPNENQSIRTVISGSAENLATLKAEWETHKRNVDTLFASGNAGLGFLDPTAAIVSSDTTA 133 T 15 Packaging_FI unphh T Viruses T 5ft1 2 C,D,F,H,J,L C,D,F,H,J,L Q9HZM8_PSEAE RNASE E ERPRRRSRGQRRRSNRRERQ 20 T 26 Tristanin_u2 pdbhh F Bacteria T 5fu2 1 A,B A,B A0A1A9TAF4_RUMFL CBM74-RFGH5 MGAEEEDTAILYPFTISGNDRNGNFTINFKGTPNSTNNGCIGYSYNGDWEKIEWEGSCDGNGNLVVEVPMSKIPAGVTSGEIQIWWHSGDLKMTDYKALEHHHHHH 106 T 4.3 DUF5766 pdbhh F Bacteria T 5fu3 1 A,B A,B A0A1A9TAF4_RUMFL CBM74-RFGH5 MGAEEEDTAILYPFTISGNDRNGNFTINFKGTPNSTNNGCIGYSYNGDWEKIEWEGSCDGNGNLVVEVPMSKIPAGVTSGEIQIWWHSGDLKMTDYKALEHHHHHH 106 T 4.3 DUF5766 pdbhh F Bacteria T 5fu7 4 D,H D,H A0A0B4KGY5_DROME NANOS, ISOFORM B GPHMLESHQQTDEIARSLKIFAQVTTGAAENAAGSMQDVMQEFATNGYASDDLG 54 T 0.053 Tape_meas_lam_C pdb F Eukaryota T 5fw5 2 C C POLN_SFV NON-STRUCTURAL PROTEIN 3 LTFGDFDEHEVDALASGITFGDFDD 25 T 0.21 DUF5102 pdbhh T Viruses T 5fwe 2 C,D C,D H4_HUMAN SYNTHETIC PEPTIDE SGXGKGGKGLGKGGA 15 T 11 Shadoo unppercent F Eukaryota T 5fxc 2 B P GLYCOPEPTIDE APDTRP 6 T 170 DDE_Tnp_1_assoc pdbhh F F 5fzt 2 B B RHG07_HUMAN DELETED IN LIVER CANCER 1 PROTEIN, DLC-1, HP PROTEIN, RHO-TYPE GTPASE-ACTIVATING PROTEIN 7, START DOMAIN-CONTAINING PROTEIN 12, STARD12, STAR-RELATED LIPID TRANSFER PROTEIN 12, DLC1 PELDDILYHVKGMQRIVNQWSEK 23 T 4.7 Leptin pdbhh F Eukaryota T 5fzv 1 A A A0A0A0V662_9ARAC Venom peptide U3-SYTX-Sth1a GLIESIACIQKGLPCMEHSDCCRGVCEALFCQ 32 T 9.9E-05 Toxin_18 pdbhh F Eukaryota T 5fzw 1 A A A0A0A0VBR5_9ARAC Venom peptide U3-SYTX-Sth1h GLIESIACMQKGLPCMEHVDCCHGVCDSLFCLY 33 T 0.00011 Toxin_18 pdbhh F Eukaryota T 5fzx 1 A A A0A0A0V633_9ARAC U5-SCYTOTOXIN-STH1A DETPDECVTRGNFCATPEVHGDWCCGSLKCVSNSCR 36 T 0.00036 Conotoxin unphh F Eukaryota T 5g04 15 R S HSL1_YEAST HSL1 QNSASKRSLYSLQSISKRSLNLNDLLVFDDPLPSKKPASENVNKSEPHSLESDSDFEILCDQILFGNALDRILEEEEDNEKERDTQRQRQNDTKSSADTFTISGVSTNKENEGPEYPTKIEKNQFNMSYKPSENMSGLSSFPIFEKENTLSSSYLEEQKPKRAALSDITNSFNKMNKQEGMRIEKKIQREQLQKKNDRPSPLKPIQ 206 T 0.068 CANIN pdb F Eukaryota T 5g05 14 Q T UNIDENTIFIED PEPTIDE AAAAAQLAAAAAAAA 15 T 55 DUF4699 pdbhh F F 5g4c 2 C,D E,F SIRT2 RAAXT 5 T 290 YkyB pdbhh F F 5g50 1 A,B A,B C3LTH7_VIBCM RBMA MGSSHHHHHHSSGLVPRGSHMEVDCELQPVIEANLSLNQNQLASNGGYISSQLGIRNESCETVKFKYWLSIKGPEGIYFPAKAVVGVDTAQQESDALTDGRMLNVTRGFWVPEYMADGKYTVSLQVVAENGKVFKANQEFVKGVDLNSLPELNGLTIDIKNQFGINSVESTGGFVPFTVDLNNGREGEANVEFWMTAVGPDGLIIPVNAREKWVIASGDTYSKVRGINFDKSYPAGEYTINAQVVDIVSGERVEQSMTVVKK 262 T 0.015 BsuPI pdbhh F Bacteria T 5g51 1 A A Q8B3M2_9VIRU DWV-VP3-P-DOMAIN EEYRAKTGYAPYYAGVWHSFNNSNSLVFRWGSASDQIAQWPTISVPRGELAFLRIKDGKQAAVGTQPWRTMVVWPSGHGYNIGIPTYNAERARQLAQHLYGGGSLTDEKAKQLFVPANQQGPGKVSNGNPVWEVMRAPLATQRAHIQDFEFIEAIPE 157 T 9.2 GatD_N pdbhh T Viruses T 5g5p 4 D,E,F D,E,F LEUCINE PERMEASE TRANSCRIPTIONAL REGULATOR, SAC3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 805 F F F 5gad 36 JA k PPB_ECOLI 1A9L SS MKQSTLALLLLLLLLTPV 18 T 0.1 LPAM_1 pdb F Bacteria T 5gae 35 IA i SecG XXXXXXXXXXXXXXXXXXXXXXXXX 25 F F F 5gaf 36 JA k PPB_ECOLI 1A9L SS MKQSTLALLLLLLLLTPV 18 T 0.1 LPAM_1 pdb F Bacteria T 5gag 36 JA k PPB_ECOLI 1A9L SS MKQSTLALLLLLLLLTPV 18 T 0.1 LPAM_1 pdb F Bacteria T 5gah 36 JA k PPB_ECOLI 1A9L SS MKQSTLALLLLLLLLTPV 18 T 0.1 LPAM_1 pdb F Bacteria T 5gak 42 PA r ribosomal protein RPL1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 210 F F F 5gak 49 WA z nascent polypeptide chain XXXXXXXXXXXXXXXXXXXXXXX 23 F F F 5gam 9 I x Unknown polypeptide XXXXXXXXXXXXXXXXXX 18 F F F 5gan 10 J x Unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 100 F F F 5gap 4 D x unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 82 F F F 5gaq 1 A,B,C,D,E,F,G,H,I A,B,C,D,E,F,G,H,I TXL_EISFE EFL1 MSAKAAEGYEQIEVDVVAVWKEGYVYENRGSTSVDQKITITKGMKNVNSETRTVTATHSIGSTISTGDAFEIGSVEVSYSHSHEESQVSMTETEVYESKVIEHTITIPPTSKFTRWQLNADVGGADIEYMYLIDEVTPIGGTQSIPQVITSRAKIIVGRQIILGKTEIRIKHAERKEYMTVVSRKSWPAATLGHSKLFKFVLYEDWGGFRIKTLNTMYSGYEYAYSSDQGGIYFDQGTDNPKQRWAINKSLPLRHGDVVTFMNKYFTRSGLCYDDGPATNVYCLDKREDKWILEVVGLVPRGSGHHHHHH 310 T 0.027 Toxin_10 unphh F Eukaryota T 5gch 1 A E CTRA_BOVIN GAMMA-CHYMOTRYPSIN A CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 5gds 3 C I HIRUNORM V XVXTDXGXPESHXGGDYEEIPXXYXX 26 T 0.16 Hirudin pdbhh F T 5gg4 2 E,F,G,H E,F,G,H RN169_HUMAN RING FINGER PROTEIN 169 RGRKRHCKTKHLE 13 T 0.0061 BCOR pdbhh F Eukaryota T 5ggi 2 C,D F,G mannosyl-peptide XAAPTPVAAPX 11 T 21 Exonuc_VII_L pdbhh F F 5ggp 2 C,D C,D DAG1_HUMAN DYSTROPHIN-ASSOCIATED GLYCOPROTEIN 1 ATPTPVTAIG 10 T 1.3 NiFe_hyd_3_EhaA pdbhh F Eukaryota T 5ghr 2 B,D B,D Q5JF31_THEKO Putative uncharacterized protein MGSSHHHHHHSSGENLYFQGHMSKEVPKEAYIIQIDLPAVLGPDMKEYGPFMAGDMAIIPTVIGRALVEREAARRVRIFL 80 T 0.31 SSURE unppercent F Archaea T 5ghs 2 C,D C,D Q5JF31_THEKO Putative uncharacterized protein MGSSHHHHHHSSGENLYFQGHMSKEVPKEAYIIQIDLPAVLGPDMKEYGPFMAGDMAIIPTVIGRALVEREAARRVRIFL 80 T 0.31 SSURE unppercent F Archaea T 5gi0 1 A A MORF9_ARATH RNA EDITING-INTERACTING PROTEIN 9 MEQRETIMLPGSDYNHWLIVMEFPKDPAPSRDQMIDTYLNTLATVLGSMEEAKKNMYAFSTTTYTGFQCTIDEETSEKFKGLPGVLWVLPDSYIDVKNKDYGGDKYINGEIIPSTYPTYQPKQLEHHHHHHHH 133 T 0.27 Inhibitor_I9 pdbhh F Eukaryota T 5gic 2 B C MED1_HUMAN SRC1 NHPMLMNLLK 10 T 14 CoV_NSP8 pdbhh F Eukaryota T 5gid 2 B C MED1_HUMAN SRC1 NHPMLMNLL 9 T 14 EIIA-man pdbhh F Eukaryota F 5gie 2 B,D C,E MED1_HUMAN SRC1 NHPMLMNLLK 10 T 14 CoV_NSP8 pdbhh F Eukaryota T 5gij 2 B D CLE41_ARATH TRACHEARY ELEMENT DIFFERENTIATION INHIBITORY FACTOR-LIKE PROTEIN,TDIF-LIKE PROTEIN HEVPSGPNPISN 12 T 2.6 DUF502 pdbhh F Eukaryota T 5gim 3 C C Q6F3E8_AMBVA N-terminal peptide from Putative uncharacterized protein avahiru SGGHQTAVPK 10 T 4.2 Tsp45I pdbhh F Eukaryota T 5gim 4 D D Q6F3E8_AMBVA C-terminal peptide from Putative uncharacterized protein avahiru ISKQGLGGDFEEIPSDEIIE 20 T 0.0093 Hirudin pdbhh F Eukaryota T 5gjh 2 B,D B,D CD28_HUMAN TP44 SDXMNMTP 8 T 0.089 DUF2207 unppercent F Eukaryota T 5gji 2 B B CD28_HUMAN TP44 SDXMNMTP 8 T 0.089 DUF2207 unppercent F Eukaryota T 5glf 2 B,D,F,H B,D,F,H DERL1_HUMAN DEGRADATION IN ENDOPLASMIC RETICULUM PROTEIN 1,DERTRIN-1,DER1-LIKE PROTEIN 1 RHNWGQGFRLGD 12 T 0.31 DUF6123 pdbhh F Eukaryota T 5gmi 2 C,D C,D JAM3_MOUSE JAM-C,JAM-2,JUNCTIONAL ADHESION MOLECULE 3,JAM-3 NYIRTSEEGDFRHKSSFVI 19 T 0.2 ASTN_1_2_N unphh F Eukaryota T 5gmj 2 C,D C,D JAM2_MOUSE JAM-B,JUNCTIONAL ADHESION MOLECULE 2,JAM-2,VASCULAR ENDOTHELIAL JUNCTION-ASSOCIATED MOLECULE,VE-JAM SKVTTMSENDFKHTKSFII 19 T 3.7 RhoGEF67_u1 pdbhh F Eukaryota T 5gmv 2 B,D D,C FUND1_HUMAN FUNDC1 PEPTIDE DSYEVLDL 8 T 21 DUF6417 pdbhh F Eukaryota T 5gmy 2 B B acceptor peptide, ARG-TYR-ASN-VAL-THR-ALA-CYS RYNVTAC 7 T 0.53 DUF5735 pdbhh F T 5gnd 2 B U UNK-UNK-UNK-UNK-TRP XXXXW 5 T 500 CBM_1 pdbhh F F 5gnf 1 A,B A,B L7P7R7_9CAUD Uncharacterized protein AcrF3 SMSNTISDRIVARSVIEAARFIQSWEDADPDSLTEDQVLAAAGFAARLHEGLQATVLQRLVDESNHEEYREFKAWEEALLNADGRVASSPFADWGWWYRIANVMLATASQNVGVTWGSRVHGRLMAIFQDKFKQRYEEQA 140 T 0.23 Rtt102p unppssm T Viruses T 5gnv 2 B B MAP1A_MOUSE MAP-1A AELEGGPYSPLGKDYRKAEGEREGEG 26 T 8.1 DUF2059 pdbhh F Eukaryota T 5go3 1 A,B A,B DNCV_VIBCH C-GMP-AMP SYNTHASE,3'3'-CGAMP SYNTHASE,CYCLIC AMP-GMP SYNTHASE,C-AMP-GMP SYNTHASE,DINUCLEOTIDE CYCLASE DNCV GPLGSMRMTWNFHQYYTNRNDGLMGKLVLTDEEKNNLKALRKIIRLRTRDVFEEAKGIAKAVKKSALTFEIIQEKVSTTQIKHLSDSEQREVAKLIYEMDDDARDEFLGLTPRFWTQGSFQYDTLNRPFQPGQEMNINDGTYMPMPIFESEPKIGHSLLILLVDASLKSLVAENHGWKFEAKQTCGRIKIEAEKTHIDVPMYAIPKDEFQKKQIALEANRSFVKGAIFESYVADSITDDSETYELDSENVNLALREGDRKWINSDPKIVEDWFNDSCIRIGKHLRKVCRFMKAWRDAQWDVGGPSSISLMAATVNILDSVAHDASDLGETMKIIAKHLPSEFARGVESPDSTDEKPLFPPSYKHGPREMDIMSKLERLPEILSSAESADSKSEALKKINMAFGNRVTNSELIVLAKALPAFAQEPSSASKPEKISSTMVSGHHHHHH 447 T 0.035 zf-NF-X1 pdbpssm F Bacteria T 5go7 2 B B RL40_HUMAN D-ubiquitin XXXXXXXXXGXXXXXXXXXXXXXXXXXXXXXXXXGXXXXXXXXXXXGXXXXXGXXXXXXXXXXXXXXXXXXXXXGG 76 F F Eukaryota F 5go8 2 B B RL40_HUMAN D-ubiquitin XXXXXXXXXGXXXXXXXXXXXXXXXXXXXXXXXXGXXXXXXXXXXXGXXXXXGXXXXXXXXXXXXXXXXXXXXXGG 76 F F Eukaryota F 5gob 2 B B RL40_HUMAN D-ubiquitin XXXXXXXXXGXXXXXXXXXXXXXXXXXXXXXXXXGXXXXXXXXXXXGXXXXXGXXXXXXXXXXXXXXXXXXXXXGG 76 F F Eukaryota F 5goc 2 B D RL40_HUMAN D-ubiquitin XXXXXXXXXGXXXXXXXXXXXXXXXXXXXXXXXXGXXXXXXXXXXXGXXXXXGXXXXXXXXXXXXXXXXXXXXXGG 76 F F Eukaryota F 5god 2 C,D C,D RL40_HUMAN D-ubiquitin XXXXXXXXXGXXXXXXXXXXXXXXXXXXXXXXXXGXXXXXXXXXXXGXXXXXGXXXXXXXXXXXXXXXXXXXXXGG 76 F F Eukaryota F 5gog 2 B B RL40_HUMAN D-ubiquitin XXXXXXXXXGXXXXXXXXXXXXXXXXXXXXXXXXGXXXXXXXXXXXGXXXXXGXXXXXXXXXXXXXXXXXXXXXGG 76 F F Eukaryota F 5goh 2 B D RL40_HUMAN D-ubiquitin XXXXXXXXXGXXXXXXXXXXXXXXXXXXXXXXXXGXXXXXXXXXXXGXXXXXGXXXXXXXXXXXXXXXXXXXXXGG 76 F F Eukaryota F 5goi 2 C,D C,D RL40_HUMAN D-ubiquitin XXXXXXXXXGXXXXXXXXXXXXXXXXXXXXXXXXGXXXXXXXXXXXGXXXXXGXXXXXXXXXXXXXXXXXXXXXGG 76 F F Eukaryota F 5goj 2 B B RL40_HUMAN D-ubiquitin XXXXXXXXXGXXXXXXXXXXXXXXXXXXXXXXXXGXXXXXXXXXXXGXXXXXGXXXXXXXXXXXXXXXXXXXXXGG 76 F F Eukaryota F 5gok 2 B B RL40_HUMAN D-ubiquitin XXXXXXXXXGXXXXXXXXXXXXXXXXXXXXXXXXGXXXXXXXXXXXGXXXXXGXXXXXXXXXXXXXXXXXXXXXGG 76 F F Eukaryota F 5gow 1 A A TFDP1_HUMAN DP1 YVGEDDEEDDDFNENDEDD 19 T 5.5 DUF4820 pdbhh F Eukaryota T 5gp7 2 B B UBP25_HUMAN USP25 SLSRTPADGR 10 T 3.7 tRNA-synt_2e pdbhh F Eukaryota T 5gpk 1 A,B A,B YO48_SCHPO CCP1 MEAAQAFENLANLEQEFGKAEIEILKKQNELFQPLFEQRRDILKTINNFWVVVLEAAGDEISQYITPEDSVLLEKLENIYVERFNEKEPRDVRISLTFQPNEYLQDDNLTLVKEVRMKEEKAKDDEGLEKKITKYTSQPVDIHWKPGKSMFRKNKKLPPNFFDYFQWTGEEEDDDFDGATLTIFLAEDLFPNAVKYFTEAMTEEASDEDESVDLEEDEEEEDEEDEEGDEEKQEPPSKKSKKSNAAAENLYFQGLEDYKDDDDKHHHHHHHHHH 274 T 3.4E-06 NAP unppercent F Eukaryota T 5gpn 49 KB,QB Ab,Ah NADH dehydrogenase [ubiquinone] 1 unknown subunit fragment XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 134 F F F 5gpn 51 EC,MB,XB,YB Av,Ad,Ao,Ap NADH dehydrogenase [ubiquinone] 1 unknown subunit fragment XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 43 F F F 5gpn 60 BC,FC,ZB As,Aw,Aq NADH dehydrogenase [ubiquinone] 1 unknown subunit fragment XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 76 F F F 5gpn 61 AC Ar NADH dehydrogenase [ubiquinone] 1 unknown subunit fragment XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 34 F F F 5gpn 62 CC At NADH dehydrogenase [ubiquinone] 1 unknown subunit fragment XXXXXXXXXXXXXXXXX 17 F F F 5gpn 63 DC Au NADH dehydrogenase [ubiquinone] 1 unknown subunit fragment XXXXXXXXXXXXX 13 F F F 5gqh 2 B,C B,C L7P7R7_9CAUD ACRF3,UNCHARACTERIZED PROTEIN MSNTISDRIVARSVIEAARFIQSWEDADPDSLTEDQVLAAAGFAARLHEGLQATVLQRLVDESNHEEYREFKAWEEALLNADGRVASSPFADWGWWYRIANVMLATASQNVGVTWGSRVHGRLMAIFQDKFKQRYEEQA 139 T 0.23 Rtt102p unppssm T Viruses T 5gqr 2 B C CLE44_ARATH TDIF HEVPSGPNPISN 12 T 2.6 DUF502 pdbhh F Eukaryota T 5gr9 2 B C CLE41_ARATH TDIF/CLE41 HEVPSGPNPISN 12 T 2.6 DUF502 pdbhh F Eukaryota T 5grq 2 C,D C,D ATRX_HUMAN ATP-DEPENDENT HELICASE ATRX,X-LINKED HELICASE II,X-LINKED NUCLEAR PROTEIN,XNP,ZNF-HX VTVDDDDDDNDPENRIAKKMLLEEIKANLS 30 T 15 APC_rep pdbhh F Eukaryota T 5grs 2 B,D,F,H I,J,K,L SCAP_SCHPO SREBP CLEAVAGE-ACTIVATING PROTEIN AHMNTHSGGETQVWEVWMYSQSEKKHRSKSLKMYNSLIIADPGPSLAVSDRCVAIVLGNYVALVGYGSEIFRDFYQIRNSDEMDRILRRKRKNLQRKRSGTIG 103 T 17 BBS1 pdbhh F Eukaryota T 5gs4 2 B B ARG-IAS-ILE-LEU-DNP-ARG-LEU-LEU-GLN RXILXRLLQX 10 T 3 SRC-1 pdbhh F F 5gsf 1 A A A0A1S4NYD9_9ROSI roseltide rT1 CIPRGGICLVALSGCCNSPGCIFGICA 27 T 0.0001 DUF5637 pdbhh F Eukaryota T 5gtb 2 B B PDV2_ARATH PROTEIN PLASTID DIVISION2 LVKERVEIPFDSVVAKRDVTYGYG 24 T 1 DUF1163 pdbhh F Eukaryota T 5gtc 6 K K ORF73_HHV8P LANA peptide GMRLRSGRSTGX 12 T 1.4 RNR_inhib pdbhh T Viruses T 5gtr 2 B C ARG-IAS-ILE-0JY-DPP-ARG-0JY-0JY-GLN-NH2 RXIXXRXXQX 10 T 89 Folliculin pdbhh F F 5gvo 1 A A A0A171DJY5_9ACTN SPHAERICIN GLPIGWWIERPSGWYFPI 18 T 0.0011 DUF5972 unp F Bacteria T 5gwg 1 A,B A,B Q4JEI2_RAT RATTUSIN, PROTEIN DEFAL1 LRVRRTLQCSCRRVCRNTCSCIRLSRSTYAS 31 T 0.31 F-box pdb F Eukaryota T 5gwm 1 A A Q9BML7_DROME Metabotropic GABA-B receptor subtype 1 MDSAISKEDEERYQKLVTENEQLQRLITQKEEKIRVLRQRLVERGDAKGTELN 53 T 0.0015 Csm1_N pdbpssm F Eukaryota T 5gwm 2 B B Q9VPS7_DROME GABAB RECEPTOR 3,METABOTROPIC GABA-B RECEPTOR SUBTYPE 3,ISOFORM D,ISOFORM E,ISOFORM G GPLGSRRFVVDDRRELQYRVEVQNRVYKKEIQALDAEIRKLERLLESGLT 50 T 0.0041 CCDC14 pdbpercent F Eukaryota T 5gwy 2 C,D D,E N-[(5-METHYLISOXAZOL-3-YL)CARBONYL]ALANYL-L-VALYL-N~1~-((1R,2Z)-4-(BENZYLOXY)-4-OXO-1-{[(3R)-2-OXOPYRROLIDIN-3-YL]METHYL}BUT-2-ENYL)-L-LEUCINAMIDE XAVLXX 6 T 1700 zf-H2C2_5 pdbhh F F 5gwz 2 C,D D,E N-[(5-METHYLISOXAZOL-3-YL)CARBONYL]ALANYL-L-VALYL-N~1~-((1R,2Z)-4-(BENZYLOXY)-4-OXO-1-{[(3R)-2-OXOPYRROLIDIN-3-YL]METHYL}BUT-2-ENYL)-L-LEUCINAMIDE XAVLXX 6 T 1700 zf-H2C2_5 pdbhh F F 5gxw 2 B B NUMA1_HUMAN NUCLEAR MATRIX PROTEIN-22,NMP-22,NUCLEAR MITOTIC APPARATUS PROTEIN,NUMA PROTEIN,SP-H ANTIGEN RQQRKRVSLEPHQGPGTPESKKATSCF 27 T 0.0067 DUF4023 pdbpssm F Eukaryota T 5gzt 1 A A K7ZLW6_9BACL Chitinase MNHKVHHHHHHIEGRHMELGTLEVILDRAAAFKNEANAIAYDKAGTYGPASGTETIDGNVKVTVPGVTLRNLVIKGDLLLSEGVGSGDVTLDKVSVHGLTTVSGGGEN 108 T 0.0052 DUF5649 pdbpercent F Bacteria T 5h0r 1 A F A0A0S1LIW6_CPVBM RNA-dependent RNA polymerase MLPNTKLHNTIFSETRKFTRESFKEIEHLTARLANDSVARHDFLFNTSIALISDYSGEDSNGNQLQATITIPNEIINPKEYDPSDYPLAEDESFFKQGHKYDYLVTFRAGSLTNTYEPKTKMYKLHAALDKLMHVRQRKSRFADLWRELCAVIASLDVWYQTTNYPLRTYVKLLFHRGDEFPFYESPSQDRIIFNDKSVASILPTFVYTCCQVGTAIMSGILTHVESIVAMNHFLHCAKDSYIDEKLKIKGIGRSWYQEALHNVGQATVPVWSQFNEVIGHRRKSTSEPHFVSSTFISLRAKRAELLYPEFNAYINRAIQLSKTQNDVANYYAACRAMTNDGTFLATLTELSLDAAVFPRIEQRLVTRPAVLMSNTRHESLKQKYTNGVGSIAQSYLSSFTDEIAKRVNGIHHDEAWLNFLTTSSPGRKLTEIEKLEVGGDVAAWSNSRIVMQAVFAREYRTPERIFKSLKAPIKLVERQQSDRRQRAISGLDNDRLFLSFMPYTIGKQIYELNDNAAQGKQAGNAFDIGEMLYWTSQRNVLLSSIDVAGMDASVTTNTKDIYNTFVLDVASKCTVPRFGPYYAKNMEVFEVGKRQSQVRYVNAAWQACALEAADSQTSTSYESEIFGQVKNAEGTYPSGRADTSTHHTVLLQGLVRGNELKRASDGKNSCLATIKILGDDIMEIFQGSESDTYDHAMSNANILNESGFATTAELSQNSIVLLQQLVVNGTFWGFADRISLWTREDTKDIGRLNLAMMELNALIDDLVFRVRRPEGLKMLGFFCGAICLRRFTLSVDNKLYDSTYNNLSKYMTLIKYDKNPDFDSTLMSLILPLAWLFMPRGGEYPAYPFERRDGTFTEDESMFTARGAYKRRLLYDVSNIREMIQQNSMALDDDLLHEYGFTGALLLIDLNILDLIDEVKKEDISPVKVSELATSLEQLGKLGEREKSRRAASDLKIRGHALSNDIVYGYGLQEKIQKSAMATKETTVQSKRVSSRLHDVIVAKTRDYKISTIPADALRLHEFEVEDVTVDLLPHAKHTSYSSLAYNMSFGSDGWFAFALLGGLDRSANLLRLDVASIRGNYHKFSYDDPVFKQGYKIYKSDATLLNDFFTAISAGPKEQGILLRAFAYYSLYGNVEYHYVLSPRQLFFLSDNPVSAERLVRIPPKYYVSTQCRALYNIFSYLHILRSIANNWGKRLKMVLHPGLIAYVRGTSQGAILPEADNV 1225 T 0.48 DUF445 pdbpercent T Viruses T 5h0r 2 B G Q80A92_CPV1 VP4 protein MFAIDPLKHPKLYEEYGLYLRPHQINQEIKPTTIKKKELAPTIRSIKYASLIHSMLAKHAARHNGTLINPRMYADMITLGNTKVTVTKGTPKAQIDTLKMNGLTVVSKSRRNNKKKPVSDTTASTDETTDDVVTYKALTEMSTLVESFRLPSGLTLIVFDDEKYQSLIPDYINQLITYTQPHIIPTWQGITDFSDTYLRSYFKRPFELTASNLAVPQKHNLSPITRSIFNNTGREDAIIRKLYGYGEYVFIKYEGCLITWTGLYGAVTMMVNLPKRDLGLDVGDDFLKEYKKLLFHGVITDAIPSGISAKSTVMRISPHKMMNPSGGALAVLSKYIEAVVSTNVINATLVVYAEKGAGKTSFLSTYAQQLSLASGQIVGHLSSDAYGRWLAKNKDVEEPSFEYDYVLSLDTDDNESYYEQKASELLTSHGISELSQYELLSVRRKVKMMNEMDEILIAQLDNANTHSERNFYYMVSTGKNTPRTLIVEGHFNAQDATIARTDTTILLRTINDTTQAMRDRQRSGVVQLFLRDTYYRLLPSLHTTVYPFEMLESIKRWKWVH 561 T 6.5E-05 Zeta_toxin unphh T Viruses T 5h0s 1 A,B B,C CAPSD_CPVBM VP1 MHSTNNNSNKRNNEEKHKQPEIDSSANNGEGTSGTRAQTVGDTATEAGVRNETKAGASTRRQTDGTGLSGTNAKIATASSARQTDVEKPADVTFTIENVDDVGIMQQKKPPTVVQSRTDVFNEQFANEALHPMTKVIFNGLDVNTEVQPLSDDFKQISDPKGYLTYSVKYEDQFTKKDKLRASEADDRIVGPTVNLFKYGAAVVNIDLNRDFFDTATGIDLTKGIPLVQDLLVPIGVTAGAEQSAEYVSGLLMVLFKVMTDNRLVIVGETTTPMSNTLSTVVNNVLRTTYHNNVGVNPALLRDFTQVNWLNRDITNMLQQAGTKYGLGLTETRLDYVRLVKTIVGHALNIDHFAASVLNINLRALMEANVTADDRIKALQAHSMISTQFHGPNQGALRPELAFDHDHIIRCLMLAAANYPRLEGIIVQINTGYVASANVIRPVSEKRYFPENLEQNQSAARLVSAVKARASEADISSIHLAIAREVSPMFNVHELKKIAESFEDPSSIVVVLEFILFALFFPTEFNRIKGDIQNVLLLFFSRWYPVEYGIFIQRGATYTINAAGEFEFSGRNEKWDQSLYLSEHFPALFSDVPLAGANTIIAIMRLFTPQGFLRTDDLAIAANFPRASRNPQTYIPYTNQRGTVTNEFASRFRTIVATLANVVNERAVQDDMQKATRSCTKQWLRHLETQFDNIAVAHTDHLSVVYATMSNFMLNFTNNFSGNHATFKPDQYVITSPEGSYKPIIERQGETVDGLTIIDTSIVWPILCQCTYPLVRQSGKGVDAVSIMEEIVYPDPSTTLSQSLSVAQVLSKLTLPDAFINMILSGGDSVVMRTYQTEADDDLDEGIRMTTYDQYLSHIRERLHITNVPDPIYITGASTPDQIAASVQATHVAVVLYQSGVINGSASTYLRENEVLVVMPDYYDVVSRFANANLQMNNNRYHESVLEIADIFDQADFIQTSDAVRQLRALMPTLSTSQIRHAIERIAQITDVDSTDYGKLTLRFLGTLTRSLKMQNAQIRRIRPDGTVLRYDDQIDIEAFRWSRYFLDELRLRRLSVGLRLITNPRIARRFDGVRIMYLTDDDPDPDFVPDVPEGYVAVQYAHRLFSSSLANKRNRVTYTHPPTGMAYPSPTGRPHVHMTINERAGMSKLVADNIIASVIKSNWVVDIHDIEYTAEVMTPSEGYTQHVDAESIMTAPKGKLFHLQFMDGLLRPEPSAFDPPASGEDMRLIYPLQPISVARSMRAIVNHNEVDRPRGAVAPSSYEMDTGTLSRNGDLLYSPVANGQVGIPKLEVDHISFSNVVSMMTANIRTGDDMAVERVNPDDVRAINIRNA 1333 T 0.96 DUF2717 pdbpercent T Viruses T 5h0u 2 B B HIS-HIS-HIS-HIS-HIS-HIS HHHHHH 6 T 6100 zf_CCCH_4 pdbhh F F 5h1h 1 A A Bradykinin-trypsin inhibitor secondary loop chimera CFPDGRCKRPPGFSPL 16 T 0.0039 Bradykinin pdbhh F T 5h1i 1 A A Bradykinin-trypsin inhibitor secondary loop chimera CKRPPGFSPLCTKSIPPI 18 T 0.0047 Bradykinin pdbhh F T 5h1z 1 A A A0A1S4NYE0_9SPHN putative CYP alkane hydroxylase CYP153D17 ATLAQDVIDRFDVSRPELYRDDLWQAPFRELRATAPVHRVEHSDFGPYWSVSSYKPIITVESLPDLYSSAGGITLADFIENNPTDVRMPMFIAMDRPKHTGQRRTVAPAFTPSEMVRMSDNIRMRTAEVLDSLEWNTPFDWVDTVSVELTTQMLAILFDFPWEERRKLTFWSDWAGDIELVKNEELRLERLRHMYECGGYFQNLWNAKIGKPPTPDLISMMIHSDAMAEMDQMEFLGNLILLIVGGNDTTRNTMSAVAYGLDLFPDQRAKLEADPSMIPNTVQEIIRWQTPLAHMRRTATVDSELEGQQIKAGDKLALWYISANRDESVFENADRIIVDRPNARRHLAFGHGIHRCVGARLAELQIAVLLEEMAKRRMRVNVLGEPERVAACFVHGYRKLPVEISRY 407 T 4.5E-25 p450 pdbpssm F Bacteria T 5h2c 2 B B NVJ1_YEAST Nucleus-vacuole junction protein 1 KHYNDGERAVLQFGKNRSEPIILSYKD 27 T 2.8 Mg-por_mtran_C pdbhh F Eukaryota T 5h2v 2 B B ULP1_YEAST Ubiquitin-like-specific protease 1 MSVEVDKHRNTLQYHKKNPYSPLFSPISTYRCYPRVLNNPSESRRSASFSGIYKKRTNTSRFNYLNDRRVLSMEESMKDGSDRASKAGFIGGIRETLWNSGKYLWHTFVKNEPRNFDGSEVEASGNSDVESRSSGSRSSDVPYGLRENYS 150 T 12 DUF1412 pdbhh F Eukaryota T 5h2w 2 B,D B,D ULP1_YEAST Ubiquitin-like-specific protease 1 SSDTRKHKFDTSTWALPNKRRRIESEGVGTPSTSPISSLASQKSNCDSDNSITFSRDPFGWNKWKTSAIGSNSENNTSDQKNSYDRRQYGTAFIRKKKVAKQNINNTKLVSRAQSEEVTYLRQIFNGEYKVPKILKEERERQLKLMDMDKEKDTGLKKSIIDLTEKIKTILIENNKNRLQTRNENDDDLVF 191 T 23 DUF6203 pdbpssm F Eukaryota T 5h2x 2 B B ULP1_YEAST Ubiquitin-like-specific protease 1 SSDTRKHKFDTSTWALPNKRRRI 23 T 9.8 DUF3579 pdbhh F Eukaryota T 5h3j 2 B B GO45_MOUSE BASIC LEUCINE ZIPPER NUCLEAR FACTOR 1 GPEFHPYTRYENITFNCCNHCQGELIAL 28 T 0.23 zf_Rg pdbhh F Eukaryota T 5h43 3 C C KAT8_HUMAN LYSINE ACETYLTRANSFERASE 8,MOZ,YBF2/SAS3,SAS2 AND TIP60 PROTEIN 1,HMOF SELAEQPERKITRNQ 15 T 0.0039 Myosin_head unp F Eukaryota T 5h4d 2 B,D D,C ARG-HIS-LYS RHKX 4 T 210 Peroxin-3 pdbhh F F 5h4p 44 RA z REH1_YEAST REI1-HOMOLOG 1,PRE-60S FACTOR REH1 TITAADRRMVSGVTEKQYKKGMKKMQQLEKNAINTQIRREIKRVNFQTHYRDELLQ 56 T 0.034 Phage_Mu_F pdb F Eukaryota T 5h5m 1 A,B A,B HMP1_CAEEL PROTEIN HUMPBACK-1 GGIQGDLINEIDTFQNRIEIDPAHYRRGTDRPDLEGHCERIVSGSASIADAESTRENRKQKIVAECNNLRQALQELLTEYEKSTGRRDDNDDIPLGIAEVHKRTKDLRRHLRRAIVDHISDAFLDTRTPLILLIEAAKEGHEENTRYRSKMFQEHANEIVSVARLSCQLSSDVESVSVIQHTAAQLEKLAPQVAQAAILLCHQPTSKTAQENMETYKNAWFDKVRLLTTALDNITTLDDFLAVSEAHIVEDCERGIKGITANASTPDENAANCETVDCAAGSIRGRALRVCDVVDAEMDFLQNSEYTETVKQAVRILKTQRVDQFAERASALANRQEAHGLTWDPKTKEEEMNEFINACTLVHDAVKDIRHALLMNRSMND 381 T 4.4E-80 Vinculin unp F Eukaryota T 5h5q 2 B B GXpep-1 XCRVDLQGWRRCRRX 15 T 1.1 WYL_2 pdbhh F T 5h5r 2 B B GXpep-2 XCRAWYQNYCALRRX 15 T 0.031 LIN37 pdbhh F T 5h5s 2 B B GXpep-3 VPCPYLPLWNCAGK 14 T 1.4 DUF4708 pdbhh F T 5h5y 1 A,B A,B A0A0D7C3R7_ECOLX T3SS EFFECTOR NLEB MLSPIRTTFHNSVNIVQSSPSQTVSFAGKEYELKVIDEKTPILFQWFEPNPERYKKDEVPIVNTKQHPYLDNVTNAARIESDRMIGIFVDGDFSVNQKTAFSKLERDFENVMIIYREDVDFSMYDRKLSDIYHDIICEQRLRTEDKRDEYLLNLLEKELREISKAQDSLISMYAKKRNHAWFDFFRNLALLKAGEIFRSTYNTKNHGISFGEGCIYLDMDMILTGKLGTIYAPDGISMHVDRRNDSVNIENSAIIVNRSNHPALLEGLSFMHSKVDAHPYYDGLGKGVKKYFNFTPLHNYNHFCDFIEFNHPNIIMNTSQYTCSSW 326 T 2.5E-05 Glyco_transf_88 pdbhh F Bacteria T 5h60 1 A A Q9L9J3_SALTY Transferase MIPPLNRYVPALSKNELVKTVTNRDIQFTSFNGKDYPLCFLDEKTPLLFQWFERNPARFGKNDIPIINTEKNPYLNNIIKAATIEKERLIGIFVDGDFFPGQKDAFSKLEYDYENIKVIYRNDIDFSMYDKKLSEIYMENISKQESMPEEKRDCHLLQLLKKELSDIQEGNDSLIKSYLLDKGHGWFDFYRNMAMLKAGQLFLEADKVGCYDLSTNSGCIYLDADMIITEKLGGIYIPDGIAVHVERIDGRASMENGIIAVDRNNHPALLAGLEIMHTKFDADPYSDGVCNGIRKHFNYSLNEDYNSFCDFIEFKHDNIIMNTSQFTQSSWARHVQ 336 T 2.2E-05 Glyco_transf_88 pdbhh F Bacteria T 5h61 1 A,B A,B Q8ZNP4_SALTY Transferase MARFNAAFTRIKIMFSRIRGLISCQSNTQTIAPTLSPPSSGHVSFAGIDYPLLPLNHQTPLVFQWFERNPDRFGQNEIPIINTQKNPYLNNIINAAIIEKERIIGIFVDGDFSKGQRKALGKLEQNYRNIKVIYNSDLNYSMYDKKLTTIYLENITKLEAQSASERDEVLLNGVKKSLEDVLKNNPEETLISSHNKDKGHLWFDFYRNLFLLKGSDAFLEAGKPGCHHLQPGGGCIYLDADMLLTDKLGTLYLPDGIAIHVSRKDNHVSLENGIIAVNRSEHPALIKGLEIMHSKPYGDPYNDWLSKGLRHYFDGSHIQDYDAFCDFIEFKHENIIMNTSSLTASSWR 348 T 2.1E-05 Glyco_transf_88 pdbhh F Bacteria T 5h62 1 A,B A,B Q8ZNP4_SALTY Transferase MARFNAAFTRIKIMFSRIRGLISCQSNTQTIAPTLSPPSSGHVSFAGIDYPLLPLNHQTPLVFQWFERNPDRFGQNEIPIINTQKNPYLNNIINAAIIEKERIIGIFVDGDFSKGQRKALGKLEQNYRNIKVIYNSDLNYSMYDKKLTTIYLENITKLEAQSASERDEVLLNGVKKSLEDVLKNNPEETLISSHNKDKGHLWFDFYRNLFLLKGSDAFLEAGKPGCHHLQPGGGCIYLDADMLLTDKLGTLYLPDGIAIHVSRKDNHVSLENGIIAVNRSEHPALIKGLEIMHSKPYGDPYNDWLSKGLRHYFDGSHIQDYDAFCDFIEFKHENIIMNTSSLTASSWR 348 T 2.1E-05 Glyco_transf_88 pdbhh F Bacteria T 5h63 1 A,B,C,D A,B,C,D Q8ZNP4_SALTY Transferase MARFNAAFTRIKIMFSRIRGLISCQSNTQTIAPTLSPPSSGHVSFAGIDYPLLPLNHQTPLVFQWFERNPDRFGQNEIPIINTQKNPYLNNIINAAIIEKERIIGIFVDGDFSKGQRKALGKLEQNYRNIKVIYNSDLNYSMYDKKLTTIYLENITKLEAQSASERDEVLLNGVKKSLEDVLKNNPEETLISSHNKDKGHLWFDFYRNLFLLKGSDAFLEAGKPGCHHLQPGGGCIYLDADMLLTDKLGTLYLPDGIAIHVSRKDNHVSLENGIIAVNRSEHPALIKGLEIMHSKPYGDPYNDWLSKGLRHYFDGSHIQDYDAFCDFIEFKHENIIMNTSSLTASSWR 348 T 2.1E-05 Glyco_transf_88 pdbhh F Bacteria T 5h6y 2 B B GLY-KCR-GLY GXG 3 T 360 NAD_binding_3 pdbhh F F 5h7g 2 C,D C,D F1324 peptide LWYTDIRMSWRVP 13 T 7.8 OTT_1508_deam pdbhh F T 5h7h 2 B,C B,C F1324 peptide residues 10-13 XWRVP 5 T 24 COMM_domain pdbhh F F 5h7y 2 B B Q9I3K2_PSEAE Uncharacterized protein DDLFASIGALWTWAWRGPKARQELLKA 27 T 9.5 Lentiviral_Tat pdbhh F Bacteria T 5h9b 2 B B KCNAE_DROME ETHER-A-GO-GO PROTEIN GVLPKAPKLQASQATLARQDTIDEGGEVDSSPPSRDSRVVIEGAAVSSATVGPS 54 T 30 DUF4491 pdbhh F Eukaryota T 5hau 58 FB,LD 1x,2x CTHL3_BOVIN BACTENECIN-7,BAC7,PR-59 RRIRPRPPRLPRPRPRPLPFPRPGPRPIPRPLPFP 35 T 0.027 TonB_N unppercent F Eukaryota T 5haw 3 D,E L,K FTSZ_ECOLI FtsZ CTT DYLDIPAFLR 10 T 1.8 Duffy_binding pdbhh F Bacteria T 5hax 2 B B NUP53_CHATD NUCLEAR PORE PROTEIN NUP53 SQDDEFCRVIPTVRKAKLLPMEEALLPAPTFTQ 33 T 16 FtsX_ECD pdbhh F Eukaryota T 5hb0 2 E,F,G,H E,F,G,H NU145_CHATD NUCLEAR PORE PROTEIN NUP145 SHKKLVINKDMRTDLFSPPNKD 22 T 5.2 MethyTransf_Reg pdbhh F Eukaryota T 5hb3 2 B,D B,D NUP53_CHATD NUCLEAR PORE PROTEIN NUP53 SQQDGSLRSRKANLETGAFGKSTRRTRSKAATPAKREDPTIAAADKIFSNWLASQ 55 T 23 DUF5830 pdbhh F Eukaryota T 5hbu 3 M K FTSZ_ECOLI FtsZ CTT peptide DYLDIPAFLR 10 T 1.8 Duffy_binding pdbhh F Bacteria T 5hcc 4 D D C5I3_DERAN Dermacentor andersoni RaCI3 GPMSGESQSIQRKGQCEEVICHRKLNHLGERVTSGCPTGCLCVIREPDNVDNANGTCYALMSSTTTTTTTPDGTTTSEEEE 81 T 0.05 UPAR_LY6_2 pdbpercent F Eukaryota T 5hcd 4 D D C5I2_RHIMP Rhipicephalus microplus RaCI2 GPMEEANTTPISVKDQCANVTCRRTVDNRGKRHIDGCPPGCLCVLKGPDSKDNLDGTCYLLATTPKSTTTSTEQSFNMEE 80 T 0.061 CBM_19 pdb F Eukaryota T 5hce 4 D D C5I1_RHIAP Rhipicephalus appendiculatus RaCI1 GPMEEVKTTPIPNHQCVNATCERKLDALGNAVITKCPQGCLCVVRGASNIVPANGTCFQLATTKPPMAPGDNKDNKEEESN 81 T 0.0095 UPAR_LY6_2 unppercent F Eukaryota T 5hcp 55 CB,FD 1z,2z MK1_PALPR METALNIKOWIN I VDKPDYRPRPRPPNM 15 T 2.3 Toxin_33 pdbhh F Eukaryota T 5hcq 55 CB,FD 1z,2z Oncocin d15-19 VDKPPYLPRPRPPR 14 T 0.11 Apidaecin pdbhh F T 5hcr 55 CB,FD 1z,2z Oncocin 10wt VDKPPYLPRPRPPRRIYNR 19 T 0.18 Apidaecin pdbhh F T 5hd1 55 CB,FD 1z,2z PYRRH_PYRAP Pyrrhocoricin VDKGSYLPRPTPPRPIYNRN 20 T 2.5 Apidaecin pdbhh F Eukaryota T 5hda 2 C,D B,D EBNA2_EBVB9 EBV NUCLEAR ANTIGEN 2 SMPELSPVL 9 T 1.8 Fapy_DNA_glyco pdbhh T Viruses T 5hdt 2 C E WAPL_HUMAN FRIEND OF EBNA2 PROTEIN,WAPL COHESIN RELEASE FACTOR MTSRFGKTYSRKGGNGSSKFDEVFSNKRTTLST 33 T 0.21 BRCT_assoc pdbhh F Eukaryota T 5hf3 2 B B TAU_HUMAN modified Tau peptide XRTPSLPTX 9 T 2.9 UPF0167 pdbhh F Eukaryota T 5hgv 2 B,D B,D CSK21_HUMAN TYR-PRO-GLY-GLY-SER-THR-PRO-VAL-SER-SER-ALA-ASN-MET-MET YPGGSTPVSSANMM 14 T 13 Pr_beta_C pdbhh F Eukaryota T 5hhc 2 C,D C,D D- Vascular endothelial growth factor-A XXXXXGGXXXXXXXXGXXXXGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGGXXX 69 F F F 5hhd 2 C,D C,D D-Peptide RFX037.D XXXXXGGXXXXXXXXGXXXXGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGGXXX 69 F F F 5hhd 3 E,F E,F D-Vascular endothelial growth factor GXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGGXXXXXGXXXXXXXXXXXXXXXXXXXXXXGXXXGXXXXXXXXXXXXXXXXX 102 F F F 5hhm 3 C,H C,H M1-F5L, GILGLVFTL GILGLVFTL 9 T 0.72 Asp4 pdbhh F T 5hhn 3 C C M1-F5L, GILGLVFTL GILGLVFTL 9 T 0.72 Asp4 pdbhh F T 5hho 5 E C M1-G4E, GILEFVFTL GILEFVFTL 9 T 24 Cas9_PI2 pdbhh F T 5hhp 3 C C M1-G4E, GILEFVFTL GILEFVFTL 9 T 24 Cas9_PI2 pdbhh F T 5hhq 3 C C M1-L3W, GIWGFVFTL GIWGFVFTL 9 T 2 Tr-sialidase_C pdbhh F T 5hhv 3 D I IL-17A peptide inhibitor XIHVTIPADLWDWINK 16 T 0.59 VapB_antitoxin pdbhh F T 5hhx 3 D I IL-17A peptide inhibitor XIHVTIPADLWDWINK 16 T 0.59 VapB_antitoxin pdbhh F T 5hi3 4 F I synthetic IL-17A peptide inhibitor XIHVTIPADLWDWINK 16 T 0.59 VapB_antitoxin pdbhh F T 5hi4 4 F I synthetic IL-17A peptide inhibitor XIHVTIPADLWDWINK 16 T 0.59 VapB_antitoxin pdbhh F T 5hi5 4 F I synthetic IL-17A inhibitor XIHVTIPADLWDWINK 16 T 0.59 VapB_antitoxin pdbhh F T 5hi8 1 A,B A,B E3SMK9_9CAUD CPET NSMIDKFCDWFEGEFDNWTQAASNPTKWAHIIVKHEKISEYKYHTSSRYSYMDKPYREQTVDIEYVCPELIIVHNPACDIIFKWTGIYFEGESEPDCQWNGQPLDSKARLYADEYHTWDVGYWEGSEGFFHFKKNV 136 T 3E-16 CpeT pdbpercent T Viruses T 5hit 2 B B KCNH1_MOUSE Potassium voltage-gated channel subfamily H member 1 APLILPPDHPVRRLFQR 17 T 1.2 DUF4196 pdbhh F Eukaryota T 5hkh 2 B B UBA5_HUMAN ASP-ASN-GLU-TRP-GLY-ILE-GLU-LEU-VAL DNEWGIELV 9 T 2.3 LT-IIB pdbhh F Eukaryota T 5hkp 2 C,D C,D TERF1_HUMAN NIMA-INTERACTING PROTEIN 2,TTAGGG REPEAT-BINDING FACTOR 1,TELOMERIC PROTEIN PIN2/TRF1 SHMAEDVSSAAPSPRGCADGRDADPTEEQMAETERNDEEQFECQELLECQVQVGAPE 57 T 19 VCX_VCY pdbhh F Eukaryota T 5hky 2 B B SPY2_HUMAN SPRY-2 XQQVHVLSLDQIRAIRNTNEXTEGPT 26 T 3.3 KAR9 unp F Eukaryota T 5hkz 2 B B SPY2_HUMAN SPRY-2 XQQVHVLSLDQIRAIRNTNEXTEGPT 26 T 3.3 KAR9 unp F Eukaryota T 5hl0 2 B B SPY2_HUMAN Sprouty 2 (SPRY2) XEXTEGPT 8 T 3.3 KAR9 unp F Eukaryota F 5hm9 2 B C poly(UNK) XXXXX 5 F F F 5hma 2 B C Unidentified peptide XXXXX 5 F F F 5hog 2 D,E D,E DNA2_YEAST Dna2p SLRNIDDILDDIEGDLT 17 T 5.4 RMP pdbhh F Eukaryota T 5hoi 2 D,E,F D,E,F TOF2_YEAST Topoisomerase 1-associated factor 2 SHAKDVKIQETIRKLNRFKPT 21 T 2.2 DUF5611 pdbhh F Eukaryota T 5hpm 3 E,F E,F Cyclic amidated, acetylated linked meditope XQFDLSTRRLK 11 T 11 DUF4180 pdbhh F T 5hpp 1 A A ORN-THR-ILE-ALA-MAA-LEU-LEU-SER-ORN-SER-PHI-SER-THR-THR-ALA-VAL XTIAXLLSXSXSTTAV 16 T 4.6 PAGK pdbhh F T 5hq8 2 C,D I,J M3K2_HUMAN MEKK2 peptide YDNPIFEKFGKGGTYX 16 T 3.8 Thrombin_light pdbhh F Eukaryota T 5hs5 1 A,B A,B SARX_STAA8 STAPHYLOCOCCAL ACCESSORY REGULATOR X ETLLGFYKQYKALSEYIDKKYKLSLNDLAVLDLTMKHCKDEKVLMQSFLKTAMDELDLSRTKLLVSIRRLIEKERLSKVRSSKDERKIYIYLNNDDISKFNALFEDVEQFLNILEHHHHHH 121 T 0.00054 AphA_like unphh F Bacteria T 5hsv 2 E,F,G,H E,F,G,H Alisporivir XXXXVXAXXXX 11 T 19 MHC2-interact pdbhh F F 5hsz 2 C K FTSZ_ECOLI C-terminal Tail of FtsZ LDIPAFLRKQA 11 T 0.8 Drc1-Sld2 pdbhh F Bacteria T 5htb 2 B B ARC-3353 inhibitor ARKKQTAX 8 T 15 DHHA2 pdbhh F T 5htc 2 B B ARC-3372 INHIBITOR ARKKQTAX 8 T 15 DHHA2 pdbhh F T 5hu3 2 B B KCNAE_DROME ETHER-A-GO-GO PROTEIN GVLPKAPKLQASQATLARQDTIDEGGEVDSSPPSRDSRVVIEGAAVSSATVGPS 54 T 30 DUF4491 pdbhh F Eukaryota T 5hu6 4 D D Q581F2_TRYB2 Haptoglobin-hemoglobin receptor GLKTKDEVEKACHLAQQLKEVSITLGVIYRTTERHSVQVEAHKTAIDKHADAVSRAVEALTRVDVALQRLKELGKANDTKAVKIIENITSARENLALFNNETQAVLTARDHVHKHRAAALQGWSDAKEKGDAAAEDVWVLLNAAKKGNGSADAKAAAEKCSRYSSSSTSETELQKAIDAAANVGGLSAHKSKYGDVLNKFKLSNASVGAVRDTSGRGGKHMEKVNNVAKLLKDAEVSLAAAAAEIEEVKNAHETKVQEEM 260 T 8.4E-05 GARP unphh F Eukaryota T 5huw 2 B,C A,B TRM3_HHV11 HSV1 large terminase NLS GPPKKRAKVDVA 12 T 4.7 DUF4611 pdbhh T Viruses T 5huy 2 B,C B,A D3YRZ5_HCMVO HCMV small terminase VSRRVRATRKRPRRAS 16 T 5.6 DUF2569 pdbhh T Viruses T 5hvp 2 C C ACETYL-*PEPSTATIN XVVXAX 6 T 1700 FAM60A pdbhh F F 5hx2 1 A A BP07_BPT4 GENE PRODUCT 7, GP7 MTVKAPSVTSLRISKLSANQVQVRWDDVGANFYYFVEIAETKTNSGENLPSNQYRWINLGYTANNSFFFDDADPLTTYIIRVATAAQDFEQSDWIYTEEFETFATNAYTFQNMIEMQLANKFIQEKFTLNNSDYVNFNNDTIMAALMNESFQFSPSYVDVSSISNFIIGENEYHEIQGSIQQVCKDINRVYLMESEGILYLFERYQPVVKVSNDKGQTWKAVKLFNDRVGYPLSKTVYYQSANTTYVLGYDKIFYGRKSTDVRWSADDVRFSSQDITFAKLGDQLHLGFDVEIFATYATLPANVYRIAEAITCTDDYIYVVARDKVRYIKTSNALIDFDPLSPTYSERLFEPDTMTITGNPKAVCYKMDSICDKVFALIIGEVETLNANPRTSKIIDSADKGIYVLNHDEKTWKRVFGNTEEERRRIQPGYANMSTDGKLVSLSSSNFKFLSDNVVNDPETAAKYQLIGAVKYEFPREWLADKHYHMMAFIADETSDWETFTPQPMKYYAEPFFNWSKKSNTRCWINNSDRAVVVYADLKYTKVIENIPETSPDRLVHEYWDDGDCTIVMPNVKFTGFKKYASGMLFYKASGEIISYYDFNYRVRDTVEIIWKPTEVFLKAFLQNQEHETPWSPEEERGLADPDLRPLIGTMMPDSYLLQDSNFEAFCEAYIQYLSDGYGTQYNNLRNLIRNQYPREEHAWEYLWSEIYKRNIYLNADKRDAVARFFESRSYDFYSTKGIEASYKFLFKVLYNEEVEIEIESGAGTEYDIIVQSDSLTEDLVGQTIYTATGRCNVTYIERSYSNGKLQWTVTIHNLLGRLIAGQEVKAERLPSFEGEIIRGVKGKDLLQNNIDYINRSRSYYVMKIKSNLPSSRWKSDVIRFVHPVGFGFIAITLLTMFINVGLTLKHTETIINKYKNYKWDSGLPTEYADRIAKLTPTGEIEHDSVTGEAIYEPGPMAGVKYPLPDDYNAENNNSIFQGQLPSERRKLMSPLFDASGTTFAQFRDLVNKRLKDNIGNPRDPENPTQVKIDE 1032 T 0.033 fn3 pdb T Viruses T 5hyj 3 C,H C,H INS_HUMAN ALA-GLN-TRP-GLY-PRO-ASP-PRO-ALA-ALA-ALA AQWGPDPAAA 10 T 3.1 LEA_3 pdbhh F Eukaryota T 5hyn 5 E,J,O,T E,J,P,U JARD2_HUMAN JARID2 K116me3 RLQAQRKFAQSQ 12 T 25 DUF4395 pdbhh F Eukaryota T 5hyp 2 B B W0T1Y4_STRPY M28 protein GPGSAESPKSTETSANGADKLADAYNTLLTEHEKLRDEYYTLIDAKEEEPRYKALRGENQDLREKEGKYQDKIKKLEEKEKNLEKKSEDVERHYLKKLDQEHKE 104 T 0.0036 TMF_DNA_bd pdbpssm F Bacteria T 5hyq 3 E,F E,F Amidated meditope CQFDLSTRRLKX 12 T 3.1 Flavi_NS1 pdbhh F T 5hyu 1 A A M21_STRPY M protein, serotype 2.1 GPGSNSKNPVPVKKEAKLSEAELHDKIKNLEEEKAELFEKLDKVEEEHKKVEEEHKKDHEKLEKKSEDVERHYLRQLDQEYKEQQERQKNLEELERQSQREVEKR 105 T 0.0091 APG6_N pdb F Bacteria T 5hyx 2 B A RGF1_ARATH PTR-SER-ASN-PRO-GLY-HIS-HIS-PRO-HYP-ARG-HIS-ASN DXSNPGHHPXRHN 13 T 22 Pterin_4a pdbhh F Eukaryota T 5hz0 2 B A RGF2_ARATH ASP-PTR-TRP-LYS-PRO-ARG-HIS-HIS-PRO-HYP-ARG-ASN-ASN DXWKPRHHPXRNN 13 T 13 Metal_hydrol pdbhh F Eukaryota T 5hz1 2 B A RGF3_ARATH ASP-PTR-TRP-ARG-ALA-LYS-HIS-HIS-PRO-HYP-LYS-ASN-ASN DXWRAKHHPXKNN 13 T 0.16 N_formyltrans_C pdbpercent F Eukaryota T 5hz3 2 B A RGF5_ARATH ASP-PTR-PRO-LYS-PRO-SER-THR-ARG-PRO-HYP-ARG-HIS-ASN DXPKPSTRPXRHN 13 T 6.6 DUF2101 pdbhh F Eukaryota T 5hzp 1 A,B C,A M49_STRP9 M protein, serotype 49 GPGSAEKKVEAKVEVAENNVSSVARREKELYDQIADLTDKNGEYLERIGELEERQKNLEKLEHQSQVAADKHYQEQAKKHQEYKQEQEER 90 T 0.026 HCR unphh F Bacteria T 5hzy 1 A A Q5ZUV9_LEGPH Uncharacterized protein RavZ MHHHHHHENLYFQGSSIYPPETSWEVNKGMNSSRLHKLYSLFFDKSSAFYLGDDVSVLEDKPLTGAYGFQSKKNDQQIFLFRPDSDYVAGYHVDAKSDAGWVNDKLDRRLSEISEFCSKATQPATFILPFVEMPTDITKGVQHQVLLTISYDPKSKQLTPTVYDSIGRDTYSESLSSYFKGKYRTTCDEILTQSIEKAIKSTDFTLGKFTRAAYNHQNRLTEGNSGSYTFRTIKEVISSSAQGTEVKIPGSGYITSNSYLTSQHVQDIESCIKYRNLGVVDIESALTEGKTLPVQLSEFIVALEDYGKLRSQQSEKSMLNFIGYSKTAKLTAVELLIGILNDIKGKNEISESQYDKLVKEVDCLMDSSLGKLVQFHLKNLGAESLQKLVLPCVKFDDTIDDFVTIEKDELFDVPDITGEELASKKGIEQGALDKEALLKQKQIKTDLLDLREEDKTGLKKPLHGGIKVK 469 T 16 Crr6 pdbhh F Bacteria T 5i1n 2 E,F,G,H E,F,G,H D-Villin headpiece subdomain XXXXXXXXXXGXXXXXXXXXXXXXXXXXXXXXGXX 35 F F F 5i1o 2 E,F,G,H E,F,G,H D-Villin headpiece subdomain XXXXXXXXXXGXXXXXXXXXXXXXXXXXXXXXGXX 35 F F F 5i1p 2 E,F,G,H E,F,G,H D-Villin headpiece subdomain XXXXXXXXXXGXXXXXXXXXXXXXXXXXXXXXGXX 35 F F F 5i1s 2 C,D C,D D-Villin headpiece subdomain XXXXXXXXXXGXXXXXXXXXXXXXXXXXXXXXGXX 35 F F F 5i22 2 B B POLN_CHIKS CHIKV nsP3 peptide STVPVAPPRRRRGRNLT 17 T 2.4 HCV_NS5a_C pdbhh T Viruses T 5i25 2 B B KNG1_HUMAN ASN-PRO-ILE-SER-ASP-PHE-PRO-ASP NPISDFPD 8 T 1.6 Ku_PK_bind pdbhh F Eukaryota T 5i2i 3 E,F E,F Meditope GQQDLSTRRLKG 12 T 12 DUF262 pdbhh F T 5i4l 82 XD m2 60S ribosomal protein L12 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 155 F F F 5i4l 85 EF,FF p1,p2 Ribosomal protein P1 alpha, P2 beta XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 5i4q 1 A A CDIA_ECONC CDIA LSYLGIGKKISFDGDFYTVDGMKFSKSYYEKLWEQGRPAPFVQAREVLNSNPKIEPDPRGAPGYLRYEGAGLEMIYNPKTGQVGHIQPVKVK 92 T 2.1 SspH pdbhh F Bacteria T 5i4q 2 B B CDII_ECONC CDII MDIWPEFQRDLEMYRDVVLSIKRNLRLYEECIESLVHQIGSTNFDNAQPLFDDLFRMQSELATMLYKYEYKPGKRIQDLIYHLDRDDFYSRKYWHKKFSDGLAWPEAGHHHHHH 114 T 0.0028 DUF4041 unppercent F Bacteria T 5i4r 1 A,E A,E CDIA_ECONC CDIA LSYLGIGKKISFDGDFYTVDGMKFSKSYYEKLWEQGRPAPFVQAREVLNSNPKIEPDPRGAPGYLRYEGAGLEMIYNPKTGQVGHIQPVKVK 92 T 2.1 SspH pdbhh F Bacteria T 5i4r 4 D,H B,F CDII_ECONC CDII MDIWPEFQRDLEMYRDVVLSIKRNLRLYEECIESLVHQIGSTNFDNAQPLFDDLFRMQSELATMLYKYEYKPGKRIQDLIYHLDRDDFYSRKYWHKKFSDGLAWPEAGHHHHHH 114 T 0.0028 DUF4041 unppercent F Bacteria T 5i5a 2 B B D-ShK XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGXX 35 F F F 5i5b 2 B B D-ShK XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGXX 35 F F F 5i6a 1 A,B A,B ALA-PHE-GLY-LYD-VAL-PHE-PRO-GLN-ALA-GLY AFGXVFPQAG 10 T 5.5 TraW_N pdbhh F T 5i70 2 C,D C,D Pepstatin A XVVXAX 6 T 1700 FAM60A pdbhh F F 5i7p 1 A A FKB1A_HUMAN;SLYD_ECOLI PPIASE FKBP1A,12 KDA FK506-BINDING PROTEIN,FKBP-12,CALSTABIN-1,FK506-BINDING PROTEIN 1A,FKBP-1A,IMMUNOPHILIN FKBP12,ROTAMASE,PPIASE,HISTIDINE-RICH PROTEIN,METALLOCHAPERONE SLYD,ROTAMASE,SENSITIVITY TO LYSIS PROTEIN D,WHP,PPIASE FKBP1A,12 KDA FK506-BINDING PROTEIN,FKBP-12,CALSTABIN-1,FK506-BINDING PROTEIN 1A,FKBP-1A,IMMUNOPHILIN FKBP12,ROTAMASE GVQVETISPGDGRTFPKRGQTAVVHYTGMLEDGKKFDSSRDRNKPFKFMLGKQEVIRGWEEGVAQMSVGQRAKLTISPDYAYGQYDENLVQRVPKDVFMGVDELQVGMRFLAETDQGPVPVEITAVEDDHVVVDGNHMLAGQNLVFDVELLKLEAHHHHHH 161 T 3.9E-16 FKBP_C unppercent F Bacteria T 5i7q 1 A A FKB1A_HUMAN;FKBX_ECOLI PPIASE FKBP1A,12 KDA FK506-BINDING PROTEIN,FKBP-12,CALSTABIN-1,FK506-BINDING PROTEIN 1A,FKBP-1A,IMMUNOPHILIN FKBP12,ROTAMASE,PPIASE,ROTAMASE,PPIASE FKBP1A,12 KDA FK506-BINDING PROTEIN,FKBP-12,CALSTABIN-1,FK506-BINDING PROTEIN 1A,FKBP-1A,IMMUNOPHILIN FKBP12,ROTAMASE GVQVETISPGDGRTFPKRGQTAVVHYTGMLEDGKKFDSSRDRNKPFKFMLGKQEVIRGWEEGVAQMSVGQRAKLTISPDYAYGVPSPDLIQYFSRREFMDAGEPEIGAIMLFTAMDGSEMPGVIREINGDSITVDFNHPLAGQTLVFDVELLKLEAHHHHHH 162 T 2.3E-18 FKBP_C unppercent F Bacteria T 5i7z 2 B B CRUM3_HUMAN Crb-3 LPPEERLI 8 T 0.38 LOH1CR12 pdbhh F Eukaryota F 5i87 1 A A BT-CD domains of human acetyl-CoA carboxylase XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 723 F F F 5i87 2 B B BT-CD domains of human acetyl-CoA carboxylase XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 662 F F F 5i8c 3 C C Q2N0S7_9HIV1 HIV-1 Clade A BG505 Fusion Peptide (residue 512-520) AVGIGAVFL 9 T 2.2 OAD_gamma pdbhh T Viruses T 5i8m 2 E E DLS-LYS-CYS-LYS-LEU-CYS-LEU-LYS-NH2 XKCKLCLKX 9 T 0.71 Zn-C2H2_12 pdbhh F F 5i8x 2 E E DLS-LYS-CYS-LYS-LEU-CYS-LYS-LYS-NH2 XKCKLCLKX 9 T 0.71 Zn-C2H2_12 pdbhh F F 5i9b 2 B B ACE-ASP-GLU-VAL-ASK XDEVDX 6 T 200 ResIII pdbhh F F 5i9t 2 C,D D,E ACE-ASP-GLU-VAL-ASK XDEVDX 6 T 200 ResIII pdbhh F F 5iab 2 C,D D,E ACE-ASP-GLU-VAL-ASK XDEVDX 6 T 200 ResIII pdbhh F F 5iae 2 C,D D,E ACE-ASP-GLU-VAL-ASK XDEVDX 6 T 200 ResIII pdbhh F F 5iag 2 B B ACE-ASP-GLU-VAL-ASK XDEVDX 6 T 200 ResIII pdbhh F F 5iag 3 C D ASP-ASP-ASP-MET DDDM 4 T 140 SPACA7 pdbhh F F 5iaj 2 B B ACE-ASP-GLU-VAL-ASK XDEVDX 6 T 200 ResIII pdbhh F F 5iak 2 B B ACE-ASP-GLU-VAL-ASK XDEVDX 6 T 200 ResIII pdbhh F F 5ian 2 B E LEU-SER-SER LSS 3 T 530 CTP_transf_like pdbhh F F 5ian 3 C B ACE-ASP-GLU-VAL-ASK XDEVDX 6 T 200 ResIII pdbhh F F 5iar 2 B B ACE-ASP-GLU-VAL-ASK XDEVDX 6 T 200 ResIII pdbhh F F 5ias 2 B B ACE-ASP-GLU-VAL-ASK XDEVDX 6 T 200 ResIII pdbhh F F 5iay 2 B B UHRF1_HUMAN Spacer TGKGKWKRKSAGGGPS 16 T 6.1 Ribosomal_L35p pdbhh F Eukaryota T 5ib1 3 C C VIPR1_HUMAN VIP-R-1,PITUITARY ADENYLATE CYCLASE-ACTIVATING POLYPEPTIDE TYPE II RECEPTOR,PACAP-R2,VPAC1 RRKWRRWHL 9 F F Eukaryota F 5ib2 3 C C VIPR1_HUMAN VIP-R-1,PITUITARY ADENYLATE CYCLASE-ACTIVATING POLYPEPTIDE TYPE II RECEPTOR,PACAP-R2,VPAC1 RRKWRRWHL 9 F F Eukaryota F 5ib3 3 C C VIPR1_HUMAN VIP-R-1,PITUITARY ADENYLATE CYCLASE-ACTIVATING POLYPEPTIDE TYPE II RECEPTOR,PACAP-R2,VPAC1 RRKWRRWHL 9 F F Eukaryota F 5ib4 3 C C VIPR1_HUMAN VIP-R-1,PITUITARY ADENYLATE CYCLASE-ACTIVATING POLYPEPTIDE TYPE II RECEPTOR,PACAP-R2,VPAC1 RRKWRRWHL 9 F F Eukaryota F 5ib5 3 C,F C,F VIPR1_HUMAN VIP-R-1,PITUITARY ADENYLATE CYCLASE-ACTIVATING POLYPEPTIDE TYPE II RECEPTOR,PACAP-R2,VPAC1 RRKWRRWHL 9 F F Eukaryota F 5ibc 2 B B ACE-ASP-GLU-VAL-ASK XDEVDX 6 T 200 ResIII pdbhh F F 5ibo 1 A,B A,B LUCI_OPLGR 19KOLASE SDNMVFTLEDFVGDWRQTAGYNLDQVLEQGGVSSLFQNLGVSVTPIQRIVLSGENGLKIDIHVIIPYEGLSGDQMGQIEKIFKVVYPVDDHHFKVILHYGTLVIDGVTPNMIDYFGRPYEGIAVFDGKKITVTGTLWNGNKIIDERLINPDGSLLFRVTINGVTGWRLCERILA 174 T 0.0086 Lipocalin_7 pdbhh F Eukaryota T 5ibp 2 B B ACE-ASP-GLU-VAL-ASK XDEVDX 6 T 200 ResIII pdbhh F F 5ibr 2 B,D B,D ACE-ASP-GLU-VAL-ASK XDEVDX 6 T 200 ResIII pdbhh F F 5ic3 2 C,D C,D VE6_HPV18 HPV18E6 Peptide RLQRRRETQV 10 T 0.19 Mu-like_Com unphh T Viruses T 5ic4 3 I,J,K,L I,J,K,L DEVE peptide DEVX 4 T 420 DEAD pdbhh F F 5ic6 3 E,F E,F DEVE peptide DEVX 4 T 420 DEAD pdbhh F F 5icn 3 C C GLY-ALA-6A0-ARG-HIS LGKGGAXRH 9 T 18 PNPase_C pdbhh F T 5icv 2 C,D C,D MET-LYS-ALA-VAL-LIG MKAV 4 T 220 Lysine_decarbox pdbhh F F 5icx 3 E,F E,F Meditope XQFDLSTRRLRCGGSK 16 T 3 zf-CDGSH pdbhh F T 5icy 3 E,F E,F Meditope SQFDLSTRRLKS 12 T 12 Lambda_CIII pdbhh F T 5icz 3 E,F E,F Meditope GQFDLSTRRLKG 12 T 6.5 Pet127 pdbhh F T 5id0 3 E,F E,F Cyclic meditope QFDLSTRRLKX 11 T 7.9 AAA_lid_8 pdbhh F T 5id1 3 E,F E,F Meditope XQFDLSTRRLKC 12 T 6.5 DUF5947 pdbhh F T 5iec 1 A A C5I2_RHIMP RaCI2 GPMEEANTTPISVKDQCANVTCRRTVDNRGKRHIDGCPPGCLCVLKGPDSKDNLDGTCYLLATTPKSTTT 70 T 0.0052 CBM_19 pdbpercent F Eukaryota T 5ieh 3 C C INCE_HUMAN Inner centromere protein REFSKEPEL 9 T 37 DUF3966 pdbhh F Eukaryota T 5iek 3 C C INCE_HUMAN ARG-GLU-PHE-SER-LYS-GLU-PRO-GLU-LEU REFSKEPEL 9 T 37 DUF3966 pdbhh F Eukaryota T 5ifj 3 C,F,I,L C,F,I,L peptide PRO-LEU-GLN-PRO-GLU-GLN-PRO-PHE-PRO PLQPEQPFP 9 T 9 P120R pdbhh F F 5ig7 3 C,F,I,L C,F,I,L peptide PRO-LEU-GLN-PRO-GLN-GLN-PRO-PHE-PRO PLQPQQPFP 9 T 13 Malate_DH pdbhh F F 5igo 2 B,D,F,H U,V,W,X TRIB1_HUMAN TRB-1,G-PROTEIN-COUPLED RECEPTOR-INDUCED GENE 2 PROTEIN,GIG-2,SKIP1 SDQIVPEY 8 T 9 Cryptochrome_C pdbhh F Eukaryota T 5igq 2 B U TRIB1_HUMAN TRB-1,G-PROTEIN-COUPLED RECEPTOR-INDUCED GENE 2 PROTEIN,GIG-2,SKIP1 SDQIVPEYQED 11 T 27 DUF4851 pdbhh F Eukaryota T 5igq 3 D,F,H,J,L V,W,X,Y,Z TRIB1_HUMAN TRB-1,G-PROTEIN-COUPLED RECEPTOR-INDUCED GENE 2 PROTEIN,GIG-2,SKIP1 SDQIVPEY 8 T 9 Cryptochrome_C pdbhh F Eukaryota T 5ih2 2 C,D M,N ABL1_MOUSE Proline rich Peptide XYEKPALPRKRX 12 T 4 DUF5972 pdbhh F Eukaryota T 5ii6 1 A A ZP2_MOUSE ZONA PELLUCIDA GLYCOPROTEIN 2,ZP-2,ZONA PELLUCIDA PROTEIN A VSLPQSENPAFPGTLICDKDEVRIEFSSRFDMEKWNPSVVDTLGSEILSCTYALDLERFVLKFPYETCTIKVVGGYQVNIRVGDTTTDVRYKDDMYHFFCPAIQLEHHHHHH 112 T 4.8 DUF5374 pdbhh F Eukaryota T 5ijh 1 A,B A,B XPR1_HUMAN PROTEIN SYG1 HOMOLOG,XENOTROPIC AND POLYTROPIC MURINE LEUKEMIA VIRUS RECEPTOR X3,X-RECEPTOR, MKFAEHLSAHITPEWRKQYIQYEAFKDMLYSAQDQAPSVEVTDEDTVKRYFAKFEEKFFQTCEKELAKINTFYSEKLAEAQRRFATLQNELQSSLDAQKESTGVTTLRQRRKPVFHLSHEERVQHRNIKDLKLAFSEFYLSLILLQNYQNLNFTGFRKILKKHDKILETSRGADWRVAHVEVAPFYTCKKINQLISETEAVVTNELEHHHHHH 213 T 7.2E-16 SPX pdb F Eukaryota T 5ijk 1 A,B X,Y peptide PRO-LEU-GLN-PRO-GLU-GLN-PRO-PHE-PRO PLQPEQPFP 9 T 9 P120R pdbhh F F 5ikf 2 B B CLR1_SCHPO Cryptic loci regulator protein 1 MASMTGGQQMGPFLTPDNIASSILYSTASFSRSKPDRPRLNLSLELKLMQNELNKGQLKKQFKGDLRNLADWNNLSLVSSKFPSLPITNLRPDGSFLKHRRFNEEIAYNRQTLEKAIKQLDLSPDKVIQLREQNGVAVNGRVCYPTRNKHSEISA 155 T 4.1 Lyase_catalyt pdbhh F Eukaryota T 5ikj 2 B B CLR1_SCHPO Cryptic loci regulator protein 1 SSLLSRLTQSNQSKDKIIAALAKRNVYKSFAGLYDSKGKNDNTGYDFDSNYARVGRHGSFILPVSKSVPTPSLLIEGSIVQRKNIKIE 88 T 14 ESP pdbhh F Eukaryota T 5inz 1 A,B,C A,B,C Theta defensin-2, L-peptide GVCRCVCRRGVCRCVCRR 18 T 0.98 CXCXC pdbhh F F 5inz 2 D D Theta defensin-2, D-peptide GXXXXXXXXGXXXXXXXX 18 F F F 5io3 1 A A Q5ZUV9_LEGPH Uncharacterized protein RavZ MKGKLTGKDKLIVDEFEELGEQESDIDEFDLLEGDEKLPGDSELDKTTSIYPPETSWEVNKGMNSSRLHKLYSLFFDKSSAFYLGDDVSVLEDKPLTGAYGFQSKKNDQQIFLFRPDSDYVAGYHVDAKSDAGWVNDKLDRRLSEISEFCSKATQPATFILPFVEMPTDITKGVQHQVLLTISYDPKSKQLTPTVYDSIGRDTYSESLSSYFKGKYRTTCDEILTQSIEKAIKSTDFTLGKFTRAAYNHQNRLTEGNSGSYTFRTIKEVISSSAQGTEVKIPGSGYITSNSYLTSQHVQDIESCIKYRNLGVVDIESALTEGKTLPVQLSEFIVALEDYGKLRSQQSEKSMLNFIGYSKTAKLTAVELLIGILNDIKGKNEISESQYDKLVKEVDCLMDSSLGKLVQFHLKNLGAESLQKLVLPCVKFDDTIDDFVTIEKDELFDVPDITGEELASKKGIEQGALDKEALLKQKQIKTDLLDLREEDKTGLKKPLHGGIKVK 502 T 18 DUF438 pdbhh F Bacteria T 5ioo 1 A,B A,B A0A1L1QK08_9ARCH AvpA MEINRKQAKEFYNSDMATALESCQKYGHALFMPELIDAKILATKGSSLLSNWLTAPSIRATGRTKQGNPVVVYVHVDNYLSNPENIRNAERINGAGVMPVDEFQRLLDLGDNKNVFVIDYDKLKSSSSGVIPVERALEHPQTIPFIGGEERAQRYLEKFKQVYGNNIGIWHCDDLKDEPLGRLLFVGDYCNNGLIGNYGIGNYARFVGVRGSASAEGTAQKISAPTIEQILKVSKNFVPKATRKEYENKIKALYK 255 T 0.093 Transglut_C pdb F Archaea T 5iop 3 E,F E,F Meditope variant GQXDLSTRRLKG 12 T 6.1 Rit1_C pdbhh F T 5ip7 13 M Q T2FA_YEAST PHE-ILE-LYS-ARG-ASP-ARG-MET-ARG-ARG-ASN-PHE-LEU-ARG-MET-ARG FIKRDRMRRNFLRMR 15 T 2.2 DUF5928 pdbhh F Eukaryota T 5ip9 13 M Q T2FA_YEAST PHE-ILE-LYS-ARG-ASP-ARG-MET-ARG-ARG-ASN-PHE-LEU-ARG-MET-ARG FIKRDRMRRNFLRMR 15 T 2.2 DUF5928 pdbhh F Eukaryota T 5ipy 1 A,B A,B A3SLM3_ROSNI Flavin-containing monooxygenase MTKRVAVIGAGPSGLAQLRAFQSAADQGAEIPEIVCFEKQANWGGLWNYTWRTGLDENGEPVHCSMYRYLWSNGPKEGLEFADYSFEEHFGKQIASYPPRAVLFDYIEGRVHKADVRKWIRFNSPVRWVSYDAETAKFTVTAHNHETDSTYSAAFDHVICASGHFSTPNVPFYEGFDTFNGRIVHAHDFRDAREFEGKDVLVMGASYSAEDIGSQCWKYGAKSITSCYRSAPMGYAWPDNWEEKPALEKLTGKTAHFADGSTRDVDAIILCTGYKHFFSFLPDDLRLKTANRLATADLYKGVAYVHNPAMFYLGMQDQWFTFNMFDAQAWWVRDAILGRITLPKDKAAMLADVAERETREEASDDVKYAIRYQADYVKELVAETDYPSFDIDGACDAFFEWKKHKAKDIMAFRDNSYKSVITGTMAPVHHTPWKEALDDSMEAYLQNHHHHHH 453 T 4.1E-11 FMO-like unppercent F Bacteria T 5iq4 1 A,B A,B A3SLM3_ROSNI Flavin-containing monooxygenase MTKRVAVIGAGPSGLAQLRAFQSAADQGAEIPEIVCFEKQANWGGLWNYTWRTGLDENGEPVHCSMYRYLWSNGPKEGLEFADYSFEEHFGKQIASYPPRAVLFDYIEGRVHKADVRKWIRFNSPVRWVSYDAETAKFTVTAHNHETDSTYSEDFDHVICASGHFSTPNVPFYEGFDTFNGRIVHAHDFRDAREFEGKDVLVMGASSSAEDIGSQCWKYGAKSITSCYRSAPMGYAWPDNWEEKPALEKLTGKTAHFADGSTRDVDAIILCTGYKHFFSFLPDDLRLKTANRLATADLYKGVAYVHNPAMFYLGMQDQWFTFNMFDAQAWWVRDAILGRITLPKDKAAMLADVAERETREEASDDVKYAIRYQADYVKELVAETDYPSFDIDGACDAFFEWKKHKAKDIMAFRDNSYKSVITGTMAPVHHTPWKEALDDSMEAYLQNHHHHHH 453 T 4.1E-11 FMO-like unppercent F Bacteria T 5ir0 1 A,B A,B M1Q7T5_VIBCL Uncharacterized protein ORF19 GMYTNTIIKTEIDEKVIKAFKLDALTRSKLFFKLTTKLAVPFAGVIDGAFSADRSLVSASVASLLSQHLDQETFEETQLILFGSIVEDGEALATPEAINKWFEYNDVNPMDLFVWLVDENLVTLFKGSKQLQSLKPKFDEFYKKFEDFIPQTVISDDKAEE 161 T 0.00039 Phage_TAC_9 unphh F Bacteria T 5ir1 3 E,F E,F Meditope variant GQXDLSTRRLKG 12 T 16 CbbQ_C pdbhh F T 5ir6 3 C C A0A0Q0UXS2_9BACI Putative membrane protein MQTFLIMYAPMVVVALSVVAAFWVGLKDVHVNE 33 T 0.16 FixS pdbpssm F Bacteria T 5iri 1 A,B A,B BRSK1_MOUSE SERINE/THREONINE-PROTEIN KINASE SAD-B MKRSWFGNFISLDKEEQIFLVLKDKPLSSIKADIVHAFLSIPSLSHSVLSQTSFRAEYKASGGPSVFQKPVRFQVDISSSEGPEPSPRRDGSSGGGIYSVTFTLISGPSRRFKRVVETIQAQLLSTHDQLEHHHHHH 137 T 0.016 KA1 pdbpssm F Eukaryota T 5irx 2 E,F E,F DKTX_HAPSC TAU-TRTX-HS1A, DOUBLE-KNOT TOXIN, DKTX DCAKEGEVCSWGKKCCDLDNFYCPMEFIPHCKKYKPYVPVTTNCAKEGEVCGWGSKCCHGLDCPLAFIPYCEKYR 75 T 0.079 Conotoxin_I2 unp F Eukaryota T 5it7 47 UA KK uL11 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 147 F F F 5itf 3 E,F E,F Meditope variant GQXDLSTRRLKG 12 T 16 CbbQ_C pdbhh F T 5itz 3 C D CENPJ_HUMAN CENP-J,CENTROSOMAL P4.1-ASSOCIATED PROTEIN,LAG-3-ASSOCIATED PROTEIN,LYST-INTERACTING PROTEIN 1 MAHHHHHHGSLVPRGSAQKHDDSSEVANIEERPIKAAIGERKQTFEDYLEEQIQLEEQELKQKQLKEAEGPLPIKAKPKQPFLKRGEGLARFTNAKSKFQKGKESKLVTNQSTSEDQPLFKMDRQQLQR 129 T 0.64 DUF1654 unppssm F Eukaryota T 5iue 3 I,J,K,L K,L,M,N Peptide LEU-ILE-LEU-ARG-TRP-GLU-GLN-ASP LILRWEQD 8 T 0.94 DUF1246 pdbhh F T 5iv2 3 E,F E,F Meditope variant GQFDLSTRXLKG 12 T 6.5 Pet127 pdbhh F T 5iv5 2 C,ND,QC,TB,WA,Z C,GF,EC,BJ,w,Z BP07_BPT4 GENE PRODUCT 7,GP7 MTVKAPSVTSLRISKLSANQVQVRWDDVGANFYYFVEIAETKTNSGENLPSNQYRWINLGYTANNSFFFDDADPLTTYIIRVATAAQDFEQSDWIYTEEFETFATNAYTFQNMIEMQLANKFIQEKFTLNNSDYVNFNNDTIMAALMNESFQFSPSYVDVSSISNFIIGENEYHEIQGSIQQVCKDINRVYLMESEGILYLFERYQPVVKVSNDKGQTWKAVKLFNDRVGYPLSKTVYYQSANTTYVLGYDKIFYGRKSTDVRWSADDVRFSSQDITFAKLGDQLHLGFDVEIFATYATLPANVYRIAEAITCTDDYIYVVARDKVRYIKTSNALIDFDPLSPTYSERLFEPDTMTITGNPKAVCYKMDSICDKVFALIIGEVETLNANPRTSKIIDSADKGIYVLNHDEKTWKRVFGNTEEERRRIQPGYANMSTDGKLVSLSSSNFKFLSDNVVNDPETAAKYQLIGAVKYEFPREWLADKHYHMMAFIADETSDWETFTPQPMKYYAEPFFNWSKKSNTRCWINNSDRAVVVYADLKYTKVIENIPETSPDRLVHEYWDDGDCTIVMPNVKFTGFKKYASGMLFYKASGEIISYYDFNYRVRDTVEIIWKPTEVFLKAFLQNQEHETPWSPEEERGLADPDLRPLIGTMMPDSYLLQDSNFEAFCEAYIQYLSDGYGTQYNNLRNLIRNQYPREEHAWEYLWSEIYKRNIYLNADKRDAVARFFESRSYDFYSTKGIEASYKFLFKVLYNEEVEIEIESGAGTEYDIIVQSDSLTEDLVGQTIYTATGRCNVTYIERSYSNGKLQWTVTIHNLLGRLIAGQEVKAERLPSFEGEIIRGVKGKDLLQNNIDYINRSRSYYVMKIKSNLPSSRWKSDVIRFVHPVGFGFIAITLLTMFINVGLTLKHTETIINKYKNYKWDSGLPTEYADRIAKLTPTGEIEHDSVTGEAIYEPGPMAGVKYPLPDDYNAENNNSIFQGQLPSERRKLMSPLFDASGTTFAQFRDLVNKRLKDNIGNPRDPENPTQVKIDE 1032 T 0.033 fn3 pdb T Viruses T 5iv7 2 C,EC,IA,OB,S,YA C,EC,i,CA,S,y BP07_BPT4 GENE PRODUCT 7,GP7 MTVKAPSVTSLRISKLSANQVQVRWDDVGANFYYFVEIAETKTNSGENLPSNQYRWINLGYTANNSFFFDDADPLTTYIIRVATAAQDFEQSDWIYTEEFETFATNAYTFQNMIEMQLANKFIQEKFTLNNSDYVNFNNDTIMAALMNESFQFSPSYVDVSSISNFIIGENEYHEIQGSIQQVCKDINRVYLMESEGILYLFERYQPVVKVSNDKGQTWKAVKLFNDRVGYPLSKTVYYQSANTTYVLGYDKIFYGRKSTDVRWSADDVRFSSQDITFAKLGDQLHLGFDVEIFATYATLPANVYRIAEAITCTDDYIYVVARDKVRYIKTSNALIDFDPLSPTYSERLFEPDTMTITGNPKAVCYKMDSICDKVFALIIGEVETLNANPRTSKIIDSADKGIYVLNHDEKTWKRVFGNTEEERRRIQPGYANMSTDGKLVSLSSSNFKFLSDNVVNDPETAAKYQLIGAVKYEFPREWLADKHYHMMAFIADETSDWETFTPQPMKYYAEPFFNWSKKSNTRCWINNSDRAVVVYADLKYTKVIENIPETSPDRLVHEYWDDGDCTIVMPNVKFTGFKKYASGMLFYKASGEIISYYDFNYRVRDTVEIIWKPTEVFLKAFLQNQEHETPWSPEEERGLADPDLRPLIGTMMPDSYLLQDSNFEAFCEAYIQYLSDGYGTQYNNLRNLIRNQYPREEHAWEYLWSEIYKRNIYLNADKRDAVARFFESRSYDFYSTKGIEASYKFLFKVLYNEEVEIEIESGAGTEYDIIVQSDSLTEDLVGQTIYTATGRCNVTYIERSYSNGKLQWTVTIHNLLGRLIAGQEVKAERLPSFEGEIIRGVKGKDLLQNNIDYINRSRSYYVMKIKSNLPSSRWKSDVIRFVHPVGFGFIAITLLTMFINVGLTLKHTETIINKYKNYKWDSGLPTEYADRIAKLTPTGEIEHDSVTGEAIYEPGPMAGVKYPLPDDYNAENNNSIFQGQLPSERRKLMSPLFDASGTTFAQFRDLVNKRLKDNIGNPRDPENPTQVKIDE 1032 T 0.033 fn3 pdb T Viruses T 5ivn 2 B B G9GAG7_HUMAN Cadherin derived peptide XDRKAAVSHWQX 12 T 0.15 DUF2288 unp F Eukaryota T 5ivz 3 E,F E,F Meditope variant GQFDLSTXRLKG 12 T 6.5 Pet127 pdbhh F T 5iwb 1 A A MORF9_ARATH RNA EDITING-INTERACTING PROTEIN 9 MEQRETIMLPGSDYNHWLIVMEFPKDPAPSRDQMIDTYLNTLATVLGSMEEAKKNMYAFSTTTYTGFQCTIDEETSEKFKGLPGVLWVLPDSYIDVKNKDYGGDKYINGEIIPSTYPTYQPKQLEHHHHHHHH 133 T 0.27 Inhibitor_I9 pdbhh F Eukaryota T 5iww 1 A,B,C A,B,C MORF9_ARATH RNA EDITING-INTERACTING PROTEIN 9 MEQRETIMLPGSDYNHWLIVMEFPKDPAPSRDQMIDTYLNTLATVLGSMEEAKKNMYAFSTTTYTGFQCTIDEETSEKFKGLPGVLWVLPDSYIDVKNKDYGGDKYINGEIIPSTYPTYQPKQLEHHHHHHHH 133 T 0.27 Inhibitor_I9 pdbhh F Eukaryota T 5ix9 1 A A A1YIY2_9GAMM Antifreeze protein MSDNQFPFATLGNAIGFITKLDGSVTVQSINGQERVLKLGDPIFFGETVLTGGSGSVTIAFVDGTDVVIGGDSIVEMTDEIYNTGDNEDLVADSSSEIDALQNAILAGDDPTLIQDAPAAGNTLADQQRVDVSIERNDNSAQAGFGVDTQSSLPTYGYDTDNGNGGQATEREYSAPSLSRTLNQSPLLEHHHHHH 195 T 0.0019 FecR pdbpercent F Bacteria T 5ixf 2 B B STABP_HUMAN STAM-binding protein AKPPVVDRSLKPGA 14 T 9.4 DUF1681 pdbhh F Eukaryota T 5ixq 2 B B IDA_ARATH PROTEIN INFLORESCENCE DEFICIENT IN ABSCISSION PIPPSAPSKRHN 12 T 0.67 Disulph_isomer pdbhh F Eukaryota T 5ixt 2 B B IDA_ARATH PROTEIN INFLORESCENCE DEFICIENT IN ABSCISSION YPKGVPIPPSAPSKRHN 17 T 2.7 Disulph_isomer pdbhh F Eukaryota T 5iy4 2 B,D,F B,D,F SPRTN_HUMAN DVC1 PIP box SNSHQNVLSNYFPRVS 16 T 11 FAD_SOX pdbhh F Eukaryota T 5iyv 2 B B IDL1_ARATH Protein IDA-LIKE 1 LVPPSGPSMRHN 12 T 0.021 Sperm_Ag_HE2 unp F Eukaryota T 5iyx 2 B B IDA_ARATH PROTEIN INFLORESCENCE DEFICIENT IN ABSCISSION YVPIPPSAPSKRHN 14 T 0.34 Disulph_isomer pdbhh F Eukaryota T 5iz0 2 B,D,F,H C,E,F,H GLU-PHE-PRO-TYR-LEU-LEU-SER-LEU-LEU-GLY-GLU-VAL-SER-PRO-GLN EFPYLLSLLGEVSPQ 15 T 1.6 DUF4576 pdbhh F T 5iz2 2 C Z B5SYS5_NEPCL Major ampullate spidroin 1A (Partial C-terminus) SYG 3 T 85 Thx pdbhh F Eukaryota F 5iz6 2 B B PHQ-ALA-GLY-GLU-ALA-LEU-TYR-GLU-NH2 XAGEALYEX 9 T 30 NYAP_N pdbhh F T 5iz8 2 C,D C,D ACE-ALA-GLY-GLU-ALA-LEU-ALA-ASP-NH2 XAGEALADX 9 T 34 SWI-SNF_Ssr4 pdbhh F T 5iz9 2 B B ACE-GLY-GLY-GLU-ALA-LEU-ALA-ASP-NH2 XGGEALADX 9 T 72 DUF898 pdbhh F T 5iza 2 B B ACE-GLY-GLY-GLU-ALA-LEU-ALA-TRP-NH2 XGGEALAWX 9 T 3.2 Nmad4 pdbhh F T 5ize 1 A,B A,B L_HANTV PROTEIN L,LARGE STRUCTURAL PROTEIN,REPLICASE,TRANSCRIPTASE GMDKYREIHNKLKEFSPGTLTAVECIDYLDRLYAVRHDIVDQMIKHDWSDNKDSEEAIGKVLLFAGVPSNIITALEKKIIPNHPTGKSLKAFFKMTPDNYKISGTTIEFVEVTVTADVDKGIREKKLKYEAGLTYIEQELHKFFLKGEIPQPYKITFNVVAVRTDGSNITTQWPSRRNDG 180 T 0.075 L_protein_N pdbpssm T Viruses T 5izf 2 B E 6J9-ZEU-DAR-ACA-DAR-NH2 XXXXXX 6 F F F 5izj 2 C G 47P-AZ1-DAR-DAR XXXXX 5 F F F 5izj 3 D F 47P-AZ1-DAR-DAR-DAR XXXXXX 6 F F F 5izv 1 A,B A,B Q5ZUV9_LEGPH Uncharacterized protein RavZ MKGKLTGKDKLIVDEFEELGEQESDIDEFDLLEGDEKLPGDSELDKTTSIYPPETSWEVNKGMNSSRLHKLYSLFFDKSSAFYLGDDVSVLEDKPLTGAYGFQSKKNDQQIFLFRPDSDYVAGYHVDAKSDAGWVNDKLDRRLSEISEFCSKATQPATFILPFVEMPTDITKGVQHQVLLTISYDPKSKQLTPTVYDSIGRDTYSESLSSYFKGKYRTTCDEILTQSIEKAIKSTDFTLGKFTRAAYNHQNRLTEGNSGSYTFRTIKEVISSSAQGTEVKIPGSGYITSNSYLTSQHVQDIESCIKYRNLGVVDIESALTEGKTLPVQLSEFIVALEDYGKLRSQQSEKSMLNFIGYSKTAKLTAVELLIGILNDIKGKNEISESQYDKLVKEVDCLMDSSLGKLVQFHLKNLGAESLQKLVLPCVKFDDTIDDFVTIEKDELFDVPDITGEELASKKGIEQGALDKEALLKQKQIKTDLLDLREEDKTGLKKPLHGGIKVK 502 T 18 DUF438 pdbhh F Bacteria T 5j0m 1 A A Cyclic peptide mimetic of HIV-1 Tat RVRTRKGRRIRIXP 14 T 0.24 DUF2835 pdbhh F T 5j19 2 C,D C,D O96561_DROME PON ESCFTNAAFSSTPKK 15 T 0.26 RskA unppercent F Eukaryota T 5j1o 1 A A Cyclic peptide mimetic of Tat RVRTRKGRRIRIXP 14 T 0.24 DUF2835 pdbhh F T 5j2w 1 A A Cyclic peptide mimetic of HIV-1 Tat RVRTRKGRRIRIXP 14 T 0.24 DUF2835 pdbhh F T 5j2y 1 A,B A,B Q9X7H4_PSEAI REGULATORY PROTEIN RSAL,RSAL PROTEIN,UNCHARACTERIZED PROTEIN,VIRULENCE GENE REPRESSOR RSAL MASHERTQPQNMAFRAKATRTARRESQETFWSRFGISQSCGSRFENGENLPFPIYLLLHFYIEGQITDRQLADLRGKIRE 80 T 0.00017 DUF4447 pdbhh F Bacteria T 5j3h 1 A B Peptide S519C16 GSLDESFYDWFERQLG 16 T 0.086 YozE_SAM_like pdbhh F T 5j4a 2 B,D B,D CDII9_BURPE Immunity protein CdiI KMAGSIVISKEVRVPVSTSQFDYLVSRIGDQFHSSDMWIKDEVYLPMEEGGMSFISTESLNSSGLSIFLATVMRARAASQAEESFPLYENVWNQLVEKLRQDARLGVSGNTSLEHHHHHH 120 T 0.15 Adeno_E4_ORF3 pdbpssm F Bacteria T 5j4t 2 B,D,F,H B,D,F,H LECB3_ARTIN JACALIN BETA-3 CHAIN EQSGISQTVIVGPWGAKVS 19 T 2.9 DUF3842 pdbhh F Eukaryota T 5j4x 2 B,D,F,H B,D,F,H LECB3_ARTIN JACALIN BETA-3 CHAIN EQSGISQTVIVGPWGAKVS 19 T 2.9 DUF3842 pdbhh F Eukaryota T 5j4z 1 A A COMPLEX I ND3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 102 F F F 5j4z 2 B B COMPLEX I PSST/NDUFS7 XXXXXXXXXXXXXXXXXXXXXXXXXXXXCCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 154 T 4200 Cas1_AcylT pdbhh F F 5j4z 3 C C COMPLEX I 30KDA/NDUFS3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 194 F F F 5j4z 4 D D COMPLEX I 49KDA/NDUFS2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 384 F F F 5j4z 5 E E COMPLEX I 24KDA/NDUFV2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 189 T 1400 Radical_SAM_2 pdbhh F F 5j4z 6 F F COMPLEX I 51KDA/NDUFV1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXCXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 429 T 81 Fer4_2 pdbhh F F 5j4z 7 G G COMPLEX I 75KDA/NDUFS1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXXXXXXXXCXXCXXXXXXXXXXXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHXXXCXXCXXXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXCXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 652 T 160 Fer4_2 pdbhh F F 5j4z 8 H H COMPLEX I ND1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 297 F F F 5j4z 9 I I COMPLEX I TYKY/NDUFS8 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXCXXCXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXCXXCXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 171 T 0.027 Fer4_8 pdbhh F F 5j4z 10 J J COMPLEX I ND6 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 171 F F F 5j4z 11 K K COMPLEX I ND4L XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 93 F F F 5j4z 12 L L COMPLEX I ND5 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 575 F F F 5j4z 13 M M COMPLEX I ND4 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 455 F F F 5j4z 14 N N COMPLEX I ND2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 345 F F F 5j4z 15 O O COMPLEX I 18KDA/NDUFS6 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 104 F F F 5j4z 16 P P COMPLEX I 13KDA/NDUFS6 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 85 F F F 5j4z 17 Q Q COMPLEX I 15KDA/NDUFS5 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 66 F F F 5j4z 18 R R COMPLEX I MWFE/NDUFA1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXX 29 F F F 5j4z 19 DA,S d,S COMPLEX I B8/NDUFA2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 80 F F F 5j4z 20 T T COMPLEX I B9/NDUFA3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 53 F F F 5j4z 21 U U COMPLEX I B13/NDUFA5 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 96 F F F 5j4z 22 V V COMPLEX I B14/NDUFA6 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 112 F F F 5j4z 23 W W COMPLEX I PGIV/NDUFA8 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 103 F F F 5j4z 24 X X COMPLEX I 39KDA/NDUFA9 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 309 F F F 5j4z 25 Y Y COMPLEX I 42KDA/NDUFA10 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 322 F F F 5j4z 26 Z Z COMPLEX I B14.7/NDUFA11 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 119 F F F 5j4z 27 AA a COMPLEX I B17.2/NDUFA12 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 111 F F F 5j4z 28 BA b COMPLEX I B16.6/NDUFA13 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 92 F F F 5j4z 29 CA c COMPLEX I SDAP/NDUFAB1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 79 F F F 5j4z 30 EA e COMPLEX I SDAP/NDUFAB1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 55 F F F 5j4z 31 FA f COMPLEX I SDAP/NDUFAB1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 59 F F F 5j4z 32 GA g COMPLEX I B15/NDUFB4 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 130 F F F 5j4z 33 HA,UA,VA h,9,z COMPLEX I B18/NDUFB7 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 63 F F F 5j4z 34 IA i COMPLEX I B22/NDUFB9 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 70 F F F 5j4z 35 JA j COMPLEX I PDSW/NDUFB10 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 44 F F F 5j4z 36 KA k COMPLEX I ESSS/NDUFB11 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 83 F F F 5j4z 37 LA 0 COMPLEX I KFYI/NDUFC1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 36 F F F 5j4z 38 MA 1 COMPLEX I B14.5B/NDUFC2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 30 F F F 5j4z 39 NA 2 COMPLEX I UNKNOWN SUBUNIT FRAGMENT 1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 38 F F F 5j4z 40 OA,PA 3,4 COMPLEX I UNKNOWN SUBUNIT FRAGMENT 2 XXXXXXXXXXXXXXXXXXXXXXXXXXXX 28 F F F 5j4z 41 QA 5 COMPLEX I UNKNOWN SUBUNIT FRAGMENT 3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 34 F F F 5j4z 42 RA 6 COMPLEX I UNKNOWN SUBUNIT FRAGMENT 4 XXXXXXXXXXXXXXXXXXXXX 21 F F F 5j4z 43 SA 7 COMPLEX I UNKNOWN SUBUNIT FRAGMENT 5 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 39 F F F 5j4z 44 TA 8 COMPLEX I UNKNOWN SUBUNIT FRAGMENT 6 XXXXXXXXXXXXXXXXXXXXXXXXXXX 27 F F F 5j4z 45 WA y COMPLEX I UNKNOWN SUBUNIT FRAGMENT 11 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 46 F F F 5j4z 46 XA x COMPLEX I UNKNOWN SUBUNIT FRAGMENT 12 XXXXXXXXXXXXX 13 F F F 5j4z 47 YA w COMPLEX I UNKNOWN SUBUNIT FRAGMENT 13 XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 5j4z 48 ZA v COMPLEX I UNKNOWN SUBUNIT FRAGMENT 14 XXXXXXXXXXXXXXXXXX 18 F F F 5j4z 49 AB u COMPLEX I UNKNOWN SUBUNIT FRAGMENT 15 XXXXXXXXXXXXXXXX 16 F F F 5j4z 50 BB t COMPLEX I UNKNOWN SUBUNIT FRAGMENT 16 XXXXXXXXXXXX 12 F F F 5j50 2 B,D,F,H B,D,F,H LECB3_ARTIN JACALIN BETA-3 CHAIN EQSGISQTVIVGPWGAKVS 19 T 2.9 DUF3842 pdbhh F Eukaryota T 5j51 2 B,D,F,H B,D,F,H LECB3_ARTIN JACALIN BETA-3 CHAIN EQSGISQTVIVGPWGAKVS 19 T 2.9 DUF3842 pdbhh F Eukaryota T 5j5x 2 B B 47P-AZ1-DAL-DAR-DAR-DAR-DAR XXXXXXXX 8 F F F 5j6t 1 A A ALBO1_HYPAB HY-A1 XIFGAIWPLALGALKNLIKX 20 T 4 SH3_7 unphh F Eukaryota T 5j6v 1 A A ALBO1_HYPAB Hylin-D DIFGAIWPLALGALKNLIKX 20 T 2.7 DUF3275 pdbhh F Eukaryota T 5j6w 1 A A ALBO1_HYPAB Hylin-K KIFGAIWPLALGALKNLIKX 20 T 4 SH3_7 unphh F Eukaryota T 5j7j 2 B B DLG4_HUMAN POSTSYNAPTIC DENSITY PROTEIN 95,PSD-95,SYNAPSE-ASSOCIATED PROTEIN 90,SAP90 MDCLCIVTTKKYRYQDEDT 19 T 0.0058 MAGUK_N_PEST unp F Eukaryota T 5j7o 2 G,H,I,J,K,L G,H,I,J,K,L unknown XXXXXXXXXXXXXXXXXXXXX 21 F F F 5j7u 2 G,H,I,J,K,L,S,T,U,V,W,X M,N,O,P,Q,R,S,T,U,V,W,X unknown XXXXXXXXXXXXXXXXXXXXX 21 F F F 5j7y 1 A A COMPLEX I ND3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 102 F F F 5j7y 2 B B COMPLEX I PSST/NDUFS7 XXXXXXXXXXXXXXXXXXXXXXXXXXXXCCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 154 T 4200 Cas1_AcylT pdbhh F F 5j7y 3 C C COMPLEX I 30KDA/NDUFS3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 194 F F F 5j7y 4 D D COMPLEX I 49KDA/NDUFS2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 384 F F F 5j7y 5 E E COMPLEX I 24KDA/NDUFV2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 189 T 1400 Radical_SAM_2 pdbhh F F 5j7y 6 F F COMPLEX I 51KDA/NDUFV1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXCXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 429 T 81 Fer4_2 pdbhh F F 5j7y 7 G G COMPLEX I 75KDA/NDUFS1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXXXXXXXXCXXCXXXXXXXXXXXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHXXXCXXCXXXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXCXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 652 T 160 Fer4_2 pdbhh F F 5j7y 8 H H COMPLEX I ND1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 297 F F F 5j7y 9 I I COMPLEX I TYKY/NDUFS8 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXCXXCXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXCXXCXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 171 T 0.027 Fer4_8 pdbhh F F 5j7y 10 J J COMPLEX I ND6 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 171 F F F 5j7y 11 K K COMPLEX I ND4L XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 93 F F F 5j7y 12 L L COMPLEX I ND5 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 575 F F F 5j7y 13 M M COMPLEX I ND4 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 455 F F F 5j7y 14 N N COMPLEX I ND2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 345 F F F 5j7y 15 O O COMPLEX I 18KDA/NDUFS6 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 104 F F F 5j7y 16 P P COMPLEX I 13KDA/NDUFS6 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 85 F F F 5j7y 17 Q Q COMPLEX I 15KDA/NDUFS5 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 66 F F F 5j7y 18 R R COMPLEX I MWFE/NDUFA1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXX 29 F F F 5j7y 19 DA,S d,S COMPLEX I B8/NDUFA2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 80 F F F 5j7y 20 T T COMPLEX I B9/NDUFA3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 53 F F F 5j7y 21 U U COMPLEX I B13/NDUFA5 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 96 F F F 5j7y 22 V V COMPLEX I B14/NDUFA6 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 112 F F F 5j7y 23 W W COMPLEX I PGIV/NDUFA8 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 103 F F F 5j7y 24 X X COMPLEX I 39KDA/NDUFA9 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 309 F F F 5j7y 25 Y Y COMPLEX I 42KDA/NDUFA10 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 322 F F F 5j7y 26 Z Z COMPLEX I B14.7/NDUFA11 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 119 F F F 5j7y 27 AA a COMPLEX I B17.2/NDUFA12 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 111 F F F 5j7y 28 BA b COMPLEX I B16.6/NDUFA13 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 92 F F F 5j7y 29 CA c COMPLEX I SDAP/NDUFAB1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 79 F F F 5j7y 30 EA e COMPLEX I SDAP/NDUFAB1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 55 F F F 5j7y 31 FA f COMPLEX I SDAP/NDUFAB1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 59 F F F 5j7y 32 GA g COMPLEX I B15/NDUFB4 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 130 F F F 5j7y 33 HA,UA,VA h,9,z COMPLEX I B18/NDUFB7 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 63 F F F 5j7y 34 IA i COMPLEX I B22/NDUFB9 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 70 F F F 5j7y 35 JA j COMPLEX I PDSW/NDUFB10 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 44 F F F 5j7y 36 KA k COMPLEX I ESSS/NDUFB11 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 83 F F F 5j7y 37 LA 0 COMPLEX I KFYI/NDUFC1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 36 F F F 5j7y 38 MA 1 COMPLEX I B14.5B/NDUFC2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 30 F F F 5j7y 39 NA 2 COMPLEX I UNKNOWN SUBUNIT FRAGMENT 1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 38 F F F 5j7y 40 OA,PA 3,4 COMPLEX I UNKNOWN SUBUNIT FRAGMENT 2 XXXXXXXXXXXXXXXXXXXXXXXXXXXX 28 F F F 5j7y 41 QA 5 COMPLEX I UNKNOWN SUBUNIT FRAGMENT 3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 34 F F F 5j7y 42 RA 6 COMPLEX I UNKNOWN SUBUNIT FRAGMENT 4 XXXXXXXXXXXXXXXXXXXXX 21 F F F 5j7y 43 SA 7 COMPLEX I UNKNOWN SUBUNIT FRAGMENT 5 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 39 F F F 5j7y 44 TA 8 COMPLEX I UNKNOWN SUBUNIT FRAGMENT 6 XXXXXXXXXXXXXXXXXXXXXXXXXXX 27 F F F 5j7y 45 WA y COMPLEX I UNKNOWN SUBUNIT FRAGMENT 11 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 46 F F F 5j7y 46 XA x COMPLEX I UNKNOWN SUBUNIT FRAGMENT 12 XXXXXXXXXXXXX 13 F F F 5j7y 47 YA w COMPLEX I UNKNOWN SUBUNIT FRAGMENT 13 XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 5j7y 48 ZA v COMPLEX I UNKNOWN SUBUNIT FRAGMENT 14 XXXXXXXXXXXXXXXXXX 18 F F F 5j7y 49 AB u COMPLEX I UNKNOWN SUBUNIT FRAGMENT 15 XXXXXXXXXXXXXXXX 16 F F F 5j7y 50 BB t COMPLEX I UNKNOWN SUBUNIT FRAGMENT 16 XXXXXXXXXXXX 12 F F F 5j8h 2 B B EF2K_HUMAN EEF-2K,CALCIUM/CALMODULIN-DEPENDENT EUKARYOTIC ELONGATION FACTOR 2 KINASE SPANSFHFKEAWKHAIQKAKHMPDPWA 27 T 5.7 MPLKIP pdbhh F Eukaryota T 5j8k 1 A A COMPLEX I ND3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 102 F F F 5j8k 2 B B COMPLEX I PSST/NDUFS7 XXXXXXXXXXXXXXXXXXXXXXXXXXXXCCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 154 T 4200 Cas1_AcylT pdbhh F F 5j8k 3 C C COMPLEX I 30KDA/NDUFS3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 194 F F F 5j8k 4 D D COMPLEX I 49KDA/NDUFS2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 384 F F F 5j8k 5 E E COMPLEX I 24KDA/NDUFV2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 189 T 1400 Radical_SAM_2 pdbhh F F 5j8k 6 F F COMPLEX I 51KDA/NDUFV1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXCXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 429 T 81 Fer4_2 pdbhh F F 5j8k 7 G G COMPLEX I 75KDA/NDUFS1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXXXXXXXXCXXCXXXXXXXXXXXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHXXXCXXCXXXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXCXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 652 T 160 Fer4_2 pdbhh F F 5j8k 8 H H COMPLEX I ND1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 297 F F F 5j8k 9 I I COMPLEX I TYKY/NDUFS8 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXCXXCXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXCXXCXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 171 T 0.027 Fer4_8 pdbhh F F 5j8k 10 J J COMPLEX I ND6 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 171 F F F 5j8k 11 K K COMPLEX I ND4L XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 93 F F F 5j8k 12 L L COMPLEX I ND5 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 575 F F F 5j8k 13 M M COMPLEX I ND4 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 455 F F F 5j8k 14 N N COMPLEX I ND2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 345 F F F 5j8k 15 O O COMPLEX I 18KDA/NDUFS6 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 104 F F F 5j8k 16 P P COMPLEX I 13KDA/NDUFS6 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 85 F F F 5j8k 17 Q Q COMPLEX I 15KDA/NDUFS5 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 66 F F F 5j8k 18 R R COMPLEX I MWFE/NDUFA1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXX 29 F F F 5j8k 19 DA,S d,S COMPLEX I B8/NDUFA2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 80 F F F 5j8k 20 T T COMPLEX I B9/NDUFA3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 53 F F F 5j8k 21 U U COMPLEX I B13/NDUFA5 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 96 F F F 5j8k 22 V V COMPLEX I B14/NDUFA6 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 112 F F F 5j8k 23 W W COMPLEX I PGIV/NDUFA8 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 103 F F F 5j8k 24 X X COMPLEX I 39KDA/NDUFA9 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 309 F F F 5j8k 25 Y Y COMPLEX I 42KDA/NDUFA10 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 322 F F F 5j8k 26 Z Z COMPLEX I B14.7/NDUFA11 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 119 F F F 5j8k 27 AA a COMPLEX I B17.2/NDUFA12 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 111 F F F 5j8k 28 BA b COMPLEX I B16.6/NDUFA13 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 92 F F F 5j8k 29 CA c COMPLEX I SDAP/NDUFAB1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 79 F F F 5j8k 30 EA e COMPLEX I B15/NDUFB4 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 55 F F F 5j8k 31 FA f COMPLEX I B18/NDUFB7 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 59 F F F 5j8k 32 GA g COMPLEX I B22/NDUFB9 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 130 F F F 5j8k 33 HA,UA,VA h,9,z COMPLEX I PDSW/NDUFB10 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 63 F F F 5j8k 34 IA i COMPLEX I ESSS/NDUFB11 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 70 F F F 5j8k 35 JA j COMPLEX I KFYI/NDUFC1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 44 F F F 5j8k 36 KA k COMPLEX I B14.5B/NDUFC2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 83 F F F 5j8k 37 LA 0 COMPLEX I UNKNOWN SUBUNIT FRAGMENT 1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 36 F F F 5j8k 38 MA 1 COMPLEX I UNKNOWN SUBUNIT FRAGMENT 2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 30 F F F 5j8k 39 NA 2 COMPLEX I UNKNOWN SUBUNIT FRAGMENT 3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 38 F F F 5j8k 40 OA,PA 3,4 COMPLEX I UNKNOWN SUBUNIT FRAGMENT 4 XXXXXXXXXXXXXXXXXXXXXXXXXXXX 28 F F F 5j8k 41 QA 5 COMPLEX I UNKNOWN SUBUNIT FRAGMENT 5 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 34 F F F 5j8k 42 RA 6 COMPLEX I UNKNOWN SUBUNIT FRAGMENT 6 XXXXXXXXXXXXXXXXXXXXX 21 F F F 5j8k 43 SA 7 COMPLEX I UNKNOWN SUBUNIT FRAGMENT 7 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 39 F F F 5j8k 44 TA 8 COMPLEX I UNKNOWN SUBUNIT FRAGMENT 8 XXXXXXXXXXXXXXXXXXXXXXXXXXX 27 F F F 5j8k 45 WA y COMPLEX I UNKNOWN SUBUNIT FRAGMENT 11 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 46 F F F 5j8k 46 XA x COMPLEX I UNKNOWN SUBUNIT FRAGMENT 12 XXXXXXXXXXXXX 13 F F F 5j8k 47 YA w COMPLEX I UNKNOWN SUBUNIT FRAGMENT 13 XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 5j8k 48 ZA v COMPLEX I UNKNOWN SUBUNIT FRAGMENT 14 XXXXXXXXXXXXXXXXXX 18 F F F 5j8k 49 AB u COMPLEX I UNKNOWN SUBUNIT FRAGMENT 15 XXXXXXXXXXXXXXXX 16 F F F 5j8k 50 BB t COMPLEX I UNKNOWN SUBUNIT FRAGMENT 16 XXXXXXXXXXXX 12 F F F 5j8p 2 B B RL40_HUMAN D-ubiquitin XXXXXXXXXGXXXXXXXXXXXXXXXXXXXXXXXXGXXXXXXXXXXXGXXXXXGXXXXXXXXXXXXXXXXXXXXXGX 76 F F Eukaryota F 5j9q 5 M,N,O L,M,O H2AZ_YEAST Htz1 SGAKDSGSLR 10 T 0.0022 Histone unppercent F Eukaryota T 5jb3 17 Q C 30S ribosomal protein SX XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 57 F F F 5jbh 28 BA C 30S ribosomal protein SX XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 57 F F F 5jbq 2 B B THIOMURACIN ANALOG SXNXXXYXXXXXX 13 T 0.79 CCER1 pdbhh F F 5jbv 2 B B RL40_HUMAN D-ubiquitin XXXXXXXXXGXXXXXXXXXXXXXXXXXXXXXXXXGXXXXXXXXXXXGXXXXXGXXXXXXXXXXXXXXXXXXXXXGX 76 F F Eukaryota F 5jby 2 B,D,F B,D,F RL40_HUMAN D-ubiquitin XXXXXXXXXGXXXXXXXXXXXXXXXXXXXXXXXXGXXXXXXXXXXXGXXXXXGXXXXXXXXXXXXXXXXXXXXXGX 76 F F Eukaryota F 5jcy 2 B B SPIR2_HUMAN SPIR-2 QRPRPRVLLKAPTLAEMEEMNTSEEEE 27 T 14 DUF5395 pdbhh F Eukaryota T 5jej 1 A,B,C C,D,E STING_HUMAN HSTING,ENDOPLASMIC RETICULUM INTERFERON STIMULATOR,ERIS,MEDIATOR OF IRF3 ACTIVATION,HMITA,TRANSMEMBRANE PROTEIN 173 STVGSLKTSAVPSTSTMSQEPELLISGMEKPLPLRTDWS 39 T 10 Herpes_IE68 pdbhh F Eukaryota T 5jek 2 C,D C,D MAVS_HUMAN MAVS peptide SGCFEDLAISASTSLGWG 18 T 3.6 GRA6 unphh F Eukaryota T 5jel 2 B B TCAM1_HUMAN Phosphorylated TRIF peptide SPASLASNLEISQSPTMPFWS 21 T 17 DUF4675 pdbhh F Eukaryota T 5jez 2 B D Met-Ala-Ser MAS 3 T 280 zf-C2H2_4 pdbhh F F 5jf0 2 B D MET-ALA-ARG MAR 3 T 160 Ferredoxin_N pdbhh F F 5jfg 2 B B PEPTIDE FHTA XFHTAX 6 T 290 Archease pdbhh F F 5jfi 2 B,D C,D CLE41_ARATH CLE41 HEVPSGPNPISN 12 T 2.6 DUF502 pdbhh F Eukaryota T 5jft 2 C,D F,C ACE-ASP-GLU-VAL-ASK XDEVDX 6 T 200 ResIII pdbhh F F 5jg9 1 A,B,C A,B,C de novo design, hyper stable, disulfide-rich mini protein GSEERRYKRCGQDEERVRRECKERGERQNCQYQIRKEGNCYVCEIRC 47 T 18 Rad50_zn_hook pdbhh F T 5jge 1 A,B,D,E A,B,D,E ATG19_YEAST CYTOPLASM-TO-VACUOLE TARGETING PROTEIN 19 GPHMLDNFMKQLLKLEESLNKLELEQKVTNKE 32 T 2.7 NCKAP5 pdbhh F Eukaryota T 5jge 2 C,F C,F AMPL_YEAST Ape1 propeptide GPMEEQREILEQLKKTLQMLTVY 23 T 0.74 PKHD_C pdbhh F Eukaryota T 5jhc 1 A,AA,B,BA,C,CA,D,DA,E,EA,F,FA,G,GA,H,HA,I,IA,J,JA,K,KA,L,LA,M,MA,N,O,P,Q,R,S,T,U,V,W,X,Y,Z A,g,C,P,E,h,G,R,I,i,K,T,M,j,O,V,Q,k,S,X,U,l,W,Z,Y,m,B,a,b,D,F,c,H,d,J,e,L,f,N AMPL_YEAST AMINOPEPTIDASE YSCI,LEUCINE AMINOPEPTIDASE IV,LAPIV,LYSOSOMAL AMINOPEPTIDASE III,POLYPEPTIDASE,VACUOLAR AMINOPEPTIDASE I GPMEEQREILEQLKKTLQMLTVEL 24 T 0.86 PKHD_C pdbhh F Eukaryota T 5jhf 4 G,H G,H C5DB94_LACTC Atg13 17BR SKYSSSFGRLRRQ 13 T 6.2 Corona_5a pdbhh F Eukaryota T 5jhf 5 I,J I,J C5DB94_LACTC Atg13 17LR LQPFKAGSVGSGS 13 T 0.6 DUF565 pdbhh F Eukaryota T 5jhi 1 A A DE NOVO MINIPROTEIN EHE_06 CKQRRRYRGSEEECRKYAEELSRRTGCEVEVECET 35 T 0.048 Ribosomal_S4 pdb F T 5jhj 1 A A R9RX08_MAGOR Antivirulence protein AVR-Pia APQDNTSMGSSHHHHHHSSGRENLYFQGHMAAPARSCVYYDGHLPATRVLLMYVRIGNTATITARGHEFEVEAKDQNCKVILTNGKQAPDWLAAEPY 97 T 0.012 Pirin_C unppssm F Eukaryota T 5jhq 2 E,F,G,H,I,J,K,L E,F,G,H,I,J,K,L LCAP_HUMAN Peptide derived from insulin-responsive aminopeptidase (IRAP) ATGYRQSPDGACSVPS 16 T 0.0072 GBP_PSP pdb F Eukaryota T 5ji4 1 A A DE NOVO MINIPROTEIN EEHE_02 APCECDVNGETYTVSSSEECERLCRKLGVTNCRVHCG 37 T 0.82 DUF6482 pdbhh F T 5jie 1 A,B,C,D,E B,C,E,D,A E9KNV6_9VIRU Protein delta MPSEDYAIWYARATIAALQAAEYRLAMPSASYTAWFTDAVSDKLDKISESLNTLVECVIDKRLAVS 66 T 0.048 DNMT1-RFD pdb T Viruses T 5jiu 2 C,D C,D DDX4_MOUSE DEAD BOX PROTEIN 4,MVH,VASA HOMOLOG KSETEGGESSDSQGPKVTYI 20 T 26 Polo_box_3 pdbhh F Eukaryota T 5jja 2 C,D C,D BUB1B_HUMAN MAD3/BUB1-RELATED PROTEIN KINASE,HBUBR1,MITOTIC CHECKPOINT KINASE MAD3L,PROTEIN SSK1 GKTSEDQQTACGTIYSQTLSIKKLDPIIEDDREADHSSGFSGSSASVASTSSIKCLQIPEKLELTNETSENPTQS 75 T 0.066 NifU_N pdbpssm F Eukaryota T 5jjm 2 B,D,F,H M,F,K,L Unknown peptide MSD 3 T 140 CBFNT pdbhh F F 5jjz 2 B B H14_HUMAN LYS-LYS-LYS-ALA-ARG-MLY-SER-ALA-GLY-ALA-ALA-LYS-TYR KKKARKSAGAAKY 13 T 0.2 DUF5797 unp F Eukaryota T 5jkq 1 A,B,C,D A,B,D,C C6KSR6_PLAF7 PfVFT1 GSMGVEEVVNNKAKRLIDIYHAAVKELIQNEELIDLIDKHNVDYSVIESIENLPNLADINVKDDIDDVLSEIIKKKEVKIGALKNKNWGIIGNYEQNPPVGFWPDVMYIIWETISKHIFNDEDAINIAYNYYDNVFVALNDKDIHMTDNYFLSNSRLVDQSGNNLPKLTSGLPIIKHSNKIMILKEYNINNLEDLKSYISKNEGLKIACLTEANCNALKNIFLDKVTYDYKSFSSYIDLSKSVLSKSHIIGVISGIPFNFNEHKINVFDSFLKTGHSAYFKAAA 284 T 8.4E-05 SBP_bac_3 unphh F Eukaryota T 5jlf 2 F,G F,G Tropomyosin Alpha-1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 135 F F F 5jlh 3 H,I,J,K H,I,J,K Tropomyosin alpha-3 chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 135 F F F 5jm1 2 B,D,F,H B,D,F,H LECB3_ARTIN JACALIN BETA-3 CHAIN EQSGISQTVIVGPWGAKVS 19 T 2.9 DUF3842 pdbhh F Eukaryota T 5jm4 2 C,D D,E GLN-GLY-MKD-ANG-ASP-MKD-LEU-ASP-LEU-ALA-CLU QGXXDXLDLAX 11 T 120 DUF1797 pdbhh F T 5jmb 1 A,B A,B B3JI28_9BACE Uncharacterized protein SNAVTVDDLVEGIAFSITHDSENPNIVYLKSLMPSSYQVCWQHPQGRSQEREVTLQMPFEGKYEVTFGVQTRGGIVYGNPATFTIDSFCADFVN 94 T 0.0055 ARL6IP6 pdbpercent F Bacteria T 5jmo 3 E,F G,H CMK-inhibitor XRVKXX 6 T 280 MIB_HERC2 pdbhh F F 5jnb 2 E,F,G,H E,F,G,H O61711_CAEEL RNP (RRM RNA binding domain) containing TLFDNHPVQQYSGFNPIDFRFDDYVEGAKRFDNLANLIRSSTPTDPFANYQKPCESTSTSRSRTNSAKDQKHGP 74 T 0.056 Toxin_YhaV pdbpercent F Eukaryota T 5jp2 1 A,C E,F EPS15_HUMAN PROTEIN EPS15,PROTEIN AF-1P TNLDFFQSDPFVGSDPFKDDPFGGAGA 27 T 8.5 Taeniidae_ag pdbhh F Eukaryota T 5jpf 2 B M Microcystin-LR XLXRXXX 7 T 49 Phage_TAC_2 pdbhh F F 5jpl 1 A A J7LF03_NOCAA Uncharacterized protein GRPNWGFENDWSCVRVC 17 T 5.9 DUF4710 pdbhh F Bacteria T 5jpo 2 E E EF1D_HUMAN EF-1-DELTA,ANTIGEN NY-CO-4 GAMATNFLAHEKIWFDKFKYDDAERRFYEQMN 32 T 5.1 PRR18 pdbhh F Eukaryota T 5jpq 1 A,B,C,D,E,F,J,K,L,LA,N,NA,P A,B,C,D,E,F,J,K,L,l,N,n,P WD40 domain proteins XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1290 F F F 5jpq 3 H H UTP-A oligomerization domain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 920 F F F 5jpq 5 M,MA,O M,m,O WD40 domain proteins XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 870 F F F 5jpq 6 Q Q UTP6 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 456 F F F 5jpq 7 R R UTP-B oligomerisation domain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 560 F F F 5jqf 1 A,B A,B A0A1D5B387_SPHAL Sphingopyxin I GIEPLGPVDEDQGEHYLFAGG 21 T 11 SpecificRecomb pdbhh F Bacteria T 5jqz 1 A,B A,B De novo designed homotetramer GSHMGTAIEANSRMLKALIEIAKAIWKALWANSLLLEATSRGDTERMRQWAEEARKIYKEAEKIIDRADEIVEEAKKRHD 80 T 0.19 Dec-1 pdb F T 5jr2 2 E,F,G,H E,F,G,H APYd3 peptide XPYCVYRXSWSCX 13 T 0.7 DUF1684 pdbhh F T 5jr6 2 C F Apstatin XPPAX 5 T 680 SEC-C pdbhh F F 5jte 57 EB B5 ErmBL AVFQMRNVD 9 T 1.4E-05 ErmC pdbhh F T 5jtm 2 E,F,G,H E,F,G,H PPB_ECOLI APASE MKQSTIALALLPLLFTPVTKARTPE 25 T 3.6 Mfp-3 pdbhh F Bacteria T 5jts 1 A A A0A1L1QK12_STRSQ beta-1,4-mannanase GPLGSSACPSGATCGSYTVGGLGSRKQQVRNAGGSSLDLAVAMLETERMDTAYPYGDNKSGDAANFGIFKQNWLMLRSACAQFGGQGAGQYDNGAALNSSLGQDVSCLHQSQSHYGLDAWFAGHRNGASGLSSPNTADIAAYKAAVYWIKAQLDADSANLGNDTRFWVQVPAI 173 T 0.16 Lys pdb F Bacteria T 5ju8 56 DB B5 ErmBL AVFQMRNVD 9 T 1.4E-05 ErmC pdbhh F T 5ju9 1 A A A0A1L1QK13_9ACTN beta-1,4-mannanase SACPSGATCGSYTVGGLGSRKQQVRNAGGSSLDLAVAMLETERMDTAYPYGDNKSGDAANFGIFKQNWLMLRSACAQFGGQGAGQYDNGAALNSSLGQDVSCLHQSQSHYGLDAWFAGHRNGASGLSSPNTADIAAYKAAVYWIKAQLDADSANLGNDTRFWVQVPAI 168 T 0.15 Lys pdb F Bacteria T 5jub 2 C,D C,D ComS LPYFAGCL 8 T 1.4 IL17R_fnIII_D2 pdbhh F T 5jug 1 A A A0A1L1QK16_9ACTN beta-1,4-mannanase SACPSGATCGSYTVGGLGSRKQQVRNAGGSSLDLAVAMLQTERMDTAYPYGDNKSGDAANFGIFKQNWLMLRSACAQFGGQGAGQYDNGAALNSSLGQDVSCLHQSQSHYGLDAWFAGHRNGASGLSSPNTADIAAYKAAVYWIKAQLDADSANLGNDTRFWVQVPAI 168 T 0.14 Lys pdb F Bacteria T 5jui 1 A,B,C A,B,C A0A0H2URK1_STRPN Cell wall surface anchor family protein HHHHHHSGNTIVNGAPAINASLNIAKSETKVYTGEGVDSVYRVPIYYKLKVTNDGSKLTFTYTVTYVNPKTNDLGNISSMRPGYSIYNSGTSTQTMLTLGSDLGKPSGVKNYITDKNGRQVLSYNTSTMTTQGSGYTWGNGAQMNGFFAKKGYGLTSSWTVPITGTDTSFTFTPYAARTDRIGINYFNGGGKVVESST 198 T 0.41 FlgD_ig pdbpercent F Bacteria T 5jxh 2 B H 2UC-ARG-VAL-ARG-00S XRVRX 5 T 450 Consortin_C pdbhh F F 5jxj 2 B H 2UC-ARG-VAL-ARG-00S XRVRX 5 T 450 Consortin_C pdbhh F F 5jxt 2 I,J,K,T,U,V,W R,S,U,Q,T,V,W A0A0E9NAT8_9ASCO Histone H4 SGRGKGGKGLGKGGAKRHRKI 21 T 84 DUF4196 pdbhh F Eukaryota T 5jy0 2 B B LEU-ASP-VAL LDV 3 T 300 DUF2249 pdbhh F F 5jzr 1 A,B A,B Q9AZ42_9VIRU Coat protein MANKPMQPITSTANKIVWSDPTRLSTTFSASLLRQRVKVGIAELNNVSGQYVSVYKRPAPKPEGCADACVIMPNENQSIRTVISGSAENLATLKAEWETHKRNVDTLFASGNAGLGFLDPTAAIVSSDTTA 131 T 0.19 Glycoprot_B_PH2 pdbpssm T Viruses T 5k0y 26 Z d IF2B_SACS2 eukaryotic initiation factor 2 subunit Beta (eIF2-Beta) SEKEYVEMLDRLYSKLP 17 T 0.77 DUF6103 pdbhh F Archaea T 5k18 3 E,F F,E Bisubstrate inhibitor XMDSEVAALVID 12 T 0.89 Trm56 pdbhh F T 5k1h 2 B A eIF3a C-terminal tail XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 54 F F F 5k2e 1 A A ERF3_YEAST SUP35, ERF-3, ERF3, ERF2, G1 TO S PHASE TRANSITION PROTEIN 1, OMNIPOTENT SUPPRESSOR PROTEIN 2, PSI NO MORE PROTEIN 2, POLYPEPTIDE RELEASE FACTOR 3, TRANSLATION RELEASE FACTOR 3 NNQQNY 6 T 1.3 TFIIA unppssm F Eukaryota F 5k2f 1 A A ERF3_YEAST SUP35, ERF-3, ERF3, ERF2, G1 TO S PHASE TRANSITION PROTEIN 1, OMNIPOTENT SUPPRESSOR PROTEIN 2, PSI NO MORE PROTEIN 2, POLYPEPTIDE RELEASE FACTOR 3, TRANSLATION RELEASE FACTOR 3 NNQQNY 6 T 1.3 TFIIA unppssm F Eukaryota F 5k2g 1 A A ERF3_YEAST SUP35, ERF-3, ERF3, ERF2, G1 TO S PHASE TRANSITION PROTEIN 1, OMNIPOTENT SUPPRESSOR PROTEIN 2, PSI NO MORE PROTEIN 2, POLYPEPTIDE RELEASE FACTOR 3, TRANSLATION RELEASE FACTOR 3 GNNQQNY 7 T 1.3 TFIIA unppssm F Eukaryota F 5k2h 1 A A ERF3_YEAST SUP35, ERF-3, ERF3, ERF2, G1 TO S PHASE TRANSITION PROTEIN 1, OMNIPOTENT SUPPRESSOR PROTEIN 2, PSI NO MORE PROTEIN 2, POLYPEPTIDE RELEASE FACTOR 3, TRANSLATION RELEASE FACTOR 3 GNNQQNY 7 T 1.3 TFIIA unppssm F Eukaryota F 5k4f 2 B,D C,D VE6_HPV18 HPV18E6 peptide RLQRRRETQV 10 T 0.19 Mu-like_Com unphh T Viruses T 5k4l 2 C,D F,G Unknown Peptide XXXXXXXXXX 10 F F F 5k57 1 A A DDI2_HUMAN Protein DDI1 homolog 2 SQQSHSSPGEITSSPQGLDNPALLRDMLLANPHELSLLKERNPPLAEALLSGDLEKFSRVLVEQQQDRARREQERIRLFSADPFDLEAQAKIEEDIRQ 98 T 0.00065 XPC-binding pdbpssm F Eukaryota T 5k58 3 G,H,I,J L,K,N,M FTSZ_ECOLI Octapeptide LDIPAFLR 8 T 4.6 DUF1848 pdbhh F Bacteria T 5k6s 2 B B BUB1B_HUMAN BubR1 TLSIKKLSPIIEDDREADH 19 T 8.3 YwhD pdbhh F Eukaryota T 5k86 1 A,B,C A,B,C Aza-glycine containing collagen peptide PPGPPGPPGPRXPPGPPGPPGPPGX 25 T 0.00022 Collagen pdbpercent F F 5k99 2 B,D C,D Microcin C MRTGNAXX 8 T 75 RGM_C pdbhh F T 5kdg 1 A A Q9XC73_SALTM UNCHARACTERIZED PROTEIN,VIRULENCE PROTEIN MTATPQGQIIHHRNFQSLYNNSWVCSLNVIKSRDGNNYSALEDITSDNQAFNNILEGIDIIECENLLKEMNVQKIPESSLFTNIKEALQAEVFNSTVEDNFESFISYELQNHGPLMLIRPSLGSECLHAECIVGYDSEVKKVLIYDSMNTSPEWQSNIDVYDKLTLAFNDKYKNEDCSICGLYYDGVYEPKLEHHHHHH 199 T 0.002 YsaB unppssm F Bacteria T 5kdm 3 C C DAXX_HUMAN DAXX,HDAXX,ETS1-ASSOCIATED PROTEIN 1,EAP1,FAS DEATH DOMAIN-ASSOCIATED PROTEIN SPRTRGSRRQIQRLEQLLALYVAEIRRLQEKELDLSELDDPDSAYLQEARLKRKLIRLFGRLCELKDCSSLTGRVIEQRIPYRGTRYPEVNRRIERLINKPGPDTFPDYGDVLRAVEKAAARHSLGLPRQQLQLMAQDAFRDVGIRLQERRHLDLIYNFGCHLTDDYRPGVDPALSDPVLARRLRENRSLAMSRLDEVISKYAMLQDKSEEG 212 T 0.0026 Syntaxin_2 pdbpercent F Eukaryota T 5kds 1 A G BSM FRAGMENT TAPGG 5 T 86 P5CR_dimer pdbhh F F 5kev 1 A A Q87GI4_VIBPA VtrA Protein MTAKDDYPSLSFQQDYVYIFSSDFQLSEELGVALINALSAKEIVPERLYVMLNDKTISFSFISKNKKSKNRVLSTEKKLNYKHISEYIVNEIEY 94 T 0.014 NUP214 pdb F Bacteria T 5kev 2 B B Q87GI3_VIBPA VtrC Protein MGSSHHHHHHSQDPVHFYETSYKYQAADSTYMHDVAINVSIKGNHFTSDIIIRELVKSENKNYYNVIGHGDIIQKNTHQYYLNFDNIDVYTGTNKANMKPYKEPTSISSLINKSNNIRVVYLSEEYVVVEFFFYDGQIITLHRY 144 T 0.17 Gp13-like unppercent F Bacteria T 5kew 1 A,C,E A,C,E Q87GI4_VIBPA VtrA Protein MTAKDDYPSLSFQQDYVYIFSSDFQLSEELGVALINALSAKEIVPERLYVMLNDKTISFSFISKNKKSKNRVLSTEKKLNYKHISEYIVNEIEY 94 T 0.014 NUP214 pdb F Bacteria T 5kew 2 B,D,F B,D,F Q87GI3_VIBPA VtrC Protein MGSSHHHHHHSQDPVHFYETSYKYQAADSTYMHDVAINVSIKGNHFTSDIIIRELVKSENKNYYNVIGHGDIIQKNTHQYYLNFDNIDVYTGTNKANMKPYKEPTSISSLINKSNNIRVVYLSEEYVVVEFFFYDGQIITLHRY 144 T 0.17 Gp13-like unppercent F Bacteria T 5kez 2 B B ACE-DTY-PRO-TYR-SER-CYS-TRP-VAL-ARG-HIS-NH2 XXPYSCWVRHX 11 T 1.5 DUF6006 pdbhh F T 5kgf 7 K,N L,K TP53B_HUMAN Tumor suppressor p53-binding protein 1 LTKAADISLDNLVEGKRKRRS 21 T 11 CHZ pdbhh F Eukaryota T 5kgn 2 C,D C,D macrocyclic peptide inhibitor XXDYPGDYCYLYX 13 T 3.8 Gln_deamidase_2 pdbhh F T 5kgq 1 A A Q4DY78_TRYCC Uncharacterized protein GSAMGHMVKISHEDTQRIKTAFLSYAQGQDKVTEAMIDQLICGAFPGLSWEQLQEKKKGRAAANGYDRSAFFSLVASDEQYVRFIAQHFPCAPEEEKPPEIDALELKTQKGF 112 T 0.13 DICT unppercent F Eukaryota T 5khr 15 R S HSL1_YEAST HSL1 peptide NKENEGPEYPTKIEXYLEEQKPKRAALSDITNS 33 T 13 NTS_2 pdbhh F Eukaryota T 5khu 16 T U unknown AARAALAAA 9 T 21 DUF6052 pdbhh F F 5ki0 1 A A K2C6A_HUMAN Antimicrobial peptide KAMP-19 RAIGGGLSSVGGGSSTIKY 19 T 14 DUF4244 pdbhh F Eukaryota T 5kko 1 A,B,C,D,E,F A,B,C,D,E,F Uncharacterised protein SNAMKYFQIDELTLNAMLRITTIESLTPEQRLELIKAHLLNIKTPSDDNEPWDEF 55 T 0.16 DUF6291 unppssm F T 5kkv 1 A A GCN4-p2L XGMKQIEDKIEEILSKIYHIENEIARIKKLIGEGHH 36 T 0.0018 VGPC1_C pdbhh F T 5klc 1 A A A0A0R5P8X1_9BACT Carbohydrate binding module E1 GSHMSASCGSGNFNKTAAKGVEFSAVAGDCIKYNKSSGTLQIGSWTGVASSYNITSGPQGITNTGNGWTTVANAANGDLYIKIVSASRSFNVKFDNW 97 T 0.41 Pox_T4_N unp F Bacteria T 5kle 1 A A A0A0R5P8X1_9BACT Carbohydrate binding module E1 GSHMSASCGSGNFNKTAAKGVEFSAVAGDCIKYNKSSGTLQIGSWTGVASSYNITSGPQGITNTGNGWTTVANAANGDLYIKIVSASRSFNVKFDNW 97 T 0.41 Pox_T4_N unp F Bacteria T 5klf 1 A A A0A0R5P8X1_9BACT Carbohydrate binding module E1 GSHMSASCGSGNFNKTAAKGVEFSAVAGDCIKYNKSSGTLQIGSWTGVASSYNITSGPQGITNTGNGWTTVANAANGDLYIKIVSASRSFNVKFDNW 97 T 0.41 Pox_T4_N unp F Bacteria T 5klh 1 A,B A,B Q26806_9TRYP Surface glycoprotein GSAMGSSDDPRDNFKKAVSAFDPKPLESWTGTFSDVKATVRRQSLSVAGLGSIPSVYTEATVPVSGNTDGSQLVVKVNINTVAPFTRRSPLHATRERWFSCSSSQCSGYSRKCDCQEKHEQFRNKCYSQGGQYSTQSSKCRLGEKCGYCKQEVYLSKLYLVAASDGKGEYRESTQYQSALYSFGHLSQGYEAVPQDKVQVQLYSEGDPFIALERETMGEGEFGVPNRTAAA 231 T 0.0019 Shisa unphh F Eukaryota T 5klr 2 B A Prototypical P4[R]cNLS SKKAGFPAKKRKVEAA 16 T 20 IGR pdbhh F T 5klt 2 B A Prototypical P4[M]cNLS SKKAGFPAKKMKVEAA 16 T 9.7 DUF4543 pdbhh F T 5kmx 1 A,B,C,D A,B,C,D G0UXP9_TRYCI Putative uncharacterized protein TCIL3000_10_9440 GSAMGSSDEPRDDFKEAVNAFNPNPIEKWTGRFNTENASVRRRTLNVPGFKSIPTVYTEATLPLNKDVTDGRLTVVVNINTVQPFTRRTPLRVKREKWYTCSSSQCSGSSSKCDCHRKHDEFRNKCISEGGRYTTESSKCRLGEKCGYCKQNVYLATLYLVAGSVGGGMYRESDKYQSALYPFYDISQGYEPRQPSSVNVRLYSEGDPFIAFQQLTEGREEFGIPNRTVGAAA 233 T 0.001 DUF4106 unphh F Eukaryota T 5kmx 2 E U Putative uncharacterized protein TCIL3000_10_9440 XXXXXXXXXXXXXX 14 F F F 5knm 4 D N Peptide ILE-LEU-ARG-TRP-GLU-GLN ILRWEQ 6 T 0.72 DUF1216 pdbhh F T 5koa 2 C D FTSZ_ECOLI C-terminal tail of FtsZ DYLDIPAFLRKQ 12 T 1.4 F-box-like_2 pdbhh F Bacteria T 5kpe 1 A A De novo Beta Sheet Design Protein OR664 MQDIVEAAKQAAIAIFQLWKNPTDPEAQELLNKILSPDVLDQVREHARELQKQGIHFEVKRVEVTTDGNTVNVTVELEETTGGTTTNTTYELRFEVDGDTIRRVTVTQNGGSLEHHHHHH 120 T 0.0013 SnoaL_2 pdb F T 5kph 1 A A De novo Beta Sheet Design Protein OR485 MPSEEEEKRQVKQVAKEKLLEQSPNSKVQVRRVQKQGNTIRVELELRTNGKKENYTVEVERQGNTWTVKRITRTVGSLEHHHHHH 85 T 0.00085 DUF3828 pdb F T 5ks5 1 A A EF2K_HUMAN EEF-2K,CALCIUM/CALMODULIN-DEPENDENT EUKARYOTIC ELONGATION FACTOR 2 KINASE GSHMSPDRCQDWLEALHWYNTALEMTDCDEGGEYDGMQDEPRYMMLAREAEMLFTGGYGLEKDPQRSGDLYTQAAEAAMEAMKGRLANQYYQKAEEAWAQMEE 103 T 0.00017 Sel1 unphh F Eukaryota T 5ksa 5 E J GDB0_WHEAT DQ8.5-glia-gamma1 peptide QPQQSFPEQEA 11 T 1.4 DUF3067 pdbhh F Eukaryota T 5ksb 5 I,J I,J GDB0_WHEAT DQ8.5-glia-gamma1 peptide GPQQSFPEQEA 11 T 1.3 DUF3067 pdbhh F Eukaryota T 5kvn 1 A A Designed peptide NC_HEE_D1 NDKCKELKKRYPNCEVRCDXPRYEVHC 27 T 0.31 LPD29 pdbhh F T 5kwn 2 B U HY5_ARATH peptide 16-mer IESDEEIRRVPEFGGEAVG 19 T 0.2 Macoilin unppercent F Eukaryota T 5kwo 1 A A Designed peptide NC_EHE_D1 CQTWRXVSPEECRKYKEEYXCVRCTE 26 T 0.2 zf-CW pdbpssm F T 5kwp 1 A A Designed peptide NC_EEH_D2 TCVECXXVKVCRPDPEEARREAEERCX 27 T 2 Herpes_IE1 pdbhh F T 5kwx 1 A A Designed peptide NC_EEH_D1 CSYTCXPQTYTFPTCEEAKKMKKRC 25 T 0.8 Fer4_5 pdbhh F T 5kwz 1 A A Designed peptide NC_cHH_D1 HDPEKRKECEKKYTDPKKREECKRKA 26 T 0.24 Antimicrobial21 pdb F T 5kx0 1 A A Designed peptide NC_cHh_DL_D1 NPELQRKCKELXTRXXXXXXXXXXSD 26 T 11 DUF3511 pdbhh F T 5kx1 1 A A Designed peptide NC_cHHH_D1 NPEDCRQDPEANKSPEECKKLK 22 T 3.1 DUF1388 pdbhh F T 5kx2 1 A A Designed peptide NC_cEE_D1 PVTWCVRIXPTVRCTVRX 18 T 6.5 SapA pdbhh F T 5kyn 2 C C MIA3_HUMAN Melanoma inhibitory activity protein 3 GPRPLPPP 8 T 4.3 OGFr_III pdbhh F Eukaryota F 5kyu 3 C C MIA3_HUMAN TANGO1 peptide2 GPRPLPPP 8 T 4.3 OGFr_III pdbhh F Eukaryota F 5kyw 3 C C MIA3_HUMAN TANGO1 peptide3 LPPPFGPGM 9 T 1.6 dCMP_cyt_deam_2 pdbhh F Eukaryota F 5kzt 2 C C Hexamer peptide: SER-ASP-GLU-SER-LYS-GLY SDESKG 6 T 230 DUF3510 pdbhh F F 5kzt 3 D D Hexamer peptide: SER-ASP-GLU-SER-SER-GLY SDESSG 6 T 270 DUF1684 pdbhh F F 5l0l 1 A,B A,B Q5ZYD3_LEGPH Uncharacterized protein GKKEFLKHEYSPGHWSIDYTRAGTSIAVITVRNKYHYSVILNPTDCRGYRIIIRYLNEGDSTLSSAFNRPYTVSEQRGLNDVASLMTQVYEKLGLIVQFSQLGNNSQSFDKGTGVTLIGSEEEPSMLHLHMWGRGDPDMEYIAGVPLRGPEPGLMFDLIAKNKTHPINQHAIKWNEEELKACLAMFKLKLAEYVNSPEFTEEFGDTLKVTIHDKK 215 T 0.001 DUF3762 pdbpercent F Bacteria T 5l0y 2 I,J,K,L,M I,J,K,L,M G0RYP6_CHATD PRO-THR-VAL-GLU-GLU-VAL-ASP PTVEEVD 7 T 2.6 DUF2368 pdbhh F Eukaryota F 5l20 3 C C Peptide Inhibitor BTN-VLTK-AOMK XVLTX 5 T 1200 DUF592 pdbhh F F 5l23 2 B B RPGF1_HUMAN C3G derived peptide XDNSPPPALPKKRQSYX 17 T 8.5 Ribosomal_L32p pdbhh F Eukaryota T 5l3f 3 C C Polmyxin B XXXXXXFLXXX 11 T 81 Gas_vesicle_C pdbhh F F 5l3g 3 C C COLISTIN XXXXXXLLXXX 11 T 48 DUF2525 pdbhh F F 5l3t 4 D,E D,E Sac3polyAla XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 5l3t 5 F F Sac3polyAla XXXXXXXXXXXX 12 F F F 5l3x 1 A A NELFA_HUMAN NELF-A,WOLF-HIRSCHHORN SYNDROME CANDIDATE 2 PROTEIN ESDTGLWLHNKLGATDELWAPPSIASLLTAAVIDNIRLCFHGLSSAVKLKLLLGTLHLPRRTVDEMKGALMEIIQLASLDSDPWVLMVADILKSFPDTGSLNLELEEQNPNVQDILGELREKVGECEASAMLPLECQYLNKNALTTLAGPLTPPVKHFQLKRKPKSATLRAELLQKS 177 T 0.00076 Adaptin_N pdbpssm F Eukaryota T 5l7e 2 B B NCOA1_HUMAN NCOA1 peptide KSLLQQLLTE 10 T 7.3 E3_UbLigase_RBR pdbhh F Eukaryota T 5l7g 2 B B NCOA1_HUMAN NCOA1 peptide KSLLQQLLTE 10 T 7.3 E3_UbLigase_RBR pdbhh F Eukaryota T 5l7h 2 B B NCOA1_HUMAN NCOA1 peptide KSLLQQLLTE 10 T 7.3 E3_UbLigase_RBR pdbhh F Eukaryota T 5l7k 2 B B NPHP3_HUMAN GLY-THR-ALA-SER-SER-LEU GTASSL 6 T 160 DUF5536 pdbhh F Eukaryota F 5l82 1 A A Enterococcin K1 MKFKFNPTGTIVKKLTQYEIAWFKNKHGYYPWEIPRC 37 T 0.016 Psg1 pdb F T 5l83 1 A,B C,D ASP-TRP-GLU-ILE-VAL DWEIV 5 T 24 MciZ pdbhh F F 5l85 1 A A ZNHI3_HUMAN HNF-4A COACTIVATOR,THYROID HORMONE RECEPTOR INTERACTOR 3,THYROID RECEPTOR-INTERACTING PROTEIN 3,TRIP-3 GPHMDRVSLQNLKNLGESATLRSLLLNPHLRQLMVNLDQGEDKAKLMRAYMQEPLFVEFADCCLGIVEPSQNEES 75 T 0.00049 STI1 unphh F Eukaryota T 5l85 2 B B NUFP1_HUMAN NUCLEAR FMRP-INTERACTING PROTEIN 1 DIRHERNVILQCVRYIIKKDFFGLDTNSAKSKDV 34 T 0.18 Nup188 unppssm F Eukaryota T 5l8e 2 C C Unknown XXXXX 5 F F F 5l9v 2 C,D C,D HIF1A_HUMAN HIF1-ALPHA,ARNT-INTERACTING PROTEIN,BASIC-HELIX-LOOP-HELIX-PAS PROTEIN MOP1,CLASS E BASIC HELIX-LOOP-HELIX PROTEIN 78,BHLHE78,MEMBER OF PAS PROTEIN 1,PAS DOMAIN-CONTAINING PROTEIN 8 DACTLLAPAAGDTIISLCF 19 T 13 DUF5913 pdbhh F Eukaryota T 5la9 2 C,D C,D HIF1A_HUMAN HIF1-ALPHA,ARNT-INTERACTING PROTEIN,BASIC-HELIX-LOOP-HELIX-PAS PROTEIN MOP1,CLASS E BASIC HELIX-LOOP-HELIX PROTEIN 78,BHLHE78,MEMBER OF PAS PROTEIN 1,PAS DOMAIN-CONTAINING PROTEIN 8 DACTLLAPAAGDTIISLCF 19 T 13 DUF5913 pdbhh F Eukaryota T 5lah 1 A A TX121_URTEQ tau-AnmTx Ueq 12-1 CYPGQPGCGHCSRPNYCEGARCESGFHDCGSDHWCDASGDRCCCA 45 T 1.4 TerY_C pdbhh F Eukaryota T 5lak 2 E,F,G I,J,K BEZ-TYR-TYR-ASN-ECC Peptide inhibitor XYYNX 5 T 62 AZUL pdbhh F F 5las 2 C,D C,D HIF1A_HUMAN HIF1-ALPHA,ARNT-INTERACTING PROTEIN,BASIC-HELIX-LOOP-HELIX-PAS PROTEIN MOP1,CLASS E BASIC HELIX-LOOP-HELIX PROTEIN 78,BHLHE78,MEMBER OF PAS PROTEIN 1,PAS DOMAIN-CONTAINING PROTEIN 8 DACTLLAPAAGDTIISLCF 19 T 13 DUF5913 pdbhh F Eukaryota T 5lb7 3 C C ASPM_MOUSE CALMODULIN-BINDING PROTEIN SHA1,CALMODULIN-BINDING PROTEIN 1,SPINDLE AND HYDROXYUREA CHECKPOINT ABNORMAL PROTEIN LSPDSFLND 9 T 0.89 Cmyb_C pdbhh F Eukaryota T 5lc5 16 P P COMPLEX I-39KD,CI-39KD,NADH-UBIQUINONE OXIDOREDUCTASE 39 KDA SUBUNIT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 335 F F F 5lc5 17 Q Q COMPLEX I-18 KDA,CI-18 KDA,COMPLEX I-AQDQ,CI-AQDQ,NADH-UBIQUINONE OXIDOREDUCTASE 18 KDA SUBUNIT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 113 F F F 5lc5 36 JA j NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 2, NDUFB2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 52 F F F 5lc5 37 KA k NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 3, NDUFB3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 74 F F F 5lc5 38 LA l NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 8, NDUFB8 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 118 F F F 5lc5 44 RA r NADH dehydrogenase [ubiquinone] 1 alpha subcomplex subunit 7, NDUFA7 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 87 F F F 5lc5 45 SA s NADH dehydrogenase [ubiquinone] flavoprotein 3, NDUFV3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 35 F F F 5ldw 16 P P NADH dehydrogenase [ubiquinone] 1 alpha subcomplex subunit 9, NDUFA9 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 300 F F F 5ldw 17 Q Q NADH dehydrogenase [ubiquinone] iron-sulfur protein 4, NDUFS4 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 113 F F F 5ldw 35 JA j NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 2, NDUFB2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 52 F F F 5ldw 36 KA k NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 3, NDUFB3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 74 F F F 5ldw 37 LA l NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 8, NDUFB8 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 118 F F F 5ldw 43 RA r NADH dehydrogenase [ubiquinone] 1 alpha subcomplex subunit 7, NDUFA7 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 87 F F F 5ldw 44 SA s NADH dehydrogenase [ubiquinone] flavoprotein 3, NDUFV3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 35 F F F 5ldx 16 P P NADH dehydrogenase [ubiquinone] 1 alpha subcomplex subunit 9, NDUFA9 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 283 F F F 5ldx 17 Q Q NADH dehydrogenase [ubiquinone] iron-sulfur protein 4, NDUFS4 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 113 F F F 5ldx 35 JA j NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 2, NDUFB2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 52 F F F 5ldx 36 KA k NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 3, NDUFB3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 74 F F F 5ldx 37 LA l NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 8, NDUFB8 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 118 F F F 5ldx 43 RA r NADH dehydrogenase [ubiquinone] 1 alpha subcomplex subunit 7, NDUFA7 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 87 F F F 5ldx 44 SA s NADH dehydrogenase [ubiquinone] flavoprotein 3, NDUFV3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 35 F F F 5leo 2 C,D G,H GLY-GLY-GLY-GLY-GLY GGGGG 5 T 56 Parvo_coat pdbhh F F 5ley 15 CA,DA c,d bound Oprozomib XXXX 4 T 1700 GoLoco pdbhh F F 5lez 15 CA,DA,EA,FA c,d,e,f bound Oprozomib XXXX 4 T 1700 GoLoco pdbhh F F 5lf0 15 CA,DA,EA,FA,GA,HA c,d,e,f,g,h EPOXOMICIN (peptide inhibitor) XXITX 5 T 840 DUF4597 pdbhh F F 5lf6 11 K,Z c,d LLY-ketoaldehyde peptide LLX 3 T 1400 EF-hand_1 pdbhh F F 5lff 1 A A ARG-ALA-CYS-ARG-PHE-PHE-CYS RACRFFC 7 T 0.24 DUF5730 pdbhh F F 5lfh 1 A A ACE-ARG-ALA-DCY-ARG-PHE-PHE-CYS XRAXRFFC 8 T 0.43 DUF5730 pdbhh F F 5lgm 1 A A V5557_BPT7 Fusion protein 5.5/5.7 MSDYLKVLQAIKSCPKTFQSNYVRNNASLVAEAASRGHISCATTSGRNGGAWEITASGTRFLKRMGGCV 69 T 0.012 DUF3116 pdbhh T Viruses T 5lgp 2 E,F,G,H E,F,G,H PABP1_HUMAN POLY(A)-BINDING PROTEIN 1 XFQNMPGAIRPAA 13 T 5.8 DUF1992 pdbhh F Eukaryota T 5lgq 2 E,F,G,H F,E,G,H PABP1_HUMAN POLY(A)-BINDING PROTEIN 1 PAAPRPPFSTM 11 T 19 Spore_YtrH pdbhh F Eukaryota T 5lgr 2 E,F,G,H E,F,G,H PABP1_HUMAN POLY(A)-BINDING PROTEIN 1 FQNMPGAIRPAA 12 T 4.4 DUF1992 pdbhh F Eukaryota T 5lgs 2 E,F,G,H E,F,G,H PABP1_HUMAN POLY(A)-BINDING PROTEIN 1 PAAPRPPFS 9 T 18 MIP-T3 pdbhh F Eukaryota F 5lhw 1 A A STIL_HUMAN TAL-1-INTERRUPTING LOCUS PROTEIN GGSLTEQDRQLRLLQAQIQRLLEAQSLM 28 T 1.8 SlyX pdbhh F Eukaryota T 5lhz 2 D,E,F D,E,F STIL_HUMAN TAL-1-INTERRUPTING LOCUS PROTEIN GGSLTEQDRQLRLLQAQIQRLLEAQSLM 28 T 1.8 SlyX pdbhh F Eukaryota T 5li1 2 B B Q28E03_XENTR UNCHARACTERIZED PROTEIN LAFQREGFGRQSMSEKRTKQ 20 T 0.088 LamB_YcsF pdbhh F Eukaryota T 5lih 2 C,D F,G KPCE_HUMAN PKC Epsilon pseudo substrate sequence ERMRPFKRQGSVRRRV 16 T 20 NumbF pdbhh F Eukaryota T 5lij 1 A P polyalanine chain built in bacteriophage phi812K1-420 cement protein density map AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 152 T 16000 zf_CCCH_4 pdbhh F F 5lj3 31 LA x unknown XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 132 F F F 5lj5 35 SA x unknown XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 188 F F F 5ljn 2 C,D C,D SPAT2_HUMAN SPERMATOGENESIS-ASSOCIATED PROTEIN PD1 DVDLYTDS 8 T 19 Noda_Vmethyltr pdbhh F Eukaryota T 5lm1 2 B B UBAP1_HUMAN UBAP-1 SNIKSLSFPKLDSDDSNQKT 20 T 2 UPF0728 pdbhh F Eukaryota T 5lm5 2 C,D C,D DCP2_YEAST DCP2 DECAPPING FACTOR SSSPGQLLDILNSK 14 T 3.7 TaqI_C pdbhh F Eukaryota T 5lmf 2 C,D C,D DCP2_YEAST DCP2 DECAPPING FACTOR TAHSNSQALLDLLKKPT 17 T 3.6 RMMBL pdbhh F Eukaryota T 5lmg 2 C,D C,D DCP2_YEAST DCP2 DECAPPING FACTOR TSGSNELLSILHRK 14 T 16 RRM_9 pdbhh F Eukaryota T 5lmz 1 A,B A,B W0W999_9ACTN Fluorinase GAMVAANGSQRPIIAFMSDLGTTDDSVAQCKGLMHSICPGVTVVDVCHSMTPWDVEEGARYIVDLPRFFPEGTVFATTTYPATGTTTRSVAVRIRQAAKGGARGQWAGSGDGFERADGSYIYIAPNNGLLTTVLEEHGYIEAYEVTSTKVIPANPEPTFYSREMVAIPSAHLAAGFPLAEVGRRLDDSEIVRFHRPAVEISGEALSGVVTAIDHPFGNIWTNIHRTDLEKAGIGQGKHLKIILDDVLPFEAPLTPTFADAGAIGNIAFYLNSRGYLSLARNAASLAYPYNLKAGLKVRVEAR 302 T 3.8E-42 SAM_adeno_trans unppercent F Bacteria T 5ln4 1 A,B,C A,B,C PSAA_YERPE ADHESIN,ANTIGEN 4,ADHESIN,ANTIGEN 4 TFHVDFAPNTGEIFAGKQPGDVTMFTLTMGDTAPHGGWRLIPTGDSKGGYMISADGDYVGLYSYMMSWVGIDNNWYINDDSPKDIKDHLYVKAGTVLKPTTYKFTGRVEEYVFNDKQSTVINSKDVSGEVTVK 133 T 0.018 SEF14_adhesin unp F Bacteria T 5lnd 1 A,B A,B MYFA_YEREN C-AG,MYF ANTIGEN,C-AG,MYF ANTIGEN,C-AG,MYF ANTIGEN SFSVEFKATENEIVSGKLDADTPAFHLVMSDSGEHKGWNVRPTGASEGGQMVSADGTRVDLHTNELSWDNDHWWIDDGSERVEATFFLAAGDEVKAGEYQFTGRVEEYVEDNKQEPTVINSKDISATKTVKE 132 T 7.3 DUF3836 unppssm F Bacteria T 5lnk 44 SA V Mitochondrial complex I, B14.7 subunit XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 119 F F F 5los 1 A A G4TKU4_SERID PIIN_05872 GPGSAPLPNPPMTPAQHYAQAIHHEGLARHHTTVAEDHRQTANLHDNRIKAAKARYNAGLDPNGLTSAQKHQIERDHHLSLAAQAERHAATHNREAAYHRLHSQTPAPGTKRSIDELD 118 T 0.13 DSBA pdb F Eukaryota T 5lpc 1 A A B0C4R0_ACAM1 Vanadium-dependent bromoperoxidase MGSSHHPHHHHHSSGLEVLFQGPLGSHMNTRRQQAQNIRNNAAELAANRPHPQHINNKEEYEYRRPKKDGNEPSHIANFTKGLPHDEHTGLLLNSADYDQFVLGIQSGDTTDFARTPLGPAELPKVHGCLSKQKIDCDDDHRSGFWKSQIAQGAAGGDGAKLRAWESAGAGLVFDLEGPDAQAVTMPPAPRLESPELTSEIAEVYSQALLRDIHFSQLRDPGLGDQVNACDSCPTQLSIYEAIDILNTVQIEGQNWFSANCCDLTDDEQARQRPLVTRQNIFRGIAPGDDVGPYLSQFLLIGNNALGGGVFGQEAGHIGYGAIRIDQRVRKATPCKDFMTNFETWLDVQNGADLRGLETYVDADPGKCREFPAYRVITTPRDLATYVHYDALYEAYLNACLILLGMGAPFDPGIPFQKPDVEDKQQGFAHFGGPQILTLVCEAATRGLKAVRFQKFNVHRRLRPEALGGLVDRYKHGKGAGDELKPVAALVEALENVGLLSKVVAHNQLQNQNLDRSGDPSSAGDNYFLPMAFPEGSPMHPSYGAGHATVAGACVTMLKAFFDHGWQLNLGMANGKYISYEPNQDGSSLQQVLLDCPLTVEGELNKIAANISIGRDWAGVHYFTDYIESLRLGEKIAIGILEEQKLTYGENFTMTVPLYDGGSIQI 666 T 0.0014 PAP2 unppssm F Bacteria T 5lpr 2 B P METHOXYSUCCINYL-ALA-ALA-PRO-ALANINE BORONIC ACID INHIBITOR XAAPX 5 T 730 Trp_leader1 pdbhh F F 5lqp 1 A,AA,AB,AC,AD,AE,AF,B,BA,BB,BC,BD,BE,BF,C,CA,CB,CC,CD,CE,CF,D,DA,DB,DC,DD,DE,DF,E,EA,EB,EC,ED,EE,EF,F,FA,FB,FC,FD,FE,FF,G,GA,GB,GC,GD,GE,GF,H,HA,HB,HC,HD,HE,HF,I,IA,IB,IC,ID,IE,IF,J,JA,JB,JC,JD,JE,JF,K,KA,KB,KC,KD,KE,KF,L,LA,LB,LC,LD,LE,LF,M,MA,MB,MC,MD,ME,MF,N,NA,NB,NC,ND,NE,NF,O,OA,OB,OC,OD,OE,OF,P,PA,PB,PC,PD,PE,PF,Q,QA,QB,QC,QD,QE,QF,R,RA,RB,RC,RD,RE,RF,S,SA,SB,SC,SD,SE,SF,T,TA,TB,TC,TD,TE,TF,U,UA,UB,UC,UD,UE,UF,V,VA,VB,VC,VD,VE,VF,W,WA,WB,WC,WD,WE,WF,X,XA,XB,XC,XD,XE,XF,Y,YA,YB,YC,YD,YE,Z,ZA,ZB,ZC,ZD,ZE AB,BB,CB,DB,EB,FB,GB,AC,BC,CC,DC,EC,FC,GC,AD,BD,CD,DD,ED,FD,GD,AE,BE,CE,DE,EE,FE,GE,AF,BF,CF,DF,EF,FF,GF,AG,BG,CG,DG,EG,FG,GG,AH,BH,CH,DH,EH,FH,GH,AI,BI,CI,DI,EI,FI,GI,AJ,BJ,CJ,DJ,EJ,FJ,GJ,AK,BK,CK,DK,EK,FK,GK,AL,BL,CL,DL,EL,FL,GL,AM,BM,CM,DM,EM,FM,GM,AN,BN,CN,DN,EN,FN,GN,AO,BO,CO,DO,EO,FO,GO,AP,BP,CP,DP,EP,FP,GP,AQ,BQ,CQ,DQ,EQ,FQ,GQ,AR,BR,CR,DR,ER,FR,GR,AS,BS,CS,DS,ES,FS,GS,AT,BT,CT,DT,ET,FT,GT,AU,BU,CU,DU,EU,FU,GU,AV,BV,CV,DV,EV,FV,GV,AW,BW,CW,DW,EW,FW,GW,AX,BX,CX,DX,EX,FX,GX,AY,BY,CY,DY,EY,FY,GY,AZ,BZ,CZ,DZ,EZ,FZ,BA,CA,DA,EA,FA,GA Q9AZ42_9VIRU Coat protein ANKPMQPITSTANKIVWSDPTRLSTTFSASLLRQRVKVGIAELNNVSGQYVSVYKRPAPKPEGCADACVIMPNENQSIRTVISGSAENLATLKAEWETHKRNVDTLFASGNAGLGFLDPTAAIVSSDTT 129 T 15 Packaging_FI pdbhh T Viruses T 5lqx 1 A 1 ATP synthase subunit f XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 31 F F F 5lqx 2 B 2 ATP synthase AAP1 subunit XXXXXXXXXXXXXXXXXXXXXXXXX 25 F F F 5lqx 3 C 3 ATP synthase subunit a XXXXXXXXXXXXXXXXX 17 F F F 5lqx 4 D 4 ATP synthase subunit b XXXXXXXXXXXXXXXXXXXXXXXXXXX 27 F F F 5lqx 15 BA X ATP synthase subunit h XXXXXXXXXXXXXXXXXXXXX 21 F F F 5lqx 17 DA Z ATP synthase subunit a XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 44 F F F 5lqy 1 A 1 ATP synthase subunit f XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 30 F F F 5lqy 2 B 2 ATP synthase subunit AAP1 XXXXXXXXXXXXXXXXXXXXXXXXX 25 F F F 5lqy 3 C 3 ATP synthase subunit a XXXXXXXXXXXXXXXXX 17 F F F 5lqy 4 D 4 ATP synthase subunit b XXXXXXXXXXXXXXXXXXXXXXXXXXX 27 F F F 5lqy 15 BA X ATP synthase subunit h XXXXXXXXXXXXXXXXXXXXX 21 F F F 5lqy 17 DA Z ATP synthase subunit a XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 44 F F F 5lqz 1 A 1 ATP synthase subunit f XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 30 F F F 5lqz 2 B 2 ATP synthase subunit AAP1 XXXXXXXXXXXXXXXXXXXXXXXXX 25 F F F 5lqz 3 C 3 ATP synthase subunit a XXXXXXXXXXXXXXXXX 17 F F F 5lqz 4 D 4 ATP synthase subunit b XXXXXXXXXXXXXXXXXXXXXXXXXXX 27 F F F 5lqz 15 BA X ATP synthase subunit h XXXXXXXXXXXXXXXXXXXXX 21 F F F 5lqz 17 DA Z ATP synthase subunit a XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 44 F F F 5lrg 2 D,E,F F,G,H Anabaenopeptin B XXVXXF 6 T 39 MCD pdbhh F F 5lrj 2 D,E,F F,G,H Anabaenopeptin C XXVXXF 6 T 44 Glyco_hydro_99 pdbhh F F 5lrk 2 D,E,F F,G,H Anabaenopeptin F XXXXXF 6 T 9.4 DUF5736 pdbhh F F 5ls6 4 M,N,O,P Q,R,S,T JARD2_HUMAN Jarid2 K116me3 RLQAQRKFAQS 11 T 21 DUF4395 pdbhh F Eukaryota T 5lsf 4 D,D10,D11,D12,D13,D14,D15,D16,D17,D18,D19,D2,D20,D21,D22,D23,D24,D25,D26,D27,D28,D29,D3,D30,D31,D32,D33,D34,D35,D36,D37,D38,D39,D4,D40,D41,D42,D43,D44,D45,D46,D47,D48,D49,D5,D50,D51,D52,D53,D54,D55,D56,D57,D58,D59,D6,D60,D7,D8,D9 D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D A0A2S0CUG6_9VIRU VP4 DNPHRFLPANVSNRWNEYSSAYLPRV 26 T 2.7 YLP pdbhh T Viruses T 5lso 2 B,D C,D LYS-SER-ARG-TRP-ASP-GLU KSRWDE 6 T 0.034 SF3b1 pdbhh F T 5lsp 4 G,H X,Y MET_HUMAN HGF RECEPTOR,HGF/SF RECEPTOR,PROTO-ONCOGENE C-MET,SCATTER FACTOR RECEPTOR,SF RECEPTOR,TYROSINE-PROTEIN KINASE MET ETRECKEALAKSEM 14 T 10 YedD pdbhh F Eukaryota T 5lsw 2 B,D B,D Q9VV48_DROME ROQUIN,ISOFORM A,ISOFORM B,ISOFORM C EGGIDSGMMLQLEKNLVDIVD 21 T 0.18 QueC pdbhh F Eukaryota T 5lu8 1 A C AC-TYR-VAL-ALA-ASP-CHLOROMETHYLKETONE XYVADX 6 T 300 Rhodanese_C pdbhh F F 5lu9 1 A C AC-TYR-VAL-ALA-ASP-CHLOROMETHYLKETONE XYVADX 6 T 300 Rhodanese_C pdbhh F F 5luf 25 JA A complex I XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 111 F F F 5luf 26 KA B complex I XXXXXXXXXXXXXXXXXXXXXXXXXXXCCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXXXXXXXXXXXXXXXXXX 143 T 4000 Cas1_AcylT pdbhh F F 5luf 27 LA C complex I XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 154 F F F 5luf 28 MA D complex I XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 384 F F F 5luf 29 NA E complex I XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 159 T 1200 Radical_SAM_2 pdbhh F F 5luf 30 OA F complex I XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXCXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 411 T 78 Fer4_2 pdbhh F F 5luf 31 PA G complex I XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXXXXXXXXCXXCXXXXXXXXXXXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHXXXCXXCXXXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXCXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 538 T 140 Fer4_2 pdbhh F F 5luf 32 QA H complex I XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 313 F F F 5luf 33 RA I complex I XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXCXXCXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXCXXCXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 162 T 0.026 Fer4_8 pdbhh F F 5luf 34 SA J complex I XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 171 F F F 5luf 35 TA K complex I XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 84 F F F 5luf 36 UA L complex I XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 601 F F F 5luf 37 VA M complex I XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 453 F F F 5luf 38 WA N complex I XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 345 F F F 5luf 39 XA O complex I XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 220 F F F 5luf 40 YA P complex I XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 303 F F F 5luf 41 ZA Q complex I XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 85 F F F 5luf 42 AB R complex I XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 5luf 43 BB S complex I XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 80 F F F 5luf 44 CB T complex I XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 75 F F F 5luf 45 DB V complex I XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 71 F F F 5luf 46 EB W complex I XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 72 F F F 5luf 47 FB X complex I XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 330 F F F 5luf 48 GB Y complex I XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 106 F F F 5luf 49 HB a complex I XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 142 F F F 5luf 50 IB U complex I XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 828 F F F 5luf 51 JB Z complex I XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 625 F F F 5luq 2 B,D K,S C-terminal fragment of KU80 (KU80ct194) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXMX 194 T 12000 zf-H2C2_2 pdbhh F F 5lvp 2 E,F,G,H E,F,G,H hydrophobic-motif peptide of PKB/Akt KGAGGGGFPQFSYSA 15 T 6.9 DUF4172 pdbhh F T 5lvy 1 A A C9K5V2_ECOLX Adhesin protein NSCSLSISSPDPVTYTIPTDKGDKYINFKLDVPDPRCKALGGTVYFWGADTRDGKLVMKKGQDKYTLMTTYGGAVQQQLGGGYGYYHVSQKTPPQTISGVVSKNVGYKPGQYTVELTGFFSLNDNKQANPTPSSLTSKAAGKNIVSSTGTITIS 154 T 0.00036 SEF14_adhesin unphh F Bacteria T 5lw1 3 C,F,I C,F,I JIP1_HUMAN JNK-INTERACTING PROTEIN 1,ISLET-BRAIN 1,IB-1,JNK MAP KINASE SCAFFOLD PROTEIN 1,MITOGEN-ACTIVATED PROTEIN KINASE 8-INTERACTING PROTEIN 1 RPKRPTTLNLF 11 T 12 CTP-dep_RFKase pdbhh F Eukaryota T 5lw1 4 J L Pepstatin XVVXAX 6 T 1700 FAM60A pdbhh F F 5lwc 1 A A BS222_STAPS Bacteriocin BacSp222 MAGLLRFLLSKGRALYNWAKSHVGKVWEWLKSGATYEQIKEWIENALGWR 50 F F Bacteria T 5lx2 2 B B PI4KB_HUMAN Phosphatidylinositol 4-kinase beta TASNPK 6 T 13 EndIII_4Fe-2S pdbhh F Eukaryota T 5lxh 2 D,E,F E,F,G ATG4B_HUMAN AUT-LIKE 1 CYSTEINE ENDOPEPTIDASE,AUTOPHAGIN-1,AUTOPHAGY-RELATED CYSTEINE ENDOPEPTIDASE 1,AUTOPHAGY-RELATED PROTEIN 4 HOMOLOG B,HAPG4B EDEDFEILSL 10 T 1.2 Vault_3 pdbhh F Eukaryota T 5lxi 2 C,D C,E ATG4B_HUMAN AUT-LIKE 1 CYSTEINE ENDOPEPTIDASE,AUTOPHAGIN-1,AUTOPHAGY-RELATED CYSTEINE ENDOPEPTIDASE 1,AUTOPHAGY-RELATED PROTEIN 4 HOMOLOG B,HAPG4B EDEDFEILSL 10 T 1.2 Vault_3 pdbhh F Eukaryota T 5lxl 1 A A DECO_BPT5 CAPSID PROTEIN PB10 MGIDYSGLRTIFGEKLPESHIFFATVAAHKYVPSYAFLRRELGLSSAHTNRKVWKKFVEAYGKAIPPAPPAPPLTLSKLEHHHHHH 86 T 0.13 DUF2063 pdbpssm T Viruses T 5ly1 2 E E CP2 XXVYNTRSGWRWYT 14 T 0.14 TraF pdbhh F T 5ly2 2 E,F,G,H E,F,G,H CP2_R6Kme3 XXVYNTKSGWRWYT 14 T 0.11 TraF pdbhh F T 5ly3 2 B B RKD2_PYRCJ ARCADIN-2 GGIGENEWVKILRSKR 16 T 2.4 DUF4287 pdbhh F Archaea T 5lyb 86 XD m2 60S Ribosomal Protein L12 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 150 F F F 5lyb 89 EF,FF p1,p2 60S Ribosomal Protein P1/2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 5lyn 2 C,D C,D HSP71_YEAST PRO-THR-VAL-GLU-GLU-VAL-ASP PTVEEVD 7 T 2.6 DUF2368 pdbhh F Eukaryota F 5lz3 2 B B POLG_AIVA8 3A GNRVIDAEPREIPLEYADDLLEAMAHHRPVPCSLGLSQAIANNTPIQQISETFWKYRK 58 T 0.061 RsdA_SigD_bd pdbpssm T Viruses T 5lz6 2 B B Q8BES6_9PICO 3A GAHSERTFETAPSEIDADEVLEILSKSKPAPTHLTLER 38 T 0.19 Nitroreductase pdb T Viruses T 5lzx 45 SA 1 Nascent chain DSPGLKV 7 T 4.1 Peptidase_M18 pdbhh F T 5lzz 45 SA 1 Nascent chain DSPGLKV 7 T 4.1 Peptidase_M18 pdbhh F T 5m0i 3 G,H,I H,J,I SHE3_YEAST SWI5-dependent HO expression protein 3 KTNVTHNNDPSTSPTISVPPGVTR 24 T 10 B277 pdbhh F Eukaryota T 5m21 1 A,C,E,G A,C,E,G F8TW82_9SPHN Hydroquinone dioxygenase small subunit MADVVTEFGALTDYRKGGVEIIDDDPRNYVFSNVFEVAANAAPYERVAVGKNFEYVIESARAEGTSGWFSCAHDEFVLAMDGQIEVHLLKLDNSDAYVDPDSEGAVAIGEALPEGRKMGRIVLRRGHMALLPVGAAYRFYAEQPAAMLFQSIEGAVTVQKWGEICQTEAA 170 T 0.18 Lyx_isomer pdbhh F Bacteria T 5m22 1 A,C,E,G A,C,E,G F8TW82_9SPHN Hydroquinone dioxygenase small subunit MADVVTEFGALTDYRKGGVEIIDDDPRNYVFSNVFEVAANAAPYERVAVGKNFEYVIESARAEGTSGWFSCAHDEFVLAMDGQIEVHLLKLDNSDAYVDPDSEGAVAIGEALPEGRKMGRIVLRRGHMALLPVGAAYRFYAEQPAAMLFQSIEGAVTVQKWGEICQTEAA 170 T 0.18 Lyx_isomer pdbhh F Bacteria T 5m26 1 A,C,E,G A,C,E,G F8TW82_9SPHN Hydroquinone dioxygenase small subunit MADVVTEFGALTDYRKGGVEIIDDDPRNYVFSNVFEVAANAAPYERVAVGKNFEYVIESARAEGTSGWFSCAHDEFVLAMDGQIEVHLLKLDNSDAYVDPDSEGAVAIGEALPEGRKMGRIVLRRGHMALLPVGAAYRFYAEQPAAMLFQSIEGAVTVQKWGEICQTEAA 170 T 0.18 Lyx_isomer pdbhh F Bacteria T 5m2h 1 A,B A,B Vancomycin XXNXXXX 7 T 95 P53_C pdbhh F F 5m2k 1 A,B A,B vancomycin XXNXXXX 7 T 95 P53_C pdbhh F F 5m32 34 TA,UA t,u bound Oprozomib XXXX 4 F F F 5m3h 6 F,G X,Y RPB1_HUMAN TYR-SER-PRO-THR-SEP-PRO YSPTSPSYSPTSPSYSPTSPSYSPTSPS 28 T 3.8E-05 RNA_pol_Rpb1_R pdbhh F Eukaryota F 5m3j 6 F X RPB1_HUMAN DNA-directed RNA polymerase subunit YSPTSPSYSPTSPSYSPTSPSYSPTSPS 28 T 3.8E-05 RNA_pol_Rpb1_R pdbhh F Eukaryota F 5m41 1 A A Nigritoxine MSLPSNPTPVIPANLDLGGINHSAVANRYRNLTKEAQQNLYQFAIIEVLSQIREERPDKNLDAYNALIGIDKVTTVDIYTYGATNMFFMPDARGSKTGILVNLNSPDKPYTNIQQPSDFNNINDESFRQNFTSWEKRDGTTYSGVDTALDGLQEGQGGWNLGYFNQKTPRTINISELSKILVERLDYHVSQENNDDQILSTLLLDVLPRSAKGAAREPLGVSASGIPFQLEFTFEGFTSPTDELRAIQSPFSHLAKYFDLLVASTNGSDLQDVEYSQEQAENIGAWIDSGTQLLMSASGIGAAVSVIQGAAGLTADAIEGKEIDPLDVISLSLAAIPGGKIVAKLSKVSKNLGQVVRGGISIAETGVDIVGSSRDLIEGFKKGNFTDIINGLVSVASSSASGRPGKSKIGNAIKKGNPDAPLPTRPTYRNHEGEVRPIPTAQTKSFFERVAIVRREGLSGRGAIGLDLTAAQKRGAELSGMGGTISKSNPNGNVSQVYINEAEGIEKNITYRKVPVPNEPGNFENRLQESFLDNNGQTKWRDFPYAGEEFDFRLQHKDDFNNIGDLGVGKQGIIAVNNPYSFVHHSHTFEQKGISNNHLTLESNAFLTYIEGKKTGDFENKYGNEMEWLVRKFKTKKNDFDLKDIPDNIHFRTDREKGDHSLTTYTLQDFITVVENAPTKMRKVKNDEFALNNIVESMRATAKNMGASPDTLFLDVASTNYMTQLMGQVLTNGRQELNLQGLSNAAQKLRNGASSSV 757 T 0.26 Ago_PAZ pdb F T 5m4o 1 A,C,E,G A,C,E,G F8TW82_9SPHN Hydroquinone dioxygenase small subunit MADVVTEFGALTDYRKGGVEIIDDDPRNYVFSNVFEVAANAAPYERVAVGKNFEYVIESARAEGTSGWFSCAHDEFVLAMDGQIEVHLLKLDNSDAYVDPDSEGAVAIGEALPEGRKMGRIVLRRGHMALLPVGAAYRFYAEQPAAMLFQSIEGAVTVQKWGEICQTEAA 170 T 0.18 Lyx_isomer pdbhh F Bacteria T 5m4t 1 A A VSM1_TRYBB VSG GSKKQQTESAENKEKICNAAKDNQKACENLKEKGCVFNTESNKCELKKDVKEKLEKESKETEGKDEKANTTGS 73 T 0.00018 Trypan_glycop_C unphh F Eukaryota T 5m5g 4 D E G0RYC6_CHATD Fragment from molecular 2 (region containing putative polycomb protein Suz12) VMLPGRGVP 9 T 1.2 SSPI pdbhh F Eukaryota T 5m5r 2 B,C C,D AP2B1_HUMAN AP105B,ADAPTOR PROTEIN COMPLEX AP-2 SUBUNIT BETA,ADAPTOR-RELATED PROTEIN COMPLEX 2 SUBUNIT BETA,BETA-2-ADAPTIN,BETA-ADAPTIN,CLATHRIN ASSEMBLY PROTEIN COMPLEX 2 BETA LARGE CHAIN,PLASMA MEMBRANE ADAPTOR HA2/AP2 ADAPTIN BETA SUBUNIT CGDLLNLDLG 10 T 4 DUF4952 pdbhh F Eukaryota F 5m5s 2 C,D,E,F E,F,G,H AMPH_HUMAN Amphiphysin ETLLDLDFDP 10 T 1.7 DUF5331 pdbhh F Eukaryota T 5m5t 2 C,D,E,F,G,H E,F,G,H,I,J AMPH_RAT Amphiphysin ETLLDLDFLE 10 T 2.1 KN_motif pdbhh F Eukaryota F 5m5u 2 C,D,E,F E,F,G,H LHDAG_HDVIT L-HDAG,P27 SDILFPADS 9 T 2.6 Pyocin_S pdbhh T Viruses T 5m5v 2 C,D,E,F,G,H E,F,G,H,I,J A4ZNG7_HDV Large delta antigen SPRLPLLES 9 T 1.5 Ribosomal_S12 pdbhh T Viruses F 5m61 2 C,D,E,F E,F,G,H AMPH_HUMAN Amphiphysin ETLLDLDFDPFK 12 T 0.12 UPF0489 pdbhh F Eukaryota T 5m9d 2 B B RPB1_YEAST THR-SER-PRO-SEP-TYR-SEP-PRO-THR-SER-PRO-SEP-TYR-SEP-PRO-THR-SER TSPSYSPTSPSYSPTS 16 T 0.00011 RNA_pol_Rpb1_R pdbhh F Eukaryota F 5m9e 2 E,F,G,H E,F,G,H DIS1_SCHPO Phosphoprotein p93 RRSLAGSMLQKPTQFSRPSF 20 T 0.076 AF-4 pdbhh F Eukaryota T 5m9f 1 A,B,C A,B,C Q6Y7P9_BPPGK PUTATIVE RECEPTOR BINDING PROTEIN GSSHHHHHHSSGLVPRGSHMASMTGGQQMGRGPKDMEKYLLSSIRDDGSASFPLLVYTSDSKTFQQAIIDHIDRTGQTTFTFYVQGGVSGSPMSNSCRGLFMSDTPNTSSLHGVYNAIGTDGRNVTGSVVGSNWTSPKTSPSHKELWTGAQSFLSTGTTKNLSDDISNYSYVEVYTTHKTTEKTKGNDNTGTICHKFYLDGSGTYVCSGTFVSGDRTDTKPPITEFYRVGVSFKGSTWTLVDSAVQNSKTQYVTRIIGINMP 262 T 0.024 Ig_4 pdb T Viruses T 5m9n 2 C C E2F1_HUMAN E2F peptide SSGPARGXGXHPGKGVK 17 T 0.11 TP1 unp F Eukaryota T 5m9o 2 B B E2F1_HUMAN E2F peptide ARGXGXHPG 9 T 0.11 TP1 unp F Eukaryota T 5m9u 1 A A ANN1_AREMA Arenicin-1 RWCVYAYRRVRGVLVRYRRCW 21 T 1.8 CBP_BcsN pdbhh F Eukaryota T 5m9y 1 A A Harzianin HK-VI XXNIIXPLLXPX 12 T 5 DUF389 pdbhh F F 5mao 1 A,B A,B Q72GF3_THET2 Heat resistant RNA dependent ATPase GGMAERSLLTGEEGWRTYKATGPRLSLPRLVALLKGQGLEVGKVAEAEGGFYVDLRPEARPEVAGLRLEPA 71 T 0.00048 GUCT pdbpercent F Bacteria T 5mas 1 A A A0A1U7Q1Y9_9HYPO Bergofungin A XVXXXVGLXXPQXPXX 16 T 0.12 Pep_deformylase pdbhh F Eukaryota T 5mav 2 G,H,I,J,K,L G,H,K,L,N,M Q0MQR4_HUMAN Poly (ADP-ribose) glycohydrolase QHGKKDSKITDHFMRLPKA 19 T 0.32 DUF4334 pdbhh F Eukaryota T 5mb9 1 A,B A,B G0RZX9_CHATD Putative heat shock protein SAMGWSHPQFEKMAESASKAAPGERVVIGITFGNSNSSIAHTVDDKAEVIANEDGDRQIPTILSYVDGDEYYGQQAKNFLVRNPKNTVAYFRDILGQDFKSVDPTHNHASAHPQEAGDNVVFTIKDKAEEDAEPSTLTVSEIATRYLRRLVGAASEYLGKKVTSAVITIPTNFTEKQKAALIAAAAAADLEVLQLISEPAAAVLAYDARPEATISDKIIVVADLGGSRSDVTVLASRSGMYTILATVHDYEYHGIALDKVLIDHFSKEFLKKNPGAKDPRENPRSLAKLRLEAESTKRALSRSTNASFSVESLIDGLDFASTINRLRYETIARTVFEGFNRLVESAVKKAGLDPLDVDEVIMSGGTSNTPRIAANFRYIFPESTRILAPSTDPSALNPSELQARGAALQASLIQEFETEDIEQSTHAAVTTMPHVTNAIGVVSVSESGEEKFVPIIAPETAVPARRTVHLDAPKEGGDVLVKVVEGSTHINVIKPEPKAKEDGETKEKTEDADDDGDFDDDDEEEEEEEEEEEKREKVWKIGSTLAEAAVRGVKKGAKVEVTINVNTDLTVIVTAREVGGKGGVRGTLSA 590 T 0.053 DUF3221 pdbpercent F Eukaryota T 5mb9 2 C,D C,D G0RYD6_CHATD Putative ribosome associated protein GAMAEKDFKAIGKLTQEGSSMRTLEPVGPHFLAHARRVRHKRTFS 45 T 4.2 Suv3_N pdbhh F Eukaryota T 5mbw 2 B B BACE1 INHIBITOR PEPTIDE Pep#3 EVNXVAEXKX 10 T 18 Musclin pdbhh F T 5mc6 66 OB AZ uL1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 210 F F F 5mco 2 B B BACE-1 EXOSITE PEPTIDE XALYPYFLPISAK 13 T 0.96 Suv3_N pdbhh F T 5mcq 2 B D BACE-1 ACTIVE AND EXOSITE BINDING INHIBITOR GGGYPYFIPXGXGEVNXVAEXX 22 T 3.3 Img2 pdbhh F T 5me3 2 C X unassigned sequence of Scc2 XXXXXXXXXXXXXXXXX 17 F F F 5me3 3 D Y Scc2 unassigned sequence XXXXXXXXXXXXXXXXXXXXXXXXXXX 27 F F F 5me3 4 E,F Z,W Scc2 unassigned sequence XXXXXXXXXXX 11 F F F 5me5 2 B B A0A1S3C4H6_CUCME eIF4G NEAIKEDAGALSKAEPDDWEDAADIATPDLESANGDGVGTSMLDSGDRTGDMAKKYSRDFLLKFAEQFLDLPHNFEVTSDIESLMSTHTN 90 T 0.00032 eIF_4G1 pdbhh F Eukaryota T 5mf3 1 A A A0A3B6UE78_9HYPO Harzianin HK-VI XXNIIXPLLXPX 12 T 5 DUF389 pdbhh F Eukaryota F 5mf8 1 A A A0A3B6UE78_9HYPO Harzianin HK-VI XXNIIXPLLXPX 12 T 5 DUF389 pdbhh F Eukaryota F 5mf9 2 B B COMPONENT OF GEMS 1,GEMIN-1 GMRPPPPGIRG 11 T 8.4 DUF4810 pdbhh F F 5mfe 2 E E (RR)4 RRRRRRRR 8 T 23 DUF6203 pdbhh F F 5mff 2 E,F E,F (RR)5 RRRRRRRRRR 10 T 24 Adeno_PX pdbhh F F 5mfg 2 E E (RR)4 RRRRRRRRRR 10 T 24 Adeno_PX pdbhh F F 5mfh 2 E,F E,F (RR)5 RRRRRRRRRR 10 T 24 Adeno_PX pdbhh F F 5mfi 2 C,D C,D (KR)4 KRKRKRKR 8 T 13 CDC27 pdbhh F F 5mfj 2 C,D C,D (KR)5 KRKRKRKRKR 10 T 3.8 RFX5_DNA_bdg pdbhh F F 5mfk 2 C,D C,D (KR)4 KRKRKRKR 8 T 13 CDC27 pdbhh F F 5mgx 1 A,B,C,D A,B,C,D HSP82_YEAST yeast HSP90 C-terminus DTEMEEVD 8 T 15 CHZ pdbhh F Eukaryota F 5mhc 2 B P P53_HUMAN LYS-LEU-MET-PHE-LYS-TPO-GLU-GLY-PRO-ASP-SER-ASP KLMFKTEGPDSD 12 T 44 CopB pdbhh F Eukaryota T 5mhk 4 I J ICP4_HHV11 ICP4 DNA BINDING DOMAIN FTAG 4 T 160 FANCA_interact pdbhh T Viruses F 5mhl 2 C,D K,O inhibitor Mi0621 XFXVKVIGX 9 T 9.3 DUF5580 pdbhh F T 5mhm 2 C,D H,O inhibitor ZED1630 XXXILPWX 8 T 14 TraV pdbhh F F 5mhn 2 C,D H,I inhibitor ZED2360 XXLILPWP 8 T 19 DUF4893 pdbhh F F 5mho 2 C,D G,H inhibitor ZED2369 XXLILPWPX 9 T 25 TraV pdbhh F F 5miy 1 A,B,C A,B,C E3 ubiquitin ligase RavN GAMGSMPTYFDPIMQEDTVLDENTIVYLVKIGDNKFSIKAISSGLEHLPSDPTTHAEKYWPIPAKSLIDHSSNKLLFEEDKLTNQPISKDQVIELFAVDPDKTEPKQFSDSVKRELTENWAREVLQDQ 128 T 0.74 GET2 unppssm F T 5mjy 2 E,F E,F ZFYV9_HUMAN MOTHERS AGAINST DECAPENTAPLEGIC HOMOLOG-INTERACTING PROTEIN,MADH-INTERACTING PROTEIN,NOVEL SERINE PROTEASE,NSP,RECEPTOR ACTIVATION ANCHOR,HSARA,SMAD ANCHOR FOR RECEPTOR ACTIVATION MENYFQAEAYNLDKVLDEFEQN 22 T 6.4 OmpH pdbhh F Eukaryota T 5mk0 2 B,D B,D ZFY16_HUMAN ENDOFIN,ENDOSOME-ASSOCIATED FYVE DOMAIN PROTEIN MDSYFKAAVSDLDKLLDDFEQN 22 T 1.2 OmpH pdbhh F Eukaryota T 5mk1 2 E,F,G,H E,F,H,K CHM4A_HUMAN CHROMATIN-MODIFYING PROTEIN 4A,CHMP4A,SNF7 HOMOLOG ASSOCIATED WITH ALIX-2,SNF7-1,HSNF-1,VACUOLAR PROTEIN SORTING-ASSOCIATED PROTEIN 32-1,HVPS32-1 PKVDEDEEALKQLAEWVS 18 T 1.8 ZapA pdbhh F Eukaryota T 5mk2 2 C C CHM4B_HUMAN CHROMATIN-MODIFYING PROTEIN 4B,CHMP4B,SNF7 HOMOLOG ASSOCIATED WITH ALIX 1,SNF7-2,HSNF7-2,VACUOLAR PROTEIN SORTING-ASSOCIATED PROTEIN 32-2,HVPS32-2 KKKEEEDDDMKELENWAGSM 20 T 1.7 TMEM154 pdbhh F Eukaryota T 5mk3 2 E,F,G,H E,F,G,H CHM4C_HUMAN CHROMATIN-MODIFYING PROTEIN 4C,CHMP4C,SNF7 HOMOLOG ASSOCIATED WITH ALIX 3,SNF7-3,HSNF7-3,VACUOLAR PROTEIN SORTING-ASSOCIATED PROTEIN 32-3,HVPS32-3 QRAEEEDDDIKQLAAWAT 18 T 1.1 Ribosomal_60s unppssm F Eukaryota T 5mku 2 B B HIS-LYS-ILE-LEU-HIS-ARG-LEU-LEU-GLN-ASP-SER HKILHRLLQDS 11 T 0.0023 SRC-1 pdb F T 5mlo 2 B,D,F B,D,F ZRAB3_HUMAN ZRANB3 PIP box peptide EKEKQHDIRSFFVPQ 15 T 0.95 DUF4651 pdbhh F Eukaryota T 5mlw 2 B,D,F B,D,F ZRAB3_HUMAN APIM motif peptide GSDITRFLVKK 11 T 0.15 DUF3460 pdbhh F Eukaryota T 5mm2 1 A A CAPS4_NORAV capsid protein VP4C SLPENAPNAVSNPQQFITPATALSAEEYNVHEALGETEELELDEFPVLVFKGNVPVDSVTSIPLDLATIYDFAWDGEQNAISQKFQRFAHLIPKSAGGFGPVIGNYTITANLPTGVAGRILHNCLPGDCVDLAVSRIFGLKSLLGVAGTAVSAIGGPLLNGLVNTAAPILSGAAHAIGGNVVGGLADAVIDIGSNLLTPKEKEQPSANSSAISGDIPISRFVEMLKYVKENYQDNPVFPTLLVEPQNFISNAMTALKTIPIEVFANMRNVKVERNLFDRTVVPTVKEATLADIVIPNHMYGYILRDFLQNKRAFQSGTKQNVYFQQFLTVLSQRNIRTHITLNDITSCSIDSESIANKIERVKHYLSTNSSGETTEEFSRTDTGLLPITTRKIVLGESKRRTERYVAETVFPSVRQ 416 T 0.096 Iso_dh pdb T Viruses T 5mm2 2 B B CAPS4_NORAV capsid protein VP4B ADNEVTAEGGKLVQELVYDHSAIPVAPVVETQAEQPEVPVSLVATRKNDTGHLATKWYDFAKISLSNPANMNWTTLTIDPYNNVTLSRDGESMVLPWRRNVWTTGSKSIGYIRTMVAQINIPRPPQISGVLEVKDSINNSSISLVEFGGKVEIPIIPKVMNGLATTASLPRHRLNPWMRTAESKVELQYRIIAFNRTSDIADLNVSVLLRPGDSQFQLPMKPDNNVDTRHFELVEALMYHYDSLRIRGEEQ 251 T 0.24 Waikav_capsid_1 pdbhh T Viruses T 5mm2 3 C C D2WFA0_NORAV Capsid protein VP4A MQNPTQTMHIYDMPLRVIAGLSTLAKTTEEDDNTSTGIVVSEVGEPQVVNHPAWIDPFVAYQLRAPRKNITPDFIFGRADIGNAFSAFLPRRFSAPAVGTRLVVDPVFTYQQRTVLGLYNYFHADFYYIVHVPAPLGTGIYLKIYAPEFDTTTVTRGIRFKPSASPTIALSVPWSNDLSTVETSVGRVGQSGGSIVIETIEDNSNETVNTPLSITVWCCMANIKATGYRHADTSAYNEKGMNFIPVPVPKPPVPPTKPITGEEQ 264 T 0.0028 Waikav_capsid_1 pdbhh T Viruses T 5mmi 7 G 6 PSRP5_SPIOL plastid ribosomal protein cL37, PSRP5 MALLSPLLSLSSVPPITSIAVSSSSFPIKLQNVSVALLPTLGQRLMTHGPVIAQKRGTVVAMVSAAADETAGEDGDQSKVEEANISVQNLPLESKLQLKLEQKMKMKMAKKIRLRRNRLMRKRKLRKRGAWPPSKMKKLKNV 142 T 0.084 DUF3381 pdbpssm F Eukaryota T 5mmj 2 B 8 plastid ribosomal protein bS1c XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 174 F F F 5mmk 1 A A GLY-ILE-LEU-SER-SER-LEU-TRP-LYS-LYS-LEU-LYS-LYS-ILE-ILE-ALA-LYS GILSSLWKKLKKIIAKX 17 T 3.6 TnpW pdbhh F T 5mml 1 A A GLY-ILE-LEU-SER-SER-LEU-TRP-LYS-LYS-LEU-LYS-LYS-ILE-ILE-ALA-LYS GILSSLWKKLKKIIAKX 17 T 3.6 TnpW pdbhh F T 5mmm 7 G 6 PSRP5_SPIOL plastid ribosomal protein cL37, PSRP5 MALLSPLLSLSSVPPITSIAVSSSSFPIKLQNVSVALLPTLGQRLMTHGPVIAQKRGTVVAMVSAAADETAGEDGDQSKVEEANISVQNLPLESKLQLKLEQKMKMKMAKKIRLRRNRLMRKRKLRKRGAWPPSKMKKLKNV 142 T 0.084 DUF3381 pdbpssm F Eukaryota T 5mmm 36 JA 8 plastid ribosomal protein bS1c XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 174 F F F 5mn3 1 A A domain-swapped metallothionein from Littorina Littorea GSMSSVFGAGCTDTCKQTPCGCGSGCNCKEDCRCQSCKYGAGCTDVCKQTPCGCATSGCNCTDDCKCQSCSTACKCAAGSCKCGKGCTGPDSCKCDRSCSCK 102 T 0.00054 Metallothio_Euk pdbpercent F T 5mn9 2 C C MINY1_HUMAN DEUBIQUITINATING ENZYME MINDY-1,PROTEIN FAM63A GPLGSQVDQDYLIALSLQQQQPRGPLGLTDLELAQQLQQEEYQQ 44 T 0.22 CCDC50_N pdbhh F Eukaryota T 5moc 2 B P P53_HUMAN p53 C-terminal domain KLMFKTEGPDSD 12 T 44 CopB pdbhh F Eukaryota T 5mps 22 V X Unknown XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 68 F F F 5mq0 22 V X UNKNOWN PROTEIN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 68 F F F 5mrc 71 SB 55 RT13_YEAST 37S RIBOSOMAL PROTEIN MRP13, MITOCHONDRIAL,YMS-A MGTITVVINEGPILLIRALHRATTNKKMFRSTVWRRFASTGEIAKAKLDEFLIYHKTDAKLKPFIYRPKNAQILLTKDIRDPKTREPLQPRPPVKPLSKQTLNDFIYSVEPNSTELLDWFKEWTGTSIRKRAIWTYISPIHVQKMLTASFFKIGKYAHMVGLLYGIEHKFLKAQNPSVFDIEHFFNTNIMCALHRNRLKDYKDAEIAQRKLQVAWKKVLNRKNNTGLANILVATLGRQIGFTPELTGLQPVDISLPDIPNSSSGAELKDLLSKYEGIYLIARTLLDIDQHNAQYLELQEFIRQYQNALSESSDPYDTHLKALGLLETPPPQESTEKEEK 339 T 32 NMU unphh F Eukaryota T 5mrc 73 UB 77 RSM28_YEAST 37S RIBOSOMAL PROTEIN RSM28, MITOCHONDRIAL SSEYVLEEPTPLSLLEYTPQVFPTKESRLVNFTLDSLKKSNYPIYRSPNLGILKVHDFTLNTPNFGKYTPGSSLIFAKEPQLQNLLIEEDPEDFHRQVTGEYQLLKPYVKKDFEKLTKSKDTVSKLVQNSQVVRLSLQSVVMGSEEKKLVYDVCSGMKPISELQQ 165 T 0.33 bCoV_NS6 pdbpercent F Eukaryota T 5mrc 77 YB cc unknown protein sequence 1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 94 F F F 5mrc 78 ZB dd unknown protein sequence 2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 151 F F F 5mre 71 SB 55 RT13_YEAST 37S RIBOSOMAL PROTEIN MRP13, MITOCHONDRIAL,YMS-A MGTITVVINEGPILLIRALHRATTNKKMFRSTVWRRFASTGEIAKAKLDEFLIYHKTDAKLKPFIYRPKNAQILLTKDIRDPKTREPLQPRPPVKPLSKQTLNDFIYSVEPNSTELLDWFKEWTGTSIRKRAIWTYISPIHVQKMLTASFFKIGKYAHMVGLLYGIEHKFLKAQNPSVFDIEHFFNTNIMCALHRNRLKDYKDAEIAQRKLQVAWKKVLNRKNNTGLANILVATLGRQIGFTPELTGLQPVDISLPDIPNSSSGAELKDLLSKYEGIYLIARTLLDIDQHNAQYLELQEFIRQYQNALSESSDPYDTHLKALGLLETPPPQESTEKEEK 339 T 32 NMU unphh F Eukaryota T 5mre 73 UB 77 RSM28_YEAST 37S RIBOSOMAL PROTEIN RSM28, MITOCHONDRIAL SSEYVLEEPTPLSLLEYTPQVFPTKESRLVNFTLDSLKKSNYPIYRSPNLGILKVHDFTLNTPNFGKYTPGSSLIFAKEPQLQNLLIEEDPEDFHRQVTGEYQLLKPYVKKDFEKLTKSKDTVSKLVQNSQVVRLSLQSVVMGSEEKKLVYDVCSGMKPISELQQ 165 T 0.33 bCoV_NS6 pdbpercent F Eukaryota T 5mre 77 YB cc unknown protein sequence 1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 94 F F F 5mre 78 ZB dd unknown protein sequence 2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 151 F F F 5mrf 71 SB 55 RT13_YEAST 37S RIBOSOMAL PROTEIN MRP13, MITOCHONDRIAL,YMS-A MGTITVVINEGPILLIRALHRATTNKKMFRSTVWRRFASTGEIAKAKLDEFLIYHKTDAKLKPFIYRPKNAQILLTKDIRDPKTREPLQPRPPVKPLSKQTLNDFIYSVEPNSTELLDWFKEWTGTSIRKRAIWTYISPIHVQKMLTASFFKIGKYAHMVGLLYGIEHKFLKAQNPSVFDIEHFFNTNIMCALHRNRLKDYKDAEIAQRKLQVAWKKVLNRKNNTGLANILVATLGRQIGFTPELTGLQPVDISLPDIPNSSSGAELKDLLSKYEGIYLIARTLLDIDQHNAQYLELQEFIRQYQNALSESSDPYDTHLKALGLLETPPPQESTEKEEK 339 T 32 NMU unphh F Eukaryota T 5mrf 73 UB 77 RSM28_YEAST 37S RIBOSOMAL PROTEIN RSM28, MITOCHONDRIAL SSEYVLEEPTPLSLLEYTPQVFPTKESRLVNFTLDSLKKSNYPIYRSPNLGILKVHDFTLNTPNFGKYTPGSSLIFAKEPQLQNLLIEEDPEDFHRQVTGEYQLLKPYVKKDFEKLTKSKDTVSKLVQNSQVVRLSLQSVVMGSEEKKLVYDVCSGMKPISELQQ 165 T 0.33 bCoV_NS6 pdbpercent F Eukaryota T 5mrf 77 YB cc unknown protein sequence 1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 94 F F F 5mrf 78 ZB dd unknown protein sequence 2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 151 F F F 5mrv 2 C C MCPI_NERVS MCPI,CARBOXYPEPTIDASE INHIBITOR,NVCI FHVPDDRPCINPGRCPLVPDATCTFVCKAADNDFGYECQHVWTFEGQRVGCYA 53 T 0.88 NPBW pdbhh F Eukaryota T 5ms2 1 A A Q5ZUV9_LEGPH Legionella pneumophila effector protein RavZ GPMKGKLTGKDKLIVDEFEELGEQESDIDEFDLLEGDEKLPGDSELDKTTSIYPPETSWEVNKGMNSSRLHKLYSLFFDKSSAFYLGDDVSVLEDKPLTGAYGFQSKKNDQQIFLFRPDSDYVAGYHVDAKSDAGWVNDKLDRRLSEISEFCSKATQPATFILPFVEMPTDITKGVQHQVLLTISYDPKSKQLTPTVYDSIGRDTYSESLSSYFKGKYRTTCDEILTQSIEKAIKSTDFTLGKFTRAAYNHQNRLTEGNCGSYTFRTIKEVISSSAQGTEVKIPGSGYITSNSYLTSQHVQDIESCIKYRNLGVVDIESALTEGKTLPVQLSEFIVALEDYGKLRSQQSEKSMLNFIGYSKTAKLTAVELLIGILNDIKGKNEISESQYDKLVKEVDCLMDSSLGKLVQFHLKNLGAESLQKLVLPCVKFDDT 433 T 8.9 DUF438 pdbhh F Bacteria T 5ms4 2 E,F,G,H E,F,G,H LEUPEPTIN XLLX 4 T 590 DNTTIP1_dimer pdbhh F F 5ms7 1 A A Q5ZUV9_LEGPH Legionella pneumophila effector protein RavZ GPGEQESDIDEFDLLEGDEKLPGDSELDKTTSIYPPETSWEVNKGMNSSRLHKLYSLFFDKSSAFYLGDDVSVLEDKPLTGAYGFQSKKNDQQIFLFRPDSDYVAGYHVDAKSDAGWVNDKLDRRLSEISEFCSKATQPATFILPFVEMPTDITKGVQHQVLLTISYDPKSKQLTPTVYDSIGRDTYSESLSSYFKGKYRTTCDEILTQSIEKAIKSTDFTLGKFTRAAYNHQNRLTEGNCGSYTFRTIKEVISSSAQGTEVKIPGSGYITSNSYLTSQHVQDIESCIKYRNLGVVDIESALTEGKTLPVQLSEFIVALEDYGKLRSQQSEKSMLNFIGYSKTAKLTAVELLIGILNDIKGKNEISESQYDKLVKEVDCLMDSSLGKLVQFHLKNLGAESLQKLVLPCVKFDDTIDDFVTIEKDELFDVPDITGEELASKKGIEQGALDKEALLKQKQIKTDLLDLREEDKTGLKKPLHGGIKVK 485 T 15 DUF438 pdbhh F Bacteria T 5ms8 1 A A Q5ZUV9_LEGPH Legionella pneumophila effector protein RavZ GPMKGKLTGKDKLIVDEFEELGEQESDIDEFDLLEGDEKLPGDSELDKTTSIYPPETSWEVNKGMNSSRLHKLYSLFFDKSSAFYLGDDVSVLEDKPLTGAYGFQSKKNDQQIFLFRPDSDYVAGYHVDAKSDAGWVNDKLDRRLSEISEFCSKATQPATFILPFVEMPTDITKGVQHQVLLTISYDPKSKQLTPTVYDSIGRDTYSESLSSYFKGKYRTTCDEILTQSIEKAIKSTDFTLGKFTRAAYNHQNRLTEGNCGSYTFRTIKEVISSSAQGTEVKIPGSGYITSNSYLTSQHVQDIESCIKYRNLGVVDIESALTEGKTLPVQLSEFIVALEDYGKLRSQQSEKSMLNFIGYSKTAKLTAVELLIGILNDIKGKNEISESQYDKLVKEVDCLMDSSLGKLVQFHLKNLGAESLQKLVLPCVKFDDTIDDFVTIEKDELFDVPDITGEELASKKGIEQGALDKEALLKQKQIKTDLLDLREED 489 T 17 DUF438 pdbhh F Bacteria T 5msm 3 C,F C,F CTF18_YEAST Chromosome transmission fidelity protein 18 SGKVKTGLNSSSSTIDFFKNQYGLLKQTQELEETQKTIGSDETNQADDCNQTVKIWVKYNEGFSNAVRKNVTWNNLWE 78 T 2 Jnk-SapK_ap_N unppssm F Eukaryota T 5mt6 2 B B ACE-ARG-VAL-ARG-HIS-ALA-V9C XRVRHAX 7 T 120 Cecropin pdbhh F F 5mt7 2 B B ACE-VAL-ARG-HIS-ALA-0QE XVRHAX 6 T 330 NnrU pdbhh F F 5mt8 2 B B ACE-ARG-VAL-ARG-HIS-ALA-0QE XRVRHAX 7 T 120 Cecropin pdbhh F F 5mtf 2 B C inhibitor XRVRHX 6 T 170 VPS38 pdbhh F F 5mtw 2 E,F,G E,F,G HIGA1_MYCTU Antitoxin HigA1 EVPTWHRLSSYRG 13 T 2.6 WW_like pdbhh F Bacteria T 5mu0 3 Q,R,S,T,U,V,W,X Q,R,S,T,U,V,W,X CO2A1_MOUSE ALPHA-1 TYPE II COLLAGEN GPPGARGLTGXPGDAGPP 18 T 0.0042 Collagen unppercent F Eukaryota T 5mu2 3 Q,R,S,T,U,V,W,X X,Q,R,S,T,U,V,W synthetic peptide containing the CII583-591 epitope of collagen type II GPPGPPGPPGPPGGRGLTGPIGPPGPPGPP 30 T 0.00036 Collagen pdb F T 5mu3 2 B B CENPP_KLULA CHROMOSOME TRANSMISSION FIDELITY PROTEIN 19 MNIEQRKKYLDITLNDVTVTCEKDMILLRKGSFTASFRIAVENESIRSMAIDLNAFEVELQPIIQYAEDTQNVNVAMMAVVQFLRIKELHEQMISKIVEASKFIRASNNTITLNDLEVSFHCYWNLPSPYPETLILTNKVQKILDFLIYQYGIQLGVIKYGSTII 165 T 0.0022 CENP-P unphh F Eukaryota T 5mu3 4 E E CENPP_KLULA CHROMOSOME TRANSMISSION FIDELITY PROTEIN 19 MNIEQRKKYLDITLNDVTVTCEKDMILLRKGSFTASFRIAVENESIRSMAIDLNAFEVELQPIIQYAEDTQNVNVAMMAVVQFLRIKELHEQMISKIVEASKFIRASNNTITLNDLEVSFHCYWNLPSPYPETLILTNKVQKILDFLIYQYGIQLGVIKYGSTII 165 T 0.0022 CENP-P unphh F Eukaryota T 5muu 1 A,B A,B P1_BPPH6 Major inner protein P1 MFNLKVKDLNGSARGLTQAFAIGELKNQLSVGALQLPLQFTRTFSASMTSELLWEVGKGNIDPVMYARLFFQYAQAGGALSVDELVNQFTEYHQSTACNPEIWRKLTAYITGSSNRAIKADAVGKVPPTAILEQLRTLAPSEHELFHHITTDFVCHVLSPLGFILPDAAYVYRVGRTATYPNFYALVDCVRASDLRRMLTALSSVDSKMLQATFKAKGALAPALISQHLANAATTAFERSRGNFDANAVVSSVLTILGRLWSPSTPKELDPSARLRNTNGIDQLRSNLALFIAYQDMVKQRGRAEVIFSDEELSSTIIPWFIEAMSEVSPFKLRPINETTSYIGQTSAIDHMGQPSHVVVYEDWQFAKEITAFTPVKLANNSNQRFLDVEPGISDRMSATLAPIGNTFAVSAFVKNRTAVYEAVSQRGTVNSNGAEMTLGFPSVVERDYALDRDPMVAIAALRTGIVDESLEARASNDLKRSMFNYYAAVMHYAVAHNPEVVVSEHQGVAAEQGSLYLVWNVRTELRIPVGYNAIEGGSIRTPEPLEAIAYNKPIQPSEVLQAKVLDLANHTTSIHIWPWHEASTEFAYEDAYSVTIRNKRYTAEVKEFELLGLGQRRERVRILKPTVAHAIIQMWYSWFVEDDRTLAAARRTSRDDAEKLAIDGRRMQNAVTLLRKIEMIGTTGIGASAVHLAQSRIVDQMAGRGLIDDSSDLHVGINRHRIRIWAGLAVLQMMGLLSRSEAEALTKVLGDSNALGMVVATTDIDPSL 769 T 0.22 STAG pdb T Viruses T 5muu 3 D,E,F,G,H,I,J,K,L,M D,E,F,G,H,I,J,K,L,M CAPSD_BPPH6 PROTEIN P8 MLLPVVARAAVPAIESAIAATPGLVSRIAAAIGSKVSPSAILAAVKSNPVVAGLTLAQIGSTGYDAYQQLLENHPEVAEMLKDLSFKADEIQPDFIGNLGQYREELELVEDAARFVGGMSNLIRLRQALELDIKYYGLKMQLNDMGYRS 149 T 2.7 DnaI_N pdbhh T Viruses T 5muz 1 A,B A,B J7HBG8_9VIRU L protein GPHADGDQNLFDYQFTGTPEEPIKGYWTTTISYRDSKPKISLTIRQEFVEGGVESQAVLATVVGRPHLQDFLLLKRKHLEYSDYPESIDLIEFGDVKVIEKTV 103 T 1.8 Viral_Rep pdbhh T Viruses T 5mv3 3 C,F,I,L,O,R,U,X X,E,H,K,N,Q,T,W CO2A1_MOUSE ALPHA-1 TYPE II COLLAGEN,ALPHA-1 TYPE II COLLAGEN,ALPHA-1 TYPE II COLLAGEN GPPGPPGPPGPPGGRGLTGPIGPPGPPGPP 30 T 0.00036 Collagen pdb F Eukaryota T 5mwe 2 B,C C,D CNN_DROME PROTEIN ARROW GPMDQQNSAVIGQLRLELQQARTEVETADKWRLECIDVCSVLTNRLEELAGFLNSLLKHKDVLGVLAADRRNAMRKAVDRS 81 T 0.0034 FPP unphh F Eukaryota T 5mwp 2 B B NCOA1_HUMAN NCOA1 peptide PQAQQKSLLQQLLTE 15 T 1.8 GFD1 pdbhh F Eukaryota T 5mwy 2 B B NCOA1_HUMAN NCOA1 PQAQQKSLLQQLLTE 15 T 1.8 GFD1 pdbhh F Eukaryota T 5mxo 2 B P p53 C-terminal 12 amino acids FKT 3 T 240 EF-hand_1 pdbhh F F 5mxs 1 A A TRP-TYR-HIS-ARG-LEU-SER-HIS-LEU-HIS-SER-ARG-LEU-GLN-ASP-NH2 WYHRLSHLHSRLQDX 15 T 8.1 FA_hydroxylase pdbhh F T 5mxt 1 A A TRP-TYR-HIS-ARG-LEU-SER-HIS-ILE-HIS-SER-ARG-LEU-GLN-ASP-NH2 WYHRLSHIHSRLQDX 15 T 3.1 Endotoxin_M pdbhh F T 5my9 2 B P LRRK2_HUMAN DARDARIN LQRHSNSLGPIFD 13 T 15 DCP1 pdbhh F Eukaryota T 5myc 2 B P LRRK2_HUMAN DARDARIN VKKKSNSISVGEFYRDAVLQRCSPNLQRHSNSLGPIFD 38 T 48 US30 pdbhh F Eukaryota T 5mz6 2 B B IFY1_CAEEL Interactor of FizzY protein MEDLNFEERGSTQIPASLQQHFSAKLGRQNELEKTPSRGGLGLVVNSSKTPGGKSLQSLASACKVPPSTKKNTIPIAFECYEDETDDQIADVATIKKTEKHPCSPIDTANRCETFDSLAADIEDDMLNLEDQDVVLSEDRPYGDVIDPAESEAEALAELGVEEWDSYPPIDPASRIGDDFNYVLRTEDFAEEGDVKLEETRHRTVIADIDEVKMSKAERNELFSMLADDLDSYDLLAEEANLPL 244 T 35 Sgf11_N pdbhh F Eukaryota T 5mzl 2 B B Pimtide ARKRRRHPSGPPTA 14 T 1.5 DUF3019 pdbhh F T 5mzm 3 C,F C,F Ceramide synthase 5 derived peptide Trh4 p3P MCPRMTAVM 9 T 6.8 Mob1_phocein pdbhh F T 5n1y 3 C C MVWGPDPLYV MVWGPDPLYV 10 T 0.74 Tachykinin pdbhh F T 5n4b 2 C,D D,C AAMA1_GALM3 Alpha-amanitin proprotein IWGIGCNPWTAEHVDQTLASGNDIC 25 T 1.1 Sld7_N unphh F Eukaryota T 5n4c 2 B,F,G,H E,F,G,H AAMA1_GALM3 Alpha-amanitin proprotein MFDTNATRLPIWGIGCNPWTAEHVDQTLASGNDIC 35 T 1.1 Sld7_N pdbhh F Eukaryota T 5n4d 2 C,D C,D AAMA1_GALM3 Alpha-amanitin proprotein IWGIGCNPWTAEHVDQTLASGNDIC 25 T 1.1 Sld7_N unphh F Eukaryota T 5n4e 2 C,D C,D AAMA1_GALM3 Alpha-amanitin proprotein MFDTNATRLPIWGIGCNPWTAEHVDQTLASGNDIC 35 T 1.1 Sld7_N pdbhh F Eukaryota T 5n4n 2 B B Pimtide ARKRRRHPSGPPTA 14 T 1.5 DUF3019 pdbhh F T 5n4o 2 B B Pimtide ARKRRRHPSGPPTA 14 T 1.5 DUF3019 pdbhh F T 5n4r 2 B E Pimtide ARKRRRHPSGPPTA 14 T 1.5 DUF3019 pdbhh F T 5n4u 2 B B Pimtide ARKRRRHPSGPPTA 14 T 1.5 DUF3019 pdbhh F T 5n4v 2 B D Pimtide ARKRRRHPSGPPTA 14 T 1.5 DUF3019 pdbhh F T 5n4x 2 B B Pimtide ARKRRRHPSGPPTA 14 T 1.5 DUF3019 pdbhh F T 5n4y 2 B B Pimtide ARKRRRHPSGPPTA 14 T 1.5 DUF3019 pdbhh F T 5n4z 2 B B Pimtide ARKRRRHPSGPPTA 14 T 1.5 DUF3019 pdbhh F T 5n50 2 B B Pimtide ARKRRRHPSGPPTA 14 T 1.5 DUF3019 pdbhh F T 5n51 2 B B Pimtide ARKRRRHPSGPPTA 14 T 1.5 DUF3019 pdbhh F T 5n52 2 B B Pimtide ARKRRRHPSGPPTA 14 T 1.5 DUF3019 pdbhh F T 5n5l 2 B B Pimtide ARKRRRHPSGPPTA 14 T 1.5 DUF3019 pdbhh F T 5n5m 2 B B Pimtide ARKRRRHPSGPPTA 14 T 1.5 DUF3019 pdbhh F T 5n5r 2 B P WWTR1_HUMAN TAZ pS89 peptide SHSSPASLQ 9 T 0.13 TFIIA unppercent F Eukaryota T 5n5t 2 B P WWTR1_HUMAN TAZ pS89 peptide SHSSPASL 8 T 0.13 TFIIA unppercent F Eukaryota F 5n5w 2 B P WWTR1_HUMAN TAZ pS89 peptide RSHSSPASLQ 10 T 0.13 TFIIA unppercent F Eukaryota T 5n75 2 B P WWTR1_HUMAN TAZ PS89 PEPTIDE RSHSSPASLQ 10 T 0.13 TFIIA unppercent F Eukaryota T 5n7b 2 B I APD(CG6)RP(NH2) peptide APDXRPX 7 T 19 EIAV_GP90 pdbhh F F 5n7d 2 C C KS6A1_HUMAN S6K-ALPHA-1,90 KDA RIBOSOMAL PROTEIN S6 KINASE 1,P90S6K,MAP KINASE-ACTIVATED PROTEIN KINASE 1A,MAPKAPK-1A,RIBOSOMAL S6 KINASE 1,RSK-1 GSQDLQLVKGAMAATYSALNSSKPTPQLKPIESSILAQRRVRKLPSTTL 49 T 0.026 Vitelline_membr pdbpssm F Eukaryota T 5n7f 2 C C KS6A1_HUMAN S6K-ALPHA-1,90 KDA RIBOSOMAL PROTEIN S6 KINASE 1,P90S6K,MAP KINASE-ACTIVATED PROTEIN KINASE 1A,MAPKAPK-1A,RIBOSOMAL S6 KINASE 1,RSK-1 GSQDLQLVKGAMAATYSALNSSKPTPQLKPIESSILAQRRVRKLPSTTL 49 T 0.026 Vitelline_membr pdbpssm F Eukaryota T 5n7g 2 C C KS6A1_HUMAN S6K-ALPHA-1,90 KDA RIBOSOMAL PROTEIN S6 KINASE 1,P90S6K,MAP KINASE-ACTIVATED PROTEIN KINASE 1A,MAPKAPK-1A,RIBOSOMAL S6 KINASE 1,RSK-1 KLPSTTL 7 T 68 Baculo_8kDa pdbhh F Eukaryota F 5n7q 2 C,D I,J PEPSTATIN A XVVXAX 6 T 1700 FAM60A pdbhh F F 5n7x 2 C,D,G,I,J,L,N,P C,D,G,I,J,L,N,P GLU-TRP-VAL-HIS-PRO-GLN-PHE-GLU-GLN-LYS-ALA-LYS Peptide EWVHPQFEQKAK 12 T 0.052 RHINO pdbhh F T 5n85 2 B B PRIPO_HUMAN HPRIMPOL1,COILED-COIL DOMAIN-CONTAINING PROTEIN 111 DNGIDDAYFLEATED 15 T 25 DUF5302 pdbhh F Eukaryota T 5n89 2 C,E,G,I,J,L,N,P C,E,G,I,J,L,N,P GLY-ASN-SER-PHE-ASP-ASP-TRP-LEU-ALA-SER-LYS-GLY-NH2 GNSFDDWLASKGX 13 T 0.86 CnrY pdbhh F T 5n8a 1 A X PRIPO_HUMAN HPRIMPOL1,COILED-COIL DOMAIN-CONTAINING PROTEIN 111 MGSSHHHHHHSSGLVPRGSHMTTDEADETRSNETQNPHKPSPSRLSTGASADAVWDNGIDDAYFLEATEDAELAEAAENSLLSYNSEVDEIPDELIIEVLQE 102 T 31 Hydin_ADK pdbhh F Eukaryota T 5n8b 2 B,D,F,H E,C,F,H ALA-PHE-PRO-ASP-TYR-LEU-ALA-GLU-TYR-HIS-GLY-GLY-NH2 AFPDYLAEYHGGX 13 T 1.6 Not1 pdbhh F T 5n8e 2 E,F,G E,F,H ARG-ASP-PRO-ALA-PRO-ALA-TRP-ALA-HIS-GLY-GLY-GLY-NH2 RDPAPAWAHGGGX 13 T 5.7 DUF2591 pdbhh F T 5n8j 2 E,F,G P,O,E GLY-DTY-GLY-DLE-DAL-DSG-DVA-DAS-DGL-DSN-DSN-GLY GXGXXXXXXXXG 12 F F F 5n8t 2 B B DLE-DTR-DGN-DHI-DGL-DAL-DTH-DTR-DLY XXXXXXXXX 9 F F F 5n8w 2 C,D C,D GLY-GLY-DTR-DHI-DAS-DGL-DAL-DTH-DTR-DLY GGXXXXXXXXXGLY 14 F F F 5n91 2 C,D,E E,F,G Ac-[2-Cl-F]-PPPP-OH XXPPPP 6 T 100 PDCD7 pdbhh F F 5n99 2 C,E,F,H,J,L,N,P,R,T,V,W C,E,F,H,J,L,N,P,R,T,V,X ASN-GLN-DPR-TRP-GLN NQXWQ 5 T 22 Eco57I pdbhh F F 5n9c 2 C,D,E,F F,G,H,M Ac-[2-Cl-F]-PP-[ProM-1]-OH XXPPX 5 T 1000 MT-A70 pdbhh F F 5n9h 1 A,B,C,D A,B,C,M Porin LGNY 4 T 77 DUF5613 pdbhh F F 5n9i 1 A,B,C,D A,B,C,D E3U904_PROST Porin 1 GVVTSE 6 T 140 DUF5487 pdbhh F Bacteria F 5n9p 2 C,D,E C,D,E Ac-[2-Cl-F]-PP-[ProM-1]-NH2 XXPPXX 6 T 1300 Fer4_6 pdbhh F F 5naf 2 E,F E,F MECP2_MOUSE MECP2 KKAVKESSIRSVHETVLPIKKRKTR 25 T 0.57 Humanin pdbhh F Eukaryota T 5nam 1 A A TLR4_HUMAN HTOLL MNITSQMNKTIIGVSVLSVLVVSVVAVLVYKFYFHLMLLAGCIKYGRG 48 T 0.082 Serinc pdbpssm F Eukaryota T 5nao 1 A A TLR4_HUMAN HTOLL MNITSQMNKTIIGVSVLSVLVVSVVAVLVYKFYFH 35 T 0.014 Phageshock_PspG pdb F Eukaryota T 5nas 2 C,D C,D PI4KB_HUMAN PTDINS 4-KINASE BETA,NPIK,PI4K92 LKRTASNPK 9 T 1.4 RE_HindIII pdbhh F Eukaryota T 5nbl 4 G,H G,H Unknown peptide XXXXXXXXXXXXXXXXXXXX 20 F F F 5nbm 4 G,H G,H Unknown peptide XXXXXXXXXXXXXXXXX 17 F F F 5nbx 2 C,D C,D Ac-[2-Cl-F]-PP-[ProM-9]-OH XPPX 4 T 740 Eco57I pdbhh F F 5ncl 3 C D SSD1_YEAST PROTEIN SRK1 TTEQSDFKFP 10 T 23 Interferon pdbhh F Eukaryota T 5ncm 2 B B CBK1_YEAST CELL WALL BIOSYNTHESIS KINASE GSASSPVQSGFNNGTISNYMYFERRPDLLTKGTQDKAAAVKLKIENFYQSSVKYAIERNERRVELETELTSHNWSEERKSRQLSSLGKKESQFLRLRRTRLSLED 105 T 0.076 DNA_pol_D_N pdb F Eukaryota T 5ncn 2 B B DBF2_YEAST DUMBBELL FORMING PROTEIN 2 GSASKKLPPKFYERATSNKTQRVVSVCKMYFLEHYCDMFDYVISRRQRTKQVLEYLQQQSQLPNSDQIKLNEEWSSYLQREHQVLRKRRLKPK 93 T 0.0039 PPPI_inhib pdbpercent F Eukaryota T 5nco 39 MA k Signal sequence (1A9L) KQSTLALLLLLLLLTPVAAAAAA 23 T 1.5 Mfp-3 pdbhh F T 5nd0 2 C,D C,F 5,6-DIHYDRO-BENZO[H]CINNOLIN-3-YLAMINE XXPPXTEDEX 10 T 10 Vasculin pdbhh F F 5nd1 1 A A M1VMJ0_RNQV1 Capsid protein MATQMQQRDNGEETLANIKHSARSELKDMNVSGLAGINIQAGGYDLTGLSLTDELIAQGLSNVGLLPFEHSRVVLELAEAITVTANNTDMGGSGQYCEQGAWRAWLHVGLNMAKHHVRIRSSAAMDFGTHRMMACDPASLDASTISAMSRNVTTATMNAVSREMKAMKALAAGRSRMSQSDIADNNDCRGFAFGVLSRMVMHNSARRHGVVNGRLQELGENDASTADTYLTWELACAHGKGEVAITPVPAAWLDPEAQLTGRERVFSEALARLVDPDVGCVHVKIDGVTQNAAENARVHYATRPDPMSWLDDNTGLSADSNAGRISGEHYTLWKGRHSKVHLTIQLKQLYHRMSTTAATAEPRADSIVYYLKGFEGLGACAEFLLANSRFGHHSFLPGVFGVTADVHEAAYAQNQALFLAGIGDRMPPATFTKAQLATATYALMRRYDISERTCHFAITTIGHMVAQTAVRDLNNGSLSPLPFRVNLSPFLVQGVQFWDMNDTEGSVSVHDMGIGKELLTATYALGAMASLAHVCEQGGTGEEMSAIEVSRFTDVHTVATDLFRKVVMTELGDLKLRGSEVTHSSQEALFAQKMKAVWSSMAEGSTRLYNLNQAYGPFVDVQLARIRSSFVRDIGASRQMMDASALIKHAQNVTYDWPQNESGCPVQFIALPVPSTITHYATPAIGTERWFATTRLNAAGSKVISEIRWTNGLNSDDRAGVHVFAYGRSTIVSSPLGCAEAMAAMVIAGEHKVVRRHTSIARAQSARTANIVAGAVLGARNGDMMTIIRPSTSVASSAVHLRGYIPMAAMNMLPITDGDCDLVVADTRTRPGRMSTSPEAHRRGVLASDYHVDIATDGNIRHVAREVYTVADIPTVSERVSGLALRPYERSCVRDASTLHSMLCGAVPLLYGGGEPMKLGDNTPVTNRQALRPPEYNRNPALRMPARFQMGTTACAFTKALGDVRAQMELREGDVVTEEVTEPDTTIVPQGSITERVVVGEMTEALIMADQPMFDQDVVQNIMYNSPGIRGTERANIQAELMAAKDWPSILQATAKSSHGDIASAKTPYDLIKACKVDWSKVKGETQLKLIMNKIHPEYMTLSRAISAQVDARIVPNVPKSAMSTLLFWACATDMGLHTAVMANLAGLQRTTGFKGGIVYDLEGGANWPATDVRARIIEGWNAHARRIAQSGLISRDLTVMKIQHDMNADDIMALPAHIGDSWVLTIGELTANIATDEQSVAYAKDAQSAYYAIDRLRRLVAGREEGVDEILSKAEVMARVLAENKGLADDQALYHREWNRRTMAYVYMTTYLGSLDVEARLGVVSDDAYAERRAEWKAERAKLAVPAVSGQGQGRR 1357 T 4.8 CmlA_N pdbhh T Viruses T 5nd1 2 B B M1VHN2_RNQV1 Capsid protein MSAPSDQSQETRSPTSVGNTVAADVQTSVHDKPTGELKGSDGTGIHEATGLPIDKRGEVPTVQLERTAESIAKMMDLLRSEKFTAAAADAKLMLQQEFQNIVACAKNAPQMTVNAGRFYLGCNSTTAIIAGDTADGYEIEYSGKRIEGQCVVALEPLTITLSGSTSSTQDNSDSAKLFALAVSQVWGGASTVGIVAPMLQTVAQEQTFRARVERDSGFQHHAALTTVVTTIVGWLMHVGDSAAKRSRDGWLDHQTDFAVKGMLTPHIASGMDWAGVQTYSASAMETTTDRVRADYAGRMVVHSTLRKQTLRSRGTGDTTETENSGRYLLALPKCDAGVAAAALALTWGKPKLGGAGHANLTAVMSEAGVGYITGVNGTRATPHADTVFGREELVYLLGFALRHMADAQEQVIRNVLAQVASLFRPAACSAHEWMNVHGALMPKVSRPMNEPAFREVWNVANSSSDLQMIDRDKLNGEHFLRQLAQQITVNCTGTAMAIYQAVLAGPTGITDGDTTRLQKDLYHHLFQYATTTYADGVQVMQANTRMANKMVPPVNALAAWGLGSSMDSFTGPHCAYYFGLADAADGCFYSTTTGRTLSVYAVDVNHTSSDSYLAMAQLEPGLIATATGTGSTITTNVEAAGVVDGGLVTEGHVSLYTTISAQWNGLQREVYNWLLWHACKTEDSSHADIVGAEEVKSAVEWLSSNSVEAHRFRSSAGLGATEAAGSPGRRAWRLHHYDGQIFSNVIADTERHPYMRRLYTPSELRDARNDLFVVDRIWKIVMAMRAQLMLISVQEDGGRHQHSKHYFGEAAAIGVMGHGFTNLFAYCASTVHGGREARLISNCTDTPMYKKEANDLVPPMMKVAQLSTLLAHGGAWCNAVNMGGNSTSIGLSILGDGTMPLQTVPWTVNEITYLSEEGARHGIEAIIDTNGSVSVKVKMTMLEPRQRFCLYDDNKTSSYITAQESRTATYVTLKLGGTKNANTISGLVAHDYKLATTILASTYDKGRKTGLTLEDLQKVGGITGGQGMTGRGGGSSSGRGGRGRGGSSTGGAETIGDSE 1059 T 24 Prion_octapep pdbhh T Viruses T 5ndt 2 B B Pimtide ARKRRRHPSGPPTA 14 T 1.5 DUF3019 pdbhh F T 5nes 2 B E CYD-TRP-TRD-LYS-LYD-LYS-LYD-LYS-TRD-TRP-CYD XWXKXKXKXWX 11 T 0.73 DUF5989 pdbhh F F 5ney 2 E E CYS-TRD-TRP-LYD-LYS-LYD-LYS-LYD-TRP-TRD-CYS-ALA CXWXKXKXWXCA 12 T 1 DUF5989 pdbhh F F 5nf0 2 E,G,H E,G,H CYD-TRP-TRD-LYS-LYD-LYS-LYD-LYS-TRD-TRP-CYD-GLY XWXKXKXKXWXX 12 T 0.88 DUF5989 pdbhh F F 5nf0 3 F F Fragment of ligand GGGGG 5 T 56 Parvo_coat pdbhh F F 5nfu 2 B C Ac-LEU-HIS-SER-(TPO)-ALA XHSTA 5 T 320 APP_Cu_bd pdbhh F F 5ngn 1 A,B,C A,B,C A0A2D0TC93_LYCBA lybatide 2 DSCSEYCSNRCPSCDGQTQTQYTLCCINICCPS 33 T 0.12 Radical_SAM_2 pdbhh F Eukaryota T 5ngq 2 E E DLS-PRO-ALD-CYS-TYD-ALA-CYD-LYS-ALA XPXCXAXKA 9 T 0.024 OTT_1508_deam pdbhh F F 5ngq 3 F,G,H F,G,H Fragment of bicycle KKKK 4 T 280 RFXA_RFXANK_bdg pdbhh F F 5nif 15 CA,DA 3,4 TRP-ARG-SER-TYR-TYR-ALA KYFTGSKLWRSYYA 14 T 3.4 DUF4130 pdbhh F T 5nj9 3 E,F E,F ANGT_HUMAN ASP-ARG-VAL-TYR DRVY 4 T 41 DUF3667 unphh F Eukaryota F 5nja 3 E,F E,F ANGT_HUMAN HIS-PRO-PHE HPF 3 T 23 DUF1974 pdbhh F Eukaryota F 5njc 3 E,F E,F VAL-LEU-GLU-ASP-ARG-ILE VLEDRI 6 T 63 DUF5805 pdbhh F T 5njf 3 E,F E,F ALA-ALA-ALA-ALA-ALA AAAAA 5 T 440 HCV_NS4a pdbhh F F 5njj 2 E,F,G,H E,F,G,H ALA-TYR-ILE-GLY-PRO-PTR-LEU AYIGPXL 7 T 0.29 Crl pdbhh F T 5njk 2 G,H,I,J,K,L H,G,I,J,K,M ALA-TYR-ILE-GLY-PRO-PTR-LEU AYIGPXL 7 T 0.29 Crl pdbhh F T 5njx 2 B B Q96HX7_HUMAN HSP90AA1 protein HHHHHHDDTSRMEEVD 16 T 4900 NHL pdbhh F Eukaryota T 5nkp 2 C,D D,C WNK3_HUMAN PROTEIN KINASE LYSINE-DEFICIENT 3,PROTEIN KINASE WITH NO LYSINE 3 ECEETEVDQHV 11 T 17 GP79 pdbhh F Eukaryota T 5nne 2 B C TOP2A_HUMAN GKA(ALY)GK(ALY)TQMY GKAXGKXTQMY 11 T 17 TAT_ubiq pdbhh F Eukaryota T 5nnf 2 B B BAZ1B_HUMAN FLPH(ALY)YDVKL FLPHXYDVKL 10 T 1.9 HMMR_N pdbhh F Eukaryota T 5nnp 4 G,H I,L Ser-Glu-Ser-Ser SESS 4 T 520 cEGF pdbhh F F 5nny 1 A,B A,B Q5GA15_LEGPN WipB GPMTDISMGDLHANALLFLNILVRQGIIAISPENYAKFAEIYTLPELQADYWGTEAPVFSAENKQERLEEIKKQYNALIAQIKIINTKKLIRLIGDELVDRGVIDYFILKLLQALYDQGADFEILLSNHGIEFVEACELFKENGNKLVAKRLGNIQHGNSFHALQEAIAAGAISNEEVLNIYHQVYKKHLKIISYSLDPDANEIKVFSHAGIGLNHIRGLARKFKVPYSEESAVDLAKTIDAINKKFAEKASSGEIHTLYTHDMMYRGYAGEHLNSTDEVVAATVWGREYGDLIRTSKKFKITFIHGHDSYDPEKVEHVTLN 322 T 6.4E-05 Metallophos pdbpercent F Bacteria T 5no2 10 J L RS12_ECOLI SMALL RIBOSOMAL SUBUNIT PROTEIN US12 ATVNQLVRKPRARK 14 T 1.6 MobC pdbhh F Bacteria T 5no3 11 K L RS12_ECOLI SMALL RIBOSOMAL SUBUNIT PROTEIN US12 ATVNQLVRKPRARK 14 T 1.6 MobC pdbhh F Bacteria T 5no4 11 K L RS12_ECOLI SMALL RIBOSOMAL SUBUNIT PROTEIN US12 ATVNQLVRKPRARK 14 T 1.6 MobC pdbhh F Bacteria T 5no7 1 A,B A,B A0A060SRI5_PYCCI Lytic polysaccharide monooxygenase HIAFWHNSMYGFNVTEQTFPYDNRPVVPLQYMTFQEWWFHNHLDYPPHPGDFFDFPAGKAATAELACNKGATTWFNSSEGGNIQNGNDPCPGSPPSEYHTTGIDDVKGCAMAIAYESDVRKIKPEDFTVFSVNQTCVWYRFTDFQVPERMPPCPPGGCHCAWFWIHSPDSGGEQIYMNGFQCNITGSTSHVPLAKPKVARRCGADPDHGKPDAVPGNCTYGAKQPLYWLQKEGNNEFDDYIAPPFYNDLYNFKDGAQNDIFVDSYPDGIPLEQKLISEEDLNSAVDHHHHHH 292 T 6.9 Tetradecapep pdbhh F Eukaryota T 5nog 2 F,G F,G cardiac alpha tropomyosin XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 135 F F F 5noj 2 F,G F,H cardiac alpha tropomyosin XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 136 F F F 5nol 2 F,G F,G cardiac alpha tropomyosin XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 135 F F F 5npj 2 C,D D,E POLG_HCVJF Epitope peptide WGENETDVFLLN 12 T 0.00057 HCV_NS1 pdbhh T Viruses T 5npr 2 B E bisubstrate inhibitor XVTPVCTAX 9 T 0.17 hNIFK_binding pdbhh F T 5nps 2 B D 5,6-DIHYDRO-BENZO[H]CINNOLIN-3-YLAMINE XVTPVSTAX 9 T 25 RB_A pdbhh F T 5nqa 2 C G Monoglycopeptide 3 GATGAGAGAGTTPGPG 16 T 45 MSP1_C pdbhh F F 5nqf 2 B B A5K3N8_PLAVS Rhoptry neck protein 2 MDISQHATDIGMGPATSCYTSTIPPPKQVCIQQAVKATL 39 T 3.5 zf-XS pdbhh F Eukaryota T 5nqg 2 B B A5K3N8_PLAVS RON2 MDISQHATDIGMGPATSCYTSTIPPPKQVCIQQAVKATL 39 T 3.5 zf-XS pdbhh F Eukaryota T 5nr5 1 A A Q54HW9_DICDI MatA protein GSHMASMDPLDKIINDIKKEANDSGVTLAPLSVPKPKLEELSEQQKIILAEYIAEVGLQNITAITLSKKLNITVEKAKNYIKNSNRLGRTNNLKTIGILQEEVSSMEAKSMTW 113 T 0.014 P4Ha_N pdbpercent F Eukaryota T 5nrl 31 EA X Unknown AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 111 T 16000 EF-hand_5 pdbhh F F 5nsc 3 C C Fc-III peptide DCAWHLGELVWCT 13 T 6.1 FAT pdbhh F T 5nui 2 C M SER-GLN-ILE-LYS-ARG-LEU-LEU-SER XXXXXXXXXXXXXXXX 16 F F F 5nvk 2 B,D,F,H B,D,F,H GGYF1_HUMAN PERQ AMINO ACID-RICH WITH GYF DOMAIN-CONTAINING PROTEIN 1 GPHMKYKLADYRYGREEMLALYVKENKVPEELQDKEFAAVLQDEPLQPLALEPLTEEEQRNFSLSVNSVAVLRLM 75 T 2.6 T4SS_TraI pdbhh F Eukaryota T 5nvl 2 B,D B,D GGYF2_HUMAN PERQ AMINO ACID-RICH WITH GYF DOMAIN-CONTAINING PROTEIN 2,TRINUCLEOTIDE REPEAT-CONTAINING GENE 15 PROTEIN GPHMKYKLADYRYGREEMLALFLKDNKIPSDLLDKEFLPILQEEPLPPLALVPFTEEEQRNFSMSVNSAAVLRLT 75 T 3.1 T4SS_TraI pdbhh F Eukaryota T 5nvm 2 B,D B,D GGYF2_HUMAN PERQ AMINO ACID-RICH WITH GYF DOMAIN-CONTAINING PROTEIN 2,TRINUCLEOTIDE REPEAT-CONTAINING GENE 15 PROTEIN GPHMKYKLADYRYGREEMLALFLKDNKIPSDLLDKEFLPILQ 42 T 14 TTRAP pdbhh F Eukaryota T 5nvs 1 A A dynein heavy chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 932 F F F 5nvs 2 B,C D,C dynein intermediate chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 350 F F F 5nvs 3 D B dynein heavy chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 893 F F F 5nvs 4 E F dynein light intermediate chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 298 F F F 5nvs 5 F E dynein light intermediate chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 295 F F F 5nvs 6 G 2 N-terminal dimerization domain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 125 F F F 5nvs 7 H 1 N-terminal dimerization domain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 124 F F F 5nvs 8 I,J I,J LC8 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 85 F F F 5nvs 9 K K Tctex XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 103 F F F 5nvs 10 L L Tctex XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 104 F F F 5nvs 11 M M intermediate chain XXXXXXXXXXXXXXXXXXXXXXXXXXX 27 F F F 5nvs 12 N N intermediate chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXX 29 F F F 5nvs 13 O,P R,S Robl XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 120 F F F 5nvu 1 A,B A,B Dynein motor domain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 3169 F F F 5nvu 2 C C Dynein tail heavy chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 932 F F F 5nvu 3 D,E D,E Dynein intermediate chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 350 F F F 5nvu 4 F F Dynein tail heavy chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 893 F F F 5nvu 5 G G Dynein light intermediate chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 298 F F F 5nvu 6 H H Dynein light intermediate chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 295 F F F 5nvu 7 I I N-terminal dimerization domain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 125 F F F 5nvu 8 J J N-terminal dimerization domain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 124 F F F 5nvu 9 K,L K,L LC8 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 85 F F F 5nvu 10 M M Tctex XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 103 F F F 5nvu 11 N N Tctex XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 104 F F F 5nvu 12 O O Intermediate chain N-terminus peptides XXXXXXXXXXXXXXXXXXXXXXXXXXX 27 F F F 5nvu 13 P P Intermediate chain N-terminus peptides XXXXXXXXXXXXXXXXXXXXXXXXXXXXX 29 F F F 5nvu 14 Q,R Q,R Robl XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 120 F F F 5nw4 1 A B dynein heavy chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 889 F F F 5nw4 2 B D dynein intermediate chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 341 F F F 5nw4 3 C C dynein intermediate chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 350 F F F 5nw4 4 D A dynein heavy chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 922 F F F 5nw4 5 E 1 dynein N-terminal dimerization domain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 125 F F F 5nw4 6 F 2 dynein N-terminal dimerization domain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 124 F F F 5nw4 7 G,H R,S Robl XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 120 F F F 5nw4 8 I E dynein light intermediate chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 295 F F F 5nw4 9 J F dynein light intermediate chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 298 F F F 5nw4 15 W a dynactin shoulder complex XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 571 F F F 5nw4 16 X b dynactin shoulder complex XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 602 F F F 5nw4 17 Y,Z c,d dynactin shoulder complex XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 130 F F F 5nw4 18 AA,BA e,f dynactin shoulder complex XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 174 F F F 5nw4 21 EA i dynactin pointed end p62 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 243 F F F 5nw4 22 FA j p150 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 52 F F F 5nw4 25 IA m A0A0J9X299_PIG Dynactin MADPKYADLPGIARNEPDVYAAAAAAAAAAA 31 T 0.16 Dynamitin pdbhh F Eukaryota T 5nw4 26 JA n A0A0J9X293_PIG DYNACTIN SUBUNIT 2 MADPKYADLPGIARNEPDVY 20 T 5.8 SmAKAP pdbhh F Eukaryota T 5nw4 27 KA o dynactin p150 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 53 F F F 5nw4 28 LA,MA 5,6 BICD2N XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 275 F F F 5nwi 2 B P KAT1_ARATH Potassium channel KAT1 YFSSN 5 T 37 SelB-wing_1 pdbhh F Eukaryota F 5nwj 2 B P KAT1_ARATH Potassium channel KAT1 HLYFSSN 7 F F Eukaryota T 5nwk 2 B,D,F,H,J,L,N,P P,Q,R,S,T,U,V,W KAT1_ARATH Potassium channel KAT1 YFSSN 5 T 37 SelB-wing_1 pdbhh F Eukaryota F 5nwy 1 A s A0A0P7EF65_VIBAL VemP nascent chain MHHHHHHHHHHGDYKDDDDKENLYFQGSAQIDQKAHVPHFSKLQPFVAVSVSPNSSVDFSEASEESSQSPVSEGHASLDSVALFNSQRWTSYLREGLDDEHVDFVGDLTTPFYADAGYAYSLMDINWRHNQSTFYHFTSDHRISGWKETNAMYVALNSQFSALEVLFQGPYPYDVPDYA 179 T 6.3 DUF4022 unphh F Bacteria T 5nx2 2 B B truncated peptide agonist XXGTXTSDXX 10 T 120 Jagunal pdbhh F F 5nxf 1 A,B,C A,B,C FIBP_BPT4 GENE PRODUCT 34,GP34 STEAQEGVIKVATQSETVTGTSANTAVSPKNLKWIAQSEPTWAATTAIRGFVKTSSGSITFVGNDTVGSTQDLELYEKNSYAVSPYELNRVLANYLPLKAKAADTNLLDGLDSSQFIRRDIAQTVNGSLTLTQQTNLSAPLVSSSTGEFGGSLAANRTFTIRNTGAPTSIVFEKGPASGANPAQSMSIRVWGNQFGGGSDTTRSTVFEVGDDTSHHFYSQRNKDGNIAFNINGTVMPININASGLMNVNGTATFGRSVTANGEFISKSANAFRAINGDYGFFIRNDASNTYFLLTAAGDQTGGFNGLRPLLINNQSGQITIGEGLIIAKGVTINSGGLTVNSRIRSQGTKTSDLYTRAPTSDTVGFWSIDINDSATYNQFPGYFKMVEKTNEVTGLPYLERGEEVKSPGTLTQFGNTLDSLYQDWITYPTTPEARTTRWTRTWQKTKNSWSSFVQVFDGGNPPQPSDIGALPSDNATMGNLTIRDFLRIGNVRIVPDPVNKTVKFEWVE 509 T 0.16 gp12-short_mid unphh T Viruses T 5nxh 1 A,B,C A,B,C FIBP_BPT4 GENE PRODUCT 34,GP34 SGLVESGTLWDHYTLNILEANETQRGTLRVATQVEAAAGTLDNVLITPKKLLGTKSTEAQEGVIKVATQSETVTGTSANTAVSPKNLKWIAQSEPTWAATTAIRGFVKTSSGSITFVGNDTVGSTQDLELYEKNSYAVSPYELNRVLANYLPLKAKAADTNLLDGLDSSQFIRRDIAQTVNGSLTLTQQTNLSAPLVSSSTGEFGGSLAANRTFTIRNTGAPTSIVFEKGPASGANPAQSMSIRVWGNQFGGGSDTTRSTVFEVGDDTSHHFYSQRNKDGNIAFNINGTVMPININASGLMNVNGTATFGRSVTANGEFISKSANAFRAINGDYGFFIRNDASNTYFLLTAAGDQTGGFNGLRPLLINNQSGQITIGEGLIIAKGVTINSGGLTVNSRIRSQGTKTSDLYTRAPTSDTVGFWSIDINDSATYNQFPGYFKMVEKTNEVTGLPYLERGEEVKSPGTLTQFGNTLDSLYQDWITYPTTPEARTTRWTRTWQKTKNSWSSFVQVFDGGNPPQPSDIGALPSDNATMGNLTIRDFLRIGNVRIVPDPVNKTVKFEWVE 564 T 0.16 gp12-short_mid unphh T Viruses T 5nxk 1 A,B,C A,B,C Serine-rich secreted cell wall anchored (LPXTG-motif ) protein IEEVSNEEELKAALRDASITTIKLKNNITLNNAITINNGNRNITIIGDGHYINALNSDGGIILNNRGGSAKIDLTIENATLYNTSKYGFVNMSSNGVDTVTYKDVTAYGGTLVWSKTGAGVKTLNLVGNTTLNSVKSYEVDGQSCGTEAFSHRTPDGDKTTALYVSNAINIAENANVVLNNSATDIDMWLLTAVPSTSGISTVTVGNNASLTMENIGNTEYNIKLDGGRENHFIVNENAAVKMSAKVDNVRIIPQLENIFTRGNIELAKGSNVHLEVITGSNFRVAGTVANRIDFNGTATLIKQEGASGP 310 T 0.18 Cas5fv_helical pdb F T 5nxq 2 D,E D,E SLD5_YEAST MET-ASP-ILE-UA1-ILE-ASP-ASP-ILE-LEU-UA2-GLU-LEU-ASP-LYS-GLU MDIXIDDILXELDKETTAV 19 T 0.5 Bombolitin pdbhh F Eukaryota T 5ny0 1 A A A0A384E0N5_LACR1 L. reuteris SRRP binding region EDIQADATAANASELKKALQDTSVHTIKLTDNITLTSAIELTNVSRDVTIYGNGKYINATDGNGGIFIHNTKSYTVNLTIEKATLYNQSQYGFVHMNDEGTDNITYKNITAYGGTLVWSQTHVGTKTLSLEGTVNFYSVPSYTVGGQTYSTDAFKIGTHYPNGENKDTTPAIYVSNEINIADNANIALENSATKIDIWMIADIGIHPHTTALTIGNNATLTMENGNNSALNIKLDGDTSNSFTVGEGSTVKLSAKVDNVRILPYEDSNTANVSFAKGSDVTLHAGTGSNLRMGASISNQIDFNGKATFIKDSGAYANTAYADQTRGNIEFDYYWNDQQKTGSTGVANFNPGSNVLFQAGPGASNVNTY 368 T 0.0055 MAP pdb F Bacteria T 5o0z 1 A,B A,B Laspartomycin C XDXXGDGDGXIP 12 T 0.067 LCAT pdbhh F F 5o31 15 O P NADH dehydrogenase [ubiquinone] 1 alpha subcomplex subunit 9 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 297 F F F 5o31 28 CA j NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 52 F F F 5o31 29 DA k NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 74 F F F 5o31 30 EA l NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 8 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 120 F F F 5o3u 2 E,F,G,H L,M,N,O F6LNM3_9CARY Putative presegetalin F1 MATSFQFDGLKPSFSASYSSKPIQTQVSNGMDNASAPV 38 T 0.02 MAGI_u1 pdb F Eukaryota T 5o3v 2 C,D D,C F6LNL6_9CARY Putative presegetalin B1 MSPILAHDVVKPQGVAWAFQAKDVENASAPV 31 T 16 DPRP pdbhh F Eukaryota T 5o3w 2 E,F,G,H W,X,Y,Z F6LNL5_9CARY Presegetalin A1 MSPILAHDVVKPQGVPVWAFQAKDVENASAPV 32 T 10 Choline_sulf_C pdbhh F Eukaryota T 5o45 2 B B PHE-MEA-9KK-SAR-ASP-VAL-MEA-TYR-SAR-TRP-TYR-LEU-CCS-GLY-NH2 FXXXDVXYXWYLXGX 15 T 0.79 Selenoprotein_S pdbhh F T 5o4y 2 D,E,F A,D,F PHE-MAA-ASN-PRO-HIS-LEU-SER-TRP-SER-TRP-9KK-9KK-ARG-CCS-GLY-NH2 FXNPHLSWSWXXRXGX 16 T 2.9 DUF4462 pdbhh F T 5o60 1 A 3 A0QTP4_MYCS2 BL37 MAKRGRKKRDRKHSKANHGKRPNA 24 T 0.18 DUF6254 pdb F Bacteria T 5o61 1 A 3 A0QTP4_MYCS2 BL37 MAKRGRKKRDRKHSKANHGKRPNA 24 T 0.18 DUF6254 pdb F Bacteria T 5o6u 3 C,D,E C,D,E A4Y6G1_SHEPC Uncharacterized protein MQKVTGIKSVDFKIKALGHGVVNWNGPTTLTGDDGKTVDNHTLPKLRGYTNLTGKVKDETGYKYKKQATDINFKETPLYISQNCIRHHLFREQAFDLHYASDKNLKNVLASITGLIRGYVVPSSQCKRTSPLLLEDFVDQLGNGNFEQYGQAGARDSTSFFSKTTFGDTEYISYGSISIEQLQFISLDKKFDRAAMVIKEGEGEVIAAELQNYIQSLNPSLNPQAIFHSNYVRRGTIFEEGECGILLNDDAVKALVAETLERLANLSIRQAKGYMYVDDITVDYNDSHKMMRIKRDESEIINEQHAPFAQYFYAK 315 T 0.025 MecA_N pdbpssm F Bacteria T 5o74 1 A,C,E,G,I,K A,C,E,G,I,K DRRA_LEGPN DEFECTS IN RAB1 RECRUITMENT PROTEIN A GHMVTRIENLENAKKLWDNANSMLEKGNISGYLKAANELHKFMKEKNLKEDDLRPELSDKTISPKGYAILQSLWGAASDYSRAAATLTESTVEPGLVSAVNKMSAFFMDCKLSPNERATPDPDFKVGKSKILVGIMQFIKDVADPTSKIWMHNTKALMNHKIAAIQKLERSNNVNCETLESVLSSKGENLSEYLSYK 197 T 0.099 LuxQ-periplasm unppercent F Bacteria T 5o76 2 B,D B,D ZAP70_HUMAN Tyrosine protein kinase ZAP70 peptide TLNSDGXTPEPA 12 T 0.99 Galanin pdbhh F Eukaryota T 5o7h 3 C,D,E C,D,E A4Y6G1_SHEPC Cas7fv MQKVTGIKSVDFKIKALGHGVVNWNGPTTLTGDDGKTVDNHTLPKLRGYTNLTGKVKDETGYKYKKQATDINFKETPLYISQNCIRHHLFREQAFDLHYASDKNLKNVLASITGLIRGYVVPSSQCKRTSPLLLEDFVDQLGNGNFEQYGQAGARDSTSFFSKTTFGDTEYISYGSISIEQLQFISLDKKFDRAAMVIKEGEGEVIAAELQNYIQSLNPSLNPQAIFHSNYVRRGTIFEEGECGILLNDDAVKALVAETLERLANLSIRQAKGYMYVDDITVDYNDSHKMMRIKRDESEIINEQHAPFAQYFYAK 315 T 0.025 MecA_N pdbpssm F Bacteria T 5o8k 2 B B REV3L_HUMAN PROTEIN REVERSIONLESS 3-LIKE,HREV3 MGRTANILKPLMSPPSREEIMATLLDHD 28 T 3.6 FAM110_C pdbhh F Eukaryota T 5o90 2 B B TAB1_HUMAN MITOGEN-ACTIVATED PROTEIN KINASE KINASE KINASE 7-INTERACTING PROTEIN 1,TGF-BETA-ACTIVATED KINASE 1-BINDING PROTEIN 1,TAK1-BINDING PROTEIN 1 RVYPVSVPYSSAQSTSKTSVTLSLVMPSQ 29 T 6 DUF2584 pdbhh F Eukaryota T 5o9s 2 C,D C,D NCS1_HUMAN NCS-1,FREQUENIN HOMOLOG,FREQUENIN-LIKE PROTEIN,FREQUENIN-LIKE UBIQUITOUS PROTEIN GKSNSKLK 8 T 0.19 EF-hand_7 unppssm F Eukaryota F 5o9t 2 C,D C,D 1IP-CYS-PHE-SER-LYS-PRO-ARG XNCFSKPR 8 T 5.6 DUF1244 pdbhh F T 5o9u 2 C,D C,D CNBL4_ARATH PROTEIN SALT OVERLY SENSITIVE 3 GCSVSKKK 8 T 2.3 Antimicrobial_1 pdbhh F Eukaryota F 5o9v 2 C,D C,D AIFM3_HUMAN APOPTOSIS-INDUCING FACTOR-LIKE PROTEIN GGCFSKPK 8 T 0.062 NifU unphh F Eukaryota T 5oa1 21 U 1 ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA AAAAAAAAAAAAA 13 T 220 K_channel_TID pdbhh F F 5oa1 22 V 2 ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA AAAAAAAAAAAAAAAAAAA 19 T 410 Adeno_PIX pdbhh F F 5oa1 23 CA,W 9,3 ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA AAAAAAAAA 9 T 160 FAD_oxidored pdbhh F F 5oa1 24 X 4 ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA AAAAAAAAAAAAAAAAAAAAAAAA 24 T 670 DUF4699 pdbhh F F 5oa1 25 DA,Y Q,5 ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA AAAAAAAAAAAAAAAA 16 T 240 Campylo_MOMP pdbhh F F 5oa1 26 BA,Z 8,6 ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA AAAAAAAAAA 10 T 200 FAD_oxidored pdbhh F F 5oa1 27 AA 7 ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA AAAAAAAAAAAAAAAAAAAAAAAAA 25 T 790 DUF4699 pdbhh F F 5oa1 28 EA P ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA AAAAAAAAAAAAAAAAAAAAAA 22 T 460 DUF4699 pdbhh F F 5oa1 29 FA Z ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA AAAAAAAAAAAAAAA 15 T 200 Campylo_MOMP pdbhh F F 5oa1 30 GA Y ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA AAAAAAAAAAAA 12 T 250 K_channel_TID pdbhh F F 5oa1 31 HA R ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA AAAAAAAAAAAAAAAAAAAAAAAAAAA 27 T 1100 DUF4699 pdbhh F F 5oac 1 A,B,C,D,E,F,G,H,I,J A,B,C,D,E,F,G,H,I,J A0A2D0TC94_9VIRU Major capsid protein MTIKYLSSETEKLMNQTVSGIDVCFTLIGVDDDSFASGSKNDYISDTPKFLDPSNVHIKATLKRGGKDYVLFSENLALLAKYSTITQGRDQWEEGVKLAAKEMVHLVYIPFSGNTNWPAHINLKDNDVLEVYVNVVRGAYGAELDANACICDVRTSPSIGVEKFIPFMTSYSIRANQATDLVNLGNDVTRIALLSMTNDVSNIPNAFTDVTLSSDRLDKNFNSNQLILEHSKCIEDSVRSHANEVDSYLIHEDIEIDSAKVHLKMNPAKIRENTIYLVRSHFQTSLEILQKAVAMEEKHQSADIAKVPAT 310 T 0.19 AcetylCoA_hydro pdbpercent T Viruses T 5oap 1 A A DRE2A_ARATH DREB2A SSDMFDVDELLRDLNGDD 18 T 6.5 DUF2525 pdbhh F Eukaryota T 5oav 2 B,D B,D APP12 XAPPLPPRNRPRL 13 T 0.42 SCIMP pdbhh F T 5ob0 2 B B APP12 XAPPLPPRNRPRL 13 T 0.42 SCIMP pdbhh F T 5ob1 2 B B APP12 XAPPLPPRNRPRL 13 T 0.42 SCIMP pdbhh F T 5ob2 2 B,D B,D APP12 XAPPLPPRNRPRL 13 T 0.42 SCIMP pdbhh F T 5obm 80 OC m2 60S ribosomal protein L12 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 165 F F F 5obm 83 RD p1 Ribosomal protein P1 alpha XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 5obt 3 C,F C,D Ac-YVAD-CMK XYVADX 6 T 300 Rhodanese_C pdbhh F F 5od4 1 A A A0A0C4DI32_FUSO4 SECRETED IN XYLEM 3 PROTEIN,AVR2 GPPYCVFPGRRTSSTSFTTSFSTEPLGYARMLHRDPPYERAGNSGLNHRIYERSRVGGLRTVIDVAPPDGHQAIANYEIEVRRIPVATPNAAGDCFHTARLSTGSRGPATISWDADASYTYYLTISED 128 T 11 DUF4377 pdbhh F Eukaryota T 5ods 2 E,F,G,H E,F,G,H TACC3_HUMAN LYS-GLU-SER-ALA-LEU-ARG-LYS-GLN-SEP-LEU-TYR-LEU-LYS-PHE-ASP-PRO-LEU-LEU KESALRKQSLYLKFDPLL 18 T 2 DUF4293 pdbhh F Eukaryota T 5odt 2 B B TACC3_HUMAN ERIC-1 MELKEESFRDPAEVLGTGAEVDYLEQFGTSSFKESALRKQSLYLKF 46 T 0.99 DUF2095 pdbhh F Eukaryota T 5oec 1 A A Q6EAT3_SALER VIRULENCE PROTEIN GHMQGQIIHHRNFQSQFDTTGNTLYNNAWVCSLNVIKSRDGNNYSALEDITSDNQAFNNILEGIDIIECENLLKEMNVQKIPESSLFTNIKEALQAEVFNSTVEDDFESFISYELQNHGPLMLIRPSLGSECLHAECIVGYDSEVKKVLIYDSMNTSPEWQSNIDVYDKLTLAFNDKYKNEDCSICGLYYDGVYEPK 197 T 0.002 YsaB unppssm F Bacteria T 5oed 1 A A Q6EAT3_SALER VIRULENCE PROTEIN GHMLRHIQNSLGSVYRSNTATPQGQIIHHRNFQSQFDTTGNTLYNNAWVCSLNVIKSRDGNNYSALEDITSDNQAFNNILEGIDIIECENLLKEMNVQKIPESSLFTNIKEALQAEVFNSTVEDDFESFISYELQNHGPLMLIRPSLGSECLHAECIVGYDSEVKKVLIYDSMNTSPEWQSNIDVYDKLTLAFNDKYKNEDCSICGLYYDGVYEPKPLHSSSWKDWCTIL 230 T 0.002 YsaB pdbpssm F Bacteria T 5oek 1 A,B A,B GHR_HUMAN GH RECEPTOR,SOMATOTROPIN RECEPTOR GSMSQFTCEEDFYFPWLLIIIFGIFGLTVMLFVFLFSKQQRIK 43 T 8.8E-05 IFNGR1 unphh F Eukaryota T 5oeo 2 B C TRPV5_HUMAN TRPV5,CALCIUM TRANSPORT PROTEIN 2,CAT2,EPITHELIAL CALCIUM CHANNEL 1,ECAC1,OSM-9-LIKE TRP CHANNEL 3,OTRPC3 GADKEDDQEHPSEKQPSGAESGTLARASLALPTSSLSRTASQSSSHRGWEILRQNTLGHLNLGLNLSEGDGEE 73 T 0.16 Lipase3_N pdbpssm F Eukaryota T 5of4 7 G H MAT1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 124 F F F 5of4 8 H Z Unassigned secondary structure elements. XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 270 F F F 5of4 9 I Y Unassigned secondary structure elements (p52 region) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 232 F F F 5of4 10 J X Unassigned secondary structure elements (XPB NTE region) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 78 F F F 5ogl 2 B B Substrate mimicking peptide GDQNATXG 8 T 3.9 S-AdoMet_synt_M pdbhh F T 5oh5 1 A A RidL GPSILEEYIRMAKNKEFFDALEEIAESAKNDETLRNELAKVLDDILKTDPSDPEAFRKIVAEHQEFWDEHDPSLMEFNEGRFFGKSRKQYLKSDDFLNSTDPTYNFQKLHQFAAEQRVKLGLEKSDTDTLVAILKNNPEECRAYIESKKPGLGNFSEGNVHGWLKEEYTPTIPPKAINKSTGVLSDEAIKRIKEQARDLLLLKLINSSGNTQLLKDLRDAMSKPEAERAANALGFPTEGNGVLFLSREVVDALEERVEKLEQEAAKRGFDSYVQSLSHNALLA 283 T 0.026 SE pdb F T 5oh6 1 A,B A,B Interaptin GPSAKNKEFFDALEEIAESAKNDETLRNELAKVLDDILKTDPSDPEAFRKIVAEHQEFWDEHDPSLMEFNEGRFFGKSRKQYLKSDDFLNSTDPTYNFQKLHQFAAEQRVKLGLEKSDTDTLVAILKNNPEECRAYIESKKPGLGNFSEGNVHGGSGGVLSDEAIKRIKEQARDLLLLKLINSSGNTQLLKDLRDAMSKPEAERAANALGFPTEGNGVLFLSREVVDALEERVEKLA 237 T 0.14 HEAT_2 pdbpercent F T 5ohd 1 A,B A,B GHR_HUMAN GH RECEPTOR,SOMATOTROPIN RECEPTOR GSMSQFTCEEDFYFPWLLIIIFGIFGLTVMLFVFLFSKQQRIK 43 T 8.8E-05 IFNGR1 unphh F Eukaryota T 5ohg 2 C C RNE_ECOLI RNASE E RRYRDERYPTQSPMPLTVACASPELASGKVWIRYPI 36 T 1.2 XisI pdbhh F Bacteria T 5ohg 3 F J RNE_ECOLI RNASE E RDERYPTQSPMPLTVACASPELASGKVWIRYPIVR 35 T 0.87 XisI pdbhh F Bacteria T 5oj5 2 B B PSBA1_THEEB PHE-PRO-LEU-ASP-LEU-ALA NAHNFPLDLA 10 T 0.49 IL34 pdbhh F Bacteria T 5ojo 2 C C CPSM_HUMAN CARBAMOYL-PHOSPHATE SYNTHETASE I,CPSASE I XVLKEYGV 8 F F Eukaryota T 5ojr 2 E,F E,F PSBA3_THEVB PSII D1 PROTEIN 3,PHOTOSYSTEM II Q(B) PROTEIN 3 NAHNFPLDLASAESAPVA 18 T 3.3 IL34 pdbhh F Bacteria T 5ojt 1 A A ACE-ARG-ALA-(D)CYS-ARG-BNA-HIS-PEN XRAXRXHX 8 T 9 Antimicrobial_6 pdbhh F F 5ok6 2 C,D C,D ALA-GLU-GLY-GLU-PHE-TYR-LYS-LEU-LYS-ILE-ARG-THR-PRO-AAR AEGEFYKLKIRTPR 14 T 1.3 RsgI_N pdbhh F T 5okc 3 D,F G,I CTF18_YEAST Chromosome transmission fidelity protein 18 TVKIWVKYNEGFSNAVRKNVTWNNLWE 27 T 6.5 BRCA2 pdbhh F Eukaryota T 5oki 4 E,H E,I CTF18_YEAST Chromosome transmission fidelity protein 18 TVKIWVKYNEGFSNAVRKNVTWNNLW 26 T 6.6 Hairy_orange pdbhh F Eukaryota T 5ol0 2 C,D C,D P53_HUMAN ANTIGEN NY-CO-13,PHOSPHOPROTEIN P53,TUMOR SUPPRESSOR P53 KKGQSTSRHKXLMFKTEG 18 T 30 Class_IIIsignal pdbhh F Eukaryota T 5olf 1 A A GBA-ALA-CYS-ARG-PHE-PHE-CYS XACRFFC 7 T 2.1 FKTN_N pdbhh F F 5oll 1 A A GUR_GYMSY SWEET TASTE-SUPPRESSING PEPTIDE EQCVKKDELCIPYYLDCCEPLECKKVNWWDHKCIG 35 T 0.00036 Toxin_7 pdb F Eukaryota T 5oma 2 E H Undetermined peptide XXXX 4 F F F 5on6 47 HB m2 60S ribosomal protein L12 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 150 F F F 5onj 2 B D (2R,3S,4R,5R,6R)-6-((1R,2R,3S,4R,6S)-4,6-DIAMINO-2,3-DIHYDROXYCYCLOHEXYLOXY)-5-AMINO-2-(AMINOMETHYL)-TETRAHYDRO-2H-PYRAN-3,4-DIOL EXXXXX 6 T 110 Herpes_LP pdbhh F F 5onr 2 B B A4_HUMAN ABPP,APPI,APP,ALZHEIMER DISEASE AMYLOID PROTEIN,AMYLOID PRECURSOR PROTEIN,AMYLOID-BETA PRECURSOR PROTEIN,CEREBRAL VASCULAR AMYLOID PEPTIDE,CVAP,PREA4,PROTEASE NEXIN-II,PN-II IIG 3 T 0.0001 Beta-APP unphh F Eukaryota F 5ons 2 B B DENR_HUMAN DRP,PROTEIN DRP1,SMOOTH MUSCLE CELL-ASSOCIATED PROTEIN 3,SMAP-3 MHHHHHHDADYPLRVLYCGVCSLPTEYCEYMPDVA 35 T 0.012 PHM7_cyt unppssm F Eukaryota T 5oob 3 E,J,K E,K,Z L9KL62_TUPCH NELF-E,RNA-BINDING PROTEIN RD DKRTQIVYSDDVYKENLVDGF 21 T 0.85 DUF5820 pdbhh F Eukaryota T 5ool 53 AB t Unknown protein or protein extension XXXXXXXXXXXXXXXXXXXXXXXXXXXX 28 F F F 5oom 52 ZA t Unknown protein or protein extension XXXXXXXXXXXXXXXXXXXXXXXXXXXX 28 F F F 5oqj 31 EA Z Unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 43 F F F 5oqm 31 EA Z Unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 43 F F F 5oqt 2 B C MGTS_ECOLI Uncharacterized protein YneM MLGNMNVFMAVLGIILFSGFLAAYFSHKWDD 31 T 0.23 Gram_pos_anchor pdb F Bacteria T 5osh 3 C,F,I,L C,F,I,L Interaptin MALEEYIRMAKNKEFFDALEEIAESAKNDETLRNELAKVLDDILKTDPSDPEAFRKIVAEHQEFWDEHDPSLMEFNEGRFFGKSRKQYLKSDDFLNSTDPTYNFQKLHQFAAEQRVKLGLEKSDTDTLVAILKNNPEECRAYIESKKPGLGNFSEGNVHGWLKEEYTPTIPPKAINKSTGVLSDEAIKRIKEQARDLLLLKLINSSGNTQLLKDLRDAMSKPE 223 T 0.016 SE pdb F T 5osi 3 C,F,I C,F,I Q5ZT54_LEGPH Interaptin KEEYTPTIPPKAIN 14 T 0.25 SE unp F Bacteria T 5ot7 45 SA l Ribosomal protein uL10 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 130 T 10000 zf-C2H2 pdbhh F F 5ot7 46 TA m Ribosomal protein uL11 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 140 F F F 5ot7 47 UA n Ribosomal protein bL12 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 71 F F F 5ouo 1 A A V5BCL0_TOXGV Perforin-like protein 1 ETGRNLPKQLTQATQVAWSGPPPGFAKCPGGQVVILGFAMHLNFKEPGTDNFRIISCPPGREKCDGVGTASSETDEGRIYILCGEEPINEIQQVVAESPAHAGASVLEASCPDETVVVGGFGISVRGGSDGLDSFSIESCTTGQTICTKAPTRGSEKNFLWMMCVDKQYPGLRELVNVAELGSHGNANKRAVNSDGNVDVKCPANSSIVLGYVMEAHTNMQFVRDKFLQCPENASECKMTGKGVDHGMLWLFDRHALFGWIICKTVNEGTKHHHHHH 277 T 0.23 HZS_alpha pdb F Eukaryota T 5ov3 2 C C RBBP5_MOUSE RBBP-5 EPKQTG 6 T 0.0099 DUF2457 unppercent F Eukaryota T 5ovt 2 H,I,J,K,L,M,N a,b,c,d,e,f,g Epoxomicin XXITX 5 T 840 DUF4597 pdbhh F F 5ovv 2 B B LZTS3_RAT PROSAP-INTERACTING PROTEIN 1,PROSAPIP1 XIESTEI 7 T 260 Mdv1 pdbhh F Eukaryota F 5ow5 3 E,F E,F CAMP3_MOUSE Calmodulin-regulated spectrin-associated protein IEEALQIIHS 10 T 3.2 BsuBI_PstI_RE_N pdbhh F Eukaryota T 5owo 1 A,B,C,D A,B,C,D DYHC1_HUMAN CYTOPLASMIC DYNEIN HEAVY CHAIN 1,DYNEIN HEAVY CHAIN,CYTOSOLIC MSEPGGGGGEDGSAGLEVSAVQNVADVSVLQKHLRKLVPLLLEDGGEAPAALEAALEEKSALEQMRKFLSDPQVHTVLVERSTLKEDVGDEGEEEKEFISYNINIDIHYGVKSNSLAFIKRTPVIDADKPVSSQLRVLTLSEDSPYETLHSFISNAVAPFFKSYIRESGKADRDGDKMAPSVEKKIAELEMGLLHLQQNIE 201 T 0.13 DUF4042 pdbpercent F Eukaryota T 5owp 2 B D 5,6-DIHYDRO-BENZO[H]CINNOLIN-3-YLAMINE SAXDTRPA 8 T 32 PNPase_C pdbhh F T 5oxe 1 A A D4QF72_APBV1 Major virion protein MAPKATLVKKFKGLAVGVGALLAAPPIMGLASYAVNGISSYLSITINSTTYDFAPLAQAVMVFGGIGLVAYGLHRILGRGL 81 T 0.3 MSP1b pdb T Viruses T 5oxw 1 A,B,C,D A,B,C,D Q74N74_NANEQ NEQ068 SIMDTEIEVIENGIKKKEKLSDLFNKYYAGFQIGEKHYAFPPDLYVYDGERWVKVYSIIKHETETDLYEINGITLSANHLVLSKGNWVKAKEYENKNN 98 T 3.6E-44 DNA_pol_B unp F Archaea T 5oxw 2 E,F,G,H E,F,G,H ALA-SER-GLY-SER-PHE-LYS-VAL-ILE-TYR-GLY-ASP ASGSFKVIYGD 11 T 5.2 DapH_N pdbhh F T 5oyp 4 D D Q9IGK7_9VIRU minor capsid protein MiCP DNPHRFLPANVSNRWNEYSSAYLPRV 26 T 2.7 YLP pdbhh T Viruses T 5pad 2 B I ZGPGCK XGFGX 5 T 120 YicC_N pdbhh F F 5q0i 2 B B PRGC1_HUMAN COACTIVATOR PEPTIDE PGC-1A PPAR GAMMA COACTIVATOR PSLLKKLLLAPA 12 T 13 Neurokinin_B pdbhh F Eukaryota F 5qsm 1 A,B A,B STAG1_HUMAN SCC3 HOMOLOG 1,STROMAL ANTIGEN 1 SMSPNGNLIRMLVLFFLESELHEHAAYLVDSLWESSQELLKDWECMTELLLEEPVQGEEAMSDRQESALIELMVCTIRQAAEAHPPVGRGTGKRVLTAKERKTQIDDRNKLTEHFIITLPMLLSKYSADAEKVANLLQIPQYFDLEIYSTGRMEKHLDALLKQIKFVVEKHVESDVLEACSKTYSILCSEEYTIQNRVDIARSQLIDEFVDRFNHSVEDLLQEGEEADDDDIYNVLSTLKRLTSFHNAHDLTKWDLFGNCYRLLKTGIEHGAMPEQIVVQALQCSHYSILWQLVKITDGSPSKEDLLVLRKTVKSFLAVCQQCLSNVNTPVKEQAFMLLCDLLMIFSHQLMTGGREGLQPLVFNPDTGLQSELLSFVMDHVFIDQDEENQSMEGDEEDEANKIEALHKRRNLLAAFSKLIIYDIVDMHAAADIFKHYMKYYNDYGDIIKETLSKTRQID 459 T 0.0089 Cnd3 pdb F Eukaryota T 5qsn 1 A,B A,B STAG1_HUMAN SCC3 HOMOLOG 1,STROMAL ANTIGEN 1 SMSPNGNLIRMLVLFFLESELHEHAAYLVDSLWESSQELLKDWECMTELLLEEPVQGEEAMSDRQESALIELMVCTIRQAAEAHPPVGRGTGKRVLTAKERKTQIDDRNKLTEHFIITLPMLLSKYSADAEKVANLLQIPQYFDLEIYSTGRMEKHLDALLKQIKFVVEKHVESDVLEACSKTYSILCSEEYTIQNRVDIARSQLIDEFVDRFNHSVEDLLQEGEEADDDDIYNVLSTLKRLTSFHNAHDLTKWDLFGNCYRLLKTGIEHGAMPEQIVVQALQCSHYSILWQLVKITDGSPSKEDLLVLRKTVKSFLAVCQQCLSNVNTPVKEQAFMLLCDLLMIFSHQLMTGGREGLQPLVFNPDTGLQSELLSFVMDHVFIDQDEENQSMEGDEEDEANKIEALHKRRNLLAAFSKLIIYDIVDMHAAADIFKHYMKYYNDYGDIIKETLSKTRQID 459 T 0.0089 Cnd3 pdb F Eukaryota T 5qso 1 A,B A,B STAG1_HUMAN SCC3 HOMOLOG 1,STROMAL ANTIGEN 1 SMSPNGNLIRMLVLFFLESELHEHAAYLVDSLWESSQELLKDWECMTELLLEEPVQGEEAMSDRQESALIELMVCTIRQAAEAHPPVGRGTGKRVLTAKERKTQIDDRNKLTEHFIITLPMLLSKYSADAEKVANLLQIPQYFDLEIYSTGRMEKHLDALLKQIKFVVEKHVESDVLEACSKTYSILCSEEYTIQNRVDIARSQLIDEFVDRFNHSVEDLLQEGEEADDDDIYNVLSTLKRLTSFHNAHDLTKWDLFGNCYRLLKTGIEHGAMPEQIVVQALQCSHYSILWQLVKITDGSPSKEDLLVLRKTVKSFLAVCQQCLSNVNTPVKEQAFMLLCDLLMIFSHQLMTGGREGLQPLVFNPDTGLQSELLSFVMDHVFIDQDEENQSMEGDEEDEANKIEALHKRRNLLAAFSKLIIYDIVDMHAAADIFKHYMKYYNDYGDIIKETLSKTRQID 459 T 0.0089 Cnd3 pdb F Eukaryota T 5qsp 1 A,B A,B STAG1_HUMAN SCC3 HOMOLOG 1,STROMAL ANTIGEN 1 SMSPNGNLIRMLVLFFLESELHEHAAYLVDSLWESSQELLKDWECMTELLLEEPVQGEEAMSDRQESALIELMVCTIRQAAEAHPPVGRGTGKRVLTAKERKTQIDDRNKLTEHFIITLPMLLSKYSADAEKVANLLQIPQYFDLEIYSTGRMEKHLDALLKQIKFVVEKHVESDVLEACSKTYSILCSEEYTIQNRVDIARSQLIDEFVDRFNHSVEDLLQEGEEADDDDIYNVLSTLKRLTSFHNAHDLTKWDLFGNCYRLLKTGIEHGAMPEQIVVQALQCSHYSILWQLVKITDGSPSKEDLLVLRKTVKSFLAVCQQCLSNVNTPVKEQAFMLLCDLLMIFSHQLMTGGREGLQPLVFNPDTGLQSELLSFVMDHVFIDQDEENQSMEGDEEDEANKIEALHKRRNLLAAFSKLIIYDIVDMHAAADIFKHYMKYYNDYGDIIKETLSKTRQID 459 T 0.0089 Cnd3 pdb F Eukaryota T 5qsq 1 A,B A,B STAG1_HUMAN SCC3 HOMOLOG 1,STROMAL ANTIGEN 1 SMSPNGNLIRMLVLFFLESELHEHAAYLVDSLWESSQELLKDWECMTELLLEEPVQGEEAMSDRQESALIELMVCTIRQAAEAHPPVGRGTGKRVLTAKERKTQIDDRNKLTEHFIITLPMLLSKYSADAEKVANLLQIPQYFDLEIYSTGRMEKHLDALLKQIKFVVEKHVESDVLEACSKTYSILCSEEYTIQNRVDIARSQLIDEFVDRFNHSVEDLLQEGEEADDDDIYNVLSTLKRLTSFHNAHDLTKWDLFGNCYRLLKTGIEHGAMPEQIVVQALQCSHYSILWQLVKITDGSPSKEDLLVLRKTVKSFLAVCQQCLSNVNTPVKEQAFMLLCDLLMIFSHQLMTGGREGLQPLVFNPDTGLQSELLSFVMDHVFIDQDEENQSMEGDEEDEANKIEALHKRRNLLAAFSKLIIYDIVDMHAAADIFKHYMKYYNDYGDIIKETLSKTRQID 459 T 0.0089 Cnd3 pdb F Eukaryota T 5qsr 1 A,B A,B STAG1_HUMAN SCC3 HOMOLOG 1,STROMAL ANTIGEN 1 SMSPNGNLIRMLVLFFLESELHEHAAYLVDSLWESSQELLKDWECMTELLLEEPVQGEEAMSDRQESALIELMVCTIRQAAEAHPPVGRGTGKRVLTAKERKTQIDDRNKLTEHFIITLPMLLSKYSADAEKVANLLQIPQYFDLEIYSTGRMEKHLDALLKQIKFVVEKHVESDVLEACSKTYSILCSEEYTIQNRVDIARSQLIDEFVDRFNHSVEDLLQEGEEADDDDIYNVLSTLKRLTSFHNAHDLTKWDLFGNCYRLLKTGIEHGAMPEQIVVQALQCSHYSILWQLVKITDGSPSKEDLLVLRKTVKSFLAVCQQCLSNVNTPVKEQAFMLLCDLLMIFSHQLMTGGREGLQPLVFNPDTGLQSELLSFVMDHVFIDQDEENQSMEGDEEDEANKIEALHKRRNLLAAFSKLIIYDIVDMHAAADIFKHYMKYYNDYGDIIKETLSKTRQID 459 T 0.0089 Cnd3 pdb F Eukaryota T 5qss 1 A,B A,B STAG1_HUMAN SCC3 HOMOLOG 1,STROMAL ANTIGEN 1 SMSPNGNLIRMLVLFFLESELHEHAAYLVDSLWESSQELLKDWECMTELLLEEPVQGEEAMSDRQESALIELMVCTIRQAAEAHPPVGRGTGKRVLTAKERKTQIDDRNKLTEHFIITLPMLLSKYSADAEKVANLLQIPQYFDLEIYSTGRMEKHLDALLKQIKFVVEKHVESDVLEACSKTYSILCSEEYTIQNRVDIARSQLIDEFVDRFNHSVEDLLQEGEEADDDDIYNVLSTLKRLTSFHNAHDLTKWDLFGNCYRLLKTGIEHGAMPEQIVVQALQCSHYSILWQLVKITDGSPSKEDLLVLRKTVKSFLAVCQQCLSNVNTPVKEQAFMLLCDLLMIFSHQLMTGGREGLQPLVFNPDTGLQSELLSFVMDHVFIDQDEENQSMEGDEEDEANKIEALHKRRNLLAAFSKLIIYDIVDMHAAADIFKHYMKYYNDYGDIIKETLSKTRQID 459 T 0.0089 Cnd3 pdb F Eukaryota T 5qsy 1 A,B A,B STAG1_HUMAN SCC3 HOMOLOG 1,STROMAL ANTIGEN 1 SMSPNGNLIRMLVLFFLESELHEHAAYLVDSLWESSQELLKDWECMTELLLEEPVQGEEAMSDRQESALIELMVCTIRQAAEAHPPVGRGTGKRVLTAKERKTQIDDRNKLTEHFIITLPMLLSKYSADAEKVANLLQIPQYFDLEIYSTGRMEKHLDALLKQIKFVVEKHVESDVLEACSKTYSILCSEEYTIQNRVDIARSQLIDEFVDRFNHSVEDLLQEGEEADDDDIYNVLSTLKRLTSFHNAHDLTKWDLFGNCYRLLKTGIEHGAMPEQIVVQALQCSHYSILWQLVKITDGSPSKEDLLVLRKTVKSFLAVCQQCLSNVNTPVKEQAFMLLCDLLMIFSHQLMTGGREGLQPLVFNPDTGLQSELLSFVMDHVFIDQDEENQSMEGDEEDEANKIEALHKRRNLLAAFSKLIIYDIVDMHAAADIFKHYMKYYNDYGDIIKETLSKTRQID 459 T 0.0089 Cnd3 pdb F Eukaryota T 5qsz 1 A,B A,B STAG1_HUMAN SCC3 HOMOLOG 1,STROMAL ANTIGEN 1 SMSPNGNLIRMLVLFFLESELHEHAAYLVDSLWESSQELLKDWECMTELLLEEPVQGEEAMSDRQESALIELMVCTIRQAAEAHPPVGRGTGKRVLTAKERKTQIDDRNKLTEHFIITLPMLLSKYSADAEKVANLLQIPQYFDLEIYSTGRMEKHLDALLKQIKFVVEKHVESDVLEACSKTYSILCSEEYTIQNRVDIARSQLIDEFVDRFNHSVEDLLQEGEEADDDDIYNVLSTLKRLTSFHNAHDLTKWDLFGNCYRLLKTGIEHGAMPEQIVVQALQCSHYSILWQLVKITDGSPSKEDLLVLRKTVKSFLAVCQQCLSNVNTPVKEQAFMLLCDLLMIFSHQLMTGGREGLQPLVFNPDTGLQSELLSFVMDHVFIDQDEENQSMEGDEEDEANKIEALHKRRNLLAAFSKLIIYDIVDMHAAADIFKHYMKYYNDYGDIIKETLSKTRQID 459 T 0.0089 Cnd3 pdb F Eukaryota T 5r1r 2 B,D,F,G D,E,F,P RIR2_ECOLI RIBONUCLEOTIDE REDUCTASE R2 PROTEIN YLVGQIDSEVDTDDLSNFQL 20 T 12 UL11 pdbhh F Bacteria T 5r42 1 A A CTRA_BOVIN gamma-Chymotrypsin CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 5r42 4 D D peptide SWPW SWPW 4 T 22 SLS pdbhh F F 5r42 5 E E peptide TPGVY TPGVY 5 T 22 RsgA_GTPase pdbhh F F 5r43 1 A A CTRA_BOVIN Chymotrypsinogen A CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 5r43 4 D D peptide SWPW SWPW 4 T 22 SLS pdbhh F F 5r43 5 E E peptide TPGVY TPGVY 5 T 22 RsgA_GTPase pdbhh F F 5r44 1 A A CTRA_BOVIN Chymotrypsinogen A CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 5r44 4 D D peptide SWPW SWPW 4 T 22 SLS pdbhh F F 5r44 5 E E peptide TPGVY TPGVY 5 T 22 RsgA_GTPase pdbhh F F 5r45 1 A A CTRA_BOVIN Chymotrypsinogen A CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 5r45 4 D D peptide SWPW SWPW 4 T 22 SLS pdbhh F F 5r45 5 E E peptide TPGVY TPGVY 5 T 22 RsgA_GTPase pdbhh F F 5r46 1 A A CTRA_BOVIN gamma-chymotrypsin CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 5r46 4 D D peptide SWPW SWPW 4 T 22 SLS pdbhh F F 5r46 5 E E peptide TPGVY TPGVY 5 T 22 RsgA_GTPase pdbhh F F 5r47 1 A A CTRA_BOVIN gamma-chymotrypsin CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 5r47 4 D D peptide SWPW SWPW 4 T 22 SLS pdbhh F F 5r47 5 E E peptide TPGVY TPGVY 5 T 22 RsgA_GTPase pdbhh F F 5r48 1 A A CTRA_BOVIN gamma-chymotrypsin CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 5r48 4 D D peptide SWPW SWPW 4 T 22 SLS pdbhh F F 5r48 5 E E peptide TPGVY TPGVY 5 T 22 RsgA_GTPase pdbhh F F 5r49 1 A A CTRA_BOVIN gamma-chymotrypsin CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 5r49 4 D D peptide SWPW SWPW 4 T 22 SLS pdbhh F F 5r49 5 E E peptide TPGVY TPGVY 5 T 22 RsgA_GTPase pdbhh F F 5r4a 1 A A CTRA_BOVIN gamma-chymotrypsin CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 5r4a 4 D D peptide SWPW SWPW 4 T 22 SLS pdbhh F F 5r4a 5 E E peptide TPGVY TPGVY 5 T 22 RsgA_GTPase pdbhh F F 5r4b 1 A A CTRA_BOVIN gamma-chymotrypsin CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 5r4b 4 D D peptide SWPW SWPW 4 T 22 SLS pdbhh F F 5r4b 5 E E peptide TPGVY TPGVY 5 T 22 RsgA_GTPase pdbhh F F 5r4c 1 A A CTRA_BOVIN gamma-chymotrypsin CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 5r4c 4 D D peptide SWPW SWPW 4 T 22 SLS pdbhh F F 5r4c 5 E E peptide TPGVY TPGVY 5 T 22 RsgA_GTPase pdbhh F F 5r4d 1 A A CTRA_BOVIN gamma-chymotrypsin CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 5r4d 4 D D peptide GSWPW GSWPW 5 T 24 DUF1493 pdbhh F F 5r4d 5 E E peptide TPGVY TPGVY 5 T 22 RsgA_GTPase pdbhh F F 5sbg 1 A C METP, miniaturized rubredoxin XYCSDCGADXSQVRGGYCTNCGASXDRIRX 30 T 0.0005 OrfB_Zn_ribbon pdbpssm F T 5sbi 1 A C METP, miniaturized rubredoxin XYCSDCGADXSQVRGGYCTNCGASXDRIRX 30 T 0.0005 OrfB_Zn_ribbon pdbpssm F T 5sbj 1 A C METP, miniaturized rubredoxin XYCSDCGADXSQVRGGYCTNCGASXDRIRX 30 T 0.0005 OrfB_Zn_ribbon pdbpssm F T 5sga 2 B P TETRAPEPTIDE ACE-PRO-ALA-PRO-TYR XPAPY 5 T 120 DUF5617 pdbhh F F 5suj 1 A,B A,B Q5ZTL3_LEGPH Uncharacterized protein MIVRGINMTKIKLESPGFMVHKKLKSMSQSYGVMMTGVPAEVLGQMQAERSIPSINKTGNLKQQIAKEVSKVCHMMTEPTQSCGQASNDVCELLLGKIEAEKFHFTKYEALSADGDNLKNVLENTAPSSTNLLIRFEIDREDPPIVLVKTKNENFNPETAVKNKIYLLENKLYFIDKMGNLFNLGPGKKKCTQLFNAIGDSAEYSLCDPFVLEEPEKPEDFAISEIVDIFNEQKERFDFWIGSHSFTIYIPQTLGESPRQFYPYQAYFGSHTLQDWFVSDKDEYLSRIGIDKYIEKLAVLGKTTNTKERSDIYAEFFSKRGREAFFCAHLNEKRQPLRVKFKITEINPELALKNLQETQEFIDTHPGENPSDKVENYRNRAKLAMTEHLESLLDIKPESS 400 T 0.023 AgrD pdb F Bacteria T 5suq 2 B,D B,D Tex1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 400 F F F 5suq 3 E,F M,N Tho2, Hpr1, Mft1, and Thp2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 2300 F F F 5sur 1 A,B,C,D A,B,C,D 16mer A-beta peptide: ORN-CYS-VAL-PHE-MEA-CYS-GLU-ASP-ORN-ALA-ILE-ILE-GLY-LEU-ORN-VAL XCVFXCEDXAIIGLXV 16 T 0.079 Beta-APP pdbhh F T 5sus 1 A,B,C,D A,B,C,D 16mer A-beta peptide: ORN-CYS-VAL-PHE-MEA-CYS-GLU-ASP-ORN-ALA-ILE-ILE-GLY-LEU-ORN-VAL XCVFXCEDXAIIGLXV 16 T 0.079 Beta-APP pdbhh F T 5sut 1 A,B A,B 16mer A-beta peptide: ORN-CYS-VAL-PHE-PHE-CYS-GLU-ASP-ORN-ALA-ILE-ILE-SAR-LEU-ORN-VAL XCVFFCEDXAIIXLXV 16 T 0.079 Beta-APP pdbhh F T 5suu 1 A,B A,B 16mer A-beta peptide: ORN-CYS-VAL-PHE-PHE-CYS-GLU-ASP-ORN-ALA-ILE-ILE-SAR-LEU-ORN-VAL XCVFFCEDXAIIXLXV 16 T 0.079 Beta-APP pdbhh F T 5sve 3 C C NFAC1_HUMAN NFATc1 LxVP peptide DDQYLAVPQHPYQWAKPK 18 T 0.13 IucA_IucC pdb F Eukaryota T 5sw9 2 B B CDCA2_HUMAN RepoMan RDIASKKPLLSPIPELPEVPE 21 T 7.1 Fapy_DNA_glyco pdbhh F Eukaryota T 5swf 2 B B BUB1B_HUMAN Double phosphorylated BubR1 KLSPIIEDS 9 T 6.6 TBCC pdbhh F Eukaryota T 5sxm 2 C,D D,C ACE-ALA-ARG-THR-GLU-VAL-TYR-NH2 XARTEVYX 8 T 14 TcdB_toxin_midC pdbhh F T 5sxp 2 E,F F,G ITCH_HUMAN ITCH,ATROPHIN-1-INTERACTING PROTEIN 4,AIP4,NFE2-ASSOCIATED POLYPEPTIDE 1,NAPP1 GSGGGKPSRPPRPSRPPPPTPRRPASY 27 T 3.7 UPF0449 pdbhh F Eukaryota T 5syq 1 A A Y1974_AQUAE Uncharacterized protein aq_1974 GSEEKEEKKVRELTPQELELFKRAMGITPHNYWQWASRTNNFKLLTDGEWVWVEGYEEHIGKQLPLNQARAWSWEFIKNRLKELNL 86 T 0.042 DUF3621 unp F Bacteria T 5szx 3 C,D A,B BZLF1_EBVB9 EB1,ZEBRA LEIKRYKNRVASRKSRAKFKQLLQHYREVAAAKSSENDRLRLLLKQMCPSLDVDSIIPRTPD 62 T 0.0011 bZIP_2 pdb T Viruses T 5t0f 2 B B TIF9_ARATH JASMONATE ZIM DOMAIN-CONTAINING PROTEIN 10,PROTEIN JASMONATE-ASSOCIATED 1,PROTEIN JAZ10 KQTNNAPKPKFQKFLDRRRSFRDIQGAISKIDPEIIKSLLAST 43 T 2.8 DUF3666 pdbhh F Eukaryota T 5t0x 2 B,C B,C ESR1_HUMAN ER,ER-ALPHA,ESTRADIOL RECEPTOR,NUCLEAR RECEPTOR SUBFAMILY 3 GROUP A MEMBER 1 RAANLWPSPLMIKRSKKNS 19 T 4 Tom5 pdbhh F Eukaryota T 5t1k 4 E,F E,F CQFDA(PH)2STRRLKC PEPTIDE CQFDXSTRRLKC 12 T 7.2 YsaB pdbhh F T 5t1l 4 E,F E,F CYCLIC MEDITOPE CQA(Ph)2DLSTRRLKC CQXDLSTRRLKC 12 T 3.3 DUF2089 pdbhh F T 5t1m 3 E,F,G E,F,H CYCLIC PEPTIDE CQYDLSTRRLKC CQYDLSTRRLKC 12 T 7.6 DUF1254 pdbhh F T 5t2s 2 B,D B,D CDC7_YEAST ASP-GLY-GLU-SER-TPO-ASP-GLU-ASP-ASP DGESTDEDDVVS 12 T 2.2 CRF1 pdbhh F Eukaryota T 5t47 2 B,D B,D O61380_DROME EUKARYOTIC TRANSLATION INITIATION FACTOR 4G,ISOFORM C,FI02056P,TRANSLATION INITIATION FACTOR EIF4G GPHMSIINYNEGQWSPNNPSGKKQYDREQLLQLREVKASRIQPEVKNVSILPQPNLMPSFIRNN 64 T 0.00012 eIF_4G1 pdbhh F Eukaryota T 5t48 2 B B O61380_DROME EUKARYOTIC TRANSLATION INITIATION FACTOR 4G,ISOFORM C,FI02056P,TRANSLATION INITIATION FACTOR EIF4G GPHMSIINYNEGQWSPNNPSGKKQYDREQLLQLREVKASRIQPEVKNVSILPQPNLMPSFIRNN 64 T 0.00012 eIF_4G1 pdbhh F Eukaryota T 5t56 1 A,C A,C MCJA_ECOLX MCCJ25 GGAGHVPEYFVR 12 T 0.13 Endonuc-BglII unp F Bacteria T 5t56 2 B,D B,D MCJA_ECOLX MCCJ25 CGTPISFYC 9 T 0.24 KSHV_K1 pdbhh F Bacteria T 5t5j 2 C,D a,b TN ANTIGEN ACA-SER-SER-VAL-GLY XSSVG 5 T 330 Antimicrobial24 pdbhh F F 5t5l 2 C,D a,b TN ANTIGEN ACE-SER-SER-VAL-GLY XSSVG 5 T 330 Antimicrobial24 pdbhh F F 5t5o 2 B,D,F,H,J,L,N,P,R,T a,b,c,d,e,f,g,h,i,j TN-peptide ACE-GLY-VAL-THR-SER-ALA XGVTSA 6 T 180 GIDA_C pdbhh F T 5t5p 2 C,D a,b TN ANTIGEN ACE-SER-THR-VAL-GLY XSTVG 5 T 410 RDD pdbhh F F 5t62 45 SA S Ribosomal Protein uL1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 217 F F F 5t63 2 B C ALA-ALA-ALA-ALA AAAA 4 T 900 Cyclin_C pdbhh F F 5t6p 3 E,F E,F MUC1_HUMAN MUC1 Peptide Fragment APDTRPAP 8 T 8.6 Antimicrobial10 pdbhh F Eukaryota F 5t6r 44 RA S Ribosomal Protein uL1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 210 F F F 5t6x 3 C C TTLL4_HUMAN Decapeptide: THR-SER-THR-THR-SER-VAL-ALA-SER-SER-TRP TSTTSVASSW 10 T 33 CCSMST1 pdbhh F Eukaryota F 5t6y 3 C C THADA_HUMAN Decapeptide: THR-SER-THR-PHE-GLU-ASP-VAL-LYS-ILE-LEU-ALA-PHE TSTFEDVKILAF 12 T 7.3 Peroxin-3 pdbhh F Eukaryota T 5t6z 3 C C POL_HV1B1 Decapeptide: THR-SER-THR-LEU-GLN-GLU-GLN-ILE-GLY-TRP TSTLQEQIGW 10 T 3.2 Red1 pdbhh T Viruses T 5t70 4 D C POL_HV1B1 Decapeptide: THR-SER-ASN-LEU-GLN-GLU-GLN-ILE-GLY-TRP TSNLQEQIGW 10 T 3.3 Red1 pdbhh T Viruses T 5t78 3 C,F F,E MUC1_HUMAN MUC1 Glycopeptide APDTRPAP 8 T 8.6 Antimicrobial10 pdbhh F Eukaryota F 5t7a 1 A,B A,B Q9KG76_BACHD BH0236 protein MGSSHHHHHHSSGLVPRGSHMASQGNGDSHTHPDYTAGIRGITGNEVTIFFAPTTEARYVDVHLKVNNGQQLNYRMTERNGEWERVVENLSSGDVLEYSFTYEKLGPQYTTEWFTYSR 118 T 0.0019 CBM_48 pdb F Bacteria T 5t7q 1 A A TIRAP_HUMAN TIR DOMAIN-CONTAINING ADAPTER PROTEIN,ADAPTOR PROTEIN WYATT,MYD88 ADAPTER-LIKE PROTEIN,MYD88-2 KKPLGKMADWFRQTLLKKPKK 21 T 1 Hfx_Cass5 pdbhh F Eukaryota T 5t86 1 A A Q1RPM1_ECOLX CdiA toxin IEQILKPEKNWETARNKALDLVGNLGADSKPVIGRLEVSAGNGKVIGRQSSDGKVGWRVDYDPEKGTHINIWDYSQGKGPGKAVKQVIPFEGNEKSFETILKQLNR 106 T 25 DUF1818 pdbhh F Bacteria T 5t86 2 B I A0A0B0W5A7_ECOLX CdiI immunity protein MTLFDECREALSADFNIVEGLAQQEALGILNKYPLAKGSVTWSEIRHSDYESFDELLSANSVKNDDMFVFADDASIPVFRSNLRLIAENIYDVTALSPKLFIFNDEVIIQPLFPTDMFRLGIKKHHHHHH 130 T 0.0023 DUF2947 unphh F Bacteria T 5t87 1 A,B,C,D A,B,C,D B3R1C2_CUPTR CdiI immunity protein MTMRYQEPARIPNAEIDHVLASGNPEAIADACLSIAYYEDDWEWAFKRLKSVAFDLNRPDSLRSLAVTCVGHLARRIHDLDVAMAEEFLLSLGGDQAVASAASDALDDLRIFRMSD 116 T 0.0029 HEAT_2 pdbpercent F Bacteria T 5t87 2 E,F,G,H E,F,G,H B3R1C1_CUPTR CdiA toxin SRGPSNGQSVLENSVQVKETSPRRVSVDPQTGEFVVFDRTLGDVYHGHVRAWKDLTSDMQNALVRGGYVDRKGNPK 76 T 0.02 DUF3945 pdbhh F Bacteria T 5tce 1 A A PYRD_HUMAN DHODEHASE,DIHYDROOROTATE OXIDASE GDERFYAEHLMPTLQGLLDPESAHRLAVRFTSLGX 35 T 1.1 DUF2240 pdbhh F Eukaryota T 5tda 2 B B ARG-LEU-TRP-SER peptide RLWS 4 T 22 WD40 pdbhh F F 5tdb 2 B B DA2-ILE-PHE-SER peptide XIFS 4 T 65 eIF3m_C_helix pdbhh F F 5tdc 2 C,D B,D NMM-ILE-PHE-SER peptide XIFS 4 T 65 eIF3m_C_helix pdbhh F F 5tdd 2 B B HIS-ILE-PHE-SER peptide HIFS 4 T 77 DUF1524 pdbhh F F 5ted 2 C H His Tag peptide GAYGAGLAH 9 T 3 Biopterin_H pdbhh F F 5teg 2 C,D D,E H4_HUMAN Histone H4 mutant peptide with H4K20norleucine KRHRXVLR 8 T 0.27 UPF0137 unp F Eukaryota T 5tfp 1 A,B A,B SETB2_HUMAN CHRONIC LYMPHOCYTIC LEUKEMIA DELETION REGION GENE 8 PROTEIN,LYSINE N-METHYLTRANSFERASE 1F,SET DOMAIN BIFURCATED 2 MGEKNGDAKTFWMELEDDGKVDFIFEQVQNVLQSLKQKIKDGSATNKEYIQAMILVNEATIINS 64 T 0.01 Pectinesterase pdb F Eukaryota T 5tga 83 XD m2 60S Ribosomal Protein L12 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 150 F F F 5tga 85 EF,FF p1,p2 60S Ribosomal Protein P1/2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 5tgq 1 A A A0A1S4NYF7_STAWA R.SwaI protein MNFKKYEENLVASIEEVIQRIIDDKHRPNIIGKTRVGAEVSDYLEDEFVKYISSGKSSSLYDAQGAPKEKTKNPWDARCKFKFMDREEEIWIDFKAFKITNMDSNPDIGTPNKIVKFIHEGNFYLVFVLVYYESKQDGVEFVKYNNDYKKVYLLKDVNESFRINPKPQMQVNIAAEPTYRTREEFIHFFVKKWKESFERQIKSLEKKEIMLKDLEDKLKNSNDNSI 226 T 0.013 CK2S unppercent F Bacteria T 5tgx 1 A,B,C,D A,B,C,D A0A1S4NYF7_STAWA R-SwaI protein MNFKKYEENLVASIEEVIQRIIDDKHRPNIIGKTRVGAEVSDYLEDEFVKYISSGKSSSLYDAQGAPKEKTKNPWDARCKFKFMDREEEIWIDFKAFKITNMDSNPDIGTPNKIVKFIHEGNFYLVFVLVYYESKQDGVEFVKYNNDYKKVYLLKDVNESFRINPKPQMQVNIAAEPTYRTREEFIHFFVKKWKESFERQIKSLEKKEIMLKDLEDKLKNSNDNSI 226 T 0.013 CK2S unppercent F Bacteria T 5th2 3 E,F E,F L5Q meditope CQFDQSTRRLKC 12 T 7.1 PriA_CRR pdbhh F T 5th3 1 A,B,C,D A,B,C,D A0A1S4NYF7_STAWA R-SwaI protein MNFKKYEENLVASIEEVIQRIIDDKHRPNIIGKTRVGAEVSDYLEDEFVKYISSGKSSSLYDAQGAPKEKTKNPWDARCKFKFMDREEEIWIDFKAFKITNMDSNPDIGTPNKIVKFIHEGNFYLVFVLVYYESKQDGVEFVKYNNDYKKVYLLKDVNESFRINPKPQMQVNIAAEPTYRTREEFIHFFVKKWKESFERQIKSLEKKEIMLKDLEDKLKNSNDNSI 226 T 0.013 CK2S unppercent F Bacteria T 5tj1 1 A A V4RMX4_9CAUL Benenodin-1 GVGFGRPDSILTQEQAKPM 19 T 0.15 DUF5974 unphh F Bacteria T 5tj5 6 M O V-type proton ATPase subunit f XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 54 F F F 5tja 1 A A MCLN1_HUMAN MG-2,MUCOLIPIDIN GSGLSNQLAVTFREENTIAFRHLFLLGYSDGADDTFAAYTREQLYQAIFHAVDQYLALPDVSLGRYAYVRGGGDPWTNGSGLALCQRYYHRGHVDPANDTFDIDPMVVTDCIQVDPPERPPPPPSDDLTLLESSSSYKNLTLKFHKLVNVTIHFRLKTINLQSLINNEIPDCYTFSVLITFDNKAHSGRIPISLETQAHIQECKHPSVFQHGDNSLEHHHHHH 223 T 0.08 DUF1866 pdb F Eukaryota T 5tjb 1 A A MCLN1_HUMAN MG-2,MUCOLIPIDIN GSGLSNQLAVTFREENTIAFRHLFLLGYSDGADDTFAAYTREQLYQAIFHAVDQYLALPDVSLGRYAYVRGGGDPWTNGSGLALCQRYYHRGHVDPANDTFDIDPMVVTDCIQVDPPERPPPPPSDDLTLLESSSSYKNLTLKFHKLVNVTIHFRLKTINLQSLINNEIPDCYTFSVLITFDNKAHSGRIPISLETQAHIQECKHPSVFQHGDNSLEHHHHHH 223 T 0.08 DUF1866 pdb F Eukaryota T 5tjc 1 A A MCLN1_HUMAN MG-2,MUCOLIPIDIN GSGLSNQLAVTFREENTIAFRHLFLLGYSDGADDTFAAYTREQLYQAIFHAVDQYLALPDVSLGRYAYVRGGGDPWTNGSGLALCQRYYHRGHVDPANDTFDIDPMVVTDCIQVDPPERPPPPPSDDLTLLESSSSYKNLTLKFHKLVNVTIHFRLKTINLQSLINNEIPDCYTFSVLITFDNKAHSGRIPISLETQAHIQECKHPSVFQHGDNSLEHHHHHH 223 T 0.08 DUF1866 pdb F Eukaryota T 5tkj 3 C,F,I,L C,F,I,L ENV_HV1H2 HIV-1 fusion peptide residue 512-519 AVGIGAVF 8 T 4.2 DUF3918 pdbhh T Viruses F 5tkk 1 A A ENV_HV1H2 HIV-1 fusion peptide residue 512-519 AVGIGAVF 8 T 4.2 DUF3918 pdbhh T Viruses F 5tmc 6 G Z unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 48 F F F 5tp6 2 B B NOS2_HUMAN HEPATOCYTE NOS,HEP-NOS,INDUCIBLE NO SYNTHASE,INOS,NOS TYPE II,PEPTIDYL-CYSTEINE S-NITROSYLASE NOS2 AGHMRPKRREIPLKVLVKAVLFACMLMRK 29 T 1.2 DUF488 unppercent F Eukaryota T 5tq1 2 B B INSR_RAT Insulin receptor PSSVXVPDEWE 11 T 9.2 RHH_1 pdbhh F Eukaryota T 5tqs 2 E,F,G,H E,F,H,G ERBB2_HUMAN Receptor protein-tyrosine kinase DNLYXWDQDPP 11 T 0.65 DUF2093 pdbhh F Eukaryota T 5tsc 1 A,B A,B Q5ZTL4_LEGPH Uncharacterized protein GMTTSKLEKTGLHVHEKIKHMVKNYGTMITGIPAEILGQNEAEISVGYVKKMGNMKENIAEVVRKSEMTQPTNSAGKASNEVCDLLLGTEGASEFEKSSYQVLSGDGSNLKGSLPNKNLLVRVEMDRFNAPQKYQKIKREEFNPETAEKNKIYLLEDQLVYLDIFGKVIDLGQTSDTCHRLFNAITTPFYQNYILYDEYIDPEESAEEAAMFEMGEIVKAKMKNIDCWTATHSFTIFVPESDSEDTRTLYPYQAYWTSHTLQQWFSGDKDEKLSRLGIDGYIEKLALLGTTTDSKIRSSIYGELFSPPGKEHVFCTGMNEKFSPLRVKFKVTEVNPEIALQNLEEVQEFIDTNYPGENAKDQCELYKIKAQEAMTKQLEMRLLIEPTHRKESVTPYKEKNTQSFFSKNLDTTSRIDKMISSVITMENLKILAKEADTLSGKDREKLVEYFKTLPSEQLAEIKA 463 T 0.035 V-ATPase_H_N unppercent F Bacteria T 5ttw 2 B,D B,D UNC4859 PSKFX 5 T 69 CHIPS pdbhh F F 5tu5 2 B B TYR-TYR-TYR YYY 3 T 91 DUF3890 pdbhh F F 5tu6 2 B B cyclic[INPYLYP] peptide INPYLYP 7 T 0.64 7tm_1 pdbhh F F 5tvz 1 A A PO152_YEAST NUCLEAR PORE PROTEIN POM152,P150,PORE MEMBRANE PROTEIN POM152 MSLRVKPSASLKLHHDLKLCLGDHSSVPVALKGQGPFTLTYDIIETFSSKRKTFEIKEIKTNEYVIKTPVFTTGGDYILSLVSIKDSTGCVVGLSQPDAKIQVRRDEGHHHHHH 114 T 0.0071 PKD_4 pdbpssm F Eukaryota T 5tw1 9 K G Unknown peptide XXXXXXXXXXXXXXXXX 17 F F F 5twg 2 B E STK4_HUMAN T353 peptide VASTMTDGANTMIEP 15 T 13 BBS2_N pdbhh F Eukaryota T 5twh 2 B E STK4_HUMAN T367 peptide DDTLPSQLGTMVINAED 17 T 1.7 OspE pdbhh F Eukaryota T 5twi 1 A A Cyclic tetrapeptide ALA-ARG-ALA-UN1 ARAXX 5 T 1300 Endonuclease_1 pdbhh F F 5two 2 B B PRGC1_HUMAN PRO-SER-LEU-LEU-LYS-LYS-LEU-LEU-LEU-ALA-PRO AEEPSLLKKLLLAPA 15 T 5.4 DUF1467 pdbhh F Eukaryota T 5tww 1 A A Cyclic peptide ALA-ALA-ALA-UN1-ALA-ARG-ALA-ALA-ARG-ALA-ALA-ARG-ALA AAAXARAARAARAX 14 T 62 AP1AR pdbhh F F 5tx1 3 N O A4ZKM1_9ADEN Fiber AKRLRVEDDFNPVYPYGYA 19 T 0.48 DUF5449 pdbhh T Viruses T 5tx1 8 EA,FA,GA X,Y,Z Unknown XXXXXXXXXX 10 F F F 5tx8 1 A A HH2 AEDCERIRKELEKNPNDEIKKKLEKCQA 28 T 2.4 UPF0228 pdbhh F T 5txe 2 B,D C,D E8RUP8_ASTEC Astexin3-dC4 GPTPMVGLDSVSGQYWDQHAPLAD 24 T 2.1 Cut12 pdbhh F Bacteria T 5txh 1 A,B,C,D A,B,C,D IFAEDV IFAEDV 6 T 7.9 EBV-NA1 pdbhh F T 5txj 1 A,B A,B amyloid-beta derived peptide IFAEDV 6 T 7.9 EBV-NA1 pdbhh F T 5txs 3 C C anapestic lymphoma kinase-derived neuroblastoma tumor antigen AQDIYRASY 9 T 0.21 ChaB pdbhh F T 5tyi 2 E,F,G,H L,M,N,P Peptide inhibitor KFEGXDNEX 9 T 33 DUF5840 pdbhh F T 5tzs 17 Q H Utp4 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 544 F F F 5tzs 18 R I UtpA_CTD1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 176 F F F 5tzs 19 S,T J,K UtpA_CTD2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 107 F F F 5tzs 20 U M Beta-propeller 2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 258 F F F 5tzs 21 V N Utp17 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 545 F F F 5tzs 22 W O Utp1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 638 F F F 5tzs 23 X P Utp6 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 306 F F F 5tzs 24 Y Q Utp12 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 710 F F F 5tzs 25 Z R Utp13 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 717 F F F 5tzs 26 AA S Utp18 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 250 F F F 5tzs 28 CA U Beta-propeller 5 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 284 F F F 5tzs 29 DA V Enp2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 263 F F F 5tzs 30 EA W UtpA_CTD4 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 104 F F F 5tzs 31 FA X Kre33 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 640 F F F 5tzs 32 GA Y Kre33 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 641 F F F 5tzs 33 HA Z Imp3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 151 F F F 5tzs 34 IA a Nop56 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 312 F F F 5tzs 35 JA b Nop58 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 341 F F F 5tzs 36 KA c Nop1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 221 F F F 5tzs 37 LA d Nop1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 216 F F F 5tzs 41 QA i BMS1_YEAST Bms1,Ribosome biogenesis protein BMS1,Bms1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXWNIGKLIYMDNISPEECIRRWRGEDDDSKDESDIEEDVDDDFFRKKDGTVTKEGNKDHAVDLEKFVPYFDTFEKLAKKWKSVDAIKERFLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 511 T 0.12 Inhibitor_I67 pdbpssm F Eukaryota T 5tzs 43 TA l Utp24 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 124 F F F 5tzs 44 UA m Imp4 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 156 F F F 5tzs 45 VA n Utp30 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 160 F F F 5tzs 46 WA o Unassigned KH domain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 175 F F F 5tzs 47 XA p Utp20 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 924 F F F 5tzs 48 YA q Repeat protein 2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 372 F F F 5tzs 50 AB,BB,CB s,t,u Beta-propeller 1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 290 F F F 5tzs 51 DB v Repeat protein 1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 580 F F F 5tzs 52 EB y Unassigned protein helices XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 507 F F F 5u06 2 E,F,G,H L,M,N,P bicyclic peptide inhibitor: LYS-PHE-GLU-GLY-CMF-ASP-ASN-GLU-CST KFEGXDNEX 9 T 1.1 YfbU pdbhh F T 5u0p 9 I D Mediator complex subunit 4 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 239 F F F 5u0p 15 O S Mediator complex subunit 19 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 139 F F F 5u0p 16 P J Mediator complex subunit 10 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 132 F F F 5u0s 9 I D Mediator complex subunit 4 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 239 F F F 5u0s 15 O S Mediator complex subunit 19 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 139 F F F 5u0s 16 P J Mediator complex subunit 10 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 132 F F F 5u1m 2 B B INSR_HUMAN IR XLYASSNPAX 10 T 6.8 MPS-4 pdbhh F Eukaryota T 5u1q 2 E,F,G,H L,M,N,P LYS-PHE-GLU-GLY-TYR-ASP-ASN-GLU-CST KFEGYDNEX 9 T 4.1 Crr6 pdbhh F T 5u30 1 A A C2C1_ALIAG AACC2C1 SMAVKSIKVKLRLDDMPEIRAGLWKLHKEVNAGVRYYTEWLSLLRQENLYRRSPNGDGEQECDKTAEECKAELLERLRARQVENGHRGPAGSDDELLQLARQLYELLVPQAIGAKGDAQQIARKFLSPLADKDAVGGLGIAKAGNKPRWVRMREAGEPGWEEEKEKAETRKSADRTADVLRALADFGLKPLMRVYTDSEMSSVEWKPLRKGQAVRTWDRDMFQQAIERMMSWESWNQRVGQEYAKLVEQKNRFEQKNFVGQEHLVHLVNQLQQDMKEASPGLESKEQTAHYVTGRALRGSDKVFEKWGKLAPDAPFDLYDAEIKNVQRRNTRRFGSHDLFAKLAEPEYQALWREDASFLTRYAVYNSILRKLNHAKMFATFTLPDATAHPIWTRFDKLGGNLHQYTFLFNEFGERRHAIRFHKLLKVENGVAREVDDVTVPISMSEQLDNLLPRDPNEPIALYFRDYGAEQHFTGEFGGAKIQCRRDQLAHMHRRRGARDVYLNVSVRVQSQSEARGERRPPYAAVFRLVGDNHRAFVHFDKLSDYLAEHPDDGKLGSEGLLSGLRVMSVALGLRTSASISVFRVARKDELKPNSKGRVPFFFPIKGNDNLVAVHERSQLLKLPGETESKDLRAIREERQRTLRQLRTQLAYLRLLVRCGSEDVGRRERSWAKLIEQPVDAANHMTPDWREAFENELQKLKSLHGICSDKEWMDAVYESVRRVWRHMGKQVRDWRKDVRSGERPKIRGYAKDVVGGNSIEQIEYLERQYKFLKSWSFFGKVSGQVIRAEKGSRFAITLREHIDHAKEDRLKKLADRIIMEALGYVYALDERGKGKWVAKYPPCQLILLAELSEYQFNNDRPPSENNQLMQWSHRGVFQELINQAQVHDLLVGTMYAAFSSRFDARTGAPGIRCRRVPARCTQEHNPEPFPWWLNKFVVEHTLDACPLRADDLIPTGEGEIFVSPFSAEEGDFHQIHAALNAAQNLQQRLWSDFDISQIRLRCDWGEVDGELVLIPRLTGKRTADSYSNKVFYTNTGVTYYERERGKKRRKVFAQEKLSEEEAELLVEADEAREKSVVLMRDPSGIINRGNWTRQKEFWSMVNQRIEGYLVKQIRSRVPLQDSACENTGDI 1130 T 0.0038 RuvC_1 unphh F Bacteria T 5u31 1 A A C2C1_ALIAG AACC2C1 SMAVKSIKVKLRLDDMPEIRAGLWKLHKEVNAGVRYYTEWLSLLRQENLYRRSPNGDGEQECDKTAEECKAELLERLRARQVENGHRGPAGSDDELLQLARQLYELLVPQAIGAKGDAQQIARKFLSPLADKDAVGGLGIAKAGNKPRWVRMREAGEPGWEEEKEKAETRKSADRTADVLRALADFGLKPLMRVYTDSEMSSVEWKPLRKGQAVRTWDRDMFQQAIERMMSWESWNQRVGQEYAKLVEQKNRFEQKNFVGQEHLVHLVNQLQQDMKEASPGLESKEQTAHYVTGRALRGSDKVFEKWGKLAPDAPFDLYDAEIKNVQRRNTRRFGSHDLFAKLAEPEYQALWREDASFLTRYAVYNSILRKLNHAKMFATFTLPDATAHPIWTRFDKLGGNLHQYTFLFNEFGERRHAIRFHKLLKVENGVAREVDDVTVPISMSEQLDNLLPRDPNEPIALYFRDYGAEQHFTGEFGGAKIQCRRDQLAHMHRRRGARDVYLNVSVRVQSQSEARGERRPPYAAVFRLVGDNHRAFVHFDKLSDYLAEHPDDGKLGSEGLLSGLRVMSVALGLRTSASISVFRVARKDELKPNSKGRVPFFFPIKGNDNLVAVHERSQLLKLPGETESKDLRAIREERQRTLRQLRTQLAYLRLLVRCGSEDVGRRERSWAKLIEQPVDAANHMTPDWREAFENELQKLKSLHGICSDKEWMDAVYESVRRVWRHMGKQVRDWRKDVRSGERPKIRGYAKDVVGGNSIEQIEYLERQYKFLKSWSFFGKVSGQVIRAEKGSRFAITLREHIDHAKEDRLKKLADRIIMEALGYVYALDERGKGKWVAKYPPCQLILLAELSEYQFNNDRPPSENNQLMQWSHRGVFQELINQAQVHDLLVGTMYAAFSSRFDARTGAPGIRCRRVPARCTQEHNPEPFPWWLNKFVVEHTLDACPLRADDLIPTGEGEIFVSPFSAEEGDFHQIHAALNAAQNLQQRLWSDFDISQIRLRCDWGEVDGELVLIPRLTGKRTADSYSNKVFYTNTGVTYYERERGKKRRKVFAQEKLSEEEAELLVEADEAREKSVVLMRDPSGIINRGNWTRQKEFWSMVNQRIEGYLVKQIRSRVPLQDSACENTGDI 1130 T 0.0038 RuvC_1 unphh F Bacteria T 5u33 1 A A C2C1_ALIAG AACC2C1 SMAVKSIKVKLRLDDMPEIRAGLWKLHKEVNAGVRYYTEWLSLLRQENLYRRSPNGDGEQECDKTAEECKAELLERLRARQVENGHRGPAGSDDELLQLARQLYELLVPQAIGAKGDAQQIARKFLSPLADKDAVGGLGIAKAGNKPRWVRMREAGEPGWEEEKEKAETRKSADRTADVLRALADFGLKPLMRVYTDSEMSSVEWKPLRKGQAVRTWDRDMFQQAIERMMSWESWNQRVGQEYAKLVEQKNRFEQKNFVGQEHLVHLVNQLQQDMKEASPGLESKEQTAHYVTGRALRGSDKVFEKWGKLAPDAPFDLYDAEIKNVQRRNTRRFGSHDLFAKLAEPEYQALWREDASFLTRYAVYNSILRKLNHAKMFATFTLPDATAHPIWTRFDKLGGNLHQYTFLFNEFGERRHAIRFHKLLKVENGVAREVDDVTVPISMSEQLDNLLPRDPNEPIALYFRDYGAEQHFTGEFGGAKIQCRRDQLAHMHRRRGARDVYLNVSVRVQSQSEARGERRPPYAAVFRLVGDNHRAFVHFDKLSDYLAEHPDDGKLGSEGLLSGLRVMSVALGLRTSASISVFRVARKDELKPNSKGRVPFFFPIKGNDNLVAVHERSQLLKLPGETESKDLRAIREERQRTLRQLRTQLAYLRLLVRCGSEDVGRRERSWAKLIEQPVDAANHMTPDWREAFENELQKLKSLHGICSDKEWMDAVYESVRRVWRHMGKQVRDWRKDVRSGERPKIRGYAKDVVGGNSIEQIEYLERQYKFLKSWSFFGKVSGQVIRAEKGSRFAITLREHIDHAKEDRLKKLADRIIMEALGYVYALDERGKGKWVAKYPPCQLILLAELSEYQFNNDRPPSENNQLMQWSHRGVFQELINQAQVHDLLVGTMYAAFSSRFDARTGAPGIRCRRVPARCTQEHNPEPFPWWLNKFVVEHTLDACPLRADDLIPTGEGEIFVSPFSAEEGDFHQIHAALNAAQNLQQRLWSDFDISQIRLRCDWGEVDGELVLIPRLTGKRTADSYSNKVFYTNTGVTYYERERGKKRRKVFAQEKLSEEEAELLVEADEAREKSVVLMRDPSGIINRGNWTRQKEFWSMVNQRIEGYLVKQIRSRVPLQDSACENTGDI 1130 T 0.0038 RuvC_1 unphh F Bacteria T 5u34 1 A A C2C1_ALIAG AACC2C1 SMAVKSIKVKLRLDDMPEIRAGLWKLHKEVNAGVRYYTEWLSLLRQENLYRRSPNGDGEQECDKTAEECKAELLERLRARQVENGHRGPAGSDDELLQLARQLYELLVPQAIGAKGDAQQIARKFLSPLADKDAVGGLGIAKAGNKPRWVRMREAGEPGWEEEKEKAETRKSADRTADVLRALADFGLKPLMRVYTDSEMSSVEWKPLRKGQAVRTWDRDMFQQAIERMMSWESWNQRVGQEYAKLVEQKNRFEQKNFVGQEHLVHLVNQLQQDMKEASPGLESKEQTAHYVTGRALRGSDKVFEKWGKLAPDAPFDLYDAEIKNVQRRNTRRFGSHDLFAKLAEPEYQALWREDASFLTRYAVYNSILRKLNHAKMFATFTLPDATAHPIWTRFDKLGGNLHQYTFLFNEFGERRHAIRFHKLLKVENGVAREVDDVTVPISMSEQLDNLLPRDPNEPIALYFRDYGAEQHFTGEFGGAKIQCRRDQLAHMHRRRGARDVYLNVSVRVQSQSEARGERRPPYAAVFRLVGDNHRAFVHFDKLSDYLAEHPDDGKLGSEGLLSGLRVMSVDLGLRTSASISVFRVARKDELKPNSKGRVPFFFPIKGNDNLVAVHERSQLLKLPGETESKDLRAIREERQRTLRQLRTQLAYLRLLVRCGSEDVGRRERSWAKLIEQPVDAANHMTPDWREAFENELQKLKSLHGICSDKEWMDAVYESVRRVWRHMGKQVRDWRKDVRSGERPKIRGYAKDVVGGNSIEQIEYLERQYKFLKSWSFFGKVSGQVIRAEKGSRFAITLREHIDHAKEDRLKKLADRIIMEALGYVYALDERGKGKWVAKYPPCQLILLAELSEYQFNNDRPPSENNQLMQWSHRGVFQELINQAQVHDLLVGTMYAAFSSRFDARTGAPGIRCRRVPARCTQEHNPEPFPWWLNKFVVEHTLDACPLRADDLIPTGEGEIFVSPFSAEEGDFHQIHADLNAAQNLQQRLWSDFDISQIRLRCDWGEVDGELVLIPRLTGKRTADSYSNKVFYTNTGVTYYERERGKKRRKVFAQEKLSEEEAELLVEADEAREKSVVLMRDPSGIINRGNWTRQKEFWSMVNQRIEGYLVKQIRSRVPLQDSACENTGDI 1130 T 0.0038 RuvC_1 unphh F Bacteria T 5u4k 2 B B TF65_HUMAN NUCLEAR FACTOR NF-KAPPA-B P65 SUBUNIT,NUCLEAR FACTOR OF KAPPA LIGHT POLYPEPTIDE GENE ENHANCER IN B-CELLS 3 GSPGYPNGLLSGDEDFSSIADMDFSALLSQISS 33 T 17 Orthopox_F14 pdbhh F Eukaryota T 5u4w 3 G,I,K G,I,K A0A024B7W1_ZIKV Protein E GALNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLMWLGLNTKNGSISLMCLALGGVLIFLSTA 66 T 0.33 COPI_assoc pdb T Viruses T 5u5c 1 A,B,C,D,E,F A,B,C,D,E,F Designed tetrameric coiled coil peptide with one terpyridine side chain XELAAIKEELAAIKXELAAIKQELAAIKQX 30 T 0.00064 DUF5320 pdbhh F T 5u5f 4 D D 5-DIPHENYL LONG MEDITOPE XCQFDXSTRRLRCGGSK 17 T 2.5 Flavi_NS1 pdbhh F T 5u5m 5 E D AZIDO-PEG4-MEDITOPE XCQFDXSTXRLRC 13 T 4.7 DUF6464 pdbhh F T 5u5p 1 A,B C,B MLH1_HUMAN MUTL PROTEIN HOMOLOG 1 SNPRKRHRED 10 T 4.8 T_cell_tran_alt pdbhh F Eukaryota T 5u5r 2 B B PMS2_HUMAN DNA MISMATCH REPAIR PROTEIN PMS2,PMS1 PROTEIN HOMOLOG 2 TPNTKRFKKEE 11 T 6.3 Nbs1_C pdbhh F Eukaryota T 5u66 1 A B STAPLED PEPTIDE FROM DOMAIN B OF PROTEIN A XFNMXQQRRFYXALH 15 T 2.1 B pdbhh F T 5u6a 5 E D meditope peptide XCQFDXSTXRLRCG 14 T 3.4 zinc_ribbon_12 pdbhh F T 5u6p 2 B,D,F,H E,F,G,H BRAIN CYCLIC NUCLEOTIDE-GATED CHANNEL 1 XXXXXXXXXXXXXXXXXXX 19 F F F 5u75 1 A A G0Z026_STAAU Enterotoxin-like toxin X STQNSSSVQDKQLQKVEEVPNNSEKALVKKLYDRYSKDTINGKSNKSRNWVYSERPLNENQVRIHLEGTYTVAGRVYTPKRNITLNKEVVTLKELDHIIRFAHISYGLYMGEHLPKGNIVINTKNGGKYTLESHKELQKNRENVEINTDDIKNVTFELVKSVNDIEQV 168 T 0.00015 Stap_Strp_tox_C unppercent F Bacteria T 5u96 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H Q928V6_LISIN Putative integrase ASLNEKLKIEHAKKKRLFDLYINGSYEVSELDSMMNDIDAQINYYEAQIEAN 52 T 0.00093 AAA_23 unppercent F Bacteria T 5u98 3 C,F C,F SPT5H_HUMAN VAL-THR-THR-ASP-ILE-GLN-VAL-LYS-VAL VTTDIQVKV 9 T 1.9 DUF460 pdbhh F Eukaryota T 5uae 1 A,B,C,D A,B,C,D Q928V6_LISIN Putative integrase KEDELDSLNEKLKIEHAKKKRLFDLYINGSYEVSELDSMMNDIDAQINYYEAQIEANEELKK 62 T 0.00093 AAA_23 unppercent F Bacteria T 5uak 2 B R CFTR,ATP-BINDING CASSETTE SUB-FAMILY C MEMBER 7,CHANNEL CONDUCTANCE-CONTROLLING ATPASE,CAMP-DEPENDENT CHLORIDE CHANNEL XXXXXXXXXXXXXXXXXXX 19 F F F 5ud5 1 A,B A,B PYLS_METMA PYRROLYSINE--TRNA(PYL) LIGASE,PYRROLYSYL-TRNA SYNTHETASE,PYLRS MGHHHHHHMDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAP 109 T 0.017 Zn_ribbon_recom pdbpercent F Archaea T 5uf5 1 A,B A,B Q5ZWW6_LEGPH effector protein SidK SNAEQYHSQVVGKIGYIARCMQTIDPENNLKKIREDYQDVLIWAEKNYRFEEILEASKSGKCPNDLDALSRRSLILQELLRLVSSISPFKMKLDLIESQYEKMKQHVNLWKSDYHVKLNQLNQLTDYLKNAAPTPKNNFLRAMTSVLQMQIAQYGITEDNEGINQLFKLGLHLLAMANEKIDEQYHLFKGYVKDQPEESPFEGILPAEDQKILVKTMIDYAMPKLSSKVLQDKLSALSSSDVLTKTLLDSIDRIVKENEKLNALSK 266 T 1 DUF3276 unppssm F Bacteria T 5ufk 1 A A Q5ZWW6_LEGPH effector protein SidK SNAEQYHSQVVGKIGYIARCMQTIDPENNLKKIREDYQDVLIWAEKNYRFEEILEASKSGKCPNDLDALSRRSLILQELLRLVSSISPFKMKLDLIESQYEKMKQHVNLWKSDYHVKLNQLNQLTDYLKNAAPTPKNNFLRAMTSVLQMQIAQYGITEDNEGINQLFKLGLHLLAMANEKIDEQYHLFKGYVKDQPEESPFEGILPAEDQKILVKTMIDYAMPKLSSKVLQDKLSALSSSDVLTKTLLDSIDRIVKENEKLNALSK 266 T 1 DUF3276 unppssm F Bacteria T 5ufs 2 C,D C,D NR0B2_HUMAN SHP NR Box 1 Peptide APAILYALLSS 11 T 8.5 NR_Repeat pdbhh F Eukaryota T 5ugk 1 A,B,C,D,E,F,G,H,I,J,K,L A,C,E,G,I,K,O,Q,S,U,W,Y ILE-HIS-VAL-HIS-LEU-GLN-ILE IHVHLQI 7 T 1.3 DUF4922 pdbhh F F 5uhr 1 A,B,C,D A,B,C,D ORN-CIR-LEU-ALA-ASN-PHE-LEU-VAL-ORN-ILE-LYS-HAO-LYS-A8E XXLANFLVXIKXKX 14 T 9.3 PsaX pdbhh F T 5ui6 1 A A Acinetodin GGKGPIFETWVTEGNYYG 18 T 2.2 YqeC pdbhh F T 5ui7 1 A A A0A1S4NYG0_KLEPN Klebsidin GSDGPIIEFFNPNGVMHYG 19 T 0.83 tRNA-synt_1c pdbhh F Bacteria T 5uie 2 G G ESCRT-III COMPLEX SUBUNIT VPS2, VACUOLAR PROTEIN-SORTING-ASSOCIATED PROTEIN 2, VACUOLAR PROTEIN-TARGETING PROTEIN 14, VPS2P XXXXXXXXX 9 F F F 5ujr 1 A A Q46313_CARML Bacteriocin WGWKEVVQNGQTIFSAGQKLGNMVGKIVPLPFG 33 T 0.048 Bacteriocin_IIc unp F Bacteria T 5ujt 3 C,F,I C,F,I insulin mimotope GVEELYLVAGEEGCGG 16 T 1.8 NTPase_1 pdbhh F T 5ukh 1 A A T1ZG69_STRIT Uncharacterized protein MGSSHHHHHHSQDPSDLSWSKRLSAYAALKDLTLSKQDKVFLEHLMTEYGFDSTTARQILKLKQGLERKFSSIFDDYTQEERDYLLFRIIGSVSYNGVKWDETAGYLSRYFYKEVVSNPVTGEKQKVPKSLLDIFQELGLSKAEAKQLQYNLSLQHEMAGGTLSTTGDMVKQDPDYYETAKNSYKLVYGTTEGFDKFWDERLKAYSNDGRGNADFTHQSITMATHLNPTSVQLSDIYGGRKHVKNLAGWEGDTTYNANERKPSIGEDDYKADLDSVNIIGRMKKGQSYQSAMSSYYSDVQKGHSVREKEFLKNKDWEKVKKTIYDSLVPNGINKNADSVVKDYIAKNYPDVSKFLSRLESVAGGQ 365 T 0.021 Seryl_tRNA_N unppercent F Bacteria T 5ulo 2 C,D C,D TBCD7_HUMAN TBC1 domain family member 7 XESGKLPRSPSFPX 14 F F Eukaryota T 5uml 2 B,D,F,H C,D,F,H PEPTIDE INHIBITOR M3 LTFLEYWAQLMQ 12 T 2.6 ParD_like pdbhh F T 5umm 2 B,D B,D PEPTIDE INHIBITOR M3 LTFLEYWAQLMQ 12 T 2.6 ParD_like pdbhh F T 5una 2 G,H,I,J,K M,N,O,Q,R unidentified peptide section/fragment XXXXXX 6 F F F 5unj 2 B C PRGC1_HUMAN Peroxisome proliferator-activated gamma coactivator 1-alpha EEPSLLKKLLLAPA 14 T 7 Cnl2_NKP2 pdbhh F Eukaryota T 5uoi 1 A A HHH_rd1_0142 RKWEEIAERLREEFNINPEEAREAVEKAGGNEEEARRIVKKRL 43 T 0.00066 DUF3606 pdbhh F T 5uow 4 E,F F,G GluN2B-specific Fab, termed 11D1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 216 F F F 5up1 1 A A EEHEE_rd3_1049 MGSSHHHHHHSSGLVPRGSHMTTVKLGDIKVTFDNPEKAKKYAQKLAKIYQLTVHVHGDTIHVK 64 T 0.46 DUF2188 pdbhh F T 5up2 4 E,F F,G GluN2B-specific Fab, termed 11D1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 216 F F F 5up5 1 A A EHEE_rd1_0284 TQTQEFDNEEEARKAEKELRKENRRVTVTQENGRWRVTWD 40 T 0.0015 SPOR pdb F T 5uqd 1 A A DPY21_CAEEL DumPY: shorter than wild-type MKSSWSHPQFEKGAMTGWSHPQFEKENLYFQSNATMRITNRNLKMLTRQFDLPKMSSRFRKFVRIRRHPNGMATIISCDYNQIKQHLGPNEMKHFERQFVRLGFAENNGVPLFAIGVMENAAEALHDQFEWLAKNSPNTQVKVGSLTNKQFIETMPMKKYYESAMETLDMGTFRFGPLMSLSMVGTKNEEAGGNFKEMLDALNAAPFLGPIMPWGDFSEVQGIKEDTSDDGPIFWVRPGEQMVPTDGKNRSTEPRHPLATRGNDRRETAFNDRTNAHADQVRESTEDDPTTTTTTTTTTSSSSSSSKSKKSAKSDPTFVKSTAAVGVLQGIRNPDANDDDEYYEDERKAVKEVIVFDAHDLHKVAHHLAMDLYEPPVSQCHRWVDDAILNTMRREGIRYAKLELHENDMYFLPRNVIHQFRTVSACSSVAWHVRLRHYYDVD 442 T 0.017 Cupin_8 unphh F Eukaryota T 5urn 2 B B TF65_HUMAN NUCLEAR FACTOR NF-KAPPA-B P65 SUBUNIT,NUCLEAR FACTOR OF KAPPA LIGHT POLYPEPTIDE GENE ENHANCER IN B-CELLS 3 GSPGYPNGLLSGDEDFSSIADMDFSALLSQISS 33 T 17 Orthopox_F14 pdbhh F Eukaryota T 5utf 1 A G Q2N0S6_9HIV1 Envelope glycoprotein gp120 AENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGEMKNCSFNMTTELRDKKQKVYSLFWRLDVVQINENQGNRSNNSNKEYRLINCNTSACTQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPMNMTRKSIRIGPGQAFYALGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRMKQIINMWQRIGQCMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 481 T 3.5E-54 GP120 pdbpercent T Viruses T 5uty 4 D G Q2N0S6_9HIV1 ENVELOPE GLYCOPROTEIN GP160 MPMGSLQPLATLYLLGMLVASVLAAENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGEMKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSACTQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPMNMTRKSIRIGPGQAFYALGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQCMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 505 T 3.8E-54 GP120 pdbpercent T Viruses T 5uup 2 B B Bfl-1-specific selected peptide XGVREIAYGLRRAADDVNAQVERX 24 T 0.0029 PUMA pdbhh F T 5uw3 2 B,D,F,H E,F,G,H F6LNL5_9CARY Presegetalin A1 GVPVWAFQAKDVENASAPV 19 T 10 Choline_sulf_C unphh F Eukaryota T 5uw5 2 B,D,F,H E,F,G,H F6LNL5_9CARY Presegetalin A1 GVPVWAFQAKDVENASAPV 19 T 10 Choline_sulf_C unphh F Eukaryota T 5uw6 2 B,D,F,H E,F,G,H F6LNL5_9CARY Presegetalin A1 NASAPV 6 T 15 PH_17 unphh F Eukaryota F 5uw7 2 B,D C,D F6LNL5_9CARY Presegetalin A1 GVPVWAFQAKDVENASAPV 19 T 10 Choline_sulf_C unphh F Eukaryota T 5uwh 4 D D PAXI_HUMAN Paxillin GGSYRELDELMASLSDFKFMAQ 22 T 0.86 KNOX2 pdbhh F Eukaryota T 5uwi 4 D D HDAC5_HUMAN Histone deacetylase 5 GGSYEAETVSAMALLSVG 18 T 11 GPHR_N pdbhh F Eukaryota T 5uwp 4 D D DIAP3_HUMAN Protein diaphanous homolog 3 GGSYSVPEVEALLARLRAL 19 T 0.98 DUF1128 pdbhh F Eukaryota T 5uws 4 D D APBA3_HUMAN Amyloid beta A4 precursor protein-binding family A member 3 GGSYSSLQELVQQFEALPGDLV 22 T 0.9 NikR_C pdbhh F Eukaryota T 5uww 4 D D DEAF1_HUMAN Deformed epidermal autoregulatory factor 1 homolog GGSSWLYLEEMVNSLLNTAQQ 21 T 0.16 Latarcin unppssm F Eukaryota T 5uy9 2 B B Brd4 peptide QASTPRX 7 T 110 Matrix pdbhh F T 5uyo 1 A A HEEH_rd4_0097 MGSSHHHHHHSSGLVPRGSHMDVEEQIRRLEEVLKKNQPVTWNGTTYTDPNEIKKVIEELRKSM 64 T 0.13 DUF6466 pdb F T 5uz9 4 I,J I,J L7P7M1_9CAUD ACRF1 KFIKYLSTAHLNYMNIAVYENGSKIKARVENVVNGKSVGARDFDSTEQLESWFYGLPGSGLGRIENAMNEISRRENP 77 T 0.16 DUF4982 pdb T Viruses T 5uz9 5 K K ACR30_BPD31 GENE PRODUCT 30, GP30, ACRF2 MHHHHHHIAQQHKDTVAACEAAEAIAIAKDQVWDGEGYTKYTFDDNSVLIQSGTTQYAMDADDADSIKGYADWLDDEARSAEASEIERLLESVEEE 96 T 0.13 Transglycosylas unp T Viruses T 5uzl 1 A A K9LL63_BRANA O-acyltransferase NVDVRYTYRPSVPAHRRVRESPLSSDAIFKQSH 33 T 140 GUCT pdbhh F Eukaryota T 5uzu 1 A B Q2G0X2_STAA8 Uncharacterised protein GSTKVYSQNGLVLHDDANFLEHELSYIDVLLDKNADQATKDNLRSYFADKGLHSIKDIINKAKQDGFDVSK 71 T 0.0051 CompInhib_SCIN unphh F Bacteria T 5uzw 2 B,D,F,H E,F,G,H F6LNL5_9CARY Presegetalin A1 NASAPV 6 T 15 PH_17 unphh F Eukaryota F 5uzz 2 B B 14-mer Peptide RVRTRGKRRIRRXP 14 T 13 Defensin_int pdbhh F T 5v0y 1 A A A0A3F2YLM5_AREMA arenicin-3 GFCWYVCVYRNGVRVCYRRCN 21 T 1.4 PilI pdbhh F Eukaryota T 5v11 1 A A AA139 GFCWYVCARRNGARVCYRRCN 21 T 3.4 Toxin_25 pdbhh F T 5v1a 1 A B ULP2_YEAST Ubiquitin-like-specific protease 2 AEFTSPYFGRPSLKTRAKQFEGVSSP 26 T 12 LtuB pdbhh F Eukaryota T 5v1d 2 B,E,G E,F,G 12-mer peptide ADPQPWRFYAPR 12 T 0.33 TM1586_NiRdase pdbhh F T 5v1e 1 A A Guavanin 2 RQYMRQIEQALRYGYRISRRX 21 T 1.4 Tyrosinase pdbhh F T 5v1t 2 B B SuiA 22mer MSKELEKVLESSAMAKGDGWHV 22 T 4.1 DUF1952 pdbhh F T 5v1u 2 E,F,G,H E,F,G,H D1CIY7_THET1 TbiA(beta) Thr(-5)Glu Leader MTKTYTAPTLVEYGGLERLT 20 T 2.1E-05 DUF5972 pdbhh F Bacteria T 5v1v 2 C,D C,D D1CIZ1_THET1 TbiA(alpha) Leader Peptide MKEYRSPELKEYGRVEDRTAG 21 T 0.029 DUF5972 unphh F Bacteria T 5v1y 3 E,F E,F PSMD1_HUMAN 26S PROTEASOME REGULATORY SUBUNIT RPN2, 26S PROTEASOME REGULATORY SUBUNIT S1, 26S PROTEASOME SUBUNIT P112, RPN2 GPQEPEPPEPFEYIDD 16 T 29 AgrD pdbhh F Eukaryota T 5v1z 3 E,F F,E PSMD1_HUMAN 26S PROTEASOME REGULATORY SUBUNIT RPN2, 26S PROTEASOME REGULATORY SUBUNIT S1, 26S PROTEASOME SUBUNIT P112, RPN2 GPKIEEEEQEPEPPEPFEYIDD 22 T 51 Bacillus_PapR pdbhh F Eukaryota T 5v2g 1 A,B,C A,B,C 20-mer Peptide KNPEAEEITRCKKLLDDSSS 20 T 0.12 DUF3151 pdb F T 5v2p 2 B B CAC1C_HUMAN CALCIUM CHANNEL, L TYPE,ALPHA-1 POLYPEPTIDE, ISOFORM 1, CARDIAC MUSCLE, VOLTAGE-GATED CALCIUM CHANNEL SUBUNIT ALPHA CAV1.2 CSPLECDLKGYLDWITQAE 19 T 5.4 MIIP pdbhh F Eukaryota T 5v2q 2 B B CAC1C_HUMAN CALCIUM CHANNEL, L TYPE,ALPHA-1 POLYPEPTIDE, ISOFORM 1, CARDIAC MUSCLE, VOLTAGE-GATED CALCIUM CHANNEL SUBUNIT ALPHA CAV1.2 ASPLEEDLCGYLCWITQAE 19 T 4.5 3-PAP pdbhh F Eukaryota T 5v3n 2 B B H0GHZ9_SACCK;TOF2_YEAST Ulp2p,Topoisomerase 1-associated factor 2 chimera SNAPYFGRPSLKTRAKQFEGVSSKDIGENCRRIEAFSD 38 T 1.6 DUF1499 pdbhh F Eukaryota T 5v3r 2 B B CHM4C_HUMAN CHROMATIN-MODIFYING PROTEIN 4C, CHMP4C, SNF7 HOMOLOG ASSOCIATED WITH ALIX 3, SNF7-3, HSNF7-3, VACUOLAR PROTEIN SORTING-ASSOCIATED PROTEIN 32-3, HVPS32-3 QRAEEEDDDIKQLAAWAT 18 T 1.1 Ribosomal_60s unppssm F Eukaryota T 5v4b 3 C C DISC1_HUMAN DISC1 peptide PEVPPTPPGSHSAFT 15 T 3.2 GSAP-16 pdbhh F Eukaryota T 5v4c 1 A A Q8I5P1_PLAF7 Peptide 38136 NVHTFRGINGHNSSSSL 17 T 0.0082 SseC unppercent F Eukaryota T 5v4r 1 A,B A,B LARP1_HUMAN LA RIBONUCLEOPROTEIN DOMAIN FAMILY MEMBER 1 GHSGGGGGGHMQHPSHELLKENGFTQHVYHKYRRRCLNERKRLGIGQSQEMNTLFRFWSFFLRDHFNKKMYEEFKQLALEDAKEGYRYGLECLFRYYSYGLEKKFRLDIFKDFQEETVKDYEAGQLYGLEKFWAFLKYSKAKNLDIDPKLQEYLGKFRRLED 162 T 0.62 Frankia_peptide pdbpercent F Eukaryota T 5v5l 3 E,F E,F POL_HV1B1 TW10 TSTLQEQIGW 10 T 3.2 Red1 pdbhh T Viruses T 5v5m 3 E,F E,F POL_HV1B1 TW10 TSTLQEQIGW 10 T 3.2 Red1 pdbhh T Viruses T 5v62 2 B I KS6A1_HUMAN S6K-ALPHA-1,90 KDA RIBOSOMAL PROTEIN S6 KINASE 1,P90S6K,MAP KINASE-ACTIVATED PROTEIN KINASE 1A,MAPKAPK-1A,RIBOSOMAL S6 KINASE 1,RSK-1 XGTAQLKPIESSILAQRRVRK 21 T 15 Microvir_lysis pdbhh F Eukaryota T 5v63 1 A A ORN-LYS-LEU-VAL-PHI-PHE-ALA-GLU-ORN-ALA-ILE-ILE-SAR-LEU-MET-VAL XKLVXFAEXAIIXLMVV 17 T 0.0033 Beta-APP pdbhh F T 5v64 1 A A ORN-GLN-LYS-LEU-VAL-PHI-PHE-ALA-ORN-ALA-ILE-ILE-SAR-LEU-MET-VAL XQKLVXFAXAIIXLMV 16 T 0.02 Beta-APP pdbhh F T 5v65 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P ORN-LEU-VAL-PHI-PHE-ALA-GLU-ASP-ORN-ALA-ILE-ILE-SAR-LEU-MET-VAL XLVXFAEDXAIIXLMV 16 T 0.00035 Beta-APP pdbhh F T 5v6e 1 A,C,E,G,I A,C,E,G,I GIPC1_MOUSE GAIP C-TERMINUS-INTERACTING PROTEIN,RGS-GAIP-INTERACTING PROTEIN,RGS19-INTERACTING PROTEIN 1,SEMAF CYTOPLASMIC DOMAIN-ASSOCIATED PROTEIN 1,SEMCAP-1,SYNECTIN GPHMSAFEEKAIEKVDDLLESYMGIRDTELAATMVELGKDKRNPDELAEALDERLGDFAFPDEFVFDVWGAIGDAKVGRY 80 T 0.00069 PWI pdbhh F Eukaryota T 5v6e 2 B,D,F,H,J B,D,F,H,J MYO6_MOUSE UNCONVENTIONAL MYOSIN-6 GPGSHDLSKWKYAELRDTINTSCDIELLAACREEFHRRLKVYHAWKSKN 49 T 0.12 Pox_MCEL pdbpssm F Eukaryota T 5v6h 1 A,C,E,G,I A,C,E,G,I GIPC2_MOUSE SEMAF CYTOPLASMIC DOMAIN-ASSOCIATED PROTEIN 2,SEMCAP-2 GPHMSEAKAKAIGKVDDLLELYMGIRDIDLATTMFEAGKDKSNPDEFAVALDETLGDFAFPDEFLFDVWGAISDMKQGR 79 T 0.0007 PWI pdbhh F Eukaryota T 5v6h 2 B,D,F,H,J B,D,F,H,J MYO6_MOUSE UNCONVENTIONAL MYOSIN-6 GPGSHDLSKWKYAELRDTINTSCDIELLAACREEFHRRLKVYHAWKSKN 49 T 0.12 Pox_MCEL pdbpssm F Eukaryota T 5v6i 1 A,B A,B G3BK00_COPCM Y3 PROTEIN QDPLSCYDNFGNRDVAACARFIDDFCDTLTPNIYRPRDNGQRCYVVNGHKCDFTVFNTNNGGSPIRASTPNCKTVLRAAANRCPTGGRGKINPSAPFLFAIDPNDGDCSTDF 112 T 0.00034 Fungal_lectin_2 unphh F Eukaryota T 5v6j 1 A,B,C,D A,B,C,D G3BK00_COPCM Y3 PROTEIN QDPLSCYDNFGNRDVAACARFIDDFCDTLTPNIYRPRDNGQRCYVVNGHKCDFTVFNTNNGGSPIRASTPNCKTVLRAAANRCPTGGRGKINPSAPFLFAIDPNDGDCSTDF 112 T 0.00034 Fungal_lectin_2 unphh F Eukaryota T 5v6x 1 A,B A,B PYLS_METMA PYRROLYSINE--TRNA(PYL) LIGASE,PYRROLYSYL-TRNA SYNTHETASE,PYLRS MGHHHHHHMNNKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRPARALRYHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAP 109 T 0.041 Zn_ribbon_recom pdbpercent F Archaea T 5v77 1 A,B A,B Q5F7U7_NEIG1 Uncharacterized protein MAHHHHHHMKKNIFHNVSLYEIIFSDNGNTLTLSFTDTIEGNYFGYIKCSNILNFKLDTNNFVDYEDKEDSLFPLFIPEIELYKYQFYSEIIIDVGIIIKISAETINFEPLGK 113 T 0.28 DUF6329 unppercent F Bacteria T 5v7c 1 A B LARP1_HUMAN LA RIBONUCLEOPROTEIN DOMAIN FAMILY MEMBER 1 GHSGGGGGGHMQHPSHELLKENGFTQHVYHKYRRRCLNERKRLGIGQSQEMNTLFRFWSFFLRDHFNKKMYEEFKQLALEDAKEGYRYGLECLFRYYSYGLEKKFRLDIFKDFQEETVKDYEAGQLYGLEKFWAFLKYSKAKNLDIDPKLQEYLGKFRRLED 162 T 0.62 Frankia_peptide pdbpercent F Eukaryota T 5v7j 1 A G Q2N0S6_9HIV1 Envelope glycoprotein gp160 ENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTAAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENIANNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQAFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSAGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSATETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 480 T 3.4E-54 GP120 pdbpercent T Viruses T 5v87 1 A,B A,B LARP1_HUMAN LA RIBONUCLEOPROTEIN DOMAIN FAMILY MEMBER 1 GHSGGGGGGHMQHPSHELLKENGFTQHVYHKYRRRCLNERKRLGIGQSQEMNTLFRFWSFFLRDHFNKKMYEEFKQLALEDAKEGYRYGLECLFRYYSYGLEKKFRLDIFKDFQEETVKDYEAGQLYGLEKFWAFLKYSKAKNLDIDPKLQEYLGKFRRLED 162 T 0.62 Frankia_peptide pdbpercent F Eukaryota T 5v8k 2 B B B0TAT4_HELMI proteinsubunit pshX YSPTFNVAHILAFFFLFLHIPFYFV 25 T 5 DUF4834 unphh F Bacteria T 5v8w 1 A,C,E,G A,C,E,G INT9_HUMAN INT9,PROTEIN RELATED TO CPSF SUBUNITS OF 74 KDA,RC-74 MKPLLSGSIPVEQFVQTLEKHGFSDIKVEDTAKGHIVLLQEAETLIQIEEDSTHIICDNDEMLRVRLRDLVLKFLQKF 78 T 0.086 FAM167 pdb F Eukaryota T 5v8w 2 B,D,F,H B,D,F,H INT11_HUMAN INT11,CLEAVAGE AND POLYADENYLATION-SPECIFIC FACTOR 3-LIKE PROTEIN,CPSF3-LIKE PROTEIN,PROTEIN RELATED TO CPSF SUBUNITS OF 68 KDA,RC-68 GSHMRLVSSEQALKELGLAEHQLRFTCRVHLHDTRKEQETALRVYSHLKSVLKDHCVQHLPDGSVTVESVLLQAAAPSEDPGTKVLLVSWTYQDEELGSFLTSLLKKGLPQAPS 114 T 0.021 CPSF73-100_C unppssm F Eukaryota T 5v93 33 GA b Capreomycin XXXXXA 6 T 1900 SEC-C pdbhh F F 5v9p 2 B B HISTONE DEMETHYLASE JARID1A, JUMONJI/ARID DOMAIN-CONTAINING PROTEIN 1A, RETINOBLASTOMA-BINDING PROTEIN 2, RBBP-2, KDM5A XXXXXXXXXX 10 F F F 5v9t 2 C G HISTONE DEMETHYLASE JARID1A, JUMONJI/ARID DOMAIN-CONTAINING PROTEIN 1A, RETINOBLASTOMA-BINDING PROTEIN 2, RBBP-2, KDM5A XXXXXXXXXX 10 F F F 5va9 2 C,D C,D Peptide Inhibitor piHA-L5(d10Y) XYGHSHIRFGYSYHVSYCGX 20 T 4.5 ZinT pdbhh F T 5vaq 3 C C FGF21_HUMAN FGF-21 PDVGSSDPLSMVGGSQGRSPSYES 24 T 23 DUF1335 pdbhh F Eukaryota T 5vav 1 A A cyc-MC12 GRCTQAWPPICFPD 14 T 0.38 Bowman-Birk_leg pdbhh F T 5vb9 2 C,D C,D Peptide inhibitor CWVLEYDMFGALHCR 15 T 1.8 Cytochrom_B559a pdbhh F T 5vbl 1 A A agonist peptide KFRRQRPXXEHKKXXPX 17 T 0.076 Apelin pdb F T 5vbn 2 B,D B,F DPOE1_HUMAN DNA POLYMERASE II SUBUNIT A AQFRDPCRSYVLPEVICRSCNFCRDLDLCKDSSFSEDGAVLPQWLCSNCQAPYDSSAIEMTLVEVLQKKLMAFTLQDLVCLKCRGVKETSMPVYCSCAGDFALTIHTQVFMEQIGIFRNIAQHYGMSYLLETLEWLLQKNPQLGH 145 T 0.0011 zinc_ribbon_15 pdbhh F Eukaryota T 5vcl 3 C P HA1L_MOUSE H-2 CLASS I HISTOCOMPATIBILITY ANTIGEN, L-D ALPHA CHAIN AMAPRTLLL 9 T 0.014 UL40 pdbhh F Eukaryota T 5vey 3 C C RN169_HUMAN RING FINGER PROTEIN 169,RING-TYPE E3 UBIQUITIN TRANSFERASE RNF169 GHMDPVLREMEQKLQQEEEDRQLALQLQRMFDNERRTVSRRKGSVDQYLLRSSNMAGAK 59 T 0.011 DUF4788 pdbpssm F Eukaryota T 5vf1 1 A,B,C,D,E,F A,B,C,D,E,F ORN-LYS-LEU-VAL-PHI-PHE-ALA-GLU-ORN-GLU-ALA-PHE-MEA-VAL-LEU-LYS XKLVXFAEXEAFXVLK 16 T 4.3 DUF4065 pdbhh F T 5vf3 5 Z z HOC XXXXXXXXXXXXXXX 15 F F F 5vfw 1 A A ANXA1_HUMAN ANNEXIN I,ANNEXIN-1,CALPACTIN II,CALPACTIN-2,CHROMOBINDIN-9,LIPOCORTIN I,PHOSPHOLIPASE A2 INHIBITORY PROTEIN,P35 AMVSEFLKQAWFIENEEQEYVQTVK 25 T 16 DUF807 pdbhh F Eukaryota T 5vgb 2 B B A0A2D0TCG3_NEIME Anti-CRISPR protein (AcrIIC1) MANKTYKIGKNAGYDGCGLCLAAISENEAIKVKYLRDICPDYDGDDKAEDWLRWGTDSRVKAAALEMEQYAYTSVGMASCWEFVEL 86 T 6.9 WIYLD pdbhh F Bacteria T 5vgd 3 C C SER-ALA-GLU-PRO-VAL-PRO-LEU-GLN-LEU SAEPVPLQL 9 T 21 REV pdbhh F T 5vi6 2 B B (3S,6S,9S,15AR)-6,9-DIBENZYL-3-{6,6-DIHYDROXY-6-[(2S)-OXIRAN-2-YL]HEXYL}OCTAHYDRO-2H-PYRIDO[1,2-A][1,4,7,10]TETRAAZACYCLODODECINE-1,4,7,10(3H,12H)-TETRONE FFXX 4 T 410 GM130_C pdbhh F F 5vid 2 F,G,H,I F,G,H,I Bot.0671.2 MGSSHHHHHHSSGLVPRGSHMQPMFAELKAKFFLEIGDRDAARNALRKAGYSDEEAERIIRKYELE 66 T 0.099 RuvA_C pdbhh F T 5vie 2 B,D B,D CSK21_HUMAN CKII YPGGSTPVSSANMM 14 T 13 Pr_beta_C pdbhh F Eukaryota T 5vif 2 B B CSK21_HUMAN CKII YPGGSTPVSSANMM 14 T 13 Pr_beta_C pdbhh F Eukaryota T 5vjh 2 G P FITC casein XXXXXXXXXXXXXXXXXXXXXXXXXX 26 F F F 5vji 1 A,B,D,E A,B,D,E CLOCK_MOUSE MCLOCK GAMDPEFSAQLGAMQHLKDQLEQRTRMIEANIHRQQEELRKIQEQLQMVHG 51 T 0.011 DUF641 pdbpercent F Eukaryota T 5vjj 1 A,B A,B B2ZCS6_MELLI Avirulence protein AvrP123 SNAQSNPNQELGVVQCLCRRIAPLTQPPFGVRCRATLNCPCDYIGDCPGPAEQYMYRCPNCGPRSHVACSGVHQGTCQQVHPGKDSVEYGG 91 T 0.032 LSR unppssm F Eukaryota T 5vjs 1 A A Reaction Center Maquette GSPELRQEHQQLAQEFQQLLQEIQQLGRELLKGELQGIKQLREASEKARNPEKKSVLQKILEDEEKHIELLETLQQTGQEAQQLLQELQQTGQELWQLGGSGGPELRQKHQQLAQKIQQLLQKHQQLGAKILEDEEKHIELLETILGGSGGDELRELLKGELQGIKQYRELQQLGQKAQQLVQKLQQTGQKLWQLG 196 T 0.0003 Rubrerythrin pdbpercent F T 5vjt 1 A A Reaction Center Maquette GSPELRQEHQQLAQEFQQLLQEIQQLGRELLKGELQGIKQLREASEKARNPEKKSVLQKILEDEEKHIELLETLQQTGQEAQQLLQELQQTGQELWQLGGSGGPELRQKHQQLAQKIQQLLQKHQQLGAKILEDEEKHIELLETILGGSGGDELRELLKGELQGIKQYRELQQLGQKAQQLVQKLQQTGQKLWQLG 196 T 0.0003 Rubrerythrin pdbpercent F T 5vjx 2 AA,B,C,CA,DA,E,F,H,I,K,L,N,O,Q,R,T,U,W,X,Z c,B,C,e,f,E,F,H,I,K,L,N,O,S,T,V,W,Y,Z,b CLOCK_MOUSE MCLOCK GAMDPEFSAQLGAMQHLKDQLEQRTRMIEANIHRQQEELRKIQEQLQMVHG 51 T 0.011 DUF641 pdbpercent F Eukaryota T 5vk0 2 B,D,F,H,J,L,N,P,R,T,V,X B,D,F,H,J,L,N,P,R,T,V,X Lysine-cysteine side chain dithiocarbamate stapled peptide inhibitor PMI XTSFAEYWXLLSCX 14 T 0.42 P53_TAD pdbhh F T 5vk1 2 B,D,F,H,J,L,N,P B,D,F,H,J,L,N,P Lysine-cysteine side chain dithiocarbamate stapled peptide inhibitor PMI XTSFXEYWCLLSPX 14 T 0.21 PDDEXK_7 pdbhh F T 5vkl 2 B B RPB1_YEAST RNA POLYMERASE II SUBUNIT B1,DNA-DIRECTED RNA POLYMERASE III LARGEST SUBUNIT,RNA POLYMERASE II SUBUNIT B220 ESGLVNADLDVKDELMFSPLVDS 23 T 0.16 Hemolysin_N pdbhh F Eukaryota T 5vko 2 B B RPB1_YEAST RNA POLYMERASE II SUBUNIT B1,DNA-DIRECTED RNA POLYMERASE III LARGEST SUBUNIT,RNA POLYMERASE II SUBUNIT B220 CGGVTPYSNESGLVNADLDVKDELMFSPLVDSGS 34 T 0.36 Hemolysin_N pdbhh F Eukaryota T 5vl6 1 A A Q8I5P1_PLAF7 Peptide 38138 NVHTFRGDNVHNSSSSL 17 T 0.0082 SseC unppercent F Eukaryota T 5vla 2 B Z THR-VAL-PHE-THR-SER-TRP-GLU-GLU-TYR-LEU-ASP-TRP-VAL-MET-PRO-TRP-ASN-LEU-VAL-ARG-ILE-GLY-LEU-LEU TVFTSWEEYLDWVGSGDLMPWNLVRIGLLR 30 T 0.9 SseB pdbhh F T 5vlh 2 B Y CYS-ARG-LEU-PRO-TRP-ASN-LEU-GLN-ARG-ILE-GLY-LEU-PRO-CYS CRLPWNLQRIGLPC 14 T 0.19 CIS_TMP pdbhh F T 5vlh 3 C Z ACE-THR-VAL-PHE-THR-SER-TRP-GLU-GLU-TYR-LEU-ASP-TRP-VAL-NH2 XTVFTSWEEYLDWVX 15 T 0.21 DUF5575 pdbhh F T 5vli 3 C C Computationally designed peptide HB1.6928.2.3 CIEQSFTTLFACQTAAEIWRAFGYTVKIMVDNGNCRLHVC 40 T 4.4 DUF4468 pdbhh F T 5vlk 2 B Y ACE-THR-VAL-PHE-THR-SER-TRP-GLU-GLU-TYR-LEU-ASP-TRP-VAL-NH2 peptide XTVFTSWEEYLDWVX 15 T 0.21 DUF5575 pdbhh F T 5vlk 3 C Z ACE-TRP-ASN-LEU-VAL-HRG-ILE-GLY-LEU-LEU peptide XWNLVXIGLLR 11 T 3.3 Abi_alpha pdbhh F T 5vll 2 B Y CYS-PHE-ILE-PRO-TRP-ASN-LEU-GLN-ARG-ILE-GLY-LEU-LEU-CYS CFIPWNLQRIGLLC 14 T 0.41 DUF2982 pdbhh F T 5vll 3 C Z ACE-THR-VAL-PHE-THR-SER-TRP-GLU-GLU-TYR-LEU-ASP-TRP-VAL-NH2 XTVFTSWEEYLDWVX 15 T 0.21 DUF5575 pdbhh F T 5vlp 4 D Z LDLR antagonist peptide XMESFPGWNLVXIGLLR 17 T 1 BOFC_N pdbhh F T 5vne 4 D D EMP24 SLV 3 T 640 zf-C2H2_6 pdbhh F F 5vnf 4 D D C-terminal VV Sorting motif: VAL-THR-SER-VAL-VAL VTSVV 5 T 390 IlvGEDA_leader pdbhh F F 5vng 4 D D C-terminal ILE-ILE EVTSII 6 T 10 UPF0561 pdbhh F F 5vnh 4 D D C-terminal SV motif EVTSSV 6 T 240 DUF4172 pdbhh F F 5vni 4 D D C-terminal FA VTSFA 5 T 150 Disulph_isomer pdbhh F F 5vnj 4 D D C-terminal FF Ergic-53 VTSFF 5 T 77 AIB pdbhh F F 5vnk 4 D D C-terminal LL VTSLL 5 T 300 PduV-EutP pdbhh F F 5vob 3 C C UL128_HCMVA Envelope glycoprotein UL128 MSPKDLTPFLTTLWLLLGHSRVPRVRAEECCEFINVNHPPERCYDFKMCNRFTVALRCPDGEVCYSPEKTAEIRGIVTTMTHSLTRQVVHNKLTSCNYNPLYLEADGRIRCGKVNDKAQYLLGAAGSVPYRWINLEYDKITRIVGLDQYLESVKKHKRLDVCRAKMGYMLQ 171 T 32 Auto_anti-p27 pdbhh T Viruses T 5vob 5 E E U131A_HCMVM Envelope glycoprotein UL131A MRLCRVWLSVCLCAVVLGQCQRETAEKNDYYRVPHYWDACSRALPDQTRYKYVEQLVDLTLNYHYDASHGLDNFDVLKRINVTEVSLLISDFRRQNRRGGTNKRTTFNAAGSLAPHARSLEFSVRLFAN 129 T 0.94 DDDD pdbpercent T Viruses T 5voc 3 C C UL128_HCMVA Envelope glycoprotein UL128 MSPKDLTPFLTTLWLLLGHSRVPRVRAEECCEFINVNHPPERCYDFKMCNRFTVALRCPDGEVCYSPEKTAEIRGIVTTMTHSLTRQVVHNKLTSCNYNPLYLEADGRIRCGKVNDKAQYLLGAAGSVPYRWINLEYDKITRIVGLDQYLESVKKHKRLDVCRAKMGYMLQ 171 T 32 Auto_anti-p27 pdbhh T Viruses T 5voc 5 E E U131A_HCMVM Envelope glycoprotein UL131A MRLCRVWLSVCLCAVVLGQCQRETAEKNDYYRVPHYWDACSRALPDQTRYKYVEQLVDLTLNYHYDASHGLDNFDVLKRINVTEVSLLISDFRRQNRRGGTNKRTTFNAAGSLAPHARSLEFSVRLFAN 129 T 0.94 DDDD pdbpercent T Viruses T 5vod 3 C C UL128_HCMVA Envelope glycoprotein UL128 MSPKDLTPFLTTLWLLLGHSRVPRVRAEECCEFINVNHPPERCYDFKMCNRFTVALRCPDGEVCYSPEKTAEIRGIVTTMTHSLTRQVVHNKLTSCNYNPLYLEADGRIRCGKVNDKAQYLLGAAGSVPYRWINLEYDKITRIVGLDQYLESVKKHKRLDVCRAKMGYMLQ 171 T 32 Auto_anti-p27 pdbhh T Viruses T 5vod 5 E E U131A_HCMVM Envelope glycoprotein UL131A MRLCRVWLSVCLCAVVLGQCQRETAEKNDYYRVPHYWDACSRALPDQTRYKYVEQLVDLTLNYHYDASHGLDNFDVLKRINVTEVSLLISDFRRQNRRGGTNKRTTFNAAGSLAPHARSLEFSVRLFAN 129 T 0.94 DDDD pdbpercent T Viruses T 5vox 15 DA d V-type proton ATPase subunit f XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 54 F F F 5vox 16 EA,FA,GA e,f,g effector protein SidK MSFIKVGIKMGGLTSEQYHSQVVGKIGYIARCMQTIDPENNLKKIREDYQDVLIWAEKNYRFEEILEASKSGKCPNDLDALSRRSLILLELLRLVSSISPFKMKLDLIESQYEKMKQHVNLWKSDYHVKLNQLNQLTDYLKNAAPTPKNNFLRAMTSVLQMQIAQYGITEDNEGINQLFKLGLHLLAMANEKIDEQYHLFKGYVKDQPEESPFEGILPAEDQKILVKTMIDYAMPKLSSKVLQDKLSALSSSDVLTKTLLDSIDRIVKENEKLNALSKVKLGKFGLDIREIEVIYSQALKISPQDALQYTAQQCDAQLLSMAFPDSQNYIIESISNKKVKTIAELIHSKEFIYQIIKTEVFKQVDPNEKIRLQAATELYQLLGRIMDKQINLFTKMNLEQINEYIQTKTKAILDKIPERVELLTFMGFEIPTFKGIETLMTDISHSQDNETLAIAQEFYTNIKNAKNQLLGDKLIEDITPQDVEKFFNQCSQYGSEAAEKLADNRPVLTKIADILTAIARWAISLIGFNTPPQFLAPTRTCVDQVSDEITKIKLKLEDTLGSLQKVQEESLSL 573 T 0.15 Chalcone pdb F T 5voy 15 DA d V-type proton ATPase subunit f XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 54 F F F 5voy 16 EA,FA,GA e,f,g effector protein SidK MSFIKVGIKMGGLTSEQYHSQVVGKIGYIARCMQTIDPENNLKKIREDYQDVLIWAEKNYRFEEILEASKSGKCPNDLDALSRRSLILLELLRLVSSISPFKMKLDLIESQYEKMKQHVNLWKSDYHVKLNQLNQLTDYLKNAAPTPKNNFLRAMTSVLQMQIAQYGITEDNEGINQLFKLGLHLLAMANEKIDEQYHLFKGYVKDQPEESPFEGILPAEDQKILVKTMIDYAMPKLSSKVLQDKLSALSSSDVLTKTLLDSIDRIVKENEKLNALSKVKLGKFGLDIREIEVIYSQALKISPQDALQYTAQQCDAQLLSMAFPDSQNYIIESISNKKVKTIAELIHSKEFIYQIIKTEVFKQVDPNEKIRLQAATELYQLLGRIMDKQINLFTKMNLEQINEYIQTKTKAILDKIPERVELLTFMGFEIPTFKGIETLMTDISHSQDNETLAIAQEFYTNIKNAKNQLLGDKLIEDITPQDVEKFFNQCSQYGSEAAEKLADNRPVLTKIADILTAIARWAISLIGFNTPPQFLAPTRTCVDQVSDEITKIKLKLEDTLGSLQKVQEESLSL 573 T 0.15 Chalcone pdb F T 5voz 15 DA d V-type proton ATPase subunit f XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 54 F F F 5voz 16 EA,FA,GA e,f,g G8UUS6_LEGPN Uncharacterized protein MSFIKVGIKMGGLTSEQYHSQVVGKIGYIARCMQTIDPENNLKKIREDYQDVLIWAEKNYRFEEILEASKSGKCPNDLDALSRRSLILLELLRLVSSISPFKMKLDLIESQYEKMKQHVNLWKSDYHVKLNQLNQLTDYLKNAAPTPKNNFLRAMTSVLQMQIAQYGITEDNEGINQLFKLGLHLLAMANEKIDEQYHLFKGYVKDQPEESPFEGILPAEDQKILVKTMIDYAMPKLSSKVLQDKLSALSSSDVLTKTLLDSIDRIVKENEKLNALSKVKLGKFGLDIREIEVIYSQALKISPQDALQYTAQQCDAQLLSMAFPDSQNYIIESISNKKVKTIAELIHSKEFIYQIIKTEVFKQVDPNEKIRLQAATELYQLLGRIMDKQINLFTKMNLEQINEYIQTKTKAILDKIPERVELLTFMGFEIPTFKGIETLMTDISHSQDNETLAIAQEFYTNIKNAKNQLLGDKLIEDITPQDVEKFFNQCSQYGSEAAEKLADNRPVLTKIADILTAIARWAISLIGFNTPPQFLAPTRTCVDQVSDEITKIKLKLEDTLGSLQKVQEESLSL 573 T 0.15 Chalcone pdb F T 5vqi 2 B,C A,C NIT2_NEUCR Nuclear localization sequence of NIT2 transcription factor (NIT2-NLS) TISSKRQRRHSKS 13 T 12 DUF4543 pdbhh F Eukaryota T 5vr1 1 A A Turripeptide DCCPCPAGAVRCRFACCX 18 T 1.3 MSC pdbhh F T 5vs7 2 B P H4K5ac peptide GRGXGGK 7 T 17 Bravo_FIGEY pdbhh F F 5vsg 1 A A Super Helical Repeat Peptide SHR-FF SXFSXFX 7 T 6.8 CTV_P6 pdbhh F F 5vt9 2 B,D C,D MYOA_TOXGO MYOA,TGM-A GASKKTPFIIRAQAHIRRHLVDNNVSPATVQPAFAAA 37 T 0.00015 IQ unppssm F Eukaryota T 5vtb 2 B B BC11A_HUMAN BCL-11A, B-CELL CLL/LYMPHOMA 11A, COUP-TF-INTERACTING PROTEIN 1, ECOTROPIC VIRAL INTEGRATION SITE 9 PROTEIN HOMOLOG, EVI-9, ZINC FINGER PROTEIN 856 SRRKQGKPQHLSKRE 15 T 460 VMAP-M12 pdbhh F Eukaryota T 5vte 1 A A de novo peptide 1 XELEAIAQKFEAIAKKFEAIAXKFEAIAQKX 31 T 4.2 DUF2967 pdbhh F T 5vte 2 B B de novo peptide 2 XELKAIAQEFKAIAKEFKAIAXEFKAIAQKX 31 T 2.6 DUF5741 pdbhh F T 5vud 3 C C Nonamer peptide: LEU-SER-SER-PRO-VAL-THR-LYS-SER-TRP LSSPVTKSW 9 T 24 HTH_WhiA pdbhh F T 5vue 3 C C Nonamer peptide: LEU-THR-VAL-GLN-VAL-ALA-ARG-VAL-TRP LTVQVARVW 9 T 2.2 TraV pdbhh F T 5vuf 3 C C Nonamer peptide: LEU-THR-VAL-GLN-VAL-ALA-ARG-VAL-TYR LTVQVARVY 9 T 8.7 APOBEC_C pdbhh F T 5vvp 3 C C LEU-SER-SER-PRO-VAL-THR-LYS-SER-TRP LSSPVTKSW 9 T 24 HTH_WhiA pdbhh F T 5vvt 2 B,D B,D ELK1_HUMAN ELK1 peptide FWSTLSPI 8 T 0.43 DUF5848 pdbhh F Eukaryota T 5vvu 2 B,D B,D TAB1_HUMAN TAB1 peptide VPYSSAQ 7 T 18 UPF0160 pdbhh F Eukaryota T 5vvx 2 C,D B,D LMNB1_HUMAN Lamin B1 KLSPSPSSRVTVS 13 T 0.41 CCDC85 unppercent F Eukaryota T 5vw1 3 C B A0A0E0UT28_LISMM anti-CRISPR protein AcrIIA4 GSMNINDLIREIKNKDYTVKLSGTDSNSITQLIIRVNNDGNEYVISESENESIVEKFISAFKNGWNQEYEDEEEFYNDMQTITLKSELN 89 T 0.041 DUF4930 pdb F Bacteria T 5vwd 3 C C Noamer peptide: LEU-THR-VAL-GLN-VAL-ALA-ARG-VAL-TRP LTVQVARVW 9 T 2.2 TraV pdbhh F T 5vwf 3 C C Nonamer peptide: LEU-THR-VAL-GLN-VAL-ALA-ARG-VAL-TYR LTVQVARVY 9 T 8.7 APOBEC_C pdbhh F T 5vwh 3 C C Nonamer peptide: LEU-SER-SER-PRO-VAL-THR-LYS-SER-TRP LSSPVTKSW 9 T 24 HTH_WhiA pdbhh F T 5vwi 2 C,D C,D beta-PIX PAWDETNL 8 T 1.1 IPP-2 pdbhh F T 5vwj 3 C C Nonamer peptide: LEU-THR-VAL-GLN-VAL-ALA-ARG-VAL-TRP LTVQVARVW 9 T 2.2 TraV pdbhh F T 5vwk 2 E,F,G,H E,F,G,H Beta-PIX PAWDETNL 8 T 1.1 IPP-2 pdbhh F T 5vwl 1 A A Q6TAN6_9HIV1 Cytoplasmic tail of HIV-1 gp41 protein SLALIWDDLRSLCLFSYHRLRDLLLIVTRIVELLGRRGWEALKYWWNLLQYWSQELKNSAVNLLNATAIAVAEGTDRVIEVLQAAYRAIRHIPRRIRQGLERILL 105 T 4.9 DUF6307 pdbhh T Viruses T 5vxv 1 A A PEX15_YEAST PEROXIN-15,PEROXISOME BIOSYNTHESIS PROTEIN PAS21 MSEVFQECVNLFIKRDIKDCLEKMSEVGFIDITVFKSNPMILDLFVSACDIMPSFTKLGLTLQSEILNIFTLDTPQCIETRKIILGDLSKLLVINKFFRCCIKVIQFNLTDHTEQEEKTLELESIMSDFIFVYITKMRTTIDVVGLQELIEIFIFQVKVKLHHKKPSPNMYWALCKTLPKLSPTLKGLYLSKDVSIEDAILNSIDNKIQKDKLEVLFQ 218 T 0.008 FGF-BP1 unppssm F Eukaryota T 5vy9 2 G P Alpha-S1-casein XXXXXXXXXXXXXXXXXXXXXXXXXXXX 28 F F F 5vya 2 G P Alpha-S1-casein XXXXXXXXXXXXXXXXXXXXXXXXXXXX 28 F F F 5vz2 2 AA,BA,O,P,Q,R,S,T,U,V,W,X,Y,Z b,c,H,J,O,P,Q,R,U,V,X,Y,Z,a Acyldepsipeptide XFSPXAX 7 T 130 GreA_GreB pdbhh F F 5vzl 3 C C A0A2D0TCG7_9VIRU phage anti-CRISPR AcrIIA4 MNINDLIREIKNKDYTVKLSGTDSNSITQLIIRVNNDGNEYVISESENESIVEKFISAFKNGWNQEYEDEEEFYNDMQTITLKSELN 87 T 0.033 DUF4930 pdb T Viruses T 5vzu 3 E,F E,F CCND1_HUMAN Cyclin D1 EEVDLACTPTDVRDVDI 17 T 8.2 RE_HaeIII pdbhh F Eukaryota T 5w0j 1 A,B A,B peptide 1 XELAQAFKEIAKAFKEIAKAFEXIAQAIEKX 31 T 4.3 DUF1241 pdbhh F T 5w0k 5 E,J X,Y GP42_EBVB9 GP42 KPNVEVWPVDPPPPVNFNKTAEQEYGDKEVKLPHW 35 T 5.4 MarB unphh T Viruses T 5w18 2 AA,BA,O,P,Q,R,S,T,U,V,W,X,Y,Z b,c,H,J,O,P,Q,R,U,V,X,Y,Z,a 9V7-PHE-SER-PRO-YCP-ALA-MP8 XFSPXAX 7 T 130 GreA_GreB pdbhh F F 5w1h 1 A A A0A2D0TCG9_9FIRM LbaCas13a (C2c2) SNAMKISKVREENRGAKLTVNAKTAVVSENRSQEGILYNDPSRYGKSRKNDEDRDRYIESRLKSSGKLYRIFNEDKNKRETDELQWFLSEIVKKINRRNGLVLSDMLSVDDRAFEKAFEKYAELSYTNRRNKVSGSPAFETCGVDAATAERLKGIISETNFINRIKNNIDNKVSEDIIDRIIAKYLKKSLCRERVKRGLKKLLMNAFDLPYSDPDIDVQRDFIDYVLEDFYHVRAKSQVSRSIKNMNMPVQPEGDGKFAITVSKGGTESGNKRSAEKEAFKKFLSDYASLDERVRDDMLRRMRRLVVLYFYGSDDSKLSDVNEKFDVWEDHAARRVDNREFIKLPLENKLANGKTDKDAERIRKNTVKELYRNQNIGCYRQAVKAVEEDNNGRYFDDKMLNMFFIHRIEYGVEKIYANLKQVTEFKARTGYLSEKIWKDLINYISIKYIAMGKAVYNYAMDELNASDKKEIELGKISEEYLSGISSFDYELIKAEEMLQRETAVYVAFAARHLSSQTVELDSENSDFLLLKPKGTMDKNDKNKLASNNILNFLKDKETLRDTILQYFGGHSLWTDFPFDKYLAGGKDDVDFLTDLKDVIYSMRNDSFHYATENHNNGKWNKELISAMFEHETERMTVVMKDKFYSNNLPMFYKNDDLKKLLIDLYKDNVERASQVPSFNKVFVRKNFPALVRDKDNLGIELDLKADADKGENELKFYNALYYMFKEIYYNAFLNDKNVRERFITKATKVADNYDRNKERNLKDRIKSAGSDEKKKLREQLQNYIAENDFGQRIKNIVQVNPDYTLAQICQLIMTEYNQQNNGCMQKKSAARKDINKDSYQHYKMLLLVNLRKAFLEFIKENYAFVLKPYKHDLCDKADFVPDFAKYVKPYAGLISRVAGSSELQKWYIVSRFLSPAQANHMLGFLHSYKQYVWDIYRRASETGTEINHSIAEDKIAGVDITDVDAVIDLSVKLCGTISSEISDYFKDDEVYAEYISSYLDFEYDGGNYKDSLNRFCNSDAVNDQKVALYYDGEHPKLNRNIILSKLYGERRFLEKITDRVSRSDIVEYYKLKKETSQYQTKGIFDSEDEQKNIKKFQEMKNIVEFRDLMDYSEIADELQGQLINWIYLRERDLMNFQLGYHYACLNNDSNKQATYVTLDYQGKKNRKINGAILYQICAMYINGLPLYYVDKDSSEWTVSDGKESTGAKIGEFYRYAKSFENTSDCYASGLEIFENISEHDNITELRNYIEHFRYYSSFDRSFLGIYSEVFDRFFTYDLKYRKNVPTILYNILLQHFVNVRFEFVSGKKMIGIDKKDRKIAKEKECARITIREKNGVYSEQFTYKLKNGTVYVDARDKRYLQSIIRLLFYPEKVNMDEMIEVKEKKKPSDNNTGKGYSKRDRQQDRKEYDKYKEKKKKEGNFLSGMGGNINWDEINAQLKN 1440 T 1.2 SesA unppercent F Bacteria T 5w1i 2 B,D A,C A0A2D0TCG9_9FIRM LbaCas13a (C2c2) SNAMKISKVREENRGAKLTVNAKTAVVSENRSQEGILYNDPSRYGKSRKNDEDRDRYIESRLKSSGKLYRIFNEDKNKRETDELQWFLSEIVKKINRRNGLVLSDMLSVDDRAFEKAFEKYAELSYTNRRNKVSGSPAFETCGVDAATAERLKGIISETNFINRIKNNIDNKVSEDIIDRIIAKYLKKSLCRERVKRGLKKLLMNAFDLPYSDPDIDVQRDFIDYVLEDFYHVRAKSQVSRSIKNMNMPVQPEGDGKFAITVSKGGTESGNKRSAEKEAFKKFLSDYASLDERVRDDMLRRMRRLVVLYFYGSDDSKLSDVNEKFDVWEDHAARRVDNREFIKLPLENKLANGKTDKDAERIRKNTVKELYRNQNIGCYRQAVKAVEEDNNGRYFDDKMLNMFFIHRIEYGVEKIYANLKQVTEFKARTGYLSEKIWKDLINYISIKYIAMGKAVYNYAMDELNASDKKEIELGKISEEYLSGISSFDYELIKAEEMLQRETAVYVAFAARHLSSQTVELDSENSDFLLLKPKGTMDKNDKNKLASNNILNFLKDKETLRDTILQYFGGHSLWTDFPFDKYLAGGKDDVDFLTDLKDVIYSMRNDSFHYATENHNNGKWNKELISAMFEHETERMTVVMKDKFYSNNLPMFYKNDDLKKLLIDLYKDNVERASQVPSFNKVFVRKNFPALVRDKDNLGIELDLKADADKGENELKFYNALYYMFKEIYYNAFLNDKNVRERFITKATKVADNYDRNKERNLKDRIKSAGSDEKKKLREQLQNYIAENDFGQRIKNIVQVNPDYTLAQICQLIMTEYNQQNNGCMQKKSAARKDINKDSYQHYKMLLLVNLRKAFLEFIKENYAFVLKPYKHDLCDKADFVPDFAKYVKPYAGLISRVAGSSELQKWYIVSRFLSPAQANHMLGFLHSYKQYVWDIYRRASETGTEINHSIAEDKIAGVDITDVDAVIDLSVKLCGTISSEISDYFKDDEVYAEYISSYLDFEYDGGNYKDSLNRFCNSDAVNDQKVALYYDGEHPKLNRNIILSKLYGERRFLEKITDRVSRSDIVEYYKLKKETSQYQTKGIFDSEDEQKNIKKFQEMKNIVEFRDLMDYSEIADELQGQLINWIYLRERDLMNFQLGYHYACLNNDSNKQATYVTLDYQGKKNRKINGAILYQICAMYINGLPLYYVDKDSSEWTVSDGKESTGAKIGEFYRYAKSFENTSDCYASGLEIFENISEHDNITELRNYIEHFRYYSSFDRSFLGIYSEVFDRFFTYDLKYRKNVPTILYNILLQHFVNVRFEFVSGKKMIGIDKKDRKIAKEKECARITIREKNGVYSEQFTYKLKNGTVYVDARDKRYLQSIIRLLFYPEKVNMDEMIEVKEKKKPSDNNTGKGYSKRDRQQDRKEYDKYKEKKKKEGNFLSGMGGNINWDEINAQLKN 1440 T 1.2 SesA unppercent F Bacteria T 5w2j 2 C F unidentified peptide AKGALQELGAGLTA 14 T 11 zf-C2HCIx2C pdbhh F T 5w3n 1 A,B,C,D,E,F,G,H,I A,B,C,D,E,F,G,H,I FUS_HUMAN 75 KDA DNA-PAIRING PROTEIN,ONCOGENE FUS,ONCOGENE TLS,POMP75,TRANSLOCATED IN LIPOSARCOMA PROTEIN MSYYHHHHHHDYDIPTTENLYFQGAMDPASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRG 241 T 720 SelK_SelG pdbhh F Eukaryota T 5w4a 1 A,B,C,D A,B,C,D P-granule scaffold MDTNKREIVEFLGIRTYFFPNLALYAVNNDELLVSDPNKANSFAAYVFGASDKKPSVDDIVQILFPSGSDSGTILTSMDTLLALGPDFLTEFKKRNQDLARFNLTHDLSILAQGDEDAAKKKLNLMGRKAKLQKTEAAKILAILIKTINSEENYEKFTELSELCGLDLDFDAYVFTKILGLEDEDTADEVEVIRDNFLNRLDQTKPKLADIIRNGP 216 T 0.0014 SidE_PDE pdbpssm F T 5w4e 2 B,C A,D TDT_HUMAN human DNA repair polymerase Tdt SHLSPRKKRPRQTGAL 16 T 21 Doppel pdbhh F Eukaryota T 5w4f 2 B,C A,D DPOLM_HUMAN POL MU,TERMINAL TRANSFERASE LPKRRRARVGSPSGDAASSTPPSTRFPGV 29 T 0.0018 BRCT unppercent F Eukaryota T 5w4g 2 B A DPOLL_HUMAN POL LAMBDA,DNA POLYMERASE BETA-2,POL BETA2,DNA POLYMERASE KAPPA RGILKAFPKRQKIHADASSKVLAKIPRRE 29 T 13 Luteo_coat pdbhh F Eukaryota T 5w4h 1 A,B,C A,B,C A-beta 17_36 peptide: ORN-LYS-LEU-VAL-MEA-PHE-ALA-GLU-ORN-ALA-ILE-ILE-GLY-LEU-MET-VAL XKLVXFAEXAIIGLMV 16 T 0.0038 Beta-APP pdbhh F T 5w4i 1 A,B,C A,B,C A-beta 17_36: ORN-LYS-LEU-VAL-MEA-PHE-ALA-GLU-ORN-ALA-ILE-ILE-GLY-LEU-MET-VAL XKLVXFAEXAIIGLMV 16 T 0.0038 Beta-APP pdbhh F T 5w4j 1 A,B,C,D,E,F A,B,C,D,E,F A-beta 17_36 peptide: ORN-LYS-VAL-PHE-MEA-ALA-ALA-ASP-ORN-ALA-ILE-ILE-GLY-LEU-MET-VAL XKVFXAADXAIIGLMV 16 T 0.031 Beta-APP pdbhh F T 5w4k 58 ID,JD A,B Klebsazolicin XSPGNXASXSNSASANXX 18 T 1.1 Cytochrom_C pdbhh F T 5w54 1 A A A0A2D0TCH0_MANSE Stress Response Peptide-2 FGVKDGKCPSGRVRRLGICVPDDDY 25 T 0.78 NRF pdbhh F Eukaryota T 5w5s 3 C D CYCLIC PEPTIDE CP141019 (P5) XXXLEYXEWLSX 12 T 3.6 TerB_N pdbhh F T 5w5u 3 C D CYCLIC PEPTIDE CP141037 (P4) XXXXEYFEWLSX 12 T 3.6 TerB_N pdbhh F T 5w67 3 C C VAL-ARG-SER-ARG-ARG-ABA-LEU-ARG-LEU VRSRRXLRL 9 T 2 DUF1331 pdbhh F F 5w6i 3 C D CYCLIC PEPTIDE CP141046 (P3) XXXLEYFEWLSX 12 T 3.6 TerB_N pdbhh F T 5w6r 3 I,J,K,L M,O,N,Q CYCLIC PEPTIDE CP141099 (P6) XXXLEYFEWLSX 12 T 3.6 TerB_N pdbhh F T 5w6t 3 C F CYCLIC PEPTIDE CP151070 (P7) XXXXEYXEWLSX 12 T 3.6 TerB_N pdbhh F T 5w6u 3 C D CYCLIC PEPTIDE CP121068 (P2) XRXLEYFEWLSX 12 T 3.9 TerB_N pdbhh F T 5w6y 1 A,B A,B A9S498_PHYPA Chorismate mutase MACALSVSGILCASQAATSFSSAKPTKSQPHPVQLKAFVPISQPAALKSASLVVSPSRTSHASVEAETEPFTLANIRESLIRQEDTIIYALLQRAQFSFNAPTYDENSFSIPGFKGSLVEFMLKETETLHAKVRRYQAPDEHPFFPEDLSQPILPSLPKSRVLHPAAEKININKSIWSMYLQDLLPKLTVPDDDGNYGSASVCDVLCLQALSKRIHYGKFVAEAKFIEDPARFEGHIKAQDGDAILRELTFKNVEDNVKRRVANKARAYGQEVNEHGKVDNARYKIDPDLAGALYEDWVMPLTKQVQVAYLLRRLD 316 T 0.072 DUF5788 pdbpercent F Eukaryota T 5w7x 2 E,F,G,H H,E,F,G XRCC1_HUMAN X-RAY REPAIR CROSS-COMPLEMENTING PROTEIN 1 PYAGSTDEN 9 T 19 M3 pdbhh F Eukaryota T 5w7y 2 B,D D,C XRCC1_HUMAN X-RAY REPAIR CROSS-COMPLEMENTING PROTEIN 1 PYAGETDE 8 T 7.5 GH43_C pdbhh F Eukaryota T 5w82 1 A,B,C,D,E C,B,D,E,A E9KNV6_9VIRU Protein delta HMMPSEDYAIWYARATIAALQAAEYRLAMPSASYTAWFTDAVSDKLDKISESLNTLVECVIDKRLAVSVPEPLPVRVENKVQVEVEDEVRVRVENKVDVEVKN 103 T 0.12 DNMT1-RFD pdb T Viruses T 5w93 2 D,E,F D,E,F PAXI_MOUSE Paxillin MDDLDALLADLESTTSHISK 20 T 0.036 DUF883 pdb F Eukaryota T 5w94 2 B,D B,D SCC2_YEAST Sister chromatid cohesion protein 2 SNAMSYPGKDKNIPGRIIEALEDLPLSYLVPKDGLAALVNAPMRVSLPFDKTIFTSADDGRDVNINVLGTANSTTSSIKNEAEKERLVFKRPSNFTSSANSVDYVPTNFLEGLSPLAQSVLSTHKGLNDSINIEKKSEIVSRPEAKHKLESVTSNAGNLSFNDNSSNKKTKTSTGVTMTQANLA 184 T 0.4 HTH_25 pdbpercent F Eukaryota T 5w94 3 E,F E,H CENPP_YEAST Ctf19n MDFTSD 6 T 3.4 DUF6324 pdbhh F Eukaryota F 5w96 1 A,B A,B FZ7 LPSDDLEFWCHVMY 14 T 0.45 v110 pdbhh F T 5w9f 1 A A De novo mini protein gHEEE_02 SQETRKKCTEMKKKFKNCEVRCDESNHCVEVRCSDTKYTLC 41 T 14 DUF5651 pdbhh F T 5wa1 2 B B CHM4C_HUMAN CHROMATIN-MODIFYING PROTEIN 4C, CHMP4C, SNF7 HOMOLOG ASSOCIATED WITH ALIX 3, SNF7-3, HSNF7-3, VACUOLAR PROTEIN SORTING-ASSOCIATED PROTEIN 32-3, HVPS32-3 QRAEEEDDDIKQLAAWTT 18 T 1.1 Ribosomal_60s unppssm F Eukaryota T 5wa4 2 G,H,I,J,K,L M,N,O,P,Q,R D6Y501_THEBD TbtA 16-mer peptide MDLNDLPMDVFELADS 16 T 6.7 NPH3 pdbhh F Bacteria T 5wah 1 A A BAG_STRAG BETA ANTIGEN,B ANTIGEN GVEKTAGETSATDTGKREKQLQQWKNNLKNDVDNTILSHEQKNEFKTKIDETNDSDALLELENQFNETNRLLHIKQHEEVEKDKKAKQQKTLKQSDTKV 99 T 2.1 RtcB pdb F Bacteria T 5wai 2 B,F B,F SUZ12_HUMAN CHROMATIN PRECIPITATED E2F TARGET 9 PROTEIN,CHET 9 PROTEIN,JOINED TO JAZF1 PROTEIN,SUPPRESSOR OF ZESTE 12 PROTEIN HOMOLOG MEHVQADHELFLQAFEKPTQIYRFLRTRNLIAPIFLHRTLTYMSHRNSRTNIKRKTFKVDDMLSKVEKMKGEQESHSLSAHLQLTFTGFFHKNDKPSPNSENEQNSVTLEVLLVKVCHKKRKDVSCPIRQVPTGKKQVPLNPDLNQTKPGNFPSLAVSSNEFEPSNSHMVKSYSLLFRVTRPGRREFNGMINGETNENIDVNEELPARRKRNREDGEKTFVAQMTVFDKNRRLQLLDGEYEVAMQEMEECPISKKRATWETILDGKRLPPFETFSQGPTLQFTLRWTGETNDKSTAPIAKPLATRNSESLHQENKPGSVKPTQTIAVKESLTTDLQTRKEKDTPNENRQKLRIFYQFLYNNNTRQQTEARDDLHCPWCTLNCRKLYSLLKHLKLCHSRFIFNYVYHPKGARIDVSINECYDGSYAGNPQDIHRQPGFAFSRNGPVKRTPITHILVCRPKRTKASMSEFLEWSHPQFEK 478 T 0.17 zf_C2H2_6 unphh F Eukaryota T 5wai 3 C,G C,G AEBP2_HUMAN ADIPOCYTE ENHANCER-BINDING PROTEIN 2,AE-BINDING PROTEIN 2 SNARHRAICFNLSAHIESLGKGHSVVFHSTVIAKRKEDSGKIKLLLHWMPEDILPDVWVNESERHQLKTKVVHLSKLPKDTALLLDPNIYRTMPQKRLKR 100 T 0.011 Mtf2_C pdbhh F Eukaryota T 5wai 4 D,H D,H JARD2_HUMAN Jumonji, AT-rich interactive domain 2 LSKRKPKTEDFLTFLCLRG 19 T 0.86 GMAP pdbhh F Eukaryota T 5wak 2 B B SUZ12_HUMAN CHROMATIN PRECIPITATED E2F TARGET 9 PROTEIN,CHET 9 PROTEIN,JOINED TO JAZF1 PROTEIN,SUPPRESSOR OF ZESTE 12 PROTEIN HOMOLOG MEHVQADHELFLQAFEKPTQIYRFLRTRNLIAPIFLHRTLTYMSHRNSRTNIKRKTFKVDDMLSKVEKMKGEQESHSLSAHLQLTFTGFFHKNDKPSPNSENEQNSVTLEVLLVKVCHKKRKDVSCPIRQVPTGKKQVPLNPDLNQTKPGNFPSLAVSSNEFEPSNSHMVKSYSLLFRVTRPGRREFNGMINGETNENIDVNEELPARRKRNREDGEKTFVAQMTVFDKNRRLQLLDGEYEVAMQEMEECPISKKRATWETILDGKRLPPFETFSQGPTLQFTLRWTGETNDKSTAPIAKPLATRNSESLHQENKPGSVKPTQTIAVKESLTTDLQTRKEKDTPNENRQKLRIFYQFLYNNNTRQQTEARDDLHCPWCTLNCRKLYSLLKHLKLCHSRFIFNYVYHPKGARIDVSINECYDGSYAGNPQDIHRQPGFAFSRNGPVKRTPITHILVCRPKRTKASMSEFLEWSHPQFEK 478 T 0.17 zf_C2H2_6 unphh F Eukaryota T 5wb5 2 B B E9AFM3_LEIMA Uncharacterized protein GSPSVRTMYTREELLRIATLASAMDLGPEVLRKFDVIEVAEPVPTPKRRDAES 53 T 0.43 Spore_YtrH pdbhh F Eukaryota T 5wbh 2 F W KS6B1_HUMAN S6K1,70 KDA RIBOSOMAL PROTEIN S6 KINASE 1,P70-S6K 1,RIBOSOMAL PROTEIN S6 KINASE I,SERINE/THREONINE-PROTEIN KINASE 14A,P70 RIBOSOMAL S6 KINASE ALPHA,P70 S6KA TYVAPSVLESVKEKFSFEPKIRSPRR 26 T 19 PLN_propep pdbhh F Eukaryota T 5wbk 2 B T KS6B1_HUMAN S6K1,70 KDA RIBOSOMAL PROTEIN S6 KINASE 1,P70-S6K 1,RIBOSOMAL PROTEIN S6 KINASE I,SERINE/THREONINE-PROTEIN KINASE 14A,P70 RIBOSOMAL S6 KINASE ALPHA,P70 S6KA MAGVFDIDLDQPED 14 T 0.86 DUF1805 pdbhh F Eukaryota T 5wco 1 A,B,C A,B,C Q910W0_9ORTO NON-STRUCTURAL PROTEIN 2,ORF1 MGSSHHHHHHSSGLVPRGSHMNESQWIQKHLPCMREANPKPRELIRHALKKKKRPEVVYAMGVLLTLGGESGLTVEFPVPEGKTVKVKTLNQLVNGMISRATMTLYCVMKDPPSGSMATLMRDHIRNWLKEESGCQDADGGEEKWAMVYGMISPDMAEEKTMLKELKTMLHSRMQMYALGASSKALENLEKAIVAAVHRLPASCSTEKMVLLGYLK 216 T 0.081 CbiD unphh T Viruses T 5wcv 1 A A A0A2H4A2Y1_ANESU ShK homolog AsK132958 CENTISGCSRADCLLTHRKQGCQKTCGLC 29 T 0.0051 ShK pdb F Eukaryota T 5wd8 1 A,B A,B LPG2328 SNAPVTELTRLKEYMEDQIAKAKESSSLTAQLKFLENAHTEHFVKMGSLTTIYKGGSEVVDRLKIEIRSLYEEMLELKDKCRDQIQQYETS 91 T 0.21 Siah-Interact_N pdbpercent F T 5wd9 1 A A LPG2328 SNAPVTELTRLKEYMEDQIAKAKESSSLTAQLKFLENAHTEHFVKMGSLTTIYKGGSEVVDRLKIEIRSLYEEMLELKDKCRDQIQQYETS 91 T 0.21 Siah-Interact_N pdbpercent F T 5wdu 1 A,C,E G,F,Q Q2N0S6_9HIV1 ENV POLYPROTEIN ENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATCACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQAFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGANNTSTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRR 471 T 3.5E-54 GP120 pdbpercent T Viruses T 5we0 1 A,D,G,J A,D,G,J POZ1_SCHPO POT1-ASSOCIATED PROTEIN POZ1 SNEKIRSQSVLNTLETFFIKENHYDMQREESSIVNACLRYLGYSKSMCHEKMPIFMDIAFIEYCFNLSLDPSSFQNLPITQTQPDSQQILWEYSLISNALERLENIELERQNCMREDGLSKYTNSLLLNKETLNNEALKLYSCAKAGICRWMAFHFLEQEPIDHINFTKFLQDWGSHNEKEMEALQRLSKHKIRKRLIYVSQHKKKMPWSKFNSVLSRYIQCTKLQLEVFCDYDFKQREIVKMLTSNIN 249 T 0.0063 Dcc1 pdbpercent F Eukaryota T 5we0 2 B,E,H,K B,E,H,K TPZ1_SCHPO MEIOTICALLY UP-REGULATED GENE 169 PROTEIN SEACEMCRLGLPHGSFFELLRDWKKIEEFRNKS 33 T 0.75 RPAP3_C pdbhh F Eukaryota T 5we1 1 A,C A,C POZ1_SCHPO POT1-ASSOCIATED PROTEIN POZ1,POT1-ASSOCIATED PROTEIN POZ1 SESSIVNACLRYLGYSKSMCHEKMPIFMDIAFIEYCFNLSLDPSSFQGGASQQILWEYSLISNALERLENIELERQNCMREDGLSKYTNSLLLNKETLNNEALKLYSCAKAGICRWMAFHFLEQEPIDHINFTKFLQDWGSHNEKEMEALQRLSKHKIRKRLIYVSQHKKKMPWSKFNSVLSRYIQCTKLQLEVFCDYDFKQREIVKMLTSNIN 214 T 0.005 Dcc1 pdbpercent F Eukaryota T 5we1 2 B,D B,D TPZ1_SCHPO MEIOTICALLY UP-REGULATED GENE 169 PROTEIN SEACEMCRLGLPHGSFFELLRDWKKIEEFRNKS 33 T 0.75 RPAP3_C pdbhh F Eukaryota T 5we2 1 A,C A,C POZ1_SCHPO POT1-ASSOCIATED PROTEIN POZ1 SNEKIRSQSVLNTLETFFIKENHYDMQREESSIVNACLRYLGYSKSMCHEKMPIFMDIAFIEYCFNLSLDPSSFQNLPITQTQPDSQQILWEYSLISNALERLENIELERQNCMREDGLVKYTNELLLNKETLNNEALKLYSCAKAGICRWMAFHFLEQEPIDHINFTKFLQDWGSHNEKEMEALQRLSKHKIRKRLIYVSQHKKKMPWSKFNSVLSRYIQCTKLQLEVFCDYDFKQREIVKMLTSNIN 249 T 0.14 DUF5896 unppssm F Eukaryota T 5we2 2 B,D B,D TPZ1_SCHPO MEIOTICALLY UP-REGULATED GENE 169 PROTEIN SEACEMCRLGLPHGSFFELLRDWKKIEEFRNKS 33 T 0.75 RPAP3_C pdbhh F Eukaryota T 5we2 3 E F RAP1_SCHPO DNA-binding protein rap1 SDNIFVKPGEDLEIPLLSDYSDSENISEKS 30 T 0.89 DUF3983 pdbhh F Eukaryota T 5wes 3 C P ENV_HV1BR ENVELOPE GLYCOPROTEIN GP160, ENV POLYPROTEIN RGPGC 5 F T Viruses F 5wet 3 C P ENV_HV1BR ENVELOPE GLYCOPROTEIN GP160, ENV POLYPROTEIN RGPGCA 6 F T Viruses F 5wfv 2 C P NF2L2_HUMAN Nrf2 ETGE peptide LDEETGEFL 9 T 0.055 Radial_spoke unppercent F Eukaryota T 5wg1 2 C P Nrf2 EAGE mutant peptide LDEEAGEFL 9 T 1.3 Herpes_US9 pdbhh F T 5wgd 3 D F (ACE)AILHKLLQDS(NH2) XAILHKLLQDSX 12 T 0.0022 SRC-1 pdbhh F T 5whn 1 A A TADBP_HUMAN TDP-43 NFGAFS 6 T 1.5 HU-CCDC81_bac_1 pdbhh F Eukaryota F 5whp 1 A A TADBP_HUMAN TDP-43 NFGTFS 6 T 0.19 HU-CCDC81_bac_1 pdbhh F Eukaryota F 5wia 1 A A TADBP_HUMAN TDP-43 GNNSYS 6 T 1.1 DUF2477 unppercent F Eukaryota F 5wiq 1 A,B A,B TADBP_HUMAN TDP-43 GFNGGFG 7 T 1.9 DUF4542 pdbhh F Eukaryota F 5wir 1 A,C D,C TERB1_HUMAN TERB1-TBM SKKILLTPRRRQRLS 15 T 0.73 WSK pdbhh F Eukaryota T 5wjc 2 B B MIS19_SCHPO Eic1 protein MDLMPLEKARAIEIAFDNVFHNTKIPDNLQQFDAILKRLERRRFIPTENQKPRVYETELLVLRFREFGVKDNHNHPINLHSLRSKSLIRAQGKKLDLHNRVFLRRNVRAVKM 112 T 6.6 Ins_allergen_rp pdbhh F Eukaryota T 5wje 2 B B Actin N-terminus peptide DDDIX 5 T 210 Ada3 pdbhh F F 5wk1 1 A,B,C,D,E,F,G X,L,Y,M,S,Z,K S5MS27_9CAUD Capsid Stabilizing Protein MANSKNSIFVGGAGRVKQTIEGLAQSAFKPGQLLARAAGDAIDVTAKASTTYGNEFLICDDQPQTLGGGTDVAVTAGDTVQAISVLPGQYVLLSFAATQNVTTKGAAVASNGDGNFKLGNPATEQTFAVTEEIINVTTAGTLVLCRAI 148 T 13 PP_kinase_N pdbhh T Viruses T 5wkb 1 A A TADBP_HUMAN TDP-43 NFGEFS 6 T 0.36 Peptidase_C58 pdbhh F Eukaryota F 5wkd 1 A A TADBP_HUMAN TDP-43 GNNQGSN 7 T 14 CBM_21 pdbhh F Eukaryota F 5wkt 2 B B Transducin Galpha peptide ILENLKDVGLF 11 T 2.1 Phage_holin_4_1 pdbhh F T 5wlb 2 B,E B,E 225-15 a SGPRRPRXPGDQASLEELHEYWARLWNYLYRVAH 34 T 0.00013 Hormone_3 pdbpssm F T 5wlc 17 Q LI UTP8_YEAST Utp8 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRLFKQAIVTCPNLPLNELLEELFSIRNRELLLDISFRILQDFTRDSIKQEMKKLSKLDVQNFIEFITSGGEDSSPECFNPSQSTQLFQLLSLVLDSIGLFSLEGALLENLTLYIDKQVEIAERNTELWNLIDTKGFQHGFASSTFDNGTSQKRALPTYTMEYLDI 713 T 8.500000000000002E-245 Utp8 unppssm F Eukaryota T 5wlc 38 MA NE FAF1_YEAST Faf1 MTLDDDDYIKQMELQRKAFESQFGSLESMGFEDKTKNIRTEVDTRDSSGDEIDNSDHGSDFKDGTIESSNSSDEDSGNETAEENNQDSKPKXXXXXXXXXXXXXXXXXXXXXXXXXXXSGKTLTQINKKLESTEAKEEKEDETLEAENLQNDLELQQFLRESHLLSAFNNGGSGSTNSGVSLTLQSMGGGNDDGIVYQDDQVIGKARSRTLEMRLNRLSRVNGHQDKINKLEKVPMHIRRGMIDKHVKRIKKYEQEAAEGGIVLSKVKKGQFRKIESTYKKDIERRIGGSIKARDKEKATKRERGLKISSVGRSTRNGLIVSKRDIARISGGERSGKFNGKKKSRR 346 T 1.9E-09 DUF4602 pdbpssm F Eukaryota T 5wlc 57 IB SP Utp20 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 982 F F F 5wlc 64 PB SX Unassigned peptides XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 103 F F F 5wlh 1 A A A0A2D0TCH1_LACNK LbaCas13a H328A (C2c2) SNAMKISKVREENRGAKLTVNAKTAVVSENRSQEGILYNDPSRYGKSRKNDEDRDRYIESRLKSSGKLYRIFNEDKNKRETDELQWFLSEIVKKINRRNGLVLSDMLSVDDRAFEKAFEKYAELSYTNRRNKVSGSPAFETCGVDAATAERLKGIISETNFINRIKNNIDNKVSEDIIDRIIAKYLKKSLCRERVKRGLKKLLMNAFDLPYSDPDIDVQRDFIDYVLEDFYHVRAKSQVSRSIKNMNMPVQPEGDGKFAITVSKGGTESGNKRSAEKEAFKKFLSDYASLDERVRDDMLRRMRRLVVLYFYGSDDSKLSDVNEKFDVWEDAAARRVDNREFIKLPLENKLANGKTDKDAERIRKNTVKELYRNQNIGCYRQAVKAVEEDNNGRYFDDKMLNMFFIHRIEYGVEKIYANLKQVTEFKARTGYLSEKIWKDLINYISIKYIAMGKAVYNYAMDELNASDKKEIELGKISEEYLSGISSFDYELIKAEEMLQRETAVYVAFAARHLSSQTVELDSENSDFLLLKPKGTMDKNDKNKLASNNILNFLKDKETLRDTILQYFGGHSLWTDFPFDKYLAGGKDDVDFLTDLKDVIYSMRNDSFHYATENHNNGKWNKELISAMFEHETERMTVVMKDKFYSNNLPMFYKNDDLKKLLIDLYKDNVERASQVPSFNKVFVRKNFPALVRDKDNLGIELDLKADADKGENELKFYNALYYMFKEIYYNAFLNDKNVRERFITKATKVADNYDRNKERNLKDRIKSAGSDEKKKLREQLQNYIAENDFGQRIKNIVQVNPDYTLAQICQLIMTEYNQQNNGCMQKKSAARKDINKDSYQHYKMLLLVNLRKAFLEFIKENYAFVLKPYKHDLCDKADFVPDFAKYVKPYAGLISRVAGSSELQKWYIVSRFLSPAQANHMLGFLHSYKQYVWDIYRRASETGTEINHSIAEDKIAGVDITDVDAVIDLSVKLCGTISSEISDYFKDDEVYAEYISSYLDFEYDGGNYKDSLNRFCNSDAVNDQKVALYYDGEHPKLNRNIILSKLYGERRFLEKITDRVSRSDIVEYYKLKKETSQYQTKGIFDSEDEQKNIKKFQEMKNIVEFRDLMDYSEIADELQGQLINWIYLRERDLMNFQLGYHYACLNNDSNKQATYVTLDYQGKKNRKINGAILYQICAMYINGLPLYYVDKDSSEWTVSDGKESTGAKIGEFYRYAKSFENTSDCYASGLEIFENISEHDNITELRNYIEHFRYYSSFDRSFLGIYSEVFDRFFTYDLKYRKNVPTILYNILLQHFVNVRFEFVSGKKMIGIDKKDRKIAKEKECARITIREKNGVYSEQFTYKLKNGTVYVDARDKRYLQSIIRLLFYPEKVNMDEMIEVKEKKKPSDNNTGKGYSKRDRQQDRKEYDKYKEKKKKEGNFLSGMGGNINWDEINAQLKN 1440 T 0.19 APEH_N unppercent F Bacteria T 5wlj 1 A,B,C,D A,B,C,D De Novo Metal Binding Helical Bundle XIEELLRKILEDEARHVAELEDIEKWLX 28 T 0.027 Ribonuc_red_sm pdbhh F T 5wlk 1 A,B,C,D A,B,C,D Helical Bundle 4EH2 XIEELLRKIIEDEVRHIAELEDIEKWLX 28 T 0.09 Ribonuc_red_sm pdbhh F T 5wll 1 A,B,C,D A,B,C,D Helical Bundle 4DH1 XIEELLRKILEDDARHVAELEDIEKWLX 28 T 0.66 Ald_deCOase pdbhh F T 5wlm 1 A,B,C,D A,B,C,D Helical Bundle 4DH2 XIEELLRKIIEDDVRHIAELEDIEKWLX 28 T 1.5 Rubrerythrin pdbhh F T 5wlp 1 A A ATG32_YEAST EXTRACELLULAR MUTANT PROTEIN 37 SNATNSFVMPKLSLTQKNPVFRLLILGRTGSSFYQSIPKEYQSLFELPKYHDSATFPQYTGIVIIFQELREMVSLLNRIVQYSQGKPVIPICQPGQVIQVKNVLKSFLRNKLVKLLFPPVVVTNKRDLKKMFQRLQDLSLEYGED 145 T 0.0048 FleQ pdbpercent F Eukaryota T 5wmn 3 E,F E,F SPI peptide from Influenza A virus SPIVPSFDM 9 T 1.4 Bul1_N pdbhh F T 5wmp 3 C C TPR peptide from CMV TPRVTGGGAM 10 T 7.7 PGK pdbhh F T 5wmr 3 C C QIK peptide from CMV QIKVRVDMV 9 T 1.5 Herpes_IE1 pdbhh F T 5woc 1 A,B A,B SER-PRO-GLU-GLU-ARG-ALA-GLN-LEU-CYS-THR-ALA-ALA-GLU-LYS-ALA-ASP-GLU-LEU-GLY SPEERAQLCTAAEKADELG 19 T 1.7 DNA_pol_P_Exo pdbhh F T 5wod 1 A A 38-mer peptide SPEERAQLLTAAEKADELGCPEERAQLLTAAEKADELG 38 T 1.7 DUF3721 pdb F T 5wou 2 B V Q9VE13_DROME GUK-holder, isoform A LPSFETAL 8 T 5.5 Comm pdbhh F Eukaryota T 5wpp 1 A,B A,B A0A076NBW8_9CYAN HPIU5 MGSSHHHHHHSSGLVPRGSHMASTSVVSIPINNAGFEDPFIEVVDDYTVDTPPGWTTYNPNNLVPEKRTTMTSNNGVGYVGPGTQFYNQLAPEGRNIGYIYLAQKPGSGVAGFEQILDATLEPDTKYTLMVDVGNFGGEFQKISLAGFPGYRVELLAGDTVLAADHNNLYIKDGEFKTSTVTFTATPDNPYLDQKLGIRLINLLQGTFSGLDFDNVRLTVEPAQT 225 T 0.27 CBM_4_9 pdbpercent F Bacteria T 5wps 1 A A A0A076NBW8_9CYAN HPIU5 MGSSHHHHHHSSGLVPRGSHMASTSVVSIPINNAGFEDPFIEVVDDYTVDTPPGWTTYNPNNLVPEKRTTWTSNNGVGYVGPGTQFYNQLAPEGRNIGFIYLAQKPGSGVAGFEQILDATLEPDTKYTLKVDVGNFGGEFQKISLAGFPGYRVELLAGDTVLAADHNNLYIKDGEFKTSTVTFTATPDNPYLDQKLGIRLINLLQGTFSGLDFDNVRLTVEPAQT 225 T 3.4 DUF642 pdbhh F Bacteria T 5wpu 1 A A A0A076NBW8_9CYAN HPIU5 MGSSHHHHHHSSGLVPRGSHMASTSVVSIPINNAGFEDPFIEVVDDYTVDTPPGWTTYNPNNLVPEKRTTWTSNNGVGYVGPGTQFYNQLAPEGRNIGSIYLAQKPGSGVAGFEQILDATLEPDTKYTLKVDVGNFGGEFQKISLAGFPGYRVELLAGDTVLAADHNNLYIKDGEFKTSTVTFTATPDNPYLDQKLGIRLINLLQGTFSGLDFDNVRLTVEPAQT 225 T 2.5 DUF642 pdbhh F Bacteria T 5wqd 2 H,I,J,K,L,M,N H,I,J,K,L,M,N NBN_HUMAN NBS1 KMRIPNYQLSPTKLPS 16 T 1.1 SpoIISB_antitox pdbhh F Eukaryota T 5wqe 1 A A C2C1_ALIAG AACC2C1 MAVKSIKVKLRLDDMPEIRAGLWKLHKEVNAGVRYYTEWLSLLRQENLYRRSPNGDGEQECDKTAEECKAELLERLRARQVENGHRGPAGSDDELLQLARQLYELLVPQAIGAKGDAQQIARKFLSPLADKDAVGGLGIAKAGNKPRWVRMREAGEPGWEEEKEKAETRKSADRTADVLRALADFGLKPLMRVYTDSEMSSVEWKPLRKGQAVRTWDRDMFQQAIERMMSWESWNQRVGQEYAKLVEQKNRFEQKNFVGQEHLVHLVNQLQQDMKEASPGLESKEQTAHYVTGRALRGSDKVFEKWGKLAPDAPFDLYDAEIKNVQRRNTRRFGSHDLFAKLAEPEYQALWREDASFLTRYAVYNSILRKLNHAKMFATFTLPDATAHPIWTRFDKLGGNLHQYTFLFNEFGERRHAIRFHKLLKVENGVAREVDDVTVPISMSEQLDNLLPRDPNEPIALYFRDYGAEQHFTGEFGGAKIQCRRDQLAHMHRRRGARDVYLNVSVRVQSQSEARGERRPPYAAVFRLVGDNHRAFVHFDKLSDYLAEHPDDGKLGSEGLLSGLRVMSVDLGLRTSASISVFRVARKDELKPNSKGRVPFFFPIKGNDNLVAVHERSQLLKLPGETESKDLRAIREERQRTLRQLRTQLAYLRLLVRCGSEDVGRRERSWAKLIEQPVDAANHMTPDWREAFENELQKLKSLHGICSDKEWMDAVYESVRRVWRHMGKQVRDWRKDVRSGERPKIRGYAKDVVGGNSIEQIEYLERQYKFLKSWSFFGKVSGQVIRAEKGSRFAITLREHIDHAKEDRLKKLADRIIMEALGYVYALDERGKGKWVAKYPPCQLILLEELSEYQFNNDRPPSENNQLMQWSHRGVFQELINQAQVHDLLVGTMYAAFSSRFDARTGAPGIRCRRVPARCTQEHNPEPFPWWLNKFVVEHTLDACPLRADDLIPTGEGEIFVSPFSAEEGDFHQIHADLNAAQNLQQRLWSDFDISQIRLRCDWGEVDGELVLIPRLTGKRTADSYSNKVFYTNTGVTYYERERGKKRRKVFAQEKLSEEEAELLVEADEAREKSVVLMRDPSGIINRGNWTRQKEFWSMVNQRIEGYLVKQIRSRVPLQDSACENTGDILEHHHHHH 1137 T 0.0038 RuvC_1 pdbhh F Bacteria T 5wql 3 E,F F,H ALA-ALA-ALA-ALA-ALA-ALA AAAAAA 6 T 340 UPF0253 pdbhh F F 5wql 4 G E ALA-ALA-ALA-ALA AAAA 4 T 900 Cyclin_C pdbhh F F 5wql 5 H G LEU-SER-ARG-SER LSRS 4 T 170 bpX2 pdbhh F F 5wrd 2 C,D C,D FYCO1_MOUSE Peptide from FYVE and coiled-coil domain-containing protein 1 DDAVFDIITDEELCQIQES 19 T 1.8 ComC pdbhh F Eukaryota T 5wri 2 C,D D,E ASP-PHE-GLU-ASP-TYR-GLU-PHE-ASP DFEDYEFD 8 T 6.1 UL11 pdbhh F F 5wrk 2 B P IRS1_RAT IRS-1,PP185 GYMPMSPG 8 T 0.1 STAT1_TAZ2bind pdbhh F Eukaryota F 5wrl 2 B P IRS1_RAT IRS-1,PP185 DYMPMSPK 8 T 0.082 STAT1_TAZ2bind pdbhh F Eukaryota T 5wrm 2 B P IRS1_RAT IRS-1,PP185 GYMMMSPS 8 T 0.5 FliP pdbhh F Eukaryota F 5wrx 1 A A analogue peptide VG13P VARGWGRKCPLFG 13 T 0.0016 Flavi_glycoprot pdbhh F T 5wsh 3 C C GLY-VAL-TRP-ILE-ARG-THR-PRO-THR-ALA GVWIRTPTA 9 T 0.13 Hepatitis_core pdbhh F T 5wti 3 C Z A0A0D0F5I0_9BACI UNCHARACTERIZED PROTEIN MATRSFILKIEPNEEVKKGLWKTHEVLNHGIAYYMNILKLIRQEAIYEHHEQDPKNPKKVSKAEIQAELWDFVLKMQKCNSFTHEVDKDVVFNILRELYEELVPSSVEKKGEANQLSNKFLYPLVDPNSQSGKGTASSGRKPRWYNLKIAGDPSWEEEKKKWEEDKKKDPLAKILGKLAEYGLIPLFIPFTDSNEPIVKEIKWMEKSRNQSVRRLDKDMFIQALERFLSWESWNLKVKEEYEKVEKEHKTLEERIKEDIQAFKSLEQYEKERQEQLLRDTLNTNEYRLSKRGLRGWREIIQKWLKMDENEPSEKYLEVFKDYQRKHPREAGDYSVYEFLSKKENHFIWRNHPEYPYLYATFCEIDKKKKDAKQQATFTLADPINHPLWVRFEERSGSNLNKYRILTEQLHTEKLKKKLTVQLDRLIYPTESGGWEEKGKVDIVLLPSRQFYNQIFLDIEEKGKHAFTYKDESIKFPLKGTLGGARVQFDRDHLRRYPHKVESGNVGRIYFNMTVNIEPTESPVSKSLKIHRDDFPKFVNFKPKELTEWIKDSKGKKLKSGIESLEIGLRVMSIDLGQRQAAAASIFEVVDQKPDIEGKLFFPIKGTELYAVHRASFNIKLPGETLVKSREVLRKAREDNLKLMNQKLNFLRNVLHFQQFEDITEREKRVTKWISRQENSDVPLVYQDELIQIRELMYKPYKDWVAFLKQLHKRLEVEIGKEVKHWRKSLSDGRKGLYGISLKNIDEIDRTRKFLLRWSLRPTEPGEVRRLEPGQRFAIDQLNHLNALKEDRLKKMANTIIMHALGYCYDVRKKKWQAKNPACQIILFEDLSNYNPYEERSRFENSKLMKWSRREIPRQVALQGEIYGLQVGEVGAQFSSRFHAKTGSPGIRCSVVTKEKLQDNRFFKNLQREGRLTLDKIAVLKEGDLYPDKGGEKFISLSKDRKLVTTHADINAAQNLQKRFWTRTHGFYKVYCKAYQVDGQTVYIPESKDQKQKIIEEFGEGYFILKDGVYEWGNAGKLKIKKGSSKQSSSELVDSDILKDSFDLASELKGEKLMLYRDPSGNVFPSDKWMAAGVFFGKLERILISKLTNQYSISTIEDDSSKQSM 1108 T 0.0023 RuvC_1 pdbhh F Bacteria T 5wtj 1 A,B A,B C2C2_LEPSD ENDORNASE,LSHC2C2 MGNLFGHKRWYEVRDKKDFKIKRKVKVKRNYDGNKYILNINENNNKEKIDNNKFIRKYINYKKNDNILKEFTRKFHAGNILFKLKGKEGIIRIENNDDFLETEEVVLYIEAYGKSEKLKALGITKKKIIDEAIRQGITKDDKKIEIKRQENEEEIEIDIRDEYTNKTLNDCSIILRIIENDELETKKSIYEIFKNINMSLYKIIEKIIENETEKVFENRYYEEHLREKLLKDDKIDVILTNFMEIREKIKSNLEILGFVKFYLNVGGDKKKSKNKKMLVEKILNINVDLTVEDIADFVIKELEFWNITKRIEKVKKVNNEFLEKRRNRTYIKSYVLLDKHEKFKIERENKKDKIVKFFVENIKNNSIKEKIEKILAEFKIDELIKKLEKELKKGNCDTEIFGIFKKHYKVNFDSKKFSKKSDEEKELYKIIYRYLKGRIEKILVNEQKVRLKKMEKIEIEKILNESILSEKILKRVKQYTLEHIMYLGKLRHNDIDMTTVNTDDFSRLHAKEELDLELITFFASTNMELNKIFSRENINNDENIDFFGGDREKNYVLDKKILNSKIKIIRDLDFIDNKNNITNNFIRKFTKIGTNERNRILHAISKERDLQGTQDDYNKVINIIQNLKISDEEVSKALNLDVVFKDKKNIITKINDIKISEENNNDIKYLPSFSKVLPEILNLYRNNPKNEPFDTIETEKIVLNALIYVNKELYKKLILEDDLEENESKNIFLQELKKTLGNIDEIDENIIENYYKNAQISASKGNNKAIKKYQKKVIECYIGYLRKNYEELFDFSDFKMNIQEIKKQIKDINDNKTYERITVKTSDKTIVINDDFEYIISIFALLNSNAVINKIRNRFFATSVWLNTSEYQNIIDILDEIMQLNTLRNECITENWNLNLEEFIQKMKEIEKDFDDFKIQTKKEIFNNYYEDIKNNILTEFKDDINGCDVLEKKLEKIVIFDDETKFEIDKKSNILQDEQRKLSNINKKDLKKKVDQYIKDKDQEIKSKILCRIIFNSDFLKKYKKEIDNLIEDMESENENKFQEIYYPKERKNELYIYKKNLFLNIGNPNFDKIYGLISNDIKMADAKFLFNIDGKNIRKNKISEIDAILKNLNDKLNGYSKEYKEKYIKKLKENDDFFAKNIQNKNYKSFEKDYNRVSEYKKIRDLVEFNYLNKIESYLIDINWKLAIQMARFERDMHYIVNGLRELGIIKLSGYNTGISRAYPKRNGSDGFYTTTAYYKFFDEESYKKFEKICYGFGIDLSENSEINKPENESIRNYISHFYIVRNPFADYSIAEQIDRVSNLLSYSTRYNNSTYASVFEVFKKDVNLDYDELKKKFKLIGNNDILERLMKPKKVSVLELESYNSDYIKNLIIELLTKIENTNDTLLEHHHHHH 1397 T 0.067 PET117 pdbpercent F Bacteria T 5wtk 1 A A C2C2_LEPSD ENDORNASE,LSHC2C2 MGNLFGHKRWYEVRDKKDFKIKRKVKVKRNYDGNKYILNINENNNKEKIDNNKFIRKYINYKKNDNILKEFTRKFHAGNILFKLKGKEGIIRIENNDDFLETEEVVLYIEAYGKSEKLKALGITKKKIIDEAIRQGITKDDKKIEIKRQENEEEIEIDIRDEYTNKTLNDCSIILRIIENDELETKKSIYEIFKNINMSLYKIIEKIIENETEKVFENRYYEEHLREKLLKDDKIDVILTNFMEIREKIKSNLEILGFVKFYLNVGGDKKKSKNKKMLVEKILNINVDLTVEDIADFVIKELEFWNITKRIEKVKKVNNEFLEKRRNRTYIKSYVLLDKHEKFKIERENKKDKIVKFFVENIKNNSIKEKIEKILAEFKIDELIKKLEKELKKGNCDTEIFGIFKKHYKVNFDSKKFSKKSDEEKELYKIIYRYLKGRIEKILVNEQKVRLKKMEKIEIEKILNESILSEKILKRVKQYTLEHIMYLGKLRHNDIDMTTVNTDDFSRLHAKEELDLELITFFASTNMELNKIFSRENINNDENIDFFGGDREKNYVLDKKILNSKIKIIRDLDFIDNKNNITNNFIRKFTKIGTNERNRILHAISKERDLQGTQDDYNKVINIIQNLKISDEEVSKALNLDVVFKDKKNIITKINDIKISEENNNDIKYLPSFSKVLPEILNLYRNNPKNEPFDTIETEKIVLNALIYVNKELYKKLILEDDLEENESKNIFLQELKKTLGNIDEIDENIIENYYKNAQISASKGNNKAIKKYQKKVIECYIGYLRKNYEELFDFSDFKMNIQEIKKQIKDINDNKTYERITVKTSDKTIVINDDFEYIISIFALLNSNAVINKIRNRFFATSVWLNTSEYQNIIDILDEIMQLNTLRNECITENWNLNLEEFIQKMKEIEKDFDDFKIQTKKEIFNNYYEDIKNNILTEFKDDINGCDVLEKKLEKIVIFDDETKFEIDKKSNILQDEQRKLSNINKKDLKKKVDQYIKDKDQEIKSKILCRIIFNSDFLKKYKKEIDNLIEDMESENENKFQEIYYPKERKNELYIYKKNLFLNIGNPNFDKIYGLISNDIKMADAKFLFNIDGKNIRKNKISEIDAILKNLNDKLNGYSKEYKEKYIKKLKENDDFFAKNIQNKNYKSFEKDYNRVSEYKKIRDLVEFNYLNKIESYLIDINWKLAIQMARFERDMHYIVNGLRELGIIKLSGYNTGISRAYPKRNGSDGFYTTTAYYKFFDEESYKKFEKICYGFGIDLSENSEINKPENESIRNYISHFYIVRNPFADYSIAEQIDRVSNLLSYSTRYNNSTYASVFEVFKKDVNLDYDELKKKFKLIGNNDILERLMKPKKVSVLELESYNSDYIKNLIIELLTKIENTNDTLLEHHHHHH 1397 T 0.067 PET117 pdbpercent F Bacteria T 5wtt 3 C,F P,C Epitope peptide of Cyr61 CGLECNFG 8 T 0.17 DUF3330 pdbhh F T 5wuj 1 A A O25118_HELPY Flagellar M-ring protein FSEEEVRYEIILEKIRGTLKERPDEIAMLFKLLIKDE 37 T 0.061 Peptidase_C34 pdbpssm F Bacteria T 5wum 2 B,C B,C EBNA1_EBVB9 EBV NUCLEAR ANTIGEN 1 EKRPRSPSS 9 T 7.4 Ribosomal_S12 pdbhh T Viruses F 5wun 2 B,C B,C EBNA1_EBVB9 EBV NUCLEAR ANTIGEN 1 EKRPRSPSS 9 T 7.4 Ribosomal_S12 pdbhh T Viruses F 5wxe 1 A A A0A2R2JFU3_9LAMI jasmintide js3 QLCLLCQTSRDCNYIIWTVCRDGCCNIS 28 T 0.021 PAN_4 pdbpercent F Eukaryota T 5wxf 2 B P upain-2-2 peptide CSWXGLENHAAC 12 T 0.53 DUF2632 pdbhh F T 5wxn 2 C,D C,D STK11_HUMAN Serine/threonine-protein kinase STK11 RWRSMTVVPYLED 13 T 0.034 WWamide pdbhh F Eukaryota T 5wxo 2 B P upain-2-2-W3A peptide CSAXGLENHAAC 12 T 6.4 LRRNT pdbhh F T 5wxp 2 B P upain-2-3-W3A peptide CSAX 4 T 120 zf-H2C2_2 pdbhh F F 5wxq 2 B P upain-2-4 peptide GACSWRGLENHAAC 14 T 1 DUF2632 pdbhh F T 5wxr 2 B P upain-2-4-W3A peptide GACSARGLENHAAC 14 T 5.5 RE_SacI pdbhh F T 5wye 1 A A Au-VG16KRKP VARGWKRKCPLFGKGG 16 T 0.0078 Flavi_glycoprot pdbhh F T 5wyh 2 B,D B,D Interaptin LEEYIRMAKNKEFFDALEEIAESAKNDETLRNELAKVLDDILKTDPSDPEAFRKIVAEHQEFWDEHDPSLMEFNEGRFFGKSRKQYLKSDDFLNSTDPTYNFQKLHQFAAEQRVKLGLEKSDTDTLVAILKNNPEECRAYIESKKPGLGNFSEGNVHGWLKEEYTPTIPPKAINKSTGVLSDEAIKRIKEQARDLLLL 198 T 0.013 SE pdb F T 5wyj 8 J AA Utp4 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 776 F F F 5wyj 9 K AB Utp5 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 643 F F F 5wyj 10 L AC Utp8 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 713 F F F 5wyj 11 M AD Utp9 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 575 F F F 5wyj 13 O AF Utp15 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 513 F F F 5wyj 14 P AG Utp17 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 896 F F F 5wyj 25 BA E4 Enp2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 707 F F F 5wyj 33 JA S1 Sof1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 489 F F F 5wyj 54 EB U1 Utp7 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 554 F F F 5wyj 55 FB U2 Utp11 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 250 F F F 5wyj 56 GB U3 Utp20 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 2493 F F F 5wyj 59 JB UA Helical domain protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1615 F F F 5wyj 60 KB UB Helical domain protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 987 F F F 5wyj 61 LB UC Unassigned helices XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1033 F F F 5wyk 8 J AA Utp4 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 776 F F F 5wyk 9 K AB Utp5 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 643 F F F 5wyk 10 L AC Utp8 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 713 F F F 5wyk 11 M AD Utp9 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 575 F F F 5wyk 13 O AF Utp15 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 513 F F F 5wyk 14 P AG Utp17 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 896 F F F 5wyk 24 AA E4 Enp2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 707 F F F 5wyk 31 HA S1 Sof1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 489 F F F 5wyk 49 ZA U1 Utp7 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 554 F F F 5wyk 50 AB U2 Utp11 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 250 F F F 5wyk 53 DB UA Helical domain protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1615 F F F 5wyk 54 EB UB Helical domain protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 987 F F F 5wyk 55 FB UC Unassigned helices XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1033 F F F 5wyl 2 B,D B,D G0SCS8_CHATD UTP17 DLDMEDNEDTHAVVVAPQRLAEIFNAAPAFAMPPIEDVFYQVASLFSTKPVINA 54 T 2.7 VQ pdbhh F Eukaryota T 5wzz 2 E,F,G,H E,F,G,H AXIN1_HUMAN AXIS INHIBITION PROTEIN 1,HAXIN YRVPKEVRVEPQKFAEELIH 20 T 0.19 Cwf_Cwc_15 unppssm F Eukaryota T 5x0s 1 A A TXF1A_SCOSU SsTx EVIKKDTPYKKRKFPYKSECLKACATSFTGGDESRIQEGKPGFFKCTCYFTTG 53 T 0.08 DUF5760 pdbpercent F Eukaryota T 5x1e 2 B,E B,E Q5ZS31_LEGPH IcmW PDLSHEASAKYWFEYLDPMIYRVITFMESVENWTLDGNPELEEAMKQLGQELDDIEKIDLGLLAEEDKFIRIVGNIKSGRGLRLLQAIDTVHPGSASRVLIHAEETSLSSSDPAGFFLKRNIVFERLRLLSRVFCQYRLKLVLRALEG 148 T 0.11 DUF2335 pdbpercent F Bacteria T 5x1e 3 C C Q5ZYC6_LEGPH IcmO (DotL) EGALTIFSKLRIDPNAPPILVADKEVFSEPLLPINETRNQMITIERLAGAKDKYAGTVANELIKDFQIATSYPPEERDVIDVQELTGIIRDLSAKISAEREK 102 T 0.74 RecC_C unppssm F Bacteria T 5x1e 5 F F Q5ZYC6_LEGPH IcmO (DotL) ALTIFSKLRIDPNAPPILVADKEVFSEPLLPINETRNQMITIERLAGAKDKYAGTVANELIKDFQIATSYPPEERDVIDVQELTGIIRDLSAKISAEREK 100 T 0.1 Glypican pdbpssm F Bacteria T 5x1g 1 A C WHAMM_HUMAN WAS PROTEIN HOMOLOGY REGION 2 DOMAIN-CONTAINING PROTEIN 1,WH2 DOMAIN-CONTAINING PROTEIN 1 IQMKRDKIKEEEQKKKEWINQERQKTLQRLRSFK 34 T 0.042 Trimer_CC pdbpssm F Eukaryota T 5x1u 1 A,B A,B Q5WZ95_LEGPL Uncharacterized protein ALTPMEFARKYNLLRKDDALLDNPVPGEEMTAGIRRGDAKRVFTMQLGPYWDGFERCSPQAYALSAVFMARMNRDRDAANNILKVLDKTFVDGKPDFSVARPVMKKYQNSELVQEVVAKHAYVLTVIASLLEAAREDGVVPSSEFLWLKPVDRRLWYMLNCVGRQTPYSEVAGPFAHWKAEKEMGRRSLVPMIDEAIRALEIAVKEVR 208 T 0.31 Pkip-1 pdb F Bacteria T 5x3m 2 B D D-ubiquitin XXXXXXXXXGXXXXXXXXXXXXXXXXXXXXXXXXGXXXXXXXXXXXGXXXXXGXXXXXXXXXXXXXXXXXXXXXGG 76 F F F 5x3o 2 B D D-ubiquitin XXXXXXXXXGXXXXXXXXXXXXXXXXXXXXXXXXGXXXXXXXXXXXGXXXXXGXXXXXXXXXXXXXXXXXXXXXGG 76 F F F 5x42 2 B,D B,D Q5ZYC6_LEGPH IcmO (DotL) VEPPPDDYLMKLQKQLASFQSILESGDLSINKAVENEEITLISKALKESTIVEPIERGVAALIAFHGQNE 70 T 0.12 DUF2433 pdb F Bacteria T 5x54 2 C,D C,D ACE-GLU-TRP-TRP-TRP XEWWW 5 T 27 Svf1_C pdbhh F F 5x6x 1 A,B,C,D C,A,B,D MCE_RDVA mRNA capping enzyme P5 GGSMSNPDYCIPNFSQTVNERTIIDIFTICRYRSPLVVFCLSHNELAKKYAQDVSMSSGTHVHIIDGSVEITVSLYRTFRTIATQLLGRMQIVVFVTVDKSVVSTQVMKSIAWAFRGSFVELRNQSVDSSTLVSKLENLVSFAPLYNVPKCGPDYYGPTVYSELLSLATNARTHWYATIDYSMFTRSVLTGFVAKYFNEEAVPIDKRIVSIVGYNPPYVWTCLRHGIRPTYIEKSLPNPGGKGPFGLILPVINELVLKSKVKYVMHNPQIKLLCLDTFMLSTSMNILYIGAYPATHLLSLQLNGWTILAFDPKITSDWTDAMAKATGAKVIGVSKEFDFKSFSVQANQLNMFQNSKLSVIDDTWVETDYEKFQSEKQAYFEWLIDRTSIDVRLISMKWNRSKDTSVSHLLALLPQPYGASIREMRAFFHKKGASDIKILAAETEKYMDDFTAMSVSDQINTQKFMHCMITTVGDALKMDLDGGRAVIASYSLSNSSNSKERVLKFLSDANKAKAMVVFGAPNTHRLAYAKKVGLVLDSAIKMSKDLITFSNPTGRRWRDYGYSQSELYDAGYVEITIDQMVAYSSDVYNGVGYFANSTYNDLFSWYIPKWYVHKRMLMQDIRLSPAALVKCFTTLIRNICYVPHETYYRFRGILVDKYLRSKNVDPSQYSIVGSGSKTFTVLSHFEVPHECGPLVFEASTDVNISGHLLSLAIAAHFVASPMILWAEQMKYMAVDRMLPPNLDKSLFFDNKVTPSGALQRWHSREEVLLAAEICESYAAMMLNNKHSPDIIGTLKSAINLVFKI 804 T 5.8E-05 PARP_regulatory unphh T Viruses T 5x6y 1 A,B,C,D A,B,C,D MCE_RDVA mRNA capping enzyme P5 GGSMSNPDYCIPNFSQTVNERTIIDIFTICRYRSPLVVFCLSHNELAKKYAQDVSMSSGTHVHIIDGSVEITVSLYRTFRTIATQLLGRMQIVVFVTVDKSVVSTQVMKSIAWAFRGSFVELRNQSVDSSTLVSKLENLVSFAPLYNVPKCGPDYYGPTVYSELLSLATNARTHWYATIDYSMFTRSVLTGFVAKYFNEEAVPIDKRIVSIVGYNPPYVWTCLRHGIRPTYIEKSLPNPGGKGPFGLILPVINELVLKSKVKYVMHNPQIKLLCLDTFMLSTSMNILYIGAYPATHLLSLQLNGWTILAFDPKITSDWTDAMAKATGAKVIGVSKEFDFKSFSVQANQLNMFQNSKLSVIDDTWVETDYEKFQSEKQAYFEWLIDRTSIDVRLISMKWNRSKDTSVSHLLALLPQPYGASIREMRAFFHKKGASDIKILAAETEKYMDDFTAMSVSDQINTQKFMHCMITTVGDALKMDLDGGRAVIASYSLSNSSNSKERVLKFLSDANKAKAMVVFGAPNTHRLAYAKKVGLVLDSAIKMSKDLITFSNPTGRRWRDYGYSQSELYDAGYVEITIDQMVAYSSDVYNGVGYFANSTYNDLFSWYIPKWYVHKRMLMQDIRLSPAALVKCFTTLIRNICYVPHETYYRFRGILVDKYLRSKNVDPSQYSIVGSGSKTFTVLSHFEVPHECGPLVFEASTDVNISGHLLSLAIAAHFVASPMILWAEQMKYMAVDRMLPPNLDKSLFFDNKVTPSGALQRWHSREEVLLAAEICESYAAMMLNNKHSPDIIGTLKSAINLVFKI 804 T 5.8E-05 PARP_regulatory unphh T Viruses T 5x6z 1 A,B,C,D C,A,D,B MCE_RDVA mRNA capping enzyme P5 GGSMSNPDYCIPNFSQTVNERTIIDIFTICRYRSPLVVFCLSHNELAKKYAQDVSMSSGTHVHIIDGSVEITVSLYRTFRTIATQLLGRMQIVVFVTVDKSVVSTQVMKSIAWAFRGSFVELRNQSVDSSTLVSKLENLVSFAPLYNVPKCGPDYYGPTVYSELLSLATNARTHWYATIDYSMFTRSVLTGFVAKYFNEEAVPIDKRIVSIVGYNPPYVWTCLRHGIRPTYIEKSLPNPGGKGPFGLILPVINELVLKSKVKYVMHNPQIKLLCLDTFMLSTSMNILYIGAYPATHLLSLQLNGWTILAFDPKITSDWTDAMAKATGAKVIGVSKEFDFKSFSVQANQLNMFQNSKLSVIDDTWVETDYEKFQSEKQAYFEWLIDRTSIDVRLISMKWNRSKDTSVSHLLALLPQPYGASIREMRAFFHKKGASDIKILAAETEKYMDDFTAMSVSDQINTQKFMHCMITTVGDALKMDLDGGRAVIASYSLSNSSNSKERVLKFLSDANKAKAMVVFGAPNTHRLAYAKKVGLVLDSAIKMSKDLITFSNPTGRRWRDYGYSQSELYDAGYVEITIDQMVAYSSDVYNGVGYFANSTYNDLFSWYIPKWYVHKRMLMQDIRLSPAALVKCFTTLIRNICYVPHETYYRFRGILVDKYLRSKNVDPSQYSIVGSGSKTFTVLSHFEVPHECGPLVFEASTDVNISGHLLSLAIAAHFVASPMILWAEQMKYMAVDRMLPPNLDKSLFFDNKVTPSGALQRWHSREEVLLAAEICESYAAMMLNNKHSPDIIGTLKSAINLVFKI 804 T 5.8E-05 PARP_regulatory unphh T Viruses T 5x70 1 A,B,C,D A,C,D,B MCE_RDVA mRNA capping enzyme P5 GGSMSNPDYCIPNFSQTVNERTIIDIFTICRYRSPLVVFCLSHNELAKKYAQDVSMSSGTHVHIIDGSVEITVSLYRTFRTIATQLLGRMQIVVFVTVDKSVVSTQVMKSIAWAFRGSFVELRNQSVDSSTLVSKLENLVSFAPLYNVPKCGPDYYGPTVYSELLSLATNARTHWYATIDYSMFTRSVLTGFVAKYFNEEAVPIDKRIVSIVGYNPPYVWTCLRHGIRPTYIEKSLPNPGGKGPFGLILPVINELVLKSKVKYVMHNPQIKLLCLDTFMLSTSMNILYIGAYPATHLLSLQLNGWTILAFDPKITSDWTDAMAKATGAKVIGVSKEFDFKSFSVQANQLNMFQNSKLSVIDDTWVETDYEKFQSEKQAYFEWLIDRTSIDVRLISMKWNRSKDTSVSHLLALLPQPYGASIREMRAFFHKKGASDIKILAAETEKYMDDFTAMSVSDQINTQKFMHCMITTVGDALKMDLDGGRAVIASYSLSNSSNSKERVLKFLSDANKAKAMVVFGAPNTHRLAYAKKVGLVLDSAIKMSKDLITFSNPTGRRWRDYGYSQSELYDAGYVEITIDQMVAYSSDVYNGVGYFANSTYNDLFSWYIPKWYVHKRMLMQDIRLSPAALVKCFTTLIRNICYVPHETYYRFRGILVDKYLRSKNVDPSQYSIVGSGSKTFTVLSHFEVPHECGPLVFEASTDVNISGHLLSLAIAAHFVASPMILWAEQMKYMAVDRMLPPNLDKSLFFDNKVTPSGALQRWHSREEVLLAAEICESYAAMMLNNKHSPDIIGTLKSAINLVFKI 804 T 5.8E-05 PARP_regulatory unphh T Viruses T 5x71 1 A,B A,B MCE_RDVA mRNA capping enzyme P5 GGSMSNPDYCIPNFSQTVNERTIIDIFTICRYRSPLVVFCLSHNELAKKYAQDVSMSSGTHVHIIDGSVEITVSLYRTFRTIATQLLGRMQIVVFVTVDKSVVSTQVMKSIAWAFRGSFVELRNQSVDSSTLVSKLENLVSFAPLYNVPKCGPDYYGPTVYSELLSLATNARTHWYATIDYSMFTRSVLTGFVAKYFNEEAVPIDKRIVSIVGYNPPYVWTCLRHGIRPTYIEKSLPNPGGKGPFGLILPVINELVLKSKVKYVMHNPQIKLLCLDTFMLSTSMNILYIGAYPATHLLSLQLNGWTILAFDPKITSDWTDAMAKATGAKVIGVSKEFDFKSFSVQANQLNMFQNSKLSVIDDTWVETDYEKFQSEKQAYFEWLIDRTSIDVRLISMKWNRSKDTSVSHLLALLPQPYGASIREMRAFFHKKGASDIKILAAETEKYMDDFTAMSVSDQINTQKFMHCMITTVGDALKMDLDGGRAVIASYSLSNSSNSKERVLKFLSDANKAKAMVVFGAPNTHRLAYAKKVGLVLDSAIKMSKDLITFSNPTGRRWRDYGYSQSELYDAGYVEITIDQMVAYSSDVYNGVGYFANSTYNDLFSWYIPKWYVHKRMLMQDIRLSPAALVKCFTTLIRNICYVPHETYYRFRGILVDKYLRSKNVDPSQYSIVGSGSKTFTVLSHFEVPHECGPLVFEASTDVNISGHLLSLAIAAHFVASPMILWAEQMKYMAVDRMLPPNLDKSLFFDNKVTPSGALQRWHSREEVLLAAEICESYAAMMLNNKHSPDIIGTLKSAINLVFKI 804 T 5.8E-05 PARP_regulatory unphh T Viruses T 5x7v 1 A,B,C,D,E,F A,B,C,D,E,F Q9Y010_PLAFA Nucleosome assembly protein FMQDFEDIQKDIEQLDIKCAHEQMNIQKQYDEKKKPLFEKRDEIIQKIPGFWANTLRKHPALSDIVPEDIDILNHLVKLDLKDNMDNNGSYKITFIFGEKAKEFMEPLTLVKHVTFDNNQEKVVECTRIKWKEGKNPIAAVTHNRSDLDNEIPKWSIFEWFTTDELQDKPDVGELIRREIWHNPLSYYLGLEE 193 T 7.5E-07 NAP pdbpssm F Eukaryota T 5x8p 6 F 6 PSRP5_SPIOL protein cL37 MALLSPLLSLSSVPPITSIAVSSSSFPIKLQNVSVALLPTLGQRLMTHGPVIAQKRGTVVAMVSAAADETAGEDGDQSKVEEANISVQNLPLESKLQLKLEQKMKMKMAKKIRLRRNRLMRKRKLRKRGAWPPSKMKKLKNV 142 T 0.084 DUF3381 pdbpssm F Eukaryota T 5x8q 2 B,D,F,H B,D,F,H NCOR2_HUMAN N-COR2,CTG REPEAT PROTEIN 26,SMAP270,SILENCING MEDIATOR OF RETINOIC ACID AND THYROID HORMONE RECEPTOR,SMRT,T3 RECEPTOR-ASSOCIATING FACTOR,TRAC,THYROID-,RETINOIC-ACID-RECEPTOR-ASSOCIATED COREPRESSOR TNMGLEAIIRKALMGKYDQWEE 22 T 3.5 RHH_7 pdbhh F Eukaryota T 5x8t 6 F 6 PSRP5_SPIOL protein cL37 MALLSPLLSLSSVPPITSIAVSSSSFPIKLQNVSVALLPTLGQRLMTHGPVIAQKRGTVVAMVSAAADETAGEDGDQSKVEEANISVQNLPLESKLQLKLEQKMKMKMAKKIRLRRNRLMRKRKLRKRGAWPPSKMKKLKNV 142 T 0.084 DUF3381 pdbpssm F Eukaryota T 5x8x 2 B,D,F,H B,D,F,H NCOR2_HUMAN N-COR2,CTG REPEAT PROTEIN 26,SMAP270,SILENCING MEDIATOR OF RETINOIC ACID AND THYROID HORMONE RECEPTOR,SMRT,T3 RECEPTOR-ASSOCIATING FACTOR,TRAC,THYROID-,RETINOIC-ACID-RECEPTOR-ASSOCIATED COREPRESSOR TNMGLEAIIRKALMGKYDQWEE 22 T 3.5 RHH_7 pdbhh F Eukaryota T 5x90 2 B F Q5ZS31_LEGPH IcmW PDLSHEASAKYWFEYLDPMIYRVITFMESVENWTLDGNPELEEAMKQLGQELDDIEKIDLGLLAEEDKFIRIVGNIKSGRGLRLLQAIDTVHPGSASRVLIHAEETSLSSSDPAGFFLKRNIVFERLRLLSRVFCQYRLKLVLRALEGD 149 T 0.11 DUF2335 pdbpercent F Bacteria T 5x90 3 C G Q5ZYC6_LEGPH IcmO (DotL) EGALTIFSKLRIDPNAPPILVADKEVFSEPLLPINETRNQMITIERLAGAKDKYAGTVANELIKDFQIATSYPPEERDVIDVQELTGIIRDLSAKISAEREKANKKAA 108 T 0.061 Csm2_III-A pdbpssm F Bacteria T 5x90 4 D H Q5ZY48_LEGPH Hypothetical virulence protein LTMIDDLNNPLAIVERVYLIWWHWADFHLHVISPHIDTITPAIVIEPELIPGSNDHEFVYSIHDSGSKLSTSKSQDMFSAGMSMCKLFYTIEKMVYILVERLKSGGVSMEAEVQIAFAGHEIAQRKAFESIINLPYNVVVTNFDPGIWGEKYLQNVKRLADKGYGYPPESPR 172 T 0.21 Herpes_TK_C pdbpercent F Bacteria T 5x90 5 F B Q5ZS31_LEGPH IcmW PDLSHEASAKYWFEYLDPMIYRVITFMESVENWTLDGNPELEEAMKQLGQELDDIEKIDLGLLAEEDKFIRIVGNIKSGRGLRLLQAIDTVHPGSASRVLIHAEETSLSSSDPAGFFLKRNIVFERLRLLSRVFCQYRLKLVLRALEG 148 T 0.11 DUF2335 pdbpercent F Bacteria T 5x90 6 G C Q5ZYC6_LEGPH IcmO (DotL) EGALTIFSKLRIDPNAPPILVADKEVFSEPLLPINETRNQMITIERLAGAKDKYAGTVANELIKDFQIATSYPPEERDVIDVQELTGIIRDLSAKISAEREKANKKA 107 T 0.7 DUF3811 pdbpercent F Bacteria T 5x90 7 H D Q5ZY48_LEGPH Hypothetical virulence protein IDDLNNPLAIVERVYLIWWHWADFHLHVISPHIDTITPAIVIEPELIPGSNDHEFVYSIHDSGSKLSTSKSQDMFSAGMSMCKLFYTIEKMVYILVERLKSGGVSMEAEVQIAFAGHEIAQRKAFESIINLPYNVVVTNFDPGIWGEKYLQNVKRLADKGYGYPPESPRKI 171 T 0.21 Herpes_TK_C pdbpercent F Bacteria T 5x9x 1 A A Q9BML7_DROME Metabotropic GABA-B receptor subtype 1 MDSKEDEERYQKLVTENEQLQRLITQKEEKIRVLRQRLVERGDA 44 T 0.0039 Csm1_N pdb F Eukaryota T 5x9x 2 B B Q9BML6_DROME GABAB RECEPTOR 2 GPLGSSVSELEQRLRDVKNTNSRFRKALMEKENELQALIRKLGPE 45 T 0.011 MIP-T3_C pdbpercent F Eukaryota T 5xa5 2 B B HMP2_CAEEL PROTEIN HUMPBACK-2 GGIQTSAAEATNSTTSIVEMMQMPTQQLKQSVMDLLTYEGSNDMSGLS 48 T 3.7E-05 Adaptin_N unppssm F Eukaryota T 5xad 2 C,D C,D Q5ZUV9_LEGPH Uncharacterised protein GSIVDEFEELGEQESDIDEFDLLEG 25 T 14 LMP pdbhh F Bacteria T 5xbd 1 A A A0A2R2JFU8_9CARY pB1 QCKPNGAKCTEISIPPCCSNFCLRYAGQKSGTCANR 36 T 0.00093 Antifungal_pept pdb F Eukaryota T 5xbl 2 B D A0A247D711_LISMN Associated protein MNINDLIREIKNKDYTVKLSGTDSNSITQLIIRVNNDGNEYVISESENESIVEKFISAFKNGWNQEYEDEEEFYNDMQTITLKSELN 87 T 0.033 DUF4930 pdb F Bacteria T 5xco 2 B B ACE-ARG-ARG-ARG-ARG-CYS-PRO-LEU-TYR-ILE-SER-TYR-ASP-PRO-VAL-CYS-ARG-ARG-ARG-ARG-NH2 XRRRRCPLYISYDPVCRRRRX 21 T 1.5 YliH pdbhh F T 5xcq 3 C C C8 peptide PRGYPGQV 8 T 0.16 Gag_p19 pdbhh F T 5xcr 3 C,F C,F C8 peptide PRGYPGQV 8 T 0.16 Gag_p19 pdbhh F T 5xcs 3 C C HA peptide YPYDVPDYA 9 T 0.22 DUF3437 pdbhh F F 5xct 3 C C C8 peptide PRGYPGQV 8 T 0.16 Gag_p19 pdbhh F T 5xcu 3 C,F C,F HA peptide YPYDVPDYA 9 T 0.22 DUF3437 pdbhh F F 5xdp 1 A B D-ubiquitin XXXXXXXXXGXXXXXXXXXXXXXXXXXXXXXXXXGXXXXXXXXXXXGXXXXXGXXXXXXXXXXXXXXXXXXXXXGG 76 F F F 5xhp 1 A,B F,E E8XCX6_SALT4 Putative cytoplasmic protein MARFNAAFTRIKIMFSRIRGLISCQSNTQTIAPTLSPPSSGHVSFAGIDYPLLPLNHQTPLVFQWFERNPDRFGQNEIPIINTQKNPYLNNIINAAIIEKERIIGIFVDGDFSKGQRKALGKLEQNYRNIKVIYNSDLNYSMYDKKLTTIYLENITKLEAQSASERDEVLLNGVKKSLEDVLKNNPEETLISSHNKDKGHLWFDFYRNLFLLKGSDAFLEAGKPGCHHLQPGGGCIYLDADMLLTDKLGTLYLPDGIAIHVSRKDNHVSLENGIIAVNRSEHPALIKGLEIMHSKPYGDPYNDWLSKGLRHYFDGSHIQDYDAFCDFIEFKHENIIMNTSSLTASSWR 348 T 2.1E-05 Glyco_transf_88 pdbhh F Bacteria T 5xhz 2 C,D C,D ARAP1_MOUSE CENTAURIN-DELTA-2,CNT-D2 RPVPMKRHIFR 11 T 21 DUF924 pdbhh F Eukaryota T 5xiu 1 A A RN168_HUMAN HRNF168,RING FINGER PROTEIN 168,RING-TYPE E3 UBIQUITIN TRANSFERASE RNF168 GPGHMEETEINFTQKLIDLEHLLFERHKQEEQDRLLALQLQKEVDKEQM 49 T 7.2 DUF3629 pdbhh F Eukaryota T 5xiv 1 A A A0A247D712_GINBI beta-ginkgotide, beta-gB1 YETGCKRCCYLDEYGCIRCC 20 T 2.5 Antistasin pdbhh F Eukaryota T 5xj0 6 G,H G,H A7XX65_9CAUD gp39 GSHMVEGFVEPYIRLFEAIPDAETELATFYDADLDTLPPRMFLPSGDLYTPPGPVRLEEIKRKRRVRLVKVSIYRFEHVGLGLAARPYAYAYAWQGDNGILHLYHAPVVLEDVPEVLELDEVTYNESYVRLMRAMGHVDAFIDL 144 T 2.5 Abp2 unphh T Viruses T 5xjg 2 B,D B,D NVJ1_YEAST Nucleus-vacuole junction protein 1 NREKDCSSSSEVESQSKCRKESTAEPDSLSRDTRTTSSLKSSTSFPISFKGSIDLKSLNQPSSLLHIQVSPTKSSNLDAQVNTEQAYSQPFRY 93 T 0.06 Trypan_PARP unp F Eukaryota T 5xjm 4 D B ANGT_HUMAN Sar1, Ile8-angiotensin II XRVYIHPI 8 T 3 Ion_trans_N pdbhh F Eukaryota T 5xll 1 A,B A,B PKNI_MYCTU Serine/threonine-protein kinase PknI TAPVPLDGTYRIEIQRSKQTYDYTPTPQPPDVNTWWAFRTSCTPTECLAAATMLDDNDHTQAKTPPVRPFLMQFGEGQWKSRPETVQFPCVGPNGSPSTQATTQLLALRPQPQGDLVGEMVVTVHSNECGQQGAVIRIPAVASRSGDLPPAVTVPDPATIPDTPDTTSTATLTPPTTTAPGPGR 184 T 1.9 DEC-1_N pdb F Bacteria T 5xlm 1 A,B A,B PKNI_MYCTU Serine/threonine-protein kinase PknI RKTNTTATEVARPPTSGSAVPSAPTTTVAVTAPVPLDGTYRIEIQRSKQTYDYTPTPQPPDVNTWWAFRTSCTPTECLAAATMLDDNDHTQAKTPPVRPFLMQFGEGQWKSRPETVQFPCVGPNGSPSTQATTQLLALRPQPQGDLVGEMVVTVHSNECGQQGAVIRIPAVASRSGDLPPAVTVPDPATIPDTPDTTSTATLTPPTTTAPGPGR 214 T 0.24 Acyl-CoA_dh_C unp F Bacteria T 5xln 2 B B SYTC_HUMAN THREONYL-TRNA SYNTHETASE,THRRS GGKKKNKEGSGDGGRAELNPWPEYIYTRLEMYNILKAEHDSILAE 45 T 4.4 CAAP1 pdbhh F Eukaryota T 5xlo 3 H,I N,M L7P7M1_9CAUD Uncharacterized protein AcrF1 MKFIKYLSTAHLNYMNIAVYENGSKIKARVENVVNGKSVGARDFDSTEQLESWFYGLPGSGLGRIENAMNEISRRENP 78 T 0.16 DUF4982 pdb T Viruses T 5xlp 3 F M L7P7M1_9CAUD Uncharacterized protein AcrF1 MKFIKYLSTAHLNYMNIAVYENGSKIKARVENVVNGKSVGARDFDSTEQLESWFYGLPGSGLGRIENAMNEISRRENP 78 T 0.16 DUF4982 pdb T Viruses T 5xm4 1 A A A0A0B8ZWE6_9SPHN SUBTERISIN GPPGDRIEFGVLAQLPG 17 T 0.16 DUF5974 unphh F Bacteria T 5xn3 2 B B NOS2_HUMAN cR8 peptide from NOS2 RGDINNNV 8 T 3.8 DUF6373 pdbhh F Eukaryota T 5xn4 1 A X A0A247D711_LISMN Anti-CRISPR AcrIIA4 MNINDLIREIKNKDYTVKLSGTDSNSITQLIIRVNNDGNEYVISESENESIVEKFISAFKNGWNQEYEDEEEFYNDMQTITLKSELN 87 T 0.033 DUF4930 pdb F Bacteria T 5xnb 1 A,D,G,J,M,P A,D,G,J,M,P Q5ZYC6_LEGPH ICMO PROTEIN EPVEDIVEEEVEGALTIFSKLRIDPNAPPILVADKEVFSEPLLPINETRNQMITIERLAGAKDKYAGTVANELIKDFQIATSYPPEERDVIDVQELTGIIRDLSAKISAEREK 113 T 0.74 RecC_C unppssm F Bacteria T 5xnb 3 C,F,I,L,O,R C,F,I,L,O,R Q5ZS31_LEGPH ICMW PROTEIN MPDLSHEASAKYWFEYLDPMIYRVITFMESVENWTLDGNPELEEAMKQLGQELDDIEKIDLGLLAEEDKFIRIVGNIKSGRGLRLLQAIDTVHPGSASRVLIHAEETSLSSSDPAGFFLKRNIVFERLRLLSRVFCQYRLKLVLRALEGDE 151 T 0.11 DUF2335 unppercent F Bacteria T 5xnm 20 W,XA U,u A0A0K9RHP1_SPIOL Photosystem II luminal extrinsic protein Tn, PsbTn MASITMTASFLGTTVSKQPPTHHLRRGVVMAKAMPETTTTTKEETSSKRRDLVFAVAAAAACSVARIAMAEEPKRGTPEAKKKYAPVCVTMPSARICYK 99 T 0.014 PsbQ pdbpercent F Eukaryota T 5xo2 2 C,D X,Y GB_HHV1K GB GPATPAP 7 T 5.4 DUF765 pdbhh T Viruses F 5xo3 1 A A THAN_PODMA Thanatin GSKKPVPIIYCNRRTGKCQRF 21 T 2.6 YihI pdbhh F Eukaryota T 5xo4 1 A A THAN_PODMA Thanatin GSKKPVPIIYCNRRTGKCQRM 21 T 2.6 YihI pdbhh F Eukaryota T 5xo5 1 A A THAN_PODMA Thanatin GSKKPVPIIACNRRTGKCQRA 21 T 0.14 YihI pdbhh F Eukaryota T 5xo9 1 A,B A,B THAN_PODMA Thanatin GSKKPVPIIYCNRRTGKCQRM 21 T 2.6 YihI pdbhh F Eukaryota T 5xoa 1 A,B A,B THAN_PODMA Thanatin GSKKPVPIIYCNRRTGKCQRF 21 T 2.6 YihI pdbhh F Eukaryota T 5xod 2 B B SKI_HUMAN PROTO-ONCOGENE C-SKI GPGLQKTLEQFHLSSMSSLGGPAAFSASDED 31 T 2.7 DUF2520 pdbhh F Eukaryota T 5xof 2 E,F,G,H O,P,Q,R NOS3_HUMAN Peptide from Nitric oxide synthase, endothelial GPATPAP 7 T 5.4 DUF765 pdbhh F Eukaryota F 5xoj 4 D,E,F E,F,G E7Q297_YEASB Nup42p KPSAFGAPAFGSSAPINVNPPSTTSAFGAPSFGST 35 T 18 DUF2673 pdbhh F T 5xok 1 A,B A,B THAN_PODMA Thanatin GSKKPVPIIACNRRTGKCQRA 21 T 0.14 YihI pdbhh F Eukaryota T 5xol 1 A,B A,B THAN_PODMA Thanatin GSKKPVPIIYCNAATGKCQRM 21 T 2.6 YihI unphh F Eukaryota T 5xoq 2 C,D C,D GLY-PHE-SER-GLY-GLY-ASP-GLY-ILE GFSGGDGI 8 T 2.4 DNA_primase_S pdbhh F F 5xpk 2 B B UBB_HUMAN D-ubiquitin XXXXXXXXXGXXXXXXXXXXXXXXXXXXXXXXXXGXXXXXXXXXXXGXXXXXGXXXXXXXXXXXXXXXXXXXXXGG 76 F F Eukaryota F 5xpm 2 B C MED1_HUMAN ACTIVATOR-RECRUITED COFACTOR 205 KDA COMPONENT,ARC205,MEDIATOR COMPLEX SUBUNIT 1,PEROXISOME PROLIFERATOR-ACTIVATED RECEPTOR-BINDING PROTEIN,PPAR-BINDING PROTEIN,THYROID HORMONE RECEPTOR-ASSOCIATED PROTEIN COMPLEX 220 KDA COMPONENT,TRAP220,THYROID RECEPTOR-INTERACTING PROTEIN 2,TRIP-2,VITAMIN D RECEPTOR-INTERACTING PROTEIN COMPLEX COMPONENT DRIP205,P53 REGULATORY PROTEIN RB18A KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 5xpn 2 B C MED1_HUMAN ACTIVATOR-RECRUITED COFACTOR 205 KDA COMPONENT,ARC205,MEDIATOR COMPLEX SUBUNIT 1,PEROXISOME PROLIFERATOR-ACTIVATED RECEPTOR-BINDING PROTEIN,PPAR-BINDING PROTEIN,THYROID HORMONE RECEPTOR-ASSOCIATED PROTEIN COMPLEX 220 KDA COMPONENT,TRAP220,THYROID RECEPTOR-INTERACTING PROTEIN 2,TRIP-2,VITAMIN D RECEPTOR-INTERACTING PROTEIN COMPLEX COMPONENT DRIP205,P53 REGULATORY PROTEIN RB18A KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 5xpo 2 B C MED1_HUMAN ACTIVATOR-RECRUITED COFACTOR 205 KDA COMPONENT,ARC205,MEDIATOR COMPLEX SUBUNIT 1,PEROXISOME PROLIFERATOR-ACTIVATED RECEPTOR-BINDING PROTEIN,PPAR-BINDING PROTEIN,THYROID HORMONE RECEPTOR-ASSOCIATED PROTEIN COMPLEX 220 KDA COMPONENT,TRAP220,THYROID RECEPTOR-INTERACTING PROTEIN 2,TRIP-2,VITAMIN D RECEPTOR-INTERACTING PROTEIN COMPLEX COMPONENT DRIP205,P53 REGULATORY PROTEIN RB18A KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 5xpp 2 B C MED1_HUMAN ACTIVATOR-RECRUITED COFACTOR 205 KDA COMPONENT,ARC205,MEDIATOR COMPLEX SUBUNIT 1,PEROXISOME PROLIFERATOR-ACTIVATED RECEPTOR-BINDING PROTEIN,PPAR-BINDING PROTEIN,THYROID HORMONE RECEPTOR-ASSOCIATED PROTEIN COMPLEX 220 KDA COMPONENT,TRAP220,THYROID RECEPTOR-INTERACTING PROTEIN 2,TRIP-2,VITAMIN D RECEPTOR-INTERACTING PROTEIN COMPLEX COMPONENT DRIP205,P53 REGULATORY PROTEIN RB18A KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 5xpt 2 B B CHAP1_HUMAN ZINC FINGER PROTEIN 828 MSNPSASSGPWKPAKPAPSVS 21 T 5.2 GvpL_GvpF pdbhh F Eukaryota T 5xpu 2 B B CHAP1_HUMAN ZINC FINGER PROTEIN 828 MSNPSASSGPWKPAKPAPSVS 21 T 5.2 GvpL_GvpF pdbhh F Eukaryota T 5xqz 2 B,D C,D ST38L_HUMAN NDR2 PROTEIN KINASE,NUCLEAR DBF2-RELATED KINASE 2 SSGHMKLTLENFYSNLILQHEERETRQKKLEVAMEEEGLADEEKKLRRSQHARKETEFLRLKRTRLGL 68 T 0.12 Anoct_dimer pdbpssm F Eukaryota T 5xrr 1 A A FUS_HUMAN 75 KDA DNA-PAIRING PROTEIN,ONCOGENE FUS,ONCOGENE TLS,POMP75,TRANSLOCATED IN LIPOSARCOMA PROTEIN SYSSYG 6 T 1.5 Kinin pdbhh F Eukaryota F 5xs3 3 C C P VRSRRCLRL 9 T 5.3 ISAV_HA pdbhh F F 5xsg 1 A A FUS_HUMAN SER-TYR-SER-GLY-TYR-SER SYSGYS 6 T 4.4 DUF6156 pdbhh F Eukaryota F 5xsj 2 B L A6LW08_CLOB8 Signal transduction histidine kinase, LytS MGSSHHHHHHSQGSMLNNMLITNEIKQHVDSSLDNFNQYILNGTPSKKESYNNEVILAKQKIGNLKKNSDDVNQYILRDLDNTLDSYIESSKNTISAYENKEGYVFYYDDFVAAKNIASYCDAYASTLMQNFLEANSIAYKELNRNSS 148 T 0.00042 HBM pdbpercent F Bacteria T 5xtc 1 A Q NDUS2_HUMAN COMPLEX I-49KD,CI-49KD,NADH-UBIQUINONE OXIDOREDUCTASE 49 KDA SUBUNIT VRQWQPDVEWAQQFGGAVMYPSKETAHWKPPPWNDVDPPKDTIVKN 46 T 36 CCSAP pdbhh F Eukaryota T 5xtj 1 A,B A,B A0A2U8ZTY7_RHIZD ENDO BETA-1,4-MANNANASE ADRGTETVPGLGQRKQQILNSGGGVWDLAIAMLETKNLGTDYVYGDGKTYDSANFGIFKQNWFMLRTSTSQFKGQTTNQWNNGAVLNSNLQQDIKARQESQNYYGPDKWFAGHRNGESGLSNPYTQDITNYKDAVNWIHDQLASDPKYLSDDTRFWVDVTAI 162 T 0.036 Lys pdbpercent F Eukaryota T 5xtt 1 A,B A,B A0A2U8ZTY7_RHIZD beta-1,4-mannanase ADRGTETVPGLGQRKQQILNSGGGVWDLAIAMLETKNLGTDYVYGDGKTYDSANFGIFKQNWFMLRTSTSQFKGQTTNQWNNGAVLNSNLQQDIKARQESQNYYGPDKWFAGHRNGESGLSNPYTQDITNYKDAVNWIHDQLASDPKYLSDDTRFWVDVTAI 162 T 0.036 Lys pdbpercent F Eukaryota T 5xu5 1 A,B B,A A0A2U8ZTY7_RHIZD endo-1,4-beta-mannanase ADRGTETVPGLGQRKQQILNSGGGVWDLAIAMLETKNLGTDYVYGDGKTYDSANFGIFKQNWFMLRTSTSQFKGQTTNQWNNGAVLNSNLQQDIKARQESQNYYGPDKWFAGHRNGESGLSNPYTQDITNYKDAVNWIHDQLASDPKYLSDDTRFWVDVTAI 162 T 0.036 Lys pdbpercent F Eukaryota T 5xug 1 A,B B,A A0A2U8ZTY7_RHIZD endo-1,4-beta-mannanase ADRGTETVPGLGQRKQQILNSGGGVWDLAIAMLETKNLGTDYVYGDGKTYDSANFGIFKQNWFMLRTSTSQFKGQTTNQWNNGAVLNSNLQQDIKARQESQNYYGPDKWFAGHRNGESGLSNPYTQDITNYKDAVNWIHDQLASDPKYLSDDTRFWVDVTAI 162 T 0.036 Lys pdbpercent F Eukaryota T 5xul 1 A,B B,A A0A2U8ZTY7_RHIZD endo-1,4-beta-mannanase ADRGTETVPGLGQRKQQILNSGGGVWDLAIAMLETKNLGTDYVYGDGKTYDSANFGIFKQNWFMLRTSTSQFKGQTTNQWNNGAVLNSNLQQDIKARQESQNYYGPDKWFAGHRNGESGLSNPYTQDITNYKDAVNWIHDQLASDPKYLSDDTRFWVDVTAI 162 T 0.036 Lys pdbpercent F Eukaryota T 5xup 2 C,D C,D TERB1_HUMAN COILED-COIL DOMAIN-CONTAINING PROTEIN 79 KILLTPRRRQRL 12 T 0.86 WSK pdbhh F Eukaryota T 5xuq 2 B C MED1_HUMAN ACTIVATOR-RECRUITED COFACTOR 205 KDA COMPONENT,ARC205,MEDIATOR COMPLEX SUBUNIT 1,PEROXISOME PROLIFERATOR-ACTIVATED RECEPTOR-BINDING PROTEIN,PPAR-BINDING PROTEIN,THYROID HORMONE RECEPTOR-ASSOCIATED PROTEIN COMPLEX 220 KDA COMPONENT,TRAP220,THYROID RECEPTOR-INTERACTING PROTEIN 2,TRIP-2,VITAMIN D RECEPTOR-INTERACTING PROTEIN COMPLEX COMPONENT DRIP205,P53 REGULATORY PROTEIN RB18A KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 5xv8 1 A A UVSSA_HUMAN UV-stimulated scaffold protein A GSMRRRTEALGDAEEDEDDEDFVEVPEKEGYEPHIPDHLRPEYGLEAA 48 T 15 AgrD pdbhh F Eukaryota T 5xvw 2 C,F D,F RNG1A_ARATH RING 1A ILAWGRGGTRSNTR 14 T 0.25 PetN pdbhh F Eukaryota T 5xw1 3 C C Acetylated-Pro-Arg-Asn Inhibitor XPRN 4 T 170 MPC pdbhh F F 5xw5 2 C C SWI6_YEAST CELL-CYCLE BOX FACTOR SUBUNIT SWI6,MBF SUBUNIT P90,TRANS-ACTING ACTIVATOR OF HO ENDONUCLEASE GENE HRELGSPLKK 10 T 15 DUF4416 pdbhh F Eukaryota T 5xw8 3 C C Acetylated-Pro-Arg-Asn Inhibitor XPRN 4 T 170 MPC pdbhh F F 5xw9 3 C C Acetylated-Pro-Arg-Tyr Inhibitor XPRY 4 T 110 Hva1_TUDOR pdbhh F F 5xwa 3 C C Acetylated-Pro-Arg-Tyr Inhibitor XPRY 4 T 110 Hva1_TUDOR pdbhh F F 5xwe 1 A,B A,B 3S11H_OPHHA WTX DE-1 HOMOLOG 1 MKPVLLTLVVVTIVCLDLGYTRICLKQEPFQPETTTTCPEGEDACYNLFWSDHSEIKIEMGCGCPKTEPYTNLYCCKIDSCNK 83 T 0.021 Endomucin pdb F Eukaryota T 5xwj 2 B,D C,D Acetylated-THR-ARG-GLU Inhibitor XTRE 4 T 800 ArAE_1_C pdbhh F F 5xwl 2 B,D C,D Acetylated-THR-ARG-GLU Inhibitor XTRE 4 T 800 ArAE_1_C pdbhh F F 5xwp 1 A,F A,B CS13A_LEPBD Uncharacterized protein SMKVTKVGGISHKKYTSEGRLVKSESEENRTDERLSALLNMRLDMYIKNPSSTETKENQKRIGKLKKFFSNKMVYLKDNTLSLKNGKKENIDREYSETDILESDVRDKKNFAVLKKIYLNENVNSEELEVFRNDIKKKLNKINSLKYSFEKNKANYQKINENNIEKVEGKSKRNIIYDYYRESAKRDAYVSNVKEAFDKLYKEEDIAKLVLEIENLTKLEKYKIREFYHEIIGRKNDKENFAKIIYEEIQNVNNMKELIEKVPDMSELKKSQVFYKYYLDKEELNDKNIKYAFCHFVEIEMSQLLKNYVYKRLSNISNDKIKRIFEYQNLKKLIENKLLNKLDTYVRNCGKYNYYLQDGEIATSDFIARNRQNEAFLRNIIGVSSVAYFSLRNILETENENDITGRMRGKTVKNNKGEEKYVSGEVDKIYNENKKNEVKENLKMFYSYDFNMDNKNEIEDFFANIDEAISSIRHGIVHFNLELEGKDIFAFKNIAPSEISKKMFQNEINEKKLKLKIFRQLNSANVFRYLEKYKILNYLKRTRFEFVNKNIPFVPSFTKLYSRIDDLKNSLGIYWKTPKTNDDNKTKEIIDAQIYLLKNIYYGEFLNYFMSNNGNFFEISKEIIELNKNDKRNLKTGFYKLQKFEDIQEKIPKEYLANIQSLYMINAGNQDEEEKDTYIDFIQKIFLKGFMTYLANNGRLSLIYIGSDEETNTSLAEKKQEFDKFLKKYEQNNNIKIPYEINEFLREIKLGNILKYTERLNMFYLILKLLNHKELTNLKGSLEKYQSANKEEAFSDQLELINLLNLDNNRVTEDFELEADEIGKFLDFNGNKVKDNKELKKFDTNKIYFDGENIIKHRAFYNIKKYGMLNLLEKIADKAGYKISIEELKKYSNKKNEIEKNHKMQENLHRKYARPRKDEKFTDEDYESYKQAIENIEEYTHLKNKVEFNELNLLQGLLLRILHRLVGYTSIWERDLRFRLKGEFPENQYIEEIFNFENKKNVKYKGGQIVEKYIKFYKELHQNDEVKINKYSSANIKVLKQEKKDLYIANYIAAFNYIPHAEISLLEVLENLRKLLSYDRKLKNAVMKSVVDILKEYGFVATFKIGADKKIGIQTLESEKIVHLKNLKKKKLMTDRNSEELCKLVKIMFEYKMEEKKSEN 1160 T 0.66 DUF2316 unppercent F Bacteria T 5xwr 2 C,D C,D SALL4_HUMAN MET-SER-ARG-ARG-LYS-GLN-ALA-LYS-PRO-GLN-HIS-ILE MSRRKQAKPQHI 12 T 160 Loricrin pdbhh F Eukaryota T 5xwy 1 A A CS13A_LEPBD A type VI-A CRISPR-Cas RNA-guided RNA ribonuclease, Cas13a MKVTKVGGISHKKYTSEGRLVKSESEENRTDERLSALLNMRLDMYIKNPSSTETKENQKRIGKLKKFFSNKMVYLKDNTLSLKNGKKENIDREYSETDILESDVRDKKNFAVLKKIYLNENVNSEELEVFRNDIKKKLNKINSLKYSFEKNKANYQKINENNIEKVEGKSKRNIIYDYYRESAKRDAYVSNVKEAFDKLYKEEDIAKLVLEIENLTKLEKYKIREFYHEIIGRKNDKENFAKIIYEEIQNVNNMKELIEKVPDMSELKKSQVFYKYYLDKEELNDKNIKYAFCHFVEIEMSQLLKNYVYKRLSNISNDKIKRIFEYQNLKKLIENKLLNKLDTYVRNCGKYNYYLQDGEIATSDFIARNRQNEAFLRNIIGVSSVAYFSLRNILETENENDITGRMRGKTVKNNKGEEKYVSGEVDKIYNENKKNEVKENLKMFYSYDFNMDNKNEIEDFFANIDEAISSIRHGIVHFNLELEGKDIFAFKNIAPSEISKKMFQNEINEKKLKLKIFRQLNSANVFRYLEKYKILNYLKRTRFEFVNKNIPFVPSFTKLYSRIDDLKNSLGIYWKTPKTNDDNKTKEIIDAQIYLLKNIYYGEFLNYFMSNNGNFFEISKEIIELNKNDKRNLKTGFYKLQKFEDIQEKIPKEYLANIQSLYMINAGNQDEEEKDTYIDFIQKIFLKGFMTYLANNGRLSLIYIGSDEETNTSLAEKKQEFDKFLKKYEQNNNIKIPYEINEFLREIKLGNILKYTERLNMFYLILKLLNHKELTNLKGSLEKYQSANKEEAFSDQLELINLLNLDNNRVTEDFELEADEIGKFLDFNGNKVKDNKELKKFDTNKIYFDGENIIKHRAFYNIKKYGMLNLLEKIADKAGYKISIEELKKYSNKKNEIEKNHKMQENLHRKYARPRKDEKFTDEDYESYKQAIENIEEYTHLKNKVEFNELNLLQGLLLRILHRLVGYTSIWERDLRFRLKGEFPENQYIEEIFNFENKKNVKYKGGQIVEKYIKFYKELHQNDEVKINKYSSANIKVLKQEKKDLYIANYIAAFNYIPHAEISLLEVLENLRKLLSYDRKLKNAVMKSVVDILKEYGFVATFKIGADKKIGIQTLESEKIVHLKNLKKKKLMTDRNSEELCKLVKIMFEYKMEEKKSEN 1159 T 0.66 DUF2316 unppercent F Bacteria T 5xxa 1 A,B A,B A0A0A1NDE2_RHIZD endo-1,4-beta-mannanase ADRGTETVPGLGQRKQQILNSGGGVWDLAIAMLETKNLGTDYVYGDGKTYDSANFGIFKQNWFMLRTSTSQFKGQTTNQWNNGAVLNSNLQQDIKARQESQNYYGPDKWFAGHRNGESGLSNPYTQDITNYKDAVNWIHDQLASDPKYLSDDTRFWVDVTAI 162 T 0.036 Lys pdbpercent F Eukaryota T 5xxe 1 A,C A,B POZ1_SCHPO POT1-ASSOCIATED PROTEIN POZ1 GSNEKIRSQSVLNTLETFFIKENHYDMQREESSIVNACLRYLGYSKSMCHEKMPIFMDIAFIEYCFNLSLDPSSFQNLPITQTQPDSQQILWEYSLISNALERLENIELERQNCMREDGLVKYTNELLLNKETLNNEALKLYSCAKAGICRWMAFHFLEQEPIDHINFTKFLQDWGSHNEKEMEALQRLSKHKIRKRLIYVSQHKKKMPWSKFNSVLSRYIQCTKLQLEVFCDYDFKQREIVKMLTSNIN 250 T 0.14 DUF5896 unppssm F Eukaryota T 5xxe 2 B,D C,D TPZ1_SCHPO MEIOTICALLY UP-REGULATED GENE 169 PROTEIN EACEMCRLGLPHGSFFELLRDWKKIEEFRNKS 32 T 0.7 RPAP3_C pdbhh F Eukaryota T 5xxf 1 A,D A,B POZ1_SCHPO POT1-ASSOCIATED PROTEIN POZ1 GSNEKIRSQSVLNTLETFFIKENHYDMQREESSIVNACLRYLGYSKSMCHEKMPIFMDIAFIEYCFNLSLDPSSFQNLPITQTQPDSQQILWEYSLISNALERLENIELERQNCMREDGLVKYTNELLLNKETLNNEALKLYSCAKAGICRWMAFHFLEQEPIDHINFTKFLQDWGSHNEKEMEALQRLSKHKIRKRLIYVSQHKKKMPWSKFNSVLSRYIQCTKLQLEVFCDYDFKQREIVKMLTSN 248 T 0.14 DUF5896 unppssm F Eukaryota T 5xxf 2 B,E C,D TPZ1_SCHPO MEIOTICALLY UP-REGULATED GENE 169 PROTEIN ACEMCRLGLPHGSFFELLRDWKKIEEFRNKS 31 T 0.64 RPAP3_C pdbhh F Eukaryota T 5xxf 3 C,F E,F Rap1 NSDNIFVKPGEDLEIPL 17 T 0.27 DUF3983 pdbhh F T 5xxk 2 C,D C,D Hydrocarbon stapled peptide THC-SER-PHE-0EH-GLU-TYR-6CW-ALA-LEU-LEU-MK8-NH2 XSFXEYXALLXX 12 T 0.54 P53_TAD pdbhh F T 5xxq 2 C,D C,D ZN827_HUMAN Zinc finger protein 827 MPRRKQEQPKRLPS 14 T 21 Co_AT_N pdbhh F Eukaryota T 5xy9 2 C,D C,D STK26_HUMAN MST3 AND SOK1-RELATED KINASE,MAMMALIAN STE20-LIKE PROTEIN KINASE 4,STE20-LIKE KINASE MST4,SERINE/THREONINE-PROTEIN KINASE MASK TSRENNTHPEWS 12 T 3.7 DUF2811 pdbhh F Eukaryota T 5xyf 2 B C TERF2_HUMAN TTAGGG REPEAT-BINDING FACTOR 2,TELOMERIC DNA-BINDING PROTEIN SLQPKNKRMTISRLVLEE 18 T 10 VasL pdbhh F Eukaryota T 5xyf 3 C B ACD_HUMAN POT1 AND TIN2-INTERACTING PROTEIN ADPRSSLCARVQAARLPPQLMAWALHFLMDAQPGSEPTPM 40 T 15 DUF6525 pdbhh F Eukaryota T 5xyk 1 A E E8XCX6_SALT4 Putative cytoplasmic protein MARFNAAFTRIKIMFSRIRGLISCQSNTQTIAPTLSPPSSGHVSFAGIDYPLLPLNHQTPLVFQWFERNPDRFGQNEIPIINTQKNPYLNNIINAAIIEKERIIGIFVDGDFSKGQRKALGKLEQNYRNIKVIYNSDLNYSMYDKKLTTIYLENITKLEAQSASERDEVLLNGVKKSLEDVLKNNPEETLISSHNKDKGHLWFDFYRNLFLLKGSDAFLEAGKPGCHHLQPGGGCIYLDADMLLTDKLGTLYLPDGIAIHVSRKDNHVSLENGIIAVNRSEHPALIKGLEIMHSKPYGDPYNDWLSKGLRHYFDGSHIQDYDAFCDFIEFKHENIIMNTSSLTASSWR 348 T 2.1E-05 Glyco_transf_88 pdbhh F Bacteria T 5xym 31 EA a A0QTP4_MYCS2 Uncharacterized protein bL37 MAKRGRKKRDRKHSKANHGKRPNA 24 T 0.18 DUF6254 pdb F Bacteria T 5xyn 3 C C SHU1_YEAST Suppressor of HU sensitivity involved in recombination protein 1 MQFEERLQQLVESDWSLDQSSPNVLVIVLGDTARKYVELGGLKEHVTTNTVAGHVASRERVSVVFLGRVKYLYMYLTRMQAQANGPQYSNVLVYGLWDLTATQDGPQQLRLLSLVLRQCLSLPSKVEFYPEPPSSSVPARLLRFWDHIIR 150 T 28 RepA1_leader pdbhh F Eukaryota T 5xyn 4 D D SHU2_YEAST Suppressor of hydroxyurea sensitivity protein 2 MSKDVIEYSKLFAKLVNTNDDTKLDDTIASFLYYMFPRELFIRAISLLESSDMFIYILDRVHNKEGNEHTSLIDVLVDEFYKGSSNSLLEYRLIVKDTNDGAPPILVDIAHWFCSCEEFCKYFHEALEKTDEKEELHDVLINEVDDHLQFSDDRFAQLDPHSLSKQWYFKFDKVCCSHLLAFSILLRSSINVLKFFTVNSNKVFVIAIDNIDEWLNLHINIVE 223 T 0.5 SWIM pdbpssm F Eukaryota T 5xyv 2 C,D C,D DEL_DROME Protein deadlock MEKLDKIRMSQKLSCWQHILTTLGTSSKTEQEWNTFFKGFLESWRKPYCIQTSCDPSIPL 60 T 0.068 Herpes_IE68 pdbpssm F Eukaryota T 5xyw 2 C,D C,D B4Q3Z0_DROSI GD21652 MENLAKIRMSQKLACWQQILTTLGTSSMSEQEWNTFFRGFLESWQNPYCIQTSCDPSIPL 60 T 0.21 DUF4543 pdbpssm F Eukaryota T 5xzf 2 B C MED1_HUMAN ACTIVATOR-RECRUITED COFACTOR 205 KDA COMPONENT,ARC205,MEDIATOR COMPLEX SUBUNIT 1,PEROXISOME PROLIFERATOR-ACTIVATED RECEPTOR-BINDING PROTEIN,PPAR-BINDING PROTEIN,THYROID HORMONE RECEPTOR-ASSOCIATED PROTEIN COMPLEX 220 KDA COMPONENT,TRAP220,THYROID RECEPTOR-INTERACTING PROTEIN 2,TRIP-2,VITAMIN D RECEPTOR-INTERACTING PROTEIN COMPLEX COMPONENT DRIP205,P53 REGULATORY PROTEIN RB18A KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 5xzh 2 B C MED1_HUMAN ACTIVATOR-RECRUITED COFACTOR 205 KDA COMPONENT,ARC205,MEDIATOR COMPLEX SUBUNIT 1,PEROXISOME PROLIFERATOR-ACTIVATED RECEPTOR-BINDING PROTEIN,PPAR-BINDING PROTEIN,THYROID HORMONE RECEPTOR-ASSOCIATED PROTEIN COMPLEX 220 KDA COMPONENT,TRAP220,THYROID RECEPTOR-INTERACTING PROTEIN 2,TRIP-2,VITAMIN D RECEPTOR-INTERACTING PROTEIN COMPLEX COMPONENT DRIP205,P53 REGULATORY PROTEIN RB18A KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 5xzk 1 A,B,C A,B,C A0A384E107_9AGAR lectin (PhoSL) APVPVTKLVCDGDTYKCTAYLDYGDGKWVAQWDTAVFHTT 40 T 0.069 C2-set pdbhh F Eukaryota T 5xzx 2 B B RANB3_HUMAN RANBP3-B GSSPEGGEDSDREDGNYCPPVKRERTSSLT 30 T 12 Fib_alpha pdbhh F Eukaryota T 5y0h 1 A A N6 GFAWNVCVYRNGVRVCHRRAN 21 T 1.5 PilI pdbhh F T 5y0i 1 A A NZ17074(N1) GFCWNVCVYRNGVRVCHRRCN 21 T 1 PilI pdbhh F T 5y0j 1 A A N2 AFCWNVCVYRNAVRVCHRRCN 21 T 4.2 DUF2760 pdbhh F T 5y14 2 D,E,F F,E,D LP-40 YTSLIHSLIEESQNQQEKNEQELLELDK 28 T 0.00015 GP41 pdb F T 5y18 2 B B ATRX_HUMAN ATP-DEPENDENT HELICASE ATRX,X-LINKED HELICASE II,X-LINKED NUCLEAR PROTEIN,XNP,ZNF-HX SENRIAKKMLLEEIKANLSSDED 23 T 12 DUF6481 pdbhh F Eukaryota T 5y1u 2 C,D C,D AEBP2_HUMAN ADIPOCYTE ENHANCER-BINDING PROTEIN 2,AE-BINDING PROTEIN 2 KRRKLKNKRRRS 12 T 1.2 zf-C2H2_8 unppercent F Eukaryota F 5y21 2 C,D C,D RNG1A_ARATH RING 1A EVRQKKRRKRSTSR 14 T 3.3 CDC45 unppssm F Eukaryota T 5y24 2 B,D C,D GLY-MET-PRO-ARG-GLY-ALA GMPRGA 6 T 2.4 BCD pdbhh F F 5y28 2 D G UNK-UNK-UNK-UNK AAAA 4 T 900 Cyclin_C pdbhh F F 5y2d 2 B B UNK-UNK-UNK AAA 3 T 1200 RNase_HII pdbhh F F 5y2d 3 C C UNK-UNK-UNK-UNK-UNK AAAAA 5 T 440 HCV_NS4a pdbhh F F 5y2d 4 D D UNK-UNK-K-UNK-UNK-UNK-UNK-UNK-UNK-UNK AAKAAAAAAA 10 T 250 Mastoparan pdbhh F F 5y2d 5 E E UNK-UNK-UNK-UNK-UNK-UNK-UNK-UNK AAAAAAAA 8 T 280 Androgen_recep pdbhh F F 5y2d 6 F F UNK-UNK-UNK-UNK-UNK-UNK-UNK-UNK-UNK-UNK-UNK AAAAAAAAAAA 11 T 240 Ribosomal_L12_N pdbhh F F 5y3d 2 G,H G,H viral protein genome-linked (VPg) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 35 F F F 5y3r 3 C K PRKDC-Helix AAAAAAAAAAAAAAA 15 T 200 Campylo_MOMP pdbhh F F 5y42 1 A,D A,D SGSL_TRIAN Lectin ANLRLSEANSGTYKTFIGRVREELGSETYRLYGIPVLKHSL 41 T 0.00035 RIP pdbhh F Eukaryota T 5y45 1 A,B,C,D,E,F A,B,C,D,E,F collagen-like peptide GPPGPPGPPGPPALPGPPGPPGPPGPP 27 T 0.00081 Collagen pdb F F 5y46 1 A,B,C A,B,C collagen-like peptide GPPGPPGPPGPPALPGPPGPPGPPGPP 27 T 0.00081 Collagen pdb F F 5y53 2 C,F D,F DRIP1_ARATH PEPTIDE FROM E3 UBIQUITIN PROTEIN LIGASE DRIP1 ETVTPKRMRTTQRKRSAT 18 T 36 Ribosomal_L41 pdbhh F Eukaryota T 5y59 2 B C Sir4p NSKLLSLLRSKT 12 T 8.8 SRC-1 pdbhh F T 5y5w 2 E,F,G E,F,G Histone peptide H4K20(me3) KRHRKVLDN 9 T 15 Phage_X pdbhh F T 5y6p 2 C,D C1,D1 LRC4 MAAAFTAPVNLKGSSLTSNTLPAVCSRPAPLTLTPRAQADLPPPGIPSGQDPLDNAPLRHYVPRPVETYEDRGFATILPRTWEGETNTIGAGDIEPVTKEEVEESRKVPVDAASTGAFVEYARMMKEERAQALADQARRNSAPTSGRPTCGETEGTEFVSNARPILVDGVKVVEYWGVPNGPVPRLFGGPGE 192 T 0.026 DUF4786 pdbpssm F T 5y6p 3 E,F E1,F1 LRC5 MAFVSATPVSQAVRPAPALGAQLAASPLRPEIAHASNSSTPRMGYGAYSYITDKTKGHVNQYYVDKFRIASDWTKGTPKTQADAVLGRTFKGAVLVPTEGIPQEFDPAIAPRDNTVDPDPRIAESEGEVYPWDINYFDPQFLPSAYSDVNDPETVDSSFADFRSSMWESRRESLTAQDFGAVARVQRIKNGLDEKYLMTLDGMLDARYARFQKIAEPAVLSPTGTPMTEIPGTPYLGSVGAMDFIAQEEESVAFWKSGPSTTPVNYKRPSGAQTPNLPYNTAAPVAAINEAQEAQKGQMQLSAGDDE 307 T 14 DUF5953 pdbhh F T 5y6p 12 AC,AE,AO,AY,BY,CAA,CG,CI,CT,CV,DAA,DP,DR,VN A2,a3,b9,dw,dx,ey,34,Y5,aY,bY,ez,U8,Z9,b8 LR_gamma4 MDSPAFAVNGMFSAVKVGNSSFTENKVTAVSKTAPTASVRMVVDPFQRKFQSIGKIGIDYSRPKKLATYKRVGYSVGLDFPNAVSMAGHYSLTDCTRAGGAAKILMKYDEYCAKGMLQVYKRSAVSTGVYTTKCTEATQPGVAYDVRVFNRTAAFRQAQKPVNVRLGEQYAARKACVTLAHNCSREEAQFKNMPMSCATFLAGKMEAMGTCYRTVRPSSKAEDYMAGSVRMQVYQKGNASGVYPVGGCEDGHAKGDADLRRVIALASEYRAAQQGAAAVTGAQYASSKMAIQLYGHSCNHEEGQFCDYPAVAAAMCRY 318 T 0.29 ACC_epsilon pdbpercent F T 5y6p 14 BO,CC,CE,CO,CP,CR,DC,DE,DG,DI,DO,DT,DV,WN,XN,YN c9,C2,c3,d9,T8,Y9,D2,d3,44,Z5,e9,aZ,bZ,c8,d8,e8 LR_gamma5 MYAFAPNTPFTASKAVVGKTSFTSPLPAQSESRPTAAPTMVLRTVLRSPVPSGAATVYGYVGRGNISVILAKADEYMAKSVRKQYLAKSNPYGTFGVQCTEGSVKFAADFSRIRALNAEFRAKLGSASKKTFDMYENRKNAISNSHGCHHEETQFVGYKGVSSMYNVSKSEASGSCSRYASPETVVEAAMLRFMDIQVKMAANPTGVYNISCNEGAARGQAEDVRVAALNAAFRQGQKSLGKLLDEKYQQKKQGYSFAHGCNYEEGLINKYPALGAAFRSKSYGY 285 T 0.052 rRNA_methylase pdbpssm F T 5y6p 22 CBA,DBA eY,eZ LR_gamma7 MTAPAFTAPISLTTPHAFSARGLRPATTSSAAPTAVPTPRMSAADKYMARTVTRTAKSAAAGFGVYTPQCTEASGGANTAEATRLAVLAADFRLRQAPLGARFADLYETRRAAVIQACNSSAEEGYATSFPSRAAASVAGRAEGLRACSRYFPQKPPVEEYMAACVDRQYKQMRVHGGVYSTLCADGRSAGDADTARIAALGARFRAQHLSKSQQTQMRYNAMSEARMLARGLCTYEEAQFNAYPKMAGMMRYGTGVYAASVRGPELVVGNKSMTVAEQVNGVNAESYWPSSKVRPAVARGTSPWMGLGVVKSYAAMSEAAMAYGIEQQSKPYVPQKYEGWSSGWKPKSSLM 352 T 0.39 DUF2477 pdbpercent F T 5y6p 23 CCA,CFA,DCA,DFA ly,hY,lz,hZ LR_gamma8 MEPAFVSSFAPKPVITTSLTASSPLSVTARKNAVSTPTMAAYSLDKYAQMSGANAVDTSGASPAASSTWWVAYRDSLKERFNPFRAPANPEVDVGKSKEYFFAQTAYGRILNMVNASRFGKGGDPDELVPPPGAQPADQYMANCIVKQYKAMATPTGVYTTQCTEGVVRGQAEEARNAALSAAFRMKQRSSAQKFGDFCESRRMAVIGAHGCSYEESLLTKFPAAARAYTTASSEAKGNCVRYADGTSPAETYMAACVDKQMKFRSVPMGVYDVLCSDGNTKGVAEYKRVSAMSVRFRSNQMSTLYKMQAKYNNAAYARNYFGHGCSYEENLFNKYPAVSASMRPSTARY 350 T 1.9 rRNA_methylase pdbpercent F T 5y6p 24 CEA,DEA gy,gz LR_gamma6 MAFITSFTPRNLASRSEFTSTSVSTRRPTLARNTIRALFTPPVDEFMASSVQSQYIQKACPSGVPPIQCIEGVTSDQPYAARTLKRQTELRYHQLPVAVKLRKAYETRRAAVVATHGCSHEEGRVLSYPRMASAMLIGQAEASKACSRYFVPNGPAEKHMLQAVENRYMAAVNGSGVFSGACTDGQTRYEAYLMQLRGKSAEFRAKQYSTFEKESMKYAARKQALIQKGHDCNAEEVIFSNYPIVASAMRPTFGYYTPIVKNPGIGSVINIMRPVWDKNSSISSPATLVGVGGFVQP 297 T 0.095 PRMT5 pdb F T 5y7d 1 A A CX04A_HUMAN ENDOTHELIAL-OVEREXPRESSED LIPOPOLYSACCHARIDE-ASSOCIATED FACTOR 1 GMKFGCLSFRQPYAGFVLNGIKTVETRWRPLLSSQRNCTIAVHIAHRDWEGDAWRELLVERLGMTPAQIQTLLRKGEKFGRGVIAGLVDIGETLQCPEDLTPDEVVELENQAVLTNLKQKYLTVISNPRWLLEPIPRKGGKDVFQVDIPEHLIPLGHEVLE 161 T 0.00028 ASCH pdbpercent F Eukaryota T 5y7w 2 C,D C,D YL-2 peptide LLPPTEQDLXKLXXYX 16 T 0.049 STAT6_C pdb F T 5y81 3 C D Eaf1-disorder domain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 500 F F F 5y97 1 A A SGSL_TRIAN SGSL ANLRLSEANSGTYKTFIGRVREELGSETYRLYGIPVLKHSL 41 T 0.00035 RIP pdbhh F Eukaryota T 5yay 2 B B KI21A_MOUSE Kinesin-like protein KIF21A LMKLCGEVKPKNKARRRTTTQMELLYAD 28 T 4.2 Imm63 pdbhh F Eukaryota T 5yb2 3 D,E,F,J,K,L,N H,G,I,K,J,L,P LP-11 ELTWEEWEKKIEEYTKKIEEILK 23 T 0.069 GP41 pdbhh F T 5yb3 2 D,E,F H,G,I HP23L ELTWEEWEKKIEEYTKKIEEILK 23 T 0.069 GP41 pdbhh F T 5yb4 2 D,E,F H,G,I HP23L ELTWEEWEKKIEEYTKKIEEILK 23 T 0.069 GP41 pdbhh F T 5ybe 2 B B KI21A_MOUSE KIF21A PKNKARRRTTTQMELLYAD 19 T 2.3 Imm63 pdbhh F Eukaryota T 5ybu 2 B B KI21A_HUMAN KINESIN-LIKE PROTEIN KIF2,RENAL CARCINOMA ANTIGEN NY-REN-62 EVKPKNKARRRTTTQMELLYAD 22 T 2.9 Imm63 pdbhh F Eukaryota T 5ybv 3 C,D C,D KI21A_HUMAN KINESIN-LIKE PROTEIN KIF2,RENAL CARCINOMA ANTIGEN NY-REN-62 EVKPKNKARRRTTTQMELLYAD 22 T 2.9 Imm63 pdbhh F Eukaryota T 5yc0 2 D,E,F,J,K,L Q,W,P,H,I,G LP-46 WQEWEQKITALLEQAQIQQEKNEYELQKLDK 31 T 0.00047 GP41 pdbhh F T 5yc1 2 G,H,I G,I,K GPIb peptide RLRAR 5 T 120 FIN1 pdbhh F F 5yca 2 B C LEM2_SCHPO LEM DOMAIN PROTEIN 2 GSAEEDDELFQNYVLQQTRK 20 T 0.52 TFIIA unppercent F Eukaryota T 5yco 2 E,F E,F UHRF2_HUMAN E3 ubiquitin-protein ligase UHRF2 NEILQTLLDLFFPGYSK 17 T 0.0058 zf-RING_6 unphh F Eukaryota T 5yd3 2 B,D,F,H B,D,F,H CCR5_HUMAN Epitope peptide DINYYTSEP 9 T 5.5 Pico_P1A pdbhh F Eukaryota T 5yd4 2 B,D,F,H B,D,F,H CCR5_HUMAN Epitope peptide (mutation T6A) DINYYASEP 9 T 3.6 DEC-1_C pdbhh F Eukaryota T 5yd5 2 B,D B,D CCR5_HUMAN Peptide epitope (mutation N3A) DIAYYTSEP 9 T 4.3 DUF3417 pdbhh F Eukaryota T 5yd8 2 D,E,F W,U,V ZRAB3_HUMAN ANNEALING HELICASE 2,AH2,ZINC FINGER RAN-BINDING DOMAIN-CONTAINING PROTEIN 3 GSDITRFLVKK 11 T 0.15 DUF3460 pdbhh F Eukaryota T 5ydt 4 D UC Unassigned helices XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 34 F F F 5ye3 3 C C H4_HUMAN di-acetylated histone H4 SGRGXGGXGLGK 12 T 11 Shadoo unppercent F Eukaryota T 5ye4 3 E,F E,F H4_HUMAN di-acetylated histone H4 SGRGXGGXGLGK 12 T 11 Shadoo unppercent F Eukaryota T 5yf4 2 B B STK26_HUMAN MST3 AND SOK1-RELATED KINASE,MAMMALIAN STE20-LIKE PROTEIN KINASE 4,STE20-LIKE KINASE MST4,SERINE/THREONINE-PROTEIN KINASE MASK THPEWSFTTVRKKPDP 16 T 1.7 MRPL52 pdbhh F Eukaryota T 5ygd 2 B D PIWI_DROME ASP-GLN-GLY-ARG-GLY-ARG-2MR-ARG-PRO-LEU-ASN DQGRGRXRPLN 11 T 7.6 M157 pdbhh F Eukaryota T 5ygf 2 B D PIWI_DROME ASP-GLN-GLY-ARG-GLY-ARG-ARG-ARG-PRO DQGRGRRRP 9 T 15 DUF863 pdbhh F Eukaryota F 5yhr 1 A,B A,B ACR30_BPD31 GENE PRODUCT 30,GP30 GMTKTAQMIAQQHKDTVAACEAAEAIAIAKDQVWDGEGYTKYTFDDNSVLIQSGTTQYAMDADDADSIKGYADWLDDEARSAEASEIERLLESVEEE 97 T 0.13 Transglycosylas unp T Viruses T 5yi7 2 B,D B,D Q9W4I7_DROME RE65495P QQKLPTNPFEVLRQPPKKKKREHACFENPGLNLE 34 T 2.8 LAG1-DNAbind pdbhh F Eukaryota T 5yi8 2 B B Q9W4I7_DROME RE65495P KKKKREHACFENPGLNLELPEKQFNPYEVVRSA 33 T 14 HAGH_C pdbhh F Eukaryota T 5yip 2 B B ANK3_RAT ANK-3,ANKYRIN-G PEDDWTEFSSEEIREARQAAASHAPS 26 T 0.34 GHBP pdbhh F Eukaryota T 5yiq 2 B D ANK3_RAT ANK-3,ANKYRIN-G PEDDWTEFSSEEIREARQAAASHAPS 26 T 0.34 GHBP pdbhh F Eukaryota T 5yir 2 D,E,F C,G,H ANK2_HUMAN ANK-2,ANKYRIN-B,BRAIN ANKYRIN,NON-ERYTHROID ANKYRIN VEEEWVIVSDEEIEEARQKAPLEITEY 27 T 3 Pex14_N pdbhh F Eukaryota T 5yis 2 C,D D,C ANK2_HUMAN ANK-2,ANKYRIN-B,BRAIN ANKYRIN,NON-ERYTHROID ANKYRIN VEEEWVIVSDEEIEEARQKAPLEITEY 27 T 3 Pex14_N pdbhh F Eukaryota T 5ykk 1 A A Andersonin-Y1 (AY1) FLPKLFAKITKKNMAHIR 18 T 0.15 Antimicrobial_1 pdbhh F T 5ykl 1 A A designed AY1C FLPKLFAKITKKNMAHIRC 19 T 0.18 Antimicrobial_1 pdbhh F T 5ykq 1 A A designed CAY1 CFLPKLFAKITKKNMAHIR 19 T 0.15 Antimicrobial_1 pdbhh F T 5yl7 2 B B Copurified unknown peptide XXX 3 F F F 5ylx 3 C C RPOA_PRRS1 PRRSV-NSP9-TMP9 peptide TMPPGFELY 9 T 0.24 MucB_RseB_C pdbhh T Viruses T 5ym9 1 A,B A,B Q5ZTL3_LEGPH DEAMIDASE KLESPGFMVHKKLKSMSQSYGVMMTGVPAEVLGQMQAERSIPSINKTGNLKQQIAKEVSKVCHMMTEPTQSCGQASNDVCELLLGKIEAEKFHFTKYEALSADGDNLKNVLENTAPSSTNLLIRFEIDREDPPIVLVKTKNENFNPETAVKNKIYLLENKLYFIDKMGNLFNLGPGKKKCTQLFNAIGDSAEYSLCDPFVLEEPEKPEDFAISEIVDIFNEQKERFDFWIGSHSFTIYIPQTLGESPRQFYPYQAYFGSHTLQDWFVSDKDEYLSRIGIDKYIEKLAVLGKTTNTKERSDIYAEFFSKRGREAFFCAHLNEKRQPLRVKFKITEINPELALKNLQETQEFIDTHPGENPSDKVENYRNRAKLAMTEHLESLLDI 384 T 0.022 AgrD pdb F Bacteria T 5ymv 3 C,F C,F ALA-VAL-LYS-GLY-VAL-GLY-THR-MET-VAL AVKGVGTMV 9 T 1.6 Spec3 pdbhh F T 5ymw 3 C,F,I,L C,F,I,L SRC_RSVP LEU-PRO-ALA-CYS-VAL-LEU-GLU-VAL LPACVLEV 8 F T Viruses T 5ypo 2 C,D C,D DLGP1_HUMAN SAPAP AARRESYLKATQPSL 15 T 37 EABR pdbhh F Eukaryota T 5ypr 2 B B Synthesized GK inhibitor RIRREEYRRAINGQSF 16 T 5.9 DUF6026 pdbhh F T 5ypu 2 B,D B,D COBL_MOUSE Cordon-Bleu WH2 motif SLHSALXEAIHSSGGREKLRKV 22 T 0.00025 WH2 unppercent F Eukaryota T 5ypz 1 A,B,C A,B,C Q93I73_ECOLX CofB GPDEARRQIVSNALISEIAGIVDFVAEEQITVIEQGIEKEITNPLYEQSSGIPYINRTTNKDLNSTMSTNASEFINWGAGTSTRIFFTRKYCISTGTQGNYEFSKDYIPCEEPAILSNSDLKIDRIDFVATDNTVGSAIERVDFILTFDKSNANESFYFSNYVSSLEKAAEQHSISFKDIYVVERNSSGAAGWRLTTISGKPLTFSGLSKNIGSLDKTKNYGLRLSIDPNLGKFLRADGRVGADKLCWNIDNKMSGPCLAADDSGNNLVLTKGKGAKSNEPGLCWDLNTGTSKLCLTQIEGKDNNDKDASLIKLKDDNGNPATMLANILVEEKSMTDSTKKELRTIPNTIYAAFSNSNASDLVITNPGNYIGNVTSEKGRIELNVQDCPVSPDGNKLHPRLSASIASIVADTKDSNGKYQADFSSLAGNRNSGGQLGYLSGTAIQVNQSGSKWYITATMGVFDPLTNTTYVYLNPKFLSVNITTWCSTEPQT 492 T 2.1 PulG unphh F Bacteria T 5ypz 2 D,E,F D,E,F Q93I65_ECOLX CofJ SPSSSEGGAFTVNMPKTSTVDDIR 24 T 4.9 DUF5808 pdbhh F Bacteria T 5yq0 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H Q93I65_ECOLX CofJ SPSSSEGGAFTVNMPKTSTVDDIRGCPTLETPLKLTFTEDIQPRKENGSTYFYYDGWRGVGQTVNPWSPVLDNHKYAATEHEIHIYVEFFQTPSNRFADKNGAYSYIDANGVMYTNGEYSWEHVPALGKNIYKVVISDWNKGQTKSIYLPGRDFKTVEVFHFQNNRPQWDDRNSYENVKSRINNNISKSYSKAKLNEQLSTYVHDDGTDSLFLYQKLSRASLKESQINYYQLRGKFNGVNLGYWAQEYILFGGEGAEQLKNKIPDMSNYSMEDNGSFKNALKIESLDLRLMDNNRMAYGSTGTYIASFNRTDFSMTPENLKACGLD 326 T 9.5 DUF4999 unphh F Bacteria T 5yq7 5 GA Y Peptide from Precursor for L and M subunits of photosynthetic reaction center XXXXXXXXXXXXXXXXXXXXXXXXX 25 F F F 5yq7 6 HA X Subunit X XXXXXXXXXXXXXXXXXXXXXXX 23 F F F 5ytp 1 A,B A,B Q5SM04_THET8 TTHA0139 MAKKEKKRLQVVISEEQDALLTRAAYALSSPERAVSKSEVVRLAIEKIARELEEGKAKEELEALLKHLKAEEGEEEA 77 T 0.0032 TAN unppercent F Bacteria T 5ytq 1 A,B,C A,B,C Q5SM04_THET8 TTHA0139 MAKKEKKRLQVVISEEQDALLTRAAYALSSPERAVSKSEVVRLAIEKIARELEEGKAKEELEALLKHLKAEEGEEEA 77 T 0.0032 TAN unppercent F Bacteria T 5yty 3 C,D D,F echinomycin XAXXXAXX 8 T 190 RSF pdbhh F F 5ytz 2 C,D D,F Echinomycin XAXXXAXX 8 T 190 RSF pdbhh F F 5yu6 2 B,E E,F 13-mer peptide XXXXXXXXXXXXX 13 F F F 5yvi 2 B B FUS_HUMAN 75 KDA DNA-PAIRING PROTEIN,ONCOGENE FUS,ONCOGENE TLS,POMP75,TRANSLOCATED IN LIPOSARCOMA PROTEIN GSGGGPGGSHMGGNYGDDRRGGRGGYDRGGYRGRGGDRGGFRGGRGGGDRGGFGPGKMDSRGEHRQDRRERPY 73 T 860 DUF2219 pdbhh F Eukaryota T 5yvk 1 A A V5TER4_9CYAN AMBU4 MGSSHHHHHHSSGLVPRGSHMASTSAVSIPINNAGFENPFMDVVDDYTIDTPPGWTTYDPNNLVPEKRTTWTSNNGVGYVGPGTQFYNQLAPEGRNIGYIYLSQNPGSGVAGFEQILDATLEPDTKYTLTVDVGNLAGTFKGLSFAGFPGYRVELLAGDTVLAADHNNLFIKEGEFKTSTVTYTSTAKDLHLGQKLGIRLVNLLQDKFSGLDFDNVRLTTEPTET 225 T 2.1 DUF642 pdbhh F Bacteria T 5yvp 1 A,B,C,D A,B,C,D A0A1P8VSI6_9CYAN FILC1 MGSSHHHHHHSSGLVPRGSHMASTSVVSIPINNAGFEDPFIEVVDDYTVDTPPGWTTYNPNNLVPEKRTTWTSNNGVGYVGPGTQFYNQLAPEGRNIGYIYLAQKPGSGVAGFEQILDATLEPDTNYTLKVDVGNLAGTFKGLSFAGFPGYRVELLAGDTVLAADHNNLFIKEGEFKTSTVTYTSTAKDLHLGQKLGIRLVNLLQDKFSGLDFDNVRLTAEPTEA 225 T 2.2 DUF642 pdbhh F Bacteria T 5yy4 2 B B CCR5_HUMAN CCR5,CHEMR13,HIV-1 FUSION CORECEPTOR DINYYTSEP 9 T 5.5 Pico_P1A pdbhh F Eukaryota T 5yy9 2 C,D C,D DNLI1_HUMAN Ligase 1 IPKRRTARKQLPK 13 T 26 EAV_GS pdbhh F Eukaryota T 5yyf 2 B,D B,D Peptide inhibitor PHQ-H3(Q5-K9) XQTARKX 7 T 390 MC1 pdbhh F T 5yyz 2 B B HOP1_YEAST Meiosis-specific protein HOP1 QASIQPTQFVSNN 13 T 4.4 ParBc_2 pdbhh F Eukaryota T 5yz9 1 A A MTA70_HUMAN METHYLTRANSFERASE-LIKE PROTEIN 3,HMETTL3,N6-ADENOSINE-METHYLTRANSFERASE 70 KDA SUBUNIT,MT-A70 AHMSIVEKFRSRGRAQVQEFCDYGTKEECMKASDADRPCRKLHFRRIINKHTDESLGDCSFLNTCFHMDTCKYVHYEIDASMDSEAPGSKDHTPSQELALTQ 102 T 0.0081 DUF445 unp F Eukaryota T 5yzd 3 C B peptide CBZ-DPN-PHE-GLY XXFG 4 T 100 DUF6015 pdbhh F F 5yzg 45 CB 4 UNKNOW AAAPAGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 37 T 890 GHMP_kinases_N pdbhh F F 5z08 2 C C G2R3T1_THITE Cenp-K RQKDEWAKKTSSLMKQLDWFIGEHLGAMLAAEELGGPVVGELMEIDPDDLSAGFNAHGKLKKATSQPDLDRRQRRIDDIWGPQDEQGQAHKRKRGADEALAASAEMRDLIEQLMNKLVEAGGDNSATYVEIPRESAAARFLVRSKVAMFHPNDARRLRLVDFGRDLDD 168 T 4E-08 CENP-K pdb F Eukaryota T 5z1v 1 A,B,C,D A,B,C,D A0A0H4ITX1_MAGOR AvrPib protein MSHHHHHHSMAMTQVTILKKGERITWVEVPKGESREFNIRGKYFTVSVSDDGTPSISGSKYTVE 64 T 0.24 Picorna_P3A unppercent F Eukaryota T 5z1y 1 A A C3Z8S4_BRAFL mBjAMP1 peptide NLCASLRARHTIPQCRKFGRR 21 T 6.7 CENP-O pdbhh F Eukaryota T 5z26 1 A A SC51_SHEEP SMAP-18 RGLRRLGRKIAHGVKKYG 18 T 0.095 CAP18_C unppercent F Eukaryota T 5z28 1 A,B A,B VAL2_ARATH PROTEIN HIGH-LEVEL EXPRESSION OF SUGAR-INDUCIBLE-LIKE 1,PROTEIN VP1/ABI3-LIKE 2 AIKVCMNALCGAASTSGEWKKGWPMRSGDLASLCDKCGCAYEQSIFCEVFHAKESGWRECNSCDKRLHCGCIASRFMMELLENGGVTCISCAKKSGLISMNVS 103 T 0.19 FrhB_FdhB_N pdb F Eukaryota T 5z2c 1 A,B,C,D,E,F,G,H,I A,B,C,D,E,F,G,H,I ALPK1_HUMAN CHROMOSOME 4 KINASE,LYMPHOCYTE ALPHA-PROTEIN KINASE MNNQKVVAVLLQECKQVLDQLLLEAPDVSEEDKSEDQRCRALLPSELRTLIQEAKEMKWPFVPEKWQYKQAVGPEDKTNLKDVIGAGLQQLLASLRASILARDCAAAAAIVFLVDRFLYGLDVSGKLLQVAKGLHKLQPATPIAPQVVIRQARISVNSGKLLKAEYILSSLISNNGATGTWLYRNESDKVLVQSVCIQIRGQILQKLGMWYEAAELIWASIVGYLALPQPDKKGLSTSLGILADIFVSMSKNDYEKFKNNPQINLSLLKEFDHHLLSAAEACKLAAAFSAYTPLFVLTAVNIRGTCLLSYSSSNDCPPELKNLHLCEAKEAFEIGLLTKRDDEPVTGKQELHSFVKAAFGLTTVHRRLHGETGTVHAASQLCKEAMGKLYNFSTSSRSQDREALSQEVMSVIAQVKEHLQVQSFSNVDDRSYVPESFECRLDKLIL 446 T 0.24 RNPP_C pdbhh F Eukaryota T 5z2o 1 A A G2,7,13A SMAP-18 analogue RALRRLARKIAHAVKKYG 18 T 3.4 DUF5664 pdbhh F T 5z31 1 A A LYS-ASN-LYS-SER-ARG-VAL-ALA-ARG-GLY-TRP-GLY-ARG-LYS-CYS-PRO-LEU-PHE-GLY KNKSRVARGWGRKCPLFG 18 T 0.0093 Flavi_glycoprot pdbhh F T 5z32 1 A A VAL-ALA-ARG-GLY-TRP-GLY-ARG-LYS-CYS-PRO-LEU-PHE-GLY-LYS-ASN-LYS-SER-ARG VARGWGRKCPLFGKNKSR 18 T 0.0017 Flavi_glycoprot pdbhh F T 5z3a 1 A A IMGH_KRIFD Glycoside hydrolase 15-related protein MGSSHHHHHHSSGLVPRGSHMTTSARDTGLDSHELARLHELARHSHAVITRHQDAGGAYPAAPTFSAYRGYAWLRDGSFTAEGISRYGDVASAGRFHDWVDGVLRRRRGQVDDLLAAVDRGEVPSNEGMLPTRFTFDGNDGSDPWWDFQTDGYGMWLWSVVTHAARHGLDLERWRAGIDVAVDYLLAFWDRPCYDWWEEHVEHRHVSTLGAIHGGLVAVGTCAALRSAPWSAATLQVAARIRSLVSAEGVVDGHLVKWLGSSAVDGSLPACVVPFGLVPPDDDVAAMTRAAVAKDLDVDGGVHRFAADVFYGGGQWILLSALLGWNLAAAGDTAGALRHLRWIADQADADGDLPEQVPHHLLHPGSRAEWVARWGTVATPLLWSHGMYLILADELGLLPPAAKDA 405 T 7.9 Glyco_hydro_15 unppssm F Bacteria T 5z3b 1 A A IMGH_KRIFD Glycoside hydrolase 15-related protein MGSSHHHHHHSSGLVPRGSHMTTSARDTGLDSHELARLHELARHSHAVITRHQDAGGAYPAAPTFSAFRGYAWLRDGSFTAEGISRYGDVASAGRFHDWVDGVLRRRRGQVDDLLAAVDRGEVPSNEGMLPTRFTFDGNDGSDPWWDFQTDGYGMWLWSVVTHAARHGLDLERWRAGIDVAVDYLLAFWDRPCYDWWEEHVEHRHVSTLGAIHGGLVAVGTCAALRSAPWSAATLQVAARIRSLVSAEGVVDGHLVKWLGSSAVDGSLPACVVPFGLVPPDDDVAAMTRAAVAKDLDVDGGVHRFAADVFYGGGQWILLSALLGWNLAAAGDTAGALRHLRWIADQADADGDLPEQVPHHLLHPGSRAEWVARWGTVATPLLWSHGMYLILADELGLLPPAAKDA 405 T 7.9 Glyco_hydro_15 unppssm F Bacteria T 5z3c 1 A A IMGH_KRIFD Glycoside hydrolase 15-related protein MGSSHHHHHHSSGLVPRGSHMTTSARDTGLDSHELARLHELARHSHAVITRHQDAGGAYPAAPTFSAYRGYAWLRDGSFTAEGISRYGDVASAGRFHDWVDGVLRRRRGQVDDLLAAVDRGEVPSNEGMLPTRFTFDGNDGSDPWWDFQTDGYGMWLWSVVTHAARHGLDLERWRAGIDVAVDYLLAFWDRPCYDWWAEHVEHRHVSTLGAIHGGLVAVGTCAALRSAPWSAATLQVAARIRSLVSAEGVVDGHLVKWLGSSAVDGSLPACVVPFGLVPPDDDVAAMTRAAVAKDLDVDGGVHRFAADVFYGGGQWILLSALLGWNLAAAGDTAGALRHLRWIADQADADGDLPEQVPHHLLHPGSRAEWVARWGTVATPLLWSHGMYLILADELGLLPPAAKDA 405 T 7.9 Glyco_hydro_15 unppssm F Bacteria T 5z3d 1 A A IMGH_KRIFD Glycoside hydrolase 15-related protein MGSSHHHHHHSSGLVPRGSHMTTSARDTGLDSHELARLHELARHSHAVITRHQDAGGAYPAAPTFSAYRGYAWLRDGSFTAEGISRYGDVASAGRFHDWVDGVLRRRRGQVDDLLAAVDRGEVPSNEGMLPTRFTFDGNDGSDPWWDFQTDGYGMWLWSVVTHAARHGLDLERWRAGIDVAVDYLLAFWDRPCYDWWEEHVEHRHVSTLGAIHGGLVAVGTCAALRSAPWSAATLQVAARIRSLVSAEGVVDGHLVKWLGSSAVDGSLPACVVPFGLVPPDDDVAAMTRAAVAKDLDVDGGVHRFAADVYYGGGQWILLSALLGWNLAAAGDTAGALRHLRWIADQADADGDLPEQVPHHLLHPGSRAEWVARWGTVATPLLWSHGMYLILADELGLLPPAAKDA 405 T 7.9 Glyco_hydro_15 unppssm F Bacteria T 5z3e 1 A A IMGH_KRIFD Glycoside hydrolase 15-related protein MGSSHHHHHHSSGLVPRGSHMTTSARDTGLDSHELARLHELARHSHAVITRHQDAGGAYPAAPTFSAYRGYAWLRDGSFTAEGISRYGDVASAGRFHDWVDGVLRRRRGQVDDLLAAVDRGEVPSNEGMLPTRFTFDGNDGSDPWWDFQTDGYGMWLWSVVTHAARHGLDLERWRAGIDVAVDYLLAFWDRPCYDWWEEHVEHRHVSTLGAIHGGLVAVGTCAALRSAPWSAATLQVAARIRSLVSAEGVVDGHLVKWLGSSAVDGSLPACVVPFGLVPPDDDVAAMTRAAVAKDLDVDGGVHRFAADVFYGGGQWILLSALLGWNLAAAGDTAGALRHLRWIADQADADGDLPAQVPHHLLHPGSRAEWVARWGTVATPLLWSHGMYLILADELGLLPPAAKDA 405 T 7.9 Glyco_hydro_15 unppssm F Bacteria T 5z3f 1 A A IMGH_KRIFD Glycoside hydrolase 15-related protein MGSSHHHHHHSSGLVPRGSHMTTSARDTGLDSHELARLHELARHSHAVITRHQDAGGAYPAAPTFSAYRGYAWLRDGSFTAEGISRYGDVASAGRFHDWVDGVLRRRRGQVDDLLAAVDRGEVPSNEGMLPTRFTFDGNDGSDPWWDFQTDGYGMWLWSVVTHAARHGLDLERWRAGIDVAVDYLLAFWDRPCYDWWEEHVEHRHVSTLGAIHGGLVAVGTCAALRSAPWSAATLQVAARIRSLVSAEGVVDGHLVKWLGSSAVDGSLPACVVPFGLVPPDDDVAAMTRAAVAKDLDVDGGVHRFAADVFYGGGQWILLSALLGWNLAAAGDTAGALRHLRWIADQADADGDLPAQVPHHLLHPGSRAEWVARWGTVATPLLWSHGMYLILADELGLLPPAAKDA 405 T 7.9 Glyco_hydro_15 unppssm F Bacteria T 5z3g 30 DA d Unassigned XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 154 F F F 5z53 1 A,B,C,D A,B,C,D A0A1P8VSI6_9CYAN FILC1 MKRKLIVAVVCLIFICFGINTPAHATSVVSIPINNAGFEDPFIEVVDDYTVDTPPGWTTYNPNNLVPEKRTTWTSNNGVGYVGPGTQFYNQLAPEGRNIGYIYLAQKPGSGVAGFEQILDATLEPDTNYTLKVDVGNLAGTFKGLSFAGFPGYRVELLAGDTVLAADHNNLFIKEGEFKTSTVTYTSTAKDLHLGQKLGIRLVNLLQDKFSGLDFDNVRLTAEPTEA 227 T 3.2 DUF642 pdbhh F Bacteria T 5z54 1 A,B,C,D A,B,C,D A0A076NBW8_9CYAN HPIU5 MKRKLIVAVVCLIFICFGINTPAHATSVVSIPINNAGFEDPFIEVVDDYTVDTPPGWTTYNPNNLVPEKRTTWTSNNGVGYVGPGTQFYNQLAPEGRNIGYIYLAQKPGSGVAGFEQILDATLEPDTKYTLKVDVGNFGGEFQKISLAGFPGYRVELLAGDTVLAADHNNLYIKDGEFKTSTVTFTATPDNPYLDQKLGIRLINLLQGTFSGLDFDNVRLTVEPAQT 227 T 3.4 DUF4969 pdbhh F Bacteria T 5z5s 2 B C PRGC1_HUMAN PPARGC-1-ALPHA,LIGAND EFFECT MODULATOR 6 QEAEEPSLLKKLLLAPANT 19 T 3.3 Apo-CIII pdbhh F Eukaryota T 5z6s 2 B C PRGC1_HUMAN PPARGC-1-ALPHA,LIGAND EFFECT MODULATOR 6 QEAEEPSLLKKLLLAPANT 19 T 3.3 Apo-CIII pdbhh F Eukaryota T 5z8h 2 B B Peptide inhibitor XAGESLYEX 9 T 24 DUF3928 pdbhh F T 5z93 2 B B V9H1G0_HUMAN Gene for histone H3 (germline gene) TARXSTGGKA 10 T 0.044 PAF unp F Eukaryota T 5z94 2 C,D C,D V9H1G0_HUMAN Gene for histone H3 (germline gene) ARTXQTARKSTGGKA 15 T 0.044 PAF unp F Eukaryota T 5zc3 1 A,B B,A A0A3F2YLV0_PHYCP RxLR effector MDTTDIKPVRPAINLQQPPFVVGRLLRTVQDEERGFTLPGAGKLADLFESTALKLAQSARINTWLVKGTSVDDAFLKLELNTAGSRIFENPKLLTWAVYVTKVEKQNPEEIILAKLSKQFTEGSLAKMIASAKLDSKTEGLATILQAQQRQVWVDAGKSSDEVFKLLQLDEAGTKLFKNQQFSTWTSFVDAFNRKYPEKAVSIFSKLAKTYDGFTLWKMLEAAKKVPKTEIIASKLQAQQIDAWLDAGKSTDEVFNLLKLQRTGDKLFKNSQFLTWVSYVEKFNKKDPDQAIAIFSKLAGVYDQVTLSSMLEAAKHVPSTKRIASYLQGQQNQHWLADGKSTDDIFKLLKLNTPSPENLIDPRLDAWTSFMRAFNMANEGKETTLIATLTTHYKDRGLAQLLQEGTKFASTKKIAEELQTAQFARWLQLGKTEDDIFALLKLKLTTPTTDPEAIVFYQYKLFMDAHMKLAAA 472 T 0.0012 RXLR pdbhh F Eukaryota T 5zck 1 A A RIPK3_HUMAN RIP-3 CORE REGION VQVG 4 F F Eukaryota F 5zcn 1 A A A0A381AKI5_BREDI brevunsin DGMGEEFIEGLVRDSLYPPAG 21 T 4.5 DUF4090 unphh F Bacteria T 5zeb 56 DB 8 A0QTP4_MYCS2 BL37 MAKRGRKKRDRKHSKANHGKRPNA 24 T 0.18 DUF6254 pdb F Bacteria T 5zep 24 X 0 BS1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 479 F F F 5zep 56 DB 5 A0QTP4_MYCS2 Uncharacterized protein MAKRGRKKRDRKHSKANHGKRPNA 24 T 0.18 DUF6254 pdb F Bacteria T 5zet 34 HA 8 A0QTP4_MYCS2 Uncharacterized protein bL37 MAKRGRKKRDRKHSKANHGKRPNA 24 T 0.18 DUF6254 pdb F Bacteria T 5zfj 1 A,B,C,D A,B,C,D A0A1P8VSI6_9CYAN FILC1 MKRKLIVAVVCLIFICFGINTPAHATSVVSIPINNAGFEDPFIEVVDDYTVDTPPGWTTYNPNNLVPEKRTTWTSNNGVGYVGPGTQFYNQLAPEGRNIGYIYLAQKPGSGVAGFEQILDATLEPDTNYTLKVDVGNLAGTFKGLSFAGFPGYRVELLAGDTVLAADHNNLFIKEGEFKTSTVTYTSTAKDLHLGQKLGIRLVNLLQDKFSGLDFDNVRLTAEPTEA 227 T 3.2 DUF642 pdbhh F Bacteria T 5zgb 12 L O M1VFJ4_CYAM1 PsaO MYGFVSVLPVASALQRQQCTCAARCSFTTRAARVAPVRIALSRPQRLVGASSLRMFEVSDGEPYPLNPAVIFIALIGWSAVAAIPSNIPVLGGTGLTQAFLASIQRLLAQYPTGPKLDDPFWFYLIVYHVGLFALLIFGQIGYAGYAKGTYNRSA 155 T 23 YbgT_YccB pdbhh F Eukaryota T 5zgc 2 G,H,I,J,K,L G,H,I,J,K,L H4_HUMAN Histone H4K16bhb peptide GKGGAXRHRKV 11 T 11 Shadoo unppercent F Eukaryota T 5zgd 1 A A ROA1_HUMAN GLY-PHE-GLY-GLY-ASN-ASP-ASN-PHE-GLY GFGGNDNFG 9 T 2.3 UPF0738 pdbhh F Eukaryota F 5zgh 15 O O M1VFJ4_CYAM1 PsaO MYGFVSVLPVASALQRQQCTCAARCSFTTRAARVAPVRIALSRPQRLVGASSLRMFEVSDGEPYPLNPAVIFIALIGWSAVAAIPSNIPVLGGTGLTQAFLASIQRLLAQYPTGPKLDDPFWFYLIVYHVGLFALLIFGQIGYAGYAKGTYNRSA 155 T 23 YbgT_YccB pdbhh F Eukaryota T 5zgl 1 A,B A,B ROA1_HUMAN HNRNP A1,HELIX-DESTABILIZING PROTEIN,SINGLE-STRAND RNA-BINDING PROTEIN,HNRNP CORE PROTEIN A1 GGGYGGS 7 T 7.2 GMC_oxred_N pdbhh F Eukaryota F 5zia 3 C,F,I,L,O,R R,C,F,J,N,Q TAU_HUMAN phosphorylated tau peptide SPSSAKSRL 9 T 3.4 MinE pdbhh F Eukaryota T 5zji 17 Q O B6SQZ7_MAIZE 16kDa membrane protein MHLLASCCFTRGSRVSARNPLMSRNLERNGRITCMTFPRDWLRRDLSVIGFGLIGWMGPSSVPAINGNSLTGLFFSSIGQELAHFPTPPPVTSQFWLWLVTWHLGLFIVLTFGQIGFKGRTEDYFEK 127 T 0.0085 Plasmid_RAQPRD pdbpercent F Eukaryota T 5zjy 2 B B LYS-LYS-ARG-TYR-SER-ARG-2JN-GLN-LEU-LEU-2JN-PHE XKKRYSRXQLLXFX 14 T 6.6 Tachystatin_A pdbhh F T 5zjz 2 B B EIF-4G1 XKKRYSRXQLLXFWX 15 T 3.1 BURAN pdbhh F T 5zk5 2 B B IF4G1_HUMAN LYS-ARG-TYR-SER-ARG-GLU-GLN-LEU-LEU-MK8-PHE-GLN-ARG-MK8 XKKRYSREQLLXFQRXX 17 T 0.00025 eIF_4G1 unphh F Eukaryota T 5zk7 2 C,D C,D ACE-ARG-TYR-SER-ARG-MK8-GLN-LEU-LEU-MK8-LEU-PHE-ARG-NH2 XRYSRXQLLXLFRX 14 T 8.4 Hat1_N pdbhh F T 5zk9 2 B B ACE-ARG-ILE-ILE-TYR-SER-ARG-MK8-GLN-LEU-LEU-MK8-LEU-LYS-NH2 XRIIYSRXQLLXLKX 15 T 0.14 eIF_4EBP pdbhh F T 5zml 2 B B ACE-LYS-LYS-ARG-TYR-SER-ARG-MK8-GLN-LEU-LEU-MK8-PHE-ARG-ARG XKKRYSRXQLLXFRRR 16 T 5.4 BURAN pdbhh F T 5zmo 1 A A Q9L0M9_STRCO Uncharacterized protein McrA GSREAPKTFHRRVGDVRPARRAMGPALHRPVLLLWAIGQAVARAPRLQPWSTTRDAVAPLMEKYGQVEDGVDGVRYPFWALVRDDLWCVEQAEELTLTSRGRRPTLESLNAVDPSAGLREDDYNLLRSQPEAAASAAAGLIARYFHLLPAGLLEDFGLHELLAGRWPDALRP 172 T 29 RepA_C pdbhh F Bacteria T 5zmq 3 I,J I,K peptide PAC-DLY-DLY-DAR XXXX 4 F F F 5zmr 1 A A RPN5_YEAST PROTEASOME NON-ATPASE SUBUNIT 5 MSRDAPIKADKDYSQILKEEFPKIDSLAQNDCNSALDQLLVLEKKTRQASDLASSKEVLAKIVDLLASRNKWDDLNEQLTLLSKKHGQLKLSIQYMIQKVMEYLKSSKSLDLNTRISVIETIRVVTENKIFVEVER 136 T 0.013 ERp29 unppercent F Eukaryota T 5zms 3 I,J F,I 4-guanidinomethyl-phenylacetyl-Lys-Lys-Arg-H XXKX 4 T 450 DUF4628 pdbhh F F 5zmz 1 A A RIPK1_HUMAN Amyloid core of RIP1 IQIG 4 T 54 Poxvirus_B22R pdbhh F Eukaryota F 5zng 2 B C Q8J180_MAGGR AVR1-CO39 MAWKDCIIQRYKDGDVNNIYTANRNEEITIEEYKVFVNEACHPYPVILPDRSVLSGDFTSAYADDDESCYRHHHHHH 77 T 3.8 Ceramidase_alk pdbhh F Eukaryota T 5zob 3 I,J I,J 4-guanidinomethyl-phenylacetyl-Arg-Arg-Arg-4-amidinobenzylamide XRRRX 5 T 400 UCR_Fe-S_N pdbhh F F 5zoo 2 B A NCOR2_HUMAN SMRT corepressor SP1 fragment HIRGSITQGIPRSY 14 T 8.8 DUF1149 pdbhh F Eukaryota T 5zop 2 B A NCOR2_HUMAN SMRT corepressor SP2 fragment EGSITQGTPLKY 12 T 8.6 PrmC_N pdbhh F Eukaryota T 5zpw 2 B,D,F B,D,F MET-THR-TRP-GLU-GLU-TRP-ASP-MK8-LYS-ILE-GLU-MK8-TYR-THR-MK8-LYS-ILE-GLU-MK8-LEU-ILE-LYS-LYS-SER MTWEEWDXKIEXYTXKIEXLIKKS 24 T 0.036 GP41 pdbhh F T 5zqg 2 C C PEPTIDE LEU-ALA-GLN-LEU-GLN-VAL-ALA KLAQLQVAYHQ 11 T 14 GTP-bdg_M pdbhh F T 5zqv 2 E,F,G,H E,F,G,H PPR3A_HUMAN PROTEIN PHOSPHATASE 1 GLYCOGEN-ASSOCIATED REGULATORY SUBUNIT,PROTEIN PHOSPHATASE TYPE-1 GLYCOGEN TARGETING SUBUNIT,RG1 MEPSEVPSQISKDNFLEVPNLSDSLCEDEEVTFQPGFSPQPSRRGSDSSEDIYLDTPSSGTRRVSFADSFGFNLVSVKEFDSWELPSASTTFDLGTDIF 99 T 4.8 RSD-2 pdbhh F Eukaryota T 5zs3 2 B U GLY-ARG-LEU-LEU-PRO GRLLP 5 T 34 Thioredoxin_15 pdbhh F F 5zs6 2 B U GLY-ARG-LEU-LEU GRLL 4 T 77 rRNA_proc-arch pdbhh F F 5zt0 2 G,H,I,J G,H,I,J PPR3B_HUMAN Protein phosphatase 1 regulatory subunit 3B SKPLRPCIQLSSKNEASGMVAPAVQEKKVKKRVSFADNQGLALTMVKVFSEFDDPLDMPFNITELLDNIVSLTTA 75 T 0.018 DUF4913 pdbpssm F Eukaryota T 5zt3 1 A A M1SWB3_ORYSI WA352 AHMQEAANRSPPYAPYPYPVDEIIGGDSVQSIQRRLLGTNWNPSAHDMQMSRIQAEDLFELKVEIIRKMAGLHPSGDWMGWGARALDNPRTATGEEDLARLHQMLDDLQSRNEQSATFWRLVERVRLRAD 130 T 0.18 Rnk_N pdb F Eukaryota T 5zuj 2 B I TIFA_HUMAN TRAF2-BINDING PROTEIN SSQSQSPTEDDENES 15 T 41 Eapp_C pdbhh F Eukaryota T 5zut 2 B E Q5T4P3_HUMAN PHOSPHATIDYLINOSITOL 3-KINASE REGULATORY SUBUNIT GAMMA EVMMPYSTELIFYIEMDP 18 T 1.9 Colicin_Pyocin pdbhh F Eukaryota T 5zv3 1 A A TAU_HUMAN TAU PEPTIDE TEDGSEEPGSETSDAKSTPT 20 T 55 DUF6318 pdbhh F Eukaryota T 5zvf 1 A A CTHL4_BOVIN non glycosylated analogue of Indolicidin ILPWKWKWTPWRRX 14 T 0.21 SAG unp F Eukaryota T 5zvn 1 A A CTHL4_BOVIN glycosylated analogue of Indolicidin ILPWKWKWTPWRRX 14 T 0.21 SAG unp F Eukaryota T 5zvw 2 B B SER-ALA-ILE-ARG-GLY-ALA SAIRGA 6 T 15 Phospholamban pdbhh F F 5zw6 2 B,D C,D GLY-MET-PRO-ARG-GLY-ALA GMPRGA 6 T 2.4 BCD pdbhh F F 5zwe 2 B C MED1_HUMAN 13-meric peptide from DRIP205 NR2 BOX peptide KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 5zwf 2 B C MED1_HUMAN 13-meric peptide from DRIP205 NR2 BOX peptide KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 5zwh 2 B C MED1_HUMAN 13-meric peptide from DRIP205 NR2 BOX peptide KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 5zwi 2 B C MED1_HUMAN 13-meric peptide from DRIP205 NR2 BOX peptide KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 5zwn 19 S x U1 snRNP AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 49 T 8200 Porin_4 pdbhh F F 5zys 2 B B Nephrin LPFELRGHLV 10 T 2 Saccharop_dh_N pdbhh F T 5zz9 2 D,E,F D,E,F DREB_HUMAN DEVELOPMENTALLY-REGULATED BRAIN PROTEIN LLNFDELPEPPATFCDPEEVEGSGENLQ 28 T 31 DUF4604 pdbhh F Eukaryota T 6a0f 2 C,D C,D 5-mer peptide Asn-Phe-Ala-Ala-Arg NFAAR 5 T 110 DUF3950 pdbhh F F 6a0h 2 C,D C,D 5-mer peptide ASN-LEU-ALA-ALA-ARG NLAAR 5 T 230 DUF3566 pdbhh F F 6a22 2 B,D,F,H B,D,F,H NCOR2_HUMAN N-COR2,CTG REPEAT PROTEIN 26,SMAP270,SILENCING MEDIATOR OF RETINOIC ACID AND THYROID HORMONE RECEPTOR,SMRT,T3 RECEPTOR-ASSOCIATING FACTOR,TRAC,THYROID-,RETINOIC-ACID-RECEPTOR-ASSOCIATED COREPRESSOR TNMGLEAIIRKALMGKYDQWEE 22 T 3.5 RHH_7 pdbhh F Eukaryota T 6a27 1 A,B A,B PPRA_DEIRA PLEIOTROPIC PROTEIN PROMOTING DNA REPAIR MARAKAKDQTDGIYAAFDTLMSTAGVDSQIAALAASEADAGTLDAALTQSLQEAQGRWGLGLHHLRHEARLTDDGDIEILTDGRPSARVSEGFGALAQAYAPMQALDERGLSQWAALGEGYRAPGDLPLAQLKVLIEHARDFETDWSAGRGETFQRVWRKGDTLFVEVARPASAEAALSDAARDVIASIKDRAFQRELMRRSEKDGMLGALLGARHAGAKANLAQLPEAHFTVQAFVQTLSGAAARNAEEYRAALKTAAAALEEYQGVTTRQLSEVLRHGLRES 284 T 8.9 Ldt_C pdbhh F Bacteria T 6a28 1 A,B A,B PPRA_DEIRA PLEIOTROPIC PROTEIN PROMOTING DNA REPAIR MARAKAKDQTDGIYAAFDTLMSTAGVDSQIAALAASEADAGTLDAALTQSLQEAQGRWGLGLHHLRHEARLTDDGDIEILTDGRPSARVSEGFGALAQAYAPMQALDERGLSQWAALGEGYRAPGDLPLAQLKVLIEHARDFETDWSAGRGETFQRVWRKGDTLFVEVARPASAEAALSDAARDVIASIKDRAFQRELMRRSEKDGMLGALLGARHAGAKANLAQLPEAHFTVQAFVQTLSGAAARNAEEYRAALKTAAAALEEYQGVTTRQLSEVLRHGLRES 284 T 8.9 Ldt_C pdbhh F Bacteria T 6a29 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H PPRA_DEIRA PLEIOTROPIC PROTEIN PROMOTING DNA REPAIR MARAKAKDQTDGIYAAFDTLMSTAGVDSQIAALAASEADAGTLDAALTQSLQEAQGRWGLGLHHLRHEARLTDDGDIEILTDGRPSARVSEGFGALAQAYAPMQALDERGLSQWAALGEGYRAPGDLPLAQLKVLIEHRRDFETDWSAGRGETFQRVWRKGDTLFVEVARPASAEAALSDAAWDVIASIKDRAFQRELMRRSEKDGMLGALLGARHAGAKANLAQLPEAHFTVQAFVQTLSGAAARNAEEYRAALKTAAAALEEYQGVTTRQLSEVLRHGLRES 284 T 8.9 Ldt_C pdbhh F Bacteria T 6a2b 3 C C TYR-MET-MET-PRO-ARG-HIS-TRP-PRO-ILE YMMPRHWPI 9 T 2.9 RS4NT pdbhh F T 6a33 2 B I TIFA_HUMAN PUTATIVE MAPK-ACTIVATING PROTEIN PM14,PUTATIVE NF-KAPPA-B-ACTIVATING PROTEIN 20,TRAF2-BINDING PROTEIN SSQSSSPTEMDENES 15 T 22 RhoGEF67_u1 pdbhh F Eukaryota T 6a38 4 D D NS2_MUMIP MVM NS2 NES GGSTVDEMTKKFGTLTIHDT 20 T 0.33 DUF6118 unppssm T Viruses T 6a3a 4 D D NS2_MUMIP MVM NES mutant Nm2 GGSTVEDMTKKFGTLTIHDT 20 T 0.33 DUF6118 unppssm T Viruses T 6a3b 4 D D NS2_MUMIP MVM NES mutant Nm13 DDTVDEMTKKFGTLTIHD 18 T 0.33 DUF6118 unppssm T Viruses T 6a3c 4 D D NS2_MUMIP MVM NES mutant Nm12 GGSTVDEMTKKFGTLTIHDDD 21 T 0.33 DUF6118 unppssm T Viruses T 6a3e 4 D D NS2_MUMIP MVM NES mutant Nm15 GGSDDTVDELTKKFGTLTIHDDD 23 T 0.33 DUF6118 unppssm T Viruses T 6a48 1 A A RELN_MOUSE REELER PROTEIN EIHSDSVILRDDFDSYQQLELNPNIWVECSNCEMGEQCGTIMHGNAVTFCEPYGPRELTTTCLNTTTASVLQFSIGSGSCRFSYSDPSITVSYAKNNTADWIQLEKIRAPSNVSTVIHILYLPEEAKGESVQFQWKQDSLRVGEVYEACWALDNILVINSAHREVVLEDNLDPVDTGNWLFFPGATVKHSCQSDGNSIYFHGNEGSEFNFATTRDVDLSTEDIQEQWSEEFESQPTGWDILGAVVGADCGTVESGLSLVFLKDGERKLCTPYMDTTGYGNLRFYFVMGGICDPGVSHENDIILYAKIEGRKEHIALDTLTYSSYKVPSLVSVVINPELQTPATKFCLRQKSHQGYNRNVWAVDFFHVLPVLPSTMSHMIQFSINLGCGTHQPGNSVSLEFSTNHGRSWSLLHTECLPEICAGPHLPHSTVYSSENYSGWNRITIPLPNAALTRDTRIRWRQTGPILGNMWAIDNVYIGPSCLKFCSGRGQCTRHGCKCDPGFSGPACEMASQTFPMFISESFGSARLSSYHNFYSIRGAEVSFGCGVLASGKALVFNKDGRRQLITSFLDSSQSRFLQFTLRLGSKSVLSTCRAPDQPGEGVLLHYSYDNGITWKLLEHYSYVNYHEPRIISVELPDDARQFGIQFRWWQPYHSSQGEDVWAIDEIVMTSRLENLYF 677 T 0.0021 EGF_2 pdb F Eukaryota T 6a51 1 A,B A,B Q0PBQ6_CAMJE CYSTEINE PERMEASE MGSSHHHHHHSSMKSLILPPNEFLDHYILNAEFHRFAGISKNAYKFWKNVEIGRYQGTRIIFLHRNCILEKHQQALRQCSGLNGFVLASAFCSFTGLAPSHLVEKNNSSIYKLLELKEICGIKFVNLKKFYDFLGLNYHQHIYIEKCHFFSPAPFEKRIKITESMCVGYY 170 T 0.1 DUF1247 pdbpssm F Bacteria T 6a56 1 A,B A,B A0A2Z5WLM1_ANTJA AJLec QRCGGWVKLNTAPVCFSAKGNRPGSFTPSHHGFLKSVKLRHLRGLVTCQSSTDAHDSYWGCKNRDGFHNYPLNVFVTDKHNKVMFPKTGATYYLDPYVIKNRFYGVQGYNAMSPELVLQHGCNSPSDYIGPDSQLRVWYGEDLYNTMESDNSGKVCADVFGYFV 164 T 0.027 CTP_transf_like pdbpssm F Eukaryota T 6a5d 1 A,B A,B LLG1_ARATH LORELEI-LIKE-GPI-ANCHORED PROTEIN 1 SFISDGVFESQSLVLGRNLLQTKKTCPVNFEFMNYTIITSKCKGPKYPPKECCGAFKDFACPYTDQLNDLSSDCATTMFSYINLYGKYPPGLFANQCKEGKEGLECPAGSQLPPETSAEVNAATTSSSRLWLTVSA 136 T 0.53 MIT_LIKE_ACTX pdbpercent F Eukaryota T 6a5e 2 B,E C,D LLG2_ARATH LORELEI-LIKE-GPI-ANCHORED PROTEIN 2 TTCKEDFANKNYTIITSRCKGPNYPANVCCSAFKDFACPFAEVLNDEKNDCASTMFSYINLYGRYPPGIFANMCKEGKEGLDCT 84 T 16 SPARK pdbhh F Eukaryota T 6a5j 1 A A CHINESE BROWN FROG IKKILSKIKKLLK 13 T 0.36 DUF2786 pdb F F 6a5q 2 D,E,F D,E,F TFEB pS211-peptide LVGVTSSSCPADLTQ 15 T 7.5 Hormone_4 pdbhh F T 6a5s 2 B,D,F,H E,C,F,H TFEB pS211-peptide LVGVTSSSCPADLTQ 15 T 7.5 Hormone_4 pdbhh F T 6a6c 1 A A A0A1S4NYE1_9BACL Beta-1,3-glucanase ADFTQGADVSGNNVTLWFKSSVNTTWVDVHYKVNSGVQQNVRMSFNAGAARFEHTILTAAQAEIEYFFTYNNGVPAYDTTTFTYR 85 T 0.0051 DUF6209 pdbpercent F Bacteria T 6a6i 1 A,C,E,G A,C,E,G Q59FF6_HUMAN Excision repair cross-complementing rodent repair deficiency, complementation group 6 variant GPGHMLPERLESESGHLREASALLPTTEHDDLLVEMRNFIAFQAHTDGQASTREILQEFESKLSASQSCVFRELLRNLCTFHRTSGGEGIWKLKPEYC 98 T 0.0018 TFIIF_beta pdbpssm F Eukaryota T 6a6w 2 B B SAD1_SCHPO Spindle pole body-associated protein sad1 GPLSDNEEFENVVKNGH 17 T 0.013 UPF0257 unppercent F Eukaryota T 6a86 1 A,B,C,D,E,F A,B,C,D,E,F A0A384E107_9AGAR PHOSL APVPVTKLVCDGDTYKCTAYLDYGDGKWVAQWDTAVFHTT 40 T 0.069 C2-set pdbhh F Eukaryota T 6a87 1 A,B,C,D,E,F A,B,C,D,E,F A0A384E107_9AGAR PHOSL APVPVTKLVCDGDTYKCTAYLDYGDGKWVAQWDTAVFHTT 40 T 0.069 C2-set pdbhh F Eukaryota T 6a8g 1 A,D P,E muPAin-1-IG XCPAYSRYIGCX 12 T 3.2 DUF6438 pdbhh F T 6a8n 2 B,D P,C MUPAIN-1-IG-2 CPAYSRYIGC 10 T 1.9 DUF6438 pdbhh F T 6a8o 2 C P peptide inhibitor, CPAYSAYLDC 10 T 1.4 DUF6438 pdbhh F T 6a92 1 A,B,C,D A,B,C,D A0A1P8VSI6_9CYAN FILC1 MKRKLIVAVVCLIFICFGINTPAHATSVVSIPINNAGFEDPFIEVVDDYTVDTPPGWTTYNPNNLVPEKRTTWTSNNGVGYVGPGTQFYNQLAPEGRNIGYIYLAQKPGSGVAGFEQILDATLEPDTNYTLKVDVGNLAGTFKGLSFAGFPGYRVELLAGDTVLAADHNNLFIKEGEFKTSTVTYTSTAKDLHLGQKLGIRLVNLLQDKFSGLDFDNVRLTAEPTEA 227 T 3.2 DUF642 pdbhh F Bacteria T 6a98 1 A,B,C,D A,B,C,D A0A1P8VSL7_9CYAN aromatic prenyltransferase MGSSHHHHHHSSGLVPRGSHMASAVSIPIKNAGFEEPSLTVEDYYTIDTPPGWITYDPNGLVPAKRTRITSNNGVGYTGPNSAYYNHKAPEGRNVAYVYLAQEIGSGIAGLEQTLDAVLKPNTKYTLTVDIGNSGGSFQGFPLDGFPGYRVELLAGDTVLAADQNNLYIKEKDFKTTTVTFIATPESPYLGQHLGIRLINPLQGKFSGVDFDNVRLTAEPAET 223 T 0.26 CBM_4_9 unppercent F Bacteria T 6a99 1 A,B,C,D A,B,C,D A0A1P8VSL7_9CYAN aromatic prenyltransferase MGSSHHHHHHSSGLVPRGSHMASAVSIPIKNAGFEEPSLTVEDYYTIDTPPGWITYDPNGLVPAKRTRITSNNGVGYTGPNSAYYNHKAPEGRNVAYVYLAQEIGSGIAGLEQTLDAVLKPNTKYTLTVDIGNSGGSFQGFPLDGFPGYRVELLAGDTVLAADQNNLYIKEKDFKTTTVTFIATPESPYLGQHLGIRLINPLQGKFSGVDFDNVRLTAEPAET 223 T 0.26 CBM_4_9 unppercent F Bacteria T 6a9c 2 C E C4M4E9_ENTHI FP10(GEF) PEPTIDE KVAPPIPHR 9 T 19 NapB pdbhh F Eukaryota T 6a9f 1 A,B,C,D A,B,C,D A0A1P8VSL7_9CYAN aromatic prenyltransferase MGSSHHHHHHSSGLVPRGSHMASAVSIPIKNAGFEEPSLTVEDYYTIDTPPGWITYDPNGLVPAKRTRITSNNGVGYTGPNSAYYNHKAPEGRNVAYVYLAQEIGSGIAGLEQTLDAVLKPNTKYTLTVDIGNSGGSFQGFPLDGFPGYRVELLAGDTVLAADQNNLYIKEKDFKTTTVTFIATPESPYLGQHLGIRLINPLQGKFSGVDFDNVRLTAEPAET 223 T 0.26 CBM_4_9 unppercent F Bacteria T 6a9u 2 B B apstatin XPPAX 5 T 680 SEC-C pdbhh F F 6a9w 1 A A M5AAG8_9CAUD Primase MGSSHHHHHHSSGLVPRGSHMIMEIPAIKALSRYAQWVIWKKERDTKIPYNPNNGKKASSTDPLAWGDIDEAQAGLVRYGANGLGFVLTKSDPFVFIDLDHVLDENKRVKCEWARQLLKEIKSYTEISPSGDGLHVVVSGKLPDYIKHKTKFDDGSALEVYESGRYMTITGEVFDGRDDIKELDLSILGEFAEHKIETKNAPVQIESATTLDDEAIIDLMKRKGQWPDAPKDGDDWSSLDMSFANRLAFWCGKDIERMDRIFRQSPLMRQKWDRPTAGSTYGRITLKKACDFVDSVYDPALRNESDCPFEPYNEEGGPRN 320 T 0.0011 VirE_N pdbhh T Viruses T 6a9x 2 B A ANK3_RAT ANKG, ANK-3,ANKYRIN-G DDWTEFSSEEIREARQAAASHAPS 24 T 1.6 CFIA_Pcf11 pdbhh F Eukaryota T 6aab 1 A A THAN_PODMA Thanatin GSKKPVPIIYCNRRTGKCQRM 21 T 2.6 YihI pdbhh F Eukaryota T 6aaf 2 B B TM184_SCHPO HFL1(386-409) MLQFEIDDEMEPLYNQAKQMRYGDYLEVLFQ 31 T 0.028 DUF6022 pdbpssm F Eukaryota T 6aaw 2 B B ACE-LEU-THR-PHE-STQ-GLU-TYR-DTR-GLN-LEU-CBA-MK8-SER-ALA-ALA XLTFXEYXQLXXSAAX 16 T 2.6 Nmad2 pdbhh F T 6aay 1 A A K1LVU1_9FLAO Bergeyella zoohelcum Cas13b (R1177A) mutant MENKTSLGNNIYYNPFKPQDKSYFAGYFNAAMENTDSVFRELGKRLKGKEYTSENFFDAIFKENISLVEYERYVKLLSDYFPMARLLDKKEVPIKERKENFKKNFKGIIKAVRDLRNFYTHKEHGEVEITDEIFGVLDEMLKSTVLTVKKKKVKTDKTKEILKKSIEKQLDILCQKKLEYLRDTARKIEEKRRNQRERGEKELVAPFKYSDKRDDLIAAIYNDAFDVYIDKKKDSLKESSKAKYNTKSDPQQEEGDLKIPISKNGVVFLLSLFLTKQEIHAFKSKIAGFKATVIDEATVSEATVSHGKNSICFMATHEIFSHLAYKKLKRKVRTAEINYGEAENAEQLSVYAKETLMMQMLDELSKVPDVVYQNLSEDVQKTFIEDWNEYLKENNGDVGTMEEEQVIHPVIRKRYEDKFNYFAIRFLDEFAQFPTLRFQVHLGNYLHDSRPKENLISDRRIKEKITVFGRLSELEHKKALFIKNTETNEDREHYWEIFPNPNYDFPKENISVNDKDFPIAGSILDREKQPVAGKIGIKVKLLNQQYVSEVDKAVKAHQLKQRKASKPSIQNIIEEIVPINESNPKEAIVFGGQPTAYLSMNDIHSILYEFFDKWEKKKEKLEKKGEKELRKEIGKELEKKIVGKIQAQIQQIIDKDTNAKILKPYQDGNSTAIDKEKLIKDLKQEQNILQKLKDEQTVREKEYNDFIAYQDKNREINKVRDRNHKQYLKDNLKRKYPEAPARKEVLYYREKGKVAVWLANDIKRFMPTDFKNEWKGEQHSLLQKSLAYYEQCKEELKNLLPEKVFQHLPFKLGGYFQQKYLYQFYTCYLDKRLEYISGLVQQAENFKSENKVFKKVENECFKFLKKQNYTHKELDARVQSILGYPIFLERGFMDEKPTIIKGKTFKGNEALFADWFRYYKEYQNFQTFYDTENYPLVELEKKQADRKRKTKIYQQKKNDVFTLLMAKHIFKSVFKQDSIDQFSLEDLYQSREERLGNQERARQTGERNTNYIWNKTVDLKLCDGKITVENVKLKNVGDFIKYEYDQRVQAFLKYEENIEWQAFLIKESKEEENYPYVVEREIEQYEKVRREELLKEVHLIEEYILEKVKDKEILKKGDNQNFKYYILNGLLKQLKNEDVESYKVFNLNTEPEDVNINQLKQEATDLEQKAFVLTYIANKFAHNQLPKKEFWDYCQEKYGKIEKEKTYAEYFAEVFKKEKEALIKLEHHHHHH 1232 T 0.32 HcgB pdbpssm F Bacteria T 6aci 1 A A B7UI21_ECO27 T3SS secreted effector NleB homolog SGRPSFAGKEYSLEPIDERTPILFQWFEARPERYEKGEVPILNTKEHPYLSNIINAAKIENERIIGVLVDGNFTYEQKKEFLNLENEHQNIAIIYRADVDFSMYDKKLSDIYLENIHKQESYPASERDNYLLGLLREELKNIPEGKDSLIESYAEKREHTWFDFFRNLAILKAGSLFTETGKTGCHNISPCSGCIYLDADMIITDKLGVLYAPDGIAVHVDCNDEIKSLENGAIVVNRSNHPALLAGLDIMKSKVDAHPYYDGLGKGIKRHFNYSSLHNYNAFCDFIEFKHENIIPNTSMYTSSSW 306 T 2.2E-05 Glyco_transf_88 unphh F Bacteria T 6aco 2 B B H2B1C_HUMAN succinyl peptide H2BK120 AVTXYTS 7 T 42 DUF5611 pdbhh F Eukaryota T 6ad9 2 B B PRGC1_HUMAN PGC1-ALPHA PSLLKKLLLAPA 12 T 13 Neurokinin_B pdbhh F Eukaryota F 6adq 5 E,Q I,U A0R1B6_MYCS2 Cytochrome c oxidase subunit CtaJ MSTALTHGLIGGVPLVLFAVLALIFLTRKGPHPDTYKMSDPWTHAPILWAAEEPREHGHGGHGHDSHGVVIGGGASGKW 79 T 0.08 VPDSG-CTERM pdbpssm F Bacteria T 6adu 1 A,B,C,D A,B,C,D V5TER4_9CYAN acyclase MKRKLIVAVVCLIFICFGINTPAHATSVVSIPINNAGFEDPFIEVVDDYTVDTPPGWTTYNPNNLVPEKRTTWTSNNGVGYVGPGTQFYNQLAPEGRNIGYIYLAQKPGSGVAGFEQILDATLEPDTNYTLKVDVGNLAGTFKGLSFAGFPGYRVELLAGDTVLAADHNNLFIKEGEFKTSTVTYTSTAKDLHLGQKLGIRLVNLLQDKFSGLDFDNVRLTAEPTEA 227 T 3.2 DUF642 pdbhh F Bacteria T 6af0 3 C C G2Q3X1_MYCTT Cdc73 protein SAASGRAGRGTLDPRLAQIYSGERRMGDRNTALRGIKPTDFSHVRKLAAPFVTRKPGAAPSAGVGASATLALNQ 74 T 0.095 DUF5529 unppercent F Eukaryota T 6afq 1 A A THAN_PODMA Thanatin GSKKPVPIIYCNRRTGKCQRF 21 T 2.6 YihI pdbhh F Eukaryota T 6agu 1 A A Q9L9J3_SALTY Transferase MIPPLNRYVPALSKNELVKTVTNRDIQFTSFNGKDYPLCFLDEKTPLLFQWFERNPARFGKNDIPIINTEKNPYLNNIIKAATIEKERLIGIFVDGDFFPGQKDAFSKLEYDYENIKVIYRNDIDFSMYDKKLSEIYMENISKQESMPEEKRDCHLLQLLKKELSDIQEGNDSLIKSYLLDKGHGWFDFYRNMAMLKAGQLFLEADKVGCYDLSTNSGCIYLDADMIITEKLGGIYIPDGIAVHVERIDGRASMENGIIAVDRNNHPALLAGLEIMHTKFDADPYSDGVCNGIRKHFNYSLNEDYNSFCDFIEFKHDNIIMNTSQFTQSSWARHVQ 336 T 2.2E-05 Glyco_transf_88 pdbhh F Bacteria T 6aht 1 A A A1BZ87_BACCE DNA-BINDING PROTEIN LSNISMSSSEIIDVLCENLNDGIWALRVLYAEGAMNKEKLWDYINQYHKDYQIENEKDYEGKKILPSRYALDIMTARLEGAGLISFKAIGRVRIYDVTDLGNVLIKELEKR 111 T 5E-05 DUF3116 pdbhh F Bacteria T 6aht 2 B B A1BYM8_BACCE DNA-BINDING PROTEIN ISMSSSEIIDVLCENLNDGIWALRVLYAEGAMNKEKLWDYINQYHKDYQIENEKDYEGKKILPSRYALDIMTARLEGAGLISFKAIGRVRIYDVTDLGNVLIKELEKRVEKNN 113 T 6.9E-05 DUF3116 unphh F Bacteria T 6ai4 1 A,B A,B A0A0D7C3R7_ECOLX Non-LEE encoded effector protein NleB MLSPIRTTFHNSVNIVQSSPSQTVSFAGKEYELKVIDEKTPILFQWFEPNPERYKKDEVPIVNTKQHPYLDNVTNAARIESDRMIGIFVDGDFSVNQKTAFSKLERDFENVMIIYREDVDFSMYDRKLSDIYHDIICEQRLRTEDKRDEYLLNLLEKELREISKAQDSLISMYAKKRNHAWFDFFRNLALLKAGEIFRSTYNTKNHGISFGEGCIYLDMDMILTGKLGTIYAPDGISMHVDRRNDSVNIENSAIIVNRSNHPALLEGLSFMHSKVDAHPYYDGLGKGVKKYFNFTPLHNYNHFCDFIEFNHPNIIMNTSQYTCSSW 326 T 2.5E-05 Glyco_transf_88 pdbhh F Bacteria T 6aif 2 B B CYSE_SALTY SAT,SERINE TRANSACETYLASE WHTFEYGDGI 10 T 3.2 Cyanate_lyase pdbhh F Bacteria T 6ak0 1 A A A0A493R6M6_9ACTN CYS-LEU-GLY-VAL-GLY-SER-CYS-VAL-ASP-PHE-ALA-GLY-CYS-GLY-TYR-ALA-VAL-VAL-CYS-PHE-DTR CLGVGSCVDFAGCGYAVVCFX 21 T 1.4 CCAP pdbhh F Bacteria T 6ak2 2 C,D D,E peptide inhibitor KSL-128018 SHWXXDI 7 T 7.8 DUF3950 pdbhh F T 6al5 1 A A CD19_HUMAN B-LYMPHOCYTE SURFACE ANTIGEN B4,DIFFERENTIATION ANTIGEN CD19,T-CELL SURFACE ANTIGEN LEU-12 EEPLVVKVEEGDNAVLQCLKGTSDGPTQQLTWSRESPLKPFLKLSLGLPGLGIHMRPLAIWLFIFNVSQQMGGFYLCQPGPPSEKAWQPGWTVNVEGSGELFRWNVSDLGGLGCGLKQRSSEGPSSPSGKLMSPKLYVWAKDRPEIWEGEPPCLPPRDSLNQSLSQDLTMAPGSTLWLSCGVPPDSVSRGPLSWTHVHPKGPKSLLSLELKDDRPARDMWVMETGLLLPRATAQDAGKYYCHRGNLTMSFHLEITARGSHHHHHH 265 T 0.00011 G6B unphh F Eukaryota T 6al7 1 A,B,C,D A,B,D,E A0A076NBW8_9CYAN HPIU5 MGSSHHHHHHSSGLVPRGSHMASTSVVSIPINNAGFEDPFIEVVDDYTVDTPPGWTTYNPNNLVPEKRTTWTSNNGVGYVGPGTQFYNQLAPEGRNIGYIYLAQKPGSGVAGFEQILDATLEPDTKYTLKVDVGNSGGEFQKISLAGFPGYRVELLAGDTVLAADHNNLYIKDGEFKTSTVTFTATPDNPYLDQKLGIRLINLLQGTFSGLDFDNVRLTVEPAQT 225 T 2.7 DUF642 pdbhh F Bacteria T 6al8 1 A,B,C,D A,B,D,E A0A076NBW8_9CYAN HPIU5 MGSSHHHHHHSSGLVPRGSHMASTSVVSIPINNAGFEDPFIEVVDDYTVDTPPGWTTYNPNNLVPEKRTTWTSNNGVGYVGPGTQFYNQLAPEGRNIGFIYLAQKPGSGVAGFEQILDATLEPDTKYTLKVDVGNSGGEFQKISLAGFPGYRVELLAGDTVLAADHNNLYIKDGEFKTSTVTFTATPDNPYLDQKLGIRLINLLQGTFSGLDFDNVRLTVEPAQT 225 T 3.5 DUF3868 unphh F Bacteria T 6alg 8 I N VNUN_BPHK0 Transcription termination factor nun VKKTIYVNPDSGQNRKVSDRGLTSRDRRRIARWEKRIAYALKNGVTPGFNAIDDGPEYKINEDPMDKVDKALATPFPRDVEKIEDEKYEDVMHRVVNHAHQRNPNKKWS 109 T 0.0082 N36 unphh T Viruses T 6aly 1 A A MED15_YEAST AUTONOMOUS REPLICATION REGULATORY PROTEIN 3,BASAL EXPRESSION ACTIVATOR PROTEIN 1,DEFECTIVE SILENCING SUPPRESSOR PROTEIN 4,MEDIATOR COMPLEX SUBUNIT 15,TRANSCRIPTION REGULATORY PROTEIN GAL11,TY INSERTION SUPPRESSOR PROTEIN 13 NNPLQQQSSQNTVPNVLNQINQIFSPEEQRSLLQEAIETCKNFEKTQLGSTMTEPVKQSFIRKYINQKALRKIQALRDVKNNNNANNNGSNL 92 T 0.022 Gliadin pdb F Eukaryota T 6am5 3 C C SER-MET-LEU-GLY-ILE-GLY-ILE-VAL-PRO-VAL SMLGIGIVPV 10 T 4.4 Dehydratase_MU pdbhh F T 6amt 3 C,F C,F MET-MET-TRP-ASP-ARG-GLY-LEU-GLY-MET-MET MMWDRGLGMM 10 T 6.8 RNA_pol_Rpb5_N pdbhh F T 6amu 3 C C MET-MET-TRP-ASP-ARG-GLY-LEU-GLY-MET-MET MMWDRGLGMM 10 T 6.8 RNA_pol_Rpb5_N pdbhh F T 6ana 1 A K anti Kappa VHH domain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 122 F F F 6and 1 A K Anti-kappa VHH domain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 121 F F F 6anf 1 A A Capped-strapped peptide XTPRQARAARAAXCX 15 T 13 BssS pdbhh F T 6ani 1 A,B A,K Anti-Kappa VHH domain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 121 F F F 6anm 1 A A DLE-DPN-BE2-DAL XXXX 4 F F F 6ann 1 A,B,C,D,E A,B,C,D,E cyclic DLE-ZAE-BE2-DAL XXXX 4 F F F 6anw 1 A,B,C A,B,C A0A073KP86_9GAMM anti-CRISPR protein AcrF10 GSMTTFRIENVRIETINDFDMVKFDLVTDLGRVELAEHVNYDSEGDFKSVEYTDSNIRYNMVDELCSVFDLTDKPSLMPAIDYVTFAEIIEAVEEMLEA 99 T 3.9 DUF6156 unphh F Bacteria T 6anz 1 A A B4RQJ2_NEIG2 NEGOA.19190.A.B1 MAHHHHHHMKTSTIVFGGFFITDNGERIQIPILENPNIKEINNFFSVSNFEKKAGVLVFRIIPEPEFGNTELTIYFEKGYYLPIIQTILEDGDIEVKNLKTENYSGNTMEILGDVYPIEHISKNISIIQDIISEFIMKNKPITIMI 146 T 0.025 Imm1 unphh F Bacteria T 6apr 2 B I PEPSTATIN XVVXAX 6 T 1700 FAM60A pdbhh F F 6ar2 2 C,D C,D STK3_HUMAN ASP-GLY-TPO-MET-LYS-ARG EEEDGTMKRN 10 T 0.28 Fib_succ_major unp F Eukaryota T 6arz 1 A,B,C A,B,C L7P7L6_9CAUD PHAGE ANTI-CRISPR PROTEIN MEKKLSDAQVALVAAWRKYPDLRESLEEAASILSLIVFQAETLSDQANELANYIRRQGLEEAEGACRNIDIMRAKWVEVCGEVNQHGIRVYGDAIDRDVDLEHHHHHH 108 T 0.99 DUF1040 unphh T Viruses T 6as3 1 A,B,C,D A,B,C,D L7P7L6_9CAUD NHis AcrE1 protein HHHHHHMEKKLSDAQVALVAAWRKYPDLRESLEEAASILSLIVFQAETLSDQANELANYIRRQGLEEAEGACRNIDIMRAKWVEVCGEVNQYGIRVYGDAIDRDVD 106 T 0.13 GAF_3 pdbpssm T Viruses T 6as4 1 A,B,C A,B,C L7P7L6_9CAUD PHAGE ANTI-CRISPR PROTEIN HHHHHHMEKKLSDAQVALVAAWRKYPDLRESLEEAASILSLIVFQAETLSDQANELANYIRRQGLEEAEGACRNIDIMRAKWVEVCGEVNQHGIRVYGDAIDRDVD 106 T 0.99 DUF1040 unphh T Viruses T 6at5 3 C C CTG1B_HUMAN AUTOIMMUNOGENIC CANCER/TESTIS ANTIGEN NY-ESO-1,CANCER/TESTIS ANTIGEN 6.1,CT6.1,L ANTIGEN FAMILY MEMBER 2,LAGE-2 APRGPHGGAASGL 13 T 10 FTCD_C pdbhh F Eukaryota T 6atz 3 E,F E,F FIBB_HUMAN FIBRINOGEN BETA- 74CIT69-81 GGYRAXPAKAAT 12 T 1.5 AT_hook pdbhh F Eukaryota T 6au5 3 E,F E,F meditope XQFDLSTXRLK 11 T 21 DUF4180 pdbhh F T 6au8 2 B C BAG6_HUMAN BAG FAMILY MOLECULAR CHAPERONE REGULATOR 6,BCL2-ASSOCIATED ATHANOGENE 6,BAG-6,HLA-B-ASSOCIATED TRANSCRIPT 3,PROTEIN G3,PROTEIN SCYTHE EPWAAAVPPEWVPIIQQDIQSQRKVKPQPPLSDAYLSGMPAKR 43 T 8.9 DUF5928 pdbhh F Eukaryota T 6avf 4 D P CTG1B_HUMAN ALA-PRO-ARG-GLY-PRO-HIS-GLY-GLY-ALA-ALA-SER-GLY-LEU APRGPHGGAASGL 13 T 10 FTCD_C pdbhh F Eukaryota T 6avg 5 I,J Q,P CTG1B_HUMAN ALA-PRO-ARG-GLY-PRO-HIS-GLY-GLY-ALA-ALA-SER-GLY-LEU APRGPHGGAASGL 13 T 10 FTCD_C pdbhh F Eukaryota T 6avz 3 C E part of HopQ loop XXX 3 F F F 6awb 6 G B 30S ribosomal protein S1 ESFAQLFEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 153 T 350 DUF5572 pdbhh F T 6awc 6 G B 30S ribosomal protein S1 ESFAQLFEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 153 T 350 DUF5572 pdbhh F T 6awk 1 A A PLP-12 FVGGTSFD 8 T 0.36 BDV_M pdbhh F T 6awm 1 A A GLY-LEU-LEU-GLY-ILE-THR-ASP GLLGITD 7 T 0.63 YjcB pdbhh F F 6ax2 1 A A TX22A_MACGS MU-HXTX-MG2A,NEUROTOXIN MAGI-3 GGCIKWNHSCQTTTLKCCGKCVVCYCHTPWGTNCRCDRTRLFCTED 46 T 0.012 Toxin_9 pdb F Eukaryota T 6ax4 2 B C histidine N(tau)-cyclized Macrocycle 5b XLXST 5 T 150 RAD51_interact pdbhh F F 6axi 1 A A ASP-LEU-PHE-VAL-PRO-PRO-ILE-ASP DLFVPPID 8 T 7.7 DUF5651 pdbhh F T 6axk 3 C E CSP_PLAFA ACE-ASN-PRO-ASN-ALA-ASN-PRO-ASN-ALA-ASN-PRO-ASN XNPNANPNANPNAX 14 T 1.9 Cas_Cas7 pdbhh F Eukaryota F 6axl 3 E,F G,I CSP_PLAFA Peptide ACE-ASN-PRO-ASN-ALA-ASN-PRO-ASN-ALA-ASN-PRO-ASN-ALA-NH2 XNPNANPNANPNAX 14 T 1.9 Cas_Cas7 pdbhh F Eukaryota F 6axp 3 E,F E,F meditope XQFDLSTXRLK 11 T 21 DUF4180 pdbhh F T 6ay9 2 B B CPSF6_HUMAN CLEAVAGE AND POLYADENYLATION SPECIFICITY FACTOR 68 KDA SUBUNIT,CPSF 68 KDA SUBUNIT,PRE-MRNA CLEAVAGE FACTOR IM 68 KDA SUBUNIT,PROTEIN HPBRII-4/7 PVLFPGQPFGQP 12 T 0.89 MF_alpha pdbhh F Eukaryota T 6ayn 3 E,F E,F Cyclic meditope XQFDLSTXRLK 11 T 21 DUF4180 pdbhh F T 6az0 2 G G poly(UNK) XXXXXXXXXX 10 F F F 6aza 1 A A KPHAB_ACTTE ARG-CYS-LYS-THR-CYS-SER-LYS-GLY-ARG-CYS-ARG-PRO-LYS-PRO-ASN-CYS-GLY-NH2 RCKTCSKGRCRPKPNCGX 18 T 1.4 DUF35_N unphh F Eukaryota T 6azf 1 A A GLY-SER-PRO-LEU-PHE-ASP GSPLFD 6 T 0.17 Peptidase_C24 pdbhh F T 6azg 1 A A GLY-SER-PRO-LEU-PHE-ASP GSPLFD 6 T 0.17 Peptidase_C24 pdbhh F T 6azk 3 E,F E,F meditope QFDLSTXRLKX 11 T 19 DUF4180 pdbhh F T 6azl 3 E,F E,F meditope QFDLSTXRLKX 11 T 19 DUF4180 pdbhh F T 6azm 3 E,F F,E CSP_PLAFA Circumsporozoite protein NANP 5-mer NANPNANPNANPNANPNANP 20 T 3.2 PT unppercent F Eukaryota F 6azp 2 B B Q2G0X2_STAA8 Staphylococcal Peroxidase Inhibitor ANFLEHELSYIDVLLDKNADQATKDNLRSYFADKGLHSIKDIINKAKQDGFDVSKYEHVK 60 T 0.0076 Drf_FH3 pdbpssm F Bacteria T 6azt 2 B B ALA-ALA-ASN tetrahedral intermediate AAX 3 T 2500 zf-met pdbhh F F 6b0u 2 D,E D,E Synthetic peptide ATKAPAKKA ATKAPAKKA 9 T 26 SpecificRecomb pdbhh F F 6b12 2 B,C B,C Q4K3B5_PSEF5 Tni2 MISDFERIREDGKVIDENMTVDQMIALGWSPCRVVEARWRWQEQLLSVVNSRGLLAIVVPDRQHLAILWNDDDTGVAATLYVVSGDRQQQIRIADQLLINGQLEAGIYSWFEQFPQVSPSIFTCMFSRQRDQAMFRVDIDASTGDIVSIQHSR 153 T 0.49 Skp1_POZ unppercent F Bacteria T 6b17 1 A,B,C,D,E,F A,B,E,F,D,C Capped-strapped peptide XTPRQARAARAAXCX 15 T 13 BssS pdbhh F T 6b27 2 G,H,I,J,K,L G,H,I,J,K,L CAC1S_HUMAN CALCIUM CHANNEL,L TYPE,ALPHA-1 POLYPEPTIDE,ISOFORM 3,SKELETAL MUSCLE,VOLTAGE-GATED CALCIUM CHANNEL SUBUNIT ALPHA CAV1.1 EDEPEIPLSPRPRP 14 T 18 OGFr_III pdbhh F Eukaryota T 6b2z 6 HA,O P,e ATPASE SUBUNIT E,TRANSLOCASE OF THE INNER MEMBRANE PROTEIN 11 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 49 F F F 6b2z 8 JA,Q R,g ATP synthase subunit g XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 106 F F F 6b34 1 A A Tyrocidine A analogue D-PHE-BE2-PHE-D-PHE-ASN-GLN-TYR-VAL-ORN-LEU XXFXNQYVXL 10 T 3.2 MFP2b pdbhh F T 6b35 1 A A Tyrocidine A analogue D-PHE-BE2-PHE-D-PHE-ASN-LYS-TYR-VAL-ORN-LEU XXFXNKYVXL 10 T 7.9 MSA_2 pdbhh F T 6b3r 2 B,E,F B,D,F Piezo-type mechanosensitive ion channel component 1, unknown fragment XXXXXXXXXXXXXXXX 16 F F F 6b46 2 G,H I,J L7P7M1_9CAUD Anti-CRISPR protein AcrF1 GSMKFIKYLSTAHLNYMNIAVYENGSKIKARVENVVNGKSVGARDFDSTEQLESWFYGLPGSGLGRIENAMNEISRRENP 80 T 0.075 UXS1_N pdb T Viruses T 6b47 4 I K ACR30_BPD31 Anti-CRISPR protein AcrF2 GSMIAQQHKDTVAACEAAEAIAIAKDQVWDGEGYTKYTFDDNSVLIQSGTTQYAMDADDADSIKGYADWLDDEARSAEASEIERLLESVEEE 92 T 0.13 Transglycosylas unp T Viruses T 6b48 4 I K A0A073KP86_9GAMM Anti-CRISPR protein AcrF10 GSMTTFRIENVRIETINDFDMVKFDLVTDLGRVELAEHVNYDSEGDFKSVEYTDSNIRYNMVDELCSVFDLTDKPSLMPAIDYVTFAEIIEAVEEMLEA 99 T 3.9 DUF6156 unphh F Bacteria T 6b4e 2 C,D C,D NUP42_YEAST NUCLEAR PORE PROTEIN NUP42 GPSGSELADLAEETLKIFRANKFELGLVPDIPPPPALVA 39 T 14 DUF5767 unphh F Eukaryota T 6b4f 2 C,D C,D NUPL2_HUMAN Nucleoporin like 2 GPSGSIIATDNVLFTPRDKLTVEELEQFQSKKFTLGKIPLKPPPLELLNV 50 T 7.7 Arcadin_1 pdbhh F Eukaryota T 6b4g 2 B,C,D,E B,D,F,H AMO1_CHATD NUCLEAR PORE PROTEIN AMO1 GPHMGSPEFDGTLVRIWMPDGAPAYTADTEAEDPKVYEDEGVKRQWQSFLEKGRFEGGMPEVPPRREWCVWDF 73 T 3.4 BTHB pdbhh F Eukaryota T 6b4h 1 A,C D,B AMO1_CHATD NUCLEAR PORE PROTEIN AMO1 GPHMGSPEFDGTLVRIWMPDGAPAYTADTEAEDPKVYEDEGVKRQWQSFLEKGRFEGGMPEVPPRREWCVWDF 73 T 3.4 BTHB pdbhh F Eukaryota T 6b4i 2 B,D C,D NUPL2_HUMAN Nucleoporin like 2 GPSGSIIATDNVLFTPRDKLTVEELEQFQSKKFTLGKIPLKPPPLELLNV 50 T 7.7 Arcadin_1 pdbhh F Eukaryota T 6b4j 2 B,D C,D NUPL2_HUMAN Nucleoporin like 2 GPSGSIIATDNVLFTPRDKLTVEELEQFQSKKFTLGKIPLKPPPLELLN 49 T 10 Arcadin_1 pdbhh F Eukaryota T 6b5l 3 C A CSP_PLAFA PfCSP peptide 20: ASN-PRO-ASP-PRO-ASN-ALA-ASN-PRO-ASN-VAL NPDPNANPNVD 11 T 9.1 DUF6112 pdbhh F Eukaryota F 6b5m 3 C A CSP_PLAFA pfCSP peptide 21: ASN-PRO-ASP-PRO-ASN-ALA-ASN-PRO-ASN-VAL-ASP-PRO-ASN NPDPNANPNVDPNAN 15 T 0.7 Cas_Cas7 pdbhh F Eukaryota F 6b5n 3 C A CSP_PLAFA pfCSP peptide 25: ASN-VAL-ASP-PRO-ASN-ALA-ASN-PRO-ASN-VAL-ASP NVDPNANPNVDPNAN 15 T 0.025 PT unppercent F Eukaryota F 6b5o 3 C A CSP_PLAFA PfCSP peptide 29: ASN-PRO-ASN-ALA-ASN-PRO-ASN-ALA-ASN NANPNANPNANPNAN 15 T 1.6 Cas_Cas7 pdbhh F Eukaryota F 6b5p 1 A A CSP_PLAFA pfCSP peptide 20: ASN-PRO-ASP-PRO-ASN-ALA-ASN-PRO-ASN-VAL-ASP NPDPNANPNVD 11 T 9.1 DUF6112 pdbhh F Eukaryota F 6b5q 2 C,D D,E Peptidomimetic Inhibitors DI-591 XXXKX 5 T 2800 EF-hand_5 pdbhh F F 6b5r 3 C A CSP_PLAFA PfCSP peptide 21: ASN-PRO-ASP-PRO-ASN-ALA-ASN-PRO-ASN-VAL-ASP-PRO-ASN NPDPNANPNVDPNAN 15 T 0.7 Cas_Cas7 pdbhh F Eukaryota F 6b5s 3 C A CSP_PLAFA pfCSP peptide 25: ASN-VAL-ASP-PRO-ASN-ALA-ASN-PRO-ASN-VAL-ASP-PRO-ASN NVDPNANPNVDPNAN 15 T 0.025 PT unppercent F Eukaryota F 6b5t 1 A A CSP_PLAFA pfCSP peptide 29: ALA-ASN-PRO-ASN-ALA-ASN-PRO-ASN-ALA-ASN-PRO-ASN-ALA NANPNANPNANPNAN 15 T 1.6 Cas_Cas7 pdbhh F Eukaryota F 6b5w 1 A A V4RMX4_9CAUL Benenodin-1 GVGFGRPDSILTQEQAKPM 19 T 0.15 DUF5974 unphh F Bacteria T 6b67 2 D,E,F D,E,F cyclic peptide c(MpSIpYVA) MSIXVAX 7 T 6.4 DOR pdbhh F T 6b7l 1 A A immune modulator A MEKAANSIAKRVPLALPEAGLYQANLMSRDGDKATPRMIKDLDGLALVYPKGETVQHWGVWVDHQVGKVETNSQWLGQADQKADKDGIYPVQLIRNSERLGTSTALSSVTNDHNLITFQDQPVIDLQGKEIKRWVFDFTRTGTKFSDNSPIYSGFSGHVAVTALTTKAVTTASWSATDSDGFSSEMVGKVDTTNNGGKLTVAIEFPAAGCTLVGEGSATAGLSKLTMTGFGKCNFKQSAAATPIENLWNAALARAMDNRVAYVTTFTADAKKEALVIGFPDTNGLLITADKRLEHHHHHH 300 T 0.68 Choline_bind_1 pdbpssm F T 6b8h 6 O,SA e,s ATP synthase subunit e, mitochondrial XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 49 F F F 6b8h 8 Q,UA g,u AATP synthase subunit g XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 106 F F F 6b8h 17 DA,HB h,v ATP synthase subunit h XXXXXXXXXXXXXXXXXXXXX 21 F F F 6b8j 2 B C VAL-SEP-ARG-ARG VSRR 4 T 110 DUF1848 pdbhh F F 6b9l 2 E,F,G,H E,F,G,H peptide 135E2, (DUG)SAYPDSVPFR XSAYPDSVPFR 11 T 6.1 DUF5623 pdbhh F T 6b9l 3 I J HIS TAG CLEAVED OFF AHHHHA 6 T 85 DUF3399 pdbhh F F 6b9m 2 D D UHRF1_HUMAN INVERTED CCAAT BOX-BINDING PROTEIN OF 90 KDA,NUCLEAR PROTEIN 95,NUCLEAR ZINC FINGER PROTEIN NP95,HNP95,RING FINGER PROTEIN 106,RING-TYPE E3 UBIQUITIN TRANSFERASE UHRF1,TRANSCRIPTION FACTOR ICBP90,UBIQUITIN-LIKE PHD AND RING FINGER DOMAIN-CONTAINING PROTEIN 1,HUHRF1,UBIQUITIN-LIKE-CONTAINING PHD AND RING FINGER DOMAINS PROTEIN 1 ASPRTGKGKWKRKSAGGGPSRAGSPRRTSKKTKVEPYSLTA 41 T 6.4 PMAIP1 pdbpercent F Eukaryota T 6b9y 5 E D meditope XSQFDFCTRRLQSGGK 16 T 2.1 Lambda_CIII pdbhh F T 6bae 5 E D meditope XCQFDLSTRRLKCX 14 T 4.4 Flavi_NS1 pdbhh F T 6bah 5 E D meditope XSQFDXCTRRLQS 13 T 1.7 Lambda_CIII pdbhh F T 6bb4 3 C,F,I P,Q,R TAU_HUMAN NEUROFIBRILLARY TANGLE PROTEIN,PAIRED HELICAL FILAMENT-TAU,PHF-TAU TDHGAEIVYKSPVVSGDTSPRHL 23 T 0.37 Tmemb_cc2 unp F Eukaryota T 6bba 2 H,I,J,K,L,M,N H,I,J,K,L,M,N Acyldepsipeptide ADEP-28 XXXPXAP 7 T 520 zf-CCHC pdbhh F F 6bc8 2 B B REV3L_HUMAN PROTEIN REVERSIONLESS 3-LIKE,HREV3 MEDKKIVIMPCKCAPSRQLVQVWLQAKE 28 T 0.42 SAM_3 pdbhh F Eukaryota T 6bcd 2 B B REV3L_HUMAN PROTEIN REVERSIONLESS 3-LIKE,HREV3 MEDKKIVIMPCKCAPSRQLVQVWLQAKE 28 T 0.42 SAM_3 pdbhh F Eukaryota T 6bcr 2 C,D,G,H C,D,G,H BAIP2_HUMAN Insulin receptor substrate protein of 53 kDa, peptide (IRSp53) LSDSYSNTLPVRKS 14 T 2.4 Glycolipid_bind pdbhh F Eukaryota T 6bcy 2 C,D,G,H C,D,G,H BAIP2_HUMAN Insulin receptor substrate protein of 53 kDa, peptide (IRSp53) ATTENKTLPRSSS 13 T 5.7 EAR pdbhh F Eukaryota T 6bd1 2 C,D,G,H C,D,G,H BAIP2_HUMAN Insulin receptor substrate protein of 53 kDa, peptide (IRSp53) TLPRSSSMAAGLEK 14 T 27 EAR pdbhh F Eukaryota T 6bd2 2 C C BAIP2_HUMAN PROTEIN BAP2,FAS LIGAND-ASSOCIATED FACTOR 3,FLAF3,INSULIN RECEPTOR SUBSTRATE P53/P58,IRSP53/58,INSULIN RECEPTOR SUBSTRATE PROTEIN OF 53 KDA,INSULIN RECEPTOR SUBSTRATE P53 DSYSNTLPVRKSVTPKNSYATTENKTLPRSSSMAAGLE 38 T 0.1 EAR pdb F Eukaryota T 6bdu 1 A,B A,B PPRA_DEIRA PLEIOTROPIC PROTEIN PROMOTING DNA REPAIR MRSGSHHHHHHRSDITSLYKKAGLENLYFQGQTDGIYAAFDTLMSTAGVDSQIAALAASEADAGTLDAALTQSLQEAQGRWGLGLHHLRHEARLTDDGDIEILTDGRPSARVSEGFGALAQAYAPMQALDERGLSQWAALGEGYRAPGDLPLAQLKVLIEHARDFETDWSAGRGETFQRVWRKGDTLFVEVARPASAEAALSKAAWKVIASIKDRAFQRELMRRSEKDGMLGALLGARHAGAKANLAQLPEAHFTVQAFVQTLSGAAARNAEEYRAALKTAAAALEEYQGVTTRQLSEVLRHGLRES 307 T 9.2 Ldt_C unphh F Bacteria T 6bdv 3 C C Acetyl-Asp-Glu-Val-Asp-CMK XDEVDX 6 T 200 ResIII pdbhh F F 6be7 1 A A DDPT(DPR)(DAR)Q(DGN) DDPTXXQX 8 T 15 RhodobacterPufX pdbhh F F 6be9 1 A A T(DLY)NDT(DSG)(DPR) TXNDTXX 7 T 45 CBP_CCPA pdbhh F F 6ben 1 A A (DAR)Q(DPR)(DGN)R(DGL)PQ XQXXRXPQ 8 T 19 MDFI pdbhh F F 6beo 1 A A (DPR)PY(DHI)PKDL(DGN) XPYXPKDLX 9 T 2.6 Gemin6 pdbhh F T 6beq 1 A A AAR(DVA)(DPR)R(DLE)(DTH)PE AARXXRXXPE 10 T 3.8 RRT14 pdbhh F F 6ber 1 A A E(DVA)DP(DGL)(DHI)(DPR)N(DAL)(DPR) EXDPXXXNXX 10 F F F 6bes 1 A A (DAL)Q(DPR)(DCY)(DLY)DS(DTY)(DCY)P(DSN) XQXXXDSXXPX 11 F F F 6bet 1 A A H(DPR)(DVA)CIP(DPR)E(DLY)VC(DGL) HXXCIPXEXVCX 12 T 1.1 ARMET_N pdbhh F T 6beu 1 A A (DCY)N(DVA)(DPR)DVYC(DPR)(DSG)KY(DVA)(DPR) XNXXDVYCXXKYXX 14 T 1.5 CcmE pdbhh F T 6bew 1 A A (DHI)P(DAS)(DGN)(DSN)(DGL)P XPXXXXP 7 F F F 6bf3 1 A A QDP(DPR)K(2TL)(DAS) QDPXKXX 7 T 110 RAMP pdbhh F F 6bf4 1 A,D A,G D7S2G1_9HIV1 HIV-1 clade AE gp120 core VWRDADTTLFCASDAKAHETEVHNVWATHACVPTDPNPQEIHLVNVTENFNMWKNKMVEQMQEDVISLWDESLKPCVKLTGGSVIKQACPKVSFDPIPIHYCTPAGYVILKCNDKNFNGTGPCKNVSSVQCTHGIKPVVSTQLLLNGSLAEEEIIIRSENLTDNAKNIIVHLNKSVEINCTRPSNGGSGSGGDIRKAYCEIDGTEWNKTLTQVAEKLKEHFNKTIVYQPPSGGDLEITMHHFNCRGEFFYCNTTQLFNNSVGNSTIKLPCRIKQIINMWQGVGQAMYAPPISGAINCLSNITGILLTRDGGGNNRSNETFRPGGGNIKDNWRSELYKYKVVEIE 344 T 2.2E-50 GP120 unppercent T Viruses T 6bf5 1 A A QDP(DPR)K(DTH)(DAS) QDPXKXX 7 T 110 RAMP pdbhh F F 6bfj 3 C C Ac-Asp-Glu-Val-Asp-CMK XDEVDX 6 T 200 ResIII pdbhh F F 6bfk 3 E,F F,E AC-ASP-GLU-VAL-ASP-CMK XDEVDX 6 T 200 ResIII pdbhh F F 6bfl 3 C C AC-ASP-GLU-VAL-ASP-CMK XDEVDX 6 T 200 ResIII pdbhh F F 6bfo 3 C D AC-ASP-GLU-VAL-ASP-CMK XDEVDX 6 T 200 ResIII pdbhh F F 6bg0 3 E,F G,F AC-ASP-GLU-VAL-ASP-CMK XDEVDX 6 T 200 ResIII pdbhh F F 6bg1 3 C B AC-ASP-GLU-VAL-ASP-CMK XDEVDX 6 T 200 ResIII pdbhh F F 6bg4 3 E,F C,D AC-ASP-GLU-VAL-ASP-CMK XDEVDX 6 T 200 ResIII pdbhh F F 6bga 5 E E Velcro peptide YVVVPDGTGGGSGSG 15 T 0.54 CFIA_Pcf11 pdbhh F T 6bgg 1 A A CHD4_HUMAN CHD4 KVAPLKIKLGGF 12 T 3.3 DUF2577 pdbhh F Eukaryota T 6bgh 2 B B SMCA4_HUMAN Brd3_ET RSVKVKIKLGRK 12 T 5.5 ProQ_C pdbhh F Eukaryota T 6bgk 3 E,F F,H ACE-ASP-GLU-VAL-ASP-0QE XDEVDX 6 T 200 ResIII pdbhh F F 6bgq 3 E,F K,L Ac-Asp-Glu-Val-Asp-CMK XDEVDX 6 T 200 ResIII pdbhh F F 6bgr 3 C C Ac-Asp-Glu-Val-Asp-CMK XDEVDX 6 T 200 ResIII pdbhh F F 6bgs 3 C B Ac-Asp-Glu-Val-Asp-CMK XDEVDX 6 T 200 ResIII pdbhh F F 6bh9 3 E,F D,G Ac-Asp-Glu-Val-Asp-CMK XDEVDX 6 T 200 ResIII pdbhh F F 6bha 3 C C Ac-Asp-Glu-Val-Asp-CMK XDEVDX 6 T 200 ResIII pdbhh F F 6bi7 2 B,D,F,H B,D,F,H REV3L_HUMAN PROTEIN REVERSIONLESS 3-LIKE,HREV3 MEDKKIVIMPCKCAPSRQLVQVWLQAKE 28 T 0.42 SAM_3 pdbhh F Eukaryota T 6bij 3 C C FIBB_HUMAN Citrullinated Fibrinogen 72,74Cit69-81 GGYXAXPAKAAAT 13 T 3.3 AT_hook pdbhh F Eukaryota T 6bil 3 C C FIBB_HUMAN Fibrinogen beta 74cit69-81 GGYRAXPAKAAAT 13 T 3.3 AT_hook pdbhh F Eukaryota T 6bin 2 B C CO2A1_HUMAN Type II Collagen 1240Cit 1237-1249 QYMXADQAAGGLR 13 T 0.022 DUF2600 unphh F Eukaryota T 6bir 3 C C VIME_HUMAN Vimentin 424Cit419-431 SSLNLXETNLDSL 13 T 11 LRR_1 pdbhh F Eukaryota T 6bj5 2 B,D,F F,G,H SPI1_MYXVL SERPIN-1 LIPRNALTAIVANKPFMFLIYHKPTTTVLFMGTITKGEKVIYDTEGRDDVVSSV 54 F T Viruses T 6bj8 5 E C VAL-PRO-LEU-THR-GLU-ASP-ALA-GLU-LEU VPLTEDAEL 9 T 6.3 EST1_DNA_bind pdbhh F T 6bk8 24 X X Unknown protein fragment XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 219 F F F 6bk8 25 Y Y Unknown protein fragment XXXXXXXXXXXXXXXX 16 F F F 6bkj 2 E,F,G,H F,G,H,I Leupeptin XLLX 4 T 590 DNTTIP1_dimer pdbhh F F 6bl5 1 A A A7XXR5_9CAUD Head decoration protein DKIQLFRTIGRVQYWERVPRLHAYGVFALPFPMDPDVEWGNWFAGPHPKAFLVSVHPSGPKAGHVYPTDLSDPDSVANVIGMVLDGHDYEADHNVTVTLRAAVPIEYVQQGIEAPPLQPDPAVLNAAPQLKLKVIKGHYFFDYTR 145 T 0.77 TRI9 pdbpssm T Viruses T 6bl9 1 A A Sm2a toxin EETEEPIRHAKKNPSEGECKKACADAFANGDQSKIAKAENFKDYYCNCHIIIH 53 T 0.022 BSMAP pdbpssm F T 6bmt 2 B B A0A2A4GXB5_9STAP Hypothetical Protein GSTGSMKKTLVAGFAVAALSTGIFAVSNEANAQVTSQNGIILHDDSRMLDHELQYVDVLINPNANPQTKERLKAYFESQGLNTVSEIVQKAKQDGLDTSKYDHLI 105 T 0.046 Drf_FH3 unppssm F Bacteria T 6bnt 2 B B IRS1_HUMAN IRS-1 CHTDDGYMPMSPGVA 15 T 0.14 ComGF pdbhh F Eukaryota T 6bo3 1 A,B A,B Q6Q0L4_9VIRU Uncharacterized protein MGEVFKEVKEKFERYKFDVVYVDREYPVSSNNLNVFFEIGERNSFSGLLINEGQAVIDVLLLKKSHEGLSPIPGEGTGIQLSAGQILKFYNVPIAEIIVEYDPSNVSGVSSNVKLKGTIHPLFEVPSQISIENFQPTENYLIYSGFGTSLPQTYTIPANGYLIISITNTSTGNIGQITLTIGSTTMTFNLQTGENKIPVIAGTQITNMTLTSSSAILIYEEVIHHHHHH 229 T 6.4 PSP1 pdbhh T Viruses T 6bpi 2 B B MLY-SER-THR-E2G KSTX 4 T 150 CoaE pdbhh F F 6bqb 3 C P Q7K740_PLAF7 N-terminal junction peptide XKQPADGNPDPNANPX 16 T 5.1 Nup54 pdbhh F Eukaryota T 6bqn 4 D D 7B1 fab XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 106 F F F 6bqn 5 E E 7B1 fab XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 121 F F F 6bqn 6 F F 10D4 fab XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 115 F F F 6bqn 7 G G 10D4 fab XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 112 F F F 6bqt 2 C,F,I,L C,F,I,L BAIP2_HUMAN PROTEIN BAP2,FAS LIGAND-ASSOCIATED FACTOR 3,FLAF3,INSULIN RECEPTOR SUBSTRATE P53/P58,IRSP53/58,INSULIN RECEPTOR SUBSTRATE PROTEIN OF 53 KDA,INSULIN RECEPTOR SUBSTRATE P53 DSYSNTLPVRKSVTPKNSYATTENKTLPRSSS 32 T 20 Glycolipid_bind pdbhh F Eukaryota T 6bra 2 C S Phage display-optimized HIV-1 protease substrate SGIFLETS 8 T 3.7 DUF2016 pdbhh F T 6brs 3 D E unidentified Ferredoxin peptide XXXX 4 F F F 6buu 2 C,D F,G GSK3B_HUMAN GLY-ARG-PRO-ARG-THR-THR-ZXW-PHE-ALA-GLU GRPRTTXFAE 10 T 7.5 AT_hook pdbhh F Eukaryota T 6bvh 2 B I SFTI1_HELAN Trypsin inhibitor 1 GTCTRSIPPICNPN 14 T 0.0038 Bowman-Birk_leg pdb F Eukaryota T 6bvu 1 A A SFTI1_HELAN Trypsin inhibitor 1 HFRW-1 CTASIPPICHXRWR 14 T 0.052 Bowman-Birk_leg unp F Eukaryota T 6bvv 2 B B W_NIPAV Protein W SRNIHLLGRKTCLGRRVVQPGMFEDHPPTKKARVSMRRMSN 41 T 1.3 Paramyxo_P_V_N unphh T Viruses T 6bvw 1 A A SFTI1_HELAN Trypsin inhibitor 1 HFRW-3 CTASIPPICHXXXR 14 T 0.052 Bowman-Birk_leg unp F Eukaryota T 6bvx 1 A A SFTI1_HELAN Trypsin inhibitor 1 HFRW-2 CTHXXWPICFPDGR 14 T 0.052 Bowman-Birk_leg unp F Eukaryota T 6bvy 1 A A SFTI1_HELAN Trypsin inhibitor 1 HFRW-4 CTASIPPICXXXWR 14 T 0.052 Bowman-Birk_leg unp F Eukaryota T 6bw0 1 A C W_NIPAV Protein W SRNIHLLGRKTCLGRRVVQPGMFEDHPPTKKARVSMRRMSN 41 T 1.3 Paramyxo_P_V_N unphh T Viruses T 6bw1 1 A C W_HENDH Protein W SRSLNMLGRKTCLGRRVVQPGMFADYPPTKKARVLLRRMSN 41 T 8.1 Paramyxo_P_V_N unphh T Viruses T 6bw3 2 B,D B,D MECOM_HUMAN MYELODYSPLASIA SYNDROME 1 PROTEIN,MYELODYSPLASIA SYNDROME-ASSOCIATED PROTEIN 1 MRSKGRARKLAT 12 T 11 DUF3824 pdbhh F Eukaryota T 6bw4 2 B,D B,D PRD16_HUMAN PR DOMAIN-CONTAINING PROTEIN 16,TRANSCRIPTION FACTOR MEL1,MDS1/EVI1-LIKE GENE 1 MRSKARARKLAK 12 T 10 TCP pdbhh F Eukaryota T 6bw9 2 B B W_HENDH Protein W SRSLNMLGRKTCLGRRVVQPGMFADYPPTKKARVLLRRMSN 41 T 8.1 Paramyxo_P_V_N unphh T Viruses T 6bwa 2 B B W_HENDH Protein W SRSLNMLGRKTCLGRRVVQPGMFADYPPTKKARVLLRRMSN 41 T 8.1 Paramyxo_P_V_N unphh T Viruses T 6bwb 2 B B W_HENDH Protein W SRSLNMLGRKTCLGRRVVQPGMFADYPPTKKARVLLRRMSN 41 T 8.1 Paramyxo_P_V_N unphh T Viruses T 6bwz 1 A A FUS_HUMAN SYSGYS peptide from low-complexity domain of FUS SYSGYS 6 T 4.4 DUF6156 pdbhh F Eukaryota F 6bx3 4 E F SPP1_YEAST COMPLEX PROTEINS ASSOCIATED WITH SET1 PROTEIN SPP1,SET1C COMPONENT SPP1,SUPPRESSOR OF PRP PROTEIN 1 HGREFVNDIWSRLKTDEDRAVVKKMVEQTGHIDKFKKFGQLDFIDNNIVVKTDDEKEIFDQIVVRDMTLKTLEDDLQEVQEISLPLFKKKLELLEVYLGWLDNVYTEMRKLDDDAASHVECGKEDSKGTKRKKKKNSSRSRARKNICGYCSTYERIPCSVEEFVRDFGSNEEATKIHEVCTKWKCNRHLDWVSTNQEQYLQQIDSLESMQERLQHLIQARKKQLNIQYYEEILRRGL 237 T 0.0087 Mod_r pdbpssm F Eukaryota T 6bxg 2 B B LEU-ILE-ALA LIA 3 T 660 DUF2591 pdbhh F F 6bxp 3 C C ENV_HV1H2 HIV peptide RKV-Kyn RVKEKYQHLX 10 T 0.25 FAS_meander pdbhh T Viruses T 6bxq 1 A A ENV_HV1H2 HIV peptide RKV RVKEKYQHLW 10 T 0.25 FAS_meander pdbhh T Viruses T 6bxr 1 A A A0A140H546_TOXGO Mitochondrial association factor 1 SQTVDLSCLSGTTVRFFGPSHHFGGFTPLYDPAPDKRVATVDAGANALFIGGGGLNGQFAKTLLEEAEKHGIRLTPEELSQHSQRIQQSLLRRAVKSPGKLVELDTGVASPVFARSFGFVPVVPGLMWEESEVGPNVGVTFVHILKPEVTPYGNLNNNVMMYTVAPSGAAPDKTYSLAYKTTIAGVIGAAAAYNDTPAGQQYPVQGLRLPLLGGGIFRRNRSLESIGRANAEGTSLAITRYGPNFELQYMYDPSNAALHGLQEAESTYLASAA 273 T 0.5 FAP unp F Eukaryota T 6bxs 1 A,B,C A,B,C A0A193AUK9_TOXGO mitochondrial association factor 1 GSMGTPDPLTLRFTCLGDRNVIFFGPSGRQDGFTPLYDPSPSKRVATVDAGTYGLFIGGVGMNGEFADTIIEEARRNRIPLTATELSAESQEIQERLLHDAERQPGTLVEIDSGRFSRVFARSFAYVAIVPNTVWDESETGKNVGATFLHILKPEVTPHGNEMNDVMLYTVAPFGNASDSAYNMAYKATMLGIVGAVSEYNKTPWGEVKPVEAIRLPLLGAGHFRGRRGLHSIGRANAVAVEAAITRFDPRVELQFMYEPSDTALRGLMESERKYKFPQGD 281 T 0.052 DUF6479 unppercent F Eukaryota T 6bxt 1 A,B,C A,B,C A0A193AUK9_TOXGO mitochondrial association factor 1 GSMGTPDPLTLRFTCLGDRNVIFFGPSGRQDGFTPLYDPSPSKRVATVDAGTYGLFIGGVGMNGEFADTIIEEARRNRIPLTATELSAESQEIQERLLHDAERQPGTLVEIDSGRFSRVFARSFAYVAIVPNTVWDESETGKNVGATFLHILKPEVTPHGNEMNDVMLYTVAPFGNASDSAYNMAYKATMLGIVGAVSEYNKTPWGEVKPVEAIRLPLLGAGHFRGRRGLHSIGRANAVAVEAAITRFDPRVELQFMYEPSDTALRGLMESERKYKFPQGD 281 T 0.052 DUF6479 unppercent F Eukaryota T 6bxv 1 A A FUS_HUMAN FUS SYSSYGQS 8 T 6.2 Kinin pdbhh F Eukaryota F 6bxw 1 A A A0A140H546_TOXGO Mitochondrial association factor 1 GSMGSQTVDLSCLSGTTVRFFGPSHHFGGFTPLYDPAPDKRVATVDAGANALFIGGGGLNGQFAKTLLEEAEKHGIRLTPEELSQHSQRIQQSLLRRAVKSPGKLVELDTGVASPVFARSFGFVPVVPGLMWEESEVGPNVGVTFVHILKPEVTPYGNLNNNVMMYTVAPSGAAPDKTYSLAYKTTIAGVIGAAAAYNDTPAGQQYPVQGLRLPLLGGGIFRRNRSLESIGRANAEGTSLAITRYGPNFELQYMYDPSNAALHGLQEAESTYLASAAA 278 T 0.5 FAP unp F Eukaryota T 6bxx 1 A A ROA1_HUMAN hnRNPA1 GYNGFG 6 T 6.9 Orbi_VP3 pdbhh F Eukaryota F 6byj 2 G,H,I G,P,T TSTTATPPVSQASSTTTSTW O-GlcNac peptide TSTTATPPVSQASSTTTSTW 20 T 84 Polyoma_coat pdbhh F T 6byk 2 E,F,G,H G,J,K,R ATPPVSQASSTT O-GlcNac peptide ATPPVSQASSTT 12 T 39 Luteo_coat pdbhh F T 6byl 2 G,H,I G,P,T TSASTTVPVTTATTTTTSTW O-GlcNac peptide TSASTTVPVTTATTTTTSTW 20 T 45 YjbE pdbhh F T 6byz 2 C,D D,E ALA-ALA-ALA AAA 3 T 1200 RNase_HII pdbhh F F 6bz9 3 C C Ac-FLTD-CMK XFLTDX 6 T 280 p25-alpha pdbhh F F 6bzd 2 E,F G,K GlcNAcylated peptide TSTTATPPVSQASSTTTSTW 20 T 84 Polyoma_coat pdbhh F T 6bzm 1 A,B A,B NUP98_HUMAN Nuclear pore complex protein Nup98-Nup96 GFGNFGTS 8 T 0.097 SNRNP27 pdbhh F Eukaryota F 6bzp 1 A,B A,B FUS_HUMAN FUS, 75 KDA DNA-PAIRING PROTEIN, ONCOGENE FUS, ONCOGENE TLS, POMP75, TRANSLOCATED IN LIPOSARCOMA PROTEIN STGGYG 6 T 1.9 Pox_ser-thr_kin pdbhh F Eukaryota F 6c0f 31 EA x Brx1-associated peptide XXXXXXXXXXXXXXXXXXXXXXXXXXXX 28 F F F 6c0f 32 FA m EBNA1-BINDING PROTEIN HOMOLOG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 74 F F F 6c0f 36 JA q MAINTENANCE OF KILLER PROTEIN 11 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 285 F F F 6c1d 4 H F Phalloidin AXXPAWX 7 T 1.6 Fe_hyd_lg_C pdbhh F F 6c1q 2 B L PMX53 XFXPXWR 7 T 2.6 Filo_VP24 pdbhh F F 6c1r 2 B L PMX53 XFXPXWR 7 T 2.6 Filo_VP24 pdbhh F F 6c23 2 B E JARD2_HUMAN JUMONJI/ARID DOMAIN-CONTAINING PROTEIN 2 SNARKRPRLQAQRKFAQSQPNSPSTTPVKIVEPLLPPPATQISDLSKRKPKTEDFLTFLCLRGSPALPNSMVYFGSSQDEEEVEEEDDETEDVKTATNNASSSCQSTPRKGKTHKHVHNGHVFNGSSRSTREKEPVQKHKSKEATPAKEKHSDHRADSRREQASANHPAAAPSTGSSAKGLAATHHHPPLHRSAQDLRKQVSKVNGVTRMSSLGAGVTSAKKMREVRPSPSKTVKYTATVTKGAVTYTKAKRELVKDTKPNHHKPSSAVNHTISGKTESSNAKTRKQVLSLGGASKSTGPAVNGLKVSGRLNPKSCTKEVGGRQLREGLQLREGLRNSKRRLEEAHQA 348 T 0.054 Actin_micro pdbpercent F Eukaryota T 6c23 6 G O JARID2-substrate AAARKFA 7 T 85 CFTR_R pdbhh F F 6c23 8 J Z SUZ12 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 135 F F F 6c23 9 L B JARD2_HUMAN JUMONJI/ARID DOMAIN-CONTAINING PROTEIN 2 RKRPRLQAQRKFAQSQPNSPSTTPVKIVEPLLPPPATQISDLSKRKPKTEDFLTFLCLRGSPALPNSMVYFGSSQDEEEVEEEDDETEDVKTATNNASSSCQSTPRKGKTHKHVHNGHVFNGSSRSTREKEPVQKHKSKEATPAKEKHSDHRADSRREQASANHPAAAPSTGSSAKGLAATHHHPPLHRSAQDLRKQVSKVNGVTRMSSLGAGVTSAKKMREVRPSPSKTVKYTATVTKGAVTYTKAKRELVKDTKPNHHKPSSAVNHTISGKTESSNAKTRKQVLSLGGASKSTGPAVNGLKVSGRLNPKSCTKEVGGRQLREGLQLREGLRNSKRRLEEAHQA 345 T 0.053 Actin_micro pdbpercent F Eukaryota T 6c24 2 B B JARD2_HUMAN JUMONJI/ARID DOMAIN-CONTAINING PROTEIN 2 RKRPRLQAQRKFAQSQPNSPSTTPVKIVEPLLPPPATQISDLSKRKPKTEDFLTFLCLRGSPALPNSMVYFGSSQDEEEVEEEDDETEDVKTATNNASSSCQSTPRKGKTHKHVHNGHVFNGSSRSTREKEPVQKHKSKEATPAKEKHSDHRADSRREQASANHPAAAPSTGSSAKGLAATHHHPPLHRSAQDLRKQVSKVNGVTRMSSLGAGVTSAKKMREVRPSPSKTVKYTATVTKGAVTYTKAKRELVKDTKPNHHKPSSAVNHTISGKTESSNAKTRKQVLSLGGASKSTGPAVNGLKVSGRLNPKSCTKEVGGRQLREGLQLREGLRNSKRRLEEAHQA 345 T 0.053 Actin_micro pdbpercent F Eukaryota T 6c24 3 C E JARD2_HUMAN JUMONJI/ARID DOMAIN-CONTAINING PROTEIN 2 RKRPRLQAQRKFAQSQPNSPSTTPVKIVEPLLPPPATQISDLSKRKPKTEDFLTFLCLRGSPALPNSMVYFGSSQDEEEVEEEDDETEDVKTATNNASSSCQSTPRKGKTHKHVHNGHVFNGSSRSTREKEPVQKHKSKEATPAKEKHSDHRADSRREQASANHPAAAPSTGSSAKGLAATHHHPPLHRSAQDLRKQVSKVNGVTRMSSLGAGVTSAKKMREVRPSPSKTVKYTATVTKGAVTYTKAKRELVKDTKPNHHKPSSAVNHTISGKTESSNAKTRKQVLSLGGASKSTGPAVNGLKVSGRLNPKSCTKEVGGRQLREGLQLREGLRNSKRRLEEAHQA 345 T 0.053 Actin_micro pdbpercent F Eukaryota T 6c24 7 H O JARID2-substrate AAARKFA 7 T 85 CFTR_R pdbhh F F 6c24 9 K Z SUZ12 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 135 F F F 6c3f 1 A,B A,B ILE-TYR-LYS-VAL-GLU-ILE IYKVEI 6 T 15 DUF6519 pdbhh F F 6c3g 1 A,B A,B LYS-ALA-LEU-GLY-ILE-SER KALGIS 6 T 0.55 Tat pdbhh F T 6c3r 1 A,B A,B POLN_CRPVC Cricket paralysis virus 1A protein INSLEELAAQELIAAQFEGNLDGFFCTFYVQSKPQLLDLESECYCMDDFDCGCDRIKREEELRKLIFLTSDVYGYNFEEWKGLVWKFVQNYCPEHRYGSTFGNGLLIVSPRFFMDHLDWFQQWKLVSSNDECRAFLRKRTQ 141 T 7.1 AAA_lid_6 pdbhh T Viruses T 6c3s 1 A A TYR-THR-ILE-ALA-ALA-LEU YTIAAL 6 T 29 TDH pdbhh F F 6c3t 1 A,B A,B ALA-ALA-ASP-THR-TRP-GLU AADTWE 6 T 3.6 NRN1 pdbhh F F 6c48 2 C,D F,C MYBB_HUMAN B-MYB,MYB-LIKE PROTEIN 2 APMSSAWKTVACGGTRDQLFMQEKARQLLGRL 32 T 14 Atracotoxin pdbhh F Eukaryota T 6c4a 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H ACEA_MYCTU ICL1,ISOCITRASE,ISOCITRATASE,METHYLISOCITRATE LYASE,MICA MHHHHHHLVPRGSHMSVVGTPKSAEQIQQEWDTNPRWKDVTRTYSAEDVVALQGSVVEEHTLARRGAEVLWEQLHDLEWVNALGALTGNMAVQQVRAGLKAIYLSGWQVAGDANLSGHTYPDQSLYPANSVPQVVRRINNALQRADQIAKIEGDTSVENWLAPIVADGEAGFGGALNVYELQKALIAAGVAGSHWEDQLASEKKXGHLGGKVLIPTQQHIRTLTSARLAADVADVPTVVIARTDAEAATLITSDVDERDQPFITGERTREGFYRTKNGIEPCIARAKAYAPFADLIWMETGTPDLEAARQFSEAVKAEYPDQMLAYNCSPSFNWKKHLDDATIAKFQKELAAMGFKFQFITLAGFHALNYSMFDLAYGYAQNQMSAYVELQEREFAAEERGYTATKHQREVGAGYFDRIATTVDPNSSTTALTGSTEEGQFH 442 T 1.8E-47 ICL unp F Bacteria T 6c4c 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H ACEA_MYCTU ICL1,ISOCITRASE,ISOCITRATASE,METHYLISOCITRATE LYASE,MICA MHHHHHHLVPRGSHMSVVGTPKSAEQIQQEWDTNPRWKDVTRTYSAEDVVALQGSVVEEHTLARRGAEVLWEQLHDLEWVNALGALTGNMAVQQVRAGLKAIYLSGWQVAGDANLSGHTYPDQSLYPANSVPQVVRRINNALQRADQIAKIEGDTSVENWLAPIVADGEAGFGGALNVYELQKALIAAGVAGSHWEDQLASEKKXGHLGGKVLIPTQQHIRTLTSARLAADVADVPTVVIARTDAEAATLITSDVDERDQPFITGERTREGFYRTKNGIEPCIARAKAYAPFADLIWMETGTPDLEAARQFSEAVKAEYPDQMLAYNCSPSFNWKKHLDDATIAKFQKELAAMGFKFQFITLAGFHALNYSMFDLAYGYAQNQMSAYVELQEREFAAEERGYTATKHQREVGAGYFDRIATTVDPNSSTTALTGSTEEGQFH 442 T 1.8E-47 ICL unp F Bacteria T 6c4o 1 A,B A,B THR-ILE-ALA-ALA-LEU-LEU-SER TIAALLS 7 T 100 DUF6169 pdbhh F F 6c4x 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H cross-alpha amyloid-like membrane peptide alpha-AmMEM XSKLLLLLIILSEALHLAILLLIKWGX 27 T 2.9 GRP pdbhh F T 6c4y 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R A,B,C,D,E,F,G,H,K,I,J,L,M,N,O,P,Q,R Cross-alpha Amyloid-like Structure alphaAmG XSKLLELLRKLGEALHKAIELLEKWGX 27 T 2.1 BssS pdbhh F T 6c4z 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R A,B,C,D,P,O,R,Q,E,F,G,H,L,K,N,M,J,I Cross-alpha Amyloid-like Structure alphaAmG - low resolution XSKLLELLRKLGEALHKAIELLEKWGX 27 T 2.1 BssS pdbhh F T 6c50 1 A,AA,AB,AC,AD,AE,AF,AG,AH,AI,B,BA,BB,BC,BD,BE,BF,BG,BH,BI,C,CA,CB,CC,CD,CE,CF,CG,CH,D,DA,DB,DC,DD,DE,DF,DG,DH,E,EA,EB,EC,ED,EE,EF,EG,EH,F,FA,FB,FC,FD,FE,FF,FG,FH,G,GA,GB,GC,GD,GE,GF,GG,GH,H,HA,HB,HC,HD,HE,HF,HG,HH,I,IA,IB,IC,ID,IE,IF,IG,IH,J,JA,JB,JC,JD,JE,JF,JG,JH,K,KA,KB,KC,KD,KE,KF,KG,KH,L,LA,LB,LC,LD,LE,LF,LG,LH,M,MA,MB,MC,MD,ME,MF,MG,MH,N,NA,NB,NC,ND,NE,NF,NG,NH,O,OA,OB,OC,OD,OE,OF,OG,OH,P,PA,PB,PC,PD,PE,PF,PG,PH,Q,QA,QB,QC,QD,QE,QF,QG,QH,R,RA,RB,RC,RD,RE,RF,RG,RH,S,SA,SB,SC,SD,SE,SF,SG,SH,T,TA,TB,TC,TD,TE,TF,TG,TH,U,UA,UB,UC,UD,UE,UF,UG,UH,V,VA,VB,VC,VD,VE,VF,VG,VH,W,WA,WB,WC,WD,WE,WF,WG,WH,X,XA,XB,XC,XD,XE,XF,XG,XH,Y,YA,YB,YC,YD,YE,YF,YG,YH,Z,ZA,ZB,ZC,ZD,ZE,ZF,ZG,ZH A1,G3,N1,T3,11,73,e1,k3,r1,x3,A2,G4,N2,T4,12,74,e2,k4,r2,x4,A3,H1,N3,U1,13,81,e3,l1,r3,A4,H2,N4,U2,14,82,e4,l2,r4,B1,H3,O1,U3,21,83,f1,l3,s1,B2,H4,O2,U4,22,84,f2,l4,s2,B3,I1,O3,V1,23,91,f3,m1,s3,B4,I2,O4,V2,24,92,f4,m2,s4,C1,I3,P1,V3,31,93,g1,m3,t1,C2,I4,P2,V4,32,94,g2,m4,t2,C3,J1,P3,W1,33,a1,g3,n1,t3,C4,J2,P4,W2,34,a2,g4,n2,t4,D1,J3,Q1,W3,41,a3,h1,n3,u1,D2,J4,Q2,W4,42,a4,h2,n4,u2,D3,K1,Q3,X1,43,b1,h3,o1,u3,D4,K2,Q4,X2,44,b2,h4,o2,u4,E1,K3,R1,X3,51,b3,i1,o3,v1,E2,K4,R2,X4,52,b4,i2,o4,v2,E3,L1,R3,Y1,53,c1,i3,p1,v3,E4,L2,R4,Y2,54,c2,i4,p2,v4,F1,L3,S1,Y3,61,c3,j1,p3,w1,F2,L4,S2,Y4,62,c4,j2,p4,w2,F3,M1,S3,Z1,63,d1,j3,q1,w3,F4,M2,S4,Z2,64,d2,j4,q2,w4,G1,M3,T1,Z3,71,d3,k1,q3,x1,G2,M4,T2,Z4,72,d4,k2,q4,x2 Cross-alpha Amyloid-like Structure alphaAmS XSKLLELLRKLSEALHKAIELLEKWGX 27 T 3.1 BssS pdbhh F T 6c51 1 A,B,C,D A,C,B,D Cross-alpha Amyloid-like Structure alphaAmL XSKLLELLRKLLEALHKAIELLEKWGX 27 T 2.3 Antimicrobial19 pdbhh F T 6c52 1 A,B,C,D A,B,C,D Cross-alpha Amyloid-like Structure alphaTet XSKLEELRRKLQEAEHKARELQEKWGX 27 T 0.0094 DMPK_coil pdb F T 6c5l 44 SA,ZC BJ,DJ 50S ribosomal protein L10 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 130 F F F 6c5x 4 G G IL6RB_MOUSE GP130 peptide fragment TVEXSTVVHS 10 T 7.8 DUF2536 pdbhh F Eukaryota T 6c88 1 A,B A,B VAL-ALA-VAL-HIS-VAL-PHE VAVHVF 6 T 64 DUF5709 pdbhh F F 6c90 2 B B ZCHC8_HUMAN TRAMP-LIKE COMPLEX RNA-BINDING FACTOR ZCCHC8 SGDPIPDMSKFATGITPFEFENMAESTGMYLRIRSLLKNSPRNQQKNKKASE 52 T 6.5 DUF2621 pdbhh F Eukaryota T 6cae 56 ID,JD,KD A,B,C NOSO-95179 antibiotic KXAGXPHKX 9 T 30 Chisel pdbhh F T 6cb1 25 Y d Ribosome biogenesis protein YTM1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 465 F F F 6cb1 33 GA m rRNA-processing protein EBP2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 102 F F F 6cb1 39 MA x BRX1 associated peptide XXXXXXXXXXXXXXXXXXXXXXXXXXXX 28 F F F 6cb9 1 A A AALQSS AALQSS 6 T 7.3 CCD48 pdbhh F F 6cbi 2 G,H,I,J H,I,J,K CDN1A_HUMAN GLY-ARG-LYS-ARG-ARG-GLN-DAB-SER-MET-THR-GLU-PHE-TYR-HIS GRKRRQXSMTEFYH 14 T 1.4 HD_assoc pdbhh F Eukaryota T 6cbz 3 C,D C,D grip peptide AILHRLLQ 8 T 0.0019 SRC-1 pdbhh F T 6cce 8 I G poly(UNK) XXXXXXXXXXXXXXXXXXX 19 F F F 6cct 2 B B Tetrapeptide PTLV 4 T 100 DUF5972 pdbhh F F 6ccu 2 B B Short peptide PHRV 4 T 97 DUF924 pdbhh F F 6ccv 6 G G Unknown Peptide XXXXXXXXXXXXXXXXX 17 F F F 6cd8 2 C,D C,D Tetrapeptide PSRV PSRV 4 T 140 DUF659 pdbhh F F 6cd9 2 B B Tetrapeptide PSRW PSRW 4 T 22 DUF5123 pdbhh F F 6cdc 2 B C Tetrapeptide PGLW PGLW 4 T 22 DUF4746 pdbhh F F 6cdg 2 B B Hexapeptide PGLWKS PGLWKS 6 T 0.43 Herpes_TK_C pdbhh F T 6cdm 3 C,F C,F Q2N0S5_9HIV1 HIV fusion peptide (512-519) AVGIGAVF 8 T 4.2 DUF3918 pdbhh T Viruses F 6cdo 3 C C Q2N0S5_9HIV1 HIV-1 fusion peptide 512-519 AVGIGAVF 8 T 4.2 DUF3918 pdbhh T Viruses F 6cdp 3 C C Q2N0S5_9HIV1 HIV-1 fusion peptide 512-519 AVGIGAVF 8 T 4.2 DUF3918 pdbhh T Viruses F 6ce7 4 E P INSR_HUMAN IR QILKELEESSFRKTFEDYLHNVVFVPRPSR 30 T 0.00017 DUF4998 unphh F Eukaryota T 6ce9 2 C,H M,P INSR_HUMAN IR QILKELEESSFRKTFEDYLHNVVFVPRPSR 30 T 0.00017 DUF4998 unphh F Eukaryota T 6ceb 2 C,H M,P INSR_HUMAN IR QILKELEESSFRKTFEDYLHNVVFVPRPSR 30 T 0.00017 DUF4998 unphh F Eukaryota T 6cej 1 A A CDN1A_HUMAN CDK-INTERACTING PROTEIN 1, MELANOMA DIFFERENTIATION-ASSOCIATED PROTEIN 6, MDA-6, P21 GRKRRQTSMTDFYH 14 T 2.4 INCA1 pdbhh F Eukaryota T 6cen 2 B D ACE-GLY-VAL-NLE-ARG-ILE-NH2 XGVXRIX 7 T 47 DUF4656 pdbhh F F 6cew 1 A,B A,B AMMAAA AMMAAA 6 T 170 DUF2967 pdbhh F F 6cf4 1 A A TADBP_HUMAN NFGTFS NFGTFS 6 T 0.19 HU-CCDC81_bac_1 pdbhh F Eukaryota F 6cf6 2 C,D C,D RN146_HUMAN RNF146 NLARESSADGADS 13 T 47 DUF2788 pdbhh F Eukaryota T 6cfa 1 A A peptide PaAMP1R3 PMARNKKLLKKLRLKIAFK 19 T 7.9 RR_TM4-6 pdbhh F T 6cfb 1 A A A0A384E130_9METZ barrettide A DVSPCFCVEDETSGAKTCVPDNCDASRGTNP 31 T 8.4 NRF pdbhh F Eukaryota T 6cfh 1 A,B A,B TADBP_HUMAN TDP-43 SWGMMGMLASQ 11 T 0.29 Glucosaminidase unppercent F Eukaryota T 6cfw 4 D I I6U847_9EURY MBH subunit MFGYWDPLYFIIVFIIGLILAYLLNLWAKKSGMGTREVGEGTKIFISGEDPEKVIPGFEHLEGYYTGRNTMWGLVNGVKKFFATLKNDHTGLLPDYVSYLLMTTAFILVILLLRG 115 T 0.00077 Oxidored_q3 pdbpssm F Archaea T 6cfw 8 H E I6V287_9EURY MBH subunit MKRALGFLSLLVIFASLLVALSPEYGIKFGVGGEDWLKYRYTDNYYIEHGIEEVGGTNIVTDIVFDYRGYDTLGEATVLFTAIAGAVALLRPWRREENE 99 T 0.002 DUF2106 pdbhh F Archaea T 6cg3 1 A A ORN-LEU-VAL-PHI-PHE-ALA-GLU-ASP-ORN-ALA-ILE-ILE-EZY-LEU-ORN-VAL XLVXFAEDXAIIXLXV 16 T 0.35 Beta-APP pdbhh F T 6cg4 1 A,B,C A,B,C ORN-CYS-VAL-PHE-PHE-CYS-GLU-ASP-ORN-ALA-ILE-ILE-EZY-LEU-ORN-VAL XCVFFCEDXAIIXLXV 16 T 2.5 Endothelin pdbhh F T 6cg5 1 A,B,C A,B,C ORN-CYS-VAL-PHE-PHE-CYS-GLU-ASP-ORN-ALA-ILE-ILE-EZY-LEU-ORN-VAL XCVFFCEDXAIIXLXV 16 T 2.5 Endothelin pdbhh F T 6cgi 1 A,B,C,D A,B,C,D A0A0H3NMP8_SALTS Type III secretion system effector protein SNAPSSGSANFAGVEYPLLPLDQHTPLLFQWFERNPSRFGENQIPIINTQQNPYLNNIINAAIIEKERTIGVLVDGNFSAGQKKALAKLEKQYENIKVIYNSDLDYSMYDKKLSDIYLENIAKIEAQPANVRDEYLLGEIKKSLNEVLKNNPEESLVSSHDKRLGHVRFDFYRNLFLLKGSNAFLEAGKHGCHHLQPGGGCIYLDADMLLTGKLGTLYLPDGIAVHVSRKGNSMSLENGIIAVNRSEHPALKKGLEIMHSKPYGDPYIDGVCGGLRHYFNCSIRHNYEEFCNFIEFKHEHIFMDTSSLTISSWR 314 T 1.9E-05 Glyco_transf_88 unphh F Bacteria T 6cgw 1 A A JZTX5_CHIGU BETA/KAPPA-TRTX-CG2A, JINGZHAOTOXIN-5, JINGZHAOTOXIN-V, JZTX-V, PEPTIDE F8-15.73 YCQKWXWTCDSKRACCEGLRCKLWCRKEI 29 T 0.0016 Conotoxin unppercent F Eukaryota T 6ch7 4 D G Q2N0S6_9HIV1 ENV POLYPROTEIN AENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRTELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQAFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGREKR 479 T 3.4E-54 GP120 pdbpercent T Viruses T 6cha 1 A,D A,E CTRA_BOVIN ALPHA-CHYMOTRYPSIN A CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 6chg 6 G J H3 RTMQ 4 T 320 zf-CCHC pdbhh F F 6cht 2 C,F,I,L C,F,I,L PA2G4_HUMAN CELL CYCLE PROTEIN P38-2G4 HOMOLOG,HG4-1,ERBB3-BINDING PROTEIN 1 VQDAELKALLQSSASRKTQK 20 T 0.25 TraC unp F Eukaryota T 6cit 4 D D NS2_MUMIP NS2 GGSYSTVDEMTKKFGTLTIHD 21 T 0.33 DUF6118 unppssm T Viruses T 6civ 1 A C CDN1A_HUMAN p21 GRKRRQXSMTEFYH 14 T 1.4 HD_assoc pdbhh F Eukaryota T 6cix 1 A B CDN1A_HUMAN p21 GRKRRQKSMTEFYH 14 T 1.6 HD_assoc pdbhh F Eukaryota T 6cjd 1 A A A0A0H3NF83_SALTS TYPE III SECRETION SYSTEM EFFECTOR PROTEIN ORGC GHMVSLSARAAMLNNMDSAPLSNGGDVDLYDAFYQRLLALPESASSETLKDSIYQEMNAFKDPNSGDSAFVSFEQQTAMLQNMLAKVEPGTHLYEALNGVLVGSMNAQSQMTSWMQEIILSGGENKEAIDW 131 T 0.0082 AAA_35 pdbpercent F Bacteria T 6cka 1 A,B A,B A0A0H2UWN8_STRP3 Paratox MLYIDEFKEAIDKGYILGDTVAIVRKNGKIFDYVLPHEKVRDDEVVTVERVEEVMVELDKLEHHHHHH 68 T 0.019 Mesothelin unppercent F Bacteria T 6ckz 3 C C ACE-1MH-ASP-B3L-PHE-1U8 XXDXFX 6 T 910 SEC-C pdbhh F F 6cl0 3 C C ACE-1MH-ASP-PF5-PHE-1U8 XXDXFX 6 T 160 DUF5125 pdbhh F F 6cl1 3 E,F E,F ACE-1MH-ASP-B3L-PHE-1U8 ACEXDXFX 8 T 16 Arabinose_Isome pdbhh F T 6cl2 3 E,F E,F ACE-1MH-ASP-PF5-PHE-1U8 XXDXFX 6 T 160 DUF5125 pdbhh F F 6cl5 1 A,B,C,D,E,F A,B,C,D,E,F Q9KW03_PSEAI TAIL FIBER PROTEIN SGSEFVTAGMALAATDIPGLDASKLVSGVLAEQRLPVFARGLATAVSNSSDPNTATVPLMLTNHANGPVAGRYFYIQSMFYPDQNGNASQIATSYNATSEMYVRVSYAANPSIREWLPWQRCDIGGSFTKTTDGSIGNGVNINSFVNSGWWLQSTSEWAAGGANYPVGLAGLLIVYRAHADHIYQTYVTLNGSTYSRCCYAGSWRPWRQNWDDGNFDPASYLPKAGFTWAALPGKPATFPPSGHNHDTSQITSGILPLARGGLGANTAAGARNNIGAGVPATASRALNGWWKDNDTGLIVQWMQVNVGDHPGGIIDRTLTFPIAFPSACLHVVPTVKEVGRPATSASTVTVADVSVSNTGCVIVSSEYYGLAQNYGIRVMAIGY 384 T 0.0049 H_lectin pdbpssm F Bacteria T 6cl6 1 A,B,C,D,E,F A,B,C,D,E,F G3XD71_PSEAE Tail fiber protein SGSVTAGMALAATDIPGLDASKLVSGVLAEQRLPVFARGLATAVSNSSDPNTATVPLMLTNHANGPVAGRYFYIQSMFYPDQNGNASQIATSYNATSEMYVRVSYAANPSIREWLPWQRCDIGGSFTKEADGELPGGVNLDSMVTSGWWSQSFTAQAASGANYPIVRAGLLHVYAASSNFIYQTYQAYDGESFYFRCRHSNTWFPWRRMWHGGDFNPSDYLLKSGFYWNALPGKPATFPPSAHNHDVGQLTSGILPLARGGVGSNTAAGARSTIGAGVPATASLGASGWWRDNDTGLIRQWGQVTCPADADASITFPIPFPTLCLGGYANQTSAFHPGTDASTGFRGATTTTAVIRNGYFAQAVLSWEAFGR 372 T 0.6 Big_2 pdbpercent F Bacteria T 6cn8 2 B B Rufomycin I XXXAXLX 7 T 450 DUF4372 pdbhh F F 6cnl 2 M,N,O,P,Q,R,S,T,U,V,W,X M,O,X,V,R,T,W,P,N,U,S,Q PGAM5 Multimerization Motif Peptide GPGVWDPNWDRREP 14 T 1.9 IL17R_fnIII_D2 pdbhh F T 6cnu 1 A A JzTx-V(D) XXXXXXXXXXXXXXXXXXGXXXXXXXXXXXX 31 F F F 6co4 2 B B PSMD1_HUMAN 26S PROTEASOME REGULATORY SUBUNIT RPN2,26S PROTEASOME REGULATORY SUBUNIT S1,26S PROTEASOME SUBUNIT P112 GPGSQEPEPPEPFEYIDD 18 T 38 AgrD pdbhh F Eukaryota T 6cou 1 A A CSP2_STRPN CSP-2 AMRISRIILXFLFLRKK 17 T 0.95 DUF5841 unphh F Bacteria T 6cp8 1 A,B A,B A0A2A2CAY5_ECOLX CONTACT-DEPENDENT INHIBITOR A SNSFEVSSLPDANGKNHITAVKGDAKIPVDKIELYMRGKASGDLDSLQAEYNSLKDARISSQKEFAKDPNNAKRMEVLEKQIHNIERSQDMARVLEQAGIVNTASNNSMIMDKLLDSAQGATSANRKTSVVVSGPNGNVRIYATWTILPDGTKRLSTVTGTFK 163 T 0.016 DUF1090 pdb F Bacteria T 6cp8 2 C,D C,D A0A2A2C800_ECOLX CdiI SNAMINVNSTAKDIEGLESYLANGYVEANSFNDPEDDALECLSNLLVKDSRGGLSFCKKILNSNNIDGVFIKGSALNFLLLSEQWSYAFEYLTSNADNITLAELEKALFYFYCAKNETDPYPVPEGLFKKLMKRYEELKNDPDAKFYHLHETYDDFSKAYPLNN 164 T 0.06 DUF4007 pdbpssm F Bacteria T 6cp9 1 A,C,E,G A,C,E,G B5Y0C2_KLEP3 FILAMENTOUS HAEMAGGLUTININ FAMILY PROTEIN VPEITTAQTIANSVVDAKKFDYLFGKATGNSHTLDRTNQLALEMKRLGVADDINGHAVLAEHFTQATKDSNNIVKKYTDQYGSFEIRESFFIGPSGKATVFESTFEVMKDGSHRFITTIPKNGVTK 126 T 1.9 Exog_C pdbhh F Bacteria T 6cp9 2 B,D,F,H B,D,F,H CdiI MFIENKPGEIELLSFFESEPVSFERDNISFLYTAKNKCGLSVDFSFSVVEGWIQYTVRLHENEILHNSIDGVSSFSIRNDNLGDYIYAEIITKELINKIEIRIRPDIKIKSSSVIR 116 T 11 Imm50 pdbhh F T 6cpd 1 A,B A,B A0A452CSS7_9RHIZ PmoD SMGNMCMVMFGYDMIHITVFQPDKSRSEYCDEIPATGRTIMAFDIENPAFRDLPLELRIIRDPLTPVLPTGEKELDALTELHLPAKKYSKGTFSVEHNFANNGHYIGLVTLTRESGQQETAQFKFMVG 128 T 0.016 PKD_4 pdbpssm F Bacteria T 6csu 1 A,C B,D CE152_HUMAN CEP152 MGALEELRGQYIKAVKKIKCDMLRYIQESKERAAEMVKAEVLRERQETARKMRK 54 T 25 Viral_cys_rich pdbhh F Eukaryota T 6csu 2 B,D C,A CEP63_HUMAN CEP63 ACLNTRFLEEEELRSHHILERLDAHIEELKRESEKTVRQFTALK 44 T 0.021 HalX pdb F Eukaryota T 6ct4 1 A A O06514_ECOLX MERP PROTEIN PMKKLKLALRLAAKIAPVW 19 T 0.041 Mfp-3 unphh F Bacteria T 6ct8 1 A A G3XD71_PSEAE R2-type pyocin MHHHHHHSSGVDLGTENLYFQSNAGSFTKEADGELPGGVNLDSMVTSGWWSQSFTAQAASGANYPIVRAGLLHVYAASSNFIYQTYQAYDGESFYFRCRHSNTWFPWRRMWHGGDFNPSDYLLKSGFYWNALPGKPATFPPSAHNHDVGQLTSGILPLARGGVGSNTAAGARSTIGAGVPATASLGASGWWRDNDTGLIRQWGQVTCPADADASITFPIPFPTLCLGGYANQTSAFHPGTDASTGFRGATTTTAVIRNGYFAQAVLSWEAFGR 273 T 20 Neuraminidase pdbhh F Bacteria T 6ctg 1 A A AFP_CENMR CM-P1 SRSELIVHQRX 11 T 5.7 Nbs1_C unphh F Eukaryota T 6cu2 1 A A G3XD71_PSEAE R2-type pyocin MHHHHHHSSGVDLGTENLYFQSNAGSFTKEADGELPGGVNLDSMVTSGWWSQSFTAQAASGANYPIVRAGLLHVYAASSNFIYQTYQAYDGESFYFRCRHSNTWFPWRRMWHGGDFNPSDYLLKSGFYWNALPGKPATFPPSAHNHDVGQLTSGILPLARGGVGSNTAAGARSTIGAGVPATASLGASGWWRDNDTGLIRQWGQVTCPADADASITFPIPFPTLCLGGYANQTSAFHPGTDASTGFRGATTTTAVIRNGYFAQAVLSWEAFGR 273 T 20 Neuraminidase pdbhh F Bacteria T 6cuc 1 A A DKTX_CYRSC TAU-TRTX-HS1A,DOUBLE-KNOT TOXIN,DKTX GDCAKEGEVCSWGKKCCDLDNFYCPMEFIPHCKKYKPYVPVTTSFNCAKEGEVCGWGSKCCHGLDCPLAFIPYCEKYRGRND 82 T 0.071 Conotoxin_I2 pdb F Eukaryota T 6cvz 1 A,B,C A,B,C RFWD3_HUMAN RING FINGER AND WD REPEAT DOMAIN-CONTAINING PROTEIN 3,RING FINGER PROTEIN 201 GSPSSQGQHKHKYHFQKTFTVSQAGNCRIMAYCDALSCLVISQPSPQASFLPGFGVKMLSTANMKSSQYIPMHGKQIRGLAFSSYLRGLLLSASLDNTIKLTSLETNTVVQTYNAGRPVWSCCWCLDEANYIYAGLANGSILVYDVRNTSSHVQELVAQKARCPLVSLSYMPRAASAAFPYGGVLAGTLEDASFWEQKMDFSHWPHVLPLEPGGCIDFQTENSSRHCLVTYRPDKNHTTIRSVLMEMSYRLDDTGNPICSCQPVHTFFGGPTCKLLTKNAIFQSPENDGNILVCTGDEAANSALLWDAASGSLLQDLQTDQPVLDICPFEVNRNSYLATLTEKMVHIYKWE 351 T 0.002 WD40_2 pdb F Eukaryota T 6cwp 2 C F VAL-GLU-TYR-THR-LYS-HIS VEYTKH 6 T 16 DUF5428 pdbhh F T 6cxg 3 E,F A,C 10V1S glycopeptide XATKTNSKREKTXDNHVTIXRSIPWYTYRWLPNGSGSGXA 40 T 5.3 Peptidase_C54 pdbhh F T 6cxi 4 Q,R,S,T S,T,U,V Tropomyosin XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 127 F F F 6cxj 4 Q,R,S,T S,T,U,V Tropomyosin XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 127 F F F 6cxn 1 A A Hexapeptide HRFLRH HRFLRHX 7 T 6.8 DUF3877 pdbhh F F 6cxp 1 A A Hexapeptide HRFLRH HRFLRHX 7 T 6.8 DUF3877 pdbhh F F 6cxq 1 A A Hexapeptide HRFLRH HRFLRHX 7 T 6.8 DUF3877 pdbhh F F 6cxr 1 A A Hexapeptide HRFLRH HRFLRHX 7 T 6.8 DUF3877 pdbhh F F 6czo 2 B,D B,D Q05C46_HUMAN CASC5 protein GAMGHSSILKPPRSPLQDLRGGNETVQESNALRNKKNSRRVSFADTIKVFQTESHMKIVRKS 62 T 2.3 Consortin_C pdbhh F Eukaryota T 6d01 3 I,J I,J CSP_PLAFA NANP5 NANPNANPNANPNANPNANP 20 T 3.2 PT unppercent F Eukaryota F 6d02 1 A,AA,B,BA,C,CA,D,DA,E,EA,F,FA,G,H,I,J,K,L,M,N,O,P,Q,R,S,T,U,V,W,X,Y,Z A,a,B,b,C,c,D,d,E,e,F,f,G,H,I,J,K,L,M,N,O,P,Q,R,S,T,U,V,W,X,Y,Z alpha-Amyloid peptide alphaAmL XSKLLELLRKLLEALHKAIELLEKWGX 27 T 2.3 Antimicrobial19 pdbhh F T 6d0x 3 C C CSP_PLAFA NANP3 NANPNANPNANP 12 T 0.48 Cas_Cas7 pdbhh F Eukaryota F 6d0y 3 C B PRGC2_HUMAN PPARGC-1-BETA,PGC-1-RELATED ESTROGEN RECEPTOR ALPHA COACTIVATOR XSEEALPASGKSKXEAMDFDSLLKEAQQSLH 31 T 0.074 HALZ pdbpssm F Eukaryota T 6d11 3 E E CSP_PLAFA NANP5 NANPNANPNANPNANPNANP 20 T 3.2 PT unppercent F Eukaryota F 6d29 3 C C THR-SER-MET-SER-PHE-VAL-PRO-ARG-PRO-TRP TSMSFVPRPW 10 T 0.99 LSM_int_assoc pdbhh F T 6d2b 3 C C LEU-SER-ASP-SER-THR-ARG-ASP-VAL-THR-TRP LSDSTARDVTW 11 T 47 Alpha_TIF pdbhh F T 6d2c 1 A,B A,B A0A084JZF2_9FLAO Ulvan lyase MRKLKYNTTRVILMIAFISLSACSSEDAMIEEEQVIPDPDPVAQTDEDTGPVVDCTNQGTNPTRDTDIPNPRNIGDIDDRSCYANYSESSILGKFWGIYNITDGSNHMDAPNTLQPRIERSLSRSQATGAGSYARFRGVLRILEVGDTGTFSSSGSYFMQAKGKHTGGGGSPDPAICLYRAHPVYGDDGNGNQVQVSFDIWREQINFRGGSGSAGRTEVFLKNVLKNEQIDIELEVGFRDDPNNPGQTLHYADAKIGGEEFNWNIPEPERGIESGIRYGAYRVKGGRAQFRWANTSYTKDEVN 303 T 0.031 DUF4999 pdbhh F Bacteria T 6d2h 1 A A SER-ARG-PHE-GLU-LEU-ILE-VAL-HIS-GLN-ARG-NH2 SRFELIVHQRX 11 T 7.5 Ribosomal_S10 pdbhh F T 6d2r 3 C C GLY-SER-PHE-ASP-TYR-SER-GLY-VAL-HIS-LEU-TRP GSFDYSGVHLW 11 T 3.6 DUF2399 pdbhh F T 6d2t 3 C C LEU-ALA-LEU-LEU-THR-GLY-VAL-ARG-TRP LALLTGVRW 9 T 3.6 TMEM252 pdbhh F T 6d2u 1 A A DAB-VAL-ARG-THR-ARG-LYS-GLY-ARG-ARG-ILE-NOR-ILE-DPR-PRO XVRTRKGRRIXIXP 14 T 0.35 DUF2835 pdbhh F T 6d37 1 A A ALA-TYR-ALA-GLN-TRP-LEU-ALA-ASP-DAL-GLY-PRO-ALA-SER-DAL-NVA-PRO-PRO-PRO-SER XAYAQWLADXGPASXXPPPSX 21 T 5.6 Sec16_C pdbhh F T 6d3o 3 C,D C,D HH4 alpha/beta-Peptide NCDIHVXXEWXCFXR 15 T 9.4 PHP_C pdbhh F T 6d3u 1 A,B A,B A0A084JZF2_9FLAO Ulvan lyase MRKLKYNTTRVILMIAFISLSACSSEDAMIEEEQVIPDPDPVAQTDEDTGPVVDCTNQGTNPTRDTDIPNPRNIGDIDDRSCYANYSESSILGKFWGIYNITDGSNHMDAPNTLQPRIERSLSRSQATGAGSYARFRGVLRILEVGDTGTFSSSGSYFMQAMGKHTGGGGSPDPAICLYRAHPVYGDDGNGNQVQVSFDIWREQINFRGGSGSAGRTEVFLKNVLKNEQIDIELEVGFRDDPNNPGQTLHYADAKIGGEEFNWNIPEPERGIESGIRYGAYRVKGGRAQFRWANTSYTKDEVN 303 T 0.031 DUF4999 pdbhh F Bacteria T 6d3x 2 C,D C,D SFTI1_HELAN SFTI-1 GRCYKSKPPICFPD 14 T 0.052 Bowman-Birk_leg unp F Eukaryota T 6d3y 2 B C SFTI1_HELAN SFTI-1 GRCTKSRPPICFPD 14 T 0.011 Bowman-Birk_leg pdb F Eukaryota T 6d3z 2 B C SFTI1_HELAN SFTI-1 GRCYKSRPPICFPN 14 T 0.052 Bowman-Birk_leg unp F Eukaryota T 6d40 2 B C SFTI1_HELAN SFTI-1 GRCYKSIPPICFPD 14 T 0.052 Bowman-Birk_leg unp F Eukaryota T 6d53 1 A C Q81AN8_BACCR Hemolysin II GSHMDNQKALEEQMNSINSVNDKLNKGKGKLSLSMNGNQLKATSSNAGYGISYEDKNWGIFVNGEKVYTFNEKSTVGNISNDINKLNIKGPYIEIKQI 98 T 0.0039 CE2_N pdbpercent F Bacteria T 6d5f 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T,U,V,W,X,Y,Z A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T,U,V,W,X,Y,Z Fimbrial protein MARKRTSKNDPLRMYLNYVRKLQTMGDAYDESAKYRIANFENGFKSLHMVENEFKQYLANVIDEAIKSGASPQDLPYVNEIKLALMKIFTSWLKYSNEKLGANEIAINVAGTATMTLTENLYGTRVSCEEAVSLINSIFAVWVGVEPFEAEEREGACLVTPRSPLPPVPISSPTGFSAPIQEVLQAKSPEEIIGVKGGA 199 T 0.00082 Sulf_coat_C pdbhh F T 6d5z 1 A D Q81AN8_BACCR Hemolysin II GSHMDNQKALEEQMNSINSVNDKLNKGKGKLSLSMNGNQLKATSSNAGYGISYEDKNWGIFVNGEKVYTFNEKSTVGNISNDINKLNIKGPYIEIKQI 98 T 0.0039 CE2_N pdbpercent F Bacteria T 6d6v 4 D E A0A0U8TRG9_TETTH Telomerase holoenzyme Teb2 subunit MSNRVQGGFDNNSGNNQSAQKQQAEKIPQITVPLNCFMINQIVKAAKENPQAHSGNHYEWYGAFENAIITAKFEFLQSINDSPKIMGKLSDSTGCIEVVIQKSKMSDELPEFVQAYEIELQNNGNRHKYVRAMLKMRKNAQIQLLYFSIVNDANEISRHGLDLCLRYLQRKHGIEDFMHMTNDKAHNNHNASAQKVHYQIDRNQQPKEQVLELMRQILKHNPNDQIPKSKIIEFFQSQLNQVQINQILQQLVSANEIFSVGSDNYLLNV 269 T 0.012 HSM3_N pdb F Eukaryota T 6d6v 7 G G Telomerase-associated protein 50 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 157 F F F 6d7a 1 A,B A,B A1E348_TOXGO Perforin-like protein 1 SNAVGLTPQDLSALTGVTRNLPKQLTQATQVAWSGPPPGFAKCPGGQVVILGFAMHLNFKEPGTDNFRIISCPPGREKCDGVGTASSETDEGRIYILCGEEPINEIQQVVAESPAHAGASVLEASCPDETVVVGGFGISVRGGSDGLDSFSIESCTTGQTICTKAPTRGSEKNFLWMMCVDKQYPGLRELVNVAELGSHGNANKRAVNSDGNVDVKCPANSSIVLGYVMEAHTNMQFVRDKFLQCPENASECKMTGKGVDHGMLWLFDRHALFGWIICKTVNEPAMHVATDVGKAKGNGKKKKGRKGKNKTNAPNEVEEGQQLGADSPSQVSVPADADSGPTSKTMSSLKLAPVKLLDL 359 T 10 Cutinase pdb F Eukaryota T 6d7k 4 D,H D,H Q27RN3_METSR Methane monooxygenase hydroxylase, MmoD SNAMAHSAEPTTEASRILIHSDARYEAFTVDLDYMWRWEILRDGEFVQEGCSLSFDSSRKAVAHVLSHFKRQDEAAQRPGDNSAEIKRLLQSLGTPIPVNEQNDSTKNELAQPE 114 T 1.9 DUF1508 pdbhh F Bacteria T 6d7y 1 A A Hemagglutinin IKTVLDTAQAPYKGSTVIGHALSKHAGRHPEIWGKVKGSMSGWNEQAMKHFKEIVRAPGEFRPTMNEKGITFLEKRLIDGRGVRLNLDGTFKGFID 96 T 35 Tcp10_C pdbhh F T 6d7y 2 B B immune protein MKELFEVIFEGVNTSRLFFLLKEIESKSDRIFDFNFSEDFFSSNVNVFSELLIDSFLGFNGDLYFGVSMEGFSVKDGLKLPVVLLRVLKYEGGVDVGLCFYMNDFNSAGKVMLEFQKYMNGISADFGFENFYGGLEPASDQETRFFTNNRLGPLL 155 T 0.017 Psb28 pdbpssm F T 6d80 1 A,B,C,D,E,F F,H,G,I,J,K Saposin A XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 81 F F F 6d8c 3 K,L,M N,O,P PHAD3_AMAPH Phalloidin PAWXAXC 7 T 2.7 CSN7a_helixI pdbhh F Eukaryota F 6d94 2 B B MED1_HUMAN ACTIVATOR-RECRUITED COFACTOR 205 KDA COMPONENT,ARC205,MEDIATOR COMPLEX SUBUNIT 1,PEROXISOME PROLIFERATOR-ACTIVATED RECEPTOR-BINDING PROTEIN,PPAR-BINDING PROTEIN,THYROID HORMONE RECEPTOR-ASSOCIATED PROTEIN COMPLEX 220 KDA COMPONENT,TRAP220,THYROID RECEPTOR-INTERACTING PROTEIN 2,TRIP-2,VITAMIN D RECEPTOR-INTERACTING PROTEIN COMPLEX COMPONENT DRIP205,P53 REGULATORY PROTEIN RB18A VSSMAGNTKNHPMLMNLLKDNPAQ 24 T 12 HEAT pdbhh F Eukaryota T 6da1 2 C C serine-rich region (SRR) peptide PSXDSXDXEDXPAALWX 17 T 2.6 SAS-6_N pdbhh F T 6dat 2 E,F E,F serine-rich region (SRR) peptide PSXDSXDXEDXPAALWX 17 T 2.6 SAS-6_N pdbhh F T 6dc8 3 C P TAU_HUMAN NEUROFIBRILLARY TANGLE PROTEIN,PAIRED HELICAL FILAMENT-TAU,PHF-TAU RENAKAKTDHGAEIVYKSPVVSGDTSPRHLX 31 T 0.37 Tmemb_cc2 unp F Eukaryota T 6dc9 3 C,F P,Q TAU_HUMAN NEUROFIBRILLARY TANGLE PROTEIN,PAIRED HELICAL FILAMENT-TAU,PHF-TAU RENAKAKTDHGAEIVYKSPVVSGDTSPRHL 30 T 0.37 Tmemb_cc2 unp F Eukaryota T 6dca 3 C,F,I,L P,Q,R,S TAU_HUMAN NEUROFIBRILLARY TANGLE PROTEIN,PAIRED HELICAL FILAMENT-TAU,PHF-TAU RENAKAKTDHGAEIVYKSPVVSGDTSPRHLX 31 T 0.37 Tmemb_cc2 unp F Eukaryota T 6dde 6 F D DAMGO YXGXX 5 T 100 FANCL_d1 pdbhh F F 6ddf 5 E D DAMGO YXGXX 5 T 100 FANCL_d1 pdbhh F F 6dec 9 Q Q unidentified XXXXXX 6 F F F 6dec 10 R R unidentified XXXXXXXXX 9 F F F 6dei 2 C,D C,D DSE3_YEAST DAUGHTER SPECIFIC EXPRESSION PROTEIN 3 SNAFGGTLKLKKRLESVPELFLHD 24 T 4 DUF3805 pdbhh F Eukaryota T 6dex 1 A A Q75DL0_ASHGO ABR011WP SNAERALLQLVVEDDAKALVFVLGQDARRYFEEELPASPFEFPSPQAVANSRQNVGVMFLDKLQYLYMYLTKLEVDEAPEYRTLVVYGLEQLLGAGGELDADQVRLASLIYNTAFRVRVRHGAAVRFVAHGAPHAQLQQLEAHWRLFT 148 T 0.099 CutC pdb F Eukaryota T 6dex 2 B B SHU2_ASHGO Suppressor of hydroxyurea sensitivity protein 2 MAETNFNYSKLLRNLVTEDNVLNEVVVSFLYQLFPRDLFVRAFSLLESADMFIYVWMPTPKEADELLESLYNGTPLYRPIVRPRGPDDRPVCVDLDHWFCSCTEFAATCRPHLVGDTPLSDALFRPTEAADPDDCFGMLAGLQHLRADPEKLMCEHLFAFAILLQTDLRVLRHFSTGPGAQVFVLGITSIDEWLKLHLNVV 201 T 1 SWIM pdbpssm F Eukaryota T 6df1 3 C A LEU-PTR LXL 3 T 220 DUF3744 pdbhh F F 6df2 3 C,F D,C LEU-PTR-LEU LXL 3 T 220 DUF3744 pdbhh F F 6dfd 1 A,B A,B CNNM3_HUMAN ANCIENT CONSERVED DOMAIN-CONTAINING PROTEIN 3,CYCLIN-M3 GPLGSSEDYRDTVVKRKPASLMAPLKRKEEFSLFKVSDDEYKVTISPQLLLATQRFLSREVDVFSPLRMSEKVLLHLLKHPSVNQEVRFDESNRLATHHYLYQRSQPVDYFILILQGRVEVEIGKEGLKFENGAFTYYGVSALMVPSSVHQSPVSSLQPIRHDLQPDPGDGTHSSMYCPDYTVRALSDLQLIKVTRLQYLNALMATRAQNLPQSPENTDLQMMPGSQTRLLGEKTTTAAGSSHSRPGVPVEGSPGRNPGV 260 T 0.0053 cNMP_binding pdbhh F Eukaryota T 6dfg 1 A,E,I A,C,D Q2N0S6_9HIV1 ENV POLYPROTEIN AENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHEDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNTPVQINCTRPNNNTVKSIRIGPGQAFYYTGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFAQSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGR 476 T 3.1E-54 GP120 pdbpercent T Viruses T 6dg5 1 A A Neoleukin-2/15 GSHMPKKKIQLHAEHALYDALMILNIVKTNSPPAEEKLEDYAFNFELILEEIARLFESGDQKDEAEKAKRMKEWMKRIKTTASEDEQEEMANAIITILQSWIFS 104 T 0.0088 UvrD_C pdb F T 6dg6 1 A,B,C,D,E,F A,B,C,D,E,F Neoleukin-2/15 GSHMPKKKIQLHAEHALYDALMILNIVKTNSPPAEEKLEDYAFNFELILEEIARLFESGDQKDEAEKAKRMKEWMKRIKTTASEDEQEEMANAIITILQSWIFS 104 T 0.0088 UvrD_C pdb F T 6dgp 2 C,D C,D TRAP220 Coactivator Peptide (Mediator of RNA polymerase II transcription subunit 1) NTKNHPMLMNLLKDNPAQD 19 T 8.5 HEAT pdbhh F T 6dhx 1 A,B,C A,B,C T1ZH71_STRIT TipC2 MGSSHHHHHHSQDPMNQPKNIFDEIYQETEKTYRLNNIFNKLTDVEVHSYQEYSDDSKFYPSILYKDIAKTGNYTKIAIDFSFLNKNNNILIYFEKEIGPNVRVRIWNKYTRQDRTLTKSVKIALEKGDSDKYIEDETQVRAYLKKYGITAKDLDAHYEKIVNQKVLKDWCSIYKSKYSPKDYGQVTVKMQWEKW 195 T 0.0095 YflT unppercent F Bacteria T 6di8 1 A,D,G,J A,D,G,J CTRA_BOVIN Chymotrypsin A chain A CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 6dig 3 C C 13-mer peptide: ALA-GLY-ASN-HIS-ALA-ALA-GLY-ILE-LEU-THR-LEU-GLY-LYS AGNHAAGILTLGK 13 T 1.2E-05 Orexin pdbhh F T 6dix 1 A,B,C,D A,B,C,D NFVFGT Immunoglobulin Light-Chain Variable Domain NFVFGT 6 T 35 FA_hydroxylase pdbhh F F 6diy 1 A A YTFGQ segment Light-Chain Variable Domain Kappa AL09 YTFGQ 5 T 35 Thrombin_light pdbhh F F 6dj0 1 A,B A,B ASLTVS segment from Light-Chain Variable Domain, Lambda Mcg ASLTVS 6 T 220 Trypco1 pdbhh F F 6dj3 1 A,B A,B CNNM2_HUMAN ANCIENT CONSERVED DOMAIN-CONTAINING PROTEIN 2,CYCLIN-M2 GPLGSTDLYTDNRTKKKVAHRERKQDFSAFKQTDSEMKVKISPQLLLAMHRFLATEVEAFSPSQMSEKILLRLLKHPNVIQELKYDEKNKKAPEYYLYQRNKPVDYFVLILQGKVEVEAGKEGMKFEASAFSYYGVMALTASPVIDAVTPTLGSSNNQLNSSLLQVYIPDYSVRALSDLQFVKISRQQYQNALMASRMD 199 T 0.0026 cNMP_binding pdbhh F Eukaryota T 6dj8 2 C,D C,D Natural product peptide XXXXLXLXPXG 11 T 15 NOP19 pdbhh F F 6djk 2 B B ACE-MVA-MP8-NZC-LEU-MP8-LEU-MVA-PRO-MLU-GLY XXXXLXLXPXG 11 T 15 NOP19 pdbhh F F 6djr 1 A,B,C,D A,B,C,D Transient Receptor Potential Cation Channel Subfamily C Member 3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 490 F F F 6dju 2 E N casein polyAlanine model AAAAAAAAAAAAAAAAAAAAAAAAAA 26 T 920 DUF4699 pdbhh F F 6djv 2 E N casein polyAlanine model AAAAAAAAAAAAAAAAAAAAAAAAAA 26 T 920 DUF4699 pdbhh F F 6djy 1 A A A0A0A0UEE5_9REOV Clamp protein MTLTYWDKEKRMTLKQMIQQVAINEQENELTHYVFTTPLSMPTFGKPMLGYVPLNEVATSKFFSNVNDFDRDNQLAMAHFPDTTITQAYNLTNSIKPGDTSLPDAEVAALKWFWKFFTSINLVRQPPMDNVMYWACQFLSSGTSFLPLERDVEIVFSGFKGSHICMFSNLRQMNLSPILCPYYDLITNFKTTTEIRAYVDAHEELKSLLTYLCLCTIVGLCDTFTETRNMDTGEYVWKVRDVVSRNHTPAQNVEKFCYTIQNAKYMIQLVHVLLFPLTDNKYADLPNYVAVITQGAINQSRSHNVINTTDESNSNTTSDTAASTSGIVSGDTGTVASLYPDEFKYVQS 348 T 31 DUF5659 pdbhh T Viruses T 6djy 2 B,C B,C A0A0A0U7Z7_9REOV Major capsid protein MRPIRMYKNNQERTNLKHQEINEEQQNEQTTSNQGFTRSDNSGKINIERISSSRNQITDGKTVSSYSKIETNRSSQDSVQHGGSSITYTSDTTGNPRITNARTNNDETHATGPIEDLNSTSHGREPEIESFADRAELAMMIQGMTVGALTVQPMRSIRSTFANLANVLIFHDVFTTEDKPSAFIEYHSDEMIVNMPKQTYNPIDNLAKILYLPSLEKFKYGTGIVQLNYSPHISKLYQNTNNIINTITDGITYANRTEFFIRVMVLMMMDRKILTMEFYDVDTSAISNTAILPTIPTTTGVSPLLRIDTRTEPIWYNDAIKTLITNLTIQYGKIKTVLDANAVKRYSVVGYPIDQYRAYLYNHNLLEYLGKKVKREDIMSLIKALSYEFDLITISDLEYQNIPKWFSDNDLSRFIFSICMFPDIVRQFHALNIDYFSQANVFTVKSENAIVKMLNSNQNMEPTIINWFLFRICAIDKTVIDDYFSLEMTPIIMRPKLYDFDMKRGEPVSLLYILELILFSIMFPNVTQHMLGQIQARILYISMYAFRQEYLKFITKFGFYYKIVNGRKEYIQVTNQNERMTENNDVLTGNLYPSLFTDDPTLSAIAPTLAKIARLMKPTTSLTPDDRAIAAKFPRFKDSAHLNPYSSLNIGGRTQHSVTYTRMYDAIEEMFNLILRAFASSFAQRPRAGVTQLKSLLTQLADPLCLALDGHVYHLYNVMANMMQNFIPNTDGQFHSFRACSYAVKDGGNIYRVVQNGDELNESLLIDTAIVWGLLGNTDSSYGNAIGATGTANVPTKVQPVIPTPDNFITPTIHLKTSIDAICSVEGILLLILSRQTTIPGYEDELNKLRTGISQPKVTERQYRRARESIKNMLGSGDYNVAPLHFLLHTEHRSTKLSKPLIRRVLDNVVQPYVANLDPAEFENTPQLIENSNMTRLQIALKMLTGDMDDIVKGLILHKRACAKFDVYETLTIPTDVKTIVLTMQHISTQTQNNMVYYVFLIDGVKILAEDIKNVNFQIDITGIWPEYVITLLLRAINNGFNTYVSMPNILYKPTITADVRQFMNTTKAETLLISNKSIVHEIMFFDNALQPKMSSDTLALSEAVYRTIWNSSIITQRISARGLMNLEDARPPEAKISHQSELDMGKIDETSGEPIYTSGLQKMQSSKVSMANVVLSAGSDVIRQAAIKYNVVRTQEIILFE 1202 T 0.054 DUF6279 pdb T Viruses T 6djy 3 D D A0A0A0U955_9REOV Turret protein MIDLRLEEDILTATLPEFLSTRPKYRYAYTNTKQQDIRFQGPMRHVRLTHLYKQTKLWNLQYIERELAISEIDDALDEFIQTFSLPYVIEQGTYKYNMLLGMHAHNVNYQDDVSELIANNPQLLNYLDDNPFSAIFELVNVDLQIYQYGQNIFNNEAEHTILFLKDNTNYGVIQALQKHPFSATHINWHLHKHIFVFHSREQLLNKLLSAGLEDSQLYQRQKTYSTKRGDRPTERMVTYIEDDHIRRIQAVFPLLLDNIFDVKLHKDSSMTWLKSYADMIYDSVKNSNSTITPEIRKLYLRMYNQYMRIFLPIEQYMLYDNTCWPFSEKITLKINVRLISSRENQPVLWKTPIDTENLISIVQPDEPINKLNFTAIPSTMIRLNDNITMYRAVKDMFSAIEYLPDAIENIPTLTMKEQALSRYISPDSEAQNFFNNQPPYLNSIMNVNRQVFEAVKRGNIQVSTGSMEHLCLCMHVKSGLIVGRTVLIDDKVVLRRNFNASTAKMITCYVKAFAQLYGEGSLINPGLRMVFFGVETEPAIDILKLFYGDKSLYIQGFGDRGIGRDKFRTKIEDALTLRIGCDILISDIDQADYEDPNEEKFDDITDFVCYVTELVISNATVGLVKISMPTYYIMNKISSTLNNKFSNVAINIVKLSTQKPYTYEAYIMLSHGSTLTNKGYLRNPVCDVYLEKISLQPMDLKIISTISNEINYDKPTLYRFVVDKNDVTDVSIAMHILSIHCSTITTRSVMVRSDNTGAFVTMSGIKDMKRVAIMNRMTDGTSANSYMHEQNGKLYLQKVPYLEDLISAFPNGFGSTYQNDYDSSMSVINVNALIRQVVYRVISKSIPVALLESLSRIRIIGGRDLGEMNAVYKLYKTPIEVYDAVGITREYPHVQISYRAQRYSFTESIPNHTLLLANYVIMNDVDGAPISSLEQINTIKKIISKISLGSIAYIQVYTDIVARNINVMTKNDSFLISANADKTVFKVQVSGYKAVEMCNYEQLLQLVSDNTGVNIIKLTYQDVLESCVLSSGILGDTGSWLLDLVLASTYIIEIRG 1056 T 1.3 Reovirus_L2 pdbhh T Viruses T 6dkm 1 A,C,E,G A,C,E,G DHD131_A GSDESDRIRKIVEESDEIVKESRKLAERARELIKESEDKRVSEERNERLLEELLRILDENAELLKRNLELLKEVLYRTR 79 T 0.18 Syntaxin_2 pdb F T 6dky 1 A,B A,B ILE-LEU-GLY-SER-ILE-ILE-LEU-GLY ILGSIILG 8 T 2.5 YkpC pdbhh F F 6dkz 1 A A ribifolin SIILGILG 8 T 2.8 DUF4491 pdbhh F F 6dl0 1 A A pohlianin C TIIFGFGG 8 T 0.16 Sec16 pdbhh F F 6dl1 1 A A jatrophidin PGLLNLWG 8 T 4.3 DUF3959 pdbhh F F 6dlc 2 B B Designed protein DHD1:234_B HGDPKVVETYVELLKRHEKAVKELLEIAKTHAKKVE 36 T 0.46 Nup54_C pdbhh F T 6dly 2 C,D C,D Natural product peptide XXXXLXLXPXG 11 T 15 NOP19 pdbhh F F 6dm3 1 A,B A,B Q5ZWF6_LEGPH RavO ESEKIYKVMEEIFVDRHYKENIRTGEEVKQYFSKSKAEFILRWSSANESDTENKYVFIAASFQASDGIHSIRYGINKNGELFSINTASNKVTPIDILPLGVMATLTQHITQNKELIEKAL 120 T 6.9 IBP39 pdbhh F Bacteria T 6dm4 1 A,C,D,F A,B,C,D Q5ZWF6_LEGPH RavO ESEKIYKVMEEIFVDRHYKENIRTGEEVKQYFSKSKAEFILRWSSANESDTENKYVFIAASFQASDGIHSIRYGINKNGELFSINTASNKVTPIDILPLGVMATLTQHITQNKELIEKAL 120 T 6.9 IBP39 pdbhh F Bacteria T 6dm4 2 B,E,G E,G,H SHC1_HUMAN Shc1 phospho-Tyr317 peptide PSXVNVQ 7 T 0.25 SH3-WW_linker pdbhh F Eukaryota T 6dm6 2 C,D C,D Natural product peptide XXXXLXLXPXG 11 T 15 NOP19 pdbhh F F 6dm9 1 A,C A,C DHD15_extended_A MTREELLRENIELAKEHIEIMREILELLQKMEELLEKARGADEDVAKTIKELLRRLKEIIERNQRIAKEHEYIARERS 78 T 0.0058 Hormone_1 pdb F T 6dma 1 A,C,E,G A,C,E,G DHD15_closed_A MTREELLRENIELAKEHIEIMREILELLQKMEELLEKARGADEDVAKTIKELLRRLKEIIERNQRIAKEHEYIARERS 78 T 0.0058 Hormone_1 pdb F T 6dmp 1 A A Designed orthogonal protein DHD13_XAAA_A GTKEDILERQRKIIERAQEIHRRQQEILEELERIIRKPGSSEEAMKRMLKLLEESLRLLKELLELSEESAQLLYEQR 77 T 0.0023 Ku_C pdbpercent F T 6dmp 2 B B Designed orthogonal protein DHD13_XAAA_B TEKRLLEEAERAHREQKEIIKKAQELHRRLEEIVRQSGSSEEAKKEAKKILEEIRELSKRSLELLREILYLSQEQKGSLVPR 82 T 0.00053 Prefoldin_2 pdb F T 6dmx 1 A,F E,J HBZ_HTL1A BZIP factor GSHMASGLFRALPVSAPEDLLVEELVDGLLSLEEELKDKEEEKAVLDGLLSLEEESRG 58 T 0.6 Cupin_8 unp T Viruses T 6dn5 2 B B WDINNN(BAL) cyclic peptide inhibitor WDINNNX 7 T 18 Resistin pdbhh F F 6dn6 2 B B INNN(ABU) cyclic peptide inhibitor INNNX 5 T 330 Homez pdbhh F F 6dn7 2 B,D B,D WDINNN(BAL) Cyclic peptide inhibitor WDINNNX 7 T 18 Resistin pdbhh F F 6dn8 2 B,D,F B,D,F (GZJ)VDINNN(CY3) Cyclic peptide inhibitor XVDINNNX 8 T 1.1 DUF2729 pdbhh F F 6dnm 1 A A Export chaperone SatS SHMVVLGGDRDFWLQVGIDPIQIMTGTATFYTLRCYLDDRPIFLGRNGRISVFGSERALARYLADEHDHDLSDLSTYDDIRTAATDGSLAVAVTDDNVYVLSGLVDDFADGPDAVDREQLDLAVELLRDIGDYSEDSAVDKALETTRPLGQLVAYVLDPHSVGKPTAPYAAAVREWEKLERFVESRLRRE 190 T 0.04 DUF6482 pdbhh F T 6dno 2 B B PPR3A_RABIT PROTEIN PHOSPHATASE 1 GLYCOGEN-ASSOCIATED REGULATORY SUBUNIT,PROTEIN PHOSPHATASE TYPE-1 GLYCOGEN TARGETING SUBUNIT,RG1 RRVSFADNFGFNLVSVKEFDTWELPSVSTT 30 T 1.3 RSD-2 pdbhh F Eukaryota T 6dno 3 C C Microcystin-LR XLXRXXX 7 T 49 Phage_TAC_2 pdbhh F F 6dnq 1 A E Q2Q067_9DELA BZIP factor GSHMASGLFRALPVSAPEDLLVEELVDGLLSLEEELKDKEEEKAVLDGLLSLEEESRGRLRRGPPGEKAPPRGETHRDR 79 T 0.16 Cupin_8 unp T Viruses T 6dny 1 A A Cyclic tetrapeptide PYPV PYPV 4 T 49 zf-MYST pdbhh F F 6do1 3 G,H G,H Angiotensin-like peptide S1I8 XRVYIHPI 8 T 3 Ion_trans_N pdbhh F T 6do5 2 C,D D,C UBP1_HUMAN USP1 C-END DEGRON IGLLGG 6 T 0.016 HYPK_UBA unphh F Eukaryota F 6dql 2 C C SpeB-inducing peptide (SIP) MWLLLLFL 8 T 0.8 Prion_bPrPp pdbhh F F 6dqq 2 B B ALA-ALA-ALA-ALA AAAA 4 T 900 Cyclin_C pdbhh F F 6dqr 2 B B MET-GLY-GLY MGG 3 T 56 DUF829 pdbhh F F 6dqt 2 B B LEU-GLY-GLY LGG 3 T 170 RMI1_N pdbhh F F 6dqu 2 B B GLY-ILE-ILE-ASN-THR-LEU GIINTL 6 T 11 DUF1601 pdbhh F F 6dr4 1 A,B,C,D A,C,B,D ORN-CYS-VAL-PHE-MEA-CYS-GLU-ASP-ORN-ALA-VAL-ILE-GLY-LEU-ORN-VAL XCVFXCEDXAVIGLXV 16 T 0.35 Beta-APP pdbhh F T 6dr5 1 A,B,C,D,E,F A,B,C,D,E,F ORT-CYS-VAL-PHE-MEA-CYS-GLU-ASP-ORT-ALA-CHG-ILE-GLY-LEU-ORA-VAL XCVFXCEDXAXIGLXV 16 T 1.1 Beta-APP pdbhh F T 6dr6 1 A,B,C,D A,D,B,C ORT-CYS-VAL-PHE-XXX-CYS-GLU-ASP-ORT-ALA-ILE-ILE-GLY-LEU-ORA-VAL XCVFXCEDXAIIGLXV 16 T 2.3 Pox_A14 pdbhh F T 6drd 13 M M GRL1A_HUMAN DNA-DIRECTED RNA POLYMERASE II SUBUNIT M,GLUTAMATE RECEPTOR-LIKE PROTEIN 1A XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPTQLLSIEESLALQKQQ 62 T 5.2 DUF6465 pdbpssm F Eukaryota T 6drq 1 A A Primosomal protein LGGDRDFWLQVGIDPIQIMTGTATFYTLRCYLDDRPIFLGRNGRISVFGSERALARYLADEHDHDLSDLSTYDDIRTAATDGSLAVAVTDDNVYVLSGLVDDFADGPDAVDREQLDLAVELLRDIGDYSEDSAVDKALETTRPLGQLVAYVLDPHSVGKPTAPYAAAVREWEKLERFVESRLRRE 185 T 0.037 DUF6482 pdbhh F T 6dsl 1 A A Consensus engineered intein CatN EFEALSGDTMIEILDDDGIIQKISMEDLYQRLA 33 T 1 Ish1 pdbhh F T 6dsl 2 B B Consensus engineered intein CatC DYKDDDDKMFKLNTKNIKVLTPSGFKSFSGIQKVYKPFYHHIIFDDGSEIKCSDNHSFGKDKIKASTIKVGDYLQGKKVLYNEIVEEGIYLYDLLNVGEDNLYYTNGIVSHACESRGK 118 T 0.12 DUF3857 pdb F T 6dt7 2 B B Q8LTE3_BPN4 RNAP2 MQTFTAREYLKIDIANNYGLDKEDWDDRIAWFDKNENNLLNLVREAEEPALFYAGVKAWMDVKEGKPIGYPVALDATSSGLQILACLTGDRRAAELCNVVNYRDESGKVKRRDAYTVIYNKMLNTLGKGARIKRNDCKQAIMTALYGSEAKPKEVFGEGIMLNVFESTMNVEAPAVWELNKFWLQCGNPEAFVYHWVMPDGFNVYIKVMVNEVETVHFLDKPYDCVRKVQGTEEKTRMLSANTTHSIDGLVVRELVRRCDYDKNQIEYIKALCNGEAEYKASEKNYGKAMELWGYYEKTGFLTARIFDYLDSETIKLVNTQDILDLIESMPKKPFHVLTVHDCFRCLPNYGNDIRRQYNNLLATIAKGDLLSFIMSQVIGQEVTIGKLDPTLWEDVLETEYALS 404 T 7.5E-42 RNA_pol pdbpercent T Viruses T 6dt8 2 B B Q8LTE3_BPN4 RNAP2 MQTFTAREYLKIDIANNYGLDKEDWDDRIAWFDKNENNLLNLVREAEEPALFYAGVKAWMDVKEGKPIGYPVALDATSSGLQILACLTGDRRAAELCNVVNYRDESGKVKRRDAYTVIYNKMLNTLGKGARIKRNDCKQAIMTALYGSEAKPKEVFGEGIMLNVFESTMNVEAPAVWELNKFWLQCGNPEAFVYHWVMPDGFNVYIKVMVNEVETVHFLDKPYDCVRKVQGTEEKTRMLSANTTHSIDGLVVRELVRRCDYDKNQIEYIKALCNGEAEYKASEKNYGKAMELWGYYEKTGFLTARIFDYLDSETIKLVNTQDILDLIESMPKKPFHVLTVHDCFRCLPNYGNDIRRQYNNLLATIAKGDLLSFIMSQVIGQEVTIGKLDPTLWEDVLETEYALS 404 T 7.5E-42 RNA_pol pdbpercent T Viruses T 6dta 2 B B Q8LTE3_BPN4 RNAP2 MQTFTAREYLKIDIANNYGLDKEDWDDRIAWFDKNENNLLNLVREAEEPALFYAGVKAWMDVKEGKPIGYPVALDATSSGLQILACLTGDRRAAELCNVVNYRDESGKVKRRDAYTVIYNKMLNTLGKGARIKRNDCKQAIMTALYGSEAKPKEVFGEGIMLNVFESTMNVEAPAVWELNKFWLQCGNPEAFVYHWVMPDGFNVYIKVMVNEVETVHFLDKPYDCVRKVQGTEEKTRMLSANTTHSIDGLVVRELVRRCDYDKNQIEYIKALCNGEAEYKASEKNYGKAMELWGYYEKTGFLTARIFDYLDSETIKLVNTQDILDLIESMPKKPFHVLTVHDCFRCLPNYGNDIRRQYNNLLATIAKGDLLSFIMSQVIGQEVTIGKLDPTLWEDVLETEYALS 404 T 7.5E-42 RNA_pol pdbpercent T Viruses T 6dtd 1 A A E6K398_9BACT nuclease MQKQDKLFVDRKKNAIFAFPKYITIMENKEKPEPIYYELTDKHFWAAFLNLARHNVYTTINHINRRLEIAELKDDGYMMGIKGSWNEQAKKLDKKVRLRDLIMKHFPFLEAAAYEMTNSKSPNNKEQREKEQSEALSLNNLKNVLFIFLEKLQVLRNYYSHYKYSEESPKPIFETSLLKNMYKVFDANVRLVKRDYMHHENIDMQRDFTHLNRKKQVGRTKNIIDSPNFHYHFADKEGNMTIAGLLFFVSLFLDKKDAIWMQKKLKGFKDGRNLREQMTNEVFCRSRISLPKLKLENVQTKDWMQLDMLNELVRCPKSLYERLREKDRESFKVPFDIFSDDYNAEEEPFKNTLVRHQDRFPYFVLRYFDLNEIFEQLRFQIDLGTYHFSIYNKRIGDEDEVRHLTHHLYGFARIQDFAPQNQPEEWRKLVKDLDHFETSQEPYISKTAPHYHLENEKIGIKFCSAHNNLFPSLQTDKTCNGRSKFNLGTQFTAEAFLSVHELLPMMFYYLLLTKDYSRKESADKVEGIIRKEISNIYAIYDAFANNEINSIADLTRRLQNTNILQGHLPKQMISILKGRQKDMGKEAERKIGEMIDDTQRRLDLLCKQTNQKIRIGKRNAGLLKSGKIADWLVNDMMRFQPVQKDQNNIPINNSKANSTEYRMLQRALALFGSENFRLKAYFNQMNLVGNDNPHPFLAETQWEHQTNILSFYRNYLEARKKYLKGLKPQNWKQYQHFLILKVQKTNRNTLVTGWKNSFNLPRGIFTQPIREWFEKHNNSKRIYDQILSFDRVGFVAKAIPLYFAEEYKDNVQPFYDYPFNIGNRLKPKKRQFLDKKERVELWQKNKELFKNYPSEKKKTDLAYLDFLSWKKFERELRLIKNQDIVTWLMFKELFNMATVEGLKIGEIHLRDIDTNTANEESNNILNRIMPMKLPVKTYETDNKGNILKERPLATFYIEETETKVLKQGNFKALVKDRRLNGLFSFAETTDLNLEEHPISKLSVDLELIKYQTTRISIFEMTLGLEKKLIDKYSTLPTDSFRNMLERWLQCKANRPELKNYVNSLIAVRNAFSHNQYPMYDATLFAEVKKFTLFPSVDTKKIELNIAPQLLEIVGKAIKEIEKSENKN 1127 T 0.13 Cdh1_DBD_1 pdbpssm F Bacteria T 6dtf 2 B B LYS-LYS-LYS KKK 3 T 580 Rrn6 pdbhh F F 6dtg 2 B B TYR-LEU-GLY-ALA-ASN-GLY YLGANG 6 T 46 Reovirus_L2 pdbhh F F 6dth 2 B B ARG-PRO-PRO-GLY-PHE RPPGF 5 T 3.2E-05 Bradykinin unphh F F 6dtn 2 B A (6D6)PPKRIA(NH2), DC100-1 XPPKRIAX 8 T 27 DUF5394 pdbhh F T 6du2 2 C,D C,D REST_HUMAN REST-pS861/4 EDLSPPSPPLPK 12 T 12 Tir_receptor_N pdbhh F Eukaryota T 6du3 2 C,D C,D REST_HUMAN REST-pS861 EDLSPPSPPLPK 12 T 12 Tir_receptor_N pdbhh F Eukaryota T 6dub 2 C,D E,F RCC1_HUMAN RCC1 XPKRIA 6 T 7.2 DUF5394 pdbhh F Eukaryota T 6dus 1 A,B A,B A0A0H3NMP8_SALTS Type III secretion system effector protein SSGSANFAGVEYPLLPLDQHTPLLFQWFERNPSRFGENQIPIINTQQNPYLNNIINAAIIEKERTIGVLVDGNFSAGQKKALAKLEKQYENIKVIYNSDLDYSMYDKKLSDIYLENIAKIEAQPANVRDEYLLGEIKKSLNEVLKNNPEESLVSSHDKRLGHVRFDFYRNLFLLKGSNAFLEAGKHGCHHLQPGGGCIYLDADMLLTGKLGTLYLPDGIAVHVSRKGNSMSLQNGIIAVNRSEHPALKKGLEIMHSKPYGDPYIDGVCGGLRHYFNCSIRHNYEEFCNFIEFKHEHIFMDTSSLTISSWR 310 T 1.9E-05 Glyco_transf_88 unphh F Bacteria T 6dym 1 A A EBO_DROME Ebony MLKMEAVPLRLEHRQEVIDIIVASFYNKADLEQWLKPGVLRTDYSDILNDIWNVLVERDLSFVVYDTNTDRIIGTALNFDARNEPEVDIKSKLLIVFEFLEFCEGPIRDNYLPKGLNQILHSFMMGTAEKLNPRENIACMHFMEHEVLRVAREKQFAGIFTTNTSPLTQQLADVYHYKTLLNFQVNEYVHSDGSRPFGDAPDEQRAIVHWKEVGKGSHHHHHH 223 T 0.00056 Acetyltransf_9 unphh F Eukaryota T 6dyn 1 A A EBO_DROME Ebony MLKMEAVPLRLEHRQEVIDIIVASFYNKADLEQWLKPGVLRTDYSDILNDIWNVLVERDLSFVVYDTNTDRIIGTALNFDARNEPEVDIKSKLLIVFEFLEFCEGPIRDNYLPKGLNQILHSFMMGTAEKLNPRENIACMHFMEHEVLRVAREKQFAGIFTTNTSPLTQQLADVYHYKTLLNFQVNEYVHSDGSRPFGDAPDEQRAIVHWKEVGKGSHHHHHH 223 T 0.00056 Acetyltransf_9 unphh F Eukaryota T 6dyo 1 A A EBO_DROME Ebony MLKMEAVPLRLEHRQEVIDIIVASFYNKADLEQWLKPGVLRTDYSDILNDIWNVLVERDLSFVVYDTNTDRIIGTALNFDARNEPEVDIKSKLLIVFEFLEFCEGPIRDNYLPKGLNQILHSFMMGTAEKLNPRENIACMHFMEHEVLRVAREKQFAGIFTTNTSPLTQQLADVYHYKTLLNFQVNEYVHSDGSRPFGDAPDEQRAIVHWKEVGKGSHHHHHH 223 T 0.00056 Acetyltransf_9 unphh F Eukaryota T 6dyr 1 A A EBO_DROME Ebony MLKMEAVPLRLEHRQEVIDIIVASFYNKADLEQWLKPGVLRTDYSDILNDIWNVLVERDLSFVVYDTNTDRIIGTALNFDARNEPEVDIKSKLLIVFEFLEFCEGPIRDNYLPKGLNQILHSFMMGTAEKLNPRENIACMHFMEHEVLRVAREKQFAGIFTTNTSPLTQQLADVYHYKTLLNFQVNEYVHSDGSRPFGDAPDEQRAIVHWKEVGKGSHHHHHH 223 T 0.00056 Acetyltransf_9 unphh F Eukaryota T 6dys 1 A A EBO_DROME Ebony MLKMEAVPLRLEHRQEVIDIIVASFYNKADLEQWLKPGVLRTDYSDILNDIWNVLVERDLSFVVYDTNTDRIIGTALNFDARNEPEVDIKSKLLIVFEFLEFCEGPIRDNYLPKGLNQILHSFMMGTAEKLNPRENIACMHFMEHEVLRVAREKQFAGIFTTNTSPLTQQLADVYHYKTLLNFQVNEYVHSDGSRPFGDAPDEQRAIVHWKEVGKGSHHHHHH 223 T 0.00056 Acetyltransf_9 unphh F Eukaryota T 6dz9 1 A A CPfox2 GSKRFRXPIIFNER 14 T 7.3 Hum_adeno_E3A pdbhh F T 6dza 1 A A CPfox4 GSKRFRFXPEIIFNER 16 T 5.7 PRC2_HTH_1 pdbhh F T 6dzb 1 A A CPfox5 GSRGFRFXPKIIFNER 16 T 1.6 PsbT pdbhh F T 6dzc 1 A A CPfox6 GSRGFRFXPKIIRNER 16 T 2.9 DUF3368 pdbhh F T 6dze 1 A A CPfox7 GSRRFRFXPKIIFNQR 16 T 2.4 PsbT pdbhh F T 6dzi 56 DB 3 A0QTP4_MYCS2 Uncharacterized protein AKRGRKKRDRKHSKANHGKRPNA 23 T 0.16 DUF6254 pdb F Bacteria T 6dzp 34 HA 3 A0QTP4_MYCS2 Uncharacterized protein AKRGRKKRDRKHSKANHGKRPNA 23 T 0.16 DUF6254 pdb F Bacteria T 6e00 1 A,AA,B,BA,C,CA,D,DA,E,EA,F,FA,G,H,I,J,K,L,M,N,O,P,Q,R,S,T,U,V,W,X,Y,Z A,k,B,l,C,m,D,n,E,o,F,p,G,H,I,J,K,L,M,N,O,P,a,b,c,d,e,f,g,h,i,j N-Me-p-iodo-D-Phe1,N-Me-D-Gln4,Lys10-teixobactin analogue XISXXISXAKI 11 T 94 RII_binding_1 pdbhh F F 6e10 2 G,I,K,M,O,Q,S B,A,G,F,E,D,C Q8IKC8_PLAF7 Exported protein 2 MKVSYIFSFFLLFFVYKNTNTVVCDNGYGDLAATSALTTVIKDPISLTIKDIYEHGVKNPFTKIIHKLKKFIRYRKVLRWSRMWWVLLVREIVGDNTIEKKTEKALREIWDQCTIAVYNNTLNAVESKPLLFLHGILNECRNNFATKLRQDPSLIVAKIDQIIKSQIYRFWVSEPYLKIGRSHTLYTHITPDAVPQLPKECTLKHLSSYMEEKLKSMESKKNIESGKYEFDVDSSETDSTKDDGKPDDDDDDDDNFDDDDNFDDDTVEEEDASGDLFKNEKKDENKE 287 T 0.086 Y_Y_Y pdbpercent F Eukaryota T 6e10 3 H,J,L,N,P,R,T,V a,g,f,e,d,c,b,h Q8ILA1_PLAF7 Translocon component PTEX150 SVKDIKKLIEEGILDYEDLTENELRKLAKPDDNFYELSPYASDEKDLSLNETSGLTNEQLKNFLGQNGTYHMSYDSKSIDYAKQKKSEKKEDQQEDDDGFYDAYKQIKNSYDGIPNNFNHEAPQLIGNNYVFTSIYDTKENLIKFLKKNSEYDLYDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 207 T 0.02 Latexin pdb F Eukaryota T 6e10 4 U 0 Endogenous cargo polypeptide XXXXXXXXXXXXXXX 15 F F F 6e10 5 AA,BA,W,X,Y,Z m,n,i,j,k,l Unknown (Claw) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 58 F F F 6e11 1 A,AA,B,BA,C,Y,Z i,l,j,m,k,n,h Unknown (Claw) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 60 F F F 6e11 3 I,K,M,Q,S,U,W C,D,E,B,A,G,F Q8IKC8_PLAF7 Exported protein 2 MKVSYIFSFFLLFFVYKNTNTVVCDNGYGDLAATSALTTVIKDPISLTIKDIYEHGVKNPFTKIIHKLKKFIRYRKVLRWSRMWWVLLVREIVGDNTIEKKTEKALREIWDQCTIAVYNNTLNAVESKPLLFLHGILNECRNNFATKLRQDPSLIVAKIDQIIKSQIYRFWVSEPYLKIGRSHTLYTHITPDAVPQLPKECTLKHLSSYMEEKLKSMESKKNIESGKYEFDVDSSETDSTKDDGKPDDDDDDDDNFDDDDNFDDDTVEEEDASGDLFKNEKKDENKE 287 T 0.086 Y_Y_Y pdbpercent F Eukaryota T 6e11 4 L 0 Endogenous cargo polypeptide XXXXXX 6 F F F 6e11 5 N,O,P,R,T,V,X d,c,b,a,g,f,e Q8ILA1_PLAF7 Translocon component PTEX150 MRIIILALLIVCTIINYYCAVQNNGNKSLNVMPTCSMPGNDSDSNDNETGDVDNDKNNELGNANDNNEMNNENAESKNMQGENSNNQEQLNENVHANDDAMYEGTPSSDNPPQENVDANNNEQEYGPPQEEPVSENNVENVEVATDDSGNDNINNNDNFNNNDNYNDNDNFNEEPPSDDGNKNEDELTEGNQSDDKPMNEEEATINEMGKITNPFEDMLKGKVDDMDIGKMMNKDNLQSFLSSLTGNKDGSGKNPLSDMMNIFGVPQTGKEGAEGGVNKENQMKQINELKDKLETMLKGAGVNVDKIKDSIKNNDLLKNKQLLKEAISKLTLDPSMMNMLNNKDGANGKPFDINPDSMMKMFNALSNENGNLDDLKMKPTDGSFDSFNDGVDNNLVPSNPKGQNNNEEDDEEGGDDDDYDDKSFVVNSKYADNSFEDKFNTFDEKDDDVKYELFGENEEAEELNNNTTTASSKGDANNSVNTQEGEGEEESFSANEENINNNNNHNNKNYNNYNTSQQEEDDNSFNENDEPLISSSQFDNNKKNKMSVSTHNKKSKNLMDSLDLESTNYGSNSSSSMSNNYNSKNKNSKKNNKKKSSQKDYIRTDGKVSFDMATLQKTIKNFGGADNEIVQNILKKYVTIDNDDDNDADEDEDEDDDDDDDLDEDEFSVKDIKKLIEEGILDYEDLTENELRKLAKPDDNFYELSPYASDEKDLSLNETSGLTNEQLKNFLGQNGTYHMSYDSKSIDYAKQKKSEKKEDQQEDDDGFYDAYKQIKNSYDGIPNNFNHEAPQLIGNNYVFTSIYDTKENLIKFLKKNSEYDLYDDDDKEGGNFKSPLYDKYGGKLQKFKRQRAFNILKQWRAKEKKLKEKKKKEEMEENKEFDFSKNYNFSSKNDGGVTMFSKDQLEDMVKNFGGKPSAHVTDSFSRKENPFVPTNTKNNSNDDDDMDNGYVTFDGKNKVSENDDDEKGNNNDDENDNDDSNDEEELDEEEDDN 993 T 0.14 CLP_protease pdbpssm F Eukaryota T 6e1r 1 A,B,C,D,E,F A,B,C,D,E,F A0A221SBY4_9CAUD Tailspike protein NNPNLDMSGWLMNLKGVVNSKVELEGLSGSDGQVVLMTGYYAGQYMGGDHFKYDSTQALINNGVTVINGWVKQFSAGVLTVSACGADPSASDHSAALDLAVNTATSLKRKLVVDFDLRVNTTTELDATLRIEGDGGAVQFSRSITATADIPIFTVKAGFSSESSYFGKLMFKASTGGTATAFRSTSNGYLSQSTFDHCVFDRSLRYGIDANLILCDFQKCDFGTYMSTTNSIGFKAIRSLGVVGTREPNANTFYNCIFRKGTDDCMIEWDSYGTQWHFFACDLEQNLCTEALIKCTASSPIMFVGGYIEANTSTPYVIKTLGNSATGFVPLIKFQGIHMNRPCSVAIGKNTMANYPKYIFEGCYGQLISAVVESSTGVLNDVALIENSIANHFTLATGGSIGDIRTLTMPSGFNADSRNFQAAKITNLTSYKHNYKKTINRDFTVGSSVGVASLSHPSISGASYGGRLLVNAIFGTTAAAGTNSAVYELLVTSVGTAKYISQIGSAGLTSGAAASHPSFTWSINSSNVLVATAVGSTAGRFAMEVFTTGNVQAT 554 T 0.006 Pectate_lyase_3 pdbhh T Viruses T 6e2p 2 C,D C,D LEPR_HUMAN LEPR, LEP-R, HUB219, OB RECEPTOR, OB-R GSHQRMKKLFWEDVPNPKNCSWAQGLNFQKPETFEHLFIKHTASVTCGPLLLEPETISEDISVDTSWKNKDEGNS 75 T 3.5 RCR unphh F Eukaryota T 6e2q 2 E,F,G,H M,N,O,P EPOR_HUMAN EPOR, EPO-R GSGSGSGSGSGSGSSHRRALKQKIWPGIPSPESEFEGLFTTHKGNFQLWLYQNDGCLWWSPCTPFTEDPPASLEVLSERCGNS 83 T 0.0066 IFNGR1 unphh F Eukaryota T 6e37 2 B B CSK21_HUMAN TYR-PRO-GLY-GLY-SER-THR-PRO-VAL-SER-SER-ALA-ASN YPGGSTPVSSANMM 14 T 13 Pr_beta_C pdbhh F Eukaryota T 6e3c 1 A C Q5C838_9VIRU Dec protein GSHMANPNFTPSWPLYKDADGVYVSALPIKAIKYANDGSANAEFDGPYADQYMSAQTVAVFKPEVGGYLFRSQYGELLYMSKTAFEANYTSASGSVANAETADKLSTARTITLTGAVTGSASFDGSANVTIETTSGS 137 T 0.067 DUF5853 unppssm T Viruses T 6e3d 2 B B tetra-peptide picked up from the expression host SSVT 4 T 450 SDH_beta pdbhh F F 6e3i 2 B B peptide srt.F4 XQRVVHIAAGLRRTGDQLEAYGX 23 T 2.9 PMAIP1 pdbhh F T 6e3j 2 B B peptide srt.F10 XRRVVQIAAGLRRAGDQLEKYGX 23 T 0.79 BID pdbhh F T 6e49 2 D,E,F D,E,F PIF1_YEAST DNA REPAIR AND RECOMBINATION HELICASE PIF1, PETITE INTEGRATION FREQUENCY PROTEIN 1, TELOMERE STABILITY PROTEIN 1 NGIAAMLQRHSRKRFQL 17 T 2.3 CrtO pdbhh F Eukaryota T 6e4d 2 B F VAL-VAL-VAL-ALA VVVA 4 T 330 MannoseP_isomer pdbhh F F 6e4h 1 A,B A,B PALB2_MOUSE Partner and localizer of BRCA2 MEELSGKPLSYAEKEKLKEKLAFLKKEYSRTLARLQRAKRAEKAKNSKKAIEDGVPQPEALEHHHHHH 68 T 0.024 DUF1564 pdbpercent F Eukaryota T 6e4j 1 A A I6V394_9EURY Uncharacterized protein PF2048.1 MAHHHHHHGSVVKEKLEKALIEVRPYVEYYNELKALVSKISSSVNDLEEAIVVLREEEKKASEPFKTDIRILLDFLESKP 80 T 0.0002 Rnk_N unppssm F Archaea T 6e4y 3 C P PCSK9_HUMAN NEURAL APOPTOSIS-REGULATED CONVERTASE 1, NARC-1, PROPROTEIN CONVERTASE 9, PC9, SUBTILISIN/KEXIN-LIKE PROTEASE PC9 EDEDGDYEELVLALRSEEDGLA 22 T 8.7 PIN7 pdbhh F Eukaryota T 6e4z 3 C P PCSK9_HUMAN NEURAL APOPTOSIS-REGULATED CONVERTASE 1, NARC-1, PROPROTEIN CONVERTASE 9, PC9, SUBTILISIN/KEXIN-LIKE PROTEASE PC9 EDEDGDYEELVLALRSEEDGLA 22 T 8.7 PIN7 pdbhh F Eukaryota T 6e5h 1 A A Designed peptide NC_HEE_D1: Aib turn mutant NDKCKELKKRYXGCEVRCDXPRYEVHCX 28 T 1.1 DUF6410 pdbhh F T 6e5i 1 A A Designed peptide NC_HEE_D1: Orn turn mutant NDKCKELKKRYXCEVRCDXPRYEVHCX 27 T 9.4 DUF2152 pdbhh F T 6e5j 1 A A Designed peptide NC_HEE_D1: Aib turn, beta3 helix, N-methyl hairpin mutant NDXCKXLKXRYXGCEXRCDXPRYEXHCX 28 T 1.1 DUF6410 pdbhh F T 6e5k 1 A A Designed peptide NC_HEE_D1: Aib turn, Aib helix, N-methyl hairpin mutant NDXCKXLKXRYXGCEXRCDXPRYEXHCX 28 T 1.3 DUF6410 pdbhh F T 6e5n 1 A B MYO6_HUMAN UNCONVENTIONAL MYOSIN-6 GPLGSRPKMTPEQMAKEMSEFLSRGPAVLATKAAAGTKKYDLSKWKYAELRDTINTSCDIELLAACREEFHRRLKVYHAWKSKNKKR 87 T 0.027 BUD22 unppercent F Eukaryota T 6e5x 2 B B RBBP6_HUMAN PROLIFERATION POTENTIAL-RELATED PROTEIN, PROTEIN P2P-R, RING-TYPE E3 UBIQUITIN TRANSFERASE RBBP6, RETINOBLASTOMA-BINDING Q PROTEIN 1, RBQ-1, RETINOBLASTOMA-BINDING PROTEIN 6, P53-ASSOCIATED CELLULAR PROTEIN OF TESTIS PVFVPVPPPPLYPPP 15 T 6.7 Tryp_FSAP pdbhh F Eukaryota F 6e66 1 A A NLEB1_ECO27 NleB MLSSLNVLQSSFRGKTALSNSTLLQRPSFAGKEYSLEPIDERTPILFQWFEARPERYEKGEVPILNTKEHPYLSNIINAAKIENERIIGVLVDGNFTYEQKKEFLNLENEHQNIAIIYRADVDFSMYDKKLSDIYLENIHKQESYPASERDNYLLGLLREELKNIPEGKDSLIESYAEKREHTWFDFFRNLAILKAGSLFTETGKTGCHNISPCSGCIYLDADMIITDKLGVLYAPDGIAVHVDCNDEIKSLCNGAIVVNRSNHPALLAGLDIMKSKVDAHPYYDGLGKGIKRHFNYSSLHNYNAFCDFIEEGNPGIIIPNTSMYTSSSW 330 T 1.7E-05 Glyco_transf_88 pdbhh F Bacteria T 6e7i 2 B P Q62605_RAT EA2 PTTDSTTPAPTTK 13 T 39 DUF1263 pdbhh F Eukaryota T 6e8k 2 B B IL2RB_HUMAN INTERLEUKIN-2 RECEPTOR SUBUNIT BETA,IL-2RB,HIGH AFFINITY IL-2 RECEPTOR SUBUNIT BETA,INTERLEUKIN-15 RECEPTOR SUBUNIT BETA,P70-75,P75 YFTYDPXSEEDPD 13 T 1.3 Membrane_bind pdbhh F Eukaryota T 6e8m 2 B B DNJA1_HUMAN DNAJ HOMOLOG SUBFAMILY A MEMBER 1,DNAJ PROTEIN HOMOLOG 2,HSDJ,HEAT SHOCK 40 KDA PROTEIN 4,HEAT SHOCK PROTEIN J2,HSJ-2,HUMAN DNAJ PROTEIN 2,HDJ-2 HYNGEAXEDDEHH 13 T 9.7 BLOC1S3 pdbhh F Eukaryota T 6e9e 2 B A B0MS50_9FIRM EsCas13d MGKKIHARDLREQRKTDRTEKFADQNKKREAERAVPKKDAAVSVKSVSSVSSKKDNVTKSMAKAAGVKSVFAVGNTVYMTSFGRGNDAVLEQKIVDTSHEPLNIDDPAYQLNVVTMNGYSVTGHRGETVSAVTDNPLRRFNGRKKDEPEQSVPTDMLCLKPTLEKKFFGKEFDDNIHIQLIYNILDIEKILAVYSTNAIYALNNMSADENIENSDFFMKRTTDETFDDFEKKKESTNSREKADFDAFEKFIGNYRLAYFADAFYVNKKNPKGKAKNVLREDKELYSVLTLIGKLRHWCVHSEEGRAEFWLYKLDELKDDFKNVLDVVYNRPVEEINNRFIENNKVNIQILGSVYKNTDIAELVRSYYEFLITKKYKNMGFSIKKLRESMLEGKGYADKEYDSVRNKLYQMTDFILYTGYINEDSDRADDLVNTLRSSLKEDDKTTVYCKEADYLWKKYRESIREVADALDGDNIKKLSKSNIEIQEDKLRKCFISYADSVSEFTKLIYLLTRFLSGKEINDLVTTLINKFDNIRSFLEIMDELGLDRTFTAEYSFFEGSTKYLAELVELNSFVKSCSFDINAKRTMYRDALDILGIESDKTEEDIEKMIDNILQIDANGDKKLKKNNGLRNFIASNVIDSNRFKYLVRYGNPKKIRETAKCKPAVRFVLNEIPDAQIERYYEACCPKNTALCSANKRREKLADMIAEIKFENFSDAGNYQKANVTSRTSEAEIKRKNQAIIRLYLTVMYIMLKNLVNVNARYVIAFHCVERDTKLYAESGLEVGNIEKNKTNLTMAVMGVKLENGIIKTEFDKSFAENAANRYLRNARWYKLILDNLKKSERAVVNEFRNTVCHLNAIRNININIKEIKEVENYFALYHYLIQKHLENRFADKKVERDTGDFISKLEEHKTYCKDFVKAYCTPFGYNLVRYKNLTIDGLFDKNYPGKDDSDEQK 954 T 0.18 Orthopox_F14 pdbpercent F Bacteria T 6e9f 1 A A B0MS50_9FIRM EsCas13d MGKKIHARDLREQRKTDRTEKFADQNKKREAERAVPKKDAAVSVKSVSSVSSKKDNVTKSMAKAAGVKSVFAVGNTVYMTSFGRGNDAVLEQKIVDTSHEPLNIDDPAYQLNVVTMNGYSVTGHRGETVSAVTDNPLRRFNGRKKDEPEQSVPTDMLCLKPTLEKKFFGKEFDDNIHIQLIYNILDIEKILAVYSTNAIYALNNMSADENIENSDFFMKRTTDETFDDFEKKKESTNSREKADFDAFEKFIGNYRLAYFADAFYVNKKNPKGKAKNVLREDKELYSVLTLIGKLAHWCVASEEGRAEFWLYKLDELKDDFKNVLDVVYNRPVEEINNRFIENNKVNIQILGSVYKNTDIAELVRSYYEFLITKKYKNMGFSIKKLRESMLEGKGYADKEYDSVRNKLYQMTDFILYTGYINEDSDRADDLVNTLRSSLKEDDKTTVYCKEADYLWKKYRESIREVADALDGDNIKKLSKSNIEIQEDKLRKCFISYADSVSEFTKLIYLLTRFLSGKEINDLVTTLINKFDNIRSFLEIMDELGLDRTFTAEYSFFEGSTKYLAELVELNSFVKSCSFDINAKRTMYRDALDILGIESDKTEEDIEKMIDNILQIDANGDKKLKKNNGLRNFIASNVIDSNRFKYLVRYGNPKKIRETAKCKPAVRFVLNEIPDAQIERYYEACCPKNTALCSANKRREKLADMIAEIKFENFSDAGNYQKANVTSRTSEAEIKRKNQAIIRLYLTVMYIMLKNLVNVNARYVIAFHCVERDTKLYAESGLEVGNIEKNKTNLTMAVMGVKLENGIIKTEFDKSFAENAANRYLRNARWYKLILDNLKKSERAVVNEFANTVCALNAIRNININIKEIKEVENYFALYHYLIQKHLENRFADKKVERDTGDFISKLEEHKTYCKDFVKAYCTPFGYNLVRYKNLTIDGLFDKNYPGKDDSDEQK 954 T 0.18 Orthopox_F14 unppercent F Bacteria T 6eav 2 B I IBB_VIGUN CYS-THR-LYS CTK 3 F F Eukaryota F 6ecd 2 B B tetradepsipeptide XXXV 4 T 1400 Pkinase_C pdbhh F F 6ece 2 C,D C,D dodecadepsipeptide XXXVXXXVXXXV 12 T 180 DUF2659 pdbhh F F 6ecf 2 G,H,I,J,K,L G,I,H,K,J,L dodecadepsipeptide XXXVXXXVXXXV 12 T 180 DUF2659 pdbhh F F 6ee9 1 A X Stress-response Peptide-1 FGVRVGTCPSGYVRRGTFCFPDDDY 25 T 0.013 CPW_WPC pdbhh F T 6eex 1 A A L-GSTSTA from ice nucleaction protein, inaZ GSTSTA 6 T 0.00013 Ice_nucleation unp F F 6ef0 14 N s CCNB_ARBPU model substrate polypeptide SARLGGASIAVQ 12 T 3.9 DUF3182 pdbhh F Eukaryota T 6ef1 14 N s CCNB_ARBPU model substrate polypeptide NENVSARLGGASIAV 15 T 6.3 DUF3182 pdbhh F Eukaryota T 6ef2 14 N s CCNB_ARBPU model substrate polypeptide NNENVSARLGGASIAV 16 T 8.1 DUF3182 pdbhh F Eukaryota T 6ef3 21 U n PSB7_YEAST Proteasome subunit beta type-7 KWDFAKDIKGYGTQK 15 T 2.4 Ice_nucleation pdbhh F Eukaryota T 6ef3 23 W s CCNB_ARBPU Model substrate polypeptide GGKHTFNNENVSARLGGASIAVQAPAQPPPYSHHHHHH 38 T 22 Con-6 pdbhh F Eukaryota T 6ef5 2 E,H S,Q KKCC2_HUMAN ARG-SER-LEU-SEP-ALA-PRO-GLY RSLSAPG 7 T 9.1 DUF6439 pdbhh F Eukaryota T 6ef5 3 F,G R,P LYS-LEU-SEP-LEU-GLN KLSLQ 5 T 150 MukF_M pdbhh F F 6ef8 1 A,B,C,D,E,F,G A,B,C,D,E,F,G OMCS_GEOSL OUTER MEMBRANE CYTOCHROME S FHSGGVAECEGCHTMHNSLGGAVMNSATAQFTTGPMLLQGATQSSSCLNCHQHAGDTGPSSYHISTAEADMPAGTAPLQMTPGGDFGWVKKTYTWNVRGLNTSEGERKGHNIVAGDYNYVADTTLTTAPGGTYPANQLHCSSCHDPHGKYRRFVDGSIATTGLPIKNSGSYQNSNDPTAWGAVGAYRILGGTGYQPKSLSGSYAFANQVPAAVAPSTYNRTEATTQTRVAYGQGMSEWCANCHTDIHNSAYPTNLRHPAGNGAKFGATIAGLYNSYKKSGDLTGTQASAYLSLAPFEEGTADYTVLKGHAKIDDTALTGADATSNVNCLSCHRAHASGFDSMTRFNLAYEFTTIADASGNSIYGTDPNTSSLQGRSVNEMTAAYYGRTADKFAPYQRALCNKCHAKD 407 T 9.8E-05 Cytochrom_NNT unphh F Bacteria T 6efe 1 A A CLEA_CONVL Kappa-conotoxin vil14a GGLGRCIYNCMNSGGGLSFIQCKTMCY 27 T 0.024 Eclosion pdbhh F Eukaryota T 6efk 2 C,D C,D ACE-ILE-GLU-GLU-VAL-ASP XIEEVD 6 T 150 DUF4695 pdbhh F F 6ego 1 A A Hg(II)(GRAND CoilSerL12AL16C)3- EWEALEKKLAAAESKCQALEKKLQALEKKLEALEHG 36 T 0.00015 Cep57_CLD pdb F T 6egv 4 D D Q9IGK7_9VIRU minor capsid protein MiCP DNPHRFLPANVSNRWNEYSSAYLPRV 26 T 2.7 YLP pdbhh T Viruses T 6egx 4 D D Q9IGK7_9VIRU minor capsid protein MiCP DNPHRFLPANVSNRWNEYSSAYLPRV 26 T 2.7 YLP pdbhh T Viruses T 6eh1 4 D D Q9IGK7_9VIRU minor capsid protein MiCP DNPHRFLPANVSNRWNEYSSAYLPRV 26 T 2.7 YLP pdbhh T Viruses T 6eih 2 B P SER-ILE-SEP-ARG SISR 4 T 180 4PPT_N pdbhh F F 6eik 1 A,B,C,D,E,F,G A,B,C,D,E,F,G CC-Hept-I24E XGEIAKALREIAKALREIAWALREEAKALRGX 32 T 0.019 WXG100 pdbpssm F T 6eiw 4 D D Q9IGK7_9VIRU minor capsid protein MiCP DNPHRFLPANVSNRWNEYSSAYLPRV 26 T 2.7 YLP pdbhh T Viruses T 6eiz 1 A,B,C,D,E,F A,B,C,D,E,F CC-Hex2 XGEIAKSLKEIAKSLKEIAWSLKEIAKSLKGX 32 T 0.031 MCPsignal pdbpssm F T 6ej7 2 B B AMBP_HUMAN Protein AMBP QEEEGAGGGQGG 12 T 24 AGRB_N pdbhh F Eukaryota F 6ej8 2 B B AMBP_HUMAN Protein AMBP QEEEGSGGGQGG 12 T 17 DUF6054 pdbhh F Eukaryota F 6ej9 2 B B AMBP_HUMAN Protein AMBP QEPEGSGGGQG 11 T 13 DUF6054 pdbhh F Eukaryota F 6eja 2 B B AMBP_HUMAN BIKUNIN QEEEYSGGGQGG 12 T 29 Glypican pdbhh F Eukaryota F 6ejb 2 B B AMBP_HUMAN Protein AMBP QEEEGSAGGQGG 12 T 24 DUF4266 pdbhh F Eukaryota F 6ejc 2 B B AMBP_HUMAN BIKUNIN QEEEGSGVGQGG 12 T 37 DUF5639 pdbhh F Eukaryota F 6ejd 2 B B AMBP_HUMAN BIKUNIN QEEEGSGGPQGG 12 T 68 DUF6180 pdbhh F Eukaryota F 6eje 2 B B SDC1_HUMAN SYND1 PAAEGSGEQDFT 12 T 32 BING4CT pdbhh F Eukaryota T 6ejl 2 C,D C,D M3K5_HUMAN APOPTOSIS SIGNAL-REGULATING KINASE 1,ASK-1,MAPK/ERK KINASE KINASE 5,MEKK 5 RSISLPVP 8 T 4 Imm9 pdbhh F Eukaryota T 6ek1 1 A A A0A452CST7_PSEFL restriction endonuclease PfoI MQKYRLYEKDGSPVQDFNRFVKGWLDIEFGLKEHQPPKVFDTIRDKYNEAIEAVVLSGVAPRTAHKAALSTLTELLFGHDLAKELSARLDIQPIGVGGFRSAHSQAFAKNVGENFVNLMVYALACILKDNDDVLVDKGLPPHLKKALTLSRECRIKDTLREIKIPIEGDLCVFSRSNHCNAIVISAKTRLKEVFHIGTMWALFSDVAKDEYCLNKWGLKVESSESLKDTMYVFATADMINKDGARSQGCDVERETPRNLIAMDASFFDYVFVSKMGIGHVSSDLSLKYGRESLFHELGCIIDMIEQKFDILL 312 T 0.037 ChaB unppercent F Bacteria T 6eka 1 A,B,C,D,E A,B,C,D,E B2B1E9_PODAN Podospora anserina S mat+ genomic DNA chromosome 3, supercontig 2 MKTLSATRACRTGQKFGEMKTDDHSIAMQGIVGVAQPGVDQSFGSLTTTKSSRAFQGQMDAGSFSNLFSKLEHHHHHH 78 T 0.029 Fez1 unppssm F Eukaryota T 6eke 1 A,B,C A,C,B A0A3B6UEU4_9AGAR lectin GAMAPVPVTKLVCDGDTYKCTAYLDFGDGRWVAQWDTNVFHTG 43 T 0.1 C2-set pdbhh F Eukaryota T 6ekj 2 B B CHAP1_HUMAN ZINC FINGER PROTEIN 828 MSASSGPWKPAKPAPSVSPGPWKPIPSVS 29 T 19 FGAR-AT_N pdbhh F Eukaryota T 6ekl 2 B B CHAP1_HUMAN ZINC FINGER PROTEIN 828 MSASSGPWKPAKPAPSVSPGPWKPIPSVS 29 T 19 FGAR-AT_N pdbhh F Eukaryota T 6ekm 2 B B REV3L_HUMAN PROTEIN REVERSIONLESS 3-LIKE,HREV3 MGDKKIVIMPCKCAPSRQLVQVWLQAKE 28 T 0.37 SAM_3 pdbhh F Eukaryota T 6eko 1 A,B A,B A0A452CST9_PSEFL Restriction endonuclease PfoI MQKYRLYEKDGSPVQDFNRFVKGWLDIEFGLKEHQPPKVFDTIRDKYNEAIEAVVLSGVAPRTAHKAALSTLTELLFGHDLAKELSARLDIQPIGVGGFRSAHSQAFAKNVGENFVNLMVYALACILKDNDDVLVDKGLPPHLKKALTLSRECRIKDTLREIKIPIEGDLCVFSRSNHCNAIVISAATRLKEVFHIGTMWALFSDVAKDEYCLNKWGLKVESSESLKDTMYVFATADMINKDGARSQGCDVERETPRNLIAMDASFFDYVFVSKMGIGHVSSDLSLKYGRESLFHELGCIIDMIEQKFDILL 312 T 0.64 RE_BsaWI pdbhh F Bacteria T 6ekr 1 A A Q93K38_KLEPN Type ii site-specific deoxyribonuclease MDILKEKIDVASRLYNLNLDHIPATLQVIEHAMLLLKNNAGYGYFGSFNGKNTQEYHSFTFNGEYSRPVRDDLFITDYDFFVSGFREFNESLRDIGSKWSSFDSRRANKIIYTSVMSVACCFDLWKSGSRKTPGTFFEIFMAAVLKWMIPDEIFSKHIPLIDQLESDDESIDPSSVSTDIVIKSAYANASVVIPLKITTRERIVQPFAQQRILDSYFGNGVYFSFLACISETQQDKKKKKVNHICVPGTIRLYQKYLSSLSGMYYCDIPERYLERDLTDIIPVRTMGDFLFDIYSFFRSQGAAALEHHHHHH 312 T 0.033 Nop52 pdbpercent F Bacteria T 6emk 3 E,F E,F Target of rapamycin complex 2 subunit TSC11 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 303 F F F 6ena 1 A A NEMA1_LINLO Nemertide alpha-1 GCIATGSFCTLSKGCCTKNCGWNFKCNPPNQ 31 T 0.001 Conotoxin_I2 pdb F Eukaryota T 6enb 2 B B PRO-GLY-PRO PGP 3 T 81 zf-CCHC pdbhh F F 6epg 1 A,C,E,G A,C,E,G D5K9E3_NEIGO Epsilon_1 antitoxin MNKVEPQESNAIRMIKEACEKNRRMMTDEAFRKEVEKRLYAGPSPELLAKLRVLWAANKEQ 61 T 1.5 DUF6033 pdbhh F Bacteria T 6eph 1 A,C,E,G A,C,E,G D5K9E3_NEIGO Epsilon_1 antitoxin MNKVEPQESNAIRMIKEACEKNRRMMTDEAFRKEVEKRLYAGPSPELLAKLRVLWAANKEQ 61 T 1.5 DUF6033 pdbhh F Bacteria T 6epi 1 A,C,E,G A,C,E,G D5K9E3_NEIGO Epsilon_1 antitoxin MNKVEPQESNAIRMIKEACEKNRRMMTDEAFRKEVEKRLYAGPSPELLAKLRVLWAANKEQ 61 T 1.5 DUF6033 pdbhh F Bacteria T 6eqv 2 B D HY1-LLI-VAL-ARG-00S XXVRX 5 T 450 Consortin_C pdbhh F F 6eqw 2 B D AMA-ARG-TBG-ARG-00S XRXRX 5 T 450 Consortin_C pdbhh F F 6eqx 2 B D Arg-Arg-Arg-Val-Arg-00S RRRVRX 6 T 110 DUF4658 pdbhh F F 6er6 1 A B Endonuclease colEdes7 MESKRNKPGKATGKGKPVGDKWLDDAGKDSGAPIPDRIADKLRDKEFKNFDDFRKKFWEEVSKDPDLAKQFKRSNRKRIQQGYAPFAPQKDQVGGRTTFELHHDKPISQDGGVYDMNNIRVTTPKRAIDIHRGK 134 T 0.0047 HNH pdbpssm F T 6ere 1 A,D B,A colicin MESKRNKPGKATGKGKPVGDKWLDDAGKDSGAPIPDRIADKLRDKEFKNFDDFRKKFWKEVAKDPDLAKQFSKANQRNIKDGNAPFARESDQVGGRTTYELHHDKPISQDGGVYDMNNIRVTTPKRAIDIHRGK 134 T 0.0033 HNH pdbpssm F T 6erf 5 Q,R,S,T Q,R,S,T APLF_HUMAN APURINIC-APYRIMIDINIC ENDONUCLEASE APLF,PNK AND APTX-LIKE FHA DOMAIN-CONTAINING PROTEIN,XRCC1-INTERACTING PROTEIN 1 KQQPILAERKRILPTWML 18 T 0.032 PNISR pdbhh F Eukaryota T 6erg 3 C,F C,F NHEJ1_HUMAN PROTEIN CERNUNNOS,XRCC4-LIKE FACTOR SKVKRKKPRGLFS 13 T 2.4 DUF3487 pdbhh F Eukaryota T 6erh 5 G,J M,T NHEJ1_HUMAN PROTEIN CERNUNNOS,XRCC4-LIKE FACTOR LQRPQLSKVKRKKPRGLFS 19 T 7.6 DUF3487 pdbhh F Eukaryota T 6eri 57 EB BY bS1c XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 170 F F F 6et5 7 AB,BB,G,NA,OA,PA,QA,RA,SA,TA,UA,VA,WA,XA,YA,ZA y,5,2,I,O,R,U,X,a,d,g,j,m,p,s,v LHG_BLAVI Light-harvesting protein B-1015 gamma chain SDWNLWVPLGILGIPTIWIALTYR 24 T 0.72 Proton_antipo_C pdbhh F Bacteria T 6evh 1 A A Lipoaminopeptide helioferin A and B XPXAXIIXXXX 11 T 25 MWFE pdbhh F F 6evm 2 B C Pro-9 PPPPPPPPP 9 T 29 Adeno_E3_14_5 pdbhh F F 6evn 2 B C PRO-PRO-GLY-PRO-ALA-GLY-PRO-PRO-GLY PPGPAGPPG 9 T 0.58 DUF374 pdbhh F F 6evo 2 B C PRO-PRO-GLY-PRO-ARG-GLY-PRO-PRO-GLY PPGPRGPPG 9 T 0.29 DUF374 pdbhh F F 6evp 2 B C PRO-PRO-GLY-PRO-GLU-GLY-PRO-PRO-GLY PPGPEGPPG 9 T 0.35 DUF6053 pdbhh F F 6ew9 2 D,E,F P,Q,R DNRLGLVYQF PEPTIDE DNRLGLVYQF 10 T 1.2 POLO_box pdbhh F T 6ewa 3 C,G C,G POL_HV1H2 Polyprotein ILKEPVHGV 9 T 0.56 DUF2115 pdbhh T Viruses T 6ewc 3 C,G C,G RETR2_HUMAN Reticulophagy regulator 2 RLSSPLHFV 9 T 5.7 Pox_F15 pdbhh F Eukaryota T 6ewo 3 C,G C,G SYNEM_HUMAN DESMUSLIN RTFSPTYGL 9 T 1.6 Adipokin_hormo pdbhh F Eukaryota T 6eww 2 E,F,G,H E,F,G,H KKCC2_HUMAN ARG-LYS-LEU-SEP-LEU-GLN-GLU-ARG RKLSLQER 8 T 3.9 DUF2660 pdbhh F Eukaryota T 6ex9 2 B B Inhibitor Peptide WSYFYDGSYSYYDYE 15 T 2.8 DUF6058 pdbhh F T 6exa 1 A,B A,B Q5ZYC7_LEGPH IcmP (DotM) GPSGGGADVNKGPWAMALTPMEFARKYNLLRKDDALLDNPVPGEEMTAGIEEGDAKRVFTMQLGPYWDGFERCSPQAYALSAVFMARMNRDRDAANNILKVLDKTFVDGKPDFSVARPVMKKYQNSELVQEVVAKHAYVLTVIASLLEAAREDGVVPSSEFLWLKPVDRRLWYMLNCVGRQTPYSEVAGPFAHWKAEKEMGRRSLVPMIDEAIRALEIAVKEVRLTPRQMEELEP 235 T 17 DUF6015 pdbhh F Bacteria T 6exb 1 A,B A,B Q5ZYC7_LEGPH IcmP (DotM) GPSGGGADVNKGPWAMALTPMEFARKYNLLRKDDALLDNPVPGEEMTAGIRRGDAKRVFTMQLGPYWDGFERCSPQAYALSAVFMARMNRDRDAANNILKVLDKTFVDGKPDFSVARPVMKKYQNSELVQEVVAKHAYVLTVIASLLEAAREDGVVPSSEFLWLKPVDRRLWYMLNCVGRQTPYSEVAGPFAHWKAEKEMGRRSLVPMIDEAIRALEIAVKEVRLTPRQMEELEP 235 T 18 DUF6015 pdbhh F Bacteria T 6exc 1 A,B A,B Q5ZYC7_LEGPH IcmP (DotM) GPSGGGADVNKGPWAMALTPMEFARKYNLLRKDDALLDNPVPGEEMTAGIRRGDAKRVFTMQLGPYWDGFERCSPQAYALSAVFMARMNRDRDAANNILKVLDKTFVDGKPDFSVARPVMKKYQNSELVQEVVAKHAYVLTVIASLLEAAREDGVVPSSEFLWLKPVDEELWYMLNCVGRQTPYSEVAGPFAHWKAEKEMGRRSLVPMIDEAIRALEIAVKEVRLTPRQMEELEP 235 T 20 DUF6015 pdbhh F Bacteria T 6exd 1 A,B A,B Q5ZYC7_LEGPH IcmP (DotM) GPSGGGADVNKGPWAMALTPMEFARKYNLLRKDDALLDNPVPGEEMTAGIRRGDAKRVFTMQLGPYWDGFERCSPQAYALSAVFMARMNRDRDAANNILKVLDKTFVDGKPDFSVARPVMKKYQNSELVQEVVAKHAYVLTVIASLLEAAREDGVVPSSEFLWLKPVDRRLWYMLNCVGRQTPYSEVAGPFAHWKAEKEMGRRSLVPMIDEAIRALEIAVKEVRLTPRQMEELEP 235 T 18 DUF6015 pdbhh F Bacteria T 6exe 1 A,B A,B Q5ZYC7_LEGPH IcmP (DotM) GPSGGGADVNKGPWAMALTPMEFARKYNLLRKDDALLDNPVPGEEMTAGIRRGDAKRVFTMQLGPYWDGFEECSPQAYALSAVFMARMNRDRDAANNILKVLDKTFVDGKPDFSVARPVMKKYQNSELVQEVVAKHAYVLTVIASLLEAAREDGVVPSSEFLWLKPVDRRLWYMLNCVGRQTPYSEVAGPFAHWKAEKEMGRRSLVPMIDEAIRALEIAVKEVRLTPRQMEELEP 235 T 18 DUF6015 pdbhh F Bacteria T 6exj 2 B,D B,D SSR2_RAT SSTR2 XDLQTSI 7 T 39 RLL pdbhh F Eukaryota T 6exn 22 V X Unassigned structure XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 95 F F F 6exv 13 M M AAMAT_AMAPH AMATOXIN XXGIGCNP 8 T 0.85 DUF3085 pdbhh F Eukaryota T 6ey3 1 A A CYS-ARG-PRO-LEU-TRP-THR-ALA-CYS-GLY CRPLWTACG 9 T 0.58 Tmpp129 pdbhh F T 6eyr 1 A,B A,B A0A0H3NMP8_SALTS Type III secretion system effector protein GPLGSYSHTATPAITLPSSGSANFAGVEYPLLPLDQHTPLLFQWFERNPSRFGENQIPIINTQQNPYLNNIINAAIIEKERTIGVLVDGNFSAGQKKALAKLEKQYENIKVIYNSDLDYSMYDKKLSDIYLENIAKIEAQPANVRDEYLLGEIKKSLNEVLKNNPEESLVSSHDKRLGHVRFDFYRNLFLLKGSNAFLEAGKHGCHHLQPGGGCIYLDADMLLTGKLGTLYLPDGIAVHVSRKGNSMSLENGIIAVNRSEHPALKKGLEIMHSKPYGDPYIDGVCGGLRHYFNCSIRHNYEEFCNFIEFKHEHIFMDTSSLTISS 325 T 1.8E-05 Glyco_transf_88 pdbhh F Bacteria T 6eys 1 A,B,C,D B,A,C,D A0A0H2ZBG1_PSEAB PvdP HHHHHHSSGLEVLFQGTTHRHLTREEEPQTPDEASLDLAATDGIRLGDRLRGLWDLRLVGGDAELPGLPREGLQLVLDVAPKGRGLIGYLDTPERLLAAEPPRFRVLGDLLGASSASIRWRLVDQASGSVAPTHDCSAVFDEVWADYANAGDGTLSGRIQRLERSPLSPNEDFRFVAVKRHFPLAHERIVLNEKLLGWLVSPQHRLFHQLWHASRDKWHRLSEKQRNALRGVGWQPGPLDRERDARGPRKDRNASGIDFFFMHRHMLHTARSMQDLPSWERLPRPVVPLEYDRPGFIRYFDNPDGFSVPPAWVAVDDDEYSEWLHGLKSAEAYHANFLVWESQYQDPAYLAKLTLGQFGSELELGMHDWLHMRWASVTRDPSNGAPVMTDRFPADFAPRWFRPENDFLGDPFSSHVNPVFWSFHGWIDDRIEDWYRAHERFHPGEVQRREVEGIQWFAPGRWVEVGDPWLGPATHGCGLSDVQASSNSVELDVETMKLALRIIFSEEDQLSGWLKRAPRRPWYARNLKLARDQLRR 536 T 0.0015 Hemocyanin_M pdbhh F Bacteria T 6eyt 1 A,B A,B A0A0H3NMP8_SALTS Type III secretion system effector protein GPLGSYSHTATPAITLPSSGSANFAGVEYPLLPLDQHTPLLFQWFERNPSRFGENQIPIINTQQNPYLNNIINAAIIEKERTIGVLVDGNFSAGQKKALAKLEKQYENIKVIYNSDLDYSMYDKKLSDIYLENIAKIEAQPANVRDEYLLGEIKKSLNEVLKNNPEESLVSSHDKRLGHVRFDFYRNLFLLKGSNAFLEAGKHGCHHLQPGGGCIYLDADMLLTGKLGTLYLPDGIAVHVSRKGNSMSLENGIIAVNRSEHPALKKGLEIMHSKPYGDPYIDGVCGGLRHYFNCSIRHNYEEFCNFIEFKHEHIFMDTSSLTISSWR 327 T 1.8E-05 Glyco_transf_88 pdbhh F Bacteria T 6eyv 1 A,B B,A A0A0H2ZBG1_PSEAB PvdP HHHHHHSSGLEVLFQGTTHRHLTREEEPQTPDEASLDLAATDGIRLGDRLRGLWDLRLVGGDAELPGLPREGLQLVLDVAPKGRGLIGYLDTPERLLAAEPPRFRVLGDLLGASSASIRWRLVDQASGSVAPTHDCSAVFDEVWADYANAGDGTLSGRIQRLERSPLSPNEDFRFVAVKRHFPLAHERIVLNEKLLGWLVSPQHRLFHQLWHASRDKWHRLSEKQRNALRGVGWQPGPLDRERDARGPRKDRNASGIDFFFMHRHMLHTARSMQDLPSWERLPRPVVPLEYDRPGFIRYFDNPDGFSVPPAWVAVDDDEYSEWLHGLKSAEAYHANFLVWESQYQDPAYLAKLTLGQFGSELELGMHDWLHMRWASVTRDPSNGAPVMTDRFPADFAPRWFRPENDFLGDPFSSHVNPVFWSFHGWIDDRIEDWYRAHERFHPGEVQRREVEGIQWFAPGRWVEVGDPWLGPATHGCGLSDVQASSNSVELDVETMKLALRIIFSEEDQLSGWLKRAPRRPWYARNLKLARDQLRR 536 T 0.0015 Hemocyanin_M pdbhh F Bacteria T 6eyx 1 A,B A,B Q9XJC1_9CAUD AcrIIa6 MKINDDIKELILEYMSRYFKFENDFYKLPGIKFTDANWQKFKNGGTDIEKMGAARVNAMLDCLFDDFELAMIGKAQTNYYNDNSLKMNMPFYTYYDMFKKQQLLKWLKNNRDDVIGGTGRMYTASGNYIANAYLEVALESSSLGSGSYMLQMRFKDYSKGQEPIPSGRQNRLEWIENNLENIRLE 185 T 0.062 PDH_E1_M pdb T Viruses T 6eyy 1 A,B A,B A0A1S5PRR0_9CAUD AcrIIa6 MKINDDIKELILEYMSRYFKFENDFYKLPGIKFTDANWQKFKNGGTDIEKMGAARVNAMLDCLFDDFELAMIGKAQTNYYNDNSLKMNMPFYTYYDMFKKQQLLKWLKNNRDDVIGGTGRMYTASGNYIANAYLEVALESSSLGSGSYMLQMRFKDYSKGQEPIPSGRQNRLEWIENNLENIR 183 T 0.06 PDH_E1_M pdb T Viruses T 6ezi 2 B B S15A2_HUMAN KIDNEY H(+)/PEPTIDE COTRANSPORTER,OLIGOPEPTIDE TRANSPORTER,KIDNEY ISOFORM,PEPTIDE TRANSPORTER 2 IKLETKKTKL 10 T 8.8 Ac76 unphh F Eukaryota F 6f08 2 C,F,G,H D,K,N,Q SOS1_HUMAN SOS-1 PRRRPESAPAESS 13 T 21 DUF2754 pdbhh F Eukaryota F 6f09 2 B,D,F,H A,B,C,D UBP8_HUMAN DEUBIQUITINATING ENZYME 8,UBIQUITIN ISOPEPTIDASE Y,HUBPY,UBIQUITIN THIOESTERASE 8,UBIQUITIN-SPECIFIC-PROCESSING PROTEASE 8 KLKRSYSSPDITQ 13 T 1.9 STAC2_u1 pdbhh F Eukaryota T 6f0f 2 B B ip2_s GAMGTLTPKEAELARRIRGAGGRTLNGFG 29 T 0.032 cIII pdb F T 6f0g 2 C,D C,D ip3 ASTERKWAELARRIRGAGGVTLNGFG 26 T 1.7 ELF pdbhh F T 6f0h 2 B,D B,D ip4 ASTEEKWARLARRIAGAGGVTLDGFG 26 T 1.2 DUF1654 pdbhh F T 6f0w 2 B S B3RXX2_TRIAD Hypoxia inducible factor, alpha subunit EKEDYDDLAPFVPPPSFDNRL 21 T 0.096 Pilt unppercent F Eukaryota T 6f0y 2 B B RT109_YEAST histone acetyltransferase Rtt109 C-terminus LAITMLKPRKKAKAL 15 T 2.9 SOXp pdbhh F Eukaryota T 6f1s 1 A A H7C664_CORGT CglIIR protein MPTRANVLDKRKVGNLSGGVNYFAADPRIKNVEALDKKLLAYLDKHGEDSTIGMRAIITILNAFTVDPNDLDLATFKAALLDFERNQPHLTARMVLRTNRKVNQGTGALLSPTDQALSRAEVAHPLLILYRIEGVNDAAAQRGEPTWSSDPIWVPNIKLPGQRQFWCVDGGHHHHHHG 178 T 0.024 Imm30 pdbpssm F Bacteria T 6f1t 6 M M Dynactin Subunit 2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 589 F F F 6f1t 7 N N Dynactin Subunit 2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 618 F F F 6f1t 8 O,P O,P Dynactin Subunit 3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 65 F F F 6f1t 9 Q,R Q,R Dynactin Subunit 2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 87 F F F 6f1t 13 V Y Dynactin Subunit 4 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 263 F F F 6f1t 14 W Z Dynactin Subunit 1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 52 F F F 6f1t 24 SA z Dynactin Subunit 1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 53 F F F 6f2r 5 J M Unknown peptide from HspB2 or HspB3 XXXXXXXXXX 10 F F F 6f2r 6 K N Unknown peptide from HspB2 or HspB3 XXXXXXX 7 F F F 6f2r 7 L O Unknown peptide from HspB2 or HspB3 XXXXXX 6 F F F 6f2r 8 P,Q,R 1,2,3 Unknown peptide from HspB2 or HspB3 XXXXX 5 F F F 6f2r 9 S,T,U W,X,Y Unknown peptide from HspB2 or HspB3 XXXXXXXXXXXX 12 F F F 6f34 2 B C MGTS_ECOLI MgtS MLGNMNVFMAVLGIILFSGFLAAYFSH 27 T 0.23 Gram_pos_anchor unp F Bacteria T 6f36 3 L N D7P897_9CHLO Mitochondrial ATP synthase subunit ASA6 MMLRTLTRSSAVAGQAVRLFKTSAAAAEGNSVAGIIKSVNETSGANLLSSLKTIKAQAAPIYPAAASSTGYSTQAKIALFGALSWILYRADGQSKAHEWIVDLNLNVLQAAWLISFSSLIPFRAVYFAFRGMAPATASTLNGLKTFSSISL 151 T 0.00025 NrsF pdbpercent F Eukaryota T 6f38 6 M M Dynactin Subunit 2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 587 F F F 6f38 7 N N Dynactin Subunit 2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 616 F F F 6f38 8 O,P O,P Dynactin Subunit 3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 65 F F F 6f38 9 Q,R Q,R Dynactin Subunit 2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 87 F F F 6f38 12 RA,U x,X HOOK3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 223 F F F 6f38 13 V Y Dynactin Subunit 4 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 264 F F F 6f38 14 W Z Dynactin Subunit 1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 52 F F F 6f38 23 SA z Dynactin Subunit 1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 53 F F F 6f3a 6 M M Dynactin Subunit 2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 587 F F F 6f3a 7 N N Dynactin Subunit 2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 616 F F F 6f3a 8 O,P O,P Dynactin Subunit 3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 65 F F F 6f3a 9 Q,R Q,R Dynactin Subunit 2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 87 F F F 6f3a 12 U Y Dynactin Subunit 4 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 264 F F F 6f3a 13 V Z Dynactin Subunit 1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 52 F F F 6f3a 20 HA z Dynactin Subunit 1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 53 F F F 6f3a 21 IA,JA 5,6 BICD2N XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 275 F F F 6f46 1 A A B2CL1_HUMAN BCL2-L-1,APOPTOSIS REGULATOR BCL-X GSGESRKGQERFNRWFLTGMTVAGVVLLGSLFSRK 35 T 0.014 DUF3094 pdbpssm F Eukaryota T 6f4p 2 B B RS6_HUMAN PHOSPHOPROTEIN NP33,SMALL RIBOSOMAL SUBUNIT PROTEIN ES6 VPRRLGPKRASRIRKL 16 T 41 DUF6408 pdbhh F Eukaryota T 6f4q 2 B B RS6_HUMAN PHOSPHOPROTEIN NP33,SMALL RIBOSOMAL SUBUNIT PROTEIN ES6 VPRRLGPKRCSRIRKL 16 T 40 DUF6408 pdbhh F Eukaryota T 6f4r 2 B B RCCD1_HUMAN RCC1 domain-containing protein 1 CARAY 5 T 41 zf-met pdbhh F Eukaryota F 6f4s 2 B B RCCD1_HUMAN RCC1 domain-containing protein 1 CARAY 5 T 41 zf-met pdbhh F Eukaryota F 6f4t 2 B B RCCD1_HUMAN RCC1 domain-containing protein 1 CARAY 5 T 41 zf-met pdbhh F Eukaryota F 6f55 2 B B TRPV4_CHICK TRANSIENT RECEPTOR POTENTIAL CATION CHANNEL SUBFAMILY V MEMBER 4 TKGPAPNPPPILKVW 15 T 0.1 Ank_2 unppercent F Eukaryota T 6f5e 3 C C JIP1_MOUSE JNK-INTERACTING PROTEIN 1,ISLET-BRAIN-1,IB-1,JNK MAP KINASE SCAFFOLD PROTEIN 1,MITOGEN-ACTIVATED PROTEIN KINASE 8-INTERACTING PROTEIN 1 RPKRPTTLNLF 11 T 12 CTP-dep_RFKase pdbhh F Eukaryota T 6f5p 4 G G RPB1_HUMAN DNA-directed RNA polymerase subunit YSPTSPSYSPTSPSYSPTSPSYSPTSPS 28 T 3.8E-05 RNA_pol_Rpb1_R pdbhh F Eukaryota F 6f5p 5 H H ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA AAAAAAAAAA 10 T 200 FAD_oxidored pdbhh F F 6f61 1 A A A0A4P1LYD9_9ARAC purotoxin-6 GYCATKGIKCNDIHCCSGLKCDSKRKVCVKG 31 T 0.0041 Toxin_7 pdb F Eukaryota T 6f8f 2 B G PDX1_HUMAN PDX-1,GLUCOSE-SENSITIVE FACTOR,GSF,INSULIN PROMOTER FACTOR 1,IPF-1,INSULIN UPSTREAM FACTOR 1,IUF-1,ISLET/DUODENUM HOMEOBOX-1,IDX-1,SOMATOSTATIN-TRANSACTIVATING FACTOR 1,STF-1 PEQDCAVTSGE 11 T 2.5 Rieske_3 pdbhh F Eukaryota T 6f8g 2 E,F,G,H E,F,G,H PDX1_MESAU HOMEODOMAIN PROTEIN PDX1,INSULIN PROMOTER FACTOR 1,IPF-1 EPEQDSAVTSGE 12 T 52 DUF5577 pdbhh F Eukaryota T 6f9i 2 B,D X,C CSTN1_MOUSE ALCADEIN-ALPHA,ALC-ALPHA NATRQLEWDDSTLSY 15 T 0.0031 CDC45 unppercent F Eukaryota T 6f9w 2 B B 4ET_HUMAN EIF4E TRANSPORTER,EUKARYOTIC TRANSLATION INITIATION FACTOR 4E NUCLEAR IMPORT FACTOR 1 GPLGSGLAKWFGSDMLQQPLPSMPAKVISVDELEYRQ 37 T 0.18 AbfS_sensor pdbpssm F Eukaryota T 6fad 1 A,B,C,D A,B,C,D SRPK1_HUMAN SFRS PROTEIN KINASE 1,SERINE/ARGININE-RICH PROTEIN-SPECIFIC KINASE 1,SR-PROTEIN-SPECIFIC KINASE 1 GSHMPEQEEEILGSDDDEQEDPNDYCKGGYHLVKIGDLFNGRYHVIRKLGWGHFSTVWLSWDIQGKKFVAMKVVKSAEHYTETALDEIRLLKSVRNSDPNDPNREMVVQLLDDFKISGVNGTHICMVFEVLGHHLLKWIIKSNYQGLPLPCVKKIIQQVLQGLDYLHTKCRIIHTDIKPENILLSVNEQYIRRLAAEATEWQRSGAPPPSGSAVSTAPQPKPADKMSKNKKKKLKKKQKRQAELLEKRMQEIEEMEKESGPGQKRPNKQEESESPVERPLKENPPNKMTQEKLEESSTIGQDQTLMERDTEGGAAEINCNGVIEVINYTQNSNNETLRHKEDLHNANDCDVQNLNQESSFLSSQNGDSSTSQETDSCTPITSEVSDTMVCQSSSTVGQSFSEQHISQLQESIRAEIPCEDEQEQEHNGPLDNKGKSTAGNFLVNPLEPKNAEKLKVKIADLGNACWVHKHFTEDIQTRQYRSLEVLIGSGYNTPADIWSTACMAFELATGDYLFEPHSGEEYTRDEDHIALIIELLGKVPRKLIVAGKYSKEFFTKKGDLKHITKLKPWGLFEVLVEKYEWSQEEAAGFTDFLLPMLELIPEKRATAAECLRHPWLNS 618 T 1.3E-06 WaaY unphh F Eukaryota T 6fau 2 B B TAU_HUMAN ACE-ARG-THR-PRO-SEP-LEU-PRO-GLY XRTPSLPG 8 T 6.1 UPF0167 pdbhh F Eukaryota T 6fau 3 D D TAU_HUMAN THR-PRO-SEP-LEU-PRO-GLY TPSLPG 6 T 51 NPP pdbhh F Eukaryota F 6fav 2 B B TAU_HUMAN ACE-ARG-THR-PRO-SEP-LEU-PRO-GLY XRTPSLPG 8 T 6.1 UPF0167 pdbhh F Eukaryota T 6fav 3 D D TAU_HUMAN THR-PRO-SEP-LEU-PRO-GLY TPSLPG 6 T 51 NPP pdbhh F Eukaryota F 6faw 2 B B TAU_HUMAN ACE-ARG-THR-PRO-SEP-LEU-PRO-GLY XRTPSLPG 8 T 6.1 UPF0167 pdbhh F Eukaryota T 6faw 3 D D TAU_HUMAN THR-PRO-SEP-LEU-PRO-GLY TPSLPG 6 T 51 NPP pdbhh F Eukaryota F 6fbb 2 B P SHRM3_HUMAN Shroom3 SRSSP 5 T 2.2 CCDC14 unphh F Eukaryota F 6fbk 2 B P WNK1_HUMAN ERYTHROCYTE 65 KDA PROTEIN,P65,KINASE DEFICIENT PROTEIN,PROTEIN KINASE LYSINE-DEFICIENT 1,PROTEIN KINASE WITH NO LYSINE 1,HWNK1 LTQVVHSAGRRFIVSPVPESRLR 23 T 0.73 NUC pdbhh F Eukaryota T 6fbt 2 B E NAG-anhNAMpentapeptide AXXXX 5 F F F 6fbw 2 B,D B,D TAU_HUMAN ARG-THR-PRO-SEP-LEU-PRO-GLY RTPSLPG 7 T 4.1 UPF0167 pdbhh F Eukaryota T 6fby 2 B B TAU_HUMAN ACE-ARG-THR-PRO-SEP-LEU-PRO-GLY XRTPSLPG 8 T 6.1 UPF0167 pdbhh F Eukaryota T 6fby 3 D D TAU_HUMAN THR-PRO-SEP-LEU-PRO-GLY TPSLPG 6 T 51 NPP pdbhh F Eukaryota F 6fc1 2 B,D B,D EAP1_YEAST EIF4E-ASSOCIATED PROTEIN 1 GPHMTDPITNYKPMDLQYKTYAYSMNELYHLKPSLASASYEEDPLISELVRSLPKRKFWRLRMG 64 T 0.047 CNTF pdbpssm F Eukaryota T 6fc2 2 B,D B,D EAP1_YEAST EIF4E-ASSOCIATED PROTEIN 1 GPHMTDPITNYKPMDLQYKTYAYSMNELYHLKPSLASASYEEDPLISELVRSLPKRKFWRLRMG 64 T 0.047 CNTF pdbpssm F Eukaryota T 6fc6 2 B B BIM1_YEAST Protein BIM1 SNNLIIDEETF 11 T 1.8 DUF3797 pdbhh F Eukaryota T 6fce 1 A A ACP-HIS-DPHE-ARG-TRP-ASP-NH2 XHXRWDX 7 T 0.019 ACTH_domain pdbhh F F 6fcp 2 B P SHRM3_HUMAN SHROOM-RELATED PROTEIN,HSHRML AGPVHVRSRSSLATA 15 T 2.2 CCDC14 unphh F Eukaryota T 6fcr 2 B,C F,G ALA-DGL-API-DAL AXXX 4 F F F 6fcs 2 B B ALA-DGL-API-DAL-DAL AXXXX 5 F F F 6fdp 2 B B HS90A_HUMAN HEAT SHOCK 86 KDA,HSP86,LIPOPOLYSACCHARIDE-ASSOCIATED PROTEIN 2,LPS-ASSOCIATED PROTEIN 2,RENAL CARCINOMA ANTIGEN NY-REN-38 DTSRMEEVD 9 T 6.5 Clathrin_lg_ch pdbhh F Eukaryota T 6fdt 2 B B HS71B_HUMAN HEAT SHOCK 70 KDA PROTEIN 2,HSP70.2 SGPTIEEVD 9 T 6 DUF3567 pdbhh F Eukaryota T 6fe8 3 D D CBF3C_YEAST CHROMOSOME TRANSMISSION FIDELITY PROTEIN 13,KINETOCHORE PROTEIN CTF13 MGPSFNPVRFLELPIDIRKEVYFHLDGNFCGAHPYPIDILYKSNDVELPGKPSYKRSKRSKKLLRYMYPVFATYLNIFEYSPQLIEKWLEYAFWLRYDCLVLDCFKVNHLYDGTLIDALEWTYLDNELRLAYFNKASMLEVWYTFKEYKKWVIDSVAFDELDLLNVSNIQFNIDNLTPQLVDKCLSILEQKDLFATIGEVQFGQDEEVGEEKDVDVSGANSDENSSPSSTIKNKKRSASKRSHSDNGNVGATHNQLTSISVIRTIRSMESMKSLRKITVRGEKLYELLINFHGFRDNPGKTISYIVKRRINEIRLSRMNQISRTGLADFTRWDNLQKLVLSRVAYIDLNSIVFPKNFKSLTMKRVSKIKWWNIEENILKELKVDKRTFKSLYIKEDDSKFTKFFNLRHTRIKELDKSEINQITYLRCQAIVWLSFRTLNHIKLQNVSEVFNNIIVPRALFDSKRVEIYRCEKISQVLVIGSRSGSENLYFQGSKRRWKKNFIAVSAANRFKKISSSGAL 519 T 0.088 Glft2_N unppercent F Eukaryota T 6fec 32 FA d EUKARYOTIC TRANSLATION INITIATION FACTOR 2 BETA SUBUNIT (eIF2-Beta) SEKEYVEMLDRLYSKLP 17 T 0.77 DUF6103 pdbhh F T 6fel 2 E,F,G,H E,F,G,H KKCC2_HUMAN CAMKK 2,CALCIUM/CALMODULIN-DEPENDENT PROTEIN KINASE KINASE BETA,CAMKK BETA RSLSAPGN 8 T 13 DUF6439 pdbhh F Eukaryota T 6fgm 1 A A ALA-CYS-PHE-LEU-THR-ARG-LEU-GLY-THR-TYR-VAL-CYS ACFLTRLGTYVC 12 T 3.9 zf-C3HC pdbhh F T 6fgp 1 A A CCR5_HUMAN CCR5,CHEMR13,HIV-1 FUSION CORECEPTOR MDYQVSSPIYDINYYTSEPAQKINVKQ 27 T 6 Polysacc_syn_2C pdbhh F Eukaryota T 6fhi 6 F X TYR-SER-PRO-THR-SEP-PRO-SER-TYR-SER-PRO-SER-TYR-SER-PRO-THR-SEP-PRO-SER-TYR YSPTSPSYSPSYSPTSPSY 19 T 8.9E-05 RNA_pol_Rpb1_R pdbhh F F 6fhu 2 E,F,G F,H,G ALA-ARG-TAM ARX 3 T 1200 GH97_C pdbhh F F 6fi4 2 B B TAU_HUMAN PRO-SEP-LEU-PRO-DVA PSLPX 5 T 110 Get5_bdg pdbhh F Eukaryota F 6fi5 2 B B TAU_HUMAN THR-PRO-SEP-LEU-PRO-DAL TPSLPX 6 T 95 DUF5613 pdbhh F Eukaryota F 6fkp 2 E,F,G E,F,G ALA-ARG-THR-ALA-ALA-THR-ALA-ARG ARTAATARKS 10 T 150 Dppa2_A pdbhh F F 6fkr 53 AB 1y Tur1A peptide RRIRFRPPYLPRPGRRPRFPPP 22 T 13 Consortin_C pdbhh F T 6fky 2 C,D C,I 3(R)-(phenylthio)succinyl-CPS1 peptide XVLKEYGV 8 F F T 6fkz 2 C E 3(S)-(phenylthio)succinyl-CPS1 peptide XVLXEYGV 8 T 25 Sulf_coat_C pdbhh F T 6fkz 3 D H 3(R)-(phenylthio)succinyl-CPS1 peptide XVLXEYGV 8 T 25 Sulf_coat_C pdbhh F T 6flg 2 C C 3(S)-(naphthylthio)succinyl-CPS1 peptide XVLXEYGV 8 T 25 Sulf_coat_C pdbhh F T 6fm1 1 A,B A,B G3FFN6_9CAUD Adenylosuccinate synthetase MGSSHHHHHHSSGLVPRGSHMKNVDLVIDLQFGSTGKGLIAGYLAEKNGYDTVINANMPNAGHTYINAEGRKWMHKVLPNGIVSPNLKRVMLGAGSVFSINRLMEEIEMSKDLLHDKVAILIHPMATVLDEEAHKKAEVGIATSIGSTGQGSMAAMVEKLQRDPTNNTIVARDVAQYDGRIAQYVCTVEEWDMALMASERILAEGAQGFSLSLNQEFYPYCTSRDCTPARFLADMGIPLPMLNKVIGTARCHPIRVGGTSGGHYPDQEELTWEQLGQVPELTTVTKKVRRVFSFSFIQMQKAMWTCQPDEVFLNFCNYLSPMGWQDIVHQIEVAAQSRYCDAEVKYLGFGPTFNDVELREDVM 363 T 1.1E-68 Adenylsucc_synt pdbpercent T Viruses T 6fmb 1 A A N1JJ94_BLUG1 CSEP0064 putative effector protein AAAYWDCDGTEIPERNVRAAVVLAFNYRKESFHGYPATFIIGSTFSGVGEVRQFPVEDSDANWQGGAVKYYILTNKRGSYLEVFSSVGSGNKCTFVEG 98 T 18 T2SS_PulS_OutS unphh F Eukaryota T 6fmp 2 C C ACY-ASP-GLU-GLU-THR-GLY-GLU-PHE XDEETGEF 8 T 4 DUF4585 pdbhh F T 6fmq 2 C D ACY-SC1-GLU-THR-GLY-GLU-LEU XXETGEL 7 T 72 ssDBP_DBD pdbhh F F 6fms 2 E,F,G,H E,F,G,H Globomycin XXSXX 5 T 380 GLF pdbhh F F 6fnz 2 E,F,G E,F,G possible peptide PESSEG 6 T 130 Involucrin pdbhh F F 6fos 14 O O M1VFJ4_CYAM1 PsaM SSLRMFEVSDGEPYPLNPAVIFIALIGWSAVAAIPSNIPVLGGTGLTQAFLASIQRLLAQYPTGPKLDDPFWFYLIVYHVGLFALLIFGQIGYAGYAK 98 T 23 YbgT_YccB unphh F Eukaryota T 6fpf 1 A,B,C,D A,C,D,E CMU1_USTMA Chromosome 16, whole genome shotgun sequence MAAVSGKSEAAEIEAGDRLDALRDQLQRYETPIIQTILARSALGGRAPSEQDEVRAALSRNAFEPSEVISEWLQTESGARFRSTRPLPPAVEFITPVVLSRDTVLDKPVVGKGIFPIGRRPQDPTNMDEFLDTSLLSLNQSSTVDLASAVSLDVSLLHLVSARVLLGYPIALAKFDWLHDNFCHILTNTTLSKSQKLANIIQQLTDHKQEVNVLSRVEQKSKSLSHLFRNDIPYPPHTQDRILRLFQAYLIPITTQIEAAAILDHANKCTLEHHHHHH 278 T 0.26 CM_2 pdbpssm F Eukaryota T 6fpg 1 A,B,E,F C,B,F,G CMU1_USTMA Chromosome 16, whole genome shotgun sequence MAAVSGKSEAAEIEAGDRLDALRDQLQRYETPIIQTILARSALGGRAPSEQDEVRAALSRNAFEPSEVISEWLQTESGARFRSTRPLPPAVEFITPVVLSRDTVLDKPVVGKGIFPIGRRPQDPTNMDEFLDTSLLSLNQSSTVDLASAVSLDVSLLHLVSARVLLGYPIALAKFDWLHDNFCHILTNTTLSKSQKLANIIQQLTDHKQEVNVLSRVEQKSKSLSHLFRNDIPYPPHTQDRILRLFQAYLIPITTQIEAAAILDHANKCTLEHHHHHH 278 T 0.26 CM_2 pdbpssm F Eukaryota T 6fq4 2 B B Q824H6_CHLCV TarP-VBS1 LLEAARNTTTMLSKTLSKVC 20 T 0.02 SipA_VBS pdb F Bacteria T 6frj 2 B B APD-SeThr-RP APDXRPX 7 T 130 SP2 pdbhh F F 6frk 52 ZA t Signal sequence LLLLLLLLLLL 11 T 22 GRP pdbhh F F 6ft6 57 EB NN MPP6 XXXXXXXXXXX 11 F F F 6ftg 52 ZA 2 TMEM258 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 60 F F F 6ftg 56 DB 6 DAD1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 97 F F F 6ftg 57 EB 7 OST48 XXXXXXXXXXXXXXXXXXXXXXXXX 25 F F F 6ftg 58 FB 8 RPN2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 80 F F F 6fti 53 AB 2 TMEM258 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 60 T 12000 DUF4699 pdbhh F F 6fti 57 EB 6 DAD1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 109 F F F 6fti 58 FB 7 OST48 XXXXXXXXXXXXXXXXXXXXXXXXX 25 F F F 6fti 59 GB 8 RPN1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 80 F F F 6fti 60 HB 0 Unidentified TM XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 6ftj 52 ZA 2 TMEM258 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 60 F F F 6ftj 56 DB 6 DAD1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 97 F F F 6ftj 57 EB 7 OST48 XXXXXXXXXXXXXXXXXXXXXXXXX 25 F F F 6ftj 58 FB 8 RPN2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 80 F F F 6fto 2 C C MIT1_SCHPO MI2-LIKE INTERACTING WITH CLR3 PROTEIN 1,SNF2/HDAC-CONTAINING REPRESSOR COMPLEX PROTEIN MIT1,SHREC PROTEIN MIT1 MPKEDDSLCKIVVRREPLDVLLPYYDASETTVQKILHENDSTLSVKFLAGVEALIKKDELDKYKNGKACLRVWLKHKSGKR 81 T 0.12 Mad3_BUB1_I pdb F Eukaryota T 6fu9 2 B,D B,D C4B8B8_MAGOR AVR-PIK PROTEIN ( PIKMPROTEIN,PIKP PROTEIN ),AVRPI7 PROTEIN METGNKYIEKRAIDLSRERDPNFFDHPGIPVPECFWFMFKNNVRQDAGTCYSSWKMDMKVGPNWVHIKSDDNCNLSGDFPPGWIVLGKKRPGF 93 T 0.1 TMEM18 unp F Eukaryota T 6fub 1 A B C4B8C2_MAGOR AVR-Pik protein METGNKYIEKRAIDLSRERDPNFFDNPGIPVPECFWFMFKNNVRQDAGTCYSSWKMDMKVGPNWVHIKSDDNCNLSGDFPPGWIVLGKKRPGF 93 T 0.086 TMEM18 unp F Eukaryota T 6fud 2 B B C4B8B9_MAGOR AVR-PIKM PROTEIN METGNKYIEKRAIDLSRERDPNFFDNADIPVPECFWFMFKNNVRQDAGTCYSSWKMDMKVGPNWVHIKSDDNCNLSGDFPPGWIVLGKKRPGF 93 T 0.34 DIM unppssm F Eukaryota T 6fvl 2 E,F,G,H H,I,J,K P7 peptide XQXDLF 6 T 81 Zn_peptidase pdbhh F F 6fvm 2 C,D H,I P7 peptide XQXDLF 6 T 81 Zn_peptidase pdbhh F F 6fvn 2 B,D,F,H J,K,H,I P7 peptide XQXDLF 6 T 81 Zn_peptidase pdbhh F F 6fvo 2 E,F,G,H H,I,J,K P7 peptide XQXDLF 6 T 81 Zn_peptidase pdbhh F F 6fwn 1 A A VWF_HUMAN VWF GSMASACEVVTGSPRGDSQSSWKSVGSQWASPENPCLINECVRVKEEVFIQQRNVSCPQLEVPVCPSGFQLSCKTSACCPSCRCE 85 T 2.2 Antistasin pdbhh F Eukaryota T 6fx1 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L A0A3B6UEU4_9AGAR lectin GAMAPVPVTKLVCDGDTYKCTAYLDFGDGRWVAQWDTNVFHTG 43 T 0.1 C2-set pdbhh F Eukaryota T 6fx2 1 A,B A,B A0A3B6UEU4_9AGAR lectin GAMAPVPVTKLVCDGDTYKCTAYLDFGDGRWVAQWDTNVFHTG 43 T 0.1 C2-set pdbhh F Eukaryota T 6fx3 1 A,B A,B A0A3B6UEU4_9AGAR lectin GAMAPVPVTKLVCDGDTYKCTAYLDFGDGRWVAQWDTNVFHTG 43 T 0.1 C2-set pdbhh F Eukaryota T 6fzf 2 C,D C,D PRGC1_HUMAN PPARGC-1-ALPHA,LIGAND EFFECT MODULATOR 6 EEPSLLKKLLLAPA 14 T 7 Cnl2_NKP2 pdbhh F Eukaryota T 6fzj 2 C,D C,D PPARGC-1-ALPHA,LIGAND EFFECT MODULATOR 6 EEPSLLKKLLLAPA 14 T 7 Cnl2_NKP2 pdbhh F T 6fzp 2 B C PRGC1_HUMAN PPARGC-1-ALPHA,LIGAND EFFECT MODULATOR 6 EEPSLLKKLLLAPA 14 T 7 Cnl2_NKP2 pdbhh F Eukaryota T 6fzq 2 B P MUC1_HUMAN MUC-1,BREAST CARCINOMA-ASSOCIATED ANTIGEN DF3,CANCER ANTIGEN 15-3,CA 15-3,CARCINOMA-ASSOCIATED MUCIN,EPISIALIN,H23AG,KREBS VON DEN LUNGEN-6,KL-6,PEMT,PEANUT-REACTIVE URINARY MUCIN,PUM,POLYMORPHIC EPITHELIAL MUCIN,PEM,TUMOR-ASSOCIATED EPITHELIAL MEMBRANE ANTIGEN,EMA,TUMOR-ASSOCIATED MUCIN APDTRP 6 T 170 DDE_Tnp_1_assoc pdbhh F Eukaryota F 6fzr 2 B P MUC1_HUMAN MUC-1,BREAST CARCINOMA-ASSOCIATED ANTIGEN DF3,CANCER ANTIGEN 15-3,CA 15-3,CARCINOMA-ASSOCIATED MUCIN,EPISIALIN,H23AG,KREBS VON DEN LUNGEN-6,KL-6,PEMT,PEANUT-REACTIVE URINARY MUCIN,PUM,POLYMORPHIC EPITHELIAL MUCIN,PEM,TUMOR-ASSOCIATED EPITHELIAL MEMBRANE ANTIGEN,EMA,TUMOR-ASSOCIATED MUCIN APDTRP 6 T 170 DDE_Tnp_1_assoc pdbhh F Eukaryota F 6g0o 2 B B ATRX_HUMAN ATP-DEPENDENT HELICASE ATRX,X-LINKED HELICASE II,X-LINKED NUCLEAR PROTEIN,XNP,ZNF-HX HFPXGIXQIKY 11 T 0.27 PI_PP_C pdbhh F Eukaryota T 6g0p 2 B B E2F1_HUMAN E2F-1,PBR3,RETINOBLASTOMA-ASSOCIATED PROTEIN 1,RBAP-1,RETINOBLASTOMA-BINDING PROTEIN 3,RBBP-3,PRB-BINDING PROTEIN E2F-1 HPGXGVXSPGEKSRYE 16 T 0.17 Cucumo_2B pdbhh F Eukaryota T 6g0q 2 B B GATA1_HUMAN ERYF1,GATA-BINDING FACTOR 1,GF-1,NF-E1 DNA-BINDING PROTEIN ASGXGKXKRGY 11 T 9.7 RELT pdbhh F Eukaryota T 6g10 2 C C C4B8B8_MAGOR AVR-PIK PROTEIN ( PIKMPROTEIN,PIKP PROTEIN ),AVRPI7 PROTEIN METGNKYIEKRAIDLSRERDPNFFDHPGIPVPECFWFMFKNNVRQDAGTCYSSWKMDMKVGPNWVHIKSDDNCNLSGDFPPGWIVLGKKRPGF 93 T 0.1 TMEM18 unp F Eukaryota T 6g11 1 A,D C,F C4B8C2_MAGOR AVR-Pik protein METGNKYIEKRAIDLSRERDPNFFDNPGIPVPECFWFMFKNNVRQDAGTCYSSWKMDMKVGPNWVHIKSDDNCNLSGDFPPGWIVLGKKRPGF 93 T 0.086 TMEM18 unp F Eukaryota T 6g2t 3 M,N,O,P M,N,O,P Tropomyosin XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 135 F F F 6g3j 3 C,F C,F MET-THR-SER-ALA-ILE-GLY-ILE-LEU-PRO-VAL MTSAIGILPV 10 T 5 CLLAC pdbhh F T 6g3k 3 C,F C,F ILE-THR-SER-GLY-ILE-GLY-VAL-LEU-PRO-VAL ITSGIGVLPV 10 T 2.9 NAD_binding_6 pdbhh F T 6g41 1 A,B,C,D,E,F,G,H,I,J A,B,C,D,E,F,G,H,I,J A0A1L4BKA3_9VIRU Minor capsid protein GAMGMKQYIWLNETIKSNKQLAGPRGSYKRPVSVDIFRSSTILDPDKNYLLIVEEFHLHKIRLPLFKPAGHDYQVGIFNRSTDEIMGVREVDFSTFVDEDGYMYDYVDVGTAINETLAGLCDGIIGEEDIPVFSFNKHSKKFEITTTENFRNGHFIMFNDDMRVDFNSFEFDDIDEEYSLVILNEDVETQDASTLEFLTPISHIVIESNDLPVSYELLPSISKNTTISDNTGVFLTNYKYLQQNNQDYNSILFRVENSSNKYHNILQTNFNRFNLSFTIYDYDNEKHPLTLLPQTVIQLKLLFESID 307 T 0.051 SIN1 unp T Viruses T 6g42 1 A,B,C,D,E A,B,C,D,E A0A1L4BKA3_9VIRU Minor capsid protein GAMGMKQYIWLNETIKSNKQLAGPRGSYKRPVSVDIFRSSTILDPDKNYLLIVEEFHLHKIRLPLFKPAGHDYQVGIFNRSTDEIMGVREVDFSTFVDEDGYMYDYVDVGTAINETLAGLCDGIIGEEDIPVFSFNKHSKKFEITTTENFRNGHFIMFNDDMRVDFNSFEFDDIDEEYSLVILNEDVETQDASTLEFLTPISHIVIESNDLPVSYELLPSISKNTTISDNTGVFLTNYKYLQQNNQDYNSILFRVENSSNKYHNILQTNFNRFNLSFTIYDYDNEKHPLTLLPQTVIQLKLLFESID 307 T 0.051 SIN1 unp T Viruses T 6g43 1 A,B,C A,B,C A0A1L4BK98_9VIRU Putative major capsid protein GAMGMNTPPELDTVLQAPYAYNWPTSKNVKIASRIGIPYSTFQTIQPVSDAPNNGIGQITFNQPLGNLTGGAPRLRVSFTAEIKNILADSSLKDQIGLKSFPVNRSIPVAVINMNGKTFTSYPAQLIKLHQYNADPLELALLSPCSDVDEYNKIKAVSMNNPYRQGTESTDSRMSRGLGCNYAYYIHPRAAGSTSVKIDFVVDEALVANPTQYKNIKDPVPFRNLNTFKVILDGQFKPENMIGIADDVKLVAGKADFEVDITGFKINMLVQNWVAPLEIGDIPKTIIYNTPLISLEGNISSMCLNTKDPYGIPGERNKHILTTHSMAMNNVPSMFAVMVSQETPTKKFAPDQLAGIIGLEIKVDSDVGIFRELEQQQLYELSSSNGYNKRFSCFSGALANGLTVADPAVAAGNKFKEAIFGAGSVIFFRPSDLGLKDYNVMANANKSINMQVQATFVTPEAAGTGAHYKLEVFSIRDNLTYSFEDGTFMDDLTLYTPDQLLRSPLKLTDDNNKLMRVMGG 520 T 16 DUF4223 pdbhh T Viruses T 6g44 1 A,B,C A,B,C A0A1L4BK98_9VIRU Putative major capsid protein GAMGMNTPPELDTVLQAPYAYNWPTSKNVKIASRIGIPYSTFQTIQPVSDAPNNGIGQITFNQPLGNLTGGAPRLRVSFTAEIKNILADSSLKDQIGLKSFPVNRSIPVAVINMNGKTFTSYPAQLIKLHQYNADPLELALLSPCSDVDEYNKIKAVSMNNPYRQGTESTDSRMSRGLGCNYAYYIHPRAAGSTSVKIDFVVDEALVANPTQYKNIKDPVPFRNLNTFKVILDGQFKPENMIGIADDVKLVAGKADFEVDITGFKINMLVQNWVAPLEIGDIPKTIIYNTPLISLEGNISSMCLNTKDPYGIPGERNKHILTTHSMAMNNVPSMFAVMVSQETPTKKFAPDQLAGIIGLEIKVDSDVGIFRELEQQQLYELSSSNGYNKRFSCFSGALANGLTVADPAVAAGNKFKEAIFGAGSVIFFRPSDLGLKDYNVMANANKSINMQVQATFVTPEAAGTGAHYKLEVFSIRDNLTYSFEDGTFMDDLTLYTPDQLLRSPLKLTDDNNKLMRVMGG 520 T 16 DUF4223 pdbhh T Viruses T 6g45 1 A,B,C A,B,C A0A1L4BK98_9VIRU Putative major capsid protein GAMGMNTPPELDTVLQAPYAYNWPTSKNVKIASRIGIPYSTFQTIQPVSDAPNNGIGQITFNQPLGNLTGGAPRLRVSFTAEIKNILADSSLKDQIGLKSFPVNRSIPVAVINMNGKTFTSYPAQLIKLHQYNADPLELALLSPCSDVDEYNKIKAVSMNNPYRQGTESTDSRMSRGLGCNYAYYIHPRAAGSTSVKIDFVVDEALVANPTQYKNIKDPVPFRNLNTFKVILDGQFKPENMIGIADDVKLVAGKADFEVDITGFKINMLVQNWVAPLEIGDIPKTIIYNTPLISLEGNISSMCLNTKDPYGIPGERNKHILTTHSMAMNNVPSMFAVMVSQETPTKKFAPDQLAGIIGLEIKVDSDVGIFRELEQQQLYELSSSNGYNKRFSCFSGALANGLTVADPAVAAGNKFKEAIFGAGSVIFFRPSDLGLKDYNVMANANKSINMQVQATFVTPEAAGTGAHYKLEVFSIRDNLTYSFEDGTFMDDLTLYTPDQLLRSPLKLTDDNNKLMRVMGGSFMGDVMTNFNHMAAHPVTKTVTKLLRNAGPLKDYAGDGTMMGNIASVYGYGKKKTTTRKKKGGEIVLLGSGKKGGKKLSDKQLHDLRNL 610 T 19 DUF4223 pdbhh T Viruses T 6g4s 32 FA s RRP12 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 800 F F F 6g4s 33 GA k Unknown AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 39 T 4300 Chorion_S16 pdbhh F F 6g4w 28 BA s RRP12 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 800 F F F 6g4w 31 EA k UNKNOWN HELIX AAAAAAAAAAAAAAAAAAA 19 T 410 Adeno_PIX pdbhh F F 6g52 1 A,B,C,D,E,F,G,H,I I,B,C,D,E,F,G,H,A CNNM4_HUMAN ANCIENT CONSERVED DOMAIN-CONTAINING PROTEIN 4,CYCLIN-M4 AGMKISPQLLLAAHRFLATEVSQFSPSLISEKILLRLLKYPDVIQELKFDEHNKYYARHYLYTRNKPADYFILILQGKVEVEAGKENMKFETGAFSYYGTMALTSVPSDRSPAHPTPLSRSASLSYPDRTDVSTAATLAGSSNQFGSSVLGQYISDFSVRALVDLQYIKITRQQYQNGLLASRMENSPQ 189 T 0.00033 cNMP_binding pdb F Eukaryota T 6g57 1 A,B,C,D A,B,C,D KCTD8_HUMAN BTB/POZ domain-containing protein KCTD8 SMAQDKRSGFLTLGYRGSYTTVRDNQADAKFRRVARIMVCGRIALAKEVFGDTLNESRDPDRQPEKYTSRFYLKFTYLEQAFDRLSEAGFHMVACNSSGTAAFVNQYRDDKIWSSYTEYIFFRP 124 T 0.61 GFRP pdbhh F Eukaryota T 6g5f 2 C P SYT1_HUMAN SYNAPTOTAGMIN I,SYTI,P65 GEGKEDAFSKLKEKFMNELHK 21 T 0.01 PRIMA1 unphh F Eukaryota T 6g5g 2 C P SYT2_HUMAN SYNAPTOTAGMIN II,SYTII GESQEDMFAKLKEKLFNEINK 21 T 0.64 DUF4312 pdbhh F Eukaryota T 6g5k 2 B,D C,33 SYT1_HUMAN SYNAPTOTAGMIN I,SYTI,P65 GEGKEDAFSKLKEKFMNELHK 21 T 0.01 PRIMA1 unphh F Eukaryota T 6g65 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L CC-Type2-VV XGEVAQAVKEVAKAVKEVAWAVKEVAQAVKGX 32 T 0.0064 MCPsignal pdbpssm F T 6g66 1 A,B,C,D,E,F,G A,B,C,D,E,F,G CC-Type2-IV XGEVAQAIKEVAKAIKEVAWAIKEVAQAIKGX 32 T 0.007 DUF1241 pdb F T 6g69 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N A,B,C,D,E,F,G,H,I,J,K,L,M,N CC-Type2-IL-Sg-L17E XGELAQSIKELAKSIKEEAWSIKELAQSIKGX 32 T 0.18 HTH_52 pdbhh F T 6g6e 1 A,B,C,D,E,F,G A,B,C,D,E,F,G CC-Type2-deLI XGEIAQAXKEIAKAXKEIAWAXKEIAQAXKGX 32 T 2.8 DUF1328 pdbpssm F T 6g6g 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L CC-Type2-FI XGEIAQAFKEIAKAFKEIAWAFKEIAQAFKGX 32 T 0.029 WXG100 pdbpssm F T 6g6h 1 A,B,C,D,E A,B,C,D,E 5H2L_2.1-I9L XTQEYLLKELMKLLKEQIKLLKEQIKMLKELEKQX 35 T 0.027 DUF5320 pdbhh F T 6g6x 2 B P YAP1_HUMAN YES-ASSOCIATED PROTEIN 1,PROTEIN YORKIE HOMOLOG,YES-ASSOCIATED PROTEIN YAP65 HOMOLOG XRAHSSXASLQ 11 T 0.00014 FAM181 unp F Eukaryota T 6g84 2 B D CBK1_YEAST CBK1 FTDVPALNYPATPPPH 16 T 0.8 CbtA pdbhh F Eukaryota T 6g84 3 D C CBK1_YEAST CBK1 AFTDVPALNYPATPPPH 17 T 0.48 CbtA pdbhh F Eukaryota T 6g85 2 C C CBK1_YEAST CBK1 FTDVPALNYPATPPPH 16 T 0.8 CbtA pdbhh F Eukaryota T 6g85 3 D D CBK1_YEAST CBK1 AFTDVPALNYPATPPPH 17 T 0.48 CbtA pdbhh F Eukaryota T 6g86 2 C,D D,C SIC1_YEAST CDK INHIBITOR P40 PSTTKSFKNAPLLAPP 16 T 12 BHD_1 pdbhh F Eukaryota T 6g8i 2 B P YAP1_HUMAN ALA-HIS-SEP-SER-PRO-ALA-SER-LEU-GLN XXAHSSPASLQ 11 T 0.00014 FAM181 unp F Eukaryota T 6g8j 2 B P YAP1_HUMAN ACE-ARG-ALA-HIS-SEP-SER-PRO-BAL-SER-LEU-GLN XRAHSSPXSLQ 11 T 0.00014 FAM181 unp F Eukaryota T 6g8k 2 B P YAP1_HUMAN ACE-ARG-ALA-HIS-SEP-SER-PRO-ALA-BSE-LEU-GLN XRAHSSPAXLQ 11 T 0.00014 FAM181 unp F Eukaryota T 6g8l 2 B P YAP1_HUMAN ACE-ARG-ALA-HIS-SEP-SER-PRO-ALA-SER-BLE-GLN XRAHSSPASXQ 11 T 0.00014 FAM181 unp F Eukaryota T 6g8p 2 B P YAP1_HUMAN YES-ASSOCIATED PROTEIN 1,PROTEIN YORKIE HOMOLOG,YES-ASSOCIATED PROTEIN YAP65 HOMOLOG XRAHSSXASXQ 11 T 0.00014 FAM181 unp F Eukaryota T 6g8q 2 B P YAP1_HUMAN YES-ASSOCIATED PROTEIN 1,PROTEIN YORKIE HOMOLOG,YES-ASSOCIATED PROTEIN YAP65 HOMOLOG XRAHSSPXSLX 11 T 0.00014 FAM181 unp F Eukaryota T 6g90 22 V X Unknown XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 51 F F F 6g9q 5 E P DOPO_MOUSE DOPAMINE BETA-MONOOXYGENASE KAPYDYAPI 9 T 5.4 DUF1043 pdbhh F Eukaryota T 6g9r 3 I,J,K,L P,I,J,K DOPO_MOUSE DOPAMINE BETA-MONOOXYGENASE,MDBM KAPYDYAPI 9 T 5.4 DUF1043 pdbhh F Eukaryota T 6gaw 55 IB Bz unassigned secondary structure elements AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 82 T 14000 zf-H2C2_2 pdbhh F F 6gaw 74 BC AZ unassigned secondary structure elements AAAAAAAAAAAAAAAAAA 18 T 330 Campylo_MOMP pdbhh F F 6gaz 21 U AZ unassigned secondary structure elements AAAAAAAAAAAAAAAAAA 18 T 330 Campylo_MOMP pdbhh F F 6gb2 55 IB Bz unassigned secondary structure elements AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 82 T 14000 zf-H2C2_2 pdbhh F F 6gb5 4 G,H G,H GLY-LEU GL 2 T 470 Tachykinin pdbhh F F 6gb7 5 M,N P,R TPSN_MOUSE GLY-GLY-LEU-SER EDAGGGGLSK 10 T 11 DUF5672 pdbhh F Eukaryota T 6gc3 2 B B SEN1_YEAST TRNA-SPLICING ENDONUCLEASE POSITIVE EFFECTOR DDDEDDYTPSIS 12 T 2.4 CM1 pdbhh F Eukaryota T 6gch 1 A E CTRA_BOVIN GAMMA-CHYMOTRYPSIN A CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 6gcs 18 R S NESM SUBUNIT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 159 F F F 6gcs 23 W Z NUZM SUBUNIT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 137 F F F 6gcs 24 X a NIAM SUBUNIT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 100 F F F 6gcs 25 Y b NEBM SUBUNIT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 64 F F F 6gcs 28 BA e NUUM SUBUNIT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 45 F F F 6gcs 30 DA g B5FVF3_YARLI NI9M SUBUNIT MINANPGFWNGPFRYLRWSAHNRPHLFFAFAIGIAGPVAALTLTPLRRKYLYPDHSPLPQSYP 63 T 0.054 DUF998 pdb F Eukaryota T 6gcs 32 FA i UNKNOWN SUBUNIT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 53 F F F 6gcs 34 HA n NUNM SUBUNIT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 93 F F F 6gd5 2 B B THAN_PODMA Thanatin GSKKPVPIIYCNRRTGKCQRM 21 T 2.6 YihI pdbhh F Eukaryota T 6gdj 1 A,B A,B YHP6_SCHPO Mto2 GGPAPLSTMQTALMRLRTYHPSPIILKPVEQAVNHAITLVNTSPSSVVDALCRSLAELCLGLVQEAIDASILSQQESSNSLDLVRHTP 88 T 7.4 SPDY pdbhh F Eukaryota T 6gej 1 A Z Vacuolar protein sorting-associated protein 72 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 131 F F F 6gen 1 A Z Vacuolar protein sorting-associated protein 72 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 131 F F F 6gev 2 B B NCOA1_HUMAN NCOA-1,CLASS E BASIC HELIX-LOOP-HELIX PROTEIN 74,BHLHE74,PROTEIN HIN-2,RIP160,RENAL CARCINOMA ANTIGEN NY-REN-52,STEROID RECEPTOR COACTIVATOR 1,SRC-1 PQAQQKSLLQQLLTE 15 T 1.8 GFD1 pdbhh F Eukaryota T 6gf6 1 A,B A,B A0A140JXP0_CHICK Zona pellucida sperm-binding protein 1,Zona pellucida sperm-binding protein 1 DAAQPALLQYHYDCGDFGMQLLAYPTRGRTVHFKVLDEFGTRFEVANCSICMHWLNTGEDGGLIFSAGYEGCHVLVKDGRYVLRVQLEEMLLSGVVAASYEVQMTCPRPAGYEILRDEKVHHHHHHHHQRPDRGNS 136 T 0.2 Translat_reg pdbpercent F Eukaryota T 6gf7 1 A,B A,B A0A140JXP0_CHICK Zona pellucida sperm-binding protein 1,Zona pellucida sperm-binding protein 1 DAAQPALLQYHYDCGDFGMQLLAYPTRGRTVHFKVLDEFGTRFEVANCSICMHWLNTGEDGGLIFSAGYEGCHVLVKDGRYVLRVQLEEMLLSGVVAASYEVQMTCPRPAGYEILRDEKVHHHHHHHHQRPDRGNS 136 T 0.2 Translat_reg pdbpercent F Eukaryota T 6gf8 1 A,B A,B A0A140JXP0_CHICK Zona pellucida sperm-binding protein 1,Zona pellucida sperm-binding protein 1 DAAQPALLQYHYDCGDFGMQLLAYPTRGRTVHFKVLDEFGTRFEVANCSICMHWLNTGEDGGLIFSAGYEGCHVLVKDGRYVLRVQLEEMLLSGVVAASYEVQMTCPRPAGYEILRDEKVHHHHHHHHQRPDRGNS 136 T 0.2 Translat_reg pdbpercent F Eukaryota T 6gg8 2 B B NCOA1_HUMAN NCOA-1,CLASS E BASIC HELIX-LOOP-HELIX PROTEIN 74,BHLHE74,PROTEIN HIN-2,RIP160,RENAL CARCINOMA ANTIGEN NY-REN-52,STEROID RECEPTOR COACTIVATOR 1,SRC-1 PQAQQKSLLQQLLTE 15 T 1.8 GFD1 pdbhh F Eukaryota T 6ggg 2 B B NCOA1 peptide PQAQQKSLLQQLLTE 15 T 1.8 GFD1 pdbhh F T 6ghj 2 B B PHE-ALA-GLN FAQ 3 T 160 DUF6108 pdbhh F F 6ghp 2 B P KCNK9_HUMAN ACID-SENSITIVE POTASSIUM CHANNEL PROTEIN TASK-3,TWIK-RELATED ACID-SENSITIVE K(+) CHANNEL 3,TWO PORE POTASSIUM CHANNEL KT3.2,TWO PORE K(+) CHANNEL KT3.2 XKRRKSV 7 T 40 DUF4739 pdbhh F Eukaryota F 6gif 1 A A AAPA1_HELPY AapA1 MATKHGKNSWKTLYLKISFLGCKVVALLKR 30 T 0.81 protein_MS5 unphh F Bacteria T 6gig 1 A A AAPA1_HELPY AapA1 MATKHGKNSWKTLYLKISFLGCKVVVLLKR 30 T 0.81 protein_MS5 unphh F Bacteria T 6gij 1 A A TEMB_RANTE temporinB_KKG6A KKLLPIVANLLKSLL 15 T 3.9 Nup188_C pdbhh F Eukaryota T 6gik 1 A A temporinB_L1FK FLPIVGLLKSLLK 13 T 2.6 PSI_8 pdbhh F T 6gil 1 A A TEMB_RANTE Temporin-B LLPIVGNLLKSLL 13 T 1.8 DUF5665 pdbhh F Eukaryota T 6giq 22 FA m Unknown Cox subunit XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 31 F F F 6gje 2 B,C,D B,C,D CUBN_HUMAN 460 KDA RECEPTOR,INTESTINAL INTRINSIC FACTOR RECEPTOR,INTRINSIC FACTOR-COBALAMIN RECEPTOR,INTRINSIC FACTOR-VITAMIN B12 RECEPTOR GELELQRQKRSINLQQPRMATERGNLVFLTGSAQNIEFRTGSLGKIKLNDEDLSECLHQIQKNKEDIIELKGSAIGLPQNISSQIYQLNSKLVDLERKFQGLQQTVDKKV 110 T 0.0088 hEGF unppercent F Eukaryota T 6gjh 2 I K ALA-LEU-SER-ARG-GLN ALSRQ 5 T 190 DUF6105 pdbhh F F 6gjh 3 J J LEU-SER-GLY-VAL LSGV 4 T 200 SNase pdbhh F F 6gjh 4 K,L L,I ALA-LEU-SER-ARG ALSR 4 T 220 DUF5762 pdbhh F F 6gk8 3 C I TAU_HUMAN TAU PEPTIDE A7731 (RESIDUES 52-71) TEDGSEEPGSETSDAKSTPT 20 T 55 DUF6318 pdbhh F Eukaryota T 6gkf 2 I,J,K,L,M,N,O,P I,J,K,L,M,N,O,P CASP2_HUMAN Caspase-2 YDLSLPFP 8 T 1.4 TUG-UBL1 pdbhh F Eukaryota T 6gkg 2 I,J,K,L,M,N I,J,K,L,M,N CASP2_HUMAN Caspase-2 VEHSLDNK 8 T 38 Lectin_leg-like pdbhh F Eukaryota T 6gmg 2 C,D C,D PAPI_STRMB SPI DIPIGXKMT 9 F F Bacteria T 6gmh 21 U X CDC73 XXXXXXXXXXXXXXXX 16 F F F 6gml 14 N U NELFA_HUMAN NELF-A,WOLF-HIRSCHHORN SYNDROME CANDIDATE 2 PROTEIN MASMRESDTGLWLHNKLGATDELWAPPSIASLLTAAVIDNIRLCFHGLSSAVKLKLLLGTLHLPRRTVDEMKGALMEIIQLASLDSDPWVLMVADILKSFPDTGSLNLELEEQNPNVQDILGELREKVGECEASAMLPLECQYLNKNALTTLAGPLTPPVKHFQLKRKPKSATLRAELLQKSTETAQQLKRSAGVPFHAKGRGLLRKMDTTTPLKGIPKQAPFRSPTAPSVFSPTGNRTPIPPSRTLLRKERGVKLLDISELDMVGAGREAKRRRKTLDAEVVEKPAKEETVVENATPDYAAGLVSTQKLGSLNNEPALPSTSYLPSTPSVVPASSYIPSSETPPAPSSREASRPPEEPSAPSPTLPAQFKQRAPMYNSGLSPATPTPAAPTSPLTPTTPPAVAPTTQTPPVAMVAPQTQAPAQQQPKKNLSLTREQMFAAQEMFKTANKVTRPEKALILGFMAGSRENPCQEQGDVIQIKLSEHTEDLPKADGQGSTTMLVDTVFEMNYATGQWTRFKKYKPMTNVS 528 T 0.008 PRCC unppercent F Eukaryota T 6gmz 2 B B ALA-HIS-HIS-ALA AHHA 4 T 120 DUF5993 pdbhh F F 6go0 1 A A O32830_LACPN Plantaricin S beta protein KKKKQSWYAAAGDAIVSFGEGFLNAW 26 T 0.00052 LcnG-beta unppssm F Bacteria T 6gos 1 A A MCBA_ECOLX MCCB17 MGHHHHHHMELKASEFGVVLSVDALKLSRQSPLGVGIGGGGGGGGGXGGQGGXGXNXGGNGXGXGSHI 68 T 0.51 Dehydratase_MU pdbhh F Bacteria T 6gp7 2 C D PBP1A MSDQFNSREARRKANSK 17 T 5.2 Birna_RdRp pdbhh F T 6gpz 2 C E LmPBPA1 MADKPQTRSQYRNKQ 15 T 4.1 Pox_A12 pdbhh F T 6gqn 2 C,D C,G SpPBP2a TILRRSRSDRKKLA 14 T 6.4 DUF1408 pdbhh F T 6grg 1 A A MCBA_ECOLX MCCB17 MGHHHHHHMELKASEFGVVLSVDALKLSRQSPLGVGIGGGGGGGGGXGGQGGXGXNXGGNGXGXGSHI 68 T 0.51 Dehydratase_MU pdbhh F Bacteria T 6grh 1 A A MCBA_ECOLX MCCB17 MGHHHHHHMELKASEFGVVLSVDALKLSRQSPLGVGIGGGGGGGGGXGGQGG 52 T 0.86 FeoB_associated unphh F Bacteria T 6gs5 1 A A TEML_RANTE Temporin-L FVQWFSKFLGRIL 13 T 0.063 MOSC_N pdbhh F Eukaryota T 6gsa 3 D D CBF3C_YEAST CHROMOSOME TRANSMISSION FIDELITY PROTEIN 13,KINETOCHORE PROTEIN CTF13 MGPSFNPVRFLELPIDIRKEVYFHLDGNFCGAHPYPIDILYKSNDVELPGKPSYKRSKRSKKLLRYMYPVFATYLNIFEYSPQLIEKWLEYAFWLRYDCLVLDCFKVNHLYDGTLIDALEWTYLDNELRLAYFNKASMLEVWYTFKEYKKWVIDSVAFDELDLLNVSNIQFNIDNLTPQLVDKCLSILEQKDLFATIGEVQFGQDEEVGEEKDVDVSGANSDENSSPSSTIKNKKRSASKRSHSDNGNVGATHNQLTSISVIRTIRSMESMKSLRKITVRGEKLYELLINFHGFRDNPGKTISYIVKRRINEIRLSRMNQISRTGLADFTRWDNLQKLVLSRVAYIDLNSIVFPKNFKSLTMKRVSKIKWWNIEENILKELKVDKRTFKSLYIKEDDSKFTKFFNLRHTRIKELDKSEINQITYLRCQAIVWLSFRTLNHIKLQNVSEVFNNIIVPRALFDSKRVEIYRCEKISQVLVIGSRSGSENLYFQGSKRRWKKNFIAVSAANRFKKISSSGAL 519 T 0.088 Glft2_N unppercent F Eukaryota T 6gt7 1 A A Q54324_SULIS functional pRN1 primase TVVEFEELRKELVKRDSGKPVEKIKEEICTKSPPKLIKEIICENKTYADVNIDRSRGDWHVILYLMKHGVTDPDKILELLPRDSKAKENEKWNTQKYFVITLSKAWSVVKKYLEA 115 T 1E-08 pRN1_helical pdbpssm F Archaea T 6gvk 2 B B 230 KDA BULLOUS PEMPHIGOID ANTIGEN,230/240 KDA BULLOUS PEMPHIGOID ANTIGEN,BULLOUS PEMPHIGOID ANTIGEN 1,BULLOUS PEMPHIGOID ANTIGEN,DYSTONIA MUSCULORUM PROTEIN,HEMIDESMOSOMAL PLAQUE PROTEIN DSNENLLLVHCGPTLINSCISFGSESFDGH 30 T 13 DUF4556 pdbhh F T 6gvl 2 B B 230 KDA BULLOUS PEMPHIGOID ANTIGEN,230/240 KDA BULLOUS PEMPHIGOID ANTIGEN,BULLOUS PEMPHIGOID ANTIGEN 1,BULLOUS PEMPHIGOID ANTIGEN,DYSTONIA MUSCULORUM PROTEIN,HEMIDESMOSOMAL PLAQUE PROTEIN DSNENLLLVHCGPTLINSCISFGSESFDGH 30 T 13 DUF4556 pdbhh F T 6gvq 2 B B Q54324_SULIS functional pRN1 primase TVVEFEELRKELVKRDSGKPVEKIKEEICTKSPPKLIKEIICENKTYADVNIDRSRGDWHVILYLMKHGVTDPDKILELLPRDSKAKENEKWNTQKYFVITLSKAWSVVKKYLEA 115 T 1E-08 pRN1_helical pdbpssm F Archaea T 6gvu 2 B B Q54324_SULIS functional pRN1 primase TVVEFEELRKELVKRDSGKPVEKIKEEICTKSPPKLIKEIICENKTYADVNIDRSRGDWHVILYLMKHGVTDPDKILELLPRDSKAKENEKWNTQKYFVITLSKAWSVVKKYLEA 115 T 1E-08 pRN1_helical pdbpssm F Archaea T 6gvw 5 E,J E,J UIMC1_MOUSE RECEPTOR-ASSOCIATED PROTEIN 80,UBIQUITIN INTERACTION MOTIF-CONTAINING PROTEIN 1 GGGRHYYWGIPFCPAGVDPNQYTNVILCQLEVYQKSLKMAQRQLVKKRGFGEPVLPRPPFLIQN 64 T 6.2 PRTRC_E pdbhh F Eukaryota T 6gw7 1 A A A0A2A6XLY0_HELPX DNA protecting protein DprA MLKDYHLKEMPEMEDEFLEYCAKNPSYEEAYLKFGDKLLEYELLGKIKRINHIVVLAHH 59 T 1.5 DnaI_N pdbhh F Bacteria T 6gx9 2 C,D C,D CPSF6_HUMAN CLEAVAGE AND POLYADENYLATION SPECIFICITY FACTOR 68 KDA SUBUNIT,CPSF 68 KDA SUBUNIT,CLEAVAGE FACTOR IM COMPLEX 68 KDA SUBUNIT,CFIM68,PRE-MRNA CLEAVAGE FACTOR IM 68 KDA SUBUNIT,PROTEIN HPBRII-4/7 ESKSYGSGSRRERSRERDHSRSREKSRRHKSRSRDRHDDYYRERSRERERHRDRDRDRDRERDREREYRH 70 T 0.12 PRP38_assoc unppercent F Eukaryota T 6gxc 2 B B GLY-ASP-GLN-DAB-ALA-THR-PPN-GLY GDQXATXG 8 T 1.3 S-AdoMet_synt_M pdbhh F T 6gy2 2 C,D C,D BRCA2_HUMAN Phosphopeptide of BRCA2 WSSSLATPPTLSSTVLI 17 T 18 CoV_NSP4_C pdbhh F Eukaryota T 6gym 29 CA Z Unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 43 F F F 6gyp 2 B A CBF3C_YEAST CHROMOSOME TRANSMISSION FIDELITY PROTEIN 13,KINETOCHORE PROTEIN CTF13 MPSFNPVRFLELPIDIRKEVYFHLDGNFCGAHPYPIDILYKSNDVELPGKPSYKRSKRSKKLLRYMYPVFATYLNIFEYSPQLIEKWLEYAFWLRYDCLVLDCFKVNHLYDGTLIDALEWTYLDNELRLAYFNKASMLEVWYTFKEYKKWVIDSVAFDELDLLNVSNIQFNIDNLTPQLVDKCLSILEQKDLFATIGEVQFGQDEEVGEEKDVDVSGANSDENSSPSSTIKNKKRSASKRSHSDNGNVGATHNQLTSISVIRTIRSMESMKSLRKITVRGEKLYELLINFHGFRDNPGKTISYIVKRRINEIRLSRMNQISRTGLADFTRWDNLQKLVLSRVAYIDLNSIVFPKNFKSLTMKRVSKIKWWNIEENILKELKVDKRTFKSLYIKEDDSKFTKFFNLRHTRIKELDKSEINQITYLRCQAIVWLSFRTLNHIKLQNVSEVFNNIIVPRALFDSKRVEIYRCEKISQVLVI 478 T 0.088 Glft2_N pdbpercent F Eukaryota T 6gys 1 A,H A,H CBF3C_YEAST CHROMOSOME TRANSMISSION FIDELITY PROTEIN 13,KINETOCHORE PROTEIN CTF13 MPSFNPVRFLELPIDIRKEVYFHLDGNFCGAHPYPIDILYKSNDVELPGKPSYKRSKRSKKLLRYMYPVFATYLNIFEYSPQLIEKWLEYAFWLRYDCLVLDCFKVNHLYDGTLIDALEWTYLDNELRLAYFNKASMLEVWYTFKEYKKWVIDSVAFDELDLLNVSNIQFNIDNLTPQLVDKCLSILEQKDLFATIGEVQFGQDEEVGEEKDVDVSGANSDENSSPSSTIKNKKRSASKRSHSDNGNVGATHNQLTSISVIRTIRSMESMKSLRKITVRGEKLYELLINFHGFRDNPGKTISYIVKRRINEIRLSRMNQISRTGLADFTRWDNLQKLVLSRVAYIDLNSIVFPKNFKSLTMKRVSKIKWWNIEENILKELKVDKRTFKSLYIKEDDSKFTKFFNLRHTRIKELDKSEINQITYLRCQAIVWLSFRTLNHIKLQNVSEVFNNIIVPRALFDSKRVEIYRCEKISQVLVI 478 T 0.088 Glft2_N pdbpercent F Eukaryota T 6gyt 3 C C H4_XENLA Histone H4 GLGXGGAXA 9 T 11 Shadoo unppercent F Eukaryota F 6gyu 2 B A CBF3C_YEAST CHROMOSOME TRANSMISSION FIDELITY PROTEIN 13,KINETOCHORE PROTEIN CTF13 MPSFNPVRFLELPIDIRKEVYFHLDGNFCGAHPYPIDILYKSNDVELPGKPSYKRSKRSKKLLRYMYPVFATYLNIFEYSPQLIEKWLEYAFWLRYDCLVLDCFKVNHLYDGTLIDALEWTYLDNELRLAYFNKASMLEVWYTFKEYKKWVIDSVAFDELDLLNVSNIQFNIDNLTPQLVDKCLSILEQKDLFATIGEVQFGQDEEVGEEKDVDVSGANSDENSSPSSTIKNKKRSASKRSHSDNGNVGATHNQLTSISVIRTIRSMESMKSLRKITVRGEKLYELLINFHGFRDNPGKTISYIVKRRINEIRLSRMNQISRTGLADFTRWDNLQKLVLSRVAYIDLNSIVFPKNFKSLTMKRVSKIKWWNIEENILKELKVDKRTFKSLYIKEDDSKFTKFFNLRHTRIKELDKSEINQITYLRCQAIVWLSFRTLNHIKLQNVSEVFNNIIVPRALFDSKRVEIYRCEKISQVLVI 478 T 0.088 Glft2_N pdbpercent F Eukaryota T 6gzj 2 B B MAG_MOUSE SIGLEC-4A SEKRLGSERRLLGLRGESPELDLSYSHSDLGKRPTKDSYTLTEELAEYAEIRVK 54 T 0.23 DAG1 unphh F Eukaryota T 6gzl 2 B B MAG_MOUSE PRO-THR-LYS-ASP-SER-TYR-THR-LEU-THR-GLU-GLU-LEU KRPTKDSYTLTEELAEY 17 T 15 IFNGR1 unphh F Eukaryota T 6h06 3 I,J,K,L I,G,J,K TAU_HUMAN NEUROFIBRILLARY TANGLE PROTEIN,PAIRED HELICAL FILAMENT-TAU,PHF-TAU SPRHLSNVSSTGSIDMVDSPQLATLA 26 T 25 BAGE pdbhh F Eukaryota T 6h0b 2 C F ALA-THR-GLY-ALA-GLY-ALA-GLY-ALA-GLY-THR-THR-PRO-GLY-PRO-GLY GATGAGAGAGTTPGPG 16 T 45 MSP1_C pdbhh F F 6h0e 3 I,J,K,L I,G,J,K TAU_MOUSE NEUROFIBRILLARY TANGLE PROTEIN,PAIRED HELICAL FILAMENT-TAU,PHF-TAU RHLSNVSSTGSIDMVDSPQLATLA 24 T 24 BAGE pdbhh F Eukaryota T 6h0i 1 A A F4RME6_MELLP Secreted protein MCEFIEDSEDIQGLKSLRKSHTSLEDDDDGSRGGDCEGCSGTACSSDAQCRARGCDGCSTSGVCVLSSLHHHHHH 75 T 0.1 SPDY unppssm F Eukaryota T 6h1e 3 C S Histone H4 peptide SGRGKGGKGLGKGGAKRHRKV 21 T 11 Shadoo unppercent F T 6h1q 1 A,B A,B B4EYH7_PROMH Fimbrial adhesin IADPLVVTPPPMNFDGAADGTPAGTPITSTWIGETSVHNGFKCEKKFLQKCWVETLYANATGSKISGIYYYEGSNRYPVYSLPGVKGIGYAFGLKDNNDSVAYVPIDVDNGSGATVIYPAVGSTVNHNVDRVSLKGKVVFVVTDKHLETGVYNIPYTVIANTWSEYGGGHKGNNTSIVAINPVTITAHHHHHH 193 T 0.25 TMEM151 unppssm F Bacteria T 6h22 2 C,D C,D Stapled peptide XLTFAEYWAQLASX 14 T 0.11 PBP-Tp47_a pdbhh F T 6h2a 2 D H Possible protein degradation product GGGGGG 6 T 36 Sperm_act_pep pdbhh F F 6h41 2 B B VAL-ASP-GLU-CYS-TRP-ARG-ILE-ILE-ALA-SER-HIS-THR-TRP-PHE-CYS-ALA-GLU-GLU VDECWRIIASHTWFCAEE 18 T 2.6 DUF3750 pdbhh F T 6h48 1 A A A0A2S6DEV9_STAAU STL GPGKKREVTIEEIGEFHEKYLKLLFTNLETHNDRKKALAEIEKLKEESIYLGEKLRLVPNHHYDAIKGKPMYKLYLYEYPDRLEHQKKIILEKDTN 96 T 0.00083 LRRFIP pdb F Bacteria T 6h4b 2 B B Q9F0J8_STAAU Orf20 GPGKKREVTIEEIGEFHEKYLKLLFTNLETHNDRKKALAEIEKLKEESIYLGEKLRLVPNHHYDAIKGKPMYKLYLYEYPDRLEHQKKIILEKDTN 96 T 0.00083 LRRFIP pdb F Bacteria T 6h4k 1 A A UBP25_HUMAN DEUBIQUITINATING ENZYME 25,USP ON CHROMOSOME 21,UBIQUITIN THIOESTERASE 25,UBIQUITIN-SPECIFIC-PROCESSING PROTEASE 25 GPSHEHEDKSPETVLQSAIKLEYARLVKLAQEDTPPETDYRLHHVVVYFIQNQAPKKIIEKTLLEQFGDRNLSFDERCHNIMKVAQAKLEMIKPEEVNLEEYEEWHQDYRKFRETTMYLIIGLENFQRESYIDSLLFLICAYQNNKELLSKGLYRGHDEELISHYRRECLLKLNEQAAELFESGEDREVNNGLIIMNEFIVPFLPLLLVDEMEEKDILAVEDMRNRWCSYLGQEMEPHLQEKLTDFLPKLLDCSMEIKSFHEPPKLPSYSTHELCERFARIMLSLSRTPADGR 293 T 0.21 Imm30 pdb F Eukaryota T 6h6a 2 B,D,F E,H,K LCK_HUMAN GLY-CYS-GLY-CYS-SER-SER GCGCSSHPED 10 T 1.2 EPV_E5 pdbhh F Eukaryota T 6h7b 2 B,D B,D Q4Q7R3_LEIMA HIS-HIS-MET-ASN-PRO-ASN-ALA-THR-GLU-PHE-MET-PRO HHMNPNATEFMPGR 14 T 9.4E-05 PAM2 pdbhh F Eukaryota T 6h7i 1 A A Trpzip2 SWTWENGKWTWKX 13 T 0.64 Chibby pdbhh F T 6h7q 1 A A Trpzip2 SWTWENGKWTWKX 13 T 0.64 Chibby pdbhh F T 6h82 1 A,A10,A11,A12,A13,A14,A15,A16,A17,A18,A19,A2,A20,A21,A22,A23,A24,A25,A26,A27,A28,A29,A3,A30,A31,A32,A33,A34,A35,A36,A37,A38,A39,A4,A40,A41,A42,A43,A44,A45,A46,A47,A48,A49,A5,A50,A51,A52,A53,A54,A55,A56,A57,A58,A59,A6,A60,A7,A8,A9,C,C10,C11,C12,C13,C14,C15,C16,C17,C18,C19,C2,C20,C21,C22,C23,C24,C25,C26,C27,C28,C29,C3,C30,C31,C32,C33,C34,C35,C36,C37,C38,C39,C4,C40,C41,C42,C43,C44,C45,C46,C47,C48,C49,C5,C50,C51,C52,C53,C54,C55,C56,C57,C58,C59,C6,C60,C7,C8,C9,E,E10,E11,E12,E13,E14,E15,E16,E17,E18,E19,E2,E20,E21,E22,E23,E24,E25,E26,E27,E28,E29,E3,E30,E31,E32,E33,E34,E35,E36,E37,E38,E39,E4,E40,E41,E42,E43,E44,E45,E46,E47,E48,E49,E5,E50,E51,E52,E53,E54,E55,E56,E57,E58,E59,E6,E60,E7,E8,E9,G,G10,G11,G12,G13,G14,G15,G16,G17,G18,G19,G2,G20,G21,G22,G23,G24,G25,G26,G27,G28,G29,G3,G30,G31,G32,G33,G34,G35,G36,G37,G38,G39,G4,G40,G41,G42,G43,G44,G45,G46,G47,G48,G49,G5,G50,G51,G52,G53,G54,G55,G56,G57,G58,G59,G6,G60,G7,G8,G9,I,I10,I11,I12,I13,I14,I15,I16,I17,I18,I19,I2,I20,I21,I22,I23,I24,I25,I26,I27,I28,I29,I3,I30,I31,I32,I33,I34,I35,I36,I37,I38,I39,I4,I40,I41,I42,I43,I44,I45,I46,I47,I48,I49,I5,I50,I51,I52,I53,I54,I55,I56,I57,I58,I59,I6,I60,I7,I8,I9,K,K10,K11,K12,K13,K14,K15,K16,K17,K18,K19,K2,K20,K21,K22,K23,K24,K25,K26,K27,K28,K29,K3,K30,K31,K32,K33,K34,K35,K36,K37,K38,K39,K4,K40,K41,K42,K43,K44,K45,K46,K47,K48,K49,K5,K50,K51,K52,K53,K54,K55,K56,K57,K58,K59,K6,K60,K7,K8,K9,M,M10,M11,M12,M13,M14,M15,M16,M17,M18,M19,M2,M20,M21,M22,M23,M24,M25,M26,M27,M28,M29,M3,M30,M31,M32,M33,M34,M35,M36,M37,M38,M39,M4,M40,M41,M42,M43,M44,M45,M46,M47,M48,M49,M5,M50,M51,M52,M53,M54,M55,M56,M57,M58,M59,M6,M60,M7,M8,M9,O,O10,O11,O12,O13,O14,O15,O16,O17,O18,O19,O2,O20,O21,O22,O23,O24,O25,O26,O27,O28,O29,O3,O30,O31,O32,O33,O34,O35,O36,O37,O38,O39,O4,O40,O41,O42,O43,O44,O45,O46,O47,O48,O49,O5,O50,O51,O52,O53,O54,O55,O56,O57,O58,O59,O6,O60,O7,O8,O9,Q,Q10,Q11,Q12,Q13,Q14,Q15,Q16,Q17,Q18,Q19,Q2,Q20,Q21,Q22,Q23,Q24,Q25,Q26,Q27,Q28,Q29,Q3,Q30,Q31,Q32,Q33,Q34,Q35,Q36,Q37,Q38,Q39,Q4,Q40,Q41,Q42,Q43,Q44,Q45,Q46,Q47,Q48,Q49,Q5,Q50,Q51,Q52,Q53,Q54,Q55,Q56,Q57,Q58,Q59,Q6,Q60,Q7,Q8,Q9,S,S10,S11,S12,S13,S14,S15,S16,S17,S18,S19,S2,S20,S21,S22,S23,S24,S25,S26,S27,S28,S29,S3,S30,S31,S32,S33,S34,S35,S36,S37,S38,S39,S4,S40,S41,S42,S43,S44,S45,S46,S47,S48,S49,S5,S50,S51,S52,S53,S54,S55,S56,S57,S58,S59,S6,S60,S7,S8,S9,U,U10,U11,U12,U13,U14,U15,U16,U17,U18,U19,U2,U20,U21,U22,U23,U24,U25,U26,U27,U28,U29,U3,U30,U31,U32,U33,U34,U35,U36,U37,U38,U39,U4,U40,U41,U42,U43,U44,U45,U46,U47,U48,U49,U5,U50,U51,U52,U53,U54,U55,U56,U57,U58,U59,U6,U60,U7,U8,U9,Y,Y10,Y11,Y12,Y13,Y14,Y15,Y16,Y17,Y18,Y19,Y2,Y20,Y21,Y22,Y23,Y24,Y25,Y26,Y27,Y28,Y29,Y3,Y30,Y31,Y32,Y33,Y34,Y35,Y36,Y37,Y38,Y39,Y4,Y40,Y41,Y42,Y43,Y44,Y45,Y46,Y47,Y48,Y49,Y5,Y50,Y51,Y52,Y53,Y54,Y55,Y56,Y57,Y58,Y59,Y6,Y60,Y7,Y8,Y9 A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,A,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,C,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,B,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,Q,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,P,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,O,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,U,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,Z,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,V,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,H,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,G,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X,X H9AZX2_9VIRU VP4 QTQEYTINHTGGVLGDSYVTTASNQTSPQRETAVLSFECPRKFEEINYVGQRDATRFVPRTTESITGSANDDTVVDLTANIQPVAGEEVIAEQDYPVAVAYNVTQGVEVDVVDADYAADTVTLGTNPADGDEVKVWPIMSDGDVQFRLINQFGQEEGRVYPWSTPLYRWHDFPQLKRGREINLHGSASWSENETLEILLDAPQALTWEDSDYPRGQYVTTLEQDVEITL 229 T 6.3 DUF1344 unphh T Viruses T 6h82 2 B,B10,B11,B12,B13,B14,B15,B16,B17,B18,B19,B2,B20,B21,B22,B23,B24,B25,B26,B27,B28,B29,B3,B30,B31,B32,B33,B34,B35,B36,B37,B38,B39,B4,B40,B41,B42,B43,B44,B45,B46,B47,B48,B49,B5,B50,B51,B52,B53,B54,B55,B56,B57,B58,B59,B6,B60,B7,B8,B9,H,H10,H11,H12,H13,H14,H15,H16,H17,H18,H19,H2,H20,H21,H22,H23,H24,H25,H26,H27,H28,H29,H3,H30,H31,H32,H33,H34,H35,H36,H37,H38,H39,H4,H40,H41,H42,H43,H44,H45,H46,H47,H48,H49,H5,H50,H51,H52,H53,H54,H55,H56,H57,H58,H59,H6,H60,H7,H8,H9,N,N10,N11,N12,N13,N14,N15,N16,N17,N18,N19,N2,N20,N21,N22,N23,N24,N25,N26,N27,N28,N29,N3,N30,N31,N32,N33,N34,N35,N36,N37,N38,N39,N4,N40,N41,N42,N43,N44,N45,N46,N47,N48,N49,N5,N50,N51,N52,N53,N54,N55,N56,N57,N58,N59,N6,N60,N7,N8,N9,R,R10,R11,R12,R13,R14,R15,R16,R17,R18,R19,R2,R20,R21,R22,R23,R24,R25,R26,R27,R28,R29,R3,R30,R31,R32,R33,R34,R35,R36,R37,R38,R39,R4,R40,R41,R42,R43,R44,R45,R46,R47,R48,R49,R5,R50,R51,R52,R53,R54,R55,R56,R57,R58,R59,R6,R60,R7,R8,R9 D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,D,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,T,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,J,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W,W H9AZX1_9VIRU VP7 PEIGNNGAEKQISLHKGQPFIDTQDVGAADPNTPAVTIEGPSDYVIAIDAGTPVAPEFRDANGDKLDPSTRVTIQKCDKQGNPLGDGIVFSDTLGRFEYSKMRSDPDYMRKTTTSLMIDEREIVKIFVEVPPNANGMDADNSRITIGDDTSDYGKAVGIVEHGDLSPAESKA 172 T 0.82 aRib unppercent T Viruses T 6h82 3 D,D10,D11,D12,D13,D14,D15,D16,D17,D18,D19,D2,D20,D21,D22,D23,D24,D25,D26,D27,D28,D29,D3,D30,D31,D32,D33,D34,D35,D36,D37,D38,D39,D4,D40,D41,D42,D43,D44,D45,D46,D47,D48,D49,D5,D50,D51,D52,D53,D54,D55,D56,D57,D58,D59,D6,D60,D7,D8,D9,F,F10,F11,F12,F13,F14,F15,F16,F17,F18,F19,F2,F20,F21,F22,F23,F24,F25,F26,F27,F28,F29,F3,F30,F31,F32,F33,F34,F35,F36,F37,F38,F39,F4,F40,F41,F42,F43,F44,F45,F46,F47,F48,F49,F5,F50,F51,F52,F53,F54,F55,F56,F57,F58,F59,F6,F60,F7,F8,F9,J,J10,J11,J12,J13,J14,J15,J16,J17,J18,J19,J2,J20,J21,J22,J23,J24,J25,J26,J27,J28,J29,J3,J30,J31,J32,J33,J34,J35,J36,J37,J38,J39,J4,J40,J41,J42,J43,J44,J45,J46,J47,J48,J49,J5,J50,J51,J52,J53,J54,J55,J56,J57,J58,J59,J6,J60,J7,J8,J9,L,L10,L11,L12,L13,L14,L15,L16,L17,L18,L19,L2,L20,L21,L22,L23,L24,L25,L26,L27,L28,L29,L3,L30,L31,L32,L33,L34,L35,L36,L37,L38,L39,L4,L40,L41,L42,L43,L44,L45,L46,L47,L48,L49,L5,L50,L51,L52,L53,L54,L55,L56,L57,L58,L59,L6,L60,L7,L8,L9,P,P10,P11,P12,P13,P14,P15,P16,P17,P18,P19,P2,P20,P21,P22,P23,P24,P25,P26,P27,P28,P29,P3,P30,P31,P32,P33,P34,P35,P36,P37,P38,P39,P4,P40,P41,P42,P43,P44,P45,P46,P47,P48,P49,P5,P50,P51,P52,P53,P54,P55,P56,P57,P58,P59,P6,P60,P7,P8,P9,T,T10,T11,T12,T13,T14,T15,T16,T17,T18,T19,T2,T20,T21,T22,T23,T24,T25,T26,T27,T28,T29,T3,T30,T31,T32,T33,T34,T35,T36,T37,T38,T39,T4,T40,T41,T42,T43,T44,T45,T46,T47,T48,T49,T5,T50,T51,T52,T53,T54,T55,T56,T57,T58,T59,T6,T60,T7,T8,T9,V,V10,V11,V12,V13,V14,V15,V16,V17,V18,V19,V2,V20,V21,V22,V23,V24,V25,V26,V27,V28,V29,V3,V30,V31,V32,V33,V34,V35,V36,V37,V38,V39,V4,V40,V41,V42,V43,V44,V45,V46,V47,V48,V49,V5,V50,V51,V52,V53,V54,V55,V56,V57,V58,V59,V6,V60,V7,V8,V9,Z,Z10,Z11,Z12,Z13,Z14,Z15,Z16,Z17,Z18,Z19,Z2,Z20,Z21,Z22,Z23,Z24,Z25,Z26,Z27,Z28,Z29,Z3,Z30,Z31,Z32,Z33,Z34,Z35,Z36,Z37,Z38,Z39,Z4,Z40,Z41,Z42,Z43,Z44,Z45,Z46,Z47,Z48,Z49,Z5,Z50,Z51,Z52,Z53,Z54,Z55,Z56,Z57,Z58,Z59,Z6,Z60,Z7,Z8,Z9 E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,E,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,F,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,R,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,S,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,K,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,M,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,N,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a,a H9AZX1_9VIRU VP7 PEIGNNGAEKQISLHKGQPFIDTQDVGAADPNTPAVTIEGPSDYVIAIDAGTPVAPEFRDANGDKLDPSTRVTIQKCDKQGNPLGDGIVFSDTLGRFEYSKMRSDPDYMRKTTTSLMIDEREIVKIFVEVPPNANGMDADNSRITIGDDTSDYGKAVGIVEHGDLSPAESKAVRQ 175 T 0.82 aRib unppercent T Viruses T 6h82 4 AA,AA2,AA3,AA4,AA5,AA6,AA7,AA8,AA9,W,W10,W11,W12,W13,W14,W15,W16,W17,W18,W19,W2,W20,W21,W22,W23,W24,W25,W26,W27,W28,W29,W3,W30,W31,W32,W33,W34,W35,W36,W37,W38,W39,W4,W40,W41,W42,W43,W44,W45,W46,W47,W48,W49,W5,W50,W51,W52,W53,W54,W55,W56,W57,W58,W59,W6,W60,W7,W8,W9,X,X10,X11,X12,X13,X14,X15,X16,X17,X18,X19,X2,X20,X21,X22,X23,X24,X25,X26,X27,X28,X29,X3,X30,X31,X32,X33,X34,X35,X36,X37,X38,X39,X4,X40,X41,X42,X43,X44,X45,X46,X47,X48,X49,X5,X50,X51,X52,X53,X54,X55,X56,X57,X58,X59,X6,X60,X7,X8,X9 Y,Y,Y,Y,Y,Y,Y,Y,Y,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,L,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I,I H9AZX1_9VIRU VP7 IGNNGAEKQISLHKGQPFIDTQDVGAADPNTPAVTIEGPSDYVIAIDAGTPVAPEFRDANGDKLDPSTRVTIQKCDKQGNPLGDGIVFSDTLGRFEYSKMRSDPDYMRKTTTSLMIDEREIVKIFVEVPPNANGMDADNSRITIGDDTSDYGKAVGIVEHG 161 T 0.82 aRib unppercent T Viruses T 6h82 5 BA b H9AZX8_9VIRU Uncharacterized protein QTADGRVGLVPVNSYVTLETDDLDTDEHPVTDAGTVALEPGESAPIVRYDLGQPAAVYAVGATDEANVEYELKVNNSKTVGGRTNSPLGVLNTPFSFVEKLGGAIPCETAATYWAHYSSDATGTVELAGRMHIEV 135 T 0.035 T4BSS_DotH_IcmK pdb T Viruses T 6h82 6 CA g VP16 (vertex complex) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 217 F F F 6h82 7 DA,EA c,d GPS III XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 105 F F F 6h82 8 FA m polypeptide stretch (vertex complex) XXXXXXXXXXXXXXXXXXXXXXXXX 25 F F F 6h8c 2 B B UBA5_HUMAN UBIQUITIN-ACTIVATING ENZYME 5,THIFP1,UFM1-ACTIVATING ENZYME,UBIQUITIN-ACTIVATING ENZYME E1 DOMAIN-CONTAINING PROTEIN 1 GAMEIIHEDNEWGIELVSE 19 T 5.7 DUF2172 pdbhh F Eukaryota T 6h8k 10 J E NUEM protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 195 F F F 6h8k 16 P,XA Z,r Unknown polypeptide XXXXXXXXXXXXXXXXX 17 F F F 6h8k 17 LB,Q AF,Y Unknown polypeptide XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 40 F F F 6h8k 18 R U Unknown polypeptide XXXXXXXXXX 10 F F F 6h8k 19 S X Unknown polypeptide XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 57 F F F 6h8k 20 T W Unknown polypeptide XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 54 F F F 6h8k 21 U V Unknown polypeptide XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 63 F F F 6h8k 22 OB,SA,V AI,m,T Unknown polypeptide XXXXXXXXXXXXXXXXXXXX 20 F F F 6h8k 23 PB,RB,W AJ,AL,S Unknown polypeptide XXXXXXXXXXXXXXXXXXX 19 F F F 6h8k 24 X R Unknown polypeptide XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 50 F F F 6h8k 25 Y Q Unknown polypeptide XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 70 F F F 6h8k 26 Z P Unknown polypeptide XXXXXXXXXXXXXXXXXXXXXXXXXXXX 28 F F F 6h8k 27 AA,IB,LA,NB,SB F,AC,f,AH,AM Unknown polypeptide XXXXXXXXXXXXXXXXXX 18 F F F 6h8k 28 BA,RA O,l Unknown polypeptide XXXXXXXXXXXXXXXXXXXXXXXXX 25 F F F 6h8k 29 CA M Unknown polypeptide XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 51 F F F 6h8k 30 DA D Unknown polypeptide XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 30 F F F 6h8k 31 EA J Unknown polypeptide XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 69 F F F 6h8k 32 FA N Unknown polypeptide XXXXXXXXXXXXXXX 15 F F F 6h8k 33 GA,OA a,i Unknown polypeptide XXXXXXXXXXXXXXXXXXXXXXXXXX 26 F F F 6h8k 34 HA b Unknown polypeptide XXXXXXXXXXXXXXXXXXXXXX 22 F F F 6h8k 35 IA,KB,MA,QB,UB c,AE,g,AK,AO Unknown polypeptide XXXXXXXXX 9 F F F 6h8k 36 HB,JA,TB,UA AB,d,AN,o Unknown polypeptide XXXXXXXXXXXXXXXX 16 F F F 6h8k 37 CB,FB,KA w,z,e Unknown polypeptide XXXXXXXXXXXXX 13 F F F 6h8k 38 NA h Unknown polypeptide XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 6h8k 39 PA j Unknown polypeptide XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 48 F F F 6h8k 40 QA,YA k,s Unknown polypeptide XXXXXXXXXXXXXXXXXXXXXXX 23 F F F 6h8k 41 TA,WA n,q Unknown polypeptide XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 36 F F F 6h8k 42 VA p Unknown polypeptide XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 76 F F F 6h8k 43 ZA t Unknown polypeptide XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 45 F F F 6h8k 44 AB u Unknown polypeptide XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 32 F F F 6h8k 45 BB,EB v,y Unknown polypeptide XXXXXXXXXXX 11 F F F 6h8k 46 DB,MB x,AG Unknown polypeptide XXXXXXXX 8 F F F 6h8k 47 GB AA Unknown polypeptide XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 58 F F F 6h8k 48 JB AD Unknown polypeptide XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 39 F F F 6h8p 2 C,D C,D H14_HUMAN HISTONE H1B,HISTONE H1S-4 TPVKKKARKSAGAAK 15 T 0.2 DUF5797 unp F Eukaryota T 6h8q 2 C,D G,H SCC1_YEAST Sister chromatid cohesion protein 1 TDAMTESQPKQTGTRRNSKLLNTKSIQIDEETENSESIASSNTYKEERSNNLLTPQPTNFTTKRLWSEITESMSYLPDPILKNFLSYESLKKRKIHNGRE 100 T 0.12 ABC2_membrane_7 pdb F Eukaryota T 6h9c 1 A,AA,BA,CA,DA,EA,FA,S,T,U,V,W,X,Y,Z D,M,N,I,L,a,Y,F,E,R,T,S,W,K,J A0A1C7A3R1_9VIRU VP7 MGNIGNLSAEKQISLYDGQPFISEQDVAAGDPNTPALTIEGPDGYVIAVDAGTPIAPEFRDSNGEKLDPSTRVIVQKCDRQGNPLGDGIIFNDTLGRFNYNKMRTDPDYMRKTAKSLMVDEREIVKVFVDVPDGANGYDAERSRFTLGDDTSDFGKAVEIVDHDDLTEGETQAVKSASQRSGGA 184 T 0.27 aRib pdbpercent T Viruses T 6h9c 2 B b A0A1C7A3R7_9VIRU VP9 MRDNQDLLVKRLGRLVNVLESKEFGGTTTVDKDLDVTKNVTRTDEPNEDNTPDYFSTGKDRVLVPDTEEWERLGFGIVAKTVNVRTTDDVLLAFANPNTNGPTFKIRSNESPFTIGGDAGIDTAFMWLKKAESAQNDPAVEIIAYR 146 T 0.058 Aft1_OSA unppercent T Viruses T 6h9c 3 C c GPS-III molecule located underneath the capsomer close to the icosahedral three-fold axis. XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 108 F F F 6h9c 4 D d GPS-II protein located underneath the two-tower capsomer NOT sitting on the icosahedral 2-fold axis. XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 75 F F F 6h9c 5 E e (Half) GPS-II protein located underneath the two-tower capsomer sitting ON the icosahedral 2-fold axis. XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 46 F F F 6h9c 6 F f Peripentonal unknown polypeptide XXXXXXXXXXXXXXXXXX 18 F F F 6h9c 7 G,H,I,J,K,L,M,N,O,P,Q,R Z,A,B,C,P,Q,O,V,U,H,G,X A0A1C7A3R2_9VIRU VP4 MADQTQEYTLSHTGGLLGSSKVTTASNQTAPQRETAIISFEVPRKFSEIEYVGQRDATRFVPRTTEEITGTANDDTVVQLQANIQPIAGEEDMADQDYPVVVAYNVTQGAQVEIADVNYATDEVTLATDPADGDTVKLWPIMGDGEVQFRLVNQFGQEEGRVYPWATPLYRWHDFPQLKRGREINLHGSVTWQENETVEVLLDAPQAITWEDADYPEGQYVSTFEQDVEITL 232 T 6.3 DUF1344 pdbhh T Viruses T 6h9h 1 A,B B,A Q5NWP0_AROAE Csf5 MGQQHLLRFALPAGKKLWPNDLREALAKHDLPPLFFSRDPQTGHAITRAMRNEKRVRGYIEQHGHEPPPPTEEQRANPLAIPGIRIVGSSTWVGILATGERYKPLLEAATLPAIQIVTQRCGRGVGVELEQHTLSIKGLDDPKRYFVRNLVMKRGLTKTAENTTQVASRILSALERQAVAYSLDLPPTAQVDIHVESVVRPRGMRLVTSTGATEQFVGLADVEFYACLDLKGYWFAGNLTSRGYGRIIADHPAMSTGRYAHHHHHH 266 T 0.0038 Cas6b_C unphh F Bacteria T 6h9i 1 A,B A,B Q5NWP0_AROAE Csf5 MGQQHLLRFALPAGKKLWPNDLREALAKHDLPPLFFSRDPQTGHAITRAMRNEKRVRGYIEQHGHEPPPPTEEQRANPLAIPGIRIVGSSTWVGILATGERYKPLLEAATLPAIQIVTQRCGRGVGVELEQHTLSIKGLDDPKRYFVRNLVMKRGLTKTAENTTQVASRILSALERQAVAYSLDLPPTAQVDIHVESVVRPRGMRLVTSTGATEQFVGLADVEFYACLDLKGYWFAGNLTSRGYGRIIADHPAMSTGRYAHHHHHH 266 T 0.0038 Cas6b_C unphh F Bacteria T 6hc0 1 A A Q6MPU8_BDEBA DgcB N-terminus KTSIVAS 7 T 7.3 Clink pdbhh F Bacteria T 6hc1 2 C C Q6MPU8_BDEBA DgcB N-terminus, phosphorylated LEKTSIVASDTX 12 T 5.2 Zea_mays_MuDR pdbhh F Bacteria T 6hc2 2 B,D,F,H,J,L,N,P,R,T,V,X B,D,F,H,J,L,N,P,R,T,V,X NUMA1_HUMAN NUCLEAR MATRIX PROTEIN-22,NMP-22,NUCLEAR MITOTIC APPARATUS PROTEIN,NUMA PROTEIN,SP-H ANTIGEN GPLGSPDYGNSALLSLPGYRPTTRSSARRSQAGVSSGAPPGRNSFYMGTCQDEPEQLDDWNRIAELQQRNR 71 T 0.072 zf-FPG_IleRS pdbpssm F Eukaryota T 6hcf 86 HC 1 nascent chain AAAAAAAAAAAAAAAAAAAAAA 22 T 460 DUF4699 pdbhh F F 6hcj 85 GC 1 nascent chain AAAAAAAAAAAAAAAAAAAAAA 22 T 460 DUF4699 pdbhh F F 6hcm 86 HC 1 nascent chain AAAAAAAAAAAAAAAAAAAAAA 22 T 460 DUF4699 pdbhh F F 6hcq 85 GC 1 nascent chain AAAAAAAAAAAAAAAAAAAAAA 22 T 460 DUF4699 pdbhh F F 6hcs 2 B,D,F,H B,D,F,H KCC2B_RAT CAMK-II SUBUNIT BETA LKKFNARRKLKGAILTTMLATRNFS 25 T 13 PACT_coil_coil pdbhh F Eukaryota T 6hd7 46 TA r ribosomal protein RPL1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 210 F F F 6hd7 51 YA z nascent polypeptide chain XXXXXXXXXXXXXXXXXXXXXXX 23 F F F 6hem 1 A A UBP25_HUMAN DEUBIQUITINATING ENZYME 25,USP ON CHROMOSOME 21,UBIQUITIN THIOESTERASE 25,UBIQUITIN-SPECIFIC-PROCESSING PROTEASE 25 GPDFSKHLKEETIQIITKASHEHEDKSPETVLQSAIKLEYARLVKLAQEDTPPETDYRLHHVVVYFIQNQAPKKIIEKTLLEQFGDRNLSFDERCHNIMKVAQAKLEMIKPEEVNLEEYEEWHQDYRKFRETTMYLIIGLENFQRESYIDSLLFLICAYQNNKELLSKGLYRGHDEELISHYRRECLLKLNEQAAELFESGEDREVNNGLIIMNEFIVPFLPLLLVDEMEEKDILAVEDMRNRWCSYLGQEMEPHLQEKLTDFLPKLLDCSMEIKSFHEPPKLPSYSTHELCERFARIMLSLS 303 T 0.22 Imm30 pdb F Eukaryota T 6hfa 2 C,D C,D LM266, 1-[(2~{S})-2-azanyl-3-methyl-butyl]urea TSFAEYWXXXX 11 T 3.9 PBP-Tp47_a pdbhh F T 6hhs 2 H M Probable C-terminal region of MamM CTD E289D - Cadmium form XXXXXXXX 8 F F F 6hiv 1 A DA Q57UJ2_TRYB2 ms48 MMRRVASTGSNGGHCYVGSSLWADSVYMEVSSITLQKRFSFKYATKLQHDEMRQPYYIHEKRYGIFSNERNIAKARRGLPFITPLYTKHMNLWDTDTDASNNRFFRGYYYGQRELHQLLGRPHSLEATNADGSNDLSTYEANTSQLYKGIPRPAITNLHYEPAWRYTLYQAGAHGAQLSNPRSPFTAKVLGDELMQVRDIKSVEHCKAWFDRLQYLINLHYEAVGDIGEFKSRHTRHVHEFFVAFHDALSSLDFRDTYLFDQFKAVRPPELSDLFGIFLEMEANYVHEDYCPRCSLPYSTTRYCGEGDVDTPFRKHRGRWAPHQKWGREWYDVVARRAEALWYRATEDPYFGTPQHTQRQAEALLKVYVQAKQRGKAIDFMNKLRGSVEYLTGDITITTVMQESYDALLDTTPHPHLLTNGFTLESDAAKYTGEVNKAPLSPLQFRIDMEMNKYRRQQKEEGVVRVPPALWRLDTSAIVPYKVDSKTRRIVNWREVKEGIEKSFLSVGLPKEAYTGNEWREMLYLHDVIANREAKAAVLEQQQKLDREKLKARERASLKGEKSSNSASNGIGIIFPDKDGYQVFFVDEASLRPFGIGASGTLFKAVARVYPSPAAVPYDDPVHGKQFLIDTTDETCHLFGGFEHGDRLLLRAKSSGGDVDGGNNSNCRNTGSFTDEEEEVIVMGVSTEGSDAERMLCAMHVDTGKQRERGIVFLGTDCIDIRERWESVRYAASAYVKGRVTLLESQRTAREEFMGQVVGIRDGVLFVQWRLLKGGGSVLDRSVAVPIGDSTQVRELFVETKVLGVEPLQSPPSWRTPFRNDFAEERLKELQRAPFKREKYVSLIQGKYTPKVKKFGYTQHTTVDDFETKEYKDRLLSKQFFQNPQAFEVVPDRHERSVQFGGKWEYQRTHGLPTVDRNELENGWSEVEPISDGEMKVIEQALRDISGPRPGNFVKPPSKTKSLQLSESWWEPLGYGWEQHNEEQKALCDVTEQRLIDGSNLPFGGKLPPFGTSYGMGERMRAIIEDYSKGFGLGPHGHSPTHDTIHYNTLNAEGERVRDLGYTDALGRLFSEKLGDRDVHQWAVESCADGEADVRQLLLSLHEWRERGRPPSLLLANVLSKYLEEEIAAFNKGIPENAPKLQLETSDGTLAHSGSGGSRSGTMWADVDPTTFALHQASQSTRSTRCDEPFILHLVKRAKLGECVTNFTDTAYIAHLESSVINEFHLALTKLVGKGISPTLLAQKTGQLHRGSVRVGGNVVPFVKSRELSRLLERMGLTSESIAVITRGLANCPEQESVGDDFAVPVSLILSWDGPGSSGGSGSGDRRLNTTAARNEQQRKGSAALSSAIRQLSQHQKQRGSASGRGWQNEDKMVVHVLEELSLRNDGLVMDIQYAIRENKKNRRLRWEFVTTLLPVFGGNEAKVEQLYSDYCDGKYVPNITVAVEAFIAFLHNATQHPETYSASDYFDIDDGKSSTASGTGDQPSAGQYTTLKLLDPLEGPFVFDNVKVEYIQTVERFRRHGIRAGPVMAPATGFIAANCKSLNYFTRRQEEVVYVTNDSDQGLRRSLENSAYHKTIAANPALQYLLKARRGAALVETFNRFFYRTMPMLNFYQNVLKHYSETIQPLRQAAKSSTRGLARALESERSAAMEEFRRNSERYWRNIIEGRSVEQVVLANGESNRRTVGGGAGEQLQKPQGNRDAVRTNFTGEATDGSRKGQRQHQQPPLSQQQQHSRGEGERSAVHTSPRQGGSRGMADLLSKLNTPKTGSGVKGSTAKNTGNNRRGPKS 1788 T 12 DUF5053 pdbhh F Eukaryota T 6hiv 2 B DD Q385L8_TRYB2 ms51 MLRCTVVGHHRSGKARAFVFRDPSLRMMRAGSGYQQLRRMGMPMQVGMGWRKVDSFHANTQYQHAWPLLSHDDLGNSDQSNNTKNIMYSMYMPKRNKGTAPWFRGADTYSVKYCEQGRYEYQRYLMINRFPSEYKKHFLSFLSNIRMSSGSATIPQEALHWLLRMIVDNFNPQHVHYIAAMKTLQSAGELDMARDVWKIMERQQTWPCTATICAYLDVCVEAGEKTWAMEAWNRYCTELKFLEPGEVDPKPISRVPFSLTREELLYLPKWKKHFDHDPNLDVMDLNRFNRTREVYLRMAQVMLAGGERNAFQHFFTKLEEAMLNKPTPVPEPPNPHLVRRPRWAPYEHCKSVHHSPWRLQNNGRALALGPPVTIEDEMQSRFFSNDQFLVHSVKEVLRIVLQEHKRAHPTECTRCKTEAFFYKTKDADETLKFCDDLIERLFASLGVRLSNLNTSSLLSTILEVFRVVGKESGAALLQRANEFLERKASLGDAEGSRENLTASNYLQVLSGFADESAFVYNTKKDGTCQYKTGFDPRTTMRHLADVVQEIAGNPHVTWAADMHLQVVETMVGCGTMKANDYFVRNVLRQFSWDSRFLEALYVEYRRQDDVDMWAELTKRALVWTARYNAPASERLRRLIEDDYDTIRVQTRTFRELAVFQFRDVEERRHSRDVVNELPNPWYDYVAHALPFPDRDAGYPDEYGDLGQWRAPGGPGSPVRGPGYYAPPMEGEHQRGYTAEWRDLRNPMRPPEFPTPWERKYRQYARGQHPSYDMVYAGPMPEIFPMRRDFRKPTRWDFHDIEKQGKYRTSGPY 812 T 0.001 PPR_long pdbhh F Eukaryota T 6hiv 3 C DI Q587C2_TRYB2 ms56 MRKFCHFMANCWNSARSHATYGAVPLTHSQVTSVYATDGGKVDELGLLELVEERIFSWKLNKWEMRIPPNLPNDQKELIRQEQENLKQILSEWRKCFGALNADILQISSLTGVPKDVVREKNRTWLQEEVAKLRWMGEVNKAALLRDAFMRLEAFGSRDFMFMERLCCIYGLARQGTFDEAFTNYITEDPVTNDIFVDERNPFKELVAHIVRNYSQIDIIYDFLGFNYSEGYRSSLRRYMEYLQCKTAENVRASGRLVTGDKGEHNILFDYCVSRESLVSGDSCQGIIDFLYINGNDVTLIIIASDNPWLRNRQLPHRRQMEGIARRVCFVLGIPPSEVRIRNLLLPPTYLDKGSIVRLNDIVFRLSNEQSNLLIPWLTNYNKELDPKDVDYTALAKTTNEEEWLTL 407 T 0.2 ApoC-I pdbpercent F Eukaryota T 6hiv 4 D DL Q38BS2_TRYB2 ms59 MRCSCAFLDKSVFAAKRRVIVPIHPTPNFPAHFIKSAFTTDPLKEKQKARFSSGGEAMREVQDIPKNLEGERSRRDLASRGDTEFQALVEFIEGASYDQLISGRRFKKVYDVLSENDDMFIWLCHTAMSVLNPGDMRSRLIYNHLRILAESVASGEMTQRTAFRFFESAVRSPAYREIAKRQLEGGAATRLAGISAAADVMRRMGLTRRPMSSYFELYQRIVERSEAMTPWGFPPLFQFEERLSLEPRLKFFSRASQQSLERRRRGHVMTPHTTLHGRRIFWIPPTWNRAGRFLGPHVTLYPGMTPD 307 T 0.37 DUF2840 pdbpssm F Eukaryota T 6hiv 6 F DN Q38D60_TRYB2 ms61 MLCRTFLRQFRMSGGDMFVEYKVLSRDHRRSIRVEDAIVDPTFKRTVLPLGWLELLRSPSLRLPTGYFVEETVHVSLPNATSNGGKKEARPQKGGFASGSPSVGRNEANAIIAGPVVLYITGQSVPVVLNPYFVPEGTWDMRTRDGELDLRLGMDAIEQCTLFSELRPGGLLYGKLPENPNVRRNESLRATLGRYGMKCDLAESPLVPRPWTRMRYMFIDELQRGPKLTEFVGHNPRNGTPWRFSQNTKYFRIGIWRDTIRRNDMNEGLHAHSSWQKSPQQSVPEVRFLAPYP 293 T 5.9 AAA_11 pdbhh F Eukaryota T 6hiv 7 G DO Q383D1_TRYB2 ms62 MRRFCTARVSSYVKRAPLLLPRGGRRWRSGAGTTADGGGEKEYIADSFESTAGSNAYAAMELAAEARREIHELWLSAETALEREKRVQQVAALIEKYKLDPSTPREADVSRGLGDAFDRLLLLCLPLGKTDAKGTDNLERLMHLAGRNGRELSVRTIQHLFARTDSFAEALAVFYTMRRCHVAMNMEAYYSMLYSLQRLEEEGWGQHFRNEYEENGAPSEQAMDFIVKGISNALLPENKPWLGRVMFQDRNVPDRRYDTRDFDELDTAWTQRYKSGTPAGAH 282 T 0.084 ECSIT pdbhh F Eukaryota T 6hiv 8 H DP Q38F25_TRYB2 ms63 MLHGTPVRRASLRYRRPYWMMFLKGVDNWKIYTVIQQPDHQRTEMLYQAWLGGLDRPYTRPKCMANQPLWLSKKRHMLRKERLDGPETPLEKYVLEWHKKFHSFQGTERPTPDDLHTALDLVERPLDLSYALQLLGQCRNLNNIRFAKETFLVFLEACLRVGRRDCAEYALEHAEPLGFWFIDEDHRRYLQGEQTWYKLSPLDNLYYPVEENAKLNEGRKPITRLSPATESPGSGTAISDGEPSTDVEGETTVDDEIAQLEAELAALEREGGGK 274 T 0.19 MNE1 pdbpssm F Eukaryota T 6hiv 10 J DR C9ZPP1_TRYB9 ms65 MFSESLCILSRRFRYNTKFPALVSYNKLPWEVVNHETPQFHMHVAPHYEQLLTLAASSPVPHIIGSKHIDVPREHRLRLLPGMLYLLDGDTLPGEFTINRVLDPTALQYYGRLSSQIVTVEAVRMLVPDDLRLLCNCITFKGPLHLPVAPYASLASLRGASQGGTTGSETGSNCFTLYHFVRPNRPPKELQLEKYYIHAPCVAPLSEFASNSDERGNWRPRLQAPKRTRRATPLPAYRPPQSYLMGLAERLAVVPGGCFGRRSLMWGHWF 270 T 0.47 DUF3475 unp F Eukaryota T 6hiv 13 M DZ Q587C4_TRYB2 ms73 MLRRSILFRMKYADLELTTRGEFPHGMKEPAFVKKLDQNIPWYFSTYRSMYHWPITGDNWSDLNEAEKHHDLHMFYTLAWWKLGEGIFGVDEDS 94 T 9.7 Mastoparan_2 pdbhh F Eukaryota T 6hiv 14 N Da D0A3P2_TRYB9 ms74 MFSLTQTWLIAHWYCGHKFRHRFMRDKRFHPSLQASHDARNRFSKRRHFKTNRWNYQQAYRDMP 64 T 0.093 MLANA pdb F Eukaryota T 6hiv 15 O DB C9ZJE4_TRYB9 ms49 MIRRRVCDGARLAPNIRATFSAVRYQSGLHTFIRDSKPSNFSSVRRSENANGDATASPGATAGENPASSGDWASHMQRELFGEVDPLGGQAHKDYYRDVTRGYSPQYAPRNFANGGAVAYPHIQSPYEYEEAAHRRVWLDHDVDRMREEFTQHRASLRSLASAQEREELLRSRAAEYQVANTVHESESVHPIQQLYNSGGTSRSALKQQAVADRYSIAEQHSPLPLTTGVDRDALDEAQRTKDRILNDSFTAENLLITHGLREKEKHDFTILQRTVRIPFQGYDMDRFLAQQKGTPYGAQQLPPNVVPSSMEEAQRTLRGSSATATPLVDAVAQKVYARNTVVDRPAIGEQLTEQIINIMRASRTTAEQQREEERAQRFGLGRQGALVQDGGPDQRTLKKHTNDERIVDAMLFQQNAYRKTPTDEHWNPYIRRSTENGVGHLLQNKFDIMRREDRLSKGEQDLTERNTIHYGVPIQQIVDEFVFRHRNARGERPLDYFKPFPNFRALRLNRMYRDVEGFSLMKQRPEFLEWELFTRYRQHHQQRRRLALLHGLEPVANETAQERDTRRHRLDEICERTPFDEREMRVNDDEMRVSVETLRSWFGVYMLPSPTVVNAVLGGSASVNLHLYHLADEMGTADTREHVLSSRYLNRLLLLESYQNRVGRGFMNHVVGRAPEPVVPHEQPQEVLRHFSAEERAMYEQHVKEQTSRQLGEWERAMKRRRWLTDHQQYGHVVSHGLETSVVDLSHTETGAVLTVSTKAYEQEIEAVRMKTNATIKVDGMVYNLLPNSERRVVPLTVQLDSGEKIDMTSEDFDRCELEAFPRNLNHALNYGIANYAYNRGNYVETQDSIWEEQTASGQEGWSPATHADGLREGLPVRARRPIFSSSAEQRIAGGPQRAVIIQYHHQPFFNPEPRLVKVAFQCDGTIMEVPISDVMIWQRRYHGPERTVGDESRRYNPAAMRRYVDVTDPFNEKTSNTEHFLDKYEPKRNADTVADKYRTTKQITEIDKWTRYDSARADNYRPLSISHRRDYIRMGYIPRYTPWEWIAIQEADQPLIAEQIRQDNIGTSYFFSLNRYWRYKASPHGYIRHFENEVRDLLQYVDGVTPWKQAQKIRTYWEVRSHHPMPQFNRPEVAMHRNTVGLLPAHMWETDKKTGKVKSVKDSVRDYQTKTPYPKWVQL 1181 T 0.098 ACD unp F Eukaryota T 6hiv 16 P DC C9ZSK8_TRYB9 ms50 MFRVRRLAGFIQCSPCGKMGPLKQQTRRYTPIWKSDPAVDNVAPLRDEDERRALWAEVGPISDVGSAVTAWIRFGNDPVLHTAVPTMLGGKFRNQQREKESLLPNSSSPFAYVEDYMGTNLVFGSPVHAKESAAVWATYFERRYASRLRLSRRTVANYVGLINSPEVFDDESDRPETRWSQDTFFRECAYLSEKFLKEKVSNMQQFEAALKRASPEAYLAFFDAFQQQTQTQIPLPSPSVWHYEGERRKQWAEKFISISHKAQAFFKDVLSEDVKKYQEVPGKLLQKVKPVLADVGKILVKRHERWLKGRVWTSLTEEEREAYCMKEVKRQQMQVEDGEFDPMMEDDVDDTELEEWQREHDAIMKLMNSPIDGLHFTTLELWLHTMRCEELETEHIYTSARVRAIQVAARKKLYDTTSYEEVIQAVVESIARGTLDLGAGVLRPHFNEVWCQLNYAKFGSSTITQHTTTSRRQLLFFHAGSLKDIAATATLYYATKPLSNSLDYASPYKYRRSLITLCSNYGVETAYTTQRPLLRSAANLARAEDLIHAVVTAAAQPFGERRRAATRDLHMEFQRLAVPVERVIVANPVSALLESGADPDEKPVEGEKVNMWPLGAKRVVLYKWSAPNVEKLKAMESDAASAVSGSSLTAKRLREIQELKRRGFLEVSLWRRVTAQERKQRNEIVEAKKKQVEEVVRTVPSLAHLHQYATSLYSRIEERVAFPTETSTVTETTNMKEETNKPLEDSEWEFAVLLDDRVLLNKEESVELYLPYRDANGELLAQGEYRALVRAFDLEANPNLHPAYCSVGYSESFQVFDALPQLIAQFFRVKDATAEAAGVTHIPAADFTPFCAFLRDAGLDVPLRCEFEAGQAVTTDGDVYMDYFLQLLRGEAFHQSHAQAGLTEAQRAIEPLCRAHWVVHHPGADESEWATARRSVLDHAMQHEREWWFPNEMLDVKDVVTGSTNGLTPQMYPAAVRYGVELCTVLTAEGKFVDERGSGLSARCVVNGTGAAESVVFDTANCNGTNTTSVEDALRVAHGALRSAQDRHNTLAAFRLGPLSKQSQVLLFCGVNAYEFGGKYARTYAYAFEKAKKELEATAASGFMAPSLSHEDTERLSDQPTTSPSVDRFASTTHPEQRKAQFVPRVGPGSTPLEDPAADQKSEWS 1165 T 0.23 CT_C_D pdbpercent F Eukaryota T 6hiv 17 Q DE Q386Q7_TRYB2 mS52 MRRCSFASTCRYRSADDVPLGLGSAYVWTCRNITSRGQFNPIHNFSYAMERGVRARDVKAFEKLITNPGPLRVAYTPDYLDWLHRCYKAKGTYMDARAVAEKKFNGNIVSSELSAAVNRREGKRGDTRGDTVDNDHHNLPGAPPPGMFLRPAHSFRRLAGELKRRRAQSILDEVARAQGMLDLFERQPHFPAIHIDRCSRFHLVELFKEMVLERSLDSNMIWEKALLYRAILSERKPSYPTSFHYIFTAVEDTVFAPTISTPMEMAGSGGEGHDRSSHPLAAKCPTLEAYYYYVYLVKKYYIDNAVEAHVVLRCHREPNAADLLFSNPPPKDDTEIMKAVELLRNADIQRGVAAAAAVSDPTLPPGGEGSVIGNSDNXKNSEKXSEGSRGRPARPPVLPGAYPPIDMLWRCEENLPLLKVLLFGEFNLIVSENPFVKFPSAHGFLTRPYSTDSSRTLADGMSLANVMAEKRGHLLPSLPRNTATSIDARAQDIRRLQQKHHRDDIVSFQKLLRTHAEDSPSAFSSYSDWSYFNPRAVRAEERDRLTRKAVEALKLYDSATNDIYRHSFEDVQACHTQRVTERDRTMPPYLPTLPHFVAIIKKDPHISFLLHIGLPDRNSSEEGSAKHKELEKRIYYLARALYHTALEYHNETVRRVNRQKVNVAASLLDNFVEQEWTTILRDKHDVTDVTKTLNDTQNDKKQLARRLGRYMLFANRSLDDTGFPTDARADDYTRWMAPPSVGKVSL 746 T 7.6 Complex1_LYR_2 pdbpssm F Eukaryota T 6hiv 18 R DF Q38ET1_TRYB2 ms53 MSVSGVFSKGRGIGHEATTSILRYIPRARVPWQPSRFGRENLTAADMARLWSRGRYRDGPGNYNSGYCTERTHVLEENTVSIIPRRELEKYMPDITIGPKALVTPVSLMNARNGHRVTHDLLHSYDPHIGRLGKPAVVDHDNITVEDPNRVGLNAATLDCRGRIYRWLRRGPFFQVDNYFRRSVKLNRDGTLPTDFVHEAPLMRKIIRLAHRGHLKAACEEYRRVTTVPPVEVYRALTACCVPGAKLADAVSIFEDGDSKLFYVSRDGEVLHNLMRCAIAARHRARIMWVYNVMRGRFYENVVVRAEVDLIWRYRIAMIALEYLLDHECAEEAAAIYSYLVEEELLRCDVHVRVGLHMREAIAAGKPITLNNDVMNATSLVRDATAVAPEVARELQRRHAQTLQNNAVEAVGAENVREGSAPWSILGPLTAIGPTAEDTMVWLQQHYGDVDVMSIMRWARFRKGKDLMAKDRPQYLARAAAWIELLSKRNREMEEVPLTYMRKSKPLVLDTNSNVRVAWQTPLMRSGGPPRLLAREEGYVFHHSNSSRFVEETYRHPGESLQSRYLALQPLHTEVSAKEDFQRLYYQAQKHHKQQERLLPVSTAAIPSRIVHHSVMSALHGVSGKGEPANRSLFTXNKSNDSHSNTGVASGTSACTDGAGSATPEF 666 T 0.0009 PPR_long pdbhh F Eukaryota T 6hiv 21 U DJ Q584U8_TRYB2 ms57 MFRSTRPRSVGYTPVNPDTSPMVAYSQYHWHYNLPQGMERPHSVNRTFAAPFQSNHSLVNKYRGVWIEFDMHPAFSVALEPQLRKLPRGRTLPKTPAEEVIADYTALAPLVDDEKTRDLWLAKVFQHCAFQRCGGAMELWERYCHQRFTAEGATAKPPLSLVKSVLFYCNKTDNSGWRALFDRCLKDGWNYTPLFDTAQWSFMLKSIGRMGDEDGVRAVLEEMLDVQADLDRVEARSVVIALNAVTNADVYEFVKKYLFNFGERKVKFLRTTYSDLRGHGAGKLRIPLKENDNMYYHVCWHSSIRSPRQFSPRQLYFDYTPSTLGSSSHNPNAKIDDIVKDKIEKWKAEGLLPEDYVHEDRVYDRSAAFKNVARQEKWKKMPKILKSKRMGYTGDP 396 T 1.1 RPM2 pdbhh F Eukaryota T 6hiv 24 X DV Q57UZ6_TRYB2 ms69 MRRVRDGLFLSHPSSALQCSVAVVSTGEFDHPPFQFRQRHTFNTTPLHDANRFGGRTAYLREIGPVNIKKQGRRFKKDPRTVQFNVDVWCAQQTLRKRWKQRDWEVIEIPFRLVPREQQRVIPELYTDIPQMTDPARNDFSNIRNKVYDREELQGVLFPAAGAMLYPPLQRVDKQAMTLDKYL 183 T 0.13 CNRIP1 pdb F Eukaryota T 6hiv 25 Y DW D0A8P6_TRYB9 ms70 MLRFTHRALTATPERFSVLGTTHPKPKRTGFGRNNKMRSKPSDNVAWYDKGPVEWLPRPVRLTYDHLDQLQQWTMRATLDGRTEEFNRIRDLHREWSQHPLMPVLGDVEPKFPLNLFKQNHRAKKRFLVRWHKANTPANWLWMPRGPTVVTPLHRTNPTQYPENWKQMVQRKSGTGTPS 179 T 16 THDPS_M pdbhh F Eukaryota T 6hiv 26 Z DX Q383G5_TRYB2 ms71 MFHRAFVSSSDLTGCTIALSSVCTQKRYWAKPKKRPKVGQGFHEKAQKWREEYLLDRHRALADSLRAYVEFSTSKRVEPWDARFKPFDRIEKDGVYVLMRYMMEEKLQLCNYHHRPVKRLFCNIGLMGPQITTKARWKPYRFATNPAGTSKAERMYQRDKTVYTHGHND 169 T 0.18 Tox-MPTase5 pdbpercent F Eukaryota T 6hiv 27 AA DY Q57YD4_TRYB2 ms72 MLRSTLTSLNTFLTSSVATPPISVIRTGPKWWADPERMVRQKLMYFTLGVDQLPLRRTAVIQRDLQRFHMCKPPPRVGDSTGYKRSRAAQLNTWYRRIQYQEYHMQHLFTRHVWGLLRVYPGNTTKIQGKADDGYVGYDSVPFHRYNRAPLPFPARELYERRK 163 T 37 MBDa pdbhh F Eukaryota T 6hiv 28 BA CC E0A3K1_LEIAM uS3m MFLIHFVHYKTILQKYTFKFKHIFLSIDKYNSLFFNISGILIWLNIIHINIILIKYSFFILINNFEYLIILIST 74 T 0.022 Prophage_tail unppercent F Eukaryota T 6hiv 36 JA CN Q580I0_TRYB2 uS14m MFCFSRGLWMRHIGQDVPKRHTHFVLESRLMYEKSFRDCWLHSVCRAISQLDEPLSKTVVGTHQKMLQRKVTCFQYNQYGLFKTPYYRLANVDRYHAVQGVAGTREWVPYVNVSYWTMNKMVRGGNLLVHRVHYTGWGTDSHLKKGGWEHRWNKVLQRNVLQYSRI 166 T 12 DUF6462 pdbhh F Eukaryota T 6hiv 41 OA CS Q584T8_TRYB2 uS19m MAFRNTFTTPGKFSTVSKNIVLLLIWRVKVFLRAEGFAHSLVMLPVSLYSKILLCDVKKKIVYFHCCTRKKSMLRRCPCVWPDGPTKSSVSIGTAFTLQKRFLKIAKSAFGFYLARRGQRKYPFLRRPHIKNTHSMNPSAPYFWSFMTAKSQMAFLPEENYITGDWTGKFFVSKRQVYTLQHATSGAKVRVKSFPSIFEFNSPSRWNIGKEMNTLTKPRMDLIDEQMLTKKQRLDYVRAGLLPK 244 T 4.9 LIN52 unphh F Eukaryota T 6hiv 42 PA CU Q580M9_TRYB2 bS12m MLHTTRLWLGGYMMYHRKAMGTMKYSKWKGAHGGISHFYGRTPMVEEVRPNEPITLVDRRIMHYVHHSRLRHFQLFRSYQEKSNSTECKLREGEMLRRRWHRRLQKSFIAFMQFKTMKVLEDQARLVNTYGQAAVNAALGDPWNATDNVARERKSAAVRRQVRALPMVNVVPKHVATMKQIHNDRFNYRWRVN 193 T 2.6 HMD pdbhh F Eukaryota T 6hiv 43 QA CZ C9ZRZ4_TRYB9 mt-IF-3 MMQRCSTSLSRLCFRRLLRTPLLVYSIPPTRDVPSGIAHCPLSCSMRMVTSSNDDEFVFDPTLSIQKDAAIHTAKKSFETIVLEYVPAHAPEEARQKVKSYLTQHPIDILITQPKVQITHLEDAESGAETKVSLSPCDLPEALQQARERGMNLVQMGARGDVAYCRIRRESTRILSLIHTELEALREQEEKQQGKGRGGVQAAAKMGELIDHTFRDAVDAHFVGWRSKKIVEDIRRRHPVKLTIKEFQSPECAIGKLREMCQAMQHYAQEKVIYHHFTSIVANDREASITFAPALPMAKSDSWKHIKYPGEKEWTNALRRMEDACRKSGRYGTYAKSNKLKLRSLGQTSYRVDKYGRKMD 360 T 0.0011 mIF3 pdbhh F Eukaryota T 6hiv 44 RA Ca Q38DR3_TRYB2 mS22 MLRRAYIQRRYPFNKRGPREHKSWKHHVLTEPPKPLQWRDPKVWTRDLSVMKSFDAPQWDLWQSRPRSEDMDEALQPFMDMPKSLKDRRYDIPWWANPFGAWYLQNILSLELLKLKSKTNAEKIATYRSYMRSLASGKDNTMSDDDVIRNIIKERWKTLEFGDRNAGYPCTFGDYIQFLNEWFKSLDEEGMQRLREHFDRRIRPLLAVMSPVDILWLEALTQNSPHNKEQLQRKIAFQTSLGTPEFFDMSKRLRYEINEDYKVRDELGPELFALWSKAPERWPPERLSKMYGLDFTLVRKILVWHHFKACYDACVEPDWSLPKRLFALEWIRDVRARKHGLFYGKMRFAEQKITFYSDRFLFRDLVNRREASYANVWEMDDPYRFLQTEQDYEDYWGDNYDVYRRMFPEMIGRTGEPVQQYGQMPIWAGPHRQHANKSEHNWMFAEIGVNVGHEALKKLELDPTNEKRRRFVIRQPDGTLRSAKMSEMRAWYWKEEWADFRFWAPQMEWGIENTPSQEQYQEHVPDTTDADFRKQRRIQSRPVKWFYESHYTRTGSFAGFQPLRFMQRRTEREVRWPDVINAAVQIQKRKPAAYIFKAIPEL 602 T 0.26 MRP-S22 pdb F Eukaryota T 6hiv 45 SA Cb Q57VB2_TRYB2 mS23 MRQCVVRRYKMPKNMGVAPRFDTWNEKYEPWEHMKRMGRLVGTGFYIPPEWYNHFRMFPPINHNFQQEKTLNPHNASEPTQDDTSTLSPERVALRDELARKSRLVASEGMRYYNIFWVRKPLDTMEKEYYELKRRGVDHGEAIRKVLQGFYSGLAVKKRVAAIQAEEAKLTGRFITMREATVVLGVLAKLHKEQLTPHQVSLLAKEQGETTQSGAKLTAIVSRTQPHVNKEASSPATSEAVGSSTEESLSADALASMLSEDGEQSAVGTRYQVEVKETANDSVRQLREKAEDQTGFPDWYTGESPTYSGTSCSRDGFALMKANK 324 T 0.026 Herpes_ICP4_N unppercent F Eukaryota T 6hiv 46 TA Cd Q38DK6_TRYB2 mS26 MMRSSSFCRRQIRPYYNLPSKSEHGRKMTGFLTPYRHWMWKQNELWRNVHEAQFEHLRRVYKRQWLESFRVNADEYIYKYNITKAAQLAQWECEMKEQEKKRIEARQMMDGRQALKKKHLDLLREFHERQFFFWYERASERLQNMNLINYVPHAQLREHIDKELDKYVAGKNEPYPLNFVGQMPFLEDGDGNIVEVPESLLSNHMAEHPDSTAKPHEPHTSSSISEAAAFEERMLRAMVSAKEEDLKEWLGDDSRALSETIDDISREEEEREADIRVARSMEETDAEREVSRRAYIERGKTGSRSIFRPPTVSEGAGGTPSAPAGDANTPMRRRKKGKLDRVHALQAHQDELLAKLSSQGLKEGVDASSVPERGKIVQSRGRIRDKAVIPTHEVLMQKPELAAGSTPGARIQTKDMVDKMYHRGKYKKSGSGDKSDGEDL 440 T 0.028 ThylakoidFormat pdbpssm F Eukaryota T 6hiv 48 VA Ci Q57WW0_TRYB2 mS33 MVLRRWFPLLGVHRVGYTHPSTLPVPCAQRWDLRLARARIFQEYIEEKAPGAWQLEDERSMSPEFKTFTGYPMRDMRPGYGQNLPDYIMKKRLPNNTHYELFARRDIPNEDNAMYGKYLYDMTVHGTSLPSTYRMHKDINKAQRNDRKLSGNRFKVLCSSGAKNPPSGWEPIPDATEEEED 181 T 1.1 Nmad3 pdbhh F Eukaryota T 6hiv 51 YA Cm Q38C96_TRYB2 mS37 MKSSDIFFAYRLTPVVFKSRQHDSGVNQYGLKPTNAYDYINPTNLINFGRGTTFDNLGVRRAGRGEIDSSPSHSGSPVFTQAKLIGLSGEEQLTMCQSETMALRLCMAKAGKETCERESRALDSCLGRVGHLRRAMSEACWEFNDWFIQNVSDNHTKPFQHRPHDWRHFYAQEKLVRERQQNGHAYGRRPKQFSFGARYVKTDGYGKRPRLPYNK 215 T 0.054 Gypsy pdbpercent F Eukaryota T 6hiv 52 ZA Cn Q57VQ9_TRYB2 mS38 MRAGCVACSRIPKLGIAALGSTTTSMQATCQETQELRCATTQMAPAVIFPLPPPLRGSYIDRKPTAASFNEHHATASFRHHMIASADVSHRPRHFYFASGTRNDTGTRSVKEGERKQLKQQTNVEVDSVAYGDQLDELSARWAAKFYGQVTFGPRNYPYPSSRWLARRFQMKKHRIIKRFRFRRYKLAAVANLPFAKMIRVGMLPELKSSKTKRGDVVDPTLSGQLVSAVKNTGEKRKGQRTRPKSKYQV 250 T 28 DUF2996 pdbhh F Eukaryota T 6hiv 58 FB UO UNK XXXXX 5 F F F 6hiv 59 GB,WE UP,UW UNK XXXXXXX 7 F F F 6hiv 60 HB UQ UNK XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 32 F F F 6hiv 61 IB,VE,XE UR,UV,UX UNK XXXXXXXX 8 F F F 6hiv 62 JB US UNK XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 54 F F F 6hiv 63 KB UT UNK XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 44 F F F 6hiv 68 PB A4 Q38AB0_TRYB2 bL31m MVLHKWAVVSRSAPPPRGLRPIARTIPTHPRLRPVDYKIPYVLRTFIKDRHTSEVQHLENRGMFAEELSIERSRFPRFHSTFTIQTDGSLNEREFEFAVPPIVTLFHDRLSAHRERQLELAKIGKLRKERNWETEQKGEESVSMACNALAFPYCIPKNMLKRSRVVDPLNSKSSTQGVTSGGG 183 T 12 RNA_pol_L pdbhh F Eukaryota T 6hiv 69 QB A5 Q584F4_TRYB2 bL32m MFRRTFFTPMIAQPTLLMLGNKGGTPKRKKNPMQLRRKVYGLHFKEKYLKMEEWYYCPLCAEPKKPGEWCRREDCRQIKP 80 T 0.089 HVO_2753_ZBP pdbhh F Eukaryota T 6hiv 71 SB A8 D0A1K1_TRYB9 bL35m MGSEESNNICAYKRTISLAKIYIVLLVKTAMLRYSRLCFPKMGCEEITRKARRVQLQPTEYLAQHRMQVWQLRFKEMGPPFSRVWVALGGKMRRRRVGRQVDVKDMRYYWRPIEPQYQRLYMSRLRIRDHSNKLRQPMRLRATNADIGSGSSSIEWERASNRKYGAMLAPPKRQDFEFRVV 181 T 0.13 Cytochrom_B559a pdbpssm F Eukaryota T 6hiv 76 XB AJ C9ZPD1_TRYB9 uL10m MSPASPLPVAALSRLRITHRSFLTRSRGRHVCRSAVGVEYRPEQQKKVLDHSYARVINAEVVHGDEQKFWGERRTFYTQRNIFFPMWDRCAQALILITREVPRVPQEMAFRLMAVFLKLMLLPRLMMNTELMLPMWIASNAEGAMAAAKDGSKGKEQSSKQQGESKDDAKKEGDNTK 177 T 12 DUF5783 pdbhh F Eukaryota T 6hiv 82 DC AT Q4GZ98_TRYB2 bL19m MGYTRERTNRHFFVARANAFFSRLPIARVQRSLAMEAVREGRMRPWKYTKEQILGAPVTCNFEYNPRPVRLIGTVMDAHTEETSIKGGLKVYARNEETNMMLWIPPGNPKLRHEVTSTKGSFQHYLDERDKWDEAWITGRARMK 144 T 3.8 DUF2760 pdbhh F Eukaryota T 6hiv 83 EC AU Q383R2_TRYB2 bL20m MLRRTVCVQHYRAKLELDRIRSMLRGRARLERKVGLKRLFFLMRTQTRYRVEQQAHWERAIVRKNVDSAAREHGTGWQHLRNELGRQNVMLLPRSQQLLAQYEPLAFRAVVELCASRIPPPPPPVVASVPEESYTLWPPASHDNSECASTDGSDAPHGQQQSLSHPAARVELRCGVERVLRRGPSGLGNNVNELIDAWKEFDVSPLRKGEVNK 213 T 7.7E-05 Ribosomal_L20 pdbhh F Eukaryota T 6hiv 90 LC Ae D0A8I6_TRYB9 mL41 MLRCSCACRRGVYHNAPSVYPFVKPFHDTPYDQDRGRHDSVGQRYRKNSWPEWMDNGADGTGYGIGLHRTHPLSRLRGNLKRSPSHVPRVLGMMIQGVWHKSGVKLYFRGGKPPNPSVHPYLTGEPCPLYGWKVTDESVIRQFNMPSIDGTNFRYKPYVALQERKIMGVQVPSDSVSLASGRTTESKPLAKRLFFWR 197 T 1.9 MRP-L27 pdbhh F Eukaryota T 6hiv 91 MC Af Q383S1_TRYB2 mL42 MLRLCRVSLRVQSHQKKRAQHPNAGTRFGRVYNRGFIRYGFGGFGMSVYTPKKDRRFRVQPLPSLHANSLADDTPLVTTTRTLSPNFRAFALQDGGVFFTHPSHEQVMRVGQNILAEETKATGMTSMDTYVNSRIQSIIAENTVENVALSHWRRRHMWNLVRTHGKLQRHWGVSDATKGARSNIYGRPS 189 T 0.12 Toxin_10 pdb F Eukaryota T 6hiv 95 QC Ao Q385V2_TRYB2 mL52 MRLHTVRAPIITRAAMRGYSEARSNYDGTSLPAWPAPGKKPTYPAALSELRLPQPRMRKTRTEWMYYHGHGGCPGKYGPSREIADFEYADGTPASISGRRFAFKHHQDHLLVQLIRAAATVERYDASGLLPRIPGTAEQRNWDPAIPLFLDDVDEQGRPAPLRTAGDAPGTMVSHVCSRVVDERMGTPTHTPNELANRHEGETLEANTMFATNDPSAFVSDTVKLRDDKRPYWSRRRWALTDKFLVPKSPKPKNTIKDE 259 T 0.00017 MRPL52 unppssm F Eukaryota T 6hiv 96 RC Ap Q57YA9_TRYB2 mL53 MLNPPKHYSVESLRTVGLLPAQLALSRKPRLRPHVGNLKGLVYPLPYYAMWRGNHNKYTYNKSTVCLWGEGDTRSMYHQHYAHAKCPTDYGRGGREFEYLTVKRGKMLQKPLPRVQYVAEGSKPVWLFKSWHTPLSSPSMWEREVQYAEHTPEHIGAKRPLAVVAPRTMHRYLFLMHMEKVTITVSPLLFGYGHTIQKAVLDFYRRAISARSPFPKDKVFLFYAIDHITPRIEVTWLDGTSYVPPVLEGASSQDLIQMVMEEAWLAADRMAAEGRVLNPLAIDDYKWDQLVVFKKVRDKEASKGGGRKK 309 T 5.5 MRP_L53 unphh F Eukaryota T 6hiv 97 SC At C9ZU82_TRYB9 mL63 MLRHCTAHRRYRTAWRELLHPLPVRARKMEWLKRDAVEENEEILRRPYYTIKSYALPPAVGRQESIHNSNNIRGGMHSSHSLDLIMRQPRRVKTPEQLRALRDRLRFIGVTGPMPQATSVSTKSYTDTYGSRLRPRYPESWDTVPPHQPSRELL 154 T 20 DUF4113 pdbhh F Eukaryota T 6hiv 99 UC AB bL12m AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 56 T 10000 DUF4699 pdbhh F F 6hiv 100 VC,WC AC,AD bL12m AAAAAAAAAAAAAAAAAAAAAAAAAAAA 28 T 1200 DUF4699 pdbhh F F 6hiv 101 XC AG bL12m AAAAAAAAAAAAAAAAAAAAAAAAAAA 27 T 1100 DUF4699 pdbhh F F 6hiv 103 ZC BA D0A5V6_TRYB9 mL67 MLRRFALTSSVALRLRFERDSGHNTVRYRPIPESMQPKHLEDNFTPFPLPKFDESLEYGPVRLRNIPDIEAAKERRRGSRLAATEVLLQETLQEENQFATSGKGDGNMAIAITERHTEDVTTPAADSRFPSQTMSPCSHEEEMRGYVVSRDYPLIDRLHCTRSIEELVAQFEDRPQIESRVAALADMASTVSFRSDEELLRMFTAISAPFSVDGRGLNFLTVKVSKFGRPYYVPNSLLPAYVNLVDATTIALVREQPWRLSASPALFIQVLQFMALIKVFEPNKWFTFSDHAPSNRADYRHAIGVNHSTAFWGTGEELYDFMVELLRVEDDGRIPTMLDLCTREQMVDLLSGFCGVMPCGKAVGDVFKTITDAFLRRVRNDISGPWSAHDWAIVERMYLVTVLCDAGNNEILQLLLSDTASPRGPDFFAAVSRTKDTPTKKRALCLLQEAIDNASAKADKVTLLGLLESGSEFLLSLVDKGVAHTFATQNLFDYRILNSFLHCSLVADRLRVEQSVITSLIPSSLRDVQVQMLMSNERNALNPLTSSLPGNSGAIATAPKLKRPLMTMLSQLEYLNSIDSVFILHSSLMATSTDQLVSAVRRLPSGKDSLIVTMSCLRALSVKSLTSPSMKERIACARALEIVSYELEKGRAVLLPFSEEILLHDAGAYCDEDLMLWTVAAFLARELPLVKVHTLMHSNCTARTPYRFLKGGHNLLVSSRSLYDKGAPLLSSLHSKELRLVTHNVRLRTPVRDRKCTLQYYNPIRARFVYRRDKPLFDKYHVTARNLAPGFSRGALKHDWRALGVYTPDHPQVPYHPLQTWMLGETTVEAE 831 T 0.047 DUF5642 pdb F Eukaryota T 6hiv 106 CD BD C9ZR91_TRYB9 mL70 MCLKRKAPHLFCFCLWSIFLSFRCFCFRSYAIMLPLLSFPTIQISIFLSFKLPITTFLLSPCFVFVFVFAIRYCGELTLNAQLVLFLLYHCAQTQRGPLKEGEMPICPGLCGELAAVPFRVFLGTLPTLAVEERFLRQLQPVFAWYSSRKRVKEQANEFIEIDLASCDAELLLRYSHIYYVRRQLFDELIERQMTLLDSGKAPKMAEPSLLQCLAGCNMTIADRLQLEIRQLGAAKRAASVPGRRELDPVARLEVYDYACMMRLVEEDAGAVGDAEMKARAYLPREVIESKLGHLTQLLLGSDARAALDKKDVKLLNRMIPPDYTRVGCVEKLRPFDVTAYFRFYGERINNVKVENYFKRALWGHVYRRFATTPSFLSGVSTYWARHSGLDASFTTTTMPQEVAVAVCDQQIQFPAIKFRAQYVYTSPETARQLWRTDAAVPLMRLFPLMGSRTAEDLAAGVLTDAFWMHLGLSEEENLLQDSLLLKVRRFVDEVGDMYETNIDSVLKRVDDNFKQVVPQLKAEDLQVDAPLQDGEGEDTVRETVAA 547 T 0.26 MGAT2 unppssm F Eukaryota T 6hiv 107 DD BE Q57WG1_TRYB2 mL71 MLFRSVSCKNYQRGGWSPGSKHQKHMTLNPTLYLYRFPGPHGPGPYTMKYWWTLGCFPTGMEVPFRLHEFLSTYQQEHVPVEVEEWLRCYIKDPLSELVNASNDFFKAVEVYPEVESARGYKTLQPSIAPLLVPMKKFEEQLGVKISPVGLRSVLSNPVLKDRFLDDLFDYKSYVEKGGSTPHRRLARSRFEGSLSVLGECEKCLPEQHQVEISESLGTFIGATVSPAETTADDERSLILLLTTISEGCINAGNYSDAASVLADALMFCHDPDSQATTHANISFASLLNADFKGAEYNGREAALLQPQVKPTSTACARGYVGWAAAAAYQDDFEKAEAIVKDGLTLYVGNEHLEKLANKLQALREEQPSVYKQVPRSLRESRSHLPSQQSRGLLSGSGKGFSNEFDWVEFKNKLYPSKMDPRNNEMGSVFRRVGDLGSFISTSRSMERL 449 T 2.4E-05 TPR_21 pdbhh F Eukaryota T 6hiv 109 FD BG Q57Y49_TRYB2 mL73 MRLARTLHHVASATTGGGQMLEGLVNDGPEVAHKQHASFTPFSIQPWQARCVGASRRKLLPQMLMYHGARLGPRPLIILDHSTKGEAGVAEAARKYESILSQLSWDYGAVYIPLHAQCTDSSKDLLEQSCQRICAVMDALDVRWTHFLTYSYGALVAVRMASSQEFPHRVGTLMSLDTPLVTREFLRNMEQREDIAKAERDINVPEDGLAFAKQALLSSLEGPLPCPAAEDESLYRDYLFDPNRIFGAGGLVRDESRYVPLKSLLGVRHPVQLIVPSANPLSDAAAHSEVFGHRRPAVVKCCQRHEDLFKESAAKEVAGVLGAWMRRFEPDCFISKRYEQAANEMGQLMLSTAQVSSESAGKGGGEPRKKKEKKKSKA 378 T 7.3E-10 Ndr pdbhh F Eukaryota T 6hiv 110 GD BH Q38AM5_TRYB2 mL74 MLVPGLSLTRRAVTSSCCRPLHVVRGFSTTCTLFGLEQLQDVPTSTSRRPTGLHRGPGKRQTSEREAAQYKFIRRWELQMRDEWDQLEPFKGLPKPKRQFGNEAAEVIWPYALLLERVVKVHPFTKSIYVYYAQRQSTARGKLAAEIARSFAREFLIPITFHNSQVYTEAEMLLEYSETPWVVLHSLDNGQKPRILPVAPVEGTPAHTAVEQLLAEVVQGCEALGASVADPVTATRVLNERPLQNQYVRVDYQWFGDTPDERASHLVRWEFEPEQIEPKIRHRTRHVLDWLNYDGNLPTHRAVHVNAMREKARQKAPRTVAGPRTFYNSAGSRANARSSRFGGQAAVGK 349 T 0.023 L51_S25_CI-B8 pdbhh F Eukaryota T 6hiv 112 ID BJ Q383M2_TRYB2 mL76 MLRLSSWNLKSQHHNVLRRSRPHIHKYRELNRWQRQAQGISKWDQSHSHRPLPYVERFNPESVGLTRGTSAFAWKWWHTQYPWLPNVPPEAAQIDEAQKQERRSHRPPAWDDEFAKVVLNMNDAEIREYLMSKLTDVIFLETQRDGYELRRLDFEGKPLTSLPEPRIIENFVLEEETIRERVIYQVVEGVFRLSPTSADRRELRSVANIIDYVLTHVRAARPTDRERRQERPITSAALAVMQKCPIQPQLGFVHALPHDTRDALLQEWERMHHLDWQFGKAVYTPRSKENVRGNLTWLREDRHYDQRMKFMQEVESGEARAKHMKLIAEAAGN 333 T 1.8 KIX pdbpssm F Eukaryota T 6hiv 114 KD BL D0A4P5_TRYB9 mL78 MGASAGLIRRGGGVFPDAVSLTLTPSRRVYGGSGRGDLLYENPDARRHSGRALGVLNGVRHSSQATMPESGQLYYRKLILHSRPPNGSCAGLQRHCHDTCNWSYLIPSLHRCAESAISAKLWEKMCQLGLEDRSKAWVNLTQYERQRVRDGQNLYRYEVHQRLPLLEESIGWAQLDDLLGWFRSARRAWVRLPTSVTLSRESSEAGVASVVSPSSAMSCRLEGHADSRDTTPGRNQVFDTPERVEQLTEATVHRIREELQRLNRSERSDCEGSAAMRASARRLARDEELSRCVEEELGWHGVALQHRIPVPK 312 T 0.091 CAF1-p150_C2 pdb F Eukaryota T 6hiv 115 LD BM Q381N5_TRYB2 mL79 MRPTFPALGSRAKGYENRVMVYAHRRHRAWYLPPKLAHARSPLANKSPDEYGNTWDPRTGVEWYHRLRRRGAYRHWPWARWNDDPVRQHQELSCRRTFSAAVTGANEGVPLWNYYAEVGQEYGLPSHFPLSFMAPFIHQYTSRAWSRKEIERHLKVVEERTGLRTIQQACDATSELLEWGEEEMGVVPHGLLQHVVMLAEDIVLQNKKKAYRKAAHERGILRTTTMERYYALPHLRTGPPMPTTLEQPSGEFPWGKFSTMVGGTRIHPLYRPDGFFKDNMYPA 283 T 0.055 POTRA_2 pdbpssm F Eukaryota T 6hiv 116 MD BN Q585A3_TRYB2 mL80 MCCLYTSDVFFWSSLVVSPLPRIVRCVSAPQIRSIPCGSGDFSVMKRSLIARWQSGVHTPHGVVYRGAKMKNWPEQRIPENFKFTEEQRFRTKAIPRDVGTIPRNFVLGVLYRHQPCEVGGLWEHCTNDPEIVLDSKRHLREVLKQAREEGFVTFERDAISNEWLCFLTRERYEEVQRIVTAKSEAVDSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSHARRLEEEVANTTRYLRRFQQREIDYLPYTDLNGKVNFMWWYETRDVQQRADEVLTDSSSSKALAGEEEHKAGSQLEATTASTS 302 T 0.004 CCDC106 pdb F Eukaryota T 6hiv 117 ND BO D0A755_TRYB9 mL81 MKRLFPSAGVSVVLTSSSIVMSCPCNHIFTSRRAYYWPYNEDFVPEGAETSRFQSSGSPGTRRRVLQEYALSPLFGARVPCCVVGTLRTAKEIIVLKRDIQSLLCELMELPQGSVTLGPLQQLREMMLYRHGTPLSPRLGDERQLNMDAYARRPIAARTMMDVFLAEDMSLDEIVNFGRLSLGKLQIALNNLRKENAEKSSADCEDGSAVEPAQLLVDRMGISYCEIPSLDESDYVFAEVGGVAVTDEEAERVAQRWAERCE 262 T 10 DUF1382 unphh F Eukaryota T 6hiv 118 OD BP D0A0S6_TRYB9 mL82 MRRCAACLTSLPVSSPTGAAVGAPAAPLKTQAPSNRSLLGYPLRRAAAMEMLYGGICIQHLAQPPFPLRKIQSESLPPPSLQGERDDLELEVKDSTGNVMGYRLFPVNIGIRARTESVRVRSEDCYKRFLAQKHCAAAGVPLQFPAPSSITNSNCLATPRAASHFHPPSSSLSLFTRPADSQGGDVGRTTPADVAAYHPRAWRPYQMLKPMPHNWGPAVRSSGVRGPHMQLLQERIDKKGFGWKRKSRSLWQQDISTAGFRPKRYF 266 T 5.2 BH4 pdbhh F Eukaryota T 6hiv 120 QD BR Q586A6_TRYB2 mL84 MLRFTRLFREMAPELQLEYMPVLFTRTILGPQGGFAGEERLVKLEVARKYMEAGHAVTPTEELRRGLWCYNPDTDKYDCFIERNEEFLDFAARKRQWLDVYWRVNTGYLLFGRQSWGQGFLINCPLRKRDVAQKLWEQYKVRIDPRLIEFREKDRRTGIQELGHNWCWLYLPGAEELGINREVYDNKRVKVRIHVRKMNSMFALY 205 T 2.2 Ribosomal_L9_C pdbhh F Eukaryota T 6hiv 121 RD BS Q38FG8_TRYB2 mL85 MRRLGVFCRGQRNSPLLTRGIATGGRVTNEDRRWWLVHLECAPDITPGTFIAWLDCCGTHTCKKLIERNIWTIEQVAALDSDQVDELKYREGCLKMDVVWEHARTIITPLRQREVTGGVESELQGRIMELRKKRELERRREEILKERANVSEQREETLRK 160 T 0.048 RNase_Y_N unp F Eukaryota T 6hiv 122 SD BT C9ZPU8_TRYB9 mL86 MRRCIPARGGFTMKYKKGTGLWDEDHVNDYKTNRYLSARATMRWYQEMERHQTRNSLNARRATQSHNNNRGLHHTGRGAFERELERRGVQVEKYPLTTTTGTMRVAELVILRRMELEKRAEEALAEQRGELQKKNPTPSEWYDESKGPLNPNFLRSMRSHYEVDIANLPDTPLIRGQREFFIGEERGNGAA 191 T 3 DUF2663 pdbhh F Eukaryota T 6hiv 123 TD BU D0A7Z9_TRYB9 mL87 MLNPTFSLYRKTLQSYPVPPKIRHYDRRWSGSRTNPYNRQYWRVIMNENYSRPSFWVSDFRHRYLMRTGTDYQGQVPSSPQPGLYQGFSDVHKLLANHPKPQRESRHLPVLPMTPRIVFEHANEKRIDTAKRMRRDRRRIEELKTLEFWGWYMKLQRVRGRWCREQGVSSRGVYGPAVDAAELWG 185 T 1.3 Crl pdbhh F Eukaryota T 6hiv 124 UD BV C9ZUW3_TRYB9 mL88 MLQINAFKLVRATPFLLKRTGKPADTPDYKQVYLPYDAAPTERELERERRRFKQAYHGRMEHRKLVEVKEVPLNVYTYGKEGMSLPIAIFKDQKDPVIGPEWTYPGIYENKIAAQHWYTEELFDKESKEAFESPWQQQILDNQVKRRMAKVMFRMRQVNMKAVDLFQKERGSSRRSGGAGEKGKDGGGKK 190 T 2.5 Ribosomal_L37 pdbhh F Eukaryota T 6hiv 125 VD BW Q57WW5_TRYB2 mL89 MSGAFGCGSYRSVVAGTQNVPRRMTFYPSAYELIQLHKAHREVIRHFYVRDKIFDNKFPGNALANGLFKFVPNRRENYHMRELMESIRRRSIWMHRIKQQREINAKVVENMEVKYGKKAAASMLCFTTPDSNAYFAPHRYQDVANSWPNYWQHPSVNHVVPKPRWRRHRELGGITRVEDPFAVQASDY 188 T 24 Babuvirus_MP pdbhh F Eukaryota T 6hiv 127 XD BY Q580U5_TRYB2 mS91 MLFTRCLLAVTTINSSTASAAGRLIRIRKKSKWIDRRSKRIPHNGKDVWQFGEQPSCALCHVRFRFKQDYEAHKESELHQNRLRWVETMKWWEEIGEPHHQQHAASEWEWFRQRVLPAKAAAMGLSEEDAARELRRAVMHETPRWYSRIQPPNARSEIKEPRDQRWPSSPKW 172 T 0.0021 zf-met pdb F Eukaryota T 6hiv 129 ZD Ba D0A4T0_TRYB9 mL93 FKQGQGMFSHQLKRLLQKKSIHRYNWDPLPMYDPRKLVHASRHMDVETWREVPDPHWDERSYLVPDQMFYNIPVPPEYKDAYWWRELQARRVQCPVEWVSHRMYNKGDRQRYDFQDLAFRKKFEFSYEEVVKNAKDMRS 139 T 7.9 Pox_VP8_L4R unphh F Eukaryota T 6hiv 131 BE Bc Q389K3_TRYB2 mL95 MFRPTTAIADSFKEHYHRVHLPRRLALQRYARQQSLRNAAKGNVKAEEVPYKYNRWWVNEEHEFVHQYAFVEDPEVTKAKRETLPPVTRENIWKEPQQTFFLPFAPYVRVVDYPKDPDAKFLKPVNIPRWKDYMQRTKPVIPRTWY 146 T 1.8 DUF4653 pdbhh F Eukaryota T 6hiv 132 CE Bd Q381W9_TRYB2 mL96 MNDIYARRLAQATMFHQLMRCHGTLWAATQVTKEQMDYNFIREEFMRVNGRRAMPLLLGAAANENLHQLHLSHLSEHCAWGESARALAVQRQTPLSQRVAALGRMAETIHQVKTASTVQNLFNEQISCMEGISSFEEEPLIEGE 144 T 0.083 CCP_MauG pdb F Eukaryota T 6hiv 133 DE Be Q388L8_TRYB2 mL97 MSSRFFQKYFIRCGNCQTIQRYAKGYKPIPNPILFDSDAHCRSYHRERRDCTGLTGTLVTCRCDKCARVHSHWTVMDFQEFLDAKLVMTPEERTALLWPGAGSRAEPSSGTSN 113 T 0.15 Myticin-prepro pdb F Eukaryota T 6hiv 134 EE Bf Q388M2_TRYB2 mL98 MVLRGVRLRSVAVSCYGSSLTAATRCLSVRTEDFFSKEAISHARRVSWAPHTTEKKQGAFAKLARSNFGDPLPSSFAQEPYFEEEIEAHRKHHRPDVYIYKYNVSPTHFSLRE 113 T 6.1 DUF2975 pdbhh F Eukaryota T 6hiv 135 FE Bg Q587H8_TRYB2 mL99 MYQRTRFLWSSWRDYPLGSRDRRGRFNMDEAAAALQLNPAYAAALYRPLNYTFHIRGQLYPAQKGRPSRPGSLAASQGRMFPLYQRNDRLDKELFRLNSRGLTTE 105 T 1.5 Tenui_NS4 pdbhh F Eukaryota T 6hiv 136 GE Bh A0A1G4HYZ0_TRYEQ mL100 MALFSCFRCGYMYEFAVSNSYCRKLTLRNDHCPRCDQLTLFRFMSVSGMVGNMPFKPIGVPGPSYATLWWRKTREGKEASAPLDAVCKSDRW 92 T 0.004 DUF1178 unphh F Eukaryota T 6hiv 137 HE UA UNK XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 46 F F F 6hiv 138 IE UB UNK XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 40 F F F 6hiv 139 JE,OE UC,UH UNK XXXXXXXXXXXX 12 F F F 6hiv 140 KE UD UNK XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 177 F F F 6hiv 141 LE UE UNK XXXXXXXXXXXXXXXXXXXXXX 22 F F F 6hiv 142 ME,NE,TE UF,UG,UN UNK XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 6hiv 143 PE UI UNK XXXXXXXXXXXXXXXXX 17 F F F 6hiv 144 QE UK UNK XXXXXXXXXX 10 F F F 6hiv 145 RE UL UNK XXXXXXXXXXXXXXX 15 F F F 6hiv 146 SE UM UNK XXXXXX 6 F F F 6hiv 147 UE UU UNK XXXXXXXXXXX 11 F F F 6hiw 1 A DA Q57UJ2_TRYB2 mS48 MMRRVASTGSNGGHCYVGSSLWADSVYMEVSSITLQKRFSFKYATKLQHDEMRQPYYIHEKRYGIFSNERNIAKARRGLPFITPLYTKHMNLWDTDTDASNNRFFRGYYYGQRELHQLLGRPHSLEATNADGSNDLSTYEANTSQLYKGIPRPAITNLHYEPAWRYTLYQAGAHGAQLSNPRSPFTAKVLGDELMQVRDIKSVEHCKAWFDRLQYLINLHYEAVGDIGEFKSRHTRHVHEFFVAFHDALSSLDFRDTYLFDQFKAVRPPELSDLFGIFLEMEANYVHEDYCPRCSLPYSTTRYCGEGDVDTPFRKHRGRWAPHQKWGREWYDVVARRAEALWYRATEDPYFGTPQHTQRQAEALLKVYVQAKQRGKAIDFMNKLRGSVEYLTGDITITTVMQESYDALLDTTPHPHLLTNGFTLESDAAKYTGEVNKAPLSPLQFRIDMEMNKYRRQQKEEGVVRVPPALWRLDTSAIVPYKVDSKTRRIVNWREVKEGIEKSFLSVGLPKEAYTGNEWREMLYLHDVIANREAKAAVLEQQQKLDREKLKARERASLKGEKSSNSASNGIGIIFPDKDGYQVFFVDEASLRPFGIGASGTLFKAVARVYPSPAAVPYDDPVHGKQFLIDTTDETCHLFGGFEHGDRLLLRAKSSGGDVDGGNNSNCRNTGSFTDEEEEVIVMGVSTEGSDAERMLCAMHVDTGKQRERGIVFLGTDCIDIRERWESVRYAASAYVKGRVTLLESQRTAREEFMGQVVGIRDGVLFVQWRLLKGGGSVLDRSVAVPIGDSTQVRELFVETKVLGVEPLQSPPSWRTPFRNDFAEERLKELQRAPFKREKYVSLIQGKYTPKVKKFGYTQHTTVDDFETKEYKDRLLSKQFFQNPQAFEVVPDRHERSVQFGGKWEYQRTHGLPTVDRNELENGWSEVEPISDGEMKVIEQALRDISGPRPGNFVKPPSKTKSLQLSESWWEPLGYGWEQHNEEQKALCDVTEQRLIDGSNLPFGGKLPPFGTSYGMGERMRAIIEDYSKGFGLGPHGHSPTHDTIHYNTLNAEGERVRDLGYTDALGRLFSEKLGDRDVHQWAVESCADGEADVRQLLLSLHEWRERGRPPSLLLANVLSKYLEEEIAAFNKGIPENAPKLQLETSDGTLAHSGSGGSRSGTMWADVDPTTFALHQASQSTRSTRCDEPFILHLVKRAKLGECVTNFTDTAYIAHLESSVINEFHLALTKLVGKGISPTLLAQKTGQLHRGSVRVGGNVVPFVKSRELSRLLERMGLTSESIAVITRGLANCPEQESVGDDFAVPVSLILSWDGPGSSGGSGSGDRRLNTTAARNEQQRKGSAALSSAIRQLSQHQKQRGSASGRGWQNEDKMVVHVLEELSLRNDGLVMDIQYAIRENKKNRRLRWEFVTTLLPVFGGNEAKVEQLYSDYCDGKYVPNITVAVEAFIAFLHNATQHPETYSASDYFDIDDGKSSTASGTGDQPSAGQYTTLKLLDPLEGPFVFDNVKVEYIQTVERFRRHGIRAGPVMAPATGFIAANCKSLNYFTRRQEEVVYVTNDSDQGLRRSLENSAYHKTIAANPALQYLLKARRGAALVETFNRFFYRTMPMLNFYQNVLKHYSETIQPLRQAAKSSTRGLARALESERSAAMEEFRRNSERYWRNIIEGRSVEQVVLANGESNRRTVGGGAGEQLQKPQGNRDAVRTNFTGEATDGSRKGQRQHQQPPLSQQQQHSRGEGERSAVHTSPRQGGSRGMADLLSKLNTPKTGSGVKGSTAKNTGNNRRGPKS 1788 T 12 DUF5053 pdbhh F Eukaryota T 6hiw 2 B DD Q385L8_TRYB2 mS51 MLRCTVVGHHRSGKARAFVFRDPSLRMMRAGSGYQQLRRMGMPMQVGMGWRKVDSFHANTQYQHAWPLLSHDDLGNSDQSNNTKNIMYSMYMPKRNKGTAPWFRGADTYSVKYCEQGRYEYQRYLMINRFPSEYKKHFLSFLSNIRMSSGSATIPQEALHWLLRMIVDNFNPQHVHYIAAMKTLQSAGELDMARDVWKIMERQQTWPCTATICAYLDVCVEAGEKTWAMEAWNRYCTELKFLEPGEVDPKPISRVPFSLTREELLYLPKWKKHFDHDPNLDVMDLNRFNRTREVYLRMAQVMLAGGERNAFQHFFTKLEEAMLNKPTPVPEPPNPHLVRRPRWAPYEHCKSVHHSPWRLQNNGRALALGPPVTIEDEMQSRFFSNDQFLVHSVKEVLRIVLQEHKRAHPTECTRCKTEAFFYKTKDADETLKFCDDLIERLFASLGVRLSNLNTSSLLSTILEVFRVVGKESGAALLQRANEFLERKASLGDAEGSRENLTASNYLQVLSGFADESAFVYNTKKDGTCQYKTGFDPRTTMRHLADVVQEIAGNPHVTWAADMHLQVVETMVGCGTMKANDYFVRNVLRQFSWDSRFLEALYVEYRRQDDVDMWAELTKRALVWTARYNAPASERLRRLIEDDYDTIRVQTRTFRELAVFQFRDVEERRHSRDVVNELPNPWYDYVAHALPFPDRDAGYPDEYGDLGQWRAPGGPGSPVRGPGYYAPPMEGEHQRGYTAEWRDLRNPMRPPEFPTPWERKYRQYARGQHPSYDMVYAGPMPEIFPMRRDFRKPTRWDFHDIEKQGKYRTSGPY 812 T 0.001 PPR_long pdbhh F Eukaryota T 6hiw 3 C DI Q587C2_TRYB2 mS56 MRKFCHFMANCWNSARSHATYGAVPLTHSQVTSVYATDGGKVDELGLLELVEERIFSWKLNKWEMRIPPNLPNDQKELIRQEQENLKQILSEWRKCFGALNADILQISSLTGVPKDVVREKNRTWLQEEVAKLRWMGEVNKAALLRDAFMRLEAFGSRDFMFMERLCCIYGLARQGTFDEAFTNYITEDPVTNDIFVDERNPFKELVAHIVRNYSQIDIIYDFLGFNYSEGYRSSLRRYMEYLQCKTAENVRASGRLVTGDKGEHNILFDYCVSRESLVSGDSCQGIIDFLYINGNDVTLIIIASDNPWLRNRQLPHRRQMEGIARRVCFVLGIPPSEVRIRNLLLPPTYLDKGSIVRLNDIVFRLSNEQSNLLIPWLTNYNKELDPKDVDYTALAKTTNEEEWLTL 407 T 0.2 ApoC-I pdbpercent F Eukaryota T 6hiw 4 D DL Q38BS2_TRYB2 mS59 MRCSCAFLDKSVFAAKRRVIVPIHPTPNFPAHFIKSAFTTDPLKEKQKARFSSGGEAMREVQDIPKNLEGERSRRDLASRGDTEFQALVEFIEGASYDQLISGRRFKKVYDVLSENDDMFIWLCHTAMSVLNPGDMRSRLIYNHLRILAESVASGEMTQRTAFRFFESAVRSPAYREIAKRQLEGGAATRLAGISAAADVMRRMGLTRRPMSSYFELYQRIVERSEAMTPWGFPPLFQFEERLSLEPRLKFFSRASQQSLERRRRGHVMTPHTTLHGRRIFWIPPTWNRAGRFLGPHVTLYPGMTPD 307 T 0.37 DUF2840 pdbpssm F Eukaryota T 6hiw 6 F DN Q38D60_TRYB2 mS61 MLCRTFLRQFRMSGGDMFVEYKVLSRDHRRSIRVEDAIVDPTFKRTVLPLGWLELLRSPSLRLPTGYFVEETVHVSLPNATSNGGKKEARPQKGGFASGSPSVGRNEANAIIAGPVVLYITGQSVPVVLNPYFVPEGTWDMRTRDGELDLRLGMDAIEQCTLFSELRPGGLLYGKLPENPNVRRNESLRATLGRYGMKCDLAESPLVPRPWTRMRYMFIDELQRGPKLTEFVGHNPRNGTPWRFSQNTKYFRIGIWRDTIRRNDMNEGLHAHSSWQKSPQQSVPEVRFLAPYP 293 T 5.9 AAA_11 pdbhh F Eukaryota T 6hiw 7 G DO Q383D1_TRYB2 mS62 MRRFCTARVSSYVKRAPLLLPRGGRRWRSGAGTTADGGGEKEYIADSFESTAGSNAYAAMELAAEARREIHELWLSAETALEREKRVQQVAALIEKYKLDPSTPREADVSRGLGDAFDRLLLLCLPLGKTDAKGTDNLERLMHLAGRNGRELSVRTIQHLFARTDSFAEALAVFYTMRRCHVAMNMEAYYSMLYSLQRLEEEGWGQHFRNEYEENGAPSEQAMDFIVKGISNALLPENKPWLGRVMFQDRNVPDRRYDTRDFDELDTAWTQRYKSGTPAGAH 282 T 0.084 ECSIT pdbhh F Eukaryota T 6hiw 8 H DP Q38F25_TRYB2 mS63 MLHGTPVRRASLRYRRPYWMMFLKGVDNWKIYTVIQQPDHQRTEMLYQAWLGGLDRPYTRPKCMANQPLWLSKKRHMLRKERLDGPETPLEKYVLEWHKKFHSFQGTERPTPDDLHTALDLVERPLDLSYALQLLGQCRNLNNIRFAKETFLVFLEACLRVGRRDCAEYALEHAEPLGFWFIDEDHRRYLQGEQTWYKLSPLDNLYYPVEENAKLNEGRKPITRLSPATESPGSGTAISDGEPSTDVEGETTVDDEIAQLEAELAALEREGGGK 274 T 0.19 MNE1 pdbpssm F Eukaryota T 6hiw 10 J DR Q57UA2_TRYB2 mS65 MFSESLCILSRRFRYNTKFPALVSYNKLPWEVVNHETPQFHMHVAPHYEQLLTLAASSPVPHIIGSKHIDVPREHRLRLLPGMLYLLDGDTLPGEFTINRVLDPTALQYYGRLSSQIVTVEAVRMLVPDDLRLLCNCITFKGPLHLPVAPYASLASLRGASQGGTTGSETGSNCFTLYHFVRPNRPPKELQLEKYYIHAPCVAPLSEFASNSDERGNWRPRLQAPKRTRRATPLPAYRPPQSYLMGLAERLAVVPGGCFGRRSLMWGHWF 270 T 0.47 DUF3475 unp F Eukaryota T 6hiw 13 M DZ Q587C4_TRYB2 mS73 MLRRSILFRMKYADLELTTRGEFPHGMKEPAFVKKLDQNIPWYFSTYRSMYHWPITGDNWSDLNEAEKHHDLHMFYTLAWWKLGEGIFGVDEDS 94 T 9.7 Mastoparan_2 pdbhh F Eukaryota T 6hiw 14 N Da mS74 MFSLTQTWLIAHWYCGHKFRHRFMRDKRFHPSLQASHDARNRFSKRRHFKTNRWNYQQAYRDMP 64 T 0.093 MLANA pdb F T 6hiw 15 O DB Q586P5_TRYB2 mS49 MIRRRVCDGARLAPNIRATFSAVRYQSGLHTFIRDSKPSNFSSVRRSENANGDATASPGATAGENPASSGDWASHMQRELFGEVDPLGGQAHKDYYRDVTRGYSPQYAPRNFANGGAVAYPHIQSPYEYEEAAHRRVWLDHDVDRMREEFTQHRASLRSLASAQEREELLRSRAAEYQVANTVHESESVHPIQQLYNSGGTSRSALKQQAVADRYSIAEQHSPLPLTTGVDRDALDEAQRTKDRILNDSFTAENLLITHGLREKEKHDFTILQRTVRIPFQGYDMDRFLAQQKGTPYGAQQLPPNVVPSSMEEAQRTLRGSSATATPLVDAVAQKVYARNTVVDRPAIGEQLTEQIINIMRASRTTAEQQREEERAQRFGLGRQGALVQDGGPDQRTLKKHTNDERIVDAMLFQQNAYRKTPTDEHWNPYIRRSTENGVGHLLQNKFDIMRREDRLSKGEQDLTERNTIHYGVPIQQIVDEFVFRHRNARGERPLDYFKPFPNFRALRLNRMYRDVEGFSLMKQRPEFLEWELFTRYRQHHQQRRRLALLHGLEPVANETAQERDTRRHRLDEICERTPFDEREMRVNDDEMRVSVETLRSWFGVYMLPSPTVVNAVLGGSASVNLHLYHLADEMGTADTREHVLSSRYLNRLLLLESYQNRVGRGFMNHVVGRAPEPVVPHEQPQEVLRHFSAEERAMYEQHVKEQTSRQLGEWERAMKRRRWLTDHQQYGHVVSHGLETSVVDLSHTETGAVLTVSTKAYEQEIEAVRMKTNATIKVDGMVYNLLPNSERRVVPLTVQLDSGEKIDMTSEDFDRCELEAFPRNLNHALNYGIANYAYNRGNYVETQDSIWEEQTASGQEGWSPATHADGLREGLPVRARRPIFSSSAEQRIAGGPQRAVIIQYHHQPFFNPEPRLVKVAFQCDGTIMEVPISDVMIWQRRYHGPERTVGDESRRYNPAAMRRYVDVTDPFNEKTSNTEHFLDKYEPKRNADTVADKYRTTKQITEIDKWTRYDSARADNYRPLSISHRRDYIRMGYIPRYTPWEWIAIQEADQPLIAEQIRQDNIGTSYFFSLNRYWRYKASPHGYIRHFENEVRDLLQYVDGVTPWKQAQKIRTYWEVRSHHPMPQFNRPEVAMHRNTVGLLPAHMWETDKKTGKVKSVKDSVRDYQTKTPYPKWVQL 1181 T 0.098 ACD unp F Eukaryota T 6hiw 16 P DC Q57YB5_TRYB2 mS50 MFRVRRLAGFIQCSPCGKMGPLKQQTRRYTPIWKSDPAVDNVAPLRDEDERRALWAEVGPISDVGSAVTAWIRFGNDPVLHTAVPTMLGGKFRNQQREKESLLPNSSSPFAYVEDYMGTNLVFGSPVHAKESAAVWATYFERRYASRLRLSRRTVANYVGLINSPEVFDDESDRPETRWSQDTFFRECAYLSEKFLKEKVSNMQQFEAALKRASPEAYLAFFDAFQQQTQTQIPLPSPSVWHYEGERRKQWAEKFISISHKAQAFFKDVLSEDVKKYQEVPGKLLQKVKPVLADVGKILVKRHERWLKGRVWTSLTEEEREAYCMKEVKRQQMQVEDGEFDPMMEDDVDDTELEEWQREHDAIMKLMNSPIDGLHFTTLELWLHTMRCEELETEHIYTSARVRAIQVAARKKLYDTTSYEEVIQAVVESIARGTLDLGAGVLRPHFNEVWCQLNYAKFGSSTITQHTTTSRRQLLFFHAGSLKDIAATATLYYATKPLSNSLDYASPYKYRRSLITLCSNYGVETAYTTQRPLLRSAANLARAEDLIHAVVTAAAQPFGERRRAATRDLHMEFQRLAVPVERVIVANPVSALLESGADPDEKPVEGEKVNMWPLGAKRVVLYKWSAPNVEKLKAMESDAASAVSGSSLTAKRLREIQELKRRGFLEVSLWRRVTAQERKQRNEIVEAKKKQVEEVVRTVPSLAHLHQYATSLYSRIEERVAFPTETSTVTETTNMKEETNKPLEDSEWEFAVLLDDRVLLNKEESVELYLPYRDANGELLAQGEYRALVRAFDLEANPNLHPAYCSVGYSESFQVFDALPQLIAQFFRVKDATAEAAGVTHIPAADFTPFCAFLRDAGLDVPLRCEFEAGQAVTTDGDVYMDYFLQLLRGEAFHQSHAQAGLTEAQRAIEPLCRAHWVVHHPGADESEWATARRSVLDHAMQHEREWWFPNEMLDVKDVVTGSTNGLTPQMYPAAVRYGVELCTVLTAEGKFVDERGSGLSARCVVNGTGAAESVVFDTANCNGTNTTSVEDALRVAHGALRSAQDRHNTLAAFRLGPLSKQSQVLLFCGVNAYEFGGKYARTYAYAFEKAKKELEATAASGFMAPSLSHEDTERLSDQPTTSPSVDRFASTTHPEQRKAQFVPRVGPGSTPLEDPAADQKSEWS 1165 T 0.23 CT_C_D pdbpercent F Eukaryota T 6hiw 17 Q DE Q386Q7_TRYB2 mS52 MRRCSFASTCRYRSADDVPLGLGSAYVWTCRNITSRGQFNPIHNFSYAMERGVRARDVKAFEKLITNPGPLRVAYTPDYLDWLHRCYKAKGTYMDARAVAEKKFNGNIVSSELSAAVNRREGKRGDTRGDTVDNDHHNLPGAPPPGMFLRPAHSFRRLAGELKRRRAQSILDEVARAQGMLDLFERQPHFPAIHIDRCSRFHLVELFKEMVLERSLDSNMIWEKALLYRAILSERKPSYPTSFHYIFTAVEDTVFAPTISTPMEMAGSGGEGHDRSSHPLAAKCPTLEAYYYYVYLVKKYYIDNAVEAHVVLRCHREPNAADLLFSNPPPKDDTEIMKAVELLRNADIQRGVAAAAAVSDPTLPPGGEGSVIGNSDNKKNSEKTSEGSRGRPARPPVLPGAYPPIDMLWRCEENLPLLKVLLFGEFNLIVSENPFVKFPSAHGFLTRPYSTDSSRTLADGMSLANVMAEKRGHLLPSLPRNTATSIDARAQDIRRLQQKHHRDDIVSFQKLLRTTHAEDSPSAFSSYSDWSYFNPRAVRAEERDRLTRKAVEALKLYDSATNDIYRHSFEDVQACHTQRVTERDRTMPPYLPTLPHFVAIIKKDPHISFLLHIGLPDRNSSEEGSAKHKELEKRIYYLARALYHTALEYHNETVRRVNRQKVNVAASLLDNFVEQEWTTILRDKHDVTDVTKTLNDTQNDKKQLARRLGRYMLFANRSLDDTGFPTDARADDYTRWMAPPSVGKVSL 747 T 7.7 Complex1_LYR_2 unppssm F Eukaryota T 6hiw 18 R DF Q38ET1_TRYB2 mS53 MSVSGVFSKGRGIGHEATTSILRYIPRARVPWQPSRFGRENLTAADMARLWSRGRYRDGPGNYNSGYCTERTHVLEENTVSIIPRRELEKYMPDITIGPKALVTPVSLMNARNGHRVTHDLLHSYDPHIGRLGKPAVVDHDNITVEDPNRVGLNAATLDCRGRIYRWLRRGPFFQVDNYFRRSVKLNRDGTLPTDFVHEAPLMRKIIRLAHRGHLKAACEEYRRVTTVPPVEVYRALTACCVPGAKLADAVSIFEDGDSKLFYVSRDGEVLHNLMRCAIAARHRARIMWVYNVMRGRFYENVVVRAEVDLIWRYRIAMIALEYLLDHECAEEAAAIYSYLVEEELLRCDVHVRVGLHMREAIAAGKPITLNNDVMNATSLVRDATAVAPEVARELQRRHAQTLQNNAVEAVGAENVREGSAPWSILGPLTAIGPTAEDTMVWLQQHYGDVDVMSIMRWARFRKGKDLMAKDRPQYLARAAAWIELLSKRNREMEEVPLTYMRKSKPLVLDTNSNVRVAWQTPLMRSGGPPRLLAREEGYVFHHSNSSRFVEETYRHPGESLQSRYLALQPLHTEVSAKEDFQRLYYQAQKHHKQQERLLPVSTAAIPSRIVHHSVMSALHGVSGKGEPANRSLFTXNKSNDSHSNTGVASGTSACTDGAGSATPEF 666 T 0.0009 PPR_long pdbhh F Eukaryota T 6hiw 21 U DJ Q584U8_TRYB2 mS57 MFRSTRPRSVGYTPVNPDTSPMVAYSQYHWHYNLPQGMERPHSVNRTFAAPFQSNHSLVNKYRGVWIEFDMHPAFSVALEPQLRKLPRGRTLPKTPAEEVIADYTALAPLVDDEKTRDLWLAKVFQHCAFQRCGGAMELWERYCHQRFTAEGATAKPPLSLVKSVLFYCNKTDNSGWRALFDRCLKDGWNYTPLFDTAQWSFMLKSIGRMGDEDGVRAVLEEMLDVQADLDRVEARSVVIALNAVTNADVYEFVKKYLFNFGERKVKFLRTTYSDLRGHGAGKLRIPLKENDNMYYHVCWHSSIRSPRQFSPRQLYFDYTPSTLGSSSHNPNAKIDDIVKDKIEKWKAEGLLPEDYVHEDRVYDRSAAFKNVARQEKWKKMPKILKSKRMGYTGDP 396 T 1.1 RPM2 pdbhh F Eukaryota T 6hiw 24 X DV Q57UZ6_TRYB2 mS69 MRRVRDGLFLSHPSSALQCSVAVVSTGEFDHPPFQFRQRHTFNTTPLHDANRFGGRTAYLREIGPVNIKKQGRRFKKDPRTVQFNVDVWCAQQTLRKRWKQRDWEVIEIPFRLVPREQQRVIPELYTDIPQMTDPARNDFSNIRNKVYDREELQGVLFPAAGAMLYPPLQRVDKQAMTLDKYL 183 T 0.13 CNRIP1 pdb F Eukaryota T 6hiw 25 Y DW Q383N9_TRYB2 mS70 MLRFTHRALTATPERFSVLGTTHPKPKRTGFGRNNKMRSKPSDNVAWYDKGPVEWLPRPVRLTYDHLDQLQQWTMRATLDGRTEEFNRIRDLHREWSQHPLMPVLGDVEPKFPLNLFKQNHRAKKRFLVRWHKANTPANWLWMPRGPTVVTPLHRTNPTQYPENWKQMVQRKSGTGTPS 179 T 16 THDPS_M pdbhh F Eukaryota T 6hiw 26 Z DX Q383G5_TRYB2 mS71 MFHRAFVSSSDLTGCTIALSSVCTQKRYWAKPKKRPKVGQGFHEKAQKWREEYLLDRHRALADSLRAYVEFSTSKRVEPWDARFKPFDRIEKDGVYVLMRYMMEEKLQLCNYHHRPVKRLFCNIGLMGPQITTKARWKPYRFATNPAGTSKAERMYQRDKTVYTHGHND 169 T 0.18 Tox-MPTase5 pdbpercent F Eukaryota T 6hiw 27 AA DY Q57YD4_TRYB2 mS72 MLRSTLTSLNTFLTSSVATPPISVIRTGPKWWADPERMVRQKLMYFTLGVDQLPLRRTAVIQRDLQRFHMCKPPPRVGDSTGYKRSRAAQLNTWYRRIQYQEYHMQHLFTRHVWGLLRVYPGNTTKIQGKADDGYVGYDSVPFHRYNRAPLPFPARELYERRK 163 T 37 MBDa pdbhh F Eukaryota T 6hiw 28 BA CC uS3m MFLIHFVHYKTILQKYTFKFKHIFLSIDKYNSLFFNISGILIWLNIIHINIILIKYSFFILINNFEYLIILIST 74 T 0.022 Prophage_tail unppercent F T 6hiw 36 JA CN Q580I0_TRYB2 uS14m MFCFSRGLWMRHIGQDVPKRHTHFVLESRLMYEKSFRDCWLHSVCRAISQLDEPLSKTVVGTHQKMLQRKVTCFQYNQYGLFKTPYYRLANVDRYHAVQGVAGTREWVPYVNVSYWTMNKMVRGGNLLVHRVHYTGWGTDSHLKKGGWEHRWNKVLQRNVLQYSRI 166 T 12 DUF6462 pdbhh F Eukaryota T 6hiw 41 OA CS Q584T8_TRYB2 uS19m MAFRNTFTTPGKFSTVSKNIVLLLIWRVKVFLRAEGFAHSLVMLPVSLYSKILLCDVKKKIVYFHCCTRKKSMLRRCPCVWPDGPTKSSVSIGTAFTLQKRFLKIAKSAFGFYLARRGQRKYPFLRRPHIKNTHSMNPSAPYFWSFMTAKSQMAFLPEENYITGDWTGKFFVSKRQVYTLQHATSGAKVRVKSFPSIFEFNSPSRWNIGKEMNTLTKPRMDLIDEQMLTKKQRLDYVRAGLLPK 244 T 4.9 LIN52 unphh F Eukaryota T 6hiw 42 PA CU Q580M9_TRYB2 bS21m MLHTTRLWLGGYMMYHRKAMGTMKYSKWKGAHGGISHFYGRTPMVEEVRPNEPITLVDRRIMHYVHHSRLRHFQLFRSYQEKSNSTECKLREGEMLRRRWHRRLQKSFIAFMQFKTMKVLEDQARLVNTYGQAAVNAALGDPWNATDNVARERKSAAVRRQVRALPMVNVVPKHVATMKQIHNDRFNYRWRVN 193 T 2.6 HMD pdbhh F Eukaryota T 6hiw 43 QA CZ Q57WU2_TRYB2 mt-IF-3 MMQRCSTSLSRLCFRRLLRTPLLVYSIPPTRDVPSGIAHCPLSCSMRMVTSSNDDEFVFDPTLSIQKDAAIHTAKKSFETIVLEYVPAHAPEEARQKVKSYLTQHPIDILITQPKVQITHLEDAESGAETKVSLSPCDLPEALQQARERGMNLVQMGARGDVAYCRIRRESTRILSLIHTELEALREQEEKQQGKGRGGVQAAAKMGELIDHTFRDAVDAHFVGWRSKKIVEDIRRRHPVKLTIKEFQSPECAIGKLREMCQAMQHYAQEKVIYHHFTSIVANDREASITFAPALPMAKSDSWKHIKYPGEKEWTNALRRMEDACRKSGRYGTYAKSNKLKLRSLGQTSYRVDKYGRKMD 360 T 0.0011 mIF3 pdbhh F Eukaryota T 6hiw 44 RA Ca Q38DR3_TRYB2 mS22 MLRRAYIQRRYPFNKRGPREHKSWKHHVLTEPPKPLQWRDPKVWTRDLSVMKSFDAPQWDLWQSRPRSEDMDEALQPFMDMPKSLKDRRYDIPWWANPFGAWYLQNILSLELLKLKSKTNAEKIATYRSYMRSLASGKDNTMSDDDVIRNIIKERWKTLEFGDRNAGYPCTFGDYIQFLNEWFKSLDEEGMQRLREHFDRRIRPLLAVMSPVDILWLEALTQNSPHNKEQLQRKIAFQTSLGTPEFFDMSKRLRYEINEDYKVRDELGPELFALWSKAPERWPPERLSKMYGLDFTLVRKILVWHHFKACYDACVEPDWSLPKRLFALEWIRDVRARKHGLFYGKMRFAEQKITFYSDRFLFRDLVNRREASYANVWEMDDPYRFLQTEQDYEDYWGDNYDVYRRMFPEMIGRTGEPVQQYGQMPIWAGPHRQHANKSEHNWMFAEIGVNVGHEALKKLELDPTNEKRRRFVIRQPDGTLRSAKMSEMRAWYWKEEWADFRFWAPQMEWGIENTPSQEQYQEHVPDTTDADFRKQRRIQSRPVKWFYESHYTRTGSFAGFQPLRFMQRRTEREVRWPDVINAAVQIQKRKPAAYIFKAIPEL 602 T 0.26 MRP-S22 pdb F Eukaryota T 6hiw 45 SA Cb Q57VB2_TRYB2 mS23 MRQCVVRRYKMPKNMGVAPRFDTWNEKYEPWEHMKRMGRLVGTGFYIPPEWYNHFRMFPPINHNFQQEKTLNPHNASEPTQDDTSTLSPERVALRDELARKSRLVASEGMRYYNIFWVRKPLDTMEKEYYELKRRGVDHGEAIRKVLQGFYSGLAVKKRVAAIQAEEAKLTGRFITMREATVVLGVLAKLHKEQLTPHQVSLLAKEQGETTQSGAKLTAIVSRTQPHVNKEASSPATSEAVGSSTEESLSADALASMLSEDGEQSAVGTRYQVEVKETANDSVRQLREKAEDQTGFPDWYTGESPTYSGTSACSRDGFALMKANK 325 T 0.026 Herpes_ICP4_N unppercent F Eukaryota T 6hiw 46 TA Cd Q38DK6_TRYB2 mS26 MMRSSSFCRRQIRPYYNLPSKSEHGRKMTGFLTPYRHWMWKQNELWRNVHEAQFEHLRRVYKRQWLESFRVNADEYIYKYNITKAAQLAQWECEMKEQEKKRIEARQMMDGRQALKKKHLDLLREFHERQFFFWYERASERLQNMNLINYVPHAQLREHIDKELDKYVAGKNEPYPLNFVGQMPFLEDGDGNIVEVPESLLSNHMAEHPDSTAKPHEPHTSSSISEAAAFEERMLRAMVSAKEEDLKEWLGDDSRALSETIDDISREEEEREADIRVARSMEETDAEREVSRRAYIERXKTGSRSIFRPPTVSEGAGGTPSAPAGDANTPMRRRKKGKLDRVHALQAHQDELLAKLSSQGLKEGVDASSVPERGKIVQSRGRIRDKAVIPTHEVLMQKPELAAGSTPGARIQTKDMVDKMYHRGKYKKSGSGDKSDGEDL 440 T 0.028 ThylakoidFormat pdbpssm F Eukaryota T 6hiw 48 VA Ci Q57WW0_TRYB2 mS33 MVLRRWFPLLGVHRVGYTHPSTLPVPCAQRWDLRLARARIFQEYIEEKAPGAWQLEDERSMSPEFKTFTGYPMRDMRPGYGQNLPDYIMKKRLPNNTHYELFARRDIPNEDNAMYGKYLYDMTVHGTSLPSTYRMHKDINKAQRNDRKLSGNRFKVLCSSGAKNPPSGWEPIPDATEEEED 181 T 1.1 Nmad3 pdbhh F Eukaryota T 6hiw 51 YA Cm Q38C96_TRYB2 mS37 MKSSDIFFAYRLTPVVFKSRQHDSGVNQYGLKPTNAYDYINPTNLINFGRGTTFDNLGVRRAGRGEIDSSPSHSGSPVFTQAKLIGLSGEEQLTMCQSETMALRLCMAKAGKETCERESRALDSCLGRVGHLRRAMSEACWEFNDWFIQNVSDNHTKPFQHRPHDWRHFYAQEKLVRERQQNGHAYGRRPKQFSFGARYVKTDGYGKRPRLPYNK 215 T 0.054 Gypsy pdbpercent F Eukaryota T 6hiw 52 ZA Cn Q57VQ9_TRYB2 mS38 MRAGCVACSRIPKLGIAALGSTTTSMQATCQETQELRCATTQMAPAVIFPLPPPLRGSYIDRKPTAASFNEHHATASFRHHMIASADVSHRPRHFYFASGTRNDTGTRSVKEGERKQLKQQTNVEVDSVAYGDQLDELSARWAAKFYGQVTFGPRNYPYPSSRWLARRFQMKKHRIIKRFRFRRYKLAAVANLPFAKMIRVGMLPELKSSKTKRGDVVDPTLSGQLVSAVKNTGEKRKGQRTRPKSKYQV 250 T 28 DUF2996 pdbhh F Eukaryota T 6hiw 58 FB UO Unknown Protein XXXXX 5 F F F 6hiw 59 GB UP Unknown Protein XXXXXXX 7 F F F 6hiw 60 HB UQ Unknown Protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 32 F F F 6hiw 61 IB UR Unknown Protein XXXXXXXX 8 F F F 6hiw 62 JB US Unknown Protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 54 F F F 6hiw 63 KB UT Unknown Protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 44 F F F 6hix 5 E A4 Q38AB0_TRYB2 bl31m MVLHKWAVVSRSAPPPRGLRPIARTIPTHPRLRPVDYKIPYVLRTFIKDRHTSEVQHLENRGMFAEELSIERSRFPRFHSTFTIQTDGSLNEREFEFAVPPIVTLFHDRLSAHRERQLELAKIGKLRKERNWETEQKGEESVSMACNALAFPYCIPKNMLKRSRVVDPLNSKSSTQGVTSGGG 183 T 12 RNA_pol_L pdbhh F Eukaryota T 6hix 6 F A5 Q584F4_TRYB2 bl32m MFRRTFFTPMIAQPTLLMLGNKGGTPKRKKNPMQLRRKVYGLHFKEKYLKMEEWYYCPLCAEPKKPGEWCRREDCRQIKP 80 T 0.089 HVO_2753_ZBP pdbhh F Eukaryota T 6hix 8 H A8 D0A1K1_TRYB9 bl35m MGSEESNNICAYKRTISLAKIYIVLLVKTAMLRYSRLCFPKMGCEEITRKARRVQLQPTEYLAQHRMQVWQLRFKEMGPPFSRVWVALGGKMRRRRVGRQVDVKDMRYYWRPIEPQYQRLYMSRLRIRDHSNKLRQPMRLRATNADIGSGSSSIEWERASNRKYGAMLAPPKRQDFEFRVV 181 T 0.13 Cytochrom_B559a pdbpssm F Eukaryota T 6hix 13 M AJ C9ZPD1_TRYB9 ul10m MSPASPLPVAALSRLRITHRSFLTRSRGRHVCRSAVGVEYRPEQQKKVLDHSYARVINAEVVHGDEQKFWGERRTFYTQRNIFFPMWDRCAQALILITREVPRVPQEMAFRLMAVFLKLMLLPRLMMNTELMLPMWIASNAEGAMAAAKDGSKGKEQSSKQQGESKDDAKKEGDNTK 177 T 12 DUF5783 pdbhh F Eukaryota T 6hix 19 S AT Q4GZ98_TRYB2 bl19m MGYTRERTNRHFFVARANAFFSRLPIARVQRSLAMEAVREGRMRPWKYTKEQILGAPVTCNFEYNPRPVRLIGTVMDAHTEETSIKGGLKVYARNEETNMMLWIPPGNPKLRHEVTSTKGSFQHYLDERDKWDEAWITGRARMK 144 T 3.8 DUF2760 pdbhh F Eukaryota T 6hix 27 AA Ae D0A8I6_TRYB9 ml41 MLRCSCACRRGVYHNAPSVYPFVKPFHDTPYDQDRGRHDSVGQRYRKNSWPEWMDNGADGTGYGIGLHRTHPLSRLRGNLKRSPSHVPRVLGMMIQGVWHKSGVKLYFRGGKPPNPSVHPYLTGEPCPLYGWKVTDESVIRQFNMPSIDGTNFRYKPYVALQERKIMGVQVPSDSVSLASGRTTESKPLAKRLFFWR 197 T 1.9 MRP-L27 pdbhh F Eukaryota T 6hix 28 BA Af Q383S1_TRYB2 ml42 MLRLCRVSLRVQSHQKKRAQHPNAGTRFGRVYNRGFIRYGFGGFGMSVYTPKKDRRFRVQPLPSLHANSLADDTPLVTTTRTLSPNFRAFALQDGGVFFTHPSHEQVMRVGQNILAEETKATGMTSMDTYVNSRIQSIIAENTVENVALSHWRRRHMWNLVRTHGKLQRHWGVSDATKGARSNIYGRPS 189 T 0.12 Toxin_10 pdb F Eukaryota T 6hix 33 GA Ap Q57YA9_TRYB2 ml53 MLNPPKHYSVESLRTVGLLPAQLALSRKPRLRPHVGNLKGLVYPLPYYAMWRGNHNKYTYNKSTVCLWGEGDTRSMYHQHYAHAKCPTDYGRGGREFEYLTVKRGKMLQKPLPRVQYVAEGSKPVWLFKSWHTPLSSPSMWEREVQYAEHTPEHIGAKRPLAVVAPRTMHRYLFLMHMEKVTITVSPLLFGYGHTIQKAVLDFYRRAISARSPFPKDKVFLFYAIDHITPRIEVTWLDGTSYVPPVLEGASSQDLIQMVMEEAWLAADRMAAEGRVLNPLAIDDYKWDQLVVFKKVRDKEASKGGGRKK 309 T 5.5 MRP_L53 unphh F Eukaryota T 6hix 34 HA At C9ZU82_TRYB9 ml63 MLRHCTAHRRYRTAWRELLHPLPVRARKMEWLKRDAVEENEEILRRPYYTIKSYALPPAVGRQESIHNSNNIRGGMHSSHSLDLIMRQPRRVKTPEQLRALRDRLRFIGVTGPMPQATSVSTKSYTDTYGSRLRPRYPESWDTVPPHQPSRELL 154 T 20 DUF4113 pdbhh F Eukaryota T 6hix 36 JA AB bL12m AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 56 T 10000 DUF4699 pdbhh F F 6hix 37 KA,LA AC,AD bL12m AAAAAAAAAAAAAAAAAAAAAAAAAAAA 28 T 1200 DUF4699 pdbhh F F 6hix 38 MA AG bL12m AAAAAAAAAAAAAAAAAAAAAAAAAAA 27 T 1100 DUF4699 pdbhh F F 6hix 40 OA BA D0A5V6_TRYB9 ml67 MLRRFALTSSVALRLRFERDSGHNTVRYRPIPESMQPKHLEDNFTPFPLPKFDESLEYGPVRLRNIPDIEAAKERRRGSRLAATEVLLQETLQEENQFATSGKGDGNMAIAITERHTEDVTTPAADSRFPSQTMSPCSHEEEMRGYVVSRDYPLIDRLHCTRSIEELVAQFEDRPQIESRVAALADMASTVSFRSDEELLRMFTAISAPFSVDGRGLNFLTVKVSKFGRPYYVPNSLLPAYVNLVDATTIALVREQPWRLSASPALFIQVLQFMALIKVFEPNKWFTFSDHAPSNRADYRHAIGVNHSTAFWGTGEELYDFMVELLRVEDDGRIPTMLDLCTREQMVDLLSGFCGVMPCGKAVGDVFKTITDAFLRRVRNDISGPWSAHDWAIVERMYLVTVLCDAGNNEILQLLLSDTASPRGPDFFAAVSRTKDTPTKKRALCLLQEAIDNASAKADKVTLLGLLESGSEFLLSLVDKGVAHTFATQNLFDYRILNSFLHCSLVADRLRVEQSVITSLIPSSLRDVQVQMLMSNERNALNPLTSSLPGNSGAIATAPKLKRPLMTMLSQLEYLNSIDSVFILHSSLMATSTDQLVSAVRRLPSGKDSLIVTMSCLRALSVKSLTSPSMKERIACARALEIVSYELEKGRAVLLPFSEEILLHDAGAYCDEDLMLWTVAAFLARELPLVKVHTLMHSNCTARTPYRFLKGGHNLLVSSRSLYDKGAPLLSSLHSKELRLVTHNVRLRTPVRDRKCTLQYYNPIRARFVYRRDKPLFDKYHVTARNLAPGFSRGALKHDWRALGVYTPDHPQVPYHPLQTWMLGETTVEAE 831 T 0.047 DUF5642 pdb F Eukaryota T 6hix 41 PA BB D0A135_TRYB9 ml68 MLYTRRLMTTGGSATADGAVSYSKGSYHIVPKKYTVGKRIAVRSYLDRNRTELSDRTYMPQKAWFEPYTPKKFDMEHQRISHNFYNLETKLIWTAFDTPELIGILLHDETIKGAPHLYDAEFLESAVHWTRESRYWRCIGITKPFYNKTTLRAQCWHDRGLQVGTLVFSQAMRDALMDLERAVRRKELGLEPNYVWDRWGPVGFIDGARTDHLPRFAHNPYVDPDGVEVTEVDIAPFNTHEQIKERYGAFIDPDLRPFEGVFRAPSHGALTLDDVPHQEAVRLYRDLMEKADMPVMLGNGAEIPPMDMRALFHLSANPERMKAASELSSWREVRGMLAPVQEVCDEKVEALRLMENTRHDAARVRTFYEEKCGFSDFMRTPDKVITAAVLCYLQELQRICTETDWGKPLARCLTDLERVNVMGKDAFLVYRHIEDAILDKKRRVWATRFAGEANEESTLDYLLENFGRRTEQTRNVGTTGTEFDREQEPIGRQVQRRVLDSDKASKLAEVRQKRGKMWSNKKSVFDSLHQKQLQNVTYGVH 541 T 4.7 VirE_N pdbhh F Eukaryota T 6hix 43 RA BD mL70 MCLKRKAPHLFCFCLWSIFLSFRCFCFRSYAIMLPLLSFPTIQISIFLSFKLPITTFLLSPCFVFVFVFAIRYCGELTLNAQLVLFLLYHCAQTQRGPLKEGEMPICPGLCGELAAVPFRVFLGTLPTLAVEERFLRQLQPVFAWYSSRKRVKEQANEFIEIDLASCDAELLLRYSHIYYVRRQLFDELIERQMTLLDSGKAPKMAEPSLLQCLAGCNMTIADRLQLEIRQLGAAKRAASVPGRRELDPVARLEVYDYACMMRLVEEDAGAVGDAEMKARAYLPREVIESKLGHLTQLLLGSDARAALDKKDVKLLNRMIPPDYTRVGCVEKLRPFDVTAYFRFYGERINNVKVENYFKRALWGHVYRRFATTPSFLSGVSTYWARHSGLDASFTTTTMPQEVAVAVCDQQIQFPAIKFRAQYVYTSPETARQLWRTDAAVPLMRLFPLMGSRTAEDLAAGVLTDAFWMHLGLSEEENLLQDSLLLKVRRFVDEVGDMYETNIDSVLKRVDDNFKQVVPQLKAEDLQVDAPLQDGEGEDTVRETVAA 547 T 0.26 MGAT2 unppssm F T 6hix 46 UA BG Q57Y49_TRYB2 ml73 MRLARTLHHVASATTGGGQMLEGLVNDGPEVAHKQHASFTPFSIQPWQARCVGASRRKLLPQMLMYHGARLGPRPLIILDHSTKGEAGVAEAARKYESILSQLSWDYGAVYIPLHAQCTDSSKDLLEQSCQRICAVMDALDVRWTHFLTYSYGALVAVRMASSQEFPHRVGTLMSLDTPLVTREFLRNMEQREDIAKAERDINVPEDGLAFAKQALLSSLEGPLPCPAAEDESLYRDYLFDPNRIFGAGGLVRDESRYVPLKSLLGVRHPVQLIVPSANPLSDAAAHSEVFGHRRPAVVKCCQRHEDLFKESAAKEVAGVLGAWMRRFEPDCFISKRYEQAANEMGQLMLSTAQVSSESAGKGGGEPRKKKEKKKSKA 378 T 7.3E-10 Ndr pdbhh F Eukaryota T 6hix 47 VA BH Q38AM5_TRYB2 ml74 MLVPGLSLTRRAVTSSCCRPLHVVRGFSTTCTLFGLEQLQDVPTSTSRRPTGLHRGPGKRQTSEREAAQYKFIRRWELQMRDEWDQLEPFKGLPKPKRQFGNEAAEVIWPYALLLERVVKVHPFTKSIYVYYAQRQSTARGKLAAEIARSFAREFLIPITFHNSQVYTEAEMLLEYSETPWVVLHSLDNGQKPRILPVAPVEGTPAHTAVEQLLAEVVQGCEALGASVADPVTATRVLNERPLQNQYVRVDYQWFGDTPDERASHLVRWEFEPEQIEPKIRHRTRHVLDWLNYDGNLPTHRAVHVNAMREKARQKAPRTVAGPRTFYNSAGSRANARSSRFGGQAAVGK 349 T 0.023 L51_S25_CI-B8 pdbhh F Eukaryota T 6hix 49 XA BJ Q383M2_TRYB2 ml76 MLRLSSWNLKSQHHNVLRRSRPHIHKYRELNRWQRQAQGISKWDQSHSHRPLPYVERFNPESVGLTRGTSAFAWKWWHTQYPWLPNVPPEAAQIDEAQKQERRSHRPPAWDDEFAKVVLNMNDAEIREYLMSKLTDVIFLETQRDGYELRRLDFEGKPLTSLPEPRIIENFVLEEETIRERVIYQVVEGVFRLSPTSADRRELRSVANIIDYVLTHVRAARPTDRERRQERPITSAALAVMQKCPIQPQLGFVHALPHDTRDALLQEWERMHHLDWQFGKAVYTPRSKENVRGNLTWLREDRHYDQRMKFMQEVESGEARAKHMKLIAEAAGN 333 T 1.8 KIX pdbpssm F Eukaryota T 6hix 51 ZA BL D0A4P5_TRYB9 ml78 MGASAGLIRRGGGVFPDAVSLTLTPSRRVYGGSGRGDLLYENPDARRHSGRALGVLNGVRHSSQATMPESGQLYYRKLILHSRPPNGSCAGLQRHCHDTCNWSYLIPSLHRCAESAISAKLWEKMCQLGLEDRSKAWVNLTQYERQRVRDGQNLYRYEVHQRLPLLEESIGWAQLDDLLGWFRSARRAWVRLPTSVTLSRESSEAGVASVVSPSSAMSCRLEGHADSRDTTPGRNQVFDTPERVEQLTEATVHRIREELQRLNRSERSDCEGSAAMRASARRLARDEELSRCVEEELGWHGVALQHRIPVPK 312 T 0.091 CAF1-p150_C2 pdb F Eukaryota T 6hix 54 CB BO D0A755_TRYB9 ml81 MKRLFPSAGVSVVLTSSSIVMSCPCNHIFTSRRAYYWPYNEDFVPEGAETSRFQSSGSPGTRRRVLQEYALSPLFGARVPCCVVGTLRTAKEIIVLKRDIQSLLCELMELPQGSVTLGPLQQLREMMLYRHGTPLSPRLGDERQLNMDAYARRPIAARTMMDVFLAEDMSLDEIVNFGRLSLGKLQIALNNLRKENAEKSSADCEDGSAVEPAQLLVDRMGISYCEIPSLDESDYVFAEVGGVAVTDEEAERVAQRWAERCE 262 T 10 DUF1382 unphh F Eukaryota T 6hix 55 DB BP D0A0S6_TRYB9 ml82 MRRCAACLTSLPVSSPTGAAVGAPAAPLKTQAPSNRSLLGYPLRRAAAMEMLYGGICIQHLAQPPFPLRKIQSESLPPPSLQGERDDLELEVKDSTGNVMGYRLFPVNIGIRARTESVRVRSEDCYKRFLAQKHCAAAGVPLQFPAPSSITNSNCLATPRAASHFHPPSSSLSLFTRPADSQGGDVGRTTPADVAAYHPRAWRPYQMLKPMPHNWGPAVRSSGVRGPHMQLLQERIDKKGFGWKRKSRSLWQQDISTAGFRPKRYF 266 T 5.2 BH4 pdbhh F Eukaryota T 6hix 57 FB BR Q586A6_TRYB2 ml84 MLRFTRLFREMAPELQLEYMPVLFTRTILGPQGGFAGEERLVKLEVARKYMEAGHAVTPTEELRRGLWCYNPDTDKYDCFIERNEEFLDFAARKRQWLDVYWRVNTGYLLFGRQSWGQGFLINCPLRKRDVAQKLWEQYKVRIDPRLIEFREKDRRTGIQELGHNWCWLYLPGAEELGINREVYDNKRVKVRIHVRKMNSMFALY 205 T 2.2 Ribosomal_L9_C pdbhh F Eukaryota T 6hix 58 GB BS Q38FG8_TRYB2 ml85 MRRLGVFCRGQRNSPLLTRGIATGGRVTNEDRRWWLVHLECAPDITPGTFIAWLDCCGTHTCKKLIERNIWTIEQVAALDSDQVDELKYREGCLKMDVVWEHARTIITPLRQREVTGGVESELQGRIMELRKKRELERRREEILKERANVSEQREETLRKLREAIAAKKAAMXQKKQAASEAYGGSSDGGARKEGAEE 198 T 0.02 RNase_Y_N pdbpssm F Eukaryota T 6hix 59 HB BT C9ZPU8_TRYB9 ml86 MRRCIPARGGFTMKYKKGTGLWDEDHVNDYKTNRYLSARATMRWYQEMERHQTRNSLNARRATQSHNNNRGLHHTGRGAFERELERRGVQVEKYPLTTTTGTMRVAELVILRRMELEKRAEEALAEQRGELQKKNPTPSEWYDESKGPLNPNFLRSMRSHYEVDIANLPDTPLIRGQREFFIGEERGNGAA 191 T 3 DUF2663 pdbhh F Eukaryota T 6hix 60 IB BU D0A7Z9_TRYB9 ml87 MLNPTFSLYRKTLQSYPVPPKIRHYDRRWSGSRTNPYNRQYWRVIMNENYSRPSFWVSDFRHRYLMRTGTDYQGQVPSSPQPGLYQGFSDVHKLLANHPKPQRESRHLPVLPMTPRIVFEHANEKRIDTAKRMRRDRRRIEELKTLEFWGWYMKLQRVRGRWCREQGVSSRGVYGPAVDAAELWG 185 T 1.3 Crl pdbhh F Eukaryota T 6hix 61 JB BV C9ZUW3_TRYB9 ml88 MLQINAFKLVRATPFLLKRTGKPADTPDYKQVYLPYDAAPTERELERERRRFKQAYHGRMEHRKLVEVKEVPLNVYTYGKEGMSLPIAIFKDQKDPVIGPEWTYPGIYENKIAAQHWYTEELFDKESKEAFESPWQQQILDNQVKRRMAKVMFRMRQVNMKAVDLFQKERGSSRRSGGAGEKGKDGGGKK 190 T 2.5 Ribosomal_L37 pdbhh F Eukaryota T 6hix 62 KB BW Q57WW5_TRYB2 ml89 MSGAFGCGSYRSVVAGTQNVPRRMTFYPSAYELIQLHKAHREVIRHFYVRDKIFDNKFPGNALANGLFKFVPNRRENYHMRELMESIRRRSIWMHRIKQQREINAKVVENMEVKYGKKAAASMLCFTTPDSNAYFAPHRYQDVANSWPNYWQHPSVNHVVPKPRWRRHRELGGITRVEDPFAVQASDY 188 T 24 Babuvirus_MP pdbhh F Eukaryota T 6hix 64 MB BY Q580U5_TRYB2 ml91 MLFTRCLLAVTTINSSTASAAGRLIRIRKKSKWIDRRSKRIPHNGKDVWQFGEQPSCALCHVRFRFKQDYEAHKESELHQNRLRWVETMKWWEEIGEPHHQQHAASEWEWFRQRVLPAKAAAMGLSEEDAARELRRAVMHETPRWYSRIQPPNARSEIKEPRDQRWPSSPKW 172 T 0.0021 zf-met pdb F Eukaryota T 6hix 66 OB Ba D0A4T0_TRYB9 ml93 MFRVTGLQLKNPVVFKQGQGMFSHQLKRLLQKKSIHRYNWDPLPMYDPRKLVHASRHMDVETWREVPDPHWDERSYLVPDQMFYNIPVPPEYKDAYWWRELQARRVQCPVEWVSHRMYNKGDRQRYDFQDLAFRKKFEFSYEEVVKNAKDMRS 153 T 7.9 Pox_VP8_L4R pdbhh F Eukaryota T 6hix 68 QB Bc Q389K3_TRYB2 ml95 MFRPTTAIADSFKEHYHRVHLPRRLALQRYARQQSLRNAAKGNVKAEEVPYKYNRWWVNEEHEFVHQYAFVEDPEVTKAKRETLPPVTRENIWKEPQQTFFLPFAPYVRVVDYPKDPDAKFLKPVNIPRWKDYMQRTKPVIPRTWY 146 T 1.8 DUF4653 pdbhh F Eukaryota T 6hix 69 RB Bd Q381W9_TRYB2 ml96 MNDIYARRLAQATMFHQLMRCHGTLWAATQVTKEQMDYNFIREEFMRVNGRRAMPLLLGAAANENLHQLHLSHLSEHCAWGESARALAVQRQTPLSQRVAALGRMAETIHQVKTASTVQNLFNEQISCMEGISSFEEEPLIEGE 144 T 0.083 CCP_MauG pdb F Eukaryota T 6hix 71 TB Bf Q388M2_TRYB2 ml98 MVLRGVRLRSVAVSCYGSSLTAATRCLSVRTEDFFSKEAISHARRVSWAPHTTEKKQGAFAKLARSNFGDPLPSSFAQEPYFEEEIEAHRKHHRPDVYIYKYNVSPTHFSLRE 113 T 6.1 DUF2975 pdbhh F Eukaryota T 6hix 72 UB Bg Q587H8_TRYB2 ml99 MYQRTRFLWSSWRDYPLGSRDRRGRFNMDEAAAALQLNPAYAAALYRPLNYTFHIRGQLYPAQKGRPSRPGSLAASQGRMFPLYQRNDRLDKELFRLNSRGLTTE 105 T 1.5 Tenui_NS4 pdbhh F Eukaryota T 6hix 73 VB Bh A0A1G4HYZ0_TRYEQ ml100 MALFSCFRCGYMYEFAVSNSYCRKLTLRNDHCPRCDQLTLFRFMSVSGMVGNMPFKPIGVPGPSYATLWWRKTREGKEASAPLDAVCKSDRW 92 T 0.004 DUF1178 unphh F Eukaryota T 6hix 74 WB UA UNK XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 46 F F F 6hix 75 XB UB UNK XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 40 F F F 6hix 76 DC,YB UH,UC UNK XXXXXXXXXXXX 12 F F F 6hix 77 ZB UD UNK XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 177 F F F 6hix 78 AC UE UNK XXXXXXXXXXXXXXXXXXXXXX 22 F F F 6hix 79 BC,CC,IC UF,UG,UN UNK XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 6hix 80 EC UI UNK XXXXXXXXXXXXXXXXX 17 F F F 6hix 81 FC UK UNK XXXXXXXXXX 10 F F F 6hix 82 GC UL UNK XXXXXXXXXXXXXXX 15 F F F 6hix 83 HC UM UNK XXXXXX 6 F F F 6hix 84 JC UU UNK XXXXXXXXXXX 11 F F F 6hix 85 KC,MC UV,UX UNK XXXXXXXX 8 F F F 6hix 86 LC UW UNK XXXXXXX 7 F F F 6hiy 1 A DA Q57UJ2_TRYB2 mS48 MMRRVASTGSNGGHCYVGSSLWADSVYMEVSSITLQKRFSFKYATKLQHDEMRQPYYIHEKRYGIFSNERNIAKARRGLPFITPLYTKHMNLWDTDTDASNNRFFRGYYYGQRELHQLLGRPHSLEATNADGSNDLSTYEANTSQLYKGIPRPAITNLHYEPAWRYTLYQAGAHGAQLSNPRSPFTAKVLGDELMQVRDIKSVEHCKAWFDRLQYLINLHYEAVGDIGEFKSRHTRHVHEFFVAFHDALSSLDFRDTYLFDQFKAVRPPELSDLFGIFLEMEANYVHEDYCPRCSLPYSTTRYCGEGDVDTPFRKHRGRWAPHQKWGREWYDVVARRAEALWYRATEDPYFGTPQHTQRQAEALLKVYVQAKQRGKAIDFMNKLRGSVEYLTGDITITTVMQESYDALLDTTPHPHLLTNGFTLESDAAKYTGEVNKAPLSPLQFRIDMEMNKYRRQQKEEGVVRVPPALWRLDTSAIVPYKVDSKTRRIVNWREVKEGIEKSFLSVGLPKEAYTGNEWREMLYLHDVIANREAKAAVLEQQQKLDREKLKARERASLKGEKSSNSASNGIGIIFPDKDGYQVFFVDEASLRPFGIGASGTLFKAVARVYPSPAAVPYDDPVHGKQFLIDTTDETCHLFGGFEHGDRLLLRAKSSGGDVDGGNNSNCRNTGSFTDEEEEVIVMGVSTEGSDAERMLCAMHVDTGKQRERGIVFLGTDCIDIRERWESVRYAASAYVKGRVTLLESQRTAREEFMGQVVGIRDGVLFVQWRLLKGGGSVLDRSVAVPIGDSTQVRELFVETKVLGVEPLQSPPSWRTPFRNDFAEERLKELQRAPFKREKYVSLIQGKYTPKVKKFGYTQHTTVDDFETKEYKDRLLSKQFFQNPQAFEVVPDRHERSVQFGGKWEYQRTHGLPTVDRNELENGWSEVEPISDGEMKVIEQALRDISGPRPGNFVKPPSKTKSLQLSESWWEPLGYGWEQHNEEQKALCDVTEQRLIDGSNLPFGGKLPPFGTSYGMGERMRAIIEDYSKGFGLGPHGHSPTHDTIHYNTLNAEGERVRDLGYTDALGRLFSEKLGDRDVHQWAVESCADGEADVRQLLLSLHEWRERGRPPSLLLANVLSKYLEEEIAAFNKGIPENAPKLQLETSDGTLAHSGSGGSRSGTMWADVDPTTFALHQASQSTRSTRCDEPFILHLVKRAKLGECVTNFTDTAYIAHLESSVINEFHLALTKLVGKGISPTLLAQKTGQLHRGSVRVGGNVVPFVKSRELSRLLERMGLTSESIAVITRGLANCPEQESVGDDFAVPVSLILSWDGPGSSGGSGSGDRRLNTTAARNEQQRKGSAALSSAIRQLSQHQKQRGSASGRGWQNEDKMVVHVLEELSLRNDGLVMDIQYAIRENKKNRRLRWEFVTTLLPVFGGNEAKVEQLYSDYCDGKYVPNITVAVEAFIAFLHNATQHPETYSASDYFDIDDGKSSTASGTGDQPSAGQYTTLKLLDPLEGPFVFDNVKVEYIQTVERFRRHGIRAGPVMAPATGFIAANCKSLNYFTRRQEEVVYVTNDSDQGLRRSLENSAYHKTIAANPALQYLLKARRGAALVETFNRFFYRTMPMLNFYQNVLKHYSETIQPLRQAAKSSTRGLARALESERSAAMEEFRRNSERYWRNIIEGRSVEQVVLANGESNRRTVGGGAGEQLQKPQGNRDAVRTNFTGEATDGSRKGQRQHQQPPLSQQQQHSRGEGERSAVHTSPRQGGSRGMADLLSKLNTPKTGSGVKGSTAKNTGNNRRGPKS 1788 T 12 DUF5053 pdbhh F Eukaryota T 6hiy 2 B DD Q385L8_TRYB2 mS51 MLRCTVVGHHRSGKARAFVFRDPSLRMMRAGSGYQQLRRMGMPMQVGMGWRKVDSFHANTQYQHAWPLLSHDDLGNSDQSNNTKNIMYSMYMPKRNKGTAPWFRGADTYSVKYCEQGRYEYQRYLMINRFPSEYKKHFLSFLSNIRMSSGSATIPQEALHWLLRMIVDNFNPQHVHYIAAMKTLQSAGELDMARDVWKIMERQQTWPCTATICAYLDVCVEAGEKTWAMEAWNRYCTELKFLEPGEVDPKPISRVPFSLTREELLYLPKWKKHFDHDPNLDVMDLNRFNRTREVYLRMAQVMLAGGERNAFQHFFTKLEEAMLNKPTPVPEPPNPHLVRRPRWAPYEHCKSVHHSPWRLQNNGRALALGPPVTIEDEMQSRFFSNDQFLVHSVKEVLRIVLQEHKRAHPTECTRCKTEAFFYKTKDADETLKFCDDLIERLFASLGVRLSNLNTSSLLSTILEVFRVVGKESGAALLQRANEFLERKASLGDAEGSRENLTASNYLQVLSGFADESAFVYNTKKDGTCQYKTGFDPRTTMRHLADVVQEIAGNPHVTWAADMHLQVVETMVGCGTMKANDYFVRNVLRQFSWDSRFLEALYVEYRRQDDVDMWAELTKRALVWTARYNAPASERLRRLIEDDYDTIRVQTRTFRELAVFQFRDVEERRHSRDVVNELPNPWYDYVAHALPFPDRDAGYPDEYGDLGQWRAPGGPGSPVRGPGYYAPPMEGEHQRGYTAEWRDLRNPMRPPEFPTPWERKYRQYARGQHPSYDMVYAGPMPEIFPMRRDFRKPTRWDFHDIEKQGKYRTSGPY 812 T 0.001 PPR_long pdbhh F Eukaryota T 6hiy 3 C DI Q587C2_TRYB2 mS56 MRKFCHFMANCWNSARSHATYGAVPLTHSQVTSVYATDGGKVDELGLLELVEERIFSWKLNKWEMRIPPNLPNDQKELIRQEQENLKQILSEWRKCFGALNADILQISSLTGVPKDVVREKNRTWLQEEVAKLRWMGEVNKAALLRDAFMRLEAFGSRDFMFMERLCCIYGLARQGTFDEAFTNYITEDPVTNDIFVDERNPFKELVAHIVRNYSQIDIIYDFLGFNYSEGYRSSLRRYMEYLQCKTAENVRASGRLVTGDKGEHNILFDYCVSRESLVSGDSCQGIIDFLYINGNDVTLIIIASDNPWLRNRQLPHRRQMEGIARRVCFVLGIPPSEVRIRNLLLPPTYLDKGSIVRLNDIVFRLSNEQSNLLIPWLTNYNKELDPKDVDYTALAKTTNEEEWLTL 407 T 0.2 ApoC-I pdbpercent F Eukaryota T 6hiy 4 D DL Q38BS2_TRYB2 mS59 MRCSCAFLDKSVFAAKRRVIVPIHPTPNFPAHFIKSAFTTDPLKEKQKARFSSGGEAMREVQDIPKNLEGERSRRDLASRGDTEFQALVEFIEGASYDQLISGRRFKKVYDVLSENDDMFIWLCHTAMSVLNPGDMRSRLIYNHLRILAESVASGEMTQRTAFRFFESAVRSPAYREIAKRQLEGGAATRLAGISAAADVMRRMGLTRRPMSSYFELYQRIVERSEAMTPWGFPPLFQFEERLSLEPRLKFFSRASQQSLERRRRGHVMTPHTTLHGRRIFWIPPTWNRAGRFLGPHVTLYPGMTPD 307 T 0.37 DUF2840 pdbpssm F Eukaryota T 6hiy 6 F DN Q38D60_TRYB2 mS61 MLCRTFLRQFRMSGGDMFVEYKVLSRDHRRSIRVEDAIVDPTFKRTVLPLGWLELLRSPSLRLPTGYFVEETVHVSLPNATSNGGKKEARPQKGGFASGSPSVGRNEANAIIAGPVVLYITGQSVPVVLNPYFVPEGTWDMRTRDGELDLRLGMDAIEQCTLFSELRPGGLLYGKLPENPNVRRNESLRATLGRYGMKCDLAESPLVPRPWTRMRYMFIDELQRGPKLTEFVGHNPRNGTPWRFSQNTKYFRIGIWRDTIRRNDMNEGLHAHSSWQKSPQQSVPEVRFLAPYP 293 T 5.9 AAA_11 pdbhh F Eukaryota T 6hiy 7 G DO Q383D1_TRYB2 mS62 MRRFCTARVSSYVKRAPLLLPRGGRRWRSGAGTTADGGGEKEYIADSFESTAGSNAYAAMELAAEARREIHELWLSAETALEREKRVQQVAALIEKYKLDPSTPREADVSRGLGDAFDRLLLLCLPLGKTDAKGTDNLERLMHLAGRNGRELSVRTIQHLFARTDSFAEALAVFYTMRRCHVAMNMEAYYSMLYSLQRLEEEGWGQHFRNEYEENGAPSEQAMDFIVKGISNALLPENKPWLGRVMFQDRNVPDRRYDTRDFDELDTAWTQRYKSGTPAGAH 282 T 0.084 ECSIT pdbhh F Eukaryota T 6hiy 8 H DP Q38F25_TRYB2 mS63 MLHGTPVRRASLRYRRPYWMMFLKGVDNWKIYTVIQQPDHQRTEMLYQAWLGGLDRPYTRPKCMANQPLWLSKKRHMLRKERLDGPETPLEKYVLEWHKKFHSFQGTERPTPDDLHTALDLVERPLDLSYALQLLGQCRNLNNIRFAKETFLVFLEACLRVGRRDCAEYALEHAEPLGFWFIDEDHRRYLQGEQTWYKLSPLDNLYYPVEENAKLNEGRKPITRLSPATESPGSGTAISDGEPSTDVEGETTVDDEIAQLEAELAALEREGGGK 274 T 0.19 MNE1 pdbpssm F Eukaryota T 6hiy 10 J DR Q57UA2_TRYB2 mS65 MFSESLCILSRRFRYNTKFPALVSYNKLPWEVVNHETPQFHMHVAPHYEQLLTLAASSPVPHIIGSKHIDVPREHRLRLLPGMLYLLDGDTLPGEFTINRVLDPTALQYYGRLSSQIVTVEAVRMLVPDDLRLLCNCITFKGPLHLPVAPYASLASLRGASQGGTTGSETGSNCFTLYHFVRPNRPPKELQLEKYYIHAPCVAPLSEFASNSDERGNWRPRLQAPKRTRRATPLPAYRPPQSYLMGLAERLAVVPGGCFGRRSLMWGHWF 270 T 0.47 DUF3475 unp F Eukaryota T 6hiy 13 M DZ Q587C4_TRYB2 mS73 MLRRSILFRMKYADLELTTRGEFPHGMKEPAFVKKLDQNIPWYFSTYRSMYHWPITGDNWSDLNEAEKHHDLHMFYTLAWWKLGEGIFGVDEDS 94 T 9.7 Mastoparan_2 pdbhh F Eukaryota T 6hiy 14 N Da mS74 MFSLTQTWLIAHWYCGHKFRHRFMRDKRFHPSLQASHDARNRFSKRRHFKTNRWNYQQAYRDMP 64 T 0.093 MLANA pdb F T 6hiy 25 Y CU Q580M9_TRYB2 uS21m MLHTTRLWLGGYMMYHRKAMGTMKYSKWKGAHGGISHFYGRTPMVEEVRPNEPITLVDRRIMHYVHHSRLRHFQLFRSYQEKSNSTECKLREGEMLRRRWHRRLQKSFIAFMQFKTMKVLEDQARLVNTYGQAAVNAALGDPWNATDNVARERKSAAVRRQVRALPMVNVVPKHVATMKQIHNDRFNYRWRVN 193 T 2.6 HMD pdbhh F Eukaryota T 6hiy 26 Z CZ Q57WU2_TRYB2 mt-IF-3 MMQRCSTSLSRLCFRRLLRTPLLVYSIPPTRDVPSGIAHCPLSCSMRMVTSSNDDEFVFDPTLSIQKDAAIHTAKKSFETIVLEYVPAHAPEEARQKVKSYLTQHPIDILITQPKVQITHLEDAESGAETKVSLSPCDLPEALQQARERGMNLVQMGARGDVAYCRIRRESTRILSLIHTELEALREQEEKQQGKGRGGVQAAAKMGELIDHTFRDAVDAHFVGWRSKKIVEDIRRRHPVKLTIKEFQSPECAIGKLREMCQAMQHYAQEKVIYHHFTSIVANDREASITFAPALPMAKSDSWKHIKYPGEKEWTNALRRMEDACRKSGRYGTYAKSNKLKLRSLGQTSYRVDKYGRKMD 360 T 0.0011 mIF3 pdbhh F Eukaryota T 6hiy 27 AA Ca Q38DR3_TRYB2 mS22 MLRRAYIQRRYPFNKRGPREHKSWKHHVLTEPPKPLQWRDPKVWTRDLSVMKSFDAPQWDLWQSRPRSEDMDEALQPFMDMPKSLKDRRYDIPWWANPFGAWYLQNILSLELLKLKSKTNAEKIATYRSYMRSLASGKDNTMSDDDVIRNIIKERWKTLEFGDRNAGYPCTFGDYIQFLNEWFKSLDEEGMQRLREHFDRRIRPLLAVMSPVDILWLEALTQNSPHNKEQLQRKIAFQTSLGTPEFFDMSKRLRYEINEDYKVRDELGPELFALWSKAPERWPPERLSKMYGLDFTLVRKILVWHHFKACYDACVEPDWSLPKRLFALEWIRDVRARKHGLFYGKMRFAEQKITFYSDRFLFRDLVNRREASYANVWEMDDPYRFLQTEQDYEDYWGDNYDVYRRMFPEMIGRTGEPVQQYGQMPIWAGPHRQHANKSEHNWMFAEIGVNVGHEALKKLELDPTNEKRRRFVIRQPDGTLRSAKMSEMRAWYWKEEWADFRFWAPQMEWGIENTPSQEQYQEHVPDTTDADFRKQRRIQSRPVKWFYESHYTRTGSFAGFQPLRFMQRRTEREVRWPDVINAAVQIQKRKPAAYIFKAIPEL 602 T 0.26 MRP-S22 pdb F Eukaryota T 6hiy 28 BA Cb Q57VB2_TRYB2 mS23 MRQCVVRRYKMPKNMGVAPRFDTWNEKYEPWEHMKRMGRLVGTGFYIPPEWYNHFRMFPPINHNFQQEKTLNPHNASEPTQDDTSTLSPERVALRDELARKSRLVASEGMRYYNIFWVRKPLDTMEKEYYELKRRGVDHGEAIRKVLQGFYSGLAVKKRVAAIQAEEAKLTGRFITMREATVVLGVLAKLHKEQLTPHQVSLLAKEQGETTQSGAKLTAIVSRTQPHVNKEASSPATSEAVGSSTEESLSADALASMLSEDGEQSAVGTRYQVEVKETANDSVRQLREKAEDQTGFPDWYTGESPTYSGTSCSRDGFALMKANK 324 T 0.026 Herpes_ICP4_N unppercent F Eukaryota T 6hiy 29 CA Cd Q38DK6_TRYB2 mS26 MMRSSSFCRRQIRPYYNLPSKSEHGRKMTGFLTPYRHWMWKQNELWRNVHEAQFEHLRRVYKRQWLESFRVNADEYIYKYNITKAAQLAQWECEMKEQEKKRIEARQMMDGRQALKKKHLDLLREFHERQFFFWYERASERLQNMNLINYVPHAQLREHIDKELDKYVAGKNEPYPLNFVGQMPFLEDGDGNIVEVPESLLSNHMAEHPDSTAKPHEPHTSSSISEAAAFEERMLRAMVSAKEEDLKEWLGDDSRALSETIDDISREEEEREADIRVARSMEETDAEREVSRRAYIERGKTGSRSIFRPPTVSEGAGGTPSAPAGDANTPMRRRKKGKLDRVHALQAHQDELLAKLSSQGLKEGVDASSVPERGKIVQSRGRIRDKAVIPTHEVLMQKPELAAGSTPGARIQTKDMVDKMYHRGKYKKSGSGDKSDGEDL 440 T 0.028 ThylakoidFormat pdbpssm F Eukaryota T 6hiy 31 EA Cm Q38C96_TRYB2 mS37 MKSSDIFFAYRLTPVVFKSRQHDSGVNQYGLKPTNAYDYINPTNLINFGRGTTFDNLGVRRAGRGEIDSSPSHSGSPVFTQAKLIGLSGEEQLTMCQSETMALRLCMAKAGKETCERESRALDSCLGRVGHLRRAMSEACWEFNDWFIQNVSDNHTKPFQHRPHDWRHFYAQEKLVRERQQNGHAYGRRPKQFSFGARYVKTDGYGKRPRLPYNK 215 T 0.054 Gypsy pdbpercent F Eukaryota T 6hiy 32 FA Cn Q57VQ9_TRYB2 mS38 MRAGCVACSRIPKLGIAALGSTTTSMQATCQETQELRCATTQMAPAVIFPLPPPLRGSYIDRKPTAASFNEHHATASFRHHMIASADVSHRPRHFYFASGTRNDTGTRSVKEGERKQLKQQTNVEVDSVAYGDQLDELSARWAAKFYGQVTFGPRNYPYPSSRWLARRFQMKKHRIIKRFRFRRYKLAAVANLPFAKMIRVGMLPELKSSKTKRGDVVDPTLSGQLVSAVKNTGEKRKGQRTRPKSKYQV 250 T 28 DUF2996 pdbhh F Eukaryota T 6hiy 38 LA UQ Unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 32 F F F 6hiy 39 MA UR Unknown protein XXXXXXXX 8 F F F 6hiy 40 NA US Unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 54 F F F 6hiy 41 OA UT Unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 44 F F F 6hiz 1 A DA Q57UJ2_TRYB2 mS48 MMRRVASTGSNGGHCYVGSSLWADSVYMEVSSITLQKRFSFKYATKLQHDEMRQPYYIHEKRYGIFSNERNIAKARRGLPFITPLYTKHMNLWDTDTDASNNRFFRGYYYGQRELHQLLGRPHSLEATNADGSNDLSTYEANTSQLYKGIPRPAITNLHYEPAWRYTLYQAGAHGAQLSNPRSPFTAKVLGDELMQVRDIKSVEHCKAWFDRLQYLINLHYEAVGDIGEFKSRHTRHVHEFFVAFHDALSSLDFRDTYLFDQFKAVRPPELSDLFGIFLEMEANYVHEDYCPRCSLPYSTTRYCGEGDVDTPFRKHRGRWAPHQKWGREWYDVVARRAEALWYRATEDPYFGTPQHTQRQAEALLKVYVQAKQRGKAIDFMNKLRGSVEYLTGDITITTVMQESYDALLDTTPHPHLLTNGFTLESDAAKYTGEVNKAPLSPLQFRIDMEMNKYRRQQKEEGVVRVPPALWRLDTSAIVPYKVDSKTRRIVNWREVKEGIEKSFLSVGLPKEAYTGNEWREMLYLHDVIANREAKAAVLEQQQKLDREKLKARERASLKGEKSSNSASNGIGIIFPDKDGYQVFFVDEASLRPFGIGASGTLFKAVARVYPSPAAVPYDDPVHGKQFLIDTTDETCHLFGGFEHGDRLLLRAKSSGGDVDGGNNSNCRNTGSFTDEEEEVIVMGVSTEGSDAERMLCAMHVDTGKQRERGIVFLGTDCIDIRERWESVRYAASAYVKGRVTLLESQRTAREEFMGQVVGIRDGVLFVQWRLLKGGGSVLDRSVAVPIGDSTQVRELFVETKVLGVEPLQSPPSWRTPFRNDFAEERLKELQRAPFKREKYVSLIQGKYTPKVKKFGYTQHTTVDDFETKEYKDRLLSKQFFQNPQAFEVVPDRHERSVQFGGKWEYQRTHGLPTVDRNELENGWSEVEPISDGEMKVIEQALRDISGPRPGNFVKPPSKTKSLQLSESWWEPLGYGWEQHNEEQKALCDVTEQRLIDGSNLPFGGKLPPFGTSYGMGERMRAIIEDYSKGFGLGPHGHSPTHDTIHYNTLNAEGERVRDLGYTDALGRLFSEKLGDRDVHQWAVESCADGEADVRQLLLSLHEWRERGRPPSLLLANVLSKYLEEEIAAFNKGIPENAPKLQLETSDGTLAHSGSGGSRSGTMWADVDPTTFALHQASQSTRSTRCDEPFILHLVKRAKLGECVTNFTDTAYIAHLESSVINEFHLALTKLVGKGISPTLLAQKTGQLHRGSVRVGGNVVPFVKSRELSRLLERMGLTSESIAVITRGLANCPEQESVGDDFAVPVSLILSWDGPGSSGGSGSGDRRLNTTAARNEQQRKGSAALSSAIRQLSQHQKQRGSASGRGWQNEDKMVVHVLEELSLRNDGLVMDIQYAIRENKKNRRLRWEFVTTLLPVFGGNEAKVEQLYSDYCDGKYVPNITVAVEAFIAFLHNATQHPETYSASDYFDIDDGKSSTASGTGDQPSAGQYTTLKLLDPLEGPFVFDNVKVEYIQTVERFRRHGIRAGPVMAPATGFIAANCKSLNYFTRRQEEVVYVTNDSDQGLRRSLENSAYHKTIAANPALQYLLKARRGAALVETFNRFFYRTMPMLNFYQNVLKHYSETIQPLRQAAKSSTRGLARALESERSAAMEEFRRNSERYWRNIIEGRSVEQVVLANGESNRRTVGGGAGEQLQKPQGNRDAVRTNFTGEATDGSRKGQRQHQQPPLSQQQQHSRGEGERSAVHTSPRQGGSRGMADLLSKLNTPKTGSGVKGSTAKNTGNNRRGPKS 1788 T 12 DUF5053 pdbhh F Eukaryota T 6hiz 2 B DL Q38BS2_TRYB2 mS59 MRCSCAFLDKSVFAAKRRVIVPIHPTPNFPAHFIKSAFTTDPLKEKQKARFSSGGEAMREVQDIPKNLEGERSRRDLASRGDTEFQALVEFIEGASYDQLISGRRFKKVYDVLSENDDMFIWLCHTAMSVLNPGDMRSRLIYNHLRILAESVASGEMTQRTAFRFFESAVRSPAYREIAKRQLEGGAATRLAGISAAADVMRRMGLTRRPMSSYFELYQRIVERSEAMTPWGFPPLFQFEERLSLEPRLKFFSRASQQSLERRRRGHVMTPHTTLHGRRIFWIPPTWNRAGRFLGPHVTLYPGMTPD 307 T 0.37 DUF2840 pdbpssm F Eukaryota T 6hiz 3 C DB Q586P5_TRYB2 mS49 MIRRRVCDGARLAPNIRATFSAVRYQSGLHTFIRDSKPSNFSSVRRSENANGDATASPGATAGENPASSGDWASHMQRELFGEVDPLGGQAHKDYYRDVTRGYSPQYAPRNFANGGAVAYPHIQSPYEYEEAAHRRVWLDHDVDRMREEFTQHRASLRSLASAQEREELLRSRAAEYQVANTVHESESVHPIQQLYNSGGTSRSALKQQAVADRYSIAEQHSPLPLTTGVDRDALDEAQRTKDRILNDSFTAENLLITHGLREKEKHDFTILQRTVRIPFQGYDMDRFLAQQKGTPYGAQQLPPNVVPSSMEEAQRTLRGSSATATPLVDAVAQKVYARNTVVDRPAIGEQLTEQIINIMRASRTTAEQQREEERAQRFGLGRQGALVQDGGPDQRTLKKHTNDERIVDAMLFQQNAYRKTPTDEHWNPYIRRSTENGVGHLLQNKFDIMRREDRLSKGEQDLTERNTIHYGVPIQQIVDEFVFRHRNARGERPLDYFKPFPNFRALRLNRMYRDVEGFSLMKQRPEFLEWELFTRYRQHHQQRRRLALLHGLEPVANETAQERDTRRHRLDEICERTPFDEREMRVNDDEMRVSVETLRSWFGVYMLPSPTVVNAVLGGSASVNLHLYHLADEMGTADTREHVLSSRYLNRLLLLESYQNRVGRGFMNHVVGRAPEPVVPHEQPQEVLRHFSAEERAMYEQHVKEQTSRQLGEWERAMKRRRWLTDHQQYGHVVSHGLETSVVDLSHTETGAVLTVSTKAYEQEIEAVRMKTNATIKVDGMVYNLLPNSERRVVPLTVQLDSGEKIDMTSEDFDRCELEAFPRNLNHALNYGIANYAYNRGNYVETQDSIWEEQTASGQEGWSPATHADGLREGLPVRARRPIFSSSAEQRIAGGPQRAVIIQYHHQPFFNPEPRLVKVAFQCDGTIMEVPISDVMIWQRRYHGPERTVGDESRRYNPAAMRRYVDVTDPFNEKTSNTEHFLDKYEPKRNADTVADKYRTTKQITEIDKWTRYDSARADNYRPLSISHRRDYIRMGYIPRYTPWEWIAIQEADQPLIAEQIRQDNIGTSYFFSLNRYWRYKASPHGYIRHFENEVRDLLQYVDGVTPWKQAQKIRTYWEVRSHHPMPQFNRPEVAMHRNTVGLLPAHMWETDKKTGKVKSVKDSVRDYQTKTPYPKWVQL 1181 T 0.098 ACD unp F Eukaryota T 6hiz 4 D DC Q57YB5_TRYB2 mS50 MFRVRRLAGFIQCSPCGKMGPLKQQTRRYTPIWKSDPAVDNVAPLRDEDERRALWAEVGPISDVGSAVTAWIRFGNDPVLHTAVPTMLGGKFRNQQREKESLLPNSSSPFAYVEDYMGTNLVFGSPVHAKESAAVWATYFERRYASRLRLSRRTVANYVGLINSPEVFDDESDRPETRWSQDTFFRECAYLSEKFLKEKVSNMQQFEAALKRASPEAYLAFFDAFQQQTQTQIPLPSPSVWHYEGERRKQWAEKFISISHKAQAFFKDVLSEDVKKYQEVPGKLLQKVKPVLADVGKILVKRHERWLKGRVWTSLTEEEREAYCMKEVKRQQMQVEDGEFDPMMEDDVDDTELEEWQREHDAIMKLMNSPIDGLHFTTLELWLHTMRCEELETEHIYTSARVRAIQVAARKKLYDTTSYEEVIQAVVESIARGTLDLGAGVLRPHFNEVWCQLNYAKFGSSTITQHTTTSRRQLLFFHAGSLKDIAATATLYYATKPLSNSLDYASPYKYRRSLITLCSNYGVETAYTTQRPLLRSAANLARAEDLIHAVVTAAAQPFGERRRAATRDLHMEFQRLAVPVERVIVANPVSALLESGADPDEKPVEGEKVNMWPLGAKRVVLYKWSAPNVEKLKAMESDAASAVSGSSLTAKRLREIQELKRRGFLEVSLWRRVTAQERKQRNEIVEAKKKQVEEVVRTVPSLAHLHQYATSLYSRIEERVAFPTETSTVTETTNMKEETNKPLEDSEWEFAVLLDDRVLLNKEESVELYLPYRDANGELLAQGEYRALVRAFDLEANPNLHPAYCSVGYSESFQVFDALPQLIAQFFRVKDATAEAAGVTHIPAADFTPFCAFLRDAGLDVPLRCEFEAGQAVTTDGDVYMDYFLQLLRGEAFHQSHAQAGLTEAQRAIEPLCRAHWVVHHPGADESEWATARRSVLDHAMQHEREWWFPNEMLDVKDVVTGSTNGLTPQMYPAAVRYGVELCTVLTAEGKFVDERGSGLSARCVVNGTGAAESVVFDTANCNGTNTTSVEDALRVAHGALRSAQDRHNTLAAFRLGPLSKQSQVLLFCGVNAYEFGGKYARTYAYAFEKAKKELEATAASGFMAPSLSHEDTERLSDQPTTSPSVDRFASTTHPEQRKAQFVPRVGPGSTPLEDPAADQKSEWS 1165 T 0.23 CT_C_D pdbpercent F Eukaryota T 6hiz 5 E DE Q386Q7_TRYB2 mS52 MRRCSFASTCRYRSADDVPLGLGSAYVWTCRNITSRGQFNPIHNFSYAMERGVRARDVKAFEKLITNPGPLRVAYTPDYLDWLHRCYKAKGTYMDARAVAEKKFNGNIVSSELSAAVNRREGKRGDTRGDTVDNDHHNLPGAPPPGMFLRPAHSFRRLAGELKRRRAQSILDEVARAQGMLDLFERQPHFPAIHIDRCSRFHLVELFKEMVLERSLDSNMIWEKALLYRAILSERKPSYPTSFHYIFTAVEDTVFAPTISTPMEMAGSGGEGHDRSSHPLAAKCPTLEAYYYYVYLVKKYYIDNAVEAHVVLRCHREPNAADLLFSNPPPKDDTEIMKAVELLRNADIQRGVAAAAAVSDPTLPPGGEGSVIGNSDNKKNSEKTSEGSRGRPARPPVLPGAYPPIDMLWRCEENLPLLKVLLFGEFNLIVSENPFVKFPSAHGFLTRPYSTDSSRTLADGMSLANVMAEKRGHLLPSLPRNTATSIDARAQDIRRLQQKHHRDDIVSFQKLLRTTHAEDSPSAFSSYSDWSYFNPRAVRAEERDRLTRKAVEALKLYDSATNDIYRHSFEDVQACHTQRVTERDRTMPPYLPTLPHFVAIIKKDPHISFLLHIGLPDRNSSEEGSAKHKELEKRIYYLARALYHTALEYHNETVRRVNRQKVNVAASLLDNFVEQEWTTILRDKHDVTDVTKTLNDTQNDKKQLARRLGRYMLFANRSLDDTGFPTDARADDYTRWMAPPSVGKVSL 747 T 7.7 Complex1_LYR_2 unppssm F Eukaryota T 6hiz 6 F DF Q38ET1_TRYB2 mS53 MSVSGVFSKGRGIGHEATTSILRYIPRARVPWQPSRFGRENLTAADMARLWSRGRYRDGPGNYNSGYCTERTHVLEENTVSIIPRRELEKYMPDITIGPKALVTPVSLMNARNGHRVTHDLLHSYDPHIGRLGKPAVVDHDNITVEDPNRVGLNAATLDCRGRIYRWLRRGPFFQVDNYFRRSVKLNRDGTLPTDFVHEAPLMRKIIRLAHRGHLKAACEEYRRVTTVPPVEVYRALTACCVPGAKLADAVSIFEDGDSKLFYVSRDGEVLHNLMRCAIAARHRARIMWVYNVMRGRFYENVVVRAEVDLIWRYRIAMIALEYLLDHECAEEAAAIYSYLVEEELLRCDVHVRVGLHMREAIAAGKPITLNNDVMNATSLVRDATAVAPEVARELQRRHAQTLQNNAVEAVGAENVREGSAPWSILGPLTAIGPTAEDTMVWLQQHYGDVDVMSIMRWARFRKGKDLMAKDRPQYLARAAAWIELLSKRNREMEEVPLTYMRKSKPLVLDTNSNVRVAWQTPLMRSGGPPRLLAREEGYVFHHSNSSRFVEETYRHPGESLQSRYLALQPLHTEVSAKEDFQRLYYQAQKHHKQQERLLPVSTAAIPSRIVHHSVMSALHGVSGKGEPANRSLFTXNKSNDSHSNTGVASGTSACTDGAGSATPEF 666 T 0.0009 PPR_long pdbhh F Eukaryota T 6hiz 9 I DJ Q584U8_TRYB2 mS57 MFRSTRPRSVGYTPVNPDTSPMVAYSQYHWHYNLPQGMERPHSVNRTFAAPFQSNHSLVNKYRGVWIEFDMHPAFSVALEPQLRKLPRGRTLPKTPAEEVIADYTALAPLVDDEKTRDLWLAKVFQHCAFQRCGGAMELWERYCHQRFTAEGATAKPPLSLVKSVLFYCNKTDNSGWRALFDRCLKDGWNYTPLFDTAQWSFMLKSIGRMGDEDGVRAVLEEMLDVQADLDRVEARSVVIALNAVTNADVYEFVKKYLFNFGERKVKFLRTTYSDLRGHGAGKLRIPLKENDNMYYHVCWHSSIRSPRQFSPRQLYFDYTPSTLGSSSHNPNAKIDDIVKDKIEKWKAEGLLPEDYVHEDRVYDRSAAFKNVARQEKWKKMPKILKSKRMGYTGDP 396 T 1.1 RPM2 pdbhh F Eukaryota T 6hiz 12 L DV Q57UZ6_TRYB2 mS69 MRRVRDGLFLSHPSSALQCSVAVVSTGEFDHPPFQFRQRHTFNTTPLHDANRFGGRTAYLREIGPVNIKKQGRRFKKDPRTVQFNVDVWCAQQTLRKRWKQRDWEVIEIPFRLVPREQQRVIPELYTDIPQMTDPARNDFSNIRNKVYDREELQGVLFPAAGAMLYPPLQRVDKQAMTLDKYL 183 T 0.13 CNRIP1 pdb F Eukaryota T 6hiz 13 M DW Q383N9_TRYB2 mS70 MLRFTHRALTATPERFSVLGTTHPKPKRTGFGRNNKMRSKPSDNVAWYDKGPVEWLPRPVRLTYDHLDQLQQWTMRATLDGRTEEFNRIRDLHREWSQHPLMPVLGDVEPKFPLNLFKQNHRAKKRFLVRWHKANTPANWLWMPRGPTVVTPLHRTNPTQYPENWKQMVQRKSGTGTPS 179 T 16 THDPS_M pdbhh F Eukaryota T 6hiz 14 N DX Q383G5_TRYB2 mS71 MFHRAFVSSSDLTGCTIALSSVCTQKRYWAKPKKRPKVGQGFHEKAQKWREEYLLDRHRALADSLRAYVEFSTSKRVEPWDARFKPFDRIEKDGVYVLMRYMMEEKLQLCNYHHRPVKRLFCNIGLMGPQITTKARWKPYRFATNPAGTSKAERMYQRDKTVYTHGHND 169 T 0.18 Tox-MPTase5 pdbpercent F Eukaryota T 6hiz 15 O DY Q57YD4_TRYB2 mS72 MLRSTLTSLNTFLTSSVATPPISVIRTGPKWWADPERMVRQKLMYFTLGVDQLPLRRTAVIQRDLQRFHMCKPPPRVGDSTGYKRSRAAQLNTWYRRIQYQEYHMQHLFTRHVWGLLRVYPGNTTKIQGKADDGYVGYDSVPFHRYNRAPLPFPARELYERRK 163 T 37 MBDa pdbhh F Eukaryota T 6hiz 16 P CC uS3m MFLIHFVHYKTILQKYTFKFKHIFLSIDKYNSLFFNISGILIWLNIIHINIILIKYSFFILINNFEYLIILIST 74 T 0.022 Prophage_tail unppercent F T 6hiz 20 T CN Q580I0_TRYB2 uS14m MFCFSRGLWMRHIGQDVPKRHTHFVLESRLMYEKSFRDCWLHSVCRAISQLDEPLSKTVVGTHQKMLQRKVTCFQYNQYGLFKTPYYRLANVDRYHAVQGVAGTREWVPYVNVSYWTMNKMVRGGNLLVHRVHYTGWGTDSHLKKGGWEHRWNKVLQRNVLQYSRI 166 T 12 DUF6462 pdbhh F Eukaryota T 6hiz 22 V CS Q584T8_TRYB2 uS19m MAFRNTFTTPGKFSTVSKNIVLLLIWRVKVFLRAEGFAHSLVMLPVSLYSKILLCDVKKKIVYFHCCTRKKSMLRRCPCVWPDGPTKSSVSIGTAFTLQKRFLKIAKSAFGFYLARRGQRKYPFLRRPHIKNTHSMNPSAPYFWSFMTAKSQMAFLPEENYITGDWTGKFFVSKRQVYTLQHATSGAKVRVKSFPSIFEFNSPSRWNIGKEMNTLTKPRMDLIDEQMLTKKQRLDYVRAGLLPK 244 T 4.9 LIN52 unphh F Eukaryota T 6hiz 24 X Ci Q57WW0_TRYB2 mS33 MVLRRWFPLLGVHRVGYTHPSTLPVPCAQRWDLRLARARIFQEYIEEKAPGAWQLEDERSMSPEFKTFTGYPMRDMRPGYGQNLPDYIMKKRLPNNTHYELFARRDIPNEDNAMYGKYLYDMTVHGTSLPSTYRMHKDINKAQRNDRKLSGNRFKVLCSSGAKNPPSGWEPIPDATEEEED 181 T 1.1 Nmad3 pdbhh F Eukaryota T 6hiz 27 AA UO Unknown protein XXXXX 5 F F F 6hiz 28 BA UP Unknown protein XXXXXXX 7 F F F 6hk3 2 C N GLY-SER-HIS-GLY-HIS-HIS-HIS-HIS GSHGHHHH 8 T 10 Nrf1_activ_bdg pdbhh F F 6hk4 2 C N GLY-SER-HIS-GLY-HIS-HIS-HIS-HIS-HIS GSHGHHHHH 9 T 7100 zf_CCCH_4 pdbhh F F 6hk5 1 A,B,C,D,E,F,G,H B,C,E,G,A,D,F,H P72321_RHORU CooJ MTESPERGRKRLGIYLAHFLDHVEGHMGEIGVQRDALAEDARLGALIDRALADMAVARASLNAVLRDL 68 T 2.4 DUF1569 pdbhh F Bacteria T 6hks 2 G,H,I,J,K,L G,H,I,J,K,L VE6_HPV16 Protein E6 RSSRTRRETQL 11 T 0.34 FpoO unphh T Viruses T 6hlb 2 B B PHE-(ALN)-ARG-ARG-ARG-ARG-SLL-ARG-00S FXRRRRXRX 9 T 4 Carla_C4 pdbhh F F 6hld 2 B B ALN-ARG-ARG-ARG-SLL-LYS-00S XRRRXKX 7 T 110 DUF5506 pdbhh F F 6hle 2 B B LYS-ARG-ARG-TBG-LYS-00S KRRXKX 6 T 170 CsiV pdbhh F F 6hm4 2 B B MDB1_SCHPO BRCT DOMAIN PROTEIN MDB1,MIDZONE AND DNA BREAK-LOCALIZING PROTEIN 1 GVMTVPNTPQKPNLQ 15 T 22 TPX2_importin pdbhh F Eukaryota T 6hm5 2 B B RAD9A_HUMAN HRAD9,DNA REPAIR EXONUCLEASE RAD9 HOMOLOG A SPVLAEDSEGE 11 T 9.3 Ham1p_like pdbhh F Eukaryota T 6hmz 2 B A Cyclosporin XXXXVXAXXXX 11 T 4.7 DUF6479 pdbhh F F 6hn9 1 A A A0A3G2WH77_9ANNE Nicomicin-1 GFWSSVWDGAKNVGTAIIKNAKVCVYAVCVSHK 33 T 2 MCPVI pdbhh F Eukaryota T 6hne 1 A A GLY-LEU-PHE-ASP-ILE-VAL-LYS-LYS-VAL-LEU-LYS-LEU-LEU-LYS-NHE GLFDIVKKVLKLLKX 15 T 0.00042 Antimicrobial20 pdbhh F T 6hng 1 A A LYS-LEU-LEU-LYS-LEU-LEU-LYS-LYS-LEU-LEU-LYS-LEU-LEU-LYS-NHE KLLKLLKKLLKLLKX 15 T 7.4 ESM4 pdbhh F F 6hnh 1 A A LYS-LEU-LEU-LYS-LEU-LEU-LYS-LYS-VAL-VAL-GLY-ALA-LEU-GLY-NHE KLLKLLKKVVGALGX 15 T 1.9 Antimicrobial20 pdbhh F T 6hoi 2 C,D F,G BECN1_HUMAN COILED-COIL MYOSIN-LIKE BCL2-INTERACTING PROTEIN,PROTEIN GT197 SANSFTLIGE 10 T 0.53 Stm1_N pdbhh F Eukaryota T 6hol 2 C,D C,D BAKOR_HUMAN BARKOR,AUTOPHAGY-RELATED PROTEIN 14-LIKE PROTEIN,ATG14L TDLGTDWENLPSPRF 15 T 0.4 Rop-like pdbhh F Eukaryota T 6hom 2 B,D B,D M9PCL9_DROME CCR4-NOT transcription complex subunit 4, isoform L DDDLGFDPFVETQKGLAELMENEVVQ 26 T 0.69 DUF6021 pdbhh F Eukaryota T 6hon 2 B,D B,D M9PCL9_DROME CCR4-NOT transcription complex subunit 4, isoform L DDDLGFDPFVETQKGLAELMENEVVQ 26 T 0.69 DUF6021 pdbhh F Eukaryota T 6hos 2 C C Expression tag from chain B, or symmetry related chain MGSSHHHHHHS 11 T 7700 zf_CCCH_4 pdbhh F F 6hp5 2 C,D C,D GLY-MET-PRO-ARG-GLY-ALA GMPRGA 6 T 2.4 BCD pdbhh F F 6hpg 2 G,H,I,J,K,L a,b,c,d,e,f HS904_ARATH ATHSP90-4,HEAT SHOCK PROTEIN 81-4,HSP81-4 GSKMEEVD 8 T 8 TMEM191C pdbhh F Eukaryota T 6hq6 1 A,B A,B Bacterial beta-1,3-oligosaccharide phosphorylase MGSSHHHHHHSSGLVPAGSMSQSPNTLANEETTSIDKSITMDMVSMNGEMFYKIANNDAMRPFFMTIVSDSNHWMFVSSNGGLTAGRKNAEYALFPYYTDDKITESADITGSKSIFQIQYNNELIVWEPFSERFTNKFKITRNLYKNYYGNKIIFEEINEDLGLTYRYQWCSSNQFGFVRKSELSNHSKNVYEISLLDGIQNIMPYGVSSDLQSSTSNLVDAYKRSELHPKSGLGIFALSAIIVDKAEPSEALKANIAWSLGLNNPKYLVSSLQLNHFRNGKSISPEDDIKGEKGAYFLNTVMTLEANTQKEWMIIANVNQDHSDIIAITETIQNNKKIAEDINTDIELGTKRLIELNASSDALQLTADNLRDTRHFSNTLFNIMRGGIFDNNYQIEKGDFSNYIKKANKLVFDKIDLNALGEIFSLNDLNEFASKQKDVDFDRLALEYLPLKFSRRHGDPSRPWNKFSINTQSEIDGSKVLDYEGNWRDIFQNWEALAHSFPNFIDSMIHKFLNASTFDGYNPYRVTKEGFDWETIEEDDPWSYIGYWGDHQIIYLLKFLEFIEKHQPGKLHSYFESECFVYAAVPYTIKPYEEILNNPKDTIGYNHEWEKVINERKKSIGADGALLKSNDKSIYHVNFIEKILATVLAKMSNFIPEAGIWLNTQRPEWNDANNALVGNGVSMVTLYYLRRFLKFFDQLLENSTLENIKISNEMVEFYHKVRETLMENQHLLAGSISDTDRKVILDKLGNAAADYRFQIYNSGFWGKKRTHSMQGLKNFTKVSLQFIDHSIKANQRPDKLYHAYNLMSVEKNKEIAISYLSEMLEGQVAVLSSGFLSSKENLAVLDGLKNSALFREDQYSYLLYPNKELPKFLDKNTISKEAVSKSELLSLLVSKSNKQVIEKDSIGEYHFNGEFNNASNLKQALEDLSQQNEYKDLVAKESKTVEAIFEDVFNHKAFTGRSGTFYGYEGLGSIYWHMVSKLQLAVLECCLKAVEEKESEEVIGRLLEHYYEINEGIGVHKSPSLYGAFPTDAYSHTPAGKGAQQPGMTGQVKEDILSRFGELGIFVKNGCLELNPCLLRKDEFLKEAKTFDYVTVNFQHQSLELVEKSLAFTYCQIPIIYKIANQKCIEVFTNDGKSAKAASLILDKQTSQDVFGRTGIINKIEVSILESDLR 1175 T 0.18 Glucodextran_N pdbhh F T 6hq8 1 A,B A,B Beta-1,3-oligosaccharide phosphorylase MGSSHHHHHHSSGLVPAGSMSQSPNTLANEETTSIDKSITMDMVSMNGEMFYKIANNDAMRPFFMTIVSDSNHWMFVSSNGGLTAGRKNAEYALFPYYTDDKITESADITGSKSIFQIQYNNELIVWEPFSERFTNKFKITRNLYKNYYGNKIIFEEINEDLGLTYRYQWCSSNQFGFVRKSELSNHSKNVYEISLLDGIQNIMPYGVSSDLQSSTSNLVDAYKRSELHPKSGLGIFALSAIIVDKAEPSEALKANIAWSLGLNNPKYLVSSLQLNHFRNGKSISPEDDIKGEKGAYFLNTVMTLEANTQKEWMIIANVNQDHSDIIAITETIQNNKKIAEDINTDIELGTKRLIELNASSDALQLTADNLRDTRHFSNTLFNIMRGGIFDNNYQIEKGDFSNYIKKANKLVFDKIDLNALGEIFSLNDLNEFASKQKDVDFDRLALEYLPLKFSRRHGDPSRPWNKFSINTQSEIDGSKVLDYEGNWRDIFQNWEALAHSFPNFIDSMIHKFLNASTFDGYNPYRVTKEGFDWETIEEDDPWSYIGYWGDHQIIYLLKFLEFIEKHQPGKLHSYFESECFVYAAVPYTIKPYEEILNNPKDTIGYNHEWEKVINERKKSIGADGALLKSNDKSIYHVNFIEKILATVLAKMSNFIPEAGIWLNTQRPEWNDANNALVGNGVSMVTLYYLRRFLKFFDQLLENSTLENIKISNEMVEFYHKVRETLMENQHLLAGSISDTDRKVILDKLGNAAADYRFQIYNSGFWGKKRTHSMQGLKNFTKVSLQFIDHSIKANQRPDKLYHAYNLMSVEKNKEIAISYLSEMLEGQVAVLSSGFLSSKENLAVLDGLKNSALFREDQYSYLLYPNKELPKFLDKNTISKEAVSKSELLSLLVSKSNKQVIEKDSIGEYHFNGEFNNASNLKQALEDLSQQNEYKDLVAKESKTVEAIFEDVFNHKAFTGRSGTFYGYEGLGSIYWHMVSKLQLAVLECCLKAVEEKESEEVIGRLLEHYYEINEGIGVHKSPSLYGAFPTDAYSHTPAGKGAQQPGMTGQVKEDILSRFGELGIFVKNGCLELNPCLLRKDEFLKEAKTFDYVTVNFQHQSLELVEKSLAFTYCQIPIIYKIANQKCIEVFTNDGKSAKAASLILDKQTSQDVFGRTGIINKIEVSILESDLR 1175 T 0.18 Glucodextran_N pdbhh F T 6hqa 5 G G Taf8 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 75 F F F 6hqa 6 H I Histone-fold XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 64 F F F 6hqa 7 I H Histone-fold XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 68 F F F 6hqc 1 A A TAPA_BACSU BIOFILM ASSEMBLY ACCESSORY PROTEIN TAPA DQSDLHISDQTDTKGTVCSPFALFAVLENTGEKLKKSKWKWELHKLENARKPLKDGNVIEKGFVSNQIGDSLYKIETKKKMKPGIYAFKVYKPAGYPANGSTFEWSEPMRLAKCDE 116 T 0.002 Herpes_PAP unp F Bacteria T 6hqe 1 A,AA,B,BA,C,CA,D,DA,E,EA,F,FA,G,GA,H,HA,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,PA,Q,QA,R,RA,S,SA,T,TA,U,UA,V,VA,W,WA,X,XA,Y,YA,Z,ZA A,a,B,b,C,c,D,d,E,e,F,f,G,g,H,h,I,i,J,j,K,k,L,l,M,m,N,n,O,o,P,p,Q,q,R,r,S,s,T,t,U,u,V,v,W,w,X,x,Y,y,Z,z peptide LRV_M3delta1 PEALERLAADPDREVRAAVARRL 23 T 0.017 LRV pdb F T 6hrr 1 A,B A,B MCLN2_HUMAN TRANSIENT RECEPTOR POTENTIAL CHANNEL MUCOLIPIN 2,TRPML2 AFKEDNTVAFKHLFLKGYSGTDEDDYSCSVYTQEDAYESIFFAINQYHQLKDITLGTLGYGENEDNRIGLKVCKQHYKKGTMFPSNETLNIDNDVELDCVQLDLQDLSKKPPDWKNSSFFRLEFYRLLQVEISFHLKGIDLQTIHSRELPDCYVFQNTIIFDNKAHSGKIKIYFDSDAKIEECKDLNIFGS 191 T 0.79 Baseplate pdb F Eukaryota T 6hrs 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H MCLN2_HUMAN TRANSIENT RECEPTOR POTENTIAL CHANNEL MUCOLIPIN 2,TRPML2 SNQLVVAFKEDNTVAFKHLFLKGYSGTDEDDYSCSVYTQEDAYESIFFAINQYHQLKDITLGTLGYGENEDNRIGLKVCKQHYKKGTMFPSNETLNIDNDVELDCVQLDLQDLSKKPPDWKNSSFFRLEFYRLLQVEISFHLKGIDLQTIHSRELPDCYVFQNTIIFDNKAHSGKIKIYFDSDAKIEECKDLNIFGSTQ 199 T 0.8 Baseplate pdb F Eukaryota T 6hsn 2 B,C D,E GBRA3_RAT GABA(A) RECEPTOR SUBUNIT ALPHA-3 FNIVGTTYPC 10 T 0.97 DUF749 pdbhh F Eukaryota T 6hso 2 B,C D,I Glycine receptor beta subunit derived peptide FSIVGSLPRDC 11 T 0.35 MucB_RseB_C pdbhh F T 6hu9 22 FA,RA l,x YD19A_YEAST Cox26 MFFSQVLRSSARAAPIKRYTGGRIGESWVITEGRRLIPEIFQWSAVLSVCLGWPGAVYFFSKARKA 66 T 0.077 Phage_holin_Dp1 unppercent F Eukaryota T 6hua 2 C,D C,D XIP signaling peptide VPFFMIYY 8 T 2 Toxin_10 pdbhh F T 6hum 8 H P Proton-translocating NADH-quinone dehydrogenase subunit P NdhP AVISVKPILLAMTPVFILLCLFFGTRNGFYDTDQYHGNAAAH 42 T 0.13 MLANA pdbpssm F T 6hum 18 R Q Proton-translocating NADH-quinone dehydrogenase subunit Q NdhQ ATDFRAIMKFDGADSPAMIAISAVLILGFIAGLIWWALH 39 T 1.4 Rax2 pdbhh F T 6hv6 1 A A PATOX_PHOAA PHOTORHABDUS ASYMBIOTICA TOXIN,PATOX MGSSHHHHHHSSGLVPRGSHMASMTGGQQMGRGSKDTAWEFHTDVGLKGGGLKDFIDRFTKEPKEITISGYKFKRIKYNQENFDTMQRMALDYAYNPDSKGKIAQAQQAYKTGKEDYNAPQYDNFNGLSLDKKIERYISPDTDATTKGVLAGKMNESIKDINAFQTAKDAQSWKKSANKANKVVLTPQNLYLKGKPSEALPESVLMGWALQSSQDAKLSKMLMGIYSSNDITSNPLYKSLKELHANGNASKFNASATSISNINVSNLATSETKLFPTEISSVRVDAPKHTMLISKIKNRENKIKYVFYDPNYGMAYFDKHSDMAAFFQKKMQQYDFPDDSVSFHPLDYSNVSDIKISGRNLNEIIDGEIPLLYKQEGVQLEGITPRDGIYRVPPKNTLGVQETKHYIIVNNDIYQVEWDQTNNTWRVFDPSNTNRSRPTVPVKQDTNGVDKLAAALEHHHHHH 463 T 0.00014 Peptidase_C58 pdb F Bacteria T 6hvo 2 D,E,F D,F,E DPOD4_HUMAN DNA POLYMERASE DELTA SUBUNIT P12 MGRKRLITDSYPVVKRREG 19 T 0.012 Adeno_terminal unppssm F Eukaryota T 6hwh 2 B,O G,C Co-purified unknown transmembrane helices built as polyALA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 74 F F F 6hwh 3 C,P H,D Co-purified unknown transmembrane helices built as polyALA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 65 F F F 6hwh 4 D,Q I,E Co-purified unknown peptide built as polyALA XXXXXXXXXXXXXXXXXXXX 20 F F F 6hwh 5 E,R J,F Co-purified unknown peptide built as polyALA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 35 F F F 6hwh 8 H,U R,N A0R1B6_MYCS2 MSMEG_4693 MSTALTHGLIGGVPLVLFAVLALIFLTRKGPHPDTYKMSDPWTHAPILWAAEEPREHGHGGHGHDSHGVVIGGGASGKW 79 T 0.08 VPDSG-CTERM pdbpssm F Bacteria T 6hwn 2 B,D,F,H,J,L,N H,I,J,K,L,M,N Unknown tripeptide XXX 3 F F F 6hxt 1 A,B,C A,B,C CCD61_HUMAN Coiled-coil domain-containing protein 61 GSMDQPAGLQVDYVFRGVEHAVRVMVSGQVLELEVEDRMTADQWRGEFDAGFIEDLTHKTGNFKQFNIFCHMLESALTQSSESVTLDLLTYTDLESLRNRKMGGRPGSLAPRSAQLNSKRYLILIYSVEFDRIHYPLPLPYQGKP 145 T 0.01 XRCC4 unphh F Eukaryota T 6hxv 1 A,B A,B CCD61_DANRE Coiled-coil domain-containing protein 61 GPHMEVGTVVQEEMKFRGSEFAVKVEMAERLLIVEISDVVTADQWRGEFGPAYIEDLTRKTGNFKQFPVFCSMLESAVHKSSDSVTLDLLTYSDLELLRNRKAGVVGRPRAQPQSPALSAKRYLILIYTVEEARIHYPLPLPYLGKPDPAELQKEIRALRSELKTLGLRGD 171 T 0.00045 XRCC4 unphh F Eukaryota T 6hxy 1 A,B A,B CCD61_DANRE Coiled-coil domain-containing protein 61 GGSMEVGTVVQEEMKFRGSEFAVKVEMAERLLIVEISDVVTADQWRGEFGPAYIEDLTRKTGNFKQFPVFCSMLESAVHKSSDSVTLDLLTYSDLELLRNRKAGVVGRPRAQPQSPALSAKRYLILIYTVEFDRIHYPLPLPYLGKPDPAELQKEIRALRSELKTLGLRGDHK 173 T 0.00045 XRCC4 unphh F Eukaryota T 6hy0 1 A,B A,B P1_BPPH6 MAJOR INNER CAPSID PROTEIN P1 MFNLKVKDLNGSARGLTQAFAIGELKNQLSVGALQLPLQFTRTFSASMTSELLWEVGKGNIDPVMYARLFFQYAQAGGALSVDELVNQFTEYHQSTACNPEIWRKLTAYITGSSNRAIKADAVGKVPPTAILEQLRTLAPSEHELFHHITTDFVCHVLSPLGFILPDAAYVYRVGRTATYPNFYALVDCVRASDLRRMLTALSSVDSKMLQATFKAKGALAPALISQHLANAATTAFERSRGNFDANAVVSSVLTILGRLWSPSTPKELDPSARLRNTNGIDQLRSNLALFIAYQDMVKQRGRAEVIFSDEELSSTIIPWFIEAMSEVSPFKLRPINETTSYIGQTSAIDHMGQPSHVVVYEDWQFAKEITAFTPVKLANNSNQRFLDVEPGISDRMSATLAPIGNTFAVSAFVKNRTAVYEAVSQRGTVNSNGAEMTLGFPSVVERDYALDRDPMVAIAALRTGIVDESLEARASNDLKRSMFNYYAAVMHYAVAHNPEVVVSEHQGVAAEQGSLYLVWNVRTELRIPVGYNAIEGGSIRTPEPLEAIAYNKPIQPSEVLQAKVLDLANHTTSIHIWPWHEASTEFAYEDAYSVTIRNKRYTAEVKEFELLGLGQRRERVRILKPTVAHAIIQMWYSWFVEDDRTLAAARRTSRDDAEKLAIDGRRMQNAVTLLRKIEMIGTTGIGASAVHLAQSRIVDQMAGRGLIDDSSDLHVGINRHRIRIWAGLAVLQMMGLLSRSEAEALTKVLGDSNALGMVVATTDIDPSL 769 T 0.22 STAG pdb T Viruses T 6hy0 3 D,E,F,G,H,I,J,K,L,M D,E,F,G,H,I,J,K,L,M CAPSD_BPPH6 Major Outer Capsid Protein P8 MLLPVVARAAVPAIESAIAATPGLVSRIAAAIGSKVSPSAILAAVKSNPVVAGLTLAQIGSTGYDAYQQLLENHPEVAEMLKDLSFKADEIQPDFIGNLGQYREELELVEDAARFVGGMSNLIRLRQALELDIKYYGLKMQLNDMGYRS 149 T 2.7 DnaI_N pdbhh T Viruses T 6hy2 2 B A TRP-MET-LEU-ASP-PRO-ILE-ALA-GLY-LYS-TRP-SER-ARG WMLDPIAGKWSR 12 T 0.055 FBPase pdbhh F T 6hyd 1 A A MDN1_YEAST DYNEIN-RELATED AAA-ATPASE REA1,MIDAS-CONTAINING PROTEIN,RIBOSOME EXPORT/ASSEMBLY PROTEIN 1,DYNEIN-RELATED AAA-ATPASE REA1,MIDAS-CONTAINING PROTEIN,RIBOSOME EXPORT/ASSEMBLY PROTEIN 1,DYNEIN-RELATED AAA-ATPASE REA1,MIDAS-CONTAINING PROTEIN,RIBOSOME EXPORT/ASSEMBLY PROTEIN 1 PIEESLAAVIPISHLGEVGKWANNVLNCTEYSEKKIAERLYVFITFLTDMGVLEKINNLYKPANLKFQKALGLHDKQLTEETVSLTLNEYVLPTVSKYSDKIKSPESLYLLSSLRLLLNSLNALKLINEKSTHGKIDELTYIELSAAAFNGRHLKNIPRIPIFCILYNILTVMSENLKTESLFCGSNQYQYYWDLLVIVIAALETAVTKDEARLRVYKELIDSWIASVKSKSDIEITPFLNINLEFTDVLQLSRGHSITLLWDIFRKNYPTTSNSWLAFEKLINLSEKFDKVRLLQFSESYNSIKDLMDVFRLLNDDVLNNKLSEFNLLLSKLEDGINELELISNKFLNKRKHYFADEFDNLIRYTFSVDTAELIKELAPASSLATQKLTKLITNKYNYPPIFDVLWTEKNAKLTSFTSTIFSSQFLEDVVRKSNNLKSFSGNQIKQSISDAELLLSSTIKCSPNLLKSQMEYYKNMLLSWLRKVIDIHVGGDCLKLTLKELCSLIEEKTASETRVTFAEYIFPALDLAESSKSLEELGEAWITFGTGLLLLFVPDSPYDPAIHDYVLYDLFLKTKTFSQNLMKSWRNVRKVISGDEEIFTEKLINTISDDDAPQSPRVYRTGMSIDSLFDEWMAFLSSTMSSRQIKELVSSYKCNSDQSDRRLEMLQQNSAHFLNRLESGYSKFADLNDILAGYIYSINFGFDLLKLQKSKDRASFQISPLWSMDPINISCAENVLSAYHELSRFFKKGDMEDTSIEKVLMYFLTLFKFHKRDTNLLEIFEAALYTLYSRWSVRRFRQEQEENEKSNMFKFNDNSDDYEADFRKLFPDYEDTALVTNEKDISSPENLDDIYFKLADTYISVFDKDHDANFSSELKSGAIITTILSEDLKNTRIEELKSGSLSAVINTLDAETQSFKNTEVFGNIDFYHDFSIPEFQKAGDIIETVLKSVLKLLKQWPEHATLKELYRVSQEFLNYPIKTPLARQLQKIEQIYTYLAEWEKYASSEVSLNNTVKLITDLIVSWRKLELRTWKGLFNSEDAKTRKSIGKWWFYLYESIVISNFVSEKKETAPNATLLVSSLNLFFSKSTLGEFNARLDLVKAFYKHIQLIGLRSSKIAGLLHNTIKFYYQFKPLIDERITNGKKSLEKEIDDIILLASWKDVNVDALKQSSRKSHNNLYKIVRKYRDLLNGDAKTIIEAGLLYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRNIDTVASNMDSYLEKISSQEFPNFADLASDFYAEAERLRKETPNVYTKENKKRLAYLKTQKSKLLGDALKELRRIGLKVNFREDIQKVQSSTTTILANIAPFNNEYLNSSDAFFFKILDLLPKLRSAASNPSDDIPVAAIERGMALAQSLMFSLITVRHPLSEFTNDYCKINGMMLDLEHFTCLKGDIVHSSLKANVDNVRLFEKWLPSLLDYAAQTLSVISKYSATSEQQKILLDAKSTLSSFFVHFNSSRIFDSSFIESYSRFELFINELLKKLENAKETGNAFVFDIIIEWIKANKGGPIKKEQKRGPSVEDVEQAFRRTFTSIILSFQKVIGDGIESISETDDNWLSASFKKVMVNVKLLRSSVVSKNIETALSLLKDFDFTTTESIYVKSVISFTLPVITRYYNAMTVVLERSRIYYTNTSRGMYILSTILHSLAKN 1676 T 0.0088 SseC pdbpssm F Eukaryota T 6hza 2 B B ARG-ARG-LYS-ARG-00S RRKRX 5 T 100 Tub_N pdbhh F F 6hzb 2 B B ARG-ARG-LYS-LYS-00S RRKKX 5 T 180 DUF6142 pdbhh F F 6hzc 2 B B LYS-ARG-ARG-TBG-LYS-00S KRRXKX 6 T 170 CsiV pdbhh F F 6hzd 2 B B ARG-ARG-ARG-LYS-ARG-00S RRRKRX 6 T 34 DUF3155 pdbhh F F 6i1j 1 A A A helical peptide containing a trinuclear Cu(II) center: HisAD GEIAAIKQEIAAHKKEHAAIKWEIAAIKQGYG 32 T 0.42 DUF5320 pdbhh F T 6i1m 1 A A A0A2H1BUS1_FASHE Cystatin VGGYTEPRSVTPEERSVFQPMILSKLLTAGSVVSSCELELLQVSTQVVAGTNYKFKVSGGATCPGCWEVVVFVPLYSSKSATSVGTPTRVSCT 93 T 2.1E-05 Cystatin pdbhh F Eukaryota T 6i2g 2 B B N7P-SER-ARG-LEU-GLU-GLU-GLU-LEU-ARG-ARG-ARG-LEU-THR-GLU-LPD XSRLEEELRRRLTEX 15 T 5.2 TF_AP-2 pdbhh F T 6i2p 3 E C UNK-UNK-UNK-UNK-UNK XXXXX 5 F F F 6i2t 2 E J lamellipodin-derived polyproline peptide PPPPPPPPPPPP 12 T 24 Orbi_NS3 pdbhh F F 6i31 1 A,B A,B EVA3_RHISA Evasin-3 LVSTIESRTSGDGADNFDVVSCNKNCTSGQNECPEGCFCGLLGQNKKGHCYKIIGNLSGEPPVVRR 66 T 0.059 Toxin_11 unppercent F Eukaryota T 6i41 2 B B BRD3_HUMAN RING3-LIKE PROTEIN KADTTTPTT 9 T 67 MDM1 pdbhh F Eukaryota F 6i4x 4 D D Erythropoietin receptor ASFEXTILDPS 11 T 7.1 SmpB pdbhh F T 6i56 1 A,B,C,D,E D,A,C,B,E XEPA_BACSU PROTEIN XKDY MVKYQYEFPLDKAGKAGAVKPYRGGKNDFVTPVSNLSGVAEILTNAALKATEAYSQLGQDRLGAVLISKVKGWAYADREGTLFIEESDNNNVWTTTAAVNVAAGVLTATDWVYLSKRYYRFRYVNGNLQQSEFVLYQSVGAGEMDVRVNEKTPLQIDFAENQTHDGRLKVEARKTFDFVFHENAESASEGAALPVDGAAHLLVEVYGTAEMSEVKFWGKSVSGQKLPIRGVKTDDATTASSTLGKAEAWAFDIKGFKEIIMEIISITGGTLSVKGTAVS 279 T 0.72 DUF6385 pdbhh F Bacteria T 6i5j 5 G,H,I,J I,J,K,L Growth hormone receptor peptide PVPDXTSIHIX 11 T 0.00021 GHBP pdbhh F T 6i5n 4 G,H,I,J J,K,I,L Growth hormone receptor peptide PVPDXTSIHIX 11 T 0.00021 GHBP pdbhh F T 6i5o 1 A,B,C,D,E A,B,C,D,E YOMS_BACSU SPBc2 prophage-derived uncharacterized protein YomS MTETTENVVITIPDKTSFTFHEAATSPSEGEEFVVGHFRELTVKISGSSTSREIKFYAVDENGEKTALSGTNKTDFQLGSSTLNTNEYWDFDIAGLFKVMFEVVSVTGDVTVKGIVVS 118 T 0.84 DUF4251 pdbhh F Bacteria T 6i5p 2 B,D,F,H B,D,F,H BRD3_HUMAN RING3-LIKE PROTEIN KADTTTPTT 9 T 67 MDM1 pdbhh F Eukaryota F 6i68 2 B,D,F,H B,D,F,H BRD3_HUMAN RING3-LIKE PROTEIN KADTTTPTT 9 T 67 MDM1 pdbhh F Eukaryota F 6i6h 2 B B AEKDEL peptide AEKDEL 6 T 160 DUF6442 pdbhh F F 6i7a 2 B,D,F,H B,D,F,H BRD3_HUMAN RING3-LIKE PROTEIN KADTTTPTT 9 T 67 MDM1 pdbhh F Eukaryota F 6i9e 2 H,I,J,K,L,M,N H,I,J,K,L,M,N A7XXC1_9CAUD Auxiliary protein MDKVKLFQTIGRVEYWERVPRLHAYGVFALPFPMDPDVNWAQWFTGPHPRAFLVSIHKYGPKAGHVYPTNLTDEDALLNVIGMVLDGHDYENDPNVTVTLKAAVPIEYVQQDPQAPALQPHQAVLDAAEVLKLKVIKGHYFFDYTR 146 T 0.78 TRI9 pdbpssm T Viruses T 6i9r 53 AB C Quinupristin XTXPXXXX 8 T 290 zf-C2H2_jaz pdbhh F F 6ia5 1 A,B,C,D,E D,A,C,B,E XEPA_BACSU PROTEIN XKDY MVKYQYEFPLDKAGKAGAVKPYRGGKNDFVTPVSNLSGVAEILTNAALKATEAYSQLGQDRLGAVLISKVKGWAYADREGTLFIEESDNNNVWTTTAAVNVAAGVLTATDWVYLSKRYYRFRYVNGNLQQSEFVLYQSVGAGEMDVRVNEKTPLQIDFAENQTHDGRLKVEARKTFDFVFHENAESASEGAALPVDGAAHLLVEVYGTAEMSEVKFWGKSVSGQKLPIRGVKTDDATTASSTLGKAEAWAFDIKGFKEIIMEIISITGGTLSVKGTAVS 279 T 0.72 DUF6385 pdbhh F Bacteria T 6iac 2 B B Q859I5_9CAUD Lower collar protein MARYTMTLYDFIKSELIKKGFNEFVNDNKLTFYDDEFQFMQKMLKFDKDVLAIVNEKVFKGFSLKDELSDLLFKKSFTIHFLDREINRQTVEAFGMQVITVCITHEDYLNVVYSSSEVEKYLQSQGFTEHNEDTTSNTDETSNQNATSLDNSTGMTANRNAYVSLPQSEVNIDVDNTTLRFADNNTIDNGKTVNKSSNESNQNAKRNQNQKGNAKGTQFTKQYLIDNIDKAYDLRKKILNEFDKKCFLQIW 251 T 0.12 AKAP95 pdb T Viruses T 6iac 4 F,G,H,I,J,K G,H,I,J,K,L Q859I1_9CAUD inner core protein MTEFDEIVKPDDKEETSESTEENLESTEETSESTEESTEESTEESTEDKTVETIEEENENKLEPTTTDEDSSKFDPVVLEQRIASLEQQVTTFLSSQMQQPQQVQQTQSDVTESNKEDNDYSDEELVDKLDLD 133 T 0.0095 TolA_bind_tri pdb T Viruses T 6iai 1 A,B,C,D A,B,C,D Q8Z7T2_SALTI STOD MGSSHHHHHHSSGLVPRGSHMFLTFPNVAITRDNRIDKLSENDLELIRDTAIQNGGRKIQVQLRDLLYEVSNRAVEGDNNTFKVSFSTTDRAMFRERHIEWQGNAIRLERQLNTGLNVSRG 121 T 0.029 Calici_MSP pdb F Bacteria T 6iam 2 B B SER-ALA-ARG-ALA-XY5-VAL-HIS-LEU-ARG-LYS-SER-ALA SARAXVHLRKSA 12 T 22 Peptidase_S31 pdbhh F T 6iam 3 C C SUMO5_HUMAN SUMO-5,SUMO1 PSEUDOGENE 1,UBIQUITIN-LIKE 2,UBIQUITIN-LIKE 6 EAKP 4 T 350 MDH pdbhh F Eukaryota F 6iat 1 A,B,C,D C,A,B,D Q859I3_9CAUD Major head protein MAQQSTKNETALLVAKSAKSALQDFNHDYSKSWTFGDKWDNSNTMFETFVNKYLFPKINETLLIDIALGNRFNWLAKEQDFIGQYSEEYVIMDTVPINMDLSKNEELMLKRNYPRMATKLYGNGIVKKQKFTLNNNDTRFNFQTLADATNYALGVYKKKISDINVLEEKEMRAMLVDYSLNQLSETNVRKATSKEDLASKVFEAILNLQNNSAKYNEVHRASGGAIGQYTTVSKLKDIVILTTDSLKSYLLDTKIANTFQIAGIDFTDHVISFDDLGGVFKVTKEFKLQNQDSIDFLRAYGDYQSQLGDTIPVGAVFTYDVSKLKEFTGNVEEIKPKSDLYAFILDINSIKYKRYTKGMLKPPFHNPEFDEVTHWIHYYSFKAISPFFNKILITDQDVNPKPEEELQE 408 T 12 ER pdbhh T Viruses T 6iat 2 E,F,G,H E,F,G,H Q859I2_9CAUD Arstotzka protein MYEGNNMRSMMGTSYEDSRLNKRTELNENMSIDTNKSEDSYGVQIHSLSKQSFTGDVEEE 60 T 0.048 DUF4958 pdb T Viruses T 6iaw 1 A,B,C,I,J,K A,B,C,M,N,O Q859I3_9CAUD Major head protein MAQQSTKNETALLVAKSAKSALQDFNHDYSKSWTFGDKWDNSNTMFETFVNKYLFPKINETLLIDIALGNRFNWLAKEQDFIGQYSEEYVIMDTVPINMDLSKNEELMLKRNYPRMATKLYGNGIVKKQKFTLNNNDTRFNFQTLADATNYALGVYKKKISDINVLEEKEMRAMLVDYSLNQLSETNVRKATSKEDLASKVFEAILNLQNNSAKYNEVHRASGGAIGQYTTVSKLKDIVILTTDSLKSYLLDTKIANTFQIAGIDFTDHVISFDDLGGVFKVTKEFKLQNQDSIDFLRAYGDYQSQLGDTIPVGAVFTYDVSKLKEFTGNVEEIKPKSDLYAFILDINSIKYKRYTKGMLKPPFHNPEFDEVTHWIHYYSFKAISPFFNKILITDQDVNPKPEEELQE 408 T 12 ER pdbhh T Viruses T 6iaw 2 D,E,F,G,L,M J,I,D,E,Q,S Q859I2_9CAUD Arstotzka protein MYEGNNMRSMMGTSYEDSRLNKRTELNENMSIDTNKSEDSYGVQIHSLSKQSFTGDVEEE 60 T 0.048 DUF4958 pdb T Viruses T 6iaw 3 H,N,R H,L,K Head fiber XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 67 F F F 6iaw 4 O,P,Q X,Y,Z inner core protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 41 F F F 6ib1 1 A,B,C,D C,D,B,A Q859I3_9CAUD Major head protein MAQQSTKNETALLVAKSAKSALQDFNHDYSKSWTFGDKWDNSNTMFETFVNKYLFPKINETLLIDIALGNRFNWLAKEQDFIGQYSEEYVIMDTVPINMDLSKNEELMLKRNYPRMATKLYGNGIVKKQKFTLNNNDTRFNFQTLADATNYALGVYKKKISDINVLEEKEMRAMLVDYSLNQLSETNVRKATSKEDLASKVFEAILNLQNNSAKYNEVHRASGGAIGQYTTVSKLKDIVILTTDSLKSYLLDTKIANTFQIAGIDFTDHVISFDDLGGVFKVTKEFKLQNQDSIDFLRAYGDYQSQLGDTIPVGAVFTYDVSKLKEFTGNVEEIKPKSDLYAFILDINSIKYKRYTKGMLKPPFHNPEFDEVTHWIHYYSFKAISPFFNKILITDQDVNPKPEEELQE 408 T 12 ER pdbhh T Viruses T 6ib1 2 E,F,G,H E,F,G,H Q859I2_9CAUD Uncharacterized protein MYEGNNMRSMMGTSYEDSRLNKRTELNENMSIDTNKSEDSYGVQIHSLSKQSFTGDVEEE 60 T 0.048 DUF4958 pdb T Viruses T 6ibh 1 A,B A,B A0A4P9I8G4_9AGAM Auxiliary activity CAZyme HFQLQWPGARGAFVANDEVYFCGAHNNVTTNRTDFPLDGSGFVSIKSGHAPYTVGAIISLETDADAWEDFKNSSGGDQIAIAYRQVDNSGTYCVPFNPSSLNIAGIQDGANATIQVVYTGGDGNLYQCADVTFRTTVANLNSSVCTNSTHHHHHH 155 T 0.64 Big_1 unp F Eukaryota T 6ibi 1 A,B,C,D A,B,C,D A0A4P9I8G4_9AGAM Auxiliary activity CAZyme HFQLQWPGARGAFVANDEVYFCGAHNNVTTNRTDFPLDGSGFVSIKSGHAPYTVGAIISLETDADAWEDFKNSSGGDQIAIAYRQVDNSGTYCVPFNPSSLNIAGIQDGANATIQVVYTGGDGNLYQCADVTFRTTVANLNSSVCTNSTHHHHHH 155 T 0.64 Big_1 unp F Eukaryota T 6ibj 1 A A A0A4P9I8G4_9AGAM Auxiliary activity CAZyme HFQLQWPGARGAFVANDEVYFCGAHNNVTTNRTDFPLDGSGFVSIKSGHAPYTVGAIISLETDADAWEDFKNSSGGDQIAIAYRQVDNSGTYCVPFNPSSLNIAGIQDGANATIQVVYTGGDGNLYQCADVTFRTTVANLNSSVCTNSTHHHHHH 155 T 0.64 Big_1 unp F Eukaryota T 6ibo 2 B C ALA-VAL-SER-ARG-ALA KKVAVSRAA 9 T 7.1 Viral_helicase1 pdbhh F F 6idx 2 B C AGRB1_MOUSE BRAIN-SPECIFIC ANGIOGENESIS INHIBITOR 1 RKSRYAELDFEKIMHTRKRHQDMFQ 25 T 7.7 YppG pdbhh F Eukaryota T 6ieh 2 B A NRDE2_HUMAN Protein NRDE2 homolog SFRTDKKPDPANWEYKSLYRGDIARYKRKGDSCLGINPKKQCISWEGTSTEKKHSRKQVERYFTKKSVGLMNIDGVAISSKTEPPSSEPISFIPVKDLEDAAPVT 105 T 0.043 DUF1283 pdbpercent F Eukaryota T 6ifg 2 C,D E,F Tripeptides (TYR-SER-ALA) YSA 3 T 230 zf-H2C2_2 pdbhh F F 6ifj 3 C,D C,D 13-mer peptide DCAWHLGELVWCT 13 T 6.1 FAT pdbhh F T 6ifo 3 E,F F,E AcrIIA2 MTLTRAQKKYAEAMHEFINMVDDFEESTPDFAKEVLHDSDYVVITKNEKYAVALCSLSTDECEYDTNLYLDEKLVDYSTVDVNGVTYYINIVETNDIDDLEIATDEDEMKSGNQEIILKSELK 123 T 0.13 DUF6376 pdb F T 6igm 5 I X unknown subunit XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 62 F F F 6iha 1 A A A0A0U2QEK1_9DIPT SibaCec-A KINKQKIKNGAKKALGVASKVAPVVAAFAR 30 T 1.3E-05 Cecropin unphh F Eukaryota T 6iht 2 B X His12 GGS 3 T 150 DUF3228 pdbhh F F 6ijo 22 W X ChainX AAAAAAAAAAAAAAAAAAAAAAAAAA 26 T 920 DUF4699 pdbhh F F 6ijq 1 A A P73_HUMAN P53-LIKE TRANSCRIPTION FACTOR,P53-RELATED PROTEIN DGGTTFEHLWSSLEPD 16 T 0.019 P53_TAD unphh F Eukaryota T 6ikg 2 E,F E,F MET-ALA-ALA MAA 3 T 420 Delta_lysin pdbhh F F 6ilc 3 C C HEV-1 DFANTFLP 8 T 0.0062 AAA_23 unppercent F T 6ile 3 C C HEV-1 DFANTFLP 8 T 0.0062 AAA_23 unppercent F T 6ilg 3 C C PHOSP_HENDH HEV-1-P8L DFANTFLL 8 T 0.0062 AAA_23 unppercent T Viruses T 6ilu 1 A,B A,B A0A218KCJ1_9CAUD Lysin GETAPVSEPEGIGVALSIYPDGYGVNLYERPSDPIYAGNITKKIPYKVFAGYWGGGDKDMICLGGEKQWAYNKHFTIDWYKVRSKYPVGWGVNFYDGPSGNFLGNIDGSEVYNAHNRVGGYVDIGGNRWIKEEHVTITAK 140 T 41 Mastoparan_2 pdbhh T Viruses T 6im4 2 C,D C,D GLY-MET-PRO-ARG-GLY-ALA GMPRGA 6 T 2.4 BCD pdbhh F F 6img 1 A A (ACE)-GLY-CYS-PRO-CYS-ILE-TRP-PRO-GLU-LEU-CYS-PRO-TRP-ILE-ARG-SER-CYS-(NH2) XGCPCIWPELCPWIRSCX 18 T 1.8 Toxin_26 pdbhh F T 6imh 1 A A (ACE)-GLY-CYS-PRO-CYS-GLU-PRO-SER-TYR-LEU-CYS-PRO-TRP-LEU-PRO-GLY-CYS-(NH2) XGCPCEPSYLCPWLPGCX 18 T 0.13 FOXP-CC pdbhh F T 6imu 1 A,B A,B A0A4V8H012_TALFU Endo-beta-1,2-glucanase AGIHHHHHHSSEPSCRFAHQYTQEQVLQNPSKFINDVLFWEGKFHQNNISYNSGNGMSYDGTNIDWVTGEGTVKHPFSAASKESLQVMLYAHAIAGSADAARFLSPNNPSAAPGIAASIMDTKLQTYLRFNETYPGFGGFLPWFTSSSQDLTPTWDWNNRVPGLDNGELLWAVYAFIQAAENTSNKSFIDLAKKWQTWMDYTKTTAAHIFYQGEGKVCAVTDIKNQSLPVYHPEQTYACEGTSYLNDPYEGELFTWWLQFFGGLSDADIEALWEYKRPQLVSVDYHIGNVGPITVQKGYWFSSHETWKVLEMPYYDIDIIRRVFQNAERARTCNSVVTQVPGMFASINNVTDPATGDVVGYISNAGIPSIANQTIQELDVITPYSVFPTVLFDKGVGMAWWRNMAIGKKMQNIYGSTESTRRDGTGVSALLTWDSKVSTVNAILGGVSGLVSQKMKAENIYNTFVERIEAEYSRVFKNLKGEHVPFCLPQETVPDTGLVDFTTCN 505 T 0.19 Glycoamylase pdbhh F Eukaryota T 6imv 1 A,B A,B A0A4V8H012_TALFU Endo-beta-1,2-glucanase AGIHHHHHHSSEPSCRFAHQYTQEQVLQNPSKFINDVLFWEGKFHQNNISYNSGNGMSYDGTNIDWVTGEGTVKHPFSAASKESLQVMLYAHAIAGSADAARFLSPNNPSAAPGIAASIMDTKLQTYLRFNETYPGFGGFLPWFTSSSQDLTPTWDWNNRVPGLDNGELLWAVYAFIQAAENTSNKSFIDLAKKWQTWMDYTKTTAAHIFYQGEGKVCAVTDIKNQSLPVYHPEQTYACEGTSYLNDPYEGELFTWWLQFFGGLSDADIEALWEYKRPQLVSVDYHIGNVGPITVQKGYWFSSHETWKVLEMPYYDIDIIRRVFQNAERARTCNSVVTQVPGMFASINNVTDPATGDVVGYISNAGIPSIANQTIQELDVITPYSVFPTVLFDKGVGMAWWRNMAIGKKMQNIYGSTESTRRDGTGVSALLTWDSKVSTVNAILGGVSGLVSQKMKAENIYNTFVERIEAEYSRVFKNLKGEHVPFCLPQETVPDTGLVDFTTCN 505 T 0.19 Glycoamylase pdbhh F Eukaryota T 6imw 1 A,B A,B A0A4V8H013_TALFU Endo-beta-1,2-glucanase AGIHHHHHHSSEPSCRFAHQYTQEQVLQNPSKFINDVLFWEGKFHQNNISYNSGNGMSYDGTNIDWVTGEGTVKHPFSAASKESLQVMLYAHAIAGSADAARFLSPNNPSAAPGIAASIMDTKLQTYLRFNETYPGFGGFLPWFTSSSQDLTPTWDWNNRVPGLDNGELLWAVYAFIQAAENTSNKSFIDLAKKWQTWMDYTKTTAAHIFYQGEGKVCAVTDIKNQSLPVYHPEQTYACEGTSYLNDPYQGELFTWWLQFFGGLSDADIEALWEYKRPQLVSVDYHIGNVGPITVQKGYWFSSHETWKVLEMPYYDIDIIRRVFQNAERARTCNSVVTQVPGMFASINNVTDPATGDVVGYISNAGIPSIANQTIQELDVITPYSVFPTVLFDKGVGMAWWRNMAIGKKMQNIYGSTESTRRDGTGVSALLTWDSKVSTVNAILGGVSGLVSQKMKAENIYNTFVERIEAEYSRVFKNLKGEHVPFCLPQETVPDTGLVDFTTCN 505 T 0.16 Glycoamylase unphh F Eukaryota T 6ip5 82 DC zx nascent peptide LSAKKLSSLLTCKYIPP 17 T 2 BLOC1S3 pdbhh F T 6ip6 81 CC zx nascent peptide LSAKKLSSLLTCKYIPP 17 T 2 BLOC1S3 pdbhh F T 6ip8 82 DC zx nascent peptide LSAKKLSSLLTCKYIPP 17 T 2 BLOC1S3 pdbhh F T 6ipv 1 A,B,C,D A,B,C,D A0A5A4PV77_STREX CqsB2 MSQRVPDESGLAQNYVLDRSDLQGLDLVWNENTGMDDMMKLMESKTKETYDHGEIFGQYCSLAEHINVPYDIVFEYAANARSLEEWTYSIRNMKHLGGGLYRADEMIQPNTDIYIRAEAQKGPEHGLVVYPCAWDQGHELWMRYYMTIIDSSKVLDKPGTVVLWTNCKHPYYDRSTENVPDYIAEGRARTDRVWVGDIWPVFHAGHSIEMGNLKRILEHRFGAGKAKLAAALEHHHHHH 239 T 0.0007 Polyketide_cyc2 unppercent F Bacteria T 6ipw 1 A,B,C,D A,B,C,D A0A5A4PV77_STREX CqsB2 MSQRVPDESGLAQNYVLDRSDLQGLDLVWNENTGMDDMMKLMESKTKETYDHGEIFGQYCSLAEHINVPYDIVFEYAANARSLEEWTYSIRNMKHLGGGLYRADEMIQPNTDIYIRAEAQKGPEHGLVVYPCAWDQGHELWMRYYMTIIDSSKVLDKPGTVVLWTNCKHPYYDRSTENVPDYIAEGRARTDRVWVGDIWPVFHAGHSIEMGNLKRILEHRFGAGKAKLAAALEHHHHHH 239 T 0.0007 Polyketide_cyc2 unppercent F Bacteria T 6iqg 2 C,D C,D 18-mer peptide G(HCS)DCAYHRGELVWCT(HCS)H(NH2) GXDCAYHRGELVWCTXHX 18 T 3.1 CHORD pdbhh F T 6iqh 2 B,D C,D 17-mer peptide (GPDCAYHKGELVWCTFH) GPDCAYHKGELVWCTFH 17 T 0.47 DUF1247 pdbhh F T 6iqj 2 C,D C,D FH1_ARATH ATFORMIN-8 RVPPPPPPPPPLP 13 T 19 MRP-S26 pdbhh F Eukaryota F 6iqk 2 K K AtPRF3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 264 F F F 6iso 2 B,D,F,G,I,K C,D,F,H,L,J ARG-THR-LYS-GLN-THR-ALA-ARG RTKQTAR 7 T 180 NR_Repeat pdbhh F F 6ist 1 A,B,D A,B,D S5MRN1_9CAUD Lysin MFIYYKRTKQGSTEQWFVIGGKRIYLPTMTYVNEANDLIKRYGGNTNVTTYNHDNFGLKMMEAALPQVKV 70 T 0.022 Sial-lect-inser pdb T Viruses T 6itc 5 E B OMPA_ECOLI Translocating peptide MAKKTAIAIAVALAGFATVASYAQYEDGCSGELERQHTFAGGARSISGDGDSPHSYHSG 59 T 1.6999999999999998E-75 OmpA_membrane unp F Bacteria T 6iu7 2 B B TP53B_HUMAN P53BP1 SGKRKLITSEEERSPAKRGRKS 22 T 35 GMAP pdbhh F Eukaryota T 6iua 2 B B TP53B_HUMAN P53BP1 SGKRKLITSEEERDPAKRGRKS 22 T 44 KCTD18_C pdbhh F Eukaryota T 6iui 2 C,D C,D PAXI_HUMAN Paxillin GPGSEFSATRELDELMASLSDFKFMAQG 28 T 2.5 SAM_LFY pdbhh F Eukaryota T 6iv8 1 A,D A,C A0A1C5SD84_9FIRM The selenomethionine (SeMet)-labeled Cas13d MAKKNKMKPRELREAQKKARQLKAAEINNNAAPAIAAMPAAEVIAPVAEKKKSSVKAAGMKSILVSENKMYITSFGKGNSAVLEYEVDNNDYNKTQLSSKDNSNIELGDVNEVNITFSSKHGFGSGVEINTSNPTHRSGESSPVRGDMLGLKSELEKRFFGKTFDDNIHIQLIYNILDIEKILAVYVTNIVYALNNMLGIKDSESYDDFMGYLSARNTYEVFTHPDKSNLSDKVKGNIKKSLSKFNDLLKTKRLGYFGLEEPKTKDTRASEAYKKRVYHMLAIVGQIAQCVFHDKSGAKRFDLYSFINNIDPEYRDTLDYLVEERLKSINKDFIEGNKVNISLLIDMMKGYEADDIIRLYYDFIVLKSQKNLGFSIKKLREKMLEEYGFRFKDKQYDSVRSKMYKLMDFLLFCNYYRNDVAAGEALVRKLRFSMTDDEKEGIYADEAAKLWGKFRNDFENIADHMNGDVIKELGKADMDFDEKILDSEKKNASDLLYFSKMIYMLTYFLDGKEINDLLTTLISKFDNIKEFLKIMKSSAVDVECELTAGYKLFNDSQRITNELFIVKNIASMRKPAASAKLTMFRDALTILGIDDNITDDRISEILKLKEKGKGIHGLRNFITNNVIESSRFVYLIKYANAQKIREVAKNEKVVMFVLGGIPDTQIERYYKSCVEFPDMNSSLEAKRSELARMIKNISFDDFKNVKQQAKGRENVAKERAKAVIGLYLTVMYLLVKNLVNVNARYVIAIHCLERDFGLYKEIIPELASKNLKNDYRILSQTLCELCDDRNESSNLFLKKNKRLRKCVEVDINNADSSMTRKYANCIAHLTVVRELKEYIGDIRTVDSYFSIYHYVMQRCITKRGDDTKQEEKIKYEDDLLKNHGYTKDFVKALNSPFGYNIPRFKNLSIEQLFDRNEYLTEKLEHHHHHH 930 T 0.0023 RB_A pdbpercent F Bacteria T 6iv9 1 A A A0A1C5SD84_9FIRM Cas13d MAKKNKMKPRELREAQKKARQLKAAEINNNAAPAIAAMPAAEVIAPVAEKKKSSVKAAGMKSILVSENKMYITSFGKGNSAVLEYEVDNNDYNKTQLSSKDNSNIELGDVNEVNITFSSKHGFGSGVEINTSNPTHRSGESSPVRGDMLGLKSELEKRFFGKTFDDNIHIQLIYNILDIEKILAVYVTNIVYALNNMLGIKDSESYDDFMGYLSARNTYEVFTHPDKSNLSDKVKGNIKKSLSKFNDLLKTKRLGYFGLEEPKTKDTRASEAYKKRVYHMLAIVGQIAQCVFHDKSGAKRFDLYSFINNIDPEYRDTLDYLVEERLKSINKDFIEGNKVNISLLIDMMKGYEADDIIRLYYDFIVLKSQKNLGFSIKKLREKMLEEYGFRFKDKQYDSVRSKMYKLMDFLLFCNYYRNDVAAGEALVRKLRFSMTDDEKEGIYADEAAKLWGKFRNDFENIADHMNGDVIKELGKADMDFDEKILDSEKKNASDLLYFSKMIYMLTYFLDGKEINDLLTTLISKFDNIKEFLKIMKSSAVDVECELTAGYKLFNDSQRITNELFIVKNIASMRKPAASAKLTMFRDALTILGIDDNITDDRISEILKLKEKGKGIHGLRNFITNNVIESSRFVYLIKYANAQKIREVAKNEKVVMFVLGGIPDTQIERYYKSCVEFPDMNSSLEAKRSELARMIKNISFDDFKNVKQQAKGRENVAKERAKAVIGLYLTVMYLLVKNLVNVNARYVIAIHCLERDFGLYKEIIPELASKNLKNDYRILSQTLCELCDDRNESSNLFLKKNKRLRKCVEVDINNADSSMTRKYANCIAHLTVVRELKEYIGDIRTVDSYFSIYHYVMQRCITKRGDDTKQEEKIKYEDDLLKNHGYTKDFVKALNSPFGYNIPRFKNLSIEQLFDRNEYLTEKLEHHHHHH 930 T 0.0023 RB_A pdbpercent F Bacteria T 6ivx 2 B,D,F,H B,D,F,H NCOR2_HUMAN SMRT TNMGLEAIIRKALMGKYDQWEE 22 T 3.5 RHH_7 pdbhh F Eukaryota T 6iw8 1 A A GOGA2_HUMAN 130 KDA CIS-GOLGI MATRIX PROTEIN,GM130,GM130 AUTOANTIGEN,GOLGIN-95 GPLGSMSEETRQSKLAAAKKKLREYQQRNSPGVPTGAKKKKKIKNGSNPETTT 53 T 0.0086 DUF812 unphh F Eukaryota T 6iwa 1 A A GOGA2_HUMAN 130 KDA CIS-GOLGI MATRIX PROTEIN,GM130,GM130 AUTOANTIGEN,GOLGIN-95 AKKKLREYQQRNSPGVPTGAKKKKKIKN 28 T 0.0086 DUF812 unphh F Eukaryota T 6iwg 3 C C N-myristoylated 4-mer lipopeptide XGGAI 5 T 150 RAMP4 pdbhh F F 6iwh 3 C C C14-GGGI lipopeptide XGGGI 5 T 91 Mqo pdbhh F F 6ixk 1 A,B A,B A0A0H3NK84_SALTS GLYCOSYLTRANSFERASE MIPPLNRYVPALSKNELVKTVTNRDIQFTSFNGKDYPLCFLDEKTPLLFQWFERNPARFGKNDIPIINTEKNPYLNNIIKAATIEKERLIGIFVDGDFFPGQKDAFSKLEYDYENIKVIYRNDIDFSMYDKKLSEIYMENISKQESMPEEKRDCHLLQLLKKELSDIQEGNDSLIKSYLLDKGHGWADFYRNMAMLKAGQLFLEADKVGCYDLSTNSGCIYLDADMIITEKLGGIYIPDGIAVHVERIDGRASMENGIIAVDRNNHPALLAGLEIMHTKFDADPYSDGVCNGIRKHFNYSLNEDYNSFCDFIEFKHDNIIMNTSQFTQSSWARHVQ 336 T 2.2E-05 Glyco_transf_88 pdbhh F Bacteria T 6ixp 2 B,C,E,F B,C,E,F MMR1_YEAST MMR1 GPGSEFGNSARIPCPKTRLARVSVLDLKKIEEQPDSSSG 39 T 0.024 DUF2080 pdb F Eukaryota T 6ixq 2 B B SMY1_YEAST SUPPRESSOR PROTEIN SMY1 GPGSSSSSIATTGSQESFVARPFKKGLNLHSIKVTSSTPKGSENLYFQ 48 T 7.7 CCDC85 pdbhh F Eukaryota T 6ixr 2 B B INP2_YEAST INP2 SGSGSGSGSGSEFNHGFHLDILKGRK 26 T 0.016 Serglycin pdb F Eukaryota T 6izm 2 B C PRGC1_HUMAN PPARGC-1-ALPHA,LIGAND EFFECT MODULATOR 6 QEAEEPSLLKKLLLAPANT 19 T 3.3 Apo-CIII pdbhh F Eukaryota T 6izn 2 B C PRGC1_HUMAN PPARGC-1-ALPHA,LIGAND EFFECT MODULATOR 6 QEAEEPSLLKKLLLAPANT 19 T 3.3 Apo-CIII pdbhh F Eukaryota T 6j03 1 A A V5TER4_9CYAN AMBU4 SAVSIPINNAGFENPFMDVVDDYTIDTPPGWTTYDPNNLVPEKRTTWTSNNGVGYVGPGTQFYNQLAPEGRNIGYIYLSQNPGSGVAGFEQCLDATLEPDTKYTLTVDVGALAGTFKGLSFAGFPGYRVELLAGDTVLAADHNNLFIKEGEFKTSTVTYTSTAKDLHLGQKLGIRLVNLLQDKFSGLDFDNVRLTTEPTE 200 T 0.22 CBM_4_9 pdbpercent F Bacteria T 6j07 2 B B TERB1_HUMAN COILED-COIL DOMAIN-CONTAINING PROTEIN 79 YRCSGCIAVEKSLNSRNFSKLLHSCPYQCDRHKVIVEAEDRYKSELRKSLICNKKILLTP 60 T 5.7 WCCH pdbhh F Eukaryota T 6j0h 2 B B Actinomycin D TXPXXXTXPXX 11 T 61 DUF5572 pdbhh F F 6j0n 3 M,N,O,P,Q,R P,Q,R,S,T,U B6VNN3_PHOAA Pvc12 MSNQDALFHSVKDDIHFDTLLEQAHQVIEKQAEKLWSDTAEHDPGITFLQGISYGVSDLAYRHTLPLKDLLTPAPDEQQQEGIFPAEFGPHNTLTCGPVTADDYRKALLDLHSSDSLDGTQQDEGDFLFRSVQLVREPEKQRYTYWYDATKREYSFVNSEGAKEFTLRGNYWLYLEPTRWTQGNIAAATRQLTEFLTKNRNIGESVSNIIWLQPVDLPLLLDVELDDDVGAQDVPGIFAAVYSTAEQYLMPGAQRYRTEVLQNAGMSNDQIFEGPLLEHGWIPELPAARDYTQRLTLNLSRLVNSLLEIEGIKHVNRLRLDDSFDKTAIEPVKGDTWSWSIKEGYYPRLWGEDPLNQLAQQNGPLRVIAKGGISVSVSKEQIQASLPSQSLIQNEPVILAYGQHRDVGSYYPVSDTLPPCYGLQHSLSESEHLLPLHQFMLPFEQLLACGCQQIAMLPRLLAFQREGYEVWGDQWPFKSGSVNDDAHQDYAPALKDLLGQIALDSDHELDIINYLLGYFGTQRAPRTFTTQLDDFRAVQQGYLAQQPTLTYHRSNIRIDQVSSLQKRIAARMGLGGELFKPQPDLSQLPFYLIEHRALLPVKPNSQFDKEQKPASVTEEGGSQTGQHYVVIEQKGIDGKLTQGQVINLILYEGEQGETQFTIRGQMVFKTEGDKFWLDVNNSAQLEYNLARVMTAAKASKLFWQNSPVWMEDMGYRLAYASDQSSLPVNQRRLTRTVQTPFPPMVVVGSEITLLKQVGIVNLKKAESEKLYAKVVSFDRIEGTLIIERLGNSTLAFPTSEEAWRYSWYFSGEKYERTDRFSFVISVVVNSDLIKLPGVDPYKLEEWVKETILTEFPAHISMIIHWMDREAFLNFANTYQRWQNNGTPLGDAAYSILESLTLGKLPSALKGVGTMRIATSSQREEVVGSNGDQWNTDGITQNELFYVPKES 950 T 0.016 DUF276 pdbhh F Bacteria T 6j0x 2 B,D,F,H E,F,G,H MMS22_YEAST METHYL METHANESULFONATE-SENSITIVITY PROTEIN 22,SYNTHETICALLY LETHAL WITH MCM10 PROTEIN 2 SIIYEPEFNENYLWAE 16 T 1.9 Pept_S41_N pdbhh F Eukaryota T 6j0y 2 B,D C,D SLX4_YEAST Peptide from Structure-specific endonuclease subunit SLX4 GPLGSGSSIRVKLLQESVVKLNPKLVKHNFYRVEANDSEEEETEFDDQFCIADIQLVD 58 T 0.063 RNA_pol_Rpo13 pdbpssm F Eukaryota T 6j2d 3 C C PHOSP_HENDH HeV1 DFANTFLP 8 T 0.0062 AAA_23 unppercent T Viruses T 6j2h 3 C,F C,F PHOSP_HENDH HeV1 DFANTFLP 8 T 0.0062 AAA_23 unppercent T Viruses T 6j31 2 E,F,G,H E,F,G,H KITACINNAMYCIN XXVXVGGXX 9 T 2.4 DUF4183 pdbhh F F 6j3q 2 N,O,P,Q,R,S,T,U,V,W,X,Y,Z 0,3,1,4,2,6,5,7,b,8,c,9,d A0A4Y5TPY8_9CAUD cement protein MPLVYTPAVRGGANPASGSYLLDPQYVNSGVDILQATYGYNINGTANADQLLQRDAILAILEYALKDTAFVNAIQAVAAGSGVTTPASFVSACVTKLTA 99 T 0.14 PilW pdb T Viruses T 6j3y 22 TA,V 5,0 Unknown protein 0 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 31 F F F 6j3y 23 UA,W 6,1 Unknown protein 1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 30 F F F 6j3y 24 VA,X 7,2 Unknown protein 2 XXXXXXXXXX 10 F F F 6j3z 22 TA,V 5,0 Unknown protein 0 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 31 F F F 6j3z 23 UA,W 6,1 Unknown protein 1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 30 F F F 6j3z 24 VA,X 7,2 Unknown protein 2 XXXXXXXXXX 10 F F F 6j3z 26 EB 19 Fucoxanthin chlorophyll a/c-binding protein monomer 1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 215 F F F 6j3z 27 FB 20 Fucoxanthin chlorophyll a/c-binding protein monomer 2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 143 F F F 6j3z 28 GB 21 Fucoxanthin chlorophyll a/c-binding protein monomer 3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 155 F F F 6j40 22 TA,V 5,0 Unknown protein 0 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 31 F F F 6j40 23 UA,W 6,1 Unknown protein 1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 30 F F F 6j40 24 VA,X 7,2 Unknown protein 2 XXXXXXXXXX 10 F F F 6j40 26 EB,PB 19,39 Fucoxanthin chlorophyll a/c-binding protein monomer 1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 215 F F F 6j40 27 FB,QB 20,40 Fucoxanthin chlorophyll a/c-binding protein monomer 2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 143 F F F 6j40 28 GB,RB 21,41 Fucoxanthin chlorophyll a/c-binding protein monomer 3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 155 F F F 6j4v 3 C C TBA1B_HUMAN ALPHA-TUBULIN UBIQUITOUS,TUBULIN K-ALPHA-1,TUBULIN ALPHA-UBIQUITOUS CHAIN DYEEVGVDSVEGEGEEEGECY 21 T 28 Hrs_helical unphh F Eukaryota T 6j54 3 C e ATP synthase subunit e, mitochondrial XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 63 F F F 6j54 5 E g ATP synthase subunit g, mitochondrial XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 84 F F F 6j54 7 G k subunit k analog XXXXXXXXXXXXXXXXXXXXXXXXXXXXX 29 F F F 6j54 11 R u ATP synthase membrane subunit 6.8PL XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 42 F F F 6j56 2 B,D C,D TOM1_HUMAN Peptide from Target of Myb protein 1 GVTSEGKFDKFLEERAKAADRLPNLSS 27 T 0.72 FUSC unp F Eukaryota T 6j5a 3 C e ATP synthase subunit e, mitochondrial XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 63 F F F 6j5a 5 E g ATP synthase subunit g, mitochondrial XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 84 F F F 6j5a 7 G k subunit k analog XXXXXXXXXXXXXXXXXXXXXXXXXXXXX 29 F F F 6j5a 11 R u ATP synthase membrane subunit 6.8PL XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 42 F F F 6j5i 11 O e ATP synthase subunit e XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 63 F F F 6j5i 13 Q g ATP synthase subunit g XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 84 F F F 6j5i 15 S k subunit k analog XXXXXXXXXXXXXXXXXXXXXXXXXXXXX 29 F F F 6j5i 19 DA u ATP synthase membrane subunit 6.8PL XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 42 F F F 6j5j 11 O e ATP synthase subunit e XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 63 F F F 6j5j 13 Q g ATP synthase subunit g XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 84 F F F 6j5j 15 S k subunit k analog XXXXXXXXXXXXXXXXXXXXXXXXXXXXX 29 F F F 6j5j 19 DA u ATP synthase membrane subunit 6.8PL XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 42 F F F 6j5k 11 AD,O,SA,WB Ce,e,Ae,Be ATP synthase subunit e XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 63 F F F 6j5k 13 CD,Q,UA,YB Cg,g,Ag,Bg ATP synthase subunit g XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 84 F F F 6j5k 15 AC,ED,S,WA Bk,Ck,k,Ak subunit k analog XXXXXXXXXXXXXXXXXXXXXXXXXXXXX 29 F F F 6j5k 19 DA,HB,LC,PD u,Au,Bu,Cu ATP synthase membrane subunit 6.8PL XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 42 F F F 6j60 1 A A ROA1_HUMAN 9-mer peptide (GFGGNDNFG) from Heterogeneous nuclear ribonucleoprotein A1 GFGGNDNFG 9 T 2.3 UPF0738 pdbhh F Eukaryota F 6j67 2 B C 3FB-PHE-B8R-LEU-5XU-PRO XFXLAP 6 T 190 DUF5525 pdbhh F F 6j68 2 C,D C,D LATS1_MOUSE LARGE TUMOR SUPPRESSOR HOMOLOG 1,WARTS PROTEIN KINASE GPGSVAEAPSYQGPPPPYPKHLLHQNPS 28 T 9.1 Nt_Gln_amidase pdbhh F Eukaryota T 6j7v 1 A A H9ABP6_9VIRU VP5 IAPLVGYAIGAAAISAVGGIGVGWTLREFEVVGSDDPAEGLTPDVLRNQLSDSVVKRKSNNQSTMVDNQNILDGVEHTAYTEAKIAAIEELNAGSSESAVLSAANSAIDSYETTVRTNFYKSWNETVRELEAMTQTVIAHADVGLSYITDFGDPRFGNLASGTSPNTLKDTTVSMPDGTNFTLLTFRHNTGWDSGNAAYSVVEYNPKEVVTSTNSNTYNTVDGTQYMKFSEWNAVETEMDTVFQNVRNGISTWVTNVYGDVQSGAIEISDLVTPRERATMMAQEEGMSQAIADLIALNVPVDAEREATITIQDTGATLPGTFALTDSSDGPLSAGQTYDPSTFSGDVYFTADMSLVEGPWDAINSGVDGGTITITSEPYEGTAIEVTTVESETVSVPAADWTDNGDGTWSYDASGDLETTITNVDSARFVSTATETTYDTLQLKGAFTVDKLVNKQSGEEVSSTSFTSSEPQTDSNYITQDEWDQLEQQNKELIEKYEQSQSGGGLDLGGLDMFGVPGEMVAVGAAAVIGFLMLGNN 537 T 0.062 B56 pdbpssm T Viruses T 6j8e 3 C D CM3A_CONKI Mu-conotoxin KIIIA CCNCSSKWCRDHSRCC 16 T 0.49 C5HCH pdbhh F Eukaryota T 6j8f 3 C C TBA1A_HUMAN 8-mer peptide GEEEGECY 8 T 3.7 Peptidase_C8 pdbhh F Eukaryota F 6j8o 3 C C TBA1B_HUMAN 8-mer peptide GEEEGEEY 8 T 81 DUF2981 pdbhh F Eukaryota F 6j9e 8 I J Q8LTJ5_9CAUD RNA POLYMERASE INHIBITOR P7 GAMAMNEFTQISGYVNAFGSQRGSVLTVKVENDEGWTLVEEDFDRADYGSDPEFVAEVSSYLKRNGGIKDLTKVLTR 77 T 0.18 DUF1494 unp T Viruses T 6j9f 8 I J Q8LTJ5_9CAUD RNA POLYMERASE INHIBITOR P7 GAMAMNEFTQISGYVNAFGSQRGSVLTVKVENDEGWTLVEEDFDRADYGSDPEFVAEVSSYLKRNGGIKDLTKVLTR 77 T 0.18 DUF1494 unp T Viruses T 6j9k 1 A A A0A425B3G2_NEIME AcrIIC2 SMASKNNIFNKYPTIIHGEARGENDEFVVHTRYPRFLARKSFDDNFTGEMPAKPVNGELGQIGEPRRLAYDSRLGLWLSDFIMLDNNKPKNMEDWLGQLKAACDRIAADDLMLNEDAADLEGWDD 125 T 0.029 SfsA_N pdbpercent F Bacteria T 6j9l 1 A,B A,B A0A425B3G2_NEIME AcrIIC2 SMASKNNIFNKYPTIIHGEARGENDEFVVHTRYPRFLARKSFDDNFTGEMPAKPVNGELGQIGEPRRLAYDSRLGLWLSDFIMLDNNKPKNMEDWLGQLKAACDRIAADDLMLNEDAADLEGWDD 125 T 0.029 SfsA_N pdbpercent F Bacteria T 6j9l 2 C E CAS9_FRATN HNH endonuclease family protein SKDSYTLLMNNRTARRHQRRGIDRKQLVKRLFKLIWTEQLNLEWDKD 47 T 5.5 RRXRR unphh F Bacteria T 6j9m 1 A,F A,F CAS9_NEIM8 CRISPR-associated endonuclease Cas9 SVPKTGDSLAMARRLARSVRRLTRRRAHRLLRTRRLLKREGVLQAANFDENGLIKSLPNTPWQLRAAALDRKLT 74 T 0.00058 Cas9-BH pdbhh F Bacteria T 6j9m 2 B,C,D,E B,C,D,E A0A425B3G2_NEIME AcrIIC2 SMASKNNIFNKYPTIIHGEARGENDEFVVHTRYPRFLARKSFDDNFTGEMPAKPVNGELGQIGEPRRLAYDSRLGLWLSDFIMLDNNKPKNMEDWLGQLKAACDRIAADDLMLNEDAADLEGWDD 125 T 0.029 SfsA_N pdbpercent F Bacteria T 6j9n 2 B B A0A425B395_NEIME AcrIIC3 SMAFKRAIIFTSFNGFEKVSRTEKRRLAKIINARVSIIDEYLRAKDTNASLDGQYRAFLFNDESPAMTEFLAKLKAFAESCTGISIDAWEIEESEYVRLPVERRDFLAAANGKEIFKI 118 T 1.8 Iron_traffic unphh F Bacteria T 6j9p 1 A A salt-resistant antimicrobial peptide RR12 RRLIRLILRLLR 12 T 12 LisH_TPL pdbhh F F 6jcu 2 B,D B,D COBL_MOUSE Peptide from Protein cordon-bleu SLHSALMEAIRSSGGREKLRKV 22 T 0.00025 WH2 unppercent F Eukaryota T 6jd7 1 A,B,C A,B,C AcrIIC2 SMSKNNIFNKYPTIIHGEARGENDEFVVHTRYPRFLARKSFDDNFTGEMPAKPVNGELGQIGEPRRLAYDSRLGLWLSDFIMLDNNKPKNMEDWLGQLKAACDRIAADDLMLNEDAADLEGWDD 124 T 0.028 SfsA_N pdbpercent F T 6jdj 1 A,B A,B AcrIIC2 SMSKNNIFNKYPTIIHGEARGENDEFVVHTRYPRFLARKSFDDNFTGEMPAKPVNGELGQIGEPRRLAYDSRLGLWLSDFIMLDNNKPKNMEDWLGQLKAACDRIAADDLMLNEDAADLEGWDD 124 T 0.028 SfsA_N pdbpercent F T 6jdj 2 C C CAS9_NEIM8 CRISPR-associated endonuclease Cas9 SMAAFKPNSINYILGLDIGIASVGWAMVEIDEEENPIRLIDLGVRVFERAEVPKTGDSLAMARRLARSVRRLTRRRAH 78 T 0.00011 Pox_A22 pdbhh F Bacteria T 6jdx 1 A,B A,B AcrIIC2 SMSKNNIFNKYPTIIHGEARGENDEFVVHTRYPRFLARKSFDDNFTGEMPAKPVNGELGQIGEPRRLAYDSRLGLWLSDFIMLDNNKPKNMEDWLGQLKAACDRIAADDLMLNEDAADLEGWDD 124 T 0.028 SfsA_N pdbpercent F T 6jdx 2 C C CAS9_NEIM8 CRISPR-associated endonuclease Cas9 SMAAFKPNSINYILGLDIGIASVGWAMVEIDEEENPIRLIDLGVRVFERAEVPKTGDSLAMARRLARSVRRLTRRRAH 78 T 0.00011 Pox_A22 pdbhh F Bacteria T 6je4 5 E,J,S,T I,J,S,T A0A425B395_NEIME AcrIIC3 SMFKRAIIFTSFNGFEKVSRTEKRRLAKIINARVSIIDEYLRAKDTNASLDGQYRAFLFNDESPAMTEFLAKLKAFAESCTGISIDAWEIEESEYVRLPVERRDFLAAANGKEIFKI 117 T 1.8 Iron_traffic unphh F Bacteria T 6je9 3 E,F E,F A0A425B395_NEIME AcrIIC3 SMFKRAIIFTSFNGFEKVSRTEKRRLAKIINARVSIIDEYLRAKDTNASLDGQYRAFLFNDESPAMTEFLAKLKAFAESCTGISIDAWEIEESEYVRLPVERRDFLAAANGKEIFKI 117 T 1.8 Iron_traffic unphh F Bacteria T 6jez 2 B C MED1_HUMAN ACTIVATOR-RECRUITED COFACTOR 205 KDA COMPONENT,ARC205,MEDIATOR COMPLEX SUBUNIT 1,PEROXISOME PROLIFERATOR-ACTIVATED RECEPTOR-BINDING PROTEIN,PPAR-BINDING PROTEIN,THYROID HORMONE RECEPTOR-ASSOCIATED PROTEIN COMPLEX 220 KDA COMPONENT,TRAP220,THYROID RECEPTOR-INTERACTING PROTEIN 2,TRIP-2,VITAMIN D RECEPTOR-INTERACTING PROTEIN COMPLEX COMPONENT DRIP205,P53 REGULATORY PROTEIN RB18A KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 6jf6 2 E E MET-ALA-SER MAS 3 T 280 zf-C2H2_4 pdbhh F F 6jfa 2 C C MET-ALA-SER MAS 3 T 280 zf-C2H2_4 pdbhh F F 6jfo 2 B B FME-ALA-SER MAS 3 T 280 zf-C2H2_4 pdbhh F F 6jg9 2 B,D C,D arbitrium peptide GMPRGA 6 T 2.4 BCD pdbhh F F 6jhc 1 A A A0A0H5BR52_9MYRI Hydroxynitrile lyase LTCDQLPKAAINPIQEFIDSNPLEFEYVLTETFECTTRIYVQPARWSTTKAPTALDIKGTQIMAYDFVGGPENSAHLNECHTGDKQVWYFQYTNLLTDNGSSYCAYRCNGTEIIEYKCASNNNGTDPLQHQAMEVAKTVPNGDKIHYAKSNCPETHGCFAFY 162 T 29 RNR_inhib pdbhh F Eukaryota T 6jhv 1 A,B A,B A0A425B395_NEIME AcrIIC3 MMFKRAIIFTSFNGFEKVSRTEKRRLAKIINARVSIIDEYLRAKDTNASLDGQYRAFLFNDESPAMTEFLAKLKAFAESCTGISIDAWEIEESEYVRLPVERRDFLAAANGKEIFKI 117 T 1.8 Iron_traffic unphh F Bacteria T 6jhw 1 A,C A,C A0A425B395_NEIME AcrIIC3 MAFKRAIIFTSFNGFEKVSRTEKRRLAKIINARVSIIDEYLRAKDTNASLDGQYRAFLFNDESPAMTEFLAKLKAFAESCTGISIDAWEIEESEYVRLPVERRDFLAAANGKEIFKI 117 T 1.8 Iron_traffic unphh F Bacteria T 6jhz 2 C Q 5-mer peptide GGGGG 5 T 56 Parvo_coat pdbhh F F 6ji7 1 A A coffeetide EGECSPLGEPCAGNPWGCCPGCICIWQLTDRCVGNC 36 T 0.0084 DUF5637 pdbhh F T 6jij 2 D,E,F D,E,F 02J-ALA-VAL-LEU-PJE-010 XAVLXX 6 T 1700 zf-H2C2_5 pdbhh F F 6jja 1 A A Q15BP7_9VIRU Nucleocapsid protein CP17 MNKRINNNRRTMRSRRGRGRTMGSNLIPYANSPVPIPYTPPVTPVTVIGNPRKTTWIDIDLSSEESGIYTLTVGSYRNRITKLGPSKPNFIIEKVAAYAAPGDYKVVLNDFKTGIQVVDEGSYAHRAAAGILYPPAAQMFYGISATGTLNTITTTAKDPVPVVRALVTYWDSEQ 174 T 0.036 ALMS_repeat pdb T Viruses T 6jjk 2 G,H,I,J,K,L,M,N,O,P,Q,R G,H,I,J,K,L,M,N,O,P,Q,R CYS-TYR-TYR-LYS-ILE CYYKI 5 T 24 ApeC pdbhh F F 6jjl 2 G,H,I,J,K,L,M,N,O,P,Q,R G,M,H,N,I,Q,J,O,K,R,L,P CYS-TYR-ARG-LYS-LEU CYRKL 5 T 29 zf-CCHC pdbhh F F 6jjo 2 G,H,I,J,K,L,M,N,O,P,Q,R G,M,H,N,I,O,J,P,K,Q,L,R TMB-CYRKL modulator CYRKL 5 T 29 zf-CCHC pdbhh F F 6jjw 2 B U PTN14_HUMAN PROTEIN-TYROSINE PHOSPHATASE PEZ GPGSSHRHSAIIVPSYRPTPDYETVMRQMKRG 32 T 8.7 SpoIISB_antitox pdbhh F Eukaryota T 6jjx 2 C,D D,C AMOT_HUMAN Peptide from Angiomotin GPGSGRTEGQLMRYQHPPEYGAARPA 26 T 0.46 DUF6092 pdbhh F Eukaryota T 6jk2 1 A A A0A3B6UEU4_9AGAR Lectin APVPVTKLVCDGDTYKCTAYLDFGDGRWVAQWDTNVFHTG 40 T 0.07 C2-set pdbhh F Eukaryota T 6jk3 1 A,B,C A,B,C A0A3B6UEU4_9AGAR Lectin APVPVTKLVCDGDTYKCTAYLDFGDGRWVAQWDTNVFHTG 40 T 0.07 C2-set pdbhh F Eukaryota T 6jky 1 A,D A,D Q5ZTL3_LEGPH MvcA ASLESPGFMVHKKLKSMSQSYGVMMTGVPAEVLGQMQAERSIPSINKTGNLKQQIAKEVSKVCHMMTEPTQSAGQASNDVCELLLGKIEAEKFHFTKYEALSADGDNLKNVLENTAPSSTNLLIRFEIDREDPPIVLVKTKNENFNPETAVKNKIYLLENKLYFIDKMGNLFNLGPGKKKCTQLFNAIGDSAEYSLCDPFVLEEPEKPEDFAISEIVDIFNEQKERFDFWIGSHSFTIYIPQTLGESPRQFYPYQAYFGSHTLQDWFVSDKDEYLSRIGIDKYIEKLAVLGKTTNTKERSDIYAEFFSKRGREAFFCAHLNEKRQPLRVKFKITEINPELALKNLQETQEFIDTHPGENPSDKVENYRNRAKLAMTEHLESLLDI 385 T 0.022 AgrD pdb F Bacteria T 6jle 2 B E MYO3A_HUMAN Myosin-IIIa GSDNKDSKATSEREACGLAIFSKQISKLSEEYFILQKKLNEMILSQQLKS 50 T 15 Uds1 pdbhh F Eukaryota T 6jlu 14 MA,N n,N Psb34 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 30 F F F 6jlu 18 QA,R r,R PsbG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 73 F F F 6jmu 2 C,D C,D PAXI_MOUSE Paxillin GSGSGSGSGSSATRELDELMASLSDFKMQGLE 32 T 0.094 Serglycin pdb F Eukaryota T 6jn0 2 B B C0O-DAL-API XXX 3 F F F 6jn1 2 B B C0O-DAL-DAL XXX 3 F F F 6jod 2 B B ANGT_HUMAN Angiotensin II DRVYIHPF 8 T 0.67 Nairo_nucleo pdbhh F Eukaryota T 6jon 1 A A M5AAG8_9CAUD Primase MGSSHHHHHHSSGLVPRGSHMIMEIPAIKALSRYAQWVIWKKERDTKIPYNPNNGKKASSTDPLAWGDIDEAQAGLVRYGANGLGFVLTKSDPFVFIDLDHVLDENKRVKCEWARQLLKEIKSYTEISPSGDGLHVVVSGKLPDYIKHKTKFDDGSALEVYESGRYMTITGEVFDGRDDIKELDLSILGEFAEHKIETKNAPVQIESATTLDDEAIIDLMKRKGQWPDAPKDGDDWSSLDMSFANRLAFWCGKDIERMDRIFRQSPLMRQKWDRPTAGSTYGRITLKKACDFVDSVYDPALRNESDCPFEPYNEEGGPRN 320 T 0.0011 VirE_N pdbhh T Viruses T 6jop 1 A A M5AAG8_9CAUD Primase MGSSHHHHHHSSGLVPRGSHMIMEIPAIKALSRYAQWVIWKKERDTKIPYNPNNGKKASSTDPLAWGDIDEAQAGLVRYGANGLGFVLTKSDPFVFIDLDHVLDENKRVKCEWARQLLKEIKSYTEISPSGDGLHVVVSGKLPDYIKHKTKFDDGSALEVYESGRYMTITGEVFDGRDDIKELDLSILGEFAEHKIETKNAPVQIESATTLDDEAIIDLMKRKGQWPDAPKDGDDWSSLDMSFANRLAFWCGKDIERMDRIFRQSPLMRQKWDRPTAGSTYGRITLKKACDFVDSVYDPALRNESDCPFEPYNEEGGPRN 320 T 0.0011 VirE_N pdbhh T Viruses T 6joq 1 A A M5AAG8_9CAUD Primase MGSSHHHHHHSSGLVPRGSHMIMEIPAIKALSRYAQWVIWKKERDTKIPYNPNNGKKASSTDPLAWGDIDEAQAGLVRYGANGLGFVLTKSDPFVFIDLDHVLDENKRVKCEWARQLLKEIKSYTEISPSGDGLHVVVSGKLPDYIKHKTKFDDGSALEVYESGRYMTITGEVFDGRDDIKELDLSILGEFAEHKIETKNAPVQIESATTLDDEAIIDLMKRKGQWPDAPKDGDDWSSLDMSFANRLAFWCGKDIERMDRIFRQSPLMRQKWDRPTAGSTYGRITLKKACDFVDSVYDPALRNESDCPFEPYNEEGGPRN 320 T 0.0011 VirE_N pdbhh T Viruses T 6joz 3 C C ALA-THR-ILE-GLY-THR-ALA-MET-TYR-LYS ATIGTAMYK 9 T 1.6 DUF3362 pdbhh F T 6jp3 3 C C ALA-THR-ILE-GLY-THR ATIGT 5 T 210 K1 pdbhh F F 6jp3 4 D D ALA-MET-TYR-LYS AMYK 4 T 150 Leg1 pdbhh F F 6jpp 1 A A ELMO1_HUMAN PROTEIN CED-12 HOMOLOG GMPPPADIVKVAIEWPGAYPKLMEIDQKKPLSAIIKEVCDGWSLANHEYFALQHADSSNFYITEKNRNEIKNGTILRLTTSPAQNAQQLHERIQSSSMDAKLEALKDLASLSRD 114 T 0.003 FERM_N pdb F Eukaryota T 6jpw 3 I,J,K J,K,L SER-C0F-GLY-LYS-ARG-LYS SXGKRK 6 T 130 Ribosomal_S9 pdbhh F F 6jq0 2 G G unknown substrate XXXXXXXXXXXXXX 14 F F F 6jqa 1 A A X5IFG3_ONYPH Phytoplasmal effector causing phyllody 1 MNKDIASASNNNQNITNXSIEENIINLKXKIRKNAVKKINTEREIQQLSNNDPNKNTLLALKQNLENLIHNQKEQLKTYQKLLKTLNDENN 91 T 0.15 DUF3349 unppercent F Bacteria T 6jqa 2 B,C B,C X5IFG3_ONYPH Phytoplasmal effector causing phyllody 1 MNKDIASASNNNQNITNYSIEENIINLKYKIRKNAVKKINTEREIQQLSNNDPNKNTLLALKQNLENLIHNQKEQLKTYQKLLKTLNDENN 91 T 0.15 DUF3349 unppercent F Bacteria T 6jqa 3 D D X5IFG3_ONYPH Phytoplasmal effector causing phyllody 1 MNKDIASASNNNQNITNYSIEENIINLKXKIRKNAVKKINTEREIQQLSNNDPNKNTLLALKQNLENLIHNQKEQLKTYQKLLKTLNDENN 91 T 0.15 DUF3349 unppercent F Bacteria T 6jsh 3 C,F,I C,H,I FAS2_YEAST FATTY ACID SYNTHASE SR2 HELICES LNMKYRKRQLVTREAQIKDWVENELEALKLEAEEIPSEDQNEFLLERTREIHNEAESQLRAAQQQWGNDFY 71 T 5.5E-10 SpoVAD unphh F Eukaryota T 6jsi 3 C,F,I C,H,I FAS2_YEAST FATTY ACID SYNTHASE SR2 HELICES LNMKYRKRQLVTREAQIKDWVENELEALKLEAEEIPSEDQNEFLLERTREIHNEAESQLRAAQQQWGNDFY 71 T 5.5E-10 SpoVAD unphh F Eukaryota T 6jue 2 B A PAR6B_MOUSE THR-ILE-ILE-THR-LEU LEEDGTIITL 10 T 0.28 CIDE-N pdbhh F Eukaryota T 6jv7 2 B B CO5_RAT D-anaphylatoxin C5a XXXXXXXXXXXXXXXXXXXXXXXXXXXGXXXXXXXXXXXXXXXXXXGXXXXXXXXXXXXXXXXXXXXXXXXGXXXGX 77 F F Eukaryota F 6jwj 2 B C UFD1_YEAST UB FUSION PROTEIN 1,POLYMERASE-INTERACTING PROTEIN 3 GPGHMEPAKLDLPEGQLFFGFPM 23 T 19 AP-5_subunit_s1 pdbhh F Eukaryota T 6jwm 2 B B NOS2_HUMAN HEPATOCYTE NOS,HEP-NOS,INDUCIBLE NO SYNTHASE,INOS,NOS TYPE II,PEPTIDYL-CYSTEINE S-NITROSYLASE NOS2 RGDINNN 7 T 16 DUF6373 pdbhh F Eukaryota F 6jwn 2 B,D B,D NOS2_HUMAN CR9 PEPTIDE RGDINNNVE 9 T 5.5 DUF6373 pdbhh F Eukaryota T 6jxu 2 B B ICP0_HHV11 viral protein NNRDPIVISDSP 12 T 5 MTD pdbhh T Viruses T 6jxv 2 B B ICP0_HHV11 Phosphorylated SLS4-SIM from ubiquitin E3 ligase ICP0 LANNRDPIVISDSPPASPHR 20 T 3.3 Myf5 pdbhh T Viruses T 6jxw 2 B B ICP0_HHV11 SLS4-SIM from Ubiquitin E3 ligase ICP0 LANNRDPIVISDSPPASPHR 20 T 3.3 Myf5 pdbhh T Viruses T 6jxx 2 B B ICP0_HHV11 Phosphorylated SLS4 from E3 ubiquitin ligase ICP0 LANNRDPIVISDSPPASPHR 20 T 3.3 Myf5 pdbhh T Viruses T 6jzd 3 C C TBA1A_MOUSE GLU-GLY-GLU-GLU-TYR VDSVEGEGEEEGEEY 15 T 29 Hrs_helical unphh F Eukaryota T 6jzn 2 E,F,G,H G,F,H,E PDV1_ARATH PROTEIN PLASTID DIVISION1 DHLDVMMARG 10 T 9.3 LicD pdbhh F Eukaryota T 6k06 1 A A GOGA2_HUMAN PHOSPHOMIMETIC GM130,130 KDA CIS-GOLGI MATRIX PROTEIN,GM130,GM130 AUTOANTIGEN,GOLGIN-95 GPLGSMSEETRQSKLAAAKKKLREYQQRNDPGVPTGAKKKKKIKNGSNPETTT 53 T 0.0086 DUF812 unphh F Eukaryota T 6k07 2 B B SHLD3_HUMAN REV7-INTERACTING NOVEL NHEJ REGULATOR 1,SHIELD COMPLEX SUBUNIT 3 MGSKLPLRPKRSPPVISEEAAEDVKQYLTI 30 T 33 Sm_like unphh F Eukaryota T 6k08 2 B B SHLD3_HUMAN REV7-INTERACTING NOVEL NHEJ REGULATOR 1,SHIELD COMPLEX SUBUNIT 3 MGSKLPLRPKRSPPVISEEAAEDVKQYLTI 30 T 33 Sm_like unphh F Eukaryota T 6k0t 2 B,D B,D PRGC1_HUMAN Peroxisome proliferator-activated receptor gamma coactivator 1-alpha EEPSLLKKLLLA 12 T 12 Neurokinin_B pdbhh F Eukaryota T 6k11 1 A,B A,B Q5ZTL3_LEGPH Lpg2148(MvcA) LESPGFMVHKKLKSMSQSYGVMMTGVPAEVLGQMQAERSIPSINKTGNLKQQIAKEVSKVCHMMTEPTQSCGQASNDVCELLLGKIEAEKFHFTKYEALSADGDNLKNVLENTAPSSTNLLIRFEIDREDPPIVLVKTKNENFNPETAVKNKIYLLENKLYFIDKMGNLFNLGPGKKKCTQLFNAIGDSAEYSLCDPFVLEEPEKPEDFAISEIVDIFNEQKERFDFWIGSHSFTIYIPQTLGESPRQFYPYQAYFGSHTLQDWFVSDKDEYLSRIGIDKYIEKLAVLGKTTNTKERSDIYAEFFSKRGREAFFCAHLNEKRQPLRVKFKITEINPELALKNLQETQEFIDTHPGENPSDKVENYRNRAKLAMTEHLESLLDI 383 T 0.022 AgrD pdb F Bacteria T 6k15 8 I E HTL1_YEAST CHROMATIN STRUCTURE-REMODELING COMPLEX PROTEIN HTL1 MSQNNTISSMNPERAYNNVTLKNLTAFQLLSQRENICELLNLVESTERHNSIINPERQRMSLEEMKKMLDALKNERKK 78 T 0.064 DUF4976 pdbpercent F Eukaryota T 6k31 1 A,B A,B AiPEPCK GPGHMSSSPAAPTNSANSAIALRLELLGAPVPDHAARSDFDRETDRLVAPILARQRELTRRLANRPCAADRRIQAFLDSYLDGAAAQPKLPGATLVLDQPGLARALSLPVDATSFTSDYVESYRVLSGVLHNPRNDRRTTAGVFHVAEGGLPIPDDKKAVPRDVFARVLAAAVDAPDDLMTLPWASTQADPARCFVSLLLRPVVVPEVPGFSAERSMEIRFIAPGGLVSNLDFVEGIFGNGGDPYLPENDASLAPESWTGHTGCVILAPHLTRLTKKELGLPAWEEATERQRRDGMCWRGADELYNDGKAFKLVARDERGVIVTIIADNYYGYCKKEVKTQISYSANLFGCVEEEHSGGALAFPRYNLGQEYTDVHTPAGATVERVLARNPGRFEARADGSAVLLDDDGRPDEGIVLVPAGAHFSMRTQTVTWDRADGREASIPLLADRVYIAPGGYRVHAKHREGDATQWHLVGTAPWATQAHKPATVSGGGKSEISKSLLDAFVFGEAYVGDVDADLDAVQKILDGNYADRFVDPANKSAHHRPILSERRSLGSVIKLLTPSSMYTEEYNAFLESIPAHIKELIFTVKRYYQPGWGADWRSHFSVGIINGRKGNSLRLDGEVIKVNMLRVGFEDDGAWRLLSLRPDFSPAAKVQTEDDITSSIVAPGGLESTAGSSVSRKFVTNCESLLFQRPDDAIVRGYDKQTERDMSGTGLFISNYQPLTPADARAMVADAPGLSRFTEPMQELVRRAAAIPEAADPREETYWTSTANPRLVGGAPTRNPRYLQVRPDIANPRDVALADLSIHLYRDAPLAAPARHGVDVVAAGRRNNPPEPGVPALCAYNPLHYMELPELFMEFISSMTGKSPSTTGAGSEGALTKSPFNALPPVYDLNAALLSYALGGYDGWLSSAGYIGPKVKVAHDISLLVPEIFSRMTPQERDARALIEAGYLERLEDFDHEGRRIEASRLGYRMNAAFATAYFGRIFLHPDVVFTEEMLRPELQDPAIFADSVEVIVATHRAVAKHYVDDGSIQWAVPPLKALLEIMYSGRSEEGWTLSSPELRALFERENILASDWYAERVDAKVERDRKQAESAIAALTRFTTTQGNEEVTERLDIEGRLASARAWLDEVTSPAYRAHLVGTLGLQPSLA 1153 T 3 SKI pdbhh F T 6k32 2 B B C7EWL9_CPVBM VP4 FAIDPLKHSKLYEEYGLYLRPHQINQEIKPTTIKKKELAPTIRSIKYASLIHSMLAEHAARHNGTLINPRMYADMITLGNTKVTVTKGTPKAQIDTLKMNGLTVVSKSRRNNKKKPVSDTTATIDENTNAIVTYKALTEMSTLIESFRLPSGLTLIIFDDEKYQSLIPNYINQLIAYTQPHIIPTWQGIADFSDTYLRSYFKRPFELTASNLAAPQKHNLSPMTRSIFNNTGREDAVIRKLYGYGEYVFIKYEGCLITWTGIYGEVTMMVNLSKRDLGLDVGDDYLKEYKKLLFYGVITDAIPSGISTRSTIMKISPHKMMNPSGGALAVLSKFLEAVVSTNVINATLVVYAEKGAGKTSFLSTYAEQLSLASGQVVGHLSSDAYGRWLAKTKDIEEPSFAYDYVLSLDTDDNESYYEQKASELLMSHGISEVAQYELLSVRKKIKMMDEMNEVLIAQLENADTHSERNFYYMVSTGKTTPRILIVEGHFNAQDATIARTDTTVLLRTINDTTQAMRDRQRGGVVQLFLRDTYYRLLPALHTTVYPFEMLESIKRWKWV 559 T 0.00013 AAA_33 unppssm T Viruses T 6k32 5 E C CAPSD_CPVBM VP1 VQSRTDVFNEQFANEALHPMTKVIFNGLDVNTEVQPLSDDFKQISDPKGYLTYSVKYEDQFTKKDKLRASEADDRIVGPTVNLFKYGAAVVNIDLNRDFFDTATGIDLTKGIPLVQDLLVPIGVTAGAEQSAEYVSGLLMVLFKVMTDNRLVIVGETTTPMSNTLSTVVNNVLRTTYHNNVGVNPALLRDFTQVNWLNRDITNMLQQAGTKYGLGLTETRLDYVRLVKTIVGHALNIDHFAASVLNINLRALMEANVTADDRIKALQAHSMISTQFHGPNQGALRPELAFDHDHIIRCLMLAAANYPRLEGIIVQINTGYVASANVIRPVSEKRYFPENLEQNQSAARLVSAVKARASEADISSIHLAIAREVSPMFNVHELKKIAESFEDPSSIVVVLEFILFALFFPTEFNRIKGDIQNVLLLFFSRWYPVEYGIFIQRGATYTINAAGEFEFSGRNEKWDQSLYLSEHFPALFSDVPLAGANTIIAIMRLFTPQGFLRTDDLAIAANFPRASRNPQTYIPYTNQRGTVTNEFASRFRTIVATLANVVNERAVQDDMQKATRSCTKQWLRHLETQFDNIAVAHTDHLSVVYATMSNFMLNFTNNFSGNHATFKPDQYVITSPEGSYKPIIERQGETVDGLTIIDTSIVWPILCQCTYPLVRQSGKGVDAVSIMEEIVYPDPSTTLSQSLSVAQVLSKLTLPDAFINMILSGGDSVVMRTYQTEADDDLDEGIRMTTYDQYLSHIRERLHITNVPDPIYITGASTPDQIAASVQATHVAVVLYQSGVINGSASTYLRENEVLVVMPDYYDVVSRFANANLQMNNNRYHESVLEIADIFDQADFIQTSDAVRQLRALMPTLSTSQIRHAIERIAQITDVDSTDYGKLTLRFLGTLTRSLKMQNAQIRRIRPDGTVLRYDDQIDIEAFRWSRYFLDELRLRRLSVGLRLITNPRIARRFDGVRIMYLTDDDPDPDFVPDVPEGYVAVQYAHRLFSSSLANKRNRVTYTHPPTGMAYPSPTGRPHVHMTINERAGMSKLVADNIIASVIKSNWVVDIHDIEYTAEVMTPSEGYTQHVDAESIMTAPKGKLFHLQFMDGLLRPEPSAFDPPASGEDMRLIYPLQPISVARSMRAIVNHNEVDRPRGAVAPSSYEMDTGTLSRNGDLLYSPVANGQVGIPKLEVDHISFSNVVSMMTANIRTGDDMAVERVNPDDVRAINIRNA 1220 T 120 SRP19 pdbhh T Viruses T 6k32 6 F,G,H D,E,F CAPSD_CPVBM VP1 ALHPMTKVIFNGLDVNTEVQPLSDDFKQISDPKGYLTYSVKYEDQFTKKDKLRASEADDRIVGPTVNLFKYGAAVVNIDLNRDFFDTATGIDLTKGIPLVQDLLVPIGVTAGAEQSAEYVSGLLMVLFKVMTDNRLVIVGETTTPMSNTLSTVVNNVLRTTYHNNVGVNPALLRDFTQVNWLNRDITNMLQQAGTKYGLGLTETRLDYVRLVKTIVGHALNIDHFAASVLNINLRALMEANVTADDRIKALQAHSMISTQFHGPNQGALRPELAFDHDHIIRCLMLAAANYPRLEGIIVQINTGYVASANVIRPVSEKRYFPENLEQNQSAARLVSAVKARASEADISSIHLAIAREVSPMFNVHELKKIAESFEDPSSIVVVLEFILFALFFPTEFNRIKGDIQNVLLLFFSRWYPVEYGIFIQRGATYTINAAGEFEFSGRNEKWDQSLYLSEHFPALFSDVPLAGANTIIAIMRLFTPQGFLRTDDLAIAANFPRASRNPQTYIPYTNQRGTVTNEFASRFRTIVATLANVVNERAVQDDMQKATRSCTKQWLRHLETQFDNIAVAHTDHLSVVYATMSNFMLNFTNNFSGNHATFKPDQYVITSPEGSYKPIIERQGETVDGLTIIDTSIVWPILCQCTYPLVRQSGKGVDAVSIMEEIVYPDPSTTLSQSLSVAQVLSKLTLPDAFINMILSGGDSVVMRTYQTEADDDLDEGIRMTTYDQYLSHIRERLHITNVPDPIYITGASTPDQIAASVQATHVAVVLYQSGVINGSASTYLRENEVLVVMPDYYDVVSRFANANLQMNNNRYHESVLEIADIFDQADFIQTSDAVRQLRALMPTLSTSQIRHAIERIAQITDVDSTDYGKLTLRFLGTLTRSLKMQNAQIRRIRPDGTVLRYDDQIDIEAFRWSRYFLDELRLRRLSVGLRLITNPRIARRFDGVRIMYLTDDDPDPDFVPDVPEGYVAVQYAHRLFSSSLANKRNRVTYTHPPTGMAYPSPTGRPHVHMTINERAGMSKLVADNIIASVIKSNWVVDIHDIEYTAEVMTPSEGYTQHVDAESIMTAPKGKLFHLQFMDGLLRPEPSAFDPPASGEDMRLIYPLQPISVARSMRAIVNHNEVDRPRGAVAPSSYEMDTGTLSRNGDLLYSPVANGQVGIPKLEVDHISFSNVVSMMTANIRTGDDMAVERVNPDDVRAINIRNA 1205 T 120 SRP19 pdbhh T Viruses T 6k32 7 I G CAPSD_CPVBM VP1 KKPPTVVQSRTDVFNEQFANEALHPMTKVIFNGLDVNTEVQPLSDDFKQISDPKGYLTYSVKYEDQFTKKDKLRASEADDRIVGPTVNLFKYGAAVVNIDLNRDFFDTATGIDLTKGIPLVQDLLVPIGVTAGAEQSAEYVSGLLMVLFKVMTDNRLVIVGETTTPMSNTLSTVVNNVLRTTYHNNVGVNPALLRDFTQVNWLNRDITNMLQQAGTKYGLGLTETRLDYVRLVKTIVGHALNIDHFAASVLNINLRALMEANVTADDRIKALQAHSMISTQFHGPNQGALRPELAFDHDHIIRCLMLAAANYPRLEGIIVQINTGYVASANVIRPVSEKRYFPENLEQNQSAARLVSAVKARASEADISSIHLAIAREVSPMFNVHELKKIAESFEDPSSIVVVLEFILFALFFPTEFNRIKGDIQNVLLLFFSRWYPVEYGIFIQRGATYTINAAGEFEFSGRNEKWDQSLYLSEHFPALFSDVPLAGANTIIAIMRLFTPQGFLRTDDLAIAANFPRASRNPQTYIPYTNQRGTVTNEFASRFRTIVATLANVVNERAVQDDMQKATRSCTKQWLRHLETQFDNIAVAHTDHLSVVYATMSNFMLNFTNNFSGNHATFKPDQYVITSPEGSYKPIIERQGETVDGLTIIDTSIVWPILCQCTYPLVRQSGKGVDAVSIMEEIVYPDPSTTLSQSLSVAQVLSKLTLPDAFINMILSGGDSVVMRTYQTEADDDLDEGIRMTTYDQYLSHIRERLHITNVPDPIYITGASTPDQIAASVQATHVAVVLYQSGVINGSASTYLRENEVLVVMPDYYDVVSRFANANLQMNNNRYHESVLEIADIFDQADFIQTSDAVRQLRALMPTLSTSQIRHAIERIAQITDVDSTDYGKLTLRFLGTLTRSLKMQNAQIRRIRPDGTVLRYDDQIDIEAFRWSRYFLDELRLRRLSVGLRLITNPRIARRFDGVRIMYLTDDDPDPDFVPDVPEGYVAVQYAHRLFSSSLANKRNRVTYTHPPTGMAYPSPTGRPHVHMTINERAGMSKLVADNIIASVIKSNWVVDIHDIEYTAEVMTPSEGYTQHVDAESIMTAPKGKLFHLQFMDGLLRPEPSAFDPPASGEDMRLIYPLQPISVARSMRAIVNHNEVDRPRGAVAPSSYEMDTGTLSRNGDLLYSPVANGQVGIPKLEVDHISFSNVVSMMTANIRTGDDMAVERVNPDDVRAINIRNA 1226 T 130 SRP19 pdbhh T Viruses T 6k3a 2 B,D,F B,D,F DNMT1_HUMAN Peptide from DNA (cytosine-5)-methyltransferase 1 STRQTTITSHFAKGPAKRKP 20 T 0.14 AIB pdbhh F Eukaryota T 6k3b 1 A A Q5ZTL4_LEGPH Lpg2147 GSHMEKTGLHVHEKIKHMVKNYGTMITGIPAEILGQNEAEISVGYVKKMGNMKENIAEVVRKSEMTQPTNSAGKASNEVCDLLLGTEGASEFEKSSYQVLSGDGSNLKGSLPNKNLLVRVEMDRFNAPQKYQKIKREEFNPETAEKNKIYLLEDQLVYLDIFGKVIDLGQTSDTCHRLFNAITTPFYQNYILYDEYIDPEESAEEAAMFEMGEIVKAKMKNIDCWTATHSFTIFVPESDSEDTRTLYPYQAYWTSHTLQQWFSGDKDEKLSRLGIDGYIEKLALLGTTTDSKIRSSIYGELFSPPGKEHVFCTGMNEKFSPLRVKFKVTEVNPEIALQNLEEVQEFIDTNYPGENAKDQCELYKIKAQEAMTKQLEMRLLIE 382 T 0.035 V-ATPase_H_N unppercent F Bacteria T 6k3f 2 B,D,F,H,J,L U,V,W,X,Y,Z ACKR3_HUMAN CHEMOKINE-RELATED PROTEIN 1,C-X-C CHEMOKINE RECEPTOR TYPE 7,CXCR-7,CHEMOKINE ORPHAN RECEPTOR 1,G-PROTEIN COUPLED RECEPTOR 159,G-PROTEIN COUPLED RECEPTOR RDC1 HOMOLOG,RDC-1 IFKYSAKTGLTKLID 15 T 48 DUF2589 pdbhh F Eukaryota T 6k4k 1 A,B A,B SIDJ_LEGPH SidJ MFGFIKKVLDFFGVDQSEDNPSETAVETTDVSTKIKTTDTTQEESSVKTKTVVPTQPGGSVKPETIAPDQQKKHQIKTETTTSTTKQKGPKVTLMDGHVKQYYFARRGETSTHDTSLPPPVKVLSGRSIPLKEIPFEATRNELVQIYLTSIDKLIKSNKLNSIPSQQIASHYLFLRSLANSETDGIKKNQILSLAKPLGTYLASKEPHVWKMINELIEKSEYPIIHYLKNNRAHSNFMLALIHEYHKEPLTKNQSAFVQKFRDSSVFLFPNPIYTAWLAHSYDEDSSFNPMFRERLSTNFYHSTLTDNLLLRTEPKEVTLSSEHHYKKEKGPIDSSFRYQMSSDRLLRIQGRTLLFSTPQNDVVAVKVQKKGEPKSTLEEEFEMADYLLKHQRRLDVHSKLPQPLGQYSVKKSEILEISRGSLDFERFKTLIDDSKDLEVYVYKAPQSYFTYLHDKNQDLEDLTASVKTNVHDLFVLLREGIVFPQLADIFHTHFGEDEREDKGRYQALVQLLNVLQFQLGRIDKWQKAVEYVNLRSSGLADLGDSLPITSLFTSSDFTKHYFSELLTGGYHPTFFDKSSGTANSLFTGKRRLFGNYLYLNTIAEYLLVIQLTLGSYGDKVTRDMMDKPKKEAVWRELANVMFTSCAEAIHIMTGIPQSRALTLLKQRANIEKHFRQTQFWMTPDYSKLDEDTLQMEQYSIYSGEPEYEFTDKLVSGVGLSVDGVHQDLGGYNRESPLRELEKLLYATVTLIEGTMQLDKEFFKQLEQVEKILSGEIKTDANSCFEAVAQLLDLARPGCHFQKRLVLSYYEEAKLKYPSAPTDAYDSRFQVVARTNAAITIQRFWREARKNLSEKSDIDSEKPESERTTDKRL 873 T 0.34 IQ pdb F Bacteria T 6k4l 1 A,B A,B SIDJ_LEGPH SidJ MFGFIKKVLDFFGVDQSEDNPSETAVETTDVSTKIKTTDTTQEESSVKTKTVVPTQPGGSVKPETIAPDQQKKHQIKTETTTSTTKQKGPKVTLMDGHVKQYYFARRGETSTHDTSLPPPVKVLSGRSIPLKEIPFEATRNELVQIYLTSIDKLIKSNKLNSIPSQQIASHYLFLRSLANSETDGIKKNQILSLAKPLGTYLASKEPHVWKMINELIEKSEYPIIHYLKNNRAHSNFMLALIHEYHKEPLTKNQSAFVQKFRDSSVFLFPNPIYTAWLAHSYDEDSSFNPMFRERLSTNFYHSTLTDNLLLRTEPKEVTLSSEHHYKKEKGPIDSSFRYQMSSDRLLRIQGRTLLFSTPQNDVVAVKVQKKGEPKSTLEEEFEMADYLLKHQRRLDVHSKLPQPLGQYSVKKSEILEISRGSLDFERFKTLIDDSKDLEVYVYKAPQSYFTYLHDKNQDLEDLTASVKTNVHDLFVLLREGIVFPQLADIFHTHFGEDEREDKGRYQALVQLLNVLQFQLGRIDKWQKAVEYVNLRSSGLADLGDSLPITSLFTSSDFTKHYFSELLTGGYHPTFFDKSSGTANSLFTGKRRLFGNYLYLNTIAEYLLVIQLTLGSYGDKVTRDMMDKPKKEAVWRELANVMFTSCAEAIHIMTGIPQSRALTLLKQRANIEKHFRQTQFWMTPDYSKLDEDTLQMEQYSIYSGEPEYEFTDKLVSGVGLSVDGVHQDLGGYNRESPLRELEKLLYATVTLIEGTMQLDKEFFKQLEQVEKILSGEIKTDANSCFEAVAQLLDLARPGCHFQKRLVLSYYEEAKLKYPSAPTDAYDSRFQVVARTNAAITIQRFWREARKNLSEKSDIDSEKPESERTTDKRL 873 T 0.34 IQ pdb F Bacteria T 6k4r 1 A,B A,B SIDJ_LEGPH SidJ MFGFIKKVLDFFGVDQSEDNPSETAVETTDVSTKIKTTDTTQEESSVKTKTVVPTQPGGSVKPETIAPDQQKKHQIKTETTTSTTKQKGPKVTLMDGHVKQYYFARRGETSTHDTSLPPPVKVLSGRSIPLKEIPFEATRNELVQIYLTSIDKLIKSNKLNSIPSQQIASHYLFLRSLANSETDGIKKNQILSLAKPLGTYLASKEPHVWKMINELIEKSEYPIIHYLKNNRAHSNFMLALIHEYHKEPLTKNQSAFVQKFRDSSVFLFPNPIYTAWLAHSYDEDSSFNPMFRERLSTNFYHSTLTDNLLLRTEPKEVTLSSEHHYKKEKGPIDSSFRYQMSSDRLLRIQGRTLLFSTPQNDVVAVKVQKKGEPKSTLEEEFEMADYLLKHQRRLDVHSKLPQPLGQYSVKKSEILEISRGSLDFERFKTLIDDSKDLEVYVYKAPQSYFTYLHDKNQDLEDLTASVKTNVHDLFVLLREGIVFPQLADIFHTHFGEDEREDKGRYQALVQLLNVLQFQLGRIDKWQKAVEYVNLRSSGLADLGDSLPITSLFTSSDFTKHYFSELLTGGYHPTFFDKSSGTANSLFTGKRRLFGNYLYLNTIAEYLLVIQLTLGSYGDKVTRDMMDKPKKEAVWRELANVMFTSCAEAIHIMTGIPQSRALTLLKQRANIEKHFRQTQFWMTPDYSKLDEDTLQMEQYSIYSGEPEYEFTDKLVSGVGLSVDGVHQDLGGYNRESPLRELEKLLYATVTLIEGTMQLDKEFFKQLEQVEKILSGEIKTDANSCFEAVAQLLDLARPGCHFQKRLVLSYYEEAKLKYPSAPTDAYDSRFQVVARTNAAITIQRFWREARKNLSEKSDIDSEKPESERTTDKRL 873 T 0.34 IQ pdb F Bacteria T 6k4v 1 A A smart chimeric peptide G6 RVQGRWKVRASFFKGGGGSGFAWNVCVYRNGVRVCHRRAN 40 T 0.36 EipB_like pdbhh F T 6k4w 1 A A SCP-A6 RVQGRWKVRASFFKEAAAKEAAAKGFAWNVCVYRNGVRVCHRRAN 45 T 0.012 EipB_like pdb F T 6k5o 2 B C MED1_HUMAN ACTIVATOR-RECRUITED COFACTOR 205 KDA COMPONENT,ARC205,MEDIATOR COMPLEX SUBUNIT 1,PEROXISOME PROLIFERATOR-ACTIVATED RECEPTOR-BINDING PROTEIN,PPAR-BINDING PROTEIN,THYROID HORMONE RECEPTOR-ASSOCIATED PROTEIN COMPLEX 220 KDA COMPONENT,TRAP220,THYROID RECEPTOR-INTERACTING PROTEIN 2,TRIP-2,VITAMIN D RECEPTOR-INTERACTING PROTEIN COMPLEX COMPONENT DRIP205,P53 REGULATORY PROTEIN RB18A KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 6k5r 2 B B VIE2_HCMVA VIRAL TRANSCRIPTION FACTOR IE2, IE2,PROTEIN UL122 DTAGCIVISDSE 12 T 0.55 DUF2778 pdbhh T Viruses T 6k5t 2 B B VIE2_HCMVA IE2,PROTEIN UL122 DTAGCIVISDSE 12 T 0.55 DUF2778 pdbhh T Viruses T 6k7t 3 C C PHOSP_HENDH HeV1 DFANTFLP 8 T 0.0062 AAA_23 unppercent T Viruses T 6k7w 1 A A UBP19_HUMAN UBIQUITIN SPECIFIC PEPTIDASE 19,DEUBIQUITINATING ENZYME 19,UBIQUITIN THIOESTERASE 19,UBIQUITIN-SPECIFIC-PROCESSING PROTEASE 19,ZINC FINGER MYND DOMAIN-CONTAINING PROTEIN 9 TPELALDWRQSAEEVIVKLRVGVGPLQLEDVDAAFTDTDCVVRFAGGQQWGGVFYAEIKSSCAKVQTRKGSLLHLTLPKKVPMLTWPSLLVE 92 T 0.00018 CS pdbhh F Eukaryota T 6k8e 1 A,B A,D SARX_STAA8 STAPHYLOCOCCAL ACCESSORY REGULATOR X HHHHHHMGSMNTEKLETLLGFYKQYKALSEYIDKKYKLSLNDLAVLDLTMKHCKDEKVLMQSFLKTAMDELDLSRTKLLVSIRRLIEKERLSKVRSSKDERKIYIYLNNDDISKFNALFEDVEQFLNI 128 T 0.00054 AphA_like unphh F Bacteria T 6k8k 2 B,D,F,H E,C,F,H BIC2_ARATH BLUE-LIGHT INHIBITOR OF CRYPTOCHROMES 2 PETTVLSGRDRLKRHREEVAGKVPIPDSWGKEGLLMGWMDFSTFDAAFTSSQIVSARAALMADSGHHHHHH 71 T 0.16 BRD4_CDT pdbpssm F Eukaryota T 6k9a 1 A A M5AAG8_9CAUD Primase SMIMEIPAIKALSRYAQWVIWKKERDTKIPYNPNNGKKASSTDPLAWGDIDEAQAGLVRYGANGLGFVLTKSDPFVFIDLDHVLDENKRVKCEWARQLLKEIKSYTEISPSGDGLHVVVSGKLPDYIKHKTKFDDGSALEVYESGRYMTITGEVFDGRDDIKELDLSILGEFAEHKIETKNAPVQIESATTLDDEAIIDLMKRKGQWPDAPKDGDDWSSLDMSFANRLAFWCGKDIERMDRIFRQSPLMRQKWDRPTAGSTYGRITLKKACDFVDSVYDPALRNESDCPFEPYNEEGGPRNDKEEK 306 T 0.00098 VirE_N pdbhh T Viruses T 6k9b 1 A,B A,B M5AAG8_9CAUD Primase SMIMEIPAIKALSRYAQWVIWKKERDTKIPYNPNNGKKASSTDPLAWGDIDEAQAGLVRYGANGLGFVLTKSDPFVFIDLDHVLDENKRVKCEWARQLLKEIKSYTEISPSGDGLHVVVSGKLPDYIKHKTKFDDGSALEVYESGRYMTITGEVFDGRDDIKELDLSILGEFAEHKIETKNAPVQIESATTLDDEAIIDLMKRKGQWPDAPKDGDDWSSLDMSFANRLAFWCGKDIERMDRIFRQSPLMRQKWDRPTAGSTYGRITLKKACDFVDSVYDPALRNESDCPFEPYNEEGGPRNDKEEK 306 T 0.00098 VirE_N pdbhh T Viruses T 6kac 27 AA,CB 1,0 10 kDa photosystem II polypeptide PsbR (potential) GAAGAAAAAGAAAAAAAAGAAAAAA 25 T 210 S_layer_N pdbhh F F 6kac 28 BA,DB 4,3 Unindentified Stromal Protein (USP) AWAAAAGAAGAGYGVYRYEAAYGAA 25 T 0.017 PsbR pdb F T 6kbb 3 C,F F,E SWC5_YEAST SWR1-complex protein 5 GSMPEVETKIIPNEKEDEDEDGYIEEEDEDFQPEKDKLGGGSDDSDASDGGDDYDDGVNRDKGRNKVDYSRIESESGGLIK 81 T 10 DUF4637 pdbpercent F Eukaryota T 6kbm 2 B B ATG13_YEAST Autophagy-related protein 13 GGNSSTSALNSRRNSLDKSSNKQGMSGLPPIFGGESTSYHHDNKIQKYNQLGVEEDDDDENDRLLNQMGNSATKFKSSISPRSIDSISSSFIKSRIPIRQPYHYSQPTTAPFQAQAKFHKPANKLIDNG 129 T 20 RCS1 pdbhh F Eukaryota T 6kbn 2 B,D B,D ATG13_YEAST Autophagy-related protein 13 GGNSSTSALNSRRNSLDKSSNKQGMSGLPPIFGGESTSYHHDNKIQKYNQLGVEEDDDDENDRLLNQMGNSATKFKSSISPRSIDSISSSFIKSRIPIRQPYHYSQPTTAPFQAQAKFHKPANKLIDNG 129 T 20 RCS1 pdbhh F Eukaryota T 6kbo 1 A A VG16KRKP VARGWKRKCPLFGKGG 16 T 0.0078 Flavi_glycoprot pdbhh F T 6kbv 1 A A VG16KRKP VARGWKRKCPLFGKGG 16 T 0.0078 Flavi_glycoprot pdbhh F T 6kbw 1 A,B A,B A0A0B5RNJ4_9FLAO FLAVIN-CONTAINING MONOOXYGENASE MLNLKVGIIGAGPSGLAMLRAFESEQKKGNPIPEIKCYEKQDNWGGMWNYTWRTGVGKYGEPIHGSMYKYLWSNGPKECLEFSDYTFMEHFKQPISSYPPREVLFDYIQGRIKQSNARDFIKFNTVARWVDYLEDKKQFRVIFDDLVKNETFEEYFDYLVVGTGHFSTPNMPYFKGIDSFPGTVMHAHDFRGADQFIDKDILLIGSSYSAEDIGVQCFKHGSKSVTISYRTNPIGAKWPKGIEEKPIVTHFEDNVAHFKDGSKKEYDAVILCTGYQHKFPFLPDNLRLKTKNNLYPDNLYKGVVFNENERLIFLGMQDQYYTFNMFDTQAWFARDYMLGRIALPNKEIRDKDIAKWVELEKTSVTGEEHVDFQTDYIKELIEMTDYPTFDLDRVAEMFKSWLNDKETNILNYRDKVYTSVMTGVTAEEHHTPWMKELDDSLERYLDEVEVDELELSKENYYHHHHHH 467 T 1.6E-26 FMO-like unp F Bacteria T 6kc4 2 B,D,F,H,J,L B,D,F,H,J,L phosphopeptide (EDpYENVD) DEXENVD 7 T 4.1 HGAL pdbhh F F 6kd5 3 C C DECANOYL-ARG-VAL-LYS-ARG-CHLOROMETHYLKETONE INHIBITOR XRVKXX 6 T 280 MIB_HERC2 pdbhh F F 6kdq 2 C,D E,F CENPA_HUMAN CENP-A peptide XPRRRSR 7 T 47 Rubi_NSP_C pdbhh F Eukaryota F 6kds 2 B E CENPA_HUMAN CENP-A peptide XPRRRS 6 T 65 EpmC pdbhh F Eukaryota F 6ke6 66 RB RV FAF1_YEAST FORTY S ASSEMBLY FACTOR MTLDDDDYIKQMELQRKAFESQFGSLESMGFEDKTKNIRTEVDTRDSSGDEIDNSDHGSDFKDGTIESSNSSDEDSGNETAEENNQDSKPKTQPKVIRFNGPSDVYVPPSKKTQKLLRSGKTLTQINKKLESTEAKEEKEDETLEAENLQNDLELQQFLRESHLLSAFNNGGSGSTNSGVSLTLQSMGGGNDDGIVYQDDQVIGKARSRTLEMRLNRLSRVNGHQDKINKLEKVPMHIRRGMIDKHVKRIKKYEQEAAEGGIVLSKVKKGQFRKIESTYKKDIERRIGGSIKARDKEKATKRERGLKISSVGRSTRNGLIVSKRDIARISGGERSGKFNGKKKSRR 346 T 4.1E-08 DUF4602 unppssm F Eukaryota T 6ke6 67 SB X1 Unassigned helices XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 347 F F F 6key 2 B B NOS2_HUMAN HEPATOCYTE NOS,HEP-NOS,INDUCIBLE NO SYNTHASE,INOS,NOS TYPE II,PEPTIDYL-CYSTEINE S-NITROSYLASE NOS2 KDINNNVEK 9 T 8.2 ZirS_C pdbhh F Eukaryota T 6kf3 3 C C RPOA2_THEKO DNA-directed RNA polymerase subunit A'' MVAEKTIKSMVSKAELPDNIKEELYAKLIEYNEKYKLKKDEIQAIIDETVREYQKALIEPGEAVGTVAAQSIGEPSTQMTLNTFHYAGVAEINVTLGLPRIIEIVDARKNPSTPIMTVYLDEEHRYDRDKALEVARRIEGTTLENLAREETIDILNMEYVVEIDPERLEKAGLDMEKVVRKLTGSFKSAEFEAEGYTLVVRPKKVTKLSDLRKIAEKVKKHRLKGLSGVGKTIIRKEGDEYVIYTEGSNFKQVLKVPGVDPTRTRTNNIWEIAEVLGIEAARNAIIDEIVSTMREQGLEVDVRHIMLVADMMTLDGVIRPIGRHGIVGEKASVLARAAFEITTQHLFAAAERGEVDPLNGVVENVLIGQPVPVGTGIVKLAMSLPLRPKRE 391 T 0.0004 RNA_pol_Rpb1_6 pdbpercent F Archaea T 6kf4 3 C C RPOA2_THEKO DNA-directed RNA polymerase subunit A'' MVAEKTIKSMVSKAELPDNIKEELYAKLIEYNEKYKLKKDEIQAIIDETVREYQKALIEPGEAVGTVAAQSIGEPSTQMTLNTFHYAGVAEINVTLGLPRIIEIVDARKNPSTPIMTVYLDEEHRYDRDKALEVARRIEGTTLENLAREETIDILNMEYVVEIDPERLEKAGLDMEKVVRKLTGSFKSAEFEAEGYTLVVRPKKVTKLSDLRKIAEKVKKHRLKGLSGVGKTIIRKEGDEYVIYTEGSNFKQVLKVPGVDPTRTRTNNIWEIAEVLGIEAARNAIIDEIVSTMREQGLEVDVRHIMLVADMMTLDGVIRPIGRHGIVGEKASVLARAAFEITTQHLFAAAERGEVDPLNGVVENVLIGQPVPVGTGIVKLAMSLPLRPKRE 391 T 0.0004 RNA_pol_Rpb1_6 pdbpercent F Archaea T 6kf9 3 C C RPOA2_THEKO DNA-directed RNA polymerase subunit A'' MVAEKTIKSMVSKAELPDNIKEELYAKLIEYNEKYKLKKDEIQAIIDETVREYQKALIEPGEAVGTVAAQSIGEPSTQMTLNTFHYAGVAEINVTLGLPRIIEIVDARKNPSTPIMTVYLDEEHRYDRDKALEVARRIEGTTLENLAREETIDILNMEYVVEIDPERLEKAGLDMEKVVRKLTGSFKSAEFEAEGYTLVVRPKKVTKLSDLRKIAEKVKKHRLKGLSGVGKTIIRKEGDEYVIYTEGSNFKQVLKVPGVDPTRTRTNNIWEIAEVLGIEAARNAIIDEIVSTMREQGLEVDVRHIMLVADMMTLDGVIRPIGRHGIVGEKASVLARAAFEITTQHLFAAAERGEVDPLNGVVENVLIGQPVPVGTGIVKLAMSLPLRPKRE 391 T 0.0004 RNA_pol_Rpb1_6 pdbpercent F Archaea T 6kfa 1 A A A0A0H5BR52_9MYRI Hydroxynitrile lyase LTCDQLPKAAINPIQEFIDSNPLEFEYVLTETFECTTRIYVQPARWSTTKAPTALDIKGTQIMAYDFVGGPENSAHLNECHTGDKQVWYFQYTNLLTDNGSSYCAYRCNGTEIIEYKCASNNNGTDPLQHQAMEVAKTVPNGDKIHYAKSNCPETHGCFAFY 162 T 29 RNR_inhib pdbhh F Eukaryota T 6kfb 1 A A A0A0H5BR52_9MYRI Hydroxynitrile lyase LTCDQLPKAAINPIQEFIDSNPLEFEYVLTETFECTTRIYVQPARWSTTKAPTALDIKGTQIMAYDFVGGPENSAHLNECHTGDKQVWYFQYTNLLTDNGSSYCAYRCNGTEIIEYKCASNNNGTDPLQHQAMEVAKTVPNGDKIHYAKSNCPETHGCFAFY 162 T 29 RNR_inhib pdbhh F Eukaryota T 6kfc 1 A A A0A0H5BR52_9MYRI Hydroxynitrile lyase LTCDQLPKAAINPIQEFIDSNPLEFEYVLTETFECTTRIYVQPARWSTTKAPTALDIKGTQIMAYDFVGGPENSAHLNECHTGDKQVWYFQYTNLLTDNGSSYCAYRCNGTEIIEYKCASNNNGTDPLQHQAMEVAKTVPNGDKIHYAKSNCPETHGCFAFY 162 T 29 RNR_inhib pdbhh F Eukaryota T 6kfd 1 A A A0A0H5BR52_9MYRI Hydroxynitrile lyase LTCDQLPKAAINPIQEFIDSNPLEFEYVLTETFECTTRIYVQPARWSTTKAPTALDIKGTQIMAYDFVGGPENSAHLNECHTGDKQVWYFQYTNLLTDNGSSYCAYRCNGTEIIEYKCASNNNGTDPLQHQAMEVAKTVPNGDKIHYAKSNCPETHGCFAFY 162 T 29 RNR_inhib pdbhh F Eukaryota T 6kfe 1 A A A0A0H5BR52_9MYRI Hydroxynitrile lyase LTCDQLPKAAINPIQEFIDSNPLEFEYVLTETFECTTRIYVQPARWSTTKAPTALDIKGTQIMAYDFVGGPENSAHLNECHTGDKQVWYFQYTNLLTDNGSSYCAYRCNGTEIIEYKCASNNNGTDPLQHQAMEVAKTVPNGDKIHYAKSNCPETHGCFAFY 162 T 29 RNR_inhib pdbhh F Eukaryota T 6kfp 1 A A Q5ZTL4_LEGPH MavC EKTGLHVHEKIKHMVKNYGTMITGIPAEILGQNEAEISVGYVKKMGNMKENIAEVVRKSEMTQPTNSAGKASNEVCDLLLGTEGASEFEKSSYQVLSGDGSNLKGSLPNKNLLVRVEMDRFNAPQKYQKIKREEFNPETAEKNKIYLLEDQLVYLDIFGKVIDLGQTSDTCHRLFNAITTPFYQNYILYDEYIDPEESAEEAAMFEMGEIVKAKMKNIDCWTATHSFTIFVPESDSEDTRTLYPYQAYWTSHTLQQWFSGDKDEKLSRLGIDGYIEKLALLGTTTDSKIRSSIYGELFSPPGKEHVFCTGMNEKFSPLRVKFKVTEVNPEIALQNLEEVQEFIDTNYPGENAKDQCELYKIKAQEAMTKQLEMRLLIE 378 T 0.035 V-ATPase_H_N unppercent F Bacteria T 6kft 4 D D NS2_MUMIP MVM NS2 mutant Nm42 DDTVDEMTKKFGTLTIHD 18 T 0.33 DUF6118 unppssm T Viruses T 6kg6 1 A B Q5ZTL4_LEGPH MavC GPLGSEKTGLHVHEKIKHMVKNYGTMITGIPAEILGQNEAEISVGYVKKMGNMKENIAEVVRKSEMTQPTNSAGKASNEVCDLLLGTEGASEFEKSSYQVLSGDGSNLKGSLPNKNLLVRVEMDRFNAPQKYQKIKREEFNPETAEKNKIYLLEDQLVYLDIFGKVIDLGQTSDTCHRLFNAITTPFYQNYILYDEYIDPEESAEEAAMFEMGEIVKAKMKNIDCWTATHSFTIFVPESDSEDTRTLYPYQAYWTSHTLQQWFSGDKDEKLSRLGIDGYIEKLALLGTTTDSKIRSSIYGELFSPPGKEHVFCTGMNEKFSPLRVKFKVTEVNPEIALQNLEEVQEFIDTNYPGENAKDQCELYKIKAQEAMTKQLEMRLLIE 383 T 0.035 V-ATPase_H_N unppercent F Bacteria T 6kgx 3 BB,CE,M,RF 21,M4,M1,24 LR7 MAAFVVGAQVSSGFMGCAAPRKEQTRVGGAAKAAVSMRIIIQDKNSYRKYQTNTSTSKWDKLLNTKPMKRQVQPNPPTNETRALNLGNTFRSPAFKFLGTLKRSKDPSGLRLGFYGRKADDFMARSIAMQAKASAAGSGVYTTQCSEGASKGMAENARTASLAKQFRQAQRSAREMSFDYYEGRKYAMKAVGHICNYEEKIFQQYNKTAAAYVMGKQETLLSCDRYAQPANKAEEYIQKSVQMQMKKRSIPYGVYTTSCADGTVKGMAENARVAKESANFRARQMSAGAKAAARFNARRVANDWHNNGCNYEEKLTSRFPAAASSVRPTTNRY 333 T 0.022 APC_u5 pdb F T 6kgx 7 BS,EB,EG,EN,FZ,PD,RH,RK,SX,ZP AF,A2,A6,AB,AJ,Y3,A7,A9,AI,YD LR4 MAAFVSGFHGVQVGAPAENKLVCRAAKPAQLTMLTGYDSKSSPNFPNRAATRERRTVSFNARVARNKSQAKKILEKADEFFARSVTMQYKAFACPNGVYDIQCTEGTVKGAAYEKRAMAVSAAFRAKQASPAAKARALFENRRHAIIASHECQHEEDLFVRFPKLSAAYMMGKTEAMRTCSRYVVPDSLEEEYMAASVDRQMKERACPGGVYASSCVEGNAKGQAEQARVAALATAFRSAQKSASKTTAERYSSAAYGRDHFAHGCSYEESVFNTYPATAAAMRSKSYNY 290 T 0.14 Amidohydro_1 pdbpercent F T 6kgx 13 EJ,JV,KV,RL,WR,XR A8,wG,xG,AA,wE,xE LR5 MAAFVGSAASAFTGASAVKANEKRSVCSLQMVAMPQTGLVNSKFSARMAKKTAKQTKNKVDEYMARSVQRQYKQAAVATGVYGTQCTEGTVKGAAEASRSAALSRQFRIKQRSAFSKAHDLFEFRKHAIIAAAGCSYEEKMVTRFPKLAAAMVLGQTEMMRTCSRYVVPESVEEEYMAASVDKQMKRRGAPGGVYSLSCAEGVAKGQAEIARVSALGAAYRAASKSASAVTAERYNSMAYGRVHFAHGCSYEEQQFNKYPAAAAAMRSDSYGY 273 T 1.6 rRNA_methylase pdbpercent F T 6kgx 14 FJ,LV,SL,YR B8,yG,BA,yE LR8 METAFVSGFMGKAAVAKFGATAVCDKTARRSSSSNSQVHMVTGAVSSVNMRRFQRVPKVSGFSAKVTKKNVNKALDKADMFFAKSVTMEGKAAAIPYGVYGIQCMEGSAKGMAHEKRAMALSAAFRMNQRSAAEKTGAMYENRRLALILAQNDHQEKQYIKYPKLAAAALMASTEVTRACQRYAVPESIEEEFLAASVDKVNKMRGTTASGVYKSSCVEGNAKGQAEQARVAALAVAFRSAQKSASQFAAERYAQSKYGRDLFSSTHFEEGYANTYPAMAAAKRASSYGY 290 T 0.011 THF_DHG_CYH pdb F T 6kgx 23 MW,OX ZH,2H LRC4 MAFVACGPLRAGEGGARLGARKAACSMQLAPPGIPPGEDARNNQSLRQYVARPVETYQKRSFATPLPLTWTGETETVGAFDVVVPPQEKDLPVSGEATSAFVKYSDMVRAERKAALQALLSASAAGEGRPTCGAEGRKFVSNANPVLVNGVKCVEYWRK 159 T 14 CRM1_repeat unphh F T 6kgx 24 NW,PX aH,3H LRC5 MAFVSGAGVAVPAGAKASAPLCALRMSGYGDYSYSTDRTKGHVNQYYVDKARSRSDWGNRNVLPASEGDAVLGRTAKGAVAVPEFGIPQLDDPVLGFGPDSMVDPRIAEADGAVWRWDAGFVDESMTLASCADISDEAVADEAFAKFRGSVLAERGAMITKAESATASVITSLRDGLYSGEAQLLTASGQRLANVAGQEKIATISGYTWDGQPQTEIPGKPFVKSIGAMDYMDGVEGGDVVAAKVGAFWKPKAPKEVPYKRPMGANTPELPYNTVPRLVQAAGLAVQE 288 T 14 DUF5953 pdbhh F T 6khi 16 P P V5V507_9CYAN proton-translocating NADH-quinone dehydrogenase subunit P MDAVISVKPILLAMTPVFILLCLFFGTRNGFYDTDQYHGNGSAH 44 T 0.12 SPC25 unppercent F Bacteria T 6khi 17 Q Q V5V791_9CYAN proton-translocating NADH-quinone dehydrogenase subunit Q MATDFNRGIMKFDGADSPAMIAISAVLILGFIAALIWWALHTAYA 45 T 0.019 FixS pdb F Bacteria T 6khj 16 P P V5V507_9CYAN proton-translocating NADH-quinone dehydrogenase subunit P MDAVISVKPILLAMTPVFILLCLFFGTRNGFYDTDQYHGNGSAH 44 T 0.12 SPC25 unppercent F Bacteria T 6khj 17 Q Q V5V791_9CYAN proton-translocating NADH-quinone dehydrogenase subunit Q MATDFNRGIMKFDGADSPAMIAISAVLILGFIAALIWWALHTAYA 45 T 0.019 FixS pdb F Bacteria T 6ki9 1 A,B,C A,B,C A0A1C9HA64_9BACT FabMG, novel types of Enoyl-acyl carrier protein reductase MKSPIPLRDVPQSNIFRKGDVFVLFGELFGRGYANGLINEARDAGMTIVGITVGRRDENNALRALTAEELATAEANLGGRIINVPLMAGFDLDAPAGEPTPTDLLADMTLKSWQDDKLDWAHIEKCRAVGVQRFKDGVAKVMAELDGMIPDGANAFFAHTMAGGIPKVKVFLAIANRIYKGRGERFLSSSALLNSDLGKLILMNFDEVTANTFLHLIEGSAAIRARLEKSGGQVRYSAYGYHGTEILIDDKYQWQTYTSYTQGKAKMRLERIAEDAWKQGIKATVYNCPEIRTNSSDIFVGVELSLFPLLKALKKENGGAWAEAQWQACREVLSEGHTLESLLQKIDDYNASDVMKGFRNFEAWPMPNTAELADIMIGTSDEITKMHKSRDALVTDVLSALVLEGTGPLMFHESSNPAGPVLWLSHDVIAKQLNLMHRLEHHHHHH 446 T 0.16 DUF1566 pdb F Bacteria T 6kia 1 A,B,C A,B,C A0A1C9HA64_9BACT FABMG MKSPIPLRDVPQSNIFRKGDVFVLFGELFGRGYANGLINEARDAGMTIVGITVGRRDENNALRALTAEELATAEANLGGRIINVPLMAGFDLDAPAGEPTPTDLLADMTLKSWQDDKLDWAHIEKCRAVGVQRFKDGVAKVMAELDGMIPDGANAFFAHTMAGGIPKVKVFLAIANRIYKGRGERFLSSSALLNSDLGKLILMNFDEVTANTFLHLIEGSAAIRARLEKSGGQVRYSAYGYHGTEILIDDKYQWQTYTSYTQGKAKMRLERIAEDAWKQGIKATVYNCPEIRTNSSDIFVGVELSLFPLLKALKKENGGAWAEAQWQACREVLSEGHTLESLLQKIDDYNASDVMKGFRNFEAWPMPNTAELADIMIGTSDEITKMHKSRDALVTDVLSALVLEGTGPLMFHESSNPAGPVLWLSHDVIAKQLNLMHRLEHHHHHH 446 T 0.16 DUF1566 pdb F Bacteria T 6kir 1 A A CX040_MOUSE Uncharacterized protein CXorf40 homolog GGSMKFPCLSFRQPYAGLILNGVKTLETRWRPLLSSVQKYTIAIHIAHKDWEDDEWQEVLMERLGMTWTQIQTLLQAGEKYGRGVIAGLIDIGETFQCPETLTAEEAVELETQAVLTNLQLKYLTQVSNPRWLLEPIPRKGGKDIFQVDIPEHLIPLEKE 160 T 0.00014 ASCH unp F Eukaryota T 6kis 1 A A CX040_MOUSE Uncharacterized protein CXorf40 homolog GGSMKFPCLSFRQPYAGLILNGVKTLETRWRPLLSSVQKYTIAIHIAHKDWEDDEWQEVLMERLGMTWTQIQTLLQAGEKYGRGVIAGLIDIGETFQCPETLTAEEAVELETQAVLTNLQLKYLTQVSNPRWLLEPIPRKGGKDIFQVDIPEHLIPLEKE 160 T 0.00014 ASCH unp F Eukaryota T 6kit 1 A A CX040_MOUSE Uncharacterized protein CXorf40 homolog SMKFPCLSFRQPYAGLILNGVKTLETRWRPLLSSVQKYTIAIHIAHKDWEDDEWQEVLMERLGMTWTQIQTLLQAGEKYGRGVIAGLIDIGETFQCPETLTAEEAVELETQAVLTNLQLKYLTQVSNPRWLLEPIPRKGGKDIFQVDIPEHLIPLEK 157 T 0.00014 ASCH pdb F Eukaryota T 6kj1 1 A A FUS_HUMAN FUS LC RAC1 SYSGYS 6 T 4.4 DUF6156 pdbhh F Eukaryota F 6kj2 1 A A FUS_HUMAN FUS LC RAC1 SYSGYS 6 T 4.4 DUF6156 pdbhh F Eukaryota F 6kj3 1 A A FUS_HUMAN FUS LC RAC1 SYSGYS 6 T 4.4 DUF6156 pdbhh F Eukaryota F 6kj4 1 A A FUS_HUMAN FUS LC RAC1 SYSGYS 6 T 4.4 DUF6156 pdbhh F Eukaryota F 6kl4 1 A A Q5ZTL4_LEGPH MavC EKTGLHVHEKIKHMVKNYGTMITGIPAEILGQNEAEISVGYVKKMGNMKENIAEVVRKSEMTQPTNSCGKASNEVCDLLLGTEGASEFEKSSYQVLSGDGSNLKGSLPNKNLLVRVEMDRFNAPQKYQKIKREEFNPETAEKNKIYLLEDQLVYLDIFGKVIDLGQTSDTCHRLFNAITTPFYQNYILYDEYIDPEESAEEAAMFEMGEIVKAKMKNIDCWTATHSFTIFVPESDSEDTRTLYPYQAYWTSHTLQQWFSGDKDEKLSRLGIDGYIEKLALLGTTTDSKIRSSIYGELFSPPGKEHVFCTGMNEKFSPLRVKFKVTEVNPEIALQNLEEVQEFIDTNYPGENAKDQCELYKIKAQEAMTKQLEMRLLIE 378 T 0.035 V-ATPase_H_N unppercent F Bacteria T 6klm 1 A A Roseltide rT7 CVSSGIVDACSECCEPDKCIIMLPTWPPRYVCSV 34 T 2.4 Benyvirus_14KDa pdbhh F T 6klp 1 A,B B,A Tropomyosin XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 135 F F F 6klq 1 A,B A,B Tropomyosin XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 135 F F F 6kls 3 C,F C,F O66458_AQUAE Cytochrome c MNTWGLIKTIFFAGSTLVFFFLLWFYNPFKHVEHYEVDEEVKAIIDNPWKKTESGKTIAEEGRELFIASCSSCHSLRYDGIYIMSVAANPKWKNIEKTSGRPVYRFGTLYKDRFFVPKDVYEAFAHDDIQGLKASLGQVPPDLSSMYLARGEGYLYQFILNPQKVLPGTTMPQLFNPQFDPQAKEKVAKIVAYMKSVNTPPPKESAKRTVMGVIVIAYFIVMGLLLWKYRENLLKRLGYH 240 T 7E-06 Cytochrom_C1 pdbpssm F Bacteria T 6klv 3 C,F C,F O66458_AQUAE Cytochrome c MNTWGLIKTIFFAGSTLVFFFLLWFYNPFKHVEHYEVDEEVKAIIDNPWKKTESGKTIAEEGRELFIASCSSCHSLRYDGIYIMSVAANPKWKNIEKTSGRPVYRFGTLYKDRFFVPKDVYEAFAHDDIQGLKASLGQVPPDLSSMYLARGEGYLYQFILNPQKVLPGTTMPQLFNPQFDPQAKEKVAKIVAYMKSVNTPPPKESAKRTVMGVIVIAYFIVMGLLLWKYRENLLKRLGYH 240 T 7E-06 Cytochrom_C1 pdbpssm F Bacteria T 6km7 2 C,D C,D RBBP5_HUMAN RBBP-5,RETINOBLASTOMA-BINDING PROTEIN RBQ-3 DEELEDSKALLYLPIAPEVEDPEENPYGPPPDGSQPPKKKPKTTNIELQGVPNDEVHPLLGVKGDGKSK 69 T 0.014 DUF2457 unppercent F Eukaryota T 6kmh 2 C,D C,D APBA1_RAT ADAPTER PROTEIN X11ALPHA,NEURON-SPECIFIC X11 PROTEIN,NEURONAL MUNC18-1-INTERACTING PROTEIN 1,MINT-1 GPGSEFQRYSKEKRDAISLAIKDIKEAIEEVKTRTIRSPYTPDEPKEPIWVMRQDISPTRDCDDQR 66 T 25 GCIP pdbhh F Eukaryota T 6kmy 1 A A Czon1107-P5A GFRSACPPFCX 11 T 0.088 Pellino pdbhh F T 6kn2 1 A A CX07_CONZO Czon1107-WT (Conformer A) GFRSPCPPFCX 11 T 0.098 Peroxidase_2 unphh F Eukaryota T 6kn3 1 A A Czon1107-WT(Conformer B) GFRSPCPPFCX 11 T 0.098 Peroxidase_2 unphh F T 6kno 1 A A Czon1107-P7A(minor conformer) GFRSPCAPFCX 11 T 3.1 Chlam_OMP3 pdbhh F T 6knp 1 A A Czon1107-P7A(major) GFRSPCAPFCX 11 T 3.1 Chlam_OMP3 pdbhh F T 6kny 1 A,B A,B B2UR41_AKKM8 Protein Amuc_1100 IVNSKRSELDKKISIAAKEIKSANAAEITPSRSSNEELEKELNRYAKAVGSLETAYKPFLASSALVPTTPTAFQNELKTFRDSLISSCKKKNILITDTSSWLGFQVYSTQAPSVQAASTLGFELKAINSLVNKLAECGLSKFIKVYRPQLPIETPANNPEESDEADQAPWTPMPLEIAFQGDRESVLKAMNAITGMQDYLFTVNSIRIRNERMMPPPIANPAAAKPAAAQPATGAASLTPADEAAAPAAPAIQQVIKPYMGKEQVFVQVSLNLVHFNQPKAQEPSED 287 T 0.019 T2SSM unphh F Bacteria T 6ko2 2 B B H2AZ_HUMAN histone H2A.Z peptide GXAGXDS 7 T 120 PduV-EutP pdbhh F Eukaryota F 6kpb 2 B B IDD10_ARATH ID1-LIKE ZINC FINGER PROTEIN 3,PROTEIN INDETERMINATE-DOMAIN 10 SPMSATALLQKAAQMGS 17 T 1.9 fvmX5 pdbhh F Eukaryota T 6kpd 2 B B IDD9_ARATH ID1-LIKE ZINC FINGER PROTEIN 1,PROTEIN INDETERMINATE-DOMAIN 9 GPQIASMSATALLQKAAQMGSKRSSSSSSNSKTFGLMT 38 T 0.41 Vfa1 unppssm F Eukaryota T 6ks5 1 A,B A,B Q5ZV21_LEGPH Type IV secretion protein Dot MHTKKDKKVISLQERVENAVDVSGAFDNCFFHNFALYLLTNNLPLPDDLFHFKSIINRNSKAEQLFEFFHNPESLNLFSILDKENDVSEPSGYLFEKSLILGFLLREWFPTQLVNNSAVKAEMLEGEKGVFSAFKNYKEYRSFMSKEELKSTEFGALYEANEAFLEYFYNRSESTLINKDSPFEKYFVGSSSDEEAIKNYWDAEGYTLYCQHLAKPQVKLSYIEIMTMMKVINQPLTIYDRSTSSIVAEYVNPKVNLPDFEVAIDALQGHYFLLKTEETEKELEEYERSYAQYKRDRSEILAHSDKPVSSLLVRATCPKGHLDEDPFIALIESLSEINSLSQIDTNLKNENT 352 T 0.25 Ycf54 pdbpssm F Bacteria T 6kto 2 C C SHLD3_HUMAN REV7-INTERACTING NOVEL NHEJ REGULATOR 1,SHIELD COMPLEX SUBUNIT 3 MTTEVILHYRPCESDPTQLPKIAEKAIQDFPTRPLSRFIPWFPYDGSKLPLRPKRSPPVISEEA 64 T 6.8 Sm_like pdbhh F Eukaryota T 6kto 3 D D SHLD2_HUMAN PROTEIN FAM35A,RINN1-REV7-INTERACTING NOVEL NHEJ REGULATOR 2,SHIELD COMPLEX SUBUNIT 2 MGMSGGSQVHIFWGAPIAPLKITVSEDTASLMSVADPWKKIQLLYSQHSLYLKD 54 T 3 LPD38 pdbhh F Eukaryota T 6ku0 2 B,D B,D MICA1_HUMAN MOLECULE INTERACTING WITH CASL PROTEIN 1,MICAL-1,NEDD9-INTERACTING PROTEIN WITH CALPONIN HOMOLOGY AND LIM DOMAINS GPGSQPTRRQIRLSSPERQRLSSLNLT 27 T 3.1 DUF3156 pdbhh F Eukaryota T 6kva 3 E,F B,b CXCR2_HUMAN CXCR2 PEPTIDE DSFEDFWKGED 11 T 1.1 LRR_3 pdbhh F Eukaryota T 6kvf 3 E,F b,B CXCR2_HUMAN CXCR2 PEPTIDE DSFEDFWKGED 11 T 1.1 LRR_3 pdbhh F Eukaryota T 6kw3 15 T E HTL1_YEAST CHROMATIN STRUCTURE-REMODELING COMPLEX PROTEIN HTL1 MSQNNTISSMNPERAYNNVTLKNLTAFQLLSQRENICELLNLVESTERHNSIINPERQRMSLEEMKKMLDALKNERKK 78 T 0.064 DUF4976 pdbpercent F Eukaryota T 6kw4 15 S E HTL1_YEAST CHROMATIN STRUCTURE-REMODELING COMPLEX PROTEIN HTL1 MSQNNTISSMNPERAYNNVTLKNLTAFQLLSQRENICELLNLVESTERHNSIINPERQRMSLEEMKKMLDALKNERKK 78 T 0.064 DUF4976 pdbpercent F Eukaryota T 6kw5 8 I E HTL1_YEAST CHROMATIN STRUCTURE-REMODELING COMPLEX PROTEIN HTL1 MSQNNTISSMNPERAYNNVTLKNLTAFQLLSQRENICELLNLVESTERHNSIINPERQRMSLEEMKKMLDALKNERKK 78 T 0.064 DUF4976 pdbpercent F Eukaryota T 6kwn 3 C C NRAM_I96A0 peptide NSDTVGWSW 9 T 2.2 DUF4902 pdbhh T Viruses T 6kwo 3 C C NRAM_I96A0 peptide ESDTVGWSW 9 T 2 CDC24_OB1 pdbhh T Viruses T 6kx1 3 C C MUC1_HUMAN Synthetic MUC1 glycopepide XVTSAPDTRPAPGSTA 16 T 32 DUF3235 pdbhh F Eukaryota T 6kx9 3 C C 8-pepide (ARG-ARG-ALA-LEU-ARG-GLU-GLY-TYR) RRALREGY 8 T 4.6 RNR_inhib pdbhh F T 6kxx 2 B B PRGC1_HUMAN PGC1alpha PQEAEEPSLLKKLLLAPANTQL 22 T 4.1 Apo-CIII pdbhh F Eukaryota T 6kxy 2 B B PRGC1_HUMAN PGC1alpha PQEAEEPSLLKKLLLAPANTQL 22 T 4.1 Apo-CIII pdbhh F Eukaryota T 6kyf 1 A A AcrF11 GPLGSMSMELFHGSYEEISEIRDSGVFGGLFGAHEKETALSHGETLHRIISPLPLTDYALNYEIESAWEVALDVAGGDENVAEAIMAKACESDSNDGWELQRLRGVLAVRLGYTSVEMEDEHGTTWLCLPGCTVEKI 137 T 0.027 Strep_his_triad pdbpssm F T 6kyu 3 C C peptide LRKRQLTVL 9 T 5.9 FAM181 pdbhh F T 6kz1 2 B B MYO15_HUMAN Myosin XVa EKRLTLPPSEITLL 14 T 0.36 DUF4875 pdbhh F Eukaryota T 6kza 2 C,D C,D DNAC_ECOLI DNA replication protein DnaC MKNVGDLMQRLQKMMPAHIKPAFKTGEELLAWQKEQGAIRSAALERENRAMKMQRT 56 T 0.014 DUF6434 pdbpssm F Bacteria T 6kzf 2 B B D-calcicludine XXXXXXXXXXXXXGXXXXXXXXXXXXXXXXXXXXXXXXGXGGXXXXXXXXGXXXXXXXGX 60 F F F 6kzg 2 B,D C,Q CIC_HUMAN 8-MER FROM PROTEIN CAPICUA HOMOLOG RSMSETGT 8 T 1.6 Tis11B_N pdbhh F Eukaryota T 6kzh 2 B,D C,Q CIC_HUMAN 8-MER FROM PROTEIN CAPICUA HOMOLOG RTQSLSAL 8 T 15 APC_r pdbhh F Eukaryota T 6kzj 1 A A ANK2_HUMAN ANK-2,ANKYRIN-B,BRAIN ANKYRIN,NON-ERYTHROID ANKYRIN GPGSASPDLLSEVSEMKQDLIKMTAILTTDVSDKAGSIKVKELVKAAEEEPGEPFEIVERVKEDLEKVNEILRSGT 76 T 15 PKHD_C pdbhh F Eukaryota T 6kzu 2 B B 2JN-DAL-E03-DTY-2JN-DSG-TDF-DGL-MK8-DLE-DLE-2JN XXXXXXXXXXXX 12 F F F 6l00 1 A A S5MRN1_9CAUD Lysin MFIYYKRTKQGSTEQWFVIGGKRIYLPTMTYVNEANDLIKRYGGNTNVTTYNHDNFGLKMMEAALPQVKV 70 T 0.022 Sial-lect-inser pdb T Viruses T 6l0o 1 A A MCM8_HUMAN MINICHROMOSOME MAINTENANCE 8 MARSMSNRSTAKRFISALNNVAERTYNNIFQFHQLRQIAKELNIQVADFENFIGSLNDQGYLLKKGPKVYQLQTMHHHHHH 81 T 0.0086 TMP_3 pdbpercent F Eukaryota T 6l0v 2 B,D,F,H B,D,F,H Q5XVG3_ARATH LZY3 GPKWVKTDSDFIVLEI 16 T 1.3 DUF2286 pdbhh F Eukaryota T 6l0w 2 B,D B,D Q5XVG3_ARATH LZY3 GPKWVKTDSDFIVLEI 16 T 1.3 DUF2286 pdbhh F Eukaryota T 6l1f 1 A A DNMT1_HUMAN the K142me1 DNMT1 peptide RSKSDG 6 T 10 DUF1645 pdbhh F Eukaryota F 6l2w 1 A,B A,B A0A4Y5TR47_9CAUD freshwater cyanophage protein MMFVRLSYHSFDYLFNLFDAGVIDLNTKCPVSLSEIEDYDNFGWLELTAENLENVCEYCAKLGIEANGSLGDFRYWYSGDMSYHLELKSDQSENLEVKIREINLKLKELELIKNECLEHHHHHH 124 T 0.036 RNA_pol_Rpb1_1 pdb T Viruses T 6l4u 11 K 1u Unknown protein 1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 130 F F F 6l4u 12 L 2u Photosystem I reaction center subunit Psa28 FAPMPRAAISTTQARTASMPSAPFTSLSMASEDMTWEGEYPPSKVLGPIMSKMPSGLLGLISIACAAVCAYSIAQSGVLQQQPGAYENGSWVKWYYVLGSFGGPLAWGTHVASWIQRKNGM 121 T 0.02 PIG-Y pdbpercent F T 6l5z 2 B C CYCLOPEPTIDE INHIBITOR XXAXXX 6 T 3100 MMR_HSR1_Xtn pdbhh F F 6l63 2 B,D B,D F3 XFAYDRRXLSNNXRNYXG 18 T 8 E2 pdbhh F T 6l65 2 B C PRO-ARG-LYS-GLN-LEU PRKQL 5 T 86 DUF2477 pdbhh F F 6l66 2 B C PRO-ARG-LYS-GLN-LEU-ALA PRKQLA 6 T 89 DUF3597 pdbhh F T 6l6g 1 A,B A,B Q5ZZ22_LEGPH Uncharacterized protein Lpg0189 NSDNNTDGLIFSPLPQNKNTVVRHYSNEQEMPNLSQMAQRTIDFPTQIVRVSGNLTGLELSCDDVENEIDQVFSKKISPNLFTYNTYVSCGYDVNDPEQHAINFSIQSYFDPLTDNAVDYLKSYLKEYNGYNLFNTTTLQIENAKGIIVSMNLNAGLKSNPDKTPFTLYRQDRNNFYFKSNFDVRKELISDIYQRFYSNDPDMILPFFDKWIFSYAGSVYYSILMASNYLELQPERIFVMENEGDIFVSDLRYYFANLCMKRNPNKHCL 269 T 0.02 DUF5012 unppercent F Bacteria T 6l6h 1 A,B A,B Q5ZZ22_LEGPH Uncharacterized protein Lpg0189 NSDNNTDGLIFSPLPQNKNTVVRHYSNEQEMPNLSQMAQRTIDFPTQIVRVSGNLTGLELSCDDVENEIDQVFSKKISPNLFTYNTYVSCGYDVNDPEQHAINFSIQSYFDPLTDNAVDYLKSYLKEYNGYNLFNTTTLQIENAKGIIVSMNLNAGLKSNPDKTPFTLYRQDRNNFYFKSNFDVRKELISDIYQRFYSNDPDMILPFFDKWIFSYAGSVYYSILMASNYLELQPERIFVMENEGDIFVSDLRYYFANLCMKRNPNKHCL 269 T 0.02 DUF5012 unppercent F Bacteria T 6l6v 1 A A GP44_BPSP1 GP44, GENE 44 PROTEIN MAKSNNVYVVNGEEKVSTLAEVAKVLGVSRVSKKDVEEGKYDVVVEEAAVSLADT 55 T 0.012 HTH_31 pdb T Viruses T 6l71 2 B C PRO-ARG-LYS-GLN-LEU-ALA PRKQLA 6 T 89 DUF3597 pdbhh F T 6l7c 3 AA,S,T,U,V,W,X,Y,Z a,S,T,U,V,W,X,Y,Z CSGA_ECOLI Major curlin subunit CsgA GVVPQYGGGGNHGGGGNNSGPN 22 T 0.35 YjbE unphh F Bacteria T 6l7i 1 A,B,C,D,E A,B,C,D,E Q9RN43_PHOLU TOXIN A RALEVERTVSLAEVYAGLPKDNGPFSLAQEIDKLVSQGSGSAGSGNNNLAFGAGTDTKTSLQASVSFADLKIREDYPASLGKIRRIKQISVTLPALLGPYQDVQAILSYGDKAGLANGCEALAVSHGMNDSGQFQLDFNDGKFLPFEGIAIDQGTLTLSFPNASMPEKGKQATMLKTLNDIILHIRYTIK 190 T 4.3999999999999996E-48 TcA_TcB_BD unppssm F Bacteria T 6l7i 4 H H Q93EP1_PHOLU TccC2 APEKGKYTKEVNFFDE 16 T 0.1 Ntox47 unphh F Bacteria T 6l7o 16 P P V5V507_9CYAN NAD(P)H-quinone oxidoreductase subunit P MDAVISVKPILLAMTPVFILLCLFFGTRNGFYDTDQYHGNGSAH 44 T 0.12 SPC25 unppercent F Bacteria T 6l7o 17 Q Q V5V791_9CYAN NAD(P)H-quinone oxidoreductase subunit Q MATDFNRGIMKFDGADSPAMIAISAVLILGFIAGLIWWALHTAYA 45 T 0.019 FixS unp F Bacteria T 6l7p 16 P P V5V507_9CYAN NAD(P)H-quinone oxidoreductase subunit P MDAVISVKPILLAMTPVFILLCLFFGTRNGFYDTDQYHGNGSAH 44 T 0.12 SPC25 unppercent F Bacteria T 6l7p 17 Q Q V5V791_9CYAN NAD(P)H-quinone oxidoreductase subunit Q MATDFNRGIMKFDGADSPAMIAISAVLILGFIAGLIWWALHTAYA 45 T 0.019 FixS unp F Bacteria T 6l7q 1 A A F8AFT0_PYRYC hypothetical protein MMLLTRHAKERIAKRLAKKRSLSHIYSSLWAFLERAVRIEIAEGVVAFTDGRKTLVCVPLDCERLSRGEILEKVRGVGVYECIFPEGRLAKLTRPEKFLESVPPGEYYFYMNDEKKVLYVGKRRPLLAITFRPAKRDERLFYIWA 145 T 1.2 DUF4258 pdbhh F Archaea T 6l7r 1 A A G0SED1_CHATD GCP3 GPLGSMQRINNAIDSLIGHLVPAAAGDDDDARTRRQAVFDLVRALLEQPGSNIPSDVNHASDLIKRRLISTNPSQALRFSNLYTRLLALPVLNQKWAILYLLHQLAD 107 T 0.068 DUF6415 pdb F Eukaryota T 6l8r 1 A A PD1L1_HUMAN HPD-L1,B7 HOMOLOG 1,B7-H1 GPRLRKGRMMDVKKCGIQDTNSKKQSDTHLEET 33 T 0.14 ASFV_J13L unphh F Eukaryota T 6l9k 2 B Q SER-PRO-SER-TYR-ALA-TYR-HIS-GLN-PHE SPSYAYHQF 9 T 8.4 F-112 pdbhh F T 6l9l 4 D,H B,F SER-PRO-SER-TYR-ALA-TYR-HIS-GLN-PHE SPSYAYHQF 9 T 8.4 F-112 pdbhh F T 6l9m 3 C,F,I,L C,F,I,L SER-PRO-SER-TYR-VAL-TYR-HIS-GLN-PHE SPSYVYHQF 9 T 0.51 DIPSY pdbhh F T 6l9n 3 C,F,I,L C,F,I,L SER-PRO-SER-TYR-ALA-TYR-HIS-GLN-PHE SPSYAYHQF 9 T 8.4 F-112 pdbhh F T 6lad 1 A,B,C,D,E,F A,B,C,D,E,F B2UR41_AKKM8 Amuc_1100 MIVNSKRSELDKKISIAAKEIKSANAAEITPSRSSNEELEKELNRYAKAVGSLETAYKPFLASSALVPTTPTAFQNELKTFRDSLISSCKKKNILITDTSSWLGFQVYSTQAPSVQAASTLGFELKAINSLVNKLAECGLSKFIKVYRPQLPIETPANNPEESDEADQAPWTPMPLEIAFQGDRESVLKAMNAITGMQDYLFTVNSIRIRNERMMPPPIANPAAAKPAAAQPATGAASLTPADEAAAPAAPAIQQVIKPYMGKEQVFVQVSLNLVHFNQPKAQEPSEDLEHHHHHH 296 T 0.019 T2SSM unphh F Bacteria T 6laf 1 A,B A,B B2UR41_AKKM8 Amuc_1100 MSLETAYKPFLASSALVPTTPTAFQNELKTFRDSLISSCKKKNILITDTSSWLGFQVYSTQAPSVQAASTLGFELKAINSLVNKLAECGLSKFIKVYRPQLPIETPANNPEESDEADQAPWTPMPLEIAFQGDRESVLKAMNAITGMQDYLFTVNSIRIRNERMMPPPIANPAAAKPAAAQPATGAASLTPADEAAAPAAPAIQQVIKPYMGKEQVFVQVSLNLVHFNQPKAQEPSEDLEHHHHHH 246 T 0.019 T2SSM unphh F Bacteria T 6lar 3 D,I F,J ECCC3_MYCS2 ESX CONSERVED COMPONENT C3,TYPE VII SECRETION SYSTEM PROTEIN ECCC3,T7SS PROTEIN ECCC3 MSRLIFEHQRRLTPPTTRKGTITIEPPPQLPRVVPPSLLRRVLPFLIVILIVGMIVALFATGMRLISPTMLFFPFVLLLAATALYRGGDNKMRTEEVDAERADYLRYLSVVRDNVRAHAAEQRAALEWSHPEPEVLATIPGTRRQWERDPRDRDFLVLRAGRHDVPLDAALKVKDTADEIDLEPVAHSALRGLLDVQRTVRDAPTGLDVAKLARITVIGEADEARAAIRAWIAQAVTWHDPTMLGVALAAPDLESGDWSWLKWLPHVDVPNEADGVGPARYLTTSTAELRERLAPALADRPLFPAESGAALKHLLVVLDDPDADPDDIARKPGLTGVTVIHRTTELPNREQYPDPERPILRVADGRIERWQVGGWQPCVDVADAMSAAEAAHIARRLSRWDSNPGYIRSTSTGSATFTTLLGIPDASALDVHLGGIKAFHHHHHHHHHH 449 T 0.0012 TauE pdbpercent F Bacteria T 6lbu 2 B B Q6CNW4_KLULA TEN1 GSSKIITDLDTIAGKIEEYAGDTLLRLRIFAQFQDISHSHERTDGIYLHFSNVPDFNAETNRERSYYFLIDETIYDEAFINTKSGERPHKGDILDMRCCYRKYDKVVEIMHLKVISIADLDSLREFLAKADDDSEIRSFLR 141 T 1.4 DUF2059 unphh F Eukaryota T 6lcn 1 A,B,C,D,E,F A,B,C,D,E,F D5SUT9_PLAL2 Serine O-acetyltransferase MGSSHHHHHHSSGLVPRGSHMATDLRLKDQLPEITDRIVESYRDFATTHHLGHCPLPSSEAVYEIAQDLQEILFPGYRRRQNLHMGNVTYHVGDLVDSLHDRLTQQIARALRHDYRRQHGISCADEVSHDFEALAQAKTITLLELLPRLRRTLALDVQAAFDGDPAAGSLDEIIFCYPGLHAVTIYRLAHELYLLDVPLIPRMLTEWAHSQTGIDIHPGATIGHSFFIDHGTGVVIGETCEIANHVKLYQGVTLGALSFPKDEQGNLLRRHKRHPTIEDHVVIYANATVLGGETVIGSHAVIGSSVSLSHSVPPNTIVTIEKPSLRYREAS 331 T 3.1E-10 DUF4954 pdbhh F Bacteria T 6ldv 3 C P M3K2_HUMAN GLY-M3L-GLY-GLY-THR-TYR-PRO NPIFEKFGKGGTYP 14 T 1.8 Thrombin_light pdbhh F Eukaryota T 6ldw 3 E,F D,C M3K2_HUMAN ILE-PHE-GLU-LYS-PHE-GLY-M3L-GLY-GLY NPIFEKFGKGGTYP 14 T 1.8 Thrombin_light pdbhh F Eukaryota T 6ldx 3 C B M3K2_HUMAN GLY-M3L-GLY-GLY-THR-TYR-PRO NPIFEKFGKGGTYP 14 T 1.8 Thrombin_light pdbhh F Eukaryota T 6ldy 3 C,F C,M M3K2_HUMAN Methylated peptide NPIFEKFGKGGTYP 14 T 1.8 Thrombin_light pdbhh F Eukaryota T 6lek 1 A A Q9GRC4_MEGRO Cement protein-20k MAHEEDGVCNSNAPCYHCDANGENCSCNCELFDCEAKKPDGSYAHPCRRCDANNICKCSCTAIPCNEDHPCHHCHEEDDGDTHCHCSCEHSHDHHDDDTHGECTKKAPCWRCEYNADLKHDVCGCECSKLPCNDEHPCYRKEGGVVSCDCKTITCNEDHPCYHSYEEDGVTKSDCDCEHSPGPSEHHHHHH 191 T 0.7 Inhibitor_I53 unphh F Eukaryota T 6lfn 1 A,B A,B LpCGTb GMSPPAPADVVSSAKPHVAVIPAAGMGHLNPTLRLAGELASRGCVVTFINPSPPVSLAEATSVAEFVASTPGVRLLDLPVQPLDPSCFPAHEDPFLRQFEAVRRSAPLLTPLLSDVSPPLAAIVCDIAICSTFLTVAAEISLPAYVFFSLSAQMLSLNLAFPTVADQVYGAGEGDEIRFPGLPESIPRSWLPPPLLDPAHLFAVHFVENGKAMPRAAGILVHSWEALEPEALAALRGGRVLAGLPPVLPIGPLYQKEKSNAVFLPWLDAQRDRSVLFVCFGNRSTHSPEQLREMAAGLERSGCRFVWVLKTKVVDKDEDEGAQKEILGEGYLERVKERGVVINGWVDQMTILSHRAVGGFFSHSGSSSVAEAAIGGQPLLLWPMGGDQRMSALVAERRGMGVWPRGWGWSADDKLIPGEEIARRIKDFMGDNALRAVAAKMKKETASAMAPGGSKDQWFDDFIARINRV 469 T 4E-27 UDPGT pdbhh F T 6lfz 1 A,B A,B SbCGTb GMSKSENAGQRPHVAVFPCAGMGHLLPYLRLAAMLHSRGCAVSVISAHPTISDAESRSLSSFFSLYPQIRSLEIQLLPLKRNPRFTNDDPFFIQRESIGNSIHLLRPLLASLSPPLSAIFVDFPVLTEFSPIAADFSLPTYTLIVTSARFFSLMAHLPRLLEQEDDISKKSEVCVPHLDPIQVSSIPPQMLDRRHFFVETITSNVASLSYLKGVLINTFTWLEPEAVEALKRNGVDHILPIGPLEAIKAEESDMDLPWLEEQAPKSVLFISFGSRGAHTKEQLREFAAALEKSGWRFLWVLKSGKVDREDKEETEDILGSSFLERTKNRGVVIKGWADQERILAHSAIGGFVSHCGWNSVVEAAKLGVPVLAWPPHGDQRVNAEVVEKVGLGLWVRGWGWAGERLIGRDEIAEKLIELRNDERLRERVKEVREKAREERESGGISETLIRDLIHSLKIK 459 T 9.6E-27 UDPGT pdbhh F T 6lg0 1 A,B,C,D,E,F A,B,C,D,E,F SbCGTa GMASTTKSENVGAHIALFPCAGMGHLLPFLRLAAMLDARGCAVTVITVKPTVSAAESDHLSAFFTIHPRITRLEFQLLPYQKSGLRNDDPFFIQMETIATSVHLLRPLLSSLSPPLSAIVSDFTLTSQVTDLVSDLPISTYTLMTSSAAFFCLMAYLPKLLQIDVANRDAIEIPDLGPISMSSIPPKMLDPSDFFSAFISSNVSSLHKVKGVLINTFNSFESEAIEAVRRNGVDHILPIGPLESYDAKKAHDLPWLDEQPPESVLFVSFGSRTALSKEQIRELGAALEKSGCRFLWVLKGGKVDKEDKEEVEDMLGASFVERTKKKGLIVKGWVKQEQILAHPAIGGFVSHCGWNSVIEAARLGVPVLAWPQHGDQSVNAGVVEKAGLGLWVREWGWGQTKLIGREEIAEKMIEVMQDEKLRVSAGEVRAKAKETREVDGDSEALLQRLIHSFNNITQNS 460 T 2.7E-26 UDPGT pdbhh F T 6lhf 3 C,H C,F RY0808 peptide RRREQTDY 8 T 15 HEPN_SAV_6107 pdbhh F T 6lhh 3 C C ARG-ARG-ARG-GLU-GLN-THR-ASP-TYR RRREQTDY 8 T 15 HEPN_SAV_6107 pdbhh F T 6lhv 1 A C Fanconi anemia complementation group G XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 400 F F F 6lhv 2 B,C B,A Fanconi anemia complementation group A XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 800 F F F 6lhw 1 A,B,C C,B,A Fanconi anemia complementation group A XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1400 F F F 6ljk 2 B B BE2-SER-ALA-ILE-LYS-SER-NIY-GLY-SET XSAIKSXGX 9 T 2.8 LAG1-DNAbind pdbhh F T 6ljm 2 B B ACE-SER-LEU-GLY-SLL XSLGX 5 T 430 DUF295 pdbhh F F 6ljn 2 B B ACE-HIS-PHE-SER-SLL XHFSX 5 T 190 DUF724 pdbhh F F 6lk8 12 O K Nup214 complex Coiled-coil region 1, helix 1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 69 F F F 6lk8 13 P L Nup214 complex coiled coil region 1, helix 2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 80 F F F 6lk8 14 Q M Nup214 complex coiled coil region 1, helix 3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 73 F F F 6lk8 15 R N Nup214 complex Coiled coil region 2, helix 1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 31 F F F 6lk8 16 S O Nup214 complex Coiled coil region 2, helix 2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 35 F F F 6lk8 17 T P Nup214 complex Coiled coil region 2, helix 3 XXXXXXXXXXXXXXXXXXXXXXXXXX 26 F F F 6lk8 18 U,V Q,R bridge domain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 391 F F F 6lkf 1 A A A0A220GHA5_9CAUD AcrIIA5 MAYGKSRYNSYRKRSFNRSNKQRREYAQEMDRLEKAFENLDGWYLSSMKDSAYKDFGKYEIRLSNHSADNKYHDLENGRLIVNIKASKLNFVDIIENKLDKIIEKIDKLDLDKYRFINATNLEHDIKCYYKGFKTKKEVI 140 T 0.033 DUSP pdbpssm T Viruses T 6lkq 56 GB,HB,IB,JB,KB y,z,7,AA,BA Viomycin XXSSXX 6 T 1500 Sid-5 pdbhh F F 6llq 1 A A VAL88 GRVVVVVTSEQVKEEVRKKFPQVEVRVVTTEEDAKQVVKEVQKKGVQKVVVVGVSEKVVQKVKQEANVQVYRVTSNDEVEQVVKDVKGSGLEHHHHHH 98 T 0.00031 PrpR_N pdbpssm F T 6ln4 2 B B PRGC1_HUMAN PPARGC-1-ALPHA,LIGAND EFFECT MODULATOR 6 ASELLKYLTT 10 T 2.2 MTBP_mid pdbhh F Eukaryota T 6lnl 1 A,B,C A,B,C PP62_ASFB7 60 kDa polyprotein MPSNMKQFCKISVWLQQHDPDLLEIINNLCMLGNLSAAKYKHGVTFIYPKQAKIRDEIKKHAYSNDPSQAIKTLESLILPFYIPTPAEFTGEIGSYTGVKLEVEKTEANKVILKNGEAVLVPAADFKPFPDRRLAVWIMESGSMPLEGPPYKRKKEGGGNLEHHHHHHHH 170 T 0.28 mRNA_decap_C pdbpercent T Viruses T 6lnm 2 B,D,F B,D,F APBA1_MOUSE ADAPTER PROTEIN X11ALPHA,NEURON-SPECIFIC X11 PROTEIN,NEURONAL MUNC18-1-INTERACTING PROTEIN 1,MINT-1 GPGSEFISLAIKDIKEAIEEVKTRTIRSPYTPDEPKEPIWVMRQDISPTR 50 T 18 UPF0561 pdbhh F Eukaryota T 6lp2 1 A A Q5ZTL3_LEGPH Uncharacterized protein lpg2148 GSLESPGFMVHKKLKSMSQSYGVMMTGVPAEVLGQMQAERSIPSINKTGNLKQQIAKEVSKVCHMMTEPTQSAGQASNDVCELLLGKIEAEKFHFTKYEALSADGDNLKNVLENTAPSSTNLLIRFEIDREDPPIVLVKTKNENFNPETAVKNKIYLLENKLYFIDKMGNLFNLGPGKKKCTQLFNAIGDSAEYSLCDPFVLEEPEKPEDFAISEIVDIFNEQKERFDFWIGSHSFTIYIPQTLGESPRQFYPYQAYFGSHTLQDWFVSDKDEYLSRIGIDKYIEKLAVLGKTTNTKERSDIYAEFFSKRGREAFFCAHLNEKRQPLRVKFKITEINPELALKNLQETQEFIDTHPGENPSDKVENYRNRAKLAMTEHLESLLDIK 386 T 0.021 CIF pdbhh F Bacteria T 6lp3 1 A,B,D,E A,B,D,E YM11_YEAST Uncharacterized protein YMR124W GSPSKTKSAPVSYDKDGMNASEEDFSFDNTLAKPYEPLYARRGDITSAGSTSGEDSSQPKMITISGEQLNLITENKELMNELTLVSTELAESIKRETELEERIRLYETNNSAPSFDDSSSVSFSDFEKELRKKSSKIVQLIQQLNDERLKRFIAEEQLLLQENGTKPSSMELVGRIENLNKLIDERDSEIEMLKGRLQ 198 T 0.039 Yop-YscD_ppl pdbpercent F Eukaryota T 6lph 2 B B FUSED_DROME Serine/threonine-protein kinase fused AAPVINSHTCFVSGNSNMILNHMNDNFA 28 T 0.23 DUF4193 unp F Eukaryota T 6lpr 2 B P METHOXYSUCCINYL-ALA-ALA-PRO-NORLEUCINE BORONIC ACID INHIBITOR XAAPX 5 T 970 Pep_deformylase pdbhh F F 6lqp 66 RB RV FAF1_YEAST FORTY S ASSEMBLY FACTOR MTLDDDDYIKQMELQRKAFESQFGSLESMGFEDKTKNIRTEVDTRDSSGDEIDNSDHGSDFKDGTIESSNSSDEDSGNETAEENNQDSKPKTQPKVIRFNGPSDVYVPPSKKTQKLLRSGKTLTQINKKLESTEAKEEKEDETLEAENLQNDLELQQFLRESHLLSAFNNGGSGSTNSGVSLTLQSMGGGNDDGIVYQDDQVIGKARSRTLEMRLNRLSRVNGHQDKINKLEKVPMHIRRGMIDKHVKRIKKYEQEAAEGGIVLSKVKKGQFRKIESTYKKDIERRIGGSIKARDKEKATKRERGLKISSVGRSTRNGLIVSKRDIARISGGERSGKFNGKKKSRR 346 T 4.1E-08 DUF4602 unppssm F Eukaryota T 6lqp 69 UB X1 Unassigned helices XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 347 F F F 6lqq 64 PB RV FAF1_YEAST FORTY S ASSEMBLY FACTOR MTLDDDDYIKQMELQRKAFESQFGSLESMGFEDKTKNIRTEVDTRDSSGDEIDNSDHGSDFKDGTIESSNSSDEDSGNETAEENNQDSKPKTQPKVIRFNGPSDVYVPPSKKTQKLLRSGKTLTQINKKLESTEAKEEKEDETLEAENLQNDLELQQFLRESHLLSAFNNGGSGSTNSGVSLTLQSMGGGNDDGIVYQDDQVIGKARSRTLEMRLNRLSRVNGHQDKINKLEKVPMHIRRGMIDKHVKRIKKYEQEAAEGGIVLSKVKKGQFRKIESTYKKDIERRIGGSIKARDKEKATKRERGLKISSVGRSTRNGLIVSKRDIARISGGERSGKFNGKKKSRR 346 T 4.1E-08 DUF4602 unppssm F Eukaryota T 6lqq 66 RB X1 Unassigned helices XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 347 F F F 6lqr 64 PB X1 Unassigned helices XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 347 F F F 6lqs 61 LB X1 Unassigned helices 1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 347 F F F 6lqs 62 MB X2 Unassigned helices 2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 694 F F F 6lqt 56 FB X1 Unassigned helices XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 347 F F F 6lqu 65 QB RV FAF1_YEAST FORTY S ASSEMBLY FACTOR MTLDDDDYIKQMELQRKAFESQFGSLESMGFEDKTKNIRTEVDTRDSSGDEIDNSDHGSDFKDGTIESSNSSDEDSGNETAEENNQDSKPKTQPKVIRFNGPSDVYVPPSKKTQKLLRSGKTLTQINKKLESTEAKEEKEDETLEAENLQNDLELQQFLRESHLLSAFNNGGSGSTNSGVSLTLQSMGGGNDDGIVYQDDQVIGKARSRTLEMRLNRLSRVNGHQDKINKLEKVPMHIRRGMIDKHVKRIKKYEQEAAEGGIVLSKVKKGQFRKIESTYKKDIERRIGGSIKARDKEKATKRERGLKISSVGRSTRNGLIVSKRDIARISGGERSGKFNGKKKSRR 346 T 4.1E-08 DUF4602 unppssm F Eukaryota T 6lqu 68 TB X1 Unassigned helices XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 347 F F F 6lqv 54 FB X1 Unassigned helices XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 347 F F F 6lqz 2 B B STH1_YEAST ATP-DEPENDENT HELICASE STH1,CHROMATIN STRUCTURE-REMODELING COMPLEX PROTEIN STH1,SNF2 HOMOLOG SEVKSSSVEIINGSESKKKKPKLTVKIKLNKTTVLENNDGKRAEEKPESKSPAKKTAAKY 60 T 70 DUF167 pdbhh F Eukaryota T 6lrd 3 C B ASP-LEU-PRO-PHE DLPF 4 T 38 DUF3794 pdbhh F F 6lrp 1 A A A9WDE7_CHLAA Isocitrate lyase MRGSHHHHHHGSMDRAAQIKQIADSWNTPRFAGIVRPYTPEDVYRLRGSVQIEYTLARMGAERLWNLLHTEPYINALGALTGNQAMQQVKAGLKAIYLSGWQVAADANLAGQMYPDQSLYPANSGPQLVRNINNALRRADQIYHSEGRNDIYWFAPIVADAEAGFGGPLNVFEIMKAYIEAGAAGVHFEDQLASEKKCGHMGGKVLIPTQAAIRNLVAARLAADVMGVPTIIVARTDANAATLLTSDIDERDRPFCTGERTSEGFYRVRAGLDQAIARGLAYAPYADMIWCETSEPNLEEARRFAEAIHAQFPGKLLAYNCSPSFNWKKKLDDATIAAFQRELGAMGYKFQFVTLAGFHALNYSMFELARNYRDRGMAAYSELQQAEFAAEAYGYTATRHQREVGTGYFDEVAQVIAGGEISTTALTGSTEEEQFH 436 T 1.4999999999999998E-47 ICL unp F Bacteria T 6lrt 1 A,B,C,D,E,F,G,H A,D,G,J,M,P,S,V A9WDE7_CHLAA Isocitrate lyase MRGSHHHHHHGSMDRAAQIKQIADSWNTPRFAGIVRPYTPEDVYRLRGSVQIEYTLARMGAERLWNLLHTEPYINALGALTGNQAMQQVKAGLKAIYLSGWQVAADANLAGQMYPDQSLYPANSGPQLVRNINNALRRADQIYHSEGRNDIYWFAPIVADAEAGFGGPLNVFEIMKAYIEAGAAGVHFEDQLASEKKCGHMGGKVLIPTQAAIRNLVAARLAADVMGVPTIIVARTDANAATLLTSDIDERDRPFCTGERTSEGFYRVRAGLDQAIARGLAYAPYADMIWCETSEPNLEEARRFAEAIHAQFPGKLLAYNCSPSFNWKKKLDDATIAAFQRELGAMGYKFQFVTLAGFHALNYSMFELARNYRDRGMAAYSELQQAEFAAEAYGYTATRHQREVGTGYFDEVAQVIAGGEISTTALTGSTEEEQFH 436 T 1.4999999999999998E-47 ICL unp F Bacteria T 6ls6 2 C,D C,D H31_HUMAN 2 TKQTARXS 8 T 260 SpecificRecomb pdbhh F Eukaryota T 6lsb 2 B B H3_ACRFO Histone H3 ARTKQTARKSTGGXAPRKQLATKAAX 26 T 0.26 PAF pdbpercent F Eukaryota T 6lsd 2 C,D C,D H31_HUMAN 2 XAARXSAPA 9 T 25 DUF5826 pdbhh F Eukaryota F 6ltp 1 A,D A,G Cas12i2 SMSSAIKSYKSVLRPNERKNQLLKSTIQCLEDGSAFFFKMLQGLFGGITPEIVRFSTEQEKQQQDIALWCAVNWFRPVSQDSLTHTIASDNLVEKFEEYYGGTASDAIKQYFSASIGESYYWNDCRQQYYDLCRELGVEVSDLTHDLEILCREKCLAVATESNQNNSIISVLFGTGEKEDRSVKLRITKKILEAISNLKEIPKNVAPIQEIILNVAKATKETFRQVYAGNLGAPSTLEKFIAKDGQKEFDLKKLQTDLKKVIRGKSKERDWCCQEELRSYVEQNTIQYDLWAWGEMFNKAHTALKIKSTRNYNFAKQRLEQFKEIQSLNNLLVVKKLNDFFDSEFFSGEETYTICVHHLGGKDLSKLYKAWEDDPADPENAIVVLCDDLKNNFKKEPIRNILRYIFTIRQECSAQDILAAAKYNQQLDRYKSQKANPSVLGNQGFTWTNAVILPEKAQRNDRPNSLDLRIWLYLKLRHPDGRWKKHHIPFYDTRFFQEIYAAGNSPVDTCQFRTPRFGYHLPKLTDQTAIRVNKKHVKAAKTEARIRLAIQQGTLPVSNLKITEISATINSKGQVRIPVKFDVGRQKGTLQIGDRFCGYDQNQTASHAYSLWEVVKEGQYHKELGCFVRFISSGDIVSITENRGNQFDQLSYEGLAYPQYADWRKKASKFVSLWQITKKNKKKEIVTVEAKEKFDAICKYQPRLYKFNKEYAYLLRDIVRGKSLVELQQIRQEIFRFIEQDCGVTRLGSLSLSTLETVKAVKGIIYSYFSTALNASKNNPISDEQRKEFDPELFALLEKLELIRTRKKKQKVERIANSLIQTCLENNIKFIRGEGDLSTTNNATKKKANSRSMDWLARGVFNKIRQLAPMHNITLFGCGSLYTSHQDPLVHRNPDKAMKCRWAAIPVKDIGDWVLRKLSQNLRAKNIGTGEYYHQGVKEFLSHYELQDLEEELLKWRSDRKSNIPCWVLQNRLAEKLGNKEAVVYIPVRGGRIYFATHKVATGAVSIVFDQKQVWVCNADHVAAANIALTVKGIGEQSSDEENPDGSRIKLQLTS 1055 T 0.11 RHSP pdb F T 6ltr 1 A A Cas12i2 SMSSAIKSYKSVLRPNERKNQLLKSTIQCLEDGSAFFFKMLQGLFGGITPEIVRFSTEQEKQQQDIALWCAVNWFRPVSQDSLTHTIASDNLVEKFEEYYGGTASDAIKQYFSASIGESYYWNDCRQQYYDLCRELGVEVSDLTHDLEILCREKCLAVATESNQNNSIISVLFGTGEKEDRSVKLRITKKILEAISNLKEIPKNVAPIQEIILNVAKATKETFRQVYAGNLGAPSTLEKFIAKDGQKEFDLKKLQTDLKKVIRGKSKERDWCCQEELRSYVEQNTIQYDLWAWGEMFNKAHTALKIKSTRNYNFAKQRLEQFKEIQSLNNLLVVKKLNDFFDSEFFSGEETYTICVHHLGGKDLSKLYKAWEDDPADPENAIVVLCDDLKNNFKKEPIRNILRYIFTIRQECSAQDILAAAKYNQQLDRYKSQKANPSVLGNQGFTWTNAVILPEKAQRNDRPNSLDLRIWLYLKLRHPDGRWKKHHIPFYDTRFFQEIYAAGNSPVDTCQFRTPRFGYHLPKLTDQTAIRVNKKHVKAAKTEARIRLAIQQGTLPVSNLKITEISATINSKGQVRIPVKFDVGRQKGTLQIGDRFCGYDQNQTASHAYSLWEVVKEGQYHKELGCFVRFISSGDIVSITENRGNQFDQLSYEGLAYPQYADWRKKASKFVSLWQITKKNKKKEIVTVEAKEKFDAICKYQPRLYKFNKEYAYLLRDIVRGKSLVELQQIRQEIFRFIEQDCGVTRLGSLSLSTLETVKAVKGIIYSYFSTALNASKNNPISDEQRKEFDPELFALLEKLELIRTRKKKQKVERIANSLIQTCLENNIKFIRGAGDLSTTNNATKKKANSRSMDWLARGVFNKIRQLAPMHNITLFGCGSLYTSHQDPLVHRNPDKAMKCRWAAIPVKDIGDWVLRKLSQNLRAKNIGTGEYYHQGVKEFLSHYELQDLEEELLKWRSDRKSNIPCWVLQNRLAEKLGNKEAVVYIPVRGGRIYFATHKVATGAVSIVFDQKQVWVCNADHVAAANIALTVKGIGEQSSDEENPDGSRIKLQLTS 1055 T 0.11 RHSP pdb F T 6ltu 1 A A Cas12i2 SMSSAIKSYKSVLRPNERKNQLLKSTIQCLEDGSAFFFKMLQGLFGGITPEIVRFSTEQEKQQQDIALWCAVNWFRPVSQDSLTHTIASDNLVEKFEEYYGGTASDAIKQYFSASIGESYYWNDCRQQYYDLCRELGVEVSDLTHDLEILCREKCLAVATESNQNNSIISVLFGTGEKEDRSVKLRITKKILEAISNLKEIPKNVAPIQEIILNVAKATKETFRQVYAGNLGAPSTLEKFIAKDGQKEFDLKKLQTDLKKVIRGKSKERDWCCQEELRSYVEQNTIQYDLWAWGEMFNKAHTALKIKSTRNYNFAKQRLEQFKEIQSLNNLLVVKKLNDFFDSEFFSGEETYTICVHHLGGKDLSKLYKAWEDDPADPENAIVVLCDDLKNNFKKEPIRNILRYIFTIRQECSAQDILAAAKYNQQLDRYKSQKANPSVLGNQGFTWTNAVILPEKAQRNDRPNSLDLRIWLYLKLRHPDGRWKKHHIPFYDTRFFQEIYAAGNSPVDTCQFRTPRFGYHLPKLTDQTAIRVNKKHVKAAKTEARIRLAIQQGTLPVSNLKITEISATINSKGQVRIPVKFDVGRQKGTLQIGDRFCGYDQNQTASHAYSLWEVVKEGQYHKELGCFVRFISSGDIVSITENRGNQFDQLSYEGLAYPQYADWRKKASKFVSLWQITKKNKKKEIVTVEAKEKFDAICKYQPRLYKFNKEYAYLLRDIVRGKSLVELQQIRQEIFRFIEQDCGVTRLGSLSLSTLETVKAVKGIIYSYFSTALNASKNNPISDEQRKEFDPELFALLEKLELIRTRKKKQKVERIANSLIQTCLENNIKFIRGEGDLSTTNNATKKKANSRSMDWLARGVFNKIRQLAPMHNITLFGCGSLYTSHQDPLVHRNPDKAMKCRWAAIPVKDIGDWVLRKLSQNLRAKNIGTGEYYHQGVKEFLSHYELQDLEEELLKWRSDRKSNIPCWVLQNRLAEKLGNKEAVVYIPVRGGRIYFATHKVATGAVSIVFDQKQVWVCNADHVAAANIALTVKGIGEQSSDEENPDGSRIKLQLTS 1055 T 0.11 RHSP pdb F T 6lu0 1 A A Cas12i2 SMSSAIKSYKSVLRPNERKNQLLKSTIQCLEDGSAFFFKMLQGLFGGITPEIVRFSTEQEKQQQDIALWCAVNWFRPVSQDSLTHTIASDNLVEKFEEYYGGTASDAIKQYFSASIGESYYWNDCRQQYYDLCRELGVEVSDLTHDLEILCREKCLAVATESNQNNSIISVLFGTGEKEDRSVKLRITKKILEAISNLKEIPKNVAPIQEIILNVAKATKETFRQVYAGNLGAPSTLEKFIAKDGQKEFDLKKLQTDLKKVIRGKSKERDWCCQEELRSYVEQNTIQYDLWAWGEMFNKAHTALKIKSTRNYNFAKQRLEQFKEIQSLNNLLVVKKLNDFFDSEFFSGEETYTICVHHLGGKDLSKLYKAWEDDPADPENAIVVLCDDLKNNFKKEPIRNILRYIFTIRQECSAQDILAAAKYNQQLDRYKSQKANPSVLGNQGFTWTNAVILPEKAQRNDRPNSLDLRIWLYLKLRHPDGRWKKHHIPFYDTRFFQEIYAAGNSPVDTCQFRTPRFGYHLPKLTDQTAIRVNKKHVKAAKTEARIRLAIQQGTLPVSNLKITEISATINSKGQVRIPVKFDVGRQKGTLQIGDRFCGYDQNQTASHAYSLWEVVKEGQYHKELGCFVRFISSGDIVSITENRGNQFDQLSYEGLAYPQYADWRKKASKFVSLWQITKKNKKKEIVTVEAKEKFDAICKYQPRLYKFNKEYAYLLRDIVRGKSLVELQQIRQEIFRFIEQDCGVTRLGSLSLSTLETVKAVKGIIYSYFSTALNASKNNPISDEQRKEFDPELFALLEKLELIRTRKKKQKVERIANSLIQTCLENNIKFIRGAGDLSTTNNATKKKANSRSMDWLARGVFNKIRQLAPMHNITLFGCGSLYTSHQDPLVHRNPDKAMKCRWAAIPVKDIGDWVLRKLSQNLRAKNIGTGEYYHQGVKEFLSHYELQDLEEELLKWRSDRKSNIPCWVLQNRLAEKLGNKEAVVYIPVRGGRIYFATHKVATGAVSIVFDQKQVWVCNADHVAAANIALTVKGIGEQSSDEENPDGSRIKLQLTS 1055 T 0.11 RHSP pdb F T 6lu7 2 B C N-[(5-METHYLISOXAZOL-3-YL)CARBONYL]ALANYL-L-VALYL-N~1~-((1R,2Z)-4-(BENZYLOXY)-4-OXO-1-{[(3R)-2-OXOPYRROLIDIN-3-YL]METHYL}BUT-2-ENYL)-L-LEUCINAMIDE XAVLXX 6 T 1700 zf-H2C2_5 pdbhh F F 6lui 1 A A SAMD1_HUMAN STERILE ALPHA MOTIF DOMAIN-CONTAINING PROTEIN 1,SAM DOMAIN-CONTAINING PROTEIN 1 SASPHYQEWILDTIDSLRSRKARPDLERICRMVRRRHGPEPERTRAELEKLIQQRAVLRVSYKGSISYRNAARVQPPRRG 80 T 0.025 Linker_histone pdbpercent F Eukaryota T 6lum 3 C,H,M E,I,O A0R4D1_MYCS2 Succinate dehydrogenase subunit F MVLFFEILLVAAVLVITWFAVYALYRLVTDES 32 T 0.0077 CoxIIa unphh F Bacteria T 6lup 3 C,F C,F PHE-ALA-ASN-PHE-PHE-ILE-ARG-GLY-LEU FANFFIRGL 9 T 3.9 DUF2199 pdbhh F T 6lvb 2 B,D,F,H B,D,F,H I6NWZ0_9RHOB N,N-dimethylformamidase small subunit MTEASESCVRDPSNYRDRSADWYAFYDERRRKEIIDIIDEHPEIVEEHAANPFGYRKHPSPYLQRVHNYFRMQPTFGRYYIYSEREWDAYRIATIREFGELPELGDERFKTEEEAMHAVFLRRIEDVRAELA 132 T 0.18 UPF0158 pdb F Bacteria T 6lvc 2 B,D B,D I6NWZ0_9RHOB N,N-dimethylformamidase small subunit MTEASESCVRDPSNYRDRSADWYAFYDERRRKEIIDIIDEHPEIVEEHAANPFGYRKHPSPYLQRVHNYFRMQPTFGRYYIYSEREWDAYRIATIREFGELPELGDERFKTEEEAMHAVFLRRIEDVRAELA 132 T 0.18 UPF0158 pdb F Bacteria T 6lvd 2 B,D,F,H B,D,F,H I6NWZ0_9RHOB N,N-dimethylformamidase small subunit MTEASESCVRDPSNYRDRSADWYAFYDERRRKEIIDIIDEHPEIVEEHAANPFGYRKHPSPYLQRVHNYFRMQPTFGRYYIYSEREWDAYRIATIREFGELPELGDERFKTEEEAMHAVFLRRIEDVRAELA 132 T 0.18 UPF0158 pdb F Bacteria T 6lve 2 B,D,F,H B,D,F,H I6NWZ0_9RHOB N,N-dimethylformamidase small subunit MTEASESCVRDPSNYRDRSADWYAFYDERRRKEIIDIIDEHPEIVEEHAANPFGYRKHPSPYLQRVHNYFRMQPTFGRYYIYSEREWDAYRIATIREFGELPELGDERFKTEEEAMHAVFLRRIEDVRAELA 132 T 0.18 UPF0158 pdb F Bacteria T 6lvv 2 B,D,F,H,J,L,N,P B,D,F,H,J,L,N,P I6NWZ0_9RHOB N,N-dimethylformamidase small subunit MTEASESCVRDPSNYRDRSADWYAFYDERRRKEIIDIIDEHPEIVEEHAANPFGYRKHPSPYLQRVHNYFRMQPTFGRYYIYSEREWDAYRIATIREFGELPELGDERFKTEEEAMHAVFLRRIEDVRAELA 132 T 0.18 UPF0158 pdb F Bacteria T 6lw4 1 A A Q5ZTL3_LEGPH Uncharacterized protein Lpg2148 LESPGFMVHKKLKSMSQSYGVMMTGVPAEVLGQMQAERSIPSINKTGNLKQQIAKEVSKVCHMMTEPTQSAGQASNDVCELLLGKIEAEKFHFTKYEALSADGDNLKNVLENTAPSSTNLLIRFEIDREDPPIVLVKTKNENFNPETAVKNKIYLLENKLYFIDKMGNLFNLGPGKKKCTQLFNAIGDSAEYSLCDPFVLEEPEKPEDFAISEIVDIFNEQKERFDFWIGSHSFTIYIPQTLGESPRQFYPYQAYFGSHTLQDWFVSDKDEYLSRIGIDKYIEKLAVLGKTTNTKERSDIYAEFFSKRGREAFFCAHLNEKRQPLRVKFKITEINPELALKNLQETQEFIDTHPGENPSDKVENYRNRAKLAMTEHLESLLDI 383 T 0.021 CIF pdbhh F Bacteria T 6lw5 2 B B TRP-LYS-TYR-MET-VAL-QXV WKYMVX 6 T 32 DUF5891 pdbhh F T 6ly5 30 GA g PsaS XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 134 F F F 6ly5 31 HA h A0A6J4B118_9STRA PsaR MSKLSFFILSAVLAVSAAFAPMPRAAISTTHARTASMPSASFTSLSMASEDMTWEGEYPPSKVLGPIMSKMPSGLLGLISIACAAVCVYSIAQSGVLQQQPGAYENGSWVKWYYVLGSFGGPLAWGTHVASWIQRKNGM 139 T 0.51 DUF6520 pdbhh F Eukaryota T 6lyc 2 B B D4-2 XXRYSAVYSIHPSWCGX 17 T 0.95 BUD22 pdbhh F T 6lzx 1 A,B A,B B5MGN9_PHYAM Glycosyltransferase MNHKVHHHHHHLQENLYFQGMGAEPQQLHVVFFPIMAHGHMIPTLDIARLFAARNVRATIITTPLNAHTFTKAIEMGKKNGSPTIHLELFKFPAQDVGLPEGCENLEQALGSSLIEKFFKGVGLLREQLEAYLEKTRPNCLVADMFFPWATDSAAKFNIPRLVFHGTSFFSLCALEVVRLYEPHKNVSSDEELFSLPLFPHDIKMMRLQLPEDVWKHEKAEGKTRLKLIKESELKSYGVIVNSFYELEPNYAEFFRKELGRRAWNIGPVSLCNRSTEDKAQRGKQTSIDEHECLKWLNSKKKNSVIYICFGSTAHQIAPQLYEIAMALEASGQEFIWVVRNNNNNDDDDDDSWLPRGFEQRVEGKGLIIRGWAPQVLILEHEAIGAFVTHCGWNSTLEGITAGVPMVTWPIFAEQFYNEKLVNQILKIGVPVGANKWSRETSIEDVIKKDAIEKALREIMVGDEAEERRSRAKKLKEMAWKAVEEGGSSYSDLSALIEELRGYHA 505 T 8.000000000000001E-28 UDPGT pdbhh F Eukaryota T 6lzy 1 A,B A,B B5MGN9_PHYAM Glycosyltransferase MNHKVHHHHHHLQENLYFQGMGAEPQQLHVVFFPIMAHGHMIPTLDIARLFAARNVRATIITTPLNAHTFTKAIEMGKKNGSPTIHLELFKFPAQDVGLPEGCENLEQALGSSLIEKFFKGVGLLREQLEAYLEKTRPNCLVADMFFPWATDSAAKFNIPRLVFHGTSFFSLCALEVVRLYEPHKNVSSDEELFSLPLFPHDIKMMRLQLPEDVWKHEKAEGKTRLKLIKESELKSYGVIVNSFYELEPNYAEFFRKELGRRAWNIGPVSLCNRSTEDKAQRGKQTSIDEHECLKWLNSKKKNSVIYICFGSTAHQIAPQLYEIAMALEASGQEFIWVVRNNNNNDDDDDDSWLPRGFEQRVEGKGLIIRGWAPQVLILEHEAIGAFVTHCGWNSTLEGITAGVPMVTWPIFAEQFYNEKLVNQILKIGVPVGANKWSRETSIEDVIKKDAIEKALREIMVGDEAEERRSRAKKLKEMAWKAVEEGGSSYSDLSALIEELRGYHA 505 T 8.000000000000001E-28 UDPGT pdbhh F Eukaryota T 6m0q 2 B,D,F,H,J,L B,D,F,H,J,L Q82V11_NITEU Uncharacterized protein MNKVIVAAFVSAFVLGSTATFASGNLESSLAPISAKDMLDYLACKDKKPTDVVKSHTEVENGKIVRVKCGDIVALVQKAREQSGDAWQGGY 91 T 0.042 DUF6488 pdbpercent F Bacteria T 6m0r 6 F O YP17B_YEAST Uncharacterized protein YPR170W-B TGKAWCCTVLSAFGVVILSVIAHLFNTNHESFVGSINDPEDGPAVAHTVYLAALVYLVFFVFCGFQVYL 69 T 0.055 Tetraspanin pdbpssm F Eukaryota T 6m0s 3 C O YP17B_YEAST Uncharacterized protein YPR170W-B TGKAWCCTVLSAFGVVILSVIAHLFNTNHESFVGSINDPEDGPAVAHTVYLAALVYLVFFVFCGFQVYL 69 T 0.055 Tetraspanin pdbpssm F Eukaryota T 6m0y 1 A A LYS-ARG-ILE-VAL-LYS-ARG-ILE-LYS-LYS-TRP-LEU-ARG KRIVKRIKKWLR 12 T 0.78 FAM110_C pdbhh F T 6m19 1 A A lasso peptide LVVIVQADWNAPGFF 15 T 1.2 Exo_endo_phos pdbhh F T 6m1h 2 B B MAXA_LUTLO Maxadilan CDATCQFRKAIDDCQKQAHHSNVLQTSVQTTATFTSMDTSQLPGNSVFKECMKQKKKEFKA 61 T 0.74 Clavanin unphh F Eukaryota T 6m1u 1 A A MET16_HUMAN METHYLTRANSFERASE 10 DOMAIN-CONTAINING PROTEIN,METHYLTRANSFERASE-LIKE PROTEIN 16,N6-ADENOSINE-METHYLTRANSFERASE METTL16,U6 SMALL NUCLEAR RNA (ADENINE-(43)-N(6))-METHYLTRANSFERASE,METHYLTRANSFERASE 10 DOMAIN-CONTAINING PROTEIN,METHYLTRANSFERASE-LIKE PROTEIN 16,N6-ADENOSINE-METHYLTRANSFERASE METTL16,U6 SMALL NUCLEAR RNA (ADENINE-(43)-N(6))-METHYLTRANSFERASE MKPITFVVLASVMKELSLKASPLRSETAEGIVVVTTWIEKILTDLKVQHKRVPCGKEEVSLFLTAIENSWIHLRRKKRERVRQLREVPRAPEDVIQALEEKKGVAGQYLFKCLINVKKEVDDALVEMHWVEGQNRDLMNQLCTYIRNQIFRLVAVNLEHHHHHH 164 T 15 BLUF pdbhh F Eukaryota T 6m24 3 C C POLG_RHDVF VP60-2 ALMPGQFFV 9 T 0.46 tRNA_Me_trans pdbhh T Viruses T 6m2j 3 C C POLG_RHDVF VP60-1 TLIDLTELI 9 T 6.1 CAF1-p150_N pdbhh T Viruses F 6m2k 1 A C POLG_RHDVF VP60-10 FVPFNSPNI 9 T 6.5 DUF1919 pdbhh T Viruses T 6m3n 1 A A anti-CRIPSR AcrIF7 GHMTTFTSIVTTNPDFGGFEFYVEAGQQFDDSAYEEAYGVSVPSAVVEEMNAKAAQLKDGEWLNVSHEA 69 T 0.034 DUF3085 pdbpssm F T 6m5e 2 D,E,F F,G,H dalbavancin XYXXXXXX 8 T 840 DDE_Tnp_1_2 pdbhh F F 6m5e 3 G,H I,J dalbavancin XYXXXXXX 8 T 840 DDE_Tnp_1_2 pdbhh F F 6m64 2 B,D,F B,D,F CBP_HUMAN CBP GPPPAAVEAARQILREAQQQQHLYSDED 28 T 6.4 DUF2007 pdbhh F Eukaryota T 6m6g 4 I L coiled coils XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 84 F F F 6m6g 5 L P Coiled coils XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 87 F F F 6m6i 2 G G Coiled coils chain 1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 81 F F F 6m6i 3 H H Coiled coils chain 2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 83 F F F 6m6q 1 A,B A,B Q93413_CAEEL Dicer Related Helicase MQPTAIRLEDYDKSKLRLPFESPYFPAYFRLLKWKFLDVCVESTRNNDIGYFKLFESLFPPGKLEEIARMIIDEPTPVSHDPDMIKIRNADLDVKIRKQAETYVTLRHAHQQKVQRRRFSECFLNTVLFDEKGLRIADEVMFNYDKELYGYSHWEDLPDGWLTAETFKNKFYDEEEVTNNPFGYQKLDRVAGAARGMIIMKHLKSNPRCVSETTILAFEVFNKGNHQLSTDLVEDLLTEGPAFELKIENGEEKKYAVKKWSLHKTLTMFLAIIGFKSNDKKEKNEHEEWYYGFIDAMKNDPANRAALYFLDKNWPEELEEREKERDRIRLTLLKS 335 T 0.29 RE_NgoFVII unphh F Eukaryota T 6m7a 2 B,D D,C SHLD3_HUMAN REV7-INTERACTING NOVEL NHEJ REGULATOR 1,SHIELD COMPLEX SUBUNIT 3 QDFPTRPLSRFIPWFPYDGSKLPLRPKRSPPVISEEAAEDVKQYLT 46 T 4.6 Sm_like pdbhh F Eukaryota T 6m7b 2 C,D C,D SHLD3_HUMAN REV7-INTERACTING NOVEL NHEJ REGULATOR 1,SHIELD COMPLEX SUBUNIT 3 RFIPWFPYDGSKLPLRPKRSPPVISEEAAEDVKQYLT 37 T 1.2 Sm_like pdbhh F Eukaryota T 6m7m 1 A A L-GSTSTA from ice nucleation protein, inaZ, and its enantiomer, D-GSTSTA GSTSTA 6 T 0.00013 Ice_nucleation unp F F 6m80 1 A,B,C C,D,E Collagen peptide containing aza-proline and aza-glycine PPGPPGXPGPRXPPGPPGPPGPPGX 25 T 0.0018 Collagen pdbpssm F F 6m8r 2 K,L K,L GABR2_HUMAN GB2, G-PROTEIN COUPLED RECEPTOR 51, HG20 GPEKDPIEDINSPEHIQRRLSLQLPILHHAYLPSIGGVDAS 41 T 5.2 DUF776 pdbhh F Eukaryota T 6m8s 3 I,J,K,L,O A,O,P,B,M KCD12_HUMAN PFETIN,PREDOMINANTLY FETAL EXPRESSED T1 DOMAIN GPESLDGSRRSGYITIGYRGSYTIGRDAQADAKFRRVARITVCGKTSLAKEVFGDTLNESRDPDRPPERYTSRYYLKFNFLEQAFDKLSESGFHMVACSSTGTCAFASSTDQSEDKIWTSYTEYVFCRE 129 T 0.13 Baculo_VP91_N pdb F Eukaryota T 6m8w 2 B B AIAF PEPTIDE INHIBITOR XIAX 4 T 340 PD40 pdbhh F F 6m8y 2 B B AIPF PEPTIDE INHIBITOR XIPX 4 T 77 G0-G1_switch_2 pdbhh F F 6m90 3 C C CTNB1_HUMAN BETA-CATENIN CDRKAAVSHWQQQSYLDSGIHSGATTTAPSLSG 33 T 25 AvrPto pdbhh F Eukaryota T 6m91 3 C C CTNB1_HUMAN BETA-CATENIN CDRKAAVSHWQQQSYLDSGIHAGATTTAPSLSG 33 T 12 AvrPto pdbhh F Eukaryota T 6m92 3 C C CTNB1_HUMAN BETA-CATENIN CDRKAAVSHWQQQSYLDSGIHSGATTTAPSLSG 33 T 25 AvrPto pdbhh F Eukaryota T 6m93 3 C C CTNB1_HUMAN BETA-CATENIN CDRKAAVSHWQQQSYLDSGIHSGATTTAPSLSG 33 T 25 AvrPto pdbhh F Eukaryota T 6m94 3 C C CTNB1_HUMAN BETA-CATENIN CDRKAAVSHWQQQSYLDSGIHSGATTTAPSLSG 33 T 25 AvrPto pdbhh F Eukaryota T 6m9c 2 B B Pseudotyrostatin XYX 3 T 890 WW pdbhh F F 6m9d 2 B B Chymostatin A FXLX 4 T 160 HycA_repressor pdbhh F F 6m9f 2 B B Tyrostatin XYLX 4 T 880 Sigma70_r1_2 pdbhh F F 6m9i 1 A A ICEN_PSESY INAZ GSTSTA 6 T 0.00013 Ice_nucleation unp F Bacteria F 6m9j 1 A A ICEN_PSESY INAZ GSTSTA 6 T 0.00013 Ice_nucleation unp F Bacteria F 6m9k 2 D,E,F D,E,F VBET_LAMBD Recombination protein bet ITPVNDETMQEINTLLIALDKTWDDDLLPLCSQIFRRDIRASSELTQAEAVKALGFLKQKAAEQKVA 67 T 0.25 DUF1018 pdbhh T Viruses T 6ma1 2 B B HCFC1_HUMAN Host Cell Factor 1 peptide THETGTTNTATTATSN 16 T 54 Ice_nucleation pdbhh F Eukaryota T 6ma2 2 B B HCFC1_HUMAN Host Cell Factor 1 peptide THETGTTNTATTATSN 16 T 54 Ice_nucleation pdbhh F Eukaryota T 6ma3 2 B B HCFC1_HUMAN Host Cell Factor 1 peptide THETGTTNTATTATSN 16 T 54 Ice_nucleation pdbhh F Eukaryota T 6ma4 2 B B HCFC1_HUMAN Host Cell Factor 1 peptide THETGTTNTATTATSN 16 T 54 Ice_nucleation pdbhh F Eukaryota T 6ma5 2 B B HCFC1_HUMAN Host Cell Factor 1 peptide THETGTTNTATTATSN 16 T 54 Ice_nucleation pdbhh F Eukaryota T 6mat 2 G G unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXX 27 F F F 6mbb 2 B B dF1 XSYVDKIADVMREVAEKINSDLTX 24 T 1.7 DUF5806 pdbhh F T 6mbc 2 B B dF4 XSLLEKLAEYLRQMADEINKKYVKX 25 T 0.11 Amphi-Trp pdb F T 6mbd 2 C,D C,D dM1 XAPKEKEVAETLRKIGEEINEALKX 25 T 0.11 Bclx_interact pdb F T 6mbe 2 B B dM7 XDKTLEEIARELLKLALEIDKEIX 24 T 1.3 DUF2497 pdbhh F T 6mbm 1 A A MYO1H_HUMAN MYOSIN-1H KWAVRIIRKFIKGFIS 16 T 0.00051 IQ unppercent F Eukaryota T 6mc6 1 A,B A,B PPRA_DEIRA PLEIOTROPIC PROTEIN PROMOTING DNA REPAIR GMQTDGIYAAFDTLMSTAGVDSQIAALAASEADAGTLDAALTQSLQEAQGRWGLGLHHLRHEARLTDDGDIEILTDGRPSARVSEGFGALAQAYAPMQALDERGLSQWAALGEGYRAPGDLPLAQLKVLIEHARDFETDWSAGRGETFQRVWRKGDTLFVEVARPASAEAALSKAAWKVIASIKDRAFQRELMRRSEKDGMLGALLGARHAGAKANLAQLPEAHFTVQAFVQTLSGAAARNAEEYRAALKTAAAALEEYQGVTTRQLSEVLRHGLRES 278 T 8.6 Ldt_C pdbhh F Bacteria T 6mc8 1 A,B A,B C1D318_DEIDV DNA repair protein PprA MRSGSHHHHHHRSDITSLYKKAGLENLYFQGSVNPLARFAELVATAGLQSDVQALADSGADDTTLEAQLTQELRLAHDRWGLGLLHLQHSARLIHTDGVPSDIALLVDGAPRAQLSDGARAIAGTYASMQAPGPEGRSEWGILPEGHRVTLRPGLGQLRVLIEDARDFETHWTPGAAQTWTRTWRQGETLAVEVHRPATPATALAKAAWKVITSIKDRTFQRELMERSNQVGMLGALLGARHSGAGDALNQLPEAHFAVSSAVVRETGREGREVDRWKAMQREATETLDELQKAATRRLAAVLSGGLR 308 T 0.2 Asp_protease_2 unp F Bacteria T 6mcb 3 C C Anti-CRISPR protein AcrIIA2 MTLTRAQKKYAEAMHEFINMVDDFEESTPDFAKEVLHDSDYVVITKNEKYAVALCSLSTDECEYDTNLYLDEKLVDYSTVDVNGVTYYINIVETNDIDDLEIATDEDEMKSGNQEIILKSELK 123 T 0.13 DUF6376 pdb F T 6mcc 3 C C A0A4V8H027_9CAUD ACRIIA2B.3 MTTARKKFYQAISEFEAMTGKDVERTPQIADEVLNDAEYIAFTKTEKYALYLCTSNVEGLEDRYFLDEECLDSTFLETEDNETYYIHFLQETEFSEDDNEDELPLATEEQIEAYDKQEELKAVILKKELN 130 T 0.051 CARDB pdbpercent T Viruses T 6mcd 1 A A Pb(II)(GRAND Coil Ser L12CL16A)- XEWEALEKKLAACESKAQALEKKLQALEKKLEALEHGX 38 T 0.0005 Lebercilin pdbpssm F T 6mct 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O A,B,C,D,E,F,G,H,I,J,K,L,M,N,O mini-eVgL membrane protein DSLKWIVFLLFLIVLLLLAIVFLLRGX 27 T 0.002 RCR pdbhh F T 6me1 3 C,F F,E ENV_HV1B1 ENV POLYPROTEIN AVGLGAVFLGHHHHHH 16 T 52 PBP_N unphh T Viruses T 6mem 1 A A Chlorophyll A/B binding protein 1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 193 F F F 6mem 2 B B Chlorophyll A/B binding protein 5 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 195 F F F 6mem 3 C C Chlorophyll A/B binding protein 3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 208 F F F 6mem 4 D D Chlorophyll A/B binding protein 7 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 206 F F F 6mem 5 E E Chlorophyll A/B binding protein 4 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 221 F F F 6mem 6 F F Chlorophyll A/B binding protein 9 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 218 F F F 6mem 7 G G Chlorophyll A/B binding protein 8 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 219 F F F 6mem 8 H H Chlorophyll A/B binding protein 2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 198 F F F 6mem 9 I I Chlorophyll A/B binding protein 6 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 196 F F F 6mem 10 J J PsaA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 743 F F F 6mem 11 K K Chlorophyll A/B binding protein 10 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 223 F F F 6mem 12 L L PsaB XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 733 F F F 6mem 13 M M Chlorophyll A/B binding protein 11 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 224 F F F 6mem 14 N N PsaC XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 80 F F F 6mem 15 O O Chlorophyll A/B binding protein 12 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 225 F F F 6mem 16 P P PsaD XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 143 F F F 6mem 17 Q Q PsaE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 66 F F F 6mem 18 R R PsaF XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 154 F F F 6mem 19 S S PsaG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 97 F F F 6mem 20 T T PsaH XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 88 F F F 6mem 21 U U PsaI XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 30 F F F 6mem 22 V V PsaJ XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 42 F F F 6mem 23 W W PsaK XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 77 F F F 6mem 24 X X PsaL XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 157 F F F 6mew 2 B,D B,D RFX7_HUMAN RFX7 peptide KAFVHMPTLPNLDFHKT 17 T 0.5 DUF4739 pdbhh F Eukaryota T 6mf5 2 C,D C,D SPC72_YEAST Spc72 SLAQSSPAGSQ 11 T 82 NAD_kinase_C pdbhh F Eukaryota T 6mf6 2 C,D D,C DBF4_YEAST DUMBBELL FORMING PROTEIN 4 RARIERARSIEGAVQVSKGTG 21 T 2.2 PDGF_N pdbhh F Eukaryota T 6mf8 1 A A TRAC_MOUSE T-cell receptor alpha chain C region DATLTEKSFETDMNLNFQNLSVMGLRILLLKVAGFNLLMTLRLWSS 46 T 0.048 Ribonucleas_3_3 pdb F Eukaryota T 6mgq 2 D,E,F D,E,F Phosphinic inhibitor DG014 XXTFPETLTY 10 T 22 Tcp11 pdbhh F T 6mhe 2 B C KB752 peptide SRVTWYDFLMEDTKSR 16 T 2.1 DUF2760 pdbhh F T 6mhf 2 B C GRDN_HUMAN AKT PHOSPHORYLATION ENHANCER,APE,COILED-COIL DOMAIN-CONTAINING PROTEIN 88A,G ALPHA-INTERACTING VESICLE-ASSOCIATED PROTEIN,GIV,GIRDERS OF ACTIN FILAMENT,HOOK-RELATED PROTEIN 1,HKRP1 KTGSPGSEVVTLQQFLEESNKLTSVQIKSSS 31 T 4.3 DFRP_C pdbhh F Eukaryota T 6mi9 1 A A PRO-MET-ALA-ARG-ASN-LYS-ILE-LEU-GLY-LYS-ILE-LEU-ARG-LYS-ILE-ALA-ALA-PHE-LYS PMARNKILGKILRKIAAFKX 20 T 0.88 HATPase_c_4 pdbhh F T 6mic 1 A A A0A0H3AKH0_VIBC3 Toxin co-regulated pilus biosynthesis protein B GSHMFLEDSELCWDTAAGSAKSCLSVRYDTVGNKTELDLKQIDVVSAKGLSFESDGKTKTPVVSTYETFQDGGRAKTINAIECPTGLNNRFAAVVSSFSTAGQNANFSSESAKDSQGTTQKDGSKGPHALLSGISLNWTLTNKVWDVTASIGIESGILPTSGIDSGSLLRNPKSLSFIAFQWCEN 185 T 23 DapH_N pdbhh F Bacteria T 6mjb 2 C C Q6FKQ5_CANGA Kinetochore-associated protein DSN1 GDKDNGLHAGETDGDDEGFEFRRHSNLGVPTLGERLDSLHEIKSARRMDHFNSSRNSLR 59 T 6.9 DUF1752 pdbhh F Eukaryota T 6mjc 2 B B Q6FKQ5_CANGA Kinetochore-associated protein DSN1 SNAPTLGERLDSLHEIKSARRMDHFNDD 28 T 3 DUF1752 pdbhh F Eukaryota T 6mje 2 B,D,F,H B,D,F,H DSN1_YEAST Dsn1p DLKFKRHKNKHIQGFPTLGERLDNLQDIKKAKRVENFNSS 40 T 2.5 Cytadhesin_P30 unphh F Eukaryota T 6mjl 2 B A MLXPL_HUMAN ChREBP Peptide ASN-TYR-TRP-LYS-ARG-ARG-ILE-GLU-VAL NYWKRRIEV 9 T 0.015 DUF2635 pdbhh F Eukaryota T 6mk4 1 A A TXPR2_THRPR BETA/OMEGA-TRTX-TP2A, PROTX-II, PT-II, PROTOXIN-2, PROTX2 YCQKWMWTCDSERKCCKGMVCRLWCKKKLW 30 T 0.001 Toxin_12 unppercent F Eukaryota T 6mk8 1 A A Anti-Staphylococcal peptide DFT503 GLSLLLSLGLKLLX 14 T 12 SIT pdbhh F F 6ml1 3 E G Proteolyzed N-terminal tag of Ubv.15.1a construct MAHHHHHHDTSLYKKAGSTENLYFQG 26 T 52 PTase_Orf2 pdbhh F T 6mlc 2 E,F E,F H4_HUMAN Histone H4 SGRGKGGKGLGKGGAKRHRK 20 T 11 Shadoo unppercent F Eukaryota T 6mm1 1 A,B,C,D A,B,C,D EHMT2_HUMAN EUCHROMATIC HISTONE-LYSINE N-METHYLTRANSFERASE 2,HLA-B-ASSOCIATED TRANSCRIPT 8,HISTONE H3-K9 METHYLTRANSFERASE 3,H3-K9-HMTASE 3,LYSINE N-METHYLTRANSFERASE 1C,PROTEIN G9A GSGFEELPLCSCRMEAPKIDRISERAGHKCMATESVDGELSGCNAAILKRETMRPSSRVALMVLCETHRARMVKHHCCPGCGYFCTAGTFLECHPDFRVAHRFHKACVSQLNGMVFCPHCGEDASEAQEVTIPRGD 136 T 0.029 DZR pdb F Eukaryota T 6mm5 2 B C RYR2_MOUSE RYR2,CARDIAC MUSCLE RYANODINE RECEPTOR,CARDIAC MUSCLE RYANODINE RECEPTOR-CALCIUM RELEASE CHANNEL,TYPE 2 RYANODINE RECEPTOR LYNRTRRISQTS 12 T 18 SNN_linker pdbhh F Eukaryota T 6mnq 1 A P A0A0K0KAD3_9HIV1 Envelope glycoprotein NNTRKSIRIGPGQAFYATGGIIG 23 T 1.9E-05 GP120 pdbhh T Viruses T 6mnr 1 A P A0A0K0KAD3_9HIV1 Envelope glycoprotein NNTRKSIRIGPGQAFYATGGIIG 23 T 1.9E-05 GP120 pdbhh T Viruses T 6mon 2 C,D C,D LYS-LEU-NLE-SER-LYS-ARG-GLY KLXSKRG 7 T 32 YuiB pdbhh F T 6mpv 3 C C PfRipr XXXXXXXXXXXXXXXX 16 F F F 6mpv 4 D D PfRipr XXXXX 5 F F F 6mpw 1 A,B,C,D,E A,B,C,D,E mini-eVgL membrane protein DSLKWIVFLLFLIVLLLLAIVFLLRGX 27 T 0.002 RCR pdbhh F T 6mpz 2 E,F,G,H M,N,O,P peptide aldehyde inhibitor 1 based on the ProcA2.8 leader peptide GNLSDDELEGVAGX 14 T 0.00047 L_biotic_typeA pdbhh F T 6mq2 1 A,B,C,D,E A,B,C,D,E mini-eVgL membrane protein DSLKWIVFLLFLIVLLLLAIVFLLRGX 27 T 0.002 RCR pdbhh F T 6mqc 3 E,F C,D ENV_HV1H2 HIV fusion peptide residue 512-519 AVGIGAVF 8 T 4.2 DUF3918 pdbhh T Viruses F 6mqe 3 E,F C,D ENV_HV1H2 HIV fusion peptide AVGIGAVF 8 T 4.2 DUF3918 pdbhh T Viruses F 6mqm 3 C,F,I,L C,F,I,L ENV_HV1H2 HIV Env fusion peptide residue 512-519 AVGIGAV 7 T 2.5 DUF3918 pdbhh T Viruses F 6mqr 3 C A ENV_HV1H2 HIV fusion peptide residue 512-519 AVGIGAVF 8 T 4.2 DUF3918 pdbhh T Viruses F 6mqs 3 E,F E,F ENV_HV1H2 HIV fusion peptide residue 512-519 AVGIGAVF 8 T 4.2 DUF3918 pdbhh T Viruses F 6mrq 2 B I inhibitor from Tityus obscurus scorpion venom (TopI1) ILKRCKTYDDCKDVCKARKGKCEFGICKCMIK 32 T 0.012 Toxin_2 pdbhh F T 6mrr 1 A A Foldit1 GWSTELEKHREELKEFLKKEGITNVEIRIDNGRLEVRVEGGTERLKRFLEELRQKLEKKGYTVDIKIE 68 T 0.0019 HMA pdb F T 6mrs 1 A A Peak6 GSGRQEKVLKSIEETVRKMGVTMETHRSGNEVKVVIKGLHESQQEQLKKDVEETSKKQGVETRIEFHGDTVTIVVRE 77 T 0.0072 Phage_TAC_5 pdb F T 6ms1 2 C,D C,D APC C-terminus peptide GSYLVTSV 8 T 0.068 EB1_binding pdbhh F T 6ms4 2 B B DENR_HUMAN DRP,PROTEIN DRP1,SMOOTH MUSCLE CELL-ASSOCIATED PROTEIN 3,SMAP-3 GDYPLRVLYCGVCSLPTEYCEYMPDVAKCRQWLEKNFPNEFAKLTV 46 T 0.012 PHM7_cyt unppssm F Eukaryota T 6ms7 2 B B PRGC1_HUMAN PGC1 LXXLL motif PSLLKKLLLAP 11 T 10 Neurokinin_B pdbhh F Eukaryota F 6mse 20 T v substrate XXXXXXXXXXXXXXXXXXXXXXXXKXXX 28 T 6700 EF-hand_5 pdbhh F F 6msg 20 T v substrate XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 6msh 19 S v substrate XXXXXXXXXXXXXXXXXXXXXX 22 F F F 6msj 19 S v substrate XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 36 F F F 6msk 19 S v substrate XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 36 F F F 6msm 2 B B Piece of Molecule-1 XXXXXXXXXXXXXXXXX 17 F F F 6msp 1 A A De novo Designed Protein Foldit3 MGHHHHHHENLYFQSHMTDELLERLRQLFEELHERGTEIVVEVHINGERDEIRVRNISKEELKKLLERIREKIEREGSSEVEVNVHSGGQTWTFNEK 97 T 0.0032 DUF6175 pdb F T 6mt3 2 B B NP338 peptide FEDLRVLSF 9 T 0.74 Flu_NP pdbhh F T 6mt4 3 C C NP338-L7S peptide FEDLRVSSF 9 T 9.1 EKR pdbhh F T 6mt5 3 C C NP338-V6L peptide FEDLRLLSF 9 T 2.6 Flu_NP pdbhh F T 6mt6 2 B B NP388 peptide FEDLRVLSF 9 T 0.74 Flu_NP pdbhh F T 6mtl 3 C C NP338 peptide FEDLRVLSF 9 T 0.74 Flu_NP pdbhh F T 6mtm 3 C C NP338 influenza peptide FEDLRVLSF 9 T 0.74 Flu_NP pdbhh F T 6mtu 2 C,D C,D CRCM_HUMAN PROTEIN MCC PHTNETSL 8 T 8.4 TPR_MLP1_2 unphh F Eukaryota T 6mtv 2 C,D D,E CRCM_HUMAN Colorectal mutant cancer protein PHTNETSL 8 T 8.4 TPR_MLP1_2 unphh F Eukaryota T 6mv5 3 C P PCSK9_HUMAN NEURAL APOPTOSIS-REGULATED CONVERTASE 1, NARC-1, PROPROTEIN CONVERTASE 9, PC9, SUBTILISIN/KEXIN-LIKE PROTEASE PC9 XKDEDGDYEELVLALRSEEDGLA 23 T 7.9 PIN7 pdbhh F Eukaryota T 6mvz 1 A A Linear precursor of pseudoxylallemycin A XFXF 4 T 65 DUF4554 pdbhh F F 6mw0 1 A A Mle-Phe-Mle-D-Phe Linear tetrapeptide related to pseudoxylallemycin A XFXX 4 T 65 DUF4554 pdbhh F F 6mw1 1 A A Pseudoxylallemycin A XFXF 4 T 65 DUF4554 pdbhh F F 6mw2 1 A A pseudoxylallemycin A XFXX 4 T 65 DUF4554 pdbhh F F 6mw3 2 C,D I,J Ribonucleoside-diphosphate reductase NrdF beta subunit XXXXXXXX 8 F F F 6mw6 1 A A Citrocin GGVGKIIEYFIGGGVGRYG 19 T 8.1 Bac_chlorC pdbhh F T 6mwm 1 A A R1AB_BCHK4 NSP3, PAPAIN-LIKE PROTEINASE SHMQTPETAFINNVTSNGGYHSWHLVSGDLIVKDVCYKKLLHWSGQTICYADNKFYVVKNDVALPFSDLEACRAYLTSRAA 81 T 0.25 DUF3954 pdb T Viruses T 6mwz 2 C M ALA-HIS-HIS-HIS-HIS-ALA AHHHHA 6 T 85 DUF3399 pdbhh F F 6mxf 1 A A THCL_STRAJ Thiostrepton XIAXASXTXXXXTXXXXXX 19 T 0.93 CCER1 pdbhh F Bacteria F 6my1 1 A A GOME_ACAGO gomesin QCRRLCYKQRCVTYCRGRX 19 T 0.0046 PanZ unp F Eukaryota T 6my2 1 A A GOME_ACAGO gomesin QCRRLCYKQRCVTYCRGRX 19 T 0.0046 PanZ unp F Eukaryota T 6my3 1 A A GOME_ACAGO gomesin QCRRLCYKQRCVTYCRGRX 19 T 0.0046 PanZ unp F Eukaryota T 6myd 2 B,D B,D STING_DANRE STING CTT, Transmembrane protein 173 EPVETTDY 8 T 15 Swm2 pdbhh F Eukaryota T 6mye 2 B B ARHGQ_HUMAN SH3 DOMAIN-CONTAINING GUANINE EXCHANGE FACTOR XKPNGLLITDFPX 13 T 0.82 DUF1968 pdbhh F Eukaryota T 6mzc 10 J Z poly(UNK) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 238 F F F 6mzd 12 L Y poly(UNK) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 96 F F F 6mzl 16 U Y poly(UNK) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 96 F F F 6mzl 17 V Z poly(UNK) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 238 F F F 6mzm 17 Q Z Unk XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 238 F F F 6mzn 1 A A E7FDB6_DANRE Transforming growth factor beta receptor III GSPCELLPVGVGHPVQAMLKSFTALSGCASRGTTSHPQEVHIINLRKGSAQGAREKTAEVALHLRPIQSLHVHQKPLVFILNSPQPILWKVRTEKLAPGVKRIFHVVEGSEVHFEVGNFSKSCEVKVETLPHGNEHLLNWAHHRYTAVTSFSELRMAHDIYIKVGEDPVFSETCKIDNKFLSLNYLASYIEPQPSTGCVLSGPDHEQEVHIIELQAPNSSSAFQVDVIVDLRPLDGDIPLHRDVVLLLKCEKSVNWVIKAHKVMGKLEIMTSDTVSLSEDTERLMQVSKTVKQKLPAGSQALIQWAEENGFNPVTSYTNTPVANHFNLRLREHHHHHH 338 T 0.17 DUF108 pdbpssm F Eukaryota T 6mzp 1 A,B A,B E7FDB6_DANRE Transforming growth factor beta receptor III GSPCELLPVGVGHPVQAMLKSFTALSGCASRGTTSHPQEVHIINLRKGSAQGAREKTAEVALHLRPIQSLHVHQKPLVFILNSPQPILWKVRTEKLAPGVKRIFHVVEGSEVHFEVGNFSKSCEVKVETLPHGNEHLLNWAHHRYTAVTSFSELRMAHDIYIKVGEDPVFSETCKIDNKFLSLNYLASYIEPQPSTGCVLSGPDHEQEVHIIELQAPNSSSAFQVDVIVDLRPLDGDIPLHRDVVLLLKCEKSVNWVIKAHKVMGKLEIMTSDTVSLSEDTERLMQVSKTVKQKLPAGSQALIQWAEENGFNPVTSYTNTPVANHFNLRLREHHHHHH 338 T 0.17 DUF108 pdbpssm F Eukaryota T 6n05 1 A,B A,B A0A425B3G2_NEIME AcrIIC2 MNTIHHHHHNTSGSGGGGGRLVPRGSMSENLYFQGSMSKNNIFNKYPTIIHGEARGENDEFVVHTRYPRFLARKSFDDNFTGEMPAKPVNGELGQIGEPRRLAYDSRLGLWLSDFIMLDNNKPKNMEDWLGQLKAACDRIAADDLMLNEDAADLEGWDD 159 T 0.05 SfsA_N pdbpercent F Bacteria T 6n0s 1 A A A3DM20_STAMF MCRB GPMNKILGFSKYWVEINNWILPTLDHIGLTLWGMIKKHASEYRGIRYSLEKFGELKIIHYIGSRASHDLGKTFIGESVLSKENNKIVKEYSWPEIKKVLRKIFLDNGISSDTIDQYFIAIRRIIRPSRSDRFFLFRLAEYRKYENPVKYDQVRDIISHITWTGRYLVPIRPEDYEAIHSRG 181 T 2.9 Colicin_Pyocin pdbhh F Archaea T 6n16 3 I,J,K,L E,F,G,I ENV_HV1H2 HIV fusion peptide (512-519) AVGIGAVF 8 T 4.2 DUF3918 pdbhh T Viruses F 6n2b 1 A,B A,B E4S4B2_CALKI Tapirin MKTSTTYGGESLTEAYLYYFNQICEDAREAAYSYYFDGSGNFKSTYIGGKVSPQNEPKRIWDDLTAGHITQDEAKDRILSGMREIIVSEVNNFMNGLPSSISFKVSPSSPITINKLDDLKNYILKELKDISNFGVGSFTVSSWSAGDVKGYTVEFEVYKEQNTGTSPAKDTVRNMRIDIAVNKGVMPDIGTLNPTSSTSSWNDLFEYAVYSRGSFLPNYKFTVRGGSIYSGERIQTQGEFKAIGVNNLICKGPEVIVNGGGNSIEIKEIMYIQNKLVFNGAPNTNPNTLNANKIYTGLGGMELNGYGYYKANEIYSDGEVQVKNYGNFEIGSIGIVKKLTVTDNGRTTIKSGATLYCDQLEVRNNGRVFIEAGATLVTRAISISGGTIEGPGTRQVNPSATFPSYPPFIDDIKNFDFDSRMSVTTLPADPVGATTLGSVYDKSATPWEIVVYGESGINDSELITEVNSKLGSFPSNVRLYLASKGNITFSNPTSLPLYNPTTGKLVIEGAIITLGSTFNINISGAGIELIYKRAGSTIESSITSTLNYIPPPRSYSSSSAQTVNTMYQVKRRGMIIK 577 T 0.00018 PilX_N unphh F Bacteria T 6n2c 1 A,B A,B E4Q7C4_CALH1 Tapirin MLTSLIHSKETINKTQTSTAADSAMEYILFYISKAIAQAKRLTYAQFFDSTGRLIYTGDSFENDYLNTFNSYIADFFENRGNRIGIDMKLADNSSVQVSNVSELILLARQSCEYISNISFSRSGNSYILEVEALDSTTKTKRVERCVFTIPSPFEKVEIVSNSSSPDTLLPYLLAWDSNIFDFTTYGLFSSDKIIFNNNITVTTRNMYSSSDITLRSDNNRPGDYTIKADNIIVKNGSFIFGGNNKVVVNNLMYTKNGITFNGNNNRLESNSLLFSDGTISLSGKDEIVANALFCDTLDIRNGSSNLVTINEFAYFNKLNIWTDKMVLKSNSKLFGGDIEIRNDGILSADVGTVVYANNLDIIGSSATIDAPDTVLYCNNLKIDGEVKLNVKKIVCSGTITISNLNSGTNIRVSDKIECRSIPQNIPSGIRNLFVQNPNVNFQIPYPTIPAIIEEIKKNTFPTNWIRLDNIVEDKKDINGANYYSLVSTGQNSNDINEIFNKNKPNNPHSNVQIFVITKSGINVPPDQNHLDGVLIANGSLQFNGGNLNIEYVRMPQPLIDYLLSKNIIKIENVQPPVISNPTVTFLPRDVNLFIIARHFVVK 603 T 0.00019 PilX_N unphh F Bacteria T 6n37 1 A,B,C,D,E,F,G,H,I,J A,B,C,D,E,F,G,H,I,J TADBP_HUMAN TDP-43, SEGA MNFGAFSINPAMMAAAQAALQSSWGMMGMLASQQNQSGPSGNNQNQGNMQ 50 T 0.0075 Glucosaminidase pdbpercent F Eukaryota T 6n38 3 D,E K,L Unassigned protein XXXXXXXXXXXXXXXXXXXXX 21 F F F 6n3a 1 A,B,C,D,E,F,G,H,I,J A,B,C,D,E,F,G,H,I,J TADBP_HUMAN TDP-43, SEGA MNFGAFSINPAMMAAAQAALQSSWGMMGMLASQQNQSGPSGNNQNQGNMQ 50 T 0.0075 Glucosaminidase pdbpercent F Eukaryota T 6n3a 2 K,L,M,N,O,P,Q,R,S,T K,L,M,N,O,P,Q,R,S,T TADBP_HUMAN segA long small GMLASQQNQS 10 T 0.29 Glucosaminidase unppercent F Eukaryota T 6n3b 1 A,B,C,D,E,F,G,H,I,J A,B,C,D,E,F,G,H,I,J TADBP_HUMAN TDP-43 MNFGAFSINPAMMAAAQAALQSSWGMMGMLASQQNQSGPSGNNQNQGNMQ 50 T 0.0075 Glucosaminidase pdbpercent F Eukaryota T 6n3c 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T TADBP_HUMAN TDP-43, SEGB QGGFGNSRGGGAGLGNNQGSNMGGGMNFGEFSINPAMMAAAQAALQ 46 T 0.29 Glucosaminidase unppercent F Eukaryota T 6n3e 2 B B SF3B1_HUMAN SF3b1 U2AF ligand motif NRWDETP 7 T 0.022 SF3b1 pdbhh F Eukaryota T 6n3f 2 C D SF3B1_HUMAN SF3b1 U2AF ligand motif SRWD 4 F F Eukaryota F 6n4p 1 A,B A,C TAU_HUMAN NEUROFIBRILLARY TANGLE PROTEIN, PAIRED HELICAL FILAMENT-TAU, PHF-TAU RQEFEV 6 T 20 BING4CT pdbhh F Eukaryota F 6n60 6 G M MCJA_ECOLX MCCJ25 GGAGHVPEYFVGIGTPISFYG 21 T 0.13 Endonuc-BglII unp F Bacteria T 6n61 8 I I Capistruin GTPGFQTPDARVISRFGFN 19 T 4.4 P_C10 pdbhh F T 6n64 2 G,H G,H Uncharacterized peptide from Structural maintenance of chromosomes flexible hinge domain-containing protein 1 XXXXXXXXXXXXXXXXXXXXX 21 F F F 6n68 1 A A PROTO_AGEPP AGELAIA-CHEMOTACTIC PEPTIDE,AGELAIA-CP ILGTILGLLKGLX 13 T 0.47 DUF445 unphh F Eukaryota T 6n7o 1 A,B B,A Q7WSG2_9VIRU GIL01 gp7 GSMRDKLLDFIIELSQSSKQVVSKSYVIDRLMQVTKEDYKELEKNVEGKKDD 52 T 14 SpoOE-like pdbhh T Viruses T 6n7p 8 H H SNU71_YEAST U1 small nuclear ribonucleoprotein component SNU71,U1 small nuclear ribonucleoprotein component SNU71,Snu71 MRDIVFVSPQLYLSSQEGWKSDSAKSGFIPILKNDLQRFQDSLKHIVDARNSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 105 T 1.1 PUD1_2 pdbpssm F Eukaryota T 6n7q 2 B C RON2 peptide CWTTRMSPPMQIP 13 T 3.2 Antimicrobial23 pdbhh F T 6n7r 8 H H SNU71_YEAST U1 small nuclear ribonucleoprotein component SNU71,U1 small nuclear ribonucleoprotein component SNU71,Snu71 MRDIVFVSPQLYLSSQEGWKSDSAKSGFIPILKNDLQRFQDSLKHIVDARNSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 89 T 0.2 Vps5 pdbpssm F Eukaryota T 6n87 2 B C backbone-cyclised peptide bcRON2hp CWTTRMSPPMQIP 13 T 3.2 Antimicrobial23 pdbhh F T 6n8c 1 A,B,C,D A,B,C,D HD_HUMAN HUNTINGTON DISEASE PROTEIN,HD PROTEIN ATLEKLMKAFESLKSFQQQQQQQ 23 T 2 Mito_fiss_reg unphh F Eukaryota T 6n8k 1 A s Ribosomal Protein uL1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 217 F F F 6n8l 2 B s Ribosomal Protein uL1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 217 F F F 6n8m 18 R S Ribosomal Protein uL1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 217 F F F 6n8n 21 U S Ribosomal Protein uL1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 217 F F F 6n8o 23 W S Ribosomal Protein uL1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 217 F F F 6n9j 2 B,D C,D Covalently bound peptide inhibitor XVLTX 5 T 1200 DUF592 pdbhh F F 6n9t 2 B,D E,F Photo-affinity peptide DCAWHLGELXWCT 13 T 0.46 YbjM pdbhh F T 6nah 2 AB,BB,CA,CB,DA,DB,EA,FA,GA,HA,IA,JA,KA,LA,MA,NA,OA,PA,QA,RA,SA,TA,UA,VA,WA,XA,YA,ZA 1,2,c,3,e,4,f,g,h,i,j,k,l,m,n,o,p,q,r,s,t,u,v,w,x,y,z,0 Acyldepsipeptide-14 XXPXAX 6 T 430 Gag_p6 pdbhh F F 6nbq 13 M P V5V507_9CYAN Proton-translocating NADH-quinone dehydrogenase subunit P NdhP MDAVISVKPILLAMTPVFILLCLFFGTRNGFYDTDQYHGNGSAH 44 T 0.12 SPC25 unppercent F Bacteria T 6nbx 16 P P V5V507_9CYAN Proton-translocating NADH-quinone dehydrogenase subunit P NdhP MDAVISVKPILLAMTPVFILLCLFFGTRNGFYDTDQYHGNGSAH 44 T 0.12 SPC25 unppercent F Bacteria T 6nbx 17 Q Q Proton-translocating NADH-quinone dehydrogenase subunit Q NdhQ MATDFNRGIMKFDGADSPAMIAISAVLILGFIAGLIWWALHTAYA 45 T 0.019 FixS unp F T 6nby 16 P P Proton-translocating NADH-quinone dehydrogenase subunit P NdhP MDAVISVKPILLAMTPVFILLCLFFGTRNGFYDTDQYHGNGSAH 44 T 0.12 SPC25 unppercent F T 6nby 17 Q Q Proton-translocating NADH-quinone dehydrogenase subunit Q NdhQ MATDFNRGIMKFDGADSPAMIAISAVLILGFIAGLIWWALHTAYA 45 T 0.019 FixS unp F T 6ncl 1 A a0 Q98550_PBCV1 P14 MQTPSIIQCGLLNSFARKMTDAISDNQIIATSRFFNIARDVADVVVSNTKLAQQYEQLSIDSLKEYLVSVAKFVAVDYSNTTSADVDDLIHKLRLFIEEECYQYNIDKEETCDGDVCVSDEYNEPAPKPKPKPKPKPAPKPKPAPKPKPAPKPAPKPAPKPAPKPAPKPAPKPAPKPAPEPAPEPAPEPAPKPAPEPAPEPAPIRPARRCDENPSNLETCCTNKALYGDFTDSSCDIVKKKTNWWLWGGIAILVIVLMIGGYFIYKRYFSAPKFENTGEFVNDMNFNNDVNFNNDVNFDNDMNYGNEGIDVSDLEILNLPVPSVSPVPSASIVPSVSPIPRGSPVPSASPIK 352 T 0.0076 Trypan_PARP pdbpssm T Viruses T 6ncl 3 C,D a2,a3 Q98505_PBCV1 P10 MMNFILVLLIVAMIGTILVSESKYLFSKPVCKNCGVKAVTLPVDISAGKLAKVAEAVKKQTEEIKTLLKQKQSAPKAPELTNPIEHIKASTTVVSGANGLENVIDEDLPFSDFKGVPVAETTVEGMIKGIRPPTYADPRVMNPALAAAPVQFSDPTQFGTFGVTDDVSPAFSTEDKIPKTNAKISSDISVEGYENSYDANGARLVMDGKVVKSECQLPSYQIRNSKHHTQLPMRSLNEPPPMVEDLVDESLFEGLQGYPVDEKLDLLTPPGTATPSSEWAAINYGLTNN 289 T 0.11 DUF4330 pdbpssm T Viruses T 6ncl 4 E a4 Q84580_PBCV1 P7 MQIYSEYYEKIGPRKLRLLVKRRLLFASTWLYINNYILLSIIMKLQTKHMILLGFVAVVVVFIIFMLTRKKKEGFSIGNIFGKVKGAVTGTVGKVVNVVKPQGYKPEFVNRVNFGKFWACPEGTTDWGSEDKQCLVSQYGPMMWRNKGGNEWGWSCPAGSAPNNSDDWNQKCVQGYSMKKLIDGQWRCTDTEIDTGKDWSNSDWFTAQQQCDRGNNKVFTRRMYIDGKWQCPDGTWDTGFTWSDGENGGKQCKYYP 256 T 0.014 Wzy_C pdb T Viruses T 6ncl 5 F a5 Q84523_PBCV1 P6 MILVGIAVLILLAVFAILYYKQKEKFVVVGKFVEPIPSNPGQDFTLLPMDQTYTFADPVPDTATAFDVVLSRFTDKKAPADLLKGATFPEAAPYTDSEVENISKLALSRVKGPDAPVLSFISVEYAAKGVDNKKNTHYDIAFMVYDQVKNFSLKLVLVAVLDAKNKLWIKKFSSFNSFTPKDKGPKGVENIDETPLAEFIPDFVQFSRLYKDNANV 216 T 3.7 Mid2 pdbhh T Viruses T 6ncl 6 G a6 Q84626_PBCV1 P1 MVETTQHFVSIESSNRPDPANTTPANYSIQLPQRYRNIWSAMLVNIALPAVSPPQKYVYLDIDKLNSIDSTSPSGGVNFALAKIPLSIAGTGNVFFADTMTSSFPNVPLQNPVATMDKLNIKLKDANGNVLTIPAGNEHSFMIQLTCGDYIPRGGGSTITQNGRVLGGTR 170 T 1.8 DUF2433 pdbhh T Viruses T 6ncl 7 H a7 Q84459_PBCV1 P12 MGNGPPMERAVSSDDILTYYNTFIFFIYFNFTNENIYIIYTIYMKVQNTIVYIVLLLIVVVIIWNFTRKEGWSDYNAPNDFMKIYYSNIVEDKKLAEKYPFFGTGPFTGLRCRKPNNVGCNTTWVSGQLVELTPKLKEQIECKFGIQYVKT 151 T 0.042 ID pdbpercent T Viruses T 6ncl 8 I a8 Q98576_PBCV1 P5 MDSRLSAAYAIRAARISMIPGGVDGLVINYAEGGEPAWVQYPLKKQKPLPNNLCYTPTLEDIARKREAVIAKYTKQPLETGTTFTHVLNASHLNEQYTRVKKSALPDKEFPIIETEKYPEPPILWETTIGAPSRLFDRSDGVKYVR 146 T 8.9 XRN1_DBM pdbhh T Viruses T 6ncl 9 J,K,KD,L,M,N,O,P,R,S,T,U a9,b0,l5,b1,b2,b3,b4,b5,b7,b8,c0,c1 Q84666_PBCV1 P11 MDMHMIVKVVAILAVLFLVYKLWESMNKPNASPLKIQNPYEKYMNSAEGGEYDAEDDDIYYPETDAEDDDIYTGETDDMYDGEDDDIYVQEGDDIEDAEDEPYDDSADMEQDVPKVQQPMMPLLTPSSQLLPKPSPEAADFAQFAPKNLQAQNFLTATQWIGVNTQGSSLKNANYDLRADPIIPKADVGPWMMSSVDPNIYQKPLFG 207 T 0.14 FeoB_associated pdbhh T Viruses T 6ncl 11 V,W,X,Y c2,c3,c4,c5 O41054_PBCV1 P4 MFSAFRDTASIGFSDTHQDEKTLRFLKKQISQFIKHLKEYYPNNELTKKLVMKYSDVQLLPYTKGATKDTYTSGLFDHTTGVIKIAPRDGLGNVRDEQSLNKSICHELAHGTRVKYPGESSHSDEWKDAWKTFLKIAADELGWKIEVPCSSVSFYGLTKDDCENCVWDQDPETCPKTAKLA 181 T 0.002 WLM pdbhh T Viruses T 6ncl 12 AA,BA,Z c7,c8,c6 Q98573_PBCV1 P3 MAMKTQRKENVLFQNVKPREIPLVDNPFSTYPYKHVITETQPTQAKNQAIWGLVQMGLSGEAAAMYGDVVVQKTTRACRKSEGGFKDVNTELWGTSPYLGRGDGEVYNMPASNQLLRGFESSLRGSRVRTQIDDKSFIPYTWQMIDVPLAAAKTSFIAGLDTRQQLAYGNP 171 T 3.2 B_solenoid_ydck pdbhh T Viruses T 6ncl 15 JD l4 Q98473_PBCV1 P13 MHKITPFLIAAVVAVIVLAVWLFKKDNKKETWFSRDLNYGKANSKIWNATVAKGLKGIANENAEIRKMYPYLGYGDFTGAICKGPNNQGCTYYANYTR 98 T 0.0083 DUF4381 pdbpssm T Viruses T 6ncp 3 E E ENV_HV1H2 HIV-1 Fusion Peptide (residues 512-520) AVGIGAVFL 9 T 2.2 OAD_gamma pdbhh T Viruses T 6ncp 4 F F His-tag of fusion peptide HHHHH 5 T 5700 zinc_ribbon_2 pdbhh F F 6nd4 5 E I UTP8_YEAST Utp8 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRLFKQAIVTCPNLPLNELLEELFSIRNRELLLDISFRILQDFTRDSIKQEMKKLSKLDVQNFIEFITSGGEDSSPECFNPSQSTQLFQLLSLVLDSIGLFSLEGALLENLTLYIDKQVEIAERNTELWNLIDTKGFQHGFASSTFDNGTSQKRALPTYTMEYLDI 519 T 8.500000000000002E-245 Utp8 unppssm F Eukaryota T 6nd4 28 DA x Unidentified fragment XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 6ndy 2 F G Designed Cyclic Peptide GGDEIVNKVLGGSSGGXXXXXXXXGGKGCK 30 T 19 Eno-Rase_NADH_b pdbhh F T 6nef 1 A A OMCS_GEOSL OUTER MEMBRANE CYTOCHROME S FHSGGVAECEGCHTMHNSLGGAVMNSATAQFTTGPMLLQGATQSSSCLNCHQHAGDTGPSSYHISTAEADMPAGTAPLQMTPGGDFGWVKKTYTWNVRGLNTSEGERKGHNIVAGDYNYVADTTLTTAPGGTYPANQLHCSSCHDPHGKYRRFVDGSIATTGLPIKNSGSYQNSNDPTAWGAVGAYRILGGTGYQPKSLSGSYAFANQVPAAVAPSTYNRTEATTQTRVAYGQGMSEWCANCHTDIHNSAYPTNLRHPAGNGAKFGATIAGLYNSYKKSGDLTGTQASAYLSLAPFEEGTADYTVLKGHAKIDDTALTGADATSNVNCLSCHRAHASGFDSMTRFNLAYEFTTIADASGNSIYGTDPNTSSLQGRSVNEMTAAYYGRTADKFAPYQRALCNKCHAKD 407 T 9.8E-05 Cytochrom_NNT unphh F Bacteria T 6neo 1 A B PPRA_DEIRA PLEIOTROPIC PROTEIN PROMOTING DNA REPAIR GMQTDGIYAAFDTLMSTAGVDSQIAALAASEADAGTLDAALTQSLQEAQGRWGLGLHHLRHEARLTDDGDIEILTDGRPSARVSEGFGALAQAYAPMQALDERGLSQWAALGEGYRAPGDLPLAQLKVLIEHARDFETDWSAGRGETFQRVWRKGDTLFVEVARPASAEAALSKAAWKVIASIKDRAFQRELMRRSEKDGMLGALLGARHAGAKANLAQLPEAHFTVQAFVQTLSGAAARNAEEYRAALKTAAAALEEYQGVTTRQLSEVLRHGLRES 278 T 8.6 Ldt_C pdbhh F Bacteria T 6nf2 1 A,G,Q A,G,Q Q2N0S6_9HIV1 Envelope glycoprotein gp120 AENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSACTQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQAFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQCMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRR 480 T 3.4E-54 GP120 pdbpercent T Viruses T 6nfj 3 C,F C,F FGF19_HUMAN FGF-19 PMLPMVPEEPEDLRGHLESDMFSSPLETDSMDPFGLVTGLEAVRSPSFEK 50 T 4.6 Mrx7 pdbhh F Eukaryota T 6nfw 1 A A POLG_PVYN VPg GKNKSKRIQALKFRHARDKRAGFEIDNNDDTIEEFFGSAYRKKGKGKGTTVGMGKSSRRFINMYGFDPTEYSFIQFVDPLTGAQIEENVYADIRDIQERFSEVRKKMVENDDIEMQALGSNTTIHAYFRKDWSDKALKIDLMPHNPLKVCDKTNGIAKFPERELELRQTGPAVEVDVKDIPAQEVEHE 188 T 0.1 DUF4447 pdb T Viruses T 6nhw 1 A,B,C,D,E,F A,B,C,D,E,F TR10B_HUMAN DEATH RECEPTOR 5,TNF-RELATED APOPTOSIS-INDUCING LIGAND RECEPTOR 2,TRAIL-R2 MPGSLSGIIIGVTVAAVVLIVAVFVCKSLLWKKVLP 36 T 0.16 Psg1 pdbpssm F Eukaryota T 6nhy 1 A,B,C A,B,C TR10B_HUMAN DEATH RECEPTOR 5,TNF-RELATED APOPTOSIS-INDUCING LIGAND RECEPTOR 2,TRAIL-R2 MPGSLSGIIIYVTVAAVVLIVAVFVCKSLLWKKVLP 36 T 0.016 Psg1 pdbpssm F Eukaryota T 6ni2 5 E V V2R_HUMAN V2R,AVPR V2,ANTIDIURETIC HORMONE RECEPTOR,RENAL-TYPE ARGININE VASOPRESSIN RECEPTOR ARGRTPPSLGPQDESCTTASSSLAKD 26 T 16 DUF6352 pdbhh F Eukaryota T 6nid 2 D,E,F D,E,F NRX1A_HUMAN NEUREXIN I-ALPHA,NEUREXIN-1-ALPHA KKNKDKEYYV 10 T 8 Topo_Zn_Ribbon pdbhh F Eukaryota T 6nii 1 A,B B,A Uncharacterized protein RavD GPLGSMNLKAEVFLNQNCAEMMIKKAAQLILGSDLDFEYTRGVQDIQVDLGPAFMFSPDEEKTLWVSGKNQETLEKDLATLNKSSVYFFRTGTQGGAGHWQVLYYEAAKSGWVSYSSQSNHFQVTDSNGKLTASGKGLLVPHANWGKENGNYAFLLVNASAENIIHAANFVYILRTQNEVAAIEYCALNHEFHPEIKRTARAKAE 205 T 16 DUF2846 pdbhh F T 6nir 2 E X HOV protease fragment XXXXXX 6 F F F 6nj8 2 E,F,G E,F,G targeting peptide TVGSLIQ 7 T 6.6 Chlorosome_CsmC pdbhh F T 6njd 2 B,D B,D A0A509GV61_LEGPN RavD GPLGSMNLKAEVFLNQNCAEMMIKKAAQLILGSDLDFEYTRGIQDIQVDLGPAFMFSPDEEKTLWVSGKNQETLEKDLATLNKSSVYFFRTGTQGGAGHWQVLYYEAAKSGWVSYSSQSNHFQVTDSNGKLTASGKGLLVPHANWGKENGNYAFLLVNASAENIIHAANFVYILRTQNEVAAIEYCALNHEFHPEIKRTARAKAE 205 T 19 SWC7 pdbhh F Bacteria T 6njl 3 E,G E,G A'/C' auxiliary proteins XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 153 F F F 6njm 3 E,G E,G A'-C' auxiliary proteins XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 153 F F F 6njm 5 I,M I,M 5B2 Fab Light Chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 218 F F F 6njm 6 J,N J,N 5B2 Fab Heavy Chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 247 F F F 6njn 4 E,G E,G A'-C' auxiliary proteins XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 153 F F F 6njn 9 L,M L,M 5B2 Fab XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 224 F F F 6njv 1 A A B0RTN2_XANCB Xcc_CTR_I RDEEDLQRYIDVTRGEIFFSRGVILVEGDAERFIVPAFAEVLNIPLDMLGITVCSVGGTNFTPYVKLLGPEGLNIPHVILTDRDPTNGNHPLVRRRLINVLDVIEGGVDHEELDADEVIKLAEQYGYFVNENTLEPELFAGGLAEDMQEVIREELPRLRRETLNALQQWVDDPAQIDEDLLLRLIERIGKGRFAQALAPSVSEDVCPAYIRSALEHIRDAIALEHHHHHH 230 T 0.0022 DUF3226 pdbhh F Bacteria T 6njw 1 A A B0RTN2_XANCB Xcc_ctr_pt RDEEDLQRYIDVTRGEIFFSRGVILVEGDAERFIVPAFAEVLNIPLDMLGITVCSVGGTNFTPYVKLLGPEGLNIPHVILTDRDPTNGNHPLVRRRLINVLDVIEGGVDHEELDADEVIKLAEQYGYFVNENTLEPELFAGGLAEDMQEVIREELPRLRRETLNALQQWVDDPAQIDEDLLLRLIERIGKGRFAQALAPSVSEDVCPAYIRSALEHIRDAIALEHHHHHH 230 T 0.0022 DUF3226 pdbhh F Bacteria T 6njx 1 A A B0RTN2_XANCB Xcc_ctr_Hg RDEEDLQRYIDVTRGEIFFSRGVILVEGDAERFIVPAFAEVLNIPLDMLGITVCSVGGTNFTPYVKLLGPEGLNIPHVILTDRDPTNGNHPLVRRRLINVLDVIEGGVDHEELDADEVIKLAEQYGYFVNENTLEPELFAGGLAEDMQEVIREELPRLRRETLNALQQWVDDPAQIDEDLLLRLIERIGKGRFAQALAPSVSEDVCPAYIRSALEHIRDAIALEHHHHHH 230 T 0.0022 DUF3226 pdbhh F Bacteria T 6njz 2 C,D C,D YSA-GSGSK-bio peptide YSAYPDSVPMMSGSGSK 17 T 6.1 DUF4810 pdbhh F T 6nk0 2 C,D C,D bA-WLA-Yam XWLAYPDSVPYX 12 T 4.1 DUF3052 pdbhh F T 6nk1 2 C,D C,D bA-WLA-YRPKbio XWLAYPDSVPYRPK 14 T 6.5 DUF3052 pdbhh F T 6nk2 2 C,D C,D bA-WLA-YPRKbio peptide XWLAYPDSVPYRPK 14 T 6.5 DUF3052 pdbhh F T 6nk9 1 A A Aca Toxin 1 CGGAGAKCSTKSDCCSGLWCSGSGHCYHRRYT 32 T 1.2E-05 Toxin_30 pdbhh F T 6nkp 2 C,D C,D bA-WLA-YSKbio peptide XWLAYPDSVPYSK 13 T 3.9 FXR_C1 pdbhh F T 6nl1 1 A A B6SBM0_9TRYP Mitochondrial edited mRNA stability factor 1 GSHMRKQLFFTLARPCVAVGRRFISGDNKSIDSSAFISDDDALRGELASALDTEGHALPFDVHLQQPHSSGDGTAGDTSTIQLEKLSHPPARFDLLTNSFVYKWQTKAALARKVSGPMREWAAELKYRTGVHIELEPTYPERLSENAVKGSGSDDGDGTQWGAYETADDVDITVYLFGSERGIFNCHKLMEAAIQQDPVYVRLGIFRRLANSSEVEWLMLRRINRELRPPDIPPISLKLPGKWTLLYERYKEAAIRTLWEETGITVDASNVYPTGHLYQTVPQYYWRVPVRYFVAEVPSDIRVEGPQVVPLQYMRNWDARLLRQSPDPIDRAWAQLADPATGCAWMKASMIDQLQKPLRGDNYMAIRYTPPPYSNLQEVVGLGDGSITPSTGNGEDAS 398 T 0.00034 NUDIX pdb F Eukaryota T 6nm2 1 A A WW291 peptide WWWLRKIWX 9 T 0.32 DUF6273 pdbhh F T 6nm3 1 A A WW295 peptide RKIWWWWLX 9 T 0.57 DUF5976 pdbhh F T 6nmc 3 C,D B,C A0A5H1ZR46_9GAMM AcrVA1 SKAMYEAKERYAKKKMQENTKIDTLTDEQHDALAQLCAFRHKFHSNKDSLFLSESAFSGEFSFEMQSDENSKLREVGLPTIEWSFYDNSHIPDDSFREWFNFANYSELSETIQEQGLELDLDDDETYELVYDELYTEAMGEYEELNQDIEKYLRRIDEEHGTQYC 165 T 0.0032 ZnuA pdbpssm F Bacteria T 6nmd 3 C B A0A5H1ZR47_9GAMM AcrVA1 MSKAMYEAKERYAKKKMQENTKIDTLTDEQHDALAQLCAFRHKFHSNKDSLFLSESAFSGEFSFEMQSDENSKLREVGLPTIEWSFYDNSHIPDDSFREWFNFANYSELSETIQEQGLELDLDDDETYELVYDELYTEAMGEYEELNQDIEKYLRRIDEEHGTQYCPTGFARLR 174 T 0.0043 ZnuA pdbpssm F Bacteria T 6nnv 2 E,F,G,H I,J,K,L macrocyclic peptide XFXXXDVXYXWYLCKX 16 T 0.51 CNPase pdbhh F T 6nox 1 A A SFTI-KLK5 Peptide GFCHRSYPPECWPN 14 T 1.1 Bowman-Birk_leg pdbhh F T 6npo 2 B C Unknown peptide ligand XXXX 4 F F F 6npw 3 E E RPB1_YEAST Ser2/Ser5 phosphorylated peptide SPSYSPTSPSYSPTSPSYS 19 T 1.8E-05 RNA_pol_Rpb1_R pdbhh F Eukaryota F 6npz 2 C,D F,G bisubstrate GRPRTTXFAE 10 T 7.5 AT_hook pdbhh F T 6nq3 2 B,F B,F SUZ12_HUMAN CHROMATIN PRECIPITATED E2F TARGET 9 PROTEIN,CHET 9 PROTEIN,JOINED TO JAZF1 PROTEIN,SUPPRESSOR OF ZESTE 12 PROTEIN HOMOLOG MEHVQADHELFLQAFEKPTQIYRFLRTRNLIAPIFLHRTLTYMSHRNSRTNIKRKTFKVDDMLSKVEKMKGEQESHSLSAHLQLTFTGFFHKNDKPSPNSENEQNSVTLEVLLVKVCHKKRKDVSCPIRQVPTGKKQVPLNPDLNQTKPGNFPSLAVSSNEFEPSNSHMVKSYSLLFRVTRPGRREFNGMINGETNENIDVNEELPARRKRNREDGEKTFVAQMTVFDKNRRLQLLDGEYEVAMQEMEECPISKKRATWETILDGKRLPPFETFSQGPTLQFTLRWTGETNDKSTAPIAKPLATRNSESLHQENKPGSVKPTQTIAVKESLTTDLQTRKEKDTPNENRQKLRIFYQFLYNNNTRQQTEARDDLHCPWCTLNCRKLYSLLKHLKLCHSRFIFNYVYHPKGARIDVSINECYDGSYAGNPQDIHRQPGFAFSRNGPVKRTPITHILVCRPKRTKASMSEFLEWSHPQFEK 478 T 0.17 zf_C2H2_6 unphh F Eukaryota T 6nq3 4 D,H D,H JARD2_HUMAN JUMONJI/ARID DOMAIN-CONTAINING PROTEIN 2 LSKRKPKTEDFLTFLCLRG 19 T 0.86 GMAP pdbhh F Eukaryota T 6nqw 1 A A B0STJ8_LEPBP Flagellar coiling protein A GSAKDQVDELLKGELVPENDDAELTEDQKKKKKEIMEQESLWKNPDFKGYNKTFQELHQLSKTFANNQFRLALSNYQSGVNTIMKNRDWVEQYRKEEAEKKRLDEKWYWQKVDRKAREERVVYREKMKAKQDALNYFSKAINHLDEIKNPDLRERPEFKRLLSDVYRSWIMAEYDLQNLPQTIPILELYIEIDDNEKEYPAHKYLASAYSFEENMIKKTKGPDDMLFKYRYKKNVHLLRATELKYGKDSPEYKHIVNVINRDEVISVAQ 269 T 0.0087 TED_complement pdbpercent F Bacteria T 6nqx 1 A,B,C,D A,B,C,D B0STJ8_LEPBP Flagellar coiling protein A GSAKDQVDELLKGELVPENDDAELTEDQKKKKKEIMEQESLWKNPDFKGYNKTFQELHQLSKTFANNQFRLALSNYQSGVNTIMKNRDWVEQYRKEEAEKKRLDEKWYWQKVDRKAREERVVYREKMKAKQDALNYFSKAINHLDEIKNPDLRERPEFKRLLSDVYRSWIMAEYDLQNLPQTIPILELYIEIDDNEKEYPAHKYLASAYSFEENMIKKTKGPDDMLFKYRYKKNVHLLRATELKYGKDSPEYKHIVNVINRDEVISVAQ 269 T 0.0087 TED_complement pdbpercent F Bacteria T 6nqy 1 A,B A,B B0STJ8_LEPBP Flagellar coiling protein A GSAKDQVDELLKGELVPENDDAELTEDQKKKKKEIMEQESLWKNPDFKGYNKTFQELHQLSKTFANNQFRLALSNYQSGVNTIMKNRDWVEQYRKEEAEKKRLDEKWYWQKVDRKAREERVVYREKMKAKQDALNYFSKAINHLDEIKNPDLRERPEFKRLLSDVYRSWIMAEYDLQNLPQTIPILELYIEIDDNEKEYPAHKYLASAYSFEENMIKKTKGPDDMLFKYRYKKNVHLLRATELKYGKDSPEYKHIVNVINRDEVISVAQ 269 T 0.0087 TED_complement pdbpercent F Bacteria T 6nqz 1 A,B A,B Q72RA0_LEPIC Flagellar coiling protein B GSGSQQNSGSDQKSQPSSAQLGQSILETERKLDEKIFELNQRLTRHTVLMKMKVRVLPFRTVLFKGKANNDECTPAINQEDPANNCIRVEVYDFIRDEERGLNKNVQGALAKYMEIYFEGQNSNDPEPRTEPPRNINKLKSKIYKNNMVLEDKIISEVMDRGPNTQPSHNDKVEVFFQKDNYPEYGRPETPAEKGVGKYILAGVENTKTHPIRNSFKKEFYIKHLDQFDRLFTKIFDYNDQLGNENYKENVDALKDSLRY 260 T 0.074 VTC unppssm F Bacteria T 6nsv 2 C,D C,D ACE-LEU-TRP-TRP-PRO-ASP XLWWPD 6 T 0.32 Kindlin_2_N pdbhh F F 6nsx 2 B B Hsh155 SRWDVK 6 T 33 DUF6507 pdbhh F T 6nu2 46 TA l RM54_HUMAN MRP-L54,MITOCHONDRIAL LARGE RIBOSOMAL SUBUNIT PROTEIN ML54 SREYWRRLRKQNIWRHNRLSKNK 23 T 4.9 PPV_E2_N pdbhh F Eukaryota T 6nu2 53 AB t Unknown protein/protein extension XXXXXXXXXXXXXXXXXXXXXXXXXXXX 28 F F F 6nu3 53 AB t Unknown protein/protein extension XXXXXXXXXXXXXXXXXXXXXXXXXXXX 28 F F F 6nuw 7 G I CENPU_YEAST ASSOCIATED WITH MICROTUBULES AND ESSENTIAL PROTEIN 1,CENP-U HOMOLOG,CONSTITUTIVE CENTROMERE-ASSOCIATED NETWORK PROTEIN AME1 MDRDTKLAFRLRGSHSRRTDDIDDDVIVFKTPNAVYREENSPIQSPVQPILSSPKLANSFEFPITTNNVNAQDRHEHGYQPLDAEDYPMIDSENKSLISESPQNVRNDEDLTTRYNFDDIPIRQLSSSITSVTTIDVLSSLFINLFENDLIPQALKDFNKSDDDQFRKLLYKLDLRLFQTISDQMTRDLKDILDINVSNNELCYQLKQVLARKEDLNQQIISVRNEIQELKAGKDWHDLQNEQAKLNDKVKLNKRLNDLTSTLLGKYEGDRKIMSQDSEDDSIRDDSNILDIAHFVDLMDPYNGLLKKINKINENLSNELQPSLHHHHHH 330 T 0.0029 DUF1640 pdb F Eukaryota T 6nuw 10 K X Inner kinetochore subunit Mcm22 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 126 F F F 6nuw 11 L M Inner kinetochore subunit Mcm16 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 75 F F F 6nuw 12 M U Unknown (unassigned) XXXXXXXXXXXXXXXXXXXXXXX 23 F F F 6nw8 1 A A A0A5H1ZR48_CENNO Cn29 LCLSCRGGDYDCRVKGTCENGKCVCGS 27 T 0.0047 EGF_2 pdb F Eukaryota T 6nwe 2 B B ILENLKDVGLF peptide CT2 ILENLKDVGLF 11 T 2.1 Phage_holin_4_1 pdbhh F T 6nwk 2 B B PRGC1_HUMAN PPARGC-1-ALPHA,LIGAND EFFECT MODULATOR 6 PSLLKKLLLAPA 12 T 13 Neurokinin_B pdbhh F Eukaryota F 6nwl 2 B B PRGC1_HUMAN PPARGC-1-ALPHA,LIGAND EFFECT MODULATOR 6 PSLLKKLLLAPA 12 T 13 Neurokinin_B pdbhh F Eukaryota F 6nxf 2 C,D V,U ANR31_MOUSE Ankyrin repeat domain 31 SSRESMQTIPHYLQIKEILQISKQELLPCHVMEQHWKFYVGRSHSEALLSW 51 T 15 DUF525 pdbhh F Eukaryota T 6nyy 2 G,H,I,J G,H,I,J Substrate AAAAAAAAAAA 11 T 240 Ribosomal_L12_N pdbhh F F 6nz2 1 A A BCD1_YEAST Box C/D snoRNA protein 1 GPHMRDSTECQRIIRRGVNCLMLPKGMQRSSQNRSKWDKTMDLFVWSVEWILCPMQEKGEKKELFKHVSHRIKETDFLVQGMGKNVFQKCCEFYRLAGTSSCIEGEDGSETKEERTQILQKSGLKFYTKTFPYNTTHIMDSKKLVELAIHEKCIGELLKNTTVIEFPTIFVAMTEADLPEGYEVLHQE 188 T 0.045 MobA_MobL unppssm F Eukaryota T 6o09 2 B,D,F,H,J,L I,B,E,G,J,L K7MRE7_SOYBN Uncharacterized protein YPLVQTKIIDFFRIQRSPEA 20 T 11 DUF1378 pdbhh F Eukaryota T 6o0c 1 A,B,C A,B,C Design construct XAA_GVDQ mutant M4L GSHLGDLKYSLERLREILERLEENPSEKQIVEAIRAIVENNAQIVEAIRAIVENNAQIVENNRAIIEALEAIGVDQKILEEMKKQLKDLKRSLERG 96 T 0.00067 PLU-1 pdbpercent F T 6o0i 1 A,B,C A,B,C Design construct XAA GSHMGTEDLKYSLERLREILERLEENPSEKQIVEAIRAIVENNAQIVEAIRAIVENNAQIVENNRAIIEALEAIGGGTKILEEMKKQLKDLKRSLERG 98 T 0.0013 KinB_sensor pdbpercent F T 6o1o 1 A A Csm1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 716 F F F 6o1o 2 B,G,H B,G,H Csm4 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 279 F F F 6o1o 3 C,D,E,F C,D,E,F Csm3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 218 F F F 6o1o 4 I,J,K,L I,J,K,L Csm2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 118 F F F 6o1q 1 A A NPHP1_HUMAN JUVENILE NEPHRONOPHTHISIS 1 PROTEIN GPMAMLARRQRDPLQALRRRNQELKQQVDSLLSESQLKEALEPNKRQHIYQRCIQLKQAIDENKNALQKLSKADESAPVANYNQRKEEEHTLLDKLTQQLQGLAVTISRENITEVGAPT 119 T 0.012 DUF6100 pdbpssm F Eukaryota T 6o1v 2 B B Unknown Peptide XXXXXXXXXXXXXXXXX 17 F F F 6o21 2 B B ACYCLIC SFTI-FCQR(ASN14) GFCQRSIPPICFPN 14 T 0.051 Bowman-Birk_leg pdb F T 6o23 3 E E CSP_PLAFO CS NANPNANPNANPNANPNANP 20 T 3.2 PT unppercent F Eukaryota F 6o24 4 D I CSP_PLAFO CS NANPNANPNANP 12 T 0.48 Cas_Cas7 pdbhh F Eukaryota F 6o25 4 J,K,L D,H,L CSP_PLAFO CS NANPNANPNANP 12 T 0.48 Cas_Cas7 pdbhh F Eukaryota F 6o26 3 C C CSP_PLAFO CS PNRNVDENANANSA 14 T 54 Tir_receptor_M pdbhh F Eukaryota T 6o28 3 E,F E,F CSP_PLAFO CS KQPADGNPDPNANPN 15 T 0.25 PT unppercent F Eukaryota T 6o29 3 E,F C,D CSP_PLAFO CS NPDPNANPNVDPNANP 16 T 1.3 Cas_Cas7 pdbhh F Eukaryota F 6o2a 4 J,K,L,M F,G,K,L CSP_PLAFO CS NANPNVDPNANP 12 T 0.17 Cas_Cas7 pdbhh F Eukaryota F 6o2b 4 L,M,N,O F,G,K,L CSP_PLAFO CS NVDPNANPNVDP 12 T 0.025 PT unppercent F Eukaryota F 6o2c 4 D C CSP_PLAFO CS NANPNANPNANP 12 T 0.48 Cas_Cas7 pdbhh F Eukaryota F 6o2k 1 A,B A,B Q9VHP9_DROME Centromeric protein-C, isoform A TPLRDEQEEASTKLMQWLRGVGDAPPSASMSDENASVSSANELIFCQVDGIDYAFYNTKEKAMLGYMRFKPYQKRSMKQAKVHPLKLLVQFGEFNVETLAVGEEKEVHSVLRVGDMIEIDRGTRYSIQNAIDKVSVLMCIRS 142 T 1.8E-05 CENP-C_C pdbhh F Eukaryota T 6o2p 2 B B Unknown Peptide XXXXXXXXXXXXXXXXX 17 F F F 6o33 2 B B peptide XEXXRX 6 T 810 Proteasome_A_N pdbhh F F 6o34 2 B B peptide XXXWXX 6 T 64 DPCD pdbhh F F 6o35 1 A,B,C,D A,B,C,D de novo designed WSHC8 GSSAEELLRRSREYLKKVKEEQERKAKEFQELLKELSERSEELIRELEEKGAASEAELARMKQQHMTAYLEAQLTAWEIESKSKIALLELQQNQLNLELRHI 102 T 0.024 MitMem_reg pdb F T 6o38 2 G,H,I,J,K,L G,H,K,L,M,N A0A2T7FJI6_ACINO Type II secretion chaperone CpaB MQSSSALTFSPESRQQSGAKMIESQNILNLSPSEKERLSQQQIVFNEVEKDQLHSKANFPLLKNAKGMVIKYDPKVIELKKVGDTVKFQMLEYGINRTGKIVEIEPVDQDIVRWTGRFDQGDPNQNFFTITQSQKDHYTIMQIFTEKGNYSAEIKDGVGLVQTMDEGVTDQELHHDHP 178 T 0.18 DUF2969 pdbpssm F Bacteria T 6o3h 2 H,I,J,K,L,M,N H,I,J,K,L,M,N A7XXR5_9CAUD P74-26 Head Decoration Protein MDKIQLFRTIGRVQYWERVPRLHAYGVFALPFPMDPDVEWGNWFAGPHPKAFLVSVHPSGPKAGHVYPTDLSDPDSVANVIGMVLDGHDYEADHNVTVTLRAAVPIEYVQQGIEAPPLQPDPAVLNAAPQLKLKVIKGHYFFDYTR 146 T 0.78 TRI9 pdbpssm T Viruses T 6o3n 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P Cross-alpha Amyloid-like Structure alphaAmA XSKLLELLRKLAEALHKAIELLEKWGX 27 T 1.4 BssS pdbhh F T 6o3w 1 A,B C,D SEN1_YEAST TRNA-SPLICING ENDONUCLEASE POSITIVE EFFECTOR EQEDDYKLPMEYIT 14 T 2.1 DUF3228 pdbhh F Eukaryota T 6o3x 2 D,E,F D,E,F SEN1_YEAST TRNA-SPLICING ENDONUCLEASE POSITIVE EFFECTOR DDEDDYTPSISD 12 T 1.9 CM1 pdbhh F Eukaryota T 6o3y 2 D,E,F D,E,F SEN1_YEAST TRNA-SPLICING ENDONUCLEASE POSITIVE EFFECTOR EAEDPYDLNPHPQ 13 T 1.1 Caveolin pdbhh F Eukaryota T 6o43 1 A A Q859J2_9CAUD Orf11 MNDQEKIDKFTHSYINDDFGLTIDQLVPKVKGYGRFNVWLGGNESKIRQVLKAVKEIGVSPTLFAVYEKNEGFSSGLGWLNHTSARGDYLTDAKFIARKLVSQSKQAGQPSWYDAGNIVHFVPQDVQRKGNADFAKNMKAGTIGRAYIPLTAAATWAAYYPLGLKASYNKVQNYGNPFLDGANTILAWGGKLDGKGGSPS 200 T 0.25 PPR_3 pdb T Viruses T 6o4m 1 A,D D,C MEL_APIME MLT,ALLERGEN API M 3,ALLERGEN API M III GXGXXXXXXXXGXXXXXXXXXXXXXXX 27 F F Eukaryota F 6o5l 1 A A L0A1P5_DEIPD PprA MRSGSHHHHHHRSDITSLYKKAGLENLYFQGREDALRGFDALMATAGVESTIVKHAASGADSQTLNDELTRSLQLAHDRWGLGLLHLRHEARLDRGEDTDVILLVDGREVARLSQGAAAISATYETMRAQNADDLSDWGVLPEGHRVTLKAGNNQMRVLVEDARDFETHWSSERGGAFVRTWRQGETLAVEVHRPASPGTALAKAAWKAIMSIKDRNFQRELMERSNSVGMLGALLGARHKDAGRALERLPEAHFAVRSTVVRMTGGAQREFDQWRSMVREGLDQLDELQKTTTRHLTEILRHGLK 306 T 0.46 ZapA pdbpercent F Bacteria T 6o5o 2 C,D C,D ACE-QNGFDNPNYQPQENMQA XQNGFDNPNYQPQENMQA 18 T 1 APP_amyloid pdbhh F T 6o7g 2 B A Histone H4 XGKGGAXRHRKVX 13 T 23 DUF4196 pdbhh F T 6o81 8 O,P d,e Translation initiation factor eiF2 beta-subunit XXXXXXXXXXXXXX 14 F F F 6o85 8 M d Eukaryotic translation initiation factor 2 subunit beta XXXXXXXXXXXXXX 14 F F F 6o8c 2 C,D D,E STING_HUMAN HSTING,ENDOPLASMIC RETICULUM INTERFERON STIMULATOR,ERIS,MEDIATOR OF IRF3 ACTIVATION,HMITA,TRANSMEMBRANE PROTEIN 173 STWGSLKTSAVPSTSTMSQEPELLISGMEKPLPLRTDFS 39 T 9.9 Herpes_IE68 pdbhh F Eukaryota T 6o8p 1 A X Circular bacteriocin, circularin A/uberolysin family AGKEKIRKKLKNEIKKKGRKAVIAW 25 T 0.00084 Bacteriocin_IId pdbhh F T 6o8r 1 A X Circular bacteriocin, circularin A/uberolysin family AWKEKIRKKLKNEIKKKGRKAVIAW 25 T 0.0011 Bacteriocin_IId pdbhh F T 6o8s 1 A X Circular bacteriocin, circularin A/uberolysin family AGKEKIRKKLKNEIKKKWRKAVIAW 25 T 0.0011 Bacteriocin_IId pdbhh F T 6o8t 1 A X Circular bacteriocin, circularin A/uberolysin family AWKEKIRKKLKNEIKKKWRKAVIAW 25 T 1.6 Bacteriocin_IId pdbhh F T 6o9b 3 C C CTNB1_HUMAN BETA-CATENIN TTAPSLSGK 9 T 6.2 LEA_6 pdbhh F Eukaryota T 6o9c 3 C C CTNB1_HUMAN BETA-CATENIN TTAPFLSGK 9 T 20 Fst_toxin pdbhh F Eukaryota T 6oab 2 F H poly(alanine) substrate AAAAAAAAAAAAAAAAAAAAAAAA 24 T 670 DUF4699 pdbhh F F 6oax 2 G P Alpha-S1-casein XXXXXXXXXXXXXXXXXXXXXXXXXX 26 F F F 6oay 2 G P Alpha-S1-casein XXXXXXXXXXXXXXXXXXXXXXXXXX 26 F F F 6obi 1 A A A0A5H1ZR50_MERUN Myosin-VI KQQEEEAERLRRIQEEMEKERKRREEDEQRRRKEEEERRMKLEMEAKRKQEEEERKKREDDEKRIQAE 68 T 9.6 Caldesmon pdbpssm F Eukaryota T 6obk 1 A A D3WAF4_BPLP2 Uncharacterized protein ORF47 MNKEHILAQKEVLTPIEYEHYVKHLFDIGEITKELYIELSSDL 43 T 2.1 DUF6442 pdbhh T Viruses T 6obq 2 C,D C,D Microcystin LR XLXRXXX 7 T 49 Phage_TAC_2 pdbhh F F 6obr 2 C,D D,E Microcystin LR XLXRXXX 7 T 49 Phage_TAC_2 pdbhh F F 6obu 2 C,D C,D Microcystin LR XLXRXXX 7 T 49 Phage_TAC_2 pdbhh F F 6ocp 2 P,Q,R P,Q,R GABR2_HUMAN GB2,G-PROTEIN COUPLED RECEPTOR 51,HG20 QLPILHHAYLPSIGG 15 T 30 UCH_C pdbhh F Eukaryota T 6ocx 2 E,F,G,H F,H,J,L Peptide inhibitor UNC10245109 DGGSFWYRAMKALYG 15 T 1.1 OCIA pdbhh F T 6od0 2 C,D D,E Peptide inhibitor UNC10245092 SFWYGAMKALYG 12 T 6.2 DUF5806 pdbhh F T 6od2 1 A A SPC42_YEASB Spindle pole body component SPC42 SDDDIMMYESAELKRVEEEIEELKRKILVRKKHDLRKLSLNNQLQELQSMMDG 53 T 0.11 TPD52 pdb F Eukaryota T 6odj 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q PolyAla Model of PRC from H.pylori XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 300 F F F 6oe6 1 A,B A,B Q72Q74_LEPIC Uncharacterized protein MAHHHHHHCFKPTGEFGWVLLDEEKFNIIEKKIMTVGEYTITRKNLIFPDDKTICYIYRFSRSVSESAETYVSLSKFQLGYNEMDVLRKRPNPVSQTIEGSFQGLSPGKYLLKVAYEGDVIDEVEFLVRSTRTPYIEDTSSSADDIEKAMK 151 T 1.3 DUF4969 unphh F Bacteria T 6oef 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N A,B,C,D,E,F,G,H,I,J,K,L,M,N PolyAla Model of OMCC O-Layer XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 732 F F F 6oeh 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N A,B,C,D,E,F,G,H,I,J,K,L,M,N PolyAla Model of OMCC I-Layer XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 470 F F F 6ofa 1 A A KKX1U_UROMN Wasabi Receptor Toxin ASPQQAKYCYEQCNVNKVPFDQCYQMCSPLERS 33 T 0.15 OATP unppssm F Eukaryota T 6ofq 2 B B SER-THR-SER-ALA STSA 4 T 600 GrpB pdbhh F F 6og3 2 D P Alpha S1-casein XXXXXXXXX 9 F F F 6ohz 1 A A Q04PE5_LEPBJ Uncharacterized protein MAHHHHHHMTEIDDLLRKNPELQKEWKRTVWTAAISSGVIAYRPPLLERAFREFPMETAKSALNLFVAAHKSKNRQSVDIITQNLKDAKTFPLGQLEEEIVTDILKYPNLLEKLLQTGWNPNLILEWEKHKSLSQNSKRSHRRPEILIKSNGKEFIEKQETTLLILAMQNDFIPMETVQILLKYGADPSLGVKRKSEGKEYLLYPLANINSNGNTILKELKQKTLIDWKK 230 T 0.098 Spore_III_AB unppssm F Bacteria T 6oi4 3 C,F E,F PSMD1_HUMAN 26S PROTEASOME REGULATORY SUBUNIT RPN2, 26S PROTEASOME REGULATORY SUBUNIT S1, 26S PROTEASOME SUBUNIT P112, RPN2 PQEPEPPEPFEXID 14 T 6.5 PrmC_N pdbhh F Eukaryota T 6oig 47 UA x P1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 6oig 48 VA y P2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 46 F F F 6ois 2 C,D,E,F C,D,E,F DMS3_ARATH PROTEIN INVOLVED IN DE NOVO 1 MADLYPTGQQISFQTTPLNVQDPTRMMNLDQSSPVARNETQNGGGIAHAEFAMFNSKRLESDLEAMGNKIKQHEDNLKFLKSQKNKMDEAIVDLQVHMSKLNSSPTPRSENSDNSLQGEDINAQILRHENSAAGVLSLVETLHGAQASQLMLTKGVVGVVAKLGKVNDENLSQILSNYLGTRSMLAVVCRNYESVTALEAYDNHGNIDINAGLHCLGSSIGREIGDSFDAICLENLRPYVGQHIADDLQRRLDLLKPKLPNGECPPGFLGFAVNMIQIDPAYLLCVTSYGYGLRETLFYNLFSRLQVYKTRADMISALPCISDGAVSLDGGIIRKTGIFNLGNRDEVNVRFAKPTASRTMDNYSEAEKKMKELKWKKEKTLEDIKREQVLREHAVFNFGKKKEEFVRCLAQSSCTNQPMNTPRGTLESGKETAAAKFERQHMDSSTSAA 449 T 0.001 DUF724 unp F Eukaryota T 6oit 2 C,D,E,F C,D,E,F DMS3_ARATH PROTEIN INVOLVED IN DE NOVO 1 MADLYPTGQQISFQTTPLNVQDPTRMMNLDQSSPVARNETQNGGGIAHAEFAMFNSKRLESDLEAMGNKIKQHEDNLKFLKSQKNKMDEAIVDLQVHMSKLNSSPTPRSENSDNSLQGEDINAQILRHENSAAGVLSLVETLHGAQASQLMLTKGVVGVVAKLGKVNDENLSQILSNYLGTRSMLAVVCRNYESVTALEAYDNHGNIDINAGLHCLGSSIGREIGDSFDAICLENLRPYVGQHIADDLQRRLDLLKPKLPNGECPPGFLGFAVNMIQIDPAYLLCVTSYGYGLRETLFYNLFSRLQVYKTRADMISALPCISDGAVSLDGGIIRKTGIFNLGNRDEVNVRFAKPTASRTMDNYSEAEKKMKELKWKKEKTLEDIKREQVLREHAVFNFGKKKEEFVRCLAQSSCTNQPMNTPRGTLESGKETAAAKFERQHMDSSTSAA 449 T 0.001 DUF724 unp F Eukaryota T 6oit 3 G G CHR35_ARATH PROTEIN DEFECTIVE IN MERISTEM SILENCING 1,PROTEIN DEFECTIVE IN RNA-DIRECTED DNA METHYLATION 1 GEFFAVSNMLEALDSGKFGSVSKELEEIADMRMDLVKRSIWLYPSLAYTVFEAEKTMDGGGGSDYKDDDDK 71 T 5.4 CSTF2_hinge pdbhh F Eukaryota T 6oj0 1 A,AA,B,BA,C,CA,D,DA,E,EA,F,FA,G,GA,H,HA,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,PA,Q,R,S,T,U,V,W,X,Y,Z T,J,U,K,V,L,W,M,X,a,G,b,n,c,o,d,p,e,q,f,r,A,g,B,N,C,O,D,P,E,Q,F,R,S,h,i,j,k,l,m,H,I A0A1W6I187_9VIRU Structural protein VP4 MSESVTQQVFNFAVTKSQPFGGYVYSTNLTASTSSAVTSTQLTPLNLSITLGQITLSGNSLVIPATQIWYLTDAYVSVPDYTNITNGAEADGVILIYKDGVKLMLTTPLISSMSISNPARTHLAQAVKYSPQSILTMYFNPTKPATASTSYPNTVYFTVVVVDFSYAQNPARAVVSANAVM 181 T 32 DUF1684 pdbhh T Viruses T 6oj0 2 QA Z A0A1W6I162_9VIRU Uncharacterized protein MLSLDNYSYVHNITTQTNIDLSSQQTIHLASINGKGYIIFLRFFCEGSSACFTNVKFSVKANGLVLYSFRYIQLLELGQAIATAIPSSSQGFSTLLSNYNVLISSPIGTLPQLTLYDSYDNRYGAMLQPAFPLPFVNTLSLDVDILPVSQSSYDPIPYSLNDNQISTNAPTGKGNISIEYLLYNCLV 187 T 7.3 Class_IIIsignal pdbhh T Viruses T 6ole 83 EC y CADH1_HUMAN CAM 120/80,EPITHELIAL CADHERIN,E-CADHERIN,UVOMORULIN GVCRKAAQPVEAGLQIPAILGILGGILALLILILLLLLF 39 T 0.016 ASFV_J13L unphh F Eukaryota T 6olf 81 CC y CADH1_HUMAN CAM 120/80,EPITHELIAL CADHERIN,E-CADHERIN,UVOMORULIN GVCRKAAQPVEAGLQIPAILGILGGILALLILILLLLLF 39 T 0.016 ASFV_J13L unphh F Eukaryota T 6olg 85 GC A CADH1_HUMAN CAM 120/80,EPITHELIAL CADHERIN,E-CADHERIN,UVOMORULIN GVCRKAAQPEEAGLQIPAILGILGGILALLILILLLLLF 39 T 0.016 ASFV_J13L unphh F Eukaryota T 6oli 83 EC y Nascent chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 36 F F F 6olo 1 A A Designed trimeric coiled coil peptide XQIAAIKXAIAAIKQQIAAIKEAIAAIKQX 30 T 0.092 DUF5320 pdbhh F T 6olz 82 DC A PCSK9_HUMAN NEURAL APOPTOSIS-REGULATED CONVERTASE 1,NARC-1,PROPROTEIN CONVERTASE 9,PC9,SUBTILISIN/KEXIN-LIKE PROTEASE PC9 SWWPLPLLLLLLLLLGPAGARAQEDE 26 T 0.55 Chi-conotoxin pdbhh F Eukaryota T 6om0 83 EC y PCSK9_HUMAN NEURAL APOPTOSIS-REGULATED CONVERTASE 1,NARC-1,PROPROTEIN CONVERTASE 9,PC9,SUBTILISIN/KEXIN-LIKE PROTEASE PC9 SWWPLPLLLLLLLLLGPAGARAQEDE 26 T 0.55 Chi-conotoxin pdbhh F Eukaryota T 6om4 2 C,D C,D MCCC7,MICROCIN C51,MICROCIN C MRTGNAD 7 T 22 YqzL pdbhh F T 6om7 83 EC y PCSK9_HUMAN NEURAL APOPTOSIS-REGULATED CONVERTASE 1,NARC-1,PROPROTEIN CONVERTASE 9,PC9,SUBTILISIN/KEXIN-LIKE PROTEASE PC9 SWWPLPLLLLLLLLLGPAGARAQEDE 26 T 0.55 Chi-conotoxin pdbhh F Eukaryota T 6omb 2 F G Substrate of Cdc48 XXXXXXXXXXXXXXXXXXXXXX 22 F F F 6omm 2 B L Peptide agonist WKYMVX 6 T 32 DUF5891 pdbhh F T 6on2 2 G G Bound Y2853 Substrate AAAAAAA 7 T 270 DUF4179 pdbhh F F 6oni 2 B D NCOR1_HUMAN NCOR isoform c DPASNLGLEDIIRKALMGSFDDK 23 T 3.3 RuvA_C pdbhh F Eukaryota T 6onj 2 B C MED1_HUMAN ACTIVATOR-RECRUITED COFACTOR 205 KDA COMPONENT,ARC205,MEDIATOR COMPLEX SUBUNIT 1,PEROXISOME PROLIFERATOR-ACTIVATED RECEPTOR-BINDING PROTEIN,PPAR-BINDING PROTEIN,THYROID HORMONE RECEPTOR-ASSOCIATED PROTEIN COMPLEX 220 KDA COMPONENT,TRAP220,THYROID RECEPTOR-INTERACTING PROTEIN 2,TRIP-2,VITAMIN D RECEPTOR-INTERACTING PROTEIN COMPLEX COMPONENT DRIP205,P53 REGULATORY PROTEIN RB18A NTKNHPMLMNLLKDNPAQD 19 T 8.5 HEAT pdbhh F Eukaryota T 6oo2 2 F G Designed Cyclic Peptide GGDEIVNKVLGGSSGGXXXXXXXXGGKGCK 30 T 19 Eno-Rase_NADH_b pdbhh F T 6opc 2 G G Substrate bound to the central pore of the Cdc48 hexamer XXXXXXXXXXXXXXXXXXXXXX 22 F F F 6opd 3 C C Melanoma antigen variant ILNAMIVKI 9 T 7.1 DUF4408 pdbhh F T 6opj 2 B B Peptide inhibitor 25 XARWXXPXXPXRR 13 T 47 DUF4632 pdbhh F F 6opm 3 G,H H,J unknown XXXXXXXXXXXXX 13 F F F 6oqp 1 A A SER-LYS-TRP-ILE-CYS-ALA-ASN-ARG-SER-VAL-CYS-PRO-ILE SKWICANRSVCPI 13 T 2.1 Sprouty pdbhh F T 6oqq 1 A,C A,C SIDJ_LEGPH Legionella pneumophila SidJ MFGFIKKVLDFFGVDQSEDNPSETAVETTDVSTKIKTTDTTQEESSVKTKTVVPTQPGGSVKPETIAPDQQKKHQIKTETTTSTTKQKGPKVTLMDGHVKQYYFARRGETSTHDTSLPPPVKVLSGRSIPLKEIPFEATRNELVQIYLTSIDKLIKSNKLNSIPSQQIASHYLFLRSLANSETDGIKKNQILSLAKPLGTYLASKEPHVWKMINELIEKSEYPIIHYLKNNRAHSNFMLALIHEYHKEPLTKNQSAFVQKFRDSSVFLFPNPIYTAWLAHSYDEDSSFNPMFRERLSTNFYHSTLTDNLLLRTEPKEVTLSSEHHYKKEKGPIDSSFRYQMSSDRLLRIQGRTLLFSTPQNDVVAVKVQKKGEPKSTLEEEFEMADYLLKHQRRLDVHSKLPQPLGQYSVKKSEILEISRGSLDFERFKTLIDDSKDLEVYVYKAPQSYFTYLHDKNQDLEDLTASVKTNVHDLFVLLREGIVFPQLADIFHTHFGEDEREDKGRYQALVQLLNVLQFQLGRIDKWQKAVEYVNLRSSGLADLGDSLPITSLFTSSDFTKHYFSELLTGGYHPTFFDKSSGTANSLFTGKRRLFGNYLYLNTIAEYLLVIQLTLGSYGDKVTRDMMDKPKKEAVWRELANVMFTSCAEAIHIMTGIPQSRALTLLKQRANIEKHFRQTQFWMTPDYSKLDEDTLQMEQYSIYSGEPEYEFTDKLVSGVGLSVDGVHQDLGGYNRESPLRELEKLLYATVTLIEGTMQLDKEFFKQLEQVEKILSGEIKTDANSCFEAVAQLLDLARPGCHFQKRLVLSYYEEAKLKYPSAPTDAYDSRFQVVARTNAAITIQRFWREARKNLSEKSDIDSEKPESERTTDKRL 873 T 0.34 IQ pdb F Bacteria T 6ore 6 F A FME-PHE-PHE MFF 3 T 110 DUF4985 pdbhh F F 6ori 1 A A Q9Z4N7_ENTFL SURFACE PROTEIN NAQMGEGRLANYSASGNTFQENPGYTKNYNFSDLQFNPKAITGDVLQGNTIDFEVYGKHNIAASTANWEIRLQLDERLAQYVEKIQVDPKKGVGNSRRTFVRINDSLGRPTNIWKVNYIRANDGLFAGAETTDTQTAPNGVITFEKNLDEIFKEIGADNLKSDRLMYRIYLVSHQDDDKIVPGIESTGYFLTDQDDFYNKLDVSENNSDQFKHGSVNTKYEEANIQTKDGSGSTGANGAIILDHKLTKEKNFSYSTSAKGTPWYANYKIDERLVPYVSGIQMHMVQADKVAYNVAFESGKKVADLAIERREGHENYGMGSITDNDLTKLIDFANASPRPIVVRYVLQLTKPLDEILEEMKAADKIEENAPFGEDFIFDSWLSDTNKKLIQNTYGTGYYYLQDIDGLEVLFQ 411 T 0.55 ARL6IP6 pdbpssm F Bacteria T 6orj 1 A A Q8SCZ8_BPDPK PHIKZ164 MDEAVSLLSNMQDSEIQTSEFRLWSIGRATENKPRNSFTLMVLPIESATATDGETTFNPVEEVVDGVDADGRAYTTKVSVSRDIPCIWLPNEDNRATPPDVMRGEKIAIYRLGDTSQFYWRSMGLSNDLRTLESVVYTFNASLSPGGAGKNFDTCYFMQFSAHDKHVTIGTSKANGEPYRYSVQINTGTGAVYILDDIGNRFELVSKDKRLMLMNADNSFVKVEKKAIDLNADQYIKLTSGGSTLELNPTEFKVNTTNTTIKSSGTHIQEAGGTMTHKAGGNMLFTAPRYDFT 293 T 0.0017 DUF2345 pdbpercent T Viruses T 6orl 53 AB H FME-PHE-PHE MFF 3 T 110 DUF4985 pdbhh F F 6os0 3 C B ANGT_HUMAN SERPIN A8 DRVYIHPF 8 T 0.67 Nairo_nucleo pdbhh F Eukaryota T 6os1 3 C B TRV023 peptide XRVYKHPA 8 T 9.5 DUF3782 pdbhh F T 6os2 3 C B TRV026 peptide XRVYYHPX 8 T 1.5 VEFS-Box pdbhh F T 6os9 2 B L JMV449 KKPYIL 6 T 36 TrmO pdbhh F F 6osa 2 B L JMV449 KKPYIL 6 T 36 TrmO pdbhh F F 6osk 56 DB 6 FME-PHE-PHE MFF 3 T 110 DUF4985 pdbhh F F 6ost 55 CB 6 FME-PHE-PHE MFF 3 T 110 DUF4985 pdbhh F F 6osw 1 A A Q7T2G3_DANRE FORKHEAD BOX M1-LIKE GEFMRESPRRPIILKRRKLPFAKSTARSFPDGIRVMDHPTMPDTQVVVIPKSADLQSVISVLTAKGKEAGPQGRNKFILLSGDTSAEEENLYFQ 94 T 0.042 uDENN pdbpssm F Eukaryota T 6osw 2 B B Q7T2G3_DANRE FORKHEAD BOX M1-LIKE GAQAGAANRSLTEGFVLDTMNDSLSKILVDISFSGLEDEDLGMGNISWSQFIPEAK 56 T 8 LRR_RI_capping pdbhh F Eukaryota T 6ot3 56 DB 6 FME-PHE-PHE MFF 3 T 110 DUF4985 pdbhh F F 6ov6 1 A,B,C A,B,C G2EBB4_9FLAO C24 PROTEIN MNKQNFLQTGGFPLETDTLNAMQEAYSVFNALGELAGNKAIIKGCVVSGSTTTDGVVYINGEVFKFVGGQTQSRVKILETSTSKEFEDGSTNAVHFERYVTFASGTGSISWAEFAKLTTLRELSRRLLPAGTNPQLYSGSVNNIPSGWQLCDGTNGTENLKGSFIVGYDPNDSDYNAIGKVGGTKKVTPSGNLDSRSINVTVPRDGWSTFGSGLGAVKSGRIVVGSGQQENSEYLESLRASGIDRTLTSTPHSHTFTGNQQDNRAPYYTLAYIIYIG 277 T 0.058 DUF859 pdbhh F Bacteria T 6ov7 2 C,D C,D kCAL01 peptide ANSRWQVTRV 10 T 2.1 DUF6245 pdbhh F T 6ovf 2 C,D C,D STA03 XQNGFDNPNYQPQ 13 T 0.34 APP_amyloid pdbhh F T 6ox6 2 B B A0A090A233_PSEAI PA14_01140 MAIEKGEAFARRDIYIDYDFEDVTYRWDHRQGTIHVRFYGEAESPEPVEHDNRLFNDALRFGREITREEYETGFPKG 77 T 7.9 LSM14 pdbhh F Bacteria T 6oxl 6 F n Unknown region of Adaptin ear-binding coat-associated protein 2 Ex-domain XXXXXXX 7 F F F 6oyl 2 B B KIF4A_HUMAN CHROMOKINESIN-A GHMELKHVATEYQENKAPGKKKKRALASNTSFFSGLEPIEEEPE 44 T 0.03 DUF3584 unphh F Eukaryota T 6p07 2 G G polyglutamate peptide EEEEEEEEEEEEEEE 15 T 22 CAC1F_C pdbhh F F 6p0f 1 A A C5A3Z3_THEGJ GTPase subunit of restriction endonuclease MENQLFIIGIGTGTDEYENFEETILKGVKRNELEGQIGPDILDNCCSDVCYFWGRSKETIYEKKIDKGDMVLFYVGKRISRNKVDLNQETAVYLGIICETVEISENDVSFLNDFWRKGENFRFLMFFKKKPEKLHHSINEINSKLGYNPDYFPIAGYVKPERMSGVYDILKNILKKRGILKESDS 185 T 62 Endonuc-EcoRV pdbhh F Archaea T 6p0g 1 A A C5A3Z3_THEGJ GTPase subunit of restriction endonuclease MENQLFIIGIGTGTDEYENFEETILKGVKRNELEGQIGPDILDNCCSDVCYFWGRSKETIYEKKIDKGDMVLFYVGKRISRNKVDLNQETAVYLGIICETVEISENDVSFLNDFWRKGENFRFLMFFKKKPEKLHHSINEINSKLGYNPDYFPIAGYVKPERMSGVYDILKNILKKRGILKESDS 185 T 62 Endonuc-EcoRV pdbhh F Archaea T 6p23 3 C C MHC I-peptide RXRAAAKKKYCL 12 T 1.5 SBP_bac_10 pdbhh F T 6p25 3 C D acceptor peptide PYTV 4 T 63 TcdB_toxin_midC pdbhh F F 6p27 3 C C MHC I-peptide RXRAAAKKKYCL 12 T 1.5 SBP_bac_10 pdbhh F T 6p2c 3 C C MHC I-peptide RXRARARARARAAAKKKYCL 20 T 0.054 TCP pdbhh F T 6p2f 3 C C MHC I-peptide RXRARAAAKKGYCL 14 T 3.9 KGG pdbhh F T 6p2s 3 C C MHC I-peptide RXAAAKKKYCL 11 T 2.9 RNF111_N pdbhh F T 6p3w 3 C,F E,F ORC1_HUMAN ORC1 Peptide ARKRLEL 7 T 0.97 MBD_C pdbhh F Eukaryota F 6p5b 1 A A Q5ZTL4_LEGPH MavC GPLGSMTTSKLEKTGLHVHEKIKHMVKNYGTMITGIPAEILGQNEAEISVGYVKKMGNMKENIAEVVRKSEMTQPTNSAGKASNEVCDLLLGTEGASEFEKSSYQVLSGDGSNLKGSLPNKNLLVRVEMDRFNAPQKYQKIKREEFNPETAEKNKIYLLEDQLVYLDIFGKVIDLGQTSDTCHRLFNAITTPFYQNYILYDEYIDPEESAEEAAMFEMGEIVKAKMKNIDCWTATHSFTIFVPESDSEDTRTLYPYQAYWTSHTLQQWFSGDKDEKLSRLGIDGYIEKLALLGTTTDSKIRSSIYGELFSPPGKEHVFCTGMNEKFSPLRVKFKVTEVNPEIALQNLEEVQEFIDTNYPGENAKDQCELYKIKAQEAMTKQLEMRLLIE 389 T 0.035 V-ATPase_H_N unppercent F Bacteria T 6p5h 1 A,B A,B Q5ZTL4_LEGPH MavC GPLGSRFNAPQKYQKIKREEFNPETAEKNKIYLLEDQLVYLDIFGKVIDLGQTSDTCHRLFNAITTPFYQNYILYDEYIDPEESAEEAAMFEMGEIVKAKMKNID 105 T 0.08 HUN pdbpssm F Bacteria T 6p5l 2 C D EZH2_HUMAN PRO-ARG-LYS-LYS-LYS-ARG-LYS-HIS PRKKKRKH 8 T 1.5 Rrn6 pdbhh F Eukaryota F 6p5r 1 A A B6SBM0_9TRYP Mitochondrial edited mRNA stability factor 1 MGSSHHHHHHSSGLVPRGSHMDDALRGELAMGSSHHHHHHSSGLVPRGSHMDDALRGELASALDTEGHALPFDVHLQQPHSSGDGTAGDTSTIQLEKLSHPPARFDLLTNSFVYKWQTKAALARKVSGPMREWAAELKYRTGVHIELEPTYPERLSENAVKGSGSDDGDGTQWGAYETADDVDITVYLFGSERGIFNCHKLMEAAIQQDPVYVRLGIFRRLANSSEVEWLMLRRINRELRPPDIPPISLKLPGKWTLLYERYKEAAIRTLWEETGITVDASNVYPTGHLYQTVPQYYWRVPVRYFVAEVPSDIRVEGPQVVPLQYMRNWDARLLRQSPDPIDRAWAQLADPATGCAWMKASMIDQLQKPLRGDNYMAIRYTPPPYSNLQEVVGLGDGSITPSTGNGEDAS 410 T 0.00034 NUDIX unp F Eukaryota T 6p60 3 C,F,I,L E,F,I,L ENV_HV1H2 HIV fusion peptide residue 512-519 AVGIGAVF 8 T 4.2 DUF3918 pdbhh T Viruses F 6p64 5 I,J C,H HHAT_HUMAN Neoantigen peptide KQWLVWLFL KQWLVWLFL 9 T 0.54 DUF446 pdbhh F Eukaryota T 6p6a 2 B,C A,C NIT2_NEUCR NITROGEN REGULATORY PROTEIN 2,NIT2 TISSKRQRRHSKS 13 T 12 DUF4543 pdbhh F Eukaryota T 6p6e 2 B,C B,C PACC_NEUCR PAC3 NLS FDARKRQFDDLNDFFGSVKRRQIN 24 T 1.1 FKS1_dom1 pdbhh F Eukaryota T 6p7e 5 Q,R U,V ASP-THR-ASP-PHE peptide DTDF 4 T 67 DUF29 pdbhh F F 6p7e 6 S W THR-ASP-PHE peptide TDF 3 T 130 zf-C2HE pdbhh F F 6p7h 3 C C ENV_HV1H2 HIV fusion peptide residues (512-519) AVGIGAVF 8 T 4.2 DUF3918 pdbhh T Viruses F 6p7o 1 A A D7Y2H5_ECOLX E. coli MS115-1 NucC MSDWSLSQLFASLHEDIQLRLGTARKAFQHPGAKGDASEGVWIEMLDTYLPKRYQAANAFVVDSLGNFSDQIDVVVFDRQYSPFIFKFNEQIIVPAESVYAVFEAKQSASADLVAYAQRKVASVRRLHRTSLPIPHAGGTYPAKPLIPILGGLLTFESDWSPALGMSFDKALNGDLSDGRLDMGCVASHGHFYFNNIDSKFNFEHGNKPATAFLFRLIAQLQFSGTVPMIDIDAYGKWLAN 241 T 0.71 NERD pdbhh F Bacteria T 6p7p 1 A,B,C A,B,C D7Y2H5_ECOLX E. coli MS115-1 NucC 2-241 SNASDWSLSQLFASLHEDIQLRLGTARKAFQHPGAKGDASEGVWIEMLDTYLPKRYQAANAFVVDSLGNFSDQINVVVFDRQYSPFIFKFNEQIIVPAESVYAVFEAKQSASADLVAYAQRKVASVRRLHRTSLPIPHAGGTYPAKPLIPILGGLLTFESDWSPALGMSFDKALNGDLSDGRLDMGCVASHGHFYFNNIDSKFNFEHGNKPATAFLFRLIAQLQFSGTVPMIDIDAYGKWLAN 243 T 0.71 NERD unphh F Bacteria T 6p7q 1 A,B,C A,B,C D7Y2H5_ECOLX E. coli MS115-1 NucC SNASDWSLSQLFASLHEDIQLRLGTARKAFQHPGAKGDASEGVWIEMLDTYLPKRYQAANAFVVDSLGNFSDQINVVVFDRQYSPFIFKFNEQIIVPAESVYAVFEAKQSASADLVAYAQRKVASVRRLHRTSLPIPHAGGTYPAKPLIPILGGLLTFESDWSPALGMSFDKALNGDLSDGRLDMGCVASHGHFYFNNIDSKFNFEHGNKPATAFLFRLIAQLQFSGTVPMIDIDAYGKWLAN 243 T 0.71 NERD unphh F Bacteria T 6p7v 1 A C Q6CK37_KLULA KLLA0F13816P MFDTKLFLSLPIDIRYTVYFFLGDVVQNVRPPAKSDIFNDELIAYPNIREFNQSLVDKYSKHIGVYDYIPNFIPNWCRDFDLLRHDIILTDRLRVCLQYEEQWFSVQWIVVSGELEIGIFTTDEQFLQVSYTINEYCHLLSIAQQDLRLGINVSDINDVNELCKEIQHRWLFDTVSYISFINCWDLDHENVVSIIPCMESFNNLHMLRIESKNMFNNLINTQGVRENPGKTIVYNVRQNIFELELYTLRDLGYKSVVDLQKWEQLQCLSLSGCEFIDLNNLILPQHCKMLILKEVKYIIWWDLSHLLKRIRPQWIINGQVKKPTKKEEEEESEWYNLYLEVVQTYQPLNFIELHNAKRVKGNLILPARLVTESRIKISNGTKVDSVLLI 389 T 0.024 DUF5420 pdbpssm F Eukaryota T 6p7w 2 B C Q6CK37_KLULA KLLA0F13816P MFDTKLFLSLPIDIRYTVYFFLGDVVQNVRPPAKSDIFNDELIAYPNIREFNQSLVDKYSKHIGVYDYIPNFIPNWCRDFDLLRHDIILTDRLRVCLQYEEQWFSVQWIVVSGELEIGIFTTDEQFLQVSYTINEYCHLLSIAQQDLRLGINVSDINDVNELCKEIQHRWLFDTVSYISFINCWDLDHENVVSIIPCMESFNNLHMLRIESKNMFNNLINTQGVRENPGKTIVYNVRQNIFELELYTLRDLGYKSVVDLQKWEQLQCLSLSGCEFIDLNNLILPQHCKMLILKEVKYIIWWDLSHLLKRIRPQWIINGQVKKPTKKEEEEESEWYNLYLEVVQTYQPLNFIELHNAKRVKGNLILPARLVTESRIKISNGTKVDSVLLI 389 T 0.024 DUF5420 pdbpssm F Eukaryota T 6p7x 2 B C Q6CK37_KLULA KLLA0F13816P MFDTKLFLSLPIDIRYTVYFFLGDVVQNVRPPAKSDIFNDELIAYPNIREFNQSLVDKYSKHIGVYDYIPNFIPNWCRDFDLLRHDIILTDRLRVCLQYEEQWFSVQWIVVSGELEIGIFTTDEQFLQVSYTINEYCHLLSIAQQDLRLGINVSDINDVNELCKEIQHRWLFDTVSYISFINCWDLDHENVVSIIPCMESFNNLHMLRIESKNMFNNLINTQGVRENPGKTIVYNVRQNIFELELYTLRDLGYKSVVDLQKWEQLQCLSLSGCEFIDLNNLILPQHCKMLILKEVKYIIWWDLSHLLKRIRPQWIINGQVKKPTKKEEEEESEWYNLYLEVVQTYQPLNFIELHNAKRVKGNLILPARLVTESRIKISNGTKVDSVLLI 389 T 0.024 DUF5420 pdbpssm F Eukaryota T 6p81 2 B B Griselimycin XXXXLXLXPXG 11 T 15 NOP19 pdbhh F F 6p8b 2 D L FITC-RJPXD33 TNLYMLPKWDIP 12 T 1.7 MG3 pdbhh F T 6p8d 3 E,F F,C ENV_HV1H2 HIV fusion peptide residue 512-519 AVGIGAVF 8 T 4.2 DUF3918 pdbhh T Viruses F 6p8p 1 A,B,C,D A,B,C,D CAP8_PSEAI Uncharacterized protein MTTVVSRTFRSSPHRDALQTWDAIVELLTQGKDGTARSELRAVTGVAASLIADQAPKSAPIVATCDGPRTRIYCLFDEDAIDGDDANEEVLGFEPLKGDWGMSLPCPKEQLGWVQSALKKHSSRIIARDLSQGIATQAQADAGQAMSLDLGGFLKS 156 T 0.28 DUF3944 pdbpercent F Bacteria T 6p8s 2 C,D C,D CAP8_PSEAI HORMA1 MTTVVSRTFRSSPHRDALQTWDAIVELLTQGKDGTARSELRAVTGVAASLIADQAPKSAPIVATCDGPRTRIYCLFDEDAIDGDDANEEVLGFEPLKGDWGVSLPCPKEQLGWVQSALKKHSSRIIARDLSQG 133 T 0.28 DUF3944 unppercent F Bacteria T 6p8s 3 E,F E,F Peptide 1 SNAEVMEFNP 10 T 5.6 DUF1885 pdbhh F T 6p8u 3 C C Peptide 1 SNAEVMEFNP 10 T 5.6 DUF1885 pdbhh F T 6pbs 2 AA,BA,CA,DA,EA,FA,GA,HA,IA,JA,M,N,O,P,Q,R,S,T,U,V,W,X,Y,Z Z,a,b,c,d,f,g,h,i,j,D,E,F,H,J,L,N,P,Q,R,S,U,V,X ecumicin XVXTXVXVXXVXV 13 T 19 Conotoxin pdbhh F F 6pbv 3 E,F G,I Q7K740_PLAF7 Junctional peptide XKQPADGNPDPNANPX 16 T 5.1 Nup54 pdbhh F Eukaryota T 6pbw 3 C E Q7K740_PLAF7 NPNANPNANPNA peptide XNPNANPNANPNAX 14 T 1.9 Cas_Cas7 pdbhh F Eukaryota F 6pc5 8 H C VIRGINIAMYCIN S1 XTXPXXX 7 T 260 zf-C2H2_jaz pdbhh F F 6pcw 2 B B Peptide ARKRRRHPSGPPTA 14 T 1.5 DUF3019 pdbhh F T 6pdi 2 B B Peptide ARKRRRHPSGPPTA 14 T 1.5 DUF3019 pdbhh F T 6pdk 1 A A B2FJJ6_STRMK Uncharacterized protein MGSSHHHHHHSSGLVPRGSHMATQTGRTINGHTYTDAPVDVKLGPNTFRIPANYLDSQIAPWPGEGVTLVIEWPDMKPTAPGARANPRTNDFRKEIPIRINYVDRVPVETLLSRLSSNEAITEEGSVERGDPRDRLDQRVAKPQTLGLTPYAIDEAKMVVYAKKYEARYGKPPVRNPAYERDWYIARQGDGRISSFIKCDGEEFRRDGVRLEGREVISEPGEVAAGCVHYFVDIDNKLSVSLDYKRAFLKDWKRMEEAVRDVIARTRSK 269 T 1.3 PsbP_2 unphh F Bacteria T 6pdn 2 B B peptide ARKRRRHPSGPPTA 14 T 1.5 DUF3019 pdbhh F T 6pdo 2 B B Peptide ARKRRRHPSGPPTA 14 T 1.5 DUF3019 pdbhh F T 6pdp 2 B B Peptide ARKRRRHPSGPPTA 14 T 1.5 DUF3019 pdbhh F T 6pdq 3 E,F F,G Ac-DEVD inhibitor XDEVD 5 T 140 zf-NPL4 pdbhh F F 6pdr 3 C A ENV_HV1H2 HIV fusion peptide residue 512-519 AVGIGAVF 8 T 4.2 DUF3918 pdbhh T Viruses F 6pds 3 E,F G,C ENV_HV1H2 HIV-1 fusion peptide residue 512-519 AVGIGAVF 8 T 4.2 DUF3918 pdbhh T Viruses F 6pdu 3 C C ENV_HV1H2 HIV-1 fusion peptide residue 512-519 AVGIGAVF 8 T 4.2 DUF3918 pdbhh T Viruses F 6pdw 2 F G Unknown peptide XXXXXXXXXX 10 F F F 6pdy 2 G G Unknown E. coli peptide XXXXXXXXXX 10 F F F 6pdz 2 B,D C,D NCOR2_HUMAN N-COR2,CTG REPEAT PROTEIN 26,SMAP270,SILENCING MEDIATOR OF RETINOIC ACID AND THYROID HORMONE RECEPTOR,SMRT,T3 RECEPTOR-ASSOCIATING FACTOR,TRAC,THYROID-,RETINOIC-ACID-RECEPTOR-ASSOCIATED COREPRESSOR TNMGLEAIIRKALMGKYDQWEE 22 T 3.5 RHH_7 pdbhh F Eukaryota T 6pe0 2 G G Unknown E. coli peptide XXXXXXXXXX 10 F F F 6pe4 9 P Q A0A4Z7TVW3_VIBPH Cation transporter MGSSHHHHHHSQDLDEVDAGSMVNTTQKISQSPVPDLEQFRAIAAQKDDRVISKRGEVKEPSTFHKGHKFASVSEGVLRKKYTKFFQENIKTHLDLKQALLKEEKPETALLAYSLVSPSGYRGEPLTERKILEVVSLLDEVKVDGDTYQQLKNTFDSISKDPRMQVSLENQYPGKMDGFGAQLLEMGKEKLKGSGVNAAINLALPGVGLLVATGRELHKASVNGDAEAYHHQLEQISQLPGRDQRLSMPMQQTLAIGHAMLSAEGAVGATLGMATGGLGTFGVSSVATAGVTPIAKEAIGTALTTGIISGGGFVAGQAGAYGLNNEVQDQLKQGPMSGVLPRLEISNVKGDFTFSMQEPAAVRALMAYLGPKEDTSMSSPQAPKEAQEMEAARLTLKQMLGSSPNEHLVPDVDSLLKLSDEDMPSQTESTANGAFKKLLSEDWDWLMPAVRAMDKGEAGKINEKLTYKLPLDAANGRVYLDKSPNLSDAQLDALDKLGSPSQLRLMYLAEGWI 513 T 0.14 VP4_helical unppercent F Bacteria T 6pe5 9 P,Q Q,R A0A4Z7TVW3_VIBPH Cation transporter MGSSHHHHHHSQDLDEVDAGSMVNTTQKISQSPVPDLEQFRAIAAQKDDRVISKRGEVKEPSTFHKGHKFASVSEGVLRKKYTKFFQENIKTHLDLKQALLKEEKPETALLAYSLVSPSGYRGEPLTERKILEVVSLLDEVKVDGDTYQQLKNTFDSISKDPRMQVSLENQYPGKMDGFGAQLLEMGKEKLKGSGVNAAINLALPGVGLLVATGRELHKASVNGDAEAYHHQLEQISQLPGRDQRLSMPMQQTLAIGHAMLSAEGAVGATLGMATGGLGTFGVSSVATAGVTPIAKEAIGTALTTGIISGGGFVAGQAGAYGLNNEVQDQLKQGPMSGVLPRLEISNVKGDFTFSMQEPAAVRALMAYLGPKEDTSMSSPQAPKEAQEMEAARLTLKQMLGSSPNEHLVPDVDSLLKLSDEDMPSQTESTANGAFKKLLSEDWDWLMPAVRAMDKGEAGKINEKLTYKLPLDAANGRVYLDKSPNLSDAQLDALDKLGSPSQLRLMYLAEGWI 513 T 0.14 VP4_helical unppercent F Bacteria T 6pec 3 C A ENV_HV1H2 HIV-1 fusion peptide residue 512-519 AVGIGAVF 8 T 4.2 DUF3918 pdbhh T Viruses F 6pef 3 C,F C,F ENV_HV1H2 HIV fusion peptide residue 512-519 AVGIGAVF 8 T 4.2 DUF3918 pdbhh T Viruses F 6pek 2 F G substrate peptide, TYR-GLU-TYR-GLU-TYR-GLU-TYR-GLU EYEYEYEYEY 10 T 91 DUF4595 pdbhh F F 6pel 2 B B ILENLKDVGLF G alpha peptide CT2 ILENLKDVGLF 11 T 2.1 Phage_holin_4_1 pdbhh F T 6pen 2 F G EYEYEYEYEY EYEYEYEYEY 10 T 91 DUF4595 pdbhh F F 6peu 2 E,F,G,H M,N,P,Q MCRA_METJA GLY-ARG-LEU-GLY-PHE-TYR-GLY-TYR-ASP-LEU-GLN-ASP GRLGFYGYDLQD 12 T 3.2 Anth_synt_I_N pdbhh F Archaea T 6pfj 1 A T F2RFR7_STRVP RSIG GSRPPAQRTAESALPDRARPELGALRLPELRTLRREAQSDEADLSYVRRMLQGRIDILRAELARRTDGEAPVLDRLSEILADVPSRHRSSARHVTLSTPRGEEYRRLAAEMLSEVELSDLTARTDEELHAAMGRLAGYEQQISRRRHHLQRTADDCSAEIARRYREGEAQVDDLLA 176 T 0.13 OrfB_IS605 unppercent F Bacteria T 6pfv 1 A,C,E T,B,E F2RFR7_STRVP RSIG GSRPPAQRTAESALPDRARPELGALRLPELRTLRREAQSDEADLSYVRRMLQGRIDILRAELARRTDGEAPVLDRLSEILADVPSRHRSSARHVTLSTPRGEEYRRLAAEMLSEVELSDLTARTDEELHAAMGRLAGYEQQISRRRHHLQRTADDCSAEIARRYREGEAQVDDLLA 176 T 0.13 OrfB_IS605 unppercent F Bacteria T 6pgs 2 B B GNAT2_BOVIN G alpha CT2 peptide ILENLKDVGLF 11 T 2.1 Phage_holin_4_1 pdbhh F Eukaryota T 6ph7 2 B B GNAT2_BOVIN G protein CT2 peptide ILENLKDVGLF 11 T 2.1 Phage_holin_4_1 pdbhh F Eukaryota T 6phm 1 A A D-glucagon XXXGXXXXXXXXXXXXXXXXXXXXXXXXX 29 F F F 6phn 1 A A D-glucagon L-Val23 XXXGXXXXXXXXXXXXXXXXXXVXXXXXX 29 F F F 6phq 1 A A D-glucagon D-BrPhe 6,22 GXXXXXXXXXXXXXXXXXXXXXXXXX 26 F F F 6pi2 1 A A PPM1_LIMPO Tachyplesin II RWCFRVCYRGICYRKCRX 18 T 0.51 YlaC pdbhh F Eukaryota T 6pi3 1 A A TAC3_TACGI TACHYPLESIN III KWCFRVCYRGICYRKCRX 18 T 0.66 YlaC unphh F Eukaryota T 6pin 1 A A TAC1_TACTR TACHYPLESIN I KWCFRVCYRGICYRRCRG 18 T 0.021 Myticin-prepro unp F Eukaryota T 6pio 1 A A TAC2_TACTR TACHYPLESIN II RWCFRVCYRGICYRKCRG 18 T 0.04 Myticin-prepro unppercent F Eukaryota T 6pip 1 A A TAC3_TACGI TACHYPLESIN III KWCFRVCYRGICYRKCRG 18 T 0.53 YlaC pdbhh F Eukaryota T 6pir 1 A,B,C A,B,C Q5ZT21_LEGPH MavE SNATRFERNFLINSLMFLETILSVDKKLDDAIHHFTQGQYENPRYQINSRITNADDWSKEDKLKFTSAIAEAIALVSEKYENPTSETTEQIQSARNILLDNYVPLLTANTDPENRLKSVRENSSQIRKELIAKLKDE 137 T 0.008 DUF3502 unppercent F Bacteria T 6pit 3 C,D D,C Stapled Peptide 41A XHKKLHRXLQDS 12 T 0.0031 SRC-1 pdbhh F T 6pj4 2 B B GRK_DROME Peptide aldehyde inhibitor VRM 3 T 0.11 Ycf70 unppssm F Eukaryota F 6pj5 2 B B GRK_DROME Peptide aldehyde inhibitor VRMA 4 T 0.0014 DUF3844 unphh F Eukaryota F 6pj7 2 B B GRK_DROME Peptide aldehyde inhibitor VRMA 4 T 0.0014 DUF3844 unphh F Eukaryota F 6pj8 2 B B GRK_DROME Peptide aldehyde inhibitor AVRMA 5 T 0.0014 DUF3844 unphh F Eukaryota F 6pj9 2 B B GRK_DROME Peptide aldehyde inhibitor VRMA 4 T 0.0014 DUF3844 unphh F Eukaryota F 6pja 2 B B Peptide aldehyde inhibitor VRMAA 5 T 150 SDA1 pdbhh F F 6pjp 2 B B GRK_DROME Peptide aldehyde inhibitor RKVRMAAIVFSFP 13 T 0.0014 DUF3844 unphh F Eukaryota T 6pjq 2 B B GRK_DROME Peptide aldehyde inhibitor RKVRMAAIVFSFP 13 T 0.0014 DUF3844 unphh F Eukaryota T 6pjr 2 B B GRK_DROME Peptide aldehyde inhibitor RKVRMAAIVFSFP 13 T 0.0014 DUF3844 unphh F Eukaryota T 6pju 2 B B GRK_DROME Peptide aldehyde inhibitor RKVRMAAIVFSFP 13 T 0.0014 DUF3844 unphh F Eukaryota T 6pka 2 AA,BA,O,P,Q,R,S,T,U,V,W,X,Y,Z b,c,H,J,O,P,Q,R,U,V,X,Y,Z,a OO1-WFP-SER-PRO-YCP-ALA-MP8 ureadepsipeptide XXSPXAX 7 T 430 GreA_GreB pdbhh F F 6pl5 3 C D Unknown peptide XXXXXXXXXXX 11 F F F 6pl6 3 C D Unknown peptide XXXXXXXXXXX 11 F F F 6plh 3 C C IL21R_HUMAN IL-21R,NOVEL INTERLEUKIN RECEPTOR AGPMPGSSYQGTWSEWSDPVIFQTQSEELKEHHHHHH 37 T 0.00047 fn3 unppssm F Eukaryota T 6plm 1 A,B A,B SIDJ_LEGPH SidJ protein GHVKQYYFARRGETSTHDTSLPPPVKVLSGRSIPLKEIPFEATRNELVQIYLTSIDKLIKSNKLNSIPSQQIASHYLFLRSLANSETDGIKKNQILSLAKPLGTYLASKEPHVWKMINELIEKSEYPIIHYLKNNRAHSNFMLALIHEYHKEPLTKNQSAFVQKFRDSSVFLFPNPIYTAWLAHSYDEDSSFNPMFRERLSTNFYHSTLTDNLLLRTEPKEVTLSSEHHYKKEKGPIDSSFRYQMSSDRLLRIQGRTLLFSTPQNDVVAVKVQKKGEPKSTLEEEFEMADYLLKHQRRLDVHSKLPQPLGQYSVKKSEILEISRGSLDFERFKTLIDDSKDLEVYVYKAPQSYFTYLHDKNQDLEDLTASVKTNVHDLFVLLREGIVFPQLADIFHTHFGEDEREDKGRYQALVQLLNVLQFQLGRIDKWQKAVEYVNLRSSGLADLGDSLPITSLFTSSDFTKHYFSELLTGGYHPTFFDKSSGTANSLFTGKRRLFGNYLYLNTIAEYLLVIQLTLGSYGDKVTRDMMDKPKKEAVWRELANVMFTSCAEAIHIMTGIPQSRALTLLKQRANIEKHFRQTQFWMTPDYSKLDEDTLQMEQYSIYSGEPEYEFTDKLVSGVGLSVDGVHQDLGGYNRESPLRELEKLLYATVTLIEGTMQLDKEFFKQLEQVEKILSGEIKTDANSCFEAVAQLLDLARPGCHFQKRLVLSYYEEAKLKYPSAPTDAYDSRFQVVARTNAAITIQRFWREARKNLS 757 T 0.29 IQ pdb F Bacteria T 6pm9 2 E,F,G,H E,F,G,H OGA_HUMAN OGA,BETA-N-ACETYLGLUCOSAMINIDASE,BETA-N-ACETYLHEXOSAMINIDASE,BETA-HEXOSAMINIDASE,MENINGIOMA-EXPRESSED ANTIGEN 5,N-ACETYL-BETA-D-GLUCOSAMINIDASE,N-ACETYL-BETA-GLUCOSAMINIDASE,NUCLEAR CYTOPLASMIC O-GLCNACASE AND ACETYLTRANSFERASE,NCOAT MTLEDLQLLADLFYLPYEHGPKGAQMLREFQWLRANSSVVSVNCKGKDSEKIEEWRSRAAKFEEMCGLVMGMFTRLSNCANRTILYDMYSYVWDIKSIMSMVKSFVQWLGCRSHSSAQFLIGDQEPWAFRGGLAGEFQRLLPIDGANDLFFQPHHHHHHHH 161 T 12 Pinin_SDK_memA pdbhh F Eukaryota T 6pmd 2 O,P,Q,R,S,T,U,V,W,X,Y H,J,O,P,Q,R,U,V,X,Y,Z SHV-WFP-SER-PRO-YCP-ALA-MP8 Acyldepsipeptide XXSPXAX 7 T 430 GreA_GreB pdbhh F F 6po1 3 N S substrate peptide XXXXXXXXXXXX 12 F F F 6po3 3 N S substrate peptide XXHXXXX 7 T 1800 zf-CCCH pdbhh F F 6po6 1 A A YFAThiaGlu VFAX 4 T 350 SRA pdbhh F F 6pod 3 N S substrate peptide XXXXXXXXXXXXXXXXXXX 19 F F F 6por 1 A A A0A105L2P0_9BURK Ubonodin GGDGSIAEYFNRPMHIHDWQIMDSGYYG 28 T 0.2 MmoB_DmpM pdbhh F Bacteria T 6pos 3 N S substrate peptide XXXXRXXX 8 T 3300 zf-C2H2 pdbhh F F 6pp5 2 G S substrate peptide XXXXXXXXXXXX 12 F F F 6pp6 2 G S substrate peptide XXHXXXX 7 T 1800 zf-CCCH pdbhh F F 6pp7 2 G S substrate peptide XXXXXXXXXXXXXXXXXXX 19 F F F 6pp8 2 G S substrate peptide XXXXRXXX 8 T 3300 zf-C2H2 pdbhh F F 6ppc 1 A A CG2RA_CONMI CONOPEPTIDE MI045 EDCGSDCMPCGGECCCEPNSCIDGTCHHESSPN 33 T 0.57 FeoB_associated pdbhh F Eukaryota T 6ppm 3 C,F,K,L E,I,L,F VAL-GLU-ILE-ASP Inhibitor VEID 4 T 65 DUF72 pdbhh F F 6pq5 1 A,B A,B PRIO_HUMAN PRP, ASCR, PRP27-30, PRP33-35C AGAAAA 6 T 0.85 Pam17 unphh F Eukaryota F 6pqa 1 A A PRIO_HUMAN PRP, ASCR, PRP27-30, PRP33-35C GAVVGG 6 T 0.85 Pam17 unphh F Eukaryota F 6pqf 1 A A OlvA(BCS) ACGXGXGCAKXCAASCAAS 19 T 0.096 C_tripleX pdbhh F T 6pqg 1 A A OlvA(BC) ACGXGDGCAKXCAASCAAS 19 T 0.096 C_tripleX pdbhh F T 6pqt 1 A A G0SCF1_CHATD Dynein intermediate chain protein GAHMMQARREELLAKKARLAEIKRQRELRAQQAAGRSITPSELVSPTPSRANSRREIESLIDSILSSSAGANSPRRGSRPNSVISTGELSTD 92 T 0.097 kleA_kleC pdbpssm F Eukaryota T 6psa 1 A H PIE12 D-peptide XXGXXXXXXXXXXXXXXX 18 F F F 6psh 1 A A ANTIH_BPT4 PROTEIN RI MNVDPHFDKFMESGIRHVYMLFENKSVESSEQFYSFMRTTYKNDPCSSDFECIERGAEMAQSYARIMNIKLETEKLAAALEHHHHHH 87 T 4 LT-IIB unphh T Viruses T 6psk 1 A R ANTIH_BPT4 Antiholin MNVDPHFDKFMESGIRHVYMLFENKSVESSEQFYSFMRTTYKNDPCSSDFECIERGAEMAQSYARIMNIKLETE 74 T 4 LT-IIB unphh T Viruses T 6psl 1 A A teixobactin analogue XISXXISXARI 11 T 75 V_ATPase_I pdbhh F F 6pt2 2 C,D C,D Peptide agonist KGCHM07 XXFXX 5 T 25 Tnp_DNA_bind pdbhh F F 6pth 2 B G ACE-MVA-MP8-NCZ-LEU-MP8-LEU-MVA-PRO-MLU-GLY XXXXLXLXPXG 11 T 15 NOP19 pdbhh F F 6ptr 2 B,D X,Y ACE-MVA-MP8-NZC-LEU-MP8-LEU-MVA-PRO-MLU-GLY XXXXLXLXPXG 11 T 15 NOP19 pdbhh F F 6ptv 2 C,E X,Y ACE-MVA-MP8-NZC-LEU-MP8-LEU-MVA-PRO-MLU-GLY XXXXLXLXPXG 11 T 15 NOP19 pdbhh F F 6pu1 2 B B SC24C_HUMAN SEC24-RELATED PROTEIN C GPLLPGQSFGGPSVS 15 T 5.7 LT-IIB pdbhh F Eukaryota T 6pu3 2 B B SER-THR-SER-ALA STSA 4 T 600 GrpB pdbhh F F 6pun 3 E,F E,F P91820_CAEEL LST-1 GSNSSGLRSQKLHLTYIEKNKRVRAMIPQ 29 T 0.15 N_Asn_amidohyd unp F Eukaryota T 6pv9 2 B B macrocyclic peptide XFXNPHLXWSWXXRXGX 17 T 9.9 DUF5701 pdbhh F T 6pva 2 B A AMINO GROUP-()-LYSINE-()-LYSINE-()-PROLINE-()-AMINO-ACETALDEHYDE-()-5'-{[(3S)-3-amino-3-carboxypropyl](3-aminopropyl)amino}-5'-deoxyadenosine XXPKKX 6 T 360 tRNA_synt_1c_R2 pdbhh F F 6pvb 2 B A AMINO GROUP-()-(2~{S})-2-azanylpropanal-()-ISOLEUCINE-()-ARGININE-()-LYSINE-()-PROLINE-()-AMINO-ACETALDEHYDE-()-9-(5-{[(3S)-3-amino-3-carboxypropyl](pentyl)amino}-5-deoxy-beta-L-arabinofuranosyl)-9H-purin-6-amine XXPKRIAX 8 T 3.8 HIRA_B pdbhh F T 6pw3 1 A,B,C,D C,A,B,D LARP1_HUMAN LA RIBONUCLEOPROTEIN DOMAIN FAMILY MEMBER 1 GHSGGGGGGHMQHPSHELLKENGFTQHVYHKYRRRCLNERKRLGIGQSQEMNTLFRFWSYFLEDHFNKKMYEEFKQLALEDAKEGYRYGLECLFRYYSYGLEKKFRLDIFKDFQEETVKDYEAGQLYGLEKFWAFLKYSKAKNLDIDPKLQEYLGKFRRLED 162 T 0.62 Frankia_peptide pdbpercent F Eukaryota T 6pwb 2 AA,AF,BD,D,E,FC,FE,GA,GB,H,HB,HD,HE,IA,IB,ID,JB,JD,KD,M,MB,N,O,P,QA,QC,QD,RC,RE,SC,SD,TC,UA,V,WA,WB,X,XB,XD,Y,YB,YD,Z,ZB,ZC,ZD,ZE CC,GW,EV,AL,AM,DY,GJ,CI,CU,AU,CV,FF,GL,CK,CW,FG,CX,FH,FI,BE,EA,BF,BG,BH,BZ,EK,FO,EL,HE,EM,FQ,EN,DD,BN,DF,DP,BP,DQ,GA,CA,DR,GB,CB,DS,ET,GC,GV B0STJ8_LEPBP Flagellar coiling protein A (FcpA) LTEDQKKKKKEIMEQESLWKNPDFKGYNKTFQELHQLSKTFANNQFRLALSNYQSGVNTIMKNRDWVEQYRKEEAEKKRLDEKWYWQKVDRKAREERVVYREKMKAKQDALNYFSKAINHLDEIKNPDLRERPEFKRLLSDVYRSWIMAEYDLQNLPQTIPILELYIEIDDNEKEYPAHKYLASAYSFEENMIKKTKGPDDMLFKYRYKKNVHLLRATELKYGKDSPEYKHIVNVIN 237 T 0.029 RVT_2 pdbpssm F Bacteria T 6pwb 3 AC,AD,AE,BA,BC,BE,BF,CA,CC,CE,DA,F,GC,GE,HA,KB,LB,LD,MD,ND,PE,Q,R,RA,RD,S,SE,UC,VA,VC,W,WC DT,EU,GE,CD,DU,GF,GZ,CE,DV,GG,CF,AO,DZ,GK,CJ,CY,CZ,FJ,FK,FL,HB,BI,BJ,DA,FP,BK,HF,EO,DE,EP,BO,EQ B0SR03_LEPBP Flagellar coiling protein B (FcpB) SGKSMADTEKELDDNISEVNKRLRLHTVLFKMKVRTLPHKTVLYKGKPSADGERCEAADKQEAQDNTCLHLEVFDFVGSEDGKSSKNLGAKFKKMELFFEGSNNADPDPRKEQPRNLTKIRTYIYQNNFLLEDKVISVIADVAPNGEPAHNDKIELFYQHDDYPVWGTPETPSEKGVGKYILSNVENTKSNPIRNNFKKQFYFKNLDYFDKLFTKIFDYND 221 T 0.11 HTH_1 unppssm F Bacteria T 6pwd 1 A A A0A085GHR3_9GAMM Type III effector HopBF1 SMFNVSNNVAPSRYQGPSSTSVTPNAFHDVPSLGQKVGAGSQKDVFHSRQDPRQCICLFRPGTTGSIPAEQYAQKELETTKQLKNLGFPVVDAHALVKHQGSVGVAKDFIHNALDSEDIVNNKKSLPDNLKFNKNVLEDCNAIIRRLKNLEVHIEDLQFLVDHNGHVLINDPRDVVRSSPDKSISKVNELRSHALNNLLDIDSD 204 T 0.0036 Pkinase pdbhh F Bacteria T 6pwg 1 A,B A,B A0A085GHR3_9GAMM Type III effector HopBF1 SMFNVSNNVAPSRYQGPSSTSVTPNAFHDVPSLGQKVGAGSQKDVFHSRQDPRQCICLFRPGTTGSIPAEQYAQKELETTKQLKNLGFPVVDAHALVKHQGSVGVAKDFIHNALDSEDIVNNKKSLPDNLKFNKNVLEDCNAIIRRLKNLEVHIEDLQFLVDHNGHVLINDPRDVVRSSPDKSISKVNELRSHALNNLLDIDSD 204 T 0.0036 Pkinase pdbhh F Bacteria T 6px4 1 A,C R,A ANTIH_BPT4 Antiholin MNVDPHFDKFMESGIRHVYMLFENKSVESSEQFYSFMRTTYKNDPCSSDFECIERGAEMAQSYARIMNIKLETE 74 T 4 LT-IIB unphh T Viruses T 6px6 3 C C GLTC_WHEAT DQ2.2-glut-L1 APFSEQEQPVLG 12 T 21 BAGE pdbhh F Eukaryota T 6pxe 1 A,C,E,G R,A,C,E ANTIH_BPT4 PROTEIN RI MNVDPHFDKFMESGIRHVYMLFENKSVESSEQFYSFMRTTYKNDPCSSDFECIERGAEMAQSYARIMNIKLETE 74 T 4 LT-IIB unphh T Viruses T 6pxk 2 M X unidentified alpha helical sequence XXXXXXXXXXXXXXXX 16 F F F 6pxr 3 C A TAU_HUMAN NEUROFIBRILLARY TANGLE PROTEIN,PAIRED HELICAL FILAMENT-TAU,PHF-TAU AGTYGLGD 8 T 0.16 GshA pdbhh F Eukaryota T 6pxu 2 C,D C,D GAGATGAGAGYYITPRTGAGA GAGATGAGAGYYITPRTGAGA 21 T 0.32 YtxH pdbhh F T 6py2 3 C C GLTC_WHEAT DQ2.2-glut-L1 APFSEQEQPVLG 12 T 21 BAGE pdbhh F Eukaryota T 6pyl 3 C C POL_HV1H2 Self-peptide LRN KRWIILGLNK 10 T 1 COX2-transmemb pdbhh T Viruses T 6q0b 5 E 7 anti-VP1 mAb XXXXXXXXXXXXXXXXXXXXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXXXXXXXXXXXXXXXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCXXXXXXXXXXXXXXXXXXXC 520 T 8800 Fer4_6 pdbhh F F 6q0m 2 C,D C,D peptide YYESDWL 7 T 0.43 MIX pdbhh F T 6q0n 2 C,D C,D peptide TGYETWV 7 T 0.49 DUF4860 pdbhh F T 6q0r 3 C C DCA15_HUMAN DDB1- and CUL4-associated factor 15 MDWSHPQFEKSAVGLNDIFEAQKIEWHEGGGGSGENLYFQGGGRMEPGYVNYTKLYYVLESGEGTEPEDELEDDKISLPFVVTDLRGRNLRPMRERTAVQGQYLTVEQLTLDFEYVINEVIRHDATWGHQFCSFSDYDIVILEVCPETNQVLINIGLLLLAFPSPTEEGQLRPKTYHTSLKVAWDLNTGIFETVSVGDLTEVKGQTSGSVWSSYRKSCVDMVMKWLVPESSGRYVNRMTNEALHKGCSLKVLADSERYTWIVL 263 T 15 LuxS pdbhh F Eukaryota T 6q0u 2 C,D C,D peptide YYESGWL 7 T 1 MORN_2 pdbhh F T 6q0v 3 C C DCA15_HUMAN DDB1- and CUL4-associated factor 15 MDWSHPQFEKSAVGLNDIFEAQKIEWHEGGGGSGENLYFQGGGRMEPGYVNYTKLYYVLESGEGTEPEDELEDDKISLPFVVTDLRGRNLRPMRERTAVQGQYLTVEQLTLDFEYVINEVIRHDATWGHQFCSFSDYDIVILEVCPETNQVLINIGLLLLAFPSPTEEGQLRPKTYHTSLKVAWDLNTGIFETVSVGDLTEVKGQTSGSVWSSYRKSCVDMVMKWLVPESSGRYVNRMTNEALHKGCSLKVLADSERYTWIVL 263 T 15 LuxS pdbhh F Eukaryota T 6q0w 3 C C DCA15_HUMAN DDB1- and CUL4-associated factor 15 MDWSHPQFEKSAVGLNDIFEAQKIEWHEGGGGSGENLYFQGGGRMEPGYVNYTKLYYVLESGEGTEPEDELEDDKISLPFVVTDLRGRNLRPMRERTAVQGQYLTVEQLTLDFEYVINEVIRHDATWGHQFCSFSDYDIVILEVCPETNQVLINIGLLLLAFPSPTEEGQLRPKTYHTSLKVAWDLNTGIFETVSVGDLTEVKGQTSGSVWSSYRKSCVDMVMKWLVPESSGRYVNRMTNEALHKGCSLKVLADSERYTWIVL 263 T 15 LuxS pdbhh F Eukaryota T 6q1h 1 A,B,C,E,F,G A,B,C,E,F,G NUCC_PSEAI Bacterial protein ORF C62 MSQWSLSQLLSSLHEDIQQRLSVVRKTFGHPGTKGDASENVWIDMLDTYLPKRYQAAKAHVVDSLGNFSQQINVVVFDRQYSPFIFTYENETIIPAESVYAVFEAKQTADAGLVAYAQEKVASVRRLHRTSLPIPHAGGTYPAKPLIPILGGLLTFESEWSPALGPSMDKALNANLTEGRLDIGCVAAHGHFFYDQASGAYSYTNENKPATAFLFKLIAQLQFSGTVPMIDVEAYGQWLTK 241 T 0.37 AdoMet_Synthase pdbpercent F Bacteria T 6q1u 2 C,D C,D SFTI1_HELAN GLY-ARG-ALA-TYR-LYS-SER-LYS-PRO-PRO-ILE-ALA-PHE-PRO-ASP GRAYKSKPPIAFPD 14 T 0.052 Bowman-Birk_leg unp F Eukaryota T 6q1x 1 A A A0A0U2WJJ9_9BURK Pandonodin GVLGNDAEGITLLPLCFKPICIPTLPPLTGGHA 33 T 0.17 PEP-utilizers_C unppercent F Bacteria T 6q36 2 C,D C,D ACE-PRO-6CW-ARG-LEU-ARG-LYS-2JH-HYP-ASP-SER-PHE-ALN-LYS-GLU-PRO-NH2 XPXRLRKXPDSFXKEPX 17 T 1.8 FAM181 pdbhh F T 6q3g 1 A,AA,AH,AI,AL,AY,B,BB,BG,BH,BL,BM,BY,C,CB,CC,CD,CG,CH,CL,CM,CW,CY,D,DB,DD,DF,DG,DM,DO,DQ,DS,DX,EB,ED,EF,EG,EM,EO,EQ,ER,ES,EV,EX,FA,FC,FD,FE,FF,FI,FO,FQ,FR,FS,FV,FW,FX,GA,GC,GE,GF,GI,GO,GQ,GS,GV,GW,GX,HA,HC,HE,HH,HI,HV,HW,I,IA,IC,IE,IH,II,IL,IW,IY,J,JB,JG,JH,JL,JM,JY,K,KB,KD,KG,KH,KM,KY,L,LB,LD,LF,LG,LM,LO,LS,LX,MB,MD,MF,MG,MM,MO,MQ,MS,MU,MV,MX,NC,ND,NE,NF,NI,NN,NO,NQ,NR,NS,NU,NV,NW,NX,OC,OE,OF,OI,ON,OO,OR,OS,OU,OV,OW,OX,PC,PE,PH,PI,PN,PR,PU,PV,PW,Q,QC,QE,QH,QI,QN,QR,QW,R,RB,RG,RH,RL,RM,S,SB,SG,SH,SM,T,TB,TF,TG,TM,TO,TS,TX,UB,UF,UG,UL,UM,UO,UV,UX,VC,VE,VF,VL,VN,VO,VQ,VR,VU,VV,VW,VX,WC,WE,WF,WL,WN,WO,WQ,WR,WU,WV,WW,WX,X,XC,XE,XH,XL,XN,XQ,XR,XV,XW,Y,YC,YE,YH,YN,YQ,YR,YW,Z,ZB,ZG,ZH,ZK C1,a1,h7,T8,BD,AS,A1,C3,I7,i7,CD,KE,BS,B1,A3,b3,X4,J7,j7,DD,LE,YQ,CS,D1,B3,Y4,Y6,K7,ME,QG,AJ,QL,QR,D3,Z4,Z6,L7,NE,RG,BJ,KK,RL,CQ,RR,f1,C4,a4,C6,a6,Y8,SG,CJ,LK,SL,AQ,bQ,SR,g1,A4,A6,b6,Z8,TG,DJ,TL,BQ,cQ,TR,h1,B4,B6,C8,a8,DQ,dQ,I1,i1,D4,D6,A8,b8,KD,eQ,IS,J1,I3,Q7,B8,LD,SE,JS,K1,J3,f4,R7,D8,TE,KS,L1,K3,g4,g6,S7,UE,YG,YL,YR,L3,h4,h6,T7,VE,ZG,KJ,ZL,AP,IQ,ZR,I4,i4,I6,i6,g8,CG,aG,LJ,CL,aL,BP,JQ,CR,aR,J4,J6,j6,h8,AG,bG,AL,bL,CP,KQ,AR,bR,K4,K6,I8,i8,BG,BL,DP,LQ,BR,Q1,L4,L6,J8,j8,DG,DL,DR,R1,Q3,Y7,K8,DE,aE,S1,R3,Z7,L8,bE,T1,S3,C7,a7,cE,gG,gL,gR,T3,A7,b7,AE,dE,hG,QQ,hR,Q4,Q6,B7,BE,IG,iG,AK,IL,KP,RQ,IR,iR,R4,R6,D7,CE,JG,jG,BK,JL,LP,SQ,JR,jR,X1,S4,S6,Q8,FE,KG,CK,KL,TQ,KR,Y1,T4,T6,R8,LG,DK,LL,LR,Z1,Y3,g7,S8,AD Q859I3_9CAUD Major head protein MAQQSTKNETALLVAKSAKSALQDFNHDYSKSWTFGDKWDNSNTMFETFVNKYLFPKINETLLIDIALGNRFNWLAKEQDFIGQYSEEYVIMDTVPINMDLSKNEELMLKRNYPRMATKLYGNGIVKKQKFTLNNNDTRFNFQTLADATNYALGVYKKKISDINVLEEKEMRAMLVDYSLNQLSETNVRKATSKEDLASKVFEAILNLQNNSAKYNEVHRASGGAIGQYTTVSKLKDIVILTTDSLKSYLLDTKIANTFQIAGIDFTDHVISFDDLGGVFKVTKEFKLQNQDSIDFLRAYGDYQSQLGDTIPVGAVFTYDVSKLKEFTGNVEEIKPKSDLYAFILDINSIKYKRYTKGMLKPPFHNPEFDEVTHWIHYYSFKAISPFFNKILITDQDVNPKPEEELQE 408 T 12 ER pdbhh T Viruses T 6q3g 2 AC,AD,AF,AG,AM,AO,AP,AR,AS,AW,AX,BA,BC,BD,BF,BI,BO,BR,BS,BW,BX,CA,CF,CI,CO,CR,CS,CX,DA,DC,DH,DI,DL,DW,DY,E,EA,EC,EH,EI,EL,EW,EY,F,FB,FG,FH,FL,FM,FY,G,GB,GD,GG,GH,GL,GM,GR,GY,H,HB,HD,HF,HG,HM,HO,HQ,HR,HS,HX,IB,ID,IF,IG,IM,IO,IQ,IS,IV,IX,JA,JC,JD,JE,JF,JI,JO,JQ,JS,JV,JW,JX,KA,KC,KE,KF,KI,KL,KO,KQ,KS,KV,KW,KX,LA,LC,LE,LH,LI,LL,LV,LW,LY,M,MA,MC,ME,MH,MI,MW,MY,N,NB,NG,NH,NM,O,OB,OD,OG,OH,OM,OQ,P,PB,PD,PF,PG,PM,PO,PQ,PS,PX,QB,QD,QF,QG,QM,QO,QS,QU,QV,QX,RC,RE,RF,RI,RN,RO,RR,RS,RU,RV,RW,RX,SC,SE,SF,SI,SL,SN,SO,SR,SS,SU,SV,SW,SX,TC,TE,TH,TI,TL,TN,TR,TU,TV,TW,U,UC,UE,UH,UI,UN,UR,US,UW,V,VB,VG,VH,VM,VS,W,WB,WG,WH,WM,XB,XF,XG,XM,XO,XU,XX,YB,YF,YG,YL,YM,YO,YU,YV,YX,ZC,ZE,ZF,ZL,ZN,ZO,ZQ,ZR,ZV,ZW,ZX Z3,V4,V6,H7,JE,NG,nG,FK,NL,WQ,NR,b1,a3,W4,W6,U8,OG,GK,OL,XQ,OR,c1,X6,V8,PG,HK,PL,PR,d1,c3,k7,W8,ED,ZQ,DS,E1,e1,d3,l7,X8,FD,aQ,ES,F1,E3,M7,m7,GD,OE,FS,G1,F3,b4,N7,n7,HD,PE,MK,GS,H1,G3,c4,c6,O7,QE,UG,EJ,NK,UL,UR,H3,d4,d6,P7,RE,VG,FJ,VL,EQ,VR,j1,E4,e4,E6,e6,c8,WG,GJ,WL,FQ,fQ,WR,k1,F4,F6,f6,d8,MD,XG,HJ,XL,GQ,gQ,XR,l1,G4,G6,E8,e8,ND,HQ,hQ,LS,M1,m1,H4,H6,F8,f8,iQ,MS,N1,M3,U7,G8,WE,O1,N3,j4,V7,H8,XE,MJ,P1,O3,k4,k6,W7,YE,cG,NJ,cL,cR,P3,l4,l6,X7,ZE,dG,dL,EP,MQ,dR,M4,M6,m6,k8,EG,eG,EL,eL,FP,NQ,ER,eR,N4,N6,n6,l8,EE,FG,fG,FL,fL,GP,OQ,FR,fR,O4,O6,M8,m8,GE,GG,GL,HP,PQ,GR,U1,P4,P6,N8,n8,HG,HL,hL,HR,V1,U3,c7,O8,eE,iL,W1,V3,d7,P8,fE,W3,E7,e7,gE,kG,MP,kR,X3,F7,f7,HE,hE,lG,NP,UQ,lR,U4,U6,G7,IE,MG,mG,EK,ML,VQ,MR,mR Q859I2_9CAUD Arstotzka protein MYEGNNMRSMMGTSYEDSRLNKRTELNENMSIDTNKSEDSYGVQIHSLSKQSFTGDVEEE 60 T 0.048 DUF4958 pdb T Viruses T 6q3g 4 AN,CP,KJ,LT,MK,OA,QP,SD,WI,XS,YJ,ZT BF,BH,BA,BN,BC,B2,BI,B5,B9,BM,BB,BO Q859I5_9CAUD Lower collar protein MARYTMTLYDFIKSELIKKGFNEFVNDNKLTFYDDEFQFMQKMLKFDKDVLAIVNEKVFKGFSLKDELSDLLFKKSFTIHFLDREINRQTVEAFGMQVITVCITHEDYLNVVYSSSEVEKYLQSQGFTEHNEDTTSNTDETSNQNATSLDNSTGMTANRNAYVSLPQSEVNIDVDNTTLRFADNNTIDNGKTVNKSSNESNQNAKRNQNQKGNAKGTQFTKQYLIDNIDKAYDLRKKILNEFDKKCFLQIW 251 T 0.12 AKAP95 pdb T Viruses T 6q3g 6 AE,AJ,BE,BJ,BT,CJ,CK,CT,DJ,DK,DT,DU,EJ,EK,EN,ET,EU,FJ,FK,FN,FT,FU,GK,GN,GP,GT,GU,HK,HN,HP,HU,IN,IP,IU,JN,JP,KP,LP,OJ,PJ,PT,QJ,QK,QT,RJ,RK,RT,SA,SJ,SK,ST,TA,TJ,TK,TT,UA,UK,UP,UT,VA,VK,VP,WA,WD,WP,XA,XD,XP,YD,YP,ZD,ZP K5,G9,L5,H9,GM,I9,GB,HM,J9,HB,IM,GO,K9,IB,GF,JM,HO,L9,JB,HF,KM,IO,KB,IF,GH,LM,JO,LB,JF,HH,KO,KF,IH,LO,LF,JH,KH,LH,GA,HA,GN,IA,GC,HN,JA,HC,IN,G2,KA,IC,JN,H2,LA,JC,KN,I2,KC,GI,LN,J2,LC,HI,K2,G5,II,L2,H5,JI,I5,KI,J5,LI Q859I1_9CAUD Tail fibre protein MTEFDEIVKPDDKEETSESTEENLESTEETSESTEESTEESTEESTEDKTVETIEEENENKLEPTTTDEDSSKFDPVVLEQRIASLEQQVTTFLSSQMQQPQQVQQTQSDVTESNKEDNDYSDEELVDKLDLD 133 T 0.0095 TolA_bind_tri pdb T Viruses T 6q3g 7 DR,DV,HL,HY,IR,LQ,ML,MR,NY,QL,QQ,RY,UQ,UU,ZU IK,SP,ID,HS,OK,IJ,OD,SK,NS,SD,OJ,SS,SJ,IP,OP Head fiber protein AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 55 T 10000 zf-C2H2_4 pdbhh F F 6q3g 8 AV,BV,CV,JR,KR,LR,NL,OL,OY,PL,PY,QY,RQ,SQ,TQ PP,QP,RP,PK,QK,RK,PD,QD,PS,RD,QS,RS,PJ,QJ,RJ Inner core protein AAAAAAAAAAAAAAAAA 17 T 260 Adeno_PIX pdbhh F F 6q3k 3 C P ASN-LEU-VAL-PRO-MET-VAL-ALA-THR-VAL NLVPMVATV 9 T 15 GDH_N pdbhh F T 6q3q 2 C,D a,b HS904_ARATH GLY-SER-LYS-MET-GLU-GLU-VAL-ASP GSKMEEVD 8 T 8 TMEM191C pdbhh F Eukaryota T 6q3v 1 A A N4BP1_HUMAN N4BP1 GSDEFTAPAEKAELLEQSRGRIEGLFGVSLAVLGALGAEEPLPARIWLQLCGAQEAVHSAKEYIKGICEPELEERECYPKDMHCIFVGAESLFLKSLIQDTCADLCILDIGLLGIRGSAEAVVMARSHIQQFVKLFENKENLPSSQKESEVKREFKQFVEAHADNYTMDLLILPTSLKKELLTLTQGE 188 T 0.25 YafQ_toxin pdb F Eukaryota T 6q4q 2 C,D C,D Stapled peptide XRLYGFKWH 9 T 1.1 Speriolin_C pdbhh F T 6q5h 1 A,B A,B CC-Hex*-L24D XGELKAIAQELKAIAKELKAIAWEDKAIAQGX 32 T 1.7 DUF5320 pdbhh F T 6q5i 1 A,B A,B CC-Hex*-L24E XGELKAIAQELKAIAKELKAIAWEEKAIAQGX 32 T 2 DUF5320 pdbhh F T 6q5j 1 A,B,C,D,E,F A,B,C,D,E,F CC-Hex*-L24E XGELKAIAQELKAIAKELKAIAWEEKAIAQGX 32 T 2 DUF5320 pdbhh F T 6q5k 1 A,B A,B CC-Hex*-L24K XGELKAIAQELKAIAKELKAIAWEKKAIAQGX 32 T 0.57 DUF2312 pdbhh F T 6q5l 1 A,B B,A CC-Hex*-L24H XGELKAIAQELKAIAKELKAIAWEHKAIAQGX 32 T 0.83 Rho_N pdbpssm F T 6q5m 1 A,B A,B CC-Hex*-L24Dab XGELKAIAQELKAIAKELKAIAWEXKAIAQGX 32 T 1.7 DUF5320 pdbhh F T 6q5n 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L CC-Hex*-L24Nle XGELKAIAQELKAIAKELKAIAWEXKAIAQGX 32 T 0.29 DUF5320 pdbhh F T 6q5o 1 A A CC-Hex*-LL XGELKALAQELKALAKELKALAWELKALAKGX 32 T 0.038 DUF5320 pdbhh F T 6q5p 1 A,B,C,D,E,F A,B,C,D,E,F CC-Hex*-II XGEIKAIAQEIKAIAKEIKAIAWEIKAIAQGX 32 T 0.0073 DUF2312 pdb F T 6q5q 1 A A CC-Hex-KgEb XGKLEAIAQKLEAIAKKLEAIAWKLEAIAQGAGX 34 T 0.053 Matrilin_ccoil pdb F T 6q5r 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L CC-Hex*-LL-KgEb XGKLEALAQKLEALAKKLEALAWKLEALAQGX 32 T 0.0071 Matrilin_ccoil pdb F T 6q5s 1 A,B,C,D A,B,C,D apCC-Tet XGELEALAQELEALAKKLKALAWKLKALAQGX 32 T 0.021 DUF5320 pdbhh F T 6q5z 1 A A H72_CONVC H_VC7.2 GAMGNVNCGGVPCKFGCCREDRCREIDCD 29 T 2.7 DUF4801 pdbhh F Eukaryota T 6q67 2 B B B8R1T8_9PICO 3A GLTIEAEPTELSYQDALEMLAESKPVSTTLSFER 34 T 0.15 Toprim_C_rpt pdbpercent T Viruses T 6q6r 2 E,F,G,H E,F,G,H DHX36_HUMAN DEAD/H BOX POLYPEPTIDE 36,DEAH-BOX PROTEIN 36,G4-RESOLVASE-1,G4R1,MLE-LIKE PROTEIN 1,RNA HELICASE ASSOCIATED WITH AU-RICH ELEMENT PROTEIN HPGHLKGREIGMWYAKKQGQKNKEAERQE 29 T 1.7 PsaL pdbhh F Eukaryota T 6q6w 2 B B SB5 XXXXXXXXXXXX 12 F F F 6q6x 2 B,D,F,H E,F,G,H SB6 XXXXXXXXXXXX 12 F F F 6q76 2 B B B9WZW9_MAGOR AVR-Pia protein GPAPARFCVYYDGHLPATRVLLMYVRIGTTATITARGHEFEVEAKDQNCKVILTNGKQAPDWLAAEPY 68 T 0.012 Pirin_C unppssm F Eukaryota T 6q77 2 B B SB12 XXXXXXXX 8 F F F 6q79 2 B,H E,H SB4 XXXXXXXXXXXXX 13 F F F 6q79 3 D,F F,G SB4 incomplete XXXXXX 6 F F F 6q85 2 B,D,F,H E,F,G,H SB11 XXXXXXXXXXXX 12 F F F 6q86 2 B,D C,D SB4 XXXXXXXXXXXXX 13 F F F 6q87 2 B B SB10 XXXXXXXXXXXXX 13 F F F 6q8d 2 B B SB15 XXXXXXXXXXX 11 F F F 6q8g 2 B,D,F E,F,G SB8 XXXXXXXXXXX 11 F F F 6q8h 2 B,C B,C SB10 XXXXXXXXXXXXX 13 F F F 6q95 57 EB 6 Nascent peptide MET-ALA MA 2 T 470 DUF3652 pdbhh F F 6q95 59 GB,HB H,I 50S RIBOSOMAL PROTEIN L10 AND L11 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 140 F F F 6q97 5 E 6 Nascent peptide AAAAAAAAAAAAAAAAAAAAAAAAAAAA 28 T 1200 DUF4699 pdbhh F F 6q98 5 E 6 Nascent peptide AAAAAAAAAAAAAAAAAAAAAA 22 T 460 DUF4699 pdbhh F F 6q9a 4 D 6 Nascent peptide AAAAAAAAAAAAAAAAAAAAAAA 23 T 560 DUF4699 pdbhh F F 6q9e 9 I,S x1,x2 Cytochrome b-c1 complex subunit Rieske, mitochondrial XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 33 F F F 6qax 1 A A LEU-GLY-GLN-GLN-GLN-PRO-PHE-PRO-PRO-GLN-GLN-PRO-TYR LGQQQPFPPQQPY 13 T 22 DUF3910 pdbhh F T 6qay 1 A A TAPA_BACSU BIOFILM ASSEMBLY ACCESSORY PROTEIN TAPA AFHDIETFDVSLQTCKDFQHTDKNCHYDKRWDQSDLHISDQTDTKGTVCSPFALFAVLENTGEKLKKSKWKWELHKLENARKPLKDGNVIEKGFVSNQIGDSLYKIETKKKMKPGIYAFKVYKPAGYPANGSTFEWSEPMRLAKCDE 147 T 0.002 Herpes_PAP unp F Bacteria T 6qb0 1 A A LEU-GLY-GLN-GLN-GLN-ALA-PHE-PRO-PRO-GLN-GLN-PRO-TYR LGQQQAFPPQQPY 13 T 24 NinD pdbhh F T 6qb1 1 A A LEU-GLY-GLN-GLN-GLN-PRO-ALA-PRO-PRO-GLN-GLN-PRO-TYR LGQQQPAPPQQPY 13 T 17 Oxidored-like pdbhh F T 6qb7 1 A,B,C,D,E A,B,C,D,E KCD16_HUMAN POTASSIUM CHANNEL TETRAMERIZATION DOMAIN-CONTAINING PROTEIN 16 SMEIKQSPDEFCHSDFEDASQGSDTRICPPSSLLPADRKWGFITVGYRGSCTLGREGQADAKFRRVPRILVCGRISLAKEVFGETLNESRDPDRAPERYTSRFYLKFKHLERAFDMLSECGFHMVACNSSVTASFINQYTDDKIWSSYTEYVFYREPSRWSPS 163 T 1.1 GFRP pdbhh F Eukaryota T 6qbb 2 B,D P,Q Strep-tag II peptide XSAWSHPQFEKX 12 T 2.7 PqqA pdbhh F T 6qbx 9 I,S x1,x2 UQCRFS1N XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 78 F F F 6qc0 2 D,E,F B,D,F DTL_HUMAN DDB1- AND CUL4-ASSOCIATED FACTOR 2,LETHAL(2) DENTICLELESS PROTEIN HOMOLOG,RETINOIC ACID-REGULATED NUCLEAR MATRIX-ASSOCIATED PROTEIN SSMRKICTYFHRKS 14 T 2.7 Flexi_CP pdbhh F Eukaryota T 6qc2 37 KA,UA x1,x2 UQCRFS1N XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 78 F F F 6qc3 9 I,S x1,x2 UQCRFS1N XXXXXXXXXXXXXXXXXXXXXXXXXX 26 F F F 6qc4 9 I,S x1,x2 UQCRFS1N XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 78 F F F 6qcg 2 G,H,I,J,K,L I,G,H,J,K,L CDT1_HUMAN DOUBLE PARKED HOMOLOG,DUP MEQRRVTDFFARRR 14 T 4.3 DAO_C pdbhh F Eukaryota T 6qdr 2 B B PAK6_HUMAN PAK-5,P21-ACTIVATED KINASE 6,PAK-6 XVISSNTLRGRS 12 T 1.4 Csm1_B pdbhh F Eukaryota T 6qds 2 B B PAK6_HUMAN PAK-5,P21-ACTIVATED KINASE 6,PAK-6 XVISSNTLRGRS 12 T 1.4 Csm1_B pdbhh F Eukaryota T 6qdv 23 W R SRRM2_HUMAN 300 KDA NUCLEAR MATRIX ANTIGEN,SERINE/ARGININE-RICH SPLICING FACTOR-RELATED NUCLEAR MATRIX PROTEIN OF 300 KDA,SER/ARG-RELATED NUCLEAR MATRIX PROTEIN OF 300 KDA,SPLICING COACTIVATOR SUBUNIT SRM300,TAX-RESPONSIVE ENHANCER ELEMENT-BINDING PROTEIN 803,TAXREB803 MYNGIGLPTPRGSGTNGYVQRNLSLV 26 T 27 Antimicrobial14 pdbhh F Eukaryota T 6qet 1 A A GLL11_CHICK GAL-11,BETA-DEFENSIN 11,VITELLINE MEMBRANE OUTER LAYER PROTEIN 2,VITELLINE MEMBRANE OUTER LAYER PROTEIN II,VMOII DTTSDFHTCQDKGGHCVSPKIRCLEEQLGLCPLKRWTCCKEI 42 T 1.7E-05 DEFB136 unphh F Eukaryota T 6qev 2 B D PRO-LYS-SER-ILE-ARG-ILE-GLY-PRO-GLY-GLN-ALA-PHE-TYR-ALA-DPR PKSIRIGPGQAFYAX 15 T 0.00049 GP120 pdbhh F T 6qfk 2 B B V3-IF PKSIRIGPGQAFYAX 15 T 0.00049 GP120 pdbhh F T 6qgi 1 A A H9ABL9_9VIRU VP5 IAPLVGVGLAAGAVGVGWALREFEIVGSDAPPEGLTADALKQQVYQTAKTRKSTNASTIVDNQNILDGVKHTAYTDAKIAAIEELNAGSAESAVLDAATTEVNSYLTTVQSNFLKTWNESVAELDSILSTVVNHPDIGKGDVFLMLNGSDNTIEDLLANPSGSTDATSFTLADGTTMSVGTVEVDRGTESYYYDPMSGLVGDLGDLKNGGPTVQYDGDSLVYLNASNWKPIYDEMDTVLQNVRSGISTWVSNVYGDVQSGEIEVSDLVTPRERAAMMAQEEGMSQAIADLIALNVPVDAEREATITIQDTGATLPGTFALTDASDGPLESGKTYDPSTFSGDVYFTADMSLVEGDWTAYQSGVDGGNVTLTSEPYSGTAVELNTAANETVAVDAGNWTATGNGTWYHDVSPELETDITSIESARFLSTAEQTQYETIQLQGSFTIDKLTNTQTGEEVTATSFDSSEPHTDSNYITQEEWDQLEQQNKELIEKYEQSQSGGGLDLGQFDMFGIPGEIVAVGVAALVGLGVLGNN 533 T 0.02 Sporozoite_P67 unppssm T Viruses T 6qgl 1 A,B A,B H9ABP6_9VIRU VP5 IAPLVGYAIGAAAISAVGGIGVGWTLREFEVVGSDDPAEGLTPDVLRNQLSDSVVKRKSNNQSTMVDNQNILDGVEHTAYTEAKIAAIEELNAGSSESAVLSAANSAIDSYETTVRTNFYKSWNETVRELEAMTQTVIAHADVGLSYITDFGDPRFGNLASGTSPNTLKDTTVSMPDGTNFTLLTFRHNTGWDSGNAAYSVVEYNPKEVVTSTNSNTYNTVDGTQYMKFSEWNAVETEMDTVFQNVRNGISTWVTNVYGDVQSGAIEISDLVTPRERATMMAQEEGMSQAIADLIALNVPVDAEREATITIQDTGATLPGTFALTDSSDGPLSAGQTYDPSTFSGDVYFTADMSLVEGPWDAINSGVDGGTITITSEPYEGTAIEVTTVESETVSVPAADWTDNGDGTWSYDASGDLETTITNVDSARFVSTATETTYDTLQLKGAFTVDKLVNKQSGEEVSSTSFTSSEPQTDSNYITQDEWDQLEQQNKELIEKYEQSQSGGGLDLGGLD 512 T 0.058 B56 pdbpssm T Viruses T 6qh6 5 E P TGN38 CARGO PEPTIDE DYQRLN 6 T 30 Fer4_24 pdbhh F T 6qh7 4 E P TGN38 CARGO PEPTIDE DYQRLN 6 T 30 Fer4_24 pdbhh F T 6qhg 1 A,B A,B L_RVFV Polymerase GPGAGTVGGFIKRQQSKVVQNKVVYYGVGIWRGFMDGYQVHLEIENDIGQPPRLRNVTTNCQSSPWDLSIPIRQWAEDMGVTNNQDYSSKSSRGARYWMHSFRMQGPSKPFGCPVYIIK 119 T 0.2 UNC80 pdbpssm T Viruses T 6qik 44 RA p uL1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 210 F F F 6qiu 2 B P ATX1_HUMAN Ataxin-1 phosphopeptide KRRWSAPESR 10 T 0.85 ACTH_domain pdbhh F Eukaryota T 6qix 1 A,B A,B P43 MLVLFFPLLLTVGLSTAGHVKCPDFGDWKPWTDCLWYPPQHMYSKLSHACGMHAHRNLTGVMDLPHGHKTPPPCGHCSFKFRCRRRPNTEGCYPLDGEVEVCHDHSDICTLPKLPHLGCGYAFINEKLKQCFTRPDTPSYVRLGYRKMFESIPKKHCIEKDGMCKCCCGDYEPNESGTECIKPPAHDCPAYGPPSEWSECLWFPLKNIVSHVYDHCHVHKEPDGYEPHSVAPANVHIPEKCGFCSFRVKCMKRDKKDGCFPLKLGKKSCGKDDCPTCGDICTLDKINGSCAFPRVMKEKIWDDFTATSKEKHMPHWKRDGYAKMLMQLPYSNCKEVGDKCKCCCHPYEPNKDGTACVVKEYCKRVHELHHHDHHGHGEEHHKSSSSESKEHHHH 394 T 29 PNISR pdbhh F T 6qj3 3 C C Brn1 XXXXXXXXXXXXXXXXXXXX 20 F F F 6qj4 4 D D Brn1 XXXXXXXXXXXXXXXXXXXXXXX 23 F F F 6qjb 1 A A EVA3_RHISA Evasin-3 FDVVSCNKNCTSGQNECPEGCFCGLLGQNKKGHCYKIIGN 40 T 0.032 Toxin_11 pdbhh F Eukaryota T 6qkf 1 A A PG4_PIG PG-4 RGGRLCYCRGWICFCVGR 18 T 0.51 PCAF_N pdbhh F Eukaryota T 6qlc 1 A A C8ZKB3_9CAUD SSDNA-BINDING RNA POLYMERASE COFACTOR DRC GPLGSMALVKKNQARNTQATDNKGASAYLNFHFPTRDGKDVRLVSLGLRADDALHMQLQEFLTVDDKGKPLSETAYAERCKKLVSRLIIKLGVTRSEEERALDL 104 T 0.086 Hemolysin_N unppssm T Viruses T 6qld 1 A C CENPC_YEAST CENP-C HOMOLOG,CONSTITUTIVE CENTROMERE-ASSOCIATED NETWORK PROTEIN MIF2,MITOTIC FIDELITY OF CHROMOSOME TRANSMISSION PROTEIN 2 LRKSTRVKVAPLQYWRNEKIVY 22 T 1.4 CENP-C_mid pdbhh F Eukaryota T 6qld 12 L U CENPU_YEAST ASSOCIATED WITH MICROTUBULES AND ESSENTIAL PROTEIN 1,CENP-U HOMOLOG,CONSTITUTIVE CENTROMERE-ASSOCIATED NETWORK PROTEIN AME1 SVTTIDVLSSLFINLFENDLIPQALKDFNKSDDDQFRKLLYKLDLRLFQTISDQMTRDLKDILDINVSNNELCYQLKQVLARKEDLNQQIISVRNEIQELKAGKDWHDLQNEQAKLNDKVKLNKRLNDLTSTLLGKYEGDRKIMSQDSEDDSIRDDSNILDIAHFVDLMDPYNGLLKKINKINENLSNEL 190 T 0.0042 DUF1640 pdb F Eukaryota T 6qld 13 M Y NKP1_YEAST CONSTITUTIVE CENTROMERE-ASSOCIATED NETWORK PROTEIN NKP1,NON-ESSENTIAL KINETOCHORE PROTEIN 1 TDTYNSISNFIENELTALLSSDDYLMDDLAGELPNEVCRLLKAQVIEKRKDAMSRGKQDLLSKEIYDNESELRASQSQQIMELVGDIPKYSLGSELRNRVEGEPQSTSIERLIEDVLKLPQMEVADEEEVEVENDLKVLSEYSNLRKDLILKCQALQIGESKLSDILSQTNSINSLTTSIKEASEDDDISEYFATYNGKLVVALEEMKLLLEEAVKTFGNSPEKREKIKKILSELKK 237 T 0.00018 FTA4 unppercent F Eukaryota T 6qle 10 J Y NKP1_YEAST CONSTITUTIVE CENTROMERE-ASSOCIATED NETWORK PROTEIN NKP1,NON-ESSENTIAL KINETOCHORE PROTEIN 1 MTDTYNSISNFIENELTALLSSDDYLMDDLAGELPNEVCRLLKAQVIEKRKDAMSRGKQDLLSKEIYDNESELRASQSQQIMELVGDIPKYSLGSELRNRVEGEPQSTSIERLIEDVLKLPQMEVADEEEVEVENDLKVLSEYSNLRKDLILKCQALQIGESKLSDILSQTNSINSLTTSIKEASEDDDISEYFATYNGKLVVALEEMKLLLEEAVKTFGNSPEKREKIKKILSELKK 238 T 0.00018 FTA4 pdbpercent F Eukaryota T 6qlf 6 F U CENPU_YEAST ASSOCIATED WITH MICROTUBULES AND ESSENTIAL PROTEIN 1,CENP-U HOMOLOG,CONSTITUTIVE CENTROMERE-ASSOCIATED NETWORK PROTEIN AME1 MDRDTKLAFRLRGSHSRRTDDIDDDVIVFKTPNAVYREENSPIQSPVQPILSSPKLANSFEFPITTNNVNAQDRHEHGYQPLDAEDYPMIDSENKSLISESPQNVRNDEDLTTRYNFDDIPIRQLSSSITSVTTIDVLSSLFINLFENDLIPQALKDFNKSDDDQFRKLLYKLDLRLFQTISDQMTRDLKDILDINVSNNELCYQLKQVLARKEDLNQQIISVRNEIQELKAGKDWHDLQNEQAKLNDKVKLNKRLNDLTSTLLGKYEGDRKIMSQDSEDDSIRDDSNILDIAHFVDLMDPYNGLLKKINKINENLSNEL 320 T 0.0047 DUF1640 unp F Eukaryota T 6qlf 7 G Y NKP1_YEAST CONSTITUTIVE CENTROMERE-ASSOCIATED NETWORK PROTEIN NKP1,NON-ESSENTIAL KINETOCHORE PROTEIN 1 MTDTYNSISNFIENELTALLSSDDYLMDDLAGELPNEVCRLLKAQVIEKRKDAMSRGKQDLLSKEIYDNESELRASQSQQIMELVGDIPKYSLGSELRNRVEGEPQSTSIERLIEDVLKLPQMEVADEEEVEVENDLKVLSEYSNLRKDLILKCQALQIGESKLSDILSQTNSINSLTTSIKEASEDDDISEYFATYNGKLVVALEEMKLLLEEAVKTFGNSPEKREKIKKILSELKK 238 T 0.00018 FTA4 pdbpercent F Eukaryota T 6qm1 1 A A DAL-PRO-GLY-CYS-LYS XPGCK 5 T 22 Gallidermin pdbhh F F 6qnn 2 B B GTSE1_HUMAN GTSE-1,PROTEIN B99 HOMOLOG SQPLIDLPLIDFCDTPEAHVAVGSESRPLIDLMTNTPDMNKNVAKPSPVVGQLIDLSSPLIQLSPE 66 T 3 Gag_p12 pdbhh F Eukaryota T 6qnp 2 E,F,G,H H,I,J,K GTSE1_HUMAN GTSE-1,PROTEIN B99 HOMOLOG LAVTPDAASQPLIDLPLIDFCDTPEAHVAVGSESRPLIDLMTNTPDMNKNVAKPSPVVGQLIDLSSP 67 T 3.9 Gag_p12 pdbhh F Eukaryota T 6qns 2 B S SYT1_HUMAN SYNAPTOTAGMIN I,SYTI,P65 GEGKEDAFSKLKEKFMNELHK 21 T 0.01 PRIMA1 unphh F Eukaryota T 6qp2 2 C C Polyhistidine tag HHHHHH 6 T 6100 zf_CCCH_4 pdbhh F F 6qpk 1 A,B A,B A0A2H1G421_ZYMTR Uncharacterized protein GHMAVVYAARCKFGNPLVQNNRITRAVCDLTNEHTTKDGSWHYVEVDNECKYLAGDNPRDQPGWAVFVKYCTYYKGVPDA 80 T 0.016 DUF5948 unppssm F Eukaryota T 6qrm 2 C,D C,D AIFM3_HUMAN APOPTOSIS-INDUCING FACTOR-LIKE PROTEIN GNCFSKRRAA 10 T 0.062 NifU unphh F Eukaryota T 6qs0 1 A A Y2503_BORBU Putative outer membrane protein BBA03 GAMGTPLEKLVSRLNLNNTEKETLTFLTNLLKEKLVDPNIGLHFKNSGGDESKIEESVQKFLSELKEDEIKDLLAKIKENKDKKEKDPEELNTYKSILASGFDGIFNQADSKTTLNKLKDTI 122 T 0.00042 RRP36 pdbpercent F Bacteria T 6qs1 2 C,D E,F Bradykinin potentiating peptide b QGLPPRPKIPP 11 T 1.5 UPF0449 pdbhh F T 6qs4 2 G S casein AAAAAAAAAAAAAAAAAAAAAAAA 24 T 670 DUF4699 pdbhh F F 6qs6 2 G S casein XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 6qs7 2 G S casein XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 6qs8 2 G S casein XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 6qsy 2 B P Strep-tag II peptide XSAWSHPQFEKX 12 T 2.7 PqqA pdbhh F T 6qsz 1 A,C,E,G,I,K,M,O A,C,E,G,I,K,M,O SIR4_YEAST SILENT INFORMATION REGULATOR 4 GPKPKNTKENLSKSSWRQEWLANLKLISVSLVDEFPSELSDSDRQIINEKMQLLKDIFANNLKSAISNNFRESDIIILKGEIEDYPMSSEIKIYYNELQNKPDAKKARFWSFMKTQRFVSNMGFDIQ 127 T 0.092 DUF6120 pdbpercent F Eukaryota T 6qsz 2 B,D,F,H,J,L,N,P B,D,F,H,J,L,N,P ESC1_YEAST ESTABLISHES SILENT CHROMATIN PROTEIN 1 IPSTDLPSDPPSDKEE 16 T 30 bCoV_SUD_M pdbhh F Eukaryota T 6qt9 1 A,C A,D Q4KPG2_9VIRU ORF 25 EYTISHTGGTLGSSKVTTAANQTSPQRETAIIGFECPRKFAEIEYVGQRDSTRFIPRTTESITGTAGDDTVVSLTANIQPVAGETAIEDQDYPVAVAYNVTQGVQVDIDAVDYAADEVTLADNPADGDTVKVWPIMGDGDVQFRLVNQFGQEEGRVYPWATPLYRWHDFPQLKRGREINLHGSVTWEENETVEVLLDAPQAITWEDSDYPEGQYVSTFEQDVEITL 226 T 6.3 DUF1344 unphh T Viruses T 6qt9 2 B,D,E,F,G,H,I,J,K,L C,E,F,G,H,I,J,K,L,M Q4KPG2_9VIRU ORF 25 YTISHTGGTLGSSKVTTAANQTSPQRETAIIGFECPRKFAEIEYVGQRDSTRFIPRTTESITGTAGDDTVVSLTANIQPVAGETAIEDQDYPVAVAYNVTQGVQVDIDAVDYAADEVTLADNPADGDTVKVWPIMGDGDVQFRLVNQFGQEEGRVYPWATPLYRWHDFPQLKRGREINLHGSVTWEENETVEVLLDAPQAITWEDSDYPEGQYVSTFEQDVEITL 225 T 6.3 DUF1344 unphh T Viruses T 6qt9 3 AA,M,N,O,P,Q,R,U,V,W,Y o,a,b,c,d,e,f,i,j,k,m Q4KPG3_9VIRU ORF 24 GNIGNLSAEKQISVYDGQPFVDEQDVPADDPNTPALTIEGPDGYVIAVDAGTPIAPEFRDSNGNKLDPSTRVIVQKCDRQGNPLGDGIVFNDTLGRFDYEQMRTDPDFMRKTAKSLMIDEREIVKVFVDIPAGANGYDADKSRLTLGDDTSDFGKAVEIVDHDELSDAETRAV 173 T 6.6 HEF_HK pdbhh T Viruses T 6qt9 4 S,X,Z g,l,n Q4KPG3_9VIRU ORF 24 SAEKQISVYDGQPFVDEQDVPADDPNTPALTIEGPDGYVIAVDAGTPIAPEFRDSNGNKLDPSTRVIVQKCDRQGNPLGDGIVFNDTLGRFDYEQMRTDPDFMRKTAKSLMIDEREIVKVFVDIPAGANGYDADKSRLTLGDDTSDFGKAVEIVDHDELSDAETRAV 167 T 6.5 HEF_HK pdbhh T Viruses T 6qt9 5 T h Q4KPG3_9VIRU ORF 24 GNIGNLSAEKQISVYDGQPFVDEQDVPADDPNTPALTIEGPDGYVIAVDAGTPIAPEFRDSNGNKLDPSTRVIVQKCDRQGNPLGDGIVFNDTLGRFDYEQMRTDPDFMRKTAKSLMIDEREIVKVFVDIPAGANGYDADKSRLTLGDDTSDFGKAVEIVDHDELSDAETRAVKA 175 T 6.6 HEF_HK pdbhh T Viruses T 6qt9 6 BA Y Q4KPF6_9VIRU ORF 31 ERLGRLVDVLETKEFGDTTVERSVTQNIDRTRTDSPNNENQPIYFSTGPEAIAVENTEEWERLDFGIVAETVNIRTTDDIDIAFADPNKNGPVIRVREGESPFTIGGDAGIESAFIWLRQAETASNTPGIQIIAF 135 T 23 MREG pdbhh T Viruses T 6qt9 7 CA X VP12 XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 6qt9 8 DA W VP13 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 80 F F F 6qtf 1 A A DCY-LEU-GLY-ALA-THR XLGAT 5 T 76 UPF0014 pdbhh F F 6qtm 1 A,B,C A,B,C SIR4_YEAST SILENT INFORMATION REGULATOR 4 GPKPKNTKENLSKSSWRQEWLANLKLISVSLVDEFPSELSDSDRQIINEKMQLLKDIFANNLKSAISNNFRESDIIILKGEIEDYPMSSEIKIYYNELQNKPDAKKARFWSFMKTQRFVSNMGFDIQ 127 T 0.092 DUF6120 pdbpercent F Eukaryota T 6qtm 2 D,E,F D,E,F O42838_SACPA Ribonuclease H ESPPSLDSSPPNTSFNA 17 T 31 TOC159_MAD pdbhh F Eukaryota T 6qto 2 B B HY5_ARATH PROTEIN LONG HYPOCOTYL 5,BZIP TRANSCRIPTION FACTOR 56,ATBZIP56 XEIRRVPEFGGY 12 T 0.2 Macoilin unppercent F Eukaryota T 6qtq 2 B B UVR8_ARATH PROTEIN UV-B RESISTANCE 8,RCC1 DOMAIN-CONTAINING PROTEIN UVR8 XRYAVVPDE 9 T 2 CFIA_Pcf11 pdbhh F Eukaryota T 6qtr 2 B B HY5_ARATH PROTEIN LONG HYPOCOTYL 5,BZIP TRANSCRIPTION FACTOR 56,ATBZIP56 XEIRRVPEFGGY 12 T 0.2 Macoilin unppercent F Eukaryota T 6qts 2 B B UVR8_ARATH PROTEIN UV-B RESISTANCE 8,RCC1 DOMAIN-CONTAINING PROTEIN UVR8 XRYAVVPDE 9 T 2 CFIA_Pcf11 pdbhh F Eukaryota T 6qtt 2 B B HYH_ARATH HY5 HOMOLOG,BZIP TRANSCRIPTION FACTOR 64,ATBZIP64 XELLMVPDMY 10 T 0.0077 CASP_C unppercent F Eukaryota T 6qtu 2 B B BBX24_ARATH SALT TOLERANCE PROTEIN XEHFIVPDLY 10 T 0.89 MRP-L27 pdbhh F Eukaryota T 6qtv 2 B B HFR1_ARATH BASIC HELIX-LOOP-HELIX PROTEIN 26,BHLH 26,PROTEIN LONG HYPOCOTYL IN FAR-RED 1,PROTEIN REDUCED PHYTOCHROME SIGNALING,REDUCED SENSITIVITY TO FAR-RED LIGHT,TRANSCRIPTION FACTOR EN 68,BHLH TRANSCRIPTION FACTOR BHLH026 XYLQIVPEIHK 11 T 0.065 MRC1 unppercent F Eukaryota T 6qtx 2 B B COL3_ARATH Zinc finger protein CONSTANS-LIKE 3 XGFGVVPSFY 10 T 3 YqzE pdbhh F Eukaryota T 6qtz 44 RA p uL1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 210 F F F 6qu1 2 B D SMRCD_HUMAN ATP-DEPENDENT HELICASE 1,HHEL1 LSELEDLKDAKLQTLKELFPQRSDNDLLKLIESTSTMDGAIAAALLMF 48 T 0.00016 CUE pdbpssm F Eukaryota T 6qvp 1 A,B,C,D,E,F A,E,C,D,B,F H9L4G5_SALTM MEMBRANE PROTEIN,PUTATIVE INNER MEMBRANE OR EXPORTED PROTEIN KTDITSTKNELVITYHGRLRSFSEEDTYKIKAWLEDKINSNLLIEMVIPQADISFSDSLRLGYERGIILMKEIKKIYPDVVIDMSVNSAASSTTSKAIITTINK 104 T 0.15 Na_Ca_ex_C unppssm F Bacteria T 6qw4 2 B P Strep-tag II peptide XSAWSHPQFEKX 12 T 2.7 PqqA pdbhh F T 6qxb 1 A A PHE-VAL-CAP-TRP-PHE-SER-LYS-PHE-LEU-GLY-ARG-ILE-LEU-NH2 FVXWFSKFLGRILX 14 T 0.013 Mim2 pdbhh F T 6qxc 1 A A PHE-VAL-TCP-TRP-PHE-SER-LYS-PHE-LEU-GLY-ARG-ILE-LEU-NH2 FVXWFSKFLGRILX 14 T 0.013 Mim2 pdbhh F T 6qxk 2 B A Pimtide ARKRRRHPSGPPTA 14 T 1.5 DUF3019 pdbhh F T 6qyr 1 A A DAL-LEU-GLY-CYS-THR XLGCT 5 T 55 DUF5390 pdbhh F F 6qys 1 A A DBB-PRO-GLY-CYS-LYS XPGCK 5 T 22 TSLP pdbhh F F 6qyt 1 A A DAL-LEU-SER-LEU-CYS-ALA XLSLCA 6 T 54 Herpes_PAP pdbhh F F 6qyu 1 A A PHE-DHA-DAL-LEU-DHA-LEU-CYS-ALA FXXLXLCA 8 T 3.2 Voldacs pdbhh F F 6qyv 1 A A PHE-SER-DAL-LEU-ALA-LEU-CYS-ALA FSXLALCA 8 T 11 PAGK pdbhh F T 6qyw 1 A A ILE-DBU-DAL-ILE-DHA-LEU-CYS-ALA IXXIXLCA 8 T 19 Gallidermin pdbhh F F 6qz9 2 M,N,O,P,Q,R,S,T,U,V,W,X 0A,0B,0C,0D,0E,0F,0G,0H,0I,0J,0K,0L TUB11_BPPH2 GENE PRODUCT 11,GP11,LOWER COLLAR PROTEIN,PROTEIN P11 MSSYTMQLRTYIEMWSQGETGLSTAEKIEKGRPKLFDFNYPIFDESYRTIFETHFIRNFYMREIGFETEGLFKFHLETWLMINMPYFNKLFESELIKYDPLENTRVGVKSNTKNDTDRNDNRDVKQDLTSNGTSSTDAKQNDTSKTTGNEKSSGSGSITDDNFKRDLNADTADDRLQLTTKDGEGVLEYASQIEEHNENKKRDTKTSNTTDTTSNTTGTSTLDSDSKTSNKANTTSNDKLNSQINSVEDYIEDRVGKIGTQSYARLVMDYREALLRIEQRIFNEMQELFMLVY 293 T 0.018 LAP1C pdbpssm T Viruses T 6qzf 2 M,N,O,P,Q,R,S,T,U,V,W,X 0A,0B,0C,0D,0E,0F,0G,0H,0I,0J,0K,0L TUB11_BPPH2 GENE PRODUCT 11,GP11,LOWER COLLAR PROTEIN,PROTEIN P11 MSSYTMQLRTYIEMWSQGETGLSTAEKIEKGRPKLFDFNYPIFDESYRTIFETHFIRNFYMREIGFETEGLFKFHLETWLMINMPYFNKLFESELIKYDPLENTRVGVKSNTKNDTDRNDNRDVKQDLTSNGTSSTDAKQNDTSKTTGNEKSSGSGSITDDNFKRDLNADTADDRLQLTTKDGEGVLEYASQIEEHNENKKRDTKTSNTTDTTSNTTGTSTLDSDSKTSNKANTTSNDKLNSQINSVEDYIEDRVGKIGTQSYARLVMDYREALLRIEQRIFNEMQELFMLVY 293 T 0.018 LAP1C pdbpssm T Viruses T 6qzl 1 A,B,C,D,E A,B,C,D,E KCD12_HUMAN PFETIN,PREDOMINANTLY FETAL EXPRESSED T1 DOMAIN SMDGSRRSGYITIGYRGSYTIGRDAQADAKFRRVARITVCGKTSLAKEVFGDTLNESRDPDRPPERYTSRYYLKFNFLEQAFDKLSESGFHMVACSSTGTCAFASSTDQSEDKIWTSYTEYVFCRE 126 T 0.12 Baculo_VP91_N pdb F Eukaryota T 6qzr 2 I,J,K,L,M,N,O,P R,J,M,N,O,P,T,U FOXO1_HUMAN FORKHEAD BOX PROTEIN O1A,FORKHEAD IN RHABDOMYOSARCOMA RPRSCTWPLPR 11 T 5.9 PCSK9_C1 pdbhh F Eukaryota T 6qzs 2 B,D P,C FOXO1_HUMAN FOXO1 pS256 site RRRAASMDNNSK 12 T 12 Lys_export pdbhh F Eukaryota T 6qzv 2 E,F,G,H E,F,G,H MET-PRO MP 2 T 59 SAMP pdbhh F F 6qzw 2 D,E,F D,E,F MET-PRO MP 2 T 59 SAMP pdbhh F F 6qzy 2 B B A0A2R2JFI5_OMPOL ASN-GLY-PHE-PRO-TRP-MVA-ILE-MVA-VAL-GLY-PRO-ILE-GLY NGFPWXIXVGPIGVIGSVMSTE 22 T 1.3 DUF2897 pdbhh F Eukaryota T 6r00 2 B B A0A2R2JFI5_OMPOL PHE-PRO-TRP-MVA-ILE-MVA-PHE-GLY-VAL-ILE-GLY-VAL-ILE-GLY FPWXIXFGVIGVIG 14 T 0.043 DUF2897 pdbhh F Eukaryota T 6r0j 1 A A GLUP_BACSU Rhomboid family serine protease MFLLEYTYWKIAAHLVNSGYGVIQAGESDEIWLEAPDKSSHDLVRLYKHDLDFRQEMVRDIEEQAERVERVRHQLGRRRMKLLNVFFSTEAPVDDWEEIAKKTFEKGTVSVEPAIVRGTMLRDDLQAVFPSFRTEDCSEEHASFENAQMARERFLSLVLKQEEQRKTEAAVFQNGKLERENLYFQ 185 T 0.013 TBP unppercent F Bacteria T 6r0q 2 E,F,G F,E,G ALA-ALA-ALA XXX 3 F F F 6r0q 3 H H ALA-ALA-ALA-ALA XXXX 4 F F F 6r0s 2 D F CEREBLON ISOFORM 4 XXX 3 F F F 6r12 2 D F Cereblon isoform 4 XXXXX 5 F F F 6r17 1 A,B A,B SYCE2_HUMAN CENTRAL ELEMENT SYNAPTONEMAL COMPLEX PROTEIN 1 GSMGLYFSSLDSSIDILQKRAQELIENINKSRQKDHALMTNFRNSLKTKVSDLTEKLEERIYQIYNDHNKIIQEKLQEFTQKMAKISHLETELKQVCHSVETVYKDLCLQPE 112 T 0.00044 Dynamitin unppssm F Eukaryota T 6r18 2 D D Cereblon isoform 4 XXX 3 F F F 6r19 2 D F Cereblon isoform 4 XXXXXX 6 F F F 6r1g 1 A,B A,B P22_BORBU ANTIGEN IPLA7 GAMGSNEYVEEQEAENSSKPDDSKIDEHTIGHVFHAMGVVHSKKDRKSLGKNIKVFYFSEEDGHFQTIPSKENAKLIVYFYDNVYAGEAPISISGKEAFIFVGITPDFKKIINSNLHGAKSDLIGTFKDLNIKNSKLEITVDENNSDAKTFLESVNYIIDGVEKISPMLTN 171 T 0.0071 DUF4969 unp F Bacteria T 6r1r 2 B,D,F,G D,E,F,P RIR2_ECOLI RIBONUCLEOTIDE REDUCTASE R2 PROTEIN YLVGQIDSEVDTDDLSNFQL 20 T 12 UL11 pdbhh F Bacteria T 6r1u 8 L L GLYR1_HUMAN 3-HYDROXYISOBUTYRATE DEHYDROGENASE-LIKE PROTEIN,CYTOKINE-LIKE NUCLEAR FACTOR N-PAC,GLYOXYLATE REDUCTASE 1 HOMOLOG,NUCLEAR PROTEIN NP60,NUCLEAR PROTEIN OF 60 KDA PLGSPEFSERGSKSPLKRAQEQSPRKRGRPPKDEKDLTIPESSTVKGMMAGPMAAFKWQPTASEPVKDADPHFHHFLLSQTEKPAVCYQAITKKLKICEEETGSTSIQAADSTAVNGSITPTDK 124 T 0.017 AT_hook pdb F Eukaryota T 6r21 3 AA,BA,CA,DA,Y,Z c,d,e,f,a,b TUBE2_BPT7 GENE PRODUCT 12,GP12 MALISQSIKNLKGGISQQPDILRYPDQGSRQVNGWSSETEGLQKRPPLVFLNTLGDNGALGQAPYIHLINRDEHEQYYAVFTGSGIRVFDLSGNEKQVRYPNGSNYIKTANPRNDLRMVTVADYTFIVNRNVVAQKNTKSVNLPNYNPNQDGLINVRGGQYGRELIVHINGKDVAKYKIPDGSQPEHVNNTDAQWLAEELAKQMRTNLSDWTVNVGQGFIHVTAPSGQQIDSFTTKDGYADQLINPVTHYAQSFSKLPPNAPNGYMVKIVGDASKSADQYYVRYDAERKVWTETLGWNTEDQVLWETMPHALVRAADGNFDFKWLEWSPKSCGDVDTNPWPSFVGSSINDVFFFRNRLGFLSGENIILSRTAKYFNFYPASIANLSDDDPIDVAVSTNRIAILKYAVPFSEELLIWSDEAQFVLTASGTLTSKSVELNLTTQFDVQDRARPFGIGRNVYFASPRSSFTSIHRYYAVQDVSSVKNAEDITSHVPNYIPNGVFSICGSGTENFCSVLSHGDPSKIFMYKFLYLNEELRQQSWSHWDFGENVQVLACQSISSDMYVILRNEFNTFLARISFTKNAIDLQGEPYRAFMDMKIRYTIPSGTYNDDTFTTSIHIPTIYGANFGRGKITVLEPDGKITVFEQPTAGWNSDPWLRLSGNLEGRMVYIGFNINFVYEFSKFLIKQTADDGSTSTEDIGRLQLRRAWVNYENSGTFDIYVENQSSNWKYTMAGARLGSNTLRAGRLNLGTGQYRFPVVGNAKFNTVYILSDETTPLNIIGCGWEGNYLRRSSGI 794 T 0.031 MelC1 pdbpercent T Viruses T 6r25 2 B L GLYR1_HUMAN NPAC DPHFHHFLLSQT 12 T 11 GREB1 unppercent F Eukaryota T 6r28 1 A A peptide P7 PSIXHVHRPDWPCWYR 16 T 1.9 DUF4172 pdbhh F T 6r2i 2 B B KASH5_HUMAN KASH5 GSMTSGTSGTSGGPSPPPTWPHLQLCYLQPPPV 33 T 0.28 B56 pdbhh F Eukaryota T 6r2l 3 C C AN30A_HUMAN SER-LEU-SER-LYS-ILE-LEU-ASP-THR-VAL SLSKILDTV 9 T 1.6E-05 SCP-1 unphh F Eukaryota T 6r4w 2 C,D C,D ACE-GLU-VAL-ASN-ALA-PRO-VAL-LPD XEVNAPVX 8 T 0.17 HIF-1a_CTAD pdbhh F T 6r4x 2 C,D C,D ACE-GLU-VAL-ASN-PRO-ALA-VAL-LPD XEVNPAVX 8 T 7.3 DUF5974 pdbhh F T 6r4y 2 C,D D,E ACE-GLU-VAL-ASN-PRO XEVNP 5 T 27 Ins134_P3_kin pdbhh F F 6r4z 2 C,D D,E ACE-GLU-VAL-ASN-PRO XEVNP 5 T 27 Ins134_P3_kin pdbhh F F 6r50 2 C,D C,D ACE-GLU-VAL-ASN-ALA-PRO-VAL-LPD XEVNAPVX 8 T 0.17 HIF-1a_CTAD pdbhh F T 6r51 2 B,E,F E,D,F ACE-SER-LEU-ARG-PRO-ALA-PRO-LPD XSLRPAPX 8 T 0.28 RhoGEF67_u2 pdbhh F T 6r57 2 C,D C,D ACE-GLU-VAL-ASN-PRO-PRO-VAL-LPD XEVNPPVX 8 T 5.3 HIF-1a_CTAD pdbhh F F 6r58 2 E,F,G,H E,F,I,G ACE-GLU-VAL-ASN-ALA-PRO-VAL-LPD XEVNAPVX 8 T 0.17 HIF-1a_CTAD pdbhh F T 6r59 2 C,D E,C ACE-GLU-VAL-ALA-PRO-PRO-VAL-LPD XEVAPPVX 8 T 51 DUF2315 pdbhh F F 6r5b 2 C,D C,E ACE-GLU-VAL-ASN-PRO-PRO-VAL-LPD XEVNPPVX 8 T 5.3 HIF-1a_CTAD pdbhh F F 6r5c 2 C,D C,D ACE-GLU-VAL-ASN-PRO-PRO-VAL-LPD XEVNPPVX 8 T 5.3 HIF-1a_CTAD pdbhh F F 6r5g 2 B B PDCD1_HUMAN ITSM EQTEXATIVFP 11 T 0.002 DUF4578 pdbhh F Eukaryota T 6r5l 2 B P P53_HUMAN p53pT387 KLMFKTEGPDSD 12 T 44 CopB pdbhh F Eukaryota T 6r5m 1 A,B,C A,B,C 3SX_DENPO Dendroaspis polylepis MT9 TICHIQISKTHGILKTCEENSCYKMSVRGWIIGRGCGCPSAVRPRQVQCCTSDKCNY 57 F F Eukaryota T 6r5q 2 B 1 XBP1_HUMAN XBP-1,TAX-RESPONSIVE ELEMENT-BINDING PROTEIN 5,TREB-5 DPVPYQPPFLCQWGRHQPAWKPLM 24 T 30 Prok-E2_E pdbhh F Eukaryota T 6r5w 1 A,B,C A,B,C Q8W5Z4_9CAUD Gp15 protein NPAQFAQKTVLDEHVNDADIHVTATDKTNWNAKETVEGAQAKADKALADAKAFFELSSSVQSVTLTPKNGFVASQPLIARYIKFGNRFLVIVSGIVGKGTGSGTGICATLPTFLAPDASWNKLYSAAQQSTAASNQANIYLSVSADINIVGVGSVDVNTGLDGIIYLTKEVTT 173 T 0.00052 Caudo_bapla_RBP pdbhh T Viruses T 6r64 1 A,B A,B MCRA_ECOLI ECOKMCRA GHHHHHHEFMHVFDNNGIELKAECSIGEEDGVYGLILESWGPGDRNKDYNIALDYIIERLVDSGVSQVVVYLASSSVRKHMHSLDERKIHPGEYFTLIGNSPRDIRLKMCGYQAYFSRTGRKEIPSGNRTKRILINVPGIYSDSFWASIIRG 152 T 0.3 DUF5616 pdbpssm F Bacteria T 6r6a 2 B D Pepstatin XVVXAX 6 T 1700 FAM60A pdbhh F F 6r6g 2 B 1 XBP1_HUMAN XBP-1,TAX-RESPONSIVE ELEMENT-BINDING PROTEIN 5,TREB-5 DPVPYQPPFLCQWGRHQPAWKPLM 24 T 30 Prok-E2_E pdbhh F Eukaryota T 6r6g 92 NC AG Signal sequence (HR2) LLLLLLLLLLLL 12 T 22 DUF316 pdbhh F F 6r6p 48 VA 1 XBP1_HUMAN XBP-1,TAX-RESPONSIVE ELEMENT-BINDING PROTEIN 5,TREB-5 DPVPYQPPFLCQWGRHQPAWKPLM 24 T 30 Prok-E2_E pdbhh F Eukaryota T 6r7l 1 A G SecG XXXXXXXXXXXXXXXXXXXXXX 22 F F F 6r7o 1 A,B A,B STAG1_HUMAN SCC3 HOMOLOG 1,STROMAL ANTIGEN 1 SMSPNGNLIRMLVLFFLESELHEHAAYLVDSLWESSQELLKDWECMTELLLEEPVQGEEAMSDRQESALIELMVCTIRQAAEAHPPVGRGTGKRVLTAKERKTQIDDRNKLTEHFIITLPMLLSKYSADAEKVANLLQIPQYFDLEIYSTGRMEKHLDALLKQIKFVVEKHVESDVLEACSKTYSILCSEEYTIQNRVDIARSQLIDEFVDRFNHSVEDLLQEGEEADDDDIYNVLSTLKRLTSFHNAHDLTKWDLFGNCYRLLKTGIEHGAMPEQIVVQALQCSHYSILWQLVKITDGSPSKEDLLVLRKTVKSFLAVCQQCLSNVNTPVKEQAFMLLCDLLMIFSHQLMTGGREGLQPLVFNPDTGLQSELLSFVMDHVFIDQDEENQSMEGDEEDEANKIEALHKRRNLLAAFSKLIIYDIVDMHAAADIFKHYMKYYNDYGDIIKETLSKTRQID 459 T 0.0089 Cnd3 pdb F Eukaryota T 6r7q 15 O 1 XBP1_HUMAN XBP-1,TAX-RESPONSIVE ELEMENT-BINDING PROTEIN 5,TREB-5 DPVPYQPPFLCQWGRHQPAWKPLM 24 T 30 Prok-E2_E pdbhh F Eukaryota T 6r7w 2 B B G8ULV2_TANFA Putative lipoprotein KRDPVYFIKLSTIK 14 T 6.8 DUF4786 pdbhh F Bacteria T 6r8i 2 B B SER-LEU-PRO-PHE-THR-PHE-LYS-VAL-PRO-ALA-PRO-PRO-PRO-SER-LEU-PRO-PRO-SER SLPFTFKVPAPPPSLPPSW 19 T 0.29 LDB19 pdbhh F T 6r8k 2 B C C4B8B8_MAGOR AVR-PIK PROTEIN ( PIKMPROTEIN,PIKP PROTEIN ),AVRPI7 PROTEIN METGNKYIEKRAIDLSRERDPNFFDHPGIPVPECFWFMFKNNVRQDAGTCYSSWKMDMKVGPNWVHIKSDDNCNLSGDFPPGWIVLGKKRPGF 93 T 0.1 TMEM18 unp F Eukaryota T 6r8m 2 C,F C,G C4B8C2_MAGOR AVR-Pik protein ETGNKYIEKRAIDLSRERDPNFFDNPGIPVPECFWFMFKNNVRQDAGTCYSSWKMDMKVGPNWVHIKSDDNCNLSGDFPPGWIVLGKKRPGF 92 T 0.086 TMEM18 unp F Eukaryota T 6r9z 1 A A ACE-GLU-VAL-ASN-PRO-PRO-VAL-PRO-NH2 XEVNPPVPX 9 T 6.7 HIF-1a_CTAD pdbhh F F 6rao 9 J J Q6HAC7_9GAMM Afp12 MSKENALFPAVKDAIVFDALWQQAHEKVTALSGEIWTDTGDHDPGVTLLQSATWNCSDLSYRASLSLNDLLTHQDQSTLFPEEFGPEQVLTCNTVTAEDYRRALLDVHSSDIQALDTPEQDFLFSDVSLTQEPKEHRFHWWYNAEKREYSFRKPTDSGEVNELKLRGNLWLSLVPTRYTQSLSPENLAAVEQCLAEFLAAHRNLGEVVSRITWLQPATFSPRMTIELADNIGDINQVAAQIYQVTDAFLRPAVARYTTEQRRALGDADDAIFEGPRLKHGWQQTAPSQITSGGYVLNLGPLVNLLLAIPGVASLSTLSVDKGDGHITAVTGDNLRWQVADGYYPLLWGAPPLSLLAGDDSPLTLVSKGGIRNTLESEAMAGYLTQADLIVTTPTVLPAGRFRDQTLYIPIGQRQPECYALQQPDTVIDDQTRAVHQFLLPVDQLLADGTAELAQLPTLLAFKNRGDAIRGTRWPYTNAMVQQAIHQPYAKTLEAIAQQDAAIFTQDKQPVGGNYARELDFLQYLLGYFGTQRAALPLTLDLPDFLATQRAYLAQQPALGYDRINIRIDQVSALQKRIAARIGLDSICFADNPDLGQLPFYLIEHRQLLPQTPDSTFDSEQTPSGFAVAEPDITLTQAGSVGKVVQGQLIDLIAIEGGSRLHVSRLLVIKAEGDSFTVSTENSQQLHNTLSRLETAWASHNLRWQNSNVWLQDMDYRLNYAEAKLQPANPQQRLLASNAQSPYPAMVSVGDGIVLRPAGLQFYMPGANATRAATLDADWQLAATVKAVDPIAGTLLIEKAAGSTEDFPSAESSFRYQWAFSQANYATTDRFSFVVSAVLNRRLIENPNIVPEQLVAWIQETIMAEFPAHVSLINHWLDDATFNNFGVTYSRWQNSGMPLGDDAFALMQILTLGHLPVTQLDIGLMRIATEEQRTEVIGDGSQWHEDVILREELFYVPKDVQTTL 963 T 0.059 DUF276 pdbhh F Bacteria T 6rd2 2 C,D C,D THR-GLU-ASP-GLU-NLW TEDEX 5 T 77 Chs7 pdbhh F F 6rd4 1 A 0 A0A5H1ZR95_9CHLO ASA-10: Polytomella F-ATP synthase associated subunit 10 MSYSAYFAKAGFQFPAGLSALVAGIVALNVCTGRPTKGTKEISNAEYNATPIGYLQSPDQHPTAFPKVPGMKDVHGSPHHHH 82 T 0.1 DUF6506 pdb F Eukaryota T 6rd4 2 B 1 Q85JD5_9CHLO ATP synthase associated protein ASA1 MMRAAQKAKQELPATVLTQTRSYLAPLRSDFTEEITAPKVASASNLVNEWNNKKQATENLMKLLQAYKDIGDAKSEPLLKNHNPRTFEDRDYPVPDFRTQNLKAGDVPKFFDTVISTRASAAIASKDKFWAGRKTEAEAASAKASAAFPRVAVPEWKKGKTVSIENLNTVTDKYAAALVPKRKLALPVLPEGVKKAVEDFAASVGQAKNASEVSELLAKSLAEKAVVTEGGKVVEGFSYVSKAVAAKVIATRRAEVHERLLKLWAKRLLVSPELAIVPLNEFDAQLASKFEGISPKYQELLSAVAQGNKTFAQRLNSSPAFSSFLLKREKAESEVPPSELELEAAQKAAELEDPEVALRTLLGPQMEALGASDLLLSEQIRVITEHRYTPDRLQYKEGMKLADKIAAQEAALKEELKVIYGDNVDVKHFQASPRTPVQQLFDSLKNAAANKERAAKEAAAAASPYLAYAVTKKQEVQADPSNIPFDEVLYPQLSEELLELELSDIREDEIALEKAEEEELWLLTLTQQFKHIQKHFGIDLPHSVVAHMDPLLIKKIDWETTNALEDFDITLDDMGAEDAKEQWGAENLSHHFLPLIRYRRDLARKNGDRYGPDLVNGN 618 T 0.21 Lipoprotein_10 pdb F Eukaryota T 6rd4 4 D 3 K0J903_9CHLO Mitochondrial F1F0 ATP synthase associated 32 kDa protein MRQASRLALSIRQAGNVEAASAVPAMTRQFSAPGSHEHHETPLSKVMPTVVSIPRKVACLALGATKKVVCGLASSGPSQNLVSTFANKVIVEENLVNVAEIDVPFWSYWLSSAGFTSKDAFVKFAEAVKPKVAALSTSDITNLTVAFKRANYYDKDLFTGIEANVSANFTKFETEQLLQIVATFDAFNHSSVAFLDDVADSITYCNHYLAPVRAGADELATLLTYYAKNGHERADLLATVARGFSEVSLGKLSAAQRKDTVLSALKAFQTFGFYPESIEAVIGAALVSPAEYSAEELKEVEAVKVAAENALGGEFVLIQEGAHGH 325 T 0.042 FAST_1 pdbpercent F Eukaryota T 6rd4 7 G 6 D7P897_9CHLO Mitochondrial ATP synthase subunit ASA6 MMLRTLTRSSAVAGQAVRLFKTSAAAAEGNSVAGIIKSVNETSGANLLSSLKTIKAQAAPIYPAAASSTGYSTQAKIALFGALSWILYRADGQSKAHEWIVDLNLNVLQAAWLISFSSLIPFRAVYFAFRGMAPATASTLNGLKTFSSISL 151 T 0.00025 NrsF pdbpercent F Eukaryota T 6rd4 8 H 7 D8V7I2_9CHLO Mitochondrial ATP synthase associated protein ASA7 MSSVRAGVEAGRRDLTTFTFSGLQDAPVAALSGSIKLNVAAKAGKAEVTVAAGAAKAATQVSAAALRKLSGSKISLAEVARISVLHSSIQNYLLSLSNERYQLLSQWPDFTTMYGKDFYYRAHPEDLKKFYDAADEYYKLYETVTEFDSLSALASQVVPNYAARRRSTVHPAIGSTVADGAFTNFLLSKQ 190 T 0.96 DUF4296 pdbhh F Eukaryota T 6rd4 9 I 8 D8V7I7_9CHLO Mitochondrial ATP synthase subunit ASA8 MVLGEVYLKDILRTPPTGAIPANVPHPFQTSFYTYATKKLIPRHWYLLGGFTFTITLYGILDGLRDSGKKKAYDEAIHAGKTPYTAGGH 89 T 0.3 Selenoprotein_S pdbhh F Eukaryota T 6rd4 10 J 9 A0A5H1ZR73_9CHLO Mitochondrial ATP synthase subunit ASA9 MAVTSFLGKAFEKYFYDFSAYEQFGLNRFLSSKGQYVALRHVGFVMVGVNVLLAANFPFNPPFPTIGMCPAGWEGTWVCQADKAKALEMYKEWKKSN 97 T 0.39 1-cysPrx_C pdbhh F Eukaryota T 6rd5 1 A 0 A0A5H1ZR95_9CHLO ASA-10: Polytomella F-ATP synthase associated subunit 10 MSYSAYFAKAGFQFPAGLSALVAGIVALNVCTGRPTKGTKEISNAEYNATPIGYLQSPDQHPTAFPKVPGMKDVHGSPHHHH 82 T 0.1 DUF6506 pdb F Eukaryota T 6rd5 2 B 1 Q85JD5_9CHLO ATP synthase associated protein ASA1 MMRAAQKAKQELPATVLTQTRSYLAPLRSDFTEEITAPKVASASNLVNEWNNKKQATENLMKLLQAYKDIGDAKSEPLLKNHNPRTFEDRDYPVPDFRTQNLKAGDVPKFFDTVISTRASAAIASKDKFWAGRKTEAEAASAKASAAFPRVAVPEWKKGKTVSIENLNTVTDKYAAALVPKRKLALPVLPEGVKKAVEDFAASVGQAKNASEVSELLAKSLAEKAVVTEGGKVVEGFSYVSKAVAAKVIATRRAEVHERLLKLWAKRLLVSPELAIVPLNEFDAQLASKFEGISPKYQELLSAVAQGNKTFAQRLNSSPAFSSFLLKREKAESEVPPSELELEAAQKAAELEDPEVALRTLLGPQMEALGASDLLLSEQIRVITEHRYTPDRLQYKEGMKLADKIAAQEAALKEELKVIYGDNVDVKHFQASPRTPVQQLFDSLKNAAANKERAAKEAAAAASPYLAYAVTKKQEVQADPSNIPFDEVLYPQLSEELLELELSDIREDEIALEKAEEEELWLLTLTQQFKHIQKHFGIDLPHSVVAHMDPLLIKKIDWETTNALEDFDITLDDMGAEDAKEQWGAENLSHHFLPLIRYRRDLARKNGDRYGPDLVNGN 618 T 0.21 Lipoprotein_10 pdb F Eukaryota T 6rd5 3 C 3 K0J903_9CHLO Mitochondrial F1F0 ATP synthase associated 32 kDa protein MRQASRLALSIRQAGNVEAASAVPAMTRQFSAPGSHEHHETPLSKVMPTVVSIPRKVACLALGATKKVVCGLASSGPSQNLVSTFANKVIVEENLVNVAEIDVPFWSYWLSSAGFTSKDAFVKFAEAVKPKVAALSTSDITNLTVAFKRANYYDKDLFTGIEANVSANFTKFETEQLLQIVATFDAFNHSSVAFLDDVADSITYCNHYLAPVRAGADELATLLTYYAKNGHERADLLATVARGFSEVSLGKLSAAQRKDTVLSALKAFQTFGFYPESIEAVIGAALVSPAEYSAEELKEVEAVKVAAENALGGEFVLIQEGAHGH 325 T 0.042 FAST_1 pdbpercent F Eukaryota T 6rd5 5 E 6 D7P897_9CHLO Mitochondrial ATP synthase subunit ASA6 MMLRTLTRSSAVAGQAVRLFKTSAAAAEGNSVAGIIKSVNETSGANLLSSLKTIKAQAAPIYPAAASSTGYSTQAKIALFGALSWILYRADGQSKAHEWIVDLNLNVLQAAWLISFSSLIPFRAVYFAFRGMAPATASTLNGLKTFSSISL 151 T 0.00025 NrsF pdbpercent F Eukaryota T 6rd5 6 F 8 D8V7I7_9CHLO Mitochondrial ATP synthase subunit ASA8 MVLGEVYLKDILRTPPTGAIPANVPHPFQTSFYTYATKKLIPRHWYLLGGFTFTITLYGILDGLRDSGKKKAYDEAIHAGKTPYTAGGH 89 T 0.3 Selenoprotein_S pdbhh F Eukaryota T 6rd5 7 G 9 A0A5H1ZR73_9CHLO Mitochondrial ATP synthase subunit ASA9 MAVTSFLGKAFEKYFYDFSAYEQFGLNRFLSSKGQYVALRHVGFVMVGVNVLLAANFPFNPPFPTIGMCPAGWEGTWVCQADKAKALEMYKEWKKSN 97 T 0.39 1-cysPrx_C pdbhh F Eukaryota T 6rd6 3 C 7 D8V7I2_9CHLO Mitochondrial ATP synthase associated protein ASA7 MSSVRAGVEAGRRDLTTFTFSGLQDAPVAALSGSIKLNVAAKAGKAEVTVAAGAAKAATQVSAAALRKLSGSKISLAEVARISVLHSSIQNYLLSLSNERYQLLSQWPDFTTMYGKDFYYRAHPEDLKKFYDAADEYYKLYETVTEFDSLSALASQVVPNYAARRRSTVHPAIGSTVADGAFTNFLLSKQ 190 T 0.96 DUF4296 pdbhh F Eukaryota T 6rd7 1 A 0 A0A5H1ZR95_9CHLO ASA-10: Polytomella F-ATP synthase associated subunit 10 MSYSAYFAKAGFQFPAGLSALVAGIVALNVCTGRPTKGTKEISNAEYNATPIGYLQSPDQHPTAFPKVPGMKDVHGSPHHHH 82 T 0.1 DUF6506 pdb F Eukaryota T 6rd7 2 B 1 Q85JD5_9CHLO ATP synthase associated protein ASA1 MMRAAQKAKQELPATVLTQTRSYLAPLRSDFTEEITAPKVASASNLVNEWNNKKQATENLMKLLQAYKDIGDAKSEPLLKNHNPRTFEDRDYPVPDFRTQNLKAGDVPKFFDTVISTRASAAIASKDKFWAGRKTEAEAASAKASAAFPRVAVPEWKKGKTVSIENLNTVTDKYAAALVPKRKLALPVLPEGVKKAVEDFAASVGQAKNASEVSELLAKSLAEKAVVTEGGKVVEGFSYVSKAVAAKVIATRRAEVHERLLKLWAKRLLVSPELAIVPLNEFDAQLASKFEGISPKYQELLSAVAQGNKTFAQRLNSSPAFSSFLLKREKAESEVPPSELELEAAQKAAELEDPEVALRTLLGPQMEALGASDLLLSEQIRVITEHRYTPDRLQYKEGMKLADKIAAQEAALKEELKVIYGDNVDVKHFQASPRTPVQQLFDSLKNAAANKERAAKEAAAAASPYLAYAVTKKQEVQADPSNIPFDEVLYPQLSEELLELELSDIREDEIALEKAEEEELWLLTLTQQFKHIQKHFGIDLPHSVVAHMDPLLIKKIDWETTNALEDFDITLDDMGAEDAKEQWGAENLSHHFLPLIRYRRDLARKNGDRYGPDLVNGN 618 T 0.21 Lipoprotein_10 pdb F Eukaryota T 6rd7 3 C 3 K0J903_9CHLO Mitochondrial F1F0 ATP synthase associated 32 kDa protein MRQASRLALSIRQAGNVEAASAVPAMTRQFSAPGSHEHHETPLSKVMPTVVSIPRKVACLALGATKKVVCGLASSGPSQNLVSTFANKVIVEENLVNVAEIDVPFWSYWLSSAGFTSKDAFVKFAEAVKPKVAALSTSDITNLTVAFKRANYYDKDLFTGIEANVSANFTKFETEQLLQIVATFDAFNHSSVAFLDDVADSITYCNHYLAPVRAGADELATLLTYYAKNGHERADLLATVARGFSEVSLGKLSAAQRKDTVLSALKAFQTFGFYPESIEAVIGAALVSPAEYSAEELKEVEAVKVAAENALGGEFVLIQEGAHGH 325 T 0.042 FAST_1 pdbpercent F Eukaryota T 6rd7 5 E 6 D7P897_9CHLO Mitochondrial ATP synthase subunit ASA6 MMLRTLTRSSAVAGQAVRLFKTSAAAAEGNSVAGIIKSVNETSGANLLSSLKTIKAQAAPIYPAAASSTGYSTQAKIALFGALSWILYRADGQSKAHEWIVDLNLNVLQAAWLISFSSLIPFRAVYFAFRGMAPATASTLNGLKTFSSISL 151 T 0.00025 NrsF pdbpercent F Eukaryota T 6rd7 6 F 8 D8V7I7_9CHLO Mitochondrial ATP synthase subunit ASA8 MVLGEVYLKDILRTPPTGAIPANVPHPFQTSFYTYATKKLIPRHWYLLGGFTFTITLYGILDGLRDSGKKKAYDEAIHAGKTPYTAGGH 89 T 0.3 Selenoprotein_S pdbhh F Eukaryota T 6rd7 7 G 9 A0A5H1ZR73_9CHLO ASA-9: Polytomella F-ATP synthase associated subunit 9 MAVTSFLGKAFEKYFYDFSAYEQFGLNRFLSSKGQYVALRHVGFVMVGVNVLLAANFPFNPPFPTIGMCPAGWEGTWVCQADKAKALEMYKEWKKSN 97 T 0.39 1-cysPrx_C pdbhh F Eukaryota T 6rd8 1 A 0 A0A5H1ZR95_9CHLO ASA-10: Polytomella F-ATP synthase associated subunit 10 MSYSAYFAKAGFQFPAGLSALVAGIVALNVCTGRPTKGTKEISNAEYNATPIGYLQSPDQHPTAFPKVPGMKDVHGSPHHHH 82 T 0.1 DUF6506 pdb F Eukaryota T 6rd8 2 B 1 Q85JD5_9CHLO ATP synthase associated protein ASA1 MMRAAQKAKQELPATVLTQTRSYLAPLRSDFTEEITAPKVASASNLVNEWNNKKQATENLMKLLQAYKDIGDAKSEPLLKNHNPRTFEDRDYPVPDFRTQNLKAGDVPKFFDTVISTRASAAIASKDKFWAGRKTEAEAASAKASAAFPRVAVPEWKKGKTVSIENLNTVTDKYAAALVPKRKLALPVLPEGVKKAVEDFAASVGQAKNASEVSELLAKSLAEKAVVTEGGKVVEGFSYVSKAVAAKVIATRRAEVHERLLKLWAKRLLVSPELAIVPLNEFDAQLASKFEGISPKYQELLSAVAQGNKTFAQRLNSSPAFSSFLLKREKAESEVPPSELELEAAQKAAELEDPEVALRTLLGPQMEALGASDLLLSEQIRVITEHRYTPDRLQYKEGMKLADKIAAQEAALKEELKVIYGDNVDVKHFQASPRTPVQQLFDSLKNAAANKERAAKEAAAAASPYLAYAVTKKQEVQADPSNIPFDEVLYPQLSEELLELELSDIREDEIALEKAEEEELWLLTLTQQFKHIQKHFGIDLPHSVVAHMDPLLIKKIDWETTNALEDFDITLDDMGAEDAKEQWGAENLSHHFLPLIRYRRDLARKNGDRYGPDLVNGN 618 T 0.21 Lipoprotein_10 pdb F Eukaryota T 6rd8 3 C 3 K0J903_9CHLO Mitochondrial F1F0 ATP synthase associated 32 kDa protein MRQASRLALSIRQAGNVEAASAVPAMTRQFSAPGSHEHHETPLSKVMPTVVSIPRKVACLALGATKKVVCGLASSGPSQNLVSTFANKVIVEENLVNVAEIDVPFWSYWLSSAGFTSKDAFVKFAEAVKPKVAALSTSDITNLTVAFKRANYYDKDLFTGIEANVSANFTKFETEQLLQIVATFDAFNHSSVAFLDDVADSITYCNHYLAPVRAGADELATLLTYYAKNGHERADLLATVARGFSEVSLGKLSAAQRKDTVLSALKAFQTFGFYPESIEAVIGAALVSPAEYSAEELKEVEAVKVAAENALGGEFVLIQEGAHGH 325 T 0.042 FAST_1 pdbpercent F Eukaryota T 6rd8 5 E 6 D7P897_9CHLO Mitochondrial ATP synthase subunit ASA6 MMLRTLTRSSAVAGQAVRLFKTSAAAAEGNSVAGIIKSVNETSGANLLSSLKTIKAQAAPIYPAAASSTGYSTQAKIALFGALSWILYRADGQSKAHEWIVDLNLNVLQAAWLISFSSLIPFRAVYFAFRGMAPATASTLNGLKTFSSISL 151 T 0.00025 NrsF pdbpercent F Eukaryota T 6rd8 6 F 8 D8V7I7_9CHLO Mitochondrial ATP synthase subunit ASA8 MVLGEVYLKDILRTPPTGAIPANVPHPFQTSFYTYATKKLIPRHWYLLGGFTFTITLYGILDGLRDSGKKKAYDEAIHAGKTPYTAGGH 89 T 0.3 Selenoprotein_S pdbhh F Eukaryota T 6rd8 7 G 9 A0A5H1ZR73_9CHLO ASA-9: Polytomella F-ATP synthase associated subunit 9 MAVTSFLGKAFEKYFYDFSAYEQFGLNRFLSSKGQYVALRHVGFVMVGVNVLLAANFPFNPPFPTIGMCPAGWEGTWVCQADKAKALEMYKEWKKSN 97 T 0.39 1-cysPrx_C pdbhh F Eukaryota T 6rd9 1 A 0 A0A5H1ZR95_9CHLO ASA-10: Polytomella F-ATP synthase associated subunit 10 MSYSAYFAKAGFQFPAGLSALVAGIVALNVCTGRPTKGTKEISNAEYNATPIGYLQSPDQHPTAFPKVPGMKDVHGSPHHHH 82 T 0.1 DUF6506 pdb F Eukaryota T 6rd9 2 B 1 Q85JD5_9CHLO ATP synthase associated protein ASA1 MMRAAQKAKQELPATVLTQTRSYLAPLRSDFTEEITAPKVASASNLVNEWNNKKQATENLMKLLQAYKDIGDAKSEPLLKNHNPRTFEDRDYPVPDFRTQNLKAGDVPKFFDTVISTRASAAIASKDKFWAGRKTEAEAASAKASAAFPRVAVPEWKKGKTVSIENLNTVTDKYAAALVPKRKLALPVLPEGVKKAVEDFAASVGQAKNASEVSELLAKSLAEKAVVTEGGKVVEGFSYVSKAVAAKVIATRRAEVHERLLKLWAKRLLVSPELAIVPLNEFDAQLASKFEGISPKYQELLSAVAQGNKTFAQRLNSSPAFSSFLLKREKAESEVPPSELELEAAQKAAELEDPEVALRTLLGPQMEALGASDLLLSEQIRVITEHRYTPDRLQYKEGMKLADKIAAQEAALKEELKVIYGDNVDVKHFQASPRTPVQQLFDSLKNAAANKERAAKEAAAAASPYLAYAVTKKQEVQADPSNIPFDEVLYPQLSEELLELELSDIREDEIALEKAEEEELWLLTLTQQFKHIQKHFGIDLPHSVVAHMDPLLIKKIDWETTNALEDFDITLDDMGAEDAKEQWGAENLSHHFLPLIRYRRDLARKNGDRYGPDLVNGN 618 T 0.21 Lipoprotein_10 pdb F Eukaryota T 6rd9 4 D 3 K0J903_9CHLO Mitochondrial F1F0 ATP synthase associated 32 kDa protein MRQASRLALSIRQAGNVEAASAVPAMTRQFSAPGSHEHHETPLSKVMPTVVSIPRKVACLALGATKKVVCGLASSGPSQNLVSTFANKVIVEENLVNVAEIDVPFWSYWLSSAGFTSKDAFVKFAEAVKPKVAALSTSDITNLTVAFKRANYYDKDLFTGIEANVSANFTKFETEQLLQIVATFDAFNHSSVAFLDDVADSITYCNHYLAPVRAGADELATLLTYYAKNGHERADLLATVARGFSEVSLGKLSAAQRKDTVLSALKAFQTFGFYPESIEAVIGAALVSPAEYSAEELKEVEAVKVAAENALGGEFVLIQEGAHGH 325 T 0.042 FAST_1 pdbpercent F Eukaryota T 6rd9 7 G 6 D7P897_9CHLO Mitochondrial ATP synthase subunit ASA6 MMLRTLTRSSAVAGQAVRLFKTSAAAAEGNSVAGIIKSVNETSGANLLSSLKTIKAQAAPIYPAAASSTGYSTQAKIALFGALSWILYRADGQSKAHEWIVDLNLNVLQAAWLISFSSLIPFRAVYFAFRGMAPATASTLNGLKTFSSISL 151 T 0.00025 NrsF pdbpercent F Eukaryota T 6rd9 8 H 7 D8V7I2_9CHLO Mitochondrial ATP synthase associated protein ASA7 MSSVRAGVEAGRRDLTTFTFSGLQDAPVAALSGSIKLNVAAKAGKAEVTVAAGAAKAATQVSAAALRKLSGSKISLAEVARISVLHSSIQNYLLSLSNERYQLLSQWPDFTTMYGKDFYYRAHPEDLKKFYDAADEYYKLYETVTEFDSLSALASQVVPNYAARRRSTVHPAIGSTVADGAFTNFLLSKQ 190 T 0.96 DUF4296 pdbhh F Eukaryota T 6rd9 9 I 8 D8V7I7_9CHLO Mitochondrial ATP synthase subunit ASA8 MVLGEVYLKDILRTPPTGAIPANVPHPFQTSFYTYATKKLIPRHWYLLGGFTFTITLYGILDGLRDSGKKKAYDEAIHAGKTPYTAGGH 89 T 0.3 Selenoprotein_S pdbhh F Eukaryota T 6rd9 10 J 9 A0A5H1ZR73_9CHLO ASA-9: Polytomella F-ATP synthase associated subunit 9 MAVTSFLGKAFEKYFYDFSAYEQFGLNRFLSSKGQYVALRHVGFVMVGVNVLLAANFPFNPPFPTIGMCPAGWEGTWVCQADKAKALEMYKEWKKSN 97 T 0.39 1-cysPrx_C pdbhh F Eukaryota T 6rda 1 A 0 A0A5H1ZR95_9CHLO ASA-10: Polytomella F-ATP synthase associated subunit 10 MSYSAYFAKAGFQFPAGLSALVAGIVALNVCTGRPTKGTKEISNAEYNATPIGYLQSPDQHPTAFPKVPGMKDVHGSPHHHH 82 T 0.1 DUF6506 pdb F Eukaryota T 6rda 2 B 1 Q85JD5_9CHLO ATP synthase associated protein ASA1 MMRAAQKAKQELPATVLTQTRSYLAPLRSDFTEEITAPKVASASNLVNEWNNKKQATENLMKLLQAYKDIGDAKSEPLLKNHNPRTFEDRDYPVPDFRTQNLKAGDVPKFFDTVISTRASAAIASKDKFWAGRKTEAEAASAKASAAFPRVAVPEWKKGKTVSIENLNTVTDKYAAALVPKRKLALPVLPEGVKKAVEDFAASVGQAKNASEVSELLAKSLAEKAVVTEGGKVVEGFSYVSKAVAAKVIATRRAEVHERLLKLWAKRLLVSPELAIVPLNEFDAQLASKFEGISPKYQELLSAVAQGNKTFAQRLNSSPAFSSFLLKREKAESEVPPSELELEAAQKAAELEDPEVALRTLLGPQMEALGASDLLLSEQIRVITEHRYTPDRLQYKEGMKLADKIAAQEAALKEELKVIYGDNVDVKHFQASPRTPVQQLFDSLKNAAANKERAAKEAAAAASPYLAYAVTKKQEVQADPSNIPFDEVLYPQLSEELLELELSDIREDEIALEKAEEEELWLLTLTQQFKHIQKHFGIDLPHSVVAHMDPLLIKKIDWETTNALEDFDITLDDMGAEDAKEQWGAENLSHHFLPLIRYRRDLARKNGDRYGPDLVNGN 618 T 0.21 Lipoprotein_10 pdb F Eukaryota T 6rda 4 D 3 K0J903_9CHLO Mitochondrial F1F0 ATP synthase associated 32 kDa protein MRQASRLALSIRQAGNVEAASAVPAMTRQFSAPGSHEHHETPLSKVMPTVVSIPRKVACLALGATKKVVCGLASSGPSQNLVSTFANKVIVEENLVNVAEIDVPFWSYWLSSAGFTSKDAFVKFAEAVKPKVAALSTSDITNLTVAFKRANYYDKDLFTGIEANVSANFTKFETEQLLQIVATFDAFNHSSVAFLDDVADSITYCNHYLAPVRAGADELATLLTYYAKNGHERADLLATVARGFSEVSLGKLSAAQRKDTVLSALKAFQTFGFYPESIEAVIGAALVSPAEYSAEELKEVEAVKVAAENALGGEFVLIQEGAHGH 325 T 0.042 FAST_1 pdbpercent F Eukaryota T 6rda 7 G 6 D7P897_9CHLO Mitochondrial ATP synthase subunit ASA6 MMLRTLTRSSAVAGQAVRLFKTSAAAAEGNSVAGIIKSVNETSGANLLSSLKTIKAQAAPIYPAAASSTGYSTQAKIALFGALSWILYRADGQSKAHEWIVDLNLNVLQAAWLISFSSLIPFRAVYFAFRGMAPATASTLNGLKTFSSISL 151 T 0.00025 NrsF pdbpercent F Eukaryota T 6rda 8 H 7 D8V7I2_9CHLO Mitochondrial ATP synthase associated protein ASA7 MSSVRAGVEAGRRDLTTFTFSGLQDAPVAALSGSIKLNVAAKAGKAEVTVAAGAAKAATQVSAAALRKLSGSKISLAEVARISVLHSSIQNYLLSLSNERYQLLSQWPDFTTMYGKDFYYRAHPEDLKKFYDAADEYYKLYETVTEFDSLSALASQVVPNYAARRRSTVHPAIGSTVADGAFTNFLLSKQ 190 T 0.96 DUF4296 pdbhh F Eukaryota T 6rda 9 I 8 D8V7I7_9CHLO Mitochondrial ATP synthase subunit ASA8 MVLGEVYLKDILRTPPTGAIPANVPHPFQTSFYTYATKKLIPRHWYLLGGFTFTITLYGILDGLRDSGKKKAYDEAIHAGKTPYTAGGH 89 T 0.3 Selenoprotein_S pdbhh F Eukaryota T 6rda 10 J 9 A0A5H1ZR73_9CHLO ASA-9: Polytomella F-ATP synthase associated subunit 9 MAVTSFLGKAFEKYFYDFSAYEQFGLNRFLSSKGQYVALRHVGFVMVGVNVLLAANFPFNPPFPTIGMCPAGWEGTWVCQADKAKALEMYKEWKKSN 97 T 0.39 1-cysPrx_C pdbhh F Eukaryota T 6rdc 1 A 0 A0A5H1ZR95_9CHLO ASA-10: Polytomella F-ATP synthase associated subunit 10 MSYSAYFAKAGFQFPAGLSALVAGIVALNVCTGRPTKGTKEISNAEYNATPIGYLQSPDQHPTAFPKVPGMKDVHGSPHHHH 82 T 0.1 DUF6506 pdb F Eukaryota T 6rdc 2 B 1 Q85JD5_9CHLO ATP synthase associated protein ASA1 MMRAAQKAKQELPATVLTQTRSYLAPLRSDFTEEITAPKVASASNLVNEWNNKKQATENLMKLLQAYKDIGDAKSEPLLKNHNPRTFEDRDYPVPDFRTQNLKAGDVPKFFDTVISTRASAAIASKDKFWAGRKTEAEAASAKASAAFPRVAVPEWKKGKTVSIENLNTVTDKYAAALVPKRKLALPVLPEGVKKAVEDFAASVGQAKNASEVSELLAKSLAEKAVVTEGGKVVEGFSYVSKAVAAKVIATRRAEVHERLLKLWAKRLLVSPELAIVPLNEFDAQLASKFEGISPKYQELLSAVAQGNKTFAQRLNSSPAFSSFLLKREKAESEVPPSELELEAAQKAAELEDPEVALRTLLGPQMEALGASDLLLSEQIRVITEHRYTPDRLQYKEGMKLADKIAAQEAALKEELKVIYGDNVDVKHFQASPRTPVQQLFDSLKNAAANKERAAKEAAAAASPYLAYAVTKKQEVQADPSNIPFDEVLYPQLSEELLELELSDIREDEIALEKAEEEELWLLTLTQQFKHIQKHFGIDLPHSVVAHMDPLLIKKIDWETTNALEDFDITLDDMGAEDAKEQWGAENLSHHFLPLIRYRRDLARKNGDRYGPDLVNGN 618 T 0.21 Lipoprotein_10 pdb F Eukaryota T 6rdc 4 D 3 K0J903_9CHLO Mitochondrial F1F0 ATP synthase associated 32 kDa protein MRQASRLALSIRQAGNVEAASAVPAMTRQFSAPGSHEHHETPLSKVMPTVVSIPRKVACLALGATKKVVCGLASSGPSQNLVSTFANKVIVEENLVNVAEIDVPFWSYWLSSAGFTSKDAFVKFAEAVKPKVAALSTSDITNLTVAFKRANYYDKDLFTGIEANVSANFTKFETEQLLQIVATFDAFNHSSVAFLDDVADSITYCNHYLAPVRAGADELATLLTYYAKNGHERADLLATVARGFSEVSLGKLSAAQRKDTVLSALKAFQTFGFYPESIEAVIGAALVSPAEYSAEELKEVEAVKVAAENALGGEFVLIQEGAHGH 325 T 0.042 FAST_1 pdbpercent F Eukaryota T 6rdc 7 G 6 D7P897_9CHLO Mitochondrial ATP synthase subunit ASA6 MMLRTLTRSSAVAGQAVRLFKTSAAAAEGNSVAGIIKSVNETSGANLLSSLKTIKAQAAPIYPAAASSTGYSTQAKIALFGALSWILYRADGQSKAHEWIVDLNLNVLQAAWLISFSSLIPFRAVYFAFRGMAPATASTLNGLKTFSSISL 151 T 0.00025 NrsF pdbpercent F Eukaryota T 6rdc 8 H 7 D8V7I2_9CHLO Mitochondrial ATP synthase associated protein ASA7 MSSVRAGVEAGRRDLTTFTFSGLQDAPVAALSGSIKLNVAAKAGKAEVTVAAGAAKAATQVSAAALRKLSGSKISLAEVARISVLHSSIQNYLLSLSNERYQLLSQWPDFTTMYGKDFYYRAHPEDLKKFYDAADEYYKLYETVTEFDSLSALASQVVPNYAARRRSTVHPAIGSTVADGAFTNFLLSKQ 190 T 0.96 DUF4296 pdbhh F Eukaryota T 6rdc 9 I 8 D8V7I7_9CHLO Mitochondrial ATP synthase subunit ASA8 MVLGEVYLKDILRTPPTGAIPANVPHPFQTSFYTYATKKLIPRHWYLLGGFTFTITLYGILDGLRDSGKKKAYDEAIHAGKTPYTAGGH 89 T 0.3 Selenoprotein_S pdbhh F Eukaryota T 6rdc 10 J 9 A0A5H1ZR73_9CHLO ASA-9: Polytomella F-ATP synthase associated subunit 9 MAVTSFLGKAFEKYFYDFSAYEQFGLNRFLSSKGQYVALRHVGFVMVGVNVLLAANFPFNPPFPTIGMCPAGWEGTWVCQADKAKALEMYKEWKKSN 97 T 0.39 1-cysPrx_C pdbhh F Eukaryota T 6rdd 1 A 0 A0A5H1ZR95_9CHLO ASA-10: Polytomella F-ATP synthase associated subunit 10 MSYSAYFAKAGFQFPAGLSALVAGIVALNVCTGRPTKGTKEISNAEYNATPIGYLQSPDQHPTAFPKVPGMKDVHGSPHHHH 82 T 0.1 DUF6506 pdb F Eukaryota T 6rdd 2 B 1 Q85JD5_9CHLO ATP synthase associated protein ASA1 MMRAAQKAKQELPATVLTQTRSYLAPLRSDFTEEITAPKVASASNLVNEWNNKKQATENLMKLLQAYKDIGDAKSEPLLKNHNPRTFEDRDYPVPDFRTQNLKAGDVPKFFDTVISTRASAAIASKDKFWAGRKTEAEAASAKASAAFPRVAVPEWKKGKTVSIENLNTVTDKYAAALVPKRKLALPVLPEGVKKAVEDFAASVGQAKNASEVSELLAKSLAEKAVVTEGGKVVEGFSYVSKAVAAKVIATRRAEVHERLLKLWAKRLLVSPELAIVPLNEFDAQLASKFEGISPKYQELLSAVAQGNKTFAQRLNSSPAFSSFLLKREKAESEVPPSELELEAAQKAAELEDPEVALRTLLGPQMEALGASDLLLSEQIRVITEHRYTPDRLQYKEGMKLADKIAAQEAALKEELKVIYGDNVDVKHFQASPRTPVQQLFDSLKNAAANKERAAKEAAAAASPYLAYAVTKKQEVQADPSNIPFDEVLYPQLSEELLELELSDIREDEIALEKAEEEELWLLTLTQQFKHIQKHFGIDLPHSVVAHMDPLLIKKIDWETTNALEDFDITLDDMGAEDAKEQWGAENLSHHFLPLIRYRRDLARKNGDRYGPDLVNGN 618 T 0.21 Lipoprotein_10 pdb F Eukaryota T 6rdd 4 D 3 K0J903_9CHLO Mitochondrial F1F0 ATP synthase associated 32 kDa protein MRQASRLALSIRQAGNVEAASAVPAMTRQFSAPGSHEHHETPLSKVMPTVVSIPRKVACLALGATKKVVCGLASSGPSQNLVSTFANKVIVEENLVNVAEIDVPFWSYWLSSAGFTSKDAFVKFAEAVKPKVAALSTSDITNLTVAFKRANYYDKDLFTGIEANVSANFTKFETEQLLQIVATFDAFNHSSVAFLDDVADSITYCNHYLAPVRAGADELATLLTYYAKNGHERADLLATVARGFSEVSLGKLSAAQRKDTVLSALKAFQTFGFYPESIEAVIGAALVSPAEYSAEELKEVEAVKVAAENALGGEFVLIQEGAHGH 325 T 0.042 FAST_1 pdbpercent F Eukaryota T 6rdd 7 G 6 D7P897_9CHLO Mitochondrial ATP synthase subunit ASA6 MMLRTLTRSSAVAGQAVRLFKTSAAAAEGNSVAGIIKSVNETSGANLLSSLKTIKAQAAPIYPAAASSTGYSTQAKIALFGALSWILYRADGQSKAHEWIVDLNLNVLQAAWLISFSSLIPFRAVYFAFRGMAPATASTLNGLKTFSSISL 151 T 0.00025 NrsF pdbpercent F Eukaryota T 6rdd 8 H 7 D8V7I2_9CHLO Mitochondrial ATP synthase associated protein ASA7 MSSVRAGVEAGRRDLTTFTFSGLQDAPVAALSGSIKLNVAAKAGKAEVTVAAGAAKAATQVSAAALRKLSGSKISLAEVARISVLHSSIQNYLLSLSNERYQLLSQWPDFTTMYGKDFYYRAHPEDLKKFYDAADEYYKLYETVTEFDSLSALASQVVPNYAARRRSTVHPAIGSTVADGAFTNFLLSKQ 190 T 0.96 DUF4296 pdbhh F Eukaryota T 6rdd 9 I 8 D8V7I7_9CHLO Mitochondrial ATP synthase subunit ASA8 MVLGEVYLKDILRTPPTGAIPANVPHPFQTSFYTYATKKLIPRHWYLLGGFTFTITLYGILDGLRDSGKKKAYDEAIHAGKTPYTAGGH 89 T 0.3 Selenoprotein_S pdbhh F Eukaryota T 6rdd 10 J 9 A0A5H1ZR73_9CHLO Mitochondrial ATP synthase subunit ASA9 MAVTSFLGKAFEKYFYDFSAYEQFGLNRFLSSKGQYVALRHVGFVMVGVNVLLAANFPFNPPFPTIGMCPAGWEGTWVCQADKAKALEMYKEWKKSN 97 T 0.39 1-cysPrx_C pdbhh F Eukaryota T 6rdf 1 A 0 A0A5H1ZR95_9CHLO ASA-10: Polytomella F-ATP synthase associated subunit 10 MSYSAYFAKAGFQFPAGLSALVAGIVALNVCTGRPTKGTKEISNAEYNATPIGYLQSPDQHPTAFPKVPGMKDVHGSPHHHH 82 T 0.1 DUF6506 pdb F Eukaryota T 6rdf 2 B 1 Q85JD5_9CHLO ATP synthase associated protein ASA1 MMRAAQKAKQELPATVLTQTRSYLAPLRSDFTEEITAPKVASASNLVNEWNNKKQATENLMKLLQAYKDIGDAKSEPLLKNHNPRTFEDRDYPVPDFRTQNLKAGDVPKFFDTVISTRASAAIASKDKFWAGRKTEAEAASAKASAAFPRVAVPEWKKGKTVSIENLNTVTDKYAAALVPKRKLALPVLPEGVKKAVEDFAASVGQAKNASEVSELLAKSLAEKAVVTEGGKVVEGFSYVSKAVAAKVIATRRAEVHERLLKLWAKRLLVSPELAIVPLNEFDAQLASKFEGISPKYQELLSAVAQGNKTFAQRLNSSPAFSSFLLKREKAESEVPPSELELEAAQKAAELEDPEVALRTLLGPQMEALGASDLLLSEQIRVITEHRYTPDRLQYKEGMKLADKIAAQEAALKEELKVIYGDNVDVKHFQASPRTPVQQLFDSLKNAAANKERAAKEAAAAASPYLAYAVTKKQEVQADPSNIPFDEVLYPQLSEELLELELSDIREDEIALEKAEEEELWLLTLTQQFKHIQKHFGIDLPHSVVAHMDPLLIKKIDWETTNALEDFDITLDDMGAEDAKEQWGAENLSHHFLPLIRYRRDLARKNGDRYGPDLVNGN 618 T 0.21 Lipoprotein_10 pdb F Eukaryota T 6rdf 4 D 3 K0J903_9CHLO Mitochondrial F1F0 ATP synthase associated 32 kDa protein MRQASRLALSIRQAGNVEAASAVPAMTRQFSAPGSHEHHETPLSKVMPTVVSIPRKVACLALGATKKVVCGLASSGPSQNLVSTFANKVIVEENLVNVAEIDVPFWSYWLSSAGFTSKDAFVKFAEAVKPKVAALSTSDITNLTVAFKRANYYDKDLFTGIEANVSANFTKFETEQLLQIVATFDAFNHSSVAFLDDVADSITYCNHYLAPVRAGADELATLLTYYAKNGHERADLLATVARGFSEVSLGKLSAAQRKDTVLSALKAFQTFGFYPESIEAVIGAALVSPAEYSAEELKEVEAVKVAAENALGGEFVLIQEGAHGH 325 T 0.042 FAST_1 pdbpercent F Eukaryota T 6rdf 7 G 6 D7P897_9CHLO Mitochondrial ATP synthase subunit ASA6 MMLRTLTRSSAVAGQAVRLFKTSAAAAEGNSVAGIIKSVNETSGANLLSSLKTIKAQAAPIYPAAASSTGYSTQAKIALFGALSWILYRADGQSKAHEWIVDLNLNVLQAAWLISFSSLIPFRAVYFAFRGMAPATASTLNGLKTFSSISL 151 T 0.00025 NrsF pdbpercent F Eukaryota T 6rdf 8 H 7 D8V7I2_9CHLO Mitochondrial ATP synthase associated protein ASA7 MSSVRAGVEAGRRDLTTFTFSGLQDAPVAALSGSIKLNVAAKAGKAEVTVAAGAAKAATQVSAAALRKLSGSKISLAEVARISVLHSSIQNYLLSLSNERYQLLSQWPDFTTMYGKDFYYRAHPEDLKKFYDAADEYYKLYETVTEFDSLSALASQVVPNYAARRRSTVHPAIGSTVADGAFTNFLLSKQ 190 T 0.96 DUF4296 pdbhh F Eukaryota T 6rdf 9 I 8 D8V7I7_9CHLO Mitochondrial ATP synthase subunit ASA8 MVLGEVYLKDILRTPPTGAIPANVPHPFQTSFYTYATKKLIPRHWYLLGGFTFTITLYGILDGLRDSGKKKAYDEAIHAGKTPYTAGGH 89 T 0.3 Selenoprotein_S pdbhh F Eukaryota T 6rdf 10 J 9 A0A5H1ZR73_9CHLO ASA-9: Polytomella F-ATP synthase associated subunit 9 MAVTSFLGKAFEKYFYDFSAYEQFGLNRFLSSKGQYVALRHVGFVMVGVNVLLAANFPFNPPFPTIGMCPAGWEGTWVCQADKAKALEMYKEWKKSN 97 T 0.39 1-cysPrx_C pdbhh F Eukaryota T 6rdh 1 A 0 A0A5H1ZR95_9CHLO ASA-10: Polytomella F-ATP synthase associated subunit 10 MSYSAYFAKAGFQFPAGLSALVAGIVALNVCTGRPTKGTKEISNAEYNATPIGYLQSPDQHPTAFPKVPGMKDVHGSPHHHH 82 T 0.1 DUF6506 pdb F Eukaryota T 6rdh 2 B 1 Q85JD5_9CHLO ATP synthase associated protein ASA1 MMRAAQKAKQELPATVLTQTRSYLAPLRSDFTEEITAPKVASASNLVNEWNNKKQATENLMKLLQAYKDIGDAKSEPLLKNHNPRTFEDRDYPVPDFRTQNLKAGDVPKFFDTVISTRASAAIASKDKFWAGRKTEAEAASAKASAAFPRVAVPEWKKGKTVSIENLNTVTDKYAAALVPKRKLALPVLPEGVKKAVEDFAASVGQAKNASEVSELLAKSLAEKAVVTEGGKVVEGFSYVSKAVAAKVIATRRAEVHERLLKLWAKRLLVSPELAIVPLNEFDAQLASKFEGISPKYQELLSAVAQGNKTFAQRLNSSPAFSSFLLKREKAESEVPPSELELEAAQKAAELEDPEVALRTLLGPQMEALGASDLLLSEQIRVITEHRYTPDRLQYKEGMKLADKIAAQEAALKEELKVIYGDNVDVKHFQASPRTPVQQLFDSLKNAAANKERAAKEAAAAASPYLAYAVTKKQEVQADPSNIPFDEVLYPQLSEELLELELSDIREDEIALEKAEEEELWLLTLTQQFKHIQKHFGIDLPHSVVAHMDPLLIKKIDWETTNALEDFDITLDDMGAEDAKEQWGAENLSHHFLPLIRYRRDLARKNGDRYGPDLVNGN 618 T 0.21 Lipoprotein_10 pdb F Eukaryota T 6rdh 4 D 3 K0J903_9CHLO Mitochondrial F1F0 ATP synthase associated 32 kDa protein MRQASRLALSIRQAGNVEAASAVPAMTRQFSAPGSHEHHETPLSKVMPTVVSIPRKVACLALGATKKVVCGLASSGPSQNLVSTFANKVIVEENLVNVAEIDVPFWSYWLSSAGFTSKDAFVKFAEAVKPKVAALSTSDITNLTVAFKRANYYDKDLFTGIEANVSANFTKFETEQLLQIVATFDAFNHSSVAFLDDVADSITYCNHYLAPVRAGADELATLLTYYAKNGHERADLLATVARGFSEVSLGKLSAAQRKDTVLSALKAFQTFGFYPESIEAVIGAALVSPAEYSAEELKEVEAVKVAAENALGGEFVLIQEGAHGH 325 T 0.042 FAST_1 pdbpercent F Eukaryota T 6rdh 7 G 6 D7P897_9CHLO Mitochondrial ATP synthase subunit ASA6 MMLRTLTRSSAVAGQAVRLFKTSAAAAEGNSVAGIIKSVNETSGANLLSSLKTIKAQAAPIYPAAASSTGYSTQAKIALFGALSWILYRADGQSKAHEWIVDLNLNVLQAAWLISFSSLIPFRAVYFAFRGMAPATASTLNGLKTFSSISL 151 T 0.00025 NrsF pdbpercent F Eukaryota T 6rdh 8 H 7 D8V7I2_9CHLO Mitochondrial ATP synthase associated protein ASA7 MSSVRAGVEAGRRDLTTFTFSGLQDAPVAALSGSIKLNVAAKAGKAEVTVAAGAAKAATQVSAAALRKLSGSKISLAEVARISVLHSSIQNYLLSLSNERYQLLSQWPDFTTMYGKDFYYRAHPEDLKKFYDAADEYYKLYETVTEFDSLSALASQVVPNYAARRRSTVHPAIGSTVADGAFTNFLLSKQ 190 T 0.96 DUF4296 pdbhh F Eukaryota T 6rdh 9 I 8 D8V7I7_9CHLO Mitochondrial ATP synthase subunit ASA8 MVLGEVYLKDILRTPPTGAIPANVPHPFQTSFYTYATKKLIPRHWYLLGGFTFTITLYGILDGLRDSGKKKAYDEAIHAGKTPYTAGGH 89 T 0.3 Selenoprotein_S pdbhh F Eukaryota T 6rdh 10 J 9 A0A5H1ZR73_9CHLO ASA-9: Polytomella F-ATP synthase associated subunit 9 MAVTSFLGKAFEKYFYDFSAYEQFGLNRFLSSKGQYVALRHVGFVMVGVNVLLAANFPFNPPFPTIGMCPAGWEGTWVCQADKAKALEMYKEWKKSN 97 T 0.39 1-cysPrx_C pdbhh F Eukaryota T 6rdi 1 A 0 A0A5H1ZR95_9CHLO ASA-10: Polytomella F-ATP synthase associated subunit 10 MSYSAYFAKAGFQFPAGLSALVAGIVALNVCTGRPTKGTKEISNAEYNATPIGYLQSPDQHPTAFPKVPGMKDVHGSPHHHH 82 T 0.1 DUF6506 pdb F Eukaryota T 6rdi 2 B 1 Q85JD5_9CHLO ATP synthase associated protein ASA1 MMRAAQKAKQELPATVLTQTRSYLAPLRSDFTEEITAPKVASASNLVNEWNNKKQATENLMKLLQAYKDIGDAKSEPLLKNHNPRTFEDRDYPVPDFRTQNLKAGDVPKFFDTVISTRASAAIASKDKFWAGRKTEAEAASAKASAAFPRVAVPEWKKGKTVSIENLNTVTDKYAAALVPKRKLALPVLPEGVKKAVEDFAASVGQAKNASEVSELLAKSLAEKAVVTEGGKVVEGFSYVSKAVAAKVIATRRAEVHERLLKLWAKRLLVSPELAIVPLNEFDAQLASKFEGISPKYQELLSAVAQGNKTFAQRLNSSPAFSSFLLKREKAESEVPPSELELEAAQKAAELEDPEVALRTLLGPQMEALGASDLLLSEQIRVITEHRYTPDRLQYKEGMKLADKIAAQEAALKEELKVIYGDNVDVKHFQASPRTPVQQLFDSLKNAAANKERAAKEAAAAASPYLAYAVTKKQEVQADPSNIPFDEVLYPQLSEELLELELSDIREDEIALEKAEEEELWLLTLTQQFKHIQKHFGIDLPHSVVAHMDPLLIKKIDWETTNALEDFDITLDDMGAEDAKEQWGAENLSHHFLPLIRYRRDLARKNGDRYGPDLVNGN 618 T 0.21 Lipoprotein_10 pdb F Eukaryota T 6rdi 4 D 3 K0J903_9CHLO Mitochondrial F1F0 ATP synthase associated 32 kDa protein MRQASRLALSIRQAGNVEAASAVPAMTRQFSAPGSHEHHETPLSKVMPTVVSIPRKVACLALGATKKVVCGLASSGPSQNLVSTFANKVIVEENLVNVAEIDVPFWSYWLSSAGFTSKDAFVKFAEAVKPKVAALSTSDITNLTVAFKRANYYDKDLFTGIEANVSANFTKFETEQLLQIVATFDAFNHSSVAFLDDVADSITYCNHYLAPVRAGADELATLLTYYAKNGHERADLLATVARGFSEVSLGKLSAAQRKDTVLSALKAFQTFGFYPESIEAVIGAALVSPAEYSAEELKEVEAVKVAAENALGGEFVLIQEGAHGH 325 T 0.042 FAST_1 pdbpercent F Eukaryota T 6rdi 7 G 6 D7P897_9CHLO Mitochondrial ATP synthase subunit ASA6 MMLRTLTRSSAVAGQAVRLFKTSAAAAEGNSVAGIIKSVNETSGANLLSSLKTIKAQAAPIYPAAASSTGYSTQAKIALFGALSWILYRADGQSKAHEWIVDLNLNVLQAAWLISFSSLIPFRAVYFAFRGMAPATASTLNGLKTFSSISL 151 T 0.00025 NrsF pdbpercent F Eukaryota T 6rdi 8 H 7 D8V7I2_9CHLO Mitochondrial ATP synthase associated protein ASA7 MSSVRAGVEAGRRDLTTFTFSGLQDAPVAALSGSIKLNVAAKAGKAEVTVAAGAAKAATQVSAAALRKLSGSKISLAEVARISVLHSSIQNYLLSLSNERYQLLSQWPDFTTMYGKDFYYRAHPEDLKKFYDAADEYYKLYETVTEFDSLSALASQVVPNYAARRRSTVHPAIGSTVADGAFTNFLLSKQ 190 T 0.96 DUF4296 pdbhh F Eukaryota T 6rdi 9 I 8 D8V7I7_9CHLO Mitochondrial ATP synthase subunit ASA8 MVLGEVYLKDILRTPPTGAIPANVPHPFQTSFYTYATKKLIPRHWYLLGGFTFTITLYGILDGLRDSGKKKAYDEAIHAGKTPYTAGGH 89 T 0.3 Selenoprotein_S pdbhh F Eukaryota T 6rdi 10 J 9 A0A5H1ZR73_9CHLO ASA-9: Polytomella F-ATP synthase associated subunit 9 MAVTSFLGKAFEKYFYDFSAYEQFGLNRFLSSKGQYVALRHVGFVMVGVNVLLAANFPFNPPFPTIGMCPAGWEGTWVCQADKAKALEMYKEWKKSN 97 T 0.39 1-cysPrx_C pdbhh F Eukaryota T 6rdk 1 A 0 A0A5H1ZR95_9CHLO ASA-10: Polytomella F-ATP synthase associated subunit 10 MSYSAYFAKAGFQFPAGLSALVAGIVALNVCTGRPTKGTKEISNAEYNATPIGYLQSPDQHPTAFPKVPGMKDVHGSPHHHH 82 T 0.1 DUF6506 pdb F Eukaryota T 6rdk 2 B 1 Q85JD5_9CHLO ATP synthase associated protein ASA1 MMRAAQKAKQELPATVLTQTRSYLAPLRSDFTEEITAPKVASASNLVNEWNNKKQATENLMKLLQAYKDIGDAKSEPLLKNHNPRTFEDRDYPVPDFRTQNLKAGDVPKFFDTVISTRASAAIASKDKFWAGRKTEAEAASAKASAAFPRVAVPEWKKGKTVSIENLNTVTDKYAAALVPKRKLALPVLPEGVKKAVEDFAASVGQAKNASEVSELLAKSLAEKAVVTEGGKVVEGFSYVSKAVAAKVIATRRAEVHERLLKLWAKRLLVSPELAIVPLNEFDAQLASKFEGISPKYQELLSAVAQGNKTFAQRLNSSPAFSSFLLKREKAESEVPPSELELEAAQKAAELEDPEVALRTLLGPQMEALGASDLLLSEQIRVITEHRYTPDRLQYKEGMKLADKIAAQEAALKEELKVIYGDNVDVKHFQASPRTPVQQLFDSLKNAAANKERAAKEAAAAASPYLAYAVTKKQEVQADPSNIPFDEVLYPQLSEELLELELSDIREDEIALEKAEEEELWLLTLTQQFKHIQKHFGIDLPHSVVAHMDPLLIKKIDWETTNALEDFDITLDDMGAEDAKEQWGAENLSHHFLPLIRYRRDLARKNGDRYGPDLVNGN 618 T 0.21 Lipoprotein_10 pdb F Eukaryota T 6rdk 4 D 3 K0J903_9CHLO Mitochondrial F1F0 ATP synthase associated 32 kDa protein MRQASRLALSIRQAGNVEAASAVPAMTRQFSAPGSHEHHETPLSKVMPTVVSIPRKVACLALGATKKVVCGLASSGPSQNLVSTFANKVIVEENLVNVAEIDVPFWSYWLSSAGFTSKDAFVKFAEAVKPKVAALSTSDITNLTVAFKRANYYDKDLFTGIEANVSANFTKFETEQLLQIVATFDAFNHSSVAFLDDVADSITYCNHYLAPVRAGADELATLLTYYAKNGHERADLLATVARGFSEVSLGKLSAAQRKDTVLSALKAFQTFGFYPESIEAVIGAALVSPAEYSAEELKEVEAVKVAAENALGGEFVLIQEGAHGH 325 T 0.042 FAST_1 pdbpercent F Eukaryota T 6rdk 7 G 6 D7P897_9CHLO Mitochondrial ATP synthase subunit ASA6 MMLRTLTRSSAVAGQAVRLFKTSAAAAEGNSVAGIIKSVNETSGANLLSSLKTIKAQAAPIYPAAASSTGYSTQAKIALFGALSWILYRADGQSKAHEWIVDLNLNVLQAAWLISFSSLIPFRAVYFAFRGMAPATASTLNGLKTFSSISL 151 T 0.00025 NrsF pdbpercent F Eukaryota T 6rdk 8 H 7 D8V7I2_9CHLO Mitochondrial ATP synthase associated protein ASA7 MSSVRAGVEAGRRDLTTFTFSGLQDAPVAALSGSIKLNVAAKAGKAEVTVAAGAAKAATQVSAAALRKLSGSKISLAEVARISVLHSSIQNYLLSLSNERYQLLSQWPDFTTMYGKDFYYRAHPEDLKKFYDAADEYYKLYETVTEFDSLSALASQVVPNYAARRRSTVHPAIGSTVADGAFTNFLLSKQ 190 T 0.96 DUF4296 pdbhh F Eukaryota T 6rdk 9 I 8 D8V7I7_9CHLO Mitochondrial ATP synthase subunit ASA8 MVLGEVYLKDILRTPPTGAIPANVPHPFQTSFYTYATKKLIPRHWYLLGGFTFTITLYGILDGLRDSGKKKAYDEAIHAGKTPYTAGGH 89 T 0.3 Selenoprotein_S pdbhh F Eukaryota T 6rdk 10 J 9 A0A5H1ZR73_9CHLO Mitochondrial ATP synthase subunit ASA9 MAVTSFLGKAFEKYFYDFSAYEQFGLNRFLSSKGQYVALRHVGFVMVGVNVLLAANFPFNPPFPTIGMCPAGWEGTWVCQADKAKALEMYKEWKKSN 97 T 0.39 1-cysPrx_C pdbhh F Eukaryota T 6rdl 1 A 0 A0A5H1ZR95_9CHLO ASA-10: Polytomella F-ATP synthase associated subunit 10 MSYSAYFAKAGFQFPAGLSALVAGIVALNVCTGRPTKGTKEISNAEYNATPIGYLQSPDQHPTAFPKVPGMKDVHGSPHHHH 82 T 0.1 DUF6506 pdb F Eukaryota T 6rdl 2 B 1 Q85JD5_9CHLO ATP synthase associated protein ASA1 MMRAAQKAKQELPATVLTQTRSYLAPLRSDFTEEITAPKVASASNLVNEWNNKKQATENLMKLLQAYKDIGDAKSEPLLKNHNPRTFEDRDYPVPDFRTQNLKAGDVPKFFDTVISTRASAAIASKDKFWAGRKTEAEAASAKASAAFPRVAVPEWKKGKTVSIENLNTVTDKYAAALVPKRKLALPVLPEGVKKAVEDFAASVGQAKNASEVSELLAKSLAEKAVVTEGGKVVEGFSYVSKAVAAKVIATRRAEVHERLLKLWAKRLLVSPELAIVPLNEFDAQLASKFEGISPKYQELLSAVAQGNKTFAQRLNSSPAFSSFLLKREKAESEVPPSELELEAAQKAAELEDPEVALRTLLGPQMEALGASDLLLSEQIRVITEHRYTPDRLQYKEGMKLADKIAAQEAALKEELKVIYGDNVDVKHFQASPRTPVQQLFDSLKNAAANKERAAKEAAAAASPYLAYAVTKKQEVQADPSNIPFDEVLYPQLSEELLELELSDIREDEIALEKAEEEELWLLTLTQQFKHIQKHFGIDLPHSVVAHMDPLLIKKIDWETTNALEDFDITLDDMGAEDAKEQWGAENLSHHFLPLIRYRRDLARKNGDRYGPDLVNGN 618 T 0.21 Lipoprotein_10 pdb F Eukaryota T 6rdl 4 D 3 K0J903_9CHLO Mitochondrial F1F0 ATP synthase associated 32 kDa protein MRQASRLALSIRQAGNVEAASAVPAMTRQFSAPGSHEHHETPLSKVMPTVVSIPRKVACLALGATKKVVCGLASSGPSQNLVSTFANKVIVEENLVNVAEIDVPFWSYWLSSAGFTSKDAFVKFAEAVKPKVAALSTSDITNLTVAFKRANYYDKDLFTGIEANVSANFTKFETEQLLQIVATFDAFNHSSVAFLDDVADSITYCNHYLAPVRAGADELATLLTYYAKNGHERADLLATVARGFSEVSLGKLSAAQRKDTVLSALKAFQTFGFYPESIEAVIGAALVSPAEYSAEELKEVEAVKVAAENALGGEFVLIQEGAHGH 325 T 0.042 FAST_1 pdbpercent F Eukaryota T 6rdl 7 G 6 D7P897_9CHLO Mitochondrial ATP synthase subunit ASA6 MMLRTLTRSSAVAGQAVRLFKTSAAAAEGNSVAGIIKSVNETSGANLLSSLKTIKAQAAPIYPAAASSTGYSTQAKIALFGALSWILYRADGQSKAHEWIVDLNLNVLQAAWLISFSSLIPFRAVYFAFRGMAPATASTLNGLKTFSSISL 151 T 0.00025 NrsF pdbpercent F Eukaryota T 6rdl 8 H 7 D8V7I2_9CHLO Mitochondrial ATP synthase associated protein ASA7 MSSVRAGVEAGRRDLTTFTFSGLQDAPVAALSGSIKLNVAAKAGKAEVTVAAGAAKAATQVSAAALRKLSGSKISLAEVARISVLHSSIQNYLLSLSNERYQLLSQWPDFTTMYGKDFYYRAHPEDLKKFYDAADEYYKLYETVTEFDSLSALASQVVPNYAARRRSTVHPAIGSTVADGAFTNFLLSKQ 190 T 0.96 DUF4296 pdbhh F Eukaryota T 6rdl 9 I 8 D8V7I7_9CHLO Mitochondrial ATP synthase subunit ASA8 MVLGEVYLKDILRTPPTGAIPANVPHPFQTSFYTYATKKLIPRHWYLLGGFTFTITLYGILDGLRDSGKKKAYDEAIHAGKTPYTAGGH 89 T 0.3 Selenoprotein_S pdbhh F Eukaryota T 6rdl 10 J 9 A0A5H1ZR73_9CHLO ASA-9: Polytomella F-ATP synthase associated subunit 9 MAVTSFLGKAFEKYFYDFSAYEQFGLNRFLSSKGQYVALRHVGFVMVGVNVLLAANFPFNPPFPTIGMCPAGWEGTWVCQADKAKALEMYKEWKKSN 97 T 0.39 1-cysPrx_C pdbhh F Eukaryota T 6rdn 1 A 0 A0A5H1ZR95_9CHLO ASA-10: Polytomella F-ATP synthase associated subunit 10 MSYSAYFAKAGFQFPAGLSALVAGIVALNVCTGRPTKGTKEISNAEYNATPIGYLQSPDQHPTAFPKVPGMKDVHGSPHHHH 82 T 0.1 DUF6506 pdb F Eukaryota T 6rdn 2 B 1 Q85JD5_9CHLO ATP synthase associated protein ASA1 MMRAAQKAKQELPATVLTQTRSYLAPLRSDFTEEITAPKVASASNLVNEWNNKKQATENLMKLLQAYKDIGDAKSEPLLKNHNPRTFEDRDYPVPDFRTQNLKAGDVPKFFDTVISTRASAAIASKDKFWAGRKTEAEAASAKASAAFPRVAVPEWKKGKTVSIENLNTVTDKYAAALVPKRKLALPVLPEGVKKAVEDFAASVGQAKNASEVSELLAKSLAEKAVVTEGGKVVEGFSYVSKAVAAKVIATRRAEVHERLLKLWAKRLLVSPELAIVPLNEFDAQLASKFEGISPKYQELLSAVAQGNKTFAQRLNSSPAFSSFLLKREKAESEVPPSELELEAAQKAAELEDPEVALRTLLGPQMEALGASDLLLSEQIRVITEHRYTPDRLQYKEGMKLADKIAAQEAALKEELKVIYGDNVDVKHFQASPRTPVQQLFDSLKNAAANKERAAKEAAAAASPYLAYAVTKKQEVQADPSNIPFDEVLYPQLSEELLELELSDIREDEIALEKAEEEELWLLTLTQQFKHIQKHFGIDLPHSVVAHMDPLLIKKIDWETTNALEDFDITLDDMGAEDAKEQWGAENLSHHFLPLIRYRRDLARKNGDRYGPDLVNGN 618 T 0.21 Lipoprotein_10 pdb F Eukaryota T 6rdn 4 D 3 K0J903_9CHLO Mitochondrial F1F0 ATP synthase associated 32 kDa protein MRQASRLALSIRQAGNVEAASAVPAMTRQFSAPGSHEHHETPLSKVMPTVVSIPRKVACLALGATKKVVCGLASSGPSQNLVSTFANKVIVEENLVNVAEIDVPFWSYWLSSAGFTSKDAFVKFAEAVKPKVAALSTSDITNLTVAFKRANYYDKDLFTGIEANVSANFTKFETEQLLQIVATFDAFNHSSVAFLDDVADSITYCNHYLAPVRAGADELATLLTYYAKNGHERADLLATVARGFSEVSLGKLSAAQRKDTVLSALKAFQTFGFYPESIEAVIGAALVSPAEYSAEELKEVEAVKVAAENALGGEFVLIQEGAHGH 325 T 0.042 FAST_1 pdbpercent F Eukaryota T 6rdn 7 G 6 D7P897_9CHLO Mitochondrial ATP synthase subunit ASA6 MMLRTLTRSSAVAGQAVRLFKTSAAAAEGNSVAGIIKSVNETSGANLLSSLKTIKAQAAPIYPAAASSTGYSTQAKIALFGALSWILYRADGQSKAHEWIVDLNLNVLQAAWLISFSSLIPFRAVYFAFRGMAPATASTLNGLKTFSSISL 151 T 0.00025 NrsF pdbpercent F Eukaryota T 6rdn 8 H 7 D8V7I2_9CHLO Mitochondrial ATP synthase associated protein ASA7 MSSVRAGVEAGRRDLTTFTFSGLQDAPVAALSGSIKLNVAAKAGKAEVTVAAGAAKAATQVSAAALRKLSGSKISLAEVARISVLHSSIQNYLLSLSNERYQLLSQWPDFTTMYGKDFYYRAHPEDLKKFYDAADEYYKLYETVTEFDSLSALASQVVPNYAARRRSTVHPAIGSTVADGAFTNFLLSKQ 190 T 0.96 DUF4296 pdbhh F Eukaryota T 6rdn 9 I 8 D8V7I7_9CHLO Mitochondrial ATP synthase subunit ASA8 MVLGEVYLKDILRTPPTGAIPANVPHPFQTSFYTYATKKLIPRHWYLLGGFTFTITLYGILDGLRDSGKKKAYDEAIHAGKTPYTAGGH 89 T 0.3 Selenoprotein_S pdbhh F Eukaryota T 6rdn 10 J 9 A0A5H1ZR73_9CHLO ASA-9: Polytomella F-ATP synthase associated subunit 9 MAVTSFLGKAFEKYFYDFSAYEQFGLNRFLSSKGQYVALRHVGFVMVGVNVLLAANFPFNPPFPTIGMCPAGWEGTWVCQADKAKALEMYKEWKKSN 97 T 0.39 1-cysPrx_C pdbhh F Eukaryota T 6rdo 1 A 0 A0A5H1ZR95_9CHLO ASA-10: Polytomella F-ATP synthase associated subunit 10 MSYSAYFAKAGFQFPAGLSALVAGIVALNVCTGRPTKGTKEISNAEYNATPIGYLQSPDQHPTAFPKVPGMKDVHGSPHHHH 82 T 0.1 DUF6506 pdb F Eukaryota T 6rdo 2 B 1 Q85JD5_9CHLO ATP synthase associated protein ASA1 MMRAAQKAKQELPATVLTQTRSYLAPLRSDFTEEITAPKVASASNLVNEWNNKKQATENLMKLLQAYKDIGDAKSEPLLKNHNPRTFEDRDYPVPDFRTQNLKAGDVPKFFDTVISTRASAAIASKDKFWAGRKTEAEAASAKASAAFPRVAVPEWKKGKTVSIENLNTVTDKYAAALVPKRKLALPVLPEGVKKAVEDFAASVGQAKNASEVSELLAKSLAEKAVVTEGGKVVEGFSYVSKAVAAKVIATRRAEVHERLLKLWAKRLLVSPELAIVPLNEFDAQLASKFEGISPKYQELLSAVAQGNKTFAQRLNSSPAFSSFLLKREKAESEVPPSELELEAAQKAAELEDPEVALRTLLGPQMEALGASDLLLSEQIRVITEHRYTPDRLQYKEGMKLADKIAAQEAALKEELKVIYGDNVDVKHFQASPRTPVQQLFDSLKNAAANKERAAKEAAAAASPYLAYAVTKKQEVQADPSNIPFDEVLYPQLSEELLELELSDIREDEIALEKAEEEELWLLTLTQQFKHIQKHFGIDLPHSVVAHMDPLLIKKIDWETTNALEDFDITLDDMGAEDAKEQWGAENLSHHFLPLIRYRRDLARKNGDRYGPDLVNGN 618 T 0.21 Lipoprotein_10 pdb F Eukaryota T 6rdo 4 D 3 K0J903_9CHLO Mitochondrial F1F0 ATP synthase associated 32 kDa protein MRQASRLALSIRQAGNVEAASAVPAMTRQFSAPGSHEHHETPLSKVMPTVVSIPRKVACLALGATKKVVCGLASSGPSQNLVSTFANKVIVEENLVNVAEIDVPFWSYWLSSAGFTSKDAFVKFAEAVKPKVAALSTSDITNLTVAFKRANYYDKDLFTGIEANVSANFTKFETEQLLQIVATFDAFNHSSVAFLDDVADSITYCNHYLAPVRAGADELATLLTYYAKNGHERADLLATVARGFSEVSLGKLSAAQRKDTVLSALKAFQTFGFYPESIEAVIGAALVSPAEYSAEELKEVEAVKVAAENALGGEFVLIQEGAHGH 325 T 0.042 FAST_1 pdbpercent F Eukaryota T 6rdo 7 G 6 D7P897_9CHLO Mitochondrial ATP synthase subunit ASA6 MMLRTLTRSSAVAGQAVRLFKTSAAAAEGNSVAGIIKSVNETSGANLLSSLKTIKAQAAPIYPAAASSTGYSTQAKIALFGALSWILYRADGQSKAHEWIVDLNLNVLQAAWLISFSSLIPFRAVYFAFRGMAPATASTLNGLKTFSSISL 151 T 0.00025 NrsF pdbpercent F Eukaryota T 6rdo 8 H 7 D8V7I2_9CHLO Mitochondrial ATP synthase associated protein ASA7 MSSVRAGVEAGRRDLTTFTFSGLQDAPVAALSGSIKLNVAAKAGKAEVTVAAGAAKAATQVSAAALRKLSGSKISLAEVARISVLHSSIQNYLLSLSNERYQLLSQWPDFTTMYGKDFYYRAHPEDLKKFYDAADEYYKLYETVTEFDSLSALASQVVPNYAARRRSTVHPAIGSTVADGAFTNFLLSKQ 190 T 0.96 DUF4296 pdbhh F Eukaryota T 6rdo 9 I 8 D8V7I7_9CHLO Mitochondrial ATP synthase subunit ASA8 MVLGEVYLKDILRTPPTGAIPANVPHPFQTSFYTYATKKLIPRHWYLLGGFTFTITLYGILDGLRDSGKKKAYDEAIHAGKTPYTAGGH 89 T 0.3 Selenoprotein_S pdbhh F Eukaryota T 6rdo 10 J 9 A0A5H1ZR73_9CHLO ASA-9: Polytomella F-ATP synthase associated subunit 9 MAVTSFLGKAFEKYFYDFSAYEQFGLNRFLSSKGQYVALRHVGFVMVGVNVLLAANFPFNPPFPTIGMCPAGWEGTWVCQADKAKALEMYKEWKKSN 97 T 0.39 1-cysPrx_C pdbhh F Eukaryota T 6rdq 1 A 0 A0A5H1ZR95_9CHLO ASA-10: Polytomella F-ATP synthase associated subunit 10 MSYSAYFAKAGFQFPAGLSALVAGIVALNVCTGRPTKGTKEISNAEYNATPIGYLQSPDQHPTAFPKVPGMKDVHGSPHHHH 82 T 0.1 DUF6506 pdb F Eukaryota T 6rdq 2 B 1 Q85JD5_9CHLO ATP synthase associated protein ASA1 MMRAAQKAKQELPATVLTQTRSYLAPLRSDFTEEITAPKVASASNLVNEWNNKKQATENLMKLLQAYKDIGDAKSEPLLKNHNPRTFEDRDYPVPDFRTQNLKAGDVPKFFDTVISTRASAAIASKDKFWAGRKTEAEAASAKASAAFPRVAVPEWKKGKTVSIENLNTVTDKYAAALVPKRKLALPVLPEGVKKAVEDFAASVGQAKNASEVSELLAKSLAEKAVVTEGGKVVEGFSYVSKAVAAKVIATRRAEVHERLLKLWAKRLLVSPELAIVPLNEFDAQLASKFEGISPKYQELLSAVAQGNKTFAQRLNSSPAFSSFLLKREKAESEVPPSELELEAAQKAAELEDPEVALRTLLGPQMEALGASDLLLSEQIRVITEHRYTPDRLQYKEGMKLADKIAAQEAALKEELKVIYGDNVDVKHFQASPRTPVQQLFDSLKNAAANKERAAKEAAAAASPYLAYAVTKKQEVQADPSNIPFDEVLYPQLSEELLELELSDIREDEIALEKAEEEELWLLTLTQQFKHIQKHFGIDLPHSVVAHMDPLLIKKIDWETTNALEDFDITLDDMGAEDAKEQWGAENLSHHFLPLIRYRRDLARKNGDRYGPDLVNGN 618 T 0.21 Lipoprotein_10 pdb F Eukaryota T 6rdq 4 D 3 K0J903_9CHLO Mitochondrial F1F0 ATP synthase associated 32 kDa protein MRQASRLALSIRQAGNVEAASAVPAMTRQFSAPGSHEHHETPLSKVMPTVVSIPRKVACLALGATKKVVCGLASSGPSQNLVSTFANKVIVEENLVNVAEIDVPFWSYWLSSAGFTSKDAFVKFAEAVKPKVAALSTSDITNLTVAFKRANYYDKDLFTGIEANVSANFTKFETEQLLQIVATFDAFNHSSVAFLDDVADSITYCNHYLAPVRAGADELATLLTYYAKNGHERADLLATVARGFSEVSLGKLSAAQRKDTVLSALKAFQTFGFYPESIEAVIGAALVSPAEYSAEELKEVEAVKVAAENALGGEFVLIQEGAHGH 325 T 0.042 FAST_1 pdbpercent F Eukaryota T 6rdq 7 G 6 D7P897_9CHLO Mitochondrial ATP synthase subunit ASA6 MMLRTLTRSSAVAGQAVRLFKTSAAAAEGNSVAGIIKSVNETSGANLLSSLKTIKAQAAPIYPAAASSTGYSTQAKIALFGALSWILYRADGQSKAHEWIVDLNLNVLQAAWLISFSSLIPFRAVYFAFRGMAPATASTLNGLKTFSSISL 151 T 0.00025 NrsF pdbpercent F Eukaryota T 6rdq 8 H 7 D8V7I2_9CHLO Mitochondrial ATP synthase associated protein ASA7 MSSVRAGVEAGRRDLTTFTFSGLQDAPVAALSGSIKLNVAAKAGKAEVTVAAGAAKAATQVSAAALRKLSGSKISLAEVARISVLHSSIQNYLLSLSNERYQLLSQWPDFTTMYGKDFYYRAHPEDLKKFYDAADEYYKLYETVTEFDSLSALASQVVPNYAARRRSTVHPAIGSTVADGAFTNFLLSKQ 190 T 0.96 DUF4296 pdbhh F Eukaryota T 6rdq 9 I 8 D8V7I7_9CHLO Mitochondrial ATP synthase subunit ASA8 MVLGEVYLKDILRTPPTGAIPANVPHPFQTSFYTYATKKLIPRHWYLLGGFTFTITLYGILDGLRDSGKKKAYDEAIHAGKTPYTAGGH 89 T 0.3 Selenoprotein_S pdbhh F Eukaryota T 6rdq 10 J 9 A0A5H1ZR73_9CHLO ASA-9: Polytomella F-ATP synthase associated subunit 9 MAVTSFLGKAFEKYFYDFSAYEQFGLNRFLSSKGQYVALRHVGFVMVGVNVLLAANFPFNPPFPTIGMCPAGWEGTWVCQADKAKALEMYKEWKKSN 97 T 0.39 1-cysPrx_C pdbhh F Eukaryota T 6rdr 1 A 0 A0A5H1ZR95_9CHLO ASA-10: Polytomella F-ATP synthase associated subunit 10 MSYSAYFAKAGFQFPAGLSALVAGIVALNVCTGRPTKGTKEISNAEYNATPIGYLQSPDQHPTAFPKVPGMKDVHGSPHHHH 82 T 0.1 DUF6506 pdb F Eukaryota T 6rdr 2 B 1 Q85JD5_9CHLO ATP synthase associated protein ASA1 MMRAAQKAKQELPATVLTQTRSYLAPLRSDFTEEITAPKVASASNLVNEWNNKKQATENLMKLLQAYKDIGDAKSEPLLKNHNPRTFEDRDYPVPDFRTQNLKAGDVPKFFDTVISTRASAAIASKDKFWAGRKTEAEAASAKASAAFPRVAVPEWKKGKTVSIENLNTVTDKYAAALVPKRKLALPVLPEGVKKAVEDFAASVGQAKNASEVSELLAKSLAEKAVVTEGGKVVEGFSYVSKAVAAKVIATRRAEVHERLLKLWAKRLLVSPELAIVPLNEFDAQLASKFEGISPKYQELLSAVAQGNKTFAQRLNSSPAFSSFLLKREKAESEVPPSELELEAAQKAAELEDPEVALRTLLGPQMEALGASDLLLSEQIRVITEHRYTPDRLQYKEGMKLADKIAAQEAALKEELKVIYGDNVDVKHFQASPRTPVQQLFDSLKNAAANKERAAKEAAAAASPYLAYAVTKKQEVQADPSNIPFDEVLYPQLSEELLELELSDIREDEIALEKAEEEELWLLTLTQQFKHIQKHFGIDLPHSVVAHMDPLLIKKIDWETTNALEDFDITLDDMGAEDAKEQWGAENLSHHFLPLIRYRRDLARKNGDRYGPDLVNGN 618 T 0.21 Lipoprotein_10 pdb F Eukaryota T 6rdr 4 D 3 K0J903_9CHLO Mitochondrial F1F0 ATP synthase associated 32 kDa protein MRQASRLALSIRQAGNVEAASAVPAMTRQFSAPGSHEHHETPLSKVMPTVVSIPRKVACLALGATKKVVCGLASSGPSQNLVSTFANKVIVEENLVNVAEIDVPFWSYWLSSAGFTSKDAFVKFAEAVKPKVAALSTSDITNLTVAFKRANYYDKDLFTGIEANVSANFTKFETEQLLQIVATFDAFNHSSVAFLDDVADSITYCNHYLAPVRAGADELATLLTYYAKNGHERADLLATVARGFSEVSLGKLSAAQRKDTVLSALKAFQTFGFYPESIEAVIGAALVSPAEYSAEELKEVEAVKVAAENALGGEFVLIQEGAHGH 325 T 0.042 FAST_1 pdbpercent F Eukaryota T 6rdr 7 G 6 D7P897_9CHLO Mitochondrial ATP synthase subunit ASA6 MMLRTLTRSSAVAGQAVRLFKTSAAAAEGNSVAGIIKSVNETSGANLLSSLKTIKAQAAPIYPAAASSTGYSTQAKIALFGALSWILYRADGQSKAHEWIVDLNLNVLQAAWLISFSSLIPFRAVYFAFRGMAPATASTLNGLKTFSSISL 151 T 0.00025 NrsF pdbpercent F Eukaryota T 6rdr 8 H 7 D8V7I2_9CHLO Mitochondrial ATP synthase associated protein ASA7 MSSVRAGVEAGRRDLTTFTFSGLQDAPVAALSGSIKLNVAAKAGKAEVTVAAGAAKAATQVSAAALRKLSGSKISLAEVARISVLHSSIQNYLLSLSNERYQLLSQWPDFTTMYGKDFYYRAHPEDLKKFYDAADEYYKLYETVTEFDSLSALASQVVPNYAARRRSTVHPAIGSTVADGAFTNFLLSKQ 190 T 0.96 DUF4296 pdbhh F Eukaryota T 6rdr 9 I 8 D8V7I7_9CHLO Mitochondrial ATP synthase subunit ASA8 MVLGEVYLKDILRTPPTGAIPANVPHPFQTSFYTYATKKLIPRHWYLLGGFTFTITLYGILDGLRDSGKKKAYDEAIHAGKTPYTAGGH 89 T 0.3 Selenoprotein_S pdbhh F Eukaryota T 6rdr 10 J 9 A0A5H1ZR73_9CHLO ASA-9: Polytomella F-ATP synthase associated subunit 9 MAVTSFLGKAFEKYFYDFSAYEQFGLNRFLSSKGQYVALRHVGFVMVGVNVLLAANFPFNPPFPTIGMCPAGWEGTWVCQADKAKALEMYKEWKKSN 97 T 0.39 1-cysPrx_C pdbhh F Eukaryota T 6rdt 1 A 0 A0A5H1ZR95_9CHLO ASA-10: Polytomella F-ATP synthase associated subunit 10 MSYSAYFAKAGFQFPAGLSALVAGIVALNVCTGRPTKGTKEISNAEYNATPIGYLQSPDQHPTAFPKVPGMKDVHGSPHHHH 82 T 0.1 DUF6506 pdb F Eukaryota T 6rdt 2 B 1 Q85JD5_9CHLO ATP synthase associated protein ASA1 MMRAAQKAKQELPATVLTQTRSYLAPLRSDFTEEITAPKVASASNLVNEWNNKKQATENLMKLLQAYKDIGDAKSEPLLKNHNPRTFEDRDYPVPDFRTQNLKAGDVPKFFDTVISTRASAAIASKDKFWAGRKTEAEAASAKASAAFPRVAVPEWKKGKTVSIENLNTVTDKYAAALVPKRKLALPVLPEGVKKAVEDFAASVGQAKNASEVSELLAKSLAEKAVVTEGGKVVEGFSYVSKAVAAKVIATRRAEVHERLLKLWAKRLLVSPELAIVPLNEFDAQLASKFEGISPKYQELLSAVAQGNKTFAQRLNSSPAFSSFLLKREKAESEVPPSELELEAAQKAAELEDPEVALRTLLGPQMEALGASDLLLSEQIRVITEHRYTPDRLQYKEGMKLADKIAAQEAALKEELKVIYGDNVDVKHFQASPRTPVQQLFDSLKNAAANKERAAKEAAAAASPYLAYAVTKKQEVQADPSNIPFDEVLYPQLSEELLELELSDIREDEIALEKAEEEELWLLTLTQQFKHIQKHFGIDLPHSVVAHMDPLLIKKIDWETTNALEDFDITLDDMGAEDAKEQWGAENLSHHFLPLIRYRRDLARKNGDRYGPDLVNGN 618 T 0.21 Lipoprotein_10 pdb F Eukaryota T 6rdt 4 D 3 K0J903_9CHLO Mitochondrial F1F0 ATP synthase associated 32 kDa protein MRQASRLALSIRQAGNVEAASAVPAMTRQFSAPGSHEHHETPLSKVMPTVVSIPRKVACLALGATKKVVCGLASSGPSQNLVSTFANKVIVEENLVNVAEIDVPFWSYWLSSAGFTSKDAFVKFAEAVKPKVAALSTSDITNLTVAFKRANYYDKDLFTGIEANVSANFTKFETEQLLQIVATFDAFNHSSVAFLDDVADSITYCNHYLAPVRAGADELATLLTYYAKNGHERADLLATVARGFSEVSLGKLSAAQRKDTVLSALKAFQTFGFYPESIEAVIGAALVSPAEYSAEELKEVEAVKVAAENALGGEFVLIQEGAHGH 325 T 0.042 FAST_1 pdbpercent F Eukaryota T 6rdt 7 G 6 D7P897_9CHLO Mitochondrial ATP synthase subunit ASA6 MMLRTLTRSSAVAGQAVRLFKTSAAAAEGNSVAGIIKSVNETSGANLLSSLKTIKAQAAPIYPAAASSTGYSTQAKIALFGALSWILYRADGQSKAHEWIVDLNLNVLQAAWLISFSSLIPFRAVYFAFRGMAPATASTLNGLKTFSSISL 151 T 0.00025 NrsF pdbpercent F Eukaryota T 6rdt 8 H 7 D8V7I2_9CHLO Mitochondrial ATP synthase associated protein ASA7 MSSVRAGVEAGRRDLTTFTFSGLQDAPVAALSGSIKLNVAAKAGKAEVTVAAGAAKAATQVSAAALRKLSGSKISLAEVARISVLHSSIQNYLLSLSNERYQLLSQWPDFTTMYGKDFYYRAHPEDLKKFYDAADEYYKLYETVTEFDSLSALASQVVPNYAARRRSTVHPAIGSTVADGAFTNFLLSKQ 190 T 0.96 DUF4296 pdbhh F Eukaryota T 6rdt 9 I 8 D8V7I7_9CHLO Mitochondrial ATP synthase subunit ASA8 MVLGEVYLKDILRTPPTGAIPANVPHPFQTSFYTYATKKLIPRHWYLLGGFTFTITLYGILDGLRDSGKKKAYDEAIHAGKTPYTAGGH 89 T 0.3 Selenoprotein_S pdbhh F Eukaryota T 6rdt 10 J 9 A0A5H1ZR73_9CHLO ASA-9: Polytomella F-ATP synthase associated subunit 9 MAVTSFLGKAFEKYFYDFSAYEQFGLNRFLSSKGQYVALRHVGFVMVGVNVLLAANFPFNPPFPTIGMCPAGWEGTWVCQADKAKALEMYKEWKKSN 97 T 0.39 1-cysPrx_C pdbhh F Eukaryota T 6rdu 1 A 0 A0A5H1ZR95_9CHLO ASA-10: Polytomella F-ATP synthase associated subunit 10 MSYSAYFAKAGFQFPAGLSALVAGIVALNVCTGRPTKGTKEISNAEYNATPIGYLQSPDQHPTAFPKVPGMKDVHGSPHHHH 82 T 0.1 DUF6506 pdb F Eukaryota T 6rdu 2 B 1 Q85JD5_9CHLO ATP synthase associated protein ASA1 MMRAAQKAKQELPATVLTQTRSYLAPLRSDFTEEITAPKVASASNLVNEWNNKKQATENLMKLLQAYKDIGDAKSEPLLKNHNPRTFEDRDYPVPDFRTQNLKAGDVPKFFDTVISTRASAAIASKDKFWAGRKTEAEAASAKASAAFPRVAVPEWKKGKTVSIENLNTVTDKYAAALVPKRKLALPVLPEGVKKAVEDFAASVGQAKNASEVSELLAKSLAEKAVVTEGGKVVEGFSYVSKAVAAKVIATRRAEVHERLLKLWAKRLLVSPELAIVPLNEFDAQLASKFEGISPKYQELLSAVAQGNKTFAQRLNSSPAFSSFLLKREKAESEVPPSELELEAAQKAAELEDPEVALRTLLGPQMEALGASDLLLSEQIRVITEHRYTPDRLQYKEGMKLADKIAAQEAALKEELKVIYGDNVDVKHFQASPRTPVQQLFDSLKNAAANKERAAKEAAAAASPYLAYAVTKKQEVQADPSNIPFDEVLYPQLSEELLELELSDIREDEIALEKAEEEELWLLTLTQQFKHIQKHFGIDLPHSVVAHMDPLLIKKIDWETTNALEDFDITLDDMGAEDAKEQWGAENLSHHFLPLIRYRRDLARKNGDRYGPDLVNGN 618 T 0.21 Lipoprotein_10 pdb F Eukaryota T 6rdu 4 D 3 K0J903_9CHLO Mitochondrial F1F0 ATP synthase associated 32 kDa protein MRQASRLALSIRQAGNVEAASAVPAMTRQFSAPGSHEHHETPLSKVMPTVVSIPRKVACLALGATKKVVCGLASSGPSQNLVSTFANKVIVEENLVNVAEIDVPFWSYWLSSAGFTSKDAFVKFAEAVKPKVAALSTSDITNLTVAFKRANYYDKDLFTGIEANVSANFTKFETEQLLQIVATFDAFNHSSVAFLDDVADSITYCNHYLAPVRAGADELATLLTYYAKNGHERADLLATVARGFSEVSLGKLSAAQRKDTVLSALKAFQTFGFYPESIEAVIGAALVSPAEYSAEELKEVEAVKVAAENALGGEFVLIQEGAHGH 325 T 0.042 FAST_1 pdbpercent F Eukaryota T 6rdu 7 G 6 D7P897_9CHLO Mitochondrial ATP synthase subunit ASA6 MMLRTLTRSSAVAGQAVRLFKTSAAAAEGNSVAGIIKSVNETSGANLLSSLKTIKAQAAPIYPAAASSTGYSTQAKIALFGALSWILYRADGQSKAHEWIVDLNLNVLQAAWLISFSSLIPFRAVYFAFRGMAPATASTLNGLKTFSSISL 151 T 0.00025 NrsF pdbpercent F Eukaryota T 6rdu 8 H 7 D8V7I2_9CHLO Mitochondrial ATP synthase associated protein ASA7 MSSVRAGVEAGRRDLTTFTFSGLQDAPVAALSGSIKLNVAAKAGKAEVTVAAGAAKAATQVSAAALRKLSGSKISLAEVARISVLHSSIQNYLLSLSNERYQLLSQWPDFTTMYGKDFYYRAHPEDLKKFYDAADEYYKLYETVTEFDSLSALASQVVPNYAARRRSTVHPAIGSTVADGAFTNFLLSKQ 190 T 0.96 DUF4296 pdbhh F Eukaryota T 6rdu 9 I 8 D8V7I7_9CHLO Mitochondrial ATP synthase subunit ASA8 MVLGEVYLKDILRTPPTGAIPANVPHPFQTSFYTYATKKLIPRHWYLLGGFTFTITLYGILDGLRDSGKKKAYDEAIHAGKTPYTAGGH 89 T 0.3 Selenoprotein_S pdbhh F Eukaryota T 6rdu 10 J 9 A0A5H1ZR73_9CHLO ASA-9: Polytomella F-ATP synthase associated subunit 9 MAVTSFLGKAFEKYFYDFSAYEQFGLNRFLSSKGQYVALRHVGFVMVGVNVLLAANFPFNPPFPTIGMCPAGWEGTWVCQADKAKALEMYKEWKKSN 97 T 0.39 1-cysPrx_C pdbhh F Eukaryota T 6rdw 1 A 0 A0A5H1ZR95_9CHLO ASA-10: Polytomella F-ATP synthase associated subunit 10 MSYSAYFAKAGFQFPAGLSALVAGIVALNVCTGRPTKGTKEISNAEYNATPIGYLQSPDQHPTAFPKVPGMKDVHGSPHHHH 82 T 0.1 DUF6506 pdb F Eukaryota T 6rdw 2 B 1 Q85JD5_9CHLO ATP synthase associated protein ASA1 MMRAAQKAKQELPATVLTQTRSYLAPLRSDFTEEITAPKVASASNLVNEWNNKKQATENLMKLLQAYKDIGDAKSEPLLKNHNPRTFEDRDYPVPDFRTQNLKAGDVPKFFDTVISTRASAAIASKDKFWAGRKTEAEAASAKASAAFPRVAVPEWKKGKTVSIENLNTVTDKYAAALVPKRKLALPVLPEGVKKAVEDFAASVGQAKNASEVSELLAKSLAEKAVVTEGGKVVEGFSYVSKAVAAKVIATRRAEVHERLLKLWAKRLLVSPELAIVPLNEFDAQLASKFEGISPKYQELLSAVAQGNKTFAQRLNSSPAFSSFLLKREKAESEVPPSELELEAAQKAAELEDPEVALRTLLGPQMEALGASDLLLSEQIRVITEHRYTPDRLQYKEGMKLADKIAAQEAALKEELKVIYGDNVDVKHFQASPRTPVQQLFDSLKNAAANKERAAKEAAAAASPYLAYAVTKKQEVQADPSNIPFDEVLYPQLSEELLELELSDIREDEIALEKAEEEELWLLTLTQQFKHIQKHFGIDLPHSVVAHMDPLLIKKIDWETTNALEDFDITLDDMGAEDAKEQWGAENLSHHFLPLIRYRRDLARKNGDRYGPDLVNGN 618 T 0.21 Lipoprotein_10 pdb F Eukaryota T 6rdw 4 D 3 K0J903_9CHLO Mitochondrial F1F0 ATP synthase associated 32 kDa protein MRQASRLALSIRQAGNVEAASAVPAMTRQFSAPGSHEHHETPLSKVMPTVVSIPRKVACLALGATKKVVCGLASSGPSQNLVSTFANKVIVEENLVNVAEIDVPFWSYWLSSAGFTSKDAFVKFAEAVKPKVAALSTSDITNLTVAFKRANYYDKDLFTGIEANVSANFTKFETEQLLQIVATFDAFNHSSVAFLDDVADSITYCNHYLAPVRAGADELATLLTYYAKNGHERADLLATVARGFSEVSLGKLSAAQRKDTVLSALKAFQTFGFYPESIEAVIGAALVSPAEYSAEELKEVEAVKVAAENALGGEFVLIQEGAHGH 325 T 0.042 FAST_1 pdbpercent F Eukaryota T 6rdw 7 G 6 D7P897_9CHLO Mitochondrial ATP synthase subunit ASA6 MMLRTLTRSSAVAGQAVRLFKTSAAAAEGNSVAGIIKSVNETSGANLLSSLKTIKAQAAPIYPAAASSTGYSTQAKIALFGALSWILYRADGQSKAHEWIVDLNLNVLQAAWLISFSSLIPFRAVYFAFRGMAPATASTLNGLKTFSSISL 151 T 0.00025 NrsF pdbpercent F Eukaryota T 6rdw 8 H 7 D8V7I2_9CHLO Mitochondrial ATP synthase associated protein ASA7 MSSVRAGVEAGRRDLTTFTFSGLQDAPVAALSGSIKLNVAAKAGKAEVTVAAGAAKAATQVSAAALRKLSGSKISLAEVARISVLHSSIQNYLLSLSNERYQLLSQWPDFTTMYGKDFYYRAHPEDLKKFYDAADEYYKLYETVTEFDSLSALASQVVPNYAARRRSTVHPAIGSTVADGAFTNFLLSKQ 190 T 0.96 DUF4296 pdbhh F Eukaryota T 6rdw 9 I 8 D8V7I7_9CHLO Mitochondrial ATP synthase subunit ASA8 MVLGEVYLKDILRTPPTGAIPANVPHPFQTSFYTYATKKLIPRHWYLLGGFTFTITLYGILDGLRDSGKKKAYDEAIHAGKTPYTAGGH 89 T 0.3 Selenoprotein_S pdbhh F Eukaryota T 6rdw 10 J 9 A0A5H1ZR73_9CHLO ASA-9: Polytomella F-ATP synthase associated subunit 9 MAVTSFLGKAFEKYFYDFSAYEQFGLNRFLSSKGQYVALRHVGFVMVGVNVLLAANFPFNPPFPTIGMCPAGWEGTWVCQADKAKALEMYKEWKKSN 97 T 0.39 1-cysPrx_C pdbhh F Eukaryota T 6rdx 1 A 0 A0A5H1ZR95_9CHLO ASA-10: Polytomella F-ATP synthase associated subunit 10 MSYSAYFAKAGFQFPAGLSALVAGIVALNVCTGRPTKGTKEISNAEYNATPIGYLQSPDQHPTAFPKVPGMKDVHGSPHHHH 82 T 0.1 DUF6506 pdb F Eukaryota T 6rdx 2 B 1 Q85JD5_9CHLO ATP synthase associated protein ASA1 MMRAAQKAKQELPATVLTQTRSYLAPLRSDFTEEITAPKVASASNLVNEWNNKKQATENLMKLLQAYKDIGDAKSEPLLKNHNPRTFEDRDYPVPDFRTQNLKAGDVPKFFDTVISTRASAAIASKDKFWAGRKTEAEAASAKASAAFPRVAVPEWKKGKTVSIENLNTVTDKYAAALVPKRKLALPVLPEGVKKAVEDFAASVGQAKNASEVSELLAKSLAEKAVVTEGGKVVEGFSYVSKAVAAKVIATRRAEVHERLLKLWAKRLLVSPELAIVPLNEFDAQLASKFEGISPKYQELLSAVAQGNKTFAQRLNSSPAFSSFLLKREKAESEVPPSELELEAAQKAAELEDPEVALRTLLGPQMEALGASDLLLSEQIRVITEHRYTPDRLQYKEGMKLADKIAAQEAALKEELKVIYGDNVDVKHFQASPRTPVQQLFDSLKNAAANKERAAKEAAAAASPYLAYAVTKKQEVQADPSNIPFDEVLYPQLSEELLELELSDIREDEIALEKAEEEELWLLTLTQQFKHIQKHFGIDLPHSVVAHMDPLLIKKIDWETTNALEDFDITLDDMGAEDAKEQWGAENLSHHFLPLIRYRRDLARKNGDRYGPDLVNGN 618 T 0.21 Lipoprotein_10 pdb F Eukaryota T 6rdx 4 D 3 K0J903_9CHLO Mitochondrial F1F0 ATP synthase associated 32 kDa protein MRQASRLALSIRQAGNVEAASAVPAMTRQFSAPGSHEHHETPLSKVMPTVVSIPRKVACLALGATKKVVCGLASSGPSQNLVSTFANKVIVEENLVNVAEIDVPFWSYWLSSAGFTSKDAFVKFAEAVKPKVAALSTSDITNLTVAFKRANYYDKDLFTGIEANVSANFTKFETEQLLQIVATFDAFNHSSVAFLDDVADSITYCNHYLAPVRAGADELATLLTYYAKNGHERADLLATVARGFSEVSLGKLSAAQRKDTVLSALKAFQTFGFYPESIEAVIGAALVSPAEYSAEELKEVEAVKVAAENALGGEFVLIQEGAHGH 325 T 0.042 FAST_1 pdbpercent F Eukaryota T 6rdx 7 G 6 D7P897_9CHLO Mitochondrial ATP synthase subunit ASA6 MMLRTLTRSSAVAGQAVRLFKTSAAAAEGNSVAGIIKSVNETSGANLLSSLKTIKAQAAPIYPAAASSTGYSTQAKIALFGALSWILYRADGQSKAHEWIVDLNLNVLQAAWLISFSSLIPFRAVYFAFRGMAPATASTLNGLKTFSSISL 151 T 0.00025 NrsF pdbpercent F Eukaryota T 6rdx 8 H 7 D8V7I2_9CHLO Mitochondrial ATP synthase associated protein ASA7 MSSVRAGVEAGRRDLTTFTFSGLQDAPVAALSGSIKLNVAAKAGKAEVTVAAGAAKAATQVSAAALRKLSGSKISLAEVARISVLHSSIQNYLLSLSNERYQLLSQWPDFTTMYGKDFYYRAHPEDLKKFYDAADEYYKLYETVTEFDSLSALASQVVPNYAARRRSTVHPAIGSTVADGAFTNFLLSKQ 190 T 0.96 DUF4296 pdbhh F Eukaryota T 6rdx 9 I 8 D8V7I7_9CHLO Mitochondrial ATP synthase subunit ASA8 MVLGEVYLKDILRTPPTGAIPANVPHPFQTSFYTYATKKLIPRHWYLLGGFTFTITLYGILDGLRDSGKKKAYDEAIHAGKTPYTAGGH 89 T 0.3 Selenoprotein_S pdbhh F Eukaryota T 6rdx 10 J 9 A0A5H1ZR73_9CHLO Mitochondrial ATP synthase subunit ASA9 MAVTSFLGKAFEKYFYDFSAYEQFGLNRFLSSKGQYVALRHVGFVMVGVNVLLAANFPFNPPFPTIGMCPAGWEGTWVCQADKAKALEMYKEWKKSN 97 T 0.39 1-cysPrx_C pdbhh F Eukaryota T 6rdz 1 A 0 A0A5H1ZR95_9CHLO ASA-10: Polytomella F-ATP synthase associated subunit 10 MSYSAYFAKAGFQFPAGLSALVAGIVALNVCTGRPTKGTKEISNAEYNATPIGYLQSPDQHPTAFPKVPGMKDVHGSPHHHH 82 T 0.1 DUF6506 pdb F Eukaryota T 6rdz 2 B 1 Q85JD5_9CHLO ATP synthase associated protein ASA1 MMRAAQKAKQELPATVLTQTRSYLAPLRSDFTEEITAPKVASASNLVNEWNNKKQATENLMKLLQAYKDIGDAKSEPLLKNHNPRTFEDRDYPVPDFRTQNLKAGDVPKFFDTVISTRASAAIASKDKFWAGRKTEAEAASAKASAAFPRVAVPEWKKGKTVSIENLNTVTDKYAAALVPKRKLALPVLPEGVKKAVEDFAASVGQAKNASEVSELLAKSLAEKAVVTEGGKVVEGFSYVSKAVAAKVIATRRAEVHERLLKLWAKRLLVSPELAIVPLNEFDAQLASKFEGISPKYQELLSAVAQGNKTFAQRLNSSPAFSSFLLKREKAESEVPPSELELEAAQKAAELEDPEVALRTLLGPQMEALGASDLLLSEQIRVITEHRYTPDRLQYKEGMKLADKIAAQEAALKEELKVIYGDNVDVKHFQASPRTPVQQLFDSLKNAAANKERAAKEAAAAASPYLAYAVTKKQEVQADPSNIPFDEVLYPQLSEELLELELSDIREDEIALEKAEEEELWLLTLTQQFKHIQKHFGIDLPHSVVAHMDPLLIKKIDWETTNALEDFDITLDDMGAEDAKEQWGAENLSHHFLPLIRYRRDLARKNGDRYGPDLVNGN 618 T 0.21 Lipoprotein_10 pdb F Eukaryota T 6rdz 4 D 3 K0J903_9CHLO Mitochondrial F1F0 ATP synthase associated 32 kDa protein MRQASRLALSIRQAGNVEAASAVPAMTRQFSAPGSHEHHETPLSKVMPTVVSIPRKVACLALGATKKVVCGLASSGPSQNLVSTFANKVIVEENLVNVAEIDVPFWSYWLSSAGFTSKDAFVKFAEAVKPKVAALSTSDITNLTVAFKRANYYDKDLFTGIEANVSANFTKFETEQLLQIVATFDAFNHSSVAFLDDVADSITYCNHYLAPVRAGADELATLLTYYAKNGHERADLLATVARGFSEVSLGKLSAAQRKDTVLSALKAFQTFGFYPESIEAVIGAALVSPAEYSAEELKEVEAVKVAAENALGGEFVLIQEGAHGH 325 T 0.042 FAST_1 pdbpercent F Eukaryota T 6rdz 7 G 6 D7P897_9CHLO Mitochondrial ATP synthase subunit ASA6 MMLRTLTRSSAVAGQAVRLFKTSAAAAEGNSVAGIIKSVNETSGANLLSSLKTIKAQAAPIYPAAASSTGYSTQAKIALFGALSWILYRADGQSKAHEWIVDLNLNVLQAAWLISFSSLIPFRAVYFAFRGMAPATASTLNGLKTFSSISL 151 T 0.00025 NrsF pdbpercent F Eukaryota T 6rdz 8 H 7 D8V7I2_9CHLO Mitochondrial ATP synthase associated protein ASA7 MSSVRAGVEAGRRDLTTFTFSGLQDAPVAALSGSIKLNVAAKAGKAEVTVAAGAAKAATQVSAAALRKLSGSKISLAEVARISVLHSSIQNYLLSLSNERYQLLSQWPDFTTMYGKDFYYRAHPEDLKKFYDAADEYYKLYETVTEFDSLSALASQVVPNYAARRRSTVHPAIGSTVADGAFTNFLLSKQ 190 T 0.96 DUF4296 pdbhh F Eukaryota T 6rdz 9 I 8 D8V7I7_9CHLO Mitochondrial ATP synthase subunit ASA8 MVLGEVYLKDILRTPPTGAIPANVPHPFQTSFYTYATKKLIPRHWYLLGGFTFTITLYGILDGLRDSGKKKAYDEAIHAGKTPYTAGGH 89 T 0.3 Selenoprotein_S pdbhh F Eukaryota T 6rdz 10 J 9 A0A5H1ZR73_9CHLO ASA-9: Polytomella F-ATP synthase associated subunit 9 MAVTSFLGKAFEKYFYDFSAYEQFGLNRFLSSKGQYVALRHVGFVMVGVNVLLAANFPFNPPFPTIGMCPAGWEGTWVCQADKAKALEMYKEWKKSN 97 T 0.39 1-cysPrx_C pdbhh F Eukaryota T 6re0 1 A 0 A0A5H1ZR95_9CHLO ASA-10: Polytomella F-ATP synthase associated subunit 10 MSYSAYFAKAGFQFPAGLSALVAGIVALNVCTGRPTKGTKEISNAEYNATPIGYLQSPDQHPTAFPKVPGMKDVHGSPHHHH 82 T 0.1 DUF6506 pdb F Eukaryota T 6re0 2 B 1 Q85JD5_9CHLO ATP synthase associated protein ASA1 MMRAAQKAKQELPATVLTQTRSYLAPLRSDFTEEITAPKVASASNLVNEWNNKKQATENLMKLLQAYKDIGDAKSEPLLKNHNPRTFEDRDYPVPDFRTQNLKAGDVPKFFDTVISTRASAAIASKDKFWAGRKTEAEAASAKASAAFPRVAVPEWKKGKTVSIENLNTVTDKYAAALVPKRKLALPVLPEGVKKAVEDFAASVGQAKNASEVSELLAKSLAEKAVVTEGGKVVEGFSYVSKAVAAKVIATRRAEVHERLLKLWAKRLLVSPELAIVPLNEFDAQLASKFEGISPKYQELLSAVAQGNKTFAQRLNSSPAFSSFLLKREKAESEVPPSELELEAAQKAAELEDPEVALRTLLGPQMEALGASDLLLSEQIRVITEHRYTPDRLQYKEGMKLADKIAAQEAALKEELKVIYGDNVDVKHFQASPRTPVQQLFDSLKNAAANKERAAKEAAAAASPYLAYAVTKKQEVQADPSNIPFDEVLYPQLSEELLELELSDIREDEIALEKAEEEELWLLTLTQQFKHIQKHFGIDLPHSVVAHMDPLLIKKIDWETTNALEDFDITLDDMGAEDAKEQWGAENLSHHFLPLIRYRRDLARKNGDRYGPDLVNGN 618 T 0.21 Lipoprotein_10 pdb F Eukaryota T 6re0 4 D 3 K0J903_9CHLO ASA-3: Polytomella F-ATP synthase associated subunit 3 MRQASRLALSIRQAGNVEAASAVPAMTRQFSAPGSHEHHETPLSKVMPTVVSIPRKVACLALGATKKVVCGLASSGPSQNLVSTFANKVIVEENLVNVAEIDVPFWSYWLSSAGFTSKDAFVKFAEAVKPKVAALSTSDITNLTVAFKRANYYDKDLFTGIEANVSANFTKFETEQLLQIVATFDAFNHSSVAFLDDVADSITYCNHYLAPVRAGADELATLLTYYAKNGHERADLLATVARGFSEVSLGKLSAAQRKDTVLSALKAFQTFGFYPESIEAVIGAALVSPAEYSAEELKEVEAVKVAAENALGGEFVLIQEGAHGH 325 T 0.042 FAST_1 pdbpercent F Eukaryota T 6re0 7 G 6 D7P897_9CHLO Mitochondrial ATP synthase subunit ASA6 MMLRTLTRSSAVAGQAVRLFKTSAAAAEGNSVAGIIKSVNETSGANLLSSLKTIKAQAAPIYPAAASSTGYSTQAKIALFGALSWILYRADGQSKAHEWIVDLNLNVLQAAWLISFSSLIPFRAVYFAFRGMAPATASTLNGLKTFSSISL 151 T 0.00025 NrsF pdbpercent F Eukaryota T 6re0 8 H 7 D8V7I2_9CHLO Mitochondrial ATP synthase associated protein ASA7 MSSVRAGVEAGRRDLTTFTFSGLQDAPVAALSGSIKLNVAAKAGKAEVTVAAGAAKAATQVSAAALRKLSGSKISLAEVARISVLHSSIQNYLLSLSNERYQLLSQWPDFTTMYGKDFYYRAHPEDLKKFYDAADEYYKLYETVTEFDSLSALASQVVPNYAARRRSTVHPAIGSTVADGAFTNFLLSKQ 190 T 0.96 DUF4296 pdbhh F Eukaryota T 6re0 9 I 8 D8V7I7_9CHLO Mitochondrial ATP synthase subunit ASA8 MVLGEVYLKDILRTPPTGAIPANVPHPFQTSFYTYATKKLIPRHWYLLGGFTFTITLYGILDGLRDSGKKKAYDEAIHAGKTPYTAGGH 89 T 0.3 Selenoprotein_S pdbhh F Eukaryota T 6re0 10 J 9 A0A5H1ZR73_9CHLO ASA-9: Polytomella F-ATP synthase associated subunit 9 MAVTSFLGKAFEKYFYDFSAYEQFGLNRFLSSKGQYVALRHVGFVMVGVNVLLAANFPFNPPFPTIGMCPAGWEGTWVCQADKAKALEMYKEWKKSN 97 T 0.39 1-cysPrx_C pdbhh F Eukaryota T 6re2 1 A 0 A0A5H1ZR95_9CHLO ASA-10: Polytomella F-ATP synthase associated subunit 10 MSYSAYFAKAGFQFPAGLSALVAGIVALNVCTGRPTKGTKEISNAEYNATPIGYLQSPDQHPTAFPKVPGMKDVHGSPHHHH 82 T 0.1 DUF6506 pdb F Eukaryota T 6re2 2 B 1 Q85JD5_9CHLO ATP synthase associated protein ASA1 MMRAAQKAKQELPATVLTQTRSYLAPLRSDFTEEITAPKVASASNLVNEWNNKKQATENLMKLLQAYKDIGDAKSEPLLKNHNPRTFEDRDYPVPDFRTQNLKAGDVPKFFDTVISTRASAAIASKDKFWAGRKTEAEAASAKASAAFPRVAVPEWKKGKTVSIENLNTVTDKYAAALVPKRKLALPVLPEGVKKAVEDFAASVGQAKNASEVSELLAKSLAEKAVVTEGGKVVEGFSYVSKAVAAKVIATRRAEVHERLLKLWAKRLLVSPELAIVPLNEFDAQLASKFEGISPKYQELLSAVAQGNKTFAQRLNSSPAFSSFLLKREKAESEVPPSELELEAAQKAAELEDPEVALRTLLGPQMEALGASDLLLSEQIRVITEHRYTPDRLQYKEGMKLADKIAAQEAALKEELKVIYGDNVDVKHFQASPRTPVQQLFDSLKNAAANKERAAKEAAAAASPYLAYAVTKKQEVQADPSNIPFDEVLYPQLSEELLELELSDIREDEIALEKAEEEELWLLTLTQQFKHIQKHFGIDLPHSVVAHMDPLLIKKIDWETTNALEDFDITLDDMGAEDAKEQWGAENLSHHFLPLIRYRRDLARKNGDRYGPDLVNGN 618 T 0.21 Lipoprotein_10 pdb F Eukaryota T 6re2 4 D 3 K0J903_9CHLO Mitochondrial F1F0 ATP synthase associated 32 kDa protein MRQASRLALSIRQAGNVEAASAVPAMTRQFSAPGSHEHHETPLSKVMPTVVSIPRKVACLALGATKKVVCGLASSGPSQNLVSTFANKVIVEENLVNVAEIDVPFWSYWLSSAGFTSKDAFVKFAEAVKPKVAALSTSDITNLTVAFKRANYYDKDLFTGIEANVSANFTKFETEQLLQIVATFDAFNHSSVAFLDDVADSITYCNHYLAPVRAGADELATLLTYYAKNGHERADLLATVARGFSEVSLGKLSAAQRKDTVLSALKAFQTFGFYPESIEAVIGAALVSPAEYSAEELKEVEAVKVAAENALGGEFVLIQEGAHGH 325 T 0.042 FAST_1 pdbpercent F Eukaryota T 6re2 7 G 6 D7P897_9CHLO Mitochondrial ATP synthase subunit ASA6 MMLRTLTRSSAVAGQAVRLFKTSAAAAEGNSVAGIIKSVNETSGANLLSSLKTIKAQAAPIYPAAASSTGYSTQAKIALFGALSWILYRADGQSKAHEWIVDLNLNVLQAAWLISFSSLIPFRAVYFAFRGMAPATASTLNGLKTFSSISL 151 T 0.00025 NrsF pdbpercent F Eukaryota T 6re2 8 H 7 D8V7I2_9CHLO Mitochondrial ATP synthase associated protein ASA7 MSSVRAGVEAGRRDLTTFTFSGLQDAPVAALSGSIKLNVAAKAGKAEVTVAAGAAKAATQVSAAALRKLSGSKISLAEVARISVLHSSIQNYLLSLSNERYQLLSQWPDFTTMYGKDFYYRAHPEDLKKFYDAADEYYKLYETVTEFDSLSALASQVVPNYAARRRSTVHPAIGSTVADGAFTNFLLSKQ 190 T 0.96 DUF4296 pdbhh F Eukaryota T 6re2 9 I 8 D8V7I7_9CHLO Mitochondrial ATP synthase subunit ASA8 MVLGEVYLKDILRTPPTGAIPANVPHPFQTSFYTYATKKLIPRHWYLLGGFTFTITLYGILDGLRDSGKKKAYDEAIHAGKTPYTAGGH 89 T 0.3 Selenoprotein_S pdbhh F Eukaryota T 6re2 10 J 9 A0A5H1ZR73_9CHLO ASA-9: Polytomella F-ATP synthase associated subunit 9 MAVTSFLGKAFEKYFYDFSAYEQFGLNRFLSSKGQYVALRHVGFVMVGVNVLLAANFPFNPPFPTIGMCPAGWEGTWVCQADKAKALEMYKEWKKSN 97 T 0.39 1-cysPrx_C pdbhh F Eukaryota T 6re3 1 A 0 A0A5H1ZR95_9CHLO ASA-10: Polytomella F-ATP synthase associated subunit 10 MSYSAYFAKAGFQFPAGLSALVAGIVALNVCTGRPTKGTKEISNAEYNATPIGYLQSPDQHPTAFPKVPGMKDVHGSPHHHH 82 T 0.1 DUF6506 pdb F Eukaryota T 6re3 2 B 1 Q85JD5_9CHLO ATP synthase associated protein ASA1 MMRAAQKAKQELPATVLTQTRSYLAPLRSDFTEEITAPKVASASNLVNEWNNKKQATENLMKLLQAYKDIGDAKSEPLLKNHNPRTFEDRDYPVPDFRTQNLKAGDVPKFFDTVISTRASAAIASKDKFWAGRKTEAEAASAKASAAFPRVAVPEWKKGKTVSIENLNTVTDKYAAALVPKRKLALPVLPEGVKKAVEDFAASVGQAKNASEVSELLAKSLAEKAVVTEGGKVVEGFSYVSKAVAAKVIATRRAEVHERLLKLWAKRLLVSPELAIVPLNEFDAQLASKFEGISPKYQELLSAVAQGNKTFAQRLNSSPAFSSFLLKREKAESEVPPSELELEAAQKAAELEDPEVALRTLLGPQMEALGASDLLLSEQIRVITEHRYTPDRLQYKEGMKLADKIAAQEAALKEELKVIYGDNVDVKHFQASPRTPVQQLFDSLKNAAANKERAAKEAAAAASPYLAYAVTKKQEVQADPSNIPFDEVLYPQLSEELLELELSDIREDEIALEKAEEEELWLLTLTQQFKHIQKHFGIDLPHSVVAHMDPLLIKKIDWETTNALEDFDITLDDMGAEDAKEQWGAENLSHHFLPLIRYRRDLARKNGDRYGPDLVNGN 618 T 0.21 Lipoprotein_10 pdb F Eukaryota T 6re3 4 D 3 K0J903_9CHLO Mitochondrial F1F0 ATP synthase associated 32 kDa protein MRQASRLALSIRQAGNVEAASAVPAMTRQFSAPGSHEHHETPLSKVMPTVVSIPRKVACLALGATKKVVCGLASSGPSQNLVSTFANKVIVEENLVNVAEIDVPFWSYWLSSAGFTSKDAFVKFAEAVKPKVAALSTSDITNLTVAFKRANYYDKDLFTGIEANVSANFTKFETEQLLQIVATFDAFNHSSVAFLDDVADSITYCNHYLAPVRAGADELATLLTYYAKNGHERADLLATVARGFSEVSLGKLSAAQRKDTVLSALKAFQTFGFYPESIEAVIGAALVSPAEYSAEELKEVEAVKVAAENALGGEFVLIQEGAHGH 325 T 0.042 FAST_1 pdbpercent F Eukaryota T 6re3 7 G 6 D7P897_9CHLO Mitochondrial ATP synthase subunit ASA6 MMLRTLTRSSAVAGQAVRLFKTSAAAAEGNSVAGIIKSVNETSGANLLSSLKTIKAQAAPIYPAAASSTGYSTQAKIALFGALSWILYRADGQSKAHEWIVDLNLNVLQAAWLISFSSLIPFRAVYFAFRGMAPATASTLNGLKTFSSISL 151 T 0.00025 NrsF pdbpercent F Eukaryota T 6re3 8 H 7 D8V7I2_9CHLO Mitochondrial ATP synthase associated protein ASA7 MSSVRAGVEAGRRDLTTFTFSGLQDAPVAALSGSIKLNVAAKAGKAEVTVAAGAAKAATQVSAAALRKLSGSKISLAEVARISVLHSSIQNYLLSLSNERYQLLSQWPDFTTMYGKDFYYRAHPEDLKKFYDAADEYYKLYETVTEFDSLSALASQVVPNYAARRRSTVHPAIGSTVADGAFTNFLLSKQ 190 T 0.96 DUF4296 pdbhh F Eukaryota T 6re3 9 I 8 D8V7I7_9CHLO Mitochondrial ATP synthase subunit ASA8 MVLGEVYLKDILRTPPTGAIPANVPHPFQTSFYTYATKKLIPRHWYLLGGFTFTITLYGILDGLRDSGKKKAYDEAIHAGKTPYTAGGH 89 T 0.3 Selenoprotein_S pdbhh F Eukaryota T 6re3 10 J 9 A0A5H1ZR73_9CHLO ASA-9: Polytomella F-ATP synthase associated subunit 9 MAVTSFLGKAFEKYFYDFSAYEQFGLNRFLSSKGQYVALRHVGFVMVGVNVLLAANFPFNPPFPTIGMCPAGWEGTWVCQADKAKALEMYKEWKKSN 97 T 0.39 1-cysPrx_C pdbhh F Eukaryota T 6re5 1 A 0 A0A5H1ZR95_9CHLO ASA-10: Polytomella F-ATP synthase associated subunit 10 MSYSAYFAKAGFQFPAGLSALVAGIVALNVCTGRPTKGTKEISNAEYNATPIGYLQSPDQHPTAFPKVPGMKDVHGSPHHHH 82 T 0.1 DUF6506 pdb F Eukaryota T 6re5 2 B 1 Q85JD5_9CHLO ATP synthase associated protein ASA1 MMRAAQKAKQELPATVLTQTRSYLAPLRSDFTEEITAPKVASASNLVNEWNNKKQATENLMKLLQAYKDIGDAKSEPLLKNHNPRTFEDRDYPVPDFRTQNLKAGDVPKFFDTVISTRASAAIASKDKFWAGRKTEAEAASAKASAAFPRVAVPEWKKGKTVSIENLNTVTDKYAAALVPKRKLALPVLPEGVKKAVEDFAASVGQAKNASEVSELLAKSLAEKAVVTEGGKVVEGFSYVSKAVAAKVIATRRAEVHERLLKLWAKRLLVSPELAIVPLNEFDAQLASKFEGISPKYQELLSAVAQGNKTFAQRLNSSPAFSSFLLKREKAESEVPPSELELEAAQKAAELEDPEVALRTLLGPQMEALGASDLLLSEQIRVITEHRYTPDRLQYKEGMKLADKIAAQEAALKEELKVIYGDNVDVKHFQASPRTPVQQLFDSLKNAAANKERAAKEAAAAASPYLAYAVTKKQEVQADPSNIPFDEVLYPQLSEELLELELSDIREDEIALEKAEEEELWLLTLTQQFKHIQKHFGIDLPHSVVAHMDPLLIKKIDWETTNALEDFDITLDDMGAEDAKEQWGAENLSHHFLPLIRYRRDLARKNGDRYGPDLVNGN 618 T 0.21 Lipoprotein_10 pdb F Eukaryota T 6re5 4 D 3 K0J903_9CHLO Mitochondrial F1F0 ATP synthase associated 32 kDa protein MRQASRLALSIRQAGNVEAASAVPAMTRQFSAPGSHEHHETPLSKVMPTVVSIPRKVACLALGATKKVVCGLASSGPSQNLVSTFANKVIVEENLVNVAEIDVPFWSYWLSSAGFTSKDAFVKFAEAVKPKVAALSTSDITNLTVAFKRANYYDKDLFTGIEANVSANFTKFETEQLLQIVATFDAFNHSSVAFLDDVADSITYCNHYLAPVRAGADELATLLTYYAKNGHERADLLATVARGFSEVSLGKLSAAQRKDTVLSALKAFQTFGFYPESIEAVIGAALVSPAEYSAEELKEVEAVKVAAENALGGEFVLIQEGAHGH 325 T 0.042 FAST_1 pdbpercent F Eukaryota T 6re5 7 G 6 D7P897_9CHLO Mitochondrial ATP synthase subunit ASA6 MMLRTLTRSSAVAGQAVRLFKTSAAAAEGNSVAGIIKSVNETSGANLLSSLKTIKAQAAPIYPAAASSTGYSTQAKIALFGALSWILYRADGQSKAHEWIVDLNLNVLQAAWLISFSSLIPFRAVYFAFRGMAPATASTLNGLKTFSSISL 151 T 0.00025 NrsF pdbpercent F Eukaryota T 6re5 8 H 7 D8V7I2_9CHLO Mitochondrial ATP synthase associated protein ASA7 MSSVRAGVEAGRRDLTTFTFSGLQDAPVAALSGSIKLNVAAKAGKAEVTVAAGAAKAATQVSAAALRKLSGSKISLAEVARISVLHSSIQNYLLSLSNERYQLLSQWPDFTTMYGKDFYYRAHPEDLKKFYDAADEYYKLYETVTEFDSLSALASQVVPNYAARRRSTVHPAIGSTVADGAFTNFLLSKQ 190 T 0.96 DUF4296 pdbhh F Eukaryota T 6re5 9 I 8 D8V7I7_9CHLO Mitochondrial ATP synthase subunit ASA8 MVLGEVYLKDILRTPPTGAIPANVPHPFQTSFYTYATKKLIPRHWYLLGGFTFTITLYGILDGLRDSGKKKAYDEAIHAGKTPYTAGGH 89 T 0.3 Selenoprotein_S pdbhh F Eukaryota T 6re5 10 J 9 A0A5H1ZR73_9CHLO ASA-9: Polytomella F-ATP synthase associated subunit 9 MAVTSFLGKAFEKYFYDFSAYEQFGLNRFLSSKGQYVALRHVGFVMVGVNVLLAANFPFNPPFPTIGMCPAGWEGTWVCQADKAKALEMYKEWKKSN 97 T 0.39 1-cysPrx_C pdbhh F Eukaryota T 6re6 1 A 0 A0A5H1ZR95_9CHLO ASA-10: Polytomella F-ATP synthase associated subunit 10 MSYSAYFAKAGFQFPAGLSALVAGIVALNVCTGRPTKGTKEISNAEYNATPIGYLQSPDQHPTAFPKVPGMKDVHGSPHHHH 82 T 0.1 DUF6506 pdb F Eukaryota T 6re6 2 B 1 Q85JD5_9CHLO ATP synthase associated protein ASA1 MMRAAQKAKQELPATVLTQTRSYLAPLRSDFTEEITAPKVASASNLVNEWNNKKQATENLMKLLQAYKDIGDAKSEPLLKNHNPRTFEDRDYPVPDFRTQNLKAGDVPKFFDTVISTRASAAIASKDKFWAGRKTEAEAASAKASAAFPRVAVPEWKKGKTVSIENLNTVTDKYAAALVPKRKLALPVLPEGVKKAVEDFAASVGQAKNASEVSELLAKSLAEKAVVTEGGKVVEGFSYVSKAVAAKVIATRRAEVHERLLKLWAKRLLVSPELAIVPLNEFDAQLASKFEGISPKYQELLSAVAQGNKTFAQRLNSSPAFSSFLLKREKAESEVPPSELELEAAQKAAELEDPEVALRTLLGPQMEALGASDLLLSEQIRVITEHRYTPDRLQYKEGMKLADKIAAQEAALKEELKVIYGDNVDVKHFQASPRTPVQQLFDSLKNAAANKERAAKEAAAAASPYLAYAVTKKQEVQADPSNIPFDEVLYPQLSEELLELELSDIREDEIALEKAEEEELWLLTLTQQFKHIQKHFGIDLPHSVVAHMDPLLIKKIDWETTNALEDFDITLDDMGAEDAKEQWGAENLSHHFLPLIRYRRDLARKNGDRYGPDLVNGN 618 T 0.21 Lipoprotein_10 pdb F Eukaryota T 6re6 4 D 3 K0J903_9CHLO Mitochondrial F1F0 ATP synthase associated 32 kDa protein MRQASRLALSIRQAGNVEAASAVPAMTRQFSAPGSHEHHETPLSKVMPTVVSIPRKVACLALGATKKVVCGLASSGPSQNLVSTFANKVIVEENLVNVAEIDVPFWSYWLSSAGFTSKDAFVKFAEAVKPKVAALSTSDITNLTVAFKRANYYDKDLFTGIEANVSANFTKFETEQLLQIVATFDAFNHSSVAFLDDVADSITYCNHYLAPVRAGADELATLLTYYAKNGHERADLLATVARGFSEVSLGKLSAAQRKDTVLSALKAFQTFGFYPESIEAVIGAALVSPAEYSAEELKEVEAVKVAAENALGGEFVLIQEGAHGH 325 T 0.042 FAST_1 pdbpercent F Eukaryota T 6re6 7 G 6 D7P897_9CHLO Mitochondrial ATP synthase subunit ASA6 MMLRTLTRSSAVAGQAVRLFKTSAAAAEGNSVAGIIKSVNETSGANLLSSLKTIKAQAAPIYPAAASSTGYSTQAKIALFGALSWILYRADGQSKAHEWIVDLNLNVLQAAWLISFSSLIPFRAVYFAFRGMAPATASTLNGLKTFSSISL 151 T 0.00025 NrsF pdbpercent F Eukaryota T 6re6 8 H 7 D8V7I2_9CHLO Mitochondrial ATP synthase associated protein ASA7 MSSVRAGVEAGRRDLTTFTFSGLQDAPVAALSGSIKLNVAAKAGKAEVTVAAGAAKAATQVSAAALRKLSGSKISLAEVARISVLHSSIQNYLLSLSNERYQLLSQWPDFTTMYGKDFYYRAHPEDLKKFYDAADEYYKLYETVTEFDSLSALASQVVPNYAARRRSTVHPAIGSTVADGAFTNFLLSKQ 190 T 0.96 DUF4296 pdbhh F Eukaryota T 6re6 9 I 8 D8V7I7_9CHLO Mitochondrial ATP synthase subunit ASA8 MVLGEVYLKDILRTPPTGAIPANVPHPFQTSFYTYATKKLIPRHWYLLGGFTFTITLYGILDGLRDSGKKKAYDEAIHAGKTPYTAGGH 89 T 0.3 Selenoprotein_S pdbhh F Eukaryota T 6re6 10 J 9 A0A5H1ZR73_9CHLO ASA-9: Polytomella F-ATP synthase associated subunit 9 MAVTSFLGKAFEKYFYDFSAYEQFGLNRFLSSKGQYVALRHVGFVMVGVNVLLAANFPFNPPFPTIGMCPAGWEGTWVCQADKAKALEMYKEWKKSN 97 T 0.39 1-cysPrx_C pdbhh F Eukaryota T 6re8 1 A 0 A0A5H1ZR95_9CHLO ASA-10: Polytomella F-ATP synthase associated subunit 10 MSYSAYFAKAGFQFPAGLSALVAGIVALNVCTGRPTKGTKEISNAEYNATPIGYLQSPDQHPTAFPKVPGMKDVHGSPHHHH 82 T 0.1 DUF6506 pdb F Eukaryota T 6re8 2 B 1 Q85JD5_9CHLO ATP synthase associated protein ASA1 MMRAAQKAKQELPATVLTQTRSYLAPLRSDFTEEITAPKVASASNLVNEWNNKKQATENLMKLLQAYKDIGDAKSEPLLKNHNPRTFEDRDYPVPDFRTQNLKAGDVPKFFDTVISTRASAAIASKDKFWAGRKTEAEAASAKASAAFPRVAVPEWKKGKTVSIENLNTVTDKYAAALVPKRKLALPVLPEGVKKAVEDFAASVGQAKNASEVSELLAKSLAEKAVVTEGGKVVEGFSYVSKAVAAKVIATRRAEVHERLLKLWAKRLLVSPELAIVPLNEFDAQLASKFEGISPKYQELLSAVAQGNKTFAQRLNSSPAFSSFLLKREKAESEVPPSELELEAAQKAAELEDPEVALRTLLGPQMEALGASDLLLSEQIRVITEHRYTPDRLQYKEGMKLADKIAAQEAALKEELKVIYGDNVDVKHFQASPRTPVQQLFDSLKNAAANKERAAKEAAAAASPYLAYAVTKKQEVQADPSNIPFDEVLYPQLSEELLELELSDIREDEIALEKAEEEELWLLTLTQQFKHIQKHFGIDLPHSVVAHMDPLLIKKIDWETTNALEDFDITLDDMGAEDAKEQWGAENLSHHFLPLIRYRRDLARKNGDRYGPDLVNGN 618 T 0.21 Lipoprotein_10 pdb F Eukaryota T 6re8 4 D 3 K0J903_9CHLO Mitochondrial F1F0 ATP synthase associated 32 kDa protein MRQASRLALSIRQAGNVEAASAVPAMTRQFSAPGSHEHHETPLSKVMPTVVSIPRKVACLALGATKKVVCGLASSGPSQNLVSTFANKVIVEENLVNVAEIDVPFWSYWLSSAGFTSKDAFVKFAEAVKPKVAALSTSDITNLTVAFKRANYYDKDLFTGIEANVSANFTKFETEQLLQIVATFDAFNHSSVAFLDDVADSITYCNHYLAPVRAGADELATLLTYYAKNGHERADLLATVARGFSEVSLGKLSAAQRKDTVLSALKAFQTFGFYPESIEAVIGAALVSPAEYSAEELKEVEAVKVAAENALGGEFVLIQEGAHGH 325 T 0.042 FAST_1 pdbpercent F Eukaryota T 6re8 7 G 6 D7P897_9CHLO Mitochondrial ATP synthase subunit ASA6 MMLRTLTRSSAVAGQAVRLFKTSAAAAEGNSVAGIIKSVNETSGANLLSSLKTIKAQAAPIYPAAASSTGYSTQAKIALFGALSWILYRADGQSKAHEWIVDLNLNVLQAAWLISFSSLIPFRAVYFAFRGMAPATASTLNGLKTFSSISL 151 T 0.00025 NrsF pdbpercent F Eukaryota T 6re8 8 H 7 D8V7I2_9CHLO Mitochondrial ATP synthase associated protein ASA7 MSSVRAGVEAGRRDLTTFTFSGLQDAPVAALSGSIKLNVAAKAGKAEVTVAAGAAKAATQVSAAALRKLSGSKISLAEVARISVLHSSIQNYLLSLSNERYQLLSQWPDFTTMYGKDFYYRAHPEDLKKFYDAADEYYKLYETVTEFDSLSALASQVVPNYAARRRSTVHPAIGSTVADGAFTNFLLSKQ 190 T 0.96 DUF4296 pdbhh F Eukaryota T 6re8 9 I 8 D8V7I7_9CHLO Mitochondrial ATP synthase subunit ASA8 MVLGEVYLKDILRTPPTGAIPANVPHPFQTSFYTYATKKLIPRHWYLLGGFTFTITLYGILDGLRDSGKKKAYDEAIHAGKTPYTAGGH 89 T 0.3 Selenoprotein_S pdbhh F Eukaryota T 6re8 10 J 9 A0A5H1ZR73_9CHLO ASA-9: Polytomella F-ATP synthase associated subunit 9 MAVTSFLGKAFEKYFYDFSAYEQFGLNRFLSSKGQYVALRHVGFVMVGVNVLLAANFPFNPPFPTIGMCPAGWEGTWVCQADKAKALEMYKEWKKSN 97 T 0.39 1-cysPrx_C pdbhh F Eukaryota T 6re9 1 A 0 A0A5H1ZR95_9CHLO ASA-10: Polytomella F-ATP synthase associated subunit 10 MSYSAYFAKAGFQFPAGLSALVAGIVALNVCTGRPTKGTKEISNAEYNATPIGYLQSPDQHPTAFPKVPGMKDVHGSPHHHH 82 T 0.1 DUF6506 pdb F Eukaryota T 6re9 2 B 1 Q85JD5_9CHLO ATP synthase associated protein ASA1 MMRAAQKAKQELPATVLTQTRSYLAPLRSDFTEEITAPKVASASNLVNEWNNKKQATENLMKLLQAYKDIGDAKSEPLLKNHNPRTFEDRDYPVPDFRTQNLKAGDVPKFFDTVISTRASAAIASKDKFWAGRKTEAEAASAKASAAFPRVAVPEWKKGKTVSIENLNTVTDKYAAALVPKRKLALPVLPEGVKKAVEDFAASVGQAKNASEVSELLAKSLAEKAVVTEGGKVVEGFSYVSKAVAAKVIATRRAEVHERLLKLWAKRLLVSPELAIVPLNEFDAQLASKFEGISPKYQELLSAVAQGNKTFAQRLNSSPAFSSFLLKREKAESEVPPSELELEAAQKAAELEDPEVALRTLLGPQMEALGASDLLLSEQIRVITEHRYTPDRLQYKEGMKLADKIAAQEAALKEELKVIYGDNVDVKHFQASPRTPVQQLFDSLKNAAANKERAAKEAAAAASPYLAYAVTKKQEVQADPSNIPFDEVLYPQLSEELLELELSDIREDEIALEKAEEEELWLLTLTQQFKHIQKHFGIDLPHSVVAHMDPLLIKKIDWETTNALEDFDITLDDMGAEDAKEQWGAENLSHHFLPLIRYRRDLARKNGDRYGPDLVNGN 618 T 0.21 Lipoprotein_10 pdb F Eukaryota T 6re9 4 D 3 K0J903_9CHLO Mitochondrial F1F0 ATP synthase associated 32 kDa protein MRQASRLALSIRQAGNVEAASAVPAMTRQFSAPGSHEHHETPLSKVMPTVVSIPRKVACLALGATKKVVCGLASSGPSQNLVSTFANKVIVEENLVNVAEIDVPFWSYWLSSAGFTSKDAFVKFAEAVKPKVAALSTSDITNLTVAFKRANYYDKDLFTGIEANVSANFTKFETEQLLQIVATFDAFNHSSVAFLDDVADSITYCNHYLAPVRAGADELATLLTYYAKNGHERADLLATVARGFSEVSLGKLSAAQRKDTVLSALKAFQTFGFYPESIEAVIGAALVSPAEYSAEELKEVEAVKVAAENALGGEFVLIQEGAHGH 325 T 0.042 FAST_1 pdbpercent F Eukaryota T 6re9 7 G 6 D7P897_9CHLO Mitochondrial ATP synthase subunit ASA6 MMLRTLTRSSAVAGQAVRLFKTSAAAAEGNSVAGIIKSVNETSGANLLSSLKTIKAQAAPIYPAAASSTGYSTQAKIALFGALSWILYRADGQSKAHEWIVDLNLNVLQAAWLISFSSLIPFRAVYFAFRGMAPATASTLNGLKTFSSISL 151 T 0.00025 NrsF pdbpercent F Eukaryota T 6re9 8 H 7 D8V7I2_9CHLO Mitochondrial ATP synthase associated protein ASA7 MSSVRAGVEAGRRDLTTFTFSGLQDAPVAALSGSIKLNVAAKAGKAEVTVAAGAAKAATQVSAAALRKLSGSKISLAEVARISVLHSSIQNYLLSLSNERYQLLSQWPDFTTMYGKDFYYRAHPEDLKKFYDAADEYYKLYETVTEFDSLSALASQVVPNYAARRRSTVHPAIGSTVADGAFTNFLLSKQ 190 T 0.96 DUF4296 pdbhh F Eukaryota T 6re9 9 I 8 D8V7I7_9CHLO Mitochondrial ATP synthase subunit ASA8 MVLGEVYLKDILRTPPTGAIPANVPHPFQTSFYTYATKKLIPRHWYLLGGFTFTITLYGILDGLRDSGKKKAYDEAIHAGKTPYTAGGH 89 T 0.3 Selenoprotein_S pdbhh F Eukaryota T 6re9 10 J 9 A0A5H1ZR73_9CHLO ASA-9: Polytomella F-ATP synthase associated subunit 9 MAVTSFLGKAFEKYFYDFSAYEQFGLNRFLSSKGQYVALRHVGFVMVGVNVLLAANFPFNPPFPTIGMCPAGWEGTWVCQADKAKALEMYKEWKKSN 97 T 0.39 1-cysPrx_C pdbhh F Eukaryota T 6reb 1 A 0 A0A5H1ZR95_9CHLO ASA-10: Polytomella F-ATP synthase associated subunit 10 MSYSAYFAKAGFQFPAGLSALVAGIVALNVCTGRPTKGTKEISNAEYNATPIGYLQSPDQHPTAFPKVPGMKDVHGSPHHHH 82 T 0.1 DUF6506 pdb F Eukaryota T 6reb 2 B 1 Q85JD5_9CHLO ATP synthase associated protein ASA1 MMRAAQKAKQELPATVLTQTRSYLAPLRSDFTEEITAPKVASASNLVNEWNNKKQATENLMKLLQAYKDIGDAKSEPLLKNHNPRTFEDRDYPVPDFRTQNLKAGDVPKFFDTVISTRASAAIASKDKFWAGRKTEAEAASAKASAAFPRVAVPEWKKGKTVSIENLNTVTDKYAAALVPKRKLALPVLPEGVKKAVEDFAASVGQAKNASEVSELLAKSLAEKAVVTEGGKVVEGFSYVSKAVAAKVIATRRAEVHERLLKLWAKRLLVSPELAIVPLNEFDAQLASKFEGISPKYQELLSAVAQGNKTFAQRLNSSPAFSSFLLKREKAESEVPPSELELEAAQKAAELEDPEVALRTLLGPQMEALGASDLLLSEQIRVITEHRYTPDRLQYKEGMKLADKIAAQEAALKEELKVIYGDNVDVKHFQASPRTPVQQLFDSLKNAAANKERAAKEAAAAASPYLAYAVTKKQEVQADPSNIPFDEVLYPQLSEELLELELSDIREDEIALEKAEEEELWLLTLTQQFKHIQKHFGIDLPHSVVAHMDPLLIKKIDWETTNALEDFDITLDDMGAEDAKEQWGAENLSHHFLPLIRYRRDLARKNGDRYGPDLVNGN 618 T 0.21 Lipoprotein_10 pdb F Eukaryota T 6reb 4 D 3 K0J903_9CHLO Mitochondrial F1F0 ATP synthase associated 32 kDa protein MRQASRLALSIRQAGNVEAASAVPAMTRQFSAPGSHEHHETPLSKVMPTVVSIPRKVACLALGATKKVVCGLASSGPSQNLVSTFANKVIVEENLVNVAEIDVPFWSYWLSSAGFTSKDAFVKFAEAVKPKVAALSTSDITNLTVAFKRANYYDKDLFTGIEANVSANFTKFETEQLLQIVATFDAFNHSSVAFLDDVADSITYCNHYLAPVRAGADELATLLTYYAKNGHERADLLATVARGFSEVSLGKLSAAQRKDTVLSALKAFQTFGFYPESIEAVIGAALVSPAEYSAEELKEVEAVKVAAENALGGEFVLIQEGAHGH 325 T 0.042 FAST_1 pdbpercent F Eukaryota T 6reb 7 G 6 D7P897_9CHLO Mitochondrial ATP synthase subunit ASA6 MMLRTLTRSSAVAGQAVRLFKTSAAAAEGNSVAGIIKSVNETSGANLLSSLKTIKAQAAPIYPAAASSTGYSTQAKIALFGALSWILYRADGQSKAHEWIVDLNLNVLQAAWLISFSSLIPFRAVYFAFRGMAPATASTLNGLKTFSSISL 151 T 0.00025 NrsF pdbpercent F Eukaryota T 6reb 8 H 7 D8V7I2_9CHLO Mitochondrial ATP synthase associated protein ASA7 MSSVRAGVEAGRRDLTTFTFSGLQDAPVAALSGSIKLNVAAKAGKAEVTVAAGAAKAATQVSAAALRKLSGSKISLAEVARISVLHSSIQNYLLSLSNERYQLLSQWPDFTTMYGKDFYYRAHPEDLKKFYDAADEYYKLYETVTEFDSLSALASQVVPNYAARRRSTVHPAIGSTVADGAFTNFLLSKQ 190 T 0.96 DUF4296 pdbhh F Eukaryota T 6reb 9 I 8 D8V7I7_9CHLO Mitochondrial ATP synthase subunit ASA8 MVLGEVYLKDILRTPPTGAIPANVPHPFQTSFYTYATKKLIPRHWYLLGGFTFTITLYGILDGLRDSGKKKAYDEAIHAGKTPYTAGGH 89 T 0.3 Selenoprotein_S pdbhh F Eukaryota T 6reb 10 J 9 A0A5H1ZR73_9CHLO ASA-9: Polytomella F-ATP synthase associated subunit 9 MAVTSFLGKAFEKYFYDFSAYEQFGLNRFLSSKGQYVALRHVGFVMVGVNVLLAANFPFNPPFPTIGMCPAGWEGTWVCQADKAKALEMYKEWKKSN 97 T 0.39 1-cysPrx_C pdbhh F Eukaryota T 6rec 1 A 0 A0A5H1ZR95_9CHLO ASA-10: Polytomella F-ATP synthase associated subunit 10 MSYSAYFAKAGFQFPAGLSALVAGIVALNVCTGRPTKGTKEISNAEYNATPIGYLQSPDQHPTAFPKVPGMKDVHGSPHHHH 82 T 0.1 DUF6506 pdb F Eukaryota T 6rec 2 B 1 Q85JD5_9CHLO ATP synthase associated protein ASA1 MMRAAQKAKQELPATVLTQTRSYLAPLRSDFTEEITAPKVASASNLVNEWNNKKQATENLMKLLQAYKDIGDAKSEPLLKNHNPRTFEDRDYPVPDFRTQNLKAGDVPKFFDTVISTRASAAIASKDKFWAGRKTEAEAASAKASAAFPRVAVPEWKKGKTVSIENLNTVTDKYAAALVPKRKLALPVLPEGVKKAVEDFAASVGQAKNASEVSELLAKSLAEKAVVTEGGKVVEGFSYVSKAVAAKVIATRRAEVHERLLKLWAKRLLVSPELAIVPLNEFDAQLASKFEGISPKYQELLSAVAQGNKTFAQRLNSSPAFSSFLLKREKAESEVPPSELELEAAQKAAELEDPEVALRTLLGPQMEALGASDLLLSEQIRVITEHRYTPDRLQYKEGMKLADKIAAQEAALKEELKVIYGDNVDVKHFQASPRTPVQQLFDSLKNAAANKERAAKEAAAAASPYLAYAVTKKQEVQADPSNIPFDEVLYPQLSEELLELELSDIREDEIALEKAEEEELWLLTLTQQFKHIQKHFGIDLPHSVVAHMDPLLIKKIDWETTNALEDFDITLDDMGAEDAKEQWGAENLSHHFLPLIRYRRDLARKNGDRYGPDLVNGN 618 T 0.21 Lipoprotein_10 pdb F Eukaryota T 6rec 4 D 3 K0J903_9CHLO Mitochondrial F1F0 ATP synthase associated 32 kDa protein MRQASRLALSIRQAGNVEAASAVPAMTRQFSAPGSHEHHETPLSKVMPTVVSIPRKVACLALGATKKVVCGLASSGPSQNLVSTFANKVIVEENLVNVAEIDVPFWSYWLSSAGFTSKDAFVKFAEAVKPKVAALSTSDITNLTVAFKRANYYDKDLFTGIEANVSANFTKFETEQLLQIVATFDAFNHSSVAFLDDVADSITYCNHYLAPVRAGADELATLLTYYAKNGHERADLLATVARGFSEVSLGKLSAAQRKDTVLSALKAFQTFGFYPESIEAVIGAALVSPAEYSAEELKEVEAVKVAAENALGGEFVLIQEGAHGH 325 T 0.042 FAST_1 pdbpercent F Eukaryota T 6rec 7 G 6 D7P897_9CHLO Mitochondrial ATP synthase subunit ASA6 MMLRTLTRSSAVAGQAVRLFKTSAAAAEGNSVAGIIKSVNETSGANLLSSLKTIKAQAAPIYPAAASSTGYSTQAKIALFGALSWILYRADGQSKAHEWIVDLNLNVLQAAWLISFSSLIPFRAVYFAFRGMAPATASTLNGLKTFSSISL 151 T 0.00025 NrsF pdbpercent F Eukaryota T 6rec 8 H 7 D8V7I2_9CHLO Mitochondrial ATP synthase associated protein ASA7 MSSVRAGVEAGRRDLTTFTFSGLQDAPVAALSGSIKLNVAAKAGKAEVTVAAGAAKAATQVSAAALRKLSGSKISLAEVARISVLHSSIQNYLLSLSNERYQLLSQWPDFTTMYGKDFYYRAHPEDLKKFYDAADEYYKLYETVTEFDSLSALASQVVPNYAARRRSTVHPAIGSTVADGAFTNFLLSKQ 190 T 0.96 DUF4296 pdbhh F Eukaryota T 6rec 9 I 8 D8V7I7_9CHLO Mitochondrial ATP synthase subunit ASA8 MVLGEVYLKDILRTPPTGAIPANVPHPFQTSFYTYATKKLIPRHWYLLGGFTFTITLYGILDGLRDSGKKKAYDEAIHAGKTPYTAGGH 89 T 0.3 Selenoprotein_S pdbhh F Eukaryota T 6rec 10 J 9 A0A5H1ZR73_9CHLO ASA-9: Polytomella F-ATP synthase associated subunit 9 MAVTSFLGKAFEKYFYDFSAYEQFGLNRFLSSKGQYVALRHVGFVMVGVNVLLAANFPFNPPFPTIGMCPAGWEGTWVCQADKAKALEMYKEWKKSN 97 T 0.39 1-cysPrx_C pdbhh F Eukaryota T 6ree 1 A 0 A0A5H1ZR95_9CHLO ASA-10: Polytomella F-ATP synthase associated subunit 10 MSYSAYFAKAGFQFPAGLSALVAGIVALNVCTGRPTKGTKEISNAEYNATPIGYLQSPDQHPTAFPKVPGMKDVHGSPHHHH 82 T 0.1 DUF6506 pdb F Eukaryota T 6ree 2 B 1 Q85JD5_9CHLO ATP synthase associated protein ASA1 MMRAAQKAKQELPATVLTQTRSYLAPLRSDFTEEITAPKVASASNLVNEWNNKKQATENLMKLLQAYKDIGDAKSEPLLKNHNPRTFEDRDYPVPDFRTQNLKAGDVPKFFDTVISTRASAAIASKDKFWAGRKTEAEAASAKASAAFPRVAVPEWKKGKTVSIENLNTVTDKYAAALVPKRKLALPVLPEGVKKAVEDFAASVGQAKNASEVSELLAKSLAEKAVVTEGGKVVEGFSYVSKAVAAKVIATRRAEVHERLLKLWAKRLLVSPELAIVPLNEFDAQLASKFEGISPKYQELLSAVAQGNKTFAQRLNSSPAFSSFLLKREKAESEVPPSELELEAAQKAAELEDPEVALRTLLGPQMEALGASDLLLSEQIRVITEHRYTPDRLQYKEGMKLADKIAAQEAALKEELKVIYGDNVDVKHFQASPRTPVQQLFDSLKNAAANKERAAKEAAAAASPYLAYAVTKKQEVQADPSNIPFDEVLYPQLSEELLELELSDIREDEIALEKAEEEELWLLTLTQQFKHIQKHFGIDLPHSVVAHMDPLLIKKIDWETTNALEDFDITLDDMGAEDAKEQWGAENLSHHFLPLIRYRRDLARKNGDRYGPDLVNGN 618 T 0.21 Lipoprotein_10 pdb F Eukaryota T 6ree 4 D 3 K0J903_9CHLO Mitochondrial F1F0 ATP synthase associated 32 kDa protein MRQASRLALSIRQAGNVEAASAVPAMTRQFSAPGSHEHHETPLSKVMPTVVSIPRKVACLALGATKKVVCGLASSGPSQNLVSTFANKVIVEENLVNVAEIDVPFWSYWLSSAGFTSKDAFVKFAEAVKPKVAALSTSDITNLTVAFKRANYYDKDLFTGIEANVSANFTKFETEQLLQIVATFDAFNHSSVAFLDDVADSITYCNHYLAPVRAGADELATLLTYYAKNGHERADLLATVARGFSEVSLGKLSAAQRKDTVLSALKAFQTFGFYPESIEAVIGAALVSPAEYSAEELKEVEAVKVAAENALGGEFVLIQEGAHGH 325 T 0.042 FAST_1 pdbpercent F Eukaryota T 6ree 7 G 6 D7P897_9CHLO Mitochondrial ATP synthase subunit ASA6 MMLRTLTRSSAVAGQAVRLFKTSAAAAEGNSVAGIIKSVNETSGANLLSSLKTIKAQAAPIYPAAASSTGYSTQAKIALFGALSWILYRADGQSKAHEWIVDLNLNVLQAAWLISFSSLIPFRAVYFAFRGMAPATASTLNGLKTFSSISL 151 T 0.00025 NrsF pdbpercent F Eukaryota T 6ree 8 H 7 D8V7I2_9CHLO Mitochondrial ATP synthase associated protein ASA7 MSSVRAGVEAGRRDLTTFTFSGLQDAPVAALSGSIKLNVAAKAGKAEVTVAAGAAKAATQVSAAALRKLSGSKISLAEVARISVLHSSIQNYLLSLSNERYQLLSQWPDFTTMYGKDFYYRAHPEDLKKFYDAADEYYKLYETVTEFDSLSALASQVVPNYAARRRSTVHPAIGSTVADGAFTNFLLSKQ 190 T 0.96 DUF4296 pdbhh F Eukaryota T 6ree 9 I 8 D8V7I7_9CHLO Mitochondrial ATP synthase subunit ASA8 MVLGEVYLKDILRTPPTGAIPANVPHPFQTSFYTYATKKLIPRHWYLLGGFTFTITLYGILDGLRDSGKKKAYDEAIHAGKTPYTAGGH 89 T 0.3 Selenoprotein_S pdbhh F Eukaryota T 6ree 10 J 9 A0A5H1ZR73_9CHLO ASA-9: Polytomella F-ATP synthase associated subunit 9 MAVTSFLGKAFEKYFYDFSAYEQFGLNRFLSSKGQYVALRHVGFVMVGVNVLLAANFPFNPPFPTIGMCPAGWEGTWVCQADKAKALEMYKEWKKSN 97 T 0.39 1-cysPrx_C pdbhh F Eukaryota T 6ref 1 A 0 A0A5H1ZR95_9CHLO ASA-10: Polytomella F-ATP synthase associated subunit 10 MSYSAYFAKAGFQFPAGLSALVAGIVALNVCTGRPTKGTKEISNAEYNATPIGYLQSPDQHPTAFPKVPGMKDVHGSPHHHH 82 T 0.1 DUF6506 pdb F Eukaryota T 6ref 2 B 1 Q85JD5_9CHLO ATP synthase associated protein ASA1 MMRAAQKAKQELPATVLTQTRSYLAPLRSDFTEEITAPKVASASNLVNEWNNKKQATENLMKLLQAYKDIGDAKSEPLLKNHNPRTFEDRDYPVPDFRTQNLKAGDVPKFFDTVISTRASAAIASKDKFWAGRKTEAEAASAKASAAFPRVAVPEWKKGKTVSIENLNTVTDKYAAALVPKRKLALPVLPEGVKKAVEDFAASVGQAKNASEVSELLAKSLAEKAVVTEGGKVVEGFSYVSKAVAAKVIATRRAEVHERLLKLWAKRLLVSPELAIVPLNEFDAQLASKFEGISPKYQELLSAVAQGNKTFAQRLNSSPAFSSFLLKREKAESEVPPSELELEAAQKAAELEDPEVALRTLLGPQMEALGASDLLLSEQIRVITEHRYTPDRLQYKEGMKLADKIAAQEAALKEELKVIYGDNVDVKHFQASPRTPVQQLFDSLKNAAANKERAAKEAAAAASPYLAYAVTKKQEVQADPSNIPFDEVLYPQLSEELLELELSDIREDEIALEKAEEEELWLLTLTQQFKHIQKHFGIDLPHSVVAHMDPLLIKKIDWETTNALEDFDITLDDMGAEDAKEQWGAENLSHHFLPLIRYRRDLARKNGDRYGPDLVNGN 618 T 0.21 Lipoprotein_10 pdb F Eukaryota T 6ref 4 D 3 K0J903_9CHLO Mitochondrial F1F0 ATP synthase associated 32 kDa protein MRQASRLALSIRQAGNVEAASAVPAMTRQFSAPGSHEHHETPLSKVMPTVVSIPRKVACLALGATKKVVCGLASSGPSQNLVSTFANKVIVEENLVNVAEIDVPFWSYWLSSAGFTSKDAFVKFAEAVKPKVAALSTSDITNLTVAFKRANYYDKDLFTGIEANVSANFTKFETEQLLQIVATFDAFNHSSVAFLDDVADSITYCNHYLAPVRAGADELATLLTYYAKNGHERADLLATVARGFSEVSLGKLSAAQRKDTVLSALKAFQTFGFYPESIEAVIGAALVSPAEYSAEELKEVEAVKVAAENALGGEFVLIQEGAHGH 325 T 0.042 FAST_1 pdbpercent F Eukaryota T 6ref 7 G 6 D7P897_9CHLO Mitochondrial ATP synthase subunit ASA6 MMLRTLTRSSAVAGQAVRLFKTSAAAAEGNSVAGIIKSVNETSGANLLSSLKTIKAQAAPIYPAAASSTGYSTQAKIALFGALSWILYRADGQSKAHEWIVDLNLNVLQAAWLISFSSLIPFRAVYFAFRGMAPATASTLNGLKTFSSISL 151 T 0.00025 NrsF pdbpercent F Eukaryota T 6ref 8 H 7 D8V7I2_9CHLO Mitochondrial ATP synthase associated protein ASA7 MSSVRAGVEAGRRDLTTFTFSGLQDAPVAALSGSIKLNVAAKAGKAEVTVAAGAAKAATQVSAAALRKLSGSKISLAEVARISVLHSSIQNYLLSLSNERYQLLSQWPDFTTMYGKDFYYRAHPEDLKKFYDAADEYYKLYETVTEFDSLSALASQVVPNYAARRRSTVHPAIGSTVADGAFTNFLLSKQ 190 T 0.96 DUF4296 pdbhh F Eukaryota T 6ref 9 I 8 D8V7I7_9CHLO Mitochondrial ATP synthase subunit ASA8 MVLGEVYLKDILRTPPTGAIPANVPHPFQTSFYTYATKKLIPRHWYLLGGFTFTITLYGILDGLRDSGKKKAYDEAIHAGKTPYTAGGH 89 T 0.3 Selenoprotein_S pdbhh F Eukaryota T 6ref 10 J 9 A0A5H1ZR73_9CHLO ASA-9: Polytomella F-ATP synthase associated subunit 9 MAVTSFLGKAFEKYFYDFSAYEQFGLNRFLSSKGQYVALRHVGFVMVGVNVLLAANFPFNPPFPTIGMCPAGWEGTWVCQADKAKALEMYKEWKKSN 97 T 0.39 1-cysPrx_C pdbhh F Eukaryota T 6rep 1 A 0 A0A5H1ZR95_9CHLO ASA-10: Polytomella F-ATP synthase associated subunit 10 MSYSAYFAKAGFQFPAGLSALVAGIVALNVCTGRPTKGTKEISNAEYNATPIGYLQSPDQHPTAFPKVPGMKDVHGSPHHHH 82 T 0.1 DUF6506 pdb F Eukaryota T 6rep 2 B 1 Q85JD5_9CHLO ATP synthase associated protein ASA1 MMRAAQKAKQELPATVLTQTRSYLAPLRSDFTEEITAPKVASASNLVNEWNNKKQATENLMKLLQAYKDIGDAKSEPLLKNHNPRTFEDRDYPVPDFRTQNLKAGDVPKFFDTVISTRASAAIASKDKFWAGRKTEAEAASAKASAAFPRVAVPEWKKGKTVSIENLNTVTDKYAAALVPKRKLALPVLPEGVKKAVEDFAASVGQAKNASEVSELLAKSLAEKAVVTEGGKVVEGFSYVSKAVAAKVIATRRAEVHERLLKLWAKRLLVSPELAIVPLNEFDAQLASKFEGISPKYQELLSAVAQGNKTFAQRLNSSPAFSSFLLKREKAESEVPPSELELEAAQKAAELEDPEVALRTLLGPQMEALGASDLLLSEQIRVITEHRYTPDRLQYKEGMKLADKIAAQEAALKEELKVIYGDNVDVKHFQASPRTPVQQLFDSLKNAAANKERAAKEAAAAASPYLAYAVTKKQEVQADPSNIPFDEVLYPQLSEELLELELSDIREDEIALEKAEEEELWLLTLTQQFKHIQKHFGIDLPHSVVAHMDPLLIKKIDWETTNALEDFDITLDDMGAEDAKEQWGAENLSHHFLPLIRYRRDLARKNGDRYGPDLVNGN 618 T 0.21 Lipoprotein_10 pdb F Eukaryota T 6rep 4 D 3 K0J903_9CHLO Mitochondrial F1F0 ATP synthase associated 32 kDa protein MRQASRLALSIRQAGNVEAASAVPAMTRQFSAPGSHEHHETPLSKVMPTVVSIPRKVACLALGATKKVVCGLASSGPSQNLVSTFANKVIVEENLVNVAEIDVPFWSYWLSSAGFTSKDAFVKFAEAVKPKVAALSTSDITNLTVAFKRANYYDKDLFTGIEANVSANFTKFETEQLLQIVATFDAFNHSSVAFLDDVADSITYCNHYLAPVRAGADELATLLTYYAKNGHERADLLATVARGFSEVSLGKLSAAQRKDTVLSALKAFQTFGFYPESIEAVIGAALVSPAEYSAEELKEVEAVKVAAENALGGEFVLIQEGAHGH 325 T 0.042 FAST_1 pdbpercent F Eukaryota T 6rep 7 G 6 D7P897_9CHLO Mitochondrial ATP synthase subunit ASA6 MMLRTLTRSSAVAGQAVRLFKTSAAAAEGNSVAGIIKSVNETSGANLLSSLKTIKAQAAPIYPAAASSTGYSTQAKIALFGALSWILYRADGQSKAHEWIVDLNLNVLQAAWLISFSSLIPFRAVYFAFRGMAPATASTLNGLKTFSSISL 151 T 0.00025 NrsF pdbpercent F Eukaryota T 6rep 8 H 7 D8V7I2_9CHLO Mitochondrial ATP synthase associated protein ASA7 MSSVRAGVEAGRRDLTTFTFSGLQDAPVAALSGSIKLNVAAKAGKAEVTVAAGAAKAATQVSAAALRKLSGSKISLAEVARISVLHSSIQNYLLSLSNERYQLLSQWPDFTTMYGKDFYYRAHPEDLKKFYDAADEYYKLYETVTEFDSLSALASQVVPNYAARRRSTVHPAIGSTVADGAFTNFLLSKQ 190 T 0.96 DUF4296 pdbhh F Eukaryota T 6rep 9 I 8 D8V7I7_9CHLO Mitochondrial ATP synthase subunit ASA8 MVLGEVYLKDILRTPPTGAIPANVPHPFQTSFYTYATKKLIPRHWYLLGGFTFTITLYGILDGLRDSGKKKAYDEAIHAGKTPYTAGGH 89 T 0.3 Selenoprotein_S pdbhh F Eukaryota T 6rep 10 J 9 A0A5H1ZR73_9CHLO ASA-9: Polytomella F-ATP synthase associated subunit 9 MAVTSFLGKAFEKYFYDFSAYEQFGLNRFLSSKGQYVALRHVGFVMVGVNVLLAANFPFNPPFPTIGMCPAGWEGTWVCQADKAKALEMYKEWKKSN 97 T 0.39 1-cysPrx_C pdbhh F Eukaryota T 6res 1 A 0 A0A5H1ZR95_9CHLO ASA-10: Polytomella F-ATP synthase associated subunit 10 MSYSAYFAKAGFQFPAGLSALVAGIVALNVCTGRPTKGTKEISNAEYNATPIGYLQSPDQHPTAFPKVPGMKDVHGSPHHHH 82 T 0.1 DUF6506 pdb F Eukaryota T 6res 2 B 1 Q85JD5_9CHLO ATP synthase associated protein ASA1 MMRAAQKAKQELPATVLTQTRSYLAPLRSDFTEEITAPKVASASNLVNEWNNKKQATENLMKLLQAYKDIGDAKSEPLLKNHNPRTFEDRDYPVPDFRTQNLKAGDVPKFFDTVISTRASAAIASKDKFWAGRKTEAEAASAKASAAFPRVAVPEWKKGKTVSIENLNTVTDKYAAALVPKRKLALPVLPEGVKKAVEDFAASVGQAKNASEVSELLAKSLAEKAVVTEGGKVVEGFSYVSKAVAAKVIATRRAEVHERLLKLWAKRLLVSPELAIVPLNEFDAQLASKFEGISPKYQELLSAVAQGNKTFAQRLNSSPAFSSFLLKREKAESEVPPSELELEAAQKAAELEDPEVALRTLLGPQMEALGASDLLLSEQIRVITEHRYTPDRLQYKEGMKLADKIAAQEAALKEELKVIYGDNVDVKHFQASPRTPVQQLFDSLKNAAANKERAAKEAAAAASPYLAYAVTKKQEVQADPSNIPFDEVLYPQLSEELLELELSDIREDEIALEKAEEEELWLLTLTQQFKHIQKHFGIDLPHSVVAHMDPLLIKKIDWETTNALEDFDITLDDMGAEDAKEQWGAENLSHHFLPLIRYRRDLARKNGDRYGPDLVNGN 618 T 0.21 Lipoprotein_10 pdb F Eukaryota T 6res 4 D 3 K0J903_9CHLO Mitochondrial F1F0 ATP synthase associated 32 kDa protein MRQASRLALSIRQAGNVEAASAVPAMTRQFSAPGSHEHHETPLSKVMPTVVSIPRKVACLALGATKKVVCGLASSGPSQNLVSTFANKVIVEENLVNVAEIDVPFWSYWLSSAGFTSKDAFVKFAEAVKPKVAALSTSDITNLTVAFKRANYYDKDLFTGIEANVSANFTKFETEQLLQIVATFDAFNHSSVAFLDDVADSITYCNHYLAPVRAGADELATLLTYYAKNGHERADLLATVARGFSEVSLGKLSAAQRKDTVLSALKAFQTFGFYPESIEAVIGAALVSPAEYSAEELKEVEAVKVAAENALGGEFVLIQEGAHGH 325 T 0.042 FAST_1 pdbpercent F Eukaryota T 6res 7 G 6 D7P897_9CHLO Mitochondrial ATP synthase subunit ASA6 MMLRTLTRSSAVAGQAVRLFKTSAAAAEGNSVAGIIKSVNETSGANLLSSLKTIKAQAAPIYPAAASSTGYSTQAKIALFGALSWILYRADGQSKAHEWIVDLNLNVLQAAWLISFSSLIPFRAVYFAFRGMAPATASTLNGLKTFSSISL 151 T 0.00025 NrsF pdbpercent F Eukaryota T 6res 8 H 7 D8V7I2_9CHLO Mitochondrial ATP synthase associated protein ASA7 MSSVRAGVEAGRRDLTTFTFSGLQDAPVAALSGSIKLNVAAKAGKAEVTVAAGAAKAATQVSAAALRKLSGSKISLAEVARISVLHSSIQNYLLSLSNERYQLLSQWPDFTTMYGKDFYYRAHPEDLKKFYDAADEYYKLYETVTEFDSLSALASQVVPNYAARRRSTVHPAIGSTVADGAFTNFLLSKQ 190 T 0.96 DUF4296 pdbhh F Eukaryota T 6res 9 I 8 D8V7I7_9CHLO Mitochondrial ATP synthase subunit ASA8 MVLGEVYLKDILRTPPTGAIPANVPHPFQTSFYTYATKKLIPRHWYLLGGFTFTITLYGILDGLRDSGKKKAYDEAIHAGKTPYTAGGH 89 T 0.3 Selenoprotein_S pdbhh F Eukaryota T 6res 10 J 9 A0A5H1ZR73_9CHLO ASA-9: Polytomella F-ATP synthase associated subunit 9 MAVTSFLGKAFEKYFYDFSAYEQFGLNRFLSSKGQYVALRHVGFVMVGVNVLLAANFPFNPPFPTIGMCPAGWEGTWVCQADKAKALEMYKEWKKSN 97 T 0.39 1-cysPrx_C pdbhh F Eukaryota T 6ret 1 A 0 A0A5H1ZR95_9CHLO ASA-10: Polytomella F-ATP synthase associated subunit 10 MSYSAYFAKAGFQFPAGLSALVAGIVALNVCTGRPTKGTKEISNAEYNATPIGYLQSPDQHPTAFPKVPGMKDVHGSPHHHH 82 T 0.1 DUF6506 pdb F Eukaryota T 6ret 2 B 1 Q85JD5_9CHLO ATP synthase associated protein ASA1 MMRAAQKAKQELPATVLTQTRSYLAPLRSDFTEEITAPKVASASNLVNEWNNKKQATENLMKLLQAYKDIGDAKSEPLLKNHNPRTFEDRDYPVPDFRTQNLKAGDVPKFFDTVISTRASAAIASKDKFWAGRKTEAEAASAKASAAFPRVAVPEWKKGKTVSIENLNTVTDKYAAALVPKRKLALPVLPEGVKKAVEDFAASVGQAKNASEVSELLAKSLAEKAVVTEGGKVVEGFSYVSKAVAAKVIATRRAEVHERLLKLWAKRLLVSPELAIVPLNEFDAQLASKFEGISPKYQELLSAVAQGNKTFAQRLNSSPAFSSFLLKREKAESEVPPSELELEAAQKAAELEDPEVALRTLLGPQMEALGASDLLLSEQIRVITEHRYTPDRLQYKEGMKLADKIAAQEAALKEELKVIYGDNVDVKHFQASPRTPVQQLFDSLKNAAANKERAAKEAAAAASPYLAYAVTKKQEVQADPSNIPFDEVLYPQLSEELLELELSDIREDEIALEKAEEEELWLLTLTQQFKHIQKHFGIDLPHSVVAHMDPLLIKKIDWETTNALEDFDITLDDMGAEDAKEQWGAENLSHHFLPLIRYRRDLARKNGDRYGPDLVNGN 618 T 0.21 Lipoprotein_10 pdb F Eukaryota T 6ret 4 D 3 K0J903_9CHLO Mitochondrial F1F0 ATP synthase associated 32 kDa protein MRQASRLALSIRQAGNVEAASAVPAMTRQFSAPGSHEHHETPLSKVMPTVVSIPRKVACLALGATKKVVCGLASSGPSQNLVSTFANKVIVEENLVNVAEIDVPFWSYWLSSAGFTSKDAFVKFAEAVKPKVAALSTSDITNLTVAFKRANYYDKDLFTGIEANVSANFTKFETEQLLQIVATFDAFNHSSVAFLDDVADSITYCNHYLAPVRAGADELATLLTYYAKNGHERADLLATVARGFSEVSLGKLSAAQRKDTVLSALKAFQTFGFYPESIEAVIGAALVSPAEYSAEELKEVEAVKVAAENALGGEFVLIQEGAHGH 325 T 0.042 FAST_1 pdbpercent F Eukaryota T 6ret 7 G 6 D7P897_9CHLO Mitochondrial ATP synthase subunit ASA6 MMLRTLTRSSAVAGQAVRLFKTSAAAAEGNSVAGIIKSVNETSGANLLSSLKTIKAQAAPIYPAAASSTGYSTQAKIALFGALSWILYRADGQSKAHEWIVDLNLNVLQAAWLISFSSLIPFRAVYFAFRGMAPATASTLNGLKTFSSISL 151 T 0.00025 NrsF pdbpercent F Eukaryota T 6ret 8 H 7 D8V7I2_9CHLO Mitochondrial ATP synthase associated protein ASA7 MSSVRAGVEAGRRDLTTFTFSGLQDAPVAALSGSIKLNVAAKAGKAEVTVAAGAAKAATQVSAAALRKLSGSKISLAEVARISVLHSSIQNYLLSLSNERYQLLSQWPDFTTMYGKDFYYRAHPEDLKKFYDAADEYYKLYETVTEFDSLSALASQVVPNYAARRRSTVHPAIGSTVADGAFTNFLLSKQ 190 T 0.96 DUF4296 pdbhh F Eukaryota T 6ret 9 I 8 D8V7I7_9CHLO Mitochondrial ATP synthase subunit ASA8 MVLGEVYLKDILRTPPTGAIPANVPHPFQTSFYTYATKKLIPRHWYLLGGFTFTITLYGILDGLRDSGKKKAYDEAIHAGKTPYTAGGH 89 T 0.3 Selenoprotein_S pdbhh F Eukaryota T 6ret 10 J 9 A0A5H1ZR73_9CHLO ASA-9: Polytomella F-ATP synthase associated subunit 9 MAVTSFLGKAFEKYFYDFSAYEQFGLNRFLSSKGQYVALRHVGFVMVGVNVLLAANFPFNPPFPTIGMCPAGWEGTWVCQADKAKALEMYKEWKKSN 97 T 0.39 1-cysPrx_C pdbhh F Eukaryota T 6rfk 3 C I GLU-GLY-AR7-0QE EGXX 4 T 320 DUF6399 pdbhh F F 6rfm 1 A A CYAA_BORPE AC-HLY,ACT,CYCLOLYSIN GSRSFSLGEVSDMAAVEAAELEMTRQVLHAGARQDDAEPGVSGASAHWGQRALQGAQAVAAAQRLVHAIALMTQFGRAGS 80 T 35 Hol_Tox pdbhh F Bacteria T 6rfq 17 Q S F2Z673_YARLI Subunit NESM of NADH:Ubiquinone Oxidoreductase (Complex I) MLKLHYRNFITAQHSTTNTTPTMIASVCKRAGLRAGPRAYPGVRQFALRAYNEEKELALKQRLSQLPPPGKAFVTAEGEPRPAKEAELAELAEIAALYKTDRVGILDILLLGNKHARLYRDNTALLKDYYYNGRRILDKIPVKDKQTGKVTWEIKREGAEKEDWVNQMYFLYAPSLILLLIVMVYKSREDITFWAKKELDQRVLDKHPEINDAPENERDALIVERIIAGDYDKLASLQKKATPTPATLI 249 T 0.0013 ESSS pdbpercent F Eukaryota T 6rfq 22 V Z Q6CI10_YARLI Subunit NUZM of NADH:Ubiquinone Oxidoreductase (Complex I) MLPGGPVPVFKKYTVGSKGIWEKLRVLLAIAPNRSTGNPIVPLYRVPTPGSRPEANVYQDPSSYPTNDIAENPYWKRDHRRAYPQTAFFDQKTVTGLLELGSEATPRIADGEAGTKALANIANGGVSFTQALGKSSKDVIYGEVLTVNGLPPVAPTLAPKQWKIIEGEAAIYPKGYPCRTFH 182 T 0.00017 CI-B14_5a pdbhh F Eukaryota T 6rfq 30 DA i A0A1H6Q311_YARLL Subunit N7BM of NADH:Ubiquinone Oxidoreductase (Complex I) MGGGRYPFPKDVISMTGGWWANPSNWKLNGLFATGIAVGLALWVSTATLPYTRRREGITSESDISKWNAAAGVWRERHGKISTGEAAESE 90 T 6 Sex_peptide pdbhh F Eukaryota T 6rfq 33 GA n Q6C1R9_YARLI Subunit NUNM of NADH:Ubiquinone Oxidoreductase (Complex I) FGSHGHGPELTPAVPFFQPYVLKWAGVSLGLVAFYQFNSSYEAKNGHTWVETFFHPKSREDILNEEAKIVQALNNQRELTIKMHELKREEKDYAHSYSPLFSDPVPQGGSIGKAPGSSRE 120 T 0.023 DUF5950 pdb F Eukaryota T 6rfr 18 R S F2Z673_YARLI Subunit NESM of NADH:Ubiquinone Oxidoreductase (Complex I) MLKLHYRNFITAQHSTTNTTPTMIASVCKRAGLRAGPRAYPGVRQFALRAYNEEKELALKQRLSQLPPPGKAFVTAEGEPRPAKEAELAELAEIAALYKTDRVGILDILLLGNKHARLYRDNTALLKDYYYNGRRILDKIPVKDKQTGKVTWEIKREGAEKEDWVNQMYFLYAPSLILLLIVMVYKSREDITFWAKKELDQRVLDKHPEINDAPENERDALIVERIIAGDYDKLASLQKKATPTPATLI 249 T 0.0013 ESSS pdbpercent F Eukaryota T 6rfr 23 W Z Q6CI10_YARLI Subunit NUZM of NADH:Ubiquinone Oxidoreductase (Complex I) MLPGGPVPVFKKYTVGSKGIWEKLRVLLAIAPNRSTGNPIVPLYRVPTPGSRPEANVYQDPSSYPTNDIAENPYWKRDHRRAYPQTAFFDQKTVTGLLELGSEATPRIADGEAGTKALANIANGGVSFTQALGKSSKDVIYGEVLTVNGLPPVAPTLAPKQWKIIEGEAAIYPKGYPCRTFH 182 T 0.00017 CI-B14_5a pdbhh F Eukaryota T 6rfr 32 FA i A0A1H6Q311_YARLL Subunit NUUM of NADH:Ubiquinone Oxidoreductase (Complex I) MGGGRYPFPKDVISMTGGWWANPSNWKLNGLFATGIAVGLALWVSTATLPYTRRREGITSESDISKWNAAAGVWRERHGKISTGEAAESE 90 T 6 Sex_peptide pdbhh F Eukaryota T 6rfr 34 HA n Q6C1R9_YARLI Subunit NUNM of NADH:Ubiquinone Oxidoreductase (Complex I) FGSHGHGPELTPAVPFFQPYVLKWAGVSLGLVAFYQFNSSYEAKNGHTWVETFFHPKSREDILNEEAKIVQALNNQRELTIKMHELKREEKDYAHSYSPLFSDPVPQGGSIGKAPGSSRE 120 T 0.023 DUF5950 pdb F Eukaryota T 6rfs 18 R S F2Z673_YARLI Subunit NESM of NADH:Ubiquinone Oxidoreductase (Complex I) MLKLHYRNFITAQHSTTNTTPTMIASVCKRAGLRAGPRAYPGVRQFALRAYNEEKELALKQRLSQLPPPGKAFVTAEGEPRPAKEAELAELAEIAALYKTDRVGILDILLLGNKHARLYRDNTALLKDYYYNGRRILDKIPVKDKQTGKVTWEIKREGAEKEDWVNQMYFLYAPSLILLLIVMVYKSREDITFWAKKELDQRVLDKHPEINDAPENERDALIVERIIAGDYDKLASLQKKATPTPATLI 249 T 0.0013 ESSS pdbpercent F Eukaryota T 6rfs 22 V Z Q6CI10_YARLI Subunit NUZM of NADH:Ubiquinone Oxidoreductase (Complex I) MLPGGPVPVFKKYTVGSKGIWEKLRVLLAIAPNRSTGNPIVPLYRVPTPGSRPEANVYQDPSSYPTNDIAENPYWKRDHRRAYPQTAFFDQKTVTGLLELGSEATPRIADGEAGTKALANIANGGVSFTQALGKSSKDVIYGEVLTVNGLPPVAPTLAPKQWKIIEGEAAIYPKGYPCRTFH 182 T 0.00017 CI-B14_5a pdbhh F Eukaryota T 6rfs 31 EA i A0A1H6Q311_YARLL Subunit NUUM of NADH:Ubiquinone Oxidoreductase (Complex I) MGGGRYPFPKDVISMTGGWWANPSNWKLNGLFATGIAVGLALWVSTATLPYTRRREGITSESDISKWNAAAGVWRERHGKISTGEAAESE 90 T 6 Sex_peptide pdbhh F Eukaryota T 6rfs 33 GA n Q6C1R9_YARLI Subunit NUNM of NADH:Ubiquinone Oxidoreductase (Complex I) FGSHGHGPELTPAVPFFQPYVLKWAGVSLGLVAFYQFNSSYEAKNGHTWVETFFHPKSREDILNEEAKIVQALNNQRELTIKMHELKREEKDYAHSYSPLFSDPVPQGGSIGKAPGSSRE 120 T 0.023 DUF5950 pdb F Eukaryota T 6rh6 2 B B AP2M1_RAT AP-2 MU CHAIN,ADAPTOR PROTEIN COMPLEX AP-2 SUBUNIT MU,ADAPTOR-RELATED PROTEIN COMPLEX 2 SUBUNIT MU,CLATHRIN ASSEMBLY PROTEIN COMPLEX 2 MU MEDIUM CHAIN,CLATHRIN COAT ASSEMBLY PROTEIN AP50,CLATHRIN COAT-ASSOCIATED PROTEIN AP50,MU2-ADAPTIN,PLASMA MEMBRANE ADAPTOR AP-2 50 KDA PROTEIN SQITSQVTGQIGWRR 15 T 74 DUF2553 pdbhh F Eukaryota T 6rhc 2 B P WWTR1_HUMAN TRANSCRIPTIONAL COACTIVATOR WITH PDZ-BINDING MOTIF RSHSSPASLQLGT 13 T 0.13 TFIIA unppercent F Eukaryota T 6rhe 2 B D ACE-ALA-HIS-CYS-GLY-NH2 XAHCGX 6 T 41 zf-ISL3 pdbhh F F 6ri5 44 RA p uL1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 210 F F F 6rj9 1 A,B A,B A0A2U7VKE8_9CAUD AcrIIA6 MKINDDIKELILEYMSRYFKFENDFYKLPGIKFTDANWQKFKNGGTDIEKMGAARVNAMLDCLFDDFELAMIGKAQTNYYNDNSLKMNMPFYTYYDMFKKQQLLKWLKNNRDDVIGGTGRMYTASGNYIANAYLEVALESSSLGSGSYMLQMRFKDYSKGQEPIPSGRQNRLEWIENNLENIR 183 T 0.06 PDH_E1_M pdb T Viruses T 6rja 1 A,B A,B A0A2U7VKE8_9CAUD AcrIIA6 MKINDDIKELILEYMSRYFKFENDFYKLPGIKFTDANWQKFKNGGTDIEKMGAARVNAMLDCLFDDFELAMIGKAQTNYYNDNSLKMNMPFYTYYDMFKKQQLLKWLKNNRDDVIGGTGRMYTASGNYIANAYLEVALESSSLGSGSYMLQMRFKDYSKGQEPIPSGRQNRLEWIENNLENIR 183 T 0.06 PDH_E1_M pdb T Viruses T 6rjg 1 A,B A,B A0A2U7VKE8_9CAUD AcrIIA6 MKINDDIKELILEYMSRYFKFENDFYKLPGIKFTDANWQKFKNGGTDIEKMGAARVNAMLDCLFDDFELAMIGKAQTNYYNDNSLKMNMPFYTYYDMFKKQQLLKWLKNNRDDVIGGTGRMYTASGNYIANAYLEVALESSSLGSGSYMLQMRFKDYSKGQEPIPSGRQNRLEWIENNLENIR 183 T 0.06 PDH_E1_M pdb T Viruses T 6rjl 2 B P WWTR1_HUMAN TAZpS89 RSHSSPASLQLGT 13 T 0.13 TFIIA unppercent F Eukaryota T 6rjq 2 B P WWTR1_HUMAN TAZpS89 RSHSSPASLQLGT 13 T 0.13 TFIIA unppercent F Eukaryota T 6rjx 1 A A O51302_BORBU LysM domain protein GAMGESRESKNAKIAQPDNKNFQLRDIKDIKNELIRERGHLFYSKEFNEAERLEEAMKQSFSKKKAIEGNEIALKVLERYKTIIRETREKKEKTNYLKENIEKYLNDAEANEAYIWIPLEIDEVNNLYFEATRKYKNYDLDNALDMYSKAFNRAQQAAKNAKEAKALKETDERMYKQLKALEAASNLPI 189 T 0.063 DUF2686 pdbpercent F Bacteria T 6rjz 2 B P P53_HUMAN ANTIGEN NY-CO-13,PHOSPHOPROTEIN P53,TUMOR SUPPRESSOR P53 KLMFKTEGPDSD 12 T 44 CopB pdbhh F Eukaryota T 6rk8 2 B P P53_HUMAN ANTIGEN NY-CO-13,PHOSPHOPROTEIN P53,TUMOR SUPPRESSOR P53 KLMFKTEGPDSD 12 T 44 CopB pdbhh F Eukaryota T 6rki 2 B P P53_HUMAN ANTIGEN NY-CO-13,PHOSPHOPROTEIN P53,TUMOR SUPPRESSOR P53 KLMFKTEGPDSD 12 T 44 CopB pdbhh F Eukaryota T 6rkk 2 B P P53_HUMAN ANTIGEN NY-CO-13,PHOSPHOPROTEIN P53,TUMOR SUPPRESSOR P53 KLMFKTEGPDSD 12 T 44 CopB pdbhh F Eukaryota T 6rkm 2 B P P53_HUMAN ANTIGEN NY-CO-13,PHOSPHOPROTEIN P53,TUMOR SUPPRESSOR P53 KLMFKTEGPDSD 12 T 44 CopB pdbhh F Eukaryota T 6rko 3 C H YNHF_ECOLI Uncharacterized protein YnhF MSTDLKFSLVTTIIVLGLIVAVGLTAALH 29 T 1.1 DUF6520 pdbhh F Bacteria T 6rl3 2 B P P53_HUMAN ANTIGEN NY-CO-13,PHOSPHOPROTEIN P53,TUMOR SUPPRESSOR P53 KLMFKTEGPDSD 12 T 44 CopB pdbhh F Eukaryota T 6rl4 2 B P P53_HUMAN ANTIGEN NY-CO-13,PHOSPHOPROTEIN P53,TUMOR SUPPRESSOR P53 KLMFKTEGPDSD 12 T 44 CopB pdbhh F Eukaryota T 6rl6 2 B P P53_HUMAN ANTIGEN NY-CO-13,PHOSPHOPROTEIN P53,TUMOR SUPPRESSOR P53 KLMFKTEGPDSD 12 T 44 CopB pdbhh F Eukaryota T 6rlz 2 C,D C,D EtMe GXLDXLDL 8 T 17 CTP_transf_1 pdbhh F F 6rm3 32 FA SH0 eS7 MSAQTETEKQISEIIKKFVTDIDEKKLQEINIQTFIRDNKRKVMTVKVPVEIISKSQINFGSIIKNIKQKFQDYYIILVENIKNEEKTSWNDCKKIFKGACYPFNINGIRTDVISPEEEIVNVLLEKKCTFNEDEFKMIETAIKGLVGMNVVVSTNFHSLN 161 T 0.0037 Ribosomal_S7e pdbpercent F T 6rm3 43 QA LM0 eL14 MKFIELGRLVAPIIKKERNIKAIIIGIIDSTFVVLKKSNGENEVCPVSSLILLDEVYDIKNLSSEEIVKLIENKKEEGGASNDFERFKNKLREEVKKNILREKGI 105 T 0.0025 Ribosomal_L14e pdbhh F T 6rm3 48 VA LNN MDF2 MEKQQNEKKLNEEETEKLALTEEHPKKKVNEEDNLDTLPEKREEDIVFKKVNVEKNKEKEEDHNFSSNYADHKIDLLSVENKDFPKKQKKVKDSLHLLKENRDRDFGRSKRGHVRNKKQGRETGTRRIKIRKNNYESNDINNYNIKTISKKSRRKEAERQ 160 T 58 DUF2970 pdbhh F T 6rm3 72 TB LXX msL1 MASKLKKSWKDEKKKSKTAIFSLEKQKVMKQERLQKAKILREKIKGLKEKKKQYYLDLARQKADKLEADKLLN 73 T 0.086 FliJ pdbpssm F T 6rm5 2 B P P53_HUMAN ANTIGEN NY-CO-13,PHOSPHOPROTEIN P53,TUMOR SUPPRESSOR P53 KLMFKTEGPDSD 12 T 44 CopB pdbhh F Eukaryota T 6rm7 2 B P P53_HUMAN ANTIGEN NY-CO-13,PHOSPHOPROTEIN P53,TUMOR SUPPRESSOR P53 KLMFKTEGPDSD 12 T 44 CopB pdbhh F Eukaryota T 6rml 2 C C TP53B_HUMAN 53BP1 EVEEIPETPCESQGE 15 T 9 Perm-CXXC pdbhh F Eukaryota T 6rmm 2 E,F P,R TP53B_HUMAN 53BP1 SSDLVAPSPDAFRST 15 T 11 Tachykinin pdbhh F Eukaryota T 6rmv 3 C C TRPM1_MOUSE Transient receptor potential cation channel, subfamily M, member 3 KRPKALKLLGMEDDI 15 T 8.4 RPT pdbhh F Eukaryota T 6rn2 2 G S casein AAAAAAAAAAAAAAAAAAAAAAAA 24 T 670 DUF4699 pdbhh F F 6rn3 2 G S casein XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 6rn4 2 G S casein AAAAAAAAAAAAAAAAAAAAAAAA 24 T 670 DUF4699 pdbhh F F 6ro1 2 B B NVL_HUMAN NUCLEAR VCP-LIKE PROTEIN GPDSMKDSEGGWFIDKTPSVKKDSFFLDLSCEKSNPKKPITEIQDSKDSSLLESD 55 T 6.9 Amdo_NSP pdbhh F Eukaryota T 6ro6 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H DDROC_DEIDV HTH-type transcriptional regulator DdrOC MEGALPKGLSDLIADPTLGPQITPDWVRTLSRIELRGKRPRDKQDWYEIYLHLKRILS 58 T 3.6 DUF4936 pdbhh F Bacteria T 6roy 2 B,D D,C PDCD1_HUMAN immune receptor tyrosine-based inhibitory motif (ITIM) FSVDXGELDFQ 11 T 0.0063 SIT unphh F Eukaryota T 6roz 2 B,D B,D PDCD1_HUMAN immune receptor tyrosine-based switch motif (ITSM) EQTEXATIVFP 11 T 0.002 DUF4578 pdbhh F Eukaryota T 6rp4 1 A,B,C,D A,B,C,D SIDD_LEGPH SIDD SIGVSDGLLSYIKNENENKGFLGIYGFFTGADKNIEKATLYKNLIAKYQNNHFISLIILSALVSDSKTPLMTQYLVGYLDFPSKALLANKITELLLKELENPDMREILGSRLATDVIEELETKIIRYIHNPAGSDIHSTLNLWTADKIKAATNSSLTI 158 T 0.0095 Yuri_gagarin pdbpercent F Bacteria T 6rp6 2 B P WWTR1_HUMAN TRANSCRIPTIONAL COACTIVATOR WITH PDZ-BINDING MOTIF RSHSSPASLQLGT 13 T 0.13 TFIIA unppercent F Eukaryota T 6rqj 2 B C C5I_ORNMO Complement inhibitor MASHHHHHHHHHHSGDSESDCTGSEPVDAFQAFSEGKEAYVLVRSTDPKARDCLKGEPAGEKQDNTLPVMMTFKQGTDWASTDWTFTLDGAKVTATLGQLTQNREVVYDSQSHHCHVDKVEKEVPDYEMWMLDAGGLEVEVECCRQKLEELASGRNQMYPHLKDC 165 T 8.1E-05 His_binding pdbhh F Eukaryota T 6rqj 3 C D C5I1_RHIAP Rhipicephalus appendiculatus RaCI1 GPMEEVKTTPIPNHQCVNATCERKLDALGNAVITKCPQGCLCVVRGASNIVPANGTCFQLATTKPPMAPGDNKDNKEEESN 81 T 0.0095 UPAR_LY6_2 unppercent F Eukaryota T 6rqs 1 A A ARG-ARG-TRP-ARG-ARG-TRP-TRP-ARG-ARG-TRP-TRP-ARG-ARG-TRP-ARG-ARG XXRRWRRWWRRWWRRWRR 18 T 17 Trp_leader2 pdbhh F F 6rqx 2 B B PSE-LYS-HIS-HIS-ALA-PHE-SER-PHE-LYS XKHHAFSFK 9 T 13 SmaI pdbhh F T 6rr0 1 A,B,C,D,E,F,G A,B,C,D,E,F,G SIR4_YEAST SILENT INFORMATION REGULATOR 4 GPKPKNTKENLSKSSWRQEWLANLKLISVSLVDEFPSELSDSDRQIINEKMQLLKDIFANNLKSAISNNFRESDIIILKGEIEDYPMSSEIKIYYNELQNKPDAKKARFWSFMKTQRFVSNMGFDIQ 127 T 0.092 DUF6120 pdbpercent F Eukaryota T 6rr0 2 H,I,J,K,L,M,N H,I,J,K,L,M,N UBP10_YEAST DEUBIQUITINATING ENZYME 10,DISRUPTER OF TELOMERE SILENCING PROTEIN 4,UBIQUITIN THIOESTERASE 10,UBIQUITIN-SPECIFIC-PROCESSING PROTEASE 10 LSTELSTEPPSS 12 T 0.29 DUF1155 pdbhh F Eukaryota F 6rrc 2 B,D B,D RAD21_HUMAN HHR21,NUCLEAR MATRIX PROTEIN 1,NXP-1,SCC1 HOMOLOG KRKLIVDSVKELDSKTIRAQLSDYS 25 T 3.2 E2 pdbhh F Eukaryota T 6rre 1 A,B,C,D,E,F A,B,C,D,E,F SIDD_LEGPH SIDD MRSIITQICNGVLHGQSYQSGSNDLDKGNSEIFASSLFVHLNEQGKEIIKHKDSDDKIVIGYTKDGMAFQIVVDGFYGCERQAVFSFIDNYVLPLIDNFSLDLTRYPDSKKVTESLIHTIYSLRSKHAPLAEFTMSLCVTYQKDEQLFCAGFGIGDTGIAIKRNEGTIEQLVCHTEVDGFKDAFDNYSSANIDLVIERNSVFNTKVMPGDELVGYTYVPPMLEMTEKEFEVETVDGKKINKRIVRHLNLDPGNFDDKDPLFSQLLQVVKSKQKQLVEQAKETGQIQRFGDDFTVGRLVIPDQLLINQLRIHALSIGVSDGLLSYIKNENENKGFLGIYGFFTGADKNIEKATLYKNLIAKYQNNHFISLIILSALVSDSKTPLMTQYLVGYLDFPSKALLANKITELLLKELENPDMREILGSRLATDVIEELETKIIRYIHNPAGSDIHSTLNLWTADKIKAATNSSLTI 471 T 0.08 PP2C_2 pdbpssm F Bacteria T 6rrk 1 A,B B,A STAG1_HUMAN SCC3 HOMOLOG 1,STROMAL ANTIGEN 1 SMSPNGNLIRMLVLFFLESELHEHAAYLVDSLWESSQELLKDWECMTELLLEEPVQGEEAMSDRQESALIELMVCTIRQAAEAHPPVGRGTGKRVLTAKERKTQIDDRNKLTEHFIITLPMLLSKYSADAEKVANLLQIPQYFDLEIYSTGRMEKHLDALLKQIKFVVEKHVESDVLEACSKTYSILCSEEYTIQNRVDIARSQLIDEFVDRFNHSVEDLLQEGEEADDDDIYNVLSTLKRLTSFHNAHDLTKWDLFGNCYRLLKTGIEHGAMPEQIVVQALQCSHYSILWQLVKITDGSPSKEDLLVLRKTVKSFLAVCQQCLSNVNTPVKEQAFMLLCDLLMIFSHQLMTGGREGLQPLVFNPDTGLQSELLSFVMDHVFIDQDEENQSMEGDEEDEANKIEALHKRRNLLAAFSKLIIYDIVDMHAAADIFKHYMKYYNDYGDIIKETLSKTRQID 459 T 0.0089 Cnd3 pdb F Eukaryota T 6rrk 2 C,D C,D RAD21_HUMAN HHR21,NUCLEAR MATRIX PROTEIN 1,NXP-1,SCC1 HOMOLOG PTKKLMMWKETGGVEKLFSLPAQPLWNNRLLKLFTRCLTP 40 T 3.2 Vac_ImportDeg pdbhh F Eukaryota T 6rrl 1 A A peptide 3967 FRIMRILRVLKL 12 T 6.6 OGFr_N pdbhh F T 6rro 1 A A peptide 536_2 GFIVKRFKILV 11 T 1.8 CHDCT2 pdbhh F T 6rrp 1 A,B A,B Q9I188_PSEAE PvdP MTVSRRGFMAGLALTGAAALPVAYYTHRHLTREEEPQTPDEASLDLAATDGIRLGDRLRGLWDLRLVGGDAELPGLPREGLQLVLDVAPKGRGLIGYLDTPERLLAAEPPRFRVLGDLLGASSASIRWRLVDQASGSVAPTHDCSAVFDEVWADYANAGDGTLSGRIQRLERSPLSPNEDFRFVAVKRHFPLAHERIVLNEKLLGWLVSPQHRLFHQLWHASRDKWHRLSEKQRNALRGVGWQPGPLDRERDARGPRKDRNASGIDFFFMHRHMLHTARSMQDLPSWERLPRPVVPLEYDRPGFIRYFDNPDGFSVPPAWVAVDDDEYSEWLHGLKSAEAYHANFLVWESQYQDPAYLAKLTLGQFGSELELGMHDWLHMRWASVTRDPSNGAPVMTDRFPADFAPRWFRPENDFLGDPFSSHVNPVFWSFHGWIDDRIEDWYRAHERFHPGEVQRREVEGIQWFAPGRWVEVGDPWLGPATHGCGLSDVQASSNSVELDVETMKLALRIIFSEEDQLSGWLKRAPRRPWYARNLKLARDQLRR 544 T 0.0015 Hemocyanin_M pdbhh F Bacteria T 6rrq 1 A,B A,B Q9I188_PSEAE PvdP MTVSRRGFMAGLALTGAAALPVAYYTHRHLTREEEPQTPDEASLDLAATDGIRLGDRLRGLWDLRLVGGDAELPGLPREGLQLVLDVAPKGRGLIGYLDTPERLLAAEPPRFRVLGDLLGASSASIRWRLVDQASGSVAPTHDCSAVFDEVWADYANAGDGTLSGRIQRLERSPLSPNEDFRFVAVKRHFPLAHERIVLNEKLLGWLVSPQHRLFHQLWHASRDKWHRLSEKQRNALRGVGWQPGPLDRERDARGPRKDRNASGIDFFFMHRHMLHTARSMQDLPSWERLPRPVVPLEYDRPGFIRYFDNPDGFSVPPAWVAVDDDEYSEWLHGLKSAEAYHANFLVWESQYQDPAYLAKLTLGQFGSELELGMHDWLHMRWASVTRDPSNGAPVMTDRFPADFAPRWFRPENDFLGDPFSSHVNPVFWSFHGWIDDRIEDWYRAHERFHPGEVQRREVEGIQWFAPGRWVEVGDPWLGPATHGCGLSDVQASSNSVELDVETMKLALRIIFSEEDQLSGWLKRAPRRPWYARNLKLARDQLRR 544 T 0.0015 Hemocyanin_M pdbhh F Bacteria T 6rrr 1 A,B A,B Q9I188_PSEAE PvdP MTVSRRGFMAGLALTGAAALPVAYYTHRHLTREEEPQTPDEASLDLAATDGIRLGDRLRGLWDLRLVGGDAELPGLPREGLQLVLDVAPKGRGLIGYLDTPERLLAAEPPRFRVLGDLLGASSASIRWRLVDQASGSVAPTHDCSAVFDEVWADYANAGDGTLSGRIQRLERSPLSPNEDFRFVAVKRHFPLAHERIVLNEKLLGWLVSPQHRLFHQLWHASRDKWHRLSEKQRNALRGVGWQPGPLDRERDARGPRKDRNASGIDFFFMHRHMLHTARSMQDLPSWERLPRPVVPLEYDRPGFIRYFDNPDGFSVPPAWVAVDDDEYSEWLHGLKSAEAYHANFLVWESQYQDPAYLAKLTLGQFGSELELGMHDWLHMRWASVTRDPSNGAPVMTDRFPADFAPRWFRPENDFLGDPFSSHVNPVFWSFHGWIDDRIEDWYRAHERFHPGEVQRREVEGIQWFAPGRWVEVGDPWLGPATHGCGLSDVQASSNSVELDVETMKLALRIIFSEEDQLSGWLKRAPRRPWYARNLKLARDQLRR 544 T 0.0015 Hemocyanin_M pdbhh F Bacteria T 6rrv 1 A A SIR4_YEAST SILENT INFORMATION REGULATOR 4 GPKPKNTKENLSKSSWRQEWLANLKLISVSLVDEFPSELSDSDRQIINEKMQLLKDIFANNLKSAISNNFRESDIIILKGEIEDYPMSSEIKIYYNELQNKPDAKKARFWSFMKTQRFVSNMGFDIQ 127 T 0.092 DUF6120 pdbpercent F Eukaryota T 6rsm 1 A A peptide 12530 KFKKVIWKSFL 11 T 8.7 DUF5665 pdbhh F T 6ru6 2 C C P63_HUMAN P63,CHRONIC ULCERATIVE STOMATITIS PROTEIN,CUSP,KERATINOCYTE TRANSCRIPTION FACTOR KET,TRANSFORMATION-RELATED PROTEIN 63,TP63,TUMOR PROTEIN P73-LIKE,P73L,P40,P51 RTPSSASTVSVGY 13 T 24 Vac7 pdbhh F Eukaryota T 6ru7 2 C,D C,D P63_HUMAN P63,CHRONIC ULCERATIVE STOMATITIS PROTEIN,CUSP,KERATINOCYTE TRANSCRIPTION FACTOR KET,TRANSFORMATION-RELATED PROTEIN 63,TP63,TUMOR PROTEIN P73-LIKE,P73L,P40,P51 YTPSSASTVSVGSSET 16 T 26 GerPB pdbhh F Eukaryota T 6ru8 2 E,F,G,H E,F,G,H P63_HUMAN P63,CHRONIC ULCERATIVE STOMATITIS PROTEIN,CUSP,KERATINOCYTE TRANSCRIPTION FACTOR KET,TRANSFORMATION-RELATED PROTEIN 63,TP63,TUMOR PROTEIN P73-LIKE,P73L,P40,P51 SSASTVSVGSSY 12 T 8.5 HET-S pdbhh F Eukaryota T 6ruj 2 B B CONSENSUS ANKYRIN REPEAT DOMAIN-(D)3-hydroxy-Leu EVVKLLLEHGADVXA 15 T 0.00029 Shigella_OspC pdbhh F T 6rup 2 C C SER-SER-SER-SER SSSS 4 T 330 GerPB pdbhh F F 6rvc 1 A,B,C A,B,C PTC1_HUMAN PTC1 ETGHHHHHHRDGLDLTDIVPRETREYDFIAAQFKYFSFYNMYIVTQKADYPNIQHLLYDLHRSFSNVKYVMLEENKQLPKMWLHYFRDWLQGLQDAFDSDWETGKIMPNNYKNGSDDGVLAYKLLVQTGSRDKPIDISQLTKQRLVDADGIINPSAFYIYLTAWVSNDPVAYAASQANIRPHRPEWVHDKADYMPETRLRIPAAEPIEYAQFPFYLNGLRDTSDFVEAIEKVRTICSNYTSLGLSSYPNGYPFLFWEQYIG 261 T 3.3E-50 Patched unppercent F Eukaryota T 6rw2 2 B B ALA-ARG-ASP-CYS-PRO-LEU-VAL-ASN-PRO-LEU-CYS-LEU-HIS-PRO-GLY-TRP-THR-CYS ARDCPLVNPLCLHPGWTC 18 T 2.6 NusG_II pdbhh F T 6rwh 2 B P P53_HUMAN ANTIGEN NY-CO-13,PHOSPHOPROTEIN P53,TUMOR SUPPRESSOR P53 KLMFKTEGPDSD 12 T 44 CopB pdbhh F Eukaryota T 6rwi 2 B P P53_HUMAN ANTIGEN NY-CO-13,PHOSPHOPROTEIN P53,TUMOR SUPPRESSOR P53 KLMFKTEGPDSD 12 T 44 CopB pdbhh F Eukaryota T 6rws 2 B P P53_HUMAN ANTIGEN NY-CO-13,PHOSPHOPROTEIN P53,TUMOR SUPPRESSOR P53 KLMFKTEGPDSD 12 T 44 CopB pdbhh F Eukaryota T 6rwu 2 B P P53_HUMAN ANTIGEN NY-CO-13,PHOSPHOPROTEIN P53,TUMOR SUPPRESSOR P53 KLMFKTEGPDSD 12 T 44 CopB pdbhh F Eukaryota T 6rwy 1 A,G,H,I,J,K A,G,H,I,J,K Inner rod protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 59 F F F 6rwy 2 B,C,D,E,F B,C,D,E,F Inner rod protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 73 F F F 6rx2 2 B P P53_HUMAN ANTIGEN NY-CO-13,PHOSPHOPROTEIN P53,TUMOR SUPPRESSOR P53 KLMFKTEGPDSD 12 T 44 CopB pdbhh F Eukaryota T 6rx4 4 D D CYTOCHROME BD-I OXIDASE SUBUNIT Y,CYTOCHROME D UBIQUINOL OXIDASE SUBUNIT Y XXXXXXXXXXXXXXXXXXXXXXXXXX 26 F F F 6rxj 2 C,D C,D H4_YEAST Histone H4 KGGAXRHRKI 10 T 4.2 Shadoo unppercent F Eukaryota T 6rxk 2 B B H4_YEAST Histone H4 KGGAXRHRKIL 11 T 4.2 Shadoo unppercent F Eukaryota T 6rxl 2 B B H4_YEAST Histone H4 KGGAXRHRKIL 11 T 4.2 Shadoo unppercent F Eukaryota T 6rxm 2 G,H,I,J,K,L G,H,I,J,K,L H4_YEAST Histone H4 KGGAXRHRKIL 11 T 11 Shadoo unppercent F Eukaryota T 6rxo 2 C,D C,D H4_YEAST Histone H4 KGGAXRHRKIL 11 T 4.2 Shadoo unppercent F Eukaryota T 6rxp 2 C,D C,D H4_YEAST Histone H4 KGGAXRHRKIL 11 T 4.2 Shadoo unppercent F Eukaryota T 6rxq 2 E,F,G,H E,F,G,H H4_YEAST Histone H4 KGGAKRHRKIL 11 T 11 Shadoo unppercent F Eukaryota T 6rxr 2 E,F,G,H E,F,G,H H4_YEAST Histone H4 KGGAKRHRKIL 11 T 11 Shadoo unppercent F Eukaryota T 6rxs 2 B B H4_HUMAN Histone H4 KGGAXRHRKIL 11 T 11 Shadoo unppercent F Eukaryota T 6rxt 47 YA UH Y1780_CHATD Utp8 MAAKLQIHAPYVLHALPRPLDRSDGLGRYFSGEVFGQKQGGKRKKRTELAVAIDGVAVYLYDILSSQVVTSYLVSPQSCFTCPPSSLRWRPASSKTVTRYTYVSVATGDSVLAKREIRLFREETLSTGNTVVACISRTICSDSPIVHIFTSSPRNFLTNVPGKDIPNHDLIIITANGSIFALNGETLEEKWQVSPSVLSREILSDSKLALQVDFVQQTSAADVADGLFGGKNDLFGVFQERIHRDGFNPDFFVVITSQSGADSANARHLHVLALPSEREARQTGKENVISVFVAPLAVEETCRSFQLDVRSGTLQAISNKALVTYQFANGIAKLENRLQVPGLSSYLRLSKTSVLTSATDSLSVYNPIYRSLQAAARLEPTDDTNGHACEFVSYLASRELAVAIRGGSLVVIQIEAPKNRTAKRRAEGLLTDAIRRGISRKIAFEKRTKPEHVSDSTILADAVPGSLSDPSWSEWQNKAMQADELLQNNDIQSWEELMAEVFKVPIKPDETADAEKQTAPNPVVKLPEWEWPSSRSDYARVDRRWVVYAINKVFGWEGQLESNTGRLTCRLPESSVLIYLVDAGHLSTSNVKSAFKDDVREVDKVEELIGEQLPIILAEVDPTMELLVGYLSGTQLGSSELVSSIKLLLCSLGLFEDGSRLPAVGDNTHIEQVTGQENEVVNMELDRAEEELQITEHYLDEHRTRGLGIAFSKLAACPAAETVKSLRRLFKPDEVLVLLNVLRAELIKDGWTTRYLDKINADQEDDAPPDASIQLIADLMSRCIDAVGLSGWMAADVMLSSSRTHQDSANFFSQFQAEISVALEGVMEAVRLKGVIAEAANYAKRARRALADSAKGKAMTVHMSAELPLGLKTDNKISTERVRSGGEIVARSSRQIGHFISKRRGIYSIHRISEEMLLGAAGPTVVQEAR 930 T 3.1E-10 Utp8 pdbpssm F Eukaryota T 6rxu 54 FB UH Y1780_CHATD Utp8 MAAKLQIHAPYVLHALPRPLDRSDGLGRYFSGEVFGQKQGGKRKKRTELAVAIDGVAVYLYDILSSQVVTSYLVSPQSCFTCPPSSLRWRPASSKTVTRYTYVSVATGDSVLAKREIRLFREETLSTGNTVVACISRTICSDSPIVHIFTSSPRNFLTNVPGKDIPNHDLIIITANGSIFALNGETLEEKWQVSPSVLSREILSDSKLALQVDFVQQTSAADVADGLFGGKNDLFGVFQERIHRDGFNPDFFVVITSQSGADSANARHLHVLALPSEREARQTGKENVISVFVAPLAVEETCRSFQLDVRSGTLQAISNKALVTYQFANGIAKLENRLQVPGLSSYLRLSKTSVLTSATDSLSVYNPIYRSLQAAARLEPTDDTNGHACEFVSYLASRELAVAIRGGSLVVIQIEAPKNRTAKRRAEGLLTDAIRRGISRKIAFEKRTKPEHVSDSTILADAVPGSLSDPSWSEWQNKAMQADELLQNNDIQSWEELMAEVFKVPIKPDETADAEKQTAPNPVVKLPEWEWPSSRSDYARVDRRWVVYAINKVFGWEGQLESNTGRLTCRLPESSVLIYLVDAGHLSTSNVKSAFKDDVREVDKVEELIGEQLPIILAEVDPTMELLVGYLSGTQLGSSELVSSIKLLLCSLGLFEDGSRLPAVGDNTHIEQVTGQENEVVNMELDRAEEELQITEHYLDEHRTRGLGIAFSKLAACPAAETVKSLRRLFKPDEVLVLLNVLRAELIKDGWTTRYLDKINADQEDDAPPDASIQLIADLMSRCIDAVGLSGWMAADVMLSSSRTHQDSANFFSQFQAEISVALEGVMEAVRLKGVIAEAANYAKRARRALADSAKGKAMTVHMSAELPLGLKTDNKISTERVRSGGEIVARSSRQIGHFISKRRGIYSIHRISEEMLLGAAGPTVVQEAR 930 T 3.1E-10 Utp8 pdbpssm F Eukaryota T 6rxv 56 HB UH Y1780_CHATD Utp8 MAAKLQIHAPYVLHALPRPLDRSDGLGRYFSGEVFGQKQGGKRKKRTELAVAIDGVAVYLYDILSSQVVTSYLVSPQSCFTCPPSSLRWRPASSKTVTRYTYVSVATGDSVLAKREIRLFREETLSTGNTVVACISRTICSDSPIVHIFTSSPRNFLTNVPGKDIPNHDLIIITANGSIFALNGETLEEKWQVSPSVLSREILSDSKLALQVDFVQQTSAADVADGLFGGKNDLFGVFQERIHRDGFNPDFFVVITSQSGADSANARHLHVLALPSEREARQTGKENVISVFVAPLAVEETCRSFQLDVRSGTLQAISNKALVTYQFANGIAKLENRLQVPGLSSYLRLSKTSVLTSATDSLSVYNPIYRSLQAAARLEPTDDTNGHACEFVSYLASRELAVAIRGGSLVVIQIEAPKNRTAKRRAEGLLTDAIRRGISRKIAFEKRTKPEHVSDSTILADAVPGSLSDPSWSEWQNKAMQADELLQNNDIQSWEELMAEVFKVPIKPDETADAEKQTAPNPVVKLPEWEWPSSRSDYARVDRRWVVYAINKVFGWEGQLESNTGRLTCRLPESSVLIYLVDAGHLSTSNVKSAFKDDVREVDKVEELIGEQLPIILAEVDPTMELLVGYLSGTQLGSSELVSSIKLLLCSLGLFEDGSRLPAVGDNTHIEQVTGQENEVVNMELDRAEEELQITEHYLDEHRTRGLGIAFSKLAACPAAETVKSLRRLFKPDEVLVLLNVLRAELIKDGWTTRYLDKINADQEDDAPPDASIQLIADLMSRCIDAVGLSGWMAADVMLSSSRTHQDSANFFSQFQAEISVALEGVMEAVRLKGVIAEAANYAKRARRALADSAKGKAMTVHMSAELPLGLKTDNKISTERVRSGGEIVARSSRQIGHFISKRRGIYSIHRISEEMLLGAAGPTVVQEAR 930 T 3.1E-10 Utp8 pdbpssm F Eukaryota T 6rxx 54 DB UH Y1780_CHATD Utp8 MAAKLQIHAPYVLHALPRPLDRSDGLGRYFSGEVFGQKQGGKRKKRTELAVAIDGVAVYLYDILSSQVVTSYLVSPQSCFTCPPSSLRWRPASSKTVTRYTYVSVATGDSVLAKREIRLFREETLSTGNTVVACISRTICSDSPIVHIFTSSPRNFLTNVPGKDIPNHDLIIITANGSIFALNGETLEEKWQVSPSVLSREILSDSKLALQVDFVQQTSAADVADGLFGGKNDLFGVFQERIHRDGFNPDFFVVITSQSGADSANARHLHVLALPSEREARQTGKENVISVFVAPLAVEETCRSFQLDVRSGTLQAISNKALVTYQFANGIAKLENRLQVPGLSSYLRLSKTSVLTSATDSLSVYNPIYRSLQAAARLEPTDDTNGHACEFVSYLASRELAVAIRGGSLVVIQIEAPKNRTAKRRAEGLLTDAIRRGISRKIAFEKRTKPEHVSDSTILADAVPGSLSDPSWSEWQNKAMQADELLQNNDIQSWEELMAEVFKVPIKPDETADAEKQTAPNPVVKLPEWEWPSSRSDYARVDRRWVVYAINKVFGWEGQLESNTGRLTCRLPESSVLIYLVDAGHLSTSNVKSAFKDDVREVDKVEELIGEQLPIILAEVDPTMELLVGYLSGTQLGSSELVSSIKLLLCSLGLFEDGSRLPAVGDNTHIEQVTGQENEVVNMELDRAEEELQITEHYLDEHRTRGLGIAFSKLAACPAAETVKSLRRLFKPDEVLVLLNVLRAELIKDGWTTRYLDKINADQEDDAPPDASIQLIADLMSRCIDAVGLSGWMAADVMLSSSRTHQDSANFFSQFQAEISVALEGVMEAVRLKGVIAEAANYAKRARRALADSAKGKAMTVHMSAELPLGLKTDNKISTERVRSGGEIVARSSRQIGHFISKRRGIYSIHRISEEMLLGAAGPTVVQEAR 930 T 3.1E-10 Utp8 pdbpssm F Eukaryota T 6rxy 48 ZA UH Y1780_CHATD Utp8 MAAKLQIHAPYVLHALPRPLDRSDGLGRYFSGEVFGQKQGGKRKKRTELAVAIDGVAVYLYDILSSQVVTSYLVSPQSCFTCPPSSLRWRPASSKTVTRYTYVSVATGDSVLAKREIRLFREETLSTGNTVVACISRTICSDSPIVHIFTSSPRNFLTNVPGKDIPNHDLIIITANGSIFALNGETLEEKWQVSPSVLSREILSDSKLALQVDFVQQTSAADVADGLFGGKNDLFGVFQERIHRDGFNPDFFVVITSQSGADSANARHLHVLALPSEREARQTGKENVISVFVAPLAVEETCRSFQLDVRSGTLQAISNKALVTYQFANGIAKLENRLQVPGLSSYLRLSKTSVLTSATDSLSVYNPIYRSLQAAARLEPTDDTNGHACEFVSYLASRELAVAIRGGSLVVIQIEAPKNRTAKRRAEGLLTDAIRRGISRKIAFEKRTKPEHVSDSTILADAVPGSLSDPSWSEWQNKAMQADELLQNNDIQSWEELMAEVFKVPIKPDETADAEKQTAPNPVVKLPEWEWPSSRSDYARVDRRWVVYAINKVFGWEGQLESNTGRLTCRLPESSVLIYLVDAGHLSTSNVKSAFKDDVREVDKVEELIGEQLPIILAEVDPTMELLVGYLSGTQLGSSELVSSIKLLLCSLGLFEDGSRLPAVGDNTHIEQVTGQENEVVNMELDRAEEELQITEHYLDEHRTRGLGIAFSKLAACPAAETVKSLRRLFKPDEVLVLLNVLRAELIKDGWTTRYLDKINADQEDDAPPDASIQLIADLMSRCIDAVGLSGWMAADVMLSSSRTHQDSANFFSQFQAEISVALEGVMEAVRLKGVIAEAANYAKRARRALADSAKGKAMTVHMSAELPLGLKTDNKISTERVRSGGEIVARSSRQIGHFISKRRGIYSIHRISEEMLLGAAGPTVVQEAR 930 T 3.1E-10 Utp8 pdbpssm F Eukaryota T 6rxz 55 GB UH Y1780_CHATD Utp8 MAAKLQIHAPYVLHALPRPLDRSDGLGRYFSGEVFGQKQGGKRKKRTELAVAIDGVAVYLYDILSSQVVTSYLVSPQSCFTCPPSSLRWRPASSKTVTRYTYVSVATGDSVLAKREIRLFREETLSTGNTVVACISRTICSDSPIVHIFTSSPRNFLTNVPGKDIPNHDLIIITANGSIFALNGETLEEKWQVSPSVLSREILSDSKLALQVDFVQQTSAADVADGLFGGKNDLFGVFQERIHRDGFNPDFFVVITSQSGADSANARHLHVLALPSEREARQTGKENVISVFVAPLAVEETCRSFQLDVRSGTLQAISNKALVTYQFANGIAKLENRLQVPGLSSYLRLSKTSVLTSATDSLSVYNPIYRSLQAAARLEPTDDTNGHACEFVSYLASRELAVAIRGGSLVVIQIEAPKNRTAKRRAEGLLTDAIRRGISRKIAFEKRTKPEHVSDSTILADAVPGSLSDPSWSEWQNKAMQADELLQNNDIQSWEELMAEVFKVPIKPDETADAEKQTAPNPVVKLPEWEWPSSRSDYARVDRRWVVYAINKVFGWEGQLESNTGRLTCRLPESSVLIYLVDAGHLSTSNVKSAFKDDVREVDKVEELIGEQLPIILAEVDPTMELLVGYLSGTQLGSSELVSSIKLLLCSLGLFEDGSRLPAVGDNTHIEQVTGQENEVVNMELDRAEEELQITEHYLDEHRTRGLGIAFSKLAACPAAETVKSLRRLFKPDEVLVLLNVLRAELIKDGWTTRYLDKINADQEDDAPPDASIQLIADLMSRCIDAVGLSGWMAADVMLSSSRTHQDSANFFSQFQAEISVALEGVMEAVRLKGVIAEAANYAKRARRALADSAKGKAMTVHMSAELPLGLKTDNKISTERVRSGGEIVARSSRQIGHFISKRRGIYSIHRISEEMLLGAAGPTVVQEAR 930 T 3.1E-10 Utp8 pdbpssm F Eukaryota T 6ryo 2 B B Globomycin XXSXX 5 T 380 GLF pdbhh F F 6rzz 41 OA p uL1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 210 F F F 6s05 44 RA p uL1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 210 F F F 6s0k 35 IA k Cytoskeleton protein RodZ RKKRDGWLMTFTWLVLFVVIGLSGAWWWQAAAAAAAAAAAAKALIVYGAAAAAAAAA 57 T 0.006 TspO_MBR pdbpercent F T 6s0n 1 A A GLN-ASP-VAL-ASN-THR-ALA-VAL-ALA-TRP QDVNTAVAW 9 T 10 gp12-short_mid pdbhh F T 6s1c 4 E,H D,H CTF18_YEAST Chromosome transmission fidelity protein 18 GAMGNQTVKIWVKYNEGFSNAVRKNVTWNNLWE 33 T 2 Jnk-SapK_ap_N unppssm F Eukaryota T 6s1u 2 C I PRO-0A1-VAL-PSA-ALA-MET-THR PXVXAMT 7 T 34 Allatostatin pdbhh F T 6s1v 2 C I PRO-0A1-VAL-PSA-ALA-MET-THR PXVXAMT 7 T 34 Allatostatin pdbhh F T 6s22 2 B F FGF23_HUMAN FGF-23,PHOSPHATONIN,TUMOR-DERIVED HYPOPHOSPHATEMIA-INDUCING FACTOR NTPIPRRHTRSA 12 T 2.1 MGTL pdbhh F Eukaryota T 6s24 2 B F ALA-THR-GLY-ALA-GLY-ALA-GLY-ALA-GLY-THR-THR-PRO-GLY-PRO ATGAGAGAGTTPGP 14 T 29 MSP1_C pdbhh F F 6s29 2 B,D B,D MIS19_SCHPO EIGHTEEN-INTERACTING CENTROMERE PROTEIN 1,KINETOCHORE PROTEIN MIS19 PRVYETELLVLRFREFGVKDNHNHPINLHSLRSKSLIRAQGKKLDLHNRVFLRRNVRAVKM 61 T 14 DHOase pdbhh F Eukaryota T 6s2c 1 A A E1CI69_9VIRU Capsid protein PISADFSEVENAPSFLSLAENTDEVLKPYTGLEIQTIITNIVGDANPNQSRIFDQDRLRGNQYSAGGLVTQNAVSAIPFTNLIPRTIRVGNILVNSANRLQITETNVSEYYSNPIIATKLSEMISDQVKNNQFSTWRRDNTSLQGFNAFDIATINTAILPNGLSLESMLLKLSLLHSIKAMNVDAASINRSQYQVIDHNTVPTIGAPAVVGVNNSPVFGEDCGGNNPVYPFGGGTGAIAFHVTLQTVPDERKSYAIFVPPAILQATSDANEALALFALSMSEWPHALYTVTKQTTDLAGANAGQQVFIPTQSTIHIGGRRVLDLIIPRREIAPNPTTLVAANAMCMVRPQAGPDATAGAIPLAAGQLFNMNFIGAPAFEEWPLTSYLYSWAGRFDITTIRQYMGRLATMVGVKDAYWAAHELNVALSQVAPKMTTAAGGWAAQAANSAQQSDVCYSSLLTVTRSAANFPLANQPAADMRVYDTDPATWNKVALGLATAANLVPEQSMDVPFVVGDARTSFWERLQAIPMCIAWTMYYHSRGITTLAWDNAYTDNTNKWLQKMVRNTFSTTQSVGTIIPARYGKIVCNLYKNMFHRAPAYVATSVGGKELHITHFERWLPGGTYANVYSGAGAVVNCFSPVLIPDIWCQYFTAKLPLFAGAFPPAQGQNSTKGFNSKQGLMIHRNQNNNLVAPYLEKFADNSSYFPVGQGPEINDMATWNGRLWMTTGNVQYLDYSGAAIVEAVPPAGELPVGKQIPLLAGENAPIELTNAATTCVPRYSNDGRRIFTYLTTAQSVIPVQACNRAANLARSCWLLSNVYAEPALQALGDEVEDAFDTLTNS 840 T 0.16 TPP_enzyme_C pdb T Viruses T 6s2c 2 B B E1CI69_9VIRU Capsid protein PISADFSEVENAPSFLSLAENTDEVLKPYTGLEIQTIITNIVGDANPNQSRIFDQDRLRGNQYSAGGLVTQNAVSAIPFTNLIPRTIRVGNILVNSANRLQITETNVSEYYSNPIIATKLSEMISDQVKNNQFSTWRRDNTSLQGFNAFDIATINTAILPNGLSLESMLLKLSLLHSIKAMNVDAASINRSQYQVIDHNTVPTIGAPAVVGVNNSPVFGEDCGGNNPVYPFGGGTGAIAFHVTLQTVPDERKSYAIFVPPAILQATSDANEALALFALSMSEWPHALYTVTKQTTDLAGANAGQQVFIPTQSTIHIGGRRVLDLIIPRREIAPNPTTLVAANAMCMVRPQAGPDATAGAIPLAAGQLFNMNFIGAPAFEEWPLTSYLYSWAGRFDITTIRQYMGRLATMVGVKDAYWAAHELNVALSQVAPKMTTAAGGWAAQAANSAQQSDVCYSSLLTVTRSAANFPLANQPAADMRVYDTDPATWNKVALGLATAANLVPEQSMDVPFVVGDARTSFWERLQAIPMCIAWTMYYHSRGITTLAWDNAYTDNTNKWLQKMVRNTFSTTQSVGTIIPARYGKIVCNLYKNMFHRAPAYVATSVGGKELHITHFERWLPGGTYANVYSGAGAVVNCFSPVLIPDIWCQYFTAKLPLFAGAFPPAQGQNSTKGFNSKQGLMIHRNQNNNLVAPYLEKFADNSSYFPVGQGPEINDMATWNGRLWMTTGNVQYLDYSGAAIVEAVPPAGELPVGKQIPLLAGENAPIELTNAATTCVPRYSNDGRRIFTYLTTAQSVIPVQACNRAANLARSCWLLSNVYAEPALQALGDEVEDAFDTLTNSSF 842 T 0.16 TPP_enzyme_C pdb T Viruses T 6s2d 1 A A Q90VX5_PSEAM Pleurocidin-like prepropolypeptide GKGRWLERIGKAGGIIIGGALDHL 24 T 5600 Antimicrobial12 unppssm F Eukaryota T 6s2e 3 C E CTF18_YEAST Chromosome transmission fidelity protein 18 GAMGNQTVKIWVKYNEGFSNAVRKNVTWNNLWE 33 T 2 Jnk-SapK_ap_N unppssm F Eukaryota T 6s2f 3 C E CTF18_YEAST Chromosome transmission fidelity protein 18 GAMGNQTVKIWVKYNEGFSNAVRKNVTWNNLWE 33 T 2 Jnk-SapK_ap_N unppssm F Eukaryota T 6s35 3 C C ALA-ARG-(D)LYS-MET-GLN-GLU-ALA-ARG-LYS-SER-THR ARXMQEARKST 11 T 15 DUF5915 pdbhh F T 6s39 2 B P P53_HUMAN ANTIGEN NY-CO-13,PHOSPHOPROTEIN P53,TUMOR SUPPRESSOR P53,P53PT387 KLMFKTEGPDSD 12 T 44 CopB pdbhh F Eukaryota T 6s3c 2 B P P53_HUMAN p53pT387 KLMFKTEGPDSD 12 T 44 CopB pdbhh F Eukaryota T 6s3d 3 I,J,K,L M,N,O,P S0_2.126 ASPCDKQKNYIDKQLLPIVNKAGCSRPEEVEERIRRALKKMGDTSCFDEILKGLKEIKCGGSWLEHHHHHH 71 T 0.18 Spo0A_C pdb F T 6s40 2 B P P53_HUMAN p53pT387 KLMFKTEGPDSD 12 T 44 CopB pdbhh F Eukaryota T 6s5p 2 E,F,G H,E,F SBL2 KAAKACX 7 T 39 DUF6073 pdbhh F F 6s5r 2 E,F F,H SBD6 XXX 3 F F F 6s5s 2 B B SBD8 chain B KPLXFKX 7 T 51 DUF4133 pdbhh F F 6s5s 3 C,E C,E SBD8 chain C KPL 3 T 220 SEC-C pdbhh F F 6s5s 4 D D SBD8 chain D KPLXFK 6 T 35 DUF4133 pdbhh F F 6s5t 1 A A SIDJ_LEGPH SidJ MFGFIKKVLDFFGVDQSEDNPSETAVETTDVSTKIKTTDTTQEESSVKTKTVVPTQPGGSVKPETIAPDQQKKHQIKTETTTSTTKQKGPKVTLMDGHVKQYYFARRGETSTHDTSLPPPVKVLSGRSIPLKEIPFEATRNELVQIYLTSIDKLIKSNKLNSIPSQQIASHYLFLRSLANSETDGIKKNQILSLAKPLGTYLASKEPHVWKMINELIEKSEYPIIHYLKNNRAHSNFMLALIHEYHKEPLTKNQSAFVQKFRDSSVFLFPNPIYTAWLAHSYDEDSSFNPMFRERLSTNFYHSTLTDNLLLRTEPKEVTLSSEHHYKKEKGPIDSSFRYQMSSDRLLRIQGRTLLFSTPQNDVVAVKVQKKGEPKSTLEEEFEMADYLLKHQRRLDVHSKLPQPLGQYSVKKSEILEISRGSLDFERFKTLIDDSKDLEVYVYKAPQSYFTYLHDKNQDLEDLTASVKTNVHDLFVLLREGIVFPQLADIFHTHFGEDEREDKGRYQALVQLLNVLQFQLGRIDKWQKAVEYVNLRSSGLADLGDSLPITSLFTSSDFTKHYFSELLTGGYHPTFFDKSSGTANSLFTGKRRLFGNYLYLNTIAEYLLVIQLTLGSYGDKVTRDMMDKPKKEAVWRELANVMFTSCAEAIHIMTGIPQSRALTLLKQRANIEKHFRQTQFWMTPDYSKLDEDTLQMEQYSIYSGEPEYEFTDKLVSGVGLSVDGVHQDLGGYNRESPLRELEKLLYATVTLIEGTMQLDKEFFKQLEQVEKILSGEIKTDANSCFEAVAQLLDLARPGCHFQKRLVLSYYEEAKLKYPSAPTDAYDSRFQVVARTNAAITIQRFWREARKNLSEKSDIDSEKPESERTTDKRL 873 T 0.34 IQ pdb F Bacteria T 6s6q 2 C,D C,D CIF2_ARATH Protein CASPARIAN STRIP INTEGRITY FACTOR 2 DYGHSSPKPKLVRPPFKLIPN 21 T 0.2 Inhibitor_I53 unphh F Eukaryota T 6s7g 2 C,E,G,H F,G,H,E SBL1 KAKACX 6 T 200 zf-CCHC pdbhh F F 6s7l 2 B,D C,D GLY-ILE-VAL-ARG-GLY-ALA GIVRGA 6 T 50 PRP1_N pdbhh F F 6s7t 10 J K PEPTIDE AANATAA 7 T 700 UPF0176_N pdbhh F F 6s8r 2 B B GGYF1_DROME GIGYF family protein CG11148 DENLPEWAIENPSKLGGSFDASGAFHG 27 T 1.6 Tipalpha pdbhh F Eukaryota T 6s9k 2 B B CASP2_HUMAN CASP-2,NEURAL PRECURSOR CELL EXPRESSED DEVELOPMENTALLY DOWN-REGULATED PROTEIN 2,NEDD-2,PROTEASE ICH-1 DYDLSLPFPVCESCPLYKKLRLSTDTVEHSLDNK 34 T 10 DUF4073 pdbhh F Eukaryota T 6s9l 2 C,D C,D LYS-ARG-LYS-ARG-LYS-ARG-LYS-ARG-LYS-LEU-SER-PHE KRKRKRKRKLSF 12 T 3.6 DUF4604 pdbhh F F 6s9q 2 B P P53_HUMAN ANTIGEN NY-CO-13,PHOSPHOPROTEIN P53,TUMOR SUPPRESSOR P53 KLMFKTEGPDSD 12 T 44 CopB pdbhh F Eukaryota T 6sa8 2 B B LYS-ARG-LYS-ARG-LYS-ARG-LYS-ARG-LYS-ARG KRKRKRKRKR 10 T 3.8 RFX5_DNA_bdg pdbhh F F 6saa 1 A A A0A1V0FWW5_9ARAC U1-theraphotoxin-Pf3 RCLHAGAACSGPIQKIPCCGTCSRRKCT 28 T 0.066 Conotoxin pdbhh F Eukaryota T 6sad 2 C C CASP2_HUMAN CASP-2,NEURAL PRECURSOR CELL EXPRESSED DEVELOPMENTALLY DOWN-REGULATED PROTEIN 2,NEDD-2,PROTEASE ICH-1 DYDLSLPFPVCESCPLYKKLRLSTDTVEHSLDNK 34 T 10 DUF4073 pdbhh F Eukaryota T 6sat 2 C,D P,Q FTSZ_CORGL Cell division protein FtsZ DDLDVPSFLQ 10 T 2.2 DUF4809 pdbhh F Bacteria T 6sb1 1 A,B A,B MPEG1_MOUSE MPG-1,PERFORIN-2,P-2,PROTEIN MPS1 ETGGCTNVDSPNFNFQANMDDDSCDAKVTNFTFGGVYQECTELSGDVLCQNLEQKNLLTGDFSCPPGYSPVHLLSQTHEEGYSRLECKKKCTLKIFCKTVCEDVFRVAKAEFRAYWCVAAGQVPDNSGLLFGGVFTDKTINPMTNAQSCPAGYIPLNLFESLKVCVSLDYELGFKFSVPFGGFFSCIMGNPLVNSDTAKDVRAPSLKKCPGGFSQHLAVISDGCQVSYCVKAGIFTGGSLLPVRLPPYTKPPLMSQVATNTVIVTNSETARSWIKDPQTNQWKLGEGTKHHHHHH 295 T 0.37 UN_NPL4 pdb F Eukaryota T 6sb4 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H MPEG1_MOUSE MPG-1,PERFORIN-2,P-2,PROTEIN MPS1 ETGGCTNVDSPNFNFQANMDDDSCDAKVTNFTFGGVYQECTELSGDVLCQNLEQKNLLTGDFSCPPGYSPVHLLSQTHEEGYSRLECKKKCTLKIFCKTVCEDVFRVAKAEFRAYWCVAAGQVPDNSGLLFGGVFTDKTINPMTNAQSCPAGYIPLNLFESLKVCVSLDYELGFKFSVPFGGFFSCIMGNPLVNSDTAKDVRAPSLKKCPGGFSQHLAVISDGCQVSYCVKAGIFTGGSLLPVRLPPYTKPPLMSQVATNTVIVTNSETARSWIKDPQTNQWKLGEGTKHHHHHH 295 T 0.37 UN_NPL4 pdb F Eukaryota T 6sba 2 B B VGLL4_MOUSE Vestigial like 4 (Drosophila), isoform CRA_a SVEDHFAKALGDTWLQIKAA 20 T 0.00088 Vg_Tdu pdbhh F Eukaryota T 6scs 2 E,F,G,H P,Q,R,S FTSZ_CORGL Cell division protein FtsZ DDLDVPSFLQ 10 T 2.2 DUF4809 pdbhh F Bacteria T 6sft 1 A A A0A0H3CCM2_CAUVN Two-component receiver protein CleD SKPREWVEAVAYVGPDRRRFNSADYKGPRKRKADAS 36 T 14 DUF1816 pdbhh F Bacteria T 6sg9 3 C Cn Q57VQ9_TRYB2 mS38 MRAGCVACSRIPKLGIAALGSTTTSMQATCQETQELRCATTQMAPAVIFPLPPPLRGSYIDRKPTAASFNEHHATASFRHHMIASADVSHRPRHFYFASGTRNDTGTRSVKEGERKQLKQQTNVEVDSVAYGDQLDELSARWAAKFYGQVTFGPRNYPYPSSRWLARRFQMKKHRIIKRFRFRRYKLAAVANLPFAKMIRVGMLPELKSSKTKRGDVVDPTLSGQLVSAVKNTGEKRKGQRTRPKSKYQV 250 T 28 DUF2996 pdbhh F Eukaryota T 6sg9 4 D FJ D0A8R9_TRYB9 mt-SAF18 MLCHNRLLLVNKQTQKYRTKLRYRFRQPSVVPLRQTLQQRHNTILEVLRRRRINSGDQSPYRYVEERLYSKPSRLDREGVKVNKTYALQGLGDLEPLRYGANFGISEKDALKYETVAEKAKYMEPPIPYSSLAARKLAAGALWPAAPDPEGMISKEVRLLRHESSMSPSARAFSERVAYHLRRSLKACPGHIAEHIDFTQLIIQEVLGSRRSKEIYIVWFTVDPGARFELEPRLHQLNHWVQQLIIKRVKRRPHIPRVTWIYDGGRLERELPRDVKQELQSFVADAATTLESRVKYLKELDTMNQRMKDIPWFMPYLWSKEEKAARQKSMLADLEEVERRKNEHSSGRSAPPRTSPPPQFVR 362 T 0.041 RBFA pdbhh F Eukaryota T 6sg9 5 E FY A0A3L6L5M9_9TRYP mt-SAF28 MWRLSRSLRSNSLHNPGPFLDGALQLIKLHLAHKNAAADKNTKACSDIEGEFLRELEAFRACFTMSSSLKVAKLYTKKLHGALSYFQLYDDPLMRQLDMIIGKQTMQPSAGRQHGVFKAPVAARLDPFFLDEREETVLPSELPNPPKPDPSTPLRERALKVPAQHRGHWVLRDPDIAITREERRTDPW 188 T 7.4 Hist_rich_Ca-bd pdbhh F Eukaryota T 6sg9 6 F Fa Q57VU7_TRYB2 mt-SAF30 MPPNSATRWLPFVSSDLKDYLNRYWAVMFTVGARPIETGHIRHYVSWYCTRMKVVLLDHHVYVEPLRQQLQEASRTPELPLLFVNKKLVGTLRDVELLEREKKLKDVLHFGFEWRVGGSVAATNGQKSLMGALPAPYGDAEFFRGRYRGPPVARPVVSLPTLHPFALRSEE 171 T 0.15 Glutaredoxin pdbpercent F Eukaryota T 6sg9 8 H CC uS3m MFLIHFVHYKTILQKYTFKFKHIFLSIDKYNSLFFNISGILIWLNIIHINIILIKYSFFILINNFEYLIILIST 74 T 0.022 Prophage_tail unppercent F T 6sg9 11 K CN Q580I0_TRYB2 uS14m MFCFSRGLWMRHIGQDVPKRHTHFVLESRLMYEKSFRDCWLHSVCRAISQLDEPLSKTVVGTHQKMLQRKVTCFQYNQYGLFKTPYYRLANVDRYHAVQGVAGTREWVPYVNVSYWTMNKMVRGGNLLVHRVHYTGWGTDSHLKKGGWEHRWNKVLQRNVLQYSRI 166 T 12 DUF6462 pdbhh F Eukaryota T 6sg9 12 L CS uS19m MAFRNTFTTPGKFSTVSKNIVLLLIWRVKVFLRAEGFAHSLVMLPVSLYSKILLCDVKKKIVYFHCCTRKKSMLRRCPCVWPDGPTKSSVSIGTAFTLQKRFLKIAKSAFGFYLARRGQRKYPFLRRPHIKNTHSMNPSAPYFWSFMTAKSQMAFLPEENYITGDWTGKFFVSKRQVYTLQHATSGAKVRVKSFPSIFEFNSPSRWNIGKEMNTLTKPRMDLIDEQMLTKKQRLDYVRAGLLPK 244 T 4.9 LIN52 unphh F T 6sg9 14 N Ci Q57WW0_TRYB2 mS33 MVLRRWFPLLGVHRVGYTHPSTLPVPCAQRWDLRLARARIFQEYIEEKAPGAWQLEDERSMSPEFKTFTGYPMRDMRPGYGQNLPDYIMKKRLPNNTHYELFARRDIPNEDNAMYGKYLYDMTVHGTSLPSTYRMHKDINKAQRNDRKLSGNRFKVLCSSGAKNPPSGWEPIPDATEEEED 181 T 1.1 Nmad3 pdbhh F Eukaryota T 6sg9 16 P DB mS49 MIRRRVCDGARLAPNIRATFSAVRYQSGLHTFIRDSKPSNFSSVRRSENANGDATASPGATAGENPASSGDWASHMQRELFGEVDPLGGQAHKDYYRDVTRGYSPQYAPRNFANGGAVAYPHIQSPYEYEEAAHRRVWLDHDVDRMREEFTQHRASLRSLASAQEREELLRSRAAEYQVANTVHESESVHPIQQLYNSGGTSRSALKQQAVADRYSIAEQHSPLPLTTGVDRDALDEAQRTKDRILNDSFTAENLLITHGLREKEKHDFTILQRTVRIPFQGYDMDRFLAQQKGTPYGAQQLPPNVVPSSMEEAQRTLRGSSATATPLVDAVAQKVYARNTVVDRPAIGEQLTEQIINIMRASRTTAEQQREEERAQRFGLGRQGALVQDGGPDQRTLKKHTNDERIVDAMLFQQNAYRKTPTDEHWNPYIRRSTENGVGHLLQNKFDIMRREDRLSKGEQDLTERNTIHYGVPIQQIVDEFVFRHRNARGERPLDYFKPFPNFRALRLNRMYRDVEGFSLMKQRPEFLEWELFTRYRQHHQQRRRLALLHGLEPVANETAQERDTRRHRLDEICERTPFDEREMRVNDDEMRVSVETLRSWFGVYMLPSPTVVNAVLGGSASVNLHLYHLADEMGTADTREHVLSSRYLNRLLLLESYQNRVGRGFMNHVVGRAPEPVVPHEQPQEVLRHFSAEERAMYEQHVKEQTSRQLGEWERAMKRRRWLTDHQQYGHVVSHGLETSVVDLSHTETGAVLTVSTKAYEQEIEAVRMKTNATIKVDGMVYNLLPNSERRVVPLTVQLDSGEKIDMTSEDFDRCELEAFPRNLNHALNYGIANYAYNRGNYVETQDSIWEEQTASGQEGWSPATHADGLREGLPVRARRPIFSSSAEQRIAGGPQRAVIIQYHHQPFFNPEPRLVKVAFQCDGTIMEVPISDVMIWQRRYHGPERTVGDESRRYNPAAMRRYVDVTDPFNEKTSNTEHFLDKYEPKRNADTVADKYRTTKQITEIDKWTRYDSARADNYRPLSISHRRDYIRMGYIPRYTPWEWIAIQEADQPLIAEQIRQDNIGTSYFFSLNRYWRYKASPHGYIRHFENEVRDLLQYVDGVTPWKQAQKIRTYWEVRSHHPMPQFNRPEVAMHRNTVGLLPAHMWETDKKTGKVKSVKDSVRDYQTKTPYPKWVQL 1181 T 0.098 ACD unp F T 6sg9 17 Q DC mS50 MFRVRRLAGFIQCSPCGKMGPLKQQTRRYTPIWKSDPAVDNVAPLRDEDERRALWAEVGPISDVGSAVTAWIRFGNDPVLHTAVPTMLGGKFRNQQREKESLLPNSSSPFAYVEDYMGTNLVFGSPVHAKESAAVWATYFERRYASRLRLSRRTVANYVGLINSPEVFDDESDRPETRWSQDTFFRECAYLSEKFLKEKVSNMQQFEAALKRASPEAYLAFFDAFQQQTQTQIPLPSPSVWHYEGERRKQWAEKFISISHKAQAFFKDVLSEDVKKYQEVPGKLLQKVKPVLADVGKILVKRHERWLKGRVWTSLTEEEREAYCMKEVKRQQMQVEDGEFDPMMEDDVDDTELEEWQREHDAIMKLMNSPIDGLHFTTLELWLHTMRCEELETEHIYTSARVRAIQVAARKKLYDTTSYEEVIQAVVESIARGTLDLGAGVLRPHFNEVWCQLNYAKFGSSTITQHTTTSRRQLLFFHAGSLKDIAATATLYYATKPLSNSLDYASPYKYRRSLITLCSNYGVETAYTTQRPLLRSAANLARAEDLIHAVVTAAAQPFGERRRAATRDLHMEFQRLAVPVERVIVANPVSALLESGADPDEKPVEGEKVNMWPLGAKRVVLYKWSAPNVEKLKAMESDAASAVSGSSLTAKRLREIQELKRRGFLEVSLWRRVTAQERKQRNEIVEAKKKQVEEVVRTVPSLAHLHQYATSLYSRIEERVAFPTETSTVTETTNMKEETNKPLEDSEWEFAVLLDDRVLLNKEESVELYLPYRDANGELLAQGEYRALVRAFDLEANPNLHPAYCSVGYSESFQVFDALPQLIAQFFRVKDATAEAAGVTHIPAADFTPFCAFLRDAGLDVPLRCEFEAGQAVTTDGDVYMDYFLQLLRGEAFHQSHAQAGLTEAQRAIEPLCRAHWVVHHPGADESEWATARRSVLDHAMQHEREWWFPNEMLDVKDVVTGSTNGLTPQMYPAAVRYGVELCTVLTAEGKFVDERGSGLSARCVVNGTGAAESVVFDTANCNGTNTTSVEDALRVAHGALRSAQDRHNTLAAFRLGPLSKQSQVLLFCGVNAYEFGGKYARTYAYAFEKAKKELEATAASGFMAPSLSHEDTERLSDQPTTSPSVDRFASTTHPEQRKAQFVPRVGPGSTPLEDPAADQKSEWS 1165 T 0.23 CT_C_D pdbpercent F T 6sg9 18 R DE Q386Q7_TRYB2 mS52 MRRCSFASTCRYRSADDVPLGLGSAYVWTCRNITSRGQFNPIHNFSYAMERGVRARDVKAFEKLITNPGPLRVAYTPDYLDWLHRCYKAKGTYMDARAVAEKKFNGNIVSSELSAAVNRREGKRGDTRGDTVDNDHHNLPGAPPPGMFLRPAHSFRRLAGELKRRRAQSILDEVARAQGMLDLFERQPHFPAIHIDRCSRFHLVELFKEMVLERSLDSNMIWEKALLYRAILSERKPSYPTSFHYIFTAVEDTVFAPTISTPMEMAGSGGEGHDRSSHPLAAKCPTLEAYYYYVYLVKKYYIDNAVEAHVVLRCHREPNAADLLFSNPPPKDDTEIMKAVELLRNADIQRGVAAAAAVSDPTLPPGGEGSVIGNSDNKKNSEKTSEGSRGRPARPPVLPGAYPPIDMLWRCEENLPLLKVLLFGEFNLIVSENPFVKFPSAHGFLTRPYSTDSSRTLADGMSLANVMAEKRGHLLPSLPRNTATSIDARAQDIRRLQQKHHRDDIVSFQKLLRSTHAEDSPSAFSSYSDWSYFNPRAVRAEERDRLTRKAVEALKLYDSATNDIYRHSFEDVQACHTQRVTERDRTMPPYLPTLPHFVAIIKKDPHISFLLHIGLPDRNSSEEGSAKHKELEKRIYYLARALYHTALEYHNETVRRVNRQKVNVAASLLDNFVEQEWTTILRDKHDVTDVTKTLNDTQNDKKQLARRLGRYMLFANRSLDDTGFPTDARADDYTRWMAPPSVGKVSL 747 T 7.7 Complex1_LYR_2 pdbpssm F Eukaryota T 6sg9 19 S DF mS53 MSVSGVFSKGRGIGHEATTSILRYIPRARVPWQPSRFGRENLTAADMARLWSRGRYRDGPGNYNSGYCTERTHVLEENTVSIIPRRELEKYMPDITIGPKALVTPVSLMNARNGHRVTHDLLHSYDPHIGRLGKPAVVDHDNITVEDPNRVGLNAATLDCRGRIYRWLRRGPFFQVDNYFRRSVKLNRDGTLPTDFVHEAPLMRKIIRLAHRGHLKAACEEYRRVTTVPPVEVYRALTACCVPGAKLADAVSIFEDGDSKLFYVSRDGEVLHNLMRCAIAARHRARIMWVYNVMRGRFYENVVVRAEVDLIWRYRIAMIALEYLLDHECAEEAAAIYSYLVEEELLRCDVHVRVGLHMREAIAAGKPITLNNDVMNATSLVRDATAVAPEVARELQRRHAQTLQNNAVEAVGAENVREGSAPWSILGPLTAIGPTAEDTMVWLQQHYGDVDVMSIMRWARFRKGKDLMAKDRPQYLARAAAWIELLSKRNREMEEVPLTYMRKSKPLVLDTNSNVRVAWQTPLMRSGGPPRLLAREEGYVFHHSNSSRFVEETYRHPGESLQSRYLALQPLHTEVSAKEDFQRLYYQAQKHHKQQERLLPVSTAAIPSRIVHHSVMSALHGVSGKGEPANRSLFTGNKSNDSHSNTGVASGTSACTDGAGSATPEF 666 T 0.00091 PPR_long pdbhh F T 6sg9 22 V DJ Q584U8_TRYB2 mS57 MFRSTRPRSVGYTPVNPDTSPMVAYSQYHWHYNLPQGMERPHSVNRTFAAPFQSNHSLVNKYRGVWIEFDMHPAFSVALEPQLRKLPRGRTLPKTPAEEVIADYTALAPLVDDEKTRDLWLAKVFQHCAFQRCGGAMELWERYCHQRFTAEGATAKPPLSLVKSVLFYCNKTDNSGWRALFDRCLKDGWNYTPLFDTAQWSFMLKSIGRMGDEDGVRAVLEEMLDVQADLDRVEARSVVIALNAVTNADVYEFVKKYLFNFGERKVKFLRTTYSDLRGHGAGKLRIPLKENDNMYYHVCWHSSIRSPRQFSPRQLYFDYTPSTLGSSSHNPNAKIDDIVKDKIEKWKAEGLLPEDYVHEDRVYDRSAAFKNVARQEKWKKMPKILKSKRMGYTGDP 396 T 1.1 RPM2 pdbhh F Eukaryota T 6sg9 25 Y DV Q57UZ6_TRYB2 mS69 MRRVRDGLFLSHPSSALQCSVAVVSTGEFDHPPFQFRQRHTFNTTPLHDANRFGGRTAYLREIGPVNIKKQGRRFKKDPRTVQFNVDVWCAQQTLRKRWKQRDWEVIEIPFRLVPREQQRVIPELYTDIPQMTDPARNDFSNIRNKVYDREELQGVLFPAAGAMLYPPLQRVDKQAMTLDKYL 183 T 0.13 CNRIP1 pdb F Eukaryota T 6sg9 26 Z DW mS70 MLRFTHRALTATPERFSVLGTTHPKPKRTGFGRNNKMRSKPSDNVAWYDKGPVEWLPRPVRLTYDHLDQLQQWTMRATLDGRTEEFNRIRDLHREWSQHPLMPVLGDVEPKFPLNLFKQNHRAKKRFLVRWHKANTPANWLWMPRGPTVVTPLHRTNPTQYPENWKQMVQRKSGTGTPS 179 T 16 THDPS_M pdbhh F T 6sg9 27 AA DX Q383G5_TRYB2 mS71 MFHRAFVSSSDLTGCTIALSSVCTQKRYWAKPKKRPKVGQGFHEKAQKWREEYLLDRHRALADSLRAYVEFSTSKRVEPWDARFKPFDRIEKDGVYVLMRYMMEEKLQLCNYHHRPVKRLFCNIGLMGPQITTKARWKPYRFATNPAGTSKAERMYQRDKTVYTHGHND 169 T 0.18 Tox-MPTase5 pdbpercent F Eukaryota T 6sg9 28 BA DY Q57YD4_TRYB2 mS72 MLRSTLTSLNTFLTSSVATPPISVIRTGPKWWAHPERMVRQKLMYFTLGVDQLPLRRTAVIQRDLQRFHMCKPPPRVGDSTGYKRSRAAQLNTWYRRIQYQEYHMQHLFTRHVWGLLRVYPGNTTKIQGKADDGYVGYDSVPFHRYNRAPLPFPARELYERRK 163 T 37 MBDa pdbhh F Eukaryota T 6sg9 31 EA FD Q583U2_TRYB2 mt-SAF12 (KRIPP18) MLIRRLRRPHSTVRGCRISSASNSDSAGAFSANGTQLPDPPYYLPHSPRFDAERCGTFNKKWLLNLPALKPLVRNSTYLPKKEELWRAPTHEALETIIGHLPYHDALRYITEHSLFLLFPTVLRARDAPLPHVIYEDFMKSCTFASLQNPPEEQFALPSVLLRTLLCMAAYHCTLDADYFTTCQMLFGRMEQQQQTTPEVLSAWVYCCTASGRVDEALTYAKYMADCSAPFDVTVFSLMQHPSLNPIEVEDGSVPHSAKGLLLQRRLGNRLHTAYRSDAVAAHGMFVYYALTLSHVRKWEVIRAAAALGVTLAERTVVLAVEVFAREKGMRCGPKTVKALTHFLAQDGTVGHLLYVLLRARKNELLPEFRDLPHTTFSEEEQELVLQCVAQRARHDDSFAVAATLVSSLVREDDPSELLMAFARAARNHHSADVCGGDGDGSVCADVPAPVPESPPSNSSEIIEKDRWAVVQASVRSLLLDVNALDQASRRDAYHKHWKNGNKGVKNKENVLAKTLTLPHDSMHTATIQRKEKELWELMRSDTPVGVRELAQLNIMEELQEAKRLERAEMAWVNPDGTF 579 T 0.3 PPR_long pdbhh F Eukaryota T 6sg9 33 GA FG mt-SAF15 MRWSAVLCKVSKVPTRSVIPADALRYHPRYSVPRKMVLSNTFNVVGENNRYTSLKLILEKLSGHVGRRQYKMLCNLEKHYDKLKDEGIDWHELKWLSREELIVLFDKVLHLTRTERAALLPAIEAKVCGVLRQTDSRHSTVVCMNRGNANHGWRCGRNGHTNDVYFEGKAKANTAALQLCRRSSVRSVERSGVLVEVRSEPDFSVVGRFDHSLSPRVWSTPENPTFQVTTIGYEFRVHQEDPRVIPQIVEAAEEWELHANITKQVIWEMLEMYAVERDRQPLDLKPGEMGDPDIPSRHATAFNVQVVPPLSEGDEAIQVVEQRIVRADGSEVPWFQEPPPQLFSGGIPVILPFAPSIIVKSTFRQVTRSSAQDVTRQLLQPVVDVTCFLHPNVCFWWNAEDEQRCLGHIVDYAKRIPFALPFNLYFRVNLSKDLRGVQNYTEELGKRMSMKAHYFNLRSYGVR 463 T 0.089 DUF1285 pdbpercent F T 6sg9 34 HA FH mt-SAF16 MLVCCRSSLSLLARATMPLCCSRRFLTHQNNIDDISGPVDTNSNSVSDGRLHCSTGEGGKASTCERVSLRTIAESLGAAAAAELRAEVERDTRDGVAAIPPLPPLGWRVRHPSGSNYFVMTRTLKNGVQSAELNNRRYRSVHDIFLQSLQKGGHKYAQKGKGSEGRQKDAKEEEEPPSQEDGKSSPKVTVGYDREGRGHRATLQRMDELHDSPKLSRADVHLTVFAPFRVYDPSLHDPTVDICEWSSFDLVVQKTVPDNMVANKLLQPLSCTPQDGALSMYVCLASVNSEMRIRSIQLLSMKEAQALVEHACFGNGEPLFLELLRRRGRRRPLVERRFDDPRLRYEEVAQPQQVADEAAVACSSSCYGPYYPAFEMLMDSCGSAGEYSRALCYGGPYVSELSRELCDALLDYIKGDLGVSDQLCEYVCQMQFFLEQEEYMTWLGQVQHVANAVSRTA 457 T 3.8E-14 MAM33 pdbpssm F T 6sg9 35 IA FI mt-SAF17 MQRTLRSAARRKWGQKTWSPTATNGGAAPANGVSAQEALQIAYRPMPPSQTVEYEEDFGHNLMIHREYISKRCRDRVSFELSALSYSNLELRRGQEHLAGIMNRERRGVSVGASGAPDDQVQMQTDVDANSREVLSARYLFNERRLQFCDRFQNFFQSKLENSAASDSNGHEKQHLFSLMEACAVIFGCETEAARETYYRMFLGLDSETLLEEDEALRNRIADAKLVQRVLENNKGRQEVTQSPKLQQQQDQGKPLHAVSSGTSLLNDCEEERFISSIPELSLFEDETEARANGFVEGEDDVKANNGDLTAGSFSSPASSVNLPEEFEEYAPLYKAYITHAVGKGPVASYDISTLGSTGLTAERRRWRTLMEKIVREDYHTMTEVEQMDAIVLNEQLHTVKFFDLKIGDAIRDILQLLQRETGVGSSVNRDTPVGISPNNPERRV 445 T 4.8 DUF3221 pdbhh F T 6sg9 36 JA FK Q57XS8_TRYB2 mt-SAF19 MHSSLIILRHAYFSALHPARRVVPGSLLPVRTQFYTRHFTSTAGPTCGDGGETYKSEPTKVGASVEGTNSGNGVTDSPSLFSSSAPTVRRRALPPSDFPENALLKCIEKEIEDEALRLDKEECPPPPPTGWEMYHAPGTSVFYGRRWWLPATASAETRATPERHTIRVQLTKRDPSLDPECDVRGEHFPFSFFVQRAPSKGEAVRRDGTFRMGDSAAAGDVKGRTEGKEEEEEEEELGLYDQSIEVRADFVDGELLVDNVVFHGTFKTGSSCSKRSGNTSPEAAAATAAGQHDNTTGGRGKVEEVRYNNIFNGYPGPNLDEAEEEVLDGLQAWLAERCVDDQFGEFVGQYSVWVEQQEYEMWLKRLRDFVAA 372 T 1.9999999999999998E-26 MAM33 unppssm F Eukaryota T 6sg9 38 LA FV Q38C60_TRYB2 mt-SAF25 MMKRTILQRCIQNKSLEIARISRSDINSRAHLPFNFDVCYELGSREFTLFSSVGSTSVLVFCNVSSRRLRSVKGGQGETEFPPKRLNVKRRRQSGSADRSPVVFSAFVSMPTSGLTIEALCCSSLGLLVVDGVSFHQGPLTESMIPQDAHPGPEGHYQGPLLNQRSLAEMVVNTRGCITSVDPFRQQESFFDGHVNPWNARSLRFGHVPVHTAKPGFSDALCHFLEVFGVNDELAFFVEDFAHLVHREEETAWTNVLKTMMGGR 264 T 1E-07 MAM33 pdbhh F Eukaryota T 6sg9 39 MA Fe Q387L0_TRYB2 mt-SAF34 MSRSTVFGPGSLYSFTKFGSFNRSPTNCTLNKRMKDIFRLENQKHIRNDFDRERRYRMCTKCGITTVTINFNNVPSARVGLWGRCADDKDYTHHRMVDITQREYEVLRESPVEKRLNWWRYER 123 T 0.022 zf_C2H2_13 pdbpercent F Eukaryota T 6sg9 40 NA Ua UNK-a XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 70 F F F 6sg9 41 OA Ub UNK-b XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 42 F F F 6sg9 42 PA Uc UNK-c XXXXXXXXXXXX 12 F F F 6sg9 43 QA Ud UNK-d XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 59 F F F 6sg9 44 RA Ue UNK-e XXXXXXXXXXXXXXXXXXXXXXXXXXXXX 29 F F F 6sg9 45 SA Uf UNK-f XXXXXXXXX 9 F F F 6sg9 46 TA Ug UNK-g XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 167 F F F 6sg9 47 UA Uh UNK-h XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 255 F F F 6sg9 48 VA Ui UNK-i XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 32 F F F 6sg9 49 WA Uj UNK-j XXXXXXXXXXXXXXXXXXX 19 F F F 6sg9 50 XA Uk UNK-k XXXXXXXXXXXXXXXXXXXXXXXXXXX 27 F F F 6sg9 51 YA Ul UNK-l XXXXXXXXXXXXXX 14 F F F 6sg9 52 ZA Um UNK-m XXXXXXXXXXX 11 F F F 6sg9 53 AB Ux UNK-x XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 108 F F F 6sga 9 I Ca Q38DR3_TRYB2 mS22 MLRRAYIQRRYPFNKRGPREHKSWKHHVLTEPPKPLQWRDPKVWTRDLSVMKSFDAPQWDLWQSRPRSEDMDEALQPFMDMPKSLKDRRYDIPWWANPFGAWYLQNILSLELLKLKSKTNAEKIATYRSYMRSLASGKDNTMSDDDVIRNIIKERWKTLEFGDRNAGYPCTFGDYIQFLNEWFKSLDEEGMQRLREHFDRRIRPLLAVMSPVDILWLEALTQNSPHNKEQLQRKIAFQTSLGTPEFFDMSKRLRYEINEDYKVRDELGPELFALWSKAPERWPPERLSKMYGLDFTLVRKILVWHHFKACYDACVEPDWSLPKRLFALEWIRDVRARKHGLFYGKMRFAEQKITFYSDRFLFRDLVNRREASYANVWEMDDPYRFLQTEQDYEDYWGDNYDVYRRMFPEMIGRTGEPVQQYGQMPIWAGPHRQHANKSEHNWMFAEIGVNVGHEALKKLELDPTNEKRRRFVIRQPDGTLRSAKMSEMRAWYWKEEWADFRFWAPQMEWGIENTPSQEQYQEHVPDTTDADFRKQRRIQSRPVKWFYESHYTRTGSFAGFQPLRFMQRRTEREVRWPDVINAAVQIQKRKPAAYIFKAIPEL 602 T 0.26 MRP-S22 pdb F Eukaryota T 6sga 10 J Cb mS23 MRQCVVRRYKMPKNMGVAPRFDTWNEKYEPWEHMKRMGRLVGTGFYIPPEWYNHFRMFPPINHNFQQEKTLNPHNASEPTQDDTSTLSPERVALRDELARKSRLVASEGMRYYNIFWVRKPLDTMEKEYYELKRRGVDHGEAIRKVLQGFYSGLAVKKRVAAIQAEEAKLTGRFITMREATVVLGVLAKLHKEQLTPHQVSLLAKEQGETTQSGAKLTAIVSRTQPHVNKEASSPATSEAVGSSTEESLSADALASMLSEDGEQSAVGTRYQVEVKETANDSVRQLREKAEDQTGFPDWYTGESPTYSGTS 311 T 0.038 CHAD pdb F T 6sga 11 K Cd Q38DK6_TRYB2 mS26 MMRSSSFCRRQIRPYYNLPSKSEHGRKMTGFLTPYRHWMWKQNELWRNVHEAQFEHLRRVYKRQWLESFRVNADEYIYKYNITKAAQLAQWECEMKEQEKKRIEARQMMDGRQALKKKHLDLLREFHERQFFFWYERASERLQNMNLINYVPHAQLREHIDKELDKYVAGKNEPYPLNFVGQMPFLEDGDGNIVEVPESLLSNHMAEHPDSTAKPHEPHTSSSISEAAAFEERMLRAMVSAKEEDLKEWLGDDSRALSETIDDISREEEEREADIRVARSMEETDAEREVSRRAYIERGKTGSRSIFRPPTVSEGAGGTPSAPAGDANTPMRRRKKGKLDRVHALQAHQDELLAKLSSQGLKEGVDASSVPERGKIVQSRGRIRDKAVIPTHEVLMQKPELAAGSTPGARIQTKDMVDKMYHRGKYKKSGSGDKSDGEDL 440 T 0.028 ThylakoidFormat pdbpssm F Eukaryota T 6sga 13 M Cn Q57VQ9_TRYB2 mS38 MRAGCVACSRIPKLGIAALGSTTTSMQATCQETQELRCATTQMAPAVIFPLPPPLRGSYIDRKPTAASFNEHHATASFRHHMIASADVSHRPRHFYFASGTRNDTGTRSVKEGERKQLKQQTNVEVDSVAYGDQLDELSARWAAKFYGQVTFGPRNYPYPSSRWLARRFQMKKHRIIKRFRFRRYKLAAVANLPFAKMIRVGMLPELKSSKTKRGDVVDPTLSGQLVSAVKNTGEKRKGQRTRPKSKYQV 250 T 28 DUF2996 pdbhh F Eukaryota T 6sga 15 O DD mS51 (KRIPP1) MLRCTVVGHHRSGKARAFVFRDPSLRMMRAGSGYQQLRRMGMPMQVGMGWRKVDSFHANTQYQHAWPLLSHDDLGNSDQSNNTKNIMYSMYMPKRNKGTAPWFRGADTYSVKYCEQGRYEYQRYLMINRFPSEYKKHFLSFLSNIRMSSGSATIPQEALHWLLRMIVDNFNPQHVHYIAAMKTLQSAGELDMARDVWKIMERQQTWPCTATICAYLDVCVEAGEKTWAMEAWNRYCTELKFLEPGEVDPKPISRVPFSLTREELLYLPKWKKHFDHDPNLDVMDLNRFNRTREVYLRMAQVMLAGGERNAFQHFFTKLEEAMLNKPTPVPEPPNPHLVRRPRWAPYEHCKSVHHSPWRLQNNGRALALGPPVTIEDEMQSRFFSNDQFLVHSVKEVLRIVLQEHKRAHPTECTRCKTEAFFYKTKDADETLKFCDDLIERLFASLGVRLSNLNTSSLLSTILEVFRVVGKESGAALLQRANEFLERKASLGDAEGSRENLTASNYLQVLSGFADESAFVYNTKKDGTCQYKTGFDPRTTMRHLADVVQEIAGNPHVTWAADMHLQVVETMVGCGTMKANDYFVRNVLRQFSWDSRFLEALYVEYRRQDDVDMWAELTKRALVWTARYNAPASERLRRLIEDDYDTIRVQTRTFRELAVFQFRDVEERRHSRDVVNELPNPWYDYVAHALPFPDRDAGYPDEYGDLGQWRAPGGPGSPVRGPGYYAPPMEGEHQRGYTAEWRDLRNPMRPPEFPTPWERKYRQYARGQHPSYDMVYAGPMPEIFPMRRDFRKPTRWDFHDIEKQGKYRTSGPY 812 T 0.001 PPR_long pdbhh F T 6sga 16 P DI mS56 MRKFCHFMANCWNSARSHATYGAVPLTHSQVTSVYATDGGKVDELGLLELVEERIFSWKLNKWEMRIPPNLPNDQKELIRQEQENLKQILSEWRKCFGALNADILQISSLTGVPKDVVREKNRTWLQEEVAKLRWMGEVNKAALLRDAFMRLEAFGSRDFMFMERLCCIYGLARQGTFDEAFTNYITEDPVTNDIFVDERNPFKELVAHIVRNYSQIDIIYDFLGFNYSEGYRSSLRRYMEYLQCKTAENVRASGRLVTGDKGEHNILFDYCVSRESLVSGDSCQGIIDFLYINGNDVTLIIIASDNPWLRNRQLPHRRQMEGIARRVCFVLGIPPSEVRIRNLLLPPTYLDKGSIVRLNDIVFRLSNEQSNLLIPWLTNYNKELDPKDVDYTALAKTTNEEEWLTL 407 T 0.2 ApoC-I pdbpercent F T 6sga 17 Q DL mS59 MRCSCAFLDKSVFAAKRRVIVPIHPTPNFPAHFIKSAFTTDPLKEKQKARFSSGGEAMREVQDIPKNLEGERSRRDLASRGDTEFQALVEFIEGASYDQLISGRRFKKVYDVLSENDDMFIWLCHTAMSVLNPGDMRSRLIYNHLRILAESVASGEMTQRTAFRFFESAVRSPAYREIAKRQLEGGAATRLAGISAAADVMRRMGLTRRPMSSYFELYQRIVERSEAMTPWGFPPLFQFEERLSLEPRLKFFSRASQQSLERRRRGHVMTPHTTLHGRRIFWIPPTWNRAGRFLGPHVTLYPGMTPD 307 T 0.37 DUF2840 pdbpssm F T 6sga 18 R DO Q383D1_TRYB2 mS62 (KRIPP14) MRRFCTARVSSYVKRAPLLLPRGGRRWRSGAGTTADGGGEKEYIADSFESTAGSNAYAAMELAAEARREIHELWLSAETALEREKRVQQVAALIEKYKLDPSTPREADVSRGLGDAFDRLLLLCLPLGKTDAKGTDNLERLMHLAGRNGRELSVRTIQHLFARTDSFAEALAVFYTMRRCHVAMNMEAYYSMLYSLQRLEEEGWGQHFRNEYEENGAPSEQAMDFIVKGISNALLPENKPWLGRVMFQDRNVPDRRYDTRDFDELDTAWTQRYKSGTPAGAH 282 T 0.084 ECSIT pdbhh F Eukaryota T 6sga 19 S DP mS63 (KRIPP16) MLHGTPVRRASLRYRRPYWMMFLKGVDNWKIYTVIQQPDHQRTEMLYQAWLGGLDRPYTRPKCMANQPLWLSKKRHMLRKERLDGPETPLEKYVLEWHKKFHSFQGTERPTPDDLHTALDLVERPLDLSYALQLLGQCRNLNNIRFAKETFLVFLEACLRVGRRDCAEYALEHAEPLGFWFIDEDHRRYLQGEQTWYKLSPLDNLYYPVEENAKLNEGRKPITRLSPATESPGSGTAISDGEPSTDVEGETTVDDEIAQLEAELAALEREGGGK 274 T 0.19 MNE1 pdbpssm F T 6sga 20 T DR mS65 MFSESLCILSRRFRYNTKFPALVSYNKLPWEVVNHETPQFHMHVAPHYEQLLTLAASSPVPHIIGSKHIDVPREHRLRLLPGMLYLLDGDTLPGEFTINRVLDPTALQYYGRLSSQIVTVEAVRMLVPDDLRLLCNCITFKGPLHLPVAPYASLASLRGASQGGTTGSETGSNCFTLYHFVRPNRPPKELQLEKYYIHAPCVAPLSEFASNSDERGNWRPRLQAPKRTRRATPLPAYRPPQSYLMGLAERLAVVPGGCFGRRSLMWGHWF 270 T 0.47 DUF3475 unp F T 6sga 22 V DZ Q587C4_TRYB2 mS73 MLRRSILFRMKYADLELTTRGEFPHGMKEPAFVKKLDQNIPWYFSTYRSMYHWPITGDNWSDLNEAEKHHDLHMFYTLAWWKLGEGIFGVDEDS 94 T 9.7 Mastoparan_2 pdbhh F Eukaryota T 6sga 24 X F3 Q38E61_TRYB2 mt-SAF3 MLRGWHPNSSAMQVGMRHITIGGRHSRGGFRQPLGKHPQVKQGTVEGVPRRIPGTTKVTYTNKKGRTFSFSVPVSELTHPQVTLESAAGTWREMDTSFCELGDIEDDMPSPVDECLRGGSSLDKRLIQEVRERFVSFCREYVLMDTSGMKSTILSTELNAGPDYEHYDRRLRRKRHWLAIRHRFEDVRYVIWPDVVEETARGDSAQADVSLTNPSLTAGEMLEALLWLDAASTFCVRKVHPSDLGDKSEFLPLDLQREVEVVACHARRDLDFFDPSATSLEQFTACAALCVNHRVPFSLFFPAQDVCGDASVSTGQCIVANAPSPHTALGAVRIMALISEGSGSDIGKTIMFSDAFGAVTRFGILRGLSRVMSVEAFGCKDALENVNESELCIILHFCAEVREQNAAFFRRYEASEEDSDPQQVSFLAKYQQLSQIALARCKRLLYHPDSPRAQVMSEDGYIPLVELQRHAEGTNKAALIHYNLGIRSAQGMRRVALGAQSSARLAELVSRLEEASARVSGNTLVNDLVHHLSHKAAAGKMSLTLREVNTLLPLLSRMRRESPNGALDARFDRVFNAIDTAIGAAMRHNCTLDELLDLAEGLAACEMVPSALKQVEMVLIRSVMMHECSPMHLRRMLQAMFTLMRTSVPQVLLQSVASRVADYIKEASHMDSSSSNGGGDEKVKNHEECEQLLELLVVLGKCGYGALPGLVTIYWEAQLIDSMQLNPRLRCSYASLLASAAFALKKHDKRAWEGLADESHRLFMEYTRCNKENDIGRFAECVTGLAVLTQIKDNTNSSDVAFLKEYLSATSLELKSCEVIRVQELTDLLGRTLEWSEALGVVAPDVVIQLEKALFVMLENVSHTAPGVGIPDELVTAACCLVDMSSASLELRKAAAGVVGGAIVHAEEALETLRSGAPTQVRPGHSFDVAALASAERENVYKNSILQYCAALQRSGMSTHVEELWS 966 T 0.074 DNA_ligase_A_N unppercent F Eukaryota T 6sga 25 Y F5 mt-SAF5 MRHTIPFFRRSAFVPAPGSSLLNPRSQRAKVRRMVAAQKAQGENFERQALYAELGGSPSARAPRSKGERSKEATRRVGCEVAERAKHMTDAEWEGVPVDEKHAFAKYMHKVLQEHPTETTEQQRRRYFETTMADVFELDPRKTVRDEYERVKLGLPVHLKNPQYSLGVSQAVYDAADASLFDPENVHRLENAMTHVKQVFADYVHKKREGVSTEAERRMLANLTAELNLETQKHLANMFKYAEMRLRQVKLEERHHQLAEIERLRRMAQQRGGVKGRKGGSRKMSRMERLKRVINRAVGLDIAVAETVLTEMQAQEEFLQFCEVFARLTLGSGFKHTGKDENLSAYIESLRKLYSMDAATLSTLDVVQYYSSKEGAHPVDWAKRWYERALLLPLQSTPEYQKLLQIQQRDESTVKHIKETAGTGHAFACEAEAEVARIKTQKVVNLVEKMFMDPKDKRLESLHEKRLRYLAHMQMERQIRCVRENAKLFDGVENMPEAAQCRELYEKIMEKKTAQCNMTSPPEGEGSAIQSAKTEGDHCEVGPGMFNVYDDAEASSLFEKIREITLRVIRDRRVQSAAATKARMLNRIIRSLKGGERSIAEELRALHQQRKEKMTMRILGIIENDVKTEMEWLQNMEEAERPPLLPIPENMSYVSAADVQAWRELREDDERKAANPFERRRRTFQPELLGQAWSVPNKPLLFWGTGVSAVQQALRHVAEDAERKRQGLLLAPPYPCAENPWGWRLAKDILDDNN 754 T 0.12 FhuF pdb F T 6sga 28 BA F8 mt-SAF8 MMHRTAARLRMPRKPTPYVRKFLEGCPLPETLVDDIAGANLKSMAPFFTTAPRYIVAAESRLSKLFFHHALYPAGGARRPCRVLIVRGGRSVREPSFTINTGGGRGEVGGGSRGYRDPARRAYFYARPSTVGPFYSGNGGVSSNVAKCHSGVGSEGAGLVKRASVDGLLSPLCGVIEAHFAVGGTCNDAVATEGDGTESLTKGGEKLCVTLAGLLSGGHGGLMMDDGNCASNVRAAKRVARLLHDAAHHLSSFFYVHTQLPDSALFVSRPEASIASDGKGDGLLPSHLAREKDNDGRPSVEGVAVFRLAGGLEPTVHFAVGAPLSVLQRGVDGTASRGEKNSAEEGLNSAASTTVLPFGHIQCLLRVRTRGGKHCAAGKEGSEGTSNTPWCNTAGNDDITSNFAASGPQIAGGIVEPWKLGVSLDPKVPFFMRTLTEKRPSFSCGEGYLGTCSRSGDVDGNTVNDVSSSSGGSRTRAPGEVHMNHLLVRNDCETYLLPQRELLLSFHVPEEAEAMCKEQNEERMRRQAALGYGSPSHVFAEGPRTFARVLHGMKANLAAVEEASSTFRQGAAEGISPQVNGGSTTSGSSRVYEVRALPGDVVFVPRGWKYSVERIVGTAIIDAVAASTASPREALRAVFRTAPDPPLPQDVVRCDEGHAGAGEMPGGDSSNAEIVGVEVDAFVLCYKPYPVLSNAQASTYVAANYVHSGIDDFYAKGGNDVYHKYT 726 T 0.02 Cupin_4 pdbhh F T 6sga 29 CA F9 mt-SAF9 MTGPTRALFLSSGINLGRLRLAEQFSSMNGWQSKEDPAFDAYVKERRRKENYEAFDQRVERGYAAAAKLHKAEIQNAVKRRLKSSGAKFTAETLREMSSAVTERLAWLRDVWAQIDADYRSGDSARQETAAQEISAALRGEPNDYMRWVYETKRELRFAGPVGRRAIQEELQAAELPEVLDEEVNRYHDLKLNMMEIEREVKAKYGVAGQQHWAELQAAKDEEYIQKLDEAAEVYKQLLDQSARLDESRRSELQRSYVERVHQAQVRFKAAMELEGQREQLIEAHQAMKEERMRTEREKRRQLLREAAELRAQGKKSADVLTALKERQLDANAKRQAEYELKECEDILKRKSEMLDMIAHFKHDVEEREGREMLQRQKSDEERQVNVFGFYEEVGVEDGLSISSEGTTSQGGSSGLGTVSTSTSCAKSADSNSSAQPSQKLRKEELWKVINADTYEDPFRTVHQARLDAVKTYDPAYARTFPLNLVLGRKYSRQGAGEMAAGNETDKQILQKGNNILYSFQWGLNNGTVHDLDADGGTDYFMDGAFHVRDKETGDIDWRYEKKRGGPVFRGPKFYRLGAQREAADPGERAMDPTPYTSTPREHKWRSS 608 T 0.11 MDMPI_C pdbpssm F T 6sga 30 DA FA Q386U1_TRYB2 mt-SAF10 MPSASSGYTFADFLRRLERSPDSHMAPLYHEHRELFVRRHDMFARVISSVTWSKGVALVAAAGYTQAVNVTIYRALLARMLLHNRHVRQCGAGSVVPWSAALRTYSEAIATHGNAVPTRMTLSALRLCTPARQWVAAISLLMLSQANDKLTLPMLIDAAGCCATPAAWEKAMALLGRFHAQSLQVLPDSIQSLRPVGTSASTVDAAAHALLPRSEGPTPEQKHILTVINKVVSAVPWQVALSNEMCRSYLTHLVASTTLRPTEKTASLTTAVQQLPWEAFVTLMKTVTATVQEGSQGVLGSRSTPQLPPPQDGMERGDVAKSLLSNSIIREGVNLLQSEPETAIPFITTILYKLPSAEAAALFLSEATSAYRNSSSAVVAAAIRHPVVVGALLKRCADSNSWYLAASIFKSTSPTAIPCDVASDLVIQMRRANQAPLVVDVLQKYIVPSRTKLTEEAIEAALLCVLVHNRALAKASAVVAGTSPDNRTGKPNGIGVANGVHWISALSWATDLLEEGVESRILQTGTTPSVGGVNHEDPTVLLRKKTLSPRILSLLIYICVNAGSPRGGLFALGYARTVSKTELELSEEITALLYCMMYDRPREAESIIQHAVKKHGEYKGKYLGRLLVASQEAKGSALRNQT 642 T 1.4 Luteo_PO pdb F Eukaryota T 6sga 33 HA FJ mt-SAF18 MLCHNRLLLVNKQTQKYRTKLRYRFRQPSVVPLRQTLQQRHNTILEVLRRRRINSGDQSPYRYVEERLYSKPSRLDREGVKVNKTYALQGLGDLEPLRYGANFGISEKDALKYETVAEKAKYMEPPIPYSSLAARKLAAGALWPAAPDPEGMISKEVRLLRHESSMSPSARAFSERVAYHLRRSLKACPGHIAEHIDFTQLIIQEVLGSRRSKEIYIVWFTVDPGARFELEPRLHQLNHWVQQLIIKRVKRRPHIPRVTWIYDGGRLERELPRDVKQELQSFVADAATTLESRVKYLKELDTMNQRMKDIPWFMPYLWSKEEKAARQKSMLADLEEVERRKNEHSSGRSAPPRTSPPPQFVR 362 T 0.041 RBFA pdbhh F T 6sga 40 TA FY mt-SAF28 MWRLSRSLRSNSLHNPGPFLDGALQLIKLHLAHKNAAADKNTKACSDIEGEFLRELEAFRACFTMSSSLKVAKLYTKKLHGALSYFQLYDDPLMRQLDMIIGKQTMQPSAGRQHGVFKAPVAARLDPFFLDEREETVLPSELPNPPKPDPSTPLRERALKVPAQHRGHWVLRDPDIAITREERRTDPW 188 T 7.4 Hist_rich_Ca-bd pdbhh F T 6sga 41 UA FZ mt-SAF29 MRRKTTLNIGQVICFSSWNDGSEGYEWKSRALSEKRSLALEFLGNVNKRVSIHDAIRLKADINKKAISNVSCPSFFSGIEGADEDEDQSDMSLCSLLGVLEGEIETDCITHLSPSDASLLKEEFLCDYDPSDTKRMAKWVNLRSETSDYQSYGAIPEGERSLWSAWYLRNIKAGKKPI 178 T 0.02 PH_11 pdbpssm F T 6sga 42 VA Fa Q57VU7_TRYB2 mt-SAF30 MPPNSATRWLPFVSSDLKDYLNRYWAVMFTVGARPIETGHIRHYVSWYCTRMKVVLLDHHVYVEPLRQQLQEASRTPELPLLFVNKKLVGTLRDVELLEREKKLKDVLHFGFEWRVGGSVAATNGQKSLMGALPAPYGDAEFFRGRYRGPPVARPVVSLPTLHPFALRSEE 171 T 0.15 Glutaredoxin pdbpercent F Eukaryota T 6sga 43 WA Fb Q581U4_TRYB2 mt-SAF31 MNCSSTLACHAVVSAPSTASLITSCWTPQQHCYRQLMKSLRAAYFHDRSKLFWSRHRVLVEFYKYSEEANEEAVKQLVAIGLEVAAFIDHHMRTDVERIVKHNETMMALPVAQAKKFRSDYLLAEKQHDSWCKQKIKNIMKRRPPPPYPFF 151 T 0.019 Complex1_LYR pdb F Eukaryota T 6sga 46 ZA UA UNK-A XXXXXXXXXXXXXXXXXXXXX 21 F F F 6sga 47 AB UB UNK-B XXXXXXXXXXXXXXXXXXXXXXXXXXX 27 F F F 6sga 48 BB UC UNK-C XXXXXXXXXX 10 F F F 6sga 49 CB,KB,OB UD,UM,UQ UNK-D, UNK-M, UNK-Q XXXXXXXXX 9 F F F 6sga 50 DB,NB UE,UP UNK-E, UNK-P XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 45 F F F 6sga 51 EB UF UNK-F XXXXXXXXXXX 11 F F F 6sga 52 FB UG UNK-G XXXXXXXXXXXXXXXXX 17 F F F 6sga 53 GB UH UNK-H XXXXX 5 F F F 6sga 54 HB,LB UI,UN UNK-I, UNK-M XXXXXXXX 8 F F F 6sga 55 IB UJ UNK-J XXXXXXXXXXXXXXXX 16 F F F 6sga 56 JB UL UNK-L XXXXXXXXXXXXXXXXXXXXXX 22 F F F 6sga 57 MB UO UNK-O XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 30 F F F 6sga 58 PB UU UNK-U XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 6sga 59 QB UY UNK-I XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 468 F F F 6sgb 9 I Ca Q38DR3_TRYB2 mS22 MLRRAYIQRRYPFNKRGPREHKSWKHHVLTEPPKPLQWRDPKVWTRDLSVMKSFDAPQWDLWQSRPRSEDMDEALQPFMDMPKSLKDRRYDIPWWANPFGAWYLQNILSLELLKLKSKTNAEKIATYRSYMRSLASGKDNTMSDDDVIRNIIKERWKTLEFGDRNAGYPCTFGDYIQFLNEWFKSLDEEGMQRLREHFDRRIRPLLAVMSPVDILWLEALTQNSPHNKEQLQRKIAFQTSLGTPEFFDMSKRLRYEINEDYKVRDELGPELFALWSKAPERWPPERLSKMYGLDFTLVRKILVWHHFKACYDACVEPDWSLPKRLFALEWIRDVRARKHGLFYGKMRFAEQKITFYSDRFLFRDLVNRREASYANVWEMDDPYRFLQTEQDYEDYWGDNYDVYRRMFPEMIGRTGEPVQQYGQMPIWAGPHRQHANKSEHNWMFAEIGVNVGHEALKKLELDPTNEKRRRFVIRQPDGTLRSAKMSEMRAWYWKEEWADFRFWAPQMEWGIENTPSQEQYQEHVPDTTDADFRKQRRIQSRPVKWFYESHYTRTGSFAGFQPLRFMQRRTEREVRWPDVINAAVQIQKRKPAAYIFKAIPEL 602 T 0.26 MRP-S22 pdb F Eukaryota T 6sgb 10 J Cb mS23 MRQCVVRRYKMPKNMGVAPRFDTWNEKYEPWEHMKRMGRLVGTGFYIPPEWYNHFRMFPPINHNFQQEKTLNPHNASEPTQDDTSTLSPERVALRDELARKSRLVASEGMRYYNIFWVRKPLDTMEKEYYELKRRGVDHGEAIRKVLQGFYSGLAVKKRVAAIQAEEAKLTGRFITMREATVVLGVLAKLHKEQLTPHQVSLLAKEQGETTQSGAKLTAIVSRTQPHVNKEASSPATSEAVGSSTEESLSADALASMLSEDGEQSAVGTRYQVEVKETANDSVRQLREKAEDQTGFPDWYTGESPTYSGTS 311 T 0.038 CHAD pdb F T 6sgb 11 K Cd Q38DK6_TRYB2 mS26 MMRSSSFCRRQIRPYYNLPSKSEHGRKMTGFLTPYRHWMWKQNELWRNVHEAQFEHLRRVYKRQWLESFRVNADEYIYKYNITKAAQLAQWECEMKEQEKKRIEARQMMDGRQALKKKHLDLLREFHERQFFFWYERASERLQNMNLINYVPHAQLREHIDKELDKYVAGKNEPYPLNFVGQMPFLEDGDGNIVEVPESLLSNHMAEHPDSTAKPHEPHTSSSISEAAAFEERMLRAMVSAKEEDLKEWLGDDSRALSETIDDISREEEEREADIRVARSMEETDAEREVSRRAYIERGKTGSRSIFRPPTVSEGAGGTPSAPAGDANTPMRRRKKGKLDRVHALQAHQDELLAKLSSQGLKEGVDASSVPERGKIVQSRGRIRDKAVIPTHEVLMQKPELAAGSTPGARIQTKDMVDKMYHRGKYKKSGSGDKSDGEDL 440 T 0.028 ThylakoidFormat pdbpssm F Eukaryota T 6sgb 13 M Cn Q57VQ9_TRYB2 mS38 MRAGCVACSRIPKLGIAALGSTTTSMQATCQETQELRCATTQMAPAVIFPLPPPLRGSYIDRKPTAASFNEHHATASFRHHMIASADVSHRPRHFYFASGTRNDTGTRSVKEGERKQLKQQTNVEVDSVAYGDQLDELSARWAAKFYGQVTFGPRNYPYPSSRWLARRFQMKKHRIIKRFRFRRYKLAAVANLPFAKMIRVGMLPELKSSKTKRGDVVDPTLSGQLVSAVKNTGEKRKGQRTRPKSKYQV 250 T 28 DUF2996 pdbhh F Eukaryota T 6sgb 15 O DD mS51 (KRIPP1) MLRCTVVGHHRSGKARAFVFRDPSLRMMRAGSGYQQLRRMGMPMQVGMGWRKVDSFHANTQYQHAWPLLSHDDLGNSDQSNNTKNIMYSMYMPKRNKGTAPWFRGADTYSVKYCEQGRYEYQRYLMINRFPSEYKKHFLSFLSNIRMSSGSATIPQEALHWLLRMIVDNFNPQHVHYIAAMKTLQSAGELDMARDVWKIMERQQTWPCTATICAYLDVCVEAGEKTWAMEAWNRYCTELKFLEPGEVDPKPISRVPFSLTREELLYLPKWKKHFDHDPNLDVMDLNRFNRTREVYLRMAQVMLAGGERNAFQHFFTKLEEAMLNKPTPVPEPPNPHLVRRPRWAPYEHCKSVHHSPWRLQNNGRALALGPPVTIEDEMQSRFFSNDQFLVHSVKEVLRIVLQEHKRAHPTECTRCKTEAFFYKTKDADETLKFCDDLIERLFASLGVRLSNLNTSSLLSTILEVFRVVGKESGAALLQRANEFLERKASLGDAEGSRENLTASNYLQVLSGFADESAFVYNTKKDGTCQYKTGFDPRTTMRHLADVVQEIAGNPHVTWAADMHLQVVETMVGCGTMKANDYFVRNVLRQFSWDSRFLEALYVEYRRQDDVDMWAELTKRALVWTARYNAPASERLRRLIEDDYDTIRVQTRTFRELAVFQFRDVEERRHSRDVVNELPNPWYDYVAHALPFPDRDAGYPDEYGDLGQWRAPGGPGSPVRGPGYYAPPMEGEHQRGYTAEWRDLRNPMRPPEFPTPWERKYRQYARGQHPSYDMVYAGPMPEIFPMRRDFRKPTRWDFHDIEKQGKYRTSGPY 812 T 0.001 PPR_long pdbhh F T 6sgb 16 P DI mS56 MRKFCHFMANCWNSARSHATYGAVPLTHSQVTSVYATDGGKVDELGLLELVEERIFSWKLNKWEMRIPPNLPNDQKELIRQEQENLKQILSEWRKCFGALNADILQISSLTGVPKDVVREKNRTWLQEEVAKLRWMGEVNKAALLRDAFMRLEAFGSRDFMFMERLCCIYGLARQGTFDEAFTNYITEDPVTNDIFVDERNPFKELVAHIVRNYSQIDIIYDFLGFNYSEGYRSSLRRYMEYLQCKTAENVRASGRLVTGDKGEHNILFDYCVSRESLVSGDSCQGIIDFLYINGNDVTLIIIASDNPWLRNRQLPHRRQMEGIARRVCFVLGIPPSEVRIRNLLLPPTYLDKGSIVRLNDIVFRLSNEQSNLLIPWLTNYNKELDPKDVDYTALAKTTNEEEWLTL 407 T 0.2 ApoC-I pdbpercent F T 6sgb 17 Q DL mS59 MRCSCAFLDKSVFAAKRRVIVPIHPTPNFPAHFIKSAFTTDPLKEKQKARFSSGGEAMREVQDIPKNLEGERSRRDLASRGDTEFQALVEFIEGASYDQLISGRRFKKVYDVLSENDDMFIWLCHTAMSVLNPGDMRSRLIYNHLRILAESVASGEMTQRTAFRFFESAVRSPAYREIAKRQLEGGAATRLAGISAAADVMRRMGLTRRPMSSYFELYQRIVERSEAMTPWGFPPLFQFEERLSLEPRLKFFSRASQQSLERRRRGHVMTPHTTLHGRRIFWIPPTWNRAGRFLGPHVTLYPGMTPD 307 T 0.37 DUF2840 pdbpssm F T 6sgb 18 R DO Q383D1_TRYB2 mS62 (KRIPP14) MRRFCTARVSSYVKRAPLLLPRGGRRWRSGAGTTADGGGEKEYIADSFESTAGSNAYAAMELAAEARREIHELWLSAETALEREKRVQQVAALIEKYKLDPSTPREADVSRGLGDAFDRLLLLCLPLGKTDAKGTDNLERLMHLAGRNGRELSVRTIQHLFARTDSFAEALAVFYTMRRCHVAMNMEAYYSMLYSLQRLEEEGWGQHFRNEYEENGAPSEQAMDFIVKGISNALLPENKPWLGRVMFQDRNVPDRRYDTRDFDELDTAWTQRYKSGTPAGAH 282 T 0.084 ECSIT pdbhh F Eukaryota T 6sgb 19 S DP mS63 (KRIPP16) MLHGTPVRRASLRYRRPYWMMFLKGVDNWKIYTVIQQPDHQRTEMLYQAWLGGLDRPYTRPKCMANQPLWLSKKRHMLRKERLDGPETPLEKYVLEWHKKFHSFQGTERPTPDDLHTALDLVERPLDLSYALQLLGQCRNLNNIRFAKETFLVFLEACLRVGRRDCAEYALEHAEPLGFWFIDEDHRRYLQGEQTWYKLSPLDNLYYPVEENAKLNEGRKPITRLSPATESPGSGTAISDGEPSTDVEGETTVDDEIAQLEAELAALEREGGGK 274 T 0.19 MNE1 pdbpssm F T 6sgb 20 T DR mS65 MFSESLCILSRRFRYNTKFPALVSYNKLPWEVVNHETPQFHMHVAPHYEQLLTLAASSPVPHIIGSKHIDVPREHRLRLLPGMLYLLDGDTLPGEFTINRVLDPTALQYYGRLSSQIVTVEAVRMLVPDDLRLLCNCITFKGPLHLPVAPYASLASLRGASQGGTTGSETGSNCFTLYHFVRPNRPPKELQLEKYYIHAPCVAPLSEFASNSDERGNWRPRLQAPKRTRRATPLPAYRPPQSYLMGLAERLAVVPGGCFGRRSLMWGHWF 270 T 0.47 DUF3475 unp F T 6sgb 22 V DZ Q587C4_TRYB2 mS73 MLRRSILFRMKYADLELTTRGEFPHGMKEPAFVKKLDQNIPWYFSTYRSMYHWPITGDNWSDLNEAEKHHDLHMFYTLAWWKLGEGIFGVDEDS 94 T 9.7 Mastoparan_2 pdbhh F Eukaryota T 6sgb 24 X F3 Q38E61_TRYB2 mt-SAF3 MLRGWHPNSSAMQVGMRHITIGGRHSRGGFRQPLGKHPQVKQGTVEGVPRRIPGTTKVTYTNKKGRTFSFSVPVSELTHPQVTLESAAGTWREMDTSFCELGDIEDDMPSPVDECLRGGSSLDKRLIQEVRERFVSFCREYVLMDTSGMKSTILSTELNAGPDYEHYDRRLRRKRHWLAIRHRFEDVRYVIWPDVVEETARGDSAQADVSLTNPSLTAGEMLEALLWLDAASTFCVRKVHPSDLGDKSEFLPLDLQREVEVVACHARRDLDFFDPSATSLEQFTACAALCVNHRVPFSLFFPAQDVCGDASVSTGQCIVANAPSPHTALGAVRIMALISEGSGSDIGKTIMFSDAFGAVTRFGILRGLSRVMSVEAFGCKDALENVNESELCIILHFCAEVREQNAAFFRRYEASEEDSDPQQVSFLAKYQQLSQIALARCKRLLYHPDSPRAQVMSEDGYIPLVELQRHAEGTNKAALIHYNLGIRSAQGMRRVALGAQSSARLAELVSRLEEASARVSGNTLVNDLVHHLSHKAAAGKMSLTLREVNTLLPLLSRMRRESPNGALDARFDRVFNAIDTAIGAAMRHNCTLDELLDLAEGLAACEMVPSALKQVEMVLIRSVMMHECSPMHLRRMLQAMFTLMRTSVPQVLLQSVASRVADYIKEASHMDSSSSNGGGDEKVKNHEECEQLLELLVVLGKCGYGALPGLVTIYWEAQLIDSMQLNPRLRCSYASLLASAAFALKKHDKRAWEGLADESHRLFMEYTRCNKENDIGRFAECVTGLAVLTQIKDNTNSSDVAFLKEYLSATSLELKSCEVIRVQELTDLLGRTLEWSEALGVVAPDVVIQLEKALFVMLENVSHTAPGVGIPDELVTAACCLVDMSSASLELRKAAAGVVGGAIVHAEEALETLRSGAPTQVRPGHSFDVAALASAERENVYKNSILQYCAALQRSGMSTHVEELWS 966 T 0.074 DNA_ligase_A_N unppercent F Eukaryota T 6sgb 25 Y F5 mt-SAF5 MRHTIPFFRRSAFVPAPGSSLLNPRSQRAKVRRMVAAQKAQGENFERQALYAELGGSPSARAPRSKGERSKEATRRVGCEVAERAKHMTDAEWEGVPVDEKHAFAKYMHKVLQEHPTETTEQQRRRYFETTMADVFELDPRKTVRDEYERVKLGLPVHLKNPQYSLGVSQAVYDAADASLFDPENVHRLENAMTHVKQVFADYVHKKREGVSTEAERRMLANLTAELNLETQKHLANMFKYAEMRLRQVKLEERHHQLAEIERLRRMAQQRGGVKGRKGGSRKMSRMERLKRVINRAVGLDIAVAETVLTEMQAQEEFLQFCEVFARLTLGSGFKHTGKDENLSAYIESLRKLYSMDAATLSTLDVVQYYSSKEGAHPVDWAKRWYERALLLPLQSTPEYQKLLQIQQRDESTVKHIKETAGTGHAFACEAEAEVARIKTQKVVNLVEKMFMDPKDKRLESLHEKRLRYLAHMQMERQIRCVRENAKLFDGVENMPEAAQCRELYEKIMEKKTAQCNMTSPPEGEGSAIQSAKTEGDHCEVGPGMFNVYDDAEASSLFEKIREITLRVIRDRRVQSAAATKARMLNRIIRSLKGGERSIAEELRALHQQRKEKMTMRILGIIENDVKTEMEWLQNMEEAERPPLLPIPENMSYVSAADVQAWRELREDDERKAANPFERRRRTFQPELLGQAWSVPNKPLLFWGTGVSAVQQALRHVAEDAERKRQGLLLAPPYPCAENPWGWRLAKDILDDNN 754 T 0.12 FhuF pdb F T 6sgb 28 BA F8 mt-SAF8 MMHRTAARLRMPRKPTPYVRKFLEGCPLPETLVDDIAGANLKSMAPFFTTAPRYIVAAESRLSKLFFHHALYPAGGARRPCRVLIVRGGRSVREPSFTINTGGGRGEVGGGSRGYRDPARRAYFYARPSTVGPFYSGNGGVSSNVAKCHSGVGSEGAGLVKRASVDGLLSPLCGVIEAHFAVGGTCNDAVATEGDGTESLTKGGEKLCVTLAGLLSGGHGGLMMDDGNCASNVRAAKRVARLLHDAAHHLSSFFYVHTQLPDSALFVSRPEASIASDGKGDGLLPSHLAREKDNDGRPSVEGVAVFRLAGGLEPTVHFAVGAPLSVLQRGVDGTASRGEKNSAEEGLNSAASTTVLPFGHIQCLLRVRTRGGKHCAAGKEGSEGTSNTPWCNTAGNDDITSNFAASGPQIAGGIVEPWKLGVSLDPKVPFFMRTLTEKRPSFSCGEGYLGTCSRSGDVDGNTVNDVSSSSGGSRTRAPGEVHMNHLLVRNDCETYLLPQRELLLSFHVPEEAEAMCKEQNEERMRRQAALGYGSPSHVFAEGPRTFARVLHGMKANLAAVEEASSTFRQGAAEGISPQVNGGSTTSGSSRVYEVRALPGDVVFVPRGWKYSVERIVGTAIIDAVAASTASPREALRAVFRTAPDPPLPQDVVRCDEGHAGAGEMPGGDSSNAEIVGVEVDAFVLCYKPYPVLSNAQASTYVAANYVHSGIDDFYAKGGNDVYHKYT 726 T 0.02 Cupin_4 pdbhh F T 6sgb 29 CA F9 mt-SAF9 MTGPTRALFLSSGINLGRLRLAEQFSSMNGWQSKEDPAFDAYVKERRRKENYEAFDQRVERGYAAAAKLHKAEIQNAVKRRLKSSGAKFTAETLREMSSAVTERLAWLRDVWAQIDADYRSGDSARQETAAQEISAALRGEPNDYMRWVYETKRELRFAGPVGRRAIQEELQAAELPEVLDEEVNRYHDLKLNMMEIEREVKAKYGVAGQQHWAELQAAKDEEYIQKLDEAAEVYKQLLDQSARLDESRRSELQRSYVERVHQAQVRFKAAMELEGQREQLIEAHQAMKEERMRTEREKRRQLLREAAELRAQGKKSADVLTALKERQLDANAKRQAEYELKECEDILKRKSEMLDMIAHFKHDVEEREGREMLQRQKSDEERQVNVFGFYEEVGVEDGLSISSEGTTSQGGSSGLGTVSTSTSCAKSADSNSSAQPSQKLRKEELWKVINADTYEDPFRTVHQARLDAVKTYDPAYARTFPLNLVLGRKYSRQGAGEMAAGNETDKQILQKGNNILYSFQWGLNNGTVHDLDADGGTDYFMDGAFHVRDKETGDIDWRYEKKRGGPVFRGPKFYRLGAQREAADPGERAMDPTPYTSTPREHKWRSS 608 T 0.11 MDMPI_C pdbpssm F T 6sgb 30 DA FA Q386U1_TRYB2 mt-SAF10 MPSASSGYTFADFLRRLERSPDSHMAPLYHEHRELFVRRHDMFARVISSVTWSKGVALVAAAGYTQAVNVTIYRALLARMLLHNRHVRQCGAGSVVPWSAALRTYSEAIATHGNAVPTRMTLSALRLCTPARQWVAAISLLMLSQANDKLTLPMLIDAAGCCATPAAWEKAMALLGRFHAQSLQVLPDSIQSLRPVGTSASTVDAAAHALLPRSEGPTPEQKHILTVINKVVSAVPWQVALSNEMCRSYLTHLVASTTLRPTEKTASLTTAVQQLPWEAFVTLMKTVTATVQEGSQGVLGSRSTPQLPPPQDGMERGDVAKSLLSNSIIREGVNLLQSEPETAIPFITTILYKLPSAEAAALFLSEATSAYRNSSSAVVAAAIRHPVVVGALLKRCADSNSWYLAASIFKSTSPTAIPCDVASDLVIQMRRANQAPLVVDVLQKYIVPSRTKLTEEAIEAALLCVLVHNRALAKASAVVAGTSPDNRTGKPNGIGVANGVHWISALSWATDLLEEGVESRILQTGTTPSVGGVNHEDPTVLLRKKTLSPRILSLLIYICVNAGSPRGGLFALGYARTVSKTELELSEEITALLYCMMYDRPREAESIIQHAVKKHGEYKGKYLGRLLVASQEAKGSALRNQT 642 T 1.4 Luteo_PO pdb F Eukaryota T 6sgb 33 HA FJ mt-SAF18 MLCHNRLLLVNKQTQKYRTKLRYRFRQPSVVPLRQTLQQRHNTILEVLRRRRINSGDQSPYRYVEERLYSKPSRLDREGVKVNKTYALQGLGDLEPLRYGANFGISEKDALKYETVAEKAKYMEPPIPYSSLAARKLAAGALWPAAPDPEGMISKEVRLLRHESSMSPSARAFSERVAYHLRRSLKACPGHIAEHIDFTQLIIQEVLGSRRSKEIYIVWFTVDPGARFELEPRLHQLNHWVQQLIIKRVKRRPHIPRVTWIYDGGRLERELPRDVKQELQSFVADAATTLESRVKYLKELDTMNQRMKDIPWFMPYLWSKEEKAARQKSMLADLEEVERRKNEHSSGRSAPPRTSPPPQFVR 362 T 0.041 RBFA pdbhh F T 6sgb 40 TA FY mt-SAF28 MWRLSRSLRSNSLHNPGPFLDGALQLIKLHLAHKNAAADKNTKACSDIEGEFLRELEAFRACFTMSSSLKVAKLYTKKLHGALSYFQLYDDPLMRQLDMIIGKQTMQPSAGRQHGVFKAPVAARLDPFFLDEREETVLPSELPNPPKPDPSTPLRERALKVPAQHRGHWVLRDPDIAITREERRTDPW 188 T 7.4 Hist_rich_Ca-bd pdbhh F T 6sgb 41 UA FZ mt-SAF29 MRRKTTLNIGQVICFSSWNDGSEGYEWKSRALSEKRSLALEFLGNVNKRVSIHDAIRLKADINKKAISNVSCPSFFSGIEGADEDEDQSDMSLCSLLGVLEGEIETDCITHLSPSDASLLKEEFLCDYDPSDTKRMAKWVNLRSETSDYQSYGAIPEGERSLWSAWYLRNIKAGKKPI 178 T 0.02 PH_11 pdbpssm F T 6sgb 42 VA Fa Q57VU7_TRYB2 mt-SAF30 MPPNSATRWLPFVSSDLKDYLNRYWAVMFTVGARPIETGHIRHYVSWYCTRMKVVLLDHHVYVEPLRQQLQEASRTPELPLLFVNKKLVGTLRDVELLEREKKLKDVLHFGFEWRVGGSVAATNGQKSLMGALPAPYGDAEFFRGRYRGPPVARPVVSLPTLHPFALRSEE 171 T 0.15 Glutaredoxin pdbpercent F Eukaryota T 6sgb 43 WA Fb Q581U4_TRYB2 mt-SAF31 MNCSSTLACHAVVSAPSTASLITSCWTPQQHCYRQLMKSLRAAYFHDRSKLFWSRHRVLVEFYKYSEEANEEAVKQLVAIGLEVAAFIDHHMRTDVERIVKHNETMMALPVAQAKKFRSDYLLAEKQHDSWCKQKIKNIMKRRPPPPYPFF 151 T 0.019 Complex1_LYR pdb F Eukaryota T 6sgb 46 ZA UA UNK-A XXXXXXXXXXXXXXXXXXXXX 21 F F F 6sgb 47 AB,ID UB,Uk UNK-B, UNK-k XXXXXXXXXXXXXXXXXXXXXXXXXXX 27 F F F 6sgb 48 BB UC UNK-C XXXXXXXXXX 10 F F F 6sgb 49 CB,DD,LB,PB UD,Uf,UM,UQ UNK-D, UNK-M, UNK-Q, UNK-f XXXXXXXXX 9 F F F 6sgb 50 DB,OB UE,UP UNK-E, UNK-P XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 45 F F F 6sgb 51 EB,KD UF,Um UNK-F, UNK-m XXXXXXXXXXX 11 F F F 6sgb 52 FB UG UNK-G XXXXXXXXXXXXXXXXX 17 F F F 6sgb 53 GB UH UNK-H XXXXX 5 F F F 6sgb 54 HB,MB UI,UN UNK-I, UNK-N XXXXXXXX 8 F F F 6sgb 55 IB UJ UNK-J XXXXXXXXXXXXXXXX 16 F F F 6sgb 56 JB UK UNK-K XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 6sgb 57 KB UL UNK-L XXXXXXXXXXXXXXXXXXXXXX 22 F F F 6sgb 58 NB UO UNK-O XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 30 F F F 6sgb 59 QB UY UNK-Y XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 468 F F F 6sgb 61 SB CC mS3m MFLIHFVHYKTILQKYTFKFKHIFLSIDKYNSLFFNISGILIWLNIIHINIILIKYSFFILINNFEYLIILIST 74 T 0.022 Prophage_tail unppercent F T 6sgb 64 VB CN Q580I0_TRYB2 uS14m MFCFSRGLWMRHIGQDVPKRHTHFVLESRLMYEKSFRDCWLHSVCRAISQLDEPLSKTVVGTHQKMLQRKVTCFQYNQYGLFKTPYYRLANVDRYHAVQGVAGTREWVPYVNVSYWTMNKMVRGGNLLVHRVHYTGWGTDSHLKKGGWEHRWNKVLQRNVLQYSRI 166 T 12 DUF6462 pdbhh F Eukaryota T 6sgb 65 WB CS uS19m MAFRNTFTTPGKFSTVSKNIVLLLIWRVKVFLRAEGFAHSLVMLPVSLYSKILLCDVKKKIVYFHCCTRKKSMLRRCPCVWPDGPTKSSVSIGTAFTLQKRFLKIAKSAFGFYLARRGQRKYPFLRRPHIKNTHSMNPSAPYFWSFMTAKSQMAFLPEENYITGDWTGKFFVSKRQVYTLQHATSGAKVRVKSFPSIFEFNSPSRWNIGKEMNTLTKPRMDLIDEQMLTKKQRLDYVRAGLLPK 244 T 4.9 LIN52 unphh F T 6sgb 67 YB Ci Q57WW0_TRYB2 mS33 MVLRRWFPLLGVHRVGYTHPSTLPVPCAQRWDLRLARARIFQEYIEEKAPGAWQLEDERSMSPEFKTFTGYPMRDMRPGYGQNLPDYIMKKRLPNNTHYELFARRDIPNEDNAMYGKYLYDMTVHGTSLPSTYRMHKDINKAQRNDRKLSGNRFKVLCSSGAKNPPSGWEPIPDATEEEED 181 T 1.1 Nmad3 pdbhh F Eukaryota T 6sgb 69 AC DB mS49 MIRRRVCDGARLAPNIRATFSAVRYQSGLHTFIRDSKPSNFSSVRRSENANGDATASPGATAGENPASSGDWASHMQRELFGEVDPLGGQAHKDYYRDVTRGYSPQYAPRNFANGGAVAYPHIQSPYEYEEAAHRRVWLDHDVDRMREEFTQHRASLRSLASAQEREELLRSRAAEYQVANTVHESESVHPIQQLYNSGGTSRSALKQQAVADRYSIAEQHSPLPLTTGVDRDALDEAQRTKDRILNDSFTAENLLITHGLREKEKHDFTILQRTVRIPFQGYDMDRFLAQQKGTPYGAQQLPPNVVPSSMEEAQRTLRGSSATATPLVDAVAQKVYARNTVVDRPAIGEQLTEQIINIMRASRTTAEQQREEERAQRFGLGRQGALVQDGGPDQRTLKKHTNDERIVDAMLFQQNAYRKTPTDEHWNPYIRRSTENGVGHLLQNKFDIMRREDRLSKGEQDLTERNTIHYGVPIQQIVDEFVFRHRNARGERPLDYFKPFPNFRALRLNRMYRDVEGFSLMKQRPEFLEWELFTRYRQHHQQRRRLALLHGLEPVANETAQERDTRRHRLDEICERTPFDEREMRVNDDEMRVSVETLRSWFGVYMLPSPTVVNAVLGGSASVNLHLYHLADEMGTADTREHVLSSRYLNRLLLLESYQNRVGRGFMNHVVGRAPEPVVPHEQPQEVLRHFSAEERAMYEQHVKEQTSRQLGEWERAMKRRRWLTDHQQYGHVVSHGLETSVVDLSHTETGAVLTVSTKAYEQEIEAVRMKTNATIKVDGMVYNLLPNSERRVVPLTVQLDSGEKIDMTSEDFDRCELEAFPRNLNHALNYGIANYAYNRGNYVETQDSIWEEQTASGQEGWSPATHADGLREGLPVRARRPIFSSSAEQRIAGGPQRAVIIQYHHQPFFNPEPRLVKVAFQCDGTIMEVPISDVMIWQRRYHGPERTVGDESRRYNPAAMRRYVDVTDPFNEKTSNTEHFLDKYEPKRNADTVADKYRTTKQITEIDKWTRYDSARADNYRPLSISHRRDYIRMGYIPRYTPWEWIAIQEADQPLIAEQIRQDNIGTSYFFSLNRYWRYKASPHGYIRHFENEVRDLLQYVDGVTPWKQAQKIRTYWEVRSHHPMPQFNRPEVAMHRNTVGLLPAHMWETDKKTGKVKSVKDSVRDYQTKTPYPKWVQL 1181 T 0.098 ACD unp F T 6sgb 70 BC DC mS50 MFRVRRLAGFIQCSPCGKMGPLKQQTRRYTPIWKSDPAVDNVAPLRDEDERRALWAEVGPISDVGSAVTAWIRFGNDPVLHTAVPTMLGGKFRNQQREKESLLPNSSSPFAYVEDYMGTNLVFGSPVHAKESAAVWATYFERRYASRLRLSRRTVANYVGLINSPEVFDDESDRPETRWSQDTFFRECAYLSEKFLKEKVSNMQQFEAALKRASPEAYLAFFDAFQQQTQTQIPLPSPSVWHYEGERRKQWAEKFISISHKAQAFFKDVLSEDVKKYQEVPGKLLQKVKPVLADVGKILVKRHERWLKGRVWTSLTEEEREAYCMKEVKRQQMQVEDGEFDPMMEDDVDDTELEEWQREHDAIMKLMNSPIDGLHFTTLELWLHTMRCEELETEHIYTSARVRAIQVAARKKLYDTTSYEEVIQAVVESIARGTLDLGAGVLRPHFNEVWCQLNYAKFGSSTITQHTTTSRRQLLFFHAGSLKDIAATATLYYATKPLSNSLDYASPYKYRRSLITLCSNYGVETAYTTQRPLLRSAANLARAEDLIHAVVTAAAQPFGERRRAATRDLHMEFQRLAVPVERVIVANPVSALLESGADPDEKPVEGEKVNMWPLGAKRVVLYKWSAPNVEKLKAMESDAASAVSGSSLTAKRLREIQELKRRGFLEVSLWRRVTAQERKQRNEIVEAKKKQVEEVVRTVPSLAHLHQYATSLYSRIEERVAFPTETSTVTETTNMKEETNKPLEDSEWEFAVLLDDRVLLNKEESVELYLPYRDANGELLAQGEYRALVRAFDLEANPNLHPAYCSVGYSESFQVFDALPQLIAQFFRVKDATAEAAGVTHIPAADFTPFCAFLRDAGLDVPLRCEFEAGQAVTTDGDVYMDYFLQLLRGEAFHQSHAQAGLTEAQRAIEPLCRAHWVVHHPGADESEWATARRSVLDHAMQHEREWWFPNEMLDVKDVVTGSTNGLTPQMYPAAVRYGVELCTVLTAEGKFVDERGSGLSARCVVNGTGAAESVVFDTANCNGTNTTSVEDALRVAHGALRSAQDRHNTLAAFRLGPLSKQSQVLLFCGVNAYEFGGKYARTYAYAFEKAKKELEATAASGFMAPSLSHEDTERLSDQPTTSPSVDRFASTTHPEQRKAQFVPRVGPGSTPLEDPAADQKSEWS 1165 T 0.23 CT_C_D pdbpercent F T 6sgb 71 CC DE Q386Q7_TRYB2 mS52 MRRCSFASTCRYRSADDVPLGLGSAYVWTCRNITSRGQFNPIHNFSYAMERGVRARDVKAFEKLITNPGPLRVAYTPDYLDWLHRCYKAKGTYMDARAVAEKKFNGNIVSSELSAAVNRREGKRGDTRGDTVDNDHHNLPGAPPPGMFLRPAHSFRRLAGELKRRRAQSILDEVARAQGMLDLFERQPHFPAIHIDRCSRFHLVELFKEMVLERSLDSNMIWEKALLYRAILSERKPSYPTSFHYIFTAVEDTVFAPTISTPMEMAGSGGEGHDRSSHPLAAKCPTLEAYYYYVYLVKKYYIDNAVEAHVVLRCHREPNAADLLFSNPPPKDDTEIMKAVELLRNADIQRGVAAAAAVSDPTLPPGGEGSVIGNSDNKKNSEKTSEGSRGRPARPPVLPGAYPPIDMLWRCEENLPLLKVLLFGEFNLIVSENPFVKFPSAHGFLTRPYSTDSSRTLADGMSLANVMAEKRGHLLPSLPRNTATSIDARAQDIRRLQQKHHRDDIVSFQKLLRSTHAEDSPSAFSSYSDWSYFNPRAVRAEERDRLTRKAVEALKLYDSATNDIYRHSFEDVQACHTQRVTERDRTMPPYLPTLPHFVAIIKKDPHISFLLHIGLPDRNSSEEGSAKHKELEKRIYYLARALYHTALEYHNETVRRVNRQKVNVAASLLDNFVEQEWTTILRDKHDVTDVTKTLNDTQNDKKQLARRLGRYMLFANRSLDDTGFPTDARADDYTRWMAPPSVGKVSL 747 T 7.7 Complex1_LYR_2 pdbpssm F Eukaryota T 6sgb 72 DC DF mS53 MSVSGVFSKGRGIGHEATTSILRYIPRARVPWQPSRFGRENLTAADMARLWSRGRYRDGPGNYNSGYCTERTHVLEENTVSIIPRRELEKYMPDITIGPKALVTPVSLMNARNGHRVTHDLLHSYDPHIGRLGKPAVVDHDNITVEDPNRVGLNAATLDCRGRIYRWLRRGPFFQVDNYFRRSVKLNRDGTLPTDFVHEAPLMRKIIRLAHRGHLKAACEEYRRVTTVPPVEVYRALTACCVPGAKLADAVSIFEDGDSKLFYVSRDGEVLHNLMRCAIAARHRARIMWVYNVMRGRFYENVVVRAEVDLIWRYRIAMIALEYLLDHECAEEAAAIYSYLVEEELLRCDVHVRVGLHMREAIAAGKPITLNNDVMNATSLVRDATAVAPEVARELQRRHAQTLQNNAVEAVGAENVREGSAPWSILGPLTAIGPTAEDTMVWLQQHYGDVDVMSIMRWARFRKGKDLMAKDRPQYLARAAAWIELLSKRNREMEEVPLTYMRKSKPLVLDTNSNVRVAWQTPLMRSGGPPRLLAREEGYVFHHSNSSRFVEETYRHPGESLQSRYLALQPLHTEVSAKEDFQRLYYQAQKHHKQQERLLPVSTAAIPSRIVHHSVMSALHGVSGKGEPANRSLFTGNKSNDSHSNTGVASGTSACTDGAGSATPEF 666 T 0.00091 PPR_long pdbhh F T 6sgb 75 GC DJ Q584U8_TRYB2 mS57 MFRSTRPRSVGYTPVNPDTSPMVAYSQYHWHYNLPQGMERPHSVNRTFAAPFQSNHSLVNKYRGVWIEFDMHPAFSVALEPQLRKLPRGRTLPKTPAEEVIADYTALAPLVDDEKTRDLWLAKVFQHCAFQRCGGAMELWERYCHQRFTAEGATAKPPLSLVKSVLFYCNKTDNSGWRALFDRCLKDGWNYTPLFDTAQWSFMLKSIGRMGDEDGVRAVLEEMLDVQADLDRVEARSVVIALNAVTNADVYEFVKKYLFNFGERKVKFLRTTYSDLRGHGAGKLRIPLKENDNMYYHVCWHSSIRSPRQFSPRQLYFDYTPSTLGSSSHNPNAKIDDIVKDKIEKWKAEGLLPEDYVHEDRVYDRSAAFKNVARQEKWKKMPKILKSKRMGYTGDP 396 T 1.1 RPM2 pdbhh F Eukaryota T 6sgb 78 JC DV Q57UZ6_TRYB2 mS69 MRRVRDGLFLSHPSSALQCSVAVVSTGEFDHPPFQFRQRHTFNTTPLHDANRFGGRTAYLREIGPVNIKKQGRRFKKDPRTVQFNVDVWCAQQTLRKRWKQRDWEVIEIPFRLVPREQQRVIPELYTDIPQMTDPARNDFSNIRNKVYDREELQGVLFPAAGAMLYPPLQRVDKQAMTLDKYL 183 T 0.13 CNRIP1 pdb F Eukaryota T 6sgb 79 KC DW mS70 MLRFTHRALTATPERFSVLGTTHPKPKRTGFGRNNKMRSKPSDNVAWYDKGPVEWLPRPVRLTYDHLDQLQQWTMRATLDGRTEEFNRIRDLHREWSQHPLMPVLGDVEPKFPLNLFKQNHRAKKRFLVRWHKANTPANWLWMPRGPTVVTPLHRTNPTQYPENWKQMVQRKSGTGTPS 179 T 16 THDPS_M pdbhh F T 6sgb 80 LC DX Q383G5_TRYB2 mS71 MFHRAFVSSSDLTGCTIALSSVCTQKRYWAKPKKRPKVGQGFHEKAQKWREEYLLDRHRALADSLRAYVEFSTSKRVEPWDARFKPFDRIEKDGVYVLMRYMMEEKLQLCNYHHRPVKRLFCNIGLMGPQITTKARWKPYRFATNPAGTSKAERMYQRDKTVYTHGHND 169 T 0.18 Tox-MPTase5 pdbpercent F Eukaryota T 6sgb 81 MC DY Q57YD4_TRYB2 mS72 MLRSTLTSLNTFLTSSVATPPISVIRTGPKWWAHPERMVRQKLMYFTLGVDQLPLRRTAVIQRDLQRFHMCKPPPRVGDSTGYKRSRAAQLNTWYRRIQYQEYHMQHLFTRHVWGLLRVYPGNTTKIQGKADDGYVGYDSVPFHRYNRAPLPFPARELYERRK 163 T 37 MBDa pdbhh F Eukaryota T 6sgb 84 PC FD Q583U2_TRYB2 mt-SAF12 (KRIPP18) MLIRRLRRPHSTVRGCRISSASNSDSAGAFSANGTQLPDPPYYLPHSPRFDAERCGTFNKKWLLNLPALKPLVRNSTYLPKKEELWRAPTHEALETIIGHLPYHDALRYITEHSLFLLFPTVLRARDAPLPHVIYEDFMKSCTFASLQNPPEEQFALPSVLLRTLLCMAAYHCTLDADYFTTCQMLFGRMEQQQQTTPEVLSAWVYCCTASGRVDEALTYAKYMADCSAPFDVTVFSLMQHPSLNPIEVEDGSVPHSAKGLLLQRRLGNRLHTAYRSDAVAAHGMFVYYALTLSHVRKWEVIRAAAALGVTLAERTVVLAVEVFAREKGMRCGPKTVKALTHFLAQDGTVGHLLYVLLRARKNELLPEFRDLPHTTFSEEEQELVLQCVAQRARHDDSFAVAATLVSSLVREDDPSELLMAFARAARNHHSADVCGGDGDGSVCADVPAPVPESPPSNSSEIIEKDRWAVVQASVRSLLLDVNALDQASRRDAYHKHWKNGNKGVKNKENVLAKTLTLPHDSMHTATIQRKEKELWELMRSDTPVGVRELAQLNIMEELQEAKRLERAEMAWVNPDGTF 579 T 0.3 PPR_long pdbhh F Eukaryota T 6sgb 86 RC FG mt-SAF15 MRWSAVLCKVSKVPTRSVIPADALRYHPRYSVPRKMVLSNTFNVVGENNRYTSLKLILEKLSGHVGRRQYKMLCNLEKHYDKLKDEGIDWHELKWLSREELIVLFDKVLHLTRTERAALLPAIEAKVCGVLRQTDSRHSTVVCMNRGNANHGWRCGRNGHTNDVYFEGKAKANTAALQLCRRSSVRSVERSGVLVEVRSEPDFSVVGRFDHSLSPRVWSTPENPTFQVTTIGYEFRVHQEDPRVIPQIVEAAEEWELHANITKQVIWEMLEMYAVERDRQPLDLKPGEMGDPDIPSRHATAFNVQVVPPLSEGDEAIQVVEQRIVRADGSEVPWFQEPPPQLFSGGIPVILPFAPSIIVKSTFRQVTRSSAQDVTRQLLQPVVDVTCFLHPNVCFWWNAEDEQRCLGHIVDYAKRIPFALPFNLYFRVNLSKDLRGVQNYTEELGKRMSMKAHYFNLRSYGVR 463 T 0.089 DUF1285 pdbpercent F T 6sgb 87 SC FH mt-SAF16 MLVCCRSSLSLLARATMPLCCSRRFLTHQNNIDDISGPVDTNSNSVSDGRLHCSTGEGGKASTCERVSLRTIAESLGAAAAAELRAEVERDTRDGVAAIPPLPPLGWRVRHPSGSNYFVMTRTLKNGVQSAELNNRRYRSVHDIFLQSLQKGGHKYAQKGKGSEGRQKDAKEEEEPPSQEDGKSSPKVTVGYDREGRGHRATLQRMDELHDSPKLSRADVHLTVFAPFRVYDPSLHDPTVDICEWSSFDLVVQKTVPDNMVANKLLQPLSCTPQDGALSMYVCLASVNSEMRIRSIQLLSMKEAQALVEHACFGNGEPLFLELLRRRGRRRPLVERRFDDPRLRYEEVAQPQQVADEAAVACSSSCYGPYYPAFEMLMDSCGSAGEYSRALCYGGPYVSELSRELCDALLDYIKGDLGVSDQLCEYVCQMQFFLEQEEYMTWLGQVQHVANAVSRTA 457 T 3.8E-14 MAM33 pdbpssm F T 6sgb 88 TC FI mt-SAF17 MQRTLRSAARRKWGQKTWSPTATNGGAAPANGVSAQEALQIAYRPMPPSQTVEYEEDFGHNLMIHREYISKRCRDRVSFELSALSYSNLELRRGQEHLAGIMNRERRGVSVGASGAPDDQVQMQTDVDANSREVLSARYLFNERRLQFCDRFQNFFQSKLENSAASDSNGHEKQHLFSLMEACAVIFGCETEAARETYYRMFLGLDSETLLEEDEALRNRIADAKLVQRVLENNKGRQEVTQSPKLQQQQDQGKPLHAVSSGTSLLNDCEEERFISSIPELSLFEDETEARANGFVEGEDDVKANNGDLTAGSFSSPASSVNLPEEFEEYAPLYKAYITHAVGKGPVASYDISTLGSTGLTAERRRWRTLMEKIVREDYHTMTEVEQMDAIVLNEQLHTVKFFDLKIGDAIRDILQLLQRETGVGSSVNRDTPVGISPNNPERRV 445 T 4.8 DUF3221 pdbhh F T 6sgb 89 UC FK Q57XS8_TRYB2 mt-SAF19 MHSSLIILRHAYFSALHPARRVVPGSLLPVRTQFYTRHFTSTAGPTCGDGGETYKSEPTKVGASVEGTNSGNGVTDSPSLFSSSAPTVRRRALPPSDFPENALLKCIEKEIEDEALRLDKEECPPPPPTGWEMYHAPGTSVFYGRRWWLPATASAETRATPERHTIRVQLTKRDPSLDPECDVRGEHFPFSFFVQRAPSKGEAVRRDGTFRMGDSAAAGDVKGRTEGKEEEEEEEELGLYDQSIEVRADFVDGELLVDNVVFHGTFKTGSSCSKRSGNTSPEAAAATAAGQHDNTTGGRGKVEEVRYNNIFNGYPGPNLDEAEEEVLDGLQAWLAERCVDDQFGEFVGQYSVWVEQQEYEMWLKRLRDFVAA 372 T 1.9999999999999998E-26 MAM33 unppssm F Eukaryota T 6sgb 91 WC FV Q38C60_TRYB2 mt-SAF25 MMKRTILQRCIQNKSLEIARISRSDINSRAHLPFNFDVCYELGSREFTLFSSVGSTSVLVFCNVSSRRLRSVKGGQGETEFPPKRLNVKRRRQSGSADRSPVVFSAFVSMPTSGLTIEALCCSSLGLLVVDGVSFHQGPLTESMIPQDAHPGPEGHYQGPLLNQRSLAEMVVNTRGCITSVDPFRQQESFFDGHVNPWNARSLRFGHVPVHTAKPGFSDALCHFLEVFGVNDELAFFVEDFAHLVHREEETAWTNVLKTMMGGR 264 T 1E-07 MAM33 pdbhh F Eukaryota T 6sgb 92 XC Fe Q387L0_TRYB2 mt-SAF34 MSRSTVFGPGSLYSFTKFGSFNRSPTNCTLNKRMKDIFRLENQKHIRNDFDRERRYRMCTKCGITTVTINFNNVPSARVGLWGRCADDKDYTHHRMVDITQREYEVLRESPVEKRLNWWRYER 123 T 0.022 zf_C2H2_13 pdbpercent F Eukaryota T 6sgb 93 YC Ua UNK-a XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 6sgb 94 ZC Ub UNK-b XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 42 F F F 6sgb 95 AD Uc UNK-c XXXXXXXXXXXX 12 F F F 6sgb 96 BD Ud UNK-d XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 59 F F F 6sgb 97 CD Ue UNK-e XXXXXXXXXXXXXXXXXXXXXXXXXXXXX 29 F F F 6sgb 98 ED Ug UNK-g XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 167 F F F 6sgb 99 FD Uh UNK-h XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 255 F F F 6sgb 100 GD Ui UNK-i XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 32 F F F 6sgb 101 HD Uj UNK-j XXXXXXXXXXXXXXXXXXX 19 F F F 6sgb 102 JD Ul UNK-l XXXXXXXXXXXXXX 14 F F F 6sgb 103 LD Ux UNK-x XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 110 F F F 6sgc 83 EC XX poly-lysine nascent chain KKKKKKKKKKKKKKKK 16 T 23 Rib_recp_KP_reg pdbhh F F 6sgf 1 A,B,C,D,E,F A,B,C,D,E,F C7G9B5_9FIRM Beta-xylanase GAMGVKKVFTADQLKVAWGDADYELADGQWKLSFAKQYNQVKWTLPESIEMSQVNAVTFQVADQKVPISLKVYNGGDDATAANTQYGLSGQTEYTINPSGDGAIDAVGIMITEDKPENATVSLVSVTFELKAGAGDAKLGD 141 T 0.00088 BspA_v pdbpssm F Bacteria T 6sgo 1 A A F4S7L2_MELLP MLP124017 MELPESFEFILTEDMVTDLDVKGLGYDFIDLVTKSPDSVNSEHELAHFLGPHDPEIYVNGKIQTTTAFLQFFRQGLFKKLKDAEFAINVSGKVKEGEGYKLVWKSAAQRSHDQKIRWDEAEAYIWRRKDGSCWLHSVKFIMSKAAPYVAIDHHHHHH 157 T 4.3 WLM pdbhh F Eukaryota T 6sgw 3 D,H F,J ECCC3_MYCS2 ESX CONSERVED COMPONENT C3,TYPE VII SECRETION SYSTEM PROTEIN ECCC3,T7SS PROTEIN ECCC3 SRLIFEHQRRLTPPTTRKGTITIEPPPQLPRVVPPSLLRRVLPFLIVILIVGMIVALFATGMRLISPTMLFFPFVLLLAATALYRGGDNKMRTEEVDAERADYLRYLSVVRDNVRAHAAEQRAALEWSHPEPEVLATIPGTRRQWERDPRDRDFLVLRAGRHDVPLDAALKVKDTADEIDLEPVAHSALRGLLDVQRTVRDAPTGLDVAKLARITVIGEADEARAAIRAWIAQAVTWHDPTMLGVALAAPDLESGDWSWLKWLPHVDVPNEADGVGPARYLTTSTAELRERLAPALADRPLFPAESGAALKHLLVVLDDPDADPDDIARKPGLTGVTVIHRTTELPNREQYPDPERPILRVADGRIERWQVGGWQPCVDVADAMSAAEAAHIARRLSRWDS 401 T 0.07 Bac_export_3 pdb F Bacteria T 6sgx 3 D F ECCC3_MYCS2 ESX CONSERVED COMPONENT C3,TYPE VII SECRETION SYSTEM PROTEIN ECCC3,T7SS PROTEIN ECCC3 SRLIFEHQRRLTPPTTRKGTITIEPPPQLPRVVPPSLLRRVLPFLIVILIVGMIVALFATGMRLISPTMLFFPFVLLLAATALYRGGDNKMRTEEVDAERADYLRYLSVVRDNVRAHAAEQRAALEWSHPEPEVLATIPGTRRQWERDPRDRDFLVLRAGRHDVPLDAALKVKDTADEIDLEPVAHSALRGLLDVQRTVRDAPTGLDVAKLARITVIGEADEARAAIRAWIAQAVTWHDPTMLGVALAAPDLESGDWSWLKWLPHVDVPNEADGVGPARYLTTSTAELRERLAPALADRPLFPAESGAALKHLLVVLDDPDADPDDIARKPGLTGVTVIHRTTELPNREQYPDPERPILRVADGRIERWQVGGWQPCVDVADAMSAAEAAHIARRLSRWDS 401 T 0.07 Bac_export_3 pdb F Bacteria T 6sgz 3 D J ESX-3 secretion system protein EccC3 MSRLIFEHQRRLTPPTTRKGTITIEPPPQLPMRTEEVDAERADYLRYLSVVRDNVRAHAAEQRAALEWSHPEPEVLATIPGTRRQWERDPRDRDFLVLRAGRHDVPLDAALKVKDTADEIDLEPVAHSALRGLLDVQRTVRDAPTGLDVAKLARITVIGEADEARAAIRAWIAQAVTWHDPTMLGVALAAPDLESGDWSWLKWLPHVDVPNEADGVGPARYLTTSTAELRERLAPALADRPLFPAESGAALKHLLVVLDDPDADPDDIARKPGLTGVTVIHRTTELPNREQYPDPERPILRVADGRIERWQVGGWQPCVDVADAMSAAEAAHIARRLSRWDSN 343 T 22 FtsK_SpoIIIE pdbhh F T 6sh2 2 B DDD C-type natriuretic peptide fragment (CNP) AAAA 4 T 900 Cyclin_C pdbhh F F 6sin 2 B P P53_HUMAN ANTIGEN NY-CO-13,PHOSPHOPROTEIN P53,TUMOR SUPPRESSOR P53 KLMFKTEGPDSD 12 T 44 CopB pdbhh F Eukaryota T 6sio 2 B P P53_HUMAN ANTIGEN NY-CO-13,PHOSPHOPROTEIN P53,TUMOR SUPPRESSOR P53 KLMFKTEGPDSD 12 T 44 CopB pdbhh F Eukaryota T 6sip 2 B P P53_HUMAN ANTIGEN NY-CO-13,PHOSPHOPROTEIN P53,TUMOR SUPPRESSOR P53 KLMFKTEGPDSD 12 T 44 CopB pdbhh F Eukaryota T 6siq 2 B P P53_HUMAN ANTIGEN NY-CO-13,PHOSPHOPROTEIN P53,TUMOR SUPPRESSOR P53 KLMFKTEGPDSD 12 T 44 CopB pdbhh F Eukaryota T 6sjl 2 D,E,F D,E,F A0A377LA80_ECOLX Putative type VI secretion protein MTKYQGYDVTDATHKTSIHNDWKVVVAKKKPARGVTLTIGIFFDGTGNNRENTASRLMKFNECSAARQGVNQKDAQSCEDFLKEINKNSISNGSYRGYYSNIHWLNILYHPDQVLKKDQTSAQIKTYISGIGTAAGEADSVIGMGLGTSILDIFEGVVTKTDEAMERITQALSEFMGFNLSPDFCIAKIQFDVFGFSRGAAAARHFANRVMEQDPAIARAIAKGLRGDFYDGKPSGEVRFLGLFDTVAAIGGISNFFDINGRSNPGVKLELRPSVAKKVFQITAMNEYRYNFSLNSIKGMWPELALPGAHSDIGGGYNPVGSPLQENESLFLSCPEFEIVSDDTREMDTRVYRKAEQVRKMLMTLPALKHILPHGKLTTKIRSIGVNNSNQRRAGVIQKQVGAAVFFERMAVPNDWANVCLRVMLDAAQEAGVLFEPIRQTNTELQLPSELIFLADKAIAQGKAVRLGQEPQAFTEEELYIIGKYTHCSANWNIESDGNLWVDPTTGEIFIHRFGPKGNKAFVFPNKPNDRWIRSVWYMDDQQRLNDNAVKNTKVMMSGV 560 T 1.3E-11 DUF2235 pdbpssm F Bacteria T 6sjt 1 A,B AAA,BBB A0A0C9MKT2_LEGPN NttC MAHHHHHHVDDDDKMAPAYLTTHNRTGEESNAYIAGSIPSLYPTAAYSTNQVYWNLVRLACYGHTTNGQCPALIKMATNTANPIDIGYVTMDLNTGDITPKTLSAKGYSLRVIGPGEAEITKN 123 T 0.96 DUF6488 unphh F Bacteria T 6sjw 1 A A FRPC_NEIMC Iron-regulated protein FrpC PLALDLDGDGIETVATKGFSGSLFDHNRDGIRTATGWVSADDGLLVRDLNGNGIIDNGAELFGDNTKLADGSFAKHGYAALAELDSNGDNIINAADAAFQSLRVWQDLNQDGISQANELRTLEELGIQSLDLAYKDVNKNLGNGNTLAQQGSYTKTNGTTAKMGDLLLAADNLHSRFLE 179 T 0.08 SdrD_B pdb F Bacteria T 6sjx 1 A A FRPC_NEIMC Iron-regulated protein FrpC GSDALALDLDGDGIETVATKGFSGSLFDHNRDGIRTATGWVSADDGLLVRDLNGNGIIDNGAELFGDNTKLADGSFAKHGYAALAELDSNGDNIINAADAAFQSLRVWQDLNQDGISQANELRTLEELGIQSLDLAYKDVNKNLGNGNTLAQQGSYTKTNGTTAKMGDLLLAADNLHSRFLE 182 T 0.083 SdrD_B pdb F Bacteria T 6sjz 2 C,D E,F AIFM3_HUMAN APOPTOSIS-INDUCING FACTOR-LIKE PROTEIN XGNCFSKPR 9 T 0.062 NifU unphh F Eukaryota T 6sk2 2 C,D D,F AIFM3_HUMAN APOPTOSIS-INDUCING FACTOR-LIKE PROTEIN XGKSFSKPR 9 T 0.062 NifU unphh F Eukaryota T 6sk3 2 C,D C,D AIFM3_HUMAN Apoptosis-inducing factor 3 GNCFSKPR 8 T 0.062 NifU unphh F Eukaryota T 6sk8 2 C,D C,D AIFM3_HUMAN Apoptosis-inducing factor 3 GDCFSKPR 8 T 0.062 NifU unphh F Eukaryota T 6skg 65 QB Bq Unknown ribosomal protein AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 34 T 2500 Porin_4 pdbhh F F 6ski 1 A F A0A377LA80_ECOLX Putative type VI secretion protein MTKYQGYDVTDATHKTSIHNDWKVVVAKKKPARGVTLTIGIFFDGTGNNRENTASRLMKFNECSAARQGVNQKDAQSCEDFLKEINKNSISNGSYRGYYSNIHWLNILYHPDQVLKKDQTSAQIKTYISGIGTAAGEADSVIGMGLGTSILDIFEGVVTKTDEAMERITQALSEFMGFNLSPDFCIAKIQFDVFGFSRGAAAARHFANRVMEQDPAIARAIAKGLRGDFYDGKPSGEVRFLGLFDTVAAIGGISNFFDINGRSNPGVKLELRPSVAKKVFQITAMNEYRYNFSLNSIKGMWPELALPGAHSDIGGGYNPVGSPLQENESLFLSCPEFEIVSDDTREMDTRVYRKAEQVRKMLMTLPALKHILPHGKLTTKIRSIGVNNSNQRRAGVIQKQVGAAVFFERMAVPNDWANVCLRVMLDAAQEAGVLFEPIRQTNTELQLPSELIFLADKAIAQGKAVRLGQEPQAFTEEELYIIGKYTHCSANWNIESDGNLWVDPTTGEIFIHRFGPKGNKAFVFPNKPNDRWIRSVWYMDDQQRLNDNAVKNTKVMMSGV 560 T 1.3E-11 DUF2235 pdbpssm F Bacteria T 6skj 2 C,D C,D AIFM3_HUMAN APOPTOSIS-INDUCING FACTOR-LIKE PROTEIN GNCFSKPR 8 T 0.062 NifU unphh F Eukaryota T 6skw 1 A,B AAA,BBB A0A4Q5N6R9_LEGPN NttE MAHHHHHHVDDDDKMNSDDNADGLIFSPLPQNKNTVVRHYSNEQEMPNLSQMAQRTIDFPTQIVRVSGNLTGLELSCDDVENEIDQVFSKKISPNLFTYNTYVSCGYDVNDPEQHATNFSIQSYFDPLTDNAVDYLKSYLKEYNGYNLFNTTTLQIENAKGIIVSMNLNAGLKSNPDKTPFTLYRQDRNNFYFKSNFDVRKELISDIYQRFYSNDPDMILPFFDKWIFSYAGSVYYSILMASNYLELQPERIFVMENEGDIFVSDLRYYFANLCMKRNPNKHCL 284 T 0.07 FAM117 pdbpercent F Bacteria T 6sl5 13 M O PsaO LRVDPIVPAISFVGWTLPSNIGTSALNGQSLFGAFYESIGQNLAHWPTGFALDDKFWLYMVTWHTGLFIVMLLGQVGFKGRTEDYF 86 T 23 YWFCY pdbhh F T 6slg 2 B B ERK-tide AALAF 5 T 200 DUF433 pdbhh F F 6sli 3 C,I P,H ALA-SER-THR-THR-GLY-GLY-ASN-SER-GLN-ARG-GLY-SER-GLY ASTTGGNSQRGSG 13 T 56 CtsR pdbhh F T 6sli 4 F,L E,K ASTTGGNSQRGGG ASTTGGNSQRGGG 13 T 5.1 CtsR pdbhh F T 6slj 3 E P ALA-SER-THR-THR-GLY-ALA-ASN-SER-GLN-ARG-GLY-SER-GLY ASTTGANSQRGSG 13 T 56 CtsR pdbhh F T 6slj 4 F Q ALA-SER-THR-THR-GLY-ALA-ASN-SER-GLN-ARG ASTTGANSQR 10 T 40 Orbi_NS3 pdbhh F T 6sln 3 E,F P,Q GLN-THR-ALA-GLY-ALA-ASN-SER-GLN-ARG-GLY-SER-ALA-GLY QTAGANSQRGSAG 13 T 21 Ice_nucleation pdbhh F T 6slv 2 B P P53_HUMAN ANTIGEN NY-CO-13,PHOSPHOPROTEIN P53,TUMOR SUPPRESSOR P53 KLMFKTEGPDSD 12 T 44 CopB pdbhh F Eukaryota T 6slw 2 B P WWTR1_HUMAN TRANSCRIPTIONAL COACTIVATOR WITH PDZ-BINDING MOTIF RSHSSPASLQLGT 13 T 0.13 TFIIA unppercent F Eukaryota T 6slx 2 B P WWTR1_HUMAN TAZpS89 RSHSSPASLQLGT 13 T 0.13 TFIIA unppercent F Eukaryota T 6sm3 3 C C GLY-GLY-ALA-THR-THR-ALA-THR-THR-THR-THR-SER-THR-SER GGATTATTTTSTS 13 T 66 Lys_export pdbhh F F 6sml 3 C C GLY-THR-GLY-GLY-SER-THR-GLY-THR-THR-SER-ALA-GLY GTGGSTGTTSAG 12 T 1.3 Trp_dioxygenase pdbhh F F 6smq 3 C C SER-GLY-ALA-THR-THR-ALA-THR-THR-THR-THR-SER-ASN-SER SGATTATTTTSNS 13 T 120 DUF763 pdbhh F F 6snt 80 BC NC YEG7_YEAST Uncharacterized protein YEL057C MANDGIQRNDNRKGFKTVQFSAYSKEIDVIMKKISFLERNITQQLDTLPHFPKTLPPNHKDCVSRKHRARRGWSSQLKNLLGIYSKEEIFTLDNLAATLHDQVLKLQATLFPNAILKQVHLDNANIENKRILKEITYKYLSNENCKEENKFGTFIVKRIFFGDLSLGVSVLINRIAFESATSSIMVVRSSFIESDFFYEDYLIFDCRAKRRKKLKRKILFISTTMNFNYQTKV 233 T 1.8 DpnI_C unppssm F Eukaryota T 6sok 2 E,F,G,H P,Q,R,S Twin-Strep-tag peptide XSAWSHPQFEKGGGSGGGSGGSAWSHPQFEKX 32 T 3.1 PNPase_C pdbhh F T 6sos 2 E,F P,R Twin-Strep-tag peptide XSAWSHPQFEKGGGSGGGSGGSAWSHPQFEKX 32 T 3.1 PNPase_C pdbhh F T 6spw 2 B,C,D B,C,D ARC3140 XDDDDDK 7 T 80 Paralemmin pdbhh F F 6spx 2 B B ARC1502 XDDDDDK 7 T 80 Paralemmin pdbhh F F 6spz 2 B,C P,Q KCNJ2_HUMAN ARG-ARG-GLU-SER-GLU-ILE RRESEI 6 T 160 hNIFK_binding pdbhh F Eukaryota F 6sqk 2 C,D D,E H4-7 SGXGKGG 7 T 8.8 CTP_synth_N pdbhh F F 6sr6 2 B,D B,D G0RYD6_CHATD Putative ribosome associated protein GPAMNATVVSLPLPTLPEGWAAEKDFKAIGKLTQEGSSMRTLEPVGPHFLAHARRVRHKRTFS 63 T 5.6 Suv3_N pdbhh F Eukaryota T 6sri 1 A,LA,M A,3,a Unassigned secondary structure elements (central region, proposed FANCB-FAAP100: chain A,a; base region, proposed FANCC-FANC-E-FANCF: chain 3) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 76 F F F 6sri 2 B,N B,b Unassigned secondary structure elements (central region, proposed FANCB-FAAP100) XXXXXXXXXXXXXXXXXXXXXXXXX 25 F F F 6sri 3 C,O C,c Unassigned secondary structure elements (central region, proposed FANCB-FAAP100) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 30 F F F 6sri 4 D,P D,d Unassigned secondary structure elements (central region, proposed FANCB-FAAP100) XXXXXXXXXXXXXXXXXXXXXXX 23 F F F 6sri 5 E,Q E,e Unassigned secondary structure elements (central region, proposed FANCB-FAAP100) XXXXXXXXX 9 F F F 6sri 6 F,NA,R F,Z,f Unassigned secondary structure elements (central region, proposed FANCB-FAAP100: chain F,f; base region, proposed FANCC-FANC-E-FANCF: chain Z) XXXXXXXXXXXXXXX 15 F F F 6sri 7 G,S G,g Unassigned secondary structure elements (central region, proposed FANCB-FAAP100) XXXXXXXXXXXXXXXXXXXXXXXXXX 26 F F F 6sri 8 H,J,T,V H,J,h,j Unassigned secondary structure elements (central region, proposed FANCB-FAAP100) XXXXXXXXXXXXXXXXX 17 F F F 6sri 9 I,U I,i Unassigned secondary structure elements (central region, proposed FANCB-FAAP100) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 37 F F F 6sri 10 K,W K,k Unassigned secondary structure elements (central region, proposed FANCB-FAAP100) XXXXXXXXXXXXXXXXXXXXX 21 F F F 6sri 11 L,X L,l Unassigned secondary structure elements (central region, proposed FANCB-FAAP100) XXXXXXXXXXXXXXXXXX 18 F F F 6sri 12 AA,Y m,M Unassigned secondary structure elements (proposed FANCB) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 120 F F F 6sri 13 BA,Z n,N Unassigned secondary structure elements (proposed FAAP100) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 43 F F F 6sri 14 CA,QA o,O Unassigned secondary structure elements (proposed FANCB) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 279 F F F 6sri 15 DA,RA p,P Unassigned secondary structure elements (proposed FAAP100) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 276 F F F 6sri 16 EA U Unassigned secondary structure elements (base region, proposed FANCC-FANC-E-FANCF) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 227 F F F 6sri 17 FA V Unassigned secondary structure elements (base region, proposed FANCC-FANC-E-FANCF) XXXXXXXXXXXXXXXXXXXXXXXXXXX 27 F F F 6sri 18 GA W Unassigned secondary structure elements (base region, proposed FANCC-FANC-E-FANCF) XXXXXXXXXXXXXXXXXXXX 20 F F F 6sri 19 HA X Unassigned secondary structure elements (base region, proposed FANCC-FANC-E-FANCF) XXXXXXXXXXXXX 13 F F F 6sri 20 IA Y Unassigned secondary structure elements (base region, proposed FANCC-FANC-E-FANCF) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 84 F F F 6sri 21 JA 1 Unassigned secondary structure elements (base region, proposed FANCC-FANC-E-FANCF) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 109 F F F 6sri 22 KA 2 Unassigned secondary structure elements (base region, proposed FANCC-FANC-E-FANCF) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 62 F F F 6sri 23 MA 4 Unassigned secondary structure elements (base region, proposed FANCC-FANC-E-FANCF) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 71 F F F 6sri 25 PA T base region, proposed FANCF XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 201 F F F 6sri 26 SA R Unassigned secondary structure elements (top region, proposed FANCG) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 285 F F F 6sri 27 TA S Unassigned secondary structure elements (top region, proposed FANCG) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 145 F F F 6srs 1 A,M A,a Unassigned secondary structure elements (central region, proposed FANCB-FAAP100) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 76 F F F 6srs 2 B,N B,b Unassigned secondary structure elements (central region, proposed FANCB-FAAP100) XXXXXXXXXXXXXXXXXXXXXXXXX 25 F F F 6srs 3 C,O C,c Unassigned secondary structure elements (central region, proposed FANCB-FAAP100) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 30 F F F 6srs 4 D,P D,d Unassigned secondary structure elements (central region, proposed FANCB-FAAP100) XXXXXXXXXXXXXXXXXXXXXXX 23 F F F 6srs 5 E,Q E,e Unassigned secondary structure elements (central region, proposed FANCB-FAAP100) XXXXXXXXX 9 F F F 6srs 6 F,R F,f Unassigned secondary structure elements (central region, proposed FANCB-FAAP100) XXXXXXXXXXXXXXX 15 F F F 6srs 7 G,S G,g Unassigned secondary structure elements (central region, proposed FANCB-FAAP100) XXXXXXXXXXXXXXXXXXXXXXXXXX 26 F F F 6srs 8 H,J,T,V H,J,h,j Unassigned secondary structure elements (central region, proposed FANCB-FAAP100) XXXXXXXXXXXXXXXXX 17 F F F 6srs 9 I,U I,i Unassigned secondary structure elements (central region, proposed FANCB-FAAP100) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 37 F F F 6srs 10 K,W K,k Unassigned secondary structure elements (central region, proposed FANCB-FAAP100) XXXXXXXXXXXXXXXXXXXXX 21 F F F 6srs 11 L,X L,l Unassigned secondary structure elements (central region, proposed FANCB-FAAP100) XXXXXXXXXXXXXXXXXX 18 F F F 6srs 12 FA,Y m,M Unassigned secondary structure elements (proposed FANCB) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 120 F F F 6srs 13 GA,Z n,N Unassigned secondary structure elements (proposed FAAP100) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 43 F F F 6srs 14 AA,HA O,o Unassigned secondary structure elements (proposed FANCB) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 279 F F F 6srs 15 BA,IA P,p Unassigned secondary structure elements (proposed FAAP100) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 276 F F F 6srs 16 CA,JA R,r Unassigned secondary structure elements (proposed FANCG) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 285 F F F 6srs 17 DA,KA S,s Unassigned secondary structure elements (proposed FANCG) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 145 F F F 6swg 2 C C TASOR_HUMAN Protein TASOR MSETTERTVLGEYNLFSRKIEEILKQKNVSYVSTVSTPIFSTQEKMKRLSEFIYSKTSKAGVQEFVDGLHEKLNTIIIKASAK 83 T 0.021 ERAP1_C pdb F Eukaryota T 6swy 1 A 5 VID28_YEAST GLUCOSE-INDUCED DEGRADATION PROTEIN 5 MTVAYSLENLKKISNSLVGDQLAKVDYFLAPKCQIFQCLLSIEQSDGVELKNAKLDLLYTLLHLEPQQRDIVGTYYFDIVSAIYKSMSLASSFTKNNSSTNYKYIKLLNLCAGVYPNCGFPDLQYLQNGFIQLVNHKFLRSKCKIDEVVTIIELLKLFLLVDEKNCSDFNKSKFMEEEREVTETSHYQDFKMAESLEHIIVKISSKYLDQISLKYIVRLKVSRPASPSSVKNDPFDNKGVDCTRAIPKKINISNMYDSSLLSLALLLYLRYHYMIPGDRKLRNDATFKMFVLGLLKSNDVNIRCVALKFLLQPYFTEDKKWEDTRTLEKILPYLVKSFNYDPLPWWFDPFDMLDSLIVLYNEITPMNNPVLTTLAHTNVIFCILSRFAQCLSLPQHNEATLKTTTKFIKICASFAASDEKYRLLLLNDTLLLNHLEYGLESHITLIQDFISLKDEIKETTTESHSMCLPPIYDHDFVAAWLLLLKSFSRSVSALRTTLKRNKIAQLLLQILSKTYTLTKECYFAGQDFMKPEIMIMGITLGSICNFVVEFSNLQSFMLRNGIIDIIEKMLTDPLFNSKKAWDDNEDERRIALQGIPVHEVKANSLWVLRHLMYNCQNEEKFQLLAKIPMNLILDFINDPCWAVQAQCFQLLRNLTCNSRKIVNILLEKFKDVEYKIDPQTGNKISIGSTYLFEFLAKKMRLLNPLDTQQKKAMEGILYIIVNLAAVNENKKQLVIEQDEILNIMSEILVETTTDSSSYGNDSNLKLACLWVLNNLLWNSSVSHYTQYAIENGLEPGHSPSDSENPQSTVTIGYNESVAGGYSRGKYYDEPDGDDSSSNANDDEDDDNDEGDDEGDEFVRTPAAKGSTSNVQVTRATVERCRKLVEVGLYDLVRKNITDESLSVREKARTLLYHMDLLLKVK 921 T 0.0017 HEAT_2 unppercent F Eukaryota T 6syf 2 E,G,I,K E,G,I,K ACE-LEU-ARG-LEU-ARG-GLY-CYS XLRLRGC 7 T 2.7 DUF1027 pdbhh F F 6syf 3 F,H,J,L F,H,J,L ACE-ILE-LYS-GLN-GLU XIKQE 5 T 140 Protein_K pdbhh F F 6syi 2 B B PB1-11 DYNPYLLFLK 10 T 0.3 Flu_PB1 pdbhh F T 6syj 1 A,B,C A,B,C ProM2 containing collagen model peptide. XPPGPPGPPGPPGPPGPRGPPGXGPPGPPGPPG 33 T 0.00064 Collagen pdbpercent F F 6sz9 2 B B Q5ZYC7_LEGPH IcmP (DotM) MYIEMAQQQQQSGSDNSMAPVWIVILLFITAYFVWALAHQYIVSFVFTINIWQARLVNLFLNNQLLANQIYLMQTLDPNTVNWDQMVTVMRAVGDYMRYPVICILVVLAFVLYNSNVTLKYRKTYDMKSLRAQEQFNWPAIMPIVKEDLVSQDVNKGPWAMALTPMEFARKYNLLRKDDALLDNPVPGEEMTAGIRRGDAKRVFTMQLGPYWDGFERCSPQAYALSAVFMARMNRDRDAANNILKVLDKTFVDGKPDFSVARPVMKKYQNSELVQEVVAKHAYVLTVIASLLEAAREDGVVPSSEFLWLKPVDRRLWYMLNCVGRQTPYSEVAGPFAHWKAEKEMGRRSLVPMIDEAIRALEIAVKEVRLTPRQMEELEP 380 T 0.031 B277 pdb F Bacteria T 6sz9 4 D D Q5ZV91_LEGPH DotZ MDEIKKDDELSQWLSTYGTITAERILGRYNISLPQDEILEAINIPSSFYRHLLQIPLKNVLNGIVIQQASDYHVYAQKLLIDYLLSGESSKEPDSQGAGTRESLEDERQRLVQLGDEFHKLELEQDNLIASSQASLMKISIDWNTKLETTLSKLNSLYKNTNSKIKKNAIRKALIKAFIHCDLVKDQSQKNKYQLIDKLNQTLAVSVGAELKESILTNLSELFQILEALNTKLDEFTDRTNHLSQQAKSFRTQFYEVILRIIELIKLLPEYKIDPAQDAINREPLYFDRTIGER 294 T 0.0097 EAP30 pdbpssm F Bacteria T 6sz9 5 E E Q5ZYR7_LEGPH DotY MPKYTLPTRDALLKAMQVGETSIEAAEYMATRFEQILTKAKLLPECNDMLEKIKEYAQFVKFKLLSSAQVWSGQERPTSDYQNTQENKAEFLASHLEGLPSGLKLEVAIGDDAKILRGFSSNGKMVEGDQLKTMDGLLEGWLAKNSLAISGGAVVKIDNTGNQTKVDPQEIRQLINDSEKGVAKYFADKGVGMEVAQRTYQEPKALETKREEIRQEIESGAEAPTTQSIR 230 T 0.019 GPW_gp25 pdbpercent F Bacteria T 6szs 55 CB y YQKK_BACSU Uncharacterized protein YqkK MAKSQAKKKRGHRLRNGGRDVLLSRGSTPSFSTHGRMTKSKKEILNKRKHKNPYDHTAVDDKDFFVPQKAA 71 T 0.068 DUF5988 pdbpercent F Bacteria T 6t0b 22 FA,SA l,y COX26_YEAST Cox26 MFFSQVLRSSARAAPIKRYTGGRIGESWVITEGRRLIPEIFQWSAVLSVCLGWPGAVYFFSKARKA 66 T 0.077 Phage_holin_Dp1 unppercent F Eukaryota T 6t15 22 FA l COX26_YEAST COX26; SYNONYM: Uncharacterized protein YDR119W-A MFFSQVLRSSARAAPIKRYTGGRIGESWVITEGRRLIPEIFQWSAVLSVCLGWPGAVYFFSKARKA 66 T 0.077 Phage_holin_Dp1 unppercent F Eukaryota T 6t1v 2 B C PRGC1_HUMAN PPARGC-1-ALPHA,LIGAND EFFECT MODULATOR 6 EEPSLLKKLLLAPA 14 T 7 Cnl2_NKP2 pdbhh F Eukaryota T 6t1y 2 F,G,H,I,J F,G,H,I,J phalloidin WXAXCPA 7 T 3.6 DUF6083 pdbhh F F 6t20 2 F,G,H,I,J F,G,H,I,J phalloidin WXAXCPA 7 T 3.6 DUF6083 pdbhh F F 6t21 1 A,B A,B MCRA_ECOLI ECOKMCRA GHHHHHHEFMHVFDNNGIELKAECSIGEEDGVYGLILESWGPGDRNKDYNIALDYIIERLVDSGVSQVVVYLASSSVRKHMHSLDERKIHPGEYFTLIGNSPRDIRLKMCGYQAYFSRTGRKEIPSGNRTKRILINVPGIYSDSFWASIIRG 152 T 0.3 DUF5616 pdbpssm F Bacteria T 6t22 1 A,B A,B MCRA_ECOLI EcoKMcrA modification dependent restriction endonuclease GHHHHHHEFMHVFDNNGIELKAECSIGEEDGVYGLILESWGPGDRNKDYNIALDYIIERLVDSGVSQVVVYLASSSVRKHMHSLDERKIHPGEYFTLIGNSPRDIRLKMCGYQAYFSRTGRKEIPSGNRTKRILINVPGIYSDSFWASIIRG 152 T 0.3 DUF5616 pdbpssm F Bacteria T 6t25 2 F,G,H,I,J F,G,H,I,J phalloidin WXAXCPA 7 T 3.6 DUF6083 pdbhh F F 6t2d 2 B B Stapled peptide GAR300-Gp LTFEQYWAQLESAA 14 T 0.7 PBP-Tp47_a pdbhh F T 6t2e 2 B B Stapled peptide GAR300-Gm LTFEQYWAQLESAA 14 T 0.7 PBP-Tp47_a pdbhh F T 6t2f 2 B B MDM2 in complex with GAR300-Am LTFDQYWAQLDSAA 14 T 0.15 PBP-Tp47_a pdbhh F T 6t33 1 A A F8QV07_RUMGN Ruminococcin C WGCVCSGSTAVANSHNAGPAYCVGYCGNNGVVTRNANANVAKTA 44 T 2.6 EPV_E5 pdbhh F Bacteria T 6t3o 1 A A MYOM1_HUMAN 190 KDA CONNECTIN-ASSOCIATED PROTEIN,190 KDA TITIN-ASSOCIATED PROTEIN,MYOMESIN FAMILY MEMBER 1 MGSSHHHHHHSSGLVPRGSHMKSELAVEILEKGQVRFWMQAEKLSGNAKVNYIFNEKEIFEGPKYKMHIDRNTGIIEMFMEKLQDEDEGTYTFQLQDGKATNHSTVVLVGDVFKKLQKEAEFQRQEWIRKQG 132 T 0.00041 V-set pdb F Eukaryota T 6t46 2 B,D,F,H B,D,F,H E9RIY7_BACNA Quorum-sensing secretion protein (processed) MKKINGWIVVALLAVTTVGAAAAIQYTNNADSPGQFQVAQKGMY 44 T 0.0046 PhrC_PhrF pdb F Bacteria T 6t4q 81 CC 8 nascent chain ESWMER 6 T 5.7 U3_snoRNA_assoc pdbhh F F 6t59 49 WA NI Nascent polypeptide-associated complex subunit alpha N-terminal region AAAAAAAAAALAAAAAPAAAAAAAAAAAA 29 T 590 Campylo_MOMP pdbhh F F 6t7t 81 DC A nascent chain AKKK 4 T 390 DUF5415 pdbhh F F 6t7v 2 B I NF2L2_HUMAN LEU-ASP-PRO-GLU-THR-GLY-GLU-PHE-LEU LDPETGEFL 9 T 0.0068 DUF4585 pdbhh F Eukaryota T 6t7y 2 B B DP2L_PYRAB cPIP motif from the DP2 large subunit of PolD KKRVISLEEFFS 12 T 1.9 Med29 pdbhh F Archaea T 6t7z 2 B B ACE-CYS-ASA-4FB-GLU-THR-GLY-GLU-CYS-NH2 XCXXETGECX 10 T 0.043 ASTN_1_2_N pdbhh F F 6t80 2 E,F,G,H E,F,G,H SNAT_HUMAN AANAT peptide GSGSLRRNSGCG 12 T 26 YopE pdbhh F Eukaryota T 6t84 1 A A A0QQF4_MYCS2 Uncharacterized protein GHMIDRRRGLGRRRKSWAKSHGFDYEYESEDLLKRWKRGVMSTVGDVTAKNVVLGQIRGEAVFIFDIEEVATVIALHRKVGTNVVVDLRLKGLKEPRENDIWLLGAIGPRMVYSTNLDAARRACDRRMVTFAHTAPDCAEIMWNEQNWTLVAMPVTSNRAQWDEGLRTVRQFNDLLRVLPPVPQNAS 187 T 0.51 PepSY_TM unppssm F Bacteria T 6t9i 12 L U unassigned sequence AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 155 T 16000 zf_CCCH_4 pdbhh F F 6t9k 11 K U unassigned sequence AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 155 T 16000 zf_CCCH_4 pdbhh F F 6t9m 2 B BBB Peptide in active site GPAMK 5 T 120 Endotoxin_C pdbhh F F 6t9q 1 A A TIM_HUMAN HTIM DPGTHIVLWTGDQELELQRLFEEFRDSDDVLGHIMKNITAKRSRARIVDKLLALGLVAERRELYKKR 67 T 0.0043 Myb_DNA-bind_6 pdbpercent F Eukaryota T 6taz 1 A B TIM_HUMAN HTIM DPSRRAPTWSPEEEAHLRELYLANKDVEGQDVVEAILAHLNTVPRTRKQIIHHLVQMGLADSVKDFQRKGTHIVLWTGDQELELQRLFEEFRDSDDVLGHIMKNITAKRSRARIVDKLLALGLVAERRELYKKRQKKLASS 141 T 0.003 DEP pdbpssm F Eukaryota T 6tb4 13 M B Transcriptional adapter 3 (Ada3) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 76 F F F 6tb9 2 DA,EA,FA,GA,HA,JA,KA,LA,MA,NA,OA E3,D3,A3,C3,B3,A1,E2,D2,A2,C2,B2 Head spike base Rcc01079 MDVFAKHAVSLESPAVRHYEITPSDSTDLARRPRALRVQTGGTLVLRDETGITVTYTVFAGEILPVRPVRVLATGTTATAVGWE 84 T 0.26 DUF2835 pdbpssm F T 6tb9 3 IA,PA F3,F2 Head spike fiber Rcc01080 MIALGLGLGLAANGGPALRRYAVNGVAPVAVLDFERHFLSHPLALTRATSATYADALRAVQTAPADTPRYDYSTGKRALLLEASATNLLPNSAQFEAASWGKTRASVLANAALAPNGTMTADKLVEDTSNNSHFVARTGTQIAAGTSVTASIFVKAAERRWFALVTADSANAFRTTYFDLQTGTLGVVSQGAAGHVAQIVAAGNGWYRCSVTQTQAASGNFNFYPSVASANGATSYPGDGASGLYLWGAQLEAGAAVSSVIPTEAAAVTRAADLASVAVAAGSYDLRRVDAAGTAVTKGVAHPGGALTIGAGSLYLLSLFPAGAL 325 T 0.012 CBM_4_9 pdbpssm F T 6tba 2 AC,AF,AH,BC,BF,CC,CF,DA,DC,DF,EA,EC,FA,FF,GA,GF,HA,HF,IF,JA,JD,JF,KA,KD,KF,LA,LD,MA,MD,NA,ND,OA,PD,PG,QD,QG,RD,RG,SD,SG,TB,TD,TG,UB,UD,VB,VG,WB,WG,XB,XG,YG,ZB,ZE,ZG EM,DD,B7,DM,AD,AM,CD,E3,CM,BD,D3,BM,A3,AB,C3,EC,B3,DC,AC,A1,EI,CC,E2,DI,BC,D2,AI,A2,CI,C2,BI,B2,AG,E8,EH,D8,DH,A8,AH,C8,EN,CH,B8,DN,BH,AN,A6,CN,E7,BN,D7,A7,AL,ED,C7 D5AR33_RHOCB Uncharacterized protein MDVFAKHAVSLESPAVRHYEITPSDSTDLARRPRALRVQTGGTLVLRDETGITVTYTVFAGEILPVRPVRVLATGTTATAVGWE 84 T 0.26 DUF2835 pdbpssm F Bacteria T 6tba 3 BH,EF,FC,IA,LF,OD,PA,UG,VD,YB F7,FD,FM,F3,FC,FI,F2,F8,FH,FN D5AR34_RHOCB Uncharacterized protein MIALGLGLGLAANGGPALRRYAVNGVAPVAVLDFERHFLSHPLALTRATSATYADALRAVQTAPADTPRYDYSTGKRALLLEASATNLLPNSAQFEAASWGKTRASVLANAALAPNGTMTADKLVEDTSNNSHFVARTGTQIAAGTSVTASIFVKAAERRWFALVTADSANAFRTTYFDLQTGTLGVVSQGAAGHVAQIVAAGNGWYRCSVTQTQAASGNFNFYPSVASANGATSYPGDGASGLYLWGAQLEAGAAVSSVIPTEAAAVTRAADLASVAVAAGSYDLRRVDAAGTAVTKGVAHPGGALTIGAGSLYLLSLFPAGAL 325 T 0.012 CBM_4_9 pdbpssm F Bacteria T 6tbm 14 N N Spt8 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 400 F F F 6tbt 2 C,D C,D Apt48 peptide GPHGPRDWCLFGGP 14 T 1.5 Prog_receptor pdbhh F T 6tcb 1 A,B A,B Q9I0B9_PSEAE Uncharacterized protein PA2723 GHMDELFEEHLEIAKALFAQRLPYWCDVFLRPADQAFNAYLNARGQASTYLVLEGFDPVYVPRGCDLDAVRATARARARLREAGLGEDALPVLL 94 T 0.63 DUF2992 unphh F Bacteria T 6tch 2 B A DLY-NVA-PPN-KCJ-SEP-PPN-B3S-BAL-PPN-LYS XXXXXSXXXXKX 12 T 10 Glyoxalase_8 pdbhh F F 6tcj 2 C,D C,D Hybrid BTB-binding (HBP) peptide PGGFLCWDGRSIHEIPR 17 T 1.8 DUF1996 pdbhh F T 6tda 13 Q Q HTL1_YEAST CHROMATIN STRUCTURE-REMODELING COMPLEX PROTEIN HTL1 MSQNNTISSMNPERAYNNVTLKNLTAFQLLSQRENICELLNLVESTERHNSIINPERQRMSLEEMKKMLDALKNERKK 78 T 0.064 DUF4976 pdbpercent F Eukaryota T 6tda 19 W X Unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 383 F F F 6tdb 2 E,F,G F,J,K C-terminal VEGFB167 peptide RKLRR 5 T 59 LEDGF pdbhh F F 6tdd 1 A,B A,B Q0B304_BURCM Beta-ketoacyl synthase GPGSMNKPTSSDGWKDDYLSRLSRLSKNQLMALALKLKQQQLEQG 45 T 4.2 LIN52 pdbhh F Bacteria T 6tdm 1 A,B A,B Q0B308_BURCM;Q0B309_BURCM Beta-ketoacyl synthase,Beta-ketoacyl synthase GPGSYDAALPIDELSALLRQEMGDDGGGSGGGSMQDIQQLLAKSLTEIKRLKAANQALEQARRE 64 T 0.00011 Docking unppssm F Bacteria T 6tdn 1 A,B A,B Q0B303_BURCM;Q0B304_BURCM Beta-ketoacyl synthase,Beta-ketoacyl synthase GPGSYAPLDTELSEIEGLQDDDLAALLGKEFIREGGGSGGGSGGGSMNKPTSSDGWKDDYLSRLSRLSKNQLMALALKLKQQQLEQG 87 T 8.5 DUF4266 pdbhh F Bacteria T 6tdu 2 B,T D,d ATPTB6 MTHAELHLFDLDEFMQTYKRLQTRQDWLIENKCKKSRLFSYVAAVIAFTVGKSATMSDEAILAKIDPYVTSEVRVQRGAWWRSGYFTKEEVEMMTPKGPIARYYKFLLGVRRFPLKHGALSWACGFVPAWLTFTSLNHWAQNRRLNRYLTQESVFGEMARELVRGKTADEATTSVMARVEKEILGVH 187 T 0.25 Phg_2220_C pdbpssm F T 6tdu 3 C,U E,e ATPTB12 MSSYTGAALAPKSERLRLAFEEKQKDHQKCIEEAKGKGLKKDELIDACAWTHRKTILALKDWFAYRPPFQDRRSKWAEYCSIRHDSGSWLGWSQKFF 97 T 0.074 Cofac_haem_bdg pdb F T 6tdu 4 D,V F,f ATP synthase subunit a MLNSNIYIIIYGGIIMYSIMIIIQMFLYNFSNKIYIEVEINKYILSKNNIDIYWIICNCTIIIIITTLNHIINKIGIYNMIEYNICYWLIGTGLGLYISPFIVFGYKFFVYIMDLNNYSLNIYHNNNKMNDIQQIYNGTNYNDTMIFFIKDINNIFTIYRSINFFMNWLYQMIYYGVRMWLVFVLHSFSLGSFGELITVITDNNLIFNVFYIGLLGLGFILYLIVIFYLGIQIYVYISFSLSFLHSTILLFLVNYIPHYNNKSIFNTFTNKSIY 274 T 4.4 DUF4514 pdbhh F T 6tdu 5 E,W G,g ATP synthase subunit b MPSTSPADKDVPMSILHTHGLSYVNWCMSLAPGLLVFEGFFRARYYRSRVPPSRTVLMNGLKMRMFSLARQQAPKIVHKPVLSPIPEHLRLVKNVAQVQIDMLKLLNAQAAK 112 T 0.097 TMEM33_Pom33 pdb F T 6tdu 6 F,X H,h ATP synthase subunit d MMRRACRIIRPSHVRGVSGVAPTIYLRSKAALPATSTTDVRPQLYALQRFAKAQLKTATEAERAAIEADIARYQEYLDSDLEKLKQDVAEDTAKKQKLIPLLDRYPDVPIEKIPEHANVLLKKIDACLEILSKDIGEVTDAEAHEMYFETSKFQILHIYTGCVASFPEGDVPPGAVECLPGQVIRTKVNGEDVMLEIDEVDPGYQVCWFKPDVPLPENAEILWSYPYEPTAALPTGTTWEEGQANVLIPAEPTPEAAVWPPTPVTNVYAPMAEKLALKSNPELKVLFKEALLQPAKLLPLDVDYQCSHDREVVEAKRDRYLTALVEAEQAPPLPFTPDVLQLQLEHNVLKGELIDRLRALEYTIVTEQLQARLHERRLRGDVIDEWEELDYHPLVRDDTYLAIDFGDPTFGRYIWKLFPHTDGDEECMFKDTRLDVLPPQVNPLNAILAQHTAQTPVHRSLEKRLWTEVRATAVSE 476 T 0.016 Apolipoprotein pdb F T 6tdu 7 G,Y I,i ATP synthase subunit f MAPLYPVLSQASLYKRHFFKNIKLFHVVFYVGAPCVTFGTAAWSGSNRNSREAIFMVIEERHGWDNFKKLSSHQQGVIMQEAAQESLLARNKGELHLP 98 T 10 Arm_3 pdbhh F T 6tdu 8 H,Z J,j ATP synthase subunit i/j MVYTNWQSSYTRLFVSKPWMWHPLAWMTLSVGIWWKFGKESLCNERSFYIHTHPKWAPHKFHTVYNWSRDPIKWTLAEQYASIIRNTNTDIEAVLKIKLPANAN 104 T 31 Spectrin_like pdbhh F T 6tdu 9 AA,I k,K ATP synthase subunit k MAFGRTRPTLSSPLVPVWNDLRALQVFTSQEYMQKRGPGFTNTLEYKLSCLNPVKWYDMMKVMPGGKAFVGTALGLALFGGWGVEFVKNISVMTKEKPPIDWNNEKLGHLTRS 113 T 9.8 Mt_ATP-synt_D pdbhh F T 6tdu 10 BA,J l,L ATP synthase subunit 8 LIPVSLVDLININIIFYILLLYTLLLFFIPLFLASINYTYHYIYKYYNYNYNFINNN 57 T 0.86 UPF0542 pdbhh F T 6tdu 11 CA,K m,M ATPEG1 MSLAKVWMYASWIPRGIPKAMANELSSAAAALAHPEAIARVAQLESQGKNPYRVARAEFWQMYLACWPYRFRNTVVEWETCKAKVLKGSVDLQDIVDLLYLLAWAYLFWILGEIYGRGSLYGYRFDGEIHRQEAQNVILYKEKEAQEMAVVMEKLEKEIQEWLKTMEQE 169 T 0.015 ATP-synt_G pdbhh F T 6tdu 12 DA,L n,N ATPEG2 MPLPNAVVQGYTSVRGPKRPLDHFYGRTPLNIDTLWHWVKFPHRYDNLRFAVCFWAFLVSAHFANKKQRNLRVEWEKNMEIQKKLHPSGLWSEEQAFAAAEKLGRPKAGHPMRVFEDGYQQFDLKPKLFDPDEEAHH 137 T 8.9 MF_alpha pdbhh F T 6tdu 13 EA,M o,O ATPEG3 MADHNKKDVGSWASPNEHLMFFDFSSWLLVDFGKRWERWVSFKKSFLTTTRSPYWSPQFFLLTFFQLRNSNVKLCENWNWAPKGDDFNLLHNSAAEPFGRDLKAHLEREAGAKHHH 116 T 2.4 BNR_6 pdbhh F T 6tdu 14 FA,N p,P ATPEG4 MGGDAHAAPAEKPDPALDATKALPKALEEVEFFQSYAVRRKTGFHLFNRATGSPTIVGPMFYNLYNFVRIGRVSKYVCWLSLPLVFQRMWMKNRATGMEYDIDLENYAPFEAKKNPMHGH 120 T 19 DUF3274 pdbhh F T 6tdu 16 HA,P r,R ATPEG6 MFGVTRKLLGELSEYVEVNEKGMPKPQALSLWNMPYAKRRALTKFARGVRWQFIVLFIALYNFKNRDDSHLLRRGAYN 78 T 7.5 DUF2845 pdbhh F T 6tdu 17 IA,Q s,S ATPEG7 MMRISRKLLVPVANFRPKKPWDGPWGIQISQKKDRPFIAMWILFPLLLVDHLTREYYAYWHSSKVPVTDVFGDF 74 T 17 Mem_trans pdbhh F T 6tdu 18 JA,R t,T ATPEG8 MGGKASEAVTIAFRFPHRTTFLVKQNVGQKLNKGHQTFWQLVAGGWLFFLLINRTSFKPKLAAPKV 66 T 2.8 SCIMP pdbhh F T 6tdu 26 XA,XB AN,BN inhibitor of F1 (IF1) MAAACAVRGFTTARPMLTPNKVKVPGRKPQDEEDLTWAEADRKLTPEERYARDKQMALLDKMTSQVEELEKSHTEQKKSNKGVKAQIEAISRQLEALKAQLKE 103 T 0.027 FlxA pdb F T 6tdu 29 JB,JC C,c ATPTB4 MFRGFRPVLAADAVKFQTLYNVLTGKQHLKDQVPVKDCNLTAIFGASWKADLNKWFDSEYAPKLPAAERDSAKKSLDLYLKRVDLTRYTREELTTYGILACGPGKVDALTEKHLLETGKARLEELTAGLGNKDEGVNAFRKEVEQEGKYANWPAEKSKALADKVIAASP 169 T 0.14 Hydantoinase_A pdbpssm F T 6tdv 3 C,V D,d ATPTB6 MTHAELHLFDLDEFMQTYKRLQTRQDWLIENKCKKSRLFSYVAAVIAFTVGKSATMSDEAILAKIDPYVTSEVRVQRGAWWRSGYFTKEEVEMMTPKGPIARYYKFLLGVRRFPLKHGALSWACGFVPAWLTFTSLNHWAQNRRLNRYLTQESVFGEMARELVRGKTADEATTSVMARVEKEILGVH 187 T 0.25 Phg_2220_C pdbpssm F T 6tdv 4 D,W E,e ATPTB12 MSSYTGAALAPKSERLRLAFEEKQKDHQKCIEEAKGKGLKKDELIDACAWTHRKTILALKDWFAYRPPFQDRRSKWAEYCSIRHDSGSWLGWSQKFF 97 T 0.074 Cofac_haem_bdg pdb F T 6tdv 5 E,X F,f subunit a MLNSNIYIIIYGGIIMYSIMIIIQMFLYNFSNKIYIEVEINKYILSKNNIDIYWIICNCTIIIIITTLNHIINKIGIYNMIEYNICYWLIGTGLGLYISPFIVFGYKFFVYIMDLNNYSLNIYHNNNKMNDIQQIYNGTNYNDTMIFFIKDINNIFTIYRSINFFMNWLYQMIYYGVRMWLVFVLHSFSLGSFGELITVITDNNLIFNVFYIGLLGLGFILYLIVIFYLGIQIYVYISFSLSFLHSTILLFLVNYIPHYNNKSIFNTFTNKSIY 274 T 4.4 DUF4514 pdbhh F T 6tdv 6 F,Y G,g subunit b MPSTSPADKDVPMSILHTHGLSYVNWCMSLAPGLLVFEGFFRARYYRSRVPPSRTVLMNGLKMRMFSLARQQAPKIVHKPVLSPIPEHLRLVKNVAQVQIDMLKLLNAQAAK 112 T 0.097 TMEM33_Pom33 pdb F T 6tdv 7 G,Z H,h subunit d MMRRACRIIRPSHVRGVSGVAPTIYLRSKAALPATSTTDVRPQLYALQRFAKAQLKTATEAERAAIEADIARYQEYLDSDLEKLKQDVAEDTAKKQKLIPLLDRYPDVPIEKIPEHANVLLKKIDACLEILSKDIGEVTDAEAHEMYFETSKFQILHIYTGCVASFPEGDVPPGAVECLPGQVIRTKVNGEDVMLEIDEVDPGYQVCWFKPDVPLPENAEILWSYPYEPTAALPTGTTWEEGQANVLIPAEPTPEAAVWPPTPVTNVYAPMAEKLALKSNPELKVLFKEALLQPAKLLPLDVDYQCSHDREVVEAKRDRYLTALVEAEQAPPLPFTPDVLQLQLEHNVLKGELIDRLRALEYTIVTEQLQARLHERRLRGDVIDEWEELDYHPLVRDDTYLAIDFGDPTFGRYIWKLFPHTDGDEECMFKDTRLDVLPPQVNPLNAILAQHTAQTPVHRSLEKRLWTEVRATAVSE 476 T 0.016 Apolipoprotein pdb F T 6tdv 8 AA,H i,I subunit f MAPLYPVLSQASLYKRHFFKNIKLFHVVFYVGAPCVTFGTAAWSGSNRNSREAIFMVIEERHGWDNFKKLSSHQQGVIMQEAAQESLLARNKGELHLP 98 T 10 Arm_3 pdbhh F T 6tdv 9 BA,I j,J subunit i/j MVYTNWQSSYTRLFVSKPWMWHPLAWMTLSVGIWWKFGKESLCNERSFYIHTHPKWAPHKFHTVYNWSRDPIKWTLAEQYASIIRNTNTDIEAVLKIKLPANAN 104 T 31 Spectrin_like pdbhh F T 6tdv 10 CA,J k,K subunit k MAFGRTRPTLSSPLVPVWNDLRALQVFTSQEYMQKRGPGFTNTLEYKLSCLNPVKWYDMMKVMPGGKAFVGTALGLALFGGWGVEFVKNISVMTKEKPPIDWNNEKLGHLTRS 113 T 9.8 Mt_ATP-synt_D pdbhh F T 6tdv 11 DA,K l,L subunit 8 LIPVSLVDLININIIFYILLLYTLLLFFIPLFLASINYTYHYIYKYYNYNYNFINNN 57 T 0.86 UPF0542 pdbhh F T 6tdv 12 EA,L m,M ATPEG1 MSLAKVWMYASWIPRGIPKAMANELSSAAAALAHPEAIARVAQLESQGKNPYRVARAEFWQMYLACWPYRFRNTVVEWETCKAKVLKGSVDLQDIVDLLYLLAWAYLFWILGEIYGRGSLYGYRFDGEIHRQEAQNVILYKEKEAQEMAVVMEKLEKEIQEWLKTMEQE 169 T 0.015 ATP-synt_G pdbhh F T 6tdv 13 FA,M n,N ATPEG2 MPLPNAVVQGYTSVRGPKRPLDHFYGRTPLNIDTLWHWVKFPHRYDNLRFAVCFWAFLVSAHFANKKQRNLRVEWEKNMEIQKKLHPSGLWSEEQAFAAAEKLGRPKAGHPMRVFEDGYQQFDLKPKLFDPDEEAHH 137 T 8.9 MF_alpha pdbhh F T 6tdv 14 GA,N o,O ATPEG3 MADHNKKDVGSWASPNEHLMFFDFSSWLLVDFGKRWERWVSFKKSFLTTTRSPYWSPQFFLLTFFQLRNSNVKLCENWNWAPKGDDFNLLHNSAAEPFGRDLKAHLEREAGAKHHH 116 T 2.4 BNR_6 pdbhh F T 6tdv 15 HA,O p,P ATPEG4 MGGDAHAAPAEKPDPALDATKALPKALEEVEFFQSYAVRRKTGFHLFNRATGSPTIVGPMFYNLYNFVRIGRVSKYVCWLSLPLVFQRMWMKNRATGMEYDIDLENYAPFEAKKNPMHGH 120 T 19 DUF3274 pdbhh F T 6tdv 17 JA,Q r,R ATPEG6 MFGVTRKLLGELSEYVEVNEKGMPKPQALSLWNMPYAKRRALTKFARGVRWQFIVLFIALYNFKNRDDSHLLRRGAYN 78 T 7.5 DUF2845 pdbhh F T 6tdv 18 KA,R s,S ATPEG7 MMRISRKLLVPVANFRPKKPWDGPWGIQISQKKDRPFIAMWILFPLLLVDHLTREYYAYWHSSKVPVTDVFGDF 74 T 17 Mem_trans pdbhh F T 6tdv 19 LA,S t,T ATPEG8 MGGKASEAVTIAFRFPHRTTFLVKQNVGQKLNKGHQTFWQLVAGGWLFFLLINRTSFKPKLAAPKV 66 T 2.8 SCIMP pdbhh F T 6tdw 2 B H subunit d MMRRACRIIRPSHVRGVSGVAPTIYLRSKAALPATSTTDVRPQLYALQRFAKAQLKTATEAERAAIEADIARYQEYLDSDLEKLKQDVAEDTAKKQKLIPLLDRYPDVPIEKIPEHANVLLKKIDACLEILSKDIGEVTDAEAHEMYFETSKFQILHIYTGCVASFPEGDVPPGAVECLPGQVIRTKVNGEDVMLEIDEVDPGYQVCWFKPDVPLPENAEILWSYPYEPTAALPTGTTWEEGQANVLIPAEPTPEAAVWPPTPVTNVYAPMAEKLALKSNPELKVLFKEALLQPAKLLPLDVDYQCSHDREVVEAKRDRYLTALVEAEQAPPLPFTPDVLQLQLEHNVLKGELIDRLRALEYTIVTEQLQARLHERRLRGDVIDEWEELDYHPLVRDDTYLAIDFGDPTFGRYIWKLFPHTDGDEECMFKDTRLDVLPPQVNPLNAILAQHTAQTPVHRSLEKRLWTEVRATAVSE 476 T 0.016 Apolipoprotein pdb F T 6tdw 4 D C ATPTB4 MFRGFRPVLAADAVKFQTLYNVLTGKQHLKDQVPVKDCNLTAIFGASWKADLNKWFDSEYAPKLPAAERDSAKKSLDLYLKRVDLTRYTREELTTYGILACGPGKVDALTEKHLLETGKARLEELTAGLGNKDEGVNAFRKEVEQEGKYANWPAEKSKALADKVIAASP 169 T 0.14 Hydantoinase_A pdbpssm F T 6tdw 6 F N subunit b MPSTSPADKDVPMSILHTHGLSYVNWCMSLAPGLLVFEGFFRARYYRSRVPPSRTVLMNGLKMRMFSLARQQAPKIVHKPVLSPIPEHLRLVKNVAQVQIDMLKLLNAQAAK 112 T 0.097 TMEM33_Pom33 pdb F T 6tdw 7 G T subunit 8 LIPVSLVDLININIIFYILLLYTLLLFFIPLFLASINYTYHYIYKYYNYNYNFINNNT 58 T 0.9 UPF0542 pdbhh F T 6tdy 8 N N inhibitor of F1 (IF1) MAAACAVRGFTTARPMLTPNKVKVPGRKPQDEEDLTWAEADRKLTPEERYARDKQMALLDKMTSQVEELEKSHTEQKKSNKGVKAQIEAISRQLEALKAQLKE 103 T 0.027 FlxA pdb F T 6tdy 10 Y h ATP synthase subunit d MMRRACRIIRPSHVRGVSGVAPTIYLRSKAALPATSTTDVRPQLYALQRFAKAQLKTATEAERAAIEADIARYQEYLDSDLEKLKQDVAEDTAKKQKLIPLLDRYPDVPIEKIPEHANVLLKKIDACLEILSKDIGEVTDAEAHEMYFETSKFQILHIYTGCVASFPEGDVPPGAVECLPGQVIRTKVNGEDVMLEIDEVDPGYQVCWFKPDVPLPENAEILWSYPYEPTAALPTGTTWEEGQANVLIPAEPTPEAAVWPPTPVTNVYAPMAEKLALKSNPELKVLFKEALLQPAKLLPLDVDYQCSHDREVVEAKRDRYLTALVEAEQAPPLPFTPDVLQLQLEHNVLKGELIDRLRALEYTIVTEQLQARLHERRLRGDVIDEWEELDYHPLVRDDTYLAIDFGDPTFGRYIWKLFPHTDGDEECMFKDTRLDVLPPQVNPLNAILAQHTAQTPVHRSLEKRLWTEVRATAVSE 476 T 0.016 Apolipoprotein pdb F T 6tdy 11 Z c ATPTB4 MFRGFRPVLAADAVKFQTLYNVLTGKQHLKDQVPVKDCNLTAIFGASWKADLNKWFDSEYAPKLPAAERDSAKKSLDLYLKRVDLTRYTREELTTYGILACGPGKVDALTEKHLLETGKARLEELTAGLGNKDEGVNAFRKEVEQEGKYANWPAEKSKALADKVIAASP 169 T 0.14 Hydantoinase_A pdbpssm F T 6tdz 2 B h subunit d MMRRACRIIRPSHVRGVSGVAPTIYLRSKAALPATSTTDVRPQLYALQRFAKAQLKTATEAERAAIEADIARYQEYLDSDLEKLKQDVAEDTAKKQKLIPLLDRYPDVPIEKIPEHANVLLKKIDACLEILSKDIGEVTDAEAHEMYFETSKFQILHIYTGCVASFPEGDVPPGAVECLPGQVIRTKVNGEDVMLEIDEVDPGYQVCWFKPDVPLPENAEILWSYPYEPTAALPTGTTWEEGQANVLIPAEPTPEAAVWPPTPVTNVYAPMAEKLALKSNPELKVLFKEALLQPAKLLPLDVDYQCSHDREVVEAKRDRYLTALVEAEQAPPLPFTPDVLQLQLEHNVLKGELIDRLRALEYTIVTEQLQARLHERRLRGDVIDEWEELDYHPLVRDDTYLAIDFGDPTFGRYIWKLFPHTDGDEECMFKDTRLDVLPPQVNPLNAILAQHTAQTPVHRSLEKRLWTEVRATAVSE 476 T 0.016 Apolipoprotein pdb F T 6tdz 3 C c subunit c MFRGFRPVLAADAVKFQTLYNVLTGKQHLKDQVPVKDCNLTAIFGASWKADLNKWFDSEYAPKLPAAERDSAKKSLDLYLKRVDLTRYTREELTTYGILACGPGKVDALTEKHLLETGKARLEELTAGLGNKDEGVNAFRKEVEQEGKYANWPAEKSKALADKVIAASP 169 T 0.14 Hydantoinase_A pdbpssm F T 6tdz 11 Z N inhibitor of F1 (IF1) MAAACAVRGFTTARPMLTPNKVKVPGRKPQDEEDLTWAEADRKLTPEERYARDKQMALLDKMTSQVEELEKSHTEQKKSNKGVKAQIEAISRQLEALKAQLKE 103 T 0.027 FlxA pdb F T 6te4 2 C C Pro-Pro-Leu-Ala-Ser-Lys PPLASK 6 T 170 DUF3629 pdbhh F F 6tf4 1 A A Q9KTB3_VIBCH Transcription/translation regulatory transformer protein RfaH GAMGEQLKHATKQLPEKGQTVRVARGQFAGIEAIYLEPDGDTRSIMLVKMISQQVPMSIENTDWEVT 67 T 2.8E-05 KOW pdbhh F Bacteria T 6tf9 1 A AP1 Helix 1 AAAAAAAAAAAAAAAAAAAAAAAAAAAAA 29 T 1400 DUF4699 pdbhh F F 6tf9 2 B,G,H,W CP1,HP1,IP1,XP1 Belt helices 1,2,3,4 AAAAAAAAAAAAA 13 T 220 K_channel_TID pdbhh F F 6tf9 3 C DP1 Belt helix 5 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 31 T 1800 Chorion_S16 pdbhh F F 6tf9 4 D EP1 Belt helix 6 AAAAAAAAAAAAAAAAAAAAAAA 23 T 560 DUF4699 pdbhh F F 6tf9 5 E FP1 Belt helix 7 AAAAAAAAAAAAAAAAAA 18 T 330 Campylo_MOMP pdbhh F F 6tf9 6 F,K,M GP1,LP1,NP1 Belt helices 8,9,10 AAAAAAAAAAAAAAA 15 T 200 Campylo_MOMP pdbhh F F 6tf9 7 I,UA JP1,vP1 Belt helices 11,12 AAAAAAAAAAAAAA 14 T 250 Campylo_MOMP pdbhh F F 6tf9 8 J,N,O,TA KP1,OP1,PP1,uP1 Belt helices 13,14,15 and Helix 2 AAAAAAAAAAAAAAAA 16 T 240 Campylo_MOMP pdbhh F F 6tf9 9 L MP1 Belt helix 16 AAAAAAAAAAAAAAAAA 17 T 260 Adeno_PIX pdbhh F F 6tf9 16 FA gP1 Belt helix 17 AAAAAAAAAAAAAAAAAAA 19 T 410 Adeno_PIX pdbhh F F 6tfl 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N A,B,C,D,E,F,G,H,I,J,K,L,M,N B0R5R2_HALS3 RNA-binding protein Lsm GAMSGRPLDVLEESLEETVTVRLKDGDEFTGVLTGYDQHMNVVIEGEDTTIIRGDNVVTIKP 62 T 0.087 LSM pdbpssm F Archaea T 6tg8 2 B PPP VAL-ILE-ASN-PRO-GLU-THR-GLY-GLU-GLN-ILE-GLN VINPETGEQIQ 11 T 0.17 2C_adapt pdbhh F T 6tgg 2 B P MUC1_HUMAN MUC-1,BREAST CARCINOMA-ASSOCIATED ANTIGEN DF3,CANCER ANTIGEN 15-3,CA 15-3,CARCINOMA-ASSOCIATED MUCIN,EPISIALIN,H23AG,KREBS VON DEN LUNGEN-6,KL-6,PEMT,PEANUT-REACTIVE URINARY MUCIN,PUM,POLYMORPHIC EPITHELIAL MUCIN,PEM,TUMOR-ASSOCIATED EPITHELIAL MEMBRANE ANTIGEN,EMA,TUMOR-ASSOCIATED MUCIN APDTRP 6 T 170 DDE_Tnp_1_assoc pdbhh F Eukaryota F 6tgx 2 C,D C,D MET-VAL-ASN-ALA MVNA 4 T 210 DUF2690 pdbhh F F 6th1 1 A R O57046_9BETA Immediate early protein 1 GPLGSEQQPGDRCPRHVARIIAENDPPIRCDLTLQELLSEVQVDFEPSASEVVAMEGLMDEQHFIPHDPHSKKAAVQSLVIAIKTADLLLQMIHENVKRDIRTTCIQMANESYARADIVRDSLIAASQGKYTALGKIVFHSYTNFMPVNANESEKRAWMEMLGECTSHGNKLCEMANAQVEQETRDIINIMFKNIDDVVTQTTRAMRGVFDPPDTVKALSAAAQLIRVWEHDNVINDQSVSTSSVVTAALEANENLAKALRDVSGYAEVQFNRLCLSILTSAKERIDIIYHSARSQHLACNVRMNVAQQNLATFILTNARERPNDAVIRTRRAVANTGILLFTGQHITRDALDKAAESKSVEEIVGMS 368 T 7.1 Herpes_IE1 pdbhh T Viruses T 6th7 2 C C Tutuilamide VXXXXXIXX 9 T 57 DUF2448 pdbhh F F 6th7 3 D D Tutuilamide XXXXVXIXX 9 T 150 KicB pdbhh F F 6thh 2 C C M9U4Y8_SULIS CRISPR-associated protein, CscA MRNLKRIVMGENKLIGLVRTALDSITLGQGVNEAKIKSPQSYAFHTISVGTISLDICKAIYSSSEIGRKQLENLSKKYNMPFEDLWFYGGFLHDWNKLSGKEESLENKEELTKKIIDKLKLPNEFLHGISTMAEGHLPDNLHLPLWVSIKLADMLLISDIGSVRDVFYFANSDSYRNAIEALKEYNLELNYVSSTFRLFTLIASKELLNDVFNEKSGYFPLISYADGIVFLKRKNSQPVLLSKIVDLLSRQVFSSSSEVIEEKISDIEKCIKNKEELFRQMNIDVKSAIYDEEGKVKQINAFLPTKVCKPFEDVVGNLDNKSKLQVAREVIERNRKDIPFGLLIYFVNKFSKNEEDYIRKGLGINEKSLKYLLNIGDVQKALDKILELLEKRYAEQSSDKTLLYYVKFSSSGNIIDDLPKITDRPNDYCVVCGMPIYSSNPVRFVQYASELGGRAEIWIPREKALDEIDNVRDDWKVCPICIYEANLMKDRVKPPYFIVTFYPGVPISLLNIIDFDFSQSSIKYYIDEEKDTYFTAFEKMGGRLEPYVKKVLPAYFSSKVIIKASEVSNFSLSTRLSKSELNKLLPYAPMISMIFLTSPVLISSNLYEMPIAHERVISITSTYNYTFMKSLNSNLLTLYSIFAYSAKYDAMRKICGRSDLDNCLGYLTEEMDLYSSVDPALGVLSIGMGVGTPIDTDEKFFSAFLPVSGYLLKVTGKVSKMGETLKSSIFSIAYALKDIIKSQKVSKYDVTGFLRDGVDMFFKTTSVIKDKEDRIGISVNAAISSLENKYALDDQHRAQVYSALQDIFKTLYSIEEESDRSLAISIANTLSNWLYIAYKLVLQGDKSLEHHHHHH 855 T 0.057 DUF2225 unphh F Archaea T 6thl 2 B B BCD1_YEAST Box C/D snoRNA protein 1 MRDSTECQRIIRRGVNCLMLPKGMQRSSQNRSKWDKTMDLFVWSVEWILCPMQEKGEKKELFKHVSHRIKETDFLVQGMGKNVFQKCCEFYRLAGTSSCIEGEDGSETKEERTQILQKSGLKFYTKTFPYNTTHIMDSKKLVELAIHEKCIGELLKNTTVIEFPTIFVAMTEADLPEGYEVLHQE 185 T 0.045 MobA_MobL unppssm F Eukaryota T 6ti2 1 A,B B,C CMU1_USTMA Chromosome 16, whole genome shotgun sequence MAAVSGKSEAAEIEAGDRLDALRDQLQRYETPIIQTILARSALGGRAPSEQDEVRAALSRNAFEPSEVISEWLQTESGARFRSTRPLPPAVEFITPVVLSRDTVLDKPVVGKGIFPIGRRPQDPTNMDEFLDTSLLSLNQSSTVDLASAVSLDVSLLHLVSARVLLGYPIALAKFDWLHDNFCHILTNTTLSKSQKLANIIQQLTDHKQEVNVLSRVEQKSKSLSHLFRNDIPYPPHTQDRILRLFQAYLIPITTQIEAAAILDHANKCTLEHHHHHH 278 T 0.26 CM_2 pdbpssm F Eukaryota T 6tid 1 A AAA B4EH86_BURCJ Lectin GHMPLLSASIVSAPVVTSETYVDIPGLYLDVAKAGIRDGKLQVILNVPTPYATGNNFPGIYFAIATNQGVVADGCFTYSSKVPESTGRMPFTLVATIDVGSGVTFVKGQWKSVRGSAMHIDSYASLSAIWGTAA 134 T 0.002 DUF1543 unppercent F Bacteria T 6tig 1 A,B,C AAA,BBB,CCC B4EH86_BURCJ Lectin GHMPLLSASIVSAPVVTSETYVDIPGLYLDVAKAGIRDGKLQVILNVPTPYATGNNFPGIYFAIATNQGVVADGCFTYSSKVPESTGRMPFTLVATIDVGSGVTFVKGQWKSVRGSAMHIDSYASLSAIWGTAA 134 T 0.002 DUF1543 unppercent F Bacteria T 6tip 2 C,D P,Q Strep-tag II peptide SAWSHPQFEK 10 T 1.8 PqqA pdbhh F T 6tj1 1 A,B,C A,B,C De novo designed WSHC6 MGSSHHHHHHSSGLVPRGSHMTEDEIRKLRKLLEEAEKKLYKLEDKTRRSEEISKTDDDPKAQSLQLIAESLMLIAESLLIIAISLLLSSRNG 93 T 0.01 Halogen_Hydrol pdb F T 6tj1 2 D D purification tag AALAAA 6 T 150 DUF4699 pdbhh F F 6tj3 1 A A Q8IJM4_PLAF7 PfELC MASDMEEKFREAFILFSSCSDHIEMYKFFELMNSFGIILTNDEKAALPNDINMDYWLNFAKKHYNYEQPFKHIN 74 T 0.024 Na_Ca_ex_C unppercent F Eukaryota T 6tj4 1 A,B A,B Q8IJM4_PLAF7 PfELC SMASDMEEKFREAFILFSSCSDHIEMYKFFELMNSFGIILTNDEKAALPNDINMDYWLNFAKKHYNYEQPFKHIN 75 T 0.024 Na_Ca_ex_C unppercent F Eukaryota T 6tj5 3 C C MYOA_TOXGO MYOA,TGM-A GAMASSWEPLVSVLEAYYAGRRHKKQLLKKTPFIIRAQAHIRRHLV 46 T 0.00015 IQ unppssm F Eukaryota T 6tj6 3 C C MYOA_TOXGO Myosin A GAMASSWEPLVSVLEAYYAGRRHKKQLLKKTPFIIRAQAHIRRHLV 46 T 0.00015 IQ unppssm F Eukaryota T 6tj7 3 C C MYOA_TOXGO MYOA,TGM-A SSWEPLVSVLEAYYAGRRHKKQLLKKTPFIIRAQAHIRRHLV 42 T 0.00015 IQ unppssm F Eukaryota T 6tjt 2 C,D D,F VEGFC_HUMAN VEGFC C terminal peptide XSIIRR 6 T 88 T3SS_needle_F pdbhh F Eukaryota F 6tka 2 B BBB HCF-1 pro-repeat 2 (11-26) THETGTTNTATTATSN 16 T 54 Ice_nucleation pdbhh F T 6tkg 3 C I TTI_GLOMM Tsetse thrombin inhibitor MKFFTVLFFLLSIIYLIVAAPGEPGAPIDYDEYGDSSEEVGGTPLHEIPGIRL 53 T 0.048 Cytochrom_B558a unppssm F Eukaryota T 6tkh 3 C I TTI_GLOMM Tsetse thrombin inhibitor MKFFTVLFFLLSIIYLIVAAPGEPGAPIDYDEYGDSSEEVGGTPLHEIPGIRL 53 T 0.048 Cytochrom_B558a unppssm F Eukaryota T 6tki 3 C I TTI_GLOMM Tsetse thrombin inhibitor MKFFTVLFFLLSIIYLIVAAPGEPGAPIDYDEYGDSSEEVGGTPLHEIPGIRL 53 T 0.048 Cytochrom_B558a unppssm F Eukaryota T 6tkj 3 C I TTI_GLOMM Tsetse thrombin inhibitor MKFFTVLFFLLSIIYLIVAAPGEPGAPIDYDEYGDSSEEVGGTPLHEIPGIRL 53 T 0.048 Cytochrom_B558a unppssm F Eukaryota T 6tkk 2 B B VEGFB_HUMAN ACE-ARG-PRO-GLN-PRO-ARG XRPQPR 6 T 110 DUF6121 pdbhh F Eukaryota F 6tkl 3 C I TTI_GLOMM Tsetse thrombin inhibitor MKFFTVLFFLLSIIYLIVAAPGEPGAPIDYDEYGDSSEEVGGTPLHEIPGIRX 53 T 0.048 Cytochrom_B558a unppssm F Eukaryota T 6tkt 1 A A Q53805_9ACTN Pre-phenomycin GAMANPKTIKAAAYNQARSTLADAGSRTAAKSHPIHGKTDVPVSYGTSLLAAARDEFRQADKKLPAKDKKSDMSIAHYNAVHSAAKTMGIDTW 93 T 0.85 DUF6388 pdbhh F Bacteria T 6tlx 1 A A A0A0L1KLX8_9EUGL Protein kinase GSSQKFSTSPSPTLDDGLDRIKCPKKHGMKLLRAFPKLNDTAGGTSDYGWGFWCDRCHKEVPALIKSKKRISKAQDERTHAPEENTFFYHCHCGYDLCKACGASIIHASNTLKENYSTELKNLAACFSTPS 131 T 0.0031 zf-RING_4 pdbpercent F Eukaryota T 6tm5 14 Q T Apc1 loop AAAAAQLAAAAAAAA 15 T 55 DUF4699 pdbhh F F 6tmg 1 A,Y q,Q A0A125YPS4_TOXGG ATPTG11 MVRNQRYPASPVQEIFLPEPVPFVQFDQTAPSPNSPPAPLPSPSLSQCEEQKDRYRDISSMFHRGVAGAEQVREAYNSMAKCFRRVSVAEVLESDPAFRQARNFTMDLKQAEDDQRYKQLQYGRVPSILTKYHL 134 T 15 Chor_lyase unphh F Eukaryota T 6tmg 2 B,Z i,I S7VXW3_TOXGG ATPTG7 MPSSSSEDAQGGNRFECVSNSTSPRRKNATKDEAACLQPRRSAVSGPREDVLCIRTTPQPHVRRGKSGPGRRKRMRFGRERERRDKKRGEGERKRTRFPFLRLHIEGGNANSRRPLCFPSRHSLLRNHYGSLSMAFRKVSPPKAPMSVFEARSSFLDLEQCARAAGPQRWEAECQGVRQRALQAAADVMSRECGAYGDSFFQCYRHGFRLEACQGEKATMQLLRCQRMVADRLVPL 236 T 0.068 FokI_N pdbpercent F Eukaryota T 6tmg 3 AA,C T,t A0A125YLH9_TOXGG ATPTG14 MPAPAASGAAAVLSKDIARSFRWMQAFAAVKGKPTAGSCAAGTAVVNPEDPTKVTLKGRYTNFSLQHIWEKYDYLQTHLLLRECMLSQVAKNPRLLDPEINAGLTPTVFMRVPPETQDPETQAKAAPQKGQAN 133 T 0.16 DUF5106 pdbpercent F Eukaryota T 6tmg 4 BA,D G,g S7WD71_TOXGG ATPTG5 MQNGVFTRENADFLVKSGADSPSSQSLLLRTSPSPLSLPRRRFIFLRSASVDLSERSSLACLAPFFCLASGVCLRSAFSLPFFARRGRPCLFFIFIFFFRVSFTANFRGKRVKMAASTIPISQWPSLLYAPPSSPANPAVEALPEMQFDDLHYPRQMLLCRGAGYSLEQCNRMAQPDARVTPENPAEKLLKEEAVAAIACLSQREGGKDEQCRYYIERMYKLANKEKQPEPGTLSKASTLACKLLGIHRPEA 252 T 0.027 CHCH pdbpssm F Eukaryota T 6tmg 6 DA,F K,k A0A125YSI9_TOXGG subunit a MAAGSRFPFCTAARLSSRGTLPRLGEATFFAGAESQRSAGAFAKTLQRPFLRAPSTQLFPVGNRLGVSSARALVANAMEPRRFFAAAASAKATHALQPTGTGSVAFTRPGQGSNAQFQTSLADKTRGLLGVGFLRPTKMASFAATFLLNFRFYFMYMARTTFQAVRPLLAFSVFGEVMKLVLATMSSGLFSFLFSFVLAFEVFYFFLQCYISYTFLTMFFTVLF 224 T 53 DUF5090 unphh F Eukaryota T 6tmg 7 EA,G J,j S7UQ82_TOXGG subunit i/j MGLSPAFAATAGCRLASPVANSSRFLSLLRLSRPRLNAAAPAAEAAKTLERNVPMKEILQPLWVVEPPNFLRQPVWKQFWEAQFANRSFFFFGNAWTSAAAFAFFIWWSRVFDPPPKERLDRYWLNSPKFRILSAFHNPGKRPGLKISLMTYEARYCYRGLDHPFTLNEMKDFLFKLREQYLVNKYEGIQFPFVFRQFNRVSTPGTLEVHTSPALQQQPHFHEEAAGHH 229 T 3.7 YokU pdbhh F Eukaryota T 6tmg 8 FA,H S,s A0A125YLN4_TOXGG ATPTG13 MSWATRLLRMSSPRLGLLPLGRSVKLGGAKERVSFSQFFDSEYFWTKANVGPFFLFLFTSPFWYQGIKTVYASCRYRKLNEREIISDRYTWLHERMLEDEVERVLLEQVPAGGFDKTRPGLLLGPSTL 128 T 0.24 DUF5378 pdbpercent F Eukaryota T 6tmg 9 GA,I U,u A0A125YRP0_TOXGG ATPTG15 MATPPLQDGAPTNGGAATKPSCGARLQNFARMAIKGPSVPHSILFGVGAGCCAYAGYYLYRAMRLTFFDTESVALQSRLRYAEKQKLFHQELDRELAAGHIASLVAEYDPVATRLPFQPMQDRYRV 126 T 3.2 DUF3067 unphh F Eukaryota T 6tmg 10 HA,J H,h A0A125YL08_TOXGG ATPTG6 MAETREGGQSGAASILGAEAFPELLSKVPLNPQMDEDKHFNKYKWGNEPIPVNRRTGSRMNSSIYDNRNHEAVRHPWSTDARTFHPNDNPEADRINTQYSNMVSDSFPEGGFSDAPRFSSNWERLLAYHHGLYSPEKFNSTTKTADEIRLAVNDFAAKVHADDPKNACKYLMIEEFKCLQSAQARIDPQGAATKCVKWFNEWRQCAWDQEKMVKGYNYIEDRRARKHKPYIGAPDLQYS 239 T 1.8 DnaJ pdb F Eukaryota T 6tmg 11 IA,K E,e A0A125YLR0_TOXGG ATPTG3 MGEKQEEEGEEEKEGKGEGGGEGGREEEDEDGSAPVVSWLRIVEERECHEETDEAPETKIALPFSAQRSSRGFEARQVEVLVSANSAFLLSVLLASLFLSSSLPSFCPPRFLLSVLLALFKDKMAGDAPAAAAAPQQAGRTASASGVRTPGYLDLVGHSLKATSMDHGMQYSSIYWETSHRTYLPFWASLTQKFSWKIMDDQIRSFLRLPKPVTTEPFVFSSGSPYIRRYFGDADISVPVPLHAPAHFAFVPTGTVSPWEETGMETGPQGAAARGAAATAFRAVLESAWKCDIDEQIKEKLHSRAGAGAFHASGSTGGCPIPTDF 325 T 4.7 DUF512 unphh F Eukaryota T 6tmg 12 JA,L X,x S7W180_TOXGG ATPTG17 MSTSPGLAFANLTLLLDVPQLPAIWAVNAWRELNGLFTEMKTLAGTSDLLYPSNRYNPQNEKTNRMGRPRKYNHGEWMFGNSY 83 T 4.8 AT_hook pdbhh F Eukaryota T 6tmg 13 KA,M B,b S7V2T0_TOXGG subunit b MNFSSSARWLAVRQSQTLGHTTRATVAAGRRVLAHSPAATEFTSFQSLHIGGDVCKLPLAVALGAAPSALGYGSAKHNQQRQYATLGSGWSFSKVQYTKYRITKPWTTDTTFDDIILSQPSKEDFAKFTKEAPLFLRFLKLVTDVEGRQEAFIQFAKRCENGLTVEKDVYVTKKELVDCLWKNGYTDTEINAFEIAFPADYKFHYPELAVLFDLTEEDCYKYCIRQRAATPEELVELKYTKPKNLVSSYGLCFLGVWFGLSNTVLSNAWFYSKTFPFGAVFYMLGSYFYRDIREKLWKEEKSLIHTAQENKNMGEESVYKQMKKYATDTKCLDYLSTFRTEVEDQIANYKVALVSQMRRQLTERLVEKLNGIQQAEKLIQGSLQDVMIREIVSSFKDLYKSRPELHDAAMQSAIQGLSGSDGAMDPVGAHFKASLQELAKVNLSTATADPMGTVVQRVAAVFQKREKEFLDTFTVKATEAQEIKTIVDKCHKGNTFDFHALSDEELRRLEQLYSTVNNRVGFETIHENSIKPVAPLSENSKGFVEFVNTQLEITKAKLRNARLTAFAHAFV 571 T 0.00013 Mt_ATP-synt_B pdbpercent F Eukaryota T 6tmg 14 LA,N R,r A0A125YKF7_TOXGG ATPTG12 MLNFIPKRCPSVSLLFGKRPVQRIEVGQARHQLEIPVETIEKIYEGVDSRLEYHNKDYNAMKWKDFMKLKLDAYHLLEASQSETAAKSALSDLNWFSDLADIYSGQQTMAEMDVALKAQGEQKLSYPIQGKNIK 134 T 12 DUF4416 pdbhh F Eukaryota T 6tmg 15 MA,O P,p A0A125YMA7_TOXGG ATPTG10 MSPPTASASVASSGSSPHMDRLLGDLKLLAAYDSAAGWQEPKAMESAFQSLSWDDADVLKALPQYLNCRGEQKRRVDFAYAALCPRPVDEKDPKQTLMSLWMKARLFSYDQKHPFVLSPFAATDKSTSAGAMTAEKPF 138 T 9.6 DUF6103 pdbhh F Eukaryota T 6tmg 16 NA,P V,v S7UQT7_TOXGG subunit f MGFHFQQYIAMAGRAINPVQWTRAWRRMEGKSATEVYRDALAWTNNQFAQISRASQYRAWWWQNPLGMGLVLYGTYKAWHMIYMVRKQKKTAQLVAAAYGQGGQWLNPVPR 111 T 0.11 DUF4468 unp F Eukaryota T 6tmg 17 OA,Q L,l S7W7F1_TOXGG ATPTG8 MTALPPPPSANVAVSFTAAPAEPLSRGEVKAASLKLELQNIERELKDWWMSRKILRDRNIGLFNLLQHHNFAGLSVNNAKLSDSQRVMWTDLVQGKPDVEDKLSVDAREMKVDMYEKLFKQAADLENPCRMPGVAYLRCLRDTLTETQSARRSSCLNAFSSFDACRTGLLKQQSAAVENSLVRQNMADVRAKALFERRAVLLDLVEGK 208 T 0.12 CHCH pdbhh F Eukaryota T 6tmg 19 QA,S D,d A0A125YV76_TOXGG ATPTG2 MSPVGRLFLGSKLPAQTWQSFRLQPALPQFAQKRFFSGGAAKPSWHVAREHRFGPTLPDHAYYGEHATYNYFVLFIRGMRPYLEKIFGDCASTIKNAAVAVYRPVNAFVVKHNPDLRLQFVAFASFIATHMAITKEFNDMYQRLVDITSLLELQAAQLHASEGFWDSESEQQEARLQRHAEHRNDLETTWEEALREATLARNFDVLVSYLNHGTSDGCGEHGACGHSGQNGIPPSVTWNFNAMPYGKENPDTKTFPIPDHEQPYRAFSLGFTANNLSGNWGDYIDRQDNKNALMRPARMMFTDVFIPTTK 310 T 10 PerC unphh F Eukaryota T 6tmg 20 RA,T M,m A0A125YPQ4_TOXGG subunit 8 MNTFFLTPAAAAARRVAVSFFARSSASGFPQHRVALRPFPSQRPAERAHNLAKSQTLRSVKAHGRQSGKKEQSTESGGRRGFRAAVGAGTGCMLAASPMLFTDYDNTASPKSELIFMAGNALGYCTERFFENEYGQSIFMFALGLAYLAMLGHEGKIHGAVWRMKHLFATNFKMVGHPRYAYALPKNPLLQDAAPTKTGSTSAKK 205 T 27 TMEM132D_C unphh F Eukaryota T 6tmg 23 UA,W W,w S7VTI0_TOXGG ATPTG16 MPFMWRQRAYCAPVPSAFASQQPNGLGGEAGVRKPLLRSNSESLSVFSQIPDGLLGHTTSVTMGNSDIFFLPKPSNLLKIALPAFVFMPNLTIFTRAFPFYAHTSA 106 T 7.1 DUF3561 pdbhh F Eukaryota T 6tmh 1 A i A0A125YJP2_TOXGG Inhibitor of F1 MSSPCCVAIRRVARTTLESGRRQVDSKSTDVSPFFTGTQQMSLPSAGMVTKIRNFSSVKFMDQKRSGEETVYFKKEDEALLRNLLANHPEYDPKYSVDHMNAEVGSIARDITLACQKHGMKDPSAAFMKDLISIFGAHGYAKNSK 145 T 1.4 DUF3223 pdbhh F Eukaryota T 6tmi 4 D R A0A125YKF7_TOXGG ATPTG12 MLNFIPKRCPSVSLLFGKRPVQRIEVGQARHQLEIPVETIEKIYEGVDSRLEYHNKDYNAMKWKDFMKLKLDAYHLLEASQSETAAKSALSDLNWFSDLADIYSGQQTMAEMDVALKAQGEQKLSYPIQGKNIK 134 T 12 DUF4416 pdbhh F Eukaryota T 6tmi 5 E B S7V2T0_TOXGG subunit b MNFSSSARWLAVRQSQTLGHTTRATVAAGRRVLAHSPAATEFTSFQSLHIGGDVCKLPLAVALGAAPSALGYGSAKHNQQRQYATLGSGWSFSKVQYTKYRITKPWTTDTTFDDIILSQPSKEDFAKFTKEAPLFLRFLKLVTDVEGRQEAFIQFAKRCENGLTVEKDVYVTKKELVDCLWKNGYTDTEINAFEIAFPADYKFHYPELAVLFDLTEEDCYKYCIRQRAATPEELVELKYTKPKNLVSSYGLCFLGVWFGLSNTVLSNAWFYSKTFPFGAVFYMLGSYFYRDIREKLWKEEKSLIHTAQENKNMGEESVYKQMKKYATDTKCLDYLSTFRTEVEDQIANYKVALVSQMRRQLTERLVEKLNGIQQAEKLIQGSLQDVMIREIVSSFKDLYKSRPELHDAAMQSAIQGLSGSDGAMDPVGAHFKASLQELAKVNLSTATADPMGTVVQRVAAVFQKREKEFLDTFTVKATEAQEIKTIVDKCHKGNTFDFHALSDEELRRLEQLYSTVNNRVGFETIHENSIKPVAPLSENSKGFVEFVNTQLEITKAKLRNARLTAFAHAFV 571 T 0.00013 Mt_ATP-synt_B pdbpercent F Eukaryota T 6tmj 1 A Q A0A125YPS4_TOXGG ATPTG11 MVRNQRYPASPVQEIFLPEPVPFVQFDQTAPSPNSPPAPLPSPSLSQCEEQKDRYRDISSMFHRGVAGAEQVREAYNSMAKCFRRVSVAEVLESDPAFRQARNFTMDLKQAEDDQRYKQLQYGRVPSILTKYHL 134 T 15 Chor_lyase unphh F Eukaryota T 6tmj 2 B K A0A125YSI9_TOXGG subunit a MAAGSRFPFCTAARLSSRGTLPRLGEATFFAGAESQRSAGAFAKTLQRPFLRAPSTQLFPVGNRLGVSSARALVANAMEPRRFFAAAASAKATHALQPTGTGSVAFTRPGQGSNAQFQTSLADKTRGLLGVGFLRPTKMASFAATFLLNFRFYFMYMARTTFQAVRPLLAFSVFGEVMKLVLATMSSGLFSFLFSFVLAFEVFYFFLQCYISYTFLTMFFTVLF 224 T 53 DUF5090 unphh F Eukaryota T 6tmk 1 A,Y q,Q A0A125YPS4_TOXGG ATPTG11 MVRNQRYPASPVQEIFLPEPVPFVQFDQTAPSPNSPPAPLPSPSLSQCEEQKDRYRDISSMFHRGVAGAEQVREAYNSMAKCFRRVSVAEVLESDPAFRQARNFTMDLKQAEDDQRYKQLQYGRVPSILTKYHL 134 T 15 Chor_lyase unphh F Eukaryota T 6tmk 2 B,Z i,I S7VXW3_TOXGG ATPTG7 MPSSSSEDAQGGNRFECVSNSTSPRRKNATKDEAACLQPRRSAVSGPREDVLCIRTTPQPHVRRGKSGPGRRKRMRFGRERERRDKKRGEGERKRTRFPFLRLHIEGGNANSRRPLCFPSRHSLLRNHYGSLSMAFRKVSPPKAPMSVFEARSSFLDLEQCARAAGPQRWEAECQGVRQRALQAAADVMSRECGAYGDSFFQCYRHGFRLEACQGEKATMQLLRCQRMVADRLVPL 236 T 0.068 FokI_N pdbpercent F Eukaryota T 6tmk 3 AA,C T,t A0A125YLH9_TOXGG ATPTG14 MPAPAASGAAAVLSKDIARSFRWMQAFAAVKGKPTAGSCAAGTAVVNPEDPTKVTLKGRYTNFSLQHIWEKYDYLQTHLLLRECMLSQVAKNPRLLDPEINAGLTPTVFMRVPPETQDPETQAKAAPQKGQAN 133 T 0.16 DUF5106 pdbpercent F Eukaryota T 6tmk 4 BA,D G,g S7WD71_TOXGG ATPTG5 MQNGVFTRENADFLVKSGADSPSSQSLLLRTSPSPLSLPRRRFIFLRSASVDLSERSSLACLAPFFCLASGVCLRSAFSLPFFARRGRPCLFFIFIFFFRVSFTANFRGKRVKMAASTIPISQWPSLLYAPPSSPANPAVEALPEMQFDDLHYPRQMLLCRGAGYSLEQCNRMAQPDARVTPENPAEKLLKEEAVAAIACLSQREGGKDEQCRYYIERMYKLANKEKQPEPGTLSKASTLACKLLGIHRPEA 252 T 0.027 CHCH pdbpssm F Eukaryota T 6tmk 6 DA,F K,k A0A125YSI9_TOXGG subunit a MAAGSRFPFCTAARLSSRGTLPRLGEATFFAGAESQRSAGAFAKTLQRPFLRAPSTQLFPVGNRLGVSSARALVANAMEPRRFFAAAASAKATHALQPTGTGSVAFTRPGQGSNAQFQTSLADKTRGLLGVGFLRPTKMASFAATFLLNFRFYFMYMARTTFQAVRPLLAFSVFGEVMKLVLATMSSGLFSFLFSFVLAFEVFYFFLQCYISYTFLTMFFTVLF 224 T 53 DUF5090 unphh F Eukaryota T 6tmk 7 EA,G J,j S7UQ82_TOXGG subunit i/j MGLSPAFAATAGCRLASPVANSSRFLSLLRLSRPRLNAAAPAAEAAKTLERNVPMKEILQPLWVVEPPNFLRQPVWKQFWEAQFANRSFFFFGNAWTSAAAFAFFIWWSRVFDPPPKERLDRYWLNSPKFRILSAFHNPGKRPGLKISLMTYEARYCYRGLDHPFTLNEMKDFLFKLREQYLVNKYEGIQFPFVFRQFNRVSTPGTLEVHTSPALQQQPHFHEEAAGHH 229 T 3.7 YokU pdbhh F Eukaryota T 6tmk 8 FA,H S,s A0A125YLN4_TOXGG ATPTG13 MSWATRLLRMSSPRLGLLPLGRSVKLGGAKERVSFSQFFDSEYFWTKANVGPFFLFLFTSPFWYQGIKTVYASCRYRKLNEREIISDRYTWLHERMLEDEVERVLLEQVPAGGFDKTRPGLLLGPSTL 128 T 0.24 DUF5378 pdbpercent F Eukaryota T 6tmk 9 GA,I U,u A0A125YRP0_TOXGG ATPTG15 MATPPLQDGAPTNGGAATKPSCGARLQNFARMAIKGPSVPHSILFGVGAGCCAYAGYYLYRAMRLTFFDTESVALQSRLRYAEKQKLFHQELDRELAAGHIASLVAEYDPVATRLPFQPMQDRYRV 126 T 3.2 DUF3067 unphh F Eukaryota T 6tmk 10 HA,J H,h A0A125YL08_TOXGG ATPTG6 MAETREGGQSGAASILGAEAFPELLSKVPLNPQMDEDKHFNKYKWGNEPIPVNRRTGSRMNSSIYDNRNHEAVRHPWSTDARTFHPNDNPEADRINTQYSNMVSDSFPEGGFSDAPRFSSNWERLLAYHHGLYSPEKFNSTTKTADEIRLAVNDFAAKVHADDPKNACKYLMIEEFKCLQSAQARIDPQGAATKCVKWFNEWRQCAWDQEKMVKGYNYIEDRRARKHKPYIGAPDLQYS 239 T 1.8 DnaJ pdb F Eukaryota T 6tmk 11 IA,K E,e A0A125YLR0_TOXGG ATPTG3 MGEKQEEEGEEEKEGKGEGGGEGGREEEDEDGSAPVVSWLRIVEERECHEETDEAPETKIALPFSAQRSSRGFEARQVEVLVSANSAFLLSVLLASLFLSSSLPSFCPPRFLLSVLLALFKDKMAGDAPAAAAAPQQAGRTASASGVRTPGYLDLVGHSLKATSMDHGMQYSSIYWETSHRTYLPFWASLTQKFSWKIMDDQIRSFLRLPKPVTTEPFVFSSGSPYIRRYFGDADISVPVPLHAPAHFAFVPTGTVSPWEETGMETGPQGAAARGAAATAFRAVLESAWKCDIDEQIKEKLHSRAGAGAFHASGSTGGCPIPTDF 325 T 4.7 DUF512 unphh F Eukaryota T 6tmk 12 JA,L X,x S7W180_TOXGG ATPTG17 MSTSPGLAFANLTLLLDVPQLPAIWAVNAWRELNGLFTEMKTLAGTSDLLYPSNRYNPQNEKTNRMGRPRKYNHGEWMFGNSY 83 T 4.8 AT_hook pdbhh F Eukaryota T 6tmk 13 KA,M B,b S7V2T0_TOXGG subunit b MNFSSSARWLAVRQSQTLGHTTRATVAAGRRVLAHSPAATEFTSFQSLHIGGDVCKLPLAVALGAAPSALGYGSAKHNQQRQYATLGSGWSFSKVQYTKYRITKPWTTDTTFDDIILSQPSKEDFAKFTKEAPLFLRFLKLVTDVEGRQEAFIQFAKRCENGLTVEKDVYVTKKELVDCLWKNGYTDTEINAFEIAFPADYKFHYPELAVLFDLTEEDCYKYCIRQRAATPEELVELKYTKPKNLVSSYGLCFLGVWFGLSNTVLSNAWFYSKTFPFGAVFYMLGSYFYRDIREKLWKEEKSLIHTAQENKNMGEESVYKQMKKYATDTKCLDYLSTFRTEVEDQIANYKVALVSQMRRQLTERLVEKLNGIQQAEKLIQGSLQDVMIREIVSSFKDLYKSRPELHDAAMQSAIQGLSGSDGAMDPVGAHFKASLQELAKVNLSTATADPMGTVVQRVAAVFQKREKEFLDTFTVKATEAQEIKTIVDKCHKGNTFDFHALSDEELRRLEQLYSTVNNRVGFETIHENSIKPVAPLSENSKGFVEFVNTQLEITKAKLRNARLTAFAHAFV 571 T 0.00013 Mt_ATP-synt_B pdbpercent F Eukaryota T 6tmk 14 LA,N R,r A0A125YKF7_TOXGG ATPTG12 MLNFIPKRCPSVSLLFGKRPVQRIEVGQARHQLEIPVETIEKIYEGVDSRLEYHNKDYNAMKWKDFMKLKLDAYHLLEASQSETAAKSALSDLNWFSDLADIYSGQQTMAEMDVALKAQGEQKLSYPIQGKNIK 134 T 12 DUF4416 pdbhh F Eukaryota T 6tmk 15 MA,O P,p A0A125YMA7_TOXGG ATPTG10 MSPPTASASVASSGSSPHMDRLLGDLKLLAAYDSAAGWQEPKAMESAFQSLSWDDADVLKALPQYLNCRGEQKRRVDFAYAALCPRPVDEKDPKQTLMSLWMKARLFSYDQKHPFVLSPFAATDKSTSAGAMTAEKPF 138 T 9.6 DUF6103 pdbhh F Eukaryota T 6tmk 16 NA,P V,v S7UQT7_TOXGG subunit f MGFHFQQYIAMAGRAINPVQWTRAWRRMEGKSATEVYRDALAWTNNQFAQISRASQYRAWWWQNPLGMGLVLYGTYKAWHMIYMVRKQKKTAQLVAAAYGQGGQWLNPVPR 111 T 0.11 DUF4468 unp F Eukaryota T 6tmk 17 OA,Q L,l S7W7F1_TOXGG ATPTG8 MTALPPPPSANVAVSFTAAPAEPLSRGEVKAASLKLELQNIERELKDWWMSRKILRDRNIGLFNLLQHHNFAGLSVNNAKLSDSQRVMWTDLVQGKPDVEDKLSVDAREMKVDMYEKLFKQAADLENPCRMPGVAYLRCLRDTLTETQSARRSSCLNAFSSFDACRTGLLKQQSAAVENSLVRQNMADVRAKALFERRAVLLDLVEGK 208 T 0.12 CHCH pdbhh F Eukaryota T 6tmk 19 QA,S D,d A0A125YV76_TOXGG ATPTG2 MSPVGRLFLGSKLPAQTWQSFRLQPALPQFAQKRFFSGGAAKPSWHVAREHRFGPTLPDHAYYGEHATYNYFVLFIRGMRPYLEKIFGDCASTIKNAAVAVYRPVNAFVVKHNPDLRLQFVAFASFIATHMAITKEFNDMYQRLVDITSLLELQAAQLHASEGFWDSESEQQEARLQRHAEHRNDLETTWEEALREATLARNFDVLVSYLNHGTSDGCGEHGACGHSGQNGIPPSVTWNFNAMPYGKENPDTKTFPIPDHEQPYRAFSLGFTANNLSGNWGDYIDRQDNKNALMRPARMMFTDVFIPTTK 310 T 10 PerC unphh F Eukaryota T 6tmk 20 RA,T M,m A0A125YPQ4_TOXGG subunit 8 MNTFFLTPAAAAARRVAVSFFARSSASGFPQHRVALRPFPSQRPAERAHNLAKSQTLRSVKAHGRQSGKKEQSTESGGRRGFRAAVGAGTGCMLAASPMLFTDYDNTASPKSELIFMAGNALGYCTERFFENEYGQSIFMFALGLAYLAMLGHEGKIHGAVWRMKHLFATNFKMVGHPRYAYALPKNPLLQDAAPTKTGSTSAKK 205 T 27 TMEM132D_C unphh F Eukaryota T 6tmk 23 UA,W W,w S7VTI0_TOXGG ATPTG16 MPFMWRQRAYCAPVPSAFASQQPNGLGGEAGVRKPLLRSNSESLSVFSQIPDGLLGHTTSVTMGNSDIFFLPKPSNLLKIALPAFVFMPNLTIFTRAFPFYAHTSA 106 T 7.1 DUF3561 pdbhh F Eukaryota T 6tmk 25 RB,WA i1,i2 A0A125YJP2_TOXGG Inhibitor of F1 MSSPCCVAIRRVARTTLESGRRQVDSKSTDVSPFFTGTQQMSLPSAGMVTKIRNFSSVKFMDQKRSGEETVYFKKEDEALLRNLLANHPEYDPKYSVDHMNAEVGSIARDITLACQKHGMKDPSAAFMKDLISIFGAHGYAKNSK 145 T 1.4 DUF3223 pdbhh F Eukaryota T 6tml 1 A,KD,MC,WG,Y,YF q7,Q8,q8,Q9,Q7,q9 A0A125YPS4_TOXGG ATPTG11 MVRNQRYPASPVQEIFLPEPVPFVQFDQTAPSPNSPPAPLPSPSLSQCEEQKDRYRDISSMFHRGVAGAEQVREAYNSMAKCFRRVSVAEVLESDPAFRQARNFTMDLKQAEDDQRYKQLQYGRVPSILTKYHL 134 T 15 Chor_lyase unphh F Eukaryota T 6tml 2 B,LD,NC,XG,Z,ZF i7,I8,i8,I9,I7,i9 S7VXW3_TOXGG ATPTG7 MPSSSSEDAQGGNRFECVSNSTSPRRKNATKDEAACLQPRRSAVSGPREDVLCIRTTPQPHVRRGKSGPGRRKRMRFGRERERRDKKRGEGERKRTRFPFLRLHIEGGNANSRRPLCFPSRHSLLRNHYGSLSMAFRKVSPPKAPMSVFEARSSFLDLEQCARAAGPQRWEAECQGVRQRALQAAADVMSRECGAYGDSFFQCYRHGFRLEACQGEKATMQLLRCQRMVADRLVPL 236 T 0.068 FokI_N pdbpercent F Eukaryota T 6tml 3 AA,AG,C,MD,OC,YG T7,t9,t7,T8,t8,T9 A0A125YLH9_TOXGG ATPTG14 MPAPAASGAAAVLSKDIARSFRWMQAFAAVKGKPTAGSCAAGTAVVNPEDPTKVTLKGRYTNFSLQHIWEKYDYLQTHLLLRECMLSQVAKNPRLLDPEINAGLTPTVFMRVPPETQDPETQAKAAPQKGQAN 133 T 0.16 DUF5106 pdbpercent F Eukaryota T 6tml 4 BA,BG,D,ND,PC,ZG G7,g9,g7,G8,g8,G9 S7WD71_TOXGG ATPTG5 MQNGVFTRENADFLVKSGADSPSSQSLLLRTSPSPLSLPRRRFIFLRSASVDLSERSSLACLAPFFCLASGVCLRSAFSLPFFARRGRPCLFFIFIFFFRVSFTANFRGKRVKMAASTIPISQWPSLLYAPPSSPANPAVEALPEMQFDDLHYPRQMLLCRGAGYSLEQCNRMAQPDARVTPENPAEKLLKEEAVAAIACLSQREGGKDEQCRYYIERMYKLANKEKQPEPGTLSKASTLACKLLGIHRPEA 252 T 0.027 CHCH pdbpssm F Eukaryota T 6tml 6 BH,DA,DG,F,PD,RC K9,K7,k9,k7,K8,k8 A0A125YSI9_TOXGG subunit a MAAGSRFPFCTAARLSSRGTLPRLGEATFFAGAESQRSAGAFAKTLQRPFLRAPSTQLFPVGNRLGVSSARALVANAMEPRRFFAAAASAKATHALQPTGTGSVAFTRPGQGSNAQFQTSLADKTRGLLGVGFLRPTKMASFAATFLLNFRFYFMYMARTTFQAVRPLLAFSVFGEVMKLVLATMSSGLFSFLFSFVLAFEVFYFFLQCYISYTFLTMFFTVLF 224 T 53 DUF5090 unphh F Eukaryota T 6tml 7 CH,EA,EG,G,QD,SC J9,J7,j9,j7,J8,j8 S7UQ82_TOXGG subunit i/j MGLSPAFAATAGCRLASPVANSSRFLSLLRLSRPRLNAAAPAAEAAKTLERNVPMKEILQPLWVVEPPNFLRQPVWKQFWEAQFANRSFFFFGNAWTSAAAFAFFIWWSRVFDPPPKERLDRYWLNSPKFRILSAFHNPGKRPGLKISLMTYEARYCYRGLDHPFTLNEMKDFLFKLREQYLVNKYEGIQFPFVFRQFNRVSTPGTLEVHTSPALQQQPHFHEEAAGHH 229 T 3.7 YokU pdbhh F Eukaryota T 6tml 8 DH,FA,FG,H,RD,TC S9,S7,s9,s7,S8,s8 A0A125YLN4_TOXGG ATPTG13 MSWATRLLRMSSPRLGLLPLGRSVKLGGAKERVSFSQFFDSEYFWTKANVGPFFLFLFTSPFWYQGIKTVYASCRYRKLNEREIISDRYTWLHERMLEDEVERVLLEQVPAGGFDKTRPGLLLGPSTL 128 T 0.24 DUF5378 pdbpercent F Eukaryota T 6tml 9 EH,GA,GG,I,SD,UC U9,U7,u9,u7,U8,u8 A0A125YRP0_TOXGG ATPTG15 MATPPLQDGAPTNGGAATKPSCGARLQNFARMAIKGPSVPHSILFGVGAGCCAYAGYYLYRAMRLTFFDTESVALQSRLRYAEKQKLFHQELDRELAAGHIASLVAEYDPVATRLPFQPMQDRYRV 126 T 3.2 DUF3067 unphh F Eukaryota T 6tml 10 FH,HA,HG,J,TD,VC H9,H7,h9,h7,H8,h8 A0A125YL08_TOXGG ATPTG6 MAETREGGQSGAASILGAEAFPELLSKVPLNPQMDEDKHFNKYKWGNEPIPVNRRTGSRMNSSIYDNRNHEAVRHPWSTDARTFHPNDNPEADRINTQYSNMVSDSFPEGGFSDAPRFSSNWERLLAYHHGLYSPEKFNSTTKTADEIRLAVNDFAAKVHADDPKNACKYLMIEEFKCLQSAQARIDPQGAATKCVKWFNEWRQCAWDQEKMVKGYNYIEDRRARKHKPYIGAPDLQYS 239 T 1.8 DnaJ pdb F Eukaryota T 6tml 11 GH,IA,IG,K,UD,WC E9,E7,e9,e7,E8,e8 A0A125YLR0_TOXGG ATPTG3 MGEKQEEEGEEEKEGKGEGGGEGGREEEDEDGSAPVVSWLRIVEERECHEETDEAPETKIALPFSAQRSSRGFEARQVEVLVSANSAFLLSVLLASLFLSSSLPSFCPPRFLLSVLLALFKDKMAGDAPAAAAAPQQAGRTASASGVRTPGYLDLVGHSLKATSMDHGMQYSSIYWETSHRTYLPFWASLTQKFSWKIMDDQIRSFLRLPKPVTTEPFVFSSGSPYIRRYFGDADISVPVPLHAPAHFAFVPTGTVSPWEETGMETGPQGAAARGAAATAFRAVLESAWKCDIDEQIKEKLHSRAGAGAFHASGSTGGCPIPTDF 325 T 4.7 DUF512 unphh F Eukaryota T 6tml 12 HH,JA,JG,L,VD,XC X9,X7,x9,x7,X8,x8 S7W180_TOXGG ATPTG17,ATPTG17,ATPTG17 MSTSPGLAFANLTLLLDVPQLPAIWAVNAWRELNGLFTEMKTLAGTSDLLYPSNRYNPQNEKTNRMGRPRKYNHGEWMFGNSY 83 T 4.8 AT_hook pdbhh F Eukaryota T 6tml 13 IH,KA,KG,M,WD,YC B9,B7,b9,b7,B8,b8 S7V2T0_TOXGG subunit b MNFSSSARWLAVRQSQTLGHTTRATVAAGRRVLAHSPAATEFTSFQSLHIGGDVCKLPLAVALGAAPSALGYGSAKHNQQRQYATLGSGWSFSKVQYTKYRITKPWTTDTTFDDIILSQPSKEDFAKFTKEAPLFLRFLKLVTDVEGRQEAFIQFAKRCENGLTVEKDVYVTKKELVDCLWKNGYTDTEINAFEIAFPADYKFHYPELAVLFDLTEEDCYKYCIRQRAATPEELVELKYTKPKNLVSSYGLCFLGVWFGLSNTVLSNAWFYSKTFPFGAVFYMLGSYFYRDIREKLWKEEKSLIHTAQENKNMGEESVYKQMKKYATDTKCLDYLSTFRTEVEDQIANYKVALVSQMRRQLTERLVEKLNGIQQAEKLIQGSLQDVMIREIVSSFKDLYKSRPELHDAAMQSAIQGLSGSDGAMDPVGAHFKASLQELAKVNLSTATADPMGTVVQRVAAVFQKREKEFLDTFTVKATEAQEIKTIVDKCHKGNTFDFHALSDEELRRLEQLYSTVNNRVGFETIHENSIKPVAPLSENSKGFVEFVNTQLEITKAKLRNARLTAFAHAFV 571 T 0.00013 Mt_ATP-synt_B pdbpercent F Eukaryota T 6tml 14 JH,LA,LG,N,XD,ZC R9,R7,r9,r7,R8,r8 A0A125YKF7_TOXGG ATPTG12 MLNFIPKRCPSVSLLFGKRPVQRIEVGQARHQLEIPVETIEKIYEGVDSRLEYHNKDYNAMKWKDFMKLKLDAYHLLEASQSETAAKSALSDLNWFSDLADIYSGQQTMAEMDVALKAQGEQKLSYPIQGKNIK 134 T 12 DUF4416 pdbhh F Eukaryota T 6tml 15 AD,KH,MA,MG,O,YD p8,P9,P7,p9,p7,P8 A0A125YMA7_TOXGG ATPTG10 MSPPTASASVASSGSSPHMDRLLGDLKLLAAYDSAAGWQEPKAMESAFQSLSWDDADVLKALPQYLNCRGEQKRRVDFAYAALCPRPVDEKDPKQTLMSLWMKARLFSYDQKHPFVLSPFAATDKSTSAGAMTAEKPF 138 T 9.6 DUF6103 pdbhh F Eukaryota T 6tml 16 BD,LH,NA,NG,P,ZD v8,V9,V7,v9,v7,V8 S7UQT7_TOXGG subunit f MGFHFQQYIAMAGRAINPVQWTRAWRRMEGKSATEVYRDALAWTNNQFAQISRASQYRAWWWQNPLGMGLVLYGTYKAWHMIYMVRKQKKTAQLVAAAYGQGGQWLNPVPR 111 T 0.11 DUF4468 unp F Eukaryota T 6tml 17 AE,CD,MH,OA,OG,Q L8,l8,L9,L7,l9,l7 S7W7F1_TOXGG ATPTG8 MTALPPPPSANVAVSFTAAPAEPLSRGEVKAASLKLELQNIERELKDWWMSRKILRDRNIGLFNLLQHHNFAGLSVNNAKLSDSQRVMWTDLVQGKPDVEDKLSVDAREMKVDMYEKLFKQAADLENPCRMPGVAYLRCLRDTLTETQSARRSSCLNAFSSFDACRTGLLKQQSAAVENSLVRQNMADVRAKALFERRAVLLDLVEGK 208 T 0.12 CHCH pdbhh F Eukaryota T 6tml 19 CE,ED,OH,QA,QG,S D8,d8,D9,D7,d9,d7 A0A125YV76_TOXGG ATPTG2 MSPVGRLFLGSKLPAQTWQSFRLQPALPQFAQKRFFSGGAAKPSWHVAREHRFGPTLPDHAYYGEHATYNYFVLFIRGMRPYLEKIFGDCASTIKNAAVAVYRPVNAFVVKHNPDLRLQFVAFASFIATHMAITKEFNDMYQRLVDITSLLELQAAQLHASEGFWDSESEQQEARLQRHAEHRNDLETTWEEALREATLARNFDVLVSYLNHGTSDGCGEHGACGHSGQNGIPPSVTWNFNAMPYGKENPDTKTFPIPDHEQPYRAFSLGFTANNLSGNWGDYIDRQDNKNALMRPARMMFTDVFIPTTK 310 T 10 PerC unphh F Eukaryota T 6tml 20 DE,FD,PH,RA,RG,T M8,m8,M9,M7,m9,m7 A0A125YPQ4_TOXGG subunit 8 MNTFFLTPAAAAARRVAVSFFARSSASGFPQHRVALRPFPSQRPAERAHNLAKSQTLRSVKAHGRQSGKKEQSTESGGRRGFRAAVGAGTGCMLAASPMLFTDYDNTASPKSELIFMAGNALGYCTERFFENEYGQSIFMFALGLAYLAMLGHEGKIHGAVWRMKHLFATNFKMVGHPRYAYALPKNPLLQDAAPTKTGSTSAKK 205 T 27 TMEM132D_C unphh F Eukaryota T 6tml 21 EE,GD,QH,SA,SG,U N8,n8,N9,N7,n9,n7 A0A125YUZ2_TOXGG ATPTG9 MSGDSVAPHQRAACEQLHSEYKQCLAKNGRTHFSACTDFHSKLRACENMLGTSYCIDEGINLMKCTKNPDPSFCAKEFVAMRECNRPQGPHLVLSSSPSSPPHYELRPEVKHLYNVDSTDLGSAVAPVRSKEQLDRVADSLKADLNLPGYGHIPYKWESLRPNPGA 166 T 0.0045 Cmc1 pdb F Eukaryota T 6tml 23 GE,ID,SH,UA,UG,W W8,w8,W9,W7,w9,w7 S7VTI0_TOXGG ATPTG16 MPFMWRQRAYCAPVPSAFASQQPNGLGGEAGVRKPLLRSNSESLSVFSQIPDGLLGHTTSVTMGNSDIFFLPKPSNLLKIALPAFVFMPNLTIFTRAFPFYAHTSA 106 T 7.1 DUF3561 pdbhh F Eukaryota T 6tml 26 EF,KE,QI,SB,WH,YA i4,i3,i6,i2,i5,i1 A0A125YJP2_TOXGG Inhibitor of F1 MSSPCCVAIRRVARTTLESGRRQVDSKSTDVSPFFTGTQQMSLPSAGMVTKIRNFSSVKFMDQKRSGEETVYFKKEDEALLRNLLANHPEYDPKYSVDHMNAEVGSIARDITLACQKHGMKDPSAAFMKDLISIFGAHGYAKNSK 145 T 1.4 DUF3223 pdbhh F Eukaryota T 6tms 1 A,B,C,D,E,G,H,I,J,K A,B,C,E,F,D,H,I,J,K a novel designed pore protein TEDEIRKLRKLLEEAEKKLYKLEDKTRRSEEISKTDDDPKAQSLQLIAESLMLIAESLLIIAISLLLSS 69 T 0.1 Matrilin_ccoil pdb F T 6tms 2 F,L G,L a novel designed pore protein TEDEIRKLKKLLEEAEKKLYKLEDKTRRSEEISKTDDDPKAQSLQLIAESLMLIAESLLIIAISLLLSS 69 T 0.075 Matrilin_ccoil pdb F T 6tms 3 M,N Q,R affinity purification tag HHHHHHSGLVPRGSHM 16 T 4000 zinc_ribbon_2 pdbhh F T 6tnh 1 A,B A,B G3FFN6_9CAUD Adenylosuccinate synthetase MGHHHHHHHHHHGLVPRGSHMENVDLVIDLQFGSTGKGLIAGYLAEKNGYDTVINANMPNAGHTYINAEGRKWMHKVLPNGIVSPNLKRVMLGAGSVFSINRLMEEIEMSKDLLHDKVAILIHPMATVLDEEAHKKAEVGIATSIGSTGQGSMAAMVEKLQRDPTNNTIVARDVAQYDGRIAQYVCTVEEWDMALMASERILAEGAQGFSLSLNQEFYPYCTSRDCTPARFLADMGIPLPMLNKVIGTARCHPIRVGGTSGGHYPDQEELTWEQLGQVPELTTVTKKVRRVFSFSFIQMQKAMWTCQPDEVFLNFCNYLSPMGWQDIVHQIEVAAQSRYCDAEVKYLGFGPTFNDVELREDVM 363 T 2.9999999999999996E-68 Adenylsucc_synt unppercent T Viruses T 6tno 2 B,D,F B,D,F CCG2_HUMAN Chains: B,D,F RIPSYRYR 8 T 1 TOC159_MAD pdbhh F Eukaryota F 6tnq 2 B,D,F B,D,F DLGP1_HUMAN Chains: B,D,F TSPKFRSR 8 T 0.77 ArfA pdbhh F Eukaryota T 6tnt 14 Q T UNIDENTIFIED PEPTIDE AAAAAQLAAAAAAAA 15 T 55 DUF4699 pdbhh F F 6tnu 83 EC BT 60S ribosomal protein L1 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 210 T 17000 zf-C2H2_6 pdbhh F F 6tob 1 A A P71658_MYCTU Integration host factor MIHF GSHMVALPQLTDEQRAAALEKAAAARRARAELKDRLKRGGTNLTQVLKDAESDEVLGKMKVSALLEALPKVGKVKAQEIMTELEIAPTRRLRGLGDRQRKALLEKFGSA 109 T 0.00038 Ribosomal_S13 pdbhh F Bacteria T 6tof 2 B B ALA-TRP-VAL-ILE-PRO-ALA AWVIPA 6 T 0.39 DUF3950 pdbhh F F 6tog 2 B B ALA-TRP-VAL-ILE-PRO-ALA AWVIPA 6 T 0.39 DUF3950 pdbhh F F 6toh 2 B B ALA-TRP-VAL-ILE-PRO-ALA AWVIPA 6 T 0.39 DUF3950 pdbhh F F 6toi 2 B B ALA-TRP-VAL-ILE-PRO-ALA AWVIPA 6 T 0.39 DUF3950 pdbhh F F 6tok 2 B B ALA-TRP-VAL-ILE-PRO-ALA AWVIPA 6 T 0.39 DUF3950 pdbhh F F 6ton 2 B B ALA-TRP-VAL-ILE-PRO-ALA AWVIPA 6 T 0.39 DUF3950 pdbhh F F 6too 2 B B ALA-TRP-VAL-ILE-PRO-ALA AWVIPA 6 T 0.39 DUF3950 pdbhh F F 6tov 1 A A Teicoplanin Aglycone XXXXXXX 7 T 1.5 Defensin_2 pdbhh F F 6tq0 2 B,D,F,H,J,L,N,P B,D,F,H,J,L,N,P DLGP1_HUMAN repeat peptide 5 from GKAP MPGCFRMR 8 T 0.36 ANP pdbhh F Eukaryota T 6tqs 2 G,H,I,J,K G,H,I,J,K OSBL1_HUMAN OSBP-RELATED PROTEIN 1 GAMRSILSEDEFYDALSDSES 21 T 0.0027 DUF4298 pdb F Eukaryota T 6ts3 2 C,D C,D KCC2A_MOUSE ACE-ASN-ALA-ARG-ARG-LYS-LEU-LYS-GLY-ALA-ILE-LEU-THR-THR-MET-LEU-ALA-THR-ARG-ASN-PHE NARRKLKGAILTTMLATRNFSG 22 T 12 HycH pdbhh F Eukaryota T 6tsc 2 B B A0A2R2JFI5_OMPOL GLY-PHE-PRO-TRP-MVA-ILE-MVA-VAL-GLY-VAL-PRO-GLY GFPWXIXVGVPG 12 T 0.49 dCache_2 pdbhh F Eukaryota T 6tsl 2 B BBB PRO-VAL-PRO-ARG PVPR 4 T 85 DUF1230 pdbhh F F 6tsm 2 B BBB PRO-VAL-VAL-ARG PVVR 4 T 120 DUF1634 pdbhh F F 6tsu 2 DA,FA,GA,HA,IA,JA,LA,MA,NA,OA,PA D2,A2,C2,B2,E2,E3,B3,D3,C3,A3,A1 D5AR33_RHOCB Uncharacterized protein MDVFAKHAVSLESPAVRHYEITPSDSTDLARRPRALRVQTGGTLVLRDETGITVTYTVFAGEILPVRPVRVLATGTTATAVGWE 84 T 0.26 DUF2835 pdbpssm F Bacteria T 6tsu 3 EA,KA F2,F3 D5AR34_RHOCB Uncharacterized protein MIALGLGLGLAANGGPALRRYAVNGVAPVAVLDFERHFLSHPLALTRATSATYADALRAVQTAPADTPRYDYSTGKRALLLEASATNLLPNSAQFEAASWGKTRASVLANAALAPNGTMTADKLVEDTSNNSHFVARTGTQIAAGTSVTASIFVKAAERRWFALVTADSANAFRTTYFDLQTGTLGVVSQGAAGHVAQIVAAGNGWYRCSVTQTQAASGNFNFYPSVASANGATSYPGDGASGLYLWGAQLEAGAAVSSVIPTEAAAVTRAADLASVAVAAGSYDLRRVDAAGTAVTKGVAHPGGALTIGAGSLYLLSLFPAGAL 325 T 0.012 CBM_4_9 pdbpssm F Bacteria T 6tt6 1 A A PD-i6 peptide WXVXEAXD 8 T 0.81 ApeA_NTD1 pdbhh F T 6ttu 8 H I IKBA_HUMAN CYS-LYS-LYS-ALA-ARG-HIS-ASP-SEP-GLY CKKERLLDDRHDSGLDSMKDEEDYKDDDDK 30 T 16 GlutR_N pdbhh F Eukaryota T 6tui 2 DA,FA,GA,HA,IA,JA,LA,MA,NA,OA,PA D2,A2,C2,B2,E2,E3,B3,D3,C3,A3,A1 D5AR33_RHOCB Uncharacterized protein MDVFAKHAVSLESPAVRHYEITPSDSTDLARRPRALRVQTGGTLVLRDETGITVTYTVFAGEILPVRPVRVLATGTTATAVGWE 84 T 0.26 DUF2835 pdbpssm F Bacteria T 6tui 3 EA,KA F2,F3 D5AR34_RHOCB Uncharacterized protein MIALGLGLGLAANGGPALRRYAVNGVAPVAVLDFERHFLSHPLALTRATSATYADALRAVQTAPADTPRYDYSTGKRALLLEASATNLLPNSAQFEAASWGKTRASVLANAALAPNGTMTADKLVEDTSNNSHFVARTGTQIAAGTSVTASIFVKAAERRWFALVTADSANAFRTTYFDLQTGTLGVVSQGAAGHVAQIVAAGNGWYRCSVTQTQAASGNFNFYPSVASANGATSYPGDGASGLYLWGAQLEAGAAVSSVIPTEAAAVTRAADLASVAVAAGSYDLRRVDAAGTAVTKGVAHPGGALTIGAGSLYLLSLFPAGAL 325 T 0.012 CBM_4_9 pdbpssm F Bacteria T 6tvj 1 A A PD-i3 peptide LXXRYXDTMY 10 T 0.46 Ima1_N pdbhh F T 6tvw 2 B DDD MET-THR-TRP-MET-GLU-TRP-ASP-ARG-GLU EQIWNNMTWMEWDRE 15 T 0.00021 GP41 pdbhh F T 6tvw 3 C DbD ASN-ASN-TYR-THR-SER-LEU-ILE-HIS-SER-LEU-ILE-GLU-GLU NNYTSLIHSLIEESQ 15 T 7.9 DUF5470 pdbhh F T 6twb 2 C B Double Bridged Peptide F19 XVNIMXCRCPX 11 T 4.9 DUF4668 pdbhh F T 6twc 3 C C Double Bridged Peptide F21 TCVNIMCCRCPX 12 T 5.2 DUF4668 pdbhh F T 6twg 1 A A CRBL_VESCR Crabrolin Plus, mutant of Crabrolin peptide FLPKILRKIVRAL 13 T 1.9 Antimicrobial_8 unphh F Eukaryota T 6twq 2 B,C C,D VE6_HPV16 THR-ARG-ARG-GLU-THR-GLN-LEU SSRTRRETQL 10 T 0.34 FpoO unphh T Viruses T 6twu 2 C C VE6_HPV16 Protein E6 SSRTRREEQL 10 T 0.34 FpoO unphh T Viruses T 6twx 2 C,D C,D VE6_HPV16 16E6 peptide SSRTRRETQL 10 T 0.34 FpoO unphh T Viruses T 6twy 2 C C KS6A1_HUMAN Phosphomimetic RSK1 peptide RRVRKLPETTL 11 T 9.8 CITED pdbhh F Eukaryota T 6twz 2 E,F,G,H E,F,G,D000 phosphorylated 16E6 peptide SSRTRRETQL 10 T 0.34 FpoO unphh F T 6txs 2 B BBB CD44_HUMAN CDW44,EPICAN,EXTRACELLULAR MATRIX RECEPTOR III,ECMR-III,GP90 LYMPHOCYTE HOMING/ADHESION RECEPTOR,HUTCH-I,HEPARAN SULFATE PROTEOGLYCAN,HERMES ANTIGEN,HYALURONATE RECEPTOR,PHAGOCYTIC GLYCOPROTEIN 1,PGP-1,PHAGOCYTIC GLYCOPROTEIN I,PGP-I QKKKLVIN 8 T 0.044 RCR unphh F Eukaryota T 6ty8 2 B B Q9IR43_CPVBM Viral structural protein 4 MFAIDPLKHSKLYEEYGLYLRPHQINQEIKPTTIKKKELAPTIRSIKYASLIHSMLAKHAARHNGTLINPRMYADMITLGNTKVTVTKGTPKAQIDTLKMNGLTVVSKSRRNNKKKPVSDTTATIDENTDDIVTYKALTEMSTLIESFRLPSGLALIIFDDEKYQSLIPNYINQLIAYTQPHIIPTWQGIADFSDTYLRSYFKRPFELTASNLAAPQKYNLSPMTRSIFNNTGREDAVIRKLYGYGEYVFIRYEGCLITWTGIYGEVTMMVNLSKRDLGLDVGDDYLKEYKKLLFYGVITDAIPSGISARSTIMKISPHKMMNPSGGALAVLSKFLEAVVSTNVINATLVVYAEKGAGKTSFLSTYAEQLSLASGQVVGHLSSDAYGRWLAKNKDVEEPSFAYDYVLSLDTDDNESYYEQKASELLISHGISEVAQYELLSVRKKIKMMDEMNEVLIAQLENADTHSERNFYYMVSTGKTTPRTLIVEGHFNAQDATIARTDTTVLLRTINDTTQAMRDRQRGGVVQLFLRDTYYRLLPALHTTVYPFEMLESIRRWKWVH 561 T 0.00013 Zeta_toxin pdbhh T Viruses T 6ty9 2 B B Q9IR43_CPVBM Viral structural protein 4 MFAIDPLKHSKLYEEYGLYLRPHQINQEIKPTTIKKKELAPTIRSIKYASLIHSMLAKHAARHNGTLINPRMYADMITLGNTKVTVTKGTPKAQIDTLKMNGLTVVSKSRRNNKKKPVSDTTATIDENTDDIVTYKALTEMSTLIESFRLPSGLALIIFDDEKYQSLIPNYINQLIAYTQPHIIPTWQGIADFSDTYLRSYFKRPFELTASNLAAPQKYNLSPMTRSIFNNTGREDAVIRKLYGYGEYVFIRYEGCLITWTGIYGEVTMMVNLSKRDLGLDVGDDYLKEYKKLLFYGVITDAIPSGISARSTIMKISPHKMMNPSGGALAVLSKFLEAVVSTNVINATLVVYAEKGAGKTSFLSTYAEQLSLASGQVVGHLSSDAYGRWLAKNKDVEEPSFAYDYVLSLDTDDNESYYEQKASELLISHGISEVAQYELLSVRKKIKMMDEMNEVLIAQLENADTHSERNFYYMVSTGKTTPRTLIVEGHFNAQDATIARTDTTVLLRTINDTTQAMRDRQRGGVVQLFLRDTYYRLLPALHTTVYPFEMLESIRRWKWVH 561 T 0.00013 Zeta_toxin pdbhh T Viruses T 6tyt 2 B B A0A1L8ENT6_XENLA ALA-LYS-GLY-LEU-PHE-MET RPPAGASKPKKKAKGLFM 18 T 24 Aft1_HRR pdbhh F Eukaryota T 6tyt 3 C C APLF_HUMAN ARG-LYS-ARG-ILE-LEU-PRO-THR-TRP-MET-LEU-ALA LAERKRILPTWMLAEH 16 T 0.054 PNISR pdbhh F Eukaryota T 6tyu 2 B B CYREN_HUMAN LYS-THR-ARG-VAL-LEU-PRO-SER-TRP-LEU-THR-ALA SETKTRVLPSWLTAQV 16 T 0.14 PNISR pdbhh F Eukaryota T 6tyv 2 B B WRN_HUMAN THR-THR-ALA-GLN-GLN-ARG-LYS-CYS-PRO-GLU-TRP-MET-ASN TTAQQRKCPEWMNVQN 16 T 0.078 Polyoma_coat2 pdbhh F Eukaryota T 6tyw 2 B B APLF_HUMAN GLU-ARG-LYS-ARG-ILE-LEU-PRO-THR-TRP-MET-LEU-ALA-GLU LAERKRILPTWMLAEH 16 T 0.054 PNISR pdbhh F Eukaryota T 6tyx 2 C,D C,D A0A1L8ENT6_XENLA LYS-GLY-LEU-PHE-MET RPPAGASKPKKKAKGLFM 18 T 24 Aft1_HRR pdbhh F Eukaryota T 6tyz 2 B B APLF_HUMAN GLU-ARG-LYS-ARG-ILE-LEU-PRO-THR-TRP-MET-LEU-ALA LAERKRILPTWMLAEH 16 T 0.054 PNISR pdbhh F Eukaryota T 6tz0 2 B B Q9IR43_CPVBM Viral structural protein 4 MFAIDPLKHSKLYEEYGLYLRPHQINQEIKPTTIKKKELAPTIRSIKYASLIHSMLAKHAARHNGTLINPRMYADMITLGNTKVTVTKGTPKAQIDTLKMNGLTVVSKSRRNNKKKPVSDTTATIDENTDDIVTYKALTEMSTLIESFRLPSGLALIIFDDEKYQSLIPNYINQLIAYTQPHIIPTWQGIADFSDTYLRSYFKRPFELTASNLAAPQKYNLSPMTRSIFNNTGREDAVIRKLYGYGEYVFIRYEGCLITWTGIYGEVTMMVNLSKRDLGLDVGDDYLKEYKKLLFYGVITDAIPSGISARSTIMKISPHKMMNPSGGALAVLSKFLEAVVSTNVINATLVVYAEKGAGKTSFLSTYAEQLSLASGQVVGHLSSDAYGRWLAKNKDVEEPSFAYDYVLSLDTDDNESYYEQKASELLISHGISEVAQYELLSVRKKIKMMDEMNEVLIAQLENADTHSERNFYYMVSTGKTTPRTLIVEGHFNAQDATIARTDTTVLLRTINDTTQAMRDRQRGGVVQLFLRDTYYRLLPALHTTVYPFEMLESIRRWKWVH 561 T 0.00013 Zeta_toxin pdbhh T Viruses T 6tz1 2 B B Q9IR43_CPVBM Viral structural protein 4 MFAIDPLKHSKLYEEYGLYLRPHQINQEIKPTTIKKKELAPTIRSIKYASLIHSMLAKHAARHNGTLINPRMYADMITLGNTKVTVTKGTPKAQIDTLKMNGLTVVSKSRRNNKKKPVSDTTATIDENTDDIVTYKALTEMSTLIESFRLPSGLALIIFDDEKYQSLIPNYINQLIAYTQPHIIPTWQGIADFSDTYLRSYFKRPFELTASNLAAPQKYNLSPMTRSIFNNTGREDAVIRKLYGYGEYVFIRYEGCLITWTGIYGEVTMMVNLSKRDLGLDVGDDYLKEYKKLLFYGVITDAIPSGISARSTIMKISPHKMMNPSGGALAVLSKFLEAVVSTNVINATLVVYAEKGAGKTSFLSTYAEQLSLASGQVVGHLSSDAYGRWLAKNKDVEEPSFAYDYVLSLDTDDNESYYEQKASELLISHGISEVAQYELLSVRKKIKMMDEMNEVLIAQLENADTHSERNFYYMVSTGKTTPRTLIVEGHFNAQDATIARTDTTVLLRTINDTTQAMRDRQRGGVVQLFLRDTYYRLLPALHTTVYPFEMLESIRRWKWVH 561 T 0.00013 Zeta_toxin pdbhh T Viruses T 6tz2 2 B B Q9IR43_CPVBM Viral structural protein 4 MFAIDPLKHSKLYEEYGLYLRPHQINQEIKPTTIKKKELAPTIRSIKYASLIHSMLAKHAARHNGTLINPRMYADMITLGNTKVTVTKGTPKAQIDTLKMNGLTVVSKSRRNNKKKPVSDTTATIDENTDDIVTYKALTEMSTLIESFRLPSGLALIIFDDEKYQSLIPNYINQLIAYTQPHIIPTWQGIADFSDTYLRSYFKRPFELTASNLAAPQKYNLSPMTRSIFNNTGREDAVIRKLYGYGEYVFIRYEGCLITWTGIYGEVTMMVNLSKRDLGLDVGDDYLKEYKKLLFYGVITDAIPSGISARSTIMKISPHKMMNPSGGALAVLSKFLEAVVSTNVINATLVVYAEKGAGKTSFLSTYAEQLSLASGQVVGHLSSDAYGRWLAKNKDVEEPSFAYDYVLSLDTDDNESYYEQKASELLISHGISEVAQYELLSVRKKIKMMDEMNEVLIAQLENADTHSERNFYYMVSTGKTTPRTLIVEGHFNAQDATIARTDTTVLLRTINDTTQAMRDRQRGGVVQLFLRDTYYRLLPALHTTVYPFEMLESIRRWKWVH 561 T 0.00013 Zeta_toxin pdbhh T Viruses T 6u0o 3 C D FLAG peptide DYKDDDDK 8 T 3300 zf-met pdbhh F F 6u19 1 A A PSMD4_HUMAN 26S PROTEASOME REGULATORY SUBUNIT RPN10,26S PROTEASOME REGULATORY SUBUNIT S5A,ANTISECRETORY FACTOR 1,ASF,MULTIUBIQUITIN CHAIN-BINDING PROTEIN SADIDASSAMDTSEPAKEEDDYDVMQDPEFLQSVLENLPGVDPNNEAIRNAMGSLASQATKDGKKDKKEEDKK 73 T 0.081 DUF5797 pdbpercent F Eukaryota T 6u22 2 B C SFTI1_HELAN GLY-ARG-ALA-THR-LYS-SER-ILE-PRO-PRO-ILE-ALA-PHE-PRO-ASP GRATKSIPPIAFPD 14 T 0.052 Bowman-Birk_leg unp F Eukaryota T 6u24 1 A A SFTI1_HELAN GLY-ARG-ALA-THR-LYS-SER-ILE-PRO-PRO-ILE-ALA-PHE-PRO-ASP GRATKSIPPIAFPD 14 T 0.052 Bowman-Birk_leg unp F Eukaryota T 6u2f 2 B B Organo-peptide PCSK9 inhibitor XWNLKXIGLLR 11 T 6 SARG pdbhh F T 6u3i 2 B B cis-1-amino-4-phenylcyclohexaneacyl-WNLK(hR)I(D-ser)LLR - NH2 XWNLKXIXLLR 11 T 21 RNA_polI_A14 pdbhh F T 6u3m 3 E,F E,F Alpha1a peptide AQPMPMPELPYPGSGGSIEGR 21 T 7 DUF3148 pdbhh F T 6u3n 5 E C Peptide APMPMPELPYPGSGGSIEGR 20 T 15 Pertus-S5-tox pdbhh F T 6u42 8 BN,CN,DN 4V,4W,4X A8J0X0_CHLRE FLAGELLAR ASSOCIATED PROTEIN MPSPAREKLMTIKAMEEAKGRSQHARAPAIFRDTALDTHKSIQPEYFGPSTVPEKKEFSTRLSSGRTRSVTKHQRAAMEALQRTSQMAGQGEVRTVFMPTAEQMPVCAAAGERRGNVANSEWALLDTLEVNLYLNEKDARLRSQKAVQQTQRAILDTQVGMLAQAKLAAETAKAAERVELLATVAAHQAEERQRAEEQRAALTRLRTDREAMLAETRVQREAALSRKREEEAKLVAAAQAQLEADRQAAARKAAELKEQAAKTMADNEARLVARKAAEAAQRVADAETTKRMIEMAEAQDRARDRNMKSFHDMIQARARGVGQKAVDDRRDRLEREERLIAEAERAAAQREAERAAAEAERKARLKSDLVSGNEALKRAKAEKLAVEREAEARERAAAEQRVLAEKEAAERQMAGMRERATATKRFVAGQAAAVAERAKTDDIFMSEQERLLNKRLLEQAVATVQRPMQYSVKLY 475 T 0.043 OTCace pdb F Eukaryota T 6u42 11 KN 5E A0A2K3E5X9_CHLRE RIB30 MLNVTGGRRPVASWRTPPGFLERLADAWPAVLDGAVEQAGGDPARVTRDSFLAALREALPGLSAAEDDYARQVSLSVIQQVRGSNVFFPDLDYLQAALLQGRVPPQELDQPRSTLSLATFTTTTRSGTKSLDLFKTTGVTWKIPKGFLNRYNDCNHEVLRRAAALVGARHDGARDVVAGVWGRVDVPTFVEACRQVLGEISADEEEYLIALASEQVQDGTAYIRDLPFLDKCIQNGKTPTSIKGPELLPSIFLNDTTSGKTDGMTLRHTGGRIF 274 T 0.2 FliX pdb F Eukaryota T 6u42 13 NN,ON,PN,QN 5H,5I,5J,5K FLTOP_CHLRE CILIA- AND FLAGELLA-ASSOCIATED PROTEIN 126,FLAGELLUM-ASSOCIATED PROTEIN 126 MSRSYPGEQVEHAFNSKRLKNWEVPAVDKSQAISTSTGTRFGTLQPRSGRTQFIVDDNGHLKSGVPKLEKSAFNFTQTTPVFMDSAPRWPKENPTWPKNMKATMGYKGIQSNYLPTNTVTLKAVEVPGTTERNFNFM 137 T 6.8 DUF3697 pdbhh F Eukaryota T 6u42 19 JO 6E A0A2K3E1X6_CHLRE FAP143 MAEETQPYTSYNKQDEVPTLIGNWVEERELKELTGVTRNLAASQALKDTSDGTSPTRSLGDALTATHPRVIEHVQAQTHAADWQSTVQATYRPPSDATRNAAAYVNTSKMGPRERMLHEQLMREAQDLPPELQATLTGPAVPVTTASTYGADFHQHDLTGIVVGAKVMKDRDGRPAVRDPTFLAETQMMKKDAADRLMGETARQSGARDTTMLPNPDVPVTIYTEAVANKTYGGVFPGTTTLNTAAPFGKSTNFSKPMSDYSKVVVDE 268 T 0.093 DUF1143 pdbhh F Eukaryota T 6u42 21 LO,MO 6G,6H A0A2K3DZF0_CHLRE FAP166 MSLTLNGLDESMRRMQGYEVTRAPEDVGNSIPNFKEGIFTYKGSRQAPWKSEQTHSFSLPNAYTARVLNGTIVHTGGATEMAITTHHTVERPMMPPGTIRGSTWVKPQYIPTDDPALDELHAVAYVVSPQLPALMDACNSYHLHSADGWITTAGFMTAARRAGLTLSRAEYLALERALTKDTMGRINYLQLEALVQAVTAADQTGEGGAEPAAE 214 T 0.028 EF-hand_11 pdbhh F Eukaryota T 6u42 23 VO,WO,XO 6Q,6R,6S A0A2K3DTN6_CHLRE FAP276 MDLKQQVKNYTMTIRNTRPPTMIKEQDKSEFSHFRALQVLANGDEVPYEATLRNVIHDGARQPKLPPRQTQKHPGYIRNESGGFFTS 87 T 0.092 DUF3337 pdbpercent F Eukaryota T 6u42 26 IP,JP,KP 7D,7E,7F A8JF23_CHLRE FLAGELLAR ASSOCIATED PROTEIN 222 MATNSTGPWATGTFSPNSTGTVTQYNHPMFVSQRLTGNFTSQFEMNSLPSHKYETLPIRSGHLPGYQGHVPGGVGAIAQRKPAAAMHTMTHLATSGSLPKGSPQTDMSLVDLRPEQRSMAKVYMYAEGAKTSFLKFPTPKTFDHRN 146 T 0.74 DUF2475 pdbhh F Eukaryota T 6u42 27 LP 7G A8HSW0_CHLRE FLAGELLAR ASSOCIATED PROTEIN 95 MAAYAHNDGAPDISQAFQNTVLVKNWYEDRFQSQVASATGRTLRELPTHERVVHKAVPPGHPGLFQTTKQAAEEKLLTTPPPAKVKKPSMYTEANVAERLQTYGLADNIHYTIGPNAATEASWAPVHNLTTTNKEFYEIKPEAARAADPDTFRASGPSPFAKTGFCAKSVKGEASDETTVAGGKGARGEITRRPGESGNPYGVSVFVDEYGKWGSAIQGMPLTETRARMQTKYFP 235 T 0.25 DUF1143 pdbpercent F Eukaryota T 6u42 29 NP,OP 7I,7J A0A2K3DZI2_CHLRE FAP129 MVHKGPNQAGNKGLLTYNNAVGIPGYTGFMPSTNALALPVKGFEHTGRPAASAEVEKLTVKSVDPRKTSQYADDYHKKPADTKAFSKTGGGYWISQRVLPPHTAFTATTTYRAETLNAEPNTAAILDRSQGLASTLVGYEAARQAGEVRRSISADPRARAEDTARGIGTQTVLGRPGSGGNTSILATVAAAAPSSPQAGSSVMVSSARRPATVPTKYGELPGYQTTYGAATDKMARMQADNELNGTGSFAPSNMGDPRFKTLPRVMNPGMGRNYSSYVAEYGGDGHDPMARQAANKDTMTRISVTRDLAGGTTRNVSHIPRYTGHIPASEYATPEARAQGEAAEPRPDHKSQALTYTLDQYPRGRLPGYTGFKAQAPANIDAGLKHSMKLPCHSTTSGDATLRGTQFGVPHQDHTHYINSRAGLNSFFSNSVVGTEFVSDNGLFNAQVYYKEAKSQGALGIKTAQPSKLTHYGAPFRAAASMV 483 T 0.0018 SPATA48 pdb F Eukaryota T 6u42 30 PP,QP 7K,7L FAP21 MSLTTQSLRRTNYEAEMTQPQIPPAGITGKLHETAKDALTWNDERPSTPDDIKKYRQSTVHEPGKIVRHPGHADDPVPQGPFGVKSAASGGQNINEALKNYPDSELARWKLEQAEGVYASAQREPLGAGYVRGHRLPEGLGSERPFGVTYDARGKDLSRQAAAVIFPTDRPAEEDAATRAMYTRSHQDFQPGEQRRRDYNWDAAGIDPAQHRFGAVDRNGVGDGVRKALQPGLDPSLQAPKVLPKLHEDFKATATDYLGRPRQLGTGDRPQLAPDHAFGQPSMRKGREPGVGELLTGRFGADEQQPDADLGKSLREGYRNQPKPGDEGRAFGVPTIRTDVRLPRLRSVANACNYGNEPDAGQVLRPPRAADLGISDEAFVALRPKSELRQLVDEAGLALSDADFEAAWALAAEADGGAAAAGEGGGAAEGPEGRACVDTFFRARHHLLAQTLQIEPTF 458 T 0.035 DUF6395 pdbpssm F T 6u42 31 RP,SP 7M,7N A8IXN7_CHLRE FLAGELLAR ASSOCIATED PROTEIN 273 MSILGPADRRPELALTGTTISHLKTWRTEYLDEYSDIKLAAGVPEQRMEMAGITAHIGTITGRHTHMHKETTRLPTGHPPSSTYRAQDAVPIGTMTRGTGTITKLGDSCLYDKEQTWAHWRVAVDGKPADTRRKYRGVS 139 T 0.12 Autoind_bind pdbpercent F Eukaryota T 6u42 32 TP,UP 7O,7P A8JC52_CHLRE FLAGELLAR ASSOCIATED PROTEIN 107 MQGDRWSRNCGSGGVGHSGTVNEYRSGVLIGNFVENAAKTTGRMGETILSHTGPGAQTGIPTTTQKRSYTAEGKTGEYLVEASTRHDLNQPGVKGELLTRHGRFDEPPVQCLGTTYQLTYGRADGTDRRVQSYLWHGRKQVDYFVPHSTGGPSTLSLTARKQQEWGTQGATDAYLTTKMAATQPAALATAENPTRTQTLRPLGDSGLMPQPGQKPKGFARDELDKPHHRTGLRVNYRS 238 T 3.1 DUF1143 pdbhh F Eukaryota T 6u42 33 VP 7Q A0A2K3D7C7_CHLRE RIB21 MDATTKTLKSTTRVDNSTNPNFKHTSTFHTRGQWTPESPPPLTSTYTIFHGERPELPRYVPKYAVSPETAALTSRHGSSPYSFRATAERAGSTPDGRATYRFSGLPAGVSPYSTGTKLSSSTLGSSGLPPVQYKSYLTEYVDEYREPLEQLDTQRSLTLKYGTTGGYRTTQRSTRSDGQPKYQTRVVAF 189 T 76 DUF4851 pdbhh F Eukaryota T 6u42 34 WP,XP 7R,7S A8IPZ5_CHLRE OUTER DYNEIN ARM-DOCKING COMPLEX SUBUNIT 1 MAQKSTLKLPRLRTKEELLKTSPELCKLLGEDSDDGRSMSPFTAPPPAGTVKPPSRGLPAVSTKATKGPGMDTPRGLGEEELTEEELLRLELEKIKNERQVLLDSIKLVKAQAGTAGGEAQQNDIKALRRELELKKAKLNELHEDVRRKENVLNKQRDDTTDASRLTPGELSEEQAYIQQLQDEMKQIDEELVEAEAKNRLYYLLGERTRREHLAMDMKVRASQQLKKDSADDLYTLTAHFNEMRAAKEQAERELARMKRMLEETRVDWQKKLRERRREVRELKKRQQKQLERERKMREKQLERERQERELQAKLKMEQDSYEMRVAALAPKVEAMEHSWNRIRTISGADTPEEVLAYWEGLKAKEEQMRSLVSLAEQRESSAKSEIAALLENRSGMYEKGSAAAADVGEGSEERATLITEVERNMEGAKGKFNKLRSVCIGAEQGLRSLQERLMIALEEIHPDQLRASHMKGGHDAKARGKGAASAGARRGSAHAHTPDRNKRGPATGSRSQSPALVPHSPAGDKPSSPLHGTSPEHGHEPIPEGAEELAGEAEMVSPLGADGNTIDDEHFFPELPELLTSVTDRLNRVLVLAAELDAQEPAGAGEDGLPLSGEPGADGAEGAAPASPSRGAPEGLSESERTLVKGMNRRTWTGAPLLETINASPSEAALTLNIKRKKGKKKEQQVQPDLNRILGYTGSDVEEEEPESEEETEEEANKDDGVVDRDYIKLRALKMSQRLANQQRAIKV 749 T 0.00034 CALCOCO1 pdbhh F Eukaryota T 6u42 37 DQ 7Y A8HPK6_CHLRE FLAGELLAR ASSOCIATED PROTEIN 68 MGAANENIHMTDGIRRETMKKETLARERSLAAQSPYMAQVATYRARNPPLDHSRLMQDPKVQDWASIAGTRRSLATNVPDGGPRVNVNLLKYKRDADFISTTPYDGGPSYNAETCMQNWAEDRRDKHYKSGFHPKELRRSTRYDSEYSARFKPTSADYVGRLTHTYNTTSRFEGLTRVGTNGIAAPVLPKRSADTSGEHVFYAKDGYGPTPWMDHTAPTARGRFWVGTAPHVAHDTITHSTLRSEPLEFQQRCPTEDARSKILMGNKPLTHESDRTLRIRDDLVATNTFTRTWRTMYQSDHVDFSRRPATVR 312 T 0.034 DUF1143 pdbpercent F Eukaryota T 6u48 32 FA A phazolicin TXARXDSXSRXGAXGKXSGXAS 22 T 6.7 zf-Dof pdbhh F T 6u4a 2 C,D D,C cyclic peptide 3.1_3 XWWIIPXVKXGCX 13 T 4.2 DUF5989 pdbhh F T 6u61 2 C,D E,D cyclic peptide 3.1_3 XWWIIPXVKXGCX 13 T 4.2 DUF5989 pdbhh F T 6u6k 2 B C cyclic peptide 3.1_3 XWWIIPXVKXGCX 13 T 4.2 DUF5989 pdbhh F T 6u6l 2 B B Cyclic peptide 3.1_2 XWKTIXGXTWRTXQC 15 T 5.8 CyRPA pdbhh F T 6u71 2 B B cyclic peptide 3.1_3 XWWIIPXVKXGCX 13 T 4.2 DUF5989 pdbhh F T 6u72 2 C C 3.1_2_AcK5toA XWKTIAGXTWRTXQC 15 T 6 BNR pdbhh F T 6u74 2 E,F E,F cyclic peptide 3.1_2 XWKTIXGXTWRTXQCX 16 T 6.8 CyRPA pdbhh F T 6u7q 1 A A SFTI1_HELAN GLY-ARG-CYS-THR-LYS-SER-ILE-PRO-PRO-ARG-CYS-PHE-PRO-ASP inhibitor GRCTKSIPPRCFPD 14 T 0.0013 Bowman-Birk_leg pdb F Eukaryota T 6u7r 1 A A SFTI1_HELAN GLY-LYS-CYS-LEU-PHE-SER-ASN-PRO-PRO-ILE-CYS-PHE-PRO-ASN inhibitor GKCLFSNPPICFPN 14 T 0.052 Bowman-Birk_leg unp F Eukaryota T 6u7s 1 A A SFTI1_HELAN GLY-ARG-CYS-TYR-LYS-SER-LYS-PRO-PRO-ILE-CYS-PHE-PRO-ASP inhibitor GRCYKSKPPICFPD 14 T 0.052 Bowman-Birk_leg unp F Eukaryota T 6u7u 1 A A SFTI1_HELAN GLY-ARG-ALA-THR-LYS-SER-ILE-PRO-PRO-ARG-ALA-PHE-PRO-ASP GRATKSIPPRAFPD 14 T 0.052 Bowman-Birk_leg unp F Eukaryota T 6u7w 1 A A GLY-LYS-ALA-LEU-PHE-SER-ASN-PRO-PRO-ILE-ALA-PHE-PRO-ASN GKALFSNPPIAFPN 14 T 7.8 ANAPC16 pdbhh F T 6u7x 1 A A SFTI1_HELAN GLY-ARG-ALA-TYR-LYS-SER-LYS-PRO-PRO-ILE-ALA-PHE-PRO-ASP GRAYKSKPPIAFPD 14 T 0.052 Bowman-Birk_leg unp F Eukaryota T 6u8g 2 E,F E,F cyclic peptide 3.1_2_AcK7toA XWKTIXGATWRTXQCX 16 T 8.1 PSP94 pdbhh F T 6u8h 2 B C cyclic peptide 3.2_2 XWSWLCKXYNLIH 13 T 3.2 Tet_res_leader pdbhh F T 6u8i 2 B B cyclic peptide 3.2_2 XWSWLCKXYNLIH 13 T 3.2 Tet_res_leader pdbhh F T 6u8m 2 C C cyclic peptide 3.2_1 XWXKAILPGXILKTLHIC 18 T 2.8 Packaging_FI pdbhh F T 6u96 2 F,G,H,I,J F,G,H,I,J PHALLOIDIN Derivative XAXCPAW 7 T 1.3 Fe_hyd_lg_C pdbhh F F 6u9e 1 A,B,C A,C,E Q7X3I9_FRANO PdpA MIAVKDITDLNIQDIISQLTSEVINGDTTPSSAKFACEINSYIINYNLSNINLINTQLKNTKILYRKGLISKLDYEKYKRYCIISRFKNNIDEFILYFSTNYKDSQSLKIAIKELQNSCSSSLILELPHDYIRKIDVLLTSIDSAIQRSSDLNKTIIKQLNKLRSSLSRYIGYNNVLQKQEITINIKPINKNFELEDISFVSTRNKQYFKHNSLTLKNPHIEKLEVCENIYGINGWLTFDLAYINNHKDFNFLLSPNQPILLDIQINDSFNFYKKESKKDHHKRTTRFIAIGFNSNSIDIHENFEYSIYSYTKNVSSGVKKFKIQFHDPLKALWTKHKPSYIALNKSLDDIFKDNFFFDSLFSLDTNKSNNLKIRIPQAFISTVNRNFYDFFIQQLEQNKCYLKYFCDKKSGKVSYHVVDQVDNDLQRNIVNSDEDLKDKLSPYDISCFKKQILISNKSNFYVKEKNICPDVTLNTQRKEDRKISDTLVKPFSSILKDNLQSVEYIQSNNDDKQEIITTGFEILLTSRNTLPFLDTEITLSKLDNDQNYLLGATDIKSLYISQRKLLFKRSKYCSKQLYENLHNFHYKSDSESDVYEKIAFTKYPSLTHDNSITYKIKDYSNLTPEYPKYKSFSNFYINGRITIGENVNNDSKKAYKFFKNHKPEESSIAEFQENGEKGTSAILNSKADILYAIEIAKEMLSDKSSDKPIIYLPLKVNINSANNQFIPLRNDDIILIEIQSFTKGEIIELISNSAISTKKAQQQLLQRQLLGSKENCEMAYTQTSDSETFSLTQVNEDCENSFLINDKKGIFLRYKSKGN 820 T 0.4 Usg pdbpercent F Bacteria T 6u9e 2 D,E,F B,D,F Q7X3I8_FRANO VgrG MDYKDDDDKDYKDDDDKDYKDDDDKGSKADHIFNLEEQGLLIDIKDDSKGCTTKLESSGKITHNATESIESSADKQIIENVKDSKISITEKEILLATKKSSIMLSEDKIVIKIGNSLIILDDSNISLESATINIKSSANINIQASQNIDIKSLNNSIKADVNLNAEGLDVNIKGSVTASIKGSAATMVG 189 T 0.0018 DUF2345 pdbpssm F Bacteria T 6u9f 1 A,B,C A,C,E A0Q7H0_FRATN PdpA MIAVKDITDLNIQDIISQLTSEVINGDTTPSSAKFACEINSYIINYNLSNINLINTQLKNTKILYRKGLISKLDYEKYKRYCIISRFKNNIDEFILYFSTNYKDSQSLKIAIKELQNSCSSSLILELPHDYIRKIDVLLTSIDSAIQRSSDLNKTIIKQLNKLRSSLSRYIGYNNVLQKQEITINIKPINKNFELEDISFVSTRNKQYFKHNSLTLKNPHIEKLEVCENIYGINGWLTFDLAYINNHKDFNFLLSPNQPILLDIQINDSFNFYKKESKKDHHKRTTRFIAIGFNSNSIDIHENFEYSIYSYTKNVSSGVKKFKIQFHDPLKALWTKHKPSYIALNKSLDDIFKDNFFFDSLFSLDTNKSNNLKIRIPQAFISTVNRNFYDFFIQQLEQNKCYLKYFCDKKSGKVSYHVVDQVDNDLQRNIVNSDEDLKDKLSPYDISCFKKQILISNKSNFYVKEKNICPDVTLNTQRKEDRKISDTLVKPFSSILKDNLQSVEYIQSNNDDKQEIITTGFEILLTSRNTLPFLDTEITLSKLDNDQNYLLGATDIKSLYISQRKLLFKRSKYCSKQLYENLHNFHYKSDSESDVYEKIAFTKYPSLTHDNSITYKIKDYSNLTPEYPKYKSFSNFYINGRITIGENVNNDSKKAYKFFKNHKPEESSIAEFQENGEKGTSAILNSKADILYAIEIAKEMLSDKSSDKPIIYLPLKVNINSANNQFIPLRNDDIILIEIQSFTKGEIIELISNSAISTKKAQQQLLQRQLLGSKENCEMAYTQTSDSETFSLTQVNEDCENSFLINDKKGIFLRYKSKGN 820 T 0.4 Usg pdbpercent F Bacteria T 6u9f 2 D,E,F B,D,F A0Q7H3_FRATN VgrG MDYKDDDDKDYKDDDDKDYKDDDDKGSKADHIFNLEEQGLLIDIKDDSKGCTTKLESSGKITHNATESIESSADKQIIENVKDSKISITEKEILLATKKSSIMLSEDKIVIKIGNSLIILDDSNISLESATINIKSSANINIQASQNIDIKSLNNSIKADVNLNAEGLDVNIKGSVTASIKGSAATMVG 189 T 0.0018 DUF2345 pdbpssm F Bacteria T 6u9g 1 A,B,C A,C,E A0Q7H0_FRATN PdpA MIAVKDITDLNIQDIISQLTSEVINGDTTPSSAKFACEINSYIINYNLSNINLINTQLKNTKILYRKGLISKLDYEKYKRYCIISRFKNNIDEFILYFSTNYKDSQSLKIAIKELQNSCSSSLILELPHDYIRKIDVLLTSIDSAIQRSSDLNKTIIKQLNKLRSSLSRYIGYNNVLQKQEITINIKPINKNFELEDISFVSTRNKQYFKHNSLTLKNPHIEKLEVCENIYGINGWLTFDLAYINNHKDFNFLLSPNQPILLDIQINDSFNFYKKESKKDHHKRTTRFIAIGFNSNSIDIHENFEYSIYSYTKNVSSGVKKFKIQFHDPLKALWTKHKPSYIALNKSLDDIFKDNFFFDSLFSLDTNKSNNLKIRIPQAFISTVNRNFYDFFIQQLEQNKCYLKYFCDKKSGKVSYHVVDQVDNDLQRNIVNSDEDLKDKLSPYDISCFKKQILISNKSNFYVKEKNICPDVTLNTQRKEDRKISDTLVKPFSSILKDNLQSVEYIQSNNDDKQEIITTGFEILLTSRNTLPFLDTEITLSKLDNDQNYLLGATDIKSLYISQRKLLFKRSKYCSKQLYENLHNFHYKSDSESDVYEKIAFTKYPSLTHDNSITYKIKDYSNLTPEYPKYKSFSNFYINGRITIGENVNNDSKKAYKFFKNHKPEESSIAEFQENGEKGTSAILNSKADILYAIEIAKEMLSDKSSDKPIIYLPLKVNINSANNQFIPLRNDDIILIEIQSFTKGEIIELISNSAISTKKAQQQLLQRQLLGSKENCEMAYTQTSDSETFSLTQVNEDCENSFLINDKKGIFLRYKSKGN 820 T 0.4 Usg pdbpercent F Bacteria T 6u9g 2 D,E,F B,D,F A0Q7H3_FRATN VgrG MDYKDDDDKDYKDDDDKDYKDDDDKGSKADHIFNLEEQGLLIDIKDDSKGCTTKLESSGKITHNATESIESSADKQIIENVKDSKISITEKEILLATKKSSIMLSEDKIVIKIGNSLIILDDSNISLESATINIKSSANINIQASQNIDIKSLNNSIKADVNLNAEGLDVNIKGSVTASIKGSAATMVG 189 T 0.0018 DUF2345 pdbpssm F Bacteria T 6u9x 1 A,B D,A B6SBM0_9TRYP Mitochondrial edited mRNA stability factor 1 GSHMDDALRGELASALDTEGHALPFDVHLQQPHSSGDGTAGDTSTIQLEKLSHPPARFDLLTNSFVYKWQTKAALARKVSGPMREWAAELKYRTGVHIELEPTYPERLSENAVKGSGSDDGDGTQWGAYETADDVDITVYLFGSERGIFNCHKLMEAAIQQDPVYVRLGIFRRLANSSEVEWLMLRRINRELRPPDIPPISLKLPGKWTLLYERYKEAAIRTLWEETGITVDASNVYPTGHLYQTVPQYYWRVPVRYFVAEVPSDIRVEGPQVVPLQYMRNWDARLLRQSPDPIDRAWAQLADPATGCAWMKASMIDQLQKPLRGDNYMAIRYTPPPYSNLQEVVGLGDGSITPSTGNGEDAS 363 T 0.00029 NUDIX pdb F Eukaryota T 6uai 2 B B YSAM peptide YSAM 4 T 150 Polo_box_2 pdbhh F F 6uao 2 B B Peptide EEYSAM EEYSAM 6 T 7.4 Toxin_36 pdbhh F F 6ube 2 B B Peptide LFRAL LFRAL 5 T 55 HPD pdbhh F F 6ubh 2 E,F,G,H E,F,G,H peptide KNFDFWV 7 T 0.55 DUF5926 pdbhh F T 6ubi 3 C,F C,F ENV_HV1H2 HIV fusion peptide 512-519 AVGIGAVF 8 T 4.2 DUF3918 pdbhh T Viruses F 6uc5 3 C P CSP_PLAFA NPNA peptide NPNANPNANPN 11 T 0.89 Cas_Cas7 pdbhh F Eukaryota F 6uce 3 C C ENV_HV1H2 HIV fusion peptide AVGIGAVF 8 T 4.2 DUF3918 pdbhh T Viruses F 6ucf 3 C A ENV_HV1H2 HIV fusion peptide AVGIGAVF 8 T 4.2 DUF3918 pdbhh T Viruses F 6ucx 1 A A S2-1, Wednesday PXVXXEXK 8 T 30 HAV_VP pdbhh F F 6ud9 1 A A S2-2, Morticia PXVXXKXE 8 T 140 SYCE1 pdbhh F F 6udr 1 A A S2-3, Lurch crystal form 1 XTRPDQXXXX 10 T 56 SH3_11 pdbhh F T 6udw 1 A A S2-3, Lurch crystal form 2 XTRPDQXXXX 10 T 56 SH3_11 pdbhh F T 6udz 1 A A S2-4, Pusgley crystal form 1 EXXKXXPPXV 10 T 1.6 Peptidase_C62 pdbhh F F 6uf2 1 A A Q31PX7_SYNE7 Biofilm-related protein MRIDELVPADPRAVSLYTPYYSQANRRRYLPYALSLYQGSSIEGSRAVEGGAPISFVATWTVTPLPADMTRCHLQFNNDAELTYEILLPNHEFLEYLIDMLMGYQRMQKTDFPGAFYRRLLGYDS 125 T 4.3 ATP13 pdbhh F Bacteria T 6uf4 1 A,B A,B S2-4, Pusgley crystal form 2 EXXKXXPPXV 10 F F F 6uf7 1 A A S2-5, Uncle Fester XSEXRPXXIX 10 F F T 6uf8 1 A A S2-6, London Bridge XNXXPXAXKHXE 12 F F T 6uf9 1 A A S4-1, Tim apo-form KLXXXHXXQEXXKLXXXHXXQEXX 24 T 59 SRC-1 pdbhh F T 6ufa 1 A A S4-1, Tim, Zinc-bound form KLXXXHXXQEXXKLXXXHXXQEXX 24 T 59 SRC-1 pdbhh F T 6ufu 1 A A C2-1, Zappy, crystal form 1 XXXSLXXXSL 10 T 29 DUF4548 pdbhh F F 6ug2 1 A A C2-1, Zappy, crystal form 2 XSXXSLXXXSL 11 T 29 DUF4548 pdbhh F F 6ug3 1 A,B A,B C3-1, Sporty, crystal form 1 PRXDPRXDPRXD 12 T 11 RRP36 pdbhh F F 6ug6 1 A,B,C,D A,B,C,D C3-1, Sporty, crystal form 2 PRXDPRXDPRXD 12 T 11 RRP36 pdbhh F F 6ugb 1 A,B A,B C3 symmetric peptide design number 2, Baby Basil PRD 3 T 160 DUF902 pdbhh F F 6ugc 1 A,B,C,D,E,F,G,H,I,J,K A,B,C,D,E,F,G,H,I,J,K C3-3 cyclic peptide design PDDPDDPDD 9 T 0.73 DUF3742 pdbhh F F 6ugd 2 G G Polyglutamate peptide EEEEEEEEEEEEEE 14 T 25 NOA36 pdbhh F F 6uge 2 G G Polyglutamate peptide EEEEEEEEEEEE 12 T 32 DUF5571 pdbhh F F 6ugf 2 G G Polyglutamate peptide EEEEEEEEEEEE 12 T 32 DUF5571 pdbhh F F 6ugm 11 N R H3 N-terminus TMQ 3 T 360 BRX_N pdbhh F F 6uib 3 C C Peptide 23-652 DTLTKSFCYFGTWCQMYGST 20 T 3.6 DUF1911 pdbhh F T 6uj0 2 C,D C,D unidentified polypeptide XXXXXXX 7 F F F 6ujo 3 C C HHAT_HUMAN Protein-cysteine N-palmitoyltransferase HHAT KQWLVWLFL 9 T 0.54 DUF446 pdbhh F Eukaryota T 6ujq 3 C C HHAT_HUMAN Protein-cysteine N-palmitoyltransferase HHAT KQWLVWLLL 9 T 3 tRNA_anti-like pdbhh F Eukaryota F 6uk2 3 C C HHAT_HUMAN HEDGEHOG ACYLTRANSFERASE,MELANOMA ANTIGEN RECOGNIZED BY T-CELLS 2,MART-2,SKINNY HEDGEHOG PROTEIN 1 KQWLVWLLL 9 T 3 tRNA_anti-like pdbhh F Eukaryota F 6uk4 3 C C HHAT_HUMAN Protein-cysteine N-palmitoyltransferase HHAT KQWLVWLFL 9 T 0.54 DUF446 pdbhh F Eukaryota T 6uka 2 B B ELMO2_MOUSE PROTEIN CED-12 HOMOLOG A MPPPSDIVKVAIEWPGANAQLLEIDQKRPLASIIKEVCDGWSLPNPEYYTLRYADGPQLYVTEQTRNDIKNGTILQLAVSA 81 T 0.029 DUF3697 pdbpssm F Eukaryota T 6uke 1 A X I3DBY6_HAEPH HhaI Restriction Endonuclease MNWKEFEVFCVTYLNKTYGNKFAKKGESDSTTSDILFTGNNPFYIEAKMPHSQCGQFVLIPNRAEYKFDYSPKNKSEINPYTQKIMQFMSENFSEYANLSTKGKIIPLPESVFVNWIKEYYKSKSVKFFITSNGDFIIFPIEHFEHYFNVSCTYRIKKSGSRHLNSKSLPDFKQALDKKGISYTMRGLELHSDENIHDKRISGDDKDFLIKENNGAYHVKILSNTFNANVIFSISLKNNISLFILNEDRKAFEAAISL 258 T 0.03 DUF4105 pdbpssm F Bacteria T 6ukf 1 A X I3DBY6_HAEPH HhaI Restriction Endonuclease MNWKEFEVFCVTYLNKTYGNKFAKKGESDSTTSDILFTGNNPFYIEAKMPHSQCGQFVLIPNRAEYKFDYSPKNKSEINPYTQKIMQFMSENFSEYANLSTKGKIIPLPESVFVNWIKEYYKSKSVKFFITSNGDFIIFPIEHFEHYFNVSCTYRIKKSGSRHLNSKSLPDFKQALDKKGISYTMRGLELHSDENIHDKRISGDDKDFLIKENNGAYHVKILSNTFNANVIFSISLKNNISLFILNEDRKAFEAAISL 258 T 0.03 DUF4105 pdbpssm F Bacteria T 6ukg 1 A X I3DBY6_HAEPH HhaI Restriction Endonuclease MNWKEFEVFCVTYLNKTYGNKFAKKGESDSTTSDILFTGNNPFYIEAKMPHSQCGQFVLIPNRAEYKFDYSPKNKSEINPYTQKIMQFMSENFSEYANLSTKGKIIPLPESVFVNWIKEYYKSKSVKFFITSNGDFIIFPIEHFEHYFNVSCTYRIKKSGSRHLNSKSLPDFKQALDKKGISYTMRGLELHSDENIHDKRISGDDKDFLIKENNGAYHVKILSNTFNANVIFSISLKNNISLFILNEDRKAFEAAISL 258 T 0.03 DUF4105 pdbpssm F Bacteria T 6ukh 1 A X I3DBY6_HAEPH HhaI Restriction Endonuclease MNWKEFEVFCVTYLNKTYGNKFAKKGESDSTTSDILFTGNNPFYIEAKMPHSQCGQFVLIPNRAEYKFDYSPKNKSEINPYTQKIMQFMSENFSEYANLSTKGKIIPLPESVFVNWIKEYYKSKSVKFFITSNGDFIIFPIEHFEHYFNVSCTYRIKKSGSRHLNSKSLPDFKQALDKKGISYTMRGLELHSDENIHDKRISGDDKDFLIKENNGAYHVKILSNTFNANVIFSISLKNNISLFILNEDRKAFEAAISL 258 T 0.03 DUF4105 pdbpssm F Bacteria T 6uki 1 A,D X,Y I3DBY6_HAEPH HhaI Restriction Endonuclease MNWKEFEVFCVTYLNKTYGNKFAKKGESDSTTSDILFTGNNPFYIEAKMPHSQCGQFVLIPNRAEYKFDYSPKNKSEINPYTQKIMQFMSENFSEYANLSTKGKIIPLPESVFVNWIKEYYKSKSVKFFITSNGDFIIFPIEHFEHYFNVSCTYRIKKSGSRHLNSKSLPDFKQALDKKGISYTMRGLELHSDENIHDKRISGDDKDFLIKENNGAYHVKILSNTFNANVIFSISLKNNISLFILNEDRKAFEAAISL 258 T 0.03 DUF4105 pdbpssm F Bacteria T 6ule 3 E I CSP_PLAFO CS NANPNANPNANPNANPNANP 20 T 3.2 PT unppercent F Eukaryota F 6ulf 4 D P CSP_PLAFO CS NANPNVDPNANP 12 T 0.17 Cas_Cas7 pdbhh F Eukaryota F 6ulh 1 A A Q5ZTL4_LEGPH LPG2147 (MavC) GPLGSMTTSKLEKTGLHVHEKIKHMVKNYGTMITGIPAEILGQNEAEISVGYVKKMGNMKENIAEVVRKSEMTQPTNSAGKASNEVCDLLLGTEGASEFEKSSYQVLSGDGSNLKGSLPNKNLLVRVEMDRFNAPQKYQKIKREEFNPETAEKNKIYLLEDQLVYLDIFGKVIDLGQTSDTCHRLFNAITTPFYQNYILYDEYIDPEESAEEAAMFEMGEIVKAKMKNIDCWTATHSFTIFVPESDSEDTRTLYPYQAYWTSHTLQQWFSGDKDEKLSRLGIDGYIEKLALLGTTTDSKIRSSIYGELFSPPGKEHVFCTGMNEKFSPLRVKFKVTEVNPEIALQNLEEVQEFIDTNYPGENAKDQCELYKIKAQEAMTKQLEMRLLIE 389 T 0.035 V-ATPase_H_N unppercent F Bacteria T 6ulo 1 A,B A,B Q72TN2_LEPIC Uncharacterized protein MAHHHHHHKIEENQNVSLNEGDIVSKLKETPQETLVPTKWDVGDTTVSNEDRLDLLIPHVQNLGNVYVGVGSEQNLTIAAWAKSDFIYLMDFTQIVVHANTITILFLQKSEKKEDFIRLWGKEGEKEALELIQVSFSDPEVYKKVYKQASPFIRKRHKTNLMLSKKYNYKMFQTDDEQYSYIRKLAIEGKILPIRGNLLGNITLTGIGNTLKKIGRKVGIIYFSNAEEYFAYPQEFKNSILNLPVSESSLVVRTISVRKDLFPWSPGSEISTDRGFHYCVQKISNFQKWLSSGKPGLRSLQVMVEGGTVDKKNGITVVDKEPVVTEDKLPKTGG 334 T 0.014 Hypoth_Ymh unppssm F Bacteria T 6ulp 2 C C Cyclic peptide 3.2_3 XWXQWKXYGLKICX 14 T 0.39 Fungal_KA1 pdbhh F T 6ulq 2 D,E,F D,E,F Cyclic peptide 4.2_3 XWXGYLCLRXRIQRTYNX 18 T 4.2 DUF2569 pdbhh F T 6uls 2 B B E2F1_HUMAN Diacetylated E2F1 Peptide (K117ac and K120ac) HPGXGVXSPGX 11 T 0.11 TP1 unp F Eukaryota T 6ult 2 I,J,K,L I,J,K,L Cyclic peptide 4.2_3 XWXGYLCLRXRIQ 13 T 2.2 RRM_DME pdbhh F T 6ulv 2 E,F E,G Cyclic peptide 4.2_3 XWXNWCWLXRXLLLRX 16 T 1.1 UL17 pdbhh F T 6umm 4 E,J E,J ECCC3_MYCS2 ESX CONSERVED COMPONENT C3,TYPE VII SECRETION SYSTEM PROTEIN ECCC3,T7SS PROTEIN ECCC3 MSRLIFEHQRRLTPPTTRKGTITIEPPPQLPRVVPPSLLRRVLPFLIVILIVGMIVALFATGMRLISPTMLFFPFVLLLAATALYRGGDNKMRTEEVDAERADYLRYLSVVRDNVRAHAAEQRAALEWSHPEPEVLATIPGTRRQWERDPRDRDFLVLRAGRHDVPLDAALKVKDTADEIDLEPVAHSALRGLLDVQRTVRDAPTGLDVAKLARITVIGEADEARAAIRAWIAQAVTWHDPTMLGVALAAPDLESGDWSWLKWLPHVDVPNEADGVGPARYLTTSTAELRERLAPALADRPLFPAESGAALKHLLVVLDDPDADPDDIARKPGLTGVTVIHRTTELPNREQYPDPERPILRVADGRIERWQVGGWQPCVDVADAMSAAEAAHIARRLSRWDSN 403 T 0.071 Bac_export_3 pdb F Bacteria T 6ump 2 B A Q5ZTL4_LEGPH MavC GPLGSMTTSKLEKTGLHVHEKIKHMVKNYGTMITGIPAEILGQNEAEISVGYVKKMGNMKENIAEVVRKSEMTQPTNSAGKASNEVCDLLLGTEGASEFEKSSYQVLSGDGSNLKGSLPNKNLLVRVEMDRFNAPQKYQKIKREEFNPETAEKNKIYLLEDQLVYLDIFGKVIDLGQTSDTCHRLFNAITTPFYQNYILYDEYIDPEESAEEAAMFEMGEIVKAKMKNIDCWTATHSFTIFVPESDSEDTRTLYPYQAYWTSHTLQQWFSGDKDEKLSRLGIDGYIEKLALLGTTTDSKIRSSIYGELFSPPGKEHVFCTGMNEKFSPLRVKFKVTEVNPEIALQNLEEVQEFIDTNYPGENAKDQCELYKIKAQEAMTKQLEMRLLIE 389 T 0.035 V-ATPase_H_N unppercent F Bacteria T 6ums 1 A A Q5ZTL4_LEGPH MavC GPLGSMTTSKLEKTGLHVHEKIKHMVKNYGTMITGIPAEILGQNEAEISVGYVKKMGNMKENIAEVVRKSEMTQPTNSAGKASNEVCDLLLGTEGASEFEKSSYQVLSGDGSNLKGSLPNKNLLVRVEMDRFNAPQKYQKIKREEFNPETAEKNKIYLLEDQLVYLDIFGKVIDLGQTSDTCHRLFNAITTPFYQNYILYDEYIDPEESAEEAAMFEMGEIVKAKMKNIDCWTATHSFTIFVPESDSEDTRTLYPYQAYWTSHTLQQWFSGDKDEKLSRLGIDGYIEKLALLGTTTDSKIRSSIYGELFSPPGKEHVFCTGMNEKFSPLRVKFKVTEVNPEIALQNLEEVQEFIDTNYPGENAKDQCELYKIKAQEAMTKQLEMRLLIE 389 T 0.035 V-ATPase_H_N unppercent F Bacteria T 6uop 1 A A PYL5_ORYSJ PYR1-LIKE PROTEIN 11,OSPYL11,PYR1-LIKE PROTEIN 5,OSPYL5,REGULATORY COMPONENTS OF ABA RECEPTOR 5 AVAAGA 6 T 22 DUF5987 pdbhh F Eukaryota F 6uoq 1 A A PYL5_ORYSJ PYR1-LIKE PROTEIN 11,OSPYL11,PYR1-LIKE PROTEIN 5,OSPYL5,REGULATORY COMPONENTS OF ABA RECEPTOR 5 AVAAGA 6 T 22 DUF5987 pdbhh F Eukaryota F 6uor 1 A A PYL5_ORYSJ PYR1-LIKE PROTEIN 11,OSPYL11,PYR1-LIKE PROTEIN 5,OSPYL5,REGULATORY COMPONENTS OF ABA RECEPTOR 5 AVAAGA 6 T 22 DUF5987 pdbhh F Eukaryota F 6uos 1 A A PYL5_ORYSJ PYR1-LIKE PROTEIN 11,OSPYL11,PYR1-LIKE PROTEIN 5,OSPYL5,REGULATORY COMPONENTS OF ABA RECEPTOR 5 AVAAGA 6 T 22 DUF5987 pdbhh F Eukaryota F 6uou 1 A A PYL5_ORYSJ PYR1-LIKE PROTEIN 11,OSPYL11,PYR1-LIKE PROTEIN 5,OSPYL5,REGULATORY COMPONENTS OF ABA RECEPTOR 5 AVAAGA 6 T 22 DUF5987 pdbhh F Eukaryota F 6uow 1 A A PYL5_ORYSJ PYR1-LIKE PROTEIN 11,OSPYL11,PYR1-LIKE PROTEIN 5,OSPYL5,REGULATORY COMPONENTS OF ABA RECEPTOR 5 AVAAGA 6 T 22 DUF5987 pdbhh F Eukaryota F 6up7 4 D V unidentified peptide XXXXXXXXX 9 F F F 6upw 1 A,C L,M VINC_HUMAN METAVINCULIN,MV MPVFHTRTIESILEPVAQQISHLVIMHEEGEVDGKAIPDLTAPVAAVQAAVSNLVRVGKETVQTTEDQILKRDMPPAFIKVENACTKLVQAAQMLQSDPYSVPARDYLIDGSRGILSGTSDLLLTFDEAEVRKIIRVCKGILEYLTVAEVVETMEDLVTYTKNLGPGMTKMAKMIDERQQELTHQEHRVMLVNSMNTVKELLPVLISAMKIFVTTKNSKNQGIEEALKNRNFTVEKMSAEINEIIRVLQLTSWDEDAWASKDTEAMKRALASIDSKLNQAKGWLRDPSASPGDAGEQAIRQILDEAGKVGELCAGKERREILGTCKMLGQMTDQVADLRARGQGSSPVAMQKAQQVSQGLDVLTAKVENAARKLEAMTNSKQSIAKKIDAAQNWLADPNGGPEGEEQIRGALAEARKIAELCDDPKERDDILRSLGEISALTSKLADLRRQGKGDSPEARALAKQVATALQNLQTKTNRAVANSRPAKAAVHLEGKIEQAQRWIDNPTVDDRGVGQAAIRGLVAEGHRLANVMMGPYRQDLLAKCDRVDQLTAQLADLAARGEGESPQARALASQLQDSLKDLKARMQEAMTQEVSDVFSDTTTPIKLLAVAATAPPDAPNREEVFDERAANFENHSGKLGATAEKAAAVGTANKSTVEGIQASVKTARELTPQVVSAARILLRNPGNQAAYEHFETMKNQWIDNVEKMTGLVDEAIDTKSLLDASEEAIKKDLDKCKVAMANIQPQMLVAGATSIARRANRILLVAKREVENSEDPKFREAVKAASDELSKTISPMVMDAKAVAGNISDPGLQKSFLDSGYRILGAVAKVREAFQPQEPDFPPPPPDLEQLRLTDELAPPKPPLPEGEVPPPRPPPPEEKDEEFPEQKAGEVINQPMMMAARQLHDEARKWSSKPGIPAAEVGIGVVAEADAADAAGFPVPPDMEDDYEPELLLMPSNQPVNQPILAAAQSLHREATKWSSKGNDIIAAAKRMALLMAEMSRLVRGGSGTKRALIQCAKDIAKASDEVTRLAKEVAKQCTDKRIRTNLLQVCERIPTISTQLKILSTVKATMLGRTNISDEESEQATEMLVHNAQNLMQSVKETVREAEAASIKIRTDAGFTLRWVRKTPWYQ 1134 T 1.9E-200 Vinculin pdb F Eukaryota T 6uqe 3 U X RepA-GFP XXXXXXXXXX 10 F F F 6uqe 4 V Y RepA-GFP XXXXXXXXXXX 11 F F F 6uqo 3 U,V X,Y RepA-GFP XXXXXXXXX 9 F F F 6utc 1 A A Q9RP86_VIBVL TRANSCRIPTIONAL REGULATOR,TRANSMEMBRANE TRANSCRIPTION ACTIVATOR MAHHHHHHTNPSESKFRLLENVNGVEVLTPLNHPPLQAWMPSIRQCVNKYAETHTGDSAPVKVIATGGQGNQLILNYIHTLPHSNENVTLRIFSEQNDLGSICK 104 T 0.12 TPPK_C unppercent F Bacteria T 6uud 3 C A CSP_PLAFA Circumsporozoite protein EDNEKLRKPKHKKLKQPA 18 T 8.1 P120R pdbhh F Eukaryota T 6uue 1 A A Q9RP86_VIBVL TRANSCRIPTIONAL REGULATOR,TRANSMEMBRANE TRANSCRIPTION ACTIVATOR MAHHHHHHTNPSESKFRLLENVNGVEVLTPLNHPPLQAWMPSIRQCVNKYAETHTGDSAPVKVIATGGQGNQLILNYIHTLPHSNENVTLRIFSEQNDLGSICK 104 T 0.12 TPPK_C unppercent F Bacteria T 6uvn 6 L N Cas8_HelicalBundle XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 98 F F F 6ux5 1 A A ACR1_ACTEQ U-AITX-AEQ5A,ACRORHAGIN I,ACRORHAGIN-1 SSTPDGTWVKCRHDCFTKYKSCQMSDSCHDEQSCHQCHVKHTDCVNTGCP 50 T 0.027 DUF4802 pdbhh F Eukaryota T 6uxc 1 A,B A,B A0A0H3MBU8_CHLT2 CT253 SNSGSYNARLYTKGSKAKGVVAMLPVFYRTEKSAELLPWNLQAEFSEEISRRLHSSDKLLLIKHHASAGVAAQFFSPTPNISPELATQLLPAEFVVAAEILEQKTTEDVLNPSISASVRVRVFDIRHNKVSMIYQEILDASQSLASGSNDYHRYGWRSKNFDSTPMGLMHQRLFREIVARVEGYVCANYS 190 T 6.1E-05 CsgG unphh F Bacteria T 6uxd 1 A,B A,B A0A0H3MCU1_CHLT2 CT021 AHSPLQSSIQEKILTARPGDYAVLSRGSQKFFFLIRQSSSEATWVEMSEFASLTQQEKKLVEQSSWKNAFHQLQSSKKVYLLRISKNPLMIFVLKNAQWMPLSEKDPLPFFVKILRLPLSPAPSHLIKYKGKERTPWSPRTSLNGELITLPSSAWISVWPKDSSPLSEKNILIYFSNNERLAFPLWTSIDTPTGTVIIKTIEMGHQAASSYPALPNF 217 T 2.7 DUF3868 unphh F Bacteria T 6uxf 1 A A NUCC_VIBMT Vibrio meotecus sp. RC341 NucC MAQDWQLSELLENLHADVQHKLTTVRKSFKHSVVKGDGAENVWVDLFNQYLPERYRASRAFVVDSENQFSEQIDVVIYDRQYSPFIFHYAEQLIIPAESVYAVFEVKQTLNKQHIDAARKKVASVRALHRTSLPIPHAGGVHSPRELIGIIGGLLTLENELKIPDTLMGHLDHDKADKGMLNIGCAADDCFFYYDNDHQRMQVMQHKKATTAFLFELLSQLQKCGTVPMIDIHAYGKWLTPRISE 245 T 0.53 NERD pdbhh F Bacteria T 6uxg 1 A,B,C A,B,C NUCC_VIBMT Vibrio metoecus sp. RC341 NucC MAQDWQLSELLENLHADVQHKLTTVRKSFKHSVVKGDGAENVWVDLFNQYLPERYRASRAFVVDSENQFSEQIDVVIYDRQYSPFIFHYAEQLIIPAESVYAVFEVKQTLNKQHIDAARKKVASVRALHRTSLPIPHAGGVHSPRELIGIIGGLLTLENELKIPDTLMGHLDHDKADKGMLNIGCAADDCFFYYDNDHQRMQVMQHKKATTAFLFELLSQLQKCGTVPMIDIHAYGKWLTPRISE 245 T 0.53 NERD pdbhh F Bacteria T 6uxs 1 A A Cyclic peptide 3.1B XWKTIXGXTWRTXQCX 16 T 6.8 CyRPA pdbhh F T 6uxv 6 I I SNF6_YEAST SWI/SNF COMPLEX COMPONENT SNF6 MGVIKKKRSHHGKASRQQYYSGVQVGGVGSMGAINNNIPSLTSFAEENNYQYGYSGSSAGMNGRSLTYAQQQLNKQRQDFERVRLRPEQLSNIIHDESDTISFRSNLLKNFISSNDAFNMLSLTTVPCDRIEKSRLFSEKTIRYLMQKQHXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 179 T 0.017 Glyco_transf_34 pdbpercent F Eukaryota T 6uxv 7 J J Unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 67 F F F 6uxv 8 K K Unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXX 28 F F F 6uxv 9 L,O L,O Unknown protein XXXXXXXXXXXXXXXXXX 18 F F F 6uxv 10 M M SWI/SNF global transcription activator complex subunit SWP82 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 83 F F F 6uxv 11 N N Unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 30 F F F 6uxw 15 V I SNF6_YEAST SWI/SNF COMPLEX COMPONENT SNF6 MGVIKKKRSHHGKASRQQYYSGVQVGGVGSMGAINNNIPSLTSFAEENNYQYGYSGSSAGMNGRSLTYAQQQLNKQRQDFERVRLRPEQLSNIIHDESDTISFRSNLLKNFISSNDAFNMLSLTTVPCDRIEKSRLFSEKTIRYLMQKQHXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 179 T 0.017 Glyco_transf_34 pdbpercent F Eukaryota T 6uxw 16 AA,BA,W,X,Y N,O,J,K,L Unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 67 F F F 6uxw 17 Z M SWI/SNF global transcription activator complex subunit SWP82 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 83 F F F 6uyi 4 D B PSMD1_HUMAN 26S PROTEASOME REGULATORY SUBUNIT RPN2,26S PROTEASOME REGULATORY SUBUNIT S1,26S PROTEASOME SUBUNIT P112 GPGSQEPEPPEPFEYIDD 18 T 38 AgrD pdbhh F Eukaryota T 6uyj 4 D B PSMD1_HUMAN 26S PROTEASOME REGULATORY SUBUNIT RPN2,26S PROTEASOME REGULATORY SUBUNIT S1,26S PROTEASOME SUBUNIT P112 GPGSQEPEPPEPFEYIDD 18 T 38 AgrD pdbhh F Eukaryota T 6uyx 2 B,D D,B phosphorylated DAXX GSGEAEERIIVLSDSDY 17 T 1.5 Rnk_N pdbhh F T 6uyy 2 B B phosphorylated DAXX GSGEAEERIIVLSDSDY 17 T 1.5 Rnk_N pdbhh F T 6uyz 2 B,D B,D phosphorylated DAXX GSGEAEERIIVLSDSDY 17 T 1.5 Rnk_N pdbhh F T 6uz7 47 UA AK GDPCP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 147 F F F 6uzm 3 C C Synthetic peptide HIS-LEU-ALA-SER-SER-GLY-HIS-SER-LEU HLASSGHSL 9 T 12 CCDC84 pdbhh F F 6uzn 3 C C Synthetic peptide THR-VAL-ARG-ALA-SER-GLY-HIS-SER-TYR TVRASGHSY 9 T 5.3 Gly_radical pdbhh F T 6uzo 3 C C Synthetic peptide HIS-LEU-ALA-SER-SER-GLY-HIS-SER-TYR HLASSGHSY 9 T 9.3 DUF562 pdbhh F T 6uzp 3 C C Synthetic peptide HIS-LEU-ALA-SER-SER-GLY-HIS-SER-LEU HLASSGHSL 9 T 12 CCDC84 pdbhh F F 6uzq 3 C C Synthetic peptide THR-VAL-ARG-ALA-SER-GLY-HIS-SER-TYR TVRASGHSY 9 T 5.3 Gly_radical pdbhh F T 6uzs 3 C C Synthetic peptide HIS-LEU-ALA-SER-SER-GLY-HIS-SER-TYR HLASSGHSY 9 T 9.3 DUF562 pdbhh F T 6v0n 3 C C RIOK1_HUMAN Riok1 PBM peptide VVPGQFDDADSSD 13 T 0.00026 COPR5 pdbhh F Eukaryota T 6v0o 3 C D ICLN_HUMAN PBM peptide TVAGQFEDADVDH 13 T 0.007 COPR5 pdbhh F Eukaryota T 6v0y 3 C C FIBB_HUMAN Fibrinogen beta 72,74cit69-81 GGYXAXPAKAAAT 13 T 3.3 AT_hook pdbhh F Eukaryota T 6v13 3 C C FIBB_HUMAN Fibrinogen beta 74cit69-81 GGYRAXPAKAAAT 13 T 3.3 AT_hook pdbhh F Eukaryota T 6v15 3 C C FIBB_HUMAN Fibrinogen beta 72,74cit69-81 GGYXAXPAKAAAT 13 T 3.3 AT_hook pdbhh F Eukaryota T 6v18 3 C C FIBB_HUMAN Fibrinogen beta GGYRAXPAKAAAT 13 T 3.3 AT_hook pdbhh F Eukaryota T 6v19 3 C C FIBB_HUMAN Fibrinogen beta 72,74cit69-81 GGYXAXPAKAAAT 13 T 3.3 AT_hook pdbhh F Eukaryota T 6v1a 3 C C FIBB_HUMAN Fibrinogen beta 74cit69-81 GGYRAXPAKAAAT 13 T 3.3 AT_hook pdbhh F Eukaryota T 6v2d 2 G,H,I,J,K,L J,L,B,D,F,H UNC3866 XFALXX 6 T 800 zf-C2H2_11 pdbhh F F 6v2r 2 B B UNC3866 XFALXX 6 T 800 zf-C2H2_11 pdbhh F F 6v2s 2 C,D C,D UNC3866 XFALXX 6 T 800 zf-C2H2_11 pdbhh F F 6v3x 2 B B LCOR_HUMAN PALI1 peptide LKKFPG 6 T 6.8 DUF4136 pdbhh F Eukaryota F 6v3y 2 B B LCOR_HUMAN PALI1 peptide LQKYA 5 T 54 MFAP1 pdbhh F Eukaryota F 6v4b 1 A,B A,B V4JF97_9DELT Neur_chan_LBD domain-containing protein MHNLQQLLPTRSLIWIFSFLTSISIWCTVAHAETEGRVQHFTGYIEDGRGIFYSLPDMKQGDIIYASMQNTGGNLDPLVGIMAEEIDPAVSLGQVLEKALASENDLISELTAVADRIFLGWDDDGGKGYSASLEFTIPRDGTYHIFAGSTITNQRLDKFQPTYTTGSFQLILGLNAPQVISGEGEPEGEVFASLASLEIKPE 202 T 0.088 PPC pdbpercent F Bacteria T 6v4e 2 C,D C,D Stapled peptide QSQQTF(0EH)NLWRLL(MK8)QN(NH2) QSQQTFXNLWRLLXQNX 17 T 0.0017 P53_TAD pdbhh F T 6v4g 2 B B Stapled peptide QSQQTF(0EH)NLWRLE(MK8)QN(NH2) QSQQTFXNLWRLEXQNX 17 T 0.011 P53_TAD pdbhh F T 6v4i 1 A A DanD peptide GXXPIPX 7 T 1.5 NinD pdbhh F F 6v67 1 A,B A,B PD-1 Binding Miniprotein GR918.2 GSCFCVCITGPQWDYRYGNKEQCKKFLTECEQKNPGAEVEIQC 43 T 3.5 SBP_bac_8 pdbhh F T 6v6a 2 B,D B,D S7V0W9_TOXGG Apical Cap Protein 9 (AC9) STRPKFVPCLSTAAAGAGSWMSGNREPSEYPQGM 34 T 1.6 DUF1168 pdbhh F Eukaryota T 6v6s 6 N,O,P,Q,R N,O,Q,R,S Unassigned poly-alanine chain ("staple") XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 50 F F F 6v6s 8 U,X V,Y Unassigned poly-alanine model ("CC") XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 66 F F F 6v6s 9 V W Unassigned poly-alanine model ("HB") XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 165 F F F 6v6s 10 W X Unassigned poly-alanine model ("Lumenal bridge helical bundles") XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 263 F F F 6v7b 3 C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T,U,V,W,X,Y A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T,U,V,W A0A140F3K6_9VIRU Structural protein VP1 MSVVTTRARIAETLTEKHTLGIEKVVATDSWRVGITSREKKLERINISAEISRRIQDEAIAYARNKGIPYLPGINGIAWKLLRLKWLGYTDQINVVMRTVPAEWRDFLTQIMENTQMESMYSELRKVRV 129 T 0.15 NDUFA12 pdb T Viruses T 6v7k 2 C X alpha/beta-Peptide HH4 XEXNCDIHVXXEWXCFXRX 19 T 16 Desulfoferrodox pdbhh F T 6v7p 2 B,D D,B Protein PIAS GSGEAEERIISLD 13 T 20 Lsm_C pdbhh F T 6v7q 2 B,D D,B Protein PIAS GSGEAEERIISLD 13 T 20 Lsm_C pdbhh F T 6v7r 2 B,D D,B Protein PIAS GSGEAEERIISLD 13 T 20 Lsm_C pdbhh F T 6v7s 2 B,D D,B Protein PIAS GSGEAEERIISLD 13 T 20 Lsm_C pdbhh F T 6v7u 1 A,B A,B A0SML3_9CAUD Quorum sensing anti-activator protein Aqs1 MTNTDLKPLLDNLRNATEFWNLVAAASATDESTVHNRSYRDALDWLESAALALGDALIAQRKAVGGDHE 69 T 0.42 YqaH unphh T Viruses T 6v7v 1 A,B,C,D,E,F A,B,C,D,E,F A0SML3_9CAUD Quorum sensing anti-activator Aqs1 MTNTDLKPLLDNLRNATEFWNLVAAASATDESTVHNRSYRDALDWLESAALALGDALIAQRKAVGGDHE 69 T 0.42 YqaH unphh T Viruses T 6v7w 2 B,C,E,F A,C,D,F A0SML3_9CAUD QUORUM SENSING ANTI-ACTIVATOR PROTEIN AQS1 MTNTDLKPLLDNLRNATEFWNLVKEASATDESTVHNRSYRDALDWLESAALALGDALIAQRKAVGGDHE 69 T 0.42 YqaH unphh T Viruses T 6v7x 1 A,C A,C A0SML3_9CAUD QUORUM SENSING ANTI-ACTIVATOR PROTEIN AQS1 MTNTDLKPLLDNLRNATEFWNLVKEASATDESTVHNRSYRDALDWLESAALALGDALIAQRKAVGGDHE 69 T 0.42 YqaH unphh T Viruses T 6v84 2 B,D C,D LyCALAc ANSRLPTSXI 10 T 10 DUF3697 pdbhh F T 6v8i 3 D,H,L CF,BF,AF A4ZFC2_9CAUD Tape Measure Protein, gp57 MTEYKIKATIEASVAKFKRQIDSAVKSVQRFKRVADQTKDVELNANDKKLQKTIKVAKKSLDAFSNKNVKAKLDASIQDLQQKILESNFELDKLNSKEASPEVKLQKQKLTKDIAEAENKLSELEKKRVNIDVNADNSKFNRVLKVSKASLEALNRSKAKAILDVDNSVANSKIKRTKEELKSIPNKTRSRLDVDTRLSIPTIYAFKKSLDALPNKKTTKVDVDTNGLKKVYAYIIKANDNFQRQMGNLANMFRVFGTVGSNMVGGLLTSSFSILIPVIASVVPVVFALLNAIKVLTGGVLALGGAVAIAGAGFVAFGAMAISAIKMLNDGTLQASSATNEYKKALDGVKSAWTDIIKQNQSAIFTTLANGLNTVKTAMQSLQPFFSGISRGMEEASQSVLKWAENSSVASRFFNMMNTTGVSVFNKLLSAAGGFGDGLVNVFTQLAPLFQWSADWLDRLGQSFSNWANSAAGENSITRFIEYTKTNLPIIGNIFKNVFAGINNLMNAFSGSSTGIFQSLEQMTAKFREWSEQVGQSQGFKDFVSYIQTNGPLIMQLIGNIARGLVAFATAMAPIASAVLRVAVAITGWIANLFEAHPATAQLVGVIITLVGAFRFLIAPILAVMDFLGPLAARLVALVTKFGWAKTGTLVLSKAMTSLKGPIKLVTAIFQLLFGKIGLIRNAITGLVTVFGILGGPITIVIGVIAALIAIFVLLWNKNEGFRNFIINAWNAIKTFMVNVWNVLKAVASVVWNAILTAITTAVSNVYNFIMIVWNQIVAYLQGLWNGIIAIATTVWNLLVTIITTVFTTIMTIVMTIWTAIWTFLSTIWNTIITIATTIWNLLVTVITTVFTTIMTIAMTIWNAIWTFLQTLWNTIVTVATKVWNAITTAISTALQAAWSFISNIWNTIWSFLSGILTTIWNKVVSIFTQVVSTISDKMSQAWNFIVTKGMQWVSTITSTLINFVNRVIQGFVNVVNKVSQGMTNAVNKIKSFIGDFVSAGADMIRGLIRGIGQMAGQLVDAAKNVAKKALDAAKSALGIHSPSREFMDVGMYSMLGFVKGIDNHSSKVIRNVSNVADKVVDAFQPTLNAPDISSITGNLSNLGGNINAQVQHTHSIETSPNMKTVKVEFDVNNDALTSIVNGRNAKRNSEYYL 1154 T 0.077 Nucleoporin_FG2 pdbhh T Viruses T 6v8o 1 A C HTL1_YEAST CHROMATIN STRUCTURE-REMODELING COMPLEX PROTEIN HTL1 MSQNNTISSMNPERAYNNVTLKNLTAFQLLSQRENICELLNLVESTERHNSIINPERQRMSLEEMKKMLDALKNERKK 78 T 0.064 DUF4976 pdbpercent F Eukaryota T 6v8o 14 Q 2 Unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXX 28 F F F 6v8o 15 R,S 3,4 Unknown Protein XXXXXXXXXXXXXXXXXXX 19 F F F 6v8o 16 T 5 Unknown protein XXXXXXXXXXXXXX 14 F F F 6v8o 17 U 6 Unknown Protein XXXXXXXXXXXXXXX 15 F F F 6v8o 18 V 7 Unknown Protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 49 F F F 6v8w 2 B,BA,D,DA,F,FA,H,HA,J,JA,L,LA,N,NA,P,PA,R,RA,T,TA,V,VA,X,XA,Z,ZA AA,NN,BB,OO,CC,PP,DD,QQ,EE,RR,FF,SS,GG,TT,HH,UU,II,VV,JJ,WW,KK,XX,LL,YY,MM,ZZ IVA-PHE-ALA-PHE-5T3-SER-NH2 XFAFXSX 7 T 250 Glyco_transf_43 pdbhh F F 6v92 5 E C HTL1_YEAST CHROMATIN STRUCTURE-REMODELING COMPLEX PROTEIN HTL1 MSQNNTISSMNPERAYNNVTLKNLTAFQLLSQRENICELLNLVESTERHNSIINPERQRMSLEEMKKMLDALKNERKK 78 T 0.064 DUF4976 pdbpercent F Eukaryota T 6v92 17 T 2 Unknown Protein XXXXXXXXXXXXXXXXXXXXXXXXXXXX 28 F F F 6v92 18 U,V 3,4 Unknown Protein XXXXXXXXXXXXXXXXXXX 19 F F F 6v92 19 W 5 Unknown Protein XXXXXXXXXXXXXX 14 F F F 6v92 20 X 6 Uuknown Protein XXXXXXXXXXXXXXX 15 F F F 6v92 21 Y 7 Unknown Protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 49 F F F 6v98 1 A A Q9KMN9_VIBCH Cysteine hydrolase SGNDDFLIPVVFPDYLISVADEQSFELWGVKIKTPAVKAPYLGHAGVILINGETGVTRYYEYGRYKNPKSDIPGNVRKVGVSNVTIKSGLITESSLLKVLKEVSLRSGQEGRISGVVLRGKFFSEADSWLRGKMDLNNSPDKIPYDLDSHNXMTFVIDLADAMGLDPAWKPPVVVPSAYIEQFQLSEIDLDYDYKTNKLTVSE 203 T 0.027 DUF4105 pdbhh F Bacteria T 6v9z 2 B,D C,D A3DCU2_HUNT2 CtA SNAMSEAKKLNIGRELTDEELMEMTGGSTFSIQCQKDYTYKPSLPVVKYGVVIDEPEVVIKYGVGPIVGIKYGVEPIGPIQPMYGIKPVETLK 93 T 0.017 L_biotic_typeA unphh F Bacteria T 6vb0 3 C C Synthetic peptide GLU-LEU-ARG-ALA-ARG-GLU-GLU-SER-TYR ELRAREESY 9 T 0.04 Alpha_TIF pdbhh F T 6vb1 3 C C Synthetic peptide GLU-LEU-ARG-ALA-ARG-GLU-GLU-ALA-TYR ELRAREEAY 9 T 0.12 SUIM_assoc pdbhh F F 6vb2 3 C C LOXE3_HUMAN Synthetic peptide GLU-LEU-ARG-ALA-ARG-GLN-GLU-CYS-TYR ELRARQECY 9 T 1.4 SUIM_assoc pdbhh F Eukaryota T 6vb3 3 C C Synthetic peptide THR-VAL-ALA-ALA-SER-GLY-HIS-SER-TYR TVAASGHSY 9 T 2.5 BAAT_C pdbhh F T 6vb4 3 C C Synthetic peptide THR-VAL-ALA-ALA-SER-GLY-HIS-SER-TYR TVAASGHSY 9 T 2.5 BAAT_C pdbhh F T 6vb5 3 C C Synthetic peptide THR-VAL-ARG-ALA-SER-GLY-HIS-SER-TYR TVRASGHSY 9 T 5.3 Gly_radical pdbhh F T 6vb6 3 C C Synthetic peptide THR-VAL-ARG-ALA-SER-GLY-HIS-SER-TYR TVRASGHSY 9 T 5.3 Gly_radical pdbhh F T 6vb7 3 C C Synthetic peptide THR-VAL-ARG-ALA-SER-GLY-HIS-SER-TYR TVRASGHSY 9 T 5.3 Gly_radical pdbhh F T 6vb9 1 A,B,C,D A,B,C,D ACEA_MYCTU Isocitrate lyase GSHMSVVGTPKSAEQIQQEWDTNPRWKDVTRTYSAEDVVALQGSVVEEHTLARRGAEVLWEQLHDLEWVNALGALTGNMAVQQVRAGLKAIYLSGWQVAGDANLSGHTYPDQSLYPANSVPQVVRRINNALQRADQIAKIEGDTSVENWLAPIVADGEAGFGGALNVYELQKALIAAGVAGSHWEDQLASEKKXGHLGGKVLIPTQQHIRTLTSARLAADVADVPTVVIARTDAEAATLITSDVDERDQPFITGERTREGFYRTKNGIEPCIARAKAYAPFADLIWMETGTPDLEAARQFSEAVKAEYPDQMLAYNCSPSFNWKKHLDDATIAKFQKELAAMGFKFQFITLAGFHALNYSMFDLAYGYAQNQMSAYVELQEREFAAEERGYTATKHQREVGAGYFDRIATTVDPNSSTTALTGSTEEGQFH 431 T 1.8E-47 ICL pdb F Bacteria T 6vc1 1 A,B,C A,B,C Octreotide XCFXKTCT 8 T 0.0016 Urotensin_II pdbhh F F 6vcs 3 F G UNK-UNK-UNK-UNK XXXXXX 6 F F F 6vdb 2 B H ALA-PRO-ARG-PHE-GLY-GLY-VAL-MET-ARG-PRO-ASN-ARG APRFGGVMRPNRYR 14 T 5.3 MraY_sig1 pdbhh F T 6vdp 1 A A SFMD_STRLA 3-methyl-L-tyrosine peroxygenase MTAPADTVHPAGQPDYVAQVATVPFRLGRPEELPGTLDELRAAVSARAGEAVRGLNRPGARTDLAALLAATERTRAALAPVGAGPVGDDPSESEANRDNDLAFGIVRTRGPVAELLVDAALAALAGILEVAVDRGSDLEDAAWQRFIGGFDALLGWLADPHSAPRPATVPGAGPAGPPVHQDALRRWVRGHHVFMVLAQGCALATACLRDSAARGDLPGAEASAAAAEALMRGCQGALLYAGDANREQYNEQIRPTLMPPVAPPKMSGLHWRDHEVLIKELAGSRDAWEWLSAQGSERPATFRAALAETYDSHIGVCGHFVGDQSPSLLAAQGSTRSAVGVIGQFRKIRLSALPEQPATQQGEPS 365 T 0.00091 Hs1pro-1_C pdbhh F Bacteria T 6vdq 1 A A SFMD_STRLA 3-methyl-L-tyrosine peroxygenase MTAPADTVHPAGQPDYVAQVATVPFRLGRPEELPGTLDELRAAVSARAGEAVRGLNRPGARTDLAALLAATERTRAALAPVGAGPVGDDPSESEANRDNDLAFGIVRTRGPVAELLVDAALAALAGILEVAVDRGSDLEDAAWQRFIGGFDALLGWLADPHSAPRPATVPGAGPAGPPVHQDALRRWVRGHHVFMVLAQGCALATACLRDSAARGDLPGAEASAAAAEALMRGCQGALLYAGDANREQYNEQIRPTLMPPVAPPKMSGLHWRDHEVLIKELAGSRDAWEWLSAQGSERPATFRAALAETYDSHIGVCGHFVGDQSPSLLAAQGSTRSAVGVIGQFRKIRLSALPEQPATQQGEPS 365 T 0.00091 Hs1pro-1_C pdbhh F Bacteria T 6vdz 1 A A SFMD_STRLA 3-methyl-L-tyrosine peroxygenase MTAPADTVHPAGQPDYVAQVATVPFRLGRPEELPGTLDELRAAVSARAGEAVRGLNRPGARTDLAALLAATERTRAALAPVGAGPVGDDPSESEANRDNDLAFGIVRTRGPVAELLVDAALAALAGILEVAVDRGSDLEDAAWQRFIGGFDALLGWLADPHSAPRPATVPGAGPAGPPVHQDALRRWVRGHHVFMVLAQGCALATACLRDSAARGDLPGAEASAAAAEALMRGCQGALLYAGDANREQYNEQIRPTLMPPVAPPKMSGLHWRDHEVLIKELAGSRDAWEWLSAQGSERPATFRAALAETYDSHIGVCGHFVGDQSPSLLAAQGSTRSAVGVIGQFRKIRLSALPEQPATQQGEPS 365 T 0.00091 Hs1pro-1_C pdbhh F Bacteria T 6ve0 1 A A SFMD_STRLA 3-methyl-L-tyrosine peroxygenase MTAPADTVHPAGQPDYVAQVATVPFRLGRPEELPGTLDELRAAVSARAGEAVRGLNRPGARTDLAALLAATERTRAALAPVGAGPVGDDPSESEANRDNDLAFGIVRTRGPVAELLVDAALAALAGILEVAVDRGSDLEDAAWQRFIGGFDALLGWLADPHSAPRPATVPGAGPAGPPVHQDALRRWVRGHHVFMVLAQGCALATACLRDSAARGDLPGAEASAAAAEALMRGCQGALLYAGDANREQYNEQIRPTLMPPVAPPKMSGLHWRDHEVLIKELAGSRDAWEWLSAQGSERPATFRAALAETYDSHIGVCGHFVGDQSPSLLAAQGSTRSAVGVIGQFRKIRLSALPEQPATQQGEPS 365 T 0.00091 Hs1pro-1_C pdbhh F Bacteria T 6ve5 2 B B SHLD3_HUMAN REV7-INTERACTING NOVEL NHEJ REGULATOR 1,SHIELD COMPLEX SUBUNIT 3 MWFPYDGSKLPLRPKRSPPVISEEAAEDVKQYLTI 35 T 21 MauJ pdbhh F Eukaryota T 6ve7 4 H H FLTOP_CHLRE CILIA- AND FLAGELLA-ASSOCIATED PROTEIN 126,FLAGELLUM-ASSOCIATED PROTEIN 126 MSRSYPGEQVEHAFNSKRLKNWEVPAVDKSQAISTSTGTRFGTLQPRSGRTQFIVDDNGHLKSGVPKLEKSAFNFTQTTPVFMDSAPRWPKENPTWPKNMKATMGYKGIQSNYLPTNTVTLKAVEVPGTTERNFNFM 137 T 6.8 DUF3697 pdbhh F Eukaryota T 6ve7 6 W,XA W,x A0A2K3DTN6_CHLRE FAP276 MDLKQQVKNYTMTIRNTRPPTMIKEQDKSEFSHFRALQVLANGDEVPYEATLRNVIHDGARQPKLPPRQTQKHPGYIRNESGGFFT 86 T 0.092 DUF3337 unppercent F Eukaryota T 6vek 1 A A CONTACT-DEPENDENT INHIBITOR A MVENNYLSVSEKTELEIAKQKLKNSKDPAEREKAQQKYDALLEKDISSDKAVITACSNGQAASAACAGERLKVIAAKGGYETGHYNNQVSDMYPDAYGQIVNLLNITSVDAQNQQQVKDAMVNYAMVQFGVDRATAQAYVETYDGMKVVAASMAPVIGAAAASKIEVLAGKQRLSNSFEVSSLPDANGKNHITAVKGDAKIPVDKIELYMRGKASGDLDSLQAEYNSLKDARISSQKEFAKDPNNAKRMEVLEKQIHNIERSQDMARVLEQAGIVNTASNNSMIMDKLLDSAQGATSANRKTSVVVSGPNGNVRIYATWTILPDGTKRLSTVNTGTFK 338 T 0.0083 ORF6C pdb F T 6vek 2 B I A0A2A2C800_ECOLX contact-dependent immunity protein CdiI MINVNSTAKDIEGLESYLANGYVEANSFNDPEDDALECLSNLLVKDSRGGLSFCKKILNSNNIDGVFIKGSALNFLLLSEQWSYAFEYLTSNADNITLAELEKALFYFYCAKNETDPYPVPEGLFKKLMKRYEELKNDPDAKFYHLHETYDDFSKAYPLNNHHHHHH 167 T 0.2 PIG-X unp F Bacteria T 6ven 11 O O BRE2_YEAST BREFELDIN-A SENSITIVITY PROTEIN 2,COMPLEX PROTEINS ASSOCIATED WITH SET1 PROTEIN BRE2,SET1C COMPONENT BRE2 MKLGIIPYQEGTDIVYKNALQGQQEGKRPNLPQMEATHQIKSSVQGTSYEFVRTEDIPLNRRHFVYRPCSANPFFTILGYGCTEYPFDHSGMSVMDRSEGLSISRDGNDLVSVPDQYGWRTARSDVCIKEGMTYWEVEVIRGGNKKFADGVNNKENADDSVDEVQSGIYEKMHKQVNDTPHLRFGVCRREASLEAPVGFDVYGYGIRDISLESIHEGKLNCVLENGSPLKEGDKIGFLLSLPSIHTQIKQAKEFTKRRIFALNSHMDTMNEPWREDAENGPSRKKLKQETTNKEFQRALLEDIEYNDVVRDQIAIRYKNQLFFEATDYVKTTKPEYYSSDKRERQDYYQLEDSYLAIFQNGKYLGKAFENLKPLLPPFSELQYNEKFYLGYWQHGEARDESNDKNTTSAKKKKQQQKKKKGLILRNKYVNNNKLGYYPTISCFNGGTARIISEEDKLEYLDQIRSAYCVDGNSKVNTLDTLYKEQIAEDIVWDIIDELEQIALQQ 505 T 0.00011 Neuralized pdbhh F Eukaryota T 6vep 6 F,L,R,X F,L,R,X INSR_HUMAN IR TFEDYLHNVVFVPRPS 16 T 0.00017 DUF4998 unphh F Eukaryota T 6veq 4 D,H F,L INSR_HUMAN IR TFEDYLHNVVFVPRPS 16 T 0.00017 DUF4998 unphh F Eukaryota T 6vfs 2 G G Unidentified protein substrate XXXXXXX 7 F F F 6vfx 2 G G Unidentified peptide substrate XXXXXXX 7 F F F 6vg7 1 A A De novo designed protein RO2_25 MGSSHHHHHHSSGLVPRGSHMTLFVLILSNDKKLIEEARKMAEKANLILITVGDEEELKKAIKKADDIAKKQNSSEAKILILLEKPVSPEYEKKLQKYADAEVRVRTVTSPDEAKRWIKEFSEE 124 T 0.0037 Regulator_TrmB pdbpercent F T 6vga 1 A A De novo designed protein RO2_1 MGSSHHHHHHSSGLVPRGSHMRLVVLIVSNDKKLIEEARKMAEKANLELITVPGSPEEAIRLAQEIAEKAPGPVKVLVLITGSADPDEKTKAKKAAEEARKWNVRVRTVTSPDEAKRWIKEFSEE 125 T 0.0014 Regulator_TrmB pdb F T 6vgb 1 A A De novo designed protein RO2_20 MGSSHHHHHHSSGLVPRGSHMGLLVLIWSNDKKLIEEARKMAEKANLYLLTLETDDKKIEDILKSLGPPVKILVLLEDTKDADKVKKEIEKKARKKNLPVRIRKVTSPDEAKRWIKEFSEE 121 T 0.068 IF3_N pdbpssm F T 6vgn 3 O,P,Q,R,S,T,U V,W,X,Y,Z,a,b R0M-WFP-ALO-PRO-YCP-ALA-MP8 XXXPXAX 7 T 520 zf-CCHC pdbhh F F 6vgq 3 O,P,Q,R,S,T,U U,O,P,Q,R,S,T Z-Gly-leu-phe-CH2Cl XGLFX 5 T 290 gag_pre-integrs pdbhh F F 6vh8 1 A A Excelsatoxin A LPRCDSPFCSLFRIGLCGDKCTCVPLPIFGLCVPDV 36 T 0.0036 Albumin_I pdbhh F T 6vhj 1 A A LAN11_PROMM Prochlorosin 1.1 FFCVQGXANRFXINVC 16 T 0.0065 Bacteriocin_IIc unppercent F Bacteria T 6vi1 3 M,N,O,P,Q,R M,N,O,P,Q,R TERL_BPP22 DNA-PACKAGING PROTEIN GP2,GENE PRODUCT 2,GP2 MELDAILDNLSDEEQIELLELLEEEENYRNTHL 33 T 0.0057 DUF3775 pdbhh T Viruses T 6viu 3 C C THR-VAL-ALA-ALA-SER-GLY-HIS-SER-TYR TVAASGHSY 9 T 2.5 BAAT_C pdbhh F T 6vj8 2 B B Peptide chloromethylketone inhibitor XVRMX 5 T 0.0014 DUF3844 unphh F F 6vj9 2 B B ACE-VAL-ARG-MET-B2A XVRMX 5 T 290 RTP801_C pdbhh F F 6vjq 1 A A Q7TUK2_PROMM Prochlorosin 2.1 CCIXGESPGXAPXNDYKCXKGRGPGGCY 28 T 0.009 NHase_alpha unphh F Bacteria T 6vjz 4 D D USA1_YEAST U1 SNP1-associating protein 1 VRAADNTSSANDNNTVENDESAWNRRVVRPLRNSFPLLLVLIRTFYLIGYNSLVPFFIILEFGSFLPWKYIILLSLLFIFRTVWNTQEVWNLWRDYLHLNEIDEVKFSQIKEFINSNSLTLNFYKKCKDTQSAIDLLMIPNLHEQRLSVYSKYDIEYDTNTPDVGQLNLLFIKVLSGEIPKDALDELFKEFFELYETTRNMNTLYPQDSLNELLLMIWKESQKKDINTLPKYRRWFQTLCSQIAEHNVLDVVLRYIIPDPVNDRVITAVIKNFVLFWVTLLPYVKEKLDDIVAQRARDREQPAPSAQQQENEDEALIIPDEEEPTATGAQPHLYIPDED 339 T 0.03 GyrB_insert pdb F Eukaryota T 6vk0 1 A D USA1_YEAST U1 SNP1-associating protein 1 VRAADNTSSANDNNTVENDESAWNRRVVRPLRNSFPLLLVLIRTFYLIGYNSLVPFFIILEFGSFLPWKYIILLSLLFIFRTVWNTQEVWNLWRDYLHLNEIDEVKFSQIKEFINSNSLTLNFYKKCKDTQSAIDLLMIPNLHEQRLSVYSKYDIEYDTNTPDVGQLNLLFIKVLSGEIPKDALDELFKEFFELYETTRNMNTLYPQDSLNELLLMIWKESQKKDINTLPKYRRWFQTLCSQIAEHNVLDVVLRYIIPDPVNDRVITAVIKNFVLFWVTLLPYVKEKLDDIVAQRARDREQPAPSAQQQENEDEALIIPDEEEPTATGAQPHLYIPDED 339 T 0.03 GyrB_insert pdb F Eukaryota T 6vk9 2 AA,BA,CA,DA,EA,FA,Q,R,S,T,U,V,W,X,Y,Z F,X,Z,1,3,5,D,B,P,R,T,V,H,J,L,N Q74D22_GEOSL Geopilin domain 2 protein AGKIPTTTMGGKDFTFKPSTNVSVSYFTTNGATSTAGTVNTDYAVNTKNSSGNRVFTSTNNTSNIWYIENDAWKGKAVSDSDVTALGTGDVGKSDFSGTEWKSQ 104 T 0.11 DUF1445 pdb F Bacteria T 6vl2 1 A A Stigmurin FFSLIPSLVGGLISAFKX 18 T 0.55 Endotoxin_N pdbhh F T 6vlj 1 A A Q7V447_PROMM Prochlorosin 2.8 AACHNHAPXMPPXYWEGEC 19 T 0.0038 NHase_alpha unphh F Bacteria T 6vln 4 D P CSP_PLAFO CS NVDPNANPNVDP 12 T 0.025 PT unppercent F Eukaryota F 6vlz 85 IC u P-site finger XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 65 F F F 6vm7 5 E C PMEL_HUMAN ME20-M,ME20M,MELANOCYTE PROTEIN PMEL 17,MELANOCYTES LINEAGE-SPECIFIC ANTIGEN GP100,MELANOMA-ASSOCIATED ME20 ANTIGEN,P1,P100,PREMELANOSOME PROTEIN,SILVER LOCUS PROTEIN HOMOLOG ITDQVPFSV 9 T 4.8 PatG_C pdbhh F Eukaryota T 6vm8 5 E C PMEL_HUMAN ME20-M,ME20M,MELANOCYTE PROTEIN PMEL 17,MELANOCYTES LINEAGE-SPECIFIC ANTIGEN GP100,MELANOMA-ASSOCIATED ME20 ANTIGEN,P1,P100,PREMELANOSOME PROTEIN,SILVER LOCUS PROTEIN HOMOLOG IMDQVPFSV 9 T 4.3 DUF1422 pdbhh F Eukaryota T 6vm9 3 C C PMEL_HUMAN ME20-M,ME20M,MELANOCYTE PROTEIN PMEL 17,MELANOCYTES LINEAGE-SPECIFIC ANTIGEN GP100,MELANOMA-ASSOCIATED ME20 ANTIGEN,P1,P100,PREMELANOSOME PROTEIN,SILVER LOCUS PROTEIN HOMOLOG IMDQVPFSV 9 T 4.3 DUF1422 pdbhh F Eukaryota T 6vma 5 E C PMEL_HUMAN ME20-M,ME20M,MELANOCYTE PROTEIN PMEL 17,MELANOCYTES LINEAGE-SPECIFIC ANTIGEN GP100,MELANOMA-ASSOCIATED ME20 ANTIGEN,P1,P100,PREMELANOSOME PROTEIN,SILVER LOCUS PROTEIN HOMOLOG ITDQVPFSV 9 T 4.8 PatG_C pdbhh F Eukaryota T 6vmc 5 E C PMEL_HUMAN ME20-M,ME20M,MELANOCYTE PROTEIN PMEL 17,MELANOCYTES LINEAGE-SPECIFIC ANTIGEN GP100,MELANOMA-ASSOCIATED ME20 ANTIGEN,P1,P100,PREMELANOSOME PROTEIN,SILVER LOCUS PROTEIN HOMOLOG ILDQVPFSV 9 T 6.2 Spin-Ssty pdbhh F Eukaryota T 6vmi 85 IC u P-site finger XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 65 F F F 6vo5 2 C,D C,D H4_HUMAN Histone H4 SGRGKGGKGLGAGGAKRHRK 20 T 11 Shadoo unppercent F Eukaryota T 6vps 1 A,B,C,D,E,F,G,H,I A,B,C,D,E,F,G,H,I ORB2_DROME Translational regulator orb2 QLHQQQHQQQHQQHQQHQQQQQLHQHQQQLS 31 T 0.00077 TFIIA unp F Eukaryota F 6vpx 1 A,B,C A,E,C B9V6B3_9HIV1 Envelope glycoprotein gp120 AEQLWVTVYYGVPVWKEATTTLFCASDARAYDTEVHNVWATHACVPTDPNPQEVVLENVTENFNMWKNNMVEQMHEDIISLWDQSLKPCVKLTPLCVTLNCTDLRNSSSGEKMEGGEIKNCSFNITTSMRDKVQKEYALFYKLDVVPIKNDNTSYRLISCNTSVITQACPKVSFEPIPIHYCAPAGFAILKCNDKKFNGTGPCTNVSTVQCTHGIRPVVSTQLLLNGSLAEEEVVIRSANFTDNAKIIIVQLNKSVEINCTRPNNNTRKSIHIGPGRAFYTTGEIIGDIRQAHCNISGTKWNDTLKQIVVKLKEQFGNKTIVFNHSSGGDPEIVMHSFNCGGEFFYCNSTQLFNSTWNDTEGSNNTKGNGTIVLPCRIKQIVNMWQEVGKAMYAPPIKGQIRCSSNITGLILIRDGGNNNESTEIFRPGGGDMRDNWRSELYKYKVVKIEPLGIAPTKAKRRVVQ 465 T 2.9999999999999995E-54 GP120 pdb T Viruses T 6vpz 3 C C POL_HV1H2 11-mer peptide KRWIILGLNKI 11 T 4.2 COX2-transmemb pdbhh T Viruses T 6vq2 3 C C POL_HV1H2 14-mer peptide KRWIILGLNKIVRM 14 T 6.2 COX2-transmemb pdbhh T Viruses T 6vq6 8 P,Q,R Q,R,S Q5ZWW6_LEGPH Effector protein SidK GMSFIKVGIKMGGLTSEQYHSQVVGKIGYIARCMQTIDPENNLKKIREDYQDVLIWAEKNYRFEEILEASKSGKCPNDLDALSRRSLILQELLRLVSSISPFKMKLDLIESQYEKMKQHVNLWKSDYHVKLNQLNQLTDYLKNAAPTPKNNFLRAMTSVLQMQIAQYGITEDNEGINQLFKLGLHLLAMANEKIDEQYHLFKGYVKDQPEESPFEGILPAEDQKILVKTMIDYAMPKLSSKVLQDKLSALSSSDVLTKTLLDSIDRIVKENEKLNALSKDYKDHDGDYKDHDIDYKDDDDK 301 T 1 DUF3276 unppssm F Bacteria T 6vq7 8 P,Q,R Q,R,S Q5ZWW6_LEGPH Effector protein SidK GMSFIKVGIKMGGLTSEQYHSQVVGKIGYIARCMQTIDPENNLKKIREDYQDVLIWAEKNYRFEEILEASKSGKCPNDLDALSRRSLILQELLRLVSSISPFKMKLDLIESQYEKMKQHVNLWKSDYHVKLNQLNQLTDYLKNAAPTPKNNFLRAMTSVLQMQIAQYGITEDNEGINQLFKLGLHLLAMANEKIDEQYHLFKGYVKDQPEESPFEGILPAEDQKILVKTMIDYAMPKLSSKVLQDKLSALSSSDVLTKTLLDSIDRIVKENEKLNALSKDYKDHDGDYKDHDIDYKDDDDK 301 T 1 DUF3276 unppssm F Bacteria T 6vq8 8 P,Q,R Q,R,S Q5ZWW6_LEGPH Effector protein SidK GMSFIKVGIKMGGLTSEQYHSQVVGKIGYIARCMQTIDPENNLKKIREDYQDVLIWAEKNYRFEEILEASKSGKCPNDLDALSRRSLILQELLRLVSSISPFKMKLDLIESQYEKMKQHVNLWKSDYHVKLNQLNQLTDYLKNAAPTPKNNFLRAMTSVLQMQIAQYGITEDNEGINQLFKLGLHLLAMANEKIDEQYHLFKGYVKDQPEESPFEGILPAEDQKILVKTMIDYAMPKLSSKVLQDKLSALSSSDVLTKTLLDSIDRIVKENEKLNALSKDYKDHDGDYKDHDIDYKDDDDK 301 T 1 DUF3276 unppssm F Bacteria T 6vq9 6 N,O,P Q,R,S Q5ZWW6_LEGPH Effector protein SidK GMSFIKVGIKMGGLTSEQYHSQVVGKIGYIARCMQTIDPENNLKKIREDYQDVLIWAEKNYRFEEILEASKSGKCPNDLDALSRRSLILQELLRLVSSISPFKMKLDLIESQYEKMKQHVNLWKSDYHVKLNQLNQLTDYLKNAAPTPKNNFLRAMTSVLQMQIAQYGITEDNEGINQLFKLGLHLLAMANEKIDEQYHLFKGYVKDQPEESPFEGILPAEDQKILVKTMIDYAMPKLSSKVLQDKLSALSSSDVLTKTLLDSIDRIVKENEKLNALSKDYKDHDGDYKDHDIDYKDDDDK 301 T 1 DUF3276 unppssm F Bacteria T 6vqa 6 N,O,P Q,R,S Q5ZWW6_LEGPH Effector protein SidK GMSFIKVGIKMGGLTSEQYHSQVVGKIGYIARCMQTIDPENNLKKIREDYQDVLIWAEKNYRFEEILEASKSGKCPNDLDALSRRSLILQELLRLVSSISPFKMKLDLIESQYEKMKQHVNLWKSDYHVKLNQLNQLTDYLKNAAPTPKNNFLRAMTSVLQMQIAQYGITEDNEGINQLFKLGLHLLAMANEKIDEQYHLFKGYVKDQPEESPFEGILPAEDQKILVKTMIDYAMPKLSSKVLQDKLSALSSSDVLTKTLLDSIDRIVKENEKLNALSKDYKDHDGDYKDHDIDYKDDDDK 301 T 1 DUF3276 unppssm F Bacteria T 6vqb 6 N,O,P Q,R,S Q5ZWW6_LEGPH Uncharacterized protein GMSFIKVGIKMGGLTSEQYHSQVVGKIGYIARCMQTIDPENNLKKIREDYQDVLIWAEKNYRFEEILEASKSGKCPNDLDALSRRSLILQELLRLVSSISPFKMKLDLIESQYEKMKQHVNLWKSDYHVKLNQLNQLTDYLKNAAPTPKNNFLRAMTSVLQMQIAQYGITEDNEGINQLFKLGLHLLAMANEKIDEQYHLFKGYVKDQPEESPFEGILPAEDQKILVKTMIDYAMPKLSSKVLQDKLSALSSSDVLTKTLLDSIDRIVKENEKLNALSKDYKDHDGDYKDHDIDYKDDDDK 301 T 1 DUF3276 unppssm F Bacteria T 6vqd 3 C C POL_HV1H2 8-mer peptide KRWIILGL 8 T 0.56 COX2-transmemb pdbhh T Viruses T 6vqe 3 C C POL_HV1H2 13-mer peptide KRWIILGLNKIVR 13 T 5.7 COX2-transmemb pdbhh T Viruses T 6vqp 2 B Q CalU17 His-Tagged protein MGSSHHHHHHSSGLVPRGS 19 T 9200 zf_CCCH_4 pdbhh F T 6vqv 1 A,B A,B AcrF9 MKAAYIIKEVQNINSEREGTQIEATSLSQAKRIASKEQCFHGTVMRIETVNGLWLAYKEDGKRWVDCQ 68 T 0.11 Ribosomal_L19 pdbpssm F T 6vqw 4 I A AcrF8 MARIAPNEDSTMSTAYIIFNSSVAAVVDTEIANGANVTFSTVTVKEEINANRDFNLVNAQNGKISRAKRWGNEASKCEYFGREINPTEFFIK 92 T 10 PHF12_MRG_bd pdbhh F T 6vqx 1 A A A0A6B1YCA6_PSEAI AcrF6 MKVPAFFAANILTIEQIIEAINNDGSAMTSAPEIAGYYAWDAATDALESENDLEQLTEDDFVAHLEVLEERGAKIDRDAAIAVALQFQAAAVNDLHSGDE 100 T 0.47 SUB1_ProdP9 pdb F Bacteria T 6vqy 3 E,F G,F POL_HV1H2 7-mer peptide KRWIILG 7 T 0.7 COX2-transmemb pdbhh T Viruses T 6vqz 3 E,F G,F POL_HV1H2 6-mer peptide KRWIIL 6 T 6.1 HofP pdbhh T Viruses F 6vr4 1 A,B A,B S0A2C3_9CAUD DNA-dependent RNA polymerase MGSSHHHHHHSQDPMACKIENIKYKGKEVESKLGSQLIDIFNDLDRAKEEYDKLSSPEFIAKFGDWINDEVERNVNEDGEPLLIQDVRQDSSKHYFFILKNGERFDLLTREFDSFTSPDLTNEIKEITDQLSYYIYNKHFSSDFEQVEGAKLNIQNEISQFVKEGKAPVQAAYNKLQDPDIKDLLDYYDNIEKHSDEFESEIVKFFSEKKLIIKDAELEDVTQEGLNEGLQGGDLVQAFEKNSKDNATANVKLMLSFLPKIDNLTGEPALGDYLNKPVFRSFDSIHSELLEVLSDITTLHVQGEVLDVFSSMYNKIKELADFKKSFKPLLEILDTIDEQKKTEFVQAFYLSKINFYTTTIETLETEDQNNTLTTFKVQNVSNANNPISSKLTEYYTNFKYKILPGGKLNKGKLKDLQSTVTSLLEKTRKENNPKYKSDSDFYEVFEEGVVELMQVFEDLGVDSITFEAMDIFLKQFRFDLPENNAYKIMYQQYQGKLTNLNNLLKDIQSNKINPYKINPFKNYSNLIFNSLAEAENYFIENNNESTIFSNGKTYWNFARPSYISNRINTFKNNPGVLRQLLNTSYGQSSLWAKHLLGEEKNVTGDFVLAGNARESASENRLKSLELSIFNSLQEKDKGAEGNDNGSISIVDQLADKLNKVLRGGTKNGTSIYSTVTPGDKSTLHEIKIDHFIPETISSFSNGTMIFNDKIVNAFTDHFVSEVNRMKEAYQELETLPESKRVVHYHTDARGNVMKDGKLAGNAFKSGHILSELSFDQITQDDNEMLKLYNEDGSPINPKGAVSNEQKILIKQTINKVLNQRIKENIRYFKDQGLVIDTVNKDGNKGFHFHGLDKSIMSEYTDDIQLTEFDISHVVSDFTLNSILASIEYTKLFTGDPANYKNMVDFFKRVPATYTNGTNLRLGLEANDHLFDVAVLENIVKPSAYLKEIGESLKLSDLSEAEKKYILEAYEDVNQTDAQAWITPKRWAFLISRTGKWNSKYQSVYNKILKSESLDASEMKLAAQPLKGVYFGLVNNTPTYLKYSQAVLLPQLVAGTQLQSLADAMNKQDIGESIVLDGVKVGATTPNIVTDENGDILKSISLNPLTLSNADWKLQQDLPVKTIKPTLLGSQIQKNIYSSLTDEATYTIENEAFNGSGMFQAINDTVSAMSNLSIAGLSSELGKDSEGKIDKRKLYDMLEREMLDKGSAINLLKSIQKNLPIEAMPGIKDKLYNIVFSKINSAAVKLKTNGGSFIQLSNFGLDKQTADAKGITWLVEPSDLKPPVIEKDADGKNYIRPGQIFMSHVQIAKLVPDYAKMDSKTLSSMIDPKALRAIGYRIPNQGQSSNDPLQIVGILPEAMGDTIVAYTEIPTKTGSDFDIDKMYVMLPNFKVEHTKKSFKLAKDYIAQNEITVEEMYDELEDHGFNIDDIANGEEVTESAITEAFIKNHILNSNSELEYHNDFVKQHNIDAVNKIDFLGYSEELHKNKSEQLQNRLFDLYWAVLTNEKTYGDLITPIDFPHVKDEIKRVFGDNSKQTGENLKFHDPLYQLKLKFTYAGGKSGVGITANMLVDHNRSKGIDMQFNQYNLGVGHTQNGNTVFDKEYSEELNGTRFKIKDTISAFLNAFVDNAKDPYINDGNFNTYTSSVAFMLIRAGVHPDWIISFIGQPVLRELADFTQRYESKIIPKEDVGKSSFDIIVEKYETINQESYKDAESRAFSLDTLQESIEVGVHGIDLDVLKTFKGFQEQAKRLNESVQLSRFDTNGSGKNILDLIILKNKIKNLYVSEQTQQKGSMMNHFKKYHNNGKITSLGTQVKNTLLFTDDILNNNPSLFLLGSKPIQDLVNSISNNLVDSRGGSRGLLTNEDVGKLFYKEVYKYIMADFAPFKVGDPMAYIKDTIFDLVNYKTEDKQYDSSNFFIENMTVYENSFGITNKNKSVDFQDRLYRSAYDLMMENPELANKMFISSFLMNGFENKLIDIKEYIPYQWFLENDIRSFIESKNTGLKDSSESLRSFEEQFIKNNSDSNILAPKVSQSVIKSIKGIKSKHVFELPINDKTKRYILGATETKEEVLPNYVKVGSDLYRLKAYREKSGVYVRTNKLGFEDPKSFLSIKEYKFGTRTGGNFTGELTKQELVYTNQWVNENITLANGYISADSRTVDNPADKILEQNSLENILFSQNNVVSSDENDITKQECK 2194 T 0.038 HTH_40 pdbpercent T Viruses T 6vrb 2 B A CS13A_LISSS CRISPR-ASSOCIATED ENDORIBONUCLEASE C2C2,ENDORNASE,LSEC2C2 TMRITKVEVDRKKVLISRDKNGGKLVYENEMQDNTEQIMHHKKSSFYKSVVNKTICRPEQKQMKKLVHGLLQENSQEKIKVSDVTKLNISNFLNHRFKKSLYYFPENSPDKSEEYRIEINLSQLLEDSLKKQQGTFICWESFSKDMELYINWAENYISSKTKLIKKSIRNNRIQSTESRSGQLMDRYMKDILNKNKPFDIQSVSEKYQLEKLTSALKATFKEAKKNDKEINYKLKSTLQNHERQIIEELKENSELNQFNIEIRKHLETYFPIKKTNRKVGDIRNLEIGEIQKIVNHRLKNKIVQRILQEGKLASYEIESTVNSNSLQKIKIEEAFALKFINACLFASNNLRNMVYPVCKKDILMIGEFKNSFKEIKHKKFIRQWSQFFSQEITVDDIELASWGLRGAIAPIRNEIIHLKKHSWKKFFNNPTFKVKKSKIINGKTKDVTSEFLYKETLFKDYFYSELDSVPELIINKMESSKILDYYSSDQLNQVFTIPNFELSLLTSAVPFAPSFKRVYLKGFDYQNQDEAQPDYNLKLNIYNEKAFNSEAFQAQYSLFKMVYYQVFLPQFTTNNDLFKSSVDFILTLNKERKGYAKAFQDIRKMNKDEKPSEYMSYIQSQLMLYQKKQEEKEKINHFEKFINQVFIKGFNSFIEKNRLTYICHPTKNTVPENDNIEIPFHTDMDDSNIAFWLMCKLLDAKQLSELRNEMIKFSCSLQSTEEISTFTKAREVIGLALLNGEKGCNDWKELFDDKEAWKKNMSLYVSEELLQSLPYTQEDGQTPVINRSIDLVKKYGTETILEKLFSSSDDYKVSAKDIAKLHEYDVTEKIAQQESLHKQWIEKPGLARDSAWTKKYQNVINDISNYQWAKTKVELTQVRHLHQLTIDLLSRLAGYMSIADRDFQFSSNYILERENSEYRVTSWILLSENKNKNKYNDYELYNLKNASIKVSSKNDPQLKVDLKQLRLTLEYLELFDNRLKEKRNNISHFNYLNGQLGNSILELFDDARDVLSYDRKLKNAVSKSLKEILSSHGMEVTFKPLYQTNHHLKIDKLQPKKIHHLGEKSTVSSNQVSNEYCQLVRTLLTMK 1087 T 0.51 MbeB_N unp F Bacteria T 6vrb 3 C C AcrVIA1 MIYYIKDLKVKGKIFENLMNKEAVEGLITFLKKAEFEIYSRENYSKYNKWFEMWKSPTSSLVFWKNYSFRCHLLFVIEKDGECLGIPASVFESVLQIYLADPFAPDTKELFVEVCNLYECLADVTVVEHFEAEESAWHKLTHNETEVSKRVYSKDDDELLKYIPEFLDTIATNKKSQKYNQIQGKIQEINKEIATLYESSEDYIFTEYVSNLYRESAKLEQHSKQILKE 229 T 0.32 DUF5377 pdb F T 6vrc 1 A A CS13A_LISSS CRISPR-ASSOCIATED ENDORIBONUCLEASE C2C2,ENDORNASE,LSEC2C2 MWISIKTLIHHLGVLFFCDYMYNRREKKIIEVKTMRITKVEVDRKKVLISRDKNGGKLVYENEMQDNTEQIMHHKKSSFYKSVVNKTICRPEQKQMKKLVHGLLQENSQEKIKVSDVTKLNISNFLNHRFKKSLYYFPENSPDKSEEYRIEINLSQLLEDSLKKQQGTFICWESFSKDMELYINWAENYISSKTKLIKKSIRNNRIQSTESRSGQLMDRYMKDILNKNKPFDIQSVSEKYQLEKLTSALKATFKEAKKNDKEINYKLKSTLQNHERQIIEELKENSELNQFNIEIRKHLETYFPIKKTNRKVGDIRNLEIGEIQKIVNHRLKNKIVQRILQEGKLASYEIESTVNSNSLQKIKIEEAFALKFINACLFASNNLRNMVYPVCKKDILMIGEFKNSFKEIKHKKFIRQWSQFFSQEITVDDIELASWGLRGAIAPIRNEIIHLKKHSWKKFFNNPTFKVKKSKIINGKTKDVTSEFLYKETLFKDYFYSELDSVPELIINKMESSKILDYYSSDQLNQVFTIPNFELSLLTSAVPFAPSFKRVYLKGFDYQNQDEAQPDYNLKLNIYNEKAFNSEAFQAQYSLFKMVYYQVFLPQFTTNNDLFKSSVDFILTLNKERKGYAKAFQDIRKMNKDEKPSEYMSYIQSQLMLYQKKQEEKEKINHFEKFINQVFIKGFNSFIEKNRLTYICHPTKNTVPENDNIEIPFHTDMDDSNIAFWLMCKLLDAKQLSELRNEMIKFSCSLQSTEEISTFTKAREVIGLALLNGEKGCNDWKELFDDKEAWKKNMSLYVSEELLQSLPYTQEDGQTPVINRSIDLVKKYGTETILEKLFSSSDDYKVSAKDIAKLHEYDVTEKIAQQESLHKQWIEKPGLARDSAWTKKYQNVINDISNYQWAKTKVELTQVRHLHQLTIDLLSRLAGYMSIADRDFQFSSNYILERENSEYRVTSWILLSENKNKNKYNDYELYNLKNASIKVSSKNDPQLKVDLKQLRLTLEYLELFDNRLKEKRNNISHFNYLNGQLGNSILELFDDARDVLSYDRKLKNAVSKSLKEILSSHGMEVTFKPLYQTNHHLKIDKLQPKKIHHLGEKSTVSSNQVSNEYCQLVRTLLTMKHHHHHH 1126 T 0.12 FRB_dom pdbpercent F Bacteria T 6vro 2 B B CRBG1_HUMAN ABSENT IN MELANOMA 1 PROTEIN KRKKARMPNSPAPHFAMPPIHEDHLE 26 T 11 DUF3320 pdbhh F Eukaryota T 6vrw 1 A,C,E G,A,D A0A0N9FF17_9HIV1 Envelope glycoprotein gp120 GLWVTVYYGVPVWREAKTTLFCASDAKSYEKEVHNVWATHACVPTDPNPQELVLENVTENFNMWKNDMVDQMHEDIISLWDQSLKPCVKLTPLCVTLNCSDAKVNATYKGTREEIKNCSFNATTELRDKKRREYALFYRLDIVPLSGEGNNNSEYRLINCNTSVITQICPKVTFDPIPIHYCAPAGYAILKCNNKTFNGTGPCNNVSTVQCTHGIKPVVSTQLLLNGSLAEEEIIIRSENLTDNVKTIIVHLNESVEITCTRPNNMTRKSVRIGPGQTFYALGDIIGDIRQPHCNISEIKWEKTLQRVSEKLREHFNKTIIFNQSSGGDLEITTHSFNCGGEFFYCNTSDLFFNKTFNETYSTGSNSTNSTITLPCRIKQIINMWQEVGRAMYAPPIAGNITCKSNITGLLLTRDGGGNNSTKETFRPGGGNMRDNWRSELYKYKVVEVKPLGIAPTECNRTVVQRRRRRR 471 T 3.2999999999999996E-53 GP120 pdbpssm T Viruses T 6vtt 1 A,B,C F,E,G A0A0N9FF17_9HIV1 Envelope glycoprotein gp120 GLWVTVYYGVPVWREAKTTLFCASDAKSYEKEVHNVWATHACVPTDPNPQELVLENVTENFNMWKNDMVDQMHEDIISLWDQSLKPCVKLTPLCVTLNCSDAKVNATYKGTREEIKNCSFNATTELRDKKRREYALFYRLDIVPLSGEGNNNSEYRLINCNTSVITQICPKVTFDPIPIHYCAPAGYAILKCNNKTFNGTGPCNNVSTVQCTHGIKPVVSTQLLLNGSLAEEEIIIRSENLTDNVKTIIVHLNESVEITCTRPNNMTRKSVRIGPGQTFYALGDIIGDIRQPHCNISEIKWEKTLQRVSEKLREHFNKTIIFNQSSGGDLEITTHSFNCGGEFFYCNTSDLFFNKTFNETYSTGSNSTNSTITLPCRIKQIINMWQEVGRAMYAPPIAGNITCKSNITGLLLTRDGGGNNSTKETFRPGGGNMRDNWRSELYKYKVVEVKPLGIAPTECNRTVVQRRRRRR 471 T 3.2999999999999996E-53 GP120 pdbpssm T Viruses T 6vtw 1 A A S4_2.45 CSVVVGENYSIKCDATKCTIEDKNRGIIKTVTGSRCEELAKAVQKAQ 47 T 6.3 DUF2997 pdbhh F T 6vu4 1 A,B C,A APP,ABPP,APPI,ALZHEIMER DISEASE AMYLOID PROTEIN,AMYLOID PRECURSOR PROTEIN,AMYLOID-BETA A4 PROTEIN,CEREBRAL VASCULAR AMYLOID PEPTIDE,CVAP,PREA4,PROTEASE NEXIN-II,PN-II HXKLVXFAEXAIIGLMV 17 T 0.0046 Beta-APP pdbhh F T 6vvs 6 G G unknown XXXXXXXXXXXXXXXXX 17 F F F 6vw9 2 B B S6FCX2_CAEEL K+/Cl-Cotransporter SKMHTAVRLNELLLQHSANSQLILLNLPKPPVHKDQQALDDYVHYLEVMTDKLNRVIFVRGTGKEVITESS 71 F F Eukaryota T 6vxg 1 A A RAGE_HUMAN RECEPTOR FOR ADVANCED GLYCOSYLATION END PRODUCTS MWQRRQRRGEERKAPENQEEEEERAELNQSEEPEAGESSTGGP 43 T 0.0011 TMEM154 unphh F Eukaryota T 6vxy 2 B C SFTI1_HELAN SFTI1 inhibitor GLY-ARG-GLY-THR-LYS-SER-ILE-PRO-PRO-ILE-ALA-PHE-PRO-ASP GRGTKSIPPIAFPD 14 T 1 Antimicrobial23 pdbhh F Eukaryota T 6vy2 1 A,C,E A,C,E A0A0A7I3C6_9HIV1 SURFACE PROTEIN GP120, SU, GP120 MPMGSLQPLATLYLLGMLVASVLAAENLWVTVYYGVPVWKEAKTTLFCASDAKAYEKKVHNVWATHACVPTDPNPQEMVLKNVTENFNMWKNDMVDQMHEDVISLWDQSLKPCVKLTPLCVTLNCTNATASNSSIIEGMKNCSFNITTELRDKREKKNALFYKLDIVQLDGNSSQYRLINCNTSVITQACPKVSFDPIPIHYCAPAGYAILKCNNKTFTGTGPCNNVSTVQCTHGIKPVVSTQLLLNGSLAEGEIIIRSENITNNVKTIIVHLNESVKIECTRPNNKTRTSIRIGPGQWFYATGQVIGDIREAYCNINESKWNETLQRVSKKLKEYFPHKNITFQPSSGGDLEITTHSFNCGGEFFYCNTSSLFNRTYMANSTDMANSTETNSTRTITIHCRIKQIINMWQEVGRAMYAPPIAGNITCISNITGLLLTRDGGKNNTETFRPGGGNMKDNWRSELYKYKVVKIEPLGVAPTRCKRRVV 487 T 1.2E-52 GP120 pdbpssm T Viruses T 6vy8 1 A A SFTI1_HELAN Trypsin inhibitor GLY-ARG-RVJ-THR-LYS-SER-ILE-PRO-PRO-ILE-2AG-PHE-PRO-ASP GRXTKSIPPIXFPD 14 T 0.052 Bowman-Birk_leg unp F Eukaryota T 6vzi 4 D G A0A0N9FF17_9HIV1 ENV POLYPROTEIN GLWVTVYYGVPVWREAKTTLFCASDAKSYEKEVHNVWATHACVPTDPNPQELVLENVTENFNMWKNDMVDQMHEDIISLWDQSLKPCVKLTPLCVTLNCSDAKVNATYKGTREEIKNCSFNATTELRDKKRREYALFYRLDIVPLSGEGNNNSEYRLINCNTSVITQICPKVTFDPIPIHYCAPAGYAILKCNNKTFNGTGPCNNVSTVQCTHGIKPVVSTQLLLNGSLAEEEIIIRSENLTDNVKTIIVHLNESVEITCTRPNNMTRKSVRIGPGQTFYALGDIIGDIRQPHCNISEIKWEKTLQRVSEKLREHFNKTIIFNQSSGGDLEITTHSFNCGGEFFYCNTSDLFFNKTFNETYSTGSNSTNSTITLPCRIKQIINMWQEVGRAMYAPPIAGNITCKSNITGLLLTRDGGGNNSTKETFRPGGGNMRDNWRSELYKYKVVEVKPLGIAPTECRRRVVQRRRRRR 471 T 3.2999999999999996E-53 GP120 pdbpssm T Viruses T 6vzr 2 E F TTLL6 unregistered chain XXXXXXXXXXXX 12 F F F 6vzr 3 F G TTLL6 unregistered chain XXXXXXXXXXX 11 F F F 6vzt 2 C D TTLL6 unregistered chain XXXXXXXXXXX 11 F F F 6vzu 2 E M TTLL6 unregistered chain XXXXXXXXXXXXX 13 F F F 6vzu 3 F N TTLL6 unregistered chain XXXXXXXXXXXXXXX 15 F F F 6vzv 2 E I TTLL6 unregistered chain XXXXXXXXXXXXXXX 15 F F F 6vzv 3 F J TTLL6 unregistered chain XXXXXXXXXXXXXXXX 16 F F F 6vzx 1 A,B,C A,B,C collagen mimetic peptide XPPGPPGPPGPKGEPGPPGPPGPPGX 26 T 0.00013 Collagen pdbpssm F F 6w00 4 D P CSP_PLAFA NPNA2 peptide XNPNANPNA 9 T 3.2 PT unppercent F Eukaryota F 6w03 4 D G Q2N0S6_9HIV1 ENV POLYPROTEIN AENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNAITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSACTQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPNNMTRKSIRIGPGQAFYALGDIIGDIRQPHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQCMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 481 T 3.4E-54 GP120 pdbpercent T Viruses T 6w05 3 C P CSP_PLAFA NPNA2 peptide NPNANPNA 8 T 3 Cas_Cas7 pdbhh F Eukaryota F 6w0l 2 B P W_NIPAV Phosphorylated W peptide ARVSMRRMSN 10 T 1.3 Paramyxo_P_V_N unphh T Viruses T 6w0v 1 A A A0A1P8L021_PSEAI PYS8 DEPGVATGNGQPVTGNWLAGASQGDGVPIPSQIADQLRGKEFKSWRDFREQFWVAVANDPELVKYFRKTNAKGMRDGLSPFTPKAEQAGGRDKYAIHHVVQISQGGAVYDIDNLRVMTPKMHIQV 125 T 0.0052 HNH pdbpssm F Bacteria T 6w17 10 M,N,O,P,Q M,N,O,P,Q PHAD1_AMAPH Phalloidin PAWXAXC 7 T 2.7 CSN7a_helixI pdbhh F Eukaryota F 6w1s 1 A A Mediator of RNA polymerase II transcription subunit 1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 382 F F F 6w1x 6 K,L I,J C0AVY5_9GAMM anti-CRISPR AcrIF9 MKSTYIIKEVQNINSDREGVKVETTSLTSAKRIASKNQFFHGTVLRIESESGNWLAYKEDGKRWIECE 68 T 0.15 SurA_N pdb F Bacteria T 6w1z 3 U X RepA, green fluorescent protein fusion XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 6w20 3 U X RepA, green fluorescent protein fusion XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 6w21 2 G X RepA, green fluorescent protein fusion XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 6w22 2 G X RepA, green fluorescent protein fusion XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 6w23 2 G X RepA, green fluorescent protein fusion XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 6w24 2 G X RepA, green fluorescent protein fusion XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 6w25 2 B B SHU9119 XXDHXRWKX 9 T 23 ACTH_domain pdbhh F T 6w2r 1 A,B,C,D A,B,C,D Junction 19 DHR54-DHR79 MGTTEDERRELEKVARKAIEAAREGNTDEVREQLQRALEIARESGTKTAVKLALDVALRVAQEAAKRGNKDAIDEAAEVVVRIAEESNNSDALEQALRVLEEIAKAVLKSEKTEDAKKAVKLVQEAYKAAQRAIEAAKRTGTPDVIKLAIKLAKLAARAALEVIKRPKSEEVNEALKKIVKAIQEAVESLREAEESGDPEKREKARERVREAVERAEEVQRDPSSGWLEHHHHHH 235 T 0.0008 SPO22 pdb F T 6w3j 3 C C CE192_HUMAN CEP192/SPD-2 IDDEMFYDDHLEAYFEQLAIPG 22 T 3.2 SUB1_ProdP9 pdbhh F Eukaryota T 6w5c 1 A A Cas12i MSNKEKNASETRKAYTTKMIPRSHDRMKLLGNFMDYLMDGTPIFFELWNQFGGGIDRDIISGTANKDKISDDLLLAVNWFKVMPINSKPQGVSPSNLANLFQQYSGSEPDIQAQEYFASNFDTEKHQWKDMRVEYERLLAELQLSRSDMHHDLKLMYKEKCIGLSLSTAHYITSVMFGTGAKNNRQTKHQFYSKVIQLLEESTQINSVEQLASIILKAGDCDSYRKLRIRCSRKGATPSILKIVQDYELGTNHDDEVNVPSLIANLKEKLGRFEYECEWKCMEKIKAFLASKVGPYYLGSYSAMLENALSPIKGMTTKNCKFVLKQIDAKNDIKYENEPFGKIVEGFFDSPYFESDTNVKWVLHPHHIGESNIKTLWEDLNAIHSKYEEDIASLSEDKKEKRIKVYQGDVCQTINTYCEEVGKEAKTPLVQLLRYLYSRKDDIAVDKIIDGITFLSKKHKVEKQKINPVIQKYPSFNFGNNSKLLGKIISPKDKLKHNLKCNRNQVDNYIWIEIKVLNTKTMRWEKHHYALSSTRFLEEVYYPATSENPPDALAARFRTKTNGYEGKPALSAEQIEQIRSAPVGLRKVKKRQMRLEAARQQNLLPRYTWGKDFNINICKRGNNFEVTLATKVKKKKEKNYKVVLGYDANIVRKNTYAAIEAHANGDGVIDYNDLPVKPIESGFVTVESQVRDKSYDQLSYNGVKLLYCKPHVESRRSFLEKYNGTMKDNRGNNIQIDFMKDFEAIADDETSLYYFNMKYCKLLQSSIRNHSSQAKEYREEIFELLRDGKLSVLKLSSLSNLSFVMFKVAKSLIGTYFGHLLKKPKNSKSDVKAPPITDEDKQKADPEMFALRLALEEKRLNKVKSKKEVIANKIVAKALELRDKYGPVLIKGENISDTTKKGKKSSTNSFLMDWLARGVANKVKEMVMMHQGLEFVEVNPNFTSHQDPFVHKNPENTFRARYSRCTPSELTEKNRKEILSFLSDKPSKRPTNAYYNEGAMAFLATYGLKKNDVLGVSLEKFKQIMANILHQRSEDQLLFPSRGGMFYLATYKLDADATSVNWNGKQFWVCNADLVAAYNVGLVDIQKDFKKK 1092 T 0.37 DUF1910 pdbpercent F T 6w62 1 A A Cas12i MSNKEKNASETRKAYTTKMIPRSHDRMKLLGNFMDYLMDGTPIFFELWNQFGGGIDRDIISGTANKDKISDDLLLAVNWFKVMPINSKPQGVSPSNLANLFQQYSGSEPDIQAQEYFASNFDTEKHQWKDMRVEYERLLAELQLSRSDMHHDLKLMYKEKCIGLSLSTAHYITSVMFGTGAKNNRQTKHQFYSKVIQLLEESTQINSVEQLASIILKAGDCDSYRKLRIRCSRKGATPSILKIVQDYELGTNHDDEVNVPSLIANLKEKLGRFEYECEWKCMEKIKAFLASKVGPYYLGSYSAMLENALSPIKGMTTKNCKFVLKQIDAKNDIKYENEPFGKIVEGFFDSPYFESDTNVKWVLHPHHIGESNIKTLWEDLNAIHSKYEEDIASLSEDKKEKRIKVYQGDVCQTINTYCEEVGKEAKTPLVQLLRYLYSRKDDIAVDKIIDGITFLSKKHKVEKQKINPVIQKYPSFNFGNNSKLLGKIISPKDKLKHNLKCNRNQVDNYIWIEIKVLNTKTMRWEKHHYALSSTRFLEEVYYPATSENPPDALAARFRTKTNGYEGKPALSAEQIEQIRSAPVGLRKVKKRQMRLEAARQQNLLPRYTWGKDFNINICKRGNNFEVTLATKVKKKKEKNYKVVLGYDANIVRKNTYAAIEAHANGDGVIDYNDLPVKPIESGFVTVESQVRDKSYDQLSYNGVKLLYCKPHVESRRSFLEKYRNGTMKDNRGNNIQIDFMKDFEAIADDETSLYYFNMKYCKLLQSSIRNHSSQAKEYREEIFELLRDGKLSVLKLSSLSNLSFVMFKVAKSLIGTYFGHLLKKPKNSKSDVKAPPITDEDKQKADPEMFALRLALEEKRLNKVKSKKEVIANKIVAKALELRDKYGPVLIKGENISDTTKKGKKSSTNSFLMDWLARGVANKVKEMVMMHQGLEFVEVNPNFTSHQDPFVHKNPENTFRARYSRCTPSELTEKNRKEILSFLSDKPSKRPTNAYYNEGAMAFLATYGLKKNDVLGVSLEKFKQIMANILHQRSEDQLLFPSRGGMFYLATYKLDADATSVNWNGKQFWVCNADLVAAYNVGLVDIQKDFKKK 1093 T 0.37 DUF1910 pdbpercent F T 6w64 1 A A Cas12i MSNKEKNASETRKAYTTKMIPRSHDRMKLLGNFMDYLMDGTPIFFELWNQFGGGIDRDIISGTANKDKISDDLLLAVNWFKVMPINSKPQGVSPSNLANLFQQYSGSEPDIQAQEYFASNFDTEKHQWKDMRVEYERLLAELQLSRSDMHHDLKLMYKEKCIGLSLSTAHYITSVMFGTGAKNNRQTKHQFYSKVIQLLEESTQINSVEQLASIILKAGDCDSYRKLRIRCSRKGATPSILKIVQDYELGTNHDDEVNVPSLIANLKEKLGRFEYECEWKCMEKIKAFLASKVGPYYLGSYSAMLENALSPIKGMTTKNCKFVLKQIDAKNDIKYENEPFGKIVEGFFDSPYFESDTNVKWVLHPHHIGESNIKTLWEDLNAIHSKYEEDIASLSEDKKEKRIKVYQGDVCQTINTYCEEVGKEAKTPLVQLLRYLYSRKDDIAVDKIIDGITFLSKKHKVEKQKINPVIQKYPSFNFGNNSKLLGKIISPKDKLKHNLKCNRNQVDNYIWIEIKVLNTKTMRWEKHHYALSSTRFLEEVYYPATSENPPDALAARFRTKTNGYEGKPALSAEQIEQIRSAPVGLRKVKKRQMRLEAARQQNLLPRYTWGKDFNINICKRGNNFEVTLATKVKKKKEKNYKVVLGYDANIVRKNTYAAIEAHANGDGVIDYNDLPVKPIESGFVTVESQVRDKSYDQLSYNGVKLLYCKPHVESRRSFLEKYNGTMKDNRGNNIQIDFMKDFEAIADDETSLYYFNMKYCKLLQSSIRNHSSQAKEYREEIFELLRDGKLSVLKLSSLSNLSFVMFKVAKSLIGTYFGHLLKKPKNSKSDVKAPPITDEDKQKADPEMFALRLALEEKRLNKVKSKKEVIANKIVAKALELRDKYGPVLIKGENISDTTKKGKKSSTNSFLMDWLARGVANKVKEMVMMHQGLEFVEVNPNFTSHQDPFVHKNPENTFRARYSRCTPSELTEKNRKEILSFLSDKPSKRPTNAYYNEGAMAFLATYGLKKNDVLGVSLEKFKQIMANILHQRSEDQLLFPSRGGMFYLATYKLDADATSVNWNGKQFWVCNADLVAAYNVGLVDIQKDFKKK 1092 T 0.37 DUF1910 pdbpercent F T 6w6e 3 H N Substrate XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 33 F F F 6w6g 2 G N Substrate XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 33 F F F 6w6h 2 G N Substrate XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 31 F F F 6w6i 2 G N Substrate XXXXXXXXXXXXXXXXXXXXXXXXXXXXX 29 F F F 6w6j 2 G N Substrate XXXXXXXXXXXXXXXXXXXXXXXXXXXXX 29 F F F 6w6l 47 UA y Nascent chain mixture XXXXXXXXXXXXXXXXXXXXXXXXXXX 27 F F F 6w6x 1 A,B A,B De novo designed ABLE protein SVKSEYAEAAAVGQEAVAVFNTMKAAFQNGDKEAVAQYLARLASLYTRHEELLNRILEKARREGNKEAVTLMNEFTATFQTGKSIFNAMVAAFKNGDDDSFESYLQALEKVTAKGETLADQIAKAL 126 T 0.028 ASD2 pdb F T 6w70 1 A,B A,C De novo designed ABLE SVKSEYAEAAAVGQEAVAVFNTMKAAFQNGDKEAVAQYLARLASLYTRHEELLNRILEKARREGNKEAVTLMNEFTATFQTGKSIFNAMVAAFKNGDDDSFESYLQALEKVTAKGETLADQIAKAL 126 T 0.028 ASD2 pdb F T 6w8u 1 A,AA,B,BA,C,CA,D,DA,E,EA,F,FA,G,GA,H,HA,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,Q,R,S,T,U,V,W,X,Y,Z A,a,B,b,C,c,D,d,E,e,F,f,G,g,H,h,I,i,J,j,K,k,L,l,M,m,N,n,O,o,P,Q,R,S,T,U,V,W,X,Y,Z A4WH64_PYRAR pilin MTSLEIAIIVAIVLVIAIAVGWYLYTTFAAAGQQTGLTATKATIYVTKDGNVYLNVTLVPQGAAQVAISSIEVAGVSIPCTSSNLVKAPGEYVIELSSVSVSVGQVLTGRIVLASGAISPFTATVVAADHVPSTENKLCSSQ 142 T 1.8E-05 DUF973 unphh F Archaea T 6w9k 2 B B PRGC1_HUMAN PPARGC-1-ALPHA,LIGAND EFFECT MODULATOR 6 SLLKKLLLAP 10 T 30 PHtD_u1 pdbhh F Eukaryota F 6w9l 2 B B PRGC1_HUMAN PPARGC-1-ALPHA,LIGAND EFFECT MODULATOR 6 PSLLKKLLLAPA 12 T 13 Neurokinin_B pdbhh F Eukaryota F 6w9m 2 B B NR0B2_HUMAN ORPHAN NUCLEAR RECEPTOR SHP, SMALL HETERODIMER PARTNER RPAILYALLSS 11 T 7.3 GTA_holin_3TM pdbhh F Eukaryota T 6w9s 1 A A B1EF49_ESCAT OTU domain-containing protein EschOTU SSPQSVFSDSVSSSRLELKKQIIKALDLDYWQGSGGEIMPLVLIDFYKRHNININIYLNHCKVNNFDKKAINLINAGNHYNALTMNSRGNIERIDVPGDGNCLYHAVVKSHQITRKPKPYGNELQKDKPEWCILKESLKTHFDKDFDQFVEQVKCILISENTHEANKILDKVAQYSGVK 179 T 0.0021 OTU pdbhh F Bacteria T 6w9y 1 A,B,C,D A,B,C,D De novo designed receptor transmembrane domain proMP 1.2 EPELLFILVAILGGLFGAIVAFLLALRRLX 30 T 0.08 DUF1294 pdb F T 6w9z 1 A,B,C,D,E,F A,B,C,D,E,F De novo designed receptor transmembrane domain ProMP C2.1 EPELTVALILGIFLGTFIAFWVVYLLRRLX 30 T 0.12 YtxH pdbhh F T 6wa0 1 A,B,C,D,E,F,G,H,I A,B,C,D,E,F,G,H,I De novo designed receptor transmembrane domain proMP C3.1 EPETALLVAFVAYYTALIALIFAILATRRLX 31 T 7.1 MSA_2 pdbhh F T 6wa1 1 A,B A,B Q81AN8_BACCR Hemolysin II DNQKALEEQMNSINSVNDKLNKGKGKLSLSMNGNQLKATSSNAGYGISYEDKNWGIFVNGEKVYTFNEKSTVGNISNDINKLNIKGMYIEIKQI 94 T 0.0097 Gal-bind_lectin pdbpercent F Bacteria T 6wb3 1 A,B A,B SEPT4_HUMAN APOPTOSIS-RELATED PROTEIN IN THE TGF-BETA SIGNALING PATHWAY,ARTS,BRADEION BETA,BRAIN PROTEIN H5,CE5B3 BETA,CELL DIVISION CONTROL-RELATED PROTEIN 2,HCDCREL-2,CEREBRAL PROTEIN 7,PEANUT-LIKE PROTEIN 2 XETEKLIREKDEELRRMQEMLHKIQKQMKENX 32 T 0.097 AAA_23 unp F Eukaryota T 6wb9 1 A 0 EMC10_YEAST Endoplasmic reticulum membrane protein complex subunit 10 MLVRLLRVILLASMVFCADILQLSYSDDAKDAIPLGTFEIDSTSDGNVTVTTVNIQDVEVSGEYCLNAQIEGKLDMPCFSYMKLRTPLKYDLIVDVDEDNEVKQVSLSYDETNDAITATVRYPEAGPTAPVTKLKKKTKTYADKKASKNKDGSTAQFEEDEEVKEVSWFQKNWKMLLLGLLIYNFVAGSAKKQQQGGAGADQKTE 205 T 0.039 PFU pdb F Eukaryota T 6wba 2 B,C C,B MLH1_HUMAN MUTL PROTEIN HOMOLOG 1 SSNPAKRHRED 11 T 2.9 T_cell_tran_alt pdbhh F Eukaryota T 6wbb 2 B,C C,B MLH1_HUMAN MUTL PROTEIN HOMOLOG 1 SSNPRKRHRAD 11 T 8.6 DUF5592 pdbhh F Eukaryota T 6wbc 2 B B MLH1_HUMAN MUTL PROTEIN HOMOLOG 1 SSNPRKKHRED 11 T 2.3 T_cell_tran_alt pdbhh F Eukaryota T 6wbe 1 A,B,C,D A,B,C,D SEPT1_HUMAN LARP,PEANUT-LIKE PROTEIN 3,SEROLOGICALLY DEFINED BREAST CANCER ANTIGEN NY-BR-24 DTEKLIREKDEELRRMQEMLEKMQAQMQQS 30 T 0.04 HHV-5_US34A pdb F Eukaryota T 6wc3 2 B B Q753G8_ASHGO AFR344CP MGSMPYATQLALLQDELLDMLEPRDGEGLRTADIIDKTLRFRELLGCYRLQVEKSTRQLELPRQVRTAAALRGAHAPASQAPALAQLLLWERFLADYRRRLDAAIVHEHEATAARQLQARPTAARAPRAPMTAKDRLLA 139 T 0.12 MRP-S31 unppercent F Eukaryota T 6wc6 3 C C KDM1A_HUMAN LYSINE-SPECIFIC HISTONE DEMETHYLASE 1A SEEERNAKAEKEKKL 15 T 3 NDUF_B8 pdbhh F Eukaryota T 6wcu 1 A,B A,B SEPT5_HUMAN CELL DIVISION CONTROL-RELATED PROTEIN 1,CDCREL-1,PEANUT-LIKE PROTEIN 1 ETEKLIRMKDEELRRMQEMLQRMKQQMQDQ 30 T 0.046 IL32 pdbpssm F Eukaryota T 6wdp 1 A A I12R1_HUMAN IL-12RB1,IL-12 RECEPTOR BETA COMPONENT SECCFQDPPYPDADSGSASGPRDLRCYRISSDRYECSWQYEGPTAGVSHFLRCCLSSGRCCYFAAGSATRLQFSDQAGVSVLYTVTLWVESWARNQTEKSPEVTLQLYNSVKYEPPLGDIKVSKLAGQLRMEWETPDNQVGAEVQFRHRTPSSPWKLGDCGPQDDDTESCLCPLEMNVAQEFQLRRRQLGSQGSSWSKWSSPVCVPPENPPQPHHHHHH 219 T 6.2E-05 LIFR_D2 pdbhh F Eukaryota T 6wdr 1 A A RSSA1_YEAST NUCLEIC ACID-BINDING PROTEIN NAB1A,SMALL RIBOSOMAL SUBUNIT PROTEIN US2-A SLPATFDLTPEDAQLLLAANTHLGARNVQVHQEPYVFNARPDGVHVINVGKTWEKLVLAARIIAAIPNPEDVVAISSRTFGQRAVLKFAAHTGATPIAGRFTPGSFTNYITRSFKEPRLVIVTDPRSDAQAIKEASYVNIPVIALTDLDSPSEFVDVAIPCNNRGKHSIGLIWYLLAREVLRLRGALVDRTQPWSIMPDLYFYRFP 206 T 2.9E-13 Ribosomal_S2 pdb F Eukaryota T 6weg 3 E P Q5NHR4_FRATT Peptide KRNVFSRCWINMNLYSVIKAKS 22 T 0.92 XTBD pdbhh F Bacteria T 6wes 1 A A C5IAW5_PHANO Tox3 YIKANDINFGTRSVHDCRERTGIQRDVKVRADIPFETDDGPNQVLRVTWSNALNVDRFDPLPIVTVPGNAASTTITAIHDFCLMNPTTSPPTRCLYQLRQPFTLGFDRTRMHNNIYLTPPNPQRPTMHEVCIRADECPAGRVFLECSTRTYGAIPRGE 158 T 0.077 DUF6286 unppercent F Eukaryota T 6wf3 2 G,H,I,J,K,L G,H,I,J,K,L ACE-MET-LEU-GLY-PRO-NH2 XMLGPX 6 T 250 Rrp40_N pdbhh F F 6wf5 2 C,D C,D ACE-MET-LEU-GLY-PRO-NH2 XMLGPX 6 T 250 Rrp40_N pdbhh F F 6wfw 4 D P CSP_PLAFA NPNA2 peptide NPNANPNA 8 T 3 Cas_Cas7 pdbhh F Eukaryota F 6wfx 3 C P CSP_PLAFA NPNA2 peptide XNPNANPNAX 10 T 3.2 PT unppercent F Eukaryota F 6wfy 3 C P CSP_PLAFA NPNA4 peptide XNPNANPNANPNANPNAX 18 T 3.2 PT unppercent F Eukaryota F 6wfz 3 C,F C,P CSP_PLAFA NPNA3 peptide XNPNANPNANPNAX 14 T 1.9 Cas_Cas7 pdbhh F Eukaryota F 6wg0 3 C P CSP_PLAFA NPNA3 peptide NPNANPNANPN 11 T 0.89 Cas_Cas7 pdbhh F Eukaryota F 6wg1 3 E C CSP_PLAFA NPNA6 peptide XNPNANPNANPNANPNANPNANPNAX 26 T 3.2 PT unppercent F Eukaryota F 6wg2 3 E P CSP_PLAFA NPNA4 peptide XNPNANPNANPNANPNAX 18 T 3.2 PT unppercent F Eukaryota F 6wgg 1 A 8 CDT1_YEAST SIC1 INDISPENSABLE PROTEIN 2,TOPOISOMERASE-A HYPERSENSITIVE PROTEIN 11 MSGTANSRRKEVLRVPVIDLNRVSDEEQLLPVVRAILLQHDTFLLKNYANKAVLDALLAGLTTKDLPDTSQGFDANFTGTLPLEDDVWLEQYIFDTDPQLRFDRKCRNESLCSIYSRLFKLGLFFAQLCVKSVVSSAELQDCISTSHYATKLTRYFNDNGSTHDGADAGATVLPTGDDFQYLFERDYVTFLPTGVLTIFPCAKAIRYKPSTMATTDNSWVSIDEPDCLLFHTGTLLARWSQGMHTTSPLQIDPRANIVSLTIWPPLTTPISSKGEGTIANHLLEQQIKAFPKVAQQYYPRELSILRLQDAMKFVKELFTVCETVLSLNALSRSTGVPPELHVLLPQISSMMKRKIVQDDILKLLTIWSDAYVVELNSRGELTMNLPKRDNLTTLTNKSRTLAFVERAESWYQQVIASKDEIMTDVPAFKINKRRSSSNSKTVLSSKVQTKSSNANALNNSRYLANSKENFMYKEKMPDSQANLMDRLRERERRSAALLSQRQKRYQQFLAMKMTQVFDILFSLTRGQPYTETYLSSLIVDSLQDSNNPIGTKEASEILAGLQGILPMDISVHQVDGGLKVYRWNSLDKNRFSKLLQIHKSKQQD 604 T 1.3E-07 CDT1 pdbhh F Eukaryota T 6wgi 14 N L CDT1_YEAST SIC1 INDISPENSABLE PROTEIN 2,TOPOISOMERASE-A HYPERSENSITIVE PROTEIN 11 MSGTANSRRKEVLRVPVIDLNRVSDEEQLLPVVRAILLQHDTFLLKNYANKAVLDALLAGLTTKDLPDTSQGFDANFTGTLPLEDDVWLEQYIFDTDPQLRFDRKCRNESLCSIYSRLFKLGLFFAQLCVKSVVSSAELQDCISTSHYATKLTRYFNDNGSTHDGADAGATVLPTGDDFQYLFERDYVTFLPTGVLTIFPCAKAIRYKPSTMATTDNSWVSIDEPDCLLFHTGTLLARWSQGMHTTSPLQIDPRANIVSLTIWPPLTTPISSKGEGTIANHLLEQQIKAFPKVAQQYYPRELSILRLQDAMKFVKELFTVCETVLSLNALSRSTGVPPELHVLLPQISSMMKRKIVQDDILKLLTIWSDAYVVELNSRGELTMNLPKRDNLTTLTNKSRTLAFVERAESWYQQVIASKDEIMTDVPAFKINKRRSSSNSKTVLSSKVQTKSSNANALNNSRYLANSKENFMYKEKMPDSQANLMDRLRERERRSAALLSQRQKRYQQFLAMKMTQVFDILFSLTRGQPYTETYLSSLIVDSLQDSNNPIGTKEASEILAGLQGILPMDISVHQVDGGLKVYRWNSLDKNRFSKLLQIHKSKQQD 604 T 1.3E-07 CDT1 pdbhh F Eukaryota T 6wgn 2 D,E,F E,F,G Cyclic Peptide KD2 GXFVNFRNFRTFRCG 15 T 9.3 Bac_DNA_binding pdbhh F T 6wh3 1 A,AA,AB,B,BA,BB,C,CA,CB,D,DA,DB,E,EA,EB,F,FA,FB,G,GA,GB,H,HA,HB,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,PA,Q,QA,R,RA,S,SA,T,TA,U,UA,V,VA,W,WA,X,XA,Y,YA,Z,ZA A,8,u,B,7,v,C,6,w,D,5,x,E,4,y,F,3,z,G,a,1,H,b,2,I,c,J,d,K,e,L,f,M,g,N,h,O,i,P,j,Q,k,R,l,S,m,T,n,U,o,V,p,W,q,X,r,Y,s,Z,t Penaeus monodon metallodensovirus major capsid protein MSDEVSSSTDVVSRKRRRHDEGGKALEDIAVHGASEGDGSAPGGSVWQTTDYIALSMVVYRTAIKLRNFVNIRGLTPTEMIVIPWNVMRFYCEYNTGTYGLSGNVHHKNYSMLLACKAHRPTKVGYTLSNLILTSDELVSTGGTLGTTTTFNTSPYMIHSIDDQQCLSKVYPKTDTVWPVSSMRELDYVASTVSGDNAIIPSTIFNKNRYWKQGDDALHFSHDLDLGFWFGSDYGNAYVPQNNDSMNAVGTIPTSKHINVRGVNNRGMAGHYLSFPPIRTNDGQFKLNAQFTLETEIEFEFRLWEQGVQGINSVHTNLNPANDSLWIQSYGSLVSITESKINNIQFGPTCPRVDARNKGGKMSMLFDHH 369 T 29 DUF4752 pdbhh F T 6wh7 1 A,AA,AB,B,BA,BB,C,CA,CB,D,DA,DB,E,EA,EB,F,FA,FB,G,GA,GB,H,HA,HB,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,PA,Q,QA,R,RA,S,SA,T,TA,U,UA,V,VA,W,WA,X,XA,Y,YA,Z,ZA A,1,u,B,2,v,C,3,w,D,4,x,E,5,y,F,6,z,G,a,7,H,b,8,I,c,J,d,K,e,L,f,M,g,N,h,O,i,P,j,Q,k,R,l,S,m,T,n,U,o,V,p,W,q,X,r,Y,s,Z,t Penaeus monodon metallodensovirus major capsid protein MSDEVSSSTDVVSRKRRRHDEGGKALEDIAVHGASEGDGSAPGGSVWQTTDYIALSMVVYRTAIKLRNFVNIRGLTPTEMIVIPWNVMRFYCEYNTGTYGLSGNVHHKNYSMLLACKAHRPTKVGYTLSNLILTSDELVSTGGTLGTTTTFNTSPYMIHSIDDQQCLSKVYPKTDTVWPVSSMRELDYVASTVSGDNAIIPSTIFNKNRYWKQGDDALHFSHDLDLGFWFGSDYGNAYVPQNNDSMNAVGTIPTSKHINVRGVNNRGMAGHYLSFPPIRTNDGQFKLNAQFTLETEIEFEFRLWEQGVQGINSVHTNLNPANDSLWIQSYGSLVSITESKINNIQFGPTCPRVDARNKGGKMSMLFDHH 369 T 29 DUF4752 pdbhh F T 6wh8 2 C,D C,D 4HP-PRO-LYS-ARG-NH2, BM-30 XPKRX 5 T 380 DUF2002 pdbhh F F 6whi 6 K,L I,J C0AVY5_9GAMM anti-CRISPR AcrIF9 MKSTYIIKEVQNINSDREGVKVETTSLTSAKRIASKNQFFHGTVLRIESESGNWLAYKEDGKRWIECE 68 T 0.15 SurA_N pdb F Bacteria T 6whn 2 D,E,F F,G,H U2M-ASN-PRO-LYS-GLN-DLY-TRP-GLY peptide macrocycle XNPKQXWG 8 T 1.7 Pox_A28 pdbhh F T 6who 2 D,E,F F,G,H U2M-ASN-PRO-GLU-GLN-DLY-TRP-GLY peptide macrocycle XNPEQXWG 8 T 3.1 DUF2340 pdbhh F T 6whq 2 D,E,F F,G,H U2M-ASN-PRO-GLU-GLN-DLY-TRP-GLY peptide macrocycle XNPEQXWG 8 T 3.1 DUF2340 pdbhh F T 6whz 2 D,E,F D,F,G U2M-ASN-HYP-LYS-GLN-DLY-TRP-GLY peptide macrocycle XNPKQXWG 8 T 1.7 Pox_A28 pdbhh F T 6wi3 2 D,E,F F,G,H (SHA)W(DTH)DN(DSN)(DME)(DAS)K peptide macrocycle XWXDNXXXK 9 T 9.3 Galanin pdbhh F F 6wi4 2 C C Caspase-3 KLFSFGG 7 T 1.8 RNA_pol_RpbG pdbhh F F 6wi4 3 D,E D,E ACE-DEVD inhibitor XDEVD 5 T 140 zf-NPL4 pdbhh F F 6win 1 A A Q8Z969_SALTI Type 6 secretion amidase effector 2 APYVYANAKALQDTEKVGNHHQCVELIQHYIRVGQASTWQQGAAVFGNKNIEVGTVIATFVNGRYPNHNSGNHAAFFLGQDTGGIWVMDQWKDDIAKPRVSKRYIRKLHNGSVRSDGTYIRMSNNAEAYFIVELEHHHHH 140 T 0.95 GATA unppssm F Bacteria T 6wj2 8 H H S38A9_HUMAN SOLUTE CARRIER FAMILY 38 MEMBER 9,UP-REGULATED IN LUNG CANCER 11 GGTMANMNSDSRHLGTSEVDHERDPGPMNIQFEPSDLRSKRPFCIEPTNIVNVNHVIQRVSDHASAMNKRIHYYSRLTTPADKALIAPDHVVPAPEECYVYSPLGSAYKLQSYTEGYGKNTS 122 T 37 CoV_NSP4_C pdbhh F Eukaryota T 6wj3 8 H H S38A9_HUMAN SOLUTE CARRIER FAMILY 38 MEMBER 9,UP-REGULATED IN LUNG CANCER 11 GGTMANMNSDSRHLGTSEVDHERDPGPMNIQFEPSDLRSKRPFCIEPTNIVNVNHVIQRVSDHASAMNKRIHYYSRLTTPADKALIAPDHVVPAPEECYVYSPLGSAYKLQSYTEGYGKNTS 122 T 37 CoV_NSP4_C pdbhh F Eukaryota T 6wj7 2 B A AN6-GLZ-PRO-LYS-ARG-ILE-ALA-NH2 XXPKRIAX 8 T 3.8 HIRA_B pdbhh F T 6wjq 2 C,D C,D PDPK1_HUMAN HPDK1 XARTTSQLYDAVPIQSX 17 T 3.7 HHA pdbhh F Eukaryota T 6wkk 2 G,H,I,J,K,L,M,N,O,P,Q,R,S,T,U,V,W,X G,H,I,J,K,L,M,N,O,P,Q,R,S,T,U,V,W,X G3MB96_9CAUD Gp26 capsid decoration protein PYVRLGYEGILNGAHDIDVAGLNGVEQLAGKFATIGANGVKLAGDNGTNAVGLFREDLGDMVNASEKASFYFRGGEYYVNISRTSLTAAGIAAGDEITCDADGKMIKFTGTGKALGVVTHVGEYRAGNMYEKATQGVTDTDTFIGFIMYV 150 T 0.082 DUF4265 pdb T Viruses T 6wkr 6 F,R B,E JARD2_HUMAN JUMONJI/ARID DOMAIN-CONTAINING PROTEIN 2 MSKERPKRNIIQKKYDDSDGIPWSEERVVRKVLYLSLKEFKNSQKRQHAEGIAGSLKTVNGLLGNDQSKGLGPASEQSENEKDDASQVSSTSNDVSSSDFEEGPSRKRPRLQAQRKFAQSQPNSPSTTPVKIVEPLLPPPATQISDLSKRKPKTEDFLTFLCLRGSPALPNSMVYFGSSQDEEEVEEEDDETEDVKTATNNASSSCQSTPRKGKTHKHVHNGHVFNGSSRSTREKEPVQKHKSKEATPAKEKHSDHRADSRREQASANHPAAAPSTGSSAKGLAATHHHPPLHRSAQDLRKQVSKVNGVTRMSSLGAGVTSAKKMREVRPSPSKTVKYTATVTKGAVTYTKAKRELVKDTKPNHHKPSSAVNHTISGKTESSNAKTRKQVLSLGGASKSTGPAVNGLKVSGRLNPKSCTKEVGGRQLREGLQLREGLRNSKRRLEEAHQA 450 T 0.11 Actin_micro pdb F Eukaryota T 6wkx 1 A,AA,AB,AC,AD,B,BA,BB,BC,C,CA,CB,CC,D,DA,DB,DC,E,EA,EB,EC,F,FA,FB,FC,G,GA,GB,GC,H,HA,HB,HC,I,IA,IB,IC,J,JA,JB,JC,K,KA,KB,KC,L,LA,LB,LC,M,MA,MB,MC,N,NA,NB,NC,O,OA,OB,OC,P,PA,PB,PC,Q,QA,QB,QC,R,RA,RB,RC,S,SA,SB,SC,T,TA,TB,TC,U,UA,UB,UC,V,VA,VB,VC,W,WA,WB,WC,X,XA,XB,XC,Y,YA,YB,YC,Z,ZA,ZB,ZC A,p,AA,VA,qA,V,q,BA,WA,W,r,CA,Q,X,s,L,XA,Y,G,DA,YA,B,t,EA,ZA,Z,u,FA,aA,a,v,GA,R,b,w,M,bA,c,H,HA,cA,C,x,IA,dA,d,y,JA,eA,e,z,KA,S,f,0,N,fA,g,I,LA,gA,D,1,MA,hA,h,2,NA,iA,i,3,OA,T,j,4,O,jA,k,J,PA,kA,E,5,QA,lA,l,6,RA,mA,m,7,SA,U,n,8,P,nA,o,K,TA,oA,F,9,UA,pA peptide 15-10-3 QAEILRAYARILEAQ 15 T 2.7 Inhibitor_I10 pdbhh F T 6wky 1 A,AA,B,BA,C,CA,D,DA,E,EA,F,FA,G,GA,H,HA,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,P,Q,R,S,T,U,V,W,X,Y,Z B,g,C,h,D,i,A,j,E,k,F,l,G,m,H,n,I,o,J,p,K,q,L,r,M,s,N,t,O,P,Q,R,S,T,a,b,c,d,e,f peptide 29-24-3 QAEILRAYARILEADAKILEAHAEILKAQ 29 T 20 Rad33 pdbhh F T 6wl0 1 A,AA,B,BA,C,CA,D,DA,E,EA,F,FA,G,GA,H,HA,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,Q,R,S,T,U,V,W,X,Y,Z A,a,B,b,C,c,D,d,E,e,F,f,G,g,H,h,I,i,J,j,K,k,L,l,M,m,N,n,O,o,P,Q,R,S,T,U,V,W,X,Y,Z peptide 36-31-3-RD TLEELRAEARILEAKAEILKAKAEVLKAKAEILKAQ 36 T 7.2 DUF6327 pdbhh F T 6wl1 1 A,AA,B,BA,C,CA,D,DA,E,EA,F,FA,G,GA,H,HA,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,PA,Q,QA,R,RA,S,SA,T,TA,U,UA,V,VA,W,WA,X,XA,Y,YA,Z,ZA A,a,B,b,C,c,D,d,E,e,F,f,G,g,H,h,I,i,J,j,K,k,L,l,M,m,N,n,O,o,P,p,Q,q,R,r,S,s,T,t,U,u,V,v,W,w,X,x,Y,y,Z,z peptide 36-31-3 QAEILRAYARILEADAEILKAQAKILEAHAEILKAQ 36 T 0.11 RnlB_antitoxin pdb F T 6wl3 2 B,E,H B,E,H NCAP_VSIVA ARG-GLY-TYR-LEU-TYR-GLN-GLY-LEU RGYLYQGL 8 T 1.1 Cap4_nuclease pdbhh T Viruses F 6wl7 1 A,AA,AB,AC,AD,AE,B,BA,BB,BC,BD,BE,C,CA,CB,CC,CD,CE,D,DA,DB,DC,DD,DE,E,EA,EB,EC,ED,EE,F,FA,FB,FC,FD,FE,G,GA,GB,GC,GD,GE,H,HA,HB,HC,HD,HE,I,IA,IB,IC,ID,IE,J,JA,JB,JC,JD,JE,K,KA,KB,KC,KD,KE,L,LA,LB,LC,LD,LE,M,MA,MB,MC,MD,ME,N,NA,NB,NC,ND,NE,O,OA,OB,OC,OD,OE,P,PA,PB,PC,PD,PE,Q,QA,QB,QC,QD,QE,R,RA,RB,RC,RD,RE,S,SA,SB,SC,SD,SE,T,TA,TB,TC,TD,TE,U,UA,UB,UC,UD,V,VA,VB,VC,VD,W,WA,WB,WC,WD,X,XA,XB,XC,XD,Y,YA,YB,YC,YD,Z,ZA,ZB,ZC,ZD A,u,GA,N,xA,JB,Z,v,HA,cA,yA,KB,a,w,J,dA,zA,W,b,x,IA,eA,0A,LB,c,F,JA,fA,S,MB,d,y,KA,gA,1A,NB,B,z,LA,O,2A,OB,e,0,MA,hA,3A,PB,f,1,K,iA,4A,X,g,2,NA,jA,5A,QB,h,G,OA,kA,T,RB,i,3,PA,lA,6A,SB,C,4,QA,P,7A,TB,j,5,RA,mA,8A,UB,k,6,L,nA,9A,Y,l,7,SA,oA,AB,VB,m,H,TA,pA,U,WB,n,8,UA,qA,BB,XB,D,9,VA,Q,CB,YB,o,AA,WA,rA,DB,ZB,p,BA,M,sA,EB,q,CA,XA,tA,FB,r,I,YA,uA,V,s,DA,ZA,vA,GB,E,EA,aA,R,HB,t,FA,bA,wA,IB peptide 29-20-2 QAEILEADARILRAYAEILKAHAEILKAQ 29 T 7.3 DUF5799 pdbhh F T 6wl8 1 A,AA,AB,AC,AD,B,BA,BB,BC,BD,C,CA,CB,CC,D,DA,DB,DC,E,EA,EB,EC,F,FA,FB,FC,G,GA,GB,GC,H,HA,HB,HC,I,IA,IB,IC,J,JA,JB,JC,K,KA,KB,KC,L,LA,LB,LC,M,MA,MB,MC,N,NA,NB,NC,O,OA,OB,OC,P,PA,PB,PC,Q,QA,QB,QC,R,RA,RB,RC,S,SA,SB,SC,T,TA,TB,TC,U,UA,UB,UC,V,VA,VB,VC,W,WA,WB,WC,X,XA,XB,XC,Y,YA,YB,YC,Z,ZA,ZB,ZC A,N,a,n,1,0,EA,RA,eA,rA,B,O,b,o,2,FA,SA,fA,C,P,c,p,3,GA,TA,gA,D,Q,d,q,4,HA,UA,hA,E,R,e,r,5,IA,VA,iA,F,S,f,s,6,JA,WA,jA,G,T,g,t,7,KA,XA,kA,H,U,h,u,8,LA,YA,lA,I,V,i,v,9,MA,ZA,mA,J,W,j,w,AA,NA,aA,nA,K,X,k,x,BA,OA,bA,oA,L,Y,l,y,CA,PA,cA,pA,M,Z,m,z,DA,QA,dA,qA Form 2 peptide QAKILEADAEILKAYAKILEAHAEILKAQ 29 T 2.4 DUF5320 pdbhh F T 6wl9 1 A,AA,AB,AC,B,BA,BB,BC,C,CA,CB,CC,D,DA,DB,DC,E,EA,EB,EC,F,FA,FB,FC,G,GA,GB,GC,H,HA,HB,HC,I,IA,IB,IC,J,JA,JB,JC,K,KA,KB,KC,L,LA,LB,LC,M,MA,MB,MC,N,NA,NB,NC,O,OA,OB,OC,P,PA,PB,PC,Q,QA,QB,QC,R,RA,RB,RC,S,SA,SB,SC,T,TA,TB,TC,U,UA,UB,UC,V,VA,VB,VC,W,WA,WB,WC,X,XA,XB,XC,Y,YA,YB,YC,Z,ZA,ZB,ZC A,N,a,n,0,DA,QA,dA,B,O,b,o,1,EA,RA,eA,C,P,c,p,2,FA,SA,fA,D,Q,d,q,3,GA,TA,gA,E,R,e,r,4,HA,UA,hA,F,S,f,s,5,IA,VA,iA,G,T,g,t,6,JA,WA,jA,H,U,h,u,7,KA,XA,kA,I,V,i,v,8,LA,YA,lA,J,W,j,w,9,MA,ZA,mA,K,X,k,x,AA,NA,aA,nA,L,Y,l,y,BA,OA,bA,oA,M,Z,m,z,CA,PA,cA,pA peptide Form2a QAEILKADAEILKAYAKILEAHAEILKAQ 29 T 4.5 DUF5320 pdbhh F T 6wlg 1 A,B A,B INT3_HUMAN INT3,SOSS COMPLEX SUBUNIT A,SENSOR OF SINGLE-STRAND DNA COMPLEX SUBUNIT A,SENSOR OF SSDNA SUBUNIT A HPIKETVVEEPVDITPYLDQLDESLRDKVLQLQKGSDTEAQCEVMQEIVDQVLEEDFDSEQLSVLASCLQELFKAHFRGEVLPEEITEESLEESVGKPLYLIFRNLCQMQEDNSSFSLLLDLLSELYQKQPKIGYHLLYYLRASKAAAGKMNLYESFAQATQLGDLHTCLMMDMKACQEDDVRLLCHLTPSIYTEFPDETLRSGELLNMIVAVIDSAQLQELVCHVMMGNLVMFRKDSVLNILIQSLDWETFEQYCAWQLFLAHNIPLETIIPILQHLKYKEHPEALSCLLLQLRREKPSEEMVKMVLSRPCHPDDQFTTSILRHWCMKHDELLAEHIKSLLIKNNSLPRKRQSLRSSSSKLAQLTLEQILEHLDNLRLNLTNTKQNFFSQTPILQALQHVQASCDEAHKMKFSDLFSLAEEY 423 T 0.0011 IFRD pdbpssm F Eukaryota T 6wlw 6 N T RNK_HUMAN RNASE KAPPA MGWLRPGPRPLCPPARASWAFSHRFPSPLAPRRSPTPFFMASLLCCGPKLAACGIVLSAWGVIMLIMLGIFFNVHSAVLIEDVPFTEKDFENGPQNIYNLYEQVSYNCFIAAGLYLLLGGFSFCQVRLNKRKEYMVR 137 T 0.0026 DUF2650 pdbpercent F Eukaryota T 6wlx 2 B B CTNB1_HUMAN BETA-CATENIN KKRLSVE 7 T 0.051 Adaptin_N unppssm F Eukaryota T 6wlz 3 G,H,I X,Y,Z Q5ZWW6_LEGPH SidK MSFIKVGIKMGGLTSEQYHSQVVGKIGYIARCMQTIDPENNLKKIREDYQDVLIWAEKNYRFEEILEASKSGKCPNDLDALSRRSLILQELLRLVSSISPFKMKLDLIESQYEKMKQHVNLWKSDYHVKLNQLNQLTDYLKNAAPTPKNNFLRAMTSVLQMQIAQYGITEDNEGINQLFKLGLHLLAMANEKIDEQYHLFKGYVKDQPEESPFEGILPAEDQKILVKTMIDYAMPKLSSKVLQDKLSALSSSDVLTKTLLDSIDRIVKENEKLNALSKVKLGKFGLDIREIEVIYSQALKISPQDALQYTAQQCDAQLLSMAFPDSQNYIIESISNKKVKTIAELIHSKEFIYQIIKTEVFKQVDPNEKIRLQAATELYQLLGRIMDKQINLFTKMNLEQINEYIQTKTKAILDKIPERVELLTFMGFEIPTFKGIETLMTDISHSQDNETLAIAQEFYTNIKNAKNQLLGDKLIEDITPQDVEKFFNQCSQYGSEAAEKLADNRPVLTKIADILTAIARWAISLIGFNTPPQFLAPTRTCVDQVSDEITKIKLKLEDTLGSLQKVQEESLSL 573 T 0.15 Chalcone pdb F Bacteria T 6wm1 2 C,D B,D ACE-PTR-02K-ASN-PRA XXXNX 5 T 430 DUF3673 pdbhh F F 6wm2 8 P,Q,R X,Y,Z Q5ZWW6_LEGPH SidK MSFIKVGIKMGGLTSEQYHSQVVGKIGYIARCMQTIDPENNLKKIREDYQDVLIWAEKNYRFEEILEASKSGKCPNDLDALSRRSLILQELLRLVSSISPFKMKLDLIESQYEKMKQHVNLWKSDYHVKLNQLNQLTDYLKNAAPTPKNNFLRAMTSVLQMQIAQYGITEDNEGINQLFKLGLHLLAMANEKIDEQYHLFKGYVKDQPEESPFEGILPAEDQKILVKTMIDYAMPKLSSKVLQDKLSALSSSDVLTKTLLDSIDRIVKENEKLNALSKVKLGKFGLDIREIEVIYSQALKISPQDALQYTAQQCDAQLLSMAFPDSQNYIIESISNKKVKTIAELIHSKEFIYQIIKTEVFKQVDPNEKIRLQAATELYQLLGRIMDKQINLFTKMNLEQINEYIQTKTKAILDKIPERVELLTFMGFEIPTFKGIETLMTDISHSQDNETLAIAQEFYTNIKNAKNQLLGDKLIEDITPQDVEKFFNQCSQYGSEAAEKLADNRPVLTKIADILTAIARWAISLIGFNTPPQFLAPTRTCVDQVSDEITKIKLKLEDTLGSLQKVQEESLSL 573 T 0.15 Chalcone pdb F Bacteria T 6wm2 15 GA T RNK_HUMAN RNASE KAPPA MGWLRPGPRPLCPPARASWAFSHRFPSPLAPRRSPTPFFMASLLCCGPKLAACGIVLSAWGVIMLIMLGIFFNVHSAVLIEDVPFTEKDFENGPQNIYNLYEQVSYNCFIAAGLYLLLGGFSFCQVRLNKRKEYMVR 137 T 0.0026 DUF2650 pdbpercent F Eukaryota T 6wm3 6 J T RNK_HUMAN RNASE KAPPA MGWLRPGPRPLCPPARASWAFSHRFPSPLAPRRSPTPFFMASLLCCGPKLAACGIVLSAWGVIMLIMLGIFFNVHSAVLIEDVPFTEKDFENGPQNIYNLYEQVSYNCFIAAGLYLLLGGFSFCQVRLNKRKEYMVR 137 T 0.0026 DUF2650 pdbpercent F Eukaryota T 6wm3 14 DA,EA,FA Z,X,Y Q5ZWW6_LEGPH SidK MSFIKVGIKMGGLTSEQYHSQVVGKIGYIARCMQTIDPENNLKKIREDYQDVLIWAEKNYRFEEILEASKSGKCPNDLDALSRRSLILQELLRLVSSISPFKMKLDLIESQYEKMKQHVNLWKSDYHVKLNQLNQLTDYLKNAAPTPKNNFLRAMTSVLQMQIAQYGITEDNEGINQLFKLGLHLLAMANEKIDEQYHLFKGYVKDQPEESPFEGILPAEDQKILVKTMIDYAMPKLSSKVLQDKLSALSSSDVLTKTLLDSIDRIVKENEKLNALSKVKLGKFGLDIREIEVIYSQALKISPQDALQYTAQQCDAQLLSMAFPDSQNYIIESISNKKVKTIAELIHSKEFIYQIIKTEVFKQVDPNEKIRLQAATELYQLLGRIMDKQINLFTKMNLEQINEYIQTKTKAILDKIPERVELLTFMGFEIPTFKGIETLMTDISHSQDNETLAIAQEFYTNIKNAKNQLLGDKLIEDITPQDVEKFFNQCSQYGSEAAEKLADNRPVLTKIADILTAIARWAISLIGFNTPPQFLAPTRTCVDQVSDEITKIKLKLEDTLGSLQKVQEESLSL 573 T 0.15 Chalcone pdb F Bacteria T 6wm4 6 J T RNK_HUMAN RNASE KAPPA MGWLRPGPRPLCPPARASWAFSHRFPSPLAPRRSPTPFFMASLLCCGPKLAACGIVLSAWGVIMLIMLGIFFNVHSAVLIEDVPFTEKDFENGPQNIYNLYEQVSYNCFIAAGLYLLLGGFSFCQVRLNKRKEYMVR 137 T 0.0026 DUF2650 pdbpercent F Eukaryota T 6wm4 9 Q,R,S Y,Z,X Q5ZWW6_LEGPH SidK MSFIKVGIKMGGLTSEQYHSQVVGKIGYIARCMQTIDPENNLKKIREDYQDVLIWAEKNYRFEEILEASKSGKCPNDLDALSRRSLILQELLRLVSSISPFKMKLDLIESQYEKMKQHVNLWKSDYHVKLNQLNQLTDYLKNAAPTPKNNFLRAMTSVLQMQIAQYGITEDNEGINQLFKLGLHLLAMANEKIDEQYHLFKGYVKDQPEESPFEGILPAEDQKILVKTMIDYAMPKLSSKVLQDKLSALSSSDVLTKTLLDSIDRIVKENEKLNALSKVKLGKFGLDIREIEVIYSQALKISPQDALQYTAQQCDAQLLSMAFPDSQNYIIESISNKKVKTIAELIHSKEFIYQIIKTEVFKQVDPNEKIRLQAATELYQLLGRIMDKQINLFTKMNLEQINEYIQTKTKAILDKIPERVELLTFMGFEIPTFKGIETLMTDISHSQDNETLAIAQEFYTNIKNAKNQLLGDKLIEDITPQDVEKFFNQCSQYGSEAAEKLADNRPVLTKIADILTAIARWAISLIGFNTPPQFLAPTRTCVDQVSDEITKIKLKLEDTLGSLQKVQEESLSL 573 T 0.15 Chalcone pdb F Bacteria T 6wmf 2 B B KASH5_HUMAN COILED-COIL DOMAIN-CONTAINING PROTEIN 155,KASH DOMAIN-CONTAINING PROTEIN 5 GPGGSGGPSPPPTWPHLQLCYLQPPPV 27 T 1.7 B56 pdbhh F Eukaryota T 6wmk 1 A,C A,C Beta sheet heterodimer LHD29 - Chain A SGGSTWQWVLINISEEARQRIEEYVRRISKKEGTEVHFEKDDGVLHIRVKNLHEKRAREIHEYAKRVIL 69 T 0.005 TnpV pdb F T 6wmk 2 B,D B,D Beta sheet heterodimer LHD29 - Chain B SGGSSSIFLLSNVSEEARQRAEEYVRRISKKEGTEVRFEKDDGFLTIEVKNLSEERLREIAEYLWRVAV 69 T 0.013 MecA pdb F T 6wmq 2 C,D E,F NCOR1_HUMAN N-COR1 RTHRLITLADHICQIITQDFARN 23 T 40 Es2 pdbhh F Eukaryota T 6wms 2 C,D E,F NCOR1_HUMAN NCOR isoform c DPASNLGLEDIIRKALMGSFDDK 23 T 3.3 RuvA_C pdbhh F Eukaryota T 6wmt 9 I,J K,L aCTDs XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 73 F F F 6wnm 1 A,B A,B A0A509JD33_PSEAI Pf4r MSTPADRARLLIKKIGPKKVSLHGGDYERWKSVSKGAIRVSTEEIDVLVKIFPNYALWIASGSIAPEVGQTSPDYDEANLNLSNQNAGAHHHHHH 95 T 0.0022 BetR unphh F Bacteria T 6wnx 3 C,F,I C,F,I CTNB1_HUMAN BETA-CATENIN LDSGIHSGA 9 T 2.4 Peptidase_C9 pdbhh F Eukaryota T 6wo2 2 C,D C,D ACE-PTR-02K-ASN-U67 XXXNX 5 T 430 DUF3673 pdbhh F F 6wo6 1 A,B A,B A0A3A6VZ03_LEGPN RavA SNATFTCDELKGLEHPYEVLGNGDALAENREELNKLTNDAALVLASRLVLECPVNELKDFAHAIEAARMPQDDSDTFHSFLFQAYQVKKRIISLLDPRNINPHSMILEKEFDGELFNNFNKLAIDVLTNNEVAIALRLAETTPAQDRSRVSQNINNIFPQSLFAAKVGHAFAVRRDIERLLLGDRPDQFFSSREFKIDSCIEFASLFNVINDKESSIAGKLALRTPAENRTDVVMKIKGFCAEDSELAIKVQSAFALRRDIERNLLGDNPEQFFSSRDFSVDLCLEFAILFPELLKGHEQAIGEKLAKLDAKVRSDISRKLEMINGAAHEQ 331 T 0.29 PORR pdbpssm F Bacteria T 6woo 47 UA K L10 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 152 F F F 6wpb 1 A A HLP1_BOAPU HSP1-NH2 GILDAIKAIAKAAGX 15 T 0.48 Antimicrobial_2 unphh F Eukaryota T 6wpd 1 A A HLP1_BOAPU HSP1 GILDAIKAIAKAAGX 15 T 0.48 Antimicrobial_2 unphh F Eukaryota T 6wpv 1 A A Xanthoxycyclin D GTVAVQFL 8 T 9.9 MatB pdbhh F T 6wpz 1 A,B A,B A0A509JD33_PSEAI Pf4r MSTPADRARLLIKKIGPKKVSLHGGDYERWKSVSKGAIRVSTEEIDVLVKIFPNYALWIASGSIAPEVGQTSPDYDEANLNLSNQNAGAHHHHHH 95 T 0.0022 BetR unphh F Bacteria T 6wq2 3 C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S A,B,C,D,E,F,G,H,I,J,K,L,M,N,P,R,T Y035_SIFVH Structural protein MCP2 MARRNRRLSSASVYRYYLKRISMNIGTTGHVNGLSIAGNPEIMRAIARLSEQETYNWVTDYAPSHLAKEVVKQISGKYNIPGAYQGLLMAFAEKVLANYILDYKGEPLVEIHHNFLWELMQRQSGAGLGVTSGFIYTFVRKDGKPVTVDMSKVLTEIEDALFKLVKK 167 T 0.022 DUF1581 pdb T Viruses T 6wq2 4 AA,BA,CA,DA,EA,FA,GA,HA,IA,JA,T,U,V,W,X,Y,Z h,i,j,k,l,m,n,p,r,t,a,b,c,d,e,f,g Y036_SIFVH Structural protein MCP1 MAGRQSHKKIDVRNDTSTRYKGKLYGIFVNYMGEKYAQQLVENMYSNYNDVFVEIYNKMHNALRPTLVKLAGAGATFPLWQLVNEAIYAVYLTHKETASFLVTKYVARGVPAMTVKTLLAEVGNQLKELVPAVAEQIGSVTLDHTNVVSTVDNIVTSMPALPNSYAGVLMKTKVPTVTPHYAGTGTFSSMESAYKALEDIERGL 204 T 0.019 MMS22L_C pdb T Viruses T 6wqh 2 G S Ig2 substrate XXXXXXXXXXX 11 F F F 6wqj 1 A A A0A0A0L4Q9_CUCSA Vicilin-buried peptide-10 QKETEICRQWCQVMKPQGGEEQRRCQQECEERLRD 35 T 4.8E-05 Vicilin_N unphh F Eukaryota T 6wqr 1 A A HSTX1_HAESL HSTX-I ACKEYWECGAFLFCIEGICVPMIX 24 T 0.0053 DUF3397 unppercent F Eukaryota T 6wqu 4 D D NOTC3_HUMAN NOTCH 3 ARRKREHSTLWFPEGFSL 18 T 0.062 VPS9 unphh F Eukaryota T 6wrv 2 D,E,F D,C,F Computationally designed protein 3DS18 DEREEEQRRRLEEVKEEAKRRERSEQDLAVLYLEAVNAAVVFVADSEEEAKRVADIVKKLVPEVIIFVHDNFVVFVVDSDEAARRVYEIVERAQ 94 T 0.0011 MCM6_C pdb F T 6wrw 2 C,D C,D Computationally designed protein 2DS25.5 DEEEIQKAIEELLRKGVSEEEAAIIIVQRFNVAVVVVVQDERQGKHISEYIRRYIPEADVILFANLVVIKVETHELSTRVWEAAQKAY 88 T 0.023 NUDIX_2 pdb F T 6wrx 2 C,D C,D Computationally designed protein 2DS25.1 DEEEIQKAIEELLRKGVSEEEAAIIIVQRFNVAVVVVVQDERQAKHISEYIRRYIPEADVILFANIVVIKVETHELRKRVWEAAQKAY 88 T 0.014 NUDIX_2 pdb F T 6ws0 3 C ZZZ REV3L_HUMAN PROTEIN REVERSIONLESS 3-LIKE,HREV3 MLTPTPDSSPRSTSSPSQSKNGSFTPRTANILKPLMSPPSREEIMATLLDHD 52 T 7 CaM_bind pdbhh F Eukaryota T 6ws5 3 C ZZZ REV3L_HUMAN PROTEIN REVERSIONLESS 3-LIKE,HREV3 MLTPTPDSSPRSTSSPSQSKNGSFTPRTANILKPLMSPPSREEIMATLLDHD 52 T 7 CaM_bind pdbhh F Eukaryota T 6wsj 2 B I cyclopeptide des4.3.1 XXDXXFXK 8 T 1.2 DUF2274 pdbhh F F 6wt4 1 A,B A,B CAP13_FLASX Bacterial STING SEAEYSPAFALAVGYFKNFIFPAITQIKENGEVNPKICIYKPKHFDELTSTNIDMIKAELTNKKYNLSEINLSLKGARARDILTLNKKSKIHSYFDFPNTLLSLYSYVDFKIASSNNNSSELKKKKFVELLIEQFYLKLNELIQENNLTNNITFCDKNLQGL 162 T 0.00067 TMEM173 pdbhh F Bacteria T 6wt5 1 A,B,C,D A,B,C,D CAP12_CAPGB ABC-TYPE SUGAR TRANSPORT SYSTEM, PERIPLASMIC COMPONENT SNDSDINFFPSSTLAAVYYENFIKPTCSHIINNGGLLDKNGYIYKKCTIKIIIPKKLTSDVNSQFQRIKAKIETKELSFEYLGRPRNINVEIIAEDGEVMIIDFPTILSGINYAISNLLPQDFNSMSVDYEAILSRELERFVYTLKKIALRDGFDDLIKIVDEDN 165 T 0.00018 TMEM173 pdbhh F Bacteria T 6wt8 1 A A CDNE_FLASX FSCDNE SQKNYLELIKKVRERSNPDLVQMTKMYSETLSGSKLFENKSIEYSDVSIYIKESMKGVAPSYTMNSKVAANKVEAHLKKSHGNLVDFERQGSVMTNTHILKENDVDLVQITNKSSEFDHKGLEKALNNTSVLKTEEILNLKKHKENFSPYQGNQIDDLKYVRLKSELVLSSTYKTVDIEKENSIYVKVTEPERDIDVVTATYYKSVDFMKTNDKSRKGIQIYNKKTGKINDVDYPFLSIERINVKDIISNRRLKNMIRFLKNIKYDCPHIENKGSIRSFHINAICYNIDVKKYEDLHYLDLVSILYQELTNIISNKSYRDNIKSVDGCEYIFEFDCAKKLIEIEFLSQELDSIIADLHNQSLLVG 365 T 0.00046 NTP_transf_2 pdbpercent F Bacteria T 6wt9 1 A A CDNE_CAPGB CGCDNE SEKKNYSALFENLQNRSNPEKLQEITTKFFSDNPDVKYNDVLKYITLAMNGVSPEYTNKSREAGEKVKLHLQDILLDVEYQYQGSVMTNTHIKGYSDIDLLVISDKFYTLDERNIIENLEVNKFSLSQEKIQKLQQELLGKKYHSATNDLKNNRLLSEQKLSSVYEICDITHPKAIKITNKSMGRDVDIVIANWYDDAQSVINNRQIEYRGIQIYNKRSNTIENRDFPFLSIQRINKRSSETKGRLKKMIRFLKNLKADSDEKIELSSFDINAICYNIEKNKYLHSNKYQLVPILYEQLNELVSNSNKINSLKSVDGHEYIFSRNNIDKKESLKMLLQEVKIIYSNLQSYL 351 T 1.3E-05 NTP_transf_2 pdb F Bacteria T 6wth 4 D,E D,E 7B1 Fab XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 118 F F F 6wth 5 F,G F,G 10D4 Fab XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 115 F F F 6wuc 4 D W CENPW_YEAST CENP-W HOMOLOG,CONSTITUTIVE CENTROMERE-ASSOCIATED NETWORK PROTEIN WIP1,W-LIKE PROTEIN 1 SNAMDTEALANYLLRQLSLDAEENKLEDLLQRQNEDQESSQEYNKKLLLACGFQAILRKILLDARTRATAEGLREVYPYHIEAATQAFLDSQ 92 T 4.6E-05 CENP-W pdbhh F Eukaryota T 6wuc 5 E T CENPT_YEAST CENP-T HOMOLOG,CO-PURIFIED WITH NNF1 PROTEIN 1,CONSTITUTIVE CENTROMERE-ASSOCIATED NETWORK PROTEIN CNN1 MSTPRKAAGNNENTEVSEIRTPFRERALEEQRLKDEVLIRNTPGYRKLLSASTKSHDILNKDPNEVRSFLQDLSQVLARKSQGNDTTTNKTQARNLIDELAYEESQPEENELLRSRSEKLTDNNIGNETQPDYTSLSQTVFAKLQERDKGLKSRKIDPIIIQDVPTTGHEDELTVHSPDKANSISMEVLRTSPSIGMDQVDEPPVRDPVPISITQQEEPLSEDLPSDDKEETEEAENEDYSFENTSDENLDDIGNDPIRLNVPAVRRSSIKPLQIMDLKHLTRQFLNENRIILPKQTWSTIQEESLNIMDFLKQKIGTLQKQELVDSFIDMGIINNVDDMFELAHELLPLELQSRIESYLF 361 T 0.0019 CENP-T_C pdbhh F Eukaryota T 6wud 2 B B TMC1_MOUSE BEETHOVEN PROTEIN,DEAFNESS PROTEIN,TRANSMEMBRANE COCHLEAR-EXPRESSED PROTEIN 1 GGGDDNTFNFSWKVFCSWDYLIGNPETADNKFNSITMNFKEAIIEERAAQVEENI 55 T 4.4 NAD_binding_5 pdbhh F Eukaryota T 6wuu 2 E,F,G,H G,H,I,J VIR250 XXXGX 5 T 1400 UPF0547 pdbhh F F 6wux 1 A,B A,B HTRSN_PHYTS Homotarsinin NLVSDIIGSKKHMEKLISIIKKCRX 25 T 7 Spore_IV_A unphh F Eukaryota T 6wvs 1 A A DeNovoTIM15 hyperstable de novo TIM barrel MDILIVNATDVDEMLKQVEILRRLGAKQIAVVSDDWRILQEALKKGGDILIVNATDVDEMLKQVEILRRLGAKQIAVVSDDWRILQEALKKGGDILIVNATDVDEMLKQVEILRRLGAKQIAVVSDDWRILQEALKKGGDILIVNATDVDEMLKQVEILRRLGAKQIAVVSDDWRILQEALKKGGLEHHHHHH 193 T 0.0034 DNA_photolyase pdbpercent F T 6ww7 4 D D ER Membrane Protein Complex Subunit 4 XXXXXXXXXXXXXX 14 F F F 6ww7 9 I I EMC10_HUMAN HEMATOPOIETIC SIGNAL PEPTIDE-CONTAINING MEMBRANE DOMAIN-CONTAINING PROTEIN 1 MAAASAGATRLLLLLLMAVAAPSRARGSGCRAGTGARGAGAEGREGEACGTVGLLLEHSFEIDDSANFRKRGSLLWNQQDGTLSLSQRQLSEEERGRLRDVAALNGLYRVRIPRRPGALDGLEAGGYVSSFVPACSLVESHLSDQLTLHVDVAGNVVGVSVVTHPGGCRGHEVEDVDLELFNTSVQLQPPTTAPGPETAAFIERLEMEQAQKAKNPQEQKSFFAKYWMYIIPVVLFLMMSGAPDTGGQGGGGGGGGGGGSGR 262 T 0.0033 2OG-FeII_Oxy_5 pdbpssm F Eukaryota T 6ww9 2 B,D X,Y SHLD3_HUMAN REV7-INTERACTING NOVEL NHEJ REGULATOR 1,SHIELD COMPLEX SUBUNIT 3 MLSRFIPWFPYDGSKLPLRPKRSPPASREEIMATL 35 T 2.2 Sm_like pdbhh F Eukaryota T 6wwc 3 C,E C,F ENV_HV1H2 fusion peptide AVGIGAVF 8 T 4.2 DUF3918 pdbhh T Viruses F 6wx2 3 C,F C,F Q2N0S7_9HIV1 fusion peptide AVGIGAVF 8 T 4.2 DUF3918 pdbhh T Viruses F 6wx4 2 B I VIR251 GXXGX 5 T 490 zf-C2H2_10 pdbhh F F 6wxh 2 D D CEA1_ECOLX Colicin-E1 METAVAYYKDGVPYDDKGQVIITLLNGTPDGSGSGGGGGKGGSKSESSAAIHATAKWSTAQLKKTQAEQAARAKAAAEAQAKAKANRDALTQRLKDIVNEALRHNASRTPSATELAHANNAAMQAEDERLRLAKAEEKARKEAEAAEKAFQEAEQRRKEIEREKAETERQLKLAEAEEKRLAALSEEAKALEHHHHHH 198 T 0.21 GTP1_OBG unppercent F Bacteria T 6wxo 1 A,B A,B TFD-HE MGHHHHHHGGWGGSGGENLYFQGDILIVNAKDVDEMLKQVEILRRLGAKQIAVHSSDWRILQEALKKGGDILIVNGGGMTITFRGDDLEALLKAAIEMIKQALKFGATITLSLDGNDLNINITGVPEQVRKELAKEAERLAKEFGITVTRTGGGDVDEMLKQVEILRRLGAKQIAVESDDWRILQEALKKG 191 T 0.0014 YicC_N pdbpercent F T 6wy6 2 C,D C,D EDE1_YEAST BUD SITE SELECTION PROTEIN 15, EDE1 ADSESEFENVANAGSMEQFETIDHKDLX 28 T 0.063 ComC pdbpssm F Eukaryota T 6wyv 8 H C VIRGINIAMYCIN S1 XTXPXXX 7 T 260 zf-C2H2_jaz pdbhh F F 6wzx 2 C,D C,D ILE-GLY-LEU-TRP-LYS peptide IGLWKS 6 T 4.5 DUF3876 pdbhh F T 6wzz 2 B B VGLWKS peptide VGLWKS 6 T 1.1 CCD48 pdbhh F T 6x1g 1 A,B A,C B3CVM3_ORITI ULP_PROTEASE domain-containing protein MERLVKKVTSNLETELKFFKGRLVQELMQIVKNENGRIDHTSKNWQESASVLLNSQEKGAVSLAEVERAVSKMTQKLRDQKVSEEEVVNIESKLKFERASLEAKLFDDNEIKELINKRIKEDALRAIPFLGSDSESFMEKISPFVKLPDDSYSLLKANDKHHPFQNILYSNALKFFADSSDIGYLNDDSLKNLTPENLNAFEQAVAADIDKLMHHHHHH 219 T 0.0088 RAB3GAP2_C pdbpssm F Bacteria T 6x1h 1 A,B,C,D,E,F E,D,B,F,C,A B3CVM3_ORITI ULP_PROTEASE domain-containing protein MERLVKKVTSNLETELKFFKGRLVQELMQIVKNENGRIDHTSKNWQESASVLLNSQEKGAVSLAEVERAVSKMTQKLRDQKVSEEEVVNIESKLKFERASLEAKLFDDNEIKELINKRIKEDALRAIPFLGSDSESFMEKISPFVKLPDDSYSLLKANDKHHPFQNILYSNALKFFADSSDIGYLNDDSLKNLTPENLNAFEQAVAADIDKLMHHHHHH 219 T 0.0088 RAB3GAP2_C pdbpssm F Bacteria T 6x1s 3 I,J,K,L G,I,J,K NM23-1-pTza peptide RNIIXGSDS 9 T 0.6 Nbs1_C pdbhh F T 6x1t 3 M,N,O,P,Q,R M,N,O,P,Q,R NM23-1-pTza peptide RNIIXGSDS 9 T 0.6 Nbs1_C pdbhh F T 6x1u 3 E,F D,C ACLYana-3-pTza peptide AGAGXAGAG 9 T 7.5 Pyr_redox_2 pdbhh F F 6x1v 3 C D ACLYana-3-pTza peptide AGAGXAGAG 9 T 7.5 Pyr_redox_2 pdbhh F F 6x1w 3 C A ACLYana-3-pTza peptide AGAGXAGAG 9 T 7.5 Pyr_redox_2 pdbhh F F 6x23 2 B B KCNJ9_HUMAN GIRK-3,INWARD RECTIFIER K(+) CHANNEL KIR3.3,POTASSIUM CHANNEL,INWARDLY RECTIFYING SUBFAMILY J MEMBER 9 LPPPESESKV 10 T 22 ANAPC9 pdbhh F Eukaryota T 6x2p 4 D D MP2K1_HUMAN MKK1,ERK ACTIVATOR KINASE 1,MAPK/ERK KINASE 1,MEK 1 TNLEALQKKLEELELDE 17 T 0.07 CAMSAP_CC1 unppercent F Eukaryota T 6x2s 4 D D MP2K1_HUMAN MKK1,ERK ACTIVATOR KINASE 1,MAPK/ERK KINASE 1,MEK 1 NLEALQKKLEELELNQ 16 T 0.07 CAMSAP_CC1 unppercent F Eukaryota T 6x2x 4 D D MP2K1_HUMAN MKK1,ERK ACTIVATOR KINASE 1,MAPK/ERK KINASE 1,MEK 1 ALEALQKKLEELELDE 16 T 0.07 CAMSAP_CC1 unppercent F Eukaryota T 6x2y 4 D D DIAP3_HUMAN DIAPHANOUS-RELATED FORMIN-3,DRF3,MDIA2 VEALLARLRAL 11 T 5.3 Packaging_FI pdbhh F Eukaryota F 6x3b 1 A,B,C,D A,B,C,D RMD_PSEAE NAD-DEPENDENT EPIMERASE/DEHYDRATASE FAMILY PROTEIN MGSSHHHHHHSSENLYFQGHMTQRLFVTGLSGFVGKHLQAYLAAAHTPWALLPVPHRYDLLEPDSLGDLWPELPDAVIHLAGQTYVPEAFRDPARTLQINLLGTLNLLQALKARGFSGTFLYISSGDVYGQVAEAALPIHEELIPHPRNPYAVSKLAAESLCLQWGITEGWRVLVARPFNHIGPGQKDSFVIASAARQIARMKQGLQANRLEVGDIDVSRDFLDVQDVLSAYLRLLSHGEAGAVYNVCSGQEQKIRELIELLADIAQVELEIVQDPARMRRAEQRRVRGSHARLHDATGWKPEITIKQSLRAILSDWESRVREE 324 T 0.00018 GDP_Man_Dehyd pdbpercent F Bacteria T 6x5g 2 B B LRRC7_HUMAN DENSIN-180,DENSIN,PROTEIN LAP1 SKSRSTSSHGRRPLIRQDRIVG 22 T 3.7 Rubredoxin_C pdbhh F Eukaryota T 6x5i 1 A,AA,AB,B,BA,BB,C,CA,CB,D,DA,DB,E,EA,EB,F,FA,FB,G,GA,GB,H,HA,HB,I,IA,IB,J,JA,JB,K,KA,KB,L,LA,LB,M,MA,MB,N,NA,NB,O,OA,OB,P,PA,PB,Q,QA,QB,R,RA,RB,S,SA,T,TA,U,UA,V,VA,W,WA,X,XA,Y,YA,Z,ZA B,c,BA,D,d,CA,E,e,DA,F,f,EA,G,g,FA,H,h,GA,I,i,HA,J,j,IA,K,k,JA,L,l,KA,M,m,LA,N,n,MA,O,o,NA,P,p,OA,Q,q,PA,R,r,QA,S,s,RA,T,t,SA,U,u,V,v,W,w,X,x,Y,y,Z,z,a,0,b,AA 1-KMe3 peptide-like fibril XXXXK 5 F F F 6x5q 2 B B GRIA1_HUMAN GLUR-1,AMPA-SELECTIVE GLUTAMATE RECEPTOR 1,GLUR-A,GLUR-K1,GLUTAMATE RECEPTOR IONOTROPIC,AMPA 1,GLUA1 SKRMKGFCLIPQQSINEAIR 20 T 0.21 PROCT pdbhh F Eukaryota T 6x5r 2 C,D C,D A2-Asn ANK 3 T 540 Proteasome_A_N pdbhh F F 6x5s 2 C,D C,D A3'-Asn ANK 3 T 540 Proteasome_A_N pdbhh F F 6x5v 2 B B Peptide GYTS 4 T 62 Ice_nucleation pdbhh F F 6x5w 2 B B peptide AGYTD 5 T 32 DUF5724 pdbhh F F 6x64 1 A,AA,B,BA,C,CA,D,DA,E,EA,F,FA,G,GA,H,HA,I,IA,J,JA,K,L,M,N,O,P,Q,R,S,T,U,V,W,X,Y,Z AV,NV,AW,NW,BV,OV,BW,OW,CV,PV,CW,PW,DV,QV,DW,QW,EV,RV,EW,RW,FV,FW,GV,GW,HV,HW,IV,IW,JV,JW,KV,KW,LV,LW,MV,MW T4SS XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 330 F F F 6x65 1 A,AA,AD,AE,B,BA,BD,C,CA,CC,CD,D,DA,DC,E,EA,EB,EC,F,FA,FB,G,GA,GB,GE,H,HA,HE,I,IA,ID,IE,J,JA,JD,K,KC,KD,L,LC,M,MB,MC,N,NB,O,OA,OB,P,PA,Q,QA,QD,R,RD,S,SC,SD,T,TC,U,UB,UC,V,VB,W,WA,WB,X,XA,Y,YA,YD,Z,ZD AV,NV,IX,LZ,AW,NW,IY,BV,OV,FX,IZ,BW,OW,FY,CV,PV,CX,FZ,CW,PW,CY,DV,QV,CZ,MX,DW,QW,MY,EV,RV,JX,MZ,EW,RW,JY,FV,GX,JZ,FW,GY,GV,DX,GZ,GW,DY,HV,AX,DZ,HW,AY,IV,AZ,KX,IW,KY,JV,HX,KZ,JW,HY,KV,EX,HZ,KW,EY,LV,BX,EZ,LW,BY,MV,BZ,LX,MW,LY Type IV secretion system unknown protein fragment XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 228 F F F 6x66 5 AB,AC,BB,CA,CB,DA,E,EA,F,G,GC,HC,IB,IC,JB,KA,KB,LA,M,MA,N,O,OC,PC,QB,QC,RB,SA,SB,TA,U,UA,V,W,WC,XC,YB,YC,ZB GX,JZ,GY,DX,GZ,DY,AX,DZ,AY,AZ,KX,KY,HX,KZ,HY,EX,HZ,EY,BX,EZ,BY,BZ,LX,LY,IX,LZ,IY,FX,IZ,FY,CX,FZ,CY,CZ,MX,MY,JX,MZ,JY Type IV secretion system unknown protein fragment AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 330 T 18000 zf-C2H2_6 pdbhh F F 6x6f 1 A,B,C,D,E,F A,B,C,D,F,H Pf6r MESIQSRARTLIDKAGIDRLVRHGEISHSRWQSVRYKDIRMSTEELEVLQSLFPHYRLWLISGEVMPEAGQVSPDFEEASRNLAGQNAGAHHHHHH 96 T 7E-05 BetR pdbhh F T 6x6h 2 B A2 A9ZMR8_ECOLX STX2A PECQITGDRPVIKINNTLWESNTAAAFLNRKSQFLYTTGK 40 T 7.7 CdiA_C pdbhh F Bacteria T 6x6h 4 H P P11 peptide GFGLFD 6 T 7 TMEM65 pdbhh F F 6x6m 2 B B peptide DSTD 4 T 320 BNR_6 pdbhh F F 6x6o 1 A,B A,B SPAC_BPT4 Protein spackle MKKFIFATIFALASCAAQPAMAGYDKDLCEWSMTADQTEVETQIEADIMNIVKRDRPEMKAEVQKQLKSGGVMQYNYVLYCDKNFNNKNIIAEVVGELEHHHHHH 105 T 0.0054 Mfp-3 unphh T Viruses T 6x6q 2 B B ASP-SER-ASP DSD 3 T 180 Glyco_trans_2_3 pdbhh F F 6x6s 1 A,AA,AB,AF,B,BA,BF,C,CA,CE,CF,D,DE,DF,E,ED,EE,EF,FD,FE,GC,GD,GE,HC,HD,IB,IC,ID,JB,JC,KA,KB,KC,LA,LB,M,MA,MB,N,NA,O,OA,OE,P,PE,Q,QD,QE,RD,RE,SC,SD,SE,TC,TD,UB,UC,UD,VB,VC,WA,WB,WC,XA,XB,Y,YA,YB,Z,ZA AA,CC,EE,NA,AB,CD,NB,AC,CE,LA,NC,AD,LB,ND,AE,JA,LC,NE,JB,LD,HA,JC,LE,HB,JD,FA,HC,JE,FB,HD,DA,FC,HE,DB,FD,BA,DC,FE,BB,DD,BC,DE,MA,BD,MB,BE,KA,MC,KB,MD,IA,KC,ME,IB,KD,GA,IC,KE,GB,ID,EA,GC,IE,EB,GD,CA,EC,GE,CB,ED A0A2J9KJK3_HELPX Type IV secretion system apparatus protein Cag3 MFRKLATAVSLIGLLTSNTLYAKEISEADKVIKATKETKETKKEAKRLKKEAKQRQQIPDHKKPQYVSVDDTKTQALFDIYDTLNVNDKSFGDWFGNSALKDKTYLYAMDLLDYNNYLSIENPIIKTRAMGTYADLIIITGSLEQVNGYYNILKALNKRNAKFVLKINENMPYAQATFLRVPKRSDPNAHTLDKGASIDENKLFEQQKKMYFNYANDVICRPDDEVCSPLRDEMVAMPTSDSVTQKPNIIAPYSLYRLKETNNANEAQPSPYATATAPENSKEKLIEELIANSQLVANEEEREKKLLAEKEKQEAELAKYKLKDLENQKKLKALEAELKKKNAKKPRVVEVPVSPQTSNSDETMRVVKEKENYNGLLVDKETTIKRSYEGTLISENSYSKKTPLNPNDLRSLEEEIKSYYIKSNGLCYTNGINLYVKIKNDPYKEGMLCGYESVQNLLSPLKDKLKYDKQKLQKALLKDSK 481 T 0.12 RRP36 pdbpssm F Bacteria T 6x6s 2 AE,BB,CD,DA,EC,F,FF,GB,HE,IA,JD,K,KF,LC,ME,NB,OD,PA,QC,R,SB,TE,UA,VD,W,XC,YE,ZB Km,EM,Im,CM,Gm,AM,NM,Em,LM,Cm,JM,Am,Nm,HM,Lm,FM,Jm,DM,Hm,BM,Fm,MM,Dm,KM,Bm,IM,Mm,GM A0A2J9KJL4_HELPX Type IV secretion system apparatus protein CagM MLAKIVFSSLVAFGVLSANVEQFGSFFNEIKKEQEEVAAKEDALKARKKLLNNTHDFLEDLIFRKQKIKELMDHRAKVLSDLENKYKKEKEALEKETRGKILTAKSKAYGDLEQALKDNPLYRKLLPNPYAYVLNQETFTKEDRERLSYYYPQVKTSSIFKKTTATTKDKAQALLQMGVFSLDEEQNKKASRLALSYKQAIEEYSNNVSNLLSRKELDNIDYYLQLERNKFDSKAKDIAQKATNTLIFNSERLAFSMAIDKINEKYLRGYEAFSNLLKNVKDDVELNTLTKNFTNQKLSFAQKQKLCLLVLDSFNFDTQSKKSILKKTNEYNIFVDSDPMMSDKTTMQKEHYKIFNFFKTVVSAYRNNVAKNNPFE 376 T 0.021 DUF4363 pdb F Bacteria T 6x6s 4 BC,DB,FA,H,HF,JE,LD,NC,PB,RA,T,VE,XD,ZC GU,EU,CU,AU,NU,LU,JU,HU,FU,DU,BU,MU,KU,IU Unknown protein fragment XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 160 F F F 6x78 3 E,F G,I HIV fusion peptide 512-519 V2 AVGLGAVF 8 T 4 DUF3918 pdbhh F F 6x7i 1 A A B2CL1_HUMAN BCL2-L-1,APOPTOSIS REGULATOR BCL-X GQERFNRWFLTGMTVAGVVLLGSLFSRK 28 T 0.057 DUF3094 pdb F Eukaryota T 6x7w 2 B,F G,D HIV fusion peptide 512-519 XAVGLGAVF 9 T 4.6 DUF3918 pdbhh F T 6x89 1 A A Unknown Peptide XXXXXXXXXXXXXXXXXX 18 F F F 6x89 5 E A7 A0A1S3UVC7_VIGRR NDUA7 MAKSASNSLVQTLKRYIKKPWEITGPCADPEYRSAVPLATEYRLQCPATTKEKPCIPNSLPETVYDIKYFSRDQRRNRPPIRRTVLKKADVEKLAKEQTFAVSDFPPVYLNSAVEEDINAIGGGYQG 127 T 0.0011 CI-B14_5a pdb F Eukaryota T 6x89 26 Z B Unknown Peptide XXXXXXXXXXXXXXXXXXXXXXXXX 25 F F F 6x89 27 AA C Unknown Peptide XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 43 F F F 6x89 29 CA P2 A0A1S3TGE7_VIGRR Protein At2g27730, mitochondrial MAARVAARYGSRRLFSSGSGKILSEEEKAAENAYFKKAEQDKLEKLARKGPQPEASSGGSVIDAKPSGSGHTGASAERVSTDKHRNYAVVAGTITILGALGWYLKGTAKKPEVQD 115 T 0.0082 IATP unphh F Eukaryota T 6x8h 3 C C Ac-DW3-KE XXDXFX 6 T 910 SEC-C pdbhh F F 6x8i 3 E,F E,F ketomethylene inhibitor XDEVX 5 T 570 Helicase_RecD pdbhh F F 6x8j 3 E,F E,F ketomethylene inhibitor XDEVX 5 T 570 Helicase_RecD pdbhh F F 6x8k 3 E,F E,F ketomethylene inhibitor XDEVXAAA 8 T 78 UPF0160 pdbhh F F 6x8l 3 E,F E,F ketomethylene inhibitor XDEVXAAA 8 T 78 UPF0160 pdbhh F F 6x8n 1 A,B A,B De novo designed ABLE protein SVKSEYAEAAAVGQEAVAVFNTMKAAFQNGDKEAVAQYLARLASLYTRAEELLNRILEKARREGNKEAVTLMNEFTATFQTGKSIFNAMVAAFKNGDDDSFESYLQALEKVTAKGETLADQIAKAL 126 T 0.028 ASD2 pdbpssm F T 6x8p 3 C P CSP_PLABA NPND peptide PPPPNPNDPPPPNPND 16 T 110 Herpes_US12 pdbhh F Eukaryota F 6x8q 3 C P CSP_PLABA PAPP peptide PAPPNANDPAPPNAND 16 T 62 TMEM220 pdbhh F Eukaryota F 6x8r 1 A A SxIIIC peptide RGCCNGRGGCSSRWCRDHARCCX 23 T 0.041 Mu-conotoxin pdbpssm F T 6x8s 3 C P CSP_PLABA NAND peptide PPPPNANDPPPPNAND 16 T 14 SRA1 pdbhh F Eukaryota F 6x8u 3 C P CSP_PLABA Mixed peptide PPPPNPNDPAPPNAND 16 T 16 Orbi_NS3 pdbhh F Eukaryota F 6xa1 83 EC NC Stalled Nascent chain GLQIPAILGILGGILALLILILNPN 25 T 0.022 Phage_holin_5_2 pdb F T 6xa4 2 B B inhibitor UAW241 XLLX 4 T 1700 EF-hand_1 pdbhh F F 6xar 2 C,D C,D SLAP2_MOUSE SRC-LIKE ADAPTER PROTEIN 2,SLAP-2 LSEGLRESLSSYISLAEDP 19 T 0.59 RHH_6 pdbhh F Eukaryota T 6xaw 2 B B UME6_YEAST NEGATIVE TRANSCRIPTIONAL REGULATOR OF IME2,REGULATOR OF INDUCER OF MEIOSIS PROTEIN 16,UNSCHEDULED MEIOTIC GENE EXPRESSION PROTEIN 6 GPRSRLLLGPNSASSSTKLDDDLGTAAAVLSNMRSSPYRTHDKPIS 46 T 4.6 ACC_epsilon pdbhh F Eukaryota T 6xbd 5 M,N M,N MSP1D1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 164 F F F 6xbe 2 B,D,F,H F,G,H,I macrocycle inhibitor NDM1i-1F XXLXPVPE 8 T 1.6 LAG1-DNAbind pdbhh F F 6xbf 2 B,D,F,H F,G,H,I macrocycle inhibitor NDM1i-1G XXLXPIPE 8 T 3 LAG1-DNAbind pdbhh F F 6xbg 2 C,D C,E inhibitor UAW246 XLXX 4 T 2000 zf-C2H2 pdbhh F F 6xbh 2 B C inhibitor UAW247 XFX 3 T 530 zf-C2H2_11 pdbhh F F 6xbi 2 C,D D,E inhibitor UAW248 XLLXX 5 T 2000 EF-hand_1 pdbhh F F 6xc0 2 C,D C,D SPAC_BPT4 Protein spackle MKKFIFATIFALASCAAQPAMAGYDKDLCEWSMTADQTEVETQIEADIMNIVKRDRPEMKAEVQKQLKSGGVMQYNYVLYCDKNFNNKNIIAEVVGELEHHHHHH 105 T 0.0054 Mfp-3 unphh T Viruses T 6xc1 2 B C SPAC_BPT4 Protein spackle MKKFIFATIFALASCAAQPAMAGYDKDLCEWSMTADQTEVETQIEADIMNIVKRDRPEMKAEVQKQLKSGGVMQYNYVLYCDKNFNNKNIIAEVVGELEHHHHHH 105 T 0.0054 Mfp-3 unphh T Viruses T 6xch 2 B C Leupeptin XLLX 4 T 590 DNTTIP1_dimer pdbhh F F 6xci 2 C H Macrocycle NDM1i-3D XPXPEXXX 8 F F F 6xf7 1 A,B B,C LMBD1_REOVL Lambda 1 protein QRHITEFISSWQNHPIVQVSADVENKKTAQLLHADTPRLVTWDAGLCTSFKIVPIVPAQVPQDVLAYTFFTSSYAIQSPFPEAAVSRIVVHTRWASNVDFDRDSSVIMAPPTENNIHLFKQLLNTETLSVRGANPLMFRANVLHMLLEFVLDNLYLNRHTGFSQDHTPFTEGANLRSLPGPDAEKWYSIMYPTRMGTPNVSKICNFVASCVRNRVGRFDRAQMMNGAMSEWVDVFETSDALTVSIRGRWMARLARMNINPTEIEWALTECAQGYVTVTSPYAPSVNRLMPYRISNAERQISQIIRIMNIGNNATVIQPVLQDISVLLQRISPLQIDPTIISNTMSTVSESTTQTLSPASSILGKLRPSNSDFSSFRVALAGWLYNGVVTTVIDDSSYPKDGGSVTSLENLWDFFILALALPLTTDPCAPVKAFMTLANMMVGFETIPMDNQIYTQSRRASAFSTPHTWPRCFMNIQLISPIDAPILRQWAEIIHRYWPNPSQIRYGAPNVFGSANLFTPPEVLLLPIDHQPANVTTPTLDFTNELTNWRARVCELMKNLVDNQRYQPGWTQSLVSSMRGTLDKLKLIKSMTPMYLQQLAPVELAVIAPMLPFPPFQVPYVRLDRDRVPTMVGVTRQSRDTITQPALSLSTTNTTVGVPLALDARAITVALLSGKYPPDLVTNVWYADAIYPMYADTEVFSNLQRDMITCEAVQTLVTLVAQISETQYPVDRYLDWIPSLRASAATAATFAEWVNTSMKTAFDLSDMLLEPLLSGDPRMTQLAIQYQQYNGRTFNVIPEMPGSVIADCVQLTAEVFNHEYNLFGIARGDIIIGRVQSTHLWSPLAPPPDLVFDRDTPGVHIFGRDCRISFGMNGAAPMIRDETGMMVPFEGNWIFPLALWQMNTRYFNQQFDAWIKTGELRIRIEMGAYPYMLHYYDPRQYANAWNLTSAWLEEITPTSIPSVPFMVPISSDHDISSAPAVQYIISTEYNDRSLFCTNSSSPQTIAGPDKHIPVERYNILTNPDAPPTQIQLPEVVDLYNVVTRYAYETPPITAVVMGVP 1059 T 0.59 DUF5810 unppercent T Viruses T 6xf8 4 G,H C,B LMBD1_REOVL LAMBDA1,ATP-DEPENDENT DNA HELICASE LAMBDA-1,LAMBDA1(HEL) QRHITEFISSWQNHPIVQVSADVENKKTAQLLHADTPRLVTWDAGLCTSFKIVPIVPAQVPQDVLAYTFFTSSYAIQSPFPEAAVSRIVVHTRWASNVDFDRDSSVIMAPPTENNIHLFKQLLNTETLSVRGANPLMFRANVLHMLLEFVLDNLYLNRHTGFSQDHTPFTEGANLRSLPGPDAEKWYSIMYPTRMGTPNVSKICNFVASCVRNRVGRFDRAQMMNGAMSEWVDVFETSDALTVSIRGRWMARLARMNINPTEIEWALTECAQGYVTVTSPYAPSVNRLMPYRISNAERQISQIIRIMNIGNNATVIQPVLQDISVLLQRISPLQIDPTIISNTMSTVSESTTQTLSPASSILGKLRPSNSDFSSFRVALAGWLYNGVVTTVIDDSSYPKDGGSVTSLENLWDFFILALALPLTTDPCAPVKAFMTLANMMVGFETIPMDNQIYTQSRRASAFSTPHTWPRCFMNIQLISPIDAPILRQWAEIIHRYWPNPSQIRYGAPNVFGSANLFTPPEVLLLPIDHQPANVTTPTLDFTNELTNWRARVCELMKNLVDNQRYQPGWTQSLVSSMRGTLDKLKLIKSMTPMYLQQLAPVELAVIAPMLPFPPFQVPYVRLDRDRVPTMVGVTRQSRDTITQPALSLSTTNTTVGVPLALDARAITVALLSGKYPPDLVTNVWYADAIYPMYADTEVFSNLQRDMITCEAVQTLVTLVAQISETQYPVDRYLDWIPSLRASAATAATFAEWVNTSMKTAFDLSDMLLEPLLSGDPRMTQLAIQYQQYNGRTFNVIPEMPGSVIADCVQLTAEVFNHEYNLFGIARGDIIIGRVQSTHLWSPLAPPPDLVFDRDTPGVHIFGRDCRISFGMNGAAPMIRDETGMMVPFEGNWIFPLALWQMNTRYFNQQFDAWIKTGELRIRIEMGAYPYMLHYYDPRQYANAWNLTSAWLEEITPTSIPSVPFMVPISSDHDISSAPAVQYIISTEYNDRSLFCTNSSSPQTIAGPDKHIPVERYNILTNPDAPPTQIQLPEVVDLYNVVTRYAYETPPITAVVMGVP 1059 T 0.59 DUF5810 unppercent T Viruses T 6xfk 2 B B SCTC_SALTY T3SS SECRETIN,PROTEIN INVG DDKLQKWVRVYLDRGQ 16 T 0.062 DUF3963 pdbhh F Bacteria T 6xfl 2 B B SCTC_SALTY T3SS SECRETIN,PROTEIN INVG GSHMDPLTPDASESVNNILKQSGAWSGDDKLQKWVRVYLDRGQEAIK 47 T 0.11 DUF3485 pdbpssm F Bacteria T 6xfm 1 A,B,C,D,E,F,G,H 1,2,3,4,5,6,7,8 FUS_HUMAN 75 KDA DNA-PAIRING PROTEIN,ONCOGENE FUS,ONCOGENE TLS,POMP75,TRANSLOCATED IN LIPOSARCOMA PROTEIN GSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQGDRG 104 T 920 HMMR_N pdbhh F Eukaryota T 6xfn 2 B B UAW243 XLXX 4 T 2000 zf-C2H2 pdbhh F F 6xi2 5 E G ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA-ALA AAAAAAAAAA 10 T 200 FAD_oxidored pdbhh F F 6xi2 6 F H ALA-GLY-ALA-GLY-ALA-ALA-ALA-ALA-ALA-ALA AGAGAAAAAA 10 T 2.5 DUF3918 pdbhh F F 6xi6 1 A A helical fusion design GKELEIVARLQQLNIELARKLLEAVARLQELNIDLVRKTSELTDEKTIREEIRKVKEESKRIVEEAEQEIRKAEAESLRLTAEAAADAARKAALRMGDERVRRLAAELVRLAQEAAEEATRDPNSSDQNEALRLIILAIEAAVRALDKAIEKGDPEDRERAREMVRAAVRAAELVQRYPSASAANEALKALVAAIDEGDKDAARCAEELVEQAEEALRKKNPEEARAVYEAARDVLEALQRLEEAKRRGDEEERREAEERLRQACERARKKN 272 T 0.00026 SMBP pdb F T 6xi8 1 A C TFB3_YEAST RNA POLYMERASE II TRANSCRIPTION FACTOR B 38 KDA SUBUNIT,RNA POLYMERASE II TRANSCRIPTION FACTOR B P38 SUBUNIT PFNGDREAHPPFTLKGSVYNDPFIKDLEHRKEFIASGFNTNYAYERVLTEAFMGLGCVISEEL 63 T 6.1 DUF6190 pdbhh F Eukaryota T 6xib 3 C I Peptide 30 XCKGWWDHYXCA 12 T 0.047 EPV_E5 pdbhh F T 6xic 3 C I Peptide 40 XCKXWWDHYXX 11 T 1.7 DUF5958 pdbhh F T 6xid 3 C I Peptide 51 XCKXWWPTYXCA 12 T 0.35 ZinT pdbhh F T 6xie 3 C I Peptide 77 XCAXXWQTXXC 11 T 11 DUF2754 pdbhh F T 6xif 3 C I Peptide 83 XGAXXXXXXXG 11 T 500 DUF5507 pdbhh F F 6xkb 2 F,G,H,I,J F,G,I,J,K RPB1_HUMAN S2,S5p-CTD peptide XSPSYSPTSPSYSPTSPSYS 20 T 0.022 RNA_pol_Rpb1_R pdbpssm F Eukaryota F 6xke 1 A,B,C A,B,C A0A1Y9G8D0_ANOAL Albicin ANNHIRTVLKLFRTIDLDDSKKSFYLTAAKYGIQTQLREPIIRIVGGYLPSTKLSEACVKNMISEVYEIEGDFYSKFSYACEDHAPYSVECLEDARDDYLTQLVELFKETKKCLRE 116 T 0.094 Herpes_UL55 pdbpssm F Eukaryota T 6xl7 1 A A SG7.AF ARKHVQELLKTFRRIDFDETRKSVYLQSAKFGVQSQLREPLTKKVLNYWDDVKLSKTCLDRMVTKVNDVKETFYAGFSYACESHNQYSVDCLEAAKPSYLTALGEIRGETEKCLTTRLK 119 T 5.8 HECW1_helix pdbhh F T 6xli 3 G,H,I E,F,P TAU_HUMAN Tau Phosphopeptide (Ac-SR(pT)PSLP(pT)PPTRE-OH) XSRTPSLPTPPTRE 14 T 2.3 UPF0449 pdbhh F Eukaryota T 6xmb 1 A,B,C A,B,C A5HUP6_ANOST Anophensin TEATRKHVQQLMKVFRAIDFDFTKKAFYLHRAKYGVQNQLRNPLYLKAMSLPRSAKLSQPCLNKMIDEVNDLESTFYAGFSFNCHDHDQYSMDCLEAAEPTYLDGLKKLAASTEQCLVQK 120 T 0.025 PL48 pdbpssm F Eukaryota T 6xmi 3 E,F F,C TERL_BPP22 DNA-PACKAGING PROTEIN GP2,GENE PRODUCT 2,GP2 MELDAILDNLSDEEQIELLELLEEEENYRNTHL 33 T 0.0057 DUF3775 pdbhh T Viruses T 6xmn 2 B B CXCR1_HUMAN CXCR-1,CDW128A,HIGH AFFINITY INTERLEUKIN-8 RECEPTOR A,IL-8R A,IL-8 RECEPTOR TYPE 1 MSNITDPQMWDFDDLNFTGMPPADEDYSP 29 T 0.01 FA_desaturase unppercent F Eukaryota T 6xmu 2 B B Putative endogenous substrate transmembrane helix XXXXXXXXXXXXXXXXXXXX 20 F F F 6xn9 1 A A Recifin modulatory peptide QEAFCYSDRFCQNYIGSIPDCCFGRGSYSFELQPPPWECYQC 42 T 3.5 Rubredoxin pdbhh F T 6xnj 2 B B Q8XAN6_ECO57 NleG8 peptide LATQNICTRI 10 T 5 DUF3894 pdbhh F Bacteria T 6xnr 1 A,B,C,D,E AAA,BBB,CCC,DDD,EEE Antifreeze protein MYSCRAVGVDASTVTDVQGTCHAKATGPGAVASGTSVDGSTSTATATGSGATATSTSTGTGTATTTATSNAAATSNAIGQGTATSTATGTAAARAIGSSTTSASATEPTQTKTVSGPGAQTATAIAIDTATTTVTASLEHHHHHH 145 T 4.8 Sporozoite_P67 pdbpercent F T 6xns 1 A,B,C,D,E,F A,B,C,D,E,F C3_crown-05 MGDRSDHAKKLKTFLENLRRHLDRLDKHIKQLRDILSENPEDERVKDVIDLSERSVRIVKTVIKIFEDSVRKLLKQINKEAEELAKSPDPEDLKRAVELAEAVVRADPGSNLSKKALEIILRAAAELAKLPDPDALAAAARAASKVQQEQPGSNLAKAAQEIMRQASRAAEEAARRAKETLEKAEKDGDPETALKAVETVVKVARALNQIATMAGSEEAQERAARVASEAARLAERVLELAEKQGDPEVARRARELQEKVLDILLDILEQILQTATKIIDDANKLLEKLRRSERKDPKVVETYVELLKRHERLVKQLLEIAKAHAEAVEGGSLEHHHHHH 340 T 0.0088 DUF327 pdbpercent F T 6xod 2 B B PEX22_ARATH PEROXIN-22,ATPEX22 GPAVQDVVDQFFQPVKPTLGQIVRQKLSEGRKVTCRLLGVILEETSPEELQKQATVRSSVLEVLLEITKYSDLYLMERVLDDESEAKVLQALENAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWHIDTNPEISTQLARFIKYQLHVATVKPERTAPNVFTSQSIEQFFGSV 175 T 0.00058 Peroxin-22 pdbhh F Eukaryota T 6xor 1 A,B A,B SWA_DROME Protein swallow SFDRLLAENESLQQKINSLEVEAKRLQGFNEYVQERLDRITDDFVKMKDNFETLRTELSEAQQKLRRQQDN 71 T 0.00023 HALZ pdbpssm F Eukaryota T 6xov 3 C C Amyloid-beta precursor protein XXX 3 F F F 6xp5 14 N b Unknown peptide XXXXXXXXXXXXXXXXXXXXX 21 F F F 6xp5 15 O c HEAT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 200 F F F 6xp6 5 I,J C,F DQ2-glia-a2 peptide AAPQPELPYPQPGSGGSIEGRGGSGA 26 T 23 Dicty_CAD pdbhh F T 6xqi 3 G,H H,I ASN-PRO-LEU-GLU-PHE-LEU NPLEFL 6 T 3 DBB pdbhh F F 6xr1 1 A A dTor_9x57R GNLELALKALQILVNAAYVLAEIARDRGNEELLEKAARLAEEAARQAEEIARQARKEGNLELALKALQILVNAAYVLAEIARDRGNEELLEKAARLAEEAARQAEEIARQARKEGNLELALKALQILVNAAYVLAEIARDRGNEELLEKAARLAEEAARQAEEIARQARKEGNLELALKALQILVNAAYVLAEIARDRGNEELLEKAARLAEEAARQAEEIARQARKEGNLELALKALQILVNAAYVLAEIARDRGNEELLEKAARLAEEAARQAEEIARQARKEGNLELALKALQILVNAAYVLAEIARDRGNEELLEKAARLAEEAARQAEEIARQARKEGNLELALKALQILVNAAYVLAEIARDRGNEELLEKAARLAEEAARQAEEIARQARKEGNLELALKALQILVNAAYVLAEIARDRGNEELLEKAARLAEEAARQAEEIARQARKEGNLELALKALQILVNAAYVLAEIARDRGNEELLEKAARLAEEAARQAEEIARQARKEG 514 T 0.31 Ribosomal_S21 pdb F T 6xr2 1 A,B,C,D,E,F A,B,C,D,E,F dTor_3x57R GNLELALKALQILVNAAYVLAEIARDRGNEELLEKAARLAEEAARQAEEIARQARKEGNLELALKALQILVNAAYVLAEIARDRGNEELLEKAARLAEEAARQAEEIARQARKEGNLELALKALQILVNAAYVLAEIARDRGNEELLEKAARLAEEAARQAEEIARQARKEG 172 T 0.053 Ribosomal_S21 pdb F T 6xrf 2 C,F,I C,F,I TSE6_PSEAE PAAR motif family protein MDAQAAARLGDEIAHGFGVAAMVAGAVAGALIGAAVVAATAATGGLAAVILAGSIAAGGLSHHHHHH 67 T 0.14 MaoC_dehydratas pdb F Bacteria T 6xro 2 B B peptide boronate inhibitor KRFRSMQYSA 10 T 5 NmU-R2_C_term pdbhh F T 6xrp 2 B B peptide ketoamide inhibitor ARVWHA 6 T 18 Glyco_hydro_64 pdbhh F F 6xs5 2 B B RT-D1 XXIIDTPLGVFLSSLKR 17 T 0.27 RNA_GG_bind pdbhh F T 6xs7 2 B B 48V-DTY-THR-THR-ILE-TYR-TRP-THR-PRO-LEU-GLY-THR-PHE-PRO-ARG-ILE-ARG XXTTIYWTPLGTFPRIR 17 T 2.4 DUF3438 pdbhh F T 6xs8 2 B B 48V-DTY-GLY-TYR-ASP-PRO-LEU-GLY-LEU-LYS-TYR-PHE-ALA XXGYDPLGLKYFA 13 T 1.5 Ntox47 pdbhh F T 6xs9 3 C,D,E,F C,D,E,F 48V-TYR-ILE-LYS-THR-PRO-LEU-GLY-THR-PHE-PRO-ASN-ARG-HIS-GLY XYIKTPLGTFPNRHG 15 T 0.078 MtrB pdbhh F T 6xsa 2 B B 48V-TYR-LEU-PRO-THR-ILE-THR-GLY-VAL-GLY-HIS-LEU-TRP-HIS-PRO-LEU XYLPTITGVGHLWHPL 16 T 0.031 Exonuc_VII_L pdbhh F T 6xt4 1 A A 1BH_69 MGHHHHHHGGDSLDMLEWSLGSNDEKEKLKELLKRAEELAKSPDPEDLKEAVRLAEEVVRERPGSNLAKKALEIILRAAEELAKLPDPKALIAAVLAAIKVVREQPGSNLAKKALEIILRAAEELAKLPDPLALAAAVVAATIVVLTQPGSELAKKALEIIERAAEELKKSPDPLAQLLAIAAEALVIALKSSSEETIKEMVKLTTLALLTSLLILILILLDLKEMLERLEKNPDKDVIVKVLKVIVKAIEASVLNQAISAINQILLALSD 271 T 0.013 Dak2 pdbpercent F T 6xtd 1 A A A0A1C3HFI3_SERMA RHS1-CT MGSSHHHHHHSQDPKPRCAATKANDHNQAAFGRQWQGRGIYKGRDSWSNIMLKEGDIVYGGAPGQSGFYFNKATLDAAGGSRAKLWESLQVLPHEKFGYRSKIQAYRVKRETIAGTGKAISQDPTRFGEGGGTQFFLSNYKTVLEPIDKPFEIGL 155 T 0.28 TetR_C_18 pdb F Bacteria T 6xtd 2 B B RHSI1 MMQLDTYDGTLELAGITLGTATTREMLIKGSRLWEGWPEKSDGRTTSYRTIISTKKEKAGDIYIIADFSGAFITDAVLCSWRFAPEKLMMGIQKKVEGAITKNLRTWFYEKTHIQLPVSGSWGHIDAAYDPHNLTGTIVCNYRSAFHTEDEWRKYCKRNNIIY 163 T 1.3 Fip1 pdbhh F T 6xte 2 B C Antipain XRVX 4 T 41 Receptor_IA-2 pdbhh F F 6xth 1 A A A0A5P9PRQ2_9PSEU Felipeptin A1 GSRGWGFEPGVRCLIWCD 18 T 9.2 2EXR pdbhh F Bacteria T 6xti 1 A A A0A5P9PSL4_9PSEU Felipeptin A2 GGGGRGYEYNKQCLIFC 17 T 6 MFA1_2 pdbhh F Bacteria T 6xtt 1 A B Q5ZVQ5_LEGPH NttA MAHHHHHHVDDDDKMEDTANPNEMTKDAWLNSMTPLLPDLICKGFIQDPDLKKRFDEIKMTYEQCVTLIPESTKKCQDELYASMPDKINSETAGTWGRSLGECIGKDFAEKHLIPK 116 T 0.62 PAGK unphh F Bacteria T 6xu2 2 B B Antipain XRVX 4 T 41 Receptor_IA-2 pdbhh F F 6xvd 2 B P upain-1-W3F CSFRGLENHAMC 12 T 1.2 LRRNT pdbhh F T 6xvt 2 C,D G,H ACY-SC1-SC2-SC3-SC4-SC5-NME XXPPPX 6 T 310 SK_channel pdbhh F F 6xwd 2 B P AMPN_HUMAN Amino peptidase N 38-46 NKNANSSPV 9 T 93 DUF6446 pdbhh F Eukaryota T 6xwi 1 A A S0_2.126 ASPCDKQKNYIDKQLLPIVNKAGCSRPEEVEERIRRALKKMGDTSCFDEILKGLKEIKCGGSWLEHHHHHH 71 T 0.18 Spo0A_C pdb F T 6xwu 1 A A Q9VHP9_DROME RE68959p MSKPQNNDTLELDDILSQPVKDKERFAAFMMRKLAENKPAQNDNLFGNFKLDFDLDFEVPLIKKSQAKPKSKLPEVQPLGELVSKNSAATEKVNEPPVDQAPNENVPPRRSPTLSPNNRRSMRRSGNVPGSDKLRRHAIRRRSRSCGRQLLPEFEETVNLTRSISSPVNFLPEISSTPCTEKQKEEVAKNTTRVETDKPAEKPMELSQEPEPENPLQTKVTSPARNPILAAEIEQICKERQSSFHKNVLQLDYSGRAPYSRPPTPSSPSVAGLRRTYTMEKGPAPGQLLLSPSHRYDTPSKMPVVKAKRFNQELMVPDTPERQSHDPAWQSEPQPEFVVPETQPQDLGELVQTLSRSAISPIVVINTSNSNRSVRRDAVAMKSVPTSPVTALSSPPIAPSPRRSAAASPQKSIAQLPRVEENMDAIMTDDESDEHPSTVPLNLAPSGGNTTRQRRLRSSNRARATIESQESSMRLLNLHKSVNAKKSKPRKTAIPLNKAPSAPINGEQFARELTRMSNYEILDLRKRNSLNEIYPLNGHRNHRSEKLILEEEIQRELLRRNLMDEAEGLPKQQSSDDSNEDYIPVPPKTQSLRTKSNDRSQGRGRPRSTRRDLPMTTELVNYLGLSQTLETRRKSSKDGKRCLYTKGSSDHEDNDSLSPVKLPRLSKSIQIVPPPPVSLRYSQSLQNLPCSGKFDFDNVVMAAPPDFHDSVNSDAIEIAPPPPEYVVNTRGRSTSGRKSNKNDLVLPPPGYEGGQEEEHDERPSQPRCTAKELQQSTQNGRRAMENELVPPPIEYVEEENRNNEQSRRSTKNGNLVDRNTHNAVEYCEPPEPPEYDDSDHGQASILRRSGKKLQHSKQSVQKSNKEQIIAPSYENNEDYDSDEEPIYNEEYGKEESQNKNVTRRKSDKDEMASHTLECIEGPDPNWNSSCNKQNRNHQNASKSKENDKLANRSSKSQKLSNPRQNAVGTEKSVALSNRGEECTEKSSDVMESLRVNTPTPPIDQNSDDVPSRNPSPSRTLLSDDVPSTSRAALEFLQRSQNMSKSRPPDESSADVVFKKPLAPAPRAKSKKGKSEVDKLKLAKMPVEAEELNTTGIRRSKRGQVPLQMSWCHTMDPSKFNFMSGFIEPRSKNSKTKKGNLSKAKKASATKPKPTVEKNLPDNRGPLCSSTPRISEKLPGAIPHSESLGLSTLTWEETEVQAEAEKVPKKRGRPKKAVGGVQTDTEAEPEPEPEPMISSVAPLTSDQEEPDVPDEQAPYTEAALGPVVFSTPLRDEQEEASTKLMQWLRGVGDAPPSASMSDENASVSSANELIFYQVDGIDYAFYNTKEKAMLGYMRFKPYQKRSMKQAKVHPLKLLVQFGEFNVETLAVGEEKEVHSVLRVGDMIEIDRGTRYSIQNAIDKVSVLMCIRS 1411 T 0.00049 CENP-C_C unppssm F Eukaryota T 6xwv 1 A,C,D,E A,B,C,D Q9VHP9_DROME Calmodulin MSKPQNNDTLELDDILSQPVKDKERFAAFMMRKLAENKPAQNDNLFGNFKLDFDLDFEVPLIKKSQAKPKSKLPEVQPLGELVSKNSAATEKVNEPPVDQAPNENVPPRRSPTLSPNNRRSMRRSGNVPGSDKLRRHAIRRRSRSCGRQLLPEFEETVNLTRSISSPVNFLPEISSTPCTEKQKEEVAKNTTRVETDKPAEKPMELSQEPEPENPLQTKVTSPARNPILAAEIEQICKERQSSFHKNVLQLDYSGRAPYSRPPTPSSPSVAGLRRTYTMEKGPAPGQLLLSPSHRYDTPSKMPVVKAKRFNQELMVPDTPERQSHDPAWQSEPQPEFVVPETQPQDLGELVQTLSRSAISPIVVINTSNSNRSVRRDAVAMKSVPTSPVTALSSPPIAPSPRRSAAASPQKSIAQLPRVEENMDAIMTDDESDEHPSTVPLNLAPSGGNTTRQRRLRSSNRARATIESQESSMRLLNLHKSVNAKKSKPRKTAIPLNKAPSAPINGEQFARELTRMSNYEILDLRKRNSLNEIYPLNGHRNHRSEKLILEEEIQRELLRRNLMDEAEGLPKQQSSDDSNEDYIPVPPKTQSLRTKSNDRSQGRGRPRSTRRDLPMTTELVNYLGLSQTLETRRKSSKDGKRCLYTKGSSDHEDNDSLSPVKLPRLSKSIQIVPPPPVSLRYSQSLQNLPCSGKFDFDNVVMAAPPDFHDSVNSDAIEIAPPPPEYVVNTRGRSTSGRKSNKNDLVLPPPGYEGGQEEEHDERPSQPRCTAKELQQSTQNGRRAMENELVPPPIEYVEEENRNNEQSRRSTKNGNLVDRNTHNAVEYCEPPEPPEYDDSDHGQASILRRSGKKLQHSKQSVQKSNKEQIIAPSYENNEDYDSDEEPIYNEEYGKEESQNKNVTRRKSDKDEMASHTLECIEGPDPNWNSSCNKQNRNHQNASKSKENDKLANRSSKSQKLSNPRQNAVGTEKSVALSNRGEECTEKSSDVMESLRVNTPTPPIDQNSDDVPSRNPSPSRTLLSDDVPSTSRAALEFLQRSQNMSKSRPPDESSADVVFKKPLAPAPRAKSKKGKSEVDKLKLAKMPVEAEELNTTGIRRSKRGQVPLQMSWCHTMDPSKFNFMSGFIEPRSKNSKTKKGNLSKAKKASATKPKPTVEKNLPDNRGPLCSSTPRISEKLPGAIPHSESLGLSTLTWEETEVQAEAEKVPKKRGRPKKAVGGVQTDTEAEPEPEPEPMISSVAPLTSDQEEPDVPDEQAPYTEAALGPVVFSTPLRDEQEEASTKLMQWLRGVGDAPPSASMSDENASVSSANELIFYQVDGIDYAFYNTKEKAMLGYMRFKPYQKRSMKQAKVHPLKLLVQFGEFNVETLAVGEEKEVHSVLRVGDMIEIDRGTRYSIQNAIDKVSVLMCIRS 1411 T 0.00049 CENP-C_C unppssm F Eukaryota T 6xxc 2 B B ERR3_HUMAN Estrogen Related Receptor gamma phosphopeptide KRRRKSCQA 9 T 23 PHD20L1_u1 pdbhh F Eukaryota T 6xxf 2 B BBB RyR2 Peptide KKAVWHKLLSKQRKRAVVACF 21 T 3.2 DUF5463 pdbhh F T 6xxr 2 C,D F,G Ac-[2-Cl-F]-PPPPTEDEA-NH2 XXPPPPX 7 T 100 Agenet pdbhh F F 6xxs 2 C,D,F,H C,D,G,H NCOR1_HUMAN N-COR1 GITTIKEMGRSIHEIPR 17 T 0.61 DUF211 pdbhh F Eukaryota T 6xxx 2 B BBB LYS-LYS-ALA-VAL-TRP-HIS-LYS-LEU-LEU-SER-LYS-GLN-ARG-LYS-ARG-ALA-VAL-VAL-ALA-CYS-PHE KKAVWHKLLSKQRKRAVVACF 21 T 3.2 DUF5463 pdbhh F T 6xxz 1 A,B A,B 2-EK-4 XGEIKQQLAEIKQQLAEIKWQLAEIKQQLAGX 32 T 0.0016 DUF5320 pdbhh F T 6xy0 1 A,B,C,D A,B,C,D 3-EK-4 XGAIQQELKAIQQELKAIQWELKAIQQELKGX 32 T 0.0037 DUF5320 pdbhh F T 6xy1 1 A,B,C,D A,B,C,D 4-KE-4 XGEIQKQLKEIQKQLKEIQWQLKEIQKQLKGX 32 T 0.0015 DUF5320 pdbhh F T 6xy3 2 B BBB RyR2 peptide KKAVWHKLLSKQRKRAVVACF 21 T 3.2 DUF5463 pdbhh F T 6xy5 2 B B ERR3_HUMAN Estrogen Related Receptor gamma phosphopeptide KRRRKSCQA 9 T 23 PHD20L1_u1 pdbhh F Eukaryota T 6xya 1 A B L_SFTS RNA-dependent RNA polymerase GPAQSGTLGGFSKPQKTFVRPGGGVGYKGKGVWTGVMEDTHVQILIDGDGTSNWLEEIRLSSDARLYDVIESIRRLCDDLGINNRVASAYRGHCMVRLSGFKIKPASRTDGCPVRIME 118 T 2.2 DUF5363 pdbhh T Viruses T 6xyb 1 A,B,C,D A,B,C,D Q4D6Q6_TRYCC Uncharacterized protein GSHMPNLCVSATFNPPVITMLGSALREETVKLLEQRIPTGVSTSSSPSKDPVKFLFYPNPDHWRMELSQHFCDDLHKSAVFLTIIEGLEGEGWNLRASNSIRDSESGKDTTKLFFARRN 119 T 0.034 DUF4177 unphh F Eukaryota T 6xyd 1 A,B,C,D A,B,C,D Q4D6Q6_TRYCC Uncharacterized protein GSHMPNLCVSATFNPPVITMLGSALREETVKLLEQRIPTGVSTSSSPSKDPVKFLFYPNPDHWRMELSQHFCDDLHKSAVFLTIIEGLEGEGWNLRASNSIRDSESGKDTTKLFFARRN 119 T 0.034 DUF4177 unphh F Eukaryota T 6xyh 1 A A AMS3 GCKNLNSHCYRQHRECCHGLVCRRPNYGNGRGILWKCVRA 40 T 0.0092 Toxin_22 pdbhh F T 6xyi 1 A A AMS9.3.1 GCKKLNSYCTRQHRECCHGLVCRRPDYGIGRGILWKCTRARK 42 T 0.0037 Toxin_12 pdb F T 6xyw 33 GA AI Q8L7U3_ARATH DECOY MPRSSLRLLAKPLLESRRGFCTSSDKIVASVLFERLRVVIPKPDPAVYAFQEFKFNWQQQFRRRYPDEFLDIAKNRAKGEYQMDYVPAPRITEADKNNDRKSLYRALDKKLYLLIFGKPFGATSDKPVWHFPEKVYDSEPTLRKCAESALKSVVGDLTHTYFVGNAPMAHMAIQPTEEMPDLPSYKRFFFKCSVVAASKYDISNCEDFVWVTKDELLEFFPEQAEFFNKMIIS 233 T 5.6E-10 MRP-L46 pdbhh F Eukaryota T 6xyw 37 KA AM Q9C9B5_ARATH TUMOR NECROSIS FACTOR RECEPTOR FAMILY PROTEIN MWFAGGGGGLRKLCRASAIFDNEISYNSLLVRYMSRERAVNVRKINPKVPIQEAYAISNSLYDLFKLHGPLSVPNTWLRAQEAGVSGLNSKTHMKLLLKWMRGKKMLKLICNQVGSSKKFFHTVLPEDPLQEQPAAPIENKKQAVKKKRSK 151 T 0.3 HARE-HTH pdbhh F Eukaryota T 6xyw 38 LA AN Q9SD44_ARATH PROTEIN TRANSLOCASE SUBUNIT MGFGAIRSILRPLSRTLVSRAVVNYSSAPFNATIPAAKPELCSFFGGSMTHLRLPWIPMANHFHSLSLTDTRLPKRRPMTHPKRKRSKLKPPGPYAYVQYTPGQPISSNNPNEGSVKRRNAKKRIGQRRAFILSEKKKRQALVQEAKRKKRIKQVERKMAAVARDRAWAERLIELQQLEEEKKKSMSS 188 T 0.032 DUF6087 pdb F Eukaryota T 6xyw 40 NA AP rPPR* AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 669 T 19000 zf_CCCH_4 pdbhh F F 6xyw 42 PA AR UNK-6 AAAAAAAAAAAAAAAAAAAAAAAAAAAAA 29 T 1400 DUF4699 pdbhh F F 6xyw 58 FB BP UNK-5 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 91 T 15000 WW pdbhh F F 6xyw 59 GB BF mS31/mS46 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 123 T 16000 zf_CCCH_4 pdbhh F F 6xyw 78 ZB BI rPPR* AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 266 T 18000 zf-C2H2_6 pdbhh F F 6xyw 79 AC BJ rPPR* AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 349 T 18000 zf-C2H2_6 pdbhh F F 6xyw 81 CC BN UNK-3 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 69 T 13000 zf-C2H2_jaz pdbhh F F 6xyw 82 DC BM UNK-2 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 79 T 14000 zf-H2C2_2 pdbhh F F 6xyw 83 EC BO UNK-4 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 30 T 1600 DUF4699 pdbhh F F 6xyw 84 FC BL UNK-1 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 64 T 13000 zf-H2C2_5 pdbhh F F 6xyw 88 JC BK rPPR* AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 316 T 18000 zf-C2H2_6 pdbhh F F 6xyx 2 C,D D,C NCOR1_HUMAN N-COR1 GITTIKEMGRSIHEIPR 17 T 0.61 DUF211 pdbhh F Eukaryota T 6xzh 2 B B ARG-HIS-LYS-ILE-URL-URK-URL-LEU-GLN RHKIXXXLQ 9 T 7.1 SRC-1 pdbhh F T 6xzi 2 B B ARG-HIS-LYS-ILE-LEU-URK-UIL-URL RHKILXXX 8 T 1.3 SRC-1 pdbhh F T 6xzj 2 B B ARG-HIS-LYS-ILE-LEU-URR-UIL-URL-GLN RHKILXXXQ 9 T 1.8 SRC-1 pdbhh F T 6xzk 2 B B GLU-ASN-ALA-UIA-URL-URY-URV-UZN-LYS ENAXXXXXK 9 T 1700 zf-CCHC pdbhh F F 6xzv 2 B B URA-UIA-URL-URY-URV-UZN-LYS XXXXXXK 7 T 3400 EF-hand_5 pdbhh F F 6xzz 2 B B NCOR1_HUMAN N-COR1 RERIAAASSDLYLRPGS 17 T 3.7 B3R pdbhh F Eukaryota T 6y0g 3 C C4 Nascent peptide XX 2 F F F 6y0u 2 B,D,G E,F,H bp71 XXXLLXCLXCLLKX 14 T 4.8 DUF6395 pdbhh F F 6y0v 2 B,E,G E,G,H bp71 XXXLLXCLXCLLKX 14 T 4.8 DUF6395 pdbhh F F 6y0w 2 B,D C,D cFucRH46D XXXXXXXXXXXXXX 14 F F F 6y0x 2 C,F,N I,K,R SB6 XXXXXXXXXXXX 12 F F F 6y0x 3 D,G,I,K,M J,L,M,O,Q SB6 XXXXXXXXXXXXX 13 F F F 6y13 1 A A bp70 XHXXYXCIRCYAX 13 T 1.4 S_tail_recep_bd pdbhh F T 6y14 1 A,B A,B bp65 XKKLLKCLKCLLX 13 T 6.6 Destabilase pdbhh F F 6y18 2 B B ERR3_HUMAN Estrogen Related Receptor gamma phosphopeptide KRRRKSCQA 9 T 23 PHD20L1_u1 pdbhh F Eukaryota T 6y1q 1 A A Analog 5 PCKNXFXKTFTSCK 14 T 0.00046 Somatostatin pdb F T 6y1s 1 A A bp70 XHXXYXCIRCYAX 13 T 1.4 S_tail_recep_bd pdbhh F T 6y26 3 C C GLY-ARG-LEU-ASN-ALA-PRO-ILE-LYS-VAL GRLNAPIKV 9 T 16 DUF4861 pdbhh F T 6y27 3 C C mA GRLNAPIKV 9 T 16 DUF4861 pdbhh F T 6y28 3 C C GLY-ARG-LEU-ASN-GLU-PRO-ILE-LYS-VAL GRLNEPIKV 9 T 3.5 DUF863 pdbhh F T 6y29 3 C C mE GRLNEPIKV 9 T 3.5 DUF863 pdbhh F T 6y2a 3 C C mQ GRLNQPIKV 9 T 2 DUF3560 pdbhh F T 6y2b 3 C C mQ GRLNQPIKV 9 T 2 DUF3560 pdbhh F T 6y2l 3 C C4 Nascent polypeptide XX 2 F F F 6y38 2 C,D C,D MYO15_MOUSE Chains: C,D ERLTLPPSEITLL 13 T 5.4 DUF4875 pdbhh F Eukaryota T 6y3m 2 B P ATPase peptide QSYTV 5 T 130 DUF4642 pdbhh F F 6y3o 2 B P KKCC2_HUMAN CAMKK2 RKLSLQER 8 T 3.9 DUF2660 pdbhh F Eukaryota T 6y3r 2 B P GAB2_HUMAN Chain P PRRNTLPAMDNS 12 T 2.6 PKI pdbhh F Eukaryota T 6y3s 2 B P GAB2_HUMAN Gab2 NARSASFSQG 10 T 20 DUF6425 pdbhh F Eukaryota T 6y44 2 B P SOS1_HUMAN SOS-1 PRRRPESAPAESS 13 T 21 DUF2754 pdbhh F Eukaryota F 6y4e 1 A A B4EUK6_PROMH Fimbrial adhesin SIFSYITESTGTPSNATYTYVIERWDPETSGILNPCYGWPVCYVTVNHKHTVNGTGGNPAFQIARIEKLRTLAEVRDVVLKNRSFPIEGQTTHRGPSLNSNQECVGLFYQPNSSGISPRGKLLPGSLCGAHHHHHH 136 T 1.8 PSI_8 unp F Bacteria T 6y4f 1 A A B4EUK6_PROMH Fimbrial adhesin SIFSYITESTGTPSNATYTYVIERWDPETSGILNPCYGWPVCYVTVNHKHTVNGTGGNPAFQIARIEKLRTLAEVRDVVLKNRSFPIEGQTTHRGPSLNSNQECVGLFYQPNSSGISPRGKLLPGSLCGIAPPPVHHHHHH 141 T 1.8 PSI_8 unp F Bacteria T 6y4k 2 C,D E,F KKCC2_HUMAN CAMKK 2,CALCIUM/CALMODULIN-DEPENDENT PROTEIN KINASE KINASE BETA,CAMKK BETA RKLSLQER 8 T 3.9 DUF2660 pdbhh F Eukaryota T 6y4o 2 B B RYR2_HUMAN RYR2,CARDIAC MUSCLE RYANODINE RECEPTOR,CARDIAC MUSCLE RYANODINE RECEPTOR-CALCIUM RELEASE CHANNEL,TYPE 2 RYANODINE RECEPTOR SNARSKKAVWHKLLSKQRKRAVVACFRMAP 30 T 1.7 Spc110_C pdbhh F Eukaryota T 6y4p 2 B B RYR2_HUMAN RYR2,CARDIAC MUSCLE RYANODINE RECEPTOR,CARDIAC MUSCLE RYANODINE RECEPTOR-CALCIUM RELEASE CHANNEL,TYPE 2 RYANODINE RECEPTOR SNARSKKAVWHKLLSKQRKRAVVACFRMAP 30 T 1.7 Spc110_C pdbhh F Eukaryota T 6y4q 2 C,D C,D ACE-LEU-THR-PHE-GLY-GLU-TYR-TRP-ALA-GLN-LEU-ALA-SER XLTFGEYWAQLAS 13 T 0.42 P53_TAD pdbhh F T 6y58 2 B P ERR3_HUMAN Estrogen Related Receptor gamma phosphopeptide VYRSLSFE 8 T 1.7 AbiJ_NTD5 pdbhh F Eukaryota T 6y6b 2 C,D C,D KKCC2_HUMAN CAMKK 2,CALCIUM/CALMODULIN-DEPENDENT PROTEIN KINASE KINASE BETA,CAMKK BETA RKLSLQER 8 T 3.9 DUF2660 pdbhh F Eukaryota T 6y79 18 R S F2Z673_YARLI Subunit NESM of NADH:Ubiquinone Oxidoreductase (Complex I) MLKLHYRNFITAQHSTTNTTPTMIASVCKRAGLRAGPRAYPGVRQFALRAYNEEKELALKQRLSQLPPPGKAFVTAEGEPRPAKEAELAELAEIAALYKTDRVGILDILLLGNKHARLYRDNTALLKDYYYNGRRILDKIPVKDKQTGKVTWEIKREGAEKEDWVNQMYFLYAPSLILLLIVMVYKSREDITFWAKKELDQRVLDKHPEINDAPENERDALIVERIIAGDYDKLASLQKKATPTPATLI 249 T 0.0013 ESSS pdbpercent F Eukaryota T 6y79 23 W Z Q6CI10_YARLI Subunit NUZM of NADH:Ubiquinone Oxidoreductase (Complex I) MLPGGPVPVFKKYTVGSKGIWEKLRVLLAIAPNRSTGNPIVPLYRVPTPGSRPEANVYQDPSSYPTNDIAENPYWKRDHRRAYPQTAFFDQKTVTGLLELGSEATPRIADGEAGTKALANIANGGVSFTQALGKSSKDVIYGEVLTVNGLPPVAPTLAPKQWKIIEGEAAIYPKGYPCRTFH 182 T 0.00017 CI-B14_5a pdbhh F Eukaryota T 6y79 32 FA i A0A1H6Q311_YARLL Subunit NUUM of NADH:Ubiquinone Oxidoreductase (Complex I) MGGGRYPFPKDVISMTGGWWANPSNWKLNGLFATGIAVGLALWVSTATLPYTRRREGITSESDISKWNAAAGVWRERHGKISTGEAAESE 90 T 6 Sex_peptide pdbhh F Eukaryota T 6y79 34 HA n Q6C1R9_YARLI Subunit NUNM of NADH:Ubiquinone Oxidoreductase (Complex I) FGSHGHGPELTPAVPFFQPYVLKWAGVSLGLVAFYQFNSSYEAKNGHTWVETFFHPKSREDILNEEAKIVQALNNQRELTIKMHELKREEKDYAHSYSPLFSDPVPQGGSIGKAPGSSRE 120 T 0.023 DUF5950 pdb F Eukaryota T 6y7v 2 B B GLU-HIS-ASP-GLU-LEU EHDEL 5 T 210 PcfJ pdbhh F F 6y8a 2 B P KKCC2_HUMAN CAMKK 2,CALCIUM/CALMODULIN-DEPENDENT PROTEIN KINASE KINASE BETA,CAMKK BETA RSLSAPGN 8 T 13 DUF6439 pdbhh F Eukaryota T 6y8b 2 B P CASP2_HUMAN TYR-ASP-LEU-SEP-LEU-PRO-PHE-PRO YDLSLPFP 8 T 1.4 TUG-UBL1 pdbhh F Eukaryota T 6y8d 2 B B CASP2_HUMAN VAL-GLU-HIS-SEP-LEU-ASP-ASN-LYS VEHSLDNK 8 T 38 Lectin_leg-like pdbhh F Eukaryota T 6y8k 2 B PPP BCY10916 CIEEGQYCFADPYXC 15 T 0.0042 DUF5637 pdb F T 6y9h 4 D C D-Phe-Pro-m-Trifluoromethylbenzylamide derivative (phe2) XPX 3 T 160 DUF2795 pdbhh F F 6y9l 1 A,B,C,D B,D,A,C GP_TSWV1 Glycoprotein KVEIIRGDHPEVYDDSAENEVPTAASIQRKAILETLTNLMLESQTPGTRQIREEESTIPIFAESTTQKTISVSDLPNNCLNASSLKCEIKGISTYNVYYQVENNGVIYSCVSDSAEGLEKCDNSLNLPKRFSKVPVIPITKLDNKRHFSVGTKFFISESLTQDNYPITYNSYPTNGTVSLQTVKLSGDCKITKSNFANPYTVSITSPEKIMGYLIKKPGENVEHKVISFSGSASITFTEEMLDGEHNLLCGDKSAKIPKTNKRVRDCIIKYSKSIYKQTACINFSWIR 288 T 0.018 Bunya_G2 unphh T Viruses T 6y9m 1 A,B,C,D A,B,C,D GP_TSWV1 Glycoprotein KVEIIRGDHPEVYDDSAENEVPTAASIQRKAILETLTNLMLESQTPGTRQIREEESTIPIFAESTTQKTISVSDLPNNCLNASSLKCEIKGISTYNVYYQVENNGVIYSCVSDSAEGLEKCDNSLNLPKRFSKVPVIPITKLDNKRHFSVGTKFFISESLTQDNYPITYNSYPTNGTVSLQTVKLSGDCKITKSNFANPYTVSITSPEKIMGYLIKKPGENVEHKVISFSGSASITFTEEMLDGEHNLLCGDKSAKIPKTNKRVRDCIIKYSKSIYKQTACINF 284 T 0.016 Bunya_G2 unphh T Viruses T 6y9n 2 B B MYO15_MOUSE UNCONVENTIONAL MYOSIN-15 ERLTLPPSEITLL 13 T 5.4 DUF4875 pdbhh F Eukaryota T 6y9o 2 B C CSKP_MOUSE CALCIUM/CALMODULIN-DEPENDENT SERINE PROTEIN KINASE TAPQWVPVSWVY 12 T 1.8 DUF463 unphh F Eukaryota T 6y9p 2 C,D,F,H,J,L C,D,F,H,J,L Q6PPF3_RAT Harmonin a1 PKEYDDELTFF 11 T 7.5 DUF3601 pdbhh F Eukaryota T 6ya0 1 A,B,C A,B,C GP_TSWV1 Glycoprotein KVEIIRGDHPEVYDDSAENEVPTAASIQRKAILETLTNLMLESQTPGTRQIREEESTIPIFAESTTQKTISVSDLPNNCLNASSLKCEIKGISTYNVYYQVENNGVIYSCVSDSAEGLEKCDNSLNLPKRFSKVPVIPITKLDNKRHFSVGTKFFISESLTQDNYPITYNSYPTNGTVSLQTVKLSGDCKITKSNFANPYTVSITSPEKIMGYLIKKPGENVEHKVISFSGSASITFTEEMLDGEHNLLCGDKSAKIPKTNKRVRDCIIKYSKSIYKQTACINFSWIR 288 T 0.018 Bunya_G2 unphh T Viruses T 6ya2 1 A,B,C A,B,C GP_TSWV1 Glycoprotein STTQKTISVSDLPNNCLNASSLKCEIKGISTYNVYYQVENNGVIYSCVSDSAEGLEKCDNSLNLPKRFSKVPVIPITKLDNKRHFSVGTKFFISESLTQDNYPITYNSYPTNGTVCLQTVKLSGDCKITKSNFANPYTVSITSPEKIMGYLIKKPGENVEHKVISFSGSASITFTEEMLDGEHNLLCGDKSAKIPKTNK 199 T 0.018 Bunya_G2 unphh T Viruses T 6ya7 3 C C MCM2_HUMAN MINICHROMOSOME MAINTENANCE PROTEIN 2 HOMOLOG,NUCLEAR PROTEIN BM28 RRTDALTXSPGRDLP 15 T 110 P53 pdbhh F Eukaryota T 6yaz 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L CC-Type2-(TaId)5 XGEIAQATKEIAQATKEIAKATKEIAWATKEIAQATKGX 39 T 0.00096 MCPsignal pdb F T 6yb0 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L CC-Type2-(TaSd)2 XGEIAQALKEIAQALKESAKATKESAWATKEIAQALKGX 39 T 0.014 MCPsignal pdb F T 6yb1 1 A,B,C,D A,B,C,D K2-CCTM-VbIc XGKKSAWATVISALATVISALATVISAWATVGX 33 T 0.059 DUF6486 pdb F T 6yb2 1 A,B,C,D,E,F A,B,C,D,E,F CC-Type2-(TaId)2 XGEIAQALKEIAKATKEIAWATKEIAQALKGX 32 T 0.013 MCPsignal pdbpssm F T 6yb6 4 D D D-Phe-Pro-3-chloro-1,3-dihydroxybenzylamide derivative XPX 3 T 160 DUF2795 pdbhh F F 6ycr 2 B B FFIVIRDRVFR(CCS)G(NH2) FFIVIRDRVFRXGX 14 T 4 BOFC_N pdbhh F T 6ycx 3 D F A0A2I0BQX1_PLAFO Uncharacterized protein MASDMEEKFREAFILFSSCSDHIEMYKFFELMNSFGIILTNDEKAALPNDINMDYWLNFAKKHYNYEQPFKHINNVNEQNTNVQIKIDNFLGIMKALDTRLTESDLNILLQITNPENKTTLNLKTVSQKLTESI 134 T 0.024 Na_Ca_ex_C unppercent F Eukaryota T 6ycx 4 E G A0A2I0BQX1_PLAFO Uncharacterized protein MASDMEEKFREAFILFSSCSDHIEMYKFFELMNSFGIILTNDEKAALPNDINMDYWLNFAKKHYNYEQPFKHINNVNEQNTNVQIKIDNFLGIMKALDTRLTESDLNILLQITNPENKSTLNLKTVSQKLTESI 134 T 0.024 Na_Ca_ex_C pdbpercent F Eukaryota T 6ycy 3 C E Q8IJM4_PLAF7 Myosin essential light chain ELC MASDMEEKFREAFILFSSCSDHIEMYKFFELMNSFGIILTNDEKAALPNDINMDYWLNFAKKHYNYEQPFKHINNVNEQNTNVQIKIDNFLGIMKALDTRLTESDLNILLQITNPENKSTLNLKTVSQKLTESI 134 T 0.024 Na_Ca_ex_C pdbpercent F Eukaryota T 6ycz 3 C C A0A2I0BQX1_PLAFO Uncharacterized protein MASDMEEKFREAFILFSSCSDHIEMYKFFELMNSFGIILTNDEKAALPNDINMDYWLNFAKKHYNYEQPFKHINNVNEQNTNVQIKIDNFLGIMKALDTRLTESDLNILLQITNPENKSTLNLKTVSQKLTESI 134 T 0.024 Na_Ca_ex_C pdbpercent F Eukaryota T 6yd2 2 B 611 4-aminomethyl-phenylacetyl-canavanine-Tle-Arg-Amba XXXKX 5 T 2000 EF-hand_1 pdbhh F F 6yd3 2 B 611 4-guanidinomethyl-phenylacetyl-Canavanine-Tle-Arg-Amba XXXKX 5 T 2000 EF-hand_1 pdbhh F F 6yd4 2 B B 4-guanidinomethyl-phenylacetyl-Canavanine-Tle-Canavanine-Amba XXXXX 5 T 3200 zf-CCHC_2 pdbhh F F 6yd7 2 B B 4-guanidinomethyl-phenylacetyl-Arg-Tle-Canavanine-Amba XRXXX 5 T 1500 zf-C2H2_4 pdbhh F F 6ydp 48 VA AZ unknown peptide AAAAAAAAAAAAAAAAAA 18 T 330 Campylo_MOMP pdbhh F F 6ydw 48 WA AZ unknown peptide AAAAAAAAAAAAAAAAAA 18 T 330 Campylo_MOMP pdbhh F F 6yes 1 A,B A,B M9U4Y8_SULIS CRISPR-associated protein, CscA MRNLKRIVMGENKLIGLVRTALDSITLGQGVNEAKIKSPQSYAFHTISVGTISLDICKAIYSSSEIGRKQLENLSKKYNMPFEDLWFYGGFLHDWNKLSGKEESLENKEELTKKIIDKLKLPNEFLHGISTMAEGHLPDNLHLPLWVSIKLADMLLISDIGSVRDVFYFANSDSYRNAIEALKEYNLELNYVSSTFRLFTLIASKELLNDVFNEKSGYFPLISYADGIVFLKRKNSQPVLLSKIVDLLSRQVFSSSSEVIEEKISDIEKCIKNKEELFRQMNIDVKSAIYDEEGKVKQINAFLPTKVCKPFEDVVGNLDNKSKLQVAREVIERNRKDIPFGLLIYFVNKFSKNEEDYIRKGLGINEKSLKYLLNIGDVQKALDKILELLEKRYAEQSSDKTLLYYVKFSSSGNIIDDLPKITDRPNDYCVVCGMPIYSSNPVRFVQYASELGGRAEIWIPREKALDEIDNVRDDWKVCPICIYEANLMKDRVKPPYFIVTFYPGVPISLLNIIDFDFSQSSIKYYIDEEKDTYFTAFEKMGGRLEPYVKKVLPAYFSSKVIIKASEVSNFSLSTRLSKSELNKLLPYAPMISMIFLTSPVLISSNLYEMPIAHERVISITSTYNYTFMKSLNSNLLTLYSIFAYSAKYDAMRKICGRSDLDNCLGYLTEEMDLYSSVDPALGVLSIGMGVGTPIDTDEKFFSAFLPVSGYLLKVTGKVSKMGETLKSSIFSIAYALKDIIKSQKVSKYDVTGFLRDGVDMFFKTTSVIKDKEDRIGISVNAAISSLENKYALDDQHRAQVYSALQDIFKTLYSIEEESDRSLAISIANTLSNWLYIAYKLVLQGDKSLEHHHHHH 855 T 0.057 DUF2225 unphh F Archaea T 6yf7 1 A,AA,AB,AC,AD,AE,AF,AG,AH,AI,AJ,B,BA,BB,BC,BD,BE,BF,BG,BH,BI,BJ,C,CA,CB,CC,CD,CE,CF,CG,CH,CI,CJ,D,DA,DB,DC,DD,DE,DF,DG,DH,DI,DJ,E,EA,EB,EC,ED,EE,EF,EG,EH,EI,EJ,F,FA,FB,FC,FD,FE,FF,FG,FH,FI,FJ,G,GA,GB,GC,GD,GE,GF,GG,GH,GI,GJ,H,HA,HB,HC,HD,HE,HF,HG,HH,HI,HJ,I,IA,IB,IC,ID,IE,IF,IG,IH,II,IJ,J,JA,JB,JC,JD,JE,JF,JG,JH,JI,JJ,K,KA,KB,KC,KD,KE,KF,KG,KH,KI,L,LA,LB,LC,LD,LE,LF,LG,LH,LI,M,MA,MB,MC,MD,ME,MF,MG,MH,MI,N,NA,NB,NC,ND,NE,NF,NG,NH,NI,O,OA,OB,OC,OD,OE,OF,OG,OH,OI,P,PA,PB,PC,PD,PE,PF,PG,PH,PI,Q,QA,QB,QC,QD,QE,QF,QG,QH,QI,R,RA,RB,RC,RD,RE,RF,RG,RH,RI,S,SA,SB,SC,SD,SE,SF,SG,SH,SI,T,TA,TB,TC,TD,TE,TF,TG,TH,TI,U,UA,UB,UC,UD,UE,UF,UG,UH,UI,V,VA,VB,VC,VD,VE,VF,VG,VH,VI,W,WA,WB,WC,WD,WE,WF,WG,WH,WI,X,XA,XB,XC,XD,XE,XF,XG,XH,XI,Y,YA,YB,YC,YD,YE,YF,YG,YH,YI,Z,ZA,ZB,ZC,ZD,ZE,ZF,ZG,ZH,ZI AA,BA,CA,DA,EA,FA,GA,HA,IA,JA,KA,AB,BB,CB,DB,EB,FB,GB,HB,IB,JB,KB,AC,BC,CC,DC,EC,FC,GC,HC,IC,JC,KC,AD,BD,CD,DD,ED,FD,GD,HD,ID,JD,KD,AE,BE,CE,DE,EE,FE,GE,HE,IE,JE,KE,AF,BF,CF,DF,EF,FF,GF,HF,IF,JF,KF,AG,BG,CG,DG,EG,FG,GG,HG,IG,JG,KG,AH,BH,CH,DH,EH,FH,GH,HH,IH,JH,KH,AI,BI,CI,DI,EI,FI,GI,HI,II,JI,KI,AJ,BJ,CJ,DJ,EJ,FJ,GJ,HJ,IJ,JJ,KJ,AK,BK,CK,DK,EK,FK,GK,HK,IK,JK,AL,BL,CL,DL,EL,FL,GL,HL,IL,JL,AM,BM,CM,DM,EM,FM,GM,HM,IM,JM,AN,BN,CN,DN,EN,FN,GN,HN,IN,JN,AO,BO,CO,DO,EO,FO,GO,HO,IO,JO,AP,BP,CP,DP,EP,FP,GP,HP,IP,JP,AQ,BQ,CQ,DQ,EQ,FQ,GQ,HQ,IQ,JQ,AR,BR,CR,DR,ER,FR,GR,HR,IR,JR,AS,BS,CS,DS,ES,FS,GS,HS,IS,JS,AT,BT,CT,DT,ET,FT,GT,HT,IT,JT,AU,BU,CU,DU,EU,FU,GU,HU,IU,JU,AV,BV,CV,DV,EV,FV,GV,HV,IV,JV,AW,BW,CW,DW,EW,FW,GW,HW,IW,JW,AX,BX,CX,DX,EX,FX,GX,HX,IX,JX,AY,BY,CY,DY,EY,FY,GY,HY,IY,JY,AZ,BZ,CZ,DZ,EZ,FZ,GZ,HZ,IZ,JZ A0A068EP60_9VIRU Coat protein SITKYSESAGPIGQSIYTFTGVTVPAQYMPRLVATTTVNKAGTNIEYKIAVNYPLVSVVDGANVALNTIRANLSFTALQSVINTDEKLRVLDEIVSFITANKANIIDGNVLTVTP 115 T 0.14 Hexokinase_2 pdbpercent T Viruses T 6yf9 1 A,AA,AB,AC,AD,AE,AF,B,BA,BB,BC,BD,BE,BF,C,CA,CB,CC,CD,CE,CF,D,DA,DB,DC,DD,DE,DF,E,EA,EB,EC,ED,EE,EF,F,FA,FB,FC,FD,FE,FF,G,GA,GB,GC,GD,GE,GF,H,HA,HB,HC,HD,HE,HF,I,IA,IB,IC,ID,IE,IF,J,JA,JB,JC,JD,JE,JF,K,KA,KB,KC,KD,KE,KF,L,LA,LB,LC,LD,LE,LF,M,MA,MB,MC,MD,ME,MF,N,NA,NB,NC,ND,NE,NF,O,OA,OB,OC,OD,OE,OF,P,PA,PB,PC,PD,PE,PF,Q,QA,QB,QC,QD,QE,QF,R,RA,RB,RC,RD,RE,RF,S,SA,SB,SC,SD,SE,SF,T,TA,TB,TC,TD,TE,TF,U,UA,UB,UC,UD,UE,UF,V,VA,VB,VC,VD,VE,VF,W,WA,WB,WC,WD,WE,WF,X,XA,XB,XC,XD,XE,XF,Y,YA,YB,YC,YD,YE,Z,ZA,ZB,ZC,ZD,ZE AA,BA,CA,DA,EA,FA,GA,AB,BB,CB,DB,EB,FB,GB,AC,BC,CC,DC,EC,FC,GC,AD,BD,CD,DD,ED,FD,GD,AE,BE,CE,DE,EE,FE,GE,AF,BF,CF,DF,EF,FF,GF,AG,BG,CG,DG,EG,FG,GG,AH,BH,CH,DH,EH,FH,GH,AI,BI,CI,DI,EI,FI,GI,AJ,BJ,CJ,DJ,EJ,FJ,GJ,AK,BK,CK,DK,EK,FK,GK,AL,BL,CL,DL,EL,FL,GL,AM,BM,CM,DM,EM,FM,GM,AN,BN,CN,DN,EN,FN,GN,AO,BO,CO,DO,EO,FO,GO,AP,BP,CP,DP,EP,FP,GP,AQ,BQ,CQ,DQ,EQ,FQ,GQ,AR,BR,CR,DR,ER,FR,GR,AS,BS,CS,DS,ES,FS,GS,AT,BT,CT,DT,ET,FT,GT,AU,BU,CU,DU,EU,FU,GU,AV,BV,CV,DV,EV,FV,GV,AW,BW,CW,DW,EW,FW,GW,AX,BX,CX,DX,EX,FX,GX,AY,BY,CY,DY,EY,FY,AZ,BZ,CZ,DZ,EZ,FZ coat protein AAPSLALVGANSTLASTLVNYSLRSQNGNNVDYVCTDPDSTLSAPGLINAKFDIKAPGITGNDRIHANLRKVVLDEKTNLPSTGSVTIQVSIPRNPAWNASMTVSLLKQAADYLAGTSATVSGQTDTSGFPAKWAGLMFP 140 T 0.3 CLP1_P pdb F T 6yfa 1 A,AA,AB,B,BA,BB,C,CA,CB,D,DA,DB,E,EA,EB,F,FA,FB,G,GA,GB,H,HA,HB,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,PA,Q,QA,R,RA,S,SA,T,TA,U,UA,V,VA,W,WA,X,XA,Y,YA,Z,ZA AA,BA,CA,AB,BB,CB,AC,BC,CC,AD,BD,CD,AE,BE,CE,AF,BF,CF,AG,BG,CG,AH,BH,CH,AI,BI,AJ,BJ,AK,BK,AL,BL,AM,BM,AN,BN,AO,BO,AP,BP,AQ,BQ,AR,BR,AS,BS,AT,BT,AU,BU,AV,BV,AW,BW,AX,BX,AY,BY,AZ,BZ coat protein SKILSTNNSNSNFVDTSFTLKVPVYSKDYRVTQDEPDEVVVANRQQPFGVKNTARYGIRQIADVYRNTTIDRAYQSPSKKGTSLVVQVTETWTVASTDDETYGYSLPFSAHVIVNVPQDALITEEILYDALKRLMGHFYEGNDTTSPTTTSVRLKDMLQGALVPQSL 167 T 34 DUF3626 pdbhh F T 6yfb 1 A,AA,AB,AC,AD,AE,AF,AG,AH,B,BA,BB,BC,BD,BE,BF,BG,BH,C,CA,CB,CC,CD,CE,CF,CG,D,DA,DB,DC,DD,DE,DF,DG,E,EA,EB,EC,ED,EE,EF,EG,F,FA,FB,FC,FD,FE,FF,FG,G,GA,GB,GC,GD,GE,GF,GG,H,HA,HB,HC,HD,HE,HF,HG,I,IA,IB,IC,ID,IE,IF,IG,J,JA,JB,JC,JD,JE,JF,JG,K,KA,KB,KC,KD,KE,KF,KG,L,LA,LB,LC,LD,LE,LF,LG,M,MA,MB,MC,MD,ME,MF,MG,N,NA,NB,NC,ND,NE,NF,NG,O,OA,OB,OC,OD,OE,OF,OG,P,PA,PB,PC,PD,PE,PF,PG,Q,QA,QB,QC,QD,QE,QF,QG,R,RA,RB,RC,RD,RE,RF,RG,S,SA,SB,SC,SD,SE,SF,SG,T,TA,TB,TC,TD,TE,TF,TG,U,UA,UB,UC,UD,UE,UF,UG,V,VA,VB,VC,VD,VE,VF,VG,W,WA,WB,WC,WD,WE,WF,WG,X,XA,XB,XC,XD,XE,XF,XG,Y,YA,YB,YC,YD,YE,YF,YG,Z,ZA,ZB,ZC,ZD,ZE,ZF,ZG AA,BA,CA,DA,EA,FA,GA,HA,IA,AB,BB,CB,DB,EB,FB,GB,HB,IB,AC,BC,CC,DC,EC,FC,GC,HC,AD,BD,CD,DD,ED,FD,GD,HD,AE,BE,CE,DE,EE,FE,GE,HE,AF,BF,CF,DF,EF,FF,GF,HF,AG,BG,CG,DG,EG,FG,GG,HG,AH,BH,CH,DH,EH,FH,GH,HH,AI,BI,CI,DI,EI,FI,GI,HI,AJ,BJ,CJ,DJ,EJ,FJ,GJ,HJ,AK,BK,CK,DK,EK,FK,GK,HK,AL,BL,CL,DL,EL,FL,GL,HL,AM,BM,CM,DM,EM,FM,GM,HM,AN,BN,CN,DN,EN,FN,GN,HN,AO,BO,CO,DO,EO,FO,GO,HO,AP,BP,CP,DP,EP,FP,GP,HP,AQ,BQ,CQ,DQ,EQ,FQ,GQ,HQ,AR,BR,CR,DR,ER,FR,GR,HR,AS,BS,CS,DS,ES,FS,GS,HS,AT,BT,CT,DT,ET,FT,GT,HT,AU,BU,CU,DU,EU,FU,GU,HU,AV,BV,CV,DV,EV,FV,GV,HV,AW,BW,CW,DW,EW,FW,GW,HW,AX,BX,CX,DX,EX,FX,GX,HX,AY,BY,CY,DY,EY,FY,GY,HY,AZ,BZ,CZ,DZ,EZ,FZ,GZ,HZ coat protein SYTQSFGYTIPTEKDTLEIPQYQALLAKKASYMDDSQGKNTATYMNTAAPKDQPETITFGVNKVDNVYKQSNVQNQTFYASSSKGTKIRIDGKRIWRTQSTDVNTGLPVIVDCPLWTSFTLGFADFTLVDDSARKSTIEWMISQLELLKDDGVWSKLCSGVTRIYG 166 T 7.3 DUF4325 pdbhh F T 6yfc 1 A,AA,AB,AC,AD,AE,AF,B,BA,BB,BC,BD,BE,BF,C,CA,CB,CC,CD,CE,CF,D,DA,DB,DC,DD,DE,DF,E,EA,EB,EC,ED,EE,EF,F,FA,FB,FC,FD,FE,FF,G,GA,GB,GC,GD,GE,GF,H,HA,HB,HC,HD,HE,HF,I,IA,IB,IC,ID,IE,IF,J,JA,JB,JC,JD,JE,JF,K,KA,KB,KC,KD,KE,KF,L,LA,LB,LC,LD,LE,LF,M,MA,MB,MC,MD,ME,MF,N,NA,NB,NC,ND,NE,NF,O,OA,OB,OC,OD,OE,OF,P,PA,PB,PC,PD,PE,PF,Q,QA,QB,QC,QD,QE,QF,R,RA,RB,RC,RD,RE,RF,S,SA,SB,SC,SD,SE,SF,T,TA,TB,TC,TD,TE,TF,U,UA,UB,UC,UD,UE,UF,V,VA,VB,VC,VD,VE,VF,W,WA,WB,WC,WD,WE,WF,X,XA,XB,XC,XD,XE,XF,Y,YA,YB,YC,YD,YE,Z,ZA,ZB,ZC,ZD,ZE AA,BA,CA,DA,EA,FA,GA,AB,BB,CB,DB,EB,FB,GB,AC,BC,CC,DC,EC,FC,GC,AD,BD,CD,DD,ED,FD,GD,AE,BE,CE,DE,EE,FE,GE,AF,BF,CF,DF,EF,FF,GF,AG,BG,CG,DG,EG,FG,GG,AH,BH,CH,DH,EH,FH,GH,AI,BI,CI,DI,EI,FI,GI,AJ,BJ,CJ,DJ,EJ,FJ,GJ,AK,BK,CK,DK,EK,FK,GK,AL,BL,CL,DL,EL,FL,GL,AM,BM,CM,DM,EM,FM,GM,AN,BN,CN,DN,EN,FN,GN,AO,BO,CO,DO,EO,FO,GO,AP,BP,CP,DP,EP,FP,GP,AQ,BQ,CQ,DQ,EQ,FQ,GQ,AR,BR,CR,DR,ER,FR,GR,AS,BS,CS,DS,ES,FS,GS,AT,BT,CT,DT,ET,FT,GT,AU,BU,CU,DU,EU,FU,GU,AV,BV,CV,DV,EV,FV,GV,AW,BW,CW,DW,EW,FW,GW,AX,BX,CX,DX,EX,FX,GX,AY,BY,CY,DY,EY,FY,AZ,BZ,CZ,DZ,EZ,FZ coat protein MRLTDVDLTVGEETREYAVSEQQGTLFRFVDKSGTVANNTGVFSLEQRFGAANSNRKVTMLLTDPVVVKDASGADMTIKANASVTFSLPKTYPNEHITKLRQTLIAWLGQQCVSDPVDSGLNNY 124 T 0.0063 Phage_coat pdbhh F T 6yfd 1 A,AA,AB,AC,B,BA,BB,BC,C,CA,CB,CC,D,DA,DB,DC,E,EA,EB,EC,F,FA,FB,FC,G,GA,GB,GC,H,HA,HB,HC,I,IA,IB,IC,J,JA,JB,JC,K,KA,KB,KC,L,LA,LB,LC,M,MA,MB,N,NA,NB,O,OA,OB,P,PA,PB,Q,QA,QB,R,RA,RB,S,SA,SB,T,TA,TB,U,UA,UB,V,VA,VB,W,WA,WB,X,XA,XB,Y,YA,YB,Z,ZA,ZB AA,BA,CA,DA,AB,BB,CB,DB,AC,BC,CC,DC,AD,BD,CD,DD,AE,BE,CE,DE,AF,BF,CF,DF,AG,BG,CG,DG,AH,BH,CH,DH,AI,BI,CI,DI,AJ,BJ,CJ,DJ,AK,BK,CK,DK,AL,BL,CL,DL,AM,BM,CM,AN,BN,CN,AO,BO,CO,AP,BP,CP,AQ,BQ,CQ,AR,BR,CR,AS,BS,CS,AT,BT,CT,AU,BU,CU,AV,BV,CV,AW,BW,CW,AX,BX,CX,AY,BY,CY,AZ,BZ,CZ coat protein TKRNRNNQARGQLYMGQQGPVQSSRTTFGVNPDRQANARPVYLAPAAPMENTYTYLGSIQFAAGRHIFGEPASNVLPPQNIVPGVPTKHGEYVTTNTGDRLMASSTTVTRDVSNGRTKVSIDIPYYDRNAVETLKASAIPGAVAPVGSFKVNVEVLGGGVLTGTDANAQFALDELLSNMLMDAARIAQDGPKNTARLVAASHGVMPQA 208 T 21 FeS_assembly_P pdbhh F T 6yfg 1 A,AA,AB,AC,AD,AE,AF,AG,AH,AI,AJ,AK,AL,AM,B,BA,BB,BC,BD,BE,BF,BG,BH,BI,BJ,BK,BL,BM,C,CA,CB,CC,CD,CE,CF,CG,CH,CI,CJ,CK,CL,CM,D,DA,DB,DC,DD,DE,DF,DG,DH,DI,DJ,DK,DL,DM,E,EA,EB,EC,ED,EE,EF,EG,EH,EI,EJ,EK,EL,EM,F,FA,FB,FC,FD,FE,FF,FG,FH,FI,FJ,FK,FL,FM,G,GA,GB,GC,GD,GE,GF,GG,GH,GI,GJ,GK,GL,GM,H,HA,HB,HC,HD,HE,HF,HG,HH,HI,HJ,HK,HL,HM,I,IA,IB,IC,ID,IE,IF,IG,IH,II,IJ,IK,IL,IM,J,JA,JB,JC,JD,JE,JF,JG,JH,JI,JJ,JK,JL,JM,K,KA,KB,KC,KD,KE,KF,KG,KH,KI,KJ,KK,KL,KM,L,LA,LB,LC,LD,LE,LF,LG,LH,LI,LJ,LK,LL,LM,M,MA,MB,MC,MD,ME,MF,MG,MH,MI,MJ,MK,ML,MM,N,NA,NB,NC,ND,NE,NF,NG,NH,NI,NJ,NK,NL,NM,O,OA,OB,OC,OD,OE,OF,OG,OH,OI,OJ,OK,OL,OM,P,PA,PB,PC,PD,PE,PF,PG,PH,PI,PJ,PK,PL,PM,Q,QA,QB,QC,QD,QE,QF,QG,QH,QI,QJ,QK,QL,QM,R,RA,RB,RC,RD,RE,RF,RG,RH,RI,RJ,RK,RL,RM,S,SA,SB,SC,SD,SE,SF,SG,SH,SI,SJ,SK,SL,SM,T,TA,TB,TC,TD,TE,TF,TG,TH,TI,TJ,TK,TL,TM,U,UA,UB,UC,UD,UE,UF,UG,UH,UI,UJ,UK,UL,UM,V,VA,VB,VC,VD,VE,VF,VG,VH,VI,VJ,VK,VL,VM,W,WA,WB,WC,WD,WE,WF,WG,WH,WI,WJ,WK,WL,X,XA,XB,XC,XD,XE,XF,XG,XH,XI,XJ,XK,XL,Y,YA,YB,YC,YD,YE,YF,YG,YH,YI,YJ,YK,YL,Z,ZA,ZB,ZC,ZD,ZE,ZF,ZG,ZH,ZI,ZJ,ZK,ZL AA,BA,CA,DA,EA,FA,GA,HA,IA,JA,KA,LA,MA,NA,AB,BB,CB,DB,EB,FB,GB,HB,IB,JB,KB,LB,MB,NB,AC,BC,CC,DC,EC,FC,GC,HC,IC,JC,KC,LC,MC,NC,AD,BD,CD,DD,ED,FD,GD,HD,ID,JD,KD,LD,MD,ND,AE,BE,CE,DE,EE,FE,GE,HE,IE,JE,KE,LE,ME,NE,AF,BF,CF,DF,EF,FF,GF,HF,IF,JF,KF,LF,MF,NF,AG,BG,CG,DG,EG,FG,GG,HG,IG,JG,KG,LG,MG,NG,AH,BH,CH,DH,EH,FH,GH,HH,IH,JH,KH,LH,MH,NH,AI,BI,CI,DI,EI,FI,GI,HI,II,JI,KI,LI,MI,NI,AJ,BJ,CJ,DJ,EJ,FJ,GJ,HJ,IJ,JJ,KJ,LJ,MJ,NJ,AK,BK,CK,DK,EK,FK,GK,HK,IK,JK,KK,LK,MK,NK,AL,BL,CL,DL,EL,FL,GL,HL,IL,JL,KL,LL,ML,NL,AM,BM,CM,DM,EM,FM,GM,HM,IM,JM,KM,LM,MM,NM,AN,BN,CN,DN,EN,FN,GN,HN,IN,JN,KN,LN,MN,NN,AO,BO,CO,DO,EO,FO,GO,HO,IO,JO,KO,LO,MO,NO,AP,BP,CP,DP,EP,FP,GP,HP,IP,JP,KP,LP,MP,NP,AQ,BQ,CQ,DQ,EQ,FQ,GQ,HQ,IQ,JQ,KQ,LQ,MQ,NQ,AR,BR,CR,DR,ER,FR,GR,HR,IR,JR,KR,LR,MR,NR,AS,BS,CS,DS,ES,FS,GS,HS,IS,JS,KS,LS,MS,NS,AT,BT,CT,DT,ET,FT,GT,HT,IT,JT,KT,LT,MT,NT,AU,BU,CU,DU,EU,FU,GU,HU,IU,JU,KU,LU,MU,NU,AV,BV,CV,DV,EV,FV,GV,HV,IV,JV,KV,LV,MV,NV,AW,BW,CW,DW,EW,FW,GW,HW,IW,JW,KW,LW,MW,AX,BX,CX,DX,EX,FX,GX,HX,IX,JX,KX,LX,MX,AY,BY,CY,DY,EY,FY,GY,HY,IY,JY,KY,LY,MY,AZ,BZ,CZ,DZ,EZ,FZ,GZ,HZ,IZ,JZ,KZ,LZ,MZ coat protein SKPIAIFKLRELSSDSTLFTLPGHSVTLPNTLGIVSHLPTPRKGNPGTVKTMRNLRKTILLGAGTASERAVPIVIKTETSFPVGTTEEDRAEVLKQMASFLIEEVKNNQELAYSGYVQDKYFIEDLVITE 130 T 0.077 AAA_11 pdbhh F T 6yfj 1 A,AA,AB,B,BA,BB,C,CA,CB,D,DA,DB,E,EA,EB,F,FA,FB,G,GA,GB,H,HA,HB,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,PA,Q,QA,R,RA,S,SA,T,TA,U,UA,V,VA,W,WA,X,XA,Y,YA,Z,ZA AA,BA,CA,AB,BB,CB,AC,BC,CC,AD,BD,CD,AE,BE,CE,AF,BF,CF,AG,BG,CG,AH,BH,CH,AI,BI,AJ,BJ,AK,BK,AL,BL,AM,BM,AN,BN,AO,BO,AP,BP,AQ,BQ,AR,BR,AS,BS,AT,BT,AU,BU,AV,BV,AW,BW,AX,BX,AY,BY,AZ,BZ coat protein AYKLIKMAGGNSAIQTYAREDKTTQTLSTQKTISVLRNGSTSTRIIKVHINSTAPVTINTCDPTKCGPTVPMGVSFKSSMPEDADPAEVLKAAKAALALFEANLNSAFNKNVDEISVA 118 T 0.14 MRF_C2 pdb F T 6yfl 1 A,AA,AB,B,BA,BB,C,CA,CB,D,DA,DB,E,EA,EB,F,FA,FB,G,GA,GB,H,HA,HB,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,PA,Q,QA,R,RA,S,SA,T,TA,U,UA,V,VA,W,WA,X,XA,Y,YA,Z,ZA AA,BA,CA,AB,BB,CB,AC,BC,CC,AD,BD,CD,AE,BE,CE,AF,BF,CF,AG,BG,CG,AH,BH,CH,AI,BI,AJ,BJ,AK,BK,AL,BL,AM,BM,AN,BN,AO,BO,AP,BP,AQ,BQ,AR,BR,AS,BS,AT,BT,AU,BU,AV,BV,AW,BW,AX,BX,AY,BY,AZ,BZ coat protein AYSPSTPVTGAAQTGFTSPTYTLTSDTAPTALGKQHAVTATGGTQTGVTTHSVSSPFTITFTRPKTMKTVGVPNSNGVITNIGRNTYGFLVRKGVIPAVNQSPQVMLVRVEISVPAGADTYDAANVKAALSAAIGVLSQQSAGIGDTALSGIL 153 T 3.2 Lin0512_fam pdbhh F T 6yfm 1 A,AA,B,BA,C,CA,D,DA,E,EA,F,FA,G,GA,H,HA,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,PA,Q,QA,R,RA,S,SA,T,U,V,W,X,Y,Z AA,BA,AB,BB,AC,BC,AD,BD,AE,BE,AF,BF,AG,BG,AH,BH,AI,BI,AJ,BJ,AK,BK,AL,BL,AM,BM,AN,BN,AO,BO,AP,BP,AQ,BQ,AR,BR,AS,BS,AT,AU,AV,AW,AX,AY,AZ coat protein SSQANITVFDGAATPVSHVLVPLGVGIDENLGSVAKWRENLATVPLYANVRVTTMQKKLKSGIERVEIRVEVPVMEAVSGQNAFGYTAAPKVAFTDSGSFVGYFSERSAQSNRRLVKQILTNLLGNVSTSVAAPTTGFASELIDSGITAS 150 F F T 6yfo 1 A,AA,AB,AC,B,BA,BB,BC,C,CA,CB,CC,D,DA,DB,DC,E,EA,EB,EC,F,FA,FB,FC,G,GA,GB,GC,H,HA,HB,HC,I,IA,IB,IC,J,JA,JB,JC,K,KA,KB,KC,L,LA,LB,LC,M,MA,MB,N,NA,NB,O,OA,OB,P,PA,PB,Q,QA,QB,R,RA,RB,S,SA,SB,T,TA,TB,U,UA,UB,V,VA,VB,W,WA,WB,X,XA,XB,Y,YA,YB,Z,ZA,ZB AA,BA,CA,DA,AB,BB,CB,DB,AC,BC,CC,DC,AD,BD,CD,DD,AE,BE,CE,DE,AF,BF,CF,DF,AG,BG,CG,DG,AH,BH,CH,DH,AI,BI,CI,DI,AJ,BJ,CJ,DJ,AK,BK,CK,DK,AL,BL,CL,DL,AM,BM,CM,AN,BN,CN,AO,BO,CO,AP,BP,CP,AQ,BQ,CQ,AR,BR,CR,AS,BS,CS,AT,BT,CT,AU,BU,CU,AV,BV,CV,AW,BW,CW,AX,BX,CX,AY,BY,CY,AZ,BZ,CZ coat protein SIIGSSIKTGATSASITGGSDITFALTGQTVTNGLNVSVSEDTDYRTRRNATFKSRVPTVVNGNYSKGKNEVVFVIPMSLDSGETVFNSVRIALEIHPALASASVKDLRLIGAQLLTDADYDSFWTLGALA 131 T 0.044 RRXRR pdbpercent F T 6yfp 1 A,AA,AB,AC,B,BA,BB,BC,C,CA,CB,CC,D,DA,DB,DC,E,EA,EB,EC,F,FA,FB,FC,G,GA,GB,GC,H,HA,HB,HC,I,IA,IB,IC,J,JA,JB,JC,K,KA,KB,KC,L,LA,LB,LC,M,MA,MB,N,NA,NB,O,OA,OB,P,PA,PB,Q,QA,QB,R,RA,RB,S,SA,SB,T,TA,TB,U,UA,UB,V,VA,VB,W,WA,WB,X,XA,XB,Y,YA,YB,Z,ZA,ZB AA,BA,CA,DA,AB,BB,CB,DB,AC,BC,CC,DC,AD,BD,CD,DD,AE,BE,CE,DE,AF,BF,CF,DF,AG,BG,CG,DG,AH,BH,CH,DH,AI,BI,CI,DI,AJ,BJ,CJ,DJ,AK,BK,CK,DK,AL,BL,CL,DL,AM,BM,CM,AN,BN,CN,AO,BO,CO,AP,BP,CP,AQ,BQ,CQ,AR,BR,CR,AS,BS,CS,AT,BT,CT,AU,BU,CU,AV,BV,CV,AW,BW,CW,AX,BX,CX,AY,BY,CY,AZ,BZ,CZ coat protein SYTIDINCSTGDTQANLVLTEIPAEPYVHVSGDNKSTIEYLDTGSDNSLLVRPTQQFNCVSSQYPYRNYSKIPRSQQDPLAVRREFYTRRVEYWRKADASNVDAPEYTLPQSCSIRLASTVTKETTAADIAGIVLRTLAPIFPNGSGDWIKLQQLIDGLPRIFG 164 T 5.2 CtsR pdbhh F T 6yfr 1 A,AA,AB,B,BA,BB,C,CA,CB,D,DA,DB,E,EA,EB,F,FA,FB,G,GA,GB,H,HA,HB,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,PA,Q,QA,R,RA,S,SA,T,TA,U,UA,V,VA,W,WA,X,XA,Y,YA,Z,ZA AA,BA,CA,AB,BB,CB,AC,BC,CC,AD,BD,CD,AE,BE,CE,AF,BF,CF,AG,BG,CG,AH,BH,CH,AI,BI,AJ,BJ,AK,BK,AL,BL,AM,BM,AN,BN,AO,BO,AP,BP,AQ,BQ,AR,BR,AS,BS,AT,BT,AU,BU,AV,BV,AW,BW,AX,BX,AY,BY,AZ,BZ coat protein ASLPVTQYSPPVTPLGKSTWNVTGSTNPPGLVPQVVQTESINARKSNIMSKISVYYYIPSTNSVSCCTEWDTIRCEFSLTLLQLSSNTDVAARTVDVLDTMISFLAKRRNSILAGNLLLPDNP 123 T 0.18 CHB_HEX_C pdbpercent F T 6yfs 1 A,AA,AB,AC,B,BA,BB,BC,C,CA,CB,CC,D,DA,DB,DC,E,EA,EB,EC,F,FA,FB,FC,G,GA,GB,GC,H,HA,HB,HC,I,IA,IB,IC,J,JA,JB,JC,K,KA,KB,KC,L,LA,LB,LC,M,MA,MB,N,NA,NB,O,OA,OB,P,PA,PB,Q,QA,QB,R,RA,RB,S,SA,SB,T,TA,TB,U,UA,UB,V,VA,VB,W,WA,WB,X,XA,XB,Y,YA,YB,Z,ZA,ZB AA,BA,CA,DA,AB,BB,CB,DB,AC,BC,CC,DC,AD,BD,CD,DD,AE,BE,CE,DE,AF,BF,CF,DF,AG,BG,CG,DG,AH,BH,CH,DH,AI,BI,CI,DI,AJ,BJ,CJ,DJ,AK,BK,CK,DK,AL,BL,CL,DL,AM,BM,CM,AN,BN,CN,AO,BO,CO,AP,BP,CP,AQ,BQ,CQ,AR,BR,CR,AS,BS,CS,AT,BT,CT,AU,BU,CU,AV,BV,CV,AW,BW,CW,AX,BX,CX,AY,BY,CY,AZ,BZ,CZ coat protein AQHNMRLQLTSGTSLTWVDPNDFRSTFRINLNVNQKVAGAVSVYNARSEVITNRAPLVVIEGCTDACSVNRENISIRTTISGSVENKAAVLAALLDHLHNLGLARDDLVAGLLPTTIQPVVEYTGS 126 T 0.005 YopH_N pdb F T 6yft 1 A,AA,AB,AC,AD,AE,AF,AG,AH,AI,AJ,AK,AL,AM,B,BA,BB,BC,BD,BE,BF,BG,BH,BI,BJ,BK,BL,BM,C,CA,CB,CC,CD,CE,CF,CG,CH,CI,CJ,CK,CL,CM,D,DA,DB,DC,DD,DE,DF,DG,DH,DI,DJ,DK,DL,DM,E,EA,EB,EC,ED,EE,EF,EG,EH,EI,EJ,EK,EL,EM,F,FA,FB,FC,FD,FE,FF,FG,FH,FI,FJ,FK,FL,FM,G,GA,GB,GC,GD,GE,GF,GG,GH,GI,GJ,GK,GL,GM,H,HA,HB,HC,HD,HE,HF,HG,HH,HI,HJ,HK,HL,HM,I,IA,IB,IC,ID,IE,IF,IG,IH,II,IJ,IK,IL,IM,J,JA,JB,JC,JD,JE,JF,JG,JH,JI,JJ,JK,JL,JM,K,KA,KB,KC,KD,KE,KF,KG,KH,KI,KJ,KK,KL,KM,L,LA,LB,LC,LD,LE,LF,LG,LH,LI,LJ,LK,LL,LM,M,MA,MB,MC,MD,ME,MF,MG,MH,MI,MJ,MK,ML,MM,N,NA,NB,NC,ND,NE,NF,NG,NH,NI,NJ,NK,NL,NM,O,OA,OB,OC,OD,OE,OF,OG,OH,OI,OJ,OK,OL,OM,P,PA,PB,PC,PD,PE,PF,PG,PH,PI,PJ,PK,PL,PM,Q,QA,QB,QC,QD,QE,QF,QG,QH,QI,QJ,QK,QL,QM,R,RA,RB,RC,RD,RE,RF,RG,RH,RI,RJ,RK,RL,RM,S,SA,SB,SC,SD,SE,SF,SG,SH,SI,SJ,SK,SL,SM,T,TA,TB,TC,TD,TE,TF,TG,TH,TI,TJ,TK,TL,TM,U,UA,UB,UC,UD,UE,UF,UG,UH,UI,UJ,UK,UL,UM,V,VA,VB,VC,VD,VE,VF,VG,VH,VI,VJ,VK,VL,VM,W,WA,WB,WC,WD,WE,WF,WG,WH,WI,WJ,WK,WL,X,XA,XB,XC,XD,XE,XF,XG,XH,XI,XJ,XK,XL,Y,YA,YB,YC,YD,YE,YF,YG,YH,YI,YJ,YK,YL,Z,ZA,ZB,ZC,ZD,ZE,ZF,ZG,ZH,ZI,ZJ,ZK,ZL AA,BA,CA,DA,EA,FA,GA,HA,IA,JA,KA,LA,MA,NA,AB,BB,CB,DB,EB,FB,GB,HB,IB,JB,KB,LB,MB,NB,AC,BC,CC,DC,EC,FC,GC,HC,IC,JC,KC,LC,MC,NC,AD,BD,CD,DD,ED,FD,GD,HD,ID,JD,KD,LD,MD,ND,AE,BE,CE,DE,EE,FE,GE,HE,IE,JE,KE,LE,ME,NE,AF,BF,CF,DF,EF,FF,GF,HF,IF,JF,KF,LF,MF,NF,AG,BG,CG,DG,EG,FG,GG,HG,IG,JG,KG,LG,MG,NG,AH,BH,CH,DH,EH,FH,GH,HH,IH,JH,KH,LH,MH,NH,AI,BI,CI,DI,EI,FI,GI,HI,II,JI,KI,LI,MI,NI,AJ,BJ,CJ,DJ,EJ,FJ,GJ,HJ,IJ,JJ,KJ,LJ,MJ,NJ,AK,BK,CK,DK,EK,FK,GK,HK,IK,JK,KK,LK,MK,NK,AL,BL,CL,DL,EL,FL,GL,HL,IL,JL,KL,LL,ML,NL,AM,BM,CM,DM,EM,FM,GM,HM,IM,JM,KM,LM,MM,NM,AN,BN,CN,DN,EN,FN,GN,HN,IN,JN,KN,LN,MN,NN,AO,BO,CO,DO,EO,FO,GO,HO,IO,JO,KO,LO,MO,NO,AP,BP,CP,DP,EP,FP,GP,HP,IP,JP,KP,LP,MP,NP,AQ,BQ,CQ,DQ,EQ,FQ,GQ,HQ,IQ,JQ,KQ,LQ,MQ,NQ,AR,BR,CR,DR,ER,FR,GR,HR,IR,JR,KR,LR,MR,NR,AS,BS,CS,DS,ES,FS,GS,HS,IS,JS,KS,LS,MS,NS,AT,BT,CT,DT,ET,FT,GT,HT,IT,JT,KT,LT,MT,NT,AU,BU,CU,DU,EU,FU,GU,HU,IU,JU,KU,LU,MU,NU,AV,BV,CV,DV,EV,FV,GV,HV,IV,JV,KV,LV,MV,NV,AW,BW,CW,DW,EW,FW,GW,HW,IW,JW,KW,LW,MW,AX,BX,CX,DX,EX,FX,GX,HX,IX,JX,KX,LX,MX,AY,BY,CY,DY,EY,FY,GY,HY,IY,JY,KY,LY,MY,AZ,BZ,CZ,DZ,EZ,FZ,GZ,HZ,IZ,JZ,KZ,LZ,MZ coat protein STFSSLVIGSNTFIPTAPGYYSLSTRGFSDPRNQIKISGGKFNAKTGRVTAAVSRLWETDVTVAGLPVRSAAEVAIIMTLGRGITATNADVLLSDLNTLLDPARLDQILQGGF 113 T 0.096 DUF1194 pdb F T 6yfu 1 A,AA,AB,B,BA,BB,C,CA,CB,D,DA,DB,E,EA,EB,F,FA,FB,G,GA,GB,H,HA,HB,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,PA,Q,QA,R,RA,S,SA,T,TA,U,UA,V,VA,W,WA,X,XA,Y,YA,Z,ZA AA,BA,CA,AB,BB,CB,AC,BC,CC,AD,BD,CD,AE,BE,CE,AF,BF,CF,AG,BG,CG,AH,BH,CH,AI,BI,AJ,BJ,AK,BK,AL,BL,AM,BM,AN,BN,AO,BO,AP,BP,AQ,BQ,AR,BR,AS,BS,AT,BT,AU,BU,AV,BV,AW,BW,AX,BX,AY,BY,AZ,BZ coat protein SIKYIFKKTDTLPRSVIGNVLRTTGPDTTVYSLPGHTPVNPFTLTAVSRLPVPRKGNAGTTKTTLSLRREVTINKGTDQEKIVPMIARIETSVPVGVSQDDFKAMIEGLACPLLLDEIHVNDLFLSGLPIATTDVPDNEPLPPALL 146 T 0.13 AAA_11 pdbhh F T 6yfy 1 A,B,C,D A,B,E,F D-Arg4,Leu10-Teixobactin XISXXISXALI 11 T 3.8 YlzJ pdbhh F F 6yfy 2 E,F,G,H C,D,G,H Lipid II AXKXX 5 F F F 6ygc 4 D D ARL3_YEAST ARF-LIKE GTPASE 3 MFHLVGSRRR 10 T 0.077 TniB unppercent F Eukaryota T 6ygj 2 B,D B,I MLXPL_HUMAN CHREBP,CLASS D BASIC HELIX-LOOP-HELIX PROTEIN 14,BHLHD14,MLX INTERACTOR,MLX-INTERACTING PROTEIN-LIKE,WS BASIC-HELIX-LOOP-HELIX LEUCINE ZIPPER PROTEIN,WS-BHLH,WILLIAMS-BEUREN SYNDROME CHROMOSOMAL REGION 14 PROTEIN RDKIRLNNAIWRAWYIQYVQR 21 T 0.0087 DUF1752 pdb F Eukaryota T 6yh0 2 B EEE PRO-VAL-PRO-ARG PVPRAHS 7 T 83 DUF4462 pdbhh F T 6yi1 2 C,D,E C,D,E Glu(gamma-hydrazide)-Phe-Ala XFAX 4 T 690 zf-met pdbhh F F 6yia 2 B P SMAD2_HUMAN SMAD2 XWPSVRCSSMS 11 T 5.3 DUF5466 pdbhh F Eukaryota T 6yib 2 B P SMAD3_HUMAN SMAD3 XWPSIRCSSVS 11 T 4.7 Peptidase_Prp pdbhh F Eukaryota T 6yj4 29 CA c Q6CI10_YARLI Subunit NUZM of NADH:Ubiquinone Oxidoreductase (Complex I) MLPGGPVPVFKKYTVGSKGIWEKLRVLLAIAPNRSTGNPIVPLYRVPTPGSRPEANVYQDPSSYPTNDIAENPYWKRDHRRAYPQTAFFDQKTVTGLLELGSEATPRIADGEAGTKALANIANGGVSFTQALGKSSKDVIYGEVLTVNGLPPVAPTLAPKQWKIIEGEAAIYPKGYPCRTFH 182 T 0.00017 CI-B14_5a pdbhh F Eukaryota T 6yj4 33 GA g F2Z673_YARLI Subunit NESM of NADH:Ubiquinone Oxidoreductase (Complex I) MLKLHYRNFITAQHSTTNTTPTMIASVCKRAGLRAGPRAYPGVRQFALRAYNEEKELALKQRLSQLPPPGKAFVTAEGEPRPAKEAELAELAEIAALYKTDRVGILDILLLGNKHARLYRDNTALLKDYYYNGRRILDKIPVKDKQTGKVTWEIKREGAEKEDWVNQMYFLYAPSLILLLIVMVYKSREDITFWAKKELDQRVLDKHPEINDAPENERDALIVERIIAGDYDKLASLQKKATPTPATLI 249 T 0.0013 ESSS pdbpercent F Eukaryota T 6yj4 34 HA h Q6C1R9_YARLI subunit NUNM of protein NADH:Ubiquinone Oxidoreductase (Complex I) MLRHTVRATQTLRQARNVRFGSHGHGPELTPAVPFFQPYVLKWAGVSLGLVAFYQFNSSYEAKNGHTWVETFFHPKSREDILNEEAKIVQALNNQRELTIKMHELKREEKDYAHSYSPLFSDPVPQGGSIGKAPGSSRE 139 T 0.033 DUF5950 pdb F Eukaryota T 6yj4 35 IA i A0A1H6Q311_YARLL Subunit NUUM of NADH:Ubiquinone Oxidoreductase (Complex I) MGGGRYPFPKDVISMTGGWWANPSNWKLNGLFATGIAVGLALWVSTATLPYTRRREGITSESDISKWNAAAGVWRERHGKISTGEAAESE 90 T 6 Sex_peptide pdbhh F Eukaryota T 6ylu 2 B B BLNK_HUMAN BLNKpT152 ARLTSTLPALTA 12 T 1.8 DUF1685 pdbhh F Eukaryota T 6ymx 12 L m COX26_YEAST Cytochrome c oxidase subunit 26, mitochondrial ESWVITEGRRLIPEIFQWSAVLSVCLGWPGAVYFFSKA 38 T 0.077 Phage_holin_Dp1 unppercent F Eukaryota T 6ymy 12 L m COX26_YEAST Cytochrome c oxidase subunit 26, mitochondrial ESWVITEGRRLIPEIFQWSAVLSVCLGWPGAVYFFSKA 38 T 0.077 Phage_holin_Dp1 unppercent F Eukaryota T 6yn0 2 B B FTSN_ECOLI Cell division protein FtsN LPPKPEERWRYIKELESRQ 19 T 0.54 TFIIA unp F Bacteria T 6yn1 5 DA,E,IA,J,NA,O,T,Y d,E,i,J,n,O,T,Y APLF_HUMAN APURINIC-APYRIMIDINIC ENDONUCLEASE APLF,PNK AND APTX-LIKE FHA DOMAIN-CONTAINING PROTEIN,XRCC1-INTERACTING PROTEIN 1 GLDEDNDNVGQPNEYDLNDSFLDDEEEDYEPTDEDSDWEPGKE 43 T 0.00014 HUN pdbpercent F Eukaryota T 6yns 2 AA,BA,CA,DA,EA,FA,GA,HA,IA,JA,M,N,O,P,Q,R,S,T,U,V,W,X,Y,Z c,d,e,f,g,h,i,j,k,l,N,O,Q,R,S,T,U,V,W,X,Y,Z,a,b CYAA_BORPE CYCLOLYSIN WGQRALQGAQAVAAAQRLVHAIAL 24 T 3.3 Neuropep_like pdbhh F Bacteria T 6ynu 2 B,D B,D CYAA_BORPE CYCLOLYSIN WGQRALQGAQAVAAAQRLVHAIAL 24 T 3.3 Neuropep_like pdbhh F Bacteria T 6ynw 4 M e I7MMW3_TETTS subunit epsilon MCIEFAFKKAGIPIVRNFLHSTEGVIYGLPQRVQRNLAINYTVKQYKEGKAVSAKTIKTLQEAFPSKGDTK 71 T 0.062 YopH_N pdbpercent F Eukaryota T 6ynx 1 A,T a,A Q951C1_TETTH subunit a MGRENVLPVHNDVYEDFVFTTPYFQPESTFKSVPKLFSDILLGGVEWVYTTSESVLAYDYKLWYLWSGVSNLDESFDMFFNQYWALSLSTSVFQLFYAVILDRYLSVLFQNTPYTNDWFRMMLHSKETALIWLYHPELSWHINGLNQFFTYFYGGILEFVYFDKSNPDMCILVHTLWIHLLILFLIFTGFVTILFSFYGNPNTEENTIDSDYLAASGTVEAEKEITSIDDYLGLVFAIAYVFGVFFYVHGWTSMLSHAVLLLSCYSIIIMFLFILGMPTLLLYDFGIFFLAYLKGAGKYISSVAEMMFDYTACLVFYIRILAQWIRVVLMVVTFISLSHYVSDFDITNSALIGSENQSDSMNELNTNFSMTYYILTVLPGKFIYWIYEILHTFFVVCSQFVAFFAIVFWLFLFLYTFFIIEKHEDFFSKKREERKKKLKELWNLKN 446 T 5.9 ETRAMP pdbpercent F Eukaryota T 6ynx 4 D,W f,F Q24I07_TETTS subunit f MSLHEKMQTDYLWVKDHSQADSWAKARTHGYNYIAHTVPNKKERYEMIWRSMGKSTDWELEKFRLGKKFPDRGNKRRWFKNLFRLIKNPMGYIFWKTYKARLAKPSLIVTSMFIGFTLGFIKLKAQSIAYSKKQYATLRAGKNIEGSGQVHFGYHDQKWGMPAIPMFQLMYYELPGNSIVVNPCRNQNYRLYFEMRKKLGILPA 204 T 4 DUF6249 pdbhh F Eukaryota T 6ynx 5 E,X i,I I7LZW2_TETTS subunit i/j MNPIQKAWLKILEPVSYVINEKMAKRTGIIGKLGRFFAIGPREYGVHPINRMFIFMNRKYMAFQAVALHRYSFVKSLTHNGFHMLRVFRHFAFVLPATVLAGLGLFVYWGDDNKCYSPDRFPYLKKRAGDMALPLNSLNQRTSAHYIEINAIYGAEMMKRYHKVWENIIEERSKATDQEKKTRYAHPSYQYSPLPVVSIPNVLNPLNLQ 209 T 11 mit_SMPDase pdbhh F Eukaryota T 6ynx 6 F,Y k,K I7LSX6_TETTS subunit k MWYKYFSKQSWNLRVWRKANLKYNQDDFGMTQPKYIARFGDFRFRLVRTEGALRGCMFFVGFGCFSIINYLYGRYGYIINESSQKRAAQDLLDNDMAADKILFKNRVGAPTRPLRSLDDMMAFLSGSATYDQLADYASYNHAMDVNQDQQAGLDSWMSEKDKNMVKYYQRSLGKKVEGI 179 T 5 RTP801_C pdbhh F Eukaryota T 6ynx 7 G,Z c,C Q950Y8_TETTH subunit 8 MITILDYLFLLDLNDDLTRKAVFEQVIIFIFIYCTMNFLAWSTVVELIWPTHFFNRRHSSSQEFIRFRTYTEVLLKISAYNDFFYVLNNYYYNQKLILKN 100 T 2.1 ATP-synt_B pdbhh F Eukaryota T 6ynx 8 AA,H G,g I7M8Q3_TETTS ATPTT3 MINRSTAFISKNLRQANLTQSSLAMKTQYNQMGFSSDNPYNKRWEYKWKHSYYTYPRDYEHTEVRKPQDSKDVPPIYFAYYKDFVDRWLPGMNMWWQRRHRIFDKFNVYFLPGMSLFFYQFADLALGFKIMAAFPLFLAYTRIRDKTLDPDFKETYLRDMIYQNPEITKYFNEETIHVLDYEFEYLPGYLCPEKFPEYQNKTWQFFNTDTAQAEGFFKFGDVESGATMTLKFKTMPIPGKFRYQVGEPFYFYDLRAEIKCDGVYKEVVLVDEKESLKKIRPFLFLI 286 T 35 EIID-AGA unphh F Eukaryota T 6ynx 9 BA,I H,h I7MCZ0_TETTS ATPTT4 MQQRKKIYLRQKRKIYIQLKNKEKKKNNQFIQKREKMGYKIRNKSIFWTRAGWKNNWHPKNFNAPRPSYGEFTMGIRCRNDHHSFLRYVQTYRNMSRHCKQYFLGDKQLEETFILGLRSLFLVPYDSQCLTDQIKHGGERRFVDQLDRDFELISYNTHPYQLFTYTVRNEHLAWKNEQYEKIQKGEKTFEQELLDYLDEQVLAEKAKLRDGQNFSIERMTEIALHVFRKARAGKVRPAQDVRGPDGNVNDFLEQRRPFEHPNPTGVTH 268 T 0.075 Staphylokinase pdb F Eukaryota T 6ynx 10 CA,J J,j Q228N4_TETTS ATPTT5 MSENKAPGQIYAYDIHNTHYPYVNIKQDSQTQLLASFRRSIASINPFSYRQVPSQDRAAFGLRWGNAWYAPNPYPNGIHFDRVFPTHYDPLAETNRTKANLQLIKYAPGNYSTLVVTSEKLPRPCIRTIQNYRRCQMVNGTEKCNSEAQDILAICPNWALDHMKEKVRFYTKALAINNQTYIRAMQVEEYNQGRTVADVAPKTWIHGTRQHLRPDTMWADDRYTNITQTEINEAIKRVEARKAREHEKKPVEQANVNANTGEQPVRVEKSLYP 273 T 4.7 CX9C pdbhh F Eukaryota T 6ynx 11 DA,K L,l I7MCQ6_TETTS ATPTT6 MPVKEGQAKLWFSTKEEADAYDDKMISNIELKSQDYEDENFSPVFNRKTQEYFLEPSEKFKSDFAELLRPLRSLSFNQVVDRYVLIPPNHTFYRNWTYEKFLGGFGLSYLILRELPLRNFYARVFVMYAFAAKVLDHLGNPFPFSGHGQIVAAADRWNHWDVRCYDNVMKALKYIRIPTVQNNIPEATRWYGRQPGHLLRADTYWIPNLVSQRFAKHQPAHWDGTQNMPIFRLADPKHKDSYMVQFR 247 T 0.011 YebG pdb F Eukaryota T 6ynx 12 EA,L M,m I7M980_TETTS ATPTT7 MDNYFTAITLLGLRDQNLPPFKDARLQRYKSIKKMIDLIETTTKLAPPMPVELFMLNPTDPEWDDDMTYPTITHATALYKSSALAGNLFLYAYNYNNFTANIRLRTMRYLFPVVSLAIFGNIYWDYRSQLVKVNLFDEYIQARAQELVKQNEYLLEHEDVKRYVWWYEDLKETLARVHRQANNHKACDFKDSEIILQDFIRRYTNPKDNLPIKFHPQGQTF 221 T 1.7 PDR_CDR pdbpssm F Eukaryota T 6ynx 13 FA,M N,n I7LVK6_TETTS ATPTT8 MEGFIQNKRKKEKEGEEEEESKEKRKQINQLNKQKQEEEKIYQQKKDQKRKKYLYQRKEMTIFAETWEASEYQYRNKANLKTLPVNHLGKLAELKFDFVEYKAHQLIACHLYERMTIHCMNQYGLFKDFYRPECLDAQYYFKTCVELNAAYGIQKKFFPEHFVGSPYARPVPQFQQLGL 179 T 0.14 Pet20 pdbpssm F Eukaryota T 6ynx 14 GA,N O,o Q24HK1_TETTS ATPTT9 MKQKINKLLKNKGVQDKYKYLSKLILLDQEIKGKIKRKNKKEKQKRKNKLILEEMQNTTNIVHVPVHMGHTHYFDYIDSFPKLKEGPTLEENHITNQKILREQLISGQQGLEQNLCLRNCFKLSQKRYIEFCLDRKCGGADFQRAATILGYTKN 154 T 3.7 Arteri_nucleo pdbpssm F Eukaryota T 6ynx 15 HA,O P,p I7LZE5_TETTS ATPTT10 MSNIFLELQDGDKTVYTHTSLIEESKQEQIQAIYDKVPQWTNGGRFLGFWLSMEAVNRVQSVAKLPIYYRAGIVATSTLLGGLVSSLVFWKSGNENQVAKLANGAPVYLKKWEVPELSKLYFFLDDDNNFKPSLNHHAVTQGRQYYKIYQHN 152 T 4.8 Glyco_transf_43 pdbhh F Eukaryota T 6ynx 17 JA,Q R,r I7M0G0_TETTS ATPTT12 MSQDPKIVNPQLWPNPNKLRFADLYKYQGVEMKKINDSIKNYKAAKFYIGGILGGCLVFKFFIDAAVDKYIFGENGNGGKFLEMQTINSNYDYYYNRQFQRMRYLTEDPAGDDPLQKTKDEHLVDLGFIPKVFGANVEVRKRAPHDKYL 149 T 0.029 Disaggr_repeat pdbpercent F Eukaryota T 6ynx 18 KA,R S,s I7MLU7_TETTS ATPTT13 MNSLSSKKANSLVFKSIRNFTLQWGSLAERPMVDRVMSTSTWPVPYYQRLFKAYPIREKKDKMSLLLSDIDIDDTNWYQAKDFLRGSFRGRQIVDYVENNIASNTYILIQQDVANMAKAYVHDICGYIDVANKENVRILSKGDLI 145 T 20 Tryp_FSAP unphh F Eukaryota T 6ynx 20 MA,NA i1,i2 I7M7C0_TETTS Inhibitor of F1 (IF1) MNRSVNIAKNLIQTYRAMSVQSRFAFSTREEEWLDKRTKSQEKVYFDQEDRKAMKRLLEKLNTTSKFVEDSEYLAPQNLEVENILKRYHINYTQALIDELVDWKTGKN 108 T 0.043 DUF5673 pdbpssm F Eukaryota T 6yny 1 A,T a,A Q951C1_TETTH subunit a MGRENVLPVHNDVYEDFVFTTPYFQPESTFKSVPKLFSDILLGGVEWVYTTSESVLAYDYKLWYLWSGVSNLDESFDMFFNQYWALSLSTSVFQLFYAVILDRYLSVLFQNTPYTNDWFRMMLHSKETALIWLYHPELSWHINGLNQFFTYFYGGILEFVYFDKSNPDMCILVHTLWIHLLILFLIFTGFVTILFSFYGNPNTEENTIDSDYLAASGTVEAEKEITSIDDYLGLVFAIAYVFGVFFYVHGWTSMLSHAVLLLSCYSIIIMFLFILGMPTLLLYDFGIFFLAYLKGAGKYISSVAEMMFDYTACLVFYIRILAQWIRVVLMVVTFISLSHYVSDFDITNSALIGSENQSDSMNELNTNFSMTYYILTVLPGKFIYWIYEILHTFFVVCSQFVAFFAIVFWLFLFLYTFFIIEKHEDFFSKKREERKKKLKELWNLKN 446 T 5.9 ETRAMP pdbpercent F Eukaryota T 6yny 2 B,U b,B I7MJ84_TETTS subunit b MHSTLRVFTKNNCLSFTNMNRFSTAAQVAQANYSKFRADYSASVAAFQQRIKTIEKENTGSMKKPMAKAYEHPYNSEHHPLNFSAVKIAETFHDFIGPEQVSPHYESFAMSRKFLLTFWGGFFVLNFGMATVDLNWIMKSTYIPWIFWFQLMYFYVEGKNSMFMPLLQRFYRRAAANEIFTMEAFYHENIENKLRNLMRITKGQLEYWDIHTSYGEIRADSINNFLANEYLRLQSHITSRALNILKQAQAYETMNQAALLQKLIDDATSAIDNALKGDKKAEVLARSLDSAIDGLSKGYMDYQNDPLLPLILSSIEANVKKITTLSAQEQANLIGLTAEQLKSIKENDVRARKEFLESQPKLDNNLKNIESVKKILATWGK 381 T 0.12 Tipalpha pdbpssm F Eukaryota T 6yny 4 D,W f,F Q24I07_TETTS subunit f MSLHEKMQTDYLWVKDHSQADSWAKARTHGYNYIAHTVPNKKERYEMIWRSMGKSTDWELEKFRLGKKFPDRGNKRRWFKNLFRLIKNPMGYIFWKTYKARLAKPSLIVTSMFIGFTLGFIKLKAQSIAYSKKQYATLRAGKNIEGSGQVHFGYHDQKWGMPAIPMFQLMYYELPGNSIVVNPCRNQNYRLYFEMRKKLGILPA 204 T 4 DUF6249 pdbhh F Eukaryota T 6yny 5 E,X i,I I7LZW2_TETTS subunit i/j MNPIQKAWLKILEPVSYVINEKMAKRTGIIGKLGRFFAIGPREYGVHPINRMFIFMNRKYMAFQAVALHRYSFVKSLTHNGFHMLRVFRHFAFVLPATVLAGLGLFVYWGDDNKCYSPDRFPYLKKRAGDMALPLNSLNQRTSAHYIEINAIYGAEMMKRYHKVWENIIEERSKATDQEKKTRYAHPSYQYSPLPVVSIPNVLNPLNLQ 209 T 11 mit_SMPDase pdbhh F Eukaryota T 6yny 6 F,Y k,K I7LSX6_TETTS subunit k MWYKYFSKQSWNLRVWRKANLKYNQDDFGMTQPKYIARFGDFRFRLVRTEGALRGCMFFVGFGCFSIINYLYGRYGYIINESSQKRAAQDLLDNDMAADKILFKNRVGAPTRPLRSLDDMMAFLSGSATYDQLADYASYNHAMDVNQDQQAGLDSWMSEKDKNMVKYYQRSLGKKVEGI 179 T 5 RTP801_C pdbhh F Eukaryota T 6yny 7 G,Z c,C Q950Y8_TETTH subunit 8 MITILDYLFLLDLNDDLTRKAVFEQVIIFIFIYCTMNFLAWSTVVELIWPTHFFNRRHSSSQEFIRFRTYTEVLLKISAYNDFFYVLNNYYYNQKLILKN 100 T 2.1 ATP-synt_B pdbhh F Eukaryota T 6yny 8 AA,H G,g I7M8Q3_TETTS ATPTT3 MINRSTAFISKNLRQANLTQSSLAMKTQYNQMGFSSDNPYNKRWEYKWKHSYYTYPRDYEHTEVRKPQDSKDVPPIYFAYYKDFVDRWLPGMNMWWQRRHRIFDKFNVYFLPGMSLFFYQFADLALGFKIMAAFPLFLAYTRIRDKTLDPDFKETYLRDMIYQNPEITKYFNEETIHVLDYEFEYLPGYLCPEKFPEYQNKTWQFFNTDTAQAEGFFKFGDVESGATMTLKFKTMPIPGKFRYQVGEPFYFYDLRAEIKCDGVYKEVVLVDEKESLKKIRPFLFLI 286 T 35 EIID-AGA unphh F Eukaryota T 6yny 9 BA,I H,h I7MCZ0_TETTS ATPTT4 MQQRKKIYLRQKRKIYIQLKNKEKKKNNQFIQKREKMGYKIRNKSIFWTRAGWKNNWHPKNFNAPRPSYGEFTMGIRCRNDHHSFLRYVQTYRNMSRHCKQYFLGDKQLEETFILGLRSLFLVPYDSQCLTDQIKHGGERRFVDQLDRDFELISYNTHPYQLFTYTVRNEHLAWKNEQYEKIQKGEKTFEQELLDYLDEQVLAEKAKLRDGQNFSIERMTEIALHVFRKARAGKVRPAQDVRGPDGNVNDFLEQRRPFEHPNPTGVTH 268 T 0.075 Staphylokinase pdb F Eukaryota T 6yny 10 CA,J J,j Q228N4_TETTS ATPTT5 MSENKAPGQIYAYDIHNTHYPYVNIKQDSQTQLLASFRRSIASINPFSYRQVPSQDRAAFGLRWGNAWYAPNPYPNGIHFDRVFPTHYDPLAETNRTKANLQLIKYAPGNYSTLVVTSEKLPRPCIRTIQNYRRCQMVNGTEKCNSEAQDILAICPNWALDHMKEKVRFYTKALAINNQTYIRAMQVEEYNQGRTVADVAPKTWIHGTRQHLRPDTMWADDRYTNITQTEINEAIKRVEARKAREHEKKPVEQANVNANTGEQPVRVEKSLYP 273 T 4.7 CX9C pdbhh F Eukaryota T 6yny 11 DA,K L,l I7MCQ6_TETTS ATPTT6 MPVKEGQAKLWFSTKEEADAYDDKMISNIELKSQDYEDENFSPVFNRKTQEYFLEPSEKFKSDFAELLRPLRSLSFNQVVDRYVLIPPNHTFYRNWTYEKFLGGFGLSYLILRELPLRNFYARVFVMYAFAAKVLDHLGNPFPFSGHGQIVAAADRWNHWDVRCYDNVMKALKYIRIPTVQNNIPEATRWYGRQPGHLLRADTYWIPNLVSQRFAKHQPAHWDGTQNMPIFRLADPKHKDSYMVQFR 247 T 0.011 YebG pdb F Eukaryota T 6yny 12 EA,L M,m I7M980_TETTS ATPTT7 MDNYFTAITLLGLRDQNLPPFKDARLQRYKSIKKMIDLIETTTKLAPPMPVELFMLNPTDPEWDDDMTYPTITHATALYKSSALAGNLFLYAYNYNNFTANIRLRTMRYLFPVVSLAIFGNIYWDYRSQLVKVNLFDEYIQARAQELVKQNEYLLEHEDVKRYVWWYEDLKETLARVHRQANNHKACDFKDSEIILQDFIRRYTNPKDNLPIKFHPQGQTF 221 T 1.7 PDR_CDR pdbpssm F Eukaryota T 6yny 13 FA,M N,n I7LVK6_TETTS ATPTT8 MEGFIQNKRKKEKEGEEEEESKEKRKQINQLNKQKQEEEKIYQQKKDQKRKKYLYQRKEMTIFAETWEASEYQYRNKANLKTLPVNHLGKLAELKFDFVEYKAHQLIACHLYERMTIHCMNQYGLFKDFYRPECLDAQYYFKTCVELNAAYGIQKKFFPEHFVGSPYARPVPQFQQLGL 179 T 0.14 Pet20 pdbpssm F Eukaryota T 6yny 14 GA,N O,o Q24HK1_TETTS ATPTT9 MKQKINKLLKNKGVQDKYKYLSKLILLDQEIKGKIKRKNKKEKQKRKNKLILEEMQNTTNIVHVPVHMGHTHYFDYIDSFPKLKEGPTLEENHITNQKILREQLISGQQGLEQNLCLRNCFKLSQKRYIEFCLDRKCGGADFQRAATILGYTKN 154 T 3.7 Arteri_nucleo pdbpssm F Eukaryota T 6yny 15 HA,O P,p I7LZE5_TETTS ATPTT10 MSNIFLELQDGDKTVYTHTSLIEESKQEQIQAIYDKVPQWTNGGRFLGFWLSMEAVNRVQSVAKLPIYYRAGIVATSTLLGGLVSSLVFWKSGNENQVAKLANGAPVYLKKWEVPELSKLYFFLDDDNNFKPSLNHHAVTQGRQYYKIYQHN 152 T 4.8 Glyco_transf_43 pdbhh F Eukaryota T 6yny 17 JA,Q R,r I7M0G0_TETTS ATPTT12 MSQDPKIVNPQLWPNPNKLRFADLYKYQGVEMKKINDSIKNYKAAKFYIGGILGGCLVFKFFIDAAVDKYIFGENGNGGKFLEMQTINSNYDYYYNRQFQRMRYLTEDPAGDDPLQKTKDEHLVDLGFIPKVFGANVEVRKRAPHDKYL 149 T 0.029 Disaggr_repeat pdbpercent F Eukaryota T 6yny 18 KA,R S,s I7MLU7_TETTS ATPTT13 MNSLSSKKANSLVFKSIRNFTLQWGSLAERPMVDRVMSTSTWPVPYYQRLFKAYPIREKKDKMSLLLSDIDIDDTNWYQAKDFLRGSFRGRQIVDYVENNIASNTYILIQQDVANMAKAYVHDICGYIDVANKENVRILSKGDLI 145 T 20 Tryp_FSAP unphh F Eukaryota T 6yny 20 MA,NA i2,i1 I7M7C0_TETTS Inhibitor of F1 (IF1) MNRSVNIAKNLIQTYRAMSVQSRFAFSTREEEWLDKRTKSQEKVYFDQEDRKAMKRLLEKLNTTSKFVEDSEYLAPQNLEVENILKRYHINYTQALIDELVDWKTGKN 108 T 0.043 DUF5673 pdbpssm F Eukaryota T 6yny 28 CC,IB e2,e1 I7MMW3_TETTS subunit epsilon MCIEFAFKKAGIPIVRNFLHSTEGVIYGLPQRVQRNLAINYTVKQYKEGKAVSAKTIKTLQEAFPSKGDTK 71 T 0.062 YopH_N pdbpercent F Eukaryota T 6ynz 1 A,DC,T,WC a,a3,A,A3 Q951C1_TETTH Ymf66 MGRENVLPVHNDVYEDFVFTTPYFQPESTFKSVPKLFSDILLGGVEWVYTTSESVLAYDYKLWYLWSGVSNLDESFDMFFNQYWALSLSTSVFQLFYAVILDRYLSVLFQNTPYTNDWFRMMLHSKETALIWLYHPELSWHINGLNQFFTYFYGGILEFVYFDKSNPDMCILVHTLWIHLLILFLIFTGFVTILFSFYGNPNTEENTIDSDYLAASGTVEAEKEITSIDDYLGLVFAIAYVFGVFFYVHGWTSMLSHAVLLLSCYSIIIMFLFILGMPTLLLYDFGIFFLAYLKGAGKYISSVAEMMFDYTACLVFYIRILAQWIRVVLMVVTFISLSHYVSDFDITNSALIGSENQSDSMNELNTNFSMTYYILTVLPGKFIYWIYEILHTFFVVCSQFVAFFAIVFWLFLFLYTFFIIEKHEDFFSKKREERKKKLKELWNLKN 446 T 5.9 ETRAMP pdbpercent F Eukaryota T 6ynz 2 B,EC,U,XC b,b3,B,B3 I7MJ84_TETTS subunit b MHSTLRVFTKNNCLSFTNMNRFSTAAQVAQANYSKFRADYSASVAAFQQRIKTIEKENTGSMKKPMAKAYEHPYNSEHHPLNFSAVKIAETFHDFIGPEQVSPHYESFAMSRKFLLTFWGGFFVLNFGMATVDLNWIMKSTYIPWIFWFQLMYFYVEGKNSMFMPLLQRFYRRAAANEIFTMEAFYHENIENKLRNLMRITKGQLEYWDIHTSYGEIRADSINNFLANEYLRLQSHITSRALNILKQAQAYETMNQAALLQKLIDDATSAIDNALKGDKKAEVLARSLDSAIDGLSKGYMDYQNDPLLPLILSSIEANVKKITTLSAQEQANLIGLTAEQLKSIKENDVRARKEFLESQPKLDNNLKNIESVKKILATWGK 381 T 0.12 Tipalpha pdbpssm F Eukaryota T 6ynz 4 D,GC,W,ZC f,f3,F,F3 Q24I07_TETTS subunit f MSLHEKMQTDYLWVKDHSQADSWAKARTHGYNYIAHTVPNKKERYEMIWRSMGKSTDWELEKFRLGKKFPDRGNKRRWFKNLFRLIKNPMGYIFWKTYKARLAKPSLIVTSMFIGFTLGFIKLKAQSIAYSKKQYATLRAGKNIEGSGQVHFGYHDQKWGMPAIPMFQLMYYELPGNSIVVNPCRNQNYRLYFEMRKKLGILPA 204 T 4 DUF6249 pdbhh F Eukaryota T 6ynz 5 AD,E,HC,X I3,i,i3,I I7LZW2_TETTS subunit i/j MNPIQKAWLKILEPVSYVINEKMAKRTGIIGKLGRFFAIGPREYGVHPINRMFIFMNRKYMAFQAVALHRYSFVKSLTHNGFHMLRVFRHFAFVLPATVLAGLGLFVYWGDDNKCYSPDRFPYLKKRAGDMALPLNSLNQRTSAHYIEINAIYGAEMMKRYHKVWENIIEERSKATDQEKKTRYAHPSYQYSPLPVVSIPNVLNPLNLQ 209 T 11 mit_SMPDase pdbhh F Eukaryota T 6ynz 6 BD,F,IC,Y K3,k,k3,K I7LSX6_TETTS subunit k MWYKYFSKQSWNLRVWRKANLKYNQDDFGMTQPKYIARFGDFRFRLVRTEGALRGCMFFVGFGCFSIINYLYGRYGYIINESSQKRAAQDLLDNDMAADKILFKNRVGAPTRPLRSLDDMMAFLSGSATYDQLADYASYNHAMDVNQDQQAGLDSWMSEKDKNMVKYYQRSLGKKVEGI 179 T 5 RTP801_C pdbhh F Eukaryota T 6ynz 7 CD,G,JC,Z C3,c,c3,C Q950Y8_TETTH Ymf56 MITILDYLFLLDLNDDLTRKAVFEQVIIFIFIYCTMNFLAWSTVVELIWPTHFFNRRHSSSQEFIRFRTYTEVLLKISAYNDFFYVLNNYYYNQKLILKN 100 T 2.1 ATP-synt_B pdbhh F Eukaryota T 6ynz 8 AA,DD,H,KC G,G3,g,g3 I7M8Q3_TETTS ATPTT3 MINRSTAFISKNLRQANLTQSSLAMKTQYNQMGFSSDNPYNKRWEYKWKHSYYTYPRDYEHTEVRKPQDSKDVPPIYFAYYKDFVDRWLPGMNMWWQRRHRIFDKFNVYFLPGMSLFFYQFADLALGFKIMAAFPLFLAYTRIRDKTLDPDFKETYLRDMIYQNPEITKYFNEETIHVLDYEFEYLPGYLCPEKFPEYQNKTWQFFNTDTAQAEGFFKFGDVESGATMTLKFKTMPIPGKFRYQVGEPFYFYDLRAEIKCDGVYKEVVLVDEKESLKKIRPFLFLI 286 T 35 EIID-AGA unphh F Eukaryota T 6ynz 9 BA,ED,I,LC H,H3,h,h3 I7MCZ0_TETTS ATPTT4 MQQRKKIYLRQKRKIYIQLKNKEKKKNNQFIQKREKMGYKIRNKSIFWTRAGWKNNWHPKNFNAPRPSYGEFTMGIRCRNDHHSFLRYVQTYRNMSRHCKQYFLGDKQLEETFILGLRSLFLVPYDSQCLTDQIKHGGERRFVDQLDRDFELISYNTHPYQLFTYTVRNEHLAWKNEQYEKIQKGEKTFEQELLDYLDEQVLAEKAKLRDGQNFSIERMTEIALHVFRKARAGKVRPAQDVRGPDGNVNDFLEQRRPFEHPNPTGVTH 268 T 0.075 Staphylokinase pdb F Eukaryota T 6ynz 10 CA,FD,J,MC J,J3,j,j3 Q228N4_TETTS ATPTT5 MSENKAPGQIYAYDIHNTHYPYVNIKQDSQTQLLASFRRSIASINPFSYRQVPSQDRAAFGLRWGNAWYAPNPYPNGIHFDRVFPTHYDPLAETNRTKANLQLIKYAPGNYSTLVVTSEKLPRPCIRTIQNYRRCQMVNGTEKCNSEAQDILAICPNWALDHMKEKVRFYTKALAINNQTYIRAMQVEEYNQGRTVADVAPKTWIHGTRQHLRPDTMWADDRYTNITQTEINEAIKRVEARKAREHEKKPVEQANVNANTGEQPVRVEKSLYP 273 T 4.7 CX9C pdbhh F Eukaryota T 6ynz 11 DA,GD,K,NC L,L3,l,l3 I7MCQ6_TETTS ATPTT6 MPVKEGQAKLWFSTKEEADAYDDKMISNIELKSQDYEDENFSPVFNRKTQEYFLEPSEKFKSDFAELLRPLRSLSFNQVVDRYVLIPPNHTFYRNWTYEKFLGGFGLSYLILRELPLRNFYARVFVMYAFAAKVLDHLGNPFPFSGHGQIVAAADRWNHWDVRCYDNVMKALKYIRIPTVQNNIPEATRWYGRQPGHLLRADTYWIPNLVSQRFAKHQPAHWDGTQNMPIFRLADPKHKDSYMVQFR 247 T 0.011 YebG pdb F Eukaryota T 6ynz 12 EA,HD,L,OC M,M3,m,m3 I7M980_TETTS ATPTT7 MDNYFTAITLLGLRDQNLPPFKDARLQRYKSIKKMIDLIETTTKLAPPMPVELFMLNPTDPEWDDDMTYPTITHATALYKSSALAGNLFLYAYNYNNFTANIRLRTMRYLFPVVSLAIFGNIYWDYRSQLVKVNLFDEYIQARAQELVKQNEYLLEHEDVKRYVWWYEDLKETLARVHRQANNHKACDFKDSEIILQDFIRRYTNPKDNLPIKFHPQGQTF 221 T 1.7 PDR_CDR pdbpssm F Eukaryota T 6ynz 13 FA,ID,M,PC N,N3,n,n3 I7LVK6_TETTS ATPTT8 MEGFIQNKRKKEKEGEEEEESKEKRKQINQLNKQKQEEEKIYQQKKDQKRKKYLYQRKEMTIFAETWEASEYQYRNKANLKTLPVNHLGKLAELKFDFVEYKAHQLIACHLYERMTIHCMNQYGLFKDFYRPECLDAQYYFKTCVELNAAYGIQKKFFPEHFVGSPYARPVPQFQQLGL 179 T 0.14 Pet20 pdbpssm F Eukaryota T 6ynz 14 GA,JD,N,QC O,O3,o,o3 Q24HK1_TETTS ATPTT9 MKQKINKLLKNKGVQDKYKYLSKLILLDQEIKGKIKRKNKKEKQKRKNKLILEEMQNTTNIVHVPVHMGHTHYFDYIDSFPKLKEGPTLEENHITNQKILREQLISGQQGLEQNLCLRNCFKLSQKRYIEFCLDRKCGGADFQRAATILGYTKN 154 T 3.7 Arteri_nucleo pdbpssm F Eukaryota T 6ynz 15 HA,KD,O,RC P,P3,p,p3 I7LZE5_TETTS ATPTT10 MSNIFLELQDGDKTVYTHTSLIEESKQEQIQAIYDKVPQWTNGGRFLGFWLSMEAVNRVQSVAKLPIYYRAGIVATSTLLGGLVSSLVFWKSGNENQVAKLANGAPVYLKKWEVPELSKLYFFLDDDNNFKPSLNHHAVTQGRQYYKIYQHN 152 T 4.8 Glyco_transf_43 pdbhh F Eukaryota T 6ynz 17 JA,MD,Q,TC R,R3,r,r3 I7M0G0_TETTS ATPTT12 MSQDPKIVNPQLWPNPNKLRFADLYKYQGVEMKKINDSIKNYKAAKFYIGGILGGCLVFKFFIDAAVDKYIFGENGNGGKFLEMQTINSNYDYYYNRQFQRMRYLTEDPAGDDPLQKTKDEHLVDLGFIPKVFGANVEVRKRAPHDKYL 149 T 0.029 Disaggr_repeat pdbpercent F Eukaryota T 6ynz 18 KA,ND,R,UC S,S3,s,s3 I7MLU7_TETTS ATPTT13 MNSLSSKKANSLVFKSIRNFTLQWGSLAERPMVDRVMSTSTWPVPYYQRLFKAYPIREKKDKMSLLLSDIDIDDTNWYQAKDFLRGSFRGRQIVDYVENNIASNTYILIQQDVANMAKAYVHDICGYIDVANKENVRILSKGDLI 145 T 20 Tryp_FSAP unphh F Eukaryota T 6ynz 20 MA,NA,PD,QD i2,i1,i5,i4 I7M7C0_TETTS Inhibitor of F1 (IF1) MNRSVNIAKNLIQTYRAMSVQSRFAFSTREEEWLDKRTKSQEKVYFDQEDRKAMKRLLEKLNTTSKFVEDSEYLAPQNLEVENILKRYHINYTQALIDELVDWKTGKN 108 T 0.043 DUF5673 pdbpssm F Eukaryota T 6ynz 28 CC,FF,IB,LE e2,e5,e1,e4 I7MMW3_TETTS subunit epsilon MCIEFAFKKAGIPIVRNFLHSTEGVIYGLPQRVQRNLAINYTVKQYKEGKAVSAKTIKTLQEAFPSKGDTK 71 T 0.062 YopH_N pdbpercent F Eukaryota T 6yo0 5 I i1 I7M7C0_TETTS Inhibitor of F1 (IF1) MNRSVNIAKNLIQTYRAMSVQSRFAFSTREEEWLDKRTKSQEKVYFDQEDRKAMKRLLEKLNTTSKFVEDSEYLAPQNLEVENILKRYHINYTQALIDELVDWKTGKN 108 T 0.043 DUF5673 pdbpssm F Eukaryota T 6yo0 6 J s I7MLU7_TETTS ATPTT13 MNSLSSKKANSLVFKSIRNFTLQWGSLAERPMVDRVMSTSTWPVPYYQRLFKAYPIREKKDKMSLLLSDIDIDDTNWYQAKDFLRGSFRGRQIVDYVENNIASNTYILIQQDVANMAKAYVHDICGYIDVANKENVRILSKGDLI 145 T 20 Tryp_FSAP unphh F Eukaryota T 6yo0 7 K b I7MJ84_TETTS subunit b MHSTLRVFTKNNCLSFTNMNRFSTAAQVAQANYSKFRADYSASVAAFQQRIKTIEKENTGSMKKPMAKAYEHPYNSEHHPLNFSAVKIAETFHDFIGPEQVSPHYESFAMSRKFLLTFWGGFFVLNFGMATVDLNWIMKSTYIPWIFWFQLMYFYVEGKNSMFMPLLQRFYRRAAANEIFTMEAFYHENIENKLRNLMRITKGQLEYWDIHTSYGEIRADSINNFLANEYLRLQSHITSRALNILKQAQAYETMNQAALLQKLIDDATSAIDNALKGDKKAEVLARSLDSAIDGLSKGYMDYQNDPLLPLILSSIEANVKKITTLSAQEQANLIGLTAEQLKSIKENDVRARKEFLESQPKLDNNLKNIESVKKILATWGK 381 T 0.12 Tipalpha pdbpssm F Eukaryota T 6yo5 2 D GGG ALA-HIS-ALA AHA 3 T 350 DUF4258 pdbhh F F 6yo8 2 E,F,G,H E,F,G,H GCR_HUMAN GR,NUCLEAR RECEPTOR SUBFAMILY 3 GROUP C MEMBER 1 KTIVPATLPQLTP 13 T 5.7 DUF2064 pdbhh F Eukaryota T 6yoo 2 B B SAM50_HUMAN TRANSFORMATION-RELATED GENE 3 PROTEIN,TRG-3 EEAEFVEVEPEA 12 T 35 PIF6 pdbhh F Eukaryota F 6yp6 1 A A G3CFL3_9CAUD 933WP42, VB_24B_21 GVTTLLSYLASESEGSLKVQGWSASGGRAEVVSDAEGTGGKAVKLTKEAGKSSWVLEYAAGNGAALLQKGGQIRCRFKVSGALAANQYVMAFYWPVSSLPQGVALTGDGGNNLLAAFYIQTDAKDLNVMYHNAKVATNNLKLGTFGAFDNEWHTLAFRFAGNNSLQVTPVIDGQDGTPFTLTQSPVSAFAADKLHVTDITRGATYPVLIDSIAVEVNS 218 T 2.4 Sial-lect-inser unphh T Viruses T 6ypc 3 C T CENPT_YEAST CENP-T HOMOLOG,CO-PURIFIED WITH NNF1 PROTEIN 1,CONSTITUTIVE CENTROMERE-ASSOCIATED NETWORK PROTEIN CNN1 MSTPRKAAGNNENTEVSEIRTPFRERALEEQRLKDEVLIRNTPGYRKLLSASTKSHDILNKDPNEVRSFLQDLSQVLARKSQGNDTTTNKTQARNLIDELAYEESQPEENELLRSRSEKLTDNNIGNETQPDYTSLSQTVFAKLQERDKGLKSRKIDPIIIQDVPTTGHEDELTVHSPDKANSISMEVLRTSPSIGMDQVDEPPVRDPVPISITQQEEPLSEDLPSDDKEETEEAENEDYSFENTSDENLDDIGNDPIRLNVPAVRRSSIKPLQIMDLKHLTRQFLNENRIILPKQTWSTIQEESLNIMDFLKQKIGTLQKQELVDSFIDMGIINNVDDMFELAHELLPLELQSRIESYLFENLYFQ 367 T 0.0019 CENP-T_C unphh F Eukaryota T 6ypc 4 D W CENPW_YEAST CENP-W HOMOLOG,CONSTITUTIVE CENTROMERE-ASSOCIATED NETWORK PROTEIN WIP1,W-LIKE PROTEIN 1 MDTEALANYLLRQLSLDAEENKLEDLLQRQNEDQESSQEYNKKLLLACGFQAILRKILLDARTRATAEGLREVYPYHIEAATQAFLDSQ 89 T 4.4E-05 CENP-W pdbhh F Eukaryota T 6yqf 1 A,B A,B SYCE2_HUMAN CENTRAL ELEMENT SYNAPTONEMAL COMPLEX PROTEIN 1 GSMGLYFSSLDSSIDILQKRAQELIENINKSRQKDHALMTNFRNSLKTKVSDLTEKLEERIYQIYNDHNKIIQEKLQEFTQKMAKISHLETELKQVCHSVETVYKDLCLQPE 112 T 0.00044 Dynamitin unppssm F Eukaryota T 6yqx 1 A A de novo designed TIM barrel DeNovoTIM13 MDVDEMLKQVEILRRLGAKQIAVRSDDWRILQEALKKGGDILIVDATDVDEMLKQVEILRRLGAKQIAVRSDDWRILQEALKKGGDILIVDATDVDEMLKQVEILRRLGAKQIAVRSDDWRILQEALKKGGDILIVDATDVDEMLKQVEILRRLGAKQIAVRSDDWRILQEALKKGGDILIVDATGLEHHHHHH 194 T 0.00012 NanE pdbhh F T 6yqy 1 A A de novo designed TIM barrel sTIM11noCys MDKDEAWKQVEQLRREGATQIAYRSDDWRDLKEAWKKGADILIVDATDKDEAWKQVEQLRREGATQIAYRSDDWRDLKEAWKKGADILIVDATDKDEAWKQVEQLRREGATQIAYRSDDWRDLKEAWKKGADILIVDATDKDEAWKQVEQLRREGATQIAYRSDDWRDLKEAWKKGADILIVDATGLEHHHHHH 194 T 0.00015 NanE pdbhh F T 6yr5 2 B,D,F,H O,P,Q,R MDM4_HUMAN DOUBLE MINUTE 4 PROTEIN,MDM2-LIKE P53-BINDING PROTEIN,PROTEIN MDMX,P53-BINDING PROTEIN MDM4 DCRRTISAPVVRPK 14 T 0.93 DUF6143 unppercent F Eukaryota T 6yr6 2 B,D,F,H B,D,F,H MDM2_HUMAN hDM2-186 QRKRHKSDSISLS 13 T 1.4 NTF3_N pdbhh F Eukaryota T 6yr7 2 C,D Q,C MDM4_HUMAN DOUBLE MINUTE 4 PROTEIN,MDM2-LIKE P53-BINDING PROTEIN,PROTEIN MDMX,P53-BINDING PROTEIN MDM4 SKLTHSLSTSDITAIPEKENEGNDVPDCRRTISAPVVRPK 40 T 0.01 Cript unp F Eukaryota T 6yro 1 A,B,C,D,E D,A,B,C,E G5DSS1_STRSU SadP MHHHHHHSSGLVPRGSHMKQQSPLIQTSNADYKSGKDQEKLRTSVSINLLKAEEGQIQWKVTFDTSEWSFNVKHGGVYFILPNGLDLTKIVDNNQHDITASFPTDINDYRNSGQEKYRFFSSKQGLDNENGFNSQWNWSAGQANPSETVNSWKSGNRLSKIYFIDQITDTTELTYTLTAKVTEPNQQSFPLLAVMKSFTYTNSKSTEVTSLGAREITLEKEKT 223 T 2.4 DUF5377 unphh F Bacteria T 6ys8 2 C,D,E,F,G G,F,E,D,C Q5EGM4_FLAJO PROTEIN INVOLVED IN GLIDING MOTILITY GLDL MALLSKKVMNFAYGMGAAVVIVGALFKITHFEIGPLTGTVMLSIGLLTEALIFALSAFEPVEDELDWTLVYPELANGQARKKEAKAETATDAQGLLSQKLDAMLKEAKVDGELMASLGNSIKNFEGAAKAISPTVDSIAGQKKYAEEMSMAAAQMESLNSLYKVQLESASRNAQANSEIAENAAKLKEQMASMTANIASLNSVYGGMLSAMSNKG 215 T 0.0059 TPR_MLP1_2 pdbpssm F Bacteria T 6yse 1 A A A9J6U1_BPLUZ GP4 MKSPYEAAHERALMVNRLQKLTRMLRVHPDPKWKQEQQELIKRLKK 46 T 0.79 CBF_beta pdbhh T Viruses T 6ysr 55 CB v P-site fMet-Phe-tRNA(Phe) MF 2 T 120 KcnmB2_inactiv pdbhh F F 6ysz 1 A,B,C,D,E,F A,B,C,D,E,F GP15_BPT7 GENE PRODUCT 15,GP15 MRGSHHHHHHGMASMTGGQQMGRDLYDDDDKDPSSMSKIESALQAAQPGLSRLRGGAGGMGYRAATTQAEQPRSSLLDTIGRFAKAGADMYTAKEQRARDLADERSNEIIRKLTPEQRREALNNGTLLYQDDPYAMEALRVKTGRNAAYLVDDDVMQKIKEGVFRTREEMEEYRHSRLQEGAKVYAEQFGIDPEDVDYQRGFNGDITERNISLYGAHDNFLSQQAQKGAIMNSRVELNGVLQDPDMLRRPDSADFFEKYIDNGLVTGAIPSDAQATQLISQAFSDASSRAGGADFLMRVGDKKVTLNGATTTYRELIGEEQWNALMVTAQRSQFETDAKLNEQYRLKINSALNQEDPRTAWEMLQGIKAELDKVQPDEQMTPQREWLISAQEQVQNQMNAWTKAQAKALDDSMKSMNKLDVIDKQFQKRINGEWVSTDFKDMPVNENTGEFKHSDMVNYANKKLAEIDSMDIPDGAKDAMKLKYLQADSKDGAFRTAIGTMVTDAGQEWSAAVINGKLPERTPAMDALRRIRNADPQLIAALYPDQAELFLTMDMMDKQGIDPQVILDADRLTVKRSKEQRFEDDKAFESALNASKAPEIARMPASLRESARKIYDSVKYRSGNESMAMEQMTKFLKESTYTFTGDDVDGDTVGVIPKNMMQVNSDPKSWEQGRDILEEARKGIIASNPWITNKQLTMYSQGDSIYLMDTTGQVRVRYDKELLSKVWSENQKKLEEKAREKALADVNKRAPIVAATKAREAAAKRVREKRKQTPKFIYGRKE 782 T 0.15 DUF4404 pdbpercent T Viruses T 6yt5 1 A,B,C,D,E,F A,B,C,D,E,F GP15_BPT7 GENE PRODUCT 15,GP15 MRGSHHHHHHGMASMTGGQQMGRDLYDDDDKDPSSMSKIESALQAAQPGLSRLRGGAGGMGYRAATTQAEQPRSSLLDTIGRFAKAGADMYTAKEQRARDLADERSNEIIRKLTPEQRREALNNGTLLYQDDPYAMEALRVKTGRNAAYLVDDDVMQKIKEGVFRTREEMEEYRHSRLQEGAKVYAEQFGIDPEDVDYQRGFNGDITERNISLYGAHDNFLSQQAQKGAIMNSRVELNGVLQDPDMLRRPDSADFFEKYIDNGLVTGAIPSDAQATQLISQAFSDASSRAGGADFLMRVGDKKVTLNGATTTYRELIGEEQWNALMVTAQRSQFETDAKLNEQYRLKINSALNQEDPRTAWEMLQGIKAELDKVQPDEQMTPQREWLISAQEQVQNQMNAWTKAQAKALDDSMKSMNKLDVIDKQFQKRINGEWVSTDFKDMPVNENTGEFKHSDMVNYANKKLAEIDSMDIPDGAKDAMKLKYLQADSKDGAFRTAIGTMVTDAGQEWSAAVINGKLPERTPAMDALRRIRNADPQLIAALYPDQAELFLTMDMMDKQGIDPQVILDADRLTVKRSKEQRFEDDKAFESALNASKAPEIARMPASLRESARKIYDSVKYRSGNESMAMEQMTKFLKESTYTFTGDDVDGDTVGVIPKNMMQVNSDPKSWEQGRDILEEARKGIIASNPWITNKQLTMYSQGDSIYLMDTTGQVRVRYDKELLSKVWSENQKKLEEKAREKALADVNKRAPIVAATKAREAAAKRVREKRKQTPKFIYGRKE 782 T 0.15 DUF4404 pdbpercent T Viruses T 6yvh 2 C,J,K,L C,I,E,G CWC27_HUMAN ANTIGEN NY-CO-10,PROBABLE INACTIVE PEPTIDYL-PROLYL CIS-TRANS ISOMERASE CWC27 HOMOLOG,PPIASE CWC27,SEROLOGICALLY DEFINED COLON CANCER ANTIGEN 10 RSMGTSREDQTLALLNQFKSKLTQAIAETPENDIPETEVEDDEGWMSHVLQFEDKSR 57 T 11 DUF4407 unppssm F Eukaryota T 6yw1 2 B B PHD2-SPECIFIC RaPID CYCLIC PEPTIDE 3C (14-MER) XXVWLTDTWVLSRT 14 T 0.21 DUF4571 pdbhh F T 6yw2 2 B B PHD2-SPECIFIC RaPID CYCLIC PEPTIDE 3C (14-MER) XXVWLTDTWVLSRT 14 T 0.21 DUF4571 pdbhh F T 6yw3 2 B B PHD2-SPECIFIC RaPID CYCLIC PEPTIDE 3C (14-MER) XXVWLTDTWVLSRT 14 T 0.21 DUF4571 pdbhh F T 6yw4 2 B B PHD2-SPECIFIC RaPID CYCLIC PEPTIDE 3C (14-MER) XXVWLTDTWVLSRT 14 T 0.21 DUF4571 pdbhh F T 6yw5 22 V VV Q7SHR9_NEUCR mS26 MAPTLARPSLSGVQFILSSPTTTCAATSVVTRAIAARSFSTTRSARDSVSIPPDSPNYIKVPEPPQSSEVRHPFVKGHLPIPRSIFPKKGVPEKVQSGYVNRIAPKSAAELAGLPPKSKQESWRRKMAEARRQSLEAGLQGLWQRKVKRDQKQAKESKARYLANKRAAQAPERLDEVFTRATIRESTAKNTFVPLDPEAFVKAEEARIKHAEKEAMKSEARRDAVVQLYVASKNFIVDEKELEEHVNKHFTEKIHNAGLWESGRSIWDSQKNPISMRELRNEFSGFNDRVTATTSAAVKTTVRQKNVAEELTGGKL 316 T 0.17 T2SSE_N unppercent F Eukaryota T 6yw5 23 W WW Q7RYW7_NEUCR mS27 MAGRAPQHALRVGCRAVPEALSKPAQQSRCLSSTVPRQATYPVVSFNKTSSPELKEALETLREKVILPTYLPPELRQKIFNKKYEKELAHDPVTIQIDGQPQRFSYINMLTDMPNTPKNIRAALLSMKNGGDFANLSGLLEGMHRANRKLPYWLSAQIVRKACKAGHLQLILNMVRDVKRTGFTLERHETVNELLFWIQRFAWKSDYSEPETRKALREVQEILDALEGDERHMSKDRKRQQALTRFPYHRDPQFLAARLNLTAELAARRAVTGQTSEQQLNSANDVKNLVKYAEQLVRLWPADKALLDMYTDEAYVARVDLRYLIKPQVHLRYASFTLQALKNAAKIVGQLGHGPLAAQLINRAAAVEAESQLAYAKVDDGMAGQKIYEMVVGGKK 396 T 0.52 Chloroplast_duf pdbpercent F Eukaryota T 6yw5 27 AA 11 Q7S4Y4_NEUCR 37S ribosomal protein mrp10, mitochondrial MPNKPIRLPPLKQLRVRQANKAEENPCIAVMSSVLACWASAGYNSAGCATVENALRACMDAPKPAPKPNNTINYHLSRFQERLTQGKSKK 90 T 0.00011 CHCH pdbpercent F Eukaryota T 6yw5 32 GA 77 Q7SG49_NEUCR mS46 MNRQVVTSTLGRRGVASTILNAQQQQRPFSSTTTRCAAEDDSKKPAAAPSTPRAAAPGPISASRQKSEAAVGKLTQLRGSFTSLTNDNSFHKTLPAGARDARRLAAAPIAGKGAGAGAVAPLGGGGGASGAPKVINVRSLKGTLGSRGSNNIPGAVAPGAALRPRFAAGPGAAAGRPRFGAAASPGAGPTGAARRPPFGARRARPAGDKKRSGGSGDKRPRGDDYDAPPTEEEKAFLRGLEQGKVTEYVPKLTPDTLLGYGPPVATDAALGKVESAMRTMRILGGGLPFNDQSGVTSDPTAIKHRYVHEKKPVFFSSVEEKEWVRESLDKFAVSEGPEKKTKQKILETSVLGKYEEPKYVESLTETVKMVEKYQGGTFSYAPSDADKFNKKLNQLLAAGLPRAAPAPAQAQKKA 414 T 0.072 UPF0164 pdbpercent F Eukaryota T 6yw7 7 G C ARC1A_HUMAN SOP2-LIKE PROTEIN MSLHQFLLEPITCHAWNRDRTQIALSPNNHEVHIYKKNGSQWVKAHELKEHNGHITGIDWAPKSDRIVTCGADRNAYVWSQKDGVWKPTLVILRINRAATFVKWSPLENKFAVGSGARLISVCYFESENDWWVSKHIKKPIRSTVLSLDWHPNNVLLAAGSCDFKCRVFSAYIKEVDEKPASTPWGSKMPFGQLMSEFGGSGTGGWVHGVSFSASGSRLAWVSHDSTVSVADASKSVQVSTLKTEFLPLLSVSFVSENSVVAAGHDCCPMLFNYDDRGCLTFVSKLDIPKQSIQRNMSAMERFRNMDKRATTEDRNTALETLHQNSITQVSIYEVDKQDCRKFCTTGIDGAMTIWDFKTLESSIQGLRIM 370 T 0.00016 WD40 pdb F Eukaryota T 6ywc 3 C,F C,F De novo design 4E1H_95 MKYFDCTVSGERGIIKTYGIQLPEEALKEHVREYVEKLREGSAITITCTAGDRVFKFKDKVGSWGSHHHHHH 72 T 0.31 DUF3577 pdbhh F T 6ywd 3 C C De novo designed protein 4H_01 MEVERELRNWLSEVLSKINDAPVTNDIKKAISNQVLKVAEQVWNGHSKEELQERVRKEVCSVCSNVPACWAICGGLLEVVKYQGSHHHHHH 91 T 0.0054 Glycoprotein_G pdbhh F T 6ywe 9 I f Q6M9C4_NEUCS Related to ribosomal protein YmL11, mitochondrial MSLRLSRPAVRGLGSAIKSSRISSRSAALLVPSTSTSAFSTASPQRAAAAGHLRLPDDYVPPTQPPSARPVDTRKSQLLRTYTSMLRSTPLMLIFQHNNLTAIEWAAIRRELSLALSNVPVPEGAPDITSKIHLQVVRTRIFDVALKTVEFFDPSTVEPTTATTATGTKVPATYNHDLSKHAWKAVKEATKNTEAVEKTVYGQLAPLLVGPVAILTLPSVSPAHLGAALSVLAPSPPAFPAPSRKKNPGYYDLTCQSGLQKLLLVGGRIEGKAFDYDGIKWVGGIENGIEGLRAQLVHMLQSAGMGLTSVLEGAGKSLWLTMESRRSVLEEEQNPKKEGEGEEEKKE 347 T 8.5E-09 Ribosomal_L10 pdbhh F Eukaryota T 6ywe 68 PB VV Q7SHR9_NEUCR mS26 MAPTLARPSLSGVQFILSSPTTTCAATSVVTRAIAARSFSTTRSARDSVSIPPDSPNYIKVPEPPQSSEVRHPFVKGHLPIPRSIFPKKGVPEKVQSGYVNRIAPKSAAELAGLPPKSKQESWRRKMAEARRQSLEAGLQGLWQRKVKRDQKQAKESKARYLANKRAAQAPERLDEVFTRATIRESTAKNTFVPLDPEAFVKAEEARIKHAEKEAMKSEARRDAVVQLYVASKNFIVDEKELEEHVNKHFTEKIHNAGLWESGRSIWDSQKNPISMRELRNEFSGFNDRVTATTSAAVKTTVRQKNVAEELTGGKL 316 T 0.17 T2SSE_N unppercent F Eukaryota T 6ywe 69 QB WW Q7RYW7_NEUCR mS27 MAGRAPQHALRVGCRAVPEALSKPAQQSRCLSSTVPRQATYPVVSFNKTSSPELKEALETLREKVILPTYLPPELRQKIFNKKYEKELAHDPVTIQIDGQPQRFSYINMLTDMPNTPKNIRAALLSMKNGGDFANLSGLLEGMHRANRKLPYWLSAQIVRKACKAGHLQLILNMVRDVKRTGFTLERHETVNELLFWIQRFAWKSDYSEPETRKALREVQEILDALEGDERHMSKDRKRQQALTRFPYHRDPQFLAARLNLTAELAARRAVTGQTSEQQLNSANDVKNLVKYAEQLVRLWPADKALLDMYTDEAYVARVDLRYLIKPQVHLRYASFTLQALKNAAKIVGQLGHGPLAAQLINRAAAVEAESQLAYAKVDDGMAGQKIYEMVVGGKK 396 T 0.52 Chloroplast_duf pdbpercent F Eukaryota T 6ywe 73 UB 11 A0A0B0E339_NEUCS 37S ribosomal protein mrp10, mitochondrial MPNKPIRLPPLKQLRVRQANKAEENPCIAVMSSVLACWASAGYNSAGCATVENALRACMDAPKPAPKPNNTINYHLSRFQERLTQGKSKK 90 T 0.00011 CHCH pdbpercent F Eukaryota T 6ywe 78 AC 77 A0A0B0DYB0_NEUCS mS46 MNRQVVTSTLGRRGVASTILNAQQQQRPFSSTTTRCAAEDDSKKPAAAPSTPRAAAPGPISASRQKSEAAVGKLTQLRGSFTSLTNDNSFHKTLPAGARDARRLAAAPIAGKGAGAGAVAPLGGGGGASGAPKVINVRSLKGTLGSRGSNNIPGAVAPGAALRPRFAAGPGAAAGRPRFGAAASPGAGPTGAARRPPFGARRARPAGDKKRSGGSGDKRPRGDDYDAPPTEEEKAFLRGLEQGKVTEYVPKLTPDTLLGYGPPVATDAALGKVESAMRTMRILGGGLPFNDQSGVTSDPTAIKHRYVHEKKPVFFSSVEEKEWVRESLDKFAVSEGPEKKTKQKILETSVLGKYEEPKYVESLTETVKMVEKYQGGTFSYAPSDADKFNKKLNQLLAAGLPRAAPAPAQAQKKA 414 T 0.072 UPF0164 pdbpercent F Eukaryota T 6yws 8 H f Q7RZ62_NEUCR Uncharacterized protein MSLRLSRPAVRGLGSAIKSSRISSRSAALLVPSTSTSAFSTASPQRAAAAGHLRLPDDYVPPTQPPSARPVDTRKSQLLRTYTSMLRSTPLMLIFQHNNLTAIEWAAIRRELSLALSNVPVPEGAPDITSKIHLQVVRTRIFDVALKTVEFFDPSTVEPTTATTATGTKVPATYNHDLSKHAWKAVKEATKNTEAVEKTVYGQLAPLLVGPVAILTLPSVSPAHLGAALSVLAPSPPAFPAPSRKKNPGYYDLTCQSGLQKLLLVGGRIEGKAFDYDGIKWVGGIENGIEGLRAQLVHMLQSAGMGLTSVLEGAGKSLWLTMESRRSVLEEEQNPKKEGEGEEEKKE 347 T 8.5E-09 Ribosomal_L10 pdbhh F Eukaryota T 6ywv 7 G f Q7RZ62_NEUCR Uncharacterized protein MSLRLSRPAVRGLGSAIKSSRISSRSAALLVPSTSTSAFSTASPQRAAAAGHLRLPDDYVPPTQPPSARPVDTRKSQLLRTYTSMLRSTPLMLIFQHNNLTAIEWAAIRRELSLALSNVPVPEGAPDITSKIHLQVVRTRIFDVALKTVEFFDPSTVEPTTATTATGTKVPATYNHDLSKHAWKAVKEATKNTEAVEKTVYGQLAPLLVGPVAILTLPSVSPAHLGAALSVLAPSPPAFPAPSRKKNPGYYDLTCQSGLQKLLLVGGRIEGKAFDYDGIKWVGGIENGIEGLRAQLVHMLQSAGMGLTSVLEGAGKSLWLTMESRRSVLEEEQNPKKEGEGEEEKKE 347 T 8.5E-09 Ribosomal_L10 pdbhh F Eukaryota T 6ywx 8 H f Q7RZ62_NEUCR uL10m MSLRLSRPAVRGLGSAIKSSRISSRSAALLVPSTSTSAFSTASPQRAAAAGHLRLPDDYVPPTQPPSARPVDTRKSQLLRTYTSMLRSTPLMLIFQHNNLTAIEWAAIRRELSLALSNVPVPEGAPDITSKIHLQVVRTRIFDVALKTVEFFDPSTVEPTTATTATGTKVPATYNHDLSKHAWKAVKEATKNTEAVEKTVYGQLAPLLVGPVAILTLPSVSPAHLGAALSVLAPSPPAFPAPSRKKNPGYYDLTCQSGLQKLLLVGGRIEGKAFDYDGIKWVGGIENGIEGLRAQLVHMLQSAGMGLTSVLEGAGKSLWLTMESRRSVLEEEQNPKKEGEGEEEKKE 347 T 8.5E-09 Ribosomal_L10 pdbhh F Eukaryota T 6ywx 65 MB VV Q7SHR9_NEUCR mS26 MAPTLARPSLSGVQFILSSPTTTCAATSVVTRAIAARSFSTTRSARDSVSIPPDSPNYIKVPEPPQSSEVRHPFVKGHLPIPRSIFPKKGVPEKVQSGYVNRIAPKSAAELAGLPPKSKQESWRRKMAEARRQSLEAGLQGLWQRKVKRDQKQAKESKARYLANKRAAQAPERLDEVFTRATIRESTAKNTFVPLDPEAFVKAEEARIKHAEKEAMKSEARRDAVVQLYVASKNFIVDEKELEEHVNKHFTEKIHNAGLWESGRSIWDSQKNPISMRELRNEFSGFNDRVTATTSAAVKTTVRQKNVAEELTGGKL 316 T 0.17 T2SSE_N unppercent F Eukaryota T 6ywx 66 NB WW Q7RYW7_NEUCR mS27 MAGRAPQHALRVGCRAVPEALSKPAQQSRCLSSTVPRQATYPVVSFNKTSSPELKEALETLREKVILPTYLPPELRQKIFNKKYEKELAHDPVTIQIDGQPQRFSYINMLTDMPNTPKNIRAALLSMKNGGDFANLSGLLEGMHRANRKLPYWLSAQIVRKACKAGHLQLILNMVRDVKRTGFTLERHETVNELLFWIQRFAWKSDYSEPETRKALREVQEILDALEGDERHMSKDRKRQQALTRFPYHRDPQFLAARLNLTAELAARRAVTGQTSEQQLNSANDVKNLVKYAEQLVRLWPADKALLDMYTDEAYVARVDLRYLIKPQVHLRYASFTLQALKNAAKIVGQLGHGPLAAQLINRAAAVEAESQLAYAKVDDGMAGQKIYEMVVGGKK 396 T 0.52 Chloroplast_duf pdbpercent F Eukaryota T 6ywx 70 RB 11 Q7S4Y4_NEUCR 37S ribosomal protein mrp10, mitochondrial MPNKPIRLPPLKQLRVRQANKAEENPCIAVMSSVLACWASAGYNSAGCATVENALRACMDAPKPAPKPNNTINYHLSRFQERLTQGKSKK 90 T 0.00011 CHCH pdbpercent F Eukaryota T 6ywx 75 XB 77 Q7SG49_NEUCR mS46 MNRQVVTSTLGRRGVASTILNAQQQQRPFSSTTTRCAAEDDSKKPAAAPSTPRAAAPGPISASRQKSEAAVGKLTQLRGSFTSLTNDNSFHKTLPAGARDARRLAAAPIAGKGAGAGAVAPLGGGGGASGAPKVINVRSLKGTLGSRGSNNIPGAVAPGAALRPRFAAGPGAAAGRPRFGAAASPGAGPTGAARRPPFGARRARPAGDKKRSGGSGDKRPRGDDYDAPPTEEEKAFLRGLEQGKVTEYVPKLTPDTLLGYGPPVATDAALGKVESAMRTMRILGGGLPFNDQSGVTSDPTAIKHRYVHEKKPVFFSSVEEKEWVRESLDKFAVSEGPEKKTKQKILETSVLGKYEEPKYVESLTETVKMVEKYQGGTFSYAPSDADKFNKKLNQLLAAGLPRAAPAPAQAQKKA 414 T 0.072 UPF0164 pdbpercent F Eukaryota T 6ywy 8 H f Q6M9C4_NEUCS Related to ribosomal protein YmL11, mitochondrial MSLRLSRPAVRGLGSAIKSSRISSRSAALLVPSTSTSAFSTASPQRAAAAGHLRLPDDYVPPTQPPSARPVDTRKSQLLRTYTSMLRSTPLMLIFQHNNLTAIEWAAIRRELSLALSNVPVPEGAPDITSKIHLQVVRTRIFDVALKTVEFFDPSTVEPTTATTATGTKVPATYNHDLSKHAWKAVKEATKNTEAVEKTVYGQLAPLLVGPVAILTLPSVSPAHLGAALSVLAPSPPAFPAPSRKKNPGYYDLTCQSGLQKLLLVGGRIEGKAFDYDGIKWVGGIENGIEGLRAQLVHMLQSAGMGLTSVLEGAGKSLWLTMESRRSVLEEEQNPKKEGEGEEEKKE 347 T 8.5E-09 Ribosomal_L10 pdbhh F Eukaryota T 6ywy 65 MB VV Q7SHR9_NEUCR mS26 MAPTLARPSLSGVQFILSSPTTTCAATSVVTRAIAARSFSTTRSARDSVSIPPDSPNYIKVPEPPQSSEVRHPFVKGHLPIPRSIFPKKGVPEKVQSGYVNRIAPKSAAELAGLPPKSKQESWRRKMAEARRQSLEAGLQGLWQRKVKRDQKQAKESKARYLANKRAAQAPERLDEVFTRATIRESTAKNTFVPLDPEAFVKAEEARIKHAEKEAMKSEARRDAVVQLYVASKNFIVDEKELEEHVNKHFTEKIHNAGLWESGRSIWDSQKNPISMRELRNEFSGFNDRVTATTSAAVKTTVRQKNVAEELTGGKL 316 T 0.17 T2SSE_N unppercent F Eukaryota T 6ywy 66 NB WW Q7RYW7_NEUCR mS27 MAGRAPQHALRVGCRAVPEALSKPAQQSRCLSSTVPRQATYPVVSFNKTSSPELKEALETLREKVILPTYLPPELRQKIFNKKYEKELAHDPVTIQIDGQPQRFSYINMLTDMPNTPKNIRAALLSMKNGGDFANLSGLLEGMHRANRKLPYWLSAQIVRKACKAGHLQLILNMVRDVKRTGFTLERHETVNELLFWIQRFAWKSDYSEPETRKALREVQEILDALEGDERHMSKDRKRQQALTRFPYHRDPQFLAARLNLTAELAARRAVTGQTSEQQLNSANDVKNLVKYAEQLVRLWPADKALLDMYTDEAYVARVDLRYLIKPQVHLRYASFTLQALKNAAKIVGQLGHGPLAAQLINRAAAVEAESQLAYAKVDDGMAGQKIYEMVVGGKK 396 T 0.52 Chloroplast_duf pdbpercent F Eukaryota T 6ywy 70 RB 11 A0A0B0E339_NEUCS 37S ribosomal protein mrp10, mitochondrial MPNKPIRLPPLKQLRVRQANKAEENPCIAVMSSVLACWASAGYNSAGCATVENALRACMDAPKPAPKPNNTINYHLSRFQERLTQGKSKK 90 T 0.00011 CHCH pdbpercent F Eukaryota T 6ywy 75 XB 77 A0A0B0DYB0_NEUCS mS46 MNRQVVTSTLGRRGVASTILNAQQQQRPFSSTTTRCAAEDDSKKPAAAPSTPRAAAPGPISASRQKSEAAVGKLTQLRGSFTSLTNDNSFHKTLPAGARDARRLAAAPIAGKGAGAGAVAPLGGGGGASGAPKVINVRSLKGTLGSRGSNNIPGAVAPGAALRPRFAAGPGAAAGRPRFGAAASPGAGPTGAARRPPFGARRARPAGDKKRSGGSGDKRPRGDDYDAPPTEEEKAFLRGLEQGKVTEYVPKLTPDTLLGYGPPVATDAALGKVESAMRTMRILGGGLPFNDQSGVTSDPTAIKHRYVHEKKPVFFSSVEEKEWVRESLDKFAVSEGPEKKTKQKILETSVLGKYEEPKYVESLTETVKMVEKYQGGTFSYAPSDADKFNKKLNQLLAAGLPRAAPAPAQAQKKA 414 T 0.072 UPF0164 pdbpercent F Eukaryota T 6ywy 81 EC cc Poly-Peptide AAAAAAA 7 T 270 DUF4179 pdbhh F F 6yx0 2 C,D C,D PWQ-THR-ARG-LEU TRL 3 T 580 40S_S4_C pdbhh F F 6yx2 2 C,D C,D PWW-THR-ARG-LEU TRL 3 T 580 40S_S4_C pdbhh F F 6yxm 1 A BBB CII-C-39-CIT LPGQXGERG 9 T 4.4 DotA pdbhh F T 6yxq 1 A A ASCC3_HUMAN ASC-1 COMPLEX SUBUNIT P200,ASC1P200,HELICASE,ATP BINDING 1,TRIP4 COMPLEX SUBUNIT P200 GAEFMALPRLTGALRSFSNVTKQDNYNEEVADLKIKRSKLHEQVLDLGLTWKKIIKFLNEKLEKSKMQSINEDLKDILHAAKQIVGTDNGREAIESGAAFLFMTFHLKDSVGHKETKAIKQMFGPFPSSSATAACNATNRIISHFSQDDLTALVQMTEKEHGDRVFFGKNLAFSFDMHDLDHFDELPINGETQKTISLDYKKFLNEHLQEA 211 T 0.015 SNase pdbpercent F Eukaryota T 6yxq 2 B B ASCC2_HUMAN ASC-1 COMPLEX SUBUNIT P100,TRIP4 COMPLEX SUBUNIT P100 GAMAMPALPLDQLQITHKDPKTGKLRTSPALHPEQKADRYFVLYKPPPKDNIPALVEEYLERATFVANDLDWLLALPHDKFWCQVIFDETLQKCLDSYLRYVPRKFDEGVASAPEVVDMQKRLHRSVFLTFLRMSTHKESKDHFISPSAFGEILYNNFLFDIPKILDLCVLFGKGNSPLLQKMIGNIFTQQPSYYSDLDETLPTILQVFSNILQHCGLQGDGANTTPQKLEERGRLTPSDMPLLELKDIVLYLCDTCTTLWAFLDIFPLACQTFQKHDFCYRLASFYEAAIPEMESAIKKRRLEDSKLLGDLWQRLSHSRKKLMEIFHIILNQICLLPILESSCDNIQGFIEEFLQIFSSLLQEKRFLRDYDALFPVAEDISLLQQASSVLDETRTAYILQAVESAWEGVDRRKATDAKDPSVIEEPNGEPNGVTVTA 438 T 0.18 DUF325 pdbpssm F Eukaryota T 6yxx 2 B E1 Q57WG6_TRYB2 mt-LAF21 MWHSSLRYVSFKRLPFGRRSTSGGVNFNKGLLTDRERGDPFTEPHAYRNKKSIAAISKVAKKQDILLREEKQRKELDKIQSGYVTERELHIGCDKPLGGNANEIARVIDEQALISPTPGEKCSTALRELMENEVDRRNHMMDKFGQPVGAREFHRLFKELRHADNEAETIERHQTRLVEEYGVYPSLRLDAYMLDDDTYFPEWVNALPYSIRDRVKFGSLGLTEKDEALRVTLGRMPLDRRRREWERLKKAKEYKAAKEETLTLAELRDARQGKRRFHWLQRKRQKRASILRRLALRKPDAFELWPSRVVDYSQRIAFIAQHVENGLDTKGQWPLDPEELARARVRRSKEEAERTFLMSAEEKRAHKKLSGRSGDGSISEMLQSLEVPDKPFKRLSRKVYANRVNAIVHGDQDEYGRRYRKMETRSKRRMRPYASLGEIGLENELRKEPRINAKGLNNTDDEDWPRHTKSWGDGMPSMRYGS 482 T 0.016 Ten_N pdb F Eukaryota T 6yxx 8 H A5 Q584F4_TRYB2 bL32m MFRRTFFTPMIAQPTLLMLGNKGGTPKRKKNPMQLRRKVYGLHFKEKYLKMEEWYYCPLCAEPKKPGEWCRREDCRQIKP 80 T 0.089 HVO_2753_ZBP pdbhh F Eukaryota T 6yxx 9 I E5 mt-LAF25 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 376 F F F 6yxx 11 K A8 D0A1K1_TRYB9 bL35m MGSEESNNICAYKRTISLAKIYIVLLVKTAMLRYSRLCFPKMGCEEITRKARRVQLQPTEYLAQHRMQVWQLRFKEMGPPFSRVWVALGGKMRRRRVGRQVDVKDMRYYWRPIEPQYQRLYMSRLRIRDHSNKLRQPMRLRATNADIGSGSSSIEWERASNRKYGAMLAPPKRQDFEFRVV 181 T 0.13 Cytochrom_B559a pdbpssm F Eukaryota T 6yxx 13 M BA D0A5V6_TRYB9 mL67 MLRRFALTSSVALRLRFERDSGHNTVRYRPIPESMQPKHLEDNFTPFPLPKFDESLEYGPVRLRNIPDIEAAKERRRGSRLAATEVLLQETLQEENQFATSGKGDGNMAIAITERHTEDVTTPAADSRFPSQTMSPCSHEEEMRGYVVSRDYPLIDRLHCTRSIEELVAQFEDRPQIESRVAALADMASTVSFRSDEELLRMFTAISAPFSVDGRGLNFLTVKVSKFGRPYYVPNSLLPAYVNLVDATTIALVREQPWRLSASPALFIQVLQFMALIKVFEPNKWFTFSDHAPSNRADYRHAIGVNHSTAFWGTGEELYDFMVELLRVEDDGRIPTMLDLCTREQMVDLLSGFCGVMPCGKAVGDVFKTITDAFLRRVRNDISGPWSAHDWAIVERMYLVTVLCDAGNNEILQLLLSDTASPRGPDFFAAVSRTKDTPTKKRALCLLQEAIDNASAKADKVTLLGLLESGSEFLLSLVDKGVAHTFATQNLFDYRILNSFLHCSLVADRLRVEQSVITSLIPSSLRDVQVQMLMSNERNALNPLTSSLPGNSGAIATAPKLKRPLMTMLSQLEYLNSIDSVFILHSSLMATSTDQLVSAVRRLPSGKDSLIVTMSCLRALSVKSLTSPSMKERIACARALEIVSYELEKGRAVLLPFSEEILLHDAGAYCDEDLMLWTVAAFLARELPLVKVHTLMHSNCTARTPYRFLKGGHNLLVSSRSLYDKGAPLLSSLHSKELRLVTHNVRLRTPVRDRKCTLQYYNPIRARFVYRRDKPLFDKYHVTARNLAPGFSRGALKHDWRALGVYTPDHPQVPYHPLQTWMLGETTVEAE 831 T 0.047 DUF5642 pdb F Eukaryota T 6yxx 15 O,TB UA,Uf UNK XXXXXXXXXX 10 F F F 6yxx 16 P BB D0A135_TRYB9 mL68 MLYTRRLMTTGGSATADGAVSYSKGSYHIVPKKYTVGKRIAVRSYLDRNRTELSDRTYMPQKAWFEPYTPKKFDMEHQRISHNFYNLETKLIWTAFDTPELIGILLHDETIKGAPHLYDAEFLESAVHWTRESRYWRCIGITKPFYNKTTLRAQCWHDRGLQVGTLVFSQAMRDALMDLERAVRRKELGLEPNYVWDRWGPVGFIDGARTDHLPRFAHNPYVDPDGVEVTEVDIAPFNTHEQIKERYGAFIDPDLRPFEGVFRAPSHGALTLDDVPHQEAVRLYRDLMEKADMPVMLGNGAEIPPMDMRALFHLSANPERMKAASELSSWREVRGMLAPVQEVCDEKVEALRLMENTRHDAARVRTFYEEKCGFSDFMRTPDKVITAAVLCYLQELQRICTETDWGKPLARCLTDLERVNVMGKDAFLVYRHIEDAILDKKRRVWATRFAGEANEESTLDYLLENFGRRTEQTRNVGTTGTEFDREQEPIGRQVQRRVLDSDKASKLAEVRQKRGKMWSNKKSVFDSLHQKQLQNVTYGVH 541 T 4.7 VirE_N pdbhh F Eukaryota T 6yxx 19 S BD mL70 MCLKRKAPHLFCFCLWSIFLSFRCFCFRSYAIMLPLLSFPTIQISIFLSFKLPITTFLLSPCFVFVFVFAIRYCGELTLNAQLVLFLLYHCAQTQRGPLKEGEMPICPGLCGELAAVPFRVFLGTLPTLAVEERFLRQLQPVFAWYSSRKRVKEQANEFIEIDLASCDAELLLRYSHIYYVRRQLFDELIERQMTLLDSGKAPKMAEPSLLQCLAGCNMTIADRLQLEIRQLGAAKRAASVPGRRELDPVARLEVYDYACMMRLVEEDAGAVGDAEMKARAYLPREVIESKLGHLTQLLLGSDARAALDKKDVKLLNRMIPPDYTRVGCVEKLRPFDVTAYFRFYGERINNVKVENYFKRALWGHVYRRFATTPSFLSGVSTYWARHSGLDASFTTTTMPQEVAVAVCDQQIQFPAIKFRAQYVYTSPETARQLWRTDAAVPLMRLFPLMGSRTAEDLAAGVLTDAFWMHLGLSEEENLLQDSLLLKVRRFVDEVGDMYETNIDSVLKRVDDNFKQVVPQLKAEDLQVDAPLQDGEGEDTVRETVAA 547 T 0.26 MGAT2 unppssm F T 6yxx 22 V BE Q57WG1_TRYB2 mL71 MLFRSVSCKNYQRGGWSPGSKHQKHMTLNPTLYLYRFPGPHGPGPYTMKYWWTLGCFPTGMEVPFRLHEFLSTYQQEHVPVEVEEWLRCYIKDPLSELVNASNDFFKAVEVYPEVESARGYKTLQPSIAPLLVPMKKFEEQLGVKISPVGLRSVLSNPVLKDRFLDDLFDYKSYVEKGGSTPHRRLARSRFEGSLSVLGECEKCLPEQHQVEISESLGTFIGATVSPAETTADDERSLILLLTTISEGCINAGNYSDAASVLADALMFCHDPDSQATTHANISFASLLNADFKGAEYNGREAALLQPQVKPTSTACARGYVGWAAAAAYQDDFEKAEAIVKDGLTLYVGNEHLEKLANKLQALREEQPSVYKQVPRSLRESRSHLPSQQSRGLLSGSGKGFSNEFDWVEFKNKLYPSKMDPRNNEMGSVFRRVGDLGSFISTSRSMERL 449 T 2.4E-05 TPR_21 pdbhh F Eukaryota T 6yxx 27 AA EG Q387S8_TRYB2 mt-LAF7 MPNIKGGVGSFLMRRAAPKSIRQKYQTGPQFYKRKFFQFQKGHHRLHRRISGVQTGSPTHQREYERFHHLPGDVRTRPQFDFTFGETRADRVMFAWRKRGDLQLYQMSGRGETFVCYRCGYPVRSQLVAVKADNWDYRMCYRCYTNTVHRGMENDT 156 T 0.042 Ring_hydroxyl_B unppercent F Eukaryota T 6yxx 28 BA BH Q38AM5_TRYB2 mL74 MLVPGLSLTRRAVTSSCCRPLHVVRGFSTTCTLFGLEQLQDVPTSTSRRPTGLHRGPGKRQTSEREAAQYKFIRRWELQMRDEWDQLEPFKGLPKPKRQFGNEAAEVIWPYALLLERVVKVHPFTKSIYVYYAQRQSTARGKLAAEIARSFAREFLIPITFHNSQVYTEAEMLLEYSETPWVVLHSLDNGQKPRILPVAPVEGTPAHTAVEQLLAEVVQGCEALGASVADPVTATRVLNERPLQNQYVRVDYQWFGDTPDERASHLVRWEFEPEQIEPKIRHRTRHVLDWLNYDGNLPTHRAVHVNAMREKARQKAPRTVAGPRTFYNSAGSRANARSSRFGGQAAVGK 349 T 0.023 L51_S25_CI-B8 pdbhh F Eukaryota T 6yxx 29 CA EH A0A1G4IEQ9_TRYEQ mt-LAF8 MKSVFDIARSHVTFPVSRDGTALRRVLKDWLDYTECQSLQAKPAFPAELCITVHPSVKSMSRVYTQGDPKMSGEVCRRPGEAASLSYTKRVRLLLWSAVLPWEVQRGLSMSVLEPPGNGGSVLGEVGVRHGLGGSGGAVDGGSAVATKGWKEVEDSLGVVTDPTTXTAADFVSGPQCSTLGASIIPGMLFVMSPESAAQGLCFWSGAIRQPIDIAFIAPVEPPAADTPSFSELRQRRLQLSLEGYDISRFFPDGELESPTVTFAVQSHSYLDPFPDCEQRDQQVGCGSSRERNKGIEDSRRYTATPEGVGRGNENVRYVLETRRNLLRDSIRSALRECRCTHGGVVWASNTGCAADGNPTGTGECDVEVTISLTLSDELKEDLREKARLYTNYVVPLEGHVRRHIKCLSGISPHPKGDATDGSDALQQKGTEAVWGEEGCVCTNGSAPFVAPPPIIKPPLPVKVGTLASTRPRSPMLADEAEGRPTRLAPSVFGRHDAPALQRAQQECNQLISASALARIPNTSPRAPEIPPIDYEIFDLCLRLGLCQSEAIYYFYGRIMREWSKELRRLRAAKSHGEGGVNDGNDMVLREEDVHRMLRLVHDPSLQVPPELSACVEAVASLRKITNEVGVPVV 634 T 0.45 DUF192 pdbhh F Eukaryota T 6yxx 32 FA,FC UI,Ur UNK XXXXXXXXXXXXXXXXXXXXX 21 F F F 6yxx 33 GA BJ Q383M2_TRYB2 mL76 MLRLSSWNLKSQHHNVLRRSRPHIHKYRELNRWQRQAQGISKWDQSHSHRPLPYVERFNPESVGLTRGTSAFAWKWWHTQYPWLPNVPPEAAQIDEAQKQERRSHRPPAWDDEFAKVVLNMNDAEIREYLMSKLTDVIFLETQRDGYELRRLDFEGKPLTSLPEPRIIENFVLEEETIRERVIYQVVEGVFRLSPTSADRRELRSVANIIDYVLTHVRAARPTDRERRQERPITSAALAVMQKCPIQPQLGFVHALPHDTRDALLQEWERMHHLDWQFGKAVYTPRSKENVRGNLTWLREDRHYDQRMKFMQEVESGEARAKHMKLIAEAAGN 333 T 1.8 KIX pdbpssm F Eukaryota T 6yxx 36 JA UK UNK XXXXXXXXXXXXXXXXXXXXXXXXX 25 F F F 6yxx 37 KA BL D0A4P5_TRYB9 mL78 MGASAGLIRRGGGVFPDAVSLTLTPSRRVYGGSGRGDLLYENPDARRHSGRALGVLNGVRHSSQATMPESGQLYYRKLILHSRPPNGSCAGLQRHCHDTCNWSYLIPSLHRCAESAISAKLWEKMCQLGLEDRSKAWVNLTQYERQRVRDGQNLYRYEVHQRLPLLEESIGWAQLDDLLGWFRSARRAWVRLPTSVTLSRESSEAGVASVVSPSSAMSCRLEGHADSRDTTPGRNQVFDTPERVEQLTEATVHRIREELQRLNRSERSDCEGSAAMRASARRLARDEELSRCVEEELGWHGVALQHRIPVPK 312 T 0.091 CAF1-p150_C2 pdb F Eukaryota T 6yxx 38 LA EL C9ZVC0_TRYB9 mt-LAF12 MRRLFITTASTLCHSLHCTDTRTGGAGKESTPTEVQCEMTLQCSDESGCSPFLSSLLSPVETVPLHDVTRTYSTMDVVDPPARYNPMVPNVEPSSSSAGHMEQXLENEEEEGPVACAHKNGKLWGVFEGSEDNKPPAWFYRLCKDLFYRTNSEDNMDDAALVSDIEPSHYISSTENRHVDGSDTTQRSAEAGTDVSDGVDPYVWIPFNLLDEADYHVGPYRFPSTATYTHEQRTLLCLGDTRREYVHFCDSYAFPGRAQIPTSVGTCPSKLYVNPKQQQPVVYIQLSNDIPPAMWLPVKGTAASVRRVLAEFASMAALHRDWHHDEFMERHATAVRMLELQRLPAGEGDILRYMAYDARNAQFAFAPIREFPNQQEFFLGEHDDPEKLMEHVDLCPLLFAIPHMRTVVDLHAEHMIPTIAGPGVATSLYRCIYSKALLFVQVHLSSEVKLPPQDPEAFKFMWKDSQVLPKMRIPVFVRVVWPTNERMSGGGGLLRRFNRLFGTEFASDIPVDAAMALLYVMQWSGHIKDFLGVRGMRQRLADLLLASQQPEPTKLYPGTREIPNPEYTVAERLGMHVQYLAQLHDPDISLTIQRLLPVASAPVRMGCAKAALIAGDRELFRHIVSSEPPGRMQTYMTKLVRKRKTRDLVDAEPRLLEDQYEFAAPLWTKRGKRLDSNTLEGVVEAQSRLSG 691 T 12 RNF152_C unphh F Eukaryota T 6yxx 40 NA UM UNK XXXXXXXX 8 F F F 6yxx 42 PA BN C9ZQF0_TRYB9 mL80 MCCLYTSDVFFWSSLVVSPLPRIVRCVSAPQIRSIPCGSGDFSVMKRSLIARWQSGVHTPHGVVYRGAKMKNWPEQRIPENFKFTEEQRFRTKAIPRDVGTIPRNFVLGVLYRHQPCEVGGLWEHCTNDPEIVLDSKRHLREVLKQAREEGFVTFERDAISNEWLCFLTRERYEEVQRIVTAKSEAVDTHSGLRGAAATETSTYAEKFREMNVEAKEAHARRLEEEVANTTRYLRRFQQREIDYLPYTDLNGKVNFMWWYETRDVQQRADEVLTDSSSSKALAGEEEHKAGSQLEATTASTS 302 T 0.023 DUF2203 pdbpssm F Eukaryota T 6yxx 43 QA EN C9ZPS0_TRYB9 mt-LAF14 MRRCGNDRAVALLRATTEAMRLVKLKLSPDRTRNEEIQDRQNAFVWSDEHIFRPHQHFTHDPCSWSRSLEQSMKKQRKLSMVERLRSLEQRQLEEKQSASATAGGSSKCANHMDGEKAEGPRFYGAVGDSEDLKEYVANEDYFYTMQQEEKPNDPPLQELVDEVQSLHVLLSSPRYEDTPLATVERLQCAYSEALRCVFDRVRNASVGKTMSCNALLFSWSLLLQGVPALLESLAEKRTEECLVRALSTVHEALNIVLQEFNRITHSKERVELLPLEGWIESLDVVTHPLTNKDYTSLKGNIRLPESSFKPQCKLDSATVEFVHSRAIQAAAIRMIENDQSDVETEPLDPYHLYILLRCMVRLAEKGVNDSHIHRAALLTGMVGERIFSSLERTVAPPRRYSLRHALLGKQLRDASKPHAIPLDVCAPPGGVKKPPTAADDVLLLTRACTLLMKVATNVLPQTKFKVLETVDTVLKTLSYAPNYDLSTADTVIFSNMVLEELHHVDEASATDRHLRVLLLLSRLRLSMCADRSALSHLFSCLCNLLPPHSIQQDKLREWKRLRGLVMRHLLYSVRGEEVEQHYTRVLKSSETWVEHLAFGQYSGGLPLSLWLEACHIYLTAGRKLTVSCAEALITLRGRCKDGGVLRSSNSAGVGPLDFVSVTLLAQLLEVVSHGCCSADDLVASPVAWDKVRQTIQGAIGEDENTIQLLRAGRLCVADRQATGSLVTTYP 731 T 1.1 DUF4048 pdbpssm F Eukaryota T 6yxx 44 RA BO D0A755_TRYB9 mL81 MKRLFPSAGVSVVLTSSSIVMSCPCNHIFTSRRAYYWPYNEDFVPEGAETSRFQSSGSPGTRRRVLQEYALSPLFGARVPCCVVGTLRTAKEIIVLKRDIQSLLCELMELPQGSVTLGPLQQLREMMLYRHGTPLSPRLGDERQLNMDAYARRPIAARTMMDVFLAEDMSLDEIVNFGRLSLGKLQIALNNLRKENAEKSSADCEDGSAVEPAQLLVDRMGISYCEIPSLDESDYVFAEVGGVAVTDEEAERVAQRWAERCE 262 T 10 DUF1382 unphh F Eukaryota T 6yxx 45 SA,UA EO,EP C9ZMA6_TRYB9 mt-LAF15a MFTVSCNVAFLCHPAVHHSLLLLRALRQRHTLAIERMGANVTNIGGTVSLSQCGNHISIVPPNLHGSKCVTSGGSIGTVGESPLCVAEHGLQRVHDPQHILYLFSSASPVRQSALDGQIQSYLNAVVVSNQVLRAADDVLIALSIGEMEAVRQTHGNLIDCVAALDASLQQTTENEEGGGGNGATQEVDCLSTWPLFTTIQFLVEEGGLPLGPFPRMSRAYYRLKESTPVVAHSQLVWRTFELSRGPEGPTGELPAWPHRGFLRDIQRQIAEYTTDPPERIMAGVTGEKGPLRARVSGARLGLQRTPARIPWTMQGLHR 319 T 0.42 Hemerythrin pdbpssm F Eukaryota T 6yxx 49 XA BR Q586A6_TRYB2 mL84 MLRFTRLFREMAPELQLEYMPVLFTRTILGPQGGFAGEERLVKLEVARKYMEAGHAVTPTEELRRGLWCYNPDTDKYDCFIERNEEFLDFAARKRQWLDVYWRVNTGYLLFGRQSWGQGFLINCPLRKRDVAQKLWEQYKVRIDPRLIEFREKDRRTGIQELGHNWCWLYLPGAEELGINREVYDNKRVKVRIHVRKMNSMFALY 205 T 2.2 Ribosomal_L9_C pdbhh F Eukaryota T 6yxx 53 BB AT Q4GZ98_TRYB2 bL19m MGYTRERTNRHFFVARANAFFSRLPIARVQRSLAMEAVREGRMRPWKYTKEQILGAPVTCNFEYNPRPVRLIGTVMDAHTEETSIKGGLKVYARNEETNMMLWIPPGNPKLRHEVTSTKGSFQHYLDERDKWDEAWITGRARMK 144 T 3.8 DUF2760 pdbhh F Eukaryota T 6yxx 54 CB BT A0A3L6L8W0_9TRYP mL86 MRRCIPARGGFTMKYKKGTGLWDEDHVNDYKTNRYLSARATMRWYQEMERHQTRNSLNARRATQSHNNNRGLHHTGRGAFERELERRGVQVEKYPLTTTTGTMRVAELVILRRMELEKRAEEALAEQRGELQKKNPTPSEWYDESKGPLNPNFLRSMRSHYEVDIANLPDTPLIRGQREFFIGEERGNGAA 191 T 3 DUF2663 pdbhh F Eukaryota T 6yxx 55 DB ET Q383S4_TRYB2 mt-LAF19 MAFRRLVKRHKITNNQMLLMRRREPYKPTMKDRQEIADRAKLEEFERKNADGLMFVPEKALPPWQKSLAHNAKALGSRINFRGFRVRVADGQDEPGFPTPFR 102 T 5.2 Ribosomal_S6e pdbhh F Eukaryota T 6yxx 56 EB AU Q383R2_TRYB2 bL20m MLRRTVCVQHYRAKLELDRIRSMLRGRARLERKVGLKRLFFLMRTQTRYRVEQQAHWERAIVRKNVDSAAREHGTGWQHLRNELGRQNVMLLPRSQQLLAQYEPLAFRAVVELCASRIPPPPPPVVASVPEESYTLWPPASHDNSECASTDGSDAPHGQQQSLSHPAARVELRCGVERVLRRGPSGLGNNVNELIDAWKEFDVSPLRKGEVNK 213 T 7.7E-05 Ribosomal_L20 pdbhh F Eukaryota T 6yxx 57 FB BU D0A7Z9_TRYB9 mL87 MLNPTFSLYRKTLQSYPVPPKIRHYDRRWSGSRTNPYNRQYWRVIMNENYSRPSFWVSDFRHRYLMRTGTDYQGQVPSSPQPGLYQGFSDVHKLLANHPKPQRESRHLPVLPMTPRIVFEHANEKRIDTAKRMRRDRRRIEELKTLEFWGWYMKLQRVRGRWCREQGVSSRGVYGPAVDAAELWG 185 T 1.3 Crl pdbhh F Eukaryota T 6yxx 59 HB BV C9ZUW3_TRYB9 mL88 MLQINAFKLVRATPFLLKRTGKPADTPDYKQVYLPYDAAPTERELERERRRFKQAYHGRMEHRKLVEVKEVPLNVYTYGKEGMSLPIAIFKDQKDPVIGPEWTYPGIYENKIAAQHWYTEELFDKESKEAFESPWQQQILDNQVKRRMAKVMFRMRQVNMKAVDLFQKERGSSRRSGGAGEKGKDGGGKK 190 T 2.5 Ribosomal_L37 pdbhh F Eukaryota T 6yxx 61 JB BW Q57WW5_TRYB2 mL89 MSGAFGCGSYRSVVAGTQNVPRRMTFYPSAYELIQLHKAHREVIRHFYVRDKIFDNKFPGNALANGLFKFVPNRRENYHMRELMESIRRRSIWMHRIKQQREINAKVVENMEVKYGKKAAASMLCFTTPDSNAYFAPHRYQDVANSWPNYWQHPSVNHVVPKPRWRRHRELGGITRVEDPFAVQASDY 188 T 24 Babuvirus_MP pdbhh F Eukaryota T 6yxx 65 NB Ba D0A4T0_TRYB9 mL93 MFRVTGLQLKNPVVFKQGQGMFSHQLKRLLQKKSIHRYNWDPLPMYDPRKLVHASRHMDVETWREVPDPHWDERSYLVPDQMFYNIPVPPEYKDAYWWRELQARRVQCPVEWVSHRMYNKGDRQRYDFQDLAFRKKFEFSYEEVVKNAKDMRS 153 T 7.9 Pox_VP8_L4R pdbhh F Eukaryota T 6yxx 67 PB Bc Q389K3_TRYB2 mL95 MFRPTTAIADSFKEHYHRVHLPRRLALQRYARQQSLRNAAKGNVKAEEVPYKYNRWWVNEEHEFVHQYAFVEDPEVTKAKRETLPPVTRENIWKEPQQTFFLPFAPYVRVVDYPKDPDAKFLKPVNIPRWKDYMQRTKPVIPRTWY 146 T 1.8 DUF4653 pdbhh F Eukaryota T 6yxx 68 QB Ae Q383U6_TRYB2 mL41 MLRCSCARRRGVYHNAPSVYPFVKPFHDTPYDQDRGRHDSVGQRYRKNSWPEWMDNGADGTGYGIGLHRTHPLSRLRGNLKRSPSHVPRVLGMMIQGVWHKSGVKLYFRGGKPPNPSVHPYLTGEPCPLYGWKVTDESVIRQFNMPSIDGTNFRYKPYVALQERKIMGVQVPSDSVSLASGRTTESKPLAKRLFFWR 197 T 1.9 MRP-L27 pdbhh F Eukaryota T 6yxx 69 RB Af Q383S1_TRYB2 mL42 MLRLCRVSLRVQSHQKKRAQHPNAGTRFGRVYNRGFIRYGFGGFGMSVYTPKKDRRFRVQPLPSLHANSLADDTPLVTTTRTLSPNFRAFALQDGGVFFTHPSHEQVMRVGQNILAEETKATGMTSMDTYVNSRIQSIIAENTVENVALSHWRRRHMWNLVRTHGKLQRHWGVSDATKGARSNIYGRPS 189 T 0.12 Toxin_10 pdb F Eukaryota T 6yxx 70 SB Bf Q388M2_TRYB2 mL98 MVLRGVRLRSVAVSCYGSSLTAATRCLSVRTEDFFSKEAISHARRVSWAPHTTEKKQGAFAKLARSNFGDPLPSSFAQEPYFEEEIEAHRKHHRPDVYIYKYNVSPTHFSLRE 113 T 6.1 DUF2975 pdbhh F Eukaryota T 6yxx 72 VB E7 mt-LAF27 MKRFIQSRRVWNLMDIRRRPPVLNRLRGVLQFAQQPGALSRHRQGGDYNCSMRVSIYSRPGKVSRLNNADATWHNRSRKEKPPDFDPSAFRRRYRDS 97 T 0.034 CobW_C pdb F T 6yxx 74 XB E9 A0A3L6L206_9TRYP mt-LAF29 MPRHLSSSLLQKSLTPGAARLLPQNLIPQRRPAPGRMTYRGPLLAELDRVRDTCTTEATSSGVVEESMVSNGAFDAVRPAALGSTSSVADEYLVPTPRATKLKKAAMAASRTPTFVLPNSRSATDGEGGARQANDERDGTNSSPQLFSIESYETQQRALRNIFNEAGRHCVRLRKDSKWLLEERRAFRQSHQRAPTTKEVSPHVDVALAPVGALKLSKYLSPCSASREVVEHSLLLHRTVTKSKLVQSSEKTLFRCLRCFHVYAARPRTLLRGEVAQSWLEYEAEAEKEERARQLARRPHLRKKKYSIGRQRASLANDPRCCPLCRSTKAQWMMEYVHHHTHG 343 T 0.092 Sigma70_r3 pdbpercent F Eukaryota T 6yxx 76 ZB Ul UNK XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 238 F F F 6yxx 77 AC Um UNK XXXXXXXXXXXXXXXXXXXXXXXXXXX 27 F F F 6yxx 78 BC Un UNK XXXXXXXXXXXXXXXXXXXXXXXXXXXX 28 F F F 6yxx 79 CC Ao mL52 MHIWRTAASNWVKRHARSNAAQWRRPEPASSSAAAKTIARLLAVGGHGCWSEATALYAEALATKSVQRYGVVTPAHRHDCIALILGAMSSSVDASGSDINRGEASLTPHYGMLAERLSLDIIVPHSDVGQEEKEAASVACIAVFRALLTSGRTHAAQRFAQQLLRSRNSREHGLWALRTIMLAVTAREEGMYRQLFLHGGLLKDLEHLTGLDAFTVVMSSCDGHEKNIGPCGGRTEGNLGRDEVHRVAARWCVDVALHMHTCPWRFCLTNEKEPRRETAFNSEVTSYCCAKELPMPAFAVTMLPFEVLRSNAVGVLRCCTAAGNGSHGNEVVTFDGTASNVSGGRSGDSTSCSTTAPSLGLPALKRLQDVVSAAIEVPLSTHEPVECLYLLLQISRDSTAPDPRRVMPSTMNILWWRRHLQSCVLESVKSAVRWKSVCDFVGLSRLVTERVPHCFPLLVAHAAESPDALRTLFNEVSRQEGAAAXPVDDIPRSCVMGLLQMCGDERSAAVIDWVVEHCXDHESIAQFVLDSLGQHQELASVVVRRLFQRSEQSXDVINALSRLLEKGTCRTLLADVCSLVPQAQQWTVALRFVSGMSAPEVGHVHTAFMAFVGNCDEQFAINMALLWSDETFTWLKADVNPPPRRVYMDDRWSDALRLVALGAERLTQRPELLGKYIRYFSKRVAVNSRLYMELEALLACSKVGGGSARNCGRSSNWNPIASTVTGGPPLETSLVGVRTEVLPKGRQWMEACAHASEVGVTRGVLEILAKRGRWEEATILLARAEAKRRVEWAPLVIRAARLSSQWRAALSVAEKIAANGMKLHYSVVAELLACCFSADVPLEVVQLWLHRRSVGESTSSHVLFGLGPGSSLRTEQRDANDKHNVFGDASRWLGSAQAELFFVLLESSSSRSSDHQWRYALQLLKEHVLLSGGVPSARVFRSTYSILHRAERWRESLQLLGLQRSVCGSPTVKCVHLVLSTLPSTAWQYALGALQCIPPGDSGSIHRVLPLLLPVSWESALGLMIDHRVMTNTAMECVVGCEDVPLALRLQAWRRLLPSLPVSMKHRAAPIYLRLAAGVADGDVGLEGTNGLDAMCVIERSIRGLRGDYINYASFVYHRALLHRHWCNDSLHPSGGVAAFRALSGIAEVDQQCEVSAAALEQLSNIVERLEQVCGTVSASTATHGSQHLERTVPVVNSGEILQGSAPTNARRDPETHCFSFVSSYWLSLFYLFRYCRCVISSSLFPFFKKDQGSGLPIFDVMRLHTVRAPIITRAAMRGYSEARSNYDGTSLPAWPAPGKKPTYPAALSELRLPQPRMRKTRTEWMYYHGHGGCPGKYGPSREIADFEYADGTPASISGRRFAFKHHQDHLLVQLIRAAATVERYDASGLLPRIPGTAEQRNWDPAIPLFLDDVDEQGRPAPLRTAGDAPGTMVSHVCSRVVDERMGTPTHTPNELANRHEGETLEANTMFATNDPSAFVSDTVKLRDDKRPYWSRRRWALTDKFLVPKSPKPKNTIKDE 1520 T 0.00017 MRPL52 pdbpssm F T 6yxx 80 DC Ap A0A3L6L3K9_9TRYP mL53 MLNPPKHYSVESLRTVGLLPAQLALSRKPRLRPHVGNLKGLVYPLPYYAMWRGNHNKYTYNKSTVCLWGEGDTRSMYHQHYAHAKCPTDYGRGGREFEYLTVKRGKMLQKPLPRVQYVAEGSKPVWLFKSWHTPLSSPSMWEREVQYAEHTPEHIGAKRPLAVVAPRTMHRYLFLMHMEKVTITVSPLLFGYGHTIQKAVLDFYRRAISARSPFPKDKVFLFYAIDHITPRIEVTWLDGTSYVPPVLEGASSQDLIQMVMEEAWLAADRMAAEGRVLNPLAIDDYKWDQLVVFKKVRDKEASKGGGRKK 309 T 5.5 MRP_L53 unphh F Eukaryota T 6yxx 81 EC Up UNK XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 87 F F F 6yxx 82 GC Us UNK XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 79 F F F 6yxx 83 HC At C9ZU82_TRYB9 mL63 MLRHCTAHRRYRTAWRELLHPLPVRARKMEWLKRDAVEENEEILRRPYYTIKSYALPPAVGRQESIHNSNNIRGGMHSSHSLDLIMRQPRRVKTPEQLRALRDRLRFIGVTGPMPQATSVSTKSYTDTYGSRLRPRYPESWDTVPPHQPSRELL 154 T 20 DUF4113 pdbhh F Eukaryota T 6yxx 84 IC Av D0A934_TRYB9 mL64 MLRGTRGFLAVSPGVGIAPETTPVKYTPMMLNIQNMMWWNGKRNLYRATYREKTWYEISRTGAFTKGRRPVMRQKYSREALQAALAMVPPGFEVADVPRPPQRILAQSEGIVGRWYSNYWTLHSMRYQCLLAGVEWPLGERQRPRTNYDEPFFFADFEESKAIRDYRSRWINVNRSLVGMTKRMKEAEEEARYMQFRKLQDTFWSNRKVLVNRVKSMYNQGARTSAKDMPIKTINIKAFLSE 242 T 0.022 DUF1672 unppercent F Eukaryota T 6yxy 4 D A5 Q584F4_TRYB2 bL32m MFRRTFFTPMIAQPTLLMLGNKGGTPKRKKNPMQLRRKVYGLHFKEKYLKMEEWYYCPLCAEPKKPGEWCRREDCRQIKP 80 T 0.089 HVO_2753_ZBP pdbhh F Eukaryota T 6yxy 5 E A8 D0A1K1_TRYB9 bL35m MGSEESNNICAYKRTISLAKIYIVLLVKTAMLRYSRLCFPKMGCEEITRKARRVQLQPTEYLAQHRMQVWQLRFKEMGPPFSRVWVALGGKMRRRRVGRQVDVKDMRYYWRPIEPQYQRLYMSRLRIRDHSNKLRQPMRLRATNADIGSGSSSIEWERASNRKYGAMLAPPKRQDFEFRVV 181 T 0.13 Cytochrom_B559a pdbpssm F Eukaryota T 6yxy 7 G BA Q386Z1_TRYB2 mL67 MLRRFALTSSVALRLRFERDSGHNTVRYRPIPESMQPKHLEDNFTPFPLPKFDESLEYGPVRLRNIPDIEAAKERRRGSRLAATEVLLQETLQEENQFATSGKGDGNMAIAITERHTEDVTTPAADSRFPSQTMSPCSHEEEMRGYVVSRDYPLIDRLHCTRSIEELVAQFEDRPQIESRVAALADMASTVSFRSDEELLRMFTAISAPFSVDGRGLNFLTVKVSKFGRPYYVPNSLLPAYVNLVDATTIALVREQPWRLSASPALFIQVLQFMALIKVFEPNKWFTFSDHAPSNRADYRHAIGVNHSTAFWGTGEELYDFMVELLRVEDDGRIPTMLDLCTREQMVDLLSGFCGVMPCGKAVGDVFKTITDAFLRRVRNDISGPWSAHDWAIVERMYLVTVLCDAGNNEILQLLLSDTASPRGPDFFAAVSRTKDTPTKKRALCLLQEAIDNASAKADKVTLLGLLESGSEFLLSLVDKGVAHTFATQNLFDYRILNSFLHCSLVADRLRVEQSVITSLIPSSLRDVQVQMLMSNERNALNPLTSSLPGNSGAIATAPKLKRPLMTMLSQLEYLNSIDSVFILHSSLMATSTDQLVSAVRRLPSGKDSLIVTMSCLRALSVKSLTSPSMKERIACARALEIVSYELEKGRAVLLPFSEEILLHDAGAYCDEDLMLWTVAAFLARELPLVKVHTLMHSNCTARTPYRFLKGGHNLLVSSRSLYDKGAPLLSSLHSKELRLVTHNVRLRTPVRDRKCTLQYYNPIRARFVYRRDKPLFDKYHVTARNLAPGFSRGALKHDWRALGVYTPDHPQVPYHPLQTWMLGETTVEAE 831 T 0.047 DUF5642 pdb F Eukaryota T 6yxy 9 I BB D0A135_TRYB9 mL68 MLYTRRLMTTGGSATADGAVSYSKGSYHIVPKKYTVGKRIAVRSYLDRNRTELSDRTYMPQKAWFEPYTPKKFDMEHQRISHNFYNLETKLIWTAFDTPELIGILLHDETIKGAPHLYDAEFLESAVHWTRESRYWRCIGITKPFYNKTTLRAQCWHDRGLQVGTLVFSQAMRDALMDLERAVRRKELGLEPNYVWDRWGPVGFIDGARTDHLPRFAHNPYVDPDGVEVTEVDIAPFNTHEQIKERYGAFIDPDLRPFEGVFRAPSHGALTLDDVPHQEAVRLYRDLMEKADMPVMLGNGAEIPPMDMRALFHLSANPERMKAASELSSWREVRGMLAPVQEVCDEKVEALRLMENTRHDAARVRTFYEEKCGFSDFMRTPDKVITAAVLCYLQELQRICTETDWGKPLARCLTDLERVNVMGKDAFLVYRHIEDAILDKKRRVWATRFAGEANEESTLDYLLENFGRRTEQTRNVGTTGTEFDREQEPIGRQVQRRVLDSDKASKLAEVRQKRGKMWSNKKSVFDSLHQKQLQNVTYGVH 541 T 4.7 VirE_N pdbhh F Eukaryota T 6yxy 12 L BD mL70 MCLKRKAPHLFCFCLWSIFLSFRCFCFRSYAIMLPLLSFPTIQISIFLSFKLPITTFLLSPCFVFVFVFAIRYCGELTLNAQLVLFLLYHCAQTQRGPLKEGEMPICPGLCGELAAVPFRVFLGTLPTLAVEERFLRQLQPVFAWYSSRKRVKEQANEFIEIDLASCDAELLLRYSHIYYVRRQLFDELIERQMTLLDSGKAPKMAEPSLLQCLAGCNMTIADRLQLEIRQLGAAKRAASVPGRRELDPVARLEVYDYACMMRLVEEDAGAVGDAEMKARAYLPREVIESKLGHLTQLLLGSDARAALDKKDVKLLNRMIPPDYTRVGCVEKLRPFDVTAYFRFYGERINNVKVENYFKRALWGHVYRRFATTPSFLSGVSTYWARHSGLDASFTTTTMPQEVAVAVCDQQIQFPAIKFRAQYVYTSPETARQLWRTDAAVPLMRLFPLMGSRTAEDLAAGVLTDAFWMHLGLSEEENLLQDSLLLKVRRFVDEVGDMYETNIDSVLKRVDDNFKQVVPQLKAEDLQVDAPLQDGEGEDTVRETVAA 547 T 0.26 MGAT2 unppssm F T 6yxy 15 O BE Q57WG1_TRYB2 mL71 MLFRSVSCKNYQRGGWSPGSKHQKHMTLNPTLYLYRFPGPHGPGPYTMKYWWTLGCFPTGMEVPFRLHEFLSTYQQEHVPVEVEEWLRCYIKDPLSELVNASNDFFKAVEVYPEVESARGYKTLQPSIAPLLVPMKKFEEQLGVKISPVGLRSVLSNPVLKDRFLDDLFDYKSYVEKGGSTPHRRLARSRFEGSLSVLGECEKCLPEQHQVEISESLGTFIGATVSPAETTADDERSLILLLTTISEGCINAGNYSDAASVLADALMFCHDPDSQATTHANISFASLLNADFKGAEYNGREAALLQPQVKPTSTACARGYVGWAAAAAYQDDFEKAEAIVKDGLTLYVGNEHLEKLANKLQALREEQPSVYKQVPRSLRESRSHLPSQQSRGLLSGSGKGFSNEFDWVEFKNKLYPSKMDPRNNEMGSVFRRVGDLGSFISTSRSMERL 449 T 2.4E-05 TPR_21 pdbhh F Eukaryota T 6yxy 17 Q UE UNK XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 33 F F F 6yxy 21 U EG Q387S8_TRYB2 mt-LAF7 MPNIKGGVGSFLMRRAAPKSIRQKYQTGPQFYKRKFFQFQKGHHRLHRRISGVQTGSPTHQREYERFHHLPGDVRTRPQFDFTFGETRADRVMFAWRKRGDLQLYQMSGRGETFVCYRCGYPVRSQLVAVKADNWDYRMCYRCYTNTVHRGMENDT 156 T 0.042 Ring_hydroxyl_B unppercent F Eukaryota T 6yxy 22 V BH Q38AM5_TRYB2 mL74 MLVPGLSLTRRAVTSSCCRPLHVVRGFSTTCTLFGLEQLQDVPTSTSRRPTGLHRGPGKRQTSEREAAQYKFIRRWELQMRDEWDQLEPFKGLPKPKRQFGNEAAEVIWPYALLLERVVKVHPFTKSIYVYYAQRQSTARGKLAAEIARSFAREFLIPITFHNSQVYTEAEMLLEYSETPWVVLHSLDNGQKPRILPVAPVEGTPAHTAVEQLLAEVVQGCEALGASVADPVTATRVLNERPLQNQYVRVDYQWFGDTPDERASHLVRWEFEPEQIEPKIRHRTRHVLDWLNYDGNLPTHRAVHVNAMREKARQKAPRTVAGPRTFYNSAGSRANARSSRFGGQAAVGK 349 T 0.023 L51_S25_CI-B8 pdbhh F Eukaryota T 6yxy 23 W EH Q57ZS6_TRYB2 mt-LAF8 MKSVFDIARSHVTFPVSRDGTALRRVLKDWLDYTECQSLQAKPAFPAELCITVHPSVKSMSRVYTQGDPKMSGEVCRRPGEAASLSYTKRVRLLLWSAVLPWEVQRGLSMSVLEPPGNGGSVLGEVGVRHGLGGSGGAVDGGSAVATKGWKEVEDSLGVVTDPTTXTAADFVSGPQCSTLGASIIPGMLFVMSPESAAQGLCFWSGAIRQPIDIAFIAPVEPPAADTPSFSELRQRRLQLSLEGYDISRFFPDGELESPTVTFAVQSHSYLDPFPDCEQRDQQVGCGSSRERNKGIEDSRRYTATPEGVGRGNENVRYVLETRRNLLRDSIRSALRECRCTHGGVVWASNTGCAADGNPTGTGECDVEVTISLTLSDELKEDLREKARLYTNYVVPLEGHVRRHIKCLSGISPHPKGDATDGSDALQQKGTEAVWGEEGCVCTNGSAPFVAPPPIIKPPLPVKVGTLASTRPRSPMLADEAEGRPTRLAPSVFGRHDAPALQRAQQECNQLISASALARIPNTSPRAPEIPPIDYEIFDLCLRLGLCQSEAIYYFYGRIMREWSKELRRLRAAKSHGEGGVNDGNDMVLREEDVHRMLRLVHDPSLQVPPELSACVEAVASLRKITNEVGVPVV 634 T 0.45 DUF192 pdbhh F Eukaryota T 6yxy 27 AA UI UNK XXXXXXXXXXXXXXXXXXXXXXX 23 F F F 6yxy 28 BA BJ Q383M2_TRYB2 mL76 MLRLSSWNLKSQHHNVLRRSRPHIHKYRELNRWQRQAQGISKWDQSHSHRPLPYVERFNPESVGLTRGTSAFAWKWWHTQYPWLPNVPPEAAQIDEAQKQERRSHRPPAWDDEFAKVVLNMNDAEIREYLMSKLTDVIFLETQRDGYELRRLDFEGKPLTSLPEPRIIENFVLEEETIRERVIYQVVEGVFRLSPTSADRRELRSVANIIDYVLTHVRAARPTDRERRQERPITSAALAVMQKCPIQPQLGFVHALPHDTRDALLQEWERMHHLDWQFGKAVYTPRSKENVRGNLTWLREDRHYDQRMKFMQEVESGEARAKHMKLIAEAAGN 333 T 1.8 KIX pdbpssm F Eukaryota T 6yxy 33 GA UK UNK XXXXXXXXXXXXXXXXXXXXXXXXX 25 F F F 6yxy 34 HA BL Q389N4_TRYB2 mL78 MGASAGLIRRGGGVFPDAVSLTLTPSRRVYGGSGRGDLLYENPDARRHSGRALGVLNGVRHSSQATMPESGQLYYRKLILHSRPPNGSCAGLQRHCHDTCNWSYLIPSLHRCAESAISAKLWEKMCQLGLEDRSKAWVNLTQYERQRVRDGQNLYRYEVHQRLPLLEESIGWAQLDDLLGWFRSARRAWVRLPTSVTLSRESSEAGVASVVSPSSAMSCRLEGHADSRDTTPGRNQVFDTPERVEQLTEATVHRIREELQRLNRSERSDCEGSAAMRASARRLARDEELSRCVEEELGWHGVALQHRIPVPK 312 T 0.091 CAF1-p150_C2 pdb F Eukaryota T 6yxy 35 IA EL C9ZVC0_TRYB9 mt-LAF12 MRRLFITTASTLCHSLHCTDTRTGGAGKESTPTEVQCEMTLQCSDESGCSPFLSSLLSPVETVPLHDVTRTYSTMDVVDPPARYNPMVPNVEPSSSSAGHMEQXLENEEEEGPVACAHKNGKLWGVFEGSEDNKPPAWFYRLCKDLFYRTNSEDNMDDAALVSDIEPSHYISSTENRHVDGSDTTQRSAEAGTDVSDGVDPYVWIPFNLLDEADYHVGPYRFPSTATYTHEQRTLLCLGDTRREYVHFCDSYAFPGRAQIPTSVGTCPSKLYVNPKQQQPVVYIQLSNDIPPAMWLPVKGTAASVRRVLAEFASMAALHRDWHHDEFMERHATAVRMLELQRLPAGEGDILRYMAYDARNAQFAFAPIREFPNQQEFFLGEHDDPEKLMEHVDLCPLLFAIPHMRTVVDLHAEHMIPTIAGPGVATSLYRCIYSKALLFVQVHLSSEVKLPPQDPEAFKFMWKDSQVLPKMRIPVFVRVVWPTNERMSGGGGLLRRFNRLFGTEFASDIPVDAAMALLYVMQWSGHIKDFLGVRGMRQRLADLLLASQQPEPTKLYPGTREIPNPEYTVAERLGMHVQYLAQLHDPDISLTIQRLLPVASAPVRMGCAKAALIAGDRELFRHIVSSEPPGRMQTYMTKLVRKRKTRDLVDAEPRLLEDQYEFAAPLWTKRGKRLDSNTLEGVVEAQSRLSG 691 T 12 RNF152_C unphh F Eukaryota T 6yxy 36 JA UL UNK XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 72 F F F 6yxy 38 LA UM UNK XXXXXXXX 8 F F F 6yxy 40 NA BN C9ZQF0_TRYB9 mL80 MCCLYTSDVFFWSSLVVSPLPRIVRCVSAPQIRSIPCGSGDFSVMKRSLIARWQSGVHTPHGVVYRGAKMKNWPEQRIPENFKFTEEQRFRTKAIPRDVGTIPRNFVLGVLYRHQPCEVGGLWEHCTNDPEIVLDSKRHLREVLKQAREEGFVTFERDAISNEWLCFLTRERYEEVQRIVTAKSEAVDTHSGLRGAAATETSTYAEKFREMNVEAKEAHARRLEEEVANTTRYLRRFQQREIDYLPYTDLNGKVNFMWWYETRDVQQRADEVLTDSSSSKALAGEEEHKAGSQLEATTASTS 302 T 0.023 DUF2203 pdbpssm F Eukaryota T 6yxy 41 OA EN C9ZPS0_TRYB9 mt-LAF14 MRRCGNDRAVALLRATTEAMRLVKLKLSPDRTRNEEIQDRQNAFVWSDEHIFRPHQHFTHDPCSWSRSLEQSMKKQRKLSMVERLRSLEQRQLEEKQSASATAGGSSKCANHMDGEKAEGPRFYGAVGDSEDLKEYVANEDYFYTMQQEEKPNDPPLQELVDEVQSLHVLLSSPRYEDTPLATVERLQCAYSEALRCVFDRVRNASVGKTMSCNALLFSWSLLLQGVPALLESLAEKRTEECLVRALSTVHEALNIVLQEFNRITHSKERVELLPLEGWIESLDVVTHPLTNKDYTSLKGNIRLPESSFKPQCKLDSATVEFVHSRAIQAAAIRMIENDQSDVETEPLDPYHLYILLRCMVRLAEKGVNDSHIHRAALLTGMVGERIFSSLERTVAPPRRYSLRHALLGKQLRDASKPHAIPLDVCAPPGGVKKPPTAADDVLLLTRACTLLMKVATNVLPQTKFKVLETVDTVLKTLSYAPNYDLSTADTVIFSNMVLEELHHVDEASATDRHLRVLLLLSRLRLSMCADRSALSHLFSCLCNLLPPHSIQQDKLREWKRLRGLVMRHLLYSVRGEEVEQHYTRVLKSSETWVEHLAFGQYSGGLPLSLWLEACHIYLTAGRKLTVSCAEALITLRGRCKDGGVLRSSNSAGVGPLDFVSVTLLAQLLEVVSHGCCSADDLVASPVAWDKVRQTIQGAIGEDENTIQLLRAGRLCVADRQATGSLVTTYP 731 T 1.1 DUF4048 pdbpssm F Eukaryota T 6yxy 43 QA BO Q385L5_TRYB2 mL81 MKRLFPSAGVSVVLTSSSIVMSCPCNHIFTSRRAYYWPYNEDFVPEGAETSRFQSSGSPGTRRRVLQEYALSPLFGARVPCCVVGTLRTAKEIIVLKRDIQSLLCELMELPQGSVTLGPLQQLREMMLYRHGTPLSPRLGDERQLNMDAYARRPIAARTMMDVFLAEDMSLDEIVNFGRLSLGKLQIALNNLRKENAEKSSADCEDGSAVEPAQLLVDRMGISYCEIPSLDESDYVFAEVGGVAVTDEEAERVAQRWAERCE 262 T 10 DUF1382 unphh F Eukaryota T 6yxy 44 RA,TA EO,EP C9ZMA6_TRYB9 mt-LAF15a MFTVSCNVAFLCHPAVHHSLLLLRALRQRHTLAIERMGANVTNIGGTVSLSQCGNHISIVPPNLHGSKCVTSGGSIGTVGESPLCVAEHGLQRVHDPQHILYLFSSASPVRQSALDGQIQSYLNAVVVSNQVLRAADDVLIALSIGEMEAVRQTHGNLIDCVAALDASLQQTTENEEGGGGNGATQEVDCLSTWPLFTTIQFLVEEGGLPLGPFPRMSRAYYRLKESTPVVAHSQLVWRTFELSRGPEGPTGELPAWPHRGFLRDIQRQIAEYTTDPPERIMAGVTGEKGPLRARVSGARLGLQRTPARIPWTMQGLHR 319 T 0.42 Hemerythrin pdbpssm F Eukaryota T 6yxy 49 XA BR Q586A6_TRYB2 mL84 MLRFTRLFREMAPELQLEYMPVLFTRTILGPQGGFAGEERLVKLEVARKYMEAGHAVTPTEELRRGLWCYNPDTDKYDCFIERNEEFLDFAARKRQWLDVYWRVNTGYLLFGRQSWGQGFLINCPLRKRDVAQKLWEQYKVRIDPRLIEFREKDRRTGIQELGHNWCWLYLPGAEELGINREVYDNKRVKVRIHVRKMNSMFALY 205 T 2.2 Ribosomal_L9_C pdbhh F Eukaryota T 6yxy 52 BB AT Q4GZ98_TRYB2 bL19m MGYTRERTNRHFFVARANAFFSRLPIARVQRSLAMEAVREGRMRPWKYTKEQILGAPVTCNFEYNPRPVRLIGTVMDAHTEETSIKGGLKVYARNEETNMMLWIPPGNPKLRHEVTSTKGSFQHYLDERDKWDEAWITGRARMK 144 T 3.8 DUF2760 pdbhh F Eukaryota T 6yxy 53 CB BT A0A3L6L8W0_9TRYP mL86 MRRCIPARGGFTMKYKKGTGLWDEDHVNDYKTNRYLSARATMRWYQEMERHQTRNSLNARRATQSHNNNRGLHHTGRGAFERELERRGVQVEKYPLTTTTGTMRVAELVILRRMELEKRAEEALAEQRGELQKKNPTPSEWYDESKGPLNPNFLRSMRSHYEVDIANLPDTPLIRGQREFFIGEERGNGAA 191 T 3 DUF2663 pdbhh F Eukaryota T 6yxy 54 DB ET Q383S4_TRYB2 mt-LAF19 MAFRRLVKRHKITNNQMLLMRRREPYKPTMKDRQEIADRAKLEEFERKNADGLMFVPEKALPPWQKSLAHNAKALGSRINFRGFRVRVADGQDEPGFPTPFR 102 T 5.2 Ribosomal_S6e pdbhh F Eukaryota T 6yxy 55 EB AU Q383R2_TRYB2 bL20m MLRRTVCVQHYRAKLELDRIRSMLRGRARLERKVGLKRLFFLMRTQTRYRVEQQAHWERAIVRKNVDSAAREHGTGWQHLRNELGRQNVMLLPRSQQLLAQYEPLAFRAVVELCASRIPPPPPPVVASVPEESYTLWPPASHDNSECASTDGSDAPHGQQQSLSHPAARVELRCGVERVLRRGPSGLGNNVNELIDAWKEFDVSPLRKGEVNK 213 T 7.7E-05 Ribosomal_L20 pdbhh F Eukaryota T 6yxy 56 FB BU D0A7Z9_TRYB9 mL87 MLNPTFSLYRKTLQSYPVPPKIRHYDRRWSGSRTNPYNRQYWRVIMNENYSRPSFWVSDFRHRYLMRTGTDYQGQVPSSPQPGLYQGFSDVHKLLANHPKPQRESRHLPVLPMTPRIVFEHANEKRIDTAKRMRRDRRRIEELKTLEFWGWYMKLQRVRGRWCREQGVSSRGVYGPAVDAAELWG 185 T 1.3 Crl pdbhh F Eukaryota T 6yxy 57 GB EU mt-LAF20 MFTPSLALWSQFLKRTFVGGMGYNVKRPYRIEIRMEHDKKRRMRRRNIGCRRMMKS 56 T 0.081 MCPVI pdb F T 6yxy 59 IB BV C9ZUW3_TRYB9 mL88 MLQINAFKLVRATPFLLKRTGKPADTPDYKQVYLPYDAAPTERELERERRRFKQAYHGRMEHRKLVEVKEVPLNVYTYGKEGMSLPIAIFKDQKDPVIGPEWTYPGIYENKIAAQHWYTEELFDKESKEAFESPWQQQILDNQVKRRMAKVMFRMRQVNMKAVDLFQKERGSSRRSGGAGEKGKDGGGKK 190 T 2.5 Ribosomal_L37 pdbhh F Eukaryota T 6yxy 61 KB BW Q57WW5_TRYB2 mL89 MSGAFGCGSYRSVVAGTQNVPRRMTFYPSAYELIQLHKAHREVIRHFYVRDKIFDNKFPGNALANGLFKFVPNRRENYHMRELMESIRRRSIWMHRIKQQREINAKVVENMEVKYGKKAAASMLCFTTPDSNAYFAPHRYQDVANSWPNYWQHPSVNHVVPKPRWRRHRELGGITRVEDPFAVQASDY 188 T 24 Babuvirus_MP pdbhh F Eukaryota T 6yxy 64 NB UX UNK XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180 F F F 6yxy 67 QB Ba D0A4T0_TRYB9 mL93 MFRVTGLQLKNPVVFKQGQGMFSHQLKRLLQKKSIHRYNWDPLPMYDPRKLVHASRHMDVETWREVPDPHWDERSYLVPDQMFYNIPVPPEYKDAYWWRELQARRVQCPVEWVSHRMYNKGDRQRYDFQDLAFRKKFEFSYEEVVKNAKDMRS 153 T 7.9 Pox_VP8_L4R pdbhh F Eukaryota T 6yxy 69 SB Bc Q389K3_TRYB2 mL95 MFRPTTAIADSFKEHYHRVHLPRRLALQRYARQQSLRNAAKGNVKAEEVPYKYNRWWVNEEHEFVHQYAFVEDPEVTKAKRETLPPVTRENIWKEPQQTFFLPFAPYVRVVDYPKDPDAKFLKPVNIPRWKDYMQRTKPVIPRTWY 146 T 1.8 DUF4653 pdbhh F Eukaryota T 6yxy 70 TB Ae Q383U6_TRYB2 mL41 MLRCSCARRRGVYHNAPSVYPFVKPFHDTPYDQDRGRHDSVGQRYRKNSWPEWMDNGADGTGYGIGLHRTHPLSRLRGNLKRSPSHVPRVLGMMIQGVWHKSGVKLYFRGGKPPNPSVHPYLTGEPCPLYGWKVTDESVIRQFNMPSIDGTNFRYKPYVALQERKIMGVQVPSDSVSLASGRTTESKPLAKRLFFWR 197 T 1.9 MRP-L27 pdbhh F Eukaryota T 6yxy 71 UB Af Q383S1_TRYB2 mL42 MLRLCRVSLRVQSHQKKRAQHPNAGTRFGRVYNRGFIRYGFGGFGMSVYTPKKDRRFRVQPLPSLHANSLADDTPLVTTTRTLSPNFRAFALQDGGVFFTHPSHEQVMRVGQNILAEETKATGMTSMDTYVNSRIQSIIAENTVENVALSHWRRRHMWNLVRTHGKLQRHWGVSDATKGARSNIYGRPS 189 T 0.12 Toxin_10 pdb F Eukaryota T 6yxy 72 VB Bf Q388M2_TRYB2 mL98 MVLRGVRLRSVAVSCYGSSLTAATRCLSVRTEDFFSKEAISHARRVSWAPHTTEKKQGAFAKLARSNFGDPLPSSFAQEPYFEEEIEAHRKHHRPDVYIYKYNVSPTHFSLRE 113 T 6.1 DUF2975 pdbhh F Eukaryota T 6yxy 74 XB Bg Q587H8_TRYB2 mL99 MYQRTRFLWSSWRDYPLGSRDRRGRFNMDEAAAALQLNPAYAAALYRPLNYTFHIRGQLYPAQKGRPSRPGSLAASQGRMFPLYQRNDRLDKELFRLNSRGLTTE 105 T 1.5 Tenui_NS4 pdbhh F Eukaryota T 6yxy 75 YB Bh A0A3L6L2V1_9TRYP mL100 MALFSCFRCGYMYEFAVSNSYCRKLTLRNDHCPRCDQLTLFRFMSVSGMVGNMPFKPIGVPGPSYATLWWRKTREGKEASAPLDAVCKSDRW 92 T 0.004 DUF1178 unphh F Eukaryota T 6yxy 76 ZB Bi Q4GZ80_TRYB2 mL101 MLVRTWMLLNRPKGPQGLRPGKEYRLTVPYRSEVTMLRLANHKAINSNIRELFKKPLVMNNIKAIPRDLGEIPRDYVLRLLFFHQPIRLVDLWTICKEHDDVPLDSAKHLRLVLKIAKLQRWVYAEKNQTNNLYYYYVHQSRMQEVQQMVRASEVRKKEQESVREIEAEKLRMEEQERRKVALDENIVALQNALVSNIAQIQEFDPGFARSKIYVTESGAVNVGWGLNDGGSAACSSDLDGQQVA 245 T 0.0025 Tfb2 pdb F Eukaryota T 6yxy 78 BC Ao Q385V2_TRYB2 mL52 MHIWRTAASNWVKRHARSNAAQWRRPEPASSSAAAKTIARLLAVGGHGCWSEATALYAEALATKSVQRYGVVTPAHRHDCIALILGAMSSSVDASGSDINRGEASLTPHYGMLAERLSLDIIVPHSDVGQEEKEAASVACIAVFRALLTSGRTHAAQRFAQQLLRSRNSREHGLWALRTIMLAVTAREEGMYRQLFLHGGLLKDLEHLTGLDAFTVVMSSCDGHEKNIGPCGGRTEGNLGRDEVHRVAARWCVDVALHMHTCPWRFCLTNEKEPRRETAFNSEVTSYCCAKELPMPAFAVTMLPFEVLRSNAVGVLRCCTAAGNGSHGNEVVTFDGTASNVSGGRSGDSTSCSTTAPSLGLPALKRLQDVVSAAIEVPLSTHEPVECLYLLLQISRDSTAPDPRRVMPSTMNILWWRRHLQSCVLESVKSAVRWKSVCDFVGLSRLVTERVPHCFPLLVAHAAESPDALRTLFNEVSRQEGAAAXPVDDIPRSCVMGLLQMCGDERSAAVIDWVVEHCXDHESIAQFVLDSLGQHQELASVVVRRLFQRSEQSXDVINALSRLLEKGTCRTLLADVCSLVPQAQQWTVALRFVSGMSAPEVGHVHTAFMAFVGNCDEQFAINMALLWSDETFTWLKADVNPPPRRVYMDDRWSDALRLVALGAERLTQRPELLGKYIRYFSKRVAVNSRLYMELEALLACSKVGGGSARNCGRSSNWNPIASTVTGGPPLETSLVGVRTEVLPKGRQWMEACAHASEVGVTRGVLEILAKRGRWEEATILLARAEAKRRVEWAPLVIRAARLSSQWRAALSVAEKIAANGMKLHYSVVAELLACCFSADVPLEVVQLWLHRRSVGESTSSHVLFGLGPGSSLRTEQRDANDKHNVFGDASRWLGSAQAELFFVLLESSSSRSSDHQWRYALQLLKEHVLLSGGVPSARVFRSTYSILHRAERWRESLQLLGLQRSVCGSPTVKCVHLVLSTLPSTAWQYALGALQCIPPGDSGSIHRVLPLLLPVSWESALGLMIDHRVMTNTAMECVVGCEDVPLALRLQAWRRLLPSLPVSMKHRAAPIYLRLAAGVADGDVGLEGTNGLDAMCVIERSIRGLRGDYINYASFVYHRALLHRHWCNDSLHPSGGVAAFRALSGIAEVDQQCEVSAAALEQLSNIVERLEQVCGTVSASTATHGSQHLERTVPVVNSGEILQGSAPTNARRDPETHCFSFVSSYWLSLFYLFRYCRCVISSSLFPFFKKDQGSGLPIFDVMRLHTVRAPIITRAAMRGYSEARSNYDGTSLPAWPAPGKKPTYPAALSELRLPQPRMRKTRTEWMYYHGHGGCPGKYGPSREIADFEYADGTPASISGRRFAFKHHQDHLLVQLIRAAATVERYDASGLLPRIPGTAEQRNWDPAIPLFLDDVDEQGRPAPLRTAGDAPGTMVSHVCSRVVDERMGTPTHTPNELANRHEGETLEANTMFATNDPSAFVSDTVKLRDDKRPYWSRRRWALTDKFLVPKSPKPKNTIKDE 1520 T 0.00017 MRPL52 pdbpssm F Eukaryota T 6yxy 79 CC Ap A0A3L6L3K9_9TRYP mL53 MLNPPKHYSVESLRTVGLLPAQLALSRKPRLRPHVGNLKGLVYPLPYYAMWRGNHNKYTYNKSTVCLWGEGDTRSMYHQHYAHAKCPTDYGRGGREFEYLTVKRGKMLQKPLPRVQYVAEGSKPVWLFKSWHTPLSSPSMWEREVQYAEHTPEHIGAKRPLAVVAPRTMHRYLFLMHMEKVTITVSPLLFGYGHTIQKAVLDFYRRAISARSPFPKDKVFLFYAIDHITPRIEVTWLDGTSYVPPVLEGASSQDLIQMVMEEAWLAADRMAAEGRVLNPLAIDDYKWDQLVVFKKVRDKEASKGGGRKK 309 T 5.5 MRP_L53 unphh F Eukaryota T 6yxy 80 DC At C9ZU82_TRYB9 mL63 MLRHCTAHRRYRTAWRELLHPLPVRARKMEWLKRDAVEENEEILRRPYYTIKSYALPPAVGRQESIHNSNNIRGGMHSSHSLDLIMRQPRRVKTPEQLRALRDRLRFIGVTGPMPQATSVSTKSYTDTYGSRLRPRYPESWDTVPPHQPSRELL 154 T 20 DUF4113 pdbhh F Eukaryota T 6yxy 81 EC Av D0A934_TRYB9 mL64 MLRGTRGFLAVSPGVGIAPETTPVKYTPMMLNIQNMMWWNGKRNLYRATYREKTWYEISRTGAFTKGRRPVMRQKYSREALQAALAMVPPGFEVADVPRPPQRILAQSEGIVGRWYSNYWTLHSMRYQCLLAGVEWPLGERQRPRTNYDEPFFFADFEESKAIRDYRSRWINVNRSLVGMTKRMKEAEEEARYMQFRKLQDTFWSNRKVLVNRVKSMYNQGARTSAKDMPIKTINIKAFLSE 242 T 0.022 DUF1672 unppercent F Eukaryota T 6yzf 2 D FFF GLU-HIS-SER EHS 3 T 360 OAR pdbhh F F 6yzh 2 B D P8C9 AARLYGFKXX 10 T 1.3 B5 pdbhh F T 6z00 2 C,D C,D MET-VAL-ASN-ALA-LEU MVNAL 5 T 120 DUF4213 pdbhh F F 6z0g 1 A A TREM2_HUMAN TREM-2,TRIGGERING RECEPTOR EXPRESSED ON MONOCYTES 2 GSGRSLLEGEIPFPPTSILLLLACIFLIKILAASALWAAAWHGQKPGTH 49 T 0.00029 SIT unphh F Eukaryota T 6z0h 1 A A TREM2_HUMAN TREM-2,TRIGGERING RECEPTOR EXPRESSED ON MONOCYTES 2 GSGRSLLEGEIPFPPTSILLLLACIFLIAILAASALWAAAWHGQKPGTH 49 T 0.00029 SIT unphh F Eukaryota T 6z0i 1 A A TREM2_HUMAN TREM-2,TRIGGERING RECEPTOR EXPRESSED ON MONOCYTES 2 GSGRSLLEGEIPFPPTSILLLLACIFLIKILAASALWAAAWHGQKPGTH 49 T 0.00029 SIT unphh F Eukaryota T 6z0l 1 A,C,E,G A,C,E,G Positive Strand XNLAALRSELQALRREGFSPERLAALESRLQALERRLAALRSRLQALRGX 50 T 0.0025 DUF5320 pdbhh F T 6z0m 1 A,C,E,G A,C,E,G Positive Strand XNLAALRSELQALRREGFSPERLAALESRLQALERRLAALRSRLQALRGX 50 T 0.0025 DUF5320 pdbhh F T 6z13 1 A P bicyclic peptide 3C CDIHVXWEWECFEKL 15 T 9.7 PDE6_gamma pdbhh F T 6z19 2 B C P2 ALYGFKWA 8 T 5 DUF2627 pdbhh F T 6z1h 2 C C Residues 249 to 266 of chain A and 246 to 258 of chain B could not be identified and has been included as UNK in chain C and D, respectively. XXXXXXXXXXXXXXXXXX 18 F F F 6z1h 3 D D Residues 249 to 266 of chain A and 246 to 258 of chain B could not be identified and has been included as UNK in chain C and D, respectively. XXXXXXXXXXXXX 13 F F F 6z1p 6 F Af Q951A2_TETTH Ymf69 MLNNIFIFEKYLKKNNFKKNKIILKKKFTPLRFFLFLLSLFLTPFNCMFIISFKINNKLEINFESYLI 68 T 0.55 DUF3667 pdbpssm F Eukaryota T 6z1p 8 H Ah bL7/L12m XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 106 F F F 6z1p 9 I Ai bL7/12m XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 61 F F F 6z1p 10 J Aj bL7/12m XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 69 F F F 6z1p 13 M Am Q951B5_TETTH Ymf74 XXXXXXXXXMKFKREFKFLIKKKNFKFKKFKILLKIYYSIKNLINFYKIIKLNNFKIKSTLLINNNYYFNYITNGLDLKYDNTFQNFELNTLSIKNYKNKNLIISNNNQLDIIKFQKFLFIIDNKYVNSLICDNLFDFFFISIILTNSLILEFYKNIILINLIKIN 166 T 0.94 Interfer-bind unppssm F Eukaryota T 6z1p 30 DA AD bL32m XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 36 F F F 6z1p 34 HA AH I7LT48_TETTS mL40 MSQFLAKAVRSDLTQLCSQTLRWNKKGKVNEATVQARKEKKRLKETNQFTGDYTGERPPHPWLSAKRLRSIMTQYQVFANKNRLNIKVDKKDAVEMKKQFEEIGAHEYYRMRTEQIINEKNNQVIDQINKEIETLPFNIYEEITKMPKDKIYNFNTDSNSPYILYFEQIARMFDEEHLTKLKVSQRLQKLAEDKLGEND 199 T 0.017 DUF5446 pdb F Eukaryota T 6z1p 39 MA AM Q22KC0_TETTS mL53 MDAISIKIIIVKKLRTINIFFELVINQNLICLEKLYYCQYTSQIVSIDRQKYSKIYQMGKKRRVQVFTGSMLQPLYRYLLSAKIGVNPLDLHGQNIAKQIFLKCKQARPKYLQNDFNATLVQDNEAPASYFHAKFVNGYEQKWYLHKSTEEEINRNLKYFNYQIEMERNLQGHDDEYEDEETQL 184 T 3.7 MRP_L53 pdbhh F Eukaryota T 6z1p 42 PA AP I7M3V9_TETTS mL101 MFQAIKSIELQTKSLFCQLQNGFKRTSASQKLRQKIKYRSSRPDKFKLVSKLGKFDFTKPNLSFPVSIPLKLSYVYQPAKHTPNLPTHDFLNFKTMTGNEILLNLENYENLRPSEICGALIELSKREGHEEINWNEHEWVAATTEHVTKMMPTYTPSVVCYLMVAFQRLRITHEKLWKNLTFAIEKTIHKFNAKSFAYTYIAYLEDTSRSSEEFRKKLVELLPIHLHQMNPNQLTRCFELTFERGYMNEYLFEQHFHVLYWRRNVWFGVNNIIKVLEIYPKLNFVDDCDFFEGAILANIPKVKTQLNEQNTKALIEAIQALESKYPDLKINSTLKFLNTHLTFCQTKLKAIENSKFYKIVLNDFEYYKIKESQRLEKEAKQAEKTN 386 T 0.36 NPV_P10 pdb F Eukaryota T 6z1p 43 QA AQ W7WYR3_TETTS mL102 MNALIFRNTNFLFNWISQSSSMLGLLGIRKNMSFQLQEETAEDKTQKKLNELLQGAFDFTNRNARPPKKANHGARPCSSVMRRLKKKYFYRRTKEAMTPEMEPKKKFEL 109 T 16 DUF5528 pdbhh F Eukaryota T 6z1p 44 RA AR I7MKV5_TETTS mL103 MLLSQSIQKAVANAFKQISRSQYKAISCFSSSDKNDNGSNNQEGDSSKNNEKKQATTNDDIYIHKTSYNLEQFKSYTQNVEKALKDLNQEKEKLENSSPFDLLPRRKRRVYDRPLHDLDISNYECWRSYDKMIFKTHKHAARVVCKINLSPRALKHAFGIGSDVSRNSDVSTREYDFEDSNLDSFLLYDYKATTEYHGNNDPNYDYQNQDQVPPKKRKQQHPTPQEFWESDEPHAFRVNCSNYADYHKFKKWIQTEIEKRASEKSYEEKIIERFGPYKIYDQYDQKYDLVKEPSVFKYGREYYLEKGKKFSEKEMEENPYLKPIKPAKQMEDKYRVAWPYQWNPKPTQ 348 T 0.14 BLI1 pdbpssm F Eukaryota T 6z1p 45 SA AS I7LTP6_TETTS mL104 MIGQVVNKGTSSVSSLFQQIRRGFKGFDKNRWATSAEVFPEKPFKYGYDKQQKIKTKQDRQWYEAPVGAMTKKRFQTADFRAYQTELEEAAQKTLEGQKIDVEEEVVKVEKDEIFGEQGAPKRYYDRILQKELYFHHRNGKHFFEQADRKISFLNQFTPQIEKVPELAETLKTILKIATESEGSPLKRAGIDQEFIIEDDFYSENEECKKLFEEIKQKVSNMNYRLLPDLCLVLSFKLRYNKDVFGIWEKIEQNFMQSIHHYPVMELVKMRYASCALSPKSLSRDCLKAIHDIVFTELHNVPSVLDLSHLLFAFRHINSLKYYNLILDEICRRPIKTLQEAIALLFVFSHSLFPNYKRKEIREKDQDLKEKHKIVDHLADALANNAKQIQGDDFVRVLIGLNNLQLTTFKDVLTHIERYIIKNIDTLDAFQTSNALYGFSKANNNAGFGSEQLYKALQKAAEKHWSQFSNADKARTFYAFAFQDLVDPVFRKKFIQPWLNENLESNLSHSELHYVAFSLMFEQNKDAEIWKKFVKNICKNQYVVPVLNYYPIKLARYYMQSIFPKWNFDIYKLACQDAEATWDASRQIDSIMENNKEWKSIGVLLQNRLEFNSIGLDNFENLLLIDWAIMPQRVAIMIQGARQTLPNGKPTPLHRLKLQLLENHKFAVFNLIYKDFEAISPDQKIPYLKKTIEELIAKQDTYIKDVEEPQQWLSFMDRMQELTYRNIMIGEATKGGVIDPEIQEVQFDWSKLQKELKERQKQEE 764 T 0.00023 RAP pdbpssm F Eukaryota T 6z1p 47 UA AU Q23Q81_TETTS mL106 MIVSKQQSIISLLWRSACGFSKFSKRQTTPKILRDHRFTRKGMVIRKQSKNYDYDLTNLAASQAHQSHLFFNKEEDFALLKKQADEKEKNMKNMHVFEDHSVPETILLEVQDKFLVQKHPEKVLNNLVELDKQFSKKGGEITELIQSALSKLIKEQILSFNLKNFGVLTTLAKKYLPNDAKLWENLANNYCRLMQKSEYNDLKTLDSARSQKYIENSVLTLTTVLKFSNKFHHENCVENISQVATSYLSTNFDVVQDVNTRFLLISNILPLIQSYKQVELIGLVKQKMDLIPKLQANTITALTHSIYKVKQQNSKNSRFPADLIDVSFLQKLEQTWLKTFDKSNTQLLAIFSYSIASLGYSGETKKFTQEYVEKNIKDITNLKDLAFFGESLKKFRALSQKYFTGAEQTLKQALSNNSQELHAELALQLLRVYSKNLFLNSEIASALIKKVDDAYYYEQFKPKASQNEMIVKTLQKYSQIVDLSNLRIYQNLIGSKKLF 499 T 1.2 MbeB_N pdbpercent F Eukaryota T 6z1p 51 YA Bc Q950X8_TETTH Ymf73 MINKKLNLFLIENKKLLKTNTEIYNLNKNFNLIKFFKLTNYKEIKALISLLKCINCLNKLNKSIFIFNKNFITVVYKTNFFKKLLTYKFINIELMLTLKLFIFFNTRIFINTSDTFIKFKSEYETYPEILFDCYHNHFSRKRVKNLSYKMFLLIMYNLI 159 T 18 Ring_hydroxyl_A pdbhh F Eukaryota T 6z1p 52 ZA Bd Q951A8_TETTH Ymf64 MKLLIFLKKINTIKKIENLNSENFTNQNYYYNLTNSLDQSEFITLKHVTFFFLKNKLMYSLSKYDRLNININENFYTFLKSLDKVDVYSLKFFKNMIFNKTYSNYNVNLFYSISIKEIKKNKEEFFKSNTNKILLIRNSFKTTNLKLLRKISIIDLILKKYLESELNDKISINFDKYNLKFMRKKRLYVKVLRRKLRRMRKMLRWAKISLRNFIRLTLIFLCTKDIDIFSKVLVKIMDSMHYKNHRRFLYYLKLFISKSMHYYFNILRFEGFFFYLSGKISGGGNSKKKNYAVKCGKYSLTNKMLKLKYKKGLIHTKTGVLGYKLMISYK 330 T 0.019 VAR1 unphh F Eukaryota T 6z1p 53 AB Be Q951C0_TETTH Ymf76 MFLVKFKIKKLRLKKKIKKFYKLINYNFSNLLNNFYHKKPNFLTLYNNTNNFFLKILFYIKYINLISKTISNKKLFKFLNNPKIRNRKKFKYKYSDKIKFILNILKSKKTKIKNLLFFIKYFSVLRKRQSRIFNLARVKSRLSKRRFFKKKLKKKKIAKYFFQMFKKLKFKHKKYINLINLDFYFIRNKRFFRLHRLYDIRKKYIRYLNNNRNIYKFYKFRIKHNFKFIKRHIKSISKLSIKDRVHFYELSLRNIAIKLKYAFTLRNANLFTKSGFIFLNGHQELNPFKYAYKGDIIELPFSKFILKLRRKMKKKMFNSMRKYKKYNWRTLKNKVNPEQRRLRISRFSENTLNFKTKLTKLFQYDYRTLSYCVVLDTNFKRDLTYLNKKLIPIYLLKLFNWKIIS 405 T 0.3 S4_2 unphh F Eukaryota T 6z1p 56 DB Bh Q951A4_TETTH Ymf63 MKQLKKIMINKTKIMSNDIIYIKRTYHQKIVNLKIYNNFKTEPKFYNLKFIEFQNLLNNVNLNKVFYTEYTSYPLEDRFATSKFHTFDSYLTSLELIDCTFLKKKFNYKYKYSMFTYFIPFLIKNGKKLSTINFILKGISTIYDNLKYNKLSQFESYSYVNQFKHYIDTADDVYNINFLIHWIINIYKPVFDVKCFNVPKVHKKKSAKTVLFKIVYLSEKNRLKTAYKHISTCIRQDNSAKLNNRITNIFLDLLLNYKKSYLYTRKMYIYEQVMDM 276 T 0.02 Ribosomal_S7 unphh F Eukaryota T 6z1p 58 FB Bj Q951A1_TETTH Ymf59 MNNKIYINIKYKMNFNPQVINSRNILSKNKSNRIYCKNFIFTILFFDFFNSTFSKNFLPYKYNLHITKKRKHVGSILRAPYKNKIAQFSLGLYRYFLNLSFFINSEFLPNINNKFEFKLLFIKFLNSYNYFESTLVTQVSRVIKIPTQIQII 152 T 37 GDYXXLXY unphh F Eukaryota T 6z1p 59 GB Bk Q950Z8_TETTH Ymf61 MVQKKFNFKKDSFFYEGYVWNHSLNIIHDIQLNYLDKNSNAIAIKYAKTLNIMSSLYRNLTFKKFDFIKIWYWYYLYYIKNIYFKNLINKNNNYVFEKPNIFVFNIKSKQIRLAVLTSKNYVYNLTVGKILSSLNIKEKSKKKSNKGERLFSEYLENFFKNKNIRFGTKKLAIIKLKYFKKGFKLHESIFKTLNKNLFIINTIYDFKVPNNFFKFKKIRSIKKRIKKKLIKDENTLNF 238 T 0.15 Pectate_lyase_3 pdb F Eukaryota T 6z1p 62 JB Bn Q951B0_TETTH Ribosomal protein S14 MLFKRRKDILKCKYKKIYIFKNKIKNIILKSIFFNRNIKNINRAYAYMILNNSKILYKKYHKICKFSGYRKNVNKFTGIGRHELNRKATLGQLQNISMNSW 101 T 2.3 Ribosomal_S14 unphh F Eukaryota T 6z1p 68 PB Bt Q22BA0_TETTS bS21m MNPIQFSQFVVNSITKHYFKEVAQGALRPGQVRREVKSVYKDYNPLKIHIFKYKHPRMLEIPMPGHQRRAERVEAQRRRNREQINFFCRYVLAKQGKTIPYN 102 T 13 DUF3755 pdbhh F Eukaryota T 6z1p 69 QB Bu Q23YQ0_TETTS mS23 MYLRKWSRMEWWDSVRYFKKYNLLDYKEEHRLLHLFPPTWQQYLPTKADLREVNFRDKQDKQLVKVLFQKYPDLRYDTSGRMYDKDGGQDNYANYVVRFILKQKEYMKRGMSANAAFIETEKIFQDRMQRKIDQNNLTRGIAINNRARSFMNFYQQMAEREARWKVQRMKRDVQQYLHEREIFEKEINDDGDMEDDFAEEENIYNRVLLKFQNAGMPEITKQDEKVSTQREFIERSENMFKVYYERAAIYDRLQGLTDSQIRSEIQNSPAKMKKRTRNLVKKLERLGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEKLQVVIDKLRRTKYKIDQLSMKHAQDLMFEEHEVYGKDILIDEKVGYEDLRDYLFQPSEIRRKTELDVINDNKIQEMIKISTIKEDLVNPYNNEYASKLNLQDTIEYQRSKQEKIKTLRAEKEREE 567 T 0.0021 MRP-S23 pdbhh F Eukaryota T 6z1p 70 RB Bv I7M0P0_TETTS mS26 MNFINKARSQLGLFKEFIVMPRMGWKLRKPSRKVTIYEQERRAQKQLEKEYRAKVISDYWNSQTILENEYIEKYTREELERKKKSDQNFRDSIIRIAKATQNHVEFLKKRSALDEAKERKHILEQDVKAMNKKRILNIMQQESRHWINQQNVNNINPDSIMPATIYDETDYYLKLHEQAFLFEQGRLEEMEKVSLETEEIQYKNSVLMPIYQDVISMIKHLKSTESFKLEKEFQAAKRILIEDCRNMQIEDQLEEKMAKLEKAFNTLRKAQKEKFDQPENQLEFLHEHLLILYNMLRKWGEYTNMLKIPATVVRDILHERQVMLEKKKLIRKQNLEQAKNQDRDTEENEQDSDIQSNDERETSDSEDEIDIAKLIEEEKLRQEKIKEQQRLFEERKLQMEKEQAEEEQAEQKQTIQDDLLNPEKAIENLMLQFEKKAEEELNAEFHIEKNSLYGNDKNAVDSRDFYKGIDLENVFPRALIDNSFDFTKLSGLPFQNVKEALRNEEKIELLRGEDPNNSPRKFETALIVEVFKLKSQQLRAASLTRAQQQKLNDIDTLLDLIGEIKVEEPTFLLKIWKNF 579 T 0.17 LRR_12 pdb F Eukaryota T 6z1p 72 TB Bx Q23UD3_TETTS mS31 MKIKIVYENLFRVINEGQLKSKRVKKLINFNKEELIYISKCCDMIKALKSVQRQLVFVQKAKFCSVPNQNNQGQGSNTNEPQQQAAAAATTTASQAEKPQQPAFLNANKDAQKDQKKTQNHEQKDQKQHQNQSSIYGSKINVGNSEEIKKSIQKSVNDFSSTYKLNLSDKKIKAGNKQTFAKKQDQESISKKKEKLLKGIQPTDEKLVDKHIGNVPIAQQIVVDKLKKYALRVSDTKRIHDKLKSEEGVDFSYVEPRNLEILNSFVHRKSEEMEELISTYLGVNQTLSKEEQLDDWRQQQALDHINFTPETMGKPEVFYPGSDRGPHPLDDPQNYIQWYEKHCPLPYRPLIDQMVQMTDMKITDNNMPSYIKKWIDQIQEDPQEDLDQEKEDDQEEDLDSDEEADMSEDEDVGLTDNKAQNDELEQQLCSVVGGGQGVFFYRTNPLPRLQVDNTSLWSLDQNSELPEDTSPEEQIINCEGNITFHSGLQDYEIPAFPETLSSYITHYQIPSIEKWSIFRQFPNMYHWWQKFYETNKKLSEQPLVRYQYSGLPSVYTYFYTMPEFARNNIVVQNVARCFEFNRPELNHQQKIMALNYAAKFSLPLDDLIVHAASQMIVSQKHFLTAKEEDRLKTVNQFYYEADTEAWDIHLAHEEHTIEQIENFQPPKRLGVDDIEEQLCDLPLEYYDNDDGFWNDFIKEKLNRNNAAYPATQGRAFFKH 719 T 13 Ndc1_Nup pdbpssm F Eukaryota T 6z1p 76 XB BB I7MMM6_TETTS mS37 MAPPPKYVISRKLVKRFFDKYLPRQPMDVQNESGKLMKCWQQYGIDDPRCKEYEVLYDHMYTLTRNYRAKIEGLRIKEDVMGALNRPKYHSEVKGRWRTGKTTEWDVYDGVQ 112 T 0.2 CHCH pdb F Eukaryota T 6z1p 77 YB BC mS38 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 37 F F F 6z1p 79 AC BE Q23DN6_TETTS mS45 MIQKAIKILVNKQIQCFSYSKQSSFARMSAFRNDFDEKIKQKYIKNKTNDERFQQMNPEYIKKAINEEYEEAKEELFQNGGILTELRKSMLGKEEDEKETLGAEEFEDPVDNYLHDGLTHDEFLYRASNIKKQLFAKQIPDFFEKDEINSQYGNYSSFDKNFAKLKSMKTHLPDIEQSGFAGYKLEKWVNSLKGKKEIDDDEDLDEVREIIEENEINITEDEKHLLKWKIADIMRKENDEYVPFLEELEGENEKDPFEEDDTEDGRLSCKARDDIYELYQKGWSIKDICTRYGIVPERAKAVIWMCEKYYFQILPKADALAIHMAQEMEEEWEEENGWQDYGIDLEELAEREKGMHTLSFKRYREVDVGKPSKNILSEEDYTLVQKINTPRQEKITLKLDGGKYQRGYLIKDWKINKGRGRRDVSKMFRRIIENSHDISKLPSSVQLRVREGPRNASKGYSSKL 464 T 9E-05 Bot1p pdbhh F Eukaryota T 6z1p 80 BC BF mS75 XXXXXXXXXXXXXXXXXXXXXXX 23 F F F 6z1p 81 CC BG Q24G80_TETTS mS76 MRQLVKTQLLKSNIQELRSSIDIRKVLNNKYRDSESSDFYKKNREQLVSRIKQDMDVDSYTGATTRRTQFQQYYQDQPSLGFVYPMNPGRGAFMEPDCRFTGNDFMEKINTHFATAITSSYQDLDKVVKIQVNDSKKNIEQNLEKLFREKNPNKEFNFDKEYARYLKLDSKKFKKEYGYETD 182 T 0.0099 DUF4932 pdbpercent F Eukaryota T 6z1p 82 DC BH mS77 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 63 F F F 6z1p 84 FC BJ I7M7B1_TETTS mS78 MLSRIASKKISKQIGLNKKIARQSSSINKVIQDLLETSSKEESDLRVSQFAKNLDYLREKHEERSKIINQYLHRMRAASINSNIPLKEILRQFISSTRLGFDQKMTSETLFYLSATLNSLGPNPNHYFDYNLELLKDSWRFDDLVGDFRYGLKSGVEFSPERIAQGLRSLKKLGYSNSRITREAIQKIHRMLTKNDDQFNIDTENIIDNNPSIHKPLYYMRSPLDAKQIVENPEFQKFLKATIQKQEQELNKLNQKKNLQKDEDAQTITLRNELNEEEQEILDEIEIISKKINKSIQKSALAMKKTIESVIQIRRHYLRMIESQEKVQSVNLTPQLNLDLLNLEESMVEAGLISINEVKNPQLIIDTNQNRESIAQIVSNLIVEQNESFLPYFDNLIQSKPQLLQEIDSDRSLPEINVQFFANYTHSQFAEALLAVTEYSNSKLSEFKGQTFLNEKWADFFPQVDEADRLFFVQFLDSQKLFIEVSEVVLEQTKNEKDVNALGKYAAAFGNIGLIGVSKELVKRITSLEGMSTQTGICILRVCSEFPEEFSQLSENIIQILEKSSDITASQAIDLVYYEIALNKVSEASLNILKQQSKHILDENPRLNYVSQYLKLQGVDLGYNASGASIFENNLFQKNPIKDKLVELIGKAENLSHTGLGDYKPDFMCLERNEQGEIVQKAIFITPAEYSYFDIAKPAVEYTLLSKYLSHKVKNMVFEFIPITKFIDVNHQDHMITIKRDNQLFIELFDKAHHNIYSNLNNDLLVIGENLIDEEYNILKSKIKQIFRLSGGRRYLQLSVSDLMQVKYSLFSMAEDFNTVLSETSQQNLNQLCQKHFNQDFVSLLKSHSKLNFSVKNTQEEMEFFLKQKWVGKRLSIDLLPKGNEKYNQSDAFFDHLYISSENYYEYPEWQELLTQEYGVSNIAQINPTNFLAHQVSDVCNQTNTAYIIPKKKEVRSQIRPLDQRIVTSKQDRANYSLTWEQDYFTQGTNGELVYRGENHKISEGTKYIANLLNFKWELKKAFSSQERLDFLHKLNLTDKLIENHLHTSQNHQPKHAHFAQRDPKKYFEYLQNKTPSSLYCQEYLEFFTRKVDQKNKIIQLSKLHFTKIKEIYKSYQKGYISRQQFDQQKEGIQKDLLKTLRQYDAANNSLDSLLQNETIRYIDVQFDAESLLKDKSYVRYNLENDRDYLACKKDLNQTLNLKRIRAEEKLIKAKILSKISAEQTLTSLEQQYLEKWNSGEITLPKEPTFLTYKELDNSDVQLLNSLKFSDLVSFDKMQVNDLVVEFSSLFEQAIINNADVHLKGKNALKEWGISPEWISKSATNSMNNYLIALSETQAWKDGQKLKEHEKENVCSMLRLLQEQSYQDKIFTANENVLRKEALDVFNQSENRFSIFMKWLESREQEGFSISSDNTEHVLKLWENIFEKSYENTTQEELHLFTHNLVQRLYVLSKYPPSAFGSFLSKLLLHPKLHIIDKNILICGIDAFKFNAYLSSKELIDFSRNVRAAHTPQDLAALTRANVFASENILQKISKLVKN 1539 T 0.0033 DUF6076 pdb F Eukaryota T 6z1p 87 IC BM Q24E31_TETTS mS81 MIAKLFIRSSQRLIALNARFFSTNPANQFNNQETSSTYQNQRNNRREPSEFRRNNQERYQKREGEEQYRPRKESITWDEYFNLYATNKIHNISEAKFPYNFRGMQDFVPIKKEFDNMIFAKGENYADFCKQFDRRMVWFMKSLATNKDDDLPYSELNFLLQAKKGAELLRRNGYQINLIEDNEGFQKFGGKKGGQPFEITHIMGLNSNRTRNAGTDAYSIIRDLEEKGLIMFIGNQLKMDENGNYDFKLTRDDDKQMILRVHYKFATPYILQVTDSSGKVVSPPPESYIHTAVFENQLRLPPKFSRLDLHFLDWIKLYRIQNEWKLVDFDRFLSGNKLIYSEKEQRQLFKGEKDN 355 T 0.46 SID-1_RNA_chan pdbpercent F Eukaryota T 6z1p 88 JC BN Q22GF7_TETTS mS82 MAASQKVVNGIAKGLQEYVNPTKLAPFKKAVRDQMMEIEGLQAFEQGLYHNKDYENMIKQLVESRKAFRNSRSKTERQSIAKQQYDEWSKYVEIRKSQLTEDFQIPKNFQSQMDQVWGFVKNRKESTIHSSKMLDFHYELMNQFKFSIPIEPRLLVQMIHPHFGYLSNYPGNFTQEDILEVYKCKLVASMERVLGQDLLANEIAAYTYWKIYDKQAQGSFDLKTFGEFMKTFRFNLDGTAENFKQEFKFALQLHPGELSNDLQESDQLVRFDFYRYLFLERNL 283 T 0.0023 EF-hand_1 pdbpercent F Eukaryota T 6z1p 89 KC BO Q22N51_TETTS PARP alpha-helical domain-containing protein,mS83 DYKPIPFGVNVEDSVEYYNKQLESLEKHMPLNIFSCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 142 T 9 DUF2102 pdbhh F Eukaryota T 6z1p 90 LC BP mS84,mS84 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKVETLNEEDVAEKIIENLTDVKIIEKDSKRKVLKKTGVHKFKDIDSYIKETRSSRDKKK 100 T 0.00025 TraT pdbpssm F T 6z1p 91 MC BQ Q22UP3_TETTS mS85 MLRVVHNLGKSIKINLTSKNILRVSFSSDQKVSTEPESGLTFEQKAEIFERFSNSFVGIDRFKETQQTLKKIVEANYSQSAIKEELIQELKEVYGKNYEKILNLRFAVEYDGHKDGVAVGEFELFPKNLQDVNKFENYSKNGDLIKQLQQTTYISVDPKETHKYVVPKDSHFSLLLDEYIADEYVSELNDNQVCLFGFPLTCDESDITNLLNNEFKGNFTSSVIGEDILSLPAYVVLTFSSPKEAAEYKQKVNALQYTIEKRPIYATTFEDSRREHSTNRTLLVTGFKKNEYINDMLNLFSSFGSVMHFEIVEDPVHSKLPTTEQVIEYLKKTIKEGDDSVIYEITDFSDGAPSITEYPPFNPNTELMRDAKKVHEVKDEIELEKERQQQLAKRQLIPRIVLYSWDSESRVPEEYRMKNASEEQKKIIQSIENDLRTQYQNKQYLFVTYACTQQAQIAFHALNNLRNYEVTLKKSIEHYHCDTIHQTSVFKEIKQIKGDFKIKFNEQSLVKTEEEKVLAQTHQEMREKLLEQANSQQFTQELSKQLENEIIGKTKYGVEHQLHLKSDKLGRDFNSRVTQDDLNQLASQFQENQKKNLQTLEEVRKDEEELLNLYKAKVMYSDINKITHPYIEADKETIQKVEENYYQQQLKEYKLKQKEYMKEQQIREKWLEESKEMLEKKFVFGRNYKKKVIADKAGVDDVPDIKEPKEEEADQYYAPYNDYIQQKRYKKYLRYVDEMQRLYDGEYSEAMKNKIFVEGGKKTCDSDGNQFVTQNQGEVFNKILLSDEQFEMLKYYTSIADVLPNKRVQELSTMLEETPEETIYMMKQLKYPTKVFDRSKIPELDENSIPISNEDFVNDLNKYVSGLGQRYAVQKDARGDEKIVMYENTPHPVPLQALNVDEIQLLRDCLTTYGFDAEATEREIQYFIKHGDYSEEVLKIVGNEQTIDEESELEALINATGLTKAELESIMKLDLEKEGSNVLLSLQQQREELSLELSRATPQPKDLIKTNNTKLRNKDKQGRYKTSSFKLF 1032 T 0.24 Nab6_mRNP_bdg pdbhh F Eukaryota T 6z1p 93 OC BS A4VCP7_TETTS mS87 MIRSILKQVKGNLTKGNSFNAKLNEIPVRCFSSSTGEGNEGDAPKNQQEQQQQKDQQPQQQQQPLQNQGKNQKQFDNKRNFQNNQQGANADRNADKQKKNFTPFKQQSNNQNYRKREDGESDQQNQGGFRSNSQNQQNSTGFTSSQNQRNQPKKEVLSFNLKKAEDQSNRDSSNQQDNKQRPQRPQRENQNDQASEQSEGQSYQKSSTSSYGSEGMFLNQLFKSEQNKTEKQKGANQHQQLMKRIKSYEQNGNPSEQEMRMAIECYNSCGLYDKTIATFQTYKDNFVKGKQGVSLNETILNSVFESYLKNSSSKFNDVNEFFLIHFAQTKQLKLIYKDNIQNYISRVCIDPYLNLSQRIEFLSEFVNMFTNSQDADSLVTSSFNNIDLSSLSSLFGNADTQQYIQASKTLATLMNLSIQHNKGSFNQLGKNTEFNKVQFEIINTKKIFNNLLNLKQYEICREIVEALSRGNLLHSHTFTSKSSEQQKYIDYKDLIKFSLDFQVSVNYWIDVIAKKVDFESFTQLFSSAILELRMNQNPPKVFSIEQVYDFFYNISNYGALNPTQVYILMDISIYLKEYNLAIELFTHHQKYTNRRRDNFVYEKMIQVVNSINLRQQAGKSNKDHVLQAYRKLYTEAEQQTGKPIGFLNSKIYEIKSCILDQNHDSAYSIFNDRFLKESIANYQALKMKYFLLLLLEKQDERDLQYSEISINKFINGLAPKQLIQIWNDDRNDTYLAKQLANKENDYNEFVKRIEVEKYYSIIKRDINNGFYTIEDRRQNMDVMMNEYERYVIDNFEVLDPEYTLDGKLKGAVLDGILKIRKYEEVHLKKEERRKEKQSKKGEKESDSQTSQALIQHQKEMEDTQIEINKILGRPIDTPVNVPELYKKYDTEDKMNNKYIERYVKNLQQNEFNKEGRLYKMKFMNLDQYKEASQCYPELVEQAEILARPQIPSDVNLVFELMTWGAENQQPLAIQLGEIYCDLNGIPVPSSLVEKINKAIDPYNDNLDFINQITSAKRLIGDINRERVYQTYTHFSQKPKFAEQVNQLGEEDKYRDEATYVLQQEQAASRLTKYGIPKKYIMEQLKA 1086 T 2.2 FlgM unppercent F Eukaryota T 6z1p 94 PC BT mS88 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAAAAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 297 T 15000 Inhibitor_I36 pdbhh F F 6z1p 95 QC BU Q22EB6_TETTS mS89 MMNRNLFKLLQLSTKSVSFNCTKLNYKFATFPSKEQMRFQKNMGYNGFQPNIAFKDDLYFPLDNPVQRQGLEDLINHIKVNPNLAIEGRGLCTILYIIAREGKDEPFIFKELERHLYKFKENLSPRLSFGGLYASYKSNLASPYQVSFFEDEFTRNSQQINAYEAIEILQTMFENTTKVNEHKIQYFHQSVKPIIVSNFSKQVRPYTGNLLKLFIGLRNMNIYDEELHELILKYLPYRRGLNNVKDIAEVYETLCDYKEKGILKQNIDAHIEALEKKLTTKDDCRWRYNLKEKRFYTYDELIANRDNYTIKDQLNHKYRFSNPELIEKFNLVQSDKDAIKAELEARERSRELENLVLEMFELKNRGEVAQTEDKNTLKGTYENVIFVKEGEELEEEEGAEEIDNEPAEEVDEGLDFDLKSSNKPKVKKEKGQKQKNKNN 439 T 0.95 L31 pdb F Eukaryota T 6z1p 96 RC BV Q24HL0_TETTS mS90 TNAKEYYDYLLRFTPQDERGYIKFHPGQFSKMVKIASTEEDIKSIRDAYYNFIGHKQKFTNAQVDRFLEKAAELKAAPLINEILINHNFLMYYPHSSVLHKLAEHYIQENNAEGLNELTRIYSNTHFLKLEDRTLELVSNYAIEQKNQGIILNVAQIAYRKVLTSINENTINNILTGIARQKIANPEPKEGTQNKVETKFLAHLKKCSQSYHTIIGRAYLALANXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 310 T 0.079 YBD unppssm F Eukaryota T 6z1p 97 SC BW mS91 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 285 F F F 6z1p 98 TC BX Q951B8_TETTH Ribosomal protein S3 MGLKSLPMLNKSGISMYWHNIWDSIKLYKKYSLSFLFLNEVINHFLNENLYYYCIMKIRPTDPRLKGFRGNKSININKIKKSWNMRHFYLGKILFLKYQGWVLVLINYYCSRRNKLYINYKSFKAFKKIAKSFRQGVTSYVYKMDKYKFKF 151 T 0.058 Rib_hydrolayse unppercent F Eukaryota T 6z2d 1 A A B2UR60_AKKM8 O-glycan protease EVTVPDALKDRIALKKTARQLNIVYFLGSDTEPVPDYERRLSELLLYLQQFYGKEMQRHGYGARSFGLDIKSPGRVNIIEYKAKNPAAHYPYENGGGWKAAQELDEFFKAHPDRKKSQHTLIIMPTWNDEKNGPDNPGGVPFYGMGRNCFALDYPAFDIKHLGQKTREGRLLTKWYGGMAHELGHGLNLPHNHQTASDGKKYGTALMGSGNYTFGTSPTFLTPASCALLDACEVFSVTPSQQFYEGKPEVEVGDVAISFKGDQILVSGNYKSPQTVKALNVYIQDPPYAVNQDYDAVSFSRRLGKKSGKFSMKIDKKELEGLNNNEFRISLMFILANGLHMQKHFTFHWDALQDYRDGSKSGSGHHHHHH 370 T 0.00025 Metallopep pdbpercent F Bacteria T 6z2i 1 A A de novo designed TIM barrel DeNovoTIM6 MDKDEAWKQVEILRRLGAKQIAYRSDDWRDLQEALKKGGDILIVDATDKDEAWKQVEILRRLGAKQIAYRSDDWRDLQEALKKGGDILIVDATDKDEAWKQVEILRRLGAKQIAYRSDDWRDLQEALKKGGDILIVDATDKDEAWKQVEILRRLGAKQIAYRSDDWRDLQEALKKGGDILIVDATGLEHHHHHH 194 T 0.00021 NanE pdbhh F T 6z2o 1 A A B2UR60_AKKM8 O-glycan protease MEVTVPDALKDRIALKKTARQLNIVYFLGSDTEPVPDYERRLSELLLYLQQFYGKEMQRHGYGARSFGLDIKSPGRVNIIEYKAKNPAAHYPYENGGGWKAAQELDEFFKAHPDRKKSQHTLIIMPTWNDEKNGPDNPGGVPFYGMGRNCFALDYPAFDIKHLGQKTREGRLLTKWYGGMAHELGHGLNLPHNHQTASDGKKYGTALMGSGNYTFGTSPTFLTPASCALLDACEVFSVTPSQQFYEGKPEVEVGDVAISFKGDQILVSGNYKSPQTVKALNVYIQDPPYAVNQDYDAVSFSRRLGKKSGKFSMKIDKKELEGLNNNEFRISLMFILANGLHMQKHFTFHWDALQDYRDGSKSGSGHHHHHH 371 T 0.00025 Metallopep pdbpercent F Bacteria T 6z2p 1 A A B2UR60_AKKM8 O-glycan protease MEVTVPDALKDRIALKKTARQLNIVYFLGSDTEPVPDYERRLSELLLYLQQFYGKEMQRHGYGARSFGLDIKSPGRVNIIEYKAKNPAAHYPYENGGGWKAAQELDEFFKAHPDRKKSQHTLIIMPTWNDEKNGPDNPGGVPFYGMGRNCFALDYPAFDIKHLGQKTREGRLLTKWYGGMAAALGHGLNLPHNHQTASDGKKYGTALMGSGNYTFGTSPTFLTPASCALLDACEVFSVTPSQQFYEGKPEVEVGDVAISFKGDQILVSGNYKSPQTVKALNVYIQDPPYAVNQDYDAVSFSRRLGKKSGKFSMKIDKKELEGLNNNEFRISLMFILANGLHMQKHFTFHWDALQDYRDGSKSGSGHHHHHH 371 T 0.0033 Metallopep unppercent F Bacteria T 6z2p 2 B C DROS_DROME Glycodrosocin GKPRPYSPRPTSHPRPIRV 19 T 0.0059 DIM unppercent F Eukaryota T 6z2q 1 A A B2UR60_AKKM8 O-glycan protease MEVTVPDALKDRIALKKTARQLNIVYFLGSDTEPVPDYERRLSELLLYLQQFYGKEMQRHGYGARSFGLDIKSPGRVNIIEYKAKNPAAHYPYENGGGWKAAQELDEFFKAHPDRKKSQHTLIIMPTWNDEKNGPDNPGGVPFYGMGRNCFALDYPAFDIKHLGQKTREGRLLTKWYGGMAHELGHGLNLPHNHQTASDGKKYGTALMGSGNYTFGTSPTFLTPASCALLDACEVFSVTPSQQFYEGKPEVEVGDVAISFKGDQILVSGNYKSPQTVKALNVYIQDPPYAVNQDYDAVSFSRRLGKKSGKFSMKIDKKELEGLNNNEFRISLMFILANGLHMQKHFTFHWDALQDYRDGSKSGSGHHHHHH 371 T 0.00025 Metallopep pdbpercent F Bacteria T 6z2q 2 B D DROS_DROME Glycodrosocin GKPRPYSPRPTSHPRPIRV 19 T 0.0059 DIM unppercent F Eukaryota T 6z3f 1 A P Chains: P CDIHVXWEWDCFEKLX 16 T 8.9 NIPSNAP pdbhh F T 6z3r 4 D E RENT1_HUMAN ATP-DEPENDENT HELICASE RENT1,NONSENSE MRNA REDUCING FACTOR 1,NORF1,UP-FRAMESHIFT SUPPRESSOR 1 HOMOLOG,HUPF1 QPELSQDSYLG 11 T 0.61 DUF4629 pdbhh F Eukaryota T 6z3t 3 C C Protein transport protein Sec61 subunit beta XXXXXXXXXXXXXXXXXXXX 20 F F F 6z3u 3 C,F C,F G0SF48_CHATD RING-type domain-containing protein GPYDPFGGMEFVPSRYRVREELNHPSLDKYRIDQQHITGGYSFLDYISRAMFEAFAGLAVFIEDEKEAG 69 T 3.1 MotCF pdbhh F Eukaryota T 6z3w 9 I I EMC10_HUMAN HEMATOPOIETIC SIGNAL PEPTIDE-CONTAINING MEMBRANE DOMAIN-CONTAINING PROTEIN 1 MAAASAGATRLLLLLLMAVAAPSRARGSGCRAGTGARGAGAEGREGEACGTVGLLLEHSFEIDDSANFRKRGSLLWNQQDGTLSLSQRQLSEEERGRLRDVAALNGLYRVRIPRRPGALDGLEAGGYVSSFVPACSLVESHLSDQLTLHVDVAGNVVGVSVVTHPGGCRGHEVEDVDLELFNTSVQLQPPTTAPGPETAAFIERLEMEQAQKAKNPQEQKSFFAKYWMYIIPVVLFLMMSGAPDTGGQGGGGGGGGGGGSGR 262 T 0.0033 2OG-FeII_Oxy_5 pdbpssm F Eukaryota T 6z41 1 A A B3PJ79_CELJU Carbohydrate binding protein, putative, cpb33A MGNCISPVYVDGSSYANNALVQNNGSEYRCLVGGWCTVGGPYAPGTGWAWANAWELVRSCQAHHHHHH 68 T 0.19 P2X_receptor pdbpercent F Bacteria T 6z4x 3 C,F C,F G0SF48_CHATD RING-type domain-containing protein GPYDPFGGMEFVPSRYRVREELNHPSLDKYRIDQQHITGGYSFLDYISRAMFEAFAGLAVFIEDEKEAG 69 T 3.1 MotCF pdbhh F Eukaryota T 6z5s 1 A W Q6N1K3_RHOPA Light harvesting complex 1 Protein W MMLLLVLTAIAFVATAVVARVLAASAPEGKLYCQAAGAASMVVGPFITLVAAFVLGKAGIGGEVLDATAMLRVAALPAFGTLFVGPVVFWFFRRQRRTVAAA 102 T 0.54 DUF4229 pdbhh F Bacteria T 6z5y 1 A,B A,B D0N2F7_PHYIT Lytic Polysaccharide Monooxygenase HGYIAKPAPSWKASKTNNWVVEIEPQWKGGWDESKGDEGLLATFKELAPKNNFKDVRSLMDGNPVFGEECGFTDPKGKPSEPPSDGTATFSRGIVHAGPCEIWLDDKMVLQNDDCQSAYGDGTQQTIAVFKPVDYSSCAAGGCMLRFYWLALQRLKGKTVWQAYKNCIPLTGWSHPQFEK 180 T 0.0049 PA14 pdb F Eukaryota T 6z6e 1 A,B,C A,B,C Q9MBW4_BPHK7 Terminase small subunit ADKRIRSDSSAAAVQAMKNAAVDTIDPPSHAGLEKKAEPFWHDNIRSKALDSWTPADLLAAVELANNQLYITVLRKDLRKEERIRGEERDEGLIKDLRKQIVELQRTILAQRRDLQIHSHATNGESRDQKKRNQNDRDARNTKNEHQDQDDNLIAFPKHG 160 T 0.011 Terminase_4 unppercent T Viruses T 6z6j 2 B C5 LSO2_YEAST LATE-ANNOTATED SMALL OPEN READING FRAME 2 MGKRFSESAAKKAAGLARKRDQAHAKQRAQMEQLEAEEASKWEQGSRKENAKKLEEEQKRQEKARAKKERDALLTAEEEQLGKGGKGKRKMK 92 T 4 F-protein pdb F Eukaryota T 6z6k 2 B C5 LSO2_YEAST LATE-ANNOTATED SMALL OPEN READING FRAME 2 GKRFSESAAKKAAGLARKRDQAHAKQRAQMEQLEAEEASKWEQGSRKENAKKLEEEQKRQEKARAKKERDALLTAEEEQLGKGGKGKRKMK 91 T 3.8 F-protein pdb F Eukaryota T 6z7n 8 FA X Fiber protein FNPVYPY 7 T 0.55 DUF3463 pdbhh F F 6z7n 9 GA,HA,JA W,V,Z Unknown XXXXXXXXXX 10 F F F 6z7n 10 IA Y Unknown XXXXXX 6 F F F 6z8d 1 A,B A,B CAPSD_HPBVH Capsid protein precursor MKQNDTKKTTQRRNSKKYSSKTNRGTKRAPRDQEVGTGAQESTRNDVAWYARYPHILEEATRLPFAYPIGQYYDTGYSVASATEWSKYVDTSLTIPGVMCVNFTPTPGESYNKNSPINIAAQNVYTYVRHMNSGHANYEQADLMMYLLAMDSLYIFHSYVRKILAISKLYTPVNKYFPRALLVALGVDPEDVFANQAQWEYFVNMVAYRAGAFAAPASMTYYERHAWMSNGLYVDQDVTRAQIYMFKPTMLWKYENLGTTGTKLVPLMMPKAGDNRKLVDFQVLFNNLVSTMLGDEDFGIMSGDVFKAFGADGLVKLLAVDSTTMTLPTYDPLILAQIHSARAVGAPILETSTLTGFPGRQWQITQNPDVNNGAIIFHPSFGYDGQDHEELSFRAMCSNMILNLPGEAHSAEMIIEATRLATMFQVKAVPAGDTSKPVLYLPNGFGTEVVNDYTMISVDKATPHDLTIHTFFNNILVPNAKENYVANLELLNNIIQFDWAPQLYLTYGIAQESFGPFAQLNDWTILTGETLARMHEVCVTSMFDVPQMGFNK 552 T 55 DUF4549 pdbhh T Viruses T 6z8e 1 A,B A,B CAPSD_HPBVH Capsid protein precursor MRGSHHHHHHGMASMTGGQQMGRDLYDDDDKDRWGSMKQNDTKKTTQRRNSKKYSSKTNRGTKRAPRDQEVGTGAQESTRNDVAWYARYPHILEEATRLPFAYPIGQYYDTGYSVASATEWSKYVDTSLTIPGVMCVNFTPTPGESYNKNSPINIAAQNVYTYVRHMNSGHANYEQADLMMYLLAMDSLYIFHSYVRKILAISKLYTPVNKYFPRALLVALGVDPEDVFANQAQWEYFVNMVAYRAGAFAAPASMTYYERHAWMSNGLYVDQDVTRAQIYMFKPTMLWKYENLGTTGTKLVPLMMPKAGDNRKLVDFQVLFNNLVSTMLGDEDFGIMSGDVFKAFGADGLVKLLAVDSTTMTLPTYDPLILAQIHSARAVGAPILETSTLTGFPGRQWQITQNPDVNNGAIIFHPSFGYDGQDHEELSFRAMCSNMILNLPGEAHSAEMIIEATRLATMFQVKAVPAGDTSKPVLYLPNGFGTEVVNDYTMISVDKATPHDLTIHTFFNNILVPNAKENYVANLELLNNIIQFDWAPQLYLTYGIAQESFGPFAQLNDWTILTGETLARMHEVCVTSMFDVPQMGFNK 588 T 35 Protamine_3 pdbhh T Viruses T 6z8f 1 A,B A,B CAPSD_HPBVH Capsid protein precursor ESTRNDVAWYARYPHILEEATRLPFAYPIGQYYDTGYSVASATEWSKYVDTSLTIPGVMCVNFTPTPGESYNKNSPINIAAQNVYTYVRHMNSGHANYEQADLMMYLLAMDSLYIFHSYVRKILAISKLYTPVNKYFPRALLVALGVDPEDVFANQAQWEYFVNMVAYRAGAFAAPASMTYYERHAWMSNGLYVDQDVTRAQIYMFKPTMLWKYENLGTTGTKLVPLMMPKAGDNRKLVDFQVLFNNLVSTMLGDEDFGIMSGDVFKAFGADGLVKLLAVDSTTMTLPTYDPLILAQIHSARAVGAPILETSTLTGFPGRQWQITQNPDVNNGAIIFHPSFGYDGQDHEELSFRAMCSNMILNLPGEAHSAEMIIEATRLATMFQVKAVPAGDTSKPVLYLPNGFGTEVVNDYTMISVDKATPHDLTIHTFFNNILVPNAKENYVANLELLNNIIQFDWAPQLYLTYGIAQESFGPFAQLNDWTILTGETLARMHEVCVTSMFDVPQMGFNK 512 T 30 DUF4549 pdbhh T Viruses T 6z9l 2 B,C F,G Poly-alanine peptide AAAAAAAAA 9 T 160 FAD_oxidored pdbhh F F 6z9v 3 C,F C,F ILE-ILE-GLY-TRP-MET-TRP-ILE-PRO-VAL IIGWMWIPV 9 T 0.94 Acyl-CoA_dh_C pdbhh F T 6z9x 3 C,F C,F LEU-LEU-SER-TUR-PHE-GLY-THR-PRO-THR LLSXFGTPT 9 T 1.9 DUF6120 pdbhh F T 6zbr 1 A P Chains: P CDIHVXWEWKCFEEL 15 T 2.9 Metal_hydrol pdbhh F T 6zbt 2 E,F,G,H E,F,G,H NED4L_HUMAN HECT-TYPE E3 UBIQUITIN TRANSFERASE NED4L,NEDD4.2,NEDD4-2 LRSCSVTDAV 10 T 6.9 YodL pdbhh F Eukaryota T 6zbu 2 C,D,G,J,K,L C,D,G,K,L,H NCOR1_HUMAN N-COR1 GITTIKEMGRSIHEIPR 17 T 0.61 DUF211 pdbhh F Eukaryota T 6zbx 2 C E UNK-UNK-UNK-UNK XXXX 4 F F F 6zc9 2 E,F,G,H E,F,G,H NED4L_HUMAN HECT-TYPE E3 UBIQUITIN TRANSFERASE NED4L,NEDD4.2,NEDD4-2 PRSLSSPTVT 10 T 39 ASF1_hist_chap pdbhh F Eukaryota T 6zcd 1 A P Derived from V114 peptide CDIHVXWEWKCFEDL 15 T 3.5 WWE pdbhh F T 6zce 43 QA j RNA recognition motif (unknown) AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 77 T 14000 zf-H2C2_2 pdbhh F F 6zcj 2 B P LCP2_HUMAN SLP76pS376 FPQSASLPPY 10 F F Eukaryota T 6zd3 1 A,B B,A YTH domain containing 1 MHHHHHHSSGRENLYFQGTSKLKYVLQDARFFLIKSNNHENVSLAKAKGVWSTLPVNEKKLNLAFRSARSVILIFSVRESGKFQGFARLSSESHHGGSPIHWVLPAGMSAKALGGVFKIDWICRRELPFTKSAHLTNPWNEHKPVKIGRDGQEIELECGTQLCLLFPPDESIDLYQVIHKMRH 183 T 8.6E-31 YTH pdbhh F T 6zd4 1 A,B A,B YTH domain containing 1 MHHHHHHSSGRENLYFQGTSKLKYVLQDARFFLIKSNNHENVSLAKAKGVWATLPVNEKKLNLAFRSARSVILIFSVRESGKFQGFARLSSESHHGGSPIHWVLPAGMSAKMLGGVFKIDWICRRELPFTKSAHLTNPWNEHKPVKIGRDGQEIELECGTQLCLLFPPDESIDLYQVIHKMRH 183 T 3.8999999999999997E-31 YTH pdbhh F T 6zd5 1 A,B A,B YTH domain containing 1 MHHHHHHSSGRENLYFQGTSKLKYVLQDARFFLIKSNNHENVSLAKAKGVWATLPVNEKKLNLAFRSARSVILIFSVRESGKFQGFARLSSESHHGGSPIHWVLPAGMSAKMLGGVFKIDWICRRELPFTKSAHLTNPWNEHKPVKIGRDGQEIELECGTQLCLLFPPDESIDLYQVIHKMRH 183 T 3.8999999999999997E-31 YTH pdbhh F T 6zda 1 A,B A,B YTHDC1 MHHHHHHSSGRENLYFQGTSKLKYVLQDARFFLIKSNNHENVSLAKAKGVWSTLPVNEKKLNLAFRSARSVILIFSVRESGKFQGFARLSSESHHGGSPIHWVLPAGMSAKALGGVFKIDWICRRELPFTKSAHLTNPWNEHKPVKIGRDGQEIELECGTQLCLLFPPDESIDLYQVIHKMRH 183 T 8.6E-31 YTH pdbhh F T 6zdw 1 A,B AAA,BBB A0A162MUP0_MUCCL DRBM domain-containing protein MTDTDTVEQFIHTIFARVTDDHGRPVDITAALPLLKQILTGYTQEVAEHKFNYIGESAVQFAMHLILADHFSKYENGCLSAIAKKYTVPLQLYKLIGKQIHLKEYVRPVYLKETLDMIVGILFRCYGITAVYKFIQEEFILLVNQDINNANSPKKPSSPSLSTNQADNPVKLLHELIQAKSGTLEAEAHETEDKKWEVKIVAKLNEKALPFSHARTNASKQKAKTEASRDILTYFTNYPDVCQHLQVPVEGEVEIHVLPISENDYCHLFAET 272 T 7.5E-05 Ribonucleas_3_3 unphh F Eukaryota T 6zef 2 C,D C,D PHAR1_HUMAN Phosphatase and actin regulator GPLGSRKILIRFSDYVEVADAQDYDRRADKPWTRLTAADKAAIRKELNEFKSTEMEVHELSRHLTRFHRP 70 T 2.3 DUF6344 pdbhh F Eukaryota T 6zeg 2 B,D C,D PHAR1_HUMAN Phosphatase and actin regulator GPLGSRKILIRFSDYVEVADAQDYDRRADKPWTRLTAADKAAIRKELNEFKSTEMEVHELSRHLTRFHRP 70 T 2.3 DUF6344 pdbhh F Eukaryota T 6zeh 2 B,D C,D PHAR1_HUMAN Phosphatase and actin regulator GPLGSRKILIRFSDYVEVADAQDYDRRADKPWTRLTAADKAAIRKELNEFKSTEMEVHELSRHLTRFHRP 70 T 2.3 DUF6344 pdbhh F Eukaryota T 6zei 2 B,D C,D PHAR1_HUMAN Phosphatase and actin regulator GPLGSRKILIRFSDYVEVADAQDYDRRADKPWTRLTAADKAAIRKELNEFKSTEMEVHELSRHLTRFHRP 70 T 2.3 DUF6344 pdbhh F Eukaryota T 6zfm 2 C,F C,F peptide 12 YMWDGWYM 8 T 0.096 BRCA-2_OB1 pdbhh F F 6zga 4 D,H D,H PRO-PRO-PRO PPP 3 T 62 SK_channel pdbhh F F 6zh1 1 A A W5SB08_BORHE FACTOR H-BINDING PROTEIN A MAHHHHHHVDDDDKDLFNKNKKLDADLLKTLDNLLKTLDNNQKQALIYFKDKLQDKKYLNDLMEQQKSFLDNLQKKKEDPDLQDRLKKTLNSEYDESQFNKLLNELGNAKAKQFLQQLHIMLQSIKDGTLTSFSSSNFNDLQNLEQKKERALQYINGKLYVEYYFYINGISNADNFFETIMEYLKT 186 T 0.0047 FUT8_N_cat unppssm F Bacteria T 6zhi 2 E,F,G,H E,F,G,H ASN-ARG-LEU-LEU-LEU-THR-GLY NRLLLTG 7 T 10 PgaPase_1 pdbhh F F 6zhx 7 K,L K,L CHD1L_HUMAN AMPLIFIED IN LIVER CANCER PROTEIN 1 EKASQEGRSLRNKGSVLIPGLVEGSTKRKRVLSPEEK 37 T 0.47 ATP-synt_E unp F Eukaryota T 6zhy 7 I K CHD1L_HUMAN AMPLIFIED IN LIVER CANCER PROTEIN 1 EKASQEGRSLRNKGSVLIPGLVEGSTKRKRVLSPEEK 37 T 0.47 ATP-synt_E unp F Eukaryota T 6zj3 80 BC Lo Ribosomal protein eLEgr1 MAEVELVSVPECKAQTVDKHVLWSCINFGTSNVALIDPYHPAHRGARKYINQFHSGKVPKTAAKAKEAKAEE 72 T 7.9 MRP_L53 pdbhh F T 6zj3 97 SC L6 Ribosomal protein eLEgr2 MGGDDFEKKPLPDCLKELHEKQQAKLAKSKENYTPPKYNTPRKTTRERLNRRAQIKAALQRKKDKLKAE 69 T 14 NepR pdbhh F T 6zj3 98 TC L7 Ribosomal protein eLEgr3 MPLKNNCFRRVYHSNWEYLLSLEKEADAEPKQKALRYKQEKKQQFREKGLKLAAAKTAEAAKSA 64 T 17 Phage_antiter_Q pdbhh F T 6zlm 2 B,C,D,E,F,G,H,I,J,K,L,M XC,K,Q,X,DA,JA,PA,VA,BB,HB,NB,TB Pyruvate dehydrogenase X component XXXXXXXXXXXXX 13 F F F 6zm5 86 MC v Nascent polypeptide AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 32 T 2100 Chorion_S16 pdbhh F F 6zm9 1 A A Chains: A SLMERLGGGGFSARIFVGLNVGDKPTYTIEDVVKDTIAIRKRQGILPDASFVAQRGVYTEQRSGQLVTENSVQIIIIDLEGLSKEDFTGKVQALGKELREDFKQESVIVEIQERGIVQDVYSITAEWYEEGPMRPLRVDLQPSLIS 146 T 7.1E-05 DUF3574 pdbhh F T 6zmg 1 A A Chains: A SLMERLGGGGFSARIFVGLNVGDKPTYTIEDVVKDTIAIRKRQGILPDASFVAQRGVYTEQRSGQLVTENSVQIIIIDLEGLSKEDFTGKVQALGKELREDFKQESVIVEIQERGIVQDVYSITAEWYEEGPMRPLRVDLQPSLIS 146 T 7.1E-05 DUF3574 pdbhh F T 6zmp 2 C,D C,D CMC-MET-ASP-GLU-LEU MDEL 4 T 93 DDE_Tnp_IS1 pdbhh F F 6zn3 1 A,D,G,J,M A,D,G,J,M Q8IJM4_PLAF7 Myosin essential light chain ELC SMASDMEEKFREAFILFSSCSDHIEMYKFFELMNSFGIILTNDEKAALPNDINMDYWLNFAKKHYNYEQPFKHINNVNEQNTNVQIKIDNFLGIMKALDTRLTESDLNILLQITNPENKSTLNLKTVSQKLTESI 135 T 0.024 Na_Ca_ex_C unppercent F Eukaryota T 6zn3 3 C,F,I,L,O C,F,I,L,O MYOA_PLAF7 PFM-A SVEWENCVSVIEAAILKHKYKQKVNKNIPSLLRVQAHIRKKMV 43 T 0.00021 IQ unppercent F Eukaryota T 6znb 1 A AA Phage SAM lyase Svi3-3 SLMERLGGGGFSARIFVGLNVGDKPTYTIEDVVKDTIAIRKRQGILPDASFVAQRGVYTEQRSGQLVTENSVQIIIIDLEGLSKEDFTGKVQALGKELREDFKQESVIVEIQERGIVQDVYSITAEWYEEGPMRPLRVDLQPSLIS 146 T 7.1E-05 DUF3574 pdbhh F T 6znl 10 R Y A0A4X1TB62_PIG DYNACTIN SUBUNIT 4 MASLLQSERVLYLVQGEKKVRAPLSQLYFCRYCSELRSLECVSHEVDSHYCPSCLENMPSAEAKLKKNRCANCFDCPGCMHTLSTRATSISTQLPDDPAKTAVKKAYYLACGFCRWTSRDVGMADKSVASGGWQEPDHPHTQRMNKLIEYYQQLAQKEKVERDRKKLARRRNYMPLAFSQHTIHVVDKYGLGTRLQRPRAGTTITALAGLSLKEGEDQKEIKIEPAQAVDEVEPLPEDYYTRPVNLTEVTTLQQRLLQPDFQPICASQLYPRHKHLLIKRSLRCRQCEHNLSKPEFNPTSIKFKIQLVAVNYIPEVRIMSIPNLRYMKESQVLLTLTNPVENLTHVTLLECEEGDPDDTNSTAKVSVPPTELVLAGKDAAAEYDELAEPQDFPDDPDVVAFRKANKVGVFIKVTPQREEGDVTVCFKLKHDFKNLAAPIRPVEEADPGAEVSWLTQHVELSLGPLLP 467 T 3.9000000000000004E-28 Dynactin_p62 pdbpercent F Eukaryota T 6znm 8 I Y A0A4X1TB62_PIG DYNACTIN SUBUNIT 4 MASLLQSERVLYLVQGEKKVRAPLSQLYFCRYCSELRSLECVSHEVDSHYCPSCLENMPSAEAKLKKNRCANCFDCPGCMHTLSTRATSISTQLPDDPAKTAVKKAYYLACGFCRWTSRDVGMADKSVASGGWQEPDHPHTQRMNKLIEYYQQLAQKEKVERDRKKLARRRNYMPLAFSQHTIHVVDKYGLGTRLQRPRAGTTITALAGLSLKEGEDQKEIKIEPAQAVDEVEPLPEDYYTRPVNLTEVTTLQQRLLQPDFQPICASQLYPRHKHLLIKRSLRCRQCEHNLSKPEFNPTSIKFKIQLVAVNYIPEVRIMSIPNLRYMKESQVLLTLTNPVENLTHVTLLECEEGDPDDTNSTAKVSVPPTELVLAGKDAAAEYDELAEPQDFPDDPDVVAFRKANKVGVFIKVTPQREEGDVTVCFKLKHDFKNLAAPIRPVEEADPGAEVSWLTQHVELSLGPLLP 467 T 3.9000000000000004E-28 Dynactin_p62 pdbpercent F Eukaryota T 6znn 5 F,G S,T putative Hook3 coiled coil XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 68 F F F 6znn 8 J X Hook3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 93 F F F 6znn 9 K Y A0A4X1TB62_PIG DYNACTIN SUBUNIT 4 MASLLQSERVLYLVQGEKKVRAPLSQLYFCRYCSELRSLECVSHEVDSHYCPSCLENMPSAEAKLKKNRCANCFDCPGCMHTLSTRATSISTQLPDDPAKTAVKKAYYLACGFCRWTSRDVGMADKSVASGGWQEPDHPHTQRMNKLIEYYQQLAQKEKVERDRKKLARRRNYMPLAFSQHTIHVVDKYGLGTRLQRPRAGTTITALAGLSLKEGEDQKEIKIEPAQAVDEVEPLPEDYYTRPVNLTEVTTLQQRLLQPDFQPICASQLYPRHKHLLIKRSLRCRQCEHNLSKPEFNPTSIKFKIQLVAVNYIPEVRIMSIPNLRYMKESQVLLTLTNPVENLTHVTLLECEEGDPDDTNSTAKVSVPPTELVLAGKDAAAEYDELAEPQDFPDDPDVVAFRKANKVGVFIKVTPQREEGDVTVCFKLKHDFKNLAAPIRPVEEADPGAEVSWLTQHVELSLGPLLP 467 T 3.9000000000000004E-28 Dynactin_p62 pdbpercent F Eukaryota T 6znn 10 L x Hook3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 92 F F F 6zno 7 H Y A0A4X1TB62_PIG DYNACTIN SUBUNIT 4 MASLLQSERVLYLVQGEKKVRAPLSQLYFCRYCSELRSLECVSHEVDSHYCPSCLENMPSAEAKLKKNRCANCFDCPGCMHTLSTRATSISTQLPDDPAKTAVKKAYYLACGFCRWTSRDVGMADKSVASGGWQEPDHPHTQRMNKLIEYYQQLAQKEKVERDRKKLARRRNYMPLAFSQHTIHVVDKYGLGTRLQRPRAGTTITALAGLSLKEGEDQKEIKIEPAQAVDEVEPLPEDYYTRPVNLTEVTTLQQRLLQPDFQPICASQLYPRHKHLLIKRSLRCRQCEHNLSKPEFNPTSIKFKIQLVAVNYIPEVRIMSIPNLRYMKESQVLLTLTNPVENLTHVTLLECEEGDPDDTNSTAKVSVPPTELVLAGKDAAAEYDELAEPQDFPDDPDVVAFRKANKVGVFIKVTPQREEGDVTVCFKLKHDFKNLAAPIRPVEEADPGAEVSWLTQHVELSLGPLLP 467 T 3.9000000000000004E-28 Dynactin_p62 pdbpercent F Eukaryota T 6zno 8 I Z Dynactin subunit 1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 189 F F F 6zno 9 J z Dynactin subunit 1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 188 F F F 6zo4 1 A,B 5,6 BICD2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 145 F F F 6zo4 8 J Y A0A4X1TB62_PIG DYNACTIN SUBUNIT 4 MASLLQSERVLYLVQGEKKVRAPLSQLYFCRYCSELRSLECVSHEVDSHYCPSCLENMPSAEAKLKKNRCANCFDCPGCMHTLSTRATSISTQLPDDPAKTAVKKAYYLACGFCRWTSRDVGMADKSVASGGWQEPDHPHTQRMNKLIEYYQQLAQKEKVERDRKKLARRRNYMPLAFSQHTIHVVDKYGLGTRLQRPRAGTTITALAGLSLKEGEDQKEIKIEPAQAVDEVEPLPEDYYTRPVNLTEVTTLQQRLLQPDFQPICASQLYPRHKHLLIKRSLRCRQCEHNLSKPEFNPTSIKFKIQLVAVNYIPEVRIMSIPNLRYMKESQVLLTLTNPVENLTHVTLLECEEGDPDDTNSTAKVSVPPTELVLAGKDAAAEYDELAEPQDFPDDPDVVAFRKANKVGVFIKVTPQREEGDVTVCFKLKHDFKNLAAPIRPVEEADPGAEVSWLTQHVELSLGPLLP 467 T 3.9000000000000004E-28 Dynactin_p62 pdbpercent F Eukaryota T 6zon 47 UA Y Unknown factor XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 78 F F F 6zop 1 A A A0DFJ7_PARTE DDE_Tnp_1_7 domain-containing protein GPLGSPEFSYFAKIQPHTFIEGEEIVKCSECGNETKVFCQECTILKAEVVGLCHEKDTIKCQRFHEFMDFELDKNKEVIDKRKG 84 T 0.011 UPF0167 pdb F Eukaryota T 6zp4 48 VA X Unknown factor AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 78 T 14000 zf-H2C2_2 pdbhh F F 6zpj 1 A,B A,B E9AN40_LEIMU LEISHMANIA MEXICANA KKT4 GSTANKLTEAQRRIAELEKELQRTTQRVDQLSDVVQQQKDELQAAKDRHALEMEETRHAYNAVIHRKDEVQEEALRQLLKSRQLMVSAARYEAVVAAKKLHAQEFELGAPAGRQACGRIMLKSNRK 126 T 0.0076 SEN1_N pdb F Eukaryota T 6zpm 1 A,B A,B A0A2V2WCI2_TRYCR Trypanosoma cruzi KKT4 117-218 GSSLQRYEKLVKECRRLEEELEQKTHEASDASQRVRQLERETTRLMRRVEQLVSAVEGQKQKLDETEAKHKLELAEIENRHELEIQSKMSSHEEALRRLMDARR 104 T 0.0004 AAA_13 pdbpssm F Eukaryota T 6zpp 1 A A A0A151GCU7_9HYPO VIRULENCE FACTOR SRLSNAFVLATTASAAAVPSPALPADDILLAINQSLRLVDSRAAMLVSQVRHGAINNVGSLADSYHELIFSLRGAVRAVDDVWRPLPKDAPMRIVESLRPFQKIPASLRSALKERLDAIAERPGGCQAVDDNNRQLGLDFDRLYWEIASSSSFSAIHETVSSQQKQFETAMRELTDEFSSRCLRRAQASA 190 T 0.2 FliT pdb F Eukaryota T 6zqa 39 PA JN FAF1_YEAST FORTY S ASSEMBLY FACTOR MTLDDDDYIKQMELQRKAFESQFGSLESMGFEDKTKNIRTEVDTRDSSGDEIDNSDHGSDFKDGTIESSNSSDEDSGNETAEENNQDSKPKTQPKVIRFNGPSDVYVPPSKKTQKLLRSGKTLTQINKKLESTEAKEEKEDETLEAENLQNDLELQQFLRESHLLSAFNNGGSGSTNSGVSLTLQSMGGGNDDGIVYQDDQVIGKARSRTLEMRLNRLSRVNGHQDKINKLEKVPMHIRRGMIDKHVKRIKKYEQEAAEGGIVLSKVKKGQFRKIESTYKKDIERRIGGSIKARDKEKATKRERGLKISSVGRSTRNGLIVSKRDIARISGGERSGKFNGKKKSRR 346 T 4.1E-08 DUF4602 unppssm F Eukaryota T 6zqb 44 VA JN FAF1_YEAST FORTY S ASSEMBLY FACTOR MTLDDDDYIKQMELQRKAFESQFGSLESMGFEDKTKNIRTEVDTRDSSGDEIDNSDHGSDFKDGTIESSNSSDEDSGNETAEENNQDSKPKTQPKVIRFNGPSDVYVPPSKKTQKLLRSGKTLTQINKKLESTEAKEEKEDETLEAENLQNDLELQQFLRESHLLSAFNNGGSGSTNSGVSLTLQSMGGGNDDGIVYQDDQVIGKARSRTLEMRLNRLSRVNGHQDKINKLEKVPMHIRRGMIDKHVKRIKKYEQEAAEGGIVLSKVKKGQFRKIESTYKKDIERRIGGSIKARDKEKATKRERGLKISSVGRSTRNGLIVSKRDIARISGGERSGKFNGKKKSRR 346 T 4.1E-08 DUF4602 unppssm F Eukaryota T 6zqc 45 WA JN FAF1_YEAST FORTY S ASSEMBLY FACTOR MTLDDDDYIKQMELQRKAFESQFGSLESMGFEDKTKNIRTEVDTRDSSGDEIDNSDHGSDFKDGTIESSNSSDEDSGNETAEENNQDSKPKTQPKVIRFNGPSDVYVPPSKKTQKLLRSGKTLTQINKKLESTEAKEEKEDETLEAENLQNDLELQQFLRESHLLSAFNNGGSGSTNSGVSLTLQSMGGGNDDGIVYQDDQVIGKARSRTLEMRLNRLSRVNGHQDKINKLEKVPMHIRRGMIDKHVKRIKKYEQEAAEGGIVLSKVKKGQFRKIESTYKKDIERRIGGSIKARDKEKATKRERGLKISSVGRSTRNGLIVSKRDIARISGGERSGKFNGKKKSRR 346 T 4.1E-08 DUF4602 unppssm F Eukaryota T 6zqt 2 B,D C,D RBP1_HUMAN RALBP1,76 KDA RAL-INTERACTING PROTEIN,DINITROPHENYL S-GLUTATHIONE ATPASE,DNP-SG ATPASE,RAL-INTERACTING PROTEIN 1 GPLGSETQAGIKEEIRRQEFLLNSLHRDLQGGIKDLSKEHRLWEVLRILTALRRKLREA 59 T 0.01 G10 pdbpercent F Eukaryota T 6zrc 2 B,C P,Q MTA1_HUMAN macrocyclic peptide based on residues 659-672 of the metastasis-associated protein MTA1 XCTKRAARRPYKPCAX 16 T 0.74 MTA_R1 unp F Eukaryota T 6zrd 2 C,D P,Q MTA1_HUMAN macrocyclic peptide based on residues 659-672 of the metastasis-associated protein MTA1 XCTKRCARRPYKPCAX 16 T 1.9 Toxin_27 pdbhh F Eukaryota T 6zrn 2 C,D C,D RBP1_HUMAN RALBP1,76 KDA RAL-INTERACTING PROTEIN,DINITROPHENYL S-GLUTATHIONE ATPASE,DNP-SG ATPASE,RAL-INTERACTING PROTEIN 1 GPLGSETQAGIKEEIRRQEFLLNSLHRDLQGGIKDLSKESRMWEVLRILTALRRKLREA 59 T 0.01 G10 pdbpercent F Eukaryota T 6zrw 1 A,B,C,D,E,F A,B,C,D,E,F B3VS76_COPCI Mucin-binding lectin 1 AIFHTGSELFIITRGPGKLTLLTWGGLNNLRSVIGAIPTENTGVTKWAVSFSHNYTRFSFIWEGQGEACYQIGNGLTRSPVGRSWSSSSTIHWGSSTVITEDVTSVVPGAVNRDKVTTAYALPDNL 126 T 41 IMS_HHH pdbhh F Eukaryota T 6zs9 86 MC A Quinupristin XTXPXXXX 8 T 290 zf-C2H2_jaz pdbhh F F 6zsb 87 NC A Quinupristin XTXPXXXX 8 T 290 zf-C2H2_jaz pdbhh F F 6zsc 86 MC A Quinupristin XTXPXXXX 8 T 290 zf-C2H2_jaz pdbhh F F 6zsd 88 OC A Quinupristin XTXPXXXX 8 T 290 zf-C2H2_jaz pdbhh F F 6zse 88 OC A Quinupristin XTXPXXXX 8 T 290 zf-C2H2_jaz pdbhh F F 6zsg 89 PC A Quinupristin XTXPXXXX 8 T 290 zf-C2H2_jaz pdbhh F F 6zsu 1 A,B A,B A0A0K1ECI7_CHOCO CGNE GMGGRRTIGIRSGEGAIMNASDFYALLRGRGMPVVVDDAEAAAVVSELGFRTVPFEAFDFDSPSEDPALVIVAQMGNVDALHGLWERSGTPLMHLALAKFDGGLSRLRAGLARVLAVDTDAALKRRAEAYEQLFSSASVEIASGEGVLRCHIGDEVEVGNCGDTLEQGFLYSVAEFLEASVVNLEGERSTFWVEGELPFDGFIHLSNSAALKERWGGMLDEFMRRSREGANLVRFADNVIDRLVVGGVDVTSALAGLSQGEERGMAATEFGLGCADAEAAEPFGVNSLLHKSAGGAYIGIGKGLRIPHIDFIARGATIRFIPAAEG 326 T 0.069 DUF2806 pdbpercent F Bacteria T 6zsv 1 A,B A,B A0A0K1EBZ5_CHOCO Uncharacterized protein GAMADIGSMDVLEYFERLKNRELAFVLDDLQLSDMVTRRGFSVIPFDDFDLAREDHPPAFVLVTRLDYHGKLMQAWETAKGISSHLSLAKFDTSPKSVEYSLDQLLSMDFAETLKRRGDYYDSVASTNRMEVVTPGAVLTCDFGNEIEIANNDVEMQKGWLYSVAEFFETSVINLEADRSSYTLNGDLCFTGLIYLCNRPDLKERASATMDELMRMSTRGRNVVSFVDNQIVRMELGGVDMTATLRELIVGKEREGSSTEFAMGCVEYPLAQDWTINSVMNEGSHGIHVGVGMGKEIPHMDFIAKGAELRIAESSDA 317 T 0.055 YgbA_NO unppssm F Bacteria T 6zsy 1 A A GRND_DROME Protein grindelwald GESRDCHGTICHPVNEFCYVATERCHPCIEVCNNQTHNYDAFLCAKECSAYK 52 T 0.22 MSSP pdbpercent F Eukaryota T 6zsz 1 A A GRND_DROME Protein grindelwald GESRDCHGTICHPVNEFCYVATERCHPCIEVCNNQTHNYDAFLCAKECSAYK 52 T 0.22 MSSP pdbpercent F Eukaryota T 6zt0 2 B AAAA GRND_DROME Protein grindelwald GESRDCHGTICHPVNEFCYVATERCHPCIEVCNNQTHNYDAFLCAKECSAYK 52 T 0.22 MSSP pdbpercent F Eukaryota T 6zt1 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L CC-Type2-(LaIdGe)4 XGEIGQALKEIGKALKEIGXALKEIGQALKGX 32 T 0.013 ApoC-I pdbpssm F T 6zts 1 A,B A,B LMBD1_REOVL Lambda-1 VSRIVVHTRWASNVDFDRDSSVIMAPPTENNIHLFKQLLNTETLSVRGANPLMFRANVLHMLLEFVLDNLYLNRHTGFSQDHTPFTEGANLRSLPGPDAEKWYSIMYPTRMGTPNVSKICNFVASCVRNRVGRFDRAQMMNGAMSEWVDVFETSDALTVSIRGRWMARLARMNINPTEIEWALTECAQGYVTVTSPYAPSVNRLMPYRISNAERQISQIIRIMNIGNNATVIQPVLQDISVLLQRISPLQIDPTIISNTMSTVSESTTQTLSPASSILGKLRPSNSDFSSFRVALAGWLYNGVVTTVIDDSSYPKDGGSVTSLENLWDFFILALALPMTTDPCAPVKAFMTLANMMVGFETIPMDNQIYTQSRRASAFSTPHTWPRCFMNIQLISPIDAPILRQWAEIIHRYWPNPSQIRYGAPNVFGSANLFTPPEVLLLPIDHQPANVTTPTLDFTNELTNWRARVCELMKNLVDNQRYQPGWTQSLVSSMRGTLDKLKLIKSMTPMYLQQLAPVELAVIAPMLPFPPFQVPYVRLDRDRVPTMVGVTRQSRDAITQPALSLSTTNTTVGVPLALDARAITVALLSGKYPPDLVTNVWYADAIYPMYADTEVFSNLQRDMITCEAVQTLVTLVAQISDTQYPVDRYLDWIPSLRASAATAATFAEWVNTSLKTAFDLSDMLLEPLLSGDPRMTQLAIQYQQYNGRTFDVIPEMPGSVIADCVQLTAEVFNHEYNLFGIARGDIIIGRVQSTHLWSPLAPPPDLVFDRDTPGVHIFGRDCRISFGMNGAAPMIRDETGMMVPFEGNWIFPLALWQMNTRYFNQQFDAWIKTGELRIRIEMGAYPYMLHYYDPRQYANAWNLASAWLEEITPTSIPSVPFMVPISSDHDISSAPAVQYIISTEYNDRSLFCTNSSSPQTIAGPDKHIPVERYNILTNPDAPPTQIQLPEVIDLYNVVTRYAYETPPITAVVMGVP 975 T 25 Peptidase_C36 pdbhh T Viruses T 6ztz 1 A B LMBD1_REOVL LAMBDA 1 NKKTAQLLHADTPRLVTWDAGLCTSFKIVPIVPAQVPQDVLAYTFFTSSYAIQSPFPEAAVSRIVVHTRWASNVDFDRDSSVIMAPPTENNIHLFKQLLNTETLSVRGANPLMFRANVLHMLLEFVLDNLYLNRHTGFSQDHTPFTEGANLRSLPGPDAEKWYSIMYPTRMGTPNVSKICNFVASCVRNRVGRFDRAQMMNGAMSEWVDVFETSDALTVSIRGRWMARLARMNINPTEIEWALTECAQGYVTVTSPYAPSVNRLMPYRISNAERQISQIIRIMNIGNNATVIQPVLQDISVLLQRISPLQIDPTIISNTMSTVSESTTQTLSPASSILGKLRPSNSDFSSFRVALAGWLYNGVVTTVIDDSSYPKDGGSVTSLENLWDFFILALALPLTTDPCAPVKAFMTLANMMVGFETIPMDNQIYTQSRRASAFSTPHTWPRCFMNIQLISPIDAPILRQWAEIIHRYWPNPSQIRYGAPNVFGSANLFTPPEVLLLPIDHQPANVTTPTLDFTNELTNWRARVCELMKNLVDNQRYQPGWTQSLVSSMRGTLDKLKLIKSMTPMYLQQLAPVELAVIAPMLPFPPFQVPYVRLDRDRVPTMVGVTRQSRDTITQPALSLSTTNTTVGVPLALDARAITVALLSGKYPPDLVTNVWYADAIYPMYADTEVFSNLQRDMITCEAVQTLVTLVAQISETQYPVDRYLDWIPSLRASAATAATFAEWVNTSMKTAFDLSDMLLEPLLSGDPRMTQLAIQYQQYNGRTFNVIPEMPGSVIADCVQLTAEVFNHEYNLFGIARGDIIIGRVQSTHLWSPLAPPPDLVFDRDTPGVHIFGRDCRISFGMNGAAPMIRDETGMMVPFEGNWIFPLALWQMNTRYFNQQFDAWIKTGELRIRIEMGAYPYMLHYYDPRQYANAWNLTSAWLEEITPTSIPSVPFMVPISSDHDISSAPAVQYIISTEYNDRSLFCTNSSSPQTIAGPDKHIPVERYNILTNPDAPPTQIQLPEVVDLYNVVTRYAYETPPITAVVMGVP 1035 T 27 Peptidase_C36 pdbhh T Viruses T 6ztz 2 B C LMBD1_REOVL LAMBDA 1 AGLCTSFKIVPIVPAQVPQDVLAYTFFTSSYAIQSPFPEAAVSRIVVHTRWASNVDFDRDSSVIMAPPTENNIHLFKQLLNTETLSVRGANPLMFRANVLHMLLEFVLDNLYLNRHTGFSQDHTPFTEGANLRSLPGPDAEKWYSIMYPTRMGTPNVSKICNFVASCVRNRVGRFDRAQMMNGAMSEWVDVFETSDALTVSIRGRWMARLARMNINPTEIEWALTECAQGYVTVTSPYAPSVNRLMPYRISNAERQISQIIRIMNIGNNATVIQPVLQDISVLLQRISPLQIDPTIISNTMSTLSPASSILGKLRPSNSDFSSFRVALAGWLYNGVVTTVIDDSSYPKDGGSVTSLENLWDFFILALALPLTTDPCAPVKAFMTLANMMVGFETIPMDNQIYTQSRRASAFSTPHTWPRCFMNIQLISPIDAPILRQWAEIIHRYWPNPSQIRYGAPNVFGSANLFTPPEVLLLPIDHQPANVTTPTLDFTNELTNWRARVCELMKNLVDNQRYQPGWTQSLVSSMRGTLDKLKLIKSMTPMYLQQLAPVELAVIAPMLPFPPFQVPYVRLDRDRVPTMVGVTRQSRDTITQPALSLSTTNTTVGVPLALDARAITVALLSGKYPPDLVTNVWYADAIYPMYADTEVFSNLQRDMITCEAVQTLVTLVAQISETQYPVDRYLDWIPSLRASAATAATFAEWVNTSMKTAFDLSDMLLEPLLSGDPRMTQLAIQYQQYNGRTFNVIPEMPGSVIADCVQLTAEVFNHEYNLFGIARGDIIIGRVQSTHLWSPLAPPPDLVFDRDTPGVHIFGRDCRISFGMNGAAPMIRDETGMMVPFEGNWIFPLALWQMNTRYFNQQFDAWIKTGELRIRIEMGAYPYMLHYYDPRQYANAWNLTSAWLEEITPTSIPSVPFMVPISSDHDISSAPAVQYIISTEYNDRSLFCTNSSSPQTIAGPDKHIPVERYNILTNPDAPPTQIQLPEVVDLYNVVTRYAYETPPITAVVMGVP 1008 T 26 Peptidase_C36 pdbhh T Viruses T 6zu2 1 A,B,C,D,E,F AAA,BBB,CCC,DDD,EEE,FFF B3VS76_COPCI Mucin-binding lectin 1 AIFHTGSELFIITRGPGKLTLLTWGGLNNLRSVIGAIPTENTGVTKWAVSFSHNYTRFSFIWEGQGEACYQIGNGLTRSPVGRSWSSSSTIHWGSSTVITEDVTSVVPGAVNRDKVTTAYALPDNL 126 T 41 IMS_HHH pdbhh F Eukaryota T 6zuf 2 C,D C,D C2 foldamer/peptide hybrid inhibitor of histone chaperone ASF1 EKXXXXRIA 9 T 73 YcbB pdbhh F T 6zuq 1 A A A0A1P8YXI8_PASFU Extracellular protein 11-1 GHMLDCKAVALKWVHQFRIPGGDNCNFYCSYDSLYQQFNLWKKNDACQGADGFSTAIPKIQEAPCSDCPGSKTCICSVQATAWRVRNGKWFDGQQWFDCDVKPYTERVLGRRWYDESEADKDIYVGYYSRGFISNDNVHCGSQ 143 T 0.047 TSGP1 unphh F Eukaryota T 6zus 1 A A A0A1P8YXI8_PASFU Extracellular protein 11-1 GHMLDCKAVALKWVHQFRIPGGDNCNFYCSYDSLYQQFNLWKKNDACQGADGFSTAIPKIQEAPCSDCPGSKTCICSVQATAWRVRNGKWFDGQQWFDCDVKPYTERVLGRRWYDESEADKDIYVGYYSRGFISNDNVHCGSQ 143 T 0.047 TSGP1 unphh F Eukaryota T 6zv5 1 A,B,C,D,E,F AAA,BBB,CCC,DDD,EEE,FFF B3VS76_COPCI Mucin-binding lectin 1 AIFHTGSELFIITRGPGKLTLLTWGGLNNLRSVIGAIPTENTGVTKWAVSFSHNYTRFSFIWEGQGEACYQIGNGLTRSPVGRSWSSSSTIHWGSSTVITEDVTSVVPGAVNRDKVTTAYALPDNL 126 T 41 IMS_HHH pdbhh F Eukaryota T 6zvb 2 B P GAB2_HUMAN phosphorylated Gab2pT391 peptide IPRRNTLPAMDNS 13 T 36 TbpB_A pdbhh F Eukaryota T 6zvc 2 B P GAB2_HUMAN phosphorylated Gab2pT391 peptide IPRRNTLPAMDNS 13 T 36 TbpB_A pdbhh F Eukaryota T 6zvd 2 B P GAB2_HUMAN phosphorylated Gab2pT391 peptide IPRRNTLPAMDNS 13 T 36 TbpB_A pdbhh F Eukaryota T 6zve 2 B P GAB2_HUMAN phosphorylated Gab2pT391 peptide IPRRNTLPAMDNS 13 T 36 TbpB_A pdbhh F Eukaryota T 6zvf 3 C P LEG3_HUMAN GAL-3,35 KDA LECTIN,CARBOHYDRATE-BINDING PROTEIN 35,CBP 35,GALACTOSE-SPECIFIC LECTIN 3,GALACTOSIDE-BINDING PROTEIN,GALBP,IGE-BINDING PROTEIN,L-31,LAMININ-BINDING PROTEIN,LECTIN L-29,MAC-2 ANTIGEN XQAPPGAYPG 10 T 0.76 HMMR_N pdbhh F Eukaryota T 6zvh 36 JA y LYAR_HUMAN Cell growth-regulating nucleolar protein KFNWKGTIKAILKQAPDNEITIKKLRKKVLAQYYTVTDEHHRSEEELLVIFNKKISKNPTFKLLKDKVKLVK 72 T 0.0019 DEK_C pdb F Eukaryota T 6zvj 49 WA Y RNA recognition motif (unknown) AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 78 T 14000 zf-H2C2_2 pdbhh F F 6zvn 2 B BBB SYT1_HUMAN SYNAPTOTAGMIN I,SYTI,P65 GEGKEDAFSKLKEKFMNELHK 21 T 0.01 PRIMA1 unphh F Eukaryota T 6zvq 2 B B SKI_HUMAN PROTO-ONCOGENE C-SKI FQPHPGLQKTLEQFHLSSMSSLGGPAAFSARWAQE 35 T 120 DUF2520 pdbhh F Eukaryota T 6zwk 2 G,H,I,J,K,L G,H,I,J,K,L H2AX_HUMAN H2A/X,HISTONE H2A.X CKATQASQEY 10 T 13 Class_IIIsignal pdbhh F Eukaryota T 6zx9 2 B B DCAF1_HUMAN HIV-1 VPR-BINDING PROTEIN,VPRBP,SERINE/THREONINE-PROTEIN KINASE VPRBP,VPR-INTERACTING PROTEIN MAPINFTSRLNRRASFPKYGGVDGGCFDRHLIFSRFRPISVFREANEDESGFTCCAFSARERFLMLGTCTGQLKLYNVFSGQEEASYNCHNSAITHLEPSRDGSLLLTSATWSQPLSALWGMKSVFDMKHSFTEDHYVEFSKHSQDRVIGTKGDIAHIYDIQTGNKLLTLFNPDLANNYKRNCATFNPTDDLVLNDGVLWDVRSAQAIHKFDKFNMNISGVFHPNGLEVIINTEIWDLRTFHLLHTVPALDQCRVVFNHTGTVMYGAMLQADDEDDLMEERMKSPFGSSFRTFNATDYKPIATIDVKRNIFDLCTDTKDCYLAVIENQGSMDALNMDTVCRLYEVGRQRLAEELALVPRG 360 T 0.0003 ANAPC4_WD40 pdbpssm F Eukaryota T 6zxr 2 B B ALA-GLU-ARG-ASP-GLU-LEU AERDEL 6 T 58 SCHIP-1 pdbhh F F 6zyg 1 A A A0A1X9WII3_9GAMM Protealysin-associated protein MKPLPVLNQDTVIELAREGGFAFIPKLAGQRRIALADITPEQRQRLNQLLNQTLPYAQEEGQPDSPGSGDQRYFRVQISYYSQTLRSEIVLLIPETSAPQALVDLWKTGQVDE 113 T 4.8 DUF3500 unphh F Bacteria T 6zyw 17 Q Y Q22YU3_TETTS Shulin MFNFFSSANINQNIPKYSVNDFVFRLKKIEKIVVKEGLDGFLLINGVDSRENTEYVKLTNWLFLGNSGLEIEENEYLNQIYSDMIVLIKKGTTHIFIDPEALNSLQTLIYSIPNVDVFCPTEKQYEDKDEMELLKMAFFLRVMKPTKKVGILLGQKDKGKINSIEKWPLIQSYGLEELGVGFFSMNHEVVDLTLRLNAVYKNYDKFFVSKLIYVVAKRLTGHFNSAAGQLGDMKMHKRNLATESQLTEIFRDTYEIEEISKWVQIRGVNAALPKPRVLFGKNTSADCSKEPSVAPLKDLKYSETFHSFHATFETFDLRTCLRAARTYFLAKGVKEERNLITLNDDEGVPQGYELNIDENQQYKDQDFLANLYLSIIIGFNEVMQLITKDYKNMTEEFIQDYIFQKVSKVYAGFQIPESEITLDKIQIILKAYNSFGEEVKIDFKDTISFKLTPYFFMVRIEQKNIKSQILNNTVLGSLVFAESFILQEGCYLLLTKEIPYFDLWNCQNDYSEKIEKMKKRILWEPLGKQISDELPKNRIFVQTGRKSNYGFDIPIMQASYYMHELGLRIETQRLGWFILFFKEMKEIQITQKMNHTWLIFKVDSNITFNSISKDTIALEFTGDALEQSFFKIKNYFEENQIKYEYQVDIPAIFQESQIAKKQILNQQSQGQKLITMNSIQNEQFFISYIESKQLMILNQMKDLKLSAYKNLYEQMQISQAITPVENHIGVILVNGSYCSGKRKFAENLIRFGSDNNLRLHLYKFDLNEMSELTEKSYLSGLLKFASEKKIQNTDVIVASVPHFINTKILIDYFSKSEKISNAFYIRTIATKININNIYSNFNKNPVNNVFTYGVEGYSQFLLLDTYNNYDADVNALNKTLSGVLPGAKIYKIMNNILNPALAKDILTSITFISEQNNLNRLKYSVQYDLLTSNGPSSVVFIPFKLPILREKIRDLIYKKILQNGNQTLVDTIEAEQKIAEFKELNKNSKDPLMIEIIKLKEKIEIQNAQTSDQAIKIDYVKGILRYDSKLKEGLEEITITPNYFIERTVKGVDAKEFTEELNGVSFKNVKYTGITNSIINDMGFVFAGKNLNKEKLLELLYKLVKPLNKQKLRQRKDLTEEEIVDIQFRNRGEGLENGEFYDGQFWRNIQGLILPHHPKKDEFIEEYLKQEEVRINQINEQLQQEWETWKQVYDKIHLDK 1200 T 0.00092 cobW unphh F Eukaryota T 6zyx 8 H Y Q22YU3_TETTS Shulin MFNFFSSANINQNIPKYSVNDFVFRLKKIEKIVVKEGLDGFLLINGVDSRENTEYVKLTNWLFLGNSGLEIEENEYLNQIYSDMIVLIKKGTTHIFIDPEALNSLQTLIYSIPNVDVFCPTEKQYEDKDEMELLKMAFFLRVMKPTKKVGILLGQKDKGKINSIEKWPLIQSYGLEELGVGFFSMNHEVVDLTLRLNAVYKNYDKFFVSKLIYVVAKRLTGHFNSAAGQLGDMKMHKRNLATESQLTEIFRDTYEIEEISKWVQIRGVNAALPKPRVLFGKNTSADCSKEPSVAPLKDLKYSETFHSFHATFETFDLRTCLRAARTYFLAKGVKEERNLITLNDDEGVPQGYELNIDENQQYKDQDFLANLYLSIIIGFNEVMQLITKDYKNMTEEFIQDYIFQKVSKVYAGFQIPESEITLDKIQIILKAYNSFGEEVKIDFKDTISFKLTPYFFMVRIEQKNIKSQILNNTVLGSLVFAESFILQEGCYLLLTKEIPYFDLWNCQNDYSEKIEKMKKRILWEPLGKQISDELPKNRIFVQTGRKSNYGFDIPIMQASYYMHELGLRIETQRLGWFILFFKEMKEIQITQKMNHTWLIFKVDSNITFNSISKDTIALEFTGDALEQSFFKIKNYFEENQIKYEYQVDIPAIFQESQIAKKQILNQQSQGQKLITMNSIQNEQFFISYIESKQLMILNQMKDLKLSAYKNLYEQMQISQAITPVENHIGVILVNGSYCSGKRKFAENLIRFGSDNNLRLHLYKFDLNEMSELTEKSYLSGLLKFASEKKIQNTDVIVASVPHFINTKILIDYFSKSEKISNAFYIRTIATKININNIYSNFNKNPVNNVFTYGVEGYSQFLLLDTYNNYDADVNALNKTLSGVLPGAKIYKIMNNILNPALAKDILTSITFISEQNNLNRLKYSVQYDLLTSNGPSSVVFIPFKLPILREKIRDLIYKKILQNGNQTLVDTIEAEQKIAEFKELNKNSKDPLMIEIIKLKEKIEIQNAQTSDQAIKIDYVKGILRYDSKLKEGLEEITITPNYFIERTVKGVDAKEFTEELNGVSFKNVKYTGITNSIINDMGFVFAGKNLNKEKLLELLYKLVKPLNKQKLRQRKDLTEEEIVDIQFRNRGEGLENGEFYDGQFWRNIQGLILPHHPKKDEFIEEYLKQEEVRINQINEQLQQEWETWKQVYDKIHLDK 1200 T 0.00092 cobW unphh F Eukaryota T 6zyy 2 B Y Q22YU3_TETTS Shulin MFNFFSSANINQNIPKYSVNDFVFRLKKIEKIVVKEGLDGFLLINGVDSRENTEYVKLTNWLFLGNSGLEIEENEYLNQIYSDMIVLIKKGTTHIFIDPEALNSLQTLIYSIPNVDVFCPTEKQYEDKDEMELLKMAFFLRVMKPTKKVGILLGQKDKGKINSIEKWPLIQSYGLEELGVGFFSMNHEVVDLTLRLNAVYKNYDKFFVSKLIYVVAKRLTGHFNSAAGQLGDMKMHKRNLATESQLTEIFRDTYEIEEISKWVQIRGVNAALPKPRVLFGKNTSADCSKEPSVAPLKDLKYSETFHSFHATFETFDLRTCLRAARTYFLAKGVKEERNLITLNDDEGVPQGYELNIDENQQYKDQDFLANLYLSIIIGFNEVMQLITKDYKNMTEEFIQDYIFQKVSKVYAGFQIPESEITLDKIQIILKAYNSFGEEVKIDFKDTISFKLTPYFFMVRIEQKNIKSQILNNTVLGSLVFAESFILQEGCYLLLTKEIPYFDLWNCQNDYSEKIEKMKKRILWEPLGKQISDELPKNRIFVQTGRKSNYGFDIPIMQASYYMHELGLRIETQRLGWFILFFKEMKEIQITQKMNHTWLIFKVDSNITFNSISKDTIALEFTGDALEQSFFKIKNYFEENQIKYEYQVDIPAIFQESQIAKKQILNQQSQGQKLITMNSIQNEQFFISYIESKQLMILNQMKDLKLSAYKNLYEQMQISQAITPVENHIGVILVNGSYCSGKRKFAENLIRFGSDNNLRLHLYKFDLNEMSELTEKSYLSGLLKFASEKKIQNTDVIVASVPHFINTKILIDYFSKSEKISNAFYIRTIATKININNIYSNFNKNPVNNVFTYGVEGYSQFLLLDTYNNYDADVNALNKTLSGVLPGAKIYKIMNNILNPALAKDILTSITFISEQNNLNRLKYSVQYDLLTSNGPSSVVFIPFKLPILREKIRDLIYKKILQNGNQTLVDTIEAEQKIAEFKELNKNSKDPLMIEIIKLKEKIEIQNAQTSDQAIKIDYVKGILRYDSKLKEGLEEITITPNYFIERTVKGVDAKEFTEELNGVSFKNVKYTGITNSIINDMGFVFAGKNLNKEKLLELLYKLVKPLNKQKLRQRKDLTEEEIVDIQFRNRGEGLENGEFYDGQFWRNIQGLILPHHPKKDEFIEEYLKQEEVRINQINEQLQQEWETWKQVYDKIHLDK 1200 T 0.00092 cobW unphh F Eukaryota T 6zzw 1 A,B,C A,C,B B4EH86_BURCJ Lectin GHMPLLSASIVSAPVVTSETYVDIPGLYLDVAKAGIRDGKLQVILNVPTPYATGNNFPGIYFAIATNQGVVADGCFTYSSKVPESTGRMPFTLVATIDVGSGVTFVKGQWKSVRGSAMHIDSYASLSAIWGTAA 134 T 0.002 DUF1543 unppercent F Bacteria T 6zzx 14 N O A0A2P6THB2_CHLSO Photosystem I subunit O WGAYEEPLSLVAGFLGWFAPSNIKVPAFGNESLFGAFHASMLENLANFPQGPALTDKFWILMITWHLGLFLALTLGNIGQAARKQGY 87 T 24 YkpC pdbhh F Eukaryota T 7a00 2 C,D C,D L6F mutant of C-terminal hexapeptide from Guanylate kinase-associated protein XEAQTRF 7 T 19 GKAP pdbhh F T 7a09 49 WA X RNA recognition motif AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 78 T 14000 zf-H2C2_2 pdbhh F F 7a0n 1 A,B,C,D A,B,C,D G0S5K3_CHATD Uncharacterized protein,Uncharacterized protein SMTSSGSSRDLFRALNSFIQTPTLPPPADLDAIISSYLERHDKPEEGSGDRLNDELLAIWDKAVQDHPEKYAAFVAVLRQLRPGLGAPARTFQWWDKLLDPVLDNATREKGLARSFMDFTLEILSSSEYDDPEAWGEEGFIPWLNRLLVRWMELRESRADFRPSTDLKEQVLTDALLAFGKKDPKGFMNALNAFVLRREHRNSAFSLLCAFVNSGPPHLYLILQTPLFGNILQSLQKDESTFTVNLALIALVMLLPFFPGDIVPYLPTLFNIYARLLFWDRDSYFAQQHTEMGENHGESGTDTPWDKVLLDPDYDGHSVPYLPEYFTILYGLYPINFVDYIRKPHNYLPHAGSDDDIDVHAAEIRERSERFRKQHLLHPNFYEYTIETEKTNITRWLKSEADEIIADCMALVVDRGTADESRPGVEIIEQVSLLRYQRHRLLNDLQYERFVRQQHMSHMGELRRRQ 466 T 1.9E-05 CCDC14 unphh F Eukaryota T 7a1i 2 B,D B,D A0A3L6LBE5_9TRYP FPC4 SSLSPYLRYLPSDVSGGEWDKPDVGDVLCFQAKEPQRRRVLTSPVPDELLIK 52 T 1.7 USP7_ICP0_bdg pdbhh F Eukaryota T 7a1s 2 B B TNKS2_HUMAN TANKYRASE-2 NLEVAEYLLQHGADVNAQDK 20 T 0.00021 Ank pdb F Eukaryota T 7a1t 1 A,B,C,D,E,F A,B,C,D,E,F CC-Type2-(GgLaId)4-W19BrPhe XGEIAQGLKEIAKGLKEIAXGLKEIAQGLKGX 32 T 0.091 WXG100 pdb F T 7a23 11 K S Q9SD78_ARATH B14.5a MAKSVSTAASSLVQNLRRYIKKPWQITGPCAHPEYLEAVPKATEYRLRCPATIDEEAIVPSSDPETVYNIVYHGRDQRRNRPPIRRYVLTKDNVVQMMNEKKSFDVSDFPKVYLTTTVEEDLDTRGGGYEK 131 T 0.11 CI-B14_5a pdbpercent F Eukaryota T 7a23 29 CA r Unk1 AAAAAAAAAAAAAAAAAAAAAAAA 24 T 670 DUF4699 pdbhh F F 7a23 30 DA n UMP2_ARATH P2 MATRNALRIVSRRFSSGKVLSEEERAAENVFIKKMEQEKLQKLARQGPGEQAAGSASEAKVAGATASASAESGPKVSEDKNRNYAVVAGVVAIVGSIGWYLKAGGKKQPEVQE 113 T 0.01 IATP pdbhh F Eukaryota T 7a23 42 QA s Unk2 AAAAAAAAAAAAAAAAAAAA 20 T 510 Adeno_PIX pdbhh F F 7a24 11 K S Q9SD78_ARATH B14.5a MAKSVSTAASSLVQNLRRYIKKPWQITGPCAHPEYLEAVPKATEYRLRCPATIDEEAIVPSSDPETVYNIVYHGRDQRRNRPPIRRYVLTKDNVVQMMNEKKSFDVSDFPKVYLTTTVEEDLDTRGGGYEK 131 T 0.11 CI-B14_5a pdbpercent F Eukaryota T 7a24 29 CA r Unk1 AAAAAAAAAAAAAAAAAAAAAAAA 24 T 670 DUF4699 pdbhh F F 7a24 30 DA n UMP2_ARATH P2 MATRNALRIVSRRFSSGKVLSEEERAAENVFIKKMEQEKLQKLARQGPGEQAAGSASEAKVAGATASASAESGPKVSEDKNRNYAVVAGVVAIVGSIGWYLKAGGKKQPEVQE 113 T 0.01 IATP pdbhh F Eukaryota T 7a2w 2 B B VSL12 XVSLARRPLPPLP 13 T 0.95 DUF4522 pdbhh F T 7a2x 2 B B VSL12 high affinity synthetic peptide acetylated in the amino-terminus XVSLARRPLPPLP 13 T 0.95 DUF4522 pdbhh F T 7a2y 2 B B VSL12 VSLARRPLPPLP 12 T 0.73 DUF4522 pdbhh F T 7a2z 2 B B VSL12 VSLARRPLPPLP 12 T 0.73 DUF4522 pdbhh F T 7a48 2 B B APH colied-coil XLEEELKQLEEELQAIEEQLAQLQWKAQARKEKLAQLKEKLX 42 T 0.0018 DUF5320 pdbhh F T 7a4d 3 E,F,K,L E,F,K,L APH coiled-coil XLEEELKQLEEELQAIEEQLAQLQWKAQARKEKLAQLKEKLX 42 T 0.0018 DUF5320 pdbhh F T 7a4p 20 U H Photosystem I reaction center subunit VI-chloroplastic-like LPT 3 T 200 CoV_NSP15_N pdbhh F F 7a50 2 B,D B,D Coiled-coil APH XLEEELKQLEEELQAIEEQLAQLQWKAQARKEKLAQLKEKLX 42 T 0.0018 DUF5320 pdbhh F T 7a5f 1 A Y2 nascent chain AAAAAAAAAAAAAAAAAAAAAAAAAAAAA 29 T 1400 DUF4699 pdbhh F F 7a5f 53 BB A5 Oxa1L AAAAAAAAAAAAAAAAAAAAAAAAAAAA 28 T 1200 DUF4699 pdbhh F F 7a5g 1 A Y2 nascent chain AAAAAAAAAAAAAAAAAAAAAAAAAAAAA 29 T 1400 DUF4699 pdbhh F F 7a5g 54 CB A5 Oxa1L tail XXXXXXXXXXXXXXXXXXXXXXXXXXXX 28 F F F 7a5h 50 XA t Unknown protein/protein extension XXXXXXXXXXXXXXXXXXXXXXXXXXXX 28 F F F 7a5h 57 EB Y2 nascent chain XXXXXXXXXXXXXXXXXXXXXXXXXKKXK 29 T 4700 Keratin_assoc pdbhh F F 7a5h 59 GB G MRES1_HUMAN Mitochondrial transcription rescue factor 1 MAMASVKLLAGVLRKPDAWIGLWGVLRGTPSSYKLCTSWNRYLYFSSTKLRAPNYKTLFYNIFSLRLPGLLLSPECIFPFSVRLKSNIRSTKSTKKSLQKVDEEDSDEESHHDEMSEQEEELEDDPTVVKNYKDLEKAVQSFRYDVVLKTGLDIGRNKVEDAFYKGELRLNEEKLWKKSRTVKVGDTLDLLIGEDKEAGTETVMRILLKKVFEEKTESEKYRVVLRRWKSLKLPKKRMSK 240 T 0.002 S4 pdbpercent F Eukaryota T 7a5i 1 A Y2 nascent chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXX 29 F F F 7a5i 54 BB,CB t3,A5 Unknown protein/protein extension XXXXXXXXXXXXXXXXXXXXXXXXXXXX 28 F F F 7a5j 51 YA t Unknown protein/protein extension XXXXXXXXXXXXXXXXXXXXXXXXXXXX 28 F F F 7a5j 58 FB Y2 nascent chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXX 29 F F F 7a5j 60 IB y Unknown protein/protein extension XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 32 F F F 7a5j 61 JB z unknown protein/protein extension XXXXXXXXXXXXXX 14 F F F 7a5k 2 B Y2 nascent chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXX 29 T 1400 DUF4699 pdbhh F F 7a5k 90 OC B Unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXX 28 F F F 7a5m 2 B,C B,C Ac-[2-Cl-F]-[ProM-2]-[ProM-17]-OMe GXXXX 5 F F F 7a5p 4 D 8 UNKNOWN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 204 F F F 7a66 1 A,B,C A,B,C Pcc2 MKIRAKVELTWEYEDEETAKAIANAVNVDNISIPEKLKKSLNLITFPDGARVVTKVKYEGEIESLVVALDDLIFAIKVAEEVLWSH 86 T 0.00032 Pcc1 pdbpercent F T 7a67 2 B B Pcc2 MKIRAKVELTWEYEDEETAKAIANAVNVDNISIPEKLKKSLNLITFPDGARVVTKVKYEGEIESLVVALDDLIFAIKVAEEVLWSHPQFEK 91 T 0.00084 Pcc1 pdbpercent F T 7a6r 2 E,F,G,H E,F,G,L DAPK2_HUMAN DAPK2 C-terminal peptide RRRSSTS 7 T 4.2 CAF20 pdbhh F Eukaryota F 7a6y 2 E,F,G,H J,K,L,M DAPK2_HUMAN DAPK2 C-terminal peptide RRRSSTS 7 T 4.2 CAF20 pdbhh F Eukaryota F 7a8s 1 A A sTIM11_h3 MDKDEAWKQVEQLRREGATQIAYRSDDWRDLKEAWKKGADILIVDATDKDEAWKQVEQLRREGATQIAYRSDDWRDLKEAWKKGADILIVSEEMARHAPKDEAWKQVEQLRREGATQIAYRSDDWRDLKEAWKKGADILIVDATDKDEAWKQVEQLRREGATQIAYRSDDWRDLKEAWKKGADILIVDATGLEHHHHHH 199 T 0.00077 NanE pdbhh F T 7a8w 2 C,F CCC,FFF G4MXW3_MAGO7 Uncharacterized protein METGNKYIEKRAIDLSRERDPNFFDNPGIPVPECFWFMFKNNVRQDDGTCYSSWKMDMKVGPNWVHIKSDDNCNLSGDFPPGWIVLGKKRPGF 93 T 0.086 TMEM18 unp F Eukaryota T 7a8x 2 C,F C,F G4MXW3_MAGO7 AVR-PIK PROTEIN ( PIKMPROTEIN,PIKP PROTEIN ),AVRPI7 PROTEIN METGNKYIEKRAIDLSRERDPNFFDNPGIPVPECFWFMFKNNVRQDDGTCYSSWKMDMKVGPNWVHIKSDDNCNLSGDFPPGWIVLGKKRPGF 93 T 0.086 TMEM18 unp F Eukaryota T 7a9w 1 A A RMD9_YEAST REQUIRED FOR MEIOTIC NUCLEAR DIVISION PROTEIN 9 MSHHHHHHKNVPKGVLDKKNGREQRKTEQNVFNVDPASPWRHELLSFDECVSSALKYSTTPLQNTYKRIGNNQLNKNPSFAMFWDSMGRAMELYYSLRESPDFNAYRVSRLIHLLHNGLRSTRDQLVKLSRKPDYDSQSFHKEMMNFLCNSLKDISDDILIGKVSVSGYGATHLLTSFKELSFDDDCIRIWEASKNLSDETTSQAFQEPKVVGFMLPLLYAKTRSLTEPNELYNQIIQSKEFIHPNLYSGLIKVFIKAEDYEKALSLFGQLCEKAEVRNYGYLIETHLSFIGDSKNLTLAESFFDKIINDEMPYKIILQVSTVNSFLQNIWKAQNDFDHVYRIWEKAVKFYGNTVNPGILSSLNNTFFTIFFENYINDNINGFRKLQEIITFYSGVKKIDEPFFNVMLTRASIWHERSIIDFIDKNYTLYHIPRTIISYRILLKSLGSIDNTNNEEILDRWLELVKKLNELGQQYIANADLSALRDATVVWSQSKRDEKVFSAKAKGTPATTTTTEDDIKVPKPLENLKNEDSTSNSEDRIELYLKILKRYTPYFRATKQVYRYTTGCAESYPILNEYLSGYSDLSAEDIPVPQLHSFIAKEQ 603 T 2.6E-05 MRP-S27 unphh F Eukaryota T 7a9x 1 A A RMD9_YEAST REQUIRED FOR MEIOTIC NUCLEAR DIVISION PROTEIN 9 MSHHHHHHKNVPKGVLDKKNGREQRKTEQNVFNVDPASPWRHELLSFDECVSSALKYSTTPLQNTYKRIGNNQLNKNPSFAMFWDSMGRAMELYYSLRESPDFNAYRVSRLIHLLHNGLRSTRDQLVKLSRKPDYDSQSFHKEMMNFLCNSLKDISDDILIGKVSVSGYGATHLLTSFKELSFDDDCIRIWEASKNLSDETTSQAFQEPKVVGFMLPLLYAKTRSLTEPNELYNQIIQSKEFIHPNLYSGLIKVFIKAEDYEKALSLFGQLCEKAEVRNYGYLIETHLSFIGDSKNLTLAESFFDKIINDEMPYKIILQVSTVNSFLQNIWKAQNDFDHVYRIWEKAVKFYGNTVNPGILSSLNNTFFTIFFENYINDNINGFRKLQEIITFYSGVKKIDEPFFNVMLTRASIWHERSIIDFIDKNYTLYHIPRTIISYRILLKSLGSIDNTNNEEILDRWLELVKKLNELGQQYIANADLSALRDATVVWSQSKRDEKVFSAKAKGTPATTTTTEDDIKVPKPLENLKNEDSTSNSEDRIELYLKILKRYTPYFRATKQVYRYTTGCAESYPILNEYLSGYSDLSAEDIPVPQLHSFIAKEQ 603 T 2.6E-05 MRP-S27 unphh F Eukaryota T 7aa4 2 B B polymer Cyclomarin A analogue WXAXVXI 7 T 12 DUF446 pdbhh F F 7aa9 2 B,D,F,H,J,L B,D,F,H,J,L pT13/PT15 SCOC LIR EDSTFTNISLAD 12 T 6.3 DUF2370 pdbhh F T 7aam 2 C C PTN22_HUMAN HEMATOPOIETIC CELL PROTEIN-TYROSINE PHOSPHATASE 70Z-PEP,LYMPHOID PHOSPHATASE,LYP,PEST-DOMAIN PHOSPHATASE,PEP GFANRFSKPKGPRNPPPTWNI 21 T 44 Ral pdbhh F Eukaryota T 7abl 1 A,B,C,D A,B,C,D CAPSD_HBVCJ CORE ANTIGEN,CORE PROTEIN,HBCAG,P21.5 MDIDPYKEFGASVELLSFLPSDFFPSIRDLLDTASALYREALESPEHCSPHHTALRQAILCWGELMNLATWVGSNLEDPASRELVVSYVNVNMGLKIRQLLWFHISCLTFGRETVLEYLVSFGVWIRTPPAYRPPNAPILSTLPETTVVRRRGRSPRRRTPSPRRRRSQSPRRRRSQSRESQC 183 T 1.1E-26 Hepatitis_core pdb T Viruses T 7abr 2 G S substrate polypeptide XXXXXXXXXXXXXXXXXXXXXXXXXX 26 F F F 7abt 2 B B PRO-ARG-PRO-ARG-PRO-ARG-PRO-ARG PRPRPRPR 8 T 2.6 AT_hook pdbhh F F 7ac6 2 B Y Di-or tripeptide:H+ symporter AF 2 T 320 zf-C2H2_11 pdbhh F F 7ac7 29 CA W Nascent peptide FA 2 T 350 SEC-C pdbhh F F 7acb 1 A,B,C,D A,B,C,D W5VVI0_CAPHI Capra hircus Cathelicidin-1 (dodecylphosphocholine) RICQFVLIRVCR 12 T 1.7 bpX4 pdbhh F Eukaryota T 7ace 1 A,B A,B W5VVI0_CAPHI Capra hircus Cathelicidin-1 RICQFVLIRVCR 12 T 1.7 bpX4 pdbhh F Eukaryota T 7acj 26 Z W Nascent peptide FA 2 T 350 SEC-C pdbhh F F 7acr 26 Z W Nascent peptide FA 2 T 350 SEC-C pdbhh F F 7acv 2 C C Q9AEM2_CLODI S-LAYER PROTEIN MADIIADADSPAKITIKANKLKDLKDYVDDLKTYNNTYSNVVLEHHHHHH 50 T 0.03 DUF4458 unp F Bacteria T 7acw 2 B,D B,D Q9AEM2_CLODI S-LAYER PROTEIN MADIIADADSPAKITIKANKLKDLKDYVDDLKTYNNTYSNVVLEHHHHHH 50 T 0.03 DUF4458 unp F Bacteria T 7ad0 2 G,H,I,J,K,L H,I,J,K,L,M Modified p53 peptide ATSFAEYWALLXPA 14 T 0.49 P53_TAD pdbhh F T 7ad5 1 A A V5TFR9_LEPMC Avirulence protein LmJ1 GHMHDCHQVTVSRDVTLQNKERHDCNQVCASIDKETENKLNTDIIPRLTRYMSVKGNSIIARVQQSNSDPKCSCTWRAIIWRVYKAYDENSLNVALHVSHPNQQIGENPDWSLVISNPNVHCLKH 125 T 3.8 Antimicrobial_6 pdbhh F Eukaryota T 7ad6 2 C C K92 knob domain CPEGWSECGVAIYGYACGRWGCGHFLNSGPNISP 34 T 0.18 Toxin_4 pdbhh F T 7ad7 2 C C K8 peptide SVCPDGFDWGYGCAAGSSRFCTRHDWCCYDERADSHTYGFCTGNRVENLYFQ 52 T 0.098 DUF4716 pdb F T 7ad9 1 A,C,E,G,I A,C,L,E,G AB140_YEAST Lifeact MGVADLIKKFESISKEE 17 T 1.6 Antimicrobial_8 pdbhh F Eukaryota T 7ad9 3 K,L,M,N,O O,P,Q,R,S Phalloidin PAWXAXC 7 T 2.7 CSN7a_helixI pdbhh F F 7adj 1 A A A0A7Z7PMS6_MYCMC Putative immunoglobulin-blocking virulence protein MGSSHHHHHHSSGLVPRGSHISFDTSSNGITDAELAPINNAINDAIVSNRDNKLKPSEEKIIKETEKKIEEKIIIPPAKKEEKIEAAKPIPKPVVRKPETKITSPKITRRKQTITIAGIEVEAEIEGPPGFVTHQRDKDRKISNPTKPYQNHTVNKILSVKVTDKLKEQVAKDALSGGNGYDEGVGLFNNSIFNVFKEEFNSGKELNDILSSLESVARQNSGAFQNTLERYKKMLDSNNVINFLKSEAQKEYPKLKSKFQTKNQEYIWLIANLDQSKFTKIASTSEKYLEKGLTISPRSAFINEAGEIDSNGWGPPDEYNTVTSRLRRDNSEYRVFDYDEYYSRSSDRIANGTYPGWVKEDVSEPYSKKYNFKASDGIRFSKLERINPNPAKGKLNSGLVLDLDVSNDEAYRRSKELIEKLQKDGEQITSYRIKNMGEKNSDQAFKDILGALPKDIQQLELFFSDKATNTASLIALENKNIKELSLYTSGNSLKKAWSYNPLALRNTTWINTIDYNVSAEYSSHDKITTRITFNTLAFDQEDFSNGSYERINDGLRMVYYARNNEPFFQGGHGPGLEPDKKLGQNSYPTGLDFSRVTGIKSLKGLRFDDDLDTSNEPRKITELTLYNNESYFEISSDELNEANLQHLSTGEGNPEKPKIHFSNGNNTTSIRISGKTLLSDEGRRNLDKYFEYNESLRNSGKQIQIPNGSDELKKQLEGWGYKVSTASDRSFT 732 T 0.02 DUF3403 pdb F Bacteria T 7ado 9 I I EMC10_HUMAN HEMATOPOIETIC SIGNAL PEPTIDE-CONTAINING MEMBRANE DOMAIN-CONTAINING PROTEIN 1 MAAAASAGATRLLLLLLMAVAAPSRARGSGCRAGTGARGAGAEGREGEACGTVGLLLEHSFEIDDSANFRKRGSLLWNQQDGTLSLSQRQLSEEERGRLRDVAALNGLYRVRIPRRPGALDGLEAGGYVSSFVPACSLVESHLSDQLTLHVDVAGNVVGVSVVTHPGGCRGHEVEDVDLELFNTSVQLQPPTTAPGPETAAFIERLEMEQAQKAKNPQEQKSFFAKYWMYIIPVVLFLMMSGAPDTGGQGGGGGGGGGGGSGR 263 T 0.002 2OG-FeII_Oxy_5 pdbpssm F Eukaryota T 7ado 10 J K Unassigned helix XXXXXXXXXXXXXXXX 16 F F F 7adp 8 H I EMC10_HUMAN HEMATOPOIETIC SIGNAL PEPTIDE-CONTAINING MEMBRANE DOMAIN-CONTAINING PROTEIN 1 MAAAASAGATRLLLLLLMAVAAPSRARGSGCRAGTGARGAGAEGREGEACGTVGLLLEHSFEIDDSANFRKRGSLLWNQQDGTLSLSQRQLSEEERGRLRDVAALNGLYRVRIPRRPGALDGLEAGGYVSSFVPACSLVESHLSDQLTLHVDVAGNVVGVSVVTHPGGCRGHEVEDVDLELFNTSVQLQPPTTAPGPETAAFIERLEMEQAQKAKNPQEQKSFFAKYWMYIIPVVLFLMMSGAPDTGGQGGGGGGGGGGGSGR 263 T 0.002 2OG-FeII_Oxy_5 pdbpssm F Eukaryota T 7ads 1 A A A0A0R8HV90_ORFV Apoptosis inhibitor GPLGSMANRDDIDASAVMAAYLAREYAEAVEEQLTPRERDALEALRVSGEEVRSPLLQELSNAGEHRANPENSHIPAALVSALLEAPTSPGRMVTAVELCAQMGRLWTRGRQLVDFMRLVYVLLDRLPPTADEDLGAWLQAVARVHGT 148 T 0.029 VMAP-M0 pdbpssm T Viruses T 7adt 1 A,C A,B A0A0R8HV90_ORFV Apoptosis inhibitor GPLGSMANRDDIDASAVMAAYLAREYAEAVEEQLTPRERDALEALRVSGEEVRSPLLQELSNAGEHRANPENSHIPAALVSALLEAPTSPGRMVTAVELCAQMGRLWTRGRQLVDFMRLVYVLLDRLPPTADEDLGAWLQAVARVHGT 148 T 0.029 VMAP-M0 pdbpssm T Viruses T 7adz 2 G,H,I,J,K,L 1A,1B,1C,1D,1E,1F A3HTC3_9BACT CAP ADAPTOR PROTEIN (ALGO2) MQVSSSFRSFLKLDILHSYFLNDGEKDFSSMNEEESKTQLKSYNWKDFLEIYPSQKTSHMMRGNKIFFKSFNDSIILAIKVESGTENQPFNELYEDESMTFLLSLKDQYFGNYTDLDLADQLLYFSNKTPVLPEAFTFKPIDRINQSGTVGEEYLYEGENKKHLLEEAHLNPGGGVLGIIQIYMKGDTPVLSLINNDGTLKNSLPHFKIHFSNRKSTWKYINLKDDFETETKKDYPLTKFGFILLDKKSDFISPPAHFEKYVFPNPDARRIKITPTKNYSEIFI 284 T 0.12 PanZ pdb F Bacteria T 7ae4 2 G,H,I,J,K,L a,b,c,d,e,f SHDD_SEDHY PHENOLIC ACID DECARBOXYLASE SUBUNIT D,PAD MKCHRCGSDNVRKMVDSPVGDAWEVYVCEKCCYSWRSTENPVVMEKFKLDDNKIANMGVIPPIPPLKK 68 T 0.00011 YjdM_Zn_Ribbon pdbhh F Bacteria T 7ae5 2 G,H,I,J,K,L b,c,d,e,f,a SHDD_SEDHY PHENOLIC ACID DECARBOXYLASE SUBUNIT D,PAD MKCHRCGSDNVRKMVDSPVGDAWEVYVCEKCCYSWRSTENPVVMEKFKLDDNKIANMGVIPPIPPLKK 68 T 0.00011 YjdM_Zn_Ribbon pdbhh F Bacteria T 7ae7 2 G,H,I,J,K,L a,b,c,d,e,f SHDD_SEDHY PHENOLIC ACID DECARBOXYLASE SUBUNIT D,PAD MKCHRCGSDNVRKMVDSPVGDAWEVYVCEKCCYSWRSTENPVVMEKFKLDDNKIANMG 58 T 2.5E-05 DUF1936 pdbhh F Bacteria T 7aeb 1 A,B,C,D,E,F A,B,C,D,E,F A3HTB3_9BACT BASEPLATE PROTEIN (ALGO12) MSTLNKHISIPKDMSSKDDLDFHFLREEGIRYIKELGSNFWTDYNTHDPGITMLEVLCYAISDLGNRINIPIEDLIANEEGGVKGQFYKVQEILPSAPTSELDLRKLFIDIEGIKNCWIKRERVTVFADLKNQKLSYEKTIWEDLKENQKAQFDLKGLYRILVETEDADKVLSESLEKAVFTKFHANRNLCEDLIKVEKVATEPISVCANVEVAPEADEELIHAQILIAIEDYLAPSPRHYSLKQMVDKGYTMDEIFEGPFLENGFIDTVELKASELRKEVRLSDIINIIMSIDGVKIVKEITLGNCDENDGIENNQWVICIPENKKPKLCKKTTINYFKGILPINLNPVRVDNHKSKILASRLENDLKAKDDLEPAIPQGTFADWGEYSSIQHEFPETYGISDIGLPPKLGVKRAVLARQLKGYLLFFDQILASYFEHLSKIKSLLSLDQGPSFTYFTQAIKDIKDVEELFKDPTLLENDEELTKSLIGKLDDTIERRNQLMDHLIARFAENFSSYAFLMKFLYGESTDEIVLQDKQSFLREYKEISRERGEGFNFYEQSNDNLWDTLNVSGAQKRISKLVGVKDYSRRNLSDTAVEIYRYEHVDGNWVYRWRIRDENGKVLLSATTSYPTYNSAGNEMYFAILKILETPLSDLEKLLEVNFRNENEAGSFHFHKAATSNKFSFDIINPVIDSESSSDFIVAKQYTYYPDRTQAVLGAISLLNFIKYTFTEEGIYLVEHILLRPSPLDPEYLAMQTDAGKEYIEGNFLPFCSDDYENCKMIDPYSFRVSIVLPGFTYRFANKDFRDYLENLIREELPAHIVAKICWIGYRKGEEPELFQEDVENPETPIFKENQLEIFEKAYKNYLFELTDIHKRKGFIASMNKYNQVLNEMTSSLTGLHTIYPTGRLYDCEDEEEELDGKLILGKTNLGTL 933 T 0.00032 DUF276 pdbhh F Bacteria T 7aed 1 A,B A,B D1LHF8_ENTFL PrgL SVGQRKQVNTNEKQVKVEKKEELTTSTVKKFLIAYYTKKDLGENRNRYEPLVTSAMYNELVNVEKQPVNQAYKGYVVNQVLDTYKIYIDTENNEVIVDVTYKNTQRTKRNNDEGALKNQSNQEALKLTFVKQGANFLVDKMAPVTLTNELQEEPNSYNTHVVTTEESAKESANSGEKLEVLFQGPHHHHHHHHHH 195 T 0.00049 TraE unphh F Bacteria T 7aef 1 A,B,C,D,E,F A,B,C,D,E,F A3HTB3_9BACT BASEPLATE PROTEIN (ALGO12) MSTLNKHISIPKDMSSKDDLDFHFLREEGIRYIKELGSNFWTDYNTHDPGITMLEVLCYAISDLGNRINIPIEDLIANEEGGVKGQFYKVQEILPSAPTSELDLRKLFIDIEGIKNCWIKRERVTVFADLKNQKLSYEKTIWEDLKENQKAQFDLKGLYRILVETEDADKVLSESLEKAVFTKFHANRNLCEDLIKVEKVATEPISVCANVEVAPEADEELIHAQILIAIEDYLAPSPRHYSLKQMVDKGYTMDEIFEGPFLENGFIDTVELKASELRKEVRLSDIINIIMSIDGVKIVKEITLGNCDENDGIENNQWVICIPENKKPKLCKKTTINYFKGILPINLNPVRVDNHKSKILASRLENDLKAKDDLEPAIPQGTFADWGEYSSIQHEFPETYGISDIGLPPKLGVKRAVLARQLKGYLLFFDQILASYFEHLSKIKSLLSLDQGPSFTYFTQAIKDIKDVEELFKDPTLLENDEELTKSLIGKLDDTIERRNQLMDHLIARFAENFSSYAFLMKFLYGESTDEIVLQDKQSFLREYKEISRERGEGFNFYEQSNDNLWDTLNVSGAQKRISKLVGVKDYSRRNLSDTAVEIYRYEHVDGNWVYRWRIRDENGKVLLSATTSYPTYNSAGNEMYFAILKILETPLSDLEKLLEVNFRNENEAGSFHFHKAATSNKFSFDIINPVIDSESSSDFIVAKQYTYYPDRTQAVLGAISLLNFIKYTFTEEGIYLVEHILLRPSPLDPEYLAMQTDAGKEYIEGNFLPFCSDDYENCKMIDPYSFRVSIVLPGFTYRFANKDFRDYLENLIREELPAHIVAKICWIGYRKGEEPELFQEDVENPETPIFKENQLEIFEKAYKNYLFELTDIHKRKGFIASMNKYNQVLNEMTSSLTGLHTIYPTGRLYDCEDEEEELDGKLILGKTNLGTL 933 T 0.00032 DUF276 pdbhh F Bacteria T 7aeg 2 B B N-[(benzyloxy)carbonyl]-L-valyl-N-[(1S)-1-(carboxymethyl)-3-fluoro-2-oxopropyl]-L-alaninamide XVADX 5 T 1100 RE_HindIII pdbhh F F 7aew 2 B,C CCC,BBB AMPN_HUMAN HAPN,ALANYL AMINOPEPTIDASE,AMINOPEPTIDASE M,AP-M,MICROSOMAL AMINOPEPTIDASE,MYELOID PLASMA MEMBRANE GLYCOPROTEIN CD13,GP150 EKNKNANSSPVASTTPSASATTNPASATTLDQSKAWNR 38 T 0.069 MacB_PCD unppssm F Eukaryota T 7ag5 1 A A Laspartomycin C double mutant G4D D-allo-Thr9D-Dap DXXDDGDGXIP 11 T 5.6 LCAT pdbhh F F 7agb 2 E,F,G,H I,J,K,L KB70 XVVXAXA 7 T 5.5 DUF5807 pdbhh F F 7agc 2 D,F,G,H E,I,C,G KB74 XVVXAXA 7 T 5.5 DUF5807 pdbhh F F 7agd 2 E,F,G,H I,J,K,L KB75 XVVXAXA 7 T 5.5 DUF5807 pdbhh F F 7age 2 B,D,F,H I,C,E,G Pepstatin XVVXAXA 7 T 5.5 DUF5807 pdbhh F F 7agw 1 A,B A,B KHTT_BACSU K(+)/H(+) antiporter subunit KhtT GSGLNIKENDLPGIGKKFEIETRSHEKMTIIIHDDGRREIYRFNDRDPDELLSNISLDDSEARQIAAILGG 71 T 0.067 Imm40 unp F Bacteria T 7agy 1 A,B A,B KHTT_BACSU K(+)/H(+) antiporter subunit KhtT GSGLNIKENDLPGIGKKFEIETRSHEKMTIIIHDDGRREIYRFNDRDPDELLSNISLDDSEARQIAAILGG 71 T 0.067 Imm40 unp F Bacteria T 7ah0 1 A A 4D2 GSPELREKHRALAEQVYATGQEMLKNTSNSPELREKHRALAEQVYATGQEMLKNGSVSPSPELREKHRALAEQVYATGQEMLKNTSNSPELREKHRALAEQVYATGQEMLKN 112 T 0.055 Cluap1 pdb F T 7ah9 5 Q 1Z SptP3x-GFP-FLAG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 141 F F F 7ahi 5 Q 1Z SptP3x-GFP-FLAG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 141 F F F 7aht 1 A,B A,B KHTT_BACSU K(+)/H(+) antiporter subunit KhtT GSGLNIKENDLPGIGKKFEIETRSHEKMTIIIHDDGRREIYRFNDRDPDELLSNISLDDSEARQIAAILGG 71 T 0.067 Imm40 unp F Bacteria T 7aih 4 D D Q4QI77_LEIMA uL10m MFSRGAAATAMAKVSRLVSPRLRIIHRDYLTRRGGRTHQRCSAVAVDYTPTYFATYKSDPGQCPRLIDAEAVHGDEQAFWSARRDFYRGGASRSYYPAWDRQAQALIMLTREVPRIPQEAAFRLFTLGLKMMLLPRLVAGVELMLPSWVTMNAESVLNEGLEGKVAEADGDGKATGAAADAAALPSASSAGANEDSGSANAEKR 204 T 1.8 DUF5783 pdbhh F Eukaryota T 7aih 10 J J Q4QCY7_LEIMA bL19m MGYTRERTNRHFFVSRANAFFSRLPISRIQRALAMEAIKKGSMKPWKHTKEQIIGSPITCNFEYNPRPVRLIGTVMDAHTEETSIKGGLKVYARSEEANMMLWIPAGNPKLKYEVTAAKGSFEHYLDERSKWDEAWLTGRARMK 144 T 0.074 Endonuc_Holl pdb F Eukaryota T 7aih 16 P P Q4QG34_LEIMA bL27m MLRITPSRYASKVTAGNAKNQAGSPRQKAKLFHVIPGTPVTPVEKLKEQRRRFGQDRYSRQPEYRPGRNVRMDPNTFTLYATTKGVMTIRTSRINPSYKWLDVEPDIQKVYRSRCMRAALQARGKASMMVAGNVHYRAELDHVTEPHWRERVMRVPKATERFQDPNYFTRGLVPSLRPLSRYSYE 185 T 0.31 Ribosomal_L27 pdb F Eukaryota T 7aih 17 Q Q Q4Q719_LEIMA bL28m MLRQSSLLCFSTFALNPETSRAPHGPPRGLINRYISMGLPPWAAWCNRVNRHALYRMSDVSPRSFLPKAPHEMDVIWMNERVRERVRTSRQVQHVYRQLKYPFVKTGIHYSDTLDHWVQVPMVEAAMFEIEKDGGFDNFILKRSGPELRSTYGERIRRHLLVRQKETQKNFVLDQQAKALAEVTQAELMKATSEEELDAVLAKYGMDAEEFKRLMAKRVMEQRKSVAAAGLRSK 234 T 0.012 Methyltransf_5 pdb F Eukaryota T 7aih 19 S S Q9U0Z7_LEIMA uL30m MRRCVMAKGEDPAHVAGWDDRQDAVEWWWTEANDSRGRQRLEAAAAVAAAAASSTVGLPLFPRFSPGRRRRRRPPAPPPPPPPLFLSRHLHSMPWLWCTCVKMQMYYTPTALTCPLSNSLAAHVGHIIVGVAALLPYSMLLFLTVMCNPRKHEPVLRAQRIRWLTFHSLMFRLLRCITASPAVAASVAVAAAQTPTSLRPAAVCRRGVHLAPSVLAASAPPPPPQQQQQPTSAAVPASTATSTTTIAAGPYRRVGNVFIVTCIDHPFKFSWEVNRMLRELRLEFMGQTTVVPDIPPVRKRIWRVRHVVRVDQLDLDEAKALIGIPEHISFRDLAGQIPPTFGRGGSVANPHMRSKMNFMRLRRMRLRDVMHRDQLEKRLLEERHHALQQQQQQQQGGGEAAAAAAATTA 409 T 0.00017 Ribosomal_L30 pdbpercent F Eukaryota T 7aih 20 T T Q4Q2W9_LEIMA bL32m MLQRTTLRCYSALVGQATPVLLGSKGGTPKRKKNPMQLRRKTYGLHFKERYLKLEEWYFCPLCAEPKKQGEWCRREDCRQIKP 83 T 0.12 Metallothio_Pro pdbpercent F Eukaryota T 7aih 21 U U Q4Q2Q8_LEIMA bL33m MFRASCTLLGHGQYKTRLKKRMVGFIPKVIPRKIRNNMVALRSEANTGHMEGYIKTEAERLDATGRKLQKTMWDPVLQRYTLMKETKVRGPFLTKSNIARKVDFPVGALHGTKLGGKK 118 T 0.0038 Ribosomal_L33 pdbpercent F Eukaryota T 7aih 22 V V Q4QCK6_LEIMA bL35m MFRISLICFPKAGCEEITRQGRRVVLKPQEYFAQHRMQVWQMRFKEMGPPFSRVWVALGGKMRRRRIGRQIDVKDMRYYWRPIEPQYQRLYMSRLRIKDHSNKRVQPMRLRATNNDIGQASSLKEWERSSDRKYGAALAPPKKRDFEFRVF 151 T 10 Gln_deamidase_2 pdbhh F Eukaryota T 7aih 23 W W Q4Q6A3_LEIMA bL36m MLQYTSSARQALRATALVLNFFPLGYTCGPKNKQVFFPPNNLDGRTTHQMKKLQGSTDKHPGLVPRDKLKLHCEFCRFHWVQDTLVVRCAAHPKEHNQREIWLEPTWTWGKQQPYQYYKYMPVNINPRTGMPLAREDAKGMNNERRSQGLPTKTRLLERERRGISRAITGLGIYNQRWQTRFPFAT 186 T 4.8E-05 Ribosomal_L36 pdbhh F Eukaryota T 7aih 25 Y Y Q4Q448_LEIMA mL40 MWTLSRPCLAAVRTAVLCQKKQTAAGYMASAGKVGNEEKWAQAAMEYIHEKNHVNDARKRQQDVDQERSIANAYDRYSAVSEAKFDERLSRLIARMSEALEEMRNLGLEEALEEAVLLNSEQPPGHYRRPSLTPPLAGYEPGFGLDVPQLRSQQAEYPPLRRPTDWLEFGEGGADDFPYVDTHKIEDLTAKHEAQLEEQHGVLREAAPLTGVEGEGWEAYVALHRKALARQHLIMDLHNDPELRDKYNADEAFRAAEWERRGMGALSIEAPLERDLELHYAQVPAYEAFRSH 292 T 0.00047 MRP-L28 pdb F Eukaryota T 7aih 26 Z Z Q4Q152_LEIMA mL41 MLWCTGPRRIVFHNAPSVYPFTKPFHDTPYDQDRGRFDKTKNILRENKWPAWMDHGADGTGFGIGLNRTHPLSKLRGNLRRNPSEIPRVLNMMIQGVWHKSGNKLYFRGGKPPNPSTHPYLTGEPCPVYGWKVTDPGVIREFNLPQPEDKTRYKPYVALQERKIMGMQAPTKEHSAASTSAASTDSKPLMKRLFFWK 197 T 3.3 MRP-L27 pdbhh F Eukaryota T 7aih 27 AA BA E9ACP5_LEIMA mL94 MAQWIPKTAWKVSNLNKRYGAPYVAKGYASLDPRCSLDAYSSFQQTVTSADMKKALLSIDSTSSGALVIDVRSEPERRLRPLLSPAIVALHPHDILSGAACPILPSNKERAEMFVVASEAQRAVNACTALRRWGFSRVTAVSVDAVSEAIAAVQKPADAATSSSTKS 167 T 0.0043 Rhodanese pdbhh F Eukaryota T 7aih 28 BA UA UA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 203 F F F 7aih 29 CA BB Q4Q4D6_LEIMA mL95 MLHRSCVLVDSFKEHYHRVHLPRRLALQRYIKREEARLSRHKGKAVAAAAAAGVQPGEVAYKYNRWWVSNDHEFVHQFAFVEDPDVTREKRNTLPLVTKENIWKEPQQTFFLPFAPFVRVVDYAKDPDTKFLKPVNIPRWKDYMQRTKPIVPRTWY 156 T 0.14 DUF4653 pdbpssm F Eukaryota T 7aih 30 DA Aw E9AD00_LEIMA mL89 MSSGAVGRGSFHSVVAGANPRRIPTYYNSAYELIQLHRAHREVTRNFLVRDKVFDNKFPGCSLANGLFKMVPNKRGNFHTRELTESIRHRTIWGQRIQQQRTINAAILEDATKVLSPAQMEDRFSYRTPDAAAYFSPQEYTAANNWPNYWQHPTEKHVVPKPRWRREPELGGITRVRDAVATPIADY 187 T 0.092 DUF3295 pdb F Eukaryota T 7aih 31 EA Bj Q4Q0E1_LEIMA bL31m MVLNKWAAVTKSAPPAAGLRPLARTVSPNPKLRPADYKVPYVLRTFIKDRHSSEMQHIENRGMYREELAIERSRFPRMQKTLTIQTDGSLNEREFEFAVPPVVMLFQDRLSAHRQRQVALAKIGKLKRVKSWETSVRGKESLNPVCNALVFPYCVPKKMLVRPRIVDPLSAKSMADNRRSRDDPS 185 T 11 RNA_pol_L pdbhh F Eukaryota T 7aih 32 FA An Q4Q0F5_LEIMA mL76 MLQCTALVLKSQHKNVLRKGRPHMQKYKELNRWQREAQGITKWEQGHSHRPQPYVERFNPEGAGLTRGTSAYAWKWWHTQYPWLPNVAPADYVPPSPRGIRPAAWDDEFADVVLSMSDEEIQSYLLDKLTEVIFAETQRDGYELRRLDFEGKPLTELPERRIIENFVFEEETLRERVLDRVVEGVFRLVPTSTDRLELKSVANIIDFVLTHVTVARKPLQHEIPEAARTVMRSHPLQPQLGFVHALPTDNRDAVVQEWERMHHLDWQFGKAVYEPRSAENERGNLTWLREVRHHEAREAFQADVDSGEARRRHMAKIKAAAQVPHTGTTSQ 331 T 2.9 LRR_1 pdbhh F Eukaryota T 7aih 33 GA Al Q4Q1C8_LEIMA mL74 MLSSAHRAAFARPTATLWASARSFGAGPTRLLLGLEQVQDVPTSTDRKPTGMHRGPGKRQTAPKEAAQYQFIKKWDLQMRETWDELEPFKGLPKPKVQFGNEAAEVIWPYALLLENVIKVHPYTKSIYVYYSQRQSTPLGELAARVAKRVSQAYLIPITFHNSHVYVEAEMLLEYSETPWVVVHCLDGTHKLIPVKPQAGQTVKEGAEEVLNGIVSACNEIGSAVKNPKEVMRLLSERPLQNQYVRVNYQWYGDTPEERMSHLVKWDYEPEEVVPQLRNRTQHVLDWMNYDGNLPTHNSVRVNIHREAARMRKPNVSAGPKTFFNSSGSRANARTARFDNSRSSQS 346 T 0.011 L51_S25_CI-B8 pdbhh F Eukaryota T 7aih 35 IA Az Q4Q4D9_LEIMA mL93 MLRFTQVIRKNPVVFKQGQGMFSHQLKRILNKKSLHKYNWDPLHMYDPRKLVHANRYVDHDTYEEKYDPHWEHNAHLVPDQQFYNIPVPKEYKDAYWWRDLQARRVQCPTEWVHFRMHTKDKLKYDFQDLAFRKKFEYSYEDVVANAKDMCS 152 T 7.6 Pox_VP8_L4R pdbhh F Eukaryota T 7aih 36 JA At Q4Q4L5_LEIMA mL86 MRPSALCLGGFTMKYKRGTGLWDEDHVNDFDANKYLSARSTMRWYYGMERLQTRNNMNARRATQSYNNNMGLHHSGRGAFERELERRGIQVDKYPLTTTTGAARVAEMVLLRRQELEAHAKKAMESQRQARRRDAPSEWYDETEGPLNPRFLASMQSNYTQVITELPSSPVTGRRELPGASFA 183 T 2.3 DUF2663 pdbhh F Eukaryota T 7aih 37 KA BC Q4Q5D8_LEIMA mL96 MNDIYARRLAQTSMFHQLMRSHGTLWAATQVTKEKLNLAFVKEEMMRVNGRRAMPLLIGAAANENLNDTHFTHLTEHCAWTESARAFAVQRQTPLTQHIASMGRMAETITQAKTASTSQLLFNEHLARIDGISEFEEEPFVDDEDDS 147 T 0.027 Chloroa_b-bind pdb F Eukaryota T 7aih 40 NA Ap Q4Q7V3_LEIMA mL80 MQRCLARLFQAGVHTPHGSRYNAARMKNWPVQEVPQNFNFTNEQRFKAKAVPRDTGKIPRDFLLSVLYRNQPCEVASLWEHCLHDPQIVLDSKRHLREVLQQARAEGFVSFEKDAVTDRWVCHLTRERFEEVRALVGARAEAQDLYSGLRGASATETSAYSESFREMNEDTKREHFRLLSEQVADTTTHLRKFQRMEMDYLPYTDLNGKVNFMWWYEMSDTRDATALPEAAAEGSPKLSE 240 T 0.15 Gluconate_2-dh3 pdbpssm F Eukaryota T 7aih 41 OA Au Q4Q8J6_LEIMA mL87 MMLQHTSLLCRKALQSYPVPPRARNYERRWSSSRTNPYNRMFWRTVLNEDFARPSFWVSDFRHKYLAKHGMDYQGRVPASPAPGMYQGFSDVHKILANHPKPQRESRHLPVMPMTPRVVFEHAQEKRIDYAKKMHRDRRLVEQLRTHEFWGWYMKLQRVRGRWCKEHGVSSRGVYGPAVDAAELWG 186 T 1.2 Crl pdbhh F Eukaryota T 7aih 42 PA Aa Q4Q183_LEIMA mL42 MLRLTQAVLRVQSHQKKRAQHPNAGTRFGRVYNRGFVRYGFGGFGMSVYSSKKDRTFKVMPVPPPPPATTAVEQRDDFADNRGLSATTRTLSPTFRMFALEDGGVLVSHPSHAQIMRWNQRVHTEEGKAANSTVMDEYVNSRIQAIIADNTIENTSLSQWRKAHMWNVIKSHGKLQRRWGTPDFVMGARSTLYNN 195 T 5.8 RHINO pdbhh F Eukaryota T 7aih 43 QA Ao Q4Q547_LEIMA mL79 MLRTTHVSWASTAKGYMNRVMVYAHRRRKARYLAPKNAHVRSPLAHKMPEEYGNTWDPRSGVEWHNRMRNRNHYRHWPWARWTDDPVRFHQDSVCHRTVSALSTVANNGAPEWDYYAEVGQAYETPSHFPLSYTAPFIYQYTAQCWSREDLQSYLERIEQSSGLRTIADAASRREALYTWWHNAGMNVIPLGVLQHLELVSRDIVAQNARKSYRIEQHERGILRTPEMERYYALPHLRGPSMPVQLAQPSGKYPSGKFTQMMEDVAIHPLQKPDARYKHNMYPA 284 T 6.4 BNR_6 pdbhh F Eukaryota T 7aih 44 RA BM Q4Q703_LEIMA mL70 MSVFPGLCGDVATTNYRVFLGTLPNLAVEERFLRQVQPVFPWYASRKHVKEQASEFLEIDLASCDPELLLRYTHVYYVRRQLYDELVDRQLTLMETGKAAKVADSALLTCLAQVNAAITPRLQYELHLLQQAKKACRVPRRRELNPDAALEAHDYLCMMRVVEEDVGGIPDAEMQARAYLPREVLEAKVKELAAMIFGDGGSATKGTGAALERKEQKLLQRMIPADYNKVGAVEKLRPVDVTALYRFTGERVCGRPADKPFARALWGHVFRKVGSHPLYLQRASLYWARHSGLDPQSATSAMPADLATAVCVQQALFPALKYRCQYLYTSPDIARQQWRTGHVVPLLRLFPLLGAPAAEDLAAQLVVEGEWAKLGIEADTNLLHDTVLRQLKDMVEQVSALYESDAGAVLKRVEDGAKVLCPSLSERESLTMRGAPEDTSREVSAAAAARVANAAPA 457 T 0.37 DUF4911 pdb F Eukaryota T 7aih 45 SA Ar Q4Q712_LEIMA mL84 MLRWSRLLREMAPELQLEYIPIIFTRTILGPQGGFAGEERLIKREVAQKYMSEGNAVTPSAEFHQGVWCYNPDSEQYDRFVERNAEFLDFAARKRQWLDVYWRVNTGYLLFGRQSWGQGWLLNCPLRKKDIAQKLWEQYKVRVDPRLIEFREKDRRTGIQDLGHNWCWLYLPGAEELAIDREVYDNKRVKVRMHIRKMSSYGALY 205 T 1.8 Ribosomal_L9_C pdbhh F Eukaryota T 7aih 46 TA Aj Q4Q728_LEIMA mL72 MCTRVFDVSQRFILLVSLPPLSLSLRPSCCSRKAATRRRRSTASVLLATDALSLFVRTHYPSPLLFLLLPNFPLPIHSPRKQMRAAVGAVLQPSSSVGALRCQARFITRLYTSYFKGELFPNQLARPLERLPRGVSLAAARKGQQAAAPSSSGSGNPATTASLDVVTWGDVDSTDLVHANEQSRAVAPQAGLAPRRPYVPLGEVAKLELQGDYLTEGGLHQEALEYYGVVAKAYELAYPKDHPQVAGIRLKLAGAFRRTGRLTSSKANCEAVLQMLDSAVQPPLELIVEALFELGLTSEAMSDAAAGTVFEEAVALVDMFHNSGQSHKMLRLLPRLGRRFNLNFEEKFVYFSPFDYDRVFALADQCLERAEVFYQARNDRAGVMRVLQQRKELIDKKFFNMRDFAGRIHTMRGHWKRRAQVLTNAPTPDELLRYSPTIHQVYRDFKYELNAPIGREKEVQPGVNRVVHDMGNPYRRSGVRSQRMFRDAEKNFEKYIRADAFEA 503 T 0.0013 TPR_12 pdb F Eukaryota T 7aih 49 WA Aq Q4QD92_LEIMA mL82 MLYGGSRVQYLVQPPFTLHKIRSENLPPPSLYAERHDLGLEMQLPRDMHVYNSINMAIQRQVGGDSSTLDGEQQQLQGGHADGFDMGAFFTEQHHPERHHNSSLPYAKHDTNNVLAMRLFPVNVGVRARTEAIRIRTDDCLQRLRDADLCAKMRLPLEHPLPLSRRSQYAAIHRVRQERCYDAPTEAAGERAAAAAEEASRTAHLRGAAAHPPPSELSIVTRPVDRLGSHSGSSAAACTADADHLSFPVHPFAAAAVSSGCHSARSGSAARLASQRWLPLQTLKPMGHNWSAATRSSGVRGPHMQLMQERLDQKGFGWKRKSRSLWQQDVATAGFRPHRYF 341 T 0.21 T3SS_ExsE pdbpercent F Eukaryota T 7aih 50 XA BE Q4QE16_LEIMA mL98 MLGGLRPLAAATRRTVGGALVSPALITPSRALSVRTEDFFSKEAVSHARRVSWAPHTTEKKVGAFAKLSRSNFNDPLPVSFQSEPYFEEEIEAYRAHHRPDVYVYKYNVSPTHLSLRE 118 T 3.7 DUF2975 pdbhh F Eukaryota T 7aih 52 ZA BP Q4QGE0_LEIMA mL52,mL52 MRRRDWCGVCLPAATLHALARRYSEYRSSYTGARSAPWAAPEAAPAYPSARSPFPLERPRFRKTHIEWMLHHGHGDRYGKYGPSREIADFEYADGTPSSISGKRFALKHHQDHLLVQLIRSAAIVERFEEEELLPRIPGTPEQRSWDPEIPLFLEDVDEFGRPPRPVAGNMVARVIEERFAQESGRTPVNLANKHAGEVLEPNTMFATYDPAAFVSDDIKKDVRRPFWSRRRWALSDNFMVPMSPKPKNTIKDE 254 T 0.0014 MRPL52 pdbhh F Eukaryota T 7aih 54 BB BF Q4QIQ1_LEIMA mL99 MRRTVRALYNSFERGWKDKTVHPLDRRGRFNLDEAAAELQLDEAYVASLYKPLHYTYSMKGQRYPAEQGRTSRPGSLAASRDRMFPLYRRNYKLNRELRVLDHRRISTD 109 T 0.52 DUF6416 pdbhh F Eukaryota T 7aih 55 CB Av Q4QIT7_LEIMA mL88 MFQRTCTPRLLACTSALLKRSGKPSDLPDYKQVYLPYDTAPTKTELDRERRKFMHAYSGRMEHRKMVEVKDVPQNMYTYGKEGMSIPISIFKDQADPVIGPEWTYPGIFENKIVAQHWYMEELFDREKSNTFESPWQRQVLDNQVKRRLGKVAWRMSMLNIKTIDIFHKERGASKRPGAGDTKAPATPAGKK 192 T 2.1 Ribosomal_L37 pdbhh F Eukaryota T 7aih 56 DB Af Q4QJB6_LEIMA mL63 MLRRSPVPRRYRTAWRELLHPLPVWARRQQWLKRDTVEMNEAILREPYYRIKTFAQPAAFVSPRVSESAAHEPDTQQSSRYGVDRQLRGPRRAVSPERLQELREQLQFVGSIGPKVPPAAGAGTAYQDEYGTRLRPRYPQSWDTVPPHQPSRSEI 155 T 28 PsbP pdbhh F Eukaryota T 7aih 57 EB As E9ABZ5_LEIMA mL85 MRRLPLFCRRPSRCCGATASGSGSSSAAVLAASAAPSVLVLAARGIATSGRVTNEDRRWWLVHLECAPDVTPGTFVSWLDCCGTHTTKKLIERNIWTIEQVAELDSDRVDELKYKEGCLKMDVVWEHARTIITPLKQREVSGGVESQLQSRILELRKKRELERQRELLARERATVSDKREETLRRLRESVAAKKAALRKKLDEQHGEATPAASESASTEAHRGTAEAAVEDEAVGNIVDRMSGGNPPRA 249 T 0.39 OmpH pdbpercent F Eukaryota T 7aih 58 FB Ae E9ACG2_LEIMA mL53 MTAPASHYTFANLKKLGLCAPQVALSRQPRLRPHVGHLNGLVYPLPYYAMWRGNHDKYTYNQATPARWGEGNTNTMYHQHYAHAKCPTDYGRGGREFQFLSVKRGKLKRKPLPTVQYVDPNSKPQWVFKSWHNPLSAPSMWEREVQYPEHTPAHTGAKRPLAVVAPKTSHKHLFLMHMEKVTVTVSPLLFGYGHTLQKAALDFYRRGLSARSPFPSDKMFLYYSIDHITPKIEVTWLDGSVYVPPLIEGVKAQDLIQMVMEQAWLAADRMSAEGRVLNPIAIDDYKWEQLIAFKQKRAKGAEAAKGGAKKK 311 T 5.7 MRP_L53 pdbhh F Eukaryota T 7aih 60 HB Ah Q4QC45_LEIMA mL68 MHLHISSIPHRNSNNSKGGVLDATGPMLSAKRGALLLQAYHRPGEVISYKAGDYHLVPKKFTVGKRIAVRSYLDRNRTELSDRTFMPQKNWFRPYDLQDGCFDRDHERLSYRFYNLETKVIWKAFDTPELIGMLLHDETVKGNSGMYAPDMLDAALHYTREARYWRCIGITKPFYDRNTLRAHCWEDNGLQVGTLVMSQAMRHALMDLERAVRRKELGLEPNYLWDRWGPIGFIDGARADYLPRFEHNPYVDPDGVDVTEIDVLPFNTHEQIRERYRDFIEPDTAPFEEVFRSPSHGSLTTLADIPNASVVALYKDLKLKAGTPVAGDAVELAPADVRTLFYLSANPEWRAVADGKASWEEVVDAMQPVQAELDEKIDAARLLQNTRHNAERVRAFFEEKCGFHDFMYTPDKTITAAVLCYLTELRRICTETAWGAALAKCLTDMERVQGMGRDAFLVYRHIEDAILDKKRRLWAGRFAGESHEESTLDYLLENFGRRAERPRNVGTTGVEFDREQEPIGRQVQRRVLDSDKANKLAEIRRSRGKMWSKKRSVFDALHEKQLQNFNYGVH 570 T 3.9 VirE_N pdbhh F Eukaryota T 7aih 61 IB BD Q4QE11_LEIMA mL97 MSNRFFQKFYLRCGNCSAIQRSAQGYQPIANPILFKSDEHCRNYHDEQRRAAGYSGMVVTCRCHRCERVHSNWKVLDAQQFLDAKLRMTPEERAQRLWVSKS 102 T 0.91 Mu-like_Com pdbpercent F Eukaryota T 7aih 62 JB Ay E9ADN7_LEIMA C2H2-type domain-containing protein MLRIGRTLLAEVTTINSTTASVSGRLIRIRKKSKWIDRRSTRVPHNGKDIWYFGDQPSCALCHIRFRYKQDYEAHKESELHVNRLRWVETMNWWRETGEPAYLKASNEQWEWFEQHVLPTKAQEMGCTLDEARRVYRQAIMTETPTWHRPLQCPTVKQEVQEPRDQRWPASPKW 174 T 0.0078 zf-met pdbpercent F Eukaryota T 7aih 63 KB Ag Q4Q829_LEIMA mL54/69 MAFRGSSARLAATPGVGIAPETTPVKYVPEMLNIQNAKWWNGRGKPVYRSTYNEKSWLEKARWGAFTKGSRPVMRQRYSAAALKEALEMVPEGFETCDVPRPPQRIRAQSEGVVGRWYTNYWTLHSVRYQCQLAGVEWQFGERQRPRTNYDEPHMYTDFEETKAIRDYRSRWINVNRSLVGMSRRMKESEEEARYLHFKKVQDTFWSNRKVLVNRIKSMHNQGTLQSAKDLPIKTINIKAFLAE 244 T 0.036 DUF1672 pdbpercent F Eukaryota T 7aih 67 OB BG A0A504WW14_LEIDO mL100 MLARYLDPSVHPLRVGQVVAYDYLHAAKTWQWTLGTVREIKDYTAVVQQWGLHTGDIDTLRSILLKEVDTENGRMKNYHDMLAIAREKLASIRRSNEDRVSHVRGHFDKAREKVELIDEVDLRKVTAQAAPSPVAVAVLKAVWAVAKCDPTAVEFYEWADVQLEYRKPAALDEIAKTDVLAKLYPSAESLQQSLEQDPKLNYKAAARDSPVVASLHAWVITALAYQQAYNLLAHDKRIQEQNDAIAAAIAGMKACRAKIAKLKDELSSKDTAALPGQVTSFTRTSVLVTIPLSAVISPVNVDTDVKRCVLTKDEVEQIPIDAKITRYAQKQKLAITGSHLLDQYAAATTTHIYVTELEDRLFFFQHYMASALRDAQTAAVDAHQRLAVSLHELEAFRQKRHDAKKARAAEPELADADGVEPSSGPTSSRSPTGRAAPRGQSAAPRGTASQQHKLLGPAYQSIDPATIANEPLYAVTIEEYKAKDAAGERAMDEAERMADEVQRLAVELEDAKAAADKLAEELAAKDEELAAHRQKRHDAQQARASDPALAAADAVAPRSGKGAASPHVGAVQRQAVDPATVPVAPAVIAEEPLYVATAEELQHVRDFADQLAEELEAFRQKRHDAKKARAAEPELADADGVEPSSGPTSSRSPTGRAAPRGQSAAPRGTASQQHKLLGPAYQSIDPATIANEPLYAVTIEEYKAKDAAGERAMDEAERMADEVQRLAVELEDAKAAADKLAEELAAKDEELAAHRQKRHDAQQARASDPALAAADAVAPRSGKGAASPHVGAVQRQAVDPATVPVAPAVIAEEPLYVATAEELQHVRDFADQLAEELEAFRQKRHDAKKARAAEPELADADGVEPSSGPTSSRSPTGRAAPRGQSAAPRGTASQQHKLLGPAYQSIDPATIANEPLYAVTIEEYKAKDAAGERAMDEAERMADEVQRLAVELEDAKAAADKLAEELAAKDEELAAHRQKRHDAQQARASDPALAAADAVAPRSGKGAASPHVGAVQRQAVDPATVPVAPAVIAEEPLYVATAEELQHVRDFADQAAHDAATREAEVAGTVENLRNELDDVREMNAKLEDEVFALKEQLSDAEDAYKKLAGALVVAEDERQELCDDLEAALDELEQKKDEYDELLGNLEEVQGLLEAADVAGRTAVEALEQRNRDMADLQGELANALDASKENENLRALLDAKEREIDRLKEYNSFWTDTVGTGKQKVTHRLTKIFDGDWTRLMRHRPEALKAAFVIDSSNACHVPGDQIFLVSNSFTRRLLTRTDHCPKCDRLSTFRFMSVSGMVGRMPYKPVDTPGPSYATLYWRKQRSGKIASQPLNEVCNKNEF 1347 T 0.012 Fez1 pdbpercent F Eukaryota T 7aih 68 PB UB UB XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 67 F F F 7aih 69 QB UC UC XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 144 F F F 7aih 70 RB UD UD XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 95 F F F 7ajt 45 WA JN FAF1_YEAST FORTY S ASSEMBLY FACTOR MTLDDDDYIKQMELQRKAFESQFGSLESMGFEDKTKNIRTEVDTRDSSGDEIDNSDHGSDFKDGTIESSNSSDEDSGNETAEENNQDSKPKTQPKVIRFNGPSDVYVPPSKKTQKLLRSGKTLTQINKKLESTEAKEEKEDETLEAENLQNDLELQQFLRESHLLSAFNNGGSGSTNSGVSLTLQSMGGGNDDGIVYQDDQVIGKARSRTLEMRLNRLSRVNGHQDKINKLEKVPMHIRRGMIDKHVKRIKKYEQEAAEGGIVLSKVKKGQFRKIESTYKKDIERRIGGSIKARDKEKATKRERGLKISSVGRSTRNGLIVSKRDIARISGGERSGKFNGKKKSRR 346 T 4.1E-08 DUF4602 unppssm F Eukaryota T 7ajz 2 C C NAG-NAM(tetrapeptide) AXXX 4 F F F 7ako 2 C,D C,D CLSPN_HUMAN HCLASPIN MEELLNLCSGKFTSQD 16 T 0.37 RPAP1_C pdbhh F Eukaryota T 7aks 2 B,D,F,H BaB,DaD,FaF,HaH modified peptide XAKSAPAPKKG 11 T 46 NOB1_Zn_bind pdbhh F T 7al0 1 A A Heymonin APCKLGCKIKKVKQKIKQKLKAKVNAVKTVIGKISEHLG 39 T 7.9 Herpes_UL33 pdbhh F T 7al2 2 B B B9AGF7_METSM Cell division protein FtsZ QLDDFIDGIF 10 T 2.1 DUF4316 pdbhh F Archaea T 7ald 1 A A R7TSD6_CAPTE BRICHOS domain-containing protein SPRVCIRVCRNGVCYRRCWG 20 T 0.037 Toxin_25 pdbhh F Eukaryota T 7alo 3 C,F C,F VIPR1_HUMAN VIP-R-1,PITUITARY ADENYLATE CYCLASE-ACTIVATING POLYPEPTIDE TYPE II RECEPTOR,PACAP TYPE II RECEPTOR,PACAP-R-2,PACAP-R2,VPAC1 RRKWRRWXL 9 T 2.7 zf-CW pdbhh F Eukaryota F 7am2 8 H J Q4QCY7_LEIMA bL19m MGYTRERTNRHFFVSRANAFFSRLPISRIQRALAMEAIKKGSMKPWKHTKEQIIGSPITCNFEYNPRPVRLIGTVMDAHTEETSIKGGLKVYARSEEANMMLWIPAGNPKLKYEVTAAKGSFEHYLDERSKWDEAWLTGRARMK 144 T 0.074 Endonuc_Holl pdb F Eukaryota T 7am2 14 N Q Q4Q719_LEIMA bL28m MLRQSSLLCFSTFALNPETSRAPHGPPRGLINRYISMGLPPWAAWCNRVNRHALYRMSDVSPRSFLPKAPHEMDVIWMNERVRERVRTSRQVQHVYRQLKYPFVKTGIHYSDTLDHWVQVPMVEAAMFEIEKDGGFDNFILKRSGPELRSTYGERIRRHLLVRQKETQKNFVLDQQAKALAEVTQAELMKATSEEELDAVLAKYGMDAEEFKRLMAKRVMEQRKSVAAAGLRSK 234 T 0.012 Methyltransf_5 pdb F Eukaryota T 7am2 16 P S Q9U0Z7_LEIMA uL30m MRRCVMAKGEDPAHVAGWDDRQDAVEWWWTEANDSRGRQRLEAAAAVAAAAASSTVGLPLFPRFSPGRRRRRRPPAPPPPPPPLFLSRHLHSMPWLWCTCVKMQMYYTPTALTCPLSNSLAAHVGHIIVGVAALLPYSMLLFLTVMCNPRKHEPVLRAQRIRWLTFHSLMFRLLRCITASPAVAASVAVAAAQTPTSLRPAAVCRRGVHLAPSVLAASAPPPPPQQQQQPTSAAVPASTATSTTTIAAGPYRRVGNVFIVTCIDHPFKFSWEVNRMLRELRLEFMGQTTVVPDIPPVRKRIWRVRHVVRVDQLDLDEAKALIGIPEHISFRDLAGQIPPTFGRGGSVANPHMRSKMNFMRLRRMRLRDVMHRDQLEKRLLEERHHALQQQQQQQQGGGEAAAAAAATTA 409 T 0.00017 Ribosomal_L30 pdbpercent F Eukaryota T 7am2 17 Q T Q4Q2W9_LEIMA bL32m MLQRTTLRCYSALVGQATPVLLGSKGGTPKRKKNPMQLRRKTYGLHFKERYLKLEEWYFCPLCAEPKKQGEWCRREDCRQIKP 83 T 0.12 Metallothio_Pro pdbpercent F Eukaryota T 7am2 18 R V Q4QCK6_LEIMA bL35m MFRISLICFPKAGCEEITRQGRRVVLKPQEYFAQHRMQVWQMRFKEMGPPFSRVWVALGGKMRRRRIGRQIDVKDMRYYWRPIEPQYQRLYMSRLRIKDHSNKRVQPMRLRATNNDIGQASSLKEWERSSDRKYGAALAPPKKRDFEFRVF 151 T 10 Gln_deamidase_2 pdbhh F Eukaryota T 7am2 19 S Z Q4Q152_LEIMA mL41 MLWCTGPRRIVFHNAPSVYPFTKPFHDTPYDQDRGRFDKTKNILRENKWPAWMDHGADGTGFGIGLNRTHPLSKLRGNLRRNPSEIPRVLNMMIQGVWHKSGNKLYFRGGKPPNPSTHPYLTGEPCPVYGWKVTDPGVIREFNLPQPEDKTRYKPYVALQERKIMGMQAPTKEHSAASTSAASTDSKPLMKRLFFWK 197 T 3.3 MRP-L27 pdbhh F Eukaryota T 7am2 20 T BA E9ACP5_LEIMA mL94 MAQWIPKTAWKVSNLNKRYGAPYVAKGYASLDPRCSLDAYSSFQQTVTSADMKKALLSIDSTSSGALVIDVRSEPERRLRPLLSPAIVALHPHDILSGAACPILPSNKERAEMFVVASEAQRAVNACTALRRWGFSRVTAVSVDAVSEAIAAVQKPADAATSSSTKS 167 T 0.0043 Rhodanese pdbhh F Eukaryota T 7am2 21 U CA Q4QGU5_LEIMA TRUD domain-containing protein MKALGRGPITRLANTAAPGGFAAPGAVYNRDDWNAGRDVSAEERQCGILTRLCTLAATAPREAASPACGLAPLEAVIRVQSTDAHVTEVDANGGGAFLEKAPKGRWRKISRSKTLLVEDTATPFSNSDKSFSPRVQSYGEYVRRIGKLPEGRPLLRFAMFRDGYSLDSVCHRLRYEIGVPHDGVYLHEPPGGSFAAVTQFGVAVGVTREQLPHASRHYNVHALIFDDRGYHALDELPRLSVAPQAYLHRILLRCVSGDEAAVAQRLRHLSSNGFINYFGLESFGIGSNTLFDMAAFAFRREPHRSVGAYLQTLAECSPLHHQPYLSYANAEESTVAGAVAEWLRVCERAKLPRETRELLRKLHCYHLSQCHPSDATTISMEDVWKACPIMHRAEQSAAAFVWNAMASQRLLSFGSRPVKGDLVCRIGNRGAIEIAEVASDTDASHYTIDDVVLPIPCGGTPAAELRYPTHSVNEAFFTQFAKKHSLSFLFNSGVDPTPRAAATLGPYRRLVSRPRNLQAAVLQDPSSCAALKSDLFLLQEHQPTEGWSLDYRQRVREPSNFNVSERFRERMSCIRKRRAGEHSVALAFVLPAGSSPWVALREAFHMHYGTFHDFYGVS 618 T 0.069 TruD pdbpercent F Eukaryota T 7am2 22 V UA UA AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 203 T 17000 EF-hand_5 pdbhh F F 7am2 23 W BB Q4Q4D6_LEIMA mL95 MLHRSCVLVDSFKEHYHRVHLPRRLALQRYIKREEARLSRHKGKAVAAAAAAGVQPGEVAYKYNRWWVSNDHEFVHQFAFVEDPDVTREKRNTLPLVTKENIWKEPQQTFFLPFAPFVRVVDYAKDPDTKFLKPVNIPRWKDYMQRTKPIVPRTWY 156 T 0.14 DUF4653 pdbpssm F Eukaryota T 7am2 25 Y BK E9AD80_LEIMA mL67 MRFDNAAPPPPPPPEAAGSTTTSSSTAASALPRVSYRTLPSSMYPKEVSSDFIPFPLPKHDASMGFGPVRLRNVPDIEEARARMAKAAQGPPRGSASTQLQDADGDNAEAASALWAELSGETTATSVGLSTTSVTGASPEDPEALSRPPDAQDEDLVGYHVHRHFPLLDVLGCDRSVNDLLAQFWNRPQREARTATVLDFAATLQRHSNEELTRVLYELSSLFEWDGNGLQFIAAKVLKYGRSYTVSSELTKAFVQLVDAMTVAFVEEQPHRLAESPALLAQVLHFLALVKIMEPNKWYTLNPNAPQNRADYTHPRGVNRTCGHVTTGRALLDFLEDMVTSGHNWTGEAAVDDQQGSLARSPVPQHATNRFTEGWSEDDILDVMAGFSGVMPDGKASSPVLYALLDELWMRWSKVGFVLSGSEQAVRLERLYMLLQVMDMQRDAVLDALLGGQLRAHSTAPSTSTLPTLFCERDDTPPLTLAQSLTQTRGPDFFSAVSRDKRAMVKAAALRLLTASLAKARDDSDAVLHQALVESGTELLQSLTSKSAALSFAQREQFDVITLRAVPHMADVAERLAEQRAEAPFFPLTASAGGLPDTAAVLAHLSSHPAPYIVLCKGRRVHPVRTLVSNLDHVAAVENVFLLHSSGVSKCVDALVAVARRLRSGKDALIVTASCLRALQAAAQYGATEKRRATADRALDIVSYELEAGRAILMPVTDELYLHDAGTYCDEDLMLWTLAAYLARDVPLVKVHTIMSSRSRARNPQHALRGEHSPLTSTDDLYNKSTPLLQALRSKELRAVTHHPVVQRPVRDPPQTLYNVNPIRARFVYRRDKALFDKYHVTARNLAPGFSQGALNSDLRALGFYTPDHPQVPYTPLSELKCHPVPANPPASQ 893 T 1.6 NRDE-2 pdbpercent F Eukaryota T 7am2 27 AA BN Q4QAP7_LEIMA mL81 MRAAPTRLAPSTVASLGRSHCGSQAYHLDAAGAAGWRRRRRLTGVSMTTTHTVAPAVSALPFSLSTARRTYYWPYPENLVPEGATTSPFQSSPVPSVRERIIREYALGPLFGSRTPCCVLGFAGTARDVAACKRDVRRWVARALGKSEADVELGALVQAKEMLLHRSGTDESPRLGDGPGSRPDAEQRRVTRYARLPVQARTLLEVYLPGEEHGEVDAAADADATILAHGYFLQEQLHRHMTTTASSGSDPTGRRDSEDCQPEEPARKMQSHCQGCSEPNDEADVKSSVAALHDVCGVIYCEVPVLDESDFAFDQLCGKEVDDETTERVTDRWTRRAMQQQPQL 344 T 17 DUF1382 pdbhh F Eukaryota T 7am2 28 BA BE Q4QE16_LEIMA mL98 MLGGLRPLAAATRRTVGGALVSPALITPSRALSVRTEDFFSKEAVSHARRVSWAPHTTEKKVGAFAKLSRSNFNDPLPVSFQSEPYFEEEIEAYRAHHRPDVYVYKYNVSPTHLSLRE 118 T 3.7 DUF2975 pdbhh F Eukaryota T 7am2 30 DA At Q4Q4L5_LEIMA mL86 MRPSALCLGGFTMKYKRGTGLWDEDHVNDFDANKYLSARSTMRWYYGMERLQTRNNMNARRATQSYNNNMGLHHSGRGAFERELERRGIQVDKYPLTTTTGAARVAEMVLLRRQELEAHAKKAMESQRQARRRDAPSEWYDETEGPLNPRFLASMQSNYTQVITELPSSPVTGRRELPGASFA 183 T 2.3 DUF2663 pdbhh F Eukaryota T 7am2 31 EA Au Q4Q8J6_LEIMA mL87 MMLQHTSLLCRKALQSYPVPPRARNYERRWSSSRTNPYNRMFWRTVLNEDFARPSFWVSDFRHKYLAKHGMDYQGRVPASPAPGMYQGFSDVHKILANHPKPQRESRHLPVMPMTPRVVFEHAQEKRIDYAKKMHRDRRLVEQLRTHEFWGWYMKLQRVRGRWCKEHGVSSRGVYGPAVDAAELWG 186 T 1.2 Crl pdbhh F Eukaryota T 7am2 32 FA Ae E9ACG2_LEIMA mL53 MTAPASHYTFANLKKLGLCAPQVALSRQPRLRPHVGHLNGLVYPLPYYAMWRGNHDKYTYNQATPARWGEGNTNTMYHQHYAHAKCPTDYGRGGREFQFLSVKRGKLKRKPLPTVQYVDPNSKPQWVFKSWHNPLSAPSMWEREVQYPEHTPAHTGAKRPLAVVAPKTSHKHLFLMHMEKVTVTVSPLLFGYGHTLQKAALDFYRRGLSARSPFPSDKMFLYYSIDHITPKIEVTWLDGSVYVPPLIEGVKAQDLIQMVMEQAWLAADRMSAEGRVLNPIAIDDYKWEQLIAFKQKRAKGAEAAKGGAKKK 311 T 5.7 MRP_L53 pdbhh F Eukaryota T 7am2 33 GA Af Q4QJB6_LEIMA mL63 MLRRSPVPRRYRTAWRELLHPLPVWARRQQWLKRDTVEMNEAILREPYYRIKTFAQPAAFVSPRVSESAAHEPDTQQSSRYGVDRQLRGPRRAVSPERLQELREQLQFVGSIGPKVPPAAGAGTAYQDEYGTRLRPRYPQSWDTVPPHQPSRSEI 155 T 28 PsbP pdbhh F Eukaryota T 7am2 34 HA Ah Q4QC45_LEIMA mL68 MHLHISSIPHRNSNNSKGGVLDATGPMLSAKRGALLLQAYHRPGEVISYKAGDYHLVPKKFTVGKRIAVRSYLDRNRTELSDRTFMPQKNWFRPYDLQDGCFDRDHERLSYRFYNLETKVIWKAFDTPELIGMLLHDETVKGNSGMYAPDMLDAALHYTREARYWRCIGITKPFYDRNTLRAHCWEDNGLQVGTLVMSQAMRHALMDLERAVRRKELGLEPNYLWDRWGPIGFIDGARADYLPRFEHNPYVDPDGVDVTEIDVLPFNTHEQIRERYRDFIEPDTAPFEEVFRSPSHGSLTTLADIPNASVVALYKDLKLKAGTPVAGDAVELAPADVRTLFYLSANPEWRAVADGKASWEEVVDAMQPVQAELDEKIDAARLLQNTRHNAERVRAFFEEKCGFHDFMYTPDKTITAAVLCYLTELRRICTETAWGAALAKCLTDMERVQGMGRDAFLVYRHIEDAILDKKRRLWAGRFAGESHEESTLDYLLENFGRRAERPRNVGTTGVEFDREQEPIGRQVQRRVLDSDKANKLAEIRRSRGKMWSKKRSVFDALHEKQLQNFNYGVH 570 T 3.9 VirE_N pdbhh F Eukaryota T 7am2 35 IA Ap Q4Q7V3_LEIMA mL80 MQRCLARLFQAGVHTPHGSRYNAARMKNWPVQEVPQNFNFTNEQRFKAKAVPRDTGKIPRDFLLSVLYRNQPCEVASLWEHCLHDPQIVLDSKRHLREVLQQARAEGFVSFEKDAVTDRWVCHLTRERFEEVRALVGARAEAQDLYSGLRGASATETSAYSESFREMNEDTKREHFRLLSEQVADTTTHLRKFQRMEMDYLPYTDLNGKVNFMWWYEMSDTRDATALPEAAAEGSPKLSE 240 T 0.15 Gluconate_2-dh3 pdbpssm F Eukaryota T 7am2 36 JA Al Q4Q1C8_LEIMA mL74 MLSSAHRAAFARPTATLWASARSFGAGPTRLLLGLEQVQDVPTSTDRKPTGMHRGPGKRQTAPKEAAQYQFIKKWDLQMRETWDELEPFKGLPKPKVQFGNEAAEVIWPYALLLENVIKVHPYTKSIYVYYSQRQSTPLGELAARVAKRVSQAYLIPITFHNSHVYVEAEMLLEYSETPWVVVHCLDGTHKLIPVKPQAGQTVKEGAEEVLNGIVSACNEIGSAVKNPKEVMRLLSERPLQNQYVRVNYQWYGDTPEERMSHLVKWDYEPEEVVPQLRNRTQHVLDWMNYDGNLPTHNSVRVNIHREAARMRKPNVSAGPKTFFNSSGSRANARTARFDNSRSSQS 346 T 0.011 L51_S25_CI-B8 pdbhh F Eukaryota T 7am2 38 LA Aa Q4Q183_LEIMA mL42 MLRLTQAVLRVQSHQKKRAQHPNAGTRFGRVYNRGFVRYGFGGFGMSVYSSKKDRTFKVMPVPPPPPATTAVEQRDDFADNRGLSATTRTLSPTFRMFALEDGGVLVSHPSHAQIMRWNQRVHTEEGKAANSTVMDEYVNSRIQAIIADNTIENTSLSQWRKAHMWNVIKSHGKLQRRWGTPDFVMGARSTLYNN 195 T 5.8 RHINO pdbhh F Eukaryota T 7am2 39 MA BP Q4QGE0_LEIMA mL52,mL52 MRRRDWCGVCLPAATLHALARRYSEYRSSYTGARSAPWAAPEAAPAYPSARSPFPLERPRFRKTHIEWMLHHGHGDRYGKYGPSREIADFEYADGTPSSISGKRFALKHHQDHLLVQLIRSAAIVERFEEEELLPRIPGTPEQRSWDPEIPLFLEDVDEFGRPPRPVAGNMVARVIEERFAQESGRTPVNLANKHAGEVLEPNTMFATYDPAAFVSDDIKKDVRRPFWSRRRWALSDNFMVPMSPKPKNTIKDE 254 T 0.0014 MRPL52 pdbhh F Eukaryota T 7am2 40 NA Az Q4Q4D9_LEIMA mL93 MLRFTQVIRKNPVVFKQGQGMFSHQLKRILNKKSLHKYNWDPLHMYDPRKLVHANRYVDHDTYEEKYDPHWEHNAHLVPDQQFYNIPVPKEYKDAYWWRDLQARRVQCPTEWVHFRMHTKDKLKYDFQDLAFRKKFEYSYEDVVANAKDMCS 152 T 7.6 Pox_VP8_L4R pdbhh F Eukaryota T 7am2 42 PA As E9ABZ5_LEIMA mL85 MRRLPLFCRRPSRCCGATASGSGSSSAAVLAASAAPSVLVLAARGIATSGRVTNEDRRWWLVHLECAPDVTPGTFVSWLDCCGTHTTKKLIERNIWTIEQVAELDSDRVDELKYKEGCLKMDVVWEHARTIITPLKQREVSGGVESQLQSRILELRKKRELERQRELLARERATVSDKREETLRRLRESVAAKKAALRKKLDEQHGEATPAASESASTEAHRGTAEAAVEDEAVGNIVDRMSGGNPPRA 249 T 0.39 OmpH pdbpercent F Eukaryota T 7am2 43 QA BG A0A504WW14_LEIDO mL100 MLARYLDPSVHPLRVGQVVAYDYLHAAKTWQWTLGTVREIKDYTAVVQQWGLHTGDIDTLRSILLKEVDTENGRMKNYHDMLAIAREKLASIRRSNEDRVSHVRGHFDKAREKVELIDEVDLRKVTAQAAPSPVAVAVLKAVWAVAKCDPTAVEFYEWADVQLEYRKPAALDEIAKTDVLAKLYPSAESLQQSLEQDPKLNYKAAARDSPVVASLHAWVITALAYQQAYNLLAHDKRIQEQNDAIAAAIAGMKACRAKIAKLKDELSSKDTAALPGQVTSFTRTSVLVTIPLSAVISPVNVDTDVKRCVLTKDEVEQIPIDAKITRYAQKQKLAITGSHLLDQYAAATTTHIYVTELEDRLFFFQHYMASALRDAQTAAVDAHQRLAVSLHELEAFRQKRHDAKKARAAEPELADADGVEPSSGPTSSRSPTGRAAPRGQSAAPRGTASQQHKLLGPAYQSIDPATIANEPLYAVTIEEYKAKDAAGERAMDEAERMADEVQRLAVELEDAKAAADKLAEELAAKDEELAAHRQKRHDAQQARASDPALAAADAVAPRSGKGAASPHVGAVQRQAVDPATVPVAPAVIAEEPLYVATAEELQHVRDFADQLAEELEAFRQKRHDAKKARAAEPELADADGVEPSSGPTSSRSPTGRAAPRGQSAAPRGTASQQHKLLGPAYQSIDPATIANEPLYAVTIEEYKAKDAAGERAMDEAERMADEVQRLAVELEDAKAAADKLAEELAAKDEELAAHRQKRHDAQQARASDPALAAADAVAPRSGKGAASPHVGAVQRQAVDPATVPVAPAVIAEEPLYVATAEELQHVRDFADQLAEELEAFRQKRHDAKKARAAEPELADADGVEPSSGPTSSRSPTGRAAPRGQSAAPRGTASQQHKLLGPAYQSIDPATIANEPLYAVTIEEYKAKDAAGERAMDEAERMADEVQRLAVELEDAKAAADKLAEELAAKDEELAAHRQKRHDAQQARASDPALAAADAVAPRSGKGAASPHVGAVQRQAVDPATVPVAPAVIAEEPLYVATAEELQHVRDFADQAAHDAATREAEVAGTVENLRNELDDVREMNAKLEDEVFALKEQLSDAEDAYKKLAGALVVAEDERQELCDDLEAALDELEQKKDEYDELLGNLEEVQGLLEAADVAGRTAVEALEQRNRDMADLQGELANALDASKENENLRALLDAKEREIDRLKEYNSFWTDTVGTGKQKVTHRLTKIFDGDWTRLMRHRPEALKAAFVIDSSNACHVPGDQIFLVSNSFTRRLLTRTDHCPKCDRLSTFRFMSVSGMVGRMPYKPVDTPGPSYATLYWRKQRSGKIASQPLNEVCNKNEF 1347 T 0.012 Fez1 pdbpercent F Eukaryota T 7am2 45 SA Aw E9AD00_LEIMA mL89 MSSGAVGRGSFHSVVAGANPRRIPTYYNSAYELIQLHRAHREVTRNFLVRDKVFDNKFPGCSLANGLFKMVPNKRGNFHTRELTESIRHRTIWGQRIQQQRTINAAILEDATKVLSPAQMEDRFSYRTPDAAAYFSPQEYTAANNWPNYWQHPTEKHVVPKPRWRREPELGGITRVRDAVATPIADY 187 T 0.092 DUF3295 pdb F Eukaryota T 7am2 47 UA Aj Q4Q728_LEIMA mL72 MCTRVFDVSQRFILLVSLPPLSLSLRPSCCSRKAATRRRRSTASVLLATDALSLFVRTHYPSPLLFLLLPNFPLPIHSPRKQMRAAVGAVLQPSSSVGALRCQARFITRLYTSYFKGELFPNQLARPLERLPRGVSLAAARKGQQAAAPSSSGSGNPATTASLDVVTWGDVDSTDLVHANEQSRAVAPQAGLAPRRPYVPLGEVAKLELQGDYLTEGGLHQEALEYYGVVAKAYELAYPKDHPQVAGIRLKLAGAFRRTGRLTSSKANCEAVLQMLDSAVQPPLELIVEALFELGLTSEAMSDAAAGTVFEEAVALVDMFHNSGQSHKMLRLLPRLGRRFNLNFEEKFVYFSPFDYDRVFALADQCLERAEVFYQARNDRAGVMRVLQQRKELIDKKFFNMRDFAGRIHTMRGHWKRRAQVLTNAPTPDELLRYSPTIHQVYRDFKYELNAPIGREKEVQPGVNRVVHDMGNPYRRSGVRSQRMFRDAEKNFEKYIRADAFEA 503 T 0.0013 TPR_12 pdb F Eukaryota T 7am2 48 VA Ar Q4Q712_LEIMA mL84 MLRWSRLLREMAPELQLEYIPIIFTRTILGPQGGFAGEERLIKREVAQKYMSEGNAVTPSAEFHQGVWCYNPDSEQYDRFVERNAEFLDFAARKRQWLDVYWRVNTGYLLFGRQSWGQGWLLNCPLRKKDIAQKLWEQYKVRVDPRLIEFREKDRRTGIQDLGHNWCWLYLPGAEELAIDREVYDNKRVKVRMHIRKMSSYGALY 205 T 1.8 Ribosomal_L9_C pdbhh F Eukaryota T 7am2 49 WA An Q4Q0F5_LEIMA mL76 MLQCTALVLKSQHKNVLRKGRPHMQKYKELNRWQREAQGITKWEQGHSHRPQPYVERFNPEGAGLTRGTSAYAWKWWHTQYPWLPNVAPADYVPPSPRGIRPAAWDDEFADVVLSMSDEEIQSYLLDKLTEVIFAETQRDGYELRRLDFEGKPLTELPERRIIENFVFEEETLRERVLDRVVEGVFRLVPTSTDRLELKSVANIIDFVLTHVTVARKPLQHEIPEAARTVMRSHPLQPQLGFVHALPTDNRDAVVQEWERMHHLDWQFGKAVYEPRSAENERGNLTWLREVRHHEAREAFQADVDSGEARRRHMAKIKAAAQVPHTGTTSQ 331 T 2.9 LRR_1 pdbhh F Eukaryota T 7am2 50 XA BF Q4QIQ1_LEIMA mL99 MRRTVRALYNSFERGWKDKTVHPLDRRGRFNLDEAAAELQLDEAYVASLYKPLHYTYSMKGQRYPAEQGRTSRPGSLAASRDRMFPLYRRNYKLNRELRVLDHRRISTD 109 T 0.52 DUF6416 pdbhh F Eukaryota T 7am2 51 YA Av Q4QIT7_LEIMA mL88 MFQRTCTPRLLACTSALLKRSGKPSDLPDYKQVYLPYDTAPTKTELDRERRKFMHAYSGRMEHRKMVEVKDVPQNMYTYGKEGMSIPISIFKDQADPVIGPEWTYPGIFENKIVAQHWYMEELFDREKSNTFESPWQRQVLDNQVKRRLGKVAWRMSMLNIKTIDIFHKERGASKRPGAGDTKAPATPAGKK 192 T 2.1 Ribosomal_L37 pdbhh F Eukaryota T 7am2 52 ZA BM Q4Q703_LEIMA mL70 MSVFPGLCGDVATTNYRVFLGTLPNLAVEERFLRQVQPVFPWYASRKHVKEQASEFLEIDLASCDPELLLRYTHVYYVRRQLYDELVDRQLTLMETGKAAKVADSALLTCLAQVNAAITPRLQYELHLLQQAKKACRVPRRRELNPDAALEAHDYLCMMRVVEEDVGGIPDAEMQARAYLPREVLEAKVKELAAMIFGDGGSATKGTGAALERKEQKLLQRMIPADYNKVGAVEKLRPVDVTALYRFTGERVCGRPADKPFARALWGHVFRKVGSHPLYLQRASLYWARHSGLDPQSATSAMPADLATAVCVQQALFPALKYRCQYLYTSPDIARQQWRTGHVVPLLRLFPLLGAPAAEDLAAQLVVEGEWAKLGIEADTNLLHDTVLRQLKDMVEQVSALYESDAGAVLKRVEDGAKVLCPSLSERESLTMRGAPEDTSREVSAAAAARVANAAPA 457 T 0.37 DUF4911 pdb F Eukaryota T 7am2 53 AB Ag Q4Q829_LEIMA mL59/64 MAFRGSSARLAATPGVGIAPETTPVKYVPEMLNIQNAKWWNGRGKPVYRSTYNEKSWLEKARWGAFTKGSRPVMRQRYSAAALKEALEMVPEGFETCDVPRPPQRIRAQSEGVVGRWYTNYWTLHSVRYQCQLAGVEWQFGERQRPRTNYDEPHMYTDFEETKAIRDYRSRWINVNRSLVGMSRRMKESEEEARYLHFKKVQDTFWSNRKVLVNRIKSMHNQGTLQSAKDLPIKTINIKAFLAE 244 T 0.036 DUF1672 pdbpercent F Eukaryota T 7am2 65 NB U7 mL78 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 40 F F F 7am2 66 OB U6 U6 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 187 T 17000 EF-hand_5 pdbhh F F 7am2 67 PB U1 U1 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 46 T 7100 zf-CCHC pdbhh F F 7am2 68 QB U3 U3 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 75 T 13000 zf-H2C2_2 pdbhh F F 7am2 69 RB U4 U4 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 136 T 21 Keratin_2_tail pdbpssm F F 7am2 70 SB U5 U5 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 94 T 15000 EF-hand_5 pdbhh F F 7am2 71 TB BR A0A504XZ90_LEIDO mL78 MILITSNCARVAAFARKAACRGALCDRHRCISGGGRGDLFTRHASAFKPPAFGVLRGLTHSSQTTHPQTGQLRARKLIQHAMHDRTLSGSGHHRTAVATWSYLLPSLRQNVEQAVPDTLYEKLLTDEVPLTPAESRQLADAHRLLRFELQKRIGLLEDSLADAALPYLLQWPALFQRAWLRLPMDQGSASVTDAKRDAVVPAPPSFVHAPAALPITEWAPTLSPASPSACSNGLIGRGALVPRLAQLTRHVIRCVEEDLGRLEHGSEPREDGAPRSRRQVTLEAAWRAQWASLLSWHAGTS 301 T 0.37 DUF3945 pdbpercent F Eukaryota T 7am2 72 UB U2 U2 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 37 T 3600 Chorion_S16 pdbhh F F 7am2 77 ZB U8 U8 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 59 T 11000 DUF4699 pdbhh F F 7am6 3 D P ICIC_HIRME LEU-PRO-GLU-GLY-SER-PRO-VAL-THR-ASP-LEU-ARG-TYR LPEGSPVTLDLRY 13 T 0.62 Inhibitor_I78 pdbhh F Eukaryota T 7am7 3 D P ICIC_HIRME Eglin C fragment LPEGSPVTLDLRY 13 T 0.62 Inhibitor_I78 pdbhh F Eukaryota T 7ane 2 B h Q4QIP8_LEIMA uS14m MLSCKGVLLMRHIGQDVPRRHTHFVLESRLMYEKSFRDEWLRSLCQGLANVDEPLAKSLSGLPQQMLQRKVTCFSYNQFGLFKVPYYRLANVDRYYAVQGALGTREWVPYANVSSWTMNKMVRSGNILVHRVHYKGWGTDNALNQGGWEHRWNKVMQRNALQYNRI 166 T 12 Phage_Cox pdbhh F Eukaryota T 7ane 3 C aw Q4QHA2_LEIMA mS69 MHDANRFGGRTAYLREIGPIDHKKKGRLFKRDLPTLQFNVDVWCAQQTLRKQWKGRDWDVVEMPFEMAPKELQRVVPEKYTDVPIMTDPARHDYMNIRRKVFDREDMQDALFASGGAGQSPYPAIQRVDKAAMTLDKYL 139 T 0.15 DUF4993 pdbpercent F Eukaryota T 7ane 5 E f Q4QJG8_LEIMA uS11m MLKSSIVLLRRGKPRPRAGMFPEKYRRVPTLLKPQQGGQQYFNEFLIRSANDALEAQQQGYGSAFRVGGSGAARILADGNATAADGIHDGEISSVHPRLPQADIDGVLQRSRAETIQAELKQLVAQDGFISQRGFNERLWYEQEHHRLRTHGDGVEASPSEVAATAAASSSTAEAVSATGEGRAVPERILGDDYFQSKFGYSLLKGRSPDASAVADNVKAYAQLDLWGEMPMYSRDFVFLYLVSRRRNTYAVAYDYDGKRLLPTYTAGNRGLKGGDRGFRGDGSTDNGHQVTSMYLNDLLPKIREARAASGRPLGRGEKIDLVVRVMGFYNGRQGAVRAVQDRSADFRVRYLEDVTPFPLNGPKMPRGVFR 371 T 0.00026 Ribosomal_S11 pdbpssm F Eukaryota T 7ane 6 F s E9ACZ5_LEIMA mS33 MLRSSLRYGVHKVGYTHPHHLPVPCAQRWDLRLARARIFQEYIEEKAPGAWQLEDERHMSPEFNTFTGYPMRNLRPGYGQNLPEFIMKKRLPNNTHYELFARRDIPNEDNAMYGKLLYDMTIHGTSLPSIYRMHKDINKAQRNDRKLSGNRFKVLNSSGAKSPPSGFEAIPDAVEEEDD 179 T 0.95 Nmad3 pdbhh F Eukaryota T 7ane 8 H am mS59 MSPPALLRASGVLLDKSMFAAKRRVIVPIQPTPGYPAHFIKTSFTTDPLKEKQKARFSSGGDAMREVQDIPKRLEGQRSRAELTSRGDEDFAALIEFIQGASYDQLISGRRFRKVYEKLSENDDMFVWLCHTAMAVLNPGDMRSRLIHNHLKALAEAVASGEMTQRTAFRFFESAVRSPAYREIAARQLETGTATRLAGLAAAADVMREMGLTRRPMSSYFELYQRIVERSEAMTPWGFPPLFQFEERLALEPRLKFFSRVGQQQLERRRRGSIFSPHTILQGRRIFWIPPTWNRAGRFIGPHINLYPGLTPD 313 T 0.4 DUF2840 pdbpssm F T 7ane 9 I n Q4Q7N2_LEIMA uS19m MLRRCASAVAPAAHIPSPAAAVSGVQKRFLKIAKSTFGFYLARRGQRKFPFHRRPHIKNTQAMNLNAPYFWSYMTAKSQSFFLPEENYITGDWTGKFFVSKRQVYTLQHATSGGKVRVKSFPSVFELNSPSRWNVGKEMNTLTKPRMDLIDDQMLTKKQRLDYVKAGLLPK 171 T 4.3 LIN52 pdbhh F Eukaryota T 7ane 10 J ae Q4QFA7_LEIMA mS53 MSVIGVFSKGRATGHASVMSVLRYVPRARVPWQPSRFGRENLADDDMAQLWGTGRYRTGPGNYNSGYSTKKTHALEDSTVSIIPKHELEKFMPDISLGSKALVTPVSLMSARNGHRVTHDLIHSYDPYIGRLQKPAVVDHDNITVEDPNRVGLNAATLDCRSRIYRWLRRGPFFQEDNYFRRSTKLQRNGPVPVSVHEVPLMQRIIRLARRGHLKAACEEYRRVTSVPPVDVYRALTAACVPGAKLADAIAIFEDGHSRLFYVARDGEVLHNMLRCAIRAKHRVRVMWVYNVMVGRHYENVVVRAEVDVIWRYRIASLALEYLLDSNAGEEARTVYDYLVENELVDCDLHVRLGHVMQQALKEGKTVHVQQDALDGMALSQNVVAVAPQVAVAVYARYLETMQEGAAWTDARGLPLTDPAKTGADTNGAAAVAWLKAAFPDIDPVAVLRLARFRRSSKDLMAKDRPVYVQRAAQWVELLSSAHQTREEAPLTYLRKSRPSMANPNVRVAWLPERQRAHALLASDEGFKFAYAGPHTRFVEETFAYGENTLQSRYLAQQPVHTEVTPSVALGAAAVAAGMSSGSALPRLPGSSAAPQILHASVLLDGSLNGSSGSGTSTLRSGRNESAKAAASPTAGISSRPSSASSSAGLDDTHF 655 T 0.00058 PPR_long pdbhh F Eukaryota T 7ane 11 K ay Q4Q7W4_LEIMA mS71 MNRSFVSSADLRGLTAAFCGSLTCQKRFWAKPKKRPKVGPGFHEKAQKWRDEYLLDRHRVLADSLRAYVDFSSTKRVEPWDTRFAPFDRVEKDGVYILTRYLMDDKLQLCNYHHRPVKRMLCNVGLMGPQVTTTARWKPYRFATNPANTTRAERTFTKDKTVFTGYHHD 169 T 0.29 Tox-MPTase5 pdbpercent F Eukaryota T 7ane 13 M aj Q4Q7Q8_LEIMA mS57 MLRRTSRRLLGYTPINPDTSPMLMYSQCHWHYNLPQGMERPSSVNRSLPAPYQPHHSSVNKYRGVWISTEMHPAFLVGLAPQLKKLPHGRVVPQTPVAEVIDEFNKLSPLIDDAAARDGWLAKIFQHCAFQRSGAEAMALWDKHCAPRFMRDDSASAPPLPLVQAILFCCSKSDSAEWRPIFTKCLKDGWNYTPSFDTPQWSYLLKSLGRQGDEEGVRLVLEEMADVQADLDRVEARSLVYALNAVHDKAIYNYVKKYLFYLGERKVKFLRITYADLRGHGAEKLRVPLKENDSMFYHVCWHASIRQPRQFSPRQLYFDYAPSQLATSGHSPNAKVDGIVKDKIDKWKAEGLLPEDYVHEDRVYDRTAAFKSVARQEKWKKVPRIVKSKRFGYSGEP 397 T 0.2 RPM2 pdbhh F Eukaryota T 7ane 16 P az Q4QBM5_LEIMA mS72 MLRQTAARLNTYLTRSVATPPISVIRTGPKWWAEPERMVKHKVMYFTMGIDQLPLRRTAVIQKDLKRFHMCKPPPRVGDATGYKRSRGAQLTTWYRRIQYQEYHLQHLFVRHMWGLLRMYPGNTTKIQGKADDGYVGYDSVHFHRYNRSPLPFPAREIYERRK 163 T 39 MBDa pdbhh F Eukaryota T 7ane 17 Q ax Q4Q103_LEIMA mS70 MRRTRLVCTATPEKFSILGTTHPKPKRNGMGRNNKMRSKPSDNVAWYDKGPVEWLPRPVRLTYDQLDQLRDWMMRETIAGRTEEFNKIRHLHREWSQHPLMPVLGDVEPKFPLNLYKQNHRAKRRFLVRWHKANSPTYWMWMPRGPAVATPLHRSSPSQFPEHWKSLARTSSSSSSSGSSSAAP 184 T 11 THDPS_M pdbhh F Eukaryota T 7ane 21 U aa Q4QEP4_LEIMA mS48 MLRARARSAYTVARSSAARCSAAASIGAGSGALNGSTGLGSGVGAGAASASLHEPRRYSFKYATKRQHNEARQPSYIHDKRYGLFSNEHNIGKSRRGLPHITPIYTKHMSLWETDTDASTNRFFRHYVFGQREEHLLLGRPHGFEADQAGGRQGDSAYELNTDQRYKGVPRPAITNLHYEPVWNQTLYRNTAHGNQLTNPSSKLTAAVLGPDLMEVRDIKSVEHCKAWFDRLSFLIQQHYDAVGDIGAFRSRHAQHVHEFFVAFHDALSSFDFQDRYLYDQFAKARPKHLEDLFAIFLEMEANYVNEAYCPRCSLPYATTRYCGEGDPNTPFRKHRGRWAPHQYWGKEWYDVVMRRAEALWYRATEDPFFGTTAHTQRQAEALLSVYVKTKHRAKMVDFLHALRGSKEFLLGQLQITPAMQKAADDLLDSTPHEHLLTNAFKLESSAAKYTGDAQSVPHSPLQMRLDMEMNKYRRQQREEGVVRVPPASWKLDTSAIVPYKVDPQTKHVANWRAVKEGIEQSFLATGLPKEAYTSEEWREMLYLKERIASRGARRAQLEAERQAEEAAMAKKYGQRSPAATSSSSWCVFPDTAWYRVFETAAEALKPYGVTHAGQRVLMRSQTYANPAAVPYTDPVRNASVLLDTTPEACSVFGGFESGEVLRLTPKEPEGAAAFDVVVVGVDKSGAEAEWSLYAMHRDVASQTSKGLLNLGTDCLQLLSSYAKVETTRRRAVLTVLDAQRTLPEEHLGQVVGVRRGELYVQWHLQRGGSSELDRSVAVPLGNPETVKQLYKLQTLTDADGAPALLQEPPSWRTPFRNDFVDERLKELEQAPFKREQWASLIQGKYTPKVKKYGYAQHTTQDDFQTKEYKDRLLARQYFHSPQAFSVIPERHERSVKFMGKWEHQRVCGLPTADRDELEKGWGEAEEISDAAVGAIEQALRDISGRRPGNFVKSPTETQSLRLNESWWTPLEFGWEEHNREQMAFLDSSERSIVEGARLPFGGKRPPFGTTYGMGERISEIAADYAKGFGLGPHGHSPQHDTAHFNTLEAEQQRVKVLGLGNALVRLFHEKLGNQDIQAWSLQQCGESDANVRQLLLSVEEWRTKGRAPSLLLRKVLQRYLKEELDAFNSGLPAHVPRLAVPCADASAADSIGSSSAGCIWVDVDRNAFALEHASQFRRGDSDEPYIVGLVQRAGMSSGSGGALAASPAGSGADNTYAEYIQQDVLQRFEIGLARIVGRGIPSSIIMERTVKNSRGLERESAKMLTLVLVGELAKLLSKMRVTPDNISMVVRGLAQCPEKELLGGGDFAVPVSLIFSWNGPQSNVSAATSNSSSSNANAGASRGAGISAVQQMLERTRHGSSPGSSGSGDGEKMVGKVLEELAWSQDGVAADVLYALQQNKANPTLRQEFLNAFLPVCSNDHQKAQHMYADYTLGKFVPNITVAIEAFVKFLGNISTHPGQLTSDVEYFEVDNARRADGTEGTGQYTQVRLLPPEIGPFQYENTLIESIETAERFRRYGILAGPARVPASGFIAANCKSLTYMTHRDKEVVYVTTENDQGLANALRSSALFKSIASNPKLSYLLKGITGGAPSHPLLVDSFNRFFYRVAPMLSFYQSLLQEYSATMPSAQAEAQIANFGLARALESEASTAIEQDFRRNAERYWRNVLEGRSTEEAALSSGGRESASAQGRRPSQQGSGRAGQSGSGSRRGSGEFNLASVVGHRAAAGARSEKSRVPVSSSSSSSSSTVATAASTSTRVKGLLGSLKGGGSGAGRGKGSRPASPSGVASGSRGGRSSSSNNSGDKNGSTSGRK 1813 T 0.78 Metallothio_2 pdbpssm F Eukaryota T 7ane 22 V ab E9ADG8_LEIMA mS49 MSSAGSAAPPPPHTSSFGADVELPMSDWALRLQRELMSPVDPLGGLAHKDYYRDPATGYAPQYAPRDFVHGGSIAYPHMQGSGSAHDSYAAAAARRNWLEHDVESMAFMSQDARATARQLSSDAEREAFTQRHVPADRHRSAFPGNASLAAMDQLRTSGPQSDEKVYQQAILDRYRAAATSSSSSTAPGVSYTAATGLSGGELVDALAEDYAAAVDDGMDEELRIAHGLRAKERFDFKVMQRTSRVPFQGYDMDRFAAQREGRPHGAQQLPPVIPPSSMEEAMKNMRGGAAALLDTEAQAWQTYAQNTTSEEPKLGEALTGDVINSLHARRWSAQHAKEQARKQRFGLGRQGALVQDGGPDRRTLKKHTNDERLLDAVNFASDAYRRTITDEHVDPYVRRSTERGVGHLLTNSFDMARREDRVAHGQQDLTERNTVHYGVPIQQSIDEFVLSHRNARGERPLDYFKPFPDFRAQRLIRMYRDIEGFSLLKQRPEAFEWELFTRYRAHHQQRRELALLHGLEPVANETAAERTARRLALDELCEKTPFDPSKLHLNDDEVEIDAETLRNWFGVYVLPSPTIVESVVRAEGGALNLHLQHAADEMNTADTREHILSSRYMNRLLLFEGFQHRWNRGFTKEVAGKAPEPVIKYAQPQEVLKYFDSDERAMYQQYVQQESDAQLSEWAKVTRGRRYIAEKEQYGEVAGQGYKVPVVDVQHQETGAVLTVSSKLVEKSAAAALADKKLAGGSSSSTTSSSSMVHFDGQAYFVLPGSKRTVTPLSIRLESGESMEMTDEVFSAYPLEVSASAKYNHALNYGIGEYDYNRGNYIETQDAIWEKATADQEEGWSPATHADGLCPGLPVRARRRLAAAGEDKTGAAITGDFQRGRIVQYYRQPFFNPDPRLVTVAFYADGVVQEVPLANVMIWQRRYHGPERTVGDESRRYNPAGLRRYIDVADPNNKKLSPSSSAGAGANGAGDHFLEKYEGRLTNSVAASRYRTTKQITEIDQWNRFDTSRADNHRPLSISHRRDYVRQGYLPRYTPWEWIAIQEADQPIIHETMRTDNIGASYFFSLNRSWRYKARPHGYLRNYENEVRDMLQFVDGVTPWKQAQKIRTYWEVRQHHPMPQFNRPEVAMHRNSAGLLPSHMWEMDKKTGKVRAVKDSVRDYQTKIPVPKWVQL 1177 T 0.04 PSD5 pdb F Eukaryota T 7ane 23 W ak E9ACK8_LEIMA mS58 MSFRYTNHLVATLKHRLFLEAAHRQLVRQTFTGVCNGIEVTCTAYGSVVGIRMLDRAVWEPHYQVAADNKSEAAAATPTAPAAASPSRPSSSSSSSSTASGKTGIDLVKLSASIQAATWQAIQKVRAAKEETHSRSLRRNPQVLAEARLRDWYEQDANTLHPRPFDGLKNLEATEWMQAVRFGVPQPARYRRPNAAPKDLGEGGDGAHTDQKPRETITVLRDEDCDPANIPIGSVHPLFAPGLLQLEVDPNVATNGGSRVDEFFVLSEQRKEMRRDEEAFWERVELIRRSQLATIPKGGVKRGYADMADTVQDSIEEKVQLRFTQ 325 T 0.00057 YbaB_DNA_bd pdbhh F Eukaryota T 7ane 24 X ac Q4QBP8_LEIMA mS50 MQCHHNVLVGWANSGSSTAAFLTQQQQQPLPPSPLRFLWADQPLGSPSVLAPGCGMYRARSCGVVAASAAAPRVRTALDMVIRSYTPIYAPDPATDHLGALRSADECRTLWAQHIPVPSLTRAIELWLRFGNDPVVHTAASTASAERAEGDAAPSSTSPFAYVEDYMGSNMVTGTPEHVKESAELWSEYFETKYVRRMRQSRRTSKQYVGVLGAAGRGRGAGESGGGSASSSIANLLLDEADHPNTKWEADTFFCEVAYLSERHLKTRVTNHLQLDKLLWGGTAKPDAFVQFFEAFQQQTITRIPLPVPSIWVHESTEAKKKWAEHYLPACSAAHEFFQEKLRPHAADAAAQAKLLADVAAAYRQVHAILLERRARQVQAGVYPSTWTGGGAAATATEEAWAANEAEKEQRRMDEGVYDPEDLLDTTAEWATEHAKIQAILEQPLTSSGSNGEKSYGFSLQDFWLHTERREALETVHVLESESLARVAAAARRRLYSETPLPDVFAGLEESVAKARLDLRAAVLKPHFNSVWCRMHYVKFGAASLVQHTHTASRQLLFHYAASTQVVAATAEMYYATKPLSSQLDYASPYTFRRSLARHCTRYGVEMAHAAQQPLLLSAAYLAKAEGVIGRVARQAAAPFGARRRARYSAAQLNNQRLLNPVKSVQVTAPAPELLAAGADLLTILREERTPKAKAAGEALKVWPLGSRQTVSYDWTSPALDKLRLTDSSLTAEQAAQRDQLRQAGRLEISLWRRRTAEERQKVRAEMKKEAVDVQALVAETPVLQEVLAYASHLYRKLTREQEQEQSYSDTVPTPHAWDEASGEWVFAVMLDDDVPLSETQSTEVFLPYVDAAGRRLPNGEYRVAVRAVDRELNPTEHPTLMSAATSSPFSVVDALPQLYAQYTRHPQPADKATGTPTEAPLEGDVAGKDLMSFCAFLREAGLHISLSAEFAMGQSLDKQGNVSVAEVAAVLRGTEYHRSQCEHGITDAQRTIEPQCRLHWSLYHPGATEQEWAAARRRVLRRAMAEERDWWLPDPMLEVTDVRTDSAGAASFSFGAYPAVARYGTELCTVLPAHGSNHMEYAVLPPPPGVKATARGIGAQVQAECTVDGTGAIASLHYGAPISAADVTVEDALRAAMEAIQVAQMRHNTLSMVKLCAFEKQAQTMLFCGIQGLEFGGKHGRTYAYALEKAKREMAATAEAGQVASLQAADAEKLRLSDQEQTSAAVDRFASQTNPEQRLTRFVPRSTMSGYSMEDVGPERASTWGL 1267 T 0.76 MIase pdbpercent F Eukaryota T 7ane 25 Y ad Q4QAQ0_LEIMA mS51 MMRHTVVRHHRTGKARAFIMRDPSLKILRAGSGFQQLKRMGMPSQKTLGYRQVDNFYANNQYQHAWPLLTHDDLGNSDQSNQTKNILYSMYMPKRNKGTAPWFRGADTYSVKYCEQGRYEYQRYLMINRFPSEYKRHFLRFLGNIRGSKSRDAVPQEALHWLLRMIVDNFNPQHVHYIAAMRTLQDSGELDMARDVWKIMERQQTWPDTATICAYLDVCVEAGEKTWAVEAWNRYCTELRFLQAGEVDPKPITRTPFSLTREELLYLPKWKKHFDHDPNLDVPDLNRFNRTREVYLRMAKVMLASDDMSMFEHFFDKLQAAMLTTPTPVPEPPNPHLVRRPRWSPYEHQKSLHHSPWRMDNNGRAMALGPSRTIEGEMQSRFFSNPQFLVHAVKEAVAVVLQRHMAMFPEATDAQTAAPAFFELTETAQETLAFCDGLVQRMMERLEDKLGSLGTSSLLSTLLCIRRVVGKQSGRALLEYANQFLAKKATLSADGLRESLTAPNYFQILAAYADESAYHYDPKTRQYTYAPGFRPTETMKGLSATLNEISANQHVAWSAEMHLQVVRTLVGCGTMKANAYFVENVLRQFKWDSRFLEALYAEYRRHNTVDGWAELTKRALVWTARYNVIASERLKRLIEDDYDIIHVQTRTFRELAVFQFRDAEEKRHARDVVNELPNPWIDYVTHALPFPDRDAGYPDEYGDIGQWRAPGGPGSPVKGPGYYAPPMEGEHMRGYTAEWRDLKNPMKPPAFPEPWERKYKQYARGQHPSYDMVYAGPMPEIFPGRRDFRKPTRWDYHDVEKQGKHKISGPY 811 T 0.0012 PPR_long pdbhh F Eukaryota T 7ane 27 AA ao E9AF47_LEIMA mS61 MLRTCRVLRFRMKLGSMYVDYKIVSRNHRRSIRVEDALVDPLLPTTVVPLHWLEQLRCPSTRLLTGYHTEEAVYAKPNYGDRVSRTPALLSLPDAAAKTADNGAHANAIRAGPVVLYITGQSIPVVLNPLFVQPDEWGLTQSNGEWDLRIGMDAIEQCSLYAELRPGGLLYSKLPHASLTEAMEPVQDTLKRYGMRCALAESPLVPRPWTRMRYMFIDELQRGQKMTEFVGYNPRNGTQWRFSQHTKYFRTGIWRETIRRNEMNDGLHAHSSWQKSPQQAVPEISFLAPYP 291 T 5.8 AAA_11 pdbhh F Eukaryota T 7ane 28 BA ap Q4Q847_LEIMA mS62 MERAVDARRAIYELWSRTAAAEEHAQFSSDSTSTGEEAAAEAKAAEERSTAVAALLDKYKLDPATPREEDISRGLGDALDRLLLLCVPLSSRHGADLLVKLMQVSAQQGRQFSMRTIQHLFARTSSYAEALAVFYAMRRSNFAMSMEAYHAMLYSLQRLEEEGWAARFHEEFAASKGEAISEQALDFVLRGVDNQLMPENKPWLGRIMFAEVKDNKATQRQSMASFDAMGKLWVQRYKNGGTAPE 245 T 0.045 zf-C3HC4_5 pdbpssm F Eukaryota T 7ane 29 CA aq Q4QF40_LEIMA mS63 MLSTSQAFLASLRYRRPYWMLFLKGADNWKIYTVIQQPDHQRTEMLYQAWLGGLDRPYVRPKCMAHQPVWLSKKRHLLQKARLEGPETSVEKYVLEWYKKFHSFQGTDRPTAEDLHTAFDLVERPLDLSYACQLLNQCRNHYYIRLSDDSFEIFLEACLRVDRRDCAIYALEHAEELGFWHVSDNCRRYLAGEQTWYKRSPVDLLYYPLEENAERNTAVTSGTAASAVAASDKVKGDEKVEGAPRTSATTSATAASAENEGEAEVTDDEIARLQAELEALEREIGSEGADGTDKD 295 T 0.034 NMT1 pdb F Eukaryota T 7ane 30 DA as Q4QIF7_LEIMA mS65 MMFRGTSCALARSFRANLKYPSLVSYNKLPWEVVSHDSTKLHMHLAPNYEQLLTLAAVTDVPHLTLASHLIVPEAERLRVMPGVVYLLGGQAAHENPSSFTAYRIADPTSLQYYGRIHHNLAPIRRVDMCASADLRLLCLAMHFDGVLTNTSAGSTLDGVTTASQEGHFSLFYFFRPNRPANELTQPFEKFYRHRPSLASLDAFNAASPGKAESWTPVLQVPRRTAEKARLTPAEPYRPPQNYLMGLAERLGVRPGNAFGRRSLMWGTWF 270 T 31 PELOTA_1 pdbhh F Eukaryota T 7ane 34 HA v Q4QCC8_LEIMA mS37 MKSSDIFHACKYTPILLKSRTNDSGVNQYGLRPVNSYDYLNPTNLVNFGRGTAFDNLGVRRSERGQIDSAPSLGGSPVFTQAKLLGLSGDDQLRLCEAETTQLRMCMAKGGSACERESLLLDACLSKVGHLRRAISQAGSEFNDWFIQNVSDNHTKPFQHRPHDWRHYYAQEKLVREKQQNGHAYGRRPKEFSFGARYVKTEGYGKRPRLPYNK 214 T 8.9 CHCH pdbhh F Eukaryota T 7ane 36 JA p Q4QFH6_LEIMA mS23 MRRSSSCLYKIPKNTGVAPRFDTWNEKYEPWEHLKRMGRLAGTGFYIPPEWYNHFRMFPPINHNFQQEKTLNPHNVSEPTQDDTDYLSVERIALRDELARKSRLLASEGMRYYNVFWIRKPLDRMERHYYELERKGVAHSVAIKRVLQMFYDELTVKKRVAAIQAEEAKLSGKYISMREATVLMGVLTQLQKEQLTPHQVTLLAKEQREKAQAGEAFAATVERSTETVADAAKDAAEGDEVMSADSLAELLSTEADLDDSSPSISYKVTIHETEHDSVKQLQELALDHTGKSDWYTGASPVLHMEEAMPAKRTPASKKTES 321 T 0.051 MRP-S25 pdbhh F Eukaryota T 7ane 37 KA j Q4Q8H5_LEIMA bS16m MFSTNTVLWARALVDRKSPQLWGAPGAPIIRMRGHHVTWKFQSYDMFVEHTHRRRNSDIRLLHYLGKHCPHPQKSLWSPDTPVTQDRHLFMLTTIDVDAFKYWFGVKRCRLSVGPWNILAKSGLLPPSYKQNSKIMPKPIFDKEQLMRYYLANRKDQRQMEREDYLNYKNSMVKSPEERAAERPVAPFL 189 T 0.067 Ribosomal_S16 pdbpercent F Eukaryota T 7ane 38 LA l Q4Q3T6_LEIMA mS52 MITNPGPLRVAYSPDYLDWLYRAYRSKLKYTDERKKAEEVFNGLLLTNQTDEQGPAAGAALPGAPPPGQTLRPRHSVRRQAGEARRAAAQAKLDTLAKQQGMLDLFERQPQFPAIHIDKAARFHVVELFKEMVLDRAWKPEEVWDKALLYRAILTERQASYPASYRYILDTAQRVNLAPRESGSSDTGSSSSSADARSSNESAGGIVSESTLVIPREDNYMYFVYLVRRYYIDNAVEGHVVLRCHRQPNASELLFSHPPPKDEHEVLRSLYRPGTATATQGKDASAKAQDGAATAPQQRPSGAATIARPRPPSSYPPIEALWRCEENEALLRVLVFGELNLLVSENPFVRFPKAQAYLTRPSASTPVPGAAGGSVEGADGYGGPQQQRRGGGHRGIGSDGGISLSSVIAEKRGHLLAPLSRNVAMMIDSRANDVRRLQQRYEREDTASFQKMLRGSAQVEENPGLYSAYSDWSYFNPRAVRAEERDALSRQTVAALKTYDEASRDIYRVGFEEAEARSAVRPVEGVNNAPSYVPTLPHFVALVKKDPHVSFLSHVALPEVYNTASHAAAGVSAKHQLEKLVVQLARALYRTALEFHKEQLRRVNRQKVQVAASLLDRFVTERWRVHCVAHPSSEGVRDMARRFRAYVPFEGRILDESGFPTDARVEDYERWMAAPSV 677 T 0.066 WcbI pdb F Eukaryota T 7ane 41 OA ai Q4Q6W8_LEIMA mS56 MLAKYGDLTVVKDDLTLLEKTESYIAKWRLNRWEFRVPPLLYPAVREKVMLQQEILKALCLNRAEEHKHVLGDIQIVASITGISPESVREKNRAWLQEEASKLRWKGEVNKAKELRDAFLRLEVYGSRDHRLLERLCCIYGMGMQGTFDEAFSNIIVQDPSTGKLAVDEANPFAELQAYILSRYPQIDLIHDFLGLNVVSGYRPSLGRFLIHCLSKKNNISNPVSNGRVLLHVSTSKETLFDYGDSKNQVAHDDSIYGLPDFMYVRGNDIFLIIIAADNHWLRKRQVPHTKQLEGIARRCSFVLGIPFDKVRIRNLLLPPNYVDSSSLRRLTETVFDMSPASVKEAVPWISLYEKGLDAQDVDYCELEKTVNEEEWLTL 379 T 0.12 ApoC-I pdbpercent F Eukaryota T 7ane 44 RA g E9AE13_LEIMA bS21m MQCTSRLLGGYMMYHRKSMSTMRYSKWKGARGGLSHFYNRTAMLEKVPVNMPVSIVDRRMMAYVHRSRLRHFQLFRSYQQKSNSTECKLREGEFLRRRWHRQLQKSFIAFMQFKTMKVLEEQAKLVSQYGQASVNAALGDPQAAAGDVAHERKYAALHRRVQTLPRIQLVPKHVATMKQIHNDRFNYRWRVN 192 T 3.4 HMD pdbhh F Eukaryota T 7ane 45 SA o E9AFL9_LEIMA mS22 MLRRSALARRYPFTKRGPRERKSWKHHVLTEPPKPVEWRDPKVWTKDLSQMKSFDAPQWDLWLNRSRSQDMDEALQPFMDMPQSLKDRRYDIPWWANPFGAWYLQNVLSVELMKLPGRTNAEKIAIYRGRKRPATWDKSKEGLMDDEVLLKQIVKERWRTLEFGDRDAGYPCTFSDYIQFLNEWFKSLDEEGLQRLREHFDRKIRPLLAVMTHVDLMWLEALTQNSVQNKEQLERRIGFQTSLGTPEFFDMSKRLRYEINEDYKVRDELGPELFALWSKAPERWPPERLAKMYGLDFTLVRKILVWHHFKACYDACVEPDWTLPKRLFALEWIRDVRARKQGLFYGKLRFAEQKITFYSDKFLFRDLVNRREASYANVWEMDDPYRFLQTEQDYEDYWGDNYDVYRRMFPEMIGKSGEPVQQYSQMPVWAGPHRDHANKSEHNWMFAEIGVNVGHEPLKKLELDPTNEKRRRFVIRQPDGSLRSAKMSEMRAWYWKEEWADFRFWAPHMEWGVENTPSMEQYQEHVPDTPDADYRKQRRIQSRPVKWFYESHYSRSGSFAGFQPLRFMQRRTQREVRWPDVINAAVQIEKSKPSSYVFKAIPEI 604 T 0.00082 INCENP_N pdb F Eukaryota T 7ane 48 VA q E9AFH3_LEIMA mS26 MHSSGVARRQMRPYYNLPSKSEHGRRMTGFLTPYRHWMWKQNELWRNVHEAQFEHLRKLYKRQWLESFRVNADEYIYKYNITKSAQLAQWEHEMQGQERKRRESQQLAQGRQALKKKHLDLLREFHERQFFYWYERASERLQYMTHINYVPQASIQEHIDRELDKYTVGSKAPYPLNFVGQMPMLEDKDGNIAQVPANLMTNHATENPDGGVTMYEAPEGTAVAEEKLLQMIASAQEEELRIRPEDSDALSESMEDMDRSESARIDSRKVARTMEETDEEREVNRRAYIDRGKTGSKTIFRRPRLDSDGATPPSAGGTTPMKRRKSKMDRMHALQEQQDAVAAKATAKALKDEDSVAKATKRGEVAENRGRLRDRMIMPSLETLQQSPEMMAQNKPGGRVRTHHLMEKVYGIGKFKKGGGGDEDA 425 T 0.026 ThylakoidFormat pdbpssm F Eukaryota T 7ane 51 YA ba Q4Q6W6_LEIMA mS73 LRRSNRWCMKYANLELTTRGEFPHGMKEPGFVKKLDKNIPWYFSTYRSMYHWPVAGDGWSDLNEAEKHHDLHMYYTLAWWKLGEGIFDADDEDR 94 T 9.2 Mastoparan_2 pdbhh F Eukaryota T 7ane 52 ZA z Q4Q190_LEIMA mS47 LRRRVSATPSLAVSPAGPSSLSLTPSPADSQQRRSLKTLDVREYRPLGTPIEFRFYQRYANHPNRQSGVQFLTHYNTHQRFRVNKDFIDYMHWGKEQGQARLPHRHQRVAFDFDDSLQPTRAEGAVGAWFAGQDPTMRSHPDISASFDPNKKLFSHPEHWNKMFSKRRPGEGDIKLNVIPSNSLLGPMVTQTDTQDMAYFKTETCGPTHGRVPGINAPFKGEMDRKMMQAMSRPLNRSCTLTGNNGRFSNTIFINDPKRHQTLSATLAKELNREVDRATNGLYSKLTVLTSAQSGLTDFFCGGTDLQCIGFDLTMAQLLRKEADALTKSAASGSKKVEAKVHELLRDAERYEERADSVLRENAAVIWRAYTSPRALMTLVNGKCRGTGCGLALAAKYAGLQDASEFIVDGPNVGLTPYSGMTRLLARPETSLKYPGLAEFVMLTGASLFAGDALRLGWSDLFTSLPDMPYHIKDWFDSTEHMHNDAVAWQLGHLLEKCFQMKDRWHTSAMERCAMTPIRARWVEDAFADQSSIEEILKTLSAMEKLPLTDRHNTYDPSYATPYTLASVAEGVEKLGASRLRYTLSPWDATPPEEAVEVRQAAEIFTSYVLERRGKVNIVAHRDRHKVQAWQKQREREYVAYSNMKNAPHRRHVYVRLEGCEGTLVDFDFTIDPAGDAAAAAAEKGAGVDDRNELVHTASVERLKRAVLQAMGMPADRDVDLCWYLPTLDTCPIRNDEELVDVLHSDPGFEDPSAQLRYPPIYFLVKRNTLHLSEWAYAVKHQLLLQSPYALKATLQLLQEVRGDGSAKAVRSLADTLATEYRYAARLLKRPDFYQVGQHVDKSPEEWDVVKEERMRYVHKAHLPTRPLPDYEVVFERNVQLDGHTFQLRPRWSPRTVQEVTAESLAPLATPLDFEKDGAVEFNVVVHASKADRLAGMIEDAGGFEVVAHLGEVDKEGNAKVPPLHGDAHVPTNVNFYEMARHPWEDTPSSTRRDGFTAGSKEYFEQQYKKAEKAVYDEAGRGQRNYWPSKAAVDGVTGEESNALLEERFFAKLRDAERGVESWARQLRKKAVEGKLDNKPEIATQQEKIYDDDYYRWFIQPGHNPNPSGLLRGRKVADSGSSSVDKDLEVFLNQLLSGAAERGADGTTGDEGEALSLPEEDTDEAADST 1169 T 3.1E-05 ECH_2 pdb F Eukaryota T 7ane 53 AB bd uS3m TKKGATKILFIYKLSKLNVYNNESYKIKLLFNHLYCIDNYNSIYFNLNGILIWLNVLHINIILIKYAFLILLNNLEYLIIFKYNIISIK 89 T 150 Cytadhesin_P30 pdbhh F T 7ane 57 EB D Q4QI77_LEIMA uL10m MFSRGAAATAMAKVSRLVSPRLRIIHRDYLTRRGGRTHQRCSAVAVDYTPTYFATYKSDPGQCPRLIDAEAVHGDEQAFWSARRDFYRGGASRSYYPAWDRQAQALIMLTREVPRIPQEAAFRLFTLGLKMMLLPRLVAGVELMLPSWVTMNAESVLNEGLEGKVAEADGDGKATGAAADAAALPSASSAGANEDSGSANAEKR 204 T 1.8 DUF5783 pdbhh F Eukaryota T 7ane 63 KB J Q4QCY7_LEIMA bL19m GYTRERTNRHFFVSRANAFFSRLPISRIQRALAMEAIKKGSMKPWKHTKEQIIGSPITCNFEYNPRPVRLIGTVMDAHTEETSIKGGLKVYARSEEANMMLWIPAGNPKLKYEVTAAKGSFEHYLDERSKWDEAWLTGRARMK 143 T 0.073 Endonuc_Holl pdb F Eukaryota T 7ane 69 QB P Q4QG34_LEIMA bL27m LRITPSRYASKVTAGNAKNQAGSPRQKAKLFHVIPGTPVTPVEKLKEQRRRFGQDRYSRQPEYRPGRNVRMDPNTFTLYATTKGVMTIRTSRINPSYKWLDVEPDIQKVYRSRCMRAALQARGKASMMVAGNVHYRAELDHVTEPHWRERVMRVPKATERFQDPNYFTRGLVPSLRPLSRYSYE 184 T 0.31 Ribosomal_L27 pdb F Eukaryota T 7ane 70 RB Q Q4Q719_LEIMA bL28m LRQSSLLCFSTFALNPETSRAPHGPPRGLINRYISMGLPPWAAWCNRVNRHALYRMSDVSPRSFLPKAPHEMDVIWMNERVRERVRTSRQVQHVYRQLKYPFVKTGIHYSDTLDHWVQVPMVEAAMFEIEKDGGFDNFILKRSGPELRSTYGERIRRHLLVRQKETQKNFVLDQQAKALAEVTQAELMKATSEEELDAVLAKYGMDAEEFKRLMAKRVMEQRKSVAAAGLRSK 233 T 0.011 Methyltransf_5 pdb F Eukaryota T 7ane 72 TB S Q9U0Z7_LEIMA uL30m RRCVMAKGEDPAHVAGWDDRQDAVEWWWTEANDSRGRQRLEAAAAVAAAAASSTVGLPLFPRFSPGRRRRRRPPAPPPPPPPLFLSRHLHSMPWLWCTCVKMQMYYTPTALTCPLSNSLAAHVGHIIVGVAALLPYSMLLFLTVMCNPRKHEPVLRAQRIRWLTFHSLMFRLLRCITASPAVAASVAVAAAQTPTSLRPAAVCRRGVHLAPSVLAASAPPPPPQQQQQPTSAAVPASTATSTTTIAAGPYRRVGNVFIVTCIDHPFKFSWEVNRMLRELRLEFMGQTTVVPDIPPVRKRIWRVRHVVRVDQLDLDEAKALIGIPEHISFRDLAGQIPPTFGRGGSVANPHMRSKMNFMRLRRMRLRDVMHRDQLEKRLLEERHHALQQQQQQQQGGGEAAAAAAATTA 408 T 0.00017 Ribosomal_L30 pdbpercent F Eukaryota T 7ane 73 UB T Q4Q2W9_LEIMA bL32m LQRTTLRCYSALVGQATPVLLGSKGGTPKRKKNPMQLRRKTYGLHFKERYLKLEEWYFCPLCAEPKKQGEWCRREDCRQIKP 82 T 0.11 Metallothio_Pro pdbpercent F Eukaryota T 7ane 74 VB U Q4Q2Q8_LEIMA bL33m FRASCTLLGHGQYKTRLKKRMVGFIPKVIPRKIRNNMVALRSEANTGHMEGYIKTEAERLDATGRKLQKTMWDPVLQRYTLMKETKVRGPFLTKSNIARKVDFPVGALHGTKLGGKK 117 T 0.0038 Ribosomal_L33 pdbpercent F Eukaryota T 7ane 75 WB V Q4QCK6_LEIMA bL35m FRISLICFPKAGCEEITRQGRRVVLKPQEYFAQHRMQVWQMRFKEMGPPFSRVWVALGGKMRRRRIGRQIDVKDMRYYWRPIEPQYQRLYMSRLRIKDHSNKRVQPMRLRATNNDIGQASSLKEWERSSDRKYGAALAPPKKRDFEFRVF 150 T 10 Gln_deamidase_2 pdbhh F Eukaryota T 7ane 76 XB W Q4Q6A3_LEIMA bL36m LQYTSSARQALRATALVLNFFPLGYTCGPKNKQVFFPPNNLDGRTTHQMKKLQGSTDKHPGLVPRDKLKLHCEFCRFHWVQDTLVVRCAAHPKEHNQREIWLEPTWTWGKQQPYQYYKYMPVNINPRTGMPLAREDAKGMNNERRSQGLPTKTRLLERERRGISRAITGLGIYNQRWQTRFPFAT 185 T 4.8E-05 Ribosomal_L36 unphh F Eukaryota T 7ane 78 ZB Y Q4Q448_LEIMA mL40 MWTLSRPCLAAVRTAVLCQKKQTAAGYMASAGKVGNEEKWAQAAMEYIHEKNHVNDARKRQQDVDQERSIANAYDRYSAVSEAKFDERLSRLIARMSEALEEMRNLGLEEALEEAVLLNSEQPPGHYRRPSLTPPLAGYEPGFGLDVPQLRSQQAEYPPLRRPTDWLEFGEGGADDFPYVDTHKIEDLTAKHEAQLEEQHGVLREAAPLTGVEGEGWEAYVALHRKALARQHLIMDLHNDPELRDKYNADEAFRAAEWERRGMGALSIEAPLERDLELHYAQVPAYEAFRSH 292 T 0.00047 MRP-L28 pdb F Eukaryota T 7ane 79 AC Z Q4Q152_LEIMA mL41 MLWCTGPRRIVFHNAPSVYPFTKPFHDTPYDQDRGRFDKTKNILRENKWPAWMDHGADGTGFGIGLNRTHPLSKLRGNLRRNPSEIPRVLNMMIQGVWHKSGNKLYFRGGKPPNPSTHPYLTGEPCPVYGWKVTDPGVIREFNLPQPEDKTRYKPYVALQERKIMGMQAPTKEHSAASTSAASTDSKPLMKRLFFWK 197 T 3.3 MRP-L27 pdbhh F Eukaryota T 7ane 80 BC BA E9ACP5_LEIMA mL94 MAQWIPKTAWKVSNLNKRYGAPYVAKGYASLDPRCSLDAYSSFQQTVTSADMKKALLSIDSTSSGALVIDVRSEPERRLRPLLSPAIVALHPHDILSGAACPILPSNKERAEMFVVASEAQRAVNACTALRRWGFSRVTAVSVDAVSEAIAAVQKPADAATSSSTKS 167 T 0.0043 Rhodanese pdbhh F Eukaryota T 7ane 81 CC UA UA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 203 F F F 7ane 82 DC BB Q4Q4D6_LEIMA mL95 MLHRSCVLVDSFKEHYHRVHLPRRLALQRYIKREEARLSRHKGKAVAAAAAAGVQPGEVAYKYNRWWVSNDHEFVHQFAFVEDPDVTREKRNTLPLVTKENIWKEPQQTFFLPFAPFVRVVDYAKDPDTKFLKPVNIPRWKDYMQRTKPIVPRTWY 156 T 0.14 DUF4653 pdbpssm F Eukaryota T 7ane 83 EC Aw E9AD00_LEIMA mL89 MSSGAVGRGSFHSVVAGANPRRIPTYYNSAYELIQLHRAHREVTRNFLVRDKVFDNKFPGCSLANGLFKMVPNKRGNFHTRELTESIRHRTIWGQRIQQQRTINAAILEDATKVLSPAQMEDRFSYRTPDAAAYFSPQEYTAANNWPNYWQHPTEKHVVPKPRWRREPELGGITRVRDAVATPIADY 187 T 0.092 DUF3295 pdb F Eukaryota T 7ane 84 FC Bj Q4Q0E1_LEIMA bL31m MVLNKWAAVTKSAPPAAGLRPLARTVSPNPKLRPADYKVPYVLRTFIKDRHSSEMQHIENRGMYREELAIERSRFPRMQKTLTIQTDGSLNEREFEFAVPPVVMLFQDRLSAHRQRQVALAKIGKLKRVKSWETSVRGKESLNPVCNALVFPYCVPKKMLVRPRIVDPLSAKSMADNRRSRDDPS 185 T 11 RNA_pol_L pdbhh F Eukaryota T 7ane 85 GC An Q4Q0F5_LEIMA mL76 MLQCTALVLKSQHKNVLRKGRPHMQKYKELNRWQREAQGITKWEQGHSHRPQPYVERFNPEGAGLTRGTSAYAWKWWHTQYPWLPNVAPADYVPPSPRGIRPAAWDDEFADVVLSMSDEEIQSYLLDKLTEVIFAETQRDGYELRRLDFEGKPLTELPERRIIENFVFEEETLRERVLDRVVEGVFRLVPTSTDRLELKSVANIIDFVLTHVTVARKPLQHEIPEAARTVMRSHPLQPQLGFVHALPTDNRDAVVQEWERMHHLDWQFGKAVYEPRSAENERGNLTWLREVRHHEAREAFQADVDSGEARRRHMAKIKAAAQVPHTGTTSQ 331 T 2.9 LRR_1 pdbhh F Eukaryota T 7ane 86 HC Al Q4Q1C8_LEIMA mL74 MLSSAHRAAFARPTATLWASARSFGAGPTRLLLGLEQVQDVPTSTDRKPTGMHRGPGKRQTAPKEAAQYQFIKKWDLQMRETWDELEPFKGLPKPKVQFGNEAAEVIWPYALLLENVIKVHPYTKSIYVYYSQRQSTPLGELAARVAKRVSQAYLIPITFHNSHVYVEAEMLLEYSETPWVVVHCLDGTHKLIPVKPQAGQTVKEGAEEVLNGIVSACNEIGSAVKNPKEVMRLLSERPLQNQYVRVNYQWYGDTPEERMSHLVKWDYEPEEVVPQLRNRTQHVLDWMNYDGNLPTHNSVRVNIHREAARMRKPNVSAGPKTFFNSSGSRANARTARFDNSRSSQS 346 T 0.011 L51_S25_CI-B8 pdbhh F Eukaryota T 7ane 88 JC Az Q4Q4D9_LEIMA mL93 MLRFTQVIRKNPVVFKQGQGMFSHQLKRILNKKSLHKYNWDPLHMYDPRKLVHANRYVDHDTYEEKYDPHWEHNAHLVPDQQFYNIPVPKEYKDAYWWRDLQARRVQCPTEWVHFRMHTKDKLKYDFQDLAFRKKFEYSYEDVVANAKDMCS 152 T 7.6 Pox_VP8_L4R pdbhh F Eukaryota T 7ane 89 KC At Q4Q4L5_LEIMA mL86 MRPSALCLGGFTMKYKRGTGLWDEDHVNDFDANKYLSARSTMRWYYGMERLQTRNNMNARRATQSYNNNMGLHHSGRGAFERELERRGIQVDKYPLTTTTGAARVAEMVLLRRQELEAHAKKAMESQRQARRRDAPSEWYDETEGPLNPRFLASMQSNYTQVITELPSSPVTGRRELPGASFA 183 T 2.3 DUF2663 pdbhh F Eukaryota T 7ane 90 LC BC Q4Q5D8_LEIMA mL96 MNDIYARRLAQTSMFHQLMRSHGTLWAATQVTKEKLNLAFVKEEMMRVNGRRAMPLLIGAAANENLNDTHFTHLTEHCAWTESARAFAVQRQTPLTQHIASMGRMAETITQAKTASTSQLLFNEHLARIDGISEFEEEPFVDDEDDS 147 T 0.027 Chloroa_b-bind pdb F Eukaryota T 7ane 93 OC Ap Q4Q7V3_LEIMA mL80 MQRCLARLFQAGVHTPHGSRYNAARMKNWPVQEVPQNFNFTNEQRFKAKAVPRDTGKIPRDFLLSVLYRNQPCEVASLWEHCLHDPQIVLDSKRHLREVLQQARAEGFVSFEKDAVTDRWVCHLTRERFEEVRALVGARAEAQDLYSGLRGASATETSAYSESFREMNEDTKREHFRLLSEQVADTTTHLRKFQRMEMDYLPYTDLNGKVNFMWWYEMSDTRDATALPEAAAEGSPKLSE 240 T 0.15 Gluconate_2-dh3 pdbpssm F Eukaryota T 7ane 94 PC Au Q4Q8J6_LEIMA mL87 MMLQHTSLLCRKALQSYPVPPRARNYERRWSSSRTNPYNRMFWRTVLNEDFARPSFWVSDFRHKYLAKHGMDYQGRVPASPAPGMYQGFSDVHKILANHPKPQRESRHLPVMPMTPRVVFEHAQEKRIDYAKKMHRDRRLVEQLRTHEFWGWYMKLQRVRGRWCKEHGVSSRGVYGPAVDAAELWG 186 T 1.2 Crl pdbhh F Eukaryota T 7ane 95 QC Aa Q4Q183_LEIMA mL42 MLRLTQAVLRVQSHQKKRAQHPNAGTRFGRVYNRGFVRYGFGGFGMSVYSSKKDRTFKVMPVPPPPPATTAVEQRDDFADNRGLSATTRTLSPTFRMFALEDGGVLVSHPSHAQIMRWNQRVHTEEGKAANSTVMDEYVNSRIQAIIADNTIENTSLSQWRKAHMWNVIKSHGKLQRRWGTPDFVMGARSTLYNN 195 T 5.8 RHINO pdbhh F Eukaryota T 7ane 96 RC Ao Q4Q547_LEIMA mL79 MLRTTHVSWASTAKGYMNRVMVYAHRRRKARYLAPKNAHVRSPLAHKMPEEYGNTWDPRSGVEWHNRMRNRNHYRHWPWARWTDDPVRFHQDSVCHRTVSALSTVANNGAPEWDYYAEVGQAYETPSHFPLSYTAPFIYQYTAQCWSREDLQSYLERIEQSSGLRTIADAASRREALYTWWHNAGMNVIPLGVLQHLELVSRDIVAQNARKSYRIEQHERGILRTPEMERYYALPHLRGPSMPVQLAQPSGKYPSGKFTQMMEDVAIHPLQKPDARYKHNMYPA 284 T 6.4 BNR_6 pdbhh F Eukaryota T 7ane 97 SC BM Q4Q703_LEIMA mL70 MSVFPGLCGDVATTNYRVFLGTLPNLAVEERFLRQVQPVFPWYASRKHVKEQASEFLEIDLASCDPELLLRYTHVYYVRRQLYDELVDRQLTLMETGKAAKVADSALLTCLAQVNAAITPRLQYELHLLQQAKKACRVPRRRELNPDAALEAHDYLCMMRVVEEDVGGIPDAEMQARAYLPREVLEAKVKELAAMIFGDGGSATKGTGAALERKEQKLLQRMIPADYNKVGAVEKLRPVDVTALYRFTGERVCGRPADKPFARALWGHVFRKVGSHPLYLQRASLYWARHSGLDPQSATSAMPADLATAVCVQQALFPALKYRCQYLYTSPDIARQQWRTGHVVPLLRLFPLLGAPAAEDLAAQLVVEGEWAKLGIEADTNLLHDTVLRQLKDMVEQVSALYESDAGAVLKRVEDGAKVLCPSLSERESLTMRGAPEDTSREVSAAAAARVANAAPA 457 T 0.37 DUF4911 pdb F Eukaryota T 7ane 98 TC Ar Q4Q712_LEIMA mL84 MLRWSRLLREMAPELQLEYIPIIFTRTILGPQGGFAGEERLIKREVAQKYMSEGNAVTPSAEFHQGVWCYNPDSEQYDRFVERNAEFLDFAARKRQWLDVYWRVNTGYLLFGRQSWGQGWLLNCPLRKKDIAQKLWEQYKVRVDPRLIEFREKDRRTGIQDLGHNWCWLYLPGAEELAIDREVYDNKRVKVRMHIRKMSSYGALY 205 T 1.8 Ribosomal_L9_C pdbhh F Eukaryota T 7ane 99 UC Aj Q4Q728_LEIMA mL72 MCTRVFDVSQRFILLVSLPPLSLSLRPSCCSRKAATRRRRSTASVLLATDALSLFVRTHYPSPLLFLLLPNFPLPIHSPRKQMRAAVGAVLQPSSSVGALRCQARFITRLYTSYFKGELFPNQLARPLERLPRGVSLAAARKGQQAAAPSSSGSGNPATTASLDVVTWGDVDSTDLVHANEQSRAVAPQAGLAPRRPYVPLGEVAKLELQGDYLTEGGLHQEALEYYGVVAKAYELAYPKDHPQVAGIRLKLAGAFRRTGRLTSSKANCEAVLQMLDSAVQPPLELIVEALFELGLTSEAMSDAAAGTVFEEAVALVDMFHNSGQSHKMLRLLPRLGRRFNLNFEEKFVYFSPFDYDRVFALADQCLERAEVFYQARNDRAGVMRVLQQRKELIDKKFFNMRDFAGRIHTMRGHWKRRAQVLTNAPTPDELLRYSPTIHQVYRDFKYELNAPIGREKEVQPGVNRVVHDMGNPYRRSGVRSQRMFRDAEKNFEKYIRADAFEA 503 T 0.0013 TPR_12 pdb F Eukaryota T 7ane 102 XC Aq Q4QD92_LEIMA mL82 MLYGGSRVQYLVQPPFTLHKIRSENLPPPSLYAERHDLGLEMQLPRDMHVYNSINMAIQRQVGGDSSTLDGEQQQLQGGHADGFDMGAFFTEQHHPERHHNSSLPYAKHDTNNVLAMRLFPVNVGVRARTEAIRIRTDDCLQRLRDADLCAKMRLPLEHPLPLSRRSQYAAIHRVRQERCYDAPTEAAGERAAAAAEEASRTAHLRGAAAHPPPSELSIVTRPVDRLGSHSGSSAAACTADADHLSFPVHPFAAAAVSSGCHSARSGSAARLASQRWLPLQTLKPMGHNWSAATRSSGVRGPHMQLMQERLDQKGFGWKRKSRSLWQQDVATAGFRPHRYF 341 T 0.21 T3SS_ExsE pdbpercent F Eukaryota T 7ane 103 YC BE Q4QE16_LEIMA mL98 MLGGLRPLAAATRRTVGGALVSPALITPSRALSVRTEDFFSKEAVSHARRVSWAPHTTEKKVGAFAKLSRSNFNDPLPVSFQSEPYFEEEIEAYRAHHRPDVYVYKYNVSPTHLSLRE 118 T 3.7 DUF2975 pdbhh F Eukaryota T 7ane 105 AD BP mL52 MRRRDWCGVCLPAATLHALARRYSEYRSSYTGARSAPWAAPEAAPAYPSARSPFPLERPRFRKTHIEWMLHHGHGDRYGKYGPSREIADFEYADGTPSSISGKRFALKHHQDHLLVQLIRSAAIVERFEEEELLPRIPGTPEQRSWDPEIPLFLEDVDEFGRPPRPVAGNMVARVIEERFAQESGRTPVNLANKHAGEVLEPNTMFATYDPAAFVSDDIKKDVRRPFWSRRRWALSDNFMVPMSPKPKNTIKDE 254 T 0.0014 MRPL52 pdbhh F T 7ane 107 CD BF Q4QIQ1_LEIMA mL99 MRRTVRALYNSFERGWKDKTVHPLDRRGRFNLDEAAAELQLDEAYVASLYKPLHYTYSMKGQRYPAEQGRTSRPGSLAASRDRMFPLYRRNYKLNRELRVLDHRRISTD 109 T 0.52 DUF6416 pdbhh F Eukaryota T 7ane 108 DD Av Q4QIT7_LEIMA mL88 MFQRTCTPRLLACTSALLKRSGKPSDLPDYKQVYLPYDTAPTKTELDRERRKFMHAYSGRMEHRKMVEVKDVPQNMYTYGKEGMSIPISIFKDQADPVIGPEWTYPGIFENKIVAQHWYMEELFDREKSNTFESPWQRQVLDNQVKRRLGKVAWRMSMLNIKTIDIFHKERGASKRPGAGDTKAPATPAGKK 192 T 2.1 Ribosomal_L37 pdbhh F Eukaryota T 7ane 109 ED Af Q4QJB6_LEIMA mL63 MLRRSPVPRRYRTAWRELLHPLPVWARRQQWLKRDTVEMNEAILREPYYRIKTFAQPAAFVSPRVSESAAHEPDTQQSSRYGVDRQLRGPRRAVSPERLQELREQLQFVGSIGPKVPPAAGAGTAYQDEYGTRLRPRYPQSWDTVPPHQPSRSEI 155 T 28 PsbP pdbhh F Eukaryota T 7ane 110 FD As E9ABZ5_LEIMA mL85 MRRLPLFCRRPSRCCGATASGSGSSSAAVLAASAAPSVLVLAARGIATSGRVTNEDRRWWLVHLECAPDVTPGTFVSWLDCCGTHTTKKLIERNIWTIEQVAELDSDRVDELKYKEGCLKMDVVWEHARTIITPLKQREVSGGVESQLQSRILELRKKRELERQRELLARERATVSDKREETLRRLRESVAAKKAALRKKLDEQHGEATPAASESASTEAHRGTAEAAVEDEAVGNIVDRMSGGNPPRA 249 T 0.39 OmpH pdbpercent F Eukaryota T 7ane 111 GD Ae E9ACG2_LEIMA mL53 MTAPASHYTFANLKKLGLCAPQVALSRQPRLRPHVGHLNGLVYPLPYYAMWRGNHDKYTYNQATPARWGEGNTNTMYHQHYAHAKCPTDYGRGGREFQFLSVKRGKLKRKPLPTVQYVDPNSKPQWVFKSWHNPLSAPSMWEREVQYPEHTPAHTGAKRPLAVVAPKTSHKHLFLMHMEKVTVTVSPLLFGYGHTLQKAALDFYRRGLSARSPFPSDKMFLYYSIDHITPKIEVTWLDGSVYVPPLIEGVKAQDLIQMVMEQAWLAADRMSAEGRVLNPIAIDDYKWEQLIAFKQKRAKGAEAAKGGAKKK 311 T 5.7 MRP_L53 pdbhh F Eukaryota T 7ane 113 ID Ah Q4QC45_LEIMA mL68 MHLHISSIPHRNSNNSKGGVLDATGPMLSAKRGALLLQAYHRPGEVISYKAGDYHLVPKKFTVGKRIAVRSYLDRNRTELSDRTFMPQKNWFRPYDLQDGCFDRDHERLSYRFYNLETKVIWKAFDTPELIGMLLHDETVKGNSGMYAPDMLDAALHYTREARYWRCIGITKPFYDRNTLRAHCWEDNGLQVGTLVMSQAMRHALMDLERAVRRKELGLEPNYLWDRWGPIGFIDGARADYLPRFEHNPYVDPDGVDVTEIDVLPFNTHEQIRERYRDFIEPDTAPFEEVFRSPSHGSLTTLADIPNASVVALYKDLKLKAGTPVAGDAVELAPADVRTLFYLSANPEWRAVADGKASWEEVVDAMQPVQAELDEKIDAARLLQNTRHNAERVRAFFEEKCGFHDFMYTPDKTITAAVLCYLTELRRICTETAWGAALAKCLTDMERVQGMGRDAFLVYRHIEDAILDKKRRLWAGRFAGESHEESTLDYLLENFGRRAERPRNVGTTGVEFDREQEPIGRQVQRRVLDSDKANKLAEIRRSRGKMWSKKRSVFDALHEKQLQNFNYGVH 570 T 3.9 VirE_N pdbhh F Eukaryota T 7ane 114 JD BD Q4QE11_LEIMA mL97 MSNRFFQKFYLRCGNCSAIQRSAQGYQPIANPILFKSDEHCRNYHDEQRRAAGYSGMVVTCRCHRCERVHSNWKVLDAQQFLDAKLRMTPEERAQRLWVSKS 102 T 0.91 Mu-like_Com pdbpercent F Eukaryota T 7ane 115 KD Ay E9ADN7_LEIMA C2H2-type domain-containing protein MLRIGRTLLAEVTTINSTTASVSGRLIRIRKKSKWIDRRSTRVPHNGKDIWYFGDQPSCALCHIRFRYKQDYEAHKESELHVNRLRWVETMNWWRETGEPAYLKASNEQWEWFEQHVLPTKAQEMGCTLDEARRVYRQAIMTETPTWHRPLQCPTVKQEVQEPRDQRWPASPKW 174 T 0.0078 zf-met pdbpercent F Eukaryota T 7ane 116 LD Ag Q4Q829_LEIMA mL59/64 MAFRGSSARLAATPGVGIAPETTPVKYVPEMLNIQNAKWWNGRGKPVYRSTYNEKSWLEKARWGAFTKGSRPVMRQRYSAAALKEALEMVPEGFETCDVPRPPQRIRAQSEGVVGRWYTNYWTLHSVRYQCQLAGVEWQFGERQRPRTNYDEPHMYTDFEETKAIRDYRSRWINVNRSLVGMSRRMKESEEEARYLHFKKVQDTFWSNRKVLVNRIKSMHNQGTLQSAKDLPIKTINIKAFLAE 244 T 0.036 DUF1672 pdbpercent F Eukaryota T 7ane 120 PD BG mL100 MLARYLDPSVHPLRVGQVVAYDYLHAAKTWQWTLGTVREIKDYTAVVQQWGLHTGDIDTLRSILLKEVDTENGRMKNYHDMLAIAREKLASIRRSNEDRVSHVRGHFDKAREKVELIDEVDLRKVTAQAAPSPVAVAVLKAVWAVAKCDPTAVEFYEWADVQLEYRKPAALDEIAKTDVLAKLYPSAESLQQSLEQDPKLNYKAAARDSPVVASLHAWVITALAYQQAYNLLAHDKRIQEQNDAIAAAIAGMKACRAKIAKLKDELSSKDTAALPGQVTSFTRTSVLVTIPLSAVISPVNVDTDVKRCVLTKDEVEQIPIDAKITRYAQKQKLAITGSHLLDQYAAATTTHIYVTELEDRLFFFQHYMASALRDAQTAAVDAHQRLAVSLHELEAFRQKRHDAKKARAAEPELADADGVEPSSGPTSSRSPTGRAAPRGQSAAPRGTASQQHKLLGPAYQSIDPATIANEPLYAVTIEEYKAKDAAGERAMDEAERMADEVQRLAVELEDAKAAADKLAEELAAKDEELAAHRQKRHDAQQARASDPALAAADAVAPRSGKGAASPHVGAVQRQAVDPATVPVAPAVIAEEPLYVATAEELQHVRDFADQLAEELEAFRQKRHDAKKARAAEPELADADGVEPSSGPTSSRSPTGRAAPRGQSAAPRGTASQQHKLLGPAYQSIDPATIANEPLYAVTIEEYKAKDAAGERAMDEAERMADEVQRLAVELEDAKAAADKLAEELAAKDEELAAHRQKRHDAQQARASDPALAAADAVAPRSGKGAASPHVGAVQRQAVDPATVPVAPAVIAEEPLYVATAEELQHVRDFADQLAEELEAFRQKRHDAKKARAAEPELADADGVEPSSGPTSSRSPTGRAAPRGQSAAPRGTASQQHKLLGPAYQSIDPATIANEPLYAVTIEEYKAKDAAGERAMDEAERMADEVQRLAVELEDAKAAADKLAEELAAKDEELAAHRQKRHDAQQARASDPALAAADAVAPRSGKGAASPHVGAVQRQAVDPATVPVAPAVIAEEPLYVATAEELQHVRDFADQAAHDAATREAEVAGTVENLRNELDDVREMNAKLEDEVFALKEQLSDAEDAYKKLAGALVVAEDERQELCDDLEAALDELEQKKDEYDELLGNLEEVQGLLEAADVAGRTAVEALEQRNRDMADLQGELANALDASKENENLRALLDAKEREIDRLKEYNSFWTDTVGTGKQKVTHRLTKIFDGDWTRLMRHRPEALKAAFVIDSSNACHVPGDQIFLVSNSFTRRLLTRTDHCPKCDRLSTFRFMSVSGMVGRMPYKPVDTPGPSYATLYWRKQRSGKIASQPLNEVCNKNEF 1347 T 0.012 Fez1 pdbpercent F T 7ane 121 QD UB UB XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 67 F F F 7ane 122 RD UC UC XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 144 F F F 7ane 123 SD UD UD XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 95 F F F 7any 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L Laspartomycin C Friulimicin-like mutant XNXXDDGDGXVP 12 T 1.4 LCAT pdbhh F T 7aoi 4 D A5 A0A3L6LD92_9TRYP bL32m PKRKKNPMQLRRKVYGLHFKEKYLKMEEWYYCPLCAEPKKPGEWCRREDCRQIKP 55 T 0.036 Metallothio_Pro pdbpercent F Eukaryota T 7aoi 5 E A8 A0A3L6L070_9TRYP bL33m PKMGCEEITRKARRVQLQPTEYLAQHRMQVWQLRFKEMGPPFSRVWVALGGKMRRRRVGRQVDVKDMRYYWRPIEPQYQRLYMSRLRIRDHSNKLRQPMRLRATNADIGSGSSSIEWERASNRKYGAMLAPPKRQDFEFRVV 142 T 0.095 Cytochrom_B559a pdbpssm F Eukaryota T 7aoi 14 N AT A0A3L6LD66_9TRYP bL19m GYTRERTNRHFFVARANAFFSRLPIARVQRSLAMEAVREGRMRPWKYTKEQILGAPVTCNFEYNPRPVRLIGTVMDAHTEETSIKGGLKVYARNEETNMMLWIPPGNPKLRHEVTSTKGSFQHYLDERDKWDEAWITG 138 T 3.5 DUF2760 pdbhh F Eukaryota T 7aoi 15 O AU A0A3L6KV21_9TRYP bL20m HYRAKLELDRIRSMLRGRARLERKVGLKRLFFLMRTQTRYRVEQQAHWERAIVRKNVDSAAREHGTGWQHLRNELGRQNVMLLPRSQQLLAQYEPLAFRAVVELCASRIPPPPPPVVASVPEESYTLWPPASHDNSECASTDGSDAPHGQQQSLSHPAARVELRCGVERVLRRGPSGLGNNVNELIDAWKEFDVSP 196 T 6.2E-05 Ribosomal_L20 pdbhh F Eukaryota T 7aoi 20 T Ae A0A3L6KTI0_9TRYP mL41 KNSWPEWMDNGADGTGYGIGLHRTHPLSRLRGNLKRSPSHVPRVLGMMIQGVWHKSGVKLYFRGGKPPNPSVHPYLTGEPCPLYGWKVTDESVIRQFNMPSIDGTNFRYKPYVALQ 116 T 0.71 MRP-L27 pdbhh F Eukaryota T 7aoi 21 U Af A0A3L6KXE9_9TRYP mL42 FGGFGMSVYTPKKDRRFRVQPLPSLHANSLADDTPLVTTTRTLSPNFRAFALQDGGVFFTHPSHEQVMRVGQNILAEETKATGMTSMDTYVNSRIQSIIAENTVENVALSHWRRRHMWNLVRTHGKLQRHWGV 133 T 0.064 ATP-grasp_6 pdbpssm F Eukaryota T 7aoi 25 Y Ap A0A3L6L3K9_9TRYP mL53 TVGLLPAQLALSRKPRLRPHVGNLKGLVYPLPYYAMWRGNHNKYTYNKSTVCLWGEGDTRSMYHQHYAHAKCPTDYGRGGREFEYLTVKRGKMLQKPLPRVQYVAEGSKPVWLFKSWHTPLSSPSMWEREVQYAEHTPEHIGAKRPLAVVAPRTMHRYLFLMHMEKVTITVSPLLFGYGHTIQKAVLDFYRRAISARSPFPKDKVFLFYAIDHITPRIEVTWLDGTSYVPPVLEGASSQDLIQMVMEEAWLAADRMAAEGRVLNPLAIDDYKWDQLVVFKKVRDKEAS 288 T 5.2 MRP_L53 pdbhh F Eukaryota T 7aoi 26 Z At C9ZU82_TRYB9 mL63 RYRTAWRELLHPLPVRARKMEWLKRDAVEENEEILRRPYYTIKSYALPPAVGRQESIHNSNNIRGGMHSSHSLDLIMRQPRRVKTPEQLRALRDRLRFIGVTGPMPQATSVSTKSYTDTYGSRLRPRYPESWDTVPPHQPSRELL 145 T 21 DUF4113 unphh F Eukaryota T 7aoi 27 AA Av A0A3L6KTC7_9TRYP mL64,mL64 TPMMLNIQNMMWWNGKRNLYRATYREKTWYEISRTGAFTKGRRPVMRQKYSREALQAALAMVPPGFEVADVPRPPQRILAQSEGIVGRWYSNYWTLHSMRYQCLLAGVEWPLGERQRPRTNYDEPFFFADFEESKARRDYRSRWINVNRSLVGMTKRMKEAEEEARYMQFRKLQDTFWSNRKVLVNRVKSMYNQGAX 197 T 0.022 DUF1672 unppercent F Eukaryota T 7aoi 28 BA BA D0A5V6_TRYB9 mL67 RYRPIPESMQPKHLEDNFTPFPLPKFDESLEYGPVRLRNIPDIEAAKERRRGSRLAATEVLLQETLQEENQFATSGKGDGNMAIAITERHTEDVTTPAADSRFPSQTMSPCSHEEEMRGYVVSRDYPLIDRLHCTRSIEELVAQFEDRPQIESRVAALADMASTVSFRSDEELLRMFTAISAPFSVDGRGLNFLTVKVSKFGRPYYVPNSLLPAYVNLVDATTIALVREQPWRLSASPALFIQVLQFMALIKVFEPNKWFTFSDHAPSNRADYRHAIGVNHSTAFWGTGEELYDFMVELLRVEDDGRIPTMLDLCTREQMVDLLSGFCGVMPCGKAVGDVFKTITDAFLRRVRNDISGPWSAHDWAIVERMYLVTVLCDAGNNEILQLLLSDTASPRGPDFFAAVSRTKDTPTKKRALCLLQEAIDNASAKADKVTLLGLLESGSEFLLSLVDKGVAHTFATQNLFDYRILNSFLHCSLVADRLRVEQSVITSLIPSSLRDVQVQMLMSNERNALNPLTSSLPGNSGAIATAPKLKRPLMTMLSQLEYLNSIDSVFILHSSLMATSTDQLVSAVRRLPSGKDSLIVTMSCLRALSVKSLTSPSMKERIACARALEIVSYELEKGRAVLLPFSEEILLHDAGAYCDEDLMLWTVAAFLARELPLVKVHTLMHSNCTARTPYRFLKGGHNLLVSSRSLYDKGAPLLSSLHSKELRLVTHNVRLRTPVRDRKCTLQYYNPIRARFVYRRDKPLFDKYHVTARNLAPGFSRGALKHDWRALGVYTPDHPQVPYHPLQTWMLG 798 T 0.045 DUF5642 pdb F Eukaryota T 7aoi 29 CA BB A0A3L6KX69_9TRYP mL68,mL68,mL68 KAWFEPYTPKKFDMEHQRISHNFYNLETKLIWTAFDTPELIGILLHDETIKGAPHLYDAEFLESAVHWTRESRYWRCIGITKPFYNKTTLRAQCWHDRGLQVGTLVFSQAMRDALMDLERAVRRKELGLEPNYVWDRWGPVGFIDGARTDHLPRFAHNPYVDPDGVEVTEVDIAPFNTHEQIKERYGAFIDPDLRPFAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAALAAAAEKVEALRLMENTRHDAARVRTFYEEKCGFSDFMRTPDKVITAAVLCYLQELQRICTETDWGKPLARCLTDLERVNVMGKDAFLVYRHIEDAILDKKRRVWATRFA 371 T 1.7 DUF2339 pdb F Eukaryota T 7aoi 30 DA BD A0A3L6L6S3_9TRYP mL70 PICPGLCGELAAVPFRVFLGTLPTLAVEERFLRQLQPVFAWYSSRKRVKEQANEFIEIDLASCDAELLLRYSHIYYVRRQLFDELIERQMTLLDSGKAPKMAEPSLLQCLAGCNMTIADRLQLEIRQLGAAKRAASVPGRRELDPVARLEVYDYACMMRLVEEDAGAVGDAEMKARAYLPREVIESKLGHLTQLLLGSDARAALDKKDVKLLNRMIPPDYTRVGCVEKLRPFDVTAYFRFYGERINNVKVENYFKRALWGHVYRRFATTPSFLSGVSTYWARHSGLDASFTTTTMPQEVAVAVCDQQIQFPAIKFRAQYVYTSPETARQLWRTDAAVPLMRLFPLMGSRTAEDLAAGVLTDAFWMHLGLSEEENLLQDSLLLKVRRFVDEVGDMYETNIDSVLKRVDDNFKQVVPQL 417 T 19 Vps8 pdbhh F Eukaryota T 7aoi 31 EA BE C9ZSQ8_TRYB9 mL71 YQRGGWSPGSKHQKHMTLNPTLYLYRFPGPHGPGPYTMKYWWTLGCFPTGMEVPFRLHEFLSTYQQEHVPVEVEEWLRCYIKDPLSELVNASNDFFKAVEVYPEVESARGYKTLQPSIAPLLVPMKKFEEQLGVKISPVGLRSVLSNPVLKDRFLDDLFDYKSYVEKGGSTPHRRLARSRFEGSLSVLGECEKCLPEQHQVEISESLGTFIGATVSPAETTADDERSLILLLTTISEGCINAGNYSDAASVLADALMFCHDPDSQATTHANISFASLLNADFKGAEYNGREAALLQPQVKPTSTACARGYVGWAAAAAYQDDFEKAEAIVKDGLTLYVGNEHLEKLANKLQALREEQPSVYKQVPRSLRESRSHLPSQQSRGLLSGSGKGFSNEFDWVEFKNKLYPSKMDPRNNEMGSVFRRVGDLGSFISTSRSMER 438 T 1.7E-05 TPR_21 unphh F Eukaryota T 7aoi 33 GA BH A0A3L6KXN2_9TRYP mL74 PFKGLPKPKRQFGNEAAEVIWPYALLLERVVKVHPFTKSIYVYYAQRQSTARGKLAAEIARSFAREFLIPITFHNSQVYTEAEMLLEYSETPWVVLHSLDNGQKPRILPVAPVEGTPAHTAVEQLLAEVVQGCEALGASVADPVTATRVLNERPLQNQYVRVDYQWFGDTPDERASHLVRWEFEPEQIEPKIRHRTRHVLDWLNYDGNLPTHRAVHVNAMREKAR 225 T 0.012 L51_S25_CI-B8 pdbhh F Eukaryota T 7aoi 35 IA BJ A0A3L6KX00_9TRYP mL76 LEEETIRERVIYQVVEGVFRLSPTSADRRELRSVANIIDYVLTHVRAARPTDRERRQERPITSAALAVMQKCPIQPQLGFVHALPHDTRDALLQEWERMHHLDWQFGKAVYTPRSKENVRGNLTWLREDRHYDQRMKFMQEVESGEARAKHMKLIAEAAGN 161 T 0.17 Gluconate_2-dh3 pdbpercent F Eukaryota T 7aoi 38 LA BN C9ZQF0_TRYB9 mL80 WQSGVHTPHGVVYRGAKMKNWPEQRIPENFKFTEEQRFRTKAIPRDVGTIPRNFVLGVLYRHQPCEVGGLWEHCTNDPEIVLDSKRHLREVLKQAREEGFVTFERDAISNEWLCFLTRERYEEVQRIVTAKSEAVDTHSGLRGAAATETSTYAEKFREMNVEAKEAHARRLEEEVANTTRYLRRFQQREIDYLPYTDLNGKVNFMWWYETRDVQ 214 T 0.12 SHR3_chaperone pdb F Eukaryota T 7aoi 39 MA BO A0A3L6KS29_9TRYP mL81 YWPYNEDFVPEGAETSRFQSSGSPGTRRRVLQEYALSPLFGARVPCCVVGTLRTAKEIIVLKRDIQSLLCELMELPQGSVTLGPLQQLREMMLYRHGTPLSPRLGDERQLNMDAYARRPIAARTMMDVFLAEDMSLDEIVNFGRLSLGKLQIALNNLRKENAEKSSADCEDGSAVEPAQLLVDRMGISYCEIPSLDESDYVFAEVGGVAVTDEEAERVAQRWAERCE 227 T 0.081 MCR_gamma unppercent F Eukaryota T 7aoi 41 OA BR A0A3L6L538_9TRYP mL84 MAPELQLEYMPVLFTRTILGPQGGFAGEERLVKLEVARKYMEAGHAVTPTEELRRGLWCYNPDTDKYDCFIERNEEFLDFAARKRQWLDVYWRVNTGYLLFGRQSWGQGFLINCPLRKRDVAQKLWEQYKVRIDPRLIEFREKDRRTGIQELGHNWCWLYLPGAEELGINREVYDNKRVKVRIHVRKMNSMFALY 195 T 2.2 Ribosomal_L9_C pdbhh F Eukaryota T 7aoi 43 QA BT A0A3L6L8W0_9TRYP mL86 GFTMKYKKGTGLWDEDHVNDYKTNRYLSARATMRWYQEMERHQTRNSLNARRATQSHNNNRGLHHTGRGAFERELERRGVQVEKYPLTTTTGTMRVAELVILRRMELEKRAEEALAEQRGELQKKNPTPSEWYDESKGPLNPNFLRSMRSHYEVDIANLPDTPLIRG 167 T 2.7 DUF2663 pdbhh F Eukaryota T 7aoi 44 RA BU A0A3L6KX50_9TRYP mL87 ESRHLPVLPMTPRIVFEHANEKRIDTAKRMRRDRRRIEELKTLEFWGWYMKLQRVRGRWCREQGVSSRGVYGPAVDAAELWG 82 T 0.35 Crl pdbhh F Eukaryota T 7aoi 45 SA BW Q57WW5_TRYB2 mL89 SGAFGCGSYRSVVAGTQNVPRRMTFYPSAYELIQLHKAHREVIRHFYVRDKIFDNKFPGNALANGLFKFVPNRRENYHMRELMESIRRRSIWMHRIKQQREINAKVVENMEVKYGKKAAASMLCFTTPDSNAYFAPHRYQDVANSWPNYWQHPSVNHVVPKPRWRRHRELGGITRVEDPFAVQASDY 187 T 23 Babuvirus_MP pdbhh F Eukaryota T 7aoi 48 VA Ba D0A4T0_TRYB9 mL93 QGMFSHQLKRLLQKKSIHRYNWDPLPMYDPRKLVHASRHMDVETWREVPDPHWDERSYLVPDQMFYNIPVPPEYKDAYWWRELQARRVQCPVEWVSHRMYNKGDRQRYDFQDLAFRKKFEFSYEEVVKNAKDMRS 135 T 7.9 Pox_VP8_L4R unphh F Eukaryota T 7aoi 50 XA Bc A0A3L6L276_9TRYP mL95 DSFKEHYHRVHLPRRLALQRYARQQSLRNAAKGNVKAEEVPYKYNRWWVNEEHEFVHQYAFVEDPEVTKAKRETLPPVTRENIWKEPQQTFFLPFAPYVRVVDYPKDPDAKFLKPVNIPRWKDYMQRTKPVIPRTWY 137 T 1.4 DUF4653 pdbhh F Eukaryota T 7aoi 51 YA Bf A0A3L6L4A5_9TRYP mL98 LSVRTEDFFSKEAISHARRVSWAPHTTEKKQGAFAKLARSNFGDPLPSSFAQEPYFEEEIEAHRKHHRPDVYIYKYNVSPTHFSLR 86 T 0.053 DUF2975 unppssm F Eukaryota T 7aoi 52 ZA Bg A0A3L6LDF0_9TRYP mL99 GRFNMDEAAAALQLNPAYAAALYRPLNYTFHIRGQLYPAQKGRPSRPGSLAASQGRMFPLYQRNDRLDKELFRLNSRGLTTE 82 T 0.98 Tenui_NS4 unphh F Eukaryota T 7aoi 53 AB Bh A0A3L6L2V1_9TRYP mL100,mL100 ALFSCFRCGYMYEFAVSNSYCRKLTLRNDHCPRCDQLTLFRFMSVSGMVGNMPFKPIGVPGPSYATLWWRKTREGKEASAPLDAVCKSDRWX 92 T 0.0035 DUF1178 pdbhh F Eukaryota T 7aoi 54 BB UA UNK1 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 46 T 7100 zf-CCHC pdbhh F F 7aoi 55 CB UB UNK2 AAAAAAAAA 9 T 160 FAD_oxidored pdbhh F F 7aoi 56 DB UC UNK3 AAAAAAAAAAAAAA 14 T 250 Campylo_MOMP pdbhh F F 7aoi 57 EB UD UNK4 AAAAAAAA 8 T 280 Androgen_recep pdbhh F F 7aoi 58 FB UE UNK5 AAAAAAAAAAAA 12 T 250 K_channel_TID pdbhh F F 7aoi 59 GB UF UNK6/mt-LAF15_2 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 139 T 16000 zf_CCCH_4 pdbhh F F 7aoi 60 HB UG UNK7 AAAAAAAAAAAAAAAAAAA 19 T 410 Adeno_PIX pdbhh F F 7aoi 61 IB UH UNK8 AAAAAAAAAAAAAAAAA 17 T 260 Adeno_PIX pdbhh F F 7aoi 62 JB UI UNK9 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 39 T 4300 Chorion_S16 pdbhh F F 7aoi 63 KB UJ UNK10 AAAAAAAAAAAAAAAAAAAAAAAAA 25 T 790 DUF4699 pdbhh F F 7aoi 64 LB UK UNK11 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 47 T 7500 SEC-C pdbhh F F 7aoi 65 MB XA Q387S8_TRYB2 mt-LAF7,mt-LAF7 NIKGGVGSFLMRRAAPKSIRQKYQTGPQFYKRKFFQFQKGHHRLHRRISGVQTGSPTHQREYERFHHLPGDVRTRPQFDFTFGETRADRVMFAWRKRGDLQLYQMSGRGETFVCYRCGYPVRSQLVAVKADNWDYRMCYRCYTNTVHRGMENDTX 155 T 0.023 Ring_hydroxyl_B pdbpercent F Eukaryota T 7aoi 69 RB XF A0A3L6LB16_9TRYP mt-LAF15_1 QSYLNAVVVSNQVLRAADDVLIALSIGEMEAVRQTHGNLIDCVAALDASLQQTTENEEGGGGNGATQEVDCLSTWPLFTTIQFLVEEGGLPLGPFPRMSRAYYRLKESTPVVAHSQLVWRTFELSRGPEGPTGELPAWPHRGFLRDIQRQIAEYTTDPPERIMAGVTGEKGPLRARVSGARLGLQRTPARIPWTMQGL 198 T 0.42 Hemerythrin unppssm F Eukaryota T 7aoi 71 TB XH mt-LAF8 KSVFDIARSHVTFPVSRDGTALRRVLKDWLDYTECQSLQAKPAFPAELCITVHPSYTKRVRLLLWSAVLPWEVQRGLSMSVLIIPGMLFVMSPESAAQGLCFWSGAIRQPIDIAFIAPVEPPAADTPSFSELRQRRLQLSLEGYDISRFFPDGELESPTVTFAVQSHSYLDPFPDCEQRDQQVGCGSSRERNKGIEDSRRYTATPEGVGRGNENVRYVLETRRNLLRDSIRSALRECRCTHGGVVWASNTGCGADGNPTGTGECDVEVTISLTLSDELKEDLREKARLYTNYVVPLEGHVRRHIKCLSGISPHPKGDATDGSDALQQKGTEAVWGEEGCVCTNGSAPFVAPPPIIKPPLPVKVGTLASTRPRSPMLADEAEGRPTRLAPSVFGRHDAPALQRAQQECNQLISASALARIPNTSPRAPEIPPIDYEIFDLCLRLGLCQSEAIYYFYGRIMREWSKELRRLRAAKSHGEGGVNDGNVMVLREEDVHRMLRLVHDPSLQVPPELSACVEAVASLRKITNEVGVPVV 533 T 0.0043 DUF192 pdbpssm F T 7aoi 72 UB XI Q57U79_TRYB2 mt-LAF14 LKLSPDRTRNEEIQDRQNAFVWSDEHIFRPHQHFTHDPCSWSRSLEQSMKKQRKLSMVERLRSLEQRQLEEKQSASATAGGSSKCANHMDGEKAEGPRFYGAVGDSEDLKEYVANEDYFYTMQQEEKPNDPPLQELVDEVQSLHVLLSSPRYEDTPLATVERLQCAYSEALRCVFDRVRNASVGKTMSCNALLFSWSLLLQGLPALLESLAEKRTEECLVRALSTVHEALNIVLQEFNRITHSKERVELLPLEGWIESLDVVTHPLTNKDYTSLKGNIRLPESSFKPQCKLDSATVEFVHSRAIQAAAIRMIENDQSDVETEPLDPYHLYILLRCMVRLAEKGVNDSHIHRAALLTGMVGERIFSSLERTVAPPRRYSLRHALLGKQLRDASKPHAIPLDVCAPPGGVKKPPTAADDVLLLTRACTLLMNVATNVLPQTKFKVLETVDTVLKTLSYAPNYDLSTADTVIFSNMVLEELHHVDEASATDRHLRVLLLLSRLRLSMCADRSALSHLLSCLCNLLPPHSIQQDKLREWKRLRGLVMRHLLYSVRGEEVEQHYTRVLKSSETWVEHLAFGQYSGGLPLSLWLEACHIYLTAGRKLTVSCAEALITLRGRCKDGGVLRSSNSAGVCPLDFVSVTLLAQLLEVVSHGCCSADDLVASPVAWDKVRQTIQGAIGEDENTIQLLRAGRLCVADRQATGSLV 703 T 1.1 DUF4048 unppssm F Eukaryota T 7aoi 76 YB XN Q57YY3_TRYB2 mt-LAF12 CSPFLSSLLSPVETVPLHDVTRTYSTMDVVDPPARYNPMVPNVEPSSSSAGHMEQMLENEEEEGPVACAHKNGKLWGVFEGSEDNKPPAWFYRLCKDLFYRTNSEDNMDDAALVSDIEPSHYISSTENLHIDGCDTTQRSAEAGTDVRDGVDPYVWIPFNLLDEADYHVGPYRFPSTATYTHEQRTLLCLGDTRREYVHFCDSYAFPGRAQIPTSVGTCPSKLYVNPKQQQPVVYIQLSNDIPPAMWLPVKGTAASVRRVLAEFASMAALHRDWHHDEFMERHATAVRMLELQRLPAGEGDILRYMAYDARNAQFAFAPIREFPNQQEFFLGEHDDPEKLMEHVDLCPLLFAIPHMRTVVDLHAEHMIPTIAGPGVATSLYRCIYSKALLFVQVHLSSEVKLPPQDPEAFKFMWKDSQVLPKMRIPVFVRVVWPTNERMSGGGGLLRRFNRLFGTEFASDIPVDAAMALLYVMQWSGHIKDFLGVRGMRQRLADLLLASQQPEPTKLYPGTREIPNPEYTVAERLGMHVQYLAQLHDPDISLTIQRLLPVASAPVRMGCAKAALIAGDRELFRHIVSSEPPGRMQTYMTKLVRKRKTRDLVDAEPRLLEDQYEFAAPLWT 620 T 11 RNF152_C unphh F Eukaryota T 7aoi 81 DC XS Q4GZ80_TRYB2 mL101 KEYRLTVPYRSEVTMLRLANHKAINSNIRELFKKPLVMNNIKAIPRDLGEIPRDYVLRLLFFHQPIRLVDLWTICKEHDDVPLDSAKHLRLVLKIAKLQRWVYAEKNQTNNLYYYYVHQSRIQEVQQMVRASEVRKKEQESVREIEAEKLRMEEQERRKVALDENIVALQNALVSNIAQIQE 182 T 0.00067 DUF2514 pdbpercent F Eukaryota T 7aoi 82 EC XT A0A3L6KWY9_9TRYP mt-LAF19 LVKRHKITNNQMLLMRRREPYKPTMKDRQEIADRAKLEEFERKNADGLMFVPEKALPPWQKSLAHNAKALGSRINFRGFRVRVADGQDEPGFPTPFR 97 T 0.44 Ribosomal_S6e pdbpercent F Eukaryota T 7aor 2 B s Q4CTU7_TRYCC mS33 MALRGVPIRLGVHRVGYTHPSTLPVPCAQRWDLRLARARIFQEYIEEKAPGAWQLEDERSMSPEFKTFTGYPMREMRPGYGQNLPDFIMKKRLPNNTHYELFARRDIPNEDNAMYGKYLYDMTVHGTSLPSTYRMHKDINKAQRNDRKLSGNRFRVLCSSGAKKPPSGWEPIPDATEEEE 180 T 0.97 Nmad3 pdbhh F Eukaryota T 7aor 3 C r Q4DW24_TRYCC mS29 MRSRLVGKSLASLYVRPPVTCYTDACEAPVAMWNGAIPLKEVRVMQKGVPVRYVTKLYSHPLEASPTRLSFNDINSMYCVGNDELMQFFPEGLGGKVMQLMPPGHPRGFLYRKEAHLLNLFIDKIQHWQAKRNVLSSLTNNRPGFIIDGPKGCGKSALMCQVVHYARSRNLLTLYVPNAKEWTHGEWCWPSTILPGFFDAPDAARFFLRYFAKANRSTLLSWRLKCTPNDLPVEQGERQPQNLYELCEWGHQVVAPASIDRQSVCVKFLMDELSAEKKLPIVIVVDGWNLFSHDTHFRYPHPDFLRTLASLNDDSTDIDLYPQELPRIPASRLGFVRGLNKMILSKDEPNKFFFTCTTRDFKPFDGISGFPDVETDRFTNSLDEYAPYDAEKDSLFHPIQLGNFDEYEFRAFTRFLVNSGELAGLGWGPLWHFSSDFERKLYKIGFLSNRNPQGVIDHYHQELVWRYEYQRTRQKQYLLHRNMELVVSKRKNRHAEPKGG 500 T 3.3E-10 DAP3 pdbpercent F Eukaryota T 7aor 4 D n Q4D583_TRYCC uS19 MLRRCCPATLHVAPSTAMVAGVFVNNQKRFLKMAKSAFGFYLARRGQRKFPFLRRPHIKNTHAMNLSAPYFWSFMTAKSQTYFLPEENYITGDWTGKFFVSKLQVYTLQHATSGSTVRVKSFPSVFELSSPSRWNIGKELNTLTKPRMDLIDEQMLTKKQRLDYVKAGLLPK 172 T 4.3 LIN52 pdbhh F Eukaryota T 7aor 5 E h Q4DRG2_TRYCC uS14m MLRLSVGFLMRHIGQDVPKRHTHFVLESRLMYEKSFRDSWLHSVCRAVSQIDEPLSKTISGTRQKMLQRKVTCFQYNQYGLFKVPYYRLANVDRYHAVQGVPGTREWVPYANVSYWTMNKMVRSGNLLVHRVHYTGWGTDPHLKKGGWEHRWNKVMQRNALQYSRI 166 T 12 Phage_Cox pdbhh F Eukaryota T 7aor 7 G az Q4DX04_TRYCC mS72 MFGTTRVWRNTFLTKSVATPPISVIRTGPKWWADPERMVRQKLMYFTLGVDQLPLRRTAVIQKDLHRFHMCKPPPRIGDTTGYKRSRAAQLTTWYRRIQYQEYHLQHLFTRHVWGLVRAYPGNTTKIQGKADDGYVGYDSVPYHRYNRTPLPFPAREIYGRRE 163 T 41 MBDa pdbhh F Eukaryota T 7aor 8 H ay Q4DX19_TRYCC mS71 MYCYYLHTSPASNENDVCHSASAKSNFIFPSFSSLVGFIVFFLWKRQMPLFHRLFVSGADLRGCHTALSSTFTQRRYWAKPKKRPKVGQGFHEKAQKWRDEFLLDRHRILADSLRAYVEFSASKRTEPWDTRFRPFDRVEKDGVYVLMRHLMEDKFQLCNYHHRPVKRLFCNVGLLGPQVTTKARWKPYRYATNPANTSKAERIFQKDKTLYTHGHND 218 T 0.3 Tox-MPTase5 pdbpercent F Eukaryota T 7aor 9 I ax Q4DLH4_TRYCC mS70 MRRTIAALTATPERFSILGTTHPKPKRTGFGRNNKMRSKPSDNVAWYDKGPVEWLPRPVRLTYDHLDQLRDWMMRETLDGKTEEFNRIRDMHREWSQHPLMPVLGDVEPKFPLNLFKQNHRAKRRFLVRWHKANTPANWLWMPRGPTVVTPLHHTNPSQYPESWRQMVRKKK 172 T 12 THDPS_M pdbhh F Eukaryota T 7aor 10 J aw Q4D0Q8_TRYCC mS69 MQRAGCGIVRPGRGCHTTPLYCSLATISTGVFDHLPFQHRRQHAFNTLPLHDANHFGGRTAYLREIGPVNIKKSGRRFKKDLRTVQFNVDIWCAQQTLRKRWKQRDWEVIEVPFRLAPAEQQRVIPEMYTDVPPMTDPERHDFSNIRNKVYDREELQGVLFGASGPLPYPPLQRIDRQAMTLDKFL 186 T 0.076 DUF4993 pdbpercent F Eukaryota T 7aor 12 L ak Q4DTE7_TRYCC mS58 MSFRYTNGLVGALKHRMMLESSHRELVRRRFTGHCRGVEVVCSGYGTVLAVRLVDKTVWEPFYRKGHPSPSGADMDSAPPSSHAEGTPVGGQSAAPLDFERIAESIKAALWDATRKIRSAKEAALNRSLSHNQQLRAQAHLEHWYDEDANTLQPLAFEALKHEAATPWMQFVQFGKYKHAAAVMHSESGGKTSEAFTGETKRAGADNEEGPCVTALDEKDVDPTSIPIGSVHPLFLPALIQFESRVDNSLNDDAIRQEQRREMSRDEQLFWERVELIRKGQVATIKGGHKRDYADEAAVASDNAVDKVQLRFTQ 314 T 0.00033 YbaB_DNA_bd pdbhh F Eukaryota T 7aor 13 M aj Q4DA51_TRYCC mS57 MLRRTSWRAVGYTPVNPDTSPMLAYSQYHWHYNLPQGMERPHGVNRTMTAPYQSAHSLVNKYRGVWIELDMHPAFRVALEPQLRKLPQGRTIPKTSVDEVISDYINTAHLIQDEMTRDLWLAKVLQHCAFQRSNEGMALWEKYCHSRFIADGATATPPLPLVKAILFYCSKIDYQGWSSIFQKCLKNDWNYTPLFDTAQWNFLLKSVGRMGDEKGVRLILEEMLDVQADLDRVEARSIVIALNAVTDNDIYEYIKKYLFNFGERKVKFLRIIYSDLRGHGAGKLRIPLKENDKMFYHVCWHSSIRAPRQFSPRQLYFDYTPSTLGASVHNPNAKIDDIVKDKIEKWKTEGLLPEDYVHEDRVYDRGTAFKNVARQEKWKKMPRIVKSKKMGYTGDP 396 T 1.4 RPM2 pdbhh F Eukaryota T 7aor 14 N ag Q4DWR5_TRYCC mS55 MLSQNVAKTTVPSYYMIRTNLPQRKPQNQWEGVYYFGGITKRQRHLILLQRKREREARMRAFSASCSNLLRLLEGDTQEQQQAKTQTIQLSSPHGPFDLAIRLAQHGLYQQASRIVDELHQQRALRMSHYGLLIDALSAPCLGQRILYGSAQCDPALTYKLLGDENGEERAQEAHRWFDMAFALLTTECRMSGSEHRLPQATAAATHLVNALMRALLTCGYTHVSAVPDAVYDRMGLMGISPTISTYELVMLALSLQGNMKEAESVFSFLRRHHNEHVTIGSFNALLLGHRECRQFDRCDAIWQELVDRRWPRASTLTAELYLRSIVDHSYTPTSGPLQRFGNINVVEKKKIPLVLAQMDDLGIPRAHLSRPLMDEVEDALRKFHIYKSRYYEWGRAVKQFNFIEFRRRNGWMYDLHLMKNTTKQVGPLRDFNQPDATQAPVATVEIPAFFNERPAWEQPPLEETLYVTESRERYDDVRSGDIYEDRTRSLHDRSPTWMNEVPETRYDHLYGVNHPDIAKIGIRRHLNAEYVNRKEVVERDAALMKKNLSTGRRLRRKVESSRTHRNAGSMSGAAPASASR 581 T 0.025 PPR_1 pdbpssm F Eukaryota T 7aor 16 P ae Q4D651_TRYCC mS53 MSVTGVFSKGRGIGHAAVTSILRYIPRARVPWQPSRFGRENLSASDLAVLWSRGRYRDGPGNYNSGYHTEKTHVLEDNTVTMIPKHELEKYMPDISIGPKALVTPVSLMSARNGHRVTHDLLHSYDPHIGRLDKPAVVDHDNITVEDPNRVGLNAATLDCRGRIYRWLRRGPFFQEDHYFRRSLRLNRDGTVPTAAHEAPLMRKIVRLAQRGHLKAACEEYRRVTTVPPVEVYRALTACCIPGGLIADAVAIFEDGNSKLFYVARDGEVLHNVMRCAIKAKNRVRVMWVYNVMRGRYYENVIVRAEIDPIWRYRIALLALEYFLDHNCAEEAGTVYSYLVEEDLLQCDVHLRVGLHMREALSKGKSVGLSDEVLRATSLVTDVATVAPEVARELYQRHVEALRENEKSNDCDMNTKNDGATGCAWSAHGLLTALDFTQKDDALPWMQQNFGDVDVASVLRWARFYHSKDLMAKDRPRYLARAVAWIELLSKRSHMMEEAPLTYMRKSKPLSLNTNSNLRVAWQTPVARPDGPPRLLAREEGYTFHHNEHSRFVTETYRHPGETLQSRFLAMQPIHTEVSAKEDFQEIYAQQQEQRALPSGVVSPTARILHHSVTSEIHGGGGGQHHRSMSRAESSTGGAVKKEKLNLVSPHVAAEKRGGSGTGNIGGTAAGGGVTPEF 678 T 0.0011 PPR_long pdbhh F Eukaryota T 7aor 17 Q l Q4E2R4_TRYCC mS52 MRRRTVISFGYVRVAASAVGVIGMGANHRHHYQQQQRPITTRGQFNPVHDFTYAMERGVRARDEKTFEKLITNPGPLRIAYSPDYLDWLYRCYKAKGKYMDARAAAEKKFNGNIISGGAVTTSSGEMIHPLPGAPPPGMFLRPPNSFRRLSGEMKRKHAQETLDEVSKAQGMLDLFERQPQFPAIHIDRCTRFHLVELFKEMVLERSLEAVAIWDKALLYRAILSERKASYPASFRYIFKAVEDTVFAHSSVNCPSLEAYYYFLYLVKKYYIDNAVEAHVVLRCHREPNATDLLFSNPPPKDEVDVRNAIEALQSAEATSHAASSATSGTAKHDDPAATDEKHTRGDNDQKDSANANAKHPCQFAPPSSYPPIEALWRCEENVPLLEILLFGEFNLIVSENPFVKFPTAHAFLTRPYSTESSKGPIDGASLANVIAEKRGHLLPSFPMNVASAIDGRAQELRRLQQKHHRDDTVSFQTLLRSTHVDDNPSTFSSYSDWSYFNPRAVRAEERDRLTRKGIDALKEYDSATEDIYRRSFEDAQASNFQRVTEAWNTFPPYLPTLPHFVSIIKKDSHISFLLHVGLPERCSSAEAAAKHKEFERRIYQLARALYHTALEFHKETVRRVNRQKVNVAASLLDNFFEQEWVAMLRESESLENSLEQGAWPDKKTDMARRLGRYIPFARRSLDENGFPTDARADDYARWMEAPARMKGAA 714 T 0.16 CM_2 pdbpercent F Eukaryota T 7aor 18 R ac Q4CW80_TRYCC mS50 MMRARRVVVALSPLAQLCVHVQWRLYTPIWQPDPAVDHVAPLRESDENRTLWASSAPIANVSDAIAAWIRFGNDPVLHTALPVIHAGQNERTRTDGSSASLSLSSLPSPSSTSPFATVEDYMGTNMVFGSPEHVKDSAAVWASYFERRYLSQLRHSRRTAANHVGLVNAPDVFTDEADRPETKWSQDTRFRERAYMAEKFLKEKVANLQQLEQALKQAKPAEYIAFHDALQQQTLTLIPLPSPSVWHYGGARRTQWAERFLPLSHEAQQFFTTVLAEDLKRAGDAPEKVLQKVAAVFAEVGKILLQRHRRCLGGREWSALAPHEKDEFCMKEVERWKQQVEVGEFDPPLDGDDDPTSTEWQSEHDAIMQLMTATIDGLSFSALEFWTHTIRCEEMETEHIHTEKRVRAISAAARRAMYDTTSYEAVLQGIVDAVAKGQLDMKAAGFKPHMNDIWCQLNYAKFGASTVTQHTTTARRQLNYFHAGLLKEVAATAALYYATKPLSSSLDYASPYKFRRSLVGLFSTYGVEMVYAVQRPLLFSAANLAKAEDLIRGVVKNVARPFGERRRAKLKQLRANHRRLATPVQGVVVSAVVSDLLESGADVSEAKKAEKMQESVTFWPLGARRVVSYDWPTPHFDALKRRVAAAGSAVTAQSTKEIQEIKRNAFVEVSLWRRVTAEETKQRRDAVEEETRRVADVVRTIPPLAQVQQYATSLYQRIEDAAPFPAATDNNAKSEQEDDESSWEFVVMLDDRVVLNANQAAELYLPYTDASGVPIPQGECRVRVRGFDVDVNPTLNPAFCSEAFSTPFQVFDAIPQLVQQFFGTAKPSVAEVSDIPSSKFIQFCAFLREAGLDVPVQCEFEAGQVLNAEGDVFMEYFLNLLRSDRFHRSCAQAGLTEMQRVIESSCRAHWEVHHPGANEAEWAEARRRVLDRAMEKEREWWFPNEMLDVTNMSPGSNHGLRLPMYPATVRYGRELCTLLAAEGQFDNNSGLSATCAVNGTGAAESITFSTGDHISSTFSMEEALAVAKGALRNAHDRQNTLAAFRLGPLSKHSQVLLFCGINATEFGGKYARTYTYAFEKAKKELAETFVSGRVVPGVDEDELLRVSDKEGVDRFASSTHPEQRKTQFVPRVGPGGAPIEDPTADQKTQWGR 1152 T 0.31 MIase pdbpercent F Eukaryota T 7aor 19 S ab Q4DNX8_TRYCC mS49 MIRRRLCFVSRPTKAASISVTLFSVQRQKGGLHTFIRDARSSSFTTPRQASHAEGEHTSSSLNSTDWATQMQRELFGETDPLGGQAHKDYYRDPARGYSPQYAPRNFAEGGAISYHHAQSPMEYAEATHRRSWLDHDVARMEAAFQEQRALLRGMESATERDELARRYAAEHHVADIVVENQSLLPSTQVHHSTSTSGSALRQQAVVDRFQIADQQSPLATSDGMGREELAHTYRMRSETVHNDWIEENLRIVHGLREKEKYDFTVLQRATRIPFQGYDMDRFLAQQKGTPYGAQSLPPNTASSTMEEAQRTLRDPTATVPSFEAISQKAFARNTVRDHPTTGEELTQEVVDTIRTSREASEWQREQERAQRFGLGRQGALVQDGGPDKRTLKKHVNDERIMDAMFFRSDAYRKTQTDEHWNPYMRQDTTHGVAHLLNNKFDIARREDRLSKGEQDLTERSVMHFGVPIQQTIDEFVFRHRNARGERPLDYFKPFPGFRDFRLNRMYRDVEGFSLMKQRPEFLEWELFTRYRAHHQQRRRIALLHGLEPVANETAQERDARREKLDEICERTPFDERELHTNDDEMQVSGETLRSWFGVYMLPSPTVVEAVVGASASVNLHLFPLADEMGTADTRENVLSSRYFNRLLLMEGFQNRISRAFMGNVSGKAPEPVVQYMQPPEVLRHFTAEERAMYEQYVKEQTSKQLGEWATAMRRRRWIPDRQQYGHVVAQGYGVSVVDLEHADTAAVLTVSAKAFERELAAAKGNTSHIIMVEGQAYKLRPDSERFVVPLSVRLESGEVLDMTDEAFGRYELELLPRNVNHALNYGIGDYAYNRGNYIETQDVIWEEQTASGEEGWSPATHADGLRAGLPVRARRHVGMNANGSRIVSSPQRAVIVAYDRQPFFNPEPRLVRVAFQSDGSVEEVPLANIMIWQRRYHGPERTVGDESRRFSPASLRRYIDVSDPFNEKKSKGEHFLDKYEAARTSEVAAGKYRTTKQITEIDQWTRFDVSRADNFRPLSISHRRDYIRLGYMHRYTPWEWIAVQEADQPLIAEQIRQDNIGTSYFFSLNRYWRYKARPHGYIRHFDNEVRDLFQFVDGVTPWKQAQKIRTYWEVRAHHPMPQFNRPEVAMHRNTVGLLPAHMWETDKKTGKVKAVKDSVRDYQTKTPLPKWVQL 1175 T 22 UL11 pdbhh F Eukaryota T 7aor 20 T ad Q4DV41_TRYCC mS51 MFLRTHVERHRSGKARAFVFRDPTLKMMRAGSGYQQLRRMGMPIQVSKGWRKVDHFHANNQYQHAWPLLSHDDLGNSDQSNNTRNIMYSMYLPKRNKGTAPWFRGADTYSVKYCEQGRYEYQRYLMINRFPSEYRKHFMNFLSNIRSSSGPATIPQEALHWLLRMIVDNFNPQHVHYIAAMKTLQNAGELDMARDVWKIMERQQTWPCTSTICAYLDVCVEAGEKTWAMEAWNRYCTELKFLQPGEVDPKPVSRVPFSLTREELLYLPKWKKHFDHDPNLDVVDLNRFNRTREVYLRMAQVMLAGGERDSFQHFYTKLEEAMLSTPTPVPEPPNPHLVRRPQWSPYEHCKSVHHSPWRVGNNGRAMALGPSLTTEDEMQSRFFSNDQFLVHMLKEILRIVLQEHRRRHPEACSRGEGEAFFDQVVDARETLNFCNELIERLFAVLGQKMHGLNTSSLLSVILELYRVMGKETGMALLRRANQFLERKAALEDGAKESLTAPNYLQVLMGFADESAYVYDSKRKGLCRYRSGFDPRTTMQQLAATVQEIAGNPHVTWAADMHLQVVCTMVGCGTMKANDYFVRNVLRQFCWDSRFLEALYMEYRRHDDVDMWAELTKRALVWTARYNVNASERLKRLIEDDYDTIQVHTRTFRELAVFQFRDVEEKRHSRDVVNELPNPWTDYVSHALPFPDRDAGYPDEYGDIGQWRAPGGPGSPVKGPGYYAPPMEGEHQRGYTAEWRDLKNPMRPPEFPTPWERKYKQYARGQHPSYDMVYAGPMPEIFPNRYDFRKPTRWDFHDIEKQGKYKTSGPY 810 T 0.0043 RPM2 pdbhh F Eukaryota T 7aor 21 U m Q4E4E0_TRYCC bS18m MNRMGGSVYANAMAQFAICRQPWNEYINLLTKQDSTPYHVEPQEKPAYRGRKRGREGWLFGQQVQLHYHRFPDEQLLTNLTRWRTGETVGDIALQQFRNAQPFDIEDKDPQGMQRPSPEVYMKLNYKNPATISRFLTRTGHMYPADILPLNPEAVAKLRVAKAQAVRIGLYPRFGNPFWFRSQKFRPKAYQENYDPTTYSTKHTMEHFAYNWVQTDRIRRYFKELEELQKNASNGARGGSATTAEQKQQNQFYAPENQPISMHRNNISYMAEVERSMKNPTVPGLMSTKGMKKKFHNLYSSTSTKRMGFSNPTLGIKKV 319 T 1.2E-05 Ribosomal_S18 pdbhh F Eukaryota T 7aor 23 W f Q4CT44_TRYCC uS11m MRQSLVLLRRGKPRPRAGMFPDKYRRVPALLKPQQGGQQFFNQFLIRFTNDRLMRRDVEDGEDKKESKIAAQLPQMDWEHMSARSSSDAIREEMHRLVEGDAVQHQRVFNERIWYEEEERRRLQTGADAAAPTEDAGGAKEHDIPPRVLGNDYFQSRFGYSLVKQSEMPQGVTDYNQLDMWGEMPKYTRDMVFLYLISRRRNTYAVAYTYEGKRILSTYTAGNRGLKGGDRGFRSDGSTDNGHQVTSMYLNDLLPKVRELRANEGRPIGRGEKIELVVRVMGFYNGRQGAVRAVQDRANEFHVRYFEDITPFPLNGPKMPRGVFK 325 T 0.00013 Ribosomal_S11 pdbpssm F Eukaryota T 7aor 26 Z ba Q4D7F8_TRYCC mS73 MITELRTKGATRTPAIRYSYPAPPPSKNAAPQQPRGTPRPRTAKTNVRKPKRSDARRRARAGSGFGVVSPVRARRRRVVGPPFADFFGALFPATRGIAFLSDLPTHIFSSFLLSFLAGRGVAMLRMCRRLAMKYADLELTTRGEFPHGMKEPGFVKKLDQNIPWYFSTYRSMYHWPITGDNWSDLNEAEKHHDLHMFYTLAWWKLGEGIFDANDEDN 217 T 19 Mastoparan_2 pdbhh F Eukaryota T 7aor 30 DA aq Q4E0X6_TRYCC mS63 MLHGSLPSLASLRYRRPYWMLFLKDVDNWKIYTVIQQPDHQRTEMLYQAWLGGLDRPYTRPKCMANQPLWLSKKRHILRKDRLDGPETPLEKYVLEWHKRFHSFQGTERPTVDDLHTALDLVERPLDLSYAFQLLNQCRNVNNIRFAKDTFLVFLEACLRVDRKDCALYATENAEALGFWHIEEDYRRYLRGEQSWYRLSPLDNMYYPLEENVKLNAGRSPPSSAAVAEGDAEGTAETTGFEAAAGAAAWSDDEGSMKAGSSRESMTVDDEIARLEAELAALEEEGTDGDNDVKGPKGH 299 T 0.087 ABC_tran_CTD pdbpercent F Eukaryota T 7aor 31 EA ap Q4D014_TRYCC mS62 MRRFCFPVATFAQAALRHRGIRWNTTMADNESHTGAKSSASSSTPSEANEISAMERASEARREIHDLWMSTEKMLDLENRVRSVASLIEKYKLDPSTPRENDVSRGLGDAFDRLLLLCVPLGKDSSKGTDDLERLMNLAGRNGREISVRTIQHLFARTDSFSEALAVFYAMRRCHVAMNMEAYYAMLYSLQRLEEEGWAQRFREECEEKGGVSEQAMDFVVKGINNALLPENKPWLGRVMFGDRDAPAQRREARDYDELSAMWTERYRDGSAFPTSP 277 T 0.095 ECSIT pdbhh F Eukaryota T 7aor 32 FA ao Q4D7Y5_TRYCC mS61 MLRSTRPWRFRMKGGEMFVEYKIMSRDHRRSIRVEDAIVDPSVARTVVPLSWLEQLRSPSLRLHTGYHMEEAVYVPPAYAAVDEKEGRRLSEKSTMTPNAILAGPVVLSITGQSVPVVLNPYFVPDDTWGIRRNRDEWDLRLGMDAIEQCTLFSELRPGGLLYNKLPSSQNVTRHEPVRATLQRYGMKCGLAESPLVPRPWTRMRYMFIDELQRGPKLTEFVGHNPRNGTQWRFSQHSKYFRIGVWRETIRRNDMNEGLHGHSSWQKSPQQAVPEVRLMAPYP 283 T 6.2 AAA_11 pdbhh F Eukaryota T 7aor 33 GA ai Q4D4C7_TRYCC mS56 MRSIRGVCCFSSLYTTQFRVHATYDVAPLSHKELFSIYQNWDKTRDELDLLEEVEERISKWKLNKWEMRIPPLLTAREKELMRQQQELLKSIFFDWGKCRDALNKDLELISSITGLPKGTVREKNRAWLQEEAAKLRWVGEVSKATRLRDAFLRLEVYGSRDHRLLERLCCIYGLGLQGSFESAFSNYIVEDPITKKIYVDEKNSFRDLLAYIIHTYPQIDIIYDFLGFNFIGGYRSSLRRYLECMVSRSTEGEKIPGRLVFGRGKPAEILFDFGNSNESLVSGECTQGFPDFVFVKGSDMTLIIIASENSWLRNRQLPHRKQMEGIARRASFVLGIPFSEVRVRNLLLPPTYLDKDSIVRINEAVLGLSKEEQRNLAPWLEMYQKELDSKDVDFCSLMKSTNEEEWLTL 410 T 2.3 RB_B pdbhh F Eukaryota T 7aor 39 MA v Q4DRC8_TRYCC mS37 MKSSDIFHAYRYTPVFLKARQHDSGVNQYGLKPVNAYDFINPTNLVNFGRGTSFDNLGVRRAGRGEIDSSPSLGGSPVFTQAKLVGLSGEEQLTMCQSETMALRVCMARGGQDTCERESRALDACLSRVGHLRRAMSEACGEFNDWFIQNVSDNHTKPFQHRPHDWRHFYAQEKLVRERQQNGHAYGRRPKQFSFGARYVKTEGYGKRPRLPYNK 215 T 0.11 Gypsy pdbpercent F Eukaryota T 7aor 40 NA bb Q4DMI0_TRYCC mS38 MRGCGVLCAGHKRAAVIATTTCSLGPPTLSPSLTLRPVACAATPVTIPIFPPPMRGSFIDRNPVWASFNEKHTAKSFRHRIVSSADVSLRPPQFYLSNEKVSAGEAAVVQKRAEATDSYGEQLDEISARWAAKFYGRVTFGPRNYPYPSSRWLARRFKMKKHRIIKRFRFRRYKLAAVANLPFAKMIRVGLLPELKSSKTKKGDDVEADLSGRLVSAVRSSGGKKKGNRGRPKSKYQI 238 T 22 DUF2996 pdbhh F Eukaryota T 7aor 42 PA al Q4DRU1_TRYCC mS59 MMRCCCVLQDKSMFAAKRRVIVPIHPTPNYPAHFIKASFTTDPLKEKQKARFSSGGEAMREVQMIPKNLEGERSRRELMSRGDTEFEALVEFIQGASYDQLISGRRFKKVYDKLSENDDTFVWLCHTAMSVLNPGDVRSRLVYNHLRTLAEAVANGEMTLRTAFRFYESAVRSPAYREIAKRQMEGGAATRLAGISAAADVMRRMGLTRRPMASYFELYQRIVERSEAMTPWGFPPLFQFEERLSLEPRLKFFSRASQQALERRRRGHIMSAYTTLQGRRIFWIPPTWNRAGRFLGPHVTLYPGMTPD 308 T 0.35 HLH pdb F Eukaryota T 7aor 43 QA q Q4DUA8_TRYCC mS26 MMRCSRGCRRQLRPYYNLPSKSDHGRKMTGFLTPYRHWMWKQNELWRNVHEAQFEHLRKVYKRQWLESFRVNADEYIYKYNITKAAQLAQWEYEMQAQEKKRLEAKQMTEGRQALKKKHLDLLREFHERQFFFWYERASERLQSMNLIQYIPQSRVQEHIERELDKYVAGKSEAYPLNFVGQMPLVEDREGNIVEVPEGLLTNHVSEHPESTVKPHQPHESTSVSVEEQLLRTMASAREESLEEWIDDSRALSETIDDISREEEQRDEDTRVARSMEETDNEREISRRMYIDRGKTGSKAIFRRPTLSETDGGSASIPVGGVAASPADTSAPMRRRKKGKLDKAHALQEQQDAMIARMSAKSLKDGESSISIVKRGEIATSRGRIRDKAAIPTQEVLMQKPELAAGSVPNARISFKDKVDQLYHRGKYKQKKEDDNNPNEDL 442 T 0.033 ThylakoidFormat pdbpercent F Eukaryota T 7aor 44 RA p Q4DRR8_TRYCC mS23 MRSTALRLYKMPKNMGVAPRFDVWNESYEPWQHMKRMGRLVGTGFYIPPEWYNHFRMFPPINHNFQQEKTLNPRNESEPTQDDTGSLSQERAALRDELARKSRLLASEGMRYYNIFWVRKPLDRMEKEYYELKRKGIAHSEAIKKVLEGFYKELAVKKRVAAVQAEEAKLSGRFITMREATVVLNVLAQLHREQLTPHQVTLLAREQHQATEKASGLTATVSRVSAPAAGIDADSTASKESGSDEALSADSLANMLEDDHASGAGTQYQVEVKHSARDSVRQLHEKSTDDTGSPDWYTGASPVYNGAA 308 T 0.049 MRP-S25 pdbhh F Eukaryota T 7aor 45 SA Ca Q4E4S6_TRYCC mS22 MFRRGLVHRRYPFNKRGPRERKSWKHHVLTDPPKPIQWRDPKVWTKDLTTMKSFDAPQWDLWQSRARSEDIDEALQPFMDMPQSLKDRRYDIPWWANPFGAWYLQNILSVELLKLPSRTNAEKVAIYRNQKHSLSSKKKGEAAQDDEILANIIKERWRTLEFGDRDAGYPCTFSDYIQFLNEWFKSLDEEGMQRLREHFDRRIRPLLAVMSPVDILWLEALTQNSPHNKEQLQRRIAFQTSLGTPEFFDMSKRLRYEINEDYKVRDELGPELFALWSKAPERWPPERLSKMYGLDFTLVRKILVWHHFKACYDACVEPDWTLPKRLFALEWIRDVRARKHGLFYGKMRFAEQKITFYSDRFLFRDLVNRREASYANVWEMDDPYRFLQTEQDYEDYWGDNYDVYRRMFPEMIGKSGEPVQQYGQMPVWTGPHRQHANKSQHNWMFAEIGVNVGHEALKKLELDPTNEKRRRFVIRQPDGTLRSAKMSEMRAWYWKEEWADFRFWAPNMEWGIENTPSQEQYQEHVPDTPDADFRKQRRIQSRPVKWFYESHYTRTGNFAGFQPLRFMQRRTEREVRWPDVINAAVQIQKRKPDAYIFKAIPEI 603 T 0.014 INCENP_N pdbpercent F Eukaryota T 7aor 46 TA g Q4DET1_TRYCC bS21m MLRITSACRGGYMMYHRKSMGTMKYSKWKGAHGGVSHFYGRTPMVEEVKRNEPITLIDRRIMHYVHRSRLRHFQLFRSYQQKSNATECKLREGEMLRRRWHRKLQKSFIAFMQFKTMKVLEDQAKLVNTYGQAAVNAALGDSWDAVSDKEKDRKYVTIRRQVKALPVVSVVPKHVATMKQIHNDRFNYRWRVN 193 T 0.072 HMG_box pdbpercent F Eukaryota T 7aor 47 UA j Q4D6Y0_TRYCC bS16m MLHFTSLFRARAIIKRRTPQLWGAPGAPIIRMRGHHVVWKFQSYDLFVEHTHKRRNSDARLLHYLGKHCPHPQKSLWSPDTPVAQDRHLFMLTTVDVDAFKYWFGVKRCRLSMRPWALLAKAGLLPPSLRQNSKIMPKPIFDKEQLMRYYLANRKEEATIEREDYLNYKNSLVKSEEERAAERPVAPYL 189 T 0.11 Ribosomal_S16 pdb F Eukaryota T 7aor 49 WA bd MAXICIRCLE UNASSIGNED READING FRAME 5 MTKKGATKILFIYKLSKLNVYNNESYKIKLLFNHLYCIDNYNSIYFNLNGILIWLNVLHINIILIKYAFLILLNNLEYLIIFKYNIISIK 90 T 150 Cytadhesin_P30 pdbhh F T 7aor 51 YA bc Q4D913_TRYCC mt-iF3 MKKMVFCKMCRPLLVFCATSCWRRSARLPLPFFFSPLNLQVAPFTSWNLQARYFSTTAGGGREGTNSSEDDYVFDPTLSVQKDAAIHVAKKSLDAIVRDLLPENAPDAATQKVRAYLQQHPMDTLITQPTVHITHVEDPESGRETKMSLSPCDLSEALEQAQEREMNLVQMGTRGDVAYCRIRREIPRILGLVGPELEALREEEKQEGSSHRGGSDQAGGKIRELVDHSFRDVVDAHFVGWKSKKIVEDIKKRHPVKITIKEFQSPEAAIGKIREMCQAMQRYAEEKLIYHHFTSIVANDREVSVSFVPSLPSEKGNSWKHIKYPGEKEWAHANKRMEEACRKSGRYGTYVKNNMLKPRSLGQTFFRVDKYGRKID 376 T 0.00073 mIF3 pdbhh F Eukaryota T 7aor 52 ZA aa Q4DVD2_TRYCC mS48 MYAHVVSWTLFLHVFFFFLSLSRTLFFFFFFFVCLFFPFLCVYLEEEMLRRLVSSHGCSNGGGTNGGRCVRPVKEEPPVFSSLVLQRRFSFKYATKLQHDEMRQPFYIHEKRHGIFSNEKNIRKSRRGLPFITPLYTRHMNLWETDTDASKNRFFRGYVFGQRELHQLLGRPHGFEANNTDGSNDISAYEMTTDQRYKGIPRPAITNLHYEPEWNYTLYRAGTHGSQLSNPRSPLTAEVLGDELMKIRDIKSFDHCKAWFDRLQYLIKLHYDAVGDIGEFKSRHTQHVHEFFVAFHDALSSFDFGDSYLFEQFHAARPSELTDLFGIFLEMEANYVHEDYCPRCSLPYSTTRYCGEGDANTPFRKHRGRWAPHQRWGREWYAVVARRAEALWYRATEDPYFGTPQHTQRQAEALLRVYVQTKQRGKAIDFMNALRGSKEFLLGSICITPEMQESYDRLLDTTPHPHLLTNGFTLESNAAKYTGEVQKVPFSPLQFRIDMEMNKYRRQQKEEGAVRVPPAMWRIDTSAIVPYKVDPKTKRVINWREVKEGIEKSFLSTGLPKEAYTGSEWREMLHLKSIIAGRAAKAAELERRLHTDKVKLLEVSSKSKIASETNTSTGSSNSSSIIFPDKDGYHVFDTSVASLRPFGVSQSGAVFQSVTHTYPSPHAVLYNDPVHGKQFILDTTNESCHLFGGFEHGDRLLIRARKIDKNDNNPNSVLAVKGNEFEVIVVGVNKEGSDSEWQLCAMHVDPALQREYGLVFLGTDCVDIHERWASVRYAAAAPHVKGRVTLLEERRTAREESLGQIVGVRDGVLFVQWRLLRGGGSEMDRSVAEPIGTVEQVRNAYQITEQGVEELMHPPSWRTPFRNDFAEERLEELRQAPFKRENWVSLIQGRYTPKLKRFGYTQHTTMDDFETKEYKDRLLSKQFFHNPQAFEVIPDRRDRAVTFGGKWEYQRTHGLPTVDRNELENGWSEVEAVTDAEMHVIEQALRDISGRRPGNFIKSPTKKNTLQLNESWWEPLEFGWEQHNKEQKALVDPTEQRLIDSASLPFGGKIPPFGTTIGIGERIREIAEDYAKGFGLGPHGHSPSHDTCQYNTLNAEEDRVRELGYKDALVRLFDEKMADKDVHQWAVEQCADGEADVRQLLLSLHEWRERGRPPSLMLLQVLSKYLEQEIAAFNEGVPSSVPKLSLQTVDGTLSPSGNSGERSGTIWADVEPTAYALQYASQANHSSLDEPFILQLLKSAQLGGRNAQFTDPFYNAYLENSVVSEFQLGLAALAGKGVSPSLLAQKISQLHRGSVRLSGNVIPFVKSRELAHLLERMGLSSENIAVVTRGLANCPEQESVGDDFAVPVSVILSWGGPGSGSSTNAADRKGNATQSRNELQRKGSAALSSAIRQLGQKRSSSKNWQNEDKMMVHVIEELALRDDGLVMDIQYIVRENRRNPVLRHEFFAALLPVFAGKHEKVAQLYDEYCEGKYVPNITLAIEAFIAFLCNVTKHADVYPGSSYFDVDTTNGPNAGQYISLKLLDPLDGPFIFDNIKAEHIETVERFKQHGIQVGPVRAPATGFIAANSKSLSYFTRRPEEVVYVSTDADQGLRRSLERSAHYKTIAASPAMQFLLHTQNGAGLVATFNRFFYRTMPMLSFYQRILKHYSDNVQPLRQKAQNSVRGLARVLENERSAAMEEFRRNSERYWRNVLEGRSVEQAMGGSGGSGGGGGGTTPPVSSSPSQSSQEAMAADVARAIGSGRTGDQKGGAARQQQQQQQRTAGDDGGVRTTFASRKGGSRSMTDLLSKLNKPKGSNTSGTAKGPTKRGNPKSHTDGGRGAKP 1827 T 11 DUF5053 pdbhh F Eukaryota T 7aor 55 CB as Q4D4G1_TRYCC mS65 MFGRSALCLAKRFRYNTKYPSLVSYNKLPWEILNHETPEFHMHVAPHYEQIMTLAASTHVPHIVGKKHLEMPPEHRLRLLPGMFYMLDGDSIPEGFTANRVLDPTALQYYGRLESLVAPVQAVRMLISDDLRIVCNSVTLQGPLQLPVASYASLASLEVVTNKASASFTLFHFVRPNRPPSELQLEKYYIHAPRAMALAEFNSTSNTSWEPKLQAPKRSKRVTPLPAYRPPQSYLMGLAERLAVVPGSSFGRRSLMWGHWF 261 T 32 PELOTA_1 pdbhh F Eukaryota T 7apk 8 DA,O x,X THOC2 anchor (putative) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 37 F F F 7aqc 21 V W nascent polyalanine AAAAAAA 7 T 270 DUF4179 pdbhh F F 7aqd 21 U W nascent polyalanine AAAAAAA 7 T 270 DUF4179 pdbhh F F 7aqq 17 Q u unknown XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 30 F F F 7aqq 18 R v UMP2_ARATH Uncharacterized protein At2g27730, mitochondrial MATRNALRIVSRRFSSGKVLSEEERAAENVFIKKMEQEKLQKLARQGPGEQAAGSASEAKVAGATASASAESGPKVSEDKNRNYAVVAGVVAIVGSIGWYLKAGGKKQPEVQE 113 T 0.01 IATP pdbhh F Eukaryota T 7aqr 17 Q r Q9SD78_ARATH Furry MAKSVSTAASSLVQNLRRYIKKPWQITGPCAHPEYLEAVPKATEYRLRCPATIDEEAIVPSSDPETVYNIVYHGRDQRRNRPPIRRYVLTKDNVVQMMNEKKSFDVSDFPKVYLTTTVEEDLDTRGGGYEK 131 T 0.11 CI-B14_5a pdbpercent F Eukaryota T 7aqw 4 D c Q8VZT9_ARATH Transmembrane protein MGGGDHGHGAEGGDFRAKVWSMTGGPNCRPKHWRRNTAIAMFGVFLVCIPIAKLSAKLEQRPHMPVRPIPSQIWCKNFGTKDDYEKEH 88 T 1.8 Chordopox_A13L pdbhh F Eukaryota T 7aqw 9 I l NDUB8_ARATH NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 8, mitochondrial MAGRLSGVASRIMGGNGVVARSVGSSLRQRAGMGLPVGKHIVPDKPLSVNDELMWDNGTAFPEPCIDRIADTVGKYEALAWLSGGLGFFVGLGLLAVLNDKASKVPFTPRVYPYDNLRVELGGEP 125 T 0.00028 NDUF_B8 pdbhh F Eukaryota T 7aqw 13 M p NDBAB_ARATH NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 10-B MGRKKGLPEFEESAPDGFDPENPYKDPVAMVEMREHIVREKWIQIEKAKILREKVKWCYRVEGVNHYQKCRHLVQQYLDSTRGVGWGKDHRPISLHGPKPEAVEAE 106 T 0.00012 NDUFB10 pdb F Eukaryota T 7ar7 26 Z b Q9ZPY5_ARATH EXPRESSED PROTEIN,F11C10.23/F11C10.23,FIBER PVMEKLRMFVAQEPVVAASCLIGGVGLFLPAVVRPILDSL 40 T 0.0026 NADHdh_A3 pdb F Eukaryota T 7ar7 27 AA c Q8VZT9_ARATH Transmembrane protein GDFRAKVWSMTGGPNCRPKHWRRNTAIAMFGVFLVCIPIAKLSAKLEQRPHMPVRPIPSQIWCKNFGTKDDYEKEH 76 T 1.8 Chordopox_A13L unphh F Eukaryota T 7ar7 28 BA d Q94AL6_ARATH UNCHARACTERIZED PROTEIN AT4G20150 PISATMVGALLGLGTQMYSNALRKLPYMRHPWEHVVGMGLGAVFANQLVKWDVKLKEDLDVMLAKARAANERRYF 75 T 0.00096 NDUF_C2 unppercent F Eukaryota T 7ar7 35 IA l NDUB8_ARATH NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 8, mitochondrial YEALAWLSGGLGFFVGLGLLAVLNDKASKVPFTPRVYPYDNLRVEL 46 T 0.00028 NDUF_B8 unphh F Eukaryota T 7ar7 36 JA m Q9SIQ8_ARATH AT2G31490,EXPRESSED PROTEIN,NEURONAL ACETYLCHOLINE RECEPTOR SUBUNIT ALPHA-5,UNCHARACTERIZED PROTEIN AT2G31490 METNKNKFIEDWGSARENLEHNFRWTRRNFALIGIFGIALPIIVYKGIVKDFHMQDEDAGRPHRKFL 67 T 0.006 NDUF_B4 pdbpercent F Eukaryota T 7ar7 41 OA r B14.5a PPIRRYVLTK 10 T 0.34 Gp45_2 pdbhh F T 7ar7 42 PA u unknown XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 30 F F F 7ar7 43 QA v UMP2_ARATH Uncharacterized protein At2g27730, mitochondrial PKVSEDKNRNYAVVAGVVAIVGSIGWYLKA 30 T 0.051 Gram_pos_anchor pdb F Eukaryota T 7ar7 44 RA x GCAL2_ARATH ATCAL2,GAMMA CAL2 PKSQVTPSPDRVKWDYRGQRQIIPLGQWLPKVAVDAYVAPNVVLAGQVTVWDGSSVWNGAVLRGDLNKITVGFCSNVQERCVVHAAWSSPTGLPAQTLIDRYVTVGAYSLLRSCTIEPECIIGQHSILMEGSLVETRSILEAGSVLPPGRRIPSGELWGGNPARFIRTLTNEETLEIPKLAVAINHLSGDYFSEFLPYSTIYLEVEKFKKSLGI 214 T 0.00013 Hexapep pdbpercent F Eukaryota T 7ar8 28 BA c Q8VZT9_ARATH Transmembrane protein MGGGDHGHGAEGGDFRAKVWSMTGGPNCRPKHWRRNTAIAMFGVFLVCIPIAKLSAKLEQRPHMPVRPIPSQIWCKNFGTKDDYEKEH 88 T 1.8 Chordopox_A13L pdbhh F Eukaryota T 7ar8 36 JA l NDUB8_ARATH NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 8, mitochondrial MAGRLSGVASRIMGGNGVVARSVGSSLRQRAGMGLPVGKHIVPDKPLSVNDELMWDNGTAFPEPCIDRIADTVGKYEALAWLSGGLGFFVGLGLLAVLNDKASKVPFTPRVYPYDNLRVELGGEP 125 T 0.00028 NDUF_B8 pdbhh F Eukaryota T 7ar8 40 NA p NDBAB_ARATH NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 10-B MGRKKGLPEFEESAPDGFDPENPYKDPVAMVEMREHIVREKWIQIEKAKILREKVKWCYRVEGVNHYQKCRHLVQQYLDSTRGVGWGKDHRPISLHGPKPEAVEAE 106 T 0.00012 NDUFB10 pdb F Eukaryota T 7ar8 42 PA r Q9SD78_ARATH Furry MAKSVSTAASSLVQNLRRYIKKPWQITGPCAHPEYLEAVPKATEYRLRCPATIDEEAIVPSSDPETVYNIVYHGRDQRRNRPPIRRYVLTKDNVVQMMNEKKSFDVSDFPKVYLTTTVEEDLDTRGGGYEK 131 T 0.11 CI-B14_5a pdbpercent F Eukaryota T 7ar8 43 QA u unknown XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 30 F F F 7ar8 44 RA v UMP2_ARATH Uncharacterized protein At2g27730, mitochondrial MATRNALRIVSRRFSSGKVLSEEERAAENVFIKKMEQEKLQKLARQGPGEQAAGSASEAKVAGATASASAESGPKVSEDKNRNYAVVAGVVAIVGSIGWYLKAGGKKQPEVQE 113 T 0.01 IATP pdbhh F Eukaryota T 7ar9 7 G N Q8LYW0_9CHLO ND2 MWENLWYLDILINVLIITIFGLISCSSSATKSYDLKGCFIISMVGGVYDIPSAILWCLASLSILNFNGFFASLFLVFTWISNLFAMQSLNFLGIYLAFEMQSLCLLVLGKITANENQRWFAYRGLLKYLVLSLIAGSIFIFHASSSYLQSGVMISDSLVTYVFLLFKLGVAPFHMYTLELFSVVSRHVAFVFSTLPKLSVLYLISNSNIGSECVWWGLISLWLGSISQYQSVFVRSILLYSSVAEIGLVLLVLQEGFSWEAFSWVSIYFLSLSGVWHANSKFVSAISVASIAGLPPFLGFIGKAQILKSLVSINLGILIFSSILAATISFIGYLRLIRLMYLVSPVKWKNNKDSSFINWSTWMLTVGTLPMVYSV 375 T 5.5E-09 Proton_antipo_M pdbpssm F Eukaryota T 7ar9 8 H O C1-FDX MALLRALAKPLRSLQAVSSVAQVSLRQFGAASHHDDHHDDHDHYTPPKTVFEDTITINVLDYDGKKHAVKALIGTPLNKALVEYGFSSTYFFPNMGYYTQHISDAHVFIPEEYWKYVENVDLKTDDAEAIKLMFKLVVQDYQRETSFFASYLTLNKEMDNMTIGFGPIKPWHITPKWSFNGHHNVKDRMFDRLETGPFIE 200 T 0.2 HEPN_DZIP3 pdbpssm F T 7ar9 14 N b B9 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 54 F F F 7ar9 15 O c KFYI MGGHDHHHPSVPEPPYAKYLANKSHYCPPDFHYSREIYAPYGGYFNDPKGWRTNTAIATLVMLAGAYAVFCFGNAREERLRAPKGWIPSQLWNDNVPTPVDYRGKVLKDE 110 T 2.3 Deltameth_res pdbhh F T 7ar9 16 P d B14.5b MGWEYAGTYGALCGMVYAIGSNVISGRAWFRRPWVHVTSVTLSYLGSKLLDEVQDTYYLEHLKRVERKGLQVTEEHKKLFSAY 83 T 0.00022 NDUF_C2 pdbhh F T 7ar9 20 T h NUOP4 MAGGNYASLKADTSMDHVFGDSTNKLNYDFQLMSSKEAFFWNYTLYPIVGFPIFLYLYQFNKLENFEAEIAAAKAAKAASE 81 T 0.058 DUF2517 pdbpercent F T 7ar9 21 U i NUOP5 MFFFEFLQGKISDSQKEVDSQAEWYAEYDKLEKARQKRRIWKWRDSDSRDEYAINAEEPVIYIRSSLFGRTEVDPTGKNTNRNHQYLYNLKVLGHKTYTRRDPNELQKAQAEVDTLSAAGRLGPLSPF 128 T 5.9 PTPlike_phytase pdbhh F T 7ar9 22 V j AGGG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 87 F F F 7ar9 24 X l A0A7S0YPK2_9CHLO ASHI MSLTLGLRSLSRGALAARNALPKRAGAGGPVKLSPPVDKPLPYNYDFWMDNGIYPGQPIYDGMFGSMVGNMSLEYMAKGWLVLLPILSAPFVYEHCIADDVTRNPFVPRQYPSEVREFLALFKNGFVLNDYSQPDYEEMKRRQSGLLTPIY 151 T 0.39 NDUF_B8 pdbhh F Eukaryota T 7ar9 25 Y m A0A7S0Y945_9CHLO B15 MSALQAVKNLTTRMRPFAFQQIRKASNSSRIAGDTTGKYTPNIFSPETPMDRSFSHVPKNPFWEAWVFRRDNIQREFVWTWQTIFDLATFVGGLYVAMYATASFCSRQNDKRNGYPERNYYFSDSKSNFVIPDEREFY 138 T 0.00082 NDUF_B4 pdbhh F Eukaryota T 7ar9 28 BA p PDSW MTTATIEERRAFHKEVLDIVQSKLANKNSEWARPEEPILHNLKSEKEEPHVYYHNNNFRVTRQLIHFEKTKIFEDELDKCMRTHGEAKYRKCQEIAKRFQASCRVASNLERGPNARKRDVGFIYQNNKLRELEKDAKELGLNNPFPPSSPRTTIGY 156 T 0.00011 NDUFB10 pdbhh F T 7ar9 29 CA s A0A7S0VJV3_9CHLO NUOP7 MVKTLADYIHWRNKPSSIPPVDEYRPPVPLVNYDKLSTQFFSKLDNDPVINRVLRAPKVTVMATSLPIVNHPAFLFVAGALTGFSLTYAITSHYVGRKEIENLVKFDPRYFPEYTKSS 118 T 1.1 YtxH pdbhh F Eukaryota T 7ar9 30 DA t A0A7S0YCV2_9CHLO NUOP8 MRSALRLANATRLSTFRLTSAPAVRLASPSFFVQKEDEENTRSIHTSNSSFHDEPKHQIPGNALDNWAFLRTYAKPLPDMIHYYYYVYLFGFFFVYKVADFPEYSPRVLVMAALIGSLFYVRRDWVHREFKDSP 134 T 7.4 DUF2555 pdbhh F Eukaryota T 7ar9 31 EA u unknown XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 50 F F F 7ar9 32 FA w unknown XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 41 F F F 7arb 28 BA c Q8VZT9_ARATH KFYI -- 1 SUBUNIT C1 MGGGDHGHGAEGGDFRAKVWSMTGGPNCRPKHWRRNTAIAMFGVFLVCIPIAKLSAKLEQRPHMPVRPIPSQIWCKNFGTKDDYEKEH 88 T 1.8 Chordopox_A13L pdbhh F Eukaryota T 7arb 36 JA l NDUB8_ARATH NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 8, mitochondrial MAGRLSGVASRIMGGNGVVARSVGSSLRQRAGMGLPVGKHIVPDKPLSVNDELMWDNGTAFPEPCIDRIADTVGKYEALAWLSGGLGFFVGLGLLAVLNDKASKVPFTPRVYPYDNLRVELGGEP 125 T 0.00028 NDUF_B8 pdbhh F Eukaryota T 7arb 40 NA p NDBAB_ARATH NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 10-B MGRKKGLPEFEESAPDGFDPENPYKDPVAMVEMREHIVREKWIQIEKAKILREKVKWCYRVEGVNHYQKCRHLVQQYLDSTRGVGWGKDHRPISLHGPKPEAVEAE 106 T 0.00012 NDUFB10 pdb F Eukaryota T 7arb 42 PA r Q9SD78_ARATH Furry MAKSVSTAASSLVQNLRRYIKKPWQITGPCAHPEYLEAVPKATEYRLRCPATIDEEAIVPSSDPETVYNIVYHGRDQRRNRPPIRRYVLTKDNVVQMMNEKKSFDVSDFPKVYLTTTVEEDLDTRGGGYEK 131 T 0.11 CI-B14_5a pdbpercent F Eukaryota T 7arb 43 QA u unknown XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 30 F F F 7arb 44 RA v UMP2_ARATH Uncharacterized protein At2g27730, mitochondrial MATRNALRIVSRRFSSGKVLSEEERAAENVFIKKMEQEKLQKLARQGPGEQAAGSASEAKVAGATASASAESGPKVSEDKNRNYAVVAGVVAIVGSIGWYLKAGGKKQPEVQE 113 T 0.01 IATP pdbhh F Eukaryota T 7arc 16 P r A0A7S0YAP9_9CHLO B14.5a MSGILKTVQSIFYSVGLKEPWKMTGIRSLPDFEYYLPFGLTYRGISPGNQPIKAVVPHDVPKLVYDIKYFARDYRRNNSYTVRSVDSKTPFDYSKVFGSAPLKPADVKTVRIPEVMPHRGC 121 T 0.012 CI-B14_5a pdbpssm F Eukaryota T 7ard 14 N N ND2 MWENLWYLDILINVLIITIFGLISCSSSATKSYDLKGCFIISMVGGVYDIPSAILWCLASLSILNFNGFFASLFLVFTWISNLFAMQSLNFLGIYLAFEMQSLCLLVLGKITANENQRWFAYRGLLKYLVLSLIAGSIFIFHASSSYLQSGVMISDSLVTYVFLLFKLGVAPFHMYTLELFSVVSRHVAFVFSTLPKLSVLYLISNSNIGSECVWWGLISLWLGSISQYQSVFVRSILLYSSVAEIGLVLLVLQEGFSWEAFSWVSIYFLSLSGVWHANSKFVSAISVASIAGLPPFLGFIGKAQILKSLVSINLGILIFSSILAATISFIGYLRLIRLMYLVSPVKWKNNKDSSFINWSTWMLTVGTLPMVYSV 375 T 5.5E-09 Proton_antipo_M pdbpssm F T 7ard 15 O O C1-FDX MALLRALAKPLRSLQAVSSVAQVSLRQFGAASHHDDHHDDHDHYTPPKTVFEDTITINVLDYDGKKHAVKALIGTPLNKALVEYGFSSTYFFPNMGYYTQHISDAHVFIPEEYWKYVENVDLKTDDAEAIKLMFKLVVQDYQRETSFFASYLTLNKEMDNMTIGFGPIKPWHITPKWSFNGHHNVKDRMFDRLETGPFIE 200 T 0.2 HEPN_DZIP3 pdbpssm F T 7ard 28 BA b B9 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 54 F F F 7ard 29 CA c KFYI MGGHDHHHPSVPEPPYAKYLANKSHYCPPDFHYSREIYAPYGGYFNDPKGWRTNTAIATLVMLAGAYAVFCFGNAREERLRAPKGWIPSQLWNDNVPTPVDYRGKVLKDE 110 T 2.3 Deltameth_res pdbhh F T 7ard 30 DA d B14.5b MGWEYAGTYGALCGMVYAIGSNVISGRAWFRRPWVHVTSVTLSYLGSKLLDEVQDTYYLEHLKRVERKGLQVTEEHKKLFSAY 83 T 0.00022 NDUF_C2 pdbhh F T 7ard 34 HA h NUOP4 MAGGNYASLKADTSMDHVFGDSTNKLNYDFQLMSSKEAFFWNYTLYPIVGFPIFLYLYQFNKLENFEAEIAAAKAAKAASE 81 T 0.058 DUF2517 pdbpercent F T 7ard 35 IA i NUOP5 MFFFEFLQGKISDSQKEVDSQAEWYAEYDKLEKARQKRRIWKWRDSDSRDEYAINAEEPVIYIRSSLFGRTEVDPTGKNTNRNHQYLYNLKVLGHKTYTRRDPNELQKAQAEVDTLSAAGRLGPLSPF 128 T 5.9 PTPlike_phytase pdbhh F T 7ard 36 JA j AGGG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 87 F F F 7ard 38 LA l ASHI MSLTLGLRSLSRGALAARNALPKRAGAGGPVKLSPPVDKPLPYNYDFWMDNGIYPGQPIYDGMFGSMVGNMSLEYMAKGWLVLLPILSAPFVYEHCIADDVTRNPFVPRQYPSEVREFLALFKNGFVLNDYSQPDYEEMKRRQSGLLTPIY 151 T 0.39 NDUF_B8 pdbhh F T 7ard 39 MA m B15 MSALQAVKNLTTRMRPFAFQQIRKASNSSRIAGDTTGKYTPNIFSPETPMDRSFSHVPKNPFWEAWVFRRDNIQREFVWTWQTIFDLATFVGGLYVAMYATASFCSRQNDKRNGYPERNYYFSDSKSNFVIPDEREFY 138 T 0.00082 NDUF_B4 pdbhh F T 7ard 42 PA p PDSW MTTATIEERRAFHKEVLDIVQSKLANKNSEWARPEEPILHNLKSEKEEPHVYYHNNNFRVTRQLIHFEKTKIFEDELDKCMRTHGEAKYRKCQEIAKRFQASCRVASNLERGPNARKRDVGFIYQNNKLRELEKDAKELGLNNPFPPSSPRTTIGY 156 T 0.00011 NDUFB10 pdbhh F T 7ard 44 RA r B14.5a MSGILKTVQSIFYSVGLKEPWKMTGIRSLPDFEYYLPFGLTYRGISPGNQPIKAVVPHDVPKLVYDIKYFARDYRRNNSYTVRSVDSKTPFDYSKVFGSAPLKPADVKTVRIPEVMPHRGC 121 T 0.012 CI-B14_5a pdbpssm F T 7ard 45 SA s NUOP7 MVKTLADYIHWRNKPSSIPPVDEYRPPVPLVNYDKLSTQFFSKLDNDPVINRVLRAPKVTVMATSLPIVNHPAFLFVAGALTGFSLTYAITSHYVGRKEIENLVKFDPRYFPEYTKSS 118 T 1.1 YtxH pdbhh F T 7ard 46 TA t NUOP8 MRSALRLANATRLSTFRLTSAPAVRLASPSFFVQKEDEENTRSIHTSNSSFHDEPKHQIPGNALDNWAFLRTYAKPLPDMIHYYYYVYLFGFFFVYKVADFPEYSPRVLVMAALIGSLFYVRRDWVHREFKDSP 134 T 7.4 DUF2555 pdbhh F T 7ard 47 UA u unknown XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 50 F F F 7ard 48 VA w unknown XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 41 F F F 7arr 1 A,B,C,D A,B,C,D alpha/beta-peptide LSEEEIQRIFGLSSEQIKSLPEEXYKKXVEXTGYL 35 T 0.043 CSTF_C pdbhh F T 7ars 1 A,B A,B alpha/beta-peptide LSEEEIQRIFGLSSEQIKSLPEEXYKKXVEXTGYM 35 T 0.042 CSTF_C pdbhh F T 7arx 3 C C SFTI1_HELAN SFMI1 - Sunflower MASP1 inhibitor GICSRSLPPICIPD 14 T 0.052 Bowman-Birk_leg unp F Eukaryota T 7asd 2 B,D,F,H,J,L,N,P AB,BB,CB,DB,EB,FB,GB,HB APIM_APIME APISIN SUBUNIT APISIMIN,ROYAL JELLY PROTEIN RJP54 MSKIVAVVVLAAFCVAMLVSDVSAKTSISVKGESNVDVVSQINSLVSSIVSGANVSAVLLAQTLVNILQILIDANVFA 78 T 0.44 SRP54_N pdbpssm F Eukaryota T 7ase 22 V F Q4CQU0_TRYCC 40S ribosomal protein SA MTSVESGAKVLRMKEGDVQKLVAMHCHLGTKNRSNAMKKYIHSRTKEGTNIIDLHMTWEKLILAARVIAAVENPQDVTVCSTRLFGQRAIFKFSQLVGTSFLAGRFIPGTFTNQIQKKFMQPRVLLVTDPRTDHQALREASLVNIPVIAFCDTDAPLEFVDIAIPCNNRGRYSISMMYWLLAREVLRLRGTIPRSVPWDVKVDLFFYRDPEEALKHEEVNQAAAPVAEVDEGFGWVERDNNAWEQ 245 T 1.5E-13 Ribosomal_S2 pdb F Eukaryota T 7ast 17 Q Y DNA-directed RNA polymerase III subunit RPC7-beta XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 36 F F F 7asy 1 A A TNFA_HUMAN CACHECTIN,TNF-ALPHA,TUMOR NECROSIS FACTOR LIGAND SUPERFAMILY MEMBER 2,TNF-A RRCLFLSLFSFLIVAGATTLFCLLHFGVIGPQR 33 T 6.2 PBP1_TM unphh F Eukaryota T 7at7 1 A A TNFA_HUMAN CACHECTIN,TNF-ALPHA,TUMOR NECROSIS FACTOR LIGAND SUPERFAMILY MEMBER 2,TNF-A RRCLFLPLFSFLIVAGATTLFCLLHFGVIGPQR 33 T 6.2 PBP1_TM unphh F Eukaryota T 7atb 1 A A TNFA_HUMAN CACHECTIN,TNF-ALPHA,TUMOR NECROSIS FACTOR LIGAND SUPERFAMILY MEMBER 2,TNF-A RRCLFLSLFSFLIVLLLTTLFCLLHFGVIGPQR 33 T 6.2 PBP1_TM unphh F Eukaryota T 7ath 1 A AAA A0A3Q9JIL7_9MICO UipA MGSSHHHHHHSSGENLYFQIGDEFGDDDRSSMSDDGPRHDADDHGPRGEDRGDDDRGNAPSNGRGPVTGIGTASADELIAIADAARGAADGEVTSIDAKRDGTWEVQLTTAAGAETEVRVDEALVASVTSTDAADGDDTGPALTLDDETIRALVSAALAEAEGMITDLDVDGDDVSPYDASVLTSDNRSIDIDFSADFAVVGTDID 206 T 0.0072 HPTransfase pdbpercent F Bacteria T 7atk 1 A AAA A0A3Q9JIL7_9MICO UipA MGSSHHHHHHSSGENLYFQIGDEFGDDDRSSMSDDGPRHDADDHGPRGEDRGDDDRGNAPSNGRGPVTGIGTASADELIAIADAARGAADGEVTSIDAKRDGTWEVQLTTAAGAETEVRVDEALVASVTSTDAADGDDTGPALTLDDETIRALVSAALAEAEGMITDLDVDGDDVSPYDASVLTSDNRSIDIDFSADFAVVGTDID 206 T 0.0072 HPTransfase pdbpercent F Bacteria T 7atr 2 B B YEJA_ECOLI Uncharacterized protein YejA LGEPRYAFNFN 11 T 10 Cas9_C pdbhh F Bacteria T 7aue 6 F C AMINOSERINE XRCXHXRWX 9 T 0.081 ACTH_domain pdbhh F F 7ax1 2 B B CNOT7_HUMAN BTG1-BINDING FACTOR 1,CCR4-ASSOCIATED FACTOR 1,CAF-1,CAF1A GPHMLEPAATVDHSQRICEVWACNLDEEMKKIRQVIRKYNYVAMDTEFPGVVARPIGEFRSNADYQYQLLRCNVDLLKIIQLGLTFMNEQGEYPPGTSTWQFNFKFNLTEDMYAQDSIELLTTSGIQFKKHEEEGIETQYFAELLMTSGVVLCEGVKWLSFHSGYDFGYLIKILTNSNLPEEELDFFEILRLFFPVIYDVKYLMKSCKNLKGGLQEVAEQLELERIGPQHQAGSDSLLTGMAFFKMREMFFEDHIDDAKYCGHLYGLGSGSSYVQNGTGNAYEEEANKQS 290 T 6.1E-32 CAF1 unphh F Eukaryota T 7axp 2 B B CS-VIP8 XXRXXXX 7 T 480 IL23 pdbhh F F 7axq 2 B C CS-VIP8 XXRXXXX 7 T 480 IL23 pdbhh F F 7axs 2 B B CS-VIP8, (ALQ)(4FO)R(ABA)(DPN)(EDN)(S7Z) XXRXXXX 7 T 480 IL23 pdbhh F F 7axx 2 B B (ALQ)(4FO)R(ABA)(DPN)(EDN)(S7Z) XXRXXXX 7 T 480 IL23 pdbhh F F 7ay8 1 A A Tbo-IT2 CIQRHRSCRKSSECCGCSVCQCNLFGQNCQCKSGGLIAC 39 T 0.00031 Toxin_9 pdb F T 7az5 2 E,F,G,H H,I,J,K Peptide 47 XQXDLXL 7 T 310 DUF6394 pdbhh F F 7az6 2 B H Peptide 36 XRQXXLX 7 T 15 DUF4059 pdbhh F F 7az7 2 B H Peptide 37 XQXXLX 6 F F F 7az8 2 C,D H,I Peptide 43 XQXDLPL 7 T 40 CAF1-p150_N pdbhh F F 7azc 2 E,F,G,H H,I,J,K Peptide 22 XQXDLF 6 T 81 Zn_peptidase pdbhh F F 7azd 2 E,F,G,H H,I,J,K Peptide 20 XQXDLF 6 T 81 Zn_peptidase pdbhh F F 7aze 2 C,D H,I Peptide 18 XQXDLF 6 T 81 Zn_peptidase pdbhh F F 7azf 2 E,F,G,H H,I,J,K Peptide 8 XQXDLF 6 T 81 Zn_peptidase pdbhh F F 7azg 2 I,J,K,L,M,N,O,P I,J,K,L,M,N,O,P Peptide 4 GQXXLF 6 T 14 DUF6248 pdbhh F F 7azk 2 B,D,F,H H,I,J,K Peptide 35 XQXXLX 6 T 280 PcfK pdbhh F F 7azl 2 E,F,G,H E,G,F,H Peptide 38 XQXXLX 6 T 280 PcfK pdbhh F F 7azx 2 C C HUWE1_HUMAN E3 ubiquitin-protein ligase HUWE1 SHDQHAVLVLQPAVEAFFLVHATERESK 28 T 0.49 DUF3652 pdbhh F Eukaryota T 7b0n 29 CA c A0A1D8N3H5_YARLL Subunit NUZM of NADH:Ubiquinone Oxidoreductase (Complex I) MLPGGPVPVFKKYTVGSKGIWEKLRVLLAIAPNRSTGNPIVPLYRVPTPGSRPEANVYQDPSSYPTNDIAENPYWKRDHRRAYPQTAFFDQKTVTGLLELGSEATPRIADGEAGTKALANIANGGVSFTQALGKSSKDVIYGEVLTVNGLPPVAPTLAPKQWKIIEGEAAIYPKGYPCRTFH 182 T 0.00017 CI-B14_5a pdbhh F Eukaryota T 7b0n 30 DA d A0A1D8NGI5_YARLL subunit NEBM of protein NADH:Ubiquinone Oxidoreductase (Complex I) [Yarrowia lipolytica] ALFTSLVGASGLGFATKFLSNKIRLKPAGYYPLGYVFSGVAWAGLGLVLHNVHQHSLEVLEKKKTALSEQRTE 73 T 0.065 DUF6404 pdbpssm F Eukaryota T 7b0n 33 GA g Q6ZY23_YARLL Subunit NESM of NADH:Ubiquinone Oxidoreductase (Complex I) MLKLHYRNFITAQHSTTNTTPTMIASVCKRAGLRAGPRAYPGVRQFALRAYNEEKELALKQRLSQLPPPGKAFVTAEGEPRPAKEAELAELAEIAALYKTDRVGILDILLLGNKHARLYRDNTALLKDYYYNGRRILDKIPVKDKQTGKVTWEIKREGAEKEDWVNQMYFLYAPSLILLLIVMVYKSREDITFWAKKELDQRVLDKHPEINDAPENERDALIVERIIAGDYDKLASLQKKATPTPATLI 249 T 0.0013 ESSS pdbpercent F Eukaryota T 7b0n 34 HA h A0A371CFV9_YARLL subunit NUNM of protein NADH:Ubiquinone Oxidoreductase (Complex I) MLRHTVRATQTLRQARNVRFGSHGHGPELTPAVPFFQPYVLKWAGVSLGLVAFYQFNSSYEAKNGHTWVETFFHPKSREDILNEEAKIVQALNNQRELTIKMHELKREEKDYAHSYSPLFSDPVPQGGSIGKAPGSSRE 139 T 0.033 DUF5950 pdb F Eukaryota T 7b0n 35 IA i A0A1H6Q311_YARLL Subunit NUUM of NADH:Ubiquinone Oxidoreductase (Complex I) MGGGRYPFPKDVISMTGGWWANPSNWKLNGLFATGIAVGLALWVSTATLPYTRRREGITSESDISKWNAAAGVWRERHGKISTGEAAESE 90 T 6 Sex_peptide pdbhh F Eukaryota T 7b13 2 B P SHN3pS542 ASHSMPSAAC 10 T 6.1 Equine_IAV_S2 pdbhh F T 7b15 2 B P SHN3pT869 DRPDTEPEP 9 T 39 Prd1-P2 pdbhh F F 7b1f 2 C,D C,D BUB1_HUMAN HBUB1,BUB1A KVQPSPTVHTKEALGFIMNMFQAPTS 26 T 2.7 Feld-I_B pdbhh F Eukaryota T 7b1h 2 C,D,G,H C,D,G,H BUB1_HUMAN HBUB1,BUB1A KVQPSPTVHTKEALGFIMNMFQAPTS 26 T 2.7 Feld-I_B pdbhh F Eukaryota T 7b1i 2 B C A0A219T3Y8_MAGOR AVR-PIK PROTEIN ( PIKMPROTEIN,PIKP PROTEIN ),AVRPI7 PROTEIN GPETGNKYIEKRAIDLSRERDPNFFDNADIPVPECFWFMFKNNVRQDAGTCYSSWKMDKKVGPNWVHIKSDDNCNLSGDFPPGWIVLGKKRPGF 94 T 2.8 DIM unphh F Eukaryota T 7b1j 2 C,D C,D BUB1_HUMAN HBUB1,BUB1A KVQPSPTVHTKEALGFIMNMFQAPTS 26 T 2.7 Feld-I_B pdbhh F Eukaryota T 7b26 3 C C CirpA1 GPMGEDQETDFSSTDGAELIAKEPEVYPIDQFMNNTEIWVFNTTQPDPPNCKKDKSKSMTQTATSFVRSHVKNGNIIEENLVGNFTYFNDKEKVYDGIYISGESSGVYAEHLYYVSEDKKCGLFQVFAHVNDKTTIWRDVRVSGRPEEGVPLELNCTKEFDEYVKLVNATSKSPYTSECQ 180 T 0.0082 FBA_1 pdbpercent F T 7b2a 1 A A CirpA5 MGQSEKQEEPDYPINKFMNTTDEIWVFRTTQENVQKCKKDKNKYMTTSATFFTRSHEEQDQIHEQELVGKFANFYDKPDGVYDRIDITGDKTGVYEEALAYASKENTCGVVGVWAFDGETTVVWRELRVRNRPNDATKVDEMCKKKFDDYVQVVNKSWTSPYNEKCK 167 T 4.8 His_binding pdbhh F T 7b2b 1 A B A0A3D9UGN9_9GAMM PEPTIDE SYNTHETASE XPSB (MODULAR PROTEIN) MNNNELTSLPLAERKRLLELAKAAKLSRQHY 31 T 1.1 CSTF_C pdbhh F Bacteria T 7b2d 1 A A CirpA1 GPMGEDQETDFSSTDGAELIAKEPEVYPIDQFMNNTEIWVFNTTQPDPPNCKKDKSKSMTQTATSFVRSHVKNGNIIEENLVGNFTYFNDKEKVYDGIYISGESSGVYAEHLYYVSEDKKCGLFQVFAHVNDKTTIWRDVRVSGRPEEGVPLELNCTKEFDEYVKLVNATSKSPYTSECQ 180 T 0.0082 FBA_1 pdbpercent F T 7b2f 1 A A A0A3D9UGN9_9GAMM Peptide synthetase XpsB (Modular protein) MNNNELTSLPLAERKRLLELAKAAKLSRQHY 31 T 1.1 CSTF_C pdbhh F Bacteria T 7b3j 2 B B D3 all D-enantimeric peptide XXXXXXXXXXXX 12 F F F 7b3k 2 B B D3 all D-enantimeric peptide XXXXXXXXXXXX 12 F F F 7b4t 1 A B HIV-1 envelope variable loop 3 crown mimetic peptide V3-IF (BG505) KSIRIGPGQAFYAXP 15 T 0.00056 GP120 pdbhh F T 7b4u 2 C,D D,B HIV-1 envelope variable loop 3 crown mimetic peptide V3-IF (BG505) KSIRIGPGQAFYAXP 15 T 0.00056 GP120 pdbhh F T 7b4v 2 C,D B,D HIV-1 envelope variable loop 3 crown mimetic peptide V3-IF (BG505) KSIRIGPGQAFYAXP 15 T 0.00056 GP120 pdbhh F T 7b4w 2 C,D B,D HIV-1 envelope variable loop 3 crown mimetic peptide V3-IF (BG505) KSIRIGPGQAFYAXP 15 T 0.00056 GP120 pdbhh F T 7b5h 1 A,B,C,DC,EC,FC,GA,HA,IA,MB,NB,OB,R,S,T,XA,YA,ZA AA,AB,AC,FA,FB,FC,CA,CB,CC,EA,EB,EC,BA,BB,BC,DA,DB,DC Q8YRX8_NOSS1 All3314 protein MTTTITIPNSYPIFTPNQVLTNKDLNRVVTYLDEQNRLTRVYLIGMGIVAGMEVSSIYQPGDVNIVVAPGCGITSEGYIISLAETKLTHYQSGVSVPSALFAPSEEQTAASTDQLVELFEQEGNNRLALKNLPDENAFARFLADQTLVVVYELQDQQRDSCLLDCDDTGKDRNFRLRYFLLPRSVPEKLSAEALLQQGFSREPLPQQWRDFSINDIFQAQSSFFQNFFPQVRRFGYTLETPPVIRLSNIVDYDAFLKGYQQVCLQAIDEIDRTFPNLFRLFSPFFSSFNPAPSDFTGLKTLLNQRLSDIVSGSSAENRRSPISQIEAQYALQYFYDYLSQLVSAFRELAESAFDLMDDATPDTRRFPKFLMLGLVPLPNQKPEVYALNSPYRSNFSQSPIYNGNQLRVKQVRFLYDRLVRLCAADSFYLLPFYDTPLKITPSKDRAATLSQQAIPYYLNYPQLYQYWSYDTYRKGRSQSHPAYFYPNNANITPNSDLLHRLDDYSFYRIEGHIGEANATALQRILDYQQRYNLAFDVITLKIGNLQSFQDINISGQFDDLNADFGRIKDTFAKLWQRYEESWSRNVFLYTLKRVFFDKTSLAEIKSDQLFNPIVARASVKEAYEFVKESGDSYRLYLRNAAGIRIARFETVINFSGLSGDSLTQEQERIIGDLLACLPLGKITYGVEPESANNPLSYYLRFSLADELDLPANRGTADISFISLNFFTVNFEGNSPIINQPEFQDFETLYSLLRDVPESSIRVNRLELRMGDRLAADTLNYFELKGLMTAYQQRLAQIMELQLFHKFAQNNPGMEHLGGVPKGGTFVLVYVDGRELVRNLLSADRDPTYQARTEVIKKYASLPPGSPQELATSRELLNREDIVVGDFCLPYRFSSKTPTVSYVLTQPRPIVLLDRTTFCAGDETRYEFILDPTGGTLKGEGSFFADGKYYFQPSRITDDITSETAITFTYVVESSYDTLSVTVYPLPDASFQIKTNFCSNENPVTLRATQPGGNFRAFDSETDISASVINNQEFNPSAVNLGGATEKVITLVYTITSDQGCTNELSRDITIFAVPNATFQVGQGKTRFCSNDEPVDLIARVPGGTFQVRDGAEDISADVINRLTTPPQFDPSAVNLGVAREKVITLEYSISNQGCSNKFTQELRIFAVPNANFRLSTGNRDTFTNNDPPVGLIATQLGGTFQAFDGEEDITADVISPTTPPQFNPSAVNLGDEEEKVITLRYTISNQGCSNNTERRVTIVPPPEVPVRDVEDTSNPDSGDAPTENPIPHPEVRAVNLLAISNNEVINSTNLDGDRTFNLSDFNPNNQYTFEAMTVPEKVNSVIFTYTKPNGSRQALTANTAPYRMPDDWQPSIGIHEIQAQAIREVNGDRLEGATIKVIIRVIDADTDTSPSRSTNPDNLFTRIQNLFPLNRGEIITKIKLPQLLAMSTAIFMLIVGWTYSSSKQVGSTPPSVIKPR 1476 T 0.035 Cadherin_5 pdbpercent F Bacteria T 7b5h 2 AB,D,GC,JA,PB,U DD,AD,FD,CD,ED,BD Q8YRX7_NOSS1 All3315 protein MPEYLSISKQKPDFPPYLNFQTLRDIGITHLQALSGKIWTDYNLHDPGVTILEVLCYAITDLGYRNNLDIADLLALNPQDGNSRENNFFTPDAVLTCNPVTELDVRKRLIDIPGVRNAWLQKVTSYEPNIYVNFSDKRLQYNPPTAESKTLNPRGLYTVRLDLDQDYRKNACGQIDRSWGDTLDEVKQVLCDSRNLCEDFADIVILGEEEIGICADIQLETNADAEDVLVNIYVRIQQFLSPRLKFYTLQELLDKGKSPAEIFAGRPSVFDGENRLYKSHGFIDTDELEALTLPTILHTSDLYQEILQVPGVSAIKKLSIANYINGLRQTQGHPWYLQLTDQYRPVLGVKTSKINFFKSELPIGVDEEEVERRYYEQQAAYIKTIRDRDELDIPVPKGSYYDLADHYSIHHDFPTTYGISEDGLPPTVPALRKAQALQLKAYLVFFDQLLASYLAQLSHIRDLFSWEVDVTQPQQNDYATRLQEKQRTYFTQKLDFPEIEKIIPDNYLDVLDEAPETYRDRRNRFLDHLLARFSESFSDYVLLNYQMFATRNNKATQETEIIHDKAQFLQDYPTLSRDRFRAYNYYDCHAVWDTDNVAGFKKRVLRLLGIDDVRRRHLSHYRVDKDSRNLFLSIDFSSDDLTLTSKQRYATTEQAQADQDKLLLFALHPNFYKRLSYKYYYHYSWEILDTQNQSIVRSDRFFPSTKERAAALEPLLQSLLTQLSQLDDTALQNLVITQPTDEDLYSFRLQIPLNSGVITFTGVQRYFSRTEAVDAGVISLRLIQDVQNYRNITLGQDQGTTPQKFTYYGYGLVDHQGSLLSEYTHHFPTELERELSLQRWLTHIQANQNQYKFAIETITNGYVFVINDITNSQTLLRGISSYATEYLAWQAASEFAENLRYLNRYLSPAKDHTGQTYSLGITDKTGKLLAVTTTESDRLLTFQRLNALEPFLVIEAATTPTSGYRYRLVDRQETTILQSIQIYGDETTARDRFYQDVLGTLFETGVINPTTTNKEFGFRILSRPRDTNSVAAIHTQTYTSEAERDAAIEHLLLLVRTARLRISTNSLDSLAYISQIYNPDNQLILQGTQRYTSEDIAWEQGNTLMELAQDEENFRLIDSDDGVYGWELTNEGKDEIFAAQYYNSREERTAAIAEIQKYSNDEGFHLLEHILLRPRTKLPDLTAGDGFLPILVTPEDVNTEPDDPYLLARTDPYSFWVTIVLPYWPQRFRDIPFRRFVERTLRLEAPAHIALKIAWVNVRQMRDFELAYRHWLEQLALESCENAACDLTGTLNRLLKILPQLRNVYPKATLHDCEESSADNNPAILNQTALGTAND 1335 T 0.001 DUF276 pdbhh F Bacteria T 7b5h 3 BB,CB,DB,E,F,G,HC,IC,JC,KA,LA,MA,QB,RB,SB,V,W,X DE,DF,DG,AE,AF,AG,FE,FF,FG,CE,CF,CG,EE,EF,EG,BE,BF,BG Q8YRX6_NOSS1 All3316 protein MNLRKRNELKSLFKNKSRLSETYFVELIDSTLNKRDDRFHGIWKPGQTYQKGDVVYYNHSLWEMQSENEICAKEEQTPGISTDWKSLLKELEQKVDKLQHELETLHQEFTEYQKQMEIRLQLLARFIPILFIGLGIMFFWLLGQSTVHILAGTT 154 T 0.00052 CLZ pdbpssm F Bacteria T 7b5i 2 AA,B,G,L,Q,V FB,AB,BB,CB,DB,EB Q8YRW6_NOSS1 All3326 protein MKILYKKILNLELWHDFYLGQPNTPGSLPNNYDISRTLALVPTQECLRVLANLRWVFRPQLYGASLFANVNAAPSGQFPTIFPIDRVYRLTFWLVVSDRYFANFTNLSLINSRNQIYYFSNLSGNEGHALFLTQPLSAYTTNNEYQLGQLVTHADKTLESLTYQGNATNIPNPSDWDSLPASQYVSELDHLPRQGTYRTQVITNANPDNTYNFTLVNTNEQESWAIDVIVPDTHKSGEPFSTSLNFVGQTPGHYRLLENDTQVAEFVLVDNSLPEAFALVEVILNPELVPSAFSLLQASAGQTFIQPKTYVIRFKNRATRWRYRYEQPHGCSAANLPSYFNLIDTHTYATARPIGLRQRPDSLLNDCQDRPLPAPSITLIQPETDGSQRIARIFSDIYL 399 T 0.66 Y_Y_Y pdbpercent F Bacteria T 7b5k 53 AB v ermC Nacent chain GIFSIFVI 8 T 0.16 Leader_Erm pdb F F 7b76 1 A A V5TFR9_LEPMC Avirulence protein LmJ1 GHMHDCHQVTVSRDVTLQNKERHDCNQVCASIDKETENKLNTDIIPRLTRYMSVKGNSIIARVQQSNSDPKCSCTWRAIIWRVYKAYDENSLNVALHVSHPNQQIGENPDWSLVISNPNVHCLKH 125 T 3.8 Antimicrobial_6 pdbhh F Eukaryota T 7b7u 12 L N Unknown XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 30 F F F 7b9v 25 Y X Unassigned structure XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240 F F F 7b9v 27 AA Z NTC20 isoform 1 MPSLRDLSLERDQELNQLRARINQLGKTGKEEANDFVGLNISNEPVYDTVIQTGQSSNATNSFVQETIQKTKQKESGQPYIIPQKNEHQRYIDKVCETSDLKAKLAPIMEVLEKKTNEKIKGIIRKRVLQEPDRDNDDSG 140 T 9.3E-05 cwf18 pdbhh F T 7bag 3 C C Compstatin CP40 XICVXQDWXAHRCX 14 T 1.9 Inhibitor_I36 pdbhh F T 7bas 1 A,B,C,D,E A,B,C,D,E CC-Type2-(TgLaId)4-W19BrPhe. XGEIAQTLKEIAKTLKEIAXTLKEIAQTLKGX 32 T 0.004 ApoC-I pdb F T 7bat 1 A,B,C A,B,C CC-Type2-(GgIaId)4 XGEIAQGIKEIAKGIKEIAWGIKEIAQGIKGX 32 T 0.05 MCPsignal pdbpssm F T 7bau 1 A,B,C,D,E,F,G,H,I,J A,B,C,D,E,F,G,H,I,J CC-Type2-(TgLaId)4-W19BrPhe. XGEIAQTIKEIAKTIKEIAXTIKEIAQTIKGX 32 T 0.0016 MCPsignal pdb F T 7bav 1 A,B,C,D,E A,B,C,D,E CC-Type2-(TgLaId)4-W19BrPhe XGEIAQTLKEIAKTLKEIAXTLKEIAQTLKGX 32 T 0.004 ApoC-I pdb F T 7baw 1 A,B,C,D,E,F,G A,B,C,D,E,F,G CC-Type2-(GgIaId)4 XGEIAQGIKEIAKGIKEIAWGIKEIAKGIKGX 32 T 0.062 MCPsignal pdbpssm F T 7bb5 1 A A G4A6K5_AGGAC AcrIF9 MGSSHHHHHHSQDPMTNVVYYFTETNNINAYATAEALKAQTLADAKREASRRQCFQGTTLKIGTIYSLNSDGLLVDEITSKEDGKKWVDRY 91 T 3.2 DVNP unphh F Bacteria T 7bbp 2 E,F FFF,GGG Q0VAS5_HUMAN Histone H4 SGRGXGGXGL 10 T 4.7 G3P_acyltransf pdbhh F Eukaryota F 7bcj 1 A A Q6A6F6_CUTAK Radical Oxygenase of Propionibacterium acnes MTPIDESQLPVGPQVSVTDSAQHTGPFAASSPLTITVKPGAPCVRADGYQESMVTRVLDDKGHQVWTGTFDESKLIGGTGLGTATFHVGSPAAAFNFHGSERTTYRTLSYCAYPHYVNGTRERLSQVSVKTFMVDPALNLEHHHHHH 147 T 2.2 EAGR_box pdbhh F Bacteria T 7bcy 2 C,D P,Q LANA1_HHV8P ORF 73 XCRKRNRSPERX 12 T 0.022 DUF5401 unphh T Viruses T 7bdu 2 C,D,E,F,G,H C,D,E,F,G,H COQA1_HUMAN 21er collagen model peptide XPPGPPGPPGPRGLPGPPGPPG 22 T 0.0017 Collagen pdb F Eukaryota F 7bdx 1 A,B,C,D A,B,C,D HSF2B_HUMAN Heat shock factor 2-binding protein AEMGAAACTLLWGVSSSEEVVKAILGGDKALKFFSITGQTMESFVKSLDGDVQELDSDESQFVFALAGIVTNVAAIACGREFLVNSSRVLLDTILQLLGDLKPGQCTKLKVLMLMSLYNVSINLKGLKYISESPGFIPLLWWLLSDPDAEVCLHVLRLVQSVVLEPEVFSKSASEFRSSLPLQRILAMSKSRNPRLQTAAQELLEDLRTLEHNV 214 T 0.0026 KAP unphh F Eukaryota T 7bdx 2 E,F E,F BRCA2_HUMAN FANCONI ANEMIA GROUP D1 PROTEIN NEFDRIIENQEKSLKASKSTPDGTIKDRRLFMHHVSLEPITTVPFRTTKERQENLYFQG 59 T 8.8 GMP_synt_C pdbhh F Eukaryota T 7bee 2 C,D,E,F,G,H C,D,E,F,G,H COQA1_HUMAN 21er collagen model peptide XPPGPPGPPGPRGFPGPPGPPG 22 T 0.0015 Collagen pdb F Eukaryota F 7bfi 2 B,F,G,H,I,J F,H,I,J,E,G COQA1_HUMAN 15R8 collagen model peptide XPPGPPGPRGPPGPPGX 17 T 0.013 Collagen pdbpssm F Eukaryota F 7bfp 4 D U Unknown XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 36 F F F 7bfq 4 D U Unknown XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 171 F F F 7bfy 1 A A B4EH86_BURCJ Lectin MPLLSASIVSAPVVTSETYVDIPGLYLDVAKAGIRDGKLQVILNVPTPYATGNNFPGIYFAIATNQGVVADGCFTYSSKVPESTGRMPFTLVATIDVGSGVTFVKGQWKSVRGSAMHIDSYASLSAIWGTA 131 T 0.002 DUF1543 unppercent F Bacteria T 7bgh 1 A A OEP21_PEA CHLOROPLASTIC OUTER ENVELOPE PORE PROTEIN OF 21 KDA,GOEP21 METSLRYGGDSKALKIHAKEKLRIDTNTFFQVRGGLDTKTGQPSSGSALIRHFYPNFSATLGVGVRYDKQDSVGVRYAKNDKLRYTVLAKKTFPVTNDGLVNFKIKGGCDVDQDFKEWKSRGGAEFSWNVFNFQKDQDVRLRIGYEAFEQVPYLQIRENNWTFNADYKGRWNVRYDLLEHHHHHHHHHH 189 T 0.021 Fmp27_GFWDK unppssm F Eukaryota T 7bgt 2 E,F G,F peptidomimetic inhibitor PYVXAMH 7 T 55 Allatostatin pdbhh F T 7bgu 2 E,F G,F peptidomimetic inhibitor PXVXAMT 7 T 34 Allatostatin pdbhh F T 7bh8 5 I,J P,Q VL9 leader peptide VMAPRTVLL 9 T 0.0013 UL40 pdbhh F T 7bim 1 A,AA,B,BA,C,CA,D,DA,E,EA,F,FA,G,GA,H,HA,I,IA,J,JA,K,L,M,N,O,P,Q,R,S,T,U,V,W,X,Y,Z A,a,B,b,C,c,D,d,E,e,F,f,G,g,H,h,I,i,J,j,K,L,M,N,O,P,Q,R,S,T,U,V,W,X,Y,Z Nonameric de novo coiled coil CC-Type2-(GgLaId)4 XGEIAQGLKEIAKGLKEIAWGLKEIAQGLKGX 32 T 0.048 WXG100 pdbpssm F T 7bjs 1 A,B A,B KINH_DROME Kinesin heavy chain SMSFLENNLDQLTKVHKQLVRDNADLRCELPKLEKRLRCTMERVKALETALKEAKEGAMRDRKRYQYEVDRIKEAVRQKHLGRRGPQAQ 89 T 1.2E-05 SMC_N unphh F Eukaryota T 7bkx 1 A AAA Q6SVB5_DIPPU Milk protein MRQVWFSWIVGLFLCFFNVSSAKEPCPPENLQLTPRALVGKWYLRTTSPDIFKQVSNITEFYSAHGNDYYGTVTDYSPEYGLEAHRVNLTVSGRTLKFYMNDTHEYDSEYEILAVDKDYFIFYGHPPAAPSGLALIHYRQSCPKEDIIKRVKKSLKNVCLDYKYFGNDTSVHCRYLE 177 T 0.23 CE2_N pdbpercent F Eukaryota T 7bl1 5 E FFF unknown peptide AAAAAAAAAAAAAAAAAAAAAA 22 T 460 DUF4699 pdbhh F F 7blo 3 C,F N,H NRAM2_HUMAN C-term (residues 493-54) of Wls (fitted sequence corresponds to hDMT1-II) QPELYLLNTM 10 T 4.9 DUF5081 pdbhh F Eukaryota T 7blq 4 G,H V,U NRAM2_HUMAN The C-terminal portion of Kex2 cargo, fitted with Phi-X-(L/M) sorting motif of hDMT1-II cargo. QPELYLLNTM 10 T 4.9 DUF5081 pdbhh F Eukaryota T 7blz 15 O O M1VFJ4_CYAM1 PsaO FEVSDGEPYPLNPAVIFIALIGWSAVAAIPSNIPVLGGTGLTQAFLASIQRLLAQYPTGPKLDDPFWFYLIVYHVGLFALLIFGQIGYAGYAKGTYN 97 T 23 YbgT_YccB unphh F Eukaryota T 7bm9 2 B B VAL-ASN-LEU-SEP-ILE VNLSI 5 T 130 Ribosomal_L50 pdbhh F F 7bmc 2 B B VAL-ASN-LEU-SEP-ILE VNLSI 5 T 130 Ribosomal_L50 pdbhh F F 7bn1 2 C,D E,F MUNS_REOVL Protein mu-NS from Reovirus type 1 VDGAADLIDFSVPTDEY 17 T 5.4 HAUS-augmin3 unphh T Viruses T 7bn2 2 C,D CCC,DDD Non structured protein 3 from Eastern Equine Encephalitis Virus SDHSVDLITFDSVTDIY 17 T 2.9 DUF3343 pdbhh F T 7bnt 1 A,B A,B Predicted ancestral HMA domain of Pik-1 from Oryza spp. GPGMKQKIVIKVPMASDKCRSKAMALVASTGGVDSVALVGDLRDKIEVVGDGIDSIKLVSALRKKVGHAELLQVS 75 T 0.00011 HMA pdbpercent F T 7bnt 2 C C C4B8B8_MAGOR AVR-PIK PROTEIN ( PIKMPROTEIN,PIKP PROTEIN ),AVRPI7 PROTEIN,FRAGMENT OF MAGNAPORTHE ORYZAE AVR-PIKD GPETGNKYIEKRAIDLSRERDPNFFDHPGIPVPECFWFMFKNNVRQDAGTCYSSWKMDMKVGPNWVHIKSDDNCNLSGDFPPGWIVLGKKRPGF 94 T 0.1 TMEM18 unp F Eukaryota T 7bny 1 A,B,C,D A,B,C,D POLG_ENMGO Genome polyprotein SPNPLDVSKTYPTLHILLQFNHRGLEARIFRHGQLWAETHAEVVLRSKTKQISFLSNGSYPSMDATTPLNPWKSTYQAVLRAEPHRVTMDVYHKRIRPFRLPLVQKEWRTCEENVFGLYHVFETHYAGYFSDLLIHDVETNPGGSKHHHHHH 152 T 0.0044 LZ3wCH pdbpssm T Viruses T 7bo8 1 A,B,C,D,E,F A,B,C,D,E,F CC-Type2-(VaYd)4-Y3F-W19(BrPhe)-Y24F XGEFAQAVKEYAKAVKEYAXAVKEFAQAVKGX 32 T 0.053 IFT20 pdb F T 7bo9 1 A,B,C,D,E,F A,B,C,D,E,F CC-Type2-(VaYd)4-Y3F-W19(BrPhe) XGEFAQAVKEYAKAVKEYAXAVKEYAQAVKGX 32 T 0.08 IFT20 pdb F T 7boa 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H CC-Type2-(YaFd)4-W19(BrPhe) XGEFAQAYKEFAKAYKEFAXAYKEFAQAYKGX 32 T 8.6 Cas_Csy3 pdbhh F T 7boc 2 B B RIOK1_HUMAN peptide SRVVPGQFDDADSSD 15 T 0.048 COPR5 pdbhh F Eukaryota T 7bos 2 B B Myristoyl thiourea inhibitor, No.13 XKRRX 5 T 420 PPV_E1_N pdbhh F F 7bot 2 B B myristoyl thiourea inhibitor, No.23 XKXRX 5 T 740 TAT_ubiq pdbhh F F 7bow 1 A,B A,B Hydroxynitrile lyase GSLTCDKLPKVIPPGIDAFTSHNPFEFSYVLTDDLDCTARVYVQPVHGLTNYSGTAFDIKGTHITINDFTIGADGLTAYLTNCDTGEKQVWHFQYVDLGDPQGANYCAYSCNGPQIAEYKCTTNTGYISPKQLQAVKEARSVPNGDKIHLAQVDCPPHLYCPLYY 165 T 2.6 VanY pdbhh F T 7boy 1 A,B,C,D,E,F s,t,u,v,w,x TUBE2_BPT7 GENE PRODUCT 12,GP12 MALISQSIKNLKGGISQQPDILRYPDQGSRQVNGWSSETEGLQKRPPLVFLNTLGDNGALGQAPYIHLINRDEHEQYYAVFTGSGIRVFDLSGNEKQVRYPNGSNYIKTANPRNDLRMVTVADYTFIVNRNVVAQKNTKSVNLPNYNPNQDGLINVRGGQYGRELIVHINGKDVAKYKIPDGSQPEHVNNTDAQWLAEELAKQMRTNLSDWTVNVGQGFIHVTAPSGQQIDSFTTKDGYADQLINPVTHYAQSFSKLPPNAPNGYMVKIVGDASKSADQYYVRYDAERKVWTETLGWNTEDQVLWETMPHALVRAADGNFDFKWLEWSPKSCGDVDTNPWPSFVGSSINDVFFFRNRLGFLSGENIILSRTAKYFNFYPASIANLSDDDPIDVAVSTNRIAILKYAVPFSEELLIWSDEAQFVLTASGTLTSKSVELNLTTQFDVQDRARPFGIGRNVYFASPRSSFTSIHRYYAVQDVSSVKNAEDITSHVPNYIPNGVFSICGSGTENFCSVLSHGDPSKIFMYKFLYLNEELRQQSWSHWDFGENVQVLACQSISSDMYVILRNEFNTFLARISFTKNAIDLQGEPYRAFMDMKIRYTIPSGTYNDDTFTTSIHIPTIYGANFGRGKITVLEPDGKITVFEQPTAGWNSDPWLRLSGNLEGRMVYIGFNINFVYEFSKFLIKQTADDGSTSTEDIGRLQLRRAWVNYENSGTFDIYVENQSSNWKYTMAGARLGSNTLRAGRLNLGTGQYRFPVVGNAKFNTVYILSDETTPLNIIGCGWEGNYLRRSSGI 794 T 0.031 MelC1 pdbpercent T Viruses T 7bp4 3 C,F L,C ASP-ASP-ASP-ASP-TYR DDDDY 5 T 45 BioW pdbhh F F 7bp5 3 C C NP1L1_HUMAN ASN-ASP-PRO-ASP-TYR EENDPDY 7 T 1.1 TrbL_3 pdbhh F Eukaryota F 7bp6 3 C A NRP1_ARATH 7-MER FROM NAP1-RELATED PROTEIN 1 DADEEDF 7 T 0.026 Drc1-Sld2 unppercent F Eukaryota F 7bph 2 B B GN13 XXFESVYAIWGTLCGX 16 F F T 7bpl 1 A A NF1 GDADKIMEQAKRQDPNAQVYKVTTPDEIEEAVRRIEKYGAQVVLIIYTSSGIVILVAVRDPSQADQILKEAKKQNPSATFVRLEGVSPDDLRRQVEDVWRGSLEHHHHHH 110 T 0.0052 GGDEF_2 pdb F T 7bpm 1 A A NF2 GTEIELESKNGQREHYTATSEDEARKIIEKAVRRGIKRIELRGASEQLIRDMQEIAKQIGLQYRTDGSLEHHHHHH 76 T 0.1 DUF6506 pdb F T 7bpn 1 A A NF7 GQIQYFNVDENPEQVRKLIEQAGLDPDELREAEVIIIIISRTPEQLEKLSRQVKELGADRLLEFNVDENPEQASKLAKTAGISEKQLREADYIILILVRDEKKAKKFADSLRKKGSLEHHHHHH 124 T 0.015 AAA_12 pdb F T 7bpo 1 A,B A,B Hydroxynitrile lyase GSLTCDKLPKVIPPGIDAFTSHNPFEFSYVLTDDLDCTARVYVQPVHGLTNYSGTAFDIKGTHITINDFTIGADGLTAYLTNCDTGEKQVWHFQYVDLGDPQGANYCAYSCNGPQIAEYKCTTNTGYISPKQLQAVKEARSVPNGDKIHLAQVDCPPHLYCPLYY 165 T 2.6 VanY pdbhh F T 7bpp 1 A A NF5 GEDDEILQRAKDILKEDPNRKILIILNPDGKIELYEVTSEEDIKRIAKKAGISEELLRRILQSFRDGQYDLFFIAKTEDDERRARELKERMGKPVEILRGSLEHHHHHH 109 T 0.002 Nucleoporin_N pdb F T 7bq9 1 A,B B,A PP62_ASFB7 60 kDa polyprotein RSPWPSNMKQFCKISVWLQQHDPDLLEIINNLCMLGNLSAAKYKHGVTFIYPKQAKIRDEIKKHAYSNDPSQAIKTLESLILPFYIPTPAEFTGEIGSYTGVKLEVEKTEANKVILKNGEAVLVPAADFKPFPDRRLAVWIMESGSMPLEGPPYKRKKEGGGNLESRGPFEGKPIPNPLLGLDSTRTGHHHHHH 194 T 0.34 Fasciclin unppssm T Viruses T 7bqa 1 A,B B,A PP62_ASFB7 CP530R,PCP530R RSPWDPPVPKHISPYTPRTRIAIEVEKAFDDCMRQNWCSVNNPYLAKSVSLLSFLSLNHPTEFIKVLPLIDFDPLVTFYLLLEPYKTHGDDFLIPETILFGPTGWNGTDLYQSAMLEFKKFFTQITRQTFMDIADSATKEVDVPICYSDPETVHSYTNHVRTEILHHNAVNKVTTPNLVVQAYNELEQTNTIRHYGPIFPESTINALRFWKKLWQDEQRFVIHGLHRTLMDQPTYETSEFAEIVRNLRFSRPGNNYINELNITSPAMYGDKHTTGDIAPNDRFAMLVAFINSTDFLYTAIPEEKVGGNLESRGPFEGKPIPNPLLGLDSTRTGHHHHHH 339 T 30 Imm74 pdbhh T Viruses T 7bqb 1 A A NF6 GKLYEVDSPDSVEKIARELGLSEEQLRRIQKEFERAERKGKLVIVYLTSDGKVEIREVTSEEELEKILKKLGVDEEIIRRIKRLRKEGQIKLVIIEGSLEHHHHHH 106 T 0.00032 HTH_23 pdb F T 7bqc 1 A A NF4 GSEEIRELVRKIYETVRKENPNVKILIFIIFTSDGTIKVIIVIIADDPNDAKRIVKKIQERFPKLTIKQSRNEEEAEKRIQKELEERNPNAEIQVVRSEDELKEILDKLDEKKGSWSLEHHHHHH 125 T 0.011 DUF6377 pdb F T 7bqd 1 A A NF8 GTILIFLDKNKEQAEKLAKEVGVTEIYESDNLEELYREIKERIERENPNATILTVTDPNELKKIQDEGKVDRIILLIKGSLEHHHHHH 88 T 0.023 Methyltransf_25 pdb F T 7bqe 1 A A NF3 GSDEEIRKKLEELAKRKGKDLQLRRYNDPNEVEKSIREALKKGRTLIIIINGVFVVVSTDEDLIREIKRLIKESNPNKKTLDVTTEEDLEEVLRRIKKGSWSLEHHHHHH 110 T 0.03 BtrH_N pdb F T 7bqm 1 A A Chantal GEEEKEIDKLVELFAQAYEDAREKKRNGTPEEWVRDAIEEAARRVGRSRSRVVEALRRYAEKHGKEELLKRAGITPEALKVIEKIEKEEGSLEHHHHHH 99 T 0.0054 HTH_28 pdb F T 7bqn 1 A A Rei GDEAEKQAERALELVRKSPDLLKKLLEAMAEELKRQGKSPDEIQKAKDEVKTKVEQAIREWKQGNEEQARKDMRKVLKSPAFKQAVKVMEEQEPNNPEVQELKKAMEEAERGSLEHHHHHH 121 T 0.024 EcoEI_R_C pdb F T 7bqq 1 A A Gogy GDERKLEEVTEEMRKMAENMDGQDPEKVKEIVRRALQQMANDNPEVSEQLRELAKRKGTSPSEVIKDLAEQVWRAMERAREGDKDTARELIRKFADDLGISPEQVKKFIKIMREVQRKEDGSLEHHHHHH 130 T 0.001 PSK_trans_fac pdbhh F T 7bqr 1 A A Mussoc GDEDKEKLKREAERALSEALSEFEKQGKITPETLKRLAEEIAEAALAQQQGDSERLEKAARRFAETLLRALKESGASAEEIEEAIERIRKALSKAPSPQLQKLANSPQWQTALQEAIKKARQEKKEKGSLEHHHHHH 137 T 0.0043 TMP_3 pdb F T 7bqs 1 A A Nomur GETKAKAAQEALRAAREQATTPEAQKALEELEKVLKTASPEQWRQAAEKIFEAFREASNGNTEKAKKLLEEAARTAGASPEIIKKLASALERLAEEGAAKEAARQAEEVRKRGSLEHHHHHH 122 T 0.0038 BTG pdb F T 7bqy 2 B C N-[(5-METHYLISOXAZOL-3-YL)CARBONYL]ALANYL-L-VALYL-N~1~-((1R,2Z)-4-(BENZYLOXY)-4-OXO-1-{[(3R)-2-OXOPYRROLIDIN-3-YL]METHYL}BUT-2-ENYL)-L-LEUCINAMIDE XAVLXX 6 T 1700 zf-H2C2_5 pdbhh F F 7br1 1 A,B A,B A0A7I6N400_PARLM Hydroxynitrile lyase SLTCDKLPKVIPPGIDAFTSHNPFEFSYVLTDDLDCTARVYVQPVHGLTNYSGTAFDIKGTHITINDFTIGADGLTAYLTNCDTGEKQVWHFQYVDLGDPQGANYCAYSCNGPQIAEYKCTTNTGYISPKQLQAVKEARSVPNGDKIHLAQVDCPPHLYCPLYY 164 T 2.6 VanY pdbhh F Eukaryota T 7bre 3 E,F C,F RBBP5_HUMAN RBBP-5,RETINOBLASTOMA-BINDING PROTEIN RBQ-3 SAFAPDFKELDENVEYEERESEFDIED 27 T 0.014 DUF2457 unppercent F Eukaryota T 7bsb 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N B,G,A,D,E,F,C,H,I,J,K,L,M,N D7DTD6_METV3 lectin MAQNDNYIYSTEVGGVGGTPFTFMQESGTITSIKFNWSDQYKLLHHIEVKFINNANIYATGDPKGNHEVILEIDDDETIIGSVIGYKKGNDGRCTGVKLTTSKGKSIMAGYFEESLITTYTGKLAGIKGGAGSDIDRLGLIFLKK 145 T 0.00015 Jacalin pdbpssm F Archaea T 7bsm 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N A,B,C,D,E,F,G,H,I,J,K,L,M,N D7DTD6_METV3 lectin MAQNDNYIYSTEVGGVGGTPFTFMQESGTITSIKFNWSDQYKLLHHIEVKFINNANIYATGDPKGNHEVILEIDDDETIIGSVIGYKKGNDGRCTGVKLTTSKGKSIMAGYFEESLITTYTGKLAGIKGGAGSDIDRLGLIFLKK 145 T 0.00015 Jacalin pdbpssm F Archaea T 7bsn 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N A,B,C,D,E,F,G,H,I,J,K,L,M,N D7DTD6_METV3 lectin MAQNDNYIYSTEVGGVGGTPFTFMQESGTITSIKFNWSDQYKLLHHIEVKFINNANIYATGDPKGNHEVILEIDDDETIIGSVIGYKKGNDGRCTGVKLTTSKGKSIMAGYFEESLITTYTGKLAGIKGGAGSDIDRLGLIFLKK 145 T 0.00015 Jacalin pdbpssm F Archaea T 7bt8 1 A,B,C,D,E,F,G B,G,A,D,E,F,C D7DTD6_METV3 lectin DNYIYSTEVGGVGGTPFTFMQESGTITSIKFNWSDQYKLLHHIEVKFINNANIYATGDPKGNHEVILEIDDDETIIGSVIGYKKGNDGRCTGVKLTTSKGKSIMAGYFEESLITTYTGKLAGIKGGAGSDIDRLGLIFLK 140 T 0.00015 Jacalin unppssm F Archaea T 7bt9 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N B,G,A,D,E,F,C,H,I,J,K,L,M,N D7DTD6_METV3 lectin MAQNDNYIYSTEVGGVGGTPFTFMQESGTITSIKFNWSDQYKLLHHIEVKFINNANIYATGDPKGNHEVILEIDDDETIIGSVIGYKKGNDGRCTGVKLTTSKGKSIMAGYFEESLITTYTGKLAGIKGGAGSDIDRLGLIFLKK 145 T 0.00015 Jacalin pdbpssm F Archaea T 7bte 2 F,G,H L,M,N AB140_YEAST Lifeact MGVADLIKKFESISKEE 17 T 1.6 Antimicrobial_8 pdbhh F Eukaryota T 7bth 1 A,B,C,D,E,F,G B,G,A,D,E,F,C D7DTD6_METV3 lectin MAQNDNYIYSTEVGGVGGTPFTFMQESGTITSIKFNWSDQYKLLHHIEVKFINNANIYATGDPKGNHEVILEIDDDETIIGSVIGYKKGNDGRCTGVKLTTSKGKSIMAGYFEESLITTYTGKLAGIKGGAGSDIDRLGLIFLKK 145 T 0.00015 Jacalin pdbpssm F Archaea T 7bti 2 F,G,H X,Y,Z PHAD2_AMAPH Phalloidin PAWXAXC 7 T 2.7 CSN7a_helixI pdbhh F Eukaryota F 7btl 1 A,B,C,D,E,F,G B,G,A,D,E,F,C D7DTD6_METV3 lectin DNYIYSTEVGGVGGTPFTFMQESGTITSIKFNWSDQYKLLHHIEVKFINNANIYATGDPKGNHEVILEIDDDETIIGSVIGYKKGNDGRCTGVKLTTSKGKSIMAGYFEESLITTYTGKLAGIKGGAGSDIDRLGLIFLK 140 T 0.00015 Jacalin unppssm F Archaea T 7bu5 2 B B SLX4_HUMAN BTB/POZ DOMAIN-CONTAINING PROTEIN 12 RKKNLPPKVPITPMPQYSIMETPVLKKELDRFGVRPLPKRQMVLKLKEIFQYTHQTLDSDS 61 T 0.067 SAP pdbpercent F Eukaryota T 7bv3 1 A,B A,B A0A346A6C4_SIRGR UGT TRANSFERASE KVELVFVPGPGIGHLSTALQIADLLLRRDHRLSVTVLSIPLPWEAKTTTQPESLFPSSTTTTTSRIRFISLPQRPLPDDAKGPFQFQAVFETQKQNVKEAVAKLSDSSILAGLVLDMFCVTMVDVAKQLGVPSYVFFTSSAGYLSFTSHLQDLSDRHGKETQQLMRSDVEIAVPGFTNPVPGKVIPGVYFNKNMAEWLHDCARRFRETNGILVNTFSELESQVMDSFSDATAASQFPAVYAVGPILSLNKNTSAASSESQSGDEILKWLDQQPPSSVVFLCFGSKGSLNPDQAREIAHALERSGHRFVWSLRQPSPKGKFEKPIEYDNIEDVLPEGFLDRTAEMGRVIGWAPQVEILGHPATGGFVSHCGWNSTLESLWYGVPIATWPMYAEQHFNAFEMGVELGLAVGISSESSIEEGVIVSAEKIEEGIRKLMGGGGGGGGGEVRKLVKAKSEESRKSVMEGGSSFTSLNRFIDEVMKSPF 483 T 1.7999999999999997E-24 UDPGT pdb F Eukaryota T 7bv4 2 B,D,F,H C,D,F,H STX17_HUMAN Syntaxin-17 NAAESWETLEADLIELSQLVTD 22 T 0.0009 Syntaxin unphh F Eukaryota T 7bv7 1 A,B A,B INT3_HUMAN INT3,SOSS COMPLEX SUBUNIT A,SENSOR OF SINGLE-STRAND DNA COMPLEX SUBUNIT A,SENSOR OF SSDNA SUBUNIT A TVVEEPVDITPYLDQLDESLRDKVLQLQKGSDTEAQCEVMQEIVDQVLEEDFDSEQLSVLASCLQELFKAHFRGEVLPEEITEESLEESVGKPLYLIFRNLCQMQEDNSSFSLLLDLLSELYQKQPKIGYHLLYYLRASKAAAGKMNLYESFAQATQLGDLHTCLMMDMKACQEDDVRLLCHLTPSIYTEFPDETLRSGELLNMIVAVIDSAQLQELVCHVMMGNLVMFRKDSVLNILIQSLDWETFEQYCAWQLFLAHNIPLETIIPILQHLKYKEHPEALSCLLLQLRREKPSEEMVKMVLSRPCHPDDQFTTSILRHWCMKHDELLAEHIKSLLIKNNSLPRKRQSLRSSSSKLAQLTLEQILEHLDNLRLNLTNTKQNFFSQTPILQALQHVQASCDEAHKMKFSDLFSLAEEYEDSSTKPPKSRRKAALSS 436 T 0.0083 IFRD pdbpssm F Eukaryota T 7bw5 1 A A A0A2M8WFL4_9SPHN lasso peptide koreensin GPKGDFPDVGDGRILAG 17 T 0.022 DUF5974 unphh F Bacteria T 7bwk 1 A,F A,F Q5ZYC6_LEGPH IcmO (DotL) GQNEPEPVEDIVEEEVEGALTIFSKLRIDPNAPPILVADKEVFSEPLLPINETRNQMITIERLAGAKDKYAGTVANELIKDFQIATSYPPEERDVIDVQELTGIIRDLSAKISAEREKANKKAAEELT 128 T 0.098 DUF1840 pdb F Bacteria T 7bwk 3 C,H C,H Q5ZS31_LEGPH IcmW MPDLSHEASAKYWFEYLDPMIYRVITFMESVENWTLDGNPELEEAMKQLGQELDDIEKIDLGLLAEEDKFIRIVGNIKSGRGLRLLQAIDTVHPGSASRVLIHAEETSLSSSDPAGFFLKRNIVFERLRLLSRVFCQYRLKLVLRALEGDE 151 T 0.11 DUF2335 unppercent F Bacteria T 7bwk 4 D,I D,I Q5ZY48_LEGPH Hypothetical virulence protein MADGDIEIKAGFVDTDLDDRKLTMIDDLNNPLAIVERVYLIWWHWADFHLHVISPHIDTITPAIVIEPELIPGSNDHEFVYSIHDSGSKLSTSKSQDMFSAGMSMCKLFYTIEKMVYILVERLKSGGVSMEAEVQIAFAGHEIAQRKAFESIINLPYNVVVTNFDPGIWGEKYLQNVKRLADKGYGYPPESPRKIYMHPVSSGTTARK 208 T 3.1 Herpes_TK_C pdbhh F Bacteria T 7bwk 5 E,J E,J Q5ZW60_LEGPH PNPLA domain-containing protein NSSQQQEQLKEKTMLFKSRLQSFKQGEGVKPWSQHVENAIDRLMSLKGEITKAQVDLGRTWFDIKSENADPAVRLKKFNDAFLASPLAKPSSNQQEINFSKEIRKEIDLLKGLPGLNNTSSHCTEEFNEQ 130 T 0.041 Antigen_Bd37 pdb F Bacteria T 7bx2 1 A A VAL-LYS-TRP-VAL-LYS-LYS-VAL-VAL-LYS-TRP-VAL-LYS-LYS-VAL VKWVKKVVKWVKKV 14 T 1.2 Pico_P2B pdbhh F F 7bxf 1 A A Q5ZTL3_LEGPH MvcA SMIVRGINMTKIKLESPGFMVHKKLKSMSQSYGVMMTGVPAEVLGQMQAERSIPSINKTGNLKQQIAKEVSKVCHMMTEPTQSCGQASNDVCELLLGKIEAEKFHFTKYEALSADGDNLKNVLENTAPSSTNLLIRFEIDREDPPIVLVKTKNENFNPETAVKNKIYLLENKLYFIDKMGNLFNLGPGKKKCTQLFNAIGDSAEYSLCDPFVLEEPEKPEDFAISEIVDIFNEQKERFDFWIGSHSFTIYIPQTLGESPRQFYPYQAYFGSHTLQDWFVSDKDEYLSRIGIDKYIEKLAVLGKTTNTKERSDIYAEFFSKRGREAFFCAHLNEKRQPLRVKFKITEINPELALKNLQETQEFIDTHPGENPSDKVENYRNRAKLAMTEHLESLLDIKPESS 401 T 0.023 AgrD pdb F Bacteria T 7bxg 1 A A Q5ZTL4_LEGPH MavC SMTTSKLEKTGLHVHEKIKHMVKNYGTMITGIPAEILGQNEAEISVGYVKKMGNMKENIAEVVRKSEMTQPTNSCGKASNEVCDLLLGTEGASEFEKSSYQVLSGDGSNLKGSLPNKNLLVRVEMDRFNAPQKYQKIKREEFNPETAEKNKIYLLEDQLVYLDIFGKVIDLGQTSDTCHRLFNAITTPFYQNYILYDEYIDPEESAEEAAMFEMGEIVKAKMKNIDCWTATHSFTIFVPESDSEDTRTLYPYQAYWTSHTLQQWFSGDKDEKLSRLGIDGYIEKLALLGTTTDSKIRSSIYGELFSPPGKEHVFCTGMNEKFSPLRVKFKVTEVNPEIALQNLEEVQEFIDTNYPGENAKDQCELYKIKAQEAMTKQLEMRLLIEP 386 T 0.035 V-ATPase_H_N unppercent F Bacteria T 7bxh 2 B B Q5ZTL4_LEGPH MavC SMTTSKLEKTGLHVHEKIKHMVKNYGTMITGIPAEILGQNEAEISVGYVKKMGNMKENIAEVVRKSEMTQPTNSCGKASNEVCDLLLGTEGASEFEKSSYQVLSGDGSNLKGSLPNKNLLVRVEMDRFNAPQKYQKIKREEFNPETAEKNKIYLLEDQLVYLDIFGKVIDLGQTSDTCHRLFNAITTPFYQNYILYDEYIDPEESAEEAAMFEMGEIVKAKMKNIDCWTATHSFTIFVPESDSEDTRTLYPYQAYWTSHTLQQWFSGDKDEKLSRLGIDGYIEKLALLGTTTDSKIRSSIYGELFSPPGKEHVFCTGMNEKFSPLRVKFKVTEVNPEIALQNLEEVQEFIDTNYPGENAKDQCELYKIKAQEAMTKQLEMRLLIEP 386 T 0.035 V-ATPase_H_N unppercent F Bacteria T 7bxt 7 K,L K,L F1NSD9_CHICK CENTROMERE PROTEIN CENP-C QKIVLPSNTPNVRRTKRIRLKPLEYWRGERVTYTLKPSGRL 41 T 0.073 DUF3141 pdbpercent F Eukaryota T 7bxv 3 C A TYR-GLU-VAL-HIS-HIS YEVHH 5 T 85 DR2241 pdbhh F F 7by7 1 A A GP46_BPSP1 Putative gene 46 protein MMTEDQKFKYLTKIEELEAGCFSDWTKEDITGDLKYLKKGIIEESIELIRAVNGLTYSEELHDFTQEIIEELDISPL 77 T 3.6 DUF1244 pdbhh T Viruses T 7byd 3 C,H C,H GLY-GLY-ALA-ILE GGAI 4 T 75 Glyco_transf_4 pdbhh F F 7bye 1 A,D A,D A6TJ72_KLEP7 Antitoxin MazE KAGPTLEELLGQCTAENRHHEYLCDSQGKEML 32 T 4.4 ACT_3 pdbhh F Bacteria T 7byf 2 B,F B,E NUP98_MOUSE Peptidase S59 domain-containing protein PTGTTIKFNPPTGTDTMVKAGVSTNISTKHQCITAMKEYESKSLEELRLEDYQANRK 57 T 14 PRC2_HTH_1 pdbhh F Eukaryota T 7bz7 1 A A lasso peptide LVVIVQADWNAPGWY 15 T 0.58 InlK_D3 pdbhh F T 7bz8 1 A A lasso peptide LVAIVQADWNAPGWF 15 T 0.94 DUF6446 pdbhh F T 7bz9 1 A A lasso peptide LVVAVQADWNAPGWF 15 T 1.1 DUF6446 pdbhh F T 7bza 1 A A lasso peptide LVVIVQADWNAPGWF 15 T 1.6 DUF6446 pdbhh F T 7bzh 1 A A D2PEW5_SULID Sul7s MEDVKQSVEKIIKDREWVTFNDLLKYIPYPAPEVYDALSQLIKENKVGRRGRYFYYIKR 59 T 1.8E-05 SelB-wing_1 pdbhh F Archaea T 7c06 2 B,E,H,K,N,Q,T,W,Z B,E,H,K,N,Q,T,W,Z U2AF2_SCHPO U2 AUXILIARY FACTOR 59 KDA SUBUNIT,U2AF59,U2 SNRNP AUXILIARY FACTOR LARGE SUBUNIT SSVGRSRSPPPSRERSVRSIEQELEQLRDVTPINQWKRKRSLWDIKPPGYELVTADQAKMSGVFPLPGA 69 T 9.8 Transformer unphh F Eukaryota T 7c07 2 B,E,H,K,N,Q,T,W,Z B,E,H,K,N,Q,T,W,Z U2AF2_SCHPO U2 AUXILIARY FACTOR 59 KDA SUBUNIT,U2AF59,U2 SNRNP AUXILIARY FACTOR LARGE SUBUNIT SSVGRSRSPPPSRERSVRSIEQELEQLRDVTPINQWKRKRSLWDIKPPGYELVTADQAKMSGVFPLPGA 69 T 9.8 Transformer unphh F Eukaryota T 7c08 2 B,E,H,K,N,Q,T,W,Z B,E,H,K,N,Q,T,W,Z U2AF2_SCHPO U2 AUXILIARY FACTOR 59 KDA SUBUNIT,U2AF59,U2 SNRNP AUXILIARY FACTOR LARGE SUBUNIT SSVGRSRSPPPSRERSVRSIEQELEQLRDVTPINQWKRKRSLWDIKPPGYELVTADQAKMSGVFPLPGA 69 T 9.8 Transformer unphh F Eukaryota T 7c0n 1 A,B A,B Self-assembling galactosylated tyrosine-rich peptide YYCYY 5 T 22 DUF4936 pdbhh F F 7c1m 2 B B TBA1A_HUMAN Carboxy-terminal peptide from tyrosinated alpha-tubulin VEGEGEEEGEEY 12 T 88 DUF6522 pdbhh F Eukaryota F 7c1x 1 A,B A,B Q94AT3_ARATH PfkB-like carbohydrate kinase family protein MEPVIIGALILDVHAKPSTTPISGTTVPGQVLFAPGGVARNVADCIFKLGITPFMIGTLGLDGPANVLLKEWKLSMKGILRREDISTPIVSLVYDTNGEVAAGVAGVDAVENFLTPEWIQRFEYNISSARLLMVDANLSSLALEASCKLAAESSVPVWFEPVSVTKSQRIASIAKYVTIVSPNQDELIAMANALCAKNLFHPFRSDENKLSIEDMFRALKPAILVLLKNGVKVVIVTLGSNGALLCSKGNPKKALNIDRKFLRSGEVFKRVQSVCSPNRFSELGSNRSPSLFAMHFPTIPAKVKKLTGAGDCLVGGTVASLSDGLDLIQSLAVGIASAKAAVESDDNVPPEFKLDLISGDAELVYNGAKMLMVHQSML 378 T 1.2E-18 PfkB pdbpercent F Eukaryota T 7c1y 1 A,B A,B Q94AT3_ARATH PfkB-like carbohydrate kinase family protein MEPVIIGALILDVHAKPSTTPISGTTVPGQVLFAPGGVARNVADCIFKLGITPFMIGTLGLDGPANVLLKEWKLSMKGILRREDISTPIVSLVYDTNGEVAAGVAGVDAVENFLTPEWIQRFEYNISSARLLMVDANLSSLALEASCKLAAESSVPVWFEPVSVTKSQRIASIAKYVTIVSPNQDELIAMANALCAKNLFHPFRSDENKLSIEDMFRALKPAILVLLKNGVKVVIVTLGSNGALLCSKGNPKKALNIDRKFLRSGEVFKRVQSVCSPNRFSELGSNRSPSLFAMHFPTIPAKVKKLTGAGDCLVGGTVASLSDGLDLIQSLAVGIASAKAAVESDDNVPPEFKLDLISGDAELVYNGAKMLMVHQSML 378 T 1.2E-18 PfkB pdbpercent F Eukaryota T 7c1z 1 A,B A,B Q94AT3_ARATH PfkB-like carbohydrate kinase family protein MEPVIIGALILDVHAKPSTTPISGTTVPGQVLFAPGGVARNVADCIFKLGITPFMIGTLGLDGPANVLLKEWKLSMKGILRREDISTPIVSLVYDTNGEVAAGVAGVDAVENFLTPEWIQRFEYNISSARLLMVDANLSSLALEASCKLAAESSVPVWFEPVSVTKSQRIASIAKYVTIVSPNQDELIAMANALCAKNLFHPFRSDENKLSIEDMFRALKPAILVLLKNGVKVVIVTLGSNGALLCSKGNPKKALNIDRKFLRSGEVFKRVQSVCSPNRFSELGSNRSPSLFAMHFPTIPAKVKKLTGAGDCLVGGTVASLSDGLDLIQSLAVGIASAKAAVESDDNVPPEFKLDLISGDAELVYNGAKMLMVHQSML 378 T 1.2E-18 PfkB pdbpercent F Eukaryota T 7c24 1 A A IMGH_KRIFD Isomaltose glucohydrolase MGSSHHHHHHSSGLVPRGSHMTTSARDTGLDSHELARLHELARHSHAVITRHQDAGGAYPAAPTFSAYRGYAWLRDGSFTAEGISRYGDVASAGRFHDWVDGVLRRRRGQVDDLLAAVDRGEVPSNEGMLPTRFTFDGNDGSDPWWDFQTDGYGMWLWSVVTHAARHGLDLERWRAGIDVAVDYLLAFWDRPCYDWWEEHVEHRHVSTLGAIHGGLVAVGTCAALRSAPWSAATLQVAARIRSLVSAEGVVDGHLVKWLGSSAVDGSLPACVVPFGLVPPDDDVAAMTRAAVAKDLDVDGGVHRFAADVYYGGGQWILLSALLGWNLAAAGDTAGALRHLRWIADQADADGDLPEQVPHHLLHPGSRAEWVARWGTVATPLLWSHGMYLILADELGLLPPAAKDA 405 T 7.9 Glyco_hydro_15 unppssm F Bacteria T 7c25 1 A A IMGH_KRIFD Isomaltose glucohydrolase MGSSHHHHHHSSGLVPRGSHMTTSARDTGLDSHELARLHELARHSHAVITRHQDAGGAYPAAPTFSAYRGYAWLRDGSFTAEGISRYGDVASAGRFHDWVDGVLRRRRGQVDDLLAAVDRGEVPSNEGMLPTRFTFDGNDGSDPWWDFQTDGYGMWLWSVVTHAARHGLDLERWRAGIDVAVDYLLAFWDRPCYDWWEEHVEHRHVSTLGAIHGGLVAVGTCAALRSAPWSAATLQVAARIRSLVSAEGVVDGHLVKWLGSSAVDGSLPACVVPFGLVPPDDDVAAMTRAAVAKDLDVDGGVHRFAADVFYGGGQWILLSALLGWNLAAAGDTAGALRHLRWIADQADADGDLPEQVPHHLLHPGSRAEWVARWGTVATPLLWSHGMYLILADELGLLPPAAKDA 405 T 7.9 Glyco_hydro_15 unppssm F Bacteria T 7c26 1 A A IMGH_KRIFD Isomaltose glucohydrolase MGSSHHHHHHSSGLVPRGSHMTTSARDTGLDSHELARLHELARHSHAVITRHQDAGGAYPAAPTFSAYRGYAWLRDGSFTAEGISRYGDVASAGRFHDWVDGVLRRRRGQVDDLLAAVDRGEVPSNEGMLPTRFTFDGNDGSDPWWDFQTDGYGMWLWSVVTHAARHGLDLERWRAGIDVAVDYLLAFWDRPCYDWWEEHVEHRHVSTLGAIHGGLVAVGTCAALRSAPWSAATLQVAARIRSLVSAEGVVDGHLVKWLGSSAVDGSLPACVVPFGLVPPDDDVAAMTRAAVAKDLDVDGGVHRFAADVFYGGGQWILLSALLGWNLAAAGDTAGALRHLRWIADQADADGDLPEQVPHHLLHPGSRAEWVARWGTVATPLLWSHGMYLILADELGLLPPAAKDA 405 T 7.9 Glyco_hydro_15 unppssm F Bacteria T 7c27 1 A A IMGH_KRIFD Isomaltose glucohydrolase MGSSHHHHHHSSGLVPRGSHMTTSARDTGLDSHELARLHELARHSHAVITRHQDAGGAYPAAPTFSAYRGYAWLRDGSFTAEGISRYGDVASAGRFHDWVDGVLRRRRGQVDDLLAAVDRGEVPSNEGMLPTRFTFDGNDGSDPWWDFQTDGYGMWLWSVVTHAARHGLDLERWRAGIDVAVDYLLAFWDRPCYDWWEEHVEHRHVSTLGAIHGGLVAVGTCAALRSAPWSAATLQVAARIRSLVSAEGVVDGHLVKWLGSSAVDGSLPACVVPFGLVPPDDDVAAMTRAAVAKDLDVDGGVHRFAADVYYGGGQWILLSALLGWNLAAAGDTAGALRHLRWIADQADADGDLPEQVPHHLLHPGSRAEWVARWGTVATPLLWSHGMYLILADELGLLPPAAKDA 405 T 7.9 Glyco_hydro_15 unppssm F Bacteria T 7c4j 3 C C SNF6_YEAST SWI/SNF COMPLEX COMPONENT SNF6 MGVIKKKRSHHGKASRQQYYSGVQVGGVGSMGAINNNIPSLTSFAEENNYQYGYSGSSAGMNGRSLTYAQQQLNKQRQDFERVRLRPEQLSNIIHDESDTISFRSNLLKNFISSNDAFNMLSLTTVPCDRIEKSRLFSEKTIRYLMQKQHEMKTQAAELQEKPLTPLKYTKLIAAAEDGSRSTKDMIDAVFEQDSHLRYQPDGVVVHRDDPALVGKLRGDLREAPADYWTHAYRDVLAQYHEAKERIRQKEVTAGEAQDEASLQQQQQQDLQQQQQVVTTVASQSPHATATEKEPVPAVVDDPLENMFGDYSNEPFNTNFDDEFGDLDAVFF 332 T 0.0043 CENP-Q pdbpercent F Eukaryota T 7c4j 8 I G Unkown XXXXXGRXXXXXPXXXXXXXXXXXXXXXVXXTXXVTXLXXXXXXXXXXXXXXXXXXXXXXXXXTRXYLRFHXXXYXXXXXXX 82 T 210 YlbE pdbhh F T 7c4u 1 A,B A,B Vancomycin XXNXXXX 7 T 95 P53_C pdbhh F F 7c4v 1 A,B A,B Vancomycin XXNXXXX 7 T 95 P53_C pdbhh F F 7c53 1 A,B,C,D,E,F A,B,C,D,E,F SPIKE_SARS2 Spike protein S2',pan-CoVs inhibitor EK1 GVTQNVLYENQKLIANQFNSAIGKIQDSLSSTASALGKLQDVVNQNAQALNTLVKQLSSNFGAISSGGRGGSLDQINVTFLDLEYEMKKLEEAIKKLEESYIDLKEL 107 T 1.2E-07 CoV_S2 pdbpssm T Viruses T 7c5v 1 A,B A,B Q8YT18_NOSS1 iota-carbonic anhydrase GSHDSGDKITATSSLKTPIVNRAITESEVLAAQKAWGEALVAISTTYDAKGKASAKALAEKVIDDAYGYQFGPVLFKPTLAISPRTFRTTRAGALAYFVGDDKAFPEDKGFALSSWRKVEIKNAAIFITGNTATTMGNVIITDKQGKATTVDKTWQFLKDDHGKLRIITHHSSLPYEQ 178 T 0.0077 SnoaL_3 unphh F Bacteria T 7c5w 1 A,B A,B Q8YT18_NOSS1 iota-carbonic anhydrase GSHDSGDKITATSSLKTPIVNRAITESEVLAAQKAWGEALVAISTTYDAKGKASAKALAEKVIDDAYGYQFGPVLFKPTLAISPRTFRTTRAGALAYFVGDDKAFPEDKGFALSSWRKVEIKNAAIFITGNTATTMGNVIITDKQGKATTVDKTWQFLKDDHGKLRIITHHSSLPYEQ 178 T 0.0077 SnoaL_3 unphh F Bacteria T 7c5x 1 A,B A,B iota-carbonic anhydrase GSHDATITEAEVLNAQSKWAEAIKTISRTYLNGGDYIKTAGDAAAELYGYGKSKVLFKPTKAAEFPFRPTGEEAMSYFVGGNAVEKGYKEDAGFAINGGKGWSNVVFNNHDIDINGNTAVAMGSYVFTCATTGTETKVEYTFGYKRNDDGKVRIFLHHSSVPYSESPAPVTLKEVTECQEKWANAIQTISKTYLDGGDYIGEAGKQAGILYGYGNTNVLFKPTKATDHPFRPTGEQAMSYFVGGDVVDNGYVGEDAGFAINGGKGWSKVVFRNHQVDLNGPVAIAMGDYVFTSAADGSETRVEYTFGYKRNDDGNVRIFVHHSSVPYKEEVAPITEAEVLECQKNWANAIQTISKTYLDGGDYIGEAGKQAGILYGYGNTNVLFKPTKATDHPFRPTGEEAMSYFVGGDVVENGYVGEDAGFAINGGKGWKNVVFRNHQLDFNGPVAIAMGDYVFTSAADNSETRVEYTFGYKRNPDGKPRIFLHHSSVPYKEEPVTNTIRKRLFASA 508 T 0.032 SnoaL_3 pdbhh F T 7c5y 1 A,B A,B iota-carbonic anhydrase GSHDATITEAEVLNAQSKWAEAIKTISRTYLNGGDYIKTAGDAAAELYGYGKSKVLFKPTKAAEFPFRPTGEEAMSYFVGGNAVEKGYKEDAGFAINGGKGWSNVVFNNHDIDINGNTAVAMGSYVFTCATTGTETKVEYTFGYKRNDDGKVRIFLHHSSVPYSESPAPVTLKEVTECQEKWANAIQTISKTYLDGGDYIGEAGKQAGILYGYGNTNVLFKPTKATDHPFRPTGEQAMSYFVGGDVVDNGYVGEDAGFAINGGKGWSKVVFRNHQVDLNGPVAIAMGDYVFTSAADGSETRVEYTFGYKRNDDGNVRIFVHHSSVPYKEEVAPITEAEVLECQKNWANAIQTISKTYLDGGDYIGEAGKQAGILYGYGNTNVLFKPTKATDHPFRPTGEEAMSYFVGGDVVENGYVGEDAGFAINGGKGWKNVVFRNHQLDFNGPVAIAMGDYVFTSAADNSETRVEYTFGYKRNPDGKPRIFLHHSSVPYKEEPVTNTIRKRLFASA 508 T 0.032 SnoaL_3 pdbhh F T 7c5z 1 A,B A,B A7J936_SVCV Phosphoprotein SWEEESTGIDLGFGPGIVMPSVSNHEGGTYVRYNGLGNVDPNYKNLISKMMRSLIGQIGNKYGYDIDLFDYQGDFLEVFLPHKPSK 86 T 0.022 Cass2 pdbpssm T Viruses T 7c6a 4 D B ANGT_HUMAN SAR1, ILE8-ANGIOTENSIN II XRVYIHPI 8 T 3 Ion_trans_N pdbhh F Eukaryota T 7c78 1 A A A0A2S2CJ39_9GAMM AcrIF9 GSMKAAYIIKEVQNINSEREGTQIEATSLSQAKRIASKEQCFHGTVMRIETVNGLWLAYKEDGKRWVDCQ 70 T 1.7 YhzD unphh F Bacteria T 7c7v 2 B C MED1_HUMAN Mediator of RNA polymerase II transcription subunit 1 KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 7c7w 2 B C MED1_HUMAN Mediator of RNA polymerase II transcription subunit 1 KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 7c8b 2 B D Z-VAD(OMe)-FMK XVAXX 5 T 1100 RE_HindIII pdbhh F F 7c8e 2 C,D C,D 9J10 LNRTPGRRRNSN 12 T 3.7 PSRT pdbhh F T 7cbc 1 A,B A,B De novo designed switch protein caging a hemagglutinin binder (sCageHA267_1S) MSELARKLLEASTKLQRLNIRLAEALLEAMARLQELNLELVYLAVELTDPKRIRDEIKEVKDKSKEIIRRAEKEIDDAAKESEKILEEAREAISGSGSYLAKLLLKAIAETQDLNLRAAKAFLEAAAKLQELNIRAVELLVKLYDPATIREALEHAKRRSKEIIDEAERAIRAAKRESERIIEEARRLIEKGSGSGSELARELLRAHAQLQRLNLELLRELLRALAQLQELNLDLLRLASELTDPDEARKAIARSKRESKRIVEDAERGGGTFACRIAAKIAAEFGYSEEQIKELLKNAGCSEDEARDAVEYLRSRPGL 319 T 0.012 PhoU pdbpercent F T 7cc9 1 A,B,C A,B,C A0A0M4DML1_STRPR HNHc domain-containing protein LTDTDRSEDFLRRVRGLKAARTANGPRLYQPITLLWAVGRARRGEARTLAWADTDEAIGALLKRHGARGERPRPDYPVLALHRAGLWTLEGHVGEVPTAHGDSALRNWFAEQRPVGGLAEPFHDLLHRSGHSRVSVIEALLTTYFAGLDPVPLLEDTGLYDEG 163 T 20 TMEM214 pdbhh F Bacteria T 7ccd 1 A,D A,B A0A0M4DML1_STRPR HNHc domain-containing protein PLTDTDRSEDFLRRVRGLKAARTANGPRLYQPITLLWAVGRARRGEARTLAWADTDEAIGALLKRHGARGERPRPDYPVLALHRAGLWTLEGHVGEVPTAHGDSALRNWFAEQRPVGGLAEPFHDLLHRSGHSRVSVIEALLTTYFAGLDPVPLLEDTGLYDEGHHHHH 169 T 20 DUF4014 pdbhh F Bacteria T 7ccj 1 A,D A,B A0A0M4DML1_STRPR HNHc domain-containing protein MPLTDTDRSEDFLRRVRGLKAARTANGPRLYQPITLLWAVGRARRGEARTLAWADTDEAIGALLKRHGARGERPRPDYPVLALHRAGLWTLEGHVGEVPTAHGDSALRNWFAEQRPVGGLAEPFHDLLHRSGHSRVSVIEALLTTYFAGLDPVPLLEDTGLYDEG 165 T 0.26 Hemerythrin pdbpercent F Bacteria T 7ccn 1 A A LBT3 FIDTNNDGWIEGDELLA 17 T 0.0037 EF-hand_6 pdb F T 7cco 1 A A LBT3 FIDTNNDGWIEGDELLA 17 T 0.0037 EF-hand_6 pdb F T 7cdb 2 C C GBRG2_MOUSE GABA(A) RECEPTOR SUBUNIT GAMMA-2 ERDEEYGYECLDGKDCAS 18 T 11 FOLN pdbhh F Eukaryota T 7cdc 3 C C PRO-ARG-SER-PHE-LEU-VAL-ARG-ARG-PRO PRSFLVRRP 9 T 2.9 pPIWI_RE_Y pdbhh F T 7cdd 3 C C PRO-ARG-SER-PHE-LEU-VAL-ARG-ARG PRSFLVRR 8 T 9.6 HOOK pdbhh F T 7cde 3 C C PRO-ARG-SER-PHE-LEU-VAL-ARG-LYS-ARG PRSFLVRKR 9 T 1.2 HOOK pdbhh F T 7cdf 3 C C PRO-ARG-SER-PHE-LEU-VAL-ARG-ARG-LYS PRSFLVRRK 9 T 2.3 hNIFK_binding pdbhh F T 7cdg 3 C C PRO-ARG-SER-PHE-LEU-VAL-ARG-ARG-ARG PRSFLVRRR 9 T 3.5 HOOK pdbhh F T 7cfc 2 F,G,H,I F,G,H,I AGO3_DROME AGO3 NISVGRGRARLIDTLK 16 T 2.8 DUF1343 pdbhh F Eukaryota T 7cfd 2 B,D,E,H,L,N,O,P I,J,K,L,N,M,Z,O AUB_DROME PROTEIN STING NPVIARGRGXGRK 13 T 0.023 Tristanin_u2 pdbhh F Eukaryota T 7cg0 1 A,B,C,D,E 0,1,2,3,4 Flagellar MS ring L2 XXXXXXXXXXXXXXX 15 F F F 7cg1 1 A A A0A0L6JMH4_9FIRM Anti-sigma factor RsgI, N-terminal MVEIAINPASEITATSAFISGTVTKFEQSKGFYGSGCNISLLYWEASNPMHVKVASSISKKDFPADISATIKDLKPHTTYQFKVTVNFYFSSSLQTFKTLALESKSTSIVSTSTPTPSMPVKVTLEHHHHHH 132 T 0.0018 fn3 pdb F Bacteria T 7cg5 1 A A A0A0L6JMH4_9FIRM Anti-sigma factor RsgI, N-terminal MVEIAINPASEITATSAFISGTVTKFEQSKGFYGSGCNISLLYWEASNPMHVKVASSISKKDFPADISATIKDLKPHTTYQFKVTVNFYFSSSLQTFKTLALESKSTSIVSTSTPTPSMPVKVTLEHHHHHH 132 T 0.0018 fn3 pdb F Bacteria T 7cg8 1 A,B,C,D A,B,C,D A0A0L6JMH4_9FIRM Anti-sigma factor RsgI, N-terminal SVSPVEIAINPASEITATSAFISGTVTKFEQSKGFYGSGCNISLLYWEASNPMHVKVASSISKKDFPADISATIKDLKPHTTYQFKVTVNFYFSSSLQTFKTLAL 105 T 0.00081 fn3 pdb F Bacteria T 7cgo 3 AA,BA,CA,DA,EA 0,1,2,3,4 Flagellar MS ring L2 XXXXXXXXXXXXXXX 15 F F F 7cgo 13 DD,ED,FD,GD,WC GC,GE,GD,GB,GA FlgB-Dc loop XXXXXXXXXXXX 12 F F F 7cgo 14 AD,BD,CD,XC,YC,ZC GI,GJ,GK,GF,GG,GH FliE helix 1 XXXXXXXXXXXXXXXXXX 18 F F F 7chk 2 B B Q9JGP1_9SECO VP24 protein GSDPFSFLLNYSHCGTLVESSLNKGGMWCVPVSPVNLAAYTLQGEALVFNDAFVSKTHNWLHFMASTTAYWRGTLHYQMRVTYKDRNAACRNLVAFYTTNNESLFGFNNKPVGDTGISSVMGDSFSVDITVPFLIPTCYLQTIRGKFDYLNSCNGCIYFHLPTKSATSVQLWVRPGQDFDFARFRLLKAGYT 192 T 1.4E-05 CRPV_capsid pdbhh T Viruses T 7chq 1 A A A0A125RN64_9CAUD anti-CRISPR AcrIE2 MNTYLIDPRKNNDNSGERFTVDAVDITAAAKSAAQQILGEEFEGLVYRETGESNGSGMFQAYHHLHGTNRTETTVGYPFHVMELLEHHHHHH 92 T 9.2 Baculo_E66 unphh T Viruses T 7chr 1 A A A0A2S2CJ39_9GAMM anti-CRISPR AcrIF9 MKAAYIIKEVQNINSEREGTQIEATSLSQAKRIASKEQCFHGTVMRIETVNGLWLAYKEDGKRWVDCQLEHHHHHH 76 T 1.7 YhzD unphh F Bacteria T 7ci1 1 A,B A,B AcrVA2 SMHHTIARMNAFNKAFANAKDCYKKMQAWHLLNKPKHAFFPMQNTPALDNGLAALYELRGGKEDAHILSILSRLYLYGAWRNTLGIYQLDEEIIKDCKELPDDTPTSIFLNLPDWCVYVDISSAQIATFDDGVAKHIKGFWAIYDIVEMNGINHDVLDFVVDTDTDDNVYVPQPFILSSGQSVAEVLDYGASLFDDDTSNTLIKGLLPYLLWLCVAEPDITYKGLPVSREELTRPKHSINKKTGAFVTPSEPFIYQIGERLGSEVRRYQSIIDGEQKRNRPHTKRPHIRRGHWHGYWQGTGQAKEFRVRWQPAVFVNSGRVSS 323 T 10 SeqA_N pdbhh F T 7ci2 1 A,B A,B AcrVA2 SMHHTIARMNAFNKAFANAKDCYKKMQAWHLLNKPKHAFFPMQNTPALDNGLAALYELRGGKEDAHILSILSRLYLYGAWRNTLGIYQLDEEIIKDCKELPDDTPTSIFLNLPDWCVYVDISSAQIATFDDGVAKHIKGFWAIYDIVEMNGINHDVLDFVVDTDTDDNVYVPQPFILSSGQSVAEVLDYGASLFDDDTSNTLIKGLLPYLLWLCVAEPDITYKGLPVSREELTRPKHSINKKTGAFVTPSEPFIYQIGERLGSEVRRYQSIIDGEQKRNRPHTKRPHIRRGHWHGYWQGTGQAKEFRVRWQPAVFVNSGRVSS 323 T 10 SeqA_N pdbhh F T 7ci2 2 C,D C,D A0A0U2B2X7_9GAMM MbCpf1 NTGKSVYQKMIYKLLPGPNKMLPKVFFAKSNLD 33 T 23 DUF5100 pdbhh F Bacteria T 7cio 2 B B CTLA4_HUMAN CYTOTOXIC T-LYMPHOCYTE-ASSOCIATED ANTIGEN 4,CTLA-4 GVXVKMPP 8 T 0.2 TMEM190 unppssm F Eukaryota T 7ciz 4 J,K,L D,H,L DNJC9_HUMAN HDJC9,DNAJ PROTEIN SB73 GPLGSKESKQKMNARKRRAQEEAKEAEMSRKELGLDEGVDSLKAAIQSRQKDRQKEMDNFLAQMEAKYSKSSKGG 75 T 0.032 CobN-Mg_chel pdbpercent F Eukaryota T 7cj0 4 D,H D,A DNJC9_HUMAN HDJC9,DNAJ PROTEIN SB73 GPLGSEVPSYNAFVKESKQKMNARKRRAQEEAKEAEMSRKELGLDEGVDSLKAAIQSRQKDRQKEMDNFLAQMEAKYSKSSKGG 84 T 0.0043 CobN-Mg_chel pdbpssm F Eukaryota T 7ck5 1 A A B1B578_P1AMV PlAMV replicase peptide from RNA-dependent RNA polymerase FEDILSGNLLQRMLRPLRSGLTQLLDFF 28 T 11 Lambda_CIII pdbhh T Viruses T 7cl0 1 A A SIR6_HUMAN REGULATORY PROTEIN SIR2 HOMOLOG 6,SIR2-LIKE PROTEIN 6 MSVNYAAGLSPYADKGKCGLPEIFDPPEELERKVWELARLVWQSSSVVFHTGAGISTASGIPDFRGPHGVWTMEERGLAPKFDTTFESARPTQTHMALVQLERVGLLRFLVSQNVDGLHVRSGFPRDKLAELHGNMFVEECAKCKTQYVRDTVVGTMGLKATGRLCTVAKARGLRACRGELRDTILDWEDSLPDRDLALADEASRNADLSITLGTSLQIRPSGNLPLATKRRGGRLVIVNLQPTKHDRHADLRIHGYVDEVMTRLMKHLGLEIPAWDGPRVLERALPPLPRPPTPKLEPKEESPTRINGSIPAGPKQEPCAQHNGSEPASPKRERPTSPAPHRPPKRVKAKAVPS 355 T 3.2E-07 SIR2 unppercent F Eukaryota T 7cl1 1 A A SIR6_HUMAN REGULATORY PROTEIN SIR2 HOMOLOG 6,SIR2-LIKE PROTEIN 6 MSVNYAAGLSPYADKGKCGLPEIFDPPEELERKVWELARLVWQSSSVVFHTGAGISTASGIPDFRGPHGVWTMEERGLAPKFDTTFESARPTQTHMALVQLERVGLLRFLVSQNVDGLHVRSGFPRDKLAELHGNMFVEECAKCKTQYVRDTVVGTMGLKATGRLCTVAKARGLRACRGELRDTILDWEDSLPDRDLALADEASRNADLSITLGTSLQIRPSGNLPLATKRRGGRLVIVNLQPTKHDRHADLRIHGYVDEVMTRLMKHLGLEIPAWDGPRVLERALPPLPRPPTPKLEPKEESPTRINGSIPAGPKQEPCAQHNGSEPASPKRERPTSPAPHRPPKRVKAKAVPS 355 T 3.2E-07 SIR2 unppercent F Eukaryota T 7clv 2 C C COX4_YEAST COX4 isoform 1 MLSLRQSIRFFKPATRTLCSSRYLL 25 T 9.7 OTCace_N pdbhh F Eukaryota T 7cma 1 A,B A,C A0A2X0TC55_ASF OXYGENASE MNKKIIVMMALLHKEKLIECIYHELENGGTILLLTKNIVVSEISYIGNTYKYFTFNDNHDLISKEDLKGATSKNIAKMIYNWIIKNPQNNKIWSGEPRTQIYFENDLYHTNYNHKCIKDFWNVSTSVGPHIFNDRSIWCTKCTSFYPFTNIMSPNIFQ 158 T 0.12 ox_reductase_C pdbpssm T Viruses T 7cmx 1 A,B,C,D A,B,C,D Q81GQ9_BACCR Isocitrate lyase MKNERIEKLQESWELDERWEGITRPYSAEDVIRLRGSIDIEHTLARRGAEKLWTSLHTEDYINALGALTGNQAMQQVKAGLKAIYLSGWQVAADANLSGHMYPDQSLYPANSVPAVVKRINQTLQRADQIQHMEGSDDTDYFVPIVADAEAGFGGQLNVFELMKGMIEAGASGVHFEDQLSSEKKCGHLGGKVLLPTQTAVRNLISARLAADVMGVPTIIVARTDADAADLITSDIDPVDKAFITGERTPEGFYRTNAGLDQAIARGLAYAPYADLVWCETSEPNLEDAKRFADAIHKEHPGKLLAYNCSPSFNWKQKLDEKAIASFQKEIASYGYKFQFVTLAGFHSLNYGMFELARGYKERGMAAYSELQQAEFAAEKHGYSATRHQREVGTGYFDEVAQVITGGTSSTTALKGSTEEAQFTKLEHHHHHH 433 T 1.6E-47 ICL pdb F Bacteria T 7cmy 1 A,B C,A Q81GQ9_BACCR Isocitrate lyase MKNERIEKLQESWELDERWEGITRPYSAEDVIRLRGSIDIEHTLARRGAEKLWTSLHTEDYINALGALTGNQAMQQVKAGLKAIYLSGWQVAADANLSGHMYPDQSLYPANSVPAVVKRINQTLQRADQIQHMEGSDDTDYFVPIVADAEAGFGGQLNVFELMKGMIEAGASGVHFEDQLSSEKKCGHLGGKVLLPTQTAVRNLISARLAADVMGVPTIIVARTDADAADLITSDIDPVDKAFITGERTPEGFYRTNAGLDQAIARGLAYAPYADLVWCETSEPNLEDAKRFADAIHKEHPGKLLAYNCSPSFNWKQKLDEKAIASFQKEIASYGYKFQFVTLAGFHSLNYGMFELARGYKERGMAAYSELQQAEFAAEKHGYSATRHQREVGTGYFDEVAQVITGGTSSTTALKGSTEEAQFTKLEHHHHHH 433 T 1.6E-47 ICL pdb F Bacteria T 7cmz 2 B B PHF8_HUMAN PHD FINGER PROTEIN 8,[HISTONE H3]-DIMETHYL-L-LYSINE(36) DEMETHYLASE PHF8,[HISTONE H3]-DIMETHYL-L-LYSINE(9) DEMETHYLASE PHF8 GACFKDAEYIYPSLESDDDDPA 22 T 8.7 Ph1570 pdbhh F Eukaryota T 7cn6 1 A,B,C A,B,C SPAC_BPT4 Protein spackle GYDKDLCEWSMTADQTEVETQIEADIMNIVKRDRPEMKAEVQKQLKSGGVMQYNYVLYCDKNFNNKNIIAEVVGE 75 T 0.068 Autoind_synth pdb T Viruses T 7cn7 2 B C SPAC_BPT4 Protein spackle GYDKDLCEWSMTADQTEVETQIEADIMNIVKRDRPEMKAEVQKQLKSGGVMQYNYVLYCDKNFNNKNIIAEVVGE 75 T 0.068 Autoind_synth pdb T Viruses T 7cna 2 B,D B,E SPNDC_HUMAN SPIN1-DOCKING PROTEIN,SPIN-DOC ETFAAPAEVRHFTDGSFPAGFVLQLFSHTQ 30 T 29 DUF2852 pdbhh F Eukaryota T 7cna 4 F F ALA-ARG-THR-M3L-GLN-THR-ALA-ARG-M3L-SER-GLY ARTKQTARKSGG 12 T 0.24 Histone pdbhh F T 7cnc 2 B B DGCR8_HUMAN DIGEORGE SYNDROME CRITICAL REGION 8 PRTARHAPAVRKFSPDLKLLKDVKISVSFTE 31 T 7.7 CoV_NSP15_M pdbhh F Eukaryota T 7cnw 2 B,D B,D PSD_ECOLI Phosphatidylserine decarboxylase alpha chain XTVINLFAPGKVNLVEQLESLSVTKIGQPLAVSTGHHHHHHG 42 T 13 Herpes_UL51 pdbhh F Bacteria T 7cnx 2 B,D,F,H B,D,F,H PSD_ECOLI Phosphatidylserine decarboxylase alpha chain XTVINLFAPGKVNLVEQLESLSVTKIGQPLAVSTGHHHHHHG 42 T 13 Herpes_UL51 pdbhh F Bacteria T 7cny 2 B,D B,D PSD_ECOLI Phosphatidylserine decarboxylase alpha chain XTVINLFAPGKVNLVEQLESLSVTKIGQPLAVSTGHHHHHHG 42 T 13 Herpes_UL51 pdbhh F Bacteria T 7cnz 2 B,D,F,H B,D,F,H PSD_ECOLI Phosphatidylserine decarboxylase alpha chain XTVINLFAPGKVNLVEQLESLSVTKIGQPLAVSTGHHHHHHG 42 T 13 Herpes_UL51 pdbhh F Bacteria T 7co1 2 B,D,F B,D,F CBP_HUMAN HISTONE LYSINE ACETYLTRANSFERASE CREBBP,PROTEIN-LYSINE ACETYLTRANSFERASE CREBBP GPPPAAVEAARQIEREAQQQQHLYSDED 28 T 0.8 WPP pdbhh F Eukaryota T 7co2 1 A C TRP-VAL-PHE WVF 3 T 31 DUF1455 pdbhh F F 7co3 1 A C TRP-VAL-PHE WVF 3 T 31 DUF1455 pdbhh F F 7co5 1 A,C,E,G,I,K G,A,C,E,I,K decapeptide SVRDELRWVF SVRDELRWVF 10 T 9.7 Chisel pdbhh F T 7co7 1 A C decapeptide SVRDELRWVF SVRDELRWVF 10 T 9.7 Chisel pdbhh F T 7coy 6 BA,F,Q cF,aF,bF B0C7S7_ACAM1 Photosystem I protein PsaF MRRLFAVLLVMTLFLGVVPPASADIGGLVPCSESPKFQERAAKARNTTADPNSGQKRFEMYSSALCGPEDGLPRIIAGGPMRRAGDFLIPGLFFIYIAGGIGNSSRNYQIANRKKNAKNPAMGEIIIDVPLAVSSTIAGMAWPLTAFRELTSGELTVPDSDVTVSPR 167 T 3.4E-06 PSI_PsaF pdbpssm F Bacteria T 7coy 7 CA,G,R cI,aI,bI Photosystem I protein Psa27 MISDILPAIMTPLVVLIGGGAAMTAFFYYVEREG 34 T 0.0026 PSI_8 pdbpercent F T 7cp1 1 A,B A,B ACEA_MYCTU ICL,ISOCITRASE,ISOCITRATASE MSVVGTPKSAEQIQQEWDTNPRWKDVTRTYSAEDVVALQGSVVEEHTLARRGAEVLWEQLHDLEWVNALGALTGNMAVQQVRAGLKAIYLSGWQVAGDANLSGHTYPDQSLYPANSVPQVVRRINNALQRADQIAKIEGDTSVENWLAPIVADGEAGFGGALNVYELQKALIAAGVAGSHWEDQLASEKKCGHLGGKVLIPTQQHIRTLTSARLAADVADVPTVVIARTDAEAATLITSDVDERDQPFITGERTREGFYRTKNGIEPCIARAKAYAPFADLIWMETGTPDLEAARQFSEAVKAEYPDQMLAYNCSPSFNWKKHLDDATIAKFQKELAAMGFKFQFITLAGFHALNYSMFDLAYGYAQNQMSAYVELQEREFAAEERGYTATKHQREVGAGYFDRIATTVDPNSSTTALTGSTEEGQFHLEHHHHHH 436 T 1.8E-47 ICL unp F Bacteria T 7cp2 1 A,B,C A,B,C PP62_ASFB7 CP530R CDS PROTEIN,CP530R PROTEIN,PP62 PSNMKQFCKISVWLQQHDPDLLEIINNLCMLGNLSAAKYKHGVTFIYPKQAKIRDEIKKHAYSNDPSQAIKTLESLILPFYIPTPAEFTGEIGSYTGVKLEVEKTEANKVILKNGEAVLVPAADFKPFPDRRLAVWIMESGSMPLEGPPYKR 152 T 0.34 Fasciclin unppssm T Viruses T 7cpo 3 C C HIS-VAL-TYR-GLY-PRO-LEU-LYS-PRO-ILE HVYGPLKPI 9 T 0.63 DUF952 pdbhh F T 7cqh 2 B A TRP_DROME Transient receptor potential protein GPMNQTQLIEFNPNLGDVTRATRVAYVKFMRKKMAADEVSLADD 44 T 0.18 LEM pdbpssm F Eukaryota T 7cqp 2 B C TRPC4_MOUSE TRPC4,CAPACITATIVE CALCIUM ENTRY CHANNEL TRP4,RECEPTOR-ACTIVATED CATION CHANNEL TRP4 GPDKRKNLSLFDLTTLIHPRSAAIASERHN 30 T 4.1 Pox_RNA_Pol_19 pdbhh F Eukaryota T 7cqv 2 C B TRP_DROME Transient receptor potential protein GPNNNWDVPDIEKKSQGVARTTKGKVMERRILKDFQIGFVENLKQEMSESESGRDIFSSLAKVIGRKKTQKGDKDWNAIARK 82 T 5.3 DUF1331 pdbhh F Eukaryota T 7crb 1 A J ATR1_HYAAE ARABIDOPSIS THALIANA RECOGNIZED PROTEIN 1 MRVCYFVLVPSVALAVIATESSETSGTIVHVFPLRDVADHRNDALINRALRAQTALDDDEERWPFGPSAVEALIETIDRHGRVSLNDEAKMKKVVRTWKKLIERDDLIGEIGKHYFEAPGPLHDTYDEALATRLVTTYSDRGVARAILHTRPSDPLSKKAGQAHRLEEAVASLWKGRGYTSDNVVSSIATGHDVDFFAPTAFTFLVKCVESEDDANNAIFEYFGSNPSRYFSAVLHAMEKPDADSRVLESSKKWMFQCYAQKQFPTPVFERTLAAYQSEDYAIRGARNHYEKLSLSQIEELVEEYSRIYSV 311 T 0.0016 RXLR pdbhh F Eukaryota T 7crc 2 D,E,F,G E,H,F,G ATR1_HYAAE ARABIDOPSIS THALIANA RECOGNIZED PROTEIN 1 MRVCYFVLVPSVALAVIATESSETSGTIVHVFPLRDVADHRNDALINRALRAQTALDDDEERWPFGPSAVEALIETIDRHGRVSLNDEAKMKKVVRTWKKLIERDDLIGEIGKHYFEAPGPLHDTYDEALATRLVTTYSDRGVARAILHTRPSDPLSKKAGQAHRLEEAVASLWKGRGYTSDNVVSSIATGHDVDFFAPTAFTFLVKCVESEDDANNAIFEYFGSNPSRYFSAVLHAMEKPDADSRVLESSKKWMFQCYAQKQFPTPVFERTLAAYQSEDYAIRGARNHYEKLSLSQIEELVEEYSRIYSV 311 T 0.0016 RXLR pdbhh F Eukaryota T 7cu6 1 A A lasso peptide C24_A11V2C LCVIVQADWNCPGWF 15 T 1.3 Exo_endo_phos pdbhh F T 7cui 1 A,C A,C POT1_SCHPO Protection of telomeres protein 1 SENPFIAHELKQTSVNEITAHVINEPASLKLTTISTILHAPLQNLLKPRKHRLRVQVVDFWPKSLTQFAVLSQPPSSYVWMFALLVRDVSNVTLPVIFFDSDAAELINSSKIQPCNLADHPQMTLQLKERLFLIWGNLEERIQHHISKGESPTLAAEDVETPWFDIYVKEYIPVIGNTKDHQSLTFLQKRWRGFGTKIV 199 T 7.8 CDC24_OB3 pdbhh F Eukaryota T 7cui 2 B,D B,D TPZ1_SCHPO MEIOTICALLY UP-REGULATED GENE 169 PROTEIN SQQEKPNDNTSNSRDIKNNIQFHWKNMTSLSIEECIIPKGQQLILEKESEENTTHGIYLEERKMAQGLHNSVSETPE 77 T 0.0062 TEBP_beta unphh F Eukaryota T 7cuj 1 A,C A,B CCQ1_SCHPO STRUCTURAL MAINTENANCE OF CHROMOSOMES PROTEIN CCQ1,SMC PROTEIN CCQ1 ITKSSKSSFSVLDIGLPMSALQRKMMHRLVQYFAFCIDHFCTGPSDSRIQEKIRLFIQSAHNIAKHPSLYDTEVRNPSAAESTNSHVSLDASNFSSYAENSSKFLFLQELFKNLSPSYSKTFFLFISNQFLANTLTQWLKSQNIDAELWAEEDAKTSQHPAIWICVSKKAPSASHFLQSCPDLSATIFYDIEAYMSVTSSLPSIQSLVLRLIHLGSIEHAIKCFQSSYNASFLVNIVGVVATLSSSSEENSEASNLSTLFEKSGNFEEILGSESHSSITEKTRDIAKNVATWLKNGENFSSWPLPPLMDLASLSVAE 317 T 0.00039 HDA2-3 unppssm F Eukaryota T 7cuj 2 B,D C,D TPZ1_SCHPO MEIOTICALLY UP-REGULATED GENE 169 PROTEIN QIELEYKRKPIPDYDFMKGLETTLQELYVEHQSKKRRLELFQLTN 45 T 0.071 Radical_SAM_N unp F Eukaryota T 7cun 7 G H INT8_HUMAN INT8 MSAEAADREAATSSRPCTPPQTCWFEFLLEESLLEKHLRKPCPDPAPVQLIVQFLEQASKPSVNEQNQVQPPPDNKRNRILKLLALKVAAHLKWDLDILEKSLSVPVLNMLLNELLCISKVPPGTKHVDMDLATLPPTTAMAVLLYNRWAIRTIVQSSFPVKQAKPGPPQLSVMNQMQQEKELTENILKVLKEQAADSILVLEAALKLNKDLYVHTMRTLDLLAMEPGMVNGETESSTAGLKVKTEEMQCQVCYDLGAAYFQQGSTNSAVYENAREKFFRTKELIAEIGSLSLHCTIDEKRLAGYCQACDVLVPSSDSTSQQLTPYSQVHICLRSGNYQEVIQIFIEDNLTLSLPVQFRQSVLRELFKKAQQGNEALDEICFKVCACNTVRDILEGRTISVQFNQLFLRPNKEKIDFLLEVCSRSVNLEKASESLKGNMAAFLKNVCLGLEDLQYVFMISSHELFITLLKDEERKLLVDQMRKRSPRVNLCIKPVTSFYDIPASASVNIGQLEHQLILSVDPWRIRQILIELHGMTSERQFWTVSNKWEVPSVYSGVILGIKDNLTRDLVYILMAKGLHCSTVKDFSHAKQLFAACLELVTEFSPKLRQVMLNEMLLLDIHTHEAGTGQAGERPPSDLISRVRGYLEMRLPDIPLRQVIAEECVAFMLNWRENEYLTLQVPAFLLQSNPYVKLGQLLAATCKELPGPKESRRTAKDLWEVVVQICSVSSQHKRGNDGRVSLIKQRESTLGIMYRSELLSFIKKLREPLVLTIILSLFVKLHNVREDIVNDITAEHISIWPSSIPNLQSVDFEAVAITVKELVRYTLSINPNNHSWLIIQADIYFATNQYSAALHYYLQAGAVCSDFFNKAVPPDVYTDQVIKRMIKCCSLLNCHTQVAILCQFLREIDYKTAFKSLQEQNSHDAMDSYYDYIWDVTILEYLTYLHHKRGETDKRQIAIKAIGQTELNASNPEEVLQLAAQRRKKKFLQAMAKLYF 995 T 0.0069 TPR_12 pdbpercent F Eukaryota T 7cun 12 L U unknown XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 594 F F F 7cut 2 B B Z-VAD(OMe)-FMK XVAXX 5 T 1100 RE_HindIII pdbhh F F 7cwj 1 A,B,C,D A,B,C,D G9MQD3_HYPVG Root induced effector protein Tsp1 MAAPTPADKSMMAAVPEWTITNLKRVCNAGNTSCTWTFGVDTHLATATSCTYVVKANANASQASGGPVTCGPYTITSSWSGQFGPNNGFTTFAVTDFSKKLIVWPAYTDVQVQAGKVVSPNQSYAPANLPLEHHHHHH 138 T 5.5 DUF6520 unphh F Eukaryota T 7cwp 1 A,B,C,D A,B,C,D G9MQD3_HYPVG Root induced signalling protein MAAPTPADKSMMAAVPEWTITNLKRVCNAGNTSCTWTFGVDTHLATATSCTYVVKANANASQASGGPVTCGPYTITSSWSGQFGPNNGFTTFAVTDFSKKLIVWPAYTDVQVQAGKVVSPNQSYAPANLPLEHHHHHH 138 T 5.5 DUF6520 unphh F Eukaryota T 7cwz 1 A,B,C A,B,C TYRDC_ENTFA TYROSINE DECARBOXYLASE MKNEKLAKGEMNLNALFIGDKAENGQLYKDLLIDLVDEHLGWRQNYMPQDMPVISSQERTSKSYEKTVNHMKDVLNEISSRMRTHSVPWHTAGRYWGHMNSETLMPSLLAYNFAMLWNGNNVAYESSPATSQMEEEVGHEFAHLMSYKNGWGHIVADGSLANLEGLWYARNIKSLPFAMKEVKPELVAGKSDWELLNMPTKEIMDLLESAEDEIDEIKAHSARSGKHLQAIGKWLVPQTKHYSWLKAADIIGIGLDQVIPVPVDHNYRMDINELEKIVRGLAEEQIPVLGVVGVVGSTEEGAVDSIDKIIALRDELMKDGIYYYVHVDAAYGGYGRAIFLDEDNNFIPYEDLQDVHEEYGVFKEKKEHISREVYDAYKAIELAESVTIDPHAMGYIPYSAGGIVIQDIRMRDVISYFATYVFEKGADIPALLGAYILEGSKAGATAASVWAAHHVLPLNVAGYGKLIGASIEGSHHFYNFLNDLTFKVGDKEIEVHTLTHPDFNMVDYVFKEKGNDDLVAMNKLNHDVYDYASYVKGNIYNNEFITSHTDFAIPDYGNSPLKFVNSLGFSDEEWNRAGKVTVLRAAVMTPYMNDKEEFDVYAPKIQAALQEKLEQIYDVK 620 T 3.9E-18 Pyridoxal_deC pdbpssm F Bacteria T 7cyl 2 B B FUS_HUMAN 75 KDA DNA-PAIRING PROTEIN,ONCOGENE FUS,ONCOGENE TLS,POMP75,TRANSLOCATED IN LIPOSARCOMA PROTEIN GGSRGGYDRGGYRGRGGDRGGFRGGRGGGDRGGFGPGKMDSRGEHRQDRRERLY 54 T 510 DUF6114 pdbhh F Eukaryota T 7cz6 1 A C A0A7M3VBX7_9VIRU Capsid protein IDCDSSVFGNNFNITTSPQTLTMSGPLAPGKYQTTLTVQALIGGTGVVVGTVTFAGKTVAYQVFDDSFASFDLGTVTVSASTTPSVIWTGSTGATLTMAVNIICKPITPTSVAISGQPIWTTPYAP 126 T 0.11 DUF3459 pdb T Viruses T 7czm 2 C,D C,D OPTN_HUMAN Optineurin LIR SSEDSFVEIRMAE 13 T 11 DUF5856 pdbhh F Eukaryota T 7d0e 2 B B CCPG1_HUMAN Cell cycle progression protein 1 FIR2 SDDSDIVTLEPPK 13 T 39 YodL pdbhh F Eukaryota T 7d0j 13 M O A8JCL6_CHLRE Photosystem I subunit O ASNKSFPRDWVKTDPLVPVLGFAGWTIPANIGVSAFGGQSLFGLFTQSIGENLAHFPTGPALDDKFWLYLITYHLGLFLTITLGQIGVQGRKQ 93 T 25 YkpC pdbhh F Eukaryota T 7d0k 1 A,B A,B A0A7M3VBX7_9VIRU Capsid protein PISADFSEVENAPSFLSLAENTDEVLKPYTGLEIQTIITNIVGDANPNQSRIFDQDRLRGNQYSAGGLVTQNAVSAIPFTNLIPRTIRVGNILVNSANRLQITETNVSEYYSNPIIATKLSEMISDQVKNNQFSTWRRDNTSLQGFNAFDIATINTAILPNGLSLESMLLKLSLLHSIKAMNVDAASINRSQYQVIDHNTVPTIGAPAVVGVNNSPVFGEDCGGNNPVYPFGGGTGAIAFHVTLQTVPDERKSYAIFVPPAILQATSDANEALALFALSMSEWPHALYTVTKQTTDLAGANAGQQVFIPTQSTIHIGGRRVLDLIIPRREIAPNPTTLVAANAMCMVRPQAGPDATAGAIPLAAGQLFNMNFIGAPAFEEWPMTSYLYSWAGRFDITTIRQYMGRLATMVGVKDAYWAAHELNVALSQVAPKMTTAAGGWAAQAANSAQQSDVCYSSLLTVTRSAANFPLANQPAADMRVYDTDPATWNKVALGLATAANLVPEQSMDVPFVVGDARASFWERLQAIPMCIAWTMYYHSRGITTLAWDNAYTDNTNKWLQKMVRNTFSTTQSVGTIIPARYGKIVCNLYKNMFHRAPAYVATSVGGKELHITHFERWLPGGTYANVYSGAGAVVNCFSPVLIPDIWCQYFTAKLPLFAGAFPPAQGQNSTKGFNSKQGLMIHRNQNNNLVAPYLEKFADNSSYFPVGQGPEINDMATWNGRLWMTTGNVQYLDYSGAAIVEAVPPAGELPVGKQIPLLAGENAPIELTNAATTCVPRYSNDGRRIFTYLTTAQSVIPVQACNRAANLARSCWLLSNVYAEPALQALGDEVEDAFDTLTNSSFLDVAKSVAESAGEVPATKALTDLQAVDVSSLPSTSDPSNVLSQPAPLMSPPTSSS 897 T 0.17 TPP_enzyme_C pdb T Viruses T 7d0l 1 A,B A,B A0A7M3VBX7_9VIRU Capsid protein PISADFSEVENAPSFLSLAENTDEVLKPYTGLEIQTIITNIVGDANPNQSRIFDQDRLRGNQYSAGGLVTQNAVSAIPFTNLIPRTIRVGNILVNSANRLQITETNVSEYYSNPIIATKLSEMISDQVKNNQFSTWRRDNTSLQGFNAFDIATINTAILPNGLSLESMLLKLSLLHSIKAMNVDAASINRSQYQVIDHNTVPTIGAPAVVGVNNSPVFGEDCGGNNPVYPFGGGTGAIAFHVTLQTVPDERKSYAIFVPPAILQATSDANEALALFALSMSEWPHALYTVTKQTTDLAGANAGQQVFIPTQSTIHIGGRRVLDLIIPRREIAPNPTTLVAANAMCMVRPQAGPDATAGAIPLAAGQLFNMNFIGAPAFEEWPMTSYLYSWAGRFDITTIRQYMGRLATMVGVKDAYWAAHELNVALSQVAPKMTTAAGGWAAQAANSAQQSDVCYSSLLTVTRSAANFPLANQPAADMRVYDTDPATWNKVALGLATAANLVPEQSMDVPFVVGDARASFWERLQAIPMCIAWTMYYHSRGITTLAWDNAYTDNTNKWLQKMVRNTFSTTQSVGTIIPARYGKIVCNLYKNMFHRAPAYVATSVGGKELHITHFERWLPGGTYANVYSGAGAVVNCFSPVLIPDIWCQYFTAKLPLFAGAFPPAQGQNSTKGFNSKQGLMIHRNQNNNLVAPYLEKFADNSSYFPVGQGPEINDMATWNGRLWMTTGNVQYLDYSGAAIVEAVPPAGELPVGKQIPLLAGENAPIELTNAATTCVPRYSNDGRRIFTYLTTAQSVIPVQACNRAANLARSCWLLSNVYAEPALQALGDEVEDAFDTLTNSSFLDVAKSVAESAGEVPATKALTDLQAVDVSSLPSTSDPSNVLSQPAPLMSPPTSSS 897 T 0.17 TPP_enzyme_C pdb T Viruses T 7d13 1 A A VGF_HUMAN Neurosecretory protein VGF SQAEATRQAAAQEERLADLASDLLLQYLLQGGARQRGLG 39 T 3.6 AcylCoA_dehyd_C pdbhh F Eukaryota T 7d16 1 A A VGF_HUMAN Neurosecretory protein VGF SQAEATRQAAAQEERLADLASDLLLQYLLQGGARQRGLG 39 T 3.6 AcylCoA_dehyd_C pdbhh F Eukaryota T 7d2f 1 A,B A,B Q5ZWG6_LEGPH HISTIDINE ACID PHOSPHATASE DKLIFAVDIIRHGDRTPIVALPTVNYQWQEGLGQLTAEGMQQEYKMGVAFRKKYIEESHLLPEHYEYGTIYVRSTDYARTLMSAQSLLMGLYPPGTGPTIPAGTSALPHAFQPIPVFSAPSKYDEVIIQQVDRKEREKLMEQYVFSTREWQQKNNELKDKYPLWSRLTGINIDNLGDLETVGHTLYIHQIHNAPMPEGLASNDIETIINSAEWAFMAQEKPQQIANVYSSKLMTNIADYLNSGSMKKSKLKYVLLSAHATTIASVLSFLGAPLEKSPPYASNVNFSLYDNGANYYTVKITYNGNPVSIPACGGSVCELQQLINLVHDS 328 T 0.012 His_Phos_2 pdbpssm F Bacteria T 7d2l 1 A A 12i1-D647A MSNKEKNASETRKAYTTKMIPRSHDRMKLLGNFMDYLMDGTPIFFELWNQFGGGIDRDIISGTANKDKISDDLLLAVNWFKVMPINSKPQGVSPSNLANLFQQYSGSEPDIQAQEYFASNFDTEKHQWKDMRVEYERLLAELQLSRSDMHHDLKLMYKEKCIGLSLSTAHYITSVMFGTGAKNNRQTKHQFYSKVIQLLEESTQINSVEQLASIILKAGDCDSYRKLRIRCSRKGATPSILKIVQDYELGTNHDDEVNVPSLIANLKEKLGRFEYECEWKCMEKIKAFLASKVGPYYLGSYSAMLENALSPIKGMTTKNCKFVLKQIDAKNDIKYENEPFGKIVEGFFDSPYFESDTNVKWVLHPHHIGESNIKTLWEDLNAIHSKYEEDIASLSEDKKEKRIKVYQGDVCQTINTYCEEVGKEAKTPLVQLLRYLYSRKDDIAVDKIIDGITFLSKKHKVEKQKINPVIQKYPSFNFGNNSKLLGKIISPKDKLKHNLKCNRNQVDNYIWIEIKVLNTKTMRWEKHHYALSSTRFLEEVYYPATSENPPDALAARFRTKTNGYEGKPALSAEQIEQIRSAPVGLRKVKKRQMRLEAARQQNLLPRYTWGKDFNINICKRGNNFEVTLATKVKKKKEKNYKVVLGYAANIVRKNTYAAIEAHANGDGVIDYNDLPVKPIESGFVTVESQVRDKSYDQLSYNGVKLLYCKPHVESRRSFLEKYRNGTMKDNRGNNIQIDFMKDFEAIADDETSLYYFNMKYCKLLQSSIRNHSSQAKEYREEIFELLRDGKLSVLKLSSLSNLSFVMFKVAKSLIGTYFGHLLKKPKNSKSDVKAPPITDEDKQKADPEMFALRLALEEKRLNKVKSKKEVIANKIVAKALELRDKYGPVLIKGENISDTTKKGKKSSTNSFLMDWLARGVANKVKEMVMMHQGLEFVEVNPNFTSHQDPFVHKNPENTFRARYSRCTPSELTEKNRKEILSFLSDKPSKRPTNAYYNEGAMAFLATYGLKKNDVLGVSLEKFKQIMANILHQRSEDQLLFPSRGGMFYLATYKLDADATSVNWNGKQFWVCNADLVAAYNVGLVDIQKDFKKKLEHHHHHH 1101 T 25 DUF4060 pdbhh F T 7d2o 1 A A Q9BLZ2_9MAXI GLUC TGKPTENNEDFNIVAVASNFATTDLDADRGKLPGKKLPLEVLKEMEANARKAGCTRGCLICLSHIKCTPKMKKFIPGRCHTYEGDKESAQGGIGEAIVDIPAIPRFKDLEPMEQFIAQVDLCVDCTTGCLKGLANVQCSDLLKKWLPQRCATFASKIQGQVDKIKGAGGDIEGR 174 T 0.024 GASA pdbpssm F Eukaryota T 7d3c 2 C,D C,E N-[(5-METHYLISOXAZOL-3-YL)CARBONYL]ALANYL-L-VALYL-N~1~-((1R,2Z)-4-(BENZYLOXY)-4-OXO-1-{[(3R)-2-OXOPYRROLIDIN-3-YL]METHYL}BUT-2-ENYL)-L-LEUCINAMIDE XAVLXX 6 T 1700 zf-H2C2_5 pdbhh F F 7d3d 2 B,D a,b GLU-VAL-SER-ILE-ILE-GLN-GLY-ALA-ASP-SER-THR-THR EVSIIQGADSTT 12 T 10 DUF6180 pdbhh F T 7d3j 1 A A 12i1-WT MSNKEKNASETRKAYTTKMIPRSHDRMKLLGNFMDYLMDGTPIFFELWNQFGGGIDRDIISGTANKDKISDDLLLAVNWFKVMPINSKPQGVSPSNLANLFQQYSGSEPDIQAQEYFASNFDTEKHQWKDMRVEYERLLAELQLSRSDMHHDLKLMYKEKCIGLSLSTAHYITSVMFGTGAKNNRQTKHQFYSKVIQLLEESTQINSVEQLASIILKAGDCDSYRKLRIRCSRKGATPSILKIVQDYELGTNHDDEVNVPSLIANLKEKLGRFEYECEWKCMEKIKAFLASKVGPYYLGSYSAMLENALSPIKGMTTKNCKFVLKQIDAKNDIKYENEPFGKIVEGFFDSPYFESDTNVKWVLHPHHIGESNIKTLWEDLNAIHSKYEEDIASLSEDKKEKRIKVYQGDVCQTINTYCEEVGKEAKTPLVQLLRYLYSRKDDIAVDKIIDGITFLSKKHKVEKQKINPVIQKYPSFNFGNNSKLLGKIISPKDKLKHNLKCNRNQVDNYIWIEIKVLNTKTMRWEKHHYALSSTRFLEEVYYPATSENPPDALAARFRTKTNGYEGKPALSAEQIEQIRSAPVGLRKVKKRQMRLEAARQQNLLPRYTWGKDFNINICKRGNNFEVTLATKVKKKKEKNYKVVLGYDANIVRKNTYAAIEAHANGDGVIDYNDLPVKPIESGFVTVESQVRDKSYDQLSYNGVKLLYCKPHVESRRSFLEKYRNGTMKDNRGNNIQIDFMKDFEAIADDETSLYYFNMKYCKLLQSSIRNHSSQAKEYREEIFELLRDGKLSVLKLSSLSNLSFVMFKVAKSLIGTYFGHLLKKPKNSKSDVKAPPITDEDKQKADPEMFALRLALEEKRLNKVKSKKEVIANKIVAKALELRDKYGPVLIKGENISDTTKKGKKSSTNSFLMDWLARGVANKVKEMVMMHQGLEFVEVNPNFTSHQDPFVHKNPENTFRARYSRCTPSELTEKNRKEILSFLSDKPSKRPTNAYYNEGAMAFLATYGLKKNDVLGVSLEKFKQIMANILHQRSEDQLLFPSRGGMFYLATYKLDADATSVNWNGKQFWVCNADLVAAYNVGLVDIQKDFKKKLEHHHHHH 1101 T 0.38 DUF1910 pdbpercent F T 7d4i 64 OB X1 Unassigned peptides 1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 611 F F F 7d4i 65 PB X2 Unassigned peptides 2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 694 F F F 7d55 1 A,B,C,D A,B,C,D A0A2Z6FZW5_9CAUD Putative N-acetylmuramoyl-L-alanine amidase MYCLYERPINSKTGVLEWNGDAWTVMFCNGVNCRRVSHPDEMKVIEDIYRKNNGKDIPFYSQKEWNKNAPWYNRLETVCPVVGITKKS 88 T 0.0018 Lipoprotein_15 pdbpssm T Viruses T 7d5s 55 FB X1 Unassigned peptides 1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 347 F F F 7d5t 53 CB X1 Unassigned peptides 1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 300 F F F 7d63 62 NB X1 Unassigned peptides 1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 611 F F F 7d6c 2 C,D,E,F 3,4,5,F CCMN_SYNE7 CARBON DIOXIDE CONCENTRATING MECHANISM PROTEIN CCMN,ORF I FQSNMHLPPLEPPISDRYFASGEVTIAADVVIAPGVLLIAEADSRIEIASGVCIGLGSVIHARGGAIIIQAGALLAAGVLIVGQSIVGRQACLGASTTLVNTSIEAGGVTAPGSLLSAETPP 122 T 1.2E-05 Fucokinase unphh F Bacteria T 7d6f 2 B B KDIS_RAT ARMS, ANKYRIN REPEAT-RICH MEMBRANE-SPANNING PROTEIN GPGSSSESTGFGEERESIL 19 T 29 CCDC85 pdbhh F Eukaryota T 7d6r 3 G G MMA betaAla peptide MAMMXRRRRAX 11 T 6 Nup_retrotrp_bd pdbhh F F 7d6v 3 C C A0QPJ4_MYCS2 Succinate dehydrogenase (Membrane anchor subunit) MSAPTADRRATGVFSPRRAQIPERTLRTDRWWQAPLLTNLGLAAFVIYATIRAFWGSAYWVADYHYLTPFYSPCVSTACAPGSSHFGQWVGDLPWFIPMAFISLPFLLAFRLTCYYYRKAYYRSVWQSPTACAVAEPHAKYTGETRFPLILQNIHRYFFYAAVLISLVNTYDAITAFHSPSGFGFGLGNVILTGNVILLWVYTLSCHSCRHVTGGRLKHFSKHPVRYWIWTQVSKLNTRHMLFAWITLGTLVLTDFYIMLVASGTISDLRFIGHHHHHHHHHH 283 T 0.044 IncE unppercent F Bacteria T 7d6x 3 C C A0QPJ4_MYCS2 Succinate dehydrogenase (Membrane anchor subunit) MSAPTADRRATGVFSPRRAQIPERTLRTDRWWQAPLLTNLGLAAFVIYATIRAFWGSAYWVADYHYLTPFYSPCVSTACAPGSSHFGQWVGDLPWFIPMAFISLPFLLAFRLTCYYYRKAYYRSVWQSPTACAVAEPHAKYTGETRFPLILQNIHRYFFYAAVLISLVNTYDAITAFHSPSGFGFGLGNVILTGNVILLWVYTLSCHSCRHVTGGRLKHFSKHPVRYWIWTQVSKLNTRHMLFAWITLGTLVLTDFYIMLVASGTISDLRFIGHHHHHHHHHH 283 T 0.044 IncE unppercent F Bacteria T 7d7c 5 F F gp55 MSETKPKYNYVNNKELLQAIIDWKTELANNKDPNKVVRQNDTIGLAIMLIAEGLSKRFNFSGYTQSWKQEMIADGIEASIKGLHNFDETKYKNPHAYITQACFNAFVQRIKKERKEVAKKYSYFVHNVYDSRDDDMVALVDETFIQDIYDKMTHYEESTYRTPGAEKKSVVDDSPSLDFLYEAND 185 T 0.0025 Sigma70_r2 pdbpssm F T 7d7d 5 F F gp55 MSETKPKYNYVNNKELLQAIIDWKTELANNKDPNKVVRQNDTIGLAIMLIAEGLSKRFNFSGYTQSWKQEMIADGIEASIKGLHNFDETKYKNPHAYITQACFNAFVQRIKKERKEVAKKYSYFVHNVYDSRDDDMVALVDETFIQDIYDKMTHYEESTYRTPGAEKKSVVDDSPSLDFLYEAND 185 T 0.0025 Sigma70_r2 pdbpssm F T 7d87 2 B E V9H1G0_HUMAN Gene for histone H3 (germline gene) ARTKQTARKSTGGKAPRKQLATKAA 25 T 0.044 PAF unp F Eukaryota T 7d8a 2 B E V9H1G0_HUMAN Gene for histone H3 (germline gene) ARTKQTARKSTGGSGSGS 18 T 0.044 PAF unp F Eukaryota T 7d8c 1 A A 12i1 MSNKEKNASETRKAYTTKMIPRSHDRMKLLGNFMDYLMDGTPIFFELWNQFGGGIDRDIISGTANKDKISDDLLLAVNWFKVMPINSKPQGVSPSNLANLFQQYSGSEPDIQAQEYFASNFDTEKHQWKDMRVEYERLLAELQLSRSDMHHDLKLMYKEKCIGLSLSTAHYITSVMFGTGAKNNRQTKHQFYSKVIQLLEESTQINSVEQLASIILKAGDCDSYRKLRIRCSRKGATPSILKIVQDYELGTNHDDEVNVPSLIANLKEKLGRFEYECEWKCMEKIKAFLASKVGPYYLGSYSAMLENALSPIKGMTTKNCKFVLKQIDAKNDIKYENEPFGKIVEGFFDSPYFESDTNVKWVLHPHHIGESNIKTLWEDLNAIHSKYEEDIASLSEDKKEKRIKVYQGDVCQTINTYCEEVGKEAKTPLVQLLRYLYSRKDDIAVDKIIDGITFLSKKHKVEKQKINPVIQKYPSFNFGNNSKLLGKIISPKDKLKHNLKCNRNQVDNYIWIEIKVLNTKTMRWEKHHYALSSTRFLEEVYYPATSENPPDALAARFRTKTNGYEGKPALSAEQIEQIRSAPVGLRKVKKRQMRLEAARQQNLLPRYTWGKDFNINICKRGNNFEVTLATKVKKKKEKNYKVVLGYDANIVRKNTYAAIEAHANGDGVIDYNDLPVKPIESGFVTVESQVRDKSYDQLSYNGVKLLYCKPHVESRRSFLEKYRNGTMKDNRGNNIQIDFMKDFEAIADDETSLYYFNMKYCKLLQSSIRNHSSQAKEYREEIFELLRDGKLSVLKLSSLSNLSFVMFKVAKSLIGTYFGHLLKKPKNSKSDVKAPPITDEDKQKADPEMFALRLALEEKRLNKVKSKKEVIANKIVAKALELRDKYGPVLIKGENISDTTKKGKKSSTNSFLMDWLARGVANKVKEMVMMHQGLEFVEVNPNFTSHQDPFVHKNPENTFRARYSRCTPSELTEKNRKEILSFLSDKPSKRPTNAYYNEGAMAFLATYGLKKNDVLGVSLEKFKQIMANILHQRSEDQLLFPSRGGMFYLATYKLDADATSVNWNGKQFWVCNADLVAAYNVGLVDIQKDFKKKLEHHHHHH 1101 T 0.38 DUF1910 pdbpercent F T 7da6 2 B C PHE-ARG-GLY-LYS FRGK 4 T 44 BDHCT_assoc pdbhh F F 7dbt 1 A,B A,B A0A1D1VPD8_RAMVA AMNP/g12777 GSHMGRFADFFRIETEIQRLDNPAGILANGKKCDFTGACDPVVTAFLDLESPLSPWPGSVAASKWKTIFEATDQNSPTIGRSVIRDMCGGSASNVNLRVLVNDADSLSSQDEIGKFSCLFQLDARDVAMDSLSAQWGPSTECTAEAQQGKIRLFARRRAFEIPSTSCRAPSSL 173 T 0.082 MNNL pdbpssm F Eukaryota T 7dbu 1 A,B A,B A0A1D1VPD8_RAMVA AMNP/g12777 GSHMGRFADFFRIETEIQRLDNPAGILANGKKCDFTGACDPVVTAFLDLESPLSPWPGSVAASKWKTIFEATDQNSPTIGRSVIRDMCGGSASNVNLRVLVNDADSLSSQDEIGKFSCLFQLDARDVAMDSLSAQWGPSTECTAEAQQGKIRLFARRRAFEIPSTSCRAPSSL 173 T 0.082 MNNL pdbpssm F Eukaryota T 7dcv 1 A A PD1L1_HUMAN HPD-L1,B7 HOMOLOG 1,B7-H1 AHPPNERTHLVILGAILLALGVALTFIFRLRKGRLLDVKKSGIQDTNSKKQSDTHLEET 59 T 0.0032 RIF5_SNase_1 pdbpssm F Eukaryota T 7dd1 1 A A SRPK1_HUMAN SRSF protein kinase 1,SRSF protein kinase 1 HHHHHHSSGLVPRGSHMDPNDYCKGGYHLVKIGDLFNGRYHVIRKLGWGHFSTVWLSWDIQGKKFVAMKVVKSAEHYTETALDEIRLLKSVRNSDPNDPNREMVVQLLDDFKISGVNGTHICMVFEVLGHHLLKWIIKSNYQGLPLPCVKKIIQQVLQGLDYLHTKCRIIHTDIKPENILLSVNEQYIRRLAAEATEWQRSGAPPPSGSAVSTAPATAGNFLVNPLEPKNAEKLKVKIADLGNACWVHKHFTEDIQTRQYRSLEVLIGSGYNTPADIWSTACMAFELATGDYLFEPHSGEEYTRDEDHIALIIELLGKVPRKLIVAGKYSKEFFTKKGDLKHITKLKPWGLFEVLVEKYEWSQEEAAGFTDFLLPMLELIPEKRATAAECLRHPWLNS 398 T 3.2 RFXA_RFXANK_bdg unppercent F Eukaryota T 7dd1 2 B B ARG-GLU-ARG-ALA-ARG-THR-ARG RERARTR 7 T 180 Dpp_8_9_N pdbhh F F 7dda 1 A A A0A2U9GDM4_WSSV ENVELOPE PROTEIN VP37 ADFLLDRMTPVSEEDIEGFAASTFKEVSDSKTATVIVKADCETGDIDEVYNLAPSFGVTQEIKIYRSNNSSELDNVADSFHIYKISATDSDSGNTKKLLYGLRNKKAGYTCLCRIFAEIESDGIMANTNIGVAENNRDEIDENEEGKYGFLIPKQPAGAKLIIYFFLNCWT 171 T 12 DUF5043 pdbhh T Viruses T 7ddm 2 B E ALA-ALA-ARG-ASP-ALA-ALA-VAL-SER-ASP-ALA-ALA-ALA AARDAAVSDAAA 12 T 4.5 FlgT_N pdbhh F F 7de2 1 A,B A,B NVFI_ASPN1 NvfI MGSSHHHHHHSSGLVPRGSHMVGSRTWCESEMLFVQPDAGTKEELYYRVTPKPGQTQANFNWTPHKVRFHDARPQRDSFDLNTHGFTFVEDAISPQLIERIRADDTAAVEGDYFASVAALVKRVTGADHVVCFSPYTRKENSEKGIFGQPARTVHCDHTPAAAIELTHKLCGEDAVRLLQSRFRAFSVWRPLVEPVLDWPLAVVDGRTIAPDDLHPVHWLRYEKKDTEPPFQLSFSETQKWYYLSRQRSDEVSIVKNYDSEVVPSPRSAHCAFKHPFVPKDAPPRESIDVRCLVFGGR 298 T 0.28 EF-hand_5 unppercent F Eukaryota T 7de7 1 A,B A,B PDZD7_MOUSE PDZ domain-containing protein 7 GPGSTVNEQVQAWESRRPLIQDLARRLLTDDEVLAVTRHCSRYVHEGGVEDLVRPLLAILDRPTKLLLLRDIRSVVAPTDLGRFDSMVMPVELEAFEALKSRAVG 105 T 0.005 CCM2_C pdbhh F Eukaryota T 7ded 1 A,B,C,D,E,F,G D,E,A,B,C,G,F D7DTD6_METV3 lectin DNYIYSTEVGGVGGTPFTFMQESGTITSIKFNWSDQYKLLHHIEVKFINNANIYATGDPKGNHEVILEIDDDETIIGSVIGYKKGNDGRCTGVKLTTSKGKSIMAGYFEESLITTYTGKLAGIKGGAGSDIDRLGLIFLK 140 T 0.00015 Jacalin unppssm F Archaea T 7deg 2 B,E C,F G0LWX8_AQUAO Cytochrome oxidase subunit IIa FFPSGTIAFFIFMMVFYAVLWFMIYWVLLERG 32 T 0.0016 CoxIIa pdb F Bacteria T 7df2 1 A A A0A1D1UCW7_RAMVA C2 domain protein MGSSHHHHHHHSSENLYFQGLIDRGQDLADVAKYPLITGFFRIEMNVVRLDTQGKSHTGLPCDIFDKCDPKIIAFIDTEKPNNDFGGDSVPYSNYITLVDANNTPDVVEIDKTISRDVCGKGVRKIAMRVRAIDKDGLNDDKIDNYKCHITGERNPPAENEKVAQWSPEIACAGEDRASSKVYLRYRWYNIPESTCRPSSNGQGLFSGLFSR 212 T 0.36 UPAR_LY6_2 pdbpssm F Eukaryota T 7df9 2 B V V2R_HUMAN VASOPRESSIN V2 RECEPTOR PHOSPHOPEPTIDE RTPPSLGPQDESCTTASSSLRKD 23 T 27 DUF6352 pdbhh F Eukaryota T 7dfa 4 D V V2R_HUMAN VaRpp-4 RTPPSLGPQDESCTTASSSLAKD 23 T 110 DUF6352 pdbhh F Eukaryota T 7dfb 2 B V V2R_HUMAN V2Rpp-6-7 RTPPSLGPQDESCTTASSSLRKD 23 T 27 DUF6352 pdbhh F Eukaryota T 7dfc 2 B V V2R_HUMAN V2Rpp-3 RTPPSLGPQDESCTTASSSLRK 22 T 23 DUF6352 pdbhh F Eukaryota T 7dgu 1 A A de novo designed protein H4A1R MGDEYKKYYQQAIQLIQQLKKALEGNPEMKKLADKVLALLKQAYAAFKAGRSPEEIRALLRKAIEAAKKLAKLGASLGGFDLAKRIIELLKKMYELGGLEHHHHHH 106 T 0.0022 Gp-FAR-1 pdb F T 7dgw 1 A A de novo designed protein H4A2S MGEDYLKLLEEALKIAREVLENYPLTPVMRAAARAIIEAVKMAKKYGDEELIKLVVEAARLLRQAAKQGDLELARQALAAARQALAFARRVAGLEHHHHHH 101 T 0.05 VMAP-M8 pdb F T 7dgx 1 A,B A,B Q57W63_TRYB2 Coronin EFSQLLALASLLGQQQAEVQRCREDLQKKESLVMETIAKIKALALEHHHHHH 52 T 0.079 PEARLI-4 pdbpssm F Eukaryota T 7dgy 1 A A de novo designed protein H4C2R MGHPEIVAAAVAFVRQIWEYARQGMSLDEMIAWAVKYAKKIFDLVKKMGASDEVLKKVMDAVLAAAQAYAQQLNDEAAQRLLVAAQVIVQVLQQLGLEHHHHHH 104 T 1.3 DUF2277 pdb F T 7dh4 1 A,B A,B Q57W63_TRYB2 Coronin EFSQLLALASLLGQQQAEIQRCREDLQKKESLMMETIAKIKALALEHHHH 50 T 0.11 DUF2486 pdb F Eukaryota T 7dhb 1 A,B A,B Q57W63_TRYB2 Coronin FSQLLALASLLGQQQAEVQRCREDLQKKESLMMETIAKIKALALEHHHHH 50 T 0.11 DUF3391 pdbpssm F Eukaryota T 7di3 1 A A A0A160P685_STRLU Cytochrome P450 hydroxylase MTEAVAFPQNRSCPYHPPTAYEPLREERPLSRVTLWNGRQVWFVTGHQAARALLGDQRLSTDSTREDFPLPTERSESLRRQRRGALLGWDDPEHNEQRRMLIPSFTLRRAESMRPRIQAIVDRLLDDMIAAGPSAELVGAFALPVPSMVICELLGVPYGDHEFFEEQSRRLLRGPAAEDIEKAFRSLEGYFGELIETKRTDPGEGVIDDLVARQREEGRPDDDELVQFATVLLVAGHETTANMISLATYTLLEHPARLAELRADPGLVPAAVEELLRFLSIADGLVRVAREDVPVGDQVIRAGEGVVFPTSLINRDDSVYEHPDTLDWSRSARHHVAFGFGIHQCLGQNLARIELEIALGTLLRRLPGLRLAAPADRIPFKPGDTIQGMLELPVTW 396 T 3E-05 p450 pdbpssm F Bacteria T 7dii 1 A,B A,B O67854_AQUAE LEUT MEVKREHWATRLGLILAMAGNAVGLGNFLRFPVQAAENGGGAFMIPYIIAFLLVGIPLMWIEWAMGRYGGAQGHGTTPAIFYLLWRNRFAKILGVFGLWIPLVVAIYYVYIESWTLGFAIKFLVGLVPEPPPNATDPDSILRPFKEFLYSYIGVPKGDEPILKPSLFAYIVFLITMFINVSILIRGISKGIERFAKIAMPTLFILAVFLVIRVFLLETPNGTAADGLNFLWTPDFEKLKDPGVWIAAVGQIFFTLSLGFGAIITYASYVRKDQDIVLSGLTAATLNEKAEVILGGSISIPAAVAFFGVANAVAIAKAGAFNLGFITLPAIFSQTAGGTFLGFLWFFLLFFAGLTSSIAIMQPMIAFLEDELKLSRKHAVLWTAAIVFFSAHLVMFLNKSLDEMDFWAGTIGVVFFGLTELIIFFWIFGADKAWEEINRGGIIKVPRIYYYVMRYITPAFLAVLLVVWAREYIPKIMEETHWTVWITRFYIIGLFLFLTFLVFLAERRRNHESA 513 T 7.399999999999999E-33 SNF unppercent F Bacteria T 7dix 1 A,B A,B O67854_AQUAE LEUT MEVKREHWATRLGLILAMAGNAVGLGNFLRFPVQAAENGGGAFMIPYIIAFLLVGIPLMWIEWAMGRYGGAQGHGTTPAIFYLLWRNRFAKILGVFGLWIPLVVAIYYVYIESWTLGFAIKFLVGLVPEPPPNATDPDSILRPFKEFLYSYIGVPKGDEPILKPSLFAYIVFLITMFINVSILIRGISKGIERFAKIAMPTLFILAVFLVIRVFLLETPNGTAADGLNFLWTPDFEKLKDPGVWIAAVGQIFFTLSLGFGAIITYASYVRKDQDIVLSGLTAATLNEKAEVILGGSISIPAAVAFFGVANAVAIAKAGAFNLGFITLPAIFSQTAGGTFLGFLWFFLLFFAGLTSSIAIMQPMIAFLEDELKLSRKHAVLWTAAIVFFSAHLVMFLNKSLDEMDFWAGTIGVVFFGLTELIIFFWIFGADKAWEEINRGGIIKVPRIYYYVMRYITPAFLAVLLVVWAREYIPKIMEETHWTVWITRFYIIGLFLFLTFLVFLAERRRNHESA 513 T 7.399999999999999E-33 SNF unppercent F Bacteria T 7djq 1 A C A0A1V9TQZ2_9LACO C-Terminal peptide of ribosomal S4 Domain protein EEYREDFSI 9 T 7 Pox_A6 pdbhh F Bacteria T 7dkh 3 C,G,K C,G,K CDC73_YEAST RNA POLYMERASE-ASSOCIATED PROTEIN CDC73 NDSEVSDPVVVETMKHERILVDHNSALRGAKPINFGYLIKDAELKLVQSIKGSLRGS 57 T 0.34 CDC73_N unppercent F Eukaryota T 7dkh 4 D,H,L D,H,L RTF1_YEAST RNA polymerase-associated protein RTF1 KTRTKVYYQEIQKEENAKAKEIAQQEKLQEDKDAKDKREKELLVAQFRRLGGLERMVGELDIKFDLKF 68 T 0.14 DUF1366 pdbpercent F Eukaryota T 7dkk 1 A,B,C,D A,B,C,D De novo design protein XM2H MHSWSATVDSRSEEAVRAAARRLAERLLAAGISGKIKIEVEANGIKYEYEVEGPATEEVAKKIVEYAVAAALRAIAAGATSVTITVGLE 89 T 0.094 NAGLU_N pdb F T 7dko 1 A,B,C A,B,C de novo designed protein AM2M MASAEAEVKPDATIEEIRAAARRLAEALRKAGVSGPVTVTAEAGDVSFSYTADLDGTEEGLKRVVEAIVRAAIAALKATGGTKPVLLSAVLE 92 T 0.014 DUF1887 pdb F T 7dl2 4 F F unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 261 F F F 7dls 1 A A A0A160P685_STRLU Cytochrome P450 hydroxylase MTEAVAFPQNRSCPYHPPTAYEPLREERPLSRVTLWNGRQVWFVTGHQAARALLGDQRLSTDSTREDFPLPTERSESLRRQRRGALLGWDDPEHNEQRRMLIPSFTLRRAESMRPRIQAIVDRLLDDMIAAGPSAELVGAFALPVPSMVICELLGVPYGDHEFFEEQSRRLLRGPAAEDIEKAFRSLEGYFGELIETKRTDPGEGVIDDLVARQREEGRPDDDELVQFATVLLVAGHETTANMISLATYTLLEHPARLAELRADPGLVPAAVEELLRFLSIADGLVRVAREDVPVGDQVIRAGEGVVFPTSLINRDDSVYEHPDTLDWSRSARHHVAFGFGIHQCLGQNLARIELEIALGTLLRRLPGLRLAAPADRIPFKPGDTIQGMLELPVTW 396 T 3E-05 p450 pdbpssm F Bacteria T 7dmf 1 A A Designed protein EXTD-3 DCQQELSLVQTVTRGSRAFLSREEAQHFVKECGLLNCEAVLELLICHLRLGMEIMKLGRQLREAVRANDVDAMLKIAKEIIKVIGETGLDEVYRQLLKAAKEFLERRAENFSHEEAVAFAQQIIQLIKQVECVQMRALGAVASLGCTDLLPQEHILLLTRPRLQELSAGSPGPVTNKATKILRHFEASC 189 T 0.074 AviRa pdb F T 7dmn 1 A A FSA2_FUSSF FUSARISETIN A BIOSYNTHESIS PROTEIN 2 GSHMSNVTVSAFTVDKSISEEHVLPSSFIPGSGNIFPKFTSAIPKTAWELWYFDGISKDDKSSIVIGVTRNAEGLKHGGFKVQVFVIWADERTWHRDLFFPESVVSINESGVTDGIWKDATSNSSISFSCAGDLSKASLVFDVPGVVQGDMHLEALPGDTGLDTDARLGPSVYYVRPIGRASVKAQLSLYSSDATAAEQFSLGTSANGGMDRVWSPLSWPQVMTESYYLRTQVGPYAMQIMRIFPPAGSEDQPSTMARLYREGQLVCVAQHVVTREDALMTHDSLILSKQDNSDSEDVVTGGYRDKNTGYTVEFVEKGNEGQRWKFQVRHERIIWNTPTSRPGPDATGNTGFVEVLCGGTIGESYEGVGTGGQCELS 377 T 0.00021 Svf1_C unphh F Eukaryota T 7dmo 1 A,B,C,D,E,F A,B,C,D,E,F PHM7_PYRSX Diels-Alderase GSHMSEPTSSSSLDITSNCIIETPLQPSDFLPKSANLFPKFPERISVDSWELWEFDTFDTNGSVAFGCSLYRDARGVEQGGFHAEVNALWPDGTHWGETLYFAVSEVVENSDGTTGGKWLSKDGGSITFHIASDYTAAALDFNVPGKVSGTMELRNHANVSPTSNLPASDAEAQLCPGVYYTFPMGPVATSVTATFSSVGANGESRELFISSGYGGMVRGWSARPWPTFMNDAYYVVAQVGPYMLQILRTLGSVFVQHKPFAVARLYLDGSLVSAANTVVGDELTAHADDVKGDAVRLTKVQPDEKSQGLSGKFRDGNVGYVLEFAKKDSEHGWTFQISHKRAVWSEPTSAPGPDGTGKSGWIEAISGGAKGENYEGHGFGGQLQIPVP 389 T 0.0068 Svf1_C unphh F Eukaryota T 7dmq 1 A A CS13A_LEPSD CRISPR/Cas system Cas13a GSMGNLFGHKRWYEVRDKKDFKIKRKVKVKRNYDGNKYILNINENNNKEKIDNNKFIRKYINYKKNDNILKEFTRKFHAGNILFKLKGKEGIIRIENNDDFLETEEVVLYIEAYGKSEKLKALGITKKKIIDEAIRQGITKDDKKIEIKRQENEEEIEIDIRDEYTNKTLNDCSIILRIIENDELETKKSIYEIFKNINMSLYKIIEKIIENETEKVFENRYYEEHLREKLLKDDKIDVILTNFMEIREKIKSNLEILGFVKFYLNVGGDKKKSKNKKMLVEKILNINVDLTVEDIADFVIKELEFWNITKRIEKVKKVNNEFLEKRRNRTYIKSYVLLDKHEKFKIERENKKDKIVKFFVENIKNNSIKEKIEKILAEFKIDELIKKLEKELKKGNCDTEIFGIFKKHYKVNFDSKKFSKKSDEEKELYKIIYRYLKGRIEKILVNEQKVRLKKMEKIEIEKILNESILSEKILKRVKQYTLEHIMYLGKLRHNDIDMTTVNTDDFSRLHAKEELDLELITFFASTNMELNKIFSRENINNDENIDFFGGDREKNYVLDKKILNSKIKIIRDLDFIDNKNNITNNFIRKFTKIGTNERNRILHAISKERDLQGTQDDYNKVINIIQNLKISDEEVSKALNLDVVFKDKKNIITKINDIKISEENNNDIKYLPSFSKVLPEILNLYRNNPKNEPFDTIETEKIVLNALIYVNKELYKKLILEDDLEENESKNIFLQELKKTLGNIDEIDENIIENYYKNAQISASKGNNKAIKKYQKKVIECYIGYLRKNYEELFDFSDFKMNIQEIKKQIKDINDNKTYERITVKTSDKTIVINDDFEYIISIFALLNSNAVINKIRNRFFATSVWLNTSEYQNIIDILDEIMQLNTLRNECITENWNLNLEEFIQKMKEIEKDFDDFKIQTKKEIFNNYYEDIKNNILTEFKDDINGCDVLEKKLEKIVIFDDETKFEIDKKSNILQDEQRKLSNINKKDLKKKVDQYIKDKDQEIKSKILCRIIFNSDFLKKYKKEIDNLIEDMESENENKFQEIYYPKERKNELYIYKKNLFLNIGNPNFDKIYGLISNDIKMADAKFLFNIDGKNIRKNKISEIDAILKNLNDKLNGYSKEYKEKYIKKLKENDDFFAKNIQNKNYKSFEKDYNRVSEYKKIRDLVEFNYLNKIESYLIDINWKLAIQMARFERDMHYIVNGLRELGIIKLSGYNTGISRAYPKRNGSDGFYTTTAYYKFFDEESYKKFEKICYGFGIDLSENSEINKPENESIRNYISHFYIVRNPFADYSIAEQIDRVSNLLSYSTRYNNSTYASVFEVFKKDVNLDYDELKKKFKLIGNNDILERLMKPKKVSVLELESYNSDYIKNLIIELLTKIENTNDTL 1391 T 0.067 PET117 pdbpercent F Bacteria T 7dms 1 A A Q665A4_YERPS Fe(II)-binding effector SMKTDNAMKKIKLAIDGINQAIDNFNEVQTFTTINQLNHFKEKLMNCEHLIQLNNIPDKSHRNLGISRIIIDQWPFDSELGCMIINAESEYKSL 94 T 0.0065 Laminin_I pdbpssm F Bacteria T 7dn2 1 A,B,C,D,E,F,G,H,I a,b,c,d,e,f,g,h,i ORF14_BPKHP Major structural protein ORF14 MLEKLNNINFNNISNNLNLGIEVGREIQNASWIKSPFFSITGTGADRGVRLFSVASQQPFRPRIKAQLSGSGVSGNTDFEANYDNLEILSQTIYPDAFGNSLRSKIKAYSELERIDFIKESVDSLTTWMNEERDKRIVASLTNDFTNYLYTQTMNVATIRKAIFHARNGLKGDNSKAFPIKPIRATMQSVGNVMVQNTSYIILLDSYQANQLKADSEFKELRKLYAFAGEDKGMLYSGLLGVIDNCPVIDAGVWNKFNVGMPNSSISDSDFMRYLNKANVSSIVTPRQFKEKLNQEKDEKKRSINKEISIGCLIGASAVLLAGSKETRFYIDETVDAGRKSLVGVDCLLGVSKARYQSTDGVVTPYDNQDYAVIGLVSDME 381 T 0.00023 DUF4043 pdbpssm T Viruses T 7dn2 2 J,K,L,M,N,O,P,Q,R 1,2,3,4,5,6,7,8,9 I7HFW5_BPKHP Cement protein gp15 MKQKVHSVSYLAKAEFKFNNGVYNLVALPSGAEVVKVSLEVVGNPIATSTTSVSVGFEDETTKNYFLTLDNLAVDDASKKHTTSAKDYTATSNKVVVAEVKNANDNNVKGVLRVLYFLPSVIEVEY 126 T 0.056 Spore_III_AF pdbpercent T Viruses T 7dne 2 C,D C,D V3-IY (MN) crown mimetic peptide KRIHIGPGRAFYTTXP 16 F F T 7dnf 2 B,E B,E V3-IY (MN) crown mimetic peptide PKRIHIGPGRAFYTTXP 17 T 0.0018 GP120 pdbhh F T 7dno 2 C,D C,D CYS-ARG-THR-LEU-PRO-PHE CRTLPFHEC 9 T 1.6 ORC3_ins pdbhh F T 7dns 1 A,B A,B de novo designed protein GGHMGEEQKEIETLVELFAEAFREAKRQKKNGTPEEWARDAVEEAARQQGRSRKDVVEALTKYAQEQGRDELLKRLGITPEIYKVIQQIRKEEGSLE 97 T 0.0081 DUF2226 pdb F T 7doc 8 I I 3-PYRIDIN-4-YL-2,4-DIHYDRO-INDENO[1,2-.C.]PYRAZOLE XGKRK 5 T 130 Orexin pdbhh F F 7doh 1 A I BACR1_HALMA GLY-THR-GLY-ALA-THR-PRO-ALA-ASP-ASP GTGATPADD 9 T 0.53 DUF2877 pdbhh F Archaea F 7doq 1 A,B,C,D A,B,C,D A0A2S6F805_LEGPN HISTIDINE ACID PHOSPHATASE,HISTIDINE-TYPE PHOSPHATASE HHMEDKLIFAVDIIRHGDRTPIVALPTVNYQWQEGLGQLTAEGMQQEYKMGVAFRKKYIEESHLLPEHYEYGTIYVRSTDYARTLMSAQSLLMGLYPPGTGPTIPAGTSALPHAFQPIPVFSAPSKYDEVIIQQVDRKEREKLMEQYVFSTREWQQKNNELKDKYPLWSRLTGINIDNLGDLETVGHTLYIHQIHNAPMPEGLASNDIETIINSAEWAFMAQEKPQQIANVYSSKLMTNIADYLNSGSMKKSKLKYVLLSAHDTTIASVLSFLGAPLEKSPPYASNVNFSLYDNGANYYTVKITYNGNPVSIPACGGSVCELQQLINLVHDSKNS 335 T 0.018 His_Phos_2 pdbpssm F Bacteria T 7dou 1 A,B,C 4,5,6 I7GUT5_9CAUD Cement protein gp16 MKQKVHSVSYLAKAEFEYKNGVYDLVALPTGAEVIKISLEVVGLPTAGHVSVGFKDESKKNYSSILTLPVNETSGVVTKDYTVKSDKIVAAEVKDALAEGSDGRPVKCVLRALYFLPSVIEVEY 124 T 0.0053 Clathrin_bdg pdbpssm T Viruses T 7dq0 3 C C Actinomycin D TXPXXXTXPXX 11 T 61 DUF5572 pdbhh F F 7dq0 4 D D Echinomycin XAXXXAXX 8 T 190 RSF pdbhh F F 7dq8 3 C D Echinomycin XAXXXAXX 8 T 190 RSF pdbhh F F 7dq8 4 D C Actinomycin D TXPXXXTXPXX 11 T 61 DUF5572 pdbhh F F 7drm 2 E,F E,F A6GH40_9DELT PsnA214-38, Precursor peptide LFIEDLGKVTGGKGGPYTTLAIGEE 25 T 0.016 LcnG-beta unphh F Bacteria T 7drn 2 E,F E,F A6GH40_9DELT PsnA214-38, Precursor peptide LFIEDLGKVTGGKGGPYTTLAIGEE 25 T 0.016 LcnG-beta unphh F Bacteria T 7dro 2 G,H,I,J G,H,I,J A6GH40_9DELT PsnA214-38, Precursor peptide LFIEDLGKVTGGKGGPYTTLAIGEE 25 T 0.016 LcnG-beta unphh F Bacteria T 7drp 2 E,F F,E A6GH40_9DELT PsnA214-38, Precursor peptide, phospho-mimic LFIEDLGKVTGGKGGPYTTLAIGXE 25 T 0.016 LcnG-beta unphh F Bacteria T 7ds2 3 C C TWF1_MOUSE PROTEIN A6 GPLGSKQHAHKQSFAKPKGPAGKRGIRRLIRGPAEAEATTD 41 T 24 Cylicin_N unphh F Eukaryota T 7ds3 3 C C TWF2_MOUSE A6-RELATED PROTEIN,MA6RP,TWINFILIN-1-LIKE PROTEIN KQHAFKQAFAKPKGPGGKRGHKRLIRGPGE 30 T 17 Tristanin_u2 unphh F Eukaryota T 7ds4 3 C C TWF1_MOUSE PROTEIN A6 KQHAHKQSKAKPKGPAGKRGIRRLIRGPAE 30 T 24 Cylicin_N unphh F Eukaryota T 7ds6 3 C C CD2AP_HUMAN;TWF1_MOUSE PROTEIN A6,ADAPTER PROTEIN CMS,CAS LIGAND WITH MULTIPLE SH3 DOMAINS KQHAHKQSFAKPKMPGRRLPGRFNG 25 T 7.1 CAP-ZIP_m unphh F Eukaryota T 7ds8 3 C C CD2AP_HUMAN;TWF1_MOUSE ADAPTER PROTEIN CMS,CAS LIGAND WITH MULTIPLE SH3 DOMAINS,PROTEIN A6 NLLHLTANRPKGPAGKRGIRRLIRGPAE 28 T 7.1 CAP-ZIP_m unphh F Eukaryota T 7dsb 4 D D TWF1_MOUSE PROTEIN A6 KQHAHKQSFAKPKGPAGKRGIRRLIRGPAE 30 T 24 Cylicin_N unphh F Eukaryota T 7dsz 1 A,B,C A,B,C B2UR43_AKKM8 Amuc_1102 MGHHHHHHMQTTSNPRMQVRVSLEKLSLYMRQSPNVLTQDDPRPLPKPKKWADFEIPFKVEAAPTPKSGYIDALTFKFYIAVVNPDRSRQYLKLYKEVKYVNVPVGENTYASVYLSPSSVKRITGVEGGRGKWVKYQGVVVEYNGKIVATYSSERGKMEKWWTIQSPSIVETSYYPLLNKDETPFSVFWYDRYPEIMRPNSQQAASSSVPAPFGTPVEPPADGELEHHHHHH 232 T 0.14 OMP_b-brl unppssm F Bacteria T 7dta 1 A A S2A4R_HUMAN GLUT4 ENHANCER FACTOR,GEF,HUNTINGTON DISEASE GENE REGULATORY REGION-BINDING PROTEIN 1,HDBP-1 GDAKKCRKVYGMERRDLWCTACRWKKACQRFLD 33 T 0.59 ERG4_ERG24 pdbpssm F Eukaryota T 7dtr 1 A A AcrIF24 MNAIHIGPFSITPAARGLHYGGLPHHQWTLYYGPREMAIKTLPDSYTSSEVRDEFSDIIAEFVIDARHRYAPGSGSSGSAPPLAWITGLLPGEVLTHDAEEWRPPTSWELRHVVGEGSFTGVSGAAAAALLGMSATNFRKYTAGDSAANRQKISFAAWHYLLDRLGVKRAS 171 T 0.00017 DUF4447 pdbhh F T 7du0 1 A A A0A0R6PCL0_9CAUD AcrIF14 AMKKIEMIEISQNRQNLTAFLHISEIKAINAKLADGVDVDKKSFDEICSIVLEQYQAKQISNKQASEIFETLAKANKSFKIEKFRCSHGYNEIYKYSPDHEAYLFYCKGGQGQLNKLIAENGRFM 125 T 0.053 GatB_Yqey unp T Viruses T 7du4 2 C Q MAZE7_MYCTU peptide DEDREWEGTVGDGLG 15 T 0.23 FUSC unppercent F Bacteria T 7du5 2 C C MAZE9_MYCTU A fragment of MazE-mt1 TLEDDYANAWQEWSAAG 17 T 0.0075 ParD_antitoxin unppercent F Bacteria T 7duv 1 A,B,C A,B,C Q981B2_SACS2 SegB MSELDFLLKKKRKSEDEEKIINNNENAKKEEITNEEEKIKNDMLKYIEKDPKIGVWSYPAFLVLQYLYHTVPGFKMSRTAKEALEKGLKEMYPTLFTIAEKIAKERFKEHHHHHH 115 T 0.0063 CBFD_NFYB_HMF unppercent F Archaea T 7dv2 1 A,B,C,D A,B,C,D Q981B2_SACS2 SegB MNEEEKIKNDMLKYIEKDPKIGVWSYPAFLVLQYLYHTVPGFKMSRTAKEALEKGLKEMYPTLFTIAEKIAKERFKEHHHHHH 83 T 0.0062 CBFD_NFYB_HMF pdbpercent F Archaea T 7dwq 6 F F B0C7S7_ACAM1 Photosystem I protein PsaF MRRLFAVLLVMTLFLGVVPPASADIGGLVPCSESPKFQERAAKARNTTADPNSGQKRFEMYSSALCGPEDGLPRIIAGGPMRRAGDFLIPGLFFIYIAGGIGNSSRNYQIANRKKNAKNPAMGEIIIDVPLAVSSTIAGMAWPLTAFRELTSGELTVPDSDVTVSPR 167 T 3.4E-06 PSI_PsaF pdbpssm F Bacteria T 7dwq 10 J W Photosystem I protein Psa27 MISDILPAIMTPLVVLIGGGAAMTAFFYYVEREG 34 T 0.0026 PSI_8 pdbpercent F T 7dxa 2 B B Q8DMP8_THEEB Tsl0063 protein MRYTTDEGGRLNNFAIEPKVYQAQPWTPQQKVRAALLVGGGLLLVAGLVAIAVGVS 56 T 0.026 DUF2157 pdbpssm F Bacteria T 7dxa 14 N C unidentified transmembrane protein AAAAAAAAAAAAAAAAAAAAAAA 23 T 560 DUF4699 pdbhh F F 7dxh 2 B B Q8DMP8_THEEB Tsl0063 protein MRYTTDEGGRLNNFAIEPKVYQAQPWTPQQKVRAALLVGGGLLLVAGLVAIAVGVS 56 T 0.026 DUF2157 pdbpssm F Bacteria T 7dxh 18 R C unidentified transmembrane protein AAAAAAAAAAAAAAAAAAAAAAA 23 T 560 DUF4699 pdbhh F F 7dyr 3 C,F,I A,D,G MCEA_KLEPN MCCE492 GETDPNTQLLNDLGNNMAWGAALGAPGGLGSAALGAAGGALQTVGQGLIDHGPVNVPIPVLIGPSWNGSGSGYNSATSSSGSGS 84 T 0.00094 MccV unphh F Bacteria T 7dz7 13 M O A8JCL6_CHLRE Photosystem I subunit O MAVAMRSAAMPSLASRPRVSSRRSVVVRAEASNKSFPRDWVKTDPLVPVLGFAGWTIPANIGVSAFGGQSLFGLFTQSIGENLAHFPTGPALDDKFWLYLITYHLGLFLTITLGQIGVQGRKQGYW 126 T 42 YkpC pdbhh F Eukaryota T 7dz8 13 M O A8JCL6_CHLRE Photosystem I subunit O MAVAMRSAAMPSLASRPRVSSRRSVVVRAEASNKSFPRDWVKTDPLVPVLGFAGWTIPANIGVSAFGGQSLFGLFTQSIGENLAHFPTGPALDDKFWLYLITYHLGLFLTITLGQIGVQGRKQGYW 126 T 42 YkpC pdbhh F Eukaryota T 7dz9 3 C,F D,C E3BK13_9VIBR MbnC MEEILDRIINPLSAKPLTKKEHIYTSLVLQSSQSLILSACPSLQSQRQFCSFEYHQQFIDWCFFNKKRTDWCLALSFYQYLSYKNEQVSVEILKELIHLACSQWTYADKSTNQTVVICHTRLPSMVFGGNKSLFAQEFREVFLLETEQLKPFIQSHVPDGYFVYWILRDDSEYPSTMGEK 180 T 0.14 SEFIR unppercent F Bacteria T 7dz9 4 D E MbnA MKNDKKVVVKVKDKEMTCGAFNK 23 T 5.1 ATP-synt pdbhh F T 7dz9 5 E F MbnA MKNDKKVVVKVKDKEMTCGAFN 22 T 3.8 ATP-synt pdbhh F T 7dzm 3 C C POL_HV1H2 Gag-Pol polyprotein TPQDLNTML 9 T 0.13 Gag_p24 unphh T Viruses T 7dzn 3 C C POL_HV1H2 Gag-Pol polyprotein TPQDLNTML 9 T 0.13 Gag_p24 unphh T Viruses T 7e0b 2 B B PBM DDVQTSF 7 T 55 DUF2150 pdbhh F T 7e0c 1 A A Q8L3C7_9ACTN L-glutamate oxidase MNEMTYEQLARELLLVGPAPTNEDLKLRYLDVLIDNGLNPPGPPKRILIVGAGIAGLVAGDLLTRAGHDVTILEANANRVGGRIKTFHAKKGEPSPFADPAQYAEAGAMRLPSFHPLTLALIDKLGLKRRLFFNVDIDPQTGNQDAPVPPVFYKSFKDGKTWTNGAPSPEFKEPDKRNHTWIRTNREQVRRAQYATDPSSINEGFHLTGCETRLTVSDMVNQALEPVRDYYSVKQDDGTRVNKPFKEWLAGWADVVRDFDGYSMGRFLREYAEFSDEAVEAIGTIENMTSELHLAFFHSFLGRSDIDPRATYWEIEGGSRMLPETLAKDLRDQIVMGQRMVRLEYYDPGRDGHHGELTGPGGPAVAIQTVPEGEPYAATQTWTGDLAIVTIPFSSLRFVKVTPPFSYKKRRAVIETHYDQATKVLLEFSRRWWEFTEADWKRELDAIAPGLYDYYQQWGEDDAEAALALPQSVRNLPTGLLGAHPSVDESRIGEEQVEYYRNSELRGGVRPATNAYGGGSTTDNPNRFMYYPSHPVPGTQGGVVLAAYSWSDDAARWDSFDDAERYGYALENLQSVHGRRIEVFYTGAGQTQSWLRDPYACGEAAVYTPHQMTAFHLDVVRPEGPVYFAGEHVSLKHAWIEGAVETAVRAAIAVNEAPVGDTGVTAAAGRRGAAAATEPMREEALTS 687 T 2.2999999999999998E-32 Amino_oxidase unppercent F Bacteria T 7e0d 1 A A Q8L3C7_9ACTN L-glutamate oxidase MNEMTYEQLARELLLVGPAPTNEDLKLRYLDVLIDNGLNPPGPPKRILIVGAGIAGLVAGDLLTRAGHDVTILEANANRVGGRIKTFHAKKGEPSPFADPAQYAEAGAMRLPSFHPLTLALIDKLGLKRRLFFNVDIDPQTGNQDAPVPPVFYKSFKDGKTWTNGAPSPEFKEPDKRNHTWIRTNREQVRRAQYATDPSSINEGFHLTGCETRLTVSDMVNQALEPVRDYYSVKQDDGTRVNKPFKEWLAGWADVVRDFDGYSMGRFLREYAEFSDEAVEAIGTIENMTSELHLAFFHSFLGRSDIDPRATYWEIEGGSRMLPETLAKDLRDQIVMGQRMVRLEYYDPGRDGHHGELTGPGGPAVAIQTVPEGEPYAATQTWTGDLAIVTIPFSSLRFVKVTPPFSYKKRRAVIETHYDQATKVLLEFSRRWWEFTEADWKRELDAIAPGLYDYYQQWGEDDAEAALALPQSVRNLPTGLLGAHPSVDESRIGEEQVEYYRNSELRGGVRPATNAYGGGSTTDNPNRFMYYPSHPVPGTQGGVVLAAYSWSDDAARWDSFDDAERYGYALENLQSVHGRRIEVFYTGAGQTQSWLRDPYACGEAAVYTPHQMTAFHLDVVRPEGPVYFAGEHVSLKHAWIEGAVETAVRAAIAVNEAPVGDTGVTAAAGRRGAAAATEPMREEALTS 687 T 2.2999999999999998E-32 Amino_oxidase unppercent F Bacteria T 7e15 2 B,E B,E Q5JF31_THEKO Gins51 MGSSHHHHHHSSGENLYFQGHMSKEVPKEAYIIQIDLPAVLGPDMKEYGPFMAGDMAIIPTVIGRALVEREAARRVRIFL 80 T 0.31 SSURE unppercent F Archaea T 7e15 3 C,F C,F Q5JET1_THEKO POLDP1, POL II,EXODEOXYRIBONUCLEASE SMALL SUBUNIT MLVEDLLKNNYLITPSAYYLLSDHYKKAFTLAELIKFAKNRGTFVVDSNLAREFLAEKGIISSG 64 T 0.12 DUF2492 pdbhh F Archaea T 7e1v 5 E,L I,U A0R1B6_MYCS2 Cytochrome c oxidase subunit CtaJ MSTALTHGLIGGVPLVLFAVLALIFLTRKGPHPDTYKMSDPWTHAPILWAAEEPREHGHGGHGHDSHGVVIGGGASGKW 79 T 0.08 VPDSG-CTERM pdbpssm F Bacteria T 7e1w 5 E,L I,U A0R1B6_MYCS2 Cytochrome c oxidase subunit CtaJ MSTALTHGLIGGVPLVLFAVLALIFLTRKGPHPDTYKMSDPWTHAPILWAAEEPREHGHGGHGHDSHGVVIGGGASGKW 79 T 0.08 VPDSG-CTERM pdbpssm F Bacteria T 7e1x 5 E,L I,U A0R1B6_MYCS2 Cytochrome c oxidase subunit CtaJ MSTALTHGLIGGVPLVLFAVLALIFLTRKGPHPDTYKMSDPWTHAPILWAAEEPREHGHGGHGHDSHGVVIGGGASGKW 79 T 0.08 VPDSG-CTERM pdbpssm F Bacteria T 7e22 1 A A FSA2_FUSSF FUSARISETIN A BIOSYNTHESIS PROTEIN 2 GSHMSNVTVSAFTVDKSISEEHVLPSSFIPGSGNIFPKFTSAIPKTAWELWYFDGISKDDKSSIVIGVTRNAEGLKHGGFKVQVFVIWADERTWHRDLFFPESVVSINESGVTDGIWKDATSNSSISFSCAGDLSKASLVFDVPGVVQGDMHLEALPGDTGLDTDARLGPSVYYVRPIGRASVKAQLSLYSSDATAAEQFSLGTSANGGMDRVWSPLSWPQVMTESYYLRTQVGPYAMQIMRIFPPAGSEDQPSTMARLYREGQLVCVAQHVVTREDALMTHDSLILSKQDNSDSEDVVTGGYRDKNTGYTVEFVEKGNEGQRWKFQVRHERIIWNTPTSRPGPDATGNTGFVEVLCGGTIGESYEGVGTGGQCELS 377 T 0.00021 Svf1_C unphh F Eukaryota T 7e2e 2 B,D P,Q PRGC1_HUMAN PPARGC-1-ALPHA,LIGAND EFFECT MODULATOR 6 RPASELLKYLTT 12 T 4.4 MTBP_mid pdbhh F Eukaryota T 7e2h 1 A D DISP1_HUMAN Protein dispatched homolog 1 MAMSNGNNDFVVLSNSSIATSAANPSPLTPCDGDHAAQQLTPKEATRTKVSPNGCLQLNGTVKSSFLPLDNQRMPQMLPQCCHPCPYHHPLTSHSSHQECHPEAGPAAPSALASCCMQPHSEYSASLCPNHSPVYQTTCCLQPSPSFCLHHPWPDHFQHQPVQQHIANIRPSRPFKLPKSYAALIADWPVVVLGMCTMFIVVCALVGVLVPELPDFSDPLLGFEPRGTAIGQRLVTWNNMVKNTGYKATLANYPFKYADEQASSLEVLFQ 270 T 0.76 Adeno_E3_14_5 pdbhh F Eukaryota T 7e4j 1 A,B,C,D A,B,C,D VPB12_MYCTU ANTITOXIN VAPB12,CONSERVED PROTEIN OF UNCHARACTERIZED FUNCTION,POSSIBLE ANTITOXIN VAPB12 MSAMVQIRNVPDELLHELKARAAAQRMSLSDFLLARLAEIAEEP 44 T 0.00013 ParD unppercent F Bacteria T 7e57 1 A,B A,B TNF18_MOUSE GITR LIGAND,GITRL,GLUCOCORTICOID-INDUCED TNF-RELATED LIGAND MEEMPLRESSPQRAERCKKSWLLCIVALLLMLLCSLGTLIYTSLKPTAIESCMVKFELSSSKWHMTSPKPHCVNTTSDGKLKILQSGTYLIYGQVIPVDKKYIKDNAPFVVQIYKKNDVLQTLMNDFQILPIGGVYELHAGDNIYLKFNSKDHIQKTNTYWGIILMPDLPFIS 173 T 0.00013 TNF pdb F Eukaryota T 7e5e 2 E,F,G,H E,F,G,H GD20 XXLITFRQWAFNLPCGX 17 T 3.8 DUF2433 pdbhh F T 7e5t 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H FSA2_FUSSF FUSARISETIN A BIOSYNTHESIS PROTEIN 2 MGSSHHHHHHMSNVTVSAFTVDKSISEEHVLPSSFIPGSGNIFPKFTSAIPKTAWELWYFDGISKDDKSSIVIGVTRNAEGLKHGGFKVQVFVIWADERTWHRDLFFPESVVSINESGVTDGIWKDATSNSSISFSCAGDLSKASLVFDVPGVVQGDMHLEALPGDTGLDTDARLGPSVYYVRPIGRASVKAQLSLYSSDATAAEQFSLGTSANGGMDRVWSPLSWPQVMTESYYLRTQVGPYAMQIMRIFPPAGSEDQPSTMARLYREGQLVCVAQHVVTREDALMTHDSLILSKQDNSDSEDVVTGGYRDKNTGYTVEFVEKGNEGQRWKFQVRHERIIWNTPTSRPGPDATGNTGFVEVLCGGTIGESYEGVGTGGQCELS 384 T 0.00021 Svf1_C unphh F Eukaryota T 7e5u 1 A,B,C A,B,C PHM7_PYRSX Diels-Alderase GSHMSEPTSSSSLDITSNCIIETPLQPSDFLPKSANLFPKFPERISVDSWELWEFDTFDTNGSVAFGCSLYRDARGVEQGGFHAEVNALWPDGTHWGETLYFAVSEVVENSDGTTGGKWLSKDGGSITFHIASDYTAAALDFNVPGKVSGTMELRNHANVSPTSNLPASDAEAQLCPGVYYTFPMGPVATSVTATFSSVGANGESRELFISSGYGGMVRGWSARPWPTFMNDAYYVVAQVGPYMLQILRTLGSVFVQHKPFAVARLYLDGSLVSAANTVVGDELTAHADDVKGDAVRLTKVQPDEKSQGLSGKFRDGNVGYVLEFAKKDSEHGWTFQISHKRAVWSEPTSAPGPDGTGKSGWIEAISGGAKGENYEGHGFGGQLQIPVP 389 T 0.0068 Svf1_C unphh F Eukaryota T 7e5v 1 A,B,C A,B,C PHM7_PYRSX Diels-Alderase GSHMSEPTSSSSLDITSNCIIETPLQPSDFLPKSANLFPKFPERISVDSWELWEFDTFDTNGSVAFGCSLYRDARGVEQGGFHAEVNALWPDGTHWGETLYFAVSEVVENSDGTTGGKWLSKDGGSITFHIASDYTAAALDFNVPGKVSGTMELRNHANVSPTSNLPASDAEAQLCPGVYYTFPMGPVATSVTATFSSVGANGESRELFISSGYGGMVRGWSARPWPTFMNDAYYVVAQVGPYMLQILRTLGSVFVQHKPFAVARLYLDGSLVSAANTVVGDELTAHADDVKGDAVRLTKVQPDEKSQGLSGKFRDGNVGYVLEFAKKDSEHGWTFQISHKRAVWSEPTSAPGPDGTGKSGWIEAISGGAKGENYEGHGFGGQLQIPVP 389 T 0.0068 Svf1_C unphh F Eukaryota T 7e74 2 E,F,G,H E,F,G,H ALA-ALA-ARG-ALY AARX 4 T 260 G2BR pdbhh F F 7e7c 2 B B Histone H3K27ac(24-27) peptide AARX 4 T 260 G2BR pdbhh F F 7e80 3 AA,BA,CA,DA,EA 0,1,2,3,4 Flagellar MS ring L2 XXXXXXXXXXXXXXX 15 F F F 7e81 2 EA,FA,GA,HA,X GC,GE,GD,GB,GA FlgB-Dc loop XXXXXXXXXXXX 12 F F F 7e81 3 AA,BA,CA,DA,Y,Z GH,GI,GJ,GK,GF,GG FliE helix 1 XXXXXXXXXXXXXXXXXX 18 F F F 7e82 3 AA,BA,CA,DA,EA 0,1,2,3,4 Flagellar MS ring L2 XXXXXXXXXXXXXXX 15 F F F 7e87 3 C,D,G,H J,I,E,F DPP6_HUMAN DPPX,DIPEPTIDYL AMINOPEPTIDASE-RELATED PROTEIN,DIPEPTIDYL PEPTIDASE 6,DIPEPTIDYL PEPTIDASE IV-LIKE PROTEIN,DIPEPTIDYL PEPTIDASE VI,DPP VI WKGIAIALLVILVICSLIVTSVILLTPA 28 T 0.11 TMEM_230_134 unppercent F Eukaryota T 7e8e 3 C,I K,I DPP6_HUMAN DPPX,DIPEPTIDYL AMINOPEPTIDASE-RELATED PROTEIN,DIPEPTIDYL PEPTIDASE 6,DIPEPTIDYL PEPTIDASE IV-LIKE PROTEIN,DIPEPTIDYL PEPTIDASE VI,DPP VI AAAAKGIAIALLVILVICSLIVTSVILLTPA 31 T 0.11 TMEM_230_134 unppercent F Eukaryota T 7e8e 5 F,L L,J DPP6_HUMAN DPPX,DIPEPTIDYL AMINOPEPTIDASE-RELATED PROTEIN,DIPEPTIDYL PEPTIDASE 6,DIPEPTIDYL PEPTIDASE IV-LIKE PROTEIN,DIPEPTIDYL PEPTIDASE VI,DPP VI AKGIAIALLVILVICSLIVTSVILLTPA 28 T 0.11 TMEM_230_134 unppercent F Eukaryota T 7e9k 2 E,F C,F DAG1_HUMAN mono-mannosyl peptide (379Man long peptide) XTIRTRGAIIQTPTLGPIQPTRX 23 T 19 Ste5 pdbhh F Eukaryota T 7e9l 2 C C DAG1_HUMAN mono-mannosyl peptide (379Man short peptide) XQTPTLGPIQPTRX 14 T 5.1 Ste5 pdbhh F Eukaryota T 7e9m 2 B,D B,D SPNDC_HUMAN SPIN1-DOCKING PROTEIN,SPIN-DOC FAAPAEVRHFTDGSFPAGFVLQLFSHT 27 T 23 DUF2852 pdbhh F Eukaryota T 7e9s 2 B B a polypeptide linked to an inhibitory N-glycosylation sequon-containing peptide APYXVTASCR 10 T 7.4 SsgA pdbhh F T 7ea1 2 B,D B,D SPNDC_HUMAN SPINDOC DOCPEP2 VRKKRGRPMTKN 12 T 3 AT_hook pdbhh F Eukaryota T 7ea7 2 C C AZI2_HUMAN NAP1_LIR motif EDDICILNHEK 11 F F Eukaryota T 7eau 1 A A N4VVN8_COLOR SIN1 QEGKCTAKGECQENTSGVKLFCTSGSCAKKEGQACTRNGPGSSNSASCPK 50 T 0.011 Nodulin_late unp F Eukaryota T 7ebd 1 A,B A,B A0A133PTK7_9BACT STING SGGGLPSTVIAISYFEGFVKLAAEWIVTEMPTTEIDGKTYTSGKLYIKMPETLDTDIKKSAMLFYKKQGLNETQMSTNHRNYPIHIVSKEEGDTLEVYDMPTILSGIDKAIDMYFRVGHIGKTTEQQLAEDNEMNNFKRVLQLLINEDSFCRECVEILRQA 161 T 1.7E-05 TMEM173 pdbhh F Bacteria T 7ebl 1 A,B A,B STING GIHLGELGLLPSTVLAIGYFENLVNIICESLNMLPKLEVSGKEYKKFKFTIVIPKDLDANIKKRAKIYFKQKSLIEIEIPTSSRNYPIHIQFDENSTDDILHLYDMPTTIGGIDKAIEMFMRKGHIGKTDQQKLLEERELRNFKTTLENLIATDAFAKEMVEVIIEE 167 T 0.26 Pih1_fungal_CS pdbpercent F T 7ec3 2 C,D D,G 2-acetamido-2-deoxy-alpha-D-glucopyranose-(1-35)-[2-acetamido-2-deoxy-alpha-D-galactopyranose-(1-65)]5,6-DIHYDRO-BENZO[H]CINNOLIN-3-YLAMINE SDSDSDSD 8 T 8 Ripply pdbhh F F 7ec6 2 C D ASP-SER-ASP DSD 3 T 180 Glyco_trans_2_3 pdbhh F F 7eca 2 B B NF2L2_MOUSE LEU-ASP-GLU-GLU-THR-GLY-GLU-PHE-LEU-PRO EQEKAFFAQFQLDEETGEFLPIQPA 25 T 0.055 Radial_spoke unppercent F Eukaryota T 7ecd 1 A A R5MX27_9FIRM Phosphatidate cytidylyltransferase MHHHHHHMKDFIKEFLNERPEVVAAFGYGSGVFKQLGYDSKEKPQIDLILIVNDMKLWHKENIKKNPKDYSFIGRNFFLNSSIDEIKGITGITYQSNIEYKGHLFKYGIIEYGDFVRHMQTWDSFYVPGRFQKPILTIKSNNFIDELILQNRRNACKVGLLCLNNKDLKDLYLTICNLSYSGDTRMKVAENPKKVENIVGASYDKFNEMYNFNDLYQKNGERIEYEIDIDELPSSLEKYIKDDKTKEKVMEYLSDLNRKESSLQTMKGIKTN 272 T 4.6E-13 Tam41_Mmp37 unppssm F Bacteria T 7ecv 5 J,K I,J A0A0R6PCL0_9CAUD AcrIF14 MKKIEMIEISQNRQNLTAFLHISEIKAINAKLADGVDVDKKSFDEICSIVLEQYQAKQISNKQASEIFETLAKANKSFKIEKFRCSHGYNEIYKYSPDHEAYLFYCKGGQGQLNKLIAENGRFM 124 T 0.053 GatB_Yqey pdb T Viruses T 7ecw 4 I,J I,J A0A0R6PCL0_9CAUD AcrIF14 MKKIEMIEISQNRQNLTAFLHISEIKAINAKLADGVDVDKKSFDEICSIVLEQYQAKQISNKQASEIFETLAKANKSFKIEKFRCSHGYNEIYKYSPDHEAYLFYCKGGQGQLNKLIAENGRFM 124 T 0.053 GatB_Yqey pdb T Viruses T 7edo 3 C,F C,F CYS-ASN-VAL-THR-LEU-ASN-TYR-PRO CNVTLNYP 8 T 1.2 DUF2884 pdbhh F T 7edp 1 A B DOT1L_HUMAN DOT1-LIKE PROTEIN,HISTONE H3-K79 METHYLTRANSFERASE,H3-K79-HMTASE,LYSINE N-METHYLTRANSFERASE 4 DWATLSLEKLLKEKQALKSQISEKQRHCLELQISIVELEK 40 T 0.02 HALZ pdbpercent F Eukaryota T 7edz 1 A,B,C,D D,C,B,A PPCS_HUMAN PHOSPHOPANTOTHENOYLCYSTEINE SYNTHETASE,PPC SYNTHETASE MAEMDPVAEFPQPPGAARWAEVMARFAARLGAQGRRVVLVTSGGTKVPLEARPVRFLDNFSSGRRGATSAEAFLAAGYGVLFLYRARSAFPYAHRFPPQTWLSALRPSGPALSGLLSLEAEENALPGFAEALRSYQEAAAAGTFLVVEFTTLADYLHLLQAAAQALNPLGPSAMFYLAAAVSDFYVPVSEMPEHKIQSSGGPLQITMKMVPKLLSPLVKDWAPKAFIISFKLETDPAIVINRARKALEIYQHQVVVANILESRQSFVLIVTKDSETKLLLSEEEIEKGVEIEEKIVDNLQSRHTAFIGDRN 311 T 1.7E-07 DFP pdbpercent F Eukaryota T 7eeb 12 L N Unknown XXXXXXXXXXXXXXXXXXXXXXXXXXXX 28 F F F 7eeb 14 N K CTSRZ_MOUSE CATSPER-ZETA,CATSPERZETA,PROTEIN EXPRESSED IN MALE LEPTOTENE AND ZYGOTENE SPERMATOCYTES 622,MLZ-622,TESTIS-EXPRESSED PROTEIN 40 MEESVKPVPKHANHRRSSVRSSLYGDVRDLWSTATMSTANVSVSDVCEDFDEEGKSVRNRIRKYSQTISIRDSLNLEPEEIQQQARRELELCHGRSLEHGEDHEESETSLASSTSESLIFSLWKPHRTYWTEQQNRLPLPLMELMETEVLDILKKALITYRSTIGRNHFMTKELQGYIEGIRKRRNKRLYFLDQ 194 T 18 EGL-1 pdbhh F Eukaryota T 7eel 2 H,I,J,K,L,M,N H,I,J,K,L,M,N Cement (decoration) proteins MPATNSAQARLAAPGHGFGGNVKVSYGSVAFTGTITTADAATVCNLPVGAIVLGVTLESDDLDTNATPTITLNVGDAGSATRYFSASTVAQAGTSSSAPATTGLLWTVTEGNTAVRIAVANNAATSADGSVRVAVTYYLP 140 T 57 DUF6476 pdbhh F T 7eep 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L Pam1 portal proteins MEDTMTMPSHAQLKAYFEEARDANEEYRKEAFIDRDYFDGHQWTEEELQKLEARKQPATYFNEVKLSIRGLVGVFEQGDSDPRAWPRNPQDEDSADIATKALRYVKDYSEWSDERSRAALNYFVEGTCAAIVGVDENGRPEIEPIRFEEFFHDPRSRELDFSDARFKGVAKWRFADEVGMEYGIKGEIDGALDGDSEGLSIGGDTFGDRPDGKISSWIDSKLRRVFVVEMYVRWNGVWIRALFWGRGILEMSVSAYLDRNGKPTCPIEARSCYIDRENRRYGEVRDLRSPQDAINKRESKLLHMLNNRQAIATNPEYAYNSDAEMVRKEMSKPDGIIPPGWQPASMTDLANGQFALLSSAREFIQRIGQNPSVLAAQSASASGRAQLARQQAGMVDSAMALNGLRRFELAVYRQAWLRCRQFWKAPDYIRVTDDEGAPQFVGINQPIKGPPQPVLNEMGQVVIAEPILGYENALAELDVDINIDAVPDTANLAQEQFLQLTELARLYGPQEVPFDDLLELSSMPEKTKLIAKRRERSEQMAQVQAQQGQMQEQIAMQGAMAEIENTQADTAYLAARAQNEMLKPQIEAFKAGFGAA 596 T 0.00011 P22_portal pdbpssm F T 7eep 2 M,N,O,P,Q,R,S,T,U,V,W,X M,N,O,P,Q,R,S,T,U,V,W,X Pam1 adaptor proteins MITCRDIITLGLQQARVVPLGREPKAKEADAGLTVLQSIYDSMFADGPLGPFTEVYATSAYTAQENERIVTNGAAITIPQTITEGNETRKPYDLTAIIVINGAAQENHVFSLGRWQTAHDLTLNSEAPLAERDKAGLAALFAMEFAEMFGAELPPRTTARGFRFKGAISQKLATKRDDPVYY 182 F F T 7eeq 2 G,H,I,J,K,L,M,N,O,P,Q,R,S,T,U,V,W,X A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R Tailspike head-binding domain MAAELHITPSRATSSNGLNLDGAKWFFYQTGTTTPQSVYTTAALSVAHSNPVVADAAGKFPAIYFDTTLEYRGVLKTADEATTIYDIDPINSGILSVLGTSS 102 T 0.00073 Big_1 pdbpercent F T 7ees 1 A A GLY-THR-ILE-ASP-PRO-GLN-ASN-SER-GLU-GLU-HIS-PRO-VAL-LEU-SER-ARG-ARG-LEU-GLU-ASN GTIDPQNSEEHPVLSRRLEN 20 T 9.9 DUF3243 pdbhh F T 7ef0 2 C P H32_MAIZE Histone H3.2 ARTKQTARMSTGGKAPRKQ 19 T 350 Sirohm_synth_M pdbhh F Eukaryota T 7egb 23 W Q TF2AA_HUMAN GENERAL TRANSCRIPTION FACTOR IIA SUBUNIT 1,TFIIAL,TRANSCRIPTION INITIATION FACTOR TFIIA 42 KDA SUBUNIT,TFIIA-42 MANSANTNTVPKLYRSVIEDVINDVRDIFLDDGVDEQVLMELKTLWENKLMQSRAVDGFHSEEQQLLLQVQQQHQPQQQQHHHHHHHQQAQPQQTVPQQAQTQQVLIPASQQATAPQVIVPDSKLIQHMNASNMSAAATAATLALPAGVTPVQQILTNSGQLLQVVRAANGAQYIFQPQQSVVLQQQVIPQMQPGGVQAPVIQQVLAPLPGGISPQTGVIIQPQQILFTGNKTQVIPTTVAAPTPAQAQITATGQQQPQAQPAQTQAPLVLQVDGTGDTSSEEDEDEEEDYDDDEEEDKEKDGAEDGQVEEEPLNSEDDVSDEEGQELFDTENVVVCQYDKIHRSKNKWKFHLKDGIMNLNGRDYIFSKAIGDAEW 376 F F Eukaryota T 7egm 6 F I SNF6_YEAST SWI/SNF COMPLEX COMPONENT SNF6 MGVIKKKRSHHGKASRQQYYSGVQVGGVGSMGAINNNIPSLTSFAEENNYQYGYSGSSAGMNGRSLTYAQQQLNKQRQDFERVRLRPEQLSNIIHDESDTISFRSNLLKNFISSNDAFNMLSLTTVPCDRIEKSRLFSEKTIRYLMQKQHEMKTQAAELQEKPLTPLKYTKLIAAAEDGSRSTKDMIDAVFEQDSHLRYQPDGVVVHRDDPALVGKLRGDLREAPADYWTHAYRDVLAQYHEAKERIRQKEVTAGEAQDEASLQQQQQQDLQQQQQVVTTVASQSPHATATEKEPVPAVVDDPLENMFGDYSNEPFNTNFDDEFGDLDAVFF 332 T 0.0043 CENP-Q pdbpercent F Eukaryota T 7egp 5 E I SNF6_YEAST SWI/SNF COMPLEX COMPONENT SNF6 MGVIKKKRSHHGKASRQQYYSGVQVGGVGSMGAINNNIPSLTSFAEENNYQYGYSGSSAGMNGRSLTYAQQQLNKQRQDFERVRLRPEQLSNIIHDESDTISFRSNLLKNFISSNDAFNMLSLTTVPCDRIEKSRLFSEKTIRYLMQKQHEMKTQAAELQEKPLTPLKYTKLIAAAEDGSRSTKDMIDAVFEQDSHLRYQPDGVVVHRDDPALVGKLRGDLREAPADYWTHAYRDVLAQYHEAKERIRQKEVTAGEAQDEASLQQQQQQDLQQQQQVVTTVASQSPHATATEKEPVPAVVDDPLENMFGDYSNEPFNTNFDDEFGDLDAVFF 332 T 0.0043 CENP-Q pdbpercent F Eukaryota T 7egs 1 A A RPOB_ECOLI RNAP SUBUNIT BETA,RNA POLYMERASE SUBUNIT BETA,TRANSCRIPTASE SUBUNIT BETA AMDSPGVFFDSDKGKTHSSGKVLYNARIIPYRGSWLDFEFDPKDNLFVRIDRRRKLPATIILRALNYTTEQILDLFFEKVIFEIRDNKLQMELVPERLRGETASFDIEANGKVYVEKGRRITARHIRQLEKDDVKLIEVPVEYIAGKVVAKDYIDESTGELICAANMELSLDLLAKLSQSGHKRIETLFTNDLDHGPYISETLRVDPTNDRLSALVEIYRMMRPGEPPTREAAESLFENLFFSEDRYDLSAVGRMKFNRSLLREEIEGSGILSKDDIIDVMKKLIDIRNGKGEVD 295 T 5.6E-08 RNA_pol_Rpb2_2 pdb F Bacteria T 7egs 2 B B UVRD_ECOLI DNA helicase II AMDVSHQRMGTPMVENDSGYKLGQRVRHAKFGEGTIVNMEGSGEHSRLQVAFQGQGIKWLVAAYARLESV 70 T 0.00015 Tudor_1_RapA pdbhh F Bacteria T 7egu 2 B B macrocyclic peptide X FPLIFPRKGCGG 12 T 0.28 MOSC_N pdbhh F T 7ehz 2 B B macrocyclic peptide 2 GXPYRPXXC 9 T 1.4 LINES_C pdbhh F T 7ei2 2 B B macrocyclic peptide 8 GXPYKPXXC 9 T 1.8 LINES_C pdbhh F T 7eic 1 A C H4_HUMAN Histone H4 SGRGXGGXGLGK 12 T 11 Shadoo unppercent F Eukaryota T 7eid 2 C,D C,D H4_HUMAN Histone H4 KGGXGLGXGG 10 T 11 Shadoo unppercent F Eukaryota F 7eii 1 A,B A,B FAD dependent L-Lys oxidase MGNKNTPLNSGKHPDLKIEVAIIGAGTSGLYTAYRLVTDKKFKAHDVQIFDMNNKLGGRLESVIMPGMNFWGELGGMRYLTSQQIVTTLIEGYPLSEKDPNKRTPVLKDKMTPVPFPMGDPSKLLMYLRKERFKQNAWNEAQKKGEKLPTRYYLNENDLGFSSDQLFNKIIYDVLMADPWVAETYGSKIIKGSSVYDYSFKLTSRDWDDIKPKLVYNFPNSPYDQRKVNDIGFWNLIKDQVSQEGYEFLANAGGYYSNTINWNSAEAFPYMVGDFSAGTIYKTIEEGYDSIAYAVANSYMEHEGACIWSENKLLTFTKDHPLTNTHKYELTFLNLKTNTQWKVYANSIVLAMPRKSLELLDQNNFFFNINKNSVLNNNIRSVIMEPAFAILMGFEYPWWKELGIDSGHSITDLPMRQCYYFGTDPETNNSMLLGSYGDMETETFWKALSDDKVLFEVKAAKSASLRELHQLDDVQATKLMVGELMNQLRELHGDTVTIPEPYVTYFKDWTDEPFGAGYHAWKAGFSVENVMPYMRKPLTDEQIHICGEAYSDQQGWVEGAFCEAEKMLQEYFGLDRPYWLSPDYYLGWEHHHHHH 595 T 3.4E-17 MCRA pdbhh F T 7eij 1 A,B A,B FAD dependent L-Lys oxidase MGNKNTPLNSGKHPDLKIEVAIIGAGTSGLYTAYRLVTDKKFKAHDVQIFDMNNKLGGRLESVIMPGMNFWGELGGMRYLTSQQIVTTLIEGYPLSEKDPNKRTPVLKDKMTPVPFPMGDPSKLLMYLRKERFKQNAWNEAQKKGEKLPTRYYLNENDLGFSSDQLFNKIIYDVLMADPWVAETYGSKIIKGSSVYDYSFKLTSRDWDDIKPKLVYNFPNSPYDQRKVNDIGFWNLIKDQVSQEGYEFLANAGGYYSNTINWNSAEAFPYMVGDFSAGTIYKTIEEGYDSIAYAVANSYMEHEGACIWSENKLLTFTKDHPLTNTHKYELTFLNLKTNTQWKVYANSIVLAMPRKSLELLDQNNFFFNINKNSVLNNNIRSVIMEPAFAILMGFEYPWWKELGIDSGHSITDLPMRQCYYFGTDPETNNSMLLGSYGDMETETFWKALSDDKVLFEVKAAKSASLRELHQLDDVQATKLMVGELMNQLRELHGDTVTIPEPYVTYFKDWTDEPFGAGYHAWKAGFSVENVMPYMRKPLTDEQIHICGEAYSDQQGWVEGAFCEAEKMLQEYFGLDRPYWLSPDYYLGWEHHHHHH 595 T 3.4E-17 MCRA pdbhh F T 7ein 2 C,D C,D leupeptin GLLX 4 T 590 DNTTIP1_dimer pdbhh F F 7ekn 1 A,C,E,G B,D,F,H ipep SQIEWAKARVEKLRKRNQALKSQTSELQRQIAELEASNAELKK 43 T 0.00013 GIT_CC pdb F T 7el1 5 E E 100AA MKSVKYISNMSKQEKGYRVYVNVVNEDTDKGFLFPSVPKEVIENDKIDELFNFEHHKPYVQKAKSRYDKNGIGYKIVQLDEGFQKFIELNKEKMKENLDY 100 T 24 ATP-synt_DE_N pdbhh F T 7elh 4 D,F,G,I,J,L,M,O,P,R D,E,F,G,H,I,J,K,L,M F1ARN3_9REOV LAMBDA1 YQCHVCSAVLFSPLDLDAHVASHGLHGNMTLTSSDIQRHITEFISSWQNHPIVQVSADVENKKTAQLLHADTPRLVTWDAGLCTSFKIVPIVPAQVPQDVLAYTFFTSSYAIQSPFPEAAVSRIVVHTRWASNVDFDRDSSVIMAPPTENNIHLFKQLLNTETLSVRGANPLMFRANVLHMLLEFVLDNLYLNRHTGFSQDHTPFTEGANLRSLPGPDAEKWYSIMYPTRMGTPNVSKICNFVASCVRNRVGRFDRAQMMNGAMSEWVDVFETSDALTVSIRGRWMARLARMNINPTEIEWALTECAQGYVTVTSPYAPSVNRLMPYRISNAERQISQIIRIMNIGNNATVIQPVLQDISVLLQRISPLQIDPTIISNTMSTVSESTTQTLSPASSILGKLRPSNSDFSSFRVALAGWLYNGVVTTVIDDSSYPKDGGSVTSLENLWDFFILALALPLTTDPCAPVKAFMTLANMMVGFETIPMDNQIYTQSRRASAFSTPHTWPRCFMNIQLISPIDAPILRQWAEIIHRYWPNPSQIRYGAPNVFGSANLFTPPEVLLLPIDHQPANVTTPTLDFTNELTNWRARVCELMKNLVDNQRYQPGWTQSLVSSMRGTLDKLKLIKSMTPMYLQQLAPVELAVIAPMLPFPPFQVPYVRLDRDRVPTMVGVTRQSRDTITQPALSLSTTNTTVGVPLALDARAITVALLSGKYPPDLVTNVWYADAIYPMYADTEVFSNLQRDMITCEAVQTLVTLVAQISETQYPVDRYLDWIPSLRASAATAATFAEWVNTSMKTAFDLSDMLLEPLLSGDPRMTQLAIQYQQYNGRTFNIIPEMPGSVIADCVQLTAEVFNHEYNLFGIARGDIIIGRVQSTHLWSPLAPPPDLVFDRDTPGVHIFGRDCRISFGMNGAAPMIRDETGLMVPFEGNWIFPLALWQMNTRYFNQQFDAWIKTGELRIRIEMGAYPYMLHYYDPRQYANAWNLTSAWLEEITPTSIPSVPFMVPISSDHDISSAPAVQYIISTEYNDRSLFCTNSSSPQTIAGPDKHIPVERYNILTNPDAPPTQIQLPEVVDLYNVVTRYAYETPPITAVVMGVP 1095 T 0.0075 zf_C2H2_6 unphh T Viruses T 7elh 5 E,H,K,N,Q e,g,i,k,m F1ARN3_9REOV LAMBDA1 MKRIPRKTKGKSSGKGNDSTERADDGSSQLRDKQNNKAGPATTEPGTSNREQYKARPGIASVQRATESAEMPMKNNDEGTPDKKGNTKGDLVNEHSEAKDEADEATKKQAKDTDKSKAQVTYSDTGINNANELSRSGNVDNEGGSNQKPMSTRIAEATSAIVSKHPARVGLPPTASSGHG 180 T 0.25 zf-C2H2_3rep unphh T Viruses T 7elj 3 C C IS1 VYYR 4 T 48 DUF1996 pdbhh F F 7elm 5 S,T U,V A0A8G3G219_PSEAI AcrIF24 MNAIHIGPFSITPAARGLHYGGLPHHQWTLYYGPREMAIKTLPDSYTSSEVRDEFSDIIAEFVIDARHRYAPDVLELVNSDGDAVLARVAVSRLPEALSGCIPDDRFPYWLLTASRPRLGLPVTLNEYTALAVELSAPPLAWITGLLPGEVLTHDAEEWRPPTSWELRHVVGEGSFTGVSGAAAAALLGMSATNFRKYTAGDSAANRQKISFAAWHYLLDRLGVKRAS 228 T 0.00054 DUF4447 pdbhh F Bacteria T 7eln 4 Q,R U,V A0A8G3G219_PSEAI AcrIF24 MNAIHIGPFSITPAARGLHYGGLPHHQWTLYYGPREMAIKTLPDSYTSSEVRDEFSDIIAEFVIDARHRYAPDVLELVNSDGDAVLARVAVSRLPEALSGCIPDDRFPYWLLTASRPRLGLPVTLNEYTALAVELSAPPLAWITGLLPGEVLTHDAEEWRPPTSWELRHVVGEGSFTGVSGAAAAALLGMSATNFRKYTAGDSAANRQKISFAAWHYLLDRLGVKRAS 228 T 0.00054 DUF4447 pdbhh F Bacteria T 7ely 1 A A 16X_BCL GCMILLDTDIWCPCSHPYACPENICC 26 T 1.6 WCCH pdbhh F T 7em9 3 C C SER-LEU-ASP-GLU-TYR-SER-SER-ASP-VAL SLDEYSSDV 9 T 9.7 V-ATPase_H_C pdbhh F T 7ema 3 C C TYR-SER-SER-ASP-VAL-THR-THR-LEU-VAL YSSDVTTLV 9 T 15 Peptidase_S66 pdbhh F T 7emb 3 C C ALA-ALA-ALA-ILE-GLU-GLU-GLU-ASP-ILE AAAIEEEDI 9 T 12 Dehydratase_MU pdbhh F F 7emc 3 C,F,I C,F,I ALA-THR-GLU-ILE-ARG-GLU-LEU-LEU-VAL ATEIRELLV 9 T 0.66 DUF5908 pdbhh F T 7emd 3 C C CAPSH_ASFB7 TYR-GLY-ASP-PHE-PHE-HIS-ASP-MET-VAL YGDFFHDMV 9 T 0.7 Ebp2 pdbhh T Viruses T 7emf 2 B B Unknown Chain (poly A) AAAAAAAAAAAAAAAAAAAA 20 T 510 Adeno_PIX pdbhh F F 7emz 1 A,B A,B NVFI_ASPN1 NvfI W199F MGSSHHHHHHSSGLVPRGSHMVGSRTWCESEMLFVQPDAGTKEELYYRVTPKPGQTQANFNWTPHKVRFHDARPQRDSFDLNTHGFTFVEDAISPQLIERIRADDTAAVEGDYFASVAALVKRVTGADHVVCFSPYTRKENSEKGIFGQPARTVHCDHTPAAAIELTHKLCGEDAVRLLQSRFRAFSVWRPLVEPVLDWPLAVVDGRTIAPDDLHPVHFLRYEKKDTEPPFQLSFSETQKWYYLSRQRSDEVSIVKNYDSEVVPSPRSAHCAFKHPFVPKDAPPRESIDVRCLVFGGR 298 T 0.28 EF-hand_5 unppercent F Eukaryota T 7ena 6 F DQ TFIIA-a MANSANTNTVPKLYRSVIEDVINDVRDIFLDDGVDEQVLMELKTLWENKLMQSRAVDGFHSEEQQLLLQVQQQHQPQQQQHHHHHHHQQAQPQQTVPQQAQTQQVLIPASQQATAPQVIVPDSKLIQHMNASNMSAAATAATLALPAGVTPVQQILTNSGQLLQVVRAANGAQYIFQPQQSVVLQQQVIPQMQPGGVQAPVIQQVLAPLPGGISPQTGVIIQPQQILFTGNKTQVIPTTVEEEPLNSEDDVSDEEGQELFDTENVVVCQYDKIHRSKNKWKFHLKDGIMNLNGRDYIFSKAIGDAEW 307 T 8.3E-42 TFIIA pdb F T 7enb 1 A,B A,B NVFI_ASPN1 NvfI MGSSHHHHHHSSGLVPRGSHMVGSRTWCESEMLFVQPDAGTKEELYYRVTPKPGQTQANFNWTPHKVRFHDARPQRDSFDLNTHGFTFVEDAISPQLIERIRADDTAAVEGDYFASVAALVKRVTGADHVVCFSPYTRKENSEKGIFGQPARTVHCDHTPAAAIELTHKLCGEDAVRLLQSRFRAFSVWRPLVEPVLDWPLAVVDGRTIAPDDLHPVHWLRYEKKDTEPPFQLSFSETQKWYYLSRQRSDEVSIVKNYDSEVVPSPRSAHCAFKHPFVPKDAPPRESIDVRCLVFGGR 298 T 0.28 EF-hand_5 unppercent F Eukaryota T 7enc 50 DB DQ TFIIA-a MANSANTNTVPKLYRSVIEDVINDVRDIFLDDGVDEQVLMELKTLWENKLMQSRAVDGFHSEEQQLLLQVQQQHQPQQQQHHHHHHHQQAQPQQTVPQQAQTQQVLIPASQQATAPQVIVPDSKLIQHMNASNMSAAATAATLALPAGVTPVQQILTNSGQLLQVVRAANGAQYIFQPQQSVVLQQQVIPQMQPGGVQAPVIQQVLAPLPGGISPQTGVIIQPQQILFTGNKTQVIPTTVEEEPLNSEDDVSDEEGQELFDTENVVVCQYDKIHRSKNKWKFHLKDGIMNLNGRDYIFSKAIGDAEW 307 T 8.3E-42 TFIIA pdb F T 7enh 2 B B AcrIIA14 protein MKSVKYISNMSKQEKGYRVYVNVVNEDTDKGFLFPSVPKEVIENDKIDELFNFEHHKPYVQKAKSRYDKNGIGYKIVQLDEGFQKFIELNKEKMKENLDY 100 T 24 ATP-synt_DE_N pdbhh F T 7eni 3 C C AcrIIA13 protein MNKSIEIKDQNNIVLIDSLGQFFTDIENDNNGRYNIDYVLLNEVEHDNGNTYYEVGMYRTEEVPFSDKVTQDNVELLEDKWLQIDQQGESYVESIFFENEEDAREYIKLVLKGHETFEETAKAIGVIK 128 T 0.054 EABR pdb F T 7enj 2 B B Unknown Chain XXXXXXXXXXXXXXXXXXXX 20 F F F 7enm 1 A A AcrIIA14 protein SMKSVKYISNMSKQEKGYRVYVNVVNEDTDKGFLFPSVPKEVIENDKIDELFNFEHHKPYVQKAKSRYDKNGIGYKIVQLDEGFQKFIELNKEKMKENLDY 101 T 25 ATP-synt_DE_N pdbhh F T 7enr 3 C C AcrIIA14 MKSVKYISNMSKQEKGYRVYVNVVNEDTDKGFLFPSVPKEVIENDKIDELFNFEHHKPYVQKAKSRYDKNGIGYKIVQLDEGFQKFIELNKEKMKENLDY 100 T 24 ATP-synt_DE_N pdbhh F T 7ep0 1 A,B A,B ZY11B_HUMAN Protein zyg-11 homolog B GSTEQTAQLGTELFIVRQLLQIVKQKTNQNSVDTTLKFTLSALWNLTDESPTTCRHFIENQGLELFMRVLESFPTESSIQQKVLGLLNNIAEVQELHSELMWKDFIDHISSLLHSVEVEVSYFAAGIIAHLISRGEQAWTLSRSQRNSLLDDLHSAILKWPTPECEMVAYRSFNPFFPLLGCFTTPGVQLWAVWAMQHVCSKNPSRYCSMLIEEGGLQHLYNIKDHEHTDPHVQQIAVAILDSLEKHIVR 250 T 0.0059 DUF3361 pdb F Eukaryota T 7ep1 1 A,B A,B ZY11B_HUMAN Protein zyg-11 homolog B GFLHVGAQLGTELFIVRQLLQIVKQKTNQNSVDTTLKFTLSALWNLTDESPTTCRHFIENQGLELFMRVLESFPTESSIQQKVLGLLNNIAEVQELHSELMWKDFIDHISSLLHSVEVEVSYFAAGIIAHLISRGEQAWTLSRSQRNSLLDDLHSAILKWPTPECEMVAYRSFNPFFPLLGCFTTPGVQLWAVWAMQHVCSKNPSRYCSMLIEEGGLQHLYNIKDHEHTDPHVQQIAVAILDSLEKHIVR 250 T 0.00014 DUF4704 pdb F Eukaryota T 7ep2 1 A,B,C,D A,C,D,B ZY11B_HUMAN Protein zyg-11 homolog B GGFNRFEAAKLVMQWLCNHEDQNMQRMAVAIISILAAKLSTEQTAQLGTELFIVRQLLQIVKQKTNQNSVDTTLKFTLSALWNLTDESPTTCRHFIENQGLELFMRVLESFPTESSIQQKVLGLLNNIAEVQELHSELMWKDFIDHISSLLHSVEVEVSYFAAGIIAHLISRGEQAWTLSRSQRNSLLDDLHSAILKWPTPECEMVAYRSFNPFFPLLGCFTTPGVQLWAVWAMQHVCSKNPSRYCSMLIEEGGLQHLYNIKDHEHTDPHVQQIAVAILDSLEKHIVR 288 T 0.00019 V-ATPase_H_N pdbpercent F Eukaryota T 7ep3 1 A A ZER1_HUMAN HZYG,ZYG-11 HOMOLOG B-LIKE PROTEIN,ZYG11B-LIKE PROTEIN GAGNKMGFVVTMLKLIQKKLLDKTCDQVMEFSWSALWNITDETPDNCEMFLNFNGMKLFLDCLKEFPEKQELHRNMLGLLGNVAEVKELRPQLMTSQFISVFSNLLESKADGIEVSYNACGVLSHIMFDGPEAWGVCEPQREEVEERMWAAIQSWDINSRRNINYRSFEPILRLLPQGISPVSQHWATWALYNLVSVYPDKYCPLLIKEGGMPLLRDIIKMATARQETKEMARKVIEHCSNFKEENMDTSR 251 T 0.0015 Arm_3 pdbpercent F Eukaryota T 7ep4 1 A,B B,A ZER1_HUMAN HZYG,ZYG-11 HOMOLOG B-LIKE PROTEIN,ZYG11B-LIKE PROTEIN GFLHVGKMGFVVTMLKLIQKKLLDKTCDQVMEFSWSALWNITDETPDNCEMFLNFNGMKLFLDCLKEFPEKQELHRNMLGLLGNVAEVKELRPQLMTSQFISVFSNLLESKADGIEVSYNACGVLSHIMFDGPEAWGVCEPQREEVEERMWAAIQSWDINSRRNINYRSFEPILRLLPQGISPVSQHWATWALYNLVSVYPDKYCPLLIKEGGMPLLRDIIKMATARQETKEMARKVIEHCSNFKEENMDTSR 253 T 0.0007 Arm_3 pdbpercent F Eukaryota T 7ep5 1 A,B A,B ZER1_HUMAN HZYG,ZYG-11 HOMOLOG B-LIKE PROTEIN,ZYG11B-LIKE PROTEIN GKLHVGKMGFVVTMLKLIQKKLLDKTCDQVMEFSWSALWNITDETPDNCEMFLNFNGMKLFLDCLKEFPEKQELHRNMLGLLGNVAEVKELRPQLMTSQFISVFSNLLESKADGIEVSYNACGVLSHIMFDGPEAWGVCEPQREEVEERMWAAIQSWDINSRRNINYRSFEPILRLLPQGISPVSQHWATWALYNLVSVYPDKYCPLLIKEGGMPLLRDIIKMATARQETKEMARKVIEHCSNFKEENMDTSR 253 T 0.001 Arm_3 pdbpercent F Eukaryota T 7ep7 2 B B WHRN_HUMAN AUTOSOMAL RECESSIVE DEAFNESS TYPE 31 PROTEIN TNQHFVMVEVHRPDSEPDVNEVRALPQTRT 30 T 190 Cwf_Cwc_15 pdbhh F Eukaryota T 7eqc 1 A,B,E,G B,C,D,G Q9XUS9_CAEEL CYK-4 GAGSMKSSTSKEKVCGENSRHIFNMILNSQRPQFDIKDIGMFHLIDEIERLRKLWKDSEESKKRLNADMREAEEALAKARKKLAMFDIDVKDTQKHLRALMEENKALKLDLNVYETREKQLKDA 124 T 0.0019 BRE1 pdbpercent F Eukaryota T 7eqg 3 H,I,M,N,O J,K,P,Q,R L7P7V3_9CAUD AcrIF5 MSRPTVVTVTETPRNPGSYEVNVERDGKMVVGRARAGSDPGAAAAKAMQMAMEWGSPNYVILGSNKVLAFIPEQLRVKM 79 T 0.067 Flagellin_D3 pdb T Viruses T 7esi 2 B B Peptide P1 EPSQQVTEIYQHHA 14 T 16 Inhibitor_I34 pdbhh F T 7esi 3 C C Peptide P2 DYAPTKLLPQQP 12 T 9.5 DUF724 pdbhh F T 7esw 1 A,B A,B G9MQD3_HYPVG Trichoderma secreted protein Tsp1 MAAPTPADKSMMAAVPEWTITNLKRVCNAGNTSCTWTFGVDTHLATATSCTYVVKANANASQASGGPVTCGPYTITSSWSGQFGPNNGFTTFAVTDFSKKLIVWPAYTDVQVQAGKVVSPNQSYAPANLPLEHHHHHH 138 T 5.5 DUF6520 unphh F Eukaryota T 7esx 1 A,B A,B B3CP62_WOLPP Bacteria factor 1 MPTQKELRDTMSKKLQEAIKHPDPAVVAGRKSAIKRWVGVLQDNFMEHIKYFKGDKLKFLHNVFQDEGCWSGVRLDNAALGQRFTEEKIGGIDNPLRKYEMACSYCVVDKIHPLFQKRFESYRNKFPPGAFDGKTETEFGKYVRNSLLDSIKRKGPVFDFWIDRESGELKKYDAVEGFDSAVKFKWSEGVEYFYNHLKEEDKEKKLTEAILALSRVQSVEKDAPILDFCVNKIVDKDTLLQKLSQKDKGVYSLFAELIESCFFDTVHDLVQCWCYKEVSAGGDHSEKIFSQRDYELFLSSLSDTMLKNPELSVQARSLIMEFWECGSLYQYRKAAVNTSNYTVPTSGVFAELIVNWRREDIYKTDEEKEIEKKEILDMMSFAKDCFPEKFELFKKLIIRDLRLCGREGKRVNVDYGLFAEELFSELEKTILPPGPVGDGPCSNLRSRSKAHGSKKTTLPVDDSPQSELGTPSVSGVSSYKKKSVFTLSGNKLEHHHHHH 499 T 0.21 DUF3437 pdbpssm F Bacteria T 7esy 1 A A B3CP62_WOLPP Bacteria factor 1 MPTQKELRDTMSKKLQEAIKHPDPAVVAGRKSAIKRWVGVLQDNFMEHIKYFKGDKLKFLHNVFQDEGCWSGVRLDNAALGQRFTEEKIGGIDNPLRKYEMACSYCVVDKIHPLFQKRFESYRNKFPPGAFDGKTETEFGKYVRNSLLDSIKRKGPVFDFWIDRESGELKKYDAVEGFDSAVKFKWSEGVEYFYNHLKEEDKEKKLTEAILALSRVQSVEKDAPILDFCVNKIVDKDTLLQKLSQKDKGVYSLFAELIESCFFDTVHDLVQCWCYKEVSAGGDHSEKIFSQRDYELFLSSLSDTMLKNPELSVQARSLIMEFWECGSLYQYRKAAVNTSNYTVPTSGVFAELIVNWRREDIYKTDEEKEIEKKEILDMMSFAKDCFPEKFELFKKLIIRDLRLCGREGKRVNVDYGLFAEELFSELEKTILPPGPVGDGPCSNLRSRSKAHGSKKTTLPVDDSPQSELGTPSVSGVSSYKKKSVFTLSGNKLEHHHHHH 499 T 0.21 DUF3437 pdbpssm F Bacteria T 7esy 2 B B B3CP63_WOLPP ULP_PROTEASE domain-containing protein MSNGDGLIRSLVDGDLEGFRQGFESFLDQCPSFLYHVSAGRFLPVFFFSMFSTAHDANILNANERVYFRFDNHGVNPRNGENRNTANLKVAVYRDGQQVVRCYSISDRPNSDGLRFSTRERNALVQEIRRQNPNLREEDLNFEQYKVCMHGKGKSQGEAIATVFEVIREKDRQGRDKFAKYSASEVHFLRQLFRNHRLTIKEIEGRQLNQNQLRQLGRSVNFTRVEPGQQRIDNFMEMLASNQRQDVRDSLRGDILEYVTDTYNNYRAQIENNIEGRSQKFESHGFLLGFLANFSHRYTIGVDLDLSPRNSHVAFLVRHQVERENIPIVINLATRAPPYIALNRARSHAERLHVFSFIPIHTESRNTVCVGLNFNLNLDPFSVDTVGLQQDRFPLVQRLFECLENEGIRENIRDFLLHHLPAEIPRNAENYDRIFDCITGFAFGNSAFDRHPLELEEEDEAPITKYIFRHGDEGLRCLTMVFHAEGSDIVILHIRAHDAQQQGAINLQTLNVNGNDVHVWEVSCTLNNQLELDIDLPNDLGLYHDYQNNNANNFLAGDLVQVPNTENVHNTLNQVVNDGWKNIAQHRGLFQEISGALMPLVDTINVNSEDKFRSILHGTFYASDNPYKVLAMYKVGQTYSLKRGQEEEGERVILTRITEQRLDLLLLRQPRENDLDTHPIGYVLRLANNAEEVGQQQNDARQEIGRLKKQHRGFIPITSGNEVVLFPIVFNRDAHEAGNLILFPEGIGREEHVHRLDRHVRLEHHHHHH 769 T 3.6E-05 PDDEXK_9 pdbhh F Bacteria T 7esz 2 B,D B,D B3CP73_WOLPP BACTERIA FACTOR A MESGLDHNYNKILDILKGAIKGDDNQVKARKHLRVERWLRAYIQLIEDFDEEKLIFFSDIFSDNSCWDGIKLKNKAVGERLTEEKNKNGKENPLDLADRYYLACKYCLEDKIPGLFEQVFMRFKRSAFEEDGSDDDLRRELLENIEETSPIEAFWSFLIDKQIGKLNEYKSVEGLQKSIQINSNKNWEEGIEFFYNKLHNDSSISSQDKDDLLIEAALSAVKGYKEVDTIEFCLSKMDDEQKKKLLDRDYKENTYYAVLNVLVGQYYFDSFMELSRLCSQIECERYTTFLSSLSDQVLKNPDLSEETKKCMMNVWERIIKLKTQDRGEQSISSIFVDYSVTYTIANLIVDPSRQGVSKEEILGKILKHVKEMSGEEMIKVKDSVLSKIQLFHGGKKLQLGEQVFSKLAQEASKESILREAGDTLPQSSLSTTDTPYNIKSLSHSKLEHHHHHH 453 T 34 Ldr_toxin pdbhh F Bacteria T 7et0 2 B,D B,D B3CP73_WOLPP Bacteria factor A MESGLDHNYNKILDILKGAIKGDDNQVKARKHLRVERWLRAYIQLIEDFDEEKLIFFSDIFSDNSCWDGIKLKNKAVGERLTEEKNKNGKENPLDLADRYYLACKYCLEDKIPGLFEQVFMRFKRSAFEEDGSDDDLRRELLENIEETSPIEAFWSFLIDKQIGKLNEYKSVEGLQKSIQINSNKNWEEGIEFFYNKLHNDSSISSQDKDDLLIEAALSAVKGYKEVDTIEFCLSKMDDEQKKKLLDRDYKENTYYAVLNVLVGQYYFDSFMELSRLCSQIECERYTTFLSSLSDQVLKNPDLSEETKKCMMNVWERIIKLKTQDRGEQSISSIFVDYSVTYTIANLIVDPSRQGVSKEEILGKILKHVKEMSGEEMIKVKDSVLSKIQLFHGGKKLQLGEQVFSKLAQEASKESILREAGDTLPQSSLSTTDTPYNIKSLSHSKLEHHHHHH 453 T 34 Ldr_toxin pdbhh F Bacteria T 7etn 1 A,B A,B PRO-PHE-LEU-ILE PFLI 4 T 44 DUF4123 pdbhh F F 7etp 1 A,B,C A,B,C Pro-Phe-Leu-Phe PFLF 4 T 41 NRDE-2 pdbhh F F 7etq 1 A A Pro-Met-Leu-Leu PMLL 4 T 110 GP52 pdbhh F F 7ett 1 A B peptide-inhibitor hit QFPFV 5 T 24 PatG_C pdbhh F F 7etu 1 A B peptide-inhibitor hit SFPFT 5 T 29 DUF1894 pdbhh F F 7etv 2 B B peptide-inhibitor hit DFPFV 5 T 23 OST_IS pdbhh F F 7eu3 15 O T Unidentified stromal protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 61 F F F 7eu3 24 X 9 F2CPQ4_HORVV Photosynthetic NDH subunit of subcomplex B4 NQRDWVVTKSIWHLSDTAIKSFYTFYAMFTVWGVCFFASMKASMADPFYDSEHYRGQGGDGTVHWYYDRQEDIEATARGDLLR 83 T 0.023 CCDC142 unp F Eukaryota T 7eu3 25 Y 0 F2DWH9_HORVV Photosynthetic NDH subunit of subcomplex B5 GPLTEIEPDLQEDPIDKWRTNGVSPEDFVYGVYDGHHTYDEGQEKKGFWEDVSEWYQEAEPPQGFQALISWSFPPAVILGMAFDVPGEYLYIGAAIFIVVFCIIEMDKPDKPHNFEPEIYMMERSKRDKLIADYNSMDIWDFNEKYGELWDFTVN 155 T 0.06 DUF3098 pdb F Eukaryota T 7eu9 1 A A Cas12i1 D647A mutant MSNKEKNASETRKAYTTKMIPRSHDRMKLLGNFMDYLMDGTPIFFELWNQFGGGIDRDIISGTANKDKISDDLLLAVNWFKVMPINSKPQGVSPSNLANLFQQYSGSEPDIQAQEYFASNFDTEKHQWKDMRVEYERLLAELQLSRSDMHHDLKLMYKEKCIGLSLSTAHYITSVMFGTGAKNNRQTKHQFYSKVIQLLEESTQINSVEQLASIILKAGDCDSYRKLRIRCSRKGATPSILKIVQDYELGTNHDDEVNVPSLIANLKEKLGRFEYECEWKCMEKIKAFLASKVGPYYLGSYSAMLENALSPIKGMTTKNCKFVLKQIDAKNDIKYENEPFGKIVEGFFDSPYFESDTNVKWVLHPHHIGESNIKTLWEDLNAIHSKYEEDIASLSEDKKEKRIKVYQGDVCQTINTYCEEVGKEAKTPLVQLLRYLYSRKDDIAVDKIIDGITFLSKKHKVEKQKINPVIQKYPSFNFGNNSKLLGKIISPKDKLKHNLKCNRNQVDNYIWIEIKVLNTKTMRWEKHHYALSSTRFLEEVYYPATSENPPDALAARFRTKTNGYEGKPALSAEQIEQIRSAPVGLRKVKKRQMRLEAARQQNLLPRYTWGKDFNINICKRGNNFEVTLATKVKKKKEKNYKVVLGYAANIVRKNTYAAIEAHANGDGVIDYNDLPVKPIESGFVTVESQVRDKSYDQLSYNGVKLLYCKPHVESRRSFLEKYRNGTMKDNRGNNIQIDFMKDFEAIADDETSLYYFNMKYCKLLQSSIRNHSSQAKEYREEIFELLRDGKLSVLKLSSLSNLSFVMFKVAKSLIGTYFGHLLKKPKNSKSDVKAPPITDEDKQKADPEMFALRLALEEKRLNKVKSKKEVIANKIVAKALELRDKYGPVLIKGENISDTTKKGKKSSTNSFLMDWLARGVANKVKEMVMMHQGLEFVEVNPNFTSHQDPFVHKNPENTFRARYSRCTPSELTEKNRKEILSFLSDKPSKRPTNAYYNEGAMAFLATYGLKKNDVLGVSLEKFKQIMANILHQRSEDQLLFPSRGGMFYLATYKLDADATSVNWNGKQFWVCNADLVAAYNVGLVDIQKDFKKKLEHHHHHH 1101 T 25 DUF4060 pdbhh F T 7euo 6 F C Peptide Agonist fMLF MLF 3 T 140 DUF3719 pdbhh F F 7eus 1 A,B A,B CTB9_CERBT 2-oxoglutarate (2-OG)-dependent dioxygenase MTSTTTTTETLQEAVPFVAPPSPPEDVNNKELPEKPYYDVEFNYRLDPRDGGDEVIWGGTVGLMRRKYETRTVRINNERGNEHNFNLDTHGFAWVKHKTSVTEFADYLAIRQGPYYGEVAEMLKRVTGATKVHVIGHLHRSLNYNDTTEEEKNAPDMTMTKGQTPGRFVHVDQSYQGAVRRLYLDLPQEEARRLEKTRWAIINVWRPVRKVTNEPLAVCDARSVREDELFNTLHLVPMRWPDAAPQENQMWAVAPPKTPTQHKWHYVSGMTEDEALLIKMFDSKKDGTARRVPHSSFPTPDDFGEPRASTETRCFVFWEDQEAEALEHHHHHH 333 T 0.5 EF-hand_5 pdbpercent F Eukaryota T 7eut 1 A,B A,B CTB9_CERBT 2-oxoglutarate (2-OG)-dependent dioxygenase MTSTTTTTETLQEAVPFVAPPSPPEDVNNKELPEKPYYDVEFNYRLDPRDGGDEVIWGGTVGLMRRKYETRTVRINNERGNEHNFNLDTHGFAWVKHKTSVTEFADYLAIRQGPYYGEVAEMLKRVTGATKVHVIGHLHRSLNYNDTTEEEKNAPDMTMTKGQTPGRFVHVDQSYQGAVRRLYLDLPQEEARRLEKTRWAIINVWRPVRKVTNEPLAVCDARSVREDELFNTLHLVPMRWPDAAPQENQMWAVAPPKTPTQHKWHYVSGMTEDEALLIKMFDSKKDGTARRVPHSSFPTPDDFGEPRASTETRCFVFWEDQEAEALEHHHHHH 333 T 0.5 EF-hand_5 pdbpercent F Eukaryota T 7euu 1 A,B A,B CTB9_CERBT 2-oxoglutarate (2-OG)-dependent dioxygenase MTSTTTTTETLQEAVPFVAPPSPPEDVNNKELPEKPYYDVEFNYRLDPRDGGDEVIWGGTVGLMRRKYETRTVRINNERGNEHNFNLDTHGFAWVKHKTSVTEFADYLAIRQGPYYGEVAEMLKRVTGATKVHVIGHLHRSLNYNDTTEEEKNAPDMTMTKGQTPGRFVHVDQSYQGAVRRLYLDLPQEEARRLEKTRWAIINVWRPVRKVTNEPLAVCDARSVREDELFNTLHLVPMRWPDAAPQENQMWAVAPPKTPTQHKWHYVSGMTEDEALLIKMFDSKKDGTARRVPHSSFPTPDDFGEPRASTETRCFVFWEDQEAEALEHHHHHH 333 T 0.5 EF-hand_5 pdbpercent F Eukaryota T 7eux 2 B S ALA-PRO-GLU-ALA-VAL APEAV 5 T 180 DUF6436 pdbhh F F 7euy 2 B S ALA-PRO-GLU-ALA-VAL APEAV 5 T 180 DUF6436 pdbhh F F 7ev4 2 B S F-b20-Q peptide {ortho-aminobenzoic acid (Abz)- QLRSLNGEWRFAWFPAPEAV[Tyr(3-NO2)]A} AVXA 4 T 280 Qn_am_d_aIII pdbhh F F 7ev6 2 B S F-b20-Q peptide {ortho-aminobenzoic acid (Abz)- QLRSLNGEWRFAWFPAPEAV[Tyr(3-NO2)]A} EAVXA 5 T 140 FRG2 pdbhh F F 7ev8 2 B B PHOSP_PI3H4 Phosphoprotein MESDAKNYQIMDSWEEEPRDKSTNISSALNIIEFILSTDPQE 42 T 1.7 SBP_bac_1 pdbhh T Viruses T 7evn 2 B C SF3B1_HUMAN PRE-MRNA-SPLICING FACTOR SF3B 155 KDA SUBUNIT,SF3B155,SPLICEOSOME-ASSOCIATED PROTEIN 155,SAP 155 MASDYKDDDDKASDEVDAGTMKSVNDQPSGNLPFLKPDDIQYFDKLLVDVDESTLSPEEQKERKIMKLLLKIKNGTPPMRKAALRQITDKAREFGAGPLFNQILPLLMSPTLEDQERHLLVKVIDRILYKLDDLVRPYVHKILVVIEPLLIDEDYYARVEGREIISNLAKAAGLATMISTMRPDIDNMDEYVRNTTARAFAVVASALGIPSLLPFLKAVCKSKKSWQARHTGIKIVQQIAILMGCAILPHLRSLVEIIEHGLVDEQQKVRTISALAIAALAEAATPYGIESFDSVLKPLWKGIRQHRGKGLAAFLKAIGYLIPLMDAEYANYYTREVMLILIREFQSPDEEMKKIVLKVVKQCCGTDGVEANYIKTEILPPFFKHFWQHRMALDRRNYRQLVDTTVELANKVGAAEIISRIVDDLKDEAEQYRKMVMETIEKIMGNLGAADIDHKLEEQLIDGILYAFQEQTTEDSVMLNGFGTVVNALGKRVKPYLPQICGTVLWRLNNKSAKVRQQAADLISRTAVVMKTCQEEKLMGHLGVVLYEYLGEEYPEVLGSILGALKAIVNVIGMHKMTPPIKDLLPRLTPILKNRHEKVQENCIDLVGRIADRGAEYVSAREWMRICFELLELLKAHKKAIRRATVNTFGYIAKAIGPHDVLATLLNNLKVQERQNRVCTTVAIAIVAETCSPFTVLPALMNEYRVPELNVQNGVLKSLSFLFEYIGEMGKDYIYAVTPLLEDALMDRDLVHRQTASAVVQHMSLGVYGFGCEDSLNHLLNYVWPNVFETSPHVIQAVMGALEGLRVAIGPCRMLQYCLQGLFHPARKVRDVYWKIYNSIYIGSQDALIAHYPRIYNDDKNTYIRYELDYIL 872 T 0.0012 Adaptin_N pdbpercent F Eukaryota T 7evp 2 B,D C,D GP168_BPTWO GP168 MLFFKEKFYNELSYYRGGHKDLESMFELALEYIEKLEEEDEQQVTDYENAMEEELRDAVDVIESQLEIIKDIVR 74 T 0.067 DUF3810 pdb T Viruses T 7evr 2 B,D B,D SETD2_HUMAN HIF-1,HUNTINGTIN YEAST PARTNER B,HUNTINGTIN-INTERACTING PROTEIN 1,HIP-1,HUNTINGTIN-INTERACTING PROTEIN B,LYSINE N-METHYLTRANSFERASE 3A,PROTEIN-LYSINE N-METHYLTRANSFERASE SETD2,SET DOMAIN-CONTAINING PROTEIN 2,HSET2,P231HBP YPPGYPMQAYVDPSNPNAGKVLLPTP 26 T 0.027 DUF3592 pdb F Eukaryota T 7evs 2 C,D C,D SETD2_HUMAN HIF-1,HUNTINGTIN YEAST PARTNER B,HUNTINGTIN-INTERACTING PROTEIN 1,HIP-1,HUNTINGTIN-INTERACTING PROTEIN B,LYSINE N-METHYLTRANSFERASE 3A,PROTEIN-LYSINE N-METHYLTRANSFERASE SETD2,SET DOMAIN-CONTAINING PROTEIN 2,HSET2,P231HBP SNPNAGKVLLPTP 13 T 40 ANAPC16 pdbhh F Eukaryota T 7ew8 1 A,B A,B A0A2S6F2G5_LEGPN ANKD MGSSHHHHHHSSGLVPRGSHMASMLTPPPDSKISTTDKSLDKLSAPLDMLKQMNESTMEQTKLDELRKKMSLQAEILNKAKADNDMFFRLLIELMSLKLQGELFKEQLSKISKESGYDSAQSALIQATNSEGQSPLQYALQKQDFSTAKYFLDNGAKAGPIEKAVFEIALDSKAAKEFGFPPLPPEKEKLHPVKNFGLVLGIKTTSVDGTPSQFGHIAPTYQLMTDSVSHFAKSHPGNKNFQEIANAFQFSNEASAFKFSTPQRNPEAGNDLARRIQGGELTTIPVSCKGHAMGLSYVPDGPGSKSGYLVYTNRGLGAKSSEHGTHIFRIEDSSKITPEFINNMTSGHSNGASHDEIMSQIKAAAGNKEPIHHIKQKGQKNDNCTIANSKSNIEGILLCQKAREVGGFDKLTESDMDSVKKEYKEFTKHMRVEKVNELAKALKENPQDPDLNNLTKEYLKQHPNADPKLKQTLETALKQASESSMTLSQPGKTI 494 T 0.00015 Shigella_OspC unphh F Bacteria T 7ewj 2 C,F,I C,F,I L7PH55_STAAU PEMI INHIBITOR KSIEDRIKNFFQSGGKYTELEVDWEERVGREI 32 T 0.00075 DtxR unppercent F Bacteria T 7exe 2 C C ADA22_MOUSE ADAM 22 RPRSNSWQGNMGGNKKKIRGKRFRPRSNSTE 31 T 61 BBP1_N pdbhh F Eukaryota T 7exx 1 A A A0A0P7R7Y1_ECOLX DNA phosphorothioation-dependent restriction protein DptG GSHMYPIATNLKVSNNQLDSYLPIRNKNNNIDWQIVTGLVLSYAVKYKIDTYSLEQFREDCKTHLQILIDEPAFLSVLERMYFSSQDIFRVSPLFLLFHAQFDGEKISAGSTADKRLGTLFANLMRDFSLNNPIQDKLNFIEKEMLNKLNKKLIRLGEGPFAKEQPYLPYLVTCFQSDLAFLAEHPQYLLQELTNTLRLYAFSWCAQLALNLDNWQDGEPQSKSLFFILDTEKASSERDKIKLFGYKWFARQSEKLFPVLSALEVLQVKGEEKRPLWQVYQDCLGYSDTSNRVLNELNNYIQKFISKEERDLPERDRATNLEDAFKQLLSVAVEQFQGKKTERAAVNRKYINELESQICTDFIQVRGRAGKVLVLNQDRLLLLTNLTVGKNKKLRLHELLRGFEQRGFYLDNQSTQMLVAFYERMGNVERMSDSGDAVYVRKTV 444 T 0.04 DUF1798 pdbpercent F Bacteria T 7ey7 2 S,T,U,V,W,X s,t,u,v,w,x TUBE2_BPT7 GENE PRODUCT 12,GP12 MALISQSIKNLKGGISQQPDILRYPDQGSRQVNGWSSETEGLQKRPPLVFLNTLGDNGALGQAPYIHLINRDEHEQYYAVFTGSGIRVFDLSGNEKQVRYPNGSNYIKTANPRNDLRMVTVADYTFIVNRNVVAQKNTKSVNLPNYNPNQDGLINVRGGQYGRELIVHINGKDVAKYKIPDGSQPEHVNNTDAQWLAEELAKQMRTNLSDWTVNVGQGFIHVTAPSGQQIDSFTTKDGYADQLINPVTHYAQSFSKLPPNAPNGYMVKIVGDASKSADQYYVRYDAERKVWTETLGWNTEDQVLWETMPHALVRAADGNFDFKWLEWSPKSCGDVDTNPWPSFVGSSINDVFFFRNRLGFLSGENIILSRTAKYFNFYPASIANLSDDDPIDVAVSTNRIAILKYAVPFSEELLIWSDEAQFVLTASGTLTSKSVELNLTTQFDVQDRARPFGIGRNVYFASPRSSFTSIHRYYAVQDVSSVKNAEDITSHVPNYIPNGVFSICGSGTENFCSVLSHGDPSKIFMYKFLYLNEELRQQSWSHWDFGENVQVLACQSISSDMYVILRNEFNTFLARISFTKNAIDLQGEPYRAFMDMKIRYTIPSGTYNDDTFTTSIHIPTIYGANFGRGKITVLEPDGKITVFEQPTAGWNSDPWLRLSGNLEGRMVYIGFNINFVYEFSKFLIKQTADDGSTSTEDIGRLQLRRAWVNYENSGTFDIYVENQSSNWKYTMAGARLGSNTLRAGRLNLGTGQYRFPVVGNAKFNTVYILSDETTPLNIIGCGWEGNYLRRSSGI 794 T 0.031 MelC1 pdbpercent T Viruses T 7ey7 3 AA,BA,CA,DA,Y,Z C,D,E,F,A,B GP14_BPT7 GENE PRODUCT 14,GP14 MCWAAAIPIAISGAQAISGQNAQAKMIAAQTAAGRRQAMEIMRQTNIQNADLSLQARSKLEEASAELTSQNMQKVQAIGSIRAAIGESMLEGSSMDRIKRVTEGQFIREANMVTENYRRDYQAIFAQQLGGTQSAASQIDEIYKSEQKQKSKLQMVLDPLAIMGSSAASAYASGAFDSKSTTKAPIVAAKGTKTGR 196 T 0.056 Cpn60_TCP1 pdbpssm T Viruses T 7ey9 2 S,T,U,V,W,X s,t,u,v,w,x TUBE2_BPT7 GENE PRODUCT 12,GP12 MALISQSIKNLKGGISQQPDILRYPDQGSRQVNGWSSETEGLQKRPPLVFLNTLGDNGALGQAPYIHLINRDEHEQYYAVFTGSGIRVFDLSGNEKQVRYPNGSNYIKTANPRNDLRMVTVADYTFIVNRNVVAQKNTKSVNLPNYNPNQDGLINVRGGQYGRELIVHINGKDVAKYKIPDGSQPEHVNNTDAQWLAEELAKQMRTNLSDWTVNVGQGFIHVTAPSGQQIDSFTTKDGYADQLINPVTHYAQSFSKLPPNAPNGYMVKIVGDASKSADQYYVRYDAERKVWTETLGWNTEDQVLWETMPHALVRAADGNFDFKWLEWSPKSCGDVDTNPWPSFVGSSINDVFFFRNRLGFLSGENIILSRTAKYFNFYPASIANLSDDDPIDVAVSTNRIAILKYAVPFSEELLIWSDEAQFVLTASGTLTSKSVELNLTTQFDVQDRARPFGIGRNVYFASPRSSFTSIHRYYAVQDVSSVKNAEDITSHVPNYIPNGVFSICGSGTENFCSVLSHGDPSKIFMYKFLYLNEELRQQSWSHWDFGENVQVLACQSISSDMYVILRNEFNTFLARISFTKNAIDLQGEPYRAFMDMKIRYTIPSGTYNDDTFTTSIHIPTIYGANFGRGKITVLEPDGKITVFEQPTAGWNSDPWLRLSGNLEGRMVYIGFNINFVYEFSKFLIKQTADDGSTSTEDIGRLQLRRAWVNYENSGTFDIYVENQSSNWKYTMAGARLGSNTLRAGRLNLGTGQYRFPVVGNAKFNTVYILSDETTPLNIIGCGWEGNYLRRSSGI 794 T 0.031 MelC1 pdbpercent T Viruses T 7eyb 1 A,B,C,D,E,F,G,H a,b,c,d,e,f,g,h GP14_BPT7 GENE PRODUCT 14,GP14 MCWAAAIPIAISGAQAISGQNAQAKMIAAQTAAGRRQAMEIMRQTNIQNADLSLQARSKLEEASAELTSQNMQKVQAIGSIRAAIGESMLEGSSMDRIKRVTEGQFIREANMVTENYRRDYQAIFAQQLGGTQSAASQIDEIYKSEQKQKSKLQMVLDPLAIMGSSAASAYASGAFDSKSTTKAPIVAAKGTKTGR 196 T 0.056 Cpn60_TCP1 pdbpssm T Viruses T 7eyb 2 I,J,K,L,M,N,O,P A,B,C,D,E,F,G,H GP15_BPT7 GENE PRODUCT 15,GP15 MSKIESALQAAQPGLSRLRGGAGGMGYRAATTQAEQPRSSLLDTIGRFAKAGADMYTAKEQRARDLADERSNEIIRKLTPEQRREALNNGTLLYQDDPYAMEALRVKTGRNAAYLVDDDVMQKIKEGVFRTREEMEEYRHSRLQEGAKVYAEQFGIDPEDVDYQRGFNGDITERNISLYGAHDNFLSQQAQKGAIMNSRVELNGVLQDPDMLRRPDSADFFEKYIDNGLVTGAIPSDAQATQLISQAFSDASSRAGGADFLMRVGDKKVTLNGATTTYRELIGEEQWNALMVTAQRSQFETDAKLNEQYRLKINSALNQEDPRTAWEMLQGIKAELDKVQPDEQMTPQREWLISAQEQVQNQMNAWTKAQAKALDDSMKSMNKLDVIDKQFQKRINGEWVSTDFKDMPVNENTGEFKHSDMVNYANKKLAEIDSMDIPDGAKDAMKLKYLQADSKDGAFRTAIGTMVTDAGQEWSAAVINGKLPERTPAMDALRRIRNADPQLIAALYPDQAELFLTMDMMDKQGIDPQVILDADRLTVKRSKEQRFEDDKAFESALNASKAPEIARMPASLRESARKIYDSVKYRSGNESMAMEQMTKFLKESTYTFTGDDVDGDTVGVIPKNMMQVNSDPKSWEQGRDILEEARKGIIASNPWITNKQLTMYSQGDSIYLMDTTGQVRVRYDKELLSKVWSENQKKLEEKAREKALADVNKRAPIVAATKAREAAAKRVREKRKQTPKFIYGRKE 747 T 18 DUF3135 pdbpercent T Viruses T 7ezm 1 A A GNAI1_HUMAN;GNAQ_HUMAN ADENYLATE CYCLASE-INHIBITING G ALPHA PROTEIN,GUANINE NUCLEOTIDE-BINDING PROTEIN ALPHA-Q MGCTLSAEDKAAVERSKMIDRNLREDGEKARRELKLLLLGTGESGKSTFIKQMRIIHGSGYSDEDKRGFTKLVYQNIFTAMQAMIRAMDTLKIPYKYEHNKAHAQLVREVDVEKVSAFENPYVDAIKSLWNDPGIQECYDRRREYQLSDSTKYYLNDLDRVADPAYLPTQQDVLRVRVPTTGIIEYPFDLQSVIFRMVDVGAQRSERRKWIHCFENVTSIMFLVALSEYDQVLVESDNENRMEESKALFRTIITYPWFQNSSVILFLNKKDLLEEKIMYSHLVDYFPEYDGPQRDAQAAREFILKMFVDLNPDSDKIIYSHFTCSTDTENIRFVFAAVKDTILQLNLKEYNLV 353 T 6.9E-126 G-alpha unp F Eukaryota T 7ezn 1 A A J9VTB5_CRYNH Protein-tyrosine-phosphatase KEAMGHMQEVVDGLWVGDLVAANDDDELEKNGIKNILSALRPSLKFSDKYAVYPLEIDDSADTDLLSHLPSCVAWIKEILDLRQKAAEPSSQKNGTENGESLKRSPDIDTVAQPGKPGGVLVHCQAGMSRSASIVAAYLMSQYDLDPMEAMTMIREKRPVVEPSATFWHQLGLFYTTDGKVSLKDRSTRQYYMERTTTQFINGDG 205 T 1.1E-22 DSPc pdbpssm F Eukaryota T 7ezw 2 B B ALA-CYS-GLU-MET-GLY-PHE-PHE-GLN-ASP-CYS-GLY ACEMGFFQDCGX 12 T 1.2 RNA_pol_Rbc25 pdbhh F T 7ezx 2 B,DP,PE,PJ,QC,QL A1,AK,A9,AF,A7,AI A0A5J4YX19_PORPP R-phycoerythrin gamma chain, chloroplastic MAAFVSGFHGVQVGAPAENKLVCRAAKPAQLTMLTGYDSKSSPNFPNRAATRERRTVSFNARVARNKSQAKKILEKADEFFARSVTMQYKAFACPNGVYDIQCTEGTVKGAAYEKRAMAVSAAFRAKQASPAAKARALFENRRHAIIASHECQHEEDLFVRFPKLSAAYMMGKTEAMRTCSRYVVPDSLEEEYMAASVDRQMKERACPGGVYASSCVEGNAKGQAEQARVAALATAFRSAQKSASKTTAERYSSAAYGRDHFAHGCSYEESVFNTYPATAAAMRSKSYNY 290 T 0.14 Amidohydro_1 pdbpercent F Eukaryota T 7ezx 11 MC,ME D5,D8 A0A5J4YX67_PORPP CaRSPs2 MWEQQRPRRCEAPAAPSSRPAERRAAARRSRAQLRMKQDDYEQWKTEFAGGFPGGEAFYKKWIEEGAKGDVPALEEELQPRSPNKKPTIYEEQMISNRGQQKGVDPTWKTLLAGGFPGGEFFFKKWIGEGAQGEVPNLDADLQPGSGSAKKTGKKEDADKSSPGGIMTPGRIMVPSGLGEEEETVDKEAPRPELYNKYFSADRLHKAPEILFEYNKTKYDRVGVRYTEVTSKASERFFPKSRMNRAPVIEISYREGAVSTASVSLSMPEISGPPALPFPVPKGDVTTTMVTDPTTGRLKLEFKVDGAAVSAYSDPRATEAYNKMIRK 327 T 0.016 Fz pdbpssm F Eukaryota T 7ezx 12 NC,NE E5,E8 A0A5J4YJY8_PORPP CaRSPs1 MAAFVSGGCGVGGQRRAWPAKGAAVARTHACPTTMVVLPVARAGLAATAKKNQYMGTSVAPEIVLTDKGSDMSRKVKTEDKKVAADQAAAMGILANMSLYASLNPVKRMTYKAKEQAPAYVKKTGNPVEDFYPSSWRNMAPVISLSANRVAVAFEKIDAASNGVKANSNNKPFWKSGATTKNYVAPEAPAQSEPETLDDAAYQRYFPARIRNKAPAMEFRRPSFANTEDPSAYFMLQKETVPLRMALAEKLLTKLGRKGDGSEAEEEKVKPQKKAAKKDAKDDAKDDE 288 T 89 DUF6243 pdbhh F Eukaryota T 7ezx 14 NG,NL,OU,UX MA,MG,MN,MQ A0A5J4YNU6_PORPP R-phycoerythrin gamma chain, chloroplastic MAAFVVGAQVSSGFMGCAAPRKEQTRVGGAAKAAVSMRIIIQDKNSYRKYQTNTSTSKWDKLLNTKPMKRQVQPNPPTNETRALNLGNTFRSPAFKFLGTLKRSKDPSGLRLGFYGRKADDFMARSIAMQAKASAAGSGVYTTQCSEGASKGMAENARTASLAKQFRQAQRSAREMSFDYYEGRKYAMKAVGHICNYEEKIFQQYNKTAAAYVMGKQETLLSCDRYAQPANKAEEYIQKSVQMQMKKRSIPYGVYTTSCADGTVKGMAENARVAKESANFRARQMSAGAKAAARFNARRVANDWHNNGCNYEEKLTSRFPAAASSVRPTTNRY 333 T 0.022 APC_u5 pdb F Eukaryota T 7ezx 16 LS,MS,PG,QS,YO,ZO wL,xL,AB,AM,wJ,xJ A0A5J4YZM7_PORPP R-phycoerythrin gamma chain, chloroplastic MAAFVGSAASAFTGASAVKANEKRSVCSLQMVAMPQTGLVNSKFSARMAKKTAKQTKNKVDEYMARSVQRQYKQAAVATGVYGTQCTEGTVKGAAEASRSAALSRQFRIKQRSAFSKAHDLFEFRKHAIIAAAGCSYEEKMVTRFPKLAAAMVLGQTEMMRTCSRYVVPESVEEEYMAASVDKQMKRRGAPGGVYSLSCAEGVAKGQAEIARVSALGAAYRAASKSASAVTAERYNSMAYGRVHFAHGCSYEEQQFNKYPAAAAAMRSDSYGY 273 T 1.6 rRNA_methylase pdbpercent F Eukaryota T 7ezx 17 AP,NS,QG,RS yJ,yL,BB,BM A0A5J4YZH3_PORPP R-phycoerythrin gamma chain, chloroplastic METAFVSGFMGKAAVAKFGATAVCDKTARRSSSSNSQVHMVTGAVSSVNMRRFQRVPKVSGFSAKVTKKNVNKALDKADMFFAKSVTMEGKAAAIPYGVYGIQCMEGSAKGMAHEKRAMALSAAFRMNQRSAAEKTGAMYENRRLALILAQNDHQEKQYIKYPKLAAAALMASTEVTRACQRYAVPESIEEEFLAASVDKVNKMRGTTASGVYKSSCVEGNAKGQAEQARVAALAVAFRSAQKSASQFAAERYAQSKYGRDLFSSTHFEEGYANTYPAMAAAKRASSYGY 290 T 0.011 THF_DHG_CYH pdb F Eukaryota T 7ezx 26 DW,FX ZP,2P A0A5J4YTV6_PORPP Lrc4 MAFVACGPLRAGEGGARLGARKAACSMQLAPPGIPPGEDARNNQSLRQYVARPVETYQKRSFATPLPLTWTGETETVGAFDVVVPPQEKDLPVSGEATSAFVKYSDMVRAERKAALQALLSASAAGEGRPTCGAEGRKFVSNANPVLVNGVKCVEYWRK 159 T 14 CRM1_repeat unphh F Eukaryota T 7ezx 27 EW,GX aP,3P A0A5J4Z2M2_PORPP LRC5 MAFVSGAGVAVPAGAKASAPLCALRMSGYGDYSYSTDRTKGHVNQYYVDKARSRSDWGNRNVLPASEGDAVLGRTAKGAVAVPEFGIPQLDDPVLGFGPDSMVDPRIAEADGAVWRWDAGFVDESMTLASCADISDEAVADEAFAKFRGSVLAERGAMITKAESATASVITSLRDGLYSGEAQLLTASGQRLANVAGQEKIATISGYTWDGQPQTEIPGKPFVKSIGAMDYMDGVEGGDVVAAKVGAFWKPKAPKEVPYKRPMGANTPELPYNTVPRLVQAAGLAVQE 288 T 14 DUF5953 pdbhh F Eukaryota T 7f0c 2 B C DPP-SER-DPP-UAL-MYN-KBE XXXXXS 6 T 2200 zf-H2C2_2 pdbhh F F 7f0f 2 B C CMN IIB XXXXXA 6 T 1900 SEC-C pdbhh F F 7f0l 8 GA U U5NME9_CERS4 protein-U MPEVSEFAFRLMMAAVIFVGVGIMFAFAGGHWFVGLVVGGLVAAFFAATPNSN 53 T 0.05 TrbC pdbpercent F Bacteria T 7f0r 5 F G Q9HTR9_PSEAE Transcriptional factor SutA GAMGMSEEELEQDELDGADEDDGEELAAADDGEADSGDGDEAPAPGKKAKAAVVEEELPSVEAKQKERDALAKAMEEFLSRGGKVQEIEPNVVADPPKKPDSKYGSRPI 109 T 0.014 fvmX3 unppssm F Bacteria T 7f1n 1 A,B A,B A0A256XQM7_9CREN Beta-galactosidase MPFPEKFFWGASSSGFQFEMGDPEGKSIDPNTDWFKWVHDETNIRRGVVSGDLPEHGINYWDLFRSDHELAASIGMNAYRIGIEWSRIFPKPTLDVRVGIELDPEGYITRVEVDDKAIEELDLLANKEAVSRYREIILDLRDRGLKVFVCLNHFTLPLWIHDPIACRDTKLKRGPKGWVDKTTILEFAKYSAYMAWSLGNIVDYWVTFNEPMVVTEAGYFQPEVGFPPGLRNISAFKTACLNIANAHVVAYDLIKKYDKVRADDDSPSAAYVGIVHNIVPIKPYSERKLDLKAADLMNYIHNKWILEFIVRGKIDRSLVGREKYLIDKFKDKLDWLGVNYYTRIVLKGKWVPPLISPVPVIPDIVKGYGFNCTPGGRSLDGMPVSDFGWEVYPQGLSDALDIASEYGKPLIVTENGIADSEDNIRPYFLVSHLKVLEEYVEKKKNVYGYLHWALTDNYEWAQGFKMRFGLTDVDLETKERKPRESSEVFKIIASEKTVPEELVEKYPKPIF 511 T 1.4E-35 Glyco_hydro_1 unppercent F Archaea T 7f2d 2 B B CRU1_ARATH Cruciferin 1 C-terminal peptide RVAAA 5 T 200 RNase_HII pdbhh F Eukaryota F 7f2i 2 B B CRU1_ARATH Cruciferin 1 C-terminal peptide RVAAA 5 T 200 RNase_HII pdbhh F Eukaryota F 7f2p 1 A,B,C,D,E,F,G,H,I 1,2,3,4,5,6,7,8,9 I7GUT5_9CAUD Cement protein gp16 MKQKVHSVSYLAKAEFEYKNGVYDLVALPTGAEVIKISLEVVGLPTAGHVSVGFKDESKKNYSSILTLPVNETSGVVTKDYTVKSDKIVAAEVKDALAEGSDGRPVKCVLRALYFLPSVIEVEY 124 T 0.0053 Clathrin_bdg pdbpssm T Viruses T 7f32 1 A A SYCNCLCRRGVCRCICTI SYCNCLCRRGVCRCICTI 18 T 3.5 EB pdbhh F T 7f38 1 A,B,C,D,E,F,G,H,I A,B,C,D,E,F,G,H,I A0A191SAV5_9CAUD Putative major capsid protein MPLTNLTPTELLANKAVDYLANSFLVETPMLGLLANRVINQKQKAIEWGAKVAQGVVGGRTRTGALANDTQGTIKGASLSVPDYYIKHQFDVGKDEIVNSDATGKISAVRDPVGTAIADAFDVLSKKINSVLYTASGVADATNYGIFGLDAAAGTTVANSATGTYAGISKVTFPRWRSIIQGGAVPGTNEALTIARMTAMLRARRTAGVTYKGNQNQRLVILTSDNIENDVLRPLYGTVVDNQNVDFTRLDKDLLPYVNYMVKGIPVVSDIDCPANKMYLLNLDKLAIYSFDQSDADQSNGKITYIPLRYVDETGDTPSESTLWVRLADVSDEHPDLLKFELSVALQLVAFDLIDSISVIRDITQ 365 T 0.083 Phage_cap_P2 pdbpercent T Viruses T 7f45 1 A,B A,P A0A8G3QEZ8_PSEAI AcrIF5 MSRPTVVTVTETPRNPGSYEVNVERDGKMVVGRARAGSDPGAAAAKAMQMAMEWGSPNYVILGSNKVLAFIPEQLRVKM 79 T 0.067 Flagellin_D3 pdb F Bacteria T 7f4i 6 F U SHU9119 XDHXRWK 7 T 13 TSA pdbhh F T 7f4l 1 A,F B,A I7M8B9_TETTS Transmembrane protein, putative MKKNGKSQNQPLDFTQYAKNMRKDLSNQDICLEDGALNHSYFLTKKGQYWTPLNQKALQRGIELFGVGNWKEINYDEFSGKANIVELELRTCMILGINDITEYYGKKISEEEQEEIKKSNIAKGKKENKLKDNIYQKLQQMQ 142 T 0.00014 Myb_DNA-binding unppercent F Eukaryota T 7f4m 1 A,F B,A I7M8B9_TETTS Transmembrane protein, putative MKKNGKSQNQPLDFTQYAKNMRKDLSNQDICLEDGALNHSYFLTKKGQYWTPLNQKALQRGIELFGVGNWKEINYDEFSGKANIVELELRTCMILGINDITEYYGKKISEEEQEEIKKSNIAKGKKENKLKDNIYQKLQQMQ 142 T 0.00014 Myb_DNA-binding unppercent F Eukaryota T 7f4n 1 A,F B,G I7M8B9_TETTS Transmembrane protein, putative MKKNGKSQNQPLDFTQYAKNMRKDLSNQDICLEDGALNHSYFLTKKGQYWTPLNQKALQRGIELFGVGNWKEINYDEFSGKANIVELELRTCMILGINDITEYYGKKISEEEQEEIKKSNIAKGKKENKLKDNIYQKLQQMQ 142 T 0.00014 Myb_DNA-binding unppercent F Eukaryota T 7f4o 1 A B I7M8B9_TETTS Transmembrane protein, putative MKKNGKSQNQPLDFTQYAKNMRKDLSNQDICLEDGALNHSYFLTKKGQYWTPLNQKALQRGIELFGVGNWKEINYDEFSGKANIVELELRTCMILGINDITEYYGKKISEEEQEEIKKSNIAKGKKENKLKDNIYQKLQQMQ 142 T 0.00014 Myb_DNA-binding unppercent F Eukaryota T 7f4p 1 A B I7M8B9_TETTS Transmembrane protein, putative MKKNGKSQNQPLDFTQYAKNMRKDLSNQDICLEDGALNHSYFLTKKGQYWTPLNQKALQRGIELFGVGNWKEINYDEFSGKANIVELELRTCMILGINDITEYYGKKISEEEQEEIKKSNIAKGKKENKLKDNIYQKLQQMQ 142 T 0.00014 Myb_DNA-binding unppercent F Eukaryota T 7f4q 1 A B I7M8B9_TETTS Transmembrane protein, putative MKKNGKSQNQPLDFTQYAKNMRKDLSNQDICLEDGALNHSYFLTKKGQYWTPLNQKALQRGIELFGVGNWKEINYDEFSGKANIVELELRTCMILGINDITEYYGKKISEEEQEEIKKSNIAKGKKENKLKDNIYQKLQQMQ 142 T 0.00014 Myb_DNA-binding unppercent F Eukaryota T 7f4v 7 AA,G,Q cI,aI,bI PSAZ_GLOVI PSI-Z MQSYNVFPALVIITTLVVPFMAAAALLFIIERDPS 35 T 0.72 MWFE pdbhh F Bacteria T 7f4v 8 BA,H,R cJ,aJ,bJ Unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 33 F F F 7f55 6 F L bremelanotide XDHXRWK 7 T 0.047 ACTH_domain pdbhh F T 7f66 8 N O eIF2beta XXXXXXXXXXXXXX 14 F F F 7f67 8 N,Q O,R eIF2beta XXXXXXXXXXXXXX 14 F F F 7f69 1 A A WIPI2_HUMAN WIPI-2,WIPI49-LIKE PROTEIN 2 GPGSGQLLFANFNQDNTSLAVGSKSGYKFFSLSSVDKLEQIYECTDTEDVCIVERLFSSSLVAIVSLKAPRKLKVCHFKKGTEICNYSYSNTILAVKLNRQRLIVCLEESLYIHNIRDMKVLHTIRETPPNPAGLCALSINNDNCYLAYPGSATIGEVQVFDTINLRAANMIPAHDSPLAALAFDASGTKLATASEKGTVIRVFSIPEGQKLFEFRRGVKRCVSICSLAFSMDGMFLSASSNTETVHIFKLETVKEKPPEEPTTWTGYFGKVLMASTSYLPSQVTEMFNQGRAFATVRLPFCGHKNICSLATIQKIPRLLVGAADGYLYMYNLDPQEGGECALMKQHRLDGSLV 354 T 0.084 WD40 unppercent F Eukaryota T 7f6g 2 B L SAR1-AngII XRVYIHPF 8 T 0.9 Adeno_PVIII pdbhh F T 7f6i 2 B L KNG1_HUMAN Kallidin KRPPGFSPFR 10 T 3.2E-05 Bradykinin unphh F Eukaryota T 7f6j 2 C C PDZD8_HUMAN SARCOMA ANTIGEN NY-SAR-84/NY-SAR-104 SAMGNSTGIKLVRKEGGLDDSVFIAVKEIGRDLYRGLPTEERIQKLEFMLDKLQNEIDQELEHNNSLVREEKETTDTRKKSLLSAALAKSGERLQALTLLMIHYRAGIEDIETLESLSLDQHSKKISKYTDDT 133 T 0.011 AAA_32 pdbpercent F Eukaryota T 7f6m 2 B B MAI-516 inhibitor XAGESLYEX 9 T 24 DUF3928 pdbhh F T 7f7g 2 C,D C,D UNK-ARG-ILE-ARG-ARG-ASP-GLU-TYR-LEU-LYS-ALA-ILE-GLN-UNK XRIRRDEYLKAIQX 14 T 4.4 DUF6026 pdbhh F T 7f7i 2 G,H,I,J,K,L G,H,I,J,K,L ACE-ARG-ILE-ARG-ARG-ASP-GLU-TYR-LEU-LYZ-ALA-ILE-GLN-NH2 XRIRRDEYLKAIQX 14 T 4.4 DUF6026 pdbhh F T 7f7o 2 B B Tracer 7 XAGESLYEKX 10 T 12 RPN6_C_helix pdbhh F T 7f7p 1 A,B A,B A0A377JKY9_HAEPA anti-CRISPR protein AcrIIC4 MKITSSNFATIATSENFAKLSVLPKNHREPIKGLFKSAVEQFSSARDFFKNENYSKELAEKFNKEAVNEAVEKLQKAIDLAEKQGIQFLEHHHHHH 96 T 1.3 Nif11 unphh F Bacteria T 7f87 2 C,D F,G Self Derived Peptide ALT 3 T 1000 Hyd_WA pdbhh F F 7f8x 2 B C ASP-SMF-NLE-GLY-TRP-NLE-OEM-MEA-NH2 (NN9056) DXXGWXXXX 9 T 0.19 DUF3452 pdbhh F F 7f91 1 A,B A,B THRCO_CORXX Thrombocorticin MGHHHHHHMTACTTGPQTISFPAGLIVSLNASVKSSRNESVEVKDSNGNTVSRGSGSSSSGGTFTVINMEPPTFISDGNDYTVELSPQATPGILQTESSRVDNGRLIWQNYAFGANDGGCIVGDRDFNDVFVLITGLVRG 140 T 0.0024 FlgD_ig unppercent F Eukaryota T 7f9f 1 A,B,C,D A,B,C,D THRCO_CORXX Thrombocorticin TACTTGPQTISFPAGLIVSLNASVQSSRNESVEVKDSNGNTVSRGSGSSSSGGTFTVINMEPPTFISDGNDYTVELSPQATPGILQTESSRVDNGRLIWQNYAFGANDGGCIVGDRDFNDVFVLITGLVRG 131 T 0.0024 FlgD_ig pdbpercent F Eukaryota T 7f9g 1 A,B A,B THRCO_CORXX Thrombocorticin MGHHHHHHMTACTTGPQTISFPAGLIVSLNASVQSSRNESVEVKDSNGNTVSRGSGSSSSGGTFTVINMEPPTFISDGNDYTVELSPQATPGILQTESSRVDNGRLIWQNYAFGANDGGCIVGDRDFNDVFVLITGLVRG 140 T 0.0024 FlgD_ig unppercent F Eukaryota T 7f9j 1 A,B A,B THRCO_CORXX Thrombocorticin Q25K mutant MGHHHHHHMTACTTGPQTISFPAGLIVSLNASVKSSRNESVEVKDSNGNTVSRGSGSSSSGGTFTVINMEPPTFISDGNDYTVELSPQATPGILQTESSRVDNGRLIWQNYAFGANDGGCIVGDRDFNDVFVLITGLVRG 140 T 0.0024 FlgD_ig unppercent F Eukaryota T 7f9o 30 DA Z Unidentified stromal protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 61 F F F 7f9o 39 MA 9 Photosynthetic NDH subunit of subcomplex B4 NQRDWVVTKSIWHLSDTAIKSFYTFYAMFTVWGVCFFASMKASMADPFYDSEHYRGQGGDGTVHWYYDRQEDIEATARGDLLR 83 T 0.023 CCDC142 unp F T 7f9o 40 NA 0 F2DWH9_HORVV Photosynthetic NDH subunit of subcomplex B5 GPLTEIEPDLQEDPIDKWRTNGVSPEDFVYGVYDGHHTYDEGQEKKGFWEDVSEWYQEAEPPQGFQALISWSFPPAVILGMAFDVPGEYLYIGAAIFIVVFCIIEMDKPDKPHNFEPEIYMMERSKRDKLIADYNSMDIWDFNEKYGELWDFTVN 155 T 0.06 DUF3098 pdb F Eukaryota T 7f9x 1 A A Q5ZTB4_LEGPH LotA TIDNPGLGNCAFYAFAIGLVNIIQEEAKYNRRTMFDRWVGLDRSISGQYDEILKLNLEDPDKELLDRLQSSLRIVTYQYQIRELRNVCVFRNGNYNRLTGNSNFVNFAALYYGDPLDTDSRFNPFADSVPILIKMANIDRDSVHPGHENDVLVPLFLDLLYGDTTNPADITLETEPKSDSPIITAMNNITQDFFWGTHLDLNYLAEAFEVNLHVLRNNSPIQEFVDIPERHTLTLTNSNNTHWTTQITTAR 251 T 0.026 Coagulase pdb F Bacteria T 7fad 1 A,C,E A,C,E Q6DDJ4_XENLA Gamma-tubulin complex component GPLGSMSEFRIHHDVNELISLLHVFGLEGADVYIDLLQKNRTPYVTTSVSTHSAKVKIAEFSRTPDDFLKKYEELKSKNTRNLDPLVYLLSKLIEDKETLQYLQQNAKDKAELATSSVTSVSLPIAPNTSKISMQELEELRRQLETATVAVSCSHQPVEVLRKFLRDK 168 T 0.014 DUF1993 pdb F Eukaryota T 7fao 1 A,B A,C Top7 Surface mutant GSHMDIQVQVNIDDNGKNFDYTYTVTTESELQKVLNELMDYIKAAGAARVRISITARTSSEAEKFAAILRKVFAELGYNDINVTFDGDTVTVEGQLE 97 T 0.0043 Yop-YscD_ppl pdbhh F T 7fax 2 B B Q38DC5_TRYB2 TbLeo1 peptide GSTLEDLFGPLFYVDKSL 18 T 1.3 RE_LlaMI pdbhh F Eukaryota T 7fb5 2 B B RETR1_HUMAN RETICULOPHAGY RECEPTOR 1 EGDDFELLDQSELDQIESELGLTQDQ 26 T 4.4 Uds1 pdbhh F Eukaryota T 7fb8 1 A B ASP-ASP-LYS-ASP-CYS-ASP-GLU-TYR-CYS-LYS-LYS-THR-LYS-GLU-NH2 DDKDCDEYCKKTKEX 15 T 0.71 Macin pdbhh F T 7fb8 2 B A GLU-LE1-THR-GLY-HIS-ILE-GLU-GLY-PRO-THR-LE1-THR-LE1-HIS-CYS-LYS-NH2 EXTGHIEGPTXTXHCKX 17 T 79 CoV_NSP10 pdbhh F T 7fba 1 A A GLU-CYS-ARG-GLU-TYR-GLY-PRO-LE1-LYS-LE1-LE1-ALA-NH2 ECREYGPXKXXAX 13 T 3 PHA-1 pdbhh F T 7fba 2 B B ALA-LE1-CYS-GLU-CYS-GLY-PRO-THR-ARG-GLU-CYS-LYS-NH2 AXCECGPTRECKX 13 T 0.36 DUF6315 pdbhh F T 7fbh 1 A,B,C A,B,C A0A1V4D079_9ACTN;A0A2Z5X7B9_9ACTN BezA MSNLDELASSRQTVLEPQDEVRIVGQYYDDKTAKLVRKYGPGPRIHYHVGYYPSSEAPRHTRDVTPDAFRRSIRLHQEGLLRYAAKIWGAEHRLSGRILDVGCGLGGGSLFWAQEYGADVTAVTNAPEHAPIVEGFARECGVGGRVRTLVCDAMHLPLDGGPYDAAVAIESSGYFDRPVWFERLAHVLRPGGSVCIEEVFTTRPHGADVWAEYFYTKPATVLDYAEAAKAAGFELVDDVDATSETLPFWEESTAWTKAVLDSDSTLSAVDRRQLRISLMANQALGAEWQAGGLRLGFLRFERK 303 T 2.7000000000000002E-30 CMAS unppssm F Bacteria T 7fbl 1 A,B A,B THRCO_CORXX Thrombocorticin MGHHHHHHMTACTTGPQTISFPAGLIVSLNASVQSSRNESVEVKDSNGNTVSRGSGSSSSGGTFTVINMEPPTFISDGNDYTVELSPQATPGILQTESSRVDNGRLIWQNYAFGANDGGCIVGDRDFNDVFVLITGLVRG 140 T 0.0024 FlgD_ig unppercent F Eukaryota T 7fbo 1 A,B,C A,B,C A0A1V4D079_9ACTN;A0A2Z5X7B9_9ACTN BezA MSNLDELASSRQTVLEPQDEVRIVGQYYDDKTAKLVRKYGPGPRIHYHVGYYPSSEAPRHTRDVTPDAFRRSIRLHQEGLLRYAAKIWGAEHRLSGRILDVGCGLGGGSLFWAQEYGADVTAVTNAPEHAPIVEGFARECGVGGRVRTLVCDAMHLPLDGGPYDAAVAIESSGYFDRPVWFERLAHVLRPGGSVCIEEVFTTRPHGADVWAEYFYTKPATVLDYAEAAKAAGFELVDDVDATSETLPFWEESTAWTKAVLDSDSTLSAVDRRQLRISLMANQALGAEWQAGGLRLGFLRFERK 303 T 2.7000000000000002E-30 CMAS unppssm F Bacteria T 7fbr 1 A A MATR3_MOUSE Matrin-3 GSSGSSGQKGRVETRRVVHIMDFQRGKNLRYQLLQLVEPFGVISNHLILNKINEAFIEMATTEDAQAAVDYYTTTPALVFGKPVRVHLSQKYKRIKSGPSSG 102 F F Eukaryota T 7fc0 1 A,F A,D RrMbnA precosur peptide MKIVIVKKVEIQVAGRTGMRCASSCGAKS 29 T 2.8 DUF5522 pdbhh F T 7fc0 4 D,E F,C A0A1I4IFH0_9BURK Methanobactin biosynthesis cassette protein MbnC MNAPTTAAAGAAPGRQVKDSELLARLADPAARGDFPPGCRAHVRIDISIRAYWHTLFDICPGLLDIADPDGMAIFAPFMDWARRENLTMGWSFYIWVGRWLAQSPWRERLDEELTQALLSASAARWAVLDRSADVGVVLGRRGSDDWIIGWKPNTLAAGRRVELVSLDGQLPRPAEDVGVFHLAGYELDSFPGWLALPR 199 T 4.2 PSD5 pdbpssm F Bacteria T 7fcn 1 A,B,C,D A,B,C,D G5DBH3_9GAMM Insecticidal protein SVYSNSPVPVYKDLNAVGPLSELTISPHASVEVFRIDTPIIPESRKSLRVVNTGLANSVTAKFYWSHSFTSEWFESGSIDVGLGEDKVLNVPSNSFYYSKFVIYNNTDKVAYVTANLV 118 T 0.84 DUF916 pdbhh F Bacteria T 7fd4 2 G S Alpha-S1-casein XXXXXXXXXXXXXXXXXXXXXX 22 F F F 7fd5 2 G S Alpha-S1-casein XXXXXXXXXXXXXXXXXXXXXX 22 F F F 7fdm 2 C,D C,D MYB29_ARATH MYB-RELATED PROTEIN 29,ATMYB29,PROTEIN HIGH ALIPHATIC GLUCOSINOLATE 3,PROTEIN PRODUCTION OF METHIONINE-DERIVED GLUCOSINOLATE 2 SSKKRCFKRSSSTSKLLNKVAARASSMGTILGASIEGTLISSTPLSSCL 49 T 55 DUF2375 pdbhh F Eukaryota T 7fe0 1 A,B,C A,B,C A0A8F7LEJ5_9ACTN AvmM MGSSHHHHHHSSGLVPRGSHMTSTVSTDGPVYREYKGFRVNDNIVADFIGVPAVITPGETIEFSVFYTNRGRYAYPDTGMNLVIWFSDRDDLRREDFKLFYKVSRADWQEQDPAKCWDPQFPAEGGVHIACQMSGPDGGILSKPDGTVPLPEVESVTAHVRLAFREGITSEHAGIFALPGMLDAPGDKSIIPGLFGNVFGRLQQASFRLGEGPSSLY 217 T 0.14 HSP90 unp F Bacteria T 7fe5 1 A,B,C A,B,C A0A8F7LEJ5_9ACTN AvmM MGSSHHHHHHSSGLVPRGSHMTSTVSTDGPVYREYKGFRVNDNIVADFIGVPAVITPGETIEFSVFYTNRGRYAYPDTGLNLVIWFSDRDDLRREDFKLFYKVSRADWQEQDPAKCWDPQFPAEGGVHIACQLSGPDGGILSKPDGTVPLPEVESVTAHVRLAFREGITSEHAGIFALPGMLDAPGDKSIIPGLFGNVFGRLQQASFRLGEGPSSLY 217 T 0.14 HSP90 unp F Bacteria T 7fe6 1 A,B,C A,B,C A0A8F7LEJ5_9ACTN AvmM MTSTVSTDGPVYREYKGFRVNDNIVADFIGVPAVITPGETIEFSVFYTNRGRYAYPDTGLNLVIWFSDRDDLRREDFKLFYKVSRADWQEQDPAKCWDPQFPAEGGVHIACQLSGPDGGILSKPDGTVPLPEVESVTAHVRLAFREGITSEHAGIFALPGMLDAPGDKSIIPGLFGNVFGRLQQASFRLGEGPSSLY 197 T 0.14 HSP90 pdb F Bacteria T 7fep 2 AA,BA,O,P,Q,R,S,T,U,V,W,X,Y,Z a,b,O,P,Q,R,S,T,U,V,W,X,Y,Z ADEP1 XFSPXAX 7 T 45 Acp26Ab pdbhh F F 7fer 2 AA,BA,O,P,Q,R,S,T,U,V,W,X,Y,Z a,b,O,P,Q,R,S,T,U,V,W,X,Y,Z ADEP1 XFSPXAX 7 T 45 Acp26Ab pdbhh F F 7fgj 3 C C VQILNK VQILNK 6 T 56 pKID pdbhh F T 7fgm 2 B B FAF1_HUMAN HFAF1,UBX DOMAIN-CONTAINING PROTEIN 12,UBX DOMAIN-CONTAINING PROTEIN 3A RQIVERQPRMLDFRVEYRDRNVDVVLEDTCTVGEIKQILENELQIPVSKMLLKGWKTGDVEDSTVLKSLHLPKNNSLYVLT 81 T 3.2E-05 YukD pdbhh F Eukaryota T 7fgn 1 A A FAF1_HUMAN HFAF1,UBX DOMAIN-CONTAINING PROTEIN 12,UBX DOMAIN-CONTAINING PROTEIN 3A GSHMLDFRVEYRDRNVDVVLEDTCTVGEIKQILENELQIPVSKMLLKGWKTGDVEDSTVLKSLHLPKNNSLYVLTPDL 78 T 3.1E-05 YukD pdbhh F Eukaryota T 7fgr 3 C C VQIFNK VQIFNK 6 T 49 ArlS_N pdbhh F T 7fhk 2 B,D B,D AtTPC1-Cter XXXXXXXXXXXX 12 F F F 7fhl 2 B,D B,D AtTPC1-Cter XXXXXXXXXXXX 12 F F F 7fi3 2 E,F,G,H E,F,G,H endogenous pentapeptide XXXXX 5 F F F 7fi4 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H A0A3A9QXE8_MORCA AcrIF13 AGSMKLLNIKINEFAVTANTEAGDELYLQLPHTPDSQHSINHEPLDDDDFVKEVQEICDEYFGKGDRTMARLSYAGGQAYDSYTEEDGVYTTNTGDQFVEHSYADYYNVEVYCKADLV 118 T 0.019 DUF1882 unppssm F Bacteria T 7fia 1 A A A0A8F9PCN6_PSEAI AcrIF23 GSMTNFQTWLDSADIPVQQNGQWIDLETGIAYDPSYNYAANTRRASLSPRGIDARAVAKTFGGRALTGTARQKEWAEKIRAEKVQQMNQDQAEMACDPSGLLTAAKFWIENRNDSAQEIAGFVMQQKALLAQHRSAKAAGQADKVAKIAAEYNALTARWGF 161 T 8.3 DUF6440 pdbhh F T 7fid 2 G S unknown endogenous substrate XXXXXXXXXXXXXXXXXXXXXX 22 F F F 7fie 2 G S Unknown endogenous substrate XXXXXXXXXXXXXXXXXXXXXX 22 F F F 7fik 10 J J A0A1L8H1I9_XENLA NUCLEAR PORE COMPLEX PROTEIN NUP133-LIKE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXYFREVSQMEIIFECLVDKEEADLESTSIDSVEWANIVVNVNTILKDMLHVACQYRQSKNSLYKNESGIQEPEHVPWTASSGTAGIRSVVTRQHGIILKVYPQADSGLRTILIEQLAALLNYLLDDYVTQLKSIDKLANEERYNILEMEYAQKRSELLSPLLILGQYAWASNLAEKYCDFDILVQICEMTDNQSRLQRYMTLFAEQNFSDFLFRWYLEKGKRGKLLSQPASQHGQLAAFLQAHDHLSWLHELNSQEFEKAHRTLQTLANMETRYFCKKKTLLGLSKLAALASDFQEDVLQEKVEEIAEQEHFLLHQETLPKKLLEEKQLDLNAMPVLAPFQLIQLYVCEENKRANENDFMKALDLLEYIGDDSEVDVEELKLEILCKAIKRDEWSATDGKDDPIEATKDSIFVKVLQNLLNKGIELKGYLPKAETLLQSEELNSLKTNSYFEFSLKANYECYMKMQS 1140 T 9.7 Nucleoporin_C pdbpercent F Eukaryota T 7fik 14 S X Nup98 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 161 F F F 7fit 1 A A Q73HD5_WOLPM bacteria factor 1 MPIETKRQAEVLKKLQDVIKHTDRDIAAGRKLAIKRWVETYIEYIKLFKDDKLEFLYNVFRDEGCWLGTRLNNTVLGQKLTEEKIGEIDNPLPRYGMASRYCITGKIGDFFNKQFVLSRGQFTSEEVDSQGNPISDQYVRNILLSSMKRNGPVFDFWIDRESGELKKYDAVEGFDSTVKLKWSEGVEYFYNQLEEKDKEKKLTEAIVALSRPQSVKRDAPILDFCVRNIGDKDTLLQKLLQKDKGVYFLLAELIESCFFDTVHDLVQCWCYKGVSAGGDCSDKIFSQQDYELFLYSLSNVMLKNPELSVQARSLIMEIWKCERFAEYRETSVNTSNYTVPIKSVLGGLIINWKREDVCKPDREIEKEEILDMISFAKGCFPEKFDLFKEVMIENLRICGREGKRKGVDYGKFAEELFLQLEKVTLPSVGDGPWNNLRSQSKVSLPLDGSGDGPQSEFEAPSVSGISGSHKKRRILEHHHHHH 482 T 0.5 PMAIP1 unppssm F Bacteria T 7fiv 1 A A A0A2K9VS01_9RICK CidA_I gamma/2 protein MPTQKELRDTMSKKLQEAIKHPDPAVVAGRKSAIKRWVGVLQDNFMEHIKYFKGDKLKFLHNVFQDEGCWSGVRLDNAALGQRFTEEKIGGIDNPLRKYEMACSYCVVDKIHPLFQKRFESYRNKFPPGAFDGKTETEFGKYVRNSLLDSIKRKGPVFDFWIDRESGELKKYDAVEGFDSAVKFKWSEGVEYFYNHLKEEDKEKKLTEAILALSRVQSVEKDAPILDFCVNKIVDKDTLLQKLSQKDKGVYSLFVELIESCFFDTVHDLVQCWCYKEVSAGGDHSEKIFSQRDYELFLSSLSDTMLKNPELSVQARSLIMEFWECGSLYQYRKAAVNTSNYTVPTSGVFAELIVNWRREDIYKTDEEKEIEKKEILDMMSFAKDCFPEKFELFKKLIIRDLRLCGREGKRVNVDYGLFAEELFSELEKTILPPGPVGDGPCSNLRSRSKAHGSKKTTLPVDDSPQSELGTPSVSGVSSYKKKSVFTLSGNKLEHHHHHH 499 T 0.012 DUF3437 pdbpssm F Bacteria T 7fiv 2 B B A0A2K9VS18_9RICK CidB_I b/2 protein MSNGDGLIRSLVDGDLEGFRQGFESFLDQCPSFLYHVSAGRFLPVFFFSMFSTAHDANILNANERVYFRFDNHGVNPRNGENRNTANLKVAVYRDGQQVVRCYSISDRPNSDGLRFSTRERNALVQEIRRQNPNLREEDLNFEQYKVCMHGKGKSQGEAIATVFEVIREKDRQGRDKFAKYSASEINLIRRLLGDHRLTIKEIEGRQLNQNQLRQLGRLVNFAQVAQGQQGIDNFMEMLASDRRQDVRDRIRREILPYITDIYNNYRQVLENNIENRNQRFEGHGFLLGFLANFSHRYTIGVDLDLSPRNSHVAFLVRHQVERENIPIVINLATRAPPYIALNRARSHAERLHVFSFIPIHTESRNTVCVGLNFNLNLDPFSVDTVGLQQDRFPLVQRLFECLENEGIRENIRDFLLHHLPAEIPRNAENYDRIFDCITGFAFGNSAFDRHPLELEEEDEAPITKYIFRHGDEGLRCLTMVFHAEGSDIVILHIRAHDAQQQGAINLQTLNVNGNDVHVWEVSCTLNNQLELDIDLPNDLGLYHDYQNNNANNFLAGDLVQVPNTENVHNTLNQVVNDGWKNIAQHRGLFQEISGALMPLVDTINVNSEDKFRSILHGTFYASDNPYKVLAMYKVGQTYSLKRGQEEEGERVILTRITEQRLDLLLLRQPRENDLDTHPIGYVLRLANNAEEVGQQQNDARQEIGRLKKQHRGFIPITSGNEVVLFPIVFNRDAHEAGNLILFPEGIGREEHVHRLDRHVRLEHHHHHH 769 T 3.8E-05 PDDEXK_9 pdbhh F Bacteria T 7fiw 1 A A B3CP63_WOLPP BACTERIA FACTOR 3 MSNGDGLIRSLVDGDLEGFRQGFESFLDQCPSFLYHVSAGRFLPVFFFSMFSTAHDANILNANERVYFRFDNHGVNPRNGENRNTANLKVAVYRDGQQVVRCYSISDRPNSDGLRFSTRERNALVQEIRRQNPNLREEDLNFEQYKVCMHGKGKSQGEAIATVFEVIREKDRQGRDKFAKYSASEVHFLRQLFRNHRLTIKEIEGRQLNQNQLRQLGRSVNFTRVEPGQQRIDNFMEMLASNQRQDVRDSLRGDILEYVTDTYNNYRAQIENNIEGRSQKFESHGFLLGFLANFSHRYTIGVDLDLSPRNSHVAFLVRHQVERENIPIVINLATRAPPYIALNRARSHAERLHVFSFIPIHTESRNTVCVGLNFNLNLDPFSVDTVGLQQDRFPLVQRLFECLENEGIRENIRDFLLHHLPAEIPRNAENYDRIFDCITGFAFGNSAFDRHPLELEEEDEAPITKYIFRHGDEGLRCLTMVFHAEGSDIVILHIRAHDAQQQGAINLQTLNVNGNDVHVWEVSCTLNNQLELDIDLPNDLGLYHDYQNNNANNFLAGDLVQVPNTENVHNTLNQVVNDGWKNIAQHRGLFQEISGALMPLVDTINVNSEDKFRSILHGTFYASDNPYKVLAMYKVGQTYSLKRGQEEEGERVILTRITEQRLDLLLLRQPRENDLDTHPIGYVLRLANNAEEVGQQQNDARQEIGRLKKQHRGFIPITSGNEVVLFPIVFNRDAHEAGNLILFPEGIGREEHVHRLDRHVRLEHHHHHH 769 T 3.6E-05 PDDEXK_9 pdbhh F Bacteria T 7fiw 2 B B A0A5B8WHG9_9RICK;Q73HD5_WOLPM bacteria factor 4,CidA I(Zeta/1) protein MPIETKKQAEVLKKLQDVIKHTDRDIAAGRKLAIKRWVETYIEYIKYFKDDKLEFLYNVFRDEGCWLGTRLNNTVLGQKLTEEKIGEIDNPLRRYGMASRYCITGKIHPLFQKRFESYRNKFPPGAFDGKTETEFGKYVRNSLLDSIKRKGPVFDFWIDRESGELKKYDAVEGFDSTVKLKWSEGVEYFYNQLEEKDKEKKLTEAIVALSRPQSVKRDAPILDFCVRNIGDKDTLLQKLLQKDKGVYFLLAELIESCFFDTVHDLVQCWCYKGVSAGGDCSDKIFSQRDYELFLSSLSDVMLKNPELSVQARSLIMEIWKCERFAEYRETSVNTSNYTVPIKSVLGELIINWKREDVCKPDREIEKEEILDMISFAKGCFPEKFDLFKEVMIRNLRLCGREGKRKGVDYGKFAEELFLQLEKVTLPSVGDGPWNNLRSQSKVSLPLDGSGDGPQSEFEAPSVSGISGSHKKRRILEHHHHHH 482 T 0.28 EFG_III pdbpercent F Bacteria T 7fix 6 F,FA,S F1,F3,F2 PSAF_THEVB PSI-F HHHHHHHHHHMRRFLALLLVLTLWLGFTPLASADVAGLVPCKDSPAFQKRAAAAVNTTADPASGQKRFERYSQALCGEDGLPHLVVDGRLSRAGDFLIPSVLFLYIAGWIGWVGRAYLIAVRNSGEANEKEIIIDVPLAIKCMLTGFAWPLAALKELASGELTAKDNEITVSPR 174 T 2.5E-07 PSI_PsaF unppercent F Bacteria T 7fiz 2 G S Unknown endogenous substrate XXXXXXXXXXXXXXXXXXXXXX 22 F F F 7fj1 7 AA,Z Z,Y G3G8Y0_9ALPH VP1/2 RVVESDTLINRRYMRATGLGALALLIAACRLIARRLRETRTTLKGSARRFNVDLFQVRLILG 62 T 7.1 Alpha_GJ pdbpssm T Viruses T 7gch 1 A E CTRA_BOVIN GAMMA-CHYMOTRYPSIN A CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 7hvp 2 C C INHIBITOR ACE-SER-LEU-ASN-PHE-PSI(CH(OH)-CH2N)-PRO-ILE VME (JG-365) XSLNXIX 7 T 490 Rad54_N pdbhh F F 7ins 3 G G GENERAL PROTAMINE CHAIN XXXXXXXXXXXXXXXX 16 F F F 7jfo 3 Q,R,S,T,U,V,W,X q,r,s,t,u,v,w,x Q94ET8_CHLRE LCI5 TNRVSPTRSVLPANWRQELESLRN 24 T 0.047 LigXa_C unp F Eukaryota T 7jfr 5 G L Auristatin XVXXX 5 T 3200 zf-CCHC_2 pdbhh F F 7jgx 1 A A neuroVAL derived peptide ILE-PHE-TRP-LEU-PHE-ARG-GLY-LYS-ALA-ASP-VAL-ALA-LEU-NH2 IFWLFRGKADVALX 14 T 0.94 FRG pdbhh F T 7jgy 1 A A PROTO_AGEPP Protonectin peptide ILE-LEU-GLY-THR-ILE-LEU-GLY-LEU-LEU-LYS-GLY-LEU-NH2 ILGTILGLLKGLX 13 T 0.47 DUF445 unphh F Eukaryota T 7jh6 1 A,B,C,D A,B,C,D Two-domain di-Zn(II) and porphyrin-binding protein DYLRELLKLELQAIKQYEKLRQTGDELVQAFQRLREIFDKGDDDSLEQVLEEIEELIQKHRQLASELPKLELQAIKQYREALEYVKLPVLAKILEDEEKHIEWLKEAAKQGDQWVQLFQRFREAIDKGDKDSLEQLLEELEQALQKIRELTEKTGRKILEDEEKHIEWLETILG 174 T 0.00053 DUF5667 pdbpercent F T 7jh7 3 I,J I,J tropomyosin XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 135 F F F 7jhf 1 A A Protonectin-F derived peptide ILE-PHE-GLY-THR-ILE-LEU-GLY-PHE-LEU-LYS-GLY-LEU-NH2 IFGTILGFLKGLX 13 T 0.13 DUF445 pdbhh F T 7jhx 2 C,D C,D EPG5_HUMAN Ectopic P granules protein 5 homolog DEDPETSWILLN 12 T 0.36 TRI9 pdbhh F Eukaryota T 7jhy 3 H,I,J,K,L h,g,j,k,i L0JA79_9MYCO Csf4 (Cas11) MTTPTPTQVWRATVPELPPLVDEAGDTGSATARAADTAERLLLLLHYSIDWESSWVADPKHRKTYWDELLPGRVRRAAYRADTLDRWWSEVAGQLGAPAPRHRDRRLELATLLREPALPVITVLRDSLPALLLRVRIIAEAVAAQRGNNSAATSSADPNEPA 162 T 17 RBDV_coat pdbhh F Bacteria T 7ji0 2 C,D C,D SHP3 DIIIIVGG 8 T 0.083 PGA2 pdbhh F F 7ji2 3 E,F C,F OVA mutant peptide SIIQFEHL 8 T 9 KCTD4_C pdbhh F T 7jic 1 A A CD19_HUMAN B-LYMPHOCYTE SURFACE ANTIGEN B4,DIFFERENTIATION ANTIGEN CD19,T-CELL SURFACE ANTIGEN LEU-12 DYKDDDDLEVLFQGPPEEPLVVKVEEGDNAVLQCLKGTSDGPTQQLTWSRESPLKPFLKLSLGLPGLGIHMRPLAIWLFIFNVSQQMGGFYLCQPGPPSEKAWQPGWTVNVEGSGELFRWNVSDLGGLGCGLKNRSSEGPSSPSGKLMSPKLYVWAKDRPEIWEGEPPCLPPRDSLNQSLSQDLTMAPGSTLWLSCGVPPDSVSRGPLSWTHVHPKGPKSLLSLELKDDRPARDMWVMETGLLLPRATAQDAGKYYCHRGNLTMSFHLEITARPVLWHWLLRTGGWKVSAVTLAYLIFCLCSLVGILHLQRALVLRRKRKRMT 323 T 0.00011 G6B unphh F Eukaryota T 7jil 2 B C A0A1M5L9Q4_FLAJO 50S ribosomal protein L3 MSGLIGKKIGMTSIFDENGKNIPCTVIEAGPCVVTQVRTNEVDGYEALQLGFDDKNEKHSTKAALGHFKKAGTVAKKKVVEFQDFAAAQALGDLIDVSIFEEGEFVDVQGVSKGKGFQGVVKRHGFGGVGQATHGQHQRLRAPGSVGASSYPSRVFKGMRMAGRMGGDNVKVQNLRVLKVVAEKNLLVVKGCIPGHKNSYVIIQK 205 T 0.28 T2SS-T3SS_pil_N pdbpssm F Bacteria T 7jil 28 BA 5 A0A4V2PMH1_FLAJO 30S ribosomal protein S22 MPSGKKRKRHKVATHKRKKRARANRHKKKK 30 T 5.9 DUF1713 pdbpercent F Bacteria T 7jiy 1 A A A8E5C4_DANRE Granulin 1 CEGNFYCPAEKFCCKTRTGQWGCC 24 T 0.00024 Granulin unppercent F Eukaryota T 7jjc 2 E,F,G,H E,F,G,H SPIKE_SARS2 S GLYCOPROTEIN,E2,PEPLOMER PROTEIN,SPIKE GLYCOPROTEIN NSPRRAR 7 T 29 Stanniocalcin pdbhh T Viruses F 7jjl 2 B B KDM1A_HUMAN BRAF35-HDAC COMPLEX PROTEIN BHC110,FLAVIN-CONTAINING AMINE OXIDASE DOMAIN-CONTAINING PROTEIN 2,[HISTONE H3]-DIMETHYL-L-LYSINE(4) FAD-DEPENDENT DEMETHYLASE 1A TPEGRRTSRRKRAKVEYREMDESLAN 26 T 48 EFG_IV pdbhh F Eukaryota T 7jjm 1 A A KDM1A_HUMAN BRAF35-HDAC COMPLEX PROTEIN BHC110,FLAVIN-CONTAINING AMINE OXIDASE DOMAIN-CONTAINING PROTEIN 2,[HISTONE H3]-DIMETHYL-L-LYSINE(4) FAD-DEPENDENT DEMETHYLASE 1A TPEGRRTSRRKRAKVEYREMDESLAN 26 T 48 EFG_IV pdbhh F Eukaryota T 7jjv 1 A,B A,B GrAFP antifreeze protein MQCDGLDGADGTSNGQAGASGLAGGPNCNGGKGGKGAPGVGTAGGAGGVGGAGGTGNTNGGAGGSGGNSDVAAGGAGAAGGAAGGAGTGGTGGNGGAGKPGGAPGAGGAGTPAGSAGSPGQTTVLEHHHHHH 132 T 1100 NIP_1 pdbhh F T 7jk7 1 A A KDM1A_HUMAN BRAF35-HDAC COMPLEX PROTEIN BHC110,FLAVIN-CONTAINING AMINE OXIDASE DOMAIN-CONTAINING PROTEIN 2,[HISTONE H3]-DIMETHYL-L-LYSINE(4) FAD-DEPENDENT DEMETHYLASE 1A TPEGRRTERRKRAKVEYREMDESLAN 26 T 24 EFG_IV pdbhh F Eukaryota T 7jk9 1 A,AA,B,BA,C,CA,D,DA,E,EA,F,FA,G,GA,H,HA,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,P,Q,R,S,T,U,V,W,X,Y,Z A,BA,B,CA,C,DA,D,EA,E,FA,F,GA,G,HA,H,IA,I,JA,J,KA,K,LA,L,MA,M,NA,N,OA,O,P,Q,R,S,T,V,W,X,Y,Z,AA PORB_ARATH PCR B,NADPH-PROTOCHLOROPHYLLIDE OXIDOREDUCTASE B,POR B,LIGHT-DEPENDENT PROTOCHLOROPHYLLIDE OXIDOREDUCTASE MALQAASLVSSAFSVRKDAKLNASSSSFKDSSLFGASITDQIKSEHGSSSLRFKREQSLRNLAIRAQTAATSSPTVTKSVDGKKTLRKGNVVVTGASSGLGLATAKALAETGKWNVIMACRDFLKAERAAKSVGMPKDSYTVMHLDLASLDSVRQFVDNFRRTETPLDVLVCNAAVYFPTAKEPTYSAEGFELSVATNHLGHFLLARLLLDDLKKSDYPSKRLIIVGSITGNTNTLAGNVPPKANLGDLRGLAGGLNGLNSSAMIDGGDFDGAKAYKDSKVCNMLTMQEFHRRFHEETGVTFASLYPGCIASTGLFREHIPLFRALFPPFQKYITKGYVSETESGKRLAQVVSDPSLTKSGVYWSWNNASASFENQLSEEASDVEKARKVWEISEKLVGLA 401 T 0.07 adh_short pdbpercent F Eukaryota T 7jl6 1 A,B A,B SRRB_STAAU STAPHYLOCOCCAL RESPIRATORY RESPONSE PROTEIN B AMGRDSLINSMVEGVLGINESRQIILSNKMANDIMDNIDEDAKAFLLRQIEDTFKSKQTEMRDLEMNTRFFVVTTSYIDKIEQGGKSGVVVTVRDMTNEHNLDQ 104 T 0.0008 PAS_4 pdbhh F Bacteria T 7jl7 3 E F ASP-GLU-VAL-ASP peptide DEVD 4 T 76 POX pdbhh F F 7jls 2 B B Peptide SER-VAL-ALA SVA 3 T 810 Ribosomal_S25 pdbhh F F 7jmn 1 A E G0SGD2_CHATD MEDIATOR COMPLEX SUBUNIT 5 MVTVTDPLTARLEAAIKAWSDFFSDAEHERLDPAIFADQSQTLFANHPLAPVPLADLLLRPTPSNRECVDQRTLQYLQVLQKQGRITTAAVLRALYKYSTAHTRAQTPDGKPKHGAGDSSTNDADVGGSSKADLTSRMVRWRNSYMVEEDVLWRLARAVNHGTGIKTSHDVTEVAKVLARWTALFAEVSAAISRDAFNSMNGLQVKDESEDARNAFVLFYFAFCENQIVNETLSQPVCKDICRKLLDSLDAFLPTLMHLTADITGRLEHFRSEVLARYAPQEKKSMDMPSFMNDLSMSLESFQVPELPVVNTRAGLYIYLGAALVGRPMIDDEALFSYLHNRYQGDLQAMAVHLILASFDLLANAVFRNEGAKTGHLLKSFLINKVPLILVQLVAYAATTMYPFNAEMCITEALNQVDINMFPTLSGMFDMPNNNSFNDSVRQDFCFACQLHGLLSQAAIETLLGDITYQSLPPEGRYVKEQLVQACLQEPDRTLKLIGELDNMNGNVGAAAQAIVEICRDLASKPLSLDVLLLFDKPHKILHPLCELLDNWAGYEEDHGEYQPVYEEFGSVLLLLLAFVYRYNLSTADLGIRSSGSFVAKLLNGVDRCQPLEQLSEQEKSHLGGWIHGLFDTEAGGLGDELMSSCPPQDFYLLAPTLFHQIVNALSAGYLTDEMLKGGLEYLVDVLLLPALVPALLYLSNLLWADNQPIQNAVIKILQPILKPTSISNEASTMLSSVLNIVAKPLEHALKSYQRQDPECQKIEPLLLAIADNLAVSRRTGGADHTELESWCSAQITNPATGALIHGGLAAAVRTTIQQLVQWAQNPTLNSMNGMPAPYTHRQTLAAQQILGPHRLLGIILDELKSSPEPGIAYDVVTTMICAPDVRNSTISSPSTQSNNSSDHNDQAQNHQSQDAKHKHPHRLTLRDALRLEAHDFRAHLRADPVLAETVVRLYRRVEAQLTPLALPLPPAAAAAPAVGVNVGVDAATAAAAAAAAAMMPDALGLGVVGGVELGGMEGAIAAAVAAANGSGTGGAGGDGTQGGAGDAGMGLDGQQQGQGGSSAGDMGLGGGTADDIFSGLSGPDDFGADFGSWSMDLS 1099 T 7.6E-85 Med5 unppercent F Eukaryota T 7jmn 5 E X Unknown peptide XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 52 F F F 7jn6 1 A A DF204_ARATH Defensin-like protein 204 AHCDHFLGEAPVYPCKEKACKSVCKEHYHHACKGECEYHGREVHCHCYGDYH 52 T 0.0013 SLR1-BP unp F Eukaryota T 7jqd 2 B B MAXA_LUTLO Peptide-43 CDATCQFRKAIDDCARQAYHSSVFKACMKQKKKEWKAGX 39 T 0.74 Clavanin unphh F Eukaryota T 7jql 55 CB,FD 1z,2z Bac7-001 WRIRPRPPRLPRPRPR 16 T 11 DUF1639 pdbhh F F 7jqm 55 CB,FD 1z,2z Bac7-002 RRIRPRPPRLPRPRWR 16 T 2.5 Agenet pdbhh F F 7jqr 1 A A Abeta 16-36 beta-hairpin mimic VAL-ORN-LYS-LEU-VAL-MEA-PHE-ALA-GLY-ORN-ALA-ILE-ILE-GLY-LEU-MET VXKLVXFAGXAIIGLM 16 T 3.2 Beta-APP pdbhh F T 7jqs 1 A A Abeta 16-36 beta-hairpin mimic VAL-ORN-LYS-LEU-VAL-MEA-PHE-ALA-ASP-ORN-ALA-ILE-ILE-GLY-LEU-MET VXKLVXFADXAIIGLM 16 T 0.11 Beta-APP pdbhh F T 7jqt 1 A A Abeta 16-36 beta-hairpin mimic VAL-ORT-LYS-LEU-VAL-MEA-PHE-ALA-LYS-ORT-ALA-ILE-ILE-GLY-LEU-MET VXKLVXFAKXAIIGLM 16 T 0.79 Beta-APP pdbhh F T 7jqu 1 A,B,C A,B,C Abeta 16-36 beta-hairpin mimic VAL-ORN-LYS-LEU-VAL-MEA-PHE-ALA-GLN-ORN-ALA-ILE-ILE-GLY-LEU-MET VXKLVXFAQXAIIGLM 16 T 3.6 Beta-APP pdbhh F T 7jr9 4 E F Flagellar radial spoke protein 6 XXXXXXXX 8 F F F 7jr9 5 F G Flagellar radial spoke protein 10 XXXXXX 6 F F F 7jrg 10 J,T K,W QCR10 MAGLPARLRIQPADVKAAAMWGVAAATGGLYLVQVSILVLPPVKVVFHFYLVSGFRICLDVKDLRTMPCSAPRIWRLYILI 81 T 0.063 QCR10 pdbhh F T 7jrh 1 A A Cyclic peptide ASP-GLN-TRP-MLE-GLN-VAL-ASP-ORD-GLU-VAL-THR-GLY-ILE-ILE-THR-ORD DQWXQVDXEVTGIITX 16 T 6.3 GPHH pdbhh F T 7jrj 5 E J unknown protein XXXX 4 F F F 7jrj 11 L L Flagellar radial spoke protein 6 XXXXXX 6 F F F 7jrj 12 M M Flagellar radial spoke protein 10 XXXXX 5 F F F 7jrj 13 N N Flagellar radial spoke protein 1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 89 F F F 7jrj 14 O O Flagellar radial spoke protein 2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 69 F F F 7jro 8 H h A0A1S3V319_VIGRR Cytochrome c oxidase subunit 5C MAGPRIAHATLKGPSVVKEIIIGITLGLAAGSVWKMHHWNEQRKIRTFYDLLEKGEIGVVVDEQ 64 T 0.001 COX6C pdbhh F Eukaryota T 7jrp 10 J,T K,W QCR10 MAGLPARLRIQPADVKAAAMWGVAAATGGLYLVQVSILVLPPVKVVFHFYLVSGFRICLDVKDLRTMPCSAPRIWRLYILI 81 T 0.063 QCR10 pdbhh F T 7jrp 18 BA h A0A1S3V319_VIGRR Cytochrome c oxidase subunit 5C MAGPRIAHATLKGPSVVKEIIIGITLGLAAGSVWKMHHWNEQRKIRTFYDLLEKGEIGVVVDEQ 64 T 0.001 COX6C pdbhh F Eukaryota T 7jrx 1 A,E A,a CTRA_BOVIN Chymotrypsin A chain A CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 7js6 1 A A des-citrulassin F LLGRSGNDRLILSKN 15 T 0.95 hemP pdbhh F T 7jsd 1 A,B,C,D A,B,C,D Lysine hydroxylase GPHMDVHEIDETLEKFLAENYTPERVQQLADRFQRTGFVKFDSHMRIVPEELITAVRAEADRLVREHKERRDLVLGTTGGTPRNLSVVKSQDVEQSDLIRAVTRSEVLLTFLAGITRERIIPEVSDDERYLITHQEFASDTHGWHWDDYSFAFNWALRMPPIASGGMVQAVPHTHWDKNAPRINETLCERQIDTYGLVSGDLYLLRSDTTMHRTVPLTEDGAVRTMLVVSWSAERDLGKVLTGNDRWWENPEAGAAQPVHRAG 263 T 0.0012 2OG-FeII_Oxy_3 pdbpercent F T 7jsq 1 A A DNJB6_HUMAN HHDJ1,HEAT SHOCK PROTEIN J2,HSJ-2,MRJ,MSJ-1 MGNFKSISTSTKMVNGRKITTKRIVENGQERVEVEEDGQLKSLTINGKEQLLRLDNK 57 T 28 DUF1408 pdbhh F Eukaryota T 7jsx 3 Q,R,S,T,U,V,W,X q,r,s,t,u,v,w,x A0A2K3DA85_CHLRE EPYC1 RSSSASKKAVTPSRSALPSNWKQELESLRS 30 T 3.1 3-alpha pdbhh F Eukaryota T 7jta 1 A,B A,B A0A5B9TEE9_9BACT NTF2-like nuclease/anti-CRISPR GSSMGMVVEETRDLAETADCVVIEAILVDDGLRYRQLSVGIKDENGDIIRIVPISTVLI 59 T 0.029 Urease_alpha pdbpercent F Bacteria T 7jtk 15 EA e Unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 34 F F F 7jtk 18 IA s A0A2K3D359_CHLRE Uncharacterized protein MSDPEAEQGEQGYEESPEEPGPGSEAPSPSRIDNGLDTIIDIDPQTQHAEEGSNTAYESEQPDVISSYTGGQQEEDGEQAGNGAIDETTEEAAGEADDGGKASGFAVEVDAGTDAAAEGDLEPEPEPERPASASGEPQPTASTSRPASGAAARPASARPTSARPGSAAPRQPSASGGSRPGSGHPVNLAPDSVGLAQQQQQKSQIEVGAQAYEARGSSRPQSGGDAYGQAEEASAAAAAGRPSTSQSGSRPPPSREGVAVVPSIPEDQPLAVPIHIERYIAPGLKAIEVEVAQGPGMPHRLVRVLLDYTQCDAKPYLGGFRNKRTGAVYHHGATQTPRAPKYSEADRKLSRETQTVKIKQHSQQTVREQATQMARPGVLLDNDYDKEVTPGRYQTADERDEIVLRSTLRIQRWVRGWLGRKRAAYLRGKKMEREAFLRDQEARAQSEAEEHRRREIQRRMHPRTAADFEVLYNELEAWRLQETRKIKEAGLAKEQEQQVLQQLLHKETKLLQTIDRLKINANQENKEARIQHTLNEMSKPKKFALRNGGKVDVHTPFTTRAKELQQLYNGLNLPLLTVDERLDVLLHVKWTVKEFDCDLTRELVDLIDREADLLNRGRNPKMLEGLRKRISSLFLNFIETPEFNPEAVRFQIVPMDFEAYLYEQVGKATAKAGTSVGTRTLS 682 T 0.00029 IQ pdbpercent F Eukaryota T 7jts 4 L s A0A2K3D359_CHLRE FAP253 MSDPEAEQGEQGYEESPEEPGPGSEAPSPSRIDNGLDTIIDIDPQTQHAEEGSNTAYESEQPDVISSYTGGQQEEDGEQAGNGAIDETTEEAAGEADDGGKASGFAVEVDAGTDAAAEGDLEPEPEPERPASASGEPQPTASTSRPASGAAARPASARPTSARPGSAAPRQPSASGGSRPGSGHPVNLAPDSVGLAQQQQQKSQIEVGAQAYEARGSSRPQSGGDAYGQAEEASAAAAAGRPSTSQSGSRPPPSREGVAVVPSIPEDQPLAVPIHIERYIAPGLKAIEVEVAQGPGMPHRLVRVLLDYTQCDAKPYLGGFRNKRTGAVYHHGATQTPRAPKYSEADRKLSRETQTVKIKQHSQQTVREQATQMARPGVLLDNDYDKEVTPGRYQTADERDEIVLRSTLRIQRWVRGWLGRKRAAYLRGKKMEREAFLRDQEARAQSEAEEHRRREIQRRMHPRTAADFEVLYNELEAWRLQETRKIKEAGLAKEQEQQVLQQLLHKETKLLQTIDRLKINANQENKEARIQHTLNEMSKPKKFALRNGGKVDVHTPFTTRAKELQQLYNGLNLPLLTVDERLDVLLHVKWTVKEFDCDLTRELVDLIDREADLLNRGRNPKMLEGLRKRISSLFLNFIETPEFNPEAVRFQIVPMDFEAYLYEQVGKATAKAGTSVGTRTLS 682 T 0.00029 IQ pdbpercent F Eukaryota T 7jtv 2 C,D E,H GLU-ALA-PRO-SER-ALA GAEAEAPSAVPDAAG 15 T 68 DUF6412 pdbhh F T 7ju4 4 D 3 Unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 72 F F F 7ju4 16 BB s A0A2K3D359_CHLRE FAP253 MSDPEAEQGEQGYEESPEEPGPGSEAPSPSRIDNGLDTIIDIDPQTQHAEEGSNTAYESEQPDVISSYTGGQQEEDGEQAGNGAIDETTEEAAGEADDGGKASGFAVEVDAGTDAAAEGDLEPEPEPERPASASGEPQPTASTSRPASGAAARPASARPTSARPGSAAPRQPSASGGSRPGSGHPVNLAPDSVGLAQQQQQKSQIEVGAQAYEARGSSRPQSGGDAYGQAEEASAAAAAGRPSTSQSGSRPPPSREGVAVVPSIPEDQPLAVPIHIERYIAPGLKAIEVEVAQGPGMPHRLVRVLLDYTQCDAKPYLGGFRNKRTGAVYHHGATQTPRAPKYSEADRKLSRETQTVKIKQHSQQTVREQATQMARPGVLLDNDYDKEVTPGRYQTADERDEIVLRSTLRIQRWVRGWLGRKRAAYLRGKKMEREAFLRDQEARAQSEAEEHRRREIQRRMHPRTAADFEVLYNELEAWRLQETRKIKEAGLAKEQEQQVLQQLLHKETKLLQTIDRLKINANQENKEARIQHTLNEMSKPKKFALRNGGKVDVHTPFTTRAKELQQLYNGLNLPLLTVDERLDVLLHVKWTVKEFDCDLTRELVDLIDREADLLNRGRNPKMLEGLRKRISSLFLNFIETPEFNPEAVRFQIVPMDFEAYLYEQVGKATAKAGTSVGTRTLS 682 T 0.00029 IQ pdbpercent F Eukaryota T 7ju9 1 A A Q7V450_PROMM PCN2.11 GRIDXCPAGGGXXEQXGXCC 20 T 0.03 Bacteriocin_IIc unppssm F Bacteria T 7jvf 1 A A Q7V449_PROMM Prochlorosin 2.10 AGGXIPXLMXGCGWLXGLCVR 21 T 0.00033 L_biotic_typeA unphh F Bacteria T 7jvs 2 B C RL27_STAA8 L27 ribosomal peptide XKLNLQFFASKKGX 14 T 13 mit_SMPDase pdbhh F Bacteria T 7jvv 2 C,D C,D ACE-ARG-HIS-ALY-ALY-MCM XRHXXX 6 T 360 Viral_helicase1 pdbhh F F 7jwp 3 Q,R,S,T,U,V,W,X Q,R,S,T,U,V,W,X IL-18 peptide GDLESD 6 T 2.2 MtrE pdbhh F F 7jwq 3 E,F V,P IL-1beta peptide LFFEVD 6 T 0.26 Herpes_UL24 pdbhh F F 7jx4 1 A,B,C A,B,C Collagen mimetic peptide with N-Lysine guest XGPPGPPGPPGXPGPPGPPGPPX 23 T 0.0008 Collagen pdbpssm F F 7jx5 1 A,B,C,D,E,F C,A,B,E,F,D Collagen mimetic peptide with N-Phenylalanine guest XGPPGPPGPPGXPGPPGPPGPPX 23 T 0.0008 Collagen pdbpssm F F 7jx7 2 B B Diacetylated-H2A.Z peptide GGXAGX 6 T 40 Zea_mays_MuDR pdbhh F F 7jxt 1 A,B A,B PGH1_SHEEP CYCLOOXYGENASE-1,COX-1,PROSTAGLANDIN H2 SYNTHASE 1,PHS 1,PROSTAGLANDIN-ENDOPEROXIDE SYNTHASE 1 PVNPCCYYPCQHQGICVRFGLDRYQCDCTRTGYSGPNCTIPEIWTWLRTTLRPSPSFIHFLLTHGRWLWDFVNATFIRDTLMRLVLTVRSNLIPSPPTYNIAHDYISWESFSNVSYYTRILPSVPRDCPTPMDTKGKKQLPDAEFLSRRFLLRRKFIPDPQSTNLMFAFFAQHFTHQFFKTSGKMGPGFTKALGHGVDLGHIYGDNLERQYQLRLFKDGKLKYQMLNGEVYPPSVEEAPVLMHYPRGIPPQSQMAVGQEVFGLLPGLMLYATIWLREHNRVCDLLKAEHPTWGDEQLFQTARLILIGETIKIVIEEYVQQLSGYFLQLKFDPELLFGAQFQYRNRIAMEFNQLYHWHPLMPDSFRVGPQDYSYEQFLFNTSMLVDYGVEALVDAFSRQPAGRIGGGRNIDHHILHVAVDVIKESRVLRLQPFNEYRKRFGMKPYTSFQELTGEKEMAAELEELYGDIDALEFYPGLLLEKCHPNSIFGESMIEMGAPFSLKGLLGNPICSPEYWKASTFGGEVGFNLVKTATLKKLVCLNTKTCPYVSFHVPD 553 T 2.5E-05 An_peroxidase pdb F Eukaryota T 7jyn 2 B B NSD3_HUMAN NUCLEAR SET DOMAIN-CONTAINING PROTEIN 3,PROTEIN WHISTLE,WHSC1-LIKE 1 ISOFORM 9 WITH METHYLTRANSFERASE ACTIVITY TO LYSINE,WOLF-HIRSCHHORN SYNDROME CANDIDATE 1-LIKE PROTEIN 1,WHSC1-LIKE PROTEIN 1 EFTGSPEIKLKITKTIQNGRELFESSLCGDLLNEVQASE 39 T 47 DUF3587 pdbhh F Eukaryota T 7jzl 2 B,D,F E,G,F LCB1 DKEWILQKIYEIMRLLDELGHAEASMRVSDLIYEFMKKGDERLLEEAERLLEEVE 55 T 0.29 ER pdbhh F T 7jzo 2 C,D C,D LyCALTPP peptide core ANSRLPTSKI 10 T 10 DUF3697 pdbhh F T 7jzp 2 C,D C,D LyCALBF peptide core ANSRLPTSKI 10 T 10 DUF3697 pdbhh F T 7jzq 2 B C LyCALPMB peptide core ANSRLPTSKI 10 T 10 DUF3697 pdbhh F T 7jzr 2 C,D C,D LyCALAEB peptide core ANSRLPTSKI 10 T 10 DUF3697 pdbhh F T 7jzu 1 A A LCB1 GGSDKEWILQKIYEIMRLLDELGHAEASMRVSDLIYEFMKKGDERLLEEAERLLEEVERGS 61 T 0.5 ER pdbhh F T 7jzw 5 J J L7P7U3_9CAUD Type I-F anti-CRISPR protein MMTISKTDIDCYLQTYVVIDPVSNGWQWGIDENGVGGALHHGRVEMVEGENGYFGLRGATHPTEKEAMAAALGYLWKCRQDLVAIARNDAIEAEKYRAKA 100 T 1.8 TAL_effector pdbhh T Viruses T 7jzx 6 K J B3G1L5_PSEAI AcrF7 MSHASHNGEAPKRIEAMTTFTSIVTTNPDFGGFEFYVEAGQQFDDSAYEEAYGVSVPSAVVEEMNAKAAQLKDGEWLNVSHEA 83 T 0.15 Ribosomal_S19 pdbpercent F Bacteria T 7jzy 6 K,L J,K C0AVY5_9GAMM AcrF9 MKAAYIIKEVQNINSEREGTQIEATSLSQAKRIASKEQCFHGTVMRIETVNGLWLAYKEDGKRWVDCQ 68 T 0.11 Ribosomal_L19 pdbpssm F Bacteria T 7jzz 5 J,K J,K A0A0R6PCL0_9CAUD AcrF14 MKKIEMIEISQNRQNLTAFLHISEIKAINAKLADGVDVDKKSFDEICSIVLEQYQAKQISNKQASEIFETLAKANKSFKIEKFRCSHGYNEIYKYSPDHEAYLFYCKGGQGQLNKLIAENGRFM 124 T 0.053 GatB_Yqey pdb T Viruses T 7k04 1 A E RAD33_YEAST DNA repair protein RAD33 MSKSTNVSYERVELFENPKVPIEVEDEILEKYAESSLDHDMTVNELPRFFKDLQLEPTIWKLVRNEDVIIEGTDVIDFTKLVRCTCQLLILMNNLTVIDDLWSMLIRNCGRDVDFPQVALRDHVLSVKDLQKISNLIGADQSSGTIEMISCATDGKRLFMTYLDFGCVLGKLGYLKM 177 T 8.4E-13 Rad33 pdbpssm F Eukaryota T 7k1m 1 A A GLY-CYS-HIS-TYR-THR-PRO-PHE-GLY-LEU-ILE-CYS-PHE peptide GCHYTPFGLICF 12 T 2.3 DUF3951 pdbhh F T 7k1y 1 A,B,C,D,E B,C,D,E,G Vac14 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 532 F F F 7k28 2 C P NF2L2_HUMAN Nrf2 peptide,ADEETGEFL XADEETGEFLX 11 T 3.2 DUF4585 pdbhh F Eukaryota T 7k29 2 C P NF2L2_HUMAN ACE-LEU-ASP-GLU-GLU-THR-GLY-GLU-ALA-LEU-NH2 XLDEETGEALX 11 T 4.9 Adeno_100 pdbhh F Eukaryota T 7k2a 2 C P NF2L2_HUMAN ACE-LEU-ASP-GLU-GLU-THR-GLY-GLU-PHE-ALA-NH2 XLDEETGEFAX 11 T 1.3 MBF1 pdbhh F Eukaryota T 7k2b 2 C P NF2L2_HUMAN ACE-ALA-ASP-GLU-GLU-THR-GLY-GLU-PHE-ALA-NH2 XADEETGEFAX 11 T 10 DUF4585 pdbhh F Eukaryota T 7k2c 2 C P NF2L2_HUMAN Nrf2 peptide,ADEETGEAA XADEETGEAAX 11 T 41 Phi29_Phage_SSB pdbhh F Eukaryota T 7k2d 2 C P NF2L2_HUMAN Nrf2 linear peptide, Ace-GDEETGE-NH2 XGDEETGEX 9 T 24 Phi29_Phage_SSB pdbhh F Eukaryota F 7k2e 2 C P NF2L2_HUMAN GLY-ASP-GLU-GLU-THR-GLY-GLU GDEETGE 7 T 12 Phi29_Phage_SSB pdbhh F Eukaryota F 7k2f 2 C C Nrf2 cyclic peptide,c[GAEETGE] GAEETGE 7 T 240 Curto_V3 pdbhh F F 7k2g 2 C P GLY-ASP-GLU-GLU-ALA-GLY-GLU GDEEAGE 7 T 12 DUF1491 pdbhh F F 7k2h 2 C P GLY-ASP-PRO-GLU-THR-GLY-GLU GDPETGE 7 T 0.55 TRCF pdbhh F F 7k2i 2 C P Nrf2 cyclic peptide,c[GAPETGE] GAPETGE 7 T 70 Adeno_100 pdbhh F F 7k2j 2 C P Nrf2 cyclic peptide,c[GDPEAGE] GDPEAGE 7 T 3.7 DUF1491 pdbhh F F 7k2k 2 C P NF2L2_HUMAN BAL-ASP-GLU-GLU-THR-GLY-GLU XDEETGE 7 T 8.3 Phi29_Phage_SSB pdbhh F Eukaryota F 7k2l 3 C P Nrf2 cyclic peptide,c[BAL-NPETGE] XNPETGE 7 T 1.7 RBDV_coat pdbhh F T 7k2m 2 C P Nrf2 cyclic peptide,c[GEPETGE] GEPETGE 7 T 6.9 Lectin_C_term pdbhh F F 7k2n 2 C P (BAL)DPETGE XDPETGE 7 T 0.37 DUF4585 pdbhh F T 7k2o 2 C P (ABU)DPETGE XDPETGE 7 T 0.37 DUF4585 pdbhh F T 7k2p 2 C P (DAV)DPETGE XDPETGE 7 T 0.37 DUF4585 pdbhh F T 7k2q 2 C P ACA-ASP-PRO-GLU-THR-GLY-GLU XDPETGE 7 T 0.37 DUF4585 pdbhh F T 7k2r 2 C P B3A-ASP-PRO-GLU-THR-GLY-GLU XDPETGE 7 T 1.4 Adeno_100 pdbhh F T 7k2s 2 C P B3A-ASP-PRO-GLU-THR-GLY-GLU XDPETGE 7 T 1.4 Adeno_100 pdbhh F T 7k3h 1 A,B A,B Network hallucinated protein 0217 MGSSHHHHHHSSGLVPRGSHMSPIARQALDIAKSVLEHSKGMFDYWEGMLEQYEKTGDPDQANKLRQTLNRVKNSVGRLESALKRAERAYDTGNPDAAVGAVVELIGNVHEIMSTFHELFG 121 T 0.015 DUF2379 pdb F T 7k3j 2 B,D,F,H B,D,F,H PANX_DROME PROTEIN SILENCIO STLYKNAATQTERRTATRDAGTQVRLE 27 T 7.3 CFAP91 pdbhh F Eukaryota T 7k3k 2 B B PANX_DROME PROTEIN SILENCIO STLYKNAATQTERR 14 T 1 CFAP91 pdbhh F Eukaryota T 7k3l 2 B B PANX_DROME PROTEIN SILENCIO STATRDAGTQVRLE 14 T 7.8 NifQ unppercent F Eukaryota T 7k3s 1 A A BRCA1_MOUSE RING-TYPE E3 UBIQUITIN TRANSFERASE BRCA1 MNLSEDCSQSDILTTQQRATMKYNLIKLQQEMAHLEAVLEQRGNQPSGHSPSLEHHHHHH 60 T 0.0056 HrpB7 pdbpssm F Eukaryota T 7k3s 2 B B PALB2_MOUSE Partner and localizer of BRCA2 MEELSGKPLSYAEKEKLKEKLAFLKKEYSRTLARLQRAKRAEKAKNSKKAIEDGVPQPEALEHHHHHH 68 T 0.024 DUF1564 pdbpercent F Eukaryota T 7k58 11 K E Q23FU1_TETTS Flagellar outer dynein arm intermediate protein, putative KEFNNPINFQDTETRYGGIQNQVVNINQYVQRNPNFIDLDNIAELSEHSVNTERVKTGDRGMSHKEGGWPGNVDPNEAQETGRFKKRIEKDTSFPQAVKDLKEGVEKCIYQNNQIDLLEEYFEGETSEHVVENLSSKTLMLFKDEKEICKRSVSEISWHPEGPTKVAVSYAIMRFQQMPEKMPTQAYVWDLLNPNSPEIKLMSPSAVTNISYNQKIPDQIGGGCYNGLLAVWDGRKGENPIMISPVENSHYEPVTHFHWLMSKTGSECVTTSTDGKVMWWDTRKFEAGPVEKLNIIEGLGENEEIIGGTALEYNVEAGPSKFLIGTESGSILTANKKLKKPVEITTRYGLDQGRHLGPVYSINRSNQNPKYFLSVGDWSCKIWVEDLKTPIIRTKYHGSYLSDGCWSPTRSGAFFLVRRDGWMDVWDYYYRQNEIAFSHKVSDSPLTCIKINQTGGAYHNSGKLCAIGDQDGTVTILELCDSLYTMQPKEKDIINEMFEREYRKEKNLETIKKQQELAKRQVQKDMGSQKEKWEKKKLEMIETAEASFHENLAKNPV 557 T 0.13 DUF2247 pdb F Eukaryota T 7k58 12 L D I7M008_TETTS Dynein intermediate chain 2 LTAQELNEDMPSKMLEPKNPQAPKNITVYDYYTRKFKTDELVDQMIVHFSMDGDYIWKESNEYKTQEEIRDTKKALIKEAMRKQESEEPGANHDEEAIKQTLRNKFNYNTRECQTINPSIRERGVSTEPPPSDTICGNITQWEIFDAYYAEIMKDHQIENKKKKEVDQDKKQDQSMYSTSFKRCCKIMERMVVQNDQEDKYHDYRYYWSQGDNLEAGKNEGHLLPIWRFSNEKQRKKNVTSICWNPLYPDLFAVSLGSYDFTKQRMGLICLYSLKNTTHPEYAFNCEAGVMCLDFHPKSAALLAVGLYDGTVLVYDIRNKHKKPIYQSTVRNQKHTDPVWQVKWNPDTSKNYNFYSISSDGRVMNWILMKNKLEPEEVILLRLVGKNEEESTLIGLACGLCFDFNKFEPHIFLVGTEEGKIHKCSRAYSGQYQETYNGHLLAVYKVKWNNFHPRTFISASADWTVRIWDSKYTSQIICFDLSMMVVDAVWAPYSSTVFACATMDKVQVYDLNVDKLNKLAEQKIVKQPKLTNLSFNYKDPILLVGDSHGGVTLVKLSPNLCKSGPEIKQTEDKKAMEEFKNVKIEDYEREKMENL 595 T 0.004 WD40 pdb F Eukaryota T 7k5b 4 D D I7M008_TETTS Dynein intermediate chain 2 LTAQELNEDMPSKMLEPKNPQAPKNITVYDYYTRKFKTDELVDQMIVHFSMDGDYIWKESNEYKTQEEIRDTKKALIKEAMRKQESEEPGANHDEEAIKQTLRNKFNYNTRECQTINPSIRERGVSTEPPPSDTICGNITQWEIFDAYYAEIMKDHQIENKKKKEVDQDKKQDQSMYSTSFKRCCKIMERMVVQNDQEDKYHDYRYYWSQGDNLEAGKNEGHLLPIWRFSNEKQRKKNVTSICWNPLYPDLFAVSLGSYDFTKQRMGLICLYSLKNTTHPEYAFNCEAGVMCLDFHPKSAALLAVGLYDGTVLVYDIRNKHKKPIYQSTVRNQKHTDPVWQVKWNPDTSKNYNFYSISSDGRVMNWILMKNKLEPEEVILLRLVGKNEEESTLIGLACGLCFDFNKFEPHIFLVGTEEGKIHKCSRAYSGQYQETYNGHLLAVYKVKWNNFHPRTFISASADWTVRIWDSKYTSQIICFDLSMMVVDAVWAPYSSTVFACATMDKVQVYDLNVDKLNKLAEQKIVKQPKLTNLSFNYKDPILLVGDSHGGVTLVKLSPNLCKSGPEIKQTEDKKAMEEFKNVKIEDYEREKMENL 595 T 0.004 WD40 pdb F Eukaryota T 7k5b 5 E E Q23FU1_TETTS Flagellar outer dynein arm intermediate protein, putative KEFNNPINFQDTETRYGGIQNQVVNINQYVQRNPNFIDLDNIAELSEHSVNTERVKTGDRGMSHKEGGWPGNVDPNEAQETGRFKKRIEKDTSFPQAVKDLKEGVEKCIYQNNQIDLLEEYFEGETSEHVVENLSSKTLMLFKDEKEICKRSVSEISWHPEGPTKVAVSYAIMRFQQMPEKMPTQAYVWDLLNPNSPEIKLMSPSAVTNISYNQKIPDQIGGGCYNGLLAVWDGRKGENPIMISPVENSHYEPVTHFHWLMSKTGSECVTTSTDGKVMWWDTRKFEAGPVEKLNIIEGLGENEEIIGGTALEYNVEAGPSKFLIGTESGSILTANKKLKKPVEITTRYGLDQGRHLGPVYSINRSNQNPKYFLSVGDWSCKIWVEDLKTPIIRTKYHGSYLSDGCWSPTRSGAFFLVRRDGWMDVWDYYYRQNEIAFSHKVSDSPLTCIKINQTGGAYHNSGKLCAIGDQDGTVTILELCDSLYTMQPKEKDIINEMFEREYRKEKNLETIKKQQELAKRQVQKDMGSQKEKWEKKKLEMIETAEASFHENLAKNPV 557 T 0.13 DUF2247 pdb F Eukaryota T 7k5c 1 A,C,E,G,I,K H,A,C,E,I,K GP15_BPT7 GENE PRODUCT 15,GP15 MSKIESALQAAQPGLSRLRGGAGGMGYRAATTQAEQPRSSLLDTIGRFAKAGADMYTAKEQRARDLADERSNEIIRKLTPEQRREALNNGTLLYQDDPYAMEALRVKTGRNAAYLVDDDVMQKIKEGVFRTREEMEEYRHSRLQEGAKVYAEQFGIDPEDVDYQRGFNGDITERNISLYGAHDNFLSQQAQKGAIMNSRVELNGVLQDPDMLRRPDSADFFEKYIDNGLVTGAIPSDAQATQLISQAFSDASSRAGGADFLMRVGDKKVTLNGATTTYRELIGEEQWNALMVTAQRSQFETDAKLNEQYRLKINSALNQEDPRTAWEMLQGIKAELDKVQPDEQMTPQREWLISAQEQVQNQMNAWTKAQAKALDDSMKSMNKLDVIDKQFQKRINGEWVSTDFKDMPVNENTGEFKHSDMVNYANKKLAEIDSMDIPDGAKDAMKLKYLQADSKDGAFRTAIGTMVTDAGQEWSAAVINGKLPERTPAMDALRRIRNADPQLIAALYPDQAELFLTMDMMDKQGIDPQVILDADRLTVKRSKEQRFEDDKAFESALNASKAPEIARMPASLRESARKIYDSVKYRSGNESMAMEQMTKFLKESTYTFTGDDVDGDTVGVIPKNMMQVNSDPKSWEQGRDILEEARKGIIASNPWITNKQLTMYSQGDSIYLMDTTGQVRVRYDKELLSKVWSENQKKLEEKAREKALADVNKRAPIVAATKAREAAAKRVREKRKQTPKFIYGRKE 747 T 18 DUF3135 pdbpercent T Viruses T 7k5m 1 A,B,C,D,E,F A,B,C,D,E,F CAPSD_HBVD1 CORE ANTIGEN,CORE PROTEIN,HBCAG,P21.5 SMDIDPYKEFGATVELLSFLPSDFFPSVRDLLDTAAALYRDALESPEHCSPHHTALRQAILCWGDLMTLATWVGTNLEDPASRDLVVSYVNTNVGLKFRQLLWFHISCLTFGRETVLEYLVSFGVWIRTPPAARPPNAPILSTLPETTVVKLENLYFQ 158 T 3.9E-25 Hepatitis_core unp T Viruses T 7k75 3 I,J E,F PfCSP N-terminal peptide P17 KPKHKKLK 8 T 51 CTK3_C pdbhh F F 7k76 3 E P PfCSP N-terminal peptide P17 KLRKPKHKKLKQPAD 15 T 7.6 P120R pdbhh F T 7k79 1 A K CBF3C_YEAST CHROMOSOME TRANSMISSION FIDELITY PROTEIN 13,KINETOCHORE PROTEIN CTF13 MGPSFNPVRFLELPIDIRKEVYFHLDGNFCGAHPYPIDILYKSNDVELPGKPSYKRSKRSKKLLRYMYPVFATYLNIFEYSPQLIEKWLEYAFWLRYDCLVLDCFKVNHLYDGTLIDALEWTYLDNELRLAYFNKASMLEVWYTFKEYKKWVIDSVAFDELDLLNVSNIQFNIDNLTPQLVDKCLSILEQKDLFATIGEVQFGQDEEVGEEKDVDVSGANSDENSSPSSTIKNKKRSASKRSHSDNGNVGATHNQLTSISVIRTIRSMESMKSLRKITVRGEKLYELLINFHGFRDNPGKTISYIVKRRINEIRLSRMNQISRTGLADFTRWDNLQKLVLSRVAYIDLNSIVFPKNFKSLTMKRVSKIKWWNIEENILKELKVDKRTFKSLYIKEDDSKFTKFFNLRHTRIKELDKSEINQITYLRCQAIVWLSFRTLNHIKLQNVSEVFNNIIVPRALFDSKRVEIYRCEKISQVLVIGSRSGSENLYFQGSKRRWKKNFIAVSAANRFKKISSSGAL 519 T 0.088 Glft2_N unppercent F Eukaryota T 7k7a 1 A,B,C A,B,C TNR1A_HUMAN TUMOR NECROSIS FACTOR RECEPTOR 1,TNF-R1,TUMOR NECROSIS FACTOR RECEPTOR TYPE I,TNFR-I,P55,P60 GTTVLLPLVIFFGLALLSLLFIGLAYRYQR 30 T 0.13 Papilloma_E5A pdbhh F Eukaryota T 7k7h 4 H G A0A4Z0MXD9_SALET PERTUSSIS-LIKE TOXIN SUBUNIT ARTA FYDARPVIELILSK 14 T 3.5 DUF4334 pdbhh F Bacteria T 7k7r 3 C,F C,F EBNA1_EBVB9 EBNA1 peptide AA386-405 SQSSSSGSPPRRPPPGRRPF 20 T 26 ODV-E18 pdbhh T Viruses T 7k9b 1 A,B A,B Q9KGD7_BACHD OAPB GPSPEIGQIVKIVKGRDRDQFSVIIKRVDDRFVYIADGDKRKVDRAKRKNMNHLKLIDHISPEVRHSFEETGKVTNGKLRFALKKFLEEHADLLKEGE 98 T 0.023 FERM_F2 pdbpssm F Bacteria T 7k9c 1 A,B A,B Q9KGD7_BACHD OAPB GPSPEIGQIVKIVKGRDRDQFSVIIKRVDDRFVYIADGDKRKVDRAKRKNMNHLKLIDHISPEVRHSFEETGKVTNGKLRFALKKFLEEHADLLKEGE 98 T 0.023 FERM_F2 pdbpssm F Bacteria T 7k9d 1 A A Q9KGD7_BACHD OAPB GPSPEIGQIVKIVKGRDRDQFSVIIKRVDDRFVYIADGDKRKVDRAKRKNMNHLKLIDHISPEVRHSFEETGKVTNGKLRFALKKFLEEHADLLKEGE 98 T 0.023 FERM_F2 pdbpssm F Bacteria T 7k9e 1 A,C A,C Q9KGD7_BACHD OAPB GPSPEIGQIVKIVKGRDRDQFSVIIKRVDDRFVYIADGDKRKVDRAKRKNMNHLKLIDHISPEVRHSFEETGKVTNGKLRFALKKFLEEHADLLKEGE 98 T 0.023 FERM_F2 pdbpssm F Bacteria T 7kai 7 G G Protein transport protein Sec62 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 56 F F F 7kaj 7 G G Protein transport protein Sec62 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 56 F F F 7kap 7 G G Protein transport protein Sec62 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 56 F F F 7kaq 7 G G Protein transport protein Sec62 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 56 F F F 7kas 7 G G Protein transport protein SEC62 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 56 F F F 7kau 7 G G Protein transport protein Sec62 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 56 F F F 7kbb 3 C C UL128_HCMVA UL128 EECCEFINVNHPPERCYDFKMCNRFTVALRCPDGEVCYSPEKTAEIRGIVTTMTHSLTRQVVHNKLTSCNYNPLYLEADGRIRCGKVNDKAQYLLGAAGSVPYRWINLEYDKITRIVGLDQYLESVKKHKRLDVCRAKMGYMLQ 144 T 12 SH3_19 pdbhh T Viruses T 7kbb 5 E E U131A_HCMVM UL131A QCQRETAEKNDYYRVPHYWDACSRALPDQTRYKYVEQLVDLTLNYHYDASHGLDNFDVLKRINVTEVSLLISDFRRQNRRGGTNKRTTFNAAGSLAPHARSLEFSVRLFAN 111 T 0.064 Prion pdbpercent T Viruses T 7kbj 1 A,E G,I GANAB_MOUSE ALPHA-GLUCOSIDASE 2,GLUCOSIDASE II SUBUNIT ALPHA MGILPSPGMPALLSLVSLLSVLLMGCVAETGVDRSNFKTCDESSFCKRQRSIRPGLSPYRALLDTLQLGPDALTVHLIHEVTKVLLVLELQGLQKDMTRIRIDELEPRRPRYRVPDVLVADPPTARLSVSGRDDNSVELTVAEGPYKIILTAQPFRLDLLEDRSLLLSVNARGLMAFEHQRAPR 184 T 0.0016 NtCtMGAM_N pdbpercent F Eukaryota T 7kbq 1 A A DE NOVO DESIGNED OR689 MRIIVIIVTDEQKIEDMWEILKEIGVDRIVIITSNKQLAERAKELGVDRIFLLTDDELIAEIVKKLGADIVFSENRDIAKKIIRKLKNIIILSNDEQLVKELQKEASDARVFNVQTKQDFKDLIEKILEHHHHHH 135 T 0.00077 ADH_zinc_N pdb F T 7kbr 1 A,E G,I GANAB_MOUSE ALPHA-GLUCOSIDASE 2,GLUCOSIDASE II SUBUNIT ALPHA MGILPSPGMPALLSLVSLLSVLLMGCVAETGVDRSNFKTCDESSFCKRQRSIRPGLSPYRALLDTLQLGPDALTVHLIHEVTKVLLVLELQGLQKDMTRIRIDELEPRRPRYRVPDVLVADPPTARLSVSGRDDNSVELTVAEGPYKIILTAQPFRLDLLEDRSLLLSVNARGLMAFEHQRAP 183 T 0.0089 NtCtMGAM_N pdbpercent F Eukaryota T 7kd6 5 E,J,O,T F,L,R,X INSR_HUMAN Insulin receptor isoform A alphaCT peptide TFEDYLHNVVFVPRPS 16 T 0.00017 DUF4998 unphh F Eukaryota T 7kd7 2 B,D E,B SER-GLY-ARG-GLY-LYS SGRGK 5 T 22 Rad17 pdbhh F F 7kdf 5 E E STU2_YEAST Y55_G0035590.MRNA.1.CDS.1 EESYKRAAAVTSTLKARIEKMKAKSRREGTTRT 33 T 0.1 SSP160 pdbpssm F Eukaryota T 7kdq 1 A A NDB4S_TITST Stigmurin analog StigA15 FFSLIPKLVGGLIKAFKX 18 T 0.48 Endotoxin_N pdbhh F Eukaryota T 7kei 3 C C HA peptide from 2009 H1N1 pandemic flu virus. AMERNAGSGIIISDGGGGSLVPRGS 25 T 2.9 Cuticle_1 pdbhh F T 7kev 3 C C cyclic peptide LDLR disruptor XFVSTXXXDRPCGX 14 T 33 Sod_Ni pdbhh F T 7kfa 3 C D 1-[2,6,10.14-TETRAMETHYL-HEXADECAN-16-YL]-2-[2,10,14-TRIMETHYLHEXADECAN-16-YL]GLYCEROL XFVPTTXXEAPCX 13 T 26 Chromadorea_ALT pdbhh F T 7kgb 51 YA v A0A3E0UTA6_MYCTX 50S ribosomal protein L37 MAKRGRKKRDRKYSKANHGKRPNS 24 T 21 Protamine_3 pdbhh F Bacteria T 7kgv 1 A,B A,B S38A9_DANRE SOLUTE CARRIER FAMILY 38 MEMBER 9 MDEDSKPLLGSVPTGDYYTDSLDPKQRRPFHVEPRNIVGEDVQERVSAEAAVLSSRVHYYSRLTGSSDRLLAPPDHVIPSHEDIYIYSPLGTAFKVQGGDSPIKNPSIVTIFAIWNTMMGTSILSIPWGIKQAGFTLGIIIIVLMGLLTLYCCYRVLKSTKSIPYVDTSDWEFPDVCKYYFGGFGKWSSLVFSLVSLIGAMVVYWVLMSNFLFNTGKFIFNYVHNVQTSDAFGTQGTERVICPYPDVDPHGQSSTSLYSGSDQSTGLEFDHWWSKTNTIPFYLILLLLPLLNFRSASFFARFTFLGTISVIYLIFLVTYKAIQLGFHLEFHWFDSSMFFVPEFRTLFPQLSGVLTLAFFIHNCIITLMKNNKHQENNVRDLSLAYLLVGLTYLYVGVLIFAAFPSPPLSKECIEPNFLDNFPSSDILVFVARTFLLFQMTTVYPLLGYLVRVQLMGQIFGNHYPGFLHVFVLNVFVVGAGVLMARFYPNIGSIIRYSGALCGLALVFVLPSLIHMVSLKRRGELRWTSTLFHGFLILLGVANLLGQFFM 549 T 1.3E-22 Aa_trans unppercent F Eukaryota T 7kh0 3 C A GNAI3_HUMAN;GNAS2_HUMAN G(I) ALPHA-3,ADENYLATE CYCLASE-STIMULATING G ALPHA PROTEIN GCTLSAEDKAAVERSKMIEKQLQKDKQVYRATHRLLLLGAGESGKSTIVKQMRILHVNGFNGDSEKATKVQDIKNNLKEAIETIVAAMSNLVPPVELANPENQFRVDYILSVMNVPDFDFPPEFYEHAKALWEDEGVRACYERSNEYQLIDCAQYFLDKIDVIKQDDYVPSDQDLLRCRVLTSGIFETKFQVDKVNFHMFDVGGQRDERRKWIQCFNDVTAIIFVVASSSYNMVIREDNQTNRLQEALNLFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKIEDYFPEFARYTTPEDATPEPGEDPRVTRAKYFIRDEFLRISTASGDGRHYCYPHFTCAVDTENIRRVFNDCRDIIQRMHLRQYELL 372 T 2.3E-123 G-alpha unp F Eukaryota T 7kh1 3 M,N,O,P,Q,R A3,B3,C3,D3,E3,F3 baseplate organization protein, gp11 MSLVNGMVESLNNTKSETEIGIGGYRLFARVRETVNYRNIVPTDTLEDGSSSTDDIINEPITVSIEGVVSNLFVEERQYPQLVSRDFSAVGEITALLPAKSQQQIQRISQIDSQIRDAVLAAERAERLAGKPYEFFGNSGNSAKTEQEKFIDFMEALYFSRRPTEVSVNFRDYKNMALVSFIPVRDNNTKDTRFTADFQQINYSTLVYTPVSSPSKSVSGKVSDASNKGGQNPESNETGERSLLSSLVGG 250 T 64 T2SSM_b pdbhh F T 7kh1 4 S,T,U,V,W,X A4,B4,C4,D4,E4,F4 baseplate stabilizing protein, gp12 MNLIENITSEYIQTHALEFSRGFAVLTLIYEQAVQMWKMNVVYTRAGDEEPQPPIYGVKLALSTTHIKHRNWPFDFTVIDTTNNGMDPYRADDFETGRCQLYFITPEEMIQVRGVDVQ 118 T 21 Frankia_peptide pdbhh F T 7kh1 7 QA,RA,SA,TA,UA,VA A7,B7,C7,D7,E7,F7 tail sheath initiator protein, gp15 MRVRTLDDNGDWTFGRGKADYITSKKAIAQTVSTRIKSWANDNPLAMNANIDWKDLLGRKGTEDTILREIERVVVQTDGVIRVTELEVIKTEKRVQSILLSYDTIYDDSETLEINDL 117 T 0.0097 DUF2634 pdbpssm F T 7kht 2 B B PRGC1_HUMAN PPARGC-1-ALPHA,LIGAND EFFECT MODULATOR 6 EEPSLLKKLLLAPA 14 T 7 Cnl2_NKP2 pdbhh F Eukaryota T 7kix 1 A A A0A125RN64_9CAUD Anti-CRISPR protein AcrIE2 MNTYLIDPRKNNDNSGERFTVDAVDITAAAKSAAQQILGEEFEGLVYRETGESNGSGMFQAYHHLHGTNRTETTVGYPFHVMELEHHHHHH 91 T 8.5 Baculo_E66 pdbhh T Viruses T 7kiy 2 B B Q8I060_PLAFA RHOPH2 MIKVTIFLLLSIFSFNLYGLELNEKVSIKYGAEQGVGSADSNTKLCSDILKYLYMDEYLSEGDKATFEKKCHNVIGNIRNTFSNKNTIKEGNEFLMSILHMKSLYGNNNNNNAGSESDVTLKSLYLSLKGSQNTEGESEVPSDDEINKTIMNFVKFNKYLLDNSNDIKKVHDFLVLTSQSNENLLPNKEKLFEQIVDQIKYFDEYFFASGGKIKVKKGYLKYNFLDIYKQPVCSAYLHLCSRYYESVSIYIRLKKVFNGIPAFLDKNCRKVKGEEFKKLMDMELKHNHIVERFDKYIISDDLYYVNMKVFDLKNVDKIQVSKIDDINNLNIYEHKETMHLSAKNLSRYIDIKKELNDEKAYKQLMSAIRKYVTTLTKADSDITYFVKQLDDEEIERFLIDLNFFLYNGFLRITEDKHLINADDVSPSYINLYRSNNIVALYILKTQYEENKLSEYRAHKFYRRKRVSNITNDMIKKDFTQTNALTNLPNLDNKKTTEYYLKEYENFVENFQPDLHDIMKLQLFFTMAFKDCNVNQNFTETSKKLWFDLLYAYDKFGWFYIHPNEVINSINKTDFVRHVLVSRNFLLKNNDQLTFLETQVAKIVEIINLSLEVDKSPDSLDFSIPMNFFNHKNGYHVMNDDKLKLLTSYEYIDSIANNYFFLSEYKNDVFRTGNNFKLYFNLPNIYSLAYQLFNELAININVITNVPLKKYLKYNASYAYFTLMNMIGKNHDIYSKGSRFVYASYILGLVFFIESHIDIARLKPKDFFFMKQSLPIIDHVYHKDLKTLKKNCTLLTDFMKINKNSQNYSLTHTEEMIKILGLLTVTLWAKEGKKSVYYDDDVSLYRKLMVSCVFNGGETIQEKLANNIEKSCDISQYGIKSKNLKDMIDINLSIHKWNPAEIEKLAYSFVLSCKMQKLMYKPMNVEKLPLEDYYKLPLAPDMVKTYHCYKLGKQAAKLLESIILKKKFVRFRVTDAIDVYDFFYIKKVLSSHIKKEYNEFLQDKRAFEKKELETILNNSPFSEEQTMKLINSYECHWFTSYENFRILWMHASSNLGTGTYLKNFFSELWQNIRFLFKSKLKIRDMEYFSGDISQMNLLDYYSPMVHSESHCQEKMQVLFITLRDSKEENRSEIAQKVKSAYYQCKLDYYKNHHSDFIHRIHPNDFLNNKVYVLKQPYYLMSNVPLNNPKKVSRLFVTEGTLEYLLLDKINIPECFGPCTKLHFNKVVIKESKQRIYDMTINNALVPEIQPYNRRKYMTIYINEAYIKNIVSDALTSEEIKRHDIQKGNIKICMGKSTYLTEPILTEEHFNLTHKPVYDFSSVKHNLKVFHMKNEHLVSEDPNDDCFINYPLATINLDISDPYKEISEDLIKNLYILKSS 1378 T 1.3 Crystall_2 pdbpercent F Eukaryota T 7kiy 3 C C A0A024X9S2_PLAFC RHOPH3 MRSKHLVTLFIITFLSFSTVKVWGKDVFAGFVTKKLKTLLDCNFALYYNFKGNGPDAGSFLDFVDEPEQFYWFVEHFLSVKFRVPKHLKDKNIHNFTPCLNRSWVSEFLKEYEEPFVNPVMKFLDKEQRLFFTYNFGDVEPQGKYTYFPVKEFHKYCILPPLIKTNIKDGESGEFLKYQLNKEEYKVFLSSVGSQMTAIKNLYSTVEDEQRKQLLKVIIENESTNDISVQCPTYNIKLHYTKECANSNNILKCIDEFLRKTCEKKTESKHPSADLCEHLQFLFESLKNPYLDNFKKFMTNSDFTLIKPQSVWNVPIFDIYKPKNYLDSVQNLDTECFKKLNSKNLIFLSFHDDIPNNPYYNVELQEIVKLSTYTYSIFDKLYNFFFVFKKSGAPISPVSVKELSHNITDFSFKEDNSEIQCQNVRKSLDLEVDVETMKGIAAEKLCKIIEKFILTKDDASKPEKSDIHRGFRILCILISTHVEAYNIVRQLLNMESMISLTRYTSLYIHKFFKSVTLLKGNFLYKNNKAIRYSRACSKASLHVPSVLYRRNIYIPETFLSLYLGLSNLVSSNPSSPFFEYAIIEFLVTYYNKGSEKFVLYFISIISVLYINEYYYEQLSCFYPKEFELIKSRMIHPNIVDRILKGIDNLMKSTRYDKMRTMYLDFESSDIFSREKVFTALYNFDSFIKTNEQLKKKNLEEISEIPVQLETSNDGIGYRKQDVLYETDKPQTMDEASYEETVDEDAHHVNEKQHSAHFLDAIAEKDILEEKTKDQDLEIELYKYMGPLKEQSKSTSAASTSDELAGSEGPSTESTSTGNQGEDKTTDNTYKEMEELEEAEGTSNLKKGLEFYKSSLKLDQLDKEKPKKKKSKRKKKRDSSSDRILLEESKTFTSENEL 897 T 11 Phage_TAC_10 pdbhh F Eukaryota T 7kj6 1 A,B A,B Q5ZSR1_LEGPH Ankyrin repeat-containing protein SNALTPPPDSKISTTDKSLDKLSAPLDMLKQMNESTMEQTKLDELRKKMSLQAEILNKAKADNDMFFRLLIELMSLKLQGELFKEQLSKISKESGYDSAQSALIQATNSEGQSPLQYALQKQDFSTAKYFLDNGAKAGPIEKAVFEIALDSKAAKEFGFPPLPPEKEKLHPVKNFGLVLGIKTTSVDGTPSQFGHIAPTYQLMTDSVSHFAKSHPGNKNFQEIANAFQFSNEASAFKFSTPQRNPEAGNDLARRIQGGELTTIPVSCKGHAMGLSYVPDGPGSKSGYLVYTNRGLGAKSSEHGTHIFRIEDSSKITPEFINNMTSGHSNGASHDEIMSQIKAAAGNKEPIHHIKQKGQKNDNCTIANSKSNIEGILLCQKAREVGGFDKLTESDMDSVKKEYKEFTKHMRVEKVNELAKALKENPQDPDLNNLTKEYLKQHPNADPKLKQTLETALKQASESSMTLSQPGKTI 473 T 0.00015 Shigella_OspC pdbhh F Bacteria T 7kjk 1 A,B,C,D,E,F A5,B5,C5,D5,E5,F5 Tail terminator protein MSQSIINVARYIRDLLDYDENLIQFDRKNTQQSDTVTGYIVVNGSGVQNVLSHGSSYDGDAEIMEYSKSESRLITLEFYGSDAYENAELFSLLNQSQKAKEVSRGLGLTIYNVSQATDVKQLLGYQYGNRVHVDFNIQYCPSVYVETLRVDASEFEILVDD 161 T 33 Collectrin pdbhh F T 7kjk 3 AA,BA,CA,DA,Y,Z C4,D4,E4,F4,A4,B4 Head completion protein MLPNMRSALKMFEQSVLLKSVETIRVDFVDDIIITATPIRAVVQVADKKKLNLDSLDWSKQYIWVHSGSKMEIGQFIEWHGKDFKLVAAGDDYSDYGYNAWYGEETLKPVLVSS 114 T 0.014 Hepatitis_core pdbpssm F T 7kjm 2 B,D B,D D-PMI-omega XXXXXXXXXXXX 12 F F F 7kjn 2 B,C B,C D-PMI-omega XXXXXXXXXXXX 12 F F F 7kkm 1 A,B,C,D A,B,C,D TNKS1_HUMAN PARP GSQGTILLDLAPEDKEYQSVEEEMQSTIREHRDGGNAGGIFNRYNVIRIQKVVNKKLRERFCHRQKEVSEENHNHHNERMLFHGSPFINAIIHKGFDERHAYIGGMFGAGIYFAENSSKSNQYVYGIGGGTGCPTHKDRSCYICHRQMLFCRVTLGKSFLQFSTMKMAHAPPGHHSVIGRPSVNGLAYAEYVIYRGEQAYPEYLITYQIMKPE 213 T 0.011 PARP pdb F Eukaryota T 7kkn 1 A,B,C,D A,B,C,D TNKS1_HUMAN PARP GSQGTILLDLAPEDKEYQSVEEEMQSTIREHRDGGNAGGIFNRYNVIRIQKVVNKKLRERFCHRQKEVSEENHNHHNERMLFHGSPFINAIIHKGFDERHAYIGGMFGAGIYFAENSSKSNQYVYGIGGGTGCPTHKDRSCYICHRQMLFCRVTLGKSFLQFSTMKMAHAPPGHHSVIGRPSVNGLAYAEYVIYRGEQAYPEYLITYQIMKPE 213 T 0.011 PARP pdb F Eukaryota T 7kko 1 A,B,C A,B,C TNKS1_HUMAN PARP GSQGTILLDLAPEDKEYQSVEEEMQSTIREHRDGGNAGGIFNRYNVIRIQKVVNKKLRERFCHRQKEVSEENHNHHNERMLFHGSPFINAIIHKGFDERHAYIGGMFGAGIYFAENSSKSNQYVYGIGGGTGCPTHKDRSCYICHRQMLFCRVTLGKSFLQFSTMKMAHAPPGHHSVIGRPSVNGLAYAEYVIYRGEQAYPEYLITYQIMKPE 213 T 0.011 PARP pdb F Eukaryota T 7kkp 1 A,B A,B TNKS1_HUMAN PARP GSQGTILLDLAPEDKEYQSVEEEMQSTIREHRDGGNAGGIFNRYNVIRIQKVVNKKLRERFCHRQKEVSEENHNHHNERMLFHGSPFINAIIHKGFDERHAYIGGMFGAGIYFAENSSKSNQYVYGIGGGTGCPTHKDRSCYICHRQMLFCRVTLGKSFLQFSTMKMAHAPPGHHSVIGRPSVNGLAYAEYVIYRGEQAYPEYLITYQIMKPE 213 T 0.011 PARP pdb F Eukaryota T 7kkq 1 A,B,C,D A,B,C,D TNKS1_HUMAN PARP GSQGTILLDLAPEDKEYQSVEEEMQSTIREHRDGGNAGGIFNRYNVIRIQKVVNKKLRERFCHRQKEVSEENHNHHNERMLFHGSPFINAIIHKGFDERHAYIGGMFGAGIYFAENSSKSNQYVYGIGGGTGCPTHKDRSCYICHRQMLFCRVTLGKSFLQFSTMKMAHAPPGHHSVIGRPSVNGLAYAEYVIYRGEQAYPEYLITYQIMKPE 213 T 0.011 PARP pdb F Eukaryota T 7kkv 1 A A Q9KGD7_BACHD OAPB GPSPEIGQIVKIVKGRDRDQFSVIIKRVDDRFVYIADGDKRKVDRAKRKNMNHLKLIDHISPEVRHSFEETGKVTNGKLRFALKKFLEEHADLLKEGE 98 T 0.023 FERM_F2 pdbpssm F Bacteria T 7kl5 2 B B RYR2_HUMAN RYR-2,RYR2,HRYR-2,CARDIAC MUSCLE RYANODINE RECEPTOR,CARDIAC MUSCLE RYANODINE RECEPTOR-CALCIUM RELEASE CHANNEL,TYPE 2 RYANODINE RECEPTOR FALRYNILTLMRMLSLKSLKKQMKKVKKMT 30 T 5.8 SRP_SPB pdbhh F Eukaryota T 7klc 5 E A Q2N0S5_9HIV1 HIV-1 clade A BG505 gp120,HIV-1 clade A BG505 gp120 VWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVGAGNCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQAFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIE 383 T 1.5999999999999998E-49 GP120 unp T Viruses T 7kld 2 C,E,F C,E,Q LYS-LEU-ASN-LEU-GLN-PHE-PCS KLNLQFX 7 T 2.2 Aquarius_N pdbhh F T 7klz 2 C,D C,D GEMI_HUMAN Geminin peptide AEGTVSSSTDALPCI 15 T 23 BCAS2 pdbhh F Eukaryota T 7kme 4 D J SEL2711 XXXXLPX 7 T 240 LRR_12 pdbhh F F 7kmx 1 A,B,C,D,E,F,G a,b,c,d,e,f,g A0A7D7FKF5_9CAUD Minor capsid protein MAFNNAVLQEVSDLPAGEVIKASPHNVSAFEVFQNGLIEGRFVKFDAGSIDILDASATPTIAGIAKRKVTGEIGPGVYSTSGIEIDQVAEVINFGFATVTVQDAAAPSKYDPVYAINLDSAEAGKATENSGATGALAVADCVFWEQKAANVWLVRMNKFL 160 T 13 DUF2292 pdbhh T Viruses T 7knf 2 C,D C,D DTY-ASP-TYR-PRO-GLY-ASP-HIS-CYS-TYR-LEU-TYR-GLY-THR XXDYPGDHCYLYGTX 15 T 7.9 Gln_deamidase_2 pdbhh F T 7kng 2 C,D C,D DTY-ASP-TYR-PRO-GLY-ASP-PHE-CYS-TYR-LEU-TYR-GLY-THR-CYS XXDYPGDFCYLYGTCX 16 T 0.3 DUF5714 pdbhh F T 7kpk 2 B B PDX1_HUMAN Pdx1 peptide LSASPQPSSVAPRRPQEPR 19 T 25 Pim pdbhh F Eukaryota T 7kpo 1 A A Q9KUA1_VIBCH Response regulator GSSKQDLMRAVLVEAMTSALNYWERVSGQSKFTFAEQSGLWRVYLDRSTLQTRTLDKYLRIETLPKTPRWRTVLNSLDYILEHCKEAGPERTHIEMQRDKLQKLLTSE 108 T 0.079 DUF3024 pdb F Bacteria T 7kpq 1 A A FAD-dependent monooxygenase CtdE MTKTPEAPVPRTMEKDHTQQINVIIVGLGIAGLTAAIECHRKGHKVIAFEKTPKMMHIGDIFSIGPNAESVIRQWKDGAISRALNEARCAIDEIKVFDETGKLQNVNTMEGYREGEGYVINRAEAVDIFFEYAQSLGIDIRFNSNVTEYWETPHNAGIIVDGLKIEADCVIATDGIHSKARNAICGAVVQPKKTGSAIYRSGYAMEELRGHSGAVWLTEGKEDVDQLYHFIGKDITVLVGTGRRGKDVYWGCMHKSLHDVSESWIQVSDVRRAIELISDWNVRDRLEPIMACTPQGKCFDHLVMTMDQLPSWVSPKHRMIVLGDAAHPFLPNTGQGANQAIEDGATVAICLELAGKNQVTKGVQVAERLRYQRVAKIQELGHRMLKTLQNADWDGEKDEDAPTMITRPAWIYSHDCQQYAYNEFQTVAQLVSERRDFHHHHHH 443 T 3.6E-12 FAD_binding_3 pdbpercent F T 7kpr 1 A,B A,B PPM1H_HUMAN Protein phosphatase 1H GSHMSDLPLRFPYGRPEFLGLSQDEVECSADHIARPILILKETRRLPWATGYAEVINAGKSTHNEDQASCEVLTVKKKAGAVTSTPNRNSSKRRSSLPNGEGLQLKENSESEGVSCHYWSLFDGHAGSGAAVVASRLLQHHITEQLQDIVDILKNSAVLPPTCLGEEPENTPANSRTLTRAASLRGGVGAPGSPSTPPTRFFTEKKIPHECLVIGALESAFKEMDLQIERERSSYNISGGCTALIVICLLGKLYVANAGASRAIIIRNGEIIPMSSEFTPETERQRLQYLAFMQPHLLGNEFTHLEFPRRVQRKELGKKMLYRDFNMTGWAYKTIEDEDLKFPLIYGEGKKARVMATIGVTRGLGDHDLKVHDSNIYIKPFLSSAPEVRIYDLSKYDHGSDDVLILATDGLWDVLSNEEVAEAITQFLPNCDPDDPHRYTLAAQDLVMRARGVLKDRGWRISNDRLGSGDDISVYVIPLIHGNKLS 486 T 8.5E-20 PP2C pdbpercent F Eukaryota T 7kpt 1 A A FAD-dependent monooxygenase CtdE MTKTPEAPVPRTMEKDHTQQINVIIVGLGIAGLTAAIECHRKGHKVIAFEKTPKMMHIGDIFSIGPNAESVIRQWKDGAISRALNEARCAIDEIKVFDETGKLQNVNTMEGYREGEGYVINRAEAVDIFFEYAQSLGIDIRFNSNVTEYWETPHNAGIIVDGLKIEADCVIATDGIHSKARNAICGAVVQPKKTGSAIYRSGYAMEELRGHSGAVWLTEGKEDVDQLYHFIGKDITVLVGTGRRGKDVYWGCMHKSLHDVSESWIQVSDVRRAIELISDWNVRDRLEPIMACTPQGKCFDHLVMTMDQLPSWVSPKHRMIVLGDAAHPFLPNTGQGANQAIEDGATVAICLELAGKNQVTKGVQVAERLRYQRVAKIQELGHRMLKTLQNADWDGEKDEDAPTMITRPAWIYSHDCQQYAYNEFQTVAQLVSERRDFHHHHHH 443 T 3.6E-12 FAD_binding_3 pdbpercent F T 7kpu 2 B,D E,B bisubstrate analogue (CMC-ACE-SER-GLY-ARG-GLY-LYS) SGRGK 5 T 22 Rad17 pdbhh F F 7kq0 2 B,D,F B,D,F CDN1A_HUMAN LYS-ARG-ARG-GLN-THR-SER-MET-THR-ASP-TYR-TYR-HIS-SER-LYS-ARG KRRQTSMTDYYHSKR 15 T 1.7 CDC27 pdbhh F Eukaryota T 7kq1 2 B,D,F B,D,F CDN1A_HUMAN LYS-ARG-ARG-GLN-THR-SER-MET-THR-ASP-PHE-TYR-HIS-SER-LYS-ARG KRRQTSMTDFYHSKR 15 T 0.37 CDC27 pdbhh F Eukaryota T 7kqk 3 C,F C,P TAU_HUMAN pTau peptide KKVAVVRTPP 10 T 2 Sulfotransfer_2 pdbhh F Eukaryota T 7kqr 1 A,B A,B Heme-dependent L-tyrosine hydroxylase GHMNTGTGTVLTELPDHGRWDFGDFPYGLEPLTLPEPGSLEAADSGSVPAEFTLTCRHIAAIAAGGGPAERVQPADSSDRLYWFRWITGHQVTFILWQLLSRELARLPEEGPERDAALKAMTRYVRGYCAMLLYTGSMPRTVYGDVIRPSMFLQHPGFSGTWAPDHKPVQALFRGKKLPCVRDSADLAQAVHVYQVIHAGIAARMVPSGRSLLQEASVPSGVQHPDVLGVVYDNYFLTLRSRPSSRDVVAQLLRRLTAIALDVKDNALYPDGREAGSELPEELTRPEVTGHERDFLAILSEVAEEATGSPALASDR 316 T 3.1 Hs1pro-1_C pdbhh F T 7kqs 1 A,B A,B Heme-dependent L-tyrosine hydroxylase GHMNTGTGTVLTELPDHGRWDFGDFPYGLEPLTLPEPGSLEAADSGSVPAEFTLTCRHIAAIAAGGGPAERVQPADSSDRLYWFRWITGHQVTFILWQLLSRELARLPEEGPERDAALKAMTRYVRGYCAMLLYTGSMPRTVYGDVIRPSMFLQHPGFSGTWAPDHKPVQALFRGKKLPCVRDSADLAQAVHVYQVIHAGIAARMVPSGRSLLQEASVPSGVQHPDVLGVVYDNYFLTLRSRPSSRDVVAQLLRRLTAIALDVKDNALYPDGREAGSELPEELTRPEVTGHERDFLAILSEVAEEATGSPALASDR 316 T 3.1 Hs1pro-1_C pdbhh F T 7kqt 1 A,B A,B Heme-dependent L-tyrosine hydroxylase GHMNTGTGTVLTELPDHGRWDFGDFPYGLEPLTLPEPGSLEAADSGSVPAEFTLTCRHIAAIAAGGGPAERVQPADSSDRLYWFRWITGHQVTFILWQLLSRELARLPEEGPERDAALKAMTRYVRGYCAMLLYTGSMPRTVYGDVIRPSMFLQHPGFSGTWAPDHKPVQALFRGKKLPCVRDSADLAQAVHVYQVIHAGIAARMVPSGRSLLQEASVPSGVQHPDVLGVVYDNYFLTLRSRPSSRDVVAQLLRRLTAIALDVKDNALYPDGREAGSELPEELTRPEVTGHERDFLAILSEVAEEATGSPALASDR 316 T 3.1 Hs1pro-1_C pdbhh F T 7kqu 1 A,B A,B Heme-dependent L-tyrosine hydroxylase GHMNTGTGTVLTELPDHGRWDFGDFPYGLEPLTLPEPGSLEAADSGSVPAEFTLTCRHIAAIAAGGGPAERVQPADSSDRLYWFRWITGHQVTFILWQLLSRELARLPEEGPERDAALKAMTRYVRGYCAMLLYTGSMPRTVYGDVIRPSMFLQHPGFSGTWAPDHKPVQALFRGKKLPCVRDSADLAQAVHVYQVIHAGIAARMVPSGRSLLQEASVPSGVQHPDVLGVVYDNYFLTLRSRPSSRDVVAQLLRRLTAIALDVKDNALYPDGREAGSELPEELTRPEVTGHERDFLAILSEVAEEATGSPALASDR 316 T 3.1 Hs1pro-1_C pdbhh F T 7kra 8 H H EMC10_YEAST Endoplasmic reticulum membrane protein complex subunit 10 MLVRLLRVILLASMVFCADILQLSYSDDAKDAIPLGTFEIDSTSDGNVTVTTVNIQDVEVSGEYCLNAQIEGKLDMPCFSYMKLRTPLKYDLIVDVDEDNEVKQVSLSYDETNDAITATVRYPEAGPTAPVTKLKKKTKTYADKKASKNKDGSTAQFEEDEEVKEVSWFQKNWKMLLLGLLIYNFVAGSAKKQQQGGAGADQKTE 205 T 0.039 PFU pdb F Eukaryota T 7kra 11 K,L M,N Unassigned helix XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 7krz 2 G G Endogenous co-purified substrate XXXXXXXXXXXX 12 F F F 7ksm 2 G G Unidentified endogenous substrate XXXXXXXXXXXX 12 F F F 7ksq 18 R O A0A2K1JDE1_PHYPA PsaO NRDWLRRDLSVIGFGLIGWLAPSSLPVINGNSLTGLFLGSIGPELAHFPTGPALTSPFWLWMVTWHVGLFIVLTFGQIGFKGRQDGYW 88 T 0.1 Plasmid_RAQPRD unppercent F Eukaryota T 7ktr 3 C C SP20H_HUMAN P38-INTERACTING PROTEIN,P38IP, SUPT20H MQQALELALDRAEYVIESARQRPPKRKYLSSGRKSVFQKLYDLYIEECEKEPEVKKLRRNVNLLEKLVMQETLSCLVVNLYPGNEGYSLMLRGKNGSDSETIRLPYEEGELLEYLDAEELPPILVDLLEKSQVNIFHCGCVIAEIRDYRQSSNMKSPGYQSRHILLRPTMQTLICDVHSITSDNHKWTQEDKLLLESQLILATAEPLCLDPSIAVTCTANRLLYNKQKMNTRPMKRCFKRYSRSSLNRQQDLSHCPPPPQLRLLDFLQKRKERKAGQHYDLKISKAGNCVDMWKRSPCNLAIPSEVDVEKYAKVEKSIKSDDSQPTVWPAHDVKDDYVFECEAGTQYQKTKLTILQSLGDPLYYGKIQPCKADEESDSQMSPSHSSTDDHSNWFIIGSKTDAERVVNQYQELVQNEAKCPVKMSHSSSGSASLSQVSPGKETDQTETVSVQSSVLGKGVKHRPPPIKLPSSSGNSSSGNYFTPQQTSSFLKSPTPPPSSKPSSIPRKSSVDLNQVSMLSPAALSPASSSQRSGTPKPSTPTPTPSSTPHPPDAQSSTPSTPSATPTPQDSGFTPQPTLLTQFAQQQRSLSQAMPVTTIPLSTMVTSITPGTTATQVMANSAGLNFINVVGSVCGAQALMSGSNPMLGCNTGAITPAGINLSGLLPSGGLLPNALPSAMQAASQAGVPFGLKNTSSLRPLNLLQLPGGSLIFNTLQQQQQQLSQFTPQQPQQPTTCSPQQPGEQGSEQGSTSQEQALSAQQAAVINLTGVGSFMQSQAAAVAILAASNGYGSSSSTNSSATSSSAYRQPVKK 811 T 6.5E-20 Spt20 unp F Eukaryota T 7ktr 10 J J TADA1_HUMAN SPT3-ASSOCIATED FACTOR 42,STAF42,TRANSCRIPTIONAL ADAPTER 1-LIKE PROTEIN MATFVSELEAAKKNLSEALGDNVKQYWANLKLWFKQKISKEEFDLEAHRLLTQDNVHSHNDFLLAILTRCQILVSTPDGAGSLPWPGGSAAKPGKPKGKKKLSSVRQKFDHRFQPQNPLSGAQQFVAKDPQDDDDLKLCSHTMMLPTRGQLEGRMIVTAYEHGLDNVTEEAVSAVVYAVENHLKDILTSVVSRRKAYRLRDGHFKYAFGSNVTPQPYLKNSVVAYNNLIESPPAFTAPCAGQNPASHPPPDDAEQQAALLLACSGDTLPASLPPVNMYDLFEALQVHREVIPTHTVYALNIERIITKLWHPNHEELQQDKVHRQRLAAKEGLLLC 335 T 4.4E-17 SAGA-Tad1 pdbpercent F Eukaryota T 7kts 3 C C SP20H_HUMAN P38-INTERACTING PROTEIN,P38IP MQQALELALDRAEYVIESARQRPPKRKYLSSGRKSVFQKLYDLYIEECEKEPEVKKLRRNVNLLEKLVMQETLSCLVVNLYPGNEGYSLMLRGKNGSDSETIRLPYEEGELLEYLDAEELPPILVDLLEKSQVNIFHCGCVIAEIRDYRQSSNMKSPGYQSRHILLRPTMQTLICDVHSITSDNHKWTQEDKLLLESQLILATAEPLCLDPSIAVTCTANRLLYNKQKMNTRPMKRCFKRYSRSSLNRQQDLSHCPPPPQLRLLDFLQKRKERKAGQHYDLKISKAGNCVDMWKRSPCNLAIPSEVDVEKYAKVEKSIKSDDSQPTVWPAHDVKDDYVFECEAGTQYQKTKLTILQSLGDPLYYGKIQPCKADEESDSQMSPSHSSTDDHSNWFIIGSKTDAERVVNQYQELVQNEAKCPVKMSHSSSGSASLSQVSPGKETDQTETVSVQSSVLGKGVKHRPPPIKLPSSSGNSSSGNYFTPQQTSSFLKSPTPPPSSKPSSIPRKSSVDLNQVSMLSPAALSPASSSQRSGTPKPSTPTPTPSSTPHPPDAQSSTPSTPSATPTPQDSGFTPQPTLLTQFAQQQRSLSQAMPVTTIPLSTMVTSITPGTTATQVMANSAGLNFINVVGSVCGAQALMSGSNPMLGCNTGAITPAGINLSGLLPSGGLLPNALPSAMQAASQAGVPFGLKNTSSLRPLNLLQLPGGSLIFNTLQQQQQQLSQFTPQQPQQPTTCSPQQPGEQGSEQGSTSQEQALSAQQAAVINLTGVGSFMQSQAAAVAILAASNGYGSSSSTNSSATSSSAYRQPVKK 811 T 6.5E-20 Spt20 unp F Eukaryota T 7kts 10 J J TADA1_HUMAN SPT3-ASSOCIATED FACTOR 42,STAF42,TRANSCRIPTIONAL ADAPTER 1-LIKE PROTEIN MATFVSELEAAKKNLSEALGDNVKQYWANLKLWFKQKISKEEFDLEAHRLLTQDNVHSHNDFLLAILTRCQILVSTPDGAGSLPWPGGSAAKPGKPKGKKKLSSVRQKFDHRFQPQNPLSGAQQFVAKDPQDDDDLKLCSHTMMLPTRGQLEGRMIVTAYEHGLDNVTEEAVSAVVYAVENHLKDILTSVVSRRKAYRLRDGHFKYAFGSNVTPQPYLKNSVVAYNNLIESPPAFTAPCAGQNPASHPPPDDAEQQAALLLACSGDTLPASLPPVNMYDLFEALQVHREVIPTHTVYALNIERIITKLWHPNHEELQQDKVHRQRLAAKEGLLLC 335 T 4.4E-17 SAGA-Tad1 pdbpercent F Eukaryota T 7ktt 1 A A VINC_HUMAN VINCULIN, MV MPVFHTRTIESILEPVAQQISHLVIMHEEGEVDGKAIPDLTAPVAAVQAAVSNLVRVGKETVQTTEDQILKRDMPPAFIKVENACTKLVQAAQMLQSDPYSVPARDYLIDGSRGILSGTSDLLLTFDEAEVRKIIRVCKGILEYLTVAEVVETMEDLVTYTKNLGPGMTKMAKMIDERQQELTHQEHRVMLVNSMNTVKELLPVLISAMKIFVTTKNSKNQGIEEALKNRNFTVEKMSAEINEIIRVLQLTSWDEDAWASKDTEAMKRALASIDSKLNQAKGWLRDPSASPGDAGEQAIRQILDEAGKVGELCAGKERREILGTCKMLGQMTDQVADLRARGQGSSPVAMQKAQQVSQGLDVLTAKVENAARKLEAMTNSKQSIAKKIDAAQNWLADPNGGPEGEEQIRGALAEARKIAELCDDPKERDDILRSLGEISALTSKLADLRRQGKGDSPEARALAKQVATALQNLQTKTNRAVANSRPAKAAVHLEGKIEQAQRWIDNPTVDDRGVGQAAIRGLVAEGHRLANVMMGPYRQDLLAKCDRVDQLTAQLADLAARGEGESPQARALASQLQDSLKDLKARMQEAMTQEVSDVFSDTTTPIKLLAVAATAPPDAPNREEVFDERAANFENHSGKLGATAEKAAAVGTANKSTVEGIQASVKTARELTPQVVSAARILLRNPGNQAAYEHFETMKNQWIDNVEKMTGLVDEAIDTKSLLDASEEAIKKDLDKCKVAMANIQPQMLVAGATSIARRANRILLVAKREVENSEDPKFREAVKAASDELSKTISPMVMDAKAVAGNISDPGLQKSFLDSGYRILGAVAKVREAFQPQEPDFPPPPPDLEQLRLTDELAPPKPPLPEGEVPPPRPPPPEEKDEEFPEQKAGEVINQPMMMAARQLHDEARKWSSKPGIPAAEVGIGVVAEADAADAAGFPVPPDMEDDYEPELLLMPSNQPVNQPILAAAQSLHREATKWSSKGNDIIAAAKRMALLMAEMSRLVRGGSGTKRALIQCAKDIAKASDEVTRLAKEVAKQCTDKRIRTNLLQVCERIPTISTQLKILSTVKATMLGRTNISDEESEQATEMLVHNAQNLMQSVKETVREAEAASIKIRTDAGFTLRWVRKTPWYQHHHHHHHH 1142 T 1.9E-200 Vinculin pdb F Eukaryota T 7ktu 1 A A VINC_HUMAN VINCULIN, MV MPVFHTRTIESILEPVAQQISHLVIMHEEGEVDGKAIPDLTAPVAAVQAAVSNLVRVGKETVQTTEDQILKRDMPPAFIKVENACTKLVQAAQMLQSDPYSVPARDYLIDGSRGILSGTSDLLLTFDEAEVRKIIRVCKGILEYLTVAEVVETMEDLVTYTKNLGPGMTKMAKMIDERQQELTHQEHRVMLVNSMNTVKELLPVLISAMKIFVTTKNSKNQGIEEALKNRNFTVEKMSAEINEIIRVLQLTSWDEDAWASKDTEAMKRALASIDSKLNQAKGWLRDPSASPGDAGEQAIRQILDEAGKVGELCAGKERREILGTCKMLGQMTDQVADLRARGQGSSPVAMQKAQQVSQGLDVLTAKVENAARKLEAMTNSKQSIAKKIDAAQNWLADPNGGPEGEEQIRGALAEARKIAELCDDPKERDDILRSLGEISALTSKLADLRRQGKGDSPEARALAKQVATALQNLQTKTNRAVANSRPAKAAVHLEGKIEQAQRWIDNPTVDDRGVGQAAIRGLVAEGHRLANVMMGPYRQDLLAKCDRVDQLTAQLADLAARGEGESPQARALASQLQDSLKDLKARMQEAMTQEVSDVFSDTTTPIKLLAVAATAPPDAPNREEVFDERAANFENHSGKLGATAEKAAAVGTANKSTVEGIQASVKTARELTPQVVSAARILLRNPGNQAAYEHFETMKNQWIDNVEKMTGLVDEAIDTKSLLDASEEAIKKDLDKCKVAMANIQPQMLVAGATSIARRANRILLVAKREVENSEDPKFREAVKAASDELSKTISPMVMDAKAVAGNISDPGLQKSFLDSGYRILGAVAKVREAFQPQEPDFPPPPPDLEQLRLTDELAPPKPPLPEGEVPPPRPPPPEEKDEEFPEQKAGEVINQPMMMAARQLHDEARKWSSKPGIPAAEVGIGVVAEADAADAAGFPVPPDMEDDYEPELLLMPSNQPVNQPILAAAQSLHREATKWSSKGNDIIAAAKRMALLMAEMSRLVRGGSGTKRALIQCAKDIAKASDEVTRLAKEVAKQCTDKRIRTNLLQVCERIPTISTQLKILSTVKATMLGRTNISDEESEQATEMLVHNAQNLMQSVKETVREAEAASIKIRTDAGFTLRWVRKTPWYQHHHHHHHH 1142 T 1.9E-200 Vinculin pdb F Eukaryota T 7ktv 1 A A VINC_HUMAN VINCULIN, MV MPVFHTRTIESILEPVAQQISHLVIMHEEGEVDGKAIPDLTAPVAAVQAAVSNLVRVGKETVQTTEDQILKRDMPPAFIKVENACTKLVQAAQMLQSDPYSVPARDYLIDGSRGILSGTSDLLLTFDEAEVRKIIRVCKGILEYLTVAEVVETMEDLVTYTKNLGPGMTKMAKMIDERQQELTHQEHRVMLVNSMNTVKELLPVLISAMKIFVTTKNSKNQGIEEALKNRNFTVEKMSAEINEIIRVLQLTSWDEDAWASKDTEAMKRALASIDSKLNQAKGWLRDPSASPGDAGEQAIRQILDEAGKVGELCAGKERREILGTCKMLGQMTDQVADLRARGQGSSPVAMQKAQQVSQGLDVLTAKVENAARKLEAMTNSKQSIAKKIDAAQNWLADPNGGPEGEEQIRGALAEARKIAELCDDPKERDDILRSLGEISALTSKLADLRRQGKGDSPEARALAKQVATALQNLQTKTNRAVANSRPAKAAVHLEGKIEQAQRWIDNPTVDDRGVGQAAIRGLVAEGHRLANVMMGPYRQDLLAKCDRVDQLTAQLADLAARGEGESPQARALASQLQDSLKDLKARMQEAMTQEVSDVFSDTTTPIKLLAVAATAPPDAPNREEVFDERAANFENHSGKLGATAEKAAAVGTANKSTVEGIQASVKTARELTPQVVSAARILLRNPGNQAAYEHFETMKNQWIDNVEKMTGLVDEAIDTKSLLDASEEAIKKDLDKCKVAMANIQPQMLVAGATSIARRANRILLVAKREVENSEDPKFREAVKAASDELSKTISPMVMDAKAVAGNISDPGLQKSFLDSGYRILGAVAKVREAFQPQEPDFPPPPPDLEQLRLTDELAPPKPPLPEGEVPPPRPPPPEEKDEEFPEQKAGEVINQPMMMAARQLHDEARKWSSKPGIPAAEVGIGVVAEADAADAAGFPVPPDMEDDYEPELLLMPSNQPVNQPILAAAQSLHREATKWSSKGNDIIAAAKRMALLMAEMSRLVRGGSGTKRALIQCAKDIAKASDEVTRLAKEVAKQCTDKRIRTNLLQVCERIPTISTQLKILSTVKATMLGRTNISDEESEQATEMLVHNAQNLMQSVKETVREAEAASIKIRTDAGFTLRWVRKTPWYQHHHHHHHH 1142 T 1.9E-200 Vinculin pdb F Eukaryota T 7ktw 1 A A VINC_HUMAN VINCULIN, MV MPVFHTRTIESILEPVAQQISHLVIMHEEGEVDGKAIPDLTAPVAAVQAAVSNLVRVGKETVQTTEDQILKRDMPPAFIKVENACTKLVQAAQMLQSDPYSVPARDYLIDGSRGILSGTSDLLLTFDEAEVRKIIRVCKGILEYLTVAEVVETMEDLVTYTKNLGPGMTKMAKMIDERQQELTHQEHRVMLVNSMNTVKELLPVLISAMKIFVTTKNSKNQGIEEALKNRNFTVEKMSAEINEIIRVLQLTSWDEDAWASKDTEAMKRALASIDSKLNQAKGWLRDPSASPGDAGEQAIRQILDEAGKVGELCAGKERREILGTCKMLGQMTDQVADLRARGQGSSPVAMQKAQQVSQGLDVLTAKVENAARKLEAMTNSKQSIAKKIDAAQNWLADPNGGPEGEEQIRGALAEARKIAELCDDPKERDDILRSLGEISALTSKLADLRRQGKGDSPEARALAKQVATALQNLQTKTNRAVANSRPAKAAVHLEGKIEQAQRWIDNPTVDDRGVGQAAIRGLVAEGHRLANVMMGPYRQDLLAKCDRVDQLTAQLADLAARGEGESPQARALASQLQDSLKDLKARMQEAMTQEVSDVFSDTTTPIKLLAVAATAPPDAPNREEVFDERAANFENHSGKLGATAEKAAAVGTANKSTVEGIQASVKTARELTPQVVSAARILLRNPGNQAAYEHFETMKNQWIDNVEKMTGLVDEAIDTKSLLDASEEAIKKDLDKCKVAMANIQPQMLVAGATSIARRANRILLVAKREVENSEDPKFREAVKAASDELSKTISPMVMDAKAVAGNISDPGLQKSFLDSGYRILGAVAKVREAFQPQEPDFPPPPPDLEQLRLTDELAPPKPPLPEGEVPPPRPPPPEEKDEEFPEQKAGEVINQPMMMAARQLHDEARKWSSKPGIPAAEVGIGVVAEADAADAAGFPVPPDMEDDYEPELLLMPSNQPVNQPILAAAQSLHREATKWSSKGNDIIAAAKRMALLMAEMSRLVRGGSGTKRALIQCAKDIAKASDEVTRLAKEVAKQCTDKRIRTNLLQVCERIPTISTQLKILSTVKATMLGRTNISDEESEQATEMLVHNAQNLMQSVKETVREAEAASIKIRTDAGFTLRWVRKTPWYQHHHHHHHH 1142 T 1.9E-200 Vinculin pdb F Eukaryota T 7ktx 8 H H EMC10_YEAST Endoplasmic reticulum membrane protein complex subunit 10 MLVRLLRVILLASMVFCADILQLSYSDDAKDAIPLGTFEIDSTSDGNVTVTTVNIQDVEVSGEYCLNAQIEGKLDMPCFSYMKLRTPLKYDLIVDVDEDNEVKQVSLSYDETNDAITATVRYPEAGPTAPVTKLKKKTKTYADKKASKNKDGSTAQFEEDEEVKEVSWFQKNWKMLLLGLLIYNFVAGSAKKQQQGGAGADQKTE 205 T 0.039 PFU pdb F Eukaryota T 7ktx 11 K,L M,N Unassigned helix XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 7ku5 1 A O A0A2K1JDE1_PHYPA PsaO NRDWLRRDLSVIGFGLIGWLAPSSLPVINGNSLTGLFLGSIGPELAHFPTGPALTSPFWLWMVTWHVGLFIVLTFGQIGFKGRQDGYW 88 T 0.1 Plasmid_RAQPRD unppercent F Eukaryota T 7kuw 1 A A Sequence-Based Designed Protein nmt_0994_guided_02 DEREIARKVASELQKFSEWVKKLKEVIKKASPEQQTKIAQWVAKLAGVRPEDVKKIIKAFND 62 T 0.004 FliG_N pdb F T 7kw6 1 A A Q65JI8_BACLD PROCESSIVE CELLULASE FROM GLYCOSIDE HYDROLASE FAMILY 48 MDNKTRFMQLYEQIKNPNNGYFSPEGIPYHSVETLICEAPDYGHMTTSEAYSYWLWLEAMYGRYTQDWSKLEAAWDNMEKYIIPVNEGDNNEEQPTMNYYNPSSPATYAAEHPYPDLYPSALTGQYPAGNDPLDAELKATYGSNETYLMHWLLDVDNWYGFGNLLNPSHTAVYVNTYQRGEQESVWETVPHPSQDNQTFGKPNEGFMSLFTKENQAPAPQWRYTNATDADARAVQAMFWARQWGYSNTNYLEKAKKMGDFLRYGMYDKYFQEIGSAADGSPSRGAGKNACHYLMAWYTAWGGGLGQYANWAWRIGASHVHQGYQNPVASYALSTAEGGLIPNSSTARSDWEKALKRQLELYTWLLSSEGAVAGGATNSWNGNYSAYPQNVSTFYEMAYTEAPVYHDPPSNNWFGMQVWPLERVAELYYIFAEKGDKSSESFHMAKHVIEKWIAYSLDYVFVGERPVTDEEGYYLNDAGERVLGGQNPQIAVQSDPGEFWIPANLEWSGQPDPWKGFDSFTGNPGLHVTTKNPSQDVGVLGSYIKTLVFFAAGTKAETGGFTALGNKAKNLAKELLDAAWSKNDGIGIAAEEEHEDYIRYFTKEIYFPNGWSGRNGQGNTIPGPNTVPSDPAKGGNGVYISHAELRPKIKNDPMWPYLENKYQTSWNPNTGKWENGLPTFVYHRFWSQVDMATAYAEYDRLIGNA 704 T 7.1E-66 Glyco_hydro_48 pdbpercent F Bacteria T 7kwt 1 A,B A,B Q7Y3F3_9CAUD PlyCB SKINVNVENVSGVQGFLFHTDGKESYGHRAFINGVEIGIKDIETVQGFQQIIPSINISKSDVEAIRKAMKK 71 T 0.087 DUF3201 pdb T Viruses T 7kww 1 A,B A,B Q7Y3F3_9CAUD PlyCB MSKINVNVENVSGVQGFLFHTDGKESYGYRAFINGVEIGIKDIETVQGFQQIIPSINISHSDVEAIRKAMKK 72 T 2.6 DUF3213 pdbhh T Viruses T 7kwy 1 A,B A,B Q7Y3F3_9CAUD PlyCB SKINVNVENVSGVQGFLFHTDGKESYGYRAFINGVEIGIKDIETVQGFQQIIPSINISKSDVEAIKKAMKK 71 T 2.5 DUF3213 pdbhh T Viruses T 7kwz 1 A,B,C,D,E A,B,C,D,E TADBP_HUMAN TDP-43 NRQLERSGRFGGNPGGFGNQGGFGNSRGGGAGLGNNQGSNMGGGMNFGAFSINPAMMAAAQAALQSSWGMMGMLASQQNQSGPSGNNQNQGNMQREPNQAFGSGNNSYSGSNSGAAIGWGSASNAGSGSGFNGGFGSSMDSKSSGWGM 148 T 0.043 Glucosaminidase pdbpssm F Eukaryota T 7kzl 2 B B XDJ-XDD-XDY-XDJ-XDY-XDJ-XDD-XDY-XDD-XDV XXXXXXXXXX 10 F F F 7kzm 16 EA,GA X,X1 A8IPZ5_CHLRE Outer dynein arm-docking complex subunit 1 MAQKSTLKLPRLRTKEELLKTSPELCKLLGEDSDDGRSMSPFTAPPPAGTVKPPSRGLPAVSTKATKGPGMDTPRGLGEEELTEEELLRLELEKIKNERQVLLDSIKLVKAQAGTAGGEAQQNDIKALRRELELKKAKLNELHEDVRRKENVLNKQRDDTTDASRLTPGELSEEQAYIQQLQDEMKQIDEELVEAEAKNRLYYLLGERTRREHLAMDMKVRASQQLKKDSADDLYTLTAHFNEMRAAKEQAERELARMKRMLEETRVDWQKKLRERRREVRELKKRQQKQLERERKMREKQLERERQERELQAKLKMEQDSYEMRVAALAPKVEAMEHSWNRIRTISGADTPEEVLAYWEGLKAKEEQMRSLVSLAEQRESSAKSEIAALLENRSGMYEKGSAAAADVGEGSEERATLITEVERNMEGAKGKFNKLRSVCIGAEQGLRSLQERLMIALEEIHPDQLRASHMKGGHDAKARGKGAASAGARRGSAHAHTPDRNKRGPATGSRSQSPALVPHSPAGDKPSSPLHGTSPEHGHEPIPEGAEELAGEAEMVSPLGADGNTIDDEHFFPELPELLTSVTDRLNRVLVLAAELDAQEPAGAGEDGLPLSGEPGADGAEGAAPASPSRGAPEGLSESERTLVKGMNRRTWTGAPLLETINASPSEAALTLNIKRKKGKKKEQQVQPDLNRILGYTGSDVEEEEPESEEETEEEANKDDGVVDRDYIKLRALKMSQRLANQQRAIKV 749 T 0.00034 CALCOCO1 pdbhh F Eukaryota T 7kzm 17 FA X0 DC1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 162 F F F 7kzm 19 IA Y0 DC2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 168 F F F 7kzn 14 Q X DC1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 121 F F F 7kzn 15 R Y DC2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 168 F F F 7kzo 4 P,R X,X1 A8IPZ5_CHLRE Outer dynein arm-docking complex subunit 1 MAQKSTLKLPRLRTKEELLKTSPELCKLLGEDSDDGRSMSPFTAPPPAGTVKPPSRGLPAVSTKATKGPGMDTPRGLGEEELTEEELLRLELEKIKNERQVLLDSIKLVKAQAGTAGGEAQQNDIKALRRELELKKAKLNELHEDVRRKENVLNKQRDDTTDASRLTPGELSEEQAYIQQLQDEMKQIDEELVEAEAKNRLYYLLGERTRREHLAMDMKVRASQQLKKDSADDLYTLTAHFNEMRAAKEQAERELARMKRMLEETRVDWQKKLRERRREVRELKKRQQKQLERERKMREKQLERERQERELQAKLKMEQDSYEMRVAALAPKVEAMEHSWNRIRTISGADTPEEVLAYWEGLKAKEEQMRSLVSLAEQRESSAKSEIAALLENRSGMYEKGSAAAADVGEGSEERATLITEVERNMEGAKGKFNKLRSVCIGAEQGLRSLQERLMIALEEIHPDQLRASHMKGGHDAKARGKGAASAGARRGSAHAHTPDRNKRGPATGSRSQSPALVPHSPAGDKPSSPLHGTSPEHGHEPIPEGAEELAGEAEMVSPLGADGNTIDDEHFFPELPELLTSVTDRLNRVLVLAAELDAQEPAGAGEDGLPLSGEPGADGAEGAAPASPSRGAPEGLSESERTLVKGMNRRTWTGAPLLETINASPSEAALTLNIKRKKGKKKEQQVQPDLNRILGYTGSDVEEEEPESEEETEEEANKDDGVVDRDYIKLRALKMSQRLANQQRAIKV 749 T 0.00034 CALCOCO1 pdbhh F Eukaryota T 7kzp 2 B,J B,O FANCB_HUMAN PROTEIN FACB,FANCONI ANEMIA-ASSOCIATED POLYPEPTIDE OF 95 KDA,FAAP95 MDYKDDDDKENLYFQGGGRKLGTGSMTSKQAMSSNEQERLLCYNGEVLVFQLSKGNFADKEPTKTPILHVRRMVFDRGTKVFVQKSTGFFTIKEENSHLKIMCCNCVSDFRTGINLPYIVIEKNKKNNVFEYFLLILHSTNKFEMRLSFKLGYEMKDGLRVLNGPLILWRHVKAFFFISSQTGKVVSVSGNFSSIQWAGEIENLGMVLLGLKECCLSEEECTQEPSKSDYAIWNTKFCVYSLESQEVLSDIYIIPPAYSSVVTYVHICATEIIKNQLRISLIALTRKNQLISFQNGTPKNVCQLPFGDPCAVQLMDSGGGNLFFVVSFISNNACAVWKESFQVAAKWEKLSLVLIDDFIGSGTEQVLLLFKDSLNSDCLTSFKITDLGKINYSSEPSDCNEDDLFEDKQENRYLVVPPLETGLKVCFSSFRELRQHLLLKEKIISKSYKALINLVQGKDDNTSSAEEKECLVPLCGEEENSVHILDEKLSDNFQDSEQLVEKIWYRVIDDSLVVGVKTTSSLKLSLNDVTLSLLMDQAHDSRFRLLKCQNRVIKLSTNPFPAPYLMPCEIGLEAKRVTLTPDSKKEESFVCEHPSKKECVQIITAVTSLSPLLTFSKFCCTVLLQIMERESGNCPKDRYVVCGRVFLSLEDLSTGKYLLTFPKKKPIEHMEDLFALLAAFHKSCFQITSPGYALNSMKVWLLEHMKCEIIKEFPEVYFCERPGSFYGTLFTWKQRTPFEGILIIYSRNQTVMFQCLHNLIRILPINCFLKNLKSGSENFLIDNMAFTLEKELVTLSSLSSAIAKHESNFMQRCEVSKGKSSVVAAALSDRRENIHPYRKELQREKKKMLQTNLKVSGALYREITLKVAEVQLKSDFAAQKLSNL 884 T 0.46 DP unp F Eukaryota T 7kzq 2 B,J B,O FANCB_HUMAN PROTEIN FACB,FANCONI ANEMIA-ASSOCIATED POLYPEPTIDE OF 95 KDA,FAAP95 MDYKDDDDKENLYFQGGGRKLGTGSMTSKQAMSSNEQERLLCYNGEVLVFQLSKGNFADKEPTKTPILHVRRMVFDRGTKVFVQKSTGFFTIKEENSHLKIMCCNCVSDFRTGINLPYIVIEKNKKNNVFEYFLLILHSTNKFEMRLSFKLGYEMKDGLRVLNGPLILWRHVKAFFFISSQTGKVVSVSGNFSSIQWAGEIENLGMVLLGLKECCLSEEECTQEPSKSDYAIWNTKFCVYSLESQEVLSDIYIIPPAYSSVVTYVHICATEIIKNQLRISLIALTRKNQLISFQNGTPKNVCQLPFGDPCAVQLMDSGGGNLFFVVSFISNNACAVWKESFQVAAKWEKLSLVLIDDFIGSGTEQVLLLFKDSLNSDCLTSFKITDLGKINYSSEPSDCNEDDLFEDKQENRYLVVPPLETGLKVCFSSFRELRQHLLLKEKIISKSYKALINLVQGKDDNTSSAEEKECLVPLCGEEENSVHILDEKLSDNFQDSEQLVEKIWYRVIDDSLVVGVKTTSSLKLSLNDVTLSLLMDQAHDSRFRLLKCQNRVIKLSTNPFPAPYLMPCEIGLEAKRVTLTPDSKKEESFVCEHPSKKECVQIITAVTSLSPLLTFSKFCCTVLLQIMERESGNCPKDRYVVCGRVFLSLEDLSTGKYLLTFPKKKPIEHMEDLFALLAAFHKSCFQITSPGYALNSMKVWLLEHMKCEIIKEFPEVYFCERPGSFYGTLFTWKQRTPFEGILIIYSRNQTVMFQCLHNLIRILPINCFLKNLKSGSENFLIDNMAFTLEKELVTLSSLSSAIAKHESNFMQRCEVSKGKSSVVAAALSDRRENIHPYRKELQREKKKMLQTNLKVSGALYREITLKVAEVQLKSDFAAQKLSNL 884 T 0.46 DP unp F Eukaryota T 7kzr 2 B,J B,O FANCB_HUMAN PROTEIN FACB,FANCONI ANEMIA-ASSOCIATED POLYPEPTIDE OF 95 KDA,FAAP95 MDYKDDDDKENLYFQGGGRKLGTGSMTSKQAMSSNEQERLLCYNGEVLVFQLSKGNFADKEPTKTPILHVRRMVFDRGTKVFVQKSTGFFTIKEENSHLKIMCCNCVSDFRTGINLPYIVIEKNKKNNVFEYFLLILHSTNKFEMRLSFKLGYEMKDGLRVLNGPLILWRHVKAFFFISSQTGKVVSVSGNFSSIQWAGEIENLGMVLLGLKECCLSEEECTQEPSKSDYAIWNTKFCVYSLESQEVLSDIYIIPPAYSSVVTYVHICATEIIKNQLRISLIALTRKNQLISFQNGTPKNVCQLPFGDPCAVQLMDSGGGNLFFVVSFISNNACAVWKESFQVAAKWEKLSLVLIDDFIGSGTEQVLLLFKDSLNSDCLTSFKITDLGKINYSSEPSDCNEDDLFEDKQENRYLVVPPLETGLKVCFSSFRELRQHLLLKEKIISKSYKALINLVQGKDDNTSSAEEKECLVPLCGEEENSVHILDEKLSDNFQDSEQLVEKIWYRVIDDSLVVGVKTTSSLKLSLNDVTLSLLMDQAHDSRFRLLKCQNRVIKLSTNPFPAPYLMPCEIGLEAKRVTLTPDSKKEESFVCEHPSKKECVQIITAVTSLSPLLTFSKFCCTVLLQIMERESGNCPKDRYVVCGRVFLSLEDLSTGKYLLTFPKKKPIEHMEDLFALLAAFHKSCFQITSPGYALNSMKVWLLEHMKCEIIKEFPEVYFCERPGSFYGTLFTWKQRTPFEGILIIYSRNQTVMFQCLHNLIRILPINCFLKNLKSGSENFLIDNMAFTLEKELVTLSSLSSAIAKHESNFMQRCEVSKGKSSVVAAALSDRRENIHPYRKELQREKKKMLQTNLKVSGALYREITLKVAEVQLKSDFAAQKLSNL 884 T 0.46 DP unp F Eukaryota T 7kzs 2 B,I B,O FANCB_HUMAN PROTEIN FACB,FANCONI ANEMIA-ASSOCIATED POLYPEPTIDE OF 95 KDA,FAAP95 MDYKDDDDKENLYFQGGGRKLGTGSMTSKQAMSSNEQERLLCYNGEVLVFQLSKGNFADKEPTKTPILHVRRMVFDRGTKVFVQKSTGFFTIKEENSHLKIMCCNCVSDFRTGINLPYIVIEKNKKNNVFEYFLLILHSTNKFEMRLSFKLGYEMKDGLRVLNGPLILWRHVKAFFFISSQTGKVVSVSGNFSSIQWAGEIENLGMVLLGLKECCLSEEECTQEPSKSDYAIWNTKFCVYSLESQEVLSDIYIIPPAYSSVVTYVHICATEIIKNQLRISLIALTRKNQLISFQNGTPKNVCQLPFGDPCAVQLMDSGGGNLFFVVSFISNNACAVWKESFQVAAKWEKLSLVLIDDFIGSGTEQVLLLFKDSLNSDCLTSFKITDLGKINYSSEPSDCNEDDLFEDKQENRYLVVPPLETGLKVCFSSFRELRQHLLLKEKIISKSYKALINLVQGKDDNTSSAEEKECLVPLCGEEENSVHILDEKLSDNFQDSEQLVEKIWYRVIDDSLVVGVKTTSSLKLSLNDVTLSLLMDQAHDSRFRLLKCQNRVIKLSTNPFPAPYLMPCEIGLEAKRVTLTPDSKKEESFVCEHPSKKECVQIITAVTSLSPLLTFSKFCCTVLLQIMERESGNCPKDRYVVCGRVFLSLEDLSTGKYLLTFPKKKPIEHMEDLFALLAAFHKSCFQITSPGYALNSMKVWLLEHMKCEIIKEFPEVYFCERPGSFYGTLFTWKQRTPFEGILIIYSRNQTVMFQCLHNLIRILPINCFLKNLKSGSENFLIDNMAFTLEKELVTLSSLSSAIAKHESNFMQRCEVSKGKSSVVAAALSDRRENIHPYRKELQREKKKMLQTNLKVSGALYREITLKVAEVQLKSDFAAQKLSNL 884 T 0.46 DP unp F Eukaryota T 7kzt 2 B,I B,O FANCB_HUMAN PROTEIN FACB,FANCONI ANEMIA-ASSOCIATED POLYPEPTIDE OF 95 KDA,FAAP95 MDYKDDDDKENLYFQGGGRKLGTGSMTSKQAMSSNEQERLLCYNGEVLVFQLSKGNFADKEPTKTPILHVRRMVFDRGTKVFVQKSTGFFTIKEENSHLKIMCCNCVSDFRTGINLPYIVIEKNKKNNVFEYFLLILHSTNKFEMRLSFKLGYEMKDGLRVLNGPLILWRHVKAFFFISSQTGKVVSVSGNFSSIQWAGEIENLGMVLLGLKECCLSEEECTQEPSKSDYAIWNTKFCVYSLESQEVLSDIYIIPPAYSSVVTYVHICATEIIKNQLRISLIALTRKNQLISFQNGTPKNVCQLPFGDPCAVQLMDSGGGNLFFVVSFISNNACAVWKESFQVAAKWEKLSLVLIDDFIGSGTEQVLLLFKDSLNSDCLTSFKITDLGKINYSSEPSDCNEDDLFEDKQENRYLVVPPLETGLKVCFSSFRELRQHLLLKEKIISKSYKALINLVQGKDDNTSSAEEKECLVPLCGEEENSVHILDEKLSDNFQDSEQLVEKIWYRVIDDSLVVGVKTTSSLKLSLNDVTLSLLMDQAHDSRFRLLKCQNRVIKLSTNPFPAPYLMPCEIGLEAKRVTLTPDSKKEESFVCEHPSKKECVQIITAVTSLSPLLTFSKFCCTVLLQIMERESGNCPKDRYVVCGRVFLSLEDLSTGKYLLTFPKKKPIEHMEDLFALLAAFHKSCFQITSPGYALNSMKVWLLEHMKCEIIKEFPEVYFCERPGSFYGTLFTWKQRTPFEGILIIYSRNQTVMFQCLHNLIRILPINCFLKNLKSGSENFLIDNMAFTLEKELVTLSSLSSAIAKHESNFMQRCEVSKGKSSVVAAALSDRRENIHPYRKELQREKKKMLQTNLKVSGALYREITLKVAEVQLKSDFAAQKLSNL 884 T 0.46 DP unp F Eukaryota T 7kzv 2 B,I B,O FANCB_HUMAN PROTEIN FACB,FANCONI ANEMIA-ASSOCIATED POLYPEPTIDE OF 95 KDA,FAAP95 MDYKDDDDKENLYFQGGGRKLGTGSMTSKQAMSSNEQERLLCYNGEVLVFQLSKGNFADKEPTKTPILHVRRMVFDRGTKVFVQKSTGFFTIKEENSHLKIMCCNCVSDFRTGINLPYIVIEKNKKNNVFEYFLLILHSTNKFEMRLSFKLGYEMKDGLRVLNGPLILWRHVKAFFFISSQTGKVVSVSGNFSSIQWAGEIENLGMVLLGLKECCLSEEECTQEPSKSDYAIWNTKFCVYSLESQEVLSDIYIIPPAYSSVVTYVHICATEIIKNQLRISLIALTRKNQLISFQNGTPKNVCQLPFGDPCAVQLMDSGGGNLFFVVSFISNNACAVWKESFQVAAKWEKLSLVLIDDFIGSGTEQVLLLFKDSLNSDCLTSFKITDLGKINYSSEPSDCNEDDLFEDKQENRYLVVPPLETGLKVCFSSFRELRQHLLLKEKIISKSYKALINLVQGKDDNTSSAEEKECLVPLCGEEENSVHILDEKLSDNFQDSEQLVEKIWYRVIDDSLVVGVKTTSSLKLSLNDVTLSLLMDQAHDSRFRLLKCQNRVIKLSTNPFPAPYLMPCEIGLEAKRVTLTPDSKKEESFVCEHPSKKECVQIITAVTSLSPLLTFSKFCCTVLLQIMERESGNCPKDRYVVCGRVFLSLEDLSTGKYLLTFPKKKPIEHMEDLFALLAAFHKSCFQITSPGYALNSMKVWLLEHMKCEIIKEFPEVYFCERPGSFYGTLFTWKQRTPFEGILIIYSRNQTVMFQCLHNLIRILPINCFLKNLKSGSENFLIDNMAFTLEKELVTLSSLSSAIAKHESNFMQRCEVSKGKSSVVAAALSDRRENIHPYRKELQREKKKMLQTNLKVSGALYREITLKVAEVQLKSDFAAQKLSNL 884 T 0.46 DP unp F Eukaryota T 7kzw 1 A A Q5NEJ0_FRATT FTT_1639c DKYQARELPLLKHGYSKKNMTAYNMFGFCCDNTPSGIFNIMDKKPTEFLVNIYVGDNQGCKFIYAADTKGKQGEITQTGSFTAYLSGRNELLKLECKGKDSNIDYKVIAYANAIEYDRVGNLSYLVESGGL 131 T 0.11 DUF4972 unphh F Bacteria T 7l04 1 A C C0LA97_9VIRU VP1 YPKKKKARIE 10 T 27 MARCKS pdbhh T Viruses T 7l08 84 FC u P-site finger XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 65 F F F 7l0u 1 A,AA,AB,B,BA,BB,C,CA,CB,D,DA,DB,E,EA,EB,F,FA,FB,G,GA,GB,H,HA,HB,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,PA,Q,QA,R,RA,S,SA,T,TA,U,UA,V,VA,W,WA,X,XA,Y,YA,Z,ZA A,a,1,B,b,2,C,c,3,D,d,4,E,e,5,F,f,6,G,g,7,H,h,8,I,i,J,j,K,k,L,l,M,m,N,n,O,o,P,p,Q,q,R,r,S,s,T,t,U,u,V,v,W,w,X,x,Y,y,Z,z B9UYL6_HBOC2 VP2 PROTEIN GSGVGISTGGWVGGSYFTDSYVITKNTRQFLVKIQNDHKYRTENIIPSNAGGKSQRCVSTPWSYFNFNQYSSHFSPQDWQRLTNEYKRFKPRKMHVKIYNLQIKQILSNGADTTYNNDLTAGVHIFCDGEHAYPNATHPWDEDVMPELPYETWYLFQYGYIPVIHELAEMEDANAVEKAIALQIPFFMLENSDHEVLRTGESTEFTFDFDCEWINNERAYIPPGLMFNPKVPTRRAQYIRQHGNTASSNTRIQPYAKPTSWMTGPGLLSAQRVGPAGSDTASWMVVVNPDGTAVNSGMAGVGSGFDPPSGSLRPTDLEYKIQWYQTPEGTNSDGNIISNPPLSMLRDQALYRGNQTTYNLCSDVWMFPNQIWDRYPITRENPIWCKKPRSDKNTIIDPFDGTLAMDHPPGTIFIKMAKIPVPSNNNADSYLNIYCTGQVSCEIVWEVERYATKNWRPERRHTALGLGIGGEENINPTYHVDKNGKYIQPTTWDMCYPIKTNINKVL 506 T 4.2E-12 Parvo_coat pdbpssm T Viruses T 7l0v 1 A,AA,AB,B,BA,BB,C,CA,CB,D,DA,DB,E,EA,EB,F,FA,FB,G,GA,GB,H,HA,HB,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,PA,Q,QA,R,RA,S,SA,T,TA,U,UA,V,VA,W,WA,X,XA,Y,YA,Z,ZA A,a,1,B,b,2,C,c,3,D,d,4,E,e,5,F,f,6,G,g,7,H,h,8,I,i,J,j,K,k,L,l,M,m,N,n,O,o,P,p,Q,q,R,r,S,s,T,t,U,u,V,v,W,w,X,x,Y,y,Z,z B9UYL6_HBOC2 VP2 PROTEIN GSGVGISTGGWVGGSYFTDSYVITKNTRQFLVKIQNDHKYRTENIIPSNAGGKSQRCVSTPWSYFNFNQYSSHFSPQDWQRLTNEYKRFKPRKMHVKIYNLQIKQILSNGADTTYNNDLTAGVHIFCDGEHAYPNATHPWDEDVMPELPYETWYLFQYGYIPVIHELAEMEDANAVEKAIALQIPFFMLENSDHEVLRTGESTEFTFDFDCEWINNERAYIPPGLMFNPKVPTRRAQYIRQHGNTASSNTRIQPYAKPTSWMTGPGLLSAQRVGPAGSDTASWMVVVNPDGTAVNSGMAGVGSGFDPPSGSLRPTDLEYKIQWYQTPEGTNSDGNIISNPPLSMLRDQALYRGNQTTYNLCSDVWMFPNQIWDRYPITRENPIWCKKPRSDKNTIIDPFDGTLAMDHPPGTIFIKMAKIPVPSNNNADSYLNIYCTGQVSCEIVWEVERYATKNWRPERRHTALGLGIGGEENINPTYHVDKNGKYIQPTTWDMCYPIKTNINKVL 506 T 4.2E-12 Parvo_coat pdbpssm T Viruses T 7l0w 1 A,AA,AB,B,BA,BB,C,CA,CB,D,DA,DB,E,EA,EB,F,FA,FB,G,GA,GB,H,HA,HB,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,PA,Q,QA,R,RA,S,SA,T,TA,U,UA,V,VA,W,WA,X,XA,Y,YA,Z,ZA A,a,1,B,b,2,C,c,3,D,d,4,E,e,5,F,f,6,G,g,7,H,h,8,I,i,J,j,K,k,L,l,M,m,N,n,O,o,P,p,Q,q,R,r,S,s,T,t,U,u,V,v,W,w,X,x,Y,y,Z,z H9C5X6_HBOC1 VP2 GSGVGISTGGWVGGSHFSDKYVVTKNTRQFITTIQNGHLYKTEAIETTNQSGKSQRCVTTPWTYFNFNQYSCHFSPQDWQRLTNEYKRFRPKAMQVKIYNLQIKQILSNGADTTYNNDLTAGVHIFCDGEHAYPNASHPWDEDVMPDLPYKTWKLFQYGYIPIENELADLDGNAAGGNATEKALLYQMPFFLLENSDHQVLRTGESTEFTFNFDCEWVNNERAYIPPGLMFNPKVPTRRVQYIRQNGSTAASTGRIQPYSKPTSWMTGPGLLSAQRVGPQSSDTAPFMVCTNPEGTHINTGAAGFGSGFDPPSGCLAPTNLEYKLQWYQTPEGTGNNGNIIANPSLSMLRDQLLYKGNQTTYNLVGDIWMFPNQVWDRFPITRENPIWCKKPRADKHTIMDPFDGSIAMDHPPGTIFIKMAKIPVPTASNADSYLNIYCTGQVSCEIVWEVERYATKNWRPERRHTALGMSLGGESNYTPTYHVDPTGAYIQPTSYDQCMPVKTNINKVL 510 T 5.8E-14 Parvo_coat unppercent T Viruses T 7l0x 1 A,AA,AB,B,BA,BB,C,CA,CB,D,DA,DB,E,EA,EB,F,FA,FB,G,GA,GB,H,HA,HB,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,PA,Q,QA,R,RA,S,SA,T,TA,U,UA,V,VA,W,WA,X,XA,Y,YA,Z,ZA A,a,1,B,b,2,C,c,3,D,d,4,E,e,5,F,f,6,G,g,7,H,h,8,I,i,J,j,K,k,L,l,M,m,N,n,O,o,P,p,Q,q,R,r,S,s,T,t,U,u,V,v,W,w,X,x,Y,y,Z,z B9UYL6_HBOC2 VP2 PROTEIN GATGSVGGGKGSGVGISTGGWVGGSYFTDSYVITKNTRQFLVKIQNDHKYRTENIIPSNAGGKSQRCVSTPWSYFNFNQYSSHFSPQDWQRLTNEYKRFKPRKMHVKIYNLQIKQILSNGADTTYNNDLTAGVHIFCDGEHAYPNATHPWDEDVMPELPYETWYLFQYGYIPVIHELAEMEDANAVEKAIALQIPFFMLENSDHEVLRTGESTEFTFDFDCEWINNERAYIPPGLMFNPKVPTRRAQYIRQHGNTASSNTRIQPYAKPTSWMTGPGLLSAQRVGPAGSDTASWMVVVNPDGTAVNSGMAGVGSGFDPPSGSLRPTDLEYKIQWYQTPEGTNSDGNIISNPPLSMLRDQALYRGNQTTYNLCSDVWMFPNQIWDRYPITRENPIWCKKPRSDKNTIIDPFDGTLAMDHPPGTIFIKMAKIPVPSNNNADSYLNIYCTGQVSCEIVWEVERYATKNWRPERRHTALGLGIGGEENINPTYHVDKNGKYIQPTTWDMCYPIKTNINKVL 516 T 8.4E-11 Parvo_coat unppercent T Viruses T 7l0y 1 A,AA,AB,B,BA,BB,C,CA,CB,D,DA,DB,E,EA,EB,F,FA,FB,G,GA,GB,H,HA,HB,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,PA,Q,QA,R,RA,S,SA,T,TA,U,UA,V,VA,W,WA,X,XA,Y,YA,Z,ZA A,a,1,B,b,2,C,c,3,D,d,4,E,e,5,F,f,6,G,g,7,H,h,8,I,i,J,j,K,k,L,l,M,m,N,n,O,o,P,p,Q,q,R,r,S,s,T,t,U,u,V,v,W,w,X,x,Y,y,Z,z H9C5X6_HBOC1 VP2 GTGSIGGGKGSGVGISTGGWVGGSHFSDKYVVTKNTRQFITTIQNGHLYKTEAIETTNQSGKSQRCVTTPWTYFNFNQYSCHFSPQDWQRLTNEYKRFRPKAMQVKIYNLQIKQILSNGADTTYNNDLTAGVHIFCDGEHAYPNASHPWDEDVMPDLPYKTWKLFQYGYIPIENELADLDGNAAGGNATEKALLYQMPFFLLENSDHQVLRTGESTEFTFNFDCEWVNNERAYIPPGLMFNPKVPTRRVQYIRQNGSTAASTGRIQPYSKPTSWMTGPGLLSAQRVGPQSSDTAPFMVCTNPEGTHINTGAAGFGSGFDPPSGCLAPTNLEYKLQWYQTPEGTGNNGNIIANPSLSMLRDQLLYKGNQTTYNLVGDIWMFPNQVWDRFPITRENPIWCKKPRADKHTIMDPFDGSIAMDHPPGTIFIKMAKIPVPTASNADSYLNIYCTGQVSCEIVWEVERYATKNWRPERRHTALGMSLGGESNYTPTYHVDPTGAYIQPTSYDQCMPVKTNINKVL 519 T 8.2E-15 Parvo_coat pdbpercent T Viruses T 7l1b 3 C C PK3CA_HUMAN PHOSPHATIDYLINOSITOL 4,5-BISPHOSPHATE 3-KINASE CATALYTIC SUBUNIT ALPHA ISOFORM,PTDINS-3-KINASE SUBUNIT ALPHA,PHOSPHATIDYLINOSITOL 4,5-BISPHOSPHATE 3-KINASE 110 KDA CATALYTIC SUBUNIT ALPHA,P110ALPHA,PHOSPHOINOSITIDE-3-KINASE CATALYTIC ALPHA POLYPEPTIDE,SERINE/THREONINE PROTEIN KINASE PIK3CA AHHGGWTTK 9 T 0.029 Prion_octapep pdbhh F Eukaryota T 7l1c 3 C C PK3CA_HUMAN PHOSPHATIDYLINOSITOL 4,5-BISPHOSPHATE 3-KINASE CATALYTIC SUBUNIT ALPHA ISOFORM,PTDINS-3-KINASE SUBUNIT ALPHA,PHOSPHATIDYLINOSITOL 4,5-BISPHOSPHATE 3-KINASE 110 KDA CATALYTIC SUBUNIT ALPHA,P110ALPHA,PHOSPHOINOSITIDE-3-KINASE CATALYTIC ALPHA POLYPEPTIDE,SERINE/THREONINE PROTEIN KINASE PIK3CA ALHGGWTTK 9 T 3.1 CFC pdbhh F Eukaryota T 7l1d 3 C C PK3CA_HUMAN PHOSPHATIDYLINOSITOL 4,5-BISPHOSPHATE 3-KINASE CATALYTIC SUBUNIT ALPHA ISOFORM,PTDINS-3-KINASE SUBUNIT ALPHA,PHOSPHATIDYLINOSITOL 4,5-BISPHOSPHATE 3-KINASE 110 KDA CATALYTIC SUBUNIT ALPHA,P110ALPHA,PHOSPHOINOSITIDE-3-KINASE CATALYTIC ALPHA POLYPEPTIDE,SERINE/THREONINE PROTEIN KINASE PIK3CA ALHGGWTTK 9 T 3.1 CFC pdbhh F Eukaryota T 7l1k 4 D D MLGP peptide MLGP 4 T 110 H2TH pdbhh F F 7l20 53 AB u 39 S P-site finger XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 65 F F F 7l2m 2 E,F E,F DKTX_CYRSC TAU-TRTX-HS1A,DOUBLE-KNOT TOXIN,DKTX MDCAKEGEVCSWGKKCCDLDNFYCPMEFIPHCKKYKPYVPVTTNCAKEGEVCGWGSKCCHGLDCPLAFIPYCEKYR 76 T 0.079 Conotoxin_I2 unp F Eukaryota T 7l2r 2 B,C F,E DKTX_CYRSC TAU-TRTX-HS1A,DOUBLE-KNOT TOXIN,DKTX MDCAKEGEVCSWGKKCCDLDNFYCPMEFIPHCKKYKPYVPVTTNCAKEGEVCGWGSKCCHGLDCPLAFIPYCEKYR 76 T 0.079 Conotoxin_I2 unp F Eukaryota T 7l2t 2 B,C F,E DKTX_CYRSC TAU-TRTX-HS1A,DOUBLE-KNOT TOXIN,DKTX MDCAKEGEVCSWGKKCCDLDNFYCPMEFIPHCKKYKPYVPVTTNCAKEGEVCGWGSKCCHGLDCPLAFIPYCEKYR 76 T 0.079 Conotoxin_I2 unp F Eukaryota T 7l2u 2 B,C F,E DKTX_CYRSC TAU-TRTX-HS1A,DOUBLE-KNOT TOXIN,DKTX MDCAKEGEVCSWGKKCCDLDNFYCPMEFIPHCKKYKPYVPVTTNCAKEGEVCGWGSKCCHGLDCPLAFIPYCEKYR 76 T 0.079 Conotoxin_I2 unp F Eukaryota T 7l33 1 A,B,C A,B,C Cu-3SCC XGIAAIKQEHAAIKQEIAAIKQEIAAIKWEGX 32 T 0.016 DivIC pdbpssm F T 7l4z 2 F,G,H,I,J S,T,R,U,V ACE-DTY-LYS-ALA-GLY-VAL-VAL-TYR-GLY-TYR-ASN-ALA-TRP-ILE-ARG-CYS-NH2 XXKAGVVYGYNAWIRCX 17 T 2.1 DUF3212 pdbhh F T 7l51 1 A,B A,B Cyclic plant protein PDP-23 GFCWHHSCVPSGTCADFPWPLGHQCFPD 28 T 0.46 DUF5763 pdbhh F T 7l53 1 A A Cyclic plant protein PDP-23 GFCWHHSCVPSGTCADFPWPLGHQCFPD 28 T 0.46 DUF5763 pdbhh F T 7l54 1 A A Cyclic plant protein PDP-23 GFCWHHSCVPSGTCADFPWPLGHQCFPD 28 T 0.46 DUF5763 pdbhh F T 7l55 1 A A Cyclic plant protein PDP-23 GFCWHHSCVPSGTCADFPWPLGHQCFPD 28 T 0.46 DUF5763 pdbhh F T 7l6g 1 A,B,C,D,E,F A,B,C,D,E,F A0A2D2CY67_METTR Metallo-mystery pair system four-Cys motif protein AGVKTQPVAVRFALVADGKEVGCGAPLANLGSGRLAGKLHEARLYVYGFELVDAKGKHTPIALTQNDWQYADVALLDFKDARGGNAACTPGNPAKNTTVVGAAPQGAYVGLAFSVGAPVESLVDGKPVFVNHSNVEAAPPPLDISGMAXNWQAGRRFVTIEVIPPAAVIKPDGSKSRTWMVHVGSTGCKGNPATGEIVACAHENRFPVVFDRFDPKTQRVELDLTTLFESSDISVDKGGAVGCMSALDDPDCPAVFRALGLNLADSAPGANDAGKPSRPGVSPIFSVGAAASKVAGGKQ 299 T 0.79 DUF4382 pdbhh F Bacteria T 7l6n 2 G N Substrate XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 33 F F F 7l6o 1 A,C,E a,c,e A0A1W6IPB2_9HIV1 ENVELOPE GLYCOPROTEIN GP120 AENLWVTVYYGVPVWKEAKTTLFCASDARAYEKEVHNVWATHACVPTDPSPQELVLGNVTENFNMWKNDMVDQMHEDIISLWDQSLKPCVKLTPLCVTLICSNATVKNGTVEEMKNCSFNTTTEIRDKEKKEYALFYKPDIVPLSETNNTSEYRLINCNTSACTQACPKVTFEPIPIHYCAPAGYAILKCNDETFNGTGPCSNVSTVQCTHGIRPVVSTQLLLNGSLAEKEIVIRSENLTNNAKIIIVHLHTPVEIVCTRPNNNTRKSVRIGPGQTFYATGDIIGDIKQAHCNISEEKWNDTLQKVGIELQKHFPNKTIKYNQSAGGDMEITTHSFNCGGEFFYCNTSNLFNGTYNGTYISTNSSANSTSTITLQCRIKQIINMWQGVGRCMYAPPIAGNITCRSNITGLLLTRDGGTNSNETETFRPAGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGR 466 T 3.5E-53 GP120 pdbpssm T Viruses T 7l7a 1 A A NuxVA GCCPAPLTCHCVIY 14 T 3 US10 pdbhh F T 7l7t 1 A,C,E A,C,E Q2N0S6_9HIV1 BG505 SOSIP.v5.2(7S) - gp120 MKRGLCCVLLLCGAVFVSPSQEIHARFRRGARAENLWVTVYYGVPVWKDAETTLFCASDAKAYETKKHNVWATHCCVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCTNVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNESVQINCTRPNNNTVKSIRIGPGQWFYYTGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRV 505 T 4.1E-54 GP120 pdbpercent T Viruses T 7l7u 1 A,C,E A,C,E Q2N0S6_9HIV1 BG505 SOSIP.v5.2(7S) - gp120 MKRGLCCVLLLCGAVFVSPSQEIHARFRRGARAENLWVTVYYGVPVWKDAETTLFCASDAKAYETKKHNVWATHCCVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCTNVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNESVQINCTRPNNNTVKSIRIGPGQWFYYTGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRV 505 T 4.1E-54 GP120 pdbpercent T Viruses T 7l86 1 A H Rh.32034 pAbC-1 - Heavy Chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 119 F F F 7l86 2 B L Rh.32034 pAbC-1 - Light Chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 105 F F F 7l86 3 C,E,G E,C,A Q2N0S6_9HIV1 BG505 SOSIP MD39 - gp120 MGILPSPGMPALLSLVSLLMGCVAETGAENLWVTVYYGVPVWKDAETTLFCASDAKAYETKKHNVWATHCCVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHEDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNTPVQINCTRPNNNTVKSIRIGPGQWFYYTGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFAQSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRV 500 T 3.9999999999999995E-54 GP120 pdbpercent T Viruses T 7l87 1 A H Rh.32034 pAbC-2 - Heavy Chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 121 F F F 7l87 2 B L Rh.32034 pAbC-2 - Light Chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 103 F F F 7l87 3 C,E,G C,A,D Q2N0S6_9HIV1 BG505 SOSIP MD39 - gp120 MGILPSPGMPALLSLVSLLMGCVAETGAENLWVTVYYGVPVWKDAETTLFCASDAKAYETKKHNVWATHCCVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHEDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNTPVQINCTRPNNNTVKSIRIGPGQWFYYTGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFAQSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKR 498 T 3.9E-54 GP120 pdbpercent T Viruses T 7l88 1 A,C,E D,A,C Q2N0S6_9HIV1 BG505 SOSIP MD39 - gp120 MGILPSPGMPALLSLVSLLMGCVAETGAENLWVTVYYGVPVWKDAETTLFCASDAKAYETKKHNVWATHCCVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHEDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNTPVQINCTRPNNNTVKSIRIGPGQWFYYTGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFAQSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKR 498 T 3.9E-54 GP120 pdbpercent T Viruses T 7l88 3 G H Rh.32034 pAbC-3 - Heavy Chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 114 F F F 7l88 4 H L Rh.32034 pAbC-3 - Light Chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 107 F F F 7l89 1 A H Rh.32034 pAbC-4 - Heavy Chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 114 F F F 7l89 2 B L Rh.32034 pAbC-4 - Light Chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 97 F F F 7l89 3 C,E,G E,A,B Q2N0S6_9HIV1 BG505 SOSIP MD39 - gp120 MGILPSPGMPALLSLVSLLMGCVAETGAENLWVTVYYGVPVWKDAETTLFCASDAKAYETKKHNVWATHCCVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHEDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNTPVQINCTRPNNNTVKSIRIGPGQWFYYTGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFAQSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKR 498 T 3.9E-54 GP120 pdbpercent T Viruses T 7l8a 1 A,E,G E,A,C Q2N0S6_9HIV1 BG505 SOSIP MD39 - gp120 NLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHEDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNTPVQINCTRPNNNTVKSIRIGPGQAFYYTGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFAQSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKR 469 T 3.5E-54 GP120 pdbpercent T Viruses T 7l8a 3 C H Rh.33104 pAbC-1 - Heavy Chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 115 F F F 7l8a 4 D L Rh.33104 pAbC-1 - Light Chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 105 F F F 7l8b 1 A,C,E C,A,E Q2N0S6_9HIV1 BG505 SOSIP MD39 - gp120 NLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHEDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNTPVQINCTRPNNNTVKSIRIGPGQAFYYTGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFAQSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKR 469 T 3.5E-54 GP120 pdbpercent T Viruses T 7l8b 3 G H Rh.33104 pAbC-2 - Heavy Chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 118 F F F 7l8b 4 H L Rh.33104 pAbC-2 - Light Chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 108 F F F 7l8c 1 A,C,E A,C,D Q2N0S6_9HIV1 BG505 SOSIP MD39 - gp120 NLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHEDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNTPVQINCTRPNNNTVKSIRIGPGQAFYYTGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFAQSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKR 469 T 3.5E-54 GP120 pdbpercent T Viruses T 7l8c 3 G H Rh.33104 pAbC-3 - Heavy Chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 113 F F F 7l8c 4 H L Rh.33104 pAbC-3 - Light Chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 107 F F F 7l8d 1 A H Rh.33104 pAbC-3 - Heavy Chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 107 F F F 7l8d 2 B L Rh.33104 pAbC-3 - Light Chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 102 F F F 7l8d 3 C,E,G A,C,D Q2N0S6_9HIV1 BG505 SOSIP MD39 - gp120 NLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHEDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNTPVQINCTRPNNNTVKSIRIGPGQAFYYTGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFAQSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKR 469 T 3.5E-54 GP120 pdbpercent T Viruses T 7l8e 2 G H Rh.33172 pAbC-1 Heavy Chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 111 F F F 7l8e 3 H L Rh.33172 pAbC-1 Light Chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 99 F F F 7l8f 2 G H Rh.33172 pAbC-2 Heavy Chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 127 F F F 7l8f 3 H L Rh.33172 pAbC-2 Light Chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 107 F F F 7l8g 2 G H Rh.33172 pAbC-3 Heavy Chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 109 F F F 7l8g 3 H L Rh.33172 pAbC-3 Light Chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 100 F F F 7l8s 3 G H Rh.33172 pAbC-4 - Heavy Chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 112 F F F 7l8s 4 H L Rh.33172 pAbC-4 - Light Chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 100 F F F 7l8t 1 A,C,E A,C,E Q2N0S6_9HIV1 BG505 SOSIP.v5.2 N241/N289 - gp120 MKRGLCCVLLLCGAVFVSPSQEIHARFRRGARAENLWVTVYYGVPVWKDAETTLFCASDAKAYETKKHNVWATHCCVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCTNVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNESVQINCTRPNNNTRKSIRIGPGQWFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKR 503 T 1.5999999999999998E-49 GP120 unp T Viruses T 7l8t 3 G H Rh.33311 pAbC-1 - Heavy Chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 116 F F F 7l8t 4 H L Rh.33311 pAbC-1 - Light Chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 102 F F F 7l8u 1 A,C,E C,A,E Q2N0S6_9HIV1 BG505 SOSIP.v5.2 N241/N289 - gp120 MKRGLCCVLLLCGAVFVSPSQEIHARFRRGARAENLWVTVYYGVPVWKDAETTLFCASDAKAYETKKHNVWATHCCVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCTNVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNESVQINCTRPNNNTRKSIRIGPGQWFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKR 503 T 1.5999999999999998E-49 GP120 unp T Viruses T 7l8u 3 G H Rh.33311 pAbC-2 - Heavy Chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 113 F F F 7l8u 4 H L Rh.33311 pAbC-2 - Light Chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 105 F F F 7l8w 1 A,C,E A,C,E Q2N0S6_9HIV1 BG505 SOSIP.v5.2 N241/N289 - gp120 MKRGLCCVLLLCGAVFVSPSQEIHARFRRGARAENLWVTVYYGVPVWKDAETTLFCASDAKAYETKKHNVWATHCCVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCTNVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNESVQINCTRPNNNTRKSIRIGPGQWFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKR 503 T 1.5999999999999998E-49 GP120 unp T Viruses T 7l8w 3 G H Rh.33311 pAbC-3 - Heavy Chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 109 F F F 7l8w 4 H L Rh.33311 pAbC-3 - Light Chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 103 F F F 7l8x 1 A,C,G A,C,E Q2N0S6_9HIV1 BG505 SOSIP.v5.2 N241/N289 - gp120 MKRGLCCVLLLCGAVFVSPSQEIHARFRRGARAENLWVTVYYGVPVWKDAETTLFCASDAKAYETKKHNVWATHCCVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCTNVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNESVQINCTRPNNNTRKSIRIGPGQWFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKR 503 T 1.5999999999999998E-49 GP120 unp T Viruses T 7l8x 3 E H Rh.33311 pAbC-4 - Heavy Chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 116 F F F 7l8x 4 F L Rh.33311 pAbC-4 - Light Chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 99 F F F 7l8y 1 A H Rh.33311 pAbC-5 - Heavy Chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 113 F F F 7l8y 2 B L Rh.33311 pAbC-5 - Light Chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 99 F F F 7l8y 3 C,E,G C,A,E Q2N0S6_9HIV1 BG505 SOSIP.v5.2 N241/N289 - gp120 MKRGLCCVLLLCGAVFVSPSQEIHARFRRGARAENLWVTVYYGVPVWKDAETTLFCASDAKAYETKKHNVWATHCCVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCTNVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNESVQINCTRPNNNTRKSIRIGPGQWFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKR 503 T 1.5999999999999998E-49 GP120 unp T Viruses T 7l8z 1 A H Rh.33311 pAbC-7 - Heavy Chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 109 F F F 7l8z 2 B L Rh.33311 pAbC-7 - Light Chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 85 F F F 7l8z 3 C,E,G A,C,D Q2N0S6_9HIV1 BG505 SOSIP.v5.2 N241/N289 - gp120 MKRGLCCVLLLCGAVFVSPSQEIHARFRRGARAENLWVTVYYGVPVWKDAETTLFCASDAKAYETKKHNVWATHCCVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCTNVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNESVQINCTRPNNNTRKSIRIGPGQWFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKR 503 T 1.5999999999999998E-49 GP120 unp T Viruses T 7l90 1 A,C,E A,C,E Q2N0S6_9HIV1 BG505 SOSIP.v5.2 N241/N289 - gp120 MKRGLCCVLLLCGAVFVSPSQEIHARFRRGARAENLWVTVYYGVPVWKDAETTLFCASDAKAYETKKHNVWATHCCVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCTNVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNESVQINCTRPNNNTRKSIRIGPGQWFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKR 503 T 1.5999999999999998E-49 GP120 unp T Viruses T 7l90 3 G H Rh.33311 pAbC-8 - Heavy Chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 108 F F F 7l90 4 H L Rh.33311 pAbC-8 - Light Chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 93 F F F 7l98 1 A A Cyclic peptide ALA-MEA-PRO-ILE-PRO-ITZ AXPIPX 6 T 29 Csm4_C pdbhh F F 7l9d 1 A A Cyclic peptide ALA-PHE-PRO-ILE-PRO-ITZ AFPIPX 6 T 29 Csm4_C pdbhh F F 7l9e 1 A,E E,G GANAB_MOUSE ALPHA-GLUCOSIDASE 2,GLUCOSIDASE II SUBUNIT ALPHA MGILPSPGMPALLSLVSLLSVLLMGCVAETGVDRSNFKTCDESSFCKRQRSIRPGLSPYRALLDTLQLGPDALTVHLIHEVTKVLLVLELQGLQKDMTRIRIDELEPRRPRYRVPDVLVADPPTARLSVSGRDDNSVELTVAEGPYKIILTAQPFRLDLLEDRSLLLSVNARGLMAFEHQRAPR 184 T 0.0016 NtCtMGAM_N pdbpercent F Eukaryota T 7lbn 2 B D Calpain I Inhibitor XLLX 4 T 1700 EF-hand_1 pdbhh F F 7lc0 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H Q8ZL27_SALTY Putative selenocysteine synthase (L-seryl-tRNA(Ser) selenium transferase) MTPNIYQQLGLKKVINACGKMTILGVSSVAPEVMQATARAASAFVEIDALVEKTGELVSRYTGAEDSYITSCASAGIAIAVAAAITHGDRARVALMPDSSGMANEVVMLRGHNVDYGAPVTSAIRLGGGRIVEVGSSNLATRWQLESAINEKTAALLYVKSHHCVQKGMLSIDDFVQVAQANHLPLIVDAAAEEDLRGWVASGADMVIYSGAKAFNAPTSGFITGRKTWIAACKAQHQGIARAMKIGKENMVGLVYALENYHQGQTTVTAAQLQPVAEAISAIHGLYADIEQDEAGRAIWRIRVRVNASELGLNAQDVEAQLRGGEIAIYARKYQLHQGVFSLDPRTVAEGEMALIVARLREIAEHAAD 369 T 1.4 SelA pdbpssm F Bacteria T 7lc2 2 C,D D,E SIN1_HUMAN TORC2 SUBUNIT MAPKAP1,MITOGEN-ACTIVATED PROTEIN KINASE 2-ASSOCIATED PROTEIN 1,STRESS-ACTIVATED MAP KINASE-INTERACTING PROTEIN 1,MSIN1 GSKESLFVRINAAHGFSLIQVDNTKVTMKEILLKAVKRRKGSQKVSGPQYRLEKQSEPNVAVDLDSTLESQSAWEFCLVRENSSRADG 88 T 0.051 E3_UbLigase_RBR unp F Eukaryota T 7lce 1 A,B,C,D A,B,C,D DGAE_SALT1 SELENOCYSTEINE SYNTHASE,TRANSFERASE MTPNIYQQLGLKKVINACGKMTILGVSSVAPEVMQATARAASAFVEIDALVEKTGELVSRYTGAEDSYITSCASAGIAIAVAAAITHGDRARVALMPDSSGMANEVVMLRGHNVDYGAPVTSAIRLGGGRIVEVGSSNLATRWQLESAINEKTAALLYVKSHHCVQKGMLSIDDFVQVAQANHLPLIVDAAAEEDLRGWVASGADMVIYSGAKAFNAPTSGFITGRKTWIAACKAQHQGIARAMKIGKENMVGLVYALENYHQGQTTVTAAQLQPVAEAISAIHGLYADIEQDEAGRAIWRIRVRVNASELGLNAQDVEAQLRGGEIAIYARKYQLHQGVFSLDPRTVAEGEMALIVARLREIAEHAAD 369 T 1.4 SelA pdbpssm F Bacteria T 7lcw 1 A A A0A1H8GYX0_9BACL Lasso Peptide Lihuanodin GSKYSDTADESSYRW 15 T 0.015 DUF5972 unppercent F Bacteria T 7ldf 1 A A Minimal thioredoxin fold protein, ems_thioM_802 MDEVKVHVGDDQFEEVSREIKKAGWKVEVHKHPSNTSQVTVTKGNKQWTFKDPKQAVEFVQKSLEHHHHHH 71 T 0.077 AvrPtoB-E3_ubiq pdb F T 7ldg 1 A,C A,C HSF2B_HUMAN MEILB2 SARLETVQADNIREKKEKLALRQQLNEAKQQLLQQAEYCTEMGAAACTLLWGVSSSEEVVKAILGGDKALKFFSITGQTMESFVKSLDGDVQELDSDESQFVFALAGIVTNVAAIACGREFLVNSSRVLLDTILQLLGDLKPGQCTKLKVLMLMSLYNVSINLKGLKYISESPGFIPLLWWLLSDPDAEVCLHVLRLVQSVVLEPEVFSKSASEFRSSLPLQRILAMSKSRNPRLQTAAQELLEDLRTLEHNV 253 T 0.0026 KAP unphh F Eukaryota T 7ldg 2 B,D B,D BRCA2_HUMAN FANCONI ANEMIA GROUP D1 PROTEIN MKRRGEPLILVGEPSIKRNLLNEFDRIIENQEKSLKASKSTPDGTIKDRRLFMHHVSLEPITCVPF 66 T 5.3 TFIIA_gamma_N pdbhh F Eukaryota T 7leq 2 B B NFKB1_HUMAN DNA-BINDING FACTOR KBF1,EBP-1,NUCLEAR FACTOR OF KAPPA LIGHT POLYPEPTIDE GENE ENHANCER IN B-CELLS 1 DKEEVQRKRQKLMP 14 T 0.63 ETAA1 pdbhh F Eukaryota T 7let 2 B B NFKB1_HUMAN DNA-BINDING FACTOR KBF1,EBP-1,NUCLEAR FACTOR OF KAPPA LIGHT POLYPEPTIDE GENE ENHANCER IN B-CELLS 1 DKEEVQRKRQKLMP 14 T 0.63 ETAA1 pdbhh F Eukaryota T 7let 3 C C TF65_HUMAN NUCLEAR FACTOR NF-KAPPA-B P65 SUBUNIT,NUCLEAR FACTOR OF KAPPA LIGHT POLYPEPTIDE GENE ENHANCER IN B-CELLS 3 DRHRIEEKRKRTYETFKSIMKK 22 T 0.32 Fimbrial_CS1 unp F Eukaryota T 7leu 2 B,C B,C TF65_HUMAN NUCLEAR FACTOR NF-KAPPA-B P65 SUBUNIT,NUCLEAR FACTOR OF KAPPA LIGHT POLYPEPTIDE GENE ENHANCER IN B-CELLS 3 DRHRIEEKRKRTYETFKSIMKK 22 T 0.32 Fimbrial_CS1 unp F Eukaryota T 7lf4 2 B,E B,F NFKB1_HUMAN DNA-BINDING FACTOR KBF1,EBP-1,NUCLEAR FACTOR OF KAPPA LIGHT POLYPEPTIDE GENE ENHANCER IN B-CELLS 1 DKEEVQRKRQKLMP 14 T 0.63 ETAA1 pdbhh F Eukaryota T 7lf4 3 D,F D,E TF65_HUMAN NUCLEAR FACTOR NF-KAPPA-B P65 SUBUNIT,NUCLEAR FACTOR OF KAPPA LIGHT POLYPEPTIDE GENE ENHANCER IN B-CELLS 3 DRHRIEEKRKRTYETFKSIMKK 22 T 0.32 Fimbrial_CS1 unp F Eukaryota T 7lfc 2 B B NFKB1_HUMAN DNA-BINDING FACTOR KBF1,EBP-1,NUCLEAR FACTOR OF KAPPA LIGHT POLYPEPTIDE GENE ENHANCER IN B-CELLS 1 DKEEVQRKRQKLMP 14 T 0.63 ETAA1 pdbhh F Eukaryota T 7lfu 2 B A Papain-like protease peptide inhibitor VIR250 XXXGX 5 T 1400 UPF0547 pdbhh F F 7lfv 2 C,D D,F Papain-like protease peptide inhibitor VIR251 GXXGX 5 T 490 zf-C2H2_10 pdbhh F F 7lfz 3 C C R1AB_SARS2 ORF1ab IPRRNVATL 9 T 0.79 TbpB_A pdbhh T Viruses T 7lgj 2 E,F,G,H E,F,G,H 8x(Asp-Arg)-NH2 XXXXXXXXX 9 T 22 DUF3134 pdbhh F F 7lgq 2 E,F,G,H,I,J,K,L E,F,G,H,I,J,K,L 8x(Asp-Arg)-Asn XXXXXXXXN 9 T 2.5 Cytomega_TRL10 pdbhh F F 7lh5 44 SA,ZC BJ,DJ 50S ribosomal protein L10 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 173 F F F 7lh5 45 AD,TA DK,BK 50S ribosomal protein L11 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 147 F F F 7lhy 2 B B H3_CAEEL H3(7-20)K14ac ARKSTGGXAPRKQL 14 T 0.12 Sirohm_synth_M pdbpercent F Eukaryota T 7lib 1 A A Cyclic peptide ORD-TYR-LEU-LEU-PHI-TYR-THR-GLU-GMO-LYS-VAL-THR-MVA-THR-VAL-LYS XYLLXYTEXKVTXTVK 16 T 1.3 Peptidase_C98 pdbhh F T 7lin 2 B B TP53B_HUMAN P53BP1 PATPTASSSSSTTPT 15 T 33 Cryptochrome_C pdbhh F Eukaryota F 7lio 2 B,D C,D TP53B_HUMAN P53BP1 PATPTASSSSSTTPT 15 T 33 Cryptochrome_C pdbhh F Eukaryota F 7lix 1 A A A0A5J4YJY8_PORPP CaRSP1 MAAFVSGGCGVGGQRRAWPAKGAAVARTHACPTTMVVLPVARAGLAATAKKNQYMGTSVAPEIVLTDKGSDMSRKVKTEDKKVAADQAAAMGILANMSLYASLNPVKRMTYKAKEQAPAYVKKTGNPVEDFYPSSWRNMAPVISLSANRVAVAFEKIDAASNGVKANSNNKPFWKSGATTKNYVAPEAPAQSEPETLDDAAYQRYFPARIRNKAPAMEFRRPSFANTEDPSAYFMLQKETVPLRMALAEKLLTKLGRKGDGSEAEEEKVKPQKKAAKKDAKDDAKDDE 288 T 89 DUF6243 pdbhh F Eukaryota T 7liy 1 A A A0A5J4YX67_PORPP CaRSP2 MWEQQRPRRCEAPAAPSSRPAERRAAARRSRAQLRMKQDDYEQWKTEFAGGFPGGEAFYKKWIEEGAKGDVPALEEELQPRSPNKKPTIYEEQMISNRGQQKGVDPTWKTLLAGGFPGGEFFFKKWIGEGAQGEVPNLDADLQPGSGSAKKTGKKEDADKSSPGGIMTPGRIMVPSGLGEEEETVDKEAPRPELYNKYFSADRLHKAPEILFEYNKTKYDRVGVRYTEVTSKASERFFPKSRMNRAPVIEISYREGAVSTASVSLSMPEISGPPALPFPVPKGDVTTTMVTDPTTGRLKLEFKVDGAAVSAYSDPRATEAYNKMIRK 327 T 0.016 Fz pdbpssm F Eukaryota T 7lkb 3 C,F A,D CSP_PLAF7 CS NPDPNANPNVDPNAN 15 T 0.7 Cas_Cas7 pdbhh F Eukaryota F 7lkc 1 A,B A,B Keratinimicin A peptide moiety XXFXXXX 7 T 40 TraQ pdbhh F F 7lkg 3 C,F C,F CSP_PLAF7 CS NPDPNANPNVDPNAN 15 T 0.7 Cas_Cas7 pdbhh F Eukaryota F 7lki 3 C,F CCC,FFF Epitope III peptide GLY-ALA-PRO-THR-TYR-SER-TRP-GLY DRSGAPTYSWGANDK 15 T 5.7E-05 HCV_NS1 pdbhh F T 7lky 2 I,J,K,L,M,N,O,P J,I,K,L,M,N,O,P Peptidomimetic inhibitor UNC6641 GGVXKPLR 8 T 24 DUF104 pdbhh F T 7ll7 1 A A GLY-GLY-ALA-GLY-HIS-VAL-PRO-GLU-TYR-PHE GGAGHVPEYF 10 T 0.13 Endonuc-BglII unp F T 7ll7 2 B B VAL-CYS-ILE-GLY-THR-PRO-ILE-SER-PHE-TYR-CYS VCIGTPISFYC 11 T 0.41 KSHV_K1 pdbhh F T 7ll8 2 C,D C,D RFX-V1 XXXXXXXXXXXXXXXXXXXXXGXXXXXXXXXXXXXXXXXXXXGXXXXXXXXXX 53 F F F 7ll9 2 C,D,G,H C,D,G,H RFX-V2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 58 F F F 7llk 1 A,E,I E,A,I O55774_9HIV1 Envelope glycoprotein gp120 AENLWVTVYYGVPVWRDADTTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLDNVTEKFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLHCTNVTSVNTTGDREGLKNCSFNMTTELRDKRQKVYSLFYRLDIVPINENQGSEYRLINCNTSACTQACPKVSFEPIPIHYCAPAGFAILKCKDEGFNGTGLCKNVSTVQCTHGIKPVVSTQLLLNGSLAEKNITIRSENITNNAKIIIVQLVQPVTIKCIRPNNNTRKSIRIGPGQAFYATGDIIGDIRQAHCNVSRSRWNKTLQEVAEKLRTYFGNKTIIFANSSGGDLEITTHSFNCGGEFFYCNTSGLFNSTWYVNSTWNDTDSTQESNDTITLPCRIKQIINMWQRAGQCMYAPPIPGVIKCESNITGLLLTRDGGKDNNVNETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVERRRRRR 474 T 9.7E-54 GP120 pdbpssm T Viruses T 7lma 4 D E RFA2_TETTS Telomerase holoenzyme Teb2 subunit MSNRVQGGFDNNSGNNQSAQKQQAEKIPQITVPLNCFMINQIVKAAKENPQAHSGNHYEWYGAFENAIITAKFEFLQSINDSPKIMGKLSDSTGCIEVVIQKSKMSDELPEFVQAYEIELQNNGNRHKYVRAMLKMRKNAQIQLLYFSIVNDANEISRHGLDLCLRYLQRKHGIEDFMHMTNDKAHNNHNASAQKVHYQIDRNQQPKEQVLELMRQILKHNPNDQIPKSKIIEFFQSQLNQVQINQILQQLVSANEIFSVGSDNYLLNV 269 T 0.012 HSM3_N pdb F Eukaryota T 7lma 6 F G TAP50_TETTS P50 MKLLLQNQNIFQKLKNTLNGCIKKFYDTYQDLEQMQKFEMIVEDKLLFRYSCSQSEMFSAQIQAHYLEKRVLQLTDGNVKYIVNFRDKGVLDKANFFDTPNNSLVIIRQWSYEIYYTKNTFQINLVIDEMRCIDIITTIFYCKLELDFTQGIKGISKSSSFSNQIYEYSAQYYKAIQLLKKLLINDSYISELYNSTKSKQQPRLFIFQSFKPKMNLAEQNLSRQFEQCQQDDFGDGCLLQIVNYTHQSLKQIENKNNSNQIVNGQNEISKKKRVLKSNEDLYKISLQKQLKIFQEEEIELHSQSTIRNQTNQQLETFESDTSKRNSEKILHSINELNTSKQKVNQMNSSQHQIQKLENNNLNKNILNQINENDIKNELEERQQQHLTQSFNSKAQLKKIITLKKNQDILLFKPQEQEGSKKY 422 T 0.093 DUF6235 pdb F Eukaryota T 7lmb 6 F E RFA2_TETTS Telomerase holoenzyme Teb2 subunit MSNRVQGGFDNNSGNNQSAQKQQAEKIPQITVPLNCFMINQIVKAAKENPQAHSGNHYEWYGAFENAIITAKFEFLQSINDSPKIMGKLSDSTGCIEVVIQKSKMSDELPEFVQAYEIELQNNGNRHKYVRAMLKMRKNAQIQLLYFSIVNDANEISRHGLDLCLRYLQRKHGIEDFMHMTNDKAHNNHNASAQKVHYQIDRNQQPKEQVLELMRQILKHNPNDQIPKSKIIEFFQSQLNQVQINQILQQLVSANEIFSVGSDNYLLNV 269 T 0.012 HSM3_N pdb F Eukaryota T 7lmb 8 H G TAP50_TETTS P50 MKLLLQNQNIFQKLKNTLNGCIKKFYDTYQDLEQMQKFEMIVEDKLLFRYSCSQSEMFSAQIQAHYLEKRVLQLTDGNVKYIVNFRDKGVLDKANFFDTPNNSLVIIRQWSYEIYYTKNTFQINLVIDEMRCIDIITTIFYCKLELDFTQGIKGISKSSSFSNQIYEYSAQYYKAIQLLKKLLINDSYISELYNSTKSKQQPRLFIFQSFKPKMNLAEQNLSRQFEQCQQDDFGDGCLLQIVNYTHQSLKQIENKNNSNQIVNGQNEISKKKRVLKSNEDLYKISLQKQLKIFQEEEIELHSQSTIRNQTNQQLETFESDTSKRNSEKILHSINELNTSKQKVNQMNSSQHQIQKLENNNLNKNILNQINENDIKNELEERQQQHLTQSFNSKAQLKKIITLKKNQDILLFKPQEQEGSKKY 422 T 0.093 DUF6235 pdb F Eukaryota T 7lmk 2 E,F,G,H F,G,H,I H4_HUMAN Histone H4 GAKRHRKVLRDNY 13 T 0.27 UPF0137 unp F Eukaryota T 7lmm 2 E,F,G,H F,G,I,H H4_HUMAN Histone H4 GAKRHRKVLRDNY 13 T 0.27 UPF0137 unp F Eukaryota T 7lmv 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L Integrin inhibitor AVVRFVFRGDLAELMLRAVKDHLKKEGPHWNITSRGNELVVRGIHESDAKRIQKEFPSVQSTIQAAAAAA 70 T 0.0011 NIR_SIR_ferr pdb F T 7lmx 1 A,B,C A,B,C Integrin inhibitor STKCVVRFVFRGDLATLMLRAVKDHLKKEGPHWNITSTNNGAELVVRGIHESDAKRIAKWVEKRFPGVHTETQCD 75 T 0.17 Pox_H7 pdb F T 7lmz 2 G G Hexa-ubiquitin XXXXXXXXX 9 F F F 7ln0 2 G G Hexa-ubiquitin XXXXXXXXX 9 F F F 7ln1 2 G G Hexa-ubiquitin XXXXXXXXX 9 F F F 7ln2 2 G G polyubiquitinated Ub-Eos XXXXXXXXXXXXXXXXXXXXXX 22 F F F 7ln3 2 G G polyubiquitinated Ub-Eos XXXXXXXXXXXXXXXXXXXXXX 22 F F F 7ln4 2 G G polyubiquitinated Ub-Eos XXXXXXXXXXXXXXXXXXXXXX 22 F F F 7ln5 2 G G polyubiquitinated Ub-Eos XXXXXXXXXXXXXXXXXXXXXX 22 F F F 7ln6 2 G G polyubiquitinated Ub-Eos XXXXXXXXXXXXXXXXXXXXXXX 23 F F F 7lo0 2 I,J,K,L,M,N,O,P,Q,R,S,T,U,V,W,X I,J,K,L,M,N,O,P,Q,R,T,U,V,W,X,Y TLK2_HUMAN HSHPK,PKU-ALPHA,TOUSLED-LIKE KINASE 2 EELHSLDPRRQELLEARFTGV 21 T 47 Plasmid_stab_B pdbhh F Eukaryota T 7lp2 2 B,D,F B,D,F AMOT_HUMAN Angiomotin MEHRGPPPEYPFKGM 15 T 1.2 GvpL_GvpF pdbhh F Eukaryota T 7lp3 2 B,D B,D AMOT_HUMAN Angiomotin EHRGPPPEYPFKGM 14 T 0.95 GvpL_GvpF pdbhh F Eukaryota T 7lpr 2 B P METHOXYSUCCINYL-ALA-ALA-PRO-LEUCINE BORONIC ACID INHIBITOR XAAPX 5 T 500 Suv3_C_1 pdbhh F F 7lq2 1 A,B,C,D,E,F B,C,F,E,D,A A0A023X3Z4_9ACTN RR RsiG GSHMRESAEEVWGGTEDLTSLSVEELKGLMARFDEEEKRISYRRRVMQGRIDVIRAEIVRRGGAVLSPEELARVLMGDVGDESE 84 T 0.019 DUF1192 pdb F Bacteria T 7lq3 1 A,B,C B,A,D A0A023X3Z4_9ACTN RsiG GSHMARESAEEVWGGTEDLTSLSVEELKGLMARFDEEEKRISYRRRVMQGRIDVIRAEIVRRGGAVLSPEELARVLMGDVGDESE 85 T 0.02 DUF1192 pdb F Bacteria T 7lq4 1 A,B D,T A0A023X3Z4_9ACTN RsiG MGEETYEGTSGREGGRHEEEVETRAARESAEEVWGGTEDLTSLSVEELKGLLARFDEEEKRISYRRRVIQGRIDVIRAEIVRRGGAVLSPEELARVLMGDVGDESEGGASGDRRGDGA 118 T 0.021 KfrA_N pdbpercent F Bacteria T 7lqe 1 A,AA,AB,B,BA,BB,C,CA,CB,D,DA,DB,E,EA,EB,F,FA,FB,G,GA,GB,H,HA,HB,I,IA,IB,J,JA,JB,K,KA,KB,L,LA,LB,M,MA,MB,N,NA,NB,O,OA,OB,P,PA,PB,Q,QA,QB,R,RA,RB,S,SA,SB,T,TA,TB,U,UA,V,VA,W,WA,X,XA,Y,YA,Z,ZA A,l,e,J,m,5,K,H,6,L,n,7,B,o,f,M,p,8,N,I,9,O,q,AA,C,r,g,P,s,BA,Q,a,CA,R,t,DA,D,u,h,S,v,EA,T,b,FA,U,w,GA,E,x,i,V,y,HA,W,c,IA,X,z,JA,F,0,Y,1,Z,d,j,2,G,3,k,4 KFE8 peptide XKFEFKFX 8 T 10 Macoilin pdbhh F F 7lqf 1 A,AA,AB,B,BA,BB,C,CA,CB,D,DA,DB,E,EA,EB,F,FA,FB,G,GA,GB,H,HA,HB,I,IA,IB,J,JA,JB,K,KA,KB,L,LA,LB,M,MA,N,NA,O,OA,P,PA,Q,QA,R,RA,S,SA,T,TA,U,UA,V,VA,W,WA,X,XA,Y,YA,Z,ZA A,j,f,I,k,3,J,H,4,K,l,5,B,m,g,L,n,6,M,a,7,N,o,8,C,p,h,O,q,9,P,b,AA,Q,r,BA,D,s,R,t,S,c,T,u,E,v,U,w,V,d,W,x,F,y,X,z,Y,e,Z,0,G,1,i,2 KFE8 peptide XKFEFKFX 8 T 10 Macoilin pdbhh F F 7lqg 1 A,AA,AB,AC,AD,AE,B,BA,BB,BC,BD,BE,C,CA,CB,CC,CD,CE,D,DA,DB,DC,DD,DE,E,EA,EB,EC,ED,EE,F,FA,FB,FC,FD,FE,G,GA,GB,GC,GD,GE,H,HA,HB,HC,HD,HE,I,IA,IB,IC,ID,IE,J,JA,JB,JC,JD,JE,K,KA,KB,KC,KD,KE,L,LA,LB,LC,LD,LE,M,MA,MB,MC,MD,ME,N,NA,NB,NC,ND,NE,O,OA,OB,OC,OD,P,PA,PB,PC,PD,Q,QA,QB,QC,QD,R,RA,RB,RC,RD,S,SA,SB,SC,SD,T,TA,TB,TC,TD,U,UA,UB,UC,UD,V,VA,VB,VC,VD,W,WA,WB,WC,WD,X,XA,XB,XC,XD,Y,YA,YB,YC,YD,Z,ZA,ZB,ZC,ZD A,o,BA,YA,e,HB,J,p,CA,ZA,vA,IB,K,q,DA,b,wA,JB,L,r,EA,aA,xA,KB,M,s,H,bA,yA,LB,N,t,FA,cA,zA,MB,O,E,GA,dA,0A,i,P,u,HA,eA,1A,NB,B,v,IA,fA,f,OB,Q,w,JA,gA,2A,PB,R,x,KA,c,3A,QB,S,y,LA,hA,4A,RB,T,z,I,iA,5A,SB,U,0,MA,jA,6A,TB,V,F,NA,kA,7A,W,1,OA,lA,8A,C,2,PA,mA,g,X,3,QA,nA,9A,Y,4,RA,d,AB,Z,5,SA,oA,BB,j,6,a,pA,CB,k,7,TA,qA,DB,l,G,UA,rA,EB,m,8,VA,sA,FB,D,9,WA,tA,h,n,AA,XA,uA,GB KFE8 peptide XKFEFKFX 8 T 10 Macoilin pdbhh F F 7lqh 1 A,AA,AB,AC,AD,AE,B,BA,BB,BC,BD,BE,C,CA,CB,CC,CD,CE,D,DA,DB,DC,DD,DE,E,EA,EB,EC,ED,EE,F,FA,FB,FC,FD,FE,G,GA,GB,GC,GD,GE,H,HA,HB,HC,HD,HE,I,IA,IB,IC,ID,IE,J,JA,JB,JC,JD,JE,K,KA,KB,KC,KD,KE,L,LA,LB,LC,LD,LE,M,MA,MB,MC,MD,ME,N,NA,NB,NC,ND,NE,O,OA,OB,OC,OD,P,PA,PB,PC,PD,Q,QA,QB,QC,QD,R,RA,RB,RC,RD,S,SA,SB,SC,SD,T,TA,TB,TC,TD,U,UA,UB,UC,UD,V,VA,VB,VC,VD,W,WA,WB,WC,WD,X,XA,XB,XC,XD,Y,YA,YB,YC,YD,Z,ZA,ZB,ZC,ZD A,o,BA,YA,e,HB,J,p,CA,ZA,vA,IB,K,q,DA,b,wA,JB,L,r,EA,aA,xA,KB,M,s,H,bA,yA,LB,N,t,FA,cA,zA,MB,O,E,GA,dA,0A,i,P,u,HA,eA,1A,NB,B,v,IA,fA,f,OB,Q,w,JA,gA,2A,PB,R,x,KA,c,3A,QB,S,y,LA,hA,4A,RB,T,z,I,iA,5A,SB,U,0,MA,jA,6A,TB,V,F,NA,kA,7A,W,1,OA,lA,8A,C,2,PA,mA,g,X,3,QA,nA,9A,Y,4,RA,d,AB,Z,5,SA,oA,BB,j,6,a,pA,CB,k,7,TA,qA,DB,l,G,UA,rA,EB,m,8,VA,sA,FB,D,9,WA,tA,h,n,AA,XA,uA,GB KFE8 peptide XKFEFKFX 8 T 10 Macoilin pdbhh F F 7lqi 1 A,AA,AB,AC,B,BA,BB,BC,C,CA,CB,CC,D,DA,DB,DC,E,EA,EB,EC,F,FA,FB,FC,G,GA,GB,GC,H,HA,HB,HC,I,IA,IB,IC,J,JA,JB,JC,K,KA,KB,KC,L,LA,LB,LC,M,MA,MB,MC,N,NA,NB,NC,O,OA,OB,OC,P,PA,PB,PC,Q,QA,QB,QC,R,RA,RB,RC,S,SA,SB,SC,T,TA,TB,TC,U,UA,UB,UC,V,VA,VB,VC,W,WA,WB,X,XA,XB,Y,YA,YB,Z,ZA,ZB A,y,YA,8A,K,z,ZA,9A,L,0,aA,d,M,1,bA,GB,N,G,cA,HB,B,5,dA,IB,R,6,eA,JB,S,7,fA,KB,T,8,b,LB,U,H,mA,MB,C,CA,nA,NB,Y,DA,oA,OB,Z,EA,pA,e,f,FA,qA,VB,g,I,rA,WB,D,JA,sA,XB,k,KA,tA,YB,l,LA,uA,ZB,m,MA,c,aB,n,J,1A,bB,E,QA,2A,cB,r,RA,3A,dB,s,SA,4A,t,TA,5A,u,a,6A,F,XA,7A KFE8 peptide XKFEFKFX 8 T 10 Macoilin pdbhh F F 7lqs 1 A A Alpha-conotoxin CIC CCSNPACQVQHSDLC 15 T 0.00012 Toxin_8 pdbhh F T 7lrw 1 A A Hact-2 CAPECRSFCPDQKCLKDCGCI 21 T 0.59 C_tripleX pdbhh F T 7lso 1 A A A0A0U1U1Y3_BOAPU L-Phenylseptin peptide FFFDTLKNLAGKVIGALT 18 T 0.091 TRM13 unp F Eukaryota T 7lsp 1 A A A0A0U1U1Y3_BOAPU D-Phenylseptin FXFDTLKNLAGKVIGALT 18 T 0.091 TRM13 unp F Eukaryota T 7lsv 1 A,B A,B Q5ZTM4_LEGPH Calmodulin-dependent protein kinase SNAELESEALGLQAYKNQMSKQQLLGEIQGFKENYWNMKDLLTLTNRHHLRVFLEYLDNICSAFKDDKTDEKSARAAYDFLNAQINKLFEDNSKNSKPSFESFSEDVQRFLIHIDTYLMKNPSACSNSIASTIQLLKQLDNKKSFNPEQSFKDFCSYKEITIQLLLKPFETPVAEMAS 178 T 0.016 Radial_spoke pdbpssm F Bacteria T 7lt3 4 D,N Q,R Unknown peptide XXXXXXXXXXXXXXXXXXXX 20 F F F 7lt7 1 A A Hact-3 FNPVGVAFKGNNGKYLSRIHRSGIDYTEFAKDNTD 35 T 0.093 Agglutinin pdbhh F T 7ltb 1 A,B A,B Keratinicyclin B peptide moiety XFXXXX 6 T 66 Kelch_3 pdbhh F F 7lu9 6 N,P,R d,e,f M4M097_9HIV1 CH505 GP120 ENLWVTVYYGVPVWKEAKTTLFCASDAKAYEKEVHNVWATHACVPTDPNPQEMVLKNVTENFNMWKNDMVDQMHEDVISLWDQSLKPCVKLTPLCVTLNCTNATASNSSIIEGMKNCSFNITTELRDKREKKNALFYKLDIVQLDGNSSQYRLINCNTSVITQACPKVSFDPIPIHYCAPAGYAILKCNNKTFTGTGPCNNVSTVQCTHGIKPVVSTQLLLNGSLAEGEIIIRSENITNNVKTIIVHLNESVKIECTRPNNKTRTSIRIGPGQWFYATGQVIGDIREAYCNINESKWNETLQRVSKKLKEYFPHKNITFQPSSGGDLEITTHSFNCGGEFFYCNTSSLFNRTYMANSTDMANSTETNSTRTITIHCRIKQIINMWQEVGRAMYAPPIAGNITCISNITGLLLTRDGGKNNTETFRPGGGNMKDNWRSELYKYKVVKIEPLGVAPTRCKRRV 461 T 8.4E-53 GP120 pdbpssm T Viruses T 7lua 1 A,C,E a,c,e A0A1W6IPB2_9HIV1 ENVELOPE GLYCOPROTEIN GP120 AENLWVTVYYGVPVWKEAKTTLFCASDARAYEKEVHNVWATHACVPTDPSPQELVLGNVTENFNMWKNDMVDQMHEDIISLWDQSLKPCVKLTPLCVTLICSNATVKNGTVEEMKNCSFNTTTEIRDKEKKEYALFYKPDIVPLSETNNTSEYRLINCNTSACTQACPKVTFEPIPIHYCAPAGYAILKCNDETFNGTGPCSNVSTVQCTHGIRPVVSTQLLLNGSLAEKEIVIRSENLTNNAKIIIVHLHTPVEIVCTRPNNNTRKSVRIGPGQTFYATGDIIGDIKQAHCNISEEKWNDTLQKVGIELQKHFPNKTIKYNQSAGGDMEITTHSFNCGGEFFYCNTSNLFNGTYNGTYISTNSSANSTSTITLQCRIKQIINMWQGVGRCMYAPPIAGNITCRSNITGLLLTRDGGTNSNETETFRPAGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGR 466 T 3.5E-53 GP120 pdbpssm T Viruses T 7lul 2 B B peptide RWYERWV 7 T 0.37 FRB_dom pdbhh F F 7luo 2 B B Skp2 Motif 1 uncharacterized fragment 1 XXXXXXXX 8 F F F 7luo 3 D D Skp2 Motif 1 uncharacterized fragment 2 XXXXXXX 7 F F F 7luu 1 A A A0A1L5BQA7_SPHIB Subclass B3 metallo-beta-lactamase MIATMTIAASLAISPAAAATGPEPEAMAAMDRAGGARASDDPLTRPMAVERAKEWLAPLPPERVFGNSYLVGFAGLSVALIDTGAGLVLIDGALPQAAPMILSNVRKLGFDPRDIKFILSTEPHYDHAGGIAALARDTGATVVASRRGAEGLRAGAHAKDDPQFDYGGAWPAVSRLRVMKDGEVLRIGRASITAHATPGHTMGSMTWSWNACEGKRCKAIVFASSLNPVSADRYRFTAPSSAPIVKGFEASYRRMGALKCDILISAHPDNAGAGRYGSGSGACRSYAERSRRLLAKRLAEERRETSK 307 T 0.53 Lactamase_B pdbpssm F Bacteria T 7lw7 1 A A EXO5_HUMAN EXO V, HEXO5, DEFECTS IN MORPHOLOGY PROTEIN 1 HOMOLOG SNALEDAQESKALVNMPGPSSESLGKDDKPISLQNWKRGLDILSPMERFHLKYLYVTDLATQNWCELQTAYGKELPGFLAPEKAAVLDTGASIHLARELELHDLVTVPVTTKEDAWAIKFLNILLLIPTLQSEGHIREFPVFGEVEGVLLVGVIDELHYTAKGELELAELKTRRRPMLPLEAQKKKDCFQVSLYKYIFDAMVQGKVTPASLIHHTKLCLEKPLGPSVLRHAQQGGFSVKSLGDLMELVFLSLTLSDLPVIDILKIEYIHQETATVLGTEIVAFKEKEVRAKVQHYMAYWMGHREPQGVDVEEAWKCRTCTYADICEWRKGSGVLSSTLAPQVKKAK 346 T 2.4999999999999997E-43 Exo5 pdbpercent F Eukaryota T 7lw8 1 A A EXO5_HUMAN EXO V, HEXO5, DEFECTS IN MORPHOLOGY PROTEIN 1 HOMOLOG SNALEDAQESKALVNMPGPSSESLGKDDKPISLQNWKRGLDILSPMERFHLKYLYVTDLATQNWCELQTAYGKELPGFLAPEKAAVLDTGASIHLARELELHDLVTVPVTTKEDAWAIKFLNILLLIPTLQSEGHIREFPVFGEVEGVLLVGVIDELHYTAKGELELAELKTRRRPMLPLEAQKKKDCFQVSLYKYIFDAMVQGKVTPASLIHHTKLCLEKPLGPSVLRHAQQGGFSVKSLGDLMELVFLSLTLSDLPVIDILKIEYIHQETATVLGTEIVAFKEKEVRAKVQHYMAYWMGHREPQGVDVEEAWKCRTCTYADICEWRKGSGVLSSTLAPQVKKAK 346 T 2.4999999999999997E-43 Exo5 pdbpercent F Eukaryota T 7lw9 1 A A EXO5_HUMAN EXO V, HEXO5, DEFECTS IN MORPHOLOGY PROTEIN 1 HOMOLOG SNALEDAQESKALVNMPGPSSESLGKDDKPISLQNWKRGLDILSPMERFHLKYLYVTDLATQNWCELQTAYGKELPGFLAPEKAAVLDTGASIHLARELELHDLVTVPVTTKEDAWAIKFLNILLLIPTLQSEGHIREFPVFGEVEGVLLVGVIDELHYTAKGELELAELKTRRRPMLPLEAQKKKDCFQVSLYKYIFDAMVQGKVTPASLIHHTKLCLEKPLGPSVLRHAQQGGFSVKSLGDLMELVFLSLTLSDLPVIDILKIEYIHQETATVLGTEIVAFKEKEVRAKVQHYMAYWMGHREPQGVDVEEAWKCRTCTYADICEWRKGSGVLSSTLAPQVKKAK 346 T 2.4999999999999997E-43 Exo5 pdbpercent F Eukaryota T 7lwa 1 A A EXO5_HUMAN EXO V, HEXO5, DEFECTS IN MORPHOLOGY PROTEIN 1 HOMOLOG SNALEDAQESKALVNMPGPSSESLGKDDKPISLQNWKRGLDILSPMERFHLKYLYVTDLAEQNWCELQTAYGKELPGFLAPEKAAVLDTGASIHLARELELHDLVTVPVTTKEDAWAIKFLNILLLIPTLQSEGHIREFPVFGEVEGVLLVGVIDELHYTAKGELELAELKTRRRPMLPLEAQKKKDCFQVSLYKYIFDAMVQGKVTPASLIHHTKLCLEKPLGPSVLRHAQQGGFSVKSLGDLMELVFLSLTLSDLPVIDILKIEYIHQETATVLGTEIVAFKEKEVRAKVQHYMAYWMGHREPQGVDVEEAWKCRTCTYADICEWRKGSGVLSSTLAPQVKKAK 346 T 1.6999999999999998E-43 Exo5 pdbpercent F Eukaryota T 7lwh 2 B B LATS1_HUMAN LARGE TUMOR SUPPRESSOR HOMOLOG 1,WARTS PROTEIN KINASE,H-WARTS PKFGTHHKALQEIRNSLLPFANE 23 T 0.049 EcoEI_R_C unp F Eukaryota T 7lwy 1 A,B B,A Q9JE95_9VIRU Capsid protein SNPRLTKVLDEMSKKPCVNINEIRKMIRNFQPQFIQPRNGNRPNAQPRTVDSFEWVVRIQSTVETQLLGATNTVPQQTLNLDISFTDDSTTITPASIPGSISMLDNSRHIPAIQSMIQNFKARYLGSLQDTAQLQSPQYPQLLAYLFGQLIAIKDRLDLFRPSNPLSLADALFGFTLAQNARPRYDDHRHAKACQGPLVIPAATNSDCGPCGFVQINANQGLTLPLGACLFVNPETVNDQSFQDFLWLIFATHHRMPNQMQNNWPFSLNIVSTCAAPGRQAPHAGELTDERVRLALDTGHRILLSMFNDDEETLRYYQRKGIETMFRPCCFYTEGGLLRKATRYVSMVPLNGLYYYNGATSYVVSPIHTDAHPGITAAIESFVDIMVLQAVFSFSGPKVVAAKVNASQIDAAMVFGPAVAEGDGFVYDPLRPAPPLSAFYTEFIHRPAEQRIFQMAMSQIYGSHAPLIIANVINSIHNCKTKIVNNKLRATFVRRPPGAPHLKADTAIINRFHDPELAYALGILADGIAPLDGSHEYNVLDELDYLFNGGDIRNCFGLNALNTRGLGQIVHIRPKREPGKRPRRGFYTTLDGQVHPVTQDAPLDEIYHWRDHGNLTRPYSCHILDSQGLEFADVSNGRSRGKILVVVNSPLKTCAAYQGPSFAPK 665 T 0.0092 HEPN_Swt1 pdbpercent T Viruses T 7lx4 1 A A Hact-SCRiP1 QSEFCGHDVGECVPPKLVCRPPTHECLHFPCPGYLKCCCYP 41 T 2.5 CLIP_SPH_mas pdbhh F T 7lxk 1 A A ALL12_ARAHY ALLERGEN ARA H I GKSSPYQKKTENPCAQRCLQSCQQEPDDLKQKACESRCTKLEYDPRCVYDPRGHTGTTN 59 T 0.27 PAN_2 pdbpercent F Eukaryota T 7lxm 1 A,E,I A,C,E HIV-1 Env glycoprotein gp120 MDAMKRGLCCVLLLCGAVFVSPSQEIHARFRRGARAENLWVTVYYGVPVWKDAETTLFCASDAKAYDTEKRNVWATHCCVPTDPNPQEIVLENVTENFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLNCTDVNATNNTTNNEEIKNCSFNITTELRDKKKKVYALFYKLDVVPIDDNNSYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCNDKKFNGTGPCKNVSTVQCTHGIKPVVSTQLLLNGSLAEEEIIIRSENITNNAKTIIVQLNESVEINCTRPNNNTRKSIRIGPGQWFYATGDIIGDIRQAHCNISRTKWNKTLQQVAKKLREHFNKTIIFNPSSGGDLEITTHSFNCGGEFFYCNTSELFNSTWNGTNNTITLPCRIKQIINMWQRVGQAMYAPPIEGKIRCTSNITGLLLTRDGGNNNTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVERRRRRRA 493 T 9.1E-54 GP120 pdbpssm F T 7lxn 1 A,E,I A,C,E HIV-1 Env glycoprotein gp120 MDAMKRGLCCVLLLCGAVFVSPSQEIHARFRRGARAENLWVTVYYGVPVWKDAETTLFCASDAKAYDTEKRNVWATHCCVPTDPNPQEIVLENVTENFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLNCTDVNATNNTTNNEEIKNCSFNITTELRDKKKKVYALFYKLDVVPIDDNNSYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCNDKKFNGTGPCKNVSTVQCTHGIKPVVSTQLLLNGSLAEEEIIIRSENITNNAKTIIVQLNESVEINCTRPNNNTRKSIRIGPGQWFYATGDIIGDIRQAHCNISRTKWNKTLQQVAKKLREHFNKTIIFNPSSGGDLEITTHSFNCGGEFFYCNTSELFNSTWNGTNNTITLPCRIKQIINMWQRVGQAMYAPPIEGKIRCTSNITGLLLTRDGGNNNTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVERRRRRR 492 T 9.1E-54 GP120 pdbpssm F T 7ly9 3 C,G,K G,D,K A0A0N9FF17_9HIV1 Envelope glycoprotein gp120 GLWVTVYYGVPVWREAKTTLFCASDAKSYEKEVHNVWATHACVPTDPNPQELVLENVTENFNMWKNDMVDQMHEDIISLWDQSLKPCVKLTPLCVTLNCSDAKVNATYKGTREEIKNCSFKATTELRDKKRREYALFYRLDIVPLSGEGNNNSEYRLINCNTSVITQICPKVTFDPIPIHYCAPAGYAILKCNNKTFNGTGPCNNVSTVQCTHGIKPVVSTQLLLNGSLAEEEIIIRSENLTDNVKTIIVHLNESVEITCTRPNNMTRKSVRIGPGQTFYALGDIIGDIRQPHCNISEIKWEKTLQRVSEKLREHFNKTIIFNQSSGGDLEITTHSFNCGGEFFYCNTSDLFFNKTFNETYSTGSNSTNSTITLPCRIKQIINMWQEVGRAMYAPPIAGNITCKSNITGLLLTRDGGGNNSTKETFRPGGGNMRDNWRSELYKYKVVEVKPLGIAPTECNRTVVQRRRRRR 471 T 3.2999999999999996E-53 GP120 pdbpssm T Viruses T 7lzj 1 A,B,C A,B,C A0A3S7W7I3_9CAUD Depolymerase MALYREGKAAMAADGTVTGTGTKWQSSLSLIRPGATIMFLSSPIQMAVVNKVVSDTEIKAITTSGAVVASTDYAILLSDSLTVDGLAQDVAETLRHYQSQETVIADAVEFFKSFDFDSLQNLANQIKADSESAESSAAAAAASESKAKTSEDNAKSSENAAKNSEVAAETTRDQIQQIIDNAGDQSTLVVLAQPDGFDSIGRVSSFAALRNLKPKKSGQHVLLTSYYDGWAAENKMPTGGGEFISSIGTATDDGGYIAAGPGYYWTRVVNNNSFTAEDFGCKTTATPPPNFNVLPAELFDNTAMMQAAFNLAISKSFKLNLSTGTYYFESSDTLRITGPIHIEGRPGTVFYHNPSNKANPKTDAFMNISGCSAGRISSINCFSNSYLGKGINFDRSVGDNRKLVLEHVYVDTFRWGFYVGEPECINQIEFHSCRAQSNYFQGIFIESFKEGQEYGHSAPVHFFNTICNGNGPTSFALGATYKTTKNEYIKVMDSVNDVGCQAYFQGLSNVQYIGGQLSGHGSPRNTSLATITQCNSFIIYGTDLEDINGFTTDGTAITADNIDAIESNYLKDISGAAIVVSSCPGFKIDSPHIFKIKTLSTIKLMNNTYNYEIGGFTPDEALKYNVWDANGLATNRISGVIHPRLVNSRLGINSVAFDNMSNKLDVSSLIHNETSQIVGLTPSTGSNVPHTRKMWSNGAMYSSTDLNNGFRLNYLSNHNEPLTPMHLYNEFSVSEFGGSVTESNALDEIKYIFIQTTYANSGDGRFIIQALDASGSVLSSNWYSPQSFNSTFPISGFVRFDVPTGAKKIRYGFVNSANYTGSLRSHFMSGFAYNKRFFLKIYAVYNDLGRYGQFEPPYSVAIDRFRVGDNTTQMPSIPASSATDVAGVNEVINSLLASLKANGFMSS 907 T 0.0099 Pectate_lyase_3 pdbhh T Viruses T 7m0q 1 A,B A,B Network hallucinated protein 0738_mod MGSSHHHHHHSSGLVPRGSHMNIQVSLQWEDPKKGKVFSHTVNIPPGGTAEQIADNILDMARSLQDEGWDKLTVQVTVNPGFPKETAMRVAAALKEAFEDRGLRLTSIETSGNSIHLKFRY 121 T 0.012 Pilus_CpaD pdb F T 7m10 2 B B VRK1_HUMAN VACCINIA-RELATED KINASE 1 PRVKAAQAGRQS 12 T 34 KGG pdbhh F Eukaryota T 7m12 1 A,B B,A Q9JE95_9VIRU Capsid protein SNPRLTKVLDEMSKKPCVNINEIRKMIRNFQPQFIQPRNGNRPNAQPRTVDSFEWVVRIQSTVETQLLGATNTVPQQTLNLDISFTDDSTTITPASIPGSISMLDNSRHIPAIQSMIQNFKARYLGSLQDTAQLQSPQYPQLLAYLFGQLIAIKDRLDLFRPSNPLSLADALFGFTLAQNARPRYDDHRHAKACQGPLVIPAATNSDCGPCGFVQINANQGLTLPLGACLFVNPETVNDQSFQDFLWLIFATHHRMPNQMQNNWPFSLNIVSTCAAPGRQAPHAGELTDERVRLALDTGHRILLSMFNDDEETLRYYQRKGIETMFRPCCFYTEGGLLRKATRYVSMVPLNGLYYYNGATSYVVSPIHTDAHPGITAAIESFVDIMVLQAVFSFSGPKVVAAKVNASQIDAAMVFGPAVAEGDGFVYDPLRPAPPLSAFYTEFIHRPAEQRIFQMAMSQIYGSHAPLIIANVINSIHNCKTKIVNNKLRATFVRRPPGAPHLKADTAIINRFHDPELAYALGILADGIAPLDGSHEYNVLDELDYLFNGGDIRNCFGLNALNTRGLGQIVHIRPKREPGKRPRRGFYTTLDGQVHPVTQDAPLDEIYHWRDHGNLTRPYSCHILDSQGLEFADVSNGRSRGKILVVVNSPLKTCAAYQGPSFAPK 665 T 0.0092 HEPN_Swt1 pdbpercent T Viruses T 7m22 1 A C UL128_HCMVA UL128 EECCEFINVNHPPERCYDFKMCNRFTVALRCPDGEVCYSPEKTAEIRGIVTTMTHSLTRQVVHNKLTSCNYNPLYLEADGRIRCGKVNDKAQYLLGAAGSVPYRWINLEYDKITRIVGLDQYLESVKKHKRLDVCRAKMGYMLQ 144 T 12 SH3_19 pdbhh T Viruses T 7m22 3 C E U131A_HCMVM UL131A QCQRETAEKNDYYRVPHYWDACSRALPDQTRYKYVEQLVDLTLNYHYDASHGLDNFDVLKRINVTEVSLLISDFRRQNRRGGTNKRTTFNAAGSLAPHARSLEFSVRLFAN 111 T 0.064 Prion pdbpercent T Viruses T 7m25 1 A A PawL-Derived Peptide PLP-13 TFGVVIAD 8 T 0.47 DUF3905 pdbhh F T 7m27 1 A A PawL-Derived Peptide PLP-16 GLFPYGPD 8 T 0.0096 LINES_C pdbhh F T 7m28 1 A A PawL-Derived Peptide PLP-22 GLPPYVD 7 T 2.5 DUF2119 pdbhh F T 7m29 1 A A PawL-Derived Peptide PLP-29 GYFPVGVD 8 T 1.7 PhnH pdbhh F T 7m2a 1 A A PawL-Derived Peptide PLP-38 GLYPYPD 7 T 1.4 Periviscerokin pdbhh F F 7m2b 1 A A PawL-Derived Peptide PLP-42 TFFNPVID 8 T 0.53 PsbK pdbhh F T 7m2c 1 A A PawL-Derived Peptide PLP-46 GYITPLD 7 T 3.1 Caleosin pdbhh F T 7m2p 2 B B Inhibitor 18 in bound form XVXX 4 T 2300 SWIM pdbhh F F 7m2t 1 A,B,C,D,E,F,G,H,I,IB,J,JB,K,KB,L,LB,M,MB,N,NB,O,OB,PB,QB,RB,SB,TB,UB,VB,WB A,B,C,D,E,F,G,H,I,BB,J,CC,K,DD,L,EE,M,FF,N,GG,O,HH,II,JJ,KK,LL,MM,NN,OO,PP COAT_STMV Coat protein MGRGKVKPNRKSTGDNSNVVTMIRAGSYPKVNPTPTWVRAIPFEVSVQSGIAFKVPVGSLFSANFRTDSFTSVTVMSVRAWTQLTPPVNEYSFVRLKPLFKTGDSTEEFEGRASNINTRASVGYRIPTNLRQNTVAADNVCEVRSNCRQVALVISCCFN 159 T 24 Csm4_C pdbhh T Viruses T 7m2u 10 J E RAD33_YEAST DNA repair protein RAD33 MSKSTNVSYERVELFENPKVPIEVEDEILEKYAESSLDHDMTVNELPRFFKDLQLEPTIWKLVRNEDVIIEGTDVIDFTKLVRCTCQLLILMNNLTVIDDLWSMLIRNCGRDVDFPQVALRDHVLSVKDLQKISNLIGADQSSGTIEMISCATDGKRLFMTYLDFGCVLGKLGYLKM 177 T 8.4E-13 Rad33 pdbpssm F Eukaryota T 7m2v 1 A,AA,B,C,D,DA,E,F,G,GA,H,I,K,LA,N,O,P,T,V,X A,L,B,C,E,O,J,M,HH,K,II,JJ,FX,NN,D,KK,G,H,GG,I COAT_STMV Coat protein MGRGKVKPNRKSTGDNSNVVTMIRAGSYPKVNPTPTWVRAIPFEVSVQSGIAFKVPVGSLFSANFRTDSFTSVTVMSVRAWTQLTPPVNEYSFVRLKPLFKTGDSTEEFEGRASNINTRASVGYRIPTNLRQNTVAADNVCEVRSNCRQVALVISCCFN 159 T 24 Csm4_C pdbhh T Viruses T 7m30 2 B C UL128_HCMVA UL128 EECCEFINVNHPPERCYDFKMCNRFTVALRCPDGEVCYSPEKTAEIRGIVTTMTHSLTRQVVHNKLTSCNYNPLYLEADGRIRCGKVNDKAQYLLGAAGSVPYRWINLEYDKITRIVGLDQYLESVKKHKRLDVCRAKMGYMLQ 144 T 12 SH3_19 pdbhh T Viruses T 7m30 4 D E U131A_HCMVM UL131A QCQRETAEKNDYYRVPHYWDACSRALPDQTRYKYVEQLVDLTLNYHYDASHGLDNFDVLKRINVTEVSLLISDFRRQNRRGGTNKRTTFNAAGSLAPHARSLEFSVRLFAN 111 T 0.064 Prion pdbpercent T Viruses T 7m3g 2 C,D C,D etelcalcetide XXXXXXXXX 9 F F F 7m3r 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,GG,HH,II,JJ,KK COAT_STMV Coat protein MGRGKVKPNRKSTGDNSNVVTMIRAGSYPKVNPTPTWVRAIPFEVSVQSGIAFKVPVGSLFSANFRTDSFTSVTVMSVRAWTQLTPPVNEYSFVRLKPLFKTGDSTEEFEGRASNINTRASVGYRIPTNLRQNTVAADNVCEVRSNCRQVALVISCCFN 159 T 24 Csm4_C pdbhh T Viruses T 7m3t 1 A,B,BA,C,CA,D,DA,E,EA,F,FA,H,I,J,K,L,M,O A,B,GG,C,HH,D,II,E,JJ,F,KK,H,I,J,K,L,M,O COAT_STMV Coat protein MGRGKVKPNRKSTGDNSNVVTMIRAGSYPKVNPTPTWVRAIPFEVSVQSGIAFKVPVGSLFSANFRTDSFTSVTVMSVRAWTQLTPPVNEYSFVRLKPLFKTGDSTEEFEGRASNINTRASVGYRIPTNLRQNTVAADNVCEVRSNCRQVALVISCCFN 159 T 24 Csm4_C pdbhh T Viruses T 7m3t 2 G G COAT_STMV Coat protein MGRGKVKPNRKSTGDNSNVVTMIRAGSYPKVNPTPTWVRAIPFEVSVQSGIAFKVPVGSLFSANFRTDSFTSVTVMSVRAWTQLTPPVNEYSFVRLKPLFKTGDSTEEFEGRASNINTRASVGYRIPSNLRQNTVAADNVCEVRSNCRQVALVISCCFN 159 T 24 Csm4_C pdbhh T Viruses T 7m3t 3 N N COAT_STMV Coat protein MGRGKVKPNRKSTGDNSNVVTMIRAGSYPKVNPTPTWVRAIPFEVSVQSGIAFKVPVGSLFSANFRTDSFTSVTVMSVRAWTQLTPPVNEYSFVRLKPLFKTGDSTEEFEGRASNINTRASVGYRIPTNLRQNTVAVDNVCEVRSNCRQVALVISCCFN 159 T 24 Csm4_C unphh T Viruses T 7m3u 1 A A PawS-Derived Peptide PDP-24 GFCWQHTCLPSGCADFPWPVGHQCFPD 27 T 0.098 DUF5763 pdbhh F T 7m4n 1 A,B A,B RN216_HUMAN RING FINGER PROTEIN 216,RING-TYPE E3 UBIQUITIN TRANSFERASE RNF216,TRIAD DOMAIN-CONTAINING PROTEIN 3,UBIQUITIN-CONJUGATING ENZYME 7-INTERACTING PROTEIN 1,ZINC FINGER PROTEIN INHIBITING NF-KAPPA-B GPEELAEKDDIKYRTSIEEKMTAARIRKCHKCGTGLIKSEGANRMSCRCGAQMCYLCRVSINGYDHFCQHPRSPGAPCQECSRCSLWTDPTEDDEKLIEEIQKEAEEEQKRKNGENTFKRIGPPLEKPVEKVQRVEAL 138 T 0.0044 Rhodanese_C pdbpercent F Eukaryota T 7m4o 1 A A RN216_HUMAN RING FINGER PROTEIN 216,RING-TYPE E3 UBIQUITIN TRANSFERASE RNF216,TRIAD DOMAIN-CONTAINING PROTEIN 3,UBIQUITIN-CONJUGATING ENZYME 7-INTERACTING PROTEIN 1,ZINC FINGER PROTEIN INHIBITING NF-KAPPA-B GPEELAEKDDIKYRTSIEEKMTAARIRKCHKCGTGLIKSEGANRMSCRCGAQMCYLCRVSINGYDHFCQHPRSPGAPCQECSRCSLWTDPTEDDEKLIEEIQKEAEEEQKRKNGENTFKRIGPPLEKPVEKVQRVEAL 138 T 0.0044 Rhodanese_C pdbpercent F Eukaryota T 7m50 1 A,B,BA,C,CA,D,DA,E,EA,F,FA,G,H,I,J,K,L,M,N,O A,B,GG,C,HH,D,II,E,JJ,F,KK,G,H,I,J,K,L,M,N,O COAT_STMV Coat protein MGRGKVKPNRKSTGDNSNVVTMIRAGSYPKVNPTPTWVRAIPFEVSVQSGIAFKVPVGSLFSANFRTDSFTSVTVMSVRAWTQLTPPVNEYSFVRLKPLFKTGDSTEEFEGRASNINTRASVGYRIPTNLRQNTVAADNVCEVRSNCRQVALVISCCFN 159 T 24 Csm4_C pdbhh T Viruses T 7m54 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,GG,HH,II,JJ,KK COAT_STMV Coat protein MGRGKVKPNRKSTGDNSNVVTMIRAGSYPKVNPTPTWVRAIPFEVSVQSGIAFKVPVGSLFSANFRTDSFTSVTVMSVRAWTQLTPPVNEYSFVRLKPLFKTGDSTEEFEGRASNINTRASVGYRIPTNLRQNTVAADNVCEVRSNCRQVALVISCCFN 159 T 24 Csm4_C pdbhh T Viruses T 7m57 1 A,B,C,D,E,F,G,H,I,IB,J,JB,K,KB,L,LB,M,MB,N,NB,O,OB,PB,QB,RB,SB,TB,UB,VB,WB A,B,C,D,E,F,G,H,I,BB,J,CC,K,DD,L,EE,M,FF,N,GG,O,HH,II,JJ,KK,LL,MM,NN,OO,PP COAT_STMV Coat protein MGRGKVKPNRKSTGDNSNVVTMIRAGSYPKVNPTPTWVRAIPFEVSVQSGIAFKVPVGSLFSANFRTDSFTSVTVMSVRAWTQLTPPVNEYSFVRLKPLFKTGDSTEEFEGRASNINTRASVGYRIPTNLRQNTVAADNVCEVRSNCRQVALVISCCFN 159 T 24 Csm4_C pdbhh T Viruses T 7m5d 59 GB 6 FME-PHE-PHE MFF 3 T 110 DUF4985 pdbhh F F 7m5f 1 A A CdiI MKEIKLMADYHCYPLWGTTPDDFGDISPDELPISLGLKNSLEAWAKRYDAILNTDDPALSGFKSVEEEKLFIDDGYKLAELLQEELGSAYKVIYHADY 98 T 0.052 RHH_5 pdbpercent F T 7m5l 2 D,E,F E,D,F Peptide mimetic (ACE)RQCSMTCFYHSK(NH2) with linker XRQCSMTCFYHSKX 14 T 0.56 Fer4_12 pdbhh F T 7m5m 2 D,E D,E Peptide mimetic (ACE)RQCSMTCFYHSK(NH2) with linker XRQCSMTCFYHSKX 14 T 0.56 Fer4_12 pdbhh F T 7m5n 2 D,E E,D Peptide mimetic (ACE)RQCSMTCFYHSK(NH2) with linker XRQCSMTCFYHSKX 14 T 0.56 Fer4_12 pdbhh F T 7m5t 1 A A De novo designed protein 0515 MDFTERLDRLVKYAKEIAKWYKESGDPDFANSVDNVLGHLENIRKAFKHGDPARAMDHVSNVVGSLDSIQTSFKQTGNPEIATRWQELTQEVRELYAYLG 100 T 0.0097 PEX11 pdb F T 7m5u 2 B B UNC5246 XFAFXSX 7 T 250 Glyco_transf_43 pdbhh F F 7m60 1 A,B C,B MLH1_HUMAN MUTL PROTEIN HOMOLOG 1 SANPRKRHRED 11 T 0.57 T_cell_tran_alt pdbhh F Eukaryota T 7m67 1 A A A0A5K4F6V0_SCHMA Schistocin-1 antimicrobial peptide GILDIKNKVSNLFKKIKGEKX 21 T 2.3 KxDL pdbhh F Eukaryota T 7m6t 4 D D Non-canonical peptide F3 ALQHLMDKWMAM 12 T 2 ATG16 pdbhh F T 7m73 1 A A A0A5K4F6V0_SCHMA Schistocin-2 antimicrobial peptide GILDIKNKVSNLFKKIKX 18 T 2 C_Hendra pdbhh F Eukaryota T 7m77 1 A A G4VEE0_SCHMA Schistocin-3 antimicrobial peptide GILDIKNKVSNLFX 14 T 6 DUF3484 pdbhh F Eukaryota T 7m79 1 A A G4VEE0_SCHMA Schistocin-4 antimicrobial peptide GILDILNKVSNLFX 14 T 0.25 Antimicrobial20 pdbhh F Eukaryota T 7m7a 1 A,B,C,D A,B,C,D Q5ZRA8_LEGPH Phosphoinositide 3-kinase MavQ SEFELRRQASMGLPKKALKESQLQFLTAGTAVSDSSHQTYKVSFIENGVIKNAFYKKLDPKNHYPELLAKISVAVSLFKRIFQGRRSAEERLVFDDEERLVGTLSISVDGFKGFNFHKESVPQESSAKEQVIPSTRTLIEKSFMEILLGRWFLDDDDGHPHNLSLAGDIDFDMFFYWFTIYMKEPRPAIGIPKTRVNLTVRDWEGFPNVKDSKPFHWPTYKNPGQETLPTVLPVQDKLVNLILEKTYPDPGQFEQLAHEPVAQEQKFAAALKILLTYQPEMIRKRLTELFGEMTLNYTSLDETDVALRNQYEKTFPHLCNENTNIKPFVDFIMNLYQMHYDNLYRVVVFYMGCENNGYGVPLPATNSALYHKPSFYKDIVEWARTQNITIFSKDDSSIKFDEDELRRRYHQVWRDAYAPTFRDLLHDSYSLTNKLLQQVSTFHVVLDEVEGKKPTDDTLTNAWELFGTMPELSLEKITPLISVDKDSKLRTALILLVEFTTQFHAVAKTYYQKDRKDLTEEDNLEFSEQLVQLYTNYNLKIRQSLAHTSTLAGEFNRIAVGLKQYTERANFQLHLTTTDEQMKEATVATT 590 T 0.12 SYF2 pdbpssm F Bacteria T 7m7x 1 A A CsrA-binding peptide XVCSELCWX 9 T 0.65 RPAP2_Rtr1 pdbhh F T 7m98 2 B B H4_HUMAN Histone H4 SGRGXGGKGLGKGGA 15 T 11 Shadoo unppercent F Eukaryota T 7mb9 2 C,D C,D R1AB_SARS2 ARG-GLU-PRO-MET-LEU-GLN REPMLQ 6 T 74 Phageshock_PspD pdbhh T Viruses T 7md4 1 A,B N,M INSR_HUMAN IR AAAKELEESSFRKTFEDYLHNVVFVPSPSR 30 T 0.00017 DUF4998 unphh F Eukaryota T 7md5 2 C,D M,N INSR_HUMAN IR AAAKELEESSFRKTFEDYLHNVVFVPSPSR 30 T 0.00017 DUF4998 unphh F Eukaryota T 7mdt 1 A,E,F A,C,E Q2N0S6_9HIV1 SU, GLYCOPROTEIN 120, GP120 MKRGLCCVLLLCGAVFVSPSQEIHARFRRGARAENLWVTVYYGVPVWKDAETTLFCASDAKAYETKKHNVWATHCCVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQWFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 513 T 3.8E-54 GP120 pdbpercent T Viruses T 7mdu 2 B A Q2N0S6_9HIV1 SU, GLYCOPROTEIN 120, GP120 MKRGLCCVLLLCGAVFVSPSQEIHARFRRGARAENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHEDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNTPVQINCTRPNNNTVKSIRIGPGQAFYYTGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFAQSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 513 T 3.8E-54 GP120 pdbpercent T Viruses T 7mdx 4 E E Triacyl-lipoprotein XXXXAA 6 T 940 FliC_SP pdbhh F F 7mex 3 C D N-degron RHGSGSGAWLLPVSLVKRKTTLAPNTQTASPPSYRALADSLMQ 43 T 19 EZH2_N pdbhh F T 7mey 5 E F Monoubiquitinated N-degron RHGSGSGAWLLPVSLVKRKTTLAPNTQTASPPSYRALADSLMQ 43 T 19 EZH2_N pdbhh F T 7mfr 3 E,F P,Q GLY-GLY-MET GGM 3 T 81 Glyco_transf_4 pdbhh F F 7mgr 2 B B R1AB_SARS2 ALA-VAL-LYS-LEU-GLN-ASN-ASN-GLU AVKLQNNEL 9 T 5.2 Phospho_p8 pdbhh T Viruses T 7mgs 2 B B R1AB_SARS2 SER-ALA-VAL-LEU-GLN-SER-GLY-PHE SAVLQSGFR 9 T 12 GPAT_N pdbhh T Viruses T 7mgv 2 B,C V,U CdnA3 Leader peptide KEPFFAAFLEKQ 12 T 0.028 Inhibitor_I10 pdb F T 7mgv 3 E T CdnA3 Core peptide TLKYPSDSDEG 11 T 1.1E-05 Inhibitor_I10 pdbhh F T 7mhs 2 F G Unknown substrate XXXXXXXXXXXXXXXXXXXXXX 22 F F F 7mir 1 A A SIDJ_LEGPH Calmodulin-dependent glutamylase SidJ SGHVKQYYFARRGETSTHDTSLPPPVKVLSGRSIPLKEIPFEATRNELVQIYLTSIDKLIKSNKLNSIPSQQIASHYLFLRSLANSETDGIKKNQILSLAKPLGTYLASKEPHVWKMINELIEKSEYPIIHYLKNNRAHSNFMLALIHEYHKEPLTKNQSAFVQKFRDSSVFLFPNPIYTAWLAHSYDEDSSFNPMFRERLSTNFYHSTLTDNLLLRTEPKEVTLSSEHHYKKEKGPIDSSFRYQMSSDRLLRIQGRTLLFSTPQNDVVAVKVQKKGEPKSTLEEEFEMADYLLKHQRRLDVHSKLPQPLGQYSVKKSEILEISRGSLDFERFKTLIDDSKDLEVYVYKAPQSYFTYLHDKNQDLEDLTASVKTNVHDLFVLLREGIVFPQLADIFHTHFGEDEREDKGRYQALVQLLNVLQFQLGRIDKWQKAVEYVNLRSSGLADLGDSLPITSLFTSSDFTKHYFSELLTGGYHPTFFDKSSGTANSLFTGKRRLFGNYLYLNTIAEYLLVIQLTLGSYGDKVTRDMMDKPKKEAVWRELANVMFTSCAEAIHIMTGIPQSRALTLLKQRANIEKHFRQTQFWMTPDYSKLDEDTLQMEQYSIYSGEPEYEFTDKLVSGVGLSVDGVHQDLGGYNRESPLRELEKLLYATVTLIEGTMQLDKEFFKQLEQVEKILSGEIKTDANSCFEAVAQLLDLARPGCHFQKRLVLSYYEEAKLKYPSAPTDAYDSRFQVVARTNAAITIQRFWREARKN 756 T 0.29 IQ pdb F Bacteria T 7mis 1 A A SIDJ_LEGPH Calmodulin-dependent glutamylase SidJ SGHVKQYYFARRGETSTHDTSLPPPVKVLSGRSIPLKEIPFEATRNELVQIYLTSIDKLIKSNKLNSIPSQQIASHYLFLRSLANSETDGIKKNQILSLAKPLGTYLASKEPHVWKMINELIEKSEYPIIHYLKNNRAHSNFMLALIHEYHKEPLTKNQSAFVQKFRDSSVFLFPNPIYTAWLAHSYDEDSSFNPMFRERLSTNFYHSTLTDNLLLRTEPKEVTLSSEHHYKKEKGPIDSSFRYQMSSDRLLRIQGRTLLFSTPQNDVVAVKVQKKGEPKSTLEEEFEMADYLLKHQRRLDVHSKLPQPLGQYSVKKSEILEISRGSLDFERFKTLIDDSKDLEVYVYKAPQSYFTYLHDKNQDLEDLTASVKTNVHDLFVLLREGIVFPQLADIFHTHFGEDEREDKGRYQALVQLLNVLQFQLGRIDKWQKAVEYVNLRSSGLADLGDSLPITSLFTSSDFTKHYFSELLTGGYHPTFFDKSSGTANSLFTGKRRLFGNYLYLNTIAEYLLVIQLTLGSYGDKVTRDMMDKPKKEAVWRELANVMFTSCAEAIHIMTGIPQSRALTLLKQRANIEKHFRQTQFWMTPDYSKLDEDTLQMEQYSIYSGEPEYEFTDKLVSGVGLSVDGVHQDLGGYNRESPLRELEKLLYATVTLIEGTMQLDKEFFKQLEQVEKILSGEIKTDANSCFEAVAQLLDLARPGCHFQKRLVLSYYEEAKLKYPSAPTDAYDSRFQVVARTNAAITIQRFWREARKN 756 T 0.29 IQ pdb F Bacteria T 7mj3 1 A A Hact-4 QPRSHVDCPALHGQCQSLPCTYPLVFVGPDPFHCGPYPQFGCCA 44 T 5.3 Toxin_14 pdbhh F T 7mj5 1 A,N,O,P,Q,R A,N,O,P,Q,R A2IAB2_9NEOP Putative secreted salivary protein KPVEAEVAQPKLYQRGEGGNGMEPIPEDVLNEALNA 36 T 3.3 FixH unphh F Eukaryota T 7mj6 3 C C IBPL1_HUMAN IGFBP-RELATED PROTEIN 10,INSULIN-LIKE GROWTH FACTOR-BINDING-RELATED PROTEIN 4,IGFBP-RP4 LLLPLLPPL 9 T 1.9 CaM_bind pdbhh F Eukaryota F 7mj7 3 C C IBPL1_HUMAN IGFBP-RELATED PROTEIN 10,INSULIN-LIKE GROWTH FACTOR-BINDING-RELATED PROTEIN 4,IGFBP-RP4 LLLPLLPPLSP 11 T 5.3 CaM_bind pdbhh F Eukaryota F 7mj8 3 C C IBPL1_HUMAN IGFBP-RELATED PROTEIN 10,INSULIN-LIKE GROWTH FACTOR-BINDING-RELATED PROTEIN 4,IGFBP-RP4 LLLPLLPPLSPS 12 T 6.8 CaM_bind pdbhh F Eukaryota F 7mj9 3 C C IBPL1_HUMAN IGFBP-RELATED PROTEIN 10,INSULIN-LIKE GROWTH FACTOR-BINDING-RELATED PROTEIN 4,IGFBP-RP4 LLLPRLPPL 9 T 13 Sperm_Ag_HE2 pdbhh F Eukaryota F 7mja 3 C C PHX2B_HUMAN NEUROBLASTOMA PHOX,NBPHOX,PHOX2B HOMEODOMAIN PROTEIN,PAIRED-LIKE HOMEOBOX 2B QYNPIRTTF 9 T 4 Myb_DNA-bind_3 unp F Eukaryota T 7mjb 1 A,B A,B Nanoluc Luciferase MGSSHHHHHHSSGMVFTLEDFVGDWRQTAGYNLDQVLEQGGVSSLFQNLGVSVTPIQRIVLSGENGLKIDIHVIIPYEGLSGDQMGQIEKIFKVVYPVDDHHFKVILHYGTLVIDGVTPNMIDYFGRPYEGIAVFDGKKITVTGTLWNGNKIIDERLINPDGSLLFRVTINGVTGWQLCERILA 184 T 0.021 Lipocalin_7 pdbhh F T 7mkk 1 A,C,E,G B,A,E,G Q9W3W6_DROME SMALL OVARY,ISOFORM B,ISOFORM C MDTSMKEKVKAKLVEIRKFVPFIRRVRIDFQDTLSKVQGHRLDALVNLLDREDVSMSSLNKIEVIIDKLRTRFNPRIE 78 T 0.007 Merozoite_SPAM pdbpssm F Eukaryota T 7mkk 2 B,D,F,H C,D,F,H PANX_DROME PROTEIN SILENCIO SMEPKIKEDADNAMLDSLLADPFENNSP 28 T 15 DBB pdbhh F Eukaryota T 7mmm 1 A A LYT1_LYCER Toxin LyeTx 1 IWLTALKFLGKNLGKHLAKQQLAKLX 26 T 0.019 Cu pdbpercent F Eukaryota T 7mmy 1 A,B A,B PDP-23 GFCWHHSCVPSGTCADFPWPLGHQCFPD 28 T 0.46 DUF5763 pdbhh F T 7mn3 1 A A HTRSN_PHYTS Homotarsinin NLVSDIIGSKKHMEKLISIIKKCRX 25 T 7 Spore_IV_A unphh F Eukaryota T 7mnj 1 A,B,C A,B,C RBP2_HUMAN 358 KDA NUCLEOPORIN,NUCLEAR PORE COMPLEX PROTEIN NUP358,NUCLEOPORIN NUP358,RAN-BINDING PROTEIN 2,RANBP2,P270 DGWNKLFDLIQSELYVRPDDVHVNIRLVEVYRSTKRLKDAVAHCHEAERNIALRSSLEWNSCVVQTLKEYLESLQCLESDKSDWRATNTDLLLAYANLMLLTLSTRDVQESRELLQSFDSALQSVKSLGGNDELSATFLEMKGHFYMHAGSLLLKMGQHSSNVQWRALSELAALCYLIAFQVPRPKIKLIKGEAGQNLLEMMACDRLSQSGHMLLNLSRGKQDFLKEIVETFANKSGQSALYDALFSSQSPKDTSFLGSDDIGNIDVREPELEDLTRYDVGAIRAHNGSLQHLTWLGLQWNSLPALPGIRKWLKQLFHHLPHETSRLETNAPESICILDLEVFLLGVVYTSHLQLKEKCNSHHSSYQPLCLPLPVCKQLCTERQKSWWDAVCTLIHRKAVPGNVAKLRLLVQHEINTLRAQEKHGLQPALLVHWAECLQKTGSGLNSFYDQREYIGRSVHYWKKVLPLLKIIKKKNSIPEPIDPLFKHFHSVDIQASEIVEYEEDAHITFAILDAVNGNIEDAVTAFES 529 T 0.015 TPR_19 pdbpercent F Eukaryota T 7mnk 1 A,B,C,D A,B,C,D RBP2_HUMAN 358 KDA NUCLEOPORIN,NUCLEAR PORE COMPLEX PROTEIN NUP358,NUCLEOPORIN NUP358,RAN-BINDING PROTEIN 2,RANBP2,P270 SYEDQNSLLKMICQQVEAIKKEMQELKLNS 30 T 0.4 DUF5320 pdbhh F Eukaryota T 7moq 20 IA,V x,V Docking complex 1/2 protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 130 F F F 7moq 21 X X Q22T00_TETTS Docking complex 1 protein MRASSATKSQKDLKTLEEEYIHQSKKSNLLENDRKIFHKNAEETKNNNMQIIESLKKENKQLKTLRDELIANKRASTPGMSKTQGSLVSWSGDIKDENYWRRKFDEARHATKNKKSQLLQLQDKLNEVSDAKFGAVEESPLMRQIRILENRLDKVMIKFNEAQSIRKTYEQIVKRLKEERVGYDNQLAAIERSLKGKEHDFEELLLLAHDATHAKELAAAELKKYEHKKAAVRELRKTYIAEKRKAIEQREAVISRMEKKDKDNDDRNLEKSQANNLNELNNPQIEPQNHQDATFQRQKLNDYDEAFRKLYEATGVTDVNEIIQKFTTQDETSKSLKDLQREYQDTIDDKKKQRDDLKAGLNALKYEGNENPNRKQLDEIEKNVNNAVNKCDKAKLKYERVSKILVDVKAGIEHLYEKLEFYKLEGKPNIVITDETLVEGLSQIVEKMKLIFQPVKNDPSYNPEDFKQTAKGVSNYINLNLRDKSGRIESISKNIRVKLPEKDEEEVSNDEIEDDIDIETTTKLKQKYQAQAKQEKAARNKQKKQLGSTQQGRKV 555 T 0.051 CALCOCO1 pdbhh F Eukaryota T 7moq 22 Z Z Q233H6_TETTS Outer dynein arm docking complex protein oda protein MNENLEKKKKMEELEEYQRKFRNLESDRKAYAEETVALIKKQRGIVDKLKNENQQLKDIISKMNAQKIQQSNTMYGKPSSDSLVEELKQKIEVERRQQMEIEKHVVDFQKKIIEKRSNIGGYNAGAENDSSLAKQIKILENRLDKANQKFNEAIAVNKQLRQQIDSLRRERVIFDNLYKKLEKELHEKRKQMANIIETANTAYEERDRANDQIQNLKMLAKKESENFEKDLRELSHIMEKNKKALDYIKLTEKNRDDNKLNNDLLDSDKFARTTSQKLYKDRNVNQTQSEKIQRYEEDFAKIQAATKVNDFEKLVNTFIENEEKNFQTFKFVNELSNEIEELEKQIGELRSELDQYKGGSNMDIQYKRKIKEFEEVMTRAENKSESYEFKRHDAQKLINSLTNWIETLFNTIECDKKVAKELAGSHSVTDGNMMIFLAIIENKVNQIVQAFSAIDAQGANENYHTLLQNVSNLSTALMANKQRQDAPDNDEFEEEEGEGDRILNIEDFRKKALEKLDDRKQTQQSKKLPKAITNRRKR 538 T 0.0012 Imm30 pdbpssm F Eukaryota T 7mp3 2 E,F L,N bicyclic peptide B8 KFEGYDNEFP 10 T 4.6 DUF1284 pdbhh F T 7mpa 1 A A DWORF_HUMAN SERCA REGULATOR DWORF,DWARF OPEN READING FRAME,DWORF,SMALL TRANSMEMBRANE REGULATOR OF ION TRANSPORT 1 AMAEKAGSTFSHLLVPILLLIGWIVGCIIMIYVVFS 36 T 0.23 GNVR pdbpssm F Eukaryota T 7mq8 46 VA NR RRP12-like protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 861 F F F 7mq8 65 RB ST Nucleolar protein 14 MAKAKKVGARRKASGAPAGARGGPAKANSNPFEVKVNRQKFQILGRKTRHDVGLPGVSRARALRKRTQTLLKEYKERDKSNVFRDKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTEAELAKEEQEHLRKLEAERLRRMLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAEGNKAKLEKLFGFLLEYVGDLATDDPPDLTVIDKLVVHLYHLCQMFPESASDAIKFVLRDAMHEMEEMIETKGRAALPGLDVLIYLKITGLLFPTSDFWHPVVTPALVCLSQLLTKCPILSLQDVVKGLFVCCLFLEYVALSQRFIPELINFLLGILYIATPNKASQGSTLVHPFRALGKNSELLVVSAREDVATWQQSSLSLRWASRLRAPTSTEANHIRLSCLAVGLALLKRCVLMYGSLPSFHAIMGPLQALLTDHLADCSHPQELQELCQSTLTEMESQKQLCRPLTCEKSKPVPLKLFTPRLVKVLEFGRKQGSSKEEQERKRLIHKHKREFKGAVREIRKDNQFLARMQLSEIMERDAERKRKVKQLFNSLATQEGEWKALKRKKFKK 632 T 7.999999999999999E-26 Nop14 pdbpssm F T 7mq8 68 UB SX Unassigned peptides XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 177 F F F 7mq9 40 OA NR RRP12-like protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 861 F F F 7mq9 56 GB SX Unassigned peptides XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 177 F F F 7mq9 66 QB ST Nucleolar protein 14 MAKAKKVGARRKASGAPAGARGGPAKANSNPFEVKVNRQKFQILGRKTRHDVGLPGVSRARALRKRTQTLLKEYKERDKSNVFRDKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTEAELAKEEQEHLRKLEAERLRRMLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAEGNKAKLEKLFGFLLEYVGDLATDDPPDLTVIDKLVVHLYHLCQMFPESASDAIKFVLRDAMHEMEEMIETKGRAALPGLDVLIYLKITGLLFPTSDFWHPVVTPALVCLSQLLTKCPILSLQDVVKGLFVCCLFLEYVALSQRFIPELINFLLGILYIATPNKASQGSTLVHPFRALGKNSELLVVSAREDVATWQQSSLSLRWASRLRAPTSTEANHIRLSCLAVGLALLKRCVLMYGSLPSFHAIMGPLQALLTDHLADCSHPQELQELCQSTLTEMESQKQLCRPLTCEKSKPVPLKLFTPRLVKVLEFGRKQGSSKEEQERKRLIHKHKREFKGAVREIRKDNQFLARMQLSEIMERDAERKRKVKQLFNSLATQEGEWKALKRKKFKK 632 T 7.999999999999999E-26 Nop14 pdbpssm F T 7mqa 44 SA NR RRP12-like protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 861 F F F 7mqa 63 OB ST Nucleolar protein 14 MAKAKKVGARRKASGAPAGARGGPAKANSNPFEVKVNRQKFQILGRKTRHDVGLPGVSRARALRKRTQTLLKEYKERDKSNVFRDKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTEAELAKEEQEHLRKLEAERLRRMLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAEGNKAKLEKLFGFLLEYVGDLATDDPPDLTVIDKLVVHLYHLCQMFPESASDAIKFVLRDAMHEMEEMIETKGRAALPGLDVLIYLKITGLLFPTSDFWHPVVTPALVCLSQLLTKCPILSLQDVVKGLFVCCLFLEYVALSQRFIPELINFLLGILYIATPNKASQGSTLVHPFRALGKNSELLVVSAREDVATWQQSSLSLRWASRLRAPTSTEANHIRLSCLAVGLALLKRCVLMYGSLPSFHAIMGPLQALLTDHLADCSHPQELQELCQSTLTEMESQKQLCRPLTCEKSKPVPLKLFTPRLVKVLEFGRKQGSSKEEQERKRLIHKHKREFKGAVREIRKDNQFLARMQLSEIMERDAERKRKVKQLFNSLATQEGEWKALKRKKFKK 632 T 7.999999999999999E-26 Nop14 pdbpssm F T 7mqa 66 RB SX Unassigned peptides XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 228 F F F 7mqq 1 A A A0A5B0MRS6_PUCGR Stem rust effector protein AvrSr50 GPMARSLIKTDWSGSEYTILGANHYEEPNTGAAAQFPGTMAEDDGRSPYIVRKLRNSSGKRFYVFTDHPQQPIIWNPHEEIEIQFSRKYLIAVLTEFEADSKVFTHFARRQHRS 114 T 6.3 SelB-wing_3 pdbhh F Eukaryota T 7mrr 2 B B LEUPEPTIN XLLX 4 T 590 DNTTIP1_dimer pdbhh F F 7mrw 2 B B A0A2I0BSI4_PLAFO High molecular weight rhoptry protein 2 MIKVTIFLLLSIFSFNLYGLELNEKVSIKYGAEQGVGSADSNTKLCSDILKYLYMDEYLSEGDKATFEKKCHNVIGNIRNTFSNKNTIKEGNEFLMSILHMKSLYGNNNNNNAGSESDVTLKSLYLSLKGSQNTEGESEVPSDDEINKTIMNFVKFNKYLLDNSNDIKKVHDFLVLTSQSNENLLPNKEKLFEQIVDQIKYFDEYFFASGGKIKVKKGYLKYNFLDIYKQPVCSAYLHLCSRYYESVSIYIRLKKVFNGIPAFLDKNCRKVKGEEFKKLMDMELKHNHIVERFDKYIISDDLYYVNMKVFDLKNVDKIQVSKIDDINNLNIYEHKETMHLSAKNLSRYIDIKKELNDEKAYKQLMSAIRKYVTTLTKADSDITYFVKQLDDEEIERFLIDLNFFLYNGFLRITEDKHLINADDVSPSYINLYRSNNIVALYILKTQYEENKLSEYRAHKFYRRKRVSNITNDMIKKDFTQTNALTNLPNLDNKKTTEYYLKEYENFVENFQPDLHDIMKLQLFFTMAFKDCNVNQNFTETSKKLWFDLLYAYDKFGWFYIHPNEVINSINKTDFVRHVLVSRNFLLKNNDQLTFLETQVAKIVEIINLSLEVDKSPDSLDFSIPMNFFNHKNGYHVMNDDKLKLLTSYEYIDSIANNYFFLSEYKNDVFRTGNNFKLYFNLPNIYSLAYQLFNELAININVITNVPLKKYLKYNASYAYFTLMNMIGKNHDIYSKGSRFVYASYILGLVFFIESHIDIARLKPKDFFFMKQSLPIIDHVYHKDLKTLKKNCTLLTDFMKINKNSQNYSLTHTEEMIKILGLLTVTLWAKEGKKSVYYDDDVSLYRKLMVSCVFNGGETIQEKLANNIEKSCDISQYGIKSKNLKDMIDINLSIHKWNPAEIEKLAYSFVLSCKMQKLMYKPMNVEKLPLEDYYKLSLAPDMVKTYHCYKLGKQAAELLESIILKKKFVRFRVTDAIDVYDFFYIKKVLSSRIKKEYNEFLQDKRAFEKKELETILNNSPFSEEQTMKLINSYECHWFTSYENFRILWMHASSNLGTGTYLKNFFSELWQNIRFLFKSKLKIRDMEYFSGDISQMNLLDYYSPMVHSESHCQEKMQVLFITLRDSKEENRSEIAQKVKSAYYQCKLDYYKNHHSDFIHRIHPNDFLNNKVYVLKQPYYLMSNVPLNNPKKVSRLFVTEGTLEYLLLDKINIPECFGPCTKLHFNKVVIKESKQRIYDMTINNALVPEIQPYNRRKYMTIYINEAYIKNIVSDALTSEEIKRHDIQKGNIKICMGKSTYLTEPILTEEHFNLTHKPVYDFSSVKHNLKVFHMKNEHLVSEDPNDDCFINYPLATINLDISDPYKEISEDLIKNLYILKSS 1378 T 1.3 Crystall_2 pdbpercent F Eukaryota T 7mrw 3 C C W7JUX6_PLAFO High molecular weight rhoptry protein 3 MRSKHLVTLFIITFLSFSTVKVWGKDVFAGFVTKKLKTLLDCNFALYYNFKGNGPDAGSFLDFVDEPEQFYWFVEHFLSVKFRVPKHLKDKNIHNFTPCLNRSWVSEFLKEYEEPFVNPVMKFLDKEQRLFFTYNFGDVEPQGKYTYFPVKEFHKYCILPPLIKTNIKDGESGEFLKYQLNKEEYKVFLSSVGSQMTAIKNLYSTVEDEQRKQLLKVIIENESTNDISVQCPTYNIKLHYTKECANSNNILKCIDEFLRKTCEKKTESKHPSADLCEHLQFLFESLKNPYLDNFKKFMTNSDFTLIKPQSVWNVPIFDIYKPKNYLDSVQNLDTECFKKLNSKNLIFLSFHDDIPNNPYYNVELQEIVKLSTYTYSIFDKLYNFFFVFKKSGAPISPVSVKELSHNITDFSFKEDNSEIQCQNVRKSLDLEVDVETMKGIAAEKLCKIIEKFILTKDDASKPEKSDIHRGFRILCILISTHVEAYNIVRQLLNMESMISLTRYTSLYIHKFFKSVTLLKGNFLYKNNKAIRYSRACSKASLHVPSVLYRRNIYIPETFLSLYLGLSNLVSSNPSSPFFEYAIIEFLVTYYNKGSEKFVLYFISIISVLYINEYYYEQLSCFYPKEFELIKSRMIHPNIVDRILKGIDNLMKSTRYDKMRTMYLDFESSDIFSREKVFTALYNFDSFIKTNEQLKKKNLEEISEIPVQLETSNDGIGYRKQDVLYETDKPQTMDEASYEETVDEDAHHVNEKQHSAHFLDAIAEKDILEEKTKDQDLEIELYKYMGPLKEQSKSTSAASTSDEISGSEGPSTESTSTGNQGEDKTTDNTYKEMEELEEAEGTSNLKKGLEFYKSSLKLDQLDKEKPKKKKSKRKKKRDSSSDRILLEESKTFTSENEL 897 T 11 Phage_TAC_10 pdbhh F Eukaryota T 7msc 7 G 7 A0A3E0UTA6_MYCTX 50S ribosomal protein L37 MAKRGRKKRDRKYSKANHGKRPNS 24 T 21 Protamine_3 pdbhh F Bacteria T 7msh 7 G 7 A0A3E0UTA6_MYCTX 50S ribosomal protein L37 MAKRGRKKRDRKYSKANHGKRPNS 24 T 21 Protamine_3 pdbhh F Bacteria T 7msl 1 A A A0A377JKY9_HAEPA AcrIIC4 GSMAMKITSSNFATIATSENFAKLSVLPKNHREPIKGLFKSAVEQFSSARDFFKNENYSKELAEKFNKEAVNEAVEKLQKAIDLAEKQGIQF 92 T 1.3 Nif11 unphh F Bacteria T 7msm 7 G 7 A0A3E0UTA6_MYCTX 50S ribosomal protein L37 MAKRGRKKRDRKYSKANHGKRPNS 24 T 21 Protamine_3 pdbhh F Bacteria T 7mso 2 C,D E,F Cyclic Peptide Inhibitor ZO1-GLN-SER-TPO-45W-MLL XQSTXX 6 T 800 SARS_3b pdbhh F F 7msz 7 G 7 A0A3E0UTA6_MYCTX 50S ribosomal protein L37 MAKRGRKKRDRKYSKANHGKRPNS 24 T 21 Protamine_3 pdbhh F Bacteria T 7mt2 7 G 7 A0A3E0UTA6_MYCTX 50S ribosomal protein L37 MAKRGRKKRDRKYSKANHGKRPNS 24 T 21 Protamine_3 pdbhh F Bacteria T 7mt3 7 G 7 A0A3E0UTA6_MYCTX 50S ribosomal protein L37 MAKRGRKKRDRKYSKANHGKRPNS 24 T 21 Protamine_3 pdbhh F Bacteria T 7mt7 7 G 7 A0A3E0UTA6_MYCTX 50S ribosomal protein L37 MAKRGRKKRDRKYSKANHGKRPNS 24 T 21 Protamine_3 pdbhh F Bacteria T 7mto 1 A A SET_HUMAN HLA-DR-ASSOCIATED PROTEIN II,INHIBITOR OF GRANZYME A-ACTIVATED DNASE,IGAAD,PHAPII,PHOSPHATASE 2A INHIBITOR I2PP2A,I-2PP2A,TEMPLATE-ACTIVATING FACTOR I,TAF-I GTSEKEQQEAIEHIDEVQNEIDRLNEQASEEILKVEQKYNKLRQPFFQKRSELIAKIPNFWVTTFVNHPQVSALLGEEDEEALHYLTRVEVTEFEDIKSGYRIDFYFDENPYFENKVLSKEFHLNESGDPSSKSTEIKWKSGKDLTKRSSQTQNKASRKRQHEEPESFFTWFTDHSDAGADELGEVIKDDIWPNPLQYYLVPDM 204 T 6E-06 NAP pdb F Eukaryota T 7mu2 1 A,C A,C WIPI2_HUMAN WIPI2 MAGQLLFANFNQDNTSLAVGSKSGYKFFSLSSVDKLEQIYECTDTEDVCIVERLFSSSLVAIVSLKAPRKLKVCHFKKGTEICNYSYSNTILAVKLNRQRLIVCLEESLYIHNIRDMKVLHTIRETPPNPAGLCALSINNDNCYLAYPGSATIGEVQVFDTINLRAANMIPAHDSPLAALAFDASGTKLATASEKGTVIRVFSIPEGQKLFEFRRGVKRCVSICSLAFSMDGMFLSASSNTETVHIFKLETVGSGSFNQGRAFATVRLPFCGHKNICSLATIQKIPRLLVGAADGYLYMYNLDPQEGGECALMKQHRLDGSLET 324 T 0.00048 WD40 pdbpercent F Eukaryota T 7mu2 2 B,D B,D E7EVC7_HUMAN Autophagy-related protein 16-1 YAENEKDSRRRQARLQKELAEAAKE 25 T 0.0039 Macoilin unphh F Eukaryota T 7mu9 1 A A Q8PJC6_XANAC CARBOXYPEPTIDASE, VIPCD GSHMSDPRHPDNAMYNGAVSKLEALGERGGFANRKELEQAAGQIVFESKVSGLQRIDHVVPNKSGDGFFAVQGELTDPAMQRVFVDRNQAQNQPLENSSRQAAEE 105 T 0.068 DUF4369 pdb F Bacteria T 7muc 9 I,IA,IB,IC,ID,IE,IF,V,VA,VB,VC,VD,VE AN,CN,EN,GN,IN,KN,MN,BN,DN,FN,HN,JN,LN A0A2S6FAR3_LEGPN Neurogenic locus notch MLFLKIKTNQRTTMNILKPKAFLLASVFVLSISPAFAADGCCSKMGGINYCDSSAGRLVCNNGFYSTCYCTRHAVMDLQFLMGCCLWHGGVYPQLNSSGLVVCNDGYVSEECSLQKPVEQISVY 124 T 0.067 PAN_2 pdbpercent F Bacteria T 7muc 10 J,JA,JB,JC,JD,JE,JF,W,WA,WB,WC,WD,WE AU,CU,EU,GU,IU,KU,MU,BU,DU,FU,HU,JU,LU Unknown protein fragment XXXXXXXXX 9 F F F 7muc 11 CG,GG,K,KA,KB,KC,KD,KE,KF,QF,UF,X,XA,XB,XC,XD,XE,YF YX,ZX,AX,CX,EX,GX,IX,KX,MX,VX,WX,BX,DX,FX,HX,JX,LX,XX Unknown protein fragment XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 48 F F F 7mud 7 AA,CD,EB,G,IC,KA,MD,OB,Q,SC,UA,WD,YB CN,KN,FN,AN,IN,DN,LN,GN,BN,JN,EN,MN,HN A0A2S6FAR3_LEGPN Neurogenic locus notch MLFLKIKTNQRTTMNILKPKAFLLASVFVLSISPAFAADGCCSKMGGINYCDSSAGRLVCNNGFYSTCYCTRHAVMDLQFLMGCCLWHGGVYPQLNSSGLVVCNDGYVSEECSLQKPVEQISVY 124 T 0.067 PAN_2 pdbpercent F Bacteria T 7mud 8 BA,DD,FB,H,JC,LA,ND,PB,R,TC,VA,XD,ZB CU,KU,FU,AU,IU,DU,LU,GU,BU,JU,EU,MU,HU Unknown protein fragment XXXXXXXXX 9 T 160 FAD_oxidored pdbhh F F 7mue 3 F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T,U,V,W AX,BX,CX,DX,EX,FX,GX,HX,IX,JX,KX,LX,MX,VX,WX,XX,YX,ZX Unknown protein fragment XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 48 F F F 7muq 9 AC,IE,L,LF,MA,OB,QC,QG,UF,VD,Y,ZA,ZE LN,BN,GN,DN,IN,KN,MN,FN,EN,AN,HN,JN,CN A0A2S6FAR3_LEGPN Neurogenic locus notch MLFLKIKTNQRTTMNILKPKAFLLASVFVLSISPAFAADGCCSKMGGINYCDSSAGRLVCNNGFYSTCYCTRHAVMDLQFLMGCCLWHGGVYPQLNSSGLVVCNDGYVSEECSLQKPVEQISVY 124 T 0.067 PAN_2 pdbpercent F Bacteria T 7muq 10 AB,AF,BC,JE,M,MF,NA,PB,RC,RG,VF,WD,Z JU,CU,LU,BU,GU,DU,IU,KU,MU,FU,EU,AU,HU Unknown protein fragment XXXXXXXXX 9 F F F 7muq 11 AA,BB,BF,CC,DD,ID,KE,N,OA,PD,QB,R,SC,TD,TG,WF,XD,YC HX,JX,CX,LX,WX,XX,BX,GX,IX,YX,KX,DX,MX,ZX,FX,EX,AX,VX Unknown protein fragment XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 48 F F F 7mus 8 DE,I,JA,KB,KF,OC,PG,SD,TF,V,VA,XB,YE BN,GN,IN,KN,DN,MN,FN,AN,EN,HN,JN,LN,CN A0A2S6FAR3_LEGPN Neurogenic locus notch MLFLKIKTNQRTTMNILKPKAFLLASVFVLSISPAFAADGCCSKMGGINYCDSSAGRLVCNNGFYSTCYCTRHAVMDLQFLMGCCLWHGGVYPQLNSSGLVVCNDGYVSEECSLQKPVEQISVY 124 T 0.067 PAN_2 pdbpercent F Bacteria T 7mus 9 EE,J,KA,LB,LF,PC,QG,TD,UF,W,WA,YB,ZE BU,GU,IU,KU,DU,MU,FU,AU,EU,HU,JU,LU,CU Unknown protein fragment XXXXXXXXX 9 F F F 7mus 10 AD,AF,GD,GE,K,LA,LD,MF,NB,QC,QD,RG,UD,VF,WC,X,XA,ZB WX,CX,XX,BX,GX,IX,YX,DX,KX,MX,ZX,FX,AX,EX,VX,HX,JX,LX Unknown protein fragment XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 48 F F F 7muv 5 CC,DA,DB,DD,DG,E,NE,OB,P,PA,QC,RF,YE KN,GN,IN,MN,DN,EN,AN,JN,FN,HN,LN,CN,BN A0A2S6FAR3_LEGPN Neurogenic locus notch MLFLKIKTNQRTTMNILKPKAFLLASVFVLSISPAFAADGCCSKMGGINYCDSSAGRLVCNNGFYSTCYCTRHAVMDLQFLMGCCLWHGGVYPQLNSSGLVVCNDGYVSEECSLQKPVEQISVY 124 T 0.067 PAN_2 pdbpercent F Bacteria T 7muv 6 EA,EB,EC,F,FD,OG,PB,PE,Q,QA,RC,SF,ZE GU,IU,KU,EU,MU,DU,JU,AU,FU,HU,LU,CU,BU Unknown protein fragment XXXXXXXXX 9 F F F 7muv 7 AF,BE,FA,FB,FC,G,GD,GE,ND,PG,QB,QE,R,RA,RD,SC,TF,VD BX,YX,GX,IX,KX,EX,MX,ZX,VX,DX,JX,AX,FX,HX,WX,LX,CX,XX Unknown protein fragment XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 48 F F F 7muw 6 DG,F,GA,GB,HC,ID,ME,R,RF,SA,SB,UC,YE DN,EN,GN,IN,KN,MN,AN,FN,CN,HN,JN,LN,BN A0A2S6FAR3_LEGPN Neurogenic locus notch MLFLKIKTNQRTTMNILKPKAFLLASVFVLSISPAFAADGCCSKMGGINYCDSSAGRLVCNNGFYSTCYCTRHAVMDLQFLMGCCLWHGGVYPQLNSSGLVVCNDGYVSEECSLQKPVEQISVY 124 T 0.067 PAN_2 pdbpercent F Bacteria T 7muw 7 G,HA,HB,IC,JD,NE,OG,S,SF,TA,TB,VC,ZE EU,GU,IU,KU,MU,AU,DU,FU,CU,HU,JU,LU,BU Unknown protein fragment XXXXXXXXX 9 F F F 7muw 8 AF,GE,H,IA,IB,JC,KD,KE,PE,PG,RD,T,TF,UA,UB,VD,WC,ZD BX,YX,EX,GX,IX,KX,MX,ZX,AX,DX,VX,FX,CX,HX,JX,WX,LX,XX Unknown protein fragment XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 48 F F F 7muy 8 AF,I,KA,LB,MC,MD,PE,QG,UF,V,XA,YB,YC BN,EN,GN,IN,KN,MN,AN,DN,CN,FN,HN,JN,LN A0A2S6FAR3_LEGPN Neurogenic locus notch MLFLKIKTNQRTTMNILKPKAFLLASVFVLSISPAFAADGCCSKMGGINYCDSSAGRLVCNNGFYSTCYCTRHAVMDLQFLMGCCLWHGGVYPQLNSSGLVVCNDGYVSEECSLQKPVEQISVY 124 T 0.067 PAN_2 pdbpercent F Bacteria T 7muy 9 BF,J,LA,MB,NC,ND,QE,RG,VF,W,YA,ZB,ZC BU,EU,GU,IU,KU,MU,AU,DU,CU,FU,HU,JU,LU Unknown protein fragment XXXXXXXXX 9 F F F 7muy 10 AC,AD,CE,CF,JE,K,MA,NB,NE,OC,OD,RE,SG,UD,WF,X,YD,ZA JX,LX,XX,BX,YX,EX,GX,IX,ZX,KX,MX,AX,DX,VX,CX,FX,WX,HX Unknown protein fragment XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 48 F F F 7mvt 1 A B NIC96_CHATD NUCLEAR PORE PROTEIN NIC96 SEPGKAHYFLAASGVDPGAAVRDLGALGLQAKTERTAASVGPAAGPSGVSTTGFGTGLGEVDVDTYLSNLQTKTTLSMIADGLERSARDFDAFLEENVTLEWEAQRKRIYQHFGIK 116 T 2.6 UPF0172 pdbhh F Eukaryota T 7mvu 2 B B NIC96_CHATD NUCLEAR PORE PROTEIN NIC96 SGTGLGEVDVDTYLSNLQTKTTLSMIADGLERSARDFDAFLEENVTLEWEAQRKRIYQHFGIK 63 T 0.19 UPF0172 pdbpssm F Eukaryota T 7mvv 2 B B NIC96_CHATD NUCLEAR PORE PROTEIN NIC96 SGTGLGEVDVDTYLSNLQTKTTLSMIADGLERSARDFDAFLEENVTLEWEAQRKRIYQHFGIK 63 T 0.19 UPF0172 pdbpssm F Eukaryota T 7mvv 3 C D NUP53_CHATD NUCLEAR PORE PROTEIN NUP53 SQQDGSLRSRKANLETGAFGKSTRRTRSKAATPAKRED 38 T 33 DUF6374 pdbhh F Eukaryota T 7mvv 4 D C NU145_CHATD N-NUP145 SATPLSGKAKVKSRSILPMYKLSPANASRLVTTPQKRAYGFSFSAYGSPTSPSSSASSTPGAFGQSILS 69 T 68 DUF3591 pdbhh F Eukaryota T 7mvw 1 A A NU188_CHATD NUCLEAR PORE PROTEIN NUP188 GPHNMATLTDRTYLPPLEDCLTGRTVILSWRLVASALEDADLARLTSPALSTFLRDGFVHELLKHPARVFEPKDLKQEFETKTSSIQTVAPGVDTIKKDALWLADAVAINQVAALRIVLIEYQTRAHSHLVLPLSTQDVANIQEAAGVGDAHASSILSLLNPASAVDAETMWCDFETEARRRERILATYLSERRSFTAAVDALVTFLLHSAPGQHKDLDSLRRALLKDAFAFDEDLDVPDRSKLLTMAPTYMNLVEDCIARAQALPAKLGESFKTEAFELDWLRTAITEAVHSLSIAFQALDLDTPYFAPHELLSEWFELMNSSLFLESILGFEVVADLAMPARSLVSAICLKMLNIDRTIQFLHDFDYPDGEEPYLLSSQTLNKIHTAVTNAVNSGVAASLPVAFAWSLIVHQMHLGYQERAERRDLLVNQRAQAGFELEFQPSASTPNRRRRNSAGSIVSLEASPYDDFLREQRLDNDIAPVEQIAMLATSRGQVYQVMSEMALCLGTTHEAAFRPAVGARARLVFQDLLKRSAYLIPYQDEPVFSLLAILATGRQYWDVTDALSASSLNQVYTDMLDDETLFTQFTMQAINRFPYEFNPFSVLCRVLAAALITNKDKADVVTGWLWRTPTLTVDWNPAWDRSYELCFEDENTNSFRLTRDVDLFGSASPARPRHLAAEERFIIPEGTLGRFVTDVGRTARLEFEHSALALLGKRLEVKAAEEICDSGMAPLDVDEQAEAVAMLATVLRAESLKSTAKGGDPEAPLKFLKEASRLLPHNKDILTVISDTIDGLVEKELLELDGPQIAVLASCLQFLHAALAVCPGRVWAYMSRCALIAGDARPGRLSRITGSLDMYAERFDLLSSAVKLFAALIDSAACSAVQRRAGSTALVSVRSAVENPWLGTSEKILSRVALAIAQAALDVYESTTTWRFRSELDRSILVRDVVGLMHKLVVHAHTLSSHLTSTLSPAAAHIISSFLTPPPSASSLRFQPLLGTLLVALITPRATLYPGQSRILAERVTSVLAFCTSLLRAADFLGQTHIPLQTHLFQSACLLARLPAANAVYRAPVLELLRALVEVAGRAANGSGEPPSLLGYLGSHAARSFISLVEGIDKPFGRVEHAVVTWRFFAAVIRN 1138 T 8.4E-19 Nup188 pdbpercent F Eukaryota T 7mvx 2 B B NIC96_CHATD NUCLEAR PORE PROTEIN NIC96 SGTGLGEVDVDTYLSNLQTKTTLSMIADGLERSARDFDAFLEENVTLEWEAQRKRIYQHFGIK 63 T 0.19 UPF0172 pdbpssm F Eukaryota T 7mvy 2 B B NIC96_CHATD NUCLEAR PORE PROTEIN NIC96 SGTGLGEVDVDTYLSNLQTKTTLSMIADGLERSARDFDAFLEENVTLEWEAQRKRIYQHFGIK 63 T 0.19 UPF0172 pdbpssm F Eukaryota T 7mvz 2 B B NIC96_CHATD NUCLEAR PORE PROTEIN NIC96 SGTGLGEVDVDTYLSNLQTKTTLSMIADGLERSARDFDAFLEENVTLEWEAQRKRIYQHFGIK 63 T 0.19 UPF0172 pdbpssm F Eukaryota T 7mvz 3 C C NU145_CHATD N-NUP145 SNASRLVTTPQKRAYGFSFSAYGSPTSPSSSASSTPGAFGQSILSSSINRGLNKSISASNLRRSLNVEDSILQPGAFSANSSMRLLGGPGSHKK 94 T 85 WSK pdbhh F Eukaryota T 7mw1 2 C,D D,C NUP35_HUMAN 35 KDA NUCLEOPORIN,MITOTIC PHOSPHOPROTEIN 44,MP-44,NUCLEAR PORE COMPLEX PROTEIN NUP53,NUCLEOPORIN NUP53 SDKSGAPPVRSIYDDISSPGLGSTPLTSRRQPNISVMQSPLVGVTSTPGTGQSMFSPASIGQPRKTTL 68 T 0.27 DUF4712 pdbpercent F Eukaryota T 7mwq 1 A,C A,C LHD29A53 SSIFLLSNVSEDAAQLAEELVREISKKEGTEVRFEKDDGFLTIEVKNLSEERLREIAKALQLIVDVANAERVVRERPGSNLAKKALEIILRAAEELAKLDLKASLKAAVRAAEKVVREQPGSNLAKKALEIILRAAEELAKLPDPEALKEAVKAAEKVVREQPGSELAKKALEIIERAAEELKKSPDPEAQKEAKKAEQKVREERPGS 208 T 0.0026 Activator-TraM pdb F T 7mwq 2 B,D B,D LHD29B53 TWQWVLINISEEARQLIEKAVRAISKKEGTEVHFEKDDGVLHIRVKNLHEKRAREIHKVAKLILEVAAAERIVRERPGSNLAKKALEIILRAAEELAKADVDAALEAAVRAAEKVVREQPGSNLAKKALEIILRAAEELAKLPDPEALKEAVKAAEKVVREQPGSELAKKALEIIERAAEELKKSPDPEAQKEAKKAEQKVREERPGS 208 T 0.0089 YfiO pdbpssm F T 7mwr 1 A A LHD101A54 GSNDEKEKLKELLKRAEELAKSPDPEDLKEAVRLAEEVVRERPGSNLAKKALEIILRAAEELAKLPDPEALKEAVKAAEKVVREQPGSNLAKKALEIILRAAAALANLPDPESRKEADKAADKVRREQPGSELAVVAAIISAVARMGVKMELHPSGNEVKVVIKGLHIKQQRQLYRDVREAAKKAGVEVEIEVEGDTVTIVVRG 204 T 0.034 YfiO pdbpssm F T 7mwr 2 B B LHD101B4 YEDECEEKARRVAEKVERLKRSGTSEDEIAEEVAREISEVIRTLKESGSSYEVICECVARIVAEIVEALKRSGTSEDEIAEIVARVISEVIRTLKESGSSYEVICECVARIVAEIVEALKRSGTSEEEIAEIVARVIQEVIRTLKESGSSYEVIRECLRRILEEVIEALKRSGVDSSEIVLIIIKIAVAVMGVTMEEHRSGNEVKVVIKGLHESQQEELLELVLRAAELAGVRVRIRFKGDTVTIVVRG 249 T 0.00012 mTERF pdb F T 7mx1 2 C,D C,E ACE-PRO-LEU-ALA-SER-TPO XPLAST 6 T 280 TruB_N pdbhh F T 7mx2 4 D D peptide portion of bisubstrate inhibitor MLGP 4 T 110 H2TH pdbhh F F 7mzv 1 A A PUS7_YEAST RNA PSEUDOURIDYLATE SYNTHASE 7,RNA-URIDINE ISOMERASE 7,TRNA PSEUDOURIDINE(13) SYNTHASE MSDSSEATVKRPLDAHVGPSENAAKKLKIEQRTQADGIHEADVGITLFLSPELPGFRGQIKQRYTDFLVNEIDQEGKVIHLTDKGFKMPKKPQRSKEEVNAEKESEAARRQEFNVDPELRNQLVEIFGEEDVLKIESVYRTANKMETAKNFEDKSVRTKIHQLLREAFKNELESVTTDTNTFKIARSNRNSRTNKQEKINQTRDANGVENWGYGPSKDFIHFTLHKENKDTMEAVNVITKLLRVPSRVIRYAGTKDRRAVTCQRVSISKIGLDRLNALNRTLKGMIIGNYNFSDASLNLGDLKGNEFVVVIRDVTTGNSEVSLEEIVSNGCKSLSENGFINYFGMQRFGTFSISTHTIGRELLLSNWKKAAELILSDQDNVLPKSKEARKIWAETKDAALALKQMPRQCLAENALLYSLSNQRKEEDGTYSENAYYTAIMKIPRNLRTMYVHAYQSYVWNSIASKRIELHGLKLVVGDLVIDTSEKSPLISGIDDEDFDEDVREAQFIRAKAVTQEDIDSVKYTMEDVVLPSPGFDVLYPSNEELKQLYVDILKADNMDPFNMRRKVRDFSLAGSYRTVIQKPKSLEYRIIHYDDPSQQLVNTDLDILNNTRAKESGQKYMKAKLDRYMPDKGGEKTAVVLKFQLGTSAYATMALRELMKLETSRRGDMCDVKENI 676 T 0.038 TruD pdbhh F Eukaryota T 7n0w 2 F G CA1A_CONAV Ribbon alpha-conotoxin AusIA SCCARNPACRHNHPCV 16 T 0.0034 Toxin_8 pdbhh F Eukaryota T 7n0y 2 F G CA1A_CONAV Globular alpha-conotoxin AusIA SCCARNPACRHNHPCV 16 T 0.0034 Toxin_8 pdbhh F Eukaryota T 7n10 1 A,C A,C A0A0H2UWN8_STRP3 Prx MLYIDEFKEAIDKGYILGDTVAIVRKNGKIFDYVLPHEKVRDDEVVTVERVEEVMVELDKLEHHHHHH 68 T 0.019 Mesothelin unppercent F Bacteria T 7n19 3 C,F,I,L C,F,I,L HST4 peptide GGIGSDNKVTRRGG 14 T 9.2 DUF3976 pdbhh F T 7n1j 2 B,D B,D Binder GGGDRRKEMDKVYRTAFKRITSTPDKEKRKEVVKEATEQLRRIAKDEEEKKKAAYMILFLKTLG 64 T 0.055 UQCC3 pdb F T 7n1k 1 A A Binder GGGDRRKEMDKVYRTAFKRITSTPDKEKRKEVVKEATEQLRRIAKDEEEKKKAAYMILFLKTLG 64 T 0.055 UQCC3 pdb F T 7n1n 1 A A A0A0H2UWN8_STRP3 Prx MLYIDEFKEAIDKGYILGDTVAIVRKNGKIFDYVLPHEKVRDDEVVTVERVEEVMVELDKLEHHHHHH 68 T 0.019 Mesothelin unppercent F Bacteria T 7n1p 57 EB Pp Chains: Pp MFK 3 T 100 PNTB_4TM pdbhh F F 7n20 1 A A CA1B_CONAN Alpha-conotoxin AnIB GGCCSHPACAANNQDYCX 18 T 0.0004 Toxin_8 pdbpssm F Eukaryota T 7n21 1 A A CA1B_CONAN Alpha-conotoxin AnIB GGCCSHPACAANNQDYC 17 T 2E-05 Toxin_8 pdbhh F Eukaryota T 7n22 1 A A CA1B_CONAN Alpha-conotoxin AnIB GGCCSHPACAANNQDYCX 18 T 0.0004 Toxin_8 pdbpssm F Eukaryota T 7n23 1 A A CA1B_CONAN Alpha-conotoxin AnIB GGCCSHPACAANNQDYC 17 T 2E-05 Toxin_8 pdbhh F Eukaryota T 7n27 2 B,D,F,H,J,L G,H,I,J,K,L inhibitor UNC6261 XXAFXA 6 T 400 ICAP-1_inte_bdg pdbhh F F 7n2c 59 GB Pp Polypeptide MFK 3 T 100 PNTB_4TM pdbhh F F 7n2d 1 A A ZN292_HUMAN zinc finger protein 292 FRNWQAYMQ 9 T 0.046 zinc_ribbon_11 unp F Eukaryota T 7n2e 1 A C CPEB3_HUMAN CPEB3 QIGLAQTQ 8 T 41 DUF5315 pdbhh F Eukaryota T 7n2f 1 A A CPEB3_HUMAN CPEB3 QIGLAQTQ 8 T 41 DUF5315 pdbhh F Eukaryota T 7n2g 1 A A CPEB3_HUMAN CPEB3 QIGLAQTQ 8 T 41 DUF5315 pdbhh F Eukaryota T 7n2h 1 A A prion New1p NYNNYQ 6 T 79 SIR4_SID pdbhh F F 7n2k 1 A A prion New1p NYNNYQ 6 T 79 SIR4_SID pdbhh F F 7n2p 3 C C A0A2R8Y7R8_HUMAN Ribonuclease H2 subunit B GQVMVVAPR 9 T 7.1 DUF45 pdbhh F Eukaryota T 7n2u 54 BB Pp Nascent peptide MFK 3 T 100 PNTB_4TM pdbhh F F 7n2v 57 EB Pp Nascent peptide MFK 3 T 100 PNTB_4TM pdbhh F F 7n2y 1 A A Apo-(GRAND CoilSerL16CL23C)3 EWEALEKKLAALESKCQALEKKCQALEKKLEALEHG 36 T 0.0003 Lebercilin pdb F T 7n2z 1 A A Pb(II)2-(GRAND CoilSerL16CL23C)3 EWEALEKKLAALESKCQALEKKCQALEKKLEALEHG 36 T 0.0003 Lebercilin pdb F T 7n30 54 BB Pp Nascent peptide MFK 3 T 100 PNTB_4TM pdbhh F F 7n31 54 BB Pp Polypeptide MFK 3 T 100 PNTB_4TM pdbhh F F 7n3o 1 A A Cas12k MSQITIQARLISFESNRQQLWKLMADLNTPLINELLCQLGQHPDFEKWQQKGKLPSTVVSQLCQPLKTDPRFAGQPSRLYMSAIHIVDYIYKSWLAIQKRLQQQLDGKTRWLEMLNSDAELVELSGDTLEAIRVKAAEILAIAMPASESDSASPKGKKGKKEKKPSSSSPKRSLSKTLFDAYQETEDIKSRSAISYLLKNGCKLTDKEEDSEKFAKRRRQVEIQIQRLTEKLISRMPKGRDLTNAKWLETLLTATTTVAEDNAQAKRWQDILLTRSSSLPFPLVFETNEDMVWSKNQKGRLCVHFNGLSDLIFEVYCGNRQLHWFQRFLEDQQTKRKSKNQHSSGLFTLRNGHLVWLEGEGKGEPWNLHHLTLYCCVDNRLWTEEGTEIVRQEKADEITKFITNMKKKSDLSDTQQALIQRKQSTLTRINNSFERPSQPLYQGQSHILVGVSLGLEKPATVAVVDAIANKVLAYRSIKQLLGDNYELLNRQRRQQQYLSHERHKAQKNFSPNQFGASELGQHIDRLLAKAIVALARTYKAGSIVLPKLGDMREVVQSEIQAIAEQKFPGYIEGQQKYAKQYRVNVHRWSYGRLIQSIQSKAAQTGIVIEEGKQPIRGSPHDKAKELALSAYNLRLTRRS 639 T 0.0027 RuvC_1 pdbhh F T 7n3p 1 A A Cas12k MSQITIQARLISFESNRQQLWKLMADLNTPLINELLCQLGQHPDFEKWQQKGKLPSTVVSQLCQPLKTDPRFAGQPSRLYMSAIHIVDYIYKSWLAIQKRLQQQLDGKTRWLEMLNSDAELVELSGDTLEAIRVKAAEILAIAMPASESDSASPKGKKGKKEKKPSSSSPKRSLSKTLFDAYQETEDIKSRSAISYLLKNGCKLTDKEEDSEKFAKRRRQVEIQIQRLTEKLISRMPKGRDLTNAKWLETLLTATTTVAEDNAQAKRWQDILLTRSSSLPFPLVFETNEDMVWSKNQKGRLCVHFNGLSDLIFEVYCGNRQLHWFQRFLEDQQTKRKSKNQHSSGLFTLRNGHLVWLEGEGKGEPWNLHHLTLYCCVDNRLWTEEGTEIVRQEKADEITKFITNMKKKSDLSDTQQALIQRKQSTLTRINNSFERPSQPLYQGQSHILVGVSLGLEKPATVAVVDAIANKVLAYRSIKQLLGDNYELLNRQRRQQQYLSHERHKAQKNFSPNQFGASELGQHIDRLLAKAIVALARTYKAGSIVLPKLGDMREVVQSEIQAIAEQKFPGYIEGQQKYAKQYRVNVHRWSYGRLIQSIQSKAAQTGIVIEEGKQPIRGSPHDKAKELALSAYNLRLTRRS 639 T 0.0027 RuvC_1 pdbhh F T 7n3t 2 C,D C,D Designed TrkA-binding miniprotein MSHHHHHHHHSENLYFQSGGGRDEIKERIFKAVVRAIVTGNPEQLKEAKKLLEKLKKLGRLDQDAKKFEKAIRQVEKRLRS 81 T 0.012 LCD1 pdb F T 7n50 1 A A Gasdermin SNXSRDTGDELMAALLAEGINLILPPRDNIAPGDLIIADPQGGARLGGWHEVFNLQLSPEVATDPGFKSFQFRASSILQVGVAASVMGRVLQALGLGSGSFSSAFSSSNADTIQLSIVAPANKELTNFDAVLVQMNEAKAEPAQGYTDRNFFVVTKVWRARGIRISVADKSKKQVDLSAKAVEELTAKAKMELKREDTGSYAFLAASQLIFGLTLREVTYKDGAIVDVAPTGPLKFRGKGPGDPFAFIGDDAFVDLPES 259 T 0.0034 Gasdermin pdbhh F T 7n51 1 A A A0A2T4VDM4_9DELT Gasdermin SGLXSDPAITYLKRLGYNVVRLPREGIQPLHLLGQQRGTVEYLGSLEKLITQPPSEPPAITRDQAAAGINGQKTENLSFSIGINILKSVLAQFGAGAGIEAQYNQARKVRFEFSNVLADSVEPLAVGQFLKMAEVDADNPVLKQYVLGNGRLYVITQVIKSNEFTVAAEKSGGGSIQLDVPEIQKVVGGKLKVEASVSSQSTVTYKGEKQLVFGFKCFEIGVKNGEITLFASQPGAIAMALDAAGGVMPSDSALLDEGGLLDLEGF 266 T 0.00083 Gasdermin pdbpercent F Bacteria T 7n52 1 A,B,C,D A,B,C,D Gasdermin SEXNDPFVVALKDKGYSLVAYPKTSIRPLHIYEHTIKNAFKRIWIQSEAQPTSGFIKSLFSDKIHGAIGLSDGQGIDIDLRKTNSLSSAVAAKILESYFQDSAPSFDLAFENSSSVIFHIEEIITTDADEISLRNWLNDNQNELREIYKEEIKKGNFFVATSLLRAKKMRMQFERKNKGELGVDVSKIKNLPVDAKLESKIEGSTYDRLVFETPDEGIVFGVKLVRLFFSDNGILTIDKKQDFNRVLGENMALNLFTEIQDAGFIEVT 268 T 0.13 AKAP95 pdb F T 7n5c 3 C C PA_I97A1 RNA-DIRECTED RNA POLYMERASE SUBUNIT P2 SSLCNFRAYV 10 T 3.9 P34-Arc pdbhh T Viruses T 7n5p 3 C C PA_I97A1 RNA-DIRECTED RNA POLYMERASE SUBUNIT P2 SSLCNFRAYV 10 T 3.9 P34-Arc pdbhh T Viruses T 7n5q 3 C,F C,H PA_I97A1 RNA-DIRECTED RNA POLYMERASE SUBUNIT P2 SSLCNFRAYV 10 T 3.9 P34-Arc pdbhh T Viruses T 7n61 4 K,L 0K,0L A0A2K3DV98_CHLRE FAP239 MPPQLGREVQERVKVYGPLNELTYEGRLLTQTLQDELNRSISAPAGPRSPWYEGDPELESMRERVRQQRAIREAQRRRDHAALTASIQKRNLQEEQRRDAMLGSLLGDVIGGLTDPNSPLAEAEAALSHADKVRRKKKESLHNEWSTQVFDTIQGRLQAAVDARDPAAIESRLKTQYDQYLHTTNTKVAVFRDVIIEQDYNPLAAADAAIRVPTGDIRDPLKRDVLKGEYERRLMTGGRGGGGASPTGRGGAAAAGAGSIYGPLGKETLGTQQWGELAVKATPYGHCTDGQGGYVARPLSGSAVALRASRVPMDHYDYPVGNAAAAAEVPPGKRIVPGPEQRRGRQDLFDVVQHTVHLKPQGYTGGDQWLEHKGKGNAPGPEQRRGRRDLADVLQQKAVADGPRGTSAPARGDQLQHKEQGDAWLDAKGKRRVEGPEMRRGRQGLYETLQQTSNPYQGGNKVGDAWLEHKGRKVQPRPEPEAAAALSAVPPLPTVRPPRVGDDKKYAVNIEAAMGQMTVKDGAKVTGW 528 T 0.42 Histone pdbpssm F Eukaryota T 7n61 6 O,P 0O,0P FLAGELLAR ASSOCIATED PROTEIN MEGAAGPSGFRNVEPLSRQERAAARDKDLLEKSRLQARNRGGPLKQPENVVGNPVMPARNAPAFCDEYDRFNRDVAGEMNAKKQQNLQKKEEVYAVKRAEQYHRERSNWETQAQAAAREAARLEASRTTGTGAKRNQGSESYNIISLNYNNSSGGQQLAAKDTAVKEARQARAVNLYSKSHSVSHNIITGEPIKFPTAGKE 201 T 0.36 VIR_N pdbpssm F T 7n61 8 S 0T Unassigned protein-1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 169 F F F 7n61 9 T 0U Unassigned protein-2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 170 F F F 7n61 10 U 0V Unassigned protein-3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 160 F F F 7n61 11 V 0W Unassigned protein-4 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 39 F F F 7n61 15 GA 1H Unassigned protein-5 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 88 F F F 7n61 16 HA,IE 1M,1L Unassigned protein-6 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 174 F F F 7n65 1 A,E,I A,E,I A0A6H1VCM1_9PLVG ENV POLYPROTEIN MDAMKRGLCCVLLLCGAVFVSPSQEIHARFRRGARAENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQAFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRV 508 T 4.1E-54 GP120 pdbpercent T Viruses T 7n6g 9 LA 1Q A0A2K3E6N2_CHLRE FAP297 MPPLPTLWDPAGAVDKLPQPFRMIDKILADIVEQVVDMIGTRESQRRAEDASRVDLTFAPHAVMEVAPETCCFLPIGIAGIAAVAMPDGEVQVRSARDPSICFSDRTHTSPVTAMEAGMAVCTSGRLLAAASRTALTLHEVDIKTCEISLLATVPLPADPDDTSPPVRLHWSDNLGHLAVCRRSGALALFTLSLPPPNVSLESVGAFVALSLKAFGETRVEELLRVPAATVRAFCLGSAAVGPSNLVWQLQQRPDKSPHSDRYHKSARGAYVWWEGANRLLLLDFEAAAGAAAAAGGAGGAVVPPPEVEAAVKAGAAGTSKPSTPATKAASSKSVAPPGEASSLLTASASAAAAAYAGPERPPEVPQIAPMARDWLLPHDVTAAATTSDHKTMAWGLADGSVVIWDDRSCCSTKVLPRLKGGITALSWVNGVAHKLVCASAGGHIFIADVIKPEDSSQKPYEFPQAIHEVHTLPNEPFALCICRGHTSSDGEIASTHRGGRGGPSILSTMSGHGPGGGGHDVMRVPPQRPRVFWYNVLEEKPVAELMGPRAEQGFGLACCVPPPPSLAPPPPRPDTAATDAGAADGKSAAASAAATPAPGAAPKGKGGAAAPAPPSGGGGGGAAPAASQEAPSEPAMSEAQRALIKAALDAMGANSTTPVLVPVKLHVPAGQVVSYPACVFRDTYLLAGGDVVDKVVRSNFADDDAVPQRVTQLYMYKVDALLRHLLPEDESTSRLGKVVLDRLLADLEAPKMNRKKGKRVKMDVEDPDAPKLSSAMRKPDPFDTTGSRPGSRAANRHITFGGGDADGLFDDVLEGGRAKPKKETKVFPKETKKGLKTGPGDVKMIDKNASGRPRLAPLDLEKAKEAPLPFSERSQSPPWHHTNPLARIHPDWEEAPVLVRIMDRIGSKGGGRKRRDKRLEALTTELMTKYSKEAGAKPNLLVPT 945 T 0.0003 Lgl_C pdbhh F Eukaryota T 7n6g 10 MA,NA 1R,1S A0A2K3DQN7_CHLRE FAP108 MPLYFEEVAPDPKAKKERDAKQQRPAILVERKGPPPAPMHLESQVIPTLIRKVGDWKTGRISQAMCEAYLDRHTLVFDRELLTKLFKEADYQKEGSLDTRALTIAIAGRFPKREHTPEWRLLTALLLGLPELVLTTDAEVTTLRTTHERPVGGGTYNSGNFWDSPPPPLPPVRRRTGSGRSTVGKVTAHEPSPEWLDTLNRTAAAASMSAGGSPSASMAGSFAGAASLNASMLRTGSVGAMDPGGAGVVGTTGGLKQTTQIADEARLNAALMGGAASTFATQREFADWSRGLEVMPRLAADTAGPGPGSEFGGGVRTATHLGSPKAPVRVWAAPLPPSAISLPSSALRTLRETVRSTASTKPDFVKGVKPLDSHELDLKKTLGEPLDVGMSLARVEPVRDTKVLPNADYVTWGDYAANCRTGPTGWYSKHPTAQAQDTGEHKYPWC 446 T 0.61 EF-hand_8 pdbhh F Eukaryota T 7n6g 14 XA,YA 2F,2G A8J870_CHLRE FLAGELLAR ASSOCIATED PROTEIN MAAKGKQQWDFLKADANTPASPAHYYEPLNAKKEGEFKPGWNTKRRGPAWEAERQAAIMTKEQKNIGCVALRSERLNNAQQQSGFNPIAHTERAADGSWVPATNAWMHQKVGVKQQDPRAAAADTLKHQAEGASRAAAIAEMRKERIAAGGASRPAAGGGVKDALTWG 168 T 0.17 Nop25 pdbpssm F Eukaryota T 7n6g 15 ZA 2H Unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 196 F F F 7n6g 16 AB 2I Unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 108 F F F 7n6g 17 BB,CB 2J,2K Unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 100 F F F 7n6g 18 DB 2L Unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 54 F F F 7n6g 19 EB 2M Unknown protein XXXXXXXXXXXXXXXXXX 18 F F F 7n6g 20 FB 2N Unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 52 F F F 7n6g 21 GB 2O Unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 7n6g 22 HB 2P Unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 106 F F F 7n6g 23 IB,JB 2Q,2R Unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 110 F F F 7n6g 24 KB,LB,MB,NB 2S,2T,2U,2V A8JCZ9_CHLRE FLAGELLAR ASSOCIATED COILED-COIL PROTEIN MALTVEPLSPNLLHTDLLTIKHTSDPALLFPSHAYGRGGVGHFLCTGHSRAGIHRLPQAASKAASLSLTARGRASSSRASDYADGMDPAATGNGSSISGFSLISSDPAAIGATRAWEAGTVAHASNFVSACRSVRPTDVPRGSKWAAVSFFENDPKEVSRHMQVLDFIQAKHDMAKAAERQTEQARHNIWAEQQRETFRRQRLEQASRYTRSGIRPRSAYAELGAVAAAEQAHNGYGSGSVLGDSQDGSRFGGGGGGGRLGGIAPPSGDPASRYYGALGRDGTFGSRISGTGSAQGGGGSGSMSYYPGGKPRPSTALPAGSVYTGRGGPLTAAQAATANPFNATLSATAATAAAGGGGPIPLPKKTYIHVYDRMAAEAAETPAARAAAQAAAQAAAEDERRQAVLDGELAELDSFEARMRSLQRARSRAAHSERRRGSNAFAEDSDA 447 T 7.7 PDDEXK_1 pdbpssm F Eukaryota T 7n6g 25 OB,PB 2W,2X Unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 167 F F F 7n6g 26 QB,RB 2Y,2Z Unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 65 F F F 7n6g 27 AC,BC 3A,3B Unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 36 F F F 7n6g 29 DC 3E A0A2K3CZ11_CHLRE FAP92 MAPKAKGPDPAEAAAPVGPSQELHLVLKVRMRLKPPAPERPPQEPPAQPSTPPSAEPSAATGADTASAAKDAKGRAKSPPKKPVTPGSARKAAAAAAAAEAAAAAAAAAAAALAKPVDPAALVLTVKYTPLGGAEAVVGPVKPLKPAVAAAAAAAAAAAAAEAAAAAAAAEAAAASKSGTAAGGKKPPPAKPGASAAPSPPPPATPSPPLTPPPPPPAAASAAAPAGPEPYAEVCVEHHVKVVVDEAVVRSLAAANALLPVSVSLASPSDEEGADGKGAKKPTPAAGKSKAALAAAAAAPPTPVRSYGAVLMLDVSGLLVGDTSARAVWPDKAKGLPSALEEVAEGVEAELQLLSLPRPENPTTPAAGKGRPAAAATAAKPVSAKPGGKKGEAPPEPELKDLPGEPIGLLPPELITQLNPIVINIRKAKELPAAPATRAQLDNNCASPALFLRWPPGVPPREQPGLASPWQLPASAATSVTGLTHNGSSGGAEVLVLAGPAGAAMSRVPPSAGRRYLLFGQPEVFFAGDLPGGGEEALRLCRECPLLVEVHDRTAIPEPPEFPLEPLPAADGGAAGGAAGGQAAPPESEGYVCGLARVPMLDLARGYTRFRFHTSLAPHTTVRGAASLDWTKRPGNYAEAGSILKGEVRCACPLPSTRGADPSSRVFARALFIMDYRDSDLFHLLEDTVRKNNAWRLGLAEPQDKVSPDQLPPELKAMRAAGGGVPGVGAGVGGHHSHSHPAHDDDDDDGRERRPSTDERQRRGSASSGSSAQAAGGGGAGGGLGRRSSSMYSYTEPNSPTRTGGLVPDRGRPGGGGGVPPLPPAGVAGGLQRRESTSPGALAAMASGLGGVDGRRHVIWDSESEEDDEPDEPPTAAELALDALCVDIDRISLELRELQALSTAQLTPEQAADRQLDLLTGWHLVDGRERVIVVEGLAEGAMKIVKGISDWALEDPKPEEGWRRRSVLLSTSPSLRSAWRLYSPLGVDLWIVKLRAPLPKLLGEPGSFATGRVRPDCVQGIRRLGALRSVAAWSRQAHDLSLWPSPQQLQLVDKKFGGELLAADVLGVEANEPDSEDEDGGPRALGGLLDADARSAKSSKSNRSRRSSRSRRSGKSGRSGKSGKTGKSGKRRQRLRQSRVPPLDTHNDGYLALRRQARQRRLQRDWLSLNRDSLRELERRTSHIKDTWREWNPQRTAREQLDAAIRAGTIPPEELEALSTARKRSPAPLPGTVPRQDALARGWYPHPSPFKWPSPRVPADFRQLPARPTDFRVQQLEEPWEEGALHRGVDLRGGPGTAGVGSAKDEFLTAVRGDQTGLFGTDPEYWRTVHLGGAGREAELISTRRAEAEEWRRRLVVEDPVMRTVLPQGPPVPAQADRLKPLLKDEPSKKGFKVAAVPPAPLNSQLAYPWTDPSSAAAQAATVGRGRDDKTKFIDPGHEFRNVTGKPKTAVHKLSYNQSWSASTHDKYYNQ 1471 T 28 BLUF pdbhh F Eukaryota T 7n6g 30 EC,FC 3F,3G CFA99_CHLRE CILIA- AND FLAGELLA-ASSOCIATED PROTEIN 99 MESRPGSASSYHSHAAGNTQSRPGSATSVESSNRLSGKPLGAEKLMAACEKIYKSFNPAQVTLDTHVDNCIGQLSVHNSFDDSFIRQVVYGTVRYRRLLGALMDSFYHYNGGAASREDVDMYKLYSYLTIFRLEELNFSNFQRLVDAMTPQKMYVLLKYMFNPQYMREVVREDWLKLYDKEFVDEIIDRLLSWKSESKEMLGRLEYKVTLSRKEDDEKETLGTGYAAARSTTVPQPFNLTQPKPRPLPVEDPPPPPIRAKPAPRPREGPTKEEVALAAAREANRAAAERKAAKAAPFKLRVLERPTNIDKIREELEAERTRELTFKGIRAAPPPPVPNAQVRLNAAAILREDALYRRKQQEEADALKRYEAELRDASSFKAWQNAMLEQDEAARAASVERRRQEMAAAQENAIRARMAAQEANAELARAAKEEAKRIEEDLKRDREEQARLNALRRDAVVEARQNVQAAVEKMSQERRLAAEEERRKQQEDARARAEVAAREMAERRDIILQLKALEKVPKQRVKEFDPTETGPDHGLLETMSLVELRERLNVAKRRQREEEERQRAEILRQKQERESALLEKAANIQRVRRVAAAQAAQRRATSAETIQRKNTEVSKAREADVLQLADKLDAKRAALAAERARLAAEQKRTRFEQMQAAAGAAVVEETKFRELRAGAQREAKTRQENALASATVYEATKARQQNVRLKNVRQELKAKDDFMRAYDEKLAALRGQAGAESAADLARRTQMAQTQRAAEATVRNRTTTTAYRPYEGGSTSMQARLAALGQGMELED 795 T 0.038 DUF1948 pdbhh F Eukaryota T 7n6g 34 HD,ID,LC,MC,NC,OC,PC,QC,RC,SC,TC,UC,VC,WC,XC,YC 4A,4B,3M,3N,3O,3P,3Q,3R,3S,3T,3U,3V,3W,3X,3Y,3Z A8IVW2_CHLRE FLAGELLA ASSOCIATED PROTEIN MPSPKRSGLGSGRLIGGQASTTSLGSPGAGTSQFQINHENTLRKNRNHFQGQIEQYSIDYHSSHNKALAELPVLQEQHALEIEEYQNAIEETVTRHLGRIAQLRQDYSLLIKEASRRHMELQQRRFAGARVDIPQVAQQPIAGLPALSAQPQIVAPSAVPNGSLAASRSGNVSTSSEQAGSHAMGQVNPNAYLRTLNNDAAAAMSQYNTRPLPATPGGIAALSISTPASPLTARLATPSEATARNSDFARLEAMHEKYMGRTKSQHEQGIAARINEVSNWQQTFAACMAALAQHQSSALAHLSSCVEEAEAWHAQQRGALSAAHDEVMTEAERLSRQLEAAQTAAAAKLNGLLASFLERVLPSGEVGLAEAQASYTRSAANLRNEHVAALEAAEAALRALVPRHAAMRQHMSASYSSGLASHEAALAAAGGHYDSRGIPELRRQYQDAESRHRDTLAAIRAEHLKGLAGSRDGWMGEAAALLEEYRARMQELKQQYMLAYDVNLTEV 507 T 0.022 AAA_13 pdb F Eukaryota T 7n6g 35 JD 4C A0A2K3DQM4_CHLRE FAP81 MSSAHILSTFQSTFPGLYQAPKKGEDEPPPEAPAPEPVTQHDDEPDQYSTRIAGITSKFERMRASADEMEQYLRSAAEDAKEAEARALAKADEDFTPAWRNVGLPLKPSHLHLDHGAMAGARLVNPKAIVQEYQAIKGREVLNPPRIAEYADTSAKPNYLKSTHAMEERKMRTMSPDRTARIQALSARHLAWQTLTPEEVAAKMEEAEQRRRQLGLKMPRAQFELEQKIIQSMHHKLTFLRNPRHPLPPAVKTLMELRPDANRWVGPRTTVLEGVKPPIKADMTSRPDQVFVVEPAEVTFTNYAVGRAYEQVVRVRNVTAVSRSLRIFPPASQYFHASLPRFPGEVGVLAPGMAAEVTLRFCPDSLGDYEDAIAVDATHSRQTVPLRARRPPPSLTLPEEIDMGQVVIGNVKTEQVTFKNMGGAGRFRIVPEAHWPDFAMDAPTDRAVVGQFKIWPLYFEMAAGEQLGLNVSYEPTEWGNTEERLVLVCDNCQVKTFSLSGNAVGVDVLLHSVDGRMLEPRELDLPLWFGECAPGAGFSKTVSVRNTTKLPFAFEWGLTKFPQVQNRRRANEPLQTEAQYDEEQDDEGHVLLVDNKSLRGTSPLRLGTGGGGAAAAPPAVGGGAGGGAGGGAGGVSSSAAGVNGSVAQAPGGGVAGAKPPGPMKALAGAENAGTPWGVHCGNEAHGPDPLALGAVVEDLFRVVPRSGVLQPGEVMEFLVTFTPPGQARYERWAQLRVDRKPISVPSGASPMVRGSGSGTGRSHAAIAASATCDVLVAEVGLEGLGCPVQLSAAPRLVSLPGKLMPAEGTTRHVTLRNPTRAQVVVRATVDNPAIAVSPSEFRMPSLGAISLAVTVRAPPDAAPGPLSGRVLLEVEHGPPVPIEVRAAVGSSYARLITPRINFGDVPLSGSSEQRLVIRNMSATCPTPWSIRELTPALVAAEKSRLLRAQLLQSSRMLDPQAAAAAIDALVQEEEEAEAAAAAREEEEAVGYRGASVTGHHARFAEAQRSPAASTSGALVALPASAAERRAVFAAGRHTDSSLAQALPPPDTTHVTFEPSSGVLEPNQELTVRVTCHALTDGRHRSIIQLRSGAPHAGGGPDGGLHMECLEAFACVVTPACVVDRPVMDLGVTFVGVQVRQTLYLTNLSQLPVLYRWTAEAEDEGSQTAGLAELKIKPDHGELEPGEDVEIQVRYTPRYPGPCVMYGVCELEGAPEPLGFRVSSAIHGLDVTYDLLTQEQYDDYMAHDQAAAALGSTGPKGAAAMAVAAAAAGGANTGSGGYYDSESGEVAGAGGEGGGPSRAEEIAAALDKVNFMRVGRHGKAEVDVEAFSGLTHLPPDVLSRVASHRHASTSAAGSKAASRRASARPGAQARPRSGAAAGGGVAAWPPPTPQRHLVADFGHNVPLGETRQMYLVVTNRTAMHTSIRTWLERFGVADASRFVRGTESGAAPPGGGADKAGGAEGGAGKGGASRRQTKDEDAHHPQGPKLSRYSKYTPIKLAGTDAEHRAPFRADKGNEMMATRRLQEEADEALGNKGLAVSVTPPESTLEPWSRLVLTVSCFNDMCGAYMDMMHVKVGDLPARDIPVLVGVSGTPLVVQRERVLVRGLRARSWRTDLEWGQVPQGVEQTRTFYVFNTGSLDMHLAWEARRYHDYVDLERLPPTTDVGAGGTGQHDSLWGGAPGTLRDTRAGMKLFDVKLQPDERAGCVRLATERHADPTDDVPFRVEPEEQVIKGNTTAKFTVTFCASESRRHGGYLHGTQRVFSPESPLELRVWTAGENADRVGALLSGTFHPYAGAPPTPLQPLRVDLGAQAQACRLEPDGQTDLSWVVTSIQQPGSHAAFVRSVTLSNTAHCPQVFSLDVEGPWDMVAASPSVPQDPVAYRGTSTLLGPAAASGRLGTSAADGGLTFLPPGESVDVTLRFSPGKGDMEALPVRFAAMQAKQRVIETVNDYKNTGALCITFANGDSQSLPLVAEMLHPRLEVKPRKLDFKKVHLQSPKEMFVMLSNPTNVDAAWAVTVEGHKPRFPTLPGAGAAASAAAAKEAKEEAAAAAAAAGGGGSASAQPTPRSASGGNLAGEASAPDSRAASAVSGASRPATVDGGAAGAAAPAGGVPPPPKLPGAGGPGTLPGVTGIIAEARIGPYVVKPASGVLSGRGLGMPRSQRISITFAPTEAEAYEGELIFAVLRGKQCSVDVDGEGSIEETDETKGNLFVI 2215 T 0.0019 PapD-like pdbhh F Eukaryota T 7n6g 45 KF,LF,MF 5W,5X,5Y Unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 137 F F F 7n6g 46 NF 5Z Unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXWAMHVVFARXXXXXXXXXXXXXXXXVHAAVVMXXXXXXXXXXXVAAHAVAVVAVAAAVXXXXXXXXXMAAAAALLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 181 T 410 HD_3 pdbhh F T 7n6g 52 JG,KG,LG 6O,6P,6Q Unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGYSVYESXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 89 T 730 DUF1939 pdbhh F T 7n6j 2 B B PPB_ECOLI APASE RKQSTIALALLPLLFTPRR 19 T 6.7 ASTN_1_2_N pdbhh F Bacteria T 7n6k 2 B B PPB_ECOLI APASE RALALLPLSR 10 T 8.9 DUF3561 pdbhh F Bacteria F 7n82 1 A A Q31PX7_SYNE7 Biofilm-related protein MRIDELVPADPRAVSLYTPYYSQANRRRYLPYALSLYQGSSIEGSRAVEGGAPISFVATWTVTPLPADMTRCHLQFNNDAELTYEILLPNHEFLEYLIDMLMGYQRMQKTDFPGAFYRRLLGYDS 125 T 4.3 ATP13 pdbhh F Bacteria T 7n84 2 B,C Y,Z unknown XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 63 F F F 7n84 3 D,K a,l NU120_YEAST NUCLEAR PORE PROTEIN NUP120 MACLSRIDANLLQYYEKPEPNNTVDLYVSNNSNNNGLKEGDKSISTPVPQPYGSEYSNCLLLSNSEYICYHFSSRSTLLTFYPLSDAYHGKTINIHLPNASMNQRYTLTIQEVEQQLLVNVILKDGSFLTLQLPLSFLFSSANTLNGEWFHLQNPYDFTVRVPHFLFYVSPQFSVVFLEDGGLLGLKKVDGVHYEPLLFNDNSYLKSLTRFFSRSSKSDYDSVISCKLFHERYLIVLTQNCHLKIWDLTSFTLIQDYDMVSQSDSDPSHFRKVEAVGEYLSLYNNTLVTLLPLENGLFQMGTLLVDSSGILTYTFQNNIPTNLSASAIWSIVDLVLTRPLELNVEASYLNLIVLWKSGTASKLQILNVNDESFKNYEWIESVNKSLVDLQSEHDLDIVTKTGDVERGFCNLKSRYGTQIFERAQQILSENKIIMAHNEDEEYLANLETILRDVKTAFNEASSITLYGDEIILVNCFQPYNHSLYKLNTTVENWFYNMHSETDGSELFKYLRTLNGFASTLSNDVLRSISKKFLDIITGELPDSMTTVEKFTDIFKNCLENQFEITNLKILFDELNSFDIPVVLNDLINNQMKPGIFWKKDFISAIKFDGFTSIISLESLHQLLSIHYRITLQVLLTFVLFDLDTEIFGQHISTLLDLHYKQFLLLNLYRQDKCLLAEVLLKDSSEFSFGVKFFNYGQLIAYIDSLNSNVYNASITENSFFMTFFRSYIIENTSHKNIRFFLENVECPFYLRHNEVQEFMFAMTLFSCGNFDQSYEIFQLHDYPEAINDKLPTFLEDLKSENYHGDSIWKDLLCTFTVPYRHSAFYYQLSLLFDRNNSQEFALKCISKSAEYSLKEIQIEELQDFKEKQHIHYLNLLIHFRMFEEVLDVLRLGHECLSDTVRTNFLQLLLQEDIYSRDFFSTLLRLCNAHSDNGELYLRTVDIKIVDSILSQNLRSGDWECFKKLYCFRMLNKSERAAAEVLYQYILMQADLDVIRKRKCYLMVINVLSSFDSAYDQWILNGSKVVTLTDLRDELRGL 1037 T 0.086 TPR_1 pdbpssm F Eukaryota T 7n85 3 C,D 5,6 Unknown connectors XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 36 F F F 7n89 2 C,D C,D R1AB_SARS2 ACE-SER-ALA-VAL-LEU-GLN-SER-GLY-PHE-NH2 XSAVLQSGFX 10 T 14 GPAT_N pdbhh T Viruses T 7n8j 2 B B BIMAX2 GSRRRRRRKRKREWDDDDDPPKKRRRLD 28 T 0.74 Med24_N pdbhh F T 7n8r 1 A,B A,B NUP54_HUMAN FGTGFG segment from the Nucleoporin p54, residues 63-68 FGTGFG 6 T 1.1 DUF543 pdbhh F Eukaryota F 7n9f 3 C,D 5,6 orphans bound to Nup192 NTD XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 36 F F F 7n9f 12 EA,FA u,v NUP82_YEAST NUCLEAR PORE PROTEIN NUP82 MSQSSRLSALPIFQASLSASQSPRYIFSSQNGTRIVFIQDNIIRWYNVLTDSLYHSLNFSRHLVLDDTFHVISSTSGDLLCLFNDNEIFVMEVPWGYSNVEDVSIQDAFQIFHYSIDEEEVGPKSSIKKVLFHPKSYRDSCIVVLKEDDTITMFDILNSQEKPIVLNKPNNSFGLDARVNDITDLEFSKDGLTLYCLNTTEGGDIFAFYPFLPSVLLLNEKDLNLILNKSLVMYESLDSTTDVIVKRNVIKQLQFVSKLHENWNSRFGKVDIQKEYRLAKVQGPFTINPFPGELYDYTATNIATILIDNGQNEIVCVSFDDGSLILLFKDLEMSMSWDVDNYVYNNSLVLIERVKLQREIKSLITLPEQLGKLYVISDNIIQQVNFMSWASTLSKCINESDLNPLAGLKFESKLEDIATIERIPNLAYINWNDQSNLALMSNKTLTFQNISSDMKPQSTAAETSISTEKSDTVGDGFKMSFTQPINEILILNDNFQKACISPCERIIPSADRQIPLKNEASENQLEIFTDISKEFLQRIVKAQTLGVSIHNRIHEQQFELTRQLQSTCKIISKDDDLRRKFEAQNKKWDAQLSRQSELMERFSKLSKKLSQIAESNKFKEKKISHGEMKWFKEIRNQILQFNSFVHSQKSLQQDLSYLKSELTRIEAETIKVDKKSQNEWDELRKMLEIDSKIIKECNEELLQVSQEFTTKTQ 713 T 2.4E-12 Nup88 pdbpercent F Eukaryota T 7n9f 14 DB,KA a,h NU120_YEAST NUCLEAR PORE PROTEIN NUP120 MACLSRIDANLLQYYEKPEPNNTVDLYVSNNSNNNGLKEGDKSISTPVPQPYGSEYSNCLLLSNSEYICYHFSSRSTLLTFYPLSDAYHGKTINIHLPNASMNQRYTLTIQEVEQQLLVNVILKDGSFLTLQLPLSFLFSSANTLNGEWFHLQNPYDFTVRVPHFLFYVSPQFSVVFLEDGGLLGLKKVDGVHYEPLLFNDNSYLKSLTRFFSRSSKSDYDSVISCKLFHERYLIVLTQNCHLKIWDLTSFTLIQDYDMVSQSDSDPSHFRKVEAVGEYLSLYNNTLVTLLPLENGLFQMGTLLVDSSGILTYTFQNNIPTNLSASAIWSIVDLVLTRPLELNVEASYLNLIVLWKSGTASKLQILNVNDESFKNYEWIESVNKSLVDLQSEHDLDIVTKTGDVERGFCNLKSRYGTQIFERAQQILSENKIIMAHNEDEEYLANLETILRDVKTAFNEASSITLYGDEIILVNCFQPYNHSLYKLNTTVENWFYNMHSETDGSELFKYLRTLNGFASTLSNDVLRSISKKFLDIITGELPDSMTTVEKFTDIFKNCLENQFEITNLKILFDELNSFDIPVVLNDLINNQMKPGIFWKKDFISAIKFDGFTSIISLESLHQLLSIHYRITLQVLLTFVLFDLDTEIFGQHISTLLDLHYKQFLLLNLYRQDKCLLAEVLLKDSSEFSFGVKFFNYGQLIAYIDSLNSNVYNASITENSFFMTFFRSYIIENTSHKNIRFFLENVECPFYLRHNEVQEFMFAMTLFSCGNFDQSYEIFQLHDYPEAINDKLPTFLEDLKSENYHGDSIWKDLLCTFTVPYRHSAFYYQLSLLFDRNNSQEFALKCISKSAEYSLKEIQIEELQDFKEKQHIHYLNLLIHFRMFEEVLDVLRLGHECLSDTVRTNFLQLLLQEDIYSRDFFSTLLRLCNAHSDNGELYLRTVDIKIVDSILSQNLRSGDWECFKKLYCFRMLNKSERAAAEVLYQYILMQADLDVIRKRKCYLMVINVLSSFDSAYDQWILNGSKVVTLTDLRDELRGL 1037 T 0.086 TPR_1 pdbpssm F Eukaryota T 7n9h 2 B A TADBP_HUMAN TDP-43 KDNKRKMDETDASSAVKVKRAVQK 24 T 1.2E-05 DUF4523 unphh F Eukaryota T 7nac 3 C 7 LOC1_YEAST LOCALIZATION OF ASH1 MRNA PROTEIN 1 MAPKKPSKRQNLRREVAPEVFQDSQARNQLANVPHLTEKSAQRKPSKTKVKKEQSLARLYGAKKDKKGKYSEKDLNIPTLNRAIVPGVKIRRGKKGKKFIADNDTLTLNRLITTIGDKYDDIAESKLEKARRLEEIRELKRKEIERKEALKQDKLEEKKDEIKKKSSVARTIRRKNKRDMLKSEAKASESKTEGRKVKKVSFAQ 204 T 2 PIN_6 pdb F Eukaryota T 7naf 3 C 8 NOC2_YEAST Nucleolar complex protein 2 KVSKSTKKFQSKHLKHTLDQRRKEKIQKKRIQGRRGNKT 39 T 1.5 Hid1 unppssm F Eukaryota T 7naf 14 N s NUG1_YEAST NUCLEAR GTPASE 1 MRVRKRQ 7 T 0.0095 RNR_inhib pdbhh F Eukaryota F 7nbv 1 A A POLG_TMEVG CAPSID PROTEIN VP1,CAPSID PROTEIN VP2,CAPSID PROTEIN VP3,CAPSID PROTEIN VP4,GENOME POLYPROTEIN,LEADER PROTEIN,P1A,P1B,P1C,P1D,PICORNAIN 3C,PROTEASE 3C,PROTEIN 2C,PROTEIN 3A,RNA-DIRECTED RNA POLYMERASE,VP4-VP2,VPG,VIRION PROTEIN 1,VIRION PROTEIN 2,VIRION PROTEIN 3,VIRION PROTEIN 4,PROTEIN 2B, 2A PROTEIN (DERIVED FROM GENOME POLYPROTEIN) GPLGSNPASLYRIDLFITFTDELITFDYKVHGRPVLTFRIPGFGLTPAGRMLVCMGEKPAHSPFTSSKSLYHVIFTSTCNSFSFTIYKGRYRSWKKPIHDELVDRGYTTFREFFKAVRGYHADYYKQRLIHDVEMNPG 138 T 0.11 SpoVAD pdbpssm T Viruses T 7ncr 1 A,B A,B F8VBM8_9VIRU PUTATIVE COAT PROTEIN MGDRVNAQDDDTVVPHQAPLQPAALQQDLTRSADYLLDNVRIGNHRQRYDKYRRYVLLRSSEIFTSLVAIYAHIFSSYWQHFRRFTDQFQAPTGVQLPTFVARVYISTWLHDLYCSIREATRSISPLAFNERYSYELLPYSTEYDPFLAFLSMSIKPTHIQHTPENTLWIPILCENYDWDRNEANHNPFGITNFTLNSNLFYGLLAILKERKEFKLSTLTTNTIGRPCWLFDWHDNVQVCAWFPREANFNSQDVTAAYIIGVACTPKLGPSDDDAWKYYASLNSVPTFTPTEPRLTNRRSYGAYEVRTRETENNYFLPDSLLNIIEDFTATGTTQRRKIRRPSATSASTGAAIIIRDTPGTASTATTSTTETEVTFPPVIRTKIRDWYYHSRVILELEDNSRTAALRMFIIA 412 T 13 Peptidase_C62 pdbhh T Viruses T 7nef 2 I,J,K,L,M,N,O,P I,J,K,L,M,N,O,P Fln65 KKLLKLLKLLLX 12 T 16 Sec16 pdbhh F F 7nev 2 B B LEUPEPTIN XLLX 4 T 590 DNTTIP1_dimer pdbhh F F 7new 2 E,F,G,H E,G,H,F Heterochiral peptide Fdln69 KKXXKXXKXXXX 12 F F F 7nff 1 A,B A,B CC-Type2-(LaId)4-I24A XGEIAQALKEIAKALKEIAWALKEAAQALKGX 32 T 0.002 WXG100 pdbpssm F T 7nfg 1 A,B,C,D,E,F A,B,C,D,E,F CC-Type2-(LaId)4-L14A XGEIAQALKEIAKAAKEIAWALKEIAQALKGX 32 T 0.015 MCPsignal pdbpssm F T 7nfh 1 A,B,C,D,E,F,G A,B,C,D,E,F,G CC-Type2-(MaId)4 XGEIAQAMKEIAKAMKEIAWAMKEIAQAMKGX 32 T 0.0046 WXG100 pdbpssm F T 7nfi 1 A,B A,B CC-Type2-(LaId)4-L7Y XGEIAQAYKEIAKALKEIAWALKEIAQALKGX 32 T 0.0046 WXG100 pdbpssm F T 7nfj 1 A,B,C,D,E,F,G A,B,C,D,E,F,G CC-Type2-(LaId)4-L28Y XGEIAQALKEIAKALKEIAWALKEIAQAYKGX 32 T 0.0009 WXG100 pdbpssm F T 7nfk 1 A,B A,B CC-Type2-(LaId)4-I24S XGEIAQALKEIAKALKEIAWALKESAQALKGX 32 T 0.0037 WXG100 pdbpssm F T 7nfl 1 A,B A,B CC-Type2-(LaId)4-I24N XGEIAQALKEIAKALKEIAWALKENAQALKGX 32 T 0.0048 WXG100 pdbpssm F T 7nfm 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P CC-Type2-(LaId)4-L21K XGEIAQALKEIAKALKEIAWAKKEIAQALKGX 32 T 0.042 MCPsignal pdbpssm F T 7nfn 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N A,B,C,D,E,F,G,H,I,J,K,L,M,N CC-Type2-(LaId)4-L21N-I24N XGEIAKALREIAKALREIAWANRENAKALRGX 32 T 0.098 Ada3 pdbpssm F T 7nfo 1 A,B,C A,B,C CC-Type2-(LaId)4-I17C XGEIAQALKEIAKALKECAWALKEIAQALKGX 32 T 0.026 WXG100 pdbpssm F T 7nfp 1 A,B,C,D,E,F,G A,B,C,D,E,F,G CC-Type2-(LaId)4-I17K XGEIAKALREIAKALREKAXALREIAKALRG 31 T 0.13 HTH_38 pdbpercent F T 7nfx 48 VA s Signal Sequence XXXXXXXXXXXXXXXXXXXXX 21 F F F 7nfy 2 G G substrate protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 55 F F F 7ng2 1 A A S8F6K2_TOXGM CPSF4 GDPFGHVASPQSTKRFFIIKSNRMSNIYTSIQHGVWATSKGNSRKLSNAFTSTDHVLLLFSANESGGFQGFGRMMSLPDPQLFPGIWGPVQLRLGSNFRVMWLKQCKIEFEELGKVTNPWNDDLPLRKSRDGTEVPPALGSLLCTWMSQRPSEDLLAGTGIDPATR 166 T 5.6E-31 YTH pdbhh F Eukaryota T 7ng4 2 G G substrate protein, chain G XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 55 F F F 7ng5 2 G G Substrate protein chain:G XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 55 F F F 7ng8 2 D D Klebicin C activity MADNQPVPLTPAPPGMVSLGVNENGEEEMTVIGGDGSGTGFSGNEAPIIPGSGSLQADLGKKSLTRLQAESSAAIHATAKWTTENLAKTQAAQAERAKAAMLSQQAAKAKQAKLTQHLKDVVDRALQNNKTRPTVIDLAHQNNQQMAAMAEFIGRQKAIEEARKKAEREAKRAEEAYQAALRAQEEEQRKQAEIERKLQEARKQEAAAKAKAEADRIAAEKAEAEARAKAEAERRKAEEARKALFAKAGIKDTPGCLEHHHHHH 264 T 0.043 DUF2612 pdb F T 7ngc 2 G G substrate protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 55 F F F 7ngf 2 G G substrate protein chain:G XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 55 F F F 7nh2 1 A A S8F6K2_TOXGM CPSF4 GDPFGHVASPQSTKRFFIIKSNRMSNIYTSIQHGVWATSKGNSRKLSNAFTSTDHVLLLFSANESGGFQGFGRMMSLPDPQLFPGIWGPVQLRLGSNFRVMWLKQCKIEFEELGKVTNPWNDDLPLRKSRDGTEVPPALGSLLCTWMSQRPSEDLLAGTGIDPATR 166 T 5.6E-31 YTH pdbhh F Eukaryota T 7nhp 18 R 3 Q8DMP8_THEEB Tsl0063 protein MRYTTDEGGRLNNFAIEPKVYQAQPWTPQQKVRAALLVGGGLLLVAGLVAIAVGVS 56 T 0.026 DUF2157 pdbpssm F Bacteria T 7nhq 17 Q 3 Q8DMP8_THEEB Tsl0063 protein MRYTTDEGGRLNNFAIEPKVYQAQPWTPQQKVRAALLVGGGLLLVAGLVAIAVGVS 56 T 0.026 DUF2157 pdbpssm F Bacteria T 7nht 15 O,P c,d AKIR2_HUMAN Akirin-2 MACGATLKRTLDFDPLLSPASPKRRRCAPLSAPTSAAASPLSAAAATAASFSAAAASPQKYLRMEPSPFGDVSSRLTTEQILYNIKQEYKRMQKRRHLETSFQQTDPCCTSDAQPHAFLLSGPASPGTSSAASSPLKKEQPLFTLRQVGMICERLLKEREEKVREEYEEILNTKLAEQYDAFVKFTHDQIMRRYGEQPASYVS 203 T 4.2 ATP-synt_E_2 pdbhh F Eukaryota T 7nix 2 B B TBCD4_HUMAN AKT SUBSTRATE OF 160 KDA,AS160 RRRAHTFSHPP 11 T 9.3 THEG4 pdbhh F Eukaryota T 7njc 1 A A S8F6K2_TOXGM CPSF4 GDPFGHVASPQSTKRFFIIKSNRMSNIYTSIQHGVWATSKGNSRKLSNAFTSTDHVLLLFSANESGGFQGFGRMMSLPDPQLFPGIWGPVQLRLGSNFRVMWLKQCKIEFEELGKVTNPWNDDLPLRKSRDGTEVPPALGSLLCTWMSQRPSEDLLAGTGIDPATR 166 T 5.6E-31 YTH pdbhh F Eukaryota T 7njz 3 C C CCR5_HUMAN C-C CKR-5,CC-CKR-5,CCR-5,CCR5,CHEMR13,HIV-1 FUSION CORECEPTOR, PIYDIN PIYDIN 6 T 9.9 Herpes_UL95 pdbhh F Eukaryota F 7nkv 1 A A Q8XAD6_ECO57 Phage repressor protein CI MQKKEIRRLRLKEWFKDKTLPPKEKSYLSQLMSGRASFGEKAARRIEQTYGMPEGYLDAEYAEQPGSSHHHHHH 74 T 0.0032 HTH_3 unppssm F Bacteria T 7nlj 2 B B APikL2A MQNEYLDAKKHGIDLSRERAPNFVDHPGIPPSDCFWFLYKNYVRQDAGVCQSDWSFDMKIGQYWVTIHTDEGCRLSGIIPAGWLILGIKRLGF 93 T 2.9 OPA1_C pdbhh F T 7nma 2 B P AMOT_HUMAN ANGIOMOTIN GHVRSLSERLMQM 13 T 2.8E-05 Macoilin unphh F Eukaryota T 7nmd 3 C,F C,F GLN-LEU-PRO-ARG-LEU-PHE-PRO-LEU-LEU QLPRLFPLL 9 T 2.9 DUF6308 pdbhh F F 7nme 3 C C GLN-LEU-PRO-ARG-LEU-PHE-PRO-LEU-LEU QLPRLFPLL 9 T 2.9 DUF6308 pdbhh F F 7nmf 3 C C GLN-LEU-PRO-ARG-LEU-PHE-PRO-LEU-LEU QLPRLFPLL 9 T 2.9 DUF6308 pdbhh F F 7nmg 3 C C INS_HUMAN Diabetes epitope LWMRLLPLL LWMRLLPLL 9 T 2.5 MitoNEET_N pdbhh F Eukaryota F 7nmm 2 I,J,K,L,M,N,O,P I,J,K,L,M,N,O,P APikL2F GPMQNEYIDAKKHGIDLSRERAPNFVDHPGIPPSDCFWFLYKNYVRQNAGVCQSDWSFDMKIGQYWVTIHTDEGCRLSGIIPAGWLILGMKRPGF 95 T 2.7 OPA1_C pdbhh F T 7nmw 2 B P AMOT_HUMAN ANGIOMOTIN GHVRSLSERLMQM 13 T 2.8E-05 Macoilin unphh F Eukaryota T 7nmx 2 B P AMOT_HUMAN ANGIOMOTIN GHVRSLSERLMQM 13 T 2.8E-05 Macoilin unphh F Eukaryota T 7nn2 2 B P AMOT_HUMAN ANGIOMOTIN GHVRSLSERLMQM 13 T 2.8E-05 Macoilin unphh F Eukaryota T 7nn3 1 A,B,C,D A,B,C,D E4S6E9_CALKI Beta-xylanase MGSSHHHHHHSSENLYFQGHIETLPDSFTFYDGTKVQRLSDWPKRAQELKDLYQFYMYGYKPDTSVEDVTYSVNGNTLTITVKVGDKQASFNATVRLPQANSGYQPPYPVIISLGYLAGFNWQTWQFIDYSTNAVNRGYAVISFMPNDVARDDSSYTGAFYTLYPHSNKVENDTGVLMAWAWGASKILDALEKGAIPEIDAKKAIVTGFSRYGKAALVAGAFDERFAVVNPHASGQGGAASFRYSFAGKQYSWGVAGNAEAFSNLQGNTEGHWFNAVFREFKDPRQLPFDQHELIALCAPRTVLITGGYSDWGTNPEGTWVSFVGARKVYEFLGVADRIGFALRDGSHAITEEDVNNLLDFCDWQLRGIQPTKDFSTSRFAIDPAWDTISVPTLYRNAD 399 T 0.00012 AXE1 pdbpercent F Bacteria T 7nn6 1 A A TOXR_VIBCH ToxR MRGSHHHHHHGSPSQTSFKPLTVVDGVAVNMPNNHPDLSNWLPSIELCVKKYNEKHTGGLKPIEVIATGGQNNQLTLNYIHSPEVSGENITLRIVANPNDAIKVCE 106 T 0.051 Oest_recep pdb F Bacteria T 7nna 1 A A Q5V9K0_KLEPN KLEBC TOL BINDING DOMAIN MSGSLQADLGKKSLTRLQAESSAAIHATAKWTTENLAKTQAAQAERAKAAMLSQQAAKAKQAKLTQHLKDVVDRALQNNKTRPTVIDLAHQNNQQMAAMAEFIGRQKAIEEARKKAEREAKRAEEAYQAALRAQEEEQRKQAEIERKLQEARKQEAAAKAKAEADRIAAEKAEAEARAKAEAERRKAEEARKALFAKAGIKDTPLEHHHHHH 212 T 0.027 DUF2612 pdb F Bacteria T 7nnd 2 B P AMOT_HUMAN ANGIOMOTIN GHVRSLSERLMQM 13 T 2.8E-05 Macoilin unphh F Eukaryota T 7nne 2 B P AMOT_HUMAN ANGIOMOTIN GHVRSLSERLMQM 13 T 2.8E-05 Macoilin unphh F Eukaryota T 7np2 2 B P AMOT_HUMAN ANGIOMOTIN GHVRSLSERLMQMSLATSGV 20 T 2.8E-05 Macoilin unphh F Eukaryota T 7npb 2 B P AMOT_HUMAN ANGIOMOTIN GHVRSLSERLMQMSLATSGV 20 T 2.8E-05 Macoilin unphh F Eukaryota T 7npg 2 B P AMOT_HUMAN ANGIOMOTIN GHVRSLSERLMQMSLATSGV 20 T 2.8E-05 Macoilin unphh F Eukaryota T 7nqc 2 B B RALA_HUMAN PRO-ASN-GLY-LYS-LYS-LYS-ARG-LYS-SER-LEU-ALA-LYS-ARG-ILE-ARG-GLU-ARG-CMF PNGKKKRKSLAKRIRERC 18 T 12 Protamine_3 pdbhh F Eukaryota T 7nqd 1 A,B A,B H2J4R1_MARPK TPR_REGION domain-containing protein KAPSQNAIKRFMTLFSGREDVFSIQYEGGYRPIRRPLNFQDIKNHFSGKKTLGIYLLKKNDTVKFAAYDIDIKKHYLNREDKFVYEENSKKVAKRLSRELNLENITHYFEFTGNRGYHIWIFFDIPVSAYKIKYIMEKILDRIELEEGIDVEIFPKQTSLNGGLGNLIKVPLGVHKKTGKKCLFVDNDFNVIENQIEFLNNIKENKATEINKLFREIFNEND 222 T 0.024 DNA_primase_S pdbpssm F Bacteria T 7nqe 1 A A H2J4R1_MARPK TPR_REGION domain-containing protein KAPSQNAIKRFMTLFSGREDVFSIQYEGGYRPIRRPLNFQDIKNHFSGKKTLGIYLLKKNDTVKFAAYDIDIKKHYLNREDKFVYEENSKKVAKRLSRELNLENITHYFEFTGNRGYHIWIFFDIPVSAYKIKYIMEKILDRIELEEGIDVEIFPKQTSLNGGLGNLIKVPLGVHKKTGKKCLFVDNDFNVIENQIEFLNNIKENKATEINKLFREIFNEND 222 T 0.024 DNA_primase_S pdbpssm F Bacteria T 7nqf 1 A,B A,B H2J4R1_MARPK TPR_REGION domain-containing protein GPSQNAIKRFMTLFSGREDVFSIQYEGGYRPIRRPLNFQDIKNHFSGKKTLGIYLLKKNDTVKFAAYDIDIKKHYLNREDKFVYEENSKKVAKRLSRELNLENITHYFEFTGNRGYHIWIFFDIPVSAYKIKYIMEKILDRIELEEGIDVEIFPKQTSLNGGLGNLIKVPLGVHKKTGKKCLFVDNDFNVIENQIEFLNNIKENKATEINKLFREIFNE 219 T 0.019 DUF1882 pdbhh F Bacteria T 7nqh 70 WB AZ unknown XXXXXXXXXXXXXXXXXX 18 F F F 7nql 66 QB AZ unknown AAAAAAAAAAAAAAAAAA 18 T 330 Campylo_MOMP pdbhh F F 7nrc 39 MA Sp GIR2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 48 F F F 7nrc 85 GC Lt L1 60S ribosomal protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 210 F F F 7nrc 86 HC A GCN1 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGDYLGILMEKLLNPTVASSMRKGAAWGIAGLVKGYGISALSEFDIIRNLIEAAEDKKEPKRRESVGFCFQYLSESLGKFFEPYVIEILPNILKNLGDAVPEVRDATARATKAIMAHTTGYGVKKLIPVAVSNLDEIAWRTKRGSVQLLGNMAYLDPTQLSASVSTIVPEIVGVLNDSHKEVRKAADESLKRFGEVIRNAAIQKLVPVLLQAIGDPTKYTEEALDSLIQTQFVHYIDGPSLALIIHIIHRGMHDRSANIKRKACKIVGNMAILVDTKDLIPYLQQLLDEVEIAMVDPVPNTRATAARALGALVERLAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 1209 T 0.00065 CLASP_N pdbpssm F T 7nre 2 B C Darobactin WNXSKSF 7 T 5.7 TMP pdbhh F T 7nrf 2 B B Darobactin WNXSKSF 7 T 5.7 TMP pdbhh F T 7nri 6 F G 3-PYRIDIN-4-YL-2,4-DIHYDRO-INDENO[1,2-.C.]PYRAZOLE WNXSKSF 7 T 5.7 TMP pdbhh F T 7nrn 1 A A GIPC1_MOUSE GAIP C-TERMINUS-INTERACTING PROTEIN,RGS-GAIP-INTERACTING PROTEIN,RGS19-INTERACTING PROTEIN 1,SEMAF CYTOPLASMIC DOMAIN-ASSOCIATED PROTEIN 1,SEMCAP-1,SYNECTIN GAMGDLPSAFEEKAIEKVDDLLESYMGIRDTELAATMVELGKDKRNPDELAEALDERLGDFAFPDEFVFDVWGAIGDAKVGRY 83 T 0.0092 PWI unphh F Eukaryota T 7ns0 1 A,B,C A1,B2,C3 A0A0B6VL42_9VIRU Capsid protein VP2 MVRRRGGKTAGSKRPKMSSKNFGANRKRDFRRPARKSKAKKARSMAPAKTVRKSTTAGAHSKHFSVIGNPFSKATQQPQIPDGRMLESLPRRCQLVTEIRNNVTVGSNPTYILVAPSLGLAFQAYQDTNVPGGLDSSVYGLQNRGCTVRANLSATSIENYNDIAKWRIVSQGINLKLNNVEDENDGWYEACRFQHDWTPDELCLRSTENDASTISQDEDLVMGVISSSFMNGALNTIGNNMVEQRGYESGLLKNIHKRMFQLHNNTSAIRPKTLQGQFNYGSEITFSGTESEARFTDVPSNRQLVDSLWHNDYDCILIKLYPRENTGAAGQTGSALIVNAIQNLELQYSPTSDLSTYHIANKRARMVEAKLDKKNNTDAAGEPFVPGSSR 390 T 0.082 Peptidase_A6 pdbhh T Viruses T 7ns3 1 A 5 VID28_YEAST GLUCOSE-INDUCED DEGRADATION PROTEIN 5 MTVAYSLENLKKISNSLVGDQLAKVDYFLAPKCQIFQCLLSIEQSDGVELKNAKLDLLYTLLHLEPQQRDIVGTYYFDIVSAIYKSMSLASSFTKNNSSTNYKYIKLLNLCAGVYPNCGFPDLQYLQNGFIQLVNHKFLRSKCKIDEVVTIIELLKLFLLVDEKNCSDFNKSKFMEEEREVTETSHYQDFKMAESLEHIIVKISSKYLDQISLKYIVRLKVSRPASPSSVKNDPFDNKGVDCTRAIPKKINISNMYDSSLLSLALLLYLRYHYMIPGDRKLRNDATFKMFVLGLLKSNDVNIRCVALKFLLQPYFTEDKKWEDTRTLEKILPYLVKSFNYDPLPWWFDPFDMLDSLIVLYNEITPMNNPVLTTLAHTNVIFCILSRFAQCLSLPQHNEATLKTTTKFIKICASFAASDEKYRLLLLNDTLLLNHLEYGLESHITLIQDFISLKDEIKETTTESHSMCLPPIYDHDFVAAWLLLLKSFSRSVSALRTTLKRNKIAQLLLQILSKTYTLTKECYFAGQDFMKPEIMIMGITLGSICNFVVEFSNLQSFMLRNGIIDIIEKMLTDPLFNSKKAWDDNEDERRIALQGIPVHEVKANSLWVLRHLMYNCQNEEKFQLLAKIPMNLILDFINDPCWAVQAQCFQLLRNLTCNSRKIVNILLEKFKDVEYKIDPQTGNKISIGSTYLFEFLAKKMRLLNPLDTQQKKAMEGILYIIVNLAAVNENKKQLVIEQDEILNIMSEILVETTTDSSSNGNDSNLKLACLWVLNNLLWNSSVSHYTQYAIENGLEPGHSPSDSENPQSTVTIGYNESVAGGYSRGKYYDEPDGDDSSSNANDDEDDDNDEGDDEGDEFVRTPAAKGSTSNVQVTRATVERCRKLVEVGLYDLVRKNITDESLSVREKARTLLYHMDLLLKVK 921 T 0.0017 HEAT_2 pdbpercent F Eukaryota T 7nsi 72 TB AZ unknown XXXXXXXXXXXXXXXXXX 18 F F F 7nsj 69 QB AZ unknown XXXXXXXXXXXXXXXXXX 18 F F F 7nso 33 GA 7 ermDL MTHSMRL 7 T 0.0092 Ery_res_leader2 pdb F T 7nsp 33 GA 7 ermDL MTHSMRL 7 T 0.0092 Ery_res_leader2 pdb F T 7nsq 33 GA 7 ermDL MTHSMRL 7 T 0.0092 Ery_res_leader2 pdb F T 7nus 2 D,E,F D,E,F p53/MDM2 macrocyclic peptide inhibitor FSDXSSVPNXXRNXX 15 T 2.8 DUF1244 pdbhh F T 7nuv 1 A,B A,B E9RJ22_BACNA Aux2pLS20 GPMAKVKKHLTFSGPTESPYGIAYIEKEMKAKNCSKMNETIELIFAEHDEMKARLSEQDALVEKIFQRFKKTLDVIRVRAGHTDKNAQINLELWNAFLMANPLPVTVLTDQHTSESVSMAKEKVSNDIATFKQRKDEQKAKQEMQKGEK 149 T 0.0015 DUF1433 unppercent F Bacteria T 7nvr 33 GA Y Unassigned peptide, likely TFIIE-beta XXXXXXXXXXXXXXXXXXX 19 F F F 7nvr 34 HA Z Unassigned peptide, likely XPB XXXXXXXX 8 F F F 7nvr 52 ZA r unassigned peptide (MED29 or MED30) XXXXXXXXXXXXXXXXXXXX 20 F F F 7nvr 54 BB v unassigned peptide (MED14) XXXXXXXXXXXXXXXXXXXXXX 22 F F F 7nvr 55 CB w unassigned peptide (MED6) XXXXXXXXXXXXXXXX 16 F F F 7nvr 56 DB x unassigned peptide (MED17) XXXXXXXXXXX 11 F F F 7nvr 57 EB y unassigned peptide (MED29 or MED30) XXXXXXXXXXXXXXXXXX 18 F F F 7nvr 58 FB z unassigned peptide (MED29 or MED30) XXXXXXXXXXXXXXXXXXXXXXX 23 F F F 7nvv 7 G Y Unassigned peptide, likely XPB XXXXXXXX 8 F F F 7nvv 8 H Z Unassigned Peptide, likely TFIIE-Beta XXXXXXXXXXXXXXXX 16 F F F 7nvw 12 L Y Unassigned Peptide, likely XPB XXXXXXXX 8 F F F 7nvw 13 M Z Unassigned Peptide, likely TFIIE-Beta XXXXXXXXXXXXXXXXXXX 19 F F F 7nvx 12 L Y Unassigned Peptide, likely XPB XXXXXXXX 8 F F F 7nvx 13 M Z Unassigned Peptide, likely TFIIE-Beta XXXXXXXXXXXXXXXX 16 F F F 7nvy 31 EA Y Unassigned peptide, likely TFIIE-beta XXXXXXXXXXXXXXXXXXX 19 F F F 7nvy 32 FA Z Unassigned peptide, likely XPB XXXXXXXX 8 F F F 7nvz 31 EA Y Unassigned peptide, likely TFIIE-beta XXXXXXXXXXXXXXXXXXX 19 F F F 7nvz 32 FA Z Unassigned peptide, likely XPB XXXXXXXX 8 F F F 7nw0 31 EA Y Unassigned peptide, likely TFIIE-Beta XXXXXXXXXXXXXXXX 16 F F F 7nw0 32 FA Z Unassigned peptide, likely XPB XXXXXXXX 8 F F F 7nw1 2 C,D CCC,FFF UBA5_HUMAN UBIQUITIN-ACTIVATING ENZYME 5,THIFP1,UFM1-ACTIVATING ENZYME,UBIQUITIN-ACTIVATING ENZYME E1 DOMAIN-CONTAINING PROTEIN 1,UBA5 DSGESLEDLMAKMKNM 16 T 0.39 DUF5786 pdbhh F Eukaryota T 7nw3 3 C A CCR5_HUMAN C-C CKR-5,CC-CKR-5,CCR-5,CCR5,CHEMR13,HIV-1 FUSION CORECEPTOR, PIYDIN PIYDIN 6 T 9.9 Herpes_UL95 pdbhh F Eukaryota F 7nwt 57 EB,FB,GB AA,BB,CC POLG_ENMGO P2A,G SPNPLDVSKTYPTLHILLQFNHRGLEARIFRHGQLWAETHAEVVLRSKTKQISFLSNGSYPSMDATTPLNPWKSTYQAVLRAEPHRVTMDVYHKRIRPFRLPLVQKEWRTCEENVFGLYHVFETHYAGYFSDLLIHDVETNPGGSKHHHHHH 152 T 0.0044 LZ3wCH pdbpssm T Viruses T 7nx5 1 A,B,E,F A,B,E,F BZLF1_EBVB9 EB1,ZEBRA MLEIKRYKNRVASRKSRAKFKQLLQHYREVAAAKSSENDRLRLLLKQMCPSLDVDSIIPRTPD 63 T 0.0012 bZIP_2 pdb T Viruses T 7nyi 1 A A BS222_STAPS Bacteriocin BacSp222 MAGLLRFLLSKGRALYNWAXSHVGKVWEWLKSGATYEQIKEWIENALGWR 50 F F Bacteria T 7nyj 1 A A A0A7M7KVA6_VARDE Odorant Binding Protein 1 from Varoa destructor, form P3<2>21 APQAPASATPAKVPVIEWGKCEQLKPSESERTSKAAVVDKCLQSLPLPDPEKATQQEIDKHRESVTTCALKAEGWFDDEGVYKFDRARNEIKNKKLDSEVEEAVLLKHDACQKEATEKHDDYINQVQLYQACMDYNISQICGIKVMV 147 T 0.0003 PBP_GOBP pdbpssm F Eukaryota T 7nza 1 A,B A,B A0A7M7KVA6_VARDE Odorant Binding Protein from Varroa destructor, form P2<1> APQAPASATPAKVPVIEWGKCEQLKPSESERTSKAAVVDKCLQSLPLPDPEKATQQEIDKHRESVTTCALKAEGWFDDEGVYKFDRARNEIKNKKLDSEVEEAVLLKHDACQKEATEKHDDYINQVQLYQACMDYNISQICGIKVMV 147 T 0.0003 PBP_GOBP pdbpssm F Eukaryota T 7nzf 3 C CCC mutant human collagen type II,259-273 AGFAGEQGPAGEP 13 T 2.8 FokI_C pdbhh F T 7nzh 3 E,F EEE,FFF citrullinated cartilage intermediate layer protein (CILP) peptide 982-996 GKLYGIXDVXSTRD 14 T 4.7 DUF6489 pdbhh F T 7o07 2 B P YAP1_HUMAN Transcriptional coactivator YAP1 XRAHSSPAXLQX 12 T 0.00014 FAM181 unp F Eukaryota T 7o0u 3 WA C A0A143BHR6_9BACT MULTIHEME_CYTC domain-containing protein MVPVSLLTLGACGDAATDTVQVGYRGTAMEQNYDHGDLKTKFAQVKLPQSPPPAGESPPGPLPWKNVQVLNDISIAEFNRTMIAMSTWVAGTGNCAYCHNVAAFQDDTLPNGKPLYTKIVARRMLQMTRNINGNYSQHVKNTGVTCYTCHMGKPLPNGLWFYSSQTDYLRHYLDRDGARVITQGVAPSNANRSSTKQAEWTYALMISQSRSLGVNCTYCHNTRQFASWREAPPARVTAYHGILMLRDVNQNYLAPLQPVYPAVRLGAMGDAPKAQCVTCHNGAYKPLYGAQMAKDFPAMWGRADWNGVPFPGIMRVAADSTKTDSTVVAAPAAAPAQRTSARPGSVTTPVGGVN 354 T 2.7999999999999997E-68 CytoC_RC pdbpercent F Bacteria T 7o0u 4 XA C1 RC-S MPASPSPLPRSSRVRNAAVVVALVAVGLAARGRDAQGTQPPVAPPAAPTATAAPDLAVQDSTKADSTAVADTLMDLSMVMAAEAAAATVTTAPVAVAPTAWPVDPTTGQTLINGRPVVGRVFIMRKTDGTVKYPNVADVVAHEALAPLPPVVGSSYQQAPITNQRRMRGIMIQSTLWDMDRKRSATRQRYYPASTPANQLGQ 202 T 0.42 DUF126 pdbhh F T 7o0v 5 WA C A0A143BHR6_9BACT MULTIHEME_CYTC domain-containing protein MVPVSLLTLGACGDAATDTVQVGYRGTAMEQNYDHGDLKTKFAQVKLPQSPPPAGESPPGPLPWKNVQVLNDISIAEFNRTMIAMSTWVAGTGNCAYCHNVAAFQDDTLPNGKPLYTKIVARRMLQMTRNINGNYSQHVKNTGVTCYTCHMGKPLPNGLWFYSSQTDYLRHYLDRDGARVITQGVAPSNANRSSTKQAEWTYALMISQSRSLGVNCTYCHNTRQFASWREAPPARVTAYHGILMLRDVNQNYLAPLQPVYPAVRLGAMGDAPKAQCVTCHNGAYKPLYGAQMAKDFPAMWGRADWNGVPFPGIMRVAADSTKTDSTVVAAPAAAPAQRTSARPGSVTTPVGGVN 354 T 2.7999999999999997E-68 CytoC_RC pdbpercent F Bacteria T 7o0v 6 XA C1 RC-S MPASPSPLPRSSRVRNAAVVVALVAVGLAARGRDAQGTQPPVAPPAAPTATAAPDLAVQDSTKADSTAVADTLMDLSMVMAAEAAAATVTTAPVAVAPTAWPVDPTTGQTLINGRPVVGRVFIMRKTDGTVKYPNVADVVAHEALAPLPPVVGSSYQQAPITNQRRMRGIMIQSTLWDMDRKRSATRQRYYPASTPANQLGQ 202 T 0.42 DUF126 pdbhh F T 7o0w 3 WA C A0A143BHR6_9BACT MULTIHEME_CYTC domain-containing protein MVPVSLLTLGACGDAATDTVQVGYRGTAMEQNYDHGDLKTKFAQVKLPQSPPPAGESPPGPLPWKNVQVLNDISIAEFNRTMIAMSTWVAGTGNCAYCHNVAAFQDDTLPNGKPLYTKIVARRMLQMTRNINGNYSQHVKNTGVTCYTCHMGKPLPNGLWFYSSQTDYLRHYLDRDGARVITQGVAPSNANRSSTKQAEWTYALMISQSRSLGVNCTYCHNTRQFASWREAPPARVTAYHGILMLRDVNQNYLAPLQPVYPAVRLGAMGDAPKAQCVTCHNGAYKPLYGAQMAKDFPAMWGRADWNGVPFPGIMRVAADSTKTDSTVVAAPAAAPAQRTSARPGSVTTPVGGVN 354 T 2.7999999999999997E-68 CytoC_RC pdbpercent F Bacteria T 7o0w 4 XA C1 RC-S MPASPSPLPRSSRVRNAAVVVALVAVGLAARGRDAQGTQPPVAPPAAPTATAAPDLAVQDSTKADSTAVADTLMDLSMVMAAEAAAATVTTAPVAVAPTAWPVDPTTGQTLINGRPVVGRVFIMRKTDGTVKYPNVADVVAHEALAPLPPVVGSSYQQAPITNQRRMRGIMIQSTLWDMDRKRSATRQRYYPASTPANQLGQ 202 T 0.42 DUF126 pdbhh F T 7o0w 5 YA C2 A0A143BK87_9BACT RC-U MNMHSSDATVSIPDDIDLILVDSVPVNDGIWAWYGIDDDRPMAAWSRFHATRCVEQLAINRARVGAAEWALADVQARGIVPCIAKAAAHLARARAELADWEAQGHRLEAARKVTPGAWTTPVIES 125 T 8 DUF5563 pdbhh F Bacteria T 7o0x 3 WA C A0A143BHR6_9BACT MULTIHEME_CYTC domain-containing protein MVPVSLLTLGACGDAATDTVQVGYRGTAMEQNYDHGDLKTKFAQVKLPQSPPPAGESPPGPLPWKNVQVLNDISIAEFNRTMIAMSTWVAGTGNCAYCHNVAAFQDDTLPNGKPLYTKIVARRMLQMTRNINGNYSQHVKNTGVTCYTCHMGKPLPNGLWFYSSQTDYLRHYLDRDGARVITQGVAPSNANRSSTKQAEWTYALMISQSRSLGVNCTYCHNTRQFASWREAPPARVTAYHGILMLRDVNQNYLAPLQPVYPAVRLGAMGDAPKAQCVTCHNGAYKPLYGAQMAKDFPAMWGRADWNGVPFPGIMRVAADSTKTDSTVVAAPAAAPAQRTSARPGSVTTPVGGVN 354 T 2.7999999999999997E-68 CytoC_RC pdbpercent F Bacteria T 7o0x 4 XA C1 RC-S MPASPSPLPRSSRVRNAAVVVALVAVGLAARGRDAQGTQPPVAPPAAPTATAAPDLAVQDSTKADSTAVADTLMDLSMVMAAEAAAATVTTAPVAVAPTAWPVDPTTGQTLINGRPVVGRVFIMRKTDGTVKYPNVADVVAHEALAPLPPVVGSSYQQAPITNQRRMRGIMIQSTLWDMDRKRSATRQRYYPASTPANQLGQ 202 T 0.42 DUF126 pdbhh F T 7o0x 5 YA C2 A0A143BK87_9BACT RC-U MNMHSSDATVSIPDDIDLILVDSVPVNDGIWAWYGIDDDRPMAAWSRFHATRCVEQLAINRARVGAAEWALADVQARGIVPCIAKAAAHLARARAELADWEAQGHRLEAARKVTPGAWTTPVIES 125 T 8 DUF5563 pdbhh F Bacteria T 7o1f 2 G,H,I,J J,K,M,O G0RZ52_CHATD Peptide fragment from PolD4 KHQSTLNFKHRVTKP 15 T 0.068 DUF4643 unppercent F Eukaryota T 7o2z 3 C P P/A#1 epitope peptide XAPAPAAPA 9 T 240 AGP pdbhh F F 7o30 3 C,F P,Q PAS#1 epitope peptide QAPASPAAPA 10 T 84 DUF4301 pdbhh F F 7o31 4 D P PAS#1 epitope peptide QAPASPAAPA 10 T 84 DUF4301 pdbhh F F 7o33 3 C P APSA epitope peptide QAPSAAPSAAPSA 13 T 15 OGFr_III pdbhh F F 7o3j 3 AA,C,DA,F,GA,I,JA,L,MA,O,PA,R,U,X a,C,d,F,g,I,j,L,m,O,p,R,U,X O50334_ECOLX TrwH protein MKTIIFAILMTGLLSACASAPKPKQPSDFNREPVNKTVPVEIQRGAL 47 T 0.0089 LPAM_1 pdbpssm F Bacteria T 7o4i 24 X Q T2FA_YEAST TFIIF-ALPHA,TFIIF LARGE SUBUNIT,TRANSCRIPTION FACTOR G 105 KDA SUBUNIT,P105 GPGMSRRNPPGSRNGGGPTNASPFIKRDRMRRNFLRMRMGQNGSNSSSPGVPNGDNSRGSLVKKDDPEYAEEREKMLLQIGVEADAGRSNVKVKDEDPNEYNEFPLRAIPKEDLENMRTHLLKFQSKKKINPVTDFHLPVRLHRKDTRNLQFQLTRAEIVQRQKEISEYKKKAEQERSTPNSGGMNKSGTVSLNNTVKDGSQTPTVDSVTKDNTANGVNSSIPTVTGSSVPPASPTTVSAVESNGLSNGSTSAANGLDGNASTANLANGRPLVTKLEDAGPAEDPTKVGMVKYDGKEVTNEPEFEEGTMDPLADVAPDGGGRAKRGNLRRKTRQLKVLDENAKKLRFEEFYPWVMEDFDGYNTWVGSYEAGNSDSYVLLSVEDDGSFTMIPADKVYKFTARNKYATLTIDEAEKRMDKKSGEVPRWLMKHLDNIGTTTTRYDRTRRKLKAVADQQAMDEDDRDDNSEVELDYDEEFADDEEAPIIDGNEQENKESEQRIKKEMLQANAMGLRDEEAPSENEEDELFGEKKIDEDGERIKKALQKTELAALYSSDENEINPYLSESDIENKENESPVKKEEDSDTLSKSKRSSPKKQQKKATNAHVHKEPTLRVKSIKNCVIILKGDKKILKSFPEGEWNPQTTKAVDSSNNASNTVPSPIKQEEGLNSTVAEREETPAPTITEKDIIEAIGDGKVNIKEFGKFIRRKYPGAENKKLMFAIVKKLCRKVGNDHMELKKE 738 T 9.4E-36 TFIIF_alpha pdbhh F Eukaryota T 7o4j 24 X Q T2FA_YEAST TFIIF-ALPHA,TFIIF LARGE SUBUNIT,TRANSCRIPTION FACTOR G 105 KDA SUBUNIT,P105 GPGMSRRNPPGSRNGGGPTNASPFIKRDRMRRNFLRMRMGQNGSNSSSPGVPNGDNSRGSLVKKDDPEYAEEREKMLLQIGVEADAGRSNVKVKDEDPNEYNEFPLRAIPKEDLENMRTHLLKFQSKKKINPVTDFHLPVRLHRKDTRNLQFQLTRAEIVQRQKEISEYKKKAEQERSTPNSGGMNKSGTVSLNNTVKDGSQTPTVDSVTKDNTANGVNSSIPTVTGSSVPPASPTTVSAVESNGLSNGSTSAANGLDGNASTANLANGRPLVTKLEDAGPAEDPTKVGMVKYDGKEVTNEPEFEEGTMDPLADVAPDGGGRAKRGNLRRKTRQLKVLDENAKKLRFEEFYPWVMEDFDGYNTWVGSYEAGNSDSYVLLSVEDDGSFTMIPADKVYKFTARNKYATLTIDEAEKRMDKKSGEVPRWLMKHLDNIGTTTTRYDRTRRKLKAVADQQAMDEDDRDDNSEVELDYDEEFADDEEAPIIDGNEQENKESEQRIKKEMLQANAMGLRDEEAPSENEEDELFGEKKIDEDGERIKKALQKTELAALYSSDENEINPYLSESDIENKENESPVKKEEDSDTLSKSKRSSPKKQQKKATNAHVHKEPTLRVKSIKNCVIILKGDKKILKSFPEGEWNPQTTKAVDSSNNASNTVPSPIKQEEGLNSTVAEREETPAPTITEKDIIEAIGDGKVNIKEFGKFIRRKYPGAENKKLMFAIVKKLCRKVGNDHMELKKE 738 T 9.4E-36 TFIIF_alpha pdbhh F Eukaryota T 7o50 2 C,D H,C GLY-SER-ASN XGSN 4 T 410 DUF3561 pdbhh F F 7o54 2 B B CYNT_SYNE7 CARBONATE DEHYDRATASE GWLAPEQQQRIYRGNAS 17 T 0.42 NPA pdbhh F Bacteria T 7o55 3 C C Inhibitor MI-2231 KKXGXX 6 T 680 zf-RanBP pdbhh F F 7o5b 5 E h MifM-stalling construct GPGTMFVESINDVLFLVDFFTIILPALTAIGIAFLLRECRAGEQWKSKRTDGPGKPIPNPLLGLDSTDFLIIIYHRITTWIRKVFRMNSPVNDEED 96 T 0.03 DUF4231 pdbpercent F T 7o5y 1 A,B,C,D B,C,D,A A0A0B7GNW3_STRSA Type IV pilus biogenesis protein PilA MDHHHHHHDTGQSQTQRMYNYLKAKYTATSGTQLAWGAYLDPVDGNPSSVYAEFDERAHNVDPSTEPIKSTHTFKDGSVAEIEMNGQLVDGLTGPENYNITIKSKSKLAGSNDYYEHIVTFNFDTKGIRSEEGHLRSAQK 140 T 0.0049 DUF3377 unppercent F Bacteria T 7o6n 2 C,D C,D PID3_CAEEL PIRNA BIOGENESIS AND CHROMOSOME SEGREGATION PROTEIN 1,PIRNA-INDUCED SILENCING DEFECTIVE PROTEIN 3 GPDSMWTFDKVLFNSEDIKDSVFKVLHAEEEPRGADQEN 39 T 0.031 Nup35_RRM unphh F Eukaryota T 7o6t 1 A A VIN3_ARATH Protein VERNALIZATION INSENSITIVE 3 KDLGHIVKTIRCLEEEGHIDKSFREDFLTWYSLRATHREVRVVKDFVETFMEDLSSLGQQLVDTFSESILSKK 73 T 3 DUF6429 pdbhh F Eukaryota T 7o6t 2 B B VIN3_ARATH Protein VERNALIZATION INSENSITIVE 3 KDLGHIVKTIRCLEEEGHIDKSFREDFLTWYSLRATHREVRVVKDFVETFMEDLSSLGQQLVDTFSESILSKK 73 T 3 DUF6429 pdbhh F Eukaryota T 7o6u 1 A A VIN3_ARATH Protein VERNALIZATION INSENSITIVE 3 DLGHIVKTIRCLEEEGHIDKSFAEDFLTWYSLRATHREVRVVKIFVETFMEDLSSLGQQLVDTFSESILS 70 T 2.7 DUF6429 pdbhh F Eukaryota T 7o6v 1 A,B,C,D A,B,C,D VIL2_ARATH VERNALIZATION5/VIN3-LIKE PROTEIN 1 GGTESGLEHCVKIIRQLECSGHIDKNFAQDFLTWYSLRATSQEIRVVKDFIDTFIDDPMALAEQLIDTFDDRVSIKR 77 T 0.23 DUF6184 pdb F Eukaryota T 7o6w 1 A,B A,B VIL2_ARATH VERNALIZATION5/VIN3-LIKE PROTEIN 1 GLEHCVKIIRQLECSGHIDKNFRQKFLTWYSLRATSQEIRVVKDFIDTFIDDPMALAEQLIDTFDDRVS 69 T 2.9 DUF6429 pdbhh F Eukaryota T 7o6y 9 I S F2Z673_YARLI Subunit NESM of NADH:Ubiquinone Oxidoreductase (Complex I) MLKLHYRNFITAQHSTTNTTPTMIASVCKRAGLRAGPRAYPGVRQFALRAYNEEKELALKQRLSQLPPPGKAFVTAEGEPRPAKEAELAELAEIAALYKTDRVGILDILLLGNKHARLYRDNTALLKDYYYNGRRILDKIPVKDKQTGKVTWEIKREGAEKEDWVNQMYFLYAPSLILLLIVMVYKSREDITFWAKKELDQRVLDKHPEINDAPENERDALIVERIIAGDYDKLASLQKKATPTPATLI 249 T 0.0013 ESSS pdbpercent F Eukaryota T 7o6y 31 EA Z Q6CI10_YARLI Subunit NUZM of NADH:Ubiquinone Oxidoreductase (Complex I) MLPGGPVPVFKKYTVGSKGIWEKLRVLLAIAPNRSTGNPIVPLYRVPTPGSRPEANVYQDPSSYPTNDIAENPYWKRDHRRAYPQTAFFDQKTVTGLLELGSEATPRIADGEAGTKALANIANGGVSFTQALGKSSKDVIYGEVLTVNGLPPVAPTLAPKQWKIIEGEAAIYPKGYPCRTFH 182 T 0.00017 CI-B14_5a pdbhh F Eukaryota T 7o6y 39 MA i A0A1H6Q311_YARLL Subunit NUUM of NADH:Ubiquinone Oxidoreductase (Complex I) MGGGRYPFPKDVISMTGGWWANPSNWKLNGLFATGIAVGLALWVSTATLPYTRRREGITSESDISKWNAAAGVWRERHGKISTGEAAESE 90 T 6 Sex_peptide pdbhh F Eukaryota T 7o6y 40 NA n Q6C1R9_YARLI Subunit NUNM of NADH:Ubiquinone Oxidoreductase (Complex I) MLRHTVRATQTLRQARNVRFGSHGHGPELTPAVPFFQPYVLKWAGVSLGLVAFYQFNSSYEAKNGHTWVETFFHPKSREDILNEEAKIVQALNNQRELTIKMHELKREEKDYAHSYSPLFSDPVPQGGSIGKAPGSSRE 139 T 0.033 DUF5950 pdb F Eukaryota T 7o71 9 I S F2Z673_YARLI Subunit NESM of NADH:Ubiquinone Oxidoreductase (Complex I) MLKLHYRNFITAQHSTTNTTPTMIASVCKRAGLRAGPRAYPGVRQFALRAYNEEKELALKQRLSQLPPPGKAFVTAEGEPRPAKEAELAELAEIAALYKTDRVGILDILLLGNKHARLYRDNTALLKDYYYNGRRILDKIPVKDKQTGKVTWEIKREGAEKEDWVNQMYFLYAPSLILLLIVMVYKSREDITFWAKKELDQRVLDKHPEINDAPENERDALIVERIIAGDYDKLASLQKKATPTPATLI 249 T 0.0013 ESSS pdbpercent F Eukaryota T 7o71 31 EA Z Q6CI10_YARLI Subunit NUZM of NADH:Ubiquinone Oxidoreductase (Complex I) MLPGGPVPVFKKYTVGSKGIWEKLRVLLAIAPNRSTGNPIVPLYRVPTPGSRPEANVYQDPSSYPTNDIAENPYWKRDHRRAYPQTAFFDQKTVTGLLELGSEATPRIADGEAGTKALANIANGGVSFTQALGKSSKDVIYGEVLTVNGLPPVAPTLAPKQWKIIEGEAAIYPKGYPCRTFH 182 T 0.00017 CI-B14_5a pdbhh F Eukaryota T 7o71 39 MA i A0A1H6Q311_YARLL Subunit NUUM of NADH:Ubiquinone Oxidoreductase (Complex I) MGGGRYPFPKDVISMTGGWWANPSNWKLNGLFATGIAVGLALWVSTATLPYTRRREGITSESDISKWNAAAGVWRERHGKISTGEAAESE 90 T 6 Sex_peptide pdbhh F Eukaryota T 7o71 40 NA n Q6C1R9_YARLI Subunit NUNM of NADH:Ubiquinone Oxidoreductase (Complex I) FGSHGHGPELTPAVPFFQPYVLKWAGVSLGLVAFYQFNSSYEAKNGHTWVETFFHPKSREDILNEEAKIVQALNNQRELTIKMHELKREEKDYAHSYSPLFSDPVPQGGSIGKAPGSSRE 120 T 0.023 DUF5950 pdb F Eukaryota T 7o72 24 X Q T2FA_YEAST TFIIF-ALPHA,TFIIF LARGE SUBUNIT,TRANSCRIPTION FACTOR G 105 KDA SUBUNIT,P105 GPGMSRRNPPGSRNGGGPTNASPFIKRDRMRRNFLRMRMGQNGSNSSSPGVPNGDNSRGSLVKKDDPEYAEEREKMLLQIGVEADAGRSNVKVKDEDPNEYNEFPLRAIPKEDLENMRTHLLKFQSKKKINPVTDFHLPVRLHRKDTRNLQFQLTRAEIVQRQKEISEYKKKAEQERSTPNSGGMNKSGTVSLNNTVKDGSQTPTVDSVTKDNTANGVNSSIPTVTGSSVPPASPTTVSAVESNGLSNGSTSAANGLDGNASTANLANGRPLVTKLEDAGPAEDPTKVGMVKYDGKEVTNEPEFEEGTMDPLADVAPDGGGRAKRGNLRRKTRQLKVLDENAKKLRFEEFYPWVMEDFDGYNTWVGSYEAGNSDSYVLLSVEDDGSFTMIPADKVYKFTARNKYATLTIDEAEKRMDKKSGEVPRWLMKHLDNIGTTTTRYDRTRRKLKAVADQQAMDEDDRDDNSEVELDYDEEFADDEEAPIIDGNEQENKESEQRIKKEMLQANAMGLRDEEAPSENEEDELFGEKKIDEDGERIKKALQKTELAALYSSDENEINPYLSESDIENKENESPVKKEEDSDTLSKSKRSSPKKQQKKATNAHVHKEPTLRVKSIKNCVIILKGDKKILKSFPEGEWNPQTTKAVDSSNNASNTVPSPIKQEEGLNSTVAEREETPAPTITEKDIIEAIGDGKVNIKEFGKFIRRKYPGAENKKLMFAIVKKLCRKVGNDHMELKKE 738 T 9.4E-36 TFIIF_alpha pdbhh F Eukaryota T 7o73 24 X Q T2FA_YEAST TFIIF-ALPHA,TFIIF LARGE SUBUNIT,TRANSCRIPTION FACTOR G 105 KDA SUBUNIT,P105 GPGMSRRNPPGSRNGGGPTNASPFIKRDRMRRNFLRMRMGQNGSNSSSPGVPNGDNSRGSLVKKDDPEYAEEREKMLLQIGVEADAGRSNVKVKDEDPNEYNEFPLRAIPKEDLENMRTHLLKFQSKKKINPVTDFHLPVRLHRKDTRNLQFQLTRAEIVQRQKEISEYKKKAEQERSTPNSGGMNKSGTVSLNNTVKDGSQTPTVDSVTKDNTANGVNSSIPTVTGSSVPPASPTTVSAVESNGLSNGSTSAANGLDGNASTANLANGRPLVTKLEDAGPAEDPTKVGMVKYDGKEVTNEPEFEEGTMDPLADVAPDGGGRAKRGNLRRKTRQLKVLDENAKKLRFEEFYPWVMEDFDGYNTWVGSYEAGNSDSYVLLSVEDDGSFTMIPADKVYKFTARNKYATLTIDEAEKRMDKKSGEVPRWLMKHLDNIGTTTTRYDRTRRKLKAVADQQAMDEDDRDDNSEVELDYDEEFADDEEAPIIDGNEQENKESEQRIKKEMLQANAMGLRDEEAPSENEEDELFGEKKIDEDGERIKKALQKTELAALYSSDENEINPYLSESDIENKENESPVKKEEDSDTLSKSKRSSPKKQQKKATNAHVHKEPTLRVKSIKNCVIILKGDKKILKSFPEGEWNPQTTKAVDSSNNASNTVPSPIKQEEGLNSTVAEREETPAPTITEKDIIEAIGDGKVNIKEFGKFIRRKYPGAENKKLMFAIVKKLCRKVGNDHMELKKE 738 T 9.4E-36 TFIIF_alpha pdbhh F Eukaryota T 7o75 24 X Q T2FA_YEAST TFIIF-ALPHA,TFIIF LARGE SUBUNIT,TRANSCRIPTION FACTOR G 105 KDA SUBUNIT,P105 GPGMSRRNPPGSRNGGGPTNASPFIKRDRMRRNFLRMRMGQNGSNSSSPGVPNGDNSRGSLVKKDDPEYAEEREKMLLQIGVEADAGRSNVKVKDEDPNEYNEFPLRAIPKEDLENMRTHLLKFQSKKKINPVTDFHLPVRLHRKDTRNLQFQLTRAEIVQRQKEISEYKKKAEQERSTPNSGGMNKSGTVSLNNTVKDGSQTPTVDSVTKDNTANGVNSSIPTVTGSSVPPASPTTVSAVESNGLSNGSTSAANGLDGNASTANLANGRPLVTKLEDAGPAEDPTKVGMVKYDGKEVTNEPEFEEGTMDPLADVAPDGGGRAKRGNLRRKTRQLKVLDENAKKLRFEEFYPWVMEDFDGYNTWVGSYEAGNSDSYVLLSVEDDGSFTMIPADKVYKFTARNKYATLTIDEAEKRMDKKSGEVPRWLMKHLDNIGTTTTRYDRTRRKLKAVADQQAMDEDDRDDNSEVELDYDEEFADDEEAPIIDGNEQENKESEQRIKKEMLQANAMGLRDEEAPSENEEDELFGEKKIDEDGERIKKALQKTELAALYSSDENEINPYLSESDIENKENESPVKKEEDSDTLSKSKRSSPKKQQKKATNAHVHKEPTLRVKSIKNCVIILKGDKKILKSFPEGEWNPQTTKAVDSSNNASNTVPSPIKQEEGLNSTVAEREETPAPTITEKDIIEAIGDGKVNIKEFGKFIRRKYPGAENKKLMFAIVKKLCRKVGNDHMELKKE 738 T 9.4E-36 TFIIF_alpha pdbhh F Eukaryota T 7o9k 63 QB UNK UNK XXXXXXXXXXXXXXXXXXXXXXXXXXXX 28 F F F 7o9m 61 NB UNK Unknown residues XXXXXXXXXXXXXXXXXXXXXXXXXX 26 F F F 7o9o 1 A A AWP3b MGSSHHHHHHSSGLVPRGSHMASMTGGQQMGRGSDQTVRSVAGDQRVTDPVIVGDNSILDYYGGSNYDFSNNFEIGRGTLYIGKESYFSSFQSAPTDVPNSFHLLIKNTNNLQNNGQFIIENIKRHANQCSNSSIQVFPINFQNDGEFEIISGGVEGRCCLPTSVIAPQNFLNNGKFYYKVLTDTGSIYSGSCMQNVDIGASTTTTVNNNLWEFTGSINAQINGAVSGAAQINLDGSNMFVNANTFSGQVVNLINGGSFLQTSDPLSNIVVINGLGTSDTGVTSIAVKGKGKSFTYNPSSGIVKLTTVEGKTYAYQIGCGYNTKKFITNNDSGASYESADNFFVLTYSEPYSPQTCQLEN 360 T 2.9 Put_Phosphatase pdbpercent F T 7o9p 1 A A AWP3b MGSSHHHHHHSSGLVPRGSHMASMTGGQQMGRGSDQTVRSVAGDQRVTDPVIVGDNSILDYYGGSNYDFSNNFEIGRGTLYIGKESYFSSFQSAPTDVPNSFHLLIKNTNNLQNNGQFIIENIKRHANQCSNSSIQVFPINFQNDGEFEIISGGVEGRCCLPTSVIAPQNFLNNGKFYYKVLTDTGSIYSGSCMQNVDIGASTTTTVNNNLWEFTGSINAQINGAVSGAAQINLDGSNMFVNANTFSGQVVNLINGGSFLQTSDPLSNIVVINGLGTSDTGVTSIAVKGKGKSFTYNPSSGIVKLTTVEGKTYAYQIGCGYNTKKFITNNDSGASYESADNFFVLTYSEPYSPQTCQLEN 360 T 2.9 Put_Phosphatase pdbpercent F T 7o9q 1 A,B A,B Q6FPN0_CANGA Awp1A MGSSHHHHHHSSGLVPRGSHMASMTGGQQMGRGSLDILTPTTLTGDQTFNEDVSVVSSLTLNDGSQYLFNNLLQIAPSSASVTANALAAVSVFTFSLPPSSSLSNSGTLIISNSNTGPSTEQHIVITPNVMANTGTITLSLAHTNTDSSSTLIIDPVTFYNTGTINYESIGSETNDPSLTGNILSIGSSGRTLQNLGTINLNAANSYYLLGTITENSGSINVQKGFLYVNALDFIGNTINLSTTTALAFISPVSQVVRVRGVFFGNIIASVGSSGTFSYNTQTGILTVTTNGVYSYDIGCGYNPALMSGQQETLSFQGNLYDTFLVLVNQPIPSDLTCAAV 341 T 1.8 Cadherin_4 pdbpssm F Eukaryota T 7o9t 1 A A MEN1_HUMAN Menin AMGLKAAQKTLFPLRSIDDVVRLFAAELGREEPDLVLLSLVLGFVEHFLAVNRVGLTYFPVADLSIIAALYARFTAQIRGAVDLSLYPREGGVSSRELVKKVSDVIWNSLSRSYFKDRAHIQSLFSFITGTKLDSSGVAFAVVGACQALGLRDVHLALSEDHAWVVFGPNGEQTAEVTWHGKGNEDRRGQTVNAGVAERSWLYLKGSYMRCDRKMEVAFMVCAINPSIDLHTDSLELLQLQQKLLWLLYDLGHLERYPMALGNLADLEELEPTPGRPDPLTLYHKGIASAKTYYRDEHIYPYMYLAGYHCRNRNVREALQAWADTATVIQDYNYCREDEEIYKEFFEVANDVIPNLLKEAASLLEAGQSQGSALQDPECFAHLLRFYDGICKWEEGSPTPVLHVGWATFLVQSLGRFEGQVRQKVRIVSGTVAGTARGPEGGSTAQVPAPAASPPPEGPVLTFQSEKMKGMKELLVATKINSSAIKLQLTAQSQ 494 T 1.8E-23 Menin unp F Eukaryota T 7o9x 1 A A MEN1_HUMAN Menin AMGLKAAQKTLFPLRSIDDVVRLFAAELGREEPDLVLLSLVLGFVEHFLAVNRVGLTYFPVADLSIIAALYARFTAQIRGAVDLSLYPREGGVSSRELVKKVSDVIWNSLSRSYFKDRAHIQSLFSFITGTKLDSSGVAFAVVGACQALGLRDVHLALSEDHAWVVFGPNGEQTAEVTWHGKGNEDRRGQTVNAGVAERSWLYLKGSYMRCDRKMEVAFMVCAINPSIDLHTDSLELLQLQQKLLWLLYDLGHLERYPMALGNLADLEELEPTPGRPDPLTLYHKGIASAKTYYRDEHIYPYMYLAGYHCRNRNVREALQAWADTATVIQDYNYCREDEEIYKEFFEVANDVIPNLLKEAASLLEAGQSQGSALQDPECFAHLLRFYDGICKWEEGSPTPVLHVGWATFLVQSLGRFEGQVRQKVRIVSGTVAGTARGPEGGSTAQVPAPAASPPPEGPVLTFQSEKMKGMKELLVATKINSSAIKLQLTAQSQ 494 T 1.8E-23 Menin unp F Eukaryota T 7o9z 1 A A MEN1_HUMAN Menin AMGLKAAQKTLFPLRSIDDVVRLFAAELGREEPDLVLLSLVLGFVEHFLAVNRVGLTYFPVADLSIIAALYARFTAQIRGAVDLSLYPREGGVSSRELVKKVSDVIWNSLSRSYFKDRAHIQSLFSFITGTKLDSSGVAFAVVGACQALGLRDVHLALSEDHAWVVFGPNGEQTAEVTWHGKGNEDRRGQTVNAGVAERSWLYLKGSYMRCDRKMEVAFMVCAINPSIDLHTDSLELLQLQQKLLWLLYDLGHLERYPMALGNLADLEELEPTPGRPDPLTLYHKGIASAKTYYRDEHIYPYMYLAGYHCRNRNVREALQAWADTATVIQDYNYCREDEEIYKEFFEVANDVIPNLLKEAASLLEAGQSQGSALQDPECFAHLLRFYDGICKWEEGSPTPVLHVGWATFLVQSLGRFEGQVRQKVRIVSGTVAGTARGPEGGSTAQVPAPAASPPPEGPVLTFQSEKMKGMKELLVATKINSSAIKLQLTAQSQ 494 T 1.8E-23 Menin unp F Eukaryota T 7oa7 1 A A A0A0B7GRV8_STRSA TYPE IV PILUS PROTEIN MDHHHHHHMKVTIPSGKRYYYAGMGITTPGGKVDIADSKKKSKTRIYTESGWFLSDRAIGQGVSGIVPVGTIGQKGDGTISQTLFPEMPTDFKQLSKLETGIHITDDMRGKYLTFAARAINSYGRVGNYQEADRIWIMGLPVTQNVRLHTDADLALLKNGNTTSLIPTDNQLHTNTEVRDYFNDVVYGATIPVLNYKEPAINQTRQLIALDGRTMQFSNHNFNNGYTTSVLIGNRQQTGPLLTYKLDDTLTWGINLENDGRIAIKTVDTTTANNGGQEYIQNVKLDYSNDNSIQVRSAAKNGSLGIEIFINGQSVYNKTVSLTRNRTTHNISSGQIIFGGNTYINEFAVYTESLNNSNIQKLAEYFRDKYKAS 373 T 0.11 SlpA unppercent F Bacteria T 7oa9 1 A A MEN1_HUMAN Isoform 2 of Menin AMGLKAAQKTLFPLRSIDDVVRLFAAELGREEPDLVLLSLVLGFVEHFLAVNRVGLTYFPVADLSIIAALYARFTAQIRGAVDLSLYPREGGVSSRELVKKVSDVIWNSLSRSYFKDRAHIQSLFSFITGTKLDSSGVAFAVVGACQALGLRDVHLALSEDHAWVVFGPNGEQTAEVTWHGKGNEDRRGQTVNAGVAERSWLYLKGSYMRCDRKMEVAFMVCAINPSIDLHTDSLELLQLQQKLLWLLYDLGHLERYPMALGNLADLEELEPTPGRPDPLTLYHKGIASAKTYYRDEHIYPYMYLAGYHCRNRNVREALQAWADTATVIQDYNYCREDEEIYKEFFEVANDVIPNLLKEAASLLEAGEERPGEQSQGTQSQGSALQDPECFAHLLRFYDGICKWEEGSPTPVLHVGWATFLVQSLGRFEGQVRQKVRIVSGTVAGTARGPEGGSTAQVPAPAASPPPEGPVLTFQSEKMKGMKELLVATKINSSAIKLQLTAQSQ 505 T 2.1999999999999996E-24 Menin unppssm F Eukaryota T 7ob2 1 A A RiLK1 RLKWVRIWRR 10 T 7 SNX17_FERM_C pdbhh F T 7ob5 2 B P LDB1_HUMAN LIM DOMAIN-BINDING PROTEIN 1,LDB-1,CARBOXYL-TERMINAL LIM DOMAIN-BINDING PROTEIN 2,CLIM-2,LIM DOMAIN-BINDING FACTOR CLIM2,HLDB1,NUCLEAR LIM INTERACTOR KSENPTSQASQ 11 T 13 DUF999 pdbhh F Eukaryota T 7ob6 1 A,B A,B CPR-C4 TMITHHHHHHGSMHYKAQLQKLLTTEEKKILARLSTPQKIQDFLDTIKNKDLAEGEHTMWSPRAVLKHKHAHCMEGAMLAALALAYHGHSPLLMDLQTTDEDEDHVVALFKIDGHWGAISKTNHPVLRYRDPIYKSVRELAMSYFHEYFIWWTKKNGGKKTLRAYSNPFDLTRYKPERWVIATGDLDWLAEALDDSKHFPILNKKMQKQLRPASRIETKAASLSEWPKRKTNS 233 T 0.00031 DUF553 pdb F T 7ob7 1 A A CPR-C4 TMITHHHHHHGSMHYKAQLQKLLTTEEKKILARLSTPQKIQDFLDTIKNKDLAEGEHTMWSPRAVLKHKHAHCMEGAMLAALALAYHGHSPLLMDLQTTDEDEDHVVALFKIDGHWGAISKTNHPVLRYRDPIYKSVRELAMSYFHEYFIWWTKKNGGKKTLRAYSNPFDLTRYKPERWVIATGDLDWLAEALDDSKHFPILNKKMQKQLRPASRIETKAASLSEWPKRKTNS 233 T 0.00031 DUF553 pdb F T 7ob8 2 B B LDB1_HUMAN LIM DOMAIN-BINDING PROTEIN 1,LDB-1,CARBOXYL-TERMINAL LIM DOMAIN-BINDING PROTEIN 2,CLIM-2,LIM DOMAIN-BINDING FACTOR CLIM2,HLDB1,NUCLEAR LIM INTERACTOR KSENPTSQASQ 11 T 13 DUF999 pdbhh F Eukaryota T 7obc 2 B B RND3_HUMAN PROTEIN MEMB,RHO FAMILY GTPASE 3,RHO-RELATED GTP-BINDING PROTEIN RHO8,RND3 TDLRKDKAKSC 11 T 34 ALC pdbhh F Eukaryota T 7obd 2 B B RND3_HUMAN PROTEIN MEMB,RHO FAMILY GTPASE 3,RHO-RELATED GTP-BINDING PROTEIN RHO8,RND3 TDLRKDKAKSC 11 T 34 ALC pdbhh F Eukaryota T 7obk 2 B B E2AK2_HUMAN INTERFERON-INDUCED, DOUBLE-STRANDED RNA-ACTIVATED PROTEIN KINASE,EUKARYOTIC TRANSLATION INITIATION FACTOR 2-ALPHA KINASE 2,EIF-2A PROTEIN KINASE 2,INTERFERON-INDUCIBLE RNA-DEPENDENT PROTEIN KINASE,P1/EIF-2A PROTEIN KINASE,PROTEIN KINASE RNA-ACTIVATED,PKR,PROTEIN KINASE R,TYROSINE-PROTEIN KINASE EIF2AK2,P68 KINASE KSPEKNERHTC 11 T 1.3 VEK-30 pdbhh F Eukaryota T 7obl 2 B B E2AK2_HUMAN INTERFERON-INDUCED, DOUBLE-STRANDED RNA-ACTIVATED PROTEIN KINASE,EUKARYOTIC TRANSLATION INITIATION FACTOR 2-ALPHA KINASE 2,EIF-2A PROTEIN KINASE 2,INTERFERON-INDUCIBLE RNA-DEPENDENT PROTEIN KINASE,P1/EIF-2A PROTEIN KINASE,PROTEIN KINASE RNA-ACTIVATED,PKR,PROTEIN KINASE R,TYROSINE-PROTEIN KINASE EIF2AK2,P68 KINASE KSPEKNERHTC 11 T 1.3 VEK-30 pdbhh F Eukaryota T 7obq 3 C s EM14S01-3B_G0054400.mRNA.1.CDS.1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 62 F F F 7obr 48 VA s DAP2_YEAST DPAP B,YSCV LDKLIRVGIILVLLIWGTVLLLKSIPHHSNTPDYQEPNSNYTNDGKLKVSFSVVRNNTFHPKYHELH 67 T 0.00011 DPPIV_rep unppercent F Eukaryota T 7obs 2 B B RIPK2_HUMAN CARD-CONTAINING INTERLEUKIN-1 BETA-CONVERTING ENZYME-ASSOCIATED KINASE,CARD-CONTAINING IL-1 BETA ICE-KINASE,RIP-LIKE-INTERACTING CLARP KINASE,RECEPTOR-INTERACTING PROTEIN 2,RIP-2,TYROSINE-PROTEIN KINASE RIPK2 PSLNLLQNKSM 11 T 16 FtsK_alpha pdbhh F Eukaryota T 7obt 2 B B RIPK2_HUMAN CARD-CONTAINING INTERLEUKIN-1 BETA-CONVERTING ENZYME-ASSOCIATED KINASE,CARD-CONTAINING IL-1 BETA ICE-KINASE,RIP-LIKE-INTERACTING CLARP KINASE,RECEPTOR-INTERACTING PROTEIN 2,RIP-2,TYROSINE-PROTEIN KINASE RIPK2 PSLNLLQNKSM 11 T 16 FtsK_alpha pdbhh F Eukaryota T 7obv 3 C D Inhibitor MI-2248 XGXXKK 6 T 260 Phage_coatGP8 pdbhh F F 7obx 2 B B SSBP4_HUMAN SINGLE-STRANDED DNA-BINDING PROTEIN 4 ESYSPGMTMSV 11 T 23 SCAB-PH pdbhh F Eukaryota T 7oby 2 B B SSBP4_HUMAN SINGLE-STRANDED DNA-BINDING PROTEIN 4 ESYSPGMTMSV 11 T 23 SCAB-PH pdbhh F Eukaryota T 7oc2 3 C D Cyclic 1[2-CHLORO-4-METHOXY-PHENYL-OXYMETHYL]-4-[2,6-DICHLORO-PHENYL-OXYMETHYL]-BENZENE-(7-3)-7-BENZYL-1,3-DIMETHYL-8-PIPERAZIN-1-YL-3,7-DIHYDRO-PURINE-2,6-DIONE-(7-19)-N-ACETYL-L-CYSTEINE-(8-25)-[3R-[3A,4A,5B(S*)]]-5-(1-CARBOXY-1-PHOSPHONOETHOXY)-4-HYDROXY-3-(PHOSPHONOOXY)-1-CYCLOHEXENE-1-CARBOXYLIC ACID-()-(6E,11E)-HEPTADECA-6,11-DIENE-9,9-DIYLBIS(PHOSPHONIC ACID) XXXKK 5 T 970 UPF0715 pdbhh F F 7oc4 1 A,B A,B ASR6_SARSH XENOVULENE A BIOSYNTHESIS CLUSTER PROTEIN R6 GAMPVTTPTKMATLTTKQMWQTIKDYFGDGFVTGSAPISYNVHTCDMQLQPDSGIHAASDGIHYGVQISEDSMPLFSIMGDTAAPPCTCHRVDEIVKHIDEFLERAPEALPDDGAITSGKPCDTNPDQVSLYAMRDSLSWWVHWGGNLRPEHYWKQIYIGFAAIPDDVQISPREFLDGTYRYLGHTWDDCLSGLEEEGVSPDEIEFANMCMWRQMLTQWLEKADPELLPLLKGKISLMLQYRVLTANTLGCLALFMNATADPKDGPIHYADSSYEMEIASVAQCVTLDMAKEAMGILQGERTEVVAGDRAQRKRELRWIYVRCMQILESQPHAHMLRRYGSAGLHYVPMMDRYLERVSGHTRFPIRDGAARILERFINRAELPKESEDINPNGRSLKVSAKMNGNGQLHHEVNGNAKLHLEAERPDVTTAVG 432 T 19 NETI unphh F Eukaryota T 7oc5 1 A,B A,B ASR6_SARSH XENOVULENE A BIOSYNTHESIS CLUSTER PROTEIN R6 GAMPVTTPTKMATLTTKQMWQTIKDYFGDGFVTGSAPISYNVHTCDMQLQPDSGIHAASDGIHYGVQISEDSMPLFSIMGDTAAPPCTCHRVDEIVKHIDEFLERAPEALPDDGAITSGKPCDTNPDQVSLYAMRDSLSWWVHWGGNLRPEHYWKQIYIGFAAIPDDVQISPREFLDGTYRYLGHTWDDCLSGLEEEGVSPDEIEFANMCMWRQMLTQWLEKADPELLPLLKGKISLMLQYRVLTANTLGCLALFMNATADPKDGPIHYADSSYEMEIASVAQCVTLDMAKEAMGILQGERTEVVAGDRAQRKRELRWIYVRCMQILESQPHAHMLRRYGSAGLHYVPMMDRYLERVSGHTRFPIRDGAARILERFINRAELPKESEDINPNGRSLKVSAKMNGNGQLHHEVNGNAKLHLEAERPDVTTAVG 432 T 19 NETI unphh F Eukaryota T 7oc6 1 A A ASR6_SARSH XENOVULENE A BIOSYNTHESIS CLUSTER PROTEIN R6 MRGSHHHHHHGMASMTGGQQMGRDLYDDDDKDHPFTMPVTTPTKMATLTTKQMWQTIKDYFGDGFVTGSAPISYNVHTCDMQLQPDSGIHAASDGIHYGVQISEDSMPLFSIMGDTAAPPCTCHRVDEIVKHIDEFLERAPEALPDDGAITSGKPCDTNPDQVSLYAMRDSLSWWVHWGGNLRPEHYWKQIYIGFAAIPDDVQISPREFLDGTYRYLGHTWDDCLSGLEEEGVSPDEIEFANMCMWRQMLTQWLEKADPELLPLLKGKISLMLQYRVLTANTLGCLALFMNATADPKDGPIHYADSSYEMEIASVAQCVTLDMAKEAMGILQGERTEVVAGDRAQRKRELRWIYVRCMQILESQPHAHMLRRYGSAGLHYVPMMDRYLERVSGHTRFPIRDGAARILERFINRAELPKESEDINPNGRSLKVSAKMNGNGQLHHEVNGNAKLHLEAERPDVTTAVG 466 T 19 NETI unphh F Eukaryota T 7oc9 1 A A Q6MQ12_BDEBA Bd0675 GGNDFVSRLKALDGREGKIVSSYDDENTGRCRLELQKYELEDGSQGLAVYLQDTGMYFTPSAGLDKETKLKDANTAVVSTSSERPGGDACGDFGGALGYKKVLVLKDNQVTIRETFRCVMDGFKKYDLSTTCQF 134 T 8.1 Fimbrial_PilY2 unphh F Bacteria T 7oca 3 C,H G,E CNIH2_RAT CNIH-2,CORNICHON FAMILY AMPA RECEPTOR AUXILIARY PROTEIN 2,CORNICHON-LIKE PROTEIN MAFTFAAFCYMLTLVLCASLIFFVIWHIIAFDELRTDFKNPIDQGNPARARERLKNIERICCLLRKLVVPEYSIHGLFCLMFLCAAEWVTLGLNIPLLFYHLWRYFHRPADGSEVMYDAVSIMNADILNYCQKESWCKLAFYLLSFFYYLYSMVYTLVSFENLYFQSGGSTETSQVAPAYPYDVPDYA 188 T 2E-13 Cornichon pdbpssm F Eukaryota T 7oce 3 C,H G,E CNIH2_RAT CNIH-2,CORNICHON FAMILY AMPA RECEPTOR AUXILIARY PROTEIN 2,CORNICHON-LIKE PROTEIN MAFTFAAFCYMLTLVLCASLIFFVIWHIIAFDELRTDFKNPIDQGNPARARERLKNIERICCLLRKLVVPEYSIHGLFCLMFLCAAEWVTLGLNIPLLFYHLWRYFHRPADGSEVMYDAVSIMNADILNYCQKESWCKLAFYLLSFFYYLYSMVYTLVSFENLYFQSGGSTETSQVAPAYPYDVPDYA 188 T 2E-13 Cornichon pdbpssm F Eukaryota T 7ocf 3 C,F G,E CNIH2_RAT CNIH-2,CORNICHON FAMILY AMPA RECEPTOR AUXILIARY PROTEIN 2,CORNICHON-LIKE PROTEIN MAFTFAAFCYMLTLVLCASLIFFVIWHIIAFDELRTDFKNPIDQGNPARARERLKNIERICCLLRKLVVPEYSIHGLFCLMFLCAAEWVTLGLNIPLLFYHLWRYFHRPADGSEVMYDAVSIMNADILNYCQKESWCKLAFYLLSFFYYLYSMVYTLVSFENLYFQSGGSTETSQVAPAYPYDVPDYA 188 T 2E-13 Cornichon pdbpssm F Eukaryota T 7oci 9 I I Dolichyl-diphosphooligosaccharide--protein glycosyltransferase subunit OST6 - TM1 XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 7ock 2 I,J,K,L L,A,K,J ADOM_BPT3 SAM hydrolase MIFTKEPANVFYVLVSAFRSNLCDEVNMSRHRHMVSTLRAAPGLYGSVESTDLTGCYREAISSAPTEEKTVRVRCKDKAQALNVARLACNEWEQDCVLVYKSQTHTAGLVYAKGIDGYKAERLPGSFQEVPKGAPLQGCFTIDEFGRRWQVQHHHHHH 158 T 0.0035 DUF3293 unphh T Viruses T 7oco 1 A,B,C,D B,A,C,D CAPSD_HBVD3 CORE ANTIGEN,CORE PROTEIN,HBCAG,P21.5 MDIDPYKEFGATVELLSFLPSDFFPSVRDLLDTASALYREALESPEHCSPHHTALRQAIVCWGELMTLATWVGVNLEDPASRDLVVSYVNTNMGLKFRQLLWFHISCLTFGRETVIEYLVSFGVWIRTPPAYRPPNAPILSTLPETTVVRRRGRSPRRRTPSPRRRRSQSPRRRRSQSRESQC 183 T 1.5999999999999998E-25 Hepatitis_core pdb T Viruses T 7ocw 1 A,B,C,D B,A,C,D CAPSD_HBVD3 CORE ANTIGEN,CORE PROTEIN,HBCAG,P21.5 MDIDTYKEFGATVELLSFLPSDFFPSVRDLLDTASALYREALESPEHCSPHHTALRQAILCWGELMTLATWVGVNLEDPASRDLVVSYVNTNMGLKFRQLLWFHISCLTFGRETVIEYLVSFGVWIRTPPAYRPPNAPILSTLPETTVVRRRGRSPRRRTPSPRRRRSQSPRRRRSQSRESQC 183 T 1.9E-17 Hepatitis_core unp T Viruses T 7ocz 1 A,B A,B PID3_CAEEL PIRNA BIOGENESIS AND CHROMOSOME SEGREGATION PROTEIN 1,PIRNA-INDUCED SILENCING DEFECTIVE PROTEIN 3 GPDSMPRGADQENMLKISGYPGMLNTFGIAQLLTPYRVNGITITGAQSAVVALENKFQVYQAVQDFNGKKLDRNHKLQVSSLVV 84 T 0.0014 RRM_1 pdbpercent F Eukaryota T 7od2 1 A A K1A_ANEER KAPPA-AITX-AER3A,ANERK,POTASSIUM CHANNEL TOXIN AETX K ACKDYLPKSECTQFRCRTSMKYKYTNCKKTCGTC 34 T 0.0073 ShK pdbpercent F Eukaryota T 7od6 2 E,F F,E Inhibitory Peptide P2 (GSLLGRMKGA) XXXXXXGSLLGRMKGA 16 T 6.6 Aconitase_B_N pdbhh F T 7od7 2 E E SLLRGM XXXXXXSLLRGM 12 T 39 Leu_leader pdbhh F T 7od8 1 A,B,C,D B,A,C,D CAPSD_HBVD3 CORE ANTIGEN,CORE PROTEIN,HBCAG,P21.5 MDIDPYKEFGATVELLSFLPSDFFPSVRDLLDTASALYREALESPEHCSPHHTALRQAIVCWGELMTLATWVGVNLEDPASRDLVVSYVNTNMGLKFRQLLWFHISCLTFGRETVIEYLVSFGVWIRTPPAYRPPNAPILSTLPETTVVRRRGRSPRRRTPSPRRRRSQSPRRRRSQSRESQC 183 T 1.5999999999999998E-25 Hepatitis_core pdb T Viruses T 7od8 2 E,F E,F peptide GSLLGRMKGA XXXXXXXXXXGSLLGRMKGA 20 T 11 Aconitase_B_N pdbhh F T 7odv 3 C,F CCC,FFF IDA_ARATH PROTEIN INFLORESCENCE DEFICIENT IN ABSCISSION YVPIPPSAPSKRHN 14 T 0.34 Disulph_isomer pdbhh F Eukaryota T 7odx 1 A A PURZ_BPS2L Succinoaminodeoxyadenylate synthetase (PurZ) GTGDGSMLSIPPYYRVKNCNLIVDCQYGSTGKGLLAGYLGALEAPQVLCMAPSPNAGHTLVEEDGTARVHKMLPLGITSPSLERIYLGPGSVIDMDRLLEEYLALPRQVELWVHQNAAVVLQEHRDEEAAGGLAPGSTRSGAGSAFIAKIRRRPGTLLFGEAVRDHPLHGVVRVVDTRTAQDMLFRTRSIQAEGCQGYSLSVHHGAYPYCTARDVTTAQLIADCGLPYDVARIARVVGSMRTYPIRVANRPEAGEWSGPCYPDSVECQFADLGLEQEYTTVTKLPRRIFTFSAIQAHEAIAQNGVDEVFLNFAQYPPSLGALEDILDAIEARAEVTYVGFGPKVTDVYHTPTRAELEGLYARYRR 365 T 1.9999999999999999E-56 Adenylsucc_synt pdbpssm T Viruses T 7oe2 2 F,G,H,I,J 1,2,3,4,5 D0LZ73_HALO1 Haliangium ochraceum Encapsulated ferritin localisation sequence MSSEQLHEPAELLSEETKNMHRALVTLIEELEAVDWYQQRADACSEPGLHDVLIHNKNEEVEHAMMTLEWIRRRSPVFDAHMRTYLFTERPILELEEEDTGSSSSVAASPTSAPSHGSLGIGSLRQEGKED 131 T 0.00024 Rubrerythrin pdbpercent F Bacteria T 7oec 1 A A DP2L_PYRHO POL II,EXODEOXYRIBONUCLEASE LARGE SUBUNIT SGNAFPGDTRILVQINGTPQRVTLKELYELFDEEHYESMVYVRKKPKVDIKVYSFNPEEGKVVLTDIEEVIKAPATDHLIRFELELGSSFETTVDHPVLVYENGKFVEKRAFEVREGNIIIIIDESTLEPLKVAVKKIEFIEPPEDFVFSLNAKKYHTVIINENIVTHQ 169 T 1.4E-08 Intein_splicing pdbhh F Archaea T 7oen 1 A,B,C,D B,A,C,D CAPSD_HBVD3 CORE ANTIGEN,CORE PROTEIN,HBCAG,P21.5 MDIDTYKEFGATVELLSFLPSDFFPSVRDLLDTASALYREALESPEHCSPHHTALRQAILCWGELMTLATWVGVNLEDPASRDLVVSYVNTNMGLKFRQLLWFHISCLTFGRETVIEYLVSFGVWIRTPPAYRPPNAPILSTLPETTVVRRRGRSPRRRTPSPRRRRSQSPRRRRSQSRESQC 183 T 1.9E-17 Hepatitis_core unp T Viruses T 7oen 2 E,F E,F GSLLGRMKGA XXXXXXXXXXGSLLGRMKGA 20 T 11 Aconitase_B_N pdbhh F T 7oeu 2 B,C,D,E,F 1,2,3,4,5 D0LZ73_HALO1 Haliangium ochraceum encapsulated ferritin MSSEQLHEPAELLSEETKNMHRALVTLIEELEAVDWYQQRADACSEPGLHDVLIHNKNEEVEHAMMTLEWIRRRSPVFDAHMRTYLFTERPILELEEEDTGSSSSVAASPTSAPSHGSLGIGSLRQEGKED 131 T 0.00024 Rubrerythrin pdbpercent F Bacteria T 7oev 2 E,F E,F GSLLGRMKGA XXXXXXXXXXGSLLGRMKGA 20 T 11 Aconitase_B_N pdbhh F T 7oew 2 E,F E,F MHRSLLGRMKGA XXXXXXXXXXXXMHRSLLGRMKGA 24 T 19 Aconitase_B_N pdbhh F T 7ofm 1 A A BAK_HUMAN APOPTOSIS REGULATOR BAK,BCL-2-LIKE PROTEIN 7,BCL2-L-7 SLGNGPILNVLVVLGVVLLGQFVVRRFFKS 30 T 3.5 FeoB_associated pdbhh F Eukaryota T 7ofo 1 A A BAK_HUMAN APOPTOSIS REGULATOR BAK,BCL-2-LIKE PROTEIN 7,BCL2-L-7 SLGNGPILNVLVVLGVVLLGQFVVRRFFKS 30 T 3.5 FeoB_associated pdbhh F Eukaryota T 7ofv 2 B B EphA4 agonist ligand XXXXGX 6 T 180 Bac_GH3_C pdbhh F F 7og1 5 E,H GGG,DDD FCHO2_HUMAN F-BAR domain only protein 2 GSPEFNIPDVDEEGYSIKPETNQNDTKENHFYSSSDSDSEDEEPKKYRIEIKPMHPNNSHHTMASLDELKVSIGNITLSPAISRHSPVQMNRNLSNEELTKSKPSAPPNEKGTSDLLAWDPLFGPSLDSSSSSSLTEFPGRPHHHHHHHHHH 152 T 94 Nas2_N pdbhh F Eukaryota T 7og1 6 G PPP TGN38 CARGO PEPTIDE DYQRLN 6 T 30 Fer4_24 pdbhh F T 7og2 1 A,B A,B A0A166WMK8_9GAMM Amine oxidoreductase MTHYTFGKEITDKQLPSQVKVAIVGAGMSGLYSAWRLQQEANCQDLAIFERSDRTGGRLDSDLIEFKNLRSDEPKTITVKEEQGGMRFLFDGMDDLMALFLKLNLQDDIVPFPMNSGGNNRLFFRGESFSVSDAQQDDYAIWSHLYNLDQSEQGVNPKDIVNVVFNRILEANPQFQQRPKVRGPQFWQDFRLECQWKGQGLNQWTLWDLYTDMGYSQECITMLYRVLGFNGTFLSQMNAGVAYQLLEDFPAGVKFKTFKDGFSTLPNKLVEEVGTNNIHLQTTIEEIDFNEESGLYELSYAHIDAHGKIHKGLVKAEKVILGLPRLALEKLFVRSNVINRLDQDRSELLWNTLQSASNQPLLKINLYYDSAWWGRGTTGRPAVEFGPNFADLPTGSVYPFYAVNEELAAALMYEERTTHPSDAVEAKLERIGNDKYERPAALTIYCDYLNINFWSNLQNIGETYHNPKQDHYVENVPDDIYPASTAVVEQATRFFKDIFNTHYVPAPVLTSARIWEGSVKFDIPANRQFGYGVHQWAVGANDKEVMATLSEPLPNLFTCGEAFSDYQGWVEGALRSTDLALEKGFGLKPLSQAYFESTHISSSDAIKAVYEENSSKLINQYIETNFAASAAPIEKADDEQSVIGVNLSYFDVK 653 T 9.2E-21 Amino_oxidase pdbhh F Bacteria T 7ogo 3 C,F CCC,FFF IDL1_ARATH Protein IDA-LIKE 1 YVLVPPSGPSMRHN 14 T 0.021 Sperm_Ag_HE2 unp F Eukaryota T 7ogp 2 B B Q8SD94_BPDPK PHIKZ068 MEIIVTGVQGTGFTEVATEHNGKRLTWTTTAYSKIRVQDQQRVFQEINDYWSGLSAEAQQHIWNCYVEIRKIMDMAMHPMRIAMSLSYYIKEMYKAMPMNSFRRWLLTIGKLYIPVDIEEVITDDSRYNRPDQTYLKHDYINLASVSLALRPLVPIWGEFIDQGTSQEMHKECEVISLISDCEVNHWPVDEISIDGTPVETAYDKLSAYVKFCVEDEAPTLANLYRGMSSAEVPDILQAKVMVRRLTILPLNDATSHSIVSNMFRYVKSNLNPAERSTADRVNDKRPDKGGIDDDDKTSFIESHKTKQRVTPGDIVAYNLDALDVVKLVHKIDDTVPVELIQECLDCVAVTATKDIYPHQILLAQWVMHKAFPARAFSHINKNAVNHLLAAAQSLMWHWGFQQVAVFMQVELYYSGEHAMSIQPRNSTRIQIKYKDVMDELYPHQRQQRAINGVPVAPVNIAGIAVQSAHASIRSSNWIYHGPDRLFKEAEQVTQNKVLVVPATIKSVITELVIHLGKLNQ 521 T 0.16 FF pdbpercent T Viruses T 7ogp 5 E E Q8SD39_BPDPK PHIKZ123 MPDPFLIEKIRENTPCMNPTLANGITVEHTMTRDPNTGVNMTRRYIDSLFDISSVLFPDGFKYEGNRACTPLKHFEEITREYNAKRIANIAPTDMYMIDLMFSYKGEMLYPRPMLLPAFKRGNMVTINGAKYIGSPVLTDVGFSVLNDSIFIPFRRTKLTFKQTDHHYMCNGQRKIMYVIWSQIHNEMAKRTKRDLGNRPHIESCLAHYFFCQFGVTQTFKQWANVDVKCGLLSDFPEEEYPREKWNIYSSATLKGKHPTGEMVLVIPRHQESIFATRLIAGFWYVVDAFPMRFTRPEYVDSTNLWRVILGHMVFGDFEHQGKVEENIDSHLHSFCNSLDEMTIEELKTVGVNVSTIWELLYEIMTSLAHHLYATDIDETSMYGKRLTVLHYLMSEFNYAVSMFGYMFQSRRDREWTVQELNEGLKRSFKLQTAIKRLTVDHGELDTMSNPNSSMLIKGTSILVTQDRAKTAKAHNKSLINDSSRIIHASIAEVGQYKNQPKNNPDGRGRLNMYTKVGPTGLVERREEVREIIDNAQLMFRAK 543 T 0.4 OGG_N unp T Viruses T 7ogq 3 C CCC IDL2_ARATH Protein IDA-LIKE 2 YVPVPASGPSRKHN 14 T 0.033 RSN1_TM unp F Eukaryota T 7ogr 1 A X UNK helices AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 32 T 2100 Chorion_S16 pdbhh F F 7ogr 3 C B Q8SD94_BPDPK PHIKZ068 MEIIVTGVQGTGFTEVATEHNGKRLTWTTTAYSKIRVQDQQRVFQEINDYWSGLSAEAQQHIWNCYVEIRKIMDMAMHPMRIAMSLSYYIKEMYKAMPMNSFRRWLLTIGKLYIPVDIEEVITDDSRYNRPDQTYLKHDYINLASVSLALRPLVPIWGEFIDQGTSQEMHKECEVISLISDCEVNHWPVDEISIDGTPVETAYDKLSAYVKFCVEDEAPTLANLYRGMSSAEVPDILQAKVMVRRLTILPLNDATSHSIVSNMFRYVKSNLNPAERSTADRVNDKRPDKGGIDDDDKTSFIESHKTKQRVTPGDIVAYNLDALDVVKLVHKIDDTVPVELIQECLDCVAVTATKDIYPHQILLAQWVMHKAFPARAFSHINKNAVNHLLAAAQSLMWHWGFQQVAVFMQVELYYSGEHAMSIQPRNSTRIQIKYKDVMDELYPHQRQQRAINGVPVAPVNIAGIAVQSAHASIRSSNWIYHGPDRLFKEAEQVTQNKVLVVPATIKSVITELVIHLGKLNQ 521 T 0.16 FF pdbpercent T Viruses T 7ogr 6 F E Q8SD39_BPDPK PHIKZ123 MPDPFLIEKIRENTPCMNPTLANGITVEHTMTRDPNTGVNMTRRYIDSLFDISSVLFPDGFKYEGNRACTPLKHFEEITREYNAKRIANIAPTDMYMIDLMFSYKGEMLYPRPMLLPAFKRGNMVTINGAKYIGSPVLTDVGFSVLNDSIFIPFRRTKLTFKQTDHHYMCNGQRKIMYVIWSQIHNEMAKRTKRDLGNRPHIESCLAHYFFCQFGVTQTFKQWANVDVKCGLLSDFPEEEYPREKWNIYSSATLKGKHPTGEMVLVIPRHQESIFATRLIAGFWYVVDAFPMRFTRPEYVDSTNLWRVILGHMVFGDFEHQGKVEENIDSHLHSFCNSLDEMTIEELKTVGVNVSTIWELLYEIMTSLAHHLYATDIDETSMYGKRLTVLHYLMSEFNYAVSMFGYMFQSRRDREWTVQELNEGLKRSFKLQTAIKRLTVDHGELDTMSNPNSSMLIKGTSILVTQDRAKTAKAHNKSLINDSSRIIHASIAEVGQYKNQPKNNPDGRGRLNMYTKVGPTGLVERREEVREIIDNAQLMFRAK 543 T 0.4 OGG_N unp T Viruses T 7ogu 3 C,F,I,L CCC,FFF,III,LLL CLE9_ARATH CLAVATA3/ESR (CLE)-related protein 9 RLVPSGPNPLHN 12 T 21 DUF502 pdbhh F Eukaryota T 7ogz 3 C,F CCC,FFF IDL3_ARATH PEPTIDE FROM PROTEIN IDA-LIKE 3 PVPTSGPSRKHN 12 T 2.3 Disulph_isomer pdbhh F Eukaryota T 7ohi 2 B B FCHO1_HUMAN F-BAR domain only protein 1 QSEEQVSKNLFGPPLESAFDHED 23 T 4.4 CDI pdbhh F Eukaryota T 7oiq 2 C,D CCC,DDD FCHO2_HUMAN F-BAR domain only protein 2 SDLLAWDPLFG 11 T 1.1 DUF1871 pdbhh F Eukaryota T 7oit 2 B BBB FCHO2_HUMAN F-BAR domain only protein 2 SDLLAWDPLFG 11 T 1.1 DUF1871 pdbhh F Eukaryota T 7oj1 1 A A IMDH_BACSU Inosine-5'-monophosphate dehydrogenase WESKFSKEGLTFDDVLLVPAKSEVLPRDVDLSVELTKTLKLNIPVISAGMDTVTESAMAIAMARQGGLGIIHKNMSIEQQAEQVDKVKRSERGITNPFFLTPDHQVFDAEHLMGKRISGVPIEEDLVGIITNRDLRFISMKISDVMTKEELVTASVGTTLDEAEKILQKHKIEKLPLVGLITIKDIEKVIEFPNSSKDIHGRLIVGAAVGVTGDTMTRVKKLVEANVDVIVIDTAHGHSQGVLNTVTKIRETYPELNIIAGNVATAEATRALIEAGADVVKVGIGPICTTRVVAGVGVPQITAIYDCATEARKHGKTIIADGGIKFSGDITKALAAGGHAVMLGSLLAGTSESPGETPYKGPVEETVYQLVGGLRSGMGYCGSKDLRALREEAQFIRMTGA 401 T 1.7E-11 IMPDH pdb F Bacteria T 7oj9 1 A B POLN_EEEV1 EEEV nsP3 peptide AERLIPRRPAPPVPVPARIPSPR 23 T 31 EspF pdbhh T Viruses T 7oju 2 B H MVNAL Peptide MVNAL 5 T 120 DUF4213 pdbhh F F 7ok6 2 C,D BBB,CCC LCK peptide GCGCSSHPED 10 T 1.2 EPV_E5 pdbhh F T 7ok9 2 M,N,O,P,Q,R,S,T,U,V P,Q,R,S,T,U,V,W,X,Y pentaglycine GGGGG 5 T 56 Parvo_coat pdbhh F F 7oke 2 B B ALA-TRP-VAL-ILE-PRO-ALA AWVIPA 6 T 0.39 DUF3950 pdbhh F F 7okf 2 B B ALA-TRP-VAL-ILE-PRO-ALA AWVIPA 6 T 0.39 DUF3950 pdbhh F F 7okg 2 B B ALA-TRP-VAL-ILE-PRO-ALA AWVIPA 6 T 0.39 DUF3950 pdbhh F F 7okh 2 B B ALA-TRP-VAL-ILE-PRO-ALA AWVIPA 6 T 0.39 DUF3950 pdbhh F F 7oki 2 B B ALA-TRP-VAL-ILE-PRO-ALA AWVIPA 6 T 0.39 DUF3950 pdbhh F F 7okj 2 B B ALA-TRP-VAL-ILE-PRO-ALA AWVIPA 6 T 0.39 DUF3950 pdbhh F F 7okk 2 B B ALA-TRP-VAL-ILE-PRO-ALA AWVIPA 6 T 0.39 DUF3950 pdbhh F F 7okl 2 B B ALA-TRP-VAL-ILE-PRO-ALA AWVIPA 6 T 0.39 DUF3950 pdbhh F F 7okm 2 B B ALA-TRP-VAL-ILE-PRO-ALA AWVIPA 6 T 0.39 DUF3950 pdbhh F F 7oko 2 AB,BA,C,F,FB,GA,K,KB,LA,QA,R,VA,W n,O,AC,2,s,T,7,x,Y,d,E,i,J TraB PGMMDSQEFS 10 T 2.1 NADPH_Ox pdbhh F T 7old 31 EA LW G0S1P9_CHATD 60S ribosomal protein L24-like protein MRTYEDTFSGQRIYPGKVRFPISHEGDNGDISHPEEIRTGRRKIAPATRQLRAEVQKTSMKGKLYVRGDSKIFRFQNGKSESLFLQRKNPRRIAWTVLYRRQHRKGISEEVAKKRTRRTIKSQRAIVGASLEVIKERRSMRPEARNAARLAAIKESKEKKAAAQAAKKAEKAKNAAAAAKGQPQGRVTSKQGAKGAPVKVAAKSR 205 T 0.03 Ribosomal_L24e pdbpssm F Eukaryota T 7ole 3 G H TTI1_HUMAN PROTEIN SMG10 MAVFDTPEEAFGVLRPVCVQLTKTQTVENVEHLQTRLQAVSDSALQELQQYILFPLRFTLKTPGPKRERLIQSVVECLTFVLSSTCVKEQELLQELFSELSACLYSPSSQKPAAVSEELKLAVIQGLSTLMHSAYGDIILTFYEPSILPRLGFAVSLLLGLAEQEKSKQIKIAALKCLQVLLLQCDCQDHPRSLDELEQKQLGDLFASFLPGISTALTRLITGDFKQGHSIVVSSLKIFYKTVSFIMADEQLKRISKVQAKPAVEHRVAELMVYREADWVKKTGDKLTILIKKIIECVSVHPHWKVRLELVELVEDLLLKCSQSLVECAGPLLKALVGLVNDESPEIQAQCNKVLRHFADQKVVVGNKALADILSESLHSLATSLPRLMNSQDDQGKFSTLSLLLGYLKLLGPKINFVLNSVAHLQRLSKALIQVLELDVADIKIVEERRWNSDDLNASPKTSATQPWNRIQRRYFRFFTDERIFMLLRQVCQLLGYYGNLYLLVDHFMELYHQSVVYRKQAAMILNELVTGAAGLEVEDLHEKHIKTNPEELREIVTSILEEYTSQENWYLVTCLETEEMGEELMMEHPGLQAITSGEHTCQVTSFLAFSKPSPTICSMNSNIWQICIQLEGIGQFAYALGKDFCLLLMSALYPVLEKAGDQTLLISQVATSTMMDVCRACGYDSLQHLINQNSDYLVNGISLNLRHLALHPHTPKVLEVMLRNSDANLLPLVADVVQDVLATLDQFYDKRAASFVSVLHALMAALAQWFPDTGNLGHLQEQSLGEEGSHLNQRPAALEKSTTTAEDIEQFLLNYLKEKDVADGNVSDFDNEEEEQSVPPKVDENDTRPDVEPPLPLQIQIAMDVMERCIHLLSDKNLQIRLKVLDVLDLCVVVLQSHKNQLLPLAHQAWPSLVHRLTRDAPLAVLRAFKVLRTLGSKCGDFLRSRFCKDVLPKLAGSLVTQAPISARAGPVYSHTLAFKLQLAVLQGLGPLCERLDLGEGDLNKVADACLIYLSVKQPVKLQEAARSVFLHLMKVDPDSTWFLLNELYCPVQFTPPHPSLHPVQLHGASGQQNPYTTNVLQLLKELQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGXGXXXXXXXXXXXXXXXXXXXXXXXXXGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1733 T 0.00097 Proteasom_PSMB pdbhh F Eukaryota T 7ole 4 H J TTI2_HUMAN TELO2-interacting protein 2,TELO2-interacting protein 2,TTI2 MELDSALEAPSQEDSNLSEELSHSAFGQAFSKILHCLARPEARRGNVKDAVLKDLGDLIEATEFDRLFEGTGARLRGMPETLGQVAKALEKYAAPSKEEEGGGDGHSEAAEKAAQVGLLFLKLLGKVETAKNSLVGPAWQTGLHHLAGPVYIFAITHSLEQPWTTPRSREVAREVLTSLLQVTECGSVAGFLHGENEDEKGRLSVILGLLKPDLYKESWKNNPAIKHVFSWTLQQVTRPWLSQHLERVLPASLVISDDYQTENKILGVHCLHHIVLNVPAADLLQYNRAQVLYHAISNHLYTPEHHLIQAVLLCLLDLFPILEKTLHWKGDGARPTTHCDEVLRLILTHMEPEHRLLLRRTYARNLPAFVNRLGILTVRHLKRLERVIIGYLEVYDGPEEEARLKILETLKLLMQHTWPRVSCRLVVLLKALLKLICDVARDPNLTPESVKSALLQEATDCLILLDRCSQGRVKGLLAKIPQSCEDRKVVNYIRKVQQVSEGAPYNGTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGXGXXXXXXXXXXXXXXXXXXXXXXXXXGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGXXGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 964 T 2.1 Tti2 pdbpssm F Eukaryota T 7ole 5 I K TELO2_HUMAN PROTEIN CLK-2 HOMOLOG,HCLK2,PROTEIN CLK-2 HOMOLOG,HCLK2,PROTEIN CLK-2 HOMOLOG,HCLK2,PROTEIN CLK-2 HOMOLOG,HCLK2 MEPAPSEVRLAVREAIHALSSSEDGGHIFCTLESLKRYLGEMEPPALPREXXXXXXXXXXKEEFASAHFSPVLRCLASRLSPAWLELLPHGRLEXXXXXXXXELWASFFLEGPADQAFLVLMETIEGAAGPSFRLMKMARLLARFLREGRLAVLMEAQCRQQTQPGFILLRETLLGKVVXXXXXXXXXXXXXXXXXXALPDHLGNRLQQENLAEFFPQNYFRLLGEEVVRVLQAVVDSLQGGLDSSVSFVSQVLGKACVHGRQQEILGVLVPRLAALTQGSYLHQRVCWRLVEQVPDRAMEAVLTGLVEAALGPEVLSRLLGNLVVKNKKAQFVMTQKLLFLQSRLTTPMLQSLLGHLAMDSQRRPLLLQVLKELLETWGSSSAIRHTPLPQQRHVSKAVLICLAQLGEPELRDSRDELLASMMAGVKCRLDSSLPPVRRLGMIVAEVVSARIHPEGPPLKFQYEEDELSLELLALASPQPAGDGASEAGT 491 T 0.0069 Ribosomal_60s pdbpssm F Eukaryota T 7olg 2 C,D C,D DYST_HUMAN 11MACF KPSKIPTPQRK 11 T 4.6 DUF3697 pdbhh F Eukaryota T 7oln 1 A AAA B4EH86_BURCJ Lectin GHMPLLSASIVSAPVVTSETYVDIPGLYLDVAKAGIRDGKLQVILNVPTPYATGNNFPGIYFAIATNQGVVADGCFTYSSKVPESTGRMPFTLVATIDVGSGVTFVKGQWKSVRGSAMHIDSYASLSAIWGTAA 134 T 0.002 DUF1543 unppercent F Bacteria T 7olu 1 A AAA B4EH86_BURCJ Lectin GHMPLLSASIVSAPVVTSETYVDIPGLYLDVAKAGIRDGKLQVILNVPTPYATGNNFPGIYFAIATNQGVVADGCFTYSSKVPESTGRMPFTLVATIDVGSGVTFVKGQWKSVRGSAMHIDSYASLSAIWGTAA 134 T 0.002 DUF1543 unppercent F Bacteria T 7olw 1 A AAA B4EH86_BURCJ Lectin GHMPLLSASIVSAPVVTSETYVDIPGLYLDVAKAGIRDGKLQVILNVPTPYATGNNFPGIYFAIATNQGVVADGCFTYSSKVPESTGRMPFTLVATIDVGSGVTFVKGQWKSVRGSAMHIDSYASLSAIWGTAA 134 T 0.002 DUF1543 unppercent F Bacteria T 7onb 5 E F UNK AAAAARAARAAAWRAEQAAAA 21 T 16 DUF5308 pdbhh F F 7oo3 18 R x CSB element XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 35 F F F 7oop 16 P R LEO1 helix XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 40 F F F 7oow 2 B B INHIBITOR ARC-1415 XXXXXXXXX 9 F F F 7oox 2 B B Inhibitor ARC-3126 XXXXXXXXX 9 T 24 Mucin15 pdbhh F F 7op0 3 C C K92chemFE TCPEGWSECGVAIYGYACGRWGCGHFLNSGPNISP 35 T 0.14 Toxin_4 pdbhh F T 7opb 2 D,E,F D,E,F IL7R binder SVIEKLRKLEKQARKQGDEVLVMLARMVLEYLEKGWVSEEDADESADRIEEVLKK 55 T 0.89 TyeA pdbhh F T 7opc 16 P R LEO1 helix XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 40 F F F 7opd 16 P R LEO1 helix XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 40 F F F 7opm 2 B B P28 MQLXLDSSNLARRRRRRR 18 T 6.9 UCMA pdbhh F T 7opm 3 C C ORF45_HHV8P Protein ORF45 RPPVKFIFPPPPLS 14 T 0.98 AIM3 pdbhh T Viruses T 7opo 2 B,D,F,H,J,L B,D,F,H,J,L ORF45_HHV8P Protein ORF45 GSRMLPIEGAPRRRPPVKFIFPPPPLSSLPGFGRPRGYAGPTVIDMSAPDDVFAEDTPSPPAT 63 T 43 Corona_NS1 pdbhh T Viruses T 7oq4 14 N Z RIP_ATV RIP MKNMLHPQKYETHVLDDLMEFYEGVIGYPEIDLRLAGEEAWLKGVNPELAEAVKKIIKTIRRYLEGSPYDGSEKPIPRYIIAEIFSQIAPEVQLLVNALDTEGKYGFLKHIKKLNLNSLAMLSKNYNENDKLWKELENEGYVYLELVPR 149 T 0.11 MnmE_helical pdbpssm T Viruses T 7oqe 17 Q D PRP39_YEAST Pre-mRNA-processing factor 39 MPDETNFTIEDIEPRPDALRGLDTQFLQDNTALVQAYRGLDWSDISSLTQMVDVIEQTVVKYGNPNDSIKLALETILWQILRKYPLLFGFWKRFATIEYQLFGLKKSIAVLATSVKWFPTSLELWCDYLNVLCVNNPNETDFIRNNFEIAKDLIGKQFLSHPFWDKFIEFEVGQKNWHNVQRIYEYIIEVPLHQYARFFTSYKKFLNEKNLKTTRNIDIVLRKTQTTVNEIWQFESKIKQPFFNLGQVLNDDLENWSRYLKFVTDPSKSLDKEFVMSVFDRCLIPCLYHENTWMMYIKWLTKKNISDEVVVDIYQKANTFLPLDFKTLRYDFLRFLKRKYRSNNTLFNNIFNETVSRYLKIWPNDILLMTEYLCMLKRHSFKNSLDQSPKEILEKQTSFTKILETSITNYINNQIDAKVHLQTLINDKNLSIVVVELIKTTWLVLKNNMQTRKYFNLYQKNILIKNSVPFWLTYYKFEKSNVNFTKLNKFIRELGVEIYLPTTVMNDILTDYKTFYLTHSNIVTYESSIIDSNTFDPILYPELKMSNPKYDPVLNTTANVDWHKKTEWKEAGHIGITTERPQISNSIIECNSGTLIQKPISLPNFRNLEKINQVKINDLYTEEFLKEGK 629 F F Eukaryota T 7oqg 2 B P AMOT_HUMAN ANGIOMOTIN GHVRSLSERLMQM 13 T 2.8E-05 Macoilin unphh F Eukaryota T 7oqj 2 B P AMOT_HUMAN ANGIOMOTIN GHVRSLSERLMQM 13 T 2.8E-05 Macoilin unphh F Eukaryota T 7oqs 2 B P AMOT_HUMAN ANGIOMOTIN GHVRSLSERLMQM 13 T 2.8E-05 Macoilin unphh F Eukaryota T 7oqu 2 B P AMOT_HUMAN ANGIOMOTIN GHVRSLSERLMQM 13 T 2.8E-05 Macoilin unphh F Eukaryota T 7oqv 1 A,B,C,D AAA,BBB,CCC,DDD VIN3_ARATH Protein VERNALIZATION INSENSITIVE 3 GDKDLGHIVKTIRCLEEEGHIDKSFRERFLTWYSLRATHREVRVVKDFVETFMEDLSSLGQQLVDTFSESILSKR 75 T 3.3 DUF6495 pdbhh F Eukaryota T 7oqw 2 B P AMOT_HUMAN ANGIOMOTIN GHVRSLSERLMQM 13 T 2.8E-05 Macoilin unphh F Eukaryota T 7or3 2 B B NOTC4_HUMAN NOTCH 4,HNOTCH4 RGRRFSAGMRG 11 T 7.3 RNA_polI_A14 pdbhh F Eukaryota T 7or5 2 B B NOTC4_HUMAN NOTCH 4,HNOTCH4 RGRRFSAGMRG 11 T 7.3 RNA_polI_A14 pdbhh F Eukaryota T 7or7 2 B B NOTC4_HUMAN NOTCH 4,HNOTCH4 RGRRFSAGMRG 11 T 7.3 RNA_polI_A14 pdbhh F Eukaryota T 7or8 2 B P CDN1B_HUMAN CYCLIN-DEPENDENT KINASE INHIBITOR P27,P27KIP1 TPKKPGLRRRQT 12 T 10 MepB pdbhh F Eukaryota T 7org 2 B P CDN1B_HUMAN CYCLIN-DEPENDENT KINASE INHIBITOR P27,P27KIP1 TPKKPGLRRRQT 12 T 10 MepB pdbhh F Eukaryota T 7orh 2 B P CDN1B_HUMAN CYCLIN-DEPENDENT KINASE INHIBITOR P27,P27KIP1 TPKKPGLRRRQT 12 T 10 MepB pdbhh F Eukaryota T 7ors 2 B P CDN1B_HUMAN CYCLIN-DEPENDENT KINASE INHIBITOR P27,P27KIP1 TPKKPGLRRRQT 12 T 10 MepB pdbhh F Eukaryota T 7ort 2 B P CDN1B_HUMAN CYCLIN-DEPENDENT KINASE INHIBITOR P27,P27KIP1 TPKKPGLRRRQT 12 T 10 MepB pdbhh F Eukaryota T 7os0 1 A,B A,C D5AUW0_RHOCB Cas13a MQIGKVQGRTISEFGDPAGGLKRKISTDGKNRKELPAHLSSDPKALIGQWISGIDKIYRKPDSRKSDGKAIHSPTPSKMQFDARDDLGEAFWKLVSEAGLAQDSDYDQFKRRLHPYGDKFQPADSGAKLKFEADPPEPQAFHGRWYGAMSKRGNDAKELAAALYEHLHVDEKRIDGQPKRNPKTDKFAPGLVVARALGIESSVLPRGMARLARNWGEEEIQTYFVVDVAASVKEVAKAAVSAAQAFDPPRQVSGRSLSPKVGFALAEHLERVTGSKRCSFDPAAGPSVLALHDEVKKTYKRLCARGKNAARAFPADKTELLALMRHTHENRVRNQMVRMGRVSEYRGQQAGDLAQSHYWTSAGQTEIKESEIFVRLWVGAFALAGRSMKAWIDPMGKIVNTEKNDRDLTAAVNIRQVISNKEMVAEAMARRGIYFGETPELDRLGAEGNEGFVFALLRYLRGCRNQTFHLGARAGFLKEIRKELEKTRWGKAKEAEHVVLTDKTVAAIRAIIDNDAKALGARLLADLSGAFVAHYASKEHFSTLYSEIVKAVKDAPEVSSGLPRLKLLLKRADGVRGYVHGLRDTRKHAFATKLPPPPAPRELDDPATKARYIALLRLYDGPFRAYASGITGTALAGPAARAKEAATALAQSVNVTKAYSDVMEGRTSRLRPPNDGETLREYLSALTGETATEFRVQIGYESDSENARKQAEFIENYRRDMLAFMFEDYIRAKGFDWILKIEPGATAMTRAPVLPEPIDTRGQYEHWQAALYLVMHFVPASDVSNLLHQLRKWEALQGKYELVQDGDATDQADARREALDLVKRFRDVLVLFLKTGEARFEGRAAPFDLKPFRALFANPATFDRLFMATPTTARPAEDDPEGDGASEPELRVARTLRGLRQIARYNHMAVLSDLFAKHKVRDEEVARLAEIEDETQEKSQIVAAQELRTDLHDKVMKCHPKTISPEERQSYAAAIKTIEEHRFLVGRVYLGDHLRLHRLMMDVIGRLIDYAGAYERDTGTFLINASKQLGAGADWAVTIAGAANTDARTQTRKDLAHFNVLDRADGTPDLTALVNRAREMMAYDRKRKNAVPRSILDMLARLGLTLKWQMKDHLLQDATITQAAIKHLDKVRLTVGGPAAVTEARFSQDYLQMVAAVFNGSVQNPKPRRRDDGDAWHKPPKPATAQSQPDQKPPNKAPSAGSRLPPPQVGEVYEGVVVKVIDTGSLGFLAVEGVAGNIGLHISRLRRIREDAIIVGRRYRFRVEIYVPPKSNTSKLNAADLVRIDENLYFQKLAAALEHHHHHH 1304 T 0.54 D5_N unppercent F Bacteria T 7os1 2 B F WBP4_HUMAN WBP-4,FORMIN-BINDING PROTEIN 21,WW DOMAIN-CONTAINING-BINDING PROTEIN 4 GAMAFNPHTSDLPSSKVNENSLGTLDESKSSDSHSDSDGEQEAEEGGVSTETEKPKIKFKEKNKNSDGGSDPETQKEKSIQKQNSLGSNEEKSKTLKKSNPYGEWQEIKQEVESHEEVDLELPSTENEYVSTSEADGGGEPKVVFKEKTVTSLGVMADGVAPVFKKRRTENGKSRNLRQRGDDQ 184 T 0.91 CDC45 pdbpssm F Eukaryota T 7os8 1 A A TPL_RANTE PHE-VAL-PRO-TRP-PHE-SER-LYS-PHE-DLE-GLY-ARG-ILE-LEU-NH2 FVPWFSKFXGRILX 14 T 0.013 Mim2 pdbhh F Eukaryota T 7osc 1 A A A0A2Y9FJE4_PHYMC cathelicidin-1-like QICRIIVVRVCRPICRITVIRVCS 24 T 15 IF3_N pdbhh F Eukaryota T 7osd 1 A A TPL_RANTE PHE-VAL-PRO-TRP-PHE-LYS-LYS-PHE-DLE-GLU-ARG-ILE-LEU-NH2 FVPWFKKFXERILX 14 T 0.063 MOSC_N unphh F Eukaryota T 7osu 1 A A sTIM11noCys-SB MDKDEAWKQVEQLRREGATRIAYRSDDWRDLKEAWKKGADILIVDATDKDEAWKQVEQLRREGATEIAYRSDDWRDLKEAWKKGADILIVDATDKDEAWKQVEQLRREGATRIAYRSDDWRDLKEAWKKGADILIVDATDKDEAWKQVEQLRREGATEIAYRSDDWRDLKEAWKKGADILIVDATGLEHHHHHH 194 T 0.00014 NanE pdbhh F T 7osv 1 A A DeNovoTIM6-SB MDKDEAWKQVEILRRLGAKRIAYRSDDWRDLQEALKKGADILIVDATDKDEAWKQVEILRRLGAKEIAYRSDDWRDLQEALKKGADILIVDATDKDEAWKQVEILRRLGAKRIAYRSDDWRDLQEALKKGADILIVDATDKDEAWKQVEILRRLGAKEIAYRSDDWRDLQEALKKGADILIVDATGLEHHHHHH 194 T 1.2E-05 NanE pdbhh F T 7ot7 1 A A sTIM11noCys-SB MDKDEAWKQVEQLRREGATRIAYRSDDWRDLKEAWKKGADILIVDATDKDEAWKQVEQLRREGATEIAYRSDDWRDLKEAWKKGADILIVDATDKDEAWKQVEQLRREGATRIAYRSDDWRDLKEAWKKGADILIVDATDKDEAWKQVEQLRREGATEIAYRSDDWRDLKEAWKKGADILIVDATGLEHHHHHH 194 T 0.00014 NanE pdbhh F T 7ot8 1 A,B A,B DeNovoTIM6-SB MDKDEAWKQVEILRRLGAKRIAYRSDDWRDLQEALKKGADILIVDATDKDEAWKQVEILRRLGAKEIAYRSDDWRDLQEALKKGADILIVDATDKDEAWKQVEILRRLGAKRIAYRSDDWRDLQEALKKGADILIVDATDKDEAWKQVEILRRLGAKEIAYRSDDWRDLQEALKKGADILIVDATGLEHHHHHH 194 T 1.2E-05 NanE pdbhh F T 7oui 23 Z,ZA U,u PST2_ARATH PsbTn EPKRGTEAAKKKYAQVCVTMPTAKICRY 28 T 0.47 Surface_antigen pdbhh F Eukaryota T 7oun 2 B B macrocyclic peptide AFLFVIRDRVFRCG 14 T 3.9 BOFC_N pdbhh F T 7oup 2 B F ((2R,4S,5S)-5-((S)-2-amino-3-methylbutanamido)-2-benzyl-4-hydroxy-6-methylheptanoyl)-L-prolyl-L-tryptophan VXPW 4 T 45 Glyco_transf_8C pdbhh F F 7ovb 2 B B Q5ZYC7_LEGPH IcmP (DotM) MYIEMAQQQQQSGSDNSMAPVWIVILLFITAYFVWALAHQYIVSFVFTINIWQARLVNLFLNNQLLANQIYLMQTLDPNTVNWDQMVTVMRAVGDYMRYPVICILVVLAFVLYNSNVTLKYRKTYDMKSLRAQEQFNWPAIMPIVKEDLVSQDVNKGPWAMALTPMEFARKYNLLRKDDALLDNPVPGEEMTAGIRRGDAKRVFTMQLGPYWDGFERCSPQAYALSAVFMARMNRDRDAANNILKVLDKTFVDGKPDFSVARPVMKKYQNSELVQEVVAKHAYVLTVIASLLEAAREDGVVPSSEFLWLKPVDRRLWYMLNCVGRQTPYSEVAGPFAHWKAEKEMGRRSLVPMIDEAIRALEIAVKEVRLTPRQMEELEP 380 T 0.031 B277 pdb F Bacteria T 7ovb 4 D D Q5ZV91_LEGPH DotZ MDEIKKDDELSQWLSTYGTITAERILGRYNISLPQDEILEAINIPSSFYRHLLQIPLKNVLNGIVIQQASDYHVYAQKLLIDYLLSGESSKEPDSQGAGTRESLEDERQRLVQLGDEFHKLELEQDNLIASSQASLMKISIDWNTKLETTLSKLNSLYKNTNSKIKKNAIRKALIKAFIHCDLVKDQSQKNKYQLIDKLNQTLAVSVGAELKESILTNLSELFQILEALNTKLDEFTDRTNHLSQQAKSFRTQFYEVILRIIELIKLLPEYKIDPAQDAINREPLYFDRTIGER 294 T 0.0097 EAP30 pdbpssm F Bacteria T 7ovb 5 E E Q5ZYR7_LEGPH DotY MPKYTLPTRDALLKAMQVGETSIEAAEYMATRFEQILTKAKLLPECNDMLEKIKEYAQFVKFKLLSSAQVWSGQERPTSDYQNTQENKAEFLASHLEGLPSGLKLEVAIGDDAKILRGFSSNGKMVEGDQLKTMDGLLEGWLAKNSLAISGGAVVKIDNTGNQTKVDPQEIRQLINDSEKGVAKYFADKGVGMEVAQRTYQEPKALETKREEIRQEIESGAEAPTTQSIR 230 T 0.019 GPW_gp25 pdbpercent F Bacteria T 7ovc 2 B B UBA5_HUMAN UBIQUITIN-ACTIVATING ENZYME 5,THIFP1,UFM1-ACTIVATING ENZYME,UBIQUITIN-ACTIVATING ENZYME E1 DOMAIN-CONTAINING PROTEIN 1 GMSVTELTVEDSGESLEDLMAKMKNMW 27 T 0.43 DUF5786 pdbhh F Eukaryota T 7ovx 2 B Q GLYG_HUMAN Peptide G DNIKRKLDTYLQ 12 T 3.1 NTS_2 pdbhh F Eukaryota T 7ow2 2 E,F,G,H E,F,G,H RN187_HUMAN E3 ubiquitin-protein ligase RNF187 peptide GLSMLLQ 7 T 0.22 DUF2028 pdbhh F Eukaryota F 7owm 2 C C HPCA_HUMAN CALCIUM-BINDING PROTEIN BDR-2 GKQNSKLR 8 T 0.07 EF-hand_7 unppercent F Eukaryota T 7own 2 C D ALA-LYS-SER-PHE-SER-LYS-PRO-ARG AKSFSKPR 8 T 19 Crystall_4 pdbhh F T 7owo 2 C,D D,F N-Acetyl-LYS-SER-PHE-SER-LYS-PRO-ARG XKSFSKPR 8 T 19 Crystall_4 pdbhh F T 7owp 2 C,D E,D ACE-GLY-ORN-SER-PHE-SER-LYS-PRO-ARG XGXSFSKPR 9 T 7.7 cIII pdbhh F T 7owq 2 C,D D,F MGY-ASN-CYS-PHE-SER-LYS-PRO-ARG XNCFSKPR 8 T 0.062 NifU unphh F T 7owr 2 C E GLY-GLY-LYS-SER-PHE-SER-LYS-PRO-ARG GGKSFSKPR 9 T 3 zf_C2H2_6 pdbhh F T 7owu 2 C,D C,D ALA-ASN-CYS-PHE-SER-LYS-PRO-ARG ANCFSKPR 8 T 4 Flexi_CP_N pdbhh F T 7ox1 3 G,J,K,L G,X,Y,Z IL9_HUMAN IL-9,CYTOKINE P40,T-CELL GROWTH FACTOR P40 GSHMQGCPTLAGILDINFLINKMQEDPASKCHCSANVTSCLCLGIPSDNCTRPCFSERLSQMTNTTMQTRYPLIFSRVKKSVEVLKNNKCPYFSCEQPCNQTTAGNALTFLKSLLEIFQKEKMRGMRGKI 130 T 0.0044 Dynamin_M unppercent F Eukaryota T 7ox2 3 C,F,I,L T,O,M,N IL9_HUMAN IL-9,CYTOKINE P40,T-CELL GROWTH FACTOR P40 GSHMQGCPTLAGILDINFLINKMQEDPASKCHCSANVTSCLCLGIPSDNCTRPCFSERLSQMTNTTMQTRYPLIFSRVKKSVEVLKNNKCPYFSCEQPCNQTTAGNALTFLKSLLEIFQKEKMRGMRGKI 130 T 0.0044 Dynamin_M unppercent F Eukaryota T 7ox3 3 C C IL9_HUMAN IL-9,CYTOKINE P40,T-CELL GROWTH FACTOR P40 GSHMQGCPTLAGILDINFLINKMQEDPASKCHCSANVTSCLCLGIPSDNCTRPCFSERLSQMTNTTMQTRYPLIFSRVKKSVEVLKNNKCPYFSCEQPCNQTTAGNALTFLKSLLEIFQKEKMRGMRGKI 130 T 0.0044 Dynamin_M unppercent F Eukaryota T 7ox4 3 C C IL9_MOUSE IL-9,CYTOKINE P40,T-CELL GROWTH FACTOR P40 GSHMQRCSTTWGIRDTNYLIENLKDDPPSKCSCSGNVTSCLCLSVPTDDCTTPCYREGLLQLTNATQKSRLLPVFHRVKRIVEVLKNITCPSFSCEKPCNQTMAGNTLSFLKSLLGTFQKTEMQRQKSRP 130 T 0.0041 Dynamin_M unppssm F Eukaryota T 7ox5 2 B,D,F,H,J,L,N,P B,D,F,H,J,L,N,P IL9_HUMAN IL-9,CYTOKINE P40,T-CELL GROWTH FACTOR P40 GSHMQGCPTLAGILDINFLINKMQEDPASKCHCSANVTSCLCLGIPSDNCTRPCFSERLSQMTNTTMQTRYPLIFSRVKKSVEVLKNNKCPYFSCEQPCNQTTAGNALTFLKSLLEIFQKEKMRGMRGKI 130 T 0.0044 Dynamin_M unppercent F Eukaryota T 7ox6 1 A A IL9_HUMAN IL-9,CYTOKINE P40,T-CELL GROWTH FACTOR P40 GSHMQGCPTLAGILDINFLINKMQEDPASKCHCSANVTSCLCLGIPSDNCTRPCFSERLSQMTNTTMQTRYPLIFSRVKKSVEVLKNNKCPYFSCEQPCNQTTAGNALTFLKSLLEIFQKEKMRGMRGKI 130 T 0.0044 Dynamin_M unppercent F Eukaryota T 7oxe 2 B B THR-ALA-GLU-HIS-ASP-GLU-PHE TAEHDEF 7 T 110 Histidinol_dh pdbhh F T 7oxh 3 C E Fragment of 30S ribosomal protein S2 peptide XX 2 F F F 7oxj 3 C E Fragment of 30S ribosomal protein S2 peptide XX 2 F F F 7oxp 1 A,B,C,D,E,F,G,H,I,J A,B,C,D,E,F,G,H,I,J A0A6A5PUS7_YEASX HLJ1_G0030540.MRNA.1.CDS.1,SEIPIN,Y55_G0030470.MRNA.1.CDS.1 MKINVSRPLQFLQWSSYIVVAFLIQLLIILPLSILIYHDFYLRLLPADSSNVVPLNTFNILNGVQFGTKFFQSIKSIPVGTDLPQTIDNGLSQLIPMRDNMEYKLDLNLQLYCQSKTDHLNLDNLLIDVYRGPGPLLGAPGGSNSKDEKIFHTSRPIVCLALTDSMSPQEIEQLGPSRLDVYDEEWLNTIRIEDKISLESSYETISVFLKTEIAQRNLIIHPESGIKFRMNFEQGLRNLMLRKRFLSYIIGISIFHCIICVLFFITGCTAFIFVRKGQEKSKKHSGRRIPGLINGGGGGGDYKDHDGDYKDHDIDYKDDDDK 322 T 0.23 Seipin pdbpercent F Eukaryota T 7oxr 1 A,B,C,D,E,F,G,H,I,J A,B,C,D,E,F,G,H,I,J A0A6A5PUS7_YEASX HLJ1_G0030540.MRNA.1.CDS.1,SEIPIN,Y55_G0030470.MRNA.1.CDS.1 MKINVSRPLQFLQWSSYIVVAFLIQLLIILPLSILIYHDFYLRLLPADSSNVVPLNTFNILNGVQFGTKFFQSIKSIPVGTDLPQTIDNGLSQLIPMRDNMEYKLDLNLQLYCQSKTDHLNLDNLLIDVYRGPGPLLGAPGGSNSKDEKIFHTSRPIVCLALTDSMSPQEIEQLGPSRLDVYDEEWLNTIRIEDKISLESSYETISVFLKTEIAQRNLIIHPESGIKFRMGGSGGSRFLSYIIGISIFHCIICVLFFITGCTAFIFVRKGQEKSKKHSGRRIPGLINGGGGGGDYKDHDGDYKDHDIDYKDDDDK 315 T 0.047 Telomerase_RBD pdbpercent F Eukaryota T 7oxu 2 B B MED1_HUMAN ACTIVATOR-RECRUITED COFACTOR 205 KDA COMPONENT,ARC205,MEDIATOR COMPLEX SUBUNIT 1,PEROXISOME PROLIFERATOR-ACTIVATED RECEPTOR-BINDING PROTEIN,PBP,PPAR-BINDING PROTEIN,THYROID HORMONE RECEPTOR-ASSOCIATED PROTEIN COMPLEX 220 KDA COMPONENT,TRAP220,THYROID RECEPTOR-INTERACTING PROTEIN 2,TR-INTERACTING PROTEIN 2,TRIP-2,VITAMIN D RECEPTOR-INTERACTING PROTEIN COMPLEX COMPONENT DRIP205,P53 REGULATORY PROTEIN RB18A NHPMLMNLLK 10 T 14 CoV_NSP8 pdbhh F Eukaryota T 7oye 2 B B THR-ALA-GLU-HIS-ASP-GLU-LEU TAEHDEL 7 T 150 FAS_I_H pdbhh F T 7oym 2 B B Hit2 (MH65) XHPYKAHA 8 T 25 RRN9 pdbhh F T 7oyn 2 B B Hit3 (MH57) XSLPFTVYX 9 T 8.2 Soc pdbhh F T 7oyq 2 B,C C,B Hit3-t2 (MH174) XSL 3 T 1100 zinc_ribbon_2 pdbhh F F 7oyr 2 B,C C,B Hit3-t4 (MH181) XSLXFX 6 T 140 DUF5001 pdbhh F F 7ozs 6 F E unknown XXXXXXXXXXXXXXXXXXXXXXX 23 F F F 7p02 4 D A GNAI1_HUMAN;GNAS2_HUMAN ADENYLATE CYCLASE-INHIBITING G ALPHA PROTEIN,ADENYLATE CYCLASE-STIMULATING G ALPHA PROTEIN MGCTLSAEDKAAVERSKMIEKQLQKDKQVYRATHRLLLLGADNSGKSTIVKQMRILHGGSGGSGGTSGIFETKFQVDKVNFHMFDVGGQRDERRKWIQCFNDVTAIIFVVDSSDYNRLQEALNLFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKIEDYFPEFARYTTPEDATPEPGEDPRVTRAKYFIRDEFLRISTASGDGRHYCYPHFTCAVDTENARRIFNDCRDIIQRMHLRQYELL 246 T 9E-10 G-alpha pdb F Eukaryota T 7p09 2 G G Unknown peptide from human mitochondrial transcription factor A (TFAM) XXXXXXXXXXX 11 F F F 7p0m 2 G,H,I,J,K,L,M G,I,J,M,H,L,K Unknown peptide from human mitochondrial transcription factor A (TFAM) XXXXXXXXXXX 11 F F F 7p0s 1 A,C A,B A0A0R8HV90_ORFV Apoptosis inhibitor GPLGSMANRDDIDASAVMAAYLAREYAEAVEEQLTPRERDALEALRVSGEEVRSPLLQELSNAGEHRANPENSHIPAALVSALLEAPTSPGRMVTAVELCAQMGRLWTRGRQLVDFMRLVYVLLDRLPPTADEDLGAWLQAVARVHGT 148 T 0.029 VMAP-M0 pdbpssm T Viruses T 7p0u 1 A,C,E,G A,B,D,F A0A0R8HV90_ORFV Apoptosis inhibitor GPLGSMANRDDIDASAVMAAYLAREYAEAVEEQLTPRERDALEALRVSGEEVRSPLLQELSNAGEHRANPENSHIPAALVSALLEAPTSPGRMVTAVELCAQMGRLWTRGRQLVDFMRLVYVLLDRLPPTADEDLGAWLQAVARVHGT 148 T 0.029 VMAP-M0 pdbpssm T Viruses T 7p12 1 A A DeNovoTIM13-SB MDVDEMLKQVEILRRLGAKRIAVRSDDWRILQEALKKGGDILIVDATDVDEMLKQVEILRRLGAKEIAVRSDDWRILQEALKKGGDILIVDATDVDEMLKQVEILRRLGAKRIAVRSDDWRILQEALKKGGDILIVDATDVDEMLKQVEILRRLGAKEIAVRSDDWRILQEALKKGGDILIVDATLEHHHHHH 193 T 0.00032 NanE pdbhh F T 7p1c 2 B B TRP-ASN-UX8-THR-LYS-ARG-PHE WNXTKRF 7 T 11 TMP pdbhh F T 7p1g 3 C,F,I,L,O H,F,G,I,J Phalloidin WXAXCPA 7 T 3.6 DUF6083 pdbhh F F 7p3h 1 A,B,C A,B,C Peptide HC02 XEWEAIEKKIAANESKDQAIEKKIQAIEKKIEAIEHGX 38 T 0.028 FlaC_arch pdb F T 7p46 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H T23O_XANCP TDO,TRYPTAMIN 2,3-DIOXYGENASE,TRYPTOPHAN OXYGENASE,TO,TRPO,TRYPTOPHAN PYRROLASE,TRYPTOPHANASE KNLRDLEPGIHTDLEGRLTYGGYLRLDQLLSAQQPLSEPAHHDEMLFIIQSQTSELWLKLLAHELRAAIVHLQRDEVWQCRKVLARSKQVLRQLTEQWSVLETLTPSEYMGFRDVLGPSSGFQSLQYRYIEFLLGNKNPQMLQVFAYDPAGQARLREVLEAPSLYEEFLRYLARFGHAIPQQYQARDWTAAHVADDTLRPVFERIYENTDRYWREYSLCEDLVDVETQFQLWRFRHMRTVMRVIGFKRGTGGSSGVGFLQQALALTFFPELFDVRTSVGVDN 282 T 6.3E-41 Trp_dioxygenase unp F Bacteria T 7p47 1 A B SMC5_YEAST Structural maintenance of chromosomes protein 5 MGTDEFLKAKEKINEIFEKLNTIRDEVIKKKNQNEYYRGRTGTRKDVSQKIKDIDDQIQQLLLKQRHLLSKMASSMKSLKNCQK 84 T 0.00014 Phe_tRNA-synt_N unp F Eukaryota T 7p4a 2 B,D E,D A0A659I9D5_STAAU Sri MVTKEFLKIKLECSDMYAQKLIDEAQGDENKLYDLFIQKLAERHTRPAIVEY 52 T 0.42 DUF3173 pdbhh F Bacteria T 7p4n 1 A A VWF_HUMAN VWF GSMATACTIQLRGGQIMTLKRDETLQDGCDTHFCKVNERGEYFWEKRVTGCPPFDEHKCLAEGGKIMKIPGTCCDTCE 78 T 0.099 zf_CCCH_5 pdb F Eukaryota T 7p5u 2 C,D CCC,EEE MGC0122 DRAATPHHRPQPR 13 T 15 Holin_2-3 pdbhh F T 7p5z 7 M 1 CDC7_YEAST Cell division control protein 7 MTSKTKNIDDIPPEIKEEMIQLYHDLPGIENEYKLIDKIGEGTFSSVYKAKDITGKITKKFASHFWNYGSNYVALKKIYVTSSPQRIYNELNLLYIMTGSSRVAPLCDAKRVRDQVIAVLPYYPHEEFRTFYRDLPIKGIKKYIWELLRALKFVHSKGIIHRDIKPTNFLFNLELGRGVLVDFGLAEAQMDYKSMISSQNDYDNYANTNHDGGYSMRNHEQFCPCIMRNQYSPNSHNQTPPMVTIQNGKVVHLNNVNGVDLTKGYPKNETRRIKRANRAGTRGFRAPEVLMKCGAQSTKIDIWSVGVILLSLLGRRFPMFQSLDDADSLLELCTIFGWKELRKCAALHGLGFEASGLIWDKPNGYSNGLKEFVYDLLNKECTIGTFPEYSVAFETFGFLQQELHDRMSIEPQLPDPKTNMDAVDAYELKKYQEEIWSDHYWCFQVLEQCFEMDPQKRSSAEDLLKTPFFNELNENTYLLDGESTDEDDVVSSSEADLLDKDVLLISE 507 T 7.6E-21 Pkinase pdbpssm F Eukaryota T 7p6r 1 A,B A,B GP2_HUMAN PANCREATIC ZYMOGEN GRANULE MEMBRANE PROTEIN GP-2,ZAP75 VQRGYGNPIEASSYGLDLDCGAPGTPEAHVCFDPCQNYTLLDEPFRSTENSAGSQGCDKNMSGWYRFVGEGGVRMSETCVQVHRCQTDAPMWLNGTHPALGDGITNHTACAHWSGNCCFWKTEVLVKACPGGYHVYRLEGTPWCNLRYCTDPSHHHHHHHH 161 T 140 Diphtheria_R pdbhh F Eukaryota T 7p6s 1 A A GP2_HUMAN PANCREATIC ZYMOGEN GRANULE MEMBRANE PROTEIN GP-2,ZAP75 VQRGYGNPIEASSYGLDLDCGAPGTPEAHVCFDPCQNYTLLDEPFRSTENSAGSQGCDKNMSGWYRFVGEGGVRMSETCVQVHRCQTDAPMWLNGTHPALGDGITNHTACAHWSGNCCFWKTEVLVKACPGGYHVYRLEGTPWCNLRYCTDPSHHHHHHHH 161 T 140 Diphtheria_R pdbhh F Eukaryota T 7p6t 1 A A GP2_HUMAN PANCREATIC ZYMOGEN GRANULE MEMBRANE PROTEIN GP-2,ZAP75 VQRGYGNPIEASSYGLDLDCGAPGTPEAHVCFDPCQNYTLLDEPFRSTENSAGSQGCDKNMSGWYRFVGEGGVRMSETCVQVHRCQTDAPMWLNGTHPALGDGITNHTACAHWSGNCCFWKTEVLVKACPGGYHVYRLEGTPWCNLRYCTDPSHHHHHHHH 161 T 140 Diphtheria_R pdbhh F Eukaryota T 7p6u 2 G S (UNK)(UNK)(UNK)(UNK)(UNK)(UNK)(UNK) XXXXXXX 7 F F F 7p6z 50 XA Z nascent peptide XXXXX 5 F F F 7p70 1 A C VE6_HPV35 Protein E6 TDDSKPTRRETEV 13 T 0.38 E6 unphh T Viruses T 7p71 2 C,D C,D VE6_HPV35 Protein E6 TDDSKPTRRETEV 13 T 0.38 E6 unphh T Viruses T 7p73 2 B B TAX_HTL1A PROTEIN X-LOR,PROTEIN PX,TRANS-ACTIVATING TRANSCRIPTIONAL REGULATORY PROTEIN OF HTLV-1 TDDSEKHFRETEV 13 T 6 VGLL4 pdbhh T Viruses T 7p74 2 B B KS6A1_HUMAN S6K-ALPHA-1,90 KDA RIBOSOMAL PROTEIN S6 KINASE 1,P90-RSK 1,P90RSK1,P90S6K,MAP KINASE-ACTIVATED PROTEIN KINASE 1A,MAPK-ACTIVATED PROTEIN KINASE 1A,MAPKAP KINASE 1A,MAPKAPK-1A,RIBOSOMAL S6 KINASE 1,RSK-1 TDDSRRVRKLPSTTL 15 T 9.6 DUF1088 pdbhh F Eukaryota T 7p7q 15 O H RL3_ENTFA 50S ribosomal protein L3 MTKGILGKKVGMTQIFTESGELIPVTVVEATPNVVLQVKTVETDGYEAIQVGYQDKREVLSNKPAKGHVAKANTAPKRFIKEFKNVELGEYEVGKEIKVDVFQAGDVVDVTGTTKGKGFQGAIKRHGQSRGPMSHGSRYHRRPGSMGPVAPNRVFKNKRLAGRMGGDRVTIQNLEVVKVDVERNVILIKGNIPGAKKSLITIKSAVKAK 209 F F Bacteria T 7p80 2 H,I I,J ADEP2 XXSPXAX 7 T 430 GreA_GreB pdbhh F F 7p81 2 CA,DA,EA,FA,GA,HA,IA,JA,KA,LA,MA,NA,OA,PA,QA,RA,SA,TA,UA,VA c,d,e,f,g,h,i,j,k,l,m,n,o,p,q,r,t,u,v,w ADEP2 XXSPXAX 7 T 430 GreA_GreB pdbhh F F 7p8x 2 B M CCR2_HUMAN C-C CKR-2,CC-CKR-2,CCR-2,CCR2,MONOCYTE CHEMOATTRACTANT PROTEIN 1 RECEPTOR,MCP-1-R XDYDYG 6 T 59 DUF4223 pdbhh F Eukaryota F 7p93 2 B,C B,M ACKR1_HUMAN DUFFY ANTIGEN/CHEMOKINE RECEPTOR,FY GLYCOPROTEIN,GPFY,GLYCOPROTEIN D,PLASMODIUM VIVAX RECEPTOR XDSFPDGDYGANLE 14 T 0.67 DUF2716 pdbhh F Eukaryota T 7p9j 1 A,B,C A,B,C H2J4R1_MARPK TPR_REGION domain-containing protein GPSQNAIKRFMTLFSGREDVFSIQYEGGYRPIRRPLNFQDIKNHFSGKKTLGIYLLKKNDTVKFAAYDIDIKKHYLNREDKFVYEENSKKVAKRLSRELNLENITHYFEFTGNRGYHIWIFFDIPVSAYKIKYIMEKILDRIELEEGIDVEIFPKQTSLNGGLGNLIKVPLGVHKKTGKKCLFVDNDFNVIENQIEFLNNIKENKATEINKLFREIFNE 219 T 0.019 DUF1882 pdbhh F Bacteria T 7pal 24 X Z nascent peptide AATVV 5 T 330 XdhC_CoxI pdbhh F F 7pc3 2 B C TAX_HTL1A PROTEIN X-LOR,PROTEIN PX,TRANS-ACTIVATING TRANSCRIPTIONAL REGULATORY PROTEIN OF HTLV-1 SEKHFRETEV 10 T 6.9 DUF6428 pdbhh T Viruses T 7pc4 2 B C TAX_HTL1A PROTEIN X-LOR,PROTEIN PX,TRANS-ACTIVATING TRANSCRIPTIONAL REGULATORY PROTEIN OF HTLV-1 SEKHFRETEV 10 T 6.9 DUF6428 pdbhh T Viruses T 7pc5 2 B B EXOC4_HUMAN EXOCYST COMPLEX COMPONENT SEC8 ATKDKKITTV 10 T 78 FAM76 pdbhh F Eukaryota T 7pc7 2 C,D E,F PTEN_HUMAN MUTATED IN MULTIPLE ADVANCED CANCERS 1,PHOSPHATASE AND TENSIN HOMOLOG EDQHTQITXV 10 T 63 Pas_Saposin pdbhh F Eukaryota T 7pc8 2 C,D C,D KS6A1_HUMAN S6K-ALPHA-1,90 KDA RIBOSOMAL PROTEIN S6 KINASE 1,P90-RSK 1,P90RSK1,P90S6K,MAP KINASE-ACTIVATED PROTEIN KINASE 1A,MAPK-ACTIVATED PROTEIN KINASE 1A,MAPKAP KINASE 1A,MAPKAPK-1A,RIBOSOMAL S6 KINASE 1,RSK-1 RVRKLPETTL 10 T 6.2 CITED pdbhh F Eukaryota T 7pc9 2 C C TAX_HTL1A PROTEIN X-LOR,PROTEIN PX,TRANS-ACTIVATING TRANSCRIPTIONAL REGULATORY PROTEIN OF HTLV-1 SEKHFRETEV 10 T 6.9 DUF6428 pdbhh T Viruses T 7pcj 2 B,D B,E Cyclosporin A XXXXXXXXVXA 11 F F F 7pd3 53 BB t mL65 XXXXXXXXXXXXXXXXXXXXXXXXXXXX 28 F F F 7pda 2 B B UNK XXXXX 5 F F F 7pdz 4 I,J,K,L,M Q,R,S,T,P Phalloidin PAWXAXC 7 T 2.7 CSN7a_helixI pdbhh F F 7pfo 13 M C CDC45_HUMAN PORC-PI-1,PORC-PI-1 MFVSDFRKEFYEVVQSQRVLLFVASDVDALCACKILQALFQCDHVQYTLVPVSGWQELETAFLEHKEQFHYFILINCGANVDLLDILQPDEDTIFFVCDTHRPVNVVNVYNDTQIKLLIKQDDDLEVPAYEDIFRDEEEDEEHSGNDSDGSEPSEKRTRLDYKDDDEEEIVEQTMRRRQRREWEARRRDILFDYEQYEYHGTSSAMVMFELAWMLSKDLNDMLWWAIVGLTDQWVQDKITQMKYVTDVGVLQRHVSRHNHRNEDEENTLSVDCTRISFEYDLRLVLYQHWSLHDSLCNTSYTAARFKLWSVHGQKRLQEFLADMGLPLKQVKQKFQAMDISLKENLREMIEESANKFGMKDMRVQTFSIHFGFKHKFLASDVVFATMSLMESPEKDGSGTDHFIQALDSLSRSNLDKLYHGLELAKKQLRATQQTIASCLCTNLVISQGPFLYCSLMEGTPDVMLFSRPASLSLLSKHLLKSFVCSTKNRRCKLLPLVMAAPLSMEHGTVTVVGIPPETDSSDRKNFFGRAFEKAAESTSSRMLHNHFDLSVIELKAEDRSKFLDALISLLS 572 T 2.1E-35 CDC45 unppssm F Eukaryota T 7pfo 19 U Q CLSPN_HUMAN HCLASPIN LEVLFQGPDYKDDDDKDYKDDDDKDYKDDDDKMTGEVGSEVHLEINDPNVISQEEADSPSDSGQGSYETIGPLSEGDSDEEIFVSKKLKNRKVLQDSDSETEDTNASPEKTTYDSAEEENKENLYAGKNTKIKRIYKTVADSDESYMEKSLYQENLEAQVKPCLELSLQSGNSTDFTTDRKSSKKHIHDKEGTAGKAKVKSKRRLEKEERKMEKIRQLKKKETKNQEDDVEQPFNDSGCLLVDKDLFETGLEDENNSPLEDEESLESIRAAVKNKVKKHKKKEPSLESGVHSFEEGSELSKGTTRKERKAARLSKEALKQLHSETQRLIRESALNLPYHMPENKTIHDFFKRKPRPTCHGNAMALLKSSKYQSSHHKEIIDTANTTEMNSDHHSKGSEQTTGAENEVETNALPVVSKETQIITGSDESCRKDLVKNEELEIQEKQKQSDIRPSPGDSSVLQQESNFLGNNHSEECQVGGLVAFEPHALEGEGPQNPEETDEKVEEPEQQNKSSAVGPPEKVRRFTLDRLKQLGVDVSIKPRLGADEDSFVILEPETNRELEALKQRFWKHANPAAKPRAGQTVNVNVIVKDMGTDGKEELKADVVPVTLAPKKLDGASHTKPGEKLQVLKAKLQEAMKLRRFEERQKRQALFKLDNEDGFEEEEEEEEEMTDESEEDGEEKVEKEEKEEELEEEEEKEEEEEEEGNQETAEFLLSSEEIETKDEKEMDKENNDGSSEIGKAVGFLSVPKSLSSDSTLLLFKDSSSKMGYFPTEEKSETDENSGKQPSKLDEDDSCSLLTKESSHNSSFELIGSTIPSYQPCNRQTGRGTSFFPTAGGFRSPSPGLFRASLVSSASKSSGKLSEPSLPIEDSQDLYNASPEPKTLFLGAGDFQFCLEDDTQSQLLDADGFLNVRNHRNQYQALKPRLPLASMDENAMDANMDELLDLCTGKFTSQAEKHLPRKSDKKENMEELLNLCSGKFTSQDASTPASSELNKQEKESSMGDPMEEALALCSGSFPTDKEEEDEEEEFGDFRLVSNDNEFDSDEDEHSDSGNDLALEDHEDDDEEELLKRSEKLKRQMRLRKYLEDEAEVSGSDVGSEDEYDGEEIDEYEEDVIDEVLPSDEELQSQIKKIHMKTMLDDDKRQLRLYQERYLADGDLHSDGPGRMRKFRWKNIDDASQMDLFHRDSDDDQTEEQLDESEARWRKERIEREQWLRDMAQQGKITAEEEEEIGEDSQFMILAKKVTAKALQKNASRPMVIQESKSLLRNPFEAIRPGSAQQVKTGSLLNQPKAVLQKLAALSDHNPSAPRNSRNFVFHTLSPVKAEAAKESSKSQVKKRGPSFMTSPSPKHLKTDDSTSGLTRSIFKYLESLEVLFQGPDYKDDDDKDYKDDDDKDYKDDDDK 1403 T 0.00057 BUD22 pdbpercent F Eukaryota T 7pfq 3 C D Inhibitor MI-2247 XGXXKK 6 T 520 zf-HC5HC2H_2 pdbhh F F 7pfy 3 C D Inhibitor MI-2241 XGXVKK 6 T 550 z-alpha pdbhh F F 7pfz 3 C C Inhibitor MI-2267 XKKXX 5 T 1700 Bombesin pdbhh F F 7pg1 3 C D Inhibitor MI-2221 XGXGKK 6 T 34 Ploopntkinase1 pdbhh F F 7pg8 3 Q,R,S,T,U,V,W,X C,E,H,K,O,Q,T,W F7IVA8_RUEPO;Q0ABW0_ALKEH Ion transport protein,Voltage-gated sodium channel GPSSPSLLRAIPGIAWIALLLLVIFYVFAVMGTKLFAQSFPEWFGTLGASMYTLFQVMTLESWSMGIARPVIEAYPWAWIYFVSFILVSSFTVLNLFIGIIIESMQSAHHAEDGERTDAYRDEVLARLEQIDQRLNALGETKK 143 T 1.3E-51 Ion_trans unppssm F Bacteria T 7pgb 3 C,CA,DA,F,I,L,O,R,U,X c,h,e,C,T,W,Z,d,l,o F7IVA8_RUEPO;Q0ABW0_ALKEH Ion transport protein,Voltage-gated sodium channel GPSSPSLLRAIPGIAWIALLLLVIFYVFAVMGTKLFAQSFPEWFGTLGASMYTLFQVMTLESWSMGIARPVIEAYPWAWIYFVSFILVSSFTVLNLFIGIIIESMQSAHHAEDGERTDAYRDEVLARLEQIDQRLNALGETKK 143 T 1.3E-51 Ion_trans unppssm F Bacteria T 7pgc 3 C C Inhibitor MI-2191 XXXKK 5 T 2200 LPAM_1 pdbhh F F 7pgh 1 A,B,C,D,E,F,G,H F,A,B,C,D,E,G,H Q0ABW0_ALKEH;Q6TMY8_9RHOB Ion transport protein,Voltage-gated sodium channel subunit GPSSPSLLRAIPGIAWIALLLLVIFYVFAVMGTKLFAQSFPEWFGTLGASMYTLFQVMTLESWSMGIARPVIEAYPWAWIYFVSFILVSSFTVLNLFIGIIIESMQSAHHAEDGERTDAYRDEVLARLEQIDQRLNALGETKK 143 T 1.3E-51 Ion_trans unppssm F Bacteria T 7ph8 1 A A IGF1R_HUMAN INSULIN-LIKE GROWTH FACTOR I RECEPTOR,IGF-I RECEPTOR NFIHLIIALPVAVLLIVGGLVIMLYVFHRKR 31 T 0.00041 Insulin_TMD pdbhh F Eukaryota T 7phb 24 X Z nascent peptide XXXXX 5 F F F 7phx 3 C I TTI_GLOMM Tsetse thrombin inhibitor GEPGAPIDXDEXGDSSEEVGGTPLHEIPGIRL 32 T 0.048 Cytochrom_B558a unppssm F Eukaryota T 7pi0 25 XA,Y u,U PsbU QRVRTVLDMDDPAKEETVKELRKDINN 27 T 0.018 LMSTEN pdb F T 7pi5 25 XA,Y u,U PsbU QRVRTVLDMDDPAKEETVKELRKDINN 27 T 0.018 LMSTEN pdb F T 7pil 6 FA UU U5NME9_CERS4 RC-Y EVSEFAFRLMMAAVIFVGVGIMFAFAGGHWFVGLVVGGLVAAFFAATPN 49 T 0.054 DUF3487 pdbhh F Bacteria T 7pin 25 UC,WB,XA,Y u1,U1,u,U PsbU QRVRTVLDMDDPAKEETVKELRKDINN 27 T 0.018 LMSTEN pdb F T 7piu 2 B P Setmelanotide (other names RM-493; BIM-22493; IRC-022493; Imcivree) RCXHXRWC 8 T 5.2 RyR pdbhh F F 7piw 25 UC,WB,XA,Y u1,U1,u,U PsbU QRVRTVLDMDDPAKEETVKELRKDINN 27 T 0.018 LMSTEN pdb F T 7pjo 1 A,B AAA,BBB CPR-C4 GHMASMTGGQQMGRGSMHYKAQLQKLLTTEEKKILARLSTPQKIQDFLDTIKNKDLAEGEHTMWSPRAVLKHKHAHCMEGAMLAALALAYHGHSPLLMDLQTTDEDEDHVVALFKIDGHWGAISKTNHPVLRYRDPIYKSVRELAMSYFHEYFIWWTKKNGGKKTLRAYSNPFDLTRYKPERWVIATGDLDWLAEALDDSKHFPILNKKMQKQLRPASRIETKAASLSEWPKRKTNS 237 T 0.00035 DUF553 pdb F T 7pjs 57 EB y Dipeptide (FME-PHE) MF 2 T 120 KcnmB2_inactiv pdbhh F F 7pjt 57 EB y Dipeptide (FME-PHE) MF 2 T 120 KcnmB2_inactiv pdbhh F F 7pju 57 EB y Dipeptide (FME-PHE) MF 2 T 120 KcnmB2_inactiv pdbhh F F 7pjv 58 FB y Dipeptide (FME-PHE) MF 2 T 120 KcnmB2_inactiv pdbhh F F 7pjw 58 FB y Dipeptide (FME-PHE) MF 2 T 120 KcnmB2_inactiv pdbhh F F 7pjx 58 FB y Dipeptide (FME-PHE) MF 2 T 120 KcnmB2_inactiv pdbhh F F 7pjy 58 FB y Dipeptide (FME-PHE) MF 2 T 120 KcnmB2_inactiv pdbhh F F 7pjz 58 FB y Dipeptide (FME-PHE) MF 2 T 120 KcnmB2_inactiv pdbhh F F 7pkq 1 A B A0A2K3CST5_CHLRE mS35 MTLSTQRRALLAGFGKARGGQWTQEGIAALHSTTTTNAQVETQPDGELAESSADDASRLFQELSRRNRPNSAGTEPGPSPRPVLAQLPAAAELMERAAGTPSSQALYDLPTYLSVTHPHARVEPRNPAYDWRRSPQLEAGGPRRAALLLAVDAHMAAPESREALLRMAQLELAHLWYYQQHAAELPAAASPSASAASAASTATPDAAAAGQRRGGVAKQREAEPAAASTSAAAGDKAQAGAGTGTGAGAGAEAEAGDDDIFAAADRKARERAAAVEAATAAAASSAAKSLGRRGGRLPAELEARVRDMQLRYGMASRDAASVLRLLADSFSRRLPGGGRRLEGEAEAEAAVGLGEAEAEAGGDPTAALTWALSGGGRGGSGGTISGLSRHLAKQAAQAKAADRAIITAMQSLADAVNTASSSSSSATASSSTASSSGSSAYWAAHPALAYDQTLAARAHRQGLAWRAAAEAGPDGAASLSAAITALSASLRRPTSSPTSSASASSPPATPVLDLVLDYLQAKYADLLTAWETAARLREATERVSAAVARARATVPMSAVPPLPPALAAELERSWQNAAAAFRPALLQPLLQPAGSGAAARNSLAAAAASGSPALLQAQSSLTRPLDWAKAKALIEQHYAKQQQAAALAGATAAAEATAAAAAAASAAAAAGSPRGAVEALLQPLLQRAADRHRALLAGGDAAAAEAAAAAAAAAAHGSSTAAAGGAAPSEGAASVVCFRETLTLDASSAYANSSFDSRVTLEFNVDRLAAAEPGLGGAWGARFLERLLATPALPGSGRRIRGGGGSSGMFGRARASPAARAAANAAAYPVDVEHSFCRRTRTAALSTARYGSREANRKLLLEHYNELLRAAALAAAAGASAAV 883 T 31 4HPAD_g_N pdbhh F Eukaryota T 7pkq 3 C D mS45-insert XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 425 F F F 7pkq 4 D K mS31/46 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 186 F F F 7pkq 6 H O A0A2K3DZV7_CHLRE mS106 MDLIGRGGAPSASALGTDALVLLCGPEAQAGISTCSAAAESAASPQSSCSWVGTAQHSSATRAEAAGPSAPCGPAPLLRNLRTPSQSGRLSGAPPTWISAAAASLGNSPRFAASARCSTADVDVSRSGLEDASALGDCVDRWSTPAATVSGIALLHQYHNHHHNHHHIGHSSSSCGSVASSTSSVPGSAAGSPFPPRPSLPSSATNSSRFLMQSQSQQLRSVSFTAATAAPKAAAKGASAKAGSSSSSGAAAPSPAAAALRTPEWARVPGHLHELQDLYRKRQKRMAALRHGAQVELEAREAVAWKTGGRGRAVAAAAKAALDALPLPPPEDGGKAELAAALAADTTFAAAHEALTGLLRRGVVLDAAEHLAPLLRKAGDAGQLASALGVAAANHLSQAARQLGAHSPRHGPLHAVLVQECRRLKAAAPLVTLWESLHEHGLAPEAADAAAAVRAAVELGDGGAAVRLLMLACMYGEAPLAGAAEAGAVLQLLQQGNPDQAAQLRELLPKLGLRGA 516 T 1.2 PPR_long pdbhh F Eukaryota T 7pkq 7 I P A0A2K3CRX4_CHLRE mS107 MRKRELLNEARALVPEGSGWLEAYTRNISPRQLTWRLGKRDSLAAMTEGWQLYQGKFDTVAMAALLRRLRHAQLQDPGFDPLAAQRLLDDLVPRLRSVGLRFGKLRDITAYLHALAKLRSPAPSASSSAASPRAGAAAASLLTQPDALVLDLAVFATRNRTELLHASPQRLATLLWALMRLLPPQLYGSEQLQVVLDRMALASLGRLQNFAPLDLRWAALAFATFGPHGPSSKATATTTAAAAGTRAAAGAGSGPALPEWPRVVRGEAAAAATAGAQGAEDVTARRQSRNARVIKALCDELAARSGNLALPQPEPRDLALAAHALGLVAAASSSAGGAVAPPALLVKAAGVAARSLPALSGEEAVGLVETLAVWGLRQPALLEALRDAAGRWGEGEQAEALRGRLQAAYTRLGVEL 416 T 0.8 Med3 pdb F Eukaryota T 7pkq 8 J U Unk1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 33 F F F 7pkq 9 K V Unk2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 152 F F F 7pkq 10 L W Unk3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 54 F F F 7pkq 11 M X Unk4 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 81 F F F 7pkq 12 N Y Unk5 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 108 F F F 7pkq 13 O Z Unk6 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 571 F F F 7pkq 14 P d A0A2K3E198_CHLRE uS4m MQRTLRSLATRCAGAITGSTAQTGASGCIKAEAGVSTSALSGLISQDAIHGLCPRGAAFASLASSGVSAAAGAGPASQLRAGAPLSCLWLLALAPSRTGLAATASSSSSCGACGSHSSTSSPFAPLPASAAGLRHYAKAAKGGAAAAPAAPSGPKRPRTSTTKLYSCRNDRLRHTHEQIWPTLQLTEYEQAMFKRNSRLFVVDMGRSLSLRDKFRMGAYEPATAASTGAAAEEAGGAGAGDALVPAAGGASYRRVPYWQARSLLHESNLHLDALGENPRYLRLRRVGSLFATKLQNVRKLRLLLGFQRRGFVQKLYEHSLLARGSDRMWKMVCAMEATLPMTVTRMGLAEDVVGAATAIRNDKIYVNGKQPVMPRKGLLEPGDVVGPAAGGAAYLRKRVARSMEPLASVVTRDYV 415 T 0.0089 S4 pdbpercent F Eukaryota T 7pkq 31 GA h A0A2K3CNM3_CHLRE uS8m MAAPLLDPLVSKLRQTTATAARAAEVMRAAFPGATHETAGRNTIAVQLPRKDVPTYVMANQRPQPWELLPMKAAAMTQYPNFFNNSCTFFGSIKRDVVNGVPFCLLRPSRLALDMAKVVRNLGIVDGFEVVQRRSRLGAHDFVWLPEQQPQEPEHLYDTSLFRQRLIRLHLRTDLFSRLPGAPGSGAGGPQPASAQLAPAVGLLPLSVKNISKASQPVLMYPRQLEEAAARLPAGVFMCYHPQLGLITDAMAQQYDVPALVAAHVGLPLSQAAAIRGALRVKAAEEAGKELRHVTQLKDWNMMELLRQRMVERRAALEAGMGVGGEVAARLQELREAGLRLRDEASDRVTSALNVAQDLEDGALAWQLVHSRALGAAPGAAAAGVDEGAGGEEQAGAGEGRTSPRGQPRRRR 412 T 1.6 Ribosomal_S8 unppercent F Eukaryota T 7pkq 40 PA x A0A2K3DXG4_CHLRE mS29 MTSALLVASRRARQAQGLPRCLLHAIGIHAGTRAEFASVALQEAGTTPSTSGQEQPSSAQLTPAHLRSYYPLNLALLPEAARGSAGAFYTPRDPGHERRGGCKALQQEMEATGRASILYRPIMAALNGAVAAGQQPRLLLTGPAGCGKSLALLGLVEWARQQGWLVVYVPSCLALVRGGYFARRGRGAAGGWDTLTSAQQLLKGVMDAHGPLLQSLPVLPVPGRAARRQQQQQHEPRQADKPAKVEEGQGQGQGQAGGSGLLEEGSGSASGAGGGGGRTLQDVALRGLSSDDNAQLAVDSALQLIRQLQLLGSGAAQPPDSQPGQPPRVLFALDDYNYLYGPTDYGVQPPSASPLQGRRRVLDAGELILARGLRLLESELGTNPVAAAAAGGGAGGVGGAVVVAATTATPALPAPRSLALEVPHTVVEVPGFDEAETAAALAHYAATGAATRAASAAEARHLFALTGGNGRELRAKAGALGVRVG 485 T 0.0048 DAP3 pdbpercent F Eukaryota T 7pks 13 M M I3LJR4_PIG RPBI C-terminal domain peptide SPSYSPTSPSYSP 13 T 3.2E-05 RNA_pol_Rpb1_R pdbhh F Eukaryota F 7pks 17 Q U NELFA_HUMAN NELF-A,WOLF-HIRSCHHORN SYNDROME CANDIDATE 2 PROTEIN MASMRESDTGLWLHNKLGATDELWAPPSIASLLTAAVIDNIRLCFHGLSSAVKLKLLLGTLHLPRRTVDEMKGALMEIIQLASLDSDPWVLMVADILKSFPDTGSLNLELEEQNPNVQDILGELREKVGECEASAMLPLECQYLNKNALTTLAGPLTPPVKHFQLKRKPKSATLRAELLQKSTETAQQLKRSAGVPFHAKGRGLLRKMDTTTPLKGIPKQAPFRSPTAPSVFSPTGNRTPIPPSRTLLRKERGVKLLDISELDMVGAGREAKRRRKTLDAEVVEKPAKEETVVENATPDYAAGLVSTQKLGSLNNEPALPSTSYLPSTPSVVPASSYIPSSETPPAPSSREASRPPEEPSAPSPTLPAQFKQRAPMYNSGLSPATPTPAAPTSPLTPTTPPAVAPTTQTPPVAMVAPQTQAPAQQQPKKNLSLTREQMFAAQEMFKTANKVTRPEKALILGFMAGSRENPCQEQGDVIQIKLSEHTEDLPKADGQGSTTMLVDTVFEMNYATGQWTRFKKYKPMTNVS 528 T 0.008 PRCC unppercent F Eukaryota T 7pks 20 T X Negative elongation factor E XXXXXXXXXXXXXXXXXXXXXX 22 F F F 7pks 28 BA h INT8_HUMAN INT8,PROTEIN KAONASHI-1 MSAEAADREAATSSRPCTPPQTCWFEFLLEESLLEKHLRKPCPDPAPVQLIVQFLEQASKPSVNEQNQVQPPPDNKRNRILKLLALKVAAHLKWDLDILEKSLSVPVLNMLLNELLCISKVPPGTKHVDMDLATLPPTTAMAVLLYNRWAIRTIVQSSFPVKQAKPGPPQLSVMNQMQQEKELTENILKVLKEQAADSILVLEAALKLNKDLYVHTMRTLDLLAMEPGMVNGETESSTAGLKVKTEEMQCQVCYDLGAAYFQQGSTNSAVYENAREKFFRTKELIAEIGSLSLHCTIDEKRLAGYCQACDVLVPSSDSTSQQLTPYSQVHICLRSGNYQEVIQIFIEDNLTLSLPVQFRQSVLRELFKKAQQGNEALDEICFKVCACNTVRDILEGRTISVQFNQLFLRPNKEKIDFLLEVCSRSVNLEKASESLKGNMAAFLKNVCLGLEDLQYVFMISSHELFITLLKDEERKLLVDQMRKRSPRVNLCIKPVTSFYDIPASASVNIGQLEHQLILSVDPWRIRQILIELHGMTSERQFWTVSNKWEVPSVYSGVILGIKDNLTRDLVYILMAKGLHCSTVKDFSHAKQLFAACLELVTEFSPKLRQVMLNEMLLLDIHTHEAGTGQAGERPPSDLISRVRGYLEMRLPDIPLRQVIAEECVAFMLNWRENEYLTLQVPAFLLQSNPYVKLGQLLAATCKELPGPKESRRTAKDLWEVVVQICSVSSQHKRGNDGRVSLIKQRESTLGIMYRSELLSFIKKLREPLVLTIILSLFVKLHNVREDIVNDITAEHISIWPSSIPNLQSVDFEAVAITVKELVRYTLSINPNNHSWLIIQADIYFATNQYSAALHYYLQAGAVCSDFFNKAVPPDVYTDQVIKRMIKCCSLLNCHTQVAILCQFLREIDYKTAFKSLQEQNSHDAMDSYYDYIWDVTILEYLTYLHHKRGETDKRQIAIKAIGQTELNASNPEEVLQLAAQRRKKKFLQAMAKLYF 995 T 0.0069 TPR_12 pdbpercent F Eukaryota T 7pks 33 GA u Unknown XXXXXWXXXXXXXXXXXXXXXXXXXXX 27 T 2300 zf-H2C2 pdbhh F F 7pkt 4 D d A0A2K3DYN2_CHLRE uL5m MLKLQPRSWDALPRLTAIEVSIPAIETQLERDVVDKSELLLYALALEVLAGKPAGFTAPANKALGTRATGVAVRLDAVTEPEAAHLFMEKLVHVLLPNQVGFEGVPPPMLVPPPRRSKAAEAAQARKAALDHRKAPAKAHFTEIKVGNLLTYPDFEQNFSLFEPLRGMRVRLVMEGASAADCAALLGGMSLPVLSGAAAEAALAEITAEVARRARG 216 T 0.0091 Ribosomal_L5_C pdbpssm F Eukaryota T 7pkt 28 BA E A8J2J1_CHLRE mL41 MTVRSIVVSLIRGANKASRQHQGDIGREAVVDLIQQSAAKQSGIRKGWQVKAATWVKRVHVDRGDVKVGRLEGGEFQVLPHLRPRYFVPADLDKFQLKPYVEVEKKVEAAKQ 112 T 0.00012 MRP-L27 pdbhh F Eukaryota T 7pkt 31 EA I A8IS96_CHLRE mL63/57/60 MFFSRCVMVVFKTTGGRSWNPPSGLRPLSPAQRRNRTKNLALTMKNMSILKLAEANQPEVPVRLYKPLNFSRMQWMKKKLEETRAALGWDMEARALQEQARALRVGGGRQGAAGSLLPPAARAALQGSVGDK 132 T 0.12 L31 pdbhh F Eukaryota T 7pkt 33 GA K A0A2K3DBX4_CHLRE mL80 MHASVSGSLDTAPSSSSGTALATASTSSAPLDLREQRHLYLDGTRTADPGEPRYTAPYWVPPSARAGIPNILFSEPWPSHEEPQLRRQHAAMCLEALKRADRPLTAEQVHEAVNSTAGYSASASAGDAGDSGAGADKPVLSTLAYTKKLLEHLRRTRFVYGRKNPDSMLSPGHPDHPRLYEALPFQAARYGKPETLAAADEAARAAAIAKAQKRLRNGKAPYPQHRRRARFSIWQHELAQEALRELQAK 249 T 0.0046 DUF4777 pdbpercent F Eukaryota T 7pkt 34 HA L A8J535_CHLRE mL87 MLALLAVRARSPSLPSITLPARLLSTQTSASVSETYSNRPTSSAESTEAVSSSGQSASKWDWKWVLGKASGRKPAITRPRRHQWHYCNPEYDPAAPLPEVLRSPFGPPGAERSHDWATYARHLQLQPENRRDLKRYRARFVRFMQLRELDWREAFQRGVAEDSRVSNKVARAKAEAQRQDAWSDYKQAMWQRAQLADSHQSHGTGR 206 T 4.2 Statherin pdbpssm F Eukaryota T 7pkt 35 IA C bL36m XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 36 F F F 7pkt 36 JA M A0A2K3DVD3_CHLRE mL113 MRQLSLSLLARLGRGSRGSLQPASSAINSGVDPGVLSGEECSTSAPAGMPSWLRHSRRYAHQYNLIQPVDTNHINALLSSATALEHAAVPVLRYSAWFDPEQVTRTMQRVPRMLQYQRRKGRRGAAYASSPSSSADLARSLLDALGSRLAALAPACSDQQLARALWALGAARHPHPQALAAACEVLPQRLKGASGAAAAAGAGSGAGGMAMTDLATAAWGLAAAASAGPQSVREPVRRALQEVARHLVASRPADLSATPALPQPSSPSSISSPSSGAVAPAADEVAAAASAAALAADRPWLDPRSAVKLAWAFASCEVKDAAALDVVAEAAEARIASQLQAHDPTTGPLTPRATYMYQTIRGWQAWPRPRPRVIRSAASAARGGRSRYLYDDRPRVVLRDFTAGSLAQLLAALAAAGHRHEGLMQAAAAHLTASSGRSLRVDPHDLKRLAAAFARLDLAAPAAASGGAATAAALTALLSAAQLSSLPAPLLARLAILAAESGVRRRSVYDRLVRQLMARAWVPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 876 T 0.94 CIS_TMP unppercent F Eukaryota T 7pkt 37 KA N A0A2K3DR53_CHLRE mL114 MRRAVARCTAQIAAEACSVSSACSTSGRQVQEQLDNCLRWMTYARPRSRIDYYDPTRNVYKEFQRKWDRANAAAAAAASQAEAAAARPSHSAPPSAAPQQQPYGGAGSYPFGSSLTTPCSSYGSGYGAGYGGTAAAGADWPPGYEALLQTYLEQELPLSAEEASTLVAAAKKGLLPGSRSLIRTRFMHIKDLEPRFPGFDARAAVLGEPRLLRHAADKVMRAMLVFQDHWPSHPVGPLMGRIGCPVIRDPAGVGHRLYALTRALKTDLHYELDPHRLTPESEGFLASGVSPFELEARVSALVTIFGREGAGRLLDVSLDVLTYAPRDLDRAVLALREVFSAAGDRGYGRHSLSTPEGAAAAAADRGYVTDLAVAWPGVLALPGRLGGADGVARLLARVRRAGGARYRGAVGRRALLSEVLERPELLQAAAEAAMRGEEEDEEEDVEGRAELEGKV 455 T 5.1 NpwBP pdbhh F Eukaryota T 7pkt 39 MA P A0A2K3DXQ3_CHLRE mL116 MRQQAGMLLGEAVASTSGRASPALQLIIQRTLSLVASGIPSPRADVALQLSTPHGGHINRMINTSESIVELDSILYRFRKRLRPANIGAAAMRLEHLNRLERRTPYALRVQRVAAELQKYVATYTDRLALTQAANVLRGLSAVRHRLPPELVLRLAAGAVADGGAALRLAPDVDVRDLCFGLAGQGFNNTAFWARLCAAVLPRLRSFDPNTLPALVTALQAAQQLPAPASASASSGSAGSAVAAAAGGSTPQAAVAAEALRLLSRSETLAALAPARLADAASLLAGLGPALGVAVDARLVEAVQTATARALPSLSPNQLPGLLLAVAALRRAAAPAEAAAAAAATAPQQQLPAALLATALPHLSAGAVTMDLTAVMRAARLLAPHAAEPAAADTLVRLARRTLLLLPAPGSSTGGPTGSASGSSSSNGEGLVTLSRVPRGGQAAGAVLAAAAPAGQLQGRTAGAVEGVARAFAAAAPAVAPQPALVGELAARLAAAGEAAAARGLLDEAQLASLGRSVEVLAAAGAAKSG 530 T 0.41 HrpB1_HrpK pdbpssm F Eukaryota T 7pkt 40 NA Q A0A2K3CXJ4_CHLRE mL117 MLQALAGGALGGLQTNGPANLVGALGLLQRAAAAVVTGVPSSSSPVPPHADRSLASLSAGAQSAAESACSHGGCGHDEAPCCSARSSSNSSSDAGAPRGLQQQLRSQQLQQHQHQQRRGIATSAGSALAYKFQSNVSPASSRGSGRGSKVATRDNYQRWRESGGDVRVAQDILREAEGSGRGGGGGGGAGAARRGDSRLRRGADRPGGSGAGSGGGTGAGAGVVDVQDELRAMVLGCRDLSELQVVVCECGADLNPFLVCAAAARLHKLKQATPPGASPAALARRVGESLMVLLQDRAAEAPLSQLAGAAHGLAEAGLAPGAALLEALAARCEAASPRGGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAAASELGAGMLRPLCDALTPRVPALSCADVASLATGLAAALGAAGGEAAAAAADGAPPLLSPSHFGSLPRLLSDLLLLRGPGQFGGRNFASVALALALVTGGPAGAGGGGAAAAGSLPPAFWSKLAAVALPEVPAMDAGSLSRLAGAFCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 820 T 0.19 DUF4799 pdb F Eukaryota T 7pkt 41 OA R A0A2K3DTE5_CHLRE mL118 MAFLVAARRQLKGGLSLAEVALPVSVLGTCTRSNAVFTEALAGCSGASLHCSAEQPLSRATSYSDAGAGAAGVPACGSASPRQPSPEPCSSSGSHATFNSLGRYRSQHVGISRGASALSALAEASASPSPASNLPRGQHQHQQHHHQQLRTYHAWYYGSKLRNRAISQAESLEELGEMLVREGHRLDHVNLTALLAQLKRVARAAEEEAVAEATGGSSSSSSNSSSSGSSAVTAAAAAAAARAVRVRVAELAAVAARLVRRRAKWYDPRHAALAVAHTAALRHTDGRLLHDMTGRALARLDEAYSRDVLLLLRGLCAHQHMQQLAAASSPPAVAAAVAPAVPAVAGAGKPYGGAPAVLLGGVKVFLTAKVPTGRMPPENLAGLLRHWRALAPPGRRLGPAVCGVVAADLQTRTAIYAPEPLAGVLATLSAERHALPPPLLDAAAEQFAAHALTHGSGAAAARFLAAVGAQLRLQQQAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 653 T 15 Tachystatin_B pdbhh F Eukaryota T 7pkt 42 PA S A0A2K3D424_CHLRE mL119 MATGAVAASAADTASTSAPAPPSSSPFMPRWLRNLFPGGEHLPASPAMERQTSGASASSSGGSGPGSEADDIKQLEEMRNMDMQGYVEYCKKMRGGAPPPRPRRPSVSPDHYDYRTMQDQRRIAFLRMQQHEHIGSLVTKEESDLILAKREDVVKNRALLQAIADRTGVYIDLEVKDCIEQFLETRENAGQMHRYATEFGMPLPKGSQEQREMRRFMKRVEAEEKLAVALEKRDLTSCSLRHKLSWAGPTALCDQTTLRYHECCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKREVVDKDRERQVHGMFKSRAEGILRKAAMDGVKPRLRDY 334 T 0.1 VipB_2 pdbpercent F Eukaryota T 7pkt 43 QA X Unk1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 31 F F F 7pkt 44 RA Y PPR* XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 172 F F F 7pkt 45 SA Z Unk2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 49 F F F 7pl4 1 A A INSRR_HUMAN Insulin receptor-related protein beta chain GGLHVLLTATPVGLTLLIVLAALGFFYGKKR 31 T 0.0071 DUF6203 pdbhh F Eukaryota T 7pla 1 A A A0A8X6EH11_9CYAN ShCas12k SNASQITIQARLISFESNRQQLWKLMADLNTPLINELLCQLGQHPDFEKWQQKGKLPSTVVSQLCQPLKTDPRFAGQPSRLYMSAIHIVDYIYKSWLAIQKRLQQQLDGKTRWLEMLNSDAELVELSGDTLEAIRVKAAEILAIAMPASESDSASPKGKKGKKEKKPSSSSPKRSLSKTLFDAYQETEDIKSRSAISYLLKNGCKLTDKEEDSEKFAKRRRQVEIQIQRLTEKLISRMPKGRDLTNAKWLETLLTATTTVAEDNAQAKRWQDILLTRSSSLPFPLVFETNEDMVWSKNQKGRLCVHFNGLSDLIFEVYCGNRQLHWFQRFLEDQQTKRKSKNQHSSGLFTLRNGHLVWLEGEGKGEPWNLHHLTLYCCVDNRLWTEEGTEIVRQEKADEITKFITNMKKKSDLSDTQQALIQRKQSTLTRINNSFERPSQPLYQGQSHILVGVSLGLEKPATVAVVDAIANKVLAYRSIKQLLGDNYELLNRQRRQQQYLSHERHKAQKNFSPNQFGASELGQHIDRLLAKAIVALARTYKAGSIVLPKLGDMREVVQSEIQAIAEQKFPGYIEGQQKYAKQYRVNVHRWSYGRLIQSIQSKAAQTGIVIEEGKQPIRGSPHDKAKELALSAYNLRLTRRS 641 T 0.0027 RuvC_1 pdbhh F Bacteria T 7pll 2 B B FAK2_HUMAN Pyk2-PRR2 peptide TAFQEPPPKPSRPKYRPPP 19 T 61 NinE pdbhh F Eukaryota T 7pln 1 A,B A,B A0A1G4H7E1_PLAVI Sporozoite micronemal protein essential for cell traversal, putative ETGKDIVKILTASTTVTKTGPPPISAECPHNMVVLFGFVVKQNFWDHTNKLQSYEMEICESGASSCTSKQGNTNKYDVSYTYIECGPQALPFTEQVVSVSGTTYNSVKCPNDYSVLFGFGMATSSGRHQSALYSYFTPCRPGLKSCSLNMNEHDDKSYIYLVCVDATIWTGLNALSMIAKDDLHSAVNRYQQFNDGELVVTCPSEGTILTGFYGETHTSSPYVTVPFGKCAKSLKACSVHGSGQAIGIHNYRTLFTVALCKNNKTHHHHHH 271 T 9.2 DERM pdbhh F Eukaryota T 7plo 1 A Q CLSPN_HUMAN HCLASPIN MTGEVGSEVHLEINDPNVISQEEADSPSDSGQGSYETIGPLSEGDSDEEIFVSKKLKNRKVLQDSDSETEDTNASPEKTTYDSAEEENKENLYAGKNTKIKRIYKTVADSDESYMEKSLYQENLEAQVKPCLELSLQSGNSTDFTTDRKSSKKHIHDKEGTAGKAKVKSKRRLEKEERKMEKIRQLKKKETKNQEDDVEQPFNDSGCLLVDKDLFETGLEDENNSPLEDEESLESIRAAVKNKVKKHKKKEPSLESGVHSFEEGSELSKGTTRKERKAARLSKEALKQLHSETQRLIRESALNLPYHMPENKTIHDFFKRKPRPTCHGNAMALLKSSKYQSSHHKEIIDTANTTEMNSDHHSKGSEQTTGAENEVETNALPVVSKETQIITGSDESCRKDLVKNEELEIQEKQKQSDIRPSPGDSSVLQQESNFLGNNHSEECQVGGLVAFEPHALEGEGPQNPEETDEKVEEPEQQNKSSAVGPPEKVRRFTLDRLKQLGVDVSIKPRLGADEDSFVILEPETNRELEALKQRFWKHANPAAKPRAGQTVNVNVIVKDMGTDGKEELKADVVPVTLAPKKLDGASHTKPGEKLQVLKAKLQEAMKLRRFEERQKRQALFKLDNEDGFEEEEEEEEEMTDESEEDGEEKVEKEEKEEELEEEEEKEEEEEEEGNQETAEFLLSSEEIETKDEKEMDKENNDGSSEIGKAVGFLSVPKSLSSDSTLLLFKDSSSKMGYFPTEEKSETDENSGKQPSKLDEDDSCSLLTKESSHNSSFELIGSTIPSYQPCNRQTGRGTSFFPTAGGFRSPSPGLFRASLVSSASKSSGKLSEPSLPIEDSQDLYNASPEPKTLFLGAGDFQFCLEDDTQSQLLDADGFLNVRNHRNQYQALKPRLPLASMDENAMDANMDELLDLCTGKFTSQAEKHLPRKSDKKENMEELLNLCSGKFTSQDASTPASSELNKQEKESSMGDPMEEALALCSGSFPTDKEEEDEEEEFGDFRLVSNDNEFDSDEDEHSDSGNDLALEDHEDDDEEELLKRSEKLKRQMRLRKYLEDEAEVSGSDVGSEDEYDGEEIDEYEEDVIDEVLPSDEELQSQIKKIHMKTMLDDDKRQLRLYQERYLADGDLHSDGPGRMRKFRWKNIDDASQMDLFHRDSDDDQTEEQLDESEARWRKERIEREQWLRDMAQQGKITAEEEEEIGEDSQFMILAKKVTAKALQKNASRPMVIQESKSLLRNPFEAIRPGSAQQVKTGSLLNQPKAVLQKLAALSDHNPSAPRNSRNFVFHTLSPVKAEAAKESSKSQVKKRGPSFMTSPSPKHLKTDDSTSGLTRSIFKYLESLEVLFQGPDYKDDDDKDYKDDDDKDYKDDDDK 1371 T 0.0022 BUD22 pdbpercent F Eukaryota T 7plp 1 A,B A,B TEN4_HUMAN TEN-4,PROTEIN ODD OZ/TEN-M HOMOLOG 4,TENASCIN-M4,TEN-M4,TENEURIN TRANSMEMBRANE PROTEIN 4 HHHHHHGSMETACGDSKDNDGDGLVDCMDPDCCLQPLCHINPLCLGAAA 49 T 0.017 DUF6085 pdb F Eukaryota T 7plt 3 C H Phalloidin WXAXCPA 7 F F F 7plu 3 C,G,J H,I,J Phalloidin WXAXCPA 7 F F F 7plv 4 D H Phalloidin WXAXCPA 7 F F F 7plw 4 D H Phalloidin WXAXCPA 7 F F F 7plx 4 D H Phalloidin WXAXCPA 7 F F F 7pm5 4 D H Phalloidin WXAXCPA 7 F F F 7pm6 4 D,H,J H,I,J Phalloidin WXAXCPA 7 F F F 7pm7 1 A H Phalloidin WXAXCPA 7 F F F 7pm8 1 A H Phalloidin WXAXCPA 7 F F F 7pm9 1 A H Phalloidin WXAXCPA 7 F F F 7pma 1 A H Phalloidin WXAXCPA 7 F F F 7pmb 1 A H Phalloidin WXAXCPA 7 F F F 7pmc 1 A H Phalloidin WXAXCPA 7 F F F 7pmd 3 C H Phalloidin WXAXCPA 7 F F F 7pme 4 D,H,J H,I,J Phalloidin WXAXCPA 7 F F F 7pmf 4 D H Phalloidin WXAXCPA 7 F F F 7pmg 3 C H Phalloidin WXAXCPA 7 F F F 7pmh 4 D H Phalloidin WXAXCPA 7 F F F 7pmi 3 C H Phalloidin WXAXCPA 7 F F F 7pmj 4 D H Phalloidin WXAXCPA 7 F F F 7pml 4 D H Phalloidin WXAXCPA 7 F F F 7pmp 1 A A Q5ZVW8_LEGPH Type II protein secretion LspD MAHHHHHHVDDDDKMGSKLWNLRNADIRAVIAEVSRITGKNFVIDPRVQGKVSIVSSTPLSSRELYQVFLSVLQVSGYAAIPNGEIIKIIPNIDAKTQSPDLLSGMKSPPR 111 T 0.26 DUF3738 pdbhh F Bacteria T 7pmw 2 B B ALA-PHE AF 2 T 320 zf-C2H2_11 pdbhh F F 7pmx 2 B B ALA-PHE AF 2 T 320 zf-C2H2_11 pdbhh F F 7pmy 2 B B ALA-PHE AF 2 T 320 zf-C2H2_11 pdbhh F F 7pnb 1 A,B,C,D,E,F,G,H,I A,B,C,D,E,F,G,H,I Q4JBK8_SULAC Sulfolobus acidocaldarius 0406 filament. MNKKRLLLLSVFLVSVFVPVLVADVIYYYQGQITVGNVAPPMYFAIQPNGNAKIGNNSNVPSYINAQPSSGGSGFTAQVNITNATYNYYFNFMGLAVSKTGYIYLAKVAYSYTATNNPIQNATLYIMNQQGQIVYKYKLIVNGVVNSTLPSTPLQINSGSYIVSLLIVPYQGTLPKTPSNDLATITVNFGFSPMTASPPPIPLPSP 206 T 0.024 PPC pdbpssm F Archaea T 7pnk 24 RA,X u,U PsbU QRVRTVLDMDDPAKEETVKELRKDINN 27 T 0.018 LMSTEN pdb F T 7pnt 2 B B RT02_MOUSE MRP-S2,S2MT MAPAPAVLTRLLCAGVRRWPGFLQKAIPGPAEQNGRKVTGAPVPAVSEPQDGDDFQSRILDTPLQHSDFFNVKELFSVKSLFEARVHLGHKAGCRHRFMEPYIFGNRLGQDIIDLDQTALNLQLALNFTAHVAYRKGIILFVSRNRQFSHLIETTAQACGEYAHTRYFKGGLLTNAQLLFGPSVRLPDLIIFLHTLNNVFEPHVAVRDAAKMNIPTVGIVDTNCNPCLITYPIPGNDDSPQAIQLFCKLFRTTINRAKEKRRQMEALHRLQSPKGSEGSGTSPVPDKSHSP 291 T 1.0000000000000001E-29 Ribosomal_S2 pdbpercent F Eukaryota T 7pnu 2 B B RT02_MOUSE MRP-S2,S2MT MAPAPAVLTRLLCAGVRRWPGFLQKAIPGPAEQNGRKVTGAPVPAVSEPQDGDDFQSRILDTPLQHSDFFNVKELFSVKSLFEARVHLGHKAGCRHRFMEPYIFGNRLGQDIIDLDQTALNLQLALNFTAHVAYRKGIILFVSRNRQFSHLIETTAQACGEYAHTRYFKGGLLTNAQLLFGPSVRLPDLIIFLHTLNNVFEPHVAVRDAAKMNIPTVGIVDTNCNPCLITYPIPGNDDSPQAIQLFCKLFRTTINRAKEKRRQMEALHRLQSPKGSEGSGTSPVPDKSHSP 291 T 1.0000000000000001E-29 Ribosomal_S2 pdbpercent F Eukaryota T 7pnv 2 B B RT02_MOUSE MRP-S2,S2MT MAPAPAVLTRLLCAGVRRWPGFLQKAIPGPAEQNGRKVTGAPVPAVSEPQDGDDFQSRILDTPLQHSDFFNVKELFSVKSLFEARVHLGHKAGCRHRFMEPYIFGNRLGQDIIDLDQTALNLQLALNFTAHVAYRKGIILFVSRNRQFSHLIETTAQACGEYAHTRYFKGGLLTNAQLLFGPSVRLPDLIIFLHTLNNVFEPHVAVRDAAKMNIPTVGIVDTNCNPCLITYPIPGNDDSPQAIQLFCKLFRTTINRAKEKRRQMEALHRLQSPKGSEGSGTSPVPDKSHSP 291 T 1.0000000000000001E-29 Ribosomal_S2 pdbpercent F Eukaryota T 7pnw 2 B B RT02_MOUSE MRP-S2,S2MT MAPAPAVLTRLLCAGVRRWPGFLQKAIPGPAEQNGRKVTGAPVPAVSEPQDGDDFQSRILDTPLQHSDFFNVKELFSVKSLFEARVHLGHKAGCRHRFMEPYIFGNRLGQDIIDLDQTALNLQLALNFTAHVAYRKGIILFVSRNRQFSHLIETTAQACGEYAHTRYFKGGLLTNAQLLFGPSVRLPDLIIFLHTLNNVFEPHVAVRDAAKMNIPTVGIVDTNCNPCLITYPIPGNDDSPQAIQLFCKLFRTTINRAKEKRRQMEALHRLQSPKGSEGSGTSPVPDKSHSP 291 T 1.0000000000000001E-29 Ribosomal_S2 pdbpercent F Eukaryota T 7po6 2 B,C,D B,A,C YTDC1_HUMAN SPLICING FACTOR YT521,YT521-B GGGGTSKLKYVLQDARFFLIKSNNHENVSLAKAKGVWSTLPVNEKKLNLAFRSARSVILIFSVRESGKFQGFARLSSESHHGGSPIHWVLPAGMSAKMLGGVFKIDWICRRELPFTKSAHLTNPWNEHKPVKIGRDGQEIELECGTQLCLLFPPDESIDLYQVIHKMRH 169 T 7.399999999999999E-32 YTH pdbhh F Eukaryota T 7poh 1 A,B A,B SRYD_DROME Serendipity locus protein delta PEFMDTCFFCGAVDLSDTGSSSSMRYETLSAKVPSSQKTVSLVLTHLANCIQTQLDLKPGARLCPRCFQELSDYDTIMVNLMTTQKRLTTQLKLDK 96 T 0.0082 zf-AD pdbpercent F Eukaryota T 7pp2 2 B B C4B8B7_MAGOR AVR-Pii protein LPTPASLNGNTEVATISDVKLEARSDTTYHKCSKCGYGSDDSDAYFNHKCN 51 T 0.0029 zf_C2H2_6 pdbhh F Eukaryota T 7ppl 2 B B IRS1_HUMAN IRS-1 GRKGSGDXMPMSPKS 15 T 0.7 STAT1_TAZ2bind pdbhh F Eukaryota T 7ppm 2 B B IRS1_HUMAN IRS-1 EPKSPGEXVNIEF 13 T 0.29 DUF4834 pdbhh F Eukaryota T 7ppn 2 B B CD28_HUMAN TP44 RSRLLHSDXMNMTPRR 16 T 0.0099 WBP-1 unppssm F Eukaryota T 7ppo 2 B C SIDJ_LEGPH Calmodulin-dependent glutamylase SidJ HHHHHHSAGLEVLFQGPMVKQYYFARRGETSTHDTSLPPPVKVLSGRSIPLKEIPFETTRNELVQIYLTSVDQLIKSNKLNSIPSQQIASHYLFLRSLANSETDGIKKNQILSLAKPLGIYLASKEPHVWKTINELIEKSEYPIIHYLKNNRAHSNFMLALIHEYHKEPLTKNQSAFVQKFRDSSVFLFPNPIYTAWLAHSYDEDSSFNPMFRERLSTNFYHSTLTDNLLLRTEPKEVTLSSEHHYKKEKGPIDSSFRYQMSSDRLLRIQGRTLLFSTPQNDVVAVKVQKRGEPKSTLEEEFQMADYLLKHQSRLDVYSKLPQPLGQYSVKKSEILEISRGSLDFERFKTLIGDSKDLEVYVYKAPLTYFTYLHDKNQDLEDLTASVKTNVHDLFVLLREGIMFPQLADIFHTHFGEDEREDKGRYQALVQLLNVLQFQLGRIDKWQKAVEYVNLRSSGLADLGDSLPITSLFTSSDFTKHYFSALLTGGYHPTFFDKSSGTANSLFTGKRRLFGNYLYLNTIAEYLLVIQLTLGSYGDKVTRDMMDKPKKEAVWRELANVMFTSCAEAIHIMTGIPQSRALTLLKQRANIEKHFRQTQFWMTPDYSKLDEDTLQMEQYSIYSGEPEYEFTDKLVSGVGLSVDGTHQDLGGYNRESPLRELEKLLYATVTLIEGTMQLDKEFFKQLQQVEKILSGEIKTDANSCFEAVAQLLDLARPRCHFQKRLVLSYYEEAKLKYPSAPTDAYDSRFQVVAKTNAAITIQRFWRETRKNLSENSDIESEKPESERTTDKRLK 794 T 0.28 DUF5415 pdbpercent F Bacteria T 7pqd 6 FA,GA,OB,PB UA,UB,ua,ub PufZ MAYMFGIIVFLAMLAVCWFGFMAAERQAGRL 31 T 0.031 Orai-1 pdbpercent F T 7pqd 7 HA,QB UU,uu PufY EVSEFAFRLMMAAVIFVGVGIMFAFAGGHWFVGLVVGGLVAAFFAATPN 49 T 0.054 DUF3487 pdbhh F T 7pqe 2 B C SIDJ_LEGPH Calmodulin-dependent glutamylase SidJ HHHHHHSAGLEVLFQGPMVKQYYFARRGETSTHDTSLPPPVKVLSGRSIPLKEIPFETTRNELVQIYLTSVDQLIKSNKLNSIPSQQIASHYLFLRSLANSETDGIKKNQILSLAKPLGIYLASKEPHVWKTINELIEKSEYPIIHYLKNNRAHSNFMLALIHEYHKEPLTKNQSAFVQKFRDSSVFLFPNPIYTAWLAHSYDEDSSFNPMFRERLSTNFYHSTLTDNLLLRTEPKEVTLSSEHHYKKEKGPIDSSFRYQMSSDRLLRIQGRTLLFSTPQNDVVAVKVQKRGEPKSTLEEEFQMADYLLKHQSRLDVYSKLPQPLGQYSVKKSEILEISRGSLDFERFKTLIGDSKDLEVYVYKAPLTYFTYLHDKNQDLEDLTASVKTNVHDLFVLLREGIMFPQLADIFHTHFGEDEREDKGRYQALVQLLNVLQFQLGRIDKWQKAVEYVNLRSSGLADLGDSLPITSLFTSSDFTKHYFSALLTGGYHPTFFDKSSGTANSLFTGKRRLFGNYLYLNTIAEYLLVIQLTLGSYGDKVTRDMMDKPKKEAVWRELANVMFTSCAEAIHIMTGIPQSRALTLLKQRANIEKHFRQTQFWMTPDYSKLDEDTLQMEQYSIYSGEPEYEFTDKLVSGVGLSVDGTHQDLGGYNRESPLRELEKLLYATVTLIEGTMQLDKEFFKQLQQVEKILSGEIKTDANSCFEAVAQLLDLARPRCHFQKRLVLSYYEEAKLKYPSAPTDAYDSRFQVVAKTNAAITIQRFWRETRKNLSENSDIESEKPESERTTDKRLK 794 T 0.28 DUF5415 pdbpercent F Bacteria T 7pqw 1 A A BCR4 DFDPTEFKGPFPTIEICSKYCAVVCNYTSRPCYCVEAAKERDQWFPYCYD 50 T 6.2 Ragweed_pollen pdbhh F T 7pr7 2 B B HPSE_HUMAN Heparanase 8 kDa subunit QDVVDLDFFTQEPLHLVSPSFLSVTIDANLATDPRFLILLGSPKLRTLARGLSPAYLRFGGTKTDFLIFDPKKE 74 T 3.5 Glyco_hydro_79n unppercent F Eukaryota T 7pr8 2 B B HPSE_HUMAN Heparanase 8 kDa subunit QDVVDLDFFTQEPLHLVSPSFLSVTIDANLATDPRFLILLGSPKLRTLARGLSPAYLRFGGTKTDFLIFDPKKE 74 T 3.5 Glyco_hydro_79n unppercent F Eukaryota T 7prd 1 A A NAB3_YEAST;NRD1_YEAST Protein NRD1,HLJ1_G0022400.mRNA.1.CDS.1 TANTASQQLSLDPKQRSKQILSNLKKSPPLNLNISLPTDLTSTDPAKQQAALFQVIAALQKHFKTNMENVNYDLLQKQVKYIMDSNMLNLPQFQHLPQEEKMSAILAMLNSNSDTALSVPPHDST 125 T 0.066 OSK pdbpercent F Eukaryota T 7pre 1 A A NAB3_YEAST HLJ1_G0022400.mRNA.1.CDS.1 GSWGSMENVNYDLLQKQVKYIMDSNMLNLPQFQHLPQEEKMSAILAMLNSNSD 53 T 0.34 DUF5452 pdbhh F Eukaryota T 7prt 2 B B HPSE_HUMAN Heparanase 8 kDa subunit QDVVDLDFFTQEPLHLVSPSFLSVTIDANLATDPRFLILLGSPKLRTLARGLSPAYLRFGGTKTDFLIFDPKKE 74 T 3.5 Glyco_hydro_79n unppercent F Eukaryota T 7prv 4 E F PRGC1_HUMAN PGC-1-ALPHA,PPAR-GAMMA COACTIVATOR 1-ALPHA,PPARGC-1-ALPHA,LIGAND EFFECT MODULATOR 6, PGC1A COACTIVATOR FRAGMENT PPQEAEEPSLLKKLLLAPANT 21 T 100 Neurokinin_B pdbhh F Eukaryota T 7prw 4 E F PRGC1_HUMAN PGC-1-ALPHA,PPAR-GAMMA COACTIVATOR 1-ALPHA,PPARGC-1-ALPHA,LIGAND EFFECT MODULATOR 6, PGC1A COACTIVATOR FRAGMENT PPQEAEEPSLLKKLLLAPANT 21 T 100 Neurokinin_B pdbhh F Eukaryota T 7prx 2 B B PRGC1_HUMAN PGC-1-ALPHA,PPAR-GAMMA COACTIVATOR 1-ALPHA,PPARGC-1-ALPHA,LIGAND EFFECT MODULATOR 6, PGC1A COACTIVATOR FRAGMENT PPQEAEEPSLLKKLLLAPANT 21 T 100 Neurokinin_B pdbhh F Eukaryota T 7pt6 1 A,J 1,A Undefined Mcm4 flexible N-terminal tail XXXX 4 F F F 7pt6 8 H,Q 8,H CDC7_YEAST Cell division control protein 7 MTSKTKNIDDIPPEIKEEMIQLYHDLPGIENEYKLIDKIGEGTFSSVYKAKDITGKITKKFASHFWNYGSNYVALKKIYVTSSPQRIYNELNLLYIMTGSSRVAPLCDAKRVRDQVIAVLPYYPHEEFRTFYRDLPIKGIKKYIWELLRALKFVHSKGIIHRDIKPTNFLFNLELGRGVLVDFGLAEAQMDYKSMISSQNDYDNYANTNHDGGYSMRNHEQFCPCIMRNQYSPNSHNQTPPMVTIQNGKVVHLNNVNGVDLTKGYPKNETRRIKRANRAGTRGFRAPEVLMKCGAQSTKIDIWSVGVILLSLLGRRFPMFQSLDDADSLLELCTIFGWKELRKCAALHGLGFEASGLIWDKPNGYSNGLKEFVYDLLNKECTIGTFPEYSVAFETFGFLQQELHDRMSIEPQLPDPKTNMDAVDAYELKKYQEEIWSDHYWCFQVLEQCFEMDPQKRSSAEDLLKTPFFNELNENTYLLDGESTDEDDVVSSSEADLLDKDVLLISE 507 T 7.6E-21 Pkinase pdbpssm F Eukaryota T 7pt7 1 A 1 Undefined Mcm4 flexible N-terminal tail XXXX 4 F F F 7pt7 8 H 8 CDC7_YEAST Cell division control protein 7 MTSKTKNIDDIPPEIKEEMIQLYHDLPGIENEYKLIDKIGEGTFSSVYKAKDITGKITKKFASHFWNYGSNYVALKKIYVTSSPQRIYNELNLLYIMTGSSRVAPLCDAKRVRDQVIAVLPYYPHEEFRTFYRDLPIKGIKKYIWELLRALKFVHSKGIIHRDIKPTNFLFNLELGRGVLVDFGLAEAQMDYKSMISSQNDYDNYANTNHDGGYSMRNHEQFCPCIMRNQYSPNSHNQTPPMVTIQNGKVVHLNNVNGVDLTKGYPKNETRRIKRANRAGTRGFRAPEVLMKCGAQSTKIDIWSVGVILLSLLGRRFPMFQSLDDADSLLELCTIFGWKELRKCAALHGLGFEASGLIWDKPNGYSNGLKEFVYDLLNKECTIGTFPEYSVAFETFGFLQQELHDRMSIEPQLPDPKTNMDAVDAYELKKYQEEIWSDHYWCFQVLEQCFEMDPQKRSSAEDLLKTPFFNELNENTYLLDGESTDEDDVVSSSEADLLDKDVLLISE 507 T 7.6E-21 Pkinase pdbpssm F Eukaryota T 7pua 3 C CC uS3m MFLIHFVHYKTILQKYTFKFKHIFLSIDKYNSLFFNISGILIWLNIIHINIILIKYSFFILINNFEYLIILIST 74 T 0.022 Prophage_tail unppercent F T 7pua 8 H CJ C9ZPU0_TRYB9 LysM domain-containing protein MVRRSHVVAYYWSRYRMPTQMPKFDGPAPVAAPQSMNSTKTNEFIDPIDDKFPMSIRGPLVRPDVPEDQYVDSWYICTSMTHHMGDYRPWSASAPPNAFRFRPFNEFDAKGREYVQYMREFARFDPRKSRGNGQKGFPFRDAYLTKMNEANQKTPPPTLETIMDRAVREHHQHARILSPLEVQRDVGRLEPIPSYAGKINADRSVFPFQWKTEDWYEYEVAKVRNRRFVFENTEEDGIRGSEVTYKIVLEGFWDHHVMKLAEDVCMFLKDVGRQIVEEKLVAVRRLLQGGAVDPELLAAFNCARAGPFGGLDEYDKEEVANFLRSDLRRLEEQCLSVINRCNVPVPGATNIYDPHTSWPHVEKLEPWVRMAEFWTSSSDTSFTELEMSTAHYEFRKFFRVIICKLPFQSTEFEKRMYDIRHWLHRQTSCEFHTIYRRNVIHDSAVFPTEHDPATPTTHEHHRMFSFALDWQSAPVNRLSTDTVHEGESWDAVAQRLGCSVGELKDANAERETIEAGVVINVPVTATRRLTSFGATPLVLPLKTTSAKDGERIRTWEEAAAILDCTVEELQQCNGHAALTYQKKESEAGEFDSSVTELVAPLSCWTSTSESEFSPVERVHANDTLVAIARRLQCSEEALRAVNDGITDVSGLDFVRVPPEARRPRRLVEPQLRPQAATDALLARTIAEEETFKLKSIPHLPQNAERFPHEYHTPTSRFPPTPSETPATQDWMAYTAKYLDKQFTISAEPAPVYNVNKLWPMQQIPGKVDQTPFEEDQTWLLHSIPVQQLEMHHHEKDLQDLPFINHEQFPRSLEWNAP 817 F F Eukaryota T 7pua 11 K CN Q580I0_TRYB2 uS14m MFCFSRGLWMRHIGQDVPKRHTHFVLESRLMYEKSFRDCWLHSVCRAISQLDEPLSKTVVGTHQKMLQRKVTCFQYNQYGLFKTPYYRLANVDRYHAVQGVAGTREWVPYVNVSYWTMNKMVRGGNLLVHRVHYTGWGTDSHLKKGGWEHRWNKVLQRNVLQYSRI 166 T 12 DUF6462 pdbhh F Eukaryota T 7pua 16 P CS A0A3L6L621_9TRYP uS19m MAFRNTFTTPGKFSTVSKNIVLLLIWRVKVFLRAEGFAHSLVMLPVSLYSKILLCDVKKKIVYFHCCTRKKSMLRRCPCVWPDGPTKSSVSIGTAFTLQKRFLKIAKSAFGFYLARRGQRKYPFLRRPHIKNTHSMNPSAPYFWSFMTAKSQMAFLPEENYITGDWTGKFFVSKRQVYTLQHATSGAKVRVKSFPSIFEFNSPSRWNIGKEMNTLTKPRMDLIDEQMLTKKQRLDYVRAGLLPK 244 T 4.9 LIN52 unphh F Eukaryota T 7pua 17 Q CU Q580M9_TRYB2 bS21m MLHTTRLWLGGYMMYHRKAMGTMKYSKWKGAHGGISHFYGRTPMVEEVRPNEPITLVDRRIMHYVHHSRLRHFQLFRSYQEKSNSTECKLREGEMLRRRWHRRLQKSFIAFMQFKTMKVLEDQARLVNTYGQAAVNAALGDPWNATDNVARERKSAAVRRQVRALPMVNVVPKHVATMKQIHNDRFNYRWRVN 193 T 2.6 HMD pdbhh F Eukaryota T 7pua 18 R Ca Q38DR3_TRYB2 mS22 MLRRAYIQRRYPFNKRGPREHKSWKHHVLTEPPKPLQWRDPKVWTRDLSVMKSFDAPQWDLWQSRPRSEDMDEALQPFMDMPKSLKDRRYDIPWWANPFGAWYLQNILSLELLKLKSKTNAEKIATYRSYMRSLASGKDNTMSDDDVIRNIIKERWKTLEFGDRNAGYPCTFGDYIQFLNEWFKSLDEEGMQRLREHFDRRIRPLLAVMSPVDILWLEALTQNSPHNKEQLQRKIAFQTSLGTPEFFDMSKRLRYEINEDYKVRDELGPELFALWSKAPERWPPERLSKMYGLDFTLVRKILVWHHFKACYDACVEPDWSLPKRLFALEWIRDVRARKHGLFYGKMRFAEQKITFYSDRFLFRDLVNRREASYANVWEMDDPYRFLQTEQDYEDYWGDNYDVYRRMFPEMIGRTGEPVQQYGQMPIWAGPHRQHANKSEHNWMFAEIGVNVGHEALKKLELDPTNEKRRRFVIRQPDGTLRSAKMSEMRAWYWKEEWADFRFWAPQMEWGIENTPSQEQYQEHVPDTTDADFRKQRRIQSRPVKWFYESHYTRTGSFAGFQPLRFMQRRTEREVRWPDVINAAVQIQKRKPAAYIFKAIPEL 602 T 0.26 MRP-S22 pdb F Eukaryota T 7pua 19 S Cb C9ZNU0_TRYB9 mS23 MRQCVVRRYKMPKNMGVAPRFDTWNEKYEPWEHMKRMGRLVGTGFYIPPEWYNHFRMFPPINHNFQQEKTLNPHNASEPTQDDTSTLSPERVALRDELARKSRLVASEGMRYYNIFWVRKPLDTMEKEYYELKRRGVDHGEAIRKVLQGFYSGLAVKKRVAAIQAEEAKLTGRFITMREATVVLGVLAKLHKEQLTPHQVSLLAKEQGETTQSGAKLTAIVSRTQPHVNKEASSPATSEAVGSSTEESLSADALASMLSEDGEQSAVGTRYQVEVKETANDSVRQLREKAEDQTGFPDWYTGESPTYSGTSXCSRDGFALMKANK 325 T 0.035 CHAD pdb F Eukaryota T 7pua 20 T Cd Q38DK6_TRYB2 mS26 MMRSSSFCRRQIRPYYNLPSKSEHGRKMTGFLTPYRHWMWKQNELWRNVHEAQFEHLRRVYKRQWLESFRVNADEYIYKYNITKAAQLAQWECEMKEQEKKRIEARQMMDGRQALKKKHLDLLREFHERQFFFWYERASERLQNMNLINYVPHAQLREHIDKELDKYVAGKNEPYPLNFVGQMPFLEDGDGNIVEVPESLLSNHMAEHPDSTAKPHEPHTSSSISEAAAFEERMLRAMVSAKEEDLKEWLGDDSRALSETIDDISREEEEREADIRVARSMEETDAEREVSRRAYIERXKTGSRSIFRPPTVSEGAGGTPSAPAGDANTPMRRRKKGKLDRVHALQAHQDELLAKLSSQGLKEXVDASSVPERGKIVQSRGRIRDKAVIPTHEVLMQKPELAAGSTPGARIQTKDMVDKMYHRGKYKKSGSGDKSDGEDL 440 T 0.028 ThylakoidFormat pdbpssm F Eukaryota T 7pua 22 V Ci Q57WW0_TRYB2 mS33 MVLRRWFPLLGVHRVGYTHPSTLPVPCAQRWDLRLARARIFQEYIEEKAPGAWQLEDERSMSPEFKTFTGYPMRDMRPGYGQNLPDYIMKKRLPNNTHYELFARRDIPNEDNAMYGKYLYDMTVHGTSLPSTYRMHKDINKAQRNDRKLSGNRFKVLCSSGAKNPPSGWEPIPDATEEEED 181 T 1.1 Nmad3 pdbhh F Eukaryota T 7pua 23 W Cj Q57UK0_TRYB2 mS34 MLRCARVALRADPLNGGSSMTLGSKGSKLSPEPHRRRMPWTAAKEYVPGVVLNARDKMVLDGVQLLDIESIDRASQLDPLEVLRAVVATREYNISTGKNIFQLASQATYNGRGQRFYRKEWQEGTYDKYVTLSAIDFDRDGNKGTAYGYITFHGETTTRPVQVDFADVPGWYMDFVEERAVPFTGIVPPPPSIGTDVPVDPHSYRLKAYPYYDAPNPPEFVERLLKDRGVLPDTPTETADVDKDPTTSDGSVHYDGK 257 F F Eukaryota T 7pua 25 Y Cm Q38C96_TRYB2 mS37 MKSSDIFFAYRLTPVVFKSRQHDSGVNQYGLKPTNAYDYINPTNLINFGRGTTFDNLGVRRAGRGEIDSSPSHSGSPVFTQAKLIGLSGEEQLTMCQSETMALRLCMAKAGKETCERESRALDSCLGRVGHLRRAMSEACWEFNDWFIQNVSDNHTKPFQHRPHDWRHFYAQEKLVRERQQNGHAYGRRPKQFSFGARYVKTDGYGKRPRLPYNK 215 T 0.054 Gypsy pdbpercent F Eukaryota T 7pua 26 Z Cn Q57VQ9_TRYB2 mS38 MRAGCVACSRIPKLGIAALGSTTTSMQATCQETQELRCATTQMAPAVIFPLPPPLRGSYIDRKPTAASFNEHHATASFRHHMIASADVSHRPRHFYFASGTRNDTGTRSVKEGERKQLKQQTNVEVDSVAYGDQLDELSARWAAKFYGQVTFGPRNYPYPSSRWLARRFQMKKHRIIKRFRFRRYKLAAVANLPFAKMIRVGMLPELKSSKTKRGDVVDPTLSGQLVSAVKNTGEKRKGQRTRPKSKYQV 250 T 28 DUF2996 pdbhh F Eukaryota T 7pua 28 BA DB C9ZJE4_TRYB9 mS49 MIRRRVCDGARLAPNIRATFSAVRYQSGLHTFIRDSKPSNFSSVRRSENANGDATASPGATAGENPASSGDWASHMQRELFGEVDPLGGQAHKDYYRDVTRGYSPQYAPRNFANGGAVAYPHIQSPYEYEEAAHRRVWLDHDVDRMREEFTQHRASLRSLASAQEREELLRSRAAEYQVANTVHESESVHPIQQLYNSGGTSRSALKQQAVADRYSIAEQHSPLPLTTGVDRDALDEAQRTKDRILNDSFTAENLLITHGLREKEKHDFTILQRTVRIPFQGYDMDRFLAQQKGTPYGAQQLPPNVVPSSMEEAQRTLRGSSATATPLVDAVAQKVYARNTVVDRPAIGEQLTEQIINIMRASRTTAEQQREEERAQRFGLGRQGALVQDGGPDQRTLKKHTNDERIVDAMLFQQNAYRKTPTDEHWNPYIRRSTENGVGHLLQNKFDIMRREDRLSKGEQDLTERNTIHYGVPIQQIVDEFVFRHRNARGERPLDYFKPFPNFRALRLNRMYRDVEGFSLMKQRPEFLEWELFTRYRQHHQQRRRLALLHGLEPVANETAQERDTRRHRLDEICERTPFDEREMRVNDDEMRVSVETLRSWFGVYMLPSPTVVNAVLGGSASVNLHLYHLADEMGTADTREHVLSSRYLNRLLLLESYQNRVGRGFMNHVVGRAPEPVVPHEQPQEVLRHFSAEERAMYEQHVKEQTSRQLGEWERAMKRRRWLTDHQQYGHVVSHGLETSVVDLSHTETGAVLTVSTKAYEQEIEAVRMKTNATIKVDGMVYNLLPNSERRVVPLTVQLDSGEKIDMTSEDFDRCELEAFPRNLNHALNYGIANYAYNRGNYVETQDSIWEEQTASGQEGWSPATHADGLREGLPVRARRPIFSSSAEQRIAGGPQRAVIIQYHHQPFFNPEPRLVKVAFQCDGTIMEVPISDVMIWQRRYHGPERTVGDESRRYNPAAMRRYVDVTDPFNEKTSNTEHFLDKYEPKRNADTVADKYRTTKQITEIDKWTRYDSARADNYRPLSISHRRDYIRMGYIPRYTPWEWIAIQEADQPLIAEQIRQDNIGTSYFFSLNRYWRYKASPHGYIRHFENEVRDLLQYVDGVTPWKQAQKIRTYWEVRSHHPMPQFNRPEVAMHRNTVGLLPAHMWETDKKTGKVKSVKDSVRDYQTKTPYPKWVQL 1181 T 0.098 ACD unp F Eukaryota T 7pua 29 CA DC A0A3L6L3Q2_9TRYP mS50 MFRVRRLAGFIQCSPCGKMGPLKQQTRRYTPIWKSDPAVDNVAPLRDEDERRALWAEVGPISDVGSAVTAWIRFGNDPVLHTAVPTMLGGKFRNQQREKESLLPNSSSPFAYVEDYMGTNLVFGSPVHAKESAAVWATYFERRYASRLRLSRRTVANYVGLINSPEVFDDESDRPETRWSQDTFFRECAYLSEKFLKEKVSNMQQFEAALKRASPEAYLAFFDAFQQQTQTQIPLPSPSVWHYEGERRKQWAEKFISISHKAQAFFKDVLSEDVKKYQEVPGKLLQKVKPVLADVGKILVKRHERWLKGRVWTSLTEEEREAYCMKEVKRQQMQVEDGEFDPMMEDDVDDTELEEWQREHDAIMKLMNSPIDGLHFTTLELWLHTMRCEELETEHIYTSARVRAIQVAARKKLYDTTSYEEVIQAVVESIARGTLDLGAGVLRPHFNEVWCQLNYAKFGSSTITQHTTTSRRQLLFFHAGSLKDIAATATLYYATKPLSNSLDYASPYKYRRSLITLCSNYGVETAYTTQRPLLRSAANLARAEDLIHAVVTAAAQPFGERRRAATRDLHMEFQRLAVPVERVIVANPVSALLESGADPDEKPVEGEKVNMWPLGAKRVVLYKWSAPNVEKLKAMESDAASAVSGSSLTAKRLREIQELKRRGFLEVSLWRRVTAQERKQRNEIVEAKKKQVEEVVRTVPSLAHLHQYATSLYSRIEERVAFPTETSTVTETTNMKEETNKPLEDSEWEFAVLLDDRVLLNKEESVELYLPYRDANGELLAQGEYRALVRAFDLEANPNLHPAYCSVGYSESFQVFDALPQLIAQFFRVKDATAEAAGVTHIPAADFTPFCAFLRDAGLDVPLRCEFEAGQAVTTDGDVYMDYFLQLLRGEAFHQSHAQAGLTEAQRAIEPLCRAHWVVHHPGADESEWATARRSVLDHAMQHEREWWFPNEMLDVKDVVTGSTNGLTPQMYPAAVRYGVELCTVLTAEGKFVDERGSGLSARCVVNGTGAAESVVFDTANCNGTNTTSVEDALRVAHGALRSAQDRHNTLAAFRLGPLSKQSQVLLFCGVNAYEFGGKYARTYAYAFEKAKKELEATAASGFMAPSLSHEDTERLSDQPTTSPSVDRFASTTHPEQRKAQFVPRVGPGSTPLEDPAADQKSEWS 1165 T 0.23 CT_C_D pdbpercent F Eukaryota T 7pua 30 DA DD D0A752_TRYB9 mS51 XXXXXXXXXXXXXXXXXFVFRDPSLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHANTQYQHAWPLLSHDDLGNSDQSNNTKNIMYSMYMPKRNKGTAPWFRGADTYSVKYCEQGRYEYQRYLMINRFPSEYKKHFLSFLSNIRMSSGSATIPQEALHWLLRMIVDNFNPQHVHYIAAMKTLQSAGELDMARDVWKIMERQQTWPCTATICAYLDVCVEAGEKTWAMEAWNRYCTELKFLEPGEVDPKPISRVPFSLTREELLYLPKWKKHFDHDPNLDVMDLNRFNRTREVYLRMAQVMLAGGERNAFQHFFTKLEEAMLNKPTPVPEPPNPHLVRRPRWAPYEHCKSVHHSPWRLQNNGRALALGPPVTIEDEMQSRFFSNDQFLVHSVKEVLRIVLQEHKRAHPTECTRCKTEAFFYKTKDADETLKFCDDLIERLFASLGVRLSNLNTSSLLSTILEVFRVVGKESGAALLQRANEFLERKASLGDAEGSRENLTASNYLQVLSGFADESAFVYNTKKDGTCQYKTGFDPRTTMRHLADVVQEIAGNPHVTWAADMHLQVVETMVGCGTMKANDYFVRNVLRQFSWDSRFLEALYVEYRRQDDVDMWAELTKRALVWTARYNAPASERLRRLIEDDYDTIRVQTRTFRELAVFQFRDVEERRHSRDVVNELPNPWYDYVAHALPFPDRDAGYPDEYGDLGQWRAPGGPGSPVRGPGYYAPPMEGEHQRGYTAEWRDLRNPMRPPEFPTPWERKYRQYARGQHPSYDMVYAGPMPEIFPMRRDFRKPTRWDFHDIEKQGKYRTSGPY 812 T 0.0012 PPR_long pdbhh F Eukaryota T 7pua 31 EA DE Q386Q7_TRYB2 mS52 MRRCSFASTCRYRSADDVPLGLGSAYVWTCRNITSRGQFNPIHNFSYAMERGVRARDVKAFEKLITNPGPLRVAYTPDYLDWLHRCYKAKGTYMDARAVAEKKFNGNIVSSELSAAVNRREGKRGDTRGDTVDNDHHNLPGAPPPGMFLRPAHSFRRLAGELKRRRAQSILDEVARAQGMLDLFERQPHFPAIHIDRCSRFHLVELFKEMVLERSLDSNMIWEKALLYRAILSERKPSYPTSFHYIFTAVEDTVFAPTISTPMEMAGSGGEGHDRSSHPLAAKCPTLEAYYYYVYLVKKYYIDNAVEAHVVLRCHREPNAADLLFSNPPPKDDTEIMKAVELLRNADIQRGVAAAAAVSDPTLPPGGEGSVIGNSDNXKNSEKXSEGSRGRPARPPVLPGAYPPIDMLWRCEENLPLLKVLLFGEFNLIVSENPFVKFPSAHGFLTRPYSTDSSRTLADGMSLANVMAEKRGHLLPSLPRNTATSIDARAQDIRRLQQKHHRDDIVSFQKLLRSTHAEDSPSAFSSYSDWSYFNPRAVRAEERDRLTRKAVEALKLYDSATNDIYRHSFEDVQACHTQRVTERDRTMPPYLPTLPHFVAIIKKDPHISFLLHIGLPDRNSSEEGSAKHKELEKRIYYLARALYHTALEYHNETVRRVNRQKVNVAASLLDNFVEQEWTTILRDKHDVTDVTKTLNDTQNDKKQLARRLGRYMLFANRSLDDTGFPTDARADDYTRWMAPPSVGKVSL 747 T 7.7 Complex1_LYR_2 unppssm F Eukaryota T 7pua 32 FA DF C9ZXX4_TRYB9 mS53 MSVSGVFSKGRGIGHEATTSILRYIPRARVPWQPSRFGRENLTAADMARLWSRGRYRDGPGNYNSGYCTERTHVLEENTVSIIPRRELEKYMPDITIGPKALVTPVSLMNARNGHRVTHDLLHSYDPHIGRLGKPAVVDHDNITVEDPNRVGLNAATLDCRGRIYRWLRRGPFFQVDNYFRRSVKLNRDGTLPTDFVHEAPLMRKIIRLAHRGHLKAACEEYRRVTTVPPVEVYRALTACCVPGAKLADAVSIFEDGDSKLFYVSRDGEVLHNLMRCAIAARHRARIMWVYNVMRGRFYENVVVRAEVDLIWRYRIAMIALEYLLDHECAEEAAAIYSYLVEEELLRCDVHVRVGLHMREAIAAGKPITLNNDVMNATSLVRDATAVAPEVARELQRRHAQTLQNNAVEAVGAENVREGSAPWSILGPLTAIGPTAEDTMVWLQQHYGDVDVMSIMRWARFRKGKDLMAKDRPQYLARAAAWIELLSKRNREMEEVPLTYMRKSKPLVLDTNSNVRVAWQTPLMRSGGPPRLLAREEGYVFHHSNSSRFVEETYRHPGESLQSRYLALQPLHTEVSAKEDFQRLYYQAQKHHKQQERLLPVSTAAIPSRIVHHSVMSALHGVSGKGEPANRSLFTXNKSNDSHSNTGVASGTSACTDGAGSATPEF 666 T 0.0009 PPR_long pdbhh F Eukaryota T 7pua 33 GA DG C9ZNY4_TRYB9 mS54 MFRRAIPLLSANIPRSVWDPAQHNPNWSDSYGHDITNRRAWPARKWTVGLEPCTPREWLQFSHRNLAYAYNGALRACHSLPSMLLLYKEMKQRGVKVDVDTMNVLLTRAARHEHIQVDDVFLLFDELVALGARPDLAAAETLHTVLSHSASMPEEWREARRLQLVELYNNLAMEEVERLAPHRADRLLKEQMKRFRGNLQQLGSGLRPTVYCRYLHTTHTAAVLLEEVHNFLWELVPNDHPAMEIPALQLRVPFVASVLRRPSVNPGVSLASVSRAEFGDTDVCAVFLAAAERMVDADFDDQRPVSERRLFLSLLTMISYSGVLYTSDLMAQLMEMVKYSNNDETRDSDAQRVLRYALRGSSAAQDSASRTLWHSVEKVADCRVVGRYIGARNPWNPIRVCFDEQGVFKAYPISTTTTTREVSPPEGNGAVTQEQRASCVEGRTLEALNMRWDDVRRLIECTGVLVTPPSERCPQQQKMEVFTGMAVYLRTVATGRRYEGGEDVLSDGAVATSSCEQRRRGTLFAEGYDFDVWVRLFSLVQEVRHDMEKFMADHTLQCVEPEFECWEALLVTLRCALDFCVVQMQGGGARGTEREVVERLFRDVVALREELIEESRTRFGGRMRVLWLQEA 631 F F Eukaryota T 7pua 34 HA DH A0A3L6LGC8_9TRYP mS55 MLSQNVAKTTVPSYYMIRTNLPHRKPQNQWEGVYYYSGITKRQRHLILLHRKREREAHMRSFNISRASVLQRLEQLSGDRKQESLPPHVRLDLAVRLAQHGLYQQATPIVDELHHQKALHAGHYALLINALACPRLGQRILHCDAQCDPALTYKLLGDENGEERAQEAYRWFDLALTSLAVDCGGRTQPSHFVPYLPQGTAAASHITNALMRTLLTCGYTHVAAIPDSVYDRMGSMGISPTISTYELVMLALSLQGNMVEAESILSFLRSHHSEHITVESFNALLLGHREARQFDCCDAIWQELVDRRWPRASPLTAELYLRSIMDHANTPTSEPLQSFANINVVEKKKVPLVLAQMDELGVPRTHLSRVLMDEVEDSLRKFQTYRSRFYEWGRAVKQFDFIEFRRRNGWLYDLHLMKCTTKQVGPLRDFNDPDAVQGAVATAEIPAFFNERPAWERPPLEETLYVTTNKERYDDVRGGDIYYDDTRGLHDRSPTWMNEVPETRYDRLYGVNHPDIAKIGIRRHLNVEYVNRKEVVERDAALMKKTLSSGRRLRHRVESSRTHRNAGSLSGISSTAGGGSR 581 F F Eukaryota T 7pua 35 IA DI A0A3L6L6C6_9TRYP mS56 MRKFCHFMANCWNSARSHATYGAVPLTHSQVTSVYATDGGKVDELGLLELVEERIFSWKLNKWEMRIPPNLPNDQKELIRQEQENLKQILSEWRKCFGALNADILQISSLTGVPKDVVREKNRTWLQEEVAKLRWMGEVNKAALLRDAFMRLEAFGSRDFMFMERLCCIYGLARQGTFDEAFTNYITEDPVTNDIFVDERNPFKELVAHIVRNYSQIDIIYDFLGFNYSEGYRSSLRRYMEYLQCKTAENVRASGRLVTGDKGEHNILFDYCVSRESLVSGDSCQGIIDFLYINGNDVTLIIIASDNPWLRNRQLPHRRQMEGIARRVCFVLGIPPSEVRIRNLLLPPTYLDKGSIVRLNDIVFRLSNEQSNLLIPWLTNYNKELDPKDVDYTALAKTTNEEEWLTL 407 T 0.2 ApoC-I pdbpercent F Eukaryota T 7pua 36 JA DJ Q584U8_TRYB2 mS57 MFRSTRPRSVGYTPVNPDTSPMVAYSQYHWHYNLPQGMERPHSVNRTFAAPFQSNHSLVNKYRGVWIEFDMHPAFSVALEPQLRKLPRGRTLPKTPAEEVIADYTALAPLVDDEKTRDLWLAKVFQHCAFQRCGGAMELWERYCHQRFTAEGATAKPPLSLVKSVLFYCNKTDNSGWRALFDRCLKDGWNYTPLFDTAQWSFMLKSIGRMGDEDGVRAVLEEMLDVQADLDRVEARSVVIALNAVTNADVYEFVKKYLFNFGERKVKFLRTTYSDLRGHGAGKLRIPLKENDNMYYHVCWHSSIRSPRQFSPRQLYFDYTPSTLGSSSHNPNAKIDDIVKDKIEKWKAEGLLPEDYVHEDRVYDRSAAFKNVARQEKWKKMPKILKSKRMGYTGDP 396 T 1.1 RPM2 pdbhh F Eukaryota T 7pua 37 KA DK A0A3L6L3U6_9TRYP mS58 MSFRYTNNLIGALKHRLLLESSYREIASRKFIGNCRGVEVVCSGYGTVLAVQLTDKAVWESFYRKGGRPTVSGGGDVSGDAETGTQGSATTTGATGDLDLDKLAESIKTALWDATRKIRSAKEAALHRSLSHNTRMRASADLKHWYEEDANTLRPLAFEALKHEAATPWMQLVQHGKKEEAAALLKEFEQKGDAAEATPTRVKDDRVKGTRTELSNREQPPLASTLKAEDSNPATIPIGSVHPLFTPALVQIEEAGGGSVSNEAVCRAQLWELSRDEQLFWERVELIRKGQVASIGSSHKRGYADEAAFAKDDTEEKVQLRFTQ 324 T 0.00027 YbaB_DNA_bd pdbhh F Eukaryota T 7pua 38 LA DL D0A232_TRYB9 mS59 MRCSCAFLDKSVFAAKRRVIVPIHPTPNFPAHFIKSAFTTDPLKEKQKARFSSGGEAMREVQDIPKNLEGERSRRDLASRGDTEFQALVEFIEGASYDQLISGRRFKKVYDVLSENDDMFIWLCHTAMSVLNPGDMRSRLIYNHLRILAESVASGEMTQRTAFRFFESAVRSPAYREIAKRQLEGGAATRLAGISAAADVMRRMGLTRRPMSSYFELYQRIVERSEAMTPWGFPPLFQFEERLSLEPRLKFFSRASQQSLERRRRGHVMTPHTTLHGRRIFWIPPTWNRAGRFLGPHVTLYPGMTPD 307 T 0.37 DUF2840 pdbpssm F Eukaryota T 7pua 39 MA DO Q383D1_TRYB2 mS62 MRRFCTARVSSYVKRAPLLLPRGGRRWRSGAGTTADGGGEKEYIADSFESTAGSNAYAAMELAAEARREIHELWLSAETALEREKRVQQVAALIEKYKLDPSTPREADVSRGLGDAFDRLLLLCLPLGKTDAKGTDNLERLMHLAGRNGRELSVRTIQHLFARTDSFAEALAVFYTMRRCHVAMNMEAYYSMLYSLQRLEEEGWGQHFRNEYEENGAPSEQAMDFIVKGISNALLPENKPWLGRVMFQDRNVPDRRYDTRDFDELDTAWTQRYKSGTPAGAH 282 T 0.084 ECSIT pdbhh F Eukaryota T 7pua 40 NA DP C9ZXR0_TRYB9 mS63 MLHGTPVRRASLRYRRPYWMMFLKGVDNWKIYTVIQQPDHQRTEMLYQAWLGGLDRPYTRPKCMANQPLWLSKKRHMLRKERLDGPETPLEKYVLEWHKKFHSFQGTERPTPDDLHTALDLVERPLDLSYALQLLGQCRNLNNIRFAKETFLVFLEACLRVGRRDCAEYALEHAEPLGFWFIDEDHRRYLQGEQTWYKLSPLDNLYYPVEENAKLNEGRKPITRLSPATESPGSGTAISDGEPSTDVEGETTVDDEIAQLEAELAALEREGGGK 274 T 0.19 MNE1 pdbpssm F Eukaryota T 7pua 41 OA DR C9ZPP1_TRYB9 mS65 MFSESLCILSRRFRYNTKFPALVSYNKLPWEVVNHETPQFHMHVAPHYEQLLTLAASSPVPHIIGSKHIDVPREHRLRLLPGMLYLLDGDTLPGEFTINRVLDPTALQYYGRLSSQIVTVEAVRMLVPDDLRLLCNCITFKGPLHLPVAPYASLASLRGASQGGTTGSETGSNCFTLYHFVRPNRPPKELQLEKYYIHAPCVAPLSEFASNSDERGNWRPRLQAPKRTRRATPLPAYRPPQSYLMGLAERLAVVPGGCFGRRSLMWGHWF 270 T 0.47 DUF3475 unp F Eukaryota T 7pua 44 RA DV Q57UZ6_TRYB2 mS69 MRRVRDGLFLSHPSSALQCSVAVVSTGEFDHPPFQFRQRHTFNTTPLHDANRFGGRTAYLREIGPVNIKKQGRRFKKDPRTVQFNVDVWCAQQTLRKRWKQRDWEVIEIPFRLVPREQQRVIPELYTDIPQMTDPARNDFSNIRNKVYDREELQGVLFPAAGAMLYPPLQRVDKQAMTLDKYL 183 T 0.13 CNRIP1 pdb F Eukaryota T 7pua 45 SA DW D0A8P6_TRYB9 mS70 MLRFTHRALTATPERFSVLGTTHPKPKRTGFGRNNKMRSKPSDNVAWYDKGPVEWLPRPVRLTYDHLDQLQQWTMRATLDGRTEEFNRIRDLHREWSQHPLMPVLGDVEPKFPLNLFKQNHRAKKRFLVRWHKANTPANWLWMPRGPTVVTPLHRTNPTQYPENWKQMVQRKSGTGTPS 179 T 16 THDPS_M pdbhh F Eukaryota T 7pua 46 TA DX Q383G5_TRYB2 mS71 MFHRAFVSSSDLTGCTIALSSVCTQKRYWAKPKKRPKVGQGFHEKAQKWREEYLLDRHRALADSLRAYVEFSTSKRVEPWDARFKPFDRIEKDGVYVLMRYMMEEKLQLCNYHHRPVKRLFCNIGLMGPQITTKARWKPYRFATNPAGTSKAERMYQRDKTVYTHGHND 169 T 0.18 Tox-MPTase5 pdbpercent F Eukaryota T 7pua 47 UA DY Q57YD4_TRYB2 mS72 MLRSTLTSLNTFLTSSVATPPISVIRTGPKWWAHPERMVRQKLMYFTLGVDQLPLRRTAVIQRDLQRFHMCKPPPRVGDSTGYKRSRAAQLNTWYRRIQYQEYHMQHLFTRHVWGLLRVYPGNTTKIQGKADDGYVGYDSVPFHRYNRAPLPFPARELYERRK 163 T 37 MBDa pdbhh F Eukaryota T 7pua 48 VA DZ Q587C4_TRYB2 mS73 MLRRSILFRMKYADLELTTRGEFPHGMKEPAFVKKLDQNIPWYFSTYRSMYHWPITGDNWSDLNEAEKHHDLHMFYTLAWWKLGEGIFGVDEDS 94 T 9.7 Mastoparan_2 pdbhh F Eukaryota T 7pua 49 WA Da A0A3L6KY67_9TRYP mS74 MFSLTQTWLIAHWYCGHKFRHRFMRDKRFHPSLQASHDARNRFSKRRHFKTNRWNYQQAYRDMP 64 T 0.093 MLANA pdb F Eukaryota T 7pua 51 YA F3 Q38E61_TRYB2 mt-SAF3 MLRGWHPNSSAMQVGMRHITIGGRHSRGGFRQPLGKHPQVKQGTVEGVPRRIPGTTKVTYTNKKGRTFSFSVPVSELTHPQVTLESAAGTWREMDTSFCELGDIEDDMPSPVDECLRGGSSLDKRLIQEVRERFVSFCREYVLMDTSGMKSTILSTELNAGPDYEHYDRRLRRKRHWLAIRHRFEDVRYVIWPDVVEETARGDSAQADVSLTNPSLTAGEMLEALLWLDAASTFCVRKVHPSDLGDKSEFLPLDLQREVEVVACHARRDLDFFDPSATSLEQFTACAALCVNHRVPFSLFFPAQDVCGDASVSTGQCIVANAPSPHTALGAVRIMALISEGSGSDIGKTIMFSDAFGAVTRFGILRGLSRVMSVEAFGCKDALENVNESELCIILHFCAEVREQNAAFFRRYEASEEDSDPQQVSFLAKYQQLSQIALARCKRLLYHPDSPRAQVMSEDGYIPLVELQRHAEGTNKAALIHYNLGIRSAQGMRRVALGAQSSARLAELVSRLEEASARVSGNTLVNDLVHHLSHKAAAGKMSLTLREVNTLLPLLSRMRRESPNGALDARFDRVFNAIDTAIGAAMRHNCTLDELLDLAEGLAACEMVPSALKQVEMVLIRSVMMHECSPMHLRRMLQAMFTLMRTSVPQVLLQSVASRVADYIKEASHMDSSSSNGGGDEKVKNHEECEQLLELLVVLGKCGYGALPGLVTIYWEAQLIDSMQLNPRLRCSYASLLASAAFALKKHDKRAWEGLADESHRLFMEYTRCNKENDIGRFAECVTGLAVLTQIKDNTNSSDVAFLKEYLSATSLELKSCEVIRVQELTDLLGRTLEWSEALGVVAPDVVIQLEKALFVMLENVSHTAPGVGIPDELVTAACCLVDMSSASLELRKAAAGVVGGAIVHAEEALETLRSGAPTQVRPGHSFDVAALASAERENVYKNSILQYCAALQRSGMSTHVEELWS 966 T 0.074 DNA_ligase_A_N unppercent F Eukaryota T 7pua 52 ZA F5 D0AAF7_TRYB9 mt-SAF5 MRHTIPFFRRSAFVPAPGSSLLNPRSQRAKVRRMVAAQKAQGENFERQALYAELGGSPSARAPRSKGERSKEATRRVGCEVAERAKHMTDAEWEGVPVDEKHAFAKYMHKVLQEHPTETTEQQRRRYFETTMADVFELDPRKTVRDEYERVKLGLPVHLKNPQYSLGVSQAVYDAADASLFDPENVHRLENAMTHVKQVFADYVHKKREGVSTEAERRMLANLTAELNLETQKHLANMFKYAEMRLRQVKLEERHHQLAEIERLRRMAQQRGGVKGRKGGSRKMSRMERLKRVINRAVGLDIAVAETVLTEMQAQEEFLQFCEVFARLTLGSGFKHTGKDENLSAYIESLRKLYSMDAATLSTLDVVQYYSSKEGAHPVDWAKRWYERALLLPLQSTPEYQKLLQIQQRDESTVKHIKETAGTGHAFACEAEAEVARIKTQKVVNLVEKMFMDPKDKRLESLHEKRLRYLAHMQMERQIRCVRENAKLFDGVENMPEAAQCRELYEKIMEKKTAQCNMTSPPEGEGSAIQSAKTEGDHCEVGPGMFNVYDDAEASSLFEKIREITLRVIRDRRVQSAAATKARMLNRIIRSLKGGERSIAEELRALHQQRKEKMTMRILGIIENDVKTEMEWLQNMEEAERPPLLPIPENMSYVSAADVQAWRELREDDERKAANPFERRRRTFQPELLGQAWSVPNKPLLFWGTGVSAVQQALRHVAEDAERKRQGLLLAPPYPCAENPWGWRLAKDILDDNN 754 T 0.12 FhuF pdb F Eukaryota T 7pua 55 CB F9 C9ZSL5_TRYB9 mt-SAF9 MTGPTRALFLSSGINLGRLRLAEQFSSMNGWQSKEDPAFDAYVKERRRKENYEAFDQRVERGYAAAAKLHKAEIQNAVKRRLKSSGAKFTAETLREMSSAVTERLAWLRDVWAQIDADYRSGDSARQETAAQEISAALRGEPNDYMRWVYETKRELRFAGPVGRRAIQEELQAAELPEVLDEEVNRYHDLKLNMMEIEREVKAKYGVAGQQHWAELQAAKDEEYIQKLDEAAEVYKQLLDQSARLDESRRSELQRSYVERVHQAQVRFKAAMELEGQREQLIEAHQAMKEERMRTEREKRRQLLREAAELRAQGKKSADVLTALKERQLDANAKRQAEYELKECEDILKRKSEMLDMIAHFKHDVEEREGREMLQRQKSDERQVNVFGFYEEVGVEDGLSISSEGTTSQGGSSGLGTVSTSTSCAKSADSNSSAQPSQKLRKEELWKVINADTYEDPFRTVHQARLDAVKTYDPAYARTFPLNLVLGRKYSRQGAGEMAAGNETDKQILQKGNNILYSFQWGLNNGTVHDLDADGGTDYFMDGAFHVRDKETGDIDWRYEKKRGGPVFRGPKFYRLGAQREAADPGERAMDPTPYTSTPREHKWRSS 607 T 0.27 HMGL-like unp F Eukaryota T 7pua 61 JB FZ A0A3L6KWY8_9TRYP mt-SAF29 MRRKTTLNIGQVICFSSWNDGSEGYEWKSRALSEKRSLALEFLGNVNKRVSIHDAIRLKADINKKAISNVSCPSFFSGIEGADEDEDQSDMSLCSLLGVLEGEIETDCITHLSPSDASLLKEEFLCDYDPSDTKRMAKWVNLRSETSDYQSYGAIPEGERSLWSAWYLRNIKAGKKPI 178 T 0.02 PH_11 pdbpssm F Eukaryota T 7pua 62 KB Fb Q581U4_TRYB2 mt-SAF31 MNCSSTLACHAVVSAPSTASLITSCWTPQQHCYRQLMKSLRAAYFHDRSKLFWSRHRVLVEFYKYSEEANEEAVKQLVAIGLEVAAFIDHHMRTDVERIVKHNETMMALPVAQAKKFRSDYLLAEKQHDSWCKQKIKNIMKRRPPPPYPFF 151 T 0.019 Complex1_LYR pdb F Eukaryota T 7pua 67 PB Fh Q38A63_TRYB2 mt-SAF37 MWQRLRFDRLSSSVRRTNLNPLKPCAALTEQRAELRNLHQYPTARHKSLVKDRLRFARNWWLTGGNNYELVHEVGHEREATECFAEYAQDSSRDVYLMSTNRLSDLPPGDRLKAIVGLMRSRWEVKDANRGYDKAKLLLQALECFSEMKASGQIGDFNSLPEPDQDTFLQYVEGCSRFAQACSHSHPDAVRVLLRAAQICEEMRCVEKRDEMIQVTEAAANRMDRAYAFSRPHDTLRAAPPSLHENEDCVRLKNTEELRRRFGNTAPHVLEKPKRVDCLRIHRNRPLLLHPMKDNNKLLELSKLPARPEFDSWTSHQT 318 T 0.077 ATG8 pdbpercent F Eukaryota T 7pua 68 QB Fi C9ZNX5_TRYB9 mt-SAF38 MRGSLPLLFNPVLPPSTARLRLLTYPMALAQPHATVPLIQPTIDGTHDGRNGATVSLRTQARMHGTADGTMATAGDSSQNNSVMDSPRWLRNPDELCVAALRRSRDVNKINSYVATYKFDDPQWAPLLLPEVTLRPQVNSTGDKPNGGNEAAADVVSVGPSVSATPESTPPPPPSSSSSSPYSCPADCVSISHNKMIMLECMSRHVNFSLRHIVQKGHGIYLIYHAQHSILQPKGLVEQSFVTCSFGIRGERLRTDIVHVGPIDAADVMELQPSEGHDHPRCCFNLYQKSDVRRGVIAVSQVEGYGTWFQRKPMLWQRSRRIGALQSQLGAFAYDLVDPHEVGKWRDCEVSLLAPHMRFFRNGLNGAEAVGIIASSQVAQQRRLYLGEFEAPAITALDAVQQLAHASALRCKLVTPVVDPNGVGGTGSGSLGDENMDKHIDMETLLPLSWATRTPPPYVPLEADLPFKLQMSRPTVFAESHQQNQAYPTGGTVGSPFVRGAPMMMFEYNMHQGVDHYVYDDAPSARPMKWWSQKSNMPYSGYMYFARSGLVDRFTPSEDIPNPLEPTSKRKPLHAVVPPTKVVQERLRKYRRKQQEGHKQRRRASSGSGVSNEPDAVNRQESVSRGTCE 629 T 51 RCDG1 pdbhh F Eukaryota T 7pua 70 SB IB D0A0V4_TRYB9 mt-SAF39 MRRSGRGSAVRWSSLCKCQCCLYRTPLGGTYFEQALPRSLGARQGKGVLSTVNTALSRKALKRRQSLPRKKLNVPLTAEGLKERLKQLSAEERELSIKNNTEECDEPSPNEFTTTHEARVALARVLHHGENAGERKEVAMRIPSFCRSPAVSETQSIVVDDKEGDITNAAVHVGCSVLGSDLDHLERDMIRDYHQRGKKLPTFDNIYRTLGCGRKGTSVSDTEPEDENSSGAIQSECGLGDAGRRGTVVVAPSHLHHSTPPTKGRSGEEEEEGGCFDTNTLPADANPHFPPGACDNEVLAPLSGGCAASEQTEITDTASFIPSNSRLSTAVYDAYRQRPADDRLVVLRGTDFWDNEENRARLQELTDYAEEDFAREMLMEGAMDTSEVGYSTNKVRKETLLYFQAHPINEMIQEPFARVRSILPSDGGPEVHFPADDPDTDVDIPTAQARTMARELGLDLIRVGTLYTPINDRRVVAVCTIADHREHMRDMIRFKIKKLGVQRPPTKEGIEVPFRGGTHPHAVRFKSIGIAKHLLLGHVVRINLTDFGTVREGFPVFGSILDEVARQALQLHAYHTAGVVRANYNEVYCYLYPSTGRSPKSTVLHPTQEQLATVRDRCLLEREREVYFDGLYDKKTPRERLTYMRKLQDGTAWADRDDGLSLQRQRDMKVMLGYLPKGNHELYAARGDVNVPAPFRASHPTSVDRWTHPQESNLEQAARGSAVLAKRLSMTVSEMHDRQETAENPATLDRFYYRIQGPALEAGELKEALGLKGNRKRLPRRAPGWATLGMEKVSPQEPGHAAK 803 T 0.00057 mIF3 pdbhh F Eukaryota T 7pua 71 TB U8 Unk8 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 30 F F F 7pua 72 UB UC UnkC XXXXXXXXX 9 F F F 7pua 73 VB UD UnkD AAAAFVLFMAAAA 13 T 9.4 MRAP pdbhh F F 7pua 74 WB UF UnkF XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 36 F F F 7pua 75 XB UG UnkG AAAAAAETDWKVIAA 15 T 14 Mastoparan pdbhh F T 7pua 76 YB UI UnkI XXXXXX 6 F F F 7pua 77 ZB UK UnkK XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 7pua 78 AC,BC,DC UM,UN,UQ Unk XXXXXXXX 8 F F F 7pua 79 CC UP UnkP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 32 F F F 7pua 80 EC Ua Unka XXXXXXXXXX 10 F F F 7pua 81 FC Ug Unkg XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 271 F F F 7pub 2 B CC uS3m MFLIHFVHYKTILQKYTFKFKHIFLSIDKYNSLFFNISGILIWLNIIHINIILIKYSFFILINNFEYLIILIST 74 T 0.022 Prophage_tail unppercent F T 7pub 7 G CJ Q57Z45_TRYB2 US10M MVRRSHVVAYYWSRYRMPTQMPKFDGPAPVAAPQSMNSTKTNEFIDPIDDKFPMSIRGPLVRPDVPEDQYVDSWYICTSMTHHMGDYRPWSASAPPNAFRFRPFNEFDAKGREYVQYMREFARFDPRKSRGNGQKGFPFRDAYLTKMNEANQKTPPPTLETIMDRAVREHHQHARILSPLEVQRDVGRLEPIPSYAGKINADRSVFPFQWKTEDWYEYEVAKVRNRRFVFENTEEDGIRGSEVTYKIVLEGFWDHHVMKLAEDVCMFLKDVGRQIVEEKLVAVRRLLQGGAVDPELLAAFNCARAGPFGGLDEYDKEEVANFLRSDLRRLEEQCLSVINRCNVPVPGATNIYDPHTSWPHVEKLEPWVRMAEFWTSSSDTSFTELEMSTAHYEFRKFFRVIICKLPFQSTEFEKRMYDIRHWLHRQTSCEFHTIYRRNVIHDSAVFPTEHDPATPTTHEHHRMFSFALDWQSAPVNRLSTDTVHEGESWDAVAQRLGCSVGELKDANAERETIEAGVVINVPVTATRRLTSFGATPLVLPLKTTSAKDGERIRTWEEAAAILDCTVEELQQCNGHAALTYQKKESEAGEFDSSVTELVAPLSCWTSTSESEFSPVERVHANDTLVAIARRLQCSEEALRAVNDGITDVSGLDFVRVPPEARRPRRLVEPQLRPQAATDALLARTIAEEETFKLKSIPHLPQNAERFPHEYHTPTSRFPPTPSETPATQDWMAYTAKYLDKQFTISAEPAPVYNVNKLWPMQQIPGKVDQTPFEEDQTWLLHSIPVQQLEMHHHEKDLQDLPFINHEQFPRSLEWNAP 817 F F Eukaryota T 7pub 10 J CN Q580I0_TRYB2 uS14m MFCFSRGLWMRHIGQDVPKRHTHFVLESRLMYEKSFRDCWLHSVCRAISQLDEPLSKTVVGTHQKMLQRKVTCFQYNQYGLFKTPYYRLANVDRYHAVQGVAGTREWVPYVNVSYWTMNKMVRGGNLLVHRVHYTGWGTDSHLKKGGWEHRWNKVLQRNVLQYSRI 166 T 12 DUF6462 pdbhh F Eukaryota T 7pub 15 O CS uS19m MAFRNTFTTPGKFSTVSKNIVLLLIWRVKVFLRAEGFAHSLVMLPVSLYSKILLCDVKKKIVYFHCCTRKKSMLRRCPCVWPDGPTKSSVSIGTAFTLQKRFLKIAKSAFGFYLARRGQRKYPFLRRPHIKNTHSMNPSAPYFWSFMTAKSQMAFLPEENYITGDWTGKFFVSKRQVYTLQHATSGAKVRVKSFPSIFEFNSPSRWNIGKEMNTLTKPRMDLIDEQMLTKKQRLDYVRAGLLPK 244 T 4.9 LIN52 unphh F T 7pub 16 P CU Q580M9_TRYB2 bS21m MLHTTRLWLGGYMMYHRKAMGTMKYSKWKGAHGGISHFYGRTPMVEEVRPNEPITLVDRRIMHYVHHSRLRHFQLFRSYQEKSNSTECKLREGEMLRRRWHRRLQKSFIAFMQFKTMKVLEDQARLVNTYGQAAVNAALGDPWNATDNVARERKSAAVRRQVRALPMVNVVPKHVATMKQIHNDRFNYRWRVN 193 T 2.6 HMD pdbhh F Eukaryota T 7pub 17 Q Ca Q38DR3_TRYB2 mS22 MLRRAYIQRRYPFNKRGPREHKSWKHHVLTEPPKPLQWRDPKVWTRDLSVMKSFDAPQWDLWQSRPRSEDMDEALQPFMDMPKSLKDRRYDIPWWANPFGAWYLQNILSLELLKLKSKTNAEKIATYRSYMRSLASGKDNTMSDDDVIRNIIKERWKTLEFGDRNAGYPCTFGDYIQFLNEWFKSLDEEGMQRLREHFDRRIRPLLAVMSPVDILWLEALTQNSPHNKEQLQRKIAFQTSLGTPEFFDMSKRLRYEINEDYKVRDELGPELFALWSKAPERWPPERLSKMYGLDFTLVRKILVWHHFKACYDACVEPDWSLPKRLFALEWIRDVRARKHGLFYGKMRFAEQKITFYSDRFLFRDLVNRREASYANVWEMDDPYRFLQTEQDYEDYWGDNYDVYRRMFPEMIGRTGEPVQQYGQMPIWAGPHRQHANKSEHNWMFAEIGVNVGHEALKKLELDPTNEKRRRFVIRQPDGTLRSAKMSEMRAWYWKEEWADFRFWAPQMEWGIENTPSQEQYQEHVPDTTDADFRKQRRIQSRPVKWFYESHYTRTGSFAGFQPLRFMQRRTEREVRWPDVINAAVQIQKRKPAAYIFKAIPEL 602 T 0.26 MRP-S22 pdb F Eukaryota T 7pub 18 R Cb mS23 MRQCVVRRYKMPKNMGVAPRFDTWNEKYEPWEHMKRMGRLVGTGFYIPPEWYNHFRMFPPINHNFQQEKTLNPHNASEPTQDDTSTLSPERVALRDELARKSRLVASEGMRYYNIFWVRKPLDTMEKEYYELKRRGVDHGEAIRKVLQGFYSGLAVKKRVAAIQAEEAKLTGRFITMREATVVLGVLAKLHKEQLTPHQVSLLAKEQGETTQSGAKLTAIVSRTQPHVNKEASSPATSEAVGSSTEESLSADALASMLSEDGEQSAVGTRYQVEVKETANDSVRQLREKAEDQTGFPDWYTGESPTYSGTSXCSRDGFALMKANK 325 T 0.035 CHAD pdb F T 7pub 19 S Cd Q38DK6_TRYB2 mS26 MMRSSSFCRRQIRPYYNLPSKSEHGRKMTGFLTPYRHWMWKQNELWRNVHEAQFEHLRRVYKRQWLESFRVNADEYIYKYNITKAAQLAQWECEMKEQEKKRIEARQMMDGRQALKKKHLDLLREFHERQFFFWYERASERLQNMNLINYVPHAQLREHIDKELDKYVAGKNEPYPLNFVGQMPFLEDGDGNIVEVPESLLSNHMAEHPDSTAKPHEPHTSSSISEAAAFEERMLRAMVSAKEEDLKEWLGDDSRALSETIDDISREEEEREADIRVARSMEETDAEREVSRRAYIERXKTGSRSIFRPPTVSEGAGGTPSAPAGDANTPMRRRKKGKLDRVHALQAHQDELLAKLSSQGLKEXVDASSVPERGKIVQSRGRIRDKAVIPTHEVLMQKPELAAGSTPGARIQTKDMVDKMYHRGKYKKSGSGDKSDGEDL 440 T 0.028 ThylakoidFormat pdbpssm F Eukaryota T 7pub 21 U Ci Q57WW0_TRYB2 mS33 MVLRRWFPLLGVHRVGYTHPSTLPVPCAQRWDLRLARARIFQEYIEEKAPGAWQLEDERSMSPEFKTFTGYPMRDMRPGYGQNLPDYIMKKRLPNNTHYELFARRDIPNEDNAMYGKYLYDMTVHGTSLPSTYRMHKDINKAQRNDRKLSGNRFKVLCSSGAKNPPSGWEPIPDATEEEED 181 T 1.1 Nmad3 pdbhh F Eukaryota T 7pub 22 V Cj Q57UK0_TRYB2 mS34 MLRCARVALRADPLNGGSSMTLGSKGSKLSPEPHRRRMPWTAAKEYVPGVVLNARDKMVLDGVQLLDIESIDRASQLDPLEVLRAVVATREYNISTGKNIFQLASQATYNGRGQRFYRKEWQEGTYDKYVTLSAIDFDRDGNKGTAYGYITFHGETTTRPVQVDFADVPGWYMDFVEERAVPFTGIVPPPPSIGTDVPVDPHSYRLKAYPYYDAPNPPEFVERLLKDRGVLPDTPTETADVDKDPTTSDGSVHYDGK 257 F F Eukaryota T 7pub 24 X Cm Q38C96_TRYB2 mS37 MKSSDIFFAYRLTPVVFKSRQHDSGVNQYGLKPTNAYDYINPTNLINFGRGTTFDNLGVRRAGRGEIDSSPSHSGSPVFTQAKLIGLSGEEQLTMCQSETMALRLCMAKAGKETCERESRALDSCLGRVGHLRRAMSEACWEFNDWFIQNVSDNHTKPFQHRPHDWRHFYAQEKLVRERQQNGHAYGRRPKQFSFGARYVKTDGYGKRPRLPYNK 215 T 0.054 Gypsy pdbpercent F Eukaryota T 7pub 25 Y Cn Q57VQ9_TRYB2 mS38 MRAGCVACSRIPKLGIAALGSTTTSMQATCQETQELRCATTQMAPAVIFPLPPPLRGSYIDRKPTAASFNEHHATASFRHHMIASADVSHRPRHFYFASGTRNDTGTRSVKEGERKQLKQQTNVEVDSVAYGDQLDELSARWAAKFYGQVTFGPRNYPYPSSRWLARRFQMKKHRIIKRFRFRRYKLAAVANLPFAKMIRVGMLPELKSSKTKRGDVVDPTLSGQLVSAVKNTGEKRKGQRTRPKSKYQV 250 T 28 DUF2996 pdbhh F Eukaryota T 7pub 30 DA DA Q57UJ2_TRYB2 mS48 MMRRVASTGSNGGHCYVGSSLWADSVYMEVSSITLQKRFSFKYATKLQHDEMRQPYYIHEKRYGIFSNERNIAKARRGLPFITPLYTKHMNLWDTDTDASNNRFFRGYYYGQRELHQLLGRPHSLEATNADGSNDLSTYEANTSQLYKGIPRPAITNLHYEPAWRYTLYQAGAHGAQLSNPRSPFTAKVLGDELMQVRDIKSVEHCKAWFDRLQYLINLHYEAVGDIGEFKSRHTRHVHEFFVAFHDALSSLDFRDTYLFDQFKAVRPPELSDLFGIFLEMEANYVHEDYCPRCSLPYSTTRYCGEGDVDTPFRKHRGRWAPHQKWGREWYDVVARRAEALWYRATEDPYFGTPQHTQRQAEALLKVYVQAKQRGKAIDFMNKLRGSVEYLTGDITITTVMQESYDALLDTTPHPHLLTNGFTLESDAAKYTGEVNKAPLSPLQFRIDMEMNKYRRQQKEEGVVRVPPALWRLDTSAIVPYKVDSKTRRIVNWREVKEGIEKSFLSVGLPKEAYTGNEWREMLYLHDVIANREAKAAVLEQQQKLDREKLKARERASLKGEKSSNSASNGIGIIFPDKDGYQVFFVDEASLRPFGIGASGTLFKAVARVYPSPAAVPYDDPVHGKQFLIDTTDETCHLFGGFEHGDRLLLRAKSSGGDVDGGNNSNCRNTGSFTDEEEEVIVMGVSTEGSDAERMLCAMHVDTGKQRERGIVFLGTDCIDIRERWESVRYAASAYVKGRVTLLESQRTAREEFMGQVVGIRDGVLFVQWRLLKGGGSVLDRSVAVPIGDSTQVRELFVETKVLGVEPLQSPPSWRTPFRNDFAEERLKELQRAPFKREKYVSLIQGKYTPKVKKFGYTQHTTVDDFETKEYKDRLLSKQFFQNPQAFEVVPDRNERSVQFGGKWEYQRTHGLPTVDRNELENGWSEVEPISDGEMKVIEQALRDISGPRPGNFVKPPSKTKSLQLSESWWEPLGYGWEQHNEEQKALCDVTEQRLIDGSNLPFGGKLPPFGTSYGMGERMRAIIEDYSKGFGLGPHGHSPTHDTIHYNTLNAEGERVRDLGYTDALGRLFSEKLGDRDVHQWAVESCADGEADVRQLLLSLHEWRERGRPPSLLLANVLSKYLEEEIAAFNKGIPENAPKLQLETSDGTLAHSGSGGSRSGTMWADVDPTTFALHQASQSTRSTRCDEPFILHLVKRAKLGECVTNFTDTAYIAHLESSVINEFHLALTKLVGKGISPTLLAQKTGQLHRGSVRVGGNVVPFVKSRELSRLLERMGLTSESIAVITRGLANCPEQESVGDDFAVPVSLILSWDGPGSSGGSGSGDRRLNTTAARNEQQRKGSAALSSAIRQLSQHQKQRGSASGRGWQNEDKMVVHVLEELSLRNDGLVMDIQYAIRENKKNRRLRWEFVTTLLPVFGGNEAKVEQLYSDYCDGKYVPNITVAVEAFIAFLHNATQHPETYSASDYFDIDDGKSSTASGTGDQPSAGQYTTLKLLDPLEGPFVFDNVKVEYIQTVERFRRHGIRAGPVMAPATGFIAANCKSLNYFTRRQEEVVYVTNDSDQGLRRSLENSAYHKTIAANPALQYLLKARRGAALVETFNRFFYRTMPMLNFYQNVLKHYSETIQPLRQAAKSSTRGLARALESERSAAMEEFRRNSERYWRNIIEGRSVEQVVLANGESNRRTVGGGAGEQLQKPQGNRDAVRTNFTGEATDGSRKGQRQHQQPPLSQQQQHSRGEGERSAVHTSPRQGGSRGMADLLSKLNTPKTGSGVKGSTAKNTGNNRRGPKS 1788 T 10 DUF5053 pdbhh F Eukaryota T 7pub 31 EA DB Q586P5_TRYB2 mS49 MIRRRVCDGARLAPNIRATFSAVRYQSGLHTFIRDSKPSNFSSVRRSENANGDATASPGATAGENPASSGDWASHMQRELFGEVDPLGGQAHKDYYRDVTRGYSPQYAPRNFANGGAVAYPHIQSPYEYEEAAHRRVWLDHDVDRMREEFTQHRASLRSLASAQEREELLRSRAAEYQVANTVHESESVHPIQQLYNSGGTSRSALKQQAVADRYSIAEQHSPLPLTTGVDRDALDEAQRTKDRILNDSFTAENLLITHGLREKEKHDFTILQRTVRIPFQGYDMDRFLAQQKGTPYGAQQLPPNVVPSSMEEAQRTLRGSSATATPLVDAVAQKVYARNTVVDRPAIGEQLTEQIINIMRASRTTAEQQREEERAQRFGLGRQGALVQDGGPDQRTLKKHTNDERIVDAMLFQQNAYRKTPTDEHWNPYIRRSTENGVGHLLQNKFDIMRREDRLSKGEQDLTERNTIHYGVPIQQIVDEFVFRHRNARGERPLDYFKPFPNFRALRLNRMYRDVEGFSLMKQRPEFLEWELFTRYRQHHQQRRRLALLHGLEPVANETAQERDTRRHRLDEICERTPFDEREMRVNDDEMRVSVETLRSWFGVYMLPSPTVVNAVLGGSASVNLHLYHLADEMGTADTREHVLSSRYLNRLLLLESYQNRVGRGFMNHVVGRAPEPVVPHEQPQEVLRHFSAEERAMYEQHVKEQTSRQLGEWERAMKRRRWLTDHQQYGHVVSHGLETSVVDLSHTETGAVLTVSTKAYEQEIEAVRMKTNATIKVDGMVYNLLPNSERRVVPLTVQLDSGEKIDMTSEDFDRCELEAFPRNLNHALNYGIANYAYNRGNYVETQDSIWEEQTASGQEGWSPATHADGLREGLPVRARRPIFSSSAEQRIAGGPQRAVIIQYHHQPFFNPEPRLVKVAFQCDGTIMEVPISDVMIWQRRYHGPERTVGDESRRYNPAAMRRYVDVTDPFNEKTSNTEHFLDKYEPKRNADTVADKYRTTKQITEIDKWTRYDSARADNYRPLSISHRRDYIRMGYIPRYTPWEWIAIQEADQPLIAEQIRQDNIGTSYFFSLNRYWRYKASPHGYIRHFENEVRDLLQYVDGVTPWKQAQKIRTYWEVRSHHPMPQFNRPEVAMHRNTVGLLPAHMWETDKKTGKVKSVKDSVRDYQTKTPYPKWVQL 1181 T 0.098 ACD unp F Eukaryota T 7pub 32 FA DC Q57YB5_TRYB2 mS50 MFRVRRLAGFIQCSPCGKMGPLKQQTRRYTPIWKSDPAVDNVAPLRDEDERRALWAEVGPISDVGSAVTAWIRFGNDPVLHTAVPTMLGGKFRNQQREKESLLPNSSSPFAYVEDYMGTNLVFGSPVHAKESAAVWATYFERRYASRLRLSRRTVANYVGLINSPEVFDDESDRPETRWSQDTFFRECAYLSEKFLKEKVSNMQQFEAALKRASPEAYLAFFDAFQQQTQTQIPLPSPSVWHYEGERRKQWAEKFISISHKAQAFFKDVLSEDVKKYQEVPGKLLQKVKPVLADVGKILVKRHERWLKGRVWTSLTEEEREAYCMKEVKRQQMQVEDGEFDPMMEDDVDDTELEEWQREHDAIMKLMNSPIDGLHFTTLELWLHTMRCEELETEHIYTSARVRAIQVAARKKLYDTTSYEEVIQAVVESIARGTLDLGAGVLRPHFNEVWCQLNYAKFGSSTITQHTTTSRRQLLFFHAGSLKDIAATATLYYATKPLSNSLDYASPYKYRRSLITLCSNYGVETAYTTQRPLLRSAANLARAEDLIHAVVTAAAQPFGERRRAATRDLHMEFQRLAVPVERVIVANPVSALLESGADPDEKPVEGEKVNMWPLGAKRVVLYKWSAPNVEKLKAMESDAASAVSGSSLTAKRLREIQELKRRGFLEVSLWRRVTAQERKQRNEIVEAKKKQVEEVVRTVPSLAHLHQYATSLYSRIEERVAFPTETSTVTETTNMKEETNKPLEDSEWEFAVLLDDRVLLNKEESVELYLPYRDANGELLAQGEYRALVRAFDLEANPNLHPAYCSVGYSESFQVFDALPQLIAQFFRVKDATAEAAGVTHIPAADFTPFCAFLRDAGLDVPLRCEFEAGQAVTTDGDVYMDYFLQLLRGEAFHQSHAQAGLTEAQRAIEPLCRAHWVVHHPGADESEWATARRSVLDHAMQHEREWWFPNEMLDVKDVVTGSTNGLTPQMYPAAVRYGVELCTVLTAEGKFVDERGSGLSARCVVNGTGAAESVVFDTANCNGTNTTSVEDALRVAHGALRSAQDRHNTLAAFRLGPLSKQSQVLLFCGVNAYEFGGKYARTYAYAFEKAKKELEATAASGFMAPSLSHEDTERLSDQPTTSPSVDRFASTTHPEQRKAQFVPRVGPGSTPLEDPAADQKSEWS 1165 T 0.23 CT_C_D pdbpercent F Eukaryota T 7pub 33 GA DD Q385L8_TRYB2 mS51 MLRCTVVGHHRSGKARAFVFRDPSLRMMRAGSGYQQLRRMGMPMQVGMGWRKVDSFHANTQYQHAWPLLSHDDLGNSDQSNNTKNIMYSMYMPKRNKGTAPWFRGADTYSVKYCEQGRYEYQRYLMINRFPSEYKKHFLSFLSNIRMSSGSATIPQEALHWLLRMIVDNFNPQHVHYIAAMKTLQSAGELDMARDVWKIMERQQTWPCTATICAYLDVCVEAGEKTWAMEAWNRYCTELKFLEPGEVDPKPISRVPFSLTREELLYLPKWKKHFDHDPNLDVMDLNRFNRTREVYLRMAQVMLAGGERNAFQHFFTKLEEAMLNKPTPVPEPPNPHLVRRPRWAPYEHCKSVHHSPWRLQNNGRALALGPPVTIEDEMQSRFFSNDQFLVHSVKEVLRIVLQEHKRAHPTECTRCKTEAFFYKTKDADETLKFCDDLIERLFASLGVRLSNLNTSSLLSTILEVFRVVGKESGAALLQRANEFLERKASLGDAEGSRENLTASNYLQVLSGFADESAFVYNTKKDGTCQYKTGFDPRTTMRHLADVVQEIAGNPHVTWAADMHLQVVETMVGCGTMKANDYFVRNVLRQFSWDSRFLEALYVEYRRQDDVDMWAELTKRALVWTARYNAPASERLRRLIEDDYDTIRVQTRTFRELAVFQFRDVEERRHSRDVVNELPNPWYDYVAHALPFPDRDAGYPDEYGDLGQWRAPGGPGSPVRGPGYYAPPMEGEHQRGYTAEWRDLRNPMRPPEFPTPWERKYRQYARGQHPSYDMVYAGPMPEIFPMRRDFRKPTRWDFHDIEKQGKYRTSGPY 812 T 0.001 PPR_long pdbhh F Eukaryota T 7pub 34 HA DE Q386Q7_TRYB2 mS52 MRRCSFASTCRYRSADDVPLGLGSAYVWTCRNITSRGQFNPIHNFSYAMERGVRARDVKAFEKLITNPGPLRVAYTPDYLDWLHRCYKAKGTYMDARAVAEKKFNGNIVSSELSAAVNRREGKRGDTRGDTVDNDHHNLPGAPPPGMFLRPAHSFRRLAGELKRRRAQSILDEVARAQGMLDLFERQPHFPAIHIDRCSRFHLVELFKEMVLERSLDSNMIWEKALLYRAILSERKPSYPTSFHYIFTAVEDTVFAPTISTPMEMAGSGGEGHDRSSHPLAAKCPTLEAYYYYVYLVKKYYIDNAVEAHVVLRCHREPNAADLLFSNPPPKDDTEIMKAVELLRNADIQRGVAAAAAVSDPTLPPGGEGSVIGNSDNXKNSEKXSEGSRGRPARPPVLPGAYPPIDMLWRCEENLPLLKVLLFGEFNLIVSENPFVKFPSAHGFLTRPYSTDSSRTLADGMSLANVMAEKRGHLLPSLPRNTATSIDARAQDIRRLQQKHHRDDIVSFQKLLRSTHAEDSPSAFSSYSDWSYFNPRAVRAEERDRLTRKAVEALKLYDSATNDIYRHSFEDVQACHTQRVTERDRTMPPYLPTLPHFVAIIKKDPHISFLLHIGLPDRNSSEEGSAKHKELEKRIYYLARALYHTALEYHNETVRRVNRQKVNVAASLLDNFVEQEWTTILRDKHDVTDVTKTLNDTQNDKKQLARRLGRYMLFANRSLDDTGFPTDARADDYTRWMAPPSVGKVSL 747 T 7.7 Complex1_LYR_2 unppssm F Eukaryota T 7pub 35 IA DF Q38ET1_TRYB2 mS53 MSVSGVFSKGRGIGHEATTSILRYIPRARVPWQPSRFGRENLTAADMARLWSRGRYRDGPGNYNSGYCTERTHVLEENTVSIIPRRELEKYMPDITIGPKALVTPVSLMNARNGHRVTHDLLHSYDPHIGRLGKPAVVDHDNITVEDPNRVGLNAATLDCRGRIYRWLRRGPFFQVDNYFRRSVKLNRDGTLPTDFVHEAPLMRKIIRLAHRGHLKAACEEYRRVTTVPPVEVYRALTACCVPGAKLADAVSIFEDGDSKLFYVSRDGEVLHNLMRCAIAARHRARIMWVYNVMRGRFYENVVVRAEVDLIWRYRIAMIALEYLLDHECAEEAAAIYSYLVEEELLRCDVHVRVGLHMREAIAAGKPITLNNDVMNATSLVRDATAVAPEVARELQRRHAQTLQNNAVEAVGAENVREGSAPWSILGPLTAIGPTAEDTMVWLQQHYGDVDVMSIMRWARFRKGKDLMAKDRPQYLARAAAWIELLSKRNREMEEVPLTYMRKSKPLVLDTNSNVRVAWQTPLMRSGGPPRLLAREEGYVFHHSNSSRFVEETYRHPGESLQSRYLALQPLHTEVSAKEDFQRLYYQAQKHHKQQERLLPVSTAAIPSRIVHHSVMSALHGVSGKGEPANRSLFTXNKSNDSHSNTGVASGTSACTDGAGSATPEF 666 T 0.0009 PPR_long pdbhh F Eukaryota T 7pub 36 JA DG Q57ZP8_TRYB2 mS54 MFRRAIPLLSANIPRSVWDPAQHNPNWSDSYGHDITNRRAWPARKWTVGLEPCTPREWLQFSHRNLAYAYNGALRACHSLPSMLLLYKEMKQRGVKVDVDTMNVLLTRAARHEHIQVDDVFLLFDELVALGARPDLAAAETLHTVLSHSASMPEEWREARRLQLVELYNNLAMEEVERLAPHRADRLLKEQMKRFRGNLQQLGSGLRPTVYCRYLHTTHTAAVLLEEVHNFLWELVPNDHPAMEIPALQLRVPFVASVLRRPSVNPGVSLASVSRAEFGDTDVCAVFLAAAERMVDADFDDQRPVSERRLFLSLLTMISYSGVLYTSDLMAQLMEMVKYSNNDETRDSDAQRVLRYALRGSSAAQDSASRTLWHSVEKVADCRVVGRYIGARNPWNPIRVCFDEQGVFKAYPISTTTTTREVSPPEGNGAVTQEQRASCVEGRTLEALNMRWDDVRRLIECTGVLVTPPSERCPQQQKMEVFTGMAVYLRTVATGRRYEGGEDVLSDGAVATSSCEQRRRGTLFAEGYDFDVWVRLFSLVQEVRHDMEKFMADHTLQCVEPEFECWEALLVTLRCALDFCVVQMQGGGARGTEREVVERLFRDVVALREELIEESRTRFGGRMRVLWLQEA 631 F F Eukaryota T 7pub 37 KA DH Q580V1_TRYB2 mS55 MLSQNVAKTTVPSYYMIRTNLPHRKPQNQWEGVYYYSGITKRQRHLILLHRKREREAHMRSFNISRASVLQRLEQLSGDRKQESLPPHVRLDLAVRLAQHGLYQQATPIVDELHHQKALHAGHYALLINALACPRLGQRILHCDAQCDPALTYKLLGDENGEERAQEAYRWFDLALTSLAVDCGGRTQPSHFVPYLPQGTAAASHITNALMRTLLTCGYTHVAAIPDSVYDRMGSMGISPTISTYELVMLALSLQGNMVEAESILSFLRSHHSEHITVESFNALLLGHREARQFDCCDAIWQELVDRRWPRASPLTAELYLRSIMDHANTPTSEPLQSFANINVVEKKKVPLVLAQMDELGVPRTHLSRVLMDEVEDSLRKFQTYRSRFYEWGRAVKQFDFIEFRRRNGWLYDLHLMKCTTKQVGPLRDFNDPDAVQGAVATAEIPAFFNERPAWERPPLEETLYVTTNKERYDDVRGGDIYYDDTRGLHDRSPTWMNEVPETRYDRLYGVNHPDIAKIGIRRHLNVEYVNRKEVVERDAALMKKTLSSGRRLRHRVESSRTHRNAGSLSGISSTAGGGSR 581 F F Eukaryota T 7pub 38 LA DI Q587C2_TRYB2 mS56 MRKFCHFMANCWNSARSHATYGAVPLTHSQVTSVYATDGGKVDELGLLELVEERIFSWKLNKWEMRIPPNLPNDQKELIRQEQENLKQILSEWRKCFGALNADILQISSLTGVPKDVVREKNRTWLQEEVAKLRWMGEVNKAALLRDAFMRLEAFGSRDFMFMERLCCIYGLARQGTFDEAFTNYITEDPVTNDIFVDERNPFKELVAHIVRNYSQIDIIYDFLGFNYSEGYRSSLRRYMEYLQCKTAENVRASGRLVTGDKGEHNILFDYCVSRESLVSGDSCQGIIDFLYINGNDVTLIIIASDNPWLRNRQLPHRRQMEGIARRVCFVLGIPPSEVRIRNLLLPPTYLDKGSIVRLNDIVFRLSNEQSNLLIPWLTNYNKELDPKDVDYTALAKTTNEEEWLTL 407 T 0.2 ApoC-I pdbpercent F Eukaryota T 7pub 39 MA DJ Q584U8_TRYB2 mS57 MFRSTRPRSVGYTPVNPDTSPMVAYSQYHWHYNLPQGMERPHSVNRTFAAPFQSNHSLVNKYRGVWIEFDMHPAFSVALEPQLRKLPRGRTLPKTPAEEVIADYTALAPLVDDEKTRDLWLAKVFQHCAFQRCGGAMELWERYCHQRFTAEGATAKPPLSLVKSVLFYCNKTDNSGWRALFDRCLKDGWNYTPLFDTAQWSFMLKSIGRMGDEDGVRAVLEEMLDVQADLDRVEARSVVIALNAVTNADVYEFVKKYLFNFGERKVKFLRTTYSDLRGHGAGKLRIPLKENDNMYYHVCWHSSIRSPRQFSPRQLYFDYTPSTLGSSSHNPNAKIDDIVKDKIEKWKAEGLLPEDYVHEDRVYDRSAAFKNVARQEKWKKMPKILKSKRMGYTGDP 396 T 1.1 RPM2 pdbhh F Eukaryota T 7pub 41 OA DL Q38BS2_TRYB2 mS59 MRCSCAFLDKSVFAAKRRVIVPIHPTPNFPAHFIKSAFTTDPLKEKQKARFSSGGEAMREVQDIPKNLEGERSRRDLASRGDTEFQALVEFIEGASYDQLISGRRFKKVYDVLSENDDMFIWLCHTAMSVLNPGDMRSRLIYNHLRILAESVASGEMTQRTAFRFFESAVRSPAYREIAKRQLEGGAATRLAGISAAADVMRRMGLTRRPMSSYFELYQRIVERSEAMTPWGFPPLFQFEERLSLEPRLKFFSRASQQSLERRRRGHVMTPHTTLHGRRIFWIPPTWNRAGRFLGPHVTLYPGMTPD 307 T 0.37 DUF2840 pdbpssm F Eukaryota T 7pub 43 QA DN Q38D60_TRYB2 mS61 MLCRTFLRQFRMSGGDMFVEYKVLSRDHRRSIRVEDAIVDPTFKRTVLPLGWLELLRSPSLRLPTGYFVEETVHVSLPNATSNGGKKEARPQKGGFASGSPSVGRNEANAIIAGPVVLYITGQSVPVVLNPYFVPEGTWDMRTRDGELDLRLGMDAIEQCTLFSELRPGGLLYGKLPENPNVRRNESLRATLGRYGMKCDLAESPLVPRPWTRMRYMFIDELQRGPKLTEFVGHNPRNGTPWRFSQNTKYFRIGIWRDTIRRNDMNEGLHAHSSWQKSPQQSVPEVRFLAPYP 293 T 5.9 AAA_11 pdbhh F Eukaryota T 7pub 44 RA DO Q383D1_TRYB2 mS62 MRRFCTARVSSYVKRAPLLLPRGGRRWRSGAGTTADGGGEKEYIADSFESTAGSNAYAAMELAAEARREIHELWLSAETALEREKRVQQVAALIEKYKLDPSTPREADVSRGLGDAFDRLLLLCLPLGKTDAKGTDNLERLMHLAGRNGRELSVRTIQHLFARTDSFAEALAVFYTMRRCHVAMNMEAYYSMLYSLQRLEEEGWGQHFRNEYEENGAPSEQAMDFIVKGISNALLPENKPWLGRVMFQDRNVPDRRYDTRDFDELDTAWTQRYKSGTPAGAH 282 T 0.084 ECSIT pdbhh F Eukaryota T 7pub 45 SA DP Q38F25_TRYB2 mS63 MLHGTPVRRASLRYRRPYWMMFLKGVDNWKIYTVIQQPDHQRTEMLYQAWLGGLDRPYTRPKCMANQPLWLSKKRHMLRKERLDGPETPLEKYVLEWHKKFHSFQGTERPTPDDLHTALDLVERPLDLSYALQLLGQCRNLNNIRFAKETFLVFLEACLRVGRRDCAEYALEHAEPLGFWFIDEDHRRYLQGEQTWYKLSPLDNLYYPVEENAKLNEGRKPITRLSPATESPGSGTAISDGEPSTDVEGETTVDDEIAQLEAELAALEREGGGK 274 T 0.19 MNE1 pdbpssm F Eukaryota T 7pub 47 UA DR Q57UA2_TRYB2 mS65 MFSESLCILSRRFRYNTKFPALVSYNKLPWEVVNHETPQFHMHVAPHYEQLLTLAASSPVPHIIGSKHIDVPREHRLRLLPGMLYLLDGDTLPGEFTINRVLDPTALQYYGRLSSQIVTVEAVRMLVPDDLRLLCNCITFKGPLHLPVAPYASLASLRGASQGGTTGSETGSNCFTLYHFVRPNRPPKELQLEKYYIHAPCVAPLSEFASNSDERGNWRPRLQAPKRTRRATPLPAYRPPQSYLMGLAERLAVVPGGCFGRRSLMWGHWF 270 T 0.47 DUF3475 unp F Eukaryota T 7pub 51 YA DV Q57UZ6_TRYB2 mS69 MRRVRDGLFLSHPSSALQCSVAVVSTGEFDHPPFQFRQRHTFNTTPLHDANRFGGRTAYLREIGPVNIKKQGRRFKKDPRTVQFNVDVWCAQQTLRKRWKQRDWEVIEIPFRLVPREQQRVIPELYTDIPQMTDPARNDFSNIRNKVYDREELQGVLFPAAGAMLYPPLQRVDKQAMTLDKYL 183 T 0.13 CNRIP1 pdb F Eukaryota T 7pub 52 ZA DW Q383N9_TRYB2 mS70 MLRFTHRALTATPERFSVLGTTHPKPKRTGFGRNNKMRSKPSDNVAWYDKGPVEWLPRPVRLTYDHLDQLQQWTMRATLDGRTEEFNRIRDLHREWSQHPLMPVLGDVEPKFPLNLFKQNHRAKKRFLVRWHKANTPANWLWMPRGPTVVTPLHRTNPTQYPENWKQMVQRKSGTGTPS 179 T 16 THDPS_M pdbhh F Eukaryota T 7pub 53 AB DX Q383G5_TRYB2 mS71 MFHRAFVSSSDLTGCTIALSSVCTQKRYWAKPKKRPKVGQGFHEKAQKWREEYLLDRHRALADSLRAYVEFSTSKRVEPWDARFKPFDRIEKDGVYVLMRYMMEEKLQLCNYHHRPVKRLFCNIGLMGPQITTKARWKPYRFATNPAGTSKAERMYQRDKTVYTHGHND 169 T 0.18 Tox-MPTase5 pdbpercent F Eukaryota T 7pub 54 BB DY Q57YD4_TRYB2 mS72 MLRSTLTSLNTFLTSSVATPPISVIRTGPKWWAHPERMVRQKLMYFTLGVDQLPLRRTAVIQRDLQRFHMCKPPPRVGDSTGYKRSRAAQLNTWYRRIQYQEYHMQHLFTRHVWGLLRVYPGNTTKIQGKADDGYVGYDSVPFHRYNRAPLPFPARELYERRK 163 T 37 MBDa pdbhh F Eukaryota T 7pub 55 CB DZ Q587C4_TRYB2 mS73 MLRRSILFRMKYADLELTTRGEFPHGMKEPAFVKKLDQNIPWYFSTYRSMYHWPITGDNWSDLNEAEKHHDLHMFYTLAWWKLGEGIFGVDEDS 94 T 9.7 Mastoparan_2 pdbhh F Eukaryota T 7pub 56 DB Da mS74 MFSLTQTWLIAHWYCGHKFRHRFMRDKRFHPSLQASHDARNRFSKRRHFKTNRWNYQQAYRDMP 64 T 0.093 MLANA pdb F T 7pub 57 EB F3 Q38E61_TRYB2 mt-SAF3 MLRGWHPNSSAMQVGMRHITIGGRHSRGGFRQPLGKHPQVKQGTVEGVPRRIPGTTKVTYTNKKGRTFSFSVPVSELTHPQVTLESAAGTWREMDTSFCELGDIEDDMPSPVDECLRGGSSLDKRLIQEVRERFVSFCREYVLMDTSGMKSTILSTELNAGPDYEHYDRRLRRKRHWLAIRHRFEDVRYVIWPDVVEETARGDSAQADVSLTNPSLTAGEMLEALLWLDAASTFCVRKVHPSDLGDKSEFLPLDLQREVEVVACHARRDLDFFDPSATSLEQFTACAALCVNHRVPFSLFFPAQDVCGDASVSTGQCIVANAPSPHTALGAVRIMALISEGSGSDIGKTIMFSDAFGAVTRFGILRGLSRVMSVEAFGCKDALENVNESELCIILHFCAEVREQNAAFFRRYEASEEDSDPQQVSFLAKYQQLSQIALARCKRLLYHPDSPRAQVMSEDGYIPLVELQRHAEGTNKAALIHYNLGIRSAQGMRRVALGAQSSARLAELVSRLEEASARVSGNTLVNDLVHHLSHKAAAGKMSLTLREVNTLLPLLSRMRRESPNGALDARFDRVFNAIDTAIGAAMRHNCTLDELLDLAEGLAACEMVPSALKQVEMVLIRSVMMHECSPMHLRRMLQAMFTLMRTSVPQVLLQSVASRVADYIKEASHMDSSSSNGGGDEKVKNHEECEQLLELLVVLGKCGYGALPGLVTIYWEAQLIDSMQLNPRLRCSYASLLASAAFALKKHDKRAWEGLADESHRLFMEYTRCNKENDIGRFAECVTGLAVLTQIKDNTNSSDVAFLKEYLSATSLELKSCEVIRVQELTDLLGRTLEWSEALGVVAPDVVIQLEKALFVMLENVSHTAPGVGIPDELVTAACCLVDMSSASLELRKAAAGVVGGAIVHAEEALETLRSGAPTQVRPGHSFDVAALASAERENVYKNSILQYCAALQRSGMSTHVEELWS 966 T 0.074 DNA_ligase_A_N unppercent F Eukaryota T 7pub 60 HB F9 Q57YC0_TRYB2 mt-SAF9 MTGPTRALFLSSGINLGRLRLAEQFSSMNGWQSKEDPAFDAYVKERRRKENYEAFDQRVERGYAAAAKLHKAEIQNAVKRRLKSSGAKFTAETLREMSSAVTERLAWLRDVWAQIDADYRSGDSARQETAAQEISAALRGEPNDYMRWVYETKRELRFAGPVGRRAIQEELQAAELPEVLDEEVNRYHDLKLNMMEIEREVKAKYGVAGQQHWAELQAAKDEEYIQKLDEAAEVYKQLLDQSARLDESRRSELQRSYVERVHQAQVRFKAAMELEGQREQLIEAHQAMKEERMRTEREKRRQLLREAAELRAQGKKSADVLTALKERQLDANAKRQAEYELKECEDILKRKSEMLDMIAHFKHDVEEREGREMLQRQKSDEERQVNVFGFYEEVGVEDGLSISSEGTTSQGGSSGLGTVSTSTSCAKSADSNSSAQPSQKLRKEELWKVINADTYEDPFRTVHQARLDAVKTYDPAYARTFPLNLVLGRKYSRQGAGEMAAGNETDKQILQKGNNILYSFQWGLNNGTVHDLDADGGTDYFMDGAFHVRDKETGDIDWRYEKKRGGPVFRGPKFYRLGAQREAADPGERAMDPTPYTSTPREHKWRSS 608 T 0.11 MDMPI_C pdbpssm F Eukaryota T 7pub 64 LB Fh Q38A63_TRYB2 mt-SAF37 MWQRLRFDRLSSSVRRTNLNPLKPCAALTEQRAELRNLHQYPTARHKSLVKDRLRFARNWWLTGGNNYELVHEVGHEREATECFAEYAQDSSRDVYLMSTNRLSDLPPGDRLKAIVGLMRSRWEVKDANRGYDKAKLLLQALECFSEMKASGQIGDFNSLPEPDQDTFLQYVEGCSRFAQACSHSHPDAVRVLLRAAQICEEMRCVEKRDEMIQVTEAAANRMDRAYAFSRPHDTLRAAPPSLHENEDCVRLKNTEELRRRFGNTAPHVLEKPKRVDCLRIHRNRPLLLHPMKDNNKLLELSKLPARPEFDSWTSHQT 318 T 0.077 ATG8 pdbpercent F Eukaryota T 7pub 65 MB Fi Q57ZP1_TRYB2 mt-SAF38 MRGSLPLLFNPVLPPSTARLRLLTYPMALAQPHATVPLIQPTIDGTHDGRNGATVSLRTQARMHGTADGTMATAGDSSQNNSVMDSPRWLRNPDELCVAALRRSRDVNKINSYVATYKFDDPQWAPLLLPEVTLRPQVNSTGDKPNGGNEAAADVVSVGPSVSATPESTPPPPPSSSSSSPYSCPADCVSISHNKMIMLECMSRHVNFSLRHIVQKGHGIYLIYHAQHSILQPKGLVEQSFVTCSFGIRGERLRTDIVHVGPIDAADVMELQPSEGHDHPRCCFNLYQKSDVRRGVIAVSQVEGYGTWFQRKPMLWQRSRRIGALQSQLGAFAYDLVDPHEVGKWRDCEVSLLAPHMRFFRNGLNGAEAVGIIASSQVAQQRRLYLGEFEAPAITALDAVQQLAHASALRCKLVTPVVDPNGVGGTGSGSLGDENMDKHIDMETLLPLSWATRTPPPYVPLEADLPFKLQMSRPTVFAESHQQNQAYPTGGTVGSPFVRGAPMMMFEYNMHQGVDHYVYDDAPSARPMKWWSQKSNMPYSGYMYFARSGLVDRFTPSEDIPNPLEPTSKRKPLHAVVPPTKVVQERLRKYRRKQQEGHKQRRRASSGSGVSNEPDAVNRQESVSRGTCE 629 T 51 RCDG1 pdbhh F Eukaryota T 7pub 67 OB IB Q387Q6_TRYB2 mt-SAF39 MRRSGRGSAVRWSSLCKCQCCLYRTPLGGTYFEQALPRSLGARQGKGVLSTVNTALSRKALKRRQSLPRKKLNVPLTAEGLKERLKQLSAEERELSIKNNTEECDEPSPNEFTTTHEARVALARVLHHGENAGERKEVAMRIPSFCRSPAVSETQSIVVDDKEGDITNAAVHVGCSVLGSDLDHLERDMIRDYHQRGKKLPTFDNIYRTLGCGRKGTSVSDTEPEDENSSGAIQSECGLGDAGRRGTVVVAPSHLHHSTPPTKGRSGEEEEEGGCFDTNTLPADANPHFPPGACDNEVLAPLSGGCAASEQTEITDTASFIPSNSRLSTAVYDAYRQRPADDRLVVLRGTDFWDNEENRARLQELTDYAEEDFAREMLMEGAMDTSEVGYSTNKVRKETLLYFQAHPINEMIQEPFARVRSILPSDGGPEVHFPADDPDTDVDIPTAQARTMARELGLDLIRVGTLYTPINDRRVVAVCTIADHREHMRDMIRFKIKKLGVQRPPTKEGIEVPFRGGTHPHAVRFKSIGIAKHLLLGHVVRINLTDFGTVREGFPVFGSILDEVARQALQLHAYHTAGVVRANYNEVYCYLYPSTGRSPKSTVLHPTQEQLATVRDRCLLEREREVYFDGLYDKKTPRERLTYMRKLQDGTAWADRDDGLSLQRQRDMKVMLGYLPKGNHELYAARGDVNVPAPFRASHPTSVDRWTHPQESNLEQAARGSAVLAKRLSMTVSEMHDRQETAENPATLDRFYYRIQGPALEAGELKEALGLKGNRKRLPRRAPGWATLGMEKVSPQEPGHAAK 803 T 0.00057 mIF3 pdbhh F Eukaryota T 7pub 68 PB,VB U6,UJ Unk XXXXXXXXXXXXXXXXXXXXX 21 F F F 7pub 69 QB U7 Unk7 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 40 F F F 7pub 70 RB UE UnkE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 53 F F F 7pub 71 SB UF UnkF XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 39 F F F 7pub 72 TB UG UnkG XXXXXXXXXXXXX 13 F F F 7pub 73 UB UI UnkI XXXXXXXXXX 10 F F F 7pub 74 WB UK UnkK XXX 3 F F F 7pub 75 XB UL UnkL XXXXXXXXXXXXXXXXXXXX 20 F F F 7pul 2 B P GLY-ALA-GLY-ALA-ALA GAGAA 5 T 54 Stomoxyn pdbhh F F 7pvm 1 A A G0S058_CHATD 5'-3' exoribonuclease GGEAKARLCKLCGQKGHDERSCKGEAKQKQG 31 T 0.0018 zf-CCHC pdbpercent F Eukaryota T 7pvt 2 B,D B,D VSL12 VSLARRPLPPLP 12 T 0.73 DUF4522 pdbhh F T 7pvx 2 B,D B,D VSL12 peptide VSLARRPLPPLP 12 T 0.73 DUF4522 pdbhh F T 7pwf 10 J A RSSA_GIAIC 40S ribosomal protein SA MSTEKTSQASKEYQLKEADVKKMLVATTHLGVRNIDRRMQFYIFDRQKDGTFVFNLQKVWAKIVFAARILVTIDDPAEIAVVANRPDAQRAILKFCKYTHATAFPGRFIPGNFTNRMNPNYCEPRLLLVNDPVVDRQAILEASYVNIPTISLCNSDANLKFIDVAIPCNNKTPMSIGLIYWLLAREVLRLKGSISRTEEWDVKPDLFVALPEEIPDEEESEDFYDDDEEEDEEFSAGNGNLFDEY 245 T 1.1E-11 Ribosomal_S2 pdb F Eukaryota T 7pwo 74 VB A1 RSSA_GIAIC 40S ribosomal protein SA MSTEKTSQASKEYQLKEADVKKMLVATTHLGVRNIDRRMQFYIFDRQKDGTFVFNLQKVWAKIVFAARILVTIDDPAEIAVVANRPDAQRAILKFCKYTHATAFPGRFIPGNFTNRMNPNYCEPRLLLVNDPVVDRQAILEASYVNIPTISLCNSDANLKFIDVAIPCNNKTPMSIGLIYWLLAREVLRLKGSISRTEEWDVKPDLFVALPEEIPDEEESEDFYDDDEEEDEEFSAGNGNLFDEY 245 T 1.1E-11 Ribosomal_S2 pdb F Eukaryota T 7px1 1 A,B A,B Conus mucronatus GENSDNLTHCRLFEFRLCLLECMSLTLDHCYARCTTVITQIHGSDTNRFDCTIFKTCYYRCYVLGKTEDHCWKGTATSVTGDVGDLEFC 89 T 6.1 Toxin_25 pdbhh F T 7px2 1 A,B,C,D,E,F A,B,C,D,E,F Conotoxin Mu8.1 GENSDNLTHCRLFEFRLCLLECMSLTLDHCYARCTTVITQIHGSDTNRFDCTIFKTCYYRCYVLGKTEDHCWKGTATSVTGDVGDLEFC 89 T 6.1 Toxin_25 pdbhh F T 7pzl 1 A,B,C,D A,B,C,D CAPSD_HBVD3 CORE ANTIGEN,CORE PROTEIN,HBCAG,P21.5 MDIDPYKEFGATVELLSFLPSDFFPSVRDLLDTASALYREALESPEHCSPHHTALRQAIVCWGELMTLATWVGVNLEDPASRDLVVSYVNTNMGLKFRQLLWFHISCLTFGRETVIEYLVSFGVWIRTPPAYRPPNAPILSTLPETTVVRRRGRSPRRRTPSPRRRRSQSPRRRRSQSRESQC 183 T 1.5999999999999998E-25 Hepatitis_core pdb T Viruses T 7pzm 1 A,B,C,D B,A,C,D CAPSD_HBVD3 CORE ANTIGEN,CORE PROTEIN,HBCAG,P21.5 MDIDTYKEFGATVELLSFLPSDFFPSVRDLLDTASALYREALESPEHCSPHHTALRQAILCWGELMTLATWVGVNLEDPASRDLVVSYVNTNMGLKFRQLLWFHISCLTFGRETVIEYLVSFGVWIRTPPAYRPPNAPILSTLPETTVVRRRGRSPRRRTPSPRRRRSQSPRRRRSQSRESQC 183 T 1.9E-17 Hepatitis_core unp T Viruses T 7pzn 2 E M SLLGRM, modelled as poly-A,SLLGRM, modelled as poly-A XXXXSLLGRM 10 T 16 Cas_CT1975 pdbhh F T 7pzt 1 A,B A,B A0A4P8JK46_ALCFA Urea amidohydrolase MNLTEKGTKTAKLSASDRIIYADNHLIHGPDDITAYMKGVCYDAAAYMRYLYNAKISFDQLTSISAQNWLPVFKFAEGRMWDGRNSLPGGKAIGFCRVKGMEFFHAAVAVGGTEIRAINGGLLGAGWLHPVDLRKVLTQKNPDGSFKYDGTDIFVYISNL 160 T 10 AAA_assoc_C pdbhh F Bacteria T 7q02 1 A A Q6SVB5_DIPPU Milk protein IAAILVANAKEPCPPENLQLTPRALVGKWYLRTTSPDIFKQVSNITEFYSAHGNDYYGTVTDYSPEYGLEAHRVNLTVSGRTLKFYMNDTHEYDSEYEILAVDKDYFIFYGHPPAAPSGLALIHYRQSCPKEDIIKRVKKSLKNVCLDYKYFGNDTSVHCRYLE 164 T 4.6 Transglut_N pdbhh F Eukaryota T 7q1e 4 E P CENPJ_HUMAN CENP-J,CENTROSOMAL P4.1-ASSOCIATED PROTEIN,LAG-3-ASSOCIATED PROTEIN,LYST-INTERACTING PROTEIN 1 MVNIEERPIKAAIGERKQTFEDYMEEQIQLEEQELKQKQLKEAEGPLPIKAKPKQPFLKRGEGLARFTNAKSKFQKGKE 79 T 0.11 DRAT pdb F Eukaryota T 7q1f 4 E,J P,V CENPJ_HUMAN Centromere protein J MVNIEERPIKAAIGERKQTFEDYLEEQIQLEEQELKQKQLKEGGGGGKPKQPFLKRGEGLARFTNAKSKFQKGKE 75 T 0.64 DUF1654 unppssm F Eukaryota T 7q1r 1 A,B A,B apCC-Di XGQLEQELAALDQQIAALKQRRAALKWQIQGX 32 T 0.0002 DivIC pdb F T 7q1s 1 A,C,F,G,J,K,N F,L,J,D,N,H,B apCC-Di-B_var XGQLKQRLAALDQRIAALKQRRAALKWQIQGX 32 T 0.0029 DivIC pdb F T 7q1s 2 B,D,E,H,I,L,M E,K,I,C,M,G,A apCC-Di-A_var XGQLEQELAALDQEIAALEQERAALEWQIQGX 32 T 0.00059 ABC_tran_CTD pdbpssm F T 7q1t 1 A A apCC-Di-A GQLEQELAALDQEIAAAEQELAALDWQIQG 30 T 0.0029 ABC_tran_CTD pdb F T 7q1t 2 B B apCC-Di-B GQLKQRRAALKQRIAALKQRRAALKWQIQG 30 T 0.0019 DivIC pdb F T 7q21 1 A,K Y,y Co-purified unknown transmembrane helices built as polyALA (AscD) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 96 F F F 7q21 11 U X Co-purified unknown peptide built as polyALA (AscE) CXXXXXXXXXXXXXXXXXXXX 21 T 1400 Fer4_6 pdbhh F F 7q21 12 V x Co-purified unknown peptide built as polyALA (AscE) CXXXXXXXXXXXXXXXXXXXXX 22 T 1400 Fer4_6 pdbhh F F 7q21 13 W,X V,v Q8NS61_CORGL Actinobacterial supercomplex, subunit C (AscC) MFPEFERMYDMANVEKKHFVDPAWPEHNPADGHVVTELISKVAGASSPWGDDKEFPVSAEETGYVHPYTRINR 73 T 10 PHYHIP_C pdbhh F Bacteria T 7q21 14 Y,Z K,k Q8NSJ8_CORGL Hypothetical membrane protein MYMGKSFALLVLGAIILAGGVWYTIEVGYSVMAIVAALIMAAGGGIITWGLAVAADVNSPTSHKI 65 T 0.00051 DsbD_2 pdbpssm F Bacteria T 7q3u 1 A,B,C,D,E A,B,C,D,E TADBP_HUMAN TDP-43 NPGGFGNQGGFGNSRGGGAGLGNNQGSNMGGGMNFGAFSINPAMMAAAQAALQSSWGMMGMLASQQNQSGPSGNNQNQGNMQ 82 T 0.024 Glucosaminidase pdbpssm F Eukaryota T 7q41 1 A,B,F B,D,F UBE3A_HUMAN Ubiquitin-protein ligase E3A (E6AP) peptide AKDEDKDEDEKEKAA 15 T 0.11 PCM1_C unp F Eukaryota F 7q42 2 D,E,F D,B,F BAZ2B_HUMAN HWALP4 EDDDDKDQDESDSDT 15 T 0.0019 SDA1 unppssm F Eukaryota T 7q44 2 D,E,F D,B,F UBP35_HUMAN DEUBIQUITINATING ENZYME 35,UBIQUITIN THIOESTERASE 35,UBIQUITIN-SPECIFIC-PROCESSING PROTEASE 35, UBIQUITIN CARBOXYL-TERMINAL HYDROLASE 35 GFDEDKDEDEGSPGG 15 T 12 DUF3245 pdbhh F Eukaryota T 7q45 2 D,E,F B,D,F MYT1_HUMAN MYT1,MYELIN TRANSCRIPTION FACTOR I,MYTI,PLPB1,PROTEOLIPID PROTEIN-BINDING PROTEIN RSDDDKDEDTHSRK 14 T 1.3 PTN13_u3 pdbhh F Eukaryota T 7q47 1 A,B A,B H6WYJ5_9CAUD Endolysin GAKTSLPRGIRNNNPGNIEWGSPWQGLQARTAASDPRFCQFIDPASGIRALAVILTTYFDKRKAADGSKIDTIREVIERWAPPKKNGVVENNTTAYANQIARVLNMQPDDETLNLHDYETMRKMVEGIIRHENGSPEDYDRAPYNNINQWYSDEQIAEGLRRAGLVKPKT 170 T 0.044 MPAB_Lcp_cat unppssm T Viruses T 7q4i 2 C,D G,F MUC1_HUMAN MUC-1,BREAST CARCINOMA-ASSOCIATED ANTIGEN DF3,CANCER ANTIGEN 15-3,CA 15-3,CARCINOMA-ASSOCIATED MUCIN,EPISIALIN,H23AG,KREBS VON DEN LUNGEN-6,KL-6,PEMT,PEANUT-REACTIVE URINARY MUCIN,PUM,POLYMORPHIC EPITHELIAL MUCIN,PEM,TUMOR-ASSOCIATED EPITHELIAL MEMBRANE ANTIGEN,EMA,TUMOR-ASSOCIATED MUCIN APDTRX 6 T 170 DDE_Tnp_1_assoc pdbhh F Eukaryota T 7q4k 53 AB D1 MsrDL: FME-TYR-LEU-ILE-PHE-MET MYLIFM 6 T 9.7 JAMP pdbhh F F 7q4q 3 E,F E,F A2GL_HUMAN LRG1 epitope GNKLQVLGKDLLLPQ 15 T 1.9 DUF3719 pdbhh F Eukaryota T 7q4t 2 B LbL ALA-DGL AX 2 T 1000 zf-H2C2_2 pdbhh F F 7q50 2 B B FDVSWFMG peptide FDVSWFM 7 T 1.7 DUF5724 pdbhh F T 7q51 2 B B FWLPANLW peptide FWLPANLW 8 T 1.7 Pinin_SDK_memA pdbhh F T 7q5a 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H Lanreotide XCYXKVCTX 9 T 0.021 Urotensin_II pdbhh F T 7q5g 1 A,B A,B LAN-DAP5 DERIVATIVE OF LANREOTIDE XCYXXVCTX 9 T 0.72 DUF968 pdbhh F F 7q5w 2 G,H,I,J,K,L GGG,HHH,III,JJJ,KKK,LLL TYOBP_HUMAN DNAX-ACTIVATION PROTEIN 12,KILLER-ACTIVATING RECEPTOR-ASSOCIATED PROTEIN,KAR-ASSOCIATED PROTEIN ESPXQELQGQRSDVXSDLNT 20 T 0.0049 ITAM unphh F Eukaryota T 7q64 1 A,AA,B,BA,C,CA,D,DA,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T,U,V,W,X,Y,Z A,W,a,X,b,Y,c,Z,d,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T,U,V NUP98_HUMAN Nuclear pore complex protein Nup98 TGTANTLFGTASTGTSLFSSQNNAFAQNKPTGFGNFGTST 40 T 0.43 Nucleoporin_FG unp F Eukaryota T 7q65 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T,U,V A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T,U,V NUP98_HUMAN 98 KDA NUCLEOPORIN,NUCLEOPORIN NUP98,NUP98 TGTANTLFGTASTGTSLFSSQNNAFAQNKPTGFGNFGTST 40 T 0.43 Nucleoporin_FG unp F Eukaryota T 7q66 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T,U,V A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T,U,V NUP98_HUMAN 98 KDA NUCLEOPORIN,NUCLEOPORIN NUP98,NUP98 TGTANTLFGTASTGTSLFSSQNNAFAQNKPTGFGNFGTST 40 T 0.43 Nucleoporin_FG unp F Eukaryota T 7q67 1 A,B,C,D,E,F,G,H,I,J,K A,B,C,D,E,F,G,H,I,J,K NUP98_HUMAN 98 KDA NUCLEOPORIN,NUCLEOPORIN NUP98,NUP98 TGTANTLFGTASTGTSLFSSQNNAFAQNKPTGFGNFGTST 40 T 0.43 Nucleoporin_FG unp F Eukaryota T 7q6i 2 Q,R X,Y Cell division protein FtsN (polyAla model) MANRDYVRRGKGTSRRPAKKKTSGKKPWRXXXXXXXX 37 T 8.1 MRP-S33 pdbhh F T 7q72 2 B,D C,D RED1_SCHPO NURS complex subunit red1 GAMGISLPLLKQDDWLSSSKPFGSSTPNVVIEFDSDDDGDDFSNSKIEQSNLEKPPSNSENGGSHHHHHH 70 T 5.2 DnaA_N pdbhh F Eukaryota T 7q7s 2 B B ALA-TRP-VAL-ILE-PRO-ALA AWVIPA 6 T 0.39 DUF3950 pdbhh F F 7q7t 2 B B ALA-TRP-VAL-ILE-PRO-ALA AWVIPA 6 T 0.39 DUF3950 pdbhh F F 7q8d 2 C,D PA,PB ASP-LEU-GLU(AMI) XTRESEDLEX 10 T 64 AbiEii pdbhh F T 7q8f 2 C,D PB,PA GNYKEAKK Peptide XGNYKEAKKX 10 T 1.5 TPR_3 pdbhh F T 7q8g 2 C PA ALAASS Peptide ALAASS 6 T 220 Peptidase_S49_N pdbhh F F 7q8g 3 D PB EYS Peptide EYS 3 T 240 Toxin_36 pdbhh F F 7q8h 2 C,D PA,PB EVCKKKK Peptide XEVCKKKKX 9 T 11 NOZZLE pdbhh F F 7q8i 2 C,D PA,PB AVAEKQ peptide AVAEKQ 6 T 380 Tafi-CsgC pdbhh F F 7q8j 2 C,E PA,A IILKEK Peptide IILKEK 6 T 140 PipA pdbhh F F 7q8j 3 D PB EYS Peptide EYS 3 T 240 Toxin_36 pdbhh F F 7q8k 2 C,D PA,PB LLKVAL Peptide LLKVAL 6 T 33 KCTD11_21_C pdbhh F F 7q8l 2 C,D PA,PB VPCGTAHE Peptide XVPCGTAHEX 10 T 2.8 Sod_Ni pdbhh F T 7q8m 2 C,D PA,PB KPKKKTK Peptide XKPKKKTKX 9 T 56 SOXp pdbhh F F 7q8n 2 C,D PA,PB KKYDAFLA Peptide XKKYDAFLAX 10 T 4.3 Pollen_allerg_2 pdbhh F T 7q8o 2 C,D PB,PA LLSGKE Peptide LLSGKE 6 T 100 Sm_like pdbhh F F 7q8p 2 C PA LLKVAL Peptide LLKVAL 6 T 33 KCTD11_21_C pdbhh F F 7q8p 3 D PB EYS Peptide EYS 3 T 240 Toxin_36 pdbhh F F 7q8q 2 C,D PA,PB RLSAKP Peptide RLSAKP 6 T 1.7 HMG14_17 pdbhh F T 7q98 3 C,F,I,L,O C,F,I,L,O ASN-LEU-SER-ALA-LEU-GLY-ILE-PHE-SER-THR NLSALGIFST 10 T 13 NPH-II pdbhh F T 7q99 3 C C ASN-LEU-SER-ALA-LEU-GLY-ILE-PHE-SER-THR NLSALGIFST 10 T 13 NPH-II pdbhh F T 7q9a 3 C C LEU-LEU-LEU-GLY-ILE-GLY-ILE-LEU-VAL-LEU LLLGIGILVL 10 T 2.8 UAF_Rrn10 pdbhh F F 7q9c 2 C,D,E PAA,PAC,PBA RLSAKP Peptide XRLSAKPX 8 T 3.9 HMG14_17 pdbhh F T 7q9h 2 C,D,E PAA,PAC,PB LLKAVAEKQ Peptide XLLKAVAEKQX 11 T 23 YebO pdbhh F T 7q9s 2 C,D CCC,DDD KRas DGKKKKKKSKTKC 13 T 4.5 TMEMspv1-c74-12 pdbhh F T 7qal 1 A A PGAM5_HUMAN BCL-XL-BINDING PROTEIN V68,PHOSPHOGLYCERATE MUTASE FAMILY MEMBER 5 AFRQALQLAALGLAGGSAAVLFSAVAVGKPRAGGD 35 T 290 TMEM210 pdbhh F Eukaryota T 7qam 1 A A PGAM5_HUMAN BCL-XL-BINDING PROTEIN V68,PHOSPHOGLYCERATE MUTASE FAMILY MEMBER 5 AFRQALQLAACGLAGGSAAVLFSAVAVGKPRAGGD 35 T 370 TMEM210 pdbhh F Eukaryota T 7qan 1 A,B AAA,BBB F4F6Q5_MICM1 Cytochrome P450 MAHHHHHHSSGLEVLFQGPMIEIPSAATASPQYPQRRACPYRPAGGYERPVTRVRLYDGRPAWLVTGHETARQVLLDAATFSSDRQHPAFPALAARFEAARAVRNFIGMDPPEHTAQRRMLISGFTAKRVATLRPAITEIVDSLLDEVVRRGPGVDLVATFTLPVPSVVICRLLGVPYADHEFFEHQSRRIAAGTSTAAESADAFGQLKRYLLGLIETKGRGGEDMLDVLVDEQVATGTVTTPDLVDLALLLLVAGHETTASTLALGVALLLEQDGGAVAADPTRVGAVVEEILRHTAVADGVARFATRDTEVAGVRIAAGDAVVVALSAANRDPGPFPDPDRFDPRRGGRQHVTFGHGPHQCIGANLARAELEIALSRLFTRLPTLALAVPVEELGGKEAGGVQGVQRLPVTW 414 T 1.3E-33 p450 unppercent F Bacteria T 7qao 1 A A PGAM5_HUMAN BCL-XL-BINDING PROTEIN V68,PHOSPHOGLYCERATE MUTASE FAMILY MEMBER 5 AFRQALQLAASGLAGGSAAVLFSAVAVGKPRAGGD 35 T 300 TMEM210 pdbhh F Eukaryota T 7qap 1 A A PGAM5_HUMAN BCL-XL-BINDING PROTEIN V68,PHOSPHOGLYCERATE MUTASE FAMILY MEMBER 5 AFRQALQLAACGLAGLSAAVLFSAVAVGKPRAGGD 35 T 290 TMEM210 pdbhh F Eukaryota T 7qaz 1 A,B,C A,B,C H2J4R1_MARPK TPR_REGION domain-containing protein GPSQNAIKRFMTLFSGREDVFSIQYEGGYRPIRRPLNFQDIKNHFSGKKTLGIYLLKKNDTVKFAAYDIDIKKHYLNREDKFVYEENSKKVAKRLSRELNLENITHYFEFTGNRGYHIWIFFDIPVSAYKIKYIMEKILDRIELEEGIDVEIFPKQTSLNGGLGNLIKVPLGVHKKTGKKCLFVDNDFNVIENQIEFLNNIKENKATEINKLFREIFNE 219 T 0.019 DUF1882 pdbhh F Bacteria T 7qb2 2 B D Pimtide ARKRRRHPSGPPTA 14 T 1.5 DUF3019 pdbhh F T 7qby 1 A A DNJB6_HUMAN HHDJ1,HEAT SHOCK PROTEIN J2,HSJ-2,MRJ,MSJ-1 MGNFKSISASTKMVNGRKITTKRIVENGQERVEVEEDGQLKSLTINGKEQLLRLDNK 57 T 5.2 AGA2 pdbpercent F Eukaryota T 7qca 24 X LM0 S7XVN9_SPRLO eL14 LM0 KYFSYPLMYFIKPGCLIKKKFSIYTSIVISVIDNNSVVIQSYDKSDNGDVISIDREVINVSKIVPIGNIDIKNKSKKEIDGILVEENRNLKNKDDVLLMNDFERFKEQLKKEVEDMVIEEMA 122 T 0.00025 Ribosomal_L14e pdbhh F Eukaryota T 7qdi 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H D-310HD XGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGX 33 F F F 7qdj 1 A A PK-10+PK-11 XGELXXLKELXXLKXXXWKGX 21 T 13 Beta_protein pdbhh F T 7qdk 1 A,B,C A,B,C CC-TypeN-LaLd XGELAALKQELAALKWELAALKEELAALKXGX 32 T 0.0005 DUF5320 pdbhh F T 7qdw 1 A A Q8I2Y4_PLAF7 Zinc finger protein, putative GPHMDYDMLTEEQKKKLKEDHTLKILLKNNYVREVFKQFTLSNDKIGYLSHYINDPTIVQVIDHIMKTIDDT 72 T 0.00019 STI1 pdbhh F Eukaryota T 7qdw 2 B B Q8IK99_PLAF7 NUFIP1 domain-containing protein DIYTYEKKLIKSIEYITKNKFFDDS 25 T 1 Sec34 pdbhh F Eukaryota T 7qec 1 A A E4SK47_LACAR S-layer SVSFYEIANGNEVHTGSLNMTANPTSHELNVSAVLAAAKAKYAAHQLENGASNGASVAVTTDVKDLTDQLTKAGIKVDPLGNFQAQASFSFNLAAKSAQNAATATLPITVSVAN 114 T 0.039 DUF6074 pdb F Bacteria T 7qep 38 LA M4 I7L8J2_ENCCU ECU06_1215 protein MRTVRLGRIVTPALKERRHTYAIIVGIIDVTFVLLQRKDGEREICSVANLHLEDEAFDIKGLSAEEIGKLIPEDTYIEDTTNDFDRFKLKLRKRVEEELLKEKGLA 106 T 0.0021 Ribosomal_L14e pdbpercent F Eukaryota T 7qep 45 SA MS I7IV41_ENCCU ECU06_1135 protein MSKTYLKSWKEKKEKMPNAALSFKQRLRIKQQKRVERSALLSKIKILKTRKRNFLRERQKQREMKKQENMAKS 73 T 13 Cgr1 pdbhh F Eukaryota T 7qep 68 PB S0 RSSA_ENCCU 40S ribosomal protein S0 MPQDNTRISDSIKIPDEFVKLLIVSQSHLGGTSTNKSFARYLYGTRPRDRINIIDINATWEKLIIAARAFCGIKHPSSIAVVSTKTFGRKPVVKFCEAVGATPITGRFIPGSFTNSEVKRVYDPRVLIVSDTYADKQAILESQYCNLPTIAFVNTDNSLVGVDIAIPMNNRSPSAIAAGFFILSRLINYMKTGAELVRDMKEVELFLFRDSVELEQLVEEQLLETTDSILNVGKEGILSGIGTGNADEWNSF 252 T 2.5E-12 Ribosomal_S2 pdbpercent F Eukaryota T 7qf9 2 C EEE HRas peptide SGPGCMSCKC 10 T 0.85 DUF4536 pdbhh F T 7qfb 2 B B PPR3C_HUMAN PROTEIN PHOSPHATASE 1 REGULATORY SUBUNIT 5,PP1 SUBUNIT R5,PROTEIN TARGETING TO GLYCOGEN,PTG AKKRVVFADSKGLSLTAIHVFSDLPEE 27 T 0.079 PBCV_basic_adap pdb F Eukaryota T 7qff 2 C,E PA,PB ACE-VAL-ALA-CYS-LYS XVACKSSQPX 10 T 13 KI67R pdbhh F T 7qff 3 D PAB GLU-TYR-SER EYS 3 T 240 Toxin_36 pdbhh F F 7qfh 2 C,D PA,PB LYS-VAL-LEU-AMI XAYFKKVL 8 T 4.7 DUF5339 pdbhh F T 7qfi 1 A,B A,B Q5FLN0_LACAC SlpX MGDTAVNVGSAAGTGANTTNTTTQAPQNKPYFTYNNEIIGEATQSNPLGNVVRTTISFKSDDKVSDLISTISKAVQFHKNNSASGENVTINENDFINQLKANGVTVKTVQPSNKNEKAYEAIDKVPSTSFNITLSATGDNNQTATIQIPMVPQGLEHHHHHH 162 T 0.0028 T2SSC pdbpercent F Bacteria T 7qfj 1 A,B,C,D,E,F A,B,C,D,E,F Q5FLN0_LACAC SlpX MGSTPTDTTQNPQINWTKGGQAQSSSLNGQVFQVAVGSNFNPLNFTNSNGENIIVSAQQSKNNTTFASIEATSNPVNTSEAGRYYNVTLTATGNTGKKTTATYTVLITSSQKQTLYGNGESTISTYSIYGNNVLSNSTTFKDGDQVYVSDQTKTVGGVSYSQVSPKSKNDANSSNIWVKTSLEHHHHHH 189 T 0.00023 DUF5011 pdbpssm F Bacteria T 7qfk 1 A,B,C,D A,B,C,D Q5FLN0_LACAC SlpX MGSTPTDTTQNPQINWTKGGQAQSSSLNGQVFQVAVGSNFNPLNFTNSNGENIIVSAQQSKNNTTFASIEATSNPVNTSEAGRYYNVTLTATGNTGKKTTATYTVLITSSQKQTLYGNGESTISTYSIYGNNVLCNSTTFKDGDQVYVSDQTKTVGGVSYSQVSPKSKNDANSSNIWVKTSLEHHHHHH 189 T 0.00023 DUF5011 pdbpssm F Bacteria T 7qfl 1 A A SLAP_LACAC SURFACE LAYER PROTEIN,SA-PROTEIN MGHHHHHHHHHHSSGHIEGRMGNVNFYDVTSGATVTNGAVSVNADNQGQVNVANVVAAINSKYFAAQYADKKLNTRTANTEDAIKAALKDQKIDVNSVGYFKAPHTFTVNVKATSNTNGKSATLPVVVTVPN 132 T 0.067 TrmE_N pdb F Bacteria T 7qfm 2 B D Pimtide ARKRRRHPSGPPTA 14 T 1.5 DUF3019 pdbhh F T 7qg8 2 B s VemP nascent chain MHHHHHHHHHHGDYKDDDDKENLYFQGSAQIDQKAHVPHFSKLQPFVAVSVSPNSSVDFSEASEESSQSPVSEGHASLDSVALFNSQRWTSYLREGLDDEHVDFVGDLTTPFYADAGYAYSLMDINWRHNQSTFYHFTSDHRISGWKETNAMYVALNSQFSALEVLFQGPYPYDVPDYA 179 T 6.3 DUF4022 unphh F T 7qg9 1 A,D,E,G,K,L,O,P,S,T,X,Y R,Q,S,N,M,O,J,L,U,K,P,T FIBL2_BPT5 L-shaped tail fiber protein p132 MSTENRVIDLVVDENVPYGLLMQFMDVDDSVYPSTSKPVDLTDFSLRGSIKSSLEDGAETVASFTTAIVDAAQGVASISLPVSAVTTIASKASKERDRYNPRQRLAGYYDVIITRTAVGSAASSFRIMEGKVYISDGVTQ 140 T 3.3E-05 BppU_N pdbhh T Viruses T 7qg9 2 B,H,Q I,H,G TAIL1_BPT5 TAIL PROTEIN P140 MFYSLMRESKIVIEYDGRGYHFDALSNYDASTSFQEFKTLRRTIHNRTNYADSIINAQDPSSISLAINFSTTLIESNFFDWMGFTREGNSLFLPRNTPNIEPIMFNMYIINHNNSCIYFENCYVSTVDFSLDKSIPILNVGIESGKFSEVSTFRDGYTITQGEVLPYSAPAVYTNSSPLPALISASMSFQQQCSWREDRNIFDINKIYTNKRAYVNEMNASATLAFYYVKRLVGDKFLNLDPETRTPLIIKNKYVSITFPLARISKRLNFSDLYQVEYDVIPTADSDPVEINFFGERK 298 T 0.023 DUF4965 pdbpssm T Viruses T 7qgg 84 FC y ALA-ALA-LYS-ALA AAKA 4 T 600 SPAR_C pdbhh F F 7qgn 3 C s VemP nascent chain MHHHHHHHHHHGDYKDDDDKENLYFQGSAQIDQKAHVPHFSKLQPFVAVSVSPNSSVDFSEASEESSQSPVSEGHASLDSVALFNSQRWTSYLREGLDDEHVDFVGDLTTPFYADAGYAYSLMDINWRHNQSTFYHFTSDHRISGWKETNAMYVALNSQFSALEVLFQGPYPYDVPDYA 179 T 6.3 DUF4022 unphh F T 7qgu 53 AB 2 A0A0C3GRP6_BACIU YqzJ MTMFVESINDVLFLVDFFTIILPALTAIGIAFLLRECRAGEQWKSKRTDEHQTVFHINRTDFLIIIYHRITTWIRKVFRMNSPVNDEEDAGSLLL 95 T 0.052 PqiA pdb F Bacteria T 7qgv 1 A,B,C,D A,B,C,D Teixobactin XISXXISXAXI 11 T 13 RII_binding_1 pdbhh F F 7qgv 2 E,F,G,H E,F,G,H Lipid II AXKXX 5 F F F 7qh3 1 A,B,C,D A,B,C,D I2N5H0_STRT9 RsfG MNDTTAAAPGTAADPGPDAAVRALDRLIGTWRVSGGAEGTVSYRGLEGGHFLLQDIALEQFGQPVTGVEVIGRLKEFGAEEPGEDIRSRYYDSRGNTFDYVYELDGDTLTIWGGEKGSPAYYRATFSADGNTLSGAWVYPGGGGYDSVMTRVAV 154 T 0.0017 DUF1579 pdbpssm F Bacteria T 7qh7 5 E I RM10_HUMAN L10MT,MRP-L10,39S RIBOSOMAL PROTEIN L8,MITOCHONDRIAL,L8MT,MRP-L8,MITOCHONDRIAL LARGE RIBOSOMAL SUBUNIT PROTEIN UL10M RRVMHFQRQKLMAVTEYIPPKPAIHPSCLP 30 T 0.19 UL42 unppssm F Eukaryota T 7qh7 17 Q V RM24_HUMAN L24MT,MRP-L24,MITOCHONDRIAL LARGE RIBOSOMAL SUBUNIT PROTEIN UL24M TWIDGPKDTSVEDALERTYVPCLKTLQEEVMEAMGIKETRKYKKVYWY 48 T 0.1 DUF3848 pdbpssm F Eukaryota T 7qh7 34 HA f RM48_HUMAN 39S ribosomal protein L48, mitochondrial YKTKPTHGIGKYKHLIK 17 T 0.14 Lysozyme_like unp F Eukaryota T 7qhj 2 C,D PA,PB SER-ALA-ALA-AMI XGAKSAAX 8 T 510 DUF4334 pdbhh F F 7qhk 2 C,D PA,PB GLN-LEU-ARG-GLN QLRQQE 6 T 49 TSC21 pdbhh F F 7qhm 10 J,W J,W Q8NTD4_CORGL Hypothetical membrane protein MNTMSSAKKKPAPERMHYIKGYVPVAYSSPHSSLERSATWLGMGFLLTALAGVGAVLFAVGANSVGQQQEHWVLYSIIGVVFAVVCTVLGTVLIIKGRAPYNRYVKETGRTQ 112 T 0.00033 Phage_holin_3_6 pdbpssm F Bacteria T 7qhm 11 K,X K,X Q8NS61_CORGL Actinobacterial supercomplex, subunit C (AscC) MFPEFERMYDMANVEKKHFVDPAWPEHNPADGHVVTELISKVAGASSPWGDDKEFPVSAEETGYVHPYTRINR 73 T 10 PHYHIP_C pdbhh F Bacteria T 7qhm 12 L,Y L,Y Q8NSJ8_CORGL Hypothetical membrane protein MYMGKSFALLVLGAIILAGGVWYTIEVGYSVMAIVAALIMAAGGGIITWGLAVAADVNSPTSHKI 65 T 0.00051 DsbD_2 pdbpssm F Bacteria T 7qho 10 J,W J,W Q8NTD4_CORGL Hypothetical membrane protein MNTMSSAKKKPAPERMHYIKGYVPVAYSSPHSSLERSATWLGMGFLLTALAGVGAVLFAVGANSVGQQQEHWVLYSIIGVVFAVVCTVLGTVLIIKGRAPYNRYVKETGRTQ 112 T 0.00033 Phage_holin_3_6 pdbpssm F Bacteria T 7qho 11 K,X K,X Q8NS61_CORGL Actinobacterial supercomplex, subunit C (AscC) MFPEFERMYDMANVEKKHFVDPAWPEHNPADGHVVTELISKVAGASSPWGDDKEFPVSAEETGYVHPYTRINR 73 T 10 PHYHIP_C pdbhh F Bacteria T 7qho 12 L,Y L,Y Q8NSJ8_CORGL Hypothetical membrane protein MYMGKSFALLVLGAIILAGGVWYTIEVGYSVMAIVAALIMAAGGGIITWGLAVAADVNSPTSHKI 65 T 0.00051 DsbD_2 pdbpssm F Bacteria T 7qi5 80 BC l RM54_HUMAN L54MT,MRP-L54,MITOCHONDRIAL LARGE RIBOSOMAL SUBUNIT PROTEIN ML54 MATKRLFGATRTWAGWGAWELLNPATSGRLLARDYAKKPVMKGAKSGKGAVTSEALKDPDVCTDPVQLTTYAMGVNIYKEGQDVPLKPDAEYPEWLFEMNLGPPKTLEELDPESREYWRRLRKQNIWRHNRLSKNKRL 138 F F Eukaryota T 7qi6 78 ZB l RM54_HUMAN L54MT,MRP-L54,MITOCHONDRIAL LARGE RIBOSOMAL SUBUNIT PROTEIN ML54 MATKRLFGATRTWAGWGAWELLNPATSGRLLARDYAKKPVMKGAKSGKGAVTSEALKDPDVCTDPVQLTTYAMGVNIYKEGQDVPLKPDAEYPEWLFEMNLGPPKTLEELDPESREYWRRLRKQNIWRHNRLSKNKRL 138 F F Eukaryota T 7qik 2 C,D E,F NCAP_SARS2 SER-SER-ARG-ASN-SEP-THR-PRO-GLY SSRNSTPG 8 T 20 FTCD_C pdbhh T Viruses T 7qil 1 A A DnaE intein SGGALSYDTEILTTEYGLLPIGDIVESETECTVYSVDSDGSTYTQGVAEWHDRGEQEVFEYCLEDGSTIRATKDHKFMTTDGEMLPIDEIFESELDLMRVDSSGDTKIATREYTGSEDVYDIGVESDHNFALSDGFIASN 140 T 1.1E-06 Intein_splicing pdbpercent F T 7qim 2 F F nebulin (mouse) XXXXXXXXXXXXXXXXXXXXXYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXYXXXXXXXXXXXXX 105 T 7800 WW pdbhh F F 7qim 3 G G nebulin (mouse) XXXXXXXXXXXXXXXXXXXXXYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXYXXXXXXXXXXXXX 70 T 6500 WW pdbhh F F 7qin 2 F F nebulin (mouse) XXXXXXXXXXXXXXXXXXXXXYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXYXXXXXXXXXXXXX 105 T 7800 WW pdbhh F F 7qin 3 G G nebulin (mouse) XXXXXXXXXXXXXXXXXXXXXYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXYXXXXXXXXXXXXX 70 T 6500 WW pdbhh F F 7qin 4 H H tropomyosin, alpha-1 (mouse) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 117 F F F 7qin 5 I I tropomyosin, alpha-1 (mouse) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 118 F F F 7qin 6 J,K J,K tropomyosin, alpha-1 (mouse) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 88 F F F 7qip 2 C,D C,D NCAP_SARS2 ARG-GLY-TPO-SER-PRO-ALA-ARG-MET SSRGTSPARM 10 T 6.8 RNA_pol_Rpa2_4 pdbhh T Viruses T 7qiq 1 A,E A,E CTRA_BOVIN Chymotrypsin A chain A CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 7qir 1 A,E A,E CTRA_BOVIN Chymotrypsin A chain A CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 7qis 1 A,E A,E CTRA_BOVIN Chymotrypsin A chain A CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 7qit 1 A,E A,E CTRA_BOVIN Chymotrypsin A chain A CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 7qix 3 C E A0A3Q7H1U4_SOLLC 40S ribosomal protein SA MATQDVRTLSTKEADIQMMLAAEVHLGTKNCDFQMERYAFKRRNDGIYIINLGKTWEKLQMAARVIVAIENPQDIIVQSARPYGQRAVLKFAQYTGAHAIAGRHTPGTFTNQLQTSYSEPRLLILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGVLFWILARMVLQMRGAINQGPKWDVMVDLFFYREPEEAKEQEEEVPAIADYADYSASAALGGDWTSSQIPEAQWTADAAAPAVGGGWAGDGAADGGWDAAAAPAPVPLPVPDVAPTSGATGWE 296 T 1.9E-12 Ribosomal_S2 pdb F Eukaryota T 7qiz 62 JB u A0A3Q7H1U4_SOLLC 40S ribosomal protein SA MATQDVRTLSTKEADIQMMLAAEVHLGTKNCDFQMERYAFKRRNDGIYIINLGKTWEKLQMAARVIVAIENPQDIIVQSARPYGQRAVLKFAQYTGAHAIAGRHTPGTFTNQLQTSYSEPRLLILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGVLFWILARMVLQMRGAINQGPKWDVMVDLFFYREPEEAKEQEEEVPAIADYADYSASAALGGDWTSSQIPEAQWTADAAAPAVGGGWAGDGAADGGWDAAAAPAPVPLPVPDVAPTSGATGWE 296 T 1.9E-12 Ribosomal_S2 pdb F Eukaryota T 7qjf 1 A A LLP_BPT5 Lytic conversion lipoprotein GSTFGPKDIKCEAYYMQDHVKYKANVFDRKGDMFLVSPIMAYGSFWAPVSYFTEGNTCEGVF 62 T 0.0032 Mfp-3 unphh T Viruses T 7qjh 24 TC,X KM0,LM0 S7XVN9_SPRLO Transposase MYFIKPGCLIKKKFSIYTSIVISVIDNNSVVIQSYDKSDNGDVISIDREVINVSKIVPIGNIDIKNKSKKEIDGILVEENRNLKNKDDVLLMNDFERFKEQLKKEVEDMVIEEMA 115 T 0.00024 Ribosomal_L14e pdbhh F Eukaryota T 7qke 1 A A CP125_MYCTU CHOLEST-4-EN-3-ONE 26-MONOOXYGENASE,CHOLEST-4-EN-3-ONE C26-MONOOXYGENASE [(25S)-3-OXOCHOLEST-4-EN-26-OATE FORMING],CHOLESTEROL C26-MONOOXYGENASE,CHOLESTEROL C26-MONOOXYGENASE [(25S)-3BETA-HYDROXYCHOLEST-5-EN-26-OATE FORMING],CYTOCHROME P450 125,STEROID C27-MONOOXYGENASE NGPSPNLPPGFDFTDPAIYAERLPVAEFAELRSAAPIWWNGQDPGKGGGFHDGGFWAITKLNDVKEISRHSDVFSSYENGVIPRFKNDIAREDIEVQRFVMLNMDAPHHTRLRKIISRGFTPRAVGRLHDELQERAQAIAAEAAAAGSGDFVEQVSCELPLQAIAGLLGVPQEDRGKLFHWSNEMTGNEDPEYAHIDPKASSAELIGYAMKMAEEKAKNPADDIVTQLIQADIDGEKLSDDEFGFFVVMLAVAGNETTRNSITQGMMAFAEHPDQWELYKKVRPETAADEIVRWATPVTAFQRTALRDYELSGVQIAAGQRVVMFYRSANFDEEVFQDPFTFNILRNPNPHVGFGGTGAHYCIGANLARMTINLIFNAVADHMPDLKPISAPERLRSGWLNGIKHWQVDYTGRCPVAH 418 T 3.4000000000000003E-22 p450 unppssm F Bacteria T 7qla 2 B C G0SD94_CHATD Ccz1 MTTPVSPSPSGIIPAQLGFLAIYNPALGTTDETLEDQIVYYATASTLSQARRRHRRPRRRDRQRAQSVVKDSRPNAAGATGDSEAVAEDKDPVSKEERHERLRQIGLAQGMVEFAKSFSDGEPVDTIDTEKARVILVEVEEGWWILASIDLTRLPLPQIKTPTSSSAPPPAPNLNPLPPEPAYEYSSREVKPPSLLRADLLRAYDLFLLHHGSSLSSLLASQGRAQLVASLTRFWDHFLATWNVLLHGNPACDVFGGIKLAASGELGIGVGEEERGSGEREVLEGLVERVEGLVDVVVGRYGGPPSEKGPEEEQWLGLGGEVGEEDGAVFLGVGALDRKSLRGVVQWMEEVYVWGENAFGKPRRDLSTGHFLLGLSECSEEELTSSQANPKAIFVELKPSYQHPSRKIPPEDPQPLGKVGPELPRDHTARLRPVIYVSQPFIYILLFSEITPSPSTWPTLAESLHAQLSPLQKPLLHSTSYRPERPVVETTSSSGTTTQHQIFDLVYDTETLTLQSTIPNIPDPFPYSATTPTGHSTGQQHHQQSIWTRVEALQTHAQILAILSSGRAIPTDPSSFTHLPWEEGERTCKTARGWWIVWTRVVEHSPPDAVSLHHARDDDDNDDDASCSVLGHLRSVSSSHAAGSTSSSSGSGFGLGAIPGLGGLGGWAADGATRLAQGIGIDTRRYVEGLLTSLGR 696 T 0.045 Intu_longin_1 pdbpssm F Eukaryota T 7qld 1 A,B B,A SLAP_LACAC SURFACE LAYER PROTEIN,SA-PROTEIN RHMATTINASSSAINTNTNAKYDVDVTPSVSAVAANTANNTPAIAGNLTGTISASYNGKTYTANLKADTENATITAAGSTTAVKPAELAAGVAYTVTVNDVSFNFGSENAGKTVTLGCANSNVKFTGTNSDNQTETNVSTLKVKLDQNGVASLTNVSIANVYAINTTDNS 170 T 0.0049 Cadherin-like pdb F Bacteria T 7qle 1 A,B A,B SLAP_LACAC SURFACE LAYER PROTEIN,SA-PROTEIN MGHHHHHHHHHHSSGHIEGRHMATTINASSSAINTNTNAKYDVDVTPSVSAVAANTANNTPAIAGNLTGTISASYNGKTYTANLKADTENATITAAGSTTAVKPAELAAGVAYTVTVNDVSFNFGSENAGKTVTLGSANSNVKFTGTNSDNQTETNVSTLKVKLDQNGVASLTNVSIANVYAINTTDNS 189 T 0.0057 Cadherin-like pdb F Bacteria T 7qlh 1 A,B A,B E4SK47_LACAR S-layer MGKGDVNVTSNVQAITSPQTTTIDNQTGAVTYSNWDGKVNGTVTATYNGQSYTATLNETAGKENSRVTPWYTQDGGKTWNVLKKDGGVYRLEPAGKYQLSVNNVSFNFGTANANKKNITLTSSNGVQFRENGQWKDSIKVSTDQNGAVSQPLTLLIPITPVDVTNAKSHHHHHH 174 T 0.052 BNR pdb F Bacteria T 7qlr 1 A,B,C,D A,B,C,D A0A1J1J928_9CAUD CDHS1_22 Putative tail fiber protein MSWAETYKVNSDLQGEPLNFLSYLQDIKLNGLDSYVLFIGNARIWEELYLNSLYLFSDRGIRETVYTAFSETDIDNLFNKSTKLGEQLNAFYRTDIFSLGNADNVVKEMTIEHYNSLEEKFKAGYDRYVTREQEKSTIGAWFNSTFSLDNTDLENLTTIEEILANVEATNAILNNSNAIVALTMCKSSMDAVVASSNAMDLLGQYILRVTTESPVIRAILKNNVIRDAIINSDEAMTQISSNENSVMEIFNDLEATKVLVQNQNSINKILTNNVTVEKIIPNLLEMKYNLQTSLNYINTIKSNIASGKGQIMAITYNEEIFPILKNAVKNYDGMETTRNISQRDIEEKIKISDAILESSIAMATFANNSIIVNKVGDRVGIIESIFSKTVSLNAFMKSTTAINILVNKTTAFTKIANNSTAFNAMLTISENNVTIANNTTAMGIIANNAQAMSTVANNDTSISVFVNNTTAMGIIANSSTAMTKITLTGLALNRMVKSNTAKSILISKNSTLQTYKNNIQNTIQGSTAYFRTITGFADADDNPPQTINSTYVGITYCYGYKGNSYYGIVYHGYNTSIEAGRGNGYKDETKKFITLGGARYDQSGDGYFTYAMYQAI 616 T 0.00067 HC2 pdbpssm T Viruses T 7qnn 1 A A CP125_MYCTU CHOLEST-4-EN-3-ONE 26-MONOOXYGENASE,CHOLEST-4-EN-3-ONE C26-MONOOXYGENASE [(25S)-3-OXOCHOLEST-4-EN-26-OATE FORMING],CHOLESTEROL C26-MONOOXYGENASE,CHOLESTEROL C26-MONOOXYGENASE [(25S)-3BETA-HYDROXYCHOLEST-5-EN-26-OATE FORMING],CYTOCHROME P450 125,STEROID C27-MONOOXYGENASE NGPSPNLPPGFDFTDPAIYAERLPVAEFAELRSAAPIWWNGQDPGKGGGFHDGGFWAITKLNDVKEISRHSDVFSSYENGVIPRFKNDIAREDIEVQRFVMLNMDAPHHTRLRKIISRGFTPRAVGRLHDELQERAQAIAAEAAAAGSGDFVEQVSCELPLQAIAGLLGVPQEDRGKLFHWSNEMTGNEDPEYAHIDPKASSAELIGYAMKMAEEKAKNPADDIVTQLIQADIDGEKLSDDEFGFFVVMLAVAGNETTRNSITQGMMAFAEHPDQWELYKKVRPETAADEIVRWATPVTAFQRTALRDYELSGVQIAAGQRVVMFYRSANFDEEVFQDPFTFNILRNPNPHVGFGGTGAHYCIGANLARMTINLIFNAVADHMPDLKPISAPERLRSGWLNGIKHWQVDYTGRCPVAH 418 T 3.4000000000000003E-22 p450 unppssm F Bacteria T 7qnq 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H E9RJ22_BACNA Auxiliary relaxosome protein GPMAKVKKHLTFSGPTESPYGIAYIEKEMKAKNCSKMNETIELIFAEHDEMKARLSEQDALVEKIFQRFKKTLDVIRVRAGHTDKNAQINLELWNAFLMANPLPVTVLTDQHTSESVSMAKEKVSNDIATFKQRKDEQKAKQEMQKGEK 149 T 0.0015 DUF1433 unppercent F Bacteria T 7qns 2 C,D,E PC,PB,PA VAL-TYR-GLU-LYS-LYS-PRO VYEKKP 6 T 1.2 UL41A pdbhh F F 7qo2 2 C,D PAA,PAB GAKSAA Peptide GAKSAA 6 T 320 DUF4334 pdbhh F F 7qo2 3 E PB EYS Peptide EYS 3 T 240 Toxin_36 pdbhh F F 7qof 1 A,B,C,D,E,F,G,H,I A,B,C,D,E,F,G,H,I A0A385DVU6_9CAUD Major capsid protein gp32 MAGKLGKFQMLGFQHWKGLTSDNHLGAIFQQAPQKATNLMVQLLAFYRGKSLDTFLNSFPTREFEDDNEYYWDVIGSSRRNIPLVEARDENGVVVAANAANVGVGTSPFYLVFPEDWFADGEVIVGNLNQVYPFRILGDARMEGTNAVYKVELMGGNTQGVPAERLQQGERFSIEFAPVEKELSRKVGDVRFTSPVSMRNEWTTIRIQHKVAGNKLNKKLAMGIPMVRNLESGKQVKDTANMWMHYVDWEVELQFDEYKNNAMAWGTSNRNLNGEYMNFGKSGNAIKTGAGIFEQTEVANTMYYNTFSLKLLEDALYELSASKLAMDDRLFVIKTGERGAIQFHKEVLKTVSGWTTFVLDNNSTRVVEKVQSRLHSNALSAGFQFVEYKAPNGVRVRLDVDPFYDDPVRNKILHPMGGVAFSYRYDIWYIGTMDQPNIFKCKIKGDNEYRGYQWGIRNPFTGQKGNPYMSFDEDSAVIHRMATLGVCVLDPTRTMSLIPAILQG 504 T 0.75 DUF5309 pdbhh T Viruses T 7qof 2 J,K,L,M,N,O,P,Q,R a,b,c,d,e,f,g,h,i A0A385DVS7_9CAUD Auxiliary capsid protein gp36 MVISINQVRQLYVAKALKANTAALTTAGDIVPKADTAKTTLYFQSMSPAGIVASDKINLKHVLYAKATPSEALAHKLVRYSVTLDADVSATPVAGQNYILRLAFRQYIGLSEEDQYFKYGEVIARSGMTASDFYKKMAISLAKNLENKTESTPLVNIYLISAAAASTDVPVTSATKESDLTATDYNQIIIEETEQPWVLGMMPQAFIPFTPQFLTITVDGEDRLWGVATVVTPTKTVPDGHLIADLEYFCMGARGDIYRGMGYPNIIKTTYLVDPGAVYDVLDIHYFYTGSNESVQKSEKTITLVAVDDGSHTAMNALIGAINTASGLTIATL 333 T 3.4 FTP pdbpercent T Viruses T 7qof 3 S,T j,k A0A385DVL5_9CAUD Head fiber trimer protein gp21 MKRVLNLGNLSRIVEGDPNEITDDEILVIKDKIIEGKIIDIQKRVDGKLVSLITEKYTYTINPTPADAIVVINGSTTKSIRAAKGHTVTWSVSKTGFVTQSGSDVISGDVSKDVTLVANPAS 122 T 0.0062 PEGA unppssm T Viruses T 7qog 1 A A A0A385DT68_9CAUD Portal protein gp20 MADFLNFPRQMLPFSKKTKQWRKDCLLWANQKTFFNYSLVRKSVIHKKINYDLLNGRLHMSDLELVLNPDGIKAAYIPDRLQHYPIMNSKLNVLRGEESKRVFDFKVVVTNPNAISEIEDNKKNELLQRLQEMITDTSISEDEYNIKLEKLNDYYTYEWQDIREVRANELLNHYIKEYDIPLIFNNGFMDAMTCGEEIYQCDIVGGEPVIERVNPLKIRIFKSGYSNKVEDADMIILEDYWSPGRVIDTYYDVLSPKDIKYIETMPDYIGQGAVDQMDNIDERYGFVNQNMIGDEITVRDGTYFFDPANLFTEGIANSLLPYDLAGNLRVLRLYWKSKRKILKVKSYDPETGEEEWNFYPENYVVNKEAGEEVQSFWVNEAWEGTMIGNEIFVNMRPRLIQYNRLNNPSRCHFGIVGSIYNLNDSRPFSLVDMMKPYNYLYDAIHDRLNKAIASNWGSILELDLSKVPKGWDVGKWMYYARVNHIAVIDSFKEGTIGASTGKLAGALNNAGKGMIETNIGNYIQQQINLLEFIKMEMADVAGISKQREGQISQRETVGGVERATLQSSHITEWLFTIHDDVKKRALECFLETAKVALKGRNKKFQYILSDTSTRVMEIDGDEFAEADYGLVVDNSNGTQELQQKLDTLAQAALQTQTLSFSTITKLYTSSSLAEKQRLIEKDEKQIRERQAQAQKEQLEAQQQIAAMQQQQKEAELLQKEEANIRDNQTKIIIAQIQSEGGPDEEDGIMIDDYSPEAKANLAEKIREFDEKLKLDKDKLKLDKKKAETDASIKRQALRKKSSTTNK 806 T 0.21 RbsD_FucU pdbpssm T Viruses T 7qog 2 B B A0A385DT91_9CAUD Ring protein 1 gp43 MVNNINWVKLPVILDRLLRHPLLTDLNLETAIQYTLDFISAMGLPNVYVDKIETIDIKEYRGELPCDLISINQVRLHKNGIALRAMTDNFNAYPTHDHKEGDWYERGEPSFKTQGRVIFTSIKHEKVDISYKAIMLDDEGLPLIPDNPIFLKTLELYIKKEWFTILFDMGKISPAVLNNTQQEYAFKAGQCNNEFVIPSVSEMEAITNMWNQLIPRVTEFRRGFKNLGDKEYIRVH 236 T 3.5 PriX pdbhh T Viruses T 7qog 3 C C A0A385DT87_9CAUD Ring protein 2 gp40 MTYNELIYMVLDELKLSSDDSYYTPDHVIFLLVKYRSFLLKQRYSDIKKQIPDSDYQSICLDLIEVPAISGEPCEGSSYLRSKNKVPTTMMIGNPRVYPMDFYQGEITYISRDRMRYVGYNKFLRNIIYCSKAPDGYLYFKSWNPQFLHLEKVSFNAIFEDAKEASEMACPEENGTICKLEDKEFPIEDALVPPLIELVVKELRGPEYSPKDEDNNAKDDLPDAR 225 T 0.18 DUF547 pdbpssm T Viruses T 7qog 4 D,E M,N A0A385DV85_9CAUD Cargo protein 1 gp45 MAKKKIKRRGKMPPNIFDTGGQSWGQQSSGQFSNAFKGENLGNSIGSIGGAVGGIAQAGISNAQIADTSGIEAQNKAQKNMVVGASSNDDLMSEWGSWNKVKDDYSWKDVRGGSTGQRVTNTIGAAGQGAAAGASVGGPIGAIVGGVVGLGSAIGGWLGGNRKAKRKAKKLNKEAKEANERALTSFETRADNIDTQNDFNMLANFSAYGGPLEFGSGAIGYEFDNRYLNNQEMSAVAKQRLTSLPNSFQALPEMNTYNAFAEGGGLSREKNYGSKKKPYPSVPSGDFAGPHRSYPIPTKADARDALRLAGLHGNESVRRKVLAKYPSLKAFGGSLFDSVVGNNFNQSFTQGIQGMFQQEPEQTVQAANIAKDGGDIKIKEKNKGKFTAYCGGKVTEACIRKGKNSSNPTTRKRATFAQNARNWNAFGGWLNTQGGDFTNGVTFINEGGSHEENPYQGIQIGVDPEGAPNLVEQGEVVYDDYVFSDRMEIPDDIRKEYKLRGKTFAKAAKSAQRESEERPNDPLSTKGLQAAMERIATAQEEARQRKEAHREGNEYPSMFAYGGDTNPYGLALEDPMSVEELEALMVQSGETGEIAPEGNNGNRQTWTRYAPIIGSGLASLSDLFSKPDYDSADLISGVDLGAEAVGYAPIGNYLSYRPLDRDFYINKMNQQAAATRRGLMNTSGGNRLNAQAGILAADYNYGQNMGNLARQAEEYNQQLRERVEAFNRGTNMFNTETGLKASMFNAESRNAAKRARLGQATTVAQLRQGIKDQDAARRSANITNFLQGLGDMGWENEQANWLDTLAKSGVLKMNTKGEYTGGTKKAKGGKVRTKKKKGLTYG 842 T 0.0068 RTX pdb T Viruses T 7qoh 1 A,B,C,D,E,F A,B,C,D,E,F A0A385DVU6_9CAUD Major capsid protein gp32 MAGKLGKFQMLGFQHWKGLTSDNHLGAIFQQAPQKATNLMVQLLAFYRGKSLDTFLNSFPTREFEDDNEYYWDVIGSSRRNIPLVEARDENGVVVAANAANVGVGTSPFYLVFPEDWFADGEVIVGNLNQVYPFRILGDARMEGTNAVYKVELMGGNTQGVPAERLQQGERFSIEFAPVEKELSRKVGDVRFTSPVSMRNEWTTIRIQHKVAGNKLNKKLAMGIPMVRNLESGKQVKDTANMWMHYVDWEVELQFDEYKNNAMAWGTSNRNLNGEYMNFGKSGNAIKTGAGIFEQTEVANTMYYNTFSLKLLEDALYELSASKLAMDDRLFVIKTGERGAIQFHKEVLKTVSGWTTFVLDNNSTRVVEKVQSRLHSNALSAGFQFVEYKAPNGVRVRLDVDPFYDDPVRNKILHPMGGVAFSYRYDIWYIGTMDQPNIFKCKIKGDNEYRGYQWGIRNPFTGQKGNPYMSFDEDSAVIHRMATLGVCVLDPTRTMSLIPAILQG 504 T 0.75 DUF5309 pdbhh T Viruses T 7qoh 2 G,H,I,J,K a,b,d,e,f A0A385DVS7_9CAUD Auxiliary capsid protein gp36 MVISINQVRQLYVAKALKANTAALTTAGDIVPKADTAKTTLYFQSMSPAGIVASDKINLKHVLYAKATPSEALAHKLVRYSVTLDADVSATPVAGQNYILRLAFRQYIGLSEEDQYFKYGEVIARSGMTASDFYKKMAISLAKNLENKTESTPLVNIYLISAAAASTDVPVTSATKESDLTATDYNQIIIEETEQPWVLGMMPQAFIPFTPQFLTITVDGEDRLWGVATVVTPTKTVPDGHLIADLEYFCMGARGDIYRGMGYPNIIKTTYLVDPGAVYDVLDIHYFYTGSNESVQKSEKTITLVAVDDGSHTAMNALIGAINTASGLTIATL 333 T 3.4 FTP pdbpercent T Viruses T 7qoh 3 L,M g,h A0A385DTA3_9CAUD Portal vertex capsid protein gp57 MAGQQGIYCAPDNIVPNRDRVDVGCAPDGAMQLWVMEYEVTGIGKGCAMCKAINPQQAEMLLKSNGIYNGSSYLYKVTRIEQVIVPPCNGLMAEQVVTYKDVVS 104 T 0.092 DUF2931 pdb T Viruses T 7qoh 5 Q,R l,m A0A385DT68_9CAUD Portal protein gp20 MADFLNFPRQMLPFSKKTKQWRKDCLLWANQKTFFNYSLVRKSVIHKKINYDLLNGRLHMSDLELVLNPDGIKAAYIPDRLQHYPIMNSKLNVLRGEESKRVFDFKVVVTNPNAISEIEDNKKNELLQRLQEMITDTSISEDEYNIKLEKLNDYYTYEWQDIREVRANELLNHYIKEYDIPLIFNNGFMDAMTCGEEIYQCDIVGGEPVIERVNPLKIRIFKSGYSNKVEDADMIILEDYWSPGRVIDTYYDVLSPKDIKYIETMPDYIGQGAVDQMDNIDERYGFVNQNMIGDEITVRDGTYFFDPANLFTEGIANSLLPYDLAGNLRVLRLYWKSKRKILKVKSYDPETGEEEWNFYPENYVVNKEAGEEVQSFWVNEAWEGTMIGNEIFVNMRPRLIQYNRLNNPSRCHFGIVGSIYNLNDSRPFSLVDMMKPYNYLYDAIHDRLNKAIASNWGSILELDLSKVPKGWDVGKWMYYARVNHIAVIDSFKEGTIGASTGKLAGALNNAGKGMIETNIGNYIQQQINLLEFIKMEMADVAGISKQREGQISQRETVGGVERATLQSSHITEWLFTIHDDVKKRALECFLETAKVALKGRNKKFQYILSDTSTRVMEIDGDEFAEADYGLVVDNSNGTQELQQKLDTLAQAALQTQTLSFSTITKLYTSSSLAEKQRLIEKDEKQIRERQAQAQKEQLEAQQQIAAMQQQQKEAELLQKEEANIRDNQTKIIIAQIQSEGGPDEEDGIMIDDYSPEAKANLAEKIREFDEKLKLDKDKLKLDKKKAETDASIKRQALRKKSSTTNK 806 T 0.21 RbsD_FucU pdbpssm T Viruses T 7qoi 1 A,AB,B,BB,C,D,E,F,GA,HA,IA,JA,KA,LA,MB,NB,OB,PB,Q,QB,R,RB,S,T,U,V,WA,XA,YA,ZA AA,DE,AB,DF,AC,AD,AE,AF,CA,CB,CC,CD,CE,CF,EA,EB,EC,ED,BA,EE,BB,EF,BC,BD,BE,BF,DA,DB,DC,DD A0A385DVU6_9CAUD Major capsid protein gp32 MAGKLGKFQMLGFQHWKGLTSDNHLGAIFQQAPQKATNLMVQLLAFYRGKSLDTFLNSFPTREFEDDNEYYWDVIGSSRRNIPLVEARDENGVVVAANAANVGVGTSPFYLVFPEDWFADGEVIVGNLNQVYPFRILGDARMEGTNAVYKVELMGGNTQGVPAERLQQGERFSIEFAPVEKELSRKVGDVRFTSPVSMRNEWTTIRIQHKVAGNKLNKKLAMGIPMVRNLESGKQVKDTANMWMHYVDWEVELQFDEYKNNAMAWGTSNRNLNGEYMNFGKSGNAIKTGAGIFEQTEVANTMYYNTFSLKLLEDALYELSASKLAMDDRLFVIKTGERGAIQFHKEVLKTVSGWTTFVLDNNSTRVVEKVQSRLHSNALSAGFQFVEYKAPNGVRVRLDVDPFYDDPVRNKILHPMGGVAFSYRYDIWYIGTMDQPNIFKCKIKGDNEYRGYQWGIRNPFTGQKGNPYMSFDEDSAVIHRMATLGVCVLDPTRTMSLIPAILQG 504 T 0.75 DUF5309 pdbhh T Viruses T 7qoi 2 AA,CB,DB,EB,FB,G,GB,H,I,J,K,MA,NA,OA,PA,QA,SB,TB,UB,VB,W,WB,X,Y,Z Bf,Da,Db,Dd,De,Aa,Df,Ab,Ad,Ae,Af,Ca,Cb,Cd,Ce,Cf,Ea,Eb,Ed,Ee,Ba,Ef,Bb,Bd,Be A0A385DVS7_9CAUD Auxiliary capsid protein gp36 MVISINQVRQLYVAKALKANTAALTTAGDIVPKADTAKTTLYFQSMSPAGIVASDKINLKHVLYAKATPSEALAHKLVRYSVTLDADVSATPVAGQNYILRLAFRQYIGLSEEDQYFKYGEVIARSGMTASDFYKKMAISLAKNLENKTESTPLVNIYLISAAAASTDVPVTSATKESDLTATDYNQIIIEETEQPWVLGMMPQAFIPFTPQFLTITVDGEDRLWGVATVVTPTKTVPDGHLIADLEYFCMGARGDIYRGMGYPNIIKTTYLVDPGAVYDVLDIHYFYTGSNESVQKSEKTITLVAVDDGSHTAMNALIGAINTASGLTIATL 333 T 3.4 FTP pdbpercent T Viruses T 7qoi 3 BA,CA,HB,IB,L,M,RA,SA,XB,YB Bg,Bh,Dg,Dh,Ag,Ah,Cg,Ch,Eg,Eh A0A385DTA3_9CAUD Portal vertex capsid protein gp57 MAGQQGIYCAPDNIVPNRDRVDVGCAPDGAMQLWVMEYEVTGIGKGCAMCKAINPQQAEMLLKSNGIYNGSSYLYKVTRIEQVIVPPCNGLMAEQVVTYKDVVS 104 T 0.092 DUF2931 pdb T Viruses T 7qoi 5 CC,DC,EC,FC,GC,HC,IC,JC,KC,LC,MC,NC FA,FB,FC,FD,FE,FF,FG,FH,FI,FJ,FK,FL A0A385DT68_9CAUD Portal protein gp20 MADFLNFPRQMLPFSKKTKQWRKDCLLWANQKTFFNYSLVRKSVIHKKINYDLLNGRLHMSDLELVLNPDGIKAAYIPDRLQHYPIMNSKLNVLRGEESKRVFDFKVVVTNPNAISEIEDNKKNELLQRLQEMITDTSISEDEYNIKLEKLNDYYTYEWQDIREVRANELLNHYIKEYDIPLIFNNGFMDAMTCGEEIYQCDIVGGEPVIERVNPLKIRIFKSGYSNKVEDADMIILEDYWSPGRVIDTYYDVLSPKDIKYIETMPDYIGQGAVDQMDNIDERYGFVNQNMIGDEITVRDGTYFFDPANLFTEGIANSLLPYDLAGNLRVLRLYWKSKRKILKVKSYDPETGEEEWNFYPENYVVNKEAGEEVQSFWVNEAWEGTMIGNEIFVNMRPRLIQYNRLNNPSRCHFGIVGSIYNLNDSRPFSLVDMMKPYNYLYDAIHDRLNKAIASNWGSILELDLSKVPKGWDVGKWMYYARVNHIAVIDSFKEGTIGASTGKLAGALNNAGKGMIETNIGNYIQQQINLLEFIKMEMADVAGISKQREGQISQRETVGGVERATLQSSHITEWLFTIHDDVKKRALECFLETAKVALKGRNKKFQYILSDTSTRVMEIDGDEFAEADYGLVVDNSNGTQELQQKLDTLAQAALQTQTLSFSTITKLYTSSSLAEKQRLIEKDEKQIRERQAQAQKEQLEAQQQIAAMQQQQKEAELLQKEEANIRDNQTKIIIAQIQSEGGPDEEDGIMIDDYSPEAKANLAEKIREFDEKLKLDKDKLKLDKKKAETDASIKRQALRKKSSTTNK 806 T 0.21 RbsD_FucU pdbpssm T Viruses T 7qoi 6 OC,PC,QC,RC,SC,TC,UC,VC,WC,XC,YC,ZC FM,FN,FO,FP,FQ,FR,FS,FT,FU,FV,FW,FX A0A385DT91_9CAUD Ring protein 1 gp43 MVNNINWVKLPVILDRLLRHPLLTDLNLETAIQYTLDFISAMGLPNVYVDKIETIDIKEYRGELPCDLISINQVRLHKNGIALRAMTDNFNAYPTHDHKEGDWYERGEPSFKTQGRVIFTSIKHEKVDISYKAIMLDDEGLPLIPDNPIFLKTLELYIKKEWFTILFDMGKISPAVLNNTQQEYAFKAGQCNNEFVIPSVSEMEAITNMWNQLIPRVTEFRRGFKNLGDKEYIRVH 236 T 3.5 PriX pdbhh T Viruses T 7qoi 7 AD,BD,CD,DD,ED,FD,GD,HD,ID,JD,KD,LD FY,FZ,GA,GB,GC,GD,GE,GF,GG,GH,GI,GJ A0A385DT87_9CAUD Ring protein 2 gp40 MTYNELIYMVLDELKLSSDDSYYTPDHVIFLLVKYRSFLLKQRYSDIKKQIPDSDYQSICLDLIEVPAISGEPCEGSSYLRSKNKVPTTMMIGNPRVYPMDFYQGEITYISRDRMRYVGYNKFLRNIIYCSKAPDGYLYFKSWNPQFLHLEKVSFNAIFEDAKEASEMACPEENGTICKLEDKEFPIEDALVPPLIELVVKELRGPEYSPKDEDNNAKDDLPDAR 225 T 0.18 DUF547 pdbpssm T Viruses T 7qoi 8 AE,BE,CE,DE,EE,FE,GE,HE,IE,JE,MD,ND,OD,PD,QD,RD,SD,TD,UD,VD,WD,XD,YD,ZD IM,IN,IO,IP,IQ,IR,IS,IT,IU,IV,GK,GL,GM,GN,GO,GP,GQ,GR,GS,GT,GU,GV,IK,IL A0A385DV85_9CAUD Cargo protein 1 gp45 MAKKKIKRRGKMPPNIFDTGGQSWGQQSSGQFSNAFKGENLGNSIGSIGGAVGGIAQAGISNAQIADTSGIEAQNKAQKNMVVGASSNDDLMSEWGSWNKVKDDYSWKDVRGGSTGQRVTNTIGAAGQGAAAGASVGGPIGAIVGGVVGLGSAIGGWLGGNRKAKRKAKKLNKEAKEANERALTSFETRADNIDTQNDFNMLANFSAYGGPLEFGSGAIGYEFDNRYLNNQEMSAVAKQRLTSLPNSFQALPEMNTYNAFAEGGGLSREKNYGSKKKPYPSVPSGDFAGPHRSYPIPTKADARDALRLAGLHGNESVRRKVLAKYPSLKAFGGSLFDSVVGNNFNQSFTQGIQGMFQQEPEQTVQAANIAKDGGDIKIKEKNKGKFTAYCGGKVTEACIRKGKNSSNPTTRKRATFAQNARNWNAFGGWLNTQGGDFTNGVTFINEGGSHEENPYQGIQIGVDPEGAPNLVEQGEVVYDDYVFSDRMEIPDDIRKEYKLRGKTFAKAAKSAQRESEERPNDPLSTKGLQAAMERIATAQEEARQRKEAHREGNEYPSMFAYGGDTNPYGLALEDPMSVEELEALMVQSGETGEIAPEGNNGNRQTWTRYAPIIGSGLASLSDLFSKPDYDSADLISGVDLGAEAVGYAPIGNYLSYRPLDRDFYINKMNQQAAATRRGLMNTSGGNRLNAQAGILAADYNYGQNMGNLARQAEEYNQQLRERVEAFNRGTNMFNTETGLKASMFNAESRNAAKRARLGQATTVAQLRQGIKDQDAARRSANITNFLQGLGDMGWENEQANWLDTLAKSGVLKMNTKGEYTGGTKKAKGGKVRTKKKKGLTYG 842 T 0.0068 RTX pdb T Viruses T 7qoj 1 A A A0A385DT68_9CAUD Portal protein gp20 MADFLNFPRQMLPFSKKTKQWRKDCLLWANQKTFFNYSLVRKSVIHKKINYDLLNGRLHMSDLELVLNPDGIKAAYIPDRLQHYPIMNSKLNVLRGEESKRVFDFKVVVTNPNAISEIEDNKKNELLQRLQEMITDTSISEDEYNIKLEKLNDYYTYEWQDIREVRANELLNHYIKEYDIPLIFNNGFMDAMTCGEEIYQCDIVGGEPVIERVNPLKIRIFKSGYSNKVEDADMIILEDYWSPGRVIDTYYDVLSPKDIKYIETMPDYIGQGAVDQMDNIDERYGFVNQNMIGDEITVRDGTYFFDPANLFTEGIANSLLPYDLAGNLRVLRLYWKSKRKILKVKSYDPETGEEEWNFYPENYVVNKEAGEEVQSFWVNEAWEGTMIGNEIFVNMRPRLIQYNRLNNPSRCHFGIVGSIYNLNDSRPFSLVDMMKPYNYLYDAIHDRLNKAIASNWGSILELDLSKVPKGWDVGKWMYYARVNHIAVIDSFKEGTIGASTGKLAGALNNAGKGMIETNIGNYIQQQINLLEFIKMEMADVAGISKQREGQISQRETVGGVERATLQSSHITEWLFTIHDDVKKRALECFLETAKVALKGRNKKFQYILSDTSTRVMEIDGDEFAEADYGLVVDNSNGTQELQQKLDTLAQAALQTQTLSFSTITKLYTSSSLAEKQRLIEKDEKQIRERQAQAQKEQLEAQQQIAAMQQQQKEAELLQKEEANIRDNQTKIIIAQIQSEGGPDEEDGIMIDDYSPEAKANLAEKIREFDEKLKLDKDKLKLDKKKAETDASIKRQALRKKSSTTNK 806 T 0.21 RbsD_FucU pdbpssm T Viruses T 7qoj 2 B B A0A385DT91_9CAUD Ring protein 1 gp43 MVNNINWVKLPVILDRLLRHPLLTDLNLETAIQYTLDFISAMGLPNVYVDKIETIDIKEYRGELPCDLISINQVRLHKNGIALRAMTDNFNAYPTHDHKEGDWYERGEPSFKTQGRVIFTSIKHEKVDISYKAIMLDDEGLPLIPDNPIFLKTLELYIKKEWFTILFDMGKISPAVLNNTQQEYAFKAGQCNNEFVIPSVSEMEAITNMWNQLIPRVTEFRRGFKNLGDKEYIRVH 236 T 3.5 PriX pdbhh T Viruses T 7qoj 3 C C A0A385DT87_9CAUD Ring protein 2 gp40 MTYNELIYMVLDELKLSSDDSYYTPDHVIFLLVKYRSFLLKQRYSDIKKQIPDSDYQSICLDLIEVPAISGEPCEGSSYLRSKNKVPTTMMIGNPRVYPMDFYQGEITYISRDRMRYVGYNKFLRNIIYCSKAPDGYLYFKSWNPQFLHLEKVSFNAIFEDAKEASEMACPEENGTICKLEDKEFPIEDALVPPLIELVVKELRGPEYSPKDEDNNAKDDLPDAR 225 T 0.18 DUF547 pdbpssm T Viruses T 7qoj 4 D D A0A385DV73_9CAUD Ring protein 3 gp35 MTNKEFSDGFSTLLNSFGITPNITLDEYEKSTFLTNAQEQLIIDIYSGRNIIYGKSFEQTEEIRRYLSNLVETYETSTKVTGKLGLSKDSVFFEIPQDTWFITYEVAFLKDSRLGCLDGIEASVVPLPQDDLYRAKDNPFRGPSKDRVLRLDIKSDLAELISKYNVDKYLMRYISQPTPIILVDLPDGLSINGVSTESECELNPVVHRAILERAVQLAIISKTQLTGNKE 230 T 0.22 LAGLIDADG_3 pdb T Viruses T 7qoj 5 E,F E,F A0A385DVC3_9CAUD Ring protein 4/5 gp34 MNVNEFSNEFDVLYNNIMSNAAPGLNEYEKSVLLTKAQEEIVKNYFEPAGNKYGKGLDDSPKRQIDFSELIKVGEGVLNTSAPTITFDKRAKVYDLPADLFLVINEAVDTNAGTKQIVPISYSDYTRLMSRPYKEPVKYQAWRIITTSINNISVELIVNSNETITDYKVRYIRRPAPIITTNLSSEYGDVTINGVSTVSECELNPIIHSEILQRAVELAKAAYQGDLQASVELGQRSE 238 T 16 DUF3206 pdbhh T Viruses T 7qoj 6 G,H G,H A0A385DTH1_9CAUD Tail hub protein A gp38 MHFNELRISQDNRFLIIDVSVDNQDYFEDVLLDSIVIDTQDTFVMNGPSDNPLYIYNVEDAYDLTYSLPEQCNCNPVRVEEDESYCFTYGTQQMKNVRLELNIQDLKVSPCSTMFFVYVKSKGTPSTDTPCGFDKDQILGTVINLQPIYKQTLKYLKEVECDCNIPKGFIDMILKLKAIELCVRTGNYPQAIKYWNKFFIKNNCKSPTSNCGCYG 215 T 3.4 SPOB_a pdbhh T Viruses T 7qoj 7 I I A0A385DVM6_9CAUD Tail hub protein B gp39 MDKMLEISEEAITRYFTTLSQFGYKKYSDVDKIIVLFFMEEMLAGEMSYYVTQDDYRNIVNALYCLAGSTCMIDFPMFESYDTLVHSNNRTFVPRITEDSILRSTEDDNFRVEA 114 T 0.077 DUF5854 unppssm T Viruses T 7qoj 9 M,N M,N A0A385DV85_9CAUD Cargo protein 1 gp45 MAKKKIKRRGKMPPNIFDTGGQSWGQQSSGQFSNAFKGENLGNSIGSIGGAVGGIAQAGISNAQIADTSGIEAQNKAQKNMVVGASSNDDLMSEWGSWNKVKDDYSWKDVRGGSTGQRVTNTIGAAGQGAAAGASVGGPIGAIVGGVVGLGSAIGGWLGGNRKAKRKAKKLNKEAKEANERALTSFETRADNIDTQNDFNMLANFSAYGGPLEFGSGAIGYEFDNRYLNNQEMSAVAKQRLTSLPNSFQALPEMNTYNAFAEGGGLSREKNYGSKKKPYPSVPSGDFAGPHRSYPIPTKADARDALRLAGLHGNESVRRKVLAKYPSLKAFGGSLFDSVVGNNFNQSFTQGIQGMFQQEPEQTVQAANIAKDGGDIKIKEKNKGKFTAYCGGKVTEACIRKGKNSSNPTTRKRATFAQNARNWNAFGGWLNTQGGDFTNGVTFINEGGSHEENPYQGIQIGVDPEGAPNLVEQGEVVYDDYVFSDRMEIPDDIRKEYKLRGKTFAKAAKSAQRESEERPNDPLSTKGLQAAMERIATAQEEARQRKEAHREGNEYPSMFAYGGDTNPYGLALEDPMSVEELEALMVQSGETGEIAPEGNNGNRQTWTRYAPIIGSGLASLSDLFSKPDYDSADLISGVDLGAEAVGYAPIGNYLSYRPLDRDFYINKMNQQAAATRRGLMNTSGGNRLNAQAGILAADYNYGQNMGNLARQAEEYNQQLRERVEAFNRGTNMFNTETGLKASMFNAESRNAAKRARLGQATTVAQLRQGIKDQDAARRSANITNFLQGLGDMGWENEQANWLDTLAKSGVLKMNTKGEYTGGTKKAKGGKVRTKKKKGLTYG 842 T 0.0068 RTX pdb T Viruses T 7qok 1 A A A0A385DVD6_9CAUD MUZZLE PROTEIN MALKKEQHFFKGMQRDLSVSKFNPEYAFDAQNIRITAREHDTLLSVSNEKGNKEIPLQSPSGDPVVIDGVLLGQNVLNNYVTLFTKGTNDNIYRLENKGTYFETLILFSGNLNFSTDYPIESISVYENNNIQKVYWVDGLNQARVINITKDDYNNADDFDFVGTIHTSSKIEVSKVNGSGAFGQGVIQYAFTYYNKYGKETNIFRTSPLLYIAYSDRGASPEETVSCSFQINFTELDSSYDFIRVYSIHRTSIDATPTVRKVADLATDTKLYVDTGTTGEIVDPTLLLYVGGEEIAPYTMTQKDNTLFLGNYTLKRSLISTELKNQIKSDSIVTTILGGLDDAIESEWNVNTQYNSNYDLNYDSRIKGFQKGEIYRLGIQFQDNKGKWSEVVFIGDYECTERFKYTQYDTYGITLIPRFKVVISNSTTIQAIKNLGYINARGVVVFPTLEDRNILCQGILCPTVANYKDRLDNSPFVQSSWFSRPKQATETWKTEYSGTNHLSEFGEVPYFQHNEPIGSASLSEITRWEIQTSLGLVPYYNPSTTNAKDFVDGSPSEFLVDENIVTMHSPDVEFDDRLQNITNGKFKLRIIGTTHLTNTLSDISVITSTPTYGNYATGFYKGKVANMNISTSYYGGRQLSAGLFWSDNVKFQDPSPQDKLERLWMVYPWHRNGSLMNMGVPTEGTRAAALQRKIISNLKFASQNNYLPNQSVWEAEISGDANHTGITPVNSWTEGLVRIPAQANSNLGSLNYYANIDKVLTFNRSEQISEIYKNGYLIYTTKDWITDGKIADLFNNAISQTISVDQVQDWLTRIADTDKYGTEPVSMKYKSNPHLVFAFNYTESGKQLILPMKNNNNGYLAPSANSKPFWNPTAPEGAVYQDSINFTNENRAFFWLAELYRDSVVNRFGGDTEEAILNNTWLPSGDSVIIGDSINIEYTEGDTYYQRYDCLRTFAYTNEDQNSIVDIVSFMCESKVNIDGRYDKNRGQVNNLAVSPTNFNLFNPVYSQKNNFFTFRTIDYERFSINYFPNSITVTKEKSLGEDIDTWTNITLATTLDLDGDKGEIVSLNTYNNEIFCFQRRGLSNILFNSRVQIPTSDGMPIEITNGLKVSGKRYISNTIGCANKWSIAESPSGLYFIDNETNSLYLFNGEIVSLSDKLGFRQWISTHNVHVNWEPVGYNNYRSFYDKNNNDVYFTYKDHCLCYSELINQFTSFMSYEGVPAMFNVSSEFYAFKDGKMWEQFAGDYNMFFGEYKPFSITFVANAEEPNDKIFNTVEFRADSWDSDNLISNKTFDTLDVWNEYQHGTTPLTNLLGHPSPLKKKFRIWRANIPRAIANNRDRIRNTWAYIKLGMNTPNTYRTEFHDAIIHYFA 1371 T 0.014 Phage_stabilise pdbhh T Viruses T 7qok 2 B B Muzzle bound helix XXXXXXXXXXXXXX 14 F F F 7qok 3 C,D C,D A0A385DVC3_9CAUD Ring protein 4/5 gp34 MNVNEFSNEFDVLYNNIMSNAAPGLNEYEKSVLLTKAQEEIVKNYFEPAGNKYGKGLDDSPKRQIDFSELIKVGEGVLNTSAPTITFDKRAKVYDLPADLFLVINEAVDTNAGTKQIVPISYSDYTRLMSRPYKEPVKYQAWRIITTSINNISVELIVNSNETITDYKVRYIRRPAPIITTNLSSEYGDVTINGVSTVSECELNPIIHSEILQRAVELAKAAYQGDLQASVELGQRSE 238 T 16 DUF3206 pdbhh T Viruses T 7qol 1 A,O A,O A0A385DT68_9CAUD Portal protein gp20 MADFLNFPRQMLPFSKKTKQWRKDCLLWANQKTFFNYSLVRKSVIHKKINYDLLNGRLHMSDLELVLNPDGIKAAYIPDRLQHYPIMNSKLNVLRGEESKRVFDFKVVVTNPNAISEIEDNKKNELLQRLQEMITDTSISEDEYNIKLEKLNDYYTYEWQDIREVRANELLNHYIKEYDIPLIFNNGFMDAMTCGEEIYQCDIVGGEPVIERVNPLKIRIFKSGYSNKVEDADMIILEDYWSPGRVIDTYYDVLSPKDIKYIETMPDYIGQGAVDQMDNIDERYGFVNQNMIGDEITVRDGTYFFDPANLFTEGIANSLLPYDLAGNLRVLRLYWKSKRKILKVKSYDPETGEEEWNFYPENYVVNKEAGEEVQSFWVNEAWEGTMIGNEIFVNMRPRLIQYNRLNNPSRCHFGIVGSIYNLNDSRPFSLVDMMKPYNYLYDAIHDRLNKAIASNWGSILELDLSKVPKGWDVGKWMYYARVNHIAVIDSFKEGTIGASTGKLAGALNNAGKGMIETNIGNYIQQQINLLEFIKMEMADVAGISKQREGQISQRETVGGVERATLQSSHITEWLFTIHDDVKKRALECFLETAKVALKGRNKKFQYILSDTSTRVMEIDGDEFAEADYGLVVDNSNGTQELQQKLDTLAQAALQTQTLSFSTITKLYTSSSLAEKQRLIEKDEKQIRERQAQAQKEQLEAQQQIAAMQQQQKEAELLQKEEANIRDNQTKIIIAQIQSEGGPDEEDGIMIDDYSPEAKANLAEKIREFDEKLKLDKDKLKLDKKKAETDASIKRQALRKKSSTTNK 806 T 0.21 RbsD_FucU pdbpssm T Viruses T 7qol 2 B,P B,Q A0A385DT91_9CAUD RING PROTEIN 1 GP43 MVNNINWVKLPVILDRLLRHPLLTDLNLETAIQYTLDFISAMGLPNVYVDKIETIDIKEYRGELPCDLISINQVRLHKNGIALRAMTDNFNAYPTHDHKEGDWYERGEPSFKTQGRVIFTSIKHEKVDISYKAIMLDDEGLPLIPDNPIFLKTLELYIKKEWFTILFDMGKISPAVLNNTQQEYAFKAGQCNNEFVIPSVSEMEAITNMWNQLIPRVTEFRRGFKNLGDKEYIRVH 236 T 3.5 PriX pdbhh T Viruses T 7qol 3 C,Q C,S A0A385DT87_9CAUD Ring protein 2 gp40 MTYNELIYMVLDELKLSSDDSYYTPDHVIFLLVKYRSFLLKQRYSDIKKQIPDSDYQSICLDLIEVPAISGEPCEGSSYLRSKNKVPTTMMIGNPRVYPMDFYQGEITYISRDRMRYVGYNKFLRNIIYCSKAPDGYLYFKSWNPQFLHLEKVSFNAIFEDAKEASEMACPEENGTICKLEDKEFPIEDALVPPLIELVVKELRGPEYSPKDEDNNAKDDLPDAR 225 T 0.18 DUF547 pdbpssm T Viruses T 7qol 4 D,R D,T A0A385DV73_9CAUD Ring protein 3 gp35 MTNKEFSDGFSTLLNSFGITPNITLDEYEKSTFLTNAQEQLIIDIYSGRNIIYGKSFEQTEEIRRYLSNLVETYETSTKVTGKLGLSKDSVFFEIPQDTWFITYEVAFLKDSRLGCLDGIEASVVPLPQDDLYRAKDNPFRGPSKDRVLRLDIKSDLAELISKYNVDKYLMRYISQPTPIILVDLPDGLSINGVSTESECELNPVVHRAILERAVQLAIISKTQLTGNKE 230 T 0.22 LAGLIDADG_3 pdb T Viruses T 7qol 5 E,F,S,T E,F,U,V A0A385DVC3_9CAUD Ring protein 4/5 gp34 MNVNEFSNEFDVLYNNIMSNAAPGLNEYEKSVLLTKAQEEIVKNYFEPAGNKYGKGLDDSPKRQIDFSELIKVGEGVLNTSAPTITFDKRAKVYDLPADLFLVINEAVDTNAGTKQIVPISYSDYTRLMSRPYKEPVKYQAWRIITTSINNISVELIVNSNETITDYKVRYIRRPAPIITTNLSSEYGDVTINGVSTVSECELNPIIHSEILQRAVELAKAAYQGDLQASVELGQRSE 238 T 16 DUF3206 pdbhh T Viruses T 7qol 6 G,H,U,V G,H,W,Z A0A385DTH1_9CAUD TAIL HUB PROTEIN A GP38 MHFNELRISQDNRFLIIDVSVDNQDYFEDVLLDSIVIDTQDTFVMNGPSDNPLYIYNVEDAYDLTYSLPEQCNCNPVRVEEDESYCFTYGTQQMKNVRLELNIQDLKVSPCSTMFFVYVKSKGTPSTDTPCGFDKDQILGTVINLQPIYKQTLKYLKEVECDCNIPKGFIDMILKLKAIELCVRTGNYPQAIKYWNKFFIKNNCKSPTSNCGCYG 215 T 3.4 SPOB_a pdbhh T Viruses T 7qol 7 I,W I,a A0A385DVM6_9CAUD Tail hub protein B gp39 MDKMLEISEEAITRYFTTLSQFGYKKYSDVDKIIVLFFMEEMLAGEMSYYVTQDDYRNIVNALYCLAGSTCMIDFPMFESYDTLVHSNNRTFVPRITEDSILRSTEDDNFRVEA 114 T 0.077 DUF5854 unppssm T Viruses T 7qol 9 AA,BA,M,N e,f,M,N A0A385DV85_9CAUD CARGO PROTEIN C1 GP45 MAKKKIKRRGKMPPNIFDTGGQSWGQQSSGQFSNAFKGENLGNSIGSIGGAVGGIAQAGISNAQIADTSGIEAQNKAQKNMVVGASSNDDLMSEWGSWNKVKDDYSWKDVRGGSTGQRVTNTIGAAGQGAAAGASVGGPIGAIVGGVVGLGSAIGGWLGGNRKAKRKAKKLNKEAKEANERALTSFETRADNIDTQNDFNMLANFSAYGGPLEFGSGAIGYEFDNRYLNNQEMSAVAKQRLTSLPNSFQALPEMNTYNAFAEGGGLSREKNYGSKKKPYPSVPSGDFAGPHRSYPIPTKADARDALRLAGLHGNESVRRKVLAKYPSLKAFGGSLFDSVVGNNFNQSFTQGIQGMFQQEPEQTVQAANIAKDGGDIKIKEKNKGKFTAYCGGKVTEACIRKGKNSSNPTTRKRATFAQNARNWNAFGGWLNTQGGDFTNGVTFINEGGSHEENPYQGIQIGVDPEGAPNLVEQGEVVYDDYVFSDRMEIPDDIRKEYKLRGKTFAKAAKSAQRESEERPNDPLSTKGLQAAMERIATAQEEARQRKEAHREGNEYPSMFAYGGDTNPYGLALEDPMSVEELEALMVQSGETGEIAPEGNNGNRQTWTRYAPIIGSGLASLSDLFSKPDYDSADLISGVDLGAEAVGYAPIGNYLSYRPLDRDFYINKMNQQAAATRRGLMNTSGGNRLNAQAGILAADYNYGQNMGNLARQAEEYNQQLRERVEAFNRGTNMFNTETGLKASMFNAESRNAAKRARLGQATTVAQLRQGIKDQDAARRSANITNFLQGLGDMGWENEQANWLDTLAKSGVLKMNTKGEYTGGTKKAKGGKVRTKKKKGLTYG 842 T 0.0068 RTX pdb T Viruses T 7qol 10 CA P A0A385DVD6_9CAUD MUZZLE PROTEIN MALKKEQHFFKGMQRDLSVSKFNPEYAFDAQNIRITAREHDTLLSVSNEKGNKEIPLQSPSGDPVVIDGVLLGQNVLNNYVTLFTKGTNDNIYRLENKGTYFETLILFSGNLNFSTDYPIESISVYENNNIQKVYWVDGLNQARVINITKDDYNNADDFDFVGTIHTSSKIEVSKVNGSGAFGQGVIQYAFTYYNKYGKETNIFRTSPLLYIAYSDRGASPEETVSCSFQINFTELDSSYDFIRVYSIHRTSIDATPTVRKVADLATDTKLYVDTGTTGEIVDPTLLLYVGGEEIAPYTMTQKDNTLFLGNYTLKRSLISTELKNQIKSDSIVTTILGGLDDAIESEWNVNTQYNSNYDLNYDSRIKGFQKGEIYRLGIQFQDNKGKWSEVVFIGDYECTERFKYTQYDTYGITLIPRFKVVISNSTTIQAIKNLGYINARGVVVFPTLEDRNILCQGILCPTVANYKDRLDNSPFVQSSWFSRPKQATETWKTEYSGTNHLSEFGEVPYFQHNEPIGSASLSEITRWEIQTSLGLVPYYNPSTTNAKDFVDGSPSEFLVDENIVTMHSPDVEFDDRLQNITNGKFKLRIIGTTHLTNTLSDISVITSTPTYGNYATGFYKGKVANMNISTSYYGGRQLSAGLFWSDNVKFQDPSPQDKLERLWMVYPWHRNGSLMNMGVPTEGTRAAALQRKIISNLKFASQNNYLPNQSVWEAEISGDANHTGITPVNSWTEGLVRIPAQANSNLGSLNYYANIDKVLTFNRSEQISEIYKNGYLIYTTKDWITDGKIADLFNNAISQTISVDQVQDWLTRIADTDKYGTEPVSMKYKSNPHLVFAFNYTESGKQLILPMKNNNNGYLAPSANSKPFWNPTAPEGAVYQDSINFTNENRAFFWLAELYRDSVVNRFGGDTEEAILNNTWLPSGDSVIIGDSINIEYTEGDTYYQRYDCLRTFAYTNEDQNSIVDIVSFMCESKVNIDGRYDKNRGQVNNLAVSPTNFNLFNPVYSQKNNFFTFRTIDYERFSINYFPNSITVTKEKSLGEDIDTWTNITLATTLDLDGDKGEIVSLNTYNNEIFCFQRRGLSNILFNSRVQIPTSDGMPIEITNGLKVSGKRYISNTIGCANKWSIAESPSGLYFIDNETNSLYLFNGEIVSLSDKLGFRQWISTHNVHVNWEPVGYNNYRSFYDKNNNDVYFTYKDHCLCYSELINQFTSFMSYEGVPAMFNVSSEFYAFKDGKMWEQFAGDYNMFFGEYKPFSITFVANAEEPNDKIFNTVEFRADSWDSDNLISNKTFDTLDVWNEYQHGTTPLTNLLGHPSPLKKKFRIWRANIPRAIANNRDRIRNTWAYIKLGMNTPNTYRTEFHDAIIHYFA 1371 T 0.014 Phage_stabilise pdbhh T Viruses T 7qol 11 DA R Muzzle bound helix XXXXXXXXXXXXXX 14 F F F 7qoo 15 O X Unknown protein XXXXXXXXXXXXXXX 15 F F F 7qot 2 C,D C,D KNG1_HUMAN Kininogen-1 light chain SDDDWIPDIQIDPNGLSFNPISDFPDTTSPK 31 T 5.6 NUC153 pdbhh F Eukaryota T 7qox 2 C,D D,C KNG1_HUMAN Kininogen-1 light chain TQSDDDWIPDIQIDPNGLSFNPISDFPDT 29 T 4.6 NUC153 pdbhh F Eukaryota T 7qpd 5 E C CALR_HUMAN CRP55,CALREGULIN,ENDOPLASMIC RETICULUM RESIDENT PROTEIN 60,ERP60,HACBP,GRP60 EPAVYFKEQFLDGDGWTSRWIESKHKSDFGKFVLSSGKFYGDEEKDKGLQTSQDARFYALSASFEPFSNKGQTLVVQFTVKHEQNIDCGGGYVKLFPNSLDQTDMHGDSEYNIMFGPDICGPGTKKVHVIFNYKGKNVLINKDIRCKDDEFTHLYTLIVRPDNTYEVKIDNSQVESGSLEDDWDFLPPKKIKDPDASKPEDWDERAKIDDPTDSKPEDWDKPEHIPDPDAKKPEDWDEEMDGEWEPPVIQNPEYKGEWKPRQIDNPDYKGTWIHPEIDNPEYSPDPSIYAYDNFGVLGLDLWQVKSGTIFDNFLITNDEAYAEEFGNETWGVTKAAEKQMKDKQDEEQRLKEEEEDKKRKEEEEAEDKEDDEDKDEDEEDEEDKEEDEEEDVPGQAKDEL 400 T 2.1E-21 Calreticulin pdb F Eukaryota T 7qpk 1 A,B A,B A0A0W0CDE7_CANGB A-region of Awp14 MGSSHHHHHHSSGLVPRGSHMASMTGGQQMGRGSKTYENQKIVIDGVALGTTTFEDDELLVLKNSTLTLNNFMNIKLPAGISLTDNSVLNINTPPDDTPPSDSYDVKRPQYSMVINGKVSIDNGSQFVFDGSSLVYSLGPYASEKFLFDINTGMDGIFISKDSTMRITLPKYLDWGFSHATTKFSGIHIGGTYKAPYNSPLVILGTLEVLRSDSRTDDGYFDDNLFRIDLGPDKIDENGVFTMKNDLSGNIHCQGILSFFADIFKGTDNVFIRTIGFQAISPISPITVDLAEGPVQGNGYLRYNVIISQGQGNGLKLLNLQARLDIGLPIIYIYNSDNYKDLTAKAHDNVIDIIDHSSNKSFSIIGDRKYNITYWYQQYTEIYPSYQYGGYFKVPLFKKSLQLDFIPIIEPDY 413 T 0.19 Hyphal_reg_CWP pdbhh F Eukaryota T 7qpx 2 C,F C,F G4MXW3_MAGO7 AVR-Pik protein METGNKYIEKRAIDLSRERDPNFFDNPGIPVPECFWFMFKNNVRQDDGTCYSSWKMDMKVGPNWVHIKSDDNCNLSGDFPPGWIVLGKKRPGF 93 T 0.086 TMEM18 unp F Eukaryota T 7qq3 29 CA B Myxovalargin A XXAXXXXXXXVXXXXX 16 T 110 IlvB_leader pdbhh F F 7qql 2 B,E,F F,D,E KS6A1_HUMAN S6K-ALPHA-1,90 KDA RIBOSOMAL PROTEIN S6 KINASE 1,P90-RSK 1,P90RSK1,P90S6K,MAP KINASE-ACTIVATED PROTEIN KINASE 1A,MAPK-ACTIVATED PROTEIN KINASE 1A,MAPKAP KINASE 1A,MAPKAPK-1A,RIBOSOMAL S6 KINASE 1,RSK-1 RRVRKLPSTTL 11 T 13 DUF3549 pdbhh F Eukaryota T 7qqn 2 C,D B,D TRPV3_HUMAN TRPV3,VANILLOID RECEPTOR-LIKE 3,VRL-3 EVEEFPETSV 10 T 6 Gemini_V2 pdbhh F Eukaryota T 7qqy 2 B B ECM21_YEAST ECM21 PFITSRPW 8 T 3.8 DUF2183 pdbhh F Eukaryota T 7qrf 2 B C Unknown peptide XXXXX 5 F F F 7qrj 1 A,B,C,D,E,F A,B,C,D,E,F A0A2P1EHJ0_9VIRU Zav_19 protein MSISSLLEKNIYNVHNKSNTLTNVPANPTGNTNTVWSNSNFTPPHLMYGASDITQAIGNISLTTGSFSLSLSGPWASPLVQNVAYTKINNLVNLTFPPFQANATSSAVINSAIGALPADLRPTTNIQVDFEIFVIDDGNRPVNPGLITLLSNGQIVVYKDNNLGQFTTGIGGSGFNPFSITYMV 184 T 0.11 CDC45 pdb T Viruses T 7qrr 1 A,B,C,D,E,F,G,H,I,J,K,L F,A,B,C,D,E,G,H,I,J,K,L A0A1Q1PNC6_9VIRU NMV_189 protein MSVYGPVPTVTTRAFLPRLATAADSITSTTTTIALDPQTEQSYWTRVGDTATIHIHLVGAALPAAAPSTRIYGNFPPLRITPSSALAAQHGVIVPMQYYVAPTLPVGSSAAARIETGFIELGSLLNGAFTPLAANLIGTVGYEFAIDATYAAQ 153 T 29 FANCL_d3 pdbhh T Viruses T 7qrs 2 C,D C,D TAX_HTL1C PROTEIN X-LOR,PROTEIN PX,TRANS-ACTIVATING TRANSCRIPTIONAL REGULATORY PROTEIN OF HTLV-1 KHFRETEV 8 T 41 FeThRed_A pdbhh T Viruses T 7qrt 2 C C TAX_HTL1C PROTEIN X-LOR,PROTEIN PX,TRANS-ACTIVATING TRANSCRIPTIONAL REGULATORY PROTEIN OF HTLV-1 KHFRETEV 8 T 41 FeThRed_A pdbhh T Viruses T 7qs6 2 B B THAN_PODMA Thanatin-like derivative XPITYXNRXTXKCXRY 16 T 2.6 YihI unphh F Eukaryota T 7qs8 2 B,D C,D TAX_HTL1C PROTEIN X-LOR,PROTEIN PX,TRANS-ACTIVATING TRANSCRIPTIONAL REGULATORY PROTEIN OF HTLV-1 KHFRETEV 8 T 41 FeThRed_A pdbhh T Viruses T 7qsa 2 C C POLG_TBEVH NON-STRUCTURAL PROTEIN 5 LRLESSII 8 T 14 Decorin_bind pdbhh T Viruses F 7qsj 1 A,B A,B A0A3P4A4D3_MYCHD Methylmannose polysaccharide hydrolase (MmpH) MVLRDDLDAVPGVPGVLTPEQCRQTAQAIADAQEPSGALPWFEGGHTDPWDHVENAMALTVAGLLEPARAAFDWCRTTQRPDGSWPIQIRNGVVEDANSDSNFCAYVATGVWHHVLITGDRRFAETMWPVVAKAIDFVIDMQLPGGEIAWARSPSGLYEEALLTGCASIYHSIRCALALADYMGEPQPEWEVAVGRLGHAIAEHPEAFVTKDRWSMEWYYPVLGGALRGEAARARINRRWNDFVVPGLGIRCVDDRPWVTGAETCELVLALDAIGDLTRAHEQFAAMHHLREEDGSYWTGLVYDDGKRWPIERTTWTGAAMILAADALSRTTPGNGIFRGVDLPRGLEGEYDCACATSERKLAAALEHHHHHH 373 T 0.0028 Bac_rhamnosid6H unppercent F Bacteria T 7qto 2 C,D C,D Q6DP93_9INFA NS1 ARTIESEV 8 T 34 TEX12 pdbhh T Viruses T 7qtp 2 B B Q6B3P2_9INFA NS1 KMARTIESKV 10 T 10 Tipalpha pdbhh T Viruses T 7qtu 2 B,D,F,H D,C,F,H Q6B3P2_9INFA NS1 KMARTIESKV 10 T 10 Tipalpha pdbhh T Viruses T 7quu 1 A A YFPE_SCHPO Uncharacterized protein C7D4.14c MGWSHPQFEKSSDAVEPSVEKEYKKIISFRDTVFEGKHQQFLVPNNVRLKFLRDR 55 T 9.5 NPCC pdbhh F Eukaryota T 7quu 2 B B RED1_SCHPO NURS complex subunit red1 GAMGTTNQKEAEKAVSQLFEVGVRFNDFIAEGIEPSVVHTLFLKLGLDS 49 T 0.11 T4bSS_IcmS pdb F Eukaryota T 7quv 3 C C Peptide 3 RSPESVAFPMFQSHWYSG 18 T 3.4 Pneumo_matrix pdbhh F T 7qux 2 B D P7C8 CRLYGFKW 8 T 0.33 DUF1281 pdbhh F T 7qv1 14 N I Nascent chain XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 7qv2 15 O I Nascent chain XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 7qv3 15 O I Nascent chain XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 7qvb 1 A,B A,B DNA damage response protein C GAMGMKNAPLTLNFGSVRLPVSADGLLHAPTAQQQLGLTQSWEAALVEHGLPETYRDFGAGPEAAVSVPDFVALAFALDTPEARRWQKRARELLARAMQGDVRVAAQIAERNPEPDARRWLAARLESTGARRELLATVARHGGEGRVYGQLGSISNRTVLGKDSASVRQERGVKATRDGLTSAELLRLAYIDTVTARAIQESEARGNAAILTLHEQVARSERQSWERAGQVQRVG 235 T 32 DUF2789 pdbhh F T 7qwa 1 A,B,C,D,E,F A,B,C,D,E,F CC-Type2-(UgUe)4 XGEIXQXLKEIXKXLKEIXXXLKEIXQXLKGX 32 T 0.011 WXG100 pdbpssm F T 7qwb 1 A,B,C,D,E,F A,B,C,D,E,F CC-Type2-(Ue)4 XGEIXQALKEIXKALKEIXXALKEIXQALKGX 32 T 0.011 WXG100 pdbpssm F T 7qwc 1 A,B,C B,C,E CC-Type1-(UbUc)4 XGELXXIKQELXXIKKELXXIKXELXXIKQGX 32 T 0.0032 DUF5320 pdbhh F T 7qwd 1 A,B,C,D,E A,B,C,D,E CC-Type2-(Ug)4 XGEIAQXLKEIAKXLKEIAXXLKEIAQXLKGX 32 T 0.011 WXG100 pdbpssm F T 7qwe 1 A,B,C,D,E,F,G,H,I,J A,B,C,D,E,F,G,H,I,J CC-Type2-(Ug)4 XGEIAQXLKEIAKXLKEIAXXLKEIAQXLKGX 32 T 0.011 WXG100 pdbpssm F T 7qwn 1 A,B,C A,B,C CP125_MYCTU CHOLEST-4-EN-3-ONE 26-MONOOXYGENASE,CHOLEST-4-EN-3-ONE C26-MONOOXYGENASE [(25S)-3-OXOCHOLEST-4-EN-26-OATE FORMING],CHOLESTEROL C26-MONOOXYGENASE,CHOLESTEROL C26-MONOOXYGENASE [(25S)-3BETA-HYDROXYCHOLEST-5-EN-26-OATE FORMING],CYTOCHROME P450 125,STEROID C27-MONOOXYGENASE NGPSPNLPPGFDFTDPAIYAERLPVAEFAELRSAAPIWWNGQDPGKGGGFHDGGFWAITKLNDVKEISRHSDVFSSYENGVIPRFKNDIAREDIEVQRFVMLNMDAPHHTRLRKIISRGFTPRAVGRLHDELQERAQKIAAEAAAAGSGDFVEQVSCELPLQAIAGLLGVPQEDRGKLFHWSNEMTGNEDPEYAHIDPKASSAELIGYAMKMAEEKAKNPADDIVTQLIQADIDGEKLSDDEFGFFVVMLAVAGNETTRNSITQGMMAFAEHPDQWELYKKVRPETAADEIVRWATPVTAFQRTALRDYELSGVQIKKGQRVVMFYRSANFDEEVFQDPFTFNILRNPNPHVGFGGTGAHYCIGANLARMTINLIFNAVADHMPDLKPISAPERLRSGWLNGIKHWQVDYTGRCPVA 417 T 3.4000000000000003E-22 p450 unppssm F Bacteria T 7qwq 3 C s Nascent chain preprolactin XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 112 F F F 7qwr 1 A s nascent chain pre-prolactin XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 7qws 1 A s Nascent chain tubulin beta XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 7qy5 2 C,D F,G RED1_SCHPO RNA elimination defective protein Red1 KNEEDESNDSDKEDGEISEDD 21 T 6.2 Vpu pdbhh F Eukaryota T 7qyr 2 I,J,K,L,M,N,O,P K,L,M,N,O,P,Q,T poly-glutamate EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE 60 T 1600 Nop14 pdbhh F F 7qzd 2 C,F C,G A0A219T3Y8_MAGOR AVR-PIK PROTEIN METGNKYIEKRAIDLSRERDPNFFDNADIPVPECFWFMFKNNVRQDAGTCYSSWKMDKKVGPNWVHIKSDDNCNLSGDFPPGWIVLGKKRPGF 93 T 2.8 DIM unphh F Eukaryota T 7qzr 4 E,F E,F Q2G0X2_STAA8 MYELOPEROXIDASE INHIBITOR SPIN,PEROXIDASE INHIBITOR MKFKKVLVATAMVGVLATGVVGYGNQADAKVYSQNGLVLHDDANFLEHELSYIDVLLDKNADQATKDNLRSYFADKGLHSIKDIINKAKQDGFDVSKYEHVK 102 T 0.0051 CompInhib_SCIN unphh F Bacteria T 7qzv 1 A A Hm-AMP2 EKRWRRLIFNYFX 13 T 1.8 MCRS_N pdbhh F T 7qzw 1 A A Hm-AMP8 RAVIYKIPYNAIASRWIIAPKKCX 24 T 1.9 TGF_beta pdbhh F T 7r0j 1 A A V2R_HUMAN V2R Cter ESCTTASSSLAKD 13 T 57 DPM3 pdbhh F Eukaryota T 7r0w 5 E,M M,E A0A6P1VG96_9SYNC Cytochrome B6 MAAGVGIFIGYIAVFTGVTLGLLYGLRFVKLI 32 T 0.00039 PetL pdbpercent F Bacteria T 7r1i 1 A,B,C A,B,C CP125_MYCTU CHOLEST-4-EN-3-ONE 26-MONOOXYGENASE,CHOLEST-4-EN-3-ONE C26-MONOOXYGENASE [(25S)-3-OXOCHOLEST-4-EN-26-OATE FORMING],CHOLESTEROL C26-MONOOXYGENASE,CHOLESTEROL C26-MONOOXYGENASE [(25S)-3BETA-HYDROXYCHOLEST-5-EN-26-OATE FORMING],CYTOCHROME P450 125,STEROID C27-MONOOXYGENASE NGPSPNLPPGFDFTDPAIYAERLPVAEFAELRSAAPIWWNGQDPGKGGGFHDGGFWAITKLNDVKEISRHSDVFSSYENGVIPRFKNDIAREDIEVQRFVMLNMDAPHHTRLRKIISRGFTPRAVGRLHDELQERAQKIAAEAAAAGSGDFVEQVSCELPLQAIAGLLGVPQEDRGKLFHWSNEMTGNEDPEYAHIDPKASSAELIGYAMKMAEEKAKNPADDIVTQLIQADIDGEKLSDDEFGFFVVMLAVAGNETTRNSITQGMMAFAEHPDQWELYKKVRPETAADEIVRWATPVTAFQRTALRDYELSGVQIKKGQRVVMFYRSANFDEEVFQDPFTFNILRNPNPHVGFGGTGAHYCIGANLARMTINLIFNAVADHMPDLKPISAPERLRSGWLNGIKHWQVDYTGRCPVAH 418 T 3.4000000000000003E-22 p450 unppssm F Bacteria T 7r1o 2 E,F EEE,FFF Dusquetide RIVPA 5 T 57 VP1_VP3 pdbhh F F 7r1r 2 B,D,F,G D,E,F,P RIR2_ECOLI RIBONUCLEOTIDE REDUCTASE R2 PROTEIN YLVGQIDSEVDTDDLSNFQL 20 T 12 UL11 pdbhh F Bacteria T 7r1v 2 B B Dynobactin A WNSNVHSYRF 10 T 1.3 DUF5504 pdbhh F T 7r1w 6 F G dynobactin A WNSNVHSYRF 10 T 1.3 DUF5504 pdbhh F T 7r2m 2 C,D B,E Vangl2 peptide GSGSGGMRLQSETSVMRLQSETSV 24 T 3.6 Strabismus pdbhh F T 7r2t 2 B B Vangle2 peptide binding motif with the P-1 phosphrylated MRLQSETSV 9 T 0.42 Strabismus pdbhh F T 7r31 1 A,B,C A,B,C Y1513_SYNY3 Membrane-associated protein slr1513 MAKPANKLVIVTEKILLKKIAKIIDESGAKGYTVMNTGGKGSRNVRSSGQPNTSDIEANIKFEILTETREMAEEIADRVAVKYFNDYAGIIYICSAEVLYGHTFAGPEGASAWSHPQFEK 120 T 0.0021 DUF3240 pdbpercent F Bacteria T 7r32 1 A,B,C A,B,C Y1513_SYNY3 Membrane-associated protein slr1513 MAKPANKLVIVTEKILLKKIAKIIDESGAKGYTVMNTGGKGSRNVRSSGQPNTSDIEANIKFEILTETREMAEEIADRVAVKYFNDYAGIIYICSAEVLYGHTFSAWSHPQFEK 114 T 0.011 DUF3240 unppercent F Bacteria T 7r3u 1 A,B,C A,B,C CP125_MYCTU CHOLEST-4-EN-3-ONE 26-MONOOXYGENASE,CHOLEST-4-EN-3-ONE C26-MONOOXYGENASE [(25S)-3-OXOCHOLEST-4-EN-26-OATE FORMING],CHOLESTEROL C26-MONOOXYGENASE,CHOLESTEROL C26-MONOOXYGENASE [(25S)-3BETA-HYDROXYCHOLEST-5-EN-26-OATE FORMING],CYTOCHROME P450 125,STEROID C27-MONOOXYGENASE NGPSPNLPPGFDFTDPAIYAERLPVAEFAELRSAAPIWWNGQDPGKGGGFHDGGFWAITKLNDVKEISRHSDVFSSYENGVIPRFKNDIAREDIEVQRFVMLNMDAPHHTRLRKIISRGFTPRAVGRLHDELQERAQKIAAEAAAAGSGDFVEQVSCELPLQAIAGLLGVPQEDRGKLFHWSNEMTGNEDPEYAHIDPKASSAELIGYAMKMAEEKAKNPADDIVTQLIQADIDGEKLSDDEFGFFVVMLAVAGNETTRNSITQGMMAFAEHPDQWELYKKVRPETAADEIVRWATPVTAFQRTALRDYELSGVQIKKGQRVVMFYRSANFDEEVFQDPFTFNILRNPNPHVGFGGTGAHYCIGANLARMTINLIFNAVADHMPDLKPISAPERLRSGWLNGIKHWQVDYTGRCPVAH 418 T 3.4000000000000003E-22 p450 unppssm F Bacteria T 7r4h 4 E L STING_HUMAN HSTING,ENDOPLASMIC RETICULUM INTERFERON STIMULATOR,ERIS,MEDIATOR OF IRF3 ACTIVATION,HMITA,TRANSMEMBRANE PROTEIN 173 QEPELLISG 9 T 3 SurE pdbhh F Eukaryota T 7r51 2 C,D C,D ARG-ASP RD 2 T 370 EB1 pdbhh F F 7r6k 2 B 7 LOC1_YEAST LOCALIZATION OF ASH1 MRNA PROTEIN 1 MAPKKPSKRQNLRREVAPEVFQDSQARNQLANVPHLTEKSAQRKPSKTKVKKEQSLARLYGAKKDKKGKYSEKDLNIPTLNRAIVPGVKIRRGKKGKKFIADNDTLTLNRLITTIGDKYDDIAESKLEKARRLEEIRELKRKEIERKEALKQDKLEEKKDEIKKKSSVARTIRRKNKRDMLKSEAKASESKTEGRKVKKVSFAQ 204 T 2 PIN_6 pdb F Eukaryota T 7r6k 16 P 5 RRP17_YEAST RRP17 isoform 1 YLTKNERR 8 T 0.53 RyR unp F Eukaryota T 7r6q 14 N h RL35A_YEAST 60S ribosomal protein L35 GKKYQPKVTEKQRKKQIA 18 T 0.52 AgrD pdbhh F Eukaryota T 7r73 1 A G ENV_HV1H2;H6VWK7_9HIV1 Glycoprotein 120 VWKEANTTLFCASDAKAYDTEAHNVWATHACVPTDPNPQEVVLENVTENFNMWKNHMVEQMHEDIISLWDQSLKPCVKLTGGSVITQACPKISFEPIPIHFCAPAGFAILKCNDKKFNGTGPCTNVSTVQCTHGIRPVVSTQLLLNGSLAEEEVVIRSENFTNNVKNIIVQLNESVQINCTRHNNGGSGSGGDIRQAHCNISREKWQNTLKQIVKKLREQFKNKTIAFAPSSGGDPEIVMHSFNCNGEFFYCNTTKLFTSTWNSTWNSTWNNTEGSNSTVITLPCRIRQIINMWQEVGKAMYAPPIQGQIKCSSNITGLLLTRDGGVDTTKETFRPGGGNMKDNWRSELYKYKVVRIE 358 T 4.8E-52 GP120 unp T Viruses T 7r7a 3 C 7 LOC1_YEAST LOCALIZATION OF ASH1 MRNA PROTEIN 1 MAPKKPSKRQNLRREVAPEVFQDSQARNQLANVPHLTEKSAQRKPSKTKVKKEQSLARLYGAKKDKKGKYSEKDLNIPTLNRAIVPGVKIRRGKKGKKFIADNDTLTLNRLITTIGDKYDDIAESKLEKARRLEEIRELKRKEIERKEALKQDKLEEKKDEIKKKSSVARTIRRKNKRDMLKSEAKASESKTEGRKVKKVSFAQ 204 T 2 PIN_6 pdb F Eukaryota T 7r7c 7 G 7 LOC1_YEAST LOCALIZATION OF ASH1 MRNA PROTEIN 1 MAPKKPSKRQNLRREVAPEVFQDSQARNQLANVPHLTEKSAQRKPSKTKVKKEQSLARLYGAKKDKKGKYSEKDLNIPTLNRAIVPGVKIRRGKKGKKFIADNDTLTLNRLITTIGDKYDDIAESKLEKARRLEEIRELKRKEIERKEALKQDKLEEKKDEIKKKSSVARTIRRKNKRDMLKSEAKASESKTEGRKVKKVSFAQ 204 T 2 PIN_6 pdb F Eukaryota T 7r81 81 DC v1 Nascent peptide XXXX 4 F F F 7r8w 2 B B JAK2 pY813 phosphopeptide XELLT 5 T 85 NIPSNAP pdbhh F F 7r8x 2 B C EPOR pY454 phosphopeptide KXLYLVVS 8 T 0.81 Glyco_hydro_47 pdbhh F T 7rae 2 B D NCOA2_HUMAN TIF2 QALLRYLLDKDD 12 T 0.0028 DUF4927 pdb F Eukaryota T 7raf 2 B D NCOA2_HUMAN TIF2 QALLRYLLDKDD 12 T 0.0028 DUF4927 pdb F Eukaryota T 7raj 3 C A CSP_PLAFA ASN-PRO-ASP-PRO-ASN-ALA-ASN-PRO-ASN-VAL-ASP-PRO-ASN-ALA-ASN NPDPNANPNVDPNAN 15 T 0.7 Cas_Cas7 pdbhh F Eukaryota F 7rap 1 A A W5IDB3_LASLA Heterogeneous-backbone analogue of lasiocepsin GLXRKXLCAXAKXKXXCXXAXKLXCKCX 28 T 1.2 Antimicrobial_1 pdbhh F Eukaryota T 7raw 1 A A A0A2N0URA4_9FIRM Dockerin domain-containing protein HHHHHHENLYFQGEETDTKIYFDASNLPAEWGTTKTVYCHLYAVAGDDLPETSWQGKAEKCKKDTATGLYYFDTAKLKSADGTNHGGLKDNADYAVIFSTIDTKSQSHQTCNVTLGKPCLGDTIYLTGGTVENTEDSSKRDFAATWKNNSDNYGPKAAITSLGHVTEGRFPIYLSRAEMVAQAIFNWAVKNPKNYTPETVADICAQVEAEPMDVYNAYAEMYATELADPAAYPDCAPLTTVATLLGVDPSG 251 T 0.0015 CBM26 pdbpercent F Bacteria T 7rbx 1 A,B,C,D A,B,C,D Q2YQA0_BRUA2 ISOCITRATASE,ISOCITRATE LYASE MAHHHHHHMGTLEAQTQGPGSMTDFYSLIPSAPKGRFDGIERAHTAEDVKRLRGSVEIKYSLAEMGANRLWKLIHEEDFVNALGALSGNQAMQMVRAGLKAIYLSGWQVAADANTASAMYPDQSLYPANAGPELAKRINRTLQRADQIETAEGKGLSVDTWFAPIVADAEAGFGGPLDAFEIMKAYIEAGAAGVHFEDQLASEKKCGHLGGKVLIPTAAHIRNLNAARLAADVMGTPTLIVARTDAEAAKLLTSDIDERDQPFVDYEAGRTAEGFYQVKNGIEPCIARAIAYAPYCDLIWMETSKPDLAQARRFAEAVHKAHPGKLLAYNCSPSFNWKKNLDDATIAKFQRELGAMGYKFQFITLAGFHQLNYGMFELARGYKDRQMAAYSELQQAEFAAEADGYTATKHQREVGTGYFDAVSLAITGGQSSTTAMKESTETAQFKPAAE 450 T 2.3E-50 ICL unp F Bacteria T 7rc6 2 B C AERA-DL XAXAXVXYXGXAXVXGXAXGXVXAXAXAXTXAXA 34 T 440 TAA-Trp-ring pdbhh F T 7rcs 3 E,F D,C CSP_PLAFA CS NPDPNANPNVDPNAN 15 T 0.7 Cas_Cas7 pdbhh F Eukaryota F 7rd3 3 E,F C,D CSP_PLAFA CS NPDPNANPNVDPNAN 15 T 0.7 Cas_Cas7 pdbhh F Eukaryota F 7rd4 3 E,F G,I CSP_PLAFA CS NPDPNANPNVDPNAN 15 T 0.7 Cas_Cas7 pdbhh F Eukaryota F 7rd9 1 A C CSP_PLAFA CS NPDPNANPNVDPNAN 15 T 0.7 Cas_Cas7 pdbhh F Eukaryota F 7rda 1 A C CSP_PLAFA CS NPDPNANPNVDPNAN 15 T 0.7 Cas_Cas7 pdbhh F Eukaryota F 7rdh 3 G,H G,H De novo designed protein H3mb MSHHHHHHHHSENLYFQSGGSQHEKFLEWMLRKIEEAIKRGNKISAEFLINLAKNFIHVLGDDEIRRRLERLERQLH 77 T 5.2 YqaH pdbhh F T 7rdr 1 A,B,C A,B,C Circular tendon repeat protein SELAARCLIILFQQLVELARLAIESGDEELLRRVSEWLEEVIKDMRRVVEQALREGNSELAARILIILFQQLVELARLAIESGDEELLRRVSEWLEEVIKDMRRVVEQALREGNSELAARILIILFQQLVELARLAIESGDEELLRRVSEWLEEVIKDMRRVVEQALREGNSELAARILIILFQQLVELARLAIESGDEELLRRVSEWLEEVIKDMRRVVEQALREGNSELAARILIILFQQLVELARLAIESGDEELLRRVSEWLEEVIKDMRRVVEQALREGNSELAARILIILFQQLVELARLAIESGDEELLRRVSEWLEEVIKDMRRVVEQALREGNSELAARILIILFQQLVELARLAIESGDEELLRRVSEWLEEVIKDMRRVVEQALREGNSELACRILIILFQQLVELARLAIESGDEELLRRVSEWLEEVIKDMRRVVEQALREGN 456 T 0.0073 Glyco_hydro_88 pdbpssm F T 7rdv 5 E H PGCA_HUMAN Aggrecan core peptide EGRVRVNSAYQS 12 T 7.3 Peptidase_M15_3 pdbhh F Eukaryota T 7rdw 1 A,B,G,H C,D,M,N ENV_HV1H2 Glycoprotein 120 VWKEATTTLFCASDAKAYDTECHNVWATHACVPTDPNPQEVVLVNVTENFNMWKNDMVEQMHEDIISLWDQCLKPCVKLTNTSVITQACPKVSFEPIPIHYCAPAGFAILKCNNKTFNGTGPCTNVSTVQCTHGIRPVVSTQLLLNGSLAEEEVVIRSVNFTDNAKTIIVQLNTSVEINCTRPNNGGSGSGGNMRQAHCNISRAKWNNTLKQIASKLREQFGNNKTIIFKQSSGGDPEIVTHSFNCGGEFFYCNSTQLFNSTWFNSTWSTEGSNNTEGSDTITLPCRIKQIINMWQKVGKAMYAPPISGQIRCSSNITGLLLTRDGGNSNNESEIFRPGGGDMRDNWRSELYKYKVVKIEPLGSGSGHHHHHH 373 T 2.7999999999999998E-36 GP120 pdb T Viruses T 7re7 5 I,J C,F FETA_HUMAN PHE-MET-ASN-LYS-PHE-ILE-TYR-GLU-ILE FMNKFIYEI 9 T 3 Serum_albumin pdbhh F Eukaryota T 7re8 3 C,D C,F FETA_HUMAN PHE-MET-ASN-LYS-PHE-ILE-TYR-GLU-ILE FMNKFIYEI 9 T 3 Serum_albumin pdbhh F Eukaryota T 7rft 1 A,B A,B A0A2N0URA4_9FIRM Dockerin domain-containing protein GEETDTKIYFDASNLPAEWGTTKTVYCHLYAVAGDDLPETSWQGKAEKCKKDTATGLYYFDTAKLKSADGTNHGGLKDNADYAVIFSTIDTKSQSHQTCNVTLGKPCLGDTIYLTGGTVENTEDSSKRDFAATWKNNSDNYGPKAAITSLGHVTEGRFPIYLSRAEMVAQAIFNWAVKNPKNYTPETVADICAQVEAEPMDVYNAYAEMYATELADPAAYPDCAPLTTVATLLGVDPSG 239 T 0.0014 CBM26 pdbpercent F Bacteria T 7rfv 1 A A G3M192_9CAUD Tailspike protein MANKPTQPLFPLGLETSESSNIKGFNNSGTIEHSPGAVMTFPEDTEVTGLPSSVRYNPDSDEFEGYYENGGWLSLGGGGIRWETLPHAPSSNLLEGRGYLINNTTGTSTVVLPSPTRIGDSVTICDAYGKFATYPLTVSPSGNNLYGSTEDMAITTDNVSATFTWSGPEQGWVITSGVGLGQGRVYSREIFTQILASETSAVTLNTPPTIVDVYADGKRLAESKYSLDGNVITFSPSLPASTELQVIEYT 250 T 0.00013 T4_gp9_10 pdbhh T Viruses T 7rfx 2 B B ORF4B_MERS1 ORF4b RKARKRSHSPTKKLRYVKRRF 21 T 0.081 SUIM_assoc unppercent T Viruses T 7rfy 2 B B ORF4B_MERS1 ORF4b RKARKRSHSPTKKLRYVKRRF 21 T 0.081 SUIM_assoc unppercent T Viruses T 7rfz 2 B B ORF4B_MERS1 ORF4b RKARKRSHSPTKKLRYVKRRF 21 T 0.081 SUIM_assoc unppercent T Viruses T 7rg0 2 B B ORF4B_MERS1 ORF4b RKARKASHSPTKKLRYVKRRF 21 T 0.081 SUIM_assoc unppercent T Viruses T 7rg1 2 B B ORF4B_MERS1 ORF4b RKARKRSASPTKKLRYVKRRF 21 T 0.16 SUIM_assoc unp T Viruses T 7rg2 2 B B ORF4B_MERS1 ORF4b RKARKRSHSPTKKLAYVKRRF 21 T 0.081 SUIM_assoc unppercent T Viruses T 7rg3 2 B,C B,C ORF4B_MERS1 ORF4b RKARKRSHSPTKKLRYVKARF 21 T 0.081 SUIM_assoc unppercent T Viruses T 7rg6 2 B,C C,D ORF4B_BCHK5 ORF4B STRKRRRHPMNKRRYAKRRF 20 T 1.6 DUF1713 pdbhh T Viruses T 7rg8 2 B B HPSE_HUMAN Heparanase 8 kDa subunit MGSSHHHHHHSQDPNSSSQDVVDLDFFTQEPLHLVSPSFLSVTIDANLATDPRFLILLGSPKLRTLARGLSPAYLRFGGTKTDFLIFDPKKE 92 T 3.5 Glyco_hydro_79n unppercent F Eukaryota T 7rh5 7 J,Q U,a A0R1B6_MYCS2 Cytochrome c oxidase subunit CtaJ MSTALTHGLIGGVPLVLFAVLALIFLTRKGPHPDTYKMSDPWTHAPILWAAEEPREHGHGGHGHDSHGVVIGGGASGKW 79 T 0.08 VPDSG-CTERM pdbpssm F Bacteria T 7rh6 10 M,S U,a A0R1B6_MYCS2 Cytochrome c oxidase subunit CtaJ MSTALTHGLIGGVPLVLFAVLALIFLTRKGPHPDTYKMSDPWTHAPILWAAEEPREHGHGGHGHDSHGVVIGGGASGKW 79 T 0.08 VPDSG-CTERM pdbpssm F Bacteria T 7rh7 6 F,P U,a A0R1B6_MYCS2 Cytochrome c oxidase subunit CtaJ MSTALTHGLIGGVPLVLFAVLALIFLTRKGPHPDTYKMSDPWTHAPILWAAEEPREHGHGGHGHDSHGVVIGGGASGKW 79 T 0.08 VPDSG-CTERM pdbpssm F Bacteria T 7ri4 1 A G ESPP_ECOLX EspPbeta9-12 DWKVTARACLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRMLMSVGLNAEIRDNVRFGLEFEKSAFGKYNVDNAVNANFRYSF 83 T 0.012 OMP_b-brl pdbpercent F Bacteria T 7rih 1 A A D-[I11L]hyen D GXXXGXXXXXXXXXXXXXGXXXXXXXXXXX 30 F F F 7rij 2 B B D-[I11L]hyen D GXXXGXXXXXXXXXXXXXGXXXXXXXXXXX 30 F F F 7rj1 1 A,B,C,D A,B,C,D Q59TS4_CANAL Chorismate mutase SMDFMKPETVLDLANIRQALVRMEDTIVFDLIERSQFFSSPSVYEKNKYNIPNFDGTFLEWALLQLEVAHSQIRRYEAPDETPFFPDQLKTPILPPINYPKILAKYSDEINVNSEIMKFYVDEIVPQVSCGQGDQKENLGSASTCDIECLQAISRRIHFGKFVAEAKYQSDKPLYIKLILDKDVKGIENSITNSAVEQKILERLIVKAESYGVDPSLKFGQNVQSKVKPEVIAKLYKDWIIPLTKKVEIDYLLRRLEDEDVELVEKYKK 269 T 0.18 IL10 pdb F Eukaryota T 7rjf 1 A,B A,B [L47W]MOPD-1 IQIREYKRCGQDEERVRRECKERGERQNCHYVIHKEGNCYVCGIICW 47 T 3.9 DUF2175 pdbhh F T 7rkc 1 A,B A,B D_3_633 SGSGSPELDELWKRVKKLVTELLEQAERAGDPEEIFKLLEVAQQLLWLAEMFLRLAAIQEKATDPEIQELAERVLRLIKRLLEEAERAGDPRRIKELVEVALALAKLLEMFYRLKEIQERATDPEIQELAERVLRLIKKLLKAAEEAGDPRKIYKLVFVALVLLHLLQTFYRLKEIQEKATDPEIQRKAQEVLEKIKRLLEAAERAGDPAKILLYVIRAQLLAMELKFAYRKR 233 T 0.007 PMC2NT pdb F T 7rle 2 B,D B,D CBP_HUMAN HISTONE LYSINE ACETYLTRANSFERASE CREBBP,PROTEIN-LYSINE ACETYLTRANSFERASE CREBBP GNLVPDAASKHKQLSELLRGGSGS 24 T 3.1 SRC-1 pdbhh F Eukaryota T 7rlv 2 B,E,H P,R,Q CSP_PLAVS CS GDRADGQPAGDRADGQPA 18 T 90 RNaseH_C pdbhh F Eukaryota T 7rlw 1 A,F P,R CSP_PLAVS CS GDRAAGQPAGDRAAGQPA 18 T 0.088 X unppercent F Eukaryota T 7rlx 3 C P CSP_PLAVS CS GDRADGQPAGDRAAGQPA 18 T 85 RNaseH_C pdbhh F Eukaryota T 7rly 1 A,H,I P,R,Q CSP_PLAVS CS DRAAGQPAGDRADGQPA 17 T 110 DUF5632 pdbhh F Eukaryota T 7rlz 1 A,F P,R CSP_PLAVB CS GDRAAGQPAGNGAGGQAA 18 T 0.01 DUF2000 unppercent F Eukaryota T 7rm0 3 C,F,I,L P,Q,R,S Q2TM01_PLAVI peptide from Circumsporozoite protein variant VK247 ANGAGNQPGANGAGNQPG 18 T 0.12 Collagen unppercent F Eukaryota F 7rm1 3 E P Q2TM01_PLAVI peptide from Circumsporozoite protein variant VK247 EDGAGNQPGANGAGNQPGANGAGNQPG 27 T 0.33 Collagen unppssm F Eukaryota T 7rm3 3 E,F P,Q Q2TM01_PLAVI peptide from Circumsporozoite protein variant VK247 ANGAGNQPGANGAGNQPGANGAGGQAA 27 T 0.12 Collagen unppercent F Eukaryota F 7rma 2 B C AN13D_HUMAN Ankyrin repeat domain-containing protein 13D RGQQEEEDLQRILQLSLTEH 20 T 0.00092 UIM pdbhh F Eukaryota T 7rmi 3 C S Substance P 6-11 QFFGLMX 7 T 0.00044 Tachykinin pdbhh F T 7rmr 2 B B CYO2_VIOOD D-[I11L]cycloviolacin O2 XXXGXXXXXXXXXXXXXGXXXXXXXXXXXG 30 F F Eukaryota F 7rms 2 B B CYO2_VIOOD D-[I11L]cycloviolacin O2 XXXGXXXXXXXXXXXXXGXXXXXXXXXXXG 30 F F Eukaryota F 7rmx 1 A A Tunable symmetric protein, D_3_212 SGSGSTEEEEALLRWFQTLLAKFDELVKQLGDPRLLEEARRLQERLEEAKKRGDKRTIKQLAALLQMFVLIAQIFQLVEELGDPKLLEQAKRLLERLKEAVERGDEETIKELLDLAHMTYLIAQIFQLVEQLGDPRLLELAKELLKRLKEAQERGDRRTIERLLRLVQMTYLIAQIFQLVRQLGDPRLLETAKTLLTLLKLAFEEGDELLIKSLLTLVAETYRQAAAEQ 229 T 0.012 Tim44 pdb F T 7rmy 1 A A De Novo designed tunable homodimer, D_3-337 SGSGSSEELKKVQKMVSQILATAEAVLKLAKVLGDPKAVELAERILEDAKELAKRAESGDEETLRRAQTLLKVLKMVLEILLLAIKVELAAKELGDPKAVEAAQRILKQALRLLAEIKSGDEETLKRAQELLKVLKMVLRIIYLAIEVEKAAKELGDPTAVEAAQRILELALRLLQKVESGDEDTLRKALELLEVLYMVLRIIRLAIEVEKLAKKAGDPSAVEEAQRILKQALRLLKEISSGDEQTLDEAAKTLSFLAAELEAIAFAIRVKW 272 T 0.016 FANCI_S3 pdb F T 7rn7 3 E,F F,G Ac-VD(Aly)VD-CHO XVDXVD 6 T 320 DUF1543 pdbhh F F 7rn8 3 E,F F,G Ac-VD(Orn)VD-CHO XVDXVD 6 T 200 AAA_16 pdbhh F F 7rn9 3 E,F F,G Ac-VDFVD-CHO XVDFVD 6 T 71 KN_motif pdbhh F F 7rna 3 E,F G,F Ac-ITV(Dab)D-CHO XITVXD 6 T 120 BCL_N pdbhh F F 7rnb 3 E,F F,G Ac-VDRVD-CHO XVDRVD 6 T 180 NSP2-B_epitope pdbhh F F 7rnc 3 E,F F,G Ac-VDVVD-CHO XVDVVD 6 T 150 DUF5614 pdbhh F F 7rnd 3 E,F F,G Ac-VDPVD-CHO XVDPVD 6 T 89 PARP_reg pdbhh F F 7rne 3 E,F F,G Ac-YKPVD-CHO XYKPVD 6 T 96 DUF932 pdbhh F T 7rnf 3 E,F F,G Ac-VDKVD-CHO XVDKVD 6 T 320 DUF1543 pdbhh F F 7rng 3 E,F F,G Ac-ITAKD-CHO XITAKD 6 T 120 Alpha_Helical pdbhh F T 7rnw 2 E,F,G,H X,Y,Z,W ACE-DTY-LEU-GLN-TYR-ALA-VAL-LEU-ARG-HIS-LYS-ARG-ARG-GLU-SEC XXLQYAVLRHKRREX 15 T 9.8 Potass_KdpF pdbhh F T 7ro2 1 A A W0UUV5_9VIRU major capsid protein MSVSVVVPSAKATGAGGKKAKTFKVIKVSTPKVNNVHVPKIKKATRIHDPGAISGSLAKVTFGTNGFDIPTIAIALLIVGVIIGLSGLILSIFATATASAISNPSPGTLAYNLTHPLINGMVSFFSFFPTLYVLLGVTGIVLIAAGIISIIMEKFKT 157 T 0.0039 DUF4064 pdbpercent T Viruses T 7ro3 1 A A W0UUV5_9VIRU major capsid protein MSVSVVVPSAKATGAGGKKAKTFKVIKVSTPKVNNVHVPKIKKATRIHDPGAISGSLAKVTFGTNGFDIPTIAIALLIVGVIIGLSGLILSIFATATASAISNPSPGTLAYNLTHPLINGMVSFFSFFPTLYVLLGVTGIVLIAAGIISIIMEKFKT 157 T 0.0039 DUF4064 pdbpercent T Viruses T 7ro4 1 A A W0UUV5_9VIRU major capsid protein MSVSVVVPSAKATGAGGKKAKTFKVIKVSTPKVNNVHVPKIKKATRIHDPGAISGSLAKVTFGTNGFDIPTIAIALLIVGVIIGLSGLILSIFATATASAISNPSPGTLAYNLTHPLINGMVSFFSFFPTLYVLLGVTGIVLIAAGIISIIMEKFKT 157 T 0.0039 DUF4064 pdbpercent T Viruses T 7ro5 1 A A W0UUV5_9VIRU major capsid protein MSVSVVVPSAKATGAGGKKAKTFKVIKVSTPKVNNVHVPKIKKATRIHDPGAISGSLAKVTFGTNGFDIPTIAIALLIVGVIIGLSGLILSIFATATASAISNPSPGTLAYNLTHPLINGMVSFFSFFPTLYVLLGVTGIVLIAAGIISIIMEKFKT 157 T 0.0039 DUF4064 pdbpercent T Viruses T 7ro6 1 A A W0UUV5_9VIRU major capsid protein MSVSVVVPSAKATGAGGKKAKTFKVIKVSTPKVNNVHVPKIKKATRIHDPGAISGSLAKVTFGTNGFDIPTIAIALLIVGVIIGLSGLILSIFATATASAISNPSPGTLAYNLTHPLINGMVSFFSFFPTLYVLLGVTGIVLIAAGIISIIMEKFKT 157 T 0.0039 DUF4064 pdbpercent T Viruses T 7roa 1 A A Q836L4_ENTFA EntV SDQLEDSEVEAVAKGLEEMYANGVTEDNFKNYVKNNFAQQEISSVEEELNVNISDASTVVQARFNWNALGSCVANKIKDEFFAMISISAIVKAAQKKAWKELAVTVLRFAKANGLKTNAIIVAGQLALWAVQCGLS 136 T 0.00033 Potyvirid-P3 pdb F Bacteria T 7rob 1 A A W0UUV5_9VIRU major capsid protein MSVSVVVPSAKATGAGGKKAKTFKVIKVSTPKVNNVHVPKIKKATRIHDPGAISGSLAKVTFGTNGFDIPTIAIALLIVGVIIGLSGLILSIFATATASAISNPSPGTLAYNLTHPLINGMVSFFSFFPTLYVLLGVTGIVLIAAGIISIIMEKFKT 157 T 0.0039 DUF4064 pdbpercent T Viruses T 7roc 1 A A W0UUV5_9VIRU major capsid protein MSVSVVVPSAKATGAGGKKAKTFKVIKVSTPKVNNVHVPKIKKATRIHDPGAISGSLAKVTFGTNGFDIPTIAIALLIVGVIIGLSGLILSIFATATASAISNPSPGTLAYNLTHPLINGMVSFFSFFPTLYVLLGVTGIVLIAAGIISIIMEKFKT 157 T 0.0039 DUF4064 pdbpercent T Viruses T 7rod 1 A A W0UUV5_9VIRU major capsid protein MSVSVVVPSAKATGAGGKKAKTFKVIKVSTPKVNNVHVPKIKKATRIHDPGAISGSLAKVTFGTNGFDIPTIAIALLIVGVIIGLSGLILSIFATATASAISNPSPGTLAYNLTHPLINGMVSFFSFFPTLYVLLGVTGIVLIAAGIISIIMEKFKT 157 T 0.0039 DUF4064 pdbpercent T Viruses T 7roe 1 A A W0UUV5_9VIRU major capsid protein MSVSVVVPSAKATGAGGKKAKTFKVIKVSTPKVNNVHVPKIKKATRIHDPGAISGSLAKVTFGTNGFDIPTIAIALLIVGVIIGLSGLILSIFATATASAISNPSPGTLAYNLTHPLINGMVSFFSFFPTLYVLLGVTGIVLIAAGIISIIMEKFKT 157 T 0.0039 DUF4064 pdbpercent T Viruses T 7rog 1 A A W0UUV5_9VIRU major capsid protein MSVSVVVPSAKATGAGGKKAKTFKVIKVSTPKVNNVHVPKIKKATRIHDPGAISGSLAKVTFGTNGFDIPTIAIALLIVGVIIGLSGLILSIFATATASAISNPSPGTLAYNLTHPLINGMVSFFSFFPTLYVLLGVTGIVLIAAGIISIIMEKFKT 157 T 0.0039 DUF4064 pdbpercent T Viruses T 7roh 1 A A W0UUV5_9VIRU major capsid protein MSVSVVVPSAKATGAGGKKAKTFKVIKVSTPKVNNVHVPKIKKATRIHDPGAISGSLAKVTFGTNGFDIPTIAIALLIVGVIIGLSGLILSIFATATASAISNPSPGTLAYNLTHPLINGMVSFFSFFPTLYVLLGVTGIVLIAAGIISIIMEKFKT 157 T 0.0039 DUF4064 pdbpercent T Viruses T 7roi 1 A A W0UUV5_9VIRU major capsid protein MSVSVVVPSAKATGAGGKKAKTFKVIKVSTPKVNNVHVPKIKKATRIHDPGAISGSLAKVTFGTNGFDIPTIAIALLIVGVIIGLSGLILSIFATATASAISNPSPGTLAYNLTHPLINGMVSFFSFFPTLYVLLGVTGIVLIAAGIISIIMEKFKT 157 T 0.0039 DUF4064 pdbpercent T Viruses T 7rol 1 A,B A,B Alpha-crystallin B chain peptide KVKVXWDVIEV 11 T 0.074 Ycf34 pdbhh F T 7rov 2 C,D E,F Cyclic peptide MP-9903 AXPLYISYDPVCRA 14 T 0.83 Ribosomal_L33 pdbhh F T 7roy 2 E,F G,H FNIP1_MOUSE Folliculin-interacting protein 1 GRNKSSLLFKESEETRTPNCNCKYCSHPVLG 31 T 0.092 zf_C2H2_13 pdbhh F Eukaryota T 7rps 1 A A W5SFE3_9SPIR Fibronectin-binding lipoprotein FbpB GSTGSDNQYKFKLKNITDSVEQALKIAKQIKDDLDIIEFHRIKLSNHYGIRAEEHEKQTAREELSKFSKDKLEADLKKLLSEIEKSLNAATILITYNDYGGNLQSDLSAKTTLEALKTEVSSLITKIQDFNNKDHQAYPTSYYNDYQTYQALRNPYSKLTLVKDLLTRT 169 T 0.036 Hormone_1 pdbpssm F Bacteria T 7rpy 1 A A A0A412DXQ2_9FIRM Cohesin-containing protein HHHHHHENLYFQGAADTTYVVAGTTNLTGYEWVGTPDAAPENVMTADGSVFTKTFSAVPAGKNYQLKVVANTGDEQKWIGLDGTDNNVTFDVETACDVTVTFDPATNKITVTGDGVKMVTDLEVNSITVVGNGEDNWLNGVAWGVDAEVNHMTQVSDKVYQIKYENIESADDAYQFKFAANDDWAASWGLPEQSATPIGEEFDLTFNGQNMLLNTVSAGFEEDSLVDVTITLDITNFDYSTRSGAKATVKVEPSTP 256 T 0.0024 DUF5121 pdbhh F Bacteria T 7rqa 56 DB,HD 1v,2v P-site Peptidyl-tRNA Analog Peptide MTI 3 T 230 HycA_repressor pdbhh F F 7rqb 56 DB,HD 1v,2v P-site Peptidyl-tRNA Analog Peptide MAI 3 T 170 DUF6117 pdbhh F F 7rqc 56 DB,HD 1v,2v P-site Peptidyl-tRNA Analog Peptide MFI 3 T 120 RmuC pdbhh F F 7rqd 56 DB,HD 1v,2v P-site Peptidyl-tRNA Analog Peptide MTI 3 T 230 HycA_repressor pdbhh F F 7rqe 56 DB,HD 1v,2v P-site Peptidyl-tRNA Analog Peptide MAI 3 T 170 DUF6117 pdbhh F F 7rqq 3 C P CSP_PLAF7 PfCSP peptide NANPNVDP NANPNVDP 8 T 0.069 Cas_Cas7 pdbhh F Eukaryota F 7rqr 3 C,F C,P CSP_PLAF7 PfCSP peptide NANPNVDP NANPNVDP 8 T 0.069 Cas_Cas7 pdbhh F Eukaryota F 7rr3 1 A A M5AAG8_9CAUD Primase MIMEIPAIKALSRYAQWVIWKKERDTKIPYNPNNGKKASSTDPLAWGDIDEAQAGLVRYGANGLGFVLTKSDPFVFIDLDHVLDENKRVKCEWARQLLKEIKSYTEISPSGDGLHVVVSGKLPDYIKHKTKFDDGSALEVYESGRYMTITGEVFDGRDDIKELDLSILGEFAEHKIETKNAPVQIESATTLDDEAIIDLMKRKGQWPDAPADGDDWSSLDMSFANRLAFWCGKDIERMDRIFRQSPLMRQKWDRPTAGSTYGRITLKKACDFVDSVYDPALRNESDCPF 289 T 0.0044 VirE_N pdbhh T Viruses T 7rr4 1 A A M5AAG8_9CAUD Primase MIMEIPAIKALSRYAQWVIWKKAADTKIPYNPNNGKKASSTDPLAWGDIDEAQAGLVRYGANGLGFVLTKSDPFVFIDLDHVLDENKRVKCEWARQLLKEIKSYTEISPSGDGLHVVVSGKLPDYIKHKTKFDDGSALEVYESGRYMTITGEVFDGRDDIKELDLSILGEFAEHKIETKNAPVQIESATTLDDEAIIDLMKRKGQWPDAPADGDDWSSLDMSFANRLAFWCGKDIERMDRIFRQSPLMRQKWDRPTAGSTYGRITLKKACDFVDSVYDPALRNESDCP 288 T 0.0037 VirE_N pdbhh T Viruses T 7rr5 87 IC L1 60S ribosomal protein L1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 210 F F F 7rrg 5 E C PK3CA_HUMAN PHOSPHATIDYLINOSITOL 4,5-BISPHOSPHATE 3-KINASE CATALYTIC SUBUNIT ALPHA ISOFORM,PTDINS-3-KINASE SUBUNIT ALPHA,PHOSPHATIDYLINOSITOL 4,5-BISPHOSPHATE 3-KINASE 110 KDA CATALYTIC SUBUNIT ALPHA,P110ALPHA,PHOSPHOINOSITIDE-3-KINASE CATALYTIC ALPHA POLYPEPTIDE,SERINE/THREONINE PROTEIN KINASE PIK3CA ALHGGWTTK 9 T 3.1 CFC pdbhh F Eukaryota T 7rro 5 I,J 8,9 CF107_BOVIN Uncharacterized protein C1orf158 homolog MQFLTAVSPQSSSTPSWKIETKYSTRVLTGNWTEERRKFIKATEKTPQTIYRKEYVPFPGHRPDQISRWYSKRTVEGLPYKYLITHHQEPSQRYLISTYDDHYNRHNYHPGLPELRTWNRHKLLWLPEKADFPLLGPPTNYGLYEQLKQKWLPPPEATLRESIYTSSYPRPPAGAMSRREHAIPVPPPRLQPVPHF 196 T 0.031 DUF1143 pdbpercent F Eukaryota T 7rro 13 ZB D SPAG8_BOVIN Sperm associated antigen 8 METSESTDRSQSRCLDLQPSSDGLGSSSDPFSSWDGRHRSALVAATAAASAAATAASTARAAALWTKSPAPYSHGNLLTEPSSDSLTERYTGPRFTHKISHGRLGFQPAYFSHIAWNPYTTNDLSSSRGPIPGSSSGPVPGSSSSPGPDSSSDPGPSSSSGPGGSPGGSGRGPGHGPGPGGGSGQGPGGGSGQGTDLGPAIDSRHSPGHGHGPRFNFSAPVGFRNPRGDLIPNYTGCKHHCHWEPQKQSWKFLKVSEPGARGLWKPPEVEGKSTVLSETLPRGQCLLYNWEEERATNYLDQVPVMQDGSESFFFRHGHRGLLTLQPQSPTSSCTTQKDSYQPPKSHCQPIRGKREAILEMLLRQQICKEVQAEQEPTRKDSEVESVTHHDYKKELVQAGPPAPTKIHDYHTEQPETFWLERAPQLPGVSNIRTLDTPFRKNCSFSTPVPLSLEQPLPFEPESYSQHGEISSLACQGGGQGGGGG 484 T 1.3 DUF1143 pdbhh F Eukaryota T 7rro 15 PD,XC F,E CF161_BOVIN CFAP161 MAQNLYGPRVRIGNWNEDVYLEEEIMKDFLAKRDKGQLLIQRNRRLKENLLRPMQLSVSEDGYIHYGDKVMLVSPDHPETEADLFLPGDLSLCMTPDEIKAHLSNELEVPCGLSAAQTKIPVGRNTFTILCAAGEVIGQVLRYGQNFRLGITGGFDDRMLYLSSDHRTLLKSSKRSWLQEVFLTHEDSYLNCWQAAFPHPQLRLEYEGSPVPANTKILITHCHTNRGLVAHRHLFLRTYFGQEAEVAAHTYLDSHRVEKPKNHWMLVTGAPRKDLSTMLDLPKPPAEDTRALEQEREQVSDPGARSTPDARGCVPQCTLPM 321 T 0.18 DUF1143 pdbhh F Eukaryota T 7rro 19 XE,YE,ZE H1,H2,H3 ODAD1_BOVIN Coiled-coil domain containing 114 MPFGLSAGSTRSEDGSEAFLEGMVDWELSRLQRQCKVMEDERRAYSKEVHQRINKQLEEIQRLEGVRHKLRVQISIAQSQVRRLRDSERLESMGHLLKCQVRVQAEVKELQAQNQALDREIQEWESRNSAHSKNARSPGCVQHDKVKSQRRIKSLENQLDKVICRFDIQLAQNATLREELDLLRIERNRYLNVDRKLQKEIQLLKDSVRNLMVSSTSAYTVREEAKAKLGMLRERAEKEVAQNETEVQILQRQIAHLEQLHHFLKLKNGDRQPDSAIVEKREQRAREVAEGLRKTSQEKLVLRYEDALNKLSQMTGESDPDLLVEKYLELEERNFAEFNFINEQNSELEHLQEEIKEMQEALVSGRRSEEDRRAQQEQQRAELQQRVDDVHSEADDLEARYHNFREQLEKLKTNIQHLFTRAQCDSTLINDLLGIKTHMRDRDISLFLSLIEKRLVQLLTVQAFLETQVVVMFNAALMVLGQSSEDFPKKVAPPQPPDNLEDPPGFEAKDDYPLSKEELLSSVMKAEQHLKELVESIKVESTPSMTSSTQKVSSSSRLVTQRPSQVPGSIMSHRTSGILVSSGGRATSSNVGHVTFGDSSATTGGLMSSRGSIPGRVTFRSPNSSSYLGSTGYVGSSRDHDSFEASKGPGSESSGGLGSSPGPASSPGPASSTGQASSTSKDSQSNY 687 T 2.4E-05 CCDC73 pdbhh F Eukaryota T 7rro 20 AF,BF,CF H4,H5,H6 ODAD3_BOVIN COILED-COIL DOMAIN-CONTAINING PROTEIN 151 MTSPLCWAAASNAMPSQDQISTPSKVKATQVQLKPYRSRGKGLVPVWHSLHSKAGPLHASEGKSAVNMQVAELQRKIQLLEGDRKAFYESTQWNIKKNQETINQLREETRVLQLQLTALLQGDEKVVQAVIREWKSEKPYLKNRTGQQALEHLDYRLNEKVKQLNALRHQLGLRQKWLEELQLQHSLRELEIAEAQDSNTEVAKTMRNLENRLEKARMKAEEAEHITSVYLQLKAYLQEESLHLGNRLDFMEAEVVRTKHELEELHLVNQEALNARDIAKNQLQYLEETVFRERKKRERYLTECKKRAEEKKLQNERMERKTQREHVLLQSDDTLQDSMYSKEEELKRRWSMYQMEVLFGKVKDATGVAETHAVVRRFLAQGDTFTQLEMLKSENEQTLLRLKQEKQRLQQELEDLKYSGEALLVSEQKRQAELQGRLKMEEQRRADAQNQLDRTMRALQITKEGLEHLAGKLNHIVVAGPTYEEGSPGASLDTKGSATPQPQETGRSVGKMDPKVDDYLPNLLGLVEEKLLKLHSQLENHNVPEMLRHIVDLEFYATLEGKLPSYNTRIALPVAGHKDKFFDEEESEEDDSDVVTRAALKMRSQKLIESRSKRRGRSRRS 621 T 0.0014 CCDC73 pdbhh F Eukaryota T 7rro 32 CP,DP,EP l,m,n FLTOP_BOVIN CILIA- AND FLAGELLA-ASSOCIATED PROTEIN 126 MATNYSANQYEKPFSPKYLQNWSLAKPTKERISSHEGYTQIIANDRGHLLPSVPRSKASPWGSFMGTWQMPLKVPPARATLTSRTAAGAASLTRWIQKNPDLLKASNGLRPEIFGKPHDPDSQKKLRKSITKTVQQAPSPTIIPSSPASNLSSPDQLQSSHPSAGHTPGPQSPLNSPKCPPGSPCLPHAGRNLAEV 196 T 0.34 DUF4248 pdbpssm F Eukaryota T 7rsl 1 A,B,C,D,E,F,G,H,I,J A,B,G,H,E,F,I,J,C,D SEI1_YEAST FEW LIPID DROPLETS PROTEIN 1 MKINVSRPLQFLQWSSYIVVAFLIQLLIILPLSILIYHDFYLRLLPADSSNVVPLNTFNILNGVQFGTKFFQSIKSIPVGTDLPQTIDNGLSQLIPMRDNMEYKLDLNLQLYCQSKTDHLNLDNLLIDVYRGPGPLLGAPGGSNSKDEKIFHTSRPIVCLALTDSMSPQEIEQLGPSRLDVYDEEWLNTIRIEDKISLESSYETISVFLKTEIAQRNLIIHPESGIKFRMNFEQGLRNLMLRKRFLSYIIGISIFHCIICVLFFITGCTAFIFVRKGQEKSKKHS 285 T 0.11 Seipin pdbpercent F Eukaryota T 7rsw 1 A,B A,B H0USR8_ROTGA HEMAGGLUTININ MSLGQSDLHIDPTQFIMYSGTISNGISYVNQAPSCGTVLSLKFTPGNSSLIENLHIEPYKVEVLKIEHVGDVSRATLLSDIVSLSTAQKKLLLYGFTQPGVQGLTGDVVSVETKRIPTPTQTNLLTIEDSIQCFTWDMNCANARSTNQDSRLIIYEQEDGRSHHHHHH 168 T 2.3E-05 Rota_VP4_MID unphh T Viruses T 7rsw 2 C C peptide GAAG 4 T 210 Pyr_redox_2 pdbhh F F 7rsx 1 A,B,C,D C,B,A,D ENVELOPE GLYCOPROTEIN GP120 VWKEAKTTLFCASDAKAYEKEVHNVWATHACVPTDPNPQEMVLANVTENFNMWKNDMVEQMHEDIISLWDESLKPCVKLTGGSAITQACPKVSFDPIPLHYCAPAGFAILKCNNKTFNGTGPCRNVSTVQCTHGIKPVVSTQLLLNGSLAEEEIIIRSENLTNNAKTIIVHLNESVNIVCTRPNNGGSGSGGNIRQAHCNINESKWNNTLQKVGEELAKHFPSKTIKFEPSSGGDLEITTHSFNCRGEFFYCNTSDLFNGTYRNGTYNHTGRSSNGTITLQCKIKQIINMWQEVGRAIYAPPIEGEITCNSNITGLLLLRDGGNDDNDTETFRPGGGDMRDNWRSELYKYKVVEIKHHHHHH 362 T 7.9E-36 GP120 pdb F T 7rsy 1 A,B,C,D A,B,C,D HIV-1 gp120 Clade C1086 VWKEAKTTLFCASDAKAYEKEVHNVWATHACVPTDPNPQEMVLANVTENFNMWKNDMVEQMHEDIISLWDESLKPCVKLTGGSAITQACPKVSFDPIPLHYCAPAGFAILKCNNKTFNGTGPCRNVSTVQCTHGIKPVVSTQLLLNGSLAEEEIIIRSENLTNNAKTIIVHLNESVNIVCTRPNNGGSGSGGNIRQAHCNINESKWNNTLQKVGEELAKHFPSKTIKFEPSSGGDLEITTHSFNCRGEFFYCNTSDLFNGTYRNGTYNHTGRSSNGTITLQCKIKQIINMWQEVGRAIYAPPIEGEITCNSNITGLLLLRDGGNDDNDTETFRPGGGDMRDNWRSELYKYKVVEIKHHHHHH 362 T 7.9E-36 GP120 pdb F T 7rsz 1 A,B,C,D B,A,C,D HIV-1 gp120 Clade C1086 VWKEAKTTLFCASDAKAYEKEVHNVWATHACVPTDPNPQEMVLANVTENFNMWKNDMVEQMHEDIISLWDESLKPCVKLTGGSAITQACPKVSFDPIPLHYCAPAGFAILKCNNKTFNGTGPCRNVSTVQCTHGIKPVVSTQLLLNGSLAEEEIIIRSENLTNNAKTIIVHLNESVNIVCTRPNNGGSGSGGNIRQAHCNINESKWNNTLQKVGEELAKHFPSKTIKFEPSSGGDLEITTHSFNCRGEFFYCNTSDLFNGTYRNGTYNHTGRSSNGTITLQCKIKQIINMWQEVGRAIYAPPIEGEITCNSNITGLLLLRDGGNDDNDTETFRPGGGDMRDNWRSELYKYKVVEIKHHHHHH 362 T 7.9E-36 GP120 pdb F T 7rt7 1 A,C,E,G,I,K A,B,E,G,I,K A0A0H2Z8A2_PSEAB RhsP2 MGSSHHHHHHSQDPAGPIVELDAQGNEIYYRTLSEQHLEILRNNFEVPPTSETFISPLQSYSQEYDGKLVRLTASPGTMNELSKIGVTANSGTGLLLPDLPPARKGWKQNNALFKLEALKKPTINEGGGVINTGLGDGKALEIFNKNLIDFEVID 155 T 25 STAT1_TAZ2bind pdbhh F Bacteria T 7rt7 2 B,D,F,H,J,L D,C,F,H,J,L A0A367GXM0_PSEAI RhsI2 MKTIYNFKQRIKEDPEYIRKAHELTLNTTKPKAGLKGTYGLLGSKEWWDNLENGSIPQKEISGTIKKVYLTGQDNTEDFNTIDIETENKTLCTEGTYTNKNTDRKHYEAGKKITIKYAFDPLKKPKPNGDIDYSKIVVEILISE 144 T 0.95 DUF6152 pdbhh F Bacteria T 7rte 4 D D LMBL3_HUMAN H-L(3)MBT-LIKE PROTEIN 3,L(3)MBT-LIKE PROTEIN 3,L3MBT-LIKE 3,MBT-1 KKATATTTWMVPTA 14 T 40 RNF180_C pdbhh F Eukaryota T 7rti 4 D D LMBL3_HUMAN H-L(3)MBT-LIKE PROTEIN 3,L(3)MBT-LIKE PROTEIN 3,L3MBT-LIKE 3,MBT-1 KKATATTWMVPTA 13 T 1.9 ATP-synt_Z pdbhh F Eukaryota T 7ru9 3 D,G D,G BAG6_HUMAN BAG FAMILY MOLECULAR CHAPERONE REGULATOR 6,BCL2-ASSOCIATED ATHANOGENE 6,BAG-6,HLA-B-ASSOCIATED TRANSCRIPT 3,PROTEIN G3,PROTEIN SCYTHE GPGSAETEPWAAAVPPEWVPIIQQDIQSQRKVKPQPPLSDAYLSGMPAKRRKTMQGEGPQLLLSEAVSRAAKAAGARPLTSPESLSRDLEAPEVQESYRQQLRSDIQKRLQEDPNYSPQRFPNAQRAFADDP 132 T 0.02 DUF2939 pdb F Eukaryota T 7rua 3 D,G D,G BAG6_HUMAN BAG FAMILY MOLECULAR CHAPERONE REGULATOR 6,BCL2-ASSOCIATED ATHANOGENE 6,BAG-6,HLA-B-ASSOCIATED TRANSCRIPT 3,PROTEIN G3,PROTEIN SCYTHE GPGSAETEPWAAAVPPEWVPIIQQDIQSQRKVKPQPPLSDAYLSGMPAKRRKTMQGEGPQLLLSEAVSRAAKAAGARPLTSPESLSRDLEAPEVQESYRQQLRSDIQKRLQEDPNYSPQRFPNAQRAFADDP 132 T 0.02 DUF2939 pdb F Eukaryota T 7ruc 3 D,F D,G BAG6_HUMAN BAG FAMILY MOLECULAR CHAPERONE REGULATOR 6,BCL2-ASSOCIATED ATHANOGENE 6,BAG-6,HLA-B-ASSOCIATED TRANSCRIPT 3,PROTEIN G3,PROTEIN SCYTHE GPGSAETEPWAAAVPPEWVPIIQQDIQSQRKVKPQPPLSDAYLSGMPAKRRKTMQGEGPQLLLSEAVSRAAKAAGARPLTSPESLSRDLEAPEVQESYRQQLRSDIQKRLQEDPNYSPQRFPNAQRAFADDP 132 T 0.02 DUF2939 pdb F Eukaryota T 7rva 4 D D UNK-UNK-UNK XXX 3 F F F 7rva 5 E E UNK-UNK-UNK-UNK XXXX 4 F F F 7rwc 5 E P SH3-containing GRB2-like protein 3-interacting protein 1 XXXXXXXX 8 F F F 7rx0 2 B G Zinc finger protein GLI2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 92 F F F 7rx4 1 A,B A,a AS2 peptide QARILEADARILQAYANILSAHAEILRAE 29 T 4.1 Proho_convert pdbhh F T 7rx5 1 A A F1-N2 nanotube QAEILKADAENNRAYARILEAHAEILKAQ 29 T 35 DUF167 pdbhh F T 7rxq 2 B B CAC1S_RABIT CALCIUM CHANNEL,L TYPE,ALPHA-1 POLYPEPTIDE,ISOFORM 3,SKELETAL MUSCLE,DIHYDROPYRIDINE RECEPTOR ALPHA-1S SUBUNIT,DHPR,VOLTAGE-GATED CALCIUM CHANNEL SUBUNIT ALPHA CAV1.1 EERIFRRTGGLFGQVD 16 T 0.12 CAC1F_C unppercent F Eukaryota T 7ryf 43 QA m RS13_ACIB5 30S ribosomal protein S13 MARIAGVNIPDNKHAVISLTYIFGIGRHTAKNILAAVGITETTKIRELDDAQLDAIRAEVAKVPTEGDLRREISMNIKRLMDLGCYRGLRHRRSLPVRGQRTKTNARTRKGPRKPIKK 118 F F Bacteria T 7ryg 43 QA m RS13_ACIB5 30S ribosomal protein S13 MARIAGVNIPDNKHAVISLTYIFGIGRHTAKNILAAVGITETTKIRELDDAQLDAIRAEVAKVPTEGDLRREISMNIKRLMDLGCYRGLRHRRSLPVRGQRTKTNARTRKGPRKPIKK 118 F F Bacteria T 7ryh 43 QA m RS13_ACIB5 30S ribosomal protein S13 MARIAGVNIPDNKHAVISLTYIFGIGRHTAKNILAAVGITETTKIRELDDAQLDAIRAEVAKVPTEGDLRREISMNIKRLMDLGCYRGLRHRRSLPVRGQRTKTNARTRKGPRKPIKK 118 F F Bacteria T 7rz3 1 A A Xt3a SAVLWNQQYDKTVCNQEGEFCSKSGVDCCAGLSCRKYNLMGYGVCAAQTCSEEGTFCSLSDSDCCSGLKCKRRGHGYGECSK 82 T 0.0031 Toxin_12 pdbpssm F T 7rzd 3 C C KMT2A_HUMAN N-TERMINAL CLEAVAGE PRODUCT OF 320 KDA,P320 EPRSPSHSM 9 T 2.3 DUF5069 pdbhh F Eukaryota T 7rzj 3 C E KMT2A_HUMAN N-TERMINAL CLEAVAGE PRODUCT OF 320 KDA,P320 EPRSPSHSM 9 T 2.3 DUF5069 pdbhh F Eukaryota T 7rzz 3 C C P91820_CAEEL Lateral Signaling Target GSNSSTIAYSKSQHEAPKQLLQLRSEIKPLIPLNQP 36 T 5.7 AgrD pdbhh F Eukaryota T 7s00 1 A,E c,e A0A172JI16_BPPB1 DNA-directed RNA polymerase beta subunit MISNFRKFHGNKNQEKFNENLILNKENESILNYLDPICKTLEIIPEITYLGSSVEPINKVYKFNKEEKTSDIERSELQLIKMSFLIEKDDKKEEINKFIYFPKLIDSQYFIINGNRYYPIYQLLDSGTYRTNKALTLKTLLMPIVLREKKETFDDINGETHTMLNVDLDLFKSKVPFLIYFFSKFGFEGTLEYFGLQDLIHVLMKEDLDQLDEDEINDNVIFMITKNISLVVDKNFFSNKNNQIIIATLLNCFNTRIKIDKIYEKDYWVKKLGGYFTTNNSNKQEKGEGIILSFERILDEWTKKILRTEEKNKEDIYSVVRWMINNYLALVKQDNMNLANKRIRLYEYLLHPLLIKFSKGTYRVLNNRNSNKFEKIKTIFSNIQEGFLVKKIINNELLRYDNSVNSISLFTLILRYTQSGPQSPFSSNSTNNKLRGLHPSYLGRLGLTSTSAGDPGASGSLTPFLELPENSYMHFTEEPEINLNIDDISIDEVIES 496 T 0.0014 RNA_pol_Rpb2_3 pdbpssm T Viruses T 7s01 1 A A A0A172JIC8_BPPB1 DNA-directed RNA polymerase subunit MDILENYVSFDEQARDINIAFDKLFGRDDISHMNNFSINKRSYYNCLDQISDDLNLVLNKYNDLAYSLLEIRYNMATKENYTHMEFYSDIERLFIKNEKLLNVISDIVEEEYDLDLNQASKGKKINIELQVTDNLNKIYLKSSVLMRILIPILCDFNCDDDINEVLVYDIFKEVIKSFDDGKKNALNKLYKIIYSRVFETKYSDVVIWTYLKNMSTDLMIIVKDYFKVIIKKIFPKLKHNSSVISYLDVVIKQKLKYLFTFKYPISYKPLKAETTDDEELSEQERMEINLLRNDQGNSIINECSIKQEIAKIKKKYNVTDEVMKEFINGRELNSIQIYLVKIYYSNKFKVNSNKNDIFYLLYGMTRELGEMNFSIIPEILSCAIAPNVRKMNNRKKLVDKIIHSDKYSYLLKSYLPIKNILDKNNVILQLMTIKNAKFMNKENKEVDFSTDHLAEEVLDMLLCI 464 T 23 GvpK pdbhh T Viruses T 7s01 3 C c A0A172JI16_BPPB1 DNA-directed RNA polymerase beta subunit MISNFRKFHGNKNQEKFNENLILNKENESILNYLDPICKTLEIIPEITYLGSSVEPINKVYKFNKEEKTSDIERSELQLIKMSFLIEKDDKKEEINKFIYFPKLIDSQYFIINGNRYYPIYQLLDSGTYRTNKALTLKTLLMPIVLREKKETFDDINGETHTMLNVDLDLFKSKVPFLIYFFSKFGFEGTLEYFGLQDLIHVLMKEDLDQLDEDEINDNVIFMITKNISLVVDKNFFSNKNNQIIIATLLNCFNTRIKIDKIYEKDYWVKKLGGYFTTNNSNKQEKGEGIILSFERILDEWTKKILRTEEKNKEDIYSVVRWMINNYLALVKQDNMNLANKRIRLYEYLLHPLLIKFSKGTYRVLNNRNSNKFEKIKTIFSNIQEGFLVKKIINNELLRYDNSVNSISLFTLILRYTQSGPQSPFSSNSTNNKLRGLHPSYLGRLGLTSTSAGDPGASGSLTPFLELPENSYMHFTEEPEINLNIDDISIDEVIES 496 T 0.0014 RNA_pol_Rpb2_3 pdbpssm T Viruses T 7s02 3 C C P91820_CAEEL Lateral Signaling Target GSNSSTIAYSKSQHEAPKQLLQLRSEIKPLIPLNQP 36 T 5.7 AgrD pdbhh F Eukaryota T 7s07 3 C C GP42_EBVB9 Soluble gp42 KPNVEVWPVDPPPPVNFNKTAEQEYGDKEVKLP 33 T 5.4 MarB unphh T Viruses T 7s0s 1 A 3 A0QTP4_MYCS2 ribosomal protein bL37 AKRGRKKRDRKHSKANHGKRPNA 23 T 0.16 DUF6254 pdb F Bacteria T 7s1b 7 G C GP42_EBVB9 Soluble gp42 KPNVEVWPVDPPPPVNFNKTAEQEYGDKEVKLP 33 T 5.4 MarB unphh T Viruses T 7s1g 48 VA s nascent peptide chain MFKAF 5 T 40 DUF6435 pdbhh F F 7s1i 48 VA s nascent peptide chain MFKAF 5 T 40 DUF6435 pdbhh F F 7s1k 48 VA s nascent peptide chain MFKAF 5 T 40 DUF6435 pdbhh F F 7s1o 1 A,B A,B POTE1_HUMAN HPOT1,POT1-LIKE TELOMERE END-BINDING PROTEIN SNIEVERCQQLSATILTDHQYLERTPLCAILKQKAPQQYRIRAKLRSYKPRRLFQSVKLHCPKCHLLQEVPHEGDLDIIFQDGATKTPDVKLQNTSLYDSKIWTTKNQKGRKVAVHFVKNNGILPLSNECLLLIEGGTLSEICKLSNKFNSVIPVRSGHEDLELLDLSAPFLIQGTIHHYGCKQCSSLRSIQNLNSLVDKTSWIPSSVAEALGIVPLQYVFVMTFTLDDGTGVLEAYLMDSDKFFQIPASEVLMDDDLQKSVDMIMDMFCPPGIKIDAYPWLECFIKSYNVTNGTDNQICYQIFDTTVAEDVI 313 T 0.032 CDC24_OB3 pdbhh F Eukaryota T 7s1t 1 A,B,C,D A,D,G,J POTE1_HUMAN HPOT1,POT1-LIKE TELOMERE END-BINDING PROTEIN SNIEVERCQQLSATILTDHQYLERTPLCAILKQKAPQQYRIRAKLRSYKPRRLFQSVKLHCPKCHLLQEVPHEGDLDIIFQDGATKTPDVKLQNTSLYDSKIWTTKNQKGRKVAVHFVKNNGILPLSNECLLLIEGGTLSEICKLSNKFNSVIPVRSGHEDLELLDLSAPFLIQGTIHHYGCKQCSSLRSIQNLNSLVDKTSWIPSSVAEALGIVPLQYVFVMTFTLDDGTGVLEAYLMDSDKFFQIPASEVLMDDDLQKSVDMIMDMFCPPGIKIDAYPWLECFIKSYNVTNGTDNQICYQIFDTTVAEDVI 313 T 0.032 CDC24_OB3 pdbhh F Eukaryota T 7s1u 1 A,B A,B POTE1_HUMAN HPOT1,POT1-LIKE TELOMERE END-BINDING PROTEIN QLSATILTDHQYLERTPLCAILKQKAPQQYRIRAKLRSYKPRRLFQSVKLHCPKCHLLQEVPHEGDLDIIFQDGATKTPDVKLQNTSLYDSKIWTTKNQKGRKVAVHFVKNNGILPLSNECLLLIEGGTLSEICKLSNKFNSVIPVRSGHEDLELLDLSAPFLIQGTIHHYGCKQCSSLRSIQNLNSLVDKTSWIPSSVAEALGIVPLQYVFVMTFTLDDGTGVLEAYLMDSDKFFQIPASEVLMDDDLQKSVDMIMDMFCPPGIKIDAYPWLECFIKSYNVTNGTDNQICYQIFDTTVAEDVI 304 T 0.022 Zn_Tnp_IS1 pdbpssm F Eukaryota T 7s2t 2 D,E,F F,G,H ENCB_MYXXD EncB targeting peptide ESHPLTVGSLRR 12 T 0.11 DUF2076 unppercent F Bacteria T 7s3d 6 DA,F,R f,F,Q B4WP24_SYNS7 PHOTOSYSTEM I REACTION CENTER SUBUNIT III, PSAF2 MHKTIRKFFSLLLAAFVWLSVVSPAVAASEGYTDTHLVPCASSPAFNERMQNAPEGYYFDTPYQSYAANLLCGAEGLPHQQLRFDRAIDVLIPFGIFFYVAGFIGWSGRAYLISSNRNSKPEETEIFIDVALAIKSFVQGLLWPLLAVKELTTGELTAPVSEVSVSPR 168 T 0.0004 PSI_PsaF pdbpercent F Bacteria T 7s3d 7 EA,G,S i,I,R B4WP23_SYNS7 PsaI2 MVDATQLEGAYAAAWLPWIMIPMITYILPFPIFAIAFLWIEREGGEGGLDIDVMGSNAMSNEAMGRDISS 70 T 0.46 PSI_8 pdb F Bacteria T 7s4a 2 B,D B,D PALB2_HUMAN Partner and localizer of BRCA2 MHHHHHHSSGVDLGTENLYFQSNMLSLKQLLSFLSITDFQLPDEDFGPLKLEKVKSC 57 T 5 Eaf7 pdbhh F Eukaryota T 7s4g 3 G,H,I,J G,I,J,K LY66D_HUMAN PROTEIN LY6-D,MEGAKARYOCYTE-ENHANCED GENE TRANSCRIPT 1 PROTEIN DCYLGDLCN 9 T 1.1E-05 PLA2_inh unphh F Eukaryota T 7s4m 4 G,I,J D,N,H Unidentified Helix XXXXXXXXXXXXXXXXXXX 19 F F F 7s4o 2 C,D C,D LEU-PRO-ALA-THR-SER-GLY XLPATSGKX 9 T 5.9 Pas_Saposin pdbhh F T 7s4q 2 D,E,F F,G,H ENCC_MYXXD EncC targeting peptide PEKRLTVGSLRR 12 T 4.6 DUF6225 pdbhh F Bacteria T 7s51 2 C,D C,D LEU-PRO-ALA-THR-ALA XLPATAGKX 9 T 29 Pas_Saposin pdbhh F T 7s59 2 C,D 4,2 CCL7_HUMAN;CCL8_HUMAN HC14,MONOCYTE CHEMOATTRACTANT PROTEIN 2,MONOCYTE CHEMOTACTIC PROTEIN 2,MCP-2,SMALL-INDUCIBLE CYTOKINE A8,MONOCYTE CHEMOATTRACTANT PROTEIN 3,MONOCYTE CHEMOTACTIC PROTEIN 3,MCP-3,NC28,SMALL-INDUCIBLE CYTOKINE A7 QPDSVSIPITCCFNVINKKIPKQRLESYRRTTSSHCPREAVIFKTKLDKEICADPTQKWVQDFMKHLDKKTQTPKL 76 T 7.5E-26 IL8 unppssm F Eukaryota T 7s5b 1 A,B A,B Miniprotein Binder SVIEKLRKLEKQARKQGDEVLVMLARMVLEYLEKGWVSEEDADESADRIEEVLKK 55 T 0.89 TyeA pdbhh F T 7s5g 3 C C 8VH-Z02-ALA-DAL-PHE-FTR-PRO-THR-0A1-3WX XXAXFWPTXXX 11 T 0.6 Gla pdbhh F T 7s5h 3 C C Z03-Z02-DNP-DAL-PHE-FTR-PRO-THR-0A1-3WX-Z04-Z05 XXXXFWPTXXX 11 T 0.6 Gla pdbhh F F 7s5j 2 B B A3DCU2_ACET2 CtA peptide LNIGRELTDEELMEMTGGSTFSIQ 24 T 0.012 L_biotic_typeA pdbhh F Bacteria T 7s5u 1 A,B,C,D A,B,D,C KL61_DROME BIPOLAR KINESIN KRP-130 MAQSLQDQTNLHNKLIGEVMKISDQHSQAFVAKLMEQMQQQQLLMSKEIQTNLQVIEENNQRHKAMLDSMQEKFATIIDSSLQSVEEHAKQMHKKLEQLGAMSLPDAEELQNLQEELANERALAQQEDALLESMMMQMEQIKNLRSKNSISMSVHLNKMEESRLTRNHRIDDIKSGIQDYQKLGIEASQSAQAELTSQMEAGMLCLDQGVANCSMLQVHMKNLNQKYEKETNENVGSVRVSGHHHHHH 248 T 0.00078 DUF1340 pdb F Eukaryota T 7s76 1 A,B,C,D,E,F A,B,C,D,E,F CAPSD_HBVD1 CORE ANTIGEN,CORE PROTEIN,HBCAG,P21.5 MGSMDIDPYKEFGATVELLSFLPSDFFPSVRDLLDTAAALYRDALESPEHCSPHHTALRQAILCWGDLMTLATWVGTNLEDPASRDLVVSYVNTNVGLKFRQLLWFHISCLTFGRETVLEYLVSFGVWIRTPPAARPPNAPILSTLPETTVVKLENLYFQ 160 T 3.9E-25 Hepatitis_core unp T Viruses T 7s78 7 DA 5 Unknown-1 XXXXXXXXXXXXXXXX 16 F F F 7s78 8 EA 6 Unknown-2 XXXXXXXXXX 10 F F F 7s79 3 C E KMT2A_HUMAN N-TERMINAL CLEAVAGE PRODUCT OF 320 KDA,P320 EPRXPSHSM 9 T 23 Ribosomal_S12 pdbhh F Eukaryota T 7s7d 3 C E KMT2A_HUMAN N-TERMINAL CLEAVAGE PRODUCT OF 320 KDA,P320 EPRXPSHSM 9 T 2.3 DUF5069 pdbhh F Eukaryota T 7s7e 3 C C DOT1L_HUMAN DOT1-LIKE PROTEIN,HISTONE H3-K79 METHYLTRANSFERASE,H3-K79-HMTASE,LYSINE N-METHYLTRANSFERASE 4 LPASPAHQL 9 T 18 YkpC pdbhh F Eukaryota T 7s7f 3 C C DOT1L_HUMAN DOT1-LIKE PROTEIN,HISTONE H3-K79 METHYLTRANSFERASE,H3-K79-HMTASE,LYSINE N-METHYLTRANSFERASE 4 LPASPAHQL 9 T 18 YkpC pdbhh F Eukaryota T 7s7j 2 B B IST1_HUMAN HIST1,PUTATIVE MAPK-ACTIVATING PROTEIN PM28 TSASEDIDFDDLSRRFEELKKKT 23 T 2.6 TACC_C pdbhh F Eukaryota T 7s8a 3 C C KMT2A_HUMAN N-TERMINAL CLEAVAGE PRODUCT OF 320 KDA,P320 EPRSPSHSM 9 T 2.3 DUF5069 pdbhh F Eukaryota T 7s8e 3 C E KMT2A_HUMAN N-TERMINAL CLEAVAGE PRODUCT OF 320 KDA,P320 EPRSPSHSM 9 T 2.3 DUF5069 pdbhh F Eukaryota T 7s8f 3 C C KMT2A_HUMAN N-TERMINAL CLEAVAGE PRODUCT OF 320 KDA,P320 EPRSPSHSM 9 T 2.3 DUF5069 pdbhh F Eukaryota T 7sa3 9 I N Unknown XXXXXXXXXXXXXXXXXXXXXXX 23 F F F 7sag 1 A A A0A384E130_9METZ Barrettide C NVVPCFCVEDETSGAKTCIPDNCDASRGTNP 31 T 5.1 IL4Ra_N pdbhh F Eukaryota T 7sau 2 C,D,E,F,G C,D,E,F,G A0A085L0W4_9FLAO Gliding motility protein GldL MPLIDVNGKKFKNFLAKLYGFGASIVILGAMFKILHWTGADLMLIIGLSTEAVIFFFSAFEKPAPEYDWTLVYPELAGVEDLDSKNNALVPQGGTSLTQELDNMLKEASIDEELIKSLGDGLRKFGDAALKLNETIDAAEGTQKYTEQITLAAKHMESLNALYAVQLEGTASQMELQNALIEKLGSSIENTEKLSTELSELVTNMSALNKVYGGMLSAMGVSK 223 T 0.0034 DUF489 unppssm F Bacteria T 7sav 1 A A CM3A_CONKI Mu-conotoxin KIIIA CCNCSSKWCRDHSRCCX 17 T 0.55 C5HCH pdbhh F Eukaryota T 7saw 1 A A CM3A_CONKI Mu-conotoxin KIIIA CCNCSSKWCRDHSRCCX 17 T 0.55 C5HCH pdbhh F Eukaryota T 7sax 2 C,D,E,F,G C,D,E,F,G A0A1I6R6J4_9SPHI GldL MAKKTKFKFGINTLINWGATVVIIGLMFKILHLKGGEWMIGVGLAVEALLFFIMGFMQAEQEPDWTRVYPELDEDYNGELPTRSVRAVAQPVATGNTAALDKLLQDAKIDENLIGNLGDGLRTFSDKVASISKVADTAVATNQFADKLNAASTGAAQLSNAFERAASDLQTFNESAADMQQFKEQVSTFNKNLSSLNAIYGNMLSAMNTNRS 212 T 0.0057 DASH_Dam1 pdb F Bacteria T 7saz 2 C,D,E,F,G C,D,E,F,G F9YQB6_CAPCC GldL MAQSNKTTKKIFQMAYGIGASIVILGALFKILHWEIDFGGFKLGGGFLLAFGLITEAIIFFISAFEPVEEGYDWSLVYPELVGGEARQNQLVGRGVVSQLSEEDKAIKESLSEKLDNLLAEAQIDANLMHSLSASIQNFAGAAKEIAPVTDAMVSTHKYGEELSMAAAHLESLNSLYKLQLERTENQVSAQAGVVDNLNSLNEQMMSFKDNLKSLNSVYGGMLSAMGK 228 T 0.0067 DASH_Dad4 pdbpssm F Bacteria T 7sb3 2 D,E H,L Human polyclonal Fab model with polyalanine backbone - Heavy chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 122 F F F 7sb4 2 D,E H,L Human polyclonal Fab model with polyalanine backbone - Heavy chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 113 F F F 7sb5 2 D,E H,L Human polyclonal Fab model with polyalanine backbone - Heavy chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 115 F F F 7sba 2 H H Q6ZEI5_SYNY3 Cas5d MTKIYRCKLTLHDNVFFASREMGILYETEKYFHNWALSYAFFKGTIIPHPYGLVGQNAQTPAYLDRDREQNLLHLNDSGIYVFPAQPIHWSYQINTFKAAQSAYYGRSVQFGGKGATKNYPINYGRAKELAVGSEFLTYIVSQKELDLPVWIRLGKWSSKIRVEVEAIAPDQIKTASGVYVCNHPLNPLDCPANQQILLYNRVVMPPSSLFSQSQLQGDYWQIDRNTFLPQGFHYGATTAIAQDSPQLSLLDTN 254 T 27 PNISR pdbhh F Bacteria T 7sba 3 I I Q6ZEI7_SYNY3 Cas10d MTTLLQTLLIRTLSEQKDYILLEYFQTILPALEEHFGNTSGLGGSFISHQKHFGTQGYDTEKAKKMAQGFAKKGDQTLAAHILNALLTTWNVMQELEFPLNDIERRLLCLGITLHDYDKHCHAQDMAAPEPDNIQEIINICLELGKRLNFDEFWADWRDYIAEISYLAQNTHGKQHTNLISSNWSNAGYPFTIKERKLDHPLRHLLTFGDVAVHLSSPHDLVSSTMGDRLRDLLNRLGIEKRFVYHHLRDTTGILSNAIHNVILRTVQKLDWKPLLFFAQGVIYFAPQDTEIPERNEIKQIVWQGISQELGKKMSAGDVGFKRDGKGLKVSPQTSELLAAADIVRILPQVISVKVNNAKSPATPKRLEKLELGDAEREKLYEVADLRCDRLAELLGLVQKEIFLLPEPFIEWVLKDLELTSVIMPEETQVQSGGVNYGWYRVAAHYVANHATWDLEEFQEFLQGFGDRLATWAEEEGYFAEHQSPTRQIFEDYLDRYLEIQGWESDHQAFIQELENYVNAKTKKSKQPICSLSSGEFPSEDQMDSVVLFKPQQYSNKNPLGGGQIKRGISKIWSLEMLLRQAFWSVPSGKFEDQQPIFIYLYPAYVYAPQVVEAIRELVYGIASVNLWDVRKHWVNNKMDLTSLKSLPWLNEEVEAGTNAQLKYTKEDLPFLATVYTTTREKTDTDAWVKPAFLALLLPYLLGVKAIATRSMVPLYRSDQDFRESIHLDGVAGFWSLLGIPTDLRVEDITPALNKLLAIYTLHLAARSSPPKARWQDLPKTVQEVMTDVLNVFALAEQGLRREKRDRPYESEVTEYWQFAELFSQGNIVMTEKLKLTKRLVEEYRRFYQVELSKKPSTHAILLPLSKALEQILSVPDDWDEEELILQGSGQLQAALDRQEVYTRPIIKDKSVAYETRQLQELEAIQIFMTTCVRDLFGEMCKGDRAILQEQRNRIKSGAEFAYRLLALEAQQNQN 975 T 0.002 HD pdbpssm F Bacteria T 7sba 4 J,K J,K Q6ZEI7_SYNY3 Cas11d MTEKLKLTKRLVEEYRRFYQVELSKKPSTHAILLPLSKALEQILSVPDDWDEEELILQGSGQLQAALDRQEVYTRPIIKDKSVAYETRQLQELEAIQIFMTTCVRDLFGEMCKGDRAILQEQRNRIKSGAEFAYRLLALEAQQNQN 146 T 0.018 RE_TaqI pdbpssm F Bacteria T 7sbb 2 H H Q6ZEI5_SYNY3 Cas5d MTKIYRCKLTLHDNVFFASREMGILYETEKYFHNWALSYAFFKGTIIPHPYGLVGQNAQTPAYLDRDREQNLLHLNDSGIYVFPAQPIHWSYQINTFKAAQSAYYGRSVQFGGKGATKNYPINYGRAKELAVGSEFLTYIVSQKELDLPVWIRLGKWSSKIRVEVEAIAPDQIKTASGVYVCNHPLNPLDCPANQQILLYNRVVMPPSSLFSQSQLQGDYWQIDRNTFLPQGFHYGATTAIAQDSPQLSLLDTN 254 T 27 PNISR pdbhh F Bacteria T 7sbb 3 I I Q6ZEI7_SYNY3 Cas10d MTTLLQTLLIRTLSEQKDYILLEYFQTILPALEEHFGNTSGLGGSFISHQKHFGTQGYDTEKAKKMAQGFAKKGDQTLAAHILNALLTTWNVMQELEFPLNDIERRLLCLGITLHDYDKHCHAQDMAAPEPDNIQEIINICLELGKRLNFDEFWADWRDYIAEISYLAQNTHGKQHTNLISSNWSNAGYPFTIKERKLDHPLRHLLTFGDVAVHLSSPHDLVSSTMGDRLRDLLNRLGIEKRFVYHHLRDTTGILSNAIHNVILRTVQKLDWKPLLFFAQGVIYFAPQDTEIPERNEIKQIVWQGISQELGKKMSAGDVGFKRDGKGLKVSPQTSELLAAADIVRILPQVISVKVNNAKSPATPKRLEKLELGDAEREKLYEVADLRCDRLAELLGLVQKEIFLLPEPFIEWVLKDLELTSVIMPEETQVQSGGVNYGWYRVAAHYVANHATWDLEEFQEFLQGFGDRLATWAEEEGYFAEHQSPTRQIFEDYLDRYLEIQGWESDHQAFIQELENYVNAKTKKSKQPICSLSSGEFPSEDQMDSVVLFKPQQYSNKNPLGGGQIKRGISKIWSLEMLLRQAFWSVPSGKFEDQQPIFIYLYPAYVYAPQVVEAIRELVYGIASVNLWDVRKHWVNNKMDLTSLKSLPWLNEEVEAGTNAQLKYTKEDLPFLATVYTTTREKTDTDAWVKPAFLALLLPYLLGVKAIATRSMVPLYRSDQDFRESIHLDGVAGFWSLLGIPTDLRVEDITPALNKLLAIYTLHLAARSSPPKARWQDLPKTVQEVMTDVLNVFALAEQGLRREKRDRPYESEVTEYWQFAELFSQGNIVMTEKLKLTKRLVEEYRRFYQVELSKKPSTHAILLPLSKALEQILSVPDDWDEEELILQGSGQLQAALDRQEVYTRPIIKDKSVAYETRQLQELEAIQIFMTTCVRDLFGEMCKGDRAILQEQRNRIKSGAEFAYRLLALEAQQNQN 975 T 0.002 HD pdbpssm F Bacteria T 7sbb 4 J,K J,K Q6ZEI7_SYNY3 Cas11d MTEKLKLTKRLVEEYRRFYQVELSKKPSTHAILLPLSKALEQILSVPDDWDEEELILQGSGQLQAALDRQEVYTRPIIKDKSVAYETRQLQELEAIQIFMTTCVRDLFGEMCKGDRAILQEQRNRIKSGAEFAYRLLALEAQQNQN 146 T 0.018 RE_TaqI pdbpssm F Bacteria T 7sbv 2 D,E H,L Human polyclonal Fab model with polyalanine backbone - Heavy chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 116 F F F 7sbw 1 A,B H,L Human polyclonal Fab model with polyalanine backbone - Heavy chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 119 F F F 7sbx 2 D,E H,L Human polyclonal Fab model with polyalanine backbone - Heavy chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 123 F F F 7sby 1 A,B H,L Human polyclonal Fab model with polyalanine backbone - Heavy chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 126 F F F 7sc5 1 A,C,E A,C,E Q2N0S6_9HIV1 ENVELOPE GLYCOPROTEIN GP160,GLYCOPROTEIN 120,SURFACE PROTEIN GP120,TRANSMEMBRANE PROTEIN GP41 AENLWVTVYYGVPVWKDAETTLFCASDARAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQAFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVV 474 T 3.9E-50 GP120 pdb T Viruses T 7sc7 8 BA,DB BG,CO P74135_SYNY3 Sll1873 protein MLKKLFGAKKEFYVQLDESQAPAQVEEADVAIVKSEVAPVEKPAPTTSKKTSIKKKSATKAAAPVETPASAPVAPAPKAKVDPSQVAFASGDPIPQNVARRTPGPSLNRFKEMARQVKVKR 121 T 0.28 LZ_Tnp_IS66 pdbpercent F Bacteria T 7sc9 7 AA,DB BF,CO P74135_SYNY3 Sll1873 protein MLKKLFGAKKEFYVQLDESQAPAQVEEADVAIVKSEVAPVEKPAPTTSKKTSIKKKSATKAAAPVETPASAPVAPAPKAKVDPSQVAFASGDPIPQNVARRTPGPSLNRFKEMARQVKVKR 121 T 0.28 LZ_Tnp_IS66 pdbpercent F Bacteria T 7scb 9 CA BI P74135_SYNY3 Sll1873 protein MLKKLFGAKKEFYVQLDESQAPAQVEEADVAIVKSEVAPVEKPAPTTSKKTSIKKKSATKAAAPVETPASAPVAPAPKAKVDPSQVAFASGDPIPQNVARRTPGPSLNRFKEMARQVKVKR 121 T 0.28 LZ_Tnp_IS66 pdbpercent F Bacteria T 7sdp 4 D F Unknown polymer fragment XXX 3 F F F 7sdp 5 E D Unknown polymer fragment XXXX 4 F F F 7seo 3 E,F F,G ACE-VAL-ASP-VAL-DAB-ASP XVDVXD 6 T 250 DUF3563 pdbhh F F 7sfr 51 YA v A0A3E0UTA6_MYCTX peptide AKRGRKKRDRKYSKANHGKRPN 22 T 0.2 DUF6254 pdb F Bacteria T 7sfy 1 A,B,D,E A,B,D,E MS18A_HUMAN FAPP1-ASSOCIATED PROTEIN 1 SNAELFNLESRVEIEKSLTQMEDVLKALQMKLWEAESKLSFATCKS 46 T 1.1 Trimer_CC unppercent F Eukaryota T 7sfy 2 C,F C,F MS18B_HUMAN CANCER/TESTIS ANTIGEN 86,CT86,OPA-INTERACTING PROTEIN 5,OIP-5 QNVPLSEKIAELKEKIVLTHNRLKSLMKILSEVTPDQSKPEN 42 T 0.071 SAND unppssm F Eukaryota T 7sg0 3 C C DQ2-glia-omega1 peptide QPFPQPEQPFP 11 T 4 Statherin pdbhh F F 7sg1 5 G,H C,H DQ2-glia-alpha1a peptide LQPFPQPELPYGSGGS 16 T 5.7 Sod_Fe_N pdbhh F T 7sg2 5 G,H C,H DQ2-glia-omega1 peptide QPFPQPEQPFPGS 13 T 4.7 Statherin pdbhh F T 7sg5 3 C A CSP_PLAF7 PfCSP peptide 21 NPDPNANPNVDPNAN 15 T 0.7 Cas_Cas7 pdbhh F Eukaryota F 7sg6 3 C A CSP_PLAF7 PfCSP peptide 21 NPDPNANPNVDPNAN 15 T 0.7 Cas_Cas7 pdbhh F Eukaryota F 7sgc 4 D G Unidentified polymer XXX 3 F F F 7sgc 5 E D Unidentified polymer XXXX 4 F F F 7sgz 8 H H DDC1_YEAST DNA Damage Checkpoint protein DDC1 MSFKATITESGKQNIWFRAIYVLSTIQDDIKITVTTNELIAWSMNETDTTLCQVRFQKSFFEEYEFKPHEIVFGENGVQVIEDTYGNSHKLYSFRVNGRHLTTISRKPDGDGIKSFTIAVNNTSTCPESLANRLIVVIEMDSLIVKEYCPQFQPIKYDPIIINLKYKRRFLDVFGTAASDRNPQEPLDPKLLDVFTNTERELTSALFNEEVESDIRKRNQLTAADEINYICCNSTLLKNFLDNCNVNVTDEVKLEINVHRLSITAFTKAVYGKNNDLLRNALSMSNTISTLDLEHYCLFTKSIIFKLKDFKNFITIGPSWKTTQDGNDNISLWFCHPGDPILMQMQKPGVKLELVEVTDSNINDDILEGKFIKTAISGSKEEAGLKDNKESCESPLKSKTALKRENLPHSVAGTRNSPLKVSYLTPDNGSTVAKTYRNNTARKLFVEEQSQSTNYEQDKRFRQASSVHMNMNREQSFDIGTTHEVACPRNESNSLKRSIADICNETEDPTQQSTFAKRADTTVTWGKALPAADDEVSCSNIDRKGMLKKEKLKHMQGLLNSQNDTSNHKKQDNKEMEDGLGLTQVEKPRGIFD 593 T 6.4E-09 Rad9 pdbpercent F Eukaryota T 7sh2 8 H H DDC1_YEAST DNA damage checkpoint protein DDC1 MSFKATITESGKQNIWFRAIYVLSTIQDDIKITVTTNELIAWSMNETDTTLCQVRFQKSFFEEYEFKPHEIVFGENGVQVIEDTYGNSHKLYSFRVNGRHLTTISRKPDGDGIKSFTIAVNNTSTCPESLANRLIVVIEMDSLIVKEYCPQFQPIKYDPIIINLKYKRRFLDVFGTAASDRNPQEPLDPKLLDVFTNTERELTSALFNEEVESDIRKRNQLTAADEINYICCNSTLLKNFLDNCNVNVTDEVKLEINVHRLSITAFTKAVYGKNNDLLRNALSMSNTISTLDLEHYCLFTTIEDEKQDKRSHSKRREHMKSIIFKLKDFKNFITIGPSWKTTQDGNDNISLWFCHPGDPILMQMQKPGVKLELVEVTDSNINDDILEGKFIKTAISGSKEEAGLKDNKESCESPLKSKTALKRENLPHSVAGTRNSPLKVSYLTPDNGSTVAKTYRNNTARKLFVEEQSQSTNYEQDKRFRQASSVHMNMNREQSFDIGTTHEVACPRNESNSLKRSIADICNETEDPTQQSTFAKRADTTVTWGKALPAADDEVSCSNIDRKGMLKKEKLKHMQGLLNSQNDTSNHKKQDNKEMEDGLGLTQVEKPRGIFD 612 T 6.8E-09 Rad9 pdbpercent F Eukaryota T 7sh3 1 A A Synthetic VirB8 Miniprotein Binder SGGNAEEITEKATLVGIEAWLLAKDEEQKKKVRTLNRQVKKLLQQNDLDQAKRVLDQLKSVLEDLKS 67 T 0.0067 DUF3375 pdb F T 7siy 2 B Z ZAP70_HUMAN Peptide from Tyrosine-protein kinase ZAP-70 XXTPEPX 7 T 22 FSIP1 pdbhh F Eukaryota F 7sjp 1 A E HTRA1_HUMAN HtrA1-LoopA peptide RKLPFSKREVP 11 T 3.7 Integrin_alpha pdbhh F Eukaryota T 7ska 1 A,C,E A,N,Y Q6TAN8_9HIV1 ENV POLYPROTEIN KLWVTVYYGVPVWKEATTTLFCASDAKAYDTEVHNVWATHACVPTDPNPQEVVLENVTENFNMWKNNMVEQMHEDIISLWDQSLKPCVKLTPLCVTLNCTDLRNVTNINNSSEGMRGEIKNCSFNITTSIRDKVKKDYALFYRLDVVPIDNDNTSYRLINCNTSTITQACPKVSFEPIPIHYCTPAGFAILKCKDKKFNGTGPCKNVSTVQCTHGIRPVVSTQLLLNGSLAEEEVVIRSSNFTDNAKNIIVQLKESVEINCTRPNNNTRKSIHIGPGRAFYTTGEIIGDIRQAHCNISRTKWNNTLNQIATKLKEQFGNNKTIVFNQSSGGDPEIVMHSFNCGGEFFYCNSTQLFNSTWNFNGTWNLTQSNGTEGNDTITLPCRIKQIINMWQEVGKAMYAPPIRGQIRCSSNITGLILTRDGGTNSSGSEIFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTK 465 T 1.9E-55 GP120 pdbpercent T Viruses T 7skn 1 A,B,C,D A,B,C,D De novo synthetic protein DIG8-CC GHMRIEVRVDNGRVRVRNGTDRPCRVRVTAGGETREYTVNPGTELEVELSPEQQNNAEVEVECGNEKYRFQLG 73 T 0.00047 DUF756 pdb F T 7sko 1 A,B,C,D A,B,C,D De novo synthetic protein DIG8-CC GHMRIEVRVDNGRVRVRNGTDRPCRVRVTAGGETREYTVNPGTELEVELSPEQQNNAEVEVECGNEKYRFQLG 73 T 0.00047 DUF756 pdb F T 7skz 1 A A SPIKE_SARS2 PRO-SER-LYS-ARG-SER-PHE-ILE-GLU-ASP-LEU-LEU-PHE-ASN PSKRSFIEDLLFN 13 T 0.00014 CoV_S2 pdbhh T Viruses T 7sl5 3 C,F,I,L C,F,I,L SPIKE_SARS2 PRO-SER-LYS-ARG-SER-PHE-ILE-GLU-ASP-LEU-LEU-PHE-ASN KPSKRSFIEDLLFNK 15 T 0.0004 CoV_S2 pdbhh T Viruses T 7slw 2 B,D,F,H,J,L G,H,I,J,K,L Chromodomain Y-like protein 2 XFALXS 6 T 260 DUF3827 pdbhh F F 7smc 2 B,D B,D ARI4A_HUMAN ARID DOMAIN-CONTAINING PROTEIN 4A,RETINOBLASTOMA-BINDING PROTEIN 1,RBBP-1 GPETLVCHEVDLDDL 15 T 48 DUF126 pdbhh F Eukaryota T 7smd 2 B B EID1_HUMAN 21 KDA PRB-ASSOCIATED PROTEIN,CREBBP/EP300 INHIBITORY PROTEIN 1,E1A-LIKE INHIBITOR OF DIFFERENTIATION 1,EID-1 LTEELGCDEIIDRE 14 T 0.071 Nse4-Nse3_bdg unphh F Eukaryota T 7sme 2 B B HDAC1_HUMAN HD1,PROTEIN DEACETYLASE HDAC1,PROTEIN DECROTONYLASE HDAC1 RIACEEEFSD 10 T 2.6 RAM pdbhh F Eukaryota T 7smf 2 B,D B,D Histone deacetylase 1 DIYCYEEFSD 10 T 3.9 End_beta_propel pdbhh F T 7smj 1 A A AI-designed TIM-barrel F2N HHHHHHENLYFQSDIAIVDADNPADAIQQVKDLRKYGAKLIAYKSKSSEELKLALKAGADIAIVDADNPADAIQQVKDLRKYGAKLIAYKSKSSEELKLALKAGADIAIVDADNPADAIQQVKDLRKYGAKLIAYKSKSSEELKLALKAGADIAIVDADNPADAIQQVKDLRKYGAKLIAYKSKSSEELKLALKAG 196 T 0.0034 TrkA_N pdb F T 7smk 3 C C CSOCA_HALNC CSOSCA,CARBONIC ANHYDRASE,CA,CARBOXYSOME SHELL PROTEIN CSOS3 MNTRNTRSKQRAPFGVSSSVKPRLDLIEQAPNPAYDRHPACITLPERTCR 50 T 3.3 zf-LYAR pdbhh F Bacteria T 7smu 1 A,B,C,D,E,F D,E,C,B,F,A Consomatin-Ro1 EGYKCVXKTCMPA 13 T 0.6 Urotensin_II pdbhh F T 7snq 2 M,N,O,P,Q,R,S,T,U,V,W,X M,N,O,P,Q,R,S,T,U,V,W,X CPSF6_HUMAN Cleavage and polyadenylation specificity factor subunit 6 PVLFPGQPFGQPPLG 15 T 2.2 MF_alpha pdbhh F Eukaryota T 7snr 1 A,B A,B LUCI_OPLGR 19KOLASE SDNMVFTLEDFVGDWRQTAGYNLDQVLEQGGVSSLFQNLGVSVTPIQRIVLSGENGLKIDIHVIIPYEGLSGDQMGQIEKIFKVVYPVDDHHFKVILHYGTLVIDGVTPNMIDYFGRPYEGIAVFDGKKITVTGTLWNGNKIIDERLINPDGSLLFRVTINGVTGWRLCERILA 174 T 0.0086 Lipocalin_7 pdbhh F Eukaryota T 7sns 1 A,B,C,D A,B,C,D LUCI_OPLGR 19KOLASE SDNMVFTLEDFVGDWRQTAGYNLDQVLEQGGVSSLFQNLGVSVTPIQRIVLSGENGLKIDIHVIIPYEGLSGDQMGQIEKIFKVVYPVDDHHFKVILHYGTLVIDGVTPNMIDYFGRPYEGIAVFDGKKITVTGTLWNGNKIIDERLINPDGSLLFRVTINGVTGWRLCERILA 174 T 0.0086 Lipocalin_7 pdbhh F Eukaryota T 7snt 1 A,B A,B LUCI_OPLGR 19KOLASE SDNMVFTLEDFVGDWRQTAGYNLDQVLEQGGVSSLFQNLGVSVTPIQRIVLSGENGLKIDIHVIIPYEGLSGDQMGQIEKIFKVVYPVDDHHFKVILHYGTLVIDGVTPNMIDYFGRPYEGIAVFDGKKITVTGTLWNGNKIIDERLINPDGSLLFRVTINGVTGWRLCERILA 174 T 0.0086 Lipocalin_7 pdbhh F Eukaryota T 7snv 3 C C CSOCA_HALNC CSOSCA,CARBONIC ANHYDRASE,CA,CARBOXYSOME SHELL PROTEIN CSOS3 MNTRNTRSKQRAPFGVSSSVKPRLDLIEQAPNPAYDRHPACITLPERTCR 50 T 3.3 zf-LYAR pdbhh F Bacteria T 7snw 1 A,B,C A,B,C LUCI_OPLGR 19KOLASE SDNMVFTLEDFVGDWRQTAGYNLDQVLEQGGVSSLFQNLGVSVTPIQRIVLSGENGLKIDIHVIIPYEGLSGDQMGQIEKIFKVVYPVDDHHFKVILHYGTLVIDGVTPNMIDYFGRPYEGIAVFDGKKITVTGTLWNGNKIIDERLINPDGSLLFRVTINGVTGWRLCERILA 174 T 0.0086 Lipocalin_7 pdbhh F Eukaryota T 7snx 1 A A LUCI_OPLGR 19KOLASE SDNMVFTLEDFVGDWEQTAAYNLDQVLEQGGVSSLLQNLAVSVTPIQRIVRSGENALKIDIHVIIPYEGLSADQMAQIEEVFKVVYPVDDHHFKVILPYGTLVIDGVTPNMLNYFGRPYEGIAVFDGKKITVTGTLWNGNKIIDERLITPDGSMLFRVTINSA 163 T 0.026 DUF4950 unp F Eukaryota T 7snx 2 B B LUCI_OPLGR 19KOLASE XVTGYRLFEEIL 12 T 0.24 Lipocalin_7 unphh F Eukaryota T 7sny 1 A A LUCI_OPLGR 19KOLASE MVFTLEDFVGDWEQTAAYNLDQVLEQGGVSSLLQNLAVSVTPIQRIVRSGENALKIDIHVIIPYEGLSADQMAQIEEVFKVVYPVDDHHFKVILPYGTLVIDGVTPNMLNYFGRPYEGIAVFDGKKITVTGTLWNGNKIIDERLITPDGSMLFRVTINSHHHHHH 165 T 0.026 DUF4950 unp F Eukaryota T 7som 4 VE,WE,XE e,f,g Unknown protein MEGAAGPSGFRNVEPLSRQERAAARDKDLLEKSRLQARNRGGPLKQPENVVGNPVMPARNAPAFCDEYDRFNRDVAGEMNAKKQQNLQKKEEVYAVKRAEQYHRERSNWETQAQAAAREAARLEASRTTGTGAKRNQGSESYNIISLNYNNSSGGQQLAAKDTAVKEARQARAVNLYSKSHSVSHNIITGEPIKFPTAGKE 201 T 0.36 VIR_N pdbpssm F T 7som 6 BF,CF,JF k,l,s Unknown protein MPPQLGREVQERVKVYGPLNELTYEGRLLTQTLQDELNRSISAPAGPRSPWYEGDPELESMRERVRQQRAIREAQRRRDHAALTASIQKRNLQEEQRRDAMLGSLLGDVIGGLTDPNSPLAEAEAALSHADKVRRKKKESLHNEWSTQVFDTIQGRLQAAVDARDPAAIESRLKTQYDQYLHTTNTKVAVFRDVIIEQDYNPLAAADAAIRVPTGDIRDPLKRDVLKGEYERRLMTGGRGGGGASPTGRGGAAAAGAGSIYGPLGKETLGTQQWGELAVKATPYGHCTDGQGGYVARPLSGSAVALRASRVPMDHYDYPVGNAAAAAEVPPGKRIVPGPEQRRGRQDLFDVVQHTVHLKPQGYTGGDQWLEHKGKGNAPGPEQRRGRRDLADVLQQKAVADGPRGTSAPARGDQLQHKEQGDAWLDAKGKRRVEGPEMRRGRQGLYETLQQTSNPYQGGNKVGDAWLEHKGRKVQPRPEPEAAAALSAVPPLPTVRPPRVGDDKKYAVNIEAAMGQMTVKDGAKVTGW 528 T 0.42 Histone pdbpssm F T 7som 7 DF,EF,FF m,n,o A0A2K3DLJ2_CHLRE FAP65 MSERPHQSGPRSWAEDCNQYRTTRGSKSYSTLEDAGRVPERYSRTNYVPFLAERHPLYSYNDLGEDGKGKVRLDPATQADRFSRHGWGDVSLLKQEGLAGQPHPRSSEAGPTRSGRLRPGSPRGADGRNGLYGVLQMTEAGGTDSWVGHPQIDPTKGKRAVAPPPDPKGRRDLFDVLHARSPGMPADDSWLGHQKIDPARGKAHPPGPEQSRGRRDLTELFTMNILHDPRRLELLQKGADKHGDAWCGNILIDPARGKKPVEDVAAAGQNLHGATFKPLPAGTPLPDAPRRHTRPAPAPASDAYAAEVIRGEAAGDDWGPRTKRSVPDMPKPNAFDGRTDLYAHMQYRPLSNGEQGKYAKAFDDRGTRGRRQLHTPGDADPAKEALLTWKPEMRVGQFVKNGGLAQENRVRGHTLRATAGR 421 T 12 zf-C2H2_7 pdbhh F Eukaryota T 7som 8 GF,HF,IF p,q,r A0A2K3DKW3_CHLRE FAP70 MFRQEEQPKTGVRQFGTHTTGKVDHMLGTHATVRPEYKDPPPKRTVPTSQLEAVRNIETQYIKARKAAEDARERQGTSHLYAAGKGWGH 89 T 14 DUF6422 pdbhh F Eukaryota T 7som 16 NG A1 FAP239 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 48 F F F 7som 17 OG,QG A2,A4 FAP388 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 77 F F F 7som 18 PG A3 FAP424 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 7sq1 2 B,D,H C,E,G Q2N0S6_9HIV1 ENV POLYPROTEIN ENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHEDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNTPVQINCTRPNNNTVKSIRIGPGQAFYYTGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFAQSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRV 472 T 3.6999999999999997E-54 GP120 pdbpercent T Viruses T 7sq3 1 A A Designed trefoil knot protein, variant 1 GSSMGSDEQRRELEEKIKFKLAELASKSEEERKEIKLRVIAYVLVQLEDLQKNLSDEQRRELEEKIKFKLAELASKSEEERKEIKLRVIAYVLVQLEDLQKNLSDEQRRELEEKIKFKLAELASKSEEERKEIKLRVIAYVLVQLEDLQKNLS 153 T 0.071 CID pdbpercent F T 7sq4 1 A A Designed trefoil knot protein, variant 2 GSSMGSDEQRRELEEKIKWKLAELASKSEEERKEIKLRVIAYVLVQLEDLQKNLSDEQRRELEEKIKWKLAELASKSEEERKEIKLRVIAYVLVQLEDLQKNLSDEQRRELEEKIKWKLAELASKSEEERKEIKLRVIAYVLVQLEDLQKNLS 153 T 0.19 Colicin pdbpssm F T 7sq5 1 A A Designed trefoil knot protein, variant 3 GSSMGSDEQRRELEEKIKLKLEELKTKSEEERKEIKLRVIAYVLVQLEDLQKNLSDEQRRELEEKIKAKLEELKTKSEEERKEIKLRVIAYVLVQLEDLQKNLSDEQRRELEEKIKWKLEELKTKSEEERKEIKLRVIAYVLVQLEDLQKNLS 153 T 0.039 DUF4854 pdbpercent F T 7sqa 2 C,D C,D NCOR2_HUMAN N-COR2,CTG REPEAT PROTEIN 26,SMAP270,SILENCING MEDIATOR OF RETINOIC ACID AND THYROID HORMONE RECEPTOR,SMRT,T3 RECEPTOR-ASSOCIATING FACTOR,TRAC,THYROID-,RETINOIC-ACID-RECEPTOR-ASSOCIATED COREPRESSOR TNMGLEAIIRKALMGKYDQWEE 22 T 3.5 RHH_7 pdbhh F Eukaryota T 7sqc 3 AA,BA,CA,DA,EA,FA,GA,HA,IA,JA,Q,R,S,T,U,V,W,X,Y,Z 1P,1Q,1R,1S,1T,1U,1V,1W,1X,1Y,1F,1G,1H,1I,1J,1K,1L,1M,1N,1O A8IVW2_CHLRE FLAGELLA ASSOCIATED PROTEIN MPSPKRSGLGSGRLIGGQASTTSLGSPGAGTSQFQINHENTLRKNRNHFQGQIEQYSIDYHSSHNKALAELPVLQEQHALEIEEYQNAIEETVTRHLGRIAQLRQDYSLLIKEASRRHMELQQRRFAGARVDIPQVAQQPIAGLPALSAQPQIVAPSAVPNGSLAASRSGNVSTSSEQAGSHAMGQVNPNAYLRTLNNDAAAAMSQYNTRPLPATPGGIAALSISTPASPLTARLATPSEATARNSDFARLEAMHEKYMGRTKSQHEQGIAARINEVSNWQQTFAACMAALAQHQSSALAHLSSCVEEAEAWHAQQRGALSAAHDEVMTEAERLSRQLEAAQTAAAAKLNGLLASFLERVLPSGEVGLAEAQASYTRSAANLRNEHVAALEAAEAALRALVPRHAAMRQHMSASYSSGLASHEAALAAAGGHYDSRGIPELRRQYQDAESRHRDTLAAIRAEHLKGLAGSRDGWMGEAAALLEEYRARMQELKQQYMLAYDVNLTEV 507 T 0.022 AAA_13 pdb F Eukaryota T 7sqc 4 KA,LA 2A,2B A0A2K3DQM4_CHLRE FAP81 MSSAHILSTFQSTFPGLYQAPKKGEDEPPPEAPAPEPVTQHDDEPDQYSTRIAGITSKFERMRASADEMEQYLRSAAEDAKEAEARALAKADEDFTPAWRNVGLPLKPSHLHLDHGAMAGARLVNPKAIVQEYQAIKGREVLNPPRIAEYADTSAKPNYLKSTHAMEERKMRTMSPDRTARIQALSARHLAWQTLTPEEVAAKMEEAEQRRRQLGLKMPRAQFELEQKIIQSMHHKLTFLRNPRHPLPPAVKTLMELRPDANRWVGPRTTVLEGVKPPIKADMTSRPDQVFVVEPAEVTFTNYAVGRAYEQVVRVRNVTAVSRSLRIFPPASQYFHASLPRFPGEVGVLAPGMAAEVTLRFCPDSLGDYEDAIAVDATHSRQTVPLRARRPPPSLTLPEEIDMGQVVIGNVKTEQVTFKNMGGAGRFRIVPEAHWPDFAMDAPTDRAVVGQFKIWPLYFEMAAGEQLGLNVSYEPTEWGNTEERLVLVCDNCQVKTFSLSGNAVGVDVLLHSVDGRMLEPRELDLPLWFGECAPGAGFSKTVSVRNTTKLPFAFEWGLTKFPQVQNRRRANEPLQTEAQYDEEQDDEGHVLLVDNKSLRGTSPLRLGTGGGGAAAAPPAVGGGAGGGAGGGAGGVSSSAAGVNGSVAQAPGGGVAGAKPPGPMKALAGAENAGTPWGVHCGNEAHGPDPLALGAVVEDLFRVVPRSGVLQPGEVMEFLVTFTPPGQARYERWAQLRVDRKPISVPSGASPMVRGSGSGTGRSHAAIAASATCDVLVAEVGLEGLGCPVQLSAAPRLVSLPGKLMPAEGTTRHVTLRNPTRAQVVVRATVDNPAIAVSPSEFRMPSLGAISLAVTVRAPPDAAPGPLSGRVLLEVEHGPPVPIEVRAAVGSSYARLITPRINFGDVPLSGSSEQRLVIRNMSATCPTPWSIRELTPALVAAEKSRLLRAQLLQSSRMLDPQAAAAAIDALVQEEEEAEAAAAAREEEEAVGYRGASVTGHHARFAEAQRSPAASTSGALVALPASAAERRAVFAAGRHTDSSLAQALPPPDTTHVTFEPSSGVLEPNQELTVRVTCHALTDGRHRSIIQLRSGAPHAGGGPDGGLHMECLEAFACVVTPACVVDRPVMDLGVTFVGVQVRQTLYLTNLSQLPVLYRWTAEAEDEGSQTAGLAELKIKPDHGELEPGEDVEIQVRYTPRYPGPCVMYGVCELEGAPEPLGFRVSSAIHGLDVTYDLLTQEQYDDYMAHDQAAAALGSTGPKGAAAMAVAAAAAGGANTGSGGYYDSESGEVAGAGGEGGGPSRAEEIAAALDKVNFMRVGRHGKAEVDVEAFSGLTHLPPDVLSRVASHRHASTSAAGSKAASRRASARPGAQARPRSGAAAGGGVAAWPPPTPQRHLVADFGHNVPLGETRQMYLVVTNRTAMHTSIRTWLERFGVADASRFVRGTESGAAPPGGGADKAGGAEGGAGKGGASRRQTKDEDAHHPQGPKLSRYSKYTPIKLAGTDAEHRAPFRADKGNEMMATRRLQEEADEALGNKGLAVSVTPPESTLEPWSRLVLTVSCFNDMCGAYMDMMHVKVGDLPARDIPVLVGVSGTPLVVQRERVLVRGLRARSWRTDLEWGQVPQGVEQTRTFYVFNTGSLDMHLAWEARRYHDYVDLERLPPTTDVGAGGTGQHDSLWGGAPGTLRDTRAGMKLFDVKLQPDERAGCVRLATERHADPTDDVPFRVEPEEQVIKGNTTAKFTVTFCASESRRHGGYLHGTQRVFSPESPLELRVWTAGENADRVGALLSGTFHPYAGAPPTPLQPLRVDLGAQAQACRLEPDGQTDLSWVVTSIQQPGSHAAFVRSVTLSNTAHCPQVFSLDVEGPWDMVAASPSVPQDPVAYRGTSTLLGPAAASGRLGTSAADGGLTFLPPGESVDVTLRFSPGKGDMEALPVRFAAMQAKQRVIETVNDYKNTGALCITFANGDSQSLPLVAEMLHPRLEVKPRKLDFKKVHLQSPKEMFVMLSNPTNVDAAWAVTVEGHKPRFPTLPGAGAAASAAAAKEAKEEAAAAAAAAGGGGSASAQPTPRSASGGNLAGEASAPDSRAASAVSGASRPATVDGGAAGAAAPAGGVPPPPKLPGAGGPGTLPGVTGIIAEARIGPYVVKPASGVLSGRGLGMPRSQRISITFAPTEAEAYEGELIFAVLRGKQCSVDVDGEGSIEETDETKGNLFVI 2215 T 0.0019 PapD-like pdbhh F Eukaryota T 7sqc 6 OA,PA 2H,2I A0A2K3E6N2_CHLRE FAP297 MPPLPTLWDPAGAVDKLPQPFRMIDKILADIVEQVVDMIGTRESQRRAEDASRVDLTFAPHAVMEVAPETCCFLPIGIAGIAAVAMPDGEVQVRSARDPSICFSDRTHTSPVTAMEAGMAVCTSGRLLAAASRTALTLHEVDIKTCEISLLATVPLPADPDDTSPPVRLHWSDNLGHLAVCRRSGALALFTLSLPPPNVSLESVGAFVALSLKAFGETRVEELLRVPAATVRAFCLGSAAVGPSNLVWQLQQRPDKSPHSDRYHKSARGAYVWWEGANRLLLLDFEAAAGAAAAAGGAGGAVVPPPEVEAAVKAGAAGTSKPSTPATKAASSKSVAPPGEASSLLTASASAAAAAYAGPERPPEVPQIAPMARDWLLPHDVTAAATTSDHKTMAWGLADGSVVIWDDRSCCSTKVLPRLKGGITALSWVNGVAHKLVCASAGGHIFIADVIKPEDSSQKPYEFPQAIHEVHTLPNEPFALCICRGHTSSDGEIASTHRGGRGGPSILSTMSGHGPGGGGHDVMRVPPQRPRVFWYNVLEEKPVAELMGPRAEQGFGLACCVPPPPSLAPPPPRPDTAATDAGAADGKSAAASAAATPAPGAAPKGKGGAAAPAPPSGGGGGGAAPAASQEAPSEPAMSEAQRALIKAALDAMGANSTTPVLVPVKLHVPAGQVVSYPACVFRDTYLLAGGDVVDKVVRSNFADDDAVPQRVTQLYMYKVDALLRHLLPEDESTSRLGKVVLDRLLADLEAPKMNRKKGKRVKMDVEDPDAPKLSSAMRKPDPFDTTGSRPGSRAANRHITFGGGDADGLFDDVLEGGRAKPKKETKVFPKETKKGLKTGPGDVKMIDKNASGRPRLAPLDLEKAKEAPLPFSERSQSPPWHHTNPLARIHPDWEEAPVLVRIMDRIGSKGGGRKRRDKRLEALTTELMTKYSKEAGAKPNLLVPT 945 T 0.0003 Lgl_C pdbhh F Eukaryota T 7sqc 14 NC,OC,PC,QC 8A,8B,8C,8D A8IKV8_CHLRE FAP105 MSGVRACLQPNEGPACPIGQTYGEVGGSSPNFRGNFCDAGRKHVSGPASELATQGFSRWEQSSGNRDPPLRRHAEQPASTSYAEPSVMGGHASYYGRRAVGEGEDGTAYRRTMKAVPQPVRQDRPEGKKAIPEPYGAPPAHPRGTRPPPDDVRFREAESAPEPNLPRLGRPDGISGLRESGDQQYQFESSLGRKIRVGQDSYRGAGRAGDRSLVYGAPARTEDDPTYFRSMKDSPTFTRFCNSLPAKPAVSPHQRREEGMRRAAEEERRREAALVSTLDIQGVPDDD 287 T 0.41 BRD4_CDT pdb F Eukaryota T 7sqc 16 AD,BD,XC,YC,ZC A3,A4,A0,A1,A2 FAP219 MDDTGSVIDDLPPPSPNRSLVATPTPAIPGRTGKLDYGLHESAMRVPVLDKVKEARKATIESKPDILGVRGPVWNETVALNPGKHHGKFSHNLLSNTLTPELINSTDVRKLTGTTAARGDPAAATLDRSLSPAAGGPSGWNTSTTLPSSNDRQRQLEAGLNASLAATARRRASPTPHYVDPVARQTAYSETIRAIKANSGADMGELTARYGPDGAEAMAAILAMPAKESRPRIRTTRADLQAVAALDAFSAGRDDAEEEEQGEAVPLSSPMPGAER 276 T 0.075 Lipase_chap pdbpssm F T 7sqc 17 CD,DD,ED B0,B1,B2 CFA99_CHLRE CILIA- AND FLAGELLA-ASSOCIATED PROTEIN 99 MESRPGSASSYHSHAAGNTQSRPGSATSVESSNRLSGKPLGAEKLMAACEKIYKSFNPAQVTLDTHVDNCIGQLSVHNSFDDSFIRQVVYGTVRYRRLLGALMDSFYHYNGGAASREDVDMYKLYSYLTIFRLEELNFSNFQRLVDAMTPQKMYVLLKYMFNPQYMREVVREDWLKLYDKEFVDEIIDRLLSWKSESKEMLGRLEYKVTLSRKEDDEKETLGTGYAAARSTTVPQPFNLTQPKPRPLPVEDPPPPPIRAKPAPRPREGPTKEEVALAAAREANRAAAERKAAKAAPFKLRVLERPTNIDKIREELEAERTRELTFKGIRAAPPPPVPNAQVRLNAAAILREDALYRRKQQEEADALKRYEAELRDASSFKAWQNAMLEQDEAARAASVERRRQEMAAAQENAIRARMAAQEANAELARAAKEEAKRIEEDLKRDREEQARLNALRRDAVVEARQNVQAAVEKMSQERRLAAEEERRKQQEDARARAEVAAREMAERRDIILQLKALEKVPKQRVKEFDPTETGPDHGLLETMSLVELRERLNVAKRRQREEEERQRAEILRQKQERESALLEKAANIQRVRRVAAAQAAQRRATSAETIQRKNTEVSKAREADVLQLADKLDAKRAALAAERARLAAEQKRTRFEQMQAAAGAAVVEETKFRELRAGAQREAKTRQENALASATVYEATKARQQNVRLKNVRQELKAKDDFMRAYDEKLAALRGQAGAESAADLARRTQMAQTQRAAEATVRNRTTTTAYRPYEGGSTSMQARLAALGQGMELED 795 T 0.038 DUF1948 pdbhh F Eukaryota T 7sqc 20 SD,TD,UD,VD E0,E1,E2,E3 A0A2K3DQN7_CHLRE FAP108 MPLYFEEVAPDPKAKKERDAKQQRPAILVERKGPPPAPMHLESQVIPTLIRKVGDWKTGRISQAMCEAYLDRHTLVFDRELLTKLFKEADYQKEGSLDTRALTIAIAGRFPKREHTPEWRLLTALLLGLPELVLTTDAEVTTLRTTHERPVGGGTYNSGNFWDSPPPPLPPVRRRTGSGRSTVGKVTAHEPSPEWLDTLNRTAAAASMSAGGSPSASMAGSFAGAASLNASMLRTGSVGAMDPGGAGVVGTTGGLKQTTQIADEARLNAALMGGAASTFATQREFADWSRGLEVMPRLAADTAGPGPGSEFGGGVRTATHLGSPKAPVRVWAAPLPPSAISLPSSALRTLRETVRSTASTKPDFVKGVKPLDSHELDLKKTLGEPLDVGMSLARVEPVRDTKVLPNADYVTWGDYAANCRTGPTGWYSKHPTAQAQDTGEHKYPWC 446 T 0.61 EF-hand_8 pdbhh F Eukaryota T 7sqc 30 OI,PI,QI,RI S0,S1,S2,S3 A8J870_CHLRE FLAGELLAR ASSOCIATED PROTEIN MAAKGKQQWDFLKADANTPASPAHYYEPLNAKKEGEFKPGWNTKRRGPAWEAERQAAIMTKEQKNIGCVALRSERLNNAQQQSGFNPIAHTERAADGSWVPATNAWMHQKVGVKQQDPRAAAADTLKHQAEGASRAAAIAEMRKERIAAGGASRPAAGGGVKDALTWG 168 T 0.17 Nop25 pdbpssm F Eukaryota T 7sqc 31 SI S5 Unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 38 F F F 7sqc 36 LL,ML,NL,OL W3,W4,W5,W6 Unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 68 F F F 7sqc 37 PL,RL W7,W9 Unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 149 F F F 7sqc 38 QL W8 Unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 51 F F F 7sqk 1 A A HAUS1_HUMAN COILED-COIL DOMAIN-CONTAINING PROTEIN 5,ENHANCER OF INVASION-CLUSTER,HEI-C MEPQEERETQVAAWLKKIFGDHPIPQYEVNPRTTEILHHLSERNRVRDRDVYLVIEDLKQKASEYESEAKYLQDLLMESVNFSPANLSSTGSRYLNALVDSAVALETKDTSLASFIPAVNDLTSDLFRTKSKSEEIKIELEKLEKNLTATLVLEKCLQEDVKKAELHLSTERAKVDNRRQNMDFLKAKSEEFRFGIKAAEEQLSARGMDASLSHQSLVALSEKLARLKQQTIPLKKKLESYLDLMPNPSLAQVKIEEAKRELDSIEAELTRRVDMMEL 278 T 0.00056 DUF3496 pdb F Eukaryota T 7sqq 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T,U,V,W,X A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T,V,W,X,Y GP105_BP201 Chimallin SNAMIRDTATNTTQTQAAPQQAPAQQFTQAPQEKPMQSTQSQPTPSYAGTGGINSQFTRSGNVQGGDARASEALTVFTRLKEQAVAQQDLADDFSILRFDRDQHQVGWSSLVIAKQISLNGQPVIAVRPLILPNNSIELPKRKTNIVNGMQTDVIESDIDVGTVFSAQYFNRLSTYVQNTLGKPGAKVVLAGPFPIPADLVLKDSELQLRNLLIKSVNACDDILALHSGERPFTIAGLKGQQGETLAAKVDIRTQPLHDTVGNPIRADIVVTTQRVRRNGQQENEFYETDVKLNQVAMFTNLERTPQAQAQTLFPNQQQVATPAPWVASVVITDVRNADGIQANTPEMYWFALSNAFRSTHGHAWARPFLPMTGVAKDMKDIGALGWMSALRNRIDTKAANFDDAQFGQLMLSQVQPNPVFQIDLNRMGETAQMDSLQLDAAGGPNAQKAAATIIRQINNLGGGGFERFFDHTTQPILERTGQVIDLGNWFDGDEKRDRRDLDNLAALNAAEGNENEFWGFYGAQLNPNLHPDLRNRQSRNYDRQYLGSTVTYTGKAERCTYNAKFIEALDRYLAEAGLQITMDNTSVLNSGQRFMGNSVIGNNMVSGQAQVHSAYAGTQGFNTQYQTGPSSFY 634 T 6.2 TGBp3 pdbhh T Viruses T 7sqr 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L GP105_BP201 Chimallin SNAMIRDTATNTTQTQAAPQQAPAQQFTQAPQEKPMQSTQSQPTPSYAGTGGINSQFTRSGNVQGGDARASEALTVFTRLKEQAVAQQDLADDFSILRFDRDQHQVGWSSLVIAKQISLNGQPVIAVRPLILPNNSIELPKRKTNIVNGMQTDVIESDIDVGTVFSAQYFNRLSTYVQNTLGKPGAKVVLAGPFPIPADLVLKDSELQLRNLLIKSVNACDDILALHSGERPFTIAGLKGQQGETLAAKVDIRTQPLHDTVGNPIRADIVVTTQRVRRNGQQENEFYETDVKLNQVAMFTNLERTPQAQAQTLFPNQQQVATPAPWVASVVITDVRNADGIQANTPEMYWFALSNAFRSTHGHAWARPFLPMTGVAKDMKDIGALGWMSALRNRIDTKAANFDDAQFGQLMLSQVQPNPVFQIDLNRMGETAQMDSLQLDAAGGPNAQKAAATIIRQINNLGGGGFERFFDHTTQPILERTGQVIDLGNWFDGDEKRDRRDLDNLAALNAAEGNENEFWGFYGAQLNPNLHPDLRNRQSRNYDRQYLGSTVTYTGKAERCTYNAKFIEALDRYLAEAGLQITMDNTSVLNSGQRFMGNSVIGNNMVSGQAQVHSAYAGTQGFNTQYQTGPSSFY 634 T 6.2 TGBp3 pdbhh T Viruses T 7sqs 1 A,B,C,D,E A,E,D,C,B GP105_BP201 Chimallin SNAMIRDTATNTTQTQAAPQQAPAQQFTQAPQEKPMQSTQSQPTPSYAGTGGINSQFTRSGNVQGGDARASEALTVFTRLKEQAVAQQDLADDFSILRFDRDQHQVGWSSLVIAKQISLNGQPVIAVRPLILPNNSIELPKRKTNIVNGMQTDVIESDIDVGTVFSAQYFNRLSTYVQNTLGKPGAKVVLAGPFPIPADLVLKDSELQLRNLLIKSVNACDDILALHSGERPFTIAGLKGQQGETLAAKVDIRTQPLHDTVGNPIRADIVVTTQRVRRNGQQENEFYETDVKLNQVAMFTNLERTPQAQAQTLFPNQQQVATPAPWVASVVITDVRNADGIQANTPEMYWFALSNAFRSTHGHAWARPFLPMTGVAKDMKDIGALGWMSALRNRIDTKAANFDDAQFGQLMLSQVQPNPVFQIDLNRMGETAQMDSLQLDAAGGPNAQKAAATIIRQINNLGGGGFERFFDHTTQPILERTGQVIDLGNWFDGDEKRDRRDLDNLAALNAAEGNENEFWGFYGAQLNPNLHPDLRNRQSRNYDRQYLGSTVTYTGKAERCTYNAKFIEALDRYLAEAGLQITMDNTSVLNSGQRFMGNSVIGNNMVSGQAQVHSAYAGTQGFNTQYQTGPSSFY 634 T 6.2 TGBp3 pdbhh T Viruses T 7sqt 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T,U,V,W,X A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T,V,W,X,Y A0A482GDX1_9CAUD CHIMALLIN SNAMGLDVRNNGNDNVEIRAAETRTAQRADEALETAADFAGQPKVTHTMRTINRTLSRRISRNTGSEQVLNLRRLMEKYLEDTRFKDDFIFVAVDPNQYSVPYPTLVVMSGAKVGDHNHFFGYVLPLVAGLAPLPRREEQGPHGNILVPRTWVDNLNGTFINEVMAAMYAAIGGKSNGTARIAGLAVVTNEITAESAHLATTLLSAADNAIQTAIEIRLGDKLGLPQFNLGMMASDQPISSVQYNTSGMQDSDIVGNPVRSDITVTISNRIRQAMSDYDSQQRLVATTGYIDLTYSPQNPTFNQGPVLVNGYPVPPTVQYQPRYVMTSAYPLELDAFTPNTFVLGLIGTIATLNSGMAWAQSLISNAARGIGPHNPGALAMVLDPEVTAPLDLSTQTNEQIYKFLQQVLYPSLLISIDVPEEGEYSWLLRMIPAAEKIYTGKVEGEVREISEGYKALYRAFDDVTLGCFSKKYQYGLPLVYATGNRIPLGHYNHQDGHRHDIRDMDDLYMMNITNPDTVEAWEDSFDRTDMTMSQRVVARHEIIDRVLSGSWEQTGWAMRYDFDPLALQALIEAAADAGFTIRPENIQHLAGTAVRGNMAARARGLGNISGNIYARSDRPNVGVNNMGGAFNLF 634 T 0.16 PSII_Pbs31 pdbpercent T Viruses T 7squ 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L A0A482GDX1_9CAUD Chimallin SNAMGLDVRNNGNDNVEIRAAETRTAQRADEALETAADFAGQPKVTHTMRTINRTLSRRISRNTGSEQVLNLRRLMEKYLEDTRFKDDFIFVAVDPNQYSVPYPTLVVMSGAKVGDHNHFFGYVLPLVAGLAPLPRREEQGPHGNILVPRTWVDNLNGTFINEVMAAMYAAIGGKSNGTARIAGLAVVTNEITAESAHLATTLLSAADNAIQTAIEIRLGDKLGLPQFNLGMMASDQPISSVQYNTSGMQDSDIVGNPVRSDITVTISNRIRQAMSDYDSQQRLVATTGYIDLTYSPQNPTFNQGPVLVNGYPVPPTVQYQPRYVMTSAYPLELDAFTPNTFVLGLIGTIATLNSGMAWAQSLISNAARGIGPHNPGALAMVLDPEVTAPLDLSTQTNEQIYKFLQQVLYPSLLISIDVPEEGEYSWLLRMIPAAEKIYTGKVEGEVREISEGYKALYRAFDDVTLGCFSKKYQYGLPLVYATGNRIPLGHYNHQDGHRHDIRDMDDLYMMNITNPDTVEAWEDSFDRTDMTMSQRVVARHEIIDRVLSGSWEQTGWAMRYDFDPLALQALIEAAADAGFTIRPENIQHLAGTAVRGNMAARARGLGNISGNIYARSDRPNVGVNNMGGAFNLF 634 T 0.16 PSII_Pbs31 pdbpercent T Viruses T 7sqv 1 A,B,C,D A,C,B,D A0A482GDX1_9CAUD Chimallin SNAMGLDVRNNGNDNVEIRAAETRTAQRADEALETAADFAGQPKVTHTMRTINRTLSRRISRNTGSEQVLNLRRLMEKYLEDTRFKDDFIFVAVDPNQYSVPYPTLVVMSGAKVGDHNHFFGYVLPLVAGLAPLPRREEQGPHGNILVPRTWVDNLNGTFINEVMAAMYAAIGGKSNGTARIAGLAVVTNEITAESAHLATTLLSAADNAIQTAIEIRLGDKLGLPQFNLGMMASDQPISSVQYNTSGMQDSDIVGNPVRSDITVTISNRIRQAMSDYDSQQRLVATTGYIDLTYSPQNPTFNQGPVLVNGYPVPPTVQYQPRYVMTSAYPLELDAFTPNTFVLGLIGTIATLNSGMAWAQSLISNAARGIGPHNPGALAMVLDPEVTAPLDLSTQTNEQIYKFLQQVLYPSLLISIDVPEEGEYSWLLRMIPAAEKIYTGKVEGEVREISEGYKALYRAFDDVTLGCFSKKYQYGLPLVYATGNRIPLGHYNHQDGHRHDIRDMDDLYMMNITNPDTVEAWEDSFDRTDMTMSQRVVARHEIIDRVLSGSWEQTGWAMRYDFDPLALQALIEAAADAGFTIRPENIQHLAGTAVRGNMAARARGLGNISGNIYARSDRPNVGVNNMGGAFNLF 634 T 0.16 PSII_Pbs31 pdbpercent T Viruses T 7st7 59 GB h Viomycin XXSSXX 6 T 1500 Sid-5 pdbhh F F 7st8 3 C S ASTL_HUMAN SAS1B, OOCYTE ASTACIN,OVASTACIN,ZP2-PROTEINASE MGSSHHHHHHSSGLVPRGSHMASGPRPRGRGSHAHSTGRSPAPASLSLQRLLEALSAESRSPDPSGSSAGGQPVPAGPGESPHGWESPALKKLSAEASARQPQTLASSPRSRPGAGAPGVAQEQSWLAGVSTKPTVPSSEAGIQPVPVQGSPALPGGCVPRNHFKGMSED 170 T 240 GP63 pdbhh F Eukaryota T 7st9 7 G G DDC1_YEAST DNA damage checkpoint protein 1 MDYKDDDDKDYKDDDDKDYKDDDDKLEVLFQGPGMSFKATITESGKQNIWFRAIYVLSTIQDDIKITVTTNELIAWSMNETDTTLCQVRFQKSFFEEYEFKPHEIVFGENGVQVIEDTYGNSHKLYSFRVNGRHLTTISRKPDGDGIKSFTIAVNNTSTCPESLANRLIVVIEMDSLIVKEYCPQFQPIKYDPIIINLKYKRRFLDVFGTAASDRNPQEPLDPKLLDVFTNTERELTSALFNEEVESDIRKRNQLTAADEINYICCNSTLLKNFLDNCNVNVTDEVKLEINVHRLSITAFTKAVYGKNNDLLRNALSMSNTISTLDLEHYCLFTTIEDEKQDKRSHSKRREHMKSIIFKLKDFKNFITIGPSWKTTQDGNDNISLWFCHPGDPILMQMQKPGVKLELVEVTDSNINDDILEGKFIKTAISGSKEEAGLKDNKESCESPLKSKTALKRENLPHSVAGTRNSPLKVSYLTPDNGSTVAKTYRNNTARKLFVEEQSQSTNYEQDKRFRQASSVHMNMNREQSFDIGTTHEVACPRNESNSLKRSIADICNETEDPTQQSTFAKRADTTVTWGKALPAADDEVSCSNIDRKGMLKKEKLKHMQGLLNSQNDTSNHKKQDNKEMEDGLGLTQVEKPRGIFD 646 T 2.8E-09 Rad9 pdbpssm F Eukaryota T 7stb 7 G G DDC1_YEAST DNA damage checkpoint protein 1 MDYKDDDDKDYKDDDDKDYKDDDDKLEVLFQGPGMSFKATITESGKQNIWFRAIYVLSTIQDDIKITVTTNELIAWSMNETDTTLCQVRFQKSFFEEYEFKPHEIVFGENGVQVIEDTYGNSHKLYSFRVNGRHLTTISRKPDGDGIKSFTIAVNNTSTCPESLANRLIVVIEMDSLIVKEYCPQFQPIKYDPIIINLKYKRRFLDVFGTAASDRNPQEPLDPKLLDVFTNTERELTSALFNEEVESDIRKRNQLTAADEINYICCNSTLLKNFLDNCNVNVTDEVKLEINVHRLSITAFTKAVYGKNNDLLRNALSMSNTISTLDLEHYCLFTTIEDEKQDKRSHSKRREHMKSIIFKLKDFKNFITIGPSWKTTQDGNDNISLWFCHPGDPILMQMQKPGVKLELVEVTDSNINDDILEGKFIKTAISGSKEEAGLKDNKESCESPLKSKTALKRENLPHSVAGTRNSPLKVSYLTPDNGSTVAKTYRNNTARKLFVEEQSQSTNYEQDKRFRQASSVHMNMNREQSFDIGTTHEVACPRNESNSLKRSIADICNETEDPTQQSTFAKRADTTVTWGKALPAADDEVSCSNIDRKGMLKKEKLKHMQGLLNSQNDTSNHKKQDNKEMEDGLGLTQVEKPRGIFD 646 T 2.8E-09 Rad9 pdbpssm F Eukaryota T 7stf 2 B C KRAS G12V (7-16) VVVGAVGVGK 10 T 0.0021 AAA_18 pdbhh F F 7su9 5 E C KRAS-G12D-9mer with A18L substitution GADGVGKSL 9 T 4.8E-05 Thymidylate_kin pdbhh F T 7sua 1 A A A0A237U7Y1_ACIBA DUF4175 domain-containing protein SNARAIARPTRSEYLARINEENRLKHEIQELTQALALEKQNTVTLVAQAQQQAKAKPIVRSQPEKSLESTDQNTLALNIQFYDPKQLLSSVNQSVSVPYFKLCQLFLNKSIELCTKHYHLKATDIDVVDEFHAEGATLAISTSHPHAVECLLMVGTVFQLLSDVLYKRYREDKRFALQTRSAVCNAVEAMQIDAKEAAQRLAQHLHAKESALYLDNEQLKAIQDSYQLVAMPNPSNVMTRHAFMINGMNAECAELAQNIRTEILMGKKSIPQNDSPSSAAS 281 T 0.0013 DUF4175 unp F Bacteria T 7suk 29 CA LV NOL10_YEAST ESSENTIAL NUCLEAR PROTEIN 2 VLKSTSANDVSVYQVSGTNVSRSLPDWIAKKRKRQLKNDLEYQNRVELIQDFEFSEASNKIKVSRDGQYCMATGTYKPQIHVYDFANLSLKFDRHTDAENVDFTILSDDWTKSVHLQNDRSIQFQNKGGLHYTTRIPKFGRSLVYNKVNCDLYVGASGNELYRLNLEKGRFLNPFKLDTEGVNHVSINEVNGLLAAGTETNVVEFWDPRSRSRVSKLYLENNIDNRPFQVTTTSFRNDGLTFACGTSNGYSYIYDLRTSEPSIIKDQGYGFDIKKIIWLDNVGTENKIVTCDKRIAKIWDRLDGKAYASMEPSVDINDIEHVPGTGMFFTANESIPMHTYYIPSLGPSPRWCSFLDSITEEL 362 T 0.00022 ANAPC4_WD40 unppercent F Eukaryota T 7suk 46 WA SS UTP14_YEAST U3 small nucleolar RNA-associated protein 14 QRIQQRHDRKAAYEISRQEVSKWNDIVQQNRRADHLIFPLNKPTEHNHASAFTRTQDVPQTELQEKVDQVLQESNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNVIINEKVNKKNLKYQSSAVPFPFENREQYERSLRMPIGQEWTSRASHQELIKPRIMTKPGQVIDPLKAP 197 T 2.1E-40 Utp14 pdbpercent F Eukaryota T 7suk 47 XA ST NOP14_YEAST Nucleolar complex protein 14 MAGSQLKNLKAALKARGLTGQTNVKSKNKKNSKRQAKEYDREEKKKAIAEIREEFNPFEIKAARNKRRDGLPSKTADRIAVGKPGISKQIGEEQRKRAFEARKMMKNKRGGVIDKRFGERDKLLTEEEKMLERFTRERQSQSKRNANLFNLEDDEDDGDMFGDGLTHLGQSLSLEDELANDEEDFLASKRFNEDDAELQQPQRKKTKAEVMKEVIAKSKFYKQERQKAQGIMEDQIDNLDDNFEDVMSELMMTQPKKNPMEPKTDLDKEYDIKVKEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKEKLGKFTAVLLRHIIFLSNQNYLKNVQSFKRTQNALISILKSLSEKYNRELSEECRDYINEMQARYKKNHFDALSNGDLVFFSIIGILFSTSDQYHLVITPALILMSQFLEQIKFNSLKRIAFGAVLVRIVSQYQRISKRYIPEVVYFFQKILLTFIVEKENQEKPLDFENIRLDSYELGLPLDVDFTKKRSTIIPLHTLSTMDTEAHPVDQCVSVLLNVMESLDATISTVWKSLPAFNEIILPIQQLLSAYTSKYSDFEKPRNILNKVEKLTKFTEHIPLALQNHKPVSIPTHAPKYEENFNPDKKSYDPDRTRSEINKMKAQLKKERKFTMKEIRKDAKFEARQRIEEKNKESSDYHAKMAHIVNTINTEEGAEKNKYERERKLR 806 T 1.7E-134 Nop14 pdbpssm F Eukaryota T 7suo 2 C,D C,D NCAP_SARS2 Nucleoprotein APRITFGGPSD 11 T 7 Tymo_coat pdbhh T Viruses T 7sv7 2 B B Cystic fibrosis transmembrane conductance regulator XXXXXXXXXXXXXXXXX 17 F F F 7svd 2 B B Cystic fibrosis transmembrane conductance regulator XXXXXXXXXXXXXXXXX 17 F F F 7svr 2 B B Cystic fibrosis transmembrane conductance regulator XXXXXXXXXXXXXXXXXXX 19 F F F 7svu 4 D,F,H,J,L,N,Q,T,V a,b,c,d,e,f,h,j,k A0A979HMQ2_9CYAN TnsB-CTD IEVWDYEQLREEYGF 15 T 1.1 ODC_AZ pdbhh F Bacteria T 7svv 4 D,F,H,J,L,N,P,R,T,V a,b,c,d,e,f,g,h,i,j TnsBctd IEVWDYEQLREEYGF 15 T 1.1 ODC_AZ pdbhh F T 7swl 2 G G polyleucine LLLLLLLLLLLLLLLLLLLLLLLL 24 T 69 DAG1 pdbhh F F 7sx3 2 B B NALF1_HUMAN Transmembrane protein FAM155A MTRGAWMCRQYDDGLKIWLAAPRENEKPFIDSERAQKWRLSLASLLFFTVLLSDHLWFCAEAKLTRARDKEHQQQQRQQQQQQQQQRQRQQQQQQRRQQEPSWPALLASMGESSPAAQAHRLLSASSSPTLPPSPGDGGGGGGKGNRGKDDRGKALFLGNSAKPVWRLETCYPQGASSGQCFTVENADAVCARNWSRGAAGGDGQEVRSKHPTPLWNLSDFYLSFCNSYTLWELFSGLSSPNTLNCSLDVVLKEGGEMTTCRQCVEAYQDYDHHAQEKYEEFESVLHKYLQSEEYSVKSCPEDCKIVYKAWLCSQYFEVTQFNCRKTIPCKQYCLEVQTRCPFILPDNDEVIYGGLSSFICTGLYETFLTNDEPECCDVRREEKSNNPSKGTVEKSGSCHRTSLTVSSATRLCNSRLKLCVLVLILLHTVLTASAAQNTAGLSFGGINTLEENSTNEEGGSGGSDYKDDDDKGNSDYKDDDDK 483 F F Eukaryota T 7sx4 2 B B NALF1_HUMAN Transmembrane protein FAM155A MTRGAWMCRQYDDGLKIWLAAPRENEKPFIDSERAQKWRLSLASLLFFTVLLSDHLWFCAEAKLTRARDKEHQQQQRQQQQQQQQQRQRQQQQQQRRQQEPSWPALLASMGESSPAAQAHRLLSASSSPTLPPSPGDGGGGGGKGNRGKDDRGKALFLGNSAKPVWRLETCYPQGASSGQCFTVENADAVCARNWSRGAAGGDGQEVRSKHPTPLWNLSDFYLSFCNSYTLWELFSGLSSPNTLNCSLDVVLKEGGEMTTCRQCVEAYQDYDHHAQEKYEEFESVLHKYLQSEEYSVKSCPEDCKIVYKAWLCSQYFEVTQFNCRKTIPCKQYCLEVQTRCPFILPDNDEVIYGGLSSFICTGLYETFLTNDEPECCDVRREEKSNNPSKGTVEKSGSCHRTSLTVSSATRLCNSRLKLCVLVLILLHTVLTASAAQNTAGLSFGGINTLEENSTNEEGGSGGSDYKDDDDKGNSDYKDDDDK 483 F F Eukaryota T 7sxb 1 A A A0A2D1LW19_HELBK Transforming growth factor mimic GSGTGCPPLPDDGIVFYEYYGYAGDRHTVGPVVTKDSSGNYPSPTHARRRCRALSQEADPGEFVAICYKSGTTGESHWEYYKNIGKCPDP 90 T 32 DUF5678 pdbhh F Eukaryota T 7sxf 2 B B Axin peptide LLPQKFAEELIHRLEAV 17 T 4.5 CAP_N pdbhh F T 7sxg 2 B B Axin peptide LLPQKFAEELIHRLEAVQ 18 T 4.4 CAP_N pdbhh F T 7sxh 2 B B axin peptide PQKFAEELIHRLEAVQ 16 T 3 CAP_N pdbhh F T 7sxi 1 A A SDS3_MOUSE SUPPRESSOR OF DEFECTIVE SILENCING 3 PROTEIN HOMOLOG SNAQRFEARIEDGKLYYDKRWYHKSQAIYLESKDNQKLSCVISSVGANEIWVRKTSDSTKMRIYVGQLQRGLFVIRRRS 79 T 0.044 Fascin pdb F Eukaryota T 7sxj 2 B B axin peptide EPQKFAEELIHRLEAVQ 17 T 6.9 DUF1690 pdbhh F T 7sxk 1 A,B,C,D,E,F,G,H,I,J,K,L b,a,l,k,j,i,h,c,d,e,f,g Q8H9R8_9CAUD Portal protein MAKRRKIKPMDDEQVLRHLDQLVNDALDFNSSELSKQRSEALKYYFGEPFGNERPGKSGIVSRDVQETVDWIMPSLMKVFTSGGQVVKYEPDTAEDVEQAEQETEYVNYLFMRKNEGFKVMFDWFQDTLMMKTGVVKVYVEEVLKPTFERFSGLSEDMVADILSDPDTSILAQSVDDDGTYTIKIRKDKKKREIKVLCVKPENFLVDRLATCIDDARFLCHREKYTVSDLRLLGVPEDVIEELPYDEYEFSDSQPERLVRDNFDMTGQLQYNSGDDAEANREVWASECYTLLDVDGDGISELRRILYVGDYIISNEPWDCRPFADLNAYRIAHKFHGMSVYDKIRDIQEIRSVLMRNIMDNIYRTNQGRSVVLDGQVNLEDLLTNEAAGIVRVKSMNSITPLETPQLSGEVYGMLDRLEADRGKRTGITDRTRGLDQNTLHSNQAAMSVNQLMTAAEQQIDLIARMFAETGVKRLFQLLHDHAIKYQNQEEVFQLRGKWVAVNPANWRERSDLTVTVGIGNMNKDQQMLHLMRIWEMAQAVVGGGGLGVLVSEQNLYNILKEVTENAGYKDPDRFWTNPNSPEALQAKAIREQKEAQPKPEDIKAQADAQRAQSDALAKQAEAQMKQVEAQIRLAEIELKKQEAVLQQREMALKEAELQLERDRFTWERARNEAEYHLEATQARAAYIGDGKVPETKKPTKAVRR 705 T 0.044 Sema4F_C pdbpercent T Viruses T 7sxn 1 A AAA Orb2A residues 1-9 MYNKFVNFI MYNKFVNFI 9 T 0.12 DUF5505 pdbhh F T 7sxo 2 G G endogenous substrate XXXXXXXXXXXX 12 F F F 7sya 1 A,B,C,D,E,F,G,H,I,J,K,L a,b,c,d,e,f,g,h,i,j,k,l Q8H9R8_9CAUD Portal protein MAKRRKIKPMDDEQVLRHLDQLVNDALDFNSSELSKQRSEALKYYFGEPFGNERPGKSGIVSRDVQETVDWIMPSLMKVFTSGGQVVKYEPDTAEDVEQAEQETEYVNYLFMRKNEGFKVMFDWFQDTLMMKTGVVKVYVEEVLKPTFERFSGLSEDMVADILSDPDTSILAQSVDDDGTYTIKIRKDKKKREIKVLCVKPENFLVDRLATCIDDARFLCHREKYTVSDLRLLGVPEDVIEELPYDEYEFSDSQPERLVRDNFDMTGQLQYNSGDDAEANREVWASECYTLLDVDGDGISELRRILYVGDYIISNEPWDCRPFADLNAYRIAHKFHGMSVYDKIRDIQEIRSVLMRNIMDNIYRTNQGRSVVLDGQVNLEDLLTNEAAGIVRVKSMNSITPLETPQLSGEVYGMLDRLEADRGKRTGITDRTRGLDQNTLHSNQAAMSVNQLMTAAEQQIDLIARMFAETGVKRLFQLLHDHAIKYQNQEEVFQLRGKWVAVNPANWRERSDLTVTVGIGNMNKDQQMLHLMRIWEMAQAVVGGGGLGVLVSEQNLYNILKEVTENAGYKDPDRFWTNPNSPEALQAKAIREQKEAQPKPEDIKAQADAQRAQSDALAKQAEAQMKQVEAQIRLAEIELKKQEAVLQQREMALKEAELQLERDRFTWERARNEAEYHLEATQARAAYIGDGKVPETKKPTKAVRR 705 T 0.044 Sema4F_C pdbpercent T Viruses T 7sz4 1 A,B,C,D,E,F,G,H,I,J,K,L k,j,i,h,l,a,b,c,d,e,f,g Q8H9R8_9CAUD Portal protein MAKRRKIKPMDDEQVLRHLDQLVNDALDFNSSELSKQRSEALKYYFGEPFGNERPGKSGIVSRDVQETVDWIMPSLMKVFTSGGQVVKYEPDTAEDVEQAEQETEYVNYLFMRKNEGFKVMFDWFQDTLMMKTGVVKVYVEEVLKPTFERFSGLSEDMVADILSDPDTSILAQSVDDDGTYTIKIRKDKKKREIKVLCVKPENFLVDRLATCIDDARFLCHREKYTVSDLRLLGVPEDVIEELPYDEYEFSDSQPERLVRDNFDMTGQLQYNSGDDAEANREVWASECYTLLDVDGDGISELRRILYVGDYIISNEPWDCRPFADLNAYRIAHKFHGMSVYDKIRDIQEIRSVLMRNIMDNIYRTNQGRSVVLDGQVNLEDLLTNEAAGIVRVKSMNSITPLETPQLSGEVYGMLDRLEADRGKRTGITDRTRGLDQNTLHSNQAAMSVNQLMTAAEQQIDLIARMFAETGVKRLFQLLHDHAIKYQNQEEVFQLRGKWVAVNPANWRERSDLTVTVGIGNMNKDQQMLHLMRIWEMAQAVVGGGGLGVLVSEQNLYNILKEVTENAGYKDPDRFWTNPNSPEALQAKAIREQKEAQPKPEDIKAQADAQRAQSDALAKQAEAQMKQVEAQIRLAEIELKKQEAVLQQREMALKEAELQLERDRFTWERARNEAEYHLEATQARAAYIGDGKVPETKKPTKAVRR 705 T 0.044 Sema4F_C pdbpercent T Viruses T 7sz6 1 A,B,C,D,E,F,G,H,I,J,K k,j,i,a,b,c,d,e,f,h,g Q8H9R8_9CAUD Portal protein MAKRRKIKPMDDEQVLRHLDQLVNDALDFNSSELSKQRSEALKYYFGEPFGNERPGKSGIVSRDVQETVDWIMPSLMKVFTSGGQVVKYEPDTAEDVEQAEQETEYVNYLFMRKNEGFKVMFDWFQDTLMMKTGVVKVYVEEVLKPTFERFSGLSEDMVADILSDPDTSILAQSVDDDGTYTIKIRKDKKKREIKVLCVKPENFLVDRLATCIDDARFLCHREKYTVSDLRLLGVPEDVIEELPYDEYEFSDSQPERLVRDNFDMTGQLQYNSGDDAEANREVWASECYTLLDVDGDGISELRRILYVGDYIISNEPWDCRPFADLNAYRIAHKFHGMSVYDKIRDIQEIRSVLMRNIMDNIYRTNQGRSVVLDGQVNLEDLLTNEAAGIVRVKSMNSITPLETPQLSGEVYGMLDRLEADRGKRTGITDRTRGLDQNTLHSNQAAMSVNQLMTAAEQQIDLIARMFAETGVKRLFQLLHDHAIKYQNQEEVFQLRGKWVAVNPANWRERSDLTVTVGIGNMNKDQQMLHLMRIWEMAQAVVGGGGLGVLVSEQNLYNILKEVTENAGYKDPDRFWTNPNSPEALQAKAIREQKEAQPKPEDIKAQADAQRAQSDALAKQAEAQMKQVEAQIRLAEIELKKQEAVLQQREMALKEAELQLERDRFTWERARNEAEYHLEATQARAAYIGDGKVPETKKPTKAVRR 705 T 0.044 Sema4F_C pdbpercent T Viruses T 7szi 2 D D A0A2X1SF68_KLEPN TraN CSGGQNTHC 9 T 1.9 DUF220 pdbhh F Bacteria T 7t0l 3 C,F C,F PHE-ARG-TYR-ASN-GLY-LEU-ILE-HIS-ARG peptide FRYNGLIHR 9 T 0.8 Ribosomal_L28e pdbhh F T 7t0o 2 D B BG505 SOSIP.664 gp140 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 39 F F F 7t0o 3 E,F F,J BG505 SOSIP.664 gp140 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 38 F F F 7t0o 6 M,N,O K,N,R BG505 SOSIP.664 gp140 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 35 F F F 7t0v 2 G G polyvaline VVVVVVVVVVVVVVVVVVVVVVV 23 T 38 RCR pdbhh F F 7t0y 2 B,D B,D RRP1B_HUMAN RRP1-LIKE PROTEIN B GKKVTFGLNRNMTAEFKKTDKSILVSPTGPSRVAFDPEQKPLHGVLK 47 T 30 PP1_bind pdbhh F Eukaryota T 7t11 5 E P Octreotide XCFXKTCX 8 T 0.0019 Urotensin_II pdbhh F F 7t1n 2 B B HEXIM Arginine Rich Motif GISYGRQLGKKKHRRRAHQ 19 T 0.02 Tat pdb F T 7t1u 2 C,D D,E Synthetic phosphopeptide EPQXEEI 7 T 7.3 WBP-1 pdbhh F F 7t26 1 A A ACB1_BPFBB Acb1 SGLYVAAKFSESTLDALEELQRSLKLPNPVPRDKLHTTIVYSRVNVPYKVASGSFEIADKGKLTVFETQSGNRALVLEMDSDYLSARHSYAKALGASYDYPDYRPHITLSYNIGVLNFSGEYKVPVVLDREYSEELDLEWSDKD 144 T 0.00041 2_5_RNA_ligase2 pdbhh T Viruses T 7t27 1 A A ACB1_BPFBB Acb1 SGLYVAAKFSESTLDALEELQRSLKLPNPVPRDKLHTTIVYSRVNVPYKVASGSFEIADKGKLTVFETQSGNRALVLEMDSDYLSARHSYAKALGASYDYPDYRPHITLSYNIGVLNFSGEYKVPVVLDREYSEELDLEWSDKD 144 T 0.00041 2_5_RNA_ligase2 pdbhh T Viruses T 7t2f 1 A,B A,B HEEH mini protein HEEH_TK_rd5_0341 SGLVPRGSHMDLEELEEDLKQALREGRKVNILGIEVTTEEQARRLIEFLRRFI 53 T 0.015 SepF pdb F T 7t2r 4 D,I D,I I4BYB2_ACEMN COENZYME F420-REDUCING HYDROGENASE, ALPHA SUBUNIT MTEVFKLEINPVTRIEGHGKITVMLDESGHVRETRFHVTQYRGFEVFTHGRDFREMPVITPRICGICPVSHHLASAKACDEILGVTITPAAHKLRELMHMGQIVQSHALSFFHLSSPDILWGFDAPVKIRNVAGLVDRYPELAKKGIMLRKFGQEIIKTLGGKKIHPWHSIPGGVNRSLTPQERDAIAAQLPEMKSIAMEAIKLIKDYLQEGGEELKEFATLDTAYMGLVRDGYLELYDGEVRIKAPRGRILDQFDPKDYLDHIGEHVEPWSYLKFPFYKALGFPHGSYRVGPLARLNAADAVSTPEASKEFALYKEMGEDGIVPYTLYYHYARLIEALYGLERIEQLLADPDITSSDLRVTSKEINPEGIGVIEAPRGTLIHHYQVNESGVITKVNLIVATGHNNFAMNKGVEMVAKKYITGTNVPEGVFNRLEHVIRAYDPCLSCSTHAVGKMPLKLELVGPTGEILKEVTRD 475 T 6.4E-19 NiFeSe_Hases pdb F Bacteria T 7t2u 2 C,E E,F NEMO_HUMAN NF-KAPPA-B ESSENTIAL MODULATOR,NEMO,FIP-3,IKB KINASE-ASSOCIATED PROTEIN 1,IKKAP1,INHIBITOR OF NUCLEAR FACTOR KAPPA-B KINASE SUBUNIT GAMMA,I-KAPPA-B KINASE SUBUNIT GAMMA,IKK-GAMMA,IKKG,IKB KINASE SUBUNIT GAMMA,NF-KAPPA-B ESSENTIAL MODIFIER KLAQLQVAYH 10 T 0.00027 Tropomyosin unppercent F Eukaryota T 7t30 4 D,I D,I I4BYB2_ACEMN COENZYME F420-REDUCING HYDROGENASE, ALPHA SUBUNIT MTEVFKLEINPVTRIEGHGKITVMLDESGHVRETRFHVTQYRGFEVFTHGRDFREMPVITPRICGICPVSHHLASAKACDEILGVTITPAAHKLRELMHMGQIVQSHALSFFHLSSPDILWGFDAPVKIRNVAGLVDRYPELAKKGIMLRKFGQEIIKTLGGKKIHPWHSIPGGVNRSLTPQERDAIAAQLPEMKSIAMEAIKLIKDYLQEGGEELKEFATLDTAYMGLVRDGYLELYDGEVRIKAPRGRILDQFDPKDYLDHIGEHVEPWSYLKFPFYKALGFPHGSYRVGPLARLNAADAVSTPEASKEFALYKEMGEDGIVPYTLYYHYARLIEALYGLERIEQLLADPDITSSDLRVTSKEINPEGIGVIEAPRGTLIHHYQVNESGVITKVNLIVATGHNNFAMNKGVEMVAKKYITGTNVPEGVFNRLEHVIRAYDPCLSCSTHAVGKMPLKLELVGPTGEILKEVTRD 475 T 6.4E-19 NiFeSe_Hases pdb F Bacteria T 7t3h 1 A A A0A1C0U7H2_9GAMM TRP-ASN-SER-ASN-VAL-HIS-SER-TYR-ARG-PHE WNSNVHSYRF 10 T 1.3 DUF5504 pdbhh F Bacteria T 7t3i 2 G G substrate peptide XXXXXXXXXXXXXXXXXXXXXXXXXXX 27 F F F 7t3j 5 J,K J,K AcrIF24 MNAIHIGPFSITPAARGLHYGGLPHHQWTLYYGPREMAIKTLPDSYTSSEVRDEFSDIIAEFVIDARHRYAPDVLELVNSDGDAVLARVAVSRLPEALSGCIPDDRFPYWLLTASRPRLGLPVTLNEYTALAVELSAPPLAWITGLLPGEVLTHDAEEWRPPTSWELRHVVGEGSFTGVSGAAAAALLGMSATNFRKYTAGDSAANRQKISFAAWHYLLDRLGVKRAS 228 T 0.00054 DUF4447 pdbhh F T 7t3k 5 J,K J,K AcrIF24 MNAIHIGPFSITPAARGLHYGGLPHHQWTLYYGPREMAIKTLPDSYTSSEVRDEFSDIIAEFVIDARHRYAPDVLELVNSDGDAVLARVAVSRLPEALSGCIPDDRFPYWLLTASRPRLGLPVTLNEYTALAVELSAPPLAWITGLLPGEVLTHDAEEWRPPTSWELRHVVGEGSFTGVSGAAAAALLGMSATNFRKYTAGDSAANRQKISFAAWHYLLDRLGVKRAS 228 T 0.00054 DUF4447 pdbhh F T 7t3l 5 J,K J,K A0A8G3G219_PSEAI AcrIF24 MNAIHIGPFSITPAARGLHYGGLPHHQWTLYYGPREMAIKTLPDSYTSSEVRDEFSDIIAEFVIDARHRYAPDVLELVNSDGDAVLARVAVSRLPEALSGCIPDDRFPYWLLTASRPRLGLPVTLNEYTALAVELSAPPLAWITGLLPGEVLTHDAEEWRPPTSWELRHVVGEGSFTGVSGAAAAALLGMSATNFRKYTAGDSAANRQKISFAAWHYLLDRLGVKRAS 228 T 0.00054 DUF4447 pdbhh F Bacteria T 7t4q 3 C C UL128_HCMVM UL128 MSPKNLTPFLTALWLLLGHSRVPRVRAEECCEFINVNHPPERCYDFKMCNRFTVALRCPDGEVCYSPEKTAEIRGIVTTMTHSLTRQVVHNKLTSCNYNPLYLEADGRIRCGKVNDKAQYLLGAAGSVPYRWINLEYDKITRIVGLDQYLESVKKHKRLDVCRAKMGYMLQ 171 T 11 SH3_19 pdbhh T Viruses T 7t4q 5 E E U131A_HCMVM PROTEIN UL131A,UL131A MRLCRVWLSVCLCAVVLGQCQRETAEKNDYYRVPHYWDACSRALPDQTRYKYVEQLVDLTLNYHYDASHGLDNFDVLKRINVTEVSLLISDFRRQNRRGGTNKRTTFNAAGSLAPHARSLEFSVRLFAN 129 T 0.94 DDDD pdbpercent T Viruses T 7t4r 4 D,M D,M UL128_HCMVM UL128 MSPKNLTPFLTALWLLLGHSRVPRVRAEECCEFINVNHPPERCYDFKMCNRFTVALRCPDGEVCYSPEKTAEIRGIVTTMTHSLTRQVVHNKLTSCNYNPLYLEADGRIRCGKVNDKAQYLLGAAGSVPYRWINLEYDKITRIVGLDQYLESVKKHKRLDVCRAKMGYMLQ 171 T 11 SH3_19 pdbhh T Viruses T 7t4r 6 F,O F,O U131A_HCMVM PROTEIN UL131A,UL131A MRLCRVWLSVCLCAVVLGQCQRETAEKNDYYRVPHYWDACSRALPDQTRYKYVEQLVDLTLNYHYDASHGLDNFDVLKRINVTEVSLLISDFRRQNRRGGTNKRTTFNAAGSLAPHARSLEFSVRLFAN 129 T 0.94 DDDD pdbpercent T Viruses T 7t4s 3 C C UL128_HCMVM UL128 MSPKNLTPFLTALWLLLGHSRVPRVRAEECCEFINVNHPPERCYDFKMCNRFTVALRCPDGEVCYSPEKTAEIRGIVTTMTHSLTRQVVHNKLTSCNYNPLYLEADGRIRCGKVNDKAQYLLGAAGSVPYRWINLEYDKITRIVGLDQYLESVKKHKRLDVCRAKMGYMLQ 171 T 11 SH3_19 pdbhh T Viruses T 7t4s 5 E E U131A_HCMVM PROTEIN UL131A,UL131A MRLCRVWLSVCLCAVVLGQCQRETAEKNDYYRVPHYWDACSRALPDQTRYKYVEQLVDLTLNYHYDASHGLDNFDVLKRINVTEVSLLISDFRRQNRRGGTNKRTTFNAAGSLAPHARSLEFSVRLFAN 129 T 0.94 DDDD pdbpercent T Viruses T 7t55 2 B,D C,D A3DCU2_ACET2 PCAT1 peptide substrate SNAMSEAKKLNIGRELTDEELMEMTGGSTFSIQCQKDYTYKPSLPVVKYGVVIDEPEVVIKYGVGPIVGIKYGVEPIGPIQPMYGIKPVETLK 93 T 0.017 L_biotic_typeA unphh F Bacteria T 7t56 2 B,D C,D A3DCU2_ACET2 PCAT1 peptide substrate SNAMSEAKKLNIGRELTDEELMEMTGGSTFSIQCQKDYTYKPSLPVVKYGVVIDEPEVVIKYGVGPIVGIKYGVEPIGPIQPMYGIKPVETLK 93 T 0.017 L_biotic_typeA unphh F Bacteria T 7t57 2 B,D C,D A3DCU2_ACET2 PCAT1 peptide substrate SNAMSEAKKLNIGRELTDEELMEMTGGSTFSIQCQKDYTYKPSLPVVKYGVVIDEPEVVIKYGVGPIVGIKYGVEPIGPIQPMYGIKPVETLK 93 T 0.017 L_biotic_typeA unphh F Bacteria T 7t5m 3 E,F E,F I27RA_HUMAN IL-27 RECEPTOR SUBUNIT ALPHA,IL-27R SUBUNIT ALPHA,IL-27R-ALPHA,IL-27RA,CYTOKINE RECEPTOR WSX-1,CYTOKINE RECEPTOR-LIKE 1,TYPE I T-CELL CYTOKINE RECEPTOR,TCCR,ZCYTOR1 FLPTPEELGLLGPPRPQVLA 20 T 1.9 COX5A pdbhh F Eukaryota T 7t5p 1 A A SIMC1_HUMAN PLATFORM ELEMENT FOR INHIBITION OF AUTOLYTIC DEGRADATION AYLQDMPRSPGDVPQSPSDVSPSPDAPQSPGGMPHLPGDVLHSPGDMPHSSGDVTHSPRDIPHLPGDRPDFTQNDVQNRDMPMDISALSSPSCSPRPQSETPLEKVPWLSVMETPARKEISLSEPAKPGSAHVQSRTPQGGLYNRPCLHRLKYFLRPPVHHLFFQTLIPDKDTRENKGQRLEPIPHRRLRMVTNTIEENFPLGTVQFLMDFVSPQHYPPREIVAHIIQKILLSGSETVDVLKEAYMLLMKIQQLHPANAKTVEWDWKLLTYVMEEEGQTLPGRVLFLRYVVQTLEDDFQQTLRRQRQHLQQSIANMVLSCDKQPHNVRDVIKWLVKAVTEDGLTQPPNGNQTSSGTGILKASSSHPSSQPNLTKNTNQLIVCQLQRMLSIAVEVDRTPTCSSNKIAEMMFGFVLDIPERSQREMFFTTMESHLLRCKVLEIIFLHSCETPTRLPLSLAQALYFLNNSTSLLKCQSDKSQWQTWDELVERLQFLLSSYQHVLREHLRSSVIDRKDLIIKRIKPKPQQGDDITVVDVEKQIEAFRSRLIQMLGEPLVPQLQDKVHLLKLLLFYAADLNPDAEPFQKGWSGS 589 T 0.06 Anticodon_2 unppssm F Eukaryota T 7t5v 1 A A A0A1X1LKI5_ECOLX HELIX-TURN-HELIX TRANSCRIPTIONAL REGULATOR,TRANSCRIPTIONAL REGULATOR,XRE FAMILY TRANSCRIPTIONAL REGULATOR SNADDLREPEERHLDDAFFRGYKNLEPEAKAQLRKMLDTFKKDF 44 T 0.012 Metal_resist pdbpssm F Bacteria T 7t5w 1 A,B,C,D A,B,C,D A0A1X1LKI5_ECOLX HELIX-TURN-HELIX TRANSCRIPTIONAL REGULATOR,TRANSCRIPTIONAL REGULATOR,XRE FAMILY TRANSCRIPTIONAL REGULATOR SNADDLREPEERHLDDAFFRGYKNLEPEAKAQLRKILDTFKKDF 44 T 0.013 Metal_resist pdbpssm F Bacteria T 7t69 1 A A Q709D8_FUSOX SECRETED IN XYLEM 1 PROTEIN GPMQEAAVREPQIFFNLTYTEYLDKVAASHGSPPDKSDLPWNDTMGSFPGNETDDGVQTETGSSLSRRGHIVNLRKREPFGEESRNDRVTQDMLQALHDLCVERFGTGYRAVSGLCYTDRRATRKIECNKPSVRERDRSVTRACPKGQECTTFNAYNFRNRHHQVTFPVCGPRIEVKDRHDIGIHTEWQGTWYPESPKSPGTYDYFAQMAGTLNGYFGYDGVYSDGYKTSSHGYGHSWSCINCPRGKVTITNTYRATWAFGYTSPHS 267 T 0.28 DAP_epimerase pdb F Eukaryota T 7t6a 1 A,B A,B Q2A0P0_FUSOX SECRETED IN XYLEM 4 PROTEIN SAHTESVCVHAGTATGADLHWLNAICTGKSTYTVNCAPAGNKNAGSTHTGTCPAGQDCFQLEQVGNFWGDREPDATCSPSNTVFDAVDDKEATHVNGKVVTRAGKPGIGRKLIRLKAQVYRRDGHYGQTSRMGFFRNGKEVYHIDNVASMEPTWNFDPSSDQSFSFFFTPGPNAFRIQGTLNLAS 185 T 0.32 Phage_CI_repr unppercent F Eukaryota T 7t6a 2 C,D C,D Q2A0P0_FUSOX SECRETED IN XYLEM 4 PROTEIN GPMLPKGEEGDIIGTFNFSSSDSQPLKIHWVDTPDSSGSNLVPR 44 T 0.32 Phage_CI_repr unppercent F Eukaryota T 7t6e 1 A,AA,AB,B,BA,BB,C,CA,CB,D,DA,DB,E,EA,EB,F,FA,FB,G,GA,GB,H,HA,HB,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,PA,Q,QA,R,RA,S,SA,T,TA,U,UA,V,VA,W,WA,X,XA,Y,YA,Z,ZA A,0,q,B,1,r,C,2,s,D,3,t,E,4,u,F,5,v,G,6,w,H,7,x,I,8,J,9,K,a,L,b,M,c,N,d,O,e,P,f,Q,g,R,h,S,i,T,j,U,k,V,l,W,m,X,n,Y,o,Z,p NBD-ffsy peptide XXXXX 5 F F F 7t6g 1 A A B1Q143_ANCCA Truncated Ac-AIP-2 TPEEHDLLMDLMGDPKKAEE 20 T 1.1 B3GALT2_N pdbhh F Eukaryota T 7t6t 4 D L Synthetic peptide MLFII 5 T 66 Herpes_U15 pdbhh F F 7t6u 5 E L Synthetic peptide QKFTSWFX 8 T 1.7 DUF4518 pdbhh F T 7t6v 6 F L Synthetic peptide MLFII 5 T 66 Herpes_U15 pdbhh F F 7t70 2 C,D C,D R1AB_SARS2 Nonstructural protein 4/5 TSAVLQSGFRKM 12 T 8.5 IQ pdbhh T Viruses T 7t73 3 C,E,F A,C,E HIV Envelope ApexGT2.2MUT gp120 MGILPSPGMPALLSLVSLLSVLLMGCVAETGAENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHEDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNATTELRNKRQKVYSLFYRLDIVPMGENSANYRLIDCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNTPVQINCTRPNNNTVKSIRIGPGQAFYYTGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFAQSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 504 T 3.9999999999999995E-54 GP120 pdbpercent F T 7t74 1 A,G,K A,C,E HIV Envelope ApexGT2 gp120 MGILPSPGMPALLSLVSLLSVLLMGCVAETGAENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHEDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNATTELRNKRQKVYSLFYRLDIVPMGENSTNYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNTPVQINCTRPNNNTVKSIRIGPGQAFYYTGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFAQSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 504 T 3.9E-54 GP120 pdbpercent F T 7t75 1 A A HIV Envelope ApexGT2 gp120 MGILPSPGMPALLSLVSLLSVLLMGCVAETGAENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHEDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNATTELRNKRQKVYSLFYRLDIVPMGENSTNYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNTPVQINCTRPNNNTVKSIRIGPGQAFYYTGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFAQSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 504 T 3.9E-54 GP120 pdbpercent F T 7t76 1 A,B,D A,C,E HIV Envelope ApexGT3 gp120 MGILPSPGMPALLSLVSLLSVLLMGCVAETGAENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHEDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNATTELRNKRVKRYSLFYRLDIVQIDSNRAKSHYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNTPVQINCTRPNNNTVKSIRIGPGQAFYYTGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFAQSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 506 T 3.9E-54 GP120 pdbpercent F T 7t77 1 A,C,E A,C,E HIV Envelope ApexGT3.N130 gp120 MGILPSPGMPALLSLVSLLSVLLMGCVAETGAENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHEDIISLWDQSLKPCVKLTPLCVTLNCTNVTNNITDDMRGELKNCSFNATTELRNKRVKRYSLFYRLDIVQIDSNRTKSHYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNTPVQINCTRPNNNTVKSIRIGPGQAFYYTGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFAQSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 506 T 3.9E-54 GP120 pdbpercent F T 7t7w 1 A A Lt-MAP4 peptide LKKLWRFLKKL 11 T 1.2 PurL_C pdbhh F F 7t86 3 I,J,K,L G,I,J,K Phospho-SD Peptide SDSDSDSDSDSD 12 T 23 DUF4692 pdbhh F F 7t8m 2 C,D C,D R1AB_SARS2 Nonstructural protein 5/6 GVTFQSAVKR 10 T 6.9 PhnI pdbhh T Viruses T 7t8n 1 A,B AAA,BBB PGAA_ECOLI PGA EXPORT PROTEIN,POLY-BETA-1,6-GLCNAC EXPORT PROTEIN DANLTPDIRADIHAELVRLSFMPTRSESERYAIADRALAQYAALEILWHDNPDRTAQYQRIQVDHLGALLTRDRYKDVISHYQRLKKTGQIIPPWGQYWVASAYLKDHQPKKAQSIMTELFYHKETIAPDLSDEELADLFYSHLESEN 148 T 0.029 TPR_19 unppercent F Bacteria T 7t8r 2 B B R1AB_SARS2 Nonstructural protein 7/8 NRATLQAI 8 T 23 CDC4_D pdbhh T Viruses T 7t8y 2 B B BE2-LEU-PRO-ALA-THR-ALA-ALA XLPATAA 7 T 42 Thiopep_pre pdbhh F F 7t8y 3 C D ACE-ALA-ZGL-LYS-DAL-DAL XAXKXX 6 T 640 Phage_Treg pdbhh F F 7t8z 2 B B BE2-LEU-PRO-ALA-THR-ALA-ALA XLPATAA 7 T 42 Thiopep_pre pdbhh F F 7t8z 3 C D ACE-ALA-ZGL-LYS-DAL-DAL XAXKXX 6 T 640 Phage_Treg pdbhh F F 7t9a 1 A,E,I A,C,E HIV Envelope ApexGT2 gp120 AENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHEDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNATTELRNKRQKVYSLFYRLDIVPMGENSTNYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNTPVQINCTRPNNNTVKSIRIGPGQAFYYTGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFAQSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 473 T 3.5E-54 GP120 pdbpercent F T 7t9b 1 A,E,I A,C,E HIV-1 Envelope ApexGT5 gp120 AENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHEDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNATTELRNKRQKVYSLFYRLDIVPMVDLWTNYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNTPVQINCTRPNNNTVKSIRIGPGQAFYYTGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFAQSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 473 T 3.5E-54 GP120 pdbpercent F T 7t9i 5 E X GNAS2_HUMAN ADENYLATE CYCLASE-STIMULATING G ALPHA PROTEIN GGSLEVLFQGPSGNSKTEDQRNEEKAQREANKKIEKQLQKDKQVYRATHRLLLLGADNSGKSTIVKQMRILHGGSGGSGGTSGIFETKFQVDKVNFHMFDVGGQRDERRKWIQCFNDVTAIIFVVDSSDYNRLQEALNLFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKLEDYFPEFARYTTPEDATPEPGEDPRVTRAKYFIRDEFLRISTASGDGRHYCYPHFTCAVDTENARRIFNDCRDIIQRMHLRQYELL 261 T 5E-10 G-alpha pdb F Eukaryota T 7t9n 5 E X GNAS2_HUMAN ADENYLATE CYCLASE-STIMULATING G ALPHA PROTEIN GGSLEVLFQGPSGNSKTEDQRNEEKAQREANKKIEKQLQKDKQVYRATHRLLLLGADNSGKSTIVKQMRILHGGSGGSGGTSGIFETKFQVDKVNFHMFDVGGQRDERRKWIQCFNDVTAIIFVVDSSDYNRLQEALNLFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKLEDYFPEFARYTTPEDATPEPGEDPRVTRAKYFIRDEFLRISTASGDGRHYCYPHFTCAVDTENARRIFNDCRDIIQRMHLRQYELL 261 T 5E-10 G-alpha pdb F Eukaryota T 7t9w 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P R1AB_SARS2 NON-STRUCTURAL PROTEIN 3,NSP3,PL2-PRO,PAPAIN-LIKE PROTEINASE,PL-PRO SEEVVENPTIQKDVLECNVKTTEVVGDIILKPANNSLKITEEVGHTDLMAAYVDNSSLTIKKPNELSRVLGLKTLATHGLAAVNSVPWDTIANYAKPFLNKVVST 105 T 33 PHB_acc_N pdbhh T Viruses T 7t9y 2 C,D C,D R1AB_SARS2 Nonstructural protein 8/9 SAVKLQNNEL 10 T 7.9 Phospho_p8 pdbhh T Viruses T 7ta3 1 A,C A,C Alpha-peptide-3 ECGWRIGEAGTDPNLNHQQFRAKILSIWEECX 32 T 3 CTK3 pdbhh F T 7ta4 2 C,D C,D R1AB_SARS2 Nonstructural protein 9/10 ATVRLQAGNA 10 T 1.4 CoV_NSP9 pdbhh T Viruses T 7ta6 2 I,J,K,L,M,N,O,P I,J,K,L,M,N,O,P Alpha/Beta-peptide-1 XLGWCIGEXGTDPNLNHXQFRXKILXCWX 29 T 1.5 Poty_coat pdbhh F T 7ta7 2 C,D C,D R1AB_SARS2 Nonstructural protein 10/11 REPMLQSADAQ 11 T 60 LD_cluster3 pdbhh T Viruses T 7tau 8 EA X Unknown fragment XXXXXXXXXXXXXXX 15 F F F 7taw 5 J,K J,K AcrIF24 MNAIHIGPFSITPAARGLHYGGLPHHQWTLYYGPREMAIKTLPDSYTSSEVRDEFSDIIAEFVIDARHRYAPDVLELVNSDGDAVLARVAVSRLPEALSGCIPDDRFPYWLLTASRPRLGLPVTLNEYTALAVELSAPPLAWITGLLPGEVLTHDAEEWRPPTSWELRHVVGEGSFTGVSGAAAAALLGMSATNFRKYTAGDSAANRQKISFAAWHYLLDRLGVKRAS 228 T 0.00054 DUF4447 pdbhh F T 7tax 5 J,K J,K AcrIF24 MNAIHIGPFSITPAARGLHYGGLPHHQWTLYYGPREMAIKTLPDSYTSSEVRDEFSDIIAEFVIDARHRYAPDVLELVNSDGDAVLARVAVSRLPEALSGCIPDDRFPYWLLTASRPRLGLPVTLNEYTALAVELSAPPLAWITGLLPGEVLTHDAEEWRPPTSWELRHVVGEGSFTGVSGAAAAALLGMSATNFRKYTAGDSAANRQKISFAAWHYLLDRLGVKRAS 228 T 0.00054 DUF4447 pdbhh F T 7tb1 2 C,D C,D ALA-CYS-SER-SER-ILE-TRP-CYS-PRO-ASP-GLY ACSSIWCPDG 10 T 0.52 CENP-U pdbhh F T 7tb2 2 B B R1AB_SARS2 Nonstructural protein 12/13 PHTVLQAV 8 T 11 ATXN-1_C pdbhh T Viruses T 7tb9 1 A A CEMP1_HUMAN CEMP1-p1 MGTSSTDSQQAGHRRCSTSN 20 T 12 DUF983 pdbhh F Eukaryota T 7tbi 2 D,E,F,G B1,B2,B3,B4 Nup53/Nup59 R3 RKAKLLPMEEALLP 14 T 9.2 POX pdbhh F T 7tbi 3 H,I,TB,UB C2,C3,C1,C4 Nup145N/Nup100/Nup116 R3 KLVINKDMRTDLFSPPN 17 T 4.6 DUF4616 pdbhh F T 7tbi 5 N,O,P,Q E1,E2,E3,E4 Nup53/Nup59 R2 DPTIAAADKIFSNWLASQ 18 T 1.5 DUF3986 pdbhh F T 7tbi 7 T,U G1,G2 Nic96 R2 DVDTYLSNLQTKTTLSMIADGLERSARDFDAFLEENVTLEWEAQRKRIYQHFG 53 T 0.16 X pdbpssm F T 7tbi 8 V,W H1,H2 Nup145N/Nup100/Nup116 R2 EDSILQPGAFSAN 13 T 1 Hydrolase_2 pdbhh F T 7tbi 10 AA,Z J2,J1 Nic96 R2 SGTGLGEVDVDTYLSNLQTKTTLSMIADGLERSARDFDAFLEENVTLEWEAQRKRIYQHFGIK 63 T 0.19 UPF0172 pdbpssm F T 7tbi 11 BA,CA K1,K2 Nup145N/Nup100/Nup116 R1 ILPMYKLSP 9 T 2.6 AIMP2_LysRS_bd pdbhh F T 7tbi 12 DA,EA L1,L2 Nup53/Nup59 R1 FG 2 T 140 DUF5754 pdbhh F F 7tbi 14 JA,KA,LA,MA N1,N2,N3,N4 Nup57 SEALQQEIAKIDEEIQKCIRDKEAVDAFLPAHGEQLAAIPTDVNFVTRKSEGAHNALSSDILAIDQLRELVKQDADNARLSFKAIDNLKLPMQYHQAGLWSKQMGGAGTAGASGASADADGQSNADLISYFSKTADEMEEMMKKFEKTITEIEAHLTGVEAHAMAMQNVAAQSRNAAQGGVDERVYELAAVLREFEESILKVAGVVGGVKEGVTELQLRDFM 222 T 0.0012 AAA_13 pdbpssm F T 7tbi 16 RA,SA,TA,UA P1,P2,P3,P4 Nic96 R1 ALFDSLLARNKKQAEGETALGELPSLQLGLADLRQRLRKL 40 T 7.9 TACI-CRD2 pdbhh F T 7tbi 24 HB,IB W1,W2 Nup120 MACLSRIDANLLQYYEKPEPNNTVDLYVSNNSNNNGLKEGDKSISTPVPQPYGSEYSNCLLLSNSEYICYHFSSRSTLLTFYPLSDAYHGKTINIHLPNASMNQRYTLTIQEVEQQLLVNVILKDGSFLTLQLPLSFLFSSANTLNGEWFHLQNPYDFTVRVPHFLFYVSPQFSVVFLEDGGLLGLKKVDGVHYEPLLFNDNSYLKSLTRFFSRSSKSDYDSVISCKLFHERYLIVLTQNCHLKIWDLTSFTLIQDYDMVSQSDSDPSHFRKVEAVGEYLSLYNNTLVTLLPLENGLFQMGTLLVDSSGILTYTFQNNIPTNLSASAIWSIVDLVLTRPLELNVEASYLNLIVLWKSGTASKLQILNVNDESFKNYEWIESVNKSLVDLQSEHDLDIVTKTGDVERGFCNLKSRYGTQIFERAQQILSENKIIMAHNEDEEYLANLETILRDVKTAFNEASSITLYGDEIILVNCFQPYNHSLYKLNTTVENWFYNMHSETDGSELFKYLRTLNGFASTLSNDVLRSISKKFLDIITGELPDSMTTVEKFTDIFKNCLENQFEITNLKILFDELNSFDIPVVLNDLINNQMKPGIFWKKDFISAIKFDGFTSIISLESLHQLLSIHYRITLQVLLTFVLFDLDTEIFGQHISTLLDLHYKQFLLLNLYRQDKCLLAEVLLKDSSEFSFGVKFFNYGQLIAYIDSLNSNVYNASITENSFFMTFFRSYIIENTSHKNIRFFLENVECPFYLRHNEVQEFMFAMTLFSCGNFDQSYEIFQLHDYPEAINDKLPTFLEDLKSENYHGDSIWKDLLCTFTVPYRHSAFYYQLSLLFDRNNSQEFALKCISKSAEYSLKEIQIEELQDFKEKQHIHYLNLLIHFRMFEEVLDVLRLGHECLSDTVRTNFLQLLLQEDIYSRDFFSTLLRLCNAHSDNGELYLRTVDIKIVDSILSQNLRSGDWECFKKLYCFRMLNKSERAAAEVLYQYILMQADLDVIRKRKCYLMVINVLSSFDSAYDQWILNGSKVVTLTDLRDELRGL 1037 T 0.086 TPR_1 pdbpssm F T 7tbi 26 LB,MB Y1,Y2 Nup159 YAESGIQTDLSESSKENEVQTDAIPVKHNSTQTVKKEAVDNGLQTEPVETCNFSVQTFEGDENYLAEQCKPKQLKEYYTSAKVSNIPFVSQNSTLRLIESTFQTVEAEFTVLMENIRNMDTFFTDQSSIPLVKRTVRSINNLYTWRIPEAEILLNIQNNIKCEQMQITNANIQDLKEKVTDYVRKDIAQITEDVANAKEEYLFLMHFDDASSGYVKDLSTHQFRMQKTLRQKLFDVSAKINHTEELLNILKLFTVKNKRLDDNPLVAKLAKESLARDGLLKEIKLLREQVSRLQLEEKGKKASSFDASSSITKDMKGFKVVEVGLAMNTKKQIGDFFKNL 340 T 0.025 Lzipper-MIP1 pdbpercent F T 7tbi 27 NB,OB Z1,Z2 Nup82 MSQSSRLSALPIFQASLSASQSPRYIFSSQNGTRIVFIQDNIIRWYNVLTDSLYHSLNFSRHLVLDDTFHVISSTSGDLLCLFNDNEIFVMEVPWGYSNVEDVSIQDAFQIFHYSIDEEEVGPKSSIKKVLFHPKSYRDSCIVVLKEDDTITMFDILNSQEKPIVLNKPNNSFGLDARVNDITDLEFSKDGLTLYCLNTTEGGDIFAFYPFLPSVLLLNEKDLNLILNKSLVMYESLDSTTDVIVKRNVIKQLQFVSKLHENWNSRFGKVDIQKEYRLAKVQGPFTINPFPGELYDYTATNIATILIDNGQNEIVCVSFDDGSLILLFKDLEMSMSWDVDNYVYNNSLVLIERVKLQREIKSLITLPEQLGKLYVISDNIIQQVNFMSWASTLSKCINESDLNPLAGLKFESKLEDIATIERIPNLAYINWNDQSNLALMSNKTLTFQNISSDMKPQSTAAETSISTEKSDTVGDGFKMSFTQPINEILILNDNFQKACISPCERIIPSADRQIPLKNEASENQLEIFTDISKEFLQRIVKAQTLGVSIHNRIHEQQFELTRQLQSTCKIISKDDDLRRKFEAQNKKWDAQLSRQSELMERFSKLSKKLSQIAESNKFKEKKISHGEMKWFKEIRNQILQFNSFVHSQKSLQQDLSYLKSELTRIEAETIKVDKKSQNEWDELRKMLEIDSKIIKECNEELLQVSQEFTTKTQ 713 T 2.4E-12 Nup88 pdbpercent F T 7tbj 4 G,H,I,J,K,L B1,B2,B3,B4,B5,B6 NUP53 R3 RKAKLLPMEEALLP 14 T 9.2 POX pdbhh F T 7tbj 5 M,N,O,P,Q,R C1,C2,C3,C4,C5,C6 NUP98 R3 HKKLVINKDMRTDLFSPPN 19 T 5.9 MethyTransf_Reg pdbhh F T 7tbj 7 AA,BA,CA,DA,EA,FA,Z E2,E3,E4,E5,E6,E7,E1 NUP53 R2 APPVRSIY 8 T 5.2 DUF502 pdbhh F T 7tbj 9 IA,JA G1,G2 NUP93 R2 DVDTYLSNLQTKTTLSMIADGLERSARDFDAFLEENVTLEWEAQRKRIYQHFG 53 T 0.16 X pdbpssm F T 7tbj 10 KA,LA H1,H2 NUP98 R2 EDSILQPGAFSAN 13 T 1 Hydrolase_2 pdbhh F T 7tbj 12 RA,SA,TA,UA,VA J1,J2,J3,J4,J5 NUP93 R2 SGTGLGEVDVDTYLSNLQTKTTLSMIADGLERSARDFDAFLEENVTLEWEAQRKRIYQHFGIK 63 T 0.19 UPF0172 pdbpssm F T 7tbj 13 AB,WA,XA,YA,ZA K5,K1,K2,K3,K4 NUP98 R1 ILPMYKLSP 9 T 2.6 AIMP2_LysRS_bd pdbhh F T 7tbj 14 BB,CB,DB,EB,FB L1,L2,L3,L4,L5 NUP53 R1 FG 2 T 140 DUF5754 pdbhh F F 7tbj 16 KB,LB,MB,NB N1,N2,N3,N4 NUP58 SEALQQEIAKIDEEIQKCIRDKEAVDAFLPAHGEQLAAIPTDVNFVTRKSEGAHNALSSDILAIDQLRELVKQDADNARLSFKAIDNLKLPMQYHQAGLWSKQMGGAGTAGASGASADADGQSNADLISYFSKTADEMEEMMKKFEKTITEIEAHLTGVEAHAMAMQNVAAQSRNAAQGGVDERVYELAAVLREFEESILKVAGVVGGVKEGVTELQLRDFM 222 T 0.0012 AAA_13 pdbpssm F T 7tbj 18 SB,TB,UB,VB P1,P2,P3,P4 NUP54 Ferrodoxin-like domain DEDGLISLIFNKKESDIRGQQQQLVESLHKVLGGHQTLTVNVEGVKTKADNQTEVIIYVVERSPNGTSRRVGASALFSYFEQAHIKANMQSLGVTGAMAQTELSPVQIKQLIQNPL 116 T 0.018 Glutaminase pdb F T 7tbj 20 AC,BC,CC,DC R1,R2,R3,R4 NUP93 R1 ALFDSLLARNKKQAEGETALGELPSLQLGLADLRQRLRKL 40 T 7.9 TACI-CRD2 pdbhh F T 7tbj 30 OD,PD,QD,RD b1,b2,b3,b4 NUP37 SNQYQLPLNVRPYTTTWCSQSPSCSNLLAIGHDTGITIYCASEEQTPGSTGLTLQELFTIQTGLPTLHLSFSSSCSYSENLHDGDGNVNSSPVYSLFLACVCQDNTVRLIITKNETIITQHVLGGKSGHHNFVNDIDIADVYSADNRLAEQVIASVGDDCTLIIWRLTDEGPILAGYPLSSPGISVQFRPSNPNQLIVGERNGNIRIFDWTLNLSAEENSQTELVKNPWLLTLNTLPLVNTCHSSGIASSLANVRWIGSDGSGILAMCKSGAWLRWNLFANNDYNEISDSTMKLGPKNLLPNVQGISLFPSLLGACPHPRYMDYFATAHSQHGLIQLINTYEKDSNSIPIQLGMPIVDFCWHQDGSHLAIATEGSVLLTRLMGFT 385 T 0.00016 WD40 pdbpssm F T 7tbk 4 G,H,I,J,K,L B1,B2,B3,B4,B5,B6 NUP53 R3 RKAKLLPMEEALLP 14 T 9.2 POX pdbhh F T 7tbk 5 M,N,O,P,Q,R C1,C2,C3,C4,C5,C6 NUP98 R3 HKKLVINKDMRTDLFSPPN 19 T 5.9 MethyTransf_Reg pdbhh F T 7tbk 7 AA,BA,CA,DA,EA,FA,Z E2,E3,E4,E5,E6,E7,E1 NUP53 R2 APPVRSIY 8 T 5.2 DUF502 pdbhh F T 7tbk 9 IA,JA G1,G2 NUP93 R2 DVDTYLSNLQTKTTLSMIADGLERSARDFDAFLEENVTLEWEAQRKRIYQHFG 53 T 0.16 X pdbpssm F T 7tbk 10 KA,LA H1,H2 NUP98 R2 EDSILQPGAFSAN 13 T 1 Hydrolase_2 pdbhh F T 7tbk 12 RA,SA,TA,UA,VA J1,J2,J3,J4,J5 NUP93 R2 SGTGLGEVDVDTYLSNLQTKTTLSMIADGLERSARDFDAFLEENVTLEWEAQRKRIYQHFGIK 63 T 0.19 UPF0172 pdbpssm F T 7tbk 13 AB,WA,XA,YA,ZA K5,K1,K2,K3,K4 NUP98 R1 ILPMYKLSP 9 T 2.6 AIMP2_LysRS_bd pdbhh F T 7tbk 14 BB,CB,DB,EB,FB L1,L2,L3,L4,L5 NUP53 R1 FG 2 T 140 DUF5754 pdbhh F F 7tbk 16 KB,LB,MB,NB N1,N2,N3,N4 NUP58 SEALQQEIAKIDEEIQKCIRDKEAVDAFLPAHGEQLAAIPTDVNFVTRKSEGAHNALSSDILAIDQLRELVKQDADNARLSFKAIDNLKLPMQYHQAGLWSKQMGGAGTAGASGASADADGQSNADLISYFSKTADEMEEMMKKFEKTITEIEAHLTGVEAHAMAMQNVAAQSRNAAQGGVDERVYELAAVLREFEESILKVAGVVGGVKEGVTELQLRDFM 222 T 0.0012 AAA_13 pdbpssm F T 7tbk 18 SB,TB,UB,VB P1,P2,P3,P4 NUP54 Ferrodoxin-like domain DEDGLISLIFNKKESDIRGQQQQLVESLHKVLGGHQTLTVNVEGVKTKADNQTEVIIYVVERSPNGTSRRVGASALFSYFEQAHIKANMQSLGVTGAMAQTELSPVQIKQLIQNPL 116 T 0.018 Glutaminase pdb F T 7tbk 20 AC,BC,CC,DC R1,R2,R3,R4 NUP93 R1 ALFDSLLARNKKQAEGETALGELPSLQLGLADLRQRLRKL 40 T 7.9 TACI-CRD2 pdbhh F T 7tbk 28 GD,HD,ID,JD Z1,Z2,Z3,Z4 NUP160 MACLSRIDANLLQYYEKPEPNNTVDLYVSNNSNNNGLKEGDKSISTPVPQPYGSEYSNCLLLSNSEYICYHFSSRSTLLTFYPLSDAYHGKTINIHLPNASMNQRYTLTIQEVEQQLLVNVILKDGSFLTLQLPLSFLFSSANTLNGEWFHLQNPYDFTVRVPHFLFYVSPQFSVVFLEDGGLLGLKKVDGVHYEPLLFNDNSYLKSLTRFFSRSSKSDYDSVISCKLFHERYLIVLTQNCHLKIWDLTSFTLIQDYDMVSQSDSDPSHFRKVEAVGEYLSLYNNTLVTLLPLENGLFQMGTLLVDSSGILTYTFQNNIPTNLSASAIWSIVDLVLTRPLELNVEASYLNLIVLWKSGTASKLQILNVNDESFKNYEWIESVNKSLVDLQSEHDLDIVTKTGDVERGFCNLKSRYGTQIFERAQQILSENKIIMAHNEDEEYLANLETILRDVKTAFNEASSITLYGDEIILVNCFQPYNHSLYKLNTTVENWFYNMHSETDGSELFKYLRTLNGFASTLSNDVLRSISKKFLDIITGELPDSMTTVEKFTDIFKNCLENQFEITNLKILFDELNSFDIPVVLNDLINNQMKPGIFWKKDFISAIKFDGFTSIISLESLHQLLSIHYRITLQVLLTFVLFDLDTEIFGQHISTLLDLHYKQFLLLNLYRQDKCLLAEVLLKDSSEFSFGVKFFNYGQLIAYIDSLNSNVYNASITENSFFMTFFRSYIIENTSHKNIRFFLENVECPFYLRHNEVQEFMFAMTLFSCGNFDQSYEIFQLHDYPEAINDKLPTFLEDLKSENYHGDSIWKDLLCTFTVPYRHSAFYYQLSLLFDRNNSQEFALKCISKSAEYSLKEIQIEELQDFKEKQHIHYLNLLIHFRMFEEVLDVLRLGHECLSDTVRTNFLQLLLQEDIYSRDFFSTLLRLCNAHSDNGELYLRTVDIKIVDSILSQNLRSGDWECFKKLYCFRMLNKSERAAAEVLYQYILMQADLDVIRKRKCYLMVINVLSSFDSAYDQWILNGSKVVTLTDLRDELRGL 1037 T 0.086 TPR_1 pdbpssm F T 7tbl 4 G,H,I,J,K,L B1,B2,B3,B4,B5,B6 NUP53 R3 RKAKLLPMEEALLP 14 T 9.2 POX pdbhh F T 7tbl 5 M,N,O,P,Q,R C1,C2,C3,C4,C5,C6 NUP98 R3 HKKLVINKDMRTDLFSPPN 19 T 5.9 MethyTransf_Reg pdbhh F T 7tbl 7 AA,BA,CA,DA,EA,FA,Z E2,E3,E4,E5,E6,E7,E1 NUP53 R2 APPVRSIY 8 T 5.2 DUF502 pdbhh F T 7tbl 9 IA,JA G1,G2 NUP93 R2 DVDTYLSNLQTKTTLSMIADGLERSARDFDAFLEENVTLEWEAQRKRIYQHFG 53 T 0.16 X pdbpssm F T 7tbl 10 KA,LA H1,H2 NUP98 R2 EDSILQPGAFSAN 13 T 1 Hydrolase_2 pdbhh F T 7tbl 12 RA,SA,TA,UA,VA J1,J2,J3,J4,J5 NUP93 R2 SGTGLGEVDVDTYLSNLQTKTTLSMIADGLERSARDFDAFLEENVTLEWEAQRKRIYQHFGIK 63 T 0.19 UPF0172 pdbpssm F T 7tbl 13 AB,WA,XA,YA,ZA K5,K1,K2,K3,K4 NUP98 R1 ILPMYKLSP 9 T 2.6 AIMP2_LysRS_bd pdbhh F T 7tbl 14 BB,CB,DB,EB,FB L1,L2,L3,L4,L5 NUP53 R1 FG 2 T 140 DUF5754 pdbhh F F 7tbl 16 KB,LB,MB,NB N1,N2,N3,N4 NUP58 SEALQQEIAKIDEEIQKCIRDKEAVDAFLPAHGEQLAAIPTDVNFVTRKSEGAHNALSSDILAIDQLRELVKQDADNARLSFKAIDNLKLPMQYHQAGLWSKQMGGAGTAGASGASADADGQSNADLISYFSKTADEMEEMMKKFEKTITEIEAHLTGVEAHAMAMQNVAAQSRNAAQGGVDERVYELAAVLREFEESILKVAGVVGGVKEGVTELQLRDFM 222 T 0.0012 AAA_13 pdbpssm F T 7tbl 18 SB,TB,UB,VB P1,P2,P3,P4 NUP54 Ferrodoxin-like domain DEDGLISLIFNKKESDIRGQQQQLVESLHKVLGGHQTLTVNVEGVKTKADNQTEVIIYVVERSPNGTSRRVGASALFSYFEQAHIKANMQSLGVTGAMAQTELSPVQIKQLIQNPL 116 T 0.018 Glutaminase pdb F T 7tbl 20 AC,BC,CC,DC R1,R2,R3,R4 NUP93 R1 ALFDSLLARNKKQAEGETALGELPSLQLGLADLRQRLRKL 40 T 7.9 TACI-CRD2 pdbhh F T 7tbl 28 GD,HD,ID,JD Z1,Z2,Z3,Z4 NUP160 MACLSRIDANLLQYYEKPEPNNTVDLYVSNNSNNNGLKEGDKSISTPVPQPYGSEYSNCLLLSNSEYICYHFSSRSTLLTFYPLSDAYHGKTINIHLPNASMNQRYTLTIQEVEQQLLVNVILKDGSFLTLQLPLSFLFSSANTLNGEWFHLQNPYDFTVRVPHFLFYVSPQFSVVFLEDGGLLGLKKVDGVHYEPLLFNDNSYLKSLTRFFSRSSKSDYDSVISCKLFHERYLIVLTQNCHLKIWDLTSFTLIQDYDMVSQSDSDPSHFRKVEAVGEYLSLYNNTLVTLLPLENGLFQMGTLLVDSSGILTYTFQNNIPTNLSASAIWSIVDLVLTRPLELNVEASYLNLIVLWKSGTASKLQILNVNDESFKNYEWIESVNKSLVDLQSEHDLDIVTKTGDVERGFCNLKSRYGTQIFERAQQILSENKIIMAHNEDEEYLANLETILRDVKTAFNEASSITLYGDEIILVNCFQPYNHSLYKLNTTVENWFYNMHSETDGSELFKYLRTLNGFASTLSNDVLRSISKKFLDIITGELPDSMTTVEKFTDIFKNCLENQFEITNLKILFDELNSFDIPVVLNDLINNQMKPGIFWKKDFISAIKFDGFTSIISLESLHQLLSIHYRITLQVLLTFVLFDLDTEIFGQHISTLLDLHYKQFLLLNLYRQDKCLLAEVLLKDSSEFSFGVKFFNYGQLIAYIDSLNSNVYNASITENSFFMTFFRSYIIENTSHKNIRFFLENVECPFYLRHNEVQEFMFAMTLFSCGNFDQSYEIFQLHDYPEAINDKLPTFLEDLKSENYHGDSIWKDLLCTFTVPYRHSAFYYQLSLLFDRNNSQEFALKCISKSAEYSLKEIQIEELQDFKEKQHIHYLNLLIHFRMFEEVLDVLRLGHECLSDTVRTNFLQLLLQEDIYSRDFFSTLLRLCNAHSDNGELYLRTVDIKIVDSILSQNLRSGDWECFKKLYCFRMLNKSERAAAEVLYQYILMQADLDVIRKRKCYLMVINVLSSFDSAYDQWILNGSKVVTLTDLRDELRGL 1037 T 0.086 TPR_1 pdbpssm F T 7tbl 34 AE f NUP42 IIATDNVLFTPRDKLTVEELEQFQSKKFTLGKIPLKPPPLELLNV 45 T 1.8 Arcadin_1 pdbhh F T 7tbl 40 GE k2 NUP62 CCS2 QHADEEREKTYKLAENIDAQLKRMAQDLKEVIEHLNT 37 T 0.006 Tektin pdbpercent F T 7tbl 41 HE l1 NUP214 CCS1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 92 F F F 7tbl 42 IE l2 NUP214 CCS2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 39 F F F 7tbl 43 JE m1 NUP88 CCS1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 88 F F F 7tbl 44 KE m2 NUP88 CCS2 XXXXXXXXXXXXXXXXXXXX 20 F F F 7tbm 4 G,H,I,J,K,L B1,B2,B3,B4,B5,B6 NUP53 R3 RKAKLLPMEEALLP 14 T 9.2 POX pdbhh F T 7tbm 5 M,N,O,P,Q,R C1,C2,C3,C4,C5,C6 NUP98 R3 HKKLVINKDMRTDLFSPPN 19 T 5.9 MethyTransf_Reg pdbhh F T 7tbm 7 AA,BA,CA,DA,EA,FA,Z E2,E3,E4,E5,E6,E7,E1 NUP53 R2 APPVRSIY 8 T 5.2 DUF502 pdbhh F T 7tbm 9 IA,JA G1,G2 NUP93 R2 DVDTYLSNLQTKTTLSMIADGLERSARDFDAFLEENVTLEWEAQRKRIYQHFG 53 T 0.16 X pdbpssm F T 7tbm 10 KA,LA H1,H2 NUP98 R2 EDSILQPGAFSAN 13 T 1 Hydrolase_2 pdbhh F T 7tbm 12 RA,SA,TA,UA,VA J1,J2,J3,J4,J5 NUP93 R2 SGTGLGEVDVDTYLSNLQTKTTLSMIADGLERSARDFDAFLEENVTLEWEAQRKRIYQHFGIK 63 T 0.19 UPF0172 pdbpssm F T 7tbm 13 AB,WA,XA,YA,ZA K5,K1,K2,K3,K4 NUP98 R1 ILPMYKLSP 9 T 2.6 AIMP2_LysRS_bd pdbhh F T 7tbm 14 BB,CB,DB,EB,FB L1,L2,L3,L4,L5 NUP53 R1 FG 2 T 140 DUF5754 pdbhh F F 7tbm 16 KB,LB,MB,NB N1,N2,N3,N4 NUP58 SEALQQEIAKIDEEIQKCIRDKEAVDAFLPAHGEQLAAIPTDVNFVTRKSEGAHNALSSDILAIDQLRELVKQDADNARLSFKAIDNLKLPMQYHQAGLWSKQMGGAGTAGASGASADADGQSNADLISYFSKTADEMEEMMKKFEKTITEIEAHLTGVEAHAMAMQNVAAQSRNAAQGGVDERVYELAAVLREFEESILKVAGVVGGVKEGVTELQLRDFM 222 T 0.0012 AAA_13 pdbpssm F T 7tbm 18 SB,TB,UB,VB P1,P2,P3,P4 NUP54 Ferrodoxin-like domain DEDGLISLIFNKKESDIRGQQQQLVESLHKVLGGHQTLTVNVEGVKTKADNQTEVIIYVVERSPNGTSRRVGASALFSYFEQAHIKANMQSLGVTGAMAQTELSPVQIKQLIQNPL 116 T 0.018 Glutaminase pdb F T 7tbm 20 AC,BC,CC,DC R1,R2,R3,R4 NUP93 R1 ALFDSLLARNKKQAEGETALGELPSLQLGLADLRQRLRKL 40 T 7.9 TACI-CRD2 pdbhh F T 7tbm 28 GD,HD,ID,JD Z1,Z2,Z3,Z4 NUP160 MACLSRIDANLLQYYEKPEPNNTVDLYVSNNSNNNGLKEGDKSISTPVPQPYGSEYSNCLLLSNSEYICYHFSSRSTLLTFYPLSDAYHGKTINIHLPNASMNQRYTLTIQEVEQQLLVNVILKDGSFLTLQLPLSFLFSSANTLNGEWFHLQNPYDFTVRVPHFLFYVSPQFSVVFLEDGGLLGLKKVDGVHYEPLLFNDNSYLKSLTRFFSRSSKSDYDSVISCKLFHERYLIVLTQNCHLKIWDLTSFTLIQDYDMVSQSDSDPSHFRKVEAVGEYLSLYNNTLVTLLPLENGLFQMGTLLVDSSGILTYTFQNNIPTNLSASAIWSIVDLVLTRPLELNVEASYLNLIVLWKSGTASKLQILNVNDESFKNYEWIESVNKSLVDLQSEHDLDIVTKTGDVERGFCNLKSRYGTQIFERAQQILSENKIIMAHNEDEEYLANLETILRDVKTAFNEASSITLYGDEIILVNCFQPYNHSLYKLNTTVENWFYNMHSETDGSELFKYLRTLNGFASTLSNDVLRSISKKFLDIITGELPDSMTTVEKFTDIFKNCLENQFEITNLKILFDELNSFDIPVVLNDLINNQMKPGIFWKKDFISAIKFDGFTSIISLESLHQLLSIHYRITLQVLLTFVLFDLDTEIFGQHISTLLDLHYKQFLLLNLYRQDKCLLAEVLLKDSSEFSFGVKFFNYGQLIAYIDSLNSNVYNASITENSFFMTFFRSYIIENTSHKNIRFFLENVECPFYLRHNEVQEFMFAMTLFSCGNFDQSYEIFQLHDYPEAINDKLPTFLEDLKSENYHGDSIWKDLLCTFTVPYRHSAFYYQLSLLFDRNNSQEFALKCISKSAEYSLKEIQIEELQDFKEKQHIHYLNLLIHFRMFEEVLDVLRLGHECLSDTVRTNFLQLLLQEDIYSRDFFSTLLRLCNAHSDNGELYLRTVDIKIVDSILSQNLRSGDWECFKKLYCFRMLNKSERAAAEVLYQYILMQADLDVIRKRKCYLMVINVLSSFDSAYDQWILNGSKVVTLTDLRDELRGL 1037 T 0.086 TPR_1 pdbpssm F T 7tbm 30 OD,PD,QD,RD b1,b2,b3,b4 NUP37 SNQYQLPLNVRPYTTTWCSQSPSCSNLLAIGHDTGITIYCASEEQTPGSTGLTLQELFTIQTGLPTLHLSFSSSCSYSENLHDGDGNVNSSPVYSLFLACVCQDNTVRLIITKNETIITQHVLGGKSGHHNFVNDIDIADVYSADNRLAEQVIASVGDDCTLIIWRLTDEGPILAGYPLSSPGISVQFRPSNPNQLIVGERNGNIRIFDWTLNLSAEENSQTELVKNPWLLTLNTLPLVNTCHSSGIASSLANVRWIGSDGSGILAMCKSGAWLRWNLFANNDYNEISDSTMKLGPKNLLPNVQGISLFPSLLGACPHPRYMDYFATAHSQHGLIQLINTYEKDSNSIPIQLGMPIVDFCWHQDGSHLAIATEGSVLLTRLMGFT 385 T 0.00016 WD40 pdbpssm F T 7tbm 37 CE k2 NUP62 CCS2 QHADEEREKTYKLAENIDAQLKRMAQDLKEVIEHLNT 37 T 0.006 Tektin pdbpercent F T 7tbm 38 DE l1 NUP214 CCS1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 92 F F F 7tbm 39 EE l2 NUP214 CCS2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 39 F F F 7tbm 40 FE m1 NUP88 CCS1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 88 F F F 7tbm 41 GE m2 NUP88 CCS2 XXXXXXXXXXXXXXXXXXXX 20 F F F 7tbt 2 B B R1AB_SARS2 Nonstructural protein 13/14 NVATLQAENV 10 T 34 Shugoshin_N pdbhh T Viruses T 7tc4 2 C,D C,D R1AB_SARS2 Nonstructural protein 15/16 FYPKLQSSQ 9 T 9.2 HSR pdbhh T Viruses T 7tcr 2 C,D C,D A0A2D2CY73_METTR Methanobactin biosynthesis cassette protein MbnC MSLLPTAPVRIDADLYDDLANPARQSLYPRDSRGFIRIDISLRAYWHTLFDTCPRLLELSGPSGGAIFLPFMAWARENNLAFDWSFFLWVYVWLQQSEFRERLDEDQLLPVMTASATRWLMIDRDIDACQIVLGSRSLAGAAVVGAKIDSIHCRLEQVQQVAFAAPLPLPDGEFGYFLTPGFEIDHFPGWRPLPR 195 T 34 IgG_binding_B pdbhh F Bacteria T 7tcu 2 C,D C,D A0A2D2CY73_METTR Methanobactin biosynthesis cassette protein MbnC MSLLPTAPVRIDADLYDDLANPARQSLYPRDSRGFIRIDISLRAYWHTLFDTCPRLLELSGPSGGAIFLPFMAWARENNLAFDWSFFLWVYVWLQQSEFRERLDEDQLLPVMTASATRWLMIDRDIDACQIVLGSRSLAGAAVVGAKIDSIHCRLEQVQQVAFAAPLPLPDGEFGYFLTPGFEIDHFPGWRPLPR 195 T 34 IgG_binding_B pdbhh F Bacteria T 7tcw 2 C,D C,D A0A2D2CY73_METTR Methanobactin biosynthesis cassette protein MbnC MSLLPTAPVRIDADLYDDLANPARQSLYPRDSRGFIRIDISLRAYWHTLFDTCPRLLELSGPSGGAIFLPFMAWARENNLAFDWSFFLWVYVWLQQSEFRERLDEDQLLPVMTASATRWLMIDRDIDACQIVLGSRSLAGAAVVGAKIDSIHCRLEQVQQVAFAAPLPLPDGEFGYFLTPGFEIDHFPGWRPLPR 195 T 34 IgG_binding_B pdbhh F Bacteria T 7tcx 2 C,D C,D A0A2D2CY73_METTR Methanobactin biosynthesis cassette protein MbnC MSLLPTAPVRIDADLYDDLANPARQSLYPRDSRGFIRIDISLRAYWHTLFDTCPRLLELSGPSGGAIFLPFMAWARENNLAFDWSFFLWVYVWLQQSEFRERLDEDQLLPVMTASATRWLMIDRDIDACQIVLGSRSLAGAAVVGAKIDSIHCRLEQVQQVAFAAPLPLPDGEFGYFLTPGFEIDHFPGWRPLPR 195 T 34 IgG_binding_B pdbhh F Bacteria T 7tdr 1 A A Q5MIU2_AEDAL 34k2 salivary protein GNPTPKSCTVSEEDLTTIRNAIQKASRASLDDVNLDEDLIAKCPLLKTITASLKSVASEIATLKDTGISEEQVDELKQSYEQQVNEIVKSRDIFEKQSGGDVMKEQGAMINRMTELQVQVAQLQQQIGEQTSRMYDDMAELIFQRLAMNSTDSIRNYTAHMMEQKLHTLMTKLETNYRIFLGALRYLDHLGDQPLIDKVFDGILKRLDEMSLETNKERENGKYVLVNLLCWTVNNRFLTEKYRKKQLELFRIALKFYPKTGNKEANEADIRGRQFCDANFPVNVITWFAVSRAAEGWGLRGTLAAA 306 T 0.00044 TINF2_N pdbpssm F Eukaryota T 7tds 1 A A Q5MIU2_AEDAL 34k2 salivary protein GNPTPKSCTVSEEDLTTIRNAIQKASRASLDDVNLDEDLIAKCPLLKTITASLKSVASEIATLKDTGISEEQVDELKQSYEQQVNEIVKSRDIFEKQSGGDVMKEQGAMINRMTELQVQVAQLQQQIGEQTSRMYDDMAELIFQRLAMNSTDSIRNYTAHMMEQKLHTLMTKLETNYRIFLGALRYLDHLGDQPLIDKVFDGILKRLDEMSLETNKERENGKYVLVNLLCWTVNNRFLTEKYRKKQLELFRIALKFYPKTGNKEANEADIRGRQFCDANFPVNVITWFAVSRAAEGWGLRGTLAAA 306 T 0.00044 TINF2_N pdbpssm F Eukaryota T 7tdz 3 D,DA s,S Q9PVZ2_XENLA NUCLEOPORIN CAN MEDDTDLPPERETKDFQFRQLKKVRLFDYPADLPKQRSNLLVISNKYGLLFVGGFMGLKVFHTKDILVTVKPKENANKTVVGPQGIHVPMNSPIHHLALSSDNLTLSVCMTSAEQGSSVSFYDVRTLLNESKQNKMPFASCKLLRDPSSSVTDLQWNPTLPSMVAVCLSDGSISVLQVTDTVSVFANLPATLGVTSVCWSPKGKQLAVGKQNGTVVQYLPSLQEKKVIPCPSFYDSDNPVKVLDVLWLSTYVFTVVYAAADGSLEASPQLVIVTLPKKEDKRAERFLNFTETCYSICSERQHHFFLNYIEDWEILLAASAASVDVGVIARPPDQVGWEQWLLEDSSRAEMPMTENNDDTLPMGVALDYTCQLEVFISESQILPPVPVLLLLSTDGVLCPFHVVNLNQGVKPLTTSPEQLSLDGEREMKVVGGTAVSTPPAPLTSVSAPAPPASAAPRSAAPPPYPFGLSTASSGAPTPVLNPPASLAPAATPTKTTSQPAAAATSIFQPAGPAAGSLQPPSLPAFSFSSANNAANASAPSSFPFGAAMVSSNTAKVSAPPAMSFQPAMGTRPFSLATPVTVQAATAPGFTPTPSTVKVNLKDKFNASDTPPPATISSAAALSFTPTSKPNATVPVKSQPTVIPSQASVQPNRPFAVEAPQAPSSVSIASVQKTVRVNPPATKITPQPQRSVALENQAKVTKESDSILNGIREEIAHFQKELDDLKARTSRACFQVGSEEEKRQLRTESDGLHSFFLEIKETTESLRGEFSAMKIKNLEGFASIEDVQQRNKLKQDPKYLQLLYKKPLDPKSETQMQEIRRLNQYVKNAVQDVNDVLDLEWDQYLEEKQKKKGIIIPERETLFNSLANHQEIINQQRPKLEQLVENLQKLRLYNQISQWNVPDSSTKSFDVELENMQKTLSQTAIDTQTKPQAKLPAKISPVKQSQLRNFLSKRKTPPVRSLAPANLSRSAFLAPSFFEDLDDVSSTSSLSDMADNDNRNPPPKEIERQETPPPESTPVRVPKHAPVARTTSVQPGLGTASLPFQSGLHPATSTPVAPSQSIRVIPQGADSTMLATKTVKHGAPNITAAQKAAVAAMRRQTASQIPAASLTESTLQTVPQVVNVKELKNNGPGPTIPTVIGPTVPQSAAQVIHQVLATVGSVSARQAAPAAPLKNPPASASSIAPQTWQGSAPNKPAAQAIPKSDPSASQAPAPSVSQVNKPVSFSPAAGGFSFSNVTSAPVTSALGSSSAGCAATARDSNQASSYMFGGTGKSLGSEGSFSFASLKPASSSSSSSVVEPTMSKPSVVTAASTTATVTSTTAASSKPGEGLFQGFSGGETLGSFSGLRVGQADEASKVEVAKTPTAAQPVKLPSNPVLFSFAGAPQPAKVGEAPSTTSSTSASLFGNVQLASAGSTASAFTQSGSKPAFTFGIPQSTSTTAGASSAIPASFQSLLVSAAPATTTPSAPINSGLDVKQPIKPLSEPADSSSSQQQTLTTQSAAEQVPTVTPAATTATALPPPVPTIPSTAEAKIEGAAAPAIPASVISSQTVPFTSTVLASQTPLASTPAGGPTSQVPVLVTTAPPVTTESAQTVSLTGQPVAGSSAFAQSTVTAASTPVFGQALASGAAPSPFAQPTSSSVSTSANSSTGFGTSAFGATGGNGGFGQPSFGQAPLWKGPATSQSTLPFSQPTFGTQPAFGQPAASTATSSAGSLFGCTSSASSFSFGQASNTSGTSTSGVLFGQSSAPVFGQSAAFPQAAPAFGSASVSTTTTASFGFGQPAGFASGTSGSLFNPSQSGSTSVFGQQPASSSGGLFGAGSGGASTVGLFSGLGAKPSQEAANKNPFGSPGSSGFGSAGASNSSNLFGNSGAKAFGFGGTSFGDKPSATFSAGGSVASQGFSFNSPTKTGGFGAAPVFGSPPTFGGSPGFGGSPAFGTAAAFSNTLGSTGGKVFGEGTSAATTGGFGFGSNSSTAAFGSLATQNTPTFGSISQQSPGFGGQSSGFSGFGAGPGAAAGNTGGFGFGVSNPTSPGFGCWRS 2037 T 0.00071 NUP214 pdbpercent F Eukaryota T 7tem 1 A,B A,B A0A5P8YGV9_YERPE Putative exported protein YPO2471 SNAMKLLNTLVCIIGLTSFSSSAKLVNAEHLDALYQKVTVANKTELGLIHIYSEFPDYRWVKDPIEGVSAIDDVARAAIFYQRQYQATGSAADLEKVKSLVEFILYQRADNGYFYNFIYPDHSINKEYKTSVAEPNWWTWRALWALTQVYPTLVKTDNALAQRTRETIFATIDVIYKDFNFKQTRGEKEGVAVPEWLPHTAGDQASVLLMALSDAQALEAKPEIEKMMRSLAAGIMLMQVKDTSSPVNGAFLSWQNLWHGYGNSQAYALLVAGNRLGDRDMIKAAFNELDHFHPWLISNGLLNEFTVRQQGEKVTLIEQKKFSQIAYIIRPMVFANIKAWEISRDAVYLERAVDLSLWFFKNNPAQAQMYYPVTGIAFDGIDSATTVNKNSGAESTIEALLTLQLIESIPDAKRMLESALEKRNIKQ 427 T 0.15 Glyco_hydro_127 unppercent F Bacteria T 7tf6 2 D,E,I,J,K,M,P,Q,U,V,W,X D,E,I,J,K,M,P,Q,U,V,W,X Q53687_STAAU GLUTAMINE SYNTHETASE,GLUTAMINE SYNTHETASE REPRESSOR,HTH-TYPE TRANSCRIPTIONAL REGULATOR GLNR,MERR FAMILY TRANSCRIPTIONAL REGULATOR,GLUTAMINE SYNTHETASE REPRESSOR,TRANSCRIPTIONAL REGULATOR (NITROGEN METABOLISM),TRANSCRIPTIONAL REGULATOR,MERR FAMILY,MERR FAMILY PROTEIN, FEMC, FACTOR INVOLVED IN METHICILLIN RESISTANCE PINRGDLSRFI 11 T 0.75 EST1 pdbhh F Bacteria T 7tf9 2 AA,BA,D,E,J,K,L,M,U,V,W,X,Y,Z a,b,D,E,J,K,L,M,U,V,W,X,Y,Z C-tail peptide of Glutamine synthetase repressor QLPRF 5 T 31 LIM_bind pdbhh F F 7tfa 1 A,C,D,G,H,I,M,N,O,P,U,V A,C,D,G,H,I,M,N,O,P,U,V GlnR C-tail peptide LIQGELSRFF 10 T 3.5 DUF3146 pdbhh F T 7tfb 1 A,AA,C,E,G,H,I,M,O,P,Q,R,W,X A,a,C,E,G,H,I,M,O,P,Q,R,W,X GlnR C-tail peptide LIQGELSRFF 10 T 3.5 DUF3146 pdbhh F T 7tfc 2 AA,BA,D,E,J,K,L,M,U,V,W,X,Y,Z a,b,D,E,J,K,L,M,U,V,W,X,Y,Z GLNR_BACSU GlnR C-tail peptide TFRQGDMSRF 10 T 0.086 Dfp1_Him1_M unp F Bacteria T 7tfn 1 A,B,C A,B,C Q2N0S6_9HIV1 Envelope glycoprotein BG505 SOSIP.664 - gp120 AENLWVTVYYGVPVWKDAETTLFCASDAKAYETKKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQAFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 481 T 3.4E-54 GP120 pdbpercent T Viruses T 7tfo 1 A,B,C A,B,C A0A6H1VH54_9PLVG Envelope glycoprotein BG505 SOSIP.664 - gp120 AENLWVTVYYGVPVWKDAETTLFCASDAKAYETKKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQAFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 481 T 3.4E-54 GP120 pdbpercent T Viruses T 7tgg 2 B a Q74D22_GEOSL Geopilin domain 2 protein MKKIITIVAMLLAMQGIAIAAGKIPTTTMGGKDFTFKPSTNVSVSYFTTNGATSTAGTVNTDYAVNTKNSSGNRVFTSTNNTSNIWYIENDAWKGKAVSDSDVTALGTGDVGKSDFSGTEWKSQ 124 T 0.02 Mfp-3 pdbhh F Bacteria T 7tgh 12 L,W 3G,3g Q23F81_TETTS UQCRB MVRLEKILWEQLVNVKAFSRQRVIGAPSKWYNENRTEWFKVAQHNAFNTGFSGVILRALEPLLAKFIYRWRLDIAHQRGLTLEDSLLFMDRELRRCYFFETVARQNLHPYTVLFMKKRRARYYKVERGLRGFYVPDWVRKEAEERQLSETVDNIFNWENFVYREYMSDMTPIGRWTSLSKITPLDMFQYYGLFRNEAWDRFFYNEAFYESYSEKEKQEANGNPFGKFNLQTADGRAQFEKEVNTFIERYPFAVTKPGQKFDFTRFYALEDLANKRDTSKYDPALLESVKNELKQSAALPADNGANKTKKSKPILPDWLQPKFGKAFQA 328 T 0.69 DUF6322 pdb F Eukaryota T 7tgh 13 M,X 3H,3h I7M484_TETTS Transmembrane protein, putative MNVTGAGLTHVKDFHSDEMRVFRGGLRHIADKQGNLIYGSVNSSVRYYHDKMSYERGFIQHSRSPSNQFINFHFMLGGFRTYVLERFFKQVWYRRNIRTFWFPVLISYTSGCITMRMYDNNCYDYFYFSD 130 T 1.6 DUF5320 pdbpssm F Eukaryota T 7tgh 14 N,Y 3I,3i I7MM45_TETTS Transmembrane protein, putative MVYGKLIFNNIKEYTPSWIKTIPYSQVTKPILRKQPQIVGKINADPKVKKFWVFLRENVQYYPFLWQFFILGTSFVWFHVCYDPWLAIYQANNAHRSLETALTKEKAHKKKLAEQEESE 119 T 2 Selenoprotein_S pdbhh F Eukaryota T 7tgh 15 O,Z 3J,3j I7MFL6_TETTS Transmembrane protein, putative MYLPTFYKLFHETNAFRLKRYVGYGPLLLTWSIWTLYPALYNMIYSDFIPPERGVPKRIVDA 62 T 1.5 DUF5621 pdbhh F Eukaryota T 7tgh 16 P 3M UNK1 MESRSYMFSLAKKRSTLAA 19 T 5.9 Gryzun-like pdbhh F T 7tgh 17 AA 3l UNK2 AAAAAAAAEAAAAAAAAAAAAAAA 24 T 830 DUF936 pdbhh F F 7tgh 18 BA 3m UNK3 AAARAYKFALAKARAAA 17 T 25 GspH pdbhh F T 7tgh 20 DA 4L Q950Z5_TETTH Ymf58 MLTWISFWSLIFWLILIILVLKPKNFISILFMSELTWLALYCLSLLFGAIYCDITLLSISFFILGVAGLEFSFGILIAILYKNLNESLNTDLNNNNNNQNIFDKNFKTPLEKINWQ 116 T 0.0017 Oxidored_q2 pdbhh F Eukaryota T 7tgh 22 FA 5B Q951C2_TETTH Ymf57 MLKNKLIKFKFFRFVQSGFYVDFIFKKFSEMFIRNIFIYSSIFFGEKFMIEYLTKKTIDSFIFNNNRFNFINLVESKYFLQILTLILYLFFITIFILFYI 100 T 29 HEPN_AbiV pdbhh F Eukaryota T 7tgh 23 GA 6 Q950Y2_TETTH Ymf62 MFLITITSYFSNIIEFNSYIINLIDFITPLFFIENFVIQFFILYLFYLLIVNNNLYYILLYIFLEIVFFGLFLCLYQLELFTGFLWVAEFAIVFIAVVLLFYLNIDGLHLKYNHNINNVLYYTPSLVLFLIFFNIDYFSELELFLPLELSFIDIYDDYYEGFNNSIMNDFTPLTLSYYSINSAEFIIIGLLLLLGSVACVNLYKSNKNYTIVKQSNLLTMFDFFKDFINFSFIRKQDLNNQTNFNPSLRSIKKKY 255 T 8.3 DUF2070 pdbpssm F Eukaryota T 7tgh 26 JA A6 I7M2Y3_TETTS NADH dehydrogenase, putative MNHYWGSSNTIPASSTQNNNYFSGGGNNVTIRGNEIMERLPSQTPSQNMVQASMKTLRFYRKFCRLIPFILRIHNIGTKFTAQQAMINFGNYIRERNHYRDPGLIDHRIQLGYELLYEAEMHFSQHTILMQYLSPYNTPLSDRGYSYLEKVKYGNKSKFLQGFYKGNKPTEF 172 T 0.00098 Complex1_LYR_1 pdbhh F Eukaryota T 7tgh 27 KA A7 I7MIJ7_TETTS NDUA7 MRKALERFNEIIFNPAIRWYQLPKPTVRRTRYPAPGSEPINREVHQIDYKTAFRDSPHNIRYHHEIHTSDQTYHSSYDPVGETTTERLVRYGYLNKDQVNNAEAVAAAAKEFQEKEKRSPSNNIIIDEISNSDKPITKENRESVAHHVRQQFEFFREVNAEEVWSVSIEEKYNPELYIYKTYDMAADDPVWRQVKLDLEWTFENIAERRESLGYMPTFKGDPNFWQALDNSFSPENIAQVQSSIGDKVTNIDTKALALNHQTEEYHKTSKLVYPIRTNLVVE 282 T 1.1 Synaptobrevin pdb F Eukaryota T 7tgh 32 PA AM I7M2U4_TETTS NDUA13 MQFFRPDFIATQVLRRADMAHSPFHKAIHDLEDKRSKLFPDRRRIPGRKAKLLLAASLLLQMWGVGKIIEIKKFMKRRDIELKGLQRKAAPFMQSMNDVRHLALRERNDMLYNELLSVHGEEYAQKMQKRFHQTDIWAPFRHRYAYMYNSSNKNVKDYKQVTLSRYINGFDKFNV 175 T 0.00014 GRIM-19 pdbpercent F Eukaryota T 7tgh 34 RA B8 I7M855_TETTS NDUB8 MALRRVLKNQFNLIHKGQAQAVRGGHGWDRPDVPLSFNPLYVHKRELSIFDTNMWMYDQVYPEYVISYNEIHLVDQWKGLKESFSQSAYWWAMMAMVFGFYFINTTPRQLGIDTNDLKGFLGEYYGQYKKRSGIRSNFLGLDVTGENSIIQPNYDRKNGIRDVIDSLNADAGKRKLINLEAKNFIERVEKECEQRILKKGGATQSHH 207 T 0.23 Spore_YhaL pdbpercent F Eukaryota T 7tgh 35 SA BL Q23KG0_TETTS NDUB10 MAFGGFRQTDNSLIIDDRRKIILNTRSLNDFQQKIYLRNFFTNYRPDLSSYDYFAFKEKLRIGELFLNEYRKRINNEVRRAAILTPTSSLREKMNHKIADQILDLSSPHVRGAHFQAVRSWTDASKIVNYVEEKQTKINKYGLQFPLLGNMTEEQCASKEDEVYQRLLKEMQKPPKKASEPVEESSDE 188 T 0.12 CMV_1a pdb F Eukaryota T 7tgh 36 TA C UNK4 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 209 F F F 7tgh 40 XA TD Q22DC2_TETTS Transmembrane protein, putative MNLPWFVRWGTDVALFFIPAYTFANYPTTFFVFAAEKRRQRRRKDFSDVKLRDDAAFSVDQVKQLQTKLHLKQ 73 T 1.6 HIND pdbhh F Eukaryota T 7tgh 45 CB A1 I7MI60_TETTS NDUA1 MVNTAYPTPLKTILKTTPAFVVYFVFGLGFSTVIYDVVYHPKDRIERFYFRSSKFERLSRKRDEKLRHYFKPAIEWQPWYNTSTNNNTRPLLRY 94 T 0.15 KCT2 pdbhh F Eukaryota T 7tgh 48 FB T5 I7LT77_TETTS Transmembrane protein, putative MFLYKKILSIYKQSFSFFLSFNFSFFLYALLAIFLLINFCQHIHKFLYYCKEKIQKEMQNAYPEITDQHREFLKKQGLKVYEPKPLPDQINPFSKTYWITNAFIIGVSFLARRHALKVGAPRIFWSGCIVGVPLAAIISRGKSDQLDELVGARKTLEQKLEYAPITRRAWERALATNQEYQNEIKTQIQDLQAEIAAKKVAAKLE 205 T 0.02 FlxA pdb F Eukaryota T 7tgh 49 GB R UNK5 MNIAWKELENDAFKAKDIAKFSFSNASNLANFVAESQALATNHFNTALNNGFNVFFAVAGLLVVGVLVYIFFNSVGGMIIRSRIKAAQPNPNQVKVLVMPFVALGVSLVISRAGINGDDFGYKG 124 T 0.012 DUF3611 pdb F T 7tgh 57 OB B9 Q233X7_TETTS NDUB9 MSKAYYFVKNFSWAEVSNLLCYGTKYPTVLNHQQKVTRLYRATLRRVYAHQVEGYKTDFKQYNENITDIGKDFNKMLALKPESLELQAYFKKYEDLQEELFDPAMIIDESRPYAASSGRYYIFDDYLLKFDPFGFYSPKLLSENRPEEAMPFYEDYPQNDSHWNLWEQFPEDFEDSNAEREAILKSNKH 189 T 0.005 Complex1_LYR pdbpssm F Eukaryota T 7tgh 59 QB A8 I7MMF4_TETTS NDUA8 MSILNLIKNVLNMLINIYIFVQYFKQLKYNNKVGLIYQNDIYECLYVYMNESNIFIQCQEYVSVFFNAIRKEKEKEKDQFDRQIDKQRKKQRLLKKANQILKRISKMNTKSFEVLIHSQYAFDVCREQVYNFEDCRQTDTPLPKDPIHCKAQAKEVLSCYKEAEKMDPICLSSFNDSRECMFKSDGNLYNCKTWINQYVTCQKNPAAFAEFLEASTAEQLKSKKFDFVKNRGHSDKYL 238 T 0.002 Cmc1 pdbpercent F Eukaryota T 7tgh 60 RB TB Q22T55_TETTS Transmembrane protein, putative MFWRNVVRGLNCQQALRRQNFAKNITTTDIPKDSHHFAAKRSGFTQTEQAPFAYNDVYQYPKDYKPWNYNYKGNGVLLALFLGSAFSLVAYERSYASKTGRYQRKVQQNYYQI 113 T 0.055 Ncstrn_small pdbpercent F Eukaryota T 7tgh 63 UB B6 Q231G0_TETTS NDUB6 MLLIEMAFNAMKMKIFSLRKIKVKSKEQYLYNYQQKLLILGQGKEKNNKQYKKDIEMGGFQKYPIPRYLHVGQWIVNKNWKWNTFHMFFPTAILCFMVWRNSMISTAKPPNYGEYVDPQSPVAPKAIKY 129 T 0.0021 TMEM117 pdb F Eukaryota T 7tgh 64 VB BM Q22Z32_TETTS Transmembrane protein, putative MNPRNIFNLAKKVQNFNSITQKAFKRFGGAAAHHDDHHDDHHGHGGHGYEVHLVKDKNLIGNKSFKDDLVAVYGFTDVNDHHHHDETDPYHHLRGVPTLSFERMYFADAYYHDDTHEGLMNEPHGYLTMDDPMDLRPNYEKSALELLFLVSGGAILALMLGYQGLNLANPAESLFSLNTAAEEIEDKIRQIRIDNDKLLQRKAQLEEELASLNN 214 T 0.022 MctB pdbhh F Eukaryota T 7tgh 65 WB C4 Q22W63_TETTS NDUC2 MSSMLIWGACFGLFTRAAACKASMIPLTTSPWKYPKYMIVSAVTFYYFDWYRRMALEQLCYNEEKLERYQIRAKLQSLKIGEELSDAYRESFFEHAVQKNNI 102 T 0.0018 NDUF_C2 pdbhh F Eukaryota T 7tgh 66 XB AN Q24F24_TETTS Transmembrane protein, putative MELNSSAKEDSHYVGVLGYPSQHDPHTLHPKKHDSTFTKVYACRDMLWDHHWEVRNTLYAGFKGALLGVAYASGFGLISKTVPSIVLKKMFRFVRNNNFGHIRIMQDLLTPYALTGFGLGSVYYLYQHNVWENRSNKWLAEVLSNALFFQVATAVCVNPGFHIYGMVGGILFGTLKYAFYNSSFFQEKESIGSYTTFGDLSEEERKKQEYKDYIQFLGNYHKVRNGQLVDL 231 T 9.3 ENOD93 pdbhh F Eukaryota T 7tgh 67 YB B4 NDUB4 AAPPAAFFLYFFVPDNFPSAQSGFRTASRNPFQVQFVFAYDNWEYKYCGQWWSMGSLAVNVLFFVVPLFLWLILQTQSDQSSNRDDNSLTFYFSNAGFFFFQIYTNTG 108 T 0.021 DUF1579 pdbpercent F T 7tgh 68 ZB T4 I7MIE0_TETTS NDUTT4 MGGDHHHEDSHHKSNVDQHELKAEMIKELSHYYDHHDLSLFGKVQHFVEHLLEEKHHAKINTSNFDQKKLENFSESKQISRTVFALKKIKTFNHDFFTSEEEMILEPLPLGILTYGLKYAFAGVDAALLTYFWRNWNFNVRTIGLLGGLVGIQMATLHIPNLVNEVVIQTPRRRALAKKYISAYGPQFFHDIVNPKYDIEHLRHLQNKLNPY 212 T 0.17 PfUIS3 pdb F Eukaryota T 7tgh 69 AC T8 Q22SC4_TETTS NDUTT8 MTHQFENVLLSNRKNLTPQESVQKVINYALLQDAKQRSRTLRHIKASWVIPALLFTYPAWYLAKGAVNGVWSNIHPTDKVTLSFANIGRPFRLIYRPEIFLRDQQAKFIQLEKEHIEKSKKGEFVETTSPLVLWN 135 T 4 DUF1852 pdbhh F Eukaryota T 7tgh 70 BC B2 I7MG29_TETTS NDUB2 MSLRKGTSIFSRQFKKAFNDAKYQNLTAAQGETYSHLGWISNVDLRLGRAIFTFGVVGIAFCIYLEPSYFHETFGHMSQPPKYDLIDSNINGVEKKLNKQILHREHNEHKLDGFVSMFKGSDVAKN 126 T 6 Biopterin_H pdbhh F Eukaryota T 7tgh 71 CC T3 I7LUQ4_TETTS NDUTT3 MSGLLRNFEKLVCQSQLSKAGHKLLLRSPNSTLHPTAFYYKRNSSQRLANEMDVFQLGLAAAALTRQANNYAQLLDQVDKEAVREEVQERITQNHSDLNVYFGEILSLFKIGKKECPVQTVADISYVLAFGPIQVPNAAAIITENLLPVLKEKLDYASIHNLQDILSAFVKLNYVSDKELLKRLITALSQKDFPNQLQPVTNHAWNIDQYEYSDCNSWNIVSCGDNTFEKYIHEGGCENSLAKAKFAVHELLDHISFNFVNPFLFRENRINHRFAKRNADLDHEVLMQTLSKLQEIVPETSEAIATIKARL 311 T 0.034 Baculo_F pdb F Eukaryota T 7tgh 72 DC P1 Q24C39_TETTS Transmembrane protein, putative MIARRLFKRSLYYIPRAGFGGGDIRHKFSNEITDDDYDYQRAMHVKPPKEESLFQLTNILSSVPVFKTRFFLDFIARNLDTNSAVSTSDFVAPPRVHENSFFVYHSRELGNVIRKYRSLESIVLPGALLTFTYPLFAAFVAIPSYYFMFNAKIYEMSRRFVVRMDVLPHLEMISVQRIGAFGILYTKLHRIQDLEYVPFDQVKEQENYLWAIGGHGVDNQLIFKDRSTGEFFYFERQGVWDAKGLNHPLLN 251 T 0.7 TMEM70 pdbhh F Eukaryota T 7tgh 73 EC B3 A4VD20_TETTS Transmembrane protein, putative MNSPQKVAQGAGRKLFKHYINENIKSNNEQKLFFYRVNRWRWNTKDNTTAPKFLRLKYPLLVTGVCLFAYDWTYGFTQVDAHH 83 T 0.086 RGS pdbpssm F Eukaryota T 7tgh 74 FC TA I7MAF0_TETTS NDUTT10 SPAPPTIGQVELEPFKFNHERDQLIYGYTMEELYGKKFGLKHSATVLREIKKDTIMMILFIIGGFTYCYHMRETRFQLDDDFNEYVNTNKQTFRPIPDHVKL 102 T 0.028 Viral_Beta_CD pdbpssm F Eukaryota T 7tgh 76 HC TC Q22E95_TETTS NDUTT12 MVYQGFKVLRRNPTFYNPRSAGMVALSYFAYSYYVNKYYKPQNSNFEEYNSSHPHNHDEKVRQYHEKTNQAIRDAVLEKRAEHDQRLREEAKL 93 T 0.015 Hrs_helical pdbhh F Eukaryota T 7tgh 77 IC P2 Q23KE0_TETTS NDUPH2 MFNILKGAQLSFRSITNKSVNNYYNIMRQVSLDSNPIVLYQSSTFTGNGLQEFYENADALTKYLKLVPFFLEKNLYDHPKQFVIKMEFHPQNKVLSLDCLTHQGVLKKTVNLENLIPVPYEDYVQFCRRKLFNAPLFLDTEMIYFNTFQNEFYVFDKNAKWNEEGINHPELDISKLYNEKAWFDSLRII 189 T 0.26 TMEM70 pdbhh F Eukaryota T 7tgh 78 JC A3 I7M9B3_TETTS Transmembrane protein, putative MSNNNQGDFFVDKYNFSRRVVDHRQPYDLNFSINNPVGSRVWFKAWKQKAIGNFLNLVGVHYAFYGAGFCLLFVLADAWGREKYAQPYKSQILHGRQPFGHTFVQNYRNQATDLGRWNHNFACYEKQPGCGRDFD 135 T 8.3 DUF983 pdbhh F Eukaryota T 7tgh 79 KC T9 Q23B10_TETTS NDUTT9 MFLNPVKDSEFDDEVKGFVPSEGEVRFVANKNKECGYYLQGIEQCRRKMVQLAGDSSSQFHSLGFLPCKRLVDAHYRCMTDDKFGSTIEEVPEIGLDSAQKFFDCTFQQLKPMQSCRRFFDQVVRDVYRANGSQL 135 T 0.22 COX17 pdb F Eukaryota T 7tgh 80 LC TE I7MIK1_TETTS Transmembrane protein, putative MDKYIQQAKCAYNFSLKAVRFVGPLNIVFAGVAFLMFYENNYKKLYLNPRYSYTMPYLQSAKITKNLYEKL 71 T 0.21 YpmT pdbhh F Eukaryota T 7tgh 81 MC T7 Q22HE4_TETTS Transmembrane protein, putative MTNFGSPFRNTDSGIVIRDPENEKRLKLAFQNFWKSKQEDKEFQAQIKTAVSKDTVNFMFYASPLFGALLGKTYIDMFCNPRYFYFRAFTLSMFALAGYCVGNGFRNRYEHSLYTRNYHLFPKDLQDALVNGDARYCISWWKQ 143 T 0.053 AHH pdbpssm F Eukaryota T 7th0 1 A A RPNA_ECOLI Recombination-promoting nuclease RpnA MTIAERLRQEGEQSKALHIAKIMLESGVPLADIMRFTGLSEEELAAASQ 49 T 0.00037 DUF2802 pdbhh F Bacteria T 7tho 5 I,J M,N Eptifibatide XXGDWPCX 8 T 5.3 Ferlin_C pdbhh F T 7tit 3 M,N,O,P M,N,O,P tropomyosin model XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 135 F F F 7tj7 3 S,T,U,V S,T,U,V tropomyosin model XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 135 F F F 7tjl 1 A A De novo designed protein, SEWN0.1 AGPEEHKARVEEYMRRALQATTEPEKKYWEEEAKKEIEQAMYADALINPIRFTEKAAKYIKTYGFRGQEAYDQVKKEMFEKLYKYFMEKLKSE 93 T 0.088 DUF5759 pdb F T 7tl6 1 A,B A,B METRN_MOUSE HYPOXIA/REOXYGENATION REGULATORY FACTOR GYSEDRCSWRGSGLTQEPGSVGQLTLDCTEGAIEWLYPAGALRLTLGGPDPGTRPSIVCLRPERPFAGAQVFAERMTGNLELLLAEGPDLAGGRCMRWGPRERRALFLQATPHRDISRRVAAFRFELHEDQRAE 134 T 32 MORN_2 pdbhh F Eukaryota T 7tl7 2 E,F,G,H a,b,c,d peptide Sa-D2 XXRYEXYKXECPKCX 15 T 0.045 DUF983 pdbhh F T 7tl8 2 B B Peptide Sa-D3 XXQVTVWWAXPWEDC 15 T 1.8 HRCT1 pdbhh F T 7tlh 1 A,B,C,D B,C,A,D METRN_MOUSE HYPOXIA/REOXYGENATION REGULATORY FACTOR MSPQAQGLGVDGACRPCSDAELLLAACTSDFVIHGTIHGVAHDTELQESVITVVVARVIRQTLPLFKEGSSEGQGRASIRTLLRCGVRPGPGSFLFMGWSRFGEAWLGCAPRFQEFSRVYSAALTTHLNPCEMALD 136 T 8.8E-05 TIMP pdbhh F Eukaryota T 7tlj 4 D,H D,H 14KD_CERSP 14 kDa peptide of ubiquinol-cytochrome c2 oxidoreductase complex MFSFIDDIPSFEQIKARVRDDLRKHGWEKRWNDSRLVQKSRELLNDEELKIDPATWIWKRMPSREEVAARRQRDFETVWKYRYRLGGFASGALLALALAGIFSTGNFGGSSDAGNRPSVVYPIE 124 T 0.052 MtrF pdbpercent F Bacteria T 7tls 1 A A helical peptide LXAXLXQXL 9 T 12 SpoVAB pdbhh F F 7tlu 1 A A helical peptide LXSXLXQXL 9 T 13 SpoVAB pdbhh F F 7tlw 1 A A METRL_MOUSE SUBFATIN LMSGQRGLDLHVLSAPCRPCSDTEVLLAICTSDFVVRGFIEDVTHVPEQQVSVIYLRVNRLHRQKSRVFQPAPEDSGHWLGHVTTLLQCGVRPGHGEFLFTGHVHFGEAQLGCAPRFSDFQRMYRKAEEMGINPCEINME 140 T 7.8E-05 TIMP pdbhh F Eukaryota T 7tm1 1 A,B,C A,B,C helical peptide LXMXLXQXL 9 T 4.8 SpoVAB pdbhh F F 7tm2 1 A A helical peptide LXAXLXQXL 9 T 12 SpoVAB pdbhh F F 7tm9 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H A0A0H3GK94_KLEPH Bacterial alkaline phosphatase MAHHHHHHSPVIHAETTAAPVLENRAAQGDITTPGGARRLTGDQTEALRASLINKPAKNVILLIGDGMGDSEITAARNYAEGAGGFFKGIDALPLTGQYTHYSLDKKTGKPDYVTDSAASATAWTTGVKTYNGALGVDIHENAHQTILELAKAAGLATGNVSTAELQDATPAALVAHVTSRKCYGPTVTSEKCPSNALEKGGKGSITEQLLNARPDVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQIVTDAASLAAATEASQDKPLLGLFADGNMPVRWEGPKASYHGNIDKPPVTCTPNPKRDASVPTLAQMTEKAIDLLSRNEKGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQKALEFARKDGNTLVIVTADHAHASQIIPADSKAPGLTQALNTHDGAVMVMSYGNSEEESMEHTGTQLRIAAYGPHAANVVGLTDQTDLFTTMKAALSLK 464 T 2.1E-09 Alk_phosphatase unppssm F Bacteria T 7tma 1 A A helical peptide LXAMLXQXL 9 T 18 DUF5586 pdbhh F F 7tme 1 A A helical peptide LXADLXQXL 9 T 7.9 COE1_HLH pdbhh F F 7tmh 1 A A helical peptide LXSXLMQXL 9 T 5.3 DUF2811 pdbhh F F 7tmi 1 A A helical peptide LXAHLXQXL 9 T 12 CoV_NSP4_C pdbhh F F 7tmj 1 A A helical peptide LXAXLMQXL 9 T 21 DUF6439 pdbhh F F 7tmk 1 A,B A,C helical peptide LXAXLMQXL 9 T 21 DUF6439 pdbhh F F 7tml 1 A A helical peptide LXAXLXQXL 9 T 12 SpoVAB pdbhh F F 7to7 2 C,F C,F 1xAcK.4xE (monoAcK.4xE) EEALLLAXLYHFGEE 15 T 7.1 Gal_mutarotas_2 pdbhh F T 7to8 2 C C 2xAcK.1 (diAcK.1) AQRSLXLLXHLYHG 14 T 6.2 WhiA_N pdbhh F T 7to9 2 C C 2xAcK.4xE (diAcK.4xE) EEAQRSLXLLXHLYHGEE 18 T 10 WhiA_N pdbhh F T 7toa 2 C D 3xAcK.1 (triAcK.1) RSLXLLXHLXH 11 T 7.7 Bindin pdbhh F F 7tod 1 A A A0A2S6F4N3_LEGPN SETA SDEKIKTAHDLIDEIIQDVIQLDGKLGLLGGNTRQLEDGRVINIPNGAAMIFDDYKKYKQGELTAESALESMIKIAKLSNQLNRHTFFNQRQPETGQFYKKVAAIDLQ 108 T 0.36 CUB_2 unppercent F Bacteria T 7tok 1 A,B A,B A0A5P6A8B9_FLAJO Acetylxylan esterase I PPEPGLAQNTLRQIIKVSLGGKQIRMRFSNLFSDQPAVLKSVSVANVTEAPAVDIKTQKILSFKGSPQVTLGADEVMYSDAFDFELQPGQLLAITIHYGEISSNVSGHPGSRTTSYILQGDHINNESFAGAVKTDHWYSIMGVDISSVKN 150 T 2.3 PCuAC pdbhh F Bacteria T 7too 4 D AGR ALS-FTD-associated dipeptide repeat protein, GR20 GRGRGRGRGRGRGRGRGRGRGRGRGRGRGRGRGRGRGRGR 40 T 6400 Exonuc_VII_L pdbhh F F 7too 15 O AL12 Ribosomal protein L12 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 155 F F F 7top 14 N AL12 Ribosomal protein L12 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 155 F F F 7toq 47 UA ALP0 60S acidic ribosomal protein P0 RATWKSNYFLKIIQLLDTMMRKAIRGH 27 T 2.7 Holin_SPP1 pdbhh F T 7tor 48 VA ALP0 60S acidic ribosomal protein P0 RATWKSNYFLKIIQLLDTMMRKAIRGH 27 T 2.7 Holin_SPP1 pdbhh F T 7tor 83 EC,FC GR1,GR2 GR20, ALS/FTD dipeptide repeat protein GRGRGRGRGRGRGRGRGRGRGRGRGRGRGRGRGRGRGRGR 40 T 6400 Exonuc_VII_L pdbhh F F 7tpp 3 C C FA5_HUMAN ACTIVATED PROTEIN C COFACTOR,PROACCELERIN,LABILE FACTOR AQLRQFYVAAQGISWSYRPEPTNSSLNLSVTSFKKIVYREYEPYFKKEKPQSTISGLLGPTLYAEVGDIIKVHFKNKADKPLSIHPQGIRYSKLSEGASYLDHTFPAEKMDDAVAPGREYTYEWSISEDSGPTHDDPPCLTHIYYSHENLIEDFNSGLIGPLLICKKGTLTEGGTQKTFDKQIVLLFAVFDESKSWSQSSSLMYTVNGYVNGTMPDITVCAHDHISWHLLGMSSGPELFSIHFNGQVLEQNHHKVSAITLVSATSTTANMTVGPEGKWIISSLTPKHLQAGMQAYIDIKNCPKKTRNLKKITREQRRHMKRWEYFIAAEEVIWDYAPVIPANMDKKYRSQHLDNFSNQIGKHYKKVMYTQYEDESFTKHTVNPNMKEDGILGPIIRAQVRDTLKIVFKNMASRPYSIYPHGVTFSPYEDEVNSSFTSGRNNTMIRAVQPGETYTYKWNILEFDEPTENDAQCLTRPYYSDVDIMRDIASGLIGLLLICKSRSLDRRGIQRAADIEQQAVFAVFDENKSWYLEDNINKFCENPDEVKRDDPKFYESNIMSTINGYVPESITTLGFCFDDTVQWHFCSVGTQNEILTIHFTGHSFIYGKRHEDTLTLFPMRGESVTVTMDNVGTWMLTSMNSSPRSKKLRLKFRDVKCIPDDDEDSYEIFEPPESTVMATRKMHDRLEPEDEESDADYDYQNRLAAALGIR 709 F F Eukaryota T 7tpq 2 B C FA5_HUMAN ACTIVATED PROTEIN C COFACTOR,PROACCELERIN,LABILE FACTOR AQLRQFYVAAQGISWSYRPEPTNSSLNLSVTSFKKIVYREYEPYFKKEKPQSTISGLLGPTLYAEVGDIIKVHFKNKADKPLSIHPQGIRYSKLSEGASYLDHTFPAEKMDDAVAPGREYTYEWSISEDSGPTHDDPPCLTHIYYSHENLIEDFNSGLIGPLLICKKGTLTEGGTQKTFDKQIVLLFAVFDESKSWSQSSSLMYTVNGYVNGTMPDITVCAHDHISWHLLGMSSGPELFSIHFNGQVLEQNHHKVSAITLVSATSTTANMTVGPEGKWIISSLTPKHLQAGMQAYIDIKNCPKKTRNLKKITREQRRHMKRWEYFIAAEEVIWDYAPVIPANMDKKYRSQHLDNFSNQIGKHYKKVMYTQYEDESFTKHTVNPNMKEDGILGPIIRAQVRDTLKIVFKNMASRPYSIYPHGVTFSPYEDEVNSSFTSGRNNTMIRAVQPGETYTYKWNILEFDEPTENDAQCLTRPYYSDVDIMRDIASGLIGLLLICKSRSLDRRGIQRAADIEQQAVFAVFDENKSWYLEDNINKFCENPDEVKRDDPKFYESNIMSTINGYVPESITTLGFCFDDTVQWHFCSVGTQNEILTIHFTGHSFIYGKRHEDTLTLFPMRGESVTVTMDNVGTWMLTSMNSSPRSKKLRLKFRDVKCIPDDDEDSYEIFEPPESTVMATRKMHDRLEPEDEESDADYDYQNRLAAALGIR 709 F F Eukaryota T 7tpt 9 AA,BA,CA,DA,EA,FA,GA,HA,IA,JA,V,W,X,Y,Z f,g,h,i,j,k,l,m,n,o,a,b,c,d,e Phalloidin WXAXCPA 7 T 3.6 DUF6083 pdbhh F F 7tqs 1 A L pAbC-3 light chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 96 F F F 7tqs 2 B H pAbC-3 heavy chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 106 F F F 7tqt 1 A L pAbC-5 light chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 98 F F F 7tqt 2 B H pAbC-5 heavy chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 108 F F F 7tqu 1 A H pAbC-1 heavy chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 112 F F F 7tqu 2 B L pAbC-1 light chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 102 F F F 7tr6 2 B,C,D,E,F D,E,F,G,H Q8U332_PYRFU Cas11a GGWIRNIGRYLSYLVDDTFEEYAYDVVDGIAKARTQEELLEGVYKALRLAPKLKKKAESKGCPPPRIPSPEDIEALEEKVEQLSNPKDLRKLAVSLALWAFASWNNCP 108 T 0.00011 Cas_Csa5 pdbhh F Archaea T 7tr8 3 C,D,E,F,G D,E,F,G,H Q8U332_PYRFU Cas11a GGWIRNIGRYLSYLVDDTFEEYAYDVVDGIAKARTQEELLEGVYKALRLAPKLKKKAESKGCPPPRIPSPEDIEALEEKVEQLSNPKDLRKLAVSLALWAFASWNNCP 108 T 0.00011 Cas_Csa5 pdbhh F Archaea T 7tr9 2 B,C,D,E,F D,E,F,G,H Q8U332_PYRFU Cas11a GGWIRNIGRYLSYLVDDTFEEYAYDVVDGIAKARTQEELLEGVYKALRLAPKLKKKAESKGCPPPRIPSPEDIEALEEKVEQLSNPKDLRKLAVSLALWAFASWNNCP 108 T 0.00011 Cas_Csa5 pdbhh F Archaea T 7tra 3 C,D,N,O,P C,D,E,F,G Q8U332_PYRFU Cas11a GGWIRNIGRYLSYLVDDTFEEYAYDVVDGIAKARTQEELLEGVYKALRLAPKLKKKAESKGCPPPRIPSPEDIEALEEKVEQLSNPKDLRKLAVSLALWAFASWNNCP 108 T 0.00011 Cas_Csa5 pdbhh F Archaea T 7tsq 2 C,D C,D CDND2_ENTCL CGAS/DNCV-LIKE NUCLEOTIDYLTRANSFERASE,CD-NTASE038 KPAEPQKTGRFA 12 T 16 Rrp44_CSD1 pdbhh F Bacteria T 7tsx 2 C,D C,D CDND2_ENTCL CGAS/DNCV-LIKE NUCLEOTIDYLTRANSFERASE,CD-NTASE038 KPAEPQKTGRFA 12 T 16 Rrp44_CSD1 pdbhh F Bacteria T 7tta 1 A A A0A2P9IBF7_9ACTN Putative cytochrome P450 hydroxylase EHRVLHLRDRLDLAAELKLLCERGPLVRIPLEDGSAVHWFALGYDVVREVLGSEKFDKRVIGTHFNHQEMALPGNLLQLDPPEHTRLRRMVAPAYSVRRMQALEPRVQAIVDDHLDTMASTGPPVEFLREVAGPMAARVACEFLGIPLDDRGELIRLTAHRGGKRRRVLNGHAYLAYMRELAARLRRDPGDGMLGMVARDHGADISDEELAGLCAVVMNSSVEQTESCLAAGTLLLLEHPEQFALLRERPELGEQAVEEIVRYLSVFEGLDPRTATEDVEIGGQVIKKGEAVFCSLLAANRADPALDGFDITRKESRHVAFGHGIHHCLGAPLARMELRIAFTTLVSRFPSLRTAVPAEEIRFRPPSSNVFTLLELPLTW 380 T 1.4E-27 p450 unppssm F Bacteria T 7ttb 1 A A A0A2P9IBF7_9ACTN Putative cytochrome P450 hydroxylase MVAPEHRVLHLRDRLDLAAELKLLCERGPLVRIPLEDGSAVHWFALGYDVVREVLGSEKFDKRVIGTHFNHQEMALPGNLLQLDPPEHTRLRRMVAPAFYSVRRMQALEPRVQAIVDDHLDTMASTGPPVEFLREVAGPMAARVACEFLGIPLDDRGELIRLTAHRGGKRRRVLNGHAYLAYMRELAARLRRDPGDGMLGMVARDHGADISDEELAGLCAVVMNSSVEQTESCLAAGTLLLLEHPEQFALLRERPELGEQAVEEIVRYLSVFEGLDPRTATEDVEIGGQVIKKGEAVFCSLLAANRADPALDGFDITRKESRHVAFGHGIHHCLGAPLARMELRIAFTTLVSRFPSLRTAVPAEEIRFRPPSSNVFTLLELPLTW 385 T 1.4E-27 p450 unppssm F Bacteria T 7tto 1 A A A0A2P9IBF7_9ACTN cytochrome P450 hydroxylase MVAPEHRVLHLRDRLDLAAELKLLCERGPLVRIPLEDGSAVHWFALGYDVVREVLGSEKFDKRVIGTHFNHQEMALPGNLLQLDPPEHTRLRRMVAPAYSVRRMQALEPRVQAIVDDHLDTMASTGPPVEFLREVAGPMAARVACEFLGIPLDDRGELIRLTAHRGGKRRRVLNGHAYLAYMRELAARLRRDPGDGMLGMVARDHGADISDEELAGLCAVVMNSSVEQTESCLAAGTLLLLEHPEQFALLRERPELGEQAVEEIVRYLSVFEGLDPRTATEDVEIGGQVIKKGEAVFCSLLAANRADPALDGFDITRKESRHVAFGHGIHHCLGAPLARMELRIAFTTLVSRFPSLRTAVPAEEIRFRPPSSNVFTLLELPLTW 384 T 1.4E-27 p450 unppssm F Bacteria T 7ttp 1 A A A0A2P9IBF7_9ACTN cytochrome P450 hydroxylase MVAPEHRVLHLRDRLDLAAELKLLCERGPLVRIPLEDGSAVHWFALGYDVVREVLGSEKFDKRVIGTHFNHQEMALPGNLLQLDPPEHTRLRRMVAPAYSVRRMQALEPRVQAIVDDHLDTMASTGPPVEFLREVAGPMAARVACEFLGIPLDDRGELIRLTAHRGGKRRRVLNGHAYLAYMRELAARLRRDPGDGMLGMVARDHGADISDEELAGLCAVVMNSSVEQTESCLAAGTLLLLEHPEQFALLRERPELGEQAVEEIVRYLSVFEGLDPRTATEDVEIGGQVIKKGEAVFCSLLAANRADPALDGFDITRKESRHVAFGHGIHHCLGAPLARMELRIAFTTLVSRFPSLRTAVPAEEIRFRPPSSNVFTLLELPLTW 384 T 1.4E-27 p450 unppssm F Bacteria T 7ttq 1 A A A0A2P9IBF7_9ACTN cytochrome P450 hydroxylase MVAPEHRVLHLRDRLDLAAELKLLCERGPLVRIPLEDGSAVHWFALGYDVVREVLGSEKFDKRVIGTHFNHQEMALPGNLLQLDPPEHTRLRRMVAPAYSVRRMQALEPRVQAIVDDHLDTMASTGPPVEFLREVAGPMAARVACEFLGIPLDDRGELIRLTAHRGGKRRRVLNGHAYLAYMRELAARLRRDPGDGMLGMVARDHGADISDEELAGLCAVVMNSSVEQTESCLAAGTLLLLEHPEQFALLRERPELGEQAVEEIVRYLSVFEGLDPRTATEDVEIGGQVIKKGEAVFCSLLAANRADPALDGFDITRKESRHVAFGHGIHHCLGAPLARMELRIAFTTLVSRFPSLRTAVPAEEIRFRPPSSNVFTLLELPLTW 384 T 1.4E-27 p450 unppssm F Bacteria T 7tud 3 C P EEFGRC peptide EEFGRC 6 T 0.41 CHGN pdbhh F F 7tud 4 D Q GL dipeptide GL 2 T 470 Tachykinin pdbhh F F 7tuj 1 A A SLX4_HUMAN BTB/POZ DOMAIN-CONTAINING PROTEIN 12 AQMPSAGGAQKPEGLETPKGANRKKNLPPKVPITPMPQYSIMETPVLKKELDRFGVRPLPKRQMVLKLKEIFQYTHQTLDSDSEDE 86 T 1.9 Endonuc-dimeris pdbhh F Eukaryota T 7tv0 2 E,F E,G VEMP_SARS2 Envelope small membrane protein XPSFYVYSRVXN 12 T 0.56 CoV_E pdbhh T Viruses T 7tv5 1 A A W5IDB3_LASLA Lasiocepsin GLPRKILCAIAKKKGKCKGALKLVCKCX 28 T 0.035 Defensin_2 pdbpssm F Eukaryota T 7tv6 1 A A W5IDB3_LASLA Lasiocepsin heterogeneous-backbone proteomimetic analogue GLXRKXLCAXAKXKGKCKGAXKLXCKCX 28 T 1.3 Antimicrobial_1 unphh F Eukaryota T 7tv7 1 A A W5IDB3_LASLA Lasiocepsin heterogeneous-backbone proteomimetic analogue GLXRKXLCAXAKXKGXCXGAXKLXCKCX 28 T 1.3 Antimicrobial_1 unphh F Eukaryota T 7tv8 1 A A W5IDB3_LASLA Lasiocepsin heterogeneous-backbone proteomimetic analogue GLXRKXLCAXAKXKXKCKXAXKLXCKCX 28 T 1.2 Antimicrobial_1 pdbhh F Eukaryota T 7tvh 1 A,B,C,D A,B,C,D TSE1_PSEAE TYPE VI SECRETION EXPORTED 1 MDSLDQCIVNACKNSWDKSYLAGTPNKDNCSGFVQSVAAELGVPMPRGNANAMVDGLEQSWTKLASGAEAAQKAAQGFLVIAGLKGRTYGHVAVVISGPLYRQKYPMCWSGSIAGAVGQSQGLKSVGQVWNRTDRDRLNYYVYSLASCSLPRASLEHHHHHH 162 T 3.2E-05 Amidase_6 unphh F Bacteria T 7txf 2 F,G,H F,G,H CDKB_CONVX VX20.2,VXXIIB TRMCGSMSCPRNGCTCVYHWRRGHGCSCPG 30 T 6.1 IGFL pdbhh F Eukaryota T 7txj 2 C A A7WKI9_9VIRU MCP1 MAGKKRRLSQASVLRYYAKRFTMNVGTTAHVLGKEVAGNPWVAKAIDKLSYQETYNWISDYQASHLAKQVAKQVAEKYGIPPTFQGLLMAYAEKVVANYILDYKGESLTQMHDNYLYELMQKMPIAPTGTSSGYIYVFIGKDGKTHTVDMSKVLTDIEDALLKRA 165 T 0.2 RP1-2 pdb T Viruses T 7txj 3 D a A7WKJ0_9VIRU MCP2 MAGRQAHRKFDVRNDTSTRWKGKLYGIFVNYMGEDYAKEFVEQAYSNYEKVFVNIYTKIHNQLRTTLTSSAGAGATFPLWQIINEAIYAVYLTHKETASFLYAKYVARGIQPNVVKKILAETGNALKGIVPAVAQELGETVLDESNVISVVDDIVRKNPALPNSYAGIILQEARISTTPHYEGTEGFSSMESAYSALEEIEKGL 204 T 0.0031 Dynein_light pdb T Viruses T 7txu 2 E,F,G,H,I,J,K,L E,F,G,H,I,J,K,L 16x(Asp-Arg) XXXXXXXXXXXXXXXX 16 T 22 TFIIA pdbhh F F 7txv 2 E,F,G,H,I,J,K,L E,F,G,H,I,J,K,L 16x(Asp-Arg) XXXXXXXXXXXXXXXX 16 T 22 TFIIA pdbhh F F 7tyd 2 B,D B,D Binder GGGDRRKEMDKVYRTAFKRITSTPDKEKRKEVVKEATEQLRRIAKDEEEKKKAAYMILFLKTLG 64 T 0.055 UQCC3 pdb F T 7tz3 1 A A Iturin lipopeptide NXXQPXSX 8 T 4.1 PglL_A pdbhh F F 7tzk 2 C,D C,D B7ULW4_ECO27 T3SS secreted effector NleH homolog PPELPSVDYNSL 12 T 6.1 Se-cys_synth_N pdbhh F Bacteria T 7u05 1 A,M M,m Trafficking protein particle complex II-specific subunit 130 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 242 F F F 7u05 13 AA,BA N,n Trafficking protein particle complex II-specific subunit 65 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 210 F F F 7u06 12 X,Y M,m Trafficking protein particle complex II-specific subunit 130 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 293 F F F 7u06 13 AA,Z N,n Trafficking protein particle complex II-specific subunit 65 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 210 F F F 7u09 3 C A SARS-CoV-2 S fusion peptide PSKRSFIEDLLFNK 14 T 0.00028 CoV_S2 pdbhh F T 7u0a 1 A A SARS-CoV-2 S fusion peptide PSKRSFIEDLLFNK 14 T 0.00028 CoV_S2 pdbhh F T 7u0e 3 C,D C,D SARS-CoV-2 S fusion peptide PKRSFIEDLLFNK 13 T 5.4E-05 CoV_S2 pdbhh F T 7u21 3 C,F C,F PGM5_HUMAN PGM5 peptide (465-473) (H5Y) AVGSYVYSV 9 T 7.6 TraV pdbhh F Eukaryota F 7u2j 56 DB,ID 1z,2z P-site Peptidyl-tRNA fMAC-NH-tRNAmet Peptide-part MAC 3 T 51 META pdbhh F F 7u4t 6 J T RNK_HUMAN RNASE KAPPA MGWLRPGPRPLCPPARASWAFSHRFPSPLAPRRSPTPFFMASLLCCGPKLAACGIVLSAWGVIMLIMLGIFFNVHSAVLIEDVPFTEKDFENGPQNIYNLYEQVSYNCFIAAGLYLLLGGFSFCQVRLNKRKEYMVR 137 T 0.0026 DUF2650 pdbpercent F Eukaryota T 7u4t 14 CA,DA,EA Z,X,Y Q5ZWW6_LEGPH SidK MSFIKVGIKMGGLTSEQYHSQVVGKIGYIARCMQTIDPENNLKKIREDYQDVLIWAEKNYRFEEILEASKSGKCPNDLDALSRRSLILQELLRLVSSISPFKMKLDLIESQYEKMKQHVNLWKSDYHVKLNQLNQLTDYLKNAAPTPKNNFLRAMTSVLQMQIAQYGITEDNEGINQLFKLGLHLLAMANEKIDEQYHLFKGYVKDQPEESPFEGILPAEDQKILVKTMIDYAMPKLSSKVLQDKLSALSSSDVLTKTLLDSIDRIVKENEKLNALSKVKLGKFGLDIREIEVIYSQALKISPQDALQYTAQQCDAQLLSMAFPDSQNYIIESISNKKVKTIAELIHSKEFIYQIIKTEVFKQVDPNEKIRLQAATELYQLLGRIMDKQINLFTKMNLEQINEYIQTKTKAILDKIPERVELLTFMGFEIPTFKGIETLMTDISHSQDNETLAIAQEFYTNIKNAKNQLLGDKLIEDITPQDVEKFFNQCSQYGSEAAEKLADNRPVLTKIADILTAIARWAISLIGFNTPPQFLAPTRTCVDQVSDEITKIKLKLEDTLGSLQKVQEESLSL 573 T 0.15 Chalcone pdb F Bacteria T 7u5d 4 D A Cas8/5 MVTIMHIEELLDIEDHGERDRQLRRYLAPYSAEIGVDGAEKMALVVLLNLTLKRDRVESLCDEGLARQLLSDEGHITNCLHTVRWLHTHNLKYPDARVSGERLIINAPPLIPGVISSAGLPMRMGWAHDSSDINLAKLFGTSFRYRDDSTNLALQLVARSKTWEQALIGLGLTQQQLDIWCQLLASNLENNTFPTVVSPFSKQVRFLYQGNYCVVTPVVSHALLAQLQNVVHEKKLQCTYIHHDHPASVGSLVGALGGKVAVLDYPPPVSPDKARSFSQARKHRLANGQSLFDRSVFNDHVFIDALKHVISRPGLTRKQQRQLRLSALRYLRRQLAIWLGPIIEWRDEIVSSGRGEPGNLPSGGLELELITQPKKMLPELMLQVAGRFHLELQNHSAGRRFAFHPALMAPIKSQILWLLRQLADDEEKDEPHPPTSCYYLHLSGLTVYDASALANPYLCGIPSLSALAGFCHDYERRLQSLIGQSVYFRGLAWYLGRYSLVTGKHLPEPSKSADPKSVSAIRRPGLLDGRYCDLGMDLIIEVHIPTGGSLPFTTCLDLLRVALPARFAGGCLHPPSLYEEYNWCTVYQDKSTLFTVLSRLPRYGCWIYPSDADLRSFEELSEALALDRRLRPVATGFVFLEEPVERAGSIEGQHVYAESAIGTALCINPVEMRLAGKKRFFGAGFWQLNDAKGAILMNGSANTG 704 T 0.00011 Cas_Csy2 pdb F T 7u5e 4 D A Cas8/5 MVTIMHIEELLDIEDHGERDRQLRRYLAPYSAEIGVDGAEKMALVVLLNLTLKRDRVESLCDEGLARQLLSDEGHITNCLHTVRWLHTHNLKYPDARVSGERLIINAPPLIPGVISSAGLPMRMGWAHDSSDINLAKLFGTSFRYRDDSTNLALQLVARSKTWEQALIGLGLTQQQLDIWCQLLASNLENNTFPTVVSPFSKQVRFLYQGNYCVVTPVVSHALLAQLQNVVHEKKLQCTYIHHDHPASVGSLVGALGGKVAVLDYPPPVSPDKARSFSQARKHRLANGQSLFDRSVFNDHVFIDALKHVISRPGLTRKQQRQLRLSALRYLRRQLAIWLGPIIEWRDEIVSSGRGEPGNLPSGGLELELITQPKKMLPELMLQVAGRFHLELQNHSAGRRFAFHPALMAPIKSQILWLLRQLADDEEKDEPHPPTSCYYLHLSGLTVYDASALANPYLCGIPSLSALAGFCHDYERRLQSLIGQSVYFRGLAWYLGRYSLVTGKHLPEPSKSADPKSVSAIRRPGLLDGRYCDLGMDLIIEVHIPTGGSLPFTTCLDLLRVALPARFAGGCLHPPSLYEEYNWCTVYQDKSTLFTVLSRLPRYGCWIYPSDADLRSFEELSEALALDRRLRPVATGFVFLEEPVERAGSIEGQHVYAESAIGTALCINPVEMRLAGKKRFFGAGFWQLNDAKGAILMNGSANTG 704 T 0.00011 Cas_Csy2 pdb F T 7u60 5 I,J M,N ARG-GLY-ASP-DPN-VAL RGDXV 5 T 23 DUF5414 pdbhh F F 7u6d 1 A A IM459 SLEQEWXKIECEVYGKCPPKKAXYDWFERQLK 32 T 0.3 DUF2161 pdbhh F T 7u6e 4 E,F G,H IM462 XSLEEEWAQIECEVYGRCPPSES 23 T 1.5 DUF6058 pdbhh F T 7u6v 3 G P C-terminal domain (CTD) from the Ribosomal P-stalk GFGLFD 6 T 7 TMEM65 pdbhh F F 7u7n 4 D D IL27A_HUMAN IL-27 SUBUNIT ALPHA,IL-27-A,IL27-A,INTERLEUKIN-30,P28 FPRPPGRPQLSLQELRREFTVSLHLARKLLSEVRGQAHRFAESHLPGVNLYLLPLGEQLPDVSLTFQAWRRLSDPERLCFISTTLQPFHALLGGLGTQGRWTNMERMQLWAMRLDLRDLQRHLRFQVLAAGFNLPEEEEEEEEEEEEERKGLLPGALGSALQGPAQVSWPQLLSTYRLLHSLELVLSRAVRELLLLSKAGHSVWPLGFPTLSPQP 215 F F Eukaryota T 7u8o 8 P,Q,R Q,R,S Q5ZWW6_LEGPH Bacterial effector protein SidK MSFIKVGIKMGGLTSEQYHSQVVGKIGYIARCMQTIDPENNLKKIREDYQDVLIWAEKNYRFEEILEASKSGKCPNDLDALSRRSLILQELLRLVSSISPFKMKLDLIESQYEKMKQHVNLWKSDYHVKLNQLNQLTDYLKNAAPTPKNNFLRAMTSVLQMQIAQYGITEDNEGINQLFKLGLHLLAMANEKIDEQYHLFKGYVKDQPEESPFEGILPAEDQKILVKTMIDYAMPKLSSKVLQDKLSALSSSDVLTKTLLDSIDRIVKENEKLNALSKVKLGKFGLDIREIEVIYSQALKISPQDALQYTAQQCDAQLLSMAFPDSQNYIIESISNK 337 T 0.15 Chalcone unp F Bacteria T 7u8o 16 Z f A0A480L8C4_PIG Ribonuclease kappa MASLLCCGPKLAACGIVLSAWGVIMLIMLGIFFNVHSAVLIEDVPFTEKDFENGPQDIYKLYEQVSYNCFIAAGLYLLLGGFSFCQVRLNKRKEYMVR 98 T 0.0082 DUF2650 unp F Eukaryota T 7u8p 8 P,Q,R Q,R,S Q5ZWW6_LEGPH Bacterial effector protein SidK MSFIKVGIKMGGLTSEQYHSQVVGKIGYIARCMQTIDPENNLKKIREDYQDVLIWAEKNYRFEEILEASKSGKCPNDLDALSRRSLILQELLRLVSSISPFKMKLDLIESQYEKMKQHVNLWKSDYHVKLNQLNQLTDYLKNAAPTPKNNFLRAMTSVLQMQIAQYGITEDNEGINQLFKLGLHLLAMANEKIDEQYHLFKGYVKDQPEESPFEGILPAEDQKILVKTMIDYAMPKLSSKVLQDKLSALSSSDVLTKTLLDSIDRIVKENEKLNALSKVKLGKFGLDIREIEVIYSQALKISPQDALQYTAQQCDAQLLSMAFPDSQNYIIESISNK 337 T 0.15 Chalcone unp F Bacteria T 7u8p 15 Y f A0A480L8C4_PIG Ribonuclease kappa MASLLCCGPKLAACGIVLSAWGVIMLIMLGIFFNVHSAVLIEDVPFTEKDFENGPQDIYKLYEQVSYNCFIAAGLYLLLGGFSFCQVRLNKRKEYMVR 98 T 0.0082 DUF2650 unp F Eukaryota T 7u8q 8 P,Q,R Q,R,S Q5ZWW6_LEGPH Bacterial effector protein SidK MSFIKVGIKMGGLTSEQYHSQVVGKIGYIARCMQTIDPENNLKKIREDYQDVLIWAEKNYRFEEILEASKSGKCPNDLDALSRRSLILQELLRLVSSISPFKMKLDLIESQYEKMKQHVNLWKSDYHVKLNQLNQLTDYLKNAAPTPKNNFLRAMTSVLQMQIAQYGITEDNEGINQLFKLGLHLLAMANEKIDEQYHLFKGYVKDQPEESPFEGILPAEDQKILVKTMIDYAMPKLSSKVLQDKLSALSSSDVLTKTLLDSIDRIVKENEKLNALSKVKLGKFGLDIREIEVIYSQALKISPQDALQYTAQQCDAQLLSMAFPDSQNYIIESISNK 337 T 0.15 Chalcone unp F Bacteria T 7u8q 15 Y f A0A480L8C4_PIG Ribonuclease kappa MASLLCCGPKLAACGIVLSAWGVIMLIMLGIFFNVHSAVLIEDVPFTEKDFENGPQDIYKLYEQVSYNCFIAAGLYLLLGGFSFCQVRLNKRKEYMVR 98 T 0.0082 DUF2650 unp F Eukaryota T 7u8r 8 P,Q,R Q,R,S Q5ZWW6_LEGPH Bacterial effector protein SidK MSFIKVGIKMGGLTSEQYHSQVVGKIGYIARCMQTIDPENNLKKIREDYQDVLIWAEKNYRFEEILEASKSGKCPNDLDALSRRSLILQELLRLVSSISPFKMKLDLIESQYEKMKQHVNLWKSDYHVKLNQLNQLTDYLKNAAPTPKNNFLRAMTSVLQMQIAQYGITEDNEGINQLFKLGLHLLAMANEKIDEQYHLFKGYVKDQPEESPFEGILPAEDQKILVKTMIDYAMPKLSSKVLQDKLSALSSSDVLTKTLLDSIDRIVKENEKLNALSKVKLGKFGLDIREIEVIYSQALKISPQDALQYTAQQCDAQLLSMAFPDSQNYIIESISNK 337 T 0.15 Chalcone unp F Bacteria T 7u8r 15 Y f A0A480L8C4_PIG Ribonuclease kappa MASLLCCGPKLAACGIVLSAWGVIMLIMLGIFFNVHSAVLIEDVPFTEKDFENGPQDIYKLYEQVSYNCFIAAGLYLLLGGFSFCQVRLNKRKEYMVR 98 T 0.0082 DUF2650 unp F Eukaryota T 7u9e 1 A A P230_PLAF7 Gametocyte surface protein P230 SVLQSGALPSVGVDELDKIDLSYETTESGDTAVSEDSYDKYASQNTNKEYVCDFTDQLKPTESGPKVKKCEVKVNEPLIKVKIICPLKGSVEKLYDNIEYVPKKSPYVVLTKEETKLKEKLLSKLIYGLLISPTVNEKENNFKEGVIEFTLPPVVHKATVFYFICDNSKTEDDNKKGNRGIVEVYVEPYGN 191 T 1.1 s48_45 unppercent F Eukaryota T 7u9k 2 C F DAL-DAL XX 2 F F F 7u9w 1 A A P230_PLAF7 Gametocyte surface protein P230 SVLQSGALPSVGVDELDKIDLSYETTESGDTAVSEDSYDKYASQNTNKEYVCDFTDQLKPTESGPKVKKCEVKVNEPLIKVKIICPLKGSVEKLYDNIEYVPKKSPYVVLTKEETKLKEKLLSKLIYGLLISPTVNEKENNFKEGVIEFTLPPVVHKATVFYFICDNSKTEDDNKKGNRGIVEVYVEPYGN 191 T 1.1 s48_45 unppercent F Eukaryota T 7ua2 1 A A P230_PLAF7 Gametocyte surface protein P230 SVLQSGALPSVGVDELDKIDLSYETTESGDTAVSEDSYDKYASQNTNKEYVCDFTDQLKPTESGPKVKKCEVKVNEPLIKVKIICPLKGSVEKLYDNIEYVPKKSPYVVLTKEETKLKEKLLSKLIYGLLISPTVNEKENNFKEGVIEFTLPPVVHKATVFYFICDNSKTEDDNKKGNRGIVEVYVEPYGN 191 T 1.1 s48_45 unppercent F Eukaryota T 7ua8 1 A,C A,B P230_PLAF7 Gametocyte surface protein P230 SVLQSGALPSVGVDELDKIDLSYETTESGDTAVSEDSYDKYASQNTNKEYVCDFTDQLKPTESGPKVKKCEVKVNEPLIKVKIICPLKGSVEKLYDNIEYVPKKSPYVVLTKEETKLKEKLLSKLIYGLLISPTVNEKENNFKEGVIEFTLPPVVHKATVFYFICDNSKTEDDNKKGNRGIVEVYVEPYGN 191 T 1.1 s48_45 unppercent F Eukaryota T 7uav 1 A,B,C,D,E A,B,C,D,E TAD1_CLOBO TAD1 SMKELSTIQKREKLNTVERIGSEGPGGAYHEYVIKSNSMDSQGNYDVYETIKFQKGARKEEKSQHGVIDSDLLEIVRDRLKSFQAGPFSSRENACALTHVEEALMWMNRRVEDRIERNVLGTNTK 125 T 0.023 HlyU pdb F Bacteria T 7uaw 1 A,B A,B TAD1_CLOBO ABC transporter ATPase SMKELSTIQKREKLNTVERIGSEGPGGAYHEYVIKSNSMDSQGNYDVYETIKFQKGARKEEKSQHGVIDSDLLEIVRDRLKSFQAGPFSSRENACALTHVEEALMWMNRRVEDRIERNVLGTNTK 125 T 0.023 HlyU pdb F Bacteria T 7uba 1 A A A7TRP1_VANPO HORMA domain-containing protein QSPDIECECDLLCPITSTRIKQCKNCRKFVHSLCYGNKPGPKVDKCISCVYGPMFDPSSSEFKDLMMLRKCYRFLSRNKGFPPSIKEFTNSIMEEGQVTLENIERINFCISTLSSDGILNFSQCNKQRDASQDGSASKATRIQGNKVSIDEEGIFVPKIGELLKGREYMCCFIYNSDNSHACYLDVSPESKRQIENWIDQVKSIRNDFEPNSS 213 T 0.06 PHD_2 pdb F Eukaryota T 7ubc 1 A A Cyclic peptide D9.16 DPR-MAA-ALA-DVA-MLE-LEU-LEU-PRO-DLE XXAXXLLPX 9 T 15 PelD_GGDEF pdbhh F F 7ubd 1 A A Cyclic peptide D8.31 DAL-DPR-MLU-DVA-DAL-DPR-MLU-DVA XXXXXXXX 8 F F F 7ube 1 A A Cyclic peptide D8.21 DVA-MLE-DPR-LEU-DVA-MLE-DPR-LEU XXXLXXXL 8 F F F 7ubf 1 A A Cyclic peptide D8.21 DVA-MLE-DPR-LEU-DVA-MLE-DPR-LEU XXXLXXXL 8 F F F 7ubg 1 A A Cyclic peptide D9.16 DPR-MAA-ALA-DVA-MLE-LEU-LEU-PRO-DLE XXAXXLLPX 9 T 15 PelD_GGDEF pdbhh F F 7ubh 1 A A Cyclic peptide D8.31 DAL-DPR-MLU-DVA-DAL-DPR-MLU-DVA XXXXXXXX 8 F F F 7ubi 1 A A Cyclic peptide D8.21 DVA-MLE-DPR-LEU-DVA-MLE-DPR-LEU XXXLXXXL 8 F F F 7ubs 1 A,B,C,D A,B,C,D P230_PLAF7 Gametocyte surface protein P230 SVLQSGALPSVGVDELDKIDLSYETTESGDTAVSEDSYDKYASQNTNKEYVCDFTDQLKPTESGPKVKKCEVKVNEPLIKVKIICPLKGSVEKLYDNIEYVPKKSPYVVLTKEETKLKEKLLSKLIYGLLISPTVNEKENNFKEGVIEFTLPPVVHKATVFYFICDNSKTEDDNKKGNRGIVEVYVEPYGN 191 T 1.1 s48_45 unppercent F Eukaryota T 7ubu 2 B,E P,Q H32_MAIZE Histone H3.2 SARTKQTARXSTGGKAPRKQLATKAARKSAPAT 33 T 0.41 PAF pdbpercent F Eukaryota T 7ucf 4 D G Q2N0S6_9HIV1 ENV POLYPROTEIN MDAMKRGLCCVLLLCGAVFVSPAGAGENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQAFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGR 501 T 3.4E-54 GP120 pdbpercent T Viruses T 7ucp 1 A A computationally designed cyclic peptide D8.3.p2 LVXXLIXX 8 T 0.94 Adeno_PX pdbhh F F 7ucq 1 A,C,E,G A,B,C,D P230_PLAF7 Gametocyte surface protein P230 SVLQSGALPSVGVDELDKIDLSYETTESGDTAVSEDSYDKYASQNTNKEYVCDFTDQLKPTESGPKVKKCEVKVNEPLIKVKIICPLKGSVEKLYDNIEYVPKKSPYVVLTKEETKLKEKLLSKLIYGLLISPTVNEKENNFKEGVIEFTLPPVVHKATVFYFICDNSKTEDDNKKGNRGIVEVYVEPYGN 191 T 1.1 s48_45 unppercent F Eukaryota T 7udi 1 A,B A,B DNA damage response protein DdrC MKNAPLTLNFGSVRLPVSADGLLHAPTAQQQLGLTQSWEAALVEHGLPETYRDFGAGPEAAVSVPDFVALAFALDTPEARRWQKRARELLARAMQGDVRVAAQIAERNPEPDARRWLAARLESTGARRELMATVARHGGEGRVYGQLGSISNRTVLGKDSASVRQERGVKATRDGLTSAELLRMAYIDTVTARAIQESEARGNAAILTLHEQVARSERQSWERAGQVQRVG 231 T 21 KilA-N pdbhh F T 7udj 1 A G 4xPAW peptide AWPAWPAWPAWP 12 T 3.7 RRN9 pdbhh F F 7udj 2 B H De novo designed helical repeat protein RPB_PEW3_R4 KKEAEEVAAHVEQIAFIAKEQGNEEVAKLAKRLAETIKRLNEGTEEEVKRLLEAAEVAAHVLQIAFIAHEQGNEEVAKLALELAESILRLIEGTEEEVKRLLEAAEVAAHVLQIAFIAHEQGNEEVAKLALELAESILRLIEGTEEEVKELLERAEEAAHVLQHAFIATEQGNEEDAKEALRKAEEILRRNA 192 T 0.097 HemY_N pdb F T 7udk 1 A A Designed helical repeat protein (DHR) RPB_LRP2_R4 DREEAAFLAASILIQHAHEQGKDDRELEKILEIAIRILEKNGVDREEAAFLAASILIQHAHEQGKDDRELEKILEIAIRILEKNGVDREEAAFLAASILIQHAHEQGKDDRELEKILEIAIRILEKNGVDREEAAFLAASILIQHAHEQGKDDRELEKILEIAIRILEKNGV 172 T 0.032 Hormone_recep pdb F T 7udk 2 B B 4xLRP LRPLRPLRPLRP 12 T 9.2 RhoGEF67_u2 pdbhh F F 7udl 1 A A Designed helical repeat protein (DHR) RPB_PLP1_R6 PEEERIKYVITVVEQIAKDAHRNGQEELAKLAERTAEEAKKATERGEEETLRIVYVIVVVLQIALEAHRNGQEELAKLALRTAEEAIKATERGEEETLRIVYVIVVVLQIALEAHRNGQEELAKLALRTAEEAIKATERGEEETLRIVYVIVVVLQIALEAHRNGQEELAKLALRTAEEAIKATERGEEETLRIVYVIVVVLQIALEAHRNGQEELAKLALRTAEEAIKATERGEEETERIVYDIVVVLQEALEAHRNGEEERAKKALDEARRRIEATERGE 282 T 0.019 SMBP pdb F T 7udl 2 B,C B,D 6xPLP Peptide PLPPLPPLPPLPPLPPLPPLP 21 T 630 DUF4795 pdbhh F F 7udm 1 A,B A,B Designed helical repeat protein (DHR) RPB_PLP1_R6 APEEERIKYVITVVEQIAKDAHRNGQEELAKLAERTAEEAKKATERGEEETLRIVYVIVVVLQIALEAHRNGQEELAKLALRTAEEAIKATERGEEETLRIVYVIVVVLQIALEAHRNGQEELAKLALRTAEEAIKATERGEEETLRIVYVIVVVLQIALEAHRNGQEELAKLALRTAEEAIKATERGEEETLRIVYVIVVVLQIALEAHRNGQEELAKLALRTAEEAIKATERGEEETERIVYDIVVVLQEALEAHRNGEEERAKKALDEARRRIEATERGE 283 T 0.019 SMBP pdb F T 7udm 2 C C 6xPLP PLPPLPPLPPLPPLPPLP 18 T 60 DUF5558 pdbhh F F 7udn 1 A,B A,B Designed helical repeat protein (DHR) RPB_PLP1_R6 APEEERIKYVITVVEQIAKDAHRNGQEELAKLAERTAEEAKKATERGEEETLRIVYVIVVVLQIALEAHRNGQEELAKLALRTAEEAIKATERGEEETLRIVYVIVVVLQIALEAHRNGQEELAKLALRTAEEAIKATERGEEETLRIVYVIVVVLQIALEAHRNGQEELAKLALRTAEEAIKATERGEEETLRIVYVIVVVLQIALEAHRNGQEELAKLALRTAEEAIKATERGEEETERIVYDIVVVLQEALEAHRNGEEERAKKALDEARRRIEATERGE 283 T 0.019 SMBP pdb F T 7udo 1 A A Designed helical repeat protein (DHR) RPB_LRP2_R4 DREEAAFLAASILIQHAHEQGKDDRELEKILEIAIRILEKNGVDREEAAFLAASILIQHAHEQGKDDRELEKILEIAIRILEKNGVDREEAAFLAASILIQHAHEQGKDDRELEKILEIAIRILEKNGVDREEAAFLAASILIQHAHEQGKDDRELEKILEIAIRILEKNGV 172 T 0.032 Hormone_recep pdb F T 7udv 1 A,B,C,D,E,F,G,H,I,J A,B,C,D,E,F,G,H,I,J De novo designed proton channel LLQL DSLKWIVFLLFLIVLLQLAIVFLLRG 26 T 0.0062 RCR pdbhh F T 7udw 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T De novo designed pentameric proton channel QQLL DSQKWIVFLQFLIVLLLLAIVFLLRG 26 T 0.0078 RCR pdbhh F T 7udx 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O A,B,C,D,E,F,G,H,I,J,K,L,M,N,O De novo designed pentameric proton channel QLQL DSQKWIVFLLFLIVLLQLAIVFLLRG 26 T 0.0068 RCR pdbhh F T 7udy 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O A,B,C,D,E,F,G,H,I,J,K,L,M,N,O Designed channel QLLL DSQKWIVFLLFLIVLLLLAIVFLLRG 26 T 0.011 RCR pdbhh F T 7udz 1 A,B,C,D,E,F,G,H,I,J A,B,C,D,E,F,G,H,I,J De novo designed pentameric proton channel LQLL DSLKWIVFLQFLIVLLLLAIVFLLRG 26 T 0.0065 RCR pdbhh F T 7ue2 1 A A RPB_PLP3_R6 MDEEREKLKEKLKEVLRRAKEAKKKGDKEKLIELAYEAAALAAWIIHKDSNDDEIVELAKEALKLVLEAAKEAKKNGDKEKLIKLAYLAAAVAAWIIHTDGDDDEIVELAKEALKLVLEAAKEAKKNGDKEKLIKLAYLAAAVAAWIITTDGDDDEIVELAKEALKLVLEAAKEAKKNGDKEKLIKLAYLAAAVAAWIIHTDGDDDEIVELAKEALKLVLEAAKEAKKNGDKEKLIKLAYLAAAVAAWIITTDGDDEEIVELAKEALKLVKEAAEEAEKQGDEELREKLRYLSEAVREWIERND 304 T 0.002 SMBP pdb F T 7ue2 2 B B PLPx6 peptide PLPPLPPLPPLPPLPPLPP 19 T 72 DUF4485 pdbhh F F 7ueg 1 A,B,C,D,E,F A,E,B,C,D,F A3MUL8_PYRCJ Pilin MARKKNYRPLIALAALAVAALAMATLTFTNLTYWLINATLPPAMKYPGTDTTITRSDSSGYNRYVYVSYYYDPSTGYNVTRISIVGFTGDPTNYTNVLQLCNKYYSGTLYAKLVAVGTVGTTNYESYIKDFRVYFVNPTTTPNYVQFQGTSVTQSATGSVSIGPGQCATVGAYVLVDPSLPTSARDGKTVIATYQVNVVFSTSP 204 T 0.19 DUF3254 pdb F Archaea T 7uek 1 A A OT3 MHHHHHHENLYFQSDAICIYLDESATWKDMKKAMEILYKLGVKKIVVLFKYDEKLIKVAAKVLHDLGAEEAIIILIFDIDDEDEFKKQVKKALELMKKLGVDHRIIALRMTDEEKFKKLAKIAAELGADAICIYLDESATWKDMKKAMEILYKLGVKKIVVLFKYDEKLIKVAAKVLHDLGAEEAIIILIFDIDDEDEFKKQVKKALELMKKLGVDHRIIALRMTDEEKFKKLAKIAAELGA 242 T 0.00023 DeoC pdb F T 7ufn 3 E,F E,F CSP_PLAFW PfCSP peptide 21 NPDPNANPNVDPNAN 15 T 0.7 Cas_Cas7 pdbhh F Eukaryota F 7ufo 3 C A CSP_PLAFW PfCSP peptide 21 NPDPNANPNVDPNAN 15 T 0.7 Cas_Cas7 pdbhh F Eukaryota F 7ufq 2 B,E A,C CSP_PLAFA PfCSP peptide 21 NPDPNANPNVDPNAN 15 T 0.7 Cas_Cas7 pdbhh F Eukaryota F 7ug2 1 A A TRI75_MOUSE Tripartite motif-containing protein 75 GPGGVTLREQAEAQRSQLTSECEKLMRFLDQEERAAFSRLEDEEMRLEKRLLDNIAALE 59 T 0.00063 DUF3583 unphh F Eukaryota T 7ug7 60 HB B Argyrin B XXWXGXXX 8 T 2.1 Glypican pdbhh F F 7ugb 2 B I ISG20_HUMAN ESTROGEN-REGULATED TRANSCRIPT 45 PROTEIN,PROMYELOCYTIC LEUKEMIA NUCLEAR BODY-ASSOCIATED PROTEIN ISG20 XIRARRGLPRLAVSD 15 T 0.00026 DNA_pol_B_exo2 unphh F Eukaryota T 7ugc 1 A A A0A827X9M7_ECOLX VWA DOMAIN PROTEIN INTERACTING WITH AAA ATPASE MLTLDTLNVMLAVSEEGLIEEMIIALLASPQLAVFFEKFPRLKAAITDDVPRWREALRSRLKDARVPPELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTITSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQL 191 T 0.35 RHH_3 unppssm F Bacteria T 7ugn 1 A,B,C A,B,C Q2N0S5_9HIV1 ENV POLYPROTEIN ENLWVTVYYGVPVWKDAETTLFCASDAKAYKKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNIDDMRGELKNCSFNMTTELRDKRQKVHALFYKLDIVPINENQNTSYRLINCNTAAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSEDIRNNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQWFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCDTSGLFNSTWISNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTDSTTETFRPSGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVV 451 T 2.5E-53 GP120 pdbpercent T Viruses T 7ugo 1 A,B,C A,B,C Q2N0S5_9HIV1 ENV POLYPROTEIN ENLWVTVYYGVPVWKDAETTLFCASDAKAYETKKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKRQKVHALFYKLDIVPINENQNTSYRLINCNTAAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSEDIRNNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQWFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCDTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTDSTTETFRPSGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVV 466 T 3.5E-54 GP120 pdbpercent T Viruses T 7ugp 1 A,B,C A,B,C Q2N0S5_9HIV1 ENV POLYPROTEIN ENLWVTVYYGVPVWKDAETTLFCASDKKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNIGELKNCSFNMTTELRDKRQKVHALFYKLDIVPINENQNTSYRLINCNTAAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQWFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCDTSGLFNSTWISNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTDSTTETFRPSGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVV 443 T 1.5999999999999998E-49 GP120 unp T Viruses T 7ugw 4 F E evybactin XTXTHXXFGXSX 12 T 9 RNA_pol pdbhh F T 7uhb 2 B K Multivalent miniprotein inhibitor AHB2-2GS-SB175 ELEEQVMHVLDQVSELAHELLHKLTGEELERAAYFNWWATEMMLELIKSDDEREIREIEEEARRILEHLEELARKGGSEALEELEKALRELKKSTDELERSTEELEKNPSEDALVENNRLIVENNKIIVEVLRIIAKVLKLEHHHHHH 148 T 0.0012 Syntaxin-6_N pdbpssm F T 7uhc 2 B,D,F K,C,E Multivalent miniprotein inhibitor AHB2-2GS-SB175 ELEEQVMHVLDQVSELAHELLHKLTGEELERAAYFNWWATEMMLELIKSDDEREIREIEEEARRILEHLEELARKGGSEALEELEKALRELKKSTDELERSTEELEKNPSEDALVENNRLIVENNKIIVEVLRIIAKVLKLEHHHHHH 148 T 0.0012 Syntaxin-6_N pdbpssm F T 7uhe 2 B,D B,D TAF2_YEAST TAFII-150,TBP-ASSOCIATED FACTOR 150 KDA,TBP-ASSOCIATED FACTOR 2,TSM-1 SRSFMVKIRTKN 12 T 2 DUF3970 pdbhh F Eukaryota T 7uhy 6 I,I2 I,I Unknown XXXXXXXXXXXXXXX 15 F F F 7uhy 7 J,J2 J,J Unknown XXXXXXXX 8 F F F 7ui9 5 E a MED1_YEAST MEDIATOR COMPLEX SUBUNIT 1 MVEGDSYVETLDSMIELFKDYKPGSITLENITRLCQTLGLESFTEELSNELSRLSTASKIIVIDVDYNKKQDRIQDVKLVLASNFDNFDYFNQRDGEHEKSNILLNSLTKYPDLKAFHNNLKFLYLLDAYSHIESDSTSHNNGSSDKSLDSSNASFNNQGKLDLFKYFTELSHYIRQCFQDNCCDFKVRTNLNDKFGIYILTQGINGKEVPLAKIYLEENKSDSQYRFYEYIYSQETKSWINESAENFSNGISLVMEIVANAKESNYTDLIWFPEDFISPELIIDKVTCSSNSSSSPPIIDLFSNNNYNSRIQLMNDFTTKLINIKKFDISNDNLDLISEILKWVQWSRIVLQNVFKLVSTPSSNSNSSELEPDYQAPFSTSTKDKNSSTSNTEPIPRSNRHGSVVEASRRRRSSTNKSKRPSITEAMMLKEEGLQQFNLHEILSEPAIEEENGDSIKEHSTTMDGANDLGFTASVSNQENAGTDIVMEDHGVLQGTSQNYGTATADDADIEMKDVSSKPSKPESSVLQLIVSEDHIILDTISECNLYDDVKCWSKFIEKFQDIVS 566 T 7.5E-16 Med1 pdbhh F Eukaryota T 7uid 2 C,D B,D Thyclotide XXXXXXXXXX 10 F F F 7uif 17 Q a MED1_YEAST MEDIATOR COMPLEX SUBUNIT 1 MVEGDSYVETLDSMIELFKDYKPGSITLENITRLCQTLGLESFTEELSNELSRLSTASKIIVIDVDYNKKQDRIQDVKLVLASNFDNFDYFNQRDGEHEKSNILLNSLTKYPDLKAFHNNLKFLYLLDAYSHIESDSTSHNNGSSDKSLDSSNASFNNQGKLDLFKYFTELSHYIRQCFQDNCCDFKVRTNLNDKFGIYILTQGINGKEVPLAKIYLEENKSDSQYRFYEYIYSQETKSWINESAENFSNGISLVMEIVANAKESNYTDLIWFPEDFISPELIIDKVTCSSNSSSSPPIIDLFSNNNYNSRIQLMNDFTTKLINIKKFDISNDNLDLISEILKWVQWSRIVLQNVFKLVSTPSSNSNSSELEPDYQAPFSTSTKDKNSSTSNTEPIPRSNRHGSVVEASRRRRSSTNKSKRPSITEAMMLKEEGLQQFNLHEILSEPAIEEENGDSIKEHSTTMDGANDLGFTASVSNQENAGTDIVMEDHGVLQGTSQNYGTATADDADIEMKDVSSKPSKPESSVLQLIVSEDHIILDTISECNLYDDVKCWSKFIEKFQDIVS 566 T 7.5E-16 Med1 pdbhh F Eukaryota T 7uig 1 A a MED1_YEAST MEDIATOR COMPLEX SUBUNIT 1 MVEGDSYVETLDSMIELFKDYKPGSITLENITRLCQTLGLESFTEELSNELSRLSTASKIIVIDVDYNKKQDRIQDVKLVLASNFDNFDYFNQRDGEHEKSNILLNSLTKYPDLKAFHNNLKFLYLLDAYSHIESDSTSHNNGSSDKSLDSSNASFNNQGKLDLFKYFTELSHYIRQCFQDNCCDFKVRTNLNDKFGIYILTQGINGKEVPLAKIYLEENKSDSQYRFYEYIYSQETKSWINESAENFSNGISLVMEIVANAKESNYTDLIWFPEDFISPELIIDKVTCSSNSSSSPPIIDLFSNNNYNSRIQLMNDFTTKLINIKKFDISNDNLDLISEILKWVQWSRIVLQNVFKLVSTPSSNSNSSELEPDYQAPFSTSTKDKNSSTSNTEPIPRSNRHGSVVEASRRRRSSTNKSKRPSITEAMMLKEEGLQQFNLHEILSEPAIEEENGDSIKEHSTTMDGANDLGFTASVSNQENAGTDIVMEDHGVLQGTSQNYGTATADDADIEMKDVSSKPSKPESSVLQLIVSEDHIILDTISECNLYDDVKCWSKFIEKFQDIVS 566 T 7.5E-16 Med1 pdbhh F Eukaryota T 7uii 1 A a CanA MTTQSPLNSFYATGTAQAVSEPIDVESHLGSITPAAGAQGSDDIGYAIVWIKDQVNDVKLKVTLANAEQLKPYFKYLQIQITSGYETNSTALGNFSETKAVISLDNPSAVIVLDKEDIAVLYPDKTGYTNTSIWVPGEPDKIIVYNETKPVAILNFKAFYEAKEGMLFDSLPVIFNFQVLQVG 183 T 6.4 Hormone_3 pdbhh F T 7uik 7 H n MED14_YEAST GLUCOSE REPRESSION REGULATORY PROTEIN 1,MEDIATOR COMPLEX SUBUNIT 14 MQLVVLTDVVERLHKNFESENFKIIALQPNEISFKYLSNNDEDDKDCTIKISTNDDSIKNLTVQLSPSNPQHIIQPFLDNSKMDYHFIFSYLQFTSSLFKALKVILNERGGKFHESGSQYSTMVNIGLHNLNEYQIVYYNPQAGTKITICIELKTVLHNGRDKIQFHIHFADVAHITTKSPAYPMMHQVRNQVFMLDTKRLGTPESVKPANASHAIRLGNGVACDPSEIEPILMEIHNILK 241 T 0.092 CDT1 unp F Eukaryota T 7uil 7 M a MED1_YEAST MEDIATOR COMPLEX SUBUNIT 1 MVEGDSYVETLDSMIELFKDYKPGSITLENITRLCQTLGLESFTEELSNELSRLSTASKIIVIDVDYNKKQDRIQDVKLVLASNFDNFDYFNQRDGEHEKSNILLNSLTKYPDLKAFHNNLKFLYLLDAYSHIESDSTSHNNGSSDKSLDSSNASFNNQGKLDLFKYFTELSHYIRQCFQDNCCDFKVRTNLNDKFGIYILTQGINGKEVPLAKIYLEENKSDSQYRFYEYIYSQETKSWINESAENFSNGISLVMEIVANAKESNYTDLIWFPEDFISPELIIDKVTCSSNSSSSPPIIDLFSNNNYNSRIQLMNDFTTKLINIKKFDISNDNLDLISEILKWVQWSRIVLQNVFKLVSTPSSNSNSSELEPDYQAPFSTSTKDKNSSTSNTEPIPRSNRHGSVVEASRRRRSSTNKSKRPSITEAMMLKEEGLQQFNLHEILSEPAIEEENGDSIKEHSTTMDGANDLGFTASVSNQENAGTDIVMEDHGVLQGTSQNYGTATADDADIEMKDVSSKPSKPESSVLQLIVSEDHIILDTISECNLYDDVKCWSKFIEKFQDIVS 566 T 7.5E-16 Med1 pdbhh F Eukaryota T 7uio 39 NA,ZB Aa,Ba MED1_YEAST MEDIATOR COMPLEX SUBUNIT 1 MVEGDSYVETLDSMIELFKDYKPGSITLENITRLCQTLGLESFTEELSNELSRLSTASKIIVIDVDYNKKQDRIQDVKLVLASNFDNFDYFNQRDGEHEKSNILLNSLTKYPDLKAFHNNLKFLYLLDAYSHIESDSTSHNNGSSDKSLDSSNASFNNQGKLDLFKYFTELSHYIRQCFQDNCCDFKVRTNLNDKFGIYILTQGINGKEVPLAKIYLEENKSDSQYRFYEYIYSQETKSWINESAENFSNGISLVMEIVANAKESNYTDLIWFPEDFISPELIIDKVTCSSNSSSSPPIIDLFSNNNYNSRIQLMNDFTTKLINIKKFDISNDNLDLISEILKWVQWSRIVLQNVFKLVSTPSSNSNSSELEPDYQAPFSTSTKDKNSSTSNTEPIPRSNRHGSVVEASRRRRSSTNKSKRPSITEAMMLKEEGLQQFNLHEILSEPAIEEENGDSIKEHSTTMDGANDLGFTASVSNQENAGTDIVMEDHGVLQGTSQNYGTATADDADIEMKDVSSKPSKPESSVLQLIVSEDHIILDTISECNLYDDVKCWSKFIEKFQDIVS 566 T 7.5E-16 Med1 pdbhh F Eukaryota T 7uiq 2 C,D C,D TIAM1_MOUSE TIAM-1 RTLDSHASRMTQLKKQAAL 19 T 0.67 Gas_vesicle_C pdbhh F Eukaryota T 7uir 2 C,D C,D TIAM1_MOUSE TIAM-1 RTLDSHASRMTQLKKQAAL 19 T 0.67 Gas_vesicle_C pdbhh F Eukaryota T 7uit 1 A,AA,AB,B,BA,BB,C,CA,CB,D,DA,DB,E,EA,EB,F,FA,FB,G,GA,GB,H,HA,HB,I,IA,IB,J,JA,JB,K,KA,KB,L,LA,LB,M,MA,MB,N,NA,NB,O,OA,OB,P,PA,PB,Q,QA,QB,R,RA,RB,S,SA,SB,T,TA,TB,U,UA,V,VA,W,WA,X,XA,Y,YA,Z,ZA B,b,1,C,c,2,D,d,3,E,e,4,F,f,5,G,g,6,H,h,7,I,i,8,J,j,9,K,k,AA,L,l,BA,M,m,CA,N,n,DA,O,o,EA,P,p,FA,Q,q,A,R,r,JA,S,s,GA,T,t,HA,U,u,IA,V,v,W,w,X,x,Y,y,Z,z,a,0 Peptide 2 XLKAIAQEFKAIAKKFKAIAXEFKAIAQKX 30 T 12 DUF5741 pdbhh F T 7uj4 1 A,B A,B MEN1_HUMAN Isoform 2 of Menin MGLKTAQKTLFPLRSIDDVVRLFAAELGREEPDLVLLSLVLGFVEHFLAVNRVIPTNVPELTFQPSPAPDPPGGLTYFPVADLSIIAALYARFTAQIRGAVDLSLYPREGGVSSRELVKKVSDVIWNSLSRSYFKDRAHIQSLFSFITGTKLDSSGVAFAVVGACQALGLRDVHLALSEDHAWVVFGPNGEQTAEVTWHGKGNEDRRGQTVNAGVAERSWLYLKGSYMRCDRKMEVAFMVCAINPSIDLHTDSLELLQLQQKLLWLLYDLGHLERYPMALGNLADLEELEPTPGRPDPLTLYHKGIASAKTYYRDEHIYPYMYLAGYHCRNRNVREALQAWADTATVIQDYNYCREDEEIYKEFFEVANDVIPNLLKEAASLLEAGEERPGEQSQGTQSQGSALQDPECFAHLLRFYDGICKWEEGSPTPVLHVGWATFLVQSLGRFEGQVRQKVRITFQSEKMKGMKELLVATKINSSAIKLQLTAQ 488 T 2.4E-11 Menin pdb F Eukaryota T 7ujd 6 F Z ACY-PHE-PRO-ASP-VAL-SAR-LEU-HIS-ARG-TYR-TRP-GLY-TRP-ASP-CYS-GLY-NH2 GFPDVXLHRYWGWDCGX 17 T 1.3 DUF6172 pdbhh F T 7ukn 2 B B UL145_HCMVM H-Box Motif of pUL145 NAVQLLCARTRDG 13 T 0.71 DUF5500 unphh T Viruses T 7um0 1 A A A0A172JIC8_BPPB1 DNA-directed RNA polymerase subunit MDILENYVSFDEQARDINIAFDKLFGRDDISHMNNFSINKRSYYNCLDQISDDLNLVLNKYNDLAYSLLEIRYNMATKENYTHMEFYSDIERLFIKNEKLLNVISDIVEEEYDLDLNQASKGKKINIELQVTDNLNKIYLKSSVLMRILIPILCDFNCDDDINEVLVYDIFKEVIKSFDDGKKNALNKLYKIIYSRVFETKYSDVVIWTYLKNMSTDLMIIVKDYFKVIIKKIFPKLKHNSSVISYLDVVIKQKLKYLFTFKYPISYKPLKAETTDDEELSEQERMEINLLRNDQGNSIINECSIKQEIAKIKKKYNVTDEVMKEFINGRELNSIQIYLVKIYYSNKFKVNSNKNDIFYLLYGMTRELGEMNFSIIPEILSCAIAPNVRKMNNRKKLVDKIIHSDKYSYLLKSYLPIKNILDKNNVILQLMTIKNAKFMNKENKEVDFSTDHLAEEVLDMLLCI 464 T 23 GvpK pdbhh T Viruses T 7um0 3 C c A0A172JI16_BPPB1 DNA-directed RNA polymerase beta subunit MISNFRKFHGNKNQEKFNENLILNKENESILNYLDPICKTLEIIPEITYLGSSVEPINKVYKFNKEEKTSDIERSELQLIKMSFLIEKDDKKEEINKFIYFPKLIDSQYFIINGNRYYPIYQLLDSGTYRTNKALTLKTLLMPIVLREKKETFDDINGETHTMLNVDLDLFKSKVPFLIYFFSKFGFEGTLEYFGLQDLIHVLMKEDLDQLDEDEINDNVIFMITKNISLVVDKNFFSNKNNQIIIATLLNCFNTRIKIDKIYEKDYWVKKLGGYFTTNNSNKQEKGEGIILSFERILDEWTKKILRTEEKNKEDIYSVVRWMINNYLALVKQDNMNLANKRIRLYEYLLHPLLIKFSKGTYRVLNNRNSNKFEKIKTIFSNIQEGFLVKKIINNELLRYDNSVNSISLFTLILRYTQSGPQSPFSSNSTNNKLRGLHPSYLGRLGLTSTSAGDPGASGSLTPFLELPENSYMHFTEEPEINLNIDDISIDEVIES 496 T 0.0014 RNA_pol_Rpb2_3 pdbpssm T Viruses T 7um1 1 A A A0A172JIC8_BPPB1 DNA-directed RNA polymerase subunit MDILENYVSFDEQARDINIAFDKLFGRDDISHMNNFSINKRSYYNCLDQISDDLNLVLNKYNDLAYSLLEIRYNMATKENYTHMEFYSDIERLFIKNEKLLNVISDIVEEEYDLDLNQASKGKKINIELQVTDNLNKIYLKSSVLMRILIPILCDFNCDDDINEVLVYDIFKEVIKSFDDGKKNALNKLYKIIYSRVFETKYSDVVIWTYLKNMSTDLMIIVKDYFKVIIKKIFPKLKHNSSVISYLDVVIKQKLKYLFTFKYPISYKPLKAETTDDEELSEQERMEINLLRNDQGNSIINECSIKQEIAKIKKKYNVTDEVMKEFINGRELNSIQIYLVKIYYSNKFKVNSNKNDIFYLLYGMTRELGEMNFSIIPEILSCAIAPNVRKMNNRKKLVDKIIHSDKYSYLLKSYLPIKNILDKNNVILQLMTIKNAKFMNKENKEVDFSTDHLAEEVLDMLLCI 464 T 23 GvpK pdbhh T Viruses T 7um1 3 C c A0A172JI16_BPPB1 DNA-directed RNA polymerase beta subunit MISNFRKFHGNKNQEKFNENLILNKENESILNYLDPICKTLEIIPEITYLGSSVEPINKVYKFNKEEKTSDIERSELQLIKMSFLIEKDDKKEEINKFIYFPKLIDSQYFIINGNRYYPIYQLLDSGTYRTNKALTLKTLLMPIVLREKKETFDDINGETHTMLNVDLDLFKSKVPFLIYFFSKFGFEGTLEYFGLQDLIHVLMKEDLDQLDEDEINDNVIFMITKNISLVVDKNFFSNKNNQIIIATLLNCFNTRIKIDKIYEKDYWVKKLGGYFTTNNSNKQEKGEGIILSFERILDEWTKKILRTEEKNKEDIYSVVRWMINNYLALVKQDNMNLANKRIRLYEYLLHPLLIKFSKGTYRVLNNRNSNKFEKIKTIFSNIQEGFLVKKIINNELLRYDNSVNSISLFTLILRYTQSGPQSPFSSNSTNNKLRGLHPSYLGRLGLTSTSAGDPGASGSLTPFLELPENSYMHFTEEPEINLNIDDISIDEVIES 496 T 0.0014 RNA_pol_Rpb2_3 pdbpssm T Viruses T 7um2 3 C C SARS-CoV-2 Spike-derived peptide S417-425 K417T mutant (TIADYNYKL) TIADYNYKL 9 T 0.22 bCoV_S1_RBD pdbhh F T 7uma 2 B C HIS-HIS-HIS-HIS EHHHHHH 7 T 6500 zf_CCCH_4 pdbhh F F 7unf 13 U n RNK_HUMAN RNASE KAPPA MGWLRPGPRPLCPPARASWAFSHRFPSPLAPRRSPTPFFMASLLCCGPKLAACGIVLSAWGVIMLIMLGIFFNVHSAVLIEDVPFTEKDFENGPQNIYNLYEQVSYNCFIAAGLYLLLGGFSFCQVRLNKRKEYMVR 137 T 0.0026 DUF2650 pdbpercent F Eukaryota T 7ung 5 I,J 8,9 CF107_HUMAN CILIA- AND FLAGELLA-ASSOCIATED PROTEIN 107 MFLTAVNPQPLSTPSWQIETKYSTKVLTGNWMEERRKFTRDTDKTPQSIYRKEYIPFPDHRPDQISRWYGKRKVEGLPYKHLITHHQEPPHRYLISTYDDHYNRHGYNPGLPPLRTWNGQKLLWLPEKSDFPLLAPPTNYGLYEQLKQRQLTPKAGLKQSTYTSSYPRPPLCAMSWREHAVPVPPHRLHPFPHF 194 T 0.14 DUF1143 pdbpercent F Eukaryota T 7ung 13 UB D SPAG8_HUMAN HSD-1,SPERM MEMBRANE PROTEIN 1,SMP-1,SPERM MEMBRANE PROTEIN BS-84 METNESTEGSRSRSRSLDIQPSSEGLGPTSEPFPSSDDSPRSALAAATAAAAAAASAAAATAAFTTAKAAALSTKTPAPCSEFMEPSSDPSLLGEPCAGPGFTHNIAHGSLGFEPVYVSCIAQDTCTTTDHSSNPGPVPGSSSGPVLGSSSGAGHGSGSGSGPGCGSVPGSGSGPGPGSGPGSGPGHGSGSHPGPASGPGPDTGPDSELSPCIPPGFRNLVADRVPNYTSWSQHCPWEPQKQPPWEFLQVLEPGARGLWKPPDIKGKLMVCYETLPRGQCLLYNWEEERATNHLDQVPSMQDGSESFFFRHGHRGLLTMQLKSPMPSSTTQKDSYQPPGNVYWPLRGKREAMLEMLLQHQICKEVQAEQEPTRKLFEVESVTHHDYRMELAQAGTPAPTKPHDYRQEQPETFWIQRAPQLPGVSNIRTLDTPFRKNCSFSTPVPLSLGKLLPYEPENYPYQLGEISSLPCPGGRLGGGGGRMTPF 485 T 0.027 PIP49_C pdbpssm F Eukaryota T 7ung 15 ED,QC F,E CF161_HUMAN Cilia- and flagella-associated protein 161 MAQNVYGPGVRIGNWNEDVYLEEELMKDFLEKRDKGKLLIQRSRRLKQNLLRPMQLSVTEDGYIHYGDKVMLVNPDDPDTEADVFLRGDLSLCMTPDEIQSHLKDELEVPCGLSAVQAKTPIGRNTFIILSVHRDATGQVLRYGQDFCLGITGGFDNKMLYLSSDHRTLLKSSKRSWLQEVYLTDEVSHVNCWQAAFPDPQLRLEYEGFPVPANAKILINHCHTNRGLAAHRHLFLSTYFGKEAEVVAHTYLDSHRVEKPRNHWMLVTGNPRDASSSMLDLPKPPTEDTRAMEQAMGLDTQ 301 T 0.028 zf-RING_5 pdbpssm F Eukaryota T 7ung 32 IP,JP,KP l,m,n FLTOP_HUMAN CILIA- AND FLAGELLA-ASSOCIATED PROTEIN 126 MATNYSANQYEKAFSSKYLQNWSPTKPTKESISSHEGYTQIIANDRGHLLPSVPRSKANPWGSFMGTWQMPLKIPPARVTLTSRTTAGAASLTKWIQKNPDLLKASNGLCPEILGKPHDPDSQKKLRKKSITKTVQQARSPTIIPSSPAANLNSPDELQSSHPSAGHTPGPQRPAKS 177 T 38 Scm3 pdbhh F Eukaryota T 7unh 1 A,B A,B SP2 designed chlorophyll dimer protein SSDEEFKFLATEAKMLITAAERLAGTDPELQEMVALIKKELEQAERTFRNGDKSEAQRQLEFVLTAARAVMNVAAAANAAGTDPELIEMVLRILKQLKEAIRTFQNGDQEEAETQLRFVLRAAIAVAVVAAALVLAGTDPELQEMVKQILEELKQAIETFARGDKEKALTQLLFVAWAAHAVAMIAAAANLAGTDPRLQQQVKEILEKLKEAIETFQKGDEEQAFRQLAEVLAEAALVALRAALTN 246 T 0.018 Cas_DxTHG pdb F T 7uni 1 A,B,C,D A,C,B,D SP2-ZnPPaM designed chlorophyll dimer protein SGSGSSDEEFKFLATEAKMLITAAERLAGTDPELQEMVALIKKELEQAERTFRNGDKSEAQRQLEFVLTAARAVMNVAAAANAAGTDPELIEMVLRILKQLKEAIRTFQNGDQEEAETQLRFVLRAAIAVAVVAAALVLAGTDPELQEMVKQILEELKQAIETFARGDKEKALTQLLFVAWAAHAVAMIAAAANLAGTDPRLQQQVKEILEKLKEAIETFQKGDEEQAFRQLAEVLAEAALVALRAALTN 250 T 0.015 Vps35 pdb F T 7unx 1 A A A0A8E4SKK8_MYXXA Xanthusin-1 NAPEFTQSVCERNSDCDHFCGEGFGHCIRGMYCACM 36 T 0.2 Gamma-thionin pdbhh F Bacteria T 7uny 1 A,B A,D Q8IM47_PLAF7 Cysteine-rich small secreted protein CSS GTQDEKSVKNICVCDFTDKLNFLPLEKTKILCELKPQYGEDIKIIANKEYEINCMNNSKVFCPLKDTFINNTNIKLYSPKLHFEIKDITHKGKNAALYYLKIDEEASDIFFSCSIKPKQVSGLLEGEVRVNLKKHINEEYSIFNEEEDVHVCDFSKGNLDITPSAGFYLKNSRNVSCIYRVIPNKLFLIKLPKLDIVTEKLLPSIVNCLSEFSFINFTLKHVQEGDNYISFNVIFGEFKKHFNLACSLDLSDFQQEPCNLGKTANITFIFSKLENLYFQGDYKDDDDKH 289 T 2 RnlA_toxin_N unppssm F Eukaryota T 7unz 1 A,B B,D Q8IM47_PLAF7 Cysteine-rich small secreted protein CSS, putative GTQDEKSVKNICVCDFTDKLNFLPLEKTKILCELKPQYGEDIKIIANKEYEINCMNNSKVFCPLKDTFINNTNIKLYSPKLHFEIKDITHKGKNAALYYLKIDEEASDIFFSCSIKPKQVSGLLEGEVRVNLKKHINEEYSIFNEEEDVHVCDFSKGNLDITPSAGFYLKNSRNVSCIYRVIPNKLFLIKLPKLDIVTEKLLPSIVNCLSEFSFINFTLKHVQEGDNYISFNVIFGEFKKHFNLACSLDLSDFQQEPCNLGKTANITFIFSKLENLYFQ 279 T 1.9 RnlA_toxin_N pdbpssm F Eukaryota T 7uo3 2 B B HIS-HIS-HIS-HIS EHHHHHH 7 T 6500 zf_CCCH_4 pdbhh F F 7uo8 2 B B HIS-HIS-HIS-HIS-HIS HHHHHH 6 T 6100 zf_CCCH_4 pdbhh F F 7uoa 2 B B MTP-1 YIRLYDYHNC 10 T 2.7 TTR-52 pdbhh F T 7upo 1 A A DHT03 protein A GSSPEEEKLKELLKELKKVLDRLKKILERNDEEIKKSDELDDESLLEDIVELLKEIIKLWKILVELSDILLKLIS 75 T 0.01 DUF713 pdb F T 7upo 2 B B DHT03 protein B SSPVDEIDKEVKKLEEEAKKSQEEVERLKQEVEKASKAGLDHEGDSRIFKKIHDVVTKQIKVIIRLIEVYVRLVEIIL 78 T 0.0017 GAS pdb F T 7upo 3 C C DHT03 protein C GSKQKEAIKVYLELLEVHSRVLKALIEQIKLFIELIKRPDEDLADKVRKSSEELKKIIKEVEKILRKVDDILYKVKS 77 T 0.00071 ALIX_LYPXL_bnd pdb F T 7upp 1 A A DHT03 protein A SEKEKVEELAQRIREQLPDTELAREAQELADEARKSDDSEALKVVYLALRIVQQLPDTELAREALELAKEAVKSTDSEALKVVELALKIVQQLPDTELAKEALKLAKEAVKSTDSEALKVVELALEIVQQLPDTELAKEALELAEEAVKSTDSEALKVVKLALEIVQQLPDTELAREALELAKEAVKSTDSEALKVVYLALRIVQQLPDTELARLALELAKKAVEMTAQEVLEIARAALKAAQAFPNTELAELMLRLAEVAARVMKELERNDEEIKKSDELDDESLLEDIVELLKEIIKLWKILVEVSDVMLKLIS 316 T 0.0023 DCB pdb F T 7upp 2 B B DHT03 protein B SEKEKVEELAQRIREQLPDTELAREAQELADEARKSDDSEALKVVYLALRIVQQLPDTELAREALELAKEAVKSTDSEALKVVELALKIVQQLPDTELAKEALELAKEAVKSTDSEALKVVELALEIVQQLPDTELAKEALKLAKEAVKSTDSEALKVVYLALRIVQQLPDTELAREALELAKEAVKSTDSEQLEVVRLALEIVQLAPDTRLARAALKLAKEAVKSTDQEELKKVKAILRVASEVLKLEEEAKKSQEEVERLKQEVEKASKAGLDHEGDSRIFKKIHDVVTKQIKVILRLIAVYAELVAIIG 312 T 0.0023 DCB pdb F T 7upp 3 C C DHT03 protein C GSKQKEAIKVYLELLEVHSRVLKALIEQIKLFIELIMEPDEDLADKVRKSSEELKKIIKEVEKILRKVDDILEKVKS 77 T 0.0002 Anticodon_2 pdb F T 7upq 1 A,B,C,D A,D,G,J DHT03 protein A SEKEKVEELAQRIREQLPDTELAREAQELADEARKSDDSEALKVVYLALRIVQQLPDTELAREALELAKEAVKSTDSEALKVVYLALRIVQQLPDTELARLALELAKKAVEMTAQEVLEIARAALKAAQAFPNTELAELMLRLAEVAARVMKELERNDEEIKKSDELDDESLLEDIVELLKEIIKLWKILVEVSDVMLKLIS 202 T 0.0022 DCB pdb F T 7upq 2 E,F,G,H B,E,H,K DHT03 protein B GPVDEIDKEVKKLEEEAKKSQEEVERLKQEVEKASKAGLDHEGDSRIFKKIHDVVTKQIKVILRLIAVYAELVAIIG 77 T 0.0035 GAS pdb F T 7upq 3 I,J,K,L C,F,I,L DHT03 protein C GSKQKEAIKVYLELLEVHSRVLKALIEQIKLFIELIKRPDEDLADKVRKSSEELKKIIKEVEKILRKVDDILYKVKS 77 T 0.00071 ALIX_LYPXL_bnd pdb F T 7upr 2 G G Unknown peptide substrate XXXXXXXXXX 10 F F F 7ups 1 A,B,C,D A,B,C,D DOTY_LEGPH DotY (Lpg0294) SNATRDALLKAMQVGETSIEAAEYMATRFEQILTKAKLLPECNDMLEKIKEYAQFVKFKLLSSAQVWSGQERPTSDYQNTQENKAEFLASHLEGLPSGLKLEVAIGDDAKILRGFSSNGKMVEGDQLKTMDGLLEGWLAKNSLAISGGAVVKIDNTGNQTKVDPQEIRQLINDSEKGVAKYFADKGVGMEVAQRTYQEPKALETKREEIRQEIES 215 T 0.019 GPW_gp25 unppercent F Bacteria T 7upt 2 G G Unknown peptide substrate XXXXXXXXXX 10 F F F 7uq2 1 A,B,C,D,E,F A,B,C,D,E,F Y06G_BPT4 Vs.4 SMIEDIKGYKPHTEEKIGKVNAIKDAEVRLGLIFDALYDEFWEALDNCEDCEFAKNYAESLDQLTIAKTKLKEASMWACRAVFQPEEKY 89 T 0.057 DUF1631 unppssm T Viruses T 7ur1 3 C C SARS-CoV-2 Spike-derived peptide S1215-1224 (YIWLGFIAGL) YIWLGFIAGL 10 T 0.24 MtrB pdbhh F T 7ur6 1 A,E,I G,A,F C6G0D7_9HIV1 ENV POLYPROTEIN GPAENLWVTVYYGVPVWKEAKTTLFCASDAKAYEKEVHNVWATHACVPTDPNPQEMVLENVTENFNMWKNDMVDQMHEDVISLWDQSLKPCVKLTPLCVTLNCTNTTVSNGSSNSNANFEEMKNCSFNATTEIKDKKKNEYALFYKLDIVPLNNSSGKYRLINCNTSACTQICPKVTFEPIPIHYCAPAGYAILKCNNKTFNGTGPCNNVSTVQCTHGIKPVVSTQLLLNGSLAEKEIIIRSENLTNNAKTIIVHLNESVGIVCTRPSNMTRKSIRIGPGQTFYALGDIIGDIRQPHCNISKQNWNRTLQQVGRKLAEHFPNRNITFNHSSGGDLEITTHSFNCRGEFFYCNTSGLFNGTYHPNGTYNETAVNSSDTITLQCRIKQIINMWQEVGRCMYAPPIAGNITCNSNITGLLLTRDGGINQTGEEIFRPGGGDMRDNWRSELYKYKVVEIKPLGIAPTKCKRRVVERRRRRR 477 T 1.6E-54 GP120 pdbpssm T Viruses T 7ur7 1 A A 17_bp_sh3 MSEVKELLEEFLKRNKPVRIHHKNGEEIKVRITHIGEDTVEFELNGRTHRINIKDILDVKEWLEHHHHHH 70 T 0.016 DUF2642 pdbhh F T 7ur8 1 A A 170_h_ob MSGDRTRELKVIDYREYDNTVYFILRDGDKIYTIEVSPEEAKKLKPGDWVIVNEDGKLLHVQGSLEHHHHHH 72 T 0.0032 Prot_ATP_OB_N pdbhh F T 7urf 2 B B SHH_HUMAN SHH-N peptide CGPGRGF 7 T 0.019 HH_signal pdbhh F Eukaryota F 7urg 1 A,B A,B M1PRZ0_9CAUD Ribonucleotide reductase MSKPPKELIARTGRVQSWIDDPTSRLPVSCTVFVVEDTMEGENGIEASWRFVSHALRYGAGVAVHLSKLRPKGAENGKGLVASGPVSFAKIYSTLNEILRRGGVYKNGAVVCHLDLSHPDVLEFITASRSELPWVKRCVNINDHWWKEATPTVKNALLEGIKRGDIWLNKTKVDRNGNRIRGNVCLEVYLPSRGTCLLQHVNLGGCELDEIRGAFAQGMSELCELHGKTNVGESGEYLPSETDRQVGLGMLGLANLLRTQGVTYNDFGRALEALNSGRPYPSTPGYVIAQELKAGIQAAAEIAKANKMERAFAIAPTASCSYRYTDLDGYTTCPEIAPPIARQVDRDSGTFGVQSFDYGPVEIASEVGWESYKRVVDGIIRLLDSTGLLHGYSFNSWSDVVTYDEQFIEDWLASPQTSLYYSLQVMGDVQDKSDAYAALDDGDVTAYLESLLNDPVGASPPLAPDCNCGE 470 T 1.5E-35 Ribonuc_red_lgC pdbpercent T Viruses T 7urp 1 A A A0A2N0UYJ0_9FIRM Ribonucleases G and E AVDNLTINATSNICQANGSGTFNVGDKVSVYYLLDTKDAQLEEVQWALTYDKNLLTLDSLTMPEIADGMVNMDDVSGNASNLALYDFAGGKKLVEAVFTVNGTGTTNVDLNVVDLTLGKLNPATGTVDADSEYEAVVNGDMANDLFDHINSDAKVEAYVE 160 T 0.015 Cohesin unppssm F Bacteria T 7us2 2 G P Substrate XXXXXXXXXXXXXX 14 F F F 7uso 3 E,F F,G Peptide Inhibitor AcITVKD-CHO XITVKD 6 T 91 Ribosomal_TL5_C pdbhh F T 7usp 3 E,F F,G Peptide Inhibitor AcITV(Orn)D-CHO ITVXD 5 T 90 BCL_N pdbhh F F 7usq 3 E,F F,G Peptide Inhibitor AcDVPD-CHO XDVPD 5 T 200 Rotavirus_VP7 pdbhh F F 7ust 2 B A P230_PLAF7 Gametocyte surface protein P230 GASTNKEYVCDFTDQLKPTESGPKVKKCEVKVNEPLIKVKIICPLKGSVEKLYDNIEYVPKKSPYVVLTKEETKLKEKLLSKLIYGLLISPTVNEKENNFKEGVIEFTLPPVVHKATVFYFICDNSKTEDDNKKGNRGIVEVYVEPYG 148 T 0.71 DUF2129 pdbpercent F Eukaryota T 7usv 1 A,B A,B P230_PLAF7 Gametocyte surface protein P230 GASTNKEYVCDFTDQLKPTESGPKVKKCEVKVNEPLIKVKIICPLKGSVEKLYDNIEYVPKKSPYVVLTKEETKLKEKLLSKLIYGLLISPTVNEKENNFKEGVIEFTLPPVVHKATVFYFICDNSKTEDDNKKGNRGIVEVYVEPYG 148 T 0.71 DUF2129 pdbpercent F Eukaryota T 7utd 1 A,C,E,G,I,K,M,O A,C,E,G,I,K,M,O A0QUM7_MYCS2 Hydrogenase-2, large subunit LDLFVSPLGRVEGDLDVRVTINDGVVTSAWTEAAMFRGFEIILRGKDPQAGLIVCPRICGICGGSHLYKSAYALDTAWRTHMPPNATLIRNICQACETLQSIPRYFYALFAIDLTNKNYAKSKLYDEAVRRFAPYVGTSYQPGVVLSAKPVEVYAIFGGQWPXSSFMVPGGVMSAPTLSDVTRAIAILEHWNDNWLEKQWLGCSVDRWLENKTWNDVLAWVDENESQYNSDCGFFIRYCLDVGLDKYGQGVGNYLATGTYFEPSLYENPTIEGRNAALIGRSGVFADGRYFEFDQANVTEDVTHSFYEGNRPLHPFEGETIPVNPEDGRRQGKYSWAKSPRYAVPGLGNVPLETGPLARRMAASAPDAETHQDDDPLFADIYNAIGPSVMVRQLARMHEGPKYYKWVRQWLDDLELKESFYTKPVEYAEGKGFGSTEAARGALSDWIVIEDSKIKNYQVVTPTAWNIGPRDASEVLGPIEQALVGSPIVDAEDPVELGHVARSFDSCLVCTVH 513 T 4.1E-19 NiFeSe_Hases pdb F Bacteria T 7utd 3 Q,R,S,T Q,R,S,T A0QUM5_MYCS2 Type 2 [NiFe]-hydrogenase Huc membrane adapter subunit SPVDGIRRRLDDPQVAEALNSLLDHADLLAVLVKGLDGFVRRGDDIANNLTSAIGELKAL 60 T 0.12 CompInhib_SCIN pdb F Bacteria T 7utj 2 G,H,I,J,K,L G,H,I,K,L,Z CTNA1_HUMAN ALPHA E-CATENIN,CADHERIN-ASSOCIATED PROTEIN,RENAL CARCINOMA ANTIGEN NY-REN-13 GPHMTLAVERLLEPLVTQVTTLVNTNSKGPSNKKRGRSKKAHVLAASVEQATENFLEKGDKIAKESQFLKEELVAAVEDVRKQGDLMKAAAGEFADDPCSSVKRGNMVRAARALLSAVTRLLILADMADVYKLLVQLKVVEDGILKLRNAGNEQDLGIQYKALKPEVDKLNIMAAKRQQELKDVGHRDQMAAARGILQKNVPILYTASQACLQHPDVAAYKANRDLIYKQLQQAVTGISNAAQATASDDASQHQGGGGGELAYALNNFDKQIIVDPLSFSEERFRPSLEERLESIISGAALMADSSCTRDDRRERIVAECNAVRQALQDLLSEYMGNAGRKERSDALNSAIDKMTKKTRDLRRQLRKAVMDHVSDSFLETNVPLLVLIEAAKNGNEKEVKEYAQVFREHANKLIEVANLACSISNNEEGVKLVRMSASQLEALCPQVINAALALAAKPQSKLAQENMDLFKEQWEKQVRVLTDAVDDITSIDDFLAVSENHILEDVNKCVIALQEKDVDGLDRTAGAIRGRAARVIHVVTSEMDNYEPGVYTEKVLEATKLLSNTVMPRFTEQVEAAVEALSSDPAQPMDENEFIDASRLVYDGIRDIRKAVLMIRTPEELDDSDFETEDFDVRSRTSVQTEDDQLIAGQSARAIMAQLPQEQKAKIAEQVASFQEEKSKLDAEVSKWDDSGNDIIVLAKQMCMIMMEMTDFTRGKGPLKNTSDVISAAKKIAEAGSRMDKLGRTIADHCPDSACKQDLLAYLQRIALYCHQLNICSKVKAEVQNLGGELVVSGVDSAMSLIQAAKNLMNAVVQTVKASYVASTKYQKSQGMASLNLPAVSWKMKAPEKKPLVKREKQDETQTKIKRASQKKHVNPVQALSEFKAMDSI 889 T 2.9E-97 Vinculin unp F Eukaryota T 7utz 5 E X GNAS2_HUMAN ADENYLATE CYCLASE-STIMULATING G ALPHA PROTEIN GGSLEVLFQGPSGNSKTEDQRNEEKAQREANKKIEKQLQKDKQVYRATHRLLLLGADNSGKSTIVKQMRILHGGSGGSGGTSGIFETKFQVDKVNFHMFDVGGQRDERRKWIQCFNDVTAIIFVVDSSDYNRLQEALNLFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKLEDYFPEFARYTTPEDATPEPGEDPRVTRAKYFIRDEFLRISTASGDGRHYCYPHFTCAVDTENARRIFNDCRDIIQRMHLRQYELL 261 T 5E-10 G-alpha pdb F Eukaryota T 7uuq 1 A,B,C,D,E,F,G A,B,C,D,E,F,G Pyrene-containing peptide fibril XYSPTSPS 8 T 0.045 RNA_pol_Rpb1_R pdbhh F F 7uur 1 A,C C,F A0QUM7_MYCS2 Hydrogenase-2, large subunit LDLFVSPLGRVEGDLDVRVTINDGVVTSAWTEAAMFRGFEIILRGKDPQAGLIVCPRICGICGGSHLYKSAYALDTAWRTHMPPNATLIRNICQACETLQSIPRYFYALFAIDLTNKNYAKSKLYDEAVRRFAPYVGTSYQPGVVLSAKPVEVYAIFGGQWPXSSFMVPGGVMSAPTLSDVTRAIAILEHWNDNWLEKQWLGCSVDRWLENKTWNDVLAWVDENESQYNSDCGFFIRYCLDVGLDKYGQGVGNYLATGTYFEPSLYENPTIEGRNAALIGRSGVFADGRYFEFDQANVTEDVTHSFYEGNRPLHPFEGETIPVNPEDGRRQGKYSWAKSPRYAVPGLGNVPLETGPLARRMAASAPDAETHQDDDPLFADIYNAIGPSVMVRQLARMHEGPKYYKWVRQWLDDLELKESFYTKPVEYAEGKGFGSTEAARGALSDWIVIEDSKIKNYQVVTPTAWNIGPRDASEVLGPIEQALVGSPIVDAEDPVELGHVARSFDSCLVCTVH 513 T 4.1E-19 NiFeSe_Hases pdb F Bacteria T 7uus 1 A,C,E,G,I,K,M,O A,C,E,G,I,K,M,O A0QUM7_MYCS2 Hydrogenase-2, large subunit TELDLFVSPLGRVEGDLDVRVTINDGVVTSAWTEAAMFRGFEIILRGKDPQAGLIVCPRICGICGGSHLYKSAYALDTAWRTHMPPNATLIRNICQACETLQSIPRYFYALFAIDLTNKNYAKSKLYDEAVRRFAPYVGTSYQPGVVLSAKPVEVYAIFGGQWPXSSFMVPGGVMSAPTLSDVTRAIAILEHWNDNWLEKQWLGCSVDRWLENKTWNDVLAWVDENESQYNSDCGFFIRYCLDVGLDKYGQGVGNYLATGTYFEPSLYENPTIEGRNAALIGRSGVFADGRYFEFDQANVTEDVTHSFYEGNRPLHPFEGETIPVNPEDGRRQGKYSWAKSPRYAVPGLGNVPLETGPLARRMAASAPDAETHQDDDPLFADIYNAIGPSVMVRQLARMHEGPKYYKWVRQWLDDLELKESFYTKPVEYAEGKGFGSTEAARGALSDWIVIEDSKIKNYQVVTPTAWNIGPRDASEVLGPIEQALVGSPIVDAEDPVELGHVARSFDSCLVCTVH 515 T 4.1E-19 NiFeSe_Hases pdb F Bacteria T 7uv1 1 A A Q8L5L6_ANAOC Vicilin-like protein GLGFALAKIDPELKQCKHQCKVQRQYDEQQKEQCVKECEKYYKEKKGREREHEHRD 56 T 0.00024 Vicilin_N pdb F Eukaryota T 7uv2 1 A A Q8L5L5_ANAOC Vicilin-like protein GVDEPSTHEPAEKHLSQCMRQCERQEGGQQKQLCRFRCQERYKKERGQHNYKREDD 56 T 0.0011 Vicilin_N pdb F Eukaryota T 7uv3 1 A A VCL_PISVE 7S GLOBULIN,7S SEED STORAGE PROTEIN,7S VICILIN-LIKE PROTEIN PIS V 3,VICILIN PIS V 3 GKTDPELKQCKHQCKVQRQYDEEQKEQCAKGCEKYYKEKKGREQEELE 48 T 0.0024 Vicilin_N pdb F Eukaryota T 7uva 2 B,E B,E KDM2A_MOUSE F-BOX AND LEUCINE-RICH REPEAT PROTEIN 11,F-BOX/LRR-REPEAT PROTEIN 11,JMJC DOMAIN-CONTAINING HISTONE DEMETHYLATION PROTEIN 1A,[HISTONE-H3]-LYSINE-36 DEMETHYLASE 1A MQVHLTHFELEGLRCLVDKLESLPLHKKCVPTGIEDEDALIADVKILLEELASSDPKLALTGVPIVQWP 69 T 0.0031 JHD pdbhh F Eukaryota T 7uve 2 B B B4I1C5_DROSE peptide H3K9me2K14ac TKQTARKSTGGXAPRKQ 17 T 230 WW pdbhh F Eukaryota T 7uvg 1 A A Coh5 HHHHHHENLYFQGVTATSNLFPEKQVTLSADKKTVKVTYMFQSKDKDMLDFQWDMNYDANVLKPTANTTRAKSFEYPKIGSYVWNSLPGVIKANGNTLSLYDTTSKEIVFASAEFEVIDPEATATTVNLDVQVLRLSKVDPATDMEIGDEEVSVADKSIVDQEVFDKYVVANNTVTDPDGSEE 183 T 0.0095 Cohesin pdbpercent F T 7uvh 3 C,F C,F P230_PLAF7 Gametocyte surface protein P230 VGVDELDKIDLSYETTESGDTAVSEDSYDKYASQNTNKEYVCDFTDQLKPTESGPKVKKCEVKVNEPLIKVKIICPLKGSVEKLYDNIEYVPKKSPYVVLTKEETKLKEKLLSKLIYGLLISPTVNEKENNFKEGVIEFTLPPVVHKATVFYFICDNSKTEDDNKKGNRGIVEVYVEPYGGSLKENLYFQGWSHPQFEK 199 T 0.12 tRNA_edit pdbpercent F Eukaryota T 7uvi 3 C,F C,F P230_PLAF7 Gametocyte surface protein P230 VGVDELDKIDLSYETTESGDTAVSEDSYDKYASQNTNKEYVCDFTDQLKPTESGPKVKKCEVKVNEPLIKVKIICPLKGSVEKLYDNIEYVPKKSPYVVLTKEETKLKEKLLSKLIYGLLISPTVNEKENNFKEGVIEFTLPPVVHKATVFYFICDNSKTEDDNKKGNRGIVEVYVEPYGGSLKENLYFQGWSHPQFEK 199 T 0.12 tRNA_edit pdbpercent F Eukaryota T 7uvo 3 C C P230_PLAF7 Gametocyte surface protein P230 VGVDELDKIDLSYETTESGDTAVSEDSYDKYASQNTNKEYVCDFTDQLKPTESGPKVKKCEVKVNEPLIKVKIICPLKGSVEKLYDNIEYVPKKSPYVVLTKEETKLKEKLLSKLIYGLLISPTVNEKENNFKEGVIEFTLPPVVHKATVFYFICDNSKTEDDNKKGNRGIVEVYVEPYGGSLKENLYFQGWSHPQFEK 199 T 0.12 tRNA_edit pdbpercent F Eukaryota T 7uvq 1 A A P230_PLAF7 Gametocyte surface protein P230 VGVDELDKIDLSYETTESGDTAVSEDSYDKYASQNTNKEYVCDFTDQLKPTESGPKVKKCEVKVNEPLIKVKIICPLKGSVEKLYDNIEYVPKKSPYVVLTKEETKLKEKLLSKLIYGLLISPTVNEKENNFKEGVIEFTLPPVVHKATVFYFICDNSKTEDDNKKGNRGIVEVYVEPYGGSLKENLYFQGWSHPQFEK 199 T 0.12 tRNA_edit pdbpercent F Eukaryota T 7uvs 3 C,F C,F P230_PLAF7 Gametocyte surface protein P230 VGVDELDKIDLSYETTESGDTAVSEDSYDKYASQNTNKEYVCDFTDQLKPTESGPKVKKCEVKVNEPLIKVKIICPLKGSVEKLYDNIEYVPKKSPYVVLTKEETKLKEKLLSKLIYGLLISPTVNEKENNFKEGVIEFTLPPVVHKATVFYFICDNSKTEDDNKKGNRGIVEVYVEPYGGSLKENLYFQGWSHPQFEK 199 T 0.12 tRNA_edit pdbpercent F Eukaryota T 7uvv 43 QA m RS13_ACIB5 30S ribosomal protein S13 MARIAGVNIPDNKHAVISLTYIFGIGRHTAKNILAAVGITETTKIRELDDAQLDAIRAEVAKVPTEGDLRREISMNIKRLMDLGCYRGLRHRRSLPVRGQRTKTNARTRKGPRKPIKK 118 F F Bacteria T 7uvw 43 QA m RS13_ACIB5 30S ribosomal protein S13 MARIAGVNIPDNKHAVISLTYIFGIGRHTAKNILAAVGITETTKIRELDDAQLDAIRAEVAKVPTEGDLRREISMNIKRLMDLGCYRGLRHRRSLPVRGQRTKTNARTRKGPRKPIKK 118 F F Bacteria T 7uvx 43 QA m RS13_ACIB5 30S ribosomal protein S13 MARIAGVNIPDNKHAVISLTYIFGIGRHTAKNILAAVGITETTKIRELDDAQLDAIRAEVAKVPTEGDLRREISMNIKRLMDLGCYRGLRHRRSLPVRGQRTKTNARTRKGPRKPIKK 118 F F Bacteria T 7uvy 43 QA m RS13_ACIB5 30S ribosomal protein S13 MARIAGVNIPDNKHAVISLTYIFGIGRHTAKNILAAVGITETTKIRELDDAQLDAIRAEVAKVPTEGDLRREISMNIKRLMDLGCYRGLRHRRSLPVRGQRTKTNARTRKGPRKPIKK 118 F F Bacteria T 7uvz 43 QA m RS13_ACIB5 30S ribosomal protein S13 MARIAGVNIPDNKHAVISLTYIFGIGRHTAKNILAAVGITETTKIRELDDAQLDAIRAEVAKVPTEGDLRREISMNIKRLMDLGCYRGLRHRRSLPVRGQRTKTNARTRKGPRKPIKK 118 F F Bacteria T 7uw1 43 QA m RS13_ACIB5 30S ribosomal protein S13 MARIAGVNIPDNKHAVISLTYIFGIGRHTAKNILAAVGITETTKIRELDDAQLDAIRAEVAKVPTEGDLRREISMNIKRLMDLGCYRGLRHRRSLPVRGQRTKTNARTRKGPRKPIKK 118 F F Bacteria T 7uw9 6 J b V-type proton ATPase subunit AP1 fragment XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 32 F F F 7uw9 10 V r V-type proton ATPase subunit AP2 fragment XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 7uwa 5 I b V-type proton ATPase subunit AP1 fragment XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 32 F F F 7uwa 9 U r V-type proton ATPase subunit AP2 fragment XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 7uwb 10 R b V-type proton ATPase subunit AP1 fragment XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 32 F F F 7uwb 15 EA r V-type proton ATPase subunit AP2 fragment XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 7uwc 10 R b V-type proton ATPase subunit AP1 fragment XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 32 F F F 7uwc 15 EA r V-type proton ATPase subunit AP2 fragment XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 7uwd 10 R b V-type proton ATPase subunit AP1 fragment XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 32 F F F 7uwd 15 EA r V-type proton ATPase subunit AP2 fragment XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 7uwi 2 B F Helicon Polypeptide FP01567 ATHRCEWAALHCELVX 16 T 11 Stonin2_N pdbhh F T 7uwo 2 B B Helicon Polypeptide FP05874 PAVMECYEAAFICHYV 16 T 3.1 DUF6117 pdbhh F T 7uwy 1 A A De novo designed small beta-barrel protein 29_bp_sh3 SEVETVLRKAAERNKTVDIHTKSGTTVRVNVKRVDSKSVKVERNGQDLEISLDQITHVDGW 61 T 0.0011 ROF pdbhh F T 7uwz 1 A A De novo designed small beta-barrel protein 33_bp_sh3 MDGFDRGADVTYTDSDGSKKTYKVLSYSGDKVTVQDSDGRTLTFDARLLRVKKWLEHHHHHH 62 T 0.012 DUF2835 pdb F T 7ux5 2 C,D,F,H,J,L B,D,F,H,J,L Helicon FP28136 DPALWQCVFAARYCYEE 17 T 0.27 Zea_mays_MuDR pdbhh F T 7uxe 1 A,B,C,D,E,F,G,H,I,J A,B,C,D,E,F,G,H,I,J A0A2K8I4H6_9CAUD Small terminase MTKFYSPDDLVTPQEFADPHFAAINQKRFDLYIDLRVQGYSSWRVFRAIWGEEHMDGPAQARIFAMESNPYYRKQFKAKLNATKTSDLWNPKTALHELLQMVRDPTVKDSSRLSAIKELNVLAEITFVDESGKTRIGRGLADFYASEAEAQTATVAAAAEANSYVPEGEEGDFPSPTPEPTEEDRANPI 189 T 0.047 DUF2992 pdbpssm T Viruses T 7uxi 2 B B FP19711 DPAWWVCAIAAIECSDV 17 T 3 Ytca pdbhh F T 7uxj 2 E,F,G,H E,F,G,H FP29102 XPECHIEAYWCI 12 T 2.6 DUF6390 pdbhh F T 7uxk 2 B B FP24322 XFECLDAFFSC 11 T 1.9 Mif2_N pdbhh F T 7uxm 2 D,E,F D,E,F FP29092 XDPANQDCHVAAWHCWQR 18 T 5.3 Phage_Cox pdbhh F T 7uxn 2 B B FP29103 XPDCHIRAYVCH 12 T 2.8 DUF3051 pdbhh F T 7uxo 2 B B FP30790 XDPAAADCQWAAFLCRVYX 19 T 9.6 Poty_PP pdbhh F T 7uxp 2 C,D C,D FP28132 XDPALWQCVFAARSCYEE 18 T 0.37 Zea_mays_MuDR pdbhh F T 7uxq 2 C,D C,D FP28135 XDPALWMCVFAARQCYESX 19 T 2.2 TMEM220 pdbhh F T 7uy2 2 C,D C,D Helicon FP06649 FTDCQLAAAVCMTY 14 T 12 ODAPH pdbhh F T 7uy5 4 D E RFA2_TETTS Telomerase holoenzyme Teb2 subunit MSNRVQGGFDNNSGNNQSAQKQQAEKIPQITVPLNCFMINQIVKAAKENPQAHSGNHYEWYGAFENAIITAKFEFLQSINDSPKIMGKLSDSTGCIEVVIQKSKMSDELPEFVQAYEIELQNNGNRHKYVRAMLKMRKNAQIQLLYFSIVNDANEISRHGLDLCLRYLQRKHGIEDFMHMTNDKAHNNHNASAQKVHYQIDRNQQPKEQVLELMRQILKHNPNDQIPKSKIIEFFQSQLNQVQINQILQQLVSANEIFSVGSDNYLLNV 269 T 0.012 HSM3_N pdb F Eukaryota T 7uy5 6 F G TAP50_TETTS P50 MKLLLQNQNIFQKLKNTLNGCIKKFYDTYQDLEQMQKFEMIVEDKLLFRYSCSQSEMFSAQIQAHYLEKRVLQLTDGNVKYIVNFRDKGVLDKANFFDTPNNSLVIIRQWSYEIYYTKNTFQINLVIDEMRCIDIITTIFYCKLELDFTQGIKGISKSSSFSNQIYEYSAQYYKAIQLLKKLLINDSYISELYNSTKSKQQPRLFIFQSFKPKMNLAEQNLSRQFEQCQQDDFGDGCLLQIVNYTHQSLKQIENKNNSNQIVNGQNEISKKKRVLKSNEDLYKISLQKQLKIFQEEEIELHSQSTIRNQTNQQLETFESDTSKRNSEKILHSINELNTSKQKVNQMNSSQHQIQKLENNNLNKNILNQINENDIKNELEERQQQHLTQSFNSKAQLKKIITLKKNQDILLFKPQEQEGSKKY 422 T 0.093 DUF6235 pdb F Eukaryota T 7uy5 9 I I TAP75_TETTS P75 MEIEEDLNLKILEDVKKLYLQSFDYIKNGISSSLPSDKKFLADDDIDLSRITFLYKFISVNPTLLLINEKTQAKRRIFQGEYLYGKKKIQFNIIAKNLEIERELIQFFKKPYQCYIMHNVQVFQMLNKNKNNNVVEFMDSEDLQSSVDCQLYYLIDESSHVLEDDSMDFISTLTRLSDSFNSNEFVFETNYSIQISQMPKPLNTTHFKLLQPKVVNSFEGVILQVQEGKNILQIEELIDQVYLNSRRDRFYILKVANGKNYMDFIEVYLVYDNEDQEAKQQLQFYLKPFQRILIFQSLKHFTKNLKLFMISFFYSSGVQPNNSNVKNFLVSHKGVEFFSRFDIQKNELLCKDLIKSYNKLPLSNISKLLEDEGVMIRSNMKFQVRVKKVKYFKIRLNCLNCKQEWTVGLKNCINCKGQQSYISYNIQVLVQDQHFLEQQAYIYLYDDLAAQFFNITESEKKELHLHLTKNETFIQLYYSFNKDYPLSIIKFKDKIFNKDITNCIVAYPFADIDNKIFNSQQQIIQDENLRIESEKFIQNFTEDNNLQESKLYYEKFKSKNKQQIFVNGTYISTNYSQGQKICLKPIPCLKVMYVFPQEDIKLSALKIIEEINQLKIQIDQLN 622 T 0.08 CDC24_OB3 pdbhh F Eukaryota T 7uy5 10 J K TAP19_TETTS P19 MQQPKRNFDLYKLITDKQIDFQVADLIQDEQSSFVSVRIYGQFKCFVPKSTIQEQLDKIKNLSSKELAKNKIFKFLSEYNKNNQKQDELSHDYYGYFKVQQHQFILNLENAQREASLAVDDFYFINGRIYKTNHDILILQAHHVYQMQKPTLQLLQAASEINQN 164 T 0.53 TMF_DNA_bd pdb F Eukaryota T 7uy5 11 K J TAP45_TETTS P45 MEDNFELVFLKELPSLPDFSKVCFTGLILSFSNFPSSEQNQQKDVPHKIAIIQDSTGEAELFLDMYKFCQEEISVFKAITGIGVLKKKNIGAGQVCKIIVERFRIIHSADEEMLQYLLIQKYKLSKTLNEQQQIKQKEQQINQQKIDKVVQDKESKEHLLWKQQQIPQIKSNQENINTLKYKELIAGELMRITHKLLIQKLQQQQPANNNKQINEMDVESNELAEKKEVIIKIQEIAKDQQLYDTLSIQYQVDQKEQYYAKIAQSLEDFVSISALKMVSYIYPNISYQVSIGFFQNILDIATKTVKDRGALGCNYKYLKDKLTKALNLQQISYPLISESYISYLVHLFQDFNIIEIENEHKFYYKQAFQYDDS 373 T 12 Ten1 pdbhh F Eukaryota T 7uy6 4 D E RFA2_TETTS Telomerase holoenzyme Teb2 subunit MSNRVQGGFDNNSGNNQSAQKQQAEKIPQITVPLNCFMINQIVKAAKENPQAHSGNHYEWYGAFENAIITAKFEFLQSINDSPKIMGKLSDSTGCIEVVIQKSKMSDELPEFVQAYEIELQNNGNRHKYVRAMLKMRKNAQIQLLYFSIVNDANEISRHGLDLCLRYLQRKHGIEDFMHMTNDKAHNNHNASAQKVHYQIDRNQQPKEQVLELMRQILKHNPNDQIPKSKIIEFFQSQLNQVQINQILQQLVSANEIFSVGSDNYLLNV 269 T 0.012 HSM3_N pdb F Eukaryota T 7uy6 6 F G TAP50_TETTS P50 MKLLLQNQNIFQKLKNTLNGCIKKFYDTYQDLEQMQKFEMIVEDKLLFRYSCSQSEMFSAQIQAHYLEKRVLQLTDGNVKYIVNFRDKGVLDKANFFDTPNNSLVIIRQWSYEIYYTKNTFQINLVIDEMRCIDIITTIFYCKLELDFTQGIKGISKSSSFSNQIYEYSAQYYKAIQLLKKLLINDSYISELYNSTKSKQQPRLFIFQSFKPKMNLAEQNLSRQFEQCQQDDFGDGCLLQIVNYTHQSLKQIENKNNSNQIVNGQNEISKKKRVLKSNEDLYKISLQKQLKIFQEEEIELHSQSTIRNQTNQQLETFESDTSKRNSEKILHSINELNTSKQKVNQMNSSQHQIQKLENNNLNKNILNQINENDIKNELEERQQQHLTQSFNSKAQLKKIITLKKNQDILLFKPQEQEGSKKY 422 T 0.093 DUF6235 pdb F Eukaryota T 7uy7 1 A A TAP75_TETTS P75 MEIEEDLNLKILEDVKKLYLQSFDYIKNGISSSLPSDKKFLADDDIDLSRITFLYKFISVNPTLLLINEKTQAKRRIFQGEYLYGKKKIQFNIIAKNLEIERELIQFFKKPYQCYIMHNVQVFQMLNKNKNNNVVEFMDSEDLQSSVDCQLYYLIDESSHVLEDDSMDFISTLTRLSDSFNSNEFVFETNYSIQISQMPKPLNTTHFKLLQPKVVNSFEGVILQVQEGKNILQIEELIDQVYLNSRRDRFYILKVANGKNYMDFIEVYLVYDNEDQEAKQQLQFYLKPFQRILIFQSLKHFTKNLKLFMISFFYSSGVQPNNSNVKNFLVSHKGVEFFSRFDIQKNELLCKDLIKSYNKLPLSNISKLLEDEGVMIRSNMKFQVRVKKVKYFKIRLNCLNCKQEWTVGLKNCINCKGQQSYISYNIQVLVQDQHFLEQQAYIYLYDDLAAQFFNITESEKKELHLHLTKNETFIQLYYSFNKDYPLSIIKFKDKIFNKDITNCIVAYPFADIDNKIFNSQQQIIQDENLRIESEKFIQNFTEDNNLQESKLYYEKFKSKNKQQIFVNGTYISTNYSQGQKICLKPIPCLKVMYVFPQEDIKLSALKIIEEINQLKIQIDQLN 622 T 0.08 CDC24_OB3 pdbhh F Eukaryota T 7uy7 2 B B TAP45_TETTS P45 MEDNFELVFLKELPSLPDFSKVCFTGLILSFSNFPSSEQNQQKDVPHKIAIIQDSTGEAELFLDMYKFCQEEISVFKAITGIGVLKKKNIGAGQVCKIIVERFRIIHSADEEMLQYLLIQKYKLSKTLNEQQQIKQKEQQINQQKIDKVVQDKESKEHLLWKQQQIPQIKSNQENINTLKYKELIAGELMRITHKLLIQKLQQQQPANNNKQINEMDVESNELAEKKEVIIKIQEIAKDQQLYDTLSIQYQVDQKEQYYAKIAQSLEDFVSISALKMVSYIYPNISYQVSIGFFQNILDIATKTVKDRGALGCNYKYLKDKLTKALNLQQISYPLISESYISYLVHLFQDFNIIEIENEHKFYYKQAFQYDDS 373 T 12 Ten1 pdbhh F Eukaryota T 7uy7 3 C C TAP19_TETTS P19 MQQPKRNFDLYKLITDKQIDFQVADLIQDEQSSFVSVRIYGQFKCFVPKSTIQEQLDKIKNLSSKELAKNKIFKFLSEYNKNNQKQDELSHDYYGYFKVQQHQFILNLENAQREASLAVDDFYFINGRIYKTNHDILILQAHHVYQMQKPTLQLLQAASEINQN 164 T 0.53 TMF_DNA_bd pdb F Eukaryota T 7uy7 4 D D TAP50_TETTS P50 MKLLLQNQNIFQKLKNTLNGCIKKFYDTYQDLEQMQKFEMIVEDKLLFRYSCSQSEMFSAQIQAHYLEKRVLQLTDGNVKYIVNFRDKGVLDKANFFDTPNNSLVIIRQWSYEIYYTKNTFQINLVIDEMRCIDIITTIFYCKLELDFTQGIKGISKSSSFSNQIYEYSAQYYKAIQLLKKLLINDSYISELYNSTKSKQQPRLFIFQSFKPKMNLAEQNLSRQFEQCQQDDFGDGCLLQIVNYTHQSLKQIENKNNSNQIVNGQNEISKKKRVLKSNEDLYKISLQKQLKIFQEEEIELHSQSTIRNQTNQQLETFESDTSKRNSEKILHSINELNTSKQKVNQMNSSQHQIQKLENNNLNKNILNQINENDIKNELEERQQQHLTQSFNSKAQLKKIITLKKNQDILLFKPQEQEGSKKY 422 T 0.093 DUF6235 pdb F Eukaryota T 7uyg 1 A A Q5ZTB4_LEGPH LotA GPMAKTIKATGDGACLFNAVSIGLSVEILSGRLDSQLDTPGYQALLDEFAKHHPQFNPKSWKTLKEWLAYYNDTRDIELILAPVLFNLNQKYQDHLDEEILNELTNLVWKNKANIENGQAWFQLQNTGDLGEALFPKLENLDLKKDRAPLLDKLREILKDYKLELTRENVKQFLTEKAKELLSALKKKISSDPHAFQRGYSCDELKGMTDALAISLVENREEDITDNRIKIRLENQEEHWNVLCNEEDSERFLDSTPSRLKMTSLEAYRGDKQVSAPTSEQLDLIEEPGVFLRERT 296 T 0.066 OTU pdbhh F Bacteria T 7uyh 2 B A Q5ZTB4_LEGPH LotA GPMAKTIKATGDGAALFNAVSIGLSVEILSGRLDSQLDTPGYQALLDEFAKHHPQFNPKSWKTLKEWLAYYNDTRDIELILAPVLFNLNQKYQDHLDEEILNELTNLVWKNKANIENGQAWFQLQNTGDLGEALFPKLENLDLKKDRAPLLDKLREILKDYKLELTRENVKQFLTEKAKELLSALKKKISSDPHAFQRGYSCDELKGMTDALAISLVENREEDITDNRIKIRLENQEEHWNVLCNEEDSERFLDSTPSRLKMTSLEAYRGDKQVSAPT 278 T 0.093 OTU pdbhh F Bacteria T 7uyj 2 C,D C,D Helicon FP06652 DPAIVQCAWAALYCDMQ 17 T 1.6 UPF0139 pdbhh F T 7uyk 2 C,D D,C Helicon FP06655 DPAMQRCFSAAVYCAIS 17 T 1.3 CCSAP pdbhh F T 7uym 4 D P CSP_PLAF7 CS NANPNANPNANP 12 T 0.48 Cas_Cas7 pdbhh F Eukaryota F 7uyx 1 A,B,C,D A,B,C,D A0A4D6BFJ2_9CAUD Bacteriophage PA1C gp2 SNAMTAVNYPFVDTMDKFDKITKGLIFEHQAEGESETMISHELSILDNDGVVHSLHFSQITSLIDTITGKHPSLELPPQLFLITQYLLEDLKEVGEKGFVITEYFIDVLPTGNKAIFRGTLAHKSTVDGHPDFDPSSTISKKEFEFSLNQFSILQQIALSHCIANLHEECAGFRGTFDVEYTFHWTPFAFNVKFSE 196 T 0.03 HTH_5 unppssm T Viruses T 7uz1 1 A,B A,B A0A0E3K5E4_SACSO GLYCOSIDE HYDROLASE FAMILY 1 PROTEIN MYSFPNSFRFGWSQAGFQSEMGTPGSEDPNTDWYKWVHDPENMAAGLVSGDLPENGPGYWGNYKTFHDNAQKMGLKIARLNVEWSRIFPNPLPRPQNFDESKQDVTEVEINENELKRLDEYANKDALNHYREIFKDLKSRGLYFILNMYHWPLPLWLHDPIRVRRGDFTGPSGWLSTRTVYEFARFSAYIAWKFDDLVDEYSTMNEPNVVGGLGYVGVKSGFPPGYLSFELSRRHMYNIIQAHARAYDGIKSVSKKPVGIIYANSSFQPLTDKDMEAVEMAENDNRWWFFDAIIRGEITRGNEKIVRDDLKGRLDWIGVNYYTRTVVKRTEKGYVSLGGYGHGCERNSVSLAGLPTSDFGWEFFPEGLYDVLTKYWNRYHLYMYVTENGIADDADYQRPYYLVSHVYQVHRAINSGADVRGYLHWSLADNYEWASGFSMRFGLLKVDYNTKRLYWRPSALVYREIATNGAITDEIEHLNSVPPVKPLRH 489 T 1.3E-42 Glyco_hydro_1 unppercent F Archaea T 7uz2 1 A,B A,B A0A0E3K5E4_SACSO GLYCOSIDE HYDROLASE FAMILY 1 PROTEIN MYSFPNSFRFGWSQAGFQSEMGTPGSEDPNTDWYKWVHDPENMAAGLVSGDLPENGPGYWGNYKTFHDNAQKMGLKIARLNVEWSRIFPNPLPRPQNFDESKQDVTEVEINENELKRLDEYANKDALNHYREIFKDLKSRGLYFILNMYHWPLPLWLHDPIRVRRGDFTGPSGWLSTRTVYEFARFSAYIAWKFDDLVDEYSTMNEPNVVGGLGYVGVKSGFPPGYLSFELSRRHMYNIIQAHARAYDGIKSVSKKPVGIIYANSSFQPLTDKDMEAVEMAENDNRWWFFDAIIRGEITRGNEKIVRDDLKGRLDWIGVNYYTRTVVKRTEKGYVSLGGYGHGCERNSVSLAGLPTSDFGWEFFPEGLYDVLTKYWNRYHLYMYVTENGIADDADYQRPYYLVSHVYQVHRAINSGADVRGYLHWSLADNYEWASGFSMRFGLLKVDYNTKRLYWRPSALVYREIATNGAITDEIEHLNSVPPVKPLRH 489 T 1.3E-42 Glyco_hydro_1 unppercent F Archaea T 7uzl 1 A A Cyclic peptide D9.16 DPR-MAA-ALA-DVA-MLE-LEU-LEU-PRO-DLE XXAXXLLPX 9 T 15 PelD_GGDEF pdbhh F F 7v0e 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H DNM3B_HUMAN DNMT3B,DNA METHYLTRANSFERASE HSAIIIB,DNA MTASE HSAIIIB,M.HSAIIIB AARRRPIRVLSLFDGIATGYLVLKELGIKVGKYVASEVCEESIAVGTVKHEGNIKYVNDVRNITKKNIEEWGPFDLVIGGSPCNDLSNVNPARKGLYEGTGRLFFEFYHLLNYSRPKEGDDRPFFWMFENVVAMKVGDKRDISRFLECNPVMIDAIKVSAAHRARYFWGNLPGMNRPVIASKNDKLELQDCLEYNRIAKLKKVQTITTKSNSIKQGKNQLFPVVMNGKEDVLWCTELERIFGFPVHYTDVSNMGRGARQKLLGRSWSVPVIRHLFAPLKDYFACE 285 T 6.1E-16 DNA_methylase unppercent F Eukaryota T 7v0n 2 E,G,I,K E,G,I,K IgG 21 Fab heavy chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 235 F F F 7v0n 3 F,H,J,L F,H,J,L IgG 21 Fab light chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 216 F F F 7v0o 2 E,G,I,K E,G,I,K IgG 94 Fab heavy chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 222 F F F 7v0o 3 F,H,J,L F,H,J,L IgG 94 Fab light chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 211 F F F 7v0p 2 E,G,I,K E,G,I,K IgG 106 Fab heavy chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAAA 229 T 15000 EF-hand_5 pdbhh F F 7v0p 3 F,H,J,L F,H,J,L IgG 106 Fab light chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 212 F F F 7v1a 2 B B ASP-ILE-ASP-GLN-MET-PHE-SER-THR-LEU-LEU-GLY-GLU-MK8-ASP-LEU-LEU-MK8-GLN-SER DIDQMFSTLLGEXDLLXQS 19 T 4.8 Caskin-tail pdbhh F T 7v1y 2 E,F,G,H E,F,G,H ALA-ALA-B3S AAX 3 T 1000 DUF3774 pdbhh F F 7v3v 7 M H CDC7_YEAST Cell division control protein 7 MTSKTKNIDDIPPEIKEEMIQLYHDLPGIENEYKLIDKIGEGTFSSVYKAKDITGKITKKFASHFWNYGSNYVALKKIYVTSSPQRIYNELNLLYIMTGSSRVAPLCDAKRVRDQVIAVLPYYPHEEFRTFYRDLPIKGIKKYIWELLRALKFVHSKGIIHRDIKPTNFLFNLELGRGVLVDFGLAEAQMDYKSMISSQNDYDNYANTNHDGGYSMRNHEQFCPCIMRNQYSPNSHNQTPPMVTIQNGKVVHLNNVNGVDLTKGYPKNETRRIKRANRAGTRGFRAPEVLMKCGAQSTKIDIWSVGVILLSLLGRRFPMFQSLDDADSLLELCTIFGWKELRKCAALHGLGFEASGLIWDKPNGYSNGLKEFVYDLLNKECTIGTFPEYSVAFETFGFLQQELHDRMSIEPQLPDPKTNMDAVDAYELKKYQEEIWSDHYWCFQVLEQCFEMDPQKRSSAEDLLKTPFFNELNENTYLLDGESTDEDDVVSSSEADLLDKDVLLISE 507 T 7.6E-21 Pkinase pdbpssm F Eukaryota T 7v4w 3 C C MUC1_HUMAN MUC1-NT,MUC1-ALPHA RPAPGSTAPPAHG 13 T 43 TOH_N pdbhh F Eukaryota T 7v5e 1 A A ILE-GLN-CYS-CYS-ARG-CYS-GLN-SER-TRP-PRO-TYR-MET-CYS-SER-VAL-PHE-CYS-CYS IQCCRCQSWPYMCSVFCC 18 T 0.32 zf-CW pdbhh F T 7v5f 1 A A Wisotide ADCTEYCSNSCPFCNGQPLYQLCCINNCCPS 31 T 0.11 Radical_SAM_2 pdbhh F T 7v64 3 C C MUC1_HUMAN MUC1-NT,MUC1-ALPHA RPAPGSTAPPAHG 13 T 43 TOH_N pdbhh F Eukaryota T 7v7k 3 C C MUC1_HUMAN MUC1-NT,MUC1-ALPHA RPAPGSTAPPAHG 13 T 43 TOH_N pdbhh F Eukaryota T 7v8o 1 A,B A,B A0A1L1QK40_9PSEU Cyclohexanone Monooxygenase from Thermocrispum municipale MSTTQTPDLDAIVIGAGFGGIYMLHKLRNDLGLSVRVFEKGGGVGGTWYWNKYPGAKSDTEGFVYRYSFDKELLREYDWTTRYLDQPDVLAYLEHVVERYDLARDIQLNTEVTDAIFDEETELWRVTTAGGETLTARFLVTALGLLSRSNIPDIPGRDSFAGRLVHTNAWPEDLDITGKRVGVIGTGSTGTQFIVAAAKMAEQLTVFQRTPQYCVPSGNGPMDPDEVARIKQNFDSIWDQVRSSTVAFGFEESTVEAMSVSESERQRVFQQAWDKGNGFRFMFGTFCDIATNPEANAAAAAFIRSKIAEIVKDPETARKLTPTDLYAKRPLCNEGYYETYNRDNVSLVSLKETPIEEIVPQGVRTSDGVVHELDVLVFATGFDAVDGNYRAMNLRGRDGRHINEHWTEGPTSYLGVTKAGFPNMFMILGPNGPFTNTPPSIEAQVEWISDLIDKATREGLTTVEPTADAEREWTETCAEIANMTLFPKADSWIFGANIPGKRHAVMFYLGGLGNYRRQLADVADGGYRGFQLRGERAQAVA 541 T 0.11 Pyr_redox_2 unppercent F Bacteria T 7v8q 3 G,H,I G,H,I MUC1_HUMAN MUC1 PEPTIDE RPAPGSTAPPAHG 13 T 43 TOH_N pdbhh F Eukaryota T 7v8r 1 A A A0A1L1QK40_9PSEU Cyclohexanone Monooxygenase from Thermocrispum municipale MSTTQTPDLDAIVIGAGFGGIYMLHKLRNDLGLSVRVFEKGGGVGGTWYWNKYPGAKSDTEGFVYRYSFDKELLREYDWTTRYLDQPDVLAYLEHVVERYDLARDIQLNTEVTDAIFDEETELWRVTTAGGETLTARFLVTALGLLSRSNIPDIPGRDSFAGRLVHTNAWPEDLDITGKRVGVIGTGSTGTQFIVAAAKMAEQLTVFQRTPQYCVPSGNGPMDPDEVARIKQNFDSIWDQVRSSTVAFGFEESTVEAMSVSESERQRVFQQAWDKGNGFRFMFGTFCDIATNPEANAAAAAFIRSKIAEIVKDPETARKLTPTDLYAKRPLCNEGYYETYNRDNVSLVSLKETPIEEIVPQGVRTSDGVVHELDVLVFATGFDAVDGNYRAMNLRGRDGRHINEHWTEGPTSYLGVTKAGFPNMFMILGPNGPFTNTPPSIEAQVEWISDLIDKATREGLTTVEPTADAEREWTETCAEIANMTLFPKADSWIFGANIPGKRHAVMFYLGGLGNYRRQLADVADGGYRGFQLRGERAQAVA 541 T 0.11 Pyr_redox_2 unppercent F Bacteria T 7v8s 1 A,B A,B A0A1L1QK40_9PSEU Cyclohexanone Monooxygenase from Thermocrispum municipale MSTTQTPDLDAIVIGAGFGGIYMLHKLRNDLGLSVRVFEKGGGVGGTWYWNKYPGAKSDTEGFVYRYSFDKELLREYDWTTRYLDQPDVLAYLEHVVERYDLARDIQLNTEVTDAIFDEETELWRVTTAGGETLTARFLVTALGLLSRSNIPDIPGRDSFAGRLVHTNAWPEDLDITGKRVGVIGTGSTGTQFIVAAAKMAEQLTVFQRTPQYCVPSGNGPMDPDEVARIKQNFDSIWDQVRSSTVAFGFEESTVEAMSVSESERQRVFQQAWDKGNGFRFMFGTFCDIATNPEANAAAAAFIRSKIAEIVKDPETARKLTPTDLYAKRPLCNEGYYETYNRDNVSLVSLKETPIEEIVPQGVRTSDGVVHELDVLVFATGFDAVDGNYRAMNLRGRDGRHINEHWTEGPTSYLGVTKAGFPNMFMILGPNGPFTNTPPSIEAQVEWISDLIDKATREGLTTVEPTADAEREWTETCAEIANMTLFPKADSWIFGANIPGKRHAVMFYLGGLGNYRRQLADVADGGYRGFQLRGERAQAVA 541 T 0.11 Pyr_redox_2 unppercent F Bacteria T 7v93 1 A A cas12c2 MKIEEGKGHHHHHHMTKHSIPLHAFRNSGADARKWKGRIALLAKRGKETMRTLQFPLEMSEPEAAAINTTPFAVAYNAIEGTGKGTLFDYWAKLHLAGFRFFPSGGAATIFRQQAVFEDASWNAAFCQQSGKDWPWLVPSKLYERFTKAPREVAKKDGSKKSIEFTQENVANESHVSLVGASITDKTPEDQKEFFLKMAGALAEKFDSWKSANEDRIVAMKVIDEFLKSEGLHLPSLENIAVKCSVETKPDNATVAWHDAPMSGVQNLAIGVFATCASRIDNIYDLNGGKLSKLIQESATTPNVTALSWLFGKGLEYFRTTDIDTIMQDFNIPASAKESIKPLVESAQAIPTMTVLGKKNYAPFRPNFGGKIDSWIANYASRLMLLNDILEQIEPGFELPQALLDNETLMSGIDMTGDELKELIEAVYAWVDAAKQGLATLLGRGGNVDDAVQTFEQFSAMMDTLNGTLNTISARYVRAVEMAGKDEARLEKLIECKFDIPKWCKSVPKLVGISGGLPKVEEEIKVMNAAFKDVRARMFVRFEEIAAYVASKGAGMDVYDALEKRELEQIKKLKSAVPERAHIQAYRAVLHRIGRAVQNCSEKTKQLFSSKVIEMGVFKNPSHLNNFIFNQKGAIYRSPFDRSRHAPYQLHADKLLKNDWLELLAEISATLMASESTEQMEDALRLERTRLQLQLSGLPDWEYPASLAKPDIEVEIQTALKMQLAKDTVTSDVLQRAFNLYSSVLSGLTFKLLRRSFSLKMRFSVADTTQLIYVPKVCDWAIPKQYLQAEGEIGIAARVVTESSPAKMVTEVEMKEPKALGHFMQQAPHDWYFDASLGGTQVAGRIVEKGKEVGKERKLVGYRMRGNSAYKTVLDKSLVGNTELSQCSMIIEIPYTQTVDADFRAQVQAGLPKVSINLPVKETITASNKDEQMLFDRFVAIDLGERGLGYAVFDAKTLELQESGHRPIKAITNLLNRTHHYEQRPNQRQKFQAKFNVNLSELRENTVGDVCHQINRICAYYNAFPVLEYMVPDRLDKQLKSVYESVTNRYIWSSTDAHKSARVQFWLGGETWEHPYLKSAKDKKPLVLSPGRGASGKGTSQTCSCCGRNPFDLIKDMKPRAKIAVVDGKAKLENSELKLFERNLESKDDMLARRHRNERAGMEQPLTPGNYTVDEIKALLRANLRRAPKNRRTKDTTVSEYHCVFSDCGKTMHADENAAVNIGGKFIADIEK 1232 T 3.3E-05 RuvC_1 pdbhh F T 7v94 1 A A Cas12c2 MKIEEGKGHHHHHHMTKHSIPLHAFRNSGADARKWKGRIALLAKRGKETMRTLQFPLEMSEPEAAAINTTPFAVAYNAIEGTGKGTLFDYWAKLHLAGFRFFPSGGAATIFRQQAVFEDASWNAAFCQQSGKDWPWLVPSKLYERFTKAPREVAKKDGSKKSIEFTQENVANESHVSLVGASITDKTPEDQKEFFLKMAGALAEKFDSWKSANEDRIVAMKVIDEFLKSEGLHLPSLENIAVKCSVETKPDNATVAWHDAPMSGVQNLAIGVFATCASRIDNIYDLNGGKLSKLIQESATTPNVTALSWLFGKGLEYFRTTDIDTIMQDFNIPASAKESIKPLVESAQAIPTMTVLGKKNYAPFRPNFGGKIDSWIANYASRLMLLNDILEQIEPGFELPQALLDNETLMSGIDMTGDELKELIEAVYAWVDAAKQGLATLLGRGGNVDDAVQTFEQFSAMMDTLNGTLNTISARYVRAVEMAGKDEARLEKLIECKFDIPKWCKSVPKLVGISGGLPKVEEEIKVMNAAFKDVRARMFVRFEEIAAYVASKGAGMDVYDALEKRELEQIKKLKSAVPERAHIQAYRAVLHRIGRAVQNCSEKTKQLFSSKVIEMGVFKNPSHLNNFIFNQKGAIYRSPFDRSRHAPYQLHADKLLKNDWLELLAEISATLMASESTEQMEDALRLERTRLQLQLSGLPDWEYPASLAKPDIEVEIQTALKMQLAKDTVTSDVLQRAFNLYSSVLSGLTFKLLRRSFSLKMRFSVADTTQLIYVPKVCDWAIPKQYLQAEGEIGIAARVVTESSPAKMVTEVEMKEPKALGHFMQQAPHDWYFDASLGGTQVAGRIVEKGKEVGKERKLVGYRMRGNSAYKTVLDKSLVGNTELSQCSMIIEIPYTQTVDADFRAQVQAGLPKVSINLPVKETITASNKDEQMLFDRFVAIDLGERGLGYAVFDAKTLELQESGHRPIKAITNLLNRTHHYEQRPNQRQKFQAKFNVNLSELRENTVGDVCHQINRICAYYNAFPVLEYMVPDRLDKQLKSVYESVTNRYIWSSTDAHKSARVQFWLGGETWEHPYLKSAKDKKPLVLSPGRGASGKGTSQTCSCCGRNPFDLIKDMKPRAKIAVVDGKAKLENSELKLFERNLESKDDMLARRHRNERAGMEQPLTPGNYTVDEIKALLRANLRRAPKNRRTKDTTVSEYHCVFSDCGKTMHADENAAVNIGGKFIADIEK 1232 T 3.3E-05 RuvC_1 pdbhh F T 7v9b 2 B B FOXO3_HUMAN ARG-ARG-ARG-ALA-VAL-SEP-MET-ASP-ASN-SER-ASN RRRAVSMDNSN 11 T 18 Carla_C4 pdbhh F Eukaryota T 7v9x 2 B C RIB86_ECOLX retron St85 family effector protein MNKKFTDEQQQQLIGHLTKKGFYRGANIKITIFLCGGDVANHQSWRHQLSQFLAKFSDVDIFYPEDLFDDLLAGQGQHSLLSLENILAEAVDVIILFPESPGSFTELGAFSNNENLRRKLICIQDAKFKSKRSFINYGPVRLLRKFNSKSVLRCSSNELKEMCDSSIDVARKLRLYKKLMASIKKVRKENKVSKDIGNILYAERFLLPCIYLLDSVNYRTLCELAFKAIKQDDVLSKIIVRSVVSRLINERKILQMTDGYQVTALGASYVRSVFDRKTLDRLRLEIMNFENRRKSTFNYDKIPYAHP 307 T 0.05 Stork_head pdb F Bacteria T 7vac 3 G,H,I G,H,I MUC1_HUMAN MUC1 PEPTIDE RPAPGSTAPPAHG 13 T 43 TOH_N pdbhh F Eukaryota T 7vaz 3 C,F,I G,E,I MUC1_HUMAN MUC1 PEPTIDE RPAPGSTAPPAHG 13 T 43 TOH_N pdbhh F Eukaryota T 7vbl 1 A Q F1S1A8_PIG NADH-UBIQUINONE OXIDOREDUCTASE 49 KDA SUBUNIT,COMPLEX I-49KD ARQWQPDVEWAEQFGGAVMYPTKETAHWKPPPWNDVDPPKDTLV 44 T 13 eIF_4G1 pdbhh F Eukaryota T 7vbp 1 A Q A0A4X1VKC6_PIG NADH-UBIQUINONE OXIDOREDUCTASE 49 KDA SUBUNIT,COMPLEX I-49KD ARQWQPDVEWAEQFGGAVMYPTKETAHWKPPPWNDVDPPK 40 T 14 eIF_4G1 pdbhh F Eukaryota T 7vc0 1 A Q F1S1A8_PIG NADH-UBIQUINONE OXIDOREDUCTASE 49 KDA SUBUNIT,COMPLEX I-49KD ARQWQPDVEWAEQFGGAVMYPTKETAHWKPPPWNDVDPPKDTLV 44 T 13 eIF_4G1 pdbhh F Eukaryota T 7vcf 1 A A YCF78_CHLRE UNCHARACTERIZED MEMBRANE PROTEIN YCF78 MITFTFMSLVTSVKDYVEITHKLIEIEPLKNYTEFGAVFTYFIFSIGEFFKNFFSFSFLNNIWSIPIIIPDIASAMISEVSVLDGYFHNAFTFLETSVNTTTNPSLVIFEKFVIGIINSLFLILPTSTSHLITLRRFVMQGLEAGYMAGLGTLAGNFLWLASIILGWRFFVIPWLSLDIFRYLLGFVLLVKYIWDSSKERRMALEDLSKWKIFLLNFLLALTEQSCIYPFISNLSFGPDASILEGFPVDNYPQFLLIHGAYLLGILFGSFSLLQFTCWFWENPAFSIYLWITTKSSLKISTSSYYKILNFTFLYATMLCAIASIPYYGLDYTITNPIGLVPQDRILNQKKSQSDPDKLITETAFLNLNPTDKNSRIRDGVHARRERWKQRLIKYQAFDASTYDQGVYDFLTIEDLNYGFDRFWLRRKMRNHQIRFRLFPGPWMRSLKKQLNNPANPSLETSTKAASGPRVEFFRILFEQFYHPNFHDRAAMQTNPAEARNKFISTSPLASTESKKALNSTFSLGNINNSSTGIEGLVLTNTQATLLPTDLQTKRTIKPGLIYTNSALRKFVRNVNTRLNLKLLNSKETNLTTKYKSQFIYSKRWKSIFSKIQPLQNGTTRKSYQLFRNVAKQILVTPDAKSLKLITINQKLSLKERKLLELRTQYNNNSTLTTTAPLTLVRPLNVYLQKEEAFKRKLRYYGTMPMRKLTVGNQAPYFKALMKRGFYYYKPTLRWRKTLYVASLRRGFRKKSRKQRILVMPSNQQNFNNTLDNTKTNINQNNLANPLGGNEVPMYGADGENSLITKPTHSYTVLGKRASRYRHQIYKDVLQHWYYTPFNRLLMKFDVDAFINRQPKSHFLTKNEERALHIRRFLLSEHYDTLRWYTYMQHYKTMKTNIGGTKSFANRAYNQQFQGTFKKIRHLFAITPKQGDFYTLKFDQPLYNDNKLKDNLYFHEELLTDYYNGTNLQTNQTSNISVNSTTTFIDNSLRTTQLPVPSSSFDIVNQSSTLIGLTTMQNALRKNVVESTLTSLNSDGEAATSQPKLNFVYSELFVKLIKECKKRIHDQTFLKNYITHRIEKREQLNQEQTKELNKRLEKLKVWLNSDKGSISKLQNTPVQDPNISSPDKVLTTAMQKAVNESISLSGIMPSDKIKTTYGNLTNAYTIKTENAILTKLNVINQLTNNETTTQKNTLIKSIGVNKIQTVLQTIITNFKSSLYNQTQLLRVKTDKDLQWWRTKQRVITKRKSARKRDRFKKQIAVVNKKLAALSKKVETEKSNLYQTLYGNYEISDYLLRNVPTGSSAVIDSTVLRKKQDNQAYLPKETNNVQFNSFVDSNNNVWQTFFAKKLRKKISSKGRRYRSLSLARYLTATRKPRLVGLDNLTKIDNITTLQGAFITKEEKQDSLNLTIQRKQELTNSLKKSQIKKRSRHSWKKRSRHQFSRNHYKYRKRHTHGNGKLRVMNKKLKKFKATNELRQWWWNSFLPRYLSNLQVNNSTLTNKNVSFKPLSNTNSVPSTNMASPTTSRNLLDNLNSSNQISTSASMNQNIVTESVKVETNQVYLPEGEKSFDITSMTTTLPFYAGWDESLKKFVVTNRLLSRRDAGLSVNNNPQEINFTNPPIQGLNEGSFLYWQTEMPFNSYNIDQFITTNQSFYAPLGWRRFEFRHSILKTWVNNTKAGNNNIKKKTLIISLKNLQPLKSSQQKQNQIKTKKLVARRIKKRYKLLKQMPNQLMYSPTGPLLTEVLPSHYISVFDQQYRLPRNRYLKRNPLKTLKKTTLLALMDSSKQTNGVNKEFTLRKRVKPRRKYHRKRFIKKDGLIFPRRTKFNTNTTLTGNALITNNVNSIEEDDLRWRPSSRTKQKRKDNTRSSAASKTKSNKRVKTNPLRLRQLRRREFQQVLKPLQRYIPQNGGFTWPGDYLRLEIVEMPKLKSINIKKTSLKQKINVQPVGIMPRKYLIEKHNIKVLKKKLSQAYSTQQLTKVVQEYKNLIQNSPPAI 1995 T 2E-05 Ycf1 pdbhh F Eukaryota T 7vcf 3 C C A0A2K3D4W3_CHLRE Toc52 MADGPSPIRIVLWNDGGESLAAGVEDEEQQQVLHSFADLVGSAIDAVLELPQFRHVEAVTAEAEEDEPGLSIGFDAGSGDGEVDIDNLKGRLDIAGLLLGSAQLPEELAEVAAVEVTDEEEGTTELQFTDEGLVQQLQAVVKRAKLEKRYNDWVAGVAESLGPALDAAAGGVEVTEMPVDPYDVLQAVVAQLIRVAGVSPPAPSLFSRTGALVGGVLGAPRSAVRQVTKRLGRAQRLWWRLEDVVVDGSKLALRLAVKAARPVLVGFVLHRVLKTLDRSRQLEYRLARMGPEEAREAYYEAVLGKDWKQQLQADWDKALEDVDAGLVTDEINHEKRLMTAAQLRRLEVEEWDKQRMKNFYLASFGGLRWFDQMEQALHNPLFIESRGWTDPVQNWVGQNRTYMDDLPAGQYMAGVGNAAIRIKEAELKRKLTDVERAHVLARGGAVAGGLLPQQPTDPATLAVAVGGAFVPSVAGKR 477 T 0.46 AXH pdbpssm F Eukaryota T 7vcf 4 D D A8J5D4_CHLRE Tic13 MSSDVQAKLSGLLGDIGVKCTLAFAGTVAAGAAIVVPSGKQVEAASLDIYGRPPSQLLPNERRAAEFAAGHRRWKGFVDNSIYSWTRTLPGHDNPIVNPYKGPRRPQRPQQKLEEEVEAAAKQE 124 T 3.9 DUF6460 pdbhh F Eukaryota T 7vcf 8 H I A8J6H7_CHLRE Toc39 MGASQESELDFVPRLSFLPIEWRSIGSAFGLKDKSGAAANGRATFTVRQGVDAAELTSTGRVIDGQADVGASLKLNTLAIGVSASNITFHSGLDDPTAAAAQRSSLIPSLKLTAAKQFKRDNYIAVSYDLKHQKPELSACWTGEAGADRATLLVNVDPVMRSVKLAAAVRTPGPEWRKVLYNDETDLLEYPADDGARHTLYVQHEVRGRDLLHATRLGCRLDLGRLVNYVVDFVDYRIEENIPSFVWNVPLLPQLYSLLVPADNDEQVRHRITGWELDVSHDFARSGLLPVVAISKTSKKLLGGGTLTASYDAAAREAGVSLSRKGVSVGARVARAEGAAGGLSAGWGRPSIHVAVEPLGLLQ 363 T 1 Thyroglobulin_1 pdbpssm F Eukaryota T 7vcf 9 I K A0A2K3E4D9_CHLRE Toc10 MKLVKTVSKLAGAAVGMLPAGQAGLAVKVALGVAFAFWWTSGPGADEEMDAKAQQEPDRRSQYTRHYAFKGRGRKEFLRSDMKNDANELVPTRGAAGL 98 T 0.13 DUF6479 pdbpercent F Eukaryota T 7vcf 10 J M A8J1J3_CHLRE Tic12 MDEEPPFNLALNVYKGPASIPHASAEVFGAFFLATNTALLAHMFPGKLFGSELHVRKWDPDYLASCCNEQGMRREALSGKKPNLWLLGGGPRLVNDSWERMWWNNLHWKRWKVPRTGPAFPQDMYWQ 127 T 17 TetR_C_18 pdbhh F Eukaryota T 7vcf 11 K N Unknown fragment KFIFWAAMVYATLYGNYE 18 T 2.7 FixP_N pdbhh F T 7vcf 12 L O A0A2K3DWN5_CHLRE Tic35 MQLGQLRQPLRACQDQRLTRGVPLARRQLVVVSNWNPLGGKGGGNSKDKEDAARRALEQSLGQKKFGADASKKTPAAKPAEPSKPAGEDASKNPLQNLFGGGGPKPPAGGGGGGGGDGGGGFFSGGNAEQPGGEEPIQDELLKLLRGGWVLLSNLALFLVFSSFLHRSLNWFVQTELLVAVGAPQQAGERVVGKFFEAIEWVERNILGWKLPGDEEAEDATSKVYEVLQNYTPAEAAYSFAQLKYKDLTHKERELFHKAYALRHFERRDGRPGDVDAAELQAVKDRLDPLEADRRAYAAAKAAGRLDEYWAAPGREATYQRIVGAPRIA 329 T 0.016 DUF2878 pdbpercent F Eukaryota T 7vcl 2 B B BKRF4_EBVB9 Tegument protein BKRF4 GLPGSDTDESDYSDEDEEIDLEEEYPSDEDPSEGSDSDPSWHPSDSDESDYSESDEDEATPGSQASRSSR 70 T 0.033 Nop14 unppercent T Viruses T 7vcn 1 A,C C,D D4FSQ3_STROR PITA EPQTTLHKTITPISGQDDKYELSLDITSKL 30 T 0.13 PA-IL unppssm F Bacteria T 7vcq 4 J K BKRF4_EBVB9 Tegument protein BKRF4 GPLGSDTDESDYSDEDEEIDLEEEYPSDEDPSEGSDSDPSWHPSDSDESDYSESDEDEATPGSQASRSSR 70 T 0.048 Nop14 unp T Viruses T 7vcr 1 A,C C,D D4FSQ3_STROR PITA EPQTTLHKTITPISGQDDKYELSLDITSKL 30 T 0.13 PA-IL unppssm F Bacteria T 7vd5 21 RA,U w,W A0A679C6E8_9STRA Photosystem II reaction center protein W TEGTNEWFGVDDLRLLAVLFLGHWAILSLWLGSYGDSNEDEDFFGEIDYSAR 52 T 0.0021 PsbW unppssm F Eukaryota T 7vd5 22 SA,V 5,0 Unknown protein 0 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 31 F F F 7vd5 23 TA,W 6,1 Unknown protein 1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 30 F F F 7vdv 13 P U unknown XXXXXXX 7 F F F 7vdv 19 X M unknown XXXXXXXXXXXXXXXX 16 F F F 7vec 2 M,N,O,P,Q,R,S,T,U,V,W M,N,O,P,Q,R,S,T,U,V,X TX264_HUMAN TEX264 phospho-LIR SSFEELDLY 9 T 2.9 DRMBL pdbhh F Eukaryota T 7veg 1 A,B,C A,B,C peptide PPGPPGPPGPQGFPGPPGPPGP 22 T 0.0013 Collagen pdb F F 7veh 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H A0A3A9QXE8_MORCA AcrIF13 AGSMKLLNIKINEFAVTANTEAGDELYLQLPHTPDSQHSINHEPLDDDDFVKEVQEICDEYFGKGDRTLARLSYAGGQAYDSYTEEDGVYTTNTGDQFVEHSYADYYNVEVYCKADLV 118 T 0.019 DUF1882 unppssm F Bacteria T 7vf2 2 B B ZC3HD_HUMAN Zinc finger CCCH domain-containing protein 13 PVATATATTVPATLAATTAAAATSFSTSAITISTSATPTNTTNNTFANEDSHRKCHRTRVEKVETPHVTIEDAQHRKPMDQKRSSSLGSNRSNRSHTSGRLRSPSNDSAHRSGDDQSGRKRVLHSGSRDREKTKSLEITGERKSRIDQLKRGEPSRSTSSDRQDSRSHSSRRSSPESDRQVHSRSGSFDSRDRLQERDRYEHDRERERERRDTRQREWDRDADKDWPRNRDRDRLRERERERERDKRRDLDRERERLISDSVERDRDRDRDRTFESSQIESVKRCEAKLEGEHERDLESTSRDSLALDKERMDKDLGSVQGFEETNKSERTESLEGDDESKLDDAHSLGSGAGEGYEPISDDELDEILAGDAEKREDQQDEEKMPDPLDVIDVDWSGLMPKHPKEPREPGAALLKFTPGAVMLRVGISKKLAGSELFAKVKETCQRLLEKPKDADNLFEHELGALNMAALLRKEERASLLSNLGPCCKALCFRRDSAIRKQLVKNEKGTIKQAYTSAPMVDNELLRLSLRLFKRKTTCHAPGHEKTEDNKLSQSSIQQELCVS 563 T 0.35 Fimbrillin_C pdbpercent F Eukaryota T 7vf6 1 A,B A,B A0A7L7SI10_9CAUD PurA-like adenylosuccinate synthetase MGSAIDVIVGGQFGSEAKGRVTLERVQHWADNGHAVASMRVAGPNAGHVVWDQGHRFAMRSLPVGFVDPGTDLYIAAGSEVDIEVLQQEVDLVESYGYEVRDRLYIHPQATWLEPVHRDREASSTLTAKVGSTSKGIGAARSDRIWRVANLVGDNPAFQELGRVSDFTEDLRSELVDGSLALVIEGTQGYGLGLHAGHYPQCTSSDARAIDFLAMAGINPWDLSREDLAAHGFRIHVVIRPFPIRVAGNSGELSGETSWDELGLEAERTTVTNKIRRVGQFDPELVRRAVLANGVNNVKIHLSMADQLIPQLAGLEDLPEGWRESEYAGRLREFIDQIPFNERLVSLGTGPHTRIELFKENLYFQLE 367 T 1.6E-54 Adenylsucc_synt unppssm T Viruses T 7vfi 2 C C H31_MOUSE ARG-ARG-TYR-GLN-LYS-SER-THR-GLU-LEU RRYQKSTEL 9 T 13 SAP30_Sin3_bdg pdbhh F Eukaryota T 7vfl 2 C,D D,G SER-ASP-SER-ASP-SER-ASP-SER-ASP DSDSDSDSD 9 T 2.8 DUF4196 pdbhh F F 7vfm 2 C,D E,G SER-ASP-SER-ASP DSDSD 5 T 110 PNISR pdbhh F F 7vfn 2 C B ASP-SER-ASP DSD 3 T 180 Glyco_trans_2_3 pdbhh F F 7vfx 6 F C Peptide agonist fMIFL MIFL 4 T 130 RAG2_PHD pdbhh F F 7vgm 1 A A Q818B4_BACCR Phenylalanine-4-hydroxylase MTKKTEIPSHLKPFVSTQHYDQYTPVNHAVWRYIMRQNHSFLKDVAHPAYVNGLQSSGINIDAIPKVEEMNECLAPSGWGAVTIDGLIPGVAFFDFQGHGLLPIATDIRKVENIEYTPAPDIVHEAAGHAPILLDPTYAKYVKRFGQIGAKAFSTKEEHDAFEAVRTLTIVKESPTSTPDEVKAAENAVIEKQNLVSGLSEAEQISRLFWWTVEYGLIGNIDDPKIYGAGLLSSVGESKHCLTDAVEKVPFSIEACIGTTYDVTKMQPQLFVCESFEELTDALETFSKTMAFKTGGKEGLEKAIRSENYATAELNSGLQITGTFSETIENDAGELIYMRTNSPTALALHNKQLANHSTSVHSDGFGTPIGLLTENIALENCTDEQLQSLGITIGTIAEFTFASGIHVKGTVTDIVKNDKKIALISFIDCTVTYNARVLFDASWGAFDMAVGSQITSVFPGAADAAAFFPMDEEVHEIPAPLVLNELERMYQTVRDIRSEGILHDAHIDQLIAIQEVLNKFYAKEWLLRLEVLELLLEHNKGHETSAALLHQLSTFTTDEAVTRLINNGLALLPVKDVKNDAKINLEHHHHHH 592 T 7.7E-39 Biopterin_H unppssm F Bacteria T 7vhc 3 G G inhibitor peptide, ALA-ARG-ARG-ARG-ARG-ALA ARRRRAX 7 T 1.3 Carla_C4 pdbhh F F 7vhd 3 G G ARG-ARG-ARG-ARG-ALA RRRRAX 6 T 66 BRD4_CDT pdbhh F F 7vhe 3 G G RRRA peptide RRRAX 5 T 210 DUF3042 pdbhh F F 7vhf 3 G G RRA peptide RRAX 4 T 380 Malate_DH pdbhh F F 7vi4 1 A A TIA1_HUMAN RNA-BINDING PROTEIN TIA-1,T-CELL-RESTRICTED INTRACELLULAR ANTIGEN-1,TIA-1,P40-TIA-1 GYRVTGYETQ 10 T 1.1 DUF3520 pdbhh F Eukaryota T 7vi5 1 A A TIA1_HUMAN RNA-BINDING PROTEIN TIA-1,T-CELL-RESTRICTED INTRACELLULAR ANTIGEN-1,TIA-1,P40-TIA-1 GYRVAGYETQ 10 T 1.3 DUF3520 pdbhh F Eukaryota T 7viv 1 A,B A,B A0A2X0RU36_ASF I73R CDS PROTEIN,I73R PROTEIN METQKLISMVKEALEKYQYPLTAKNIKVVIQKEHNVVLPTGSINSILYSNSELFEKIDKTNTIYPPLWIRKN 72 T 0.012 HARE-HTH pdbpercent T Viruses T 7vlm 1 A A A0A2G3NPZ8_STRMC H2C7 MKIDTTVTEVKENGKTYLRLLKGNEQLKAVSDKAVAGVNLFPGAKIGSFLVRQDNIVVFPDNKGEFDLDFFNLLNDNFETLVEYAKMADCLDIAFDINEKSYFNMIMWLMKNIDENWSQSPYGESFYSSKDIDWGYKPEGSLRVSDHWNFGQDGEHCPTAEPVDGWAVCKFENGKYHLIKKF 182 T 4.2 PhoU_div unppercent F Bacteria T 7vlz 2 B B A0A454B2U5_VIBHA Peptide P1 EPSQQVTEIYQHHA 14 T 16 Inhibitor_I34 pdbhh F Bacteria T 7vlz 3 C C A0A454B2U5_VIBHA Peptide P2 DYAPTKLLPQQP 12 T 9.5 DUF724 pdbhh F Bacteria T 7vmb 2 B B IQEC1_HUMAN ADP-RIBOSYLATION FACTORS GUANINE NUCLEOTIDE-EXCHANGE PROTEIN 100,ADP-RIBOSYLATION FACTORS GUANINE NUCLEOTIDE-EXCHANGE PROTEIN 2,BREFELDIN-RESISTANT ARF-GEF 2 PROTEIN,BRAG2 GPGSEFLSESYELSSDLQDKQVEMLERKYGGRLVTRHAARTIQTAFRQYQMNKNFERLRSSMSENRMSRRIVLS 74 T 0.0085 Protamine_P2 pdbpssm F Eukaryota T 7vmc 3 C C A0A2A3ULE6_ECOLX Contact-dependent inhibitor I MKLTVDSVINEPRSVAITIDGYIPVDIKIIDSKKLPPLYWRGGDGKKNLLELAVLPENGFLSSITLVMIASDSIHKTDSLSVSLPSSECGVPVVNTKLWSHSESDDFSRRFVDDFSLDIEVIISSESMLLTIGENKKVTSWIKCSDNFYLGIDAGRNVVHLYLDKLTPSEVESFFEAVG 179 T 1.4 DUF2283 pdbhh F Bacteria T 7vmt 1 A,B,C,D,E,F A,B,C,D,E,F MGT4A_MOUSE Alpha-1,3-mannosyl-glycoprotein 4-beta-N-acetylglucosaminyltransferase A soluble form GPLGNPPAEVSTSLKVYQGHTLEKTYMGEDFFWAITPTAGDYILFKFDKPVNVESYLFHSGNQEHPGAILLNTTVDVLPLKSDSLEISKETKDKRLEDGYFRIGKFEYGVAEGIVDPGLNPISAFRLSVIQNSAVWAILNEIHIKKVTS 149 T 0.37 NADase_NGA pdbhh F Eukaryota T 7vmw 2 C C substrate peptide XGAHTIX 7 T 160 DUF4399 pdbhh F T 7vny 7 GA Y U5NME9_CERS4 Rsp_7571 Protein-Y PufY MPEVSEFAFRLMMAAVIFVGVGIMFAFAGGHWFVGLVVGGLVAAFFAATPNSN 53 T 0.05 TrbC pdbpercent F Bacteria T 7vop 13 AA,EA a,e Q9PVZ2_XENLA Nucleoporin CAN MEDDTDLPPERETKDFQFRQLKKVRLFDYPADLPKQRSNLLVISNKYGLLFVGGFMGLKVFHTKDILVTVKPKENANKTVVGPQGIHVPMNSPIHHLALSSDNLTLSVCMTSAEQGSSVSFYDVRTLLNESKQNKMPFASCKLLRDPSSSVTDLQWNPTLPSMVAVCLSDGSISVLQVTDTVSVFANLPATLGVTSVCWSPKGKQLAVGKQNGTVVQYLPSLQEKKVIPCPSFYDSDNPVKVLDVLWLSTYVFTVVYAAADGSLEASPQLVIVTLPKKEDKRAERFLNFTETCYSICSERQHHFFLNYIEDWEILLAASAASVDVGVIARPPDQVGWEQWLLEDSSRAEMPMTENNDDTLPMGVALDYTCQLEVFISESQILPPVPVLLLLSTDGVLCPFHVVNLNQGVKPLTTSPEQLSLDGEREMKVVGGTAVSTPPAPLTSVSAPAPPASAAPRSAAPPPYPFGLSTASSGAPTPVLNPPASLAPAATPTKTTSQPAAAATSIFQPAGPAAGSLQPPSLPAFSFSSANNAANASAPSSFPFGAAMVSSNTAKVSAPPAMSFQPAMGTRPFSLATPVTVQAATAPGFTPTPSTVKVNLKDKFNASDTPPPATISSAAALSFTPTSKPNATVPVKSQPTVIPSQASVQPNRPFAVEAPQAPSSVSIASVQKTVRVNPPATKITPQPQRSVALENQAKVTKESDSILNGIREEIAHFQKELDDLKARTSRACFQVGSEEEKRQLRTESDGLHSFFLEIKETTESLRGEFSAMKIKNLEGFASIEDVQQRNKLKQDPKYLQLLYKKPLDPKSETQMQEIRRLNQYVKNAVQDVNDVLDLEWDQYLEEKQKKKGIIIPERETLFNSLANHQEIINQQRPKLEQLVENLQKLRLYNQISQWNVPDSSTKSFDVELENMQKTLSQTAIDTQTKPQAKLPAKISPVKQSQLRNFLSKRKTPPVRSLAPANLSRSAFLAPSFFEDLDDVSSTSSLSDMADNDNRNPPPKEIERQETPPPESTPVRVPKHAPVARTTSVQPGLGTASLPFQSGLHPATSTPVAPSQSIRVIPQGADSTMLATKTVKHGAPNITAAQKAAVAAMRRQTASQIPAASLTESTLQTVPQVVNVKELKNNGPGPTIPTVIGPTVPQSAAQVIHQVLATVGSVSARQAAPAAPLKNPPASASSIAPQTWQGSAPNKPAAQAIPKSDPSASQAPAPSVSQVNKPVSFSPAAGGFSFSNVTSAPVTSALGSSSAGCAATARDSNQASSYMFGGTGKSLGSEGSFSFASLKPASSSSSSSVVEPTMSKPSVVTAASTTATVTSTTAASSKPGEGLFQGFSGGETLGSFSGLRVGQADEASKVEVAKTPTAAQPVKLPSNPVLFSFAGAPQPAKVGEAPSTTSSTSASLFGNVQLASAGSTASAFTQSGSKPAFTFGIPQSTSTTAGASSAIPASFQSLLVSAAPATTTPSAPINSGLDVKQPIKPLSEPADSSSSQQQTLTTQSAAEQVPTVTPAATTATALPPPVPTIPSTAEAKIEGAAAPAIPASVISSQTVPFTSTVLASQTPLASTPAGGPTSQVPVLVTTAPPVTTESAQTVSLTGQPVAGSSAFAQSTVTAASTPVFGQALASGAAPSPFAQPTSSSVSTSANSSTGFGTSAFGATGGNGGFGQPSFGQAPLWKGPATSQSTLPFSQPTFGTQPAFGQPAASTATSSAGSLFGCTSSASSFSFGQASNTSGTSTSGVLFGQSSAPVFGQSAAFPQAAPAFGSASVSTTTTASFGFGQPAGFASGTSGSLFNPSQSGSTSVFGQQPASSSGGLFGAGSGGASTVGLFSGLGAKPSQEAANKNPFGSPGSSGFGSAGASNSSNLFGNSGAKAFGFGGTSFGDKPSATFSAGGSVASQGFSFNSPTKTGGFGAAPVFGSPPTFGGSPGFGGSPAFGTAAAFSNTLGSTGGKVFGEGTSAATTGGFGFGSNSSTAAFGSLATQNTPTFGSISQQSPGFGGQSSGFSGFGAGPGAAAGNTGGFGFGVSNPTSPGFGCWRS 2037 T 0.00071 NUP214 pdbpercent F Eukaryota T 7vor 7 GA,NB Y,y U5NME9_CERS4 Rsp_7571 Protein-Y PufY MPEVSEFAFRLMMAAVIFVGVGIMFAFAGGHWFVGLVVGGLVAAFFAATPNSN 53 T 0.05 TrbC pdbpercent F Bacteria T 7vot 7 GA,NB Y,y U5NME9_CERS4 Rsp_7571 Protein-Y PufY MPEVSEFAFRLMMAAVIFVGVGIMFAFAGGHWFVGLVVGGLVAAFFAATPNSN 53 T 0.05 TrbC pdbpercent F Bacteria T 7vpg 1 A,C,E,G A,C,E,G RAE1L_HUMAN RAE1 PROTEIN HOMOLOG,MRNA-ASSOCIATED PROTEIN MRNP 41 MSLFGTTSGFGTSGTSMFGSATTDNHNPMKDIEVTSSPDDSIGCLSFSPPTLPGNFLIAGSWANDVRCWEVQDSGQTIPKAQQMHTGPVLDVCWSDDGSKVFTASCDKTAKMWDLSSNQAIQIAQHDAPVKTIHWIKAPNYSCVMTGSWDKTLKFWDTRSSNPMMVLQLPERCYCADVIYPMAVVATAERGLIVYQLENQPSEFRRIESPLKHQHRCVAIFKDKQNKPTGFALGSIEGRVAIHYINPPNPAKDNFTFKCHRSNGTNTSAPQDIYAVNGIAFHPVHGTLATVGSDGRFSFWDKDARTKLKTSEQLDQPISACCFNHNGNIFAYASSYDWSKGHEFYNPQKKNYIFLRNAAEELKPRNKKHHHHHHHHHH 378 T 0.00011 WD40 unppercent F Eukaryota T 7vpg 2 B,D,F,H B,D,F,H NUP98_HUMAN Isoform 3 of Nuclear pore complex protein Nup98-Nup96 MHHHHHHHHHHTGTTIKFNPPTGTDTMVKAGVSTNISTKHQCITAMKEYESKSLEELRLEDYQANRK 67 T 0.045 DUF5023 pdbpssm F Eukaryota T 7vph 1 A,C,E,G A,C,E,G RAE1L_HUMAN RAE1 PROTEIN HOMOLOG,MRNA-ASSOCIATED PROTEIN MRNP 41 MSLFGTTSGFGTSGTSMFGSATTDNHNPMKDIEVTSSPDDSIGCLSFSPPTLPGNFLIAGSWANDVRCWEVQDSGQTIPKAQQMHTGPVLDVCWSDDGSKVFTASCDKTAKMWDLSSNQAIQIAQHDAPVKTIHWIKAPNYSCVMTGSWDKTLKFWDTRSSNPMMVLQLPERCYCADVIYPMAVVATAERGLIVYQLENQPSEFRRIESPLKHQHRCVAIFKDKQNKPTGFALGSIEGRVAIHYINPPNPAKDNFTFKCHRSNGTNTSAPQDIYAVNGIAFHPVHGTLATVGSDGRFSFWDKDARTKLKTSEQLDQPISACCFNHNGNIFAYASSYDWSKGHEFYNPQKKNYIFLRNAAEELKPRNKKHHHHHHHHHH 378 T 0.00011 WD40 unppercent F Eukaryota T 7vph 2 B,D,F,H B,D,F,H NUP98_HUMAN Isoform 3 of Nuclear pore complex protein Nup98-Nup96 MHHHHHHHHHHTGTTIKFNPPTGTDTMVKAGVSTNISTKHQCITAMKEYESKSLEELRLEDYQANRK 67 T 0.045 DUF5023 pdbpssm F Eukaryota T 7vpw 1 A B BAP1_HUMAN BRCA1-associated protein 1 (BAP1) IGRLHKQRKPDRRKRSRPY 19 T 0.74 DUF3734 unppercent F Eukaryota T 7vqh 1 A A VAL-ALA-ARG-GLY-TRP-GLY-ARG-LYS-CYS-PRO-LEU-PHE-GLY-LYS-ASN-LYS-SER-ARG (VR18) VARGWGRKCPLFGKNKSR 18 T 0.0017 Flavi_glycoprot pdbhh F T 7vqi 1 A A VAL-ALA-ARG-GLY-TRP-GLY-ARG-LYS-CYS-PRO-LEU-PHE-GLY-LYS-ASN-LYS-SER-ARG (VR18) VARGWGRKCPLFGKNKSR 18 T 0.0017 Flavi_glycoprot pdbhh F T 7vqp 2 B C MED1_HUMAN Mediator of RNA polymerase II transcription subunit 1 KNHPMLMNLLKDN 13 T 3.8 Gb3_synth pdbhh F Eukaryota T 7vrc 1 A,C A,C SNF11_YEAST SWI/SNF COMPLEX COMPONENT SNF11 MSSEIAYSNTNTNTENENRNTGAGVDVNTNANANANATANATANATANATAELNLPTVDEQRQYKVQLLLHINSILLARVIQMNNSLQNNLQNNINNSNNNNIIRIQQLISQFLKRVHANLQCISQINQGVPSAKPLILTPPQLANQQQPPQDILSKLYLLLARVFEIW 169 T 0.045 SSXT pdbhh F Eukaryota T 7vsx 1 A A LUCI_OPLGR QLnK EFFTLEDFVGDWRQTAGYNQDQVLEQGGLSSLFQNLGVSVTPIQRIVLSGENGLKIDIHVIIPYEGLSGDQMGQIEKIFKVVYPVDDHHFKVILHYGTLVIDGVTPNMIDYFGRPYEGIAVFDGKKITVTGTLWNGNKIIDERLINPDGSLLFRVTINGVTGWRLCERILA 171 T 0.023 Lipocalin_7 pdbhh F Eukaryota T 7vt4 1 A,B A,B A0A399DY85_9DEIN Endoglucanase H MGCQSTQLQTPAPDTGGIVELNRQLGRGVNLGNALEAPWEGAWGVRLEEGFFELIREAGFKTIRLPVSWTHHAGRAAPYTIDPAFFSRVDWAVTQATRRGLNIVVNVHHYDELNANPQAEEARYLSIWRQIAERYRNQPGSVYFELLNEPHGRFNDNPQLWNDLLAKALRVVRESNPSRAVIVGPVGWNSLWRLSELRLPDDPNLIVTFHYYDPLEFTHQGAEWLNPVPPTGVVWRENQGAFAAGWQNWSWGSRVGFVGEALEITYQEGWAGFYLHSDAGVEGYDRLAFRTSAPVSLQVSCRRDAPAKAVTTSGGVETVVNLSECGNPSRLTDLILQNNSPNARAAFRLERLELRGPGSPLALLTHQQNAIAQAMEFAQRWAEQNRRPIFVGQFGAYEKGDLDSRVRWTGAVRSELEKRNFSWAYWEFAAGFGIYDRTTRQWRTPLLKALVPEQPKLAAALEHHHHHH 468 T 6.3E-06 Cellulase pdbpercent F Bacteria T 7vt8 1 A A A0A399DY85_9DEIN Endoglucanase H MGCQSTQLQTPAPDTGGIVELNRQLGRGVNLGNALEAPWEGAWGVRLEEGFFELIREAGFKTIRLPVSWTHHAGRAAPYTIDPAFFSRVDWAVTQATRRGLNIVVNVHHYDELNANPQAEEARYLSIWRQIAERYRNQPGSVYFELLNEPHGRFNDNPQLWNDLLAKALRVVRESNPSRAVIVGPVGWNSLWRLSELRLPDDPNLIVTFHYYDPLEFTHQGAEWLNPVPPTGVVWRENQGAFAAGWQNWSWGSRVGFVGEALEITYQEGWAGFYLHSDAGVEGYDRLAFRTSAPVSLQVSCRRDAPAKAVTTSGGVETVVNLSECGNPSRLTDLILQNNSPNARAAFRLERLELRGPGSPLALLTHQQNAIAQAMEFAQRWAEQNRRPIFVGEFGAYEKGDLDSRVRWTGAVRSELEKRNFSWAYWEFAAGFGIYDRTTRQWRTPLLKALVPEQPKLAAALEHHHHHH 468 T 7.6E-06 Cellulase pdb F Bacteria T 7vti 1 A A A0A660UUL5_9BACT Cas13bt3 GGMAQVSKQTSKKRELSIDEYQGARKWCFTIAFNKALVNRDKNDGLFVESLLRHEKYSKHDWYDEDTRALIKCSTQAANAKAEALANYFSAYRHSPGCLTFTAEDELRTIMERAYERAIFECRRRETEVIIEFPSLFEGDRITTAGVVFFVSFFVERRVLDRLYGAVSGLKKNEGQYKLTRKALSMYCLKDSRFTKAWDKRVLLFRDILAQLGRIPAEAYEYYHGEQGDKKRANDNEGTNPKRHKDKFIEFALHYLEAQHSEICFGRRHIVREEAGAGDEHKKHRTKGKVVVDFSKKDEDQSYYISKNNVIVRIDKNAGPRSYRMGLNELKYLVLLSLQGKGDDAIAKLYRYRQHVENILDVVKVTDKDNHVFLPRFVLEQHGIGRKAFKQRIDGRVKHVRGVWEKKKAATNEMTLHEKARDILQYVNENCTRSFNPGEYNRLLVCLVGKDVENFQAGLKRLQLAERIDGRVYSIFAQTSTINEMHQVVCDQILNRLCRIGDQKLYDYVGLGKKDEIDYKQKVAWFKEHISIRRGFLRKKFWYDSKKGFAKLVEEHLESGGGQRDVGLDKKYYHIDAIGRFEGANPALYETLARDRLCLMMAQYFLGSVRKELGNKIVWSNDSIELPVEGSVGNEKSIVFSVSDYGKLYVLDDAEFLGRICEYFMPHEKGKIRYHTVYEKGFRAYNDLQKKCVEAVLAFEEKVVKAKKMSEKEGAHYIDFREILAQTMCKEAEKTAVNKVARAFFAHHLKFVIDEFGLFSDVMKKYGIEKEWKFPVK 777 T 0.026 Perilipin unppercent F Bacteria T 7vtj 3 C C VQIIYK peptide VQIIYK 6 T 74 30K_MP_core pdbhh F F 7vtn 1 A A A0A660UUL5_9BACT Cas13bt3 GGMAQVSKQTSKKRELSIDEYQGARKWCFTIAFNKALVNRDKNDGLFVESLLRHEKYSKHDWYDEDTRALIKCSTQAANAKAEALANYFSAYRHSPGCLTFTAEDELRTIMERAYERAIFECRRRETEVIIEFPSLFEGDRITTAGVVFFVSFFVERRVLDRLYGAVSGLKKNEGQYKLTRKALSMYCLKDSRFTKAWDKRVLLFRDILAQLGRIPAEAYEYYHGEQGDKKRANDNEGTNPKRHKDKFIEFALHYLEAQHSEICFGRRHIVREEAGAGDEHKKHRTKGKVVVDFSKKDEDQSYYISKNNVIVRIDKNAGPRSYRMGLNELKYLVLLSLQGKGDDAIAKLYRYRQHVENILDVVKVTDKDNHVFLPRFVLEQHGIGRKAFKQRIDGRVKHVRGVWEKKKAATNEMTLHEKARDILQYVNENCTRSFNPGEYNRLLVCLVGKDVENFQAGLKRLQLAERIDGRVYSIFAQTSTINEMHQVVCDQILNRLCRIGDQKLYDYVGLGKKDEIDYKQKVAWFKEHISIRRGFLRKKFWYDSKKGFAKLVEEHLESGGGQRDVGLDKKYYHIDAIGRFEGANPALYETLARDRLCLMMAQYFLGSVRKELGNKIVWSNDSIELPVEGSVGNEKSIVFSVSDYGKLYVLDDAEFLGRICEYFMPHEKGKIRYHTVYEKGFRAYNDLQKKCVEAVLAFEEKVVKAKKMSEKEGAHYIDFREILAQTMCKEAEKTAVNKVARAFFAHHLKFVIDEFGLFSDVMKKYGIEKEWKFPVK 777 T 0.026 Perilipin unppercent F Bacteria T 7vu5 1 A,B A,B CD28_HUMAN TP44 GPSKPFWVLVVVGGVLAFYSLLVTVAFIIFWVRSKRSRLLH 41 T 0.0099 WBP-1 unppssm F Eukaryota T 7vu7 1 A,B A,B A0A4Y2M0V6_ARAVE Flagelliform fibroin GGQPSGGVLPGGSYTPAAGGSSRLPSLINGIMSSMQGGGFNYQNFGNVLSQFATGTGTCNSNDLNLLMDALLSALHTLSYQGMGTVPSYPSPSAMSAYSQSVRRCFGY 108 T 0.0068 Spidroin_MaSp pdb F Eukaryota T 7vu9 1 A,B,C,D,E,F A,B,C,D,E,F A0A384E107_9AGAR Lectin (PhoSL) APVPVTKLVCDGDTYKCTAYLDYGDGKWVAQWDTAVFHTT 40 T 0.069 C2-set pdbhh F Eukaryota T 7vul 1 A,B,C A,B,C A0A7S6TZU4_9CAUD P560 DEPOLYMERASE MGSSHHHHHHSSGLVPRGSHMLNNLNQPKGSTIGVLKDGRTIQQAIDGLENPVHYVKDVSITPSALLAVAVEAARLGRTVAFGPGHYTNQGQPFEVDFPLNLDVPVGTFLDFPIIIRGKTVKMVRSVTTNLTAAQCPAGTTVIAGDFSAFPVGSVVGVKLGDNTNGSASYNNEAGWDFTTVAASSNTSITLSTGLRWAFDKPEVFTPEYAVRYSGQLSRSSYFIPGDYTSGLNVGDIIRVENIDGTDGVHGNKEYFEMLKVSSIDSSGITVETRLRYTHVNPWIVKTGLVKGSSVTGGGRLKRLEVRGVDTPKVNSVDVDRLIVGLCYNIDVGEITSRGVGEPSSVNFTFCFGRGFLYNVRASGSVSTTDNSALKLMSCPGLIINNCSPHNSTSTGSQGDYGFYVDAYYSPYWCWNDGMSINGIVTETPRSAVTRALWLFGLRGCSVSNLSGAQVFLQGCAKSVFSNIVTPDNLLELRDLSGCIVSGMANNALVLGCWNSTFDLTLFGIGSGSNLNIALRAGAGVTHPETGVPTTLGKNNTFNVKSFSPSSLAVTLSIAQQERPIFGAGCVDVDSANKSVALGSNVTVPTMLPLALTKGIDSGSGWVGGRTKGGIWFDGNYRDAAVRWNGQYVWVADNGSLKAAPTKPDSDSPSNGVVIGPLE 663 T 0.11 E1_FCCH unppssm T Viruses T 7vuo 1 A A KCNQ1_HUMAN Kv7.1 FNRQIPAAASLIQTAWRCYAAENPDSSTWKIYIRISQLREHHRATIKVIRRMQYFVAKKKFQQARIGSG 69 T 0.014 DUF5546 pdbpssm F Eukaryota T 7vv8 2 B C SIN1_HUMAN TORC2 SUBUNIT MAPKAP1,MITOGEN-ACTIVATED PROTEIN KINASE 2-ASSOCIATED PROTEIN 1,STRESS-ACTIVATED MAP KINASE-INTERACTING PROTEIN 1,SAPK-INTERACTING PROTEIN 1,MSIN1 TSKESLFVRINAAHGFSLIQVDNTKVTMKEILLKAVKRRKGSQKVSGPQYRLEKQSEPNVAVDLDSTLESQSAWEFCLVRENSSRAD 87 T 0.051 E3_UbLigase_RBR unp F Eukaryota T 7vv9 2 B C SIN1_HUMAN TORC2 SUBUNIT MAPKAP1,MITOGEN-ACTIVATED PROTEIN KINASE 2-ASSOCIATED PROTEIN 1,STRESS-ACTIVATED MAP KINASE-INTERACTING PROTEIN 1,SAPK-INTERACTING PROTEIN 1,MSIN1 SKESLFVRINAAHGFSLIQVDNTKVTMKEILLKAVKRRKGSQKVSGPQYRLEKQSEPNVAVDLDSTLESQSAWEFCLVRENSSRAD 86 T 0.051 E3_UbLigase_RBR unp F Eukaryota T 7vvd 1 A,C A,D KCNQ1_HUMAN IKS PRODUCING SLOW VOLTAGE-GATED POTASSIUM CHANNEL SUBUNIT ALPHA KVLQT1,KQT-LIKE 1,VOLTAGE-GATED POTASSIUM CHANNEL SUBUNIT KV7.1,IKS PRODUCING SLOW VOLTAGE-GATED POTASSIUM CHANNEL SUBUNIT ALPHA KVLQT1,KQT-LIKE 1,VOLTAGE-GATED POTASSIUM CHANNEL SUBUNIT KV7.1 FNRQIPAAASLIQTAWRCYAAENPDSSTWKIYIRISQLREHHRATIKVIRRMQYFVAKKKFQQARIGSG 69 T 0.014 DUF5546 pdbpssm F Eukaryota T 7vvg 2 B C SIN1_HUMAN TORC2 SUBUNIT MAPKAP1,MITOGEN-ACTIVATED PROTEIN KINASE 2-ASSOCIATED PROTEIN 1,STRESS-ACTIVATED MAP KINASE-INTERACTING PROTEIN 1,SAPK-INTERACTING PROTEIN 1,MSIN1 TSKESLFVRINAAHGFSLIQVDNTKVTMKEILLKAVKRRKGSQKVSGPQYRLEKQSEPNVAVDLDSTLESQSAWEFCLVRENSSRAD 87 T 0.051 E3_UbLigase_RBR unp F Eukaryota T 7vvh 1 A A KCNQ1_HUMAN IKS PRODUCING SLOW VOLTAGE-GATED POTASSIUM CHANNEL SUBUNIT ALPHA KVLQT1,KQT-LIKE 1,VOLTAGE-GATED POTASSIUM CHANNEL SUBUNIT KV7.1,IKS PRODUCING SLOW VOLTAGE-GATED POTASSIUM CHANNEL SUBUNIT ALPHA KVLQT1,KQT-LIKE 1,VOLTAGE-GATED POTASSIUM CHANNEL SUBUNIT KV7.1 FNRQIPAAASLIQTAWRCYAAENPDSSTWKIYIRISQLREHHRATIKVIRRMQYFVAKKKFQQARIGSG 69 T 0.014 DUF5546 pdbpssm F Eukaryota T 7vwl 1 A Q A0A4X1VKC6_PIG NADH-UBIQUINONE OXIDOREDUCTASE 49 KDA SUBUNIT,COMPLEX I-49KD ARQWQPDVEWAEQFGGAVMYPTKETAHWKPPPWNDVDPPK 40 T 14 eIF_4G1 pdbhh F Eukaryota T 7vwo 2 G,H,I H,D,J VPB43_MYCTU Antitoxin VapB43 YRVQPSGKGGLRPGVDLSSNAALAEAMN 28 T 0.81 TetR_C_6 unp F Bacteria T 7vwv 1 A,B A,B A0A2X0RU36_ASF I73R CDS PROTEIN,I73R PROTEIN MAHHHHHHEFMETQKLISMVKEALEKYQYPLTAKNIKVVIQKEHNVVLPTGSINSILYSNSELFEKIDKTNTIYPPLWIRKN 82 T 0.012 HARE-HTH unppercent T Viruses T 7vxs 1 A Q F1S1A8_PIG NADH-UBIQUINONE OXIDOREDUCTASE 49 KDA SUBUNIT,COMPLEX I-49KD ARQWQPDVEWAEQFGGAVMYPTKETAHWKPPPWNDVDPPKDTLV 44 T 13 eIF_4G1 pdbhh F Eukaryota T 7vxy 3 E E Peptide inhibitor XKXR 4 T 170 FAS_I_H pdbhh F F 7vy2 7 GA,NB U,u A0A7Z6QU05_CERSP protein-U MPEVSEFAFRLMMAAVIFVGVGIMFAFAGGHWFVGLVVGGLVAAFFAATPNSN 53 T 0.05 TrbC pdbpercent F Bacteria T 7vy9 1 A Q F1S1A8_PIG NADH-UBIQUINONE OXIDOREDUCTASE 49 KDA SUBUNIT,COMPLEX I-49KD ARQWQPDVEWAEQFGGAVMYPTKETAHWKPPPWNDVDPPKDTLV 44 T 13 eIF_4G1 pdbhh F Eukaryota T 7vye 1 A Q A0A8W4F811_PIG COMPLEX I-49KD,MITOCHONDRIAL,NADH-UBIQUINONE OXIDOREDUCTASE 49 KDA SUBUNIT ARQWQPDVEWAEQFGGAVMYPTKETAHWKPPPWNDVDPPK 40 T 14 eIF_4G1 pdbhh F Eukaryota T 7vyg 1 A Q F1S1A8_PIG NADH-UBIQUINONE OXIDOREDUCTASE 49 KDA SUBUNIT,COMPLEX I-49KD ARQWQPDVEWAEQFGGAVMYPTKETAHWKPPPWNDVDPPKDTLV 44 T 13 eIF_4G1 pdbhh F Eukaryota T 7vyi 1 A Q A0A4X1VKC6_PIG COMPLEX I-49KD,NADH DEHYDROGENASE [UBIQUINONE] IRON-SULFUR PROTEIN 2,MITOCHONDRIAL,NADH-UBIQUINONE OXIDOREDUCTASE 49 KDA SUBUNIT ARQWQPDVEWAEQFGGAVMYPTKETAHWKPPPWNDVDPPK 40 T 14 eIF_4G1 pdbhh F Eukaryota T 7vys 1 A Q F1S1A8_PIG NADH-UBIQUINONE OXIDOREDUCTASE 49 KDA SUBUNIT,COMPLEX I-49KD ARQWQPDVEWAEQFGGAVMYPTKETAHWKPPPWNDVDPPKDTLV 44 T 13 eIF_4G1 pdbhh F Eukaryota T 7vyx 1 A,B A,B Selenomethionine (SeMet)-labeled Cas12c1 D969A mutant MQTKKTHLHLISAKASRKYRRTIACLSDTAKKDLERRKQSGAADPAQELSCLKTIKFKLEVPEGSKLPSFDRISQIYNALETIEKGSLSYLLFALILSGFRIFPNSSAAKTFASSSCYKNDQFASQIKEIFGEMVKNFIPSELESILKKGRRKNNKDWTEENIKRVLNSEFGRKNSEGSSALFDSFLSKFSQELFRKFDSWNEVNKKYLEAAELLDSMLASYGPFDSVCKMIGDSDSRNSLPDKSTIAFTNNAEITVDIESSVMPYMAIAALLREYRQSKSKAAPVAYVQSHLTTTNGNGLSWFFKFGLDLIRKAPVSSKQSTSDGSKSLQELFSVPDDKLDGLKFIKEACEALPEASLLCGEKGELLGYQDFRTSFAGHIDSWVANYVNRLFELIELVNQLPESIKLPSILTQKNHNLVASLGLQEAEVSHSLELFEGLVKNVRQTLKKLAGIDISSSPNEQDIKEFYAFSDVLNRLGSIRNQIENAVQTAKKDKIDLESAIEWKEWKKLKKLPKLNGLGGGVPKQQELLDKALESVKQIRHYQRIDFERVIQWAVNEHCLETVPKFLVDAEKKKINKESSTDFAAKENAVRFLLEGIGAAARGKTDSVSKAAYNWFVVNNFLAKKDLNRYFINCQGCIYKPPYSKRRSLAFALRSDNKDTIEVVWEKFETFYKEISKEIEKFNIFSQEFQTFLHLENLRMKLLLRRIQKPIPAEIAFFSLPQEYYDSLPPNVAFLALNQEITPSEYITQFNLYSSFLNGNLILLRRSRSYLRAKFSWVGNSKLIYAAKEARLWKIPNAYWKSDEWKMILDSNVLVFDKAGNVLPAPTLKKVCEREGDLRLFYPLLRQLPHDWCYRNPFVKSVGREKNVIEVNKEGEPKVASALPGSLFRLIGPAPFKSLLDDCFFNPLDKDLRECMLIVDQEISQKVEAQKVEASLESCTYSIAVPIRYHLEEPKVSNQFENVLAIAQGEAGLAYAVFSLKSIGEAETKPIAVGTIRIPSIRRLIHSVSTYRKKKQRLQNFKQNYDSTAFIMRENVTGDVCAKIVGLMKEFNAFPVLEYDVKNLESGSRQLSAVYKAVNSHFLYFKEPGRDALRKQLWYGGDSWTIDGIEIVTRERKEDGKEGVEKIVPLKVFPGRSVSARFTSKTCSCCGRNVFDWLFTEKKAKTNKKFNVNSKGELTTADGVIQLFEADRSKGPKFYARRKERTPLTKPIAKGSYSLEEIERRVRTNLRRAPKSKQSRDTSQSQYFCVYKDCALHFSGMQADENAAINIGRRFLTALRKNRRSDFPSNVKISDRLLDNLEHHHHHH 1310 T 8.6E-05 RuvC_1 pdbhh F T 7vz8 1 A Q A0A8W4F811_PIG NADH DEHYDROGENASE [UBIQUINONE] IRON-SULFUR PROTEIN 2,MITOCHONDRIAL,NADH-UBIQUINONE OXIDOREDUCTASE 49 KDA SUBUNIT ARQWQPDVEWAEQFGGAVMYPTKETAHWKPPPWNDVDPPK 40 T 14 eIF_4G1 pdbhh F Eukaryota T 7vze 2 E,F,G,H E,F,G,H VE6_HPV16 the PDZ-binding motif of HPV16 E6 TRRETQL 7 T 0.34 FpoO unphh T Viruses F 7vzg 3 C,I E,e G2LK98_CHLTF PscE MTAILLACLFVLGGYAALWGIIKFVVANTKDIAAN 35 T 3.1 Maff2 pdbhh F Bacteria T 7vzg 4 D,J F,f G2LEN5_CHLTF PscF MWNVVGQIISVLCFFILTVGTLFGIVYVSHLLSRG 35 T 1.1 TssO pdbhh F Bacteria T 7vzg 5 E,K G,g G2LJ20_CHLTF PscG DISKVAWAWFGVLLAICLIGAFGNYVPKLFVKMLMFLN 38 T 0.32 DUF5383 pdbhh F Bacteria T 7vzg 6 F,L H,h undefined polypeptide XXXXXXXXXXXXXXXXXXX 19 F F F 7vzg 9 N D G2LHG2_CHLTF PscD' MARTPEEIVKRYKEANIWLRHWKQQIGLAKDEEQREMFTQYYEERVQEIAALEEPYRAALKILNQQESQR 70 T 0.15 Elongin_A pdb F Bacteria T 7vzm 1 A A A0A1A9KGY0_9PSED AcrIE4-F7 GMSTQYTYQQIAEDFRLWSEYVDTAGEMSKDEFNSLSTEDKVRLQVEAFGEEKSPKFSTKVTTKPDFDGFQFYIEAGRDFDGDAYTEAYGVAVPTNIAARIQAQAAELNAGEWLLVEHEA 120 T 0.02 G2F pdbpssm F Bacteria T 7vzr 3 C,I E,e G2LK98_CHLTF PscE MTAILLACLFVLGGYAALWGIIKFVVANTKDIAAN 35 T 3.1 Maff2 pdbhh F Bacteria T 7vzr 4 D,J F,f G2LEN5_CHLTF PscF MWNVVGQIISVLCFFILTVGTLFGIVYVSHLLSRG 35 T 1.1 TssO pdbhh F Bacteria T 7vzr 5 E,K G,g G2LJ20_CHLTF PscG MEGVAMEDISKVAWAWFGVLLAICLIGAFGNYVPKLFVKMLMFLN 45 T 0.46 DUF5383 pdbhh F Bacteria T 7vzr 6 F,L H,h undefined polypeptide XXXXXXXXXXXXXXXXXXX 19 F F F 7w0n 6 G,H D,E ELA_HUMAN PROTEIN ELABELA,ELA,PROTEIN TODDLER QRPVNLTMRRKLRKHNCLQRRCMPLHSRVPFP 32 T 0.031 DUF5527 unppssm F Eukaryota T 7w0o 4 D D ELA_HUMAN PROTEIN ELABELA,ELA,PROTEIN TODDLER QRPVNLTMRRKLRKHNCLQRRCMPLHSRVPFP 32 T 0.031 DUF5527 unppssm F Eukaryota T 7w0p 6 F D ELA_HUMAN PROTEIN ELABELA,ELA,PROTEIN TODDLER QRPVNLTMRRKLRKHNCLQRRCMPLHSRVPFP 32 T 0.031 DUF5527 unppssm F Eukaryota T 7w0q 2 B B POLG_CXB3N peptide VGTTLEALFQ 10 T 0.39 E2F_TDP unppercent T Viruses T 7w0s 2 B,D,F A,D,F POLG_CXB3N peptide VGTTLEALFQ 10 T 0.39 E2F_TDP unppercent T Viruses T 7w0t 2 D,E,F A,D,E POLG_CXB3N peptide VGTTLEALFQ 10 T 0.39 E2F_TDP unppercent T Viruses T 7w39 34 VA v Substrate XXXXXXKXXX 10 T 4100 EF-hand_5 pdbhh F F 7w3a 21 IA v Substrate XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 34 F F F 7w3b 32 TA v Substrate XXXXXXXXXXXXXXXXXXXXXXXXXXXX 28 F F F 7w3c 30 RA v Substrate XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 36 F F F 7w3f 30 RA v Substrate XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 36 F F F 7w3g 33 UA v Substrate XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 34 F F F 7w3h 31 SA v Substrate XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 36 F F F 7w3s 1 A,B,C,D A,B,C,D A0A2S6F197_LEGPN Type IV secretion protein Dot MKAIPPKIWFETQLKGSGLDKKFQIDELIETQSSVRVFANKKYLPDTETINEALTKVTAVNVSGDKSGYFQNGLPFPNEAGYFEKIPVGHPELLSPIERLTGSKKIVSSHSLVTASGGYPLTNPLLPYRKPIRVSIFSLAGPSFENNYLHYRLFLLDSVQKIIDSPLFSHLHDGLPIQFDEAKKELGEYDTNKLMARIRLGFPYLARFSSGGFYPSFSKSNAIIFLSEAYFRYQLEDVSLLLASVNQTGKETGKAALLKATAVGMGFFAKIDCGYDIQHIIFPYYLRAYKKLLSEHKFPWIAKIEFPIFNEIQQEQFDSIFEDYDGPTKVYRSTRDVLEFREEEIEKYLPAAINPSDAFALTGNEWGYGSVESMIGNNSSIRFDQVHHMNPLILDPSHHVEAQINKDHGVELT 413 T 0.0071 DUF4804 unppssm F Bacteria T 7w40 6 F E Bombesin XQWAVXHFX 9 T 0.14 Bombesin pdbhh F T 7w43 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L YJOB_BACSU Uncharacterized ATPase YjoB GSMTNIPFIYQYEEKENERAAAGYGTFGYLITRIEETLYDQYGVFYELYASDDPNTEYWELLVEDVRSGSLEPEHVAYIFEKLEKKTFAYDEDEKEPDYTVHKSIRNSVYAYPEKGVAFARIPYFQDGSIMSFDCLFAVNDEKMRAFLEGVRPRLWEKSKR 161 T 0.049 zf-NOSIP pdb F Bacteria T 7w54 1 A,B A,B Q5ZTB4_LEGPH Lpg2248 MAKTIKATGDGACLFNAVSIGLSVEILSGRLDSQLDTPGYQALLDEFAKHHPQFNPKSWKTLKEWLAYYNDTRDIELILAPVLFNLNQKYQDHLDEEILNELTNLVWKNKANIENGQAWFQLQNTGDLGEALFPKLENLDLKKDRAPLLDKLREILKDYKLELTRENVKQFLTEKAKELLSALKKKISSDPHAFQRGYSCDELKGMTDALAISLVENREEDITDNRIKIRLENQEEHWNVLCNEEDSERFLDSTPSRLKMTSLEAYRGDKQVSAPTSEQLDLIEEPGVFLRERTIDNPGLGNCAFYAFAIGLVNIIQEEAKYNRRTMFDRWVGLDRSISGQYDEILKLNLEDPDKELLDRLQSSLRIVTYQYQIRELRNVCVFRNGNYNRLTGNSNFVNFAALYYGDPLDTDSRFNPFADSVPILIKMANIDRDSVHPGHENDVLVPLFLDLLYGDTTNPADITLETEPKSDSPIITAMNNITQDFFWGTHLDLNYLAEAFEVNLHVLRNNSPIQEFVDIP 521 T 0.18 DUF2754 pdbhh F Bacteria T 7w56 3 C C NMS_HUMAN Neuromedin-S ILQRGSGTAAVDFTKKDHTATWGRPFFLFRPRNQ 34 T 0.00047 NMU unphh F Eukaryota T 7w57 3 C C NMS_HUMAN Neuromedin-S ILQRGSGTAAVDFTKKDHTATWGRPFFLFRPRNQ 34 T 0.00047 NMU unphh F Eukaryota T 7w5r 2 C,H H,X LEU-TYR-ASP-VAL-ALA LYDVA 5 T 76 GIT1_C pdbhh F F 7w5z 1 A,GB U1,u1 Unknown peptide XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 98 F F F 7w5z 3 C,IB U3,u3 Unknown peptide XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 34 F F F 7w5z 4 D,JB U4,u4 Unknown peptide XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 30 F F F 7w5z 9 I,OB C3,c3 Q950Y6_TETTH Ymf68 MLICNFLMYSNFSRIYWFDFNGTVNENLPLNYNVLKICRNEINKLEKLNENNLGTQKNPIKLNLSFEDKHYNTNNLVLDLNSYETFNSKNFISSIFDKTFESLNTVLMAPIYSFLEFKLKLSSTKINTNHYYVINGKLYITYNDSFKLFTTINDYFNDLNELSNTKLFFLYRSFNIYNIKLNSLVDFVFLKLILFIHLLYLKSTNYNRFDYRLKQTDWGFYINNNSNYIQNIFSGLKYIWRGLRFWIIGLLLGLSSIYYLMYVRLLPFNKIIFAWILVAMFLYWLLSGFVFFVKKYQYSKFTAAIQRFWKRTYIIFWVIEAGTFSVFFYLTLNASSEPVYMYDQIKIYKTHLFSWRWFLIKLLPSVSIILLGYYLQLTLKWNLFNKQNTIVLLITLLLLYILWLEFYQFYHILSFYGNINWAFDYDEYIWTLELDTRRTRLANNYIAICLFAKFWHFVFIFLFWVFFVLRINELGRIRYPLLVANVQNFIIIYIMSWAYMYPWLKFIFRKYLDVPYYWFYLNGRELGIRVFFTDLKLFFYGITNRLFDFNPSSIKFEKYPFYYWINSSQLTEFNQYRKFVIRDSIIYSLNNYII 594 T 0.29 DUF3408 pdbpssm F Eukaryota T 7w5z 11 K,QB 6A,6a W7XCY5_TETTS Transmembrane protein, putative MIWKYLQRTNRGNIIQAGLQHRKFENLPFKQNFDNLTKAYDLRMWYISNSPHEAKNLEYVNELEALHNELNYQNSRQFLFRTVSFLLGWALFYQFYELPKTYDWQDTQEPKHQVPAYGDLEEGGDEGGDD 130 T 0.025 SpoU_methylas_C pdbpssm F Eukaryota T 7w5z 12 L,RB 6B,6b Q24I72_TETTS Cytochrome c oxidase subunit 6B MSSAVEKKDLPADYGKMPAGYNFLTRGKDWREYDKDFILRTDAVWEKFQLEHFFRNYMKCFFFDHGLKKYQMFEPEDMYTVVFEGWALDDLITFPGFTPTGRTNSYQIGLSPRQRTVVPTQTFYQMQDYYMLCGLRFERWFRCDLVYHDQRHTKFDQVKNQKNYKTYPCYREYYEAQYACQDDMFDFLMELAYARRAADNFESDFASHELTTLPTFYDTPKAAERKTYTY 230 T 35 SNN_linker pdbhh F Eukaryota T 7w5z 13 M,SB 6L,6l I7LVX0_TETTS Cytochrome c oxidase subunit 6B-like MEVKYRGPSDDKLECEFLENNLLSCLREKSVQDNVAKMTCRPEFLVWFFLECPTKAAVYHDPKGLRNIFIQDKIKQKGSDDGVLSKDD 88 T 1.3 Defensin_4 pdbhh F Eukaryota T 7w5z 14 N,TB 6C,6c Q23DS4_TETTS Transmembrane protein, putative MGSVWFRNRYWWYRSLYDDYVAREAKLAFGIAAFIWLPHYYWGIHLNRAFEVNFSHRNYAHEWGPRRNRLAHSLEFEQFDMILENWQDLEDEYAQRGDGMLKK 103 T 7.9 NIPSNAP pdbhh F Eukaryota T 7w5z 15 O,UB 7A,7a I7MGF9_TETTS Transmembrane protein, putative MNRFFKVSSKYQYYKYLEQYDAAFLRKYQSETHWYLGRRGAWKNLVIKYAGDHISLEEEHNVKYKTHLSFVYLSYRLAWVLFAYVLIYNHFLLGDIGKTFNVGEWDHRLKPSAERDYPTRYESLYILDRTQKW 133 T 0.0061 IRK pdbpercent F Eukaryota T 7w5z 16 P,VB 7C,7c W7X287_TETTS Cytochrome c oxidase subunit 7C MISKYRYLHCARKLVKQSVQAFGGGHHHHEYDWRDDPKVNKDIEEDIRDRGWHPETYDFPYTKKHDDWVFDVTMPSQNYQTDLTVNIHPENKKMHVMKQVMRQSYWDAEHDMAHEYDYESEDLDFQCESFKSQHFRKKGPISQYLILGLLPILYFGTEFFYNHYPDEDYWRVAHPPPLDYPDTDDTDDTETFKDYKSFTGRRMVDTGIVDPLWYDIREGKKVYYDWAGVNQPMEDI 236 T 0.14 Tctex-1 pdb F Eukaryota T 7w5z 22 BC,V t2,T2 W7XDM6_TETTS Cytochrome c oxidase small TIM subunit 2 MSDKRKIQHEGIIALINYSTLCAQKCDVLKGHDDKITDTEEQCLRVCAEKIRQTFEFTNDIYLKNPNLTKPN 72 T 0.00018 zf-Tim10_DDP pdb F Eukaryota T 7w5z 23 CC,W t3,T3 Q231A8_TETTS Cytochrome c oxidase small TIM subunit 3 MSTRKIFDSEEQSFIRLVDKFYLGLSLTKLCAQSCNLLRNDISGSALTQKEKDCLSICYNNIEKTQSAFYAKVKTTMNLPAVEDDGEEGGDDE 93 T 0.0022 zf-Tim10_DDP pdb F Eukaryota T 7w5z 24 DC,X t4,T4 Q22A35_TETTS Cytochrome c oxidase small TIM subunit 4 MEQNTTQVFSDLAYKVCFKVINDKNKPFVLHDEQRLANCLTRYVEAFNVTSEYFFRERAGETKVTEKQ 68 T 0.0086 zf-Tim10_DDP pdb F Eukaryota T 7w5z 25 EC,Y t5,T5 Q22N23_TETTS Cytochrome c oxidase small TIM subunit 5 MEDNYAADVQRQFNRTAFDSLYKICYNSLVQKNGSTIDFQKQIDCHQRLIQVFAKIAPIVVKVEQDAASSGGAAAGGEDEE 81 T 11 MetOD2 pdbhh F Eukaryota T 7w5z 26 FC,Z t6,T6 Q233U0_TETTS Cytochrome c oxidase small TIM subunit 6 MDPVLGDVIATRIYKACFKHVYGKNMKAYSEKDEAKFDQCLTSYVESYKSVTNHFITYLGQLPKKGLSLDGS 72 T 0.088 zf-Tim10_DDP pdb F Eukaryota T 7w5z 29 CA,IC AC,ac Q24C97_TETTS Cytochrome c oxidase acyl carrier-like subunit MMQNLKKFMSKTIQVQPVSFNQIPKAFYNFPEYRTGGVQANPGITAKRIIKCIGERLRKYDPARWENVPITFKTHFRDENGYSDVATSIQIHDALEREFGIDIKDRLALVTDVETAFYIVMSHHDPL 127 T 0.16 DUF1493 pdbhh F Eukaryota T 7w5z 30 DA,JC Y7,y7 Q950Y7_TETTH Ymf67 MTALFLHILWSISYIIINILYIFLSLLLSNNNEKIKQYNSNYFIKILLVLFYNKNLSFYKNLLSEDEISKIEFERLKNYPTLVLIHSNLNKLEKRNKIINSFINFKTKYRFYKFISTNFNLQTIIKNCNDKIIFSTLLYIVNLNYSFFYKTIKNTDLIVYLLANKFSILNDNIIVSKFNISKFNDYIKYINNTNSIDTYLENQIILGLNNNTNSNITKNINTKLLNSYSNLKNLVNITNNTFYLKKINDNYNTVINSEFLTYLKSNYKISFSASNIVKYLSDKSVNNSVILYLRKNKIFNKSRYSRNRQTYRTGAYWCLYVNIIAVVAFYFWFYKFTMNFGYLWWLLYSLILSFFFSRALKHRFYNPLNVMTEFKNGFMWFIIILINIFKPLLKLLENNYINLYNHLVIKYYQSFICNTLINKKKLEFNYILSSFKFIKELNNIIIISLNKLF 453 T 0.0058 NUFIP1 pdbpercent F Eukaryota T 7w5z 31 EA,KC Y0,y0 Q950Y0_TETTH Ymf70 MFRWLFLYWYNSTDTPSAIAKVNLWSYINLRLFKARLSSSIAYYILGLNNLELKKLKIFYKNTYFDYIYLKSIPCLFLIIFFTNLYLFL 89 T 37 DUF5784 pdbhh F Eukaryota T 7w5z 32 FA,LC Y5,y5 Q951A7_TETTH Ymf75 MFLGIFKDVIKLLNKKVVPVYFWFFLYCFLSTMDTNIFVSSCSFLKVEVFGKDENTTLVLLFYVFYSLFNFYLSRIKNKNNYLVRKHLYTTELLIELILFKYKLIILKFSSIKYILNFNVRKFILFNLFLINNYKAYKINTFFLYIYIYLNNLNIIWYPIFKAYSIFGYYKSTRLNFIDTKNENIKRIKY 190 T 1.8 PDH_E1_M pdb F Eukaryota T 7w5z 33 GA,MC A,a Q22PJ5_TETTS Transmembrane protein, putative MLSKVTRRFLNYNQIYCFASQHGAEHHKLTASDEAYLNEVRQRYVTPDMEKWAYLDYKKHPSTTLSHYDHKSKDYVESERDDYNADVATNSHNKLIDDFKRNLQMQRKVHDILQKMDRPYLRGVPGVTKNISAGLQDYSAPVSKKSQSDPNDFYRDAYRNENRWIDQSVFTPKTSKMTHYDVEWPKELASRPVTKKFHHDKGYKYDVTTPYDQRYNYVADRLGHPEILGNPFERLMRLEGDIYHPNYLDQPFVKVPNANPNASLNFEEGEVLYENTRLLEWAKFWNYSVVVGYLWCAYFVPYNIFFKTHMPLEHAYDNLFFPYFQHTHFLWDNNALHIPTVGGVAIYATYIALSYINNIWKDYVVRAQFSKDKELLFVTRVSPFGTTEEEVYEVAHLEHLPPSVRSGVKDLSAQDADGLVDVTCMSSQRSLVFYKGDQYWNPKVYNDFINQTSNLWTRNYTGYNRLEVQNSVEQVKIGFSHSSQPKLEKK 490 T 0.015 TMEM70 pdbhh F Eukaryota T 7w5z 34 HA,NC B,b Q22FX8_TETTS Protein phosphatase 2C, putative MFRRIISNGALLSTQTQRWQDLSKFACLRASLNKESEKAFQELAKKNNVSPQELVELSKIVSMNLDVLKQNINSEQFLLEKESTLKRYRQSSIGTRGHLQTVNEAVNTKYPTLAEGLGQVAGYKEAYQALREIFVHPSISVNNLRQGSYGQQFAVDFRTRADEYVKALLKDHSSNPQAVQTIQEIQHTLHQIIKNYEQNPASIYARILTVLQTRGVNTLPVSKTADQKAVATIQKTSTPSLTIDQLTVPVQERVQTQTVFDAELAFIKEANEMIQQNTGNLPWDGGKKKIFQGQANKYLETPYYLLAALSGLGLLYFLYSGDAKYKTLVLTPVVGIAAFVLLRRNQILNRVPTLTELFLHKDGKFVDAVVSVNGQLISKNDIPVSTLKLYRGDHTVKVNLNDFEDASAKKFLAQQSGQEGVINVHFSKLRNLAARNGQVLNLGDTEVVVPFENQANRIILKQIFKGVEVLPSS 473 T 0.011 Rh5 pdb F Eukaryota T 7w5z 38 LA,QC F,f Q23DG8_TETTS Transmembrane protein, putative MRYLKIEKEKLVSCKKQEQEVQRIRRRKGNQKLNSIAKQQRVKRRDYQQNIKQNKEVKNPKKLIKQQIINKVKKRKKMFRGLTKFNKVFALNSFKNSLVAVPKANLNHVQNMLEENLKYDAQKYNDEVAVIQKTSRIYKPTYTIEFNREGEVLVYSADPIKNSVVYFKYPYVLYEAAIPLFIWAWIYNPLELSKNAVNSLLIYPNIAWIPRMWYWRSLQYKIQKMYLLRGGKVAKIETQSLAGDRFTSWVETYQFHPLTQDQKNFDNQDNAEFLEDEGQLKYELGVQLDNLQEMGTTSQDIVINFMKEGTVHHPELFEAIVKGYNIDTSDYVINTANNLRAREGNHNH 348 T 0.023 TMEM70 pdbhh F Eukaryota T 7w5z 39 MA,RC G,g Q23DZ5_TETTS Cytochrome c oxidase subunit TT7 MFLNRLVKETSKAKRLFSMAQNNFARAGPYNPNRYKDYYIPRTLPKNEEIVEFVQSQHSVPASPIRNQRHINPVRESGPLPSYDGTYTMEDIRAVFYNTTVGRDYCYCQMDPEEIMRRVPGITRKEAEFITKLGLSPQEQVDFAYIAYNIGLDIFYFTNQMFVARQVVTNSKGEKVEVLWNAQCYEDIAQLNVGFAPVLESVDYHWEIFLWADPPIKPNNDFDLNVPCTWFEYEQEWWMESCIQEDQFNLPEDERPYNTPRNPHCRKELWRSQDALQEEELMVNENWYPKNTQYNIYNQPDFIKPKSGSGAAADDIRI 318 T 5.7 IMS_HHH pdbhh F Eukaryota T 7w5z 41 OA,TC I,i I7LY65_TETTS Cytochrome c oxidase subunit TT9 MVYHLFERICNPDNFKLSGEAARVRTLIAAGFSKEEAEQVAWLQNHQVNGKILGLFTGGFALYCCNNYFHYFERYFPRLRYQPFTKFLAQAATVYFFFKIGDYYFTSRRYGSNDARMNGLMYSNTYYSTNKEALIQNFEPLNRKFTEEEVEQFLRNEGRSQEEKRNWIYNPHIHGSTEGEWKADIHEKFDSGKAPWEREHVKAKILETNKAKIDAGEEIQLKPFKTLNHLDKTGLLHRLHPFIWTNNWTLLG 252 T 0.19 Bac_luciferase pdbpssm F Eukaryota T 7w5z 42 PA,UC J,j I7MD70_TETTS Cytochrome c oxidase subunit TT10 MSSFIQYEFLKIYQGNQKIKNYYKRKRLIFQQKKVLKKKQKEIQMSTNNLRLKPWFHWTDEERSHAIFSAYEKRILKSEDLPSFLRANRINNVSTWVFPLIALPLFNQSIFKLGFAQRILLTRPAIEWHCFKIATVAASWLAWLNFSPFYRKLENEKEYLLDTLESRIGINVLDLNDALPRWTTSQEYNRRTQQLYNQRNGFFAGLLYPQEESSRPLVDIASFPKNLHKEKLTK 234 T 2.9 TFA2_Winged_2 pdbhh F Eukaryota T 7w5z 43 QA,VC K,k W7X4J9_TETTS Cytochrome c oxidase subunit TT11 MFGRLVLKQTRRTLFNPVLKNTFCIYQAYQNPLRHINTGHNPNNVYEDIVMLGDYPVQNRTHDKVISQTYVPAIANIAFTHLSKKYPQAGLKVDQLNTLKEKTWNDLGVNIEHEKQEILVELSEQIFVKESKLRWVHEQRQRLAHTTYVFSGLEFQNVKVGFFIDSYNFLLQELAHRSNLYQSKDIVGEKSFHEKHLEQQTAPYSGVKSLEEPVSQNKSFINSLMRAIHNH 231 T 27 FliD_C pdbhh F Eukaryota T 7w5z 44 RA,WC L,l I7M3P9_TETTS Cytochrome c oxidase subunit TT12 IKGNQKKQKGKNQSNNNNNIREEGKQIKEMILPHNNRQLARQYFDSLPENDINRKYYEGLKYETPKTFFGRFLNQFNIDAKLDTLSKFYTYQKTIRATQAELQEDRKSYLTNSLLFTAVSWFSIYQFARKGAVLPVLREYGRYFGTHRLFRQYLHTLVLPLLYTEYALNQKYYTHMEHLWTVHVNRLNQKILEDPLYTFYPQELNVPKHNIIVPTIFRDTPQ 222 T 18 Plk4_PB2 pdbhh F Eukaryota T 7w5z 45 SA,XC M,m W7WZP1_TETTS Transmembrane protein, putative MSLSLFGVKNNWHKNGIWWFSKILNKTVGEERYDALRVQRRIWSMRFYYARQQCLYELFVDHPDLAQWTGTYPKVDSSHGFPFYSTYEMYRDFQENTLNSDGSFAQWITLVCGIYVIHVIYNYMIPYYWVSTPLKNDEFTRLRMKDYIASTVLEEVYGISYAEWGWLPHDFAYNRMRGLAGYMHPDDPRAMCTSTFHRKHKYIEHEVEKVGDYHHMTYPK 220 T 61 Spore_YabQ pdbhh F Eukaryota T 7w5z 46 TA,YC N,n I7LZX8_TETTS Transmembrane protein, putative MKKGTASEEELKKLYDPNTFYEHGDNPAFKQFMNIAVENLREGKLTDHRTYVVDTYKKWMYARNWDDFLQRDCKAITFPRAFALWIVGTLGMATASKWCRQILPVGSHGITKISQTQFFHQFGPLGTLGAVGFYGLTAYLYYKTTIFTVKKFYSHCILQEREWIFEQERQNPGYGEYFFKDVPLSAEEHFNDLARGEMAKKKFEKPNHEF 210 T 8.8 DUF4500 pdbhh F Eukaryota T 7w5z 47 UA,ZC O,o Q23F08_TETTS Cytochrome c oxidase subunit TT15 MKEKIFNELTRKMKRKEISAKIQREENKQILIRQRNNKKYIQSIQGIQQERKKGKLYLVEMATQNVEEMDTIQKMNYEATVNMGRQDLITREYTFYSDYEFIPIQEDRKQQMEDALNNLHKIIHPTVTQLKKKANVQEIQDRVFRKLQGWEGELNTCVFSAKNVRDSNFCADRFTNRINTEGVEFVKQILREY 193 T 0.13 DUF3221 pdbpssm F Eukaryota T 7w5z 48 AD,VA p,P I7M8Y9_TETTS Cytochrome c oxidase subunit TT16 MNNTFKFLHQVISKLTLKAQVPNYGQYSHSLKRPINPKVVVFGNSSRAYELISSQFRNFNHVNGLELKGQEDNIQANKVAQSVLSINDGFQDGYYITDFPQNSKQAERLDLITDGVNLALYIKDPSDKVTVTRQQEAIDYYRKTGALVEFEVDPRGDLEEQVKQLSNQVLNGYKH 175 T 0.11 PRORP pdbhh F Eukaryota T 7w5z 49 BD,WA q,Q Q23D87_TETTS Transmembrane protein, putative MDNNYHFWGNGDRQDVSLSYEDYYSILDCLLDEKLSPQGLMKFKNLHEVSMYGVSYVPLYCFPVAYGISHMLTGKVRRGHSGYRNLFSLMSVVLPFTCWYAYTTPIPRRLYTEIICSNNADGAYVRNRIKQQKPGIWRKLSQQLYNKNFRFPELNQDLTATEFPLDYVAPHKF 173 T 0.16 DUF2206 pdb F Eukaryota T 7w5z 50 CD,XA r,R I7MKT6_TETTS Cytochrome c oxidase subunit TT18 MVFEFLFYNQQHKTRNGYFINHDNLMLASLEERKKLIFYFIANQVPEKLDPVDRVKFNEELSDNLSTKARLIGSLTGLIGLVGFPYISTRIYSRPVLNIGLSLLICPFLYYVGNQLTYSVWEPKFIANNNTVCELSKKYNFTVFDFAQAKKEAHLKALRTELVSDNLLYSPGI 173 T 0.048 DUF1689 pdbhh F Eukaryota T 7w5z 51 DD,YA s,S Q230X6_TETTS Cytochrome c oxidase subunit TT19 MAIRNFVFKISNQIQNLAAKRSLAYLNQIDSQSVPSRATINMKDQVTQMQREIDNMANVIRAQIPDEDRAEFEILKKYYVTGQHDSLVDPQDVLLQLDRIQVLKNLKMIELNEEAYDPELVRLEKLKARVLLEEEGALLEYAHFISKRPYNKPYEKWGVSEEHVKQQILG 170 T 0.35 APG6_N pdb F Eukaryota T 7w5z 52 ED,ZA t,T Q23VY4_TETTS Transmembrane protein, putative MGFETVVPAPPTRDDELRMIKATEEQFLQQPRYKLYMNEAHRIAKMNHGDRHNNIRAHFWSNFALGLLITGPIFIIPFGKAFRNLRSGVPYYFRPKYVFTQKNQYNQDRNWGAMKKQIPLWLGLSTAYAYWFTDFSINDDEWLEKGKVIYPHQTIKVL 158 T 0.091 MASE1 pdbpercent F Eukaryota T 7w5z 53 AB,FD U,u Q22DP8_TETTS Transmembrane protein, putative MSCTTRRFIDEKEKLEYSRGYNQQELEASKLRKDFVKKYIVDFDTTLYKTQVERDWAYIAKREYRYEVQLKSIGYGGALANAVLLWRIYANKKMVFWPIPIVGALGYLYFQPVFFQKSNKRFFDMCNVGEEYYLGRERNKILRECNKILNVEDF 154 T 0.1 DUF559 pdb F Eukaryota T 7w5z 54 BB,GD V,v I7MFV5_TETTS Cytochrome c oxidase subunit TT22 MGKDQLDFSHFDKAFENKYDIVAPEFGDLHQKRAEFIAKNQGTYRPVPLVPNNIKGLIPKTCRLPATRNWYRRTSSFERNGFFNIHTPVLNTKMIPWLLFIVLTWGWSSFQIGGYNYERFDDNGERRNTLYWKLSPVEFPQSKLWNRPS 149 T 0.056 TOM6p pdb F Eukaryota T 7w5z 55 CB,HD W,w Q23TE5_TETTS Transmembrane protein, putative MVFHYTNFVQETNAWWLRRVRPVYCTVLAYYGWWLYDRYYLFGKNATQDIRKDTTEVWEKRAALNKRNWGYNAHYKPELERSMKKVLYADPNYKFPIEWPERYMAETKTLEQVMDEEENWEYYK 124 T 4.1 GTA_holin_3TM pdbhh F Eukaryota T 7w5z 56 DB,ID X,x Q22W32_TETTS Transmembrane protein, putative MEPFGTDERNWTHEEKDIITRFLKYDKHVNLKTAEMVYSAEVESAYFGKAGALAGGVISALFFNFPIVRNLPIIRRSVIGVLPFLYCYTWGKNTQEELRWLKTFAAYQRFVVYHGQHCKLWV 122 T 3.1 Pepsin-I3 pdbhh F Eukaryota T 7w5z 57 EB,JD Y,y I7M9E7_TETTS Cytochrome c oxidase subunit TT25 MAQTAHQNRYQGGLCYAQCNELFSFWNPSIQQCWKGCDFGVGRVNDPEGRIEAQQMCKRWAAELYWTYKGELDTIKDLRVHADMYPTTPQNVYRACLAGVRRQKF 105 T 0.59 BSMAP pdbhh F Eukaryota T 7w5z 58 FB,KD Z,z I7LTF1_TETTS Cytochrome c oxidase subunit TT26 MSSDPFKKVERDYHNERSVHKHFASYPLKFWWGLNKFETIQGIHSILGNAADLVVSTLSFIPGVQGRNNASYIENSIRVTRFRGFDDKTQ 90 T 0.14 DUF5493 pdbpssm F Eukaryota T 7w62 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N B,G,A,D,E,F,C,H,I,J,K,L,M,N D7DTD6_METV3 lectin MAQNDNYIYSTEVGGVGGTPFTFMQESGTITSIKFNWSDQYKLLHHIEVKFINNANIYATGDPKGNHEVILEIDDDETIIGSVIGYKKGNDGRCTGVKLTTSKGKSIMAGYFEESLITTYTGKLAGIKGGAGSDIDRLGLIFLKK 145 T 0.00015 Jacalin pdbpssm F Archaea T 7w63 1 A,B,C A,B,C TCPB_VIBCH Toxin-coregulated pilus biosynthesis protein B GGELMIKSSNAFDVIELSSQIQRYASLSKINNRTNPILKDNKAKEFKDADLKWLKLENCPTAGDVPTTGNNNDLQDQFIACDADYRKGDLSYFGSQFEFSTYVHPSNPEIQRQIKQVVSYFQYRGMERAFIGDAAGYVISEAKKKGFSAQDYRIVLIEPDRVGYFESNAISYEEFIENPSARENFLLKATKDRTLALAVSLAQTGEIAMQRDGSVAFLEDSELCWDTAAGSAKSCLSVRYDTVGNKTELDLKQIDVVSAKGLSFESDGKTKTPVVSTYETFQDGGRAKTINAIECPTGLNNRFAAVVSSFSTAGQNANFSSESAKDSQGTTQKDGSKGPHALLSGISLNWTLTNKVWDVTASIGIESGILPTSGIDSGSLLRNPKSLSFIAFQWCEN 397 T 0.0013 DUF1494 unphh F Bacteria T 7w64 1 A,B,C,D,E,F A,B,C,D,E,F TCPB_VIBCH Toxin-coregulated pilus biosynthesis protein B GGELMIKSSNAFDVIELSSQIQRYASLSKINNRTNPILKDNKAKEFKDADLKWLKLENCPTAGDVPTTGNNNDLQDQFIACDADYRKGDLSYFGSQFEFSTYVHPSNPEIQRQIKQVVSYFQYRGMERAFIGDAAGYVISEAKKKGFSAQDYRIVLIEPDRVGYFESNAISYEEFIENPSARENFLLKATKDRTLALAVSLAQTGEIAMQRDGSVAFLEDSELCWDTAAGSAKSCLSVRYDTVGNKTELDLKQIDVVSAKGLSFESDGKTKTPVVSTYETFQDGGRAKTINAIECPTGLNNRFAAVVSSFSTAGQNANFSSESAKDSQGTTQKDGSKGPHALLSGISLNWTLTNKVWDVTASIGIESGILPTSGIDSGSLLRNPKSLSFIAFQWCEN 397 T 0.0013 DUF1494 unphh F Bacteria T 7w65 1 A,B,C A,B,C TCPB_VIBCH Toxin-coregulated pilus biosynthesis protein B GGELMIKSSNAFDVIELSSQIQRYASLSKINNRTNPILKDNKAKEFKDADLKWLKLENCPTAGDVPTTGNNNDLQDQFIACDADYRKGDLSYFGSQFEFSTYVHPSNPEIQRQIKQVVSYFQYRGMERAFIGDAAGYVISEAKKKGFSAQDYRIVLIEPDRVGYFESNAISYEEFIENPSARENFLLKATKDRTLALAVSLAQTGEIAMQRDGSVAFLEDSELCWDTAAGSAKSCLSVRYDTVGNKTELDLKQIDVVSAKGLSFESDGKTKTPVVSTYETFQDGGRAKTINAIECPTGLNNRFAAVVSSFSTAGQNANFSSESAKDSQGTTQKDGSKGPHALLSGISLNWTLTNKVWDVTASIGIESGILPTSGIDSGSLLRNPKSLSFIAFQWCEN 397 T 0.0013 DUF1494 unphh F Bacteria T 7w67 3 C F RBBP5_HUMAN RBBP-5,RETINOBLASTOMA-BINDING PROTEIN RBQ-3 SAFAPDFKELDENVEYEERESEFDIED 27 T 0.014 DUF2457 unppercent F Eukaryota T 7w6a 3 C F RBBP5_HUMAN RBBP-5,RETINOBLASTOMA-BINDING PROTEIN RBQ-3 SAFAPDFKELDENVEYEERESEFDIED 27 T 0.014 DUF2457 unppercent F Eukaryota T 7w6i 3 C F RBBP5_HUMAN RBBP-5,RETINOBLASTOMA-BINDING PROTEIN RBQ-3 SAFAPDFKELDENVEYEERESEFDIED 27 T 0.014 DUF2457 unppercent F Eukaryota T 7w6j 3 C F RBBP5_HUMAN RBBP-5,RETINOBLASTOMA-BINDING PROTEIN RBQ-3 SAFAPDFKELDENVEYEERESEFDIED 27 T 0.014 DUF2457 unppercent F Eukaryota T 7w6l 3 D,F D,F RBBP5_HUMAN RBBP-5,RETINOBLASTOMA-BINDING PROTEIN RBQ-3 SAFAPDFKELDENVEYEERESEFDIED 27 T 0.014 DUF2457 unppercent F Eukaryota T 7w75 2 C,D,E,F C,D,E,F BRE1_KLULC RING-TYPE E3 UBIQUITIN TRANSFERASE BRE1 MNDHFVKRPKLELSDPSEPLTQKDVIAFQKEALFRCLNKWRVKANQLVEENEVLAAGLSKTTESVSGCCSSIVVLARSVVEDCSDEQDKRFLQQLINTEDEHTLTQIISNNSARICELILKTSGSNISDNIGRLQELESLTLTLQKLLKSSENKLKKATEYYENIIAQYDRQDSESVSRVFNTADDDSNVKKEKQSSTGASSVNDE 206 T 7.8E-05 HCR unphh F Eukaryota T 7w76 2 C,D,E,F C,D,E,F BRE1_KLULC RING-TYPE E3 UBIQUITIN TRANSFERASE BRE1 MNDHFVKRPKLELSDPSEPLTQKDVIAFQKEALFRCLNKWRVKANQLVEENEVLAAGLSKTTESVSGCCSSIVVLARSVVEDCSDEQDKRFLQQLINTEDEHTLTQIISNNSARICELILKTSGSNISDNIGRLQELESLTLTLQKLLKSSENKLKKATEYYENIIAQYDRQDSESVSRVFNTADDDSNVKKEKQSSTGASSVNDE 206 T 7.8E-05 HCR unphh F Eukaryota T 7w7g 4 D D NALF1_MOUSE Transmembrane protein FAM155A MTRGAWMCRQYDDGLKIWLAAPRENEKPFIDSERAQKWRLSLASLLFFTVLLSDHLWFCAEAKLTRTRDKEHHQQQQQQQQQQQQQQQQQQQQQQRQQQRQRQQQRQRQQEPSWPALLASMGESSPAAQAHRLLSASSSPTLPPSPGGGGGSKGNRGKNNRSRALFLGNSAKPVWRLETCYPQGASSGQCFTVESADAVCARNWSRGAAAGEEQSSRGSRPTPLWNLSDFYLSFCNSYTLWELFSGLSSPSTLNCSLDVVLTEGGEMTTCRQCIEAYQDYDHHAQEKYEEFESVLHKYLQSDEYSVKSCPEDCKIVYKAWLCSQYFEVTQFNCRKTIPCKQYCLEVQTRCPFILPDNDEVIYGGLSSFICTGLYETFLTNDEPECCDIRSEEQTAPRPKGTVDRRDSCPRTSLTVSSATRLCPGRLKLCVLVLILLHTVLTASAAQNSTGLGLGGLPTLEDNSTRED 467 F F Eukaryota T 7w8j 2 B,D,F,H B,D,F,H I6NWZ0_9RHOB N,N-dimethylformamidase small subunit MTEASESCVRDPSNYRDRSADWYAFYDERRRKEIIDIIDEHPEIVEEHAANPFGYRKHPSPYLQRVHNYFRMQPTFGRYYIYSEREWDAYRIATIREFGELPELGDERFKTEEEAMHAVFLRRIEDVRAELA 132 T 0.18 UPF0158 pdb F Bacteria T 7w8k 1 A A drp1 CPPCHGRPTCDSFTNCWELLTCPPC 25 T 0.23 CCAP pdbhh F T 7w8o 1 A A drp2-a GCPPCESCHSGESTFWCYWEALCPPC 26 T 0.32 FlpD pdbhh F T 7w8r 1 A A drp2-b GCPPCESCHSGESTFWCYWEALCPPC 26 T 0.32 FlpD pdbhh F T 7w8t 1 A A DRP3 GCPPCASGCSPETGEFCWREDDCPPC 26 T 1.3 DNA_ligase_ZBD pdbhh F T 7w8z 1 A A drp4 SCPPCHGRPTCTKPGDNATPEKLAKYQACWELLTCPPC 38 T 0.14 Hormone_3 pdb F T 7w96 1 A A drp6 SCPPCMEVSSCDEETGECEIGSRCPPC 27 T 0.083 zinc_ribbon_4 pdbhh F T 7wa4 1 A A GIGAN_ARATH Protein GIGANTEA GSMASSSSSERWIDGLQFSSLLWPPPRDPQQHKDQVVAYVEYFGQFTSEQFPDDIAELVRHQYPSTEKRLLDDVLAMFVLHHPEHGHAVILPIISCLIDGSLVYSKEAHPFASFISLVCPSSENDYSEQWALACGEILRILTHYNRPIYKTEQQNGDTERNCLSKATTSGSPTSEPKAGSPTQHERKPLRPLSPWISDILLAAPLGIRSDYFRWCSGVMGKYAAGELKPPTIASRGSGKHPQLMPSTPRWAVANGAGVILSVCDDEVARYETATLTAVAVPALLLPPPTTSLDEHLVAGLPALEPYARLFHRYYAIATPSATQRLLLGLLEAPPSWAPDALDAAVQLVELLRAAEDYASGVRLPRNWMHLHFLRAIGIAMSMRAGVAADAAAALLFRILSQPALLFPPLSQVEGVEIQHAPIGGYSSNYRKQIEVPAAEATIEATAQGIASMLCAHGPEVEWRICTIWEAAYGLIPLNSSAVDLPEIIVATPLQPPILSWNLYIPLLKVLEYLPRGSPSEACLMKIFVATVETILSRTFPPESSRELTRKARSSFTTRSATKNLAMSELRAMVHALFLESCAGVELASRLLFVVLTVCVSHEAQSSGSKRPRSEYASTTENIEANQPVSNNQTANRKSRNVKGQGPVAAFDSYVLAAVCALACEVQLYPMISGGGNFSNSAVAGTITKPVKINGSSKEYGAGIDSAISHTRRILAILEALFSLKPSSVGTPWSYSSSEIVAAAMVAAHISELFRRSKALTHALSGLMRCKWDKEIHKRASSLYNLIDVHSKVVASIVDKAEPLEAYLKNTPVQKD 815 T 16 AvrL567-A pdbhh F Eukaryota T 7wae 2 E,F,G,H J,K,L,M 4x(beta-Asp-Arg) DDDD 4 T 160 LicD pdbhh F F 7waf 2 E,F,G,H I,J,K,L 4x(beta-Asp-Arg) DDDD 4 T 160 LicD pdbhh F F 7wap 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N B,G,A,D,E,F,C,H,I,J,K,L,M,N D7DTD6_METV3 Mevo lectin MAQNDNYIYSTEVGGVGGTPFTFMQESGTITSIKFNWSDQYKLLHHIEVKFINNANIYATGDPKGNHEVILEIDDDETIIGSVIGYKKGNDGRCTGVKLTTSKGKSIMAGYFEESLITTYTGKLAGIKGGAGSAIDRLGLIFLKK 145 T 0.00015 Jacalin unppssm F Archaea T 7wbb 2 F H substrate XXXXXXXXXXXXXXXXXXXXXXX 23 F F F 7wbg 3 C,F C,F ARG-ARG-ARG-GLU-GLN-THR-ASP-TYR RRREQTDY 8 T 15 HEPN_SAV_6107 pdbhh F T 7wbk 1 A,B A,B Q5ZZD0_LEGPH Lpg0081 MKAIPPKIWFETQLKGSGLDKKFQIDELIETQSSVRVFANKKYLPDTETINEALTKVTAVNVSGDKSGYFQNGLPFPNEAGYFEKIPVGHPELLSPIERLTGSKKIVSSHSLVTASGGYPLTNPLLPYRKPIRVSIFSLAGPSFENNYLHYRLFLLDSVQKIIDSPLFSHLHDGLPIQFDEAKKELGEYDTNKLMARIRLGFPYLARFSSGGFYPSFSKSNAIIFLSEAYFRYQLEDVSLLLASVNQTGKETGKAALLKATAVGMGFFAKIDCGYDIQHIIFPYYLRAYKKLLSEHKFPWIAKIEFPIFNEIQQEQFDSIFEDYDGPTKVYRSTRDVLEFREEEIEKYLPAAINPSDAFALTGNEWGYGSVESMIGNNSSIRFDQVHHMNPLILDPSHHVEAQINKDHGVELTVN 415 T 0.0071 DUF4804 unppssm F Bacteria T 7wbm 1 A,B A,B Q5ZZD0_LEGPH Lpg0081 MKAIPPKIWFETQLKGSGLDKKFQIDELIETQSSVRVFANKKYLPDTETINEALTKVTAVNVSGDKSGYFQNGLPFPNEAGYFEKIPVGHPELLSPIERLTGSKKIVSSHSLVTASGGYPLTNPLLPYRKPIRVSIFSLAGPSFENNYLHYRLFLLDSVQKIIDSPLFSHLHDGLPIQFDEAKKELGEYDTNKLMARIRLGFPYLARFSSGGFYPSFSKSNAIIFLSEAYFRYQLEDVSLLLASVNQTGKETGKAALLKATAVGMGFFAKIDCGYDIQHIIFPYYLRAYKKLLSEHKFPWIAKIEFPIFNEIQQEQFDSIFEDYDGPTKVYRSTRDVLEFREEEIEKYLPAAINPSDAFALTGNEWGYGSVESMIGNNSSIRFDQVHHMNPLILDPSHHVEAQINKDHGVELTVN 415 T 0.0071 DUF4804 unppssm F Bacteria T 7wcy 3 C,F C,F Q8MWF9_CRYPV SER-VAL-PHE-ALA-ILE-PHE-ALA-ALA-LEU SVFAIFAAL 9 T 3.2 DUF2547 pdbhh F Eukaryota T 7we3 1 A A DRP8II GCPPCPREHELVAVPCEGLNNCWFVEACPPC 31 T 0.94 Late_protein_L1 pdbhh F T 7we6 4 Q,R U,V A0A8G3G219_PSEAI AcrIF24 MNAIHIGPFSITPAARGLHYGGLPHHQWTLYYGPREMAIKTLPDSYTSSEVRDEFSDIIAEFVIDARHRYAPDVLELVNSDGDAVLARVAVSRLPEALSGCIPDDRFPYWLLTASRPRLGLPVTLNEYTALAVELSAPPLAWITGLLPGEVLTHDAEEWRPPTSWELRHVVGEGSFTGVSGAAAAALLGMSATNFRKYTAGDSAANRQKISFAAWHYLLDRLGVKRAS 228 T 0.00054 DUF4447 pdbhh F Bacteria T 7weg 2 C,D C,D FCSD2_MOUSE FCHSD2 GPGSTEKMEDVEITLV 16 T 6.9 ChW pdbhh F Eukaryota T 7wei 1 A A drp8I GCPPCPREHELVAVPCEGLNNCWFVEACPPC 31 T 0.94 Late_protein_L1 pdbhh F T 7wek 2 C,D C,D CAMP3_MOUSE MARSHALIN,PROTEIN NEZHA NSEVKMTSFAERKKQLVKAEAESGLGSPTS 30 T 16 CEP44 pdbhh F Eukaryota T 7wex 1 A A Q82A10_STRAW Cytochrome P450 hydroxylase MGSSHHHHHHSSGLVPRGSHMDPAEGLLADPYAVYDRLRDTAPVHRIAGTDGKPAWLVTRYDDVREGLANPLLSLDKKHALPGNYRGLALPPALDANLLNMDAPDHTRIRRLVGRAFTLRRVEQLREPVRETAHRLLDALGTHGSTDLIASYAAPLPITVICDLLGVPDEHRRDFRAWTDPLVTPDPARPDVARESVVSLLGFFTGLLADKRKNPADDLLSDLIAVQEEGDRLTEDELMSLAFLILFAGYENTVHLIGNAVLALLRHPEQLAALREDPARLPDAVGEFARYEGPALLAIRRFPVRDVTIGGVTVPAGETVLLSLSAANRDPSRFPDPDRLDLGRDAAGHLALGHGVHYCLGAPLARLETEVALAALLERFPDLALAETEPRRRPSLRARGLLALPVTY 408 T 2.1E-33 p450 pdbpercent F Bacteria T 7wff 9 I b PNSB2_ARATH PROTEIN PNSB2,NAD(P)H DEHYDROGENASE SUBUNIT 45,NDH-DEPENDENT CYCLIC ELECTRON FLOW 2 MASLISFSLLPKPKAVRSSISAPQTQTINTEKLEDKFGRKGIKFSESNNIPMVELKVRNGSSLKLSLSDAHVLSYKPKVYWKDEGFEEVLYTVDGDESRGGVGVVIVNGEEPKGGSSVISGCDWSVKDTDSDAIDALQIELSCTAGVLDITYIVSLYPVSMATALVVKNNGRKPVTLKPGIMSYLRFKKRSGAGIQGLKGCSYCPNPPLSSPFELLSPSEAMKAESSGWFGSEEGEKPGIWAVEDSVITLLEKKMSRIYGAPPAERLKAVYNTPPSKFETIDQGRGLFFRMIRIGFEEMYVGSPGSMWDKYGKQHYFVCTGPTSMLVPVDVASGETWRGAMVIEHDNL 348 T 0.33 Aldose_epim pdbpssm F Eukaryota T 7wff 11 K d B3H6Z4_ARATH NDH dependent flow 6 MAEAFTSFTFTNLHIPSSYNHSPKQNSGPNHGYWLSNVNEKRERNLMRGSLCVRKALPHDLPLMAVMVQQIEGMRDIITEKHVWHLSDKAIKNVYMFYIMFTCWGCLYFGSAKDPFYDSEEYRGDGGDGTGYWVYETVCISPFLILLGKKEKNLEMHTNYN 161 T 7.1 NAD_kinase pdbhh F Eukaryota T 7wff 12 L e PNSB5_ARATH PROTEIN PNSB5,NAD(P)H DEHYDROGENASE 18 MATVTILSPKSIPKVTDSKFGARVSDQIVNVVKCGKSGRRLKLAKLVSAAGLSQIEPDINEDPIGQFETNSIEMEDFKYGYYDGAHTYYEGEVQKGTFWGAIADDIAAVDQTNGFQGLISCMFLPAIALGMYFDAPGEYLFIGAALFTVVFCIIEMDKPDQPHNFEPQIYKLERGARDKLINDYNTMSIWDFNDKYGDVWDFTIEKDDIATR 212 T 0.028 DUF3098 pdbpssm F Eukaryota T 7wfg 9 I T NdhT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 122 F F F 7wg5 27 OA b PNSB2_ARATH PROTEIN PNSB2,NAD(P)H DEHYDROGENASE SUBUNIT 45,NDH-DEPENDENT CYCLIC ELECTRON FLOW 2 MASLISFSLLPKPKAVRSSISAPQTQTINTEKLEDKFGRKGIKFSESNNIPMVELKVRNGSSLKLSLSDAHVLSYKPKVYWKDEGFEEVLYTVDGDESRGGVGVVIVNGEEPKGGSSVISGCDWSVKDTDSDAIDALQIELSCTAGVLDITYIVSLYPVSMATALVVKNNGRKPVTLKPGIMSYLRFKKRSGAGIQGLKGCSYCPNPPLSSPFELLSPSEAMKAESSGWFGSEEGEKPGIWAVEDSVITLLEKKMSRIYGAPPAERLKAVYNTPPSKFETIDQGRGLFFRMIRIGFEEMYVGSPGSMWDKYGKQHYFVCTGPTSMLVPVDVASGETWRGAMVIEHDNL 348 T 0.33 Aldose_epim pdbpssm F Eukaryota T 7wg5 29 QA d B3H6Z4_ARATH NDH dependent flow 6 MAEAFTSFTFTNLHIPSSYNHSPKQNSGPNHGYWLSNVNEKRERNLMRGSLCVRKALPHDLPLMAVMVQQIEGMRDIITEKHVWHLSDKAIKNVYMFYIMFTCWGCLYFGSAKDPFYDSEEYRGDGGDGTGYWVYETVCISPFLILLGKKEKNLEMHTNYN 161 T 7.1 NAD_kinase pdbhh F Eukaryota T 7wg5 30 RA e PNSB5_ARATH PROTEIN PNSB5,NAD(P)H DEHYDROGENASE 18 MATVTILSPKSIPKVTDSKFGARVSDQIVNVVKCGKSGRRLKLAKLVSAAGLSQIEPDINEDPIGQFETNSIEMEDFKYGYYDGAHTYYEGEVQKGTFWGAIADDIAAVDQTNGFQGLISCMFLPAIALGMYFDAPGEYLFIGAALFTVVFCIIEMDKPDQPHNFEPQIYKLERGARDKLINDYNTMSIWDFNDKYGDVWDFTIEKDDIATR 212 T 0.028 DUF3098 pdbpssm F Eukaryota T 7wg5 44 FB T NdhT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 122 F F F 7whp 1 A,B A,B Q914N6_CPVBM Structural protein VP3 MWHYTSINNDTRVALDPKPNQIRTITKPNTVPQLGTDYLYTFNSQRRSHTLRLLGPFQYFNFSETDRGHPLFRLPLKYPSKAIPADELIDNLHSWMRSVHLLHVRSEDNTLRYNWMLGVYARSTNYTTPVGQLVVNAPAILNYSNPQDAFNSVFVALGIDYIDIPITNSNIFDDSSTPYNVRIWHAPTMTEVNHILALMRKSTLVSTHSSWHWNVLHTFHYRSESDMIDHFAAKILEDWRQKEKLDKGALVEADRVIQRLIPLSSSTYVQRLAAIGALYPNEFTENVLDLSRLSTALLQLSDTYYQHANDQLRRLYRRMYNDSRTLYMTQRHQELLLAQITADPNILLYPYTYIFTTIPTSMNYISNTGQGRIKHSLTVTGATEHDTVADIVLGQTGEDVITISMVEPMSIAVEDMYGYVLDTPTRDIWPADEQIEQKGDAVALYDTKTSRALGMFNNTVRIDDLLSPLLSLVYRTYIKGDTMTMTQGSLDHLTLCAAVDSDITFVGNRMIAPLPEGYIPKPMHRNNSTMKMLSLYVALKKLENFATNSYLMAPDTSIILLGAEREPAVNILRRFNRNVSNVRIIGMGDRAVEPNIRVRVPFPIDKNISADFIICDINSYEDQSFESMFSETISVVTTCASAATRALVKINHPSEYMINSVIERLSQLGGVFYHTALLKTASQNPYSYETYIYITPIAAAVRFPFYSNSAMINRYMTAVADDEMPIIPSIHTVIKGHSNTYSPGLFCGCVDVQSAPLALSQLKSYCSEATTWRVDSDDNLVNIIARIDPARIALEFRTRSNTSAYHEYQRYVPNGLGFKVRKTREFRYMHREVTFIHKLMMYALIREQISLTENMTQVVSIGGRNLADISVVPLNMKYVVIDPATRIETLTQEKKNIEVQSRPFQFDAANMDLENNSIYLFIAVIMNEPNGAATPARMQMDKIRNVATAMLTRTNCVAYISFYEAGIITRLDQSTAHKTIRVEEGRLKVANYVPVDTLVEADVTLMLRDIGITHEIIRPSTPELIDACSNYGIRLGSTGGAVLDVFNHYSPVIKLVRS 1058 T 0.52 DUF5705 pdbpssm T Viruses T 7whu 2 E,F,G,H E,F,G,H Ecotin Peptide VSSPVSTM 8 T 8.2 DUF6342 pdbhh F F 7wj2 3 C C GAG_HV1H2 8-mer peptide LYNTVATL 8 T 0.15 Gag_p17 pdbhh T Viruses T 7wj3 3 C C 4-mer lipopeptide GAAL 4 T 210 Spore_GerAC pdbhh F F 7wjt 1 A,B,C,D A,B,C,D TM266_MOUSE Isoform 2 of Transmembrane protein 266 GGSVKLEMEMVTQQYEKAKAIQDEQLERLTQICQEQGFEIRQLRAHLAQQDLDLAAEREAALQA 64 T 0.00011 VGPC1_C unphh F Eukaryota T 7wkd 6 F C TRH QHX 3 T 64 PEPcase_2 pdbhh F F 7wko 1 A A M1R2X3_9CAUD Csy1 MGSSHHHHHHSSGRENLYFQGMIKEMIEDFISKGGLIFTHSGRYTNTNNSCFIFNKNDIGVDTKVDMYTPKSAGIKNEEGENLWQVLNKANMFYRIYSGELGEELQYLLKSCCTAKEDVTTLPQIYFKNGEGYDILVPIGNAHNLISGTEYLWEHKYYNTFTQKLGGSNPQNCTHACNKMRGGFKQFNCTPPQVEDNYNA 200 T 0.0033 Cas_Csy1 unppercent T Viruses T 7wko 2 B B M1QWL5_9CAUD Csy2 MGSSHHHHHHSSGRENLYFQGMRKFIIVKNVKVDGINAKSSDITVGMPPATTFCGLGETMSIKTGIVVKAVSYGSVKFEVRGSRFNTSVTKFAWQDRGNGGKANNNSPIQPKPLADGVFTLCFEVEWEDCAEVLVDKVTNFINTARIAGGTIASFNKPFVKVAKDAEELASVKNAMMPCYVVVDCGVEVNIFEDAVNRKLQPMVNGYKKLEKIVDNKHMRDKFTPAYLATPTYTMIGYKMVSNVDNFDQALWQYGENTKVKTIGGIYND 269 T 6.1999999999999986E-24 Cas_Csy2 unppssm T Viruses T 7wku 2 G,H,I,J,K,L H,I,J,K,L,M N-[(5-METHYLISOXAZOL-3-YL)CARBONYL]ALANYL-L-VALYL-N~1~-((1R,2Z)-4-(BENZYLOXY)-4-OXO-1-{[(3R)-2-OXOPYRROLIDIN-3-YL]METHYL}BUT-2-ENYL)-L-LEUCINAMIDE XAVLXX 6 T 1700 zf-H2C2_5 pdbhh F F 7wlp 2 B,C,D B,D,C BKRF4_EBVG Tegument protein BKRF4 MRRLLSDEEEETSQSSSYTLGSQASQSIQEEDVSDTDESDYSDEDEEIDLEEEYPSDEDPSEGSDSDPSWHPSDSDESDYSESDEDEA 88 T 0.033 Nop14 unppercent T Viruses T 7wmc 2 B,C,E C,E,D Peptide1 GFXRGXWPCG 10 T 0.22 DUF1677 pdbhh F T 7wmp 1 A,B,C,D,E,F,G,H,I,J,K,L a,b,c,d,e,f,g,h,i,j,k,l PORTL_BPKHP HEAD-TO-TAIL CONNECTOR GP8,PUTATIVE PORTAL PROTEIN ORF17 MDFTTLQNDFTNDYQKALIANNEFLEAKKYYNGNQLPQDVLNIILERGQTPIIENMFKVIVNKILGYKIESISEIRLSPKQEEDRALSDLLNSLLQVFIQQENYDKSMIERDKNLLIGGLGVIQLWVSQDKDKNVEIEIKAIKPESFVIDYFSTDKNALDARRFHKMLEVSEQEALLLFGDSVIVNYSNVNHERIASVIESWYKEYNEETQSYEWNRYLWNRNTGIYKSEKKPFKNGACPFIVSKLYTDELNNYYGLFRDIKPMQDFINYAENRMGNMMGSFKAMFEEDAVVDVAEFVETMSLDNAIAKVRPNALKDHKIQFMNNQADLSALSQKAEQKRQLLRLLAGLNDESLGMAVNRQSGVAIAQRKESGLMGLQTFLKATDDMDRLIFRLAVSFICEYFTKEQVFKIVDKKLGDRYFKINSNDDNKIRPLKFDLILKSQLKTESRDEKWYNWNELLKILAPIRPDLVPSLVPLMLNDMDSPITNDVLEAIQNANALQQQNAEANAPYNQQIQALQIQKLQAEIMELQAKAHKYAEQGALSQTTNESEKINQAVAITEMQQQNANNANNEESNNKPKKKLKTSDKTTWRKYPSAQNLDY 602 T 0.056 LUC7 pdb T Viruses T 7wmp 2 M,N,O,P,Q,R,S,T,U,V,W,X m,n,o,p,q,r,s,t,u,v,w,x I7HHN3_BPKHP Adaptor protein gp12 MIEVSEVIAKVRERLNDNEVGNYEILDSVLVENINQALLKICLEFRLKKAITRSLITEEERFLTLNNLLGIESVKLDKKEIESRNTIEKDTGELELLILSDRISVTPFKIGELEVVYYTYEEIRNILETIKLPKICLDVLVYSVLCNLLEIPNNETNFSVLANYKQLLKLAKDNLTNYLSLMYSKNIHFSKVVRV 195 T 0.0012 GST_C_6 pdbpssm T Viruses T 7wmp 3 AA,BA,CA,DA,EA,FA,GA,HA,IA,JA,Y,Z C,D,E,F,G,H,I,J,K,L,A,B I7HFX1_BPKHP Nozzle protein gp25 MDTTRFIRNFILFKDALQKQNFNNKDLNTTSMQAALQSEQLALSEESQYLQSEQVRAKMQIDFLGMQANLQNAKAETLNKLIQCQAMLKSLKDNAMINRANALVSLLQVQANAANGITTSNFEAAFKIIAQIGSEYNQITLNNGNVSVQEKEQTNELKTILNSLSKELEKLNQQSEVNSIQIFSDKLEVLKDAPARLWGFSTLSNAKEGFYNEANEQIASGSVCLFRSDKVRKHTITFKAINTKTSLSKNITISVIANKLKERMS 265 T 0.019 PKD_3 pdbhh T Viruses T 7wn0 1 A A Q8IDM6_PLAF7 Equilibrative nucleoside/nucleobase transporter MSTGKESSKAYADIESRGDYKDDGKKGSTLSSKQHFMLSLTFILIGLSSLNVWNTALGLNINFKYNTFQITGLVCSSIVALFVEIPKIMLPFLLGGLSILCAGFQISHSFFTDTQFDTYCLVAFIVIGVVAGLAQTIAFNIGSTMEDNMGGYMSAGIGISGVFIFVINLLLDQFVSPEKHYGVNKAKLLALYIICELCLILAIVFCVCNLDLTNKNNKKDEENKENNATLSYMELFKDSYKAILTMFLVNWLTLQLFPGVGHKKWQESHNISDYNVTIIVGMFQVFDFLSRYPPNLTHIKIFKNFTFSLNKLLVANSLRLLFIPWFILNACVDHPFFKNIVQQCVCMAMLAFTNGWFNTVPFLVFVKELKKAKKKKEIEIISTFLVIAMFVGLFCGIWTTYIYNLFNIVLPKPDLPPIDVTQ 422 T 1.5000000000000002E-22 Nucleoside_tran unppercent F Eukaryota T 7wn1 1 A A Q8IDM6_PLAF7 NUCLEOSIDE TRANSPORTER 1 MSTGKESSKAYADIESRGDYKDDGKKGSTLSSKQHFMLSLTFILIGLSSLNVWNTALGLNINFKYNTFQITGLVCSSIVALFVEIPKIMLPFLLGGLSILCAGFQISHSFFTDTQFDTYCLVAFIVIGVVAGLAQTIAFNIGSTMEDNMGGYMSAGIGISGVFIFVINLLLDQFVSPEKHYGVNKAKLLALYIICELCLILAIVFCVCNLDLTNKNNKKDEENKENNATLSYMELFKDSYKAILTMFLVNWLTLQLFPGVGHKKWQESHNISDYNVTIIVGMFQVFDFLSRYPPNLTHIKIFKNFTFSLNKLLVANSLRLLFIPWFILNACVDHPFFKNIVQQCVCMAMLAFTNGWFNTVPFLVFVKELKKAKKKKEIEIISTFLVIAMFVGLFCGIWTTYIYNLFNIVLPKPDLPPIDVTQ 422 T 1.5000000000000002E-22 Nucleoside_tran unppercent F Eukaryota T 7wqa 2 B B Z-VAD(OMe)-FMK XVAXX 5 T 1100 RE_HindIII pdbhh F F 7wrk 1 A A Q5SH57_THET8 hypothetical protein TTHA1873 MGNYLEDCATVDVQARPTAYALAISSLGEFNSLTGGTSTDPVAEGNDYYYRFEIRAWEGSSGPQTNVTLNVTRTLGNSTFAGSGTKGVDFEVELDPDGPFGPASYAPVLSADVQVLAWGPTGVQLRYLPSLAPGATLRFSLRANAVNGTNTTVQADATSTEAPGPYTVFETTTIIP 176 T 0.021 CRISPR_assoc pdb F Bacteria T 7wrw 1 A,B,C,D,E,F B,C,F,D,E,A Q9RW32_DEIRA HerA MTGNDVQGAEKADAIGMVLGTEDVTPTVFWFAVSHGASVGLDDLVVVETRKPDGTPVRFYGLVDNVRKRHEGVTFESDVEDVVAGLLPASVSYAARVLVTRVDPENFIPPQPGDHVRHAAGRELAMALSADKMEEAAFPGGLLADGQPLPLNFRFINGESGGHINISGISGVATKTSYALFLLHSIFRSGVMDRTAQGSGGRQSGTAGGRALIFNVKGEDLLFLDKPNARMVEKEDKVVRAKGLSADRYALLGLPAEPFRDVQLLAPPRAGAAGTAIVPQTDQRSEGVTPFVFTIREFCARRMLPYVFSDASASLNLGFVIGNIEEKLFRLAAAQTGKGTGLIVHDWQFEDSETPPENLDFSELGGVNLQTFEQLISYLEYKLLEEREGEGDPKWVLKQSPGTLRAFTRRLRGVQKYLSPLIRGDLTPEQAEGYRPDPLRRGIQLTVVDIHALSAHAQMFVVGVLLREVFEYKERVGRQDTVFVVLDELNKYAPREGDSPIKDVLLDIAERGRSLGIILIGAQQTASEVERRIVSNAAIRVVGRLDLAEAERPEYRFLPQSFRGRAGILQPGTMLVSQPDVPNPVLVNYPFPAWATRRDEVDDLGGKAAAEVGAGLLR 618 F F Bacteria T 7wrx 1 A,B,C,D,E,F,G,H,I,J,K,L H,A,B,C,D,E,F,G,I,J,K,L Q9RW32_DEIRA HerA MTGNDVQGAEKADAIGMVLGTEDVTPTVFWFAVSHGASVGLDDLVVVETRKPDGTPVRFYGLVDNVRKRHEGVTFESDVEDVVAGLLPASVSYAARVLVTRVDPENFIPPQPGDHVRHAAGRELAMALSADKMEEAAFPGGLLADGQPLPLNFRFINGESGGHINISGISGVATKTSYALFLLHSIFRSGVMDRTAQGSGGRQSGTAGGRALIFNVKGEDLLFLDKPNARMVEKEDKVVRAKGLSADRYALLGLPAEPFRDVQLLAPPRAGAAGTAIVPQTDQRSEGVTPFVFTIREFCARRMLPYVFSDASASLNLGFVIGNIEEKLFRLAAAQTGKGTGLIVHDWQFEDSETPPENLDFSELGGVNLQTFEQLISYLEYKLLEEREGEGDPKWVLKQSPGTLRAFTRRLRGVQKYLSPLIRGDLTPEQAEGYRPDPLRRGIQLTVVDIHALSAHAQMFVVGVLLREVFEYKERVGRQDTVFVVLDELNKYAPREGDSPIKDVLLDIAERGRSLGIILIGAQQTASEVERRIVSNAAIRVVGRLDLAEAERPEYRFLPQSFRGRAGILQPGTMLVSQPDVPNPVLVNYPFPAWATRRDEVDDLGGKAAAEVGAGLLR 618 F F Bacteria T 7wt3 3 C,F C,F 4-mer lipopeptide XGANF 5 T 140 Pentapeptide pdbhh F F 7wt5 3 C,F C,F RDRP_I97A1 8-mer model peptide RAGFVANF 8 T 6.1 Thiol_cytolys_C pdbhh T Viruses T 7wu2 1 A A GNAS2_HUMAN Guanine nucleotide-binding protein G(s) subunit alpha isoforms short MGHHHHHHENLYFQGIEKQLQKDKQVYRATHRLLLLGADNSGKSTIVKQMRILHGGSGGSGGTSGIFETKFQVDKVNFHMFDVGGQRDERRKWIQCFNDVTAIIFVVDSSDYNRLQEALNDFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKIEDYFPEFARYTTPEDATPEPGEDPRVTRAKYFIRDEFLRISTASGDGRHYCYPHFTCAVDTENARRIFNDCRDIIQRMHLRQYELL 243 T 4.6E-09 G-alpha pdb F Eukaryota T 7wu3 1 A A GNAS2_HUMAN Guanine nucleotide-binding protein G(s) subunit alpha isoforms short MGHHHHHHENLYFQGIEKQLQKDKQVYRATHRLLLLGADNSGKSTIVKQMRILHGGSGGSGGTSGIFETKFQVDKVNFHMFDVGGQRDERRKWIQCFNDVTAIIFVVDSSDYNRLQEALNDFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKIEDYFPEFARYTTPEDATPEPGEDPRVTRAKYFIRDEFLRISTASGDGRHYCYPHFTCAVDTENARRIFNDCRDIIQRMHLRQYELL 243 T 4.6E-09 G-alpha pdb F Eukaryota T 7wug 1 A 5 VID28_YEAST GLUCOSE-INDUCED DEGRADATION PROTEIN 5 MTVAYSLENLKKISNSLVGDQLAKVDYFLAPKCQIFQCLLSIEQSDGVELKNAKLDLLYTLLHLEPQQRDIVGTYYFDIVSAIYKSMSLASSFTKNNSSTNYKYIKLLNLCAGVYPNCGFPDLQYLQNGFIQLVNHKFLRSKCKIDEVVTIIELLKLFLLVDEKNCSDFNKSKFMEEEREVTETSHYQDFKMAESLEHIIVKISSKYLDQISLKYIVRLKVSRPASPSSVKNDPFDNKGVDCTRAIPKKINISNMYDSSLLSLALLLYLRYHYMIPGDRKLRNDATFKMFVLGLLKSNDVNIRCVALKFLLQPYFTEDKKWEDTRTLEKILPYLVKSFNYDPLPWWFDPFDMLDSLIVLYNEITPMNNPVLTTLAHTNVIFCILSRFAQCLSLPQHNEATLKTTTKFIKICASFAASDEKYRLLLLNDTLLLNHLEYGLESHITLIQDFISLKDEIKETTTESHSMCLPPIYDHDFVAAWLLLLKSFSRSVSALRTTLKRNKIAQLLLQILSKTYTLTKECYFAGQDFMKPEIMIMGITLGSICNFVVEFSNLQSFMLRNGIIDIIEKMLTDPLFNSKKAWDDNEDERRIALQGIPVHEVKANSLWVLRHLMYNCQNEEKFQLLAKIPMNLILDFINDPCWAVQAQCFQLLRNLTCNSRKIVNILLEKFKDVEYKIDPQTGNKISIGSTYLFEFLAKKMRLLNPLDTQQKKAMEGILYIIVNLAAVNENKKQLVIEQDEILNIMSEILVETTTDSSSYGNDSNLKLACLWVLNNLLWNSSVSHYTQYAIENGLEPGHSPSDSENPQSTVTIGYNESVAGGYSRGKYYDEPDGDDSSSNANDDEDDDNDEGDDEGDEFVRTPAAKGSTSNVQVTRATVERCRKLVEVGLYDLVRKNITDESLSVREKARTLLYHMDLLLKVK 921 T 0.0017 HEAT_2 unppercent F Eukaryota T 7wug 5 E Y YD176_YEAST UNCHARACTERIZED PROTEIN YDL176W,Y55_G0042020.MRNA.1.CDS.1 MATGRIQFAVSTPCNTKGKPSGYRLFEFKNDRLALVPSERGCTKVDVNANIQAFCYLRPNGRDTSISPDATHILDSCDYMVLAKSNGFIEIISNYQYKIKNGLRLAPSYILRCTPEDFESNFFSDYMIAGLEYSQGLLYCCMCSGRIYVFVMNLPTDYIQYKNMYNPMFPDCFFKVHHDNNTTHSSEEEKLFEGSTRYTGRSCSKHICYFLLPIEPSHLRSSPVVSSFCNMYQGLPIYRPSMYLHIERGISTFHINPLDRFCFMTVSPRSPLFIRKIILPLTYVTFLSTFISLKNSIQGDTCGEILSWDNVAQQNGFGSLFSWISNKFTFDTDIINSTIWDDIVKYSGTGMLDSGIVWKQRQGHAKDDIYELFHTQDMLGSSRRNSSFSTASSEPRPLSRRRRESFQALTRDAFRERMDVPCSTKWELDSFIRGLRRNTFMVDFEIVEKISHRNGNDGVNEDDNTTDESDETMTSFLTDNYKKMDIVCIDHFVTLSAFRPRYYDEPIIKIDSLSNKNGSENGTNEEEWAESQMKVDGQVIDDETAQFKQALGNLCSFKKLFMLDDSLCFILDTHGVLLINRFEIKNTKNLLRNSKDTIRIIPHDFGLINDTIVIINDIDVGTDNVCALTFHLVVTSMAGEITVLKGEFFKNCRLGRIKLCDSLKLNRKDRFVDKLALIDYDGLNAQKRRLDYDEKDLYTFIVKKVKRD 708 T 14 BBS1 pdbhh F Eukaryota T 7wul 1 A X 3-mer peptide RDG 3 T 170 GRDA pdbhh F F 7wum 1 A X 3-mer peptide RDG 3 T 170 GRDA pdbhh F F 7wun 1 A X 3-mer peptide RSG 3 T 140 TbpB_A pdbhh F F 7wuw 1 A,B A,B B4XYC0_STREG AZI28 MASWSHPQFEKGGTHVAETSAPTRSEPDTRVLTLPGTASAPEFRLIDIDGLLNNRATTDVRDLGSGRLNAWGNSFPAAELPAPGSLITVAGIPFTWANAHARGDNIRCEGQVVDIPPGQYDWIYLLAASERRSEDTIWAHYDDGHADPLRVGISDFLDGTPAFGELSAFRTSRMHYPHHVQEGLPTTMWLTRVGMPRHGVARSLRLPRSVAMHVFALTLRTAAAVRLAEGATT 233 T 0.11 Rib unppercent F Bacteria T 7wux 1 A,B A,B B4XYC0_STREG AZI28 MASWSHPQFEKGGTHVAETSAPTRSEPDTRVLTLPGTASAPEFRLIDIDGLLNNRATTDVRDLGSGRLNAWGNSFPAAELPAPGSLITVAGIPFTWANAHARGDNIRCEGQVVDIPPGQYDWIYLLAASERRSEDTIWAHYDDGHADPLRVGISDFLDGTPAFGELSAFRTSRMHYPHHVQEGLPTTMWLTRVGMPRHGVARSLRLPRSVAMHVFALTLRTAAAVRLAEGATT 233 T 0.11 Rib unppercent F Bacteria T 7wvu 5 E L FME-LEU-PHE MLF 3 T 140 DUF3719 pdbhh F F 7wvv 1 A L FME-LEU-PHE-ILE-ILE MLFII 5 T 66 Herpes_U15 pdbhh F F 7wwn 1 A A Q5SH57_THET8 hypothetical protein TTHA1873 MGNYLEDCATVDVQARPTAYALAISSLGEFNSLTGGTSTDPVAEGNDYYYRFEIRAWEGSSGPQTNVTLNVTRTLGNSTFAGSGTKGVDFEVELDPDGPFGPASYAPVLSADVQVLAWGPTGVQLRYLPSLAPGATLRFSLRANAVNGTNTTVQADATSTEAPGPYTVFETTTIIP 176 T 0.021 CRISPR_assoc pdb F Bacteria T 7wwo 1 A,B A,B Q5SH57_THET8 hypothetical protein TTHA1873 MGNYLEDCATVDVQARPTAYALAISSLGEFNSLTGGTSTDPVAEGNDYYYRFEIRAWEGSSGPQTNVTLNVTRTLGNSTFAGSGTKGVDFEVELDPDGPFGPASYAPVLSADVQVLAWGPTGVQLRYLPSLAPGATLRFSLRANAVNGTNTTVQADATSTEAPGPYTVFETTTIIP 176 T 0.021 CRISPR_assoc pdb F Bacteria T 7wwq 2 B B UFD1_HUMAN UBIQUITIN FUSION DEGRADATION PROTEIN 1,UB FUSION PROTEIN 1 IPNYEFKLGKITFIRN 16 T 8.1 Ribonuc_2-5A pdbhh F Eukaryota T 7wwu 1 A A M1R2X3_9CAUD Csy1 MGSSHHHHHHSSGRENLYFQGMIKEMIEDFISKGGLIFTHSGRYTNTNNSCFIFNKNDIGVDTKVDMYTPKSAGIKNEEGENLWQVLNKANMFYRIYSGELGEELQYLLKSCCTAKEDVTTLPQIYFKNGEGYDILVPIGNAHNLISGTEYLWEHKYYNTFTQKLGGSNPQNCTHACNKMRGGFKQFNCTPPQVEDNYNA 200 T 0.0033 Cas_Csy1 unppercent T Viruses T 7wwu 2 B B M1QWL5_9CAUD Csy2 MGSSHHHHHHSSGRENLYFQGMRKFIIVKNVKVDGINAKSSDITVGMPPATTFCGLGETMSIKTGIVVKAVSYGSVKFEVRGSRFNTSVTKFAWQDRGNGGKANNNSPIQPKPLADGVFTLCFEVEWEDCAEVLVDKVTNFINTARIAGGTIASFNKPFVKVAKDAEELASVKNAMMPCYVVVDCGVEVNIFEDAVNRKLQPMVNGYKKLEKIVDNKHMRDKFTPAYLATPTYTMIGYKMVSNVDNFDQALWQYGENTKVKTIGGIYND 269 T 6.1999999999999986E-24 Cas_Csy2 unppssm T Viruses T 7wwv 1 A A M1R2X3_9CAUD Csy1 MGSSHHHHHHSSGRENLYFQGMIKEMIEDFISKGGLIFTHSGRYTNTNNSCFIFNKNDIGVDTKVDMYTPKSAGIKNEEGENLWQVLNKANMFYRIYSGELGEELQYLLKSCCTAKEDVTTLPQIYFKNGEGYDILVPIGNAHNLISGTEYLWEHKYYNTFTQKLGGSNPQNCTHACNKMRGGFKQFNCTPPQVEDNYNA 200 T 0.0033 Cas_Csy1 unppercent T Viruses T 7wwv 2 B B M1QWL5_9CAUD Csy2 MGSSHHHHHHSSGRENLYFQGMRKFIIVKNVKVDGINAKSSDITVGMPPATTFCGLGETMSIKTGIVVKAVSYGSVKFEVRGSRFNTSVTKFAWQDRGNGGKANNNSPIQPKPLADGVFTLCFEVEWEDCAEVLVDKVTNFINTARIAGGTIASFNKPFVKVAKDAEELASVKNAMMPCYVVVDCGVEVNIFEDAVNRKLQPMVNGYKKLEKIVDNKHMRDKFTPAYLATPTYTMIGYKMVSNVDNFDQALWQYGENTKVKTIGGIYND 269 T 6.1999999999999986E-24 Cas_Csy2 unppssm T Viruses T 7wxx 2 B B Peptide Inhibitor XXGLVXX 7 T 360 I_LWEQ pdbhh F F 7wyg 1 A,B A,B CYPC_BACSU Cytochrome P450 152A1 HMDEQIPHDKSLDNSLTLLKEGYLFIKNRTERYNSDLFQARLLGKNFICMTGAEAAKVFYDTDRFQRQNALPKRVQKSIFGVNAIHGMDGSAHIHRKMLFLSLMTPPHQKRLAELMTEEWKAAVTRWEKADEVVLFEEAKEILCRVACYWAGVPLKETEVKERADDFIDMVDAFGAVGPRHWKGRRARPRAEEWIEVMIEDARAGLLKTTSGTALHEMAFHTQEDGSQLDSRMAAIELINVLRPIVAISYFLVFSALALHEHPKYKEWLRSGNSREREMFVQEVRRYYPFIPFLGALVKKDFVWNNCEFKKGTSVLLDLYGTNHDPRLWDHPDEFRPERFAEREENLFDMIPQGGGHAEKGHRCPGEGITIEVMKASLDFLVHQIEYDVPEQSLHYSLARMPSLPESGFVMSGIRRKS 418 T 0.34 p450 pdb F Bacteria T 7wzz 3 C C LYS-ALA-GLY-GLN-VAL-VAL-THR-ILE-TRP KAGQVVTIW 9 T 0.18 MepB pdbhh F T 7x00 3 C C VAL-SER-PHE-ILE-GLU-PHE-VAL-GLY-TRP VSFIEFVGW 9 T 0.037 EBV-NA3 pdbhh F T 7x0y 2 E,F E,F CIB1 fragment XXXXXXXX 8 F F F 7x14 2 B B MIGA2_MOUSE MIGA2 phospho FFAT motif SEDSFFSATE 10 T 0.91 Miga pdbhh F Eukaryota T 7x1b 3 C C LYS-ALA-GLY-GLN-VAL-VAL-THR-ILE KAGQVVTI 8 T 2.8 DUF1989 pdbhh F T 7x1c 3 C C VAL-SER-PHE-ILE-GLU-PHE-VAL-ILE VSFIEFVI 8 T 10 Mob_synth_C pdbhh F F 7x1t 3 C B mini-G alpha q protein MGSTVSAEDKAAAERSKMIDKNLREDGEKARRTLRLLLLGADNSGKSTIVKQMRILHGGSGGSGGTSGIFETKFQVDKVNFHMFDVGGERDERRKWIQCFNDVTAIIFVVDSSDYNRLQEALNDFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKIEDYFPEFARYTTPEDATPEPGEDPRVTRAKYFIRKEFVDISTASGDGRHICYPHFTCAVDTENARRIFNDCKDIILQMNLREYNLV 246 T 2.6E-10 G-alpha pdb F T 7x1t 6 F E Taltirelin XDHPX 5 T 150 NADH-G_4Fe-4S_3 pdbhh F F 7x1u 3 C B mini-G alpha q prtoein MGSTVSAEDKAAAERSKMIDKNLREDGEKARRTLRLLLLGADNSGKSTIVKQMRILHGGSGGSGGTSGIFETKFQVDKVNFHMFDVGGERDERRKWIQCFNDVTAIIFVVDSSDYNRLQEALNDFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKIEDYFPEFARYTTPEDATPEPGEDPRVTRAKYFIRKEFVDISTASGDGRHICYPHFTCAVDTENARRIFNDCKDIILQMNLREYNLV 246 T 2.6E-10 G-alpha pdb F T 7x1u 6 F E Endogenous Peptide Agonist TRH QHPX 4 T 120 TRH pdbhh F F 7x2e 2 B B CDHR2_MOUSE Cadherin-related family member 2 QQKKNLSFTNPGLDTTDL 18 T 5.2 ARF7EP_C pdbhh F Eukaryota T 7x31 1 A A A0A2D0TCG3_NEIME Anti-CRISPR protein (AcrIIC1) MANKTYKIGKNAGYDGCGLCLAAISENEAIKVKYLRDICPDYDGDDKAEDWLRWGTDSRVKAAALEMEQYAYTSVGMASCWEFVEL 86 T 6.9 WIYLD pdbhh F Bacteria T 7x3k 2 B B SSZ1_YEAST DNAK-RELATED PROTEIN SSZ1,HEAT SHOCK PROTEIN 70 HOMOLOG SSZ1,PLEIOTROPIC DRUG RESISTANCE PROTEIN 13 MSSPVIGITFGNTSSSIAYINPKNDVDVIANPDGERAIPSALSYVGEDEYHGGQALQQLIRNPKNTIINFRDFIGLPFDKCDVSKCANGAPAVEVDGKVGFVISRGEGKEEKLTVDEVVSRHLNRLKLAAEDYIGSAVKEAVLTVPTNFSEEQKTALKASAAKIGLQIVQFINEPSAALLAHAEQFPFEKDVNVVVADFGGIRSDAAVIAVRNGIFTILATAHDLSLGGDNLDTELVEYFASEFQKKYQANPRKNARSLAKLKANSSITKKTLSNATSATISIDSLADGFDYHASINRMRYELVANKVFAQFSSFVDSVIAKAELDPLDIDAVLLTGGVSFTPKLTTNLEYTLPESVEILGPQNKNASNNPNELAASGAALQARLISDYDADELAEALQPVIVNTPHLKKPIGLIGAKGEFHPVLLAETSFPVQKKLTLKQAKGDFLIGVYEGDHHIEEKTLEPIPKEENAEEDDESEWSDDEPEVVREKLYTLGTKLMELGIKNANGVEIIFNINKDGALRVTARDLKTGNAVKGEL 538 T 1.1E-15 HSP70 pdbpssm F Eukaryota T 7x45 1 A,B A,B C0LEE1_CTEID Interferon gamma MDSWLNMMLLCGLLLIASLQTTNAFRFRRSKSEMTHLETNIHSLQEHYKTRGTEWVSKSVFVPHLNQLNSKASCTCQALLLERMLNIYEELFQDMKSEHKEGRKDLDHLMDEVKKLRGNYKEEHKVWKELQEMNSVKVKNGTIRGGALNDFLMVFDRASTEKHKKVQ 167 T 0.00078 IFN-gamma pdbpercent F Eukaryota T 7x4b 1 A,B A,B A0A2D0TCG3_NEIME Anti-CRISPR protein (AcrIIC1) MANKTYKIGKNAGYDGCGLCLAAISENEAIKVKYLRDICPDYDGDDKAEDWLRWGTDSRVKAAALEMEQYAYTSVGMASCWEFVEL 86 T 6.9 WIYLD pdbhh F Bacteria T 7x5c 1 A A TAP75_TETTS Telomerase-associated protein p75OB1 MEIEEDLNLKILEDVKKLYLQSFDYIKNGISSGGSGGSIDLSRITFLYKFISVNPTLLLINEKTQAKRRIFQGEYLYGKKKIQFNIIAKNLEIERELIQFFKKPYQCYIMHNVQVFQMLNKNKNNNVVEFMDSEDLQSSVDSQLYYLIDESSHVLEDDSMDFISTLTRLSDS 172 T 0.1 Clusterin pdb F Eukaryota T 7x5c 2 B B TAP50_TETTS Telomerase associated protein p50PBM QDDFGDGCLLQIVN 14 T 2.2 FAM47 pdbhh F Eukaryota T 7x5q 2 G H ASP-GLY-ALA-ASN-SER-ASP DGANSD 6 T 8.1 DUF5327 pdbhh F F 7x5v 2 E E R1DBK9_EMIHU ion channel MIAAIHNARRKKREAAA 17 T 6 RNF111_N pdbhh F Eukaryota T 7x6r 3 C C Actinomycin D TXPXXXTXPXX 11 T 61 DUF5572 pdbhh F F 7x6r 4 D D DSN-ALA-N2C-MVA-DSN-ALA-NCY-MVA XAXXXAXX 8 T 190 RSF pdbhh F F 7x6z 2 B B R1AB_SARS2 peptide PHTVLQ 6 T 3.8 ATXN-1_C pdbhh T Viruses T 7x7n 2 D,E,F,G,H,I D,E,H,I,F,G Synthetic peptide SIH-5 DKEWILQKIYEIMRLLDELXDXEASMRVSDLIYEFMKK 38 T 0.11 ER pdbhh F T 7x88 2 B B Histone H3K27ac(24-27) peptide AARX 4 T 260 G2BR pdbhh F F 7x8b 2 B,D B,D H3K27ac(24-27) peptide AARX 4 T 260 G2BR pdbhh F F 7x8f 2 B,D B,D H3K27ac(24-27) peptide AARX 4 T 260 G2BR pdbhh F F 7x8g 2 B,D B,D H3K27ac(24-27) peptide AARX 4 T 260 G2BR pdbhh F F 7x8u 1 A A Q941Q8_SOLLC SW-5B NLR IMMUNE RECEPTOR SMAENEIEEMLEHLRRIKSGGDLDWLDILRIEELEMVLRVFRTFTKYNDVLLPDSLVELTKRAKLIGEILHRLFGRIPHKCKTNLNLERLESHLLEFFQGNTASLSHNYELNNFDLSKYMDCLENFLNDVLMMFLQKDRFFHSREQLAKHRSIKELKIVQKKIRFLKYIYATEINGYVDYEKQECLENRIQFMTNTVGQYCLAVLDYVTEGKLNEENDNFSKPPYLLSLIVLVELEMKKIFHGEVK 246 T 0.07 Dna2 pdb F Eukaryota T 7x8x 3 AB,BB,CA,CB,DA,DB,EA,FA,GA,HA,IA,JA,KA,LA,MA,NA,OA,PA,QA,RA,SA,TA,UA,VA,WA,XA,YA,ZA 1,2,c,3,e,4,f,g,h,i,j,k,l,m,n,o,p,q,r,s,t,u,v,w,x,y,z,0 4-[[3,5-bis(fluoranyl)phenyl]methyl]-N-[(4-bromophenyl)methyl]piperazine-1-carboxamide XLL 3 T 1400 EF-hand_1 pdbhh F F 7x97 3 C G DSN-ALA-N2C-MVA-DSN-ALA-NCY-MVA XAXXXAXX 8 T 190 RSF pdbhh F F 7x97 4 D C Actinomycin D TXPXXXTXPXX 11 T 61 DUF5572 pdbhh F F 7x9d 2 B,C B,C DNM3L_HUMAN DNA (cytosine-5)-methyltransferase 3-like GHMFETVPVWRRQPVRVLSLFEDIKKELTSLGFLESGSDPGQLKHVVDVTDTVRKDVEEWGPFDLVYGATPPLGHTCDRPPSWYLFQFHRLLQYARPKPGSPRPFFWMFVDNLVLNKEDLDVASRFLEMEPVTIPDVHGGSLQNAVRVWSNIPAIRSRHWALVSEEELSLLAQNKQSSKLAAKWPTKLVKNCFLPLREYFKYFS 204 F F Eukaryota T 7x9f 3 C,H G,C DSN-ALA-N2C-MVA-DSN-ALA-NCY-MVA XAXXXAXX 8 T 190 RSF pdbhh F F 7x9f 4 D,G H,A THR-DVA-PRO-SAR-MVA-PXF-THR-DVA-PRO-SAR-MVA TXPXXXTXPXX 11 T 61 DUF5572 pdbhh F F 7xam 1 A 3 A0QTP4_MYCS2 50S ribosomal protein bL37 MAKRGRKKRDRKHSKANHGKRPNA 24 T 0.18 DUF6254 pdb F Bacteria T 7xau 6 F F Octreotide XCFXKTCX 8 T 0.0019 Urotensin_II pdbhh F F 7xav 6 F F Lanreotide XCYXKVCTX 9 T 0.021 Urotensin_II pdbhh F T 7xbj 1 A,B A,B Q306L3_XENNE 40kDa insecticidal toxin MVIKPVTTPSVIQLTPDDRVTPDDKGEYQPVEKQIAGDIIRVLEFKQTNESHTGLYGIAYRAKKVIIAYALAVSGIHNVSQLPEDYYKNKDNTGRIYQEYMSNLLSALLGENGDQISKDMANDFTQNELEFGGQRLKNTWDIPDLENKLLEDYSDEDKLLALYFFASQELPMEANQQSNAANFFKVIDFLLILSAVTSLGKRIFSKNFYNGLETKSLENYIERKKLSKPFFRPPQKLPDGRTGYLAGPTKAPKLPTTSSTATTSTAASSNWRVSLQKLRDNPSRNTFMKMDDAAKRKYSSFIKEVQKGNDPRAAAASIGTKSGSNFEKLQGRDLYSIRLSQEHRVTFSINNTDQIMEIQSVGTHYQNI 368 T 0.00062 HigB-like_toxin pdbhh F Bacteria T 7xbk 2 J L Unknown peptide XXXXXXXXXXXXXXXXX 17 F F F 7xbm 1 A,B A,B PIKC_STRVZ CYTOCHROME P450 MONOOXYGENASE PICK,NARBOMYCIN C-12 HYDROXYLASE,PIKROMYCIN SYNTHASE CYP107L1 MGSSHHHHHHSSGLVPRGSHMRRTQQGTTASPPVLDLGALGQDFAADPYPTYARLRAEGPAHRVRTPEGDEVWLVVGYDRARAVLADPRFSKDWRNSTTPLTEAEAALNHNMLESDPPRHTRLRKLVAREFTMRRVELLRPRVQEIVDGLVDAMLAAPDGRADLMESLAWPLPITVISELLGVPEPDRAAFRVWTDAFVFPDDPAQAQTAMAEMSGYLSRLIDSKRGQDGEDLLSALVRTSDEDGSRLTSEELLGMAXILLVAGHETTVNLIANGMYALLSHPDQLAALRADMTLLDGAVEEMLRYEGPVESATYRFPVEPVDLDGTVIPAGDTVLVVLADAHRTPERFPDPHRFDIRRDTAGHLAFGHGIHFCIGAPLARLEARIAVRALLERCPDLALDVSPGELVWYPNPMIRGLKALPIRWRRGREAGRRTGLEHHHHHH 444 T 4.6999999999999995E-33 p450 unppssm F Bacteria T 7xbn 1 A,B A,B PIKC_STRVZ CYTOCHROME P450 MONOOXYGENASE PICK,NARBOMYCIN C-12 HYDROXYLASE,PIKROMYCIN SYNTHASE CYP107L1 MGSSHHHHHHSSGLVPRGSHMRRTQQGTTASPPVLDLGALGQDFAADPYPTYARLRAEGPAHRVRTPEGDEVWLVVGYDRARAVLADPRFSKDWRNSTTPLTEAEAALNHNMLESDPPRHTRLRKLVAREFTMRRVELLRPRVQEIVDGLVDAMLAAPDGRADLMESLAWPLPITVISELLGVPEPDRAAFRVWTDAFVFPDDPAQAQTAMAEMSGYLSRLIDSKRGQDGEDLLSALVRTSDEDGSRLTSEELLGMAXILLVAGHETTVNLIANGMYALLSHPDQLAALRADMTLLDGAVEEMLRYEGPVESATYRFPVEPVDLDGTVIPAGDTVLVVLADAHRTPERFPDPHRFDIRRDTAGHLAFGHGIHFCIGAPLARLEARIAVRALLERCPDLALDVSPGELVWYPNPMIRGLKALPIRWRRGREAGRRTGLEHHHHHH 444 T 4.6999999999999995E-33 p450 unppssm F Bacteria T 7xbo 1 A,B A,B PIKC_STRVZ CYTOCHROME P450 MONOOXYGENASE PICK,NARBOMYCIN C-12 HYDROXYLASE,PIKROMYCIN SYNTHASE CYP107L1 MGSSHHHHHHSSGLVPRGSHMRRTQQGTTASPPVLDLGALGQDFAADPYPTYARLRAEGPAHRVRTPEGDEVWLVVGYDRARAVLADPRFSKDWRNSTTPLTEAEAALNHNMLESDPPRHTRLRKLVAREFTMRRVELLRPRVQEIVDGLVDAMLAAPDGRADLMESLAWPLPITVISELLGVPEPDRAAFRVWTDAFVFPDDPAQAQTAMAEMSGYLSRLIDSKRGQDGEDLLSALVRTSDEDGSRLTSEELLGMAXILLVAGHETTVNLIANGMYALLSHPDQLAALRADMTLLDGAVEEMLRYEGPVESATYRFPVEPVDLDGTVIPAGDTVLVVLADAHRTPERFPDPHRFDIRRDTAGHLAFGHGIHFCIGAPLARLEARIAVRALLERCPDLALDVSPGELVWYPNPMIRGLKALPIRWRRGREAGRRTGLEHHHHHH 444 T 4.6999999999999995E-33 p450 unppssm F Bacteria T 7xc2 2 B,D,F,H,J D,B,F,H,J A0A5B0N367_PUCGR Avirulence factor VNFPFPKKMITESNSKDIREYLASTFPFEQQSTILDSVKSIAKVQIDDRKAFDLQLKFRQENLAELKDQIILSLGANNGNQNWQKLLDYTNKLDELSNTKISPEEFIEEIQKVLYKVKLESTSTSKLYSQFNLSIQDFALQIIHSKYKSNQISQNDLLKLITEDEMLKILAKTKVLTYKMKYFDSASKMGINKYISTEMMDLDWQFSHYKTFNDALKKNKASDSSYLGWLTHGYSIKYGLSPNNERSMFFQDGRKYAELYAFSKSPHRKIIPGEHLKDLLAKINKSKGIFLDQNALLDKRIYAFHELNTLETHFPGITSSFTDDLKSNYRKKMESVSLTCQVLQEIGNIHRFIESKVPYHSSTEYGLFSIPKIFSIPIDYKHGEKENLVSYVDFLYSTAHERILQDNSINQLCLDPLQESLNRIKSNIPVFFNL 434 T 0.31 Type2_restr_D3 pdbpssm F Eukaryota T 7xcb 1 A A IL9_MOUSE IL-9,CYTOKINE P40,T-CELL GROWTH FACTOR P40 CSTTWGIRDTNYLIENLKDDPPSKCSCSGNVTSCLCLSVPTDDCTTPCYREGLLQLTNATQKSRLLPVFHRVKRIVEVLKNITCPSFSCEKPCNQTMAGNTLSFLKSLLGTFQKTEMQRQ 120 T 0.0041 Dynamin_M unppssm F Eukaryota T 7xdi 2 D D A0A5Q0V0G9_9VIRU C131 MGTKIIINVIFFDIILALLMMSFASIQPPSIANPPTVAQAQAQANITWNLTVGSINWEWLWPVFYFVDWLIWIVTTIFAVVAFIFNVFTTSLSLLASVPIVGPFLLMFAVIINFVLIWEVVKLIRGYDNPG 131 T 8.1E-12 C166 pdbpercent T Viruses T 7xdi 3 E E A0A5Q0V0F6_9VIRU B210 MKWPLLLFTVLLIIGFTLIARAGTISLLSTPPVNPPAYSYFYIEFQFLPTNNTPQPYAIFVGPNPNNLTEVAEGYTLSNGTGYARVPVINAQTEYVDIVWVNQNYTMFEIFPQIQNATTTVTLSANNNQGFSFSLPTWVSWVIGAVLMLIFMGVGWKFMGPAGLAIFGIFGLFIAMFFGLLPSYLMYVILFIVAIVGARILTKQLGGGEE 210 T 0.01 DUF5489 pdb T Viruses T 7xdi 4 F F A0A5Q0V0A2_9VIRU VP4 MKRVFLLYIIGILLTLFLPLIQTQSAVSLPPLYVEDAVNAEIQQLWSKSPTGVYAFHEAPSVNNSFWPDDNAKFLESIAPWWQSYSSYVNSTLQFLQQSDVNGLFIKRFEYPLNPLQSITIGNLSGYTNGFYDIVGNPLLNSMRIATYYNPTLAVTYLFGNVVQYPNGILVNIEQGLENPITDGGFGGTGGQNPPWESLNSSSLVNDSIVSIVNNAKTYLNLTGPTFFGTPSEELQYNFPIVNVLPHYLAFQNVNGILGQYNYQGKFIPFNVTLVLQSSSINRIYLEFIWENSTSGTYVLTDIPVYFTANGQWQQVVVTVPASAWPKYWNLGALSAVPLLIGIGLDLPGSSPSQTGPTGVYVGDIATNYPTTFGPQFNVTNKGSYVVFNESWKSDSLGATFWIAYVLGQGNAIEVLASAPVNQSWIYVGYNGLATIGTGYTILETPSGILKNYQNSGNISWTYLGPNFGKWMLLSTNYAPNWIGDFQMLFIFPMAGTSNPYMDTLNNAVYMGDPTEVRNTLYFGNYTTLPGYFQWVQIAYQNDGNTSGVFGFFLIPSVDYLVNPSVIVNDMFPSSLTAYSPSSIPNYWWEAVWGENYYEGEIIYALALLGKYGNSQALQMAQQAWLSYYNQLKAYNGATYTSSLARFIMATILLYNITGNTQYSNAYTQLANWLLQYQNQSKYAYVYIPMWYHKDVDVPSVNGFATYGYIINRTAQMDVGTVISGTSIGLNFFEDIPLNTSYGIYLLTNGTGKLPFTYQNVLNVSGTFITYLYMNGGGTATTANITITVQIAYNGNVLQTIGTAAVDNVPIQPGGISGSPPFYPVKIVVPVLTTVNAPPGSTLIIGWNIKAPQTVYVLIDSTNGPSNVTIPLSWPNPFYGLFTIPKIYNPNPGVHNYPQPYFLDISAMAGQAMMALYAVTKNITYLLDAQLVMNAIHYGPVPMPTYGILGVPNPPVEPRLWVYANYSTVDADYYTYKSELVSEFGDAIGNNTLASLAISRVWQRTSYTYPTSYIYYVARYGSGLQMNSETQPWGDVATQFYVNTWSPSNLDLFWASLPNNNYITNQTWNGTALFIHLYAYQQSQVQLIFLTTTVNFNVLVNGNYTNYEANHQIMQIAPTLEPGPNTIIIIPNPKNQVSQNTNISTTTTTSPLSNAISGLGITLTQNELMLLGFVIYFVIIMVTYGVSRNKTITVLSSIVAVAIVYALALWPTYMAFILGAVGFFMLFYSISRREEE 1236 T 0.0037 VKOR pdbpercent T Viruses T 7xdj 3 G,J,O,P L,P,D,H DSN-ALA-N2C-MVA-DSN-ALA-NCY-MVA XAXXXAXX 8 T 190 RSF pdbhh F F 7xdj 4 K,L,M,N C,G,K,O THR-DVA-PRO-SAR-MVA-PXZ-THR-DVA-PRO-SAR-MVA TXPXXXTXPXX 11 T 61 DUF5572 pdbhh F F 7xds 1 A,B A,B A0A5B0N367_PUCGR AvrSr35 HHHHHHSSGVDLGTENLYFQSNAMRNFAADRVHGVESVISGSKSSSNPMALSKSMDKPDTSDLVDSNVQAKNDGSRYEEDFTAKYSEQVDHVSKILKEIEEQEPGTIIIDHKAFPIQDKSPKQVVNFPFPKKMITESNSKDIREYLASTFPFEQQSTILDSVKSIAKVQIDDRKAFDLQLKFRQENLAELKDQIILSLGANNGNQNWQKLLDYTNKLDELSNTKISPEEFIEEIQKVLYKVKLESTSTSKLYSQFNLSIQDFALQIIHSKYKSNQISQNDLLKLITEDEMLKILAKTKVLTYKMKYFDSASKMGINKYISTEMMDLDWQFSHYKTFNDALKKNKASDSSYLGWLTHGYSIKYGLSPNNERSMFFQDGRKYAELYAFSKSPHRKIIPGEHLKDLLAKINKSKGIFLDQNALLDKRIYAFHELNTLETHFPGITSSFTDDLKSNYRKKMESVSLTCQVLQEIGNIHRFIESKVPYHSSTEYGLFSIPKIFSIPIDYKHGEKENLVSYVDFLYSTAHERILQDNSINQLCLDPLQESLNRIKSNIPVFFNLASHSSPIKPSNVHEGKL 575 T 0.44 Type2_restr_D3 pdbpssm F Eukaryota T 7xe0 2 B,G,H,I,J B,D,F,H,J A0A5B0N367_PUCGR AvrSr35 HHHHHHSSGVDLGTENLYFQSNAMRNFAADRVHGVESVISGSKSSSNPMALSKSMDKPDTSDLVDSNVQAKNDGSRYEEDFTAKYSEQVDHVSKILKEIEEQEPGTIIIDHKAFPIQDKSPKQVVNFPFPKKMITESNSKDIREYLASTFPFEQQSTILDSVKSIAKVQIDDRKAFDLQLKFRQENLAELKDQIILSLGANNGNQNWQKLLDYTNKLDELSNTKISPEEFIEEIQKVLYKVKLESTSTSKLYSQFNLSIQDFALQIIHSKYKSNQISQNDLLKLITEDEMLKILAKTKVLTYKMKYFDSASKMGINKYISTEMMDLDWQFSHYKTFNDALKKNKASDSSYLGWLTHGYSIKYGLSPNNERSMFFQDGRKYAELYAFSKSPHRKIIPGEHLKDLLAKINKSKGIFLDQNALLDKRIYAFHELNTLETHFPGITSSFTDDLKSNYRKKMESVSLTCQVLQEIGNIHRFIESKVPYHSSTEYGLFSIPKIFSIPIDYKHGEKENLVSYVDFLYSTAHERILQDNSINQLCLDPLQESLNRIKSNIPVFFNLASHSSPIKPSNVHEGKL 575 T 0.44 Type2_restr_D3 pdbpssm F Eukaryota T 7xeb 2 C,D,G,H C,D,G,H GLY-PRO-HYP peptide GPP 3 T 75 NapE pdbhh F F 7xeb 3 E,F,I,J E,F,I,J GLY-PRO-HYP-GLY-PRO-HYP peptide GPPGPP 6 T 0.79 DUF374 pdbhh F F 7xf2 1 A,B A,B K7WK08_9VIRU VP51 MSASLILDEYLKKTASAVLDVADSFEKIKGEIQSPEEAAALSVALYGAPPKPSASAVASIITGERTSLNDKYLSDNVLLKMSVARVGQENNRKRADQAADEIRTIMEDITGSLSGAYRQYSPLEEENKVHIGIMNNKTPSIVCGYYTMDTSISSEPLSLTDFQNPTVIANVTKRMESIFSKVDSARSTRFDAFVNGVANNMDIKSSIDWANMVENVIKLPDSTPNPCSVDTIVSRDASVVKTAVNDIYASVGKSYCRPATQLTFMSEIEKLRKAAVVCFEALMSDTRERAFVEFLFYVSFKEDASNTNSKLFVQNKLSSMSGNPRQPIKLVRRSAEETLFGLCFMFKVMPPEFMNCIFNFPTIPHSTQYHGLYGTCLTPLLRKYGSSFEKSWAHFEEILSERANAVKKFGVNDTRIDCLDAVANLTGPVYVLILDLVRTLSAQRSCSTKFLREIKENYLLWNRFVS 466 T 0.068 7TMR-HDED pdbpercent T Viruses T 7xfr 1 A,C A,C WIPI2_HUMAN WIPI-2,WIPI49-LIKE PROTEIN 2 GPGSGQLLFANFNQDNTSLAVGSKSGYKFFSLSSVDKLEQIYECTDTEDVCIVERLFSSSLVAIVSLKAPRKLKVCHFKKGTEICNYSYSNTILAVKLNRQRLIVCLEESLYIHNIRDMKVLHTIRETPPNPAGLCALSINNDNCYLAYPGSATIGEVQVFDTINLRAANMIPAHDSPLAALAFDASGTKLATASEKGTVIRVFSIPEGQKLFEFRRGVKRCVSICSLAFSMDGMFLSASSNTETVHIFKLETVKEQGRAFATVRLPFCGHKNICSLATIQKIPRLLVGAADGYLYMYNLDPQEGGECALMKQHRLDGSLE 321 T 0.00025 WD40 pdbpercent F Eukaryota T 7xgj 2 D,E,F D,E,F GZS-ASN-ASP-ALA-LEU-IML-EOE-NH2 XNDALXXX 8 T 330 DUF1911 pdbhh F F 7xhf 2 C,D C,D USP10/6-21 PQYIFGDFSPDEFNQF 16 T 1.2 Methyltrans_RNA pdbhh F T 7xhg 2 E,F,G F,G,E Caprin-1(369-378) PYNFIQDSML 10 T 4E-05 Caprin-1_C pdbhh F T 7xhn 1 A,H o,O CENPO_HUMAN CENP-O,INTERPHASE CENTROMERE COMPLEX PROTEIN 36 MEQANPLRPDGESKGGVLAHLERLETQVSRSRKQSEELQSVQAQEGALGTKIHKLRRLRDELRAVVRHRRASVKACIANVEPNQTVEINEQEALEEKLENVKAILQAYHFTGLSGKLTSRGVCVCISTAFEGNLLDSYFVDLVIQKPLRIHHHSVPVFIPLEEIAAKYLQTNIQHFLFSLCEYLNAYSGRKYQADRLQSDFAALLTGPLQRNPLCNLLSFTYKLDPGGQSFPFCARLLYKDLTATLPTDVTVTCQGVEVLSTSWEEQRASHETLFCTKPLHQVFASFTRKGEKLDMSLVS 300 F F Eukaryota T 7xho 8 H O CENPO_HUMAN CENP-O,INTERPHASE CENTROMERE COMPLEX PROTEIN 36 MEQANPLRPDGESKGGVLAHLERLETQVSRSRKQSEELQSVQAQEGALGTKIHKLRRLRDELRAVVRHRRASVKACIANVEPNQTVEINEQEALEEKLENVKAILQAYHFTGLSGKLTSRGVCVCISTAFEGNLLDSYFVDLVIQKPLRIHHHSVPVFIPLEEIAAKYLQTNIQHFLFSLCEYLNAYSGRKYQADRLQSDFAALLTGPLQRNPLCNLLSFTYKLDPGGQSFPFCARLLYKDLTATLPTDVTVTCQGVEVLSTSWEEQRASHETLFCTKPLHQVFASFTRKGEKLDMSLVS 300 F F Eukaryota T 7xhs 1 A A A0A2S8QTL8_PHOLU Cro/Cl family transcriptional regulator MINDMHPSLIKDKDIVDDVMLRSCKIIAMKVMPDKVMQVMVTVLMHDGVCEEMLLKWNLLDNRGMAIYKVLMEALCAKKDVKISTVGKVGPLGCDYINCVEISM 104 T 7.9 HU-CCDC81_bac_1 pdbhh F Bacteria T 7xi1 1 A A anti-CRISPR protein AcrIF24 MNAIHIGPFSITPAARGLHYGGLPHHQWTLYYGPREMAIKTLPDSYTSSEVRDEFSDIIAEFVIDARHRYAPDVLELVNSDGDAVLARVAVSRLPEALSGCIPDDRFPYWLLTASRPRLGLPVTLNEYTALAVELSAPPLAWITGLLPGEVLTHDAEEWRPPTSWELRHVVGEGSFTGVSGAAAAALLGMSATNFRKYTAGDSAANRQKISFAAWHYLLDRLGVKRASLEHHHHHH 236 T 0.00065 DUF4447 pdbhh F T 7xjg 2 B,J C,J RIB86_ECOLX retron St85 family effector protein MNKKFTDEQQQQLIGHLTKKGFYRGANIKITIFLCGGDVANHQSWRHQLSQFLAKFSDVDIFYPEDLFDDLLAGQGQHSLLSLENILAEAVDVIILFPESPGSFTELGAFSNNENLRRKLICIQDAKFKSKRSFINYGPVRLLRKFNSKSVLRCSSNELKEMCDSSIDVARKLRLYKKLMASIKKVRKENKVSKDIGNILYAERFLLPCIYLLDSVNYRTLCELAFKAIKQDDVLSKIIVRSVVSRLINERKILQMTDGYQVTALGASYVRSVFDRKTLDRLRLEIMNFENRRKSTFNYDKIPYAHP 307 T 0.05 Stork_head pdb F Bacteria T 7xjj 1 A A A0A1W2PP38_HUMAN;GNAO_HUMAN G protein subunit alpha o1,Guanine nucleotide-binding protein G(o) subunit alpha MGCTLSAEERAALERSKAIEKNLKEDGISAAKDVKLLLLGADNSGKSTIVKQMKIIHGGSGGSGGTTGIVETHFTFKNLHFRLFDVGGQRSERKKWIHCFEDVTAIIFCVDLSDYNRMHESLMLFDSICNNKFFIDTSIILFLNKKDLFGEKIKKSPLTICFPEYTGPNTYEDAAAYIQAQFESKNRSPNKEIYCHMTCATDTNNAQVIFDAVTDIIIANNLRGCGLY 228 T 1.2E-123 G-alpha unp F Eukaryota T 7xjk 2 B B Guanine nucleotide-binding protein G(q) MGSTVSAEDKAAAERSKMIDKNLREDGEKARRTLRLLLLGADNSGKSTIVKQMRILHGGSGGSGGTSGIFETKFQVDKVNFHMFDVGGQRDERRKWIQCFNDVTAIIFVVDSSDYNRLQEALNDFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKIEDYFPEFARYTTPEAATPEPGDDPRVTRAKYFIRKEFVDISTASGDGRHICYPHFTCAVDTENARRIFNDCKDIILQMNLREYNLV 246 T 2.5E-10 G-alpha pdb F T 7xjl 2 B B Guanine nucleotide-binding protein G(q) subunit alpha GPMGSTVSAEDKAAAERSKMIDKNLREDGEKARRTLRLLLLGADNSGKSTIVKQMRILHGGSGGSGGTSGIFETKFQVDKVNFHMFDVGGQRDERRKWIQCFNDVTAIIFVVDSSDYNRLQEALNDFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKIEDYFPEFARYTTPEAATPEPGDDPRVTRAKYFIRKEFVDISTASGDGRHICYPHFTCAVDTENARRIFNDCKDIILQMNLREYNLV 248 T 2.5E-10 G-alpha pdb F T 7xjo 2 C,D C,D RYH-KFB-GLU-ASP-DAB-LEU-EME-EOE-NH2 XXEDXLXXX 9 T 680 MRI pdbhh F F 7xl3 5 F G Q9HTR9_PSEAE Transcriptional factor SutA GAMGMSEEELEQDELDGADEDDGEELAAADDGEADSGDGDEAPAPGKKAKAAVVEEELPSVEAKQKERDALAKAMEEFLSRGGKVQEIEPNVVADPPKKPDSKYGSRPI 109 T 0.014 fvmX3 unppssm F Bacteria T 7xl4 5 F G Q9HTR9_PSEAE Transcriptional factor SutA GAMGMSEEELEQDELDGADEDDGEELAAADDGEADSGDGDEAPAPGKKAKAAVVEEELPSVEAKQKERDALAKAMEEFLSRGGKVQEIEPNVVADPPKKPDSKYGSRPI 109 T 0.014 fvmX3 unppssm F Bacteria T 7xl7 1 A,B A,B A0A2J0R8J6_SALTM Uncharacterized protein MKEGFYWIQHNGRVQVAYYTHGVTEDLETGQTIIGVWHLTQGDDICHNGEAEILAGPLEPPI 62 T 4.9 Cuticle_2 pdbhh F Bacteria T 7xml 2 B,D C,D GP60_BPSP1 PEIP MLNQVEVLREEYVEGYVVQMWRRNPSNAPVIEVFTEDNLEEGIIPEYVTANDDTFDRIVDAVEFGYLEELELV 73 T 0.12 NtrY_N pdb T Viruses T 7xmw 1 A,B A,B U2Q5N5_LEPWF AcrVIA2 HHHHHAMWKCKKCGCDRFYQDITGGISEVLEMDKDGEVLDEIDDVEYGDFSCAKCDNSSSKIQEIAYWDEIN 72 T 0.011 CpXC unphh F Bacteria T 7xna 2 B B CYN 154806 XXXYXKTCXX 10 T 0.27 Urotensin_II pdbhh F F 7xnj 1 A,B,C,D A,B,C,D A6V4P9_PSEA7 Stress Response Facilitator A, SrfA MAESQDKYTRRTGRTWADDQATYNRLREEADAARQKLRESGYSGAEYDQLRQAAFDLNRKANQYWEQMLSDLRQED 76 T 0.051 Flg_hook pdb F Bacteria T 7xnm 2 C,D C,D ILE-LEU-ALA-PRO-PRO-GLU-ARG ILAPPER 7 T 43 DUF5543 pdbhh F T 7xno 4 D,H,L B,F,J SAIA_LATSK Sakacin-A immunity factor MKHHHHHHHGAAGTSLYKKAGENLYFQGSMKADYKKINSILTYTSTALKNPKIIKDKDLVVLLTIIQEEAKQNRIFYDYKRKFRPAVTRFTIDNNFEIPDCLVKLLSAVETPKAWSGFS 119 T 0.06 DUF5112 pdb F Bacteria T 7xom 2 O U Polyalanine model of UDP-glucuronosyltransferase 1A (UGT1A) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 32 F F F 7xox 6 F L cck-8 FDMWGMYDEAYGWMDF 16 T 0.0021 Gastrin pdbhh F T 7xp4 1 A A GNAS2_HUMAN Guanine nucleotide-binding protein G(t) subunit alpha-3 MDYKDDDDKENLYFQSNSKTEDQRNEEKAQREANKKIEKQLQKDKQVYRATHRLLLLGADNSGKSTIVKQMRILHGGSGGSGGTSGIFETKFQVDKVNFHMFDVGGQRDERRKWIQCFNDVTAIIFVVDSSDYNRLQEALNDFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKIEDYFPEFARYTTPEDATPEPGEDPRVTRAKYFIRDEFLRISTASGDGRHYCYPHFTCAVDTQNVKFVFDAVTDIIIKENLKDCGLF 264 T 5.1E-10 G-alpha pdb F Eukaryota T 7xp5 2 B A GNAS2_HUMAN Guanine nucleotide-binding protein G(t) subunit alpha-3 MDYKDDDDKENLYFQSNSKTEDQRNEEKAQREANKKIEKQLQKDKQVYRATHRLLLLGADNSGKSTIVKQMRILHGGSGGSGGTSGIFETKFQVDKVNFHMFDVGGQRDERRKWIQCFNDVTAIIFVVDSSDYNRLQEALNDFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKIEDYFPEFARYTTPEDATPEPGEDPRVTRAKYFIRDEFLRISTASGDGRHYCYPHFTCAVDTQNVKFVFDAVTDIIIKENLKDCGLF 264 T 5.1E-10 G-alpha pdb F Eukaryota T 7xp6 1 A A GNAS2_HUMAN Guanine nucleotide-binding protein G(t) subunit alpha-3 MDYKDDDDKENLYFQSNSKTEDQRNEEKAQREANKKIEKQLQKDKQVYRATHRLLLLGADNSGKSTIVKQMRILHGGSGGSGGTSGIFETKFQVDKVNFHMFDVGGQRDERRKWIQCFNDVTAIIFVVDSSDYNRLQEALNDFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKIEDYFPEFARYTTPEDATPEPGEDPRVTRAKYFIRDEFLRISTASGDGRHYCYPHFTCAVDTQNVKFVFDAVTDIIIKENLKDCGLF 264 T 5.1E-10 G-alpha pdb F Eukaryota T 7xpk 2 B,D B,D A0A0E0IIA9_ORYNI Alpha-aminoacylpeptide hydrolase PPKRRAISAIRKFPRDCG 18 T 5.8 DUF3343 pdbhh F Eukaryota T 7xql 1 A,B B,A Q5ZSR1_LEGPH ANK_REP_REGION DOMAIN-CONTAINING PROTEIN MLTPPPDSKISTTDKSLDKLSAPLDMLKQMNESTMEQTKLDELRKKMSLQAEILNKAKADNDMFFRLLIELMSLKLQGELFKEQLSKISKESGYDSAQSALIQATNSEGQSPLQYALQKQDFSTAKYFLDNGAKAGPIEKAVFEIALDSKAAKEFGFPPLPPEKEKLHPVKNFGLVLGIKTTSVDGTPSQFGHIAPTYQLMTDSVSHFAKSHPGNKNFQEIANAFQFSNEASAFKFSTPQRNPEAGNDLARRIQGGELTTIPVSCKGHAMGLSYVPDGPGSKSGYLVYTNRGLGAKSSEHGTHIFRIEDSSKITPEFINNMTSGHSNGASHDEIMSQIKAAAGNKEPIHHIKQKGQKNDNCTIANSKSNIEGILLCQKAREVGGFDKLTESDMDSVKKEYKEFTKHMRVEKVNELAKALKENPQDPDLNNLTKEYLKQHPNADPKLKQTLETALKQASESSMTLSQPGKTI 471 T 0.00015 Shigella_OspC pdbhh F Bacteria T 7xqp 14 N O A0A2K1JDE1_PHYPA PsaO FNRDWLRKDLSVIGFGLIGWLAPSSLPVINGNSLTGLFLGSIGPELAHFPTGPALTSPFWLWMVTWHVGLFIVLTLGQIGFKGRQDGYW 89 T 0.1 Plasmid_RAQPRD unppercent F Eukaryota T 7xr2 1 A,B A,B E9LEU6_9REOV VP3 MASTTRLVNDRKQLEQQVKDDARILADARGLNITTVANDSATGGQAIRNVGPNDEATIKALDNVIKQIEALSVIVNRSEKADDAQILGPNTYKQLLEHLFSPEENVYILLPIQAYTGGVIDRRDASFSNFAYSIASKLMMELSAATHNKIFTDYTRIAASALGPEISTEGMPLFSLIESLELTEAETSRLPVIQDSMVIQKSTATVGNAQQGISTINIKRVPFVGSAFQQVIDQLLWEYSTTSLTTKEQRRQRITEMVNDRRIMIQKLTLAEKPQVMRHVTTEINNDLFFKMSPVAQLYIYHLDRAFLDGVGFTPLAEKQQQLQLQLKTNILTANLIRSAINGMNTESNLEVAIKMMQAAQLHRASIEIAFPMNVSLSPEIIVQCFIVWMSIPEQLLSDRSNFIIAAVIWAGFSADDSYADIMRRSARASDRQNYDIIKAALSSRKFKLPRASTTLFDENEPVVRRYQIGRVYAPFPVDRYGSPVYSNCTKVELASDYNAEGFTIRKDDFRALQAVLRIDEDRAADMFTTLRIMISSIPAVWYDAEVVHYPHTAVELEQLAAYGLTGAYPRTNHSVDTIVKTVNNISATYSTIAQMLSTIDLDPTRYGTSESIDKFKIAWENVESVLNMEGNDFVKTIMYAYEDNFPKKDFYMMLKQIASDGQGAHPIAAAIDQLRTIVYREPERFGYIDSVILTHNPDVDTAYNRFFHLHPIVTNQPSNTIKNAQLWNEMRLEQQVEHIKAGPVRIIGPFHVTYNYLSEEEDMPATSHIIMKDNMILNDHLTFNFVKRERRNNKKRVSSFRYKAVEMYVAVRISRFQLEVLRDLHDLVRSRTYLDVSKSPLATTPIRVVEYVR 854 T 6.7 DUF4982 pdbhh T Viruses T 7xr2 2 C,D 1,2 G9BDA7_9REOV VP11 MNWSKAINFQPFMLETRPPLTTIPIMDQLVEIGERSNQKWSMTDRLFFAIRKINPIFVTSSQIPSKFDYTILQMPTQLIASLKETLLFLAFSYYLREYQDKVGQMKFYPVAMKNMIPIVNYLKDRVHNNFDTTLEQAYRQNVVHTLSASDAFDLLSGMIATTRLDLIQRTRICPELLNVLNKMSFILIYAPNRPSILSWKNQS 203 T 0.11 MgtE_N unppercent T Viruses T 7xr2 3 E,F,G,H,I,J,K,L,M,N,O,P,Q a,b,c,d,e,f,g,h,i,j,k,l,m G9BDA8_9REOV VP12 MNLEINNFAPAISSIGSQLCSLSAQKLLTCRKQYGNGAKSFEEFYAEIGGIIGMMGINSQTPSGIREAIYRLYQSAFLFGDIFPESFGIQNTQNIKPPPGFTAPAKKLEVVLPQGGAFDLIYNNGEIRVTTTRNVQAGDLVCTVTFPIQGSVIATRNCHVNEIGGQLTTTRPEIIASVPMPARTVIVASFDAIEIGYGEGDDLFAIGIAILSNRFNGQITPMSRHNYMTQMFANLPANMSERDSSAVLHFAQAAPVVLGMMERLTGAPKWVLDY 274 T 0.082 SopE_GEF pdb T Viruses T 7xr3 1 A,B,C,D,E,F,G,H,I,J A,B,C,D,E,F,G,H,I,J E9LEU6_9REOV VP3 MASTTRLVNDRKQLEQQVKDDARILADARGLNITTVANDSATGGQAIRNVGPNDEATIKALDNVIKQIEALSVIVNRSEKADDAQILGPNTYKQLLEHLFSPEENVYILLPIQAYTGGVIDRRDASFSNFAYSIASKLMMELSAATHNKIFTDYTRIAASALGPEISTEGMPLFSLIESLELTEAETSRLPVIQDSMVIQKSTATVGNAQQGISTINIKRVPFVGSAFQQVIDQLLWEYSTTSLTTKEQRRQRITEMVNDRRIMIQKLTLAEKPQVMRHVTTEINNDLFFKMSPVAQLYIYHLDRAFLDGVGFTPLAEKQQQLQLQLKTNILTANLIRSAINGMNTESNLEVAIKMMQAAQLHRASIEIAFPMNVSLSPEIIVQCFIVWMSIPEQLLSDRSNFIIAAVIWAGFSADDSYADIMRRSARASDRQNYDIIKAALSSRKFKLPRASTTLFDENEPVVRRYQIGRVYAPFPVDRYGSPVYSNCTKVELASDYNAEGFTIRKDDFRALQAVLRIDEDRAADMFTTLRIMISSIPAVWYDAEVVHYPHTAVELEQLAAYGLTGAYPRTNHSVDTIVKTVNNISATYSTIAQMLSTIDLDPTRYGTSESIDKFKIAWENVESVLNMEGNDFVKTIMYAYEDNFPKKDFYMMLKQIASDGQGAHPIAAAIDQLRTIVYREPERFGYIDSVILTHNPDVDTAYNRFFHLHPIVTNQPSNTIKNAQLWNEMRLEQQVEHIKAGPVRIIGPFHVTYNYLSEEEDMPATSHIIMKDNMILNDHLTFNFVKRERRNNKKRVSSFRYKAVEMYVAVRISRFQLEVLRDLHDLVRSRTYLDVSKSPLATTPIRVVEYVR 854 T 6.7 DUF4982 pdbhh T Viruses T 7xr3 2 K Z G9BD97_9REOV VP1 MRIMAQRLKELQREIDKKKKERIAEAYLSSVEVTNSSPSLSKQDDALTLPKVSPFLDSTPFTTLHNSLYGQQIHSIDDELAQICKLEYELQTQIADEQITALKHFLTIRTGSPQEIQYVDKEWMKSNQHVPSFLGDVKLMFGDTAGKFRSTSKSVDSIHSITSDVQVTRKKQTRSQIRNSYRVQKKHKVQQPLKPNTLYVYKYKGLPRVVLRFVPKVDTTSNSNSSSASDSKKDKDAFSCDDLSPTWKYILTEAKRAFPDRSYSDCIHPMTWEEWLEENQDHVKVLTQYAHQLDYVTLLQDFNLYVSGGASRVRNIDMSTLPTSINVLDHFELYGDASMKEYVRSGEWYGLLREIEQEGMTVNESEKVFANPDTYVLNVKKYFLRRFQQEIASTGMTPLTDELLNIMFVHWNIIVTAEPKLQVIKDDLLKYYSRYGVDATFDYNMKRSEMTVVTRGHLLAHKVLECALRIVETIYTYDIQDETFKDILIDLGRLIMRDPIYGTTTVRDATTVMKQLMYTQGTQFRRIMFKKYDYSNFNEKLVLKGEQMTNEPPTLLATTHYEEMDKKRIDALIKANQRAGNILSQSSIERCRYTDSLDLVGDANRYFSALTTLEAVAGFASSDLLSGFIDSNESIEFTGTAHLRKLLYHSVREQITTLNTSTVPRPSLPKVLLSSAKDTASASIEPLTFRIYKTTPEYDGESLNLVESTVEMSTRQKKPNLMKAAEILRSTVTTNQEMIISGGTRAVQGGKGARAVYPTKQPYHIAGSLLFHKVDTIVNANKKYRGVSNKYGQGISNAIPHIGVPEIIAVSSDGMAICLALDVSAFDVAQKYTEADIELAMRDGFLDSEISMISGETVLERMNPADLANNLLTNTPPRYKYQTALGDIIILQHDNRSGVPWTGTQNDLVNVSNHHMAYDEYKKRVAELQRQGKISIDVNDKHHIVRVFGDDSTFIMTYDEPPSAEEVHLMCATFVESYQDTAGTLGFAINARKGMIGRYGSEYLKNSAIYGNIKSVNQVKFRGSEKSASYHFGVSEKVSMIRDITDLTITRGCDETRKWKYNLMMLPVDLTTRAGAFRMHNLCSIMTGVGKMYLGGTLNNKLIASYHGSSFGWNFDDNLIKTANSIGAISDSSYDAISTKITNLADFKDSQQRITRDIITSGRLPQHLNRYGKSNILRHILASAAMGPLSQIEKNVNAYNVVMGILNGKLEAPTVLERLNMGFKYVVMSDLKQDDYSPYSCQGLQYRRMLVHWGLNDSRITSFDPKGKLQHLLAKNSQILPIHFDIEFVYRLYLQAGTMGFLQVMSYYQLPDTLTHEMLAAVVALELQLGNDKYAVDMGVYSSQAGQIRINDALMDSIIQHRRGPPLPIIDRTLNRLLLHTYMLMFGLMGKSIDSTKIDPTLSWRAILESNDQRIAQLSELLTAV 1425 T 0.021 FAM220 pdb T Viruses T 7xrw 1 A A C1JEX5_TRYBB Repressor activator protein 1 GPGSEATEEIAALDQPFEKCFIPTEALGSDREGLDRTQLERQLPFRNYPIKLNVSKSGIFCQFPTVSDAKRFYEEGTVEILNRSLPIKPVFEKRNETVAPAERKRRRSVSPGGVHPQTAAVSALSRR 127 T 0.012 VIR_N unp F Eukaryota T 7xs0 2 B B UNK-UNK-UNK XXX 3 F F F 7xsj 3 C C APBA1_RAT ADAPTER PROTEIN X11ALPHA,NEURON-SPECIFIC X11 PROTEIN,NEURONAL MUNC18-1-INTERACTING PROTEIN 1,MINT-1 GPGSEFYRQEALGARLHHYDERSDGESDSPEKEAEFAPYPRMDSYEQEEDIDQIVAEVKQSMSSQSLDKAAEDMPEAEQDLER 83 T 0.1 ACP_PD pdb F Eukaryota T 7xtg 2 B,F,J B,F,J SAIA_LATSK Sakacin-A immunity factor ADYKKINSILTYTSTALKNPKIIKDKDLVVLLTIIQEEAKQNRIFYDYKRKFRPAVTRFTIDNNFEIPDCLVKLLSAVETPKAWSGFS 88 F F Bacteria T 7xtl 1 A,B A,B MGT4A_HUMAN MGAT4A MGSSHHHHHHSSGLVPRGSHMASKIHVNPPAEVSTSLKVYQGHTLEKTYMGEDFFWAITPIAGDYILFKFDKPVNVESYLFHSGNQEHPGDILLNTTVEVLPFKSEGLEISKETKDKRLEDGYFRIGKFENGVAEGMVDPSLNPISAFRLSVIQNSAVWAILNEIHIKKATN 172 T 0.58 NADase_NGA pdbhh F Eukaryota T 7xuv 2 B B RMI1_HUMAN BLM-ASSOCIATED PROTEIN OF 75 KDA,BLAP75,FAAP75 SGSDEELLASLDENDELTANND 22 T 23 DUF4293 pdbhh F Eukaryota T 7xv3 3 C A Engineered G protein subunit S (mini-Gs) MGCTLSAEDKAAVERSKMIEKQLQKDKQVYRATHRLLLLGADNSGKSTIVKQMRIYHVNGYSEEECKQYKAVVYSNTIQSIIAIIRAMGRLKIDFGDSARADDARQLFVLAGAAEEGFMTAELAGVIKRLWKDSGVQACFNRSREYQLNDSAAYYLNDLDRIAQPNYIPTQQDVLRTRVKTSGIFETKFQVDKVNFHMFDVGAQRDERRKWIQCFNDVTAIIFVVDSSDYNRLQEALNDFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKIEDYFPEFARYTTPEDATPEPGEDPRVTRAKYFIRDEFLRISTASGDGRHYCYPHFTCSVDTENARRIFNDCRDIIQRMHLRQYELL 361 F F T 7xv4 2 B B ATRIP_HUMAN ATM AND RAD3-RELATED-INTERACTING PROTEIN GDFTADDLEELDTLASQ 17 T 1.6 Med21 unppercent F Eukaryota T 7xv7 1 A,B B,A ZY11B_HUMAN Protein zyg-11 homolog B GYINVGAQLGTELFIVRQLLQIVKQKTNQNSVDTTLKFTLSALWNLTDESPTTCRHFIENQGLELFMRVLESFPTESSIQQKVLGLLNNIAEVQELHSELMWKDFIDHISSLLHSVEVEVSYFAAGIIAHLISRGEQAWTLSRSQRNSLLDDLHSAILKWPTPECEMVAYRSFNPFFPLLGCFTTPGVQLWAVWAMQHVCSKNPSRYCSMLIEEGGLQHLYNIKDHEHTDPHVQQIAVAILDSLEKHIVR 250 T 0.00014 DUF4704 pdb F Eukaryota T 7xva 2 B B C9JGY8_HUMAN Juxtaposed with another zinc finger protein 1 QQPTYVALSYINRFMTDAARREQES 25 T 9.1 Herpes_U34 pdbhh F Eukaryota T 7xvg 2 B B A0A5B0N367_PUCGR AvrSr35 HHHHHHSSGVDLGTENLYFQSNANSKDIREYLASTFPFEQQSTILDSVKSIAKVQIDDRKAFDLQLKFRQENLAELKDQIILSLGANNGNQNWQKLLDYTNKLDELSNTKISPEEFIEEIQKVLYKVKLESTSTSKLYSQFNLSIQDFALQIIHSKYKSNQISQNDLLKLITEDEMLKILAKTKVLTYKMKYFDSASKMGINKYISTEMMDLDWQFSHYKTFNDALKKNKASDSSYLGWLTHGYSIKYGLSPNNERSMFFQDGRKYAELYAFSKSPHRKIIPGEHLKDLLAKINKSKGIFLDQNALLDKRIYAFHELNTLETHFPGITSSFTDDLKSNYRKKMESVSLTCQVLQEIGNIHRFIESKVPYHSSTEYGLFSIPKIFSIPIDYKHGEKENLVSYVDFLYSTAHERILQDNSINQLCLDPLQESLNRIKSNIPVF 441 T 0.32 Type2_restr_D3 pdbpssm F Eukaryota T 7xw9 6 F L TRH peptide QHPX 4 T 120 TRH pdbhh F F 7xwo 6 F D HIS-LYS-THR-ASP-SER-PHE-VAL-GLY-LEU-MET-NH2 HKTDSFVGLMA 11 T 0.046 Tachykinin pdbhh F T 7xx2 2 B B A0A5B0N367_PUCGR AvrSr35 HHHHHHSSGVDLGTENLYFQSNAMRNFAADRVHGVESVISGSKSSSNPMALSKSMDKPDTSDLVDSNVQAKNDGSRYEEDFTAKYSEQVDHVSKILKEIEEQEPGTIIIDHKAFPIQDKSPKQVVNFPFPKKMITESNSKDIREYLASTFPFEQQSTILDSVKSIAKVQIDDRKAFDLQLKFRQENLAELKDQIILSLGANNGNQNWQKLLDYTNKLDELSNTKISPEEFIEEIQKVLYKVKLESTSTSKLYSQFNLSIQDFALQIIHSKYKSNQISQNDLLKLITEDEMLKILAKTKVLTYKMKYFDSASKMGINKYISTEMMDLDWQFSHYKTFNDALKKNKASDSSYLGWLTHGYSIKYGLSPNNERSMFFQDGAKYAELYAFSKSPHRKIIPGEHLKDLLAKINKSKGIFLDQNALLDKRIYAFHELNTLETHFPGITSSFTDDLKSNYRKKMESVSLTCQVLQEIGNIHRFIESKVPYHSSTEYGLFSIPKIFSIPIDYKHGEKENLVSYVDFLYSTAHERILQDNSINQLCLDPLQESLNRIKSNIPVFFNLASHSSPIKPSNVHEGKL 575 T 0.44 Type2_restr_D3 unppssm F Eukaryota T 7xxf 7 KA,LA,MA,NA,OA,PA,QA,RA,SA,TA,UA a,b,c,d,e,f,g,h,i,j,k Light-harvesting protein LH1 Gamma-like MAMVWMWILIAPAIGIVLLSRQ 22 T 0.59 DUF1514 pdbhh F T 7xya 9 J G A0A2R3ITY7_PSEAI AlpA MFQSTEQALAVAYWMFEQQPGPRSSTAMVIDSLRERFDRRFIERLPSGLSPHEWQAQAVMTVRFAQRQLAAHPLELAVVRAEFARGRDFVLGLAALRDWLKPAAGPIEQRAALALLMRMFRRPPSSIREIERLSGLSKSTLHRWDKEWRERVAALLRQALLRLEEPMAQVGIVCEH 176 T 0.00097 HTH_IclR pdbpercent F Bacteria T 7xyb 5 F G A0A2R3ITY7_PSEAI AlpA MFQSTEQALAVAYWMFEQQPGPRSSTAMVIDSLRERFDRRFIERLPSGLSPHEWQAQAVMTVRFAQRQLAAHPLELAVVRAEFARGRDFVLGLAALRDWLKPAAGPIEQRAALALLMRMFRRPPSSIREIERLSGLSKSTLHRWDKEWRERVAALLRQALLRLEEPMAQVGIVCEH 176 T 0.00097 HTH_IclR pdbpercent F Bacteria T 7xys 1 A,B,C,D A,B,C,D ZER1_HUMAN HZYG,ZYG-11 HOMOLOG B-LIKE PROTEIN,ZYG11B-LIKE PROTEIN SFLHVGKMGFVVTMLKLIQKKLLDKTCDQVMEFSWSALWNITDETPDNCEMFLNFNGMKLFLDCLKEFPEKQELHRNMLGLLGNVAEVKELRPQLMTSQFISVFSNLLESKADGIEVSYNACGVLSHIMFDGPEAWGVCEPQREEVEERMWAAIQSWDINSRRNINYRSFEPILRLLPQGISPVSQHWATWALYNLVSVYPDKYCPLLIKEGGMPLLRDIIKMATARQETKEMARKVIEHCSNFKEENMDTSR 253 T 0.0027 Arm_3 pdbpercent F Eukaryota T 7xyt 1 A,B,C,D A,B,D,C ZER1_HUMAN HZYG,ZYG-11 HOMOLOG B-LIKE PROTEIN,ZYG11B-LIKE PROTEIN AFLHVGKMGFVVTMLKLIQKKLLDKTCDQVMEFSWSALWNITDETPDNCEMFLNFNGMKLFLDCLKEFPEKQELHRNMLGLLGNVAEVKELRPQLMTSQFISVFSNLLESKADGIEVSYNACGVLSHIMFDGPEAWGVCEPQREEVEERMWAAIQSWDINSRRNINYRSFEPILRLLPQGISPVSQHWATWALYNLVSVYPDKYCPLLIKEGGMPLLRDIIKMATARQETKEMARKVIEHCSNFKEENMDTSR 253 T 0.0027 Arm_3 pdbpercent F Eukaryota T 7xyu 1 A,B,C,D B,A,C,D ZER1_HUMAN HZYG,ZYG-11 HOMOLOG B-LIKE PROTEIN,ZYG11B-LIKE PROTEIN TFLHKMGFVVTMLKLIQKKLLDKTCDQVMEFSWSALWNITDETPDNCEMFLNFNGMKLFLDCLKEFPEKQELHRNMLGLLGNVAEVKELRPQLMTSQFISVFSNLLESKADGIEVSYNACGVLSHIMFDGPEAWGVCEPQREEVEERMWAAIQSWDINSRRNINYRSFEPILRLLPQGISPVSQHWATWALYNLVSVYPDKYCPLLIKEGGMPLLRDIIKMATARQETKEMARKVIEHCSNFKEENMDTSR 251 T 0.0027 Arm_3 pdbpercent F Eukaryota T 7xyv 1 A,B A,B ZY11B_HUMAN Protein zyg-11 homolog B SFLHVGAQLGTELFIVRQLLQIVKQKTNQNSVDTTLKFTLSALWNLTDESPTTCRHFIENQGLELFMRVLESFPTESSIQQKVLGLLNNIAEVQELHSELMWKDFIDHISSLLHSVEVEVSYFAAGIIAHLISRGEQAWTLSRSQRNSLLDDLHSAILKWPTPECEMVAYRSFNPFFPLLGCFTTPGVQLWAVWAMQHVCSKNPSRYCSMLIEEGGLQHLYNIKDHEHTDPHVQQIAVAILDSLEKHIVR 250 T 0.00014 DUF4704 pdb F Eukaryota T 7xyw 1 A,B B,A ZY11B_HUMAN Protein zyg-11 homolog B AFLHVGAQLGTELFIVRQLLQIVKQKTNQNSVDTTLKFTLSALWNLTDESPTTCRHFIENQGLELFMRVLESFPTESSIQQKVLGLLNNIAEVQELHSELMWKDFIDHISSLLHSVEVEVSYFAAGIIAHLISRGEQAWTLSRSQRNSLLDDLHSAILKWPTPECEMVAYRSFNPFFPLLGCFTTPGVQLWAVWAMQHVCSKNPSRYCSMLIEEGGLQHLYNIKDHEHTDPHVQQIAVAILDSLEKHIVR 250 T 0.00014 DUF4704 pdb F Eukaryota T 7xyx 1 A,B B,A ZY11B_HUMAN Protein zyg-11 homolog B CFLHVGAQLGTELFIVRQLLQIVKQKTNQNSVDTTLKFTLSALWNLTDESPTTCRHFIENQGLELFMRVLESFPTESSIQQKVLGLLNNIAEVQELHSELMWKDFIDHISSLLHSVEVEVSYFAAGIIAHLISRGEQAWTLSRSQRNSLLDDLHSAILKWPTPECEMVAYRSFNPFFPLLGCFTTPGVQLWAVWAMQHVCSKNPSRYCSMLIEEGGLQHLYNIKDHEHTDPHVQQIAVAILDSLEKHIVR 250 T 0.00014 DUF4704 pdb F Eukaryota T 7xzi 1 A 3 A0A2K3D4W3_CHLRE Ctap3 MADGPSPIRIVLWNDGGESLAAGVEDEEQQQVLHSFADLVGSAIDAVLELPQFRHVEAVTAEAEEDEPGLSIGFDAGSGDGEVDIDNLKGRLDIAGLLLGSAQLPEELAEVAAVEVTDEEEGTTELQFTDEGLVQQLQAVVKRAKLEKRYNDWVAGVAESLGPALDAAAGGVEVTEMPVDPYDVLQAVVAQLIRVAGVSPPAPSLFSRTGALVGGVLGAPRSAVRQVTKRLGRAQRLWWRLEDVVVDGSKLALRLAVKAARPVLVGFVLHRVLKTLDRSRQLEYRLARMGPEEAREAYYEAVLGKDWKQQLQADWDKALEDVDAGLVTDEINHEKRLMTAAQLRRLEVEEWDKQRMKNFYLASFGGLRWFDQMEQALHNPLFIESRGWTDPVQNWVGQNRTYMDDLPAGQYMAGVGNAAIRIKEAELKRKLTDVERAHVLARGGAVAGGLLPQQPTDPATLAVAVGGAFVPSVAGKR 477 T 0.46 AXH pdbpssm F Eukaryota T 7xzi 2 B 4 A8J6H7_CHLRE OMP85 DOMAIN-CONTAINING PROTEIN MGASQESELDFVPRLSFLPIEWRSIGSAFGLKDKSGAAANGRATFTVRQGVDAAELTSTGRVIDGQADVGASLKLNTLAIGVSASNITFHSGLDDPTAAAAQRSSLIPSLKLTAAKQFKRDNYIAVSYDLKHQKPELSACWTGEAGADRATLLVNVDPVMRSVKLAAAVRTPGPEWRKVLYNDETDLLEYPADDGARHTLYVQHEVRGRDLLHATRLGCRLDLGRLVNYVVDFVDYRIEENIPSFVWNVPLLPQLYSLLVPADNDEQVRHRITGWELDVSHDFARSGLLPVVAISKTSKKLLGGGTLTASYDAAAREAGVSLSRKGVSVGARVARAEGAAGGLSAGWGRPSIHVAVEPLGLLQ 363 T 1 Thyroglobulin_1 pdbpssm F Eukaryota T 7xzi 3 C 5 Ctap5 MQLGQLRQPLRACQDQRLTRGVPLARRQLVVVSNWNPLGGKGGGNAEQPGGEEPIQDELLKLLRGGWVLLSNLALFLVFSSFLHRSLNWFVQTELLVAVGAPQQAGERVVGKFFEAIEWVERNILGWKLPGDEEAEDATSKVYEVLQNYTPAEAAYSFAQLKYKDLTHKERELFHKAYALRHFERRDGRPGDVDAAELQAVKDRLDPLEADRRAYAAAKAAGRLDEYWAAPGREATYQRIVGAPRIAARQCEMASMLKGLQAVLPAMELLAQLQVAQFVYAASKASKSRQQDDFKLQLQTFYGNVLDEQCQLRCMLLNVQLPMALVTVFVPQYCFCLLDRVVLPRRECGSASTLGSLLRACGRPGSACHAGGCSAQRRDPLTI 383 T 0.023 DUF2878 pdbpercent F T 7xzi 6 F A YCF78_CHLRE UNCHARACTERIZED MEMBRANE PROTEIN YCF78 MITFTFMSLVTSVKDYVEITHKLIEIEPLKNYTEFGAVFTYFIFSIGEFFKNFFSFSFLNNIWSIPIIIPDIASAMISEVSVLDGYFHNAFTFLETSVNTTTNPSLVIFEKFVIGIINSLFLILPTSTSHLITLRRFVMQGLEAGYMAGLGTLAGNFLWLASIILGWRFFVIPWLSLDIFRYLLGFVLLVKYIWDSSKERRMALEDLSKWKIFLLNFLLALTEQSCIYPFISNLSFGPDASILEGFPVDNYPQFLLIHGAYLLGILFGSFSLLQFTCWFWENPAFSIYLWITTKSSLKISTSSYYKILNFTFLYATMLCAIASIPYYGLDYTITNPIGLVPQDRILNQKKSQSDPDKLITETAFLNLNPTDKNSRIRDGVHARRERWKQRLIKYQAFDASTYDQGVYDFLTIEDLNYGFDRFWLRRKMRNHQIRFRLFPGPWMRSLKKQLNNPANPSLETSTKAASGPRVEFFRILFEQFYHPNFHDRAAMQTNPAEARNKFISTSPLASTESKKALNSTFSLGNINNSSTGIEGLVLTNTQATLLPTDLQTKRTIKPGLIYTNSALRKFVRNVNTRLNLKLLNSKETNLTTKYKSQFIYSKRWKSIFSKIQPLQNGTTRKSYQLFRNVAKQILVTPDAKSLKLITINQKLSLKERKLLELRTQYNNNSTLTTTAPLTLVRPLNVYLQKEEAFKRKLRYYGTMPMRKLTVGNQAPYFKALMKRGFYYYKPTLRWRKTLYVASLRRGFRKKSRKQRILVMPSNQQNFNNTLDNTKTNINQNNLANPLGGNEVPMYGADGENSLITKPTHSYTVLGKRASRYRHQIYKDVLQHWYYTPFNRLLMKFDVDAFINRQPKSHFLTKNEERALHIRRFLLSEHYDTLRWYTYMQHYKTMKTNIGGTKSFANRAYNQQFQGTFKKIRHLFAITPKQGDFYTLKFDQPLYNDNKLKDNLYFHEELLTDYYNGTNLQTNQTSNISVNSTTTFIDNSLRTTQLPVPSSSFDIVNQSSTLIGLTTMQNALRKNVVESTLTSLNSDGEAATSQPKLNFVYSELFVKLIKECKKRIHDQTFLKNYITHRIEKREQLNQEQTKELNKRLEKLKVWLNSDKGSISKLQNTPVQDPNISSPDKVLTTAMQKAVNESISLSGIMPSDKIKTTYGNLTNAYTIKTENAILTKLNVINQLTNNETTTQKNTLIKSIGVNKIQTVLQTIITNFKSSLYNQTQLLRVKTDKDLQWWRTKQRVITKRKSARKRDRFKKQIAVVNKKLAALSKKVETEKSNLYQTLYGNYEISDYLLRNVPTGSSAVIDSTVLRKKQDNQAYLPKETNNVQFNSFVDSNNNVWQTFFAKKLRKKISSKGRRYRSLSLARYLTATRKPRLVGLDNLTKIDNITTLQGAFITKEEKQDSLNLTIQRKQELTNSLKKSQIKKRSRHSWKKRSRHQFSRNHYKYRKRHTHGNGKLRVMNKKLKKFKATNELRQWWWNSFLPRYLSNLQVNNSTLTNKNVSFKPLSNTNSVPSTNMASPTTSRNLLDNLNSSNQISTSASMNQNIVTESVKVETNQVYLPEGEKSFDITSMTTTLPFYAGWDESLKKFVVTNRLLSRRDAGLSVNNNPQEINFTNPPIQGLNEGSFLYWQTEMPFNSYNIDQFITTNQSFYAPLGWRRFEFRHSILKTWVNNTKAGNNNIKKKTLIISLKNLQPLKSSQQKQNQIKTKKLVARRIKKRYKLLKQMPNQLMYSPTGPLLTEVLPSHYISVFDQQYRLPRNRYLKRNPLKTLKKTTLLALMDSSKQTNGVNKEFTLRKRVKPRRKYHRKRFIKKDGLIFPRRTKFNTNTTLTGNALITNNVNSIEEDDLRWRPSSRTKQKRKDNTRSSAASKTKSNKRVKTNPLRLRQLRRREFQQVLKPLQRYIPQNGGFTWPGDYLRLEIVEMPKLKSINIKKTSLKQKINVQPVGIMPRKYLIEKHNIKVLKKKLSQAYSTQQLTKVVQEYKNLIQNSPPAI 1995 T 2E-05 Ycf1 pdbhh F Eukaryota T 7xzi 8 H C A8J1J3_CHLRE Tic15 MDEEPPFNLALNVYKGPASIPHASAEVFGAFFLATNTALLAHMFPGKLFGSELHVRKWDPDYLASCCNEQGMRREALSGKKPNLWLLGGGPRLVNDSWERMWWNNLHWKRWKVPRTGPAFPQDMYWQ 127 T 17 TetR_C_18 pdbhh F Eukaryota T 7xzi 13 M U A8J5D4_CHLRE TIC13 MSSDVQAKLSGLLGDIGVKCTLAFAGTVAAGAAIVVPSGKQVEAASLDIYGRPPSQLLPNERRAAEFAAGHRRWKGFVDNSIYSWTRTLPGHDNPIVNPYKGPRRPQRPQQKLEEEVEAAAKQE 124 T 3.9 DUF6460 pdbhh F Eukaryota T 7xzi 14 N X Unknown peptide AAAAAFAAFAGFAFAAFAAAG 21 T 12 DUF6520 pdbhh F F 7xzj 1 A 3 A0A2K3D4W3_CHLRE Ctap3 MADGPSPIRIVLWNDGGESLAAGVEDEEQQQVLHSFADLVGSAIDAVLELPQFRHVEAVTAEAEEDEPGLSIGFDAGSGDGEVDIDNLKGRLDIAGLLLGSAQLPEELAEVAAVEVTDEEEGTTELQFTDEGLVQQLQAVVKRAKLEKRYNDWVAGVAESLGPALDAAAGGVEVTEMPVDPYDVLQAVVAQLIRVAGVSPPAPSLFSRTGALVGGVLGAPRSAVRQVTKRLGRAQRLWWRLEDVVVDGSKLALRLAVKAARPVLVGFVLHRVLKTLDRSRQLEYRLARMGPEEAREAYYEAVLGKDWKQQLQADWDKALEDVDAGLVTDEINHEKRLMTAAQLRRLEVEEWDKQRMKNFYLASFGGLRWFDQMEQALHNPLFIESRGWTDPVQNWVGQNRTYMDDLPAGQYMAGVGNAAIRIKEAELKRKLTDVERAHVLARGGAVAGGLLPQQPTDPATLAVAVGGAFVPSVAGKR 477 T 0.46 AXH pdbpssm F Eukaryota T 7xzj 2 B 4 A8J6H7_CHLRE Toc39 MGASQESELDFVPRLSFLPIEWRSIGSAFGLKDKSGAAANGRATFTVRQGVDAAELTSTGRVIDGQADVGASLKLNTLAIGVSASNITFHSGLDDPTAAAAQRSSLIPSLKLTAAKQFKRDNYIAVSYDLKHQKPELSACWTGEAGADRATLLVNVDPVMRSVKLAAAVRTPGPEWRKVLYNDETDLLEYPADDGARHTLYVQHEVRGRDLLHATRLGCRLDLGRLVNYVVDFVDYRIEENIPSFVWNVPLLPQLYSLLVPADNDEQVRHRITGWELDVSHDFARSGLLPVVAISKTSKKLLGGGTLTASYDAAAREAGVSLSRKGVSVGARVARAEGAAGGLSAGWGRPSIHVAVEPLGLLQ 363 T 1 Thyroglobulin_1 pdbpssm F Eukaryota T 7xzj 5 E A YCF78_CHLRE UNCHARACTERIZED MEMBRANE PROTEIN YCF78 MITFTFMSLVTSVKDYVEITHKLIEIEPLKNYTEFGAVFTYFIFSIGEFFKNFFSFSFLNNIWSIPIIIPDIASAMISEVSVLDGYFHNAFTFLETSVNTTTNPSLVIFEKFVIGIINSLFLILPTSTSHLITLRRFVMQGLEAGYMAGLGTLAGNFLWLASIILGWRFFVIPWLSLDIFRYLLGFVLLVKYIWDSSKERRMALEDLSKWKIFLLNFLLALTEQSCIYPFISNLSFGPDASILEGFPVDNYPQFLLIHGAYLLGILFGSFSLLQFTCWFWENPAFSIYLWITTKSSLKISTSSYYKILNFTFLYATMLCAIASIPYYGLDYTITNPIGLVPQDRILNQKKSQSDPDKLITETAFLNLNPTDKNSRIRDGVHARRERWKQRLIKYQAFDASTYDQGVYDFLTIEDLNYGFDRFWLRRKMRNHQIRFRLFPGPWMRSLKKQLNNPANPSLETSTKAASGPRVEFFRILFEQFYHPNFHDRAAMQTNPAEARNKFISTSPLASTESKKALNSTFSLGNINNSSTGIEGLVLTNTQATLLPTDLQTKRTIKPGLIYTNSALRKFVRNVNTRLNLKLLNSKETNLTTKYKSQFIYSKRWKSIFSKIQPLQNGTTRKSYQLFRNVAKQILVTPDAKSLKLITINQKLSLKERKLLELRTQYNNNSTLTTTAPLTLVRPLNVYLQKEEAFKRKLRYYGTMPMRKLTVGNQAPYFKALMKRGFYYYKPTLRWRKTLYVASLRRGFRKKSRKQRILVMPSNQQNFNNTLDNTKTNINQNNLANPLGGNEVPMYGADGENSLITKPTHSYTVLGKRASRYRHQIYKDVLQHWYYTPFNRLLMKFDVDAFINRQPKSHFLTKNEERALHIRRFLLSEHYDTLRWYTYMQHYKTMKTNIGGTKSFANRAYNQQFQGTFKKIRHLFAITPKQGDFYTLKFDQPLYNDNKLKDNLYFHEELLTDYYNGTNLQTNQTSNISVNSTTTFIDNSLRTTQLPVPSSSFDIVNQSSTLIGLTTMQNALRKNVVESTLTSLNSDGEAATSQPKLNFVYSELFVKLIKECKKRIHDQTFLKNYITHRIEKREQLNQEQTKELNKRLEKLKVWLNSDKGSISKLQNTPVQDPNISSPDKVLTTAMQKAVNESISLSGIMPSDKIKTTYGNLTNAYTIKTENAILTKLNVINQLTNNETTTQKNTLIKSIGVNKIQTVLQTIITNFKSSLYNQTQLLRVKTDKDLQWWRTKQRVITKRKSARKRDRFKKQIAVVNKKLAALSKKVETEKSNLYQTLYGNYEISDYLLRNVPTGSSAVIDSTVLRKKQDNQAYLPKETNNVQFNSFVDSNNNVWQTFFAKKLRKKISSKGRRYRSLSLARYLTATRKPRLVGLDNLTKIDNITTLQGAFITKEEKQDSLNLTIQRKQELTNSLKKSQIKKRSRHSWKKRSRHQFSRNHYKYRKRHTHGNGKLRVMNKKLKKFKATNELRQWWWNSFLPRYLSNLQVNNSTLTNKNVSFKPLSNTNSVPSTNMASPTTSRNLLDNLNSSNQISTSASMNQNIVTESVKVETNQVYLPEGEKSFDITSMTTTLPFYAGWDESLKKFVVTNRLLSRRDAGLSVNNNPQEINFTNPPIQGLNEGSFLYWQTEMPFNSYNIDQFITTNQSFYAPLGWRRFEFRHSILKTWVNNTKAGNNNIKKKTLIISLKNLQPLKSSQQKQNQIKTKKLVARRIKKRYKLLKQMPNQLMYSPTGPLLTEVLPSHYISVFDQQYRLPRNRYLKRNPLKTLKKTTLLALMDSSKQTNGVNKEFTLRKRVKPRRKYHRKRFIKKDGLIFPRRTKFNTNTTLTGNALITNNVNSIEEDDLRWRPSSRTKQKRKDNTRSSAASKTKSNKRVKTNPLRLRQLRRREFQQVLKPLQRYIPQNGGFTWPGDYLRLEIVEMPKLKSINIKKTSLKQKINVQPVGIMPRKYLIEKHNIKVLKKKLSQAYSTQQLTKVVQEYKNLIQNSPPAI 1995 T 2E-05 Ycf1 pdbhh F Eukaryota T 7xzj 8 H X Unknown peptide AAAAFAFFAGFAFAAFAAAAAAAA 24 T 2.5 DUF6134 pdbhh F F 7xzq 2 B B thiopeptide TP1 XWGFIYKTLKXXGXXXXX 18 T 2 FeoC pdbhh F T 7xzr 2 C,D C,D thiopeptide TP15 XWTIRTRGRIATXXXXXX 18 T 0.72 SWIM pdbhh F T 7y0s 2 C,D C,D I7X-TYR-TYR XYY 3 T 400 TP6A_N pdbhh F F 7y0t 2 C,D C,D I7X-PHE-PHE XFF 3 T 280 SdpA pdbhh F F 7y0u 2 C,D C,D I7X-PHE-PHE XFF 3 T 280 SdpA pdbhh F F 7y1a 3 M a LRH XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 87 F F F 7y1a 4 N A LRH XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 150 F F F 7y1c 3 E Y phage tail tubular protein B MALVSQSIKNLKGGISQQPEILRYPEQGTLQVNGWSSETEGLQKRPPMVFIKSLGPRGYLGEDPYIHLINRDEYEQYYAVFTGNDVRVFDLSGYEYQVRGDRSYVTVNNPKDNLRMVTVADYTFIVNRTRQVRENQNRTNGGTFRDNVDAIINVRGGQYGRKLEVNINGVWVSHQLPPGDNAKEDPPKVDAQAIAEAIATLLRTAHPTWTFNVGTGFIHCIAPADTTIDILETKDGYADQLINPVTHYVQSFSKLPLNAPDGYMVKIVGDTSKTADQYYVKYDKSQKVWKETVGWNISVGLEYHTMPWTLVRAADGNFDLGYHEWKDRRAGDDDTNPQPSFVNSTITDVFFFRNRLGFISGENIVMSRTSKYFEFYPPSVANYTDDDPLDVAVSHNRVSVLKYAVSFAEELLLWSDEAQFVLSANGVLSAKTAQLDLTTQFDVSDRARPYGIGRNIYYASPRSSFTSIMRYYAVQDVSSVKNAEDMTAHVPNYIPNGVYSINGSGTENFACVLTKGAPSKVFIYKFLYMDENIRQQSWSHWDFGDGVEVMAANCINSTMYMLMRNGYNVWIAAVDFKKESTDFPFEPYRFHVDAKRSYHISETAYDIETNQTVVNVKDIYGASFAKGTVAICESDGKITEYEPTGNSWDSTPDIRISGDVSGKNIVIGFLYDFQYVFSRFLIKQEQNDGTTSTEDSGRLQLRRAWVNYQNTGAFTVSVDNGSREFNYLVNARVGSTGLRLGQKATTTGQYRFPVTGNALYQKVSLSSFNASPVSIIGCGWEGNYSRRANGI 791 T 0.041 Phage_stabilise pdbhh F T 7y1f 2 B F Dynorphin YGGFLRRIRPKLK 13 T 0.025 Op_neuropeptide pdbhh F T 7y22 3 E Y phage tail tubular protein B MEVQGSLGRQIQGISQQPASVRLPGQCTDAINCSMDVVEGTKSRPGTVHIARLGDLGLIQDNTNIHHYRRGDDVEEYWMITNPLGIPDIFDKQGRKCTVTETEGAASYFNSNNPRVDYKFFTVGDTTFVVNRTKIVRARADKTPAVGGTALVFSAYGQYGTNYQIIINGVKAAEYKTASGGSASDVETIRTEVIAEQLYTNLLTWAGASDYSISRMGTTIVISSLSGASFTVDTEDGSKGKDLVAIQYKVTSTDLLPSKAPVGYLVQVWPTGSKPESRYWLKAEAADGNLVTWQETLGADEVLGFDGSTMPYIIERTNIVGGIAQFTIKQGYWDDRAVGDELTNPMPSFVDQSLSDIFMVQNRLCLAAGESCIMSRTSYFFQFFRQTVLSAVDTDPIDVFADASEVYALKHAKVLDGDTVLFSDNAQFILPGDKPLTKATALLRPTTTFEVDTNVAPVVTGEAVMFATKDGAYSNIREFYTDSYSDTKKAQPVTSHVNKLIRGGIYHMASSTNFNRLFALSEDNRSRVFVYDWLWQGTDKVQSAWHKWEFYGATIGGLYYSGETLYLIIKRNDGVFLEAMYMGDPLLSGSDQVRMDRTVTVSLTWDEATLSWKSSPLPWVPTQVEMLEAVLTNGDPAYLGGAFLFEYDANTRILSTKYGLGDTSQIWAAKVGQMYKVEFVPTDVIIRDSQDRVSYQDVPVIGLVHLNLDRYPDFTVEITNRKSGAVRVAKASNRVGGARNNVVGYVKPTSGTFSFPLRALSTDVEYRIISISPHTFQLRDIEWSGSYNPTRKRV 794 T 0.017 Phage_stabilise pdbhh F T 7y24 5 E C Octreotide XCFXKTCT 8 T 0.0016 Urotensin_II pdbhh F F 7y26 2 B B Engineered Guanine nucleotide-binding protein G(q) subunit alpha TVSAEDKAAAERSKMIDKNLREDGEKARRTLRLLLLGADNSGKSTIVKQMRILHGGSGGSGGTSGIFETKFQVDKVNFHMFDVGGQRDERRKWIQCFNDVTAIIFVVDSSDYNRLQEALNDFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKIEDYFPEFARYTTPEDATPEPGEDPRVTRAKYFIRKEFVDISTASGDGRHICYPHFTCAVDTENARRIFNDCKDIILQMNLREYNLV 243 T 2.5E-10 G-alpha pdb F T 7y26 3 C C Octreotide XCFXKTCT 8 T 0.0016 Urotensin_II pdbhh F F 7y27 2 B B Engineered Guanine nucleotide-binding protein G(q) subunit alpha TVSAEDKAAAERSKMIDKNLREDGEKARRTLRLLLLGADNSGKSTIVKQMRILHGGSGGSGGTSGIFETKFQVDKVNFHMFDVGGQRDERRKWIQCFNDVTAIIFVVDSSDYNRLQEALNDFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKIEDYFPEFARYTTPEDATPEPGEDPRVTRAKYFIRKEFVDISTASGDGRHICYPHFTCAVDTENARRIFNDCKDIILQMNLREYNLV 243 T 2.5E-10 G-alpha pdb F T 7y39 1 A,B A,B ZFAN1_HUMAN ZINC FINGER AN1-TYPE-CONTAINING PROTEIN 1 GGSGAKNSETAAKVALMKLKMHADGDKSLPQTERIYFQVFLPKGSKEKSKPMFFCHRWSIGKAIDFAASLARLKNDNNKFTAKKLRLCHITSGEALPLDHTLETWIAKEDCPLYNGGNIILEYLNDEEQFCKNVESYLE 139 T 0.036 EndoU_bacteria pdbpercent F Eukaryota T 7y3f 9 I K Unknown XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 82 F F F 7y3f 13 M,N,Q 2,3,6 IsiA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 325 F F F 7y3f 14 O,P 4,5 ISIA_NOSS1 CP43' MQTYDNPNIKYDWWAGNARFANLSGLFIGAHVAQAALTTLWAGAFTWFEISRYKPEIPMGEQGLILLPHLATLGFGVGVSGQVVNTYPYFVIGALHLISSAVLGAGALFHTFKGPRNLKNTTGSARKFHFEWNDPKQLGLILGHHLLFLGMAALLLVGKAMFWGGLYDATTQVVRVVNHPTLNPFVIYGYQTHFASVNNLEDLVGGHIYVGLILIGGGIWHIVKEPLPWAKKLLIFSGEAILSYSLGGIALAGFVAAYFCAVNTLAYPVEFYGAPLELKFGVTPYFADTVKLADGGYSARAWLANAHFFLAFFFLQGHLWHALRAIGVDFRQIEKSLNAISSAE 344 T 1.4E-06 PSII pdb F Bacteria T 7y3j 3 C A A4_HUMAN ALA-LEU-VAL-PHE-PHE-ALA-PRO-ALA-VAL-GLY-SER KLVFFAPDVGS 11 T 0.01 Beta-APP pdbhh F Eukaryota T 7y3t 1 A,B,C,D,E,F,G G,A,B,C,D,E,F phage major capsid protein MSTPNVLTNVAVSHSGEVDSLLIEKFNGKVREQYLKGENLLSHFQVETVTGTNTVSNKYLGETEIQVLAPGQSPAATPTKADKNQVVIDTTVIARNTVAMLHDVQGDIDSLKPKIAVNQAKQLKRLEDEMVVQQLMLGGISNTKAQRTNPRVPGHGFSINVNITADTAETSPQYLAAAIEYALEQQLEQEVDISDLVILMPWKFFNALRDMDRIVDRSYTLADESTVQGFALKSFNVPVVPSNRFPKFSQGAAHHKLSNADNGFRYDTTAPMAGAVAVIFSMDALLVGRTIELTGDIFWEKKEKTFYIDTYLAEGAIPDRWEAVSVVTTARNATTGDPDGTGADDTVVTKRANRKVILTKAVS 363 F F T 7y41 1 A 3 A0QTP4_MYCS2 50S ribosomal subunit bL37 MAKRGRKKRDRKHSKANHGKRPNA 24 T 0.18 DUF6254 pdb F Bacteria T 7y43 1 A A KAT6A_HUMAN MOZ,YBF2/SAS3,SAS2 AND TIP60 PROTEIN 3,MYST-3,MONOCYTIC LEUKEMIA ZINC FINGER PROTEIN,RUNT-RELATED TRANSCRIPTION FACTOR-BINDING PROTEIN 2,ZINC FINGER PROTEIN 220 SMVKLANPLYTEWILEAIKKVKKQKQRPSEERICNAVSSSHGLDRKTVLEQLELSVKDGTILKVSNKGLNSYKDPDNPGRIALPKP 86 T 0.0018 Linker_histone pdb F Eukaryota T 7y4a 2 B,D,F,H B,D,F,H ELMO1_HUMAN PROTEIN CED-12 HOMOLOG GMPPPADIVKVAIEWPGAYPKLMEIDQKKPLSAIIKEVCDGWSLANHEYFALQHADSSNFYITEKNRNEIKNGTILRLTTSPA 83 T 0.0014 FERM_N pdb F Eukaryota T 7y4h 1 A A AcvX MRAKGISYDTGFVKNGATSRKRFDPDVVERELRIIRDDLHCTAVRVMGGDPERIEVAAAHAADLGLEVWFSPYPLELTAEEMLSLFADCAERAERLRRRGAEVVFVVGAELSLMNPGFLPGDSTDERVALLRRPDRVREQLGEVSARVNAFLGKAVQLVRERFDGKVTYASVPFERVDWAPFDIVSMDLYRSAEIADRFTDGVRDLVAQGKPVAITEFGAAGYQGAGDRGALALEIVEYGKDGPVRLKGDHARDEPGQAAYVRELLEAFDAGGVDGAFVFTFALYDHVHRPDGDPRDDLDLASYGIVKVYEDRLGATYPDMPWEPKAAFTTLAEYYRG 338 T 0.42 Cellulase pdb F T 7y4l 1 A,E AA,EA A0A5J4YXP2_PORPP Linker4 MAFVTGGLVGSSSAPALRTVCNASQSKLRMAASAADVVNAAYPKNIKNKAPVISFDGKKGVKLEMVTLQTFAGDDSEDTLFDYSSGKFMPQKPADMGIAWPSGDGRQAEMKGGKGSFNQPDLRKYGPFPDFLKRSMDL 138 T 26 GRP pdbhh F Eukaryota T 7y4l 4 K,U KA,UA A0A5J4YJY8_PORPP CaRSPs1 MAAFVSGGCGVGGQRRAWPAKGAAVARTHACPTTMVVLPVARAGLAATAKKNQYMGTSVAPEIVLTDKGSDMSRKVKTEDKKVAADQAAAMGILANMSLYASLNPVKRMTYKAKEQAPAYVKKTGNPVEDFYPSSWRNMAPVISLSANRVAVAFEKIDAASNGVKANSNNKPFWKSGATTKNYVAPEAPAQSEPETLDDAAYQRYFPARIRNKAPAMEFRRPSFANTEDPSAYFMLQKETVPLRMALAEKLLTKLGRKGDG 261 T 85 DUF6243 pdbhh F Eukaryota T 7y4l 5 SW,W cA,WA A0A5J4YX67_PORPP CaRSPs2 MWEQQRPRRCEAPAAPSSRPAERRAAARRSRAQLRMKQDDYEQWKTEFAGGFPGGEAFYKKWIEEGAKGDVPALEEELQPRSPNKKPTIYEEQMISNRGQQKGVDPTWKTLLAGGFPGGEFFFKKWIGEGAQGEVPNLDADLQPGSGSAKKTGKKEDADKSSPGGIMTPGRIMVPSGLGEEEETVDKEAPRPELYNKYFSADRLHKAPEILFEYNKTKYDRVGVRYTEVTSKASERFFPKSRMNRAPVIEISYREGAVSTASVSLSMPEISGPPALPFPVPKGDVTTTMVTDPTTGRLKLEFKVDGAAVSAYSDPRA 317 T 10 DUF5767 pdbhh F Eukaryota T 7y4l 6 AA,AB,AC,AD,AI,IL,MR,NQ,WK,XJ AB,AC,AD,AE,AJ,A2,A6,A5,Y1,YK A0A5J4YX19_PORPP R-phycoerythrin gamma chain, chloroplastic MAAFVSGFHGVQVGAPAENKLVCRAAKPAQLTMLTGYDSKSSPNFPNRAATRERRTVSFNARVARNKSQAKKILEKADEFFARSVTMQYKAFACPNGVYDIQCTEGTVKGAAYEKRAMAVSAAFRAKQASPAAKARALFENRRHAIIASHECQHEEDLFVRFPKLSAAYMMGKTEAMRTCSRYVVPDSLEEEYMAASVDRQMKERACPGGVYASSCVEGNAKGQAEQARVAALATAFRSAQKSASKTTAERYSSAAYGRDHFAHGCSYEESVFNTYPATAAAMRSKSYNY 290 T 0.14 Amidohydro_1 pdbpercent F Eukaryota T 7y4l 12 AG,LV,MV,PO,TZ,UZ AH,w8,x8,A4,wF,xF A0A5J4YZM7_PORPP R-phycoerythrin gamma chain, chloroplastic MAAFVGSAASAFTGASAVKANEKRSVCSLQMVAMPQTGLVNSKFSARMAKKTAKQTKNKVDEYMARSVQRQYKQAAVATGVYGTQCTEGTVKGAAEASRSAALSRQFRIKQRSAFSKAHDLFEFRKHAIIAAAGCSYEEKMVTRFPKLAAAMVLGQTEMMRTCSRYVVPESVEEEYMAASVDKQMKRRGAPGGVYSLSCAEGVAKGQAEIARVSALGAAYRAASKSASAVTAERYNSMAYGRVHFAHGCSYEEQQFNKYPAAAAAMRSDSYGY 273 T 1.6 rRNA_methylase pdbpercent F Eukaryota T 7y4l 13 BG,NV,QO,VZ BH,y8,B4,yF A0A5J4YZH3_PORPP R-phycoerythrin gamma chain, chloroplastic METAFVSGFMGKAAVAKFGATAVCDKTARRSSSSNSQVHMVTGAVSSVNMRRFQRVPKVSGFSAKVTKKNVNKALDKADMFFAKSVTMEGKAAAIPYGVYGIQCMEGSAKGMAHEKRAMALSAAFRMNQRSAAEKTGAMYENRRLALILAQNDHQEKQYIKYPKLAAAALMASTEVTRACQRYAVPESIEEEFLAASVDKVNKMRGTTASGVYKSSCVEGNAKGQAEQARVAALAVAFRSAQKSASQFAAERYAQSKYGRDLFSSTHFEEGYANTYPAMAAAKRASSYGY 290 T 0.011 THF_DHG_CYH pdb F Eukaryota T 7y4l 14 BW,MH,OW,ZH M9,MI,Z9,ZI A0A5J4YNU6_PORPP R-phycoerythrin gamma chain, chloroplastic MAAFVVGAQVSSGFMGCAAPRKEQTRVGGAAKAAVSMRIIIQDKNSYRKYQTNTSTSKWDKLLNTKPMKRQVQPNPPTNETRALNLGNTFRSPAFKFLGTLKRSKDPSGLRLGFYGRKADDFMARSIAMQAKASAAGSGVYTTQCSEGASKGMAENARTASLAKQFRQAQRSAREMSFDYYEGRKYAMKAVGHICNYEEKIFQQYNKTAAAYVMGKQETLLSCDRYAQPANKAEEYIQKSVQMQMKKRSIPYGVYTTSCADGTVKGMAENARVAKESANFRARQMSAGAKAAARFNARRVANDWHNNGCNYEEKLTSRFPAAASSVRPTTNRY 333 T 0.022 APC_u5 pdb F Eukaryota T 7y4l 16 LM,MN 23,Z3 A0A5J4YTV6_PORPP Lrc4 MAFVACGPLRAGEGGARLGARKAACSMQLAPPGIPPGEDARNNQSLRQYVARPVETYQKRSFATPLPLTWTGETETVGAFDVVVPPQEKDLPVSGEATSAFVKYSDMVRAERKAALQALLSASAAGEGRPTCGAEGRKFVSNANPVLVNGVKCVEYWRK 159 T 14 CRM1_repeat unphh F Eukaryota T 7y4l 17 MM,NN 33,a3 A0A5J4Z2M2_PORPP LRC5 MAFVSGAGVAVPAGAKASAPLCALRMSGYGDYSYSTDRTKGHVNQYYVDKARSRSDWGNRNVLPASEGDAVLGRTAKGAVAVPEFGIPQLDDPVLGFGPDSMVDPRIAEADGAVWRWDAGFVDESMTLASCADISDEAVADEAFAKFRGSVLAERGAMITKAESATASVITSLRDGLYSGEAQLLTASGQRLANVAGQEKIATISGYTWDGQPQTEIPGKPFVKSIGAMDYMDGVEGGDVVAAKVGAFWKPKAPKEVPYKRPMGANTPELPYNTVPRLVQAAGLAVQE 288 T 14 DUF5953 pdbhh F Eukaryota T 7y4l 30 MAA,NAA Y3,53 A0A5J4Z365_PORPP LPP2 MVVVCASRKSGRAVVPAMVPALRVELVVSKQMVMGLAALFFQVQRTALMAKLELPKFSMPSMPSMPKLTVPKLQMPKLGGGEKKDKAAKPSPSAPKTTIRPSGGVKVRAAVGNKSNVDAPSFKGSNMELADSGADYKAFPKRRMPGANMQGFLDMAKGMKPK 162 T 0.14 CCDC71L unppercent F Eukaryota T 7y5e 3 AB,JY,LT,NN,OL,WO,Y,ZB,ZD,ZE AD,Y7,A4,AQ,AO,A1,YA,AE,AG,AH A0A5J4YX19_PORPP R-phycoerythrin gamma chain, chloroplastic MAAFVSGFHGVQVGAPAENKLVCRAAKPAQLTMLTGYDSKSSPNFPNRAATRERRTVSFNARVARNKSQAKKILEKADEFFARSVTMQYKAFACPNGVYDIQCTEGTVKGAAYEKRAMAVSAAFRAKQASPAAKARALFENRRHAIIASHECQHEEDLFVRFPKLSAAYMMGKTEAMRTCSRYVVPDSLEEEYMAASVDRQMKERACPGGVYASSCVEGNAKGQAEQARVAALATAFRSAQKSASKTTAERYSSAAYGRDHFAHGCSYEESVFNTYPATAAAMRSKSYNY 290 T 0.14 Amidohydro_1 pdbpercent F Eukaryota T 7y5e 4 DA,Z EB,AB A0A5J4YXP2_PORPP Linker4 MAFVTGGLVGSSSAPALRTVCNASQSKLRMAASAADVVNAAYPKNIKNKAPVISFDGKKGVKLEMVTLQTFAGDDSEDTLFDYSSGKFMPQKPADMGIAWPSGDGRQAEMKGGKGSFNQPDLRKYGPFPDFLKRSMDL 138 T 26 GRP pdbhh F Eukaryota T 7y5e 5 JA,TA KB,UB A0A5J4YJY8_PORPP CaRSPs1 MAAFVSGGCGVGGQRRAWPAKGAAVARTHACPTTMVVLPVARAGLAATAKKNQYMGTSVAPEIVLTDKGSDMSRKVKTEDKKVAADQAAAMGILANMSLYASLNPVKRMTYKAKEQAPAYVKKTGNPVEDFYPSSWRNMAPVISLSANRVAVAFEKIDAASNGVKANSNNKPFWKSGATTKNYVAPEAPAQSEPETLDDAAYQRYFPARIRNKAPAMEFRRPSFANTEDPSAYFMLQKETVPLRMALAEKLLTKLGRKGDGSEAEEEKVKPQKKAAKKDAKDDAKDDE 288 T 89 DUF6243 pdbhh F Eukaryota T 7y5e 6 BAA,VA cB,WB A0A5J4YX67_PORPP CaRSP2 MWEQQRPRRCEAPAAPSSRPAERRAAARRSRAQLRMKQDDYEQWKTEFAGGFPGGEAFYKKWIEEGAKGDVPALEEELQPRSPNKKPTIYEEQMISNRGQQKGVDPTWKTLLAGGFPGGEFFFKKWIGEGAQGEVPNLDADLQPGSGSAKKTGKKEDADKSSPGGIMTPGRIMVPSGLGEEEETVDKEAPRPELYNKYFSADRLHKAPEILFEYNKTKYDRVGVRYTEVTSKASERFFPKSRMNRAPVIEISYREGAVSTASVSLSMPEISGPPALPFPVPKGDVTTTMVTDPTTGRLKLEFKVDGAAVSAYSDPRATEAYNKMIRK 327 T 0.016 Fz pdbpssm F Eukaryota T 7y5e 7 GAA,LY,MY,ZA aC,A8,a8,AC LRH XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 237 F F F 7y5e 13 DCA,ECA,QEA,REA,YJ,ZG wF,xF,wK,xK,AM,AJ A0A5J4YZM7_PORPP R-phycoerythrin gamma chain, chloroplastic MAAFVGSAASAFTGASAVKANEKRSVCSLQMVAMPQTGLVNSKFSARMAKKTAKQTKNKVDEYMARSVQRQYKQAAVATGVYGTQCTEGTVKGAAEASRSAALSRQFRIKQRSAFSKAHDLFEFRKHAIIAAAGCSYEEKMVTRFPKLAAAMVLGQTEMMRTCSRYVVPESVEEEYMAASVDKQMKRRGAPGGVYSLSCAEGVAKGQAEIARVSALGAAYRAASKSASAVTAERYNSMAYGRVHFAHGCSYEEQQFNKYPAAAAAMRSDSYGY 273 T 1.6 rRNA_methylase pdbpercent F Eukaryota T 7y5e 14 AH,FCA,SEA,ZJ BJ,yF,yK,BM A0A5J4YZH3_PORPP R-phycoerythrin gamma chain, chloroplastic METAFVSGFMGKAAVAKFGATAVCDKTARRSSSSNSQVHMVTGAVSSVNMRRFQRVPKVSGFSAKVTKKNVNKALDKADMFFAKSVTMEGKAAAIPYGVYGIQCMEGSAKGMAHEKRAMALSAAFRMNQRSAAEKTGAMYENRRLALILAQNDHQEKQYIKYPKLAAAALMASTEVTRACQRYAVPESIEEEFLAASVDKVNKMRGTTASGVYKSSCVEGNAKGQAEQARVAALAVAFRSAQKSASQFAAERYAQSKYGRDLFSSTHFEEGYANTYPAMAAAKRASSYGY 290 T 0.011 THF_DHG_CYH pdb F Eukaryota T 7y5e 28 AX,BW,HFA,MJ n6,N6,nL,NL Psb34 XXXXXXXXXXXXXXXXXXXXXXXXXXXXX 29 F F F 7y5e 30 CX,DW,JFA,OJ q6,Q6,qL,QL A0A5J4Z679_PORPP PsbQ' MAFVSGFAGAQIASGSSAQVCRAPAAVRMSAAGEEMSRRDLMAGVATAAAGLLVIPGAAMAGDAPKQSFFGGSSASSPFVYNMKQTGEILYKPLNDEDLQFHKNVLEKSRGELDRTSEQIARKSWDDMRGVIRNQMYNMRHSQLRLIESVESAEKQKAAKKNYNDLKKSLEEMDLAARNKKQEDARKFRASALKAFDSFTTSVGI 205 T 0.00097 TAT_signal unppercent F Eukaryota T 7y5e 32 FW,QJ S6,SL LPP1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 119 F F F 7y5e 36 HX,JW,OFA,UJ w6,W6,wL,WL PsbW XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 46 F F F 7y5e 53 LL,UQ ON,O2 A0A5J4YUC8_PORPP Photosystem I subunit O YEISEGSSFDANPLVIGLALIGWVVPSSVPSNIPLLDGKGLTPAFVASISDNLSRWPQGPQLADPFWLLMGMWHVGLFATLIFGTVGYNLRK 92 T 35 Mif2_N pdbhh F Eukaryota T 7y5e 54 ML,VQ RN,R2 A0A5J4YR43_PORPP PsaR DNYPSSEVLGLGKNIPSALYVLISIACFAIGVTSVAKSNLITPLTPESINPQYVVGSLLLPISWGAHTAAFIQKVNKK 78 T 0.022 Antimicrobial22 pdbpercent F Eukaryota T 7y5e 55 NL,WQ ZN,Z2 LPS1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 66 F F F 7y5e 65 XQ,YR 23,Z3 A0A5J4YTV6_PORPP Lrc4 MAFVACGPLRAGEGGARLGARKAACSMQLAPPGIPPGEDARNNQSLRQYVARPVETYQKRSFATPLPLTWTGETETVGAFDVVVPPQEKDLPVSGEATSAFVKYSDMVRAERKAALQALLSASAAGEGRPTCGAEGRKFVSNANPVLVNGVKCVEYWRK 159 T 14 CRM1_repeat unphh F Eukaryota T 7y5e 66 YQ,ZR 33,a3 A0A5J4Z2M2_PORPP LRC5 MAFVSGAGVAVPAGAKASAPLCALRMSGYGDYSYSTDRTKGHVNQYYVDKARSRSDWGNRNVLPASEGDAVLGRTAKGAVAVPEFGIPQLDDPVLGFGPDSMVDPRIAEADGAVWRWDAGFVDESMTLASCADISDEAVADEAFAKFRGSVLAERGAMITKAESATASVITSLRDGLYSGEAQLLTASGQRLANVAGQEKIATISGYTWDGQPQTEIPGKPFVKSIGAMDYMDGVEGGDVVAAKVGAFWKPKAPKEVPYKRPMGANTPELPYNTVPRLVQAAGLAVQE 288 T 14 DUF5953 pdbhh F Eukaryota T 7y5e 74 AV,CHA,DHA,NV M5,Z9,M9,Z5 A0A5J4YNU6_PORPP R-phycoerythrin gamma chain, chloroplastic MAAFVVGAQVSSGFMGCAAPRKEQTRVGGAAKAAVSMRIIIQDKNSYRKYQTNTSTSKWDKLLNTKPMKRQVQPNPPTNETRALNLGNTFRSPAFKFLGTLKRSKDPSGLRLGFYGRKADDFMARSIAMQAKASAAGSGVYTTQCSEGASKGMAENARTASLAKQFRQAQRSAREMSFDYYEGRKYAMKAVGHICNYEEKIFQQYNKTAAAYVMGKQETLLSCDRYAQPANKAEEYIQKSVQMQMKKRSIPYGVYTTSCADGTVKGMAENARVAKESANFRARQMSAGAKAAARFNARRVANDWHNNGCNYEEKLTSRFPAAASSVRPTTNRY 333 T 0.022 APC_u5 pdb F Eukaryota T 7y5e 80 EHA,FHA Y3,53 A0A5J4Z365_PORPP LPP2 MVVVCASRKSGRAVVPAMVPALRVELVVSKQMVMGLAALFFQVQRTALMAKLELPKFSMPSMPSMPKLTVPKLQMPKLGGGEKKDKAAKPSPSAPKTTIRPSGGVKVRAAVGNKSNVDAPSFKGSNMELADSGADYKAFPKRRMPGANMQGFLDMAKGMKPK 162 T 0.14 CCDC71L unppercent F Eukaryota T 7y5e 81 GHA,HHA,IHA,JHA 46,16,4L,1L CNT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 118 F F F 7y5x 2 B B PSN2_HUMAN PS-2,AD3LP,AD5,E5-1,STM-2 MLTFMASDSEEEVCDERTSLMSAESPTPRSCQEGRQGPEDGENTAQWRSQENEEDGEEDPDRYVCSGVPGRPPGLEEELTLKYGAKHVIMLFVPVTLCMIVVVATIKSVRFYTEKNGQLIYTPFTEDTPSVGQRLLNSVLNTLIMISVIVVMTIFLVVLYKYRCYKFIHGWLIMSSLMLLFLFTYIYLGEVLKTYNVAMDYPTLLLTVWNFGAVGMVCIHWKGPLVLQQAYLIMISALMALVFIKYLPEWSAWVILGAISVYDLVAVLCPKGPLRMLVETAQERNEPIFPALIYSSAMVWTVGMAKLDPSSQGALQLPYDPEMEEDSYDSFGEPSYPEVFEPPLTGYPGEELEEEEERGVKLGLGDFIFYSVLVGKAAATGSGDWNTTLACFVAILIGLCLTLLLLAVFKKALPALPISITFGLIFYFSTDNLVRPFMDTLASHQLYI 448 T 1.5E-46 Presenilin pdb F Eukaryota T 7y5z 2 B B PSN2_HUMAN PS-2,AD3LP,AD5,E5-1,STM-2 MLTFMASDSEEEVCDERTSLMSAESPTPRSCQEGRQGPEDGENTAQWRSQENEEDGEEDPDRYVCSGVPGRPPGLEEELTLKYGAKHVIMLFVPVTLCMIVVVATIKSVRFYTEKNGQLIYTPFTEDTPSVGQRLLNSVLNTLIMISVIVVMTIFLVVLYKYRCYKFIHGWLIMSSLMLLFLFTYIYLGEVLKTYNVAMDYPTLLLTVWNFGAVGMVCIHWKGPLVLQQAYLIMISALMALVFIKYLPEWSAWVILGAISVYDLVAVLCPKGPLRMLVETAQERNEPIFPALIYSSAMVWTVGMAKLDPSSQGALQLPYDPEMEEDSYDSFGEPSYPEVFEPPLTGYPGEELEEEEERGVKLGLGDFIFYSVLVGKAAATGSGDWNTTLACFVAILIGLCLTLLLLAVFKKALPALPISITFGLIFYFSTDNLVRPFMDTLASHQLYI 448 T 1.5E-46 Presenilin pdb F Eukaryota T 7y65 4 D L C5apep peptide XKPXXX 6 T 120 DUF4150 pdbhh F F 7y66 6 F E BM213 peptide XFKPLAAXR 9 T 20 T3SS_HrpK1 pdbhh F T 7y67 6 F L C089 peptide XKPXWX 6 T 1.4 YycI pdbhh F F 7y7a 1 A,AD,AMB,BGB,IXA,LF,LI,MBA,MIA,MW,NA,OAB,OIB,WR,YT,ZDB,ZEA,ZFB,ZJA,ZY A1,A4,An,Aj,Ac,Y5,A8,AM,AR,AI,A2,Ae,Al,YD,AF,Yg,AP,Yi,AS,AK A0A5J4YX19_PORPP R-phycoerythrin gamma chain, chloroplastic MAAFVSGFHGVQVGAPAENKLVCRAAKPAQLTMLTGYDSKSSPNFPNRAATRERRTVSFNARVARNKSQAKKILEKADEFFARSVTMQYKAFACPNGVYDIQCTEGTVKGAAYEKRAMAVSAAFRAKQASPAAKARALFENRRHAIIASHECQHEEDLFVRFPKLSAAYMMGKTEAMRTCSRYVVPDSLEEEYMAASVDRQMKERACPGGVYASSCVEGNAKGQAEQARVAALATAFRSAQKSASKTTAERYSSAAYGRDHFAHGCSYEESVFNTYPATAAAMRSKSYNY 290 T 0.14 Amidohydro_1 pdbpercent F Eukaryota T 7y7a 9 EXA,FXA,IIA,INA,JH,JIA,JNA,KH,LOB,LP,MOA,NB wb,xb,wQ,wT,w6,xQ,xT,x6,Ap,AC,AV,A3 A0A5J4YZM7_PORPP R-phycoerythrin gamma chain, chloroplastic MAAFVGSAASAFTGASAVKANEKRSVCSLQMVAMPQTGLVNSKFSARMAKKTAKQTKNKVDEYMARSVQRQYKQAAVATGVYGTQCTEGTVKGAAEASRSAALSRQFRIKQRSAFSKAHDLFEFRKHAIIAAAGCSYEEKMVTRFPKLAAAMVLGQTEMMRTCSRYVVPESVEEEYMAASVDKQMKRRGAPGGVYSLSCAEGVAKGQAEIARVSALGAAYRAASKSASAVTAERYNSMAYGRVHFAHGCSYEEQQFNKYPAAAAAMRSDSYGY 273 T 1.6 rRNA_methylase pdbpercent F Eukaryota T 7y7a 10 GXA,KIA,KNA,LH,MOB,MP,NOA,OB yb,yQ,yT,y6,Bp,BC,BV,B3 A0A5J4YZH3_PORPP R-phycoerythrin gamma chain, chloroplastic METAFVSGFMGKAAVAKFGATAVCDKTARRSSSSNSQVHMVTGAVSSVNMRRFQRVPKVSGFSAKVTKKNVNKALDKADMFFAKSVTMEGKAAAIPYGVYGIQCMEGSAKGMAHEKRAMALSAAFRMNQRSAAEKTGAMYENRRLALILAQNDHQEKQYIKYPKLAAAALMASTEVTRACQRYAVPESIEEEFLAASVDKVNKMRGTTASGVYKSSCVEGNAKGQAEQARVAALAVAFRSAQKSASQFAAERYAQSKYGRDLFSSTHFEEGYANTYPAMAAAKRASSYGY 290 T 0.011 THF_DHG_CYH pdb F Eukaryota T 7y7a 35 II,IOB O7,Oo A0A5J4YUC8_PORPP Photosystem I subunit O YEISEGSSFDANPLVIGLALIGWVVPSSVPSNIPLLDGKGLTPAFVASISDNLSRWPQGPQLADPFWLLMGMWHVGLFATLIFGTVGYNLRK 92 T 35 Mif2_N pdbhh F Eukaryota T 7y7a 36 JI,JOB R7,Ro A0A5J4YR43_PORPP PsaR MAFINGAALGGGAKVAFSGKAVASRRVVAASTANKRSVVVKMADNYPSSEVLGLGKNIPSALYVLISIACFAIGVTSVAKSNLITPLTPESINPQYVVGSLLLPISWGAHTAAFIQKVNKK 121 T 0.093 Antimicrobial22 pdbpssm F Eukaryota T 7y7a 37 KI,KOB Z7,Zo LPS1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 66 F F F 7y7a 51 KL,KT,LK,LS,MEA,NDA,NLB,OKB,VTA,WSA n9,nE,N9,NE,nO,NO,nm,Nm,nZ,NZ Psb34 XXXXXXXXXXXXXXXXXXXXXXXXXXXXX 29 F F F 7y7a 53 ML,MT,NK,NS,OEA,PDA,PLB,QKB,XTA,YSA q9,qE,Q9,QE,qO,QO,qm,Qm,qZ,QZ A0A5J4Z679_PORPP PsbQ' MAFVSGFAGAQIASGSSAQVCRAPAAVRMSAAGEEMSRRDLMAGVATAAAGLLVIPGAAMAGDAPKQSFFGGSSASSPFVYNMKQTGEILYKPLNDEDLQFHKNVLEKSRGELDRTSEQIARKSWDDMRGVIRNQMYNMRHSQLRLIESVESAEKQKAAKKNYNDLKKSLEEMDLAARNKKQEDARKFRASALKAFDSFTTSVGI 205 T 0.00097 TAT_signal unppercent F Eukaryota T 7y7a 55 ATA,OT,PK,PS,RDA,SKB SZ,sE,S9,SE,SO,Sm LPP1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 119 F F F 7y7a 59 CUA,ETA,RL,ST,TEA,TK,TS,ULB,VDA,WKB wZ,WZ,w9,wE,wO,W9,WE,wm,WO,Wm PsbW XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 46 F F F 7y7a 63 GUA,HUA,VL,WL,WT,XEA,XT,YEA,YLB,ZLB pZ,PZ,p9,P9,pE,pO,PE,PO,pm,Pm CNT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 118 F F F 7y7a 64 BM,DQA,XL,ZPA EA,EW,AA,AW A0A5J4YXP2_PORPP Linker4 DVVNAAYPKNIKNKAPVISFDGKKGVKLEMVTLQTFAGDDSEDTLFDYSSGKFMPQKPADMGIAWPSGDGRQAEMKGGKGSFNQPDLRKYGPFPDFLKRSMDL 103 T 23 ISAV_HA pdbhh F Eukaryota T 7y7a 65 HM,JQA,RM,TQA KA,KW,UA,UW A0A5J4YJY8_PORPP CaRSP1 VVLPVARAGLAATAKKNQYMGTSVAPEIVLTDKGSDMSRKVKTEDKKVAADQAAAMGILANMSLYASLNPVKRMTYKAKEQAPAYVKKTGNPVEDFYPSSWRNMAPVISLSANRVAVAFEKIDAASNGVKANSNNKPFWKSGATTKNYVAPEAPAQSEPETLDDAAYQRYFPARIRNKAPAMEFRRPSFANTEDPSAYFMLQKETVPLRMALAEKLLTKLGRKGDG 226 T 100 TBC1D23_C pdbhh F Eukaryota T 7y7a 66 BRA,TM,VQA,ZM cW,WA,WW,cA A0A5J4YX67_PORPP CaRSP2 GEEEETVDKEAPRPELYNKYFSADRLHKAPEILFEYNKTKYDRVGVRYTEVTSKASERFFPKSRMNRAPVIEISYREGAVSTASVSLSMPEISGPPALPFPVPKGDVTTTMVTDPTTGRLKLEFKVDGAAVSAYSDPRA 139 T 2.8 DUF5767 pdbhh F Eukaryota T 7y7a 68 FN,IO,IYA,LZA 2B,ZB,2d,Zd A0A5J4YTV6_PORPP Lrc4 MAFVACGPLRAGEGGARLGARKAACSMQLAPPGIPPGEDARNNQSLRQYVARPVETYQKRSFATPLPLTWTGETETVGAFDVVVPPQEKDLPVSGEATSAFVKYSDMVRAERKAALQALLSASAAGEGRPTCGAEGRKFVSNANPVLVNGVKCVEYWRK 159 T 14 CRM1_repeat unphh F Eukaryota T 7y7a 69 GN,JO,JYA,MZA 3B,aB,3d,ad A0A5J4Z2M2_PORPP LRC5 MAFVSGAGVAVPAGAKASAPLCALRMSGYGDYSYSTDRTKGHVNQYYVDKARSRSDWGNRNVLPASEGDAVLGRTAKGAVAVPEFGIPQLDDPVLGFGPDSMVDPRIAEADGAVWRWDAGFVDESMTLASCADISDEAVADEAFAKFRGSVLAERGAMITKAESATASVITSLRDGLYSGEAQLLTASGQRLANVAGQEKIATISGYTWDGQPQTEIPGKPFVKSIGAMDYMDGVEGGDVVAAKVGAFWKPKAPKEVPYKRPMGANTPELPYNTVPRLVQAAGLAVQE 288 T 14 DUF5953 pdbhh F Eukaryota T 7y7a 77 JP,KP,MAB,NAB YB,5B,5d,Yd A0A5J4Z365_PORPP LPP2 MVVVCASRKSGRAVVPAMVPALRVELVVSKQMVMGLAALFFQVQRTALMAKLELPKFSMPSMPSMPKLTVPKLQMPKLGGGEKKDKAAKPSPSAPKTTIRPSGGVKVRAAVGNKSNVDAPSFKGSNMELADSGADYKAFPKRRMPGANMQGFLDMAKGMKPK 162 T 0.14 CCDC71L unppercent F Eukaryota T 7y7a 78 AV,IRA,JRA,MCA,NCA,VRA,WRA,ZU aG,AX,aX,AN,aN,AY,aY,AG LRH XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 237 F F F 7y7a 79 ADB,AFB,HVA,LW,NCB,NEB,UUA,YV Zf,Zh,Za,ZH,Mf,Mh,Ma,MH A0A5J4YNU6_PORPP R-phycoerythrin gamma chain, chloroplastic MAAFVVGAQVSSGFMGCAAPRKEQTRVGGAAKAAVSMRIIIQDKNSYRKYQTNTSTSKWDKLLNTKPMKRQVQPNPPTNETRALNLGNTFRSPAFKFLGTLKRSKDPSGLRLGFYGRKADDFMARSIAMQAKASAAGSGVYTTQCSEGASKGMAENARTASLAKQFRQAQRSAREMSFDYYEGRKYAMKAVGHICNYEEKIFQQYNKTAAAYVMGKQETLLSCDRYAQPANKAEEYIQKSVQMQMKKRSIPYGVYTTSCADGTVKGMAENARVAKESANFRARQMSAGAKAAARFNARRVANDWHNNGCNYEEKLTSRFPAAASSVRPTTNRY 333 T 0.022 APC_u5 pdb F Eukaryota T 7y7b 21 U O PsaO FEISDGVEFDLNPLVLAISFLGWSLPGLLPSNIPLYGGKGLTTALFAEIGEHLQTFPAPPPIGDPFWVILFIWHSGLFATMIFGTIGYNGYGPKSTTKY 99 T 34 G2F pdbhh F T 7y7b 22 V R PsaR MVRALCFLALIASAAAFSTAPGLALRSSVRPATSTKTPMKMAGYSPVPSPDNTKETYWETKAPSSQVLGIGKDVSSGNYIVASVVAAVVGAACTGQCIPLTVSPNPVFILGSFLLPYSWALHVAAWIQRNNGK 133 T 0.042 MRP_L53 pdb F T 7y7b 23 W X Unk1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 164 F F F 7y7b 24 X Z ACPI-S MAQAAPPSKLPENMAKKNALKVNKEQWGIEEAIKVDAKAAAPAPKAAAPAPKKAAPKKGAAPAEATVGFSGVPSDFCRPAPTAFPAEPAGMTVFGARGPRAEGHRDKFGSRHAAVSLACAALLWQPISQAGMYSIDSGSLAKKSFSEMEVPGFGDAKKVPTIESFFPFTKNGFDASPALFGKDSMIVFENPLGKCGAYASSCHTFLDEMGDMLKATPQEMPRSKAAPTYSFPWMYDHAAWKK 242 T 0.53 KAR9 pdbpssm F T 7y7i 7 K,L K,L A0A3Q3AQL2_CHICK Myb-like domain-containing protein SSNGIYTRSGRLVKPPLSFWCGEREFVDRELNVTIQKGGTDYLS 44 T 0.8 DUF4764 pdbhh F T 7y8a 21 U O PsaO MKVAFLVLLAAATANAFAPTAAFLPKAHGIAASKPAMALRAAPRAVAKPLAVQAKFEISDGVEFDLNPLVLAISFLGWSLPGLLPSNIPLYGGKGLTTALFAEIGEHLQTFPAPPPIGDPFWVILFIWHSGLFATMIFGTIGYNGYGPKSTTKY 154 T 43 DMP1 pdbhh F T 7y8a 22 V R PsaR MVRALCFLALIASAAAFSTAPGLALRSSVRPATSTKTPMKMAGYSPVPSPDNTKETYWETKAPSSQVLGIGKDVSSGNYIVASVVAAVVGAACTGQCIPLTVSPNPVFILGSFLLPYSWALHVAAWIQRNNGK 133 T 0.042 MRP_L53 pdb F T 7y8a 23 W X Unk1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 164 F F F 7y8a 24 X Z ACPI-S MAQAAPPSKLPENMAKKNALKVNKEQWGIEEAIKVDAKAAAPAPKAAAPAPKKAAPKKGAAPAEATVGFSGVPSDFCRPAPTAFPAEPAGMTVFGARGPRAEGHRDKFGSRHAAVSLACAALLWQPISQAGMYSIDSGSLAKKSFSEMEVPGFGDAKKVPTIESFFPFTKNGFDASPALFGKDSMIVFENPLGKCGAYASSCHTFLDEMGDMLKATPQEMPRSKAAPTYSFPWMYDHAAWKK 242 T 0.53 KAR9 pdbpssm F T 7y8f 2 B,D C,D Grip peptide AILHRLLQ 8 T 0.0019 SRC-1 pdbhh F T 7y8g 2 B,D C,D Grip peptide AILHRLLQ 8 T 0.0019 SRC-1 pdbhh F T 7y8r 5 I Z Unknown XXXXXXXXXXXXXXXXXXXXXXXXXXXXX 29 F F F 7y8r 12 P a Unknown XXXXXXXXXXXX 12 F F F 7y8r 17 V T Unkown XXXXXXXXXXXXXXXXXXXXXX 22 F F F 7y8r 19 X V Unknown XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 102 F F F 7y8r 22 AA W Unknown XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 7y8w 2 E,F,K,L,O,P,S,T E,F,K,L,O,P,S,T SAO1_CAEEL Isoform b of Suppressor of aph-1 GPGSEFMQHANVATDQVVMKSVECQTEPVE 30 T 14 XRN_N pdbhh F Eukaryota T 7y9c 2 C,D C,D SLC39A5 GHQGHSHGHQGGY 13 T 1.9 UreE_C pdbhh F F 7y9x 1 A B A0A401FT52_9DELT CHAT domain-containing protein MSNPIRDIQDRLKTAKFDNKDDMMNLASSLYKYEKQLMDSSEATLCQQGLSNRPNSFSQLSQFRDSDIQSKAGGQTGKFWQNEYEACKNFQTHKERRETLEQIIRFLQNGAEEKDADDLLLKTLARAYFHRGLLYRPKGFSVPARKVEAMKKAIAYCEIILDKNEEESEALRIWLYAAMELRRCGEEYPENFAEKLFYLANDGFISELYDIRLFLEYTEREEDNNFLDMILQENQDRERLFELCLYKARACFHLNQLNDVRIYGESAIDNAPGAFADPFWDELVEFIRMLRNKKSELWKEIAIKAWDKCREKEMKVGNNIYLSWYWARQRELYDLAFMAQDGIEKKTRIADSLKSRTTLRIQELNELRKDAHRKQNRRLEDKLDRIIEQENEARDGAYLRRNPPCFTGGKREEIPFARLPQNWIAVHFYLNELESHEGGKGGHALIYDPQKAEKDQWQDKSFDYKELHRKFLEWQENYILNEEGSADFLVTLCREIEKAMPFLFKSEVIPEDRPVLWIPHGFLHRLPLHAAMKSGNNSNIEIFWERHASRYLPAWHLFDPAPYSREESSTLLKNFEEYDFQNLENGEIEVYAPSSPKKVKEAIRENPAILLLLCHGEADMTNPFRSCLKLKNKDMTIFDLLTVEDVRLSGSRILLGACESDMVPPLEFSVDEHLSVSGAFLSHKAGEIVAGLWTVDSEKVDECYSYLVEEKDFLRNLQEWQMAETENFRSENDSSLFYKIAPFRIIGFPAE 751 T 1.7E-07 CHAT pdbpercent F Bacteria T 7y9y 4 D B A0A401FT52_9DELT CHAT domain-containing protein MSNPIRDIQDRLKTAKFDNKDDMMNLASSLYKYEKQLMDSSEATLCQQGLSNRPNSFSQLSQFRDSDIQSKAGGQTGKFWQNEYEACKNFQTHKERRETLEQIIRFLQNGAEEKDADDLLLKTLARAYFHRGLLYRPKGFSVPARKVEAMKKAIAYCEIILDKNEEESEALRIWLYAAMELRRCGEEYPENFAEKLFYLANDGFISELYDIRLFLEYTEREEDNNFLDMILQENQDRERLFELCLYKARACFHLNQLNDVRIYGESAIDNAPGAFADPFWDELVEFIRMLRNKKSELWKEIAIKAWDKCREKEMKVGNNIYLSWYWARQRELYDLAFMAQDGIEKKTRIADSLKSRTTLRIQELNELRKDAHRKQNRRLEDKLDRIIEQENEARDGAYLRRNPPCFTGGKREEIPFARLPQNWIAVHFYLNELESHEGGKGGHALIYDPQKAEKDQWQDKSFDYKELHRKFLEWQENYILNEEGSADFLVTLCREIEKAMPFLFKSEVIPEDRPVLWIPHGFLHRLPLHAAMKSGNNSNIEIFWERHASRYLPAWHLFDPAPYSREESSTLLKNFEEYDFQNLENGEIEVYAPSSPKKVKEAIRENPAILLLLCHGEADMTNPFRSCLKLKNKDMTIFDLLTVEDVRLSGSRILLGACESDMVPPLEFSVDEHLSVSGAFLSHKAGEIVAGLWTVDSEKVDECYSYLVEEKDFLRNLQEWQMAETENFRSENDSSLFYKIAPFRIIGFPAE 751 T 1.7E-07 CHAT pdbpercent F Bacteria T 7yae 6 F D DPN-CYS-PHE-DTR-LYS-THR-CYS-THO XCFXKTCX 8 T 0.0019 Urotensin_II pdbhh F F 7yca 21 U O A0A096P9N0_OSTTA PsaO MLALAPINVQRRSPLGVSARGRKTQSKARFAVKVNAANADLKFDDDWKKSNVAVHLASLFGWVIPSASPCPAFPDNASLFKVFSDRISENLAHFPTGPSADDPIWLYMLTWHMGLFACMMFGQIGVQARKQGYFGN 136 T 48 YkpC pdbhh F Eukaryota T 7ycx 7 G H INT8_HUMAN INT8,PROTEIN KAONASHI-1 MSAEAADREAATSSRPCTPPQTCWFEFLLEESLLEKHLRKPCPDPAPVQLIVQFLEQASKPSVNEQNQVQPPPDNKRNRILKLLALKVAAHLKWDLDILEKSLSVPVLNMLLNELLCISKVPPGTKHVDMDLATLPPTTAMAVLLYNRWAIRTIVQSSFPVKQAKPGPPQLSVMNQMQQEKELTENILKVLKEQAADSILVLEAALKLNKDLYVHTMRTLDLLAMEPGMVNGETESSTAGLKVKTEEMQCQVCYDLGAAYFQQGSTNSAVYENAREKFFRTKELIAEIGSLSLHCTIDEKRLAGYCQACDVLVPSSDSTSQQLTPYSQVHICLRSGNYQEVIQIFIEDNLTLSLPVQFRQSVLRELFKKAQQGNEALDEICFKVCACNTVRDILEGRTISVQFNQLFLRPNKEKIDFLLEVCSRSVNLEKASESLKGNMAAFLKNVCLGLEDLQYVFMISSHELFITLLKDEERKLLVDQMRKRSPRVNLCIKPVTSFYDIPASASVNIGQLEHQLILSVDPWRIRQILIELHGMTSERQFWTVSNKWEVPSVYSGVILGIKDNLTRDLVYILMAKGLHCSTVKDFSHAKQLFAACLELVTEFSPKLRQVMLNEMLLLDIHTHEAGTGQAGERPPSDLISRVRGYLEMRLPDIPLRQVIAEECVAFMLNWRENEYLTLQVPAFLLQSNPYVKLGQLLAATCKELPGPKESRRTAKDLWEVVVQICSVSSQHKRGNDGRVSLIKQRESTLGIMYRSELLSFIKKLREPLVLTIILSLFVKLHNVREDIVNDITAEHISIWPSSIPNLQSVDFEAVAITVKELVRYTLSINPNNHSWLIIQADIYFATNQYSAALHYYLQAGAVCSDFFNKAVPPDVYTDQVIKRMIKCCSLLNCHTQVAILCQFLREIDYKTAFKSLQEQNSHDAMDSYYDYIWDVTILEYLTYLHHKRGETDKRQIAIKAIGQTELNASNPEEVLQLAAQRRKKKFLQAMAKLYF 995 T 0.0069 TPR_12 pdbpercent F Eukaryota T 7ycx 10 J M Unknown2 XXXXXXXXXXXXXXXXX 17 F F F 7ycx 13 M U Unknown XXXXXXXXXXXXXXXXXXXXXXXXXXX 27 F F F 7ycx 27 AA e NELFA_HUMAN NELF-A,WOLF-HIRSCHHORN SYNDROME CANDIDATE 2 PROTEIN MASMRESDTGLWLHNKLGATDELWAPPSIASLLTAAVIDNIRLCFHGLSSAVKLKLLLGTLHLPRRTVDEMKGALMEIIQLASLDSDPWVLMVADILKSFPDTGSLNLELEEQNPNVQDILGELREKVGECEASAMLPLECQYLNKNALTTLAGPLTPPVKHFQLKRKPKSATLRAELLQKSTETAQQLKRSAGVPFHAKGRGLLRKMDTTTPLKGIPKQAPFRSPTAPSVFSPTGNRTPIPPSRTLLRKERGVKLLDISELDMVGAGREAKRRRKTLDAEVVEKPAKEETVVENATPDYAAGLVSTQKLGSLNNEPALPSTSYLPSTPSVVPASSYIPSSETPPAPSSREASRPPEEPSAPSPTLPAQFKQRAPMYNSGLSPATPTPAAPTSPLTPTTPPAVAPTTQTPPVAMVAPQTQAPAQQQPKKNLSLTREQMFAAQEMFKTANKVTRPEKALILGFMAGSRENPCQEQGDVIQIKLSEHTEDLPKADGQGSTTMLVDTVFEMNYATGQWTRFKKYKPMTNVS 528 T 0.008 PRCC unppercent F Eukaryota T 7yd4 1 A A P95206_MYCTU SECRETORY PROTEIN EPTGALPPMTSSGSGPVIGDGDAALRQRISQQLFSFGDPTVQEVDGSDAAQFITAAAAVADRDVASVFLPLQRVLGCQQNTAGSGAGFGARAYRRTDGQWGGAMLVVAKSTVSDVDALKACVKSGWRKATAGTPTSMCNNGWTYPPFADTRRGEEGYFVLLAGTASDFCSAPNANYRTTASSWPG 185 T 14 DUF983 pdbhh F Bacteria T 7ydp 3 C A engineered miniGas MGCTLSAEDKAAVERSKMIEKQLQKDKQVYRATHRLLLLGADNSGKSTIVKQMRIYHVNGYSEEECKQYKAVVYSNTIQSIIAIIRAMGRLKIDFGDSARADDARQLFVLAGAAEEGFMTAELAGVIKRLWKDSGVQACFNRSREYQLNDSAAYYLNDLDRIAQPNYIPTQQDVLRTRVKTSGIFETKFQVDKVNFHMFDVGAQRDERRKWIQCFNDVTAIIFVVDSSDYNRLQEALNDFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKIEDYFPEFARYTTPEDATPEPGEDPRVTRAKYFIRDEFLRISTASGDGRHYCYPHFTCSVDTENARRIFNDCRDIIQRMHLRQYELL 361 F F T 7yed 1 A,B,C,D,E,F,G,H,I,J,U,V,W,X,Y 1,2,3,4,5,A,B,C,D,E,a,b,c,d,e C9E874_9VIRU LAMBDA 1 MKRIPRKTKGKSSGKGNDSTSRSDDGSSQLRDKQSNKANPATAEPGTSNCEHYKARPGIASVQKATESAELPMKNNDEGTPDKRGNTKGALVNEHVEARDEADDATKKQAKDTEKAKAQVTYSDTGINNANELSRSGNVDNEGGSNQKPMSTRIAEATSAIVSKHPARVGLPPTASSGHGYQCHVCSAVLFSPLDLDAHVASHGLHGNMTLTSSEIQRHITEFISSWQNHPIVQVSADVENRKTAQLLHADTPRLVTWDAGLCTSFKIVPIVPAQVPQDVLAYTFFTSSYAIQSPFPEAAVSRIVVHTRWASNVDFDRDSSVIMAPPTENNIHLFKQLLNTETLSVRGANPLMFRANVLHMLLEFVLDNLYLNRHTGFSQDHTPFTEGANLRSLPGPDAEKWYSIMYPTRMGTPNVSKICNFVASCVRNRVGRFDRAQMMNGAMSEWVDVFETSDALTVSIRGRWMARLARMNINPTEIEWALTECAQGYVTVTSPYAPSVNRLMPYRISNAERQISQIIRVMNIGNNATVIQPVLQDISVLLQRISPLQIDPTIISNTMSTVSESTTQTLSPASSILGKLRPSNSDFSSFRVALAGWLYNGVVTTVIDDSSYPKDGGSVTSLENLWDFFILALALPLTTDPCAPVKAFMTLANMMVGFETIPMDNQIYTQSRRASAFSTPHTWPRCFMNIQLISPIDAPILRQWAEIIHRYWPNPSQIRYGTPNVFGSANLFTPPEVLLLPIDHQPANVTTPTLDFTNELTNWRARVCELMKNLVDNQRYQPGWTQSLVSSMRGTLGKLKLIKSMTPMYLQQLAPVELAVIAPMLPFPPFQVPYVRLDRDRVPTMVGVTRQSRDTITQPALSLSTTNTTVGVPLALDARAITVALLSGKYPPDLVTNVWYADAIYPMYADTEVFSNLQRDVITCEAVQTLVTLVAQISETQYPVDRYLDWIPSLRASAATAATFAEWVNTSMKTAFDLSDMLLEPLLSGDPRMTQLAIQYQQYNGRTFNVIPEMPGSVIADCVQLTAEVFNHEYNLFGIARGDIIIGRVQSTHLWSPLAPPPDLVFDRDTPGVHIFGRDCRISFGMNGAAPMIRDETGMMVPFEGNWIFPLALWQMNTRYFNQQFDAWIKTGELRIRIEMGAYPYMLHYYDPRQYANAWNLTSAWLEEITPTSIPSVPFMVPISSDHDISSAPAVQYIISTEYNDRSLFCTNSSSPQTIAGPDKHIPVERYNILTNPDAPPTQIQLPEVVDLYNVVTRYAYETPPITAVVMGVP 1275 T 0.0075 zf_C2H2_6 pdbhh T Viruses T 7yeq 1 A A A0A2X0RVH4_ASF CP312R CDS PROTEIN,CP312R PROTEIN MTTHIFHADDLLQALQQAKAEKNFSSVFSLDWDKLRTAKRNTTVKYVTVNVIVKGKKAPLMFNFQNEKHVGTIPPSTDEEVIRMNAENPKFLVKKRDRDPCLQFNKYKISPPLEDDGLTVKKNEQGEEIYPGDEEKSKLFQIIELLEEAFEDAVQKGPEAMKTKHVIKLIQRKISNSAVKNADKPLPNPIARIRIKINPATSILTPILLDKNKPITLQNGKTSFEELKDEDGVKANPDNIHKLIESHSIHDGIINARSICISNMGISFPLCLEMGVVKVFEKNNGIDVNSIYGSDDISTLVNQIAIA 307 T 61 DUF5721 pdbhh T Viruses T 7yev 1 A,B,C,D,E,F,G,H,I,J,R,S,T,U,V 1,2,3,4,5,A,B,C,D,E,a,b,c,d,e C9E874_9VIRU LAMBDA 1 MKRIPRKTKGKSSGKGNDSTSRSDDGSSQLRDKQSNKANPATAEPGTSNCEHYKARPGIASVQKATESAELPMKNNDEGTPDKRGNTKGALVNEHVEARDEADDATKKQAKDTEKAKAQVTYSDTGINNANELSRSGNVDNEGGSNQKPMSTRIAEATSAIVSKHPARVGLPPTASSGHGYQCHVCSAVLFSPLDLDAHVASHGLHGNMTLTSSEIQRHITEFISSWQNHPIVQVSADVENRKTAQLLHADTPRLVTWDAGLCTSFKIVPIVPAQVPQDVLAYTFFTSSYAIQSPFPEAAVSRIVVHTRWASNVDFDRDSSVIMAPPTENNIHLFKQLLNTETLSVRGANPLMFRANVLHMLLEFVLDNLYLNRHTGFSQDHTPFTEGANLRSLPGPDAEKWYSIMYPTRMGTPNVSKICNFVASCVRNRVGRFDRAQMMNGAMSEWVDVFETSDALTVSIRGRWMARLARMNINPTEIEWALTECAQGYVTVTSPYAPSVNRLMPYRISNAERQISQIIRVMNIGNNATVIQPVLQDISVLLQRISPLQIDPTIISNTMSTVSESTTQTLSPASSILGKLRPSNSDFSSFRVALAGWLYNGVVTTVIDDSSYPKDGGSVTSLENLWDFFILALALPLTTDPCAPVKAFMTLANMMVGFETIPMDNQIYTQSRRASAFSTPHTWPRCFMNIQLISPIDAPILRQWAEIIHRYWPNPSQIRYGTPNVFGSANLFTPPEVLLLPIDHQPANVTTPTLDFTNELTNWRARVCELMKNLVDNQRYQPGWTQSLVSSMRGTLGKLKLIKSMTPMYLQQLAPVELAVIAPMLPFPPFQVPYVRLDRDRVPTMVGVTRQSRDTITQPALSLSTTNTTVGVPLALDARAITVALLSGKYPPDLVTNVWYADAIYPMYADTEVFSNLQRDVITCEAVQTLVTLVAQISETQYPVDRYLDWIPSLRASAATAATFAEWVNTSMKTAFDLSDMLLEPLLSGDPRMTQLAIQYQQYNGRTFNVIPEMPGSVIADCVQLTAEVFNHEYNLFGIARGDIIIGRVQSTHLWSPLAPPPDLVFDRDTPGVHIFGRDCRISFGMNGAAPMIRDETGMMVPFEGNWIFPLALWQMNTRYFNQQFDAWIKTGELRIRIEMGAYPYMLHYYDPRQYANAWNLTSAWLEEITPTSIPSVPFMVPISSDHDISSAPAVQYIISTEYNDRSLFCTNSSSPQTIAGPDKHIPVERYNILTNPDAPPTQIQLPEVVDLYNVVTRYAYETPPITAVVMGVP 1275 T 0.0075 zf_C2H2_6 pdbhh T Viruses T 7yez 1 A,B,C,D,E,F,G,H,I,J,R,S,T,U,V 1,2,3,4,5,A,B,C,D,E,a,b,c,d,e C9E874_9VIRU LAMBDA 1 MKRIPRKTKGKSSGKGNDSTSRSDDGSSQLRDKQSNKANPATAEPGTSNCEHYKARPGIASVQKATESAELPMKNNDEGTPDKRGNTKGALVNEHVEARDEADDATKKQAKDTEKAKAQVTYSDTGINNANELSRSGNVDNEGGSNQKPMSTRIAEATSAIVSKHPARVGLPPTASSGHGYQCHVCSAVLFSPLDLDAHVASHGLHGNMTLTSSEIQRHITEFISSWQNHPIVQVSADVENRKTAQLLHADTPRLVTWDAGLCTSFKIVPIVPAQVPQDVLAYTFFTSSYAIQSPFPEAAVSRIVVHTRWASNVDFDRDSSVIMAPPTENNIHLFKQLLNTETLSVRGANPLMFRANVLHMLLEFVLDNLYLNRHTGFSQDHTPFTEGANLRSLPGPDAEKWYSIMYPTRMGTPNVSKICNFVASCVRNRVGRFDRAQMMNGAMSEWVDVFETSDALTVSIRGRWMARLARMNINPTEIEWALTECAQGYVTVTSPYAPSVNRLMPYRISNAERQISQIIRVMNIGNNATVIQPVLQDISVLLQRISPLQIDPTIISNTMSTVSESTTQTLSPASSILGKLRPSNSDFSSFRVALAGWLYNGVVTTVIDDSSYPKDGGSVTSLENLWDFFILALALPLTTDPCAPVKAFMTLANMMVGFETIPMDNQIYTQSRRASAFSTPHTWPRCFMNIQLISPIDAPILRQWAEIIHRYWPNPSQIRYGTPNVFGSANLFTPPEVLLLPIDHQPANVTTPTLDFTNELTNWRARVCELMKNLVDNQRYQPGWTQSLVSSMRGTLGKLKLIKSMTPMYLQQLAPVELAVIAPMLPFPPFQVPYVRLDRDRVPTMVGVTRQSRDTITQPALSLSTTNTTVGVPLALDARAITVALLSGKYPPDLVTNVWYADAIYPMYADTEVFSNLQRDVITCEAVQTLVTLVAQISETQYPVDRYLDWIPSLRASAATAATFAEWVNTSMKTAFDLSDMLLEPLLSGDPRMTQLAIQYQQYNGRTFNVIPEMPGSVIADCVQLTAEVFNHEYNLFGIARGDIIIGRVQSTHLWSPLAPPPDLVFDRDTPGVHIFGRDCRISFGMNGAAPMIRDETGMMVPFEGNWIFPLALWQMNTRYFNQQFDAWIKTGELRIRIEMGAYPYMLHYYDPRQYANAWNLTSAWLEEITPTSIPSVPFMVPISSDHDISSAPAVQYIISTEYNDRSLFCTNSSSPQTIAGPDKHIPVERYNILTNPDAPPTQIQLPEVVDLYNVVTRYAYETPPITAVVMGVP 1275 T 0.0075 zf_C2H2_6 pdbhh T Viruses T 7yf0 1 A,B,C,D,E,F,G,H,I,J,R,S,T,U,V 1,2,3,4,5,A,B,C,D,E,a,b,c,d,e C9E874_9VIRU LAMBDA 1 MKRIPRKTKGKSSGKGNDSTSRSDDGSSQLRDKQSNKANPATAEPGTSNCEHYKARPGIASVQKATESAELPMKNNDEGTPDKRGNTKGALVNEHVEARDEADDATKKQAKDTEKAKAQVTYSDTGINNANELSRSGNVDNEGGSNQKPMSTRIAEATSAIVSKHPARVGLPPTASSGHGYQCHVCSAVLFSPLDLDAHVASHGLHGNMTLTSSEIQRHITEFISSWQNHPIVQVSADVENRKTAQLLHADTPRLVTWDAGLCTSFKIVPIVPAQVPQDVLAYTFFTSSYAIQSPFPEAAVSRIVVHTRWASNVDFDRDSSVIMAPPTENNIHLFKQLLNTETLSVRGANPLMFRANVLHMLLEFVLDNLYLNRHTGFSQDHTPFTEGANLRSLPGPDAEKWYSIMYPTRMGTPNVSKICNFVASCVRNRVGRFDRAQMMNGAMSEWVDVFETSDALTVSIRGRWMARLARMNINPTEIEWALTECAQGYVTVTSPYAPSVNRLMPYRISNAERQISQIIRVMNIGNNATVIQPVLQDISVLLQRISPLQIDPTIISNTMSTVSESTTQTLSPASSILGKLRPSNSDFSSFRVALAGWLYNGVVTTVIDDSSYPKDGGSVTSLENLWDFFILALALPLTTDPCAPVKAFMTLANMMVGFETIPMDNQIYTQSRRASAFSTPHTWPRCFMNIQLISPIDAPILRQWAEIIHRYWPNPSQIRYGTPNVFGSANLFTPPEVLLLPIDHQPANVTTPTLDFTNELTNWRARVCELMKNLVDNQRYQPGWTQSLVSSMRGTLGKLKLIKSMTPMYLQQLAPVELAVIAPMLPFPPFQVPYVRLDRDRVPTMVGVTRQSRDTITQPALSLSTTNTTVGVPLALDARAITVALLSGKYPPDLVTNVWYADAIYPMYADTEVFSNLQRDVITCEAVQTLVTLVAQISETQYPVDRYLDWIPSLRASAATAATFAEWVNTSMKTAFDLSDMLLEPLLSGDPRMTQLAIQYQQYNGRTFNVIPEMPGSVIADCVQLTAEVFNHEYNLFGIARGDIIIGRVQSTHLWSPLAPPPDLVFDRDTPGVHIFGRDCRISFGMNGAAPMIRDETGMMVPFEGNWIFPLALWQMNTRYFNQQFDAWIKTGELRIRIEMGAYPYMLHYYDPRQYANAWNLTSAWLEEITPTSIPSVPFMVPISSDHDISSAPAVQYIISTEYNDRSLFCTNSSSPQTIAGPDKHIPVERYNILTNPDAPPTQIQLPEVVDLYNVVTRYAYETPPITAVVMGVP 1275 T 0.0075 zf_C2H2_6 pdbhh T Viruses T 7yf2 2 C,D C,D SLC39A5 peptide GHQGHSHGHQGGY 13 T 1.9 UreE_C pdbhh F F 7yf3 2 C,D C,D S100A9 peptide RGHGHSHGKGS 11 T 1.3 Urease_linker pdbhh F F 7yf4 2 C,D C,D SLC39A5 mutant peptide GHQGHAHGHQGG 12 T 4.3 DUF6488 pdbhh F F 7yf6 2 C C Macrocyclic Peptide FVPVLWLXX 9 T 0.47 BshC pdbhh F T 7yfe 1 A,B,C,D,E,F,G,H,I,J,U,V,W,X,Y 1,2,3,4,5,A,B,C,D,E,a,b,c,d,e C9E874_9VIRU LAMBDA 1 MKRIPRKTKGKSSGKGNDSTSRSDDGSSQLRDKQSNKANPATAEPGTSNCEHYKARPGIASVQKATESAELPMKNNDEGTPDKRGNTKGALVNEHVEARDEADDATKKQAKDTEKAKAQVTYSDTGINNANELSRSGNVDNEGGSNQKPMSTRIAEATSAIVSKHPARVGLPPTASSGHGYQCHVCSAVLFSPLDLDAHVASHGLHGNMTLTSSEIQRHITEFISSWQNHPIVQVSADVENRKTAQLLHADTPRLVTWDAGLCTSFKIVPIVPAQVPQDVLAYTFFTSSYAIQSPFPEAAVSRIVVHTRWASNVDFDRDSSVIMAPPTENNIHLFKQLLNTETLSVRGANPLMFRANVLHMLLEFVLDNLYLNRHTGFSQDHTPFTEGANLRSLPGPDAEKWYSIMYPTRMGTPNVSKICNFVASCVRNRVGRFDRAQMMNGAMSEWVDVFETSDALTVSIRGRWMARLARMNINPTEIEWALTECAQGYVTVTSPYAPSVNRLMPYRISNAERQISQIIRVMNIGNNATVIQPVLQDISVLLQRISPLQIDPTIISNTMSTVSESTTQTLSPASSILGKLRPSNSDFSSFRVALAGWLYNGVVTTVIDDSSYPKDGGSVTSLENLWDFFILALALPLTTDPCAPVKAFMTLANMMVGFETIPMDNQIYTQSRRASAFSTPHTWPRCFMNIQLISPIDAPILRQWAEIIHRYWPNPSQIRYGTPNVFGSANLFTPPEVLLLPIDHQPANVTTPTLDFTNELTNWRARVCELMKNLVDNQRYQPGWTQSLVSSMRGTLGKLKLIKSMTPMYLQQLAPVELAVIAPMLPFPPFQVPYVRLDRDRVPTMVGVTRQSRDTITQPALSLSTTNTTVGVPLALDARAITVALLSGKYPPDLVTNVWYADAIYPMYADTEVFSNLQRDVITCEAVQTLVTLVAQISETQYPVDRYLDWIPSLRASAATAATFAEWVNTSMKTAFDLSDMLLEPLLSGDPRMTQLAIQYQQYNGRTFNVIPEMPGSVIADCVQLTAEVFNHEYNLFGIARGDIIIGRVQSTHLWSPLAPPPDLVFDRDTPGVHIFGRDCRISFGMNGAAPMIRDETGMMVPFEGNWIFPLALWQMNTRYFNQQFDAWIKTGELRIRIEMGAYPYMLHYYDPRQYANAWNLTSAWLEEITPTSIPSVPFMVPISSDHDISSAPAVQYIISTEYNDRSLFCTNSSSPQTIAGPDKHIPVERYNILTNPDAPPTQIQLPEVVDLYNVVTRYAYETPPITAVVMGVP 1275 T 0.0075 zf_C2H2_6 pdbhh T Viruses T 7yfs 1 A A noursin APSNVLSTLLHGRACV 16 T 0.85 DUF765 pdbhh F T 7yfu 1 A,B A,B FTM_MOUSE NEPHROCYSTIN-8,RPGR-INTERACTING PROTEIN 1-LIKE PROTEIN,RPGRIP1-LIKE PROTEIN GPGSRDVEMEEMIEQLQEKVHELERQNEVLKNRLISAKQQLQVQG 45 T 0.00078 EzrA unphh F Eukaryota T 7yfv 1 A,B,C,D A,B,C,D FTM_MOUSE NEPHROCYSTIN-8,RPGR-INTERACTING PROTEIN 1-LIKE PROTEIN,RPGRIP1-LIKE PROTEIN GPGSVSRVSREELEDRFLRLHDENILLKQHARKQEDKIKRMATKLIRLVNDKKRYERVGG 60 T 0.00014 WEMBL unphh F Eukaryota T 7yfw 1 A,B,C a,b,c Pam3 fiber proreins MASINLPFSLSGSKRIPTSEELADGYQCGPLDVELDNWLMWWLTGQVDGVIEGAGLTTDDTDLARLYKAIQSMTSGNLRTVVLTAASGNLPIPSDVSVLNWVRAVGGGGAGGNSNTGNSKASGGGGGAGFDRFNVAVTPGSNVPYTVGAAGAVNGLGAGYNGGAGGSTAILGTTAGGGAGGLGVNNNATAVQVNGGTTSGTTPEISYPGGLGTEGIVGTGGGSVLSQPTQRAFTNAGNNNPANSWGGGGPGGSDFGGAWQPGGVGKQGIIIVQYFSRFAP 280 T 180 DUF777 pdbhh F T 7yfz 1 A,B,E,F,H,I,K,L,N,O,P,Q A,B,E,F,H,I,K,L,O,P,Q,R Pam3 baseplate wedge gp22 MTYGVQPTGYVKKPLAVHLAEIEASMVDLFGPGVIQTEQSPLGQLNGLYADLSYDLDERGEDLYQSFDPEQAEGSRLDILARYRLLSRRAGESDESFRRAITNVDRARIDLSDLSTALSAINGVSWSRVYVNEDATTDADGIPPNTVSVAVIGGDDDEVAQLVRRYVVPGVGMYGNTTIETTIGGFCRRIRVIRPVLIPTSVEIDVQSRPLKNGCPPPSVNAMAAGLYTELTGPDRPGNGEDGTVYLFRKIMERLYPNVEVVDVRLSQAPAAPTTPPLVMSFFQMMSFNADDILVEIVP 299 F F T 7yfz 4 T,U,V,W,X,Y a,b,c,d,e,f Pam3 tube initiator gp17 MIIAFSSAIGPVPLTVVISEKHTSKVELTTNPIESGADVTDHAYVKGKEIELEVADRNAAATWAALVAFQESRVPFVLMTGLSMYRNMIITEIDATRNAQHSKILKGTVRLREVKIVETGTAEDSSGKDGTDKNKSSNPSKDKAADAKTADKANSGVNAGDKGGTTVAAPRAQSLLKGVFGGSSASGGAAP 191 T 2.4E-05 Phage_P2_GpU pdbhh F T 7yfz 5 AA,BA,CA,DA,EA,Z i,j,k,l,m,h Pam3 plug gp18 MIELEVLDESKQKFSVILNDRRVTIELWYNTTNDRWSFSLALDGDNVVTGRRLVTGVDLLAPFGLGIGALFLLSENGEPPTRANLPLGLVKLYHATQEEIDAAISA 106 T 3.8 RC-P840_PscD pdbhh F T 7yg3 3 C B ARG-GLN-ASP-ILE-LEU-ASP-LEU-TRP-ILE RQDILDLWI 9 T 0.0037 F-protein pdbhh F T 7yg4 1 A A VIR_HUMAN Protein virilizer homolog MASVKLTELLDLYREDRGAKWVTALEEIPSLIIKGLSYLQLKNTKQDSLGQLVDWTMQALNLQVALRQPIALNVRQLKAGTKLVSSLAECGAQGVTGLLQAGVISGLFELLFADHVSSSLKLNAFKALDSVISMTEGMEAFLRGRQNEKSGYQKLLELILLDQTVRVVTAGSAILQKCHFYEVLSEIKRLGDHLAEKTSSLPNHSEPDHDTDAGLERTNPEYENEVEASMDMDLLESSNISEGEIERLINLLEEVFHLMETAPHTMIQQPVKSFPTMARITGPPERDDPYPVLFRYLHSHHFLELVTLLLSIPVTSAHPGVLQATKDVLKFLAQSQKGLLFFMSEYEATNLLIRALCHFYDQDEEEGLQSDGVIDDAFALWLQDSTQTLQCITELFSHFQRCTASEETDHSDLLGTLHNLYLITFNPVGRSAVGHVFSLEKNLQSLITLMEYYSKEALGDSKSKKSVAYNYACILILVVVQSSSDVQMLEQHAASLLKLCKADENNAKLQELGKWLEPLKNLRFEINCIPNLIEYVKQNIDNLMTPEGVGLTTALRVLCNVACPPPPVEGQQKDLKWNLAVIQLFSAEGMDTFIRVLQKLNSILTQPWRLHVNMGTTLHRVTTISMARCTLTLLKTMLTELLRGGSFEFKDMRVPSALVTLHMLLCSIPLSGRLDSDEQKIQNDIIDILLTFTQGVNEKLTISEETLANNTWSLMLKEVLSSILKVPEGFFSGLILLSELLPLPLPMQTTQVIEPHDISVALNTRKLWSMHLHVQAKLLQEIVRSFSGTTCQPIQHMLRRICVQLCDLASPTALLIMRTVLDLIVEDLQSTSEDKEKQYTSQTTRLLALLDALASHKACKLAILHLINGTIKGDERYAEIFQDLLALVRSPGDSVIRQQCVEYVTSILQSLCDQDIALILPSSSEGSISELEQLSNSLPNKELMTSICDCLLATLANSESSYNCLLTCVRTMMFLAEHDYGLFHLKSSLRKNSSALHSLLKRVVSTFSKDTGELASSFLEFMRQILNSDTIGCCGDDNGLMEVEGAHTSRTMSINAAELKQLLQSKEESPENLFLELEKLVLEHSKDDDNLDSLLDSVVGLKQMLES 1107 T 3 T3SS_ATPase_C pdb F Eukaryota T 7yh8 1 A,C A,C L-19437 LPVEKIIREAKKILDELLKRGLIDPELARIAREVLERARKLGNEEAARFVLELIERLRRELS 62 T 0.05 DUF2095 pdbhh F T 7yh8 2 B,D B,D D-Pep-1 XXXXXXXXXXXXXXXXXXX 19 F F F 7yhr 1 A A A0A239N0M2_9PSED Anti-CRISPR protein Type I-C5 MSKVTLNGQQIDFDAAVNLMDAELREELHSAQEWTNDQEFLDAYVQAHAAKFDGEEFQVA 60 T 0.1 TubC_N pdb F Bacteria T 7yhs 4 I J AcrIF4 MMTISKTDIDCYLQTYVVIDPVSNGWQWGIDENGVGGALHHGRVEMVEGENGYFGLRGATHPTEKEAMAAALGYLWRCRQDLVAIARNDAIEAEKYRAKA 100 T 1.7 TAL_effector pdbhh F T 7yi7 2 B B HPSE_HUMAN Heparanase 8 kDa subunit QDVVDLDFFTQEPLHLVSPSFLSVTIDANLATDPRFLILLGSPKLRTLARGLSPAYLRFGGTKTDFLIFDPKKE 74 T 3.5 Glyco_hydro_79n unppercent F Eukaryota T 7yil 1 A,B A,B Y248_METJA GINS FVITMYESLKNYFFEEIKNDKLLKLPDDFYDDIREYIKNIKDDIELERVKYYFKELRKLRIYKALYLDNERENLLPEELNIIHAIENIVVELKIE 95 T 0.041 Peptidase_M3_N unppercent F Archaea T 7yiu 2 B E SPTC1_HUMAN LONG CHAIN BASE BIOSYNTHESIS PROTEIN 1,LCB 1,SERINE-PALMITOYL-COA TRANSFERASE 1,SPT 1,SPT1 MATATEQWVLVEMVQALYEAPAYHLILEGILILWIIRLLFSKTYKLQERS 50 T 0.55 Mucin15 pdbhh F Eukaryota T 7yiy 2 B E SPTC1_HUMAN LONG CHAIN BASE BIOSYNTHESIS PROTEIN 1,LCB 1,SERINE-PALMITOYL-COA TRANSFERASE 1,SPT 1,SPT1 MATATEQWVLVEMVQALYEAPAYHLILEGILILWIIRLLFSKTYKLQERS 50 T 0.55 Mucin15 pdbhh F Eukaryota T 7yj1 2 B E SPTC1_HUMAN LONG CHAIN BASE BIOSYNTHESIS PROTEIN 1,LCB 1,SERINE-PALMITOYL-COA TRANSFERASE 1,SPT 1,SPT1 MATATEQWVLVEMVQALYEAPAYHLILEGILILWIIRLLFSKTYKLQERS 50 T 0.55 Mucin15 pdbhh F Eukaryota T 7yj2 2 B E SPTC1_HUMAN LONG CHAIN BASE BIOSYNTHESIS PROTEIN 1,LCB 1,SERINE-PALMITOYL-COA TRANSFERASE 1,SPT 1,SPT1 MATATEQWVLVEMVQALYEAPAYHLILEGILILWIIRLLFSKTYKLQERS 50 T 0.55 Mucin15 pdbhh F Eukaryota T 7yjc 2 B B HPSE_HUMAN Heparanase 8 kDa subunit QDVVDLDFFTQEPLHLVSPSFLSVTIDANLATDPRFLILLGSPKLRTLARGLSPAYLRFGGTKTDFLIFDPKKE 74 T 3.5 Glyco_hydro_79n unppercent F Eukaryota T 7yji 1 A A A0A2S6F9N6_LEGPN T4SS effector Lpg1083 MGHHHHHHMALIDQITTINKNEFTDDFLRKYFELGFGSLSKHDIDLLVYYLVKEHSDLFNGKTNYEISSLLTITERKLQSIQMESYLRYENNSISKNLEELSVKITKGEIKPEVEGDKIRVLIDSPVLRRDLEYSITSLGHIVDYSFNKNILSLRLSNFFEVFGNLNIENGKELKTQVIDFFREQNKWDKEILIEIENKSWWIKQFNTLQAAVKKEAAALIFHSIISMVKSHIGI 235 T 0.017 Sigma70_r4_2 unppercent F Bacteria T 7yjm 4 D E LCB1_ARATH atLCB1 MASNLVEMFNAALNWVTMILESPSARVVLFGVPIRGHFFVEGLLGVVIIILLTRKSYKPPKR 62 T 0.0009 RELT pdb F Eukaryota T 7yjn 4 D E LCB1_ARATH ATLCB1,PROTEIN EMBRYO DEFECTIVE 2779,PROTEIN FUMONISIN B1 RESISTANT 11 MASNLVEMFNAALNWVTMILESPSARVVLFGVPIRGHFFVEGLLGVVIIILLTRKSYKPPKR 62 T 0.0009 RELT pdb F Eukaryota T 7yjo 4 D E LCB1_ARATH ATLCB1,PROTEIN EMBRYO DEFECTIVE 2779,PROTEIN FUMONISIN B1 RESISTANT 11 MASNLVEMFNAALNWVTMILESPSARVVLFGVPIRGHFFVEGLLGVVIIILLTRKSYKPPKR 62 T 0.0009 RELT pdb F Eukaryota T 7yk3 2 B,D B,D DARG_MYCTU DARG,ANTITOXIN DARG,MACRO DOMAIN-CONTAINING PROTEIN MTWGRAVILEAMRRYLQQRRAMEPWEDPAGISHLEIQKLMYFANEADPDLALDFTPGRYGPYSERVRHLLQGMEGAFTVGLGDGTARVLANQPISLTTKGTDAITDYLATDAAADRVSAAVDTVLRVIEGFEGPYGVELLASTHWVATREGAKEPATAAAAVRKWTKRKGRIYSDDRIGVALDRILMTA 189 T 1.7E-05 DUF4065 pdbhh F Bacteria T 7yk5 3 Q,R,S,T i,k,j,l PYCO1 SSU binding motif KWSPRGGS 8 T 5.2 DUF2415 pdbhh F T 7yk5 4 AA,BA,U,V,W,X,Y,Z b,a,h,g,f,e,d,c PYCO1 LSU binding motif AAEWGSMNQ 9 T 0.025 Intimin_C pdbhh F T 7ykd 1 A L RARR2_HUMAN CHEMERIN,RAR-RESPONSIVE PROTEIN TIG2,TAZAROTENE-INDUCED GENE 2 PROTEIN YFPGQFAFS 9 T 1.6 BTK pdbhh F Eukaryota T 7ym8 1 A B miniGsq MGHHHHHHHHLEVLFQGPIEKQLQKDKQVYRATHRLLLLGADNSGKSTIVKQMRILHGGSGGSGGTSGIFETKFQVDKVNFHMFDVGGQRDERRKWIQCFNDVTAIIFVVDSSDYNRLQEALNDFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKIEDYFPEFARYTTPEDATPEPGEDPRVTRAKYFIRDEFLRISTASGDGRHYCYPHFTCAVDTENARRIFNDCKDIILQMNLREYNLV 246 T 5.9E-09 G-alpha pdb F T 7ymh 2 B B miniGsq MGHHHHHHHHLEVLFQGPIEKQLQKDKQVYRATHRLLLLGADNSGKSTIVKQMRILHGGSGGSGGTSGIFETKFQVDKVNFHMFDVGGQRDERRKWIQCFNDVTAIIFVVDSSDYNRLQEALNDFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKIEDYFPEFARYTTPEDATPEPGEDPRVTRAKYFIRDEFLRISTASGDGRHYCYPHFTCAVDTENARRIFNDCKDIILQMNLREYNLV 246 T 5.9E-09 G-alpha pdb F T 7ymk 2 B,D C,D Grip peptide AILHRLLQ 8 T 0.0019 SRC-1 pdbhh F T 7ynd 3 C C A0A401FT52_9DELT CHAT domain-containing protein MSNPIRDIQDRLKTAKFDNKDDMMNLASSLYKYEKQLMDSSEATLCQQGLSNRPNSFSQLSQFRDSDIQSKAGGQTGKFWQNEYEACKNFQTHKERRETLEQIIRFLQNGAEEKDADDLLLKTLARAYFHRGLLYRPKGFSVPARKVEAMKKAIAYCEIILDKNEEESEALRIWLYAAMELRRCGEEYPENFAEKLFYLANDGFISELYDIRLFLEYTEREEDNNFLDMILQENQDRERLFELCLYKARACFHLNQLNDVRIYGESAIDNAPGAFADPFWDELVEFIRMLRNKKSELWKEIAIKAWDKCREKEMKVGNNIYLSWYWARQRELYDLAFMAQDGIEKKTRIADSLKSRTTLRIQELNELRKDAHRKQNRRLEDKLDRIIEQENEARDGAYLRRNPPCFTGGKREEIPFARLPQNWIAVHFYLNELESHEGGKGGHALIYDPQKAEKDQWQDKSFDYKELHRKFLEWQENYILNEEGSADFLVTLCREIEKAMPFLFKSEVIPEDRPVLWIPHGFLHRLPLHAAMKSGNNSNIEIFWERHASRYLPAWHLFDPAPYSREESSTLLKNFEEYDFQNLENGEIEVYAPSSPKKVKEAIRENPAILLLLCHGEADMTNPFRSCLKLKNKDMTIFDLLTVEDVRLSGSRILLGACESDMVPPLEFSVDEHLSVSGAFLSHKAGEIVAGLWTVDSEKVDECYSYLVEEKDFLRNLQEWQMAETENFRSENDSSLFYKIAPFRIIGFPAE 751 T 1.7E-07 CHAT pdbpercent F Bacteria T 7yoj 1 A A A0A399WQY8_9BACT REVERSE TRANSCRIPTASE DOMAIN-CONTAINING PROTEIN MAKATKEVKSKRVEALRQVAYQRLERLERKAQKIGAHLRKPGKAADLQSLHYLLHKVEVEYHDIARNLEKDPTWTPKPKMRREKRAIVPESGPAAPLPTTAKGEPGRPANRHIPPPVPLDSARIPEDQQSMGQGSGGRSWCSAPFVEVKLPPTQWSNVREKLLKFRIEDDADIVRRWAEAKFGSIETARDGLRASAEIGTSPDVWRSFISRAISNGKKDFEPLLSLDDDELTADATAERVVRRWHQIDWVGRMLDSILETVPSGVSKDTFRSRVESRLKTFHSSVNSFELKKRKDGTVERKRKHTNPQFPYLSPSAVSIDPDVVTMEAVELLQMQPEERFAKDPNDANGRMRLRVLQAELGKARREALGRRGEKAPPWSGRKVFRGTTTRKREACLVWDKEAQADGLYFALVMSGGPKIDDKRFVYMDGQPLQSDWQLHNGVAGKAKSCRAMPLILKHDFLRWYHRHIKNHDVNAPLEKRCVHTTTQFVFVEPDEKKGLQPRLFIRPVFKFYDPVYEVPDSHSIDKKPDCRYLIGIARGVNYPYRAAVYDCETNSIIADKFVDGRKADWERIRNELAYHQRRRDLLRNSRASSAAIQREIRAIARIRKRERGLNKVETVESIARLVDWAEENLGKCNYCFVLADLSSNLNLGRNNRVKHIAAIKEALINQMRKRGYRFKKSGKVDGVREESAWYTSAVAPSGWWAKKEEVDGAWKADKTRPLARKIGSYYCCEEIDGLHLRGVLKGLGRAKRLVLQSDDPSAPTRRRGFGSELFWDPYCTELCGHAFPQGVVLDADFIGAFNIALRPLVREELGKKAKAVDLADRHQTLNPTVALRCGVTAYEFVEVGGDPRGGLRKILLNPAEAVI 867 T 0.0053 RuvC_1 unphh F Bacteria T 7ypx 1 A,B,C A,B,C A0A9E7DT93_9CAUD Pam3 tail fiber proreins HHHHHHSSGMASINLPFSLSGSKRIPTSEELADGYQCGPLDVELDNWLMWWLTGQVDGVIEGAGLTTDDTDLARLYKAIQSMTSGNLRTVVLTAASGNLPIPSDVSVLNWVRAVGGGGAGGNSNTGNSKASGGGGGAGFDRFNVAVTPGSNVPYTVGAAGAVNGLGAGYNGGAGGSTAILGTTAGGGAGGLGVNNNATAVQVNGGTTSGTTPEISYPGGLGTEGIVGTGGGSVLSQPTQRAFTNAGNNNPANSWGGGGPGGSDFGGAWQPGGVGKQGIIIVQYFSRFAP 289 T 180 DUF777 unphh T Viruses T 7ypx 2 D,E,F a,b,c A0A9E7J192_9CAUD tail fiber chaperone MTDKHYARVVDGLVVETKTLPADFNLDDLFGPDHGWVEAPLEVEQGWRKVGAKFAPAPPPERDPASILAGLKAEASRHIFATISATAQSNLLLAVGLASAKAPSARTPEERDLLNVADEGRAWIDAVRARVHALAEHDGVTPKGEDRWPAPSEAVLEMAAKF 162 T 0.48 DUF6276 unppercent T Viruses T 7yqk 8 L K TP53B_HUMAN UDR motif of 53BP1 KAADISLDNLVEGKRKRR 18 T 4.2 FYTT pdbhh F Eukaryota T 7ytq 2 E E SER-TRP SW 2 T 48 Melittin pdbhh F F 7yuz 2 B I AP8784 XIXXXXXWXXX 11 T 0.79 RFamide_26RFa pdbhh F F 7yv1 4 D I LUNA18 XIXXXXXPXXX 11 T 0.59 YvrJ pdbhh F F 7yx8 2 C,D F,H PSGL-1-like bis-T glycopeptide TEAQTTPPPA 10 T 10 CFAP91 pdbhh F F 7yxb 3 E,F G,H CLIP peptide AFAPVSKMRMATPLLMQAGN 20 T 21 DUF3440 pdbhh F T 7yxc 2 B R NR0B2_HUMAN ORPHAN NUCLEAR RECEPTOR SHP,SMALL HETERODIMER PARTNER,NUCLEAR RECEPTOR SUBFAMILY 0 GROUP B MEMBER 2 RPAILYALLSS 11 T 7.3 GTA_holin_3TM pdbhh F Eukaryota T 7yxd 2 B,D,F,H C,F,J,N NR0B2_HUMAN ORPHAN NUCLEAR RECEPTOR SHP,SMALL HETERODIMER PARTNER,NUCLEAR RECEPTOR SUBFAMILY 0 GROUP B MEMBER 2 RPAILYALLSSS 12 T 6.2 NR_Repeat pdbhh F Eukaryota T 7yxf 1 A,B,C A,B,C CP125_MYCTU CHOLEST-4-EN-3-ONE 26-MONOOXYGENASE,CHOLEST-4-EN-3-ONE C26-MONOOXYGENASE [(25S)-3-OXOCHOLEST-4-EN-26-OATE FORMING],CHOLESTEROL C26-MONOOXYGENASE,CHOLESTEROL C26-MONOOXYGENASE [(25S)-3BETA-HYDROXYCHOLEST-5-EN-26-OATE FORMING],CYTOCHROME P450 125,STEROID C27-MONOOXYGENASE NGPSPNLPPGFDFTDPAIYAERLPVAEFAELRSAAPIWWNGQDPGKGGGFHDGGFWAITKLNDVKEISRHSDVFSSYENGVIPRFKNDIAREDIEVQRFVMLNMDAPHHTRLRKIISRGFTPRAVGRLHDELQERAQKIAAEAAAAGSGDFVEQVSCELPLQAIAGLLGVPQEDRGKLFHWSNEMTGNEDPEYAHIDPKASSAELIGYAMKMAEEKAKNPADDIVTQLIQADIDGEKLSDDEFGFFVVMLAVAGNETTRNSITQGMMAFAEHPDQWELYKKVRPETAADEIVRWATPVTAFQRTALRDYELSGVQIKKGQRVVMFYRSANFDEEVFQDPFTFNILRNPNPHVGFGGTGAHYCIGANLARMTINLIFNAVADHMPDLKPISAPERLRSGWLNGIKHWQVDYTGRCPVAH 418 T 3.4000000000000003E-22 p450 unppssm F Bacteria T 7yxn 2 C,D R,S NR0B2_HUMAN ORPHAN NUCLEAR RECEPTOR SHP,SMALL HETERODIMER PARTNER,NUCLEAR RECEPTOR SUBFAMILY 0 GROUP B MEMBER 2 RPAILYALLSS 11 T 7.3 GTA_holin_3TM pdbhh F Eukaryota T 7yxo 2 B,D,F B,D,F NR0B2_HUMAN ORPHAN NUCLEAR RECEPTOR SHP,SMALL HETERODIMER PARTNER,NUCLEAR RECEPTOR SUBFAMILY 0 GROUP B MEMBER 2 RPAILYALLSS 11 T 7.3 GTA_holin_3TM pdbhh F Eukaryota T 7yxp 2 B B NR0B2_HUMAN SHP NR Box 1 Peptide SRPAILYALLSSS 13 T 8.2 NR_Repeat pdbhh F Eukaryota T 7yxr 2 C,D R,S NR0B2_HUMAN ORPHAN NUCLEAR RECEPTOR SHP,SMALL HETERODIMER PARTNER,NUCLEAR RECEPTOR SUBFAMILY 0 GROUP B MEMBER 2 RPAILYALLSSS 12 T 6.2 NR_Repeat pdbhh F Eukaryota T 7yzj 3 C E Antigenic peptide KPLEEVLNL 9 T 5.2 IL2 pdbhh F T 7z0f 4 D P CENPJ_HUMAN CENP-J,CENTROSOMAL P4.1-ASSOCIATED PROTEIN,LAG-3-ASSOCIATED PROTEIN,LYST-INTERACTING PROTEIN 1 MVNIEERPIKAAIGERKQTFEDYLEEQIQLEEQELKQKQLKEAEGPLPIKAKPKQPFLKRGEGLARFTNAKSKFQKGKE 79 T 0.11 DUF4122 pdbpercent F Eukaryota T 7z0g 4 D,H P,U CENPJ_HUMAN CENP-J,CENTROSOMAL P4.1-ASSOCIATED PROTEIN,LAG-3-ASSOCIATED PROTEIN,LYST-INTERACTING PROTEIN 1 MVNIEERPIKAAIGERKQTFEDYLEEQIQLEEQELKQKQLKEAEGPLPIKAKPKQPFLKRGEGLARFTNAKSKFQKGKE 79 T 0.11 DUF4122 pdbpercent F Eukaryota T 7z0h 19 S X Unknown RNA Polymerase III chain XXXXXXXXXXXXX 13 F F F 7z0o 3 D D RRN5_YEAST RNA polymerase I-specific transcription initiation factor RRN5 SMEHQQLRKYVELYNKEVEEFYNGAASGRPAEFHPSKVHVKSIHEKAGTANAGVEISSVGVDWDSEEKNTFFWCLSRYSIHRVDEWRSLLPRKSAMEILGYYRLLRRASASARSRKAGDDGAPIAYEMSAEWVALETKLSETVMAITEGAAEVADEEGHCEGLIDYESWKRRWVAIYSHSRIAEIRPLPRHALPLSRSATQTLERCVSRYTRTLLWCTALAGMASRSVSARAAESRGHKSLPTVVTRRQVERALCTEARSRDLHVLPRRIVLTLRKWELDYPREGKLFRTKEMAHLFLQSQLSRRDAPPVHQDENQENQENQENQEQDNTASEGESEAERDEIDEADLFRSALHENQLLKWLSK 364 T 0.0022 Myb_DNA-binding unppercent F Eukaryota T 7z0q 3 C G CLIP peptide PVSKMRMATPLLMQAGN 17 T 55 DUF3440 pdbhh F T 7z11 2 G G peptide substrate XXXXXXXXXXXXXXXXXXXX 20 F F F 7z14 5 F,G F,G Consensus short-chain short-chain alpha-neurotoxin ScNtx MICYNQQSSQPPTTKTCSETSCYKKTWRDHRGTIIERGCGCPKVKPGIKLHCCRTDKCNN 60 F F T 7z2z 22 V X Unknown RNA polymerase III chain XXXXXXXXXXX 11 F F F 7z30 19 S X Unknown RNA Polymerase III chain XXXXXXXXXXXXXXXXX 17 F F F 7z31 19 S X Unknown RNA polymerase III chain XXXXXXXXXXXXX 13 F F F 7z34 52 EB 0 Unknown peptide XXXXXXXXXXXXXXXXXXX 19 F F F 7z34 53 FB oC Unknown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 170 F F F 7z36 2 C,E C,S SMRCD_HUMAN SMARCAD1 CUE1 domain MGSSHHHHHHSQDPNSSSENLYFQGLSELEDLKDAKLQTLKELFPQRSDNDLLKLIESTSTMDGAIAAALLMF 73 T 0.00029 CUE pdbpssm F Eukaryota T 7z3n 8 H D G0RZX9_CHATD Putative heat shock protein MAESASKAAPGERVVIGITFGNSNSSIAHTVDDKAEVIANEDGDRQIPTILSYVDGDEYYGQQAKNFLVRNPKNTVAYFRDILGQDFKSVDPTHNHASAHPQEAGDNVVFTIKDKAEEDAEPSTLTVSEIATRYLRRLVGAASEYLGKKVTSAVITIPTNFTEKQKAALIAAAAAADLEVLQLISEPAAAVLAYDARPEATISDKIIVVADLGGSRSDVTVLASRSGMYTILATVHDYEYHGIALDKVLIDHFSKEFLKKNPGAKDPRENPRSLAKLRLEAESTKRALSRSTNASFSVESLIDGLDFASTINRLRYETIARTVFEGFNRLVESAVKKAGLDPLDVDEVIMSGGTSNTPRIAANFRYIFPESTRILAPSTDPSALNPSELQARGAALQASLIQEFETEDIEQSTHAAVTTMPHVTNAIGVVSVSESGEEKFVPIIAPETAVPARRTVHLDAPKEGGDVLVKVVEGSTHINVIKPEPKAKEDGETKEKTEDADDDGDFDDDDEEEEEEEEEEEKREKVWKIGSTLAEAAVRGVKKGAKVEVTINVNTDLTVIVTAREVGGKGGVRGTLSA 578 T 0.06 DUF3221 pdb F Eukaryota T 7z3n 53 BB NC Nascent chain XXXXXXXXXXXXXXXXXXXXXXXXXXX 27 F F F 7z3o 8 H D G0RZX9_CHATD Putative heat shock protein MAESASKAAPGERVVIGITFGNSNSSIAHTVDDKAEVIANEDGDRQIPTILSYVDGDEYYGQQAKNFLVRNPKNTVAYFRDILGQDFKSVDPTHNHASAHPQEAGDNVVFTIKDKAEEDAEPSTLTVSEIATRYLRRLVGAASEYLGKKVTSAVITIPTNFTEKQKAALIAAAAAADLEVLQLISEPAAAVLAYDARPEATISDKIIVVADLGGSRSDVTVLASRSGMYTILATVHDYEYHGIALDKVLIDHFSKEFLKKNPGAKDPRENPRSLAKLRLEAESTKRALSRSTNASFSVESLIDGLDFASTINRLRYETIARTVFEGFNRLVESAVKKAGLDPLDVDEVIMSGGTSNTPRIAANFRYIFPESTRILAPSTDPSALNPSELQARGAALQASLIQEFETEDIEQSTHAAVTTMPHVTNAIGVVSVSESGEEKFVPIIAPETAVPARRTVHLDAPKEGGDVLVKVVEGSTHINVIKPEPKAKEDGETKEKTEDADDDGDFDDDDEEEEEEEEEEEKREKVWKIGSTLAEAAVRGVKKGAKVEVTINVNTDLTVIVTAREVGGKGGVRGTLSA 578 T 0.06 DUF3221 pdb F Eukaryota T 7z3o 53 BB NC Nascent chain XXXXXXXXXXXXXXXXXXXXXXXXXXX 27 F F F 7z42 4 G,H,J,K Y,G,X,I RPB1_HUMAN DNA-directed RNA polymerase II subunit RPB1 YSPTSPSYSPTSPSYSPTSPSYSPTSPS 28 T 3.8E-05 RNA_pol_Rpb1_R pdbhh F Eukaryota F 7z43 7 G,H XXX,YYY SER-TYR-SER-PRO-THR-SEP-PRO YSPTSPSYSPTSPSYSPTSPSYSPTSPS 28 T 3.8E-05 RNA_pol_Rpb1_R pdbhh F F 7z44 1 A A A0A0B4N229_9CAUD Portal protein MAKQKYSEEVLDELRVDLQRRFNYAQGYVDMAVKGYAREAWEYFYGNLPAPVTAGSSSWVDRTVWESVNGTLQDIINVFCSGDEAVTFVADNQQDSDAADVATKLVNQILLRDNPGYNIISSAAQECLVTRNSFIKYYWDEQTSTQTEEAEGVPPEALAAYVQGLEAGGLKNLEVFTEENEDGTVDVKVTYEQTVKRVKVEYVPSEQIFVDEHATSFADAQYFCHRVRRSKEDLVAMGFPKDEIEAFNDWTDTMDTTQSTVAWSRTDWRQDIDADIGTDTEDIASMVWVYEHYIRTGVLDKNKESKLYQVIQAGEHILHTEEVTHIPFVTFCPYPIPGSFYGQSVYDITKDIQDLRTALVRGYIDNVNNANYGRYKALVGAYDRRSLLDNRPGGVVEMERQDAIDLFPYHNLPQGIDGLLGMSEELKETRTGVTKLGMGINPDVFKNDNAYATVGLMMNAAQNRLRMVCRNIAHNGMVELMRGIYNLIRENGEVPIEVQTPRGMIQVNPKQLPARHNLQVVVAISPNEKAERAQKLISLKQLIAADAQLAPLFGLEQDRYMTAQIFELMGIKDTHKYLLPLEQYQPPEPSPMEILQLEMTKAQVENVQASSQKMIADAFDQRERTTFEQQKAADELSLRQEELQFKQENAADAMTLENRKEDNNATLEQAKHKLALMQQQVRQYESVLKELQMVMSHQVDQEKIVQQARVQDKTLELQKKEANVTKKEQQASLKDSRIPGKRLGSKK 747 T 18 GAGA_bind pdb T Viruses T 7z47 1 A,B A,B A0A0B4N231_9CAUD Adaptor protein MAMPDVQYPINTYGWLKKAVALWADRDDDEFVNQIPNFINFAEKEIYRNLRIPPLEKEVYLDIKDGVAYIPPDYLEAQWMMRAKDGTIFQVTSPEEISYRRQHGTINPSHWNNQPVNFARFGSRFIFYPSIEADTPYYPDDGSPLIPAENSVILSYYADPPEFHEDTDTSTILTIAPELLLYFTLRHACLFVQDDNGVQKWSALGKAILDEMVEQNKKQEYSGSPIAIPNNMTRLQSSLPDIYGIRTSRV 250 T 0.025 TraD pdbpercent T Viruses T 7z47 3 D,E,F D,E,F A0A0B4N0B9_9CAUD Putative tail fiber MIVYNNQAPDAVNNVGQFGATEGSIGAYKQAAEYAADSKYWALLAESKFGTIDDLIAEVERLYQQGVLMKQDIEDLKQDFKDQDARLMSLIAQTNAAVSDANNAVALINQKLIEVQNQLDVLLGMSVDVTTLPPGTPATGSFNPNTGVISLGIPEGEPGKDGSVKDLDTAPTGVPELGDLGFYVDKDDNTVHKTTLENIANLTPSVRSVSVNGGPALDGEVALTINKETVGLGNVLNVAQYSRQEINDKFDKTTKTYQSKAEAYADAQYRQVGEKVLVWEATKYEFYTVAANKTLTPVKTEGRILTVNSRSPDSSGNIDITIPTGNPSLYLGEMVMFPYDPSKNISYPGVLPADGRLVSKESASDLGPSLVSGQLPVVSETEWQSGAKQYFSWGKLADGITDADSTNFINIRLPDWTGGEAIRAPDSDKDSQYNGSVQAQKPYVVTVNNQAPDEITGNVNISRSILGAASSGANSDITSLSGLTTPLSISQGGTGAKDAASARSNLGLGSVSTLDNVPIASGGTGAGDAAGARFNLGLGNSATMNTGTNSDNVLKVGDFGIGRPDGALVFDTTSQDQLLAGLDTYGLCVFRNNQQIAAPWDIWNYSSNLFFRAGDTYSMISIPFESAGKIKVFGGASGSGWKTSRTVYDTVNTTVDVNGFIKAASPIVKVFHDGSFETNEQSDGVSVKKISTGVYLISGCLGLNSDAGWGGVDGGFEIPIDRNKQPRVWLDYEVKEDGSLLIKTYHRTHSTSPAFARNELEGFSDGDPVDIPKDAFISVRVEMPSK 786 T 0.92 Laminin_II pdb T Viruses T 7z47 4 G,H,I I,H,C A0A0B4N235_9CAUD Putative structural protein MAIETNAVVITDLNPLYPRDRDYIYEGAAQIRLIKQTLQNTFPNVTEPVDIDSDTFKIMSEKLKFTGDAMDVGGLMIKNVTPGTGDKDVVTKGQMEAFMKNWMENKLYRIGSYYITEEDINPGDSISLGFGSWAKVTGVIMGTGVVNPDGSVPNAQRVEFQAGGTGGRVFNTIRTENVPLMTVNGSSFSLSSNTHSHNMVFGRGDASGHNSSPNWYSPGGGYSQRTDNDTHTHTISGSVSLGRDDISRQPINTLPPFRAAHIWRRIS 267 T 0.16 YadA_stalk pdb T Viruses T 7z4a 1 A,B A,N A0A0B4N231_9CAUD Adaptor protein MAMPDVQYPINTYGWLKKAVALWADRDDDEFVNQIPNFINFAEKEIYRNLRIPPLEKEVYLDIKDGVAYIPPDYLEAQWMMRAKDGTIFQVTSPEEISYRRQHGTINPSHWNNQPVNFARFGSRFIFYPSIEADTPYYPDDGSPLIPAENSVILSYYADPPEFHEDTDTSTILTIAPELLLYFTLRHACLFVQDDNGVQKWSALGKAILDEMVEQNKKQEYSGSPIAIPNNMTRLQSSLPDIYGIRTSRV 250 T 0.025 TraD pdbpercent T Viruses T 7z4a 3 D,E,F O,R,S A0A0B4N0B9_9CAUD Putative tail fiber MIVYNNQAPDAVNNVGQFGATEGSIGAYKQAAEYAADSKYWALLAESKFGTIDDLIAEVERLYQQGVLMKQDIEDLKQDFKDQDARLMSLIAQTNAAVSDANNAVALINQKLIEVQNQLDVLLGMSVDVTTLPPGTPATGSFNPNTGVISLGIPEGEPGKDGSVKDLDTAPTGVPELGDLGFYVDKDDNTVHKTTLENIANLTPSVRSVSVNGGPALDGEVALTINKETVGLGNVLNVAQYSRQEINDKFDKTTKTYQSKAEAYADAQYRQVGEKVLVWEATKYEFYTVAANKTLTPVKTEGRILTVNSRSPDSSGNIDITIPTGNPSLYLGEMVMFPYDPSKNISYPGVLPADGRLVSKESASDLGPSLVSGQLPVVSETEWQSGAKQYFSWGKLADGITDADSTNFINIRLPDWTGGEAIRAPDSDKDSQYNGSVQAQKPYVVTVNNQAPDEITGNVNISRSILGAASSGANSDITSLSGLTTPLSISQGGTGAKDAASARSNLGLGSVSTLDNVPIASGGTGAGDAAGARFNLGLGNSATMNTGTNSDNVLKVGDFGIGRPDGALVFDTTSQDQLLAGLDTYGLCVFRNNQQIAAPWDIWNYSSNLFFRAGDTYSMISIPFESAGKIKVFGGASGSGWKTSRTVYDTVNTTVDVNGFIKAASPIVKVFHDGSFETNEQSDGVSVKKISTGVYLISGCLGLNSDAGWGGVDGGFEIPIDRNKQPRVWLDYEVKEDGSLLIKTYHRTHSTSPAFARNELEGFSDGDPVDIPKDAFISVRVEMPSK 786 T 0.92 Laminin_II pdb T Viruses T 7z4a 4 G,H J,K A0A0B4N229_9CAUD Portal protein MAKQKYSEEVLDELRVDLQRRFNYAQGYVDMAVKGYAREAWEYFYGNLPAPVTAGSSSWVDRTVWESVNGTLQDIINVFCSGDEAVTFVADNQQDSDAADVATKLVNQILLRDNPGYNIISSAAQECLVTRNSFIKYYWDEQTSTQTEEAEGVPPEALAAYVQGLEAGGLKNLEVFTEENEDGTVDVKVTYEQTVKRVKVEYVPSEQIFVDEHATSFADAQYFCHRVRRSKEDLVAMGFPKDEIEAFNDWTDTMDTTQSTVAWSRTDWRQDIDADIGTDTEDIASMVWVYEHYIRTGVLDKNKESKLYQVIQAGEHILHTEEVTHIPFVTFCPYPIPGSFYGQSVYDITKDIQDLRTALVRGYIDNVNNANYGRYKALVGAYDRRSLLDNRPGGVVEMERQDAIDLFPYHNLPQGIDGLLGMSEELKETRTGVTKLGMGINPDVFKNDNAYATVGLMMNAAQNRLRMVCRNIAHNGMVELMRGIYNLIRENGEVPIEVQTPRGMIQVNPKQLPARHNLQVVVAISPNEKAERAQKLISLKQLIAADAQLAPLFGLEQDRYMTAQIFELMGIKDTHKYLLPLEQYQPPEPSPMEILQLEMTKAQVENVQASSQKMIADAFDQRERTTFEQQKAADELSLRQEELQFKQENAADAMTLENRKEDNNATLEQAKHKLALMQQQVRQYESVLKELQMVMSHQVDQEKIVQQARVQDKTLELQKKEANVTKKEQQASLKDSRIPGKRLGSKK 747 T 18 GAGA_bind pdb T Viruses T 7z4b 2 RC,SC,TC DR,DS,DT A0A0B4N235_9CAUD Putative structural protein MAIETNAVVITDLNPLYPRDRDYIYEGAAQIRLIKQTLQNTFPNVTEPVDIDSDTFKIMSEKLKFTGDAMDVGGLMIKNVTPGTGDKDVVTKGQMEAFMKNWMENKLYRIGSYYITEEDINPGDSISLGFGSWAKVTGVIMGTGVVNPDGSVPNAQRVEFQAGGTGGRVFNTIRTENVPLMTVNGSSFSLSSNTHSHNMVFGRGDASGHNSSPNWYSPGGGYSQRTDNDTHTHTISGSVSLGRDDISRQPINTLPPFRAAHIWRRIS 267 T 0.16 YadA_stalk pdb T Viruses T 7z4b 3 UC,VC DU,DV A0A0B4N231_9CAUD Adaptor MAMPDVQYPINTYGWLKKAVALWADRDDDEFVNQIPNFINFAEKEIYRNLRIPPLEKEVYLDIKDGVAYIPPDYLEAQWMMRAKDGTIFQVTSPEEISYRRQHGTINPSHWNNQPVNFARFGSRFIFYPSIEADTPYYPDDGSPLIPAENSVILSYYADPPEFHEDTDTSTILTIAPELLLYFTLRHACLFVQDDNGVQKWSALGKAILDEMVEQNKKQEYSGSPIAIPNNMTRLQSSLPDIYGIRTSRV 250 T 0.025 TraD pdbpercent T Viruses T 7z4b 5 XC,YC,ZC DX,DY,DZ A0A0B4N0B9_9CAUD Putative tail fiber MIVYNNQAPDAVNNVGQFGATEGSIGAYKQAAEYAADSKYWALLAESKFGTIDDLIAEVERLYQQGVLMKQDIEDLKQDFKDQDARLMSLIAQTNAAVSDANNAVALINQKLIEVQNQLDVLLGMSVDVTTLPPGTPATGSFNPNTGVISLGIPEGEPGKDGSVKDLDTAPTGVPELGDLGFYVDKDDNTVHKTTLENIANLTPSVRSVSVNGGPALDGEVALTINKETVGLGNVLNVAQYSRQEINDKFDKTTKTYQSKAEAYADAQYRQVGEKVLVWEATKYEFYTVAANKTLTPVKTEGRILTVNSRSPDSSGNIDITIPTGNPSLYLGEMVMFPYDPSKNISYPGVLPADGRLVSKESASDLGPSLVSGQLPVVSETEWQSGAKQYFSWGKLADGITDADSTNFINIRLPDWTGGEAIRAPDSDKDSQYNGSVQAQKPYVVTVNNQAPDEITGNVNISRSILGAASSGANSDITSLSGLTTPLSISQGGTGAKDAASARSNLGLGSVSTLDNVPIASGGTGAGDAAGARFNLGLGNSATMNTGTNSDNVLKVGDFGIGRPDGALVFDTTSQDQLLAGLDTYGLCVFRNNQQIAAPWDIWNYSSNLFFRAGDTYSMISIPFESAGKIKVFGGASGSGWKTSRTVYDTVNTTVDVNGFIKAASPIVKVFHDGSFETNEQSDGVSVKKISTGVYLISGCLGLNSDAGWGGVDGGFEIPIDRNKQPRVWLDYEVKEDGSLLIKTYHRTHSTSPAFARNELEGFSDGDPVDIPKDAFISVRVEMPSK 786 T 0.92 Laminin_II pdb T Viruses T 7z4b 7 DD,ED ED,EE A0A0B4N229_9CAUD Portal protein MAKQKYSEEVLDELRVDLQRRFNYAQGYVDMAVKGYAREAWEYFYGNLPAPVTAGSSSWVDRTVWESVNGTLQDIINVFCSGDEAVTFVADNQQDSDAADVATKLVNQILLRDNPGYNIISSAAQECLVTRNSFIKYYWDEQTSTQTEEAEGVPPEALAAYVQGLEAGGLKNLEVFTEENEDGTVDVKVTYEQTVKRVKVEYVPSEQIFVDEHATSFADAQYFCHRVRRSKEDLVAMGFPKDEIEAFNDWTDTMDTTQSTVAWSRTDWRQDIDADIGTDTEDIASMVWVYEHYIRTGVLDKNKESKLYQVIQAGEHILHTEEVTHIPFVTFCPYPIPGSFYGQSVYDITKDIQDLRTALVRGYIDNVNNANYGRYKALVGAYDRRSLLDNRPGGVVEMERQDAIDLFPYHNLPQGIDGLLGMSEELKETRTGVTKLGMGINPDVFKNDNAYATVGLMMNAAQNRLRMVCRNIAHNGMVELMRGIYNLIRENGEVPIEVQTPRGMIQVNPKQLPARHNLQVVVAISPNEKAERAQKLISLKQLIAADAQLAPLFGLEQDRYMTAQIFELMGIKDTHKYLLPLEQYQPPEPSPMEILQLEMTKAQVENVQASSQKMIADAFDQRERTTFEQQKAADELSLRQEELQFKQENAADAMTLENRKEDNNATLEQAKHKLALMQQQVRQYESVLKELQMVMSHQVDQEKIVQQARVQDKTLELQKKEANVTKKEQQASLKDSRIPGKRLGSKK 747 T 18 GAGA_bind pdb T Viruses T 7z4f 1 A,B,C A,B,C A0A0B4N235_9CAUD Putative structural protein MAIETNAVVITDLNPLYPRDRDYIYEGAAQIRLIKQTLQNTFPNVTEPVDIDSDTFKIMSEKLKFTGDAMDVGGLMIKNVTPGTGDKDVVTKGQMEAFMKNWMENKLYRIGSYYITEEDINPGDSISLGFGSWAKVTGVIMGTGVVNPDGSVPNAQRVEFQAGGTGGRVFNTIRTENVPLMTVNGSSFSLSSNTHSHNMVFGRGDASGHNSSPNWYSPGGGYSQRTDNDTHTHTISGSVSLGRDDISRQPINTLPPFRAAHIWRRIS 267 T 0.16 YadA_stalk pdb T Viruses T 7z4f 2 D,E,F D,E,F A0A0B4N0B9_9CAUD Putative tail fiber MIVYNNQAPDAVNNVGQFGATEGSIGAYKQAAEYAADSKYWALLAESKFGTIDDLIAEVERLYQQGVLMKQDIEDLKQDFKDQDARLMSLIAQTNAAVSDANNAVALINQKLIEVQNQLDVLLGMSVDVTTLPPGTPATGSFNPNTGVISLGIPEGEPGKDGSVKDLDTAPTGVPELGDLGFYVDKDDNTVHKTTLENIANLTPSVRSVSVNGGPALDGEVALTINKETVGLGNVLNVAQYSRQEINDKFDKTTKTYQSKAEAYADAQYRQVGEKVLVWEATKYEFYTVAANKTLTPVKTEGRILTVNSRSPDSSGNIDITIPTGNPSLYLGEMVMFPYDPSKNISYPGVLPADGRLVSKESASDLGPSLVSGQLPVVSETEWQSGAKQYFSWGKLADGITDADSTNFINIRLPDWTGGEAIRAPDSDKDSQYNGSVQAQKPYVVTVNNQAPDEITGNVNISRSILGAASSGANSDITSLSGLTTPLSISQGGTGAKDAASARSNLGLGSVSTLDNVPIASGGTGAGDAAGARFNLGLGNSATMNTGTNSDNVLKVGDFGIGRPDGALVFDTTSQDQLLAGLDTYGLCVFRNNQQIAAPWDIWNYSSNLFFRAGDTYSMISIPFESAGKIKVFGGASGSGWKTSRTVYDTVNTTVDVNGFIKAASPIVKVFHDGSFETNEQSDGVSVKKISTGVYLISGCLGLNSDAGWGGVDGGFEIPIDRNKQPRVWLDYEVKEDGSLLIKTYHRTHSTSPAFARNELEGFSDGDPVDIPKDAFISVRVEMPSK 786 T 0.92 Laminin_II pdb T Viruses T 7z4f 3 G,H I,H A0A0B4N231_9CAUD Adaptor protein MAMPDVQYPINTYGWLKKAVALWADRDDDEFVNQIPNFINFAEKEIYRNLRIPPLEKEVYLDIKDGVAYIPPDYLEAQWMMRAKDGTIFQVTSPEEISYRRQHGTINPSHWNNQPVNFARFGSRFIFYPSIEADTPYYPDDGSPLIPAENSVILSYYADPPEFHEDTDTSTILTIAPELLLYFTLRHACLFVQDDNGVQKWSALGKAILDEMVEQNKKQEYSGSPIAIPNNMTRLQSSLPDIYGIRTSRV 250 T 0.025 TraD pdbpercent T Viruses T 7z4f 5 J,K K,J A0A0B4N229_9CAUD Portal protein MAKQKYSEEVLDELRVDLQRRFNYAQGYVDMAVKGYAREAWEYFYGNLPAPVTAGSSSWVDRTVWESVNGTLQDIINVFCSGDEAVTFVADNQQDSDAADVATKLVNQILLRDNPGYNIISSAAQECLVTRNSFIKYYWDEQTSTQTEEAEGVPPEALAAYVQGLEAGGLKNLEVFTEENEDGTVDVKVTYEQTVKRVKVEYVPSEQIFVDEHATSFADAQYFCHRVRRSKEDLVAMGFPKDEIEAFNDWTDTMDTTQSTVAWSRTDWRQDIDADIGTDTEDIASMVWVYEHYIRTGVLDKNKESKLYQVIQAGEHILHTEEVTHIPFVTFCPYPIPGSFYGQSVYDITKDIQDLRTALVRGYIDNVNNANYGRYKALVGAYDRRSLLDNRPGGVVEMERQDAIDLFPYHNLPQGIDGLLGMSEELKETRTGVTKLGMGINPDVFKNDNAYATVGLMMNAAQNRLRMVCRNIAHNGMVELMRGIYNLIRENGEVPIEVQTPRGMIQVNPKQLPARHNLQVVVAISPNEKAERAQKLISLKQLIAADAQLAPLFGLEQDRYMTAQIFELMGIKDTHKYLLPLEQYQPPEPSPMEILQLEMTKAQVENVQASSQKMIADAFDQRERTTFEQQKAADELSLRQEELQFKQENAADAMTLENRKEDNNATLEQAKHKLALMQQQVRQYESVLKELQMVMSHQVDQEKIVQQARVQDKTLELQKKEANVTKKEQQASLKDSRIPGKRLGSKK 747 T 18 GAGA_bind pdb T Viruses T 7z4o 4 G,H JJJ,KKK SER-TYR-SER-PRO-THR-SEP-PRO-SER-TYR-SER YSPTSPSYSPTSPSYSPTSPSYSPTSPS 28 T 3.8E-05 RNA_pol_Rpb1_R pdbhh F F 7z4s 2 C,D C,D Macrocyclic peptide inhibitor XXFHXLNLGYRPGCX 15 T 2.5 DUF3228 pdbhh F T 7z50 5 I,J T,W Hybrid insulin peptide LQTLALEVEDDPCGG 15 T 0.9 DUF2405 pdbhh F T 7z53 3 E,F,K,L,Q,R,W,X E,F,K,L,Q,R,W,X Q2G0X2_STAA8 Myeloperoxidase inhibitor SPIN ANFLEHELSYIDVLLDKNADQATKDNLRSYFADKGLHSIKDIINKAKQDGFDVSKYEHV 59 T 0.046 Drf_FH3 unppssm F Bacteria T 7z5y 3 C C UDP-MurNAc-pentapeptide AXCXX 5 T 130 zf-CCHC pdbhh F F 7z5z 3 C C UDP-MurNAc-pentapeptide AXCXX 5 T 130 zf-CCHC pdbhh F F 7z6a 2 B C UDP-MurNAc-pentapeptide AXCXX 5 T 130 zf-CCHC pdbhh F F 7z6k 3 C C UDP-MurNAc-pentapeptide AXCXX 5 T 130 zf-CCHC pdbhh F F 7z6m 1 A A A0A0H3LM39_BORBR Putative membrane protein MGSSHHHHHHSSGLVPRGSHMNQPSSLAADLRGAWHAQAQSHPLITLGLAASAAGVVLLLVAGIVNALTGENRVHVGYAVLGGAAGFAATALGALMALGLRAISARTQDAMLGFAAGMMLAASAFSLILPGLDAAGTIVGPGPAAAAVVALGLGLGVLLMLGLDYFTPHEHERTGHQGPEAARVNRVWLFVLTIILHNLPEGMAIGVSFATGDLRIGLPLTSAIAIQDVPEGLAVALALRAVGLPIGRAVLVAVASGLMEPLGALVGVGISSGFALAYPISMGLAAGAMIFVVSHEVIPETHRNGHETTATVGLMAGFALMMFLDTALG 329 T 1.1E-26 Zip unp F Bacteria T 7z6n 1 A,B A,B A0A0H3LM39_BORBR Putative membrane protein MGSSHHHHHHSSGLVPRGSHMNQPSSLAADLRGAWHAQAQSHPLITLGLAASAAGVVLLLVAGIVNALTGENRVHVGYAVLGGAAGFAATALGALMALGLRAISARTQDAMLGFAAGMMLAASAFSLILPGLDAAGTIVGPGPAAAAVVALGLGLGVLLMLGLDYFTPHEHERTGHQGPEAARVNRVWLFVLTIILHNLPEGMAIGVSFATGDLRIGLPLTSAIAIQDVPEGLAVALALRAVGLPIGRAVLVAVASGLMEPLGALVGVGISSGFALAYPISMGLAAGAMIFVVSHEVIPETHRNGHETTATVGLMAGFALMMFLDTALG 329 T 1.1E-26 Zip unp F Bacteria T 7z6q 1 A,K A,a Q8KAY0_CHLTE Photosystem P840 reaction center, large subunit MAEQVKPAGVKPKGTVPPPKGNAPAPKANGAPGGASVIKEQDAAKMRRFLFQRTETRSTKWYQIFDTEKLDDEQVVGGHLALLGVLGFIMGIYYISGIQVFPWGAPGFHDNWFYLTIKPRMVSLGIDTYSTKTADLEAAGARLLGWAAFHFLVGSVLIFGGWRHWTHNLTNPFTGRCGNFRDFRFLGKFGDVVFNGTSAKSYKEALGPHAVYMSLLFLGWGIVMWAILGFAPIPDFQTINSETFMSFVFAVIFFALGIYWWNNPPNAAIHLNDDMKAAFSVHLTAIGYINIALGCIAFVAFQQPSFAPYYKELDKLVFYLYGEPFNRVSFNFVEQGGKVISGAKEFADFPAYAILPKSGEAFGMARVVTNLIVFNHIICGVLYVFAGVYHGGQYLLKIQLNGMYNQIKSIWITKGRDQEVQVKILGTVMALCFATMLSVYAVIVWNTICELNIFGTNITMSFYWLKPLPIFQWMFADPSINDWVMAHVITAGSLFSLIALVRIAFFAHTSPLWDDLGLKKNSYSFPCLGPVYGGTCGVSIQDQLWFAMLWGIKGLSAVCWYIDGAWIASMMYGVPAADAKAWDSIAHLHHHYTSGIFYYFWTETVTIFSSSHLSTILMIGHLVWFISFAVWFEDRGSRLEGADIQTRTIRWLGKKFLNRDVNFRFPVLTISDSKLAGTFLYFGGTFMLVFLFLANGFYQTNSPLPPPVSHAAVSGQQMLAQLVDTLMKMIA 731 F F Bacteria T 7z6u 2 B D Pimtide ARKRRRHPSGPPTA 14 T 1.5 DUF3019 pdbhh F T 7z8e 2 B B GLY-SER-ASP-VAL-ALA GSDVA 5 T 190 TatD_DNase pdbhh F F 7z8e 3 C D SER-SER SS 2 T 860 GoLoco pdbhh F F 7z8f 12 Y Y A0A4X1TB62_PIG DYNACTIN SUBUNIT 4 MASLLQSERVLYLVQGEKKVRAPLSQLYFCRYCSELRSLECVSHEVDSHYCPSCLENMPSAEAKLKKNRCANCFDCPGCMHTLSTRATSISTQLPDDPAKTAVKKAYYLACGFCRWTSRDVGMADKSVASGGWQEPDHPHTQRMNKLIEYYQQLAQKEKVERDRKKLARRRNYMPLAFSQHTIHVVDKYGLGTRLQRPRAGTTITALAGLSLKEGEDQKEIKIEPAQAVDEVEPLPEDYYTRPVNLTEVTTLQQRLLQPDFQPICASQLYPRHKHLLIKRSLRCRQCEHNLSKPEFNPTSIKFKIQLVAVNYIPEVRIMSIPNLRYMKESQVLLTLTNPVENLTHVTLLECEEGDPDDTNSTAKVSVPPTELVLAGKDAAAEYDELAEPQDFPDDPDVVAFRKANKVGVFIKVTPQREEGDVTVCFKLKHDFKNLAAPIRPVEEADPGAEVSWLTQHVELSLGPLLP 467 T 3.9000000000000004E-28 Dynactin_p62 pdbpercent F Eukaryota T 7z8m 4 D Y A0A4X1TB62_PIG DYNACTIN SUBUNIT 4 MASLLQSERVLYLVQGEKKVRAPLSQLYFCRYCSELRSLECVSHEVDSHYCPSCLENMPSAEAKLKKNRCANCFDCPGCMHTLSTRATSISTQLPDDPAKTAVKKAYYLACGFCRWTSRDVGMADKSVASGGWQEPDHPHTQRMNKLIEYYQQLAQKEKVERDRKKLARRRNYMPLAFSQHTIHVVDKYGLGTRLQRPRAGTTITALAGLSLKEGEDQKEIKIEPAQAVDEVEPLPEDYYTRPVNLTEVTTLQQRLLQPDFQPICASQLYPRHKHLLIKRSLRCRQCEHNLSKPEFNPTSIKFKIQLVAVNYIPEVRIMSIPNLRYMKESQVLLTLTNPVENLTHVTLLECEEGDPDDTNSTAKVSVPPTELVLAGKDAAAEYDELAEPQDFPDDPDVVAFRKANKVGVFIKVTPQREEGDVTVCFKLKHDFKNLAAPIRPVEEADPGAEVSWLTQHVELSLGPLLP 467 T 3.9000000000000004E-28 Dynactin_p62 pdbpercent F Eukaryota T 7z8o 2 B B Stapled peptide XCPYVAGXXTCLXX 14 T 0.61 Paired_CXXCH_1 pdbhh F T 7zak 3 C C Synthetic peptide KNLEKYKGKFVREID 15 T 0.42 DUF5678 pdbhh F T 7zax 2 B B THAN_PODMA Thanatin-like derivative XPITYXNRXTXKCXRY 16 T 2.6 YihI unphh F Eukaryota T 7zb0 2 E,F,G,H E,F,G,H 15mer GFPWXIXXXXXXVIG 15 T 0.038 DUF2897 pdbhh F T 7zb1 2 E,F,G,H E,F,G,H 18mer WXIXXXXXXVXXSXMSTE 18 T 0.31 DUF2897 pdbhh F T 7zbq 1 A A Q8GF97_PHOLU TccC3 MSTTSTNLQKKSFTLYRADNRSFEEMQSKFPEGFKAWTPLDTKMARQFASIFIGQKDTSNLPKETVKNISTWGAKPKLKDLSNYIKYTKDKSTVWVSTAINTEAGGQSSGAPLHKIDMDLYEFAIDGQKLNPLPEGRTKNMVPSLLLDTPQIETSSIIALNHGPVNDAEISFLTTIPLKNVKPHKRGTLEVLFQ 194 T 0.087 UFC1 pdb F Bacteria T 7zcu 3 C S Q6N9P5_RHOPA LIGHT-HARVESTING PROTEIN B-800-850 SUBUNIT GAMMA MSEEYKGHSGHPLILKQEGEYKGYSGEPLILKQEGEYKGYSGTPLILEQKGEYQSFSGTPLILKQEGEYRGFSGAPLILKQDGEYKSFSGYPLLLNI 97 T 0.18 DUF3823 pdb F Bacteria T 7zcx 1 A AAA SLAA_SULAC SURFACE LAYER LARGE PROTEIN MNKLVGLLVSSLFLASILIGIAPAITTTALTPPVSAGGIQAYLLTGSGAPASGLVLFVVNVSNIQVSSSNVTNVISTVVSNIQINAKTENAQTGATTGSVTVRFPTSGYNAYYDSVDKVVFVVVSFLYPYTTTSVNIPLSYLSKYLPGLLTAQPYDETGAQVTSVSSTPFGSLIDTSTGQQILGTNPVLTSYNSYTTQANTNMQEGVVSGTLTSFTLGGQSFSGSTVPVILYAPFIFSNSPYQAGLYNPMQVNGNLGSLSSEAYYHPVIWGRALINTTLIDTYASGSVPFTFQLNYSVPGPLTINMAQLAWIASINNLPTSFTYLSYKFSNGYESFLGIISNSTQLTAGALTINPSGNFTINGKKFYVYLLVVGSTNSTTPVEYVTKLVVEYPSSTNFLPQGVTVTTSSNKYTLPVYEIGGPAGTTITLTGNWYSTPYTVQITVGSTPTLTNYVSQILLKAVAYEGINVSTTQSPYYSTAILSTPPSEISITGSSTITAQGKLTATSASATVNLLTNATLTYENIPLTQYSFNGIIVTPGYAAINGTTAMAYVIGALYNKTSDYVLSFAGSQEPMQVMNNNLTEVTTLAPFGLTLLAPSVPATETGTSPLQLEFFTVPSTSYIALVDFGLWGNLTSVTVSAYDTVNNKLSVNLGYFYGIVIPPSISTAPYNYQNFICPNNYVTVTIYDPDAVLDPYPSGSFTTSSLPLKYGNMNITGAVIFPGSSVYNPSGVFGYSNFNKGAAVTTFTYTAQSGPFSPVALTGNTNYLSQYADNNPTDNYYFIQTVNGMPVLMGGLSIVASPVSASLPSSTSSPGFMYLLPSAAQVPSPLPGMATPNYNLNIYITYKIDGATVGNNMINGLYVASQNTLIYVVPNGSFVGSNIKLTYTTTDYAVLHYFYSTGQYKVFKTVSVPNVTANLYFPSSTTPLYQLSVPLYLSEPYYGSPLPTYIGLGTNGTSLWNSPNYVLFGVSAVQQYLGFIKSISVTLSNGTTVVIPLTTSNMQTLFPQLVGQELQACNGTFQFGISITGLEKLLNLNVQQLNNSILSVTYHDYVTGETLTATTKLVALSTLSLVAKGAGVVEFLLTAYPYTGNITFAPPWFIAENVVKQPFMTYSDLQFAKTNPSAILSLSTVNITVVGLGGKASVYYNSTSGQTVITNIYGQTVATLSGNVLPTLTELAAGNGTFTGSLQFTIVPNNTVVQIPSSLTKTSFAVYTNGSLAIVLNGKAYSLGPAGLFLLPFVTYTGSAIGANATAIITVSDGVGTSTTQVPITAENFTPIRLAPFQVPAQVPLPNAPKLKYEYNGSIVITPQQQVLKIYVTSILPYPQEFQIQAFVYEASQFNVHTGSPTAAPVYFSYSAVRAYPALGIGTSVPNLLVYVQLQGISNLPAGKYVIVLSAVPFAGGPVLSEYPAQLIFTNVTLTQ 1424 T 1.2 Tcp10_C pdbpssm F Archaea T 7zdi 3 C S Q6N9P5_RHOPA PucA-LH2-gamma MSEEYKGHSGHPLILKQEGEYKGYSGEPLILKQEGEYKGYSGTPLILEQKGEYQSFSGTPLILKQEGEYRGFSGAPLILKQDGEYKSFSGYPLLLNI 97 T 0.18 DUF3823 pdb F Bacteria T 7ze3 3 C S Q6N9P5_RHOPA PucA-LH2-gamma MSEEYKGHSGHPLILKQEGEYKGYSGEPLILKQEGEYKGYSGTPLILEQKGEYQSFSGTPLILKQEGEYRGFSGAPLILKQDGEYKSFSGYPLLLNI 97 T 0.18 DUF3823 pdb F Bacteria T 7ze8 3 S S Q6N9P5_RHOPA PucA-LH2-gamma MSEEYKGHSGHPLILKQEGEYKGYSGEPLILKQEGEYKGYSGTPLILEQKGEYQSFSGTPLILKQEGEYRGFSGAPLILKQDGEYKSFSGYPLLLNI 97 T 0.18 DUF3823 pdb F Bacteria T 7zed 2 B B THAN_PODMA Thanatin-like derivative XPITYXNRXTXKCXRY 16 T 2.6 YihI unphh F Eukaryota T 7zfr 3 C C Synthetic peptide IEFVFKNKAKEL 12 T 6.8 DUF4566 pdbhh F T 7zg0 1 A,B A,B IL27A_MOUSE IL-27 SUBUNIT ALPHA,IL-27-A,IL27-A,P28 MGILPSPGMPALLSLVSLLSVLLMGCVAETGFPTDPLSLQELRREFTVSLYLARKLLSEVQGYVHSFAESRLPGVNLDLLPLGYHLPNVSLTFQAWHHLSDSERLCFLATTLRPFPAMLGGLGTQGTWTSSEREQLWAMRLDLRDLHRHLRFQVLAAGFKCSKEEEDKEEEEEEEEEEKKLPLGALGGPNQVSSQVSWPQLLYTYQLLHSLELVLSRAVRDLLLLSLPRRPGSAWDSGTKHHHHHH 246 T 0.077 XK-related unp F Eukaryota T 7zgl 1 A,B,C A,B,C CP125_MYCTU CHOLEST-4-EN-3-ONE 26-MONOOXYGENASE,CHOLEST-4-EN-3-ONE C26-MONOOXYGENASE [(25S)-3-OXOCHOLEST-4-EN-26-OATE FORMING],CHOLESTEROL C26-MONOOXYGENASE,CHOLESTEROL C26-MONOOXYGENASE [(25S)-3BETA-HYDROXYCHOLEST-5-EN-26-OATE FORMING],CYTOCHROME P450 125,STEROID C27-MONOOXYGENASE NGPSPNLPPGFDFTDPAIYAERLPVAEFAELRSAAPIWWNGQDPGKGGGFHDGGFWAITKLNDVKEISRHSDVFSSYENGVIPRFKNDIAREDIEVQRFVMLNMDAPHHTRLRKIISRGFTPRAVGRLHDELQERAQKIAAEAAAAGSGDFVEQVSCELPLQAIAGLLGVPQEDRGKLFHWSNEMTGNEDPEYAHIDPKASSAELIGYAMKMAEEKAKNPADDIVTQLIQADIDGEKLSDDEFGFFVVMLAVAGNETTRNSITQGMMAFAEHPDQWELYKKVRPETAADEIVRWATPVTAFQRTALRDYELSGVQIKKGQRVVMFYRSANFDEEVFQDPFTFNILRNPNPHVGFGGTGAHYCIGANLARMTINLIFNAVADHMPDLKPISAPERLRSGWLNGIKHWQVDYTGRCPVAH 418 T 3.4000000000000003E-22 p450 unppssm F Bacteria T 7zgv 1 A A A0A2I5TBB8_SERS3 Serratia NucC KEEKLTMTNQAKKLSRINGREFLKQSFNLQQQLLASQLNLSRTITHDGTMGEVNESYFLSIIRQYLPERYSVDRGVVVDSEGQTSDQIDAVIFDRHYTPTLLDQQGHRFIPAEAVYAVLEVKPTINKTYLEYAADKAASVRKLYRTSTVIKNIYGTAKPVEHFPIVAGIVAIDVEWQDGLGKAFTENLQAVSSDENRKLDCGLAVSGACFDSYDEEIKIRSGENALIFFLFRLLGKLQSLGTVPAIDWRVYIDSLE 256 T 0.1 UPF0102 unppercent F Bacteria T 7zgw 1 A,B,C,D,E,F A,B,D,C,E,F A0A2I5TBB8_SERS3 Serratia NucC KEEKLTMTNQAKKLSRINGREFLKQSFNLQQQLLASQLNLSRTITHDGTMGEVNESYFLSIIRQYLPERYSVDRGVVVDSEGQTSDQIDAVIFDRHYTPTLLDQQGHRFIPAEAVYAVLEVKPTINKTYLEYAADKAASVRKLYRTSTVIKNIYGTAKPVEHFPIVAGIVAIDVEWQDGLGKAFTENLQAVSSDENRKLDCGLAVSGACFDSYDEEIKIRSGENALIFFLFRLLGKLQSLGTVPAIDWRVYIDSLE 256 T 0.1 UPF0102 unppercent F Bacteria T 7zhj 1 A,D,E,G,K,L,O,P,S,T,X,Y R,Q,S,N,M,O,J,L,U,K,P,T FIBL2_BPT5 L-shaped tail fiber protein p132 MSTENRVIDLVVDENVPYGLLMQFMDVDDSVYPSTSKPVDLTDFSLRGSIKSSLEDGAETVASFTTAIVDAAQGVASISLPVSAVTTIASKASKERDRYNPRQRLAGYYDVIITRTAVGSAASSFRIMEGKVYISDGVTQ 140 T 3.3E-05 BppU_N pdbhh T Viruses T 7zhj 2 B,H,Q I,H,G TAIL1_BPT5 TAIL PROTEIN P140 MFYSLMRESKIVIEYDGRGYHFDALSNYDASTSFQEFKTLRRTIHNRTNYADSIINAQDPSSISLAINFSTTLIESNFFDWMGFTREGNSLFLPRNTPNIEPIMFNMYIINHNNSCIYFENCYVSTVDFSLDKSIPILNVGIESGKFSEVSTFRDGYTITQGEVLPYSAPAVYTNSSPLPALISASMSFQQQCSWREDRNIFDINKIYTNKRAYVNEMNASATLAFYYVKRLVGDKFLNLDPETRTPLIIKNKYVSITFPLARISKRLNFSDLYQVEYDVIPTADSDPVEINFFGERK 298 T 0.023 DUF4965 pdbpssm T Viruses T 7zhj 5 AA,BA,Z g,e,f Q7Y5E2_BPT5Z Pore-forming tail tip protein pb2 MTDKLIRELLIDVKQKGATRTAKSIENVSDALENAAAASELTNEQLGKMPRTLYSIERAADRAAKSLTKMQASRGMAGITKSIDGIGDKLDYLAIQLIEVTDKLEIGFDGVSRSVKAMGNDVAAATEKVQDRLYDTNRALGGTSKGFNDTAGAAGRASRALGNTSGSARGATRDFAAMAKIGGRLPIMYAALASNVFVLQTAFESLKVGDQLNRLEQFGTIVGTMTGTPVQTLALSLQNATNGAISFEEAMRQASSASAYGFDSEQLEQFGLVARRAAAVLGVDMTDALNRVIKGVSKQEIELLDELGVTIRLNDAYENYVKQLNATSTGIKYTVDSLTTYQKQQAYANEVIAESTRRFGYLDDALKATSWEQFAANANSALRSLQQSAATYLNPVMDTLNTFLYQTKSSQMRVSAMARSASAKTTPAENVTALIENAVGAREDLDTYLKESEERVKKAQELKQQLDDLKAKQAATAPIANALTAGGIGGDESNKLVVQLTNELARQNKEIEERTKTEKVLRQAVQDTGEALLRNGKLAEQLGAKMKYADTAVPGDKGVFEVDPNNLKAVSEIQKNFDFLKKSSSDTANNIRMAASSITNAKKASSDLNSVVKAVEDTSKVTGQSADTLVKNLNLGFSSLDQMKAAQKGLSEYVTAMDKSEQNALEVAKRKDEVYNQTKDKAKAEAAAREVLLRQQQEQLTAAKALLAINPNDPEALKQVAKIETEILNTKAQGFENAKKTKDYTDKILGVDREIALLNDRTMTSTQYRLAQLRLELQLEQEKTELYSKQADGQAKVEQSRRAQAQISREIWEAEKQGTASHVSALMDALEVSQTQRNVTGQSQILTERLSILQQQLELSKGNTEEELKYRNEIYKTSAALEQLKKQRESQMQQQVGSSVGATYTSTTGLIGEDKDFADMQNRMASYDQAISKLSELNSEATAVAQSMGNLTNAMIQFSQGSLDTTSMIASGMQTVASMIQYSTSQQVSAIDQAIAAEQKRDGKSEASKAKLKKLEAEKLKIQQDAAKKQIIIQTAVAVMQAATAVPYPFSIPLMVAAGLAGALALAQASSASGMSSIADSGADTTQYLTLGERQKNVDVSMQASSGELSYLRGDKGIGNANSFVPRAEGGMMYPGVSYQMGEHGTEVVTPMVPMKATPNDQLSDGSKTTSGRPIILNISTMDAASFRDFASNNSTAFRDAVELALNENGTTLKSLGNS 1219 T 0.11 Asp-Al_Ex unppssm T Viruses T 7zhl 1 A,B A,B Q8ZRL0_SALTY RHS repeat protein GASTATVGRWMGPAEYQQMLDTGTVVQSSTGTTHVAYPADIDAFGKQAKNGAMYVEFDVPEKSLVPTNEGWAKIVGPDSIEGRLAKRKGLPVPEMPTAENITVRGEKINGEVEAKC 116 T 0.69 Ntox47 unphh F Bacteria T 7zhm 1 A,B A,B Q8ZRL0_SALTY TYPE IV SECRETION PROTEIN RHS MTATVGRWMGPAEYQQMLDTGTVVQSSTGTTHVAYPADIDAFGKQAKNGAMYVEFDVPEKSLVPTNEGWAKIVGPDSIEGRLAKRKGLPVPEMPTAENITVRGEKINGEVEAKC 114 T 0.69 Ntox47 unphh F Bacteria T 7zhm 2 C,D C,D A0A0H3TET1_SALTM Immunity protein TriTu MLNKFKLWVSKHTDYTVIHNENDLSYSIIIDFEDDRYISRFTVWDDLSCMSEVMDVDTGLYKLNKRNEFSTFDELLDIFDDFMISIK 87 T 0.21 Spindle_Spc25 pdb F Bacteria T 7zic 1 A,B,C A,B,C CP125_MYCTU CHOLEST-4-EN-3-ONE 26-MONOOXYGENASE,CHOLEST-4-EN-3-ONE C26-MONOOXYGENASE [(25S)-3-OXOCHOLEST-4-EN-26-OATE FORMING],CHOLESTEROL C26-MONOOXYGENASE,CHOLESTEROL C26-MONOOXYGENASE [(25S)-3BETA-HYDROXYCHOLEST-5-EN-26-OATE FORMING],CYTOCHROME P450 125,STEROID C27-MONOOXYGENASE NGPSPNLPPGFDFTDPAIYAERLPVAEFAELRSAAPIWWNGQDPGKGGGFHDGGFWAITKLNDVKEISRHSDVFSSYENGVIPRFKNDIAREDIEVQRFVMLNMDAPHHTRLRKIISRGFTPRAVGRLHDELQERAQKIAAEAAAAGSGDFVEQVSCELPLQAIAGLLGVPQEDRGKLFHWSNEMTGNEDPEYAHIDPKASSAELIGYAMKMAEEKAKNPADDIVTQLIQADIDGEKLSDDEFGFFVVMLAVAGNETTRNSITQGMMAFAEHPDQWELYKKVRPETAADEIVRWATPVTAFQRTALRDYELSGVQIKKGQRVVMFYRSANFDEEVFQDPFTFNILRNPNPHVGFGGTGAHYCIGANLARMTINLIFNAVADHMPDLKPISAPERLRSGWLNGIKHWQVDYTGRCPVAH 418 T 3.4000000000000003E-22 p450 unppssm F Bacteria T 7zjj 1 A A CspZ GAMGRLNQRNINELKIFVEKAKYYSIKLDAIYSEYTGAYNDIMTYIMTYSEGTSSDKSKVNQAISILKKDNKIVNKFKELEKIIEEYKPMFLSKLIDDFAIELDQAVDNDVSNARHVADSYEKLRKSVALAYIESFDVISSKFVDSKFVEASKKFVNKAKEFVEENDLIALKCIVKTIGDMVNDREINSRSRYNNFYKKEADFLGAAVELEGAYKAIKQTLL 222 T 3.4 Pepsin-I3 pdbhh F T 7zjk 1 A,B A,B CspZ GAMGRLNQRNINELKIFFEKAKYYSIKLDAIYNEYTEAYNDIMTYSEVNNVTDSDKSKVNQAISILKKDNKIVNKFKELEKIIEEYKPIFLSKLIDDFAIELDQAVDNDVSNARHVADSYKKLRKSVVLAYIESFDVISSKFVDSKFVEASKKFVNKAKEFVEENDLIALECIVKTIGDMVNDREINSRSRYDNFYKKEADFLGAAVELEGAYKAIKQTLL 221 T 0.0086 LEF-9 pdbpercent F T 7zjm 1 A A CspZ GAMGRLNQRNINELKIFFEKAKYYSIKLDAIYNEYTEAYNDIMTYSEVNNVTDSDKSKVNQAISILKKDNKIVNKFKELEKIIEEYKPIFLSKLIDDFAIELDQAVDNDVSNARHVADSYKKLRKSVVLAYIESFDVISSKFVDSKFVEASKKFVNKAKEFVEENDLIALECIVKTIGDMVNDREINSRSRYDNFYKKEADFLGAAVELEGAYKAIKQTLL 221 T 0.0086 LEF-9 pdbpercent F T 7zjs 3 E,F E,F SGO1_HUMAN SEROLOGICALLY DEFINED BREAST CANCER ANTIGEN NY-BR-85,SHUGOSHIN-LIKE 1 SNDAYNFNLEE 11 T 0.26 Menin unp F Eukaryota T 7zjy 1 A A L7IQQ2_MAGOP MAX effector protein GPHMADCTLGCKYLENNRWVSVSKSANIGDTLYIMGHSTKIGRGCKPETTEWSDAEIYSW 60 T 0.031 TGBp3 unppssm F Eukaryota T 7zk0 1 A A L7JNQ8_MAGOP MAX effector protein GPHMGKHGRDDYDCTVIFRNNHAPERQPIVVHTYYSRDLPIELDGVRHTIQLSGCTPEQSQIPQGYSVEHMTYKNYLRQEILNERPFWP 89 T 1.4 Sec-ASP3 pdbhh F Eukaryota T 7zkd 1 A A L7ISI9_MAGOP MAX effector protein GPHMNNVMASSSSDTDSDSSPDRGLSRMCCVYKIHPGGNIWSTKKGEQAWFRRRFSKYEVMAYDRCNLEWGFSGKPRGLTFEFLWDKEAAADGTC 95 T 5.1 PetN unphh F Eukaryota T 7zkp 13 M C Q6CG31_YARLI assembly factor CIA84 MPKNALLRSARQVAISRVFATSRASHVVSHAPILASVRPRSNPAPYRRNFSSSRALRNDYGLDTAERSLKESLVPFNGAPVDRKVVRDQLMELISVSPGQVFPISVIPVVKSAYYELFRENERVLSAGDTKTLFGAVAGNNPEDVQDLPFVLAVYHQAEQAAETNRDSRDNILLLGKYFLFQDRLDNFWKLLEAQIKTHDDVDAGFVKQLLELISVDPHLTLGNVARVLQLKTDNHVSSSDELRNALSATLEQLYYKENEGSEFFLSLVENHILDSKDFTPSDSVVAMILNTCVNEGREDLGQSVLRNVVSRVGNLSPGQEDPQNCWGFWSSVAMDLHGSKTDVKAFISRLEALPHRTKATWDILIRYAVFKADLAGRNDLLQVRALLAEMQKVGFEPDAETYFDAYRSSKSIKPDVVHLFEAELDIEKDTSIFAIEMDKALKNHDTLEALSIFYESFEQGAQWENKRLHMEAMTELLIQYAGLNDTSVADILQLVQRIEPICAQGRIPYSAETAIAQNVLQRHSDTANFYTFMNRQYGNTADKVTKQDPQIRPHTYQVIHDYIYSCESERADLAWEMYGLLHKFYVVPFADYYKAIKFFAQDVKRQDYALLTFQQIRKNHDLHGQPAATSEMVAFLFHEFAKTKYKRGIKRLHEVVALETSFDVNRDVLNEMMAAYVSVEDLNRVQDCWAQLQQLPPSIGANNRSVDVLLSYFKDNIHYTERTWQGIPEFGLLPTLENYEQYLINNCRTGNYRRALEITKNMEIDSGLKPTAKIIAAVYNYTFTEQRKLEVEQWAEKAHPEMWLELKEGDKLKSLCLPANSDNDNVESLLKQASADMDEEMSGGIVKVESV 852 T 0.00062 PPR_2 unppssm F Eukaryota T 7zkq 4 D C Q6CG31_YARLI complex I assembly factor CIA84 MPKNALLRSARQVAISRVFATSRASHVVSHAPILASVRPRSNPAPYRRNFSSSRALRNDYGLDTAERSLKESLVPFNGAPVDRKVVRDQLMELISVSPGQVFPISVIPVVKSAYYELFRENERVLSAGDTKTLFGAVAGNNPEDVQDLPFVLAVYHQAEQAAETNRDSRDNILLLGKYFLFQDRLDNFWKLLEAQIKTHDDVDAGFVKQLLELISVDPHLTLGNVARVLQLKTDNHVSSSDELRNALSATLEQLYYKENEGSEFFLSLVENHILDSKDFTPSDSVVAMILNTCVNEGREDLGQSVLRNVVSRVGNLSPGQEDPQNCWGFWSSVAMDLHGSKTDVKAFISRLEALPHRTKATWDILIRYAVFKADLAGRNDLLQVRALLAEMQKVGFEPDAETYFDAYRSSKSIKPDVVHLFEAELDIEKDTSIFAIEMDKALKNHDTLEALSIFYESFEQGAQWENKRLHMEAMTELLIQYAGLNDTSVADILQLVQRIEPICAQGRIPYSAETAIAQNVLQRHSDTANFYTFMNRQYGNTADKVTKQDPQIRPHTYQVIHDYIYSCESERADLAWEMYGLLHKFYVVPFADYYKAIKFFAQDVKRQDYALLTFQQIRKNHDLHGQPAATSEMVAFLFHEFAKTKYKRGIKRLHEVVALETSFDVNRDVLNEMMAAYVSVEDLNRVQDCWAQLQQLPPSIGANNRSVDVLLSYFKDNIHYTERTWQGIPEFGLLPTLENYEQYLINNCRTGNYRRALEITKNMEIDSGLKPTAKIIAAVYNYTFTEQRKLEVEQWAEKAHPEMWLELKEGDKLKSLCLPANSDNDNVESLLKQASADMDEEMSGGIVKVESV 852 T 0.00062 PPR_2 unppssm F Eukaryota T 7zkr 2 B B Pen3-ortho XDAXYTWECLAWPX 14 T 3.9 Stealth_CR1 pdbhh F T 7zkx 1 A A SRPK2_HUMAN SFRS PROTEIN KINASE 2,SERINE/ARGININE-RICH PROTEIN-SPECIFIC KINASE 2,SR-PROTEIN-SPECIFIC KINASE 2 PVKIGDLFNGRYHVIRKLGWGHFSTVWLCWDMQGKRFVAMKVVKSAQHYTETALDEIKLLKCVRESDPSDPNKDMVVQLIDDFKISGMNGIHVCMVFEVLGHHLLKWIIKSNYQGLPVRCVKSIIRQVLQGLDYLHSKCKIIHTDIKPENILMCVDDAYVRRMAAEATEWQKAGAPPPSGSAVSTAPQQKPIGKISKNKKKKLKKKQKRQAELLEKRLQEIEELEREAERKIIEENITSAAPSNDQDGEYCPEVKLKTTGLEEAAEAETAKDNGEAEDQEEKEDAEKENIEKDEDDVDQELANIDPTWIESPKTNGHIENGPFSLEQQLDDEDDDEEDCPNPEEYNLDEPNAESDYTYSSSYEQFNGELPNGRHKIPESQFPEFSTSLFSGSLEPVACGSVLSEGSPLTEQEESSPSHDRSRTVSASSTGDLPKAKTRAADLLVNPLDPRNADKIRVKIADLGNACWVHKHFTEDIQTRQYRSIEVLIGAGYSTPADIWSTACMAFELATGDYLFEPHSGEDYSRDEDHIAHIIELLGSIPRHFALSGKYSREFFNRRGELRHITKLKPWSLFDVLVEKYGWPHEDAAQFTDFLIPMLEMVPEKRASAGECLRHPWLNS 619 T 5.4E-20 Pkinase unppercent F Eukaryota T 7zl3 3 C C Protein transport protein Sec61 subunit beta XXXXXXXXXXXXXXXXXXXXXXXXXXXXX 29 F F F 7zl3 4 D E Cyclic depsipeptide signal peptide mimic XXXXXL 6 T 2200 WW pdbhh F F 7zl7 2 B,D D,B Pen8-ortho XDACYTWEXLAWPX 14 T 0.32 DUF1666 pdbhh F T 7zlg 5 E P Synthetic octapeptide WEHI 1886493 GSWAKWS 7 T 1.6 TMEM131_like pdbhh F F 7zlj 5 E P synthetic octapeptide WEHI 1886493 GSWAKWS 7 T 1.6 TMEM131_like pdbhh F F 7zlt 1 A,B,C A,B,C CP125_MYCTU CHOLEST-4-EN-3-ONE 26-MONOOXYGENASE,CHOLEST-4-EN-3-ONE C26-MONOOXYGENASE [(25S)-3-OXOCHOLEST-4-EN-26-OATE FORMING],CHOLESTEROL C26-MONOOXYGENASE,CHOLESTEROL C26-MONOOXYGENASE [(25S)-3BETA-HYDROXYCHOLEST-5-EN-26-OATE FORMING],CYTOCHROME P450 125,STEROID C27-MONOOXYGENASE NGPSPNLPPGFDFTDPAIYAERLPVAEFAELRSAAPIWWNGQDPGKGGGFHDGGFWAITKLNDVKEISRHSDVFSSYENGVIPRFKNDIAREDIEVQRFVMLNMDAPHHTRLRKIISRGFTPRAVGRLHDELQERAQKIAAEAAAAGSGDFVEQVSCELPLQAIAGLLGVPQEDRGKLFHWSNEMTGNEDPEYAHIDPKASSAELIGYAMKMAEEKAKNPADDIVTQLIQADIDGEKLSDDEFGFFVVMLAVAGNETTRNSITQGMMAFAEHPDQWELYKKVRPETAADEIVRWATPVTAFQRTALRDYELSGVQIKKGQRVVMFYRSANFDEEVFQDPFTFNILRNPNPHVGFGGTGAHYCIGANLARMTINLIFNAVADHMPDLKPISAPERLRSGWLNGIKHWQVDYTGRCPVAH 418 T 3.4000000000000003E-22 p450 unppssm F Bacteria T 7zlv 1 A,B,C i,h,j FIBC_BPT5 TAIL PROTEIN PB4 MISNNAPAKMVLNSVLTGYTLAYIQHSIYSDYDVIGRSFWLKEGSNVTRRDFTGIDTFSVTINNLKPTTTYEVQGAFYDSIIDSELLNAQIGINLSDKQTFKMKSAPRITGARCESEPVDVGVGAPIVYIDTTGEADYCTIELKDNSNANNPWVKYYVGALMPTIMFGGVPIGSYKVRISGQISLPDGVTIDSSGYYEYPNVFEVRYNFVPPAAPINIVFKAARIADGKERYDLRVQWDWNRGAGANVREFVLSYIDSAEFVRTGWTKAQKINVGAAQSATIISFPWKVEHKFKVSSIAWGPDAQDVTDSAVQTFILNESTPLDNSFVNETGIEVNYAYIKGKIKDGSTWKQTFLIDAATGAINIGLLDAEGKAPISFDPVKKIVNVDGSVITKTINAANFVMTNLTGQDNPAIYTQGKTWGDTKSGIWMGMDNVTAKPKLDIGNATQYIRYDGNILRISSEVVIGTPNGDIDIQTGIQGKQTVFIYIIGTSLPAKPTSPAYPPSGWSKTPPNRTSNTQNIYCSTGTLDPVTNQLVSGTSWSDVVQWSGTEGVDGRPGATGQRGPGMYSLAIANLTAWNDSQANSFFTSNFGSGPVKYDVLTEYKSGAPGTAFTRQWNGSAWTSPAMVLHGDMIVNGTVTASKIVANNAFLSQIGVNIIYDRAAALSSNPEGSYKMKIDLQNGYIHIR 688 T 0.27 fn3 pdb T Viruses T 7zlz 1 A,B,C A,B,C CP125_MYCTU CHOLEST-4-EN-3-ONE 26-MONOOXYGENASE,CHOLEST-4-EN-3-ONE C26-MONOOXYGENASE [(25S)-3-OXOCHOLEST-4-EN-26-OATE FORMING],CHOLESTEROL C26-MONOOXYGENASE,CHOLESTEROL C26-MONOOXYGENASE [(25S)-3BETA-HYDROXYCHOLEST-5-EN-26-OATE FORMING],CYTOCHROME P450 125,STEROID C27-MONOOXYGENASE NGPSPNLPPGFDFTDPAIYAERLPVAEFAELRSAAPIWWNGQDPGKGGGFHDGGFWAITKLNDVKEISRHSDVFSSYENGVIPRFKNDIAREDIEVQRFVMLNMDAPHHTRLRKIISRGFTPRAVGRLHDELQERAQKIAAEAAAAGSGDFVEQVSCELPLQAIAGLLGVPQEDRGKLFHWSNEMTGNEDPEYAHIDPKASSAELIGYAMKMAEEKAKNPADDIVTQLIQADIDGEKLSDDEFGFFVVMLAVAGNETTRNSITQGMMAFAEHPDQWELYKKVRPETAADEIVRWATPVTAFQRTALRDYELSGVQIKKGQRVVMFYRSANFDEEVFQDPFTFNILRNPNPHVGFGGTGAHYCIGANLARMTINLIFNAVADHMPDLKPISAPERLRSGWLNGIKHWQVDYTGRCPVAH 418 T 3.4000000000000003E-22 p450 unppssm F Bacteria T 7zm7 8 H 9 G0SG48_CHATD SUBUNIT NDUFS5 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MASGYGLNGGPSRCFPFWQELLACYVTNSSEDNPDGKNKCIPVMEDYYECLHHRKEAARVRALQAAYREAEAKKLQENPPTAGQIRNLGLLNKEEDTKKVHCATATQQWMKMCFALSAKMPCLRGIEVFLTTRPDNEKIPEYPHPEGTSARVICGQHLIRSPSLASNLSNSAITPPSSFSPRKANPLVSVYLPSVPGTPFTINYKISQVPPEPCKYLFFRLYINARPMVSWGIDPHSRPYGKVIKSLWLPSDDRYRGLVGFEKRSFVFLPGEEFKSVAEDGGLIEVQVFRAKDRRARTPKLETFRFRDNYGIAAPSIGLLDKPQNAFFYDWLLIDPKDEPFAKFRMHYRSWRNLKSLNLIPSSEWELLLAVSPKALRTAASTGKIEKPTSPAFSDSDSDDSLCSATDSDECVFDDHSKKTKSNRSKESPFAFLNSPPERFRAMAPSSEKLPQPSKLLRDSQRAPYQSRPLPELPVEAGVNSTGLNPPSVKVAADLRRKPSATSMESNAVSITPSLLRCMEEGTLDLEKAEVGIAKLVKVAASGSEPSSSSSSATQLTVSAVREVELRPPQPERKGGLPMDYSFSDYEKSSQSSFGDDERMSNISDMEEKPFPAPPTCYLPTTGSGLERELAMFDSPSPSPVPYSAMEPPVTPSPSSTPYKTKLGRKPLLFSRRLGLFSPRKSLPSDFLLGQAKKLVVEEETTNSNVSLAFGELTVTDMRAESPSPAPKGKPLRRFSTIRVEEISIKEKRPLFNSLRRIASASPRKLAGRVLSMDLGKKGGEEKEG 785 T 0.0017 COX6B unppssm F Eukaryota T 7zm7 30 EA Z G0SEF0_CHATD SUBUNIT NDUFA7 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MASKAAAAAASNAVSITKKYTVQSTGIWERIRRALVIDPNRSNGVPLNPYNRNPSPGDNPPLEYTDPVTIPAGDIADNPYWKRDFRRNYPRPSVIAQAQQVALLSVGSAAQPRVELIGEEGTKALVAAEEEGKEKGVAKYLEEKGAEEAKRVLALTGGLPPTPSGQTMVTGQWDVHKYGLAEEQSYGGSYPCRSFV 196 T 0.004 CI-B14_5a pdbhh F Eukaryota T 7zm7 34 IA d G0SEZ1_CHATD SUBUNIT NDUFB10 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MPTPESEAFLAKKPQVPPTFDGVDYEDNKRLKQAQDAIIREQWVQVMMGRLVREELSKCYYREGVNHLEKCGKLRERYLQLLANAKVKGYLFEQQNYWSKENQQQ 105 T 0.00014 NDUFB10 unppercent F Eukaryota T 7zm7 35 JA e SUBUNIT NDUFB2 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MAGGGQHVSRVHRFLATGLGASMWFWIFYRAKKDGPVLLGWKHPWD 46 T 0.00013 NADH_B2 pdb F T 7zm7 37 LA g G0RZZ2_CHATD SUBUNIT NDUFA3 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MSATTPRFWSTPLKYCRWAARERPALFFSVVIGALGPVTLATVPPLRRLIGDVDAAPIPLTYPIPPGPRKQLKGYDDDTEDN 82 T 0.11 NADHdh_A3 pdbpercent F Eukaryota T 7zm7 39 NA i G0S569_CHATD SUBUNIT NDUFB6 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MGGGPKIPYPKHVWSPAGGWYAQPANWKQNTAIFGLVIFGITAMVWKYSAEHEVRHKMPEPDRFYPSRYWVKQIKDYERAQKEKQQNNTEASS 93 T 0.0065 Elongin_A pdbpercent F Eukaryota T 7zm7 41 PA n G0S086_CHATD SUBUNIT NDUFB5 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MLALRQRAALLARRVRPTVVVPRNARTYASSHDHDHHDHHHDHGHNVEEPLGAAFYIAVGGIASSFVIYNISRPGPNGEPSSLHKWFSKISDYKDEWETRNTLMAAALEQAAHDKHLLLTAERSRHIELKYPEVFSHGSPFNVPAGFYPNLDHVIEHYRKQHLEEEERKAKKLAAAAAAASEAR 184 T 0.061 Glyco_transf_36 unppercent F Eukaryota T 7zm8 8 H 9 G0SG48_CHATD SUBUNIT NDUFS5 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MASGYGLNGGPSRCFPFWQELLACYVTNSSEDNPDGKNKCIPVMEDYYECLHHRKEAARVRALQAAYREAEAKKLQENPPTAGQIRNLGLLNKEEDTKKVHCATATQQWMKMCFALSAKMPCLRGIEVFLTTRPDNEKIPEYPHPEGTSARVICGQHLIRSPSLASNLSNSAITPPSSFSPRKANPLVSVYLPSVPGTPFTINYKISQVPPEPCKYLFFRLYINARPMVSWGIDPHSRPYGKVIKSLWLPSDDRYRGLVGFEKRSFVFLPGEEFKSVAEDGGLIEVQVFRAKDRRARTPKLETFRFRDNYGIAAPSIGLLDKPQNAFFYDWLLIDPKDEPFAKFRMHYRSWRNLKSLNLIPSSEWELLLAVSPKALRTAASTGKIEKPTSPAFSDSDSDDSLCSATDSDECVFDDHSKKTKSNRSKESPFAFLNSPPERFRAMAPSSEKLPQPSKLLRDSQRAPYQSRPLPELPVEAGVNSTGLNPPSVKVAADLRRKPSATSMESNAVSITPSLLRCMEEGTLDLEKAEVGIAKLVKVAASGSEPSSSSSSATQLTVSAVREVELRPPQPERKGGLPMDYSFSDYEKSSQSSFGDDERMSNISDMEEKPFPAPPTCYLPTTGSGLERELAMFDSPSPSPVPYSAMEPPVTPSPSSTPYKTKLGRKPLLFSRRLGLFSPRKSLPSDFLLGQAKKLVVEEETTNSNVSLAFGELTVTDMRAESPSPAPKGKPLRRFSTIRVEEISIKEKRPLFNSLRRIASASPRKLAGRVLSMDLGKKGGEEKEG 785 T 0.0017 COX6B unppssm F Eukaryota T 7zm8 21 U d G0SEZ1_CHATD SUBUNIT NDUFB10 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MPTPESEAFLAKKPQVPPTFDGVDYEDNKRLKQAQDAIIREQWVQVMMGRLVREELSKCYYREGVNHLEKCGKLRERYLQLLANAKVKGYLFEQQNYWSKENQQQ 105 T 0.00014 NDUFB10 unppercent F Eukaryota T 7zm8 22 V e SUBUNIT NDUFB2 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MAGGGQHVSRVHRFLATGLGASMWFWIFYRAKKDGPVLLGWKHPWD 46 T 0.00013 NADH_B2 pdb F T 7zm8 23 W g G0RZZ2_CHATD SUBUNIT NDUFA3 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MSATTPRFWSTPLKYCRWAARERPALFFSVVIGALGPVTLATVPPLRRLIGDVDAAPIPLTYPIPPGPRKQLKGYDDDTEDN 82 T 0.11 NADHdh_A3 pdbpercent F Eukaryota T 7zm8 24 X i G0S569_CHATD SUBUNIT NDUFB6 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MGGGPKIPYPKHVWSPAGGWYAQPANWKQNTAIFGLVIFGITAMVWKYSAEHEVRHKMPEPDRFYPSRYWVKQIKDYERAQKEKQQNNTEASS 93 T 0.0065 Elongin_A pdbpercent F Eukaryota T 7zm8 26 Z n G0S086_CHATD SUBUNIT NDUFB5 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MLALRQRAALLARRVRPTVVVPRNARTYASSHDHDHHDHHHDHGHNVEEPLGAAFYIAVGGIASSFVIYNISRPGPNGEPSSLHKWFSKISDYKDEWETRNTLMAAALEQAAHDKHLLLTAERSRHIELKYPEVFSHGSPFNVPAGFYPNLDHVIEHYRKQHLEEEERKAKKLAAAAAAASEAR 184 T 0.061 Glyco_transf_36 unppercent F Eukaryota T 7zmb 8 H 9 G0SG48_CHATD SUBUNIT NDUFS5 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MASGYGLNGGPSRCFPFWQELLACYVTNSSEDNPDGKNKCIPVMEDYYECLHHRKEAARVRALQAAYREAEAKKLQENPPTAGQIRNLGLLNKEEDTKKVHCATATQQWMKMCFALSAKMPCLRGIEVFLTTRPDNEKIPEYPHPEGTSARVICGQHLIRSPSLASNLSNSAITPPSSFSPRKANPLVSVYLPSVPGTPFTINYKISQVPPEPCKYLFFRLYINARPMVSWGIDPHSRPYGKVIKSLWLPSDDRYRGLVGFEKRSFVFLPGEEFKSVAEDGGLIEVQVFRAKDRRARTPKLETFRFRDNYGIAAPSIGLLDKPQNAFFYDWLLIDPKDEPFAKFRMHYRSWRNLKSLNLIPSSEWELLLAVSPKALRTAASTGKIEKPTSPAFSDSDSDDSLCSATDSDECVFDDHSKKTKSNRSKESPFAFLNSPPERFRAMAPSSEKLPQPSKLLRDSQRAPYQSRPLPELPVEAGVNSTGLNPPSVKVAADLRRKPSATSMESNAVSITPSLLRCMEEGTLDLEKAEVGIAKLVKVAASGSEPSSSSSSATQLTVSAVREVELRPPQPERKGGLPMDYSFSDYEKSSQSSFGDDERMSNISDMEEKPFPAPPTCYLPTTGSGLERELAMFDSPSPSPVPYSAMEPPVTPSPSSTPYKTKLGRKPLLFSRRLGLFSPRKSLPSDFLLGQAKKLVVEEETTNSNVSLAFGELTVTDMRAESPSPAPKGKPLRRFSTIRVEEISIKEKRPLFNSLRRIASASPRKLAGRVLSMDLGKKGGEEKEG 785 T 0.0017 COX6B unppssm F Eukaryota T 7zmb 30 EA Z G0SEF0_CHATD SUBUNIT NDUFA7 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MASKAAAAAASNAVSITKKYTVQSTGIWERIRRALVIDPNRSNGVPLNPYNRNPSPGDNPPLEYTDPVTIPAGDIADNPYWKRDFRRNYPRPSVIAQAQQVALLSVGSAAQPRVELIGEEGTKALVAAEEEGKEKGVAKYLEEKGAEEAKRVLALTGGLPPTPSGQTMVTGQWDVHKYGLAEEQSYGGSYPCRSFV 196 T 0.004 CI-B14_5a pdbhh F Eukaryota T 7zmb 34 IA d G0SEZ1_CHATD SUBUNIT NDUFB10 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MPTPESEAFLAKKPQVPPTFDGVDYEDNKRLKQAQDAIIREQWVQVMMGRLVREELSKCYYREGVNHLEKCGKLRERYLQLLANAKVKGYLFEQQNYWSKENQQQ 105 T 0.00014 NDUFB10 unppercent F Eukaryota T 7zmb 35 JA e SUBUNIT NDUFB2 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MAGGGQHVSRVHRFLATGLGASMWFWIFYRAKKDGPVLLGWKHPWD 46 T 0.00013 NADH_B2 pdb F T 7zmb 37 LA g G0RZZ2_CHATD SUBUNIT NDUFA3 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MSATTPRFWSTPLKYCRWAARERPALFFSVVIGALGPVTLATVPPLRRLIGDVDAAPIPLTYPIPPGPRKQLKGYDDDTEDN 82 T 0.11 NADHdh_A3 pdbpercent F Eukaryota T 7zmb 39 NA i G0S569_CHATD SUBUNIT NDUFB6 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MGGGPKIPYPKHVWSPAGGWYAQPANWKQNTAIFGLVIFGITAMVWKYSAEHEVRHKMPEPDRFYPSRYWVKQIKDYERAQKEKQQNNTEASS 93 T 0.0065 Elongin_A pdbpercent F Eukaryota T 7zmb 41 PA n G0S086_CHATD SUBUNIT NDUFB5 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MLALRQRAALLARRVRPTVVVPRNARTYASSHDHDHHDHHHDHGHNVEEPLGAAFYIAVGGIASSFVIYNISRPGPNGEPSSLHKWFSKISDYKDEWETRNTLMAAALEQAAHDKHLLLTAERSRHIELKYPEVFSHGSPFNVPAGFYPNLDHVIEHYRKQHLEEEERKAKKLAAAAAAASEAR 184 T 0.061 Glyco_transf_36 unppercent F Eukaryota T 7zme 8 H 9 G0SG48_CHATD SUBUNIT NDUFS5 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MASGYGLNGGPSRCFPFWQELLACYVTNSSEDNPDGKNKCIPVMEDYYECLHHRKEAARVRALQAAYREAEAKKLQENPPTAGQIRNLGLLNKEEDTKKVHCATATQQWMKMCFALSAKMPCLRGIEVFLTTRPDNEKIPEYPHPEGTSARVICGQHLIRSPSLASNLSNSAITPPSSFSPRKANPLVSVYLPSVPGTPFTINYKISQVPPEPCKYLFFRLYINARPMVSWGIDPHSRPYGKVIKSLWLPSDDRYRGLVGFEKRSFVFLPGEEFKSVAEDGGLIEVQVFRAKDRRARTPKLETFRFRDNYGIAAPSIGLLDKPQNAFFYDWLLIDPKDEPFAKFRMHYRSWRNLKSLNLIPSSEWELLLAVSPKALRTAASTGKIEKPTSPAFSDSDSDDSLCSATDSDECVFDDHSKKTKSNRSKESPFAFLNSPPERFRAMAPSSEKLPQPSKLLRDSQRAPYQSRPLPELPVEAGVNSTGLNPPSVKVAADLRRKPSATSMESNAVSITPSLLRCMEEGTLDLEKAEVGIAKLVKVAASGSEPSSSSSSATQLTVSAVREVELRPPQPERKGGLPMDYSFSDYEKSSQSSFGDDERMSNISDMEEKPFPAPPTCYLPTTGSGLERELAMFDSPSPSPVPYSAMEPPVTPSPSSTPYKTKLGRKPLLFSRRLGLFSPRKSLPSDFLLGQAKKLVVEEETTNSNVSLAFGELTVTDMRAESPSPAPKGKPLRRFSTIRVEEISIKEKRPLFNSLRRIASASPRKLAGRVLSMDLGKKGGEEKEG 785 T 0.0017 COX6B unppssm F Eukaryota T 7zme 21 U d G0SEZ1_CHATD SUBUNIT NDUFB10 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MPTPESEAFLAKKPQVPPTFDGVDYEDNKRLKQAQDAIIREQWVQVMMGRLVREELSKCYYREGVNHLEKCGKLRERYLQLLANAKVKGYLFEQQNYWSKENQQQ 105 T 0.00014 NDUFB10 unppercent F Eukaryota T 7zme 22 V e SUBUNIT NDUFB2 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MAGGGQHVSRVHRFLATGLGASMWFWIFYRAKKDGPVLLGWKHPWD 46 T 0.00013 NADH_B2 pdb F T 7zme 23 W g G0RZZ2_CHATD SUBUNIT NDUFA3 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MSATTPRFWSTPLKYCRWAARERPALFFSVVIGALGPVTLATVPPLRRLIGDVDAAPIPLTYPIPPGPRKQLKGYDDDTEDN 82 T 0.11 NADHdh_A3 pdbpercent F Eukaryota T 7zme 24 X i G0S569_CHATD SUBUNIT NDUFB6 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MGGGPKIPYPKHVWSPAGGWYAQPANWKQNTAIFGLVIFGITAMVWKYSAEHEVRHKMPEPDRFYPSRYWVKQIKDYERAQKEKQQNNTEASS 93 T 0.0065 Elongin_A pdbpercent F Eukaryota T 7zme 26 Z n G0S086_CHATD SUBUNIT NDUFB5 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MLALRQRAALLARRVRPTVVVPRNARTYASSHDHDHHDHHHDHGHNVEEPLGAAFYIAVGGIASSFVIYNISRPGPNGEPSSLHKWFSKISDYKDEWETRNTLMAAALEQAAHDKHLLLTAERSRHIELKYPEVFSHGSPFNVPAGFYPNLDHVIEHYRKQHLEEEERKAKKLAAAAAAASEAR 184 T 0.061 Glyco_transf_36 unppercent F Eukaryota T 7zmg 8 H 9 G0SG48_CHATD SUBUNIT NDUFS5 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MASGYGLNGGPSRCFPFWQELLACYVTNSSEDNPDGKNKCIPVMEDYYECLHHRKEAARVRALQAAYREAEAKKLQENPPTAGQIRNLGLLNKEEDTKKVHCATATQQWMKMCFALSAKMPCLRGIEVFLTTRPDNEKIPEYPHPEGTSARVICGQHLIRSPSLASNLSNSAITPPSSFSPRKANPLVSVYLPSVPGTPFTINYKISQVPPEPCKYLFFRLYINARPMVSWGIDPHSRPYGKVIKSLWLPSDDRYRGLVGFEKRSFVFLPGEEFKSVAEDGGLIEVQVFRAKDRRARTPKLETFRFRDNYGIAAPSIGLLDKPQNAFFYDWLLIDPKDEPFAKFRMHYRSWRNLKSLNLIPSSEWELLLAVSPKALRTAASTGKIEKPTSPAFSDSDSDDSLCSATDSDECVFDDHSKKTKSNRSKESPFAFLNSPPERFRAMAPSSEKLPQPSKLLRDSQRAPYQSRPLPELPVEAGVNSTGLNPPSVKVAADLRRKPSATSMESNAVSITPSLLRCMEEGTLDLEKAEVGIAKLVKVAASGSEPSSSSSSATQLTVSAVREVELRPPQPERKGGLPMDYSFSDYEKSSQSSFGDDERMSNISDMEEKPFPAPPTCYLPTTGSGLERELAMFDSPSPSPVPYSAMEPPVTPSPSSTPYKTKLGRKPLLFSRRLGLFSPRKSLPSDFLLGQAKKLVVEEETTNSNVSLAFGELTVTDMRAESPSPAPKGKPLRRFSTIRVEEISIKEKRPLFNSLRRIASASPRKLAGRVLSMDLGKKGGEEKEG 785 T 0.0017 COX6B unppssm F Eukaryota T 7zmg 30 EA Z G0SEF0_CHATD SUBUNIT NDUFA7 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MASKAAAAAASNAVSITKKYTVQSTGIWERIRRALVIDPNRSNGVPLNPYNRNPSPGDNPPLEYTDPVTIPAGDIADNPYWKRDFRRNYPRPSVIAQAQQVALLSVGSAAQPRVELIGEEGTKALVAAEEEGKEKGVAKYLEEKGAEEAKRVLALTGGLPPTPSGQTMVTGQWDVHKYGLAEEQSYGGSYPCRSFV 196 T 0.004 CI-B14_5a pdbhh F Eukaryota T 7zmg 34 IA d G0SEZ1_CHATD SUBUNIT NDUFB10 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MPTPESEAFLAKKPQVPPTFDGVDYEDNKRLKQAQDAIIREQWVQVMMGRLVREELSKCYYREGVNHLEKCGKLRERYLQLLANAKVKGYLFEQQNYWSKENQQQ 105 T 0.00014 NDUFB10 unppercent F Eukaryota T 7zmg 35 JA e SUBUNIT NDUFB2 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MAGGGQHVSRVHRFLATGLGASMWFWIFYRAKKDGPVLLGWKHPWD 46 T 0.00013 NADH_B2 pdb F T 7zmg 37 LA g G0RZZ2_CHATD SUBUNIT NDUFA3 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MSATTPRFWSTPLKYCRWAARERPALFFSVVIGALGPVTLATVPPLRRLIGDVDAAPIPLTYPIPPGPRKQLKGYDDDTEDN 82 T 0.11 NADHdh_A3 pdbpercent F Eukaryota T 7zmg 39 NA i G0S569_CHATD SUBUNIT NDUFB6 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MGGGPKIPYPKHVWSPAGGWYAQPANWKQNTAIFGLVIFGITAMVWKYSAEHEVRHKMPEPDRFYPSRYWVKQIKDYERAQKEKQQNNTEASS 93 T 0.0065 Elongin_A pdbpercent F Eukaryota T 7zmg 41 PA n G0S086_CHATD SUBUNIT NDUFB5 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MLALRQRAALLARRVRPTVVVPRNARTYASSHDHDHHDHHHDHGHNVEEPLGAAFYIAVGGIASSFVIYNISRPGPNGEPSSLHKWFSKISDYKDEWETRNTLMAAALEQAAHDKHLLLTAERSRHIELKYPEVFSHGSPFNVPAGFYPNLDHVIEHYRKQHLEEEERKAKKLAAAAAAASEAR 184 T 0.061 Glyco_transf_36 unppercent F Eukaryota T 7zmh 8 H 9 G0SG48_CHATD SUBUNIT NDUFS5 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MASGYGLNGGPSRCFPFWQELLACYVTNSSEDNPDGKNKCIPVMEDYYECLHHRKEAARVRALQAAYREAEAKKLQENPPTAGQIRNLGLLNKEEDTKKVHCATATQQWMKMCFALSAKMPCLRGIEVFLTTRPDNEKIPEYPHPEGTSARVICGQHLIRSPSLASNLSNSAITPPSSFSPRKANPLVSVYLPSVPGTPFTINYKISQVPPEPCKYLFFRLYINARPMVSWGIDPHSRPYGKVIKSLWLPSDDRYRGLVGFEKRSFVFLPGEEFKSVAEDGGLIEVQVFRAKDRRARTPKLETFRFRDNYGIAAPSIGLLDKPQNAFFYDWLLIDPKDEPFAKFRMHYRSWRNLKSLNLIPSSEWELLLAVSPKALRTAASTGKIEKPTSPAFSDSDSDDSLCSATDSDECVFDDHSKKTKSNRSKESPFAFLNSPPERFRAMAPSSEKLPQPSKLLRDSQRAPYQSRPLPELPVEAGVNSTGLNPPSVKVAADLRRKPSATSMESNAVSITPSLLRCMEEGTLDLEKAEVGIAKLVKVAASGSEPSSSSSSATQLTVSAVREVELRPPQPERKGGLPMDYSFSDYEKSSQSSFGDDERMSNISDMEEKPFPAPPTCYLPTTGSGLERELAMFDSPSPSPVPYSAMEPPVTPSPSSTPYKTKLGRKPLLFSRRLGLFSPRKSLPSDFLLGQAKKLVVEEETTNSNVSLAFGELTVTDMRAESPSPAPKGKPLRRFSTIRVEEISIKEKRPLFNSLRRIASASPRKLAGRVLSMDLGKKGGEEKEG 785 T 0.0017 COX6B unppssm F Eukaryota T 7zmh 21 U d G0SEZ1_CHATD SUBUNIT NDUFB10 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MPTPESEAFLAKKPQVPPTFDGVDYEDNKRLKQAQDAIIREQWVQVMMGRLVREELSKCYYREGVNHLEKCGKLRERYLQLLANAKVKGYLFEQQNYWSKENQQQ 105 T 0.00014 NDUFB10 unppercent F Eukaryota T 7zmh 22 V e SUBUNIT NDUFB2 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MAGGGQHVSRVHRFLATGLGASMWFWIFYRAKKDGPVLLGWKHPWD 46 T 0.00013 NADH_B2 pdb F T 7zmh 23 W g G0RZZ2_CHATD SUBUNIT NDUFA3 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MSATTPRFWSTPLKYCRWAARERPALFFSVVIGALGPVTLATVPPLRRLIGDVDAAPIPLTYPIPPGPRKQLKGYDDDTEDN 82 T 0.11 NADHdh_A3 pdbpercent F Eukaryota T 7zmh 24 X i G0S569_CHATD SUBUNIT NDUFB6 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MGGGPKIPYPKHVWSPAGGWYAQPANWKQNTAIFGLVIFGITAMVWKYSAEHEVRHKMPEPDRFYPSRYWVKQIKDYERAQKEKQQNNTEASS 93 T 0.0065 Elongin_A pdbpercent F Eukaryota T 7zmh 26 Z n G0S086_CHATD SUBUNIT NDUFB5 OF NADH-UBIQUINONE OXIDOREDUCTASE (COMPLEX I) MLALRQRAALLARRVRPTVVVPRNARTYASSHDHDHHDHHHDHGHNVEEPLGAAFYIAVGGIASSFVIYNISRPGPNGEPSSLHKWFSKISDYKDEWETRNTLMAAALEQAAHDKHLLLTAERSRHIELKYPEVFSHGSPFNVPAGFYPNLDHVIEHYRKQHLEEEERKAKKLAAAAAAASEAR 184 T 0.061 Glyco_transf_36 unppercent F Eukaryota T 7zmu 2 B B non-natural peptide 2 SXXXXKX 7 T 100 EF-hand_5 pdbhh F F 7zmw 2 B B non-natural peptide 1 XSXXXXKX 8 T 120 DUF3890 pdbhh F F 7zn2 2 B,C,D j,h,i Q7Y5E2_BPT5Z Pore-forming tail tip protein pb2 MTDKLIRELLIDVKQKGATRTAKSIENVSDALENAAAASELTNEQLGKMPRTLYSIERAADRAAKSLTKMQASRGMAGITKSIDGIGDKLDYLAIQLIEVTDKLEIGFDGVSRSVKAMGNDVAAATEKVQDRLYDTNRALGGTSKGFNDTAGAAGRASRALGNTSGSARGATRDFAAMAKIGGRLPIMYAALASNVFVLQTAFESLKVGDQLNRLEQFGTIVGTMTGTPVQTLALSLQNATNGAISFEEAMRQASSASAYGFDSEQLEQFGLVARRAAAVLGVDMTDALNRVIKGVSKQEIELLDELGVTIRLNDAYENYVKQLNATSTGIKYTVDSLTTYQKQQAYANEVIAESTRRFGYLDDALKATSWEQFAANANSALRSLQQSAATYLNPVMDTLNTFLYQTKSSQMRVSAMARSASAKTTPAENVTALIENAVGAREDLDTYLKESEERVKKAQELKQQLDDLKAKQAATAPIANALTAGGIGGDESNKLVVQLTNELARQNKEIEERTKTEKVLRQAVQDTGEALLRNGKLAEQLGAKMKYADTAVPGDKGVFEVDPNNLKAVSEIQKNFDFLKKSSSDTANNIRMAASSITNAKKASSDLNSVVKAVEDTSKVTGQSADTLVKNLNLGFSSLDQMKAAQKGLSEYVTAMDKSEQNALEVAKRKDEVYNQTKDKAKAEAAAREVLLRQQQEQLTAAKALLAINPNDPEALKQVAKIETEILNTKAQGFENAKKTKDYTDKILGVDREIALLNDRTMTSTQYRLAQLRLELQLEQEKTELYSKQADGQAKVEQSRRAQAQISREIWEAEKQGTASHVSALMDALEVSQTQRNVTGQSQILTERLSILQQQLELSKGNTEEELKYRNEIYKTSAALEQLKKQRESQMQQQVGSSVGATYTSTTGLIGEDKDFADMQNRMASYDQAISKLSELNSEATAVAQSMGNLTNAMIQFSQGSLDTTSMIASGMQTVASMIQYSTSQQVSAIDQAIAAEQKRDGKSEASKAKLKKLEAEKLKIQQDAAKKQIIIQTAVAVMQAATAVPYPFSIPLMVAAGLAGALALAQASSASGMSSIADSGADTTQYLTLGERQKNVDVSMQASSGELSYLRGDKGIGNANSFVPRAEGGMMYPGVSYQMGEHGTEVVTPMVPMKATPNDQLSDGSKTTSGRPIILNISTMDAASFRDFASNNSTAFRDAVELALNENGTTLKSLGNS 1219 T 0.11 Asp-Al_Ex unppssm T Viruses T 7zn2 3 F,G,H f,e,g FIBC_BPT5 PB4,TAIL PROTEIN PB4 MISNNAPAKMVLNSVLTGYTLAYIQHSIYSDYDVIGRSFWLKEGSNVTRRDFTGIDTFSVTINNLKPTTTYEVQGAFYDSIIDSELLNAQIGINLSDKQTFKMKSAPRITGARCESEPVDVGVGAPIVYIDTTGEADYCTIELKDNSNANNPWVKYYVGALMPTIMFGGVPIGSYKVRISGQISLPDGVTIDSSGYYEYPNVFEVRYNFVPPAAPINIVFKAARIADGKERYDLRVQWDWNRGAGANVREFVLSYIDSAEFVRTGWTKAQKINVGAAQSATIISFPWKVEHKFKVSSIAWGPDAQDVTDSAVQTFILNESTPLDNSFVNETGIEVNYAYIKGKIKDGSTWKQTFLIDAATGAINIGLLDAEGKAPISFDPVKKIVNVDGSVITKTINAANFVMTNLTGQDNPAIYTQGKTWGDTKSGIWMGMDNVTAKPKLDIGNATQYIRYDGNILRISSEVVIGTPNGDIDIQTGIQGKQTVFIYIIGTSLPAKPTSPAYPPSGWSKTPPNRTSNTQNIYCSTGTLDPVTNQLVSGTSWSDVVQWSGTEGVDGRPGATGQRGPGMYSLAIANLTAWNDSQANSFFTSNFGSGPVKYDVLTEYKSGAPGTAFTRQWNGSAWTSPAMVLHGDMIVNGTVTASKIVANNAFLSQIGVNIIYDRAAALSSNPEGSYKMKIDLQNGYIHIR 688 T 0.27 fn3 pdb T Viruses T 7zn2 4 BA,CA,GA,HA,J,M,N,P,T,U,X,Y U,K,P,T,R,Q,S,N,M,O,J,L FIBL2_BPT5 L-shaped tail fiber protein p132 MSTENRVIDLVVDENVPYGLLMQFMDVDDSVYPSTSKPVDLTDFSLRGSIKSSLEDGAETVASFTTAIVDAAQGVASISLPVSAVTTIASKASKERDRYNPRQRLAGYYDVIITRTAVGSAASSFRIMEGKVYISDGVTQ 140 T 3.3E-05 BppU_N pdbhh T Viruses T 7zn2 5 K,Q,Z I,H,G TAIL1_BPT5 TAIL PROTEIN P140 MFYSLMRESKIVIEYDGRGYHFDALSNYDASTSFQEFKTLRRTIHNRTNYADSIINAQDPSSISLAINFSTTLIESNFFDWMGFTREGNSLFLPRNTPNIEPIMFNMYIINHNNSCIYFENCYVSTVDFSLDKSIPILNVGIESGKFSEVSTFRDGYTITQGEVLPYSAPAVYTNSSPLPALISASMSFQQQCSWREDRNIFDINKIYTNKRAYVNEMNASATLAFYYVKRLVGDKFLNLDPETRTPLIIKNKYVSITFPLARISKRLNFSDLYQVEYDVIPTADSDPVEINFFGERK 298 T 0.023 DUF4965 pdbpssm T Viruses T 7zn4 2 C,D,E f,e,g FIBC_BPT5 TAIL PROTEIN PB4 MISNNAPAKMVLNSVLTGYTLAYIQHSIYSDYDVIGRSFWLKEGSNVTRRDFTGIDTFSVTINNLKPTTTYEVQGAFYDSIIDSELLNAQIGINLSDKQTFKMKSAPRITGARCESEPVDVGVGAPIVYIDTTGEADYCTIELKDNSNANNPWVKYYVGALMPTIMFGGVPIGSYKVRISGQISLPDGVTIDSSGYYEYPNVFEVRYNFVPPAAPINIVFKAARIADGKERYDLRVQWDWNRGAGANVREFVLSYIDSAEFVRTGWTKAQKINVGAAQSATIISFPWKVEHKFKVSSIAWGPDAQDVTDSAVQTFILNESTPLDNSFVNETGIEVNYAYIKGKIKDGSTWKQTFLIDAATGAINIGLLDAEGKAPISFDPVKKIVNVDGSVITKTINAANFVMTNLTGQDNPAIYTQGKTWGDTKSGIWMGMDNVTAKPKLDIGNATQYIRYDGNILRISSEVVIGTPNGDIDIQTGIQGKQTVFIYIIGTSLPAKPTSPAYPPSGWSKTPPNRTSNTQNIYCSTGTLDPVTNQLVSGTSWSDVVQWSGTEGVDGRPGATGQRGPGMYSLAIANLTAWNDSQANSFFTSNFGSGPVKYDVLTEYKSGAPGTAFTRQWNGSAWTSPAMVLHGDMIVNGTVTASKIVANNAFLSQIGVNIIYDRAAALSSNPEGSYKMKIDLQNGYIHIR 688 T 0.27 fn3 pdb T Viruses T 7znz 1 A A B2UR61_AKKM8 FucOB, a GH95 family alpha-1,2-fucosidase KPSASNLIWSDEPAVVVYPQEDKNSEGSFGKYRKPASVWEAEGYPIGNGRVGAMIFSAPGRERLALNEISLWSGGANPGGGYGYGPDAGTNQFGNYLPFGDLFVDFKKGDQPASLSVEDFTRSLDLRDGIHKVNYKADGVTYDREAFSSTPANVLVLNYKASKPGQFSADFSVNSQLGADISAKGSVITWKGMLKNGMNYEGRVLIRPKGGTLSASGDKISVKNADSCMVVIAMETDYLMDYKKDWKGESPSRKLDRYAAKAASADYAALKQAHISQYKSMFDRVKVNFGKTEEDVAKLPTPKRLEAYKKNPADPDLEETMFQFGRYLLLSSSRPGTLPANLQGLWNDYVKPPWACDYHNNINVQMAYWGAEPANLSECHEALVNYVEAMAPGCRDASQANKGFNTKDGKPVRGWTVRTSQNIFGGNGWQWNIPGAAWYALHIWEHYAFTGDRKYLEKQAYPLMKEICHFWEDHLKELGAGGEGFKTNGKDPSEEEKKDLADVKAGTLVAPNGWSPEHGPREDGVMHDQQLIAELFSNTIKAARILGKDAAWAKSLEGKLKRLAGNKIGKEGNLQEWMIDRIPKTDHRHTSHLFAVFPGNQISKLKTPKLAEAARLSLEWRGTTGDSRRSWTWPWRTALWARLGEGNKAHEMVQGLLKFNTLPNMLTTHPPMQMDGNFGIVGGICEMLVQSHAGGLDIMPSPVEAWPEGSVKGLKARGNVTVDFSWKDGKVSNVKLYSAQPKVLPVRVNGKMTRMKTLPLK 761 T 4.8E-18 Glyco_hyd_65N_2 pdbpercent F Bacteria T 7zo0 1 A A B2UR61_AKKM8 GH95 family alpha-1,2-fucosidase MHHHHHHENLYFQGSGADKPSASNLIWSDEPAVVVYPQEDKNSEGSFGKYRKPASVWEAEGYPIGNGRVGAMIFSAPGRERLALNEISLWSGGANPGGGYGYGPDAGTNQFGNYLPFGDLFVDFKKGDQPASLSVEDFTRSLDLRDGIHKVNYKADGVTYDREAFSSTPANVLVLNYKASKPGQFSADFSVNSQLGADISAKGSVITWKGMLKNGMNYEGRVLIRPKGGTLSASGDKISVKNADSCMVVIAMETDYLMDYKKDWKGESPSRKLDRYAAKAASADYAALKQAHISQYKSMFDRVKVNFGKTEEDVAKLPTPKRLEAYKKNPADPDLEETMFQFGRYLLLSSSRPGTLPANLQGLWNDYVKPPWACDYHNNINVQMAYWGAEPANLSECHEALVNYVEAMAPGCRDASQANKGFNTKDGKPVRGWTVRTSQNIFGGNGWQWNIPGAAWYALHIWEHYAFTGDRKYLEKQAYPLMKEICHFWEDHLKELGAGGEGFKTNGKDPSEEEKKDLADVKAGTLVAPNGWSPAHGPREDGVMHDQQLIAELFSNTIKAARILGKDAAWAKSLEGKLKRLAGNKIGKEGNLQEWMIDRIPKTDHRHTSHLFAVFPGNQISKLKTPKLAEAARLSLEWRGTTGDSRRSWTWPWRTALWARLGEGNKAHEMVQGLLKFNTLPNMLTTHPPMQMDGNFGIVGGICEMLVQSHAGGLDIMPSPVEAWPEGSVKGLKARGNVTVDFSWKDGKVSNVKLYSAQPKVLPVRVNGKMTRMKTLPLKSGAGSSQPAAR 790 T 1.3E-18 Glyco_hyd_65N_2 pdbpercent F Bacteria T 7zol 3 C A A0A975BRS1_9BACT TPR-CHAT MSSAFSGLKIPELSVDPAEVFKSDNPQLVSVLLDEFELQEQRPFFSGLIPEKQINIALKKSPQLKKLACHLLEAYEINGRRWKHADRRRVLEKAIRLLEKVSNELKGDIQKLENNVKESGKDSEELNKTREKHGEILADMGRAYLHRAKII 151 T 0.0032 TFR_dimer pdb F Bacteria T 7zpq 81 CC CB CUE3_YEAST CUE DOMAIN-CONTAINING PROTEIN 3,COUPLING OF UBIQUITIN CONJUGATION TO ER DEGRADATION PROTEIN 3 MLSRYNRVIEINGGNADISLPIVKFPPFKLRAQLIEKDPVVWLHLIETYVTYFEYLMQGANVELLDESTLDHLRLFLRTYLHEIADEEGKLLSLGINHDVSEQLYLLKGWIFSLIKKCGLLHLQIFGDSLWNLIKVYVRRNPDSIRGLIDGSLKPRINTQRVQLDKSYQVQQHLKQLIESGKFKRIDLRCVEDLLSAKSMQPNKFAENFFTANWIEILEALWAKGQGRGHKEARELIIISLFSVSADRLLKITKELGISNFETLALYPLLGTMLINEGVHKRLPDLKSKLLFLNLGG 297 T 0.21 DUF4919 pdbpercent F Eukaryota T 7zpy 2 B B Peptide inhibitor (ASP-TYR-ASN-PRO-TYR-LEU-LEU-TYR-LEU-LYS) DYNPYLLYLK 10 T 3.2 Flu_PB1 pdbhh F T 7zq6 32 FA z Nascent chain AGP 3 T 160 DUF2894 pdbhh F F 7zqb 1 A,B,C i,h,j FIBC_BPT5 TAIL PROTEIN PB4 MISNNAPAKMVLNSVLTGYTLAYIQHSIYSDYDVIGRSFWLKEGSNVTRRDFTGIDTFSVTINNLKPTTTYEVQGAFYDSIIDSELLNAQIGINLSDKQTFKMKSAPRITGARCESEPVDVGVGAPIVYIDTTGEADYCTIELKDNSNANNPWVKYYVGALMPTIMFGGVPIGSYKVRISGQISLPDGVTIDSSGYYEYPNVFEVRYNFVPPAAPINIVFKAARIADGKERYDLRVQWDWNRGAGANVREFVLSYIDSAEFVRTGWTKAQKINVGAAQSATIISFPWKVEHKFKVSSIAWGPDAQDVTDSAVQTFILNESTPLDNSFVNETGIEVNYAYIKGKIKDGSTWKQTFLIDAATGAINIGLLDAEGKAPISFDPVKKIVNVDGSVITKTINAANFVMTNLTGQDNPAIYTQGKTWGDTKSGIWMGMDNVTAKPKLDIGNATQYIRYDGNILRISSEVVIGTPNGDIDIQTGIQGKQTVFIYIIGTSLPAKPTSPAYPPSGWSKTPPNRTSNTQNIYCSTGTLDPVTNQLVSGTSWSDVVQWSGTEGVDGRPGATGQRGPGMYSLAIANLTAWNDSQANSFFTSNFGSGPVKYDVLTEYKSGAPGTAFTRQWNGSAWTSPAMVLHGDMIVNGTVTASKIVANNAFLSQIGVNIIYDRAAALSSNPEGSYKMKIDLQNGYIHIR 688 T 0.27 fn3 pdb T Viruses T 7zqb 2 AA,BA,D,G,H,J,N,O,R,S,V,W P,T,R,Q,S,N,M,O,J,L,U,K FIBL2_BPT5 L-shaped tail fiber protein p132 MSTENRVIDLVVDENVPYGLLMQFMDVDDSVYPSTSKPVDLTDFSLRGSIKSSLEDGAETVASFTTAIVDAAQGVASISLPVSAVTTIASKASKERDRYNPRQRLAGYYDVIITRTAVGSAASSFRIMEGKVYISDGVTQ 140 T 3.3E-05 BppU_N pdbhh T Viruses T 7zqb 3 E,K,T I,H,G TAIL1_BPT5 TAIL PROTEIN P140 MFYSLMRESKIVIEYDGRGYHFDALSNYDASTSFQEFKTLRRTIHNRTNYADSIINAQDPSSISLAINFSTTLIESNFFDWMGFTREGNSLFLPRNTPNIEPIMFNMYIINHNNSCIYFENCYVSTVDFSLDKSIPILNVGIESGKFSEVSTFRDGYTITQGEVLPYSAPAVYTNSSPLPALISASMSFQQQCSWREDRNIFDINKIYTNKRAYVNEMNASATLAFYYVKRLVGDKFLNLDPETRTPLIIKNKYVSITFPLARISKRLNFSDLYQVEYDVIPTADSDPVEINFFGERK 298 T 0.023 DUF4965 pdbpssm T Viruses T 7zqb 6 CA,DA,EA f,g,e Q7Y5E2_BPT5Z Pore-forming tail tip protein pb2 MTDKLIRELLIDVKQKGATRTAKSIENVSDALENAAAASELTNEQLGKMPRTLYSIERAADRAAKSLTKMQASRGMAGITKSIDGIGDKLDYLAIQLIEVTDKLEIGFDGVSRSVKAMGNDVAAATEKVQDRLYDTNRALGGTSKGFNDTAGAAGRASRALGNTSGSARGATRDFAAMAKIGGRLPIMYAALASNVFVLQTAFESLKVGDQLNRLEQFGTIVGTMTGTPVQTLALSLQNATNGAISFEEAMRQASSASAYGFDSEQLEQFGLVARRAAAVLGVDMTDALNRVIKGVSKQEIELLDELGVTIRLNDAYENYVKQLNATSTGIKYTVDSLTTYQKQQAYANEVIAESTRRFGYLDDALKATSWEQFAANANSALRSLQQSAATYLNPVMDTLNTFLYQTKSSQMRVSAMARSASAKTTPAENVTALIENAVGAREDLDTYLKESEERVKKAQELKQQLDDLKAKQAATAPIANALTAGGIGGDESNKLVVQLTNELARQNKEIEERTKTEKVLRQAVQDTGEALLRNGKLAEQLGAKMKYADTAVPGDKGVFEVDPNNLKAVSEIQKNFDFLKKSSSDTANNIRMAASSITNAKKASSDLNSVVKAVEDTSKVTGQSADTLVKNLNLGFSSLDQMKAAQKGLSEYVTAMDKSEQNALEVAKRKDEVYNQTKDKAKAEAAAREVLLRQQQEQLTAAKALLAINPNDPEALKQVAKIETEILNTKAQGFENAKKTKDYTDKILGVDREIALLNDRTMTSTQYRLAQLRLELQLEQEKTELYSKQADGQAKVEQSRRAQAQISREIWEAEKQGTASHVSALMDALEVSQTQRNVTGQSQILTERLSILQQQLELSKGNTEEELKYRNEIYKTSAALEQLKKQRESQMQQQVGSSVGATYTSTTGLIGEDKDFADMQNRMASYDQAISKLSELNSEATAVAQSMGNLTNAMIQFSQGSLDTTSMIASGMQTVASMIQYSTSQQVSAIDQAIAAEQKRDGKSEASKAKLKKLEAEKLKIQQDAAKKQIIIQTAVAVMQAATAVPYPFSIPLMVAAGLAGALALAQASSASGMSSIADSGADTTQYLTLGERQKNVDVSMQASSGELSYLRGDKGIGNANSFVPRAEGGMMYPGVSYQMGEHGTEVVTPMVPMKATPNDQLSDGSKTTSGRPIILNISTMDAASFRDFASNNSTAFRDAVELALNENGTTLKSLGNS 1219 T 0.11 Asp-Al_Ex unppssm T Viruses T 7zqj 3 C F Y3403_MYCTU Uncharacterized protein Rv3403c MTMITPPTF 9 T 0.47 Thi4 unppercent F Bacteria F 7zqp 2 B,C,D j,h,i TMP_BPT5 TAIL PROTEIN PB2 MTDKLIRELLIDVKQKGATRTAKSIENVSDALENAAAASELTNEQLGKMPRTLYSIERAADRAAKSLTKMQASRGMAGITKSIDGIGDKLDYLAIQLIEVTDKLEIGFDGVSRSVKAMGNDVAAATEKVQDRLYDTNRALGGTSKGFNDTAGAAGRASRALGNTSGSARGATRDFAAMAKIGGRLPIMYAALASNVFVLQTAFESLKVGDQLNRLEQFGTIVGTMTGTPVQTLALSLQNATNGAISFEEAMRQASSASAYGFDSEQLEQFGLVARRAAAVLGVDMTDALNRVIKGVSKQEIELLDELGVTIRLNDAYENYVKQLNATSTGIKYTVDSLTTYQKQQAYANEVIAESTRRFGYLDDALKATSWEQFAANANSALRSLQQSAATYLNPVMDTLNTFLYQTKSSQMRVSAMARSASAKTTPAENVTALIENAVGAREDLDTYLKESEERVKKAQELKQQLDDLKAKQAATAPIANALTAGGIGGDESNKLVVQLTNELARQNKEIEERTKTEKVLRQAVQDTGEALLRNGKLAEQLGAKMKYADTAVPGDKGVFEVDPNNLKAVSEIQKNFDFLKKSSSDTANNIRMAASSITNAKKASSDLNSVVKAVEDTSKVTGQSADTLVKNLNLGFSSLDQMKAAQKGLSEYVTAMDKSEQNALEVAKRKDEVYNQTKDKAKAEAAAREVLLRQQQEQLTAAKALLAINPNDPEALKQVAKIETEILNTKAQGFENAKKTKDYTDKILGVDREIALLNDRTMTSTQYRLAQLRLELQLEQEKTELYSKQADGQAKVEQSRRAQAQISREIWEAEKQGTASHVSALMDALEVSQTQRNVTGQSQILTERLSILQQQLELSKGNTEEELKYRNEIYKTSAALEQLKKQRESQMQQQVGSSVGATYTPTTGLIGEDKDFADMQNRMASYDQAISKLSELNSEATAVAQSMGNLTNAMIQFSQGSLDTTSMIASGMQTVASMIQYSTSQQVSAIDQAIAAEQKRDGKSEASKAKLKKLEAEKLKIQQDAAKKQIIIQTAVAVMQAATAVPYPFSIPLMVAAGLAGALALAQASSASGMSSIADSGADTTQYLTLGERQKNVDVSMQASSGELSYLRGDKGIGNANSFVPRAEGGMMYPGVSYQMGEHGTEVVTPMVPMKATPNDQLSDGSKTTSGRPIILNISTMDAASFRDFASNNSTAFRDAVELALNENGTTLKSLGNS 1219 T 0.083 Asp-Al_Ex pdbpssm T Viruses T 7zqr 1 A,B,C A,B,C CP125_MYCTU CHOLEST-4-EN-3-ONE 26-MONOOXYGENASE,CHOLEST-4-EN-3-ONE C26-MONOOXYGENASE [(25S)-3-OXOCHOLEST-4-EN-26-OATE FORMING],CHOLESTEROL C26-MONOOXYGENASE,CHOLESTEROL C26-MONOOXYGENASE [(25S)-3BETA-HYDROXYCHOLEST-5-EN-26-OATE FORMING],CYTOCHROME P450 125,STEROID C27-MONOOXYGENASE NGPSPNLPPGFDFTDPAIYAERLPVAEFAELRSAAPIWWNGQDPGKGGGFHDGGFWAITKLNDVKEISRHSDVFSSYENGVIPRFKNDIAREDIEVQRFVMLNMDAPHHTRLRKIISRGFTPRAVGRLHDELQERAQKIAAEAAAAGSGDFVEQVSCELPLQAIAGLLGVPQEDRGKLFHWSNEMTGNEDPEYAHIDPKASSAELIGYAMKMAEEKAKNPADDIVTQLIQADIDGEKLSDDEFGFFVVMLAVAGNETTRNSITQGMMAFAEHPDQWELYKKVRPETAADEIVRWATPVTAFQRTALRDYELSGVQIKKGQRVVMFYRSANFDEEVFQDPFTFNILRNPNPHVGFGGTGAHYCIGANLARMTINLIFNAVADHMPDLKPISAPERLRSGWLNGIKHWQVDYTGRCPVAH 418 T 3.4000000000000003E-22 p450 unppssm F Bacteria T 7zrp 2 B,D B,D KCC2D_HUMAN CAM KINASE II SUBUNIT DELTA,CAMK-II SUBUNIT DELTA FNARRKLKGAILTTMLATRNFS 22 T 0.11 SBP_bac_3 unppercent F Eukaryota T 7zrq 2 B B KCC2D_HUMAN CAM KINASE II SUBUNIT DELTA,CAMK-II SUBUNIT DELTA FNARRKLKGAILTTMLATRNFS 22 T 0.11 SBP_bac_3 unppercent F Eukaryota T 7zrs 81 CC CB CUE3_YEAST CUE DOMAIN-CONTAINING PROTEIN 3,COUPLING OF UBIQUITIN CONJUGATION TO ER DEGRADATION PROTEIN 3 MLSRYNRVIEINGGNADISLPIVKFPPFKLRAQLIEKDPVVWLHLIETYVTYFEYLMQGANVELLDESTLDHLRLFLRTYLHEIADEEGKLLSLGINHDVSEQLYLLKGWIFSLIKKCGLLHLQIFGDSLWNLIKVYVRRNPDSIRGLIDGSLKPRINTQRVQLDKSYQVQQHLKQLIESGKFKRIDLRCVEDLLSAKSMQPNKFAENFFTANWIEILEALWAKGQGRGHKEARELIIISLFSVSADRLLKITKELGISNFETLALYPLLGTMLINEGVHKRLPDLKSKLLFLNLGG 297 T 0.21 DUF4919 pdbpercent F Eukaryota T 7zru 1 A A KKX29_PANIM POTASSIUM CHANNEL-BLOCKING TOXIN 6,PI6 VDACYEACMHHHMNSDDCIEACKNPVPP 28 T 0.048 Thionin pdb F Eukaryota T 7zrv 2 D,E E,F de novo designed binder ETGASSTNMLEALQQRLQFYHGQVARAALENNSGKARRFGRIVKQYEDAIKLYKAGKPVPYDELPVPPGFGGSENLYFQ 79 T 3.7 DUF5327 pdbhh F T 7zs9 16 P Q T2FA_YEAST TFIIF-ALPHA,TFIIF LARGE SUBUNIT,TRANSCRIPTION FACTOR G 105 KDA SUBUNIT,P105 GPGMSRRNPPGSRNGGGPTNASPFIKRDRMRRNFLRMRMGQNGSNSSSPGVPNGDNSRGSLVKKDDPEYAEEREKMLLQIGVEADAGRSNVKVKDEDPNEYNEFPLRAIPKEDLENMRTHLLKFQSKKKINPVTDFHLPVRLHRKDTRNLQFQLTRAEIVQRQKEISEYKKKAEQERSTPNSGGMNKSGTVSLNNTVKDGSQTPTVDSVTKDNTANGVNSSIPTVTGSSVPPASPTTVSAVESNGLSNGSTSAANGLDGNASTANLANGRPLVTKLEDAGPAEDPTKVGMVKYDGKEVTNEPEFEEGTMDPLADVAPDGGGRAKRGNLRRKTRQLKVLDENAKKLRFEEFYPWVMEDFDGYNTWVGSYEAGNSDSYVLLSVEDDGSFTMIPADKVYKFTARNKYATLTIDEAEKRMDKKSGEVPRWLMKHLDNIGTTTTRYDRTRRKLKAVADQQAMDEDDRDDNSEVELDYDEEFADDEEAPIIDGNEQENKESEQRIKKEMLQANAMGLRDEEAPSENEEDELFGEKKIDEDGERIKKALQKTELAALYSSDENEINPYLSESDIENKENESPVKKEEDSDTLSKSKRSSPKKQQKKATNAHVHKEPTLRVKSIKNCVIILKGDKKILKSFPEGEWNPQTTKAVDSSNNASNTVPSPIKQEEGLNSTVAEREETPAPTITEKDIIEAIGDGKVNIKEFGKFIRRKYPGAENKKLMFAIVKKLCRKVGNDHMELKKE 738 T 9.4E-36 TFIIF_alpha pdbhh F Eukaryota T 7zs9 19 S U TOA1_YEAST Transcription initiation factor IIA large subunit MSNAEASRVYEIIVESVVNEVREDFENAGIDEQTLQDLKNIWQKKLTETKVTTFSWDNQFNEGNINGVQNDLNFNLATPGVNSSEFNIKEENTGSALLDTDEVGSELDDSDDDYLISEGEEDGPDENLMLCLYDKVTRTKARWKCSLKDGVVTINRNDYTFQKAQVEAEWV 171 F F Eukaryota T 7zsa 16 P Q T2FA_YEAST TFIIF-ALPHA,TFIIF LARGE SUBUNIT,TRANSCRIPTION FACTOR G 105 KDA SUBUNIT,P105 GPGMSRRNPPGSRNGGGPTNASPFIKRDRMRRNFLRMRMGQNGSNSSSPGVPNGDNSRGSLVKKDDPEYAEEREKMLLQIGVEADAGRSNVKVKDEDPNEYNEFPLRAIPKEDLENMRTHLLKFQSKKKINPVTDFHLPVRLHRKDTRNLQFQLTRAEIVQRQKEISEYKKKAEQERSTPNSGGMNKSGTVSLNNTVKDGSQTPTVDSVTKDNTANGVNSSIPTVTGSSVPPASPTTVSAVESNGLSNGSTSAANGLDGNASTANLANGRPLVTKLEDAGPAEDPTKVGMVKYDGKEVTNEPEFEEGTMDPLADVAPDGGGRAKRGNLRRKTRQLKVLDENAKKLRFEEFYPWVMEDFDGYNTWVGSYEAGNSDSYVLLSVEDDGSFTMIPADKVYKFTARNKYATLTIDEAEKRMDKKSGEVPRWLMKHLDNIGTTTTRYDRTRRKLKAVADQQAMDEDDRDDNSEVELDYDEEFADDEEAPIIDGNEQENKESEQRIKKEMLQANAMGLRDEEAPSENEEDELFGEKKIDEDGERIKKALQKTELAALYSSDENEINPYLSESDIENKENESPVKKEEDSDTLSKSKRSSPKKQQKKATNAHVHKEPTLRVKSIKNCVIILKGDKKILKSFPEGEWNPQTTKAVDSSNNASNTVPSPIKQEEGLNSTVAEREETPAPTITEKDIIEAIGDGKVNIKEFGKFIRRKYPGAENKKLMFAIVKKLCRKVGNDHMELKKE 738 T 9.4E-36 TFIIF_alpha pdbhh F Eukaryota T 7zsa 19 S U TOA1_YEAST Transcription initiation factor IIA large subunit MSNAEASRVYEIIVESVVNEVREDFENAGIDEQTLQDLKNIWQKKLTETKVTTFSWDNQFNEGNINGVQNDLNFNLATPGVNSSEFNIKEENTGSALLDTDEVGSELDDSDDDYLISEGEEDGPDENLMLCLYDKVTRTKARWKCSLKDGVVTINRNDYTFQKAQVEAEWV 171 F F Eukaryota T 7zsb 16 P Q T2FA_YEAST TFIIF-ALPHA,TFIIF LARGE SUBUNIT,TRANSCRIPTION FACTOR G 105 KDA SUBUNIT,P105 GPGMSRRNPPGSRNGGGPTNASPFIKRDRMRRNFLRMRMGQNGSNSSSPGVPNGDNSRGSLVKKDDPEYAEEREKMLLQIGVEADAGRSNVKVKDEDPNEYNEFPLRAIPKEDLENMRTHLLKFQSKKKINPVTDFHLPVRLHRKDTRNLQFQLTRAEIVQRQKEISEYKKKAEQERSTPNSGGMNKSGTVSLNNTVKDGSQTPTVDSVTKDNTANGVNSSIPTVTGSSVPPASPTTVSAVESNGLSNGSTSAANGLDGNASTANLANGRPLVTKLEDAGPAEDPTKVGMVKYDGKEVTNEPEFEEGTMDPLADVAPDGGGRAKRGNLRRKTRQLKVLDENAKKLRFEEFYPWVMEDFDGYNTWVGSYEAGNSDSYVLLSVEDDGSFTMIPADKVYKFTARNKYATLTIDEAEKRMDKKSGEVPRWLMKHLDNIGTTTTRYDRTRRKLKAVADQQAMDEDDRDDNSEVELDYDEEFADDEEAPIIDGNEQENKESEQRIKKEMLQANAMGLRDEEAPSENEEDELFGEKKIDEDGERIKKALQKTELAALYSSDENEINPYLSESDIENKENESPVKKEEDSDTLSKSKRSSPKKQQKKATNAHVHKEPTLRVKSIKNCVIILKGDKKILKSFPEGEWNPQTTKAVDSSNNASNTVPSPIKQEEGLNSTVAEREETPAPTITEKDIIEAIGDGKVNIKEFGKFIRRKYPGAENKKLMFAIVKKLCRKVGNDHMELKKE 738 T 9.4E-36 TFIIF_alpha pdbhh F Eukaryota T 7zsb 19 S U TOA1_YEAST Transcription initiation factor IIA large subunit MSNAEASRVYEIIVESVVNEVREDFENAGIDEQTLQDLKNIWQKKLTETKVTTFSWDNQFNEGNINGVQNDLNFNLATPGVNSSEFNIKEENTGSALLDTDEVGSELDDSDDDYLISEGEEDGPDENLMLCLYDKVTRTKARWKCSLKDGVVTINRNDYTFQKAQVEAEWV 171 F F Eukaryota T 7zsd 2 B P C2D1_DROME de novo designed binder ETGASSTNMLEALQQRLQFYHGQVARAALENNSGKARRFGRIVKQYEDAIKLYKAGKPVPYDELPVPPGFGGSENLYFQ 79 T 3.7 DUF5327 pdbhh F Eukaryota T 7zss 2 D,E,F D,P,h C2D1_DROME de novo designed binder ETGASSTNMLEALQQRLQFYHGQVARAALENNSGKARRFGRIVKQYEDAIKLYKAGKPVPYDELPVPPGFGGSENLYFQ 79 T 3.7 DUF5327 pdbhh F Eukaryota T 7zsu 1 A,B,C A,B,C CP125_MYCTU CHOLEST-4-EN-3-ONE 26-MONOOXYGENASE,CHOLEST-4-EN-3-ONE C26-MONOOXYGENASE [(25S)-3-OXOCHOLEST-4-EN-26-OATE FORMING],CHOLESTEROL C26-MONOOXYGENASE,CHOLESTEROL C26-MONOOXYGENASE [(25S)-3BETA-HYDROXYCHOLEST-5-EN-26-OATE FORMING],CYTOCHROME P450 125,STEROID C27-MONOOXYGENASE NGPSPNLPPGFDFTDPAIYAERLPVAEFAELRSAAPIWWNGQDPGKGGGFHDGGFWAITKLNDVKEISRHSDVFSSYENGVIPRFKNDIAREDIEVQRFVMLNMDAPHHTRLRKIISRGFTPRAVGRLHDELQERAQKIAAEAAAAGSGDFVEQVSCELPLQAIAGLLGVPQEDRGKLFHWSNEMTGNEDPEYAHIDPKASSAELIGYAMKMAEEKAKNPADDIVTQLIQADIDGEKLSDDEFGFFVVMLAVAGNETTRNSITQGMMAFAEHPDQWELYKKVRPETAADEIVRWATPVTAFQRTALRDYELSGVQIKKGQRVVMFYRSANFDEEVFQDPFTFNILRNPNPHVGFGGTGAHYCIGANLARMTINLIFNAVADHMPDLKPISAPERLRSGWLNGIKHWQVDYTGRCPVAH 418 T 3.4000000000000003E-22 p450 unppssm F Bacteria T 7zt0 1 A,B A,B CP125_MYCTU CHOLEST-4-EN-3-ONE 26-MONOOXYGENASE,CHOLEST-4-EN-3-ONE C26-MONOOXYGENASE [(25S)-3-OXOCHOLEST-4-EN-26-OATE FORMING],CHOLESTEROL C26-MONOOXYGENASE,CHOLESTEROL C26-MONOOXYGENASE [(25S)-3BETA-HYDROXYCHOLEST-5-EN-26-OATE FORMING],CYTOCHROME P450 125,STEROID C27-MONOOXYGENASE NGPSPNLPPGFDFTDPAIYAERLPVAEFAELRSAAPIWWNGQDPGKGGGFHDGGFWAITKLNDVKEISRHSDVFSSYENGVIPRFKNDIAREDIEVQRFVMLNMDAPHHTRLRKIISRGFTPRAVGRLHDELQERAQKIAAEAAAAGSGDFVEQVSCELPLQAIAGLLGVPQEDRGKLFHWSNEMTGNEDPEYAHIDPKASSAELIGYAMKMAEEKAKNPADDIVTQLIQADIDGEKLSDDEFGFFVVMLAVAGNETTRNSITQGMMAFAEHPDQWELYKKVRPETAADEIVRWATPVTAFQRTALRDYELSGVQIKKGQRVVMFYRSANFDEEVFQDPFTFNILRNPNPHVGFGGTGAHYCIGANLARMTINLIFNAVADHMPDLKPISAPERLRSGWLNGIKHWQVDYTGRCPVAH 418 T 3.4000000000000003E-22 p450 unppssm F Bacteria T 7zta 55 CB PQK1 MAAAPQK nascent peptide MAAAPQK 7 T 12 BLM_N pdbhh F F 7ztc 2 I,J,K,L W,X,Y,Z Non-muscle tropomyosin 1.6 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 135 F F F 7ztd 2 I,J,K,L P,Q,R,S Non-muscle tropomyosin 3.2 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 135 F F F 7zuc 3 C,F C,F LEU-LEU-LEU-GLY-ILE-GLY-ILE-LEU-VAL LLLGIGILV 9 T 1.4 UAF_Rrn10 pdbhh F F 7zud 2 B M CPSF6_HUMAN CLEAVAGE AND POLYADENYLATION SPECIFICITY FACTOR 68 KDA SUBUNIT,CPSF 68 KDA SUBUNIT,CLEAVAGE FACTOR IM COMPLEX 68 KDA SUBUNIT,CFIM68,PRE-MRNA CLEAVAGE FACTOR IM 68 KDA SUBUNIT,PROTEIN HPBRII-4/7 PVLFPGQPFGQPP 13 T 1.4 MF_alpha pdbhh F Eukaryota T 7zuw 81 CC CB CUE3_YEAST CUE3 isoform 1 MLSRYNRVIEINGGNADISLPIVKFPPFKLRAQLIEKDPVVWLHLIETYVTYFEYLMQGANVELLDESTLDHLRLFLRTYLHEIADEEGKLLSLGINHDVSEQLYLLKGWIFSLIKKCGLLHLQIFGDSLWNLIKVYVRRNPDSIRGLIDGSLKPRINTQRVQLDKSYQVQQHLKQLIESGKFKRIDLRCVEDLLSAKSMQPNKFAENFFTANWIEILEALWAKGQGRGHKEARELIIISLFSVSADRLLKITKELGISNFETLALYPLLGTMLINEGVHKRLPDLKSKLLFLNLGG 297 T 0.21 DUF4919 pdbpercent F Eukaryota T 7zv1 1 A,B,C A,B,C POLG_AIVA8 P2A GAASATPDVDPDDRVYIVRAQRPTYVHWAIRKVAPDGSAKQISLSRSGIQALVALEPPEGEPYMEILPSHWTLAELQLGNKWEYSATNNCTHFVSSITGESLPNTGFSMALGIGALTAIAASAAVAVKALPGIRRQ 136 T 0.0007 Calici_PP_N pdbhh T Viruses T 7zv5 2 B B inhibitor TRIP5 XGFX 4 T 87 YcgL pdbhh F F 7zv6 1 A,B,C B,A,C POLG_AIVA8 P2A GPGGAASATPDVDPDDRVYIVRAQRPTYVHWAIRKVAPDGSAKQISLSRSGIQALVALEPPEGEPYLEILPSHWTLAELQLGNKWEYSATNNCTHFVSSITGESLPNTGFSLALGIGALTAIAASAAVAVKALPGIRRQ 139 T 0.00087 Calici_PP_N pdbhh T Viruses T 7zv7 2 B B inhibitor 57 XGFX 4 T 87 YcgL pdbhh F F 7zv8 2 C,D C,D inhibitor 58 GFX 3 T 40 HORMA pdbhh F F 7zvc 2 B C GLY-GLY-GLY-GLY GGGG 4 T 90 E3_UbLigase_R4 pdbhh F F 7zvi 2 B E A4ZF88_9CAUD Sri GAMDPMVTKEFLKIKLECSDMYAQKLIDEAQGDENKLYDLFIQKLAERHTRPAIVEY 57 T 0.42 DUF3173 unphh T Viruses T 7zvo 1 A A Q8A921_BACTN Beta-galactosidase MGSSHHHHHHSSGPQQGLRYEAETATLKGKFRKKEHRKQTGVFFDKGKGNSIEWNISTGLAQVYALRFKYMNTTGKPMPVLMKFIDSKGVVLKEDILTFPETPDKWKMMSTTTGTFINAGHYKVLLSAENMDGLAFDALDI 141 T 0.023 GH115_C pdbhh F Bacteria T 7zw0 37 KA sj YIQ1_YEAST Uncharacterized protein YIL161W MDTKLSVTGAKKSQGKASGLGNEGTPIGNEESTNKAKNGNKKRNKNRNRNKKTETKEQNEPKPVTGGEEVRVEKSQAKNRRRKNNNGANKKNTLHYSKEINVEERKQIAKRQEEIEQCIHTLSDFKLFKKGKHVTSYGYRISPMTDSGKISLKILFNIPLDYPKAPIKLTMKSNEEVSSYMDTVIANFNWKARQLVKEDWRILSQINYLVSELEILKMENYKQIDKLRNSFYKTI 235 T 0.0044 RWD pdbpercent F Eukaryota T 7zwc 3 C U TF2AA_HUMAN GENERAL TRANSCRIPTION FACTOR IIA SUBUNIT 1,TFIIAL,TRANSCRIPTION INITIATION FACTOR TFIIA 42 KDA SUBUNIT,TFIIA-42 MANSANTNTVPKLYRSVIEDVINDVRDIFLDDGVDEQVLMELKTLWENKLMQSRAVDGFHSEEQQLLLQVQQQHQPQQQQHHHHHHHQQAQPQQTVPQQAQTQQVLIPASQQATAPQVIVPDSKLIQHMNASNMSAAATAATLALPAGVTPVQQILTNSGQLLQVVRAANGAQYIFQPQQSVVLQQQVIPQMQPGGVQAPVIQQVLAPLPGGISPQTGVIIQPQQILFTGNKTQVIPTTVAAPTPAQAQITATGQQQPQAQPAQTQAPLVLQVDGTGDTSSEEDEDEEEDYDDDEEEDKEKDGAEDGQVEEEPLNSEDDVSDEEGQELFDTENVVVCQYDKIHRSKNKWKFHLKDGIMNLNGRDYIFSKAIGDAEW 376 F F Eukaryota T 7zwd 19 S U TF2AA_HUMAN GENERAL TRANSCRIPTION FACTOR IIA SUBUNIT 1,TFIIAL,TRANSCRIPTION INITIATION FACTOR TFIIA 42 KDA SUBUNIT,TFIIA-42 MANSANTNTVPKLYRSVIEDVINDVRDIFLDDGVDEQVLMELKTLWENKLMQSRAVDGFHSEEQQLLLQVQQQHQPQQQQHHHHHHHQQAQPQQTVPQQAQTQQVLIPASQQATAPQVIVPDSKLIQHMNASNMSAAATAATLALPAGVTPVQQILTNSGQLLQVVRAANGAQYIFQPQQSVVLQQQVIPQMQPGGVQAPVIQQVLAPLPGGISPQTGVIIQPQQILFTGNKTQVIPTTVAAPTPAQAQITATGQQQPQAQPAQTQAPLVLQVDGTGDTSSEEDEDEEEDYDDDEEEDKEKDGAEDGQVEEEPLNSEDDVSDEEGQELFDTENVVVCQYDKIHRSKNKWKFHLKDGIMNLNGRDYIFSKAIGDAEW 376 F F Eukaryota T 7zwj 1 A A Triculamin SKKSKPGDGIRGKGVRG 17 T 5.7 CP_ATPgrasp_1 pdbhh F T 7zwk 3 C C (1R,2R,3S,4R,6S)-4,6-diamino-2,3-dihydroxycyclohexyl 2,6-diamino-2,6-dideoxy-alpha-D-glucopyranoside XGGGXKK 7 T 49 PTase_Orf2 pdbhh F F 7zwn 2 B,C B,C ALA-TRP-VAL-ILE-PRO-ALA AWVIPA 6 T 0.39 DUF3950 pdbhh F F 7zwo 2 B B ALA-TRP-VAL-ILE-PRO-ALA AWVIPA 6 T 0.39 DUF3950 pdbhh F F 7zwp 2 B B ALA-TRP-VAL-ILE-PRO-ALA AWVIPA 6 T 0.39 DUF3950 pdbhh F F 7zwr 2 B B ALA-TRP-VAL-ILE-PRO-ALA AWVIPA 6 T 0.39 DUF3950 pdbhh F F 7zws 2 B B ALA-TRP-VAL-ILE-PRO-ALA AWVIPA 6 T 0.39 DUF3950 pdbhh F F 7zwu 2 B B ALA-TRP-VAL-ILE-PRO-ALA AWVIPA 6 T 0.39 DUF3950 pdbhh F F 7zwv 2 B B ALA-TRP-VAL-ILE-PRO-ALA AWVIPA 6 T 0.39 DUF3950 pdbhh F F 7zwx 2 B B ALA-TRP-VAL-ILE-PRO-ALA AWVIPA 6 T 0.39 DUF3950 pdbhh F F 7zwy 2 B B ALA-TRP-VAL-ILE-PRO-ALA AWVIPA 6 T 0.39 DUF3950 pdbhh F F 7zwz 2 B B ALA-TRP-VAL-ILE-PRO-ALA AWVIPA 6 T 0.39 DUF3950 pdbhh F F 7zx4 2 C,D,E C,D,E DLGP5_HUMAN DAP-5,DISCS LARGE HOMOLOG 7,DISKS LARGE-ASSOCIATED PROTEIN DLG7,HEPATOMA UP-REGULATED PROTEIN,HURP YRHISFGGNLITFSPLQPGEF 21 T 17 DUF4722 pdbhh F Eukaryota T 7zx7 19 S U TF2AA_HUMAN GENERAL TRANSCRIPTION FACTOR IIA SUBUNIT 1,TFIIAL,TRANSCRIPTION INITIATION FACTOR TFIIA 42 KDA SUBUNIT,TFIIA-42 MANSANTNTVPKLYRSVIEDVINDVRDIFLDDGVDEQVLMELKTLWENKLMQSRAVDGFHSEEQQLLLQVQQQHQPQQQQHHHHHHHQQAQPQQTVPQQAQTQQVLIPASQQATAPQVIVPDSKLIQHMNASNMSAAATAATLALPAGVTPVQQILTNSGQLLQVVRAANGAQYIFQPQQSVVLQQQVIPQMQPGGVQAPVIQQVLAPLPGGISPQTGVIIQPQQILFTGNKTQVIPTTVAAPTPAQAQITATGQQQPQAQPAQTQAPLVLQVDGTGDTSSEEDEDEEEDYDDDEEEDKEKDGAEDGQVEEEPLNSEDDVSDEEGQELFDTENVVVCQYDKIHRSKNKWKFHLKDGIMNLNGRDYIFSKAIGDAEW 376 F F Eukaryota T 7zx8 19 S U TF2AA_HUMAN GENERAL TRANSCRIPTION FACTOR IIA SUBUNIT 1,TFIIAL,TRANSCRIPTION INITIATION FACTOR TFIIA 42 KDA SUBUNIT,TFIIA-42 MANSANTNTVPKLYRSVIEDVINDVRDIFLDDGVDEQVLMELKTLWENKLMQSRAVDGFHSEEQQLLLQVQQQHQPQQQQHHHHHHHQQAQPQQTVPQQAQTQQVLIPASQQATAPQVIVPDSKLIQHMNASNMSAAATAATLALPAGVTPVQQILTNSGQLLQVVRAANGAQYIFQPQQSVVLQQQVIPQMQPGGVQAPVIQQVLAPLPGGISPQTGVIIQPQQILFTGNKTQVIPTTVAAPTPAQAQITATGQQQPQAQPAQTQAPLVLQVDGTGDTSSEEDEDEEEDYDDDEEEDKEKDGAEDGQVEEEPLNSEDDVSDEEGQELFDTENVVVCQYDKIHRSKNKWKFHLKDGIMNLNGRDYIFSKAIGDAEW 376 F F Eukaryota T 7zxd 1 A,B,C A,B,C CP125_MYCTU CHOLEST-4-EN-3-ONE 26-MONOOXYGENASE,CHOLEST-4-EN-3-ONE C26-MONOOXYGENASE [(25S)-3-OXOCHOLEST-4-EN-26-OATE FORMING],CHOLESTEROL C26-MONOOXYGENASE,CHOLESTEROL C26-MONOOXYGENASE [(25S)-3BETA-HYDROXYCHOLEST-5-EN-26-OATE FORMING],CYTOCHROME P450 125,STEROID C27-MONOOXYGENASE NGPSPNLPPGFDFTDPAIYAERLPVAEFAELRSAAPIWWNGQDPGKGGGFHDGGFWAITKLNDVKEISRHSDVFSSYENGVIPRFKNDIAREDIEVQRFVMLNMDAPHHTRLRKIISRGFTPRAVGRLHDELQERAQKIAAEAAAAGSGDFVEQVSCELPLQAIAGLLGVPQEDRGKLFHWSNEMTGNEDPEYAHIDPKASSAELIGYAMKMAEEKAKNPADDIVTQLIQADIDGEKLSDDEFGFFVVMLAVAGNETTRNSITQGMMAFAEHPDQWELYKKVRPETAADEIVRWATPVTAFQRTALRDYELSGVQIKKGQRVVMFYRSANFDEEVFQDPFTFNILRNPNPHVGFGGTGAHYCIGANLARMTINLIFNAVADHMPDLKPISAPERLRSGWLNGIKHWQVDYTGRCPVAH 418 T 3.4000000000000003E-22 p450 unppssm F Bacteria T 7zxe 5 E U TF2AA_HUMAN GENERAL TRANSCRIPTION FACTOR IIA SUBUNIT 1,TFIIAL,TRANSCRIPTION INITIATION FACTOR TFIIA 42 KDA SUBUNIT,TFIIA-42 MANSANTNTVPKLYRSVIEDVINDVRDIFLDDGVDEQVLMELKTLWENKLMQSRAVDGFHSEEQQLLLQVQQQHQPQQQQHHHHHHHQQAQPQQTVPQQAQTQQVLIPASQQATAPQVIVPDSKLIQHMNASNMSAAATAATLALPAGVTPVQQILTNSGQLLQVVRAANGAQYIFQPQQSVVLQQQVIPQMQPGGVQAPVIQQVLAPLPGGISPQTGVIIQPQQILFTGNKTQVIPTTVAAPTPAQAQITATGQQQPQAQPAQTQAPLVLQVDGTGDTSSEEDEDEEEDYDDDEEEDKEKDGAEDGQVEEEPLNSEDDVSDEEGQELFDTENVVVCQYDKIHRSKNKWKFHLKDGIMNLNGRDYIFSKAIGDAEW 376 F F Eukaryota T 7zxk 1 A,C A,C IL27A_HUMAN IL-27 SUBUNIT ALPHA,IL-27-A,IL27-A,INTERLEUKIN-30,P28 FPRPPGRPQLSLQELRREFTVSLHLARKLLSEVRGQAHRFAESHLPGVNLYLLPLGEQLPDVSLTFQAWRRLSDPERLCFISTTLQPFHALLGGLGTQGRWTNMERMQLWAMRLDLRDLQRHLRFQVLAAGFNLPEEEEEEEEEEEEERKGLLPGALGSALQGPAQVSWPQLLSTYRLLHSLELVLSRAVRELLLLSKAGHSVWPLGFPTLSPQP 215 F F Eukaryota T 7zxy 5 E,M E,M A0A6P1VG96_9SYNC Cytochrome B6 MAAGVGIFIGYIAVFTGVTLGLLYGLRFVKLI 32 T 0.00039 PetL pdbpercent F Bacteria T 7zy4 2 C,D C,D FIP1_HUMAN hFip1 SNAMSAGEVERLVSELSGGTGGDEEEEWLYGDENEVER 38 T 0.002 DUF5404 pdbhh F Eukaryota T 7zzz 1 A,B,C,D,E,F,G,J,K,L J,D,E,F,G,H,I,A,B,C A0A9E7D7B0_9VIRU Major capsid protein P5 MKIATITGVTKSPELQVTKAIGALILSSDVALSALTTEKISIYIERGNGSNVILANKVLLKDFILASTYGTENTQSDADNAMIALCELADEGSIYLADKESIKITLEDLISDKRYDLHGIEEPQQTNNLFFFEQKSVASEEFNKKIDVQGFDLAIMTVDDSVSDLSYQYSNGQVVKYLPFELQTLSRDIDPIQAVLSDGKVVQGLTDRLTLPLVAVVGIEINKSQGSIINFVVRCLKTV 239 T 34 DUF5053 pdbhh T Viruses. T 7zzz 2 H P A0A9E7A4L7_9VIRU Spike protein P13 N-terminal, capsid internal domain MNFIQYIDDSYAVKVKEINSSEGFYINGIQTPFFILSVFIGNKRVTGVEFNNYDSLPMLSVINDLGNIDLNVIPQNYFATAFTEIYFNIPF 91 T 15 GAPES2 pdbhh T Viruses. T 7zzz 3 I X Unknown vertex protein XXXXX 5 F F F 8a01 1 A,B,C C,B,A A0A9E7D7B0_9VIRU Major capsid protein P5 MKIATITGVTKSPELQVTKAIGALILSSDVALSALTTEKISIYIERGNGSNVILANKVLLKDFILASTYGTENTQSDADNAMIALCELADEGSIYLADKESIKITLEDLISDKRYDLHGIEEPQQTNNLFFFEQKSVASEEFNKKIDVQGFDLAIMTVDDSVSDLSYQYSNGQVVKYLPFELQTLSRDIDPIQAVLSDGKVVQGLTDRLTLPLVAVVGIEINKSQGSIINFVVRCLKTV 239 T 34 DUF5053 pdbhh T Viruses. T 8a02 1 A,B,C F,E,D A0A9E7D7B0_9VIRU Major capsid protein P5 MKIATITGVTKSPELQVTKAIGALILSSDVALSALTTEKISIYIERGNGSNVILANKVLLKDFILASTYGTENTQSDADNAMIALCELADEGSIYLADKESIKITLEDLISDKRYDLHGIEEPQQTNNLFFFEQKSVASEEFNKKIDVQGFDLAIMTVDDSVSDLSYQYSNGQVVKYLPFELQTLSRDIDPIQAVLSDGKVVQGLTDRLTLPLVAVVGIEINKSQGSIINFVVRCLKTV 239 T 34 DUF5053 pdbhh T Viruses. T 8a03 1 A,B,C L,K,J A0A9E7D7B0_9VIRU Major capsid protein P5 MKIATITGVTKSPELQVTKAIGALILSSDVALSALTTEKISIYIERGNGSNVILANKVLLKDFILASTYGTENTQSDADNAMIALCELADEGSIYLADKESIKITLEDLISDKRYDLHGIEEPQQTNNLFFFEQKSVASEEFNKKIDVQGFDLAIMTVDDSVSDLSYQYSNGQVVKYLPFELQTLSRDIDPIQAVLSDGKVVQGLTDRLTLPLVAVVGIEINKSQGSIINFVVRCLKTV 239 T 34 DUF5053 pdbhh T Viruses. T 8a04 1 A,B,C I,H,G A0A9E7D7B0_9VIRU Major capsid protein P5 MKIATITGVTKSPELQVTKAIGALILSSDVALSALTTEKISIYIERGNGSNVILANKVLLKDFILASTYGTENTQSDADNAMIALCELADEGSIYLADKESIKITLEDLISDKRYDLHGIEEPQQTNNLFFFEQKSVASEEFNKKIDVQGFDLAIMTVDDSVSDLSYQYSNGQVVKYLPFELQTLSRDIDPIQAVLSDGKVVQGLTDRLTLPLVAVVGIEINKSQGSIINFVVRCLKTV 239 T 34 DUF5053 pdbhh T Viruses. T 8a05 1 A P A0A9E7A4L7_9VIRU Spike protein P13 N-terminal, capsid internal domain MNFIQYIDDSYAVKVKEINSSEGFYINGIQTPFFILSVFIGNKRVTGVEFNNYDSLPMLSVINDLGNIDLNVIPQNYFATAFTEIYFNIPF 91 T 15 GAPES2 pdbhh T Viruses. T 8a05 2 B X Unknown vertex protein XXXXX 5 F F F 8a06 1 A E A0A222NP85_9VIRU Penton protein P12 DFSTIPIDYVKAKDPNTIDFCLSYLELYHTTKAVKACTPFSFILGSDAGMQRATETTESLYWGKVILDINPNLSPLVNTTIVLEIESMLSSNSINRSENKRITRYIEKENFVNESSERFEFFKSMELSHLSTAYDVYVTFIGFKIDL 147 T 0.028 McrBC pdbpssm T Viruses T 8a09 1 A,B A,B A hexameric barrel state of a de novo coiled-coil assembly: CC-Type2-(QgLaId)4 XGEIAQQLKEIAKQLKEIAWQLKEIAQQLKGX 32 T 0.0058 WXG100 pdbpssm F T 8a0j 1 A,B A,B G0V1V5_TRYCI Uncharacterized protein TCIL3000_11_11110 KAFLALPRGEEQRMRFVDEFLSGAWVRFYSFTTDDVVAMYYSLQPGRYGAFFATEQGVGTAVVDVHSKLVLYVPCMDKDSMNRIQPHPHVLTYFEEDVQLLNISDAQKVLGSVLTGIMNFVQEIARQRGEGLPPPAVHAAYLHERDKTAVPSNTKFAYVRKVFPDPSGSFVLFRLSNLRSQVICNVLMDIRWQSDRQNNVGQRYYVLADGTAEPFTVDHTGILFEVDQVVRNNFRR 236 T 0.31 PH_BEACH pdb F Eukaryota T 8a0k 1 A,B,C,D A,B,C,D Q38DT1_TRYB2 Protein kinase, putative SSVPPTPEERHMLLNGDWIRYYHFYPMEEEGGDSVAVTYHIQPGRTGVTFFNHSFSVHSAVLSVLEHIVYVVDRVDIEEDNDVARILSLAQALNEEKKIYDVLQLVETHDTHMLKQRRSPGIMSVYCPPQTAFQCNGDPFVFVRWYRFHMENSMSGFMLSNGAVQVFVGGKYELRWLDDNRKFIVRSNGVCEVLDEEKFPLSEELNQMLYGGV 213 T 0.024 DUF4704 pdb F Eukaryota T 8a12 3 C E A0A2I0BQX1_PLAFO Myosin essential light chain ELC MASDMEEKFREAFILFSSCSDHIEMYKFFELMNSFGIILTNDEKAALPNDINMDYWLNFAKKHYNYEQPFKHINNVNEQNTNVQIKIDNFLGIMKALDTRLTESDLNILLQITNPENKSTLNLKTVSQKLTESI 134 T 0.024 Na_Ca_ex_C pdbpercent F Eukaryota T 8a19 2 B B 6-(2-methoxyethoxy)-11,15-dimethyl-8-oxa-2,11,15,19,21,23-hexazatetracyclo[15.6.1.13,7.020,24]pentacosa-1(23),3(25),4,6,17,20(24),21-heptaen-10-one XPXLXLLXXXX 11 T 50 COX16 pdbhh F F 8a1a 2 B B 6-(2-methoxyethoxy)-11,15-dimethyl-8-oxa-2,11,15,19,21,23-hexazatetracyclo[15.6.1.13,7.020,24]pentacosa-1(23),3(25),4,6,17,20(24),21-heptaen-10-one XPXLXLLXXXX 11 T 50 COX16 pdbhh F F 8a22 21 U Am uL18m PLKPLVFRPLKLFFWATNRHVHAKVVRFSSPFNEGVPVVDISTFDAFNKLKGSAPVVAPRSLECYKEVAKMVKEETARQNINEVTLHLNSHASDRGVREVVRELKNLGLLVKKV 114 T 0.0038 Ribosomal_L18p pdbhh F T 8a22 33 GA Ay bL31m LVRPLTKAMTVVLSNGATLRLPTVYARAKPWFPVMDLHSHNVWKHKIKTDFQLESEKNITPDFSNFYNKFGK 72 T 0.002 Ribosomal_L31 pdbhh F T 8a22 39 MA AE mL40 EGNTRLQKVVSFFVPEVEKKEEEEKLATQYKRWKVAQVHAWNHDIAVKHRLQTEAIASLPQRLKEQALKPDYSPIPLNRKLLFHTPPESYRD 92 T 7.3E-05 MRP-L28 pdbhh F T 8a22 43 QA AI mL63 VVFKTTGGKAWNPPGGLKPLTNTQKRSRKENLQILLRNLSVLKLAAENQPEVTVNLFSPLKFMH 64 T 0.0048 L31 pdbhh F T 8a22 45 SA AK mL87 RPIMHKNWDWEFVVGAKAGRKPAIQRPKPHQWYYCNPKYSAEDPLPTKIFPPHAPPTAESLDDWAKFRKLCPKDPVEAKKFRKHFVRFLNQRNYDWRTAFERGLAKEVAVAKAAQRAEDETKRQEAWHAYRTAVFESAL 139 T 6 Serglycin pdbhh F T 8a22 46 TA,UA,VA AL,AM,AN mL116 GTIFNTGVPGPRPEVAQKLSTEYQGHILRMISLAESASELDEVLWSSKKHLRPVHIARSCLKLEYLRTKEKGREVSEPIKNLASELENYVELYSTKFTIGQVSQLVRGLSSIRRNIQPDLLLKLAAVVVADDGRQVQLANEMDCRDLFFGFFSQGFDNELFWKRLSESVLPRLPYFNADVVSTVLRVVSGLRFLHNTEFAHATMTALVPKVGDLSPARLADAFFSASLLDPTDVSGLNAKLEERFLREFTSFPIKDTVTMFQTVTVRRHSTPELAAQVAPLVAAQAHQLPVRHLRRALEGMVTAGWKDTAEIPLYAILAKQAARLVLGKQSAATSAILGKHVDNQGYQRTPVQLLRQLARIFANTGLKAGPGANQPLAPYFAALQRELEGRLAELDEQVTDDFAESFKKVGIAEGARVQI 420 T 9.8 FAST_1 pdbhh F T 8a22 47 WA AO mL118 YTWHFLSRQRVEAVNKATDILELEDIMRLEGNKYDYIAIRAFLKRVCILLQERADALGLPPSNEGLLVRFDEPERARYEALVSQVCDVVSARAKWFDPSNAAAVAYCLTRWLGRAEAPLIEQLLRRVVARLPEAKSKDVQYALDATLESAAAPHLEHLREPMLRAAGAFLGAKLPTGRVPPEVVAKITRLLVNHWDQPDEELLEAIVTDIAVRLEIYSPTALGRTLLALSKVPALTGAAFKRSRSSFLPEGVNVPSGADVAVPLADACLAHVAAHAAEHANEHDLIKFLGAISKLASPGRAATAGADAGAEATESGAAWAKRNSASLAWFALEQRLAPSTRGSFEGNQFPFVIKLVSAAARPPPAVTKFISSTVAKE 377 T 3.4 TAN pdbpercent F T 8a22 48 XA Xa mL120 IEEYYVVPPACPPPPHNPKTLKYVPKLNRTQIIARVYEAKTPTELENSALGKKFFSEFAAVAKLVRLSQLRRANVYNSRDDAMCVSIYNTSVRVADKLLHLSSDEELCGLIWALSQLPYPEYENLVDRSLQILLEEDKPLKTGSSLAVSRAAAGLASLGRWDASTWEVLVPLLRKNVQEGKEVELSNLALGLYDARETV 199 T 8.9 4HB pdbhh F T 8a22 49 YA Xb mL121 ITTLEFYEQNKEKLSFLVGNDVYSEFEEKWKPKPKVFDPEIPIAEAMWPEGRSEKLQSVIVKKLSFSTESAGSVVAGMKHGYCPDSVDVIVSRVEQLKAFEGYFPGFKTEEFVSVNPRLLNFSTDRIYWAMLTFQDMFSSAEVGPLMANIGHFIIENPIRCAKNLFCLATELESQLSMKLDVTAVKQDSWFTLSSEETIRERVAALATIFGSKTAGDILLRDINYLRLDSKDVNRDAVKIREHF 244 T 0.054 DUF732 pdbpssm F T 8a22 50 ZA Xc mL122 RKNIVWPEQLEEQKRNKQYEYQWVISNGKKFIARRESTSTKWEIWKSIEKVTPGQKP 57 T 0.0053 HTH_56 pdb F T 8a22 52 BB Xe mL124 FHTGVNLVQPIDTSKLTRQIKKLTLLHEAALTVLQYSNYCNPEQATEILRRLPFLMRHEESRVLKGQTLDPKLPPMFHGLLHVMGDRFVQVFSDCNLRQIERGAWALAAARHQHDGVALALSEKLKQLTQELLDLNAKPFNTRVTKPTPEQLNSGIFASRVLVPESVNQLPVKAVLPEFNALAGIAWALATVAGEHSAAAAKAALEQLAEKFGALQVDPKPLPDADSLCRLAWAFAKAGVHNPAAVDKLFHLAEERLKSQLQAHDPASGPLRPRCTYRYKTVRGWVDQHFPRKPRDSSYLGDTAPKIIPRDFEIDSLGSLLSAAALLRDQVPVERLQTILNLAAQHTAASSVAGGALQPLMVTYEEVTRVLAACEQLGFRSSTLVTPLLHGLPMAALSAEALSQLAAAATLHHVRSRTVYLRIVRAFNAKLSVSPTLVAGAGIGAEGKKEGEAAAALGAQLLLAVTKAGLPANASVSRIASLV 483 T 11 GYR pdb F T 8a22 53 CB Xf mL125 RPALTPSFSRVSDPWTGEKEAKYAAPYRIPEEVWKNSGAPKILFQDPWNSPDYDEVRKKHAVLVHDYLKQQSQPINVQTILEGVNKTHGLVLGTIEYVTSLLENMLWHDMAYVVKPVFSSPRKAKLSKIPLLYGANKYQQVFRGTPKEVAERYEAARAKHIKVAFTRLRTSKTPQPFRRRTDEYSHVQASQSALGLAAAAA 201 T 1.6 HARE-HTH pdbhh F T 8a22 54 DB Xg mL126 SVKYIPNHAATPNKYKDAQQKVLWDRAKKLGKKPEYKVPNIKDTQTVFEIGKLTKLCLEHWKPMHFAAALGHVINVWTTQALKSGRYGGKSFTVRELLGFRSLPYGVNSITAVLPLQSPEDFLSQPLAKQPFSFKPVSVREEVKKIIASNPGLLIHNWSLKIEGQPNHPITDEDRAAAVIAICTSSFRARFNEAGDVAVALVLSRLARCGYWLPPLYELIAPFAAFQGARIDHSSPAVIANVLLVLARAKGQAEMGQPTALQIRAIAPALEQKCLQRLGELLPSLEALVISDTLAATALLSSPEARALLAQIKAEVLARNFLGFESRDIIACFKELVANVYQPLQLSADLPAPGELRDELPGGEKVLDEQLLAALSGAVVEGGALXXXXXXXXXXXXXXXXXXXXXXXXX 410 T 15 hDGE_amylase pdbhh F T 8a22 56 FB Xi mL128 KFQSRAEKKYRIMDEKVGKPRFQA 24 T 11 HJURP_C pdbhh F T 8a22 57 GB Xj mL129 AKRLLREAPPFTEQVDEKNFDDKDYLGAGAMSNEQLRKELEKVEPGEAKWDEESPLIPPPQPRQYRNKGHQ 71 T 1.5 La_HTH_kDCL pdbhh F T 8a22 62 LB Ba bS1m VSLPMTKAELRASLLYKMRKDTLTKVIDSSLSTSVLTIEEKEKIDRTLYAHIETENNHIDPGLHCLIRALRRSNLTMPILKGQQIQAKVIQKTDEVMLLDPGFYNLSEVPVNYLTTAHIVRKVDDSPRENLYDVRPGDVVKVLVDDVYTPYGDMQLDVPQQDPRLILNQVWDELHLKMKKKELVRGRILNECKSGYAVGVAGFVALLPYANTSREVANRVGEAQSFQIKSMSEPHRRRIVLQ 242 T 0.00036 MRP-S35 pdbhh F T 8a22 69 SB Bh uS8m AAQAYFDLRYHVKKQGLLTVNRAASIINSIFPEFSHESHRNQLAVPLPRKEIPTYIMQNAKVQPWALLPTKAAAYAQYPNFFRSSSLFFGSLNREIVNRRPYSLLPADKLSMDLAQVCTNLGILNGWDIVQKREKLKDLDFVWPANELPRDHHEVKLFKHLHLRLALKWEQHKPLWEDGSMVKDQREYRDQQQVQQQQPLPHLPLAPLFGPLPLTVRNLSKASQPVLLYPLQLRELAQRMPSGLFLLYHHELGVITDAQAFLFDVPVVALAHVGLPVSMAAAVNGAVNRTFRAELGKPLREVTKLKDWSLSATIAAQVRERRQQLLERAEQTKRERKQIQDLVTVRVGKFKAEVDKEDSSLALQDELLAWQLKE 374 T 59 OPA1_C pdbhh F T 8a22 82 FC Bu mS23 AFSRYFVQKFKQSYTRKYMRDMESGAFSFPKCHDILGKYRPDVLFAAAAAPLKLELPEQAVYKKLYRDFPELRKDAVDLSSLEAPLAKQFALKHLVLSAEIAANSPRTRHILRRDLEADPAYERLKEEFMPRIAELRKQQEQTASLQQLQADEEEHLKLALTYVAAQ 167 T 64 UIM pdbhh F T 8a22 85 IC Bx mS31 SSAARWRAAIAQRLGVEAAAAAQALAALLGQGDLALTVLAAASEADVLNITELLENNSVDEAVTNARKVAIVSGHGLFLATATSEDLAALSDVEAGELAALMGKVHVVGLPLADALLGSDSLTHDQLLTLTRSEKQALLWRLASVGKLREGRAKAVAALRKAALDRAAAAAEASEGLLSAAAMMKLEHDIAEFDLVRERYLPGPGLPEGVQEAFAPSGLPSAFSRDEQALYDAYFGLRSHAASAQPEPLEGPSAAQLHSSFLDGFQCREEDSQMEELPESFGQWVANIKGLIVKAPVPLLGLLAKFVTAKIDGADARDASETQSRLRLLAAEIATDIARRREARLAVSPWWQRASAPIDALAISSIDHPSSDPLVQLLEVLLGHSGADEFGSWISAVAMRPVSPYEILADEHRLMDLERYLSMTSASELHLELAATPLPWASPAVHVPPAAFLEEMRAKFNNYLLATGLSPLSAAEWSAYKDWALEEFAEKRALGEEALLQEGHSGFFNPKADEIYLRALLEATIPPEAPLREQAVRYLETVNMNKTWTFLKKKHMVQRLAELSRHLTEHPPVEEQGSPFAALFAVGPGAKPTPLVPKLSKRLPAHGPESLDLPELPEIFR 621 T 43 MRP-S31 pdbhh F T 8a22 91 OC BD mS45 PSVNDLASLLSLSEQYRGADVLAEGAALPGTGFANARGTFLPHELPTAIEYLKELDPEAEMKLEQMEAMYKLLYSRNESEREVGRQMMYDLLKLSGHPFRELELCNWDYMAAFLDARVAGRVFHRGSGERLVHRTATFPAFEGYPLAEVDQTTEGEVSKLNREESKRQDNAMFQDFRKKLLFNLGMVGEQLWEPVQGVLSANLRSALDRPLVVYDITAATGETVYPPKFVAEVDGTRRALNEQERAYQAKRKPGPRLPYYMRRIARKEEL 270 T 0.19 TOM6p pdbpssm F T 8a22 92 PC BE mS106 IAPSQLDKLEKFVHVRPPKTDYEEDIKQAISSVTDNEGLKKCLDLFLTNHAAQTWVGKHEYANAGQVSDFIKVCEKVGSSAPLVTLWQSAYRYGVDPTVPLLRSSAAACAALGEGADAALVLLYGSCYIVNVPEDLAAAVRTALEAHEKAQEGNAEALAKVQKYREALDRL 171 T 0.019 PfaD_N pdb F T 8a22 93 QC BF mS107 MATIPKGLDIDPESPMLYHYFKSIHPHQVSFRIKKRKQLQHLWELCKLYENKMDTLASAAMLGQLFRLQKRNNPDYSVELANQIFEHCVKRLSFTIRFATYQEIVPVLFTLARMNVSIVPSDTLLLDPTHRVSREFVHLFLKRAVRNHVHIRVVNPRQMARVLWATAKLFPEDQRMDPRVQDAVDKLARSSVKRLSELHPGSLSIYASAFAKLSPAPTSQEGPLKDVDVSSWDATITGVKSSLLDLDSKELAFVARARTLKVFQGISREILLRVGDLNHEQFTVRNVFHVLGAYIRAQIQDPLVAKVLAENITGRIQDVYAEELIALVRAAERLDGFKNPDLTAAVLRRAREVDLPEETQKDYAKRLQSA 370 T 43 TYA pdbhh F T 8a22 94 RC Ya uS4m-2 (fragment) ADLVRHLQSSGSKLQKLANLTSASCSYRDISVSLFGLQARQLGCTKPFVYTFSASEQAQSKSKPFDGKLRLPDVSQLSDTITVSGDAAPATLNLDQKYMDKISCWTQTIPSHLEMSYQNLTSVKLFPPVDASYPTYVDFEHATRLLDHFTSRKRLNYQRKMIKKRDKFQIKSWDHHAGEA 180 T 23 KN_motif pdbhh F T 8a22 95 SC Yb mS108 HRFRNNKFLRLEPDLDPKVYGQTETLQKQVDDNFSLLLAKHRLDMKAAAA 50 T 6 SMAP pdbhh F T 8a22 97 UC Yd mS110 TTRRRKLLGSRYGARLAKKNRQQFERARVILDCYSDEELRPDSPPVAVKEVTINTLQTMRFLFPQTSKEHLDIKQNIASYKIFSLNRELLSLLPK 95 T 1.3 DUF3135 pdbhh F T 8a22 98 VC Ye uS3m-2 (fragment) GRHLLPATAIRVRLNRGFESAWYTDVSYREMIKKDFLLAKLASSFVNRSSRASLRQIFPGGKDFPNFRTSRIFMQHLPYKSYASTFSYVAPKDGPQAKYGLFQSKL 106 T 20 YebO pdbhh F T 8a22 99 WC Yf mS111 PFNVSDANPKDVEFLQVLLSKFLPDADKATVYRTGQEPRRLRLGDLPATSQFMESFVSEKLPKEPLYDMPSWLANNMPQYDAQPKSPHYHWSSWMRQHLSLDLQRLYAAFAEYMASEPHRLGIVRQANFELARLWDWQHRRVAAGLSPDL 150 T 8.8 DUF2497 pdbhh F T 8a22 100 XC Yg mS112 SADVYKEFFKMARVAVRTMKEPTKSIMKDLQRNARRSENIQRDNNLDKGVYMRFLRQRAGLSVPPIK 67 T 9.2 CCDC53 pdbhh F T 8a22 101 YC Yh mS113 GADGVLRRHNEVRGALRFFLDSWYKNKTSGTIADKNQVMFDYLKYKGVTEIAQHLHAPAPQVFRK 65 T 0.12 Nudix_N pdbpercent F T 8a22 103 AD Yj mS115 EYNGQGYVFSLLQRPPAPTLELLAEYLTVKYQDVIAQRDFVTHILGRMSVLERGGELPAADAAASGTWTGGAKRRLSPQEIRDINGELNRLFDADLNEYVSLAQRLATENVLSPADLATCLQAARSKAQTSSFASLAAPGSSNVDRNILAQVLQGKQDVSALAAAAAAAAASGPEGARVAWDEALQVGKYGAWATKAKAWAADDIAARREKGQQISPEQEAALVCLWDNPLSYDAAAGLWHQYAEKAGAVSAPSLADVISADQAIQAAKAAAAADPASLPAVKATAEKAAQVQEAVKKLYLGFAARQGSTSGAVTVDGVPLPFADVVKANAELDVASPAALAAAFQPLELGELLACHWEAVSRTFMWEDMYQLMLETAKEIEVNGA 386 T 0.15 Ykof pdbpercent F T 8a22 104 BD Yk mS116 GPLPEDVFLVAPKVAAAVQQTQAQLIDLLAPYGYSFDAFSEAVLEDLSKTKELCVKARFVLWEARVLEALEAVRPFVSGPVFRTESEAAALT 92 T 5 POTRA_TamA_1 pdbhh F T 8a22 105 CD Yl uS7m-2 (fragment) TVVLAPSKYDSQLKIPLKPTEMDEFEELRSFVDISIEKEADYVMNKFVGRLIKGGEKATAQQVLLRTLLHTRRLMQEGNITSLK 84 T 0.13 Ribosomal_S7 pdb F T 8a22 106 DD Ua Unknown XXXXXXXXXXXXXXXXXRXXXXXXXXXXXXXX 32 T 6600 zf-C2H2 pdbhh F F 8a22 107 ED Ub mL105 RDEIIKLLESRKDMDVNGYVMYCREELGKLTVPRPRAPPVSPKHEDYKTFVDEERVTYMRMKQHEKISLFLTEEEKNTVTTKGKDILDDKRFIQTIASRTGFYIAEEVRDCLSEFFNFRDSSRRLLTYYAD 131 T 0.025 DUF5863 pdb F T 8a22 108 FD Ud Unknown XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 43 F F F 8a22 109 GD Ue Unknown XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 8a22 110 HD Uf Unknown XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 81 F F F 8a22 111 ID Ug Unknown XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 72 F F F 8a22 112 JD Uh Unknown XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 48 F F F 8a22 113 KD Ui Unknown XXXXXXXXXXXXXXXXXXXXXXXXXX 26 F F F 8a22 114 LD Uj Unknown XXXXXXXXX 9 F F F 8a22 115 MD Uk Unknown XXXXXXXXXXXXXXXXXXXXXXX 23 F F F 8a22 116 ND Ul Unknown XXXXXXXXXXXXXXXX 16 F F F 8a22 117 OD Um Unknown XXXXXXXXXXX 11 F F F 8a3g 1 A,B A,B apCC-Tet* XGQLEEIAQQLEEIAKQLKKIAWQLKKIAQGX 32 T 0.01 WXG100 pdb F T 8a3i 1 A,B A,B apCC-Tet*3 XGQLEEIAKQLQQIAWQLKKIAQGX 25 T 0.68 DUF5320 pdbhh F T 8a3j 1 A,C,E,G A,C,E,G apCC-Tet*3-A XGQLEEIAKQLEEIAWQLEEIAQGX 25 T 0.063 DUF5320 pdbhh F T 8a3j 2 B,D,F,H B,D,F,H apCC-Tet*3-B XGQLKKIAKQLKKIAYQLKKIAQGX 25 T 1.2 DUF5320 pdbhh F T 8a3t 7 J S HSL1_YEAST HSL1 isoform 1 MTGHVSKTSHVPKGRPSSLAKKAAKRAMAKVNSNPKRASGHLERVVQSVNDATKRLSQPDSTVSVATKSSKRKSRDTVGPWKLGKTLGKGSSGRVRLAKNMETGQLAAIKIVPKKKAFVHCSNNGTVPNSYSSSMVTSNVSSPSIASREHSNHSQTNPYGIEREIVIMKLISHTNVMALFEVWENKSELYLVLEYVDGGELFDYLVSKGKLPEREAIHYFKQIVEGVSYCHSFNICHRDLKPENLLLDKKNRRIKIADFGMAALELPNKLLKTSCGSPHYASPEIVMGRPYHGGPSDVWSCGIVLFALLTGHLPFNDDNIKKLLLKVQSGKYQMPSNLSSEARDLISKILVIDPEKRITTQEILKHPLIKKYDDLPVNKVLRKMRKDNMARGKSNSDLHLLNNVSPSIVTLHSKGEIDESILRSLQILWHGVSRELITAKLLQKPMSEEKLFYSLLLQYKQRHSISLSSSSENKKSATESSVNEPRIEYASKTANNTGLRSENNDVKTLHSLEIHSEDTSTVNQNNAITGVNTEINAPVLAQKSQFSINTLSQPESDKAEAEAVTLPPAIPIFNASSSRIFRNSYTSISSRSRRSLRLSNSRLSLSASTSRETVHDNEMPLPQLPKSPSRYSLSRRAIHASPSTKSIHKSLSRKNIAATVAARRTLQNSASKRSLYSLQSISKRSLNLNDLLVFDDPLPSKKPASENVNKSEPHSLESDSDFEILCDQILFGNALDRILEEEEDNEKERDTQRQRQNDTKSSADTFTISGVSTNKENEGPEYPTKIEKNQFNMSYKPSENMSGLSSFPIFEKENTLSSSYLEEQKPKRAALSDITNSFNKMNKQEGMRIEKKIQREQLQKKNDRPSPLKPIQHQELRVNSLPNDQGKPSLSLDPRRNISQPVNSKVESLLQGLKFKKEPASHWTHERGSLFMSEHVEDEKPVKASDVSIESSYVPLTTVATSSRDPSVLAESSTIQKPMLSLPSSFLNTSMTFKNLSQILADDGDDKHLSVPQNQSRSVAMSHPLRKQSAKISLTPRSNLNANLSVKRNQGSPGSYLSNDLDGISDMTFAMEIPTNTFTAQAIQLMNNDTDNNKINTSPKASSFTKEKVIKSAAYISKEKEPDNSDTNYIPDYTIPNTYDEKAINIFEDAPSDEGSLNTSSSESDSRASVHRKAVSIDTMATTNVLTPATNVRVSLYWNNNSSGIPRETTEEILSKLRLSPENPSNTHMQKRFSSTRGSRDSNALGISQSLQSMFKDLEEDQDGHTSQADILESSMSYSKRRPSEESVNPKQRVTMLFDEEEEESKKVGGGKIKEEHTKLDNKISEESSQLVLPVVEKKENANNTENNYSKIPKPSTIKVTKDTAMESNTQTHTKKPILKSVQNVEVEEAPSSDKKNWFVKLFQNFSSHNNATKASKNHVTNISFDDAHMLTLNEFNKNSIDYQLKNLDHKFGRKVVEYDCKFVKGNFKFKIKITSTPNASTVITVKKRSKHSNTSSNKAFEKFNDDVERVIRNAGRS 1518 T 3.1E-06 Pkinase pdbpssm F Eukaryota T 8a44 2 B B A0A1P8P1S7_HUMAN DUFFY ANTIGEN/CHEMOKINE RECEPTOR MGNALHRAELSPSTENSSQLDFEDVWNSSYGVNDSFPDGDYDANLEAAAPAHSANLLDDS 60 T 0.099 DUF4120 pdbpssm F Eukaryota T 8a49 2 B,C C,D ENDOS_STRP1 Secreted endoglycosidase EndoS MIPEKIPMKPLHGPLYGGYFRTWHDKTSDPTEKDKVNSMGELPKEVDLAFIFHDWTKDYSLFWKELATKHVPKLNKQGTRVIRTIPWRFLAGGDNSGIAEDTSKYPNTPEGNKALAKAIVDEYVYKYNLDGLDVAVLHDSIPKVDKKEDTAGVERSIQVFEEIGKLIGPKGVDKSRLFIMDSTYMADKNPLIERGAPYINLLLVQVYGSQGEKGGWEPVSNRPEKTMEERWQGYSKYIRPEQYMIGFSFYEENAQEGNLWYDINSRKDEDKANGINTDITGTRAERYARWQPKTGGVKGGIFSYAIDRDGVAHQPKKYAKQKEFKDATDNIFHSDYSVSKALKTVMLKDKSYDLIDEKDFPDKALREAVMAQVGTRKGDLERFNGTLRLDNPAIQSLEGLNKFKKLAQLDLIGLSRITKLDRSVLPANMKPGKDTLETVLETYKKDNKEEPATIPPVSLKVSGLTGLKELDLSGFDRETLAGLDAATLTSLEKVDISGNKLDLAPGTENRQIFDTMLSTISNHVGSNEQTVKFDKQKPTGHYPDTYGKTSLRLPVANEKVDLQSQLLFGTVTNQGTLINSEADYKAYQNHKIAGRSFVDSNYHYNNFKVSYENYTVKVTDSTLGTTTDKTLATDKEETYKVDFFSPADKTKAVHTAKVIVGDEKTMMVNLAEGATVIGGSADPVNARKVFDGQLGSETDNISLGWDSKQSIIFKLKEDGLIKHWRFFNDSARNPETTNKPIQEASLQIFNIKDYNLDNLLENPNKFDDEKYWITVDTYSAQGERATAFSNTLNNITSKYWRVVFDTKGDRYSSPVVPELQILGYPLPNADTIMKTVTTAKELSQQKDKFSQKMLDELKIKEMALETSLNSKIFDVTAINANAGVLKDCIEKRQLLKKLLEHHHHHH 906 T 0.00021 LRR_4 pdbpssm F Bacteria T 8a4o 1 A D I2G262_USTHO Effector protein Uvi2 MGHHHHHHHSMDITFTADKFARRAEEAAPVAVKPPRNPEFGIFLNNRYLLHNGEGLPKPKDVKETYPECKWRKYGQWAWLDENNVQCYLGPSYKYHAYSPAKNFDPVPSIQRGACADTANPQDFPQGIPRYTISVPYLYFNNFYDRRCKVRALVKVPQTDKEKEHWIQAWVVEHNGGNWSTKSGDLGPNGPQEGIMLDTKLYPKFLNSGDKDIGVLPNKVEWFFLDINTIG 231 T 4 DUF3868 unphh F Eukaryota T 8a50 1 A,B A,B HSF2B_HUMAN Heat shock factor 2-binding protein EFVKVRKKDLERLTTEVMQIRDFLPRILNGEV 32 T 0.021 Exonuc_VII_L unp F Eukaryota T 8a51 1 A A HSF2B_HUMAN Heat shock factor 2-binding protein EFVKVRKKDLERLTTEVMQIRDFLPRILNGEV 32 T 0.021 Exonuc_VII_L unp F Eukaryota T 8a57 13 M H RL3_LISMO 50S ribosomal protein L3 MTKGILGRKVGMTQVFTENGELIPVTVIEAAQNVVLQKKTVETDGYEAVQIGFEDKRAILSNKPEQGHVAKANTTPKRFIREFRDVNLDEYEIGAEVKVDVFAEGDIIDATGVSKGKGFQGVIKRHGQSRGPMAHGSRYHRRPGSMGPVAPNRVFKNKLLPGRMGGEQITIQNLEIVKVDVEKNVLLVKGNVPGAKKALVQIKTATKAK 209 F F Bacteria T 8a5a 5 E X IES4_YEAST Ino eighty subunit 4 MSQESSVLSESQEQLANNPKIEDTSPPSANSRDNSKPVLPWDYKNKAIEIKSFSGYKVNFTGWIRRDVREERQRGSEFTASDVKGSDDKATRKKEPADEDPEVKQLEKEGEDGLDS 116 T 0.29 INO80_Ies4 pdbhh F Eukaryota T 8a5b 2 E,F,G,H E,F,G,H MG-101 XLLX 4 T 1700 EF-hand_1 pdbhh F F 8a5i 13 M H RL3_LISMO 50S ribosomal protein L3 MTKGILGRKVGMTQVFTENGELIPVTVIEAAQNVVLQKKTVETDGYEAVQIGFEDKRAILSNKPEQGHVAKANTTPKRFIREFRDVNLDEYEIGAEVKVDVFAEGDIIDATGVSKGKGFQGVIKRHGQSRGPMAHGSRYHRRPGSMGPVAPNRVFKNKLLPGRMGGEQITIQNLEIVKVDVEKNVLLVKGNVPGAKKALVQIKTATKAK 209 F F Bacteria T 8a5l 2 B B POLG_HE71 2BC peptide TIEALFQ TIEALFQ 7 T 29 BBP1_N pdbhh T Viruses T 8a5m 2 C,D C,E Q80J95_9CALI MNV1-NS6 peptide LEALEFQ LEALEFQ 7 T 9.3 CheZ pdbhh T Viruses F 8a5o 5 E X IES4_YEAST Ino eighty subunit 4 MSQESSVLSESQEQLANNPKIEDTSPPSANSRDNSKPVLPWDYKNKAIEIKSFSGYKVNFTGWIRRDVREERQRGSEFTASDVKGSDDKATRKKEPADEDPEVKQLEKEGEDGLDS 116 T 0.29 INO80_Ies4 pdbhh F Eukaryota T 8a60 2 B B LLP_BPT5 Lytic conversion lipoprotein MKKLFLAMAVVLLSACSTFGPKDIKCEAYYMQDHVKYKANVFDRKGDMFLVSPIMAYGSFWAPVSYFTEGNTCEGVFHHHHHH 83 T 0.00082 LPAM_1 unphh T Viruses T 8a62 2 B B FOXO1_HUMAN FORKHEAD BOX PROTEIN O1A,FORKHEAD IN RHABDOMYOSARCOMA PRSCTWPLPRX 11 T 12 FOXP-CC pdbhh F Eukaryota T 8a63 13 M H RL3_LISMO 50S ribosomal protein L3 MTKGILGRKVGMTQVFTENGELIPVTVIEAAQNVVLQKKTVETDGYEAVQIGFEDKRAILSNKPEQGHVAKANTTPKRFIREFRDVNLDEYEIGAEVKVDVFAEGDIIDATGVSKGKGFQGVIKRHGQSRGPMAHGSRYHRRPGSMGPVAPNRVFKNKLLPGRMGGEQITIQNLEIVKVDVEKNVLLVKGNVPGAKKALVQIKTATKAK 209 F F Bacteria T 8a65 2 B B FOXO1_HUMAN FORKHEAD BOX PROTEIN O1A,FORKHEAD IN RHABDOMYOSARCOMA RSCTWPLP 8 T 8.6 PCSK9_C1 pdbhh F Eukaryota T 8a68 2 B B RAF1_HUMAN PROTO-ONCOGENE C-RAF,CRAF,RAF-1 QRSTSTPNV 9 T 58 NB pdbhh F Eukaryota T 8a6f 2 B P RAF1_HUMAN PROTO-ONCOGENE C-RAF,CRAF,RAF-1 QRSTSTPNVHX 11 T 68 ALC pdbhh F Eukaryota T 8a6h 2 B P RAF1_HUMAN PROTO-ONCOGENE C-RAF,CRAF,RAF-1 QRSTSTPNVHX 11 T 68 ALC pdbhh F Eukaryota T 8a6i 1 A A TADBP_HUMAN TDP-43 GGMNFGAFSINPAMMAAAQAALQSSWGMMGMLASQQNQSGPS 42 T 0.0067 Glucosaminidase pdb F Eukaryota T 8a82 2 B B K7XCU4_SERPL OocQ MSEYLINSGEFNMIVCPADKAYYILNDDRASTETLQEFLDGEKVQYHRLKPLWFKYRADESWQDLNKKEYRLGKELSEAELIDRFVLKAFNFGSLVAVRDSQTGAVKIFKRDKLKMSVR 119 T 0.0091 NOGCT unppssm F Bacteria T 8a8c 2 B B RBP5_BPT5 RBP-PB5,TAIL PROTEIN PB5 MSFFAGKLNNKSILSLRRGSGGDTNQHINPDSQTIFHSDMSHVIITETHSTGLRLDQGAGDYYWSEMPSRVTQLHNNDPNRVVLTEIEFSDGSRHMLSGMSMGVGAKAYGIINPQIMSQGGLKTQITASADLSLDVGYFNTGTSGTIPQKLRDGTGCQHMFGAFSGRRGFASSAMYLGGAALYKSAWSGSGYVVADAGTLTIPSDYVRHPGARNFGFNAIYVRGRSCNRVLYGMEGPNYTTGGAVQGASSSGALNFTYNPSNPESPKYSVGFARADPTNYAYWESMGDPNDSANGPIGIYSEHLGIYPSKITWYVTNLVYNGSGYNIDGGLFNGNDIKLSPREFIIKGVNVNNTSWKFINFIEKNFNVGNRADFRDVGCNLSKDSPSTGISGIATFGLPTTESNNAPSIKGGNVGGLHANVVSIYNFLPSASWYVSSNPPKIGNNYGDVWSENLLPLRLLGGSGSTILSGNIVFQGNGSVHVGTVGLDLNSSRNGAIVCTMEFIDDTWLSAGGIGCFNPTEMLSQGAEYGDSRFRIGGNTINKKLHQILSLPAGEYVPFFTIKGTVVNACKLQAAAYNPTPYWVSGLPGSVGQTGYYTLTYYMRNDGNNNISIWLDSSMSNIIGMKACLPNIKLIIQRLTHHHHHH 646 T 0.18 NPM1-C unppercent T Viruses T 8a8u 2 G G Bound polypeptide XXXXXXXXXXXXXXXXXXXXXXX 23 F F F 8a8v 2 G G Bound polypeptide XXXXXXXXXXXXXXXXXXXXXXX 23 F F F 8a8w 2 G G Bound polypeptide XXXXXXXXXXXXXXXXXXXXXXXXX 25 F F F 8a8x 2 B,D B,D Q80J95_9CALI MNV1-NS3 C term peptide HDDFGLQ 7 T 4 DUF4175 pdbhh T Viruses T 8a9a 1 A,B B,A Y213_MYCPN UNCHARACTERIZED PROTEIN MG075 HOMOLOG NKTHQVEHESEQSDFQDIRFGLNSVKLPKAQPAAATRITVENGTDKLVNYKSSPQQLFLAKNALKDKLQGEFDKFLSDAKAFPALTADLQEWVDQQLFNPNQSFFDLSAPRSNFTLSSDKKASLDFIFRFTNFTESVQLLKLPEGVSVVVDSKQSFDYYVNASAQKLLVLPLSLPDYTLGLNYMFDHITLNGKVVNKFSFNPFKTNLNLAFSNVYNGVDVFEAQKNLVGKGKYLNTHVKAEDVKKDVNANIKNQFDIAKIIAELMGKALKEFGNQQEGQPLSFLKVMDKVKEDFEKLFNLVRPGLGKFVKDLIQSSSQAENKITVYKLIFDNKKTILNLLKELSIPELNSSLGLVDVLFDGITDSDGLYERLQSFKDLIVPAVKTNEKTAALSPLIEELLTQKDTYVFDLIQKHKGILTNLLKNFLADFQKSTPFMADQVAIFTELFDNEGAFDLFGEADFVDKIAELFLTKRTVKNGEKIETKDSLLVTSLKSLLGEKVAALGDLLDSYIFKNELLNRSVEVAKAEAKDTKGATDYKKEQAKALKKLFKHIGENTLSKTNLDKITLKEVKNTENVELEETETTLKVKKLDVEYKVELGNFEIKNGLIKAMLEFLPDTKDLETTLDKLLFKGESYKAMKDKYIKEGFPGYGWAKGVVPGAFESIENTFKSAIDKTKSIRDLFGDMLFGNDLSSVKETDSFITLGGSFDIKYGGENLNVLPAYYSLINSEIGYQIIGVDTTIDATKVKVELKNKEYKGKSPAINGQVKLSQSFFNVWTNMFDSITKQIFQKKYEFKDNIQVFARNEDNTSRLELDISDPEQRVIPFAFVDGFGIQLKAVDKNITKEAGNTEPKSPVIQLYEALNKEKDQKQQSKQSPKQLDTKTQLGYLLKLGDNWSKDDYKSLIDDTIINNNYLEASFNSKITVDRLGIPIDLWLFKIWPKFNLEIPMQGSLQLYSSSVIFPYGIYDTSVQDAAKIVKRLNFTDMGFKLNDPKPNFWFVGFKHHHHH 1007 T 0.0075 IFN-gamma pdbpercent F Bacteria T 8a9b 1 A B Y213_MYCPN UNCHARACTERIZED PROTEIN MG075 HOMOLOG NKTHQVEHESEQSDFQDIRFGLNSVKLPKAQPAAATRITVENGTDKLVNYKSSPQQLFLAKNALKDKLQGEFDKFLSDAKAFPALTADLQEWVDQQLFNPNQSFFDLSAPRSNFTLSSDKKASLDFIFRFTNFTESVQLLKLPEGVSVVVDSKQSFDYYVNASAQKLLVLPLSLPDYTLGLNYMFDHITLNGKVVNKFSFNPFKTNLNLAFSNVYNGVDVFEAQKNLVGKGKYLNTHVKAEDVKKDVNANIKNQFDIAKIIAELMGKALKEFGNQQEGQPLSFLKVMDKVKEDFEKLFNLVRPGLGKFVKDLIQSSSQAENKITVYKLIFDNKKTILNLLKELSIPELNSSLGLVDVLFDGITDSDGLYERLQSFKDLIVPAVKTNEKTAALSPLIEELLTQKDTYVFDLIQKHKGILTNLLKNFLADFQKSTPFMADQVAIFTELFDNEGAFDLFGEADFVDKIAELFLTKRTVKNGEKIETKDSLLVTSLKSLLGEKVAALGDLLDSYIFKNELLNRSVEVAKAEAKDTKGATDYKKEQAKALKKLFKHIGENTLSKTNLDKITLKEVKNTENVELEETETTLKVKKLDVEYKVELGNFEIKNGLIKAMLEFLPDTKDLETTLDKLLFKGESYKAMKDKYIKEGFPGYGWAKGVVPGAFESIENTFKSAIDKTKSIRDLFGDMLFGNDLSSVKETDSFITLGGSFDIKYGGENLNVLPAYYSLINSEIGYQIIGVDTTIDATKVKVELKNKEYKGKSPAINGQVKLSQSFFNVWTNMFDSITKQIFQKKYEFKDNIQVFARNEDNTSRLELDISDPEQRVIPFAFVDGFGIQLKAVDKNITKEAGNTEPKSPVIQLYEALNKEKDQKQQSKQSPKQLDTKTQLGYLLKLGDNWSKDDYKSLIDDTIINNNYLEASFNSKITVDRLGIPIDLWLFKIWPKFNLEIPMQGSLQLYSSSVIFPYGIYDTSVQDAAKIVKRLNFTDMGFKLNDPKPNFWFVGFKHHHHH 1007 T 0.0075 IFN-gamma pdbpercent F Bacteria T 8a9g 2 C,D C,D GCR_HUMAN GR,NUCLEAR RECEPTOR SUBFAMILY 3 GROUP C MEMBER 1 KTIVPATLPQLTP 13 T 5.7 DUF2064 pdbhh F Eukaryota T 8a9l 2 B B Unknown fragment XXXXXXXXX 9 F F F 8a9l 3 C C Unknown fragment XXXXVXX 7 T 3900 zf-CCHC_2 pdbhh F F 8aaa 2 B B Stapled peptide ACMFVPCAVRHALGLCAX 18 T 3.4 DUF22 pdbhh F T 8aac 1 A,AA,AB,AC,AD,AE,AF,B,BA,BB,BC,BD,BE,BF,C,CA,CB,CC,CD,CE,CF,D,DA,DB,DC,DD,DE,DF,E,EA,EB,EC,ED,EE,EF,F,FA,FB,FC,FD,FE,FF,G,GA,GB,GC,GD,GE,GF,H,HA,HB,HC,HD,HE,HF,I,IA,IB,IC,ID,IE,IF,J,JA,JB,JC,JD,JE,JF,K,KA,KB,KC,KD,KE,KF,L,LA,LB,LC,LD,LE,LF,M,MA,MB,MC,MD,ME,MF,N,NA,NB,NC,ND,NE,NF,O,OA,OB,OC,OD,OE,OF,P,PA,PB,PC,PD,PE,PF,Q,QA,QB,QC,QD,QE,QF,R,RA,RB,RC,RD,RE,RF,S,SA,SB,SC,SD,SE,SF,T,TA,TB,TC,TD,TE,TF,U,UA,UB,UC,UD,UE,UF,V,VA,VB,VC,VD,VE,VF,W,WA,WB,WC,WD,WE,WF,X,XA,XB,XC,XD,XE,XF,Y,YA,YB,YC,YD,YE,Z,ZA,ZB,ZC,ZD,ZE 1A,JA,RB,ZC,iA,qB,yC,AC,1B,RC,aA,iB,qC,zA,BA,JB,1C,aB,iC,rA,zB,BB,JC,SA,2A,jA,rB,zC,BC,KA,SB,aC,2B,rC,3A,CA,KB,SC,bA,jB,2C,3B,CB,KC,TA,bB,jC,sA,3C,CC,LA,TB,bC,kA,sB,4A,DA,LB,TC,cA,kB,sC,4B,DB,LC,UA,cB,kC,tA,4C,DC,MA,UB,cC,lA,tB,5A,EA,MB,UC,dA,lB,tC,5B,EB,MC,VA,dB,lC,uA,5C,EC,NA,VB,dC,mA,uB,6A,FA,NB,VC,eA,mB,uC,6B,FB,NC,WA,eB,mC,vA,6C,FC,OA,WB,eC,nA,vB,7A,GA,OB,WC,fA,nB,vC,7B,GB,OC,XA,fB,nC,wA,7C,GC,PA,XB,fC,oA,wB,8A,HA,PB,XC,gA,oB,wC,8B,HB,PC,YA,gB,oC,xA,8C,HC,QA,YB,gC,pA,xB,AA,IA,QB,YC,hA,pB,xC,AB,IB,QC,ZA,hB,pC,yA,IC,RA,ZB,hC,qA,yB A0A3S9H6T3_9VIRU C protein MGTFIELVKNMKGYKELLLPMEMVPLPAVVLKHVKLILTSQKEHQPWMTEMALKADQCLIHKATLDLAGKATSNEAKPLIEAMQQIILAMTRELWGQIQRHHYGIVQVEHYVKQITLWQDTPQAFRGDQPKPPSFRSDGPTRGQGSFRPFFRGRGRGRGRGRGSQSPARKGPLPK 175 T 0.0012 API5 unphh T Viruses T 8aaf 53 BB 1 CAT-tailed nascent peptide XXXXXXXXXXXXXXXXXX 18 F F F 8aca 1 A,B,C A,B,C Q9RWM2_DEIRA DR_0644, only-Cu Superoxide Dismutase MKKLALIALPLVLASCTMAGPTEGTYTLAPQAVVKPAGPVYAPAGTAKISETLGVTRTTITLTGMAPYAIYVAHYHKMGTAAPMGSAPATNTNMAMSSTDATATTTASTSTTSTDTTVAASTDMTTTVTMAPVTAAPNPCNSDGPAIMESRMIAQASADGKVTLTGIVPTALIRDAAYINVHHGRDFSGALADSGVICTPITMTMR 206 T 0.028 LPAM_1 pdbhh F Bacteria T 8ack 2 C,D P,C PCP ERWGHDFIK 9 T 0.19 RPN1_RPN2_N pdbhh F T 8acq 2 D,E,F H,I,L Q9RWM2_DEIRA DR_0644, only-Cu Superoxide Dismutase MKKLALIALPLVLASCTMAGPTEGTYTLAPQAVVKPAGPVYAPAGTAKISETLGVTRTTITLTGMAPYAIYVAHYHKMGTAAPMGSAPATNTNMAMSSTDATATTTASTSTTSTDTTVAASTDMTTTVTMAPVTAAPNPCNSDGPAIMESRMIAQASADGKVTLTGIVPTALIRDAAYINVHHGRDFSGALADSGVICTPITMTMR 206 T 0.028 LPAM_1 pdbhh F Bacteria T 8ad9 2 C,D C,D Cyclomarin A XXAXVXX 7 T 23 CedA pdbhh F F 8ada 1 A,B A,B Y2667_MYCTU Uncharacterized protein Rv2667 MPEPTPTAYPVRLDELINAIKRVHSDVLDQLSDAVLAAEHLGEIADHLIGHFVDQARRSGASWSDIGKSMGV 72 T 0.047 HTH_AsnC-type unppercent F Bacteria T 8adb 1 A A D6YWY5_WADCW Wc-VDT1 KMPEEEQDSLAAFSRIEANITQYDPLLDNAGKSACTCICLKAAEMLLEASPDQVNAGLIDDILVEGVADYNRFKVGGVVEHTSVENYELNTFELKRLEFRDVDNPFSAEGNPYAGTLDSFAKMMEKASDSKDLPKPVALVMTKSNMTITIVIRPDGKYWLFDPHGTNGKGAYIESCNTDELIKKIKEIFPKTSYPGMTEDENLGFNSFEAYAVRR 215 T 0.00012 Herpes_teg_N pdb F Bacteria T 8adc 1 A,B B,A D6YWY5_WADCW Viral deubiquitinating enzyme KMPEEEQDSLAAFSRIEANITQYDPLLDNAGKSACTCICLKAAEMLLEASPDQVNAGLIDDILVEGVADYNRFKVGGVVEHTSVENYELNTFELKRLEFRDVDNPFSAEGNPYAGTLDSFAKMMEKASDSKDLPKPVALVMTKSNMTITIVIRPDGKYWLFDPHGTNGKGAYIESCNTDELIKKIKEIFPKTSYPGMTEDENLGFNSFEAYAVRR 215 T 0.00012 Herpes_teg_N pdb F Bacteria T 8adg 6 F F Darobactin 22 WNXTKRW 7 T 33 RNA_capsid pdbhh F T 8adi 6 F F Darobactin 9 XNWSKSW 7 T 0.76 DUF3309 pdbhh F F 8adm 2 B P UBP8_HUMAN DEUBIQUITINATING ENZYME 8,UBIQUITIN ISOPEPTIDASE Y,HUBPY,UBIQUITIN THIOESTERASE 8,UBIQUITIN-SPECIFIC-PROCESSING PROTEASE 8 RSYSSPDI 8 T 5.8 DUF3912 pdbhh F Eukaryota T 8adn 1 A,B 3,4 Proteasome Inhibitor 31-Like MDFQDYIQSLKKDFKLVKINDHTYILHKNKKTLELTPDKMYNIQEINDILNVSYPDISYDRNLEDLGSKNKGILNGFGNVGEDDLHPQIGRRKGKKKGAIFSPEEFKEEEDSDGIDKTDIFPLKKKRDPDSDHFKKTGGDDDNPFLY 147 T 0.047 Cap4_SAVED pdbpssm F T 8ae5 2 B,D C,D MCP1A_MACPC Macrocypin-1a MGFEDGFYTILHLAEGQHPNSKIPGGMYASSKDGKDVPVTAEPLGPQSKIRWWIARDPQAGDDMYTITEFRIDNSIPGQWSRSPVETEVPVYLYDRIKAEETGYTCAWRIQPADHGADGVYHIVGNVRIGSTDWADLREEYGEPQVYMKPVPVIPNVYIPRWFILGYEELEHHHHHH 177 T 0.04 Inhibitor_I48 unppercent F Eukaryota T 8af9 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L Q2YPZ9_BRUA2 NyxB, T4SS effector protein from Brucella MNTQATIDTAAVAPLNFDPNAWHHSQMTTLEAIELSRSGGHPYSSPNVPKGFNTVVGFFFDTYDWYPAAYDDEEGNAMKDRELIQYEDWCAKYARTLGLEVKEVEAPAALKVHGIMALKAYPEALLEIRLIEMP 134 T 0.0056 Glyco_hydro_114 pdb F Bacteria T 8afi 2 C,D,E,F,G,H,I,J F,H,B,J,N,P,D,L C9JNW8_HUMAN Ubiquitin-like-conjugating enzyme ATG3 YSDELEAIIEEDDGDGGWVDTYHG 24 T 0.24 SDA1 unppercent F Eukaryota T 8afr 2 B C Pimtide ARKRRRHPSGPPTA 14 T 1.5 DUF3019 pdbhh F T 8afz 3 C C MPRI_HUMAN CI MAN-6-P RECEPTOR,CI-MPR,M6PR,300 KDA MANNOSE 6-PHOSPHATE RECEPTOR,MPR 300,INSULIN-LIKE GROWTH FACTOR 2 RECEPTOR,INSULIN-LIKE GROWTH FACTOR II RECEPTOR,IGF-II RECEPTOR,M6P/IGF2 RECEPTOR,M6P/IGF2R SNVSYKYSKVNKEEETDENETEWLMEEIQLP 31 T 12 TMEM154 unphh F Eukaryota T 8agc 9 I P PEPTIDE YAXATSA 7 T 410 AAA_12 pdbhh F F 8agd 2 D,E,F H,I,L Q9RWM2_DEIRA SOD,DR_2577 MKKLALIALPLVLASCTMAGPTEGTYTLAPQAVVKPAGPVYAPAGTAKISETLGVTRTTITLTGMAPYAIYVAHYHKMGTAAPMGSAPATNTNMAMSSTDATATTTASTSTTSTDTTVAASTDMTTTVTMAPVTAAPNPCNSDGPAIMESRMIAQASADGKVTLTGIVPTALIRDAAYINVHHGRDFSGALADSGVICTPITMTMR 206 T 0.028 LPAM_1 pdbhh F Bacteria T 8age 9 I P PEPTIDE YANATSA 7 T 31 DUF1830 pdbhh F F 8agt 53 AB 1 CAT tailed nascent peptide XXXXXXXXXXXXXXXXXX 18 F F F 8agu 53 AB 1 CAT-tailed nascent peptide XXXXXXXXXXXXXXXXXX 18 F F F 8agv 52 AB 1 CAT-tailed nascent peptide XXXXXXXXXXXXXXXXXX 18 F F F 8agw 51 ZA 1 CAT-tailed nascent peptide XXXXXXXXXXXXXXXXX 17 F F F 8agx 52 AB 1 CAT-tailed nascent peptide XXXXXXXXXXXXXXXXXX 18 F F F 8agz 53 BB 1 CAT-tailed nascent chain XXXXXXXXXXXXXXXXXX 18 F F F 8aif 1 A,B,C A,B,C A0A164X7F2_BACIU YqxM protein required for localization of TasA to extracellular matrix DKRWDQSDLHISDQTDTKGTVCSPFALFAVLENTGEKLKKSKWKWELHKLENARKPLKDGNVIEKGFVSNQIGDSLYKIETKKKMKPGIYAFKVYKPAGYPANGSTFEWSEPMRLAKCDE 120 T 0.002 Herpes_PAP unp F Bacteria T 8ail 2 E,F,G,H,I,J,K,L O,E,F,J,N,C,K,D A0A0N9SK00_9CAUD Bacillus phage VMY22 p56 MEGFKDSYTLIYVTRDEEGKMFDIKLENQTKEECEIIYGMITDEILIWNMILEGMF 56 T 0.011 DUF5406 unppercent T Viruses T 8aiw 2 B B O25273_HELPY Cag pathogenicity island protein (Cag19) MKCFLSIFSFLTFCGLSLNGTEVVITLEPALKAIQADAQAKQKTAQAELKAIEAQSSAKEKAIQAQIEGELRTQLATMSAMLKGANGVINGVNGMTGGFFAGSDILLGVMEGYSSALSALGGNVKMIVEKQKINTQTEIQNMQIALQKNNEIIKLKMNQQNALLEALKNSFEPSVTLKTQMEMLSQALGSSSDNAQYIAYNTIGIKAFEETLKGFETWLKVAMQKATLIDYNSLTGQALFQSAIYAPALSFFSSMGAPFGIIETFTLAPTKCPYLDGLKISACLMEQVIQNYRMIVALIQNKLSDADFQNIAYLNGINGEIKTLKGSVDLNALIEVAILNAENHLNYIENLEKKADLWEEQLKLERETTARNIASSKVIVK 381 T 0.15 Bin3 pdbpssm F Bacteria T 8aj8 2 B,D,F,H B,D,F,H PI3R6_MOUSE PHOSPHOINOSITIDE 3-KINASE GAMMA ADAPTER PROTEIN OF 87 KDA,P84 PI3K ADAPTER PROTEIN,P84 PIKAP,P87 PI3K ADAPTER PROTEIN,P87PIKAP MESSDVELDFQRSVQAVLRELNTPNPALQSNQGMWRWSLHKKVERNPGKSSILVRILLRELEKAESEDGRRVIIPLLLTLMSVLTKATGIPEDLYHRAYTFCTRLLTLPAPYSTVALDCAIRLKTETAVPGTLYQRTVIAEQNLISELYPYQERVFLFVDPELVSASVCSALLLEIQAAQEQQTPEACMRHVVSHALQAALGEACHTGALNRKLQASSRRVLEYYFHAVVAAIEQVASEDSPSRLGHLEKMEEIYCSLLGPATTRRHCVGDLLQDRLPSIPLPSPYITFHLWTDQEQLWKELVLFLRPRSQLRLSADLDALDLQGFRLDRDLARVSTDSGIERDLPLGSDELPDPSSSEMERAALQRKGGIKKRVWPPDFFMPGSWDGPPGLHRRTGRPSGDGELLPGVSRVHTARVLVLGDDRMLGRLAQAYYRLRKRETKKFCLTPRLSLQLYYIPVLAPQVTGQDPEASRKPELGELASFLGRVDPWYESTVNTLCPAILKLAEMPPYLDTSRTVDPFILDVITYYVRMGTQPIYFQLYKVKIFTSLSHDPTEDIFLTELKVKIQDSKSPKEGSSPRRRGAAEGTGAELSMCYQKALLSHRPREVTVSLRATGLVLKAIPAGDTEVSGFFHCTSPNAASATDCSCLHVSVTEVVKSSNLAGRSFTTSTNTFRTSSIQVQSQDQRLLTLWLDKDGRRTFRDVVRFEVSPCPEPCSRTQKSKTSALNSHGQETEKNMAKPNSLLMPINTFSGIIQ 756 T 1.3999999999999997E-73 PI3K_1B_p101 pdbpssm F Eukaryota T 8ajm 2 B B DCA12_HUMAN CENTROSOME-RELATED PROTEIN TCC52,TESTIS CANCER CENTROSOME-RELATED PROTEIN,WD REPEAT-CONTAINING PROTEIN 40A MDWSHPQFEKSAVDENLYFQGGGRMARKVVSRKRKAPASPGAGSDAQGPQFGWDHSLHKRKRLPPVKRSLVYYLKNREVRLQNETSYSRVLHGYAAQQLPSLLKEREFHLGTLNKVFASQWLNHRQVVCGTKCNTLFVVDVQTSQITKIPILKDREPGGVTQQGCGIHAIELNPSRTLLATGGDNPNSLAIYRLPTLDPVCVGDDGHKDWIFSIAWISDTMAVSGSRDGSMGLWEVTDDVLTKSDARHNVSRVPVYAHITHKALKDIPKEDTNPDNCKVRALAFNNKNKELGAVSLDGYFHLWKAENTLSKLLSTKLPYCRENVCLAYGSEWSVYAVGSQAHVSFLDPRQPSYNVKSVCSRERGSGIRSVSFYEHIITVGTGQGSLLFYDIRAQRFLEERLSACYGSKPRLAGENLKLTTGKGWLNHDETWRNYFSDIDFFPNAVYTHCYDSSGTKLFVAGGPLPSGLHGNYAGLWS 477 T 0.0099 WD40_like unppssm F Eukaryota T 8ajn 2 B B DCA12_HUMAN CENTROSOME-RELATED PROTEIN TCC52,TESTIS CANCER CENTROSOME-RELATED PROTEIN,WD REPEAT-CONTAINING PROTEIN 40A MDWSHPQFEKSAVDENLYFQGGGRMARKVVSRKRKAPASPGAGSDAQGPQFGWDHSLHKRKRLPPVKRSLVYYLKNREVRLQNETSYSRVLHGYAAQQLPSLLKEREFHLGTLNKVFASQWLNHRQVVCGTKCNTLFVVDVQTSQITKIPILKDREPGGVTQQGCGIHAIELNPSRTLLATGGDNPNSLAIYRLPTLDPVCVGDDGHKDWIFSIAWISDTMAVSGSRDGSMGLWEVTDDVLTKSDARHNVSRVPVYAHITHKALKDIPKEDTNPDNCKVRALAFNNKNKELGAVSLDGYFHLWKAENTLSKLLSTKLPYCRENVCLAYGSEWSVYAVGSQAHVSFLDPRQPSYNVKSVCSRERGSGIRSVSFYEHIITVGTGQGSLLFYDIRAQRFLEERLSACYGSKPRLAGENLKLTTGKGWLNHDETWRNYFSDIDFFPNAVYTHCYDSSGTKLFVAGGPLPSGLHGNYAGLWS 477 T 0.0099 WD40_like unppssm F Eukaryota T 8ajo 2 B B DCA12_HUMAN CENTROSOME-RELATED PROTEIN TCC52,TESTIS CANCER CENTROSOME-RELATED PROTEIN,WD REPEAT-CONTAINING PROTEIN 40A MDWSHPQFEKSAVDENLYFQGGGRMARKVVSRKRKAPASPGAGSDAQGPQFGWDHSLHKRKRLPPVKRSLVYYLKNREVRLQNETSYSRVLHGYAAQQLPSLLKEREFHLGTLNKVFASQWLNHRQVVCGTKCNTLFVVDVQTSQITKIPILKDREPGGVTQQGCGIHAIELNPSRTLLATGGDNPNSLAIYRLPTLDPVCVGDDGHKDWIFSIAWISDTMAVSGSRDGSMGLWEVTDDVLTKSDARHNVSRVPVYAHITHKALKDIPKEDTNPDNCKVRALAFNNKNKELGAVSLDGYFHLWKAENTLSKLLSTKLPYCRENVCLAYGSEWSVYAVGSQAHVSFLDPRQPSYNVKSVCSRERGSGIRSVSFYEHIITVGTGQGSLLFYDIRAQRFLEERLSACYGSKPRLAGENLKLTTGKGWLNHDETWRNYFSDIDFFPNAVYTHCYDSSGTKLFVAGGPLPSGLHGNYAGLWS 477 T 0.0099 WD40_like unppssm F Eukaryota T 8ajy 1 A,C A,C A0AEF6_RUMFL Cell-wall anchoring protein MLTDRGMTYDLDPKDGSSAATKPVLEVTKKVFDTAADAAGQTVTVEFKVSGAEGKYATTGYHIYWDERLEVVATKTGAYAKKGAALEDSSLAKAENNGNGVFVASGADDDFGADGVMWTVELKVPADAKAGDVYPIDVAYQWDPSKGDLFTDNKDSAQGKLMQAYFFTQGIKSSSNPSTDEYLVKANATYADGYIAIKAGEPE 203 T 0.00027 Cohesin pdb F Bacteria T 8ak1 2 B B O25273_HELPY Cag pathogenicity island protein (Cag19) MKCFLSIFSFLTFCGLSLNGTEVVITLEPALKAIQADAQAKQKTAQAELKAIEAQSSAKEKAIQAQIEGELRTQLATMSAMLKGANGVINGVNGMTGGFFAGSDILLGVMEGYSSALSALGGNVKMIVEKQKINTQTEIQNMQIALQKNNEIIKLKMNQQNALLEALKNSFEPSVTLKTQMEMLSQALGSSSDNAQYIAYNTIGIKAFEETLKGFETWLKVAMQKATLIDYNSLTGQALFQSAIYAPALSFFSSMGAPFGIIETFTLAPTKCPYLDGLKISACLMEQVIQNYRMIVALIQNKLSDADFQNIAYLNGINGEIKTLKGSVDLNALIEVAILNAENHLNYIENLEKKADLWEEQLKLERETTARNIASSKVIVK 381 T 0.15 Bin3 pdbpssm F Bacteria T 8akn 6 F A DROS_DROME Drosocin1 GKPRPYSPRPTSHPRPIRV 19 T 0.0059 DIM unppercent F Eukaryota T 8ako 2 B B ESPK_MYCTU ESX-1 secretion-associated protein EspK GDALRLARRIAAALNASDNNAGDYGFFWITAVTTDGSIVVANSYGLAYIPDGMELPNKVYLASADHAIPVDEIARCATYPVLAVQAWAAFHDMTLRAVIGTAEQLASSDPGVAKIVLEPDDIPESGKMTGRSRLEVVDPSAAAQLADTTDQRLLDLLPPAPVDVNPPGDERHMLWFELMKPMTSTATGREAAHLRAFRAYAAHSQEIALHQAHTATDAAVQRVAVADWLYWQYVTGLLDRALAAAC 246 T 0.0019 DUF5632 pdbpssm F Bacteria T 8akp 1 A A catalytic domain of G7048 AVSKGFNYGATKADGSSKYQADFKKDFAAAKALVEGGSGFTSARLYTMIQGGTTNTPIEAIPAAIEEKTELLLGLWASGGNMDNEIAALKSAISQYGDDFANLVVGISVGSEDMYRNSVTGSKSNAGPGVEPEELVSYIQQVRSTIAGTGLSDASIGHVDTWDSWTNSSNSDVVNHLDWLGFDGYPYYQLTMENGIENAKKLFDESVEKTKSVANGKEVWITETGWPVTGPQEGDATASPANAKTYWDEVGCPLFGNTNTWWYMLEDEGASPSFGVVKSDLKTPQFDLSC 290 T 0.0016 Glyco_hydro_17 pdb F T 8am9 56 DB A DROS_DROME Drosocin1 GKPRPYSPRPTSHPRPIRV 19 T 0.0059 DIM unppercent F Eukaryota T 8amo 1 A A CP143_MYCTU Putative cytochrome P450 143 MHHHHHHTTPGEDHAGSFYLPRLEYSTLPMAVDRGVGWKTLRDAGPVVFMNGWYYLTRREDVLAALRNPKVFSSRKALQPPGNPLPVVPLAFDPPEHTRYRRILQPYFSPAALSKALPSLRRHTVAMIDAIAGRGECEAMADLANLFPFQLFLVLYGLPLEDRDRLIGWKDAVIAMSDRPHPTEADVAAARELLEYLTAMVAERRRNPGPDVLSQVQIGEDPLSEIEVLGLSHLLILAGLDTVTAAVGFSLLELARRPQLRAMLRDNPKQIRVFIEEIVRLEPSAPVAPRVTTEPVTVGGMTLPAGSPVRLCMAAVNRDGSDAMSTDELVMDGKVHRHWGFGGGPHRCLGSHLARLELTLLVGEWLNQIPDFELAPDYAPEIRFPSKSFALKNLPLRWS 399 T 5.8E-32 p450 unppercent F Bacteria T 8an9 2 E,F,G E,F,G Mixed-chirality peptide FHP5 XKLLKLLKLLXX 12 T 16 Sec16 pdbhh F F 8ana 6 F A DROS_DROME Drosocin1 GKPRPYSPRPTSHPRPIRV 19 T 0.0059 DIM unppercent F Eukaryota T 8ano 2 E,F,G E,H,I Mixed-chirality fucosylated peptide FHP8 KXXXKLLKLLLX 12 T 16 Sec16 pdbhh F F 8anr 2 C,D C,D Fucosylated mixed-chirality linear peptide FHP30 KXLXKXLXLXLX 12 T 16 Sec16 pdbhh F F 8aoo 2 E,F,G,H E,F,H,I Fucosylated mixed-chirality peptide FHP31 XKXLXLXKXLXX 12 F F F 8aop 1 A,B,C A,B,C A4TVL0_9PROT CULT DOMAIN-CONTAINING PROTEIN MPLDAGGQNSTQMVLAPGASIFRCRQCGQTISRRDWLLPMGGDHEHVVFNPAGMIFRVWCFSLAQGLRLIGAPSGEFSWFKGYDWTIALCGQCGSHLGWHYEGGSQPQTFFGLIKDRLAEGPAD 124 F F Bacteria T 8aoq 1 A,B,C A,B,C A4TVL0_9PROT Cereblon isoform 4 MPLDAGGQNSTQMVLAPGASIFRCRQCGQTISRRDWLLPMGGDHEHVVFNPAGMIFRVWCFSLAQGLRLIGAPSGEFSWFKGYDWTIALCGQCGSHLGWHYEGGSQPQTFFGLIKDRLAEGPAD 124 F F Bacteria T 8ap6 1 A,LB A,a ATP6_TRYBB F-ATPASE PROTEIN 6 MFLFFFCDLFWLRLLLCMYYCVWSRLCFIVYFNCLMLIFDFLLFCLFDLYLFVGLCLFLLLWFMLFNLYSLILYYCITYLNLYLLFCIVFLLYIAFLFLFCFLCDFFLFNNLLVGDSFMDVFFIRFLLCFLECFSLLCRCLSTFLRLFCNLLSSHFLLLMFFDFFYFIFVFFFYGVFCYWFILFIFVFCFCLLFYVFLYLLDLFAAILQLFIFCNMILQLIMDFLLFLLFV 231 T 0.43 ATP-synt_A unppssm F Eukaryota T 8ap6 3 F,MB C,c Q585K5_TRYB2 subunit-8 MLRRLGANVSNMARPMNKYAVTVSPRRHLEPMSTWYLASWAMVWYYAFFFWMPMVWTDIMVPSFVYNKLPVIHFLQEKRAEQKLRRVLDETYTEWTEELDQAHVTDAITRSLNI 114 T 10 bVLRF1 pdbhh F Eukaryota T 8ap6 4 I,NB D,d Q57ZW9_TRYB2 subunit-d MRRVSSPNITIQSVRWISGVSPLLYFPPTTTSTTNREDQINKNTNIAIQMIKRYKGEVPPHYTRKSSATIEQVEKEIDALLGGAEKLRKTSTDDQPMDKLTLMERCLRHALWSYHKEEGRYDFDQIGRWVVYTPEDEVKLAQLKREVEAKEKLAALRKRREEEGLPGGPVPRINWPQEYSSFIDREPVVAKRIRYDTLASTTLERDEKQIESTLQQYRRASQDKRLDDLVDLLERFKPVLAREAIMQRLTIKHLEGQLGVWRYMDWCPEVRDRAELEVDITGWQWWSPLEERRLLPVRLRSVNEVREIMSKTQAKKSAEAAERNPIVTQTSTGDNARDRLLKEVLALQARINQRDEVEPSQTEQKKKAHH 370 T 0.0093 SAM_Exu pdbpercent F Eukaryota T 8ap6 6 L,OB E,e Q38CI8_TRYB2 ATPTB1 XQGSWSVLKKNCSNFFPGLLAFAQQTQEAYGIWLRIYNRQQKYGPTDFVEQSETFSPDYHKRFHSQDKNMWVDKELCTEVSQKEVARLMTYKLDMWRMAHCAGALLATGGYAIPFGLFWLANDTWVPSSFNLTGEELRAWREAQDLYRYRSAPSYLTDTKWHFDFHAYPWNETQERAWDDLFEKNDVRRDPKVVRPAAEMYDGFIKFELIRRKSLRHLCRSMNIPTFPMLARLCNGTRVRDYWNLAWCEDYMVITQRLHESMTDEELYDYAWRRYLAPYDKNLNREQLMERVEDYFEFLGPDFVAHGKAPNLVILTNYVLGYYNDPAYLEGDISELDKNDYDHLASWGKDAFLRRLEFENGPLRDQVEAHTQRLLAERAAIAKGDNAAAVEGRHTA 396 F F Eukaryota T 8ap6 7 O,PB F,f Q57ZE2_TRYB2 subunit-f MVLFSTYRSSRLVSKEFLHGPVMRFRALGEYYFQRAWNGTLNWALPGEYRLYAVMIPFIYFYHRWHNDHTLDRDHVEKAMIMRWGGTLEDVRKLSAKDQLRVRCFTDIEKLYSAYGPKDTYLQPPGDTLPGKDFYRKAGGAQAHH 145 T 0.39 DUF3094 pdbhh F Eukaryota T 8ap6 8 QB,R g,G A0A3L6KRX7_9TRYP ATPTB3 MSKQLTFISAGATAAVLQSASAIVSKVAGGRVQTKTAKEAGRHAVVVGPETPIGVHTAVTEVPKSAQDPLFSGVSTVVVRAVLPRAAPDSVQLRDALDVYASAGIDTKEEVRSATEAFKKSAEVAVGKAKAKGVKRIVLVVKQASKHNCINELFKKISTETIESAGLTTEVVGTAAVANQLIVNPESLGVVLLNDVAATEQIELAFAGVVGGVSRVYHTVEGGKISAGHSFKSVALAVAQELRELGLSSEADKVEAAASKNPRAVVSAL 269 F F Eukaryota T 8ap6 10 RB,U h,H Q389Z3_TRYB2 ATPTB4 MRRTFISFSAASAAAAAPVTSTKMQTLHKLLTGEVSFKNKAPVKDCNIVHQFGENWATELSAYAKTLPAEQQKIIVRQIARVKLTRYTVAELAAYCGDGPALLDETARAANIEQGVAFVKAKGVEAFEKYVAEESTNANWKPEEAKKFIEDVKAKAK 157 T 0.042 TAT_signal pdb F Eukaryota T 8ap6 12 SB,X i,I Q57ZM4_TRYB2 subunit-i/j MVYTRWKCDRLPVFQLKLFTQEYPMHAAVGIFTIIFLWKHMSHCSEETERKYGWWAGYPYWRDPIARRNETKYKQMIINNDVDITHPKWTGCSVEQLEELSRVV 104 T 0.18 FPN1 pdb F Eukaryota T 8ap6 14 AA,TB J,j D0A5R7_TRYB9 ATPTB6 MTKYELKMQYFDEWMIRWRKFQTESDWEIEKGRQWWRRFNMAVSGALFCGLVLYTSGTATLKRQYGLPHFFDIGVDGQAKETMLKTLTSRWRYTPQGYGRVLITGVPTYILFVTLEHYRERRRMQQYLQQNTVFGEQMRRLLSTGKIEEYLPVNIKATLPASQQAIYNY 169 T 0.14 GT87 pdbpssm F Eukaryota T 8ap6 15 BA,CA,EA,FA,HA,IA J1,J2,K1,K2,L1,L2 ATP18_TRYBB ATP SYNTHASE F1 SUBUNIT P18 MMRRVYSPVFCSVAAARFAATSAAKKYDLFGYEVDTNTAPWIEKIKKCKYYDEAGEVLVNMNVSNCPPDIATYNATLQCIYQSPSKQSTPVDNESKFCAMMDLLEEMQHRNRLKPNEESWTWVMKECVKSGQFRLGYCIQQVMETECKGCPADLVKANEANAQKAKTEGKEHPGHLSQQAGLFDVKVE 188 T 1.1E-05 RPM2 pdbhh F Eukaryota T 8ap6 16 DA,UB K,k Q57VT0_TRYB2 subunit-k MLRRSSAALIRRTPVRHSGGELFVRPKLEEIPPADQCRGFFGPLNDSLKFLRLLDIKWMMNRAVAMRREYLIATPTLFTFIWMFTWKGAVIYFWGDRAPPRRMDWNTEETGRLPLGFKPTPAPL 124 T 0.28 DUF1048 pdbpercent F Eukaryota T 8ap6 17 GA,VB L,l Q387J1_TRYB2 subunit-e MSAKAAPKTLHQVRNVAYFFAAWLGVQKGYIEKSANDRLWVEHQRKVRQQNVERQQALDSIKLMQQGVRATTPGQLEGVPAELQQLAEAFTK 92 T 0.051 DUF3708 pdb F Eukaryota T 8ap6 18 JA,WB M,m C9ZJA0_TRYB9 subunit-g MSSTKCAVACKIMTPLCNAASKVQARSAKKLAALTDAGIQKTISEHNANGTDAAVSSTKRYLAEQRQLFHYRVVRFFDECHYIISGEYFAQYTKVNLIWDLRFLTKLVVLFLIGTVLGRQSIFPPIDPDSPLVEALVTKVNPNY 144 T 0.076 CbtA pdbpercent F Eukaryota T 8ap6 20 MA,XB N,n Q582T1_TRYB2 ATPTB11 MLRKTPLFAMATTRKALVGNGPTFSTGGECMNTCDIQNAFPMNDRGVRSSSPFQEPNTAIYDSYLAWTYFQPMDVHIEKLPAPEAKYYQRHTKKPWDVSSTELTEIQSRKKYFQTLGYLVAFIYLYFLMPKEKSFSGLSGPDGHWIMLPKGRPELF 156 T 0.091 FCP1_C pdb F Eukaryota T 8ap6 21 NA,YB O,o Q57Z84_TRYB2 ATPTB12 MSSGFHFHDVSNDAIKGMPPSEALHKHLENAQLAHRICLAKALKAGEPPVEKCALTWGEVLIRYQAWSEYRPPFQDSVAQAKYKKYWSKKRQEEDDKNPFK 101 T 6.5 DUF1539 pdbhh F Eukaryota T 8ap6 23 QA,ZB P,p C9ZLR9_TRYB9 subunit-b MLRRLVPRVMMAPMGGATALCTSRGYNMLVFRDPKRRPQLSEEERAKVVVNQAEWPEEFKDFDPDDPYKNSPEIIKGMSSWNLFLWGVECAFIYQFYELVFPKSI 105 T 0.25 Mus7 unp F Eukaryota T 8ap6 24 AC,TA q,Q Q583U4_TRYB2 ATPEG3 MTENIEAVMSDFWSNPADHFRPNLKALTLYAERQHYVDRWLHVKERWLAPWYLPWWSPLFQLGTWYSQRSRNLFLVENHLSYRPYKFRRNDEDRNNPY 98 T 32 CbiN pdbhh F Eukaryota T 8ap6 25 BC,WA r,R ATPEG4 MLLGGFVPRRFSQFNRDPCWMFFIFSVGFWLGEYPAMMIKYNARDLVYDPHRYVWSHHDDHH 62 T 0.029 DUF5357 pdb F T 8ap7 1 A,P A,a ATP6_TRYBB F-ATPASE PROTEIN 6 MFLFFFCDLFWLRLLLCMYYCVWSRLCFIVYFNCLMLIFDFLLFCLFDLYLFVGLCLFLLLWFMLFNLYSLILYYCITYLNLYLLFCIVFLLYIAFLFLFCFLCDFFLFNNLLVGDSFMDVFFIRFLLCFLECFSLLCRCLSTFLRLFCNLLSSHFLLLMFFDFFYFIFVFFFYGVFCYWFILFIFVFCFCLLFYVFLYLLDLFAAILQLFIFCNMILQLIMDFLLFLLFV 231 T 0.43 ATP-synt_A unppssm F Eukaryota T 8ap7 2 B,Q C,c Q585K5_TRYB2 subunit-8 MLRRLGANVSNMARPMNKYAVTVSPRRHLEPMSTWYLASWAMVWYYAFFFWMPMVWTDIMVPSFVYNKLPVIHFLQEKRAEQKLRRVLDETYTEWTEELDQAHVTDAITRSLNI 114 T 10 bVLRF1 pdbhh F Eukaryota T 8ap7 3 C,R D,d Q57ZW9_TRYB2 subunit-d MRRVSSPNITIQSVRWISGVSPLLYFPPTTTSTTNREDQINKNTNIAIQMIKRYKGEVPPHYTRKSSATIEQVEKEIDALLGGAEKLRKTSTDDQPMDKLTLMERCLRHALWSYHKEEGRYDFDQIGRWVVYTPEDEVKLAQLKREVEAKEKLAALRKRREEEGLPGGPVPRINWPQEYSSFIDREPVVAKRIRYDTLASTTLERDEKQIESTLQQYRRASQDKRLDDLVDLLERFKPVLAREAIMQRLTIKHLEGQLGVWRYMDWCPEVRDRAELEVDITGWQWWSPLEERRLLPVRLRSVNEVREIMSKTQAKKSAEAAERNPIVTQTSTGDNARDRLLKEVLALQARINQRDEVEPSQTEQKKKAHH 370 T 0.0093 SAM_Exu pdbpercent F Eukaryota T 8ap7 5 E,T F,f Q57ZE2_TRYB2 subunit-f MVLFSTYRSSRLVSKEFLHGPVMRFRALGEYYFQRAWNGTLNWALPGEYRLYAVMIPFIYFYHRWHNDHTLDRDHVEKAMIMRWGGTLEDVRKLSAKDQLRVRCFTDIEKLYSAYGPKDTYLQPPGDTLPGKDFYRKAGGAQAHH 145 T 0.39 DUF3094 pdbhh F Eukaryota T 8ap7 6 F,U I,i Q57ZM4_TRYB2 subunit-i/j MVYTRWKCDRLPVFQLKLFTQEYPMHAAVGIFTIIFLWKHMSHCSEETERKYGWWAGYPYWRDPIARRNETKYKQMIINNDVDITHPKWTGCSVEQLEELSRVV 104 T 0.18 FPN1 pdb F Eukaryota T 8ap7 7 G,V J,j D0A5R7_TRYB9 ATPTB6 MTKYELKMQYFDEWMIRWRKFQTESDWEIEKGRQWWRRFNMAVSGALFCGLVLYTSGTATLKRQYGLPHFFDIGVDGQAKETMLKTLTSRWRYTPQGYGRVLITGVPTYILFVTLEHYRERRRMQQYLQQNTVFGEQMRRLLSTGKIEEYLPVNIKATLPASQQAIYNY 169 T 0.14 GT87 pdbpssm F Eukaryota T 8ap7 8 H,W K,k Q57VT0_TRYB2 subunit-k MLRRSSAALIRRTPVRHSGGELFVRPKLEEIPPADQCRGFFGPLNDSLKFLRLLDIKWMMNRAVAMRREYLIATPTLFTFIWMFTWKGAVIYFWGDRAPPRRMDWNTEETGRLPLGFKPTPAPL 124 T 0.28 DUF1048 pdbpercent F Eukaryota T 8ap7 9 I,X L,l Q387J1_TRYB2 subunit-e MSAKAAPKTLHQVRNVAYFFAAWLGVQKGYIEKSANDRLWVEHQRKVRQQNVERQQALDSIKLMQQGVRATTPGQLEGVPAELQQLAEAFTK 92 T 0.051 DUF3708 pdb F Eukaryota T 8ap7 10 J,Y M,m C9ZJA0_TRYB9 subunit-g MSSTKCAVACKIMTPLCNAASKVQARSAKKLAALTDAGIQKTISEHNANGTDAAVSSTKRYLAEQRQLFHYRVVRFFDECHYIISGEYFAQYTKVNLIWDLRFLTKLVVLFLIGTVLGRQSIFPPIDPDSPLVEALVTKVNPNY 144 T 0.076 CbtA pdbpercent F Eukaryota T 8ap7 11 K,Z N,n Q582T1_TRYB2 ATPTB11 MLRKTPLFAMATTRKALVGNGPTFSTGGECMNTCDIQNAFPMNDRGVRSSSPFQEPNTAIYDSYLAWTYFQPMDVHIEKLPAPEAKYYQRHTKKPWDVSSTELTEIQSRKKYFQTLGYLVAFIYLYFLMPKEKSFSGLSGPDGHWIMLPKGRPELF 156 T 0.091 FCP1_C pdb F Eukaryota T 8ap7 12 AA,L o,O Q57Z84_TRYB2 ATPTB12 MSSGFHFHDVSNDAIKGMPPSEALHKHLENAQLAHRICLAKALKAGEPPVEKCALTWGEVLIRYQAWSEYRPPFQDSVAQAKYKKYWSKKRQEEDDKNPFK 101 T 6.5 DUF1539 pdbhh F Eukaryota T 8ap7 13 BA,M p,P C9ZLR9_TRYB9 subunit-b MLRRLVPRVMMAPMGGATALCTSRGYNMLVFRDPKRRPQLSEEERAKVVVNQAEWPEEFKDFDPDDPYKNSPEIIKGMSSWNLFLWGVECAFIYQFYELVFPKSI 105 T 0.25 Mus7 unp F Eukaryota T 8ap7 14 CA,N q,Q Q583U4_TRYB2 ATPEG3 MTENIEAVMSDFWSNPADHFRPNLKALTLYAERQHYVDRWLHVKERWLAPWYLPWWSPLFQLGTWYSQRSRNLFLVENHLSYRPYKFRRNDEDRNNPY 98 T 32 CbiN pdbhh F Eukaryota T 8ap7 15 DA,O r,R ATPEG4 MLLGGFVPRRFSQFNRDPCWMFFIFSVGFWLGEYPAMMIKYNARDLVYDPHRYVWSHHDDHH 62 T 0.029 DUF5357 pdb F T 8ap8 2 B h Q389Z3_TRYB2 ATPTB4 MRRTFISFSAASAAAAAPVTSTKMQTLHKLLTGEVSFKNKAPVKDCNIVHQFGENWATELSAYAKTLPAEQQKIIVRQIARVKLTRYTVAELAAYCGDGPALLDETARAANIEQGVAFVKAKGVEAFEKYVAEESTNANWKPEEAKKFIEDVKAKAK 157 T 0.042 TAT_signal pdb F Eukaryota T 8ap8 4 D c Q585K5_TRYB2 subunit-8 MLRRLGANVSNMARPMNKYAVTVSPRRHLEPMSTWYLASWAMVWYYAFFFWMPMVWTDIMVPSFVYNKLPVIHFLQEKRAEQKLRRVLDETYTEWTEELDQAHVTDAITRSLNI 114 T 10 bVLRF1 pdbhh F Eukaryota T 8ap8 5 E d Q57ZW9_TRYB2 subunit-d MRRVSSPNITIQSVRWISGVSPLLYFPPTTTSTTNREDQINKNTNIAIQMIKRYKGEVPPHYTRKSSATIEQVEKEIDALLGGAEKLRKTSTDDQPMDKLTLMERCLRHALWSYHKEEGRYDFDQIGRWVVYTPEDEVKLAQLKREVEAKEKLAALRKRREEEGLPGGPVPRINWPQEYSSFIDREPVVAKRIRYDTLASTTLERDEKQIESTLQQYRRASQDKRLDDLVDLLERFKPVLAREAIMQRLTIKHLEGQLGVWRYMDWCPEVRDRAELEVDITGWQWWSPLEERRLLPVRLRSVNEVREIMSKTQAKKSAEAAERNPIVTQTSTGDNARDRLLKEVLALQARINQRDEVEPSQTEQKKKAHH 370 T 0.0093 SAM_Exu pdbpercent F Eukaryota T 8apa 6 J,K,M J1,K1,L1 ATP18_TRYBB ATP SYNTHASE F1 SUBUNIT P18 MMRRVYSPVFCSVAAARFAATSAAKKYDLFGYEVDTNTAPWIEKIKKCKYYDEAGEVLVNMNVSNCPPDIATYNATLQCIYQSPSKQSTPVDNESKFCAMMDLLEEMQHRNRLKPNEESWTWVMKECVKSGQFRLGYCIQQVMETECKGCPADLVKANEANAQKAKTEGKEHPGHLSQQAGLFDVKVE 188 T 1.1E-05 RPM2 pdbhh F Eukaryota T 8apa 7 JA,L l,L Q387J1_TRYB2 subunit-e MSAKAAPKTLHQVRNVAYFFAAWLGVQKGYIEKSANDRLWVEHQRKVRQQNVERQQALDSIKLMQQGVRATTPGQLEGVPAELQQLAEAFTK 92 T 0.051 DUF3708 pdb F Eukaryota T 8apa 8 KA,N m,M C9ZJA0_TRYB9 subunit-g MSSTKCAVACKIMTPLCNAASKVQARSAKKLAALTDAGIQKTISEHNANGTDAAVSSTKRYLAEQRQLFHYRVVRFFDECHYIISGEYFAQYTKVNLIWDLRFLTKLVVLFLIGTVLGRQSIFPPIDPDSPLVEALVTKVNPNY 144 T 0.076 CbtA pdbpercent F Eukaryota T 8apa 11 Z a ATP6_TRYBB F-ATPASE PROTEIN 6 MFLFFFCDLFWLRLLLCMYYCVWSRLCFIVYFNCLMLIFDFLLFCLFDLYLFVGLCLFLLLWFMLFNLYSLILYYCITYLNLYLLFCIVFLLYIAFLFLFCFLCDFFLFNNLLVGDSFMDVFFIRFLLCFLECFSLLCRCLSTFLRLFCNLLSSHFLLLMFFDFFYFIFVFFFYGVFCYWFILFIFVFCFCLLFYVFLYLLDLFAAILQLFIFCNMILQLIMDFLLFLLFV 231 T 0.43 ATP-synt_A unppssm F Eukaryota T 8apa 12 AA c Q585K5_TRYB2 subunit-8 MLRRLGANVSNMARPMNKYAVTVSPRRHLEPMSTWYLASWAMVWYYAFFFWMPMVWTDIMVPSFVYNKLPVIHFLQEKRAEQKLRRVLDETYTEWTEELDQAHVTDAITRSLNI 114 T 10 bVLRF1 pdbhh F Eukaryota T 8apa 13 BA d Q57ZW9_TRYB2 subunit-d MRRVSSPNITIQSVRWISGVSPLLYFPPTTTSTTNREDQINKNTNIAIQMIKRYKGEVPPHYTRKSSATIEQVEKEIDALLGGAEKLRKTSTDDQPMDKLTLMERCLRHALWSYHKEEGRYDFDQIGRWVVYTPEDEVKLAQLKREVEAKEKLAALRKRREEEGLPGGPVPRINWPQEYSSFIDREPVVAKRIRYDTLASTTLERDEKQIESTLQQYRRASQDKRLDDLVDLLERFKPVLAREAIMQRLTIKHLEGQLGVWRYMDWCPEVRDRAELEVDITGWQWWSPLEERRLLPVRLRSVNEVREIMSKTQAKKSAEAAERNPIVTQTSTGDNARDRLLKEVLALQARINQRDEVEPSQTEQKKKAHH 370 T 0.0093 SAM_Exu pdbpercent F Eukaryota T 8apa 15 DA f Q57ZE2_TRYB2 subunit-f MVLFSTYRSSRLVSKEFLHGPVMRFRALGEYYFQRAWNGTLNWALPGEYRLYAVMIPFIYFYHRWHNDHTLDRDHVEKAMIMRWGGTLEDVRKLSAKDQLRVRCFTDIEKLYSAYGPKDTYLQPPGDTLPGKDFYRKAGGAQAHH 145 T 0.39 DUF3094 pdbhh F Eukaryota T 8apa 17 FA h Q389Z3_TRYB2 ATPTB4 MRRTFISFSAASAAAAAPVTSTKMQTLHKLLTGEVSFKNKAPVKDCNIVHQFGENWATELSAYAKTLPAEQQKIIVRQIARVKLTRYTVAELAAYCGDGPALLDETARAANIEQGVAFVKAKGVEAFEKYVAEESTNANWKPEEAKKFIEDVKAKAK 157 T 0.042 TAT_signal pdb F Eukaryota T 8apa 18 GA i Q57ZM4_TRYB2 subunit-i/j MVYTRWKCDRLPVFQLKLFTQEYPMHAAVGIFTIIFLWKHMSHCSEETERKYGWWAGYPYWRDPIARRNETKYKQMIINNDVDITHPKWTGCSVEQLEELSRVV 104 T 0.18 FPN1 pdb F Eukaryota T 8apa 19 HA j D0A5R7_TRYB9 ATPTB6 MTKYELKMQYFDEWMIRWRKFQTESDWEIEKGRQWWRRFNMAVSGALFCGLVLYTSGTATLKRQYGLPHFFDIGVDGQAKETMLKTLTSRWRYTPQGYGRVLITGVPTYILFVTLEHYRERRRMQQYLQQNTVFGEQMRRLLSTGKIEEYLPVNIKATLPASQQAIYNY 169 T 0.14 GT87 pdbpssm F Eukaryota T 8apa 20 IA k Q57VT0_TRYB2 subunit-k MLRRSSAALIRRTPVRHSGGELFVRPKLEEIPPADQCRGFFGPLNDSLKFLRLLDIKWMMNRAVAMRREYLIATPTLFTFIWMFTWKGAVIYFWGDRAPPRRMDWNTEETGRLPLGFKPTPAPL 124 T 0.28 DUF1048 pdbpercent F Eukaryota T 8apa 21 LA n Q582T1_TRYB2 ATPTB11 MLRKTPLFAMATTRKALVGNGPTFSTGGECMNTCDIQNAFPMNDRGVRSSSPFQEPNTAIYDSYLAWTYFQPMDVHIEKLPAPEAKYYQRHTKKPWDVSSTELTEIQSRKKYFQTLGYLVAFIYLYFLMPKEKSFSGLSGPDGHWIMLPKGRPELF 156 T 0.091 FCP1_C pdb F Eukaryota T 8apa 22 MA o Q57Z84_TRYB2 ATPTB12 MSSGFHFHDVSNDAIKGMPPSEALHKHLENAQLAHRICLAKALKAGEPPVEKCALTWGEVLIRYQAWSEYRPPFQDSVAQAKYKKYWSKKRQEEDDKNPFK 101 T 6.5 DUF1539 pdbhh F Eukaryota T 8apa 23 NA p C9ZLR9_TRYB9 subunit-b MLRRLVPRVMMAPMGGATALCTSRGYNMLVFRDPKRRPQLSEEERAKVVVNQAEWPEEFKDFDPDDPYKNSPEIIKGMSSWNLFLWGVECAFIYQFYELVFPKSI 105 T 0.25 Mus7 unp F Eukaryota T 8apa 24 OA q Q583U4_TRYB2 ATPEG3 MTENIEAVMSDFWSNPADHFRPNLKALTLYAERQHYVDRWLHVKERWLAPWYLPWWSPLFQLGTWYSQRSRNLFLVENHLSYRPYKFRRNDEDRNNPY 98 T 32 CbiN pdbhh F Eukaryota T 8apa 25 PA r ATPEG4 MLLGGFVPRRFSQFNRDPCWMFFIFSVGFWLGEYPAMMIKYNARDLVYDPHRYVWSHHDDHH 62 T 0.029 DUF5357 pdb F T 8apb 6 J,K,M J1,K1,L1 ATP18_TRYBB ATP SYNTHASE F1 SUBUNIT P18 MMRRVYSPVFCSVAAARFAATSAAKKYDLFGYEVDTNTAPWIEKIKKCKYYDEAGEVLVNMNVSNCPPDIATYNATLQCIYQSPSKQSTPVDNESKFCAMMDLLEEMQHRNRLKPNEESWTWVMKECVKSGQFRLGYCIQQVMETECKGCPADLVKANEANAQKAKTEGKEHPGHLSQQAGLFDVKVE 188 T 1.1E-05 RPM2 pdbhh F Eukaryota T 8apb 7 JA,L l,L Q387J1_TRYB2 subunit-e MSAKAAPKTLHQVRNVAYFFAAWLGVQKGYIEKSANDRLWVEHQRKVRQQNVERQQALDSIKLMQQGVRATTPGQLEGVPAELQQLAEAFTK 92 T 0.051 DUF3708 pdb F Eukaryota T 8apb 8 KA,N m,M C9ZJA0_TRYB9 subunit-g MSSTKCAVACKIMTPLCNAASKVQARSAKKLAALTDAGIQKTISEHNANGTDAAVSSTKRYLAEQRQLFHYRVVRFFDECHYIISGEYFAQYTKVNLIWDLRFLTKLVVLFLIGTVLGRQSIFPPIDPDSPLVEALVTKVNPNY 144 T 0.076 CbtA pdbpercent F Eukaryota T 8apb 11 Z a ATP6_TRYBB F-ATPASE PROTEIN 6 MFLFFFCDLFWLRLLLCMYYCVWSRLCFIVYFNCLMLIFDFLLFCLFDLYLFVGLCLFLLLWFMLFNLYSLILYYCITYLNLYLLFCIVFLLYIAFLFLFCFLCDFFLFNNLLVGDSFMDVFFIRFLLCFLECFSLLCRCLSTFLRLFCNLLSSHFLLLMFFDFFYFIFVFFFYGVFCYWFILFIFVFCFCLLFYVFLYLLDLFAAILQLFIFCNMILQLIMDFLLFLLFV 231 T 0.43 ATP-synt_A unppssm F Eukaryota T 8apb 12 AA c Q585K5_TRYB2 subunit-8 MLRRLGANVSNMARPMNKYAVTVSPRRHLEPMSTWYLASWAMVWYYAFFFWMPMVWTDIMVPSFVYNKLPVIHFLQEKRAEQKLRRVLDETYTEWTEELDQAHVTDAITRSLNI 114 T 10 bVLRF1 pdbhh F Eukaryota T 8apb 13 BA d Q57ZW9_TRYB2 subunit-d MRRVSSPNITIQSVRWISGVSPLLYFPPTTTSTTNREDQINKNTNIAIQMIKRYKGEVPPHYTRKSSATIEQVEKEIDALLGGAEKLRKTSTDDQPMDKLTLMERCLRHALWSYHKEEGRYDFDQIGRWVVYTPEDEVKLAQLKREVEAKEKLAALRKRREEEGLPGGPVPRINWPQEYSSFIDREPVVAKRIRYDTLASTTLERDEKQIESTLQQYRRASQDKRLDDLVDLLERFKPVLAREAIMQRLTIKHLEGQLGVWRYMDWCPEVRDRAELEVDITGWQWWSPLEERRLLPVRLRSVNEVREIMSKTQAKKSAEAAERNPIVTQTSTGDNARDRLLKEVLALQARINQRDEVEPSQTEQKKKAHH 370 T 0.0093 SAM_Exu pdbpercent F Eukaryota T 8apb 15 DA f Q57ZE2_TRYB2 subunit-f MVLFSTYRSSRLVSKEFLHGPVMRFRALGEYYFQRAWNGTLNWALPGEYRLYAVMIPFIYFYHRWHNDHTLDRDHVEKAMIMRWGGTLEDVRKLSAKDQLRVRCFTDIEKLYSAYGPKDTYLQPPGDTLPGKDFYRKAGGAQAHH 145 T 0.39 DUF3094 pdbhh F Eukaryota T 8apb 17 FA h Q389Z3_TRYB2 ATPTB4 MRRTFISFSAASAAAAAPVTSTKMQTLHKLLTGEVSFKNKAPVKDCNIVHQFGENWATELSAYAKTLPAEQQKIIVRQIARVKLTRYTVAELAAYCGDGPALLDETARAANIEQGVAFVKAKGVEAFEKYVAEESTNANWKPEEAKKFIEDVKAKAK 157 T 0.042 TAT_signal pdb F Eukaryota T 8apb 18 GA i Q57ZM4_TRYB2 subunit i/j MVYTRWKCDRLPVFQLKLFTQEYPMHAAVGIFTIIFLWKHMSHCSEETERKYGWWAGYPYWRDPIARRNETKYKQMIINNDVDITHPKWTGCSVEQLEELSRVV 104 T 0.18 FPN1 pdb F Eukaryota T 8apb 19 HA j D0A5R7_TRYB9 ATPTB6 MTKYELKMQYFDEWMIRWRKFQTESDWEIEKGRQWWRRFNMAVSGALFCGLVLYTSGTATLKRQYGLPHFFDIGVDGQAKETMLKTLTSRWRYTPQGYGRVLITGVPTYILFVTLEHYRERRRMQQYLQQNTVFGEQMRRLLSTGKIEEYLPVNIKATLPASQQAIYNY 169 T 0.14 GT87 pdbpssm F Eukaryota T 8apb 20 IA k Q57VT0_TRYB2 subunit-k MLRRSSAALIRRTPVRHSGGELFVRPKLEEIPPADQCRGFFGPLNDSLKFLRLLDIKWMMNRAVAMRREYLIATPTLFTFIWMFTWKGAVIYFWGDRAPPRRMDWNTEETGRLPLGFKPTPAPL 124 T 0.28 DUF1048 pdbpercent F Eukaryota T 8apb 21 LA n Q582T1_TRYB2 ATPTB11 MLRKTPLFAMATTRKALVGNGPTFSTGGECMNTCDIQNAFPMNDRGVRSSSPFQEPNTAIYDSYLAWTYFQPMDVHIEKLPAPEAKYYQRHTKKPWDVSSTELTEIQSRKKYFQTLGYLVAFIYLYFLMPKEKSFSGLSGPDGHWIMLPKGRPELF 156 T 0.091 FCP1_C pdb F Eukaryota T 8apb 22 MA o Q57Z84_TRYB2 ATPTB12 MSSGFHFHDVSNDAIKGMPPSEALHKHLENAQLAHRICLAKALKAGEPPVEKCALTWGEVLIRYQAWSEYRPPFQDSVAQAKYKKYWSKKRQEEDDKNPFK 101 T 6.5 DUF1539 pdbhh F Eukaryota T 8apb 23 NA p C9ZLR9_TRYB9 ATPTB14 MLRRLVPRVMMAPMGGATALCTSRGYNMLVFRDPKRRPQLSEEERAKVVVNQAEWPEEFKDFDPDDPYKNSPEIIKGMSSWNLFLWGVECAFIYQFYELVFPKSI 105 T 0.25 Mus7 unp F Eukaryota T 8apb 24 OA q Q583U4_TRYB2 ATPEG3 MTENIEAVMSDFWSNPADHFRPNLKALTLYAERQHYVDRWLHVKERWLAPWYLPWWSPLFQLGTWYSQRSRNLFLVENHLSYRPYKFRRNDEDRNNPY 98 T 32 CbiN pdbhh F Eukaryota T 8apb 25 PA r ATPEG4 MLLGGFVPRRFSQFNRDPCWMFFIFSVGFWLGEYPAMMIKYNARDLVYDPHRYVWSHHDDHH 62 T 0.029 DUF5357 pdb F T 8apc 6 J,K,M J1,K1,L1 ATP18_TRYBB ATP SYNTHASE F1 SUBUNIT P18 MMRRVYSPVFCSVAAARFAATSAAKKYDLFGYEVDTNTAPWIEKIKKCKYYDEAGEVLVNMNVSNCPPDIATYNATLQCIYQSPSKQSTPVDNESKFCAMMDLLEEMQHRNRLKPNEESWTWVMKECVKSGQFRLGYCIQQVMETECKGCPADLVKANEANAQKAKTEGKEHPGHLSQQAGLFDVKVE 188 T 1.1E-05 RPM2 pdbhh F Eukaryota T 8apc 7 JA,L l,L Q387J1_TRYB2 subunit-e MSAKAAPKTLHQVRNVAYFFAAWLGVQKGYIEKSANDRLWVEHQRKVRQQNVERQQALDSIKLMQQGVRATTPGQLEGVPAELQQLAEAFTK 92 T 0.051 DUF3708 pdb F Eukaryota T 8apc 8 KA,N m,M C9ZJA0_TRYB9 subunit-g MSSTKCAVACKIMTPLCNAASKVQARSAKKLAALTDAGIQKTISEHNANGTDAAVSSTKRYLAEQRQLFHYRVVRFFDECHYIISGEYFAQYTKVNLIWDLRFLTKLVVLFLIGTVLGRQSIFPPIDPDSPLVEALVTKVNPNY 144 T 0.076 CbtA pdbpercent F Eukaryota T 8apc 11 Z a ATP6_TRYBB F-ATPASE PROTEIN 6 MFLFFFCDLFWLRLLLCMYYCVWSRLCFIVYFNCLMLIFDFLLFCLFDLYLFVGLCLFLLLWFMLFNLYSLILYYCITYLNLYLLFCIVFLLYIAFLFLFCFLCDFFLFNNLLVGDSFMDVFFIRFLLCFLECFSLLCRCLSTFLRLFCNLLSSHFLLLMFFDFFYFIFVFFFYGVFCYWFILFIFVFCFCLLFYVFLYLLDLFAAILQLFIFCNMILQLIMDFLLFLLFV 231 T 0.43 ATP-synt_A unppssm F Eukaryota T 8apc 12 AA c Q585K5_TRYB2 subunit-8 MLRRLGANVSNMARPMNKYAVTVSPRRHLEPMSTWYLASWAMVWYYAFFFWMPMVWTDIMVPSFVYNKLPVIHFLQEKRAEQKLRRVLDETYTEWTEELDQAHVTDAITRSLNI 114 T 10 bVLRF1 pdbhh F Eukaryota T 8apc 13 BA d Q57ZW9_TRYB2 subunit-d MRRVSSPNITIQSVRWISGVSPLLYFPPTTTSTTNREDQINKNTNIAIQMIKRYKGEVPPHYTRKSSATIEQVEKEIDALLGGAEKLRKTSTDDQPMDKLTLMERCLRHALWSYHKEEGRYDFDQIGRWVVYTPEDEVKLAQLKREVEAKEKLAALRKRREEEGLPGGPVPRINWPQEYSSFIDREPVVAKRIRYDTLASTTLERDEKQIESTLQQYRRASQDKRLDDLVDLLERFKPVLAREAIMQRLTIKHLEGQLGVWRYMDWCPEVRDRAELEVDITGWQWWSPLEERRLLPVRLRSVNEVREIMSKTQAKKSAEAAERNPIVTQTSTGDNARDRLLKEVLALQARINQRDEVEPSQTEQKKKAHH 370 T 0.0093 SAM_Exu pdbpercent F Eukaryota T 8apc 15 DA f Q57ZE2_TRYB2 subunit-f MVLFSTYRSSRLVSKEFLHGPVMRFRALGEYYFQRAWNGTLNWALPGEYRLYAVMIPFIYFYHRWHNDHTLDRDHVEKAMIMRWGGTLEDVRKLSAKDQLRVRCFTDIEKLYSAYGPKDTYLQPPGDTLPGKDFYRKAGGAQAHH 145 T 0.39 DUF3094 pdbhh F Eukaryota T 8apc 17 FA h Q389Z3_TRYB2 ATPTB4 MRRTFISFSAASAAAAAPVTSTKMQTLHKLLTGEVSFKNKAPVKDCNIVHQFGENWATELSAYAKTLPAEQQKIIVRQIARVKLTRYTVAELAAYCGDGPALLDETARAANIEQGVAFVKAKGVEAFEKYVAEESTNANWKPEEAKKFIEDVKAKAK 157 T 0.042 TAT_signal pdb F Eukaryota T 8apc 18 GA i Q57ZM4_TRYB2 subunit-i/j MVYTRWKCDRLPVFQLKLFTQEYPMHAAVGIFTIIFLWKHMSHCSEETERKYGWWAGYPYWRDPIARRNETKYKQMIINNDVDITHPKWTGCSVEQLEELSRVV 104 T 0.18 FPN1 pdb F Eukaryota T 8apc 19 HA j D0A5R7_TRYB9 ATPTB6 MTKYELKMQYFDEWMIRWRKFQTESDWEIEKGRQWWRRFNMAVSGALFCGLVLYTSGTATLKRQYGLPHFFDIGVDGQAKETMLKTLTSRWRYTPQGYGRVLITGVPTYILFVTLEHYRERRRMQQYLQQNTVFGEQMRRLLSTGKIEEYLPVNIKATLPASQQAIYNY 169 T 0.14 GT87 pdbpssm F Eukaryota T 8apc 20 IA k Q57VT0_TRYB2 subunit-k MLRRSSAALIRRTPVRHSGGELFVRPKLEEIPPADQCRGFFGPLNDSLKFLRLLDIKWMMNRAVAMRREYLIATPTLFTFIWMFTWKGAVIYFWGDRAPPRRMDWNTEETGRLPLGFKPTPAPL 124 T 0.28 DUF1048 pdbpercent F Eukaryota T 8apc 21 LA n Q582T1_TRYB2 ATPTB11 MLRKTPLFAMATTRKALVGNGPTFSTGGECMNTCDIQNAFPMNDRGVRSSSPFQEPNTAIYDSYLAWTYFQPMDVHIEKLPAPEAKYYQRHTKKPWDVSSTELTEIQSRKKYFQTLGYLVAFIYLYFLMPKEKSFSGLSGPDGHWIMLPKGRPELF 156 T 0.091 FCP1_C pdb F Eukaryota T 8apc 22 MA o Q57Z84_TRYB2 ATPTB12 MSSGFHFHDVSNDAIKGMPPSEALHKHLENAQLAHRICLAKALKAGEPPVEKCALTWGEVLIRYQAWSEYRPPFQDSVAQAKYKKYWSKKRQEEDDKNPFK 101 T 6.5 DUF1539 pdbhh F Eukaryota T 8apc 23 NA p C9ZLR9_TRYB9 subunit-b MLRRLVPRVMMAPMGGATALCTSRGYNMLVFRDPKRRPQLSEEERAKVVVNQAEWPEEFKDFDPDDPYKNSPEIIKGMSSWNLFLWGVECAFIYQFYELVFPKSI 105 T 0.25 Mus7 unp F Eukaryota T 8apc 24 OA q Q583U4_TRYB2 ATPEG3 MTENIEAVMSDFWSNPADHFRPNLKALTLYAERQHYVDRWLHVKERWLAPWYLPWWSPLFQLGTWYSQRSRNLFLVENHLSYRPYKFRRNDEDRNNPY 98 T 32 CbiN pdbhh F Eukaryota T 8apc 25 PA r ATPEG4 MLLGGFVPRRFSQFNRDPCWMFFIFSVGFWLGEYPAMMIKYNARDLVYDPHRYVWSHHDDHH 62 T 0.029 DUF5357 pdb F T 8apd 1 A,M L,l Q387J1_TRYB2 subunit-e MSAKAAPKTLHQVRNVAYFFAAWLGVQKGYIEKSANDRLWVEHQRKVRQQNVERQQALDSIKLMQQGVRATTPGQLEGVPAELQQLAEAFTK 92 T 0.051 DUF3708 pdb F Eukaryota T 8apd 2 B,N M,m C9ZJA0_TRYB9 subunit-g MSSTKCAVACKIMTPLCNAASKVQARSAKKLAALTDAGIQKTISEHNANGTDAAVSSTKRYLAEQRQLFHYRVVRFFDECHYIISGEYFAQYTKVNLIWDLRFLTKLVVLFLIGTVLGRQSIFPPIDPDSPLVEALVTKVNPNY 144 T 0.076 CbtA pdbpercent F Eukaryota T 8apd 3 C a ATP6_TRYBB F-ATPASE PROTEIN 6 MFLFFFCDLFWLRLLLCMYYCVWSRLCFIVYFNCLMLIFDFLLFCLFDLYLFVGLCLFLLLWFMLFNLYSLILYYCITYLNLYLLFCIVFLLYIAFLFLFCFLCDFFLFNNLLVGDSFMDVFFIRFLLCFLECFSLLCRCLSTFLRLFCNLLSSHFLLLMFFDFFYFIFVFFFYGVFCYWFILFIFVFCFCLLFYVFLYLLDLFAAILQLFIFCNMILQLIMDFLLFLLFV 231 T 0.43 ATP-synt_A unppssm F Eukaryota T 8apd 4 D c Q585K5_TRYB2 subunit-8 MLRRLGANVSNMARPMNKYAVTVSPRRHLEPMSTWYLASWAMVWYYAFFFWMPMVWTDIMVPSFVYNKLPVIHFLQEKRAEQKLRRVLDETYTEWTEELDQAHVTDAITRSLNI 114 T 10 bVLRF1 pdbhh F Eukaryota T 8apd 5 E d Q57ZW9_TRYB2 subunit-d MRRVSSPNITIQSVRWISGVSPLLYFPPTTTSTTNREDQINKNTNIAIQMIKRYKGEVPPHYTRKSSATIEQVEKEIDALLGGAEKLRKTSTDDQPMDKLTLMERCLRHALWSYHKEEGRYDFDQIGRWVVYTPEDEVKLAQLKREVEAKEKLAALRKRREEEGLPGGPVPRINWPQEYSSFIDREPVVAKRIRYDTLASTTLERDEKQIESTLQQYRRASQDKRLDDLVDLLERFKPVLAREAIMQRLTIKHLEGQLGVWRYMDWCPEVRDRAELEVDITGWQWWSPLEERRLLPVRLRSVNEVREIMSKTQAKKSAEAAERNPIVTQTSTGDNARDRLLKEVLALQARINQRDEVEPSQTEQKKKAHH 370 T 0.0093 SAM_Exu pdbpercent F Eukaryota T 8apd 7 G f Q57ZE2_TRYB2 subunit-f MVLFSTYRSSRLVSKEFLHGPVMRFRALGEYYFQRAWNGTLNWALPGEYRLYAVMIPFIYFYHRWHNDHTLDRDHVEKAMIMRWGGTLEDVRKLSAKDQLRVRCFTDIEKLYSAYGPKDTYLQPPGDTLPGKDFYRKAGGAQAHH 145 T 0.39 DUF3094 pdbhh F Eukaryota T 8apd 9 I h Q389Z3_TRYB2 ATPTB4 MRRTFISFSAASAAAAAPVTSTKMQTLHKLLTGEVSFKNKAPVKDCNIVHQFGENWATELSAYAKTLPAEQQKIIVRQIARVKLTRYTVAELAAYCGDGPALLDETARAANIEQGVAFVKAKGVEAFEKYVAEESTNANWKPEEAKKFIEDVKAKAK 157 T 0.042 TAT_signal pdb F Eukaryota T 8apd 10 J i Q57ZM4_TRYB2 subunit-i/j MVYTRWKCDRLPVFQLKLFTQEYPMHAAVGIFTIIFLWKHMSHCSEETERKYGWWAGYPYWRDPIARRNETKYKQMIINNDVDITHPKWTGCSVEQLEELSRVV 104 T 0.18 FPN1 pdb F Eukaryota T 8apd 11 K j D0A5R7_TRYB9 ATPTB6 MTKYELKMQYFDEWMIRWRKFQTESDWEIEKGRQWWRRFNMAVSGALFCGLVLYTSGTATLKRQYGLPHFFDIGVDGQAKETMLKTLTSRWRYTPQGYGRVLITGVPTYILFVTLEHYRERRRMQQYLQQNTVFGEQMRRLLSTGKIEEYLPVNIKATLPASQQAIYNY 169 T 0.14 GT87 pdbpssm F Eukaryota T 8apd 12 L k Q57VT0_TRYB2 subunit-k MLRRSSAALIRRTPVRHSGGELFVRPKLEEIPPADQCRGFFGPLNDSLKFLRLLDIKWMMNRAVAMRREYLIATPTLFTFIWMFTWKGAVIYFWGDRAPPRRMDWNTEETGRLPLGFKPTPAPL 124 T 0.28 DUF1048 pdbpercent F Eukaryota T 8apd 13 O n Q582T1_TRYB2 ATPTB11 MLRKTPLFAMATTRKALVGNGPTFSTGGECMNTCDIQNAFPMNDRGVRSSSPFQEPNTAIYDSYLAWTYFQPMDVHIEKLPAPEAKYYQRHTKKPWDVSSTELTEIQSRKKYFQTLGYLVAFIYLYFLMPKEKSFSGLSGPDGHWIMLPKGRPELF 156 T 0.091 FCP1_C pdb F Eukaryota T 8apd 14 P o Q57Z84_TRYB2 ATPTB12 MSSGFHFHDVSNDAIKGMPPSEALHKHLENAQLAHRICLAKALKAGEPPVEKCALTWGEVLIRYQAWSEYRPPFQDSVAQAKYKKYWSKKRQEEDDKNPFK 101 T 6.5 DUF1539 pdbhh F Eukaryota T 8apd 15 Q p C9ZLR9_TRYB9 subunit-b MLRRLVPRVMMAPMGGATALCTSRGYNMLVFRDPKRRPQLSEEERAKVVVNQAEWPEEFKDFDPDDPYKNSPEIIKGMSSWNLFLWGVECAFIYQFYELVFPKSI 105 T 0.25 Mus7 unp F Eukaryota T 8apd 16 R q Q583U4_TRYB2 ATPEG3 MTENIEAVMSDFWSNPADHFRPNLKALTLYAERQHYVDRWLHVKERWLAPWYLPWWSPLFQLGTWYSQRSRNLFLVENHLSYRPYKFRRNDEDRNNPY 98 T 32 CbiN pdbhh F Eukaryota T 8apd 17 S r ATPEG4 MLLGGFVPRRFSQFNRDPCWMFFIFSVGFWLGEYPAMMIKYNARDLVYDPHRYVWSHHDDHH 62 T 0.029 DUF5357 pdb F T 8apd 23 CA,DA,EA J1,K1,L1 ATP18_TRYBB ATP SYNTHASE F1 SUBUNIT P18 MMRRVYSPVFCSVAAARFAATSAAKKYDLFGYEVDTNTAPWIEKIKKCKYYDEAGEVLVNMNVSNCPPDIATYNATLQCIYQSPSKQSTPVDNESKFCAMMDLLEEMQHRNRLKPNEESWTWVMKECVKSGQFRLGYCIQQVMETECKGCPADLVKANEANAQKAKTEGKEHPGHLSQQAGLFDVKVE 188 T 1.1E-05 RPM2 pdbhh F Eukaryota T 8ape 6 J,K,M J1,K1,L1 ATP18_TRYBB ATP SYNTHASE F1 SUBUNIT P18 MMRRVYSPVFCSVAAARFAATSAAKKYDLFGYEVDTNTAPWIEKIKKCKYYDEAGEVLVNMNVSNCPPDIATYNATLQCIYQSPSKQSTPVDNESKFCAMMDLLEEMQHRNRLKPNEESWTWVMKECVKSGQFRLGYCIQQVMETECKGCPADLVKANEANAQKAKTEGKEHPGHLSQQAGLFDVKVE 188 T 1.1E-05 RPM2 pdbhh F Eukaryota T 8ape 7 JA,L l,L Q387J1_TRYB2 subunit-e MSAKAAPKTLHQVRNVAYFFAAWLGVQKGYIEKSANDRLWVEHQRKVRQQNVERQQALDSIKLMQQGVRATTPGQLEGVPAELQQLAEAFTK 92 T 0.051 DUF3708 pdb F Eukaryota T 8ape 8 KA,N m,M C9ZJA0_TRYB9 subunit-g MSSTKCAVACKIMTPLCNAASKVQARSAKKLAALTDAGIQKTISEHNANGTDAAVSSTKRYLAEQRQLFHYRVVRFFDECHYIISGEYFAQYTKVNLIWDLRFLTKLVVLFLIGTVLGRQSIFPPIDPDSPLVEALVTKVNPNY 144 T 0.076 CbtA pdbpercent F Eukaryota T 8ape 11 Z a ATP6_TRYBB F-ATPASE PROTEIN 6 MFLFFFCDLFWLRLLLCMYYCVWSRLCFIVYFNCLMLIFDFLLFCLFDLYLFVGLCLFLLLWFMLFNLYSLILYYCITYLNLYLLFCIVFLLYIAFLFLFCFLCDFFLFNNLLVGDSFMDVFFIRFLLCFLECFSLLCRCLSTFLRLFCNLLSSHFLLLMFFDFFYFIFVFFFYGVFCYWFILFIFVFCFCLLFYVFLYLLDLFAAILQLFIFCNMILQLIMDFLLFLLFV 231 T 0.43 ATP-synt_A unppssm F Eukaryota T 8ape 12 AA c Q585K5_TRYB2 subunit-8 MLRRLGANVSNMARPMNKYAVTVSPRRHLEPMSTWYLASWAMVWYYAFFFWMPMVWTDIMVPSFVYNKLPVIHFLQEKRAEQKLRRVLDETYTEWTEELDQAHVTDAITRSLNI 114 T 10 bVLRF1 pdbhh F Eukaryota T 8ape 13 BA d Q57ZW9_TRYB2 subunit-d MRRVSSPNITIQSVRWISGVSPLLYFPPTTTSTTNREDQINKNTNIAIQMIKRYKGEVPPHYTRKSSATIEQVEKEIDALLGGAEKLRKTSTDDQPMDKLTLMERCLRHALWSYHKEEGRYDFDQIGRWVVYTPEDEVKLAQLKREVEAKEKLAALRKRREEEGLPGGPVPRINWPQEYSSFIDREPVVAKRIRYDTLASTTLERDEKQIESTLQQYRRASQDKRLDDLVDLLERFKPVLAREAIMQRLTIKHLEGQLGVWRYMDWCPEVRDRAELEVDITGWQWWSPLEERRLLPVRLRSVNEVREIMSKTQAKKSAEAAERNPIVTQTSTGDNARDRLLKEVLALQARINQRDEVEPSQTEQKKKAHH 370 T 0.0093 SAM_Exu pdbpercent F Eukaryota T 8ape 15 DA f Q57ZE2_TRYB2 subunit-f MVLFSTYRSSRLVSKEFLHGPVMRFRALGEYYFQRAWNGTLNWALPGEYRLYAVMIPFIYFYHRWHNDHTLDRDHVEKAMIMRWGGTLEDVRKLSAKDQLRVRCFTDIEKLYSAYGPKDTYLQPPGDTLPGKDFYRKAGGAQAHH 145 T 0.39 DUF3094 pdbhh F Eukaryota T 8ape 17 FA h Q389Z3_TRYB2 ATPTB4 MRRTFISFSAASAAAAAPVTSTKMQTLHKLLTGEVSFKNKAPVKDCNIVHQFGENWATELSAYAKTLPAEQQKIIVRQIARVKLTRYTVAELAAYCGDGPALLDETARAANIEQGVAFVKAKGVEAFEKYVAEESTNANWKPEEAKKFIEDVKAKAK 157 T 0.042 TAT_signal pdb F Eukaryota T 8ape 18 GA i Q57ZM4_TRYB2 subunit-i/j MVYTRWKCDRLPVFQLKLFTQEYPMHAAVGIFTIIFLWKHMSHCSEETERKYGWWAGYPYWRDPIARRNETKYKQMIINNDVDITHPKWTGCSVEQLEELSRVV 104 T 0.18 FPN1 pdb F Eukaryota T 8ape 19 HA j D0A5R7_TRYB9 ATPTB6 MTKYELKMQYFDEWMIRWRKFQTESDWEIEKGRQWWRRFNMAVSGALFCGLVLYTSGTATLKRQYGLPHFFDIGVDGQAKETMLKTLTSRWRYTPQGYGRVLITGVPTYILFVTLEHYRERRRMQQYLQQNTVFGEQMRRLLSTGKIEEYLPVNIKATLPASQQAIYNY 169 T 0.14 GT87 pdbpssm F Eukaryota T 8ape 20 IA k Q57VT0_TRYB2 subunit-k MLRRSSAALIRRTPVRHSGGELFVRPKLEEIPPADQCRGFFGPLNDSLKFLRLLDIKWMMNRAVAMRREYLIATPTLFTFIWMFTWKGAVIYFWGDRAPPRRMDWNTEETGRLPLGFKPTPAPL 124 T 0.28 DUF1048 pdbpercent F Eukaryota T 8ape 21 LA n Q582T1_TRYB2 ATPTB11 MLRKTPLFAMATTRKALVGNGPTFSTGGECMNTCDIQNAFPMNDRGVRSSSPFQEPNTAIYDSYLAWTYFQPMDVHIEKLPAPEAKYYQRHTKKPWDVSSTELTEIQSRKKYFQTLGYLVAFIYLYFLMPKEKSFSGLSGPDGHWIMLPKGRPELF 156 T 0.091 FCP1_C pdb F Eukaryota T 8ape 22 MA o Q57Z84_TRYB2 ATPTB12 MSSGFHFHDVSNDAIKGMPPSEALHKHLENAQLAHRICLAKALKAGEPPVEKCALTWGEVLIRYQAWSEYRPPFQDSVAQAKYKKYWSKKRQEEDDKNPFK 101 T 6.5 DUF1539 pdbhh F Eukaryota T 8ape 23 NA p C9ZLR9_TRYB9 subunit-b MLRRLVPRVMMAPMGGATALCTSRGYNMLVFRDPKRRPQLSEEERAKVVVNQAEWPEEFKDFDPDDPYKNSPEIIKGMSSWNLFLWGVECAFIYQFYELVFPKSI 105 T 0.25 Mus7 unp F Eukaryota T 8ape 24 OA q Q583U4_TRYB2 ATPEG3 MTENIEAVMSDFWSNPADHFRPNLKALTLYAERQHYVDRWLHVKERWLAPWYLPWWSPLFQLGTWYSQRSRNLFLVENHLSYRPYKFRRNDEDRNNPY 98 T 32 CbiN pdbhh F Eukaryota T 8ape 25 PA r ATPEG4 MLLGGFVPRRFSQFNRDPCWMFFIFSVGFWLGEYPAMMIKYNARDLVYDPHRYVWSHHDDHH 62 T 0.029 DUF5357 pdb F T 8apf 3 C,D,F J1,K1,L1 ATP18_TRYBB ATP SYNTHASE F1 SUBUNIT P18 MMRRVYSPVFCSVAAARFAATSAAKKYDLFGYEVDTNTAPWIEKIKKCKYYDEAGEVLVNMNVSNCPPDIATYNATLQCIYQSPSKQSTPVDNESKFCAMMDLLEEMQHRNRLKPNEESWTWVMKECVKSGQFRLGYCIQQVMETECKGCPADLVKANEANAQKAKTEGKEHPGHLSQQAGLFDVKVE 188 T 1.1E-05 RPM2 pdbhh F Eukaryota T 8apf 4 CA,E l,L Q387J1_TRYB2 subunit-e MSAKAAPKTLHQVRNVAYFFAAWLGVQKGYIEKSANDRLWVEHQRKVRQQNVERQQALDSIKLMQQGVRATTPGQLEGVPAELQQLAEAFTK 92 T 0.051 DUF3708 pdb F Eukaryota T 8apf 5 DA,G m,M C9ZJA0_TRYB9 subunit-g MSSTKCAVACKIMTPLCNAASKVQARSAKKLAALTDAGIQKTISEHNANGTDAAVSSTKRYLAEQRQLFHYRVVRFFDECHYIISGEYFAQYTKVNLIWDLRFLTKLVVLFLIGTVLGRQSIFPPIDPDSPLVEALVTKVNPNY 144 T 0.076 CbtA pdbpercent F Eukaryota T 8apf 8 S a ATP6_TRYBB F-ATPASE PROTEIN 6 MFLFFFCDLFWLRLLLCMYYCVWSRLCFIVYFNCLMLIFDFLLFCLFDLYLFVGLCLFLLLWFMLFNLYSLILYYCITYLNLYLLFCIVFLLYIAFLFLFCFLCDFFLFNNLLVGDSFMDVFFIRFLLCFLECFSLLCRCLSTFLRLFCNLLSSHFLLLMFFDFFYFIFVFFFYGVFCYWFILFIFVFCFCLLFYVFLYLLDLFAAILQLFIFCNMILQLIMDFLLFLLFV 231 T 0.43 ATP-synt_A unppssm F Eukaryota T 8apf 9 T c Q585K5_TRYB2 subunit-8 MLRRLGANVSNMARPMNKYAVTVSPRRHLEPMSTWYLASWAMVWYYAFFFWMPMVWTDIMVPSFVYNKLPVIHFLQEKRAEQKLRRVLDETYTEWTEELDQAHVTDAITRSLNI 114 T 10 bVLRF1 pdbhh F Eukaryota T 8apf 10 U d Q57ZW9_TRYB2 subunit-d MRRVSSPNITIQSVRWISGVSPLLYFPPTTTSTTNREDQINKNTNIAIQMIKRYKGEVPPHYTRKSSATIEQVEKEIDALLGGAEKLRKTSTDDQPMDKLTLMERCLRHALWSYHKEEGRYDFDQIGRWVVYTPEDEVKLAQLKREVEAKEKLAALRKRREEEGLPGGPVPRINWPQEYSSFIDREPVVAKRIRYDTLASTTLERDEKQIESTLQQYRRASQDKRLDDLVDLLERFKPVLAREAIMQRLTIKHLEGQLGVWRYMDWCPEVRDRAELEVDITGWQWWSPLEERRLLPVRLRSVNEVREIMSKTQAKKSAEAAERNPIVTQTSTGDNARDRLLKEVLALQARINQRDEVEPSQTEQKKKAHH 370 T 0.0093 SAM_Exu pdbpercent F Eukaryota T 8apf 12 W f Q57ZE2_TRYB2 subunit-f MVLFSTYRSSRLVSKEFLHGPVMRFRALGEYYFQRAWNGTLNWALPGEYRLYAVMIPFIYFYHRWHNDHTLDRDHVEKAMIMRWGGTLEDVRKLSAKDQLRVRCFTDIEKLYSAYGPKDTYLQPPGDTLPGKDFYRKAGGAQAHH 145 T 0.39 DUF3094 pdbhh F Eukaryota T 8apf 14 Y h Q389Z3_TRYB2 ATPTB4 MRRTFISFSAASAAAAAPVTSTKMQTLHKLLTGEVSFKNKAPVKDCNIVHQFGENWATELSAYAKTLPAEQQKIIVRQIARVKLTRYTVAELAAYCGDGPALLDETARAANIEQGVAFVKAKGVEAFEKYVAEESTNANWKPEEAKKFIEDVKAKAK 157 T 0.042 TAT_signal pdb F Eukaryota T 8apf 15 Z i Q57ZM4_TRYB2 subunit-i/j MVYTRWKCDRLPVFQLKLFTQEYPMHAAVGIFTIIFLWKHMSHCSEETERKYGWWAGYPYWRDPIARRNETKYKQMIINNDVDITHPKWTGCSVEQLEELSRVV 104 T 0.18 FPN1 pdb F Eukaryota T 8apf 16 AA j D0A5R7_TRYB9 ATPTB6 MTKYELKMQYFDEWMIRWRKFQTESDWEIEKGRQWWRRFNMAVSGALFCGLVLYTSGTATLKRQYGLPHFFDIGVDGQAKETMLKTLTSRWRYTPQGYGRVLITGVPTYILFVTLEHYRERRRMQQYLQQNTVFGEQMRRLLSTGKIEEYLPVNIKATLPASQQAIYNY 169 T 0.14 GT87 pdbpssm F Eukaryota T 8apf 17 BA k Q57VT0_TRYB2 subunit-k MLRRSSAALIRRTPVRHSGGELFVRPKLEEIPPADQCRGFFGPLNDSLKFLRLLDIKWMMNRAVAMRREYLIATPTLFTFIWMFTWKGAVIYFWGDRAPPRRMDWNTEETGRLPLGFKPTPAPL 124 T 0.28 DUF1048 pdbpercent F Eukaryota T 8apf 18 EA n Q582T1_TRYB2 ATPTB11 MLRKTPLFAMATTRKALVGNGPTFSTGGECMNTCDIQNAFPMNDRGVRSSSPFQEPNTAIYDSYLAWTYFQPMDVHIEKLPAPEAKYYQRHTKKPWDVSSTELTEIQSRKKYFQTLGYLVAFIYLYFLMPKEKSFSGLSGPDGHWIMLPKGRPELF 156 T 0.091 FCP1_C pdb F Eukaryota T 8apf 19 FA o Q57Z84_TRYB2 ATPTB12 MSSGFHFHDVSNDAIKGMPPSEALHKHLENAQLAHRICLAKALKAGEPPVEKCALTWGEVLIRYQAWSEYRPPFQDSVAQAKYKKYWSKKRQEEDDKNPFK 101 T 6.5 DUF1539 pdbhh F Eukaryota T 8apf 20 GA p C9ZLR9_TRYB9 subunit-b MLRRLVPRVMMAPMGGATALCTSRGYNMLVFRDPKRRPQLSEEERAKVVVNQAEWPEEFKDFDPDDPYKNSPEIIKGMSSWNLFLWGVECAFIYQFYELVFPKSI 105 T 0.25 Mus7 unp F Eukaryota T 8apf 21 HA q Q583U4_TRYB2 ATPEG3 MTENIEAVMSDFWSNPADHFRPNLKALTLYAERQHYVDRWLHVKERWLAPWYLPWWSPLFQLGTWYSQRSRNLFLVENHLSYRPYKFRRNDEDRNNPY 98 T 32 CbiN pdbhh F Eukaryota T 8apf 22 IA r ATPEG4 MLLGGFVPRRFSQFNRDPCWMFFIFSVGFWLGEYPAMMIKYNARDLVYDPHRYVWSHHDDHH 62 T 0.029 DUF5357 pdb F T 8apg 1 A,M L,l Q387J1_TRYB2 subunit-e MSAKAAPKTLHQVRNVAYFFAAWLGVQKGYIEKSANDRLWVEHQRKVRQQNVERQQALDSIKLMQQGVRATTPGQLEGVPAELQQLAEAFTK 92 T 0.051 DUF3708 pdb F Eukaryota T 8apg 2 B,N M,m C9ZJA0_TRYB9 subunit-g MSSTKCAVACKIMTPLCNAASKVQARSAKKLAALTDAGIQKTISEHNANGTDAAVSSTKRYLAEQRQLFHYRVVRFFDECHYIISGEYFAQYTKVNLIWDLRFLTKLVVLFLIGTVLGRQSIFPPIDPDSPLVEALVTKVNPNY 144 T 0.076 CbtA pdbpercent F Eukaryota T 8apg 3 C a ATP6_TRYBB F-ATPASE PROTEIN 6 MFLFFFCDLFWLRLLLCMYYCVWSRLCFIVYFNCLMLIFDFLLFCLFDLYLFVGLCLFLLLWFMLFNLYSLILYYCITYLNLYLLFCIVFLLYIAFLFLFCFLCDFFLFNNLLVGDSFMDVFFIRFLLCFLECFSLLCRCLSTFLRLFCNLLSSHFLLLMFFDFFYFIFVFFFYGVFCYWFILFIFVFCFCLLFYVFLYLLDLFAAILQLFIFCNMILQLIMDFLLFLLFV 231 T 0.43 ATP-synt_A unppssm F Eukaryota T 8apg 4 D c Q585K5_TRYB2 subunit-8 MLRRLGANVSNMARPMNKYAVTVSPRRHLEPMSTWYLASWAMVWYYAFFFWMPMVWTDIMVPSFVYNKLPVIHFLQEKRAEQKLRRVLDETYTEWTEELDQAHVTDAITRSLNI 114 T 10 bVLRF1 pdbhh F Eukaryota T 8apg 5 E d Q57ZW9_TRYB2 subunit-d MRRVSSPNITIQSVRWISGVSPLLYFPPTTTSTTNREDQINKNTNIAIQMIKRYKGEVPPHYTRKSSATIEQVEKEIDALLGGAEKLRKTSTDDQPMDKLTLMERCLRHALWSYHKEEGRYDFDQIGRWVVYTPEDEVKLAQLKREVEAKEKLAALRKRREEEGLPGGPVPRINWPQEYSSFIDREPVVAKRIRYDTLASTTLERDEKQIESTLQQYRRASQDKRLDDLVDLLERFKPVLAREAIMQRLTIKHLEGQLGVWRYMDWCPEVRDRAELEVDITGWQWWSPLEERRLLPVRLRSVNEVREIMSKTQAKKSAEAAERNPIVTQTSTGDNARDRLLKEVLALQARINQRDEVEPSQTEQKKKAHH 370 T 0.0093 SAM_Exu pdbpercent F Eukaryota T 8apg 7 G f Q57ZE2_TRYB2 subunit-f MVLFSTYRSSRLVSKEFLHGPVMRFRALGEYYFQRAWNGTLNWALPGEYRLYAVMIPFIYFYHRWHNDHTLDRDHVEKAMIMRWGGTLEDVRKLSAKDQLRVRCFTDIEKLYSAYGPKDTYLQPPGDTLPGKDFYRKAGGAQAHH 145 T 0.39 DUF3094 pdbhh F Eukaryota T 8apg 9 I h Q389Z3_TRYB2 ATPTB4 MRRTFISFSAASAAAAAPVTSTKMQTLHKLLTGEVSFKNKAPVKDCNIVHQFGENWATELSAYAKTLPAEQQKIIVRQIARVKLTRYTVAELAAYCGDGPALLDETARAANIEQGVAFVKAKGVEAFEKYVAEESTNANWKPEEAKKFIEDVKAKAK 157 T 0.042 TAT_signal pdb F Eukaryota T 8apg 10 J i Q57ZM4_TRYB2 subunit-i/j MVYTRWKCDRLPVFQLKLFTQEYPMHAAVGIFTIIFLWKHMSHCSEETERKYGWWAGYPYWRDPIARRNETKYKQMIINNDVDITHPKWTGCSVEQLEELSRVV 104 T 0.18 FPN1 pdb F Eukaryota T 8apg 11 K j D0A5R7_TRYB9 ATPTB6 MTKYELKMQYFDEWMIRWRKFQTESDWEIEKGRQWWRRFNMAVSGALFCGLVLYTSGTATLKRQYGLPHFFDIGVDGQAKETMLKTLTSRWRYTPQGYGRVLITGVPTYILFVTLEHYRERRRMQQYLQQNTVFGEQMRRLLSTGKIEEYLPVNIKATLPASQQAIYNY 169 T 0.14 GT87 pdbpssm F Eukaryota T 8apg 12 L k Q57VT0_TRYB2 subunit-k MLRRSSAALIRRTPVRHSGGELFVRPKLEEIPPADQCRGFFGPLNDSLKFLRLLDIKWMMNRAVAMRREYLIATPTLFTFIWMFTWKGAVIYFWGDRAPPRRMDWNTEETGRLPLGFKPTPAPL 124 T 0.28 DUF1048 pdbpercent F Eukaryota T 8apg 13 O n Q582T1_TRYB2 ATPTB11 MLRKTPLFAMATTRKALVGNGPTFSTGGECMNTCDIQNAFPMNDRGVRSSSPFQEPNTAIYDSYLAWTYFQPMDVHIEKLPAPEAKYYQRHTKKPWDVSSTELTEIQSRKKYFQTLGYLVAFIYLYFLMPKEKSFSGLSGPDGHWIMLPKGRPELF 156 T 0.091 FCP1_C pdb F Eukaryota T 8apg 14 P o Q57Z84_TRYB2 ATPTB12 MSSGFHFHDVSNDAIKGMPPSEALHKHLENAQLAHRICLAKALKAGEPPVEKCALTWGEVLIRYQAWSEYRPPFQDSVAQAKYKKYWSKKRQEEDDKNPFK 101 T 6.5 DUF1539 pdbhh F Eukaryota T 8apg 15 Q p C9ZLR9_TRYB9 subunit-b MLRRLVPRVMMAPMGGATALCTSRGYNMLVFRDPKRRPQLSEEERAKVVVNQAEWPEEFKDFDPDDPYKNSPEIIKGMSSWNLFLWGVECAFIYQFYELVFPKSI 105 T 0.25 Mus7 unp F Eukaryota T 8apg 16 R q Q583U4_TRYB2 ATPEG3 MTENIEAVMSDFWSNPADHFRPNLKALTLYAERQHYVDRWLHVKERWLAPWYLPWWSPLFQLGTWYSQRSRNLFLVENHLSYRPYKFRRNDEDRNNPY 98 T 32 CbiN pdbhh F Eukaryota T 8apg 17 S r ATPEG4 MLLGGFVPRRFSQFNRDPCWMFFIFSVGFWLGEYPAMMIKYNARDLVYDPHRYVWSHHDDHH 62 T 0.029 DUF5357 pdb F T 8apg 20 V,W,X J1,K1,L1 ATP18_TRYBB ATP SYNTHASE F1 SUBUNIT P18 MMRRVYSPVFCSVAAARFAATSAAKKYDLFGYEVDTNTAPWIEKIKKCKYYDEAGEVLVNMNVSNCPPDIATYNATLQCIYQSPSKQSTPVDNESKFCAMMDLLEEMQHRNRLKPNEESWTWVMKECVKSGQFRLGYCIQQVMETECKGCPADLVKANEANAQKAKTEGKEHPGHLSQQAGLFDVKVE 188 T 1.1E-05 RPM2 pdbhh F Eukaryota T 8aph 1 A,M L,l Q387J1_TRYB2 subunit-e MSAKAAPKTLHQVRNVAYFFAAWLGVQKGYIEKSANDRLWVEHQRKVRQQNVERQQALDSIKLMQQGVRATTPGQLEGVPAELQQLAEAFTK 92 T 0.051 DUF3708 pdb F Eukaryota T 8aph 2 B,N M,m C9ZJA0_TRYB9 subunit-g MSSTKCAVACKIMTPLCNAASKVQARSAKKLAALTDAGIQKTISEHNANGTDAAVSSTKRYLAEQRQLFHYRVVRFFDECHYIISGEYFAQYTKVNLIWDLRFLTKLVVLFLIGTVLGRQSIFPPIDPDSPLVEALVTKVNPNY 144 T 0.076 CbtA pdbpercent F Eukaryota T 8aph 3 C a ATP6_TRYBB F-ATPASE PROTEIN 6 MFLFFFCDLFWLRLLLCMYYCVWSRLCFIVYFNCLMLIFDFLLFCLFDLYLFVGLCLFLLLWFMLFNLYSLILYYCITYLNLYLLFCIVFLLYIAFLFLFCFLCDFFLFNNLLVGDSFMDVFFIRFLLCFLECFSLLCRCLSTFLRLFCNLLSSHFLLLMFFDFFYFIFVFFFYGVFCYWFILFIFVFCFCLLFYVFLYLLDLFAAILQLFIFCNMILQLIMDFLLFLLFV 231 T 0.43 ATP-synt_A unppssm F Eukaryota T 8aph 4 D c Q585K5_TRYB2 subunit-8 MLRRLGANVSNMARPMNKYAVTVSPRRHLEPMSTWYLASWAMVWYYAFFFWMPMVWTDIMVPSFVYNKLPVIHFLQEKRAEQKLRRVLDETYTEWTEELDQAHVTDAITRSLNI 114 T 10 bVLRF1 pdbhh F Eukaryota T 8aph 5 E d Q57ZW9_TRYB2 subunit-d MRRVSSPNITIQSVRWISGVSPLLYFPPTTTSTTNREDQINKNTNIAIQMIKRYKGEVPPHYTRKSSATIEQVEKEIDALLGGAEKLRKTSTDDQPMDKLTLMERCLRHALWSYHKEEGRYDFDQIGRWVVYTPEDEVKLAQLKREVEAKEKLAALRKRREEEGLPGGPVPRINWPQEYSSFIDREPVVAKRIRYDTLASTTLERDEKQIESTLQQYRRASQDKRLDDLVDLLERFKPVLAREAIMQRLTIKHLEGQLGVWRYMDWCPEVRDRAELEVDITGWQWWSPLEERRLLPVRLRSVNEVREIMSKTQAKKSAEAAERNPIVTQTSTGDNARDRLLKEVLALQARINQRDEVEPSQTEQKKKAHH 370 T 0.0093 SAM_Exu pdbpercent F Eukaryota T 8aph 7 G f Q57ZE2_TRYB2 subunit-f MVLFSTYRSSRLVSKEFLHGPVMRFRALGEYYFQRAWNGTLNWALPGEYRLYAVMIPFIYFYHRWHNDHTLDRDHVEKAMIMRWGGTLEDVRKLSAKDQLRVRCFTDIEKLYSAYGPKDTYLQPPGDTLPGKDFYRKAGGAQAHH 145 T 0.39 DUF3094 pdbhh F Eukaryota T 8aph 9 I h Q389Z3_TRYB2 ATPTB4 MRRTFISFSAASAAAAAPVTSTKMQTLHKLLTGEVSFKNKAPVKDCNIVHQFGENWATELSAYAKTLPAEQQKIIVRQIARVKLTRYTVAELAAYCGDGPALLDETARAANIEQGVAFVKAKGVEAFEKYVAEESTNANWKPEEAKKFIEDVKAKAK 157 T 0.042 TAT_signal pdb F Eukaryota T 8aph 10 J i Q57ZM4_TRYB2 subunit-i/j MVYTRWKCDRLPVFQLKLFTQEYPMHAAVGIFTIIFLWKHMSHCSEETERKYGWWAGYPYWRDPIARRNETKYKQMIINNDVDITHPKWTGCSVEQLEELSRVV 104 T 0.18 FPN1 pdb F Eukaryota T 8aph 11 K j D0A5R7_TRYB9 ATPTB6 MTKYELKMQYFDEWMIRWRKFQTESDWEIEKGRQWWRRFNMAVSGALFCGLVLYTSGTATLKRQYGLPHFFDIGVDGQAKETMLKTLTSRWRYTPQGYGRVLITGVPTYILFVTLEHYRERRRMQQYLQQNTVFGEQMRRLLSTGKIEEYLPVNIKATLPASQQAIYNY 169 T 0.14 GT87 pdbpssm F Eukaryota T 8aph 12 L k Q57VT0_TRYB2 subunit-k MLRRSSAALIRRTPVRHSGGELFVRPKLEEIPPADQCRGFFGPLNDSLKFLRLLDIKWMMNRAVAMRREYLIATPTLFTFIWMFTWKGAVIYFWGDRAPPRRMDWNTEETGRLPLGFKPTPAPL 124 T 0.28 DUF1048 pdbpercent F Eukaryota T 8aph 13 O n Q582T1_TRYB2 ATPTB11 MLRKTPLFAMATTRKALVGNGPTFSTGGECMNTCDIQNAFPMNDRGVRSSSPFQEPNTAIYDSYLAWTYFQPMDVHIEKLPAPEAKYYQRHTKKPWDVSSTELTEIQSRKKYFQTLGYLVAFIYLYFLMPKEKSFSGLSGPDGHWIMLPKGRPELF 156 T 0.091 FCP1_C pdb F Eukaryota T 8aph 14 P o Q57Z84_TRYB2 ATPTB12 MSSGFHFHDVSNDAIKGMPPSEALHKHLENAQLAHRICLAKALKAGEPPVEKCALTWGEVLIRYQAWSEYRPPFQDSVAQAKYKKYWSKKRQEEDDKNPFK 101 T 6.5 DUF1539 pdbhh F Eukaryota T 8aph 15 Q p C9ZLR9_TRYB9 subunit-b MLRRLVPRVMMAPMGGATALCTSRGYNMLVFRDPKRRPQLSEEERAKVVVNQAEWPEEFKDFDPDDPYKNSPEIIKGMSSWNLFLWGVECAFIYQFYELVFPKSI 105 T 0.25 Mus7 unp F Eukaryota T 8aph 16 R q Q583U4_TRYB2 ATPEG3 MTENIEAVMSDFWSNPADHFRPNLKALTLYAERQHYVDRWLHVKERWLAPWYLPWWSPLFQLGTWYSQRSRNLFLVENHLSYRPYKFRRNDEDRNNPY 98 T 32 CbiN pdbhh F Eukaryota T 8aph 17 S r ATPEG4 MLLGGFVPRRFSQFNRDPCWMFFIFSVGFWLGEYPAMMIKYNARDLVYDPHRYVWSHHDDHH 62 T 0.029 DUF5357 pdb F T 8aph 20 V,W,X J1,K1,L1 ATP18_TRYBB ATP SYNTHASE F1 SUBUNIT P18 MMRRVYSPVFCSVAAARFAATSAAKKYDLFGYEVDTNTAPWIEKIKKCKYYDEAGEVLVNMNVSNCPPDIATYNATLQCIYQSPSKQSTPVDNESKFCAMMDLLEEMQHRNRLKPNEESWTWVMKECVKSGQFRLGYCIQQVMETECKGCPADLVKANEANAQKAKTEGKEHPGHLSQQAGLFDVKVE 188 T 1.1E-05 RPM2 pdbhh F Eukaryota T 8apj 1 A,M L,l Q387J1_TRYB2 subunit-e MSAKAAPKTLHQVRNVAYFFAAWLGVQKGYIEKSANDRLWVEHQRKVRQQNVERQQALDSIKLMQQGVRATTPGQLEGVPAELQQLAEAFTK 92 T 0.051 DUF3708 pdb F Eukaryota T 8apj 2 B,N M,m C9ZJA0_TRYB9 subunit-g MSSTKCAVACKIMTPLCNAASKVQARSAKKLAALTDAGIQKTISEHNANGTDAAVSSTKRYLAEQRQLFHYRVVRFFDECHYIISGEYFAQYTKVNLIWDLRFLTKLVVLFLIGTVLGRQSIFPPIDPDSPLVEALVTKVNPNY 144 T 0.076 CbtA pdbpercent F Eukaryota T 8apj 3 C a ATP6_TRYBB F-ATPASE PROTEIN 6 MFLFFFCDLFWLRLLLCMYYCVWSRLCFIVYFNCLMLIFDFLLFCLFDLYLFVGLCLFLLLWFMLFNLYSLILYYCITYLNLYLLFCIVFLLYIAFLFLFCFLCDFFLFNNLLVGDSFMDVFFIRFLLCFLECFSLLCRCLSTFLRLFCNLLSSHFLLLMFFDFFYFIFVFFFYGVFCYWFILFIFVFCFCLLFYVFLYLLDLFAAILQLFIFCNMILQLIMDFLLFLLFV 231 T 0.43 ATP-synt_A unppssm F Eukaryota T 8apj 4 D c Q585K5_TRYB2 subunit-8 MLRRLGANVSNMARPMNKYAVTVSPRRHLEPMSTWYLASWAMVWYYAFFFWMPMVWTDIMVPSFVYNKLPVIHFLQEKRAEQKLRRVLDETYTEWTEELDQAHVTDAITRSLNI 114 T 10 bVLRF1 pdbhh F Eukaryota T 8apj 5 E d Q57ZW9_TRYB2 subunit-d MRRVSSPNITIQSVRWISGVSPLLYFPPTTTSTTNREDQINKNTNIAIQMIKRYKGEVPPHYTRKSSATIEQVEKEIDALLGGAEKLRKTSTDDQPMDKLTLMERCLRHALWSYHKEEGRYDFDQIGRWVVYTPEDEVKLAQLKREVEAKEKLAALRKRREEEGLPGGPVPRINWPQEYSSFIDREPVVAKRIRYDTLASTTLERDEKQIESTLQQYRRASQDKRLDDLVDLLERFKPVLAREAIMQRLTIKHLEGQLGVWRYMDWCPEVRDRAELEVDITGWQWWSPLEERRLLPVRLRSVNEVREIMSKTQAKKSAEAAERNPIVTQTSTGDNARDRLLKEVLALQARINQRDEVEPSQTEQKKKAHH 370 T 0.0093 SAM_Exu pdbpercent F Eukaryota T 8apj 7 G f Q57ZE2_TRYB2 subunit-f MVLFSTYRSSRLVSKEFLHGPVMRFRALGEYYFQRAWNGTLNWALPGEYRLYAVMIPFIYFYHRWHNDHTLDRDHVEKAMIMRWGGTLEDVRKLSAKDQLRVRCFTDIEKLYSAYGPKDTYLQPPGDTLPGKDFYRKAGGAQAHH 145 T 0.39 DUF3094 pdbhh F Eukaryota T 8apj 9 I h Q389Z3_TRYB2 ATPTB4 MRRTFISFSAASAAAAAPVTSTKMQTLHKLLTGEVSFKNKAPVKDCNIVHQFGENWATELSAYAKTLPAEQQKIIVRQIARVKLTRYTVAELAAYCGDGPALLDETARAANIEQGVAFVKAKGVEAFEKYVAEESTNANWKPEEAKKFIEDVKAKAK 157 T 0.042 TAT_signal pdb F Eukaryota T 8apj 10 J i Q57ZM4_TRYB2 subunit-i/j MVYTRWKCDRLPVFQLKLFTQEYPMHAAVGIFTIIFLWKHMSHCSEETERKYGWWAGYPYWRDPIARRNETKYKQMIINNDVDITHPKWTGCSVEQLEELSRVV 104 T 0.18 FPN1 pdb F Eukaryota T 8apj 11 K j D0A5R7_TRYB9 ATPTB6 MTKYELKMQYFDEWMIRWRKFQTESDWEIEKGRQWWRRFNMAVSGALFCGLVLYTSGTATLKRQYGLPHFFDIGVDGQAKETMLKTLTSRWRYTPQGYGRVLITGVPTYILFVTLEHYRERRRMQQYLQQNTVFGEQMRRLLSTGKIEEYLPVNIKATLPASQQAIYNY 169 T 0.14 GT87 pdbpssm F Eukaryota T 8apj 12 L k Q57VT0_TRYB2 subunit-k MLRRSSAALIRRTPVRHSGGELFVRPKLEEIPPADQCRGFFGPLNDSLKFLRLLDIKWMMNRAVAMRREYLIATPTLFTFIWMFTWKGAVIYFWGDRAPPRRMDWNTEETGRLPLGFKPTPAPL 124 T 0.28 DUF1048 pdbpercent F Eukaryota T 8apj 13 O n Q582T1_TRYB2 ATPTB11 MLRKTPLFAMATTRKALVGNGPTFSTGGECMNTCDIQNAFPMNDRGVRSSSPFQEPNTAIYDSYLAWTYFQPMDVHIEKLPAPEAKYYQRHTKKPWDVSSTELTEIQSRKKYFQTLGYLVAFIYLYFLMPKEKSFSGLSGPDGHWIMLPKGRPELF 156 T 0.091 FCP1_C pdb F Eukaryota T 8apj 14 P o Q57Z84_TRYB2 ATPTB12 MSSGFHFHDVSNDAIKGMPPSEALHKHLENAQLAHRICLAKALKAGEPPVEKCALTWGEVLIRYQAWSEYRPPFQDSVAQAKYKKYWSKKRQEEDDKNPFK 101 T 6.5 DUF1539 pdbhh F Eukaryota T 8apj 15 Q p C9ZLR9_TRYB9 subunit-b MLRRLVPRVMMAPMGGATALCTSRGYNMLVFRDPKRRPQLSEEERAKVVVNQAEWPEEFKDFDPDDPYKNSPEIIKGMSSWNLFLWGVECAFIYQFYELVFPKSI 105 T 0.25 Mus7 unp F Eukaryota T 8apj 16 R q Q583U4_TRYB2 ATPEG3 MTENIEAVMSDFWSNPADHFRPNLKALTLYAERQHYVDRWLHVKERWLAPWYLPWWSPLFQLGTWYSQRSRNLFLVENHLSYRPYKFRRNDEDRNNPY 98 T 32 CbiN pdbhh F Eukaryota T 8apj 17 S r ATPEG4 MLLGGFVPRRFSQFNRDPCWMFFIFSVGFWLGEYPAMMIKYNARDLVYDPHRYVWSHHDDHH 62 T 0.029 DUF5357 pdb F T 8apj 20 V,W,X J1,K1,L1 ATP18_TRYBB ATP SYNTHASE F1 SUBUNIT P18 MMRRVYSPVFCSVAAARFAATSAAKKYDLFGYEVDTNTAPWIEKIKKCKYYDEAGEVLVNMNVSNCPPDIATYNATLQCIYQSPSKQSTPVDNESKFCAMMDLLEEMQHRNRLKPNEESWTWVMKECVKSGQFRLGYCIQQVMETECKGCPADLVKANEANAQKAKTEGKEHPGHLSQQAGLFDVKVE 188 T 1.1E-05 RPM2 pdbhh F Eukaryota T 8apk 1 A,M L,l Q387J1_TRYB2 subunit-e MSAKAAPKTLHQVRNVAYFFAAWLGVQKGYIEKSANDRLWVEHQRKVRQQNVERQQALDSIKLMQQGVRATTPGQLEGVPAELQQLAEAFTK 92 T 0.051 DUF3708 pdb F Eukaryota T 8apk 2 B,N M,m C9ZJA0_TRYB9 subunit-g MSSTKCAVACKIMTPLCNAASKVQARSAKKLAALTDAGIQKTISEHNANGTDAAVSSTKRYLAEQRQLFHYRVVRFFDECHYIISGEYFAQYTKVNLIWDLRFLTKLVVLFLIGTVLGRQSIFPPIDPDSPLVEALVTKVNPNY 144 T 0.076 CbtA pdbpercent F Eukaryota T 8apk 3 C a ATP6_TRYBB F-ATPASE PROTEIN 6 MFLFFFCDLFWLRLLLCMYYCVWSRLCFIVYFNCLMLIFDFLLFCLFDLYLFVGLCLFLLLWFMLFNLYSLILYYCITYLNLYLLFCIVFLLYIAFLFLFCFLCDFFLFNNLLVGDSFMDVFFIRFLLCFLECFSLLCRCLSTFLRLFCNLLSSHFLLLMFFDFFYFIFVFFFYGVFCYWFILFIFVFCFCLLFYVFLYLLDLFAAILQLFIFCNMILQLIMDFLLFLLFV 231 T 0.43 ATP-synt_A unppssm F Eukaryota T 8apk 4 D c Q585K5_TRYB2 subunit-8 MLRRLGANVSNMARPMNKYAVTVSPRRHLEPMSTWYLASWAMVWYYAFFFWMPMVWTDIMVPSFVYNKLPVIHFLQEKRAEQKLRRVLDETYTEWTEELDQAHVTDAITRSLNI 114 T 10 bVLRF1 pdbhh F Eukaryota T 8apk 5 E d Q57ZW9_TRYB2 subunit-d MRRVSSPNITIQSVRWISGVSPLLYFPPTTTSTTNREDQINKNTNIAIQMIKRYKGEVPPHYTRKSSATIEQVEKEIDALLGGAEKLRKTSTDDQPMDKLTLMERCLRHALWSYHKEEGRYDFDQIGRWVVYTPEDEVKLAQLKREVEAKEKLAALRKRREEEGLPGGPVPRINWPQEYSSFIDREPVVAKRIRYDTLASTTLERDEKQIESTLQQYRRASQDKRLDDLVDLLERFKPVLAREAIMQRLTIKHLEGQLGVWRYMDWCPEVRDRAELEVDITGWQWWSPLEERRLLPVRLRSVNEVREIMSKTQAKKSAEAAERNPIVTQTSTGDNARDRLLKEVLALQARINQRDEVEPSQTEQKKKAHH 370 T 0.0093 SAM_Exu pdbpercent F Eukaryota T 8apk 7 G f Q57ZE2_TRYB2 subunit-f MVLFSTYRSSRLVSKEFLHGPVMRFRALGEYYFQRAWNGTLNWALPGEYRLYAVMIPFIYFYHRWHNDHTLDRDHVEKAMIMRWGGTLEDVRKLSAKDQLRVRCFTDIEKLYSAYGPKDTYLQPPGDTLPGKDFYRKAGGAQAHH 145 T 0.39 DUF3094 pdbhh F Eukaryota T 8apk 9 I h Q389Z3_TRYB2 ATPTB4 MRRTFISFSAASAAAAAPVTSTKMQTLHKLLTGEVSFKNKAPVKDCNIVHQFGENWATELSAYAKTLPAEQQKIIVRQIARVKLTRYTVAELAAYCGDGPALLDETARAANIEQGVAFVKAKGVEAFEKYVAEESTNANWKPEEAKKFIEDVKAKAK 157 T 0.042 TAT_signal pdb F Eukaryota T 8apk 10 J i Q57ZM4_TRYB2 subunit-i/j MVYTRWKCDRLPVFQLKLFTQEYPMHAAVGIFTIIFLWKHMSHCSEETERKYGWWAGYPYWRDPIARRNETKYKQMIINNDVDITHPKWTGCSVEQLEELSRVV 104 T 0.18 FPN1 pdb F Eukaryota T 8apk 11 K j D0A5R7_TRYB9 ATPTB6 MTKYELKMQYFDEWMIRWRKFQTESDWEIEKGRQWWRRFNMAVSGALFCGLVLYTSGTATLKRQYGLPHFFDIGVDGQAKETMLKTLTSRWRYTPQGYGRVLITGVPTYILFVTLEHYRERRRMQQYLQQNTVFGEQMRRLLSTGKIEEYLPVNIKATLPASQQAIYNY 169 T 0.14 GT87 pdbpssm F Eukaryota T 8apk 12 L k Q57VT0_TRYB2 subunit-k MLRRSSAALIRRTPVRHSGGELFVRPKLEEIPPADQCRGFFGPLNDSLKFLRLLDIKWMMNRAVAMRREYLIATPTLFTFIWMFTWKGAVIYFWGDRAPPRRMDWNTEETGRLPLGFKPTPAPL 124 T 0.28 DUF1048 pdbpercent F Eukaryota T 8apk 13 O n Q582T1_TRYB2 ATPTB11 MLRKTPLFAMATTRKALVGNGPTFSTGGECMNTCDIQNAFPMNDRGVRSSSPFQEPNTAIYDSYLAWTYFQPMDVHIEKLPAPEAKYYQRHTKKPWDVSSTELTEIQSRKKYFQTLGYLVAFIYLYFLMPKEKSFSGLSGPDGHWIMLPKGRPELF 156 T 0.091 FCP1_C pdb F Eukaryota T 8apk 14 P o Q57Z84_TRYB2 ATPTB12 MSSGFHFHDVSNDAIKGMPPSEALHKHLENAQLAHRICLAKALKAGEPPVEKCALTWGEVLIRYQAWSEYRPPFQDSVAQAKYKKYWSKKRQEEDDKNPFK 101 T 6.5 DUF1539 pdbhh F Eukaryota T 8apk 15 Q p C9ZLR9_TRYB9 subunit-b MLRRLVPRVMMAPMGGATALCTSRGYNMLVFRDPKRRPQLSEEERAKVVVNQAEWPEEFKDFDPDDPYKNSPEIIKGMSSWNLFLWGVECAFIYQFYELVFPKSI 105 T 0.25 Mus7 unp F Eukaryota T 8apk 16 R q Q583U4_TRYB2 ATPEG3 MTENIEAVMSDFWSNPADHFRPNLKALTLYAERQHYVDRWLHVKERWLAPWYLPWWSPLFQLGTWYSQRSRNLFLVENHLSYRPYKFRRNDEDRNNPY 98 T 32 CbiN pdbhh F Eukaryota T 8apk 17 S r ATPEG4 MLLGGFVPRRFSQFNRDPCWMFFIFSVGFWLGEYPAMMIKYNARDLVYDPHRYVWSHHDDHH 62 T 0.029 DUF5357 pdb F T 8apk 23 CA,DA,EA J1,K1,L1 ATP18_TRYBB ATP SYNTHASE F1 SUBUNIT P18 MMRRVYSPVFCSVAAARFAATSAAKKYDLFGYEVDTNTAPWIEKIKKCKYYDEAGEVLVNMNVSNCPPDIATYNATLQCIYQSPSKQSTPVDNESKFCAMMDLLEEMQHRNRLKPNEESWTWVMKECVKSGQFRLGYCIQQVMETECKGCPADLVKANEANAQKAKTEGKEHPGHLSQQAGLFDVKVE 188 T 1.1E-05 RPM2 pdbhh F Eukaryota T 8apn 21 U Am uL18m PLKPLVFRPLKLFFWATNRHVHAKVVRFSSPFNEGVPVVDISTFDAFNKLKGSAPVVAPRSLECYKEVAKMVKEETARQNINEVTLHLNSHASDRGVREVVRELKNLGLLVKKV 114 T 0.0038 Ribosomal_L18p pdbhh F T 8apn 33 GA Ay bL31m LVRPLTKAMTVVLSNGATLRLPTVYARAKPWFPVMDLHSHNVWKHKIKTDFQLESEKNITPDFSNFYNKFGK 72 T 0.002 Ribosomal_L31 pdbhh F T 8apn 39 MA AE mL40 EGNTRLQKVVSFFVPEVEKKEEEEKLATQYKRWKVAQVHAWNHDIAVKHRLQTEAIASLPQRLKEQALKPDYSPIPLNRKLLFHTPPESYRD 92 T 7.3E-05 MRP-L28 pdbhh F T 8apn 43 QA AI mL63 VVFKTTGGKAWNPPGGLKPLTNTQKRSRKENLQILLRNLSVLKLAAENQPEVTVNLFSPLKFMH 64 T 0.0048 L31 pdbhh F T 8apn 45 SA AK mL87 RPIMHKNWDWEFVVGAKAGRKPAIQRPKPHQWYYCNPKYSAEDPLPTKIFPPHAPPTAESLDDWAKFRKLCPKDPVEAKKFRKHFVRFLNQRNYDWRTAFERGLAKEVAVAKAAQRAEDETKRQEAWHAYRTAVFESAL 139 T 6 Serglycin pdbhh F T 8apn 46 TA AL mL116 NTGVPGPRPEVAQKLSTEYQGHILRMISLAESASELDEVLWSSKKHLRPVHIARSCLKLEYLRTKEKGREVSEPIKNLASELENYVELYSTKFTIGQVSQLVRGLSSIRRNIQPDLLLKLAAVVVADDGRQVQLANEMDCRDLFFGFFSQGFDNELFWKRLSESVLPRLPYFNADVVSTVLRVVSGLRFLHNTEFAHATMTALVPKVGDLSPARLADAFFSASLLDPTDVSGLNAKLEERFLREFTSFPIKDTVTMFQTVTVRRHSTPELAAQVAPLVAAQAHQLPVRHLRRALEGMVTAGWKDTAEIPLYAILAKQAARLVLTPVQLLRQLARIFANTGLKAGPGANQPLAPYFAALQRELEGRLAELDEQVTDDFAESFKKVGIAEGARVQI 394 T 0.24 TadB_TadC_N pdb F T 8apn 47 UA AM mL116 TIFNTGVPGPRPEVAQKLSTEYQGHILRMISLAESASELDEVLWSSKKHLRPVHIARSCLKLEYLRTKEKGREVSEPIKNLASELENYVELYSTKFTIGQVSQLVRGLSSIRRNIQPDLLLKLAAVVVADDGRQVQLANEMDCRDLFFGFFSQGFDNELFWKRLSESVLPRLPYFNADVVSTVLRVVSGLRFLHNTEFAHATMTALVPKVGDLSPARLADAFFSASLLDPTDVSGLNAKLEERFLREFTSFPIKDTVTMFQTVTVRRHSTPELAAQVAPLVAAQAHQLPVRHLRRALEGMVTAGWKDTAEIPLYAILAKQAARLVLGKQSAATSAILGKHVDNQGYQRTPVQLLRQLARIFANTGLKAGPGANQPLAPYFAALQRELEGRLAELDEQVTDDFAESFKKVGIAEGARVQI 419 T 9.8 FAST_1 pdbhh F T 8apn 48 VA AN mL116 GTIFNTGVPGPRPEVAQKLSTEYQGHILRMISLAESASELDEVLWSSKKHLRPVHIARSCLKLEYLRTKEKGREVSEPIKNLASELENYVELYSTKFTIGQVSQLVRGLSSIRRNIQPDLLLKLAAVVVADDGRQVQLANEMDCRDLFFGFFSQGFDNELFWKRLSESVLPRLPYFNADVVSTVLRVVSGLRFLHNTEFAHATMTALVPKVGDLSPARLADAFFSASLLDPTDVSGLNAKLEERFLREFTSFPIKDTVTMFQTVTVRRHSTPELAAQVAPLVAAQAHQLPVRHLRRALEGMVTAGWKDTAEIPLYAILAKQAARLVLGKQSAATSAILGKHVDNQGYQRTPVQLLRQLARIFANTGLKAGPGANQPLAPYFAALQRELEGRLAELDEQVTDDFAESFKKVGIAEGARVQI 420 T 9.8 FAST_1 pdbhh F T 8apn 49 WA AO mL118 YTWHFLSRQRVEAVNKATDILELEDIMRLEGNKYDYIAIRAFLKRVCILLQERADALGLPPSNEGLLVRFDEPERARYEALVSQVCDVVSARAKWFDPSNAAAVAYCLTRWLGRAEAPLIEQLLRRVVARLPEAKSKDVQYALDATLESAAAPHLEHLREPMLRAAGAFLGAKLPTGRVPPEVVAKITRLLVNHWDQPDEELLEAIVTDIAVRLEIYSPTALGRTLLALSKVPALTGAAFKRSRSSFLPEGVNVPSGADVAVPLADACLAHVAAHAAEHANEHDLIKFLGAISKLASPGRAATAGADAGAEATESGAAWAKRNSASLAWFALEQRLAPSTRGSFEGNQFPFVIKLVSAAARPPPAVTKFISSTVAKE 377 T 3.4 TAN pdbpercent F T 8apn 50 XA Xa mL120 IEEYYVVPPACPPPPHNPKTLKYVPKLNRTQIIARVYEAKTPTELENSALGKKFFSEFAAVAKLVRLSQLRRANVYNSRDDAMCVSIYNTSVRVADKLLHLSSDEELCGLIWALSQLPYPEYENLVDRSLQILLEEDKPLKTGSSLAVSRAAAGLASLGRWDASTWEVLVPLLRKNVQEGKEVELSNLALGLYDARETV 199 T 8.9 4HB pdbhh F T 8apn 51 YA Xb mL121 ITTLEFYEQNKEKLSFLVGNDVYSEFEEKWKPKPKVFDPEIPIAEAMWPEGRSEKLQSVIVKKLSFSTESAGSVVAGMKHGYCPDSVDVIVSRVEQLKAFEGYFPGFKTEEFVSVNPRLLNFSTDRIYWAMLTFQDMFSSAEVGPLMANIGHFIIENPIRCAKNLFCLATELESQLSMKLDVTAVKQDSWFTLSSEETIRERVAALATIFGSKTAGDILLRDINYLRLDSKDVNRDAVKIREHF 244 T 0.054 DUF732 pdbpssm F T 8apn 52 ZA Xc mL122 RKNIVWPEQLEEQKRNKQYEYQWVISNGKKFIARRESTSTKWEIWKSIEKVTPGQKP 57 T 0.0053 HTH_56 pdb F T 8apn 54 BB Xe mL124 FHTGVNLVQPIDTSKLTRQIKKLTLLHEAALTVLQYSNYCNPEQATEILRRLPFLMRHEESRVLKGQTLDPKLPPMFHGLLHVMGDRFVQVFSDCNLRQIERGAWALAAARHQHDGVALALSEKLKQLTQELLDLNAKPFNTRVTKPTPEQLNSGIFASRVLVPESVNQLPVKAVLPEFNALAGIAWALATVAGEHSAAAAKAALEQLAEKFGALQVDPKPLPDADSLCRLAWAFAKAGVHNPAAVDKLFHLAEERLKSQLQAHDPASGPLRPRCTYRYKTVRGWVDQHFPRKPRDSSYLGDTAPKIIPRDFEIDSLGSLLSAAALLRDQVPVERLQTILNLAAQHTAASSVAGGALQPLMVTYEEVTRVLAACEQLGFRSSTLVTPLLHGLPMAALSAEALSQLAAAATLHHVRSRTVYLRIVRAFNAKLSVSPTLVAGAGIGAEGKKEGEAAAALGAQLLLAVTKAGLPANASVSRIASLV 483 T 11 GYR pdb F T 8apn 55 CB Xf mL125 RPALTPSFSRVSDPWTGEKEAKYAAPYRIPEEVWKNSGAPKILFQDPWNSPDYDEVRKKHAVLVHDYLKQQSQPINVQTILEGVNKTHGLVLGTIEYVTSLLENMLWHDMAYVVKPVFSSPRKAKLSKIPLLYGANKYQQVFRGTPKEVAERYEAARAKHIKVAFTRLRTSKTPQPFRRRTDEYSHVQASQSALGLAAAAA 201 T 1.6 HARE-HTH pdbhh F T 8apn 56 DB Xg mL126 SVKYIPNHAATPNKYKDAQQKVLWDRAKKLGKKPEYKVPNIKDTQTVFEIGKLTKLCLEHWKPMHFAAALGHVINVWTTQALKSGRYGGKSFTVRELLGFRSLPYGVNSITAVLPLQSPEDFLSQPLAKQPFSFKPVSVREEVKKIIASNPGLLIHNWSLKIEGQPNHPITDEDRAAAVIAICTSSFRARFNEAGDVAVALVLSRLARCGYWLPPLYELIAPFAAFQGARIDHSSPAVIANVLLVLARAKGQAEMGQPTALQIRAIAPALEQKCLQRLGELLPSLEALVISDTLAATALLSSPEARALLAQIKAEVLARNFLGFESRDIIACFKELVANVYQPLQLSADLPAPGELRDELPGGEKVLDEQLLAALSGAVVEGGALXXXXXXXXXXXXXXXXXXXXXXXXX 410 T 15 hDGE_amylase pdbhh F T 8apn 58 FB Xi mL128 KFQSRAEKKYRIMDEKVGKPRFQA 24 T 11 HJURP_C pdbhh F T 8apn 59 GB Xj mL129 AKRLLREAPPFTEQVDEKNFDDKDYLGAGAMSNEQLRKELEKVEPGEAKWDEESPLIPPPQPRQYRNKGHQ 71 T 1.5 La_HTH_kDCL pdbhh F T 8apn 64 LB Ba bS1m VSLPMTKAELRASLLYKMRKDTLTKVIDSSLSTSVLTIEEKEKIDRTLYAHIETENNHIDPGLHCLIRALRRSNLTMPILKGQQIQAKVIQKTDEVMLLDPGFYNLSEVPVNYLTTAHIVRKVDDSPRENLYDVRPGDVVKVLVDDVYTPYGDMQLDVPQQDPRLILNQVWDELHLKMKKKELVRGRILNECKSGYAVGVAGFVALLPYANTSREVANRVGEAQSFQIKSMSEPHRRRIVLQ 242 T 0.00036 MRP-S35 pdbhh F T 8apn 71 SB Bh uS8m AAQAYFDLRYHVKKQGLLTVNRAASIINSIFPEFSHESHRNQLAVPLPRKEIPTYIMQNAKVQPWALLPTKAAAYAQYPNFFRSSSLFFGSLNREIVNRRPYSLLPADKLSMDLAQVCTNLGILNGWDIVQKREKLKDLDFVWPANELPRDHHEVKLFKHLHLRLALKWEQHKPLWEDGSMVKDQREYRDQQQVQQQQPLPHLPLAPLFGPLPLTVRNLSKASQPVLLYPLQLRELAQRMPSGLFLLYHHELGVITDAQAFLFDVPVVALAHVGLPVSMAAAVNGAVNRTFRAELGKPLREVTKLKDWSLSATIAAQVRERRQQLLERAEQTKRERKQIQDLVTVRVGKFKAEVDKEDSSLALQDELLAWQLKE 374 T 59 OPA1_C pdbhh F T 8apn 84 FC Bu mS23 AFSRYFVQKFKQSYTRKYMRDMESGAFSFPKCHDILGKYRPDVLFAAAAAPLKLELPEQAVYKKLYRDFPELRKDAVDLSSLEAPLAKQFALKHLVLSAEIAANSPRTRHILRRDLEADPAYERLKEEFMPRIAELRKQQEQTASLQQLQADEEEHLKLALTYVAAQ 167 T 64 UIM pdbhh F T 8apn 87 IC Bx mS31 SSAARWRAAIAQRLGVEAAAAAQALAALLGQGDLALTVLAAASEADVLNITELLENNSVDEAVTNARKVAIVSGHGLFLATATSEDLAALSDVEAGELAALMGKVHVVGLPLADALLGSDSLTHDQLLTLTRSEKQALLWRLASVGKLREGRAKAVAALRKAALDRAAAAAEASEGLLSAAAMMKLEHDIAEFDLVRERYLPGPGLPEGVQEAFAPSGLPSAFSRDEQALYDAYFGLRSHAASAQPEPLEGPSAAQLHSSFLDGFQCREEDSQMEELPESFGQWVANIKGLIVKAPVPLLGLLAKFVTAKIDGADARDASETQSRLRLLAAEIATDIARRREARLAVSPWWQRASAPIDALAISSIDHPSSDPLVQLLEVLLGHSGADEFGSWISAVAMRPVSPYEILADEHRLMDLERYLSMTSASELHLELAATPLPWASPAVHVPPAAFLEEMRAKFNNYLLATGLSPLSAAEWSAYKDWALEEFAEKRALGEEALLQEGHSGFFNPKADEIYLRALLEATIPPEAPLREQAVRYLETVNMNKTWTFLKKKHMVQRLAELSRHLTEHPPVEEQGSPFAALFAVGPGAKPTPLVPKLSKRLPAHGPESLDLPELPEIFR 621 T 43 MRP-S31 pdbhh F T 8apn 92 NC BC mS45 PSVNDLASLLSLSEQYRGADVLAEGAALPGTGFANARGTFLPHELPTAIEYLKELDPEAEMKLEQMEAMYKLLYSRNESEREVGRQMMYDLLKLSGHPFRELELCNWDYMAAFLDARVAGRVFHRGSGERLVHRTATFPAFEGYPLAEVDQTTEGEVSKLNREESKRQDNAMFQDFRKKLLFNLGMVGEQLWEPVQGVLSANLRSALDRPLVVYDITAATGETVYPPKFVAEVDGTRRALNEQERAYQAKRKPGPRLPYYMRRIARKEEL 270 T 0.19 TOM6p pdbpssm F T 8apn 94 PC BE mS106 IAPSQLDKLEKFVHVRPPKTDYEEDIKQAISSVTDNEGLKKCLDLFLTNHAAQTWVGKHEYANAGQVSDFIKVCEKVGSSAPLVTLWQSAYRYGVDPTVPLLRSSAAACAALGEGADAALVLLYGSCYIVNVPEDLAAAVRTALEAHEKAQEGNAEALAKVQKYREALDRL 171 T 0.019 PfaD_N pdb F T 8apn 95 QC BF mS107 MATIPKGLDIDPESPMLYHYFKSIHPHQVSFRIKKRKQLQHLWELCKLYENKMDTLASAAMLGQLFRLQKRNNPDYSVELANQIFEHCVKRLSFTIRFATYQEIVPVLFTLARMNVSIVPSDTLLLDPTHRVSREFVHLFLKRAVRNHVHIRVVNPRQMARVLWATAKLFPEDQRMDPRVQDAVDKLARSSVKRLSELHPGSLSIYASAFAKLSPAPTSQEGPLKDVDVSSWDATITGVKSSLLDLDSKELAFVARARTLKVFQGISREILLRVGDLNHEQFTVRNVFHVLGAYIRAQIQDPLVAKVLAENITGRIQDVYAEELIALVRAAERLDGFKNPDLTAAVLRRAREVDLPEETQKDYAKRLQSA 370 T 43 TYA pdbhh F T 8apn 96 RC Ya uS4m-2 ADLVRHLQSSGSKLQKLANLTSASCSYRDISVSLFGLQARQLGCTKPFVYTFSASEQAQSKSKPFDGKLRLPDVSQLSDTITVSGDAAPATLNLDQKYMDKISCWTQTIPSHLEMSYQNLTSVKLFPPVDASYPTYVDFEHATRLLDHFTSRKRLNYQRKMIKKRDKFQIKSWDHHAGEA 180 T 23 KN_motif pdbhh F T 8apn 97 SC Yb mS108 HRFRNNKFLRLEPDLDPKVYGQTETLQKQVDDNFSLLLAKHRLDMKAAAA 50 T 6 SMAP pdbhh F T 8apn 99 UC Yd mS110 TTRRRKLLGSRYGARLAKKNRQQFERARVILDCYSDEELRPDSPPVAVKEVTINTLQTMRFLFPQTSKEHLDIKQNIASYKIFSLNRELLSLLPK 95 T 1.3 DUF3135 pdbhh F T 8apn 100 VC Ye uS3m-2 GRHLLPATAIRVRLNRGFESAWYTDVSYREMIKKDFLLAKLASSFVNRSSRASLRQIFPGGKDFPNFRTSRIFMQHLPYKSYASTFSYVAPKDGPQAKYGLFQSKL 106 T 20 YebO pdbhh F T 8apn 101 WC Yf mS111 PFNVSDANPKDVEFLQVLLSKFLPDADKATVYRTGQEPRRLRLGDLPATSQFMESFVSEKLPKEPLYDMPSWLANNMPQYDAQPKSPHYHWSSWMRQHLSLDLQRLYAAFAEYMASEPHRLGIVRQANFELARLWDWQHRRVAAGLSPDL 150 T 8.8 DUF2497 pdbhh F T 8apn 102 XC Yg mS112 SADVYKEFFKMARVAVRTMKEPTKSIMKDLQRNARRSENIQRDNNLDKGVYMRFLRQRAGLSVPPIK 67 T 9.2 CCDC53 pdbhh F T 8apn 103 YC Yh mS113 GADGVLRRHNEVRGALRFFLDSWYKNKTSGTIADKNQVMFDYLKYKGVTEIAQHLHAPAPQVFRK 65 T 0.12 Nudix_N pdbpercent F T 8apn 105 AD Yj mS115 EYNGQGYVFSLLQRPPAPTLELLAEYLTVKYQDVIAQRDFVTHILGRMSVLERGGELPAADAAASGTWTGGAKRRLSPQEIRDINGELNRLFDADLNEYVSLAQRLATENVLSPADLATCLQAARSKAQTSSFASLAAPGSSNVDRNILAQVLQGKQDVSALAAAAAAAAASGPEGARVAWDEALQVGKYGAWATKAKAWAADDIAARREKGQQISPEQEAALVCLWDNPLSYDAAAGLWHQYAEKAGAVSAPSLADVISADQAIQAAKAAAAADPASLPAVKATAEKAAQVQEAVKKLYLGFAARQGSTSGAVTVDGVPLPFADVVKANAELDVASPAALAAAFQPLELGELLACHWEAVSRTFMWEDMYQLMLETAKEIEVNGA 386 T 0.15 Ykof pdbpercent F T 8apn 106 BD Yk mS116 GPLPEDVFLVAPKVAAAVQQTQAQLIDLLAPYGYSFDAFSEAVLEDLSKTKELCVKARFVLWEARVLEALEAVRPFVSGPVFRTESEAAALT 92 T 5 POTRA_TamA_1 pdbhh F T 8apn 107 CD Yl uS7m-2 TVVLAPSKYDSQLKIPLKPTEMDEFEELRSFVDISIEKEADYVMNKFVGRLIKGGEKATAQQVLLRTLLHTRRLMQEGNITSLK 84 T 0.13 Ribosomal_S7 pdb F T 8apn 110 FD Ub mL105 RDEIIKLLESRKDMDVNGYVMYCREELGKLTVPRPRAPPVSPKHEDYKTFVDEERVTYMRMKQHEKISLFLTEEEKNTVTTKGKDILDDKRFIQTIASRTGFYIAEEVRDCLSEFFNFRDSSRRLLTYYA 130 T 0.025 DUF5863 pdb F T 8apn 111 GD Ua Unknown XXXXXXXXXXXXXXXXXRXXXXXXXXXXXXXX 32 T 6600 zf-C2H2 pdbhh F F 8apn 112 HD Ud Unknown XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 43 F F F 8apn 113 ID Ue Unknown XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 8apn 114 JD Uf Unknown XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 73 F F F 8apn 115 KD Ug Unknown XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 63 F F F 8apn 116 LD Uh Unknown XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 48 F F F 8apn 117 MD Ui Unknown XXXXXXXXXXXXXXXXXXXXXXXXXX 26 F F F 8apn 118 ND Uj Unknown XXXXXXXXX 9 F F F 8apn 119 OD Uk Unknown XXXXXXXXXXXXXXXXXXXXXXX 23 F F F 8apn 120 PD Ul Unknown XXXXXXXXXXXXXXXX 16 F F F 8apn 121 QD Um Unknown XXXXXXXXXXX 11 F F F 8apo 21 U Am uL18m PLKPLVFRPLKLFFWATNRHVHAKVVRFSSPFNEGVPVVDISTFDAFNKLKGSAPVVAPRSLECYKEVAKMVKEETARQNINEVTLHLNSHASDRGVREVVRELKNLGLLVKKV 114 T 0.0038 Ribosomal_L18p pdbhh F T 8apo 33 GA Ay bL31m LVRPLTKAMTVVLSNGATLRLPTVYARAKPWFPVMDLHSHNVWKHKIKTDFQLESEKNITPDFSNFYNKFGK 72 T 0.002 Ribosomal_L31 pdbhh F T 8apo 39 MA AE mL40 EGNTRLQKVVSFFVPEVEKKEEEEKLATQYKRWKVAQVHAWNHDIAVKHRLQTEAIASLPQRLKEQALKPDYSPIPLNRKLLFHTPPESYRD 92 T 7.3E-05 MRP-L28 pdbhh F T 8apo 43 QA AI mL63 VVFKTTGGKAWNPPGGLKPLTNTQKRSRKENLQILLRNLSVLKLAAENQPEVTVNLFSPLKFMH 64 T 0.0048 L31 pdbhh F T 8apo 45 SA AK mL87 RPIMHKNWDWEFVVGAKAGRKPAIQRPKPHQWYYCNPKYSAEDPLPTKIFPPHAPPTAESLDDWAKFRKLCPKDPVEAKKFRKHFVRFLNQRNYDWRTAFERGLAKEVAVAKAAQRAEDETKRQEAWHAYRTAVFESAL 139 T 6 Serglycin pdbhh F T 8apo 46 TA,UA,VA AL,AM,AN mL116 GTIFNTGVPGPRPEVAQKLSTEYQGHILRMISLAESASELDEVLWSSKKHLRPVHIARSCLKLEYLRTKEKGREVSEPIKNLASELENYVELYSTKFTIGQVSQLVRGLSSIRRNIQPDLLLKLAAVVVADDGRQVQLANEMDCRDLFFGFFSQGFDNELFWKRLSESVLPRLPYFNADVVSTVLRVVSGLRFLHNTEFAHATMTALVPKVGDLSPARLADAFFSASLLDPTDVSGLNAKLEERFLREFTSFPIKDTVTMFQTVTVRRHSTPELAAQVAPLVAAQAHQLPVRHLRRALEGMVTAGWKDTAEIPLYAILAKQAARLVLGKQSAATSAILGKHVDNQGYQRTPVQLLRQLARIFANTGLKAGPGANQPLAPYFAALQRELEGRLAELDEQVTDDFAESFKKVGIAEGARVQI 420 T 9.8 FAST_1 pdbhh F T 8apo 47 WA AO mL118 YTWHFLSRQRVEAVNKATDILELEDIMRLEGNKYDYIAIRAFLKRVCILLQERADALGLPPSNEGLLVRFDEPERARYEALVSQVCDVVSARAKWFDPSNAAAVAYCLTRWLGRAEAPLIEQLLRRVVARLPEAKSKDVQYALDATLESAAAPHLEHLREPMLRAAGAFLGAKLPTGRVPPEVVAKITRLLVNHWDQPDEELLEAIVTDIAVRLEIYSPTALGRTLLALSKVPALTGAAFKRSRSSFLPEGVNVPSGADVAVPLADACLAHVAAHAAEHANEHDLIKFLGAISKLASPGRAATAGADAGAEATESGAAWAKRNSASLAWFALEQRLAPSTRGSFEGNQFPFVIKLVSAAARPPPAVTKFISSTVAKE 377 T 3.4 TAN pdbpercent F T 8apo 48 XA Xa mL120 IEEYYVVPPACPPPPHNPKTLKYVPKLNRTQIIARVYEAKTPTELENSALGKKFFSEFAAVAKLVRLSQLRRANVYNSRDDAMCVSIYNTSVRVADKLLHLSSDEELCGLIWALSQLPYPEYENLVDRSLQILLEEDKPLKTGSSLAVSRAAAGLASLGRWDASTWEVLVPLLRKNVQEGKEVELSNLALGLYDARETV 199 T 8.9 4HB pdbhh F T 8apo 49 YA Xb mL121 ITTLEFYEQNKEKLSFLVGNDVYSEFEEKWKPKPKVFDPEIPIAEAMWPEGRSEKLQSVIVKKLSFSTESAGSVVAGMKHGYCPDSVDVIVSRVEQLKAFEGYFPGFKTEEFVSVNPRLLNFSTDRIYWAMLTFQDMFSSAEVGPLMANIGHFIIENPIRCAKNLFCLATELESQLSMKLDVTAVKQDSWFTLSSEETIRERVAALATIFGSKTAGDILLRDINYLRLDSKDVNRDAVKIREHF 244 T 0.054 DUF732 pdbpssm F T 8apo 50 ZA Xc mL122 RKNIVWPEQLEEQKRNKQYEYQWVISNGKKFIARRESTSTKWEIWKSIEKVTPGQKP 57 T 0.0053 HTH_56 pdb F T 8apo 52 BB Xe mL124 FHTGVNLVQPIDTSKLTRQIKKLTLLHEAALTVLQYSNYCNPEQATEILRRLPFLMRHEESRVLKGQTLDPKLPPMFHGLLHVMGDRFVQVFSDCNLRQIERGAWALAAARHQHDGVALALSEKLKQLTQELLDLNAKPFNTRVTKPTPEQLNSGIFASRVLVPESVNQLPVKAVLPEFNALAGIAWALATVAGEHSAAAAKAALEQLAEKFGALQVDPKPLPDADSLCRLAWAFAKAGVHNPAAVDKLFHLAEERLKSQLQAHDPASGPLRPRCTYRYKTVRGWVDQHFPRKPRDSSYLGDTAPKIIPRDFEIDSLGSLLSAAALLRDQVPVERLQTILNLAAQHTAASSVAGGALQPLMVTYEEVTRVLAACEQLGFRSSTLVTPLLHGLPMAALSAEALSQLAAAATLHHVRSRTVYLRIVRAFNAKLSVSPTLVAGAGIGAEGKKEGEAAAALGAQLLLAVTKAGLPANASVSRIASLV 483 T 11 GYR pdb F T 8apo 53 CB Xf mL125 RPALTPSFSRVSDPWTGEKEAKYAAPYRIPEEVWKNSGAPKILFQDPWNSPDYDEVRKKHAVLVHDYLKQQSQPINVQTILEGVNKTHGLVLGTIEYVTSLLENMLWHDMAYVVKPVFSSPRKAKLSKIPLLYGANKYQQVFRGTPKEVAERYEAARAKHIKVAFTRLRTSKTPQPFRRRTDEYSHVQASQSALGLAAAAA 201 T 1.6 HARE-HTH pdbhh F T 8apo 54 DB Xg mL126 SVKYIPNHAATPNKYKDAQQKVLWDRAKKLGKKPEYKVPNIKDTQTVFEIGKLTKLCLEHWKPMHFAAALGHVINVWTTQALKSGRYGGKSFTVRELLGFRSLPYGVNSITAVLPLQSPEDFLSQPLAKQPFSFKPVSVREEVKKIIASNPGLLIHNWSLKIEGQPNHPITDEDRAAAVIAICTSSFRARFNEAGDVAVALVLSRLARCGYWLPPLYELIAPFAAFQGARIDHSSPAVIANVLLVLARAKGQAEMGQPTALQIRAIAPALEQKCLQRLGELLPSLEALVISDTLAATALLSSPEARALLAQIKAEVLARNFLGFESRDIIACFKELVANVYQPLQLSADLPAPGELRDELPGGEKVLDEQLLAALSGAVVEGGALXXXXXXXXXXXXXXXXXXXXXXXXX 410 T 15 hDGE_amylase pdbhh F T 8apo 56 FB Xi mL128 KFQSRAEKKYRIMDEKVGKPRFQA 24 T 11 HJURP_C pdbhh F T 8apo 57 GB Xj mL129 AKRLLREAPPFTEQVDEKNFDDKDYLGAGAMSNEQLRKELEKVEPGEAKWDEESPLIPPPQPRQYRNKGHQ 71 T 1.5 La_HTH_kDCL pdbhh F T 8apo 62 LB Ba bS1m VSLPMTKAELRASLLYKMRKDTLTKVIDSSLSTSVLTIEEKEKIDRTLYAHIETENNHIDPGLHCLIRALRRSNLTMPILKGQQIQAKVIQKTDEVMLLDPGFYNLSEVPVNYLTTAHIVRKVDDSPRENLYDVRPGDVVKVLVDDVYTPYGDMQLDVPQQDPRLILNQVWDELHLKMKKKELVRGRILNECKSGYAVGVAGFVALLPYANTSREVANRVGEAQSFQIKSMSEPHRRRIVLQ 242 T 0.00036 MRP-S35 pdbhh F T 8apo 69 SB Bh uS8m AAQAYFDLRYHVKKQGLLTVNRAASIINSIFPEFSHESHRNQLAVPLPRKEIPTYIMQNAKVQPWALLPTKAAAYAQYPNFFRSSSLFFGSLNREIVNRRPYSLLPADKLSMDLAQVCTNLGILNGWDIVQKREKLKDLDFVWPANELPRDHHEVKLFKHLHLRLALKWEQHKPLWEDGSMVKDQREYRDQQQVQQQQPLPHLPLAPLFGPLPLTVRNLSKASQPVLLYPLQLRELAQRMPSGLFLLYHHELGVITDAQAFLFDVPVVALAHVGLPVSMAAAVNGAVNRTFRAELGKPLREVTKLKDWSLSATIAAQVRERRQQLLERAEQTKRERKQIQDLVTVRVGKFKAEVDKEDSSLALQDELLAWQLKE 374 T 59 OPA1_C pdbhh F T 8apo 82 FC Bu mS23 AFSRYFVQKFKQSYTRKYMRDMESGAFSFPKCHDILGKYRPDVLFAAAAAPLKLELPEQAVYKKLYRDFPELRKDAVDLSSLEAPLAKQFALKHLVLSAEIAANSPRTRHILRRDLEADPAYERLKEEFMPRIAELRKQQEQTASLQQLQADEEEHLKLALTYVAAQ 167 T 64 UIM pdbhh F T 8apo 85 IC Bx mS31 SSAARWRAAIAQRLGVEAAAAAQALAALLGQGDLALTVLAAASEADVLNITELLENNSVDEAVTNARKVAIVSGHGLFLATATSEDLAALSDVEAGELAALMGKVHVVGLPLADALLGSDSLTHDQLLTLTRSEKQALLWRLASVGKLREGRAKAVAALRKAALDRAAAAAEASEGLLSAAAMMKLEHDIAEFDLVRERYLPGPGLPEGVQEAFAPSGLPSAFSRDEQALYDAYFGLRSHAASAQPEPLEGPSAAQLHSSFLDGFQCREEDSQMEELPESFGQWVANIKGLIVKAPVPLLGLLAKFVTAKIDGADARDASETQSRLRLLAAEIATDIARRREARLAVSPWWQRASAPIDALAISSIDHPSSDPLVQLLEVLLGHSGADEFGSWISAVAMRPVSPYEILADEHRLMDLERYLSMTSASELHLELAATPLPWASPAVHVPPAAFLEEMRAKFNNYLLATGLSPLSAAEWSAYKDWALEEFAEKRALGEEALLQEGHSGFFNPKADEIYLRALLEATIPPEAPLREQAVRYLETVNMNKTWTFLKKKHMVQRLAELSRHLTEHPPVEEQGSPFAALFAVGPGAKPTPLVPKLSKRLPAHGPESLDLPELPEIFR 621 T 43 MRP-S31 pdbhh F T 8apo 91 OC BD mS45 PSVNDLASLLSLSEQYRGADVLAEGAALPGTGFANARGTFLPHELPTAIEYLKELDPEAEMKLEQMEAMYKLLYSRNESEREVGRQMMYDLLKLSGHPFRELELCNWDYMAAFLDARVAGRVFHRGSGERLVHRTATFPAFEGYPLAEVDQTTEGEVSKLNREESKRQDNAMFQDFRKKLLFNLGMVGEQLWEPVQGVLSANLRSALDRPLVVYDITAATGETVYPPKFVAEVDGTRRALNEQERAYQAKRKPGPRLPYYMRRIARKEEL 270 T 0.19 TOM6p pdbpssm F T 8apo 92 PC BE mS106 IAPSQLDKLEKFVHVRPPKTDYEEDIKQAISSVTDNEGLKKCLDLFLTNHAAQTWVGKHEYANAGQVSDFIKVCEKVGSSAPLVTLWQSAYRYGVDPTVPLLRSSAAACAALGEGADAALVLLYGSCYIVNVPEDLAAAVRTALEAHEKAQEGNAEALAKVQKYREALDRL 171 T 0.019 PfaD_N pdb F T 8apo 93 QC BF mS107 MATIPKGLDIDPESPMLYHYFKSIHPHQVSFRIKKRKQLQHLWELCKLYENKMDTLASAAMLGQLFRLQKRNNPDYSVELANQIFEHCVKRLSFTIRFATYQEIVPVLFTLARMNVSIVPSDTLLLDPTHRVSREFVHLFLKRAVRNHVHIRVVNPRQMARVLWATAKLFPEDQRMDPRVQDAVDKLARSSVKRLSELHPGSLSIYASAFAKLSPAPTSQEGPLKDVDVSSWDATITGVKSSLLDLDSKELAFVARARTLKVFQGISREILLRVGDLNHEQFTVRNVFHVLGAYIRAQIQDPLVAKVLAENITGRIQDVYAEELIALVRAAERLDGFKNPDLTAAVLRRAREVDLPEETQKDYAKRLQSA 370 T 43 TYA pdbhh F T 8apo 94 RC Ya uS4m-2 ADLVRHLQSSGSKLQKLANLTSASCSYRDISVSLFGLQARQLGCTKPFVYTFSASEQAQSKSKPFDGKLRLPDVSQLSDTITVSGDAAPATLNLDQKYMDKISCWTQTIPSHLEMSYQNLTSVKLFPPVDASYPTYVDFEHATRLLDHFTSRKRLNYQRKMIKKRDKFQIKSWDHHAGEA 180 T 23 KN_motif pdbhh F T 8apo 95 SC Yb mS108 HRFRNNKFLRLEPDLDPKVYGQTETLQKQVDDNFSLLLAKHRLDMKAAAA 50 T 6 SMAP pdbhh F T 8apo 97 UC Yd mS110 TTRRRKLLGSRYGARLAKKNRQQFERARVILDCYSDEELRPDSPPVAVKEVTINTLQTMRFLFPQTSKEHLDIKQNIASYKIFSLNRELLSLLPK 95 T 1.3 DUF3135 pdbhh F T 8apo 98 VC Ye uS3m-2 GRHLLPATAIRVRLNRGFESAWYTDVSYREMIKKDFLLAKLASSFVNRSSRASLRQIFPGGKDFPNFRTSRIFMQHLPYKSYASTFSYVAPKDGPQAKYGLFQSKL 106 T 20 YebO pdbhh F T 8apo 99 WC Yf mS111 PFNVSDANPKDVEFLQVLLSKFLPDADKATVYRTGQEPRRLRLGDLPATSQFMESFVSEKLPKEPLYDMPSWLANNMPQYDAQPKSPHYHWSSWMRQHLSLDLQRLYAAFAEYMASEPHRLGIVRQANFELARLWDWQHRRVAAGLSPDL 150 T 8.8 DUF2497 pdbhh F T 8apo 100 XC Yg mS112 SADVYKEFFKMARVAVRTMKEPTKSIMKDLQRNARRSENIQRDNNLDKGVYMRFLRQRAGLSVPPIK 67 T 9.2 CCDC53 pdbhh F T 8apo 101 YC Yh mS113 GADGVLRRHNEVRGALRFFLDSWYKNKTSGTIADKNQVMFDYLKYKGVTEIAQHLHAPAPQVFRK 65 T 0.12 Nudix_N pdbpercent F T 8apo 103 AD Yj mS115 EYNGQGYVFSLLQRPPAPTLELLAEYLTVKYQDVIAQRDFVTHILGRMSVLERGGELPAADAAASGTWTGGAKRRLSPQEIRDINGELNRLFDADLNEYVSLAQRLATENVLSPADLATCLQAARSKAQTSSFASLAAPGSSNVDRNILAQVLQGKQDVSALAAAAAAAAASGPEGARVAWDEALQVGKYGAWATKAKAWAADDIAARREKGQQISPEQEAALVCLWDNPLSYDAAAGLWHQYAEKAGAVSAPSLADVISADQAIQAAKAAAAADPASLPAVKATAEKAAQVQEAVKKLYLGFAARQGSTSGAVTVDGVPLPFADVVKANAELDVASPAALAAAFQPLELGELLACHWEAVSRTFMWEDMYQLMLETAKEIEVNGA 386 T 0.15 Ykof pdbpercent F T 8apo 104 BD Yk mS116 GPLPEDVFLVAPKVAAAVQQTQAQLIDLLAPYGYSFDAFSEAVLEDLSKTKELCVKARFVLWEARVLEALEAVRPFVSGPVFRTESEAAALT 92 T 5 POTRA_TamA_1 pdbhh F T 8apo 105 CD Yl uS7m-2 TVVLAPSKYDSQLKIPLKPTEMDEFEELRSFVDISIEKEADYVMNKFVGRLIKGGEKATAQQVLLRTLLHTRRLMQEGNITSLK 84 T 0.13 Ribosomal_S7 pdb F T 8apo 109 GD Ub mL105 RDEIIKLLESRKDMDVNGYVMYCREELGKLTVPRPRAPPVSPKHEDYKTFVDEERVTYMRMKQHEKISLFLTEEEKNTVTTKGKDILDDKRFIQTIASRTGFYIAEEVRDCLSEFFNFRDSSRRLLTYYAD 131 T 0.025 DUF5863 pdb F T 8apo 110 HD Ua Unknown XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 32 F F F 8apo 111 ID Ud Unknown XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 44 F F F 8apo 112 JD Ue Unknown XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 47 F F F 8apo 113 KD Uf Unknown XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 73 F F F 8apo 114 LD Ug Unknown XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 63 F F F 8apo 115 MD Uh Unknown XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 48 F F F 8apo 116 ND Ui Unknown XXXXXXXXXXXXXXXXXXXXXXXXXX 26 F F F 8apo 117 OD Uj Unknown XXXXXXXXX 9 F F F 8apo 118 PD Uk Unknown XXXXXXXXXXXXXXXXXXXXXXX 23 F F F 8apo 119 QD Ul Unknown XXXXXXXXXXXXXXXX 16 F F F 8apo 120 RD Um Unknown XXXXXXXXXXX 11 F F F 8aqa 3 C C (1R,2R,3S,4R,6S)-4,6-diamino-2,3-dihydroxycyclohexyl 2,6-diamino-2,6-dideoxy-alpha-D-glucopyranoside XGXXK 5 T 950 NACHT pdbhh F F 8aqb 3 C C (1R,2R,3S,4R,6S)-4,6-diamino-2,3-dihydroxycyclohexyl 2,6-diamino-2,6-dideoxy-alpha-D-glucopyranoside XGXKW 5 T 68 DUF3402 pdbhh F F 8aqk 3 C C (1R,2R,3S,4R,6S)-4,6-diamino-2,3-dihydroxycyclohexyl 2,6-diamino-2,6-dideoxy-alpha-D-glucopyranoside XGXKX 5 T 1300 LPAM_2 pdbhh F F 8aqm 2 C,D C,D NCOR2_HUMAN N-COR2,CTG REPEAT PROTEIN 26,SMAP270,SILENCING MEDIATOR OF RETINOIC ACID AND THYROID HORMONE RECEPTOR,SMRT,T3 RECEPTOR-ASSOCIATING FACTOR,TRAC,THYROID-,RETINOIC-ACID-RECEPTOR-ASSOCIATED COREPRESSOR HASTNMGLEAIIRKALMGKYDQW 23 T 15 PPS_PS pdbhh F Eukaryota T 8aqn 2 C,D C,D NCOR2_HUMAN N-COR2,CTG REPEAT PROTEIN 26,SMAP270,SILENCING MEDIATOR OF RETINOIC ACID AND THYROID HORMONE RECEPTOR,SMRT,T3 RECEPTOR-ASSOCIATING FACTOR,TRAC,THYROID-,RETINOIC-ACID-RECEPTOR-ASSOCIATED COREPRESSOR HASTNMGLEAIIRKALMGKYDQW 23 T 15 PPS_PS pdbhh F Eukaryota T 8ar0 1 A A TLR2_HUMAN TOLL/INTERLEUKIN-1 RECEPTOR-LIKE PROTEIN 4 MSVSECHRTALVSGMCCALFLLILLTGVLCHRFHGLWYMKMMWAWLQAKR 50 T 0.00085 LRRCT unp F Eukaryota T 8ar2 1 A A TLR5_HUMAN TOLL/INTERLEUKIN-1 RECEPTOR-LIKE PROTEIN 3 MEEVLKSLKFSLFIVCTVTLTLFLMTILTVTKFRGFCFICYKTAQRLVFK 50 T 0.13 CoV_E pdb F Eukaryota T 8ar3 1 A A TLR9_HUMAN Toll-like receptor 9 MEALSWDCFALSLLAVALGLGVPMLHHLCGWDLWYCFHLCLAWLPWRGRQ 50 T 0.0096 Tlr3_TMD pdbhh F Eukaryota T 8are 2 B B PHRE_BACSU PHOSPHATASE REGULATOR E,PHRE SRNVT 5 T 0.048 4HB_MCP_1 unppssm F Bacteria F 8arn 2 C,D C,D Endogenous tetrapeptide (SER-ASN-SER-SER) SNSS 4 T 440 GerPB pdbhh F F 8as2 2 B B C-C chemokine receptor type 5 APERASSVYTRSTGEQEISVGL 22 T 150 Pardaxin pdbhh F T 8as3 2 B B C-C chemokine receptor type 5 APERASSVYTRSTGEQEISVG 21 T 120 Pardaxin pdbhh F T 8asi 4 D,H D,H Q3J2Z2_CERS4 Cytochrome b-c1 subunit IV MFSFIDDIPSFEQIKARVRDDLRKHGWEKRWNDSRLVQKSRELLNDEELKIDPATWIWKRMPSREEVAARRQRDFETVWKYRYRLGGFASGALLALALAGIFSTGNFGGSSDAGNRPSVVYPIE 124 T 0.052 MtrF pdbpercent F Bacteria T 8asj 4 D,H D,H Q3J2Z2_CERS4 Cytochrome b-c1 subunit IV MFSFIDDIPSFEQIKARVRDDLRKHGWEKRWNDSRLVQKSRELLNDEELKIDPATWIWKRMPSREEVAARRQRDFETVWKYRYRLGGFASGALLALALAGIFSTGNFGGSSDAGNRPSVVYPIE 124 T 0.052 MtrF pdbpercent F Bacteria T 8at2 1 A B A0A8J1L9M8_XENLA LOC495502 PROTEIN MDEKSTKIIMWLKKMFGDKPLPPYEVNTRTMEILYQLAEWNEARDKDLSLVTEDLKLKSAEVKAEAKYLQDLLTEGLGPSYTNLSRMGNNYLNQIVDSCLALELKNSSLSSYIPAVNDLSSELVAIELNNQEMEAELTSLRKKLTEALVLEKSLERDLKKAEEQCNFEKAKVEIRSQNMKKLKDKSEEYKYKIHAAKDQLSSAGMEEPLTHRSLVSLSETLTELKAQSMAAKEKLNSYLDLAPNPSLVKVKIEEAKRELKATEVELTTKVNMMEFVVPEPSKRRLK 286 T 0.0091 ATG16 pdbpssm F Eukaryota T 8at3 1 A A A0A8J1L9M8_XENLA LOC495502 PROTEIN MDEKSTKIIMWLKKMFGDKPLPPYEVNTRTMEILYQLAEWNEARDKDLSLVTEDLKLKSAEVKAEAKYLQDLLTEGLGPSYTNLSRMGNNYLNQIVDSCLALELKNSSLSSYIPAVNDLSSELVAIELNNQEMEAELTSLRKKLTEALVLEKSLERDLKKAEEQCNFEKAKVEIRSQNMKKLKDKSEEYKYKIHAAKDQLSSAGMEEPLTHRSLVSLSETLTELKAQSMAAKEKLNSYLDLAPNPSLVKVKIEEAKRELKATEVELTTKVNMMEFVVPEPSKRRLK 286 T 0.0091 ATG16 pdbpssm F Eukaryota T 8at3 7 G G B1H1T5_XENLA LOC100158301 PROTEIN MTGGKELGAAVELYERLQMLSCPCLEGVYLTDPQSIYELLCTPSSHRLDILQWLCSRIYPPVQEQLSSLKESQTDTKVKEIAKLCFDLMLCHFDDLDLIRGHASPFKQISFIGQLLDVIQYPDTISSNVILESLSHSTEKNVVTCIRENEELLKELFSSPHFQATLSPECNPWPADFKPLLNAEESLQKRATQSSKGKDMSNSVEALLEISSSLKALKEECVDLCSSVTDGDKVIQSLRLALTDFHQLTIAFNQIYANEFQEHCGHPAPHMSPMGPFFQFVHQSLSTCFKELESIAQFTETSENIVDVVRERHQSKEKWAGSTISTLCEKMKELRQSYEAFQQSSLQD 348 T 0.1 L27 pdbpercent F Eukaryota T 8at4 1 A A A0A8J1L9M8_XENLA LOC495502 PROTEIN MDEKSTKIIMWLKKMFGDKPLPPYEVNTRTMEILYQLAEWNEARDKDLSLVTEDLKLKSAEVKAEAKYLQDLLTEGLGPSYTNLSRMGNNYLNQIVDSCLALELKNSSLSSYIPAVNDLSSELVAIELNNQEMEAELTSLRKKLTEALVLEKSLERDLKKAEEQCNFEKAKVEIRSQNMKKLKDKSEEYKYKIHAAKDQLSSAGMEEPLTHRSLVSLSETLTELKAQSMAAKEKLNSYLDLAPNPSLVKVKIEEAKRELKATEVELTTKVNMMEFVVPEPSKRRLK 286 T 0.0091 ATG16 pdbpssm F Eukaryota T 8at4 7 G G B1H1T5_XENLA LOC100158301 PROTEIN MTGGKELGAAVELYERLQMLSCPCLEGVYLTDPQSIYELLCTPSSHRLDILQWLCSRIYPPVQEQLSSLKESQTDTKVKEIAKLCFDLMLCHFDDLDLIRGHASPFKQISFIGQLLDVIQYPDTISSNVILESLSHSTEKNVVTCIRENEELLKELFSSPHFQATLSPECNPWPADFKPLLNAEESLQKRATQSSKGKDMSNSVEALLEISSSLKALKEECVDLCSSVTDGDKVIQSLRLALTDFHQLTIAFNQIYANEFQEHCGHPAPHMSPMGPFFQFVHQSLSTCFKELESIAQFTETSENIVDVVRERHQSKEKWAGSTISTLCEKMKELRQSYEAFQQSSLQD 348 T 0.1 L27 pdbpercent F Eukaryota T 8au0 1 A,B,C A,B,C SUN1_HUMAN PROTEIN UNC-84 HOMOLOG A,SAD1/UNC-84 PROTEIN-LIKE 1 GSMSGVEQQVASLSGQCHHHGENLRELTTLLQKLQARVDQME 42 T 0.00057 Trimer_CC pdb F Eukaryota T 8au1 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R A0A7U0GB71_9CAUD Putative tail sheath protein MSEQITGSTPRIYYRGTKDSSVTRSTGSTTTLPLHRPLIMFFGQKGPTVPTWIDPVKFEDIYGSETTNLSGVYCTHSTPFIKEAIAAGNQFMALRLEPSDIPDVATLGLSVDWVKTKIDDYERNDDGTYKLDTNGDKIPLATQIDGIKFRFVLEKIETNESGVSQYKKRTAKAGTIGTEATPSTITPLADFRCRFKSSLGANTALRIWAPTINSAQAADADLQARIKSFLYRFQILTRADKASSPTIFETIYNEPSLSVGFGENLVDPQTEVVYDFVERIDSRYNDEDPSTYLMSPLDTPYLYQANIDSVLTAIQELEAPFDTVSADEDDLYQINLFGAQTVEGVPYHAVQILGVLDGGVTLTETATNYLQGGGDGTLGNDSFNAAAYAVLSNLSNNAAFNITNYARYPFNAFWDSGFDLKTKQTIPQLIGLRADTWIALSTQDISSDFNSNEEEESIALSLMSRVSAFPDSSDFGTPAFRGMIVGGAGYYTETTRKLPVPLTLDRFRAYCRYAGASDGVLKPEYAVDEGDARKVQVVKSINNLDKSWRVRRAQWNNNLVYVEDYDTNSQFYPGQQSFYSEQGSVLKAAIVGLCVANLNRFAFEAWRDLTGTQKLTDDQLIERSDDAVSTRGTGAFDDRLIFTPHSEITQADKERGYSWSMRIDFGANAFRTVMDMSSVAYTREELANG 689 T 0.17 XFP_N pdbpercent T Viruses T 8auv 28 BA Z A0A1S3Y4M0_TOBAC 40S RIBOSOMAL PROTEIN S11-LIKE MAAEGRTLSTKEADIQMMLAAEVHLGTKNCDFQMERYVFKRRNDGIYIINLGKTWEKLQMAARVIVAIENPKDIIVQSARPYGQRAVLKFAQYTGANAIAGRHTPGTFTNQLQTSYSEPRLLILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGVLFWLLARMVLQMRGSINQGHKWDVMVDLFFYREPEEAKEQQEEEAPAIDYADYSAGGDWSSSQIPEAQWTGDAAPSGPVVASGWSGEGVAEGGGWDTAAAPVPVPVSDAAPTAGGGWDTAAAPVPVPVSDAAPTAGGGWDTAAAPVPVPVSDAAPTAGATGWE 336 T 3E-13 Ribosomal_S2 pdb F Eukaryota T 8av0 2 B P RAF1_HUMAN PROTO-ONCOGENE C-RAF,CRAF,RAF-1 RSTSTPNVA 9 T 56 NB pdbhh F Eukaryota T 8avr 2 B,D B,D D-Aureocin A53 XXXXXXXXXXXXXGXXXXXXXXXXXGXXXXXXXXGXXXXXXXXXXXXXXGX 51 F F F 8avs 2 C,D C,D BACTERIOCIN D-AUREOCIN A53 XXXXXXXXXXXXXGXXXXXXXXXXXGXXXXXXXXGXXXXXXXXXXXXXXGX 51 F F F 8avt 1 A,B C,D BACTERIOCIN D-AUREOCIN A53 XXXXXXXXXXXXXGXXXXXXXXXXXGXXXXXXXXGXXXXXXXXXXXXXXGX 51 F F F 8avu 2 B B BACTERIOCIN D-AUREOCIN A53 XXXXXXXXXXXXXGXXXXXXXXXXXGXXXXXXXXGXXXXXXXXXXXXXXGX 51 F F F 8azw 44 RA l nascent chain XXXXXXXXXXXXXXXXXXXXXXXX 24 F F F 8b0b 2 B BBB HPSE_HUMAN Heparanase 8 kDa subunit QDVVDLDFFTQEPLHLVSPSFLSVTIDANLATDPRFLILLGSPKLRTLARGLSPAYLRFGGTKTDFLIFDPKKE 74 T 3.5 Glyco_hydro_79n unppercent F Eukaryota T 8b0c 2 B BBB HPSE_HUMAN Heparanase 8 kDa subunit QDVVDLDFFTQEPLHLVSPSFLSVTIDANLATDPRFLILLGSPKLRTLARGLSPAYLRFGGTKTDFLIFDPKKE 74 T 3.5 Glyco_hydro_79n unppercent F Eukaryota T 8b0f 7 G G CD59_HUMAN 1F5 ANTIGEN,20 KDA HOMOLOGOUS RESTRICTION FACTOR,HRF-20,HRF20,MAC-INHIBITORY PROTEIN,MAC-IP,MEM43 ANTIGEN,MEMBRANE ATTACK COMPLEX INHIBITION FACTOR,MACIF,MEMBRANE INHIBITOR OF REACTIVE LYSIS,MIRL,PROTECTIN MGIQGGSVLFGLLLVLAVFCHSGHSLQCYNCPNPTADCKTAVNCSSDFDACLITKAGLQVYNKCWKFEHCNFNDVTTRLRENELTYYCCKKDLCNFNEQLENGGTSLSEKTVLLLVTPFL 120 T 0.00021 UPAR_LY6 pdb F Eukaryota T 8b0h 1 A G CD59_HUMAN 1F5 ANTIGEN,20 KDA HOMOLOGOUS RESTRICTION FACTOR,HRF-20,HRF20,MAC-INHIBITORY PROTEIN,MAC-IP,MEM43 ANTIGEN,MEMBRANE ATTACK COMPLEX INHIBITION FACTOR,MACIF,MEMBRANE INHIBITOR OF REACTIVE LYSIS,MIRL,PROTECTIN MGIQGGSVLFGLLLVLAVFCHSGHMLQCYNCPNPTADCKTAVNCSSDFDACLITKAGLQVYNKCWKFEHCNFNDVTTRLRENELTYYCCKKDLCNFNEQLENGGTSLSEKTVLLLVTPFLAAAWSLHP 128 T 0.00025 UPAR_LY6 pdb F Eukaryota T 8b0p 2 B B Pam3-SKKKK SKKKK 5 T 230 BCCIP pdbhh F F 8b0u 2 B,D D,C B2V8L8_SULSY CalpT10 STSQKATYTDDFVLYRGDDFIEIIIDEKYLNKKVKILLDNDTIFNGILKDTSIFIPVKEQIDLEELAKHISILPEG 76 T 0.0095 LSM pdb F Bacteria T 8b14 2 B B RBP5_BPT5 pb5 bacteriophage T5 receptor binding protein MSFFAGKLNNKSILSLRRGSGGDTNQHINPDSQTIFHSDMSHVIITETHSTGLRLDQGAGDYYWSEMPSRVTQLHNNDPNRVVLTEIEFSDGSRHMLSGMSMGVGAKAYGIINPQIMSQGGLKTQITASADLSLDVGYFNTGTSGTIPQKLRDGTGCQHMFGAFSGRRGFASSAMYLGGAALYKSAWSGSGYVVADAGTLTIPSDYVRHPGARNFGFNAIYVRGRSCNRVLYGMEGPNYTTGGAVQGASSSGALNFTYNPSNPESPKYSVGFARADPTNYAYWESMGDPNDSANGPIGIYSEHLGIYPSKITWYVTNLVYNGSGYNIDGGLFNGNDIKLSPREFIIKGVNVNNTSWKFINFIEKNFNVGNRADFRDVGCNLSKDSPSTGISGIATFGLPTTESNNAPSIKGGNVGGLHANVVSIYNFLPSASWYVSSNPPKIGNNYGDVWSENLLPLRLLGGSGSTILSGNIVFQGNGSVHVGTVGLDLNSSRNGAIVCTMEFIDDTWLSAGGIGCFNPTEMLSQGAEYGDSRFRIGGNTINKKLHQILSLPAGEYVPFFTIKGTVVNACKLQAAAYNPTPYWVSGLPGSVGQTGYYTLTYYMRNDGNNNISIWLDSSMSNIIGMKACLPNIKLIIQRLT 640 T 0.18 NPM1-C unppercent T Viruses T 8b1r 4 D,E P,Q GP59_BPT7 GENE PRODUCT 5.9,GP5.9 MSRDLVTIPRDVWNDIQGYIDSLERENDSLKNQLMEADEYVAELEEKLNGTS 52 T 0.0021 Phage_GP20 pdbpssm T Viruses T 8b1x 1 A A P3-7_2 KKPGASLAALQALQALQAAQAAKKY 25 T 5.4 Asr pdbhh F T 8b2e 1 A A Muramidase LVLPGLDALQTRNALAIIAEAKKENVGPHGCQAAITTGLTESSLRILANNAVPPSLQYPHDGLGSDHDSIGIFQQRASIYKDIRCDMDAACSASQFFKVMKGVSGWQTLDVATLCQRVQKSAYPAAYQKFTALAVGVCKAGGL 143 T 66 Lys pdbhh F T 8b2f 2 C,D H,T GLY-GLY-GLY GGG 3 T 79 FTCD_C pdbhh F F 8b2k 2 B B RND3_HUMAN PROTEIN MEMB,RHO FAMILY GTPASE 3,RHO-RELATED GTP-BINDING PROTEIN RHO8,RND3 TDLRKDKAKSCTVM 14 T 25 DUF3012 pdbhh F Eukaryota T 8b2m 1 A A A0A1D3UV35_TANFO Tannerella forsythia Potempin A (PotA) MKQQIILWIGVLLLLIGGVGCKKDQSSCCDKEIIKDVSELTGIISYNTEVKRWYISVSDANSYDNVTLYFPCNLDSKYMKEKEKVIFSGQISKSTLKITLPAGTTSYCINLMSINKIN 118 T 0.00053 DUF4971 pdbhh F Bacteria T 8b2n 2 B,D B,D G8UM88_TANFA Tannerella forsythia potempin A (PotA) DQSSCCDKEIIKDVSELTGIISYNTEVKRWYISVSDANSYDNVTLYFPCNLDSKYMKEKEKVIFSGQISKSTLKITLPAGTTSYCINLMSINKIN 95 T 0.0008 DUF4971 unphh F Bacteria T 8b2q 2 B I G8UM88_TANFA Tannerella forsythia potempin A (PotA) QSSCCDKEIIKDVSELTGIISYNTEVKRWYISVSDANSYDNVTLYFPCNLDSKYMKEKEKVIFSGQISKSTLKITLPAGTTSYCINLMSINKIN 94 T 0.002 DUF4999 unphh F Bacteria T 8b2r 2 B B A0A219T3Y8_MAGOR AVR-PIK PROTEIN ( PIKMPROTEIN,PIKP PROTEIN ),AVRPI7 PROTEIN GPMRAIDLSRERDPNFFDNADIPVPECFWFMFKNNVRQDAGTCYSSWKMDKKVGPNWVHIKSDDNCNLSGDFPPGWIVLGKKRPGF 86 T 0.1 TMEM18 unp F Eukaryota T 8b2x 1 A M B8GLG4_THISH Type I-G CRISPR Cascade large subunit CSX17 MDKDMHINEIVLRGCAPTPLAAYLKALGVLRLVCEQVDATAKGWWQDECFMLRTRLDDNDLRRFFIEDYRPTPMLSPWNGGSGFYRKGNETAWSTLEKIITTQAERWRPFRDTAEVMADALEHLKLTEKPAELDKRALLARLRATLDDEFLPWLDAAVLLTDDKPDYPPLLGTGGNDGRLDFTSNYMQRLLEMFDPVTGKAQGDVGNKLESALFARPVPGMTALAIGQFSPGAAGGPNSSTGFDSGAQVNIWDYVLMLEGALLFAATATRRLESADPSALSYPFTVRPSGGGSGAVALGDERPARAEIWMPLWERPASLPELRVLLGEGRVTLNGRLPRDGLDFARAVAKLGTDRGVRAFQRYAFMMRSGKAYLATPLNRFHVHRNPKADLIDQLERGDWLRRFRRAARSTHAPARLQGLAHRLDDALFDLVRVADPRRVQEVLKVLGEVQFYLALSPSLREQVRPVPRLDAHWVEAARDDSHEFRVAAALAGLDDGLPMGVHLAPIDPVKRNVWAPESRLAVWGQGNLSDNLAQVLQRRLLTASRTDLNDKPLSGRCPADEGAVAAFLAGDADERRIAELMAGLACARLPARLPLRQRGASEASSLPMIYALLKPLFVPDAQLREAAVLTPDGCLPLPPALPRLLRAGPAGVGRAVDLARRRRRASGLADAGWRLTPPYPDGGRLLAALMIPVEIRVIKGFIKRLADHKSDEPATQDAS 720 T 0.16 DUF2795 pdbpercent F Bacteria T 8b3a 1 A,AA,B,BA,C,CA,D,DA,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T,U,V,W,X,Y,Z A,a,B,b,C,c,D,d,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T,U,V,W,X,Y,Z ACE-LEU-HIS-LEU-HIS-LEU-ARG-LEU-NH2 XLHLHLRLX 9 T 1.9 Peptidase_M74 pdbhh F F 8b3p 2 B,IA,M,TA,X FFF,III,GGG,JJJ,HHH G9P_BPF1 COAT PROTEIN C,POLYPEPTIDE II,G9P MSVLVYSFASFVLGWCLRSGITYFTRLMETSS 32 T 0.35 LapA_dom pdbhh T Viruses T 8b48 1 A,B,C,D A,B,C,D A0A6G1IIU9_9PLEO Carbohydrate esterase family 15 protein QAPSCPNLPASINYAANPKLPDPFLALSGTRLSKKDQWPCRKEEIRQLFQRYSYGTFPPRPESVTAAMSGNALKITVSEGSKSMSFSVNIKLPSSGAAPYPAIIAYGSASLPIPNTVATITYQNFEMAADNGRGKGKFYEFYGSNHNAGGMIAAAWGVDRIIDALEMTPAAKIDPKRVGVTGCSRNGKGSMIAGAFVDRIALALPQEGGQSAAGCWRIADEIQKNGTKVETAHQIVNGDSWFSTDFSKYVDTVPTLPWDNHMLHALYAYPPRGLLIIENTAIDYLGPTSNYHCATAGRKVHEALGVKDYFGFSQNSHSDHCGFPKAQQPELTAFIERFLLAKDTKTDVWKTDGKFTIDERRWIDWAVPSLSGLEQKLISEEDLNSAVDHHHHHH 394 F F Eukaryota T 8b4z 1 A,B A,B D0FZL0_RMBV1 Major capsid protein A MSGDNGVYSGSAAYNTATAPKVPVSRATFFQNTKSKDFDFKFADGADAIANVLQQMEHGVAQHQLGDMNVRTDGLATVSAVLNGRKRKIANQYMMHFDLFGRAARSTVRMESRIQSFGEGKDVDNFMAKFHNQLSGVYERRSEGVANFGRILATDTDLGGTSGLSVVFNGLLRGLHHVSTVPTPNVANLPIRNNRDGAGAVVGRGDMPGREFMDSSRILPPRSSRWYGAPGQPIVPPAPNNPPAHVAPMETVMAGLQKTVMNELNRVIVSIADVPKLPAHRIRNLIAVLAAVSKPNLGFDANRLEDHSCFTKGWLGFNDILLFPLTVDLFDRVVANEAGVNDAGFIVPNAAPPQFLQNTNQQVIDFRGVGVGQAGDIPALRLAQSWSDAIGFLLDTIGGEAQLAMGLNDMVAQCFHMHGAQTTMLSTPIISRADFGVYHNVVTNMYRRLAYMYTRLIRTNAAAGGGAMLDRQHYQWPTHAKVGFHDDTAVNAAAAAARIHDGLRQPLLDEAFGAGVVQPGNMDLVGAGIDFTRDLTSSLGKAYPEHRPIGADDNKRDLGDFTAGTVDAAASGYEWDNYVYRLFGNMSAMRSKAEFDRLLATFPSSTLSELFIWMGNVGFADTWEERWGYDAAPLCSIPIPAGHDRSMLRNWSWVNVHNVHSVTGTSENVVLAGYVGLSRTHDYIMDTRSTPATSQGRRLAAMFYYTNADKMLSLTFGLAGQLRAAADTTVAKFQICPHTIARAQGYIMTDNDPLSDELKGTDFVTEQFSLAGLTNLYLGYFDGLATRLGIYDLRYTYSEYAECRVELHGIQRNFLTDRLDAFVSYKCLHPIMFEYYMCGANISGGILNGDKAYEQVEMGNIRAYDAMFDTSAARDFNFVGVRGASQQIAAVGGFHIQYKMEVEIQRPGDGTEASRFNVYERYLNNYLRMSDCAPTSVLNAVSPLFWMAGTTRVVLCEAANGYKPMAYDISQTSFWNRENGLWAFTWGESEKTHRPNAIPHGTRRLGNSEVLMNSRFSKILDKKGITKLETRVGGRKRGDNNDDFVAADTRMFIIQDVAGGEHAAYSSLRDPGFALVRAAHTWDTFVQNPRMLLLERGYGNTGFTDTYSAAGIRRTNGHISLRLSALTDDFEFTMHPLARAEYKETSRVSLTSMIYVGTAGKDLSLPTGTVEDIIGAVDGMRRVVRTIGGQTIKTAPVVPPTEQRDMVQEERVGTPVKNAGNANPAADSDNATEGVVEPKN 1240 T 0.31 NTP_transf_3 pdb T Viruses T 8b58 2 C,D C,D Cyclosporin A XXXXXXXXVXA 11 T 9.7 DUF6090 pdbhh F F 8b59 1 A,B,C D,E,C D0FZL2_RMBV1 RnMBV1 Crown protein MGITYRDAQIFSACVEALSARNNRITLTSFPLTAGQGQAPTATPAWYPVDLFVADATAVYGRRQLFAWTVDKVRPTRNVAFVTDRVAMDFSAALLSLMAELEAVAPDVYAAIHGGATPGADLGDRITQLENRRVGCLAYVMATVVRAPITHNVRSFSAMLASDPQAHAALLAYLTPNSAGQLDGAPIYFRRSDVDLRNNHLALHAEVVPGLPNMVPLTKAMVEVALANVEWWSDPLGYDSLTSFGGLELLSLCDALAVCELSVAYGLKESGYCYLRFAGGCPLAEVILARLGYNPPLGVAVGWALYNGIKLDWYSKVISVGHNMRLHVCDTAGEANACLIDVLTGEYDGMPVGGVDTVSCWVEQLDLLAAAAGVGRNLSNLHCGVQTPPRTINTTRRRLLASLVRTLIADPTLTDEELLHGAVRGTLNGLPRDRALWRCLQVVNTTVREFLAQDLDAMVRDRRECTTYASRAAFAERCAMSGNASGLVGRQYSDMPAALEGEARACGLSAIDAIEIVRVVASGEPIRVLLDQHGRPATRPNGRLTADELRRCRPLVVGQGGQVGFLPFVPFIVGGVGATVAAASGLALATFATVTGAGAVALGGLGLGAGVAALSITVGQLSYQVTRRALTTILPGGREFGLDDLTRVLGGMVGRYISFVDTWFAHGRGDVFQETDAVPAGTVVFVLPNIEYEVLELRERALGRWSTLLVTTPNGVIAMRANGALPLRVVDAREAGQTFEWSTAARRRFTRAQANAINMMVTASKRVPGLKGSIDAAPSQGTGGSGTDLAGILQRLSALEQTSVPRAEFDALQGRVAACEAKITELEADRVPRIDFTELRDRVHHIDGIGLSCLAHLARDLGITVPHNVRTFRQMRANVGEVIWARFVDAVAESFSPMGGRPIFVRTDPAQPRNNHVSLVDEPTTTGFNGTVTPAMRRLTVADLTGDLVDTEWFSWTPYDASGPLGGTIEGIEAYLTDFTSKLKAELEATPTRTELGVAVGTRAPPLSDRLAAVERVIGMQEGNQVWRSNELRELWVAIDSIVTGRGQREFTTATIKWPAAFPSAVATAGRSFGQPGLAGYGELCTLARQLNALVAGVRNGVVSGMTRNGAGVLQLSTISSATGNLTSDQQAVLRACFFPATPRVGEYQIVYPVGGTMGLTRVDPSTNSSIGQYTRESLVAARNAMPRFAVHTTTPDTVGVAWDNQSAAGLPMGAAPVLTVSVNQLSGVPVTEADKQRWDAKQDKFKIVNTDDRVAALSWVDSVDGFAAPGSDMLLDYQAPAGTGSLPFGSKYAMAVAIGGSLGSQLSEAQVSAARVVLGNGVWRDAVIDVLRKLHNVMYGGKYGRIDDIAAMRSYLNDGTGLLPGSEPIVDVGGAEGNACARATILLRGFSSTMVGVDLKIQMLVELYGAEPATAALLYRGWTMQ 1426 T 0.13 DUF3375 pdb T Viruses T 8b5a 2 B B H4K20ApmTri HRXVLRDNY 9 T 7.8 Ribosomal_S13_N pdbhh F T 8b5b 2 D,E D,E H4K5acK8ApmTri XSGRGXGGXGLGK 13 T 7.5 DUF6272 pdbhh F T 8b5c 2 B B H4K5/8ApmTri XSGRGXGGXGLGK 13 T 10 ATPase_2 pdbhh F T 8b5r 2 G I I3 sequence being threaded through the p97 channel XXXXXXXXXXXXXXXXXXXXXX 22 F F F 8b6e 1 A,B A,B sCTP-23166 MGHHHHHHHHHHMAGYSRAVRCVETGVEYPSLSAAAKAMDLFGPQNIYKAIRLGKLAGGYHWVYVD 66 T 0.0024 NUMOD1 pdbhh F T 8b6f 1 A A0 Q22E24_TETTS Lipid-A-disaccharide synthase MLTHISRRYFSFTGRKTIFVAAGSPSHDLQAANFMRDLKKKSNNNYDFVGIGGPLMQAEGLNQSYADINKFIDKPFFPLKNFIRFHVARCYHPYMAPLHFFNKQVLNQVDKSSLLKDQVELSIPSAIITFGNEFFMKKLYVRLCDQYELHNKIRPPTFFYDRSHINQRFEFQDYLDHFFYTIPMKQINFQSFTYPSTCVGHEGVGRAIQYLFQNSKQYANVKSLVTANGLKIASNPKQHREIIEKLVEEQRGIQRARLGINESKNVFLLAPGNTKAEINFAVNLLSRSLEEFFKKPQLTNVSRDHFTIIITADNAQNAEFVNQAVSNTKYLKTLQTIVTTGEKEKFGAMCAADVGIPLNGELVSECAALQLPSVIISNMNLFYAYITQLYNNFYSDINFAIQGEAYHELVSTAANPYKLSDEIFDLYSDPKLRYHFAERYQNVVHEMIPQANSQDNIVTTDVATLHGVEVQERAFTYETIAAKVLKAARAYESLDKNIPNHQIDQHRKEKLIKAAF 516 T 8.5E-42 LpxB pdbpssm F Eukaryota T 8b6f 5 E A4 I7LUQ4_TETTS RNase III domain-containing protein MSGLLRNFEKLVCQSQLSKAGHKLLLRSPNSTLHPTAFYYKRNSSQRLANEMDVFQLGLAAAALTRQANNYAQLLDQVDKEAVREEVQERITQNHSDLNVYFGEILSLFKIGKKECPVQTVADISYVLAFGPIQVPNAAAIITENLLPVLKEKLDYASIHNLQDILSAFVKLNYVSDKELLKRLITALSQKDFPNQLQPVTNHAWNIDQYEYSDCNSWNIVSCGDNTFEKYIHEGGCENSLAKAKFAVHELLDHISFNFVNPFLFRENRINHRFAKRNADLDHEVLMQTLSKLQEIVPETSEAIATIKARL 311 T 0.034 Baculo_F pdb F Eukaryota T 8b6f 6 F A5 I7MIJ7_TETTS 37S ribosomal protein S25, mitochondrial MRKALERFNEIIFNPAIRWYQLPKPTVRRTRYPAPGSEPINREVHQIDYKTAFRDSPHNIRYHHEIHTSDQTYHSSYDPVGETTTERLVRYGYLNKDQVNNAEAVAAAAKEFQEKEKRSPSNNIIIDEISNSDKPITKENRESVAHHVRQQFEFFREVNAEEVWSVSIEEKYNPELYIYKTYDMAADDPVWRQVKLDLEWTFENIAERRESLGYMPTFKGDPNFWQALDNSFSPENIAQVQSSIGDKVTNIDTKALALNHQTEEYHKTSKLVYPIRTNLVVE 282 T 1.1 Synaptobrevin pdb F Eukaryota T 8b6f 7 G A6 Q24C39_TETTS Transmembrane protein, putative MIARRLFKRSLYYIPRAGFGGGDIRHKFSNEITDDDYDYQRAMHVKPPKEESLFQLTNILSSVPVFKTRFFLDFIARNLDTNSAVSTSDFVAPPRVHENSFFVYHSRELGNVIRKYRSLESIVLPGALLTFTYPLFAAFVAIPSYYFMFNAKIYEMSRRFVVRMDVLPHLEMISVQRIGAFGILYTKLHRIQDLEYVPFDQVKEQENYLWAIGGHGVDNQLIFKDRSTGEFFYFERQGVWDAKGLNHPLLN 251 T 0.7 TMEM70 pdbhh F Eukaryota T 8b6f 9 I A8 NDUTT15 QRGRDYTPSNKKYLQPWELERKEYVELSLAIQSAYSCKMLSEILKDNLYMLTDYQLSFAMFHLWNHEIPIDNYFYNVISPILKEYITRFDRECNKSLAEIATFLGRMNVQDDAALWKVIETKLVQERLYRYIPLNDLIDLAHGMATANRGSQEFYNIVENVIIKHRLRLIPDKIAVAKDCFTARKIGSPLLYQVLENPQAEAHELAGLKEHEQLKIS 217 T 0.32 DUF6386 pdbpssm F T 8b6f 10 J A9 Q24F24_TETTS Transmembrane protein, putative MELNSSAKEDSHYVGVLGYPSQHDPHTLHPKKHDSTFTKVYACRDMLWDHHWEVRNTLYAGFKGALLGVAYASGFGLISKTVPSIVLKKMFRFVRNNNFGHIRIMQDLLTPYALTGFGLGSVYYLYQHNVWENRSNKWLAEVLSNALFFQVATAVCVNPGFHIYGMVGGILFGTLKYAFYNSSFFQEKESIGSYTTFGDLSEEERKKQEYKDYIQFLGNYHKVRNGQLVDL 231 T 9.3 ENOD93 pdbhh F Eukaryota T 8b6f 20 T AJ Q950Y2_TETTH Ymf62 MFLITITSYFSNIIEFNSYIINLIDFITPLFFIENFVIQFFILYLFYLLIVNNNLYYILLYIFLEIVFFGLFLCLYQLELFTGFLWVAEFAIVFIAVVLLFYLNIDGLHLKYNHNINNVLYYTPSLVLFLIFFNIDYFSELELFLPLELSFIDIYDDYYEGFNNSIMNDFTPLTLSYYSINSAEFIIIGLLLLLGSVACVNLYKSNKNYTIVKQSNLLTMFDFFKDFINFSFIRKQDLNNQTNFNPSLRSIKKKY 255 T 8.3 DUF2070 pdbpssm F Eukaryota T 8b6f 27 AA AQ Q233X7_TETTS NADH dehydrogenase [ubiquinone] 1 alpha subcomplex subunit 10, mitochondrial MSKAYYFVKNFSWAEVSNLLCYGTKYPTVLNHQQKVTRLYRATLRRVYAHQVEGYKTDFKQYNENITDIGKDFNKMLALKPESLELQAYFKKYEDLQEELFDPAMIIDESRPYAASSGRYYIFDDYLLKFDPFGFYSPKLLSENRPEEAMPFYEDYPQNDSHWNLWEQFPEDFEDSNAEREAILKSNKH 189 T 0.005 Complex1_LYR pdbpssm F Eukaryota T 8b6f 29 CA AS I7M2Y3_TETTS NADH dehydrogenase, putative MNHYWGSSNTIPASSTQNNNYFSGGGNNVTIRGNEIMERLPSQTPSQNMVQASMKTLRFYRKFCRLIPFILRIHNIGTKFTAQQAMINFGNYIRERNHYRDPGLIDHRIQLGYELLYEAEMHFSQHTILMQYLSPYNTPLSDRGYSYLEKVKYGNKSKFLQGFYKGNKPTEF 172 T 0.00098 Complex1_LYR_1 pdbhh F Eukaryota T 8b6f 35 IA AY Q950Z5_TETTH Ymf58 MLTWISFWSLIFWLILIILVLKPKNFISILFMSELTWLALYCLSLLFGAIYCDITLLSISFFILGVAGLEFSFGILIAILYKNLNESLNTDLNNNNNNQNIFDKNFKTPLEKINWQ 116 T 0.0017 Oxidored_q2 pdbhh F Eukaryota T 8b6f 37 KA B0 I7MI60_TETTS Transmembrane protein, putative MVNTAYPTPLKTILKTTPAFVVYFVFGLGFSTVIYDVVYHPKDRIERFYFRSSKFERLSRKRDEKLRHYFKPAIEWQPWYNTSTNNNTRPLLRY 94 T 0.15 KCT2 pdbhh F Eukaryota T 8b6f 38 LA B1 Q22E95_TETTS ATP synthase subunit e, mitochondrial MVYQGFKVLRRNPTFYNPRSAGMVALSYFAYSYYVNKYYKPQNSNFEEYNSSHPHNHDEKVRQYHEKTNQAIRDAVLEKRAEHDQRLREEAKL 93 T 0.015 Hrs_helical pdbhh F Eukaryota T 8b6f 39 MA B2 W7X4R4_TETTS GRAM domain protein MSYSGYSLNGGVHPCLPFYERMLQCAKSEALPIKMCTAQTEDYLECHHRKKQYALNYAIKKELNNIRIVALPRYDEENDTFVPFSQATADHIFQ 94 F F Eukaryota T 8b6f 40 NA B3 A4VD20_TETTS Transmembrane protein, putative MNSPQKVAQGAGRKLFKHYINENIKSNNEQKLFFYRVNRWRWNTKDNTTAPKFLRLKYPLLVTGVCLFAYDWTYGFTQVDAHH 83 T 0.086 RGS pdbpssm F Eukaryota T 8b6f 41 OA B4 Q22DC2_TETTS Transmembrane protein, putative MNLPWFVRWGTDVALFFIPAYTFANYPTTFFVFAAEKRRQRRRKDFSDVKLRDDAAFSVDQVKQLQTKLHLKQ 73 T 1.6 HIND pdbhh F Eukaryota T 8b6f 42 PA B5 I7MIK1_TETTS Transmembrane protein, putative MDKYIQQAKCAYNFSLKAVRFVGPLNIVFAGVAFLMFYENNYKKLYLNPRYSYTMPYLQSAKITKNLYEKL 71 T 0.21 YpmT pdbhh F Eukaryota T 8b6f 44 RA BA I7MIE0_TETTS Transmembrane protein MGGDHHHEDSHHKSNVDQHELKAEMIKELSHYYDHHDLSLFGKVQHFVEHLLEEKHHAKINTSNFDQKKLENFSESKQISRTVFALKKIKTFNHDFFTSEEEMILEPLPLGILTYGLKYAFAGVDAALLTYFWRNWNFNVRTIGLLGGLVGIQMATLHIPNLVNEVVIQTPRRRALAKKYISAYGPQFFHDIVNPKYDIEHLRHLQNKLNPY 212 T 0.17 PfUIS3 pdb F Eukaryota T 8b6f 45 SA BB Q22Z32_TETTS Transmembrane protein, putative MNPRNIFNLAKKVQNFNSITQKAFKRFGGAAAHHDDHHDDHHGHGGHGYEVHLVKDKNLIGNKSFKDDLVAVYGFTDVNDHHHHDETDPYHHLRGVPTLSFERMYFADAYYHDDTHEGLMNEPHGYLTMDDPMDLRPNYEKSALELLFLVSGGAILALMLGYQGLNLANPAESLFSLNTAAEEIEDKIRQIRIDNDKLLQRKAQLEEELASLNN 214 T 0.022 MctB pdbhh F Eukaryota T 8b6f 46 TA BC I7M855_TETTS NDUB8 MALRRVLKNQFNLIHKGQAQAVRGGHGWDRPDVPLSFNPLYVHKRELSIFDTNMWMYDQVYPEYVISYNEIHLVDQWKGLKESFSQSAYWWAMMAMVFGFYFINTTPRQLGIDTNDLKGFLGEYYGQYKKRSGIRSNFLGLDVTGENSIIQPNYDRKNGIRDVIDSLNADAGKRKLINLEAKNFIERVEKECEQRILKKGGATQSHH 207 T 0.23 Spore_YhaL pdbpercent F Eukaryota T 8b6f 47 UA BD I7LT77_TETTS Transmembrane protein, putative MFLYKKILSIYKQSFSFFLSFNFSFFLYALLAIFLLINFCQHIHKFLYYCKEKIQKEMQNAYPEITDQHREFLKKQGLKVYEPKPLPDQINPFSKTYWITNAFIIGVSFLARRHALKVGAPRIFWSGCIVGVPLAAIISRGKSDQLDELVGARKTLEQKLEYAPITRRAWERALATNQEYQNEIKTQIQDLQAEIAAKKVAAKLE 205 T 0.02 FlxA pdb F Eukaryota T 8b6f 48 VA BE Q23KE0_TETTS NDUPH2 MFNILKGAQLSFRSITNKSVNNYYNIMRQVSLDSNPIVLYQSSTFTGNGLQEFYENADALTKYLKLVPFFLEKNLYDHPKQFVIKMEFHPQNKVLSLDCLTHQGVLKKTVNLENLIPVPYEDYVQFCRRKLFNAPLFLDTEMIYFNTFQNEFYVFDKNAKWNEEGINHPELDISKLYNEKAWFDSLRII 189 T 0.26 TMEM70 pdbhh F Eukaryota T 8b6f 49 WA BF Q23KG0_TETTS NDUB10 MAFGGFRQTDNSLIIDDRRKIILNTRSLNDFQQKIYLRNFFTNYRPDLSSYDYFAFKEKLRIGELFLNEYRKRINNEVRRAAILTPTSSLREKMNHKIADQILDLSSPHVRGAHFQAVRSWTDASKIVNYVEEKQTKINKYGLQFPLLGNMTEEQCASKEDEVYQRLLKEMQKPPKKASEPVEESSDE 188 T 0.12 CMV_1a pdb F Eukaryota T 8b6f 50 XA BG I7M2U4_TETTS NDUA13 MQFFRPDFIATQVLRRADMAHSPFHKAIHDLEDKRSKLFPDRRRIPGRKAKLLLAASLLLQMWGVGKIIEIKKFMKRRDIELKGLQRKAAPFMQSMNDVRHLALRERNDMLYNELLSVHGEEYAQKMQKRFHQTDIWAPFRHRYAYMYNSSNKNVKDYKQVTLSRYINGFDKFNV 175 T 0.00014 GRIM-19 pdbpercent F Eukaryota T 8b6f 51 YA BH Q951B2_TETTH NADH dehydrogenase subunit 2 MSIFSNIWINNDLNSYGLSILLLNIINYLIVFMLILSVILLTNLSKFKSLNQFKEFNSYNFILYSLIFSLLSMAGIPPLLGFTGKFLAILYSSFKSQYLLILFMTILNIFGMYFYIQNLRFVVKKNKSSILNYKNYYVNINYSITLNIILLNFFNFFGILFLSDLIIILNYISSYIYI 178 F F Eukaryota T 8b6f 55 CB BL Q22HE4_TETTS Transmembrane protein, putative MTNFGSPFRNTDSGIVIRDPENEKRLKLAFQNFWKSKQEDKEFQAQIKTAVSKDTVNFMFYASPLFGALLGKTYIDMFCNPRYFYFRAFTLSMFALAGYCVGNGFRNRYEHSLYTRNYHLFPKDLQDALVNGDARYCISWWKQ 143 T 0.053 AHH pdbpssm F Eukaryota T 8b6f 56 DB BM I7M9B3_TETTS Transmembrane protein, putative MSNNNQGDFFVDKYNFSRRVVDHRQPYDLNFSINNPVGSRVWFKAWKQKAIGNFLNLVGVHYAFYGAGFCLLFVLADAWGREKYAQPYKSQILHGRQPFGHTFVQNYRNQATDLGRWNHNFACYEKQPGCGRDFD 135 T 8.3 DUF983 pdbhh F Eukaryota T 8b6f 57 EB BN Q22SC4_TETTS PH domain-containing protein MTHQFENVLLSNRKNLTPQESVQKVINYALLQDAKQRSRTLRHIKASWVIPALLFTYPAWYLAKGAVNGVWSNIHPTDKVTLSFANIGRPFRLIYRPEIFLRDQQAKFIQLEKEHIEKSKKGEFVETTSPLVLWN 135 T 4 DUF1852 pdbhh F Eukaryota T 8b6f 58 FB BO Q23B10_TETTS NADH dehydrogenase [ubiquinone] 1 alpha subcomplex subunit 8 MFLNPVKDSEFDDEVKGFVPSEGEVRFVANKNKECGYYLQGIEQCRRKMVQLAGDSSSQFHSLGFLPCKRLVDAHYRCMTDDKFGSTIEEVPEIGLDSAQKFFDCTFQQLKPMQSCRRFFDQVVRDVYRANGSQLI 136 T 0.23 COX17 pdb F Eukaryota T 8b6f 59 GB BP Q231G0_TETTS NDUB6 MLLIEMAFNAMKMKIFSLRKIKVKSKEQYLYNYQQKLLILGQGKEKNNKQYKKDIEMGGFQKYPIPRYLHVGQWIVNKNWKWNTFHMFFPTAILCFMVWRNSMISTAKPPNYGEYVDPQSPVAPKAIKY 129 T 0.0021 TMEM117 pdb F Eukaryota T 8b6f 60 HB BQ I7MAF0_TETTS NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 4 MLARTLKNYMRVQQNLRFSRANIEKSPAPPTIGQVELEPFKFNHERDQLIYGYTMEELYGKKFGLKHSATVLREIKKDTIMMILFIIGGFTYCYHMRETRFQLDDDFNEYVNTNKQTFRPIPDHVKL 127 T 0.1 Viral_Beta_CD pdbpssm F Eukaryota T 8b6f 62 JB BS I7MG29_TETTS NDUB4 MSLRKGTSIFSRQFKKAFNDAKYQNLTAAQGETYSHLGWISNVDLRLGRAIFTFGVVGIAFCIYLEPSYFHETFGHMSQPPKYDLIDSNINGVEKKLNKQILHREHNEHKLDGFVSMFKGSDVAKN 126 T 6 Biopterin_H pdbhh F Eukaryota T 8b6f 63 KB BT NDUB15 NYHYDFCGRAYMGNPAVQSPPKEFFNYHYVPDNYPDALSGFRIAYRDPFEVQHVFAYENWEYQYDGQWWSMGSLACNVLFFCTPLFLYLILQVEELNSEKRGGTSNKFYHNNAGFFHFQIYDNKQ 125 T 1.5 DUF3930 pdbhh F T 8b6f 64 LB BU NDUTT16 ASQLQREQKLVQSLQQESLQPHLFKIIVDSQSDLVCEADRREYIKHYTRANEKSSTSQLLQVGALLGYIYAVGRYVSNPSTRKFSYGLAALLGSFSLLNPSKNLHHNHSLREIYSKYNISTNPQALEILKSRIY 134 T 21 ApoO pdbhh F T 8b6f 65 MB BV NDUTT17 MNISYTGLKLEDYSDEVIRKYKFPNSNELERFLNREQTLTVQQHKSAIKLAQQDFFAVAGLLSVGSLSYIFYNSVGGKVIRDRIRASRTKAQKVIVLALAFVANAAALIIARNAITAHNFGWQAA 125 T 0.32 P_C10 pdbhh F T 8b6f 67 OB BX Q22T55_TETTS Transmembrane protein, putative MFWRNVVRGLNCQQALRRQNFAKNITTTDIPKDSHHFAAKRSGFTQTEQAPFAYNDVYQYPKDYKPWNYNYKGNGVLLALFLGSAFSLVAYERSYASKTGRYQRKVQQNYYQI 113 T 0.055 Ncstrn_small pdbpercent F Eukaryota T 8b6f 68 PB BY Q951C2_TETTH Ymf57 MLKNKLIKFKFFRFVQSGFYVDFIFKKFSEMFIRNIFIYSSIFFGEKFMIEYLTKKTIDSFIFNNNRFNFINLVESKYFLQILTLILYLFFITIFILFYI 100 T 29 HEPN_AbiV pdbhh F Eukaryota T 8b6f 69 QB BZ Q22W63_TETTS Complex I-MNLL MSSMLIWGACFGLFTRAAACKASMIPLTTSPWKYPKYMIVSAVTFYYFDWYRRMALEQLCYNEEKLERYQIRAKLQSLKIGEELSDAYRESFFEHAVQKNNI 102 T 0.0018 NDUF_C2 pdbhh F Eukaryota T 8b6g 1 A CH I7LX66_TETTS Diphthamide synthesis protein MLTQRFYMIQFTKEEQSSEEKYLKTREREEDRKKELMHPQKVLNKKENKRKALLSKNQQNKKLIKYLNLNKRQEKLININQEEMSILPPLQYTYSNEESLELLIHSIKGNKDCNSERKAFNLCRSTVLGKHVEPEKCLDKALVFVNCFQKVRRDESAACQSAFNSTLECGKKYSESTISLGSSCQSQLDAYLNCK 195 T 0.0012 CHCH pdbpercent F Eukaryota T 8b6g 2 B CM Q22HD6_TETTS Transmembrane protein, putative MARLWWTLDPSKYYLKQISSGGRNEILFTVLGVTAAYWYFGNKRCEHYWRRQIDNCQSWSRAQNINGNNLTVKQYF 76 T 0.0044 PriCT_1 pdb F Eukaryota T 8b6g 3 C CL W7XBF5_TETTS Transposase MKLDQIISYYITPVRRFDKNLTAEQIYEQYQQAAQFNEIDAFTNIRFHRKFKEYIQTQEQSDYLYEKAKQISTLAQKMFEKKFPEYYTQ 89 T 0.13 Cdc6_C pdb F Eukaryota T 8b6g 5 E CI Q22YL0_TETTS DUF4885 domain-containing protein MFSDFNMYEAKVFLKAVADAQNTFRQTAQQENQLARYESQSQSLLNGSTSGAISITGDNIQQGRNFKALKEVKLFQYSNEIFKKYLAGFDSFSGDYTAFKKFLNESVKKIEQDA 114 T 0.0045 gp37_C pdb F Eukaryota T 8b6g 7 G CF Q248F8_TETTS Transmembrane protein, putative MIKYLLHQLFIYIYVAEVLLGCIFAFAETVFFHSDQDEDYFLQIKQIQIKNQKRFRNNQKKSRSFKKKIINQQLVSKMVRLNLKSNVDQNEYPFLAKWDKDMRQNYEEYQNRIDATTYHLQRSQRGIAVFGEWMYPRYFQKDILELEVLRRKQQLGKIYPEEVSSYTQINPDIANDLNLTFNAKLLWPVRGMTVGAGFFAFAHLFNLPYSFRLGLFVLPTAVELAFTWGNKTSQFKSIEFMDYLLQYRVSKALLEKNAKHFAEKKAAYQKEINSSQSVQDLYNQLITLVSEQAPSE 296 T 16 NADH_dh_m_C1 pdbhh F Eukaryota T 8b6g 8 H CG I7MEX7_TETTS SDHTT3 MSLVSLFKNTFLKSRVIGLSFQAQRVMAQMAKTDFENPDEHFLLNDAMKYNELVFYGRLAENWSINPELFGKAELAKYNEAKQTLIDFNQYHALVQNLHEFYWELKTIYLELSRGVATSNFHNKREVTHSIIESDIKNSIHKYIQLIDDLKDYPEWQHKVREEIGYYAHMIYTSVNHDGNFPEIFKEFNKVDSLYYFK 198 T 0.0012 MiaE_2 pdb F Eukaryota T 8b6g 9 I CK Q24CW6_TETTS Transmembrane protein, putative MLDDTKYIQMAQKFPRNVSVQLNKKLFVTRTWFRNYYFVGVFGIFAYFIYNQPKIFAPFSGYPTTVAYKAQPDFLNDQVIFYSQQRQNTLKNF 93 T 9.1 DUF108 pdbhh F Eukaryota T 8b6g 11 K CJ Q23S01_TETTS Transmembrane protein, putative MNHSCQKVFEGFVSALYDTSYFFRNFGPFKATIHYATYANYLAQNWAPRVSYIETSTPAYTLAKNKYAVYIVYGLIGGALIHNYMLDNKAAQKSQQYYLKHRD 103 T 11 Fzo_mitofusin pdbhh F Eukaryota T 8b6g 12 L CN W7XF00_TETTS Transmembrane protein, putative MRRIFWNFKTAFVGLPMFSLAPKNILVYPIVVGVPLYTFIVLQNSVRGFAYFDEYDSDVKEN 62 T 6 PPI_Ypi1 pdbhh F Eukaryota T 8b6g 13 M CC Q23RH8_TETTS Cytochrome b-c1 complex subunit 8 MRTKLYNAAYFLLNNNESFGHSFGIRLKIVGLNTWIVGYAVSRYYFSSLRVKAAQDERFE 60 T 4.8 eIF3_p135 pdbhh F Eukaryota T 8b6g 14 N CO SDHTT11 LPIRNIQFARYHYLAAVTVFTYFATRCCLLDYKKYYPLASVKK 43 T 12 AcylCoA_DH_N pdbhh F T 8b6g 15 O CD SDHD MFKELIHIFRTYYITFRYLKKSNINFLKNLSYTLIAYYLIINFQ 44 T 17 DUF1869 pdbhh F T 8b6h 3 C,CB DC,Dc Q950Y6_TETTH Ymf68 MLICNFLMYSNFSRIYWFDFNGTVNENLPLNYNVLKICRNEINKLEKLNENNLGTQKNPIKLNLSFEDKHYNTNNLVLDLNSYETFNSKNFISSIFDKTFESLNTVLMAPIYSFLEFKLKLSSTKINTNHYYVINGKLYITYNDSFKLFTTINDYFNDLNELSNTKLFFLYRSFNIYNIKLNSLVDFVFLKLILFIHLLYLKSTNYNRFDYRLKQTDWGFYINNNSNYIQNIFSGLKYIWRGLRFWIIGLLLGLSSIYYLMYVRLLPFNKIIFAWILVAMFLYWLLSGFVFFVKKYQYSKFTAAIQRFWKRTYIIFWVIEAGTFSVFFYLTLNASSEPVYMYDQIKIYKTHLFSWRWFLIKLLPSVSIILLGYYLQLTLKWNLFNKQNTIVLLITLLLLYILWLEFYQFYHILSFYGNINWAFDYDEYIWTLELDTRRTRLANNYIAICLFAKFWHFVFIFLFWVFFVLRINELGRIRYPLLVANVQNFIIIYIMSWAYMYPWLKFIFRKYLDVPYYWFYLNGRELGIRVFFTDLKLFFYGITNRLFDFNPSSIKFEKYPFYYWINSSQLTEFNQYRKFVIRDSIIYSLNNYII 594 T 0.29 DUF3408 pdbpssm F Eukaryota T 8b6h 4 D,DB DD,Dd Q23FF5_TETTS Cytochrome C oxidase subunit Vb protein MKKQKRTQGKQNTKQIKQEKLSSKRKANNQKEGKKKVKQEDYKEIKQKGKRMLSKIVKASFSSKGFNLANAVNTVKSTLNAPIKHIKRNIEPTGSNYSRMTNTTEEAFDEVSHEWQALVTSNPFDLNVFNYLENTQTSNFGTVDNPLVVFTSETPFRYVGCTGQMNEDDYEGHELLFFLLREGSLQRCMGCGQVFKLVRLRNEYSPEMDYYLSNFHPYEMQEMGESDTTVLMSPYKYASHYEYTQFETPSNMVYSMVNPDEHDRLLVDPAYRMERTKALEEKYKVYTSSLREVEKQFEERYGRAGQINISKVTYSTLIDVEKAVLKMDRLFRKVAKFENRAFIDRANHSRREKRMLERAQQRWDSNYSFFTGSLTEEEQKYRDYYETELEAYPEDEGIEQQLDQQEVLLSGRYDPKLYDFQEGYTKNPEDDQTSLIEKKAFKFRYRLANETSETFQRRNNRMVERQIKRFQQPQYKHAFEQLQKNIAISSNSGNALHSEYGYLELLSNESVQLYKDYYESDAEEDFKVFENLSSKEKLVMIANFENNLLPKYDRSEVHLIPKRQWEPAFGVWENFLYDITEYASFIAPRGKEIAADYQIQSAIPLTKEELIEAGLYKETIEKKVEPKLEAKKQTKSE 637 F F Eukaryota T 8b6h 5 E,EB DE,De W7XCY5_TETTS Transmembrane protein, putative MIWKYLQRTNRGNIIQAGLQHRKFENLPFKQNFDNLTKAYDLRMWYISNSPHEAKNLEYVNELEALHNELNYQNSRQFLFRTVSFLLGWALFYQFYELPKTYDWQDTQEPKHQVPAYGDLEEGGDEGGDD 130 T 0.025 SpoU_methylas_C pdbpssm F Eukaryota T 8b6h 6 F,FB DF,Df Q24I72_TETTS Structural protein MSSAVEKKDLPADYGKMPAGYNFLTRGKDWREYDKDFILRTDAVWEKFQLEHFFRNYMKCFFFDHGLKKYQMFEPEDMYTVVFEGWALDDLITFPGFTPTGRTNSYQIGLSPRQRTVVPTQTFYQMQDYYMLCGLRFERWFRCDLVYHDQRHTKFDQVKNQKNYKTYPCYREYYEAQYACQDDMFDFLMELAYARRAADNFESDFASHELTTLPTFYDTPKAAERKTYTY 230 T 35 SNN_linker pdbhh F Eukaryota T 8b6h 7 G,GB DG,Dg Q23DS4_TETTS Transmembrane protein, putative MGSVWFRNRYWWYRSLYDDYVAREAKLAFGIAAFIWLPHYYWGIHLNRAFEVNFSHRNYAHEWGPRRNRLAHSLEFEQFDMILENWQDLEDEYAQRGDGMLKK 103 T 7.9 NIPSNAP pdbhh F Eukaryota T 8b6h 8 H,HB DH,Dh I7MGF9_TETTS Transmembrane protein, putative XNRFFKVSSKYQYYKYLEQYDAAFLRKYQSETHWYLGRRGAWKNLVIKYAGDHISLEEEHNVKYKTHLSFVYLSYRLAWVLFAYVLIYNHFLLGDIGKTFNVGEWDHRLKPSAERDYPTRYESLYILDRTQKW 133 T 0.0061 IRK pdbpercent F Eukaryota T 8b6h 9 I,IB DI,Di W7X287_TETTS NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 8, mitochondrial MISKYRYLHCARKLVKQSVQAFGGGHHHHEYDWRDDPKVNKDIEEDIRDRGWHPETYDFPYTKKHDDWVFDVTMPSQNYQTDLTVNIHPENKKMHVMKQVMRQSYWDAEHDMAHEYDYESEDLDFQCESFKSQHFRKKGPISQYLILGLLPILYFGTEFFYNHYPDEDYWRVAHPPPLDYPDTDDTDDTETFKDYKSFTGRRMVDTGIVDPLWYDIREGKKVYYDWAGVNQPMEDI 236 T 0.14 Tctex-1 pdb F Eukaryota T 8b6h 10 J,JB DJ,Dj W7WZP1_TETTS Transmembrane protein, putative MSLSLFGVKNNWHKNGIWWFSKILNKTVGEERYDALRVQRRIWSMRFYYARQQCLYELFVDHPDLAQWTGTYPKVDSSHGFPFYSTYEMYRDFQENTLNSDGSFAQWITLVCGIYVIHVIYNYMIPYYWVSTPLKNDEFTRLRMKDYIASTVLEEVYGISYAEWGWLPHDFAYNRMRGLAGYMHPDDPRAMCTSTFHRKHKYIEHEVEKVGDYHHMTYPK 220 T 61 Spore_YabQ pdbhh F Eukaryota T 8b6h 12 L,LB DM,Dm Q22PJ5_TETTS Transmembrane protein, putative MLSKVTRRFLNYNQIYCFASQHGAEHHKLTASDEAYLNEVRQRYVTPDMEKWAYLDYKKHPSTTLSHYDHKSKDYVESERDDYNADVATNSHNKLIDDFKRNLQMQRKVHDILQKMDRPYLRGVPGVTKNISAGLQDYSAPVSKKSQSDPNDFYRDAYRNENRWIDQSVFTPKTSKMTHYDVEWPKELASRPVTKKFHHDKGYKYDVTTPYDQRYNYVADRLGHPEILGNPFERLMRLEGDIYHPNYLDQPFVKVPNANPNASLNFEEGEVLYENTRLLEWAKFWNYSVVVGYLWCAYFVPYNIFFKTHMPLEHAYDNLFFPYFQHTHFLWDNNALHIPTVGGVAIYATYIALSYINNIWKDYVVRAQFSKDKELLFVTRVSPFGTTEEEVYEVAHLEHLPPSVRSGVKDLSAQDADGLVDVTCMSSQRSLVFYKGDQYWNPKVYNDFINQTSNLWTRNYTGYNRLEVQNSVEQVKIGFSHSSQPKLEKK 490 T 0.015 TMEM70 pdbhh F Eukaryota T 8b6h 13 M,MB DN,Dn Q950Y7_TETTH Ymf67 MTALFLHILWSISYIIINILYIFLSLLLSNNNEKIKQYNSNYFIKILLVLFYNKNLSFYKNLLSEDEISKIEFERLKNYPTLVLIHSNLNKLEKRNKIINSFINFKTKYRFYKFISTNFNLQTIIKNCNDKIIFSTLLYIVNLNYSFFYKTIKNTDLIVYLLANKFSILNDNIIVSKFNISKFNDYIKYINNTNSIDTYLENQIILGLNNNTNSNITKNINTKLLNSYSNLKNLVNITNNTFYLKKINDNYNTVINSEFLTYLKSNYKISFSASNIVKYLSDKSVNNSVILYLRKNKIFNKSRYSRNRQTYRTGAYWCLYVNIIAVVAFYFWFYKFTMNFGYLWWLLYSLILSFFFSRALKHRFYNPLNVMTEFKNGFMWFIIILINIFKPLLKLLENNYINLYNHLVIKYYQSFICNTLINKKKLEFNYILSSFKFIKELNNIIIISLNKLF 453 T 0.0058 NUFIP1 pdbpercent F Eukaryota T 8b6h 14 N,NB DO,Do Q22FX8_TETTS Protein phosphatase 2C, putative MFRRIISNGALLSTQTQRWQDLSKFACLRASLNKESEKAFQELAKKNNVSPQELVELSKIVSMNLDVLKQNINSEQFLLEKESTLKRYRQSSIGTRGHLQTVNEAVNTKYPTLAEGLGQVAGYKEAYQALREIFVHPSISVNNLRQGSYGQQFAVDFRTRADEYVKALLKDHSSNPQAVQTIQEIQHTLHQIIKNYEQNPASIYARILTVLQTRGVNTLPVSKTADQKAVATIQKTSTPSLTIDQLTVPVQERVQTQTVFDAELAFIKEANEMIQQNTGNLPWDGGKKKIFQGQANKYLETPYYLLAALSGLGLLYFLYSGDAKYKTLVLTPVVGIAAFVLLRRNQILNRVPTLTELFLHKDGKFVDAVVSVNGQLISKNDIPVSTLKLYRGDHTVKVNLNDFEDASAKKFLAQQSGQEGVINVHFSKLRNLAARNGQVLNLGDTEVVVPFENQANRIILKQIFKGVEVLPSS 473 T 0.011 Rh5 pdb F Eukaryota T 8b6h 17 Q,QB DR,Dr Q23DG8_TETTS Transmembrane protein, putative MRYLKIEKEKLVSCKKQEQEVQRIRRRKGNQKLNSIAKQQRVKRRDYQQNIKQNKEVKNPKKLIKQQIINKVKKRKKMFRGLTKFNKVFALNSFKNSLVAVPKANLNHVQNMLEENLKYDAQKYNDEVAVIQKTSRIYKPTYTIEFNREGEVLVYSADPIKNSVVYFKYPYVLYEAAIPLFIWAWIYNPLELSKNAVNSLLIYPNIAWIPRMWYWRSLQYKIQKMYLLRGGKVAKIETQSLAGDRFTSWVETYQFHPLTQDQKNFDNQDNAEFLEDEGQLKYELGVQLDNLQEMGTTSQDIVINFMKEGTVHHPELFEAIVKGYNIDTSDYVINTANNLRAREGNHNH 348 T 0.023 TMEM70 pdbhh F Eukaryota T 8b6h 19 S,SB DT,Dt Q23DZ5_TETTS NADH dehydrogenase [ubiquinone] 1 alpha subcomplex subunit 8, mitochondrial MFLNRLVKETSKAKRLFSMAQNNFARAGPYNPNRYKDYYIPRTLPKNEEIVEFVQSQHSVPASPIRNQRHINPVRESGPLPSYDGTYTMEDIRAVFYNTTVGRDYCYCQMDPEEIMRRVPGITRKEAEFITKLGLSPQEQVDFAYIAYNIGLDIFYFTNQMFVARQVVTNSKGEKVEVLWNAQCYEDIAQLNVGFAPVLESVDYHWEIFLWADPPIKPNNDFDLNVPCTWFEYEQEWWMESCIQEDQFNLPEDERPYNTPRNPHCRKELWRSQDALQEEELMVNENWYPKNTQYNIYNQPDFIKPKSGSGAAADDIRI 318 T 5.7 IMS_HHH pdbhh F Eukaryota T 8b6h 23 W,WB DX,Dx I7LY65_TETTS COXTT9 MVYHLFERICNPDNFKLSGEAARVRTLIAAGFSKEEAEQVAWLQNHQVNGKILGLFTGGFALYCCNNYFHYFERYFPRLRYQPFTKFLAQAATVYFFFKIGDYYFTSRRYGSNDARMNGLMYSNTYYSTNKEALIQNFEPLNRKFTEEEVEQFLRNEGRSQEEKRNWIYNPHIHGSTEGEWKADIHEKFDSGKAPWEREHVKAKILETNKAKIDAGEEIQLKPFKTLNHLDKTGLLHRLHPFIWTNNWTLLG 252 T 0.19 Bac_luciferase pdbpssm F Eukaryota T 8b6h 24 X,XB DY,Dy I7MD70_TETTS COXTT10 MSSFIQYEFLKIYQGNQKIKNYYKRKRLIFQQKKVLKKKQKEIQMSTNNLRLKPWFHWTDEERSHAIFSAYEKRILKSEDLPSFLRANRINNVSTWVFPLIALPLFNQSIFKLGFAQRILLTRPAIEWHCFKIATVAASWLAWLNFSPFYRKLENEKEYLLDTLESRIGINVLDLNDALPRWTTSQEYNRRTQQLYNQRNGFFAGLLYPQEESSRPLVDIASFPKNLHKEKLTK 234 T 2.9 TFA2_Winged_2 pdbhh F Eukaryota T 8b6h 25 Y,YB DZ,Dz W7X4J9_TETTS 39S ribosomal protein L9, mitochondrial MFGRLVLKQTRRTLFNPVLKNTFCIYQAYQNPLRHINTGHNPNNVYEDIVMLGDYPVQNRTHDKVISQTYVPAIANIAFTHLSKKYPQAGLKVDQLNTLKEKTWNDLGVNIEHEKQEILVELSEQIFVKESKLRWVHEQRQRLAHTTYVFSGLEFQNVKVGFFIDSYNFLLQELAHRSNLYQSKDIVGEKSFHEKHLEQQTAPYSGVKSLEEPVSQNKSFINSLMRAIHNH 231 T 27 FliD_C pdbhh F Eukaryota T 8b6h 26 Z,ZB EA,Ea I7M3P9_TETTS COXTT12,Transmembrane protein,Transmembrane protein MYVLFVCLIDSMNVEEGKQIKEMILPHNNRQLARQYFDSLPENDINRKYYEGLKYETPKTFFGRFLNQFNIDAKLDTLSKFYTYQKTIRATQAELQEDRKSYLTNSLLFTAVSWFSIYQFARKGAVLPVLREYGRYFGTHRLFRQYLHTLVLPLLYTEYALNQKYYTHMEHLWTVHVNRLNQKILEDPLYTFYPQELNVPKHNIIVPTIFRDTPQ 215 T 17 Plk4_PB2 pdbhh F Eukaryota T 8b6h 27 AA,AC EB,Eb I7LZX8_TETTS Transmembrane protein, putative MKKGTASEEELKKLYDPNTFYEHGDNPAFKQFMNIAVENLREGKLTDHRTYVVDTYKKWMYARNWDDFLQRDCKAITFPRAFALWIVGTLGMATASKWCRQILPVGSHGITKISQTQFFHQFGPLGTLGAVGFYGLTAYLYYKTTIFTVKKFYSHCILQEREWIFEQERQNPGYGEYFFKDVPLSAEEHFNDLARGEMAKKKFEKPNHEF 210 T 8.8 DUF4500 pdbhh F Eukaryota T 8b6h 28 BA,BC EC,Ec COXTT27 MSALLKEILALTVKSEAALWKGAEQKVLSGLNNLAKTELVQITHHFGVNKQGSEALWSQLDKAAVGAFPELSVDETLQLIDGFGECPDSYTLSHDLNQRLLVSWEQLGKLNFQKLKETNPYFASDIVNQLDAAAAEFIKVRPAAESEAGGFLNSLGVSSSFNTTKNDIYVVQSASGKKLNNKEQREAYVLEKAQKYLKEDPQSKILDIIAQK 212 T 1.3 A_thal_3526 pdb F T 8b6h 29 CA,CC ED,Ed Q951A7_TETTH Ymf75 MFLGIFKDVIKLLNKKVVPVYFWFFLYCFLSTMDTNIFVSSCSFLKVEVFGKDENTTLVLLFYVFYSLFNFYLSRIKNKNNYLVRKHLYTTELLIELILFKYKLIILKFSSIKYILNFNVRKFILFNLFLINNYKAYKINTFFLYIYIYLNNLNIIWYPIFKAYSIFGYYKSTRLNFIDTKNENIKRIKY 190 T 1.8 PDH_E1_M pdb F Eukaryota T 8b6h 30 DA,DC EE,Ee Q23F08_TETTS Mobilization protein MKEKIFNELTRKMKRKEISAKIQREENKQILIRQRNNKKYIQSIQGIQQERKKGKLYLVEMATQNVEEMDTIQKMNYEATVNMGRQDLITREYTFYSDYEFIPIQEDRKQQMEDALNNLHKIIHPTVTQLKKKANVQEIQDRVFRKLQGWEGELNTCVFSAKNVRDSNFCADRFTNRINTEGVEFVKQILREY 193 T 0.13 DUF3221 pdbpssm F Eukaryota T 8b6h 32 FA,FC EG,Eg COXTT28 MAARDFEYNNQDVNQLNGAFISLVEDEKIGFWVGVGGFAYSQFIMRKFVKSTNIFASVTSLFAGAALANLYTHQSRASYARVAARANRNASLALNKLMEY 100 T 0.02 DUF1689 pdbhh F T 8b6h 33 GA,GC EH,Eh Q23D87_TETTS Transmembrane protein, putative XDNNYHFWGNGDRQDVSLSYEDYYSILDCLLDEKLSPQGLMKFKNLHEVSMYGVSYVPLYCFPVAYGISHMLTGKVRRGHSGYRNLFSLMSVVLPFTCWYAYTTPIPRRLYTEIICSNNADGAYVRNRIKQQKPGIWRKLSQQLYNKNFRFPELNQDLTATEFPLDYVAPHKF 173 T 0.16 DUF2206 pdb F Eukaryota T 8b6h 34 HA,HC EI,Ei I7MKT6_TETTS Transmembrane protein MVFEFLFYNQQHKTRNGYFINHDNLMLASLEERKKLIFYFIANQVPEKLDPVDRVKFNEELSDNLSTKARLIGSLTGLIGLVGFPYISTRIYSRPVLNIGLSLLICPFLYYVGNQLTYSVWEPKFIANNNTVCELSKKYNFTVFDFAQAKKEAHLKALRTELVSDNLLYSPGI 173 T 0.048 DUF1689 pdbhh F Eukaryota T 8b6h 35 IA,IC EV,Ev I7LVX0_TETTS Decapping nuclease XEVKYRGPSDDKLECEFLENNLLSCLREKSVQDNVAKMTCRPEFLVWFFLECPTKAAVYHDPKGLRNIFIQDKIKQKGSDDGVLSKDD 88 T 1.3 Defensin_4 pdbhh F Eukaryota T 8b6h 36 JA,JC EK,Ek Q230X6_TETTS Complex III subunit VII MAIRNFVFKISNQIQNLAAKRSLAYLNQIDSQSVPSRATINMKDQVTQMQREIDNMANVIRAQIPDEDRAEFEILKKYYVTGQHDSLVDPQDVLLQLDRIQVLKNLKMIELNEEAYDPELVRLEKLKARVLLEEEGALLEYAHFISKRPYNKPYEKWGVSEEHVKQQILG 170 T 0.35 APG6_N pdb F Eukaryota T 8b6h 37 KA,KC EL,El Q23VY4_TETTS Transmembrane protein, putative MGFETVVPAPPTRDDELRMIKATEEQFLQQPRYKLYMNEAHRIAKMNHGDRHNNIRAHFWSNFALGLLITGPIFIIPFGKAFRNLRSGVPYYFRPKYVFTQKNQYNQDRNWGAMKKQIPLWLGLSTAYAYWFTDFSINDDEWLEKGKVIYPHQTIKVL 158 T 0.091 MASE1 pdbpercent F Eukaryota T 8b6h 38 LA,LC EM,Em Q22DP8_TETTS Transmembrane protein, putative MSCTTRRFIDEKEKLEYSRGYNQQELEASKLRKDFVKKYIVDFDTTLYKTQVERDWAYIAKREYRYEVQLKSIGYGGALANAVLLWRIYANKKMVFWPIPIVGALGYLYFQPVFFQKSNKRFFDMCNVGEEYYLGRERNKILRECNKILNVEDF 154 T 0.1 DUF559 pdb F Eukaryota T 8b6h 39 MA,MC EN,En I7MFV5_TETTS COXTT22 MGKDQLDFSHFDKAFENKYDIVAPEFGDLHQKRAEFIAKNQGTYRPVPLVPNNIKGLIPKTCRLPATRNWYRRTSSFERNGFFNIHTPVLNTKMIPWLLFIVLTWGWSSFQIGGYNYERFDDNGERRNTLYWKLSPVEFPQSKLWNRPS 149 T 0.056 TOM6p pdb F Eukaryota T 8b6h 40 NA,NC EO,Eo Q23TE5_TETTS Transmembrane protein, putative MVFHYTNFVQETNAWWLRRVRPVYCTVLAYYGWWLYDRYYLFGKNATQDIRKDTTEVWEKRAALNKRNWGYNAHYKPELERSMKKVLYADPNYKFPIEWPERYMAETKTLEQVMDEEENWEYYK 124 T 4.1 GTA_holin_3TM pdbhh F Eukaryota T 8b6h 41 OA,OC EP,Ep Q24C97_TETTS Phage protein MMQNLKKFMSKTIQVQPVSFNQIPKAFYNFPEYRTGGVQANPGITAKRIIKCIGERLRKYDPARWENVPITFKTHFRDENGYSDVATSIQIHDALEREFGIDIKDRLALVTDVETAFYIVMSHHDPL 127 T 0.16 DUF1493 pdbhh F Eukaryota T 8b6h 42 PA,PC EQ,Eq Q22W32_TETTS Transmembrane protein, putative XEPFGTDERNWTHEEKDIITRFLKYDKHVNLKTAEMVYSAEVESAYFGKAGALAGGVISALFFNFPIVRNLPIIRRSVIGVLPFLYCYTWGKNTQEELRWLKTFAAYQRFVVYHGQHCKLWV 122 T 3.1 Pepsin-I3 pdbhh F Eukaryota T 8b6h 43 QA,QC ER,Er I7M9E7_TETTS Lysozyme MAQTAHQNRYQGGLCYAQCNELFSFWNPSIQQCWKGCDFGVGRVNDPEGRIEAQQMCKRWAAELYWTYKGELDTIKDLRVHADMYPTTPQNVYRACLAGVRRQKF 105 T 0.59 BSMAP pdbhh F Eukaryota T 8b6h 44 RA,RC ES,Es Q950Y0_TETTH Ymf70 MFRWLFLYWYNSTDTPSAIAKVNLWSYINLRLFKARLSSSIAYYILGLNNLELKKLKIFYKNTYFDYIYLKSIPCLFLIIFFTNLYLFL 89 T 37 DUF5784 pdbhh F Eukaryota T 8b6h 46 TA,TC EU,Eu I7LTF1_TETTS ABC transporter MSSDPFKKVERDYHNERSVHKHFASYPLKFWWGLNKFETIQGIHSILGNAADLVVSTLSFIPGVQGRNNASYIENSIRVTRFRGFDDKTQ 90 T 0.14 DUF5493 pdbpssm F Eukaryota T 8b6h 47 UA,UC EJ,Ej I7M8Y9_TETTS YflT domain-containing protein MNNTFKFLHQVISKLTLKAQVPNYGQYSHSLKRPINPKVVVFGNSSRAYELISSQFRNFNHVNGLELKGQEDNIQANKVAQSVLSINDGFQDGYYITDFPQNSKQAERLDLITDGVNLALYIKDPSDKVTVTRQQEAIDYYRKTGALVEFEVDPRGDLEEQVKQLSNQVLNGYKH 175 T 0.11 PRORP pdbhh F Eukaryota T 8b6h 48 VA,VC EW,Ew Q22N23_TETTS Cullin domain-containing protein MEDNYAADVQRQFNRTAFDSLYKICYNSLVQKNGSTIDFQKQIDCHQRLIQVFAKIAPIVVKVEQDAASSGGAAAGGEDEE 81 T 11 MetOD2 pdbhh F Eukaryota T 8b6j 7 G,R G,g Q23F81_TETTS UQCRTT1 MVRLEKILWEQLVNVKAFSRQRVIGAPSKWYNENRTEWFKVAQHNAFNTGFSGVILRALEPLLAKFIYRWRLDIAHQRGLTLEDSLLFMDRELRRCYFFETVARQNLHPYTVLFMKKRRARYYKVERGLRGFYVPDWVRKEAEERQLSETVDNIFNWENFVYREYMSDMTPIGRWTSLSKITPLDMFQYYGLFRNEAWDRFFYNEAFYESYSEKEKQEANGNPFGKFNLQTADGRAQFEKEVNTFIERYPFAVTKPGQKFDFTRFYALEDLANKRDTSKYDPALLESVKNELKQSAALPADNGANKTKKSKPILPDWLQPKFGKAFQA 328 T 0.69 DUF6322 pdb F Eukaryota T 8b6j 8 H,S H,h I7M484_TETTS Transmembrane protein, putative MNVTGAGLTHVKDFHSDEMRVFRGGLRHIADKQGNLIYGSVNSSVRYYHDKMSYERGFIQHSRSPSNQFINFHFMLGGFRTYVLERFFKQVWYRRNIRTFWFPVLISYTSGCITMRMYDNNCYDYFYFSD 130 T 1.6 DUF5320 pdbpssm F Eukaryota T 8b6j 9 I,T I,i I7MM45_TETTS Transmembrane protein, putative MVYGKLIFNNIKEYTPSWIKTIPYSQVTKPILRKQPQIVGKINADPKVKKFWVFLRENVQYYPFLWQFFILGTSFVWFHVCYDPWLAIYQANNAHRSLETALTKEKAHKKKLAEQEESE 119 T 2 Selenoprotein_S pdbhh F Eukaryota T 8b6j 10 J,U J,j UQCRTT3/UP1 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 66 F F F 8b6j 11 K,V K,k I7MFL6_TETTS Transmembrane protein, putative MYLPTFYKLFHETNAFRLKRYVGYGPLLLTWSIWTLYPALYNMIYSDFIPPERGVPKRIVDA 62 T 1.5 DUF5621 pdbhh F Eukaryota T 8b6j 12 W,X l,L UQCRTT2 MAPVFLKALRYVIYSYPLYVCYLIKQAQINAQGSEKEEEHH 41 T 2.8 DUF5392 pdbhh F T 8b6l 4 D D Signal peptide mix XXXXXXXXXXXXXXXXXXXXXX 22 F F F 8b7y 52 ZA z Myxovalargin B XXAXXXXXXXVXXXXX 16 F F F 8b8f 1 A A B0D650_LACBS N-terminal beta-trefoil domain of the lectin LBL from Laccaria bicolor MSNEYNPPLGIAFRLCGLASDRVLFSRVSPSPEVFHHPKSEVYPDQWFVAIPGSGQNAGCYAIKSKNTGKVLFSRMSPDPRVGHIDGDGKYPDNWFKFEAGSGKYAGYFRLRAVASDTVLVSRTSTGTDTQVINYPATSAKYDDQYFTILFD 152 T 0.001 RicinB_lectin_2 pdb F Eukaryota T 8b8w 2 C,D C,D NCOR2_HUMAN N-COR2,CTG REPEAT PROTEIN 26,SMAP270,SILENCING MEDIATOR OF RETINOIC ACID AND THYROID HORMONE RECEPTOR,SMRT,T3 RECEPTOR-ASSOCIATING FACTOR,TRAC,THYROID-,RETINOIC-ACID-RECEPTOR-ASSOCIATED COREPRESSOR HASTNMGLEAIIRKALMGKYDQW 23 T 15 PPS_PS pdbhh F Eukaryota T 8b8x 2 C,D C,D NCOR2_HUMAN N-COR2,CTG REPEAT PROTEIN 26,SMAP270,SILENCING MEDIATOR OF RETINOIC ACID AND THYROID HORMONE RECEPTOR,SMRT,T3 RECEPTOR-ASSOCIATING FACTOR,TRAC,THYROID-,RETINOIC-ACID-RECEPTOR-ASSOCIATED COREPRESSOR HASTNMGLEAIIRKALMGKYDQW 23 T 15 PPS_PS pdbhh F Eukaryota T 8b8y 2 C,D C,D NCOR2_HUMAN N-COR2,CTG REPEAT PROTEIN 26,SMAP270,SILENCING MEDIATOR OF RETINOIC ACID AND THYROID HORMONE RECEPTOR,SMRT,T3 RECEPTOR-ASSOCIATING FACTOR,TRAC,THYROID-,RETINOIC-ACID-RECEPTOR-ASSOCIATED COREPRESSOR HASTNMGLEAIIRKALMGKYDQW 23 T 15 PPS_PS pdbhh F Eukaryota T 8b8z 2 C,D C,D NCOR2_HUMAN N-COR2,CTG REPEAT PROTEIN 26,SMAP270,SILENCING MEDIATOR OF RETINOIC ACID AND THYROID HORMONE RECEPTOR,SMRT,T3 RECEPTOR-ASSOCIATING FACTOR,TRAC,THYROID-,RETINOIC-ACID-RECEPTOR-ASSOCIATED COREPRESSOR HASTNMGLEAIIRKALMGKYDQW 23 T 15 PPS_PS pdbhh F Eukaryota T 8b90 2 C,D C,D NCOR2_HUMAN N-COR2,CTG REPEAT PROTEIN 26,SMAP270,SILENCING MEDIATOR OF RETINOIC ACID AND THYROID HORMONE RECEPTOR,SMRT,T3 RECEPTOR-ASSOCIATING FACTOR,TRAC,THYROID-,RETINOIC-ACID-RECEPTOR-ASSOCIATED COREPRESSOR HASTNMGLEAIIRKALMGKYDQW 23 T 15 PPS_PS pdbhh F Eukaryota T 8b91 2 C,D C,D NCOR2_HUMAN N-COR2,CTG REPEAT PROTEIN 26,SMAP270,SILENCING MEDIATOR OF RETINOIC ACID AND THYROID HORMONE RECEPTOR,SMRT,T3 RECEPTOR-ASSOCIATING FACTOR,TRAC,THYROID-,RETINOIC-ACID-RECEPTOR-ASSOCIATED COREPRESSOR HASTNMGLEAIIRKALMGKYDQW 23 T 15 PPS_PS pdbhh F Eukaryota T 8b92 2 C,D C,D NCOR2_HUMAN N-COR2,CTG REPEAT PROTEIN 26,SMAP270,SILENCING MEDIATOR OF RETINOIC ACID AND THYROID HORMONE RECEPTOR,SMRT,T3 RECEPTOR-ASSOCIATING FACTOR,TRAC,THYROID-,RETINOIC-ACID-RECEPTOR-ASSOCIATED COREPRESSOR HASTNMGLEAIIRKALMGKYDQW 23 T 15 PPS_PS pdbhh F Eukaryota T 8b93 2 C,D C,D NCOR2_HUMAN N-COR2,CTG REPEAT PROTEIN 26,SMAP270,SILENCING MEDIATOR OF RETINOIC ACID AND THYROID HORMONE RECEPTOR,SMRT,T3 RECEPTOR-ASSOCIATING FACTOR,TRAC,THYROID-,RETINOIC-ACID-RECEPTOR-ASSOCIATED COREPRESSOR HASTNMGLEAIIRKALMGKYDQW 23 T 15 PPS_PS pdbhh F Eukaryota T 8b94 2 C,D C,D NCOR2_HUMAN N-COR2,CTG REPEAT PROTEIN 26,SMAP270,SILENCING MEDIATOR OF RETINOIC ACID AND THYROID HORMONE RECEPTOR,SMRT,T3 RECEPTOR-ASSOCIATING FACTOR,TRAC,THYROID-,RETINOIC-ACID-RECEPTOR-ASSOCIATED COREPRESSOR HASTNMGLEAIIRKALMGKYDQW 23 T 15 PPS_PS pdbhh F Eukaryota T 8b95 2 C,D C,D NCOR2_HUMAN N-COR2,CTG REPEAT PROTEIN 26,SMAP270,SILENCING MEDIATOR OF RETINOIC ACID AND THYROID HORMONE RECEPTOR,SMRT,T3 RECEPTOR-ASSOCIATING FACTOR,TRAC,THYROID-,RETINOIC-ACID-RECEPTOR-ASSOCIATED COREPRESSOR HASTNMGLEAIIRKALMGKYDQW 23 T 15 PPS_PS pdbhh F Eukaryota T 8b97 1 A A B0D650_LACBS Beta-trefoil domain of the LBL lectin MSNEYNPPLGIAFRLCGLASDRVLFSRVSPSPEVFHHPKSEVYPDQWFVAIPGSGQNAGCYAIKSKNTGKVLFSRMSPDPRVGHIDGDGKYPDNWFKFEAGSGKYAGYFRLRAVASDTVLVSRTSTGTDTQVINYPATSAKYDDQYFTILFD 152 T 0.001 RicinB_lectin_2 pdb F Eukaryota T 8b9u 2 C C (MLE)V(MAA)(E9M)G XVXXG 5 T 26 eIF2_C pdbhh F F 8b9z 26 AA b Q9W380_DROME UNCHARACTERIZED PROTEIN,ISOFORM A,ISOFORM B SLLKRAWNEIPDIVGGSALALAGIVMATIGVANYYAKDGDNRRYKLGYVVYRHDDPRALKVRNDED 66 T 0.038 NADHdh_A3 pdbhh F Eukaryota T 8ba0 26 AA b Q9W380_DROME UNCHARACTERIZED PROTEIN,ISOFORM A,ISOFORM B SLLKRAWNEIPDIVGGSALALAGIVMATIGVANYYAKDGDNRRYKLGYVVYRHDDPRALKVRNDED 66 T 0.038 NADHdh_A3 pdbhh F Eukaryota T 8bac 2 B BBB HPSE_HUMAN Heparanase 8 kDa subunit QDVVDLDFFTQEPLHLVSPSFLSVTIDANLATDPRFLILLGSPKLRTLARGLSPAYLRFGGTKTDFLIFDPKKE 74 T 3.5 Glyco_hydro_79n unppercent F Eukaryota T 8bat 1 A A B3E6E9_TRIL1 Geobacter lovleyi NADAR MGSSHHHHHHSSGLVPRGSHMAERPVYIPNISGTNLVKTQYVDFKWFPGMAIVQKQKSIESLHEAAKKLLNITNLLEISSKSKTTLGVDLSAFNLMITTIKYNKTFSVESAFQSSKVFEKGGPYLDLLDKTSREAKKDGRLQTSGRLKCFKFFGIEWGLEPQTAFYDWLYINALKKNSDYAEQVMEYSAFTDIEFNPERSINCQAYSAALYVSLCHRDLLEYATSSQTAFLEVVTGAPISNARQDDIVQGALKF 254 T 9.4 Phage_30_3 pdbhh F Bacteria T 8bbt 1 A A A0A0B4VFQ3_9VIRU MOBP MGYLCNDYGYEPNVDYPNASHAGLYDRSKQPYVDTAIGPKTTIQFDHVFIKSDFKTWLAHNQDEAILLIRLYELGLLLQGRSDSFLEFYNNTTYITRTDSKQPFLNKYGKLVDTTSVTCLDIFLSVVLFALNQIDSLICDFKNTPWINLSKEHKKIYELVRGIFGICYGEKDYNRFEYCPFDANSTASALNVNATLNAKKTIELITCGLIRALIAYANLVTAFSADKTALLHEILLTKVCC 241 T 0.51 DUF3746 pdb T Viruses T 8bc5 1 A A A0A0B4VFQ3_9VIRU MOBP MGYLCNDYGYEPNVDYPNASHAGLYDRSKQPYVDTAIGPKTTIQFDHVFIKSDFKTWLAHNQDEAILLIRLYELGLLLQGRSDSFLEFYNNTTYITRTDSKQPMMNKYGKLVDTTSVTCLDIFLSVVLFALNQIDSMICDFKNTPWINLSKEHKKIYELVRGIFGICYGEKDYNRFEYCPFDANSTASALNVNATLNAKKTIELITCGLIRALIAYANLVTAFSADKTALLHEILLTKVCC 241 T 0.51 DUF3746 pdb T Viruses T 8bc6 1 A,B,C A,B,C A4TVL0_9PROT Cereblon isoform 4 MPLDAGGQNSTQMVLAPGASIFRCRQCGQTISRRDWLLPMGGDHEHVVFNPAGMIFRVWCFSLAQGLRLIGAPSGEFSWFKGYDWTIALCGQCGSHLGWHYEGGSQPQTFFGLIKDRLAEGPAD 124 F F Bacteria T 8bc6 2 D,E D,E GLN-MET-GLN-SNN QMQD 4 T 120 BTD pdbhh F F 8bc7 1 A,B,C A,B,C A4TVL0_9PROT Cereblon isoform 4 MPLDAGGQNSTQMVLAPGASIFRCRQCGQTISRRDWLLPMGGDHEHVVFNPAGMIFRVWCFSLAQGLRLIGAPSGEFSWFKGYDWTIALCGQCGSHLGWHYEGGSQPQTFFGLIKDRLAEGPAD 124 F F Bacteria T 8bc7 2 D D PHE-PHE-GLU-GLN-MET-GLN-QCI KFFEQMQX 8 T 3.5 FAM110_C pdbhh F T 8bck 1 A A A0A0B4VFQ3_9VIRU MOBP MGYLCNDYGYEPNVDYPNASHAGLYDRSKQPYVDTAIGPKTTIQFDHVFIKSDFKTWLAHNQDEAILLIRLYELGLLLQGRSDSFLEFYNNTTYITRTDSKQPFLNKYGKLVDTTSVTCLDIFLSVVLFALNQIDSLICDFKNTPWINLSKEHKKIYELVRGIFGICYGEKDYNRFEYCPFDANSTASALNVNATLNAKKTIELITCGLIRALIAYANLVTAFSADKTALLHEILLTKVCC 241 T 0.51 DUF3746 pdb T Viruses T 8bcl 1 A A A0A0B4VFQ3_9VIRU MOBP MGYLCNDYGYEPNVDYPNASHAGLYDRSKQPYVDTAIGPKTTIQFDHVFIKSDFKTWLAHNQDEAILLIRLYELGLLLQGRSDSFLEFYNNTTYITRTDSKQPFLNKYGKLVDTTSVTCLDIFLSVVLFALNQIDSLICDFKNTPWINLSKEHKKIYELVRGIFGICYGEKDYNRFEYCPFDANSTASALNVNATLNAKKTIELITCGLIRALIAYANLVTAFSADKTALLHEILLTKVCC 241 T 0.51 DUF3746 pdb T Viruses T 8bcs 1 A A CC-HP1.0 XGELEALAKKLKALAWKLKALSKEPSAQELEALAQELEALAKKLKALAQGX 51 T 0.0097 LIN9_C pdb F T 8bct 1 A,C,E,H D,B,H,F 26alpha XGELEALGKKFKALAWKVKALSKEPSAQELEALTQEAEALGKKIKALAQGX 51 T 0.021 Seryl_tRNA_N pdb F T 8bct 2 B,D,F,G G,E,A,C 26beta XGELEALAKKTKALTWKFKALSKEPSAQELEALTQECEALGKKLKALAQGX 51 T 0.026 FlgN pdb F T 8bd1 2 B B A0A0L8UU71_VIBPH RHSPI MISLSDIENLIQHIWEEPIFSDVTSKKVVVSLYGTLSKKIPDKFIIIEEVFPKDELEDIWSNYEEYLDEYLIFPFLGTLGEAVICIGYGNDNKGKIFYFDFDFGACELDGDNLEAFLEKLLESGSTENLYFQ 132 T 0.00035 SUKH_6 unppercent F Bacteria T 8bd5 1 A A A0A8X6EH11_9CYAN ShCas12k MGSSHHHHHHSGGGSGGSAWSHPQFEKGGGSGGGSGGSAWSHPQFEKSGGGENLYFQSNASQITIQARLISFESNRQQLWKLMADLNTPLINELLCQLGQHPDFEKWQQKGKLPSTVVSQLCQPLKTDPRFAGQPSRLYMSAIHIVDYIYKSWLAIQKRLQQQLDGKTRWLEMLNSDAELVELSGDTLEAIRVKAAEILAIAMPASESDSASPKGKKGKKEKKPSSSSPKRSLSKTLFDAYQETEDIKSRSAISYLLKNGCKLTDKEEDSEKFAKRRRQVEIQIQRLTEKLISRMPKGRDLTNAKWLETLLTATTTVAEDNAQAKRWQDILLTRSSSLPFPLVFETNEDMVWSKNQKGRLCVHFNGLSDLIFEVYCGNRQLHWFQRFLEDQQTKRKSKNQHSSGLFTLRNGHLVWLEGEGKGEPWNLHHLTLYCCVDNRLWTEEGTEIVRQEKADEITKFITNMKKKSDLSDTQQALIQRKQSTLTRINNSFERPSQPLYQGQSHILVGVSLGLEKPATVAVVDAIANKVLAYRSIKQLLGDNYELLNRQRRQQQYLSHERHKAQKNFSPNQFGASELGQHIDRLLAKAIVALARTYKAGSIVLPKLGDMREVVQSEIQAIAEQKFPGYIEGQQKYAKQYRVNVHRWSYGRLIQSIQSKAAQTGIVIEEGKQPIRGSPHDKAKELALSAYNLRLTRRS 698 T 0.0027 RuvC_1 unphh F Bacteria T 8bd6 1 A A A0A8X6EH11_9CYAN Cas12k MGSSHHHHHHSGGGSGGSAWSHPQFEKGGGSGGGSGGSAWSHPQFEKSGGGENLYFQSNASQITIQARLISFESNRQQLWKLMADLNTPLINELLCQLGQHPDFEKWQQKGKLPSTVVSQLCQPLKTDPRFAGQPSRLYMSAIHIVDYIYKSWLAIQKRLQQQLDGKTRWLEMLNSDAELVELSGDTLEAIRVKAAEILAIAMPASESDSASPKGKKGKKEKKPSSSSPKRSLSKTLFDAYQETEDIKSRSAISYLLKNGCKLTDKEEDSEKFAKRRRQVEIQIQRLTEKLISRMPKGRDLTNAKWLETLLTATTTVAEDNAQAKRWQDILLTRSSSLPFPLVFETNEDMVWSKNQKGRLCVHFNGLSDLIFEVYCGNRQLHWFQRFLEDQQTKRKSKNQHSSGLFTLRNGHLVWLEGEGKGEPWNLHHLTLYCCVDNRLWTEEGTEIVRQEKADEITKFITNMKKKSDLSDTQQALIQRKQSTLTRINNSFERPSQPLYQGQSHILVGVSLGLEKPATVAVVDAIANKVLAYRSIKQLLGDNYELLNRQRRQQQYLSHERHKAQKNFSPNQFGASELGQHIDRLLAKAIVALARTYKAGSIVLPKLGDMREVVQSEIQAIAEQKFPGYIEGQQKYAKQYRVNVHRWSYGRLIQSIQSKAAQTGIVIEEGKQPIRGSPHDKAKELALSAYNLRLTRRS 698 T 0.0027 RuvC_1 unphh F Bacteria T 8be6 3 C P SOS1-HRas-peptidomimetic2 XHPWSVAX 8 T 0.079 DUF3019 pdbhh F T 8be7 3 C P SOS1-HRas-peptidomimetic3 KXHPWSVAX 9 T 0.03 DUF3019 pdbhh F T 8be8 3 C P SOS1-HRas-peptidomimetic4 KXHPWSVAX 9 T 0.03 DUF3019 pdbhh F T 8be9 3 C P SOS1-HRas-peptidomimetic5 KXHPWSVAX 9 T 1 DUF3019 pdbhh F T 8bea 3 C P SOS1-HRas-peptidomimetic10 XXHPWSV 7 T 1.6 DUF3019 pdbhh F T 8bef 18 R u Q8VZ65_ARATH Uncharacterized protein At1g67785 MVKVLTYFGMTLAAFAFWQSMDKVHVWIALHQDEKQERMEKEAEVRRVRAELLRKAREEDPLA 63 T 0.01 DUF6082 pdb F Eukaryota T 8bef 19 S v UMP2_ARATH Uncharacterized protein At2g27730, mitochondrial MATRNALRIVSRRFSSGKVLSEEERAAENVFIKKMEQEKLQKLARQGPGEQAAGSASEAKVAGATASASAESGPKVSEDKNRNYAVVAGVVAIVGSIGWYLKAGGKKQPEVQE 113 T 0.01 IATP pdbhh F Eukaryota T 8beh 4 D c Q8VZT9_ARATH Transmembrane protein MGGGDHGHGAEGGDFRAKVWSMTGGPNCRPKHWRRNTAIAMFGVFLVCIPIAKLSAKLEQRPHMPVRPIPSQIWCKNFGTKDDYEKEH 88 T 1.8 Chordopox_A13L pdbhh F Eukaryota T 8beh 9 I l NDUB8_ARATH NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 8, mitochondrial MAGRLSGVASRIMGGNGVVARSVGSSLRQRAGMGLPVGKHIVPDKPLSVNDELMWDNGTAFPEPCIDRIADTVGKYEALAWLSGGLGFFVGLGLLAVLNDKASKVPFTPRVYPYDNLRVELGGEP 125 T 0.00028 NDUF_B8 pdbhh F Eukaryota T 8beh 13 M p NDBAB_ARATH NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 10-B MGRKKGLPEFEESAPDGFDPENPYKDPVAMVEMREHIVREKWIQIEKAKILREKVKWCYRVEGVNHYQKCRHLVQQYLDSTRGVGWGKDHRPISLHGPKPEAVEAE 106 T 0.00012 NDUFB10 pdb F Eukaryota T 8bel 7 G,N J,T UCRY_ARATH COMPLEX III SUBUNIT 10,COMPLEX III SUBUNIT XI,UBIQUINOL-CYTOCHROME C OXIDOREDUCTASE SUBUNIT 10 MAGTSGLLNAVKPKIQTIDIQAAAGWGIAAAAGAIWVVQPFGWIKKTFIDPPPTEEK 57 T 0.0016 QCR10 unppssm F Eukaryota T 8bf1 2 B B PRGC1_HUMAN PGC-1-ALPHA,PPAR-GAMMA COACTIVATOR 1-ALPHA,PPARGC-1-ALPHA,LIGAND EFFECT MODULATOR 6 QEAEEPSLLKKLLLAPANT 19 T 3.3 Apo-CIII pdbhh F Eukaryota T 8bfc 2 B B RND3_HUMAN PROTEIN MEMB,RHO FAMILY GTPASE 3,RHO-RELATED GTP-BINDING PROTEIN RHO8,RND3 TDLRKDKAKSCTVM 14 T 25 DUF3012 pdbhh F Eukaryota T 8bfd 1 A A 310HD-U2U5 XGEXXXXKEXXXXKEXXXXKEXXXXKXXXWKGX 33 T 100 DUF4699 pdbhh F F 8bfd 2 B B D-310HD-U2U5 XGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGX 33 F F F 8bfe 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P CC-TypeN-LaUbUcLd XGELXXLKQELXXLKWELXXLKEELXXLKYGX 32 T 0.0005 DUF5320 pdbhh F T 8bfj 2 B B GGNB2_HUMAN LARYNGEAL CARCINOMA-RELATED PROTEIN 1,PROTEIN ZNF403 DEEIFISQDEIQSFMANNQSFYSNREQYRQHLKEKF 36 T 0.012 Clr5 pdbpssm F Eukaryota T 8bfk 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R A0A7U0GBC4_9CAUD TAIL INNER TUBE AKKDPNTIMSANSSYANGANGSVTNLEIPAAFGYTPDFRYYHAAADYTRRPTIAFLMELPNCFKDTDDAAKWGGSLKALIEMHSRTIDGLDYTLEVEHVETPFGGGGEMMQTLSKVRRARSVPVFTWVEKIGMPVSRFWNNYILYFMGEPNSNVAGIIGKGGITPAATYPDYNTFSVLFVEPDPTERYALRSTLITNMQPTGQGPEMRMSKDQTSSPEQLQISQTFTGLQMVGRGVDKLGQMMLDRASQTGIDLNAQPAFLSDREADVAARTDGYIDQLVSSLSKPGVAI 290 T 0.48 Phage_T4_gp19 pdbhh T Viruses T 8bfl 1 A,AA,B,BA,C,CA,D,DA,E,EA,F,FA,G,GA,H,HA,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,PA,Q,R,S,T,U,V,W,X,Y,Z A,a,B,b,C,c,D,d,E,e,F,f,G,g,H,h,I,i,J,j,K,k,L,l,M,m,N,n,O,o,P,p,Q,R,S,T,U,V,W,X,Y,Z A0A7U0GBA8_9CAUD Major head protein HEFAELFYRTYIVTPDQAGFQLSIRRNLVWEGWTGEGLSGEKQEISKRNILQGLLDYTTLETNSTELIPVIQSGENDEQFIDPSVLPAQTVKQGKDTFDTNFLKFSENGEGFNLLMLAQTPSRLKKGSMTFTDSLDSRIALKQLLISVTKGGTTELFALDVNRDQYAAYTATREYNFRLMQLKFHTSLGLGEESTTVAGAESALLKDLFDLGYRIELDVKVDGEMNVENGNGDTSLRALRLARVFDKEGKEIALTDSRVSAALSGLTVTGVGYSLEARLTNINQLEMGLLIDSDVQKQGFMIPTLPPLVIVKPAMVEDDKTYPRLEALTTAYRIQQMRNNAVTTLLNRADTLKSYLGVGVPHPIESNLGLEGVGQYYVRPYYNEATIDVLNDLNNLTSAAKQTDIQGLIVSKINEMVYTADQLTGYTAALEAAFSGRSPKPHVAIGTDMRLPQYIQINGDDRTVGIGYDYTIARISDLRMKDKIVMTFILPNESEPHPLQHGVLGFIPEYLVDFNMIRNQRIGREIRLTPRYRYFNFLPIMLVINVINLEEAIAQRTALDVNETQVTPAS 570 T 0.26 NigD_N unppssm T Viruses T 8bfp 1 A,AA,B,BA,C,CA,D,DA,E,EA,F,FA,G,GA,H,HA,I,IA,J,K,L,M,N,O,P,Q,R,S,T,U,V,W,X,Y,Z A,b,B,c,C,d,D,e,E,f,G,g,H,h,I,i,J,j,K,L,M,N,O,P,Q,R,S,T,U,V,W,X,Y,Z,a A0A7U0GBA8_9CAUD Major head protein HEFAELFYRTYIVTPDQAGFQLSIRRNLVWEGWTGEGLSGEKQEISKRNILQGLLDYTTLETNSTELIPVIQSGENDEQFIDPSVLPAQTVKQGKDTFDTNFLKFSENGEGFNLLMLAQTPSRLKKGSMTFTDSLDSRIALKQLLISVTKGGTTELFALDVNRDQYAAYTATREYNFRLMQLKFHTSLGLGEESTTVAGAESALLKDLFDLGYRIELDVKVDGEMNVENGNGDTSLRALRLARVFDKEGKEIALTDSRVSAALSGLTVTGVGYSLEARLTNINQLEMGLLIDSDVQKQGFMIPTLPPLVIVKPAMVEDDKTYPRLEALTTAYRIQQMRNNAVTTLLNRADTLKSYLGVGVPHPIESNLGLEGVGQYYVRPYYNEATIDVLNDLNNLTSAAKQTDIQGLIVSKINEMVYTADQLTGYTAALEAAFSGRSPKPHVAIGTDMRLPQYIQINGDDRTVGIGYDYTIARISDLRMKDKIVMTFILPNESEPHPLQHGVLGFIPEYLVDFNMIRNQRIGREIRLTPRYRYFNFLPIMLVINVINLEEAIAQRTALDVNETQVTPAS 570 T 0.26 NigD_N unppssm T Viruses T 8bft 2 B B OBG_ECOLI GTP-BINDING PROTEIN OBG LEEIAEEDDEDWDDDWDEDD 20 T 0.38 DUF1967 unppercent F Bacteria T 8bgm 1 A,C A,C A0A5P3XKM0_PARBF ORFX1 MHHHHHHENLYFQGNREFPFHFNDGNVSMNGLFCLKKIKTQYHPNYDYFKIKFCEGFLSIKNKVKDDLCEYDLKNIESVIALKREYSKENNLKNKESAIFMNIGNKGIHNKYDLYVVNVDINNILDENYMLKGILNDKLKILFLGNERKLLRIKN 155 T 0.013 RTBV_P12 unp F Bacteria T 8bi7 2 B B E2AK2_HUMAN INTERFERON-INDUCED, DOUBLE-STRANDED RNA-ACTIVATED PROTEIN KINASE,EUKARYOTIC TRANSLATION INITIATION FACTOR 2-ALPHA KINASE 2,EIF-2A PROTEIN KINASE 2,INTERFERON-INDUCIBLE RNA-DEPENDENT PROTEIN KINASE,P1/EIF-2A PROTEIN KINASE,PROTEIN KINASE RNA-ACTIVATED,PKR,PROTEIN KINASE R,TYROSINE-PROTEIN KINASE EIF2AK2,P68 KINASE KSPEKNERHTC 11 T 1.3 VEK-30 pdbhh F Eukaryota T 8bi8 1 A,B A,B NOS1_HUMAN CONSTITUTIVE NOS,NC-NOS,NOS TYPE I,NEURONAL NOS,N-NOS,NNOS,PEPTIDYL-CYSTEINE S-NITROSYLASE NOS1,BNOS QTHLETTFTGDGTPKTIRVTQXG 23 T 13 DUF5377 pdbhh F Eukaryota T 8bi9 1 A,B,C,D B,A,C,D NOS1_HUMAN CONSTITUTIVE NOS,NC-NOS,NOS TYPE I,NEURONAL NOS,N-NOS,NNOS,PEPTIDYL-CYSTEINE S-NITROSYLASE NOS1,BNOS QTHLETTFWGDGEPKTIRVTQXG 23 T 9.5 hemP pdbhh F Eukaryota T 8bip 47 UA 8 Nascent peptide chain XXXXXXXXXXXXXXXXXXXXXXX 23 F F F 8bmw 1 A,B,C,D,E A,B,C,D,E A0A157T170_SACSO CRISPR-associated small subunit protein (Type III-D) MRVKHYIQREFNYSVSSQDLLDIATRIAISAIKPKPKSNKPEPYVDSSTINSLLSFLQSRRNVNELLLYIMRQAGRDEIDEETGKLLLASLKDRELKDAVNLLGYVKWVYDTLTGLKVNYNNVKGVKTFKELVNILSKV 139 T 0.081 HTH_33 pdb F Archaea T 8bon 2 B,C,F E,D,F Macrocyclic peptide S1B3inL1 YRRPREQIIIGSLWVFXGX 19 T 3.4 RIC1 pdbhh F T 8boz 2 C,E,F,H,J,L,N,P B,D,F,H,J,L,N,P A0A2G9AAX8_ECOLX Lipoprotein MLKEWMIFTCSLLTLAGASLPLSGCISRGQESISEGAAFGAGILREPGATKKADTKDLNVPPPVYGPPQVIFRIDDNRYFTLENYTHCENGQTFYNNKAKNIHVKILDASGYLFKGRLFWLSTRDDFLAFPATLNTRHASCMGSNKGCMNAVIVTTDGGKRRSGVPYGSYTQNPTGATRDYDMLVMNDGFYLLRYRGGQGRFSPVILRWILSTEDSSGVVRSEDAYELFRPGEEVPSTGFYKIDLSRFYPKNNVMEMQCDRTLEPVQPSESKIQ 274 T 0.012 BNR unphh F Bacteria T 8bpn 1 A,B,C,D A,B,C,D W0DP94_9GAMM Twin-arginine translocation signal domain-containing protein MSYYHHHHHHDYDIPTTENLYFQGAMGKYVKVQDFYDQLGKYVLVAPGKFSGTVAATDLSTGWTMAWLAAWNYGDTCPIMHHMAAFPSPDPYKEFEFVVNTQGGKNLFIYGVPVTVEDPGEGMKIYRIKYDGTRMNLQRDAAEVSGLGLGVHVTITPEADGYAVGDGQKDICAEFDRETDMVRYAWAFDWDPNVKDLKRAWLDGGTMTIKRLKPTLPGGRYDLQGSKGNKIDWELVPGGELAIEDGKVSGDRPLHSVANDALVFDPRGKWAVASMRLPGVCVVFDRENQVPVAVLAGPKGTPSQFQLVKVDDDTWTVDIPEVISAGHQAGFSPDGQSFLFMNSLRQNNIMVWDSSNHDDPTTWEKKAVVESPDWRGAYPNTQHMVFTPDAKKIYVTMWWPSPTPNGIAVIDAVNWEVLKEVDLGPDMHTLAITYDGKFVVGTLSGYQNTASAIVVMETETDEVLGFLPSPMGHHDNVIVPRTLEDLRISRSTTT 494 T 0.0037 Cytochrom_D1 unppercent F Bacteria T 8bpo 4 D D1 Nascent polypeptide-associated complex subunit alpha N-terminal region XXXXXXXXXXLXXXXXPXXXXXXXXXXXX 29 T 4700 zf-RING_11 pdbhh F F 8bpx 29 CA c Q8VZT9_ARATH Transmembrane protein MGGGDHGHGAEGGDFRAKVWSMTGGPNCRPKHWRRNTAIAMFGVFLVCIPIAKLSAKLEQRPHMPVRPIPSQIWCKNFGTKDDYEKEH 88 T 1.8 Chordopox_A13L pdbhh F Eukaryota T 8bpx 37 KA l NDUB8_ARATH NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 8, mitochondrial MAGRLSGVASRIMGGNGVVARSVGSSLRQRAGMGLPVGKHIVPDKPLSVNDELMWDNGTAFPEPCIDRIADTVGKYEALAWLSGGLGFFVGLGLLAVLNDKASKVPFTPRVYPYDNLRVELGGEP 125 T 0.00028 NDUF_B8 pdbhh F Eukaryota T 8bpx 41 OA p NDBAB_ARATH NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 10-B MGRKKGLPEFEESAPDGFDPENPYKDPVAMVEMREHIVREKWIQIEKAKILREKVKWCYRVEGVNHYQKCRHLVQQYLDSTRGVGWGKDHRPISLHGPKPEAVEAE 106 T 0.00012 NDUFB10 pdb F Eukaryota T 8bpx 43 QA u Q8VZ65_ARATH Uncharacterized protein At1g67785 MVKVLTYFGMTLAAFAFWQSMDKVHVWIALHQDEKQERMEKEAEVRRVRAELLRKAREEDPLA 63 T 0.01 DUF6082 pdb F Eukaryota T 8bpx 44 RA v UMP2_ARATH Uncharacterized protein At2g27730, mitochondrial MATRNALRIVSRRFSSGKVLSEEERAAENVFIKKMEQEKLQKLARQGPGEQAAGSASEAKVAGATASASAESGPKVSEDKNRNYAVVAGVVAIVGSIGWYLKAGGKKQPEVQE 113 T 0.01 IATP pdbhh F Eukaryota T 8bpx 57 EB,OB AJ,BJ UCRY_ARATH COMPLEX III SUBUNIT 10,COMPLEX III SUBUNIT XI,UBIQUINOL-CYTOCHROME C OXIDOREDUCTASE SUBUNIT 10 MAGTSGLLNAVKPKIQTIDIQAAAGWGIAAAAGAIWVVQPFGWIKKTFIDPPPTEEK 57 T 0.0016 QCR10 unppssm F Eukaryota T 8bq5 29 CA c Q8VZT9_ARATH Transmembrane protein MGGGDHGHGAEGGDFRAKVWSMTGGPNCRPKHWRRNTAIAMFGVFLVCIPIAKLSAKLEQRPHMPVRPIPSQIWCKNFGTKDDYEKEH 88 T 1.8 Chordopox_A13L pdbhh F Eukaryota T 8bq5 37 KA l NDUB8_ARATH NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 8, mitochondrial MAGRLSGVASRIMGGNGVVARSVGSSLRQRAGMGLPVGKHIVPDKPLSVNDELMWDNGTAFPEPCIDRIADTVGKYEALAWLSGGLGFFVGLGLLAVLNDKASKVPFTPRVYPYDNLRVELGGEP 125 T 0.00028 NDUF_B8 pdbhh F Eukaryota T 8bq5 41 OA p NDBAB_ARATH NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 10-B MGRKKGLPEFEESAPDGFDPENPYKDPVAMVEMREHIVREKWIQIEKAKILREKVKWCYRVEGVNHYQKCRHLVQQYLDSTRGVGWGKDHRPISLHGPKPEAVEAE 106 T 0.00012 NDUFB10 pdb F Eukaryota T 8bq5 43 QA u Q8VZ65_ARATH Uncharacterized protein At1g67785 MVKVLTYFGMTLAAFAFWQSMDKVHVWIALHQDEKQERMEKEAEVRRVRAELLRKAREEDPLA 63 T 0.01 DUF6082 pdb F Eukaryota T 8bq5 44 RA v UMP2_ARATH Uncharacterized protein At2g27730, mitochondrial MATRNALRIVSRRFSSGKVLSEEERAAENVFIKKMEQEKLQKLARQGPGEQAAGSASEAKVAGATASASAESGPKVSEDKNRNYAVVAGVVAIVGSIGWYLKAGGKKQPEVQE 113 T 0.01 IATP pdbhh F Eukaryota T 8bq5 57 EB,OB AJ,BJ UCRY_ARATH COMPLEX III SUBUNIT 10,COMPLEX III SUBUNIT XI,UBIQUINOL-CYTOCHROME C OXIDOREDUCTASE SUBUNIT 10 MAGTSGLLNAVKPKIQTIDIQAAAGWGIAAAAGAIWVVQPFGWIKKTFIDPPPTEEK 57 T 0.0016 QCR10 unppssm F Eukaryota T 8bq6 29 CA c Q8VZT9_ARATH Transmembrane protein MGGGDHGHGAEGGDFRAKVWSMTGGPNCRPKHWRRNTAIAMFGVFLVCIPIAKLSAKLEQRPHMPVRPIPSQIWCKNFGTKDDYEKEH 88 T 1.8 Chordopox_A13L pdbhh F Eukaryota T 8bq6 37 KA l NDUB8_ARATH NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 8, mitochondrial MAGRLSGVASRIMGGNGVVARSVGSSLRQRAGMGLPVGKHIVPDKPLSVNDELMWDNGTAFPEPCIDRIADTVGKYEALAWLSGGLGFFVGLGLLAVLNDKASKVPFTPRVYPYDNLRVELGGEP 125 T 0.00028 NDUF_B8 pdbhh F Eukaryota T 8bq6 41 OA p NDBAB_ARATH NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 10-B MGRKKGLPEFEESAPDGFDPENPYKDPVAMVEMREHIVREKWIQIEKAKILREKVKWCYRVEGVNHYQKCRHLVQQYLDSTRGVGWGKDHRPISLHGPKPEAVEAE 106 T 0.00012 NDUFB10 pdb F Eukaryota T 8bq6 43 QA u Q8VZ65_ARATH Uncharacterized protein At1g67785 MVKVLTYFGMTLAAFAFWQSMDKVHVWIALHQDEKQERMEKEAEVRRVRAELLRKAREEDPLA 63 T 0.01 DUF6082 pdb F Eukaryota T 8bq6 44 RA v UMP2_ARATH Uncharacterized protein At2g27730, mitochondrial MATRNALRIVSRRFSSGKVLSEEERAAENVFIKKMEQEKLQKLARQGPGEQAAGSASEAKVAGATASASAESGPKVSEDKNRNYAVVAGVVAIVGSIGWYLKAGGKKQPEVQE 113 T 0.01 IATP pdbhh F Eukaryota T 8bq6 57 EB,OB AJ,BJ UCRY_ARATH COMPLEX III SUBUNIT 10,COMPLEX III SUBUNIT XI,UBIQUINOL-CYTOCHROME C OXIDOREDUCTASE SUBUNIT 10 MAGTSGLLNAVKPKIQTIDIQAAAGWGIAAAAGAIWVVQPFGWIKKTFIDPPPTEEK 57 T 0.0016 QCR10 unppssm F Eukaryota T 8bqv 2 B C Unknown protein XXXXXXX 7 F F F 8bqw 2 C,D B,D Unknown protein XXXXXXXXX 9 F F F 8br3 24 X LM0 S7XVN9_SPRLO Transposase MYFIKPGCLIKKKFSIYTSIVISVIDNNSVVIQSYDKSDNGDVISIDREVINVSKIVPIGNIDIKNKSKKEIDGILVEENRNLKNKDDVLLMNDFERFKEQLKKEVEDMVIEEMA 115 T 0.00024 Ribosomal_L14e pdbhh F Eukaryota T 8br8 46 TA SA RSSA_GIAIC 40S ribosomal protein SA MSTEKTSQASKEYQLKEADVKKMLVATTHLGVRNIDRRMQFYIFDRQKDGTFVFNLQKVWAKIVFAARILVTIDDPAEIAVVANRPDAQRAILKFCKYTHATAFPGRFIPGNFTNRMNPNYCEPRLLLVNDPVVDRQAILEASYVNIPTISLCNSDANLKFIDVAIPCNNKTPMSIGLIYWLLAREVLRLKGSISRTEEWDVKPDLFVALPEEIPDEEESEDFYDDDEEEDEEFSAGNGNLFDEY 245 T 1.1E-11 Ribosomal_S2 pdb F Eukaryota T 8brm 45 SA SA A0A644FB17_GIAIC Ribosomal protein SA MSTEKTSQASKEYQLKEADVKKMLVATTHLGVRNIDRRMQFYIFDRQKDGTFVFNLQKVWAKIVFAARILVTIDDPAEIAVVANRPDAQRAILKFCKYTHATAFPGRFIPGNFTNRMNPNYCEPRLLLVNDPVVDRQAILEASYVNIPTISLCNSDANLKFIDVAIPCNNKTPMSIGLIYWLLAREVLRLKGSISRTEEWDVKPDLFVALPEEIPDEEESEDFYDDDEEEDEEFSAGNGNLFDEY 245 T 1.1E-11 Ribosomal_S2 pdb F Eukaryota T 8bro 1 A A B4EH86_BURCJ Lectin GHMPLLSASIVSAPVVTSETYVDIPGLYLDVAKAGIRDGKLQVILNVPTPYATGNNFPGIYFAIATNQGVVADGCFTYSSKVPESTGRMPFTLVATIDVGSGVTFVKGQWKSVRGSAMHIDSYASLSAIWGTAA 134 T 0.002 DUF1543 unppercent F Bacteria T 8bsi 46 TA SA A0A644FB17_GIAIC 40S ribosomal protein SA MSTEKTSQASKEYQLKEADVKKMLVATTHLGVRNIDRRMQFYIFDRQKDGTFVFNLQKVWAKIVFAARILVTIDDPAEIAVVANRPDAQRAILKFCKYTHATAFPGRFIPGNFTNRMNPNYCEPRLLLVNDPVVDRQAILEASYVNIPTISLCNSDANLKFIDVAIPCNNKTPMSIGLIYWLLAREVLRLKGSISRTEEWDVKPDLFVALPEEIPDEEESEDFYDDDEEEDEEFSAGNGNLFDEY 245 T 1.1E-11 Ribosomal_S2 pdb F Eukaryota T 8bsj 45 SA SA A0A644FB17_GIAIC 40S ribosomal protein SA MSTEKTSQASKEYQLKEADVKKMLVATTHLGVRNIDRRMQFYIFDRQKDGTFVFNLQKVWAKIVFAARILVTIDDPAEIAVVANRPDAQRAILKFCKYTHATAFPGRFIPGNFTNRMNPNYCEPRLLLVNDPVVDRQAILEASYVNIPTISLCNSDANLKFIDVAIPCNNKTPMSIGLIYWLLAREVLRLKGSISRTEEWDVKPDLFVALPEEIPDEEESEDFYDDDEEEDEEFSAGNGNLFDEY 245 T 1.1E-11 Ribosomal_S2 pdb F Eukaryota T 8bss 2 B B THAN_PODMA Thanatin-like derivative VPIIYXNRXTXKCXXY 16 T 2.1 Fuz_longin_3 pdbhh F Eukaryota T 8bt6 1 A A A0A151HKA5_TOXGO Putative anonymous antigen-1 GMAIGKVKRNPDAGVATAVQSVIQHQTFKRMLLFGLRSLADFCSPSNQLYQENALDALDRGVLSAIQTAVTTFSDDDDLMLCASRVLWAMSVAIKEEMDPAHIARVHSEGSPVIVAVVNSSPTDPQTIEDSMNFVDNLKRAGAPVDGASLAGGMLSIFTKTALDMKTAKRVTAALAIAAETAEGSTALYNAGGTSVLLTYCLDQGDLSDAGVEMVEGAFDTVRYMAGYQCTDATTLPQCIALMDKYRGRKSASAKGSSALAAMIGPEQLQKCLNTLKTAEAGSAEYDEALVTLGSMSYISSFTDEIVRAGGVPLLIELINSGLPQMEGNPEKIASMISGAAKMLARIASNPVNVDAIVQAGGVATLCTAVSYCTESMEALGALCMALVPLASRESLAHEIVQYQTFATVLPILYQNVESPEIAALAMELVATGSQHEEIQEHMLQNQAAEICSLCCQYHTADASYQQHAISALNRLVPRLTTLHGVSEYGGIQGVIASLNANVNNEQVALLAVQLLDNFSEVSDAKTYMSDGTCVDAVLAAMLEHEGNDLLISAGVHCLARIATEDDCARHLNVLDTAIQTARGNPDGVYRVLAAISGLSRVPSLRQIFEEKNASDTILAGISSWIECSRFEGQNRIIKAALKTVKNMKISGDGDLTSCFAAMCDVACLPQVKRVVELEEPDNNILVADTAAFRDLAATMRITGAENLERCIESVLRVMRKYPDSRRAQLNCLETLNYLAQCDGGEGVAILSRTGGLNAVVQYLTRAPMYLDAQIAGFTVLATSAKIDSNVGETLRKCNCLQALKVAMRTHAKSKELKRTIAPLVALLMPTDALETEIQELLNECASACEKNNFPHLHENLAALNELLISSEGAKIAARLGIGAHMCKYQEYISAHEQDALAVTDYDILGKDLFDATVSECAHAMEQVASTRSGRNALIKAGNVATLISLYESLKAPQSQYSEEAAIHCLEALRILLKSDKRSAELAFERNFVSTLCVGIDSFPHSAPVLGATCACLAAMATTPERVQMLTAQPAFESLLQKLVFVIQNDPSKDNKLVAMRALQELVEITNDATMANKIAEAGAVTALFRIIDEYGDDEQLTVQAAEVLALLGAFEDLRRFYDNDVRFPAQVLTAALTKQKNNETAVVHLLDVLNKLATSEDRAVLRELGVMEQVADAMRVHSESEAVTRLGGELFAKMGADEQIKSLMLQIIETVESGAEDTAQTVDILCGRLAVFLAAPLEDPRDALQHTEKCLGSLVATLQTYPGSERLEGNVALVCRRLCDRCFDDADDPYGAWAVAASGMLAQFAGMVAGETVLANKKFLGPAYRTFTACCANAYCMPTMVEVAPSFLPQTYTLLEMHKNDAETVARVLEFLRYFAEDPTACGLIVQNMSGSSGDVVALTVLLMQQHQNNDAVVCAGMEFLGALAYTLSQAGYEPLPTLADGSVLRDCDALMGSNSSSARQLAHMHMIEKMLLSKAYNDALIQEQALKKLTMSLKAEDDKKRFSDEERAGLYAAMACVLLAAGGAGLTGEMEKFNGFEVVLQAIEEFGENPTVIKEVNRALQGLSMADVNMTARTVKEAVPKLCTEATTAIQTDAECADTFCDLMLQLVSQEGNGRQLLQVYGLEETLQGVENLAAYYGEDFGTQLSEKVAMIRQAMEDDQPREKTCKDVYDLLNSRVQQGLSVAISEVAILQEEVEFLVSQMGMYNQEQLDHQTAMGADHQYGNMAFELLAATSANVKLLQANEFSKMELALIKGQADPEIVLYAVKALTAFCKFPPAAQDTARIQGCPALVTEACSKINKSGLPNERKEEHLCARYFLVERTAINRNLYNKTPIMTELINSWNDYDKGAYTTTLLRFVFRAMRRVVSDAHVEELLKANVLQRLIGIISDVNADMALLPDVLFLLGSLAVVPEIKTKIGELNGIAACTDLLQRALPKPNTAPVVTNVCLAFANICIGHKKNTEIFSKLGGPALNVKVLNDRGHEYDVCNAASVLLCNLLYKNESMKKLLGTNGAPAALVKGLSNYDGSEEKTAIRCLESVFKAISNLSLYTPNIQPFLDAGIENAYSTWLSNLSETFPDAQLETGCRTLVNLVMENEENNMRKFGVCLLPCMAVAKQGRTDTKALLLLLDIEASLCRLKENAEAFAANGGIETTIRLIHQFDYDVGLLTLGIHLLGIQSAVKDSIQRMMDADVFSILVGCVEVDAEGNEVTDLVVGGLRCTRRIVRSEELAFEYCNAGGIATIANVICKSINQPMVMLEACRVLLGLLFYTTRSQADRQAAVEALHAQCQQRAEQMHAQAQADYEAGVVSEPPPEEMEVPEPDPDELANAAYGGWYQMGMDEVMIDAILQAVCACAAVEAHAKQLRLQRVCLGLAAYFASEQMGTSSLVGSGIEQVLTQIMTNFAGEGTTMQLSCVIINSIAMTSGDMYEEIKTSALLSALKTSVGKMATKKPEEKALKETCAATLEAASSGEDPFDAFSKTVTELDFKFTEWNVDPYPNGVHDLPSNVKEALRKGGKLKVFLPEKEKEEIRWRSSQDLNVFEWCMGNDQDYNNRIPIVRIRNVAKGLVHPALKAAAKKEPRKVAAKFTMCLFGPPNDDFPEGVELPMVAKSQKERDAFVEMMVQWRDAATYNFHHHHHH 2646 T 0.00073 Arm_2 unppercent F Eukaryota T 8btd 45 SA SA A0A644FB17_GIAIC Ribosomal protein SA MSTEKTSQASKEYQLKEADVKKMLVATTHLGVRNIDRRMQFYIFDRQKDGTFVFNLQKVWAKIVFAARILVTIDDPAEIAVVANRPDAQRAILKFCKYTHATAFPGRFIPGNFTNRMNPNYCEPRLLLVNDPVVDRQAILEASYVNIPTISLCNSDANLKFIDVAIPCNNKTPMSIGLIYWLLAREVLRLKGSISRTEEWDVKPDLFVALPEEIPDEEESEDFYDDDEEEDEEFSAGNGNLFDEY 245 T 1.1E-11 Ribosomal_S2 pdb F Eukaryota T 8btk 14 N BK Nascent chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXX 29 F F F 8btr 45 SA SA A0A644FB17_GIAIC Ribosomal protein SA MSTEKTSQASKEYQLKEADVKKMLVATTHLGVRNIDRRMQFYIFDRQKDGTFVFNLQKVWAKIVFAARILVTIDDPAEIAVVANRPDAQRAILKFCKYTHATAFPGRFIPGNFTNRMNPNYCEPRLLLVNDPVVDRQAILEASYVNIPTISLCNSDANLKFIDVAIPCNNKTPMSIGLIYWLLAREVLRLKGSISRTEEWDVKPDLFVALPEEIPDEEESEDFYDDDEEEDEEFSAGNGNLFDEY 245 T 1.1E-11 Ribosomal_S2 pdb F Eukaryota T 8bv1 2 G,H,I,J,K,L N,G,K,H,L,I P4 peptide inhibitor of histone chaperone ASF1 XEKXARLARRIAX 13 T 3.5 LsmAD pdbhh F T 8bvf 2 C,D C,D FTSZ_CORGL Cell division protein FtsZ DDLDVPSFLQ 10 T 2.2 DUF4809 pdbhh F Bacteria T 8bvp 1 A A Q5ZSL3_LEGPH Restriction endonuclease GGAMSIPCKWLKKDKGDYSIPFPTGTTSIPEETIPSAIVLQPVANENTVISGYKLKDTVSSPEKAQEVNNKTVSPRTPKIIVKHDNSLQSLTIMDIYSQKPIQFDESKVDEIIHSLETKKVNLEKAIEDNNAELSKIKKQKSKLAYLTRLYKENKENIQDYCTLNEYIEAHLFNPKFLSRHEKALNNFKALKSQFTGPVNLKELEKLTDKLTGIKEYSYDFHSNSLPYDLEHDKSFRNFYDFDGLKESIESIIKELEVLNSIRQAVSDKYPNSFKALNETEEHDDKLKFINIIFNDGFSTTYDQQTFIKALSALDIEKAIDAYTNVKNKLENTQDIIANKEGCRNKLISELQTLIANKQEPYLSANEKLGGFYSKRKLSASEGFHLAYQANRRDPIKPEVIENIITKMKPIDEDTHLDIHIRPPDCGVFITPEDIKKFQEAGIKVNITIHEYKQNYTRRYLQQYTHDLMRQANSVQFFNAEDRENAIIAATYGDCDKRNTTEPTGVAKKIREVGEDFDLDKYPVQKYDLKGKSGLTVASQKL 542 T 0.42 Epimerase_2 unphh F Bacteria T 8bvq 6 F G Darobactin-B WNXTKRF 7 T 11 TMP pdbhh F T 8bvw 29 CA U TF2AA_HUMAN GENERAL TRANSCRIPTION FACTOR IIA SUBUNIT 1,TFIIAL,TRANSCRIPTION INITIATION FACTOR TFIIA 42 KDA SUBUNIT,TFIIA-42 MANSANTNTVPKLYRSVIEDVINDVRDIFLDDGVDEQVLMELKTLWENKLMQSRAVDGFHSEEQQLLLQVQQQHQPQQQQHHHHHHHQQAQPQQTVPQQAQTQQVLIPASQQATAPQVIVPDSKLIQHMNASNMSAAATAATLALPAGVTPVQQILTNSGQLLQVVRAANGAQYIFQPQQSVVLQQQVIPQMQPGGVQAPVIQQVLAPLPGGISPQTGVIIQPQQILFTGNKTQVIPTTVAAPTPAQAQITATGQQQPQAQPAQTQAPLVLQVDGTGDTSSEEDEDEEEDYDDDEEEDKEKDGAEDGQVEEEPLNSEDDVSDEEGQELFDTENVVVCQYDKIHRSKNKWKFHLKDGIMNLNGRDYIFSKAIGDAEW 376 F F Eukaryota T 8bvw 33 GA Y Unassigned peptide, likely XPB XXXXXXXX 8 F F F 8bvw 34 HA Z Unassigned peptide, likely TFIIE-beta XXXXXXXXXXXXXXXXXXX 19 F F F 8bwy 18 R,S V,x Docking complex 1/2 protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 130 F F F 8bx8 4 D D I7M008_TETTS Dynein intermediate chain 2 MPPKQTKVVASRKTVMPISRAGRAQIRRKDSNTQNNMNDQGMEDEEIDQQREGMKNQYEQLTAQELNEDMPSKMLEPKNPQAPKNITVYDYYTRKFKTDELVDQMIVHFSMDGDYIWKESNEYKTQEEIRDTKKALIKEAMRKQESEEPGANHDEEAIKQTLRNKFNYNTRECQTINPSIRERGVSTEPPPSDTICGNITQWEIFDAYYAEIMKDHQIENKKKKEVDQDKKQDQSMYSTSFKRCCKIMERMVVQNDQEDKYHDYRYYWSQGDNLEAGKNEGHLLPIWRFSNEKQRKKNVTSICWNPLYPDLFAVSLGSYDFTKQRMGLICLYSLKNTTHPEYAFNCEAGVMCLDFHPKSAALLAVGLYDGTVLVYDIRNKHKKPIYQSTVRNQKHTDPVWQVKWNPDTSKNYNFYSISSDGRVMNWILMKNKLEPEEVILLRLVGKNEEESTLIGLACGLCFDFNKFEPHIFLVGTEEGKIHKCSRAYSGQYQETYNGHLLAVYKVKWNNFHPRTFISASADWTVRIWDSKYTSQIICFDLSMMVVDAVWAPYSSTVFACATMDKVQVYDLNVDKLNKLAEQKIVKQPKLTNLSFNYKDPILLVGDSHGGVTLVKLSPNLCKSGPEIKQTEDKKAMEEFKNVKIEDYEREKMENLLA 657 T 0.0046 WD40 pdb F Eukaryota T 8bya 7 G G p27 KIP1 C-terminus AGSVEQTPKK 10 T 52 DUF1850 pdbhh F T 8byq 29 CA U TF2AA_HUMAN GENERAL TRANSCRIPTION FACTOR IIA SUBUNIT 1,TFIIAL,TRANSCRIPTION INITIATION FACTOR TFIIA 42 KDA SUBUNIT,TFIIA-42 MANSANTNTVPKLYRSVIEDVINDVRDIFLDDGVDEQVLMELKTLWENKLMQSRAVDGFHSEEQQLLLQVQQQHQPQQQQHHHHHHHQQAQPQQTVPQQAQTQQVLIPASQQATAPQVIVPDSKLIQHMNASNMSAAATAATLALPAGVTPVQQILTNSGQLLQVVRAANGAQYIFQPQQSVVLQQQVIPQMQPGGVQAPVIQQVLAPLPGGISPQTGVIIQPQQILFTGNKTQVIPTTVAAPTPAQAQITATGQQQPQAQPAQTQAPLVLQVDGTGDTSSEEDEDEEEDYDDDEEEDKEKDGAEDGQVEEEPLNSEDDVSDEEGQELFDTENVVVCQYDKIHRSKNKWKFHLKDGIMNLNGRDYIFSKAIGDAEW 376 F F Eukaryota T 8bz1 19 S U TF2AA_HUMAN GENERAL TRANSCRIPTION FACTOR IIA SUBUNIT 1,TFIIAL,TRANSCRIPTION INITIATION FACTOR TFIIA 42 KDA SUBUNIT,TFIIA-42 MANSANTNTVPKLYRSVIEDVINDVRDIFLDDGVDEQVLMELKTLWENKLMQSRAVDGFHSEEQQLLLQVQQQHQPQQQQHHHHHHHQQAQPQQTVPQQAQTQQVLIPASQQATAPQVIVPDSKLIQHMNASNMSAAATAATLALPAGVTPVQQILTNSGQLLQVVRAANGAQYIFQPQQSVVLQQQVIPQMQPGGVQAPVIQQVLAPLPGGISPQTGVIIQPQQILFTGNKTQVIPTTVAAPTPAQAQITATGQQQPQAQPAQTQAPLVLQVDGTGDTSSEEDEDEEEDYDDDEEEDKEKDGAEDGQVEEEPLNSEDDVSDEEGQELFDTENVVVCQYDKIHRSKNKWKFHLKDGIMNLNGRDYIFSKAIGDAEW 376 F F Eukaryota T 8c0o 1 A,AA,AB,AC,AD,AE,AF,B,BA,BB,BC,BD,BE,BF,C,CA,CB,CC,CD,CE,CF,D,DA,DB,DC,DD,DE,DF,E,EA,EB,EC,ED,EE,EF,F,FA,FB,FC,FD,FE,FF,G,GA,GB,GC,GD,GE,GF,H,HA,HB,HC,HD,HE,HF,I,IA,IB,IC,ID,IE,IF,J,JA,JB,JC,JD,JE,JF,K,KA,KB,KC,KD,KE,KF,L,LA,LB,LC,LD,LE,LF,M,MA,MB,MC,MD,ME,MF,N,NA,NB,NC,ND,NE,NF,O,OA,OB,OC,OD,OE,OF,P,PA,PB,PC,PD,PE,PF,Q,QA,QB,QC,QD,QE,QF,R,RA,RB,RC,RD,RE,RF,S,SA,SB,SC,SD,SE,SF,T,TA,TB,TC,TD,TE,TF,U,UA,UB,UC,UD,UE,UF,V,VA,VB,VC,VD,VE,VF,W,WA,WB,WC,WD,WE,WF,X,XA,XB,XC,XD,XE,XF,Y,YA,YB,YC,YD,YE,Z,ZA,ZB,ZC,ZD,ZE 1A,JA,RB,ZC,iA,qB,yC,AC,1B,RC,aA,iB,qC,zA,BA,JB,1C,aB,iC,rA,zB,BB,JC,SA,2A,jA,rB,zC,BC,KA,SB,aC,2B,rC,3A,CA,KB,SC,bA,jB,2C,3B,CB,KC,TA,bB,jC,sA,3C,CC,LA,TB,bC,kA,sB,4A,DA,LB,TC,cA,kB,sC,4B,DB,LC,UA,cB,kC,tA,4C,DC,MA,UB,cC,lA,tB,5A,EA,MB,UC,dA,lB,tC,5B,EB,MC,VA,dB,lC,uA,5C,EC,NA,VB,dC,mA,uB,6A,FA,NB,VC,eA,mB,uC,6B,FB,NC,WA,eB,mC,vA,6C,FC,OA,WB,eC,nA,vB,7A,GA,OB,WC,fA,nB,vC,7B,GB,OC,XA,fB,nC,wA,7C,GC,PA,XB,fC,oA,wB,8A,HA,PB,XC,gA,oB,wC,8B,HB,PC,YA,gB,oC,xA,8C,HC,QA,YB,gC,pA,xB,AA,IA,QB,YC,hA,pB,xC,AB,IB,QC,ZA,hB,pC,yA,IC,RA,ZB,hC,qA,yB A0A3S9H6T3_9VIRU C protein MGTFIELVKNMKGYKELLLPMEMVPLPAVVLKHVKLILTSQKEHQPWMTEMALKADQCLIHKATLDLAGKATSNEAKPLIEAMQQIILAMTRELWGQIQRHHYGIVQVEHYVKQITLWQDTPQAFRGDQPKPPSFRSDGPTRGQGSFRPFFRGRGRGRGRGRGSQSPARKGPLPK 175 T 0.0012 API5 unphh T Viruses T 8c17 2 B B Stapled peptide GFSPXDFHXDIXCDVXRGX 19 T 23 DUF5510 pdbhh F T 8c1n 2 C C POLG_FMDVS P3B-1,GENOME-LINKED PROTEIN VPG1 GPYAGPLERQRPLKVRAKLPRQE 23 T 5.1 RNase_HII pdbhh T Viruses T 8c29 17 OA,R u,U A9NJW3_PICSI Photosystem II 5 kDa protein, chloroplastic MASLSLCAPCNISSASSLAAGYNKVPCKSVRGGAQVGQVFMVNKPFKASQDWAVHDENVTMKKKEDDQERMQRRRMMFTAAAAAVSAAASQGMMAMAAGEKPTGPEPKRGTPEAKKLYARVCVTMPTASVCHN 133 T 0.0024 PsbQ pdb F Eukaryota T 8c2d 2 B PPP Pyrin pS208 peptide RLRRNASSAGRLQGLAGGA 19 T 55 SfsA_N pdbhh F T 8c2p 2 B B POLG_FMDVS P3B-3,GENOME-LINKED PROTEIN VPG3 GPYEGPVKKPVALKVKAKNLIVTE 24 T 44 DUF2111 pdbhh T Viruses T 8c3e 1 A A Engineered protein LCB2 GSSDDEDSVRYLLYMAELRYEQGNPEKAKKILEMAEFIAKRNNNEELERLVREVKKRL 58 T 0.0011 TPR_6 pdb F T 8c3h 1 A,B,C A,B,C A4TVL0_9PROT Cereblon isoform 4 MPLDAGGQNSTQMVLAPGASIFRCRQCGQTISRRDWLLPMGGDHEHVVFNPAGMIFRVWCFSLAQGLRLIGAPSGEFSWFKGYDWTIALCGQCGSHLGWHYEGGSQPQTFFGLIKDRLAEGPAD 124 F F Bacteria T 8c3l 1 A,C A,C Q8XAD6_ECO57 Phage repressor protein CI MQKKEIRRLRLKEWFKDKTLPPKEKSYLSQLMSGRASFGEKAARRIEQTYGMPEGYLGSSHHHHHH 66 T 0.0032 HTH_3 unppssm F Bacteria T 8c3w 1 A A dnHEM1 MVSLDQAILILVVAAKLGTTVEEAVKRALWLKTKLGVSLDQALRILSAAANTGTTVEEAVKRALKLKTKLGVSLEAALAILSAAAQLGTTVEEAVKRALKLKTKLGVDLETAALALLTAAKLGTTVEEAVKRALKLKTKLGVSLIEALHILLTAAVLGTTVEEAVYRALKLKTKLGVSLLQAAAILILAARLGTTVEEAVKRALKLKTKLGGGSGGSHHWGSGSHHHHHH 230 T 0.0037 RuvA_C pdb F T 8c4a 1 A A A0A7J6JYP1_TOXGO Putative anonymous antigen-1 KRNPDAGVATAVQSVIQHQTFKRMLLFGLRSLADFCSPSNQLYQENALDALDRGVLSAIQTAVTTFSDDDDLMLCASRVLWAMSVAIKEEMDPAHIARVHSEGSPVIVAVVNSSPTDPQTIEDSMNFVDNLKRAGAPVDGASLAGGMLSIFTKTALDMKTAKRVTAALAIAAETAEGSTALYNAGGTSVLLTYCLDQGDLSDAGVEMVEGAFDTVRYMAGYQCTDATTLPQCIALMDKYRGRKSASAKGSSALAAMIGPEQLQKCLNTLKTAEAGSAEYDEALVTLGSMSYISSFTDEIVRAGGVPLLIELINSGLPQMEGNPEKIASMISGAAKMLARIASNPVNVDAIVQAGGVATLCTAVSYCTESMEALGALCMALVPLASRESLAHEIVQYQTFATVLPILYQNVESPEIAALAMELVATGSQHEEIQEHMLQNQAAEICSLCCQYHTADASYQQHAISALNRLVPRLTTLHGVSEYGGIQGVIASLNANVNNEQVALLAVQLLDNFSEVSDAKTYMSDGTCVDAVLAAMLEHEGNDLLISAGVHCLARIATEDDCARHLNVLDTAIQTARGNPDGVYRVLAAISGLSRVPSLRQIFEEKNASDTILAGISSWIECSRFEGQNRIIKAALKTVKNMKISGDGDLTSCFAAMCDVACLPQVKRVVELEEPDNNILVADTAAFRDLAATMRITGAENLERCIESVLRVMRKYPDSRRAQLNCLETLNYLAQCDGGEGVAILSRTGGLNAVVQYLTRAPMYLDAQIAGFTVLATSAKIDSNVGETLRKCNCLQALKVAMRTHAKSKELKRTIAPLVALLMPTDALETEIQELLNECASACEKNNFPHLHENLAALNELLISSEGAKIAARLGIGAHMCKYQEYISAHEQDALAVTDYDILGKDLFDATVSECAHAMEQVASTRSGRNALIKAGNVATLISLYESLKAPQSQYSEEAAIHCLEALRILLKSDKRSAELAFERNFVSTLCVGIDSFPHSAPVLGATCACLAAMATTPERVQMLTAQPAFESLLQKLVFVIQNDPSKDNKLVAMRALQELVEITNDATMANKIAEAGAVTALFRIIDEYGDDEQLTVQAAEVLALLGAFEDLRRFYDNDVRFPAQVLTAALTKQKNNETAVVHLLDVLNKLATSEDRAVLRELGVMEQVADAMRVHSESEAVTRLGGELFAKMGADEQIKSLMLQIIETVESGAEDTAQTVDILCGRLAVFLAAPLEDPRDALQHTEKCLGSLVATLQTYPGSERLEGNVALVCRRLCDRCFDDADDPYGAWAVAASGMLAQFAGMVAGETVLANKKFLGPAYRTFTACCANAYCMPTMVEVAPSFLPQTYTLLEMHKNDAETVARVLEFLRYFAEDPTACGLIVQNMSGSSGDVVALTVLLMQQHQNNDAVVCAGMEFLGALAYTLSQAGYEPLPTLADGSVLRDCDALMGSNSSSARQLAHMHMIEKMLLSKAYNDALIQEQALKKLTMSLKAEDDKKRFSDEERAGLYAAMACVLLAAGGAGLTGEMEKFNGFEVVLQAIEEFGENPTVIKEVNRALQGLSMADVNMTARTVKEAVPKLCTEATTAIQTDAECADTFCDLMLQLVSQEGNGRQLLQVYGLEETLQGVENLAAYYGEDFGTQLSEKVAMIRQAMEDDQPREKTCKDVYDLLNSRVQQGLSVAISEVAILQEEVEFLVSQMGMYNQEQLDHQTAMGADHQYGNMAFELLAATSANVKLLQANEFSKMELALIKGQADPEIVLYAVKALTAFCKFPPAAQDTARIQGCPALVTEACSKINKSGLPNERKEEHLCARYFLVERTAINRNLYNKTPIMTELINSWNDYDKGAYTTTLLRFVFRAMRRVVSDAHVEELLKANVLQRLIGIISDVNADMALLPDVLFLLGSLAVVPEIKTKIGELNGIAACTDLLQRALPKPNTAPVVTNVCLAFANICIGHKKNTEIFSKLGGPALNVKVLNDRGHEYDVCNAASVLLCNLLYKNESMKKLLGTNGAPAALVKGLSNYDGSEEKTAIRCLESVFKAISNLSLYTPNIQPFLDAGIENAYSTWLSNLSETFPDAQLETGCRTLVNLVMENEENNMRKFGVCLLPCMAVAKQGRTDTKALLLLLDIEASLCRLKENAEAFAANGGIETTIRLIHQFDYDVGLLTLGIHLLGIQSAVKDSIQRMMDADVFSILVGCVEVDAEGNEVTDLVVGGLRCTRRIVRSEELAFEYCNAGGIATIANVICKSINQPMVMLEACRVLLGLLFYTTRSQADRQAAVEALHAQCQQRAEQMHAQAQADYEAGVVSEPPPEEMEVPEPDPDELANAAYGGWYQMGMDEVMIDAILQAVCACAAVEAHAKQLRLQRVCLGLAAYFASEQMGTSSLVGSGIEQVLTQIMTNFAGEGTTMQLSCVIINSIAMTSGDMYEEIKTSALLSALKTSVGKMATKKPEEKALKETCAATLEAASSGEDPFDAFSKTVTELDFKFTEWNVDPYP 2498 T 0.0018 Arm unppercent F Eukaryota T 8c6j 48 VA j STEEP_HUMAN STEEP MPKVVSRSVVCSDTRDREEYDDGEKPLHVYYCLCGQMVLVLDCQLEKLPMRPRDRSRVIDAAKHAHKFCNTEDEETMYLRRPEGIERQYRKKCAKCGLPLFYQSQPKNAPVTFIVDGAVVKFGQGFGKTNIYTQKQEPPKKVMMTKRTKDMGKFSSVTVSTIDEEEEEIEAREVADSYAQNAKVIEKQLERKGMSKRRLQELAELEAKKAKMKGTLIDNQFK 222 T 0.1 R_equi_Vir pdbpercent F Eukaryota T 8c8q 9 I I COX9_SCHPO CYTOCHROME C OXIDASE POLYPEPTIDE VIIA MAVGPVTGMFKRRIVTDFSVTMILGTLGACYWWFGYHKPAARQREEFYVKLAAEKNAE 58 T 0.00028 COX6C pdbpercent F Eukaryota T 8c8q 13 M M Unknown polypeptide XXXXXXXXXXXXXXXXXXXXXXXXXX 26 F F F 8c8t 1 A,D,E A,F,G Q2N0S5_9HIV1 ENV POLYPROTEIN LWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQAFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRR 469 T 5.6E-50 GP120 pdb T Viruses T 8cah 8 H j RNA recognition motif (unknown) XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 77 F F F 8cav 2 C,D D,G D1A2F9_THECD CuvA MSALLEPRDAGATNLDALAAIKWEAPAHQAGTCTVCHWGYTILCDDFSTRS 51 T 0.091 DUF5973 pdb F Bacteria T 8cd8 2 B,D B,D GLY-SER-SER GSS 3 T 220 Peptidase_S30 pdbhh F F 8cdp 1 A A Q57XL7_TRYB2 Guide_RNA_associated_protein_-_putative MKHHHHHHSAGLEVLFQGPDSMQNQGSVSQGALNMRDQQAAAAENVTPERVWALWNEGNLFSLSLAQLQGFLSRCGVRTDPAAKKAAVVRQVEEYLHSKDTTVKGGGQGAASPQQHQQHGQQGGYGRWNQASVMQPETLLDLSQAGFYEGAANMVPKAFQLLVSDTAPDVVVSRVNTTAFPGFPSNTECYTLGASEKDVAIRSRYSKVLQWCCLNMSNLQMDGELYVDFGKLLLKPSVMRKNRRIVSSYTLQQRLQVNHPYTWVPTLPESCLSKIQEQFLQPEGFAPIGKGVQLTYSGTIKRSKDQLHVDLDNKGKVLAVNSAWVNLQTAWCTHAKGPDVRLLLRSRPPIRRQDVELFASTPIIKLADDDVADVLPPEHGQLVYLSEDETRLFERVSDRGVTITVREVKRQPLIILRDEEEDPRVEYSLSAHIPANAAKATDVRAVGLTAFELAGRLAGLVAEDFVREYGCEAKL 475 T 0.012 HeH pdbpercent F Eukaryota T 8cdp 2 B B Q586X1_TRYB2 Mitochondrial guide RNA binding complex subunit 2 MQSFSAAAPAASGDFSHITRNTVWGLWNEGNLFSLSVPELAFFLQEHCRVANVDPRAKKSALVRQVEEILSAEQQASATVPQEDNPHAIVVTDYDRAEDALEEADEYGDWGAEPGFEDRRELDFMELSPGRMGERYDPLSPRAFQLLHSETATDVGIASIDPSKLPGQSKVKNALAAIHVAPNDANKMRFRMAFEWCLMNIWNMNMPGELNIGAGKALYYRSVAKQNRNVMPLWTVQKHLYAQHPYAWFAIASESNVAAMESLAAALNMSIQQERTTSYKVTIRRMAEFFDCELNGQLKCTMMNKPWDRFFVSHYIRSKMPDLRYVVRARHPIKKRIADAYLEADILRSTRDSVQSVLSPELGDVVYCCERVVRKWAKKTATGVTLQLVETKRTPLIITKAGDEGERLEYEWIVPLPQQAERIDIAALTDELWEYGNKLAAALEEGMEELMVHTMTAVSAY 461 T 0.54 ARMET_C pdbhh F Eukaryota T 8cdq 3 C C Q8IJM4_PLAF7 Myosin essential light chain ELC MASDMEEKFREAFILFSSCSDHIEMYKFFELMNSFGIILTNDEKAALPNDINMDYWLNFAKKHYNYEQPFKHINNVNEQNTNVQIKIDNFLGIMKALDTRLTESDLNILLQITNPENKSTLNLKTVSQKLTESI 134 T 0.024 Na_Ca_ex_C pdbpercent F Eukaryota T 8cei 1 A,B,C,D A,B,C,D SUCD_CLOK5 Succinate-semialdehyde dehydrogenase (acetylating) MSNEVSIKELIEKAKVAQKKLEAYSQEQVDVLVKALGKVVYDNAEMFAKEAVEETEMGVYEDKVAKCHLKSGAIWNHIKDKKTVGIIKEEPERALVYVAKPKGVVAATTPITNPVVTPMCNAMAAIKGRNTIIVAPHPKAKKVSAHTVELMNAELKKLGAPENIIQIVEAPSREAAKELMESADVVIATGGAGRVKAAYSSGRPAYGVGPGNSQVIVDKGYDYNKAAQDIITGRKYDNGIICSSEQSVIAPAEDYDKVIAAFVENGAFYVEDEETVEKFRSTLFKDGKINSKIIGKSVQIIADLAGVKVPEGTKVIVLKGKGAGEKDVLCKEKMCPVLVALKYDTFEEAVEIAMANYMYEGAGHTAGIHSDNDENIRYAGTVLPISRLVVNQPATTAGGSFNNGFNPTTTLGCGSWGRNSISENLTYEHLINVSRIGYFNKEAKVPSYEEIWG 453 T 0.001 Aldedh pdb F Bacteria T 8cej 1 A,B,C,D A,B,C,D SUCD_CLOK5 Succinate-semialdehyde dehydrogenase (acetylating) MSNEVSIKELIEKAKVAQKKLEAYSQEQVDVLVKALGKVVYDNAEMFAKEAVEETEMGVYEDKVAKCHLKSGAIWNHIKDKKTVGIIKEEPERALVYVAKPKGVVAATTPITNPVVTPMCNAMAAIKGRNTIIVAPHPKAKKVSAHTVELMNAELKKLGAPENIIQIVEAPSREAAKELMESADVVIATGGAGRVKAAYSSGRPAYGVGPGNSQVIVDKGYDYNKAAQDIITGRKYDNGIICSSEQSVIAPAEDYDKVIAAFVENGAFYVEDEETVEKFRSTLFKDGKINSKIIGKSVQIIADLAGVKVPEGTKVIVLKGKGAGEKDVLCKEKMCPVLVALKYDTFEEAVEIAMANYMYEGAGHTAGIHSDNDENIRYAGTVLPISRLVVNQPATTAGGSFNNGFNPTTTLGCGSWGRNSISENLTYEHLINVSRIGYFNKEAKVPSYEEIWG 453 T 0.001 Aldedh pdb F Bacteria T 8cek 1 A,B,C,D A,B,C,D SUCD_CLOK5 Succinate-semialdehyde dehydrogenase (acetylating) MSNEVSIKELIEKAKVAQKKLEAYSQEQVDVLVKALGKVVYDNAEMFAKEAVEETEMGVYEDKVAKCHLKSGAIWNHIKDKKTVGIIKEEPERALVYVAKPKGVVAATTPITNPVVTPMCNAMAAIKGRNTIIVAPHPKAKKVSAHTVELMNAELKKLGAPENIIQIVEAPSREAAKELMESADVVIATGGAGRVKAAYSSGRPAYGVGPGNSQVIVDKGYDYNKAAQDIITGRKYDNGIICSSEQSVIAPAEDYDKVIAAFVENGAFYVEDEETVEKFRSTLFKDGKINSKIIGKSVQIIADLAGVKVPEGTKVIVLKGKGAGEKDVLCKEKMCPVLVALKYDTFEEAVEIAMANYMYEGAGHTAGIHSDNDENIRYAGTVLPISRLVVNQPATTAGGSFNNGFNPTTTLGCGSWGRNSISENLTYEHLINVSRIGYFNKEAKVPSYEEIWG 453 T 0.001 Aldedh pdb F Bacteria T 8cen 27 AA U Transcription initiation factor IIA large subunit MSNAEASRVYEIIVESVVNEVREDFENAGIDEQTLQDLKNIWQKKLTETKVTTFSWDNQFNEGNINGVQNDLNFNLATPGVNSSEFNIKEENTGNEGLILPNINSNNNIPHSGETNINTNTVEATNNSGATLNTNTSGNTNADVTSQPKIEVKPEIELTINNANITTVENIDDESEKKDDEEKEEDVEKTRKEKEQIEQVKLQAKKEKRSALLDTDEVGSELDDSDDDYLISEGEEDGPDENLMLCLYDKVTRTKARWKCSLKDGVVTINRNDYTFQKAQVEAEWV 286 F F T 8cen 46 TA p Mediator of RNA polymerase II transcription subunit 1 MVEGDSYVETLDSMIELFKDYKPGSITLENITRLCQTLGLESFTEELSNELSRLSTASKIIVIDVDYNKKQDRIQDVKLVLASNFDNFDYFNQRDGEHEKSNILLNSLTKYPDLKAFHNNLKFLYLLDAYSHIESDSTSHNNGSSDKSLDSSNASFNNQGKLDLFKYFTELSHYIRQCFQDNCCDFKVRTNLNDKFGIYILTQGINGKEVPLAKIYLEENKSDSQYRFYEYIYSQETKSWINESAENFSNGISLVMEIVANAKESNYTDLIWFPEDFISPELIIDKVTCSSNSSSSPPIIDLFSNNNYNSRIQLMNDFTTKLINIKKFDISNDNLDLISEILKWVQWSRIVLQNVFKLVSTPSSNSNSSELEPDYQAPFSTSTKDKNSSTSNTEPIPRSNRHGSVVEASRRRRSSTNKSKRPSITEAMMLKEEGLQQFNLHEILSEPAIEEENGDSIKEHSTTMDGANDLGFTASVSNQENAGTDIVMEDHGVLQGTSQNYGTATADDADIEMKDVSSKPSKPESSVLQLIVSEDHIILDTISECNLYDDVKCWSKFIEKFQDIVS 566 T 7.5E-16 Med1 pdbhh F T 8ceo 27 AA U A0A6A5Q2T8_YEASX TOA1 isoform 1 MSNAEASRVYEIIVESVVNEVREDFENAGIDEQTLQDLKNIWQKKLTETKVTTFSWDNQFNEGNINGVQNDLNFNLATPGVNSSEFNIKEENTGNEGLILPNINSNNNIPHSGETNINTNTVEATNNSGATLNTNTSGNTNADVTSQPKIEVKPEIELTINNANITTVENIDDESEKKDDEEKEEDVEKTRKEKEQIEQVKLQAKKEKRSALLDTDEVGSELDDSDDDYLISEGEEDGPDENLMLCLYDKVTRTKARWKCSLKDGVVTINRNDYTFQKAQVEAEWV 286 F F Eukaryota T 8ceo 46 TA p A0A8H4BU46_YEASX Mediator of RNA polymerase II transcription subunit 1 MVEGDSYVETLDSMIELFKDYKPGSITLENITRLCQTLGLESFTEELSNELSRLSTASKIIVIDVDYNKKQDRIQDVKLVLASNFDNFDYFNQRDGEHEKSNILLNSLTKYPDLKAFHNNLKFLYLLDAYSHIESDSTSHNNGSSDKSLDSSNASFNNQGKLDLFKYFTELSHYIRQCFQDNCCDFKVRTNLNDKFGIYILTQGINGKEVPLAKIYLEENKSDSQYRFYEYIYSQETKSWINESAENFSNGISLVMEIVANAKESNYTDLIWFPEDFISPELIIDKVTCSSNSSSSPPIIDLFSNNNYNSRIQLMNDFTTKLINIKKFDISNDNLDLISEILKWVQWSRIVLQNVFKLVSTPSSNSNSSELEPDYQAPFSTSTKDKNSSTSNTEPIPRSNRHGSVVEASRRRRSSTNKSKRPSITEAMMLKEEGLQQFNLHEILSEPAIEEENGDSIKEHSTTMDGANDLGFTASVSNQENAGTDIVMEDHGVLQGTSQNYGTATADDADIEMKDVSSKPSKPESSVLQLIVSEDHIILDTISECNLYDDVKCWSKFIEKFQDIVS 566 T 7.5E-16 Med1 pdbhh F Eukaryota T 8cep 15 O,P,Q,R,S V,B,C,G,I Capreomycin IA XXXXXS 6 T 2200 zf-H2C2_2 pdbhh F F 8ceu 31 EA,FA,GA,HA,IA,JA,KA,LA V,C,D,E,F,G,H,I Capreomycin IA XXXXXS 6 T 2200 zf-H2C2_2 pdbhh F F 8cir 2 C,D C,D C3P EDGGSWKYPDAFELSG 16 T 0.28 DUF817 pdbhh F T 8cis 2 B B C3P EDGGSWKYPDAFELSG 16 T 0.28 DUF817 pdbhh F T 8cis 3 C C C3S1 EDGGSSWEYIWTLPSG 16 T 0.91 BNR pdbhh F T 8cis 4 D D Unknown peptide AAAAAG 6 T 300 DD_K pdbhh F F 8cj1 2 I,J,K,L I,J,K,L c3u_3 chimera inhibitor of histone chaperone ASF1 XARRIX 6 T 200 WhiA_N pdbhh F F 8cj2 2 E,F,G,H E,F,G,H c3u_5 chimera inhibitor of histone chaperone ASF1 XEKXAXXXRIX 11 T 69 YcbB pdbhh F T 8cj3 2 B B c3u_7 chimera inhibitor of histone chaperone ASF1 XEKXARLXXXAX 12 T 20 SEEK1 pdbhh F T 8cjd 1 A,B A,B A0A861B9Z9_9CYAN AetF GASGSGSGMLEVCIIGFGFSAIPLVRELARTQTEFQIISAESGSVWDRLSESGRLDFSLVSSFQTSFYSFDLVRDYEKDYYPTAKQFYEMHERWRSVYEEKIIRDFVTKIENFKDYSLISTRSGKTYEAKHVVLATGFDRLMNTFLSNFDNHVSNKTFVFDTMGDSANLLIAKLIPNNNKIILRTNGFTALDQEVQVLGKPFTLDQLESPNFRYVSSELYDRLMMSPVYPRTVNPAVSYNQFPLIRRDFSWVDSKSSPPNGLIAIKYWPIDQYYYHFNDDLENYISKGYLLNDIAMWLHTGKVILVPSDTPINFDKKTITYAGIERSFHQYVKGDAEQPRLPTILINGETPFEYLYRDTFMGVIPQRLNNIYFLGYTRPFTGGLANITEMQSLFIHKLITQPQFHQKIHQNLSKRITAYNQHYYGAAKPRKHDHTVPFGFYTEDIARLIGIHYQPNECRSVRDLLFYYAFPNNAFKYRLKGEYAVDGVDELIQKVNDKHDHYAQVFVQALSIRNMNSDEAAEWDHSARRFSFNDMRHKEGYRAFLDTYLKAYRQVENISVDDTVVDEEWNFMVKEACQVRDKVAPNIEEKTHYSKDEDVNKGIRLILSILDSDISSLPDSNGSRGSGNLKEGDRLCKFEAQSIEFIRRLLQPKNYELLFIRES 663 T 0.0036 Pyr_redox_2 pdb F Bacteria T 8cje 1 A,B,C,D A,B,C,D A0A861B9Z9_9CYAN AetF GASGSGSGMLEVCIIGFGFSAIPLVRELARTQTEFQIISAESGSVWDRLSESGRLDFSLVSSFQTSFYSFDLVRDYEKDYYPTAKQFYEMHERWRSVYEEKIIRDFVTKIENFKDYSLISTRSGKTYEAKHVVLATGFDRLMNTFLSNFDNHVSNKTFVFDTMGDSANLLIAKLIPNNNKIILRTNGFTALDQEVQVLGKPFTLDQLESPNFRYVSSELYDRLMMSPVYPRTVNPAVSYNQFPLIRRDFSWVDSKSSPPNGLIAIKYWPIDQYYYHFNDDLENYISKGYLLNDIAMWLHTGKVILVPSDTPINFDKKTITYAGIERSFHQYVKGDAEQPRLPTILINGETPFEYLYRDTFMGVIPQRLNNIYFLGYTRPFTGGLANITEMQSLFIHKLITQPQFHQKIHQNLSKRITAYNQHYYGAAKPRKHDHTVPFGFYTEDIARLIGIHYQPNECRSVRDLLFYYAFPNNAFKYRLKGEYAVDGVDELIQKVNDKHDHYAQVFVQALSIRNMNSDEAAEWDHSARRFSFNDMRHKEGYRAFLDTYLKAYRQVENISVDDTVVDEEWNFMVKEACQVRDKVAPNIEEKTHYSKDEDVNKGIRLILSILDSDISSLPDSNGSRGSGNLKEGDRLCKFEAQSIEFIRRLLQPKNYELLFIRES 663 T 0.0036 Pyr_redox_2 pdb F Bacteria T 8cjf 1 A,B A,B A0A861B9Z9_9CYAN AetF GASGSGSGMLEVCIIGFGFSAIPLVRELARTQTEFQIISAESGSVWDRLSESGRLDFSLVSSFQTSFYSFDLVRDYEKDYYPTAKQFYEMHERWRSVYEEKIIRDFVTKIENFKDYSLISTRSGKTYEAKHVVLATGFDRLMNTFLSNFDNHVSNKTFVFDTMGDSANLLIAKLIPNNNKIILRTNGFTALDQEVQVLGKPFTLDQLESPNFRYVSSELYDRLMMSPVYPRTVNPAVSYNQFPLIRRDFSWVDSKSSPPNGLIAIKYWPIDQYYYHFNDDLENYISKGYLLNDIAMWLHTGKVILVPSDTPINFDKKTITYAGIERSFHQYVKGDAEQPRLPTILINGETPFEYLYRDTFMGVIPQRLNNIYFLGYTRPFTGGLANITEMQSLFIHKLITQPQFHQKIHQNLSKRITAYNQHYYGAAKPRKHDHTVPFGFYTEDIARLIGIHYQPNECRSVRDLLFYYAFPNNAFKYRLKGEYAVDGVDELIQKVNDKHDHYAQVFVQALSIRNMNSDEAAEWDHSARRFSFNDMRHKEGYRAFLDTYLKAYRQVENISVDDTVVDEEWNFMVKEACQVRDKVAPNIEEKTHYSKDEDVNKGIRLILSILDSDISSLPDSNGSRGSGNLKEGDRLCKFEAQSIEFIRRLLQPKNYELLFIRES 663 T 0.0036 Pyr_redox_2 pdb F Bacteria T 8cjg 1 A,B A,B A0A861B9Z9_9CYAN AetF GASGSGSGMLEVCIIGFGFSAIPLVRELARTQTEFQIISAESGSVWDRLSESGRLDFSLVSSFQTSFYSFDLVRDYEKDYYPTAKQFYEMHERWRSVYEEKIIRDFVTKIENFKDYSLISTRSGKTYEAKHVVLATGFDRLMNTFLSNFDNHVSNKTFVFDTMGDSANLLIAKLIPNNNKIILRTNGFTALDQEVQVLGKPFTLDQLESPNFRYVSSELYDRLMMSPVYPRTVNPAVSYNQFPLIRRDFSWVDSKSSPPNGLIAIKYWPIDQYYYHFNDDLENYISKGYLLNDIAMWLHTGKVILVPSDTPINFDKKTITYAGIERSFHQYVKGDAEQPRLPTILINGETPFEYLYRDTFMGVIPQRLNNIYFLGYTRPFTGGLANITEMQSLFIHKLITQPQFHQKIHQNLSKRITAYNQHYYGAAKPRKHDHTVPFGFYTEDIARLIGIHYQPNECRSVRDLLFYYAFPNNAFKYRLKGEYAVDGVDELIQKVNDKHDHYAQVFVQALSIRNMNSDEAAEWDHSARRFSFNDMRHKEGYRAFLDTYLKAYRQVENISVDDTVVDEEWNFMVKEACQVRDKVAPNIEEKTHYSKDEDVNKGIRLILSILDSDISSLPDSNGSRGSGNLKEGDRLCKFEAQSIEFIRRLLQPKNYELLFIRES 663 T 0.0036 Pyr_redox_2 pdb F Bacteria T 8cjz 2 H h Spike Base Protein MAIGDIQTSVAFDRQVGRFPPRAEVVTPSNSEEFTSGVSVFSNDGGDISVVPLLPYGSAAIVVTVAAGGFVPFMVRKVNATGTTSTSIVAVW 92 T 0.05 CarboxypepD_reg pdb F T 8cjz 3 I,J,K,L,M,N,O B,C,c,F,D,A,E Capsid Decoration Protein MIMDKENTFSYKQAITGTAVSTNVIDLGVSRDIGKGVPVPIIIQVVEDFADATSLTATLQTSETENFSSATTLATSGAVPVADLTAGKQLAVQYMPLGTQRYLRVNYTVSGTATAGAVTAGVVMSHQQND 130 T 0.0098 SSURE pdbpercent F T 8ck1 1 A A Tail Nozzle MASNNYQPASSYIQPSFAGGELAPSLQGRVDLARYAISLKTCRNFVVQPYGGASNRPGFRFNTACKYKNYATRLIPFSFNTEQTYVIEIGHQYMRFHRDGAPVLDGGEPVEVATSWHRDDIFEIKYVQSADVLTLVHPDYKPRQLKRYSETDWVLDFFDNEFGPLQDQNVDESITIISNGVVDLVELTASEAIFSEAMVGTTIKLQQVSSGEVAAWQNRSAVEQGDLAYVDERTYKATSLSGGVDNTLTGDNTPAHTEGEQWDGPRTTIQGVTETLGVKWAYLHSGFGYVRITEHRDDTHIVGRVIGRLPEEIRTEGTYRWSFAAWDSDRGYPGTASYYQQRLVFANSRAEPQAFWMSETGIFNGFKVSFPIEADDAITFTLASRQVNEIRHLIPLGSLLALTSGAEWMISDNDQGLAPDTVSADVQGYRGASDVTPLLIGSSALYVQARGTVIRDLAYSFELDGYTGDDLTIFSNHLLKDYTIKDWAYAQEPDSVVWLVRSDGALLSMTYQREQQVVAWARHDTVDGEFESVAVIAEGSRDVPYAIVKRQVGGETVRYIEYLDSRRFSHVEDFFCVDSGLTYDGRSSTGALLTIGGGTNWTTDEDLTLTASASSFSPSDVGRRVRVYTGDKFADVDVDAYVSATSVAVSAVRIVPEELRGVQGDRWGFMAKTLTGLDHLEGKTVSILADGNVHAPEVVTGGQVTLDYSAAVVHVGLPIESDIETLPISSSGATVRDSHKAIVGVGIQLEKSRGVFAARSRRDFTSSDLIELKQRDAEDWGEATGLETGLVELGIPTSWDKDGSLFIRQSDPLPLTILSIIPRVVMGGKG 830 T 0.0073 Phage_stabilise pdbhh F T 8ck1 3 E,F E,F Connector Protein MPSKVDICNRALSNTGTDITIASLTEKSKEARLCQQWYDATLASLLRTYQWAFAQRRVTLALIGVGPAGWRHKYRYPTDAITIHDVFTADTYPDGASEFTDGRYRQIFQIASDGEGGRLVLANCEDAMCRYTSDIEDPNLMPPDFSTALEMMLAKNIAMPMTGNPGLMTVLAQQAASLVSDAIARDQNEGYRNPLPYASWTRANIGDSYPDDDHLPHRGGRR 222 F F T 8cka 1 A,B A,B HPI_DEIRA Hexagonally packed intermediate-layer surface protein MKKNIALMALTGILTLASCGQNGTGTTPTADACATANTCSVTVNISGVSSADFDVTMDGKTTSMTLSNGQKLPVAKTGTVTLTPKAKDGYTTPAAQSTTISSTNLTPSVNFAYTTVPSTGNGNGNGGTTPTQPFTLNITSPTNGAAATTGTPIRVVFTSSVALSSATCKIGNSAAVNAQVSSTGGYCDVTPTTAGGGLITVTGTANGQTVSSTVTVDVKAPVVDNRYGTVTPAGDQELTLTNEGIVKDADNGWRRLGQGVSTPSDPNGNVDIYVKGTVNFSVNAAAGSKVEVFLARTTGSDVPTNDDVQAGDVLRSVASTSGTETFSLDSRRLAEFDGVRKWIVVRINGTQVTYQPVIADNKGPQQPDPELNGVQNAYSNILNNYNNSGLTYVRGDVNVFTGNPSLQDREFGQAPLGSSFVQRRPSGFESIRYYLVPETAFGNKALQESDEMLRAKAIKSVATVVSAPVLEPGTVKATSFSRVIGSGATSTVTPKAQDNVTYRVYAISRDQLGNETASATYELVRFDNVGPTITGSVIRDTSDLPFASQEPERCLSDIATITLGGITDNAGGVGLNPGQGLTFTLGGRQIQAGQFDTNQLADGEYTIGFNSLTDALGNPVVSAPTNAKVYIDNTDPTVNFNRAVMQGTFASGERVSVESDASDGGCGVYETRLFWDTDNGVVDDATTTPAIGHPVQFARQRVTDGAKADSLNAGWNALQLPNGAGAVYLRALVVDRAGNATISTTPIVVNAKITNQARPLLGGFDAFKRNASAQFMSNSNAISGVNGTAVTPNTTANSALDNILSLDSVGTLTTNAYLPRGATETAITEKIRNVGAYGRFDATQWNRIRDYQLNTDPTLRSAYVNAGNLANQRGNNWRIRTPWVELGSSDTANTQQKFDFNSDLLNDFYFGRTFGNNDNVNLFSYDQFNGIVSGTAGAYSFYGETVQK 948 T 0.0072 Big_7 pdbpssm F Bacteria T 8cl1 2 B B CPSF6_HUMAN CLEAVAGE AND POLYADENYLATION SPECIFICITY FACTOR 68 KDA SUBUNIT,CPSF 68 KDA SUBUNIT,PRE-MRNA CLEAVAGE FACTOR IM 68 KDA SUBUNIT,PROTEIN HPBRII-4/7 PVLFPGQPFGQPPLG 15 T 2.2 MF_alpha pdbhh F Eukaryota T 8cl3 2 B B SC24C_HUMAN SEC24-RELATED PROTEIN C GPLLPGQSFGGPSVS 15 T 5.7 LT-IIB pdbhh F Eukaryota T 8cli 3 C C TF3C2_HUMAN TF3C-BETA,TRANSCRIPTION FACTOR IIIC 110 KDA SUBUNIT,TFIIIC 110 KDA SUBUNIT,TFIIIC110,TRANSCRIPTION FACTOR IIIC SUBUNIT BETA MHHHHHHENLYFQGMDTCGVGYVALGEAGPVGNMTVVDSPGQEVLNQLDVKTSSEMTSAEASVEMSLPTPLPGFEDSPDQRRLPPEQESLSRLEQPDLSSEMSKVSKPRASKPGRKRGGRTRKGPKRPQQPNPPSAPLVPGLLDQSNPLSTPMPKKRGRKSKAELLLLKLSKDLDRPESQSPKRPPEDFETPSGERPRRRAAQVALLYLQELAEELSTALPAPVSCPEGPKVSSPTKPKKIRQPAACPGGEEVDGAPRDEDFFLQVEAEDVEESEGPSESSSEPEPVVPRSTPRGSTSGKQKPHCRGMAPNGLPNHIMAPVWKCLHLTKDFREQKHSYWEFAEWIPLAWKWHLLSELEAAPYLPQEEKSPLFSVQREGLPEDGTLYRINRFSSITAHPERWDVSFFTGGPLWALDWCPVPEGAGASQYVALFSSPDMNETHPLSQLHSGPGLLQLWGLGTLQQESCPGNRAHFVYGIACDNGCIWDLKFCPSGAWELPGTPRKAPLLPRLGLLALACSDGKVLLFSLPHPEALLAQQPPDAVKPAIYKVQCVATLQVGSMQATDPSECGQCLSLAWMPTRPHQHLAAGYYNGMVVFWNLPTNSPLQRIRLSDGSLKLYPFQCFLAHDQAVRTLQWCKANSHFLVSAGSDRKIKFWDLRRPYEPINSIKRFLSTELAWLLPYNGVTVAQDNCYASYGLCGIHYIDAGYLGFKAYFTAPRKGTVWSLSGSDWLGTIAAGDISGELIAAILPDMALNPINVKRPVERRFPIYKADLIPYQDSPEGPDHSSASSGVPNPPKARTYTETVNHHYLLFQDTDLGSFHDLLRREPMLRMQEGEGHSQLCLDRLQLEAIHKVRFSPNLDSYGWLVSGGQSGLVRIHFVRGLASPLGHRMQLESRAHFNAMFQPSSPTRRPGFSPTSHRLLPTP 925 T 0.0008 WD40 unppercent F Eukaryota T 8clj 3 C,H C,H TF3C2_HUMAN TF3C-BETA,TRANSCRIPTION FACTOR IIIC 110 KDA SUBUNIT,TFIIIC 110 KDA SUBUNIT,TFIIIC110,TRANSCRIPTION FACTOR IIIC SUBUNIT BETA MHHHHHHENLYFQGMDTCGVGYVALGEAGPVGNMTVVDSPGQEVLNQLDVKTSSEMTSAEASVEMSLPTPLPGFEDSPDQRRLPPEQESLSRLEQPDLSSEMSKVSKPRASKPGRKRGGRTRKGPKRPQQPNPPSAPLVPGLLDQSNPLSTPMPKKRGRKSKAELLLLKLSKDLDRPESQSPKRPPEDFETPSGERPRRRAAQVALLYLQELAEELSTALPAPVSCPEGPKVSSPTKPKKIRQPAACPGGEEVDGAPRDEDFFLQVEAEDVEESEGPSESSSEPEPVVPRSTPRGSTSGKQKPHCRGMAPNGLPNHIMAPVWKCLHLTKDFREQKHSYWEFAEWIPLAWKWHLLSELEAAPYLPQEEKSPLFSVQREGLPEDGTLYRINRFSSITAHPERWDVSFFTGGPLWALDWCPVPEGAGASQYVALFSSPDMNETHPLSQLHSGPGLLQLWGLGTLQQESCPGNRAHFVYGIACDNGCIWDLKFCPSGAWELPGTPRKAPLLPRLGLLALACSDGKVLLFSLPHPEALLAQQPPDAVKPAIYKVQCVATLQVGSMQATDPSECGQCLSLAWMPTRPHQHLAAGYYNGMVVFWNLPTNSPLQRIRLSDGSLKLYPFQCFLAHDQAVRTLQWCKANSHFLVSAGSDRKIKFWDLRRPYEPINSIKRFLSTELAWLLPYNGVTVAQDNCYASYGLCGIHYIDAGYLGFKAYFTAPRKGTVWSLSGSDWLGTIAAGDISGELIAAILPDMALNPINVKRPVERRFPIYKADLIPYQDSPEGPDHSSASSGVPNPPKARTYTETVNHHYLLFQDTDLGSFHDLLRREPMLRMQEGEGHSQLCLDRLQLEAIHKVRFSPNLDSYGWLVSGGQSGLVRIHFVRGLASPLGHRMQLESRAHFNAMFQPSSPTRRPGFSPTSHRLLPTP 925 T 0.0008 WD40 unppercent F Eukaryota T 8cll 3 C,F C,H TF3C2_HUMAN TF3C-BETA,TRANSCRIPTION FACTOR IIIC 110 KDA SUBUNIT,TFIIIC 110 KDA SUBUNIT,TFIIIC110,TRANSCRIPTION FACTOR IIIC SUBUNIT BETA MHHHHHHENLYFQGMDTCGVGYVALGEAGPVGNMTVVDSPGQEVLNQLDVKTSSEMTSAEASVEMSLPTPLPGFEDSPDQRRLPPEQESLSRLEQPDLSSEMSKVSKPRASKPGRKRGGRTRKGPKRPQQPNPPSAPLVPGLLDQSNPLSTPMPKKRGRKSKAELLLLKLSKDLDRPESQSPKRPPEDFETPSGERPRRRAAQVALLYLQELAEELSTALPAPVSCPEGPKVSSPTKPKKIRQPAACPGGEEVDGAPRDEDFFLQVEAEDVEESEGPSESSSEPEPVVPRSTPRGSTSGKQKPHCRGMAPNGLPNHIMAPVWKCLHLTKDFREQKHSYWEFAEWIPLAWKWHLLSELEAAPYLPQEEKSPLFSVQREGLPEDGTLYRINRFSSITAHPERWDVSFFTGGPLWALDWCPVPEGAGASQYVALFSSPDMNETHPLSQLHSGPGLLQLWGLGTLQQESCPGNRAHFVYGIACDNGCIWDLKFCPSGAWELPGTPRKAPLLPRLGLLALACSDGKVLLFSLPHPEALLAQQPPDAVKPAIYKVQCVATLQVGSMQATDPSECGQCLSLAWMPTRPHQHLAAGYYNGMVVFWNLPTNSPLQRIRLSDGSLKLYPFQCFLAHDQAVRTLQWCKANSHFLVSAGSDRKIKFWDLRRPYEPINSIKRFLSTELAWLLPYNGVTVAQDNCYASYGLCGIHYIDAGYLGFKAYFTAPRKGTVWSLSGSDWLGTIAAGDISGELIAAILPDMALNPINVKRPVERRFPIYKADLIPYQDSPEGPDHSSASSGVPNPPKARTYTETVNHHYLLFQDTDLGSFHDLLRREPMLRMQEGEGHSQLCLDRLQLEAIHKVRFSPNLDSYGWLVSGGQSGLVRIHFVRGLASPLGHRMQLESRAHFNAMFQPSSPTRRPGFSPTSHRLLPTP 925 T 0.0008 WD40 unppercent F Eukaryota T 8cmn 2 B,C B,C N-[2-(2-methyl-1,3-dioxolan-2-yl)phenyl]-2-{[5-(trifluoromethyl)pyridin-2-yl]amino}pyridine-4-carboxamide XXXXXXXXXX 10 F F F 8cob 2 B,D,F B,D,F DNA excision repair protein ERCC-6-like 2 SSPGQLTLLQCGFSK 15 T 8.7 TAL_effector pdbhh F T 8coy 2 C,D C,D peptido-mimetic inhibitor XITAXDE 7 T 250 Big_3_4 pdbhh F T 8cpn 1 A A PolB16 intein MKTEFSGDTDAVHGKTHVFIRSIKNGSHMQEAKIDIKSLYDSLAKKYDVQHKNSYEVIYPKGYEIKVLGNKYVKLVAMSRHKTQKHLVKIVVKSEKTIDSLDPIRQKSLLKKQDEVVVTTDHICMVYNDDHFFENVNAKNLKVGNYVSVYDEASDKEVIGEIASIEDLGMTDDYVYDCEVDDDSHAFYASNILVHASQFCNGTKLGG 207 T 0.0063 CathepsinC_exc pdb F T 8cpo 1 A A PolB16 Intein Cys-less MKTEFSGDTDAVHGKTHVFIRSIKNGSHMQEAKIDIKSLYDSLAKKYDVQHKNSYEVIYPKGYEIKVLGNKYVKLVAMSRHKTQKHLVKIVVKSEKTIDSLDPIRQKSLLKKQDEVVVTTDHIAMVYNDDHFFENVNAKNLKVGNYVSVYDEASDKEVIGEIASIEDLGMTDDYVYDAEVDDDSHAFYASNILVHASQFCNGTKLGG 207 T 0.0059 CathepsinC_exc pdb F T 8cqy 2 B B ADA17_HUMAN ADAM 17,SNAKE VENOM-LIKE PROTEASE,TNF-ALPHA CONVERTASE,TNF-ALPHA-CONVERTING ENZYME RQNRVDSKETEC 12 T 10 Spp-24 pdbhh F Eukaryota T 8crx 51 YA V 50S ribosomal protein bL37 MGKTGRKRRARRKKGANHGKRPNA 24 T 21 Protamine_3 pdbhh F T 8ct8 1 A,B A,B UEX_DROME PUTATIVE METAL TRANSPORTER UEX GPLGSVNIISGALELRKKTVADVMTHINDAFMLSLDALLDFETVSEIMNSGYSRIPVYDGDRKNIVTLLYIKDLAFVDTDDNTPLKTLCEFYQNPVHFVFEDYTLDIMFNQFKEGTIGHIAFVHRVNNEGDGDPFYETVGLVTLEDVIEELIQAEIVDELE 161 T 0.0059 CBS pdbpercent F Eukaryota T 8cto 1 A A Cyclic peptide D8.31 DAL-DPR-MLU-DVA-DAL-DPR-MLU-DVA XXXXXXXX 8 F F F 8cuf 1 A,B A,B Synthetic epi-Novo29 (2R,3S) FXXSXALL 8 T 19 Penaeidin pdbhh F F 8cug 1 A,B A,B Synthetic epi-Novo29 (2R,3S) FXXSXALL 8 T 19 Penaeidin pdbhh F F 8cuk 1 A,B,C A,B,C PEP5_YEAST CARBOXYPEPTIDASE Y-DEFICIENT PROTEIN 5,HISTONE E3 LIGASE PEP5,RING-TYPE E3 UBIQUITIN TRANSFERASE PEP5,VACUOLAR BIOGENESIS PROTEIN END1,VACUOLAR MORPHOGENESIS PROTEIN 1,VACUOLAR PROTEIN SORTING-ASSOCIATED PROTEIN 11,VACUOLAR PROTEIN-TARGETING PROTEIN 11 MKHHHHHHHGAAGTSLYKKAGENLYFQGSMSLSSWRQFQLFENIPIRDPNFGGDSLLYSDPTLCAATIVDPQTLIIAVNSNIIKVVKLNQSQVIHEFQSFPHDFQITFLKVINGEFLVALAESIGKPSLIRVYKLEKLPNREQLYHSQVELKNGNNTYPISVVSISNDLSCIVVGFINGKIILIRGDISRDRGSQQRIIYEDPSKEPITALFLNNDATACFAATTSRILLFNTTGRNRGRPSLVLNSKNGLDLNCGSFNPATNEFICCLSNFIEFFSSSGKKHQFAFDLSLRKRIFCVDKDHILIVTEETGVPTTSISVNELSPTIINRIFIIDAKNKIISLNFVVSSAIIDIFSTSQSGKNITYLLTSEGVMHRITPK 379 T 3.2E-05 WD40_like pdbhh F Eukaryota T 8cun 1 A A Cyclic peptide D8.21 DVA-MLE-DPR-LEU-DVA-MLE-DPR-LEU XXXLXXXL 8 F F F 8cv4 2 C,D,E C,D,E Peptide 4.2F WXYWRXYVLKIC 12 T 1.7 Plk4_PB1 pdbhh F T 8cv5 2 B B Peptide 4.2E WYDVFLTRXYGXXKVAC 17 T 2.1 DUF756 pdbhh F T 8cv6 2 B B Peptide 4.2E WYDVFLTRXYGXXKVAC 17 T 2.1 DUF756 pdbhh F T 8cv7 2 C,D C,D Peptide 2.2E WYGYWXVPXRKC 12 T 0.61 DUF3793 pdbhh F T 8cvj 56 DB,ID 1z,2z P-site Peptidyl-tRNA fMSEAC-NH-tRNAmet Peptide-part MSEAC 5 T 67 DUF1163 pdbhh F F 8cvk 56 DB,ID 1z,2z P-site Peptidyl-tRNA fMRC-NH-tRNAmet Peptide-part MRC 3 T 24 SICA_beta pdbhh F F 8cvl 56 DB,ID 1z,2z P-site Peptidyl-tRNA fMTHSMRC-NH-tRNAmet Peptide-part MTHSMRC 7 T 0.00014 Ery_res_leader2 pdbhh F T 8cvm 30 DA V 50S ribosomal protein bL37 MGKTGRKRRARRKKGANHGKRPNA 24 T 21 Protamine_3 pdbhh F T 8cwa 1 A A Cyclic peptide D8.21 DVA-MLE-DPR-LEU-DVA-MLE-DPR-LEU XXXLXXXL 8 F F F 8cww 1 A P Meiosis-specific protein HOP1 SNASNNPVTGICSCECGLEVPKAATVLKTCKSCRKTLHGICYGNFLHSSIEKCFTCIFGPSLDTKWSKFQDLMMIRKVFRFLVRKKKGFPASITELIDSFINVEDQNNEVKERVAFALFVFFLDETLCLDNGGKPSQTIRYVTSSVLVDVKGIVIPNTRKQLNVNHEYKWHFTTSSPKAESFYQEVLPNSRKQVESWLQDITNLRKVYSEALS 213 T 0.091 DUF928 pdb F T 8cwx 1 A A A0A7Y7E8Q0_STRMO Lanthipeptide Natural Product mSmoAc FAADAWAAQDMAXGNPLXXXFCCXVQCG 28 T 3.7 Baculo_LEF5_C pdbhh F Bacteria T 8cxp 1 A A A0A649YC68_9PICO Capsid protein VP1 STDNAETGVIEAGNTDTDFSGELAAPGSNHTNVKFLFDRSRLLNVIKVLEKDAVFPRPFPTQEGAQQDDGYFCLLTPRPTVASRPATRFGLYANPSGSGVLANTSLDFNFYSLACFTYFRSDLEVTVVSLEPDLEFAVGWFPSGSEYQASSFVYDQLHVPFHFTGRTPRAFASKGGKVSFVLPWNSVSSVLPVRWGGASKLSSATRGLPAHADWGTIYAFVPRPNEKKSTAVKHVAVYIRYKNARAWCPSMLPFRSYKQKMLM 263 F T Viruses T 8cyk 1 A,B B,A HALC1_878 MSGMKKLYEYTVTTLDEFLEKLKEFILNTSKDKIYKLTITNPKLIKDIGKAIAKAAEIADVDPKEIEEMIKAVEENELTKLVITIEQTDDKYVIKVELENEDGLVHSFEIYFKNKEEMEKFLELLEKLISKLSGS 135 T 0.0082 SUKH_5 pdb F T 8cz9 2 B C JAK2 pY813 phosphopeptide PDXELLTE 8 T 1.6 LIN9_C pdbhh F T 8czf 2 B B DF2 peptide XSYIDKIADLIRKVAEEINSKLEX 24 T 1.8 ZapA pdbhh F T 8czg 2 E,F,G,H E,F,G,H dF3 peptide XSLLEKLAEELRQLADELNKKFEKX 25 T 0.11 Bclx_interact pdb F T 8czh 2 B B DM2 peptide XAPYLEQVARTLRKIGEEINEALRX 25 T 0.023 Bclx_interact pdb F T 8czk 2 C,D C,D Deb-Erk peptide GFLXEY 6 T 1.6 DUF5918 pdbhh F T 8d02 1 A A Q81TN4_BACAN EXOSPORIUM PROTEIN MFSSDCEFTKIDCEAKPASTLPAFGFAFNASAPQFASLFTPLLLPSVSPNPNITVPVINDTVSVGDGIRILRAGIYQISYTLTISLDNSPVAPEAGRFFLSLGTPANIIPGSGTAVRSNVIGTGEVDVSSGVILINLNPGDLIRIVPVELIGTVDIRAAALTVAQIS 167 T 0.0004 BclA_C pdbpssm F Bacteria T 8d03 1 A A HALC2_068 MSGMIKVPEDLERIGRELRARGLDTKRLLEEGPKLYPELSIPDLMAIALYDHLNLDPEFLYRLLQQSRGS 70 T 3.9 Mut7-C pdbhh F T 8d04 1 A,B,C,D,E,F B,A,E,F,C,D HALC2_062 MSGMARVEYSYEKLNDTHYKLKLKVTYEYRKSPEARRLAEDLVQAFVDALSSLPFITVEYEVEEVEVEGS 70 T 10 DUF1307 pdbhh F T 8d05 1 A A HALC2_065 MSGSEEEKPIVIDLNKTIERDGRKVKLVRATITVDPETNTITIDIEYEGGPITKEDLLEAFKLAASKLGS 70 T 0.0011 RNase_PH_C pdb F T 8d06 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,H,G,I,L,K,J HALC3_104 MSGKRIDEIESKLKHLEEFTTHLIKLMETMLELLKLVSDGKSDSEEYKELLEKAEEYLKQATEAAKKIGS 70 T 0.012 ISG65-75 pdb F T 8d07 1 A,B,C,D,E,F D,A,B,C,E,F HALC3_109 MSGREEIEEAVKEAELKVLAIVLVALRSVSHYEPLSRLYESFLDALKKALSEEELKEVEKEAERIEKKGS 70 T 0.02 Ku_PK_bind pdb F T 8d08 1 A,B,C,D A,B,C,D HALC4_135 MSGMEKFKEQLLEEVKKIVLETMTKVMEHLEKWFVTLAEIIITKSEEKLEELKETMEKSIEELRKEAEGS 70 T 0.0057 DUF3884 pdb F T 8d09 1 A,B A,B HALC4_136 MSGMSPYKKAIEITKRLLELLLSNPELAKKNLGGIATLISLLALISALDGTLDEKDIEPYIKKLEESLGS 70 T 0.14 SRP54_N pdb F T 8d0y 5 E G BG505SOSIPv8 gp120 ENLWVTVYYGVPVWKDAETTLFCASDAKAYETKKHNVWATHCCVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHEDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPNVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNTPVQINCTRPNNNTVKSIRIGPGQWFYYTGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVG 455 T 3.3E-54 GP120 pdbpercent F T 8d35 2 C C TRM1_HUMAN TRNA 2,2-DIMETHYLGUANOSINE-26 METHYLTRANSFERASE,TRNA(GUANINE-26,N(2)-N(2)) METHYLTRANSFERASE,TRNA(M(2,2)G26)DIMETHYLTRANSFERASE EPRLQANFTIR 11 T 18 Chisel pdbhh F Eukaryota T 8d3r 2 B,C B,C DPOG2_HUMAN DNA POLYMERASE GAMMA ACCESSORY 55 KDA SUBUNIT,P55,MITOCHONDRIAL DNA POLYMERASE ACCESSORY SUBUNIT,MTPOLB,POLG-BETA MRSRVAVRACHKVCRCLLSGFGGRVDAGQPELLTERSSPKGGHVKSHAELEGNGEHPEAPGSGEGSEALLEICQRRHFLSGSKQQLSRDSLLSGCHPGFGPLGVELRKNLAAEWWTSVVVFREQVFPVDALHHKPGPLLPGDSAFRLVSAETLREILQDKELSKEQLVAFLENVLKTSGKLRENLLHGALEHYVNCLDLVNKRLPYGLAQIGVCFHPVFDTKQIRNGVKSIGEKTEASLVWFTPPRTSNQWLDFWLRHRLQWWRKFAMSPSNFSSSDCQDEEGRKGNKLYYNFPWGKELIETLWNLGDHELLHMYPGNVSKLHGRDGRKNVVPCVLSVNGDLDRGMLAYLYDSFQLTENSFTRKKNLHRKVLKLHPCLAPIKVALDVGRGPTLELRQVCQGLFNELLENGISVWPGYLETMQSSLEQLYSKYDEMSILFTVLVTETTLENGLIHLRSRDTTMKEMMHISKLKDFLIKYISSAKNV 485 F F Eukaryota T 8d42 2 B,C B,C DPOG2_HUMAN DNA POLYMERASE GAMMA ACCESSORY 55 KDA SUBUNIT,P55,MITOCHONDRIAL DNA POLYMERASE ACCESSORY SUBUNIT,MTPOLB,POLG-BETA MRSRVAVRACHKVCRCLLSGFGGRVDAGQPELLTERSSPKGGHVKSHAELEGNGEHPEAPGSGEGSEALLEICQRRHFLSGSKQQLSRDSLLSGCHPGFGPLGVELRKNLAAEWWTSVVVFREQVFPVDALHHKPGPLLPGDSAFRLVSAETLREILQDKELSKEQLVAFLENVLKTSGKLRENLLHGALEHYVNCLDLVNKRLPYGLAQIGVCFHPVFDTKQIRNGVKSIGEKTEASLVWFTPPRTSNQWLDFWLRHRLQWWRKFAMSPSNFSSSDCQDEEGRKGNKLYYNFPWGKELIETLWNLGDHELLHMYPGNVSKLHGRDGRKNVVPCVLSVNGDLDRGMLAYLYDSFQLTENSFTRKKNLHRKVLKLHPCLAPIKVALDVGRGPTLELRQVCQGLFNELLENGISVWPGYLETMQSSLEQLYSKYDEMSILFTVLVTETTLENGLIHLRSRDTTMKEMMHISKLKDFLIKYISSAKNV 485 F F Eukaryota T 8d5e 3 C P TF2B_MOUSE GENERAL TRANSCRIPTION FACTOR TFIIB,RNA POLYMERASE II ALPHA INITIATION FACTOR TGAASFDEF 9 T 4.8 DUF2852 pdbhh F Eukaryota T 8d5f 3 C P TF2B_MOUSE GENERAL TRANSCRIPTION FACTOR TFIIB,RNA POLYMERASE II ALPHA INITIATION FACTOR TGAARFDEF 9 T 13 DUF4295 pdbhh F Eukaryota T 8d5j 3 C C PRP19_MOUSE NUCLEAR MATRIX PROTEIN 200,PRP19/PSO4 HOMOLOG,RING-TYPE E3 UBIQUITIN TRANSFERASE PRP19,SENESCENCE EVASION FACTOR KYLQVASHV 9 T 2.1 DUF2894 unppercent F Eukaryota T 8d5k 3 C C PRP19_MOUSE NUCLEAR MATRIX PROTEIN 200,PRP19/PSO4 HOMOLOG,RING-TYPE E3 UBIQUITIN TRANSFERASE PRP19,SENESCENCE EVASION FACTOR KYRQVASHV 9 T 2.1 DUF2894 unppercent F Eukaryota T 8d5n 2 B,D E,B Dense granule protein 6, HF10 peptide HPGSVNEFDFGCGGSG 16 T 1 Polysacc_synt_4 pdbhh F T 8d5q 4 D E Dense granule protein 6, HF10 peptide HPGSVNEFDF 10 T 1.4 CITED pdbhh F T 8d6v 3 BA,CA,DA,EA,FA,GA,HA d,e,f,g,h,i,j ARC_MYCTU AAA ATPASE FORMING RING-SHAPED COMPLEXES,ARC,MYCOBACTERIAL PROTEASOME ATPASE GQYL 4 T 62 SHIPPO-rpt pdbhh F Bacteria F 8d6w 3 BA,CA,DA,EA,FA,GA,HA d,e,f,g,h,i,j ARC_MYCTU AAA ATPASE FORMING RING-SHAPED COMPLEXES,ARC,MYCOBACTERIAL PROTEASOME ATPASE GQYL 4 T 62 SHIPPO-rpt pdbhh F Bacteria F 8d6x 4 HA,IA,JA,KA,LA,MA,NA d,e,f,g,h,i,j ARC_MYCTU AAA ATPASE FORMING RING-SHAPED COMPLEXES,ARC,MYCOBACTERIAL PROTEASOME ATPASE GQYL 4 T 62 SHIPPO-rpt pdbhh F Bacteria F 8d6y 4 HA,IA,JA,KA,LA,MA,NA d,e,f,g,h,i,j ARC_MYCTU AAA ATPASE FORMING RING-SHAPED COMPLEXES,ARC,MYCOBACTERIAL PROTEASOME ATPASE GQYL 4 T 62 SHIPPO-rpt pdbhh F Bacteria F 8d7m 2 C,D C,D Period circadian protein homolog 2 peptide GKAESVASLT 10 T 80 Tmemb_14 pdbhh F T 8d7n 2 C,D C,D Period circadian protein homolog 2 peptide GKAESVASLTSQ 12 T 13 BRX_N pdbhh F T 8d7o 2 C,D C,D Period circadian protein homolog 2 peptide GKAESVASLTSQCSYA 16 T 18 Rota_NSP4 pdbhh F T 8d7p 2 C,D D,C Period circadian protein peptide VAERDSVMLGEIAPHHDY 18 T 12 DUF2718 pdbhh F T 8d85 2 B D IL27A_HUMAN IL-27 SUBUNIT ALPHA,IL-27-A,IL27-A,INTERLEUKIN-30,P28 FPRPPGRPQLSLQELRREFTVSLHLARKLLSEVRGQAHRFAESHLPGVNLYLLPLGEQLPDVSLTFQAWRRLSDPERLCFISTTLQPFHALLGGLGTQGRWTNMERMQLWAMRLDLRDLQRHLRFQVLAAGFNLPEEEEEEEEEEEEERKGLLPGALGSALQGPAQVSWPQLLSTYRLLHSLELVLSRAVRELLLLSKAGHSVWPLGFPTLSPQPEQKLISEEDLGGEQKLISEEDLHHHHHH 243 T 5.7 Myc-LZ pdb F Eukaryota T 8d8i 2 B B NCOR1_HUMAN N-COR,N-COR1 THRLITLADHIAQIITQDFA 20 T 25 Es2 pdbhh F Eukaryota T 8d8j 1 A 0 RT22_YEAST Probable S-adenosyl-L-methionine-dependent RNA methyltransferase RSM22, mitochondrial MMKRCFSILPQNVRFSSKFTSLNLPKLDLADFIDSNKRGINVLPSYRDETASTTQATNSKELRLLSKTLQGQSYRDQLELNPDVSKAINNNIMAVHIPNNLRRVATNYYKEIQEPNSLHRPCRTKMEVDAHIASIFLQNYGSIFQSLKELQKRVGPDNFKPQRILDVGYGPATGIVALNDILGPNYRPDLKDAVILGNAEMQERAKIILSRQLNEVVDTVEENVSTEKEQETDRRNKNFQEDEHIGEVMTKKINIMTNLRSSIPASKEYDLIILTHQLLHDGNQFPIQVDENIEHYLNILAPGGHIVIIERGNPMGFEIIARARQITLRPENFPDEFGKIPRPWSRGVTVRGKKDAELGNISSNYFLKVIAPCPHQRKCPLQVGNPNFYTHKEGKDLKFCNFQKSIKRPKFSIELKKGKLLATSWDGSQGNASRLKGTGRRNGRDYEILNYSYLIFERSHKDENTLKEIKKLRNENVNGKYDIGSLGDDTQNSWPRIINDPVKRKGHVMMDLCAPSGELEKWTVSRSFSKQIYHDARKSKKGDLWASAAKTQIKGLGDLNVKKFHKLEKERIKQLKKEERQKARKAMESYNELEDSLQFDDHQFSNFEVMKKLSTFHGNDFLQHVNRK 628 T 2.3E-26 Rsm22 pdbpercent F Eukaryota T 8d8j 2 B 5 RT13_YEAST MITOCHONDRIAL SMALL RIBOSOMAL SUBUNIT PROTEIN MS44,YMS-A MGTITVVINEGPILLIRALHRATTNKKMFRSTVWRRFASTGEIAKAKLDEFLIYHKTDAKLKPFIYRPKNAQILLTKDIRDPKTREPLQPRPPVKPLSKQTLNDFIYSVEPNSTELLDWFKEWTGTSIRKRAIWTYISPIHVQKMLTASFFKIGKYAHMVGLLYGIEHKFLKAQNPSVFDIEHFFNTNIMCALHRNRLKDYKDAEIAQRKLQVAWKKVLNRKNNTGLANILVATLGRQIGFTPELTGLQPVDISLPDIPNSSSGAELKDLLSKYEGIYLIARTLLDIDQHNAQYLELQEFIRQYQNALSESSDPYDTHLKALGLLETPPPQESTEKEEK 339 T 32 NMU unphh F Eukaryota T 8d8j 8 H V RTPT_YEAST MITOCHONDRIAL SMALL RIBOSOMAL SUBUNIT PROTEIN MS26 MGKGAAKYGFKSGVFPTTRSILKSPTTKQTDIINKVKSPKPKGVLGIGYAKGVKHPKGSHRLSPKVNFIDVDNLIAKTVAEPQSIKSSNGSAQKVRLQKAELRRKFLIEAFRKEEARLLHKHEYLQKRTKELEKAKELELEKLNKEKSSDLTIMTLDKMMSQPLLRNRSPEESELLKLKRNYNRSLLNFQAHKKKLNELLNLYHVANEFIVTESQLLKKIDKVFNDETEEFTDAYDVTSNFTQFGNRKLLLSGNTTLQTQINNAIMGSLSNEKFFDISLVDSYLNKDLKNISNKIDSKLNPTSNGAGNNGNNNNTTNL 318 T 0.003 MRP-S26 unppssm F Eukaryota T 8d8k 1 A 0 RT22_YEAST Probable S-adenosyl-L-methionine-dependent RNA methyltransferase RSM22, mitochondrial MMKRCFSILPQNVRFSSKFTSLNLPKLDLADFIDSNKRGINVLPSYRDETASTTQATNSKELRLLSKTLQGQSYRDQLELNPDVSKAINNNIMAVHIPNNLRRVATNYYKEIQEPNSLHRPCRTKMEVDAHIASIFLQNYGSIFQSLKELQKRVGPDNFKPQRILDVGYGPATGIVALNDILGPNYRPDLKDAVILGNAEMQERAKIILSRQLNEVVDTVEENVSTEKEQETDRRNKNFQEDEHIGEVMTKKINIMTNLRSSIPASKEYDLIILTHQLLHDGNQFPIQVDENIEHYLNILAPGGHIVIIERGNPMGFEIIARARQITLRPENFPDEFGKIPRPWSRGVTVRGKKDAELGNISSNYFLKVIAPCPHQRKCPLQVGNPNFYTHKEGKDLKFCNFQKSIKRPKFSIELKKGKLLATSWDGSQGNASRLKGTGRRNGRDYEILNYSYLIFERSHKDENTLKEIKKLRNENVNGKYDIGSLGDDTQNSWPRIINDPVKRKGHVMMDLCAPSGELEKWTVSRSFSKQIYHDARKSKKGDLWASAAKTQIKGLGDLNVKKFHKLEKERIKQLKKEERQKARKAMESYNELEDSLQFDDHQFSNFEVMKKLSTFHGNDFLQHVNRK 628 T 2.3E-26 Rsm22 pdbpercent F Eukaryota T 8d8k 2 B 5 RT13_YEAST MITOCHONDRIAL SMALL RIBOSOMAL SUBUNIT PROTEIN MS44,YMS-A MGTITVVINEGPILLIRALHRATTNKKMFRSTVWRRFASTGEIAKAKLDEFLIYHKTDAKLKPFIYRPKNAQILLTKDIRDPKTREPLQPRPPVKPLSKQTLNDFIYSVEPNSTELLDWFKEWTGTSIRKRAIWTYISPIHVQKMLTASFFKIGKYAHMVGLLYGIEHKFLKAQNPSVFDIEHFFNTNIMCALHRNRLKDYKDAEIAQRKLQVAWKKVLNRKNNTGLANILVATLGRQIGFTPELTGLQPVDISLPDIPNSSSGAELKDLLSKYEGIYLIARTLLDIDQHNAQYLELQEFIRQYQNALSESSDPYDTHLKALGLLETPPPQESTEKEEK 339 T 32 NMU unphh F Eukaryota T 8d8k 14 N V RTPT_YEAST MITOCHONDRIAL SMALL RIBOSOMAL SUBUNIT PROTEIN MS26 MGKGAAKYGFKSGVFPTTRSILKSPTTKQTDIINKVKSPKPKGVLGIGYAKGVKHPKGSHRLSPKVNFIDVDNLIAKTVAEPQSIKSSNGSAQKVRLQKAELRRKFLIEAFRKEEARLLHKHEYLQKRTKELEKAKELELEKLNKEKSSDLTIMTLDKMMSQPLLRNRSPEESELLKLKRNYNRSLLNFQAHKKKLNELLNLYHVANEFIVTESQLLKKIDKVFNDETEEFTDAYDVTSNFTQFGNRKLLLSGNTTLQTQINNAIMGSLSNEKFFDISLVDSYLNKDLKNISNKIDSKLNPTSNGAGNNGNNNNTTNL 318 T 0.003 MRP-S26 unppssm F Eukaryota T 8d8k 34 HA c unknown protein sequence XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 94 F F F 8d8l 1 A 0 RT22_YEAST Probable S-adenosyl-L-methionine-dependent RNA methyltransferase RSM22, mitochondrial MMKRCFSILPQNVRFSSKFTSLNLPKLDLADFIDSNKRGINVLPSYRDETASTTQATNSKELRLLSKTLQGQSYRDQLELNPDVSKAINNNIMAVHIPNNLRRVATNYYKEIQEPNSLHRPCRTKMEVDAHIASIFLQNYGSIFQSLKELQKRVGPDNFKPQRILDVGYGPATGIVALNDILGPNYRPDLKDAVILGNAEMQERAKIILSRQLNEVVDTVEENVSTEKEQETDRRNKNFQEDEHIGEVMTKKINIMTNLRSSIPASKEYDLIILTHQLLHDGNQFPIQVDENIEHYLNILAPGGHIVIIERGNPMGFEIIARARQITLRPENFPDEFGKIPRPWSRGVTVRGKKDAELGNISSNYFLKVIAPCPHQRKCPLQVGNPNFYTHKEGKDLKFCNFQKSIKRPKFSIELKKGKLLATSWDGSQGNASRLKGTGRRNGRDYEILNYSYLIFERSHKDENTLKEIKKLRNENVNGKYDIGSLGDDTQNSWPRIINDPVKRKGHVMMDLCAPSGELEKWTVSRSFSKQIYHDARKSKKGDLWASAAKTQIKGLGDLNVKKFHKLEKERIKQLKKEERQKARKAMESYNELEDSLQFDDHQFSNFEVMKKLSTFHGNDFLQHVNRK 628 T 2.3E-26 Rsm22 pdbpercent F Eukaryota T 8d8l 2 B 5 RT13_YEAST MITOCHONDRIAL SMALL RIBOSOMAL SUBUNIT PROTEIN MS44,YMS-A MGTITVVINEGPILLIRALHRATTNKKMFRSTVWRRFASTGEIAKAKLDEFLIYHKTDAKLKPFIYRPKNAQILLTKDIRDPKTREPLQPRPPVKPLSKQTLNDFIYSVEPNSTELLDWFKEWTGTSIRKRAIWTYISPIHVQKMLTASFFKIGKYAHMVGLLYGIEHKFLKAQNPSVFDIEHFFNTNIMCALHRNRLKDYKDAEIAQRKLQVAWKKVLNRKNNTGLANILVATLGRQIGFTPELTGLQPVDISLPDIPNSSSGAELKDLLSKYEGIYLIARTLLDIDQHNAQYLELQEFIRQYQNALSESSDPYDTHLKALGLLETPPPQESTEKEEK 339 T 32 NMU unphh F Eukaryota T 8d8l 13 M V RTPT_YEAST MITOCHONDRIAL SMALL RIBOSOMAL SUBUNIT PROTEIN MS26 MGKGAAKYGFKSGVFPTTRSILKSPTTKQTDIINKVKSPKPKGVLGIGYAKGVKHPKGSHRLSPKVNFIDVDNLIAKTVAEPQSIKSSNGSAQKVRLQKAELRRKFLIEAFRKEEARLLHKHEYLQKRTKELEKAKELELEKLNKEKSSDLTIMTLDKMMSQPLLRNRSPEESELLKLKRNYNRSLLNFQAHKKKLNELLNLYHVANEFIVTESQLLKKIDKVFNDETEEFTDAYDVTSNFTQFGNRKLLLSGNTTLQTQINNAIMGSLSNEKFFDISLVDSYLNKDLKNISNKIDSKLNPTSNGAGNNGNNNNTTNL 318 T 0.003 MRP-S26 unppssm F Eukaryota T 8d8l 34 HA c unknown protein sequence XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 94 F F F 8d9o 1 A A Reaction Center Maquette GSPELRQEHQQLAQEFQQLLQEIQQLGRELLKGELQGIKQLREASEKARNPEKKSVLQKILEDEEKHIELLETLQQTGQEAQQLLQELQQTGQELWQLGGSGGPELRQKHQQLAQKIQQLLQKHQQLGAKILEDEEKHIELLETILGGSGGDELRELLKGELQGIKQYRELQQLGQKAQQLVQKLQQTGQKLWQLG 196 T 0.0003 Rubrerythrin pdbpercent F T 8d9p 1 A A Reaction center maquette GSPELRQEHQQLAQEFQQLLQEIQQLGRELLKGELQGIKQLREASEKARNPEKKSVLQKILEDEEKHIELLETLQQTGQEAQQLLQELQQTGQELWQLGGSGGPELRQKHQQLAQKIQQLLQKHQQLGAKILEDEEKHIELLETILGGSGGDELRELLKGELQGIKQYRELQQLGQKAQQLVQKLQQTGQKLWQLG 196 T 0.0003 Rubrerythrin pdbpercent F T 8dc2 1 A A CasLambda MASHKKTESNQIIKTFSFKIKNANGLSLDVLNDAITEYQNYYNICSDWIKDHLTMKISELYKYIPNEKKNSGYALTLISDEWKDKPMYMMFKKGYPANNRDNAIYETLNTCNTEHYTGNILNFSDTYYRRFGYVASAISNYVTKISKMSTGSRSKNISNDSDVDTIMEQVIYEMEHNGWTSVKDWENQMEYLESKTDSNPNFVYRMTTLYEFYKSHIDEVNSKMETMSIDSLIKFGGCRRKDSKKSMYIMGGSNTPFDITQIGGNSLNIKFSKNLNVDVFGRYDVIKDNTLLVDIINGHGASFVLKIINDEIYIDINVSVPFDKKIATTNKVVGIDVNIKHMLLATNILDDGNVKGYVNIYKEVINDSDFKKVCNSTVMQYFTDFSKFVTFCPLEFDFLFSRVCNQKGIYNDNSAMEKSFSDVLNKLKWNFIETGDNTKRIYIENVMKLRSQMKAYAIVKNAYYKQQSEYDFGKSEEFIQEHPFSNTDKGIEILNKLDNISKKILGCRNNIIQYSYNLFEINGYDMVSLEKLTSSQFKKKPFPTVNSLLKYHKILGCTQEEMEKKDIYSVIKKGYYDIIFDNDVVTDAKLSAKGELSKFKDDFFNLMIKSIHFADIKDYFITLSNNGTAGVSLVPSYFTSQMDSIDHKIYFVQDNKSGKLKLANKHKVRSSQEKHINGLNADYNAARNIAYIMENTDCRNMFMKQSRTDKSLYNKPSYETFIKTQGSAVAKLKKEGFVKILDEASVGSSGHHHHHH 756 T 12 OrfB_IS605 pdbhh F T 8dcn 3 C,F C,F A8DS70_CLODI ADP-RIBOSYLTRANSFERASE BINDING COMPONENT,CDTB MKIPTDQEIMDAHKIYFADLNFNPSTGNTYINGMYFAPTQTNKEALDYIQKYRVEATLQYSGFKDIGTKDKEMRNYLGDPNQPKTNYVNLRSYFTGGENIMTYKKLRIYAITPDDRELLVLSVDHHHHHH 130 T 2.1E-05 Fve unphh F Bacteria T 8ddc 1 A A IL2RG_MOUSE INTERLEUKIN-2 RECEPTOR SUBUNIT GAMMA,IL-2 RECEPTOR SUBUNIT GAMMA,IL-2R SUBUNIT GAMMA,IL-2RG,GAMMAC,P64 EENPSLFALEAVLIPVGTVGLIITLIFVYFWLER 34 T 0.29 TMEM154 pdbhh F Eukaryota T 8ddc 2 B B IL7RA_MOUSE IL-7 RECEPTOR SUBUNIT ALPHA,IL-7R SUBUNIT ALPHA,IL-7R-ALPHA,IL-7RA GGWDPVLPSVTILSLFSVFLLVILAHVLWKK 31 T 0.0029 IFNGR1 unphh F Eukaryota T 8ddd 1 A A IL2RG_MOUSE INTERLEUKIN-2 RECEPTOR SUBUNIT GAMMA,IL-2 RECEPTOR SUBUNIT GAMMA,IL-2R SUBUNIT GAMMA,IL-2RG,GAMMAC,P64 EENPSLFALEAVLIPVGTVGLIITLIFVYFWLER 34 T 0.29 TMEM154 pdbhh F Eukaryota T 8ddd 2 B B IL9R_MOUSE IL-9 RECEPTOR,IL-9R QWSASILVVVPIFLLLTGFVHLLFKLSPRLK 31 T 0.021 DUF2207 unppercent F Eukaryota T 8ddf 1 A A PHE-TRP-PHE FWF 3 T 22 Pannexin_like pdbhh F F 8ddf 2 B B DPN-DTY-DPN XXX 3 F F F 8ddg 1 A A PHE-TYR-PHE FYF 3 T 33 Pox_G5 pdbhh F F 8ddh 1 A A PHE-TYR-PHE FYF 3 T 33 Pox_G5 pdbhh F F 8ddq 2 B,D,F,H E,F,G,H Unidentified segment at the N-terminus of TRPM3 XXXXXXXXXXXXXXXXX 17 F F F 8ddr 2 B,D,F,H E,F,G,H Unidentified segment at the N-terminus of TRPM3 XXXXXXXXXXXXXXXXX 17 F F F 8dds 2 B,D,F,H E,F,G,H Unidentified segment at the N-terminus of TRPM3 XXXXXXXXXXXXXXXXX 17 F F F 8ddt 2 B,D,F,H E,F,G,H Unidentified segment at the N-terminus of TRPM3 XXXXXXXXXXXXXXXXX 17 F F F 8ddu 2 B,D,F,H E,F,G,H Unidentified segment at the N-terminus of TRPM3 XXXXXXXXXXXXXXXXX 17 F F F 8ddv 2 B,D,F,H E,F,G,H Unidentified segment at the N-terminus of TRPM3 XXXXXXXXXXXXXXXXX 17 F F F 8ddw 2 B,D,F,H E,F,G,H Unidentified segment at the N-terminus of TRPM3 XXXXXXXXXXXXXXXXX 17 F F F 8ddx 2 B,D,F,H E,F,G,H Unidentified segment at the N-terminus of TRPM3 XXXXXXXXXXXXXXXXX 17 F F F 8des 2 B E A0A6G9L8Z7_9VIRU Putative DNA binding protein MARSRRRMSKRSSRRSFRKYAKTHKRNFKARSMRGGIRL 39 T 31 ELFV_dehydrog_N pdbhh T Viruses T 8deu 2 D D CASP peptide CTNEDGKPC 9 T 0.49 DUF6440 pdbhh F T 8dev 2 D D Colistin XXTXXXXLXXT 11 T 92 DUF111 pdbhh F F 8dfo 6 M M AcrIC4 MDNKITPADEEKIREWLNCEEASVDNDGDVWVAVPMTGHWLSDEQKAKYIEWRGDET 57 T 0.15 LEA_3 pdbpercent F T 8dfs 6 M M ACR30_BPD31 GENE PRODUCT 30,GP30, ACRIF2 MIAQQHKDTVAACEAAEAIAIAKDQVWDGEGYTKYTFDDNSVLIQSGTTQYAMDADDADSIKGYADWLDDEARSAEASEIERLLESVEEE 90 T 0.13 Transglycosylas pdb T Viruses T 8dft 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T A3MU74_PYRCJ Pilin protein TSVEFWQNIASGVGKWLRAIFAIAFWSSLILLTFYAIMTQVAPSKVFRLGALVDLIESVKTVLLGIFVFTASVTGIIAGVAAIANAFGASFAVSPIDVVNALIFQPIVDMVK 112 T 0.0039 TrbC unppssm F Archaea T 8dfu 1 A,AA,B,BA,C,CA,D,DA,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T,U,V,W,X,Y,Z B,0,A,1,C,2,D,3,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T,U,V,W,X,Y,Z A0A401HBH5_AERPX Pilin protein AADIVQMVEDLTGKLTALAWALFLLSWSIGWTLRGSPIPSSRIKRVGNSLIEDSMWAALWLALGTTVFAVIVRLAGIVNEVLLG 84 T 0.036 DUF6010 unppercent F Archaea T 8dgh 2 B B CNGB1_HUMAN CYCLIC NUCLEOTIDE-GATED CATION CHANNEL 4,CNG CHANNEL 4,CNG-4,CNG4,CYCLIC NUCLEOTIDE-GATED CATION CHANNEL GAMMA,CYCLIC NUCLEOTIDE-GATED CATION CHANNEL MODULATORY SUBUNIT,CYCLIC NUCLEOTIDE-GATED CHANNEL BETA-1,CNG CHANNEL BETA-1,GLUTAMIC ACID-RICH PROTEIN,GARP KLAHLRARLKEL 12 T 0.68 DUF5320 pdbhh F Eukaryota T 8dgk 2 B B CNGB1_HUMAN CYCLIC NUCLEOTIDE-GATED CATION CHANNEL 4,CNG CHANNEL 4,CNG-4,CNG4,CYCLIC NUCLEOTIDE-GATED CATION CHANNEL GAMMA,CYCLIC NUCLEOTIDE-GATED CATION CHANNEL MODULATORY SUBUNIT,CYCLIC NUCLEOTIDE-GATED CHANNEL BETA-1,CNG CHANNEL BETA-1,GLUTAMIC ACID-RICH PROTEIN,GARP NDRLQELVKLFK 12 T 1.8 UBA_6 pdbhh F Eukaryota T 8dgm 2 B B PEAK1_HUMAN PSEUDOPODIUM-ENRICHED ATYPICAL KINASE 1,SUGEN KINASE 269,TYROSINE-PROTEIN KINASE SGK269 PPPLPKKMIIRANTEPISKD 20 T 13 DUF4628 pdbhh F Eukaryota T 8dgn 2 B B Phosphorylated PEAK2 (pS826) peptide PPPLPQKKIVSRAASSPDGF 20 T 100 Fmp27_WPPW pdbhh F T 8dgo 2 C C Phosphorylated PEAK3 (pY24) peptide XXSNLGQ 7 T 0.43 LIME1 pdbhh F T 8dgp 2 E,F,G,H E,F,G,H Phosphorylated PEAK3 (pS69) peptide PLPPPLPKKILTRTQSLPTRR 21 T 3.7 NINJA_B pdbhh F T 8dgz 2 C,D C,D Ac-Asp-Glu-Val-Asp-Aldehyde XDEVX 5 T 570 Helicase_RecD pdbhh F F 8di2 1 A A Site 2 binding peptide IM459N21 XSLEQEWXKIECEVYGKCPPKX 22 T 2.5 DUF5385 pdbhh F T 8dj6 2 E,F,G,H E,F,G,H Imub-peptide XQLPLWGX 8 T 6 Plug_translocon pdbhh F T 8djq 2 C,D,G,H E,F,G,H DPO3A_MYCTU DNA polymerase III subunit alpha peptide QFDLFG 6 T 26 Biliv-reduc_cat pdbhh F Bacteria F 8dk4 2 B B PRGC1_HUMAN PGC-1-ALPHA,PPAR-GAMMA COACTIVATOR 1-ALPHA,PPARGC-1-ALPHA,LIGAND EFFECT MODULATOR 6 AEEPSLLKKLLLAPA 15 T 5.4 DUF1467 pdbhh F Eukaryota T 8dk9 2 C,D,G,H E,F,G,H DPO41_MYCTU POL IV 1 QESLFA 6 T 20 DUF6248 pdbhh F Bacteria T 8dkn 2 B C NCOR1_HUMAN N-COR,N-COR1 NLGLEDIIRKALM 13 T 6.9 DUF1244 pdbhh F Eukaryota T 8dkv 2 B C NCOR1_HUMAN N-COR,N-COR1 SNLGLEDIIRKALM 14 T 9.2 DUF1244 pdbhh F Eukaryota T 8dl7 4 D F Minihepcidin PR73 XTHXXRCRXXXX 12 T 18 zf-U11-48K pdbhh F F 8dml 1 A,C,E,G A,C,E,G Q87GI4_VIBPA VtrA MTAKDDYPSLSFQQDYVYIFSSDFQLSEELGVALINALSAKEIVPERLYVMLNDKTISFSFISKNKKSKNRVLSTEKKLNYKHISEYIVNEIEY 94 T 0.014 NUP214 pdb F Bacteria T 8dml 2 B,D,F,H B,D,F,H Q87GI3_VIBPA VtrC MGSSHHHHHHSQDPVHFYETSYKYQAADSTYMHDVAINVSIKGNHFTSDIIIRELVKSENKNYYNVIGHGDIIQKNTHQYYLNFDNIDVYTGTNKANMKPYKEPTSISSLINKSNNIRVVYLSEEYVVVEFFFYDGQIITLHRY 144 T 0.17 Gp13-like unppercent F Bacteria T 8dnq 2 C,D C,D Cyclic peptide 2.2B XWYSXKYAXWWTVYPCX 17 T 2.2 Trp_leader1 pdbhh F T 8dnx 4 D D Cotransin analogue peptide inhibitor XXLXLXX 7 T 720 Skp1 pdbhh F F 8dny 4 D D Decatransin peptide inhibitor AXXXXXXXXX 10 T 1300 CTV_P6 pdbhh F F 8dnz 4 D D Apratoxin F peptide inhibitor XXXXX 5 T 290 HHH_9 pdbhh F F 8do8 2 B,D B,D ATG13_HUMAN Autophagy-related protein 13 METDLNSQDRKDLDKFIKFFALKTVQVIVQARLGEKICTRSSSSPTGSDWFNLAIKDIPEVTHEAKKALAGQLPAVGRSMCVEISLKTSEGDSMELEIWCLEMNEKCDKEIKVSYTVYNRLSLLLKSLLAITRVTPAYRLSRKQGHEYVILYRIYFGEVQLSGLGEGFQTVRVGTVGTPVGTITLSCAYRINLAFMS 197 T 7.3E-08 ATG13 pdb F Eukaryota T 8doa 1 A A HEEH mini-protein TK_rd5_0958 MGSSHHHHHHSSGLVPRGSHMDIEEIEKKARKILEKGDSIEIAGFEVRDEEDLKKILEWLRRHG 64 T 0.027 TFIIE_beta pdbpssm F T 8dpk 1 A,B A,D RESC5 MGSSHHHHHHSSGLVPRGSHMNNALHAFVRSPHYRTIPSAGPNGIVVNRDMLVHQFRDFYKTLQHCSLVDKVHLMSERPSVEALRVADQMVSIGATFLEMPLTGMEHRATEFMESMRYVRGAGGPSTLASYLQDTENCRCNSGDVVCLPNGIAVGHGPRTNAVAHTTLKQLFEVKDDQFSFDVFTLEQEGDAPPLGDYFGFAGSNVLLTWKDEHGLLAVDQYQQKQPHTEMNVVYLEPGCHFLSFYGVDHTIDVLVQKGYERSMDSIAAAGLNPIPVQWSEMDKLGISMRAAVLPLKFFK 300 T 0.27 ADI pdb F T 8dpy 1 A,B A,B beta sheet-forming peptide with flexible linker TYRVXTWETX 10 T 46 DUF3768 pdbhh F T 8dqv 1 A,C A,C A0QUM7_MYCS2 Hydrogenase-2, large subunit LDLFVSPLGRVEGDLDVRVTINDGVVTSAWTEAAMFRGFEIILRGKDPQAGLIVCPRICGICGGSHLYKSAYALDTAWRTHMPPNATLIRNICQACETLQSIPRYFYALFAIDLTNKNYAKSKLYDEAVRRFAPYVGTSYQPGVVLSAKPVEVYAIFGGQWPXSSFMVPGGVMSAPTLSDVTRAIAILEHWNDNWLEKQWLGCSVDRWLENKTWNDVLAWVDENESQYNSDCGFFIRYCLDVGLDKYGQGVGNYLATGTYFEPSLYENPTIEGRNAALIGRSGVFADGRYFEFDQANVTEDVTHSFYEGNRPLHPFEGETIPVNPEDGRRQGKYSWAKSPRYAVPGLGNVPLETGPLARRMAASAPDAETHQDDDPLFADIYNAIGPSVMVRQLARMHEGPKYYKWVRQWLDDLELKESFYTKPVEYAEGKGFGSTEAARGALSDWIVIEDSKIKNYQVVTPTAWNIGPRDASEVLGPIEQALVGSPIVDAEDPVELGHVARSFDSCLVCTVH 513 T 4.1E-19 NiFeSe_Hases pdb F Bacteria T 8dqw 6 F G DDC1_YEAST DDC1 isoform 1 MDYKDDDDKDYKDDDDKDYKDDDDKLEVLFQGPGMSFKATITESGKQNIWFRAIYVLSTIQDDIKITVTTNELIAWSMNETDTTLCQVRFQKSFFEEYEFKPHEIVFGENGVQVIEDTYGNSHKLYSFRVNGRHLTTISRKPDGDGIKSFTIAVNNTSTCPESLANRLIVVIEMDSLIVKEYCPQFQPIKYDPIIINLKYKRRFLDVFGTAASDRNPQEPLDPKLLDVFTNTERELTSALFNEEVESDIRKRNQLTAADEINYICCNSTLLKNFLDNCNVNVTDEVKLEINVHRLSITAFTKAVYGKNNDLLRNALSMSNTISTLDLEHYCLFTTIEDEKQDKRSHSKRREHMKSIIFKLKDFKNFITIGPSWKTTQDGNDNISLWFCHPGDPILMQMQKPGVKLELVEVTDSNINDDILEGKFIKTAISGSKEEAGLKDNKESCESPLKSKTALKRENLPHSVAGTRNSPLKVSYLTPDNGSTVAKTYRNNTARKLFVEEQSQSTNYEQDKRFRQASSVHMNMNREQSFDIGTTHEVACPRNESNSLKRSIADICNETEDPTQQSTFAKRADTTVTWGKALPAADDEVSCSNIDRKGMLKKEKLKHMQGLLNSQNDTSNHKKQDNKEMEDGLGLTQVEKPRGIFD 646 T 2.8E-09 Rad9 pdbpssm F Eukaryota T 8dst 1 A,B,C,D,E,F A,B,C,D,E,F NBD-ffsy peptide XXXXX 5 F F F 8dsx 1 A,B A,B EA22_LAMBD Protein ea22 HHHHHHMSNEVREDGNQFLVVRHPGKTPVIKHCTGDLEEFLRQLIEQDPLVTIDIITHRYYGVGGQWVQDAGEYLHMMSDAGIRIKGEIETAV 93 T 0.11 SNAD4 pdb T Viruses T 8dt0 1 A,B A,B Scaffolding protein functional sites EEEELEELAKELEKILRDEEGHLRKLKEALAEGLGDAEEAAELFRAESIDEMKHAEELAKLLKKGGLDPELRELLEELAELELVAINQYREAAEAAAEAAENGSEEARAAAREALEEALALELDGAKLARAALEAVEKLL 140 T 0.037 Ferritin pdb F T 8dtl 1 A,D C,D Insulin mimetic peptide S597 SLEEEWAQIECEVYGRGCPSESFYDWFERQL 31 T 0.78 BioT2 pdbhh F T 8dtm 2 C C Insulin mimetic peptide S597 component 2 SLEEEWAQIECEVYGRGCPS 20 T 1.9 DUF5385 pdbhh F T 8dto 1 A,E,I E,A,I CH848.3.D0949.10.17chim.6R.DS.SOSIP.664_N133D_N138T gp120 MGSLQPLATLYLLGMLVASVLAAENLWVTVYYGVPVWKEAKTTLFCASDARAYEKEVHNVWATHACVPTDPSPQELVLGNVTENFNMWKNDMVDQMHEDIISLWDQSLKPCVKLTPLCVTLICSDATVKTGTVEEMKNCSFNTTTEIRDKEKKEYALFYKPDIVPLSETNNTSEYRLINCNTSACTQACPKVTFEPIPIHYCAPAGYAILKCNDETFNGTGPCSNVSTVQCTHGIRPVVSTQLLLNGSLAEKEIVIRSENLTNNAKIIIVHLHTPVEIVCTRPNNNTRKSVRIGPGQTFYATGDIIGDIKQAHCNISEEKWNDTLQKVGIELQKHFPNKTIKYNQSAGGDMEITTHSFNCGGEFFYCNTSNLFNGTYNGTYISTNSSANSTSTITLQCRIKQIINMWQGVGRCMYAPPIAGNITCRSNITGLLLTRDGGTNSNETETFRPAGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVG 487 T 5E-50 GP120 pdb F T 8dtu 1 A B Nanobody 5344N74D QVQLVK 6 T 49 Mitoc_mL59 pdbhh F F 8duz 3 E,F E,F Mimetic peptide GPIPVLDENGLFAPGPC 17 T 0.58 FAD-oxidase_C pdbhh F T 8dwc 1 A A PENK_BOVIN Proenkephalin-A VGRPEWWMDYQKRYG 15 T 7.2 DUF1694 pdbhh F Eukaryota T 8dwg 1 A A PENK_BOVIN Proenkephalin-A VGRPEWWMDYQKRYG 15 T 7.2 DUF1694 pdbhh F Eukaryota T 8dyn 1 A A Cloacaenodin GHSVDRIPEYFGPPGLPGPVLFYS 24 T 6.7 DREPP pdbhh F T 8dz8 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H neoleukin-4 GPKKKIQIMAEEALKDALSILNIVKTNSPPAEEQLERFAKRFERNLWGIARLFESGDQKDEAEKAKRMIEWMKRIKTTASEDEQEEMANAIITILQSWFFS 101 T 0.044 DUF2264 pdb F T 8e07 2 B B HPSE_HUMAN Heparanase 8 kDa subunit MGSSHHHHHHSQDPNSSSQDVVDLDFFTQEPLHLVSPSFLSVTIDANLATDPRFLILLGSPKLRTLARGLSPAYLRFGGTKTDFLIFDPKKE 92 T 3.5 Glyco_hydro_79n unppercent F Eukaryota T 8e08 2 B B HPSE_HUMAN Heparanase 8 kDa subunit MGSSHHHHHHSQDPNSSSQDVVDLDFFTQEPLHLVSPSFLSVTIDANLATDPRFLILLGSPKLRTLARGLSPAYLRFGGTKTDFLIFDPKKE 92 T 3.5 Glyco_hydro_79n unppercent F Eukaryota T 8e0l 1 A,B,C A,B,C BGL06 EGSDDLLLKLLELLVEQARVSAEFARRQGDEKMLEEVARKAEEVARKAEEIARKARKEGNLELALKALEILVRAAHVLAEIARERGNEELLKKAWKLAKEALRQVKEIAEQAQKEGNLELAIIALHISVRIAEVLLETRPDDREEIRKQQEEFEELIKRLEKQVG 165 T 0.058 PLU-1 pdb F T 8e0m 1 A,B,C,D,E,F,G,H,I,J,K,L A,B,C,D,E,F,G,H,I,J,K,L BGL15 SLKEKIEKLVEELIRHTEELRELLEKLVKHGGASEEYLLELLENLVRLAHVIAEVAREQGNEELLEEAARLAEEAARQAEELAREARREGNLELALKALQILVNAAYVLAEIARDRGNEELLEKAERLAREALRQVREISKRLQKEGNIELALKANRLLIDALRVLVRIMRHR 173 T 0.02 Ferritin pdbpssm F T 8e0n 1 A,B,C,D,E,F A,B,C,D,E,F BGL18 EGSPRLVLRALENMVRAAHTLAEIARDNGNEEWLERAARLAEEVARRAEELAREARKEGNLELALKALQILVNAAYVLAEIARDRGNEELLEKAARLAEEAARQAEEIARQARKEGNFELALEALEILNEAARVLARIAHHRGNQELLEKAWRLTHRSAKWSREIAEQARKEGE 174 T 0.039 PLU-1 pdbpercent F T 8e0o 1 A A hetBGL03-15-18a GSLELELQNLELLVHIAEVLARLARRTGNEEALEHAARVAEEVAKQAEEIAREARYRGDLRLALEALRIMVEAARVLAEIARERGNEELLQKAEELAREALRQVREISKRLQEEGNIELALKANRLLIDALEVLVRIMRHR 141 T 0.0019 HisKA_3 pdb F T 8e0o 2 B B hetBGL03-15-18b SSLEEKIEELVKELIKHTEELRRLLEKLVKEGGASEEYLLELLENLVRLARVIAEVAREQGNEELLEEAARLAEEAARQAEELAREARYEGDLELALKALQILVNAARVLAEIARDRGNEELLQKAAELAKEAARQAEEIAKEARERGNFELALEALEILNEAARVLARIAHHRGNQELLEEAWRLTHRSAKWSREIAEQARKEGE 206 T 0.027 DUF3584 pdb F T 8e0o 3 C C hetBGL03-15-18c SPRLVLRALENMVRAAHTLAEIARDNGNEEWLERAARLAEEVARRAEELAREAREKGDLELALKALQILVNAAYVLAEIARDRGNEELLKKAHELARKAAEEAQKIAEQARYEGNLELFNKALRILLEAIRVLIEHDDSEEAARELIRRLEELLEQSRRSMKG 163 T 0.014 Adaptin_N pdb F T 8e12 1 A,B,C A,B,C BGL14 SAEEELKKLLEENIKLIEELLEEVKHNDPELLLSVLEVLVRSVHVIAEVAREQGNEELLERAARLAEEAAYQAEEVAREARKRGNLELALKALQILVNAAYVLAEIARDRGNEELLQKAHELAREALRQVKEILEQARKEGNLELVIIALRLHTEIMRVLVEIWRHR 167 T 0.0031 Abdominal-A pdb F T 8e13 3 C C PHE-ALA-LYS-LYS-LYS-TYR-CYS-LEU FAKKKYCL 8 T 1 MIER1_beta_C pdbhh F T 8e1d 1 A B MITF_HUMAN CLASS E BASIC HELIX-LOOP-HELIX PROTEIN 32,BHLHE32 GSRASCMQMDDVIDDIISLESSYNEEILGLMDPA 34 T 0.61 MITF_TFEB_C_3_N pdbhh F Eukaryota T 8e1p 5 I,J,L E,G,M BG505-SOSIP.v4.1-GT1.2gp120 AENLWVTVYYGVPVWKDAETTLFCASDAKAYETKKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKRQKVHALFYKLDIVPINENQNTSYRLINCNTAAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSEDIRDNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQWFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCDTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTDSTTETFRPSGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 474 T 5.1E-50 GP120 pdb F T 8e1u 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H A0A160VN62_PROFR PPi-dependent PEPCK MSVVERRQINAAINLRLSLLGLPHPDSNAESPDAILVEPLLARQRELSRRLKDRLSAPDLRIQRFLDDYLADCDEHPQLPRTTLVLDEPGLARGLSLPVDGDEFHSDIVASYRLVNGVLHNPKHDRRTTAGVFHISTGGLPIPQDKVEVDKNVYARILARAFQAPDEELALPYTANLPEQAHCWASLLMRPTVLPAVPGRTTEKSYEVHFIVPGGLMCNLDFVEGIFGNAGDPYLPENDASLDPDSWTGHTGCVILAPHLTTMTKKSLGMPHYDDATERQRRDGQCWRHEDDLYNDGKAFKVCARDERGVIVTVIADNYFGYCKKEVKTQISYSANLLGGAEEEHSGGAEVYPAWNLNQDFTDRTPDDFTLADVISTNRELLDVRPEGYAVYKPEPNIVFIPEHSHYSMRTQTISWTAHGAEQTIKLLAGKHYLSPDGYRIHAKHREMDATQWHLIGTSSRAVTCHKPATVSGGGKSEISKSISDAFVFGNAFSHDIDSAMDQVQALFDTDFTNRFADASRNGTDHRPVLSIDRSLGSVIKLLTPSIQYNDEYNAFLEGIEPDVKELAFTVKRYYLPEWGEDWRSHFTVGIMNGRHGNMVRLDGKKIITNMLRVGFREDGSWRLFTLRPDYSPAVKVQTEDDITASTVTPPWEDAEGLPRKYVTNCEHLLFQRPDDAIHRGYDKQAEFDLASGTDTFISNFEPLTHEQARDLLTDVQAYSEFTKPVRKLIERVAAMPDDQSPEFWVCSDDPRHLPDGGRSKNPRYLQVRPTDSNPELTTVADVAGKLARKLPLAGHAPQPIDVVAAGRRNNPPEDKVPALCAYNPLHYMELPELFMEYISSMTGKSPSTTGAGSEGALTKGPFNALPAVYDLNAAVLSYALTDYDGWLSSAGYIGPNARVDHDISMLIPELFSHMGPNDRNTKRLISEGYLEKMQDFDFDGHRVLASRLGYRINDRFVTHYFGRIFLHPDVVFSEEMLRPELQDEKIFADSIDVIVKTHQRVAQMYFDDGTVSLACPPIRALLEIMAHGASAEGWTLDSPEFRKLFERESVLASDWYAARLDAKQAEDVKQTEEGVERLKEYIERPDSGSVSARLHLADRLRELEAQLTYERSPEYRRSLVGTLGRQPRFV 1131 T 2.7 DUF5788 pdbpercent F Bacteria T 8e2i 3 E,F C,D Baculoviral IAP repeat-containing protein 6 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 57 F F F 8e2j 2 C,D C,D Baculoviral IAP repeat-containing protein 6 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 57 F F F 8e41 11 K B VIRGINIAMYCIN S1 XTXPXXX 7 T 260 zf-C2H2_jaz pdbhh F F 8e55 1 A,B,C,D A,B,C,D SG135 DREIKEEARKLIREAIELLQKGDPRAKEILRQAILILLAIRLLEEMEENIEKAEKLGNEELSELAKRAIKLVREALELLKEGDPRAEEILKLALKIIKAILLLLEMYENIKQAEELGDEDLSELAKIAIRLVRQALKLLQEGDPRAEEILEIALRIIKLILQLLFLKQRIEEAKKKGDQQFVFEAEEKIRRIVEELFKLLEG 202 T 0.019 IF-2B pdbpercent F T 8e5f 1 A A A3MW92_PYRCJ c-type cytochrome MKKFPALITTLLLLAVFVAATYGPPYSYNHPTNCISCHSNSTGTANSQALSGLTSGPAAGACDPSQQECVWSHQVLKGTDVWKKCINCHVAIWNSINSGPGNVHSGLLNSYGCACHAVAHVGYGNPTDGYTACIYFYVPRLSTATPGYFGAKPTLDFRNVYICFKGTPEGTYTFSGNAPTSLMQLLESKGEVTVKALLVGYDKYANGTVKAKSSAADFLETDFFSALEQAGIFRYEWGTASGAVLKNPSVRTHPLTEEAPNGETIVMGVFDIHTGDFILVAPYAPYSRAPYYLPVAVNPGVAACFNCHFVYQGQLGTAKVMEVGGVWKIGIPADVLNSLTDPHKIVMPAAQAAGGGVAPNLSLVALLATATLLGGAFLALRRRAQ 385 T 0.0015 Cytochrom_NNT pdbhh F Archaea T 8e73 10 J,T K,W QCR10 (UCRY) MAGLPARLRIQPADVKAAAMWGVAAATGGLYLVQVSILVLPPVKVVFHFYLVSGFRICLDVKDLRTMPCSAPRIWRLYILI 81 T 0.063 QCR10 pdbhh F T 8e73 24 GA A7 A0A1S3UVC7_VIGRR NDUA7 MAKSASNSLVQTLKRYIKKPWEITGPCADPEYRSAVPLATEYRLQCPATTKEKPCIPNSLPETVYDIKYFSRDQRRNRPPIRRTVLKKADVEKLAKEQTFAVSDFPPVYLNSAVEEDINAIGGGYQG 127 T 0.0011 CI-B14_5a pdb F Eukaryota T 8e73 34 QA B4 A0A1S3ULL3_VIGRR NDUB4 MGGGMEANKNKFIEDWGTARENLEFNFRWTRRNLALVGIFGIAIPVLVYKGIVREFHMQDEDNGRPYRKFM 71 T 0.0032 NDUF_B4 unppercent F Eukaryota T 8e73 36 SA B8 A0A1S3UJ95_VIGRR NDUB8 MAGRLTNAASRILGGNGVVYRSVASSLRLRSGMGLPVGKHYIPDKPLPMNEELLWDNGTPFPEPCIDRIADTVGKYEALAWLCGGLSFFASLGLLAVWNDKASKIPFTPKVYPYDNLRVELGGEP 125 T 0.00063 NDUF_B8 pdbhh F Eukaryota T 8e73 41 XA C2 NDUC2 MVLSATTIGALLGLGTQMYSNALRKLPYMRHPWEHVVGMGLGAVFVNQLLKWEAQVEQDLDKMLEKAKAANERRYIDGDDDI 82 T 0.036 NDUF_C2 pdbpercent F T 8e73 42 YA P2 A0A0L9V7A4_PHAAN NDUP2 MAARVAARYGSRRLFSSGSGKILSEEEKAAENAYFKKAEQDKLEKLARKGPQPEASSGGSVIDAKPSGSGHTGASAERVSTDKHRNYAVVAGTITILGALGWYLKGTAKKPEVQD 115 T 0.0082 IATP unphh F Eukaryota T 8e73 55 LB P4 A0A1S3UND4_VIGRR NDUP4 MVRVASYFAMTLGAFVFWQSMDKVHVWIALHQDEKQERLEKEAEIRRVREELLKQQANQKG 61 T 0.63 DUF3935 pdbhh F Eukaryota T 8e73 56 MB C1 A0A1S3TTD7_VIGRR NDUB6 MGGGGGDHGHGNGDFRTKVWSMTGGPYCRPKHWKRNTAIAMFGVVLVCIPIAMKSAELEQRPHHPVRPIPSQLWCKNFGTKDYEQSE 87 T 1.9 Chordopox_A13L pdbhh F Eukaryota T 8e7s 21 QA,U v,V COX26_YEAST Cytochrome c oxidase subunit 26, mitochondrial MFFSQVLRSSARAAPIKRYTGGRIGESWVITEGRRLIPEIFQWSAVLSVCLGWPGAVYFFSKARKA 66 T 0.077 Phage_holin_Dp1 unppercent F Eukaryota T 8e8i 3 C C PHE-VAL-LYS-LYS-LYS-TYR-CYS-LEU FVKKKYCL 8 T 4.1 MIER1_beta_C pdbhh F T 8e90 1 A,B A,B MEN1_HUMAN Isoform 2 of Menin MGLKTAQKTLFPLRSIDDVVRLFAAELGREEPDLVLLSLVLGFVEHFLAVNRVIPTNVPELTFQPSPAPDPPGGLTYFPVADLSIIAALYARFTAQIRGAVDLSLYPREGGVSSRELVKKVSDVIWNSLSRSYFKDRAHIQSLFSFITGTKLDSSGVAFAVVGACQALGLRDVHLALSEDHAWVVFGPNGEQTAEVTWHGKGNEDRRGQTVNAGVAERSWLYLKGSYMRCDRKMEVAFMVCAINPSIDLHTDSLELLQLQQKLLWLLYDLGHLERYPMALGNLADLEELEPTPGRPDPLTLYHKGIASAKTYYRDEHIYPYIYLAGYHCRNRNVREALQAWADTATVIQDYNYCREDEEIYKEFFEVANDVIPNLLKEAASLLEAGEERPGEQSQGTQSQGSALQDPECFAHLLRFYDGICKWEEGSPTPVLHVGWATFLVQSLGRFEGQVRQKVRITFQSEKMKGMKELLVATKINSSAIKLQLTAQS 489 T 4.5E-12 Menin pdb F Eukaryota T 8ea3 5 P O A0A8M0FGU0_9CYAN Cas12k MSQITIQARLISFESNRQQLWKLMADLNTPLINELLCQLGQHPDFEKWQQKGKLPSTVVSQLCQPLKTDPRFAGQPSRLYMSAIHIVDYIYKSWLAIQKRLQQQLDGKTRWLEMLNSDAELVELSGDTLEAIRVKAAEILAIAMPASESDSASPKGKKGKKEKKPSSSSPKRSLSKTLFDAYQETEDIKSRSAISYLLKNGCKLTDKEEDSEKFAKRRRQVEIQIQRLTEKLISRMPKGRDLTNAKWLETLLTATTTVAEDNAQAKRWQDILLTRSSSLPFPLVFETNEDMVWSKNQKGRLCVHFNGLSDLIFEVYCGNRQLHWFQRFLEDQQTKRKSKNQHSSGLFTLRNGHLVWLEGEGKGEPWNLHHLTLYCCVDNRLWTEEGTEIVRQEKADEITKFITNMKKKSDLSDTQQALIQRKQSTLTRINNSFERPSQPLYQGQSHILVGVSLGLEKPATVAVVDAIANKVLAYRSIKQLLGDNYELLNRQRRQQQYLSHERHKAQKNFSPNQFGASELGQHIDRLLAKAIVALARTYKAGSIVLPKLGDMREVVQSEIQAIAEQKFPGYIEGQQKYAKQYRVNVHRWSYGRLIQSIQSKAAQTGIVIEEGKQPIRGSPHDKAKELALSAYNLRLTRRS 639 T 0.0027 RuvC_1 pdbhh F Bacteria T 8ea4 2 N O A0A8M0FGU0_9CYAN Cas12k MSQITIQARLISFESNRQQLWKLMADLNTPLINELLCQLGQHPDFEKWQQKGKLPSTVVSQLCQPLKTDPRFAGQPSRLYMSAIHIVDYIYKSWLAIQKRLQQQLDGKTRWLEMLNSDAELVELSGDTLEAIRVKAAEILAIAMPASESDSASPKGKKGKKEKKPSSSSPKRSLSKTLFDAYQETEDIKSRSAISYLLKNGCKLTDKEEDSEKFAKRRRQVEIQIQRLTEKLISRMPKGRDLTNAKWLETLLTATTTVAEDNAQAKRWQDILLTRSSSLPFPLVFETNEDMVWSKNQKGRLCVHFNGLSDLIFEVYCGNRQLHWFQRFLEDQQTKRKSKNQHSSGLFTLRNGHLVWLEGEGKGEPWNLHHLTLYCCVDNRLWTEEGTEIVRQEKADEITKFITNMKKKSDLSDTQQALIQRKQSTLTRINNSFERPSQPLYQGQSHILVGVSLGLEKPATVAVVDAIANKVLAYRSIKQLLGDNYELLNRQRRQQQYLSHERHKAQKNFSPNQFGASELGQHIDRLLAKAIVALARTYKAGSIVLPKLGDMREVVQSEIQAIAEQKFPGYIEGQQKYAKQYRVNVHRWSYGRLIQSIQSKAAQTGIVIEEGKQPIRGSPHDKAKELALSAYNLRLTRRS 639 T 0.0027 RuvC_1 pdbhh F Bacteria T 8eas 1 A A VMA22_YEAST Vacuolar ATPase assembly protein VMA22 MSETRMAQNMDTTDEQYLRLIELLSNYDSTLEQLQKGFQDGYIQLSRSNYYNKDSLRGNYGEDYWDETYIGQLMATVEEKNSKVVVEIVKRKAQDKQEKKEEEDNKLTQRKKGTKPEKQKTQSHKLKQDYDPILMFGGVLSVPSSLRQSQTSFKGCIPLIAQLINYKNEILTLVETLSEQE 181 T 0.00073 ATP-synt_D unppercent F Eukaryota T 8eat 1 A A VMA22_YEAST Vacuolar ATPase assembly protein VMA22 MSETRMAQNMDTTDEQYLRLIELLSNYDSTLEQLQKGFQDGYIQLSRSNYYNKDSLRGNYGEDYWDETYIGQLMATVEEKNSKVVVEIVKRKAQDKQEKKEEEDNKLTQRKKGTKPEKQKTQSHKLKQDYDPILMFGGVLSVPSSLRQSQTSFKGCIPLIAQLINYKNEILTLVETLSEQE 181 T 0.00073 ATP-synt_D unppercent F Eukaryota T 8eav 1 A A YAR027W or YAR028W XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 138 F F F 8eav 2 B Q YAR027W or YAR028W XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 178 F F F 8eav 3 C K YAR027W or YAR028W XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 172 F F F 8eav 4 D L YAR027W or YAR028W XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 168 F F F 8eav 5 E k subunit from the c ring of yeast VO complex XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 145 F F F 8eav 6 F,G,H,I,J,K,L,M M,B,C,D,E,F,G,H subunit from the c ring of yeast VO complex XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 159 F F F 8eav 7 N I YAR027W or YAR028W XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 187 F F F 8eav 8 O J YAR027W or YAR028W XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 162 F F F 8eav 9 P N YAR027W or YAR028W XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 177 F F F 8eav 10 Q O YAR027W or YAR028W XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 156 F F F 8eav 11 R P YAR027W or YAR028W XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 135 F F F 8eb0 1 A A RN216_HUMAN RING FINGER PROTEIN 216,RING-TYPE E3 UBIQUITIN TRANSFERASE RNF216,TRIAD DOMAIN-CONTAINING PROTEIN 3,UBIQUITIN-CONJUGATING ENZYME 7-INTERACTING PROTEIN 1,ZINC FINGER PROTEIN INHIBITING NF-KAPPA-B GPGQLIECRCCYGEFPFEELTQCADAHLFCKECLIRYAQEAVFGSGKLELSCMEGSCTCSFPTSELEKVLPQTILYKYYERKAEEEVAAAYADELVRCPSCSFPALLDSDVKRFSCPNPHCRKETCRKCQGLWKEHNGLTCEELAEKDDIKYRTSIEEKMTAARIRKCHKCGTGLIKSEGANRMSCRCGAQMCYLCRVSINGYDHFCQHPRSPGAPCQECSRCSLWTDPTEDDEKLIEEIQKEAEEEQKRKNGENTFKRIGPPLEKPVEKVQRVEAL 277 T 0.00058 zf-RING_2 unppercent F Eukaryota T 8eb9 1 A A C9WMH0_FUSOX Secreted in xylem Six8 GSDTSGILLASITGAGSAFQAYAGCYLTAFRNDPRTLTLRMDKTRGERISNVLVILSGGALSHAVEEVVQIAPGAVRNLATLGASTVQFLHNFRS 95 T 4.3 LppA unphh F Eukaryota T 8ebb 1 A,B A,B C9WMG8_FUSOX Secreted in xylem Six6 SDTLPVSTCPAGQKYDRSVCYKADKIRSFCVANPRSNREKITDTPCQPREICVQRNLSNGKSFAKCIPIVDLVEWKTSANGNKEGCTTTSVNPAGYHHLGTIVYDINKNPIEVDKISYFGEPGNVNEGIGGSTSYFSSDNFQFSKSRYMKTCIFSGGYGNLNAYTWSWES 170 T 0.16 Agglutinin unppssm F Eukaryota T 8ebb 2 C C C9WMG8_FUSOX Secreted in xylem Six6 GPMGPLAQTESESADVAEHTINYIDIAPEEFEPPKANLSSLVENLYFQ 48 T 10 Mfp-3 unphh F Eukaryota T 8ebl 2 C,D D,C GLU-ASP-SER-HIS-LYS-GLU-SER-ASN-ASP-CYS-SER-CYS-GLY-GLY EDSHKESNDCSCGG 14 T 0.68 zf-CSL pdbhh F T 8ebm 2 C,D C,D ASN-GLN-ARG-PHE-GLY-SER-ASN-ASN-THR-SER-GLY-SER NQRFGSNNTSGS 12 T 14 Ribosomal_S24e pdbhh F T 8ec0 20 T V COX26_YEAST Cytochrome c oxidase subunit 26, mitochondrial MFFSQVLRSSARAAPIKRYTGGRIGESWVITEGRRLIPEIFQWSAVLSVCLGWPGAVYFFSKARKA 66 T 0.077 Phage_holin_Dp1 unppercent F Eukaryota T 8ec3 1 A A G9BXS5_BORHE Fibronectin-binding protein GSTGSEYYDQLKKAEKDIDSAFKILEKLKKDRDQVELQGTMRMSGHSTSEDRATAQAKLNQFSKAKLVQELKDLLEKIDKNAKLTIDNAVEDFSKFSSETPQSNYVTEADKSLYLAKDKLYDLIKAVESSANTYDAYAKRTGIGHGSKFSEVENHLKDAKSLIKKALK 168 T 0.013 Zw10 unppercent F Bacteria T 8ec5 3 C C peptide RARARARARARAFVKKKYCL RARARARARARAFVKKKYCL 20 T 0.23 TCP pdbhh F T 8ec9 1 A,B,C,D,E,F A,B,C,D,E,F ORN-LYS-LEU-VAL-PHI-PHE-ALA-GLU-ORN-CYS-ILE-ILE-SAR-CYS-MET-VAL XKLVXFAEXCIIXCMV 16 T 1.5 Beta-APP pdbhh F T 8eca 1 A,B,C,D,E,F A,B,C,D,E,F ORN-LYS-LEU-VAL-PHE-PHE-ALA-GLU-ORN-CYS-ILE-ILE-SAR-CYS-MET XKLVFFAEXCIIXCMV 16 T 1.5 Beta-APP pdbhh F T 8eci 1 A,B 1,2 A0A3G2KE53_9CAUD GP7 MATGRTTEGRTVTVTTASNTTITGAAGTFVASDVGRTITRAGIPAGTTITAVASGTSATISAAATTSATSAATLGSLNGQSQGLVGWSPETDTEAGAYSVAATNAGTVTPDRLTNAFTPVSQRGRG 126 T 14 DUF5979 unppssm T Viruses T 8ed7 2 B,D,F,H E,F,G,H Unidentified segment at the N-terminus of TRPM3 XXXXXXXXXXXXXXXXX 17 F F F 8ed8 2 B,D,F,H E,F,G,H Unidentified segment at the N-terminus of TRPM3 XXXXXXXXXXXXXXXXX 17 F F F 8ed9 2 B,D,F,H E,F,G,H Unidentified segment at the N-terminus of TRPM3 XXXXXXXXXXXXXXXXX 17 F F F 8edv 1 A,B,C,D A,B,C,D Q21096_CAEEL MITOGUARDIN 2 (MIGA2) SSMNPIEADITSGQELCTELRKTIEKVHHNLEMVKNRSSKDMERSIKIEGILKGLKQVEDEIVLLVPQMEDFRDDNMEFYSVSGGSGYAGSVRTGRSRTLSVLSDDSFRSAVEEFACDIDDIDFVSDAANLDKNELRFLDEGMQAALNGEVKYRKSRMEFCKCDSETDFAAKLYCVRQALTNALKDEHKRVWLAKCGRTLLADFIRHTKQDPVKFFNAYDEMLEYVSNDRNEEQLRQDVEGRGVCETGFYDVAIDFIILDAFEDLKSPPSAVYSVTKNYFMSMSMKYSTLNTIIWSIIKSKRQRLKNPDGFIAKFYNISETVMPAITLGFLGTDERLGELCQYFKEQVVQFVLDVFNTQKVCYRSLEEMSEDVWIVMRNRLEAVQTRMSNEL 392 T 3.9E-06 Miga unppercent F Eukaryota T 8eex 2 B B A0A401FT52_9DELT Csx29 MSNPIRDIQDRLKTAKFDNKDDMMNLASSLYKYEKQLMDSSEATLCQQGLSNRPNSFSQLSQFRDSDIQSKAGGQTGKFWQNEYEACKNFQTHKERRETLEQIIRFLQNGAEEKDADDLLLKTLARAYFHRGLLYRPKGFSVPARKVEAMKKAIAYCEIILDKNEEESEALRIWLYAAMELRRCGEEYPENFAEKLFYLANDGFISELYDIRLFLEYTEREEDNNFLDMILQENQDRERLFELCLYKARACFHLNQLNDVRIYGESAIDNAPGAFADPFWDELVEFIRMLRNKKSELWKEIAIKAWDKCREKEMKVGNNIYLSWYWARQRELYDLAFMAQDGIEKKTRIADSLKSRTTLRIQELNELRKDAHRKQNRRLEDKLDRIIEQENEARDGAYLRRNPPCFTGGKREEIPFARLPQNWIAVHFYLNELESHEGGKGGHALIYDPQKAEKDQWQDKSFDYKELHRKFLEWQENYILNEEGSADFLVTLCREIEKAMPFLFKSEVIPEDRPVLWIPHGFLHRLPLHAAMKSGNNSNIEIFWERHASRYLPAWHLFDPAPYSREESSTLLKNFEEYDFQNLENGEIEVYAPSSPKKVKEAIRENPAILLLLCHGEADMTNPFRSCLKLKNKDMTIFDLLTVEDVRLSGSRILLGACESDMVPPLEFSVDEHLSVSGAFLSHKAGEIVAGLWTVDSEKVDECYSYLVEEKDFLRNLQEWQMAETENFRSENDSSLFYKIAPFRIIGFPAE 751 T 1.7E-07 CHAT pdbpercent F Bacteria T 8eey 1 A E A0A401FT41_9DELT Csx30 MNTTTYNTTTDALLEWGKVYFQKEDFSEFLDNLEAYISDAGDSLKDELESGVEKLVLGIKSAEAVIFGEAVIGTTPENEAWYDAEESFLTLDCAVWLSQALDRVVRRQDASLADSLIARLDEAINRVAEKLYADNLSPLRFSSLNEIRRSALEATDEKYHYLFPWHGAACDVDENILLILTEEYHLIGADKAGANLSEELRGDLPFIFAELERDEVLRAYVEKENALSLALENTMREHWAFGLLEAARDEGYNHPYPADVGMRIHQVARAVFSQTNLSPAERLAVAIAGACFTPEISEDRRLEILLDCEERVCEIEAPTGDDTSVRVIKDLKALADHRVRHEIPAESLVSLWFEQIEAAGTDFDTKTPMDELVLRMLSDNVITLSVDRKAASQTETDDVKPQKGKIIPFPVPDIANDEVEYQKAVGGGSANDSKVKFPGLLEIQGCRDGDKAILLEDTDDAAANHRKLFSILKAGKLNSAFFIQSDDGEWVESESKPTMEDNRIILHDSHHSSFVWILDTGSMQLRQSVKCVKDALNKKTGSAKKLKPKTMIVWVTIPQEG 561 T 12 TFIIA_gamma_N pdbhh F Bacteria T 8eey 3 C B A0A401FT52_9DELT Csx29 MSNPIRDIQDRLKTAKFDNKDDMMNLASSLYKYEKQLMDSSEATLCQQGLSNRPNSFSQLSQFRDSDIQSKAGGQTGKFWQNEYEACKNFQTHKERRETLEQIIRFLQNGAEEKDADDLLLKTLARAYFHRGLLYRPKGFSVPARKVEAMKKAIAYCEIILDKNEEESEALRIWLYAAMELRRCGEEYPENFAEKLFYLANDGFISELYDIRLFLEYTEREEDNNFLDMILQENQDRERLFELCLYKARACFHLNQLNDVRIYGESAIDNAPGAFADPFWDELVEFIRMLRNKKSELWKEIAIKAWDKCREKEMKVGNNIYLSWYWARQRELYDLAFMAQDGIEKKTRIADSLKSRTTLRIQELNELRKDAHRKQNRRLEDKLDRIIEQENEARDGAYLRRNPPCFTGGKREEIPFARLPQNWIAVHFYLNELESHEGGKGGHALIYDPQKAEKDQWQDKSFDYKELHRKFLEWQENYILNEEGSADFLVTLCREIEKAMPFLFKSEVIPEDRPVLWIPHGFLHRLPLHAAMKSGNNSNIEIFWERHASRYLPAWHLFDPAPYSREESSTLLKNFEEYDFQNLENGEIEVYAPSSPKKVKEAIRENPAILLLLCHGEADMTNPFRSCLKLKNKDMTIFDLLTVEDVRLSGSRILLGACESDMVPPLEFSVDEHLSVSGAFLSHKAGEIVAGLWTVDSEKVDECYSYLVEEKDFLRNLQEWQMAETENFRSENDSSLFYKIAPFRIIGFPAE 751 T 1.7E-07 CHAT pdbpercent F Bacteria T 8ef4 1 A A HIRV2_HIRME Bivalirudin XPRPGGGGNGDFEEIPEEYL 20 T 0.00018 Hirudin pdbhh F Eukaryota T 8efq 4 D P DAMGO YXGX 4 T 63 MurT_C pdbhh F F 8egr 3 I,J J,I A0A1S6L1H8_9CAUD gp16, tail stem protein LSKHTTTLYEIIESELQRLGLNEFVNNDRIHFNDSKHAFMQKMLYFDDDVKQIVDHMFFKGFMFNDERIDRYFKESFTLRFLYREIGRQTVESFASQVLYITMTHEDYIYRVYGSDMYKYIEQVTDTQSQDLGKAIENAIEQGQTKDRQQDKGHEEYKDYEDTITKSFDDNRTAESTLPQSKVNIDVDNTVLDYADTNTISRDKNTSETVSEKTGTKDNTFDSLRNGESDTKRNTQSQNEMNRTGLTKQYLIDNLQKLYSMRDTIFKTYDKECFLHIW 278 T 0.076 MreB_Mbl unp T Viruses T 8egr 5 M,N,O,P,Q,R,S,T,U,V,W,X M,N,O,P,Q,R,S,T,U,V,W,X A0A1S6L1H7_9CAUD gp20, portal-proximal core protein MAEEEKIIKEEPTNEETEQPEKIESAEDVVTEPEKEVTEEKSEAFVQLEQRISSLEQRLNNLESQPQPTQESSDPNFEDKTVPTEVDDNQETDGIESSEEIKQMLNL 107 T 0.11 PspB pdbhh T Viruses T 8egs 1 A,B J,I A0A1S6L1H8_9CAUD Lower collar protein MQKMLYFDDDVKQIVDHMFFKGFMFNDERIDRYFKESFTLRFLYREIGRQTVESFASQVLYITMTHEDYIYRVYGSDMYKYIEQVTDTQSQDLGKAIENAIEQGQTKDRQQDKGHEEYKDYEDTITKSFDDNRTAESTLPQSKVNIDVDNTVLDYADTNTISRDKNTSETVSEKTGTKDNTFDSLRNGESDTKRNTQSQNEMNRTGLTKQYLIDNLQKLYSMRDTIFKTYDKECFLHIW 239 T 0.076 MreB_Mbl pdb T Viruses T 8egt 1 A,B,G,H H,G,F,E A0A1S6L1I6_9CAUD gp19, capsid lining protein MANFDGNEMRGMTHANYEDSRLNKSRELNANMSIGTSKSEDEYGRQVHSLTKQSYSDDSVQEA 63 T 11 Dehydrin pdbhh T Viruses T 8egt 2 C,D,E,F A,B,C,D A0A1S6L1I0_9CAUD Major capsid protein MADKKTDIPTLIADSTKASLQDFNHDYGKQWTFGENWSNVNTMFETYVNKYLFPKINETLLIDIALGNRFNWLAKEQDFIGQYSEEYVIMDTIPIEMNLSKSEELMLKRNYPQMATRLYGSGIVKKQKFTLNNNDVRFNFQTLGDATNYALGVLRKKISDINVQEEKEIRAMMVDYAINQLQDSNRRTASSKEDLTERVFEAILNMQNNSAKYNEVHKASGGSVGQYTTVSKLSDIAILTTDSLKSYLLDTKIANTFQMAGIDFTDHIISFDDLGGVYKTTKDVTLANEDTINYLRAFGDYQAMIGDVIPTGSVFTFNVSDLKEFKGNIEEIKPQGELFAFIFDINALKYKRNTKGMLKEPFYNGEFDEVTHWIHYYSFKAMSPFFNKILITEAPKEQPDAGATE 405 T 0.085 Ribosomal_L22 pdb T Viruses T 8ehb 1 A,B,C,D,E,F A,B,C,D,E,F G8ULV2_TANFA Putative lipoprotein GPLGSPEFAEKESHASCSCECVEEKIPIVTLKNENAHFRYMKRRNDFALEIENKELVRGLYLIPRGCDIPKKYKEDGLPVIISGEVFDCSEYIKPWIKRDPVYFIKLSTIKKK 113 T 0.00065 DUF4971 unphh F Bacteria T 8ehc 1 A,B A,B A0A1D3UL35_TANFO Potempin E (PotE) ANPEQAILGKWELINSGGRPIIPTGYREFLPSGIVHKYDYTKEQYTSFQCEYSILNDTVLLMCNYRYKYLFYRDKMQLFPLDLIAIRDLTEIYQRKK 97 T 0.13 DUF5640 pdbhh F Bacteria T 8ehd 1 A A G8UII1_TANFA Potempin E (PotE) MKQQIILWIGVLLLLIGGVGCENGQLHSPPANPEQAILGKWELINSGGRPIIPTGYREFLPSGIVHKYDYTKEQYTSFQCEYSILNDTVLLMCNYRYKYLFYRDKMQLFPLDLIAIRDLTEIYQRKK 127 T 0.00069 DUF4971 pdbhh F Bacteria T 8ehe 2 B B A0A1D3UUC0_TANFO Potempin C (PotC) MKQKIILWISTLLLLTAGAGCKKETLPPNQAKGKVLGPTGPCQGYALYIEVENPKGIGLEGKGIPAGSGRTWNYRNAISVPLFNRIGLPVELMEEGTWLHFEYREMTEEEKNRKLFQPDEPVICLMNQIPPPANTYMITKIIAHKPLKINPS 152 T 0.0004 DUF4969 pdbhh F Bacteria T 8eio 2 B B Cystic fibrosis transmembrane conductance regulator XXXXXXXXXXXXXXXXX 17 F F F 8eiq 2 B B Cystic fibrosis transmembrane conductance regulator XXXXXXXXXXXXXXXX 16 F F F 8eit 1 A A A modified Guanine nucleotide-binding protein G(q) subunit alpha MGSTLSAEDKAAVERSKMIDRNLREDGEKARRTLRLLLLGADNSGKSTIVKQMRILHTSGIFETKFQVDKVNFHMFDVGGQRDERRKWIQCFNDVTAIIFVVDSSDYNRLQEALNDFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKIEDYFPEFARYTTPEDATPEPGEDPRVTRAKYFIRKEFVDISTASGDGRHICYPHFTCAVDTENARRIFNDCKDIILQMNLREYNLV 238 T 2.8E-10 G-alpha pdb F T 8ej5 2 B,C,D B,C,D A0A1S6L1I2_9CAUD gp1, tail tip protein VTNEKGQAYTEMLQLFNLLQQWNDFYTAENANNLLVACQQLLINYNEPVIKFINDENEDKSLLQYLAGDDGLAQWQFYKGFYNNYNVHIF 90 T 0.012 GSG-1 unppssm T Viruses T 8ejc 1 A A A modified Guanine nucleotide-binding protein G(q) subunit alpha MGSTLSAEDKAAVERSKMIDRNLREDGEKARRTLRLLLLGADNSGKSTIVKQMRILHTSGIFETKFQVDKVNFHMFDVGGQRDERRKWIQCFNDVTAIIFVVDSSDYNRLQEALNDFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKIEDYFPEFARYTTPEDATPEPGEDPRVTRAKYFIRKEFVDISTASGDGRHICYPHFTCAVDTENARRIFNDCKDIILQMNLREYNLV 238 T 2.8E-10 G-alpha pdb F T 8ejk 1 A A A modified Guanine nucleotide-binding protein G(q) subunit alpha MGSTLSAEDKAAVERSKMIDRNLREDGEKARRTLRLLLLGADNSGKSTIVKQMRILHTSGIFETKFQVDKVNFHMFDVGGQRDERRKWIQCFNDVTAIIFVVDSSDYNRLQEALNDFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKIEDYFPEFARYTTPEDATPEPGEDPRVTRAKYFIRKEFVDISTASGDGRHICYPHFTCAVDTENARRIFNDCKDIILQMNLREYNLV 238 T 2.8E-10 G-alpha pdb F T 8ejl 2 D,E Y,Z CPSF6_HUMAN CLEAVAGE AND POLYADENYLATION SPECIFICITY FACTOR 68 KDA SUBUNIT,CPSF 68 KDA SUBUNIT,PRE-MRNA CLEAVAGE FACTOR IM 68 KDA SUBUNIT,PROTEIN HPBRII-4/7 GTPVLFPGQPFGQPPLG 17 T 2.2 MlaD pdbhh F Eukaryota T 8ekf 3 C CCC PfCSP peptide NPNA-3 NANPNANPNANP 12 T 0.48 Cas_Cas7 pdbhh F F 8elg 3 C C NQK-OC43 peptide NQKLIANAF 9 T 0.78 CoV_S2 pdbhh F T 8em1 1 A,B A,B A0A2N8KYF9_9BURK PaqCI, DNA Unbound MPYDHNAEADFAASEVARMLVADPGLCYDAASLPASISASASYEPSAAGWPKADGLVSVLEGGTSTQRAIALEYKRPQEGIHGLLTAIGQAHGYLHKGYSGAAIVIPGRYSSHPTPAEYVRDVLNAISGSRAIAVFSYSPPDTTSPTPFAGRIQCVRPLVFDAGRVHLRPANQGPKTQWVHMREGSTTRDAFFRFLQVAKRLSADPTAPRPTLRSELVAAIGRLAPGRDPIEYITNTADNKFLTKVWQFFWLEWLATPAVLTPWKLEAGVYSAPGARTRILREDGTDFSQLWEGRVNSLKETIAGMLNRGEISEAQGWEAFVGGISATGGGQDKQGVRARAHSYREDIDSALAQLRWIEDDGLPTDQGYRFMTICERYGGANSRAAIDYMGATLIQTGRYASFLHYINRLSERKFAENPLAYTKPGPGGMPVFTEESYWEYLQDLETKLTDELRVMRKVSGRARPRVRTTFQVELTLLRNYGFVSSTRHRLGVGIPIDWEQVVQALNVDL 510 T 0.25 DUF5343 unppercent F Bacteria T 8em8 2 B U unidentified peptide fragment XXXX 4 F F F 8emb 1 A,B,C,D,E,F A,B,C,D,E,F RPOC2_THEVB RNAP SUBUNIT BETA',RNA POLYMERASE SUBUNIT BETA',TRANSCRIPTASE SUBUNIT BETA' GSHMATEKVTKDVASDLAGQVKFVNLDAEEKRDRQGTTTRIAPKGGLIWVLSGEVYNLPPGAEPVVKNGDRIEAGAVMAETTVKTEHGGVVRLPEQQDSKGGREVEIITASVMLDKAKVLKETQQGREHYIIETATGQRFSLKAAPGTKVANGQVVAELIDDRYHTTTGGILKYADIEVAKKGKAKQGYEVLKGGTLLWIPEETHEVNKDISLLMVEDNQYVEAGTEVVKDIFCQNSGVVEVIQKNDILREIIIKPGELHLVDDPEAARLKHGTLARPGEEVLPGLVVDTLSQVDYLEDTPEGPAILMRPVQEFSVPDEPSVPSQDSSDGSGQSIRLRAVQRLPYKHDERVKSVDGVDLLRTQLVLEIGSEAPQLAADIEIVTDEVDPEAQRLQLVILESLIIRRDIAADQTQGSTFTSLLVKDGDHIGPGAVIARTDIKAKQAGEVQGIVRSGESVRRILVVTDSDRLRVETNGAKPTVKVGDLVRPGDEMAKGVTAPETAAVMAVADDHVILRLARPYLVSPGAVLQIEEGDLVQRGDNLALLVFERAKTG 553 T 0.017 RNA_pol_Rpb1_5 unppercent F Bacteria T 8eno 3 E E C1DH13_AZOVD nitrogenase-associated factor T MSWRILLCHKHPVSARLRFLIPTGGGVVLPQTLPRLAVIAEDQEAPVQCHPASALRALQETMALGWQLELIGEFRLNMEVPGQIMPIYLAALAGHELPPPPEGTRWIELTQSIGMPWLDRELLRRVYEELIG 132 T 0.034 RcnB pdb F Bacteria T 8env 2 AA,FA,G,L,Q,V a,f,G,L,Q,V A0A6G9LFR0_9CAUD Structural protein gp33 KIPLTAVPNQAISFNAGSSYWKIRLYQNMDMMNADISRDGVIVCHGVRCFGGIPLLQYSHQYRPDYGNFVFDRDADWTLFGDGINLFYLDGAEFAEYQALAT 102 T 2 FAIM1 pdbhh T Viruses T 8env 4 CA,HA,I,N,S,X c,h,I,N,S,X A0A5C1KAX6_9CAUD Ripcord gp36 MINVSGFGTGIVIVSASSFPMGFSLSKFADDESPISSKELEPFGYEMLYDGGLFAFDKAAPLEVSVSVIAGSEDDINLRILLNSKKGSFRFLPGIIPDMTTLVATLPDGGRTVLSNGTILKGPAIDTIQNTGRRKGNTYTFVFGSYLGAQTA 152 T 0.72 DUF3277 pdbhh T Viruses T 8eoi 8 H I EMC10_HUMAN HEMATOPOIETIC SIGNAL PEPTIDE-CONTAINING MEMBRANE DOMAIN-CONTAINING PROTEIN 1 GTVGLLLEHSFEIDDSANFRKRGSLLWNQQDGTLSLSQRQLSEEERGRLRDVAALNGLYRVRIPRRPGALDGLEAGGYVSSFVPACSLVESHLSDQLTLHVDVAGNVVGVSVVTHPGGCRGHEVEDVDLELFNTSVQLQPPTTAPGPETAAFIERLE 157 T 0.0033 2OG-FeII_Oxy_5 unppssm F Eukaryota T 8eon 1 A,F,K,P,U,Z G,L,Q,V,a,f A0A6G9LFR0_9CAUD Baseplate component gp33 KIPLTAVPNQAISFNAGSSYWKIRLYQNMDMMNADISRDGVIVCHGVRCFGGIPLLQYSHQYRPDYGNFVFDRDADWTLFGDGINLFYLDGAEFAEYQALAT 102 T 2 FAIM1 pdbhh T Viruses T 8eon 3 BA,C,H,M,R,W h,I,N,S,X,c A0A5C1KAX6_9CAUD Baseplate component gp36 MINVSGFGTGIVIVSASSFPMGFSLSKFADDESPISSKELEPFGYEMLYDGGLFAFDKAAPLEVSVSVIAGSEDDINLRILLNSKKGSFRFLPGIIPDMTTLVATLPDGGRTVLSNGTILKGPAIDTIQNTGRRKGNTYTFVFGSYLGAQTA 152 T 0.72 DUF3277 pdbhh T Viruses T 8eon 6 EA,FA,GA A,B,C A0A2K8HNS1_9CAUD Baseplate component gp37 MLGIFTSLLSSRSFSIVDQNTNQLVAADLRISRVNTRFSSVGQRHMLEDGTTKMDSRTIHPMEIIVEVFCPSIDVVDQINQLLLDRDTLYKVITRGMVFERMMCTSEALNQTPDMISATPARLTFSQVLVQNPKPIMFRNAGDSSMIDRGLALAEDVVGSAGDLFDYAVNGV 172 T 0.023 GGGtGRT pdbpercent T Viruses T 8eon 7 HA,KA,OA D,k,o A0A2K8I4C0_9CAUD Baseplate component gp38 MNSFLKSILNTPTLTIRDDVTKLPVWKSLQVKKVEIYSPASVVSKPLATKDQTEAQVYTEALDIDVKNGKIIQPVRLRINAICPDLSTVESIMNAFNDNTSTFAITSKSILADKMAIMTLDVDQSPDMLNAAEINMEFEQVEPPVLNKFDPAFPQDSPTYGVQIQSLSDANLLDLGAIGDSISSAAKSLYNRV 193 T 47 Apo-CII pdbhh T Viruses T 8eon 8 IA,LA,MA E,l,m A0A2K8IA76_9CAUD Baseplate hub gp41 MKKRILRVTFNMPYGPEVIREDLDVRVRIMKAALRIQNRATMEIFGLTTQLRESLLSQFTAWKHRQRQVGREDELMIKVSVEAGYSDQGREQVSRVFVGEVAIVDIISPPPDIGIRIQCYTRQIDRTKTIRNMPPANTTFVKFVEWGANEMGLNFICDTSYNDQVLKNPGRSITVASAILASIQDMYMPDVAAFVDDDILIVKDRDKVIRPDEVTNVNSFVGIPSWSEWGVEFQCLFEPSIRVAGGVAVESLMNPSVNGNYVITALEYDLASRDRPFYIKVMGSPAA 287 T 0.0001 Phage_GPD pdbhh T Viruses T 8epx 1 A,B,C,D A,B,C,D A0A2N8KYF9_9BURK Type IIS Restriction Endonuclease PaqCI MPYDHNAEADFAASEVARMLVADPGLCYDAASLPASISASASYEPSAAGWPKADGLVSVLEGGTSTQRAIALEYKRPQEGIHGLLTAIGQAHGYLHKGYSGAAIVIPGRYSSHPTPAEYVRDVLNAISGSRAIAVFSYSPPDTTSPTPFAGRIQCVRPLVFDAGRVHLRPANQGPKTQWVHMREGSTTRDAFFRFLQVAKRLSADPTAPRPTLRSELVAAIGRLAPGRDPIEYITNTADNKFLTKVWQFFWLEWLATPAVLTPWKLEAGVYSAPGARTRILREDGTDFSQLWEGRVNSLKETIAGMLNRGEISEAQGWEAFVGGISATGGGQDKQGVRARAHSYREDIDSALAQLRWIEDDGLPTDQGYRFMTICERYGGANSRAAIDYMGATLIQTGRYASFLHYINRLSERKFAENPLAYTKPGPGGMPVFTEESYWEYLQDLETKLTDELRVMRKVSGRARPRVRTTFQVELTLLRNYGFVSSTRHRLGVGIPIDWEQVVQALNVDL 510 T 0.25 DUF5343 unppercent F Bacteria T 8eq5 2 B B SPRE2_HUMAN SPRED-2 STIHNEAELGDDDVFTTATDSSSNSSQKRE 30 T 150 Senescence_reg pdbhh F Eukaryota T 8eqi 2 C,D F,G Cyclopeptide des4.2.0 XYXXSXRV 8 T 18 MAP1B_neuraxin pdbhh F F 8equ 3 D,E,G,H D,E,G,H Saposin A, polyalanine model XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 79 F F F 8er8 1 A A Acheta domesticus segmented densovirus major capsid protein TKEGYGKHITSMHVRNIFNQGNQVIRNIVKQQRYELLDFTGTEAGTTNLPKIIPYQCIWWRGLQNAANVNQTINNMIALNTISYGVRFLKAKLCIEVYAVTRKRLIQTGATSYYTDDFEQGQNLFIGWADRKAESIPITTPADLDETKLTVANTTLFDANNDNITKEEVPTREKWCHTWDLDVLNHNYLWEPNNLDSQWTLIPGAQAVQPTATPIGPTYQEIVIATKAIGANESALVTTIQDRRSYPRLMLSQPQIKDETDTMKFKYQIRISTELEMEHHIKPDIANPWLTRQTLPLPALSGDGTTRYVPCVPYETHVSQRNWNHVGEYL 330 T 51 YppF pdbhh F T 8erk 1 A A Acheta domesticus segmented densovirus major capsid protein EGYGKHITSMHVRNIFNQGNQVIRNIVKQQRYELLDFTGTEAGTTNLPKIIPYQCIWWRGLQNAANVNQTINNMIALNTISYGVRFLKAKLCIEVYAVTRKRLIQTGATSYYTDDFEQGQNLFIGWADRKAESIPITTPADLDETKLTVANTTLFDANNDNITKEEVPTREKWCHTWDLDVLNHNYLWEPNNLDSQWTLIPGAQAVQPTATPIGPTYQEIVIATKAIGANESALVTTIQDRRSYPRLMLSQPQIKDETDTMKFKYQIRISTELEMEHHIKPDIANPWLTRQTLPLPALSGDGTTRYVPCVPYETHVSQ 318 T 38 Pox_Rif unphh F T 8ese 1 A X B3KT69_HUMAN VPS35 endosomal protein-sorting factor-like EFASCRLEAVPLEFGDYHPLKPI 23 T 0.65 CytochromB561_N unppercent F Eukaryota T 8esw 33 GA V3 NADH dehydrogenase [ubiquinone] flavoprotein 3 XXXXXXXXXXXXXXXXXXXXXXXXXXX 27 F F F 8esw 40 NA A3 Q9W380_DROME UNCHARACTERIZED PROTEIN,ISOFORM A,ISOFORM B MSASAARGSTSLLKRAWNEIPDIVGGSALALAGIVMATIGVANYYAKDGDNRRYKLGYVVYRHDDPRALKVRNDEDD 77 T 0.08 NADHdh_A3 pdbhh F Eukaryota T 8esz 32 FA V3 NADH dehydrogenase [ubiquinone] flavoprotein 3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 37 F F F 8esz 36 JA A3 Q9W380_DROME UNCHARACTERIZED PROTEIN,ISOFORM A,ISOFORM B MSASAARGSTSLLKRAWNEIPDIVGGSALALAGIVMATIGVANYYAKDGDNRRYKLGYVVYRHDDPRALKVRNDEDD 77 T 0.08 NADHdh_A3 pdbhh F Eukaryota T 8eth 5 E 5 NSA1_SCHPO Ribosome biogenesis protein nsa1 MKLLLGDEIGQLKFIEIKKGTDTSNPESEAPVIQKFGELDREKGVLFMLKHEMNVFVARKNGTIECWNVNQEPPILSSLWQLDSSLLETASIVSMKYSNGWLMLALSDGNLLFRHIESSKLRKLQLHGPLSAVELHPRIPGIIAAGGKENDVCLYSCNPTCKSNIDELELWRTENVVKVFQGKNVKNDSLNLRVRVWITGIVFTEDIINVIDGKSEDDESLCFHFATITHYGQLRFYDTKHGRRPVSTFDVSTSPLSHVGLLPSIKLLYFADKRAQISIFDHSKKKVIGRFQGVKGAPSSIHCLGNVVAITGLDRNVRIFDADRKPLANAYIKALPTSIIVINERDAEIIKKEEELEAAKEEEEEIWRNMEQLEDTEDKKPSKRIKL 387 T 0.13 SopA pdbpssm F Eukaryota T 8eti 5 E 5 NSA1_SCHPO Ribosome biogenesis protein nsa1 MKLLLGDEIGQLKFIEIKKGTDTSNPESEAPVIQKFGELDREKGVLFMLKHEMNVFVARKNGTIECWNVNQEPPILSSLWQLDSSLLETASIVSMKYSNGWLMLALSDGNLLFRHIESSKLRKLQLHGPLSAVELHPRIPGIIAAGGKENDVCLYSCNPTCKSNIDELELWRTENVVKVFQGKNVKNDSLNLRVRVWITGIVFTEDIINVIDGKSEDDESLCFHFATITHYGQLRFYDTKHGRRPVSTFDVSTSPLSHVGLLPSIKLLYFADKRAQISIFDHSKKKVIGRFQGVKGAPSSIHCLGNVVAITGLDRNVRIFDADRKPLANAYIKALPTSIIVINERDAEIIKKEEELEAAKEEEEEIWRNMEQLEDTEDKKPSKRIKL 387 T 0.13 SopA pdbpssm F Eukaryota T 8ets 2 B R ARP5_YEAST ACTIN-LIKE PROTEIN ARP5 KAVVIDDPPLRQTPEPFDEQSAYNPQSPIAIDFGSSKLRAGFVNHATPTHIFPNALTKFRDRKLNKNFTFVGNDTLLDQAVRSQSRSPFDGPFVTNWNLTEEILDYTFHHLGVVPDNGIPNPILLTERLATVQSQRTNWYQILFETYNVPGVTFGIDSLFSFYNYNPSGNKTGLVISCGHEDTNVIPVVDGAGILTDAKRINWGGHQAVDYLNDLMALKYPYFPTKMSYLQYETMYKDYCYVSRNYDEDIEKILTLENLDTNDVVVEAPFTEVLQPQKTEEELRIQAEKRKETGKRLQEQARLKRMEKLVQKQEEFEYFSKVRDQLIDEPKKKVLSVLQNAGFDDERDFKKYLHSLEQSLKKAQMVEAEDDSHLDEMNEDKTAQKFDLLDIADEDLNEDQIKEKRKQRFLKASQDARQKAKEEKERVAKEEEEKKLKEQQWRETDLNGWIKDKRLKLNKLIKRRKEKLKLRDEMKDRKSQVSQNRMKNLASLAEDNVKQGAKRNRHQATIDNDPNDTFGANDEDWLIYTDITQNPEAFEEALEYEYKDIVELERLLLEHDPNFTEEDTLEAQYDWRNSILHLFLRGPRPHDSENIHEQHQMHLNVERIRVPEVIFQPTMGGQDQAGICELSETILLKKFGSQPGKLSQTSIDMVNNVLITGGNAKVPGLKERIVKEFTGFLPTGTNITVNMSSDPSLDAWKGMAALARNEEQYRKTVISKKEYEEYGPEYIKEHKLGNTKYFED 744 T 9.499999999999999E-43 Actin pdbpercent F Eukaryota T 8etu 1 A R ARP5_YEAST ACTIN-LIKE PROTEIN ARP5 KAVVIDDPPLRQTPEPFDEQSAYNPQSPIAIDFGSSKLRAGFVNHATPTHIFPNALTKFRDRKLNKNFTFVGNDTLLDQAVRSQSRSPFDGPFVTNWNLTEEILDYTFHHLGVVPDNGIPNPILLTERLATVQSQRTNWYQILFETYNVPGVTFGIDSLFSFYNYNPSGNKTGLVISCGHEDTNVIPVVDGAGILTDAKRINWGGHQAVDYLNDLMALKYPYFPTKMSYLQYETMYKDYCYVSRNYDEDIEKILTLENLDTNDVVVEAPFTEVLQPQKTEEELRIQAEKRKETGKRLQEQARLKRMEKLVQKQEEFEYFSKVRDQLIDEPKKKVLSVLQNAGFDDERDFKKYLHSLEQSLKKAQMVEAEDDSHLDEMNEDKTAQKFDLLDIADEDLNEDQIKEKRKQRFLKASQDARQKAKEEKERVAKEEEEKKLKEQQWRETDLNGWIKDKRLKLNKLIKRRKEKLKLRDEMKDRKSQVSQNRMKNLASLAEDNVKQGAKRNRHQATIDNDPNDTFGANDEDWLIYTDITQNPEAFEEALEYEYKDIVELERLLLEHDPNFTEEDTLEAQYDWRNSILHLFLRGPRPHDSENIHEQHQMHLNVERIRVPEVIFQPTMGGQDQAGICELSETILLKKFGSQPGKLSQTSIDMVNNVLITGGNAKVPGLKERIVKEFTGFLPTGTNITVNMSSDPSLDAWKGMAALARNEEQYRKTVISKKEYEEYGPEYIKEHKLGNTKYFED 744 T 9.499999999999999E-43 Actin pdbpercent F Eukaryota T 8etw 2 B R ARP5_YEAST ACTIN-LIKE PROTEIN ARP5 KAVVIDDPPLRQTPEPFDEQSAYNPQSPIAIDFGSSKLRAGFVNHATPTHIFPNALTKFRDRKLNKNFTFVGNDTLLDQAVRSQSRSPFDGPFVTNWNLTEEILDYTFHHLGVVPDNGIPNPILLTERLATVQSQRTNWYQILFETYNVPGVTFGIDSLFSFYNYNPSGNKTGLVISCGHEDTNVIPVVDGAGILTDAKRINWGGHQAVDYLNDLMALKYPYFPTKMSYLQYETMYKDYCYVSRNYDEDIEKILTLENLDTNDVVVEAPFTEVLQPQKTEEELRIQAEKRKETGKRLQEQARLKRMEKLVQKQEEFEYFSKVRDQLIDEPKKKVLSVLQNAGFDDERDFKKYLHSLEQSLKKAQMVEAEDDSHLDEMNEDKTAQKFDLLDIADEDLNEDQIKEKRKQRFLKASQDARQKAKEEKERVAKEEEEKKLKEQQWRETDLNGWIKDKRLKLNKLIKRRKEKLKLRDEMKDRKSQVSQNRMKNLASLAEDNVKQGAKRNRHQATIDNDPNDTFGANDEDWLIYTDITQNPEAFEEALEYEYKDIVELERLLLEHDPNFTEEDTLEAQYDWRNSILHLFLRGPRPHDSENIHEQHQMHLNVERIRVPEVIFQPTMGGQDQAGICELSETILLKKFGSQPGKLSQTSIDMVNNVLITGGNAKVPGLKERIVKEFTGFLPTGTNITVNMSSDPSLDAWKGMAALARNEEQYRKTVISKKEYEEYGPEYIKEHKLGNTKYFED 744 T 9.499999999999999E-43 Actin pdbpercent F Eukaryota T 8eu5 1 A A A0A891H5C3_9VIRU Capsid protein EGYGKHITSMHVRNIFNQGNQVIRNIVKQQRYELLDFTGTEAGTTNLPKIIPYQCIWWRGLQNAANVNQTINNMIALNTISYGVRFLKAKLCIEVYAVTRKRLIQTGATSYYTDDFEQGQNLFIGWADRKAESIPITTPADLDETKLTVANTTLFDANNDNITKEEVPTREKWCHTWDLDVLNHNYLWEPNNLDSQWTLIPGAQAVQPTATPIGPTYQEIVIATKAIGANESALVTTIQDRRSYPRLMLSQPQIKDETDTMKFKYQIRISTELEMEHHIKPDIANPWLTRQTLPLPALSGDGTTRYVPCVPYETHVSQ 318 T 38 Pox_Rif unphh T Viruses T 8eu6 1 A A A0A891H5C3_9VIRU Capsid protein EGYGKHITSMHVRNIFNQGNQVIRNIVKQQRYELLDFTGTEAGTTNLPKIIPYQCIWWRGLQNAANVNQTINNMIALNTISYGVRFLKAKLCIEVYAVTRKRLIQTGATSYYTDDFEQGQNLFIGWADRKAESIPITTPADLDETKLTVANTTLFDANNDNITKEEVPTREKWCHTWDLDVLNHNYLWEPNNLDSQWTLIPGAQAVQPTATPIGPTYQEIVIATKAIGANESALVTTIQDRRSYPRLMLSQPQIKDETDTMKFKYQIRISTELEMEHHIKPDIANPWLTRQTLPLPALSGDGTTRYVPCVPYETHVSQ 318 T 38 Pox_Rif unphh T Viruses T 8eu7 1 A A A0A891H5C3_9VIRU Capsid protein EGYGKHITSMHVRNIFNQGNQVIRNIVKQQRYELLDFTGTEAGTTNLPKIIPYQCIWWRGLQNAANVNQTINNMIALNTISYGVRFLKAKLCIEVYAVTRKRLIQTGATSYYTDDFEQGQNLFIGWADRKAESIPITTPADLDETKLTVANTTLFDANNDNITKEEVPTREKWCHTWDLDVLNHNYLWEPNNLDSQWTLIPGAQAVQPTATPIGPTYQEIVIATKAIGANESALVTTIQDRRSYPRLMLSQPQIKDETDTMKFKYQIRISTELEMEHHIKPDIANPWLTRQTLPLPALSGDGTTRYVPCVPYETHVSQ 318 T 38 Pox_Rif unphh T Viruses T 8eu9 2 B R ARP5_YEAST ACTIN-LIKE PROTEIN ARP5 MSSRDASLTPLKAVVIDDPPLRQTPEPFDEQSAYNPQSPIAIDFGSSKLRAGFVNHATPTHIFPNALTKFRDRKLNKNFTFVGNDTLLDQAVRSQSRSPFDGPFVTNWNLTEEILDYTFHHLGVVPDNGIPNPILLTERLATVQSQRTNWYQILFETYNVPGVTFGIDSLFSFYNYNPSGNKTGLVISCGHEDTNVIPVVDGAGILTDAKRINWGGHQAVDYLNDLMALKYPYFPTKMSYLQYETMYKDYCYVSRNYDEDIEKILTLENLDTNDVVVEAPFTEVLQPQKTEEELRIQAEKRKETGKRLQEQARLKRMEKLVQKQEEFEYFSKVRDQLIDEPKKKVLSVLQNAGFDDERDFKKYLHSLEQSLKKAQMVEAEDDSHLDEMNEDKTAQKFDLLDIADEDLNEDQIKEKRKQRFLKASQDARQKAKEEKERVAKEEEEKKLKEQQWRETDLNGWIKDKRLKLNKLIKRRKEKLKLRDEMKDRKSQVSQNRMKNLASLAEDNVKQGAKRNRHQATIDNDPNDTFGANDEDWLIYTDITQNPEAFEEALEYEYKDIVELERLLLEHDPNFTEEDTLEAQYDWRNSILHLFLRGPRPHDSENIHEQHQMHLNVERIRVPEVIFQPTMGGQDQAGICELSETILLKKFGSQPGKLSQTSIDMVNNVLITGGNAKVPGLKERIVKEFTGFLPTGTNITVNMSSDPSLDAWKGMAALARNEEQYRKTVISKKEYEEYGPEYIKEHKLGNTKYFED 755 T 2.8E-42 Actin pdbpercent F Eukaryota T 8euf 2 B R ARP5_YEAST ACTIN-LIKE PROTEIN ARP5 MSSRDASLTPLKAVVIDDPPLRQTPEPFDEQSAYNPQSPIAIDFGSSKLRAGFVNHATPTHIFPNALTKFRDRKLNKNFTFVGNDTLLDQAVRSQSRSPFDGPFVTNWNLTEEILDYTFHHLGVVPDNGIPNPILLTERLATVQSQRTNWYQILFETYNVPGVTFGIDSLFSFYNYNPSGNKTGLVISCGHEDTNVIPVVDGAGILTDAKRINWGGHQAVDYLNDLMALKYPYFPTKMSYLQYETMYKDYCYVSRNYDEDIEKILTLENLDTNDVVVEAPFTEVLQPQKTEEELRIQAEKRKETGKRLQEQARLKRMEKLVQKQEEFEYFSKVRDQLIDEPKKKVLSVLQNAGFDDERDFKKYLHSLEQSLKKAQMVEAEDDSHLDEMNEDKTAQKFDLLDIADEDLNEDQIKEKRKQRFLKASQDARQKAKEEKERVAKEEEEKKLKEQQWRETDLNGWIKDKRLKLNKLIKRRKEKLKLRDEMKDRKSQVSQNRMKNLASLAEDNVKQGAKRNRHQATIDNDPNDTFGANDEDWLIYTDITQNPEAFEEALEYEYKDIVELERLLLEHDPNFTEEDTLEAQYDWRNSILHLFLRGPRPHDSENIHEQHQMHLNVERIRVPEVIFQPTMGGQDQAGICELSETILLKKFGSQPGKLSQTSIDMVNNVLITGGNAKVPGLKERIVKEFTGFLPTGTNITVNMSSDPSLDAWKGMAALARNEEQYRKTVISKKEYEEYGPEYIKEHKLGNTKYFED 755 T 2.8E-42 Actin pdbpercent F Eukaryota T 8eup 5 E 5 NSA1_SCHPO Ribosome biogenesis protein nsa1 MKLLLGDEIGQLKFIEIKKGTDTSNPESEAPVIQKFGELDREKGVLFMLKHEMNVFVARKNGTIECWNVNQEPPILSSLWQLDSSLLETASIVSMKYSNGWLMLALSDGNLLFRHIESSKLRKLQLHGPLSAVELHPRIPGIIAAGGKENDVCLYSCNPTCKSNIDELELWRTENVVKVFQGKNVKNDSLNLRVRVWITGIVFTEDIINVIDGKSEDDESLCFHFATITHYGQLRFYDTKHGRRPVSTFDVSTSPLSHVGLLPSIKLLYFADKRAQISIFDHSKKKVIGRFQGVKGAPSSIHCLGNVVAITGLDRNVRIFDADRKPLANAYIKALPTSIIVINERDAEIIKKEEELEAAKEEEEEIWRNMEQLEDTEDKKPSKRIKL 387 T 0.13 SopA pdbpssm F Eukaryota T 8euy 5 E 5 NSA1_SCHPO Ribosome biogenesis protein nsa1 MKLLLGDEIGQLKFIEIKKGTDTSNPESEAPVIQKFGELDREKGVLFMLKHEMNVFVARKNGTIECWNVNQEPPILSSLWQLDSSLLETASIVSMKYSNGWLMLALSDGNLLFRHIESSKLRKLQLHGPLSAVELHPRIPGIIAAGGKENDVCLYSCNPTCKSNIDELELWRTENVVKVFQGKNVKNDSLNLRVRVWITGIVFTEDIINVIDGKSEDDESLCFHFATITHYGQLRFYDTKHGRRPVSTFDVSTSPLSHVGLLPSIKLLYFADKRAQISIFDHSKKKVIGRFQGVKGAPSSIHCLGNVVAITGLDRNVRIFDADRKPLANAYIKALPTSIIVINERDAEIIKKEEELEAAKEEEEEIWRNMEQLEDTEDKKPSKRIKL 387 T 0.13 SopA pdbpssm F Eukaryota T 8ev3 5 E 5 NSA1_SCHPO Ribosome biogenesis protein nsa1 MKLLLGDEIGQLKFIEIKKGTDTSNPESEAPVIQKFGELDREKGVLFMLKHEMNVFVARKNGTIECWNVNQEPPILSSLWQLDSSLLETASIVSMKYSNGWLMLALSDGNLLFRHIESSKLRKLQLHGPLSAVELHPRIPGIIAAGGKENDVCLYSCNPTCKSNIDELELWRTENVVKVFQGKNVKNDSLNLRVRVWITGIVFTEDIINVIDGKSEDDESLCFHFATITHYGQLRFYDTKHGRRPVSTFDVSTSPLSHVGLLPSIKLLYFADKRAQISIFDHSKKKVIGRFQGVKGAPSSIHCLGNVVAITGLDRNVRIFDADRKPLANAYIKALPTSIIVINERDAEIIKKEEELEAAKEEEEEIWRNMEQLEDTEDKKPSKRIKL 387 T 0.13 SopA pdbpssm F Eukaryota T 8ev3 41 OA T RL21A_SCHPO RPL21 KAHFVSTENNEPVTLHPVA 19 T 0.16 MIase unppercent F Eukaryota T 8ew5 1 A A A0A6J3L7M6_9HYME Flightin MWADEEPAPWDIEETPAEQAPEASAESAAPAAEAATGEKPKIKLEKIEPPHYNHHWVRPLFLNYAYYLYEYRKNYYNDVIDYLNQREKGIFREPPRAQEWAERAMRTYDEKNTDKSFKRSADMKYIINMRHEPRYYSYHTRAYYSLKYQKIL 152 T 33 DUF3579 pdbhh F Eukaryota T 8ewy 2 C,D C,D INLR1_MOUSE IFN-LAMBDA R1,CYTOKINE RECEPTOR CLASS-II MEMBER 12,CYTOKINE RECEPTOR FAMILY 2 MEMBER 12,CRF2-12,INTERLEUKIN-28 RECEPTOR SUBUNIT ALPHA,IL-28 RECEPTOR SUBUNIT ALPHA,IL-28R-ALPHA,IL-28RA GPRMKQLEDKVEELLSKNYHLENEVARLKKLVGERKIMKGNPWFQGVKTPRALDFSEYRYPVATFQPSGPEFSDDLILCPQKELT 85 F F Eukaryota T 8ez9 1 A,D R,Q unknown region of DNA-PKcs XXXXXXXXXXXXXXXXXXXX 20 F F F 8eza 5 F,M R,Q PRKDC_HUMAN DNA-dependent protein kinase catalytic subunit -- Unknown region XXXXXXXXXXXXXXXXXXXX 20 F F F 8f0l 3 E,F P,Q CD3E_HUMAN T-CELL SURFACE ANTIGEN T3/LEU-4 EPSILON CHAIN QDGNEEMGGITQT 13 T 0.0084 Ig_3 unp F Eukaryota T 8f0z 2 B B H101 XXXXXXXXXXXXXXXXXXX 19 F F F 8f10 2 B B H102 XXXXXXXXXXXXXXXXXXX 19 F F F 8f12 2 B B H103 XXXXXXXXXXXXXXXXXXX 19 F F F 8f13 2 B B H103 XXXXXXXXXXXXXXXXXXX 19 F F F 8f14 2 B B all-D Helicon Polypeptide H201 XXXXXXXXXXXXXXXXX 17 F F F 8f15 2 D,E,F D,E,F all-D Helicon Polypeptide H202 XXXXXXXXXXXXXXXXX 17 F F F 8f16 2 C,D C,D all-D Helicon Polypeptide H203 XXXXXXXXXXXXXXXXX 17 F F F 8f17 2 C,D C,D all-D Helicon Polypeptide H204 XXXXXXXXXXXXXXXXX 17 F F F 8f24 1 A,B,C,D,E,F C,A,E,D,B,F Mirror-image RNA 0G-XEC-0G-0U-0A-0C-0A-0C 0GX0G0U0A0C0A0C 15 T 0.36 CXCXC pdbhh F T 8f2f 1 A A CLML_MESEU CALCIUM CHANNEL TOXIN-LIKE PEPTIDE-1 GCNRLNKKCNSDADCCRYGERCISTGVNYYCRPDVGPX 38 T 2.4E-05 Toxin_12 unphh F Eukaryota T 8f3a 1 A,B,C A,B,C IQN17 RMKQIEDKIEEIESKQKKIENEIARIKKLLQLTVWGIKQLQARILX 46 T 0.00044 GP41 pdbhh F T 8f3b 1 A,B,C A,B,C IQN22 GRMKQIEDKIEEIESKQKKIENEIARIKKLLQLTVWGIKQLQARILAVERYX 52 T 5.9E-05 GP41 pdbhh F T 8f3k 1 A A A0A220S190_9NEIS ACRIIC5Nch MTIKEDGMSETQYFVSHDGNRHDLFDTLEQAEHYILKKNGWTDGEIAEKWAFVKKEARKYGGDPFSSNGRHSLWFITELKLSDGVIMEVDGQLFDDYVESISAERGTEEFAETKRRLVGYYLGW 124 T 0.088 DUF4761 pdbhh F Bacteria T 8f4b 2 B B Cyclic peptide inhibitor 1 (CPI1) XFWGNLHWYYEQFDSTCX 18 T 2.5 UreE_C pdbhh F T 8f4x 1 A,AA,AB,B,BA,BB,C,CA,CB,D,DA,DB,E,EA,EB,F,FA,FB,G,GA,GB,H,HA,HB,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,PA,Q,QA,R,RA,S,SA,T,TA,U,UA,V,VA,W,WA,X,XA,Y,YA,Z,ZA 0,Q,q,1,R,r,2,S,s,3,T,t,4,U,u,5,V,v,6,W,w,7,X,x,8,Y,9,Z,A,a,B,b,C,c,D,d,E,e,F,f,G,g,H,h,I,i,J,j,K,k,L,l,M,m,N,n,O,o,P,p RC_I_1-H11 MEEERRRHLAAAEARFLLELGRPDEVLRLLERLLEEGDPALFAALRELLESGDPLARLIAETVFRRL 67 T 0.00063 TPR_14 pdb F T 8f53 1 A,AA,AB,B,BA,BB,C,CA,CB,D,DA,DB,E,EA,EB,F,FA,FB,G,GA,GB,H,HA,HB,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,PA,Q,QA,R,RA,S,SA,T,TA,U,UA,V,VA,W,WA,X,XA,Y,YA,Z,ZA A,Wq,R,F,Ae,Ex,K,D,S,U,I,Dy,J1,N,T,Od,X,Cz,Tc,Bb,0,Ba,Gf,2,B,Lj,G,Qn,L,Vr,V,Zt,Z,E,Id,J,Nh,O,Sl,Y,Xp,Dc,Yb,Fg,C,Kk,H,Po,M,Us,W,Hu,Ca,P,He,Gv,Mi,Q,Rm,Fw RC_I_2 MMEAMVKYLAEKAGISEVEAAEIVLKAVKISGGDVVKSIELVDLFIEILNKGRE 54 T 0.16 DUF3606 pdbhh F T 8f54 1 A,AA,AB,B,BA,BB,C,CA,CB,D,DA,DB,E,EA,EB,F,FA,FB,G,GA,GB,H,HA,HB,I,IA,J,JA,K,KA,L,LA,M,MA,N,NA,O,OA,P,PA,Q,QA,R,RA,S,SA,T,TA,U,UA,V,VA,W,WA,X,XA,Y,YA,Z,ZA L,l,3,o,G,d,p,n,4,A,h,e,q,R,5,S,J,f,r,K,7,C,H,6,D,M,T,I,E,u,B,v,s,w,i,W,k,x,U,X,m,y,g,Y,t,z,N,Z,O,0,V,a,P,1,F,b,Q,2,j,c RC_I_1 PDEDLKAELAATEAIWLLRQGRPEEVWKLMQRLYEKGDPALWAVLRALLRSGDEIAILIAWNFMQRI 67 T 0.91 DUF1841 pdbhh F T 8f7n 1 A A A0A8B3MS64_RHIML Methyl-accepting chemotaxis protein GLLQGRMEISNSVLKTLSGFKDVYAQMNNFLQQTTDESRRMLKDAIVTQKEVLAETAAQVAGGNGEDELAAAIAATSDIETRIDGLWTLHEGEQKLRAETRADLERLAAEQAKINEEANRLQYAVRKDENAAKTMLRNAEKLMRASRFYAEFATEVSGAITVEEKLKVAEGHFPAIGRTQRDIFVLLPKGEKSLAETVNSASGAIGALIKTPPGPETLAGLSKYVDRFRTASFRLEAASVGKMREATQIFSELDGKIAGTESVLTATRRLSTSLTDIQIAAAAFLGTTSEESRKKLLDRFLAVQSNLTTLRGIASGMSFFDQAAGALLPIIDGMKKDGLALVEITDKRTVEFEAAGAAINEIWSDLTGFAEQQKVAAGSERAEANQ 386 T 0.0028 Lipoprotein_6 unppercent F Bacteria T 8f7r 2 B,H P,Q endomorphin YPWFX 5 T 22 CHASE7 pdbhh F F 8f7s 2 B,G P,Q DA2D_PHYBI [D-ALA2]-DELTORPHIN II YXFEVVGX 8 T 2.2 DapB_C unppercent F Eukaryota T 8f7w 2 B P PDYN_HUMAN Dynorphin YGGFLRRI 8 T 0.53 Op_neuropeptide pdbhh F Eukaryota T 8f7x 2 B P PNOC_HUMAN ORPHANIN FQ,PPNOC FGGFTGARKSARKL 14 T 1.3 Lem_TRP pdbhh F Eukaryota T 8f86 7 K K SIR6_HUMAN NAD-DEPENDENT PROTEIN DEACETYLASE SIRTUIN-6,PROTEIN MONO-ADP-RIBOSYLTRANSFERASE SIRTUIN-6,REGULATORY PROTEIN SIR2 HOMOLOG 6,HSIRT6,SIR2-LIKE PROTEIN 6 CSVNYAAGLSPYADKGKCGLPEIFDPPEELERKVWELARLVWQSSSVVFHTGAGISTASGIPDFRGPHGVWTMEERGLAPKFDTTFESARPTQTHMALVQLERVGLLRFLVSQNVDGLHVRSGFPRDKLAELHGNMFVEECAKCKTQYVRDTVVGTMGLKATGRLCTVAKARGLRACRGELRDTILDWEDSLPDRDLALADEASRNADLSITLGTSLQIRPSGNLPLATKRRGGRLVIVNLQPTKHDRHADLRIHGYVDEVMTRLMKHLGLEIPAWDGPRVLERALPPLPRPPTPKLEPKEESPTRINGSIPAGPKQEPCAQHNGSEPASPKRERPTSPAPHRPPKRVKAKAVPS 355 T 3.2E-07 SIR2 unppercent F Eukaryota T 8f8e 1 A,B A,B DCAF1_HUMAN HIV-1 VPR-BINDING PROTEIN,VPRBP,SERINE/THREONINE-PROTEIN KINASE VPRBP,VPR-INTERACTING PROTEIN GASAFRPISVFREANEDESGFTCCAFSARERFLMLGTCTGQLKLYNVFSGQEEASYNCHNSAITHLEPSRDGSLLLTSATWSQPLSALWGMKSVFDMKHSFTEDHYVEFSKHSQDRVIGTKGDIAHIYDIQTGNKLLTLFNPDLANNYKRNCATFNPTDDLVLNDGVLWDVRSAQAIHKFDKFNMNISGVFHPNGLEVIINTEIWDLRTFHLLHTVPALDQCRVVFNHTGTVMYGAMLQADDEDDLMEERMKSPFGSSFRTFNATDYKPIATIDVKRNIFDLCTDTKDCYLAVIENQGSMDALNMDTVCRLYEVG 315 F F Eukaryota T 8f8y 2 C,D E,F VRK1_HUMAN VACCINIA-RELATED KINASE 1 PRVKAAQAGRQS 12 T 34 KGG pdbhh F Eukaryota T 8fae 2 B,D,E C,E,A O40222_9HIV1 Envelope glycoprotein gp120 EKLWVTVYYGVPVWKEATTTLFCASDAKAYDTEVHNVWATHACVPTDPNPQEVVLENVTENFNMWKNNMVEQMHEDIISLWDESLKPCVKLTPLCVTLNCTDLRNVTNINNSSEGMRGEIKNCSFNITTSIKDKVKKDYALFYKLDVVPIDNDNTSYRLINCNTSTITQACPKVSFEPIPIHYCTPAGFAILKCKDKKFNGTGPCKNVSTVQCTHGIKPVVSTQLLLNGSLAEEEVVIRSSNFTDNAKNIIVQLKESVEINCTRPNNNTRKSIHIGPGKAFYTTGDIIGDIRQAHCNISRTKWNNTLNQIATKLKEQFGNNKTIVFNQSSGGDPEIVMHSFNCGGEFFYCNSTQLFNSTWNFNGTWNLTQSNGTEGNDTITLPCKIKQIINMWQEVGKAMYAPPIRGQIRCSSNITGLILTRDGGNNHNNDTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTKAKRRV 472 T 1.9E-55 GP120 pdbpercent T Viruses T 8fbc 1 A,B A,B A0A3C0KFZ6_9BURK Cytochrome P450 MGLGSFHFDPYSPAIDADPFPSYKRLRDEFPCFWSEEAQMWILSRYSDIVTAGQDWQTYSSASGNLMTELPGRAGATLGSSDPPKHDRLRGLIQHAFMKRNLMALEEPIRDVAKQVFAQVKGVKEFDFKDVSSQFTVKVLMAALGLPMGEDALVPEHEVRENAVLMVQSDARTRAKGPEHIAAYNWMQDYASKVIAMRRASPQNDLISNFALAEIDGDRLDDREVLLTTTTLIMAGVESLGGFMMMFAYNLATFDEARRAVVANPALLPDAIEESLRFNTSAQRFRRRLMKDVTLHGQTMKEGDFVCLAYGSGNRDERQYPNPDVYDIARKPRGHLGFGGGVHACLGTAIARLAVKIAFEEFHQVVPDYRRVADQLPWMPSSTFRSPLVLQLKAQ 395 T 9.7E-32 p450 unppercent F Bacteria T 8fbd 1 A,C A,C A0A126JJ68_CLOBO Neurotoxin complex component Orf-X1 MELKQAFVFEFDENLSSSSGSIHLEKVKQNCSPNYDYFKITFIDGYLYIKNKSGVILDKYDLKNVISLVALKRDYLSLSLSNNKQIKKFKNIKNKHLKNKFNLYVINEDIEKRITKNGILEEVILNKMLLSILLGNEENLLQIS 144 T 0.021 Glyco_hydro_39 unppercent F Bacteria T 8fbe 1 A,B A,B O52975_CLOBO Neurotoxin complex component Orf-X1 SELKQAFVFEFDENLSSSSGSIHLEKVKQNSSPNYDYFKITFIDGYLYIKNKSGVILDKYDLKNVISLVALKRDYLSLSLSNNKQIKKFKNIKNKHLKNKFNLYVINEDIEKRITKNGILEEVILNKMLLSILLGNEENLLQIS 144 T 31 DUF3161 unphh F Bacteria T 8fbi 1 A A KWOCA_39 MPETFEAIARAIEVAREVEKVAQRAEEEGNPDLRDSAKELARAVDEAIEEAKKQGNPELVEWVARAAKVAAEVIKVAIQAEKEGNRDLFRAALELVRAVIEAIEEAVKQGNPELVEWVARAAKVAAEVIKVAIQAEKEGARDLFRLALELVRAVIEAIEFAVKLGDPEMVERAARIAKTAAELIKRAIRAKKEGDKDQEREAKKRVTRLIIELTLMVLKASLDLLRRILEELKEMLERLEKNPDKDVIVKVLKVIVKAIEASVDNQRVSADNQKMLAELAGSWSGGGSEQKLISEEDLGGS 301 T 0.0025 EST1_DNA_bind pdbpercent F T 8fbn 1 A,B,C,D,E,F A,B,C,D,E,F KWOCA_73 ALEKDRRALEALKRAQEAEKKGDVEEAVRAAQEAVRAAKESGASWILRLVAEQALRIAKEAEKQGNVEVAVKAARVAVEAAKQAGDNDVLRKVAEQALRIAKEAEKQGNVDVAAKAAQVAAEAAKQAGDKDMLEKVAKVAEQIAKAAEKEGDKKVSIDATRIALEASLAALEIILEELKEMLERLEKNPDKDVIVKVLKVIVKAIEASVKNQKISAKNQKALAELA 226 T 0.00058 Syntaxin-6_N pdbpercent F T 8fbo 1 A,B,C A,B,C KWOCA_102 MTEEKIEEARQSIKEAERSLREGNPEKALDAVARALSLVNELERLARKTGSTEVLIEAARLAIEVARVALKVGSPEMAQLAVELALRLVQELERQARKTGSTEVLIEAARLAIEVARVAFKVGSPETAREAARTALELVEELERQARKTGSEEVLERAARLAEEVARVAEEIGDPELARKAMKVAIRLTEELLKKSLRELRRILEELKEMLERLEKNPDKDVIVKVLKVIVKAIEASVENQRISADNQRALARLAGSWSGGGSEQKLISEEDLGGS 276 T 0.024 Fmp27_WPPW pdb F T 8fbw 3 C,F E,F SIV V2 peptide LKSDKKIEYNETWYSRD 17 T 13 YlaC pdbhh F T 8fck 1 A A A0A8J1L9M8_XENLA HAUS augmin-like complex subunit 1 MDEKSTKIIMWLKKMFGDKPLPPYEVNTRTMEILYQLAEWNEARDKDLSLVTEDLKLKSAEVKAEAKYLQDLLTEGLGPSYTNLSRMGNNYLNQIVDSCLALELKNSSLSSYIPAVNDLSSELVAIELNNQEMEAELTSLRKKLTEALVLEKSLEQDLKKAEEQCNFEKAKVEIRSQNMKKLKDKSEEYKYKIHAAKDQLSSAGMEEPLTHRSLVSLSETLTELKAQSMAAKEKLNSYLDLAPNPSLVKVKIEEAKRELKATEVELTTKVNMMEFVVPEPSKRRLK 286 T 0.23 DUF16 pdb F Eukaryota T 8fck 7 G G B1H1T5_XENLA LOC100158301 PROTEIN MTGGKELGAAVELYERLQMLSCPCLEGVYLTDPQSIYELLCTPSSHRLDILQWLCSRIYPPVQEQLSSLKESQTDTKVKEIAKLCFDLMLCHFDDLDLIRGHASPFKQISFIGQLLDVIQYPDTISSNVILESLSHSTEKNVVTCIRENEELLKELFSSPHFQATLSPECNPWPADFKPLLNAEESLQKRATQSSKGKDMSNSVEALLEISSSLKALKEECVDLCSSVTDGDKVIQSLRLALTDFHQLTIAFNQIYANEFQEHCGHPAPHMSPMGPFFQFVHQSLSTCFKELESIAQFTETSENIVDVVRERHQSKEKWAGSTISTLCEKMKELRQSYEAFQQSSLQD 348 T 0.1 L27 pdbpercent F Eukaryota T 8fck 8 H H HAUS8_XENLA HEC1/NDC80-INTERACTING CENTROSOME-ASSOCIATED PROTEIN 1,SARCOMA ANTIGEN NY-SAR-48 HOMOLOG MSEAGVAPIEDGSQNSSGGSSGDAALKKSKGGAKVVKSRYMQIGRSKVSKNSLANTTVCSGGKVPERGSGGTPTRRSLAPHKAKITAAVPLPALDGSIFTKEDLQSTLLDGHRIARPDLDLSVINDRTLQKITPRPVVTSEQKKPKRDTTPVNLVPEDMVEMIESQTLLLTYLTIKMQKNLFRLEEKAERNLLLVNDQKDQLQETIHMMKRDLTLLQREERLRDLIEKQDEVLTPVVTSKDPFKDNYTTFATALDSTRHQLAIKNIHITGNRHRYLEELQKHLAITKSLLEEIMPSHASENAESFDTIKDLENIVLKTDEELARSFRQILDLSFKVNKEISLQSQKAVEETCESALVRQWYFDGSLP 367 F F Eukaryota T 8fed 10 K K A0QWR2_MYCS2 Transmembrane protein MSKWLLRGVVFATAMVIVRLLQGALVNASPGNAIWFSTGLLVLYAIGVAVWGVLDGRGDARSNPDPDRRADLAMTWLLAGLAAGILSGAVSWFIGLFYKSIYTESLLNEITTFAAFTALLTFLVAVAGVTIGRWTIDRKAPPVTRTRHGLAADDDRADTDVFAAVSANGAQEHTDTTQTTPLENPDQPRQS 191 T 0.0021 DUF3611 pdbpssm F Bacteria T 8fg0 3 E,F P,Q CSP_PLAF7 CS,PFCSP QGHNMPNDPNRNVD 14 T 0.26 DUF3533 unppercent F Eukaryota T 8fis 2 C,E,F C,F,G Q2N0S6_9HIV1 ENV POLYPROTEIN AENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSACTQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQAFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQCMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 481 T 3.5E-54 GP120 pdbpercent T Viruses T 8fjk 4 M,N,OA,PA,QA M,N,k,l,m CAPSD_AQRVC ATP-DEPENDENT DNA HELICASE VP3 TASPADTNVVPAKDAPTTNSPPSTTSPNQAAADANQQQAGIVSSQSGPNAVGDSAPSTSVNNDGDIITRPTSDSIAAVANATKPAAVVSDPQSM 94 T 0.068 DUF5888 pdb T Viruses T 8fjl 4 KA,LA,M,MA,N k,l,M,m,N CAPSD_AQRVC ATP-DEPENDENT DNA HELICASE VP3 TASPADTNVVPAKDAPTTNSPPSTTSPNQAAADANQQQAGIVSSQSGPNAVGDSAPSTSVNNDGDIITRPTSDSIAAVANATKPAAVVSDPQSM 94 T 0.068 DUF5888 pdb T Viruses T 8fjo 1 A A B2HHT9_MYCMM Cytochrome P450 124A1, Cyp124A1 MDLSTNLNTGLLPRVNGTPPPEVPLADIELGSLEFWGRDDDFRDGAFATLRREAPISFWPPIELAGLTAGKGHWALTKHDDIHFASRHPEIFHSSPNIVIHDQTPELAEYFGSMIVLDDPRHQRLRSIVSRAFTPKVVARIEASVRERAHRLVAAMIENHPDGQADLVSELAGPLPLQIICDMMGIPEEDHEQIFHWTNVILGFGDPDLTTDFDEFLQVSMAIGGYATALADDRRVNHHGDLTTSLVEAEVDGERLSSSEIAMFFILLVVAGNETTRNAISHGMLALSRYPDERAKWWSDFDGLAATAVEEIVRWASPVVYMRRTLSQDVDLRGTKMAAGDKVTLWYCSANRDEEKFADPWTFDVTRNPNPQVGFGGGGAHFCLGANLARREIRVVFDELRRQMPDVVATEEPARLLSQFIHGIKRLPVAWSRHHHHHH 439 T 1.3E-22 p450 unppssm F Bacteria T 8fk5 2 C,E,F C,G,I Q2N0S6_9HIV1 ENV POLYPROTEIN AENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSACTQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQAFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQCMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 481 T 3.5E-54 GP120 pdbpercent T Viruses T 8fk7 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T A3MVU7_PYRCJ Flagellin MPMKTKGLEPIVAAVLLIVVAVIGAVLVYLWFSGYVTRATSQAEQLSAAEQLKIEAVSKTGTTVSVNVRNVGEVPVKIASAYVLNATTLTMICGGSLTSPQQIDPGTIQTINVPGTCNLIAGARYIVKVVTARGTEAAATFISP 144 T 0.00034 Pilin_N unppssm F Archaea T 8fkp 47 UA SZ NOP16_HUMAN HBV PRE-S2 TRANS-REGULATED PROTEIN 3 MPKAKGKTRRQKFGYSVNRKRLNRNARRKAAPRIECSHIRHAWDHAKSVRQNLAEMGLAVDPNRAVPLRKRKVKAMEVDIEERPKELVRKPYVLNDLEAEASLPEKKGNTLSRDLIDYVRYMVENHGEDYKAMARDEKNYYQDTPKQIRSKINVYKRFYPAEWQDFLDSLQKRKMEVE 178 T 2.5E-17 Nop16 unppssm F Eukaryota T 8fkq 43 QA SZ NOP16_HUMAN HBV PRE-S2 TRANS-REGULATED PROTEIN 3 MPKAKGKTRRQKFGYSVNRKRLNRNARRKAAPRIECSHIRHAWDHAKSVRQNLAEMGLAVDPNRAVPLRKRKVKAMEVDIEERPKELVRKPYVLNDLEAEASLPEKKGNTLSRDLIDYVRYMVENHGEDYKAMARDEKNYYQDTPKQIRSKINVYKRFYPAEWQDFLDSLQKRKMEVE 178 T 2.5E-17 Nop16 unppssm F Eukaryota T 8fkr 51 YA SZ NOP16_HUMAN HBV PRE-S2 TRANS-REGULATED PROTEIN 3 MPKAKGKTRRQKFGYSVNRKRLNRNARRKAAPRIECSHIRHAWDHAKSVRQNLAEMGLAVDPNRAVPLRKRKVKAMEVDIEERPKELVRKPYVLNDLEAEASLPEKKGNTLSRDLIDYVRYMVENHGEDYKAMARDEKNYYQDTPKQIRSKINVYKRFYPAEWQDFLDSLQKRKMEVE 178 T 2.5E-17 Nop16 unppssm F Eukaryota T 8fks 47 UA SZ NOP16_HUMAN HBV PRE-S2 TRANS-REGULATED PROTEIN 3 MPKAKGKTRRQKFGYSVNRKRLNRNARRKAAPRIECSHIRHAWDHAKSVRQNLAEMGLAVDPNRAVPLRKRKVKAMEVDIEERPKELVRKPYVLNDLEAEASLPEKKGNTLSRDLIDYVRYMVENHGEDYKAMARDEKNYYQDTPKQIRSKINVYKRFYPAEWQDFLDSLQKRKMEVE 178 T 2.5E-17 Nop16 unppssm F Eukaryota T 8fkt 53 AB SZ NOP16_HUMAN HBV PRE-S2 TRANS-REGULATED PROTEIN 3 MPKAKGKTRRQKFGYSVNRKRLNRNARRKAAPRIECSHIRHAWDHAKSVRQNLAEMGLAVDPNRAVPLRKRKVKAMEVDIEERPKELVRKPYVLNDLEAEASLPEKKGNTLSRDLIDYVRYMVENHGEDYKAMARDEKNYYQDTPKQIRSKINVYKRFYPAEWQDFLDSLQKRKMEVE 178 T 2.5E-17 Nop16 unppssm F Eukaryota T 8fku 50 XA SZ NOP16_HUMAN HBV PRE-S2 TRANS-REGULATED PROTEIN 3 MPKAKGKTRRQKFGYSVNRKRLNRNARRKAAPRIECSHIRHAWDHAKSVRQNLAEMGLAVDPNRAVPLRKRKVKAMEVDIEERPKELVRKPYVLNDLEAEASLPEKKGNTLSRDLIDYVRYMVENHGEDYKAMARDEKNYYQDTPKQIRSKINVYKRFYPAEWQDFLDSLQKRKMEVE 178 T 2.5E-17 Nop16 unppssm F Eukaryota T 8fkv 56 DB SZ NOP16_HUMAN HBV PRE-S2 TRANS-REGULATED PROTEIN 3 MPKAKGKTRRQKFGYSVNRKRLNRNARRKAAPRIECSHIRHAWDHAKSVRQNLAEMGLAVDPNRAVPLRKRKVKAMEVDIEERPKELVRKPYVLNDLEAEASLPEKKGNTLSRDLIDYVRYMVENHGEDYKAMARDEKNYYQDTPKQIRSKINVYKRFYPAEWQDFLDSLQKRKMEVE 178 T 2.5E-17 Nop16 unppssm F Eukaryota T 8fkw 53 AB SZ NOP16_HUMAN HBV PRE-S2 TRANS-REGULATED PROTEIN 3 MPKAKGKTRRQKFGYSVNRKRLNRNARRKAAPRIECSHIRHAWDHAKSVRQNLAEMGLAVDPNRAVPLRKRKVKAMEVDIEERPKELVRKPYVLNDLEAEASLPEKKGNTLSRDLIDYVRYMVENHGEDYKAMARDEKNYYQDTPKQIRSKINVYKRFYPAEWQDFLDSLQKRKMEVE 178 T 2.5E-17 Nop16 unppssm F Eukaryota T 8fkx 53 AB SZ NOP16_HUMAN HBV PRE-S2 TRANS-REGULATED PROTEIN 3 MPKAKGKTRRQKFGYSVNRKRLNRNARRKAAPRIECSHIRHAWDHAKSVRQNLAEMGLAVDPNRAVPLRKRKVKAMEVDIEERPKELVRKPYVLNDLEAEASLPEKKGNTLSRDLIDYVRYMVENHGEDYKAMARDEKNYYQDTPKQIRSKINVYKRFYPAEWQDFLDSLQKRKMEVE 178 T 2.5E-17 Nop16 unppssm F Eukaryota T 8fky 59 GB SZ NOP16_HUMAN HBV PRE-S2 TRANS-REGULATED PROTEIN 3 MPKAKGKTRRQKFGYSVNRKRLNRNARRKAAPRIECSHIRHAWDHAKSVRQNLAEMGLAVDPNRAVPLRKRKVKAMEVDIEERPKELVRKPYVLNDLEAEASLPEKKGNTLSRDLIDYVRYMVENHGEDYKAMARDEKNYYQDTPKQIRSKINVYKRFYPAEWQDFLDSLQKRKMEVE 178 T 2.5E-17 Nop16 unppssm F Eukaryota T 8fl1 2 C,E,F C,G,I Q2N0S6_9HIV1 ENV POLYPROTEIN AENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSACTQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQAFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQCMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 481 T 3.5E-54 GP120 pdbpercent T Viruses T 8flo 1 A A B2HHT9_MYCMM CYP124A1 MDLSTNLNTGLLPRVNGTPPPEVPLADIELGSLEFWGRDDDFRDGAFATLRREAPISFWPPIELAGLTAGKGHWALTKHDDIHFASRHPEIFHSSPNIVIHDQTPELAEYFGSMIVLDDPRHQRLRSIVSRAFTPKVVARIEASVRERAHRLVAAMIENHPDGQADLVSELAGPLPLQIICDMMGIPEEDHEQIFHWTNVILGFGDPDLTTDFDEFLQVSMAIGGYATALADDRRVNHHGDLTTSLVEAEVDGERLSSSEIAMFFILLVVAGNETTRNAISHGMLALSRYPDERAKWWSDFDGLAATAVEEIVRWASPVVYMRRTLSQDVDLRGTKMAAGDKVTLWYCSANRDEEKFADPWTFDVTRNPNPQVGFGGGGAHFCLGANLARREIRVVFDELRRQMPDVVATEEPARLLSQFIHGIKRLPVAWSRHHHHHH 439 T 1.3E-22 p450 unppssm F Bacteria T 8flp 1 A A Alpha-conotoxin LvIC analogue GCCANPVCNGKHCX 14 T 0.016 Toxin_8 pdbhh F T 8flt 5 E P M-PTH(1-14) XVXEIQLMHQXAKW 14 T 0.0055 Parathyroid pdbhh F T 8flw 4 E,G,H C,G,I Q2N0S6_9HIV1 ENV POLYPROTEIN AENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSACTQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQAFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQCMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 481 T 3.5E-54 GP120 pdbpercent T Viruses T 8flx 1 A A LK031 PELFLQDLRSLVEAARILARLARQRGDEHALERAARWAEQAARQAEKLARQARKEGNLELALKALQILVNAAYVLAEIARDRGNEELLEYAARLAEEAARQAAEIWAEAARRGNQQLRTKAAHILLRAAEVLLEIARDRGNQELLEKAQRIVEAVAAAQQVAALALRLAEELDSEEAKKAVRAIAEAAAAALLAALQGKDEVAKLALKVLKEAIELAKENRSEEALKVVLEIARAAAAAARAAEEGKTEVAKLALKVLEEAIELAKENRSEEALKVVLEIARAALAAAQAAEEGKSDEARDALRRLEEAIEEAKENRSKESLEKVREEAKEAEQQAEDAREGKGWSENLYFQ 352 T 0.014 Phage-MuB_C pdbpercent F T 8fn4 1 A 1 Q57XL7_TRYB2 RNA-editing substrate-binding complex protein 1 (RESC1) MLRLLRRSIVGSTFNIMVRRQNQGSVSQGALNMRDQQAAAAENVTPERVWALWNEGNLFSLSLAQLQGFLSRCGVRTDPAAKKAAVVRQVEEYLHSKDTTVKGGGQGAASPQQHQQHGQQGGYGRWNQASVMQPETLLDLSQAGFYEGAANMVPKAFQLLVSDTAPDVVVSRVNTTAFPGFPSNTECYTLGASEKDVAIRSRYSKVLQWCCLNMSNLQMDGELYVDFGKLLLKPSVMRKNRRIVSSYTLQQRLQVNHPYTWVPTLPESCLSKIQEQFLQPEGFAPIGKGVQLTYSGTIKRSKDQLHVDLDNKGKVLAVNSAWVNLQTAWCTHAKGPDVRLLLRSRPPIRRQDVELFASTPIIKLADDDVADVLPPEHGQLVYLSEDETRLFERVSDRGVTITVREVKRQPLIILRDEEEDPRVEYSLSAHIPANAAKATDVRAVGLTAFELAGRLAGLVAEDFVREYGCEAKL 473 T 0.058 HeH unppercent F Eukaryota T 8fn4 2 B 2 B6SBL9_9TRYP RNA-editing substrate-binding complex protein 2 (RESC2) MLRARLKIFSALNGATSAFSRAVAPLQIATRQQSFSAAAPAASGDFSHITRNTVWGLWNEGNLFSLSVPELAFFLQEHCRVANVDPRAKKSALVRQVEEILSAEQQASATVPQEDNPHAIVVTDYDRAEDALEEADEYGDWGAEPGFEDRRELDFMELSPGRMGERYDPLSPRAFQLLHSETATDVGIASIDPSKLPGQSKVKNALAAIHVAPNDANKMRFRMAFEWCLMNIWNMNMPGELNIGAGKALYYRSVAKQNRNVMPLWTVQKHLYAQHPYAWFAIASESNVAAMESLAAALNMSIQQERTTSYKVTIRRMAEFFDCELNGQLKCTMMNKPWDRFFVSHYIRSKMPDLRYVVRARHPIKKRIADAYLEADILRSTRDSVQSVLSPELGDVVYCCERVVRKWAKKTATGVTLQLVETKRTPLIITKAGDEGERLEYEWIVPLPQQAERIDIAALTDELWEYGNKLAAALEEGMEELMVHTMTAVSAY 492 T 0.7 ARMET_C pdbhh F Eukaryota T 8fn4 3 C 3 Q381A0_TRYB2 RNA-editing substrate-binding complex protein 3 (RESC3) MSNPFEKVARGIAFKMRSKVHKQGYSNTVMAQQARRLSPTGLLAMERLTELTALQQRHQCTFDPALRSKATQILRTLPLLSIDEDPYFTHTQRALRLAAYFGAVDLPVTYALINQHTKNAFMLDAFSMASFFYTLAKLKHPQTKEIVGILLPRLREVAPELIAREAVHILRLLCSIQMADAQLVKVVTETVVATAADVPLRDARQCAFILSETFPEEAQRILGAVEHRLCDDIDMNADANEVKTTILDVCRVVSATCKGPRRLLNSVARRSMELLPQLTPLDVAFVLKAFHLSSYRHLRLLRVLSSSLAASFPTSNVTKEHGLAASIVVQSLAHFYLSGCEEVVVTLVNASVNVLEGLNLALTLLACVRLRCVSPGVDPAVDALCSGAPMRRYVHNAHSMQVTSRILYGLAHAGRCRSDEEVAIVLPLLKSVVRTPGALRDDCRGFLLDAVTALGADGECSNDALQEQVRKVYERLSQDGGK 482 T 11 DUF6489 pdbhh F Eukaryota T 8fn4 4 D 4 Q384R6_TRYB2 RNA-editing substrate-binding complex protein 4 (RESC4) MNGRLYCLIRRITSPPVATRLIKEELCLSMAAIARLPLRRDQLAHVTNTEAITTRAQRISHLCTPTELGMIAEGAEALSCNRFDLADALIDGAYESVRRAASSTRLSHVSAIARYSASIKTYGNETITTLLKAGASLLQKNDSVPVLKSFLGVAQSHLTDGEMRVLIDEMCAKATEEQRLCINSIGTQSLAKDAAKCGEETLTKGNEDGDETAVDDEETQAWDMLRARQWMLQLVRCGKPPTAAEAVQAMELYAHFAVRDFVLHEKIEDLVLLVLPTGNKFHLNEMHKIVLRSPNLFPRVRNTLGQDHSGVSDVHRADRGVEWSDDPASSLTTTYTTSRAYSMLLLGQRLSEDIMFDVVQEQSETIPVDVAAQAACLFAEKGDIPEGVILRLSAELEHISPQGVTAFVRAARRDSSGALLPHYAAVLNRFTERDLCDTPLETLLQMCEVFALPAPRGTSEGDNDSINESQSKFQKALIVRLFSVIQGSRDVPFLCKVAKAVRAFDANDELIQFVCSSICAQGALSECEALIAFDMIRCCDFVYEPLLDAMEPVFRRLVESVSAMLEGKSTINDVEVRRCACFATLQSEFDCPDFETLASLLVHTVEKNVTGCPVELIPSVGLLCVRTRRTSALYIVGNKLEGNMQQLSDDAIGELARLLVGTENLATKELAVEFQSVVVSRLLRQQSLPPDVVALSAVVWLRQGDKVGTIDERSVDYIIKWMYAIGSSVYTDLCLAVHLSASVESLSNALIDDLPRRLELLTTNEMANAIFGLGEVSDMGARLSHQLVAERCSDYVVDHSQEFWSGKVIARLLYGFSRMHCTKRSLYNVFATRLAHRPVFSLLDQEAISFAIAAFGRVKYLDKKLFDRFTRWILDHSKDLNAAELLLTIRGVSRVMLLNDQLYDDLGSKAAEKVKEFPIESQCVLLSSFGSLGVEHERLASRMVSSIAENREELTDATKAVDVITSLWSMNYDVEDDKHVAQLADWVVQRAEELTDESIGKLCLVLSDTNWRHVPLVRAIAEQSVRLQGQQSISPKCCREVLDVLGTFMIHHQGARENLSALGRSISKERIQLSEEEEQHLQLLLRR 1087 T 0.017 MOR2-PAG1_C pdb F Eukaryota T 8fn4 5 E 5 Q389F5_TRYB2 RNA-editing substrate-binding complex protein 5 (RESC5) MLRHTSRNNALHAFVRSPHYRTIPSAGPNGIVVNRDMLVHQFRDFYKTLQHCSLVDKVHLMSERPSVEALRVADQMVSIGATFLEMPLTGMEHRATEFMESMRYVRGAGGPSTLASYLQDTENCRCNSGDVVCLPNGIAVGHGPRTNAVAHTTLKQLFEVKDDQFSFDVFTLEQEGDAPPLGDYFGFAGSNVLLTWKDEHGLLAVDQYQQKQPHTEMNVVYLEPGCHFLSFYGVDHTIDVLVQKGYERSMDSIAAAGLNPIPVQWSEMDKLGISMRAAVLPLKFFKANVGGMLSRNKSRGARWQTHQLQKGSGSGSASSGASAAGSSGASASSGASAAGSSGASAGHHHHHHHHHHSGSEDQVDPRLIDGKASAWSHPQFEKGGGSGGGSGGSAWSHPQFEK 402 T 0.29 ADI unp F Eukaryota T 8fn6 2 B 1 Q57XL7_TRYB2 RNA-editing substrate-binding complex protein 1 (RESC1) MLRLLRRSIVGSTFNIMVRRQNQGSVSQGALNMRDQQAAAAENVTPERVWALWNEGNLFSLSLAQLQGFLSRCGVRTDPAAKKAAVVRQVEEYLHSKDTTVKGGGQGAASPQQHQQHGQQGGYGRWNQASVMQPETLLDLSQAGFYEGAANMVPKAFQLLVSDTAPDVVVSRVNTTAFPGFPSNTECYTLGASEKDVAIRSRYSKVLQWCCLNMSNLQMDGELYVDFGKLLLKPSVMRKNRRIVSSYTLQQRLQVNHPYTWVPTLPESCLSKIQEQFLQPEGFAPIGKGVQLTYSGTIKRSKDQLHVDLDNKGKVLAVNSAWVNLQTAWCTHAKGPDVRLLLRSRPPIRRQDVELFASTPIIKLADDDVADVLPPEHGQLVYLSEDETRLFERVSDRGVTITVREVKRQPLIILRDEEEDPRVEYSLSAHIPANAAKATDVRAVGLTAFELAGRLAGLVAEDFVREYGCEAKL 473 T 0.058 HeH unppercent F Eukaryota T 8fn6 3 C 2 B6SBL9_9TRYP RNA-editing substrate-binding complex protein 2 (RESC2) MLRARLKIFSALNGATSAFSRAVAPLQIATRQQSFSAAAPAASGDFSHITRNTVWGLWNEGNLFSLSVPELAFFLQEHCRVANVDPRAKKSALVRQVEEILSAEQQASATVPQEDNPHAIVVTDYDRAEDALEEADEYGDWGAEPGFEDRRELDFMELSPGRMGERYDPLSPRAFQLLHSETATDVGIASIDPSKLPGQSKVKNALAAIHVAPNDANKMRFRMAFEWCLMNIWNMNMPGELNIGAGKALYYRSVAKQNRNVMPLWTVQKHLYAQHPYAWFAIASESNVAAMESLAAALNMSIQQERTTSYKVTIRRMAEFFDCELNGQLKCTMMNKPWDRFFVSHYIRSKMPDLRYVVRARHPIKKRIADAYLEADILRSTRDSVQSVLSPELGDVVYCCERVVRKWAKKTATGVTLQLVETKRTPLIITKAGDEGERLEYEWIVPLPQQAERIDIAALTDELWEYGNKLAAALEEGMEELMVHTMTAVSAY 492 T 0.7 ARMET_C pdbhh F Eukaryota T 8fn6 4 D 3 Q381A0_TRYB2 RNA-editing substrate-binding complex protein 3 (RESC3) MSNPFEKVARGIAFKMRSKVHKQGYSNTVMAQQARRLSPTGLLAMERLTELTALQQRHQCTFDPALRSKATQILRTLPLLSIDEDPYFTHTQRALRLAAYFGAVDLPVTYALINQHTKNAFMLDAFSMASFFYTLAKLKHPQTKEIVGILLPRLREVAPELIAREAVHILRLLCSIQMADAQLVKVVTETVVATAADVPLRDARQCAFILSETFPEEAQRILGAVEHRLCDDIDMNADANEVKTTILDVCRVVSATCKGPRRLLNSVARRSMELLPQLTPLDVAFVLKAFHLSSYRHLRLLRVLSSSLAASFPTSNVTKEHGLAASIVVQSLAHFYLSGCEEVVVTLVNASVNVLEGLNLALTLLACVRLRCVSPGVDPAVDALCSGAPMRRYVHNAHSMQVTSRILYGLAHAGRCRSDEEVAIVLPLLKSVVRTPGALRDDCRGFLLDAVTALGADGECSNDALQEQVRKVYERLSQDGGK 482 T 11 DUF6489 pdbhh F Eukaryota T 8fn6 5 E 4 Q384R6_TRYB2 RNA-editing substrate-binding complex protein 4 (RESC4) MNGRLYCLIRRITSPPVATRLIKEELCLSMAAIARLPLRRDQLAHVTNTEAITTRAQRISHLCTPTELGMIAEGAEALSCNRFDLADALIDGAYESVRRAASSTRLSHVSAIARYSASIKTYGNETITTLLKAGASLLQKNDSVPVLKSFLGVAQSHLTDGEMRVLIDEMCAKATEEQRLCINSIGTQSLAKDAAKCGEETLTKGNEDGDETAVDDEETQAWDMLRARQWMLQLVRCGKPPTAAEAVQAMELYAHFAVRDFVLHEKIEDLVLLVLPTGNKFHLNEMHKIVLRSPNLFPRVRNTLGQDHSGVSDVHRADRGVEWSDDPASSLTTTYTTSRAYSMLLLGQRLSEDIMFDVVQEQSETIPVDVAAQAACLFAEKGDIPEGVILRLSAELEHISPQGVTAFVRAARRDSSGALLPHYAAVLNRFTERDLCDTPLETLLQMCEVFALPAPRGTSEGDNDSINESQSKFQKALIVRLFSVIQGSRDVPFLCKVAKAVRAFDANDELIQFVCSSICAQGALSECEALIAFDMIRCCDFVYEPLLDAMEPVFRRLVESVSAMLEGKSTINDVEVRRCACFATLQSEFDCPDFETLASLLVHTVEKNVTGCPVELIPSVGLLCVRTRRTSALYIVGNKLEGNMQQLSDDAIGELARLLVGTENLATKELAVEFQSVVVSRLLRQQSLPPDVVALSAVVWLRQGDKVGTIDERSVDYIIKWMYAIGSSVYTDLCLAVHLSASVESLSNALIDDLPRRLELLTTNEMANAIFGLGEVSDMGARLSHQLVAERCSDYVVDHSQEFWSGKVIARLLYGFSRMHCTKRSLYNVFATRLAHRPVFSLLDQEAISFAIAAFGRVKYLDKKLFDRFTRWILDHSKDLNAAELLLTIRGVSRVMLLNDQLYDDLGSKAAEKVKEFPIESQCVLLSSFGSLGVEHERLASRMVSSIAENREELTDATKAVDVITSLWSMNYDVEDDKHVAQLADWVVQRAEELTDESIGKLCLVLSDTNWRHVPLVRAIAEQSVRLQGQQSISPKCCREVLDVLGTFMIHHQGARENLSALGRSISKERIQLSEEEEQHLQLLLRR 1087 T 0.017 MOR2-PAG1_C pdb F Eukaryota T 8fn6 6 F 5 Q389F5_TRYB2 RNA-editing substrate-binding complex protein 5 (RESC5) MLRHTSRNNALHAFVRSPHYRTIPSAGPNGIVVNRDMLVHQFRDFYKTLQHCSLVDKVHLMSERPSVEALRVADQMVSIGATFLEMPLTGMEHRATEFMESMRYVRGAGGPSTLASYLQDTENCRCNSGDVVCLPNGIAVGHGPRTNAVAHTTLKQLFEVKDDQFSFDVFTLEQEGDAPPLGDYFGFAGSNVLLTWKDEHGLLAVDQYQQKQPHTEMNVVYLEPGCHFLSFYGVDHTIDVLVQKGYERSMDSIAAAGLNPIPVQWSEMDKLGISMRAAVLPLKFFKANVGGMLSRNKSRGARWQTHQLQKGSGSGSASSGASAAGSSGASASSGASAAGSSGASAGHHHHHHHHHHSGSEDQVDPRLIDGKASAWSHPQFEKGGGSGGGSGGSAWSHPQFEK 402 T 0.29 ADI unp F Eukaryota T 8fnc 3 C 5 Q389F5_TRYB2 Mitochondrial RNA binding protein MLRHTSRNNALHAFVRSPHYRTIPSAGPNGIVVNRDMLVHQFRDFYKTLQHCSLVDKVHLMSERPSVEALRVADQMVSIGATFLEMPLTGMEHRATEFMESMRYVRGAGGPSTLASYLQDTENCRCNSGDVVCLPNGIAVGHGPRTNAVAHTTLKQLFEVKDDQFSFDVFTLEQEGDAPPLGDYFGFAGSNVLLTWKDEHGLLAVDQYQQKQPHTEMNVVYLEPGCHFLSFYGVDHTIDVLVQKGYERSMDSIAAAGLNPIPVQWSEMDKLGISMRAAVLPLKFFKANVGGMLSRNKSRGARWQTHQLQK 310 T 0.29 ADI pdb F Eukaryota T 8fnc 5 E 7 Q384B4_TRYB2 RxLR effector protein MRSSRGILFLSGAFAIRGMSAYHSYQRLDTVSHTSKVYSLQMQRQTVHFTPITRLGVEATANPTTATNATGQTGDGDGATALDVAMRVNKLKRLHQTGGGPSGKKQVELDAWRDLNNLTEAQINSAEGKAVSLLLNSWAYFAKYWEKGAEGPSASLSEVTPSNDSSSAGEHGTQ 174 T 0.14 LppC unppssm F Eukaryota T 8fnc 7 G 10 Q57VS6_TRYB2 RAP domain-containing protein MRRRVVLCCQDVGSLLSSKHSVHSGIGYHERVFSRNLLYRRYPVVTVLPKAGFTVLDTKRWIASSGPPVTGSPLSPVTNPSLNVGTGGGEAVAMEGPLPVSYSPGSGVNGSLPVTSTAITAHCDVLSECVAKADELAVQLKAQNALSASAEILTQEGMEEFVEELKTSATNEMTALVKQMQTTPLLQRAGMHELRRTLYYTTSLKERDWLEEKQYTAAMRMLTVEVLRRDGDGVLSADDVLYVTTHVVTANFYNRHLWNRMEKSLLKFSNYENIDMSSVKAFSTRLFKTRRGCAKETLDIRRKVLLAMSRRVGVLANDFDLPSLLGVLQCYTVHDLTPFHLEPLAIRATNHVGDFTPHECATLAHVLRKWRTMRLEVCERLVERICTSDQLTHHMANAAMIAIRTCFNQVSDGGRNAMNAEPTRQKLRAMGEQIGCRLDEVEYPALPVILSILDVVVTLKIYVPKKCLQVIFSQANDMVAIVMEQKDDLVDPKTGKRVRPITAEEGRQLQALLSHYGNDLAPELSQRMKEAFREGVLPDEASL 543 T 0.13 DUF3646 pdb F Eukaryota T 8fnf 3 C 5 Q389F5_TRYB2 Mitochondrial RNA binding protein MLRHTSRNNALHAFVRSPHYRTIPSAGPNGIVVNRDMLVHQFRDFYKTLQHCSLVDKVHLMSERPSVEALRVADQMVSIGATFLEMPLTGMEHRATEFMESMRYVRGAGGPSTLASYLQDTENCRCNSGDVVCLPNGIAVGHGPRTNAVAHTTLKQLFEVKDDQFSFDVFTLEQEGDAPPLGDYFGFAGSNVLLTWKDEHGLLAVDQYQQKQPHTEMNVVYLEPGCHFLSFYGVDHTIDVLVQKGYERSMDSIAAAGLNPIPVQWSEMDKLGISMRAAVLPLKFFKANVGGMLSRNKSRGARWQTHQLQK 310 T 0.29 ADI pdb F Eukaryota T 8fnf 5 E 7 Q384B4_TRYB2 RxLR effector protein MRSSRGILFLSGAFAIRGMSAYHSYQRLDTVSHTSKVYSLQMQRQTVHFTPITRLGVEATANPTTATNATGQTGDGDGATALDVAMRVNKLKRLHQTGGGPSGKKQVELDAWRDLNNLTEAQINSAEGKAVSLLLNSWAYFAKYWEKGAEGPSASLSEVTPSNDSSSAGEHGTQ 174 T 0.14 LppC unppssm F Eukaryota T 8fnf 7 G 10 Q57VS6_TRYB2 RAP domain-containing protein MRRRVVLCCQDVGSLLSSKHSVHSGIGYHERVFSRNLLYRRYPVVTVLPKAGFTVLDTKRWIASSGPPVTGSPLSPVTNPSLNVGTGGGEAVAMEGPLPVSYSPGSGVNGSLPVTSTAITAHCDVLSECVAKADELAVQLKAQNALSASAEILTQEGMEEFVEELKTSATNEMTALVKQMQTTPLLQRAGMHELRRTLYYTTSLKERDWLEEKQYTAAMRMLTVEVLRRDGDGVLSADDVLYVTTHVVTANFYNRHLWNRMEKSLLKFSNYENIDMSSVKAFSTRLFKTRRGCAKETLDIRRKVLLAMSRRVGVLANDFDLPSLLGVLQCYTVHDLTPFHLEPLAIRATNHVGDFTPHECATLAHVLRKWRTMRLEVCERLVERICTSDQLTHHMANAAMIAIRTCFNQVSDGGRNAMNAEPTRQKLRAMGEQIGCRLDEVEYPALPVILSILDVVVTLKIYVPKKCLQVIFSQANDMVAIVMEQKDDLVDPKTGKRVRPITAEEGRQLQALLSHYGNDLAPELSQRMKEAFREGVLPDEASL 543 T 0.13 DUF3646 pdb F Eukaryota T 8fni 3 C 5 Q389F5_TRYB2 RNA-editing substrate-binding complex protein 5 (RESC5) MLRHTSRNNALHAFVRSPHYRTIPSAGPNGIVVNRDMLVHQFRDFYKTLQHCSLVDKVHLMSERPSVEALRVADQMVSIGATFLEMPLTGMEHRATEFMESMRYVRGAGGPSTLASYLQDTENCRCNSGDVVCLPNGIAVGHGPRTNAVAHTTLKQLFEVKDDQFSFDVFTLEQEGDAPPLGDYFGFAGSNVLLTWKDEHGLLAVDQYQQKQPHTEMNVVYLEPGCHFLSFYGVDHTIDVLVQKGYERSMDSIAAAGLNPIPVQWSEMDKLGISMRAAVLPLKFFKANVGGMLSRNKSRGARWQTHQLQKGSGSGSASSGASAAGSSGASASSGASAAGSSGASAGHHHHHHHHHHSGSEDQVDPRLIDGKASAWSHPQFEKGGGSGGGSGGSAWSHPQFEK 402 T 0.29 ADI unp F Eukaryota T 8fni 5 E 7 Q384B4_TRYB2 RNA-editing substrate-binding complex protein 7 (RESC7) MRSSRGILFLSGAFAIRGMSAYHSYQRLDTVSHTSKVYSLQMQRQTVHFTPITRLGVEATANPTTATNATGQTGDGDGATALDVAMRVNKLKRLHQTGGGPSGKKQVELDAWRDLNNLTEAQINSAEGKAVSLLLNSWAYFAKYWEKGAEGPSASLSEVTPSNDSSSAGEHGTQ 174 T 0.14 LppC unppssm F Eukaryota T 8fni 7 G 9 Q585T1_TRYB2 RNA-editing substrate-binding complex protein 9 (RESC9) MLLPTLERLLERCGRPIFSNVEDVRMVMASLLDISAYVDRASTKVIAKPLRRFCHKDPDTVASVMEAVPIDAAEPTHGRRAAMLLRCLPKHSCDEVIWERAVAATLAGLKSRKWDLHDYRVAMAHAGRGGRHAPALAAAAEEFVSSSARTASQSELPALLVILTSLPELKRSPCLQVAADRIVQLSEILSPAAIGQICASVNKVSFRHTAMAIALQEEAIRFAEESDLFSAVQLFSFICQQEKEAISPDAVKCLAERVIEGKDLDQETVSVLCRALRSIPRPHRPELLREIGEMMEFLGGEVKELLELPVAKGGLKGDVSAGDIQSFISKFLSLDGLLPADHDRPGTYMAAIVACVDYITERLEDIVSDENPPFSIIPHLLNINMEETRRCGQAIIREAAEQGIHFPTLQVFRFLLALGDHNMRDQRVYRHLRNEFAKTASDIPMIQLCAALKCFVRGLMQNVETQSLDEQVEHELEKEDMDAFLRFCVENLRRGFADGMEVKCVMAATESLYQLGYTSTEFYEQVARYLGSKCSSASASVNSSETATAVCLALGEDILDRHPDVHTFLLEVEKSGLKGEASLSPTEWMNKNDPANFITPLTEIQQEGWNIINRMVETRAADTEKLTALANEYVAILKSTRVDDLKYFFGVFEEKVFKQDRILKQCLDYLVESNAAVKLSATSIGAMLNSLAAIRFTYHRSVKQFMIAISTEQWSEMDASPLVKIVSAMAKLSLRLPQVLVHVGDRLLDVYTFLSPLDTALVINSLQSIGYGNDEVLMMLMRHAASSARRWDEVSLTLLFGASGVHRLLRNVEVAAPLLEQAAGKTSSPHLRQRIAASLRRSALPRALVQSSTSLLTGGAHEVVNNPPLQLV 872 T 0.028 AAA_assoc pdb F Eukaryota T 8fni 8 H 10 Q57VS6_TRYB2 RNA-editing substrate-binding complex protein 10 (RESC10) MRRRVVLCCQDVGSLLSSKHSVHSGIGYHERVFSRNLLYRRYPVVTVLPKAGFTVLDTKRWIASSGPPVTGSPLSPVTNPSLNVGTGGGEAVAMEGPLPVSYSPGSGVNGSLPVTSTAITAHCDVLSECVAKADELAVQLKAQNALSASAEILTQEGMEEFVEELKTSATNEMTALVKQMQTTPLLQRAGMHELRRTLYYTTSLKERDWLEEKQYTAAMRMLTVEVLRRDGDGVLSADDVLYVTTHVVTANFYNRHLWNRMEKSLLKFSNYENIDMSSVKAFSTRLFKTRRGCAKETLDIRRKVLLAMSRRVGVLANDFDLPSLLGVLQCYTVHDLTPFHLEPLAIRATNHVGDFTPHECATLAHVLRKWRTMRLEVCERLVERICTSDQLTHHMANAAMIAIRTCFNQVSDGGRNAMNAEPTRQKLRAMGEQIGCRLDEVEYPALPVILSILDVVVTLKIYVPKKCLQVIFSQANDMVAIVMEQKDDLVDPKTGKRVRPITAEEGRQLQALLSHYGNDLAPELSQRMKEAFREGVLPDEASL 543 T 0.13 DUF3646 pdb F Eukaryota T 8fni 9 I 11 Q57WL2_TRYB2 RNA-editing substrate-binding complex protein 11 (RESC11) MYRLYRRTVGYQSLHQRLSACHVMCRHVSTDNSDGTTPPKPRRSGIRRVVPSDEEMAELHDLEQEVASTTSSRSKQSALSGVMVEPMRFSTSGGSGMEGDGDDLGELEAEGDEEGVGTNSLAEAENVYKRHNDGGALEKQGLAIPPSGKPTDPLLANRDDEGEGGAVPLSQAEEMTVSRSTLERQACVRSLSLEELVEAVTLYLRATKNPRLVSADEEHIFFPVLMERLNEFHVSQLLDVVECHWARSTLVRYGTTFKDMVRDRIALIATAAAKSASKRPAAAGKSGNDNRDGGAVEEEADDYDEQGDAVYVHEAEEKTSDLIILRAAEEMSPETVLRCIIVMGMSAGRRKRDLQFFQAMGMFLVHHINHYKDPHELVRVLTAFARAKIVPPKRFLALLGRRFAVLNKRKKLGSLPSYRAFVNLYKMGHDQMNTFRFLADCILETIDSNIKAEKKRLRLAQLQSSSNITAATTNENGATNESGCGGSTSSSNPTVTNITGAGDLKATHTSEGASDVAFIGDLDPHLLQNLRARERFKRLTELKPSMFTKLLLVLARFGAPHQQYLRPTTVPLILPTLRAFPPPSFTRLLRAMSLFRTTDLDLIEPVIDFMADSLGPTNVVPADVLQMVRLVAPPDVPVPRNLVKLISLCEAVYSSSASFSHSDGKSSDSADAAACAMTTLSPIRPGDMCAVAVVLLKIQMKDDVPLEALDPLTRLMEFFAERMYLLMKLHIVSLTHVDVFTDLCRQQQHPDVSGHIERLCAERRRVNDAEGDDEYYSQLDIDVRETLHRILIVNDYNTYGQYRPTPGVLQVDFKQALTEVSAFDVLEAADLFAQAFSNALKPAVERHLSRSIIAKLDGGGEEVITEGNSIVLRPPRELLLTREDLGKFVCLLQRTPLRRVRASPVVWRFVEEKAKKLGMDDVLRVVENKLATAV 934 T 0.27 DUF440 pdbpssm F Eukaryota T 8fnk 3 C 5 Q389F5_TRYB2 RNA-editing substrate-binding complex protein 5 (RESC5) MLRHTSRNNALHAFVRSPHYRTIPSAGPNGIVVNRDMLVHQFRDFYKTLQHCSLVDKVHLMSERPSVEALRVADQMVSIGATFLEMPLTGMEHRATEFMESMRYVRGAGGPSTLASYLQDTENCRCNSGDVVCLPNGIAVGHGPRTNAVAHTTLKQLFEVKDDQFSFDVFTLEQEGDAPPLGDYFGFAGSNVLLTWKDEHGLLAVDQYQQKQPHTEMNVVYLEPGCHFLSFYGVDHTIDVLVQKGYERSMDSIAAAGLNPIPVQWSEMDKLGISMRAAVLPLKFFKANVGGMLSRNKSRGARWQTHQLQKGSGSGSASSGASAAGSSGASASSGASAAGSSGASAGHHHHHHHHHHSGSEDQVDPRLIDGKASAWSHPQFEKGGGSGGGSGGSAWSHPQFEK 402 T 0.29 ADI unp F Eukaryota T 8fnk 5 E 7 Q384B4_TRYB2 RNA-editing substrate-binding complex protein 7 (RESC7) MRSSRGILFLSGAFAIRGMSAYHSYQRLDTVSHTSKVYSLQMQRQTVHFTPITRLGVEATANPTTATNATGQTGDGDGATALDVAMRVNKLKRLHQTGGGPSGKKQVELDAWRDLNNLTEAQINSAEGKAVSLLLNSWAYFAKYWEKGAEGPSASLSEVTPSNDSSSAGEHGTQ 174 T 0.14 LppC unppssm F Eukaryota T 8fnk 7 G 9 Q585T1_TRYB2 RNA-editing substrate-binding complex protein 9 (RESC9) MLLPTLERLLERCGRPIFSNVEDVRMVMASLLDISAYVDRASTKVIAKPLRRFCHKDPDTVASVMEAVPIDAAEPTHGRRAAMLLRCLPKHSCDEVIWERAVAATLAGLKSRKWDLHDYRVAMAHAGRGGRHAPALAAAAEEFVSSSARTASQSELPALLVILTSLPELKRSPCLQVAADRIVQLSEILSPAAIGQICASVNKVSFRHTAMAIALQEEAIRFAEESDLFSAVQLFSFICQQEKEAISPDAVKCLAERVIEGKDLDQETVSVLCRALRSIPRPHRPELLREIGEMMEFLGGEVKELLELPVAKGGLKGDVSAGDIQSFISKFLSLDGLLPADHDRPGTYMAAIVACVDYITERLEDIVSDENPPFSIIPHLLNINMEETRRCGQAIIREAAEQGIHFPTLQVFRFLLALGDHNMRDQRVYRHLRNEFAKTASDIPMIQLCAALKCFVRGLMQNVETQSLDEQVEHELEKEDMDAFLRFCVENLRRGFADGMEVKCVMAATESLYQLGYTSTEFYEQVARYLGSKCSSASASVNSSETATAVCLALGEDILDRHPDVHTFLLEVEKSGLKGEASLSPTEWMNKNDPANFITPLTEIQQEGWNIINRMVETRAADTEKLTALANEYVAILKSTRVDDLKYFFGVFEEKVFKQDRILKQCLDYLVESNAAVKLSATSIGAMLNSLAAIRFTYHRSVKQFMIAISTEQWSEMDASPLVKIVSAMAKLSLRLPQVLVHVGDRLLDVYTFLSPLDTALVINSLQSIGYGNDEVLMMLMRHAASSARRWDEVSLTLLFGASGVHRLLRNVEVAAPLLEQAAGKTSSPHLRQRIAASLRRSALPRALVQSSTSLLTGGAHEVVNNPPLQLV 872 T 0.028 AAA_assoc pdb F Eukaryota T 8fnk 8 H 10 Q57VS6_TRYB2 RNA-editing substrate-binding complex protein 10 (RESC10) MRRRVVLCCQDVGSLLSSKHSVHSGIGYHERVFSRNLLYRRYPVVTVLPKAGFTVLDTKRWIASSGPPVTGSPLSPVTNPSLNVGTGGGEAVAMEGPLPVSYSPGSGVNGSLPVTSTAITAHCDVLSECVAKADELAVQLKAQNALSASAEILTQEGMEEFVEELKTSATNEMTALVKQMQTTPLLQRAGMHELRRTLYYTTSLKERDWLEEKQYTAAMRMLTVEVLRRDGDGVLSADDVLYVTTHVVTANFYNRHLWNRMEKSLLKFSNYENIDMSSVKAFSTRLFKTRRGCAKETLDIRRKVLLAMSRRVGVLANDFDLPSLLGVLQCYTVHDLTPFHLEPLAIRATNHVGDFTPHECATLAHVLRKWRTMRLEVCERLVERICTSDQLTHHMANAAMIAIRTCFNQVSDGGRNAMNAEPTRQKLRAMGEQIGCRLDEVEYPALPVILSILDVVVTLKIYVPKKCLQVIFSQANDMVAIVMEQKDDLVDPKTGKRVRPITAEEGRQLQALLSHYGNDLAPELSQRMKEAFREGVLPDEASL 543 T 0.13 DUF3646 pdb F Eukaryota T 8fnk 9 I 11 Q57WL2_TRYB2 RNA-editing substrate-binding complex protein 11 (RESC11) MYRLYRRTVGYQSLHQRLSACHVMCRHVSTDNSDGTTPPKPRRSGIRRVVPSDEEMAELHDLEQEVASTTSSRSKQSALSGVMVEPMRFSTSGGSGMEGDGDDLGELEAEGDEEGVGTNSLAEAENVYKRHNDGGALEKQGLAIPPSGKPTDPLLANRDDEGEGGAVPLSQAEEMTVSRSTLERQACVRSLSLEELVEAVTLYLRATKNPRLVSADEEHIFFPVLMERLNEFHVSQLLDVVECHWARSTLVRYGTTFKDMVRDRIALIATAAAKSASKRPAAAGKSGNDNRDGGAVEEEADDYDEQGDAVYVHEAEEKTSDLIILRAAEEMSPETVLRCIIVMGMSAGRRKRDLQFFQAMGMFLVHHINHYKDPHELVRVLTAFARAKIVPPKRFLALLGRRFAVLNKRKKLGSLPSYRAFVNLYKMGHDQMNTFRFLADCILETIDSNIKAEKKRLRLAQLQSSSNITAATTNENGATNESGCGGSTSSSNPTVTNITGAGDLKATHTSEGASDVAFIGDLDPHLLQNLRARERFKRLTELKPSMFTKLLLVLARFGAPHQQYLRPTTVPLILPTLRAFPPPSFTRLLRAMSLFRTTDLDLIEPVIDFMADSLGPTNVVPADVLQMVRLVAPPDVPVPRNLVKLISLCEAVYSSSASFSHSDGKSSDSADAAACAMTTLSPIRPGDMCAVAVVLLKIQMKDDVPLEALDPLTRLMEFFAERMYLLMKLHIVSLTHVDVFTDLCRQQQHPDVSGHIERLCAERRRVNDAEGDDEYYSQLDIDVRETLHRILIVNDYNTYGQYRPTPGVLQVDFKQALTEVSAFDVLEAADLFAQAFSNALKPAVERHLSRSIIAKLDGGGEEVITEGNSIVLRPPRELLLTREDLGKFVCLLQRTPLRRVRASPVVWRFVEEKAKKLGMDDVLRVVENKLATAV 934 T 0.27 DUF440 pdbpssm F Eukaryota T 8fof 1 A A BP-ffsy XXXXX 5 F F F 8fr8 8 H 3 A0QTP4_MYCS2 50S ribosomal protein eL31 AKRGRKKRDRKHSKANHGKRPNA 23 T 0.16 DUF6254 pdb F Bacteria T 8frs 2 G,H,I f,g,h A0A2K8HLV9_9CAUD Structural protein gp24 MFQKQVYRQYTPGFPGDLIEDGPKRARPGRIMSLSAVNPAATATGPNRASRAFGYAGDVSALGEGQPKTIAARASEVVIGGANFFGVLGHPKHYALFGSAGDSLAPSYDLPDGAEGEFFDMATGLVVEIFNGAAAALDLDYGDLVAYVPNNLATADDALGLPAGALVGFKTGSMPTGLVQIPNARIVNAISLPAQSAGNLVAGVTIVQLTQ 211 T 69 CBFB_NFYA pdbhh T Viruses T 8fs3 8 H H A0A8H4BUG7_YEASX DDC1 isoform 1 MSFKATITESGKQNIWFRAIYVLSTIQDDIKITVTTNELIAWSMNETDTTLCQVRFQKSFFEEYEFKPHEIVFGENGVQVIEDTYGNSHKLYSFRVNGRHLTTISRKPDGDGIKSFTIAVNNTSTCPESLANRLIVVIEMDSLIVKEYCPQFQPIKYDPIIINLKYKRRFLDVFGTAASDRNPQEPLDPKLLDVFTNTERELTSALFNEEVESDIRKRNQLTAADEINYICCNSTLLKNFLDNCNVNVTDEVKLEINVHRLSITAFTKAVYGKNNDLLRNALSMSNTISTLDLEHYCLFTTIEDEKQDKRSHSKRREHMKSIIFKLKDFKNFITIGPSWKTTQDGNDNISLWFCHPGDPILMQMQKPGVKLELVEVTDSNINDDILEGKFIKTAISGSKEEAGLKDNKESCESPLKSKTALKRENLPHSVAGTRNSPLKVSYLTPDNGSTVAKTYRNNTARKLFVEEQSQSTNYEQDKRFRQASSVHMNMNREQSFDIGTTHEVACPRNESNSLKRSIADICNETEDPTQQSTFAKRADTTVTWGKALPAADDEVSCSNIDRKGMLKKEKLKHMQGLLNSQNDTSNHKKQDNKEMEDGLGLTQVEKPRGIFD 612 T 6.8E-09 Rad9 pdbpercent F Eukaryota T 8fs4 8 H H A0A8H4BUG7_YEASX DDC1 isoform 1 MSFKATITESGKQNIWFRAIYVLSTIQDDIKITVTTNELIAWSMNETDTTLCQVRFQKSFFEEYEFKPHEIVFGENGVQVIEDTYGNSHKLYSFRVNGRHLTTISRKPDGDGIKSFTIAVNNTSTCPESLANRLIVVIEMDSLIVKEYCPQFQPIKYDPIIINLKYKRRFLDVFGTAASDRNPQEPLDPKLLDVFTNTERELTSALFNEEVESDIRKRNQLTAADEINYICCNSTLLKNFLDNCNVNVTDEVKLEINVHRLSITAFTKAVYGKNNDLLRNALSMSNTISTLDLEHYCLFTTIEDEKQDKRSHSKRREHMKSIIFKLKDFKNFITIGPSWKTTQDGNDNISLWFCHPGDPILMQMQKPGVKLELVEVTDSNINDDILEGKFIKTAISGSKEEAGLKDNKESCESPLKSKTALKRENLPHSVAGTRNSPLKVSYLTPDNGSTVAKTYRNNTARKLFVEEQSQSTNYEQDKRFRQASSVHMNMNREQSFDIGTTHEVACPRNESNSLKRSIADICNETEDPTQQSTFAKRADTTVTWGKALPAADDEVSCSNIDRKGMLKKEKLKHMQGLLNSQNDTSNHKKQDNKEMEDGLGLTQVEKPRGIFD 612 T 6.8E-09 Rad9 pdbpercent F Eukaryota T 8fs5 8 H H A0A8H4BUG7_YEASX DDC1 isoform 1 MSFKATITESGKQNIWFRAIYVLSTIQDDIKITVTTNELIAWSMNETDTTLCQVRFQKSFFEEYEFKPHEIVFGENGVQVIEDTYGNSHKLYSFRVNGRHLTTISRKPDGDGIKSFTIAVNNTSTCPESLANRLIVVIEMDSLIVKEYCPQFQPIKYDPIIINLKYKRRFLDVFGTAASDRNPQEPLDPKLLDVFTNTERELTSALFNEEVESDIRKRNQLTAADEINYICCNSTLLKNFLDNCNVNVTDEVKLEINVHRLSITAFTKAVYGKNNDLLRNALSMSNTISTLDLEHYCLFTTIEDEKQDKRSHSKRREHMKSIIFKLKDFKNFITIGPSWKTTQDGNDNISLWFCHPGDPILMQMQKPGVKLELVEVTDSNINDDILEGKFIKTAISGSKEEAGLKDNKESCESPLKSKTALKRENLPHSVAGTRNSPLKVSYLTPDNGSTVAKTYRNNTARKLFVEEQSQSTNYEQDKRFRQASSVHMNMNREQSFDIGTTHEVACPRNESNSLKRSIADICNETEDPTQQSTFAKRADTTVTWGKALPAADDEVSCSNIDRKGMLKKEKLKHMQGLLNSQNDTSNHKKQDNKEMEDGLGLTQVEKPRGIFD 612 T 6.8E-09 Rad9 pdbpercent F Eukaryota T 8fs6 8 H H A0A8H4BUG7_YEASX DDC1 isoform 1 MSFKATITESGKQNIWFRAIYVLSTIQDDIKITVTTNELIAWSMNETDTTLCQVRFQKSFFEEYEFKPHEIVFGENGVQVIEDTYGNSHKLYSFRVNGRHLTTISRKPDGDGIKSFTIAVNNTSTCPESLANRLIVVIEMDSLIVKEYCPQFQPIKYDPIIINLKYKRRFLDVFGTAASDRNPQEPLDPKLLDVFTNTERELTSALFNEEVESDIRKRNQLTAADEINYICCNSTLLKNFLDNCNVNVTDEVKLEINVHRLSITAFTKAVYGKNNDLLRNALSMSNTISTLDLEHYCLFTTIEDEKQDKRSHSKRREHMKSIIFKLKDFKNFITIGPSWKTTQDGNDNISLWFCHPGDPILMQMQKPGVKLELVEVTDSNINDDILEGKFIKTAISGSKEEAGLKDNKESCESPLKSKTALKRENLPHSVAGTRNSPLKVSYLTPDNGSTVAKTYRNNTARKLFVEEQSQSTNYEQDKRFRQASSVHMNMNREQSFDIGTTHEVACPRNESNSLKRSIADICNETEDPTQQSTFAKRADTTVTWGKALPAADDEVSCSNIDRKGMLKKEKLKHMQGLLNSQNDTSNHKKQDNKEMEDGLGLTQVEKPRGIFD 612 T 6.8E-09 Rad9 pdbpercent F Eukaryota T 8fs7 8 H H A0A8H4BUG7_YEASX DDC1 isoform 1 MSFKATITESGKQNIWFRAIYVLSTIQDDIKITVTTNELIAWSMNETDTTLCQVRFQKSFFEEYEFKPHEIVFGENGVQVIEDTYGNSHKLYSFRVNGRHLTTISRKPDGDGIKSFTIAVNNTSTCPESLANRLIVVIEMDSLIVKEYCPQFQPIKYDPIIINLKYKRRFLDVFGTAASDRNPQEPLDPKLLDVFTNTERELTSALFNEEVESDIRKRNQLTAADEINYICCNSTLLKNFLDNCNVNVTDEVKLEINVHRLSITAFTKAVYGKNNDLLRNALSMSNTISTLDLEHYCLFTTIEDEKQDKRSHSKRREHMKSIIFKLKDFKNFITIGPSWKTTQDGNDNISLWFCHPGDPILMQMQKPGVKLELVEVTDSNINDDILEGKFIKTAISGSKEEAGLKDNKESCESPLKSKTALKRENLPHSVAGTRNSPLKVSYLTPDNGSTVAKTYRNNTARKLFVEEQSQSTNYEQDKRFRQASSVHMNMNREQSFDIGTTHEVACPRNESNSLKRSIADICNETEDPTQQSTFAKRADTTVTWGKALPAADDEVSCSNIDRKGMLKKEKLKHMQGLLNSQNDTSNHKKQDNKEMEDGLGLTQVEKPRGIFD 612 T 6.8E-09 Rad9 pdbpercent F Eukaryota T 8fs8 8 H H A0A8H4BUG7_YEASX DDC1 isoform 1 MSFKATITESGKQNIWFRAIYVLSTIQDDIKITVTTNELIAWSMNETDTTLCQVRFQKSFFEEYEFKPHEIVFGENGVQVIEDTYGNSHKLYSFRVNGRHLTTISRKPDGDGIKSFTIAVNNTSTCPESLANRLIVVIEMDSLIVKEYCPQFQPIKYDPIIINLKYKRRFLDVFGTAASDRNPQEPLDPKLLDVFTNTERELTSALFNEEVESDIRKRNQLTAADEINYICCNSTLLKNFLDNCNVNVTDEVKLEINVHRLSITAFTKAVYGKNNDLLRNALSMSNTISTLDLEHYCLFTTIEDEKQDKRSHSKRREHMKSIIFKLKDFKNFITIGPSWKTTQDGNDNISLWFCHPGDPILMQMQKPGVKLELVEVTDSNINDDILEGKFIKTAISGSKEEAGLKDNKESCESPLKSKTALKRENLPHSVAGTRNSPLKVSYLTPDNGSTVAKTYRNNTARKLFVEEQSQSTNYEQDKRFRQASSVHMNMNREQSFDIGTTHEVACPRNESNSLKRSIADICNETEDPTQQSTFAKRADTTVTWGKALPAADDEVSCSNIDRKGMLKKEKLKHMQGLLNSQNDTSNHKKQDNKEMEDGLGLTQVEKPRGIFD 612 T 6.8E-09 Rad9 pdbpercent F Eukaryota T 8fuc 3 C F Contaminant peptide KKLARE KKLARE 6 T 58 Reoviridae_Vp9 pdbhh F F 8fuv 2 B,C,D,E,F,G T,A,B,C,D,E A0A2K8HPF4_9CAUD Tail fiber protein gp32 FGSICAFTASRTFPNGFTVTEEFADADPIDSPPFAAADTGAGLNGDMVVWNRANILEVVVNVIPNTEGERNLAVLLDANRTGKDKSGARDVVGLVVAMPDGSKITCTNGTPIDGVLINAVASVGRLKTKPYRFRFEKVIKAGTS 144 T 0.06 Arch_flagellin unp T Viruses T 8fvh 1 A,B,C,D,E,F C,A,B,D,E,F A0A2K8I4A6_9CAUD E217 collar protein gp28 IPGANLLRMAFGVIGTQIVRYRKFEQRVKNDQAQYVSMFGEPFDLAASVQRVRRDQYAQFNLEFQRNYVMIFANFDMVDLDRNMAGDQFLWTGRVFQLESQGSWFYQDGWGVCLAVDIGAAKA 123 T 0.00015 Phage_H_T_join pdbhh T Viruses T 8fvh 2 G,H,I,J,K,L M,P,S,V,Y,b A0A2K8HWZ4_9CAUD E217 gateway protein gp29 MFDGELIAKLVVELNAAMTSAQEALQFPDFEVVQKAQPTQQGTSTRPTIFFQKLFDIPRGWPATDWHLDNTARKYVEITRQHVETTFQISSLHWQNPEITHVVTASDIANYVRAYFQARSTIERVKELDFLILRVSQISNEAFENDNHQFEFHPSFDMVVTYNQYIRLYENAAYSADGVLIG 182 T 0.11 HAD_2 pdb T Viruses T 8fw5 2 B B NPRL2_HUMAN GENE 21 PROTEIN,G21 PROTEIN,NITROGEN PERMEASE REGULATOR 2-LIKE PROTEIN,NPR2-LIKE PROTEIN,TUMOR SUPPRESSOR CANDIDATE 4 MGYPYDVPDYADLNGGGGGSTMGSGCRIECIFFSEFHPTLGPKITYQVPEDFISRELFDTVQVYIITKPELQNKLITVTAMEKKLIGCPVCIEHKKYSRNALLFNLGFVCDAQAKTCALEPIVKKLAGYLTTLELESSFVSMEESKQKLVPIMTILLEELNASGRCTLPIDESNTIHLKVIEQRPDPPVAQEYDVPVFTKDKEDFFNSQWDLTTQQILPYIDGFRHIQKISAEADVELNLVRIAIQNLLYYGVVTLVSILQYSNVYCPTPKVQDLVDDKSLQEACLSYVTKQGHKRASLRDVFQLYCSLSPGTTVRDLIGRHPQQLQHVDERKLIQFGLMKNLIRRLQKYPVRVTREEQSHPARLYTGCHSYDEICCKTGMSYHELDERLENDPNIIICWK 401 T 2.3E-23 NPR2 unppercent F Eukaryota T 8fw5 3 C C NPRL3_HUMAN -14 GENE PROTEIN,ALPHA-GLOBIN REGULATORY ELEMENT-CONTAINING GENE PROTEIN,NITROGEN PERMEASE REGULATOR 3-LIKE PROTEIN,PROTEIN CGTHBA MGYPYDVPDYADLNGGGGGSTMRDNTSPISVILVSSGSRGNKLLFRYPFQRSQEHPASQTSKPRSRYAASNTGDHADEQDGDSRFSDVILATILATKSEMCGQKFELKIDNVRFVGHPTLLQHALGQISKTDPSPKREAPTMILFNVVFALRANADPSVINCLHNLSRRIATVLQHEERRCQYLTREAKLILALQDEVSAMADGNEGPQSPFHHILPKCKLARDLKEAYDSLCTSGVVRLHINSWLEVSFCLPHKIHYAASSLIPPEAIERSLKAIRPYHALLLLSDEKSLLGELPIDCSPALVRVIKTTSAVKNLQQLAQDADLALLQVFQLAAHLVYWGKAIIIYPLCENNVYMLSPNASVCLYSPLAEQFSHQFPSHDLPSVLAKFSLPVSLSEFRNPLAPAVQETQLIQMVVWMLQRRLLIQLHTYVCLMASPSEEEPRPREDDVPFTARVGGRSLSTPNALSFGSPTSSDDMTLTSPSMDNSSAELLPSGDSPLNQRMTENLLASLSEHERAAILSVPAAQNPEDLRMFARLLHYFRGRHHLEEIMYNENTRRSQLLMLFDKFRSVLVVTTHEDPVIAVFQALLP 590 T 6.3E-10 NPR3 unppssm F Eukaryota T 8fw5 7 G G Schizosaccharomyces pombe LAM2, Human LAMTOR2 ortholog MGSSHHHHHHSLEVLFQGPGSMIKPKKLSSLMKQAVEETVPSIMVFTTTGSLLAYVSFEDPKDGLKRLDLAKRVRSIAALAGNMYSLYTATNPSPLVAESTDDVIAHQRDVLFETIIEFERGKLLIAAISIDGAEDKLYSKDPLLLGIVGTENAKEGMMQIKSELLKECITNELSTLGKPV 181 T 0.86 Robl_LC7 pdbpercent F T 8fw5 9 I I Schizosaccharomyces pombe LAM4, Human LAMTOR5 ortholog MDSQLSENLLKCVNETYRGAMLVRNGLPIATAGDVNAEEQRVICEWNSNAVSEVLHLHDSNTKILIATKESCVLGLIYRNT 81 T 0.016 LAMTOR5 pdbpssm F T 8fxi 2 G,H G,H 4x(beta-Asp-Arg) XXXX 4 T 160 LicD pdbhh F F 8fxq 2 C,D W,Y ALA-CYS-VAL-LYS ACVK 4 T 85 LRRC37AB_C pdbhh F F 8fy9 2 B,C,E,F B,C,E,F Cas1 MAGPIIAGKSESSELPRVEDRATFIYIEHAKINRVDSAVTVAEAKGVVRIPAAMIGVLLLGPGTDISHRAVELLGDTGTALVWVGEQGVRYYASGRALARSTRFLVKQAELVTNERSRLRVARRMYQMRFPTEDVSKLTMQQLRSHEGARVRRKYRELSKKYNVPWKKRVYNPDDFAGGDPINQALSAAHVALYGLVHSVVAALGLSPGLGFVHTGHDRSFIYDVADLYKAEITVPIAFAVAAEAEEGQDIGQLARLRTRDAFVDGKILKRMVKDLQTLLEIPEEGQIEAEPLSLWDDKEKLVPYGVNYSEVTSCP 316 T 5.3E-07 Cas_Cas1 pdb F T 8fya 2 B,C,E,F B,C,E,F Cas1 MAGPIIAGKSESSELPRVEDRATFIYIEHAKINRVDSAVTVAEAKGVVRIPAAMIGVLLLGPGTDISHRAVELLGDTGTALVWVGEQGVRYYASGRALARSTRFLVKQAELVTNERSRLRVARRMYQMRFPTEDVSKLTMQQLRSHEGARVRRKYRELSKKYNVPWKKRVYNPDDFAGGDPINQALSAAHVALYGLVHSVVAALGLSPGLGFVHTGHDRSFIYDVADLYKAEITVPIAFAVAAEAEEGQDIGQLARLRTRDAFVDGKILKRMVKDLQTLLEIPEEGQIEAEPLSLWDDKEKLVPYGVNYSEVTSCP 316 T 5.3E-07 Cas_Cas1 pdb F T 8fyb 2 B,C,E,F B,C,E,F Cas1 MAGPIIAGKSESSELPRVEDRATFIYIEHAKINRVDSAVTVAEAKGVVRIPAAMIGVLLLGPGTDISHRAVELLGDTGTALVWVGEQGVRYYASGRALARSTRFLVKQAELVTNERSRLRVARRMYQMRFPTEDVSKLTMQQLRSHEGARVRRKYRELSKKYNVPWKKRVYNPDDFAGGDPINQALSAAHVALYGLVHSVVAALGLSPGLGFVHTGHDRSFIYDVADLYKAEITVPIAFAVAAEAEEGQDIGQLARLRTRDAFVDGKILKRMVKDLQTLLEIPEEGQIEAEPLSLWDDKEKLVPYGVNYSEVTSCP 316 T 5.3E-07 Cas_Cas1 pdb F T 8fyc 4 E,F,G,H B,C,E,F Cas1 MAGPIIAGKSESSELPRVEDRATFIYIEHAKINRVDSAVTVAEAKGVVRIPAAMIGVLLLGPGTDISHRAVELLGDTGTALVWVGEQGVRYYASGRALARSTRFLVKQAELVTNERSRLRVARRMYQMRFPTEDVSKLTMQQLRSHEGARVRRKYRELSKKYNVPWKKRVYNPDDFAGGDPINQALSAAHVALYGLVHSVVAALGLSPGLGFVHTGHDRSFIYDVADLYKAEITVPIAFAVAAEAEEGQDIGQLARLRTRDAFVDGKILKRMVKDLQTLLEIPEEGQIEAEPLSLWDDKEKLVPYGVNYSE 311 T 1.7E-08 Cas_Cas1 pdbpssm F T 8fyd 3 C,D,F,G B,C,E,F Cas1 MAGPIIAGKSESSELPRVEDRATFIYIEHAKINRVDSAVTVAEAKGVVRIPAAMIGVLLLGPGTDISHRAVELLGDTGTALVWVGEQGVRYYASGRALARSTRFLVKQAELVTNERSRLRVARRMYQMRFPTEDVSKLTMQQLRSHEGARVRRKYRELSKKYNVPWKKRVYNPDDFAGGDPINQALSAAHVALYGLVHSVVAALGLSPGLGFVHTGHDRSFIYDVADLYKAEITVPIAFAVAAEAEEGQDIGQLARLRTRDAFVDGKILKRMVKDLQTLLEIPEEGQIEAEPLSLWDDKEKLVPYGVNYSEVTSCP 316 T 5.3E-07 Cas_Cas1 pdb F T 8fzm 2 B,D B,D Bimax2 SRRRRRRKRKREWDDDDDPPKKRRRLD 27 T 0.68 Med24_N pdbhh F T 8g21 1 A A RELN_HUMAN Reelin STRKQNYMMNFSRQHGLRHFYNRRRRSLRRYP 32 T 33 MazG_C pdbhh F Eukaryota T 8g2z 1 A,KB,NA,P 0A,3A,2A,1A I7LUL4_TETTS RIB27A MSQYAYDFNPAKTQSLQSRPIEHNKFSAWTGDQLYRTSYGTHWTSKPQEPKTHVPPGYAGYIPGLKPNNHYGASYGEIAKNCLSNPKVAQNPFKLASTGFNYQRHDFRDPSLTATTHKFGAQTLLKNHPSIDQKSNQWQSQTHDSFRNPLHKPNPTYRETDKDLQTQKYFTKTSGFQQNHTTFDRTGWVPEKVLHADRTTSEYRIHFNKQVPFHRDTVLFKERRLPPKEYNYKYMG 236 T 0.00015 DUF2475 pdbpercent F Eukaryota T 8g2z 7 G,V 0G,1G Q237T1_TETTS CFAP107 MNAQTIQDNVNKYRFGVLIGNFAEEKFGMDMAQRQIDERLPNSTMKDSYGLKNSALNCEPSKLTPIDKEFNQHVIFNTQGVQQHILFGHGVKQTDYNKREYGTSYDLSFNQKIKPQTQIYSKYTPDALSASRTFHKDTIFEKDYQKHIPELGSKPTVPKDKARPYNEFTKTYDSTHMKIPLRK 183 T 0.058 DUF1143 pdbhh F Eukaryota T 8g2z 18 X 1I A4VD56_TETTS CFAP143 MELNQTTLESYNKQLQKGQGTLIGNWWEERELRDVTGIGRSAHNYAKLKSTISQTGAQSLYKESPQTESVNDTNERTMGKKFNHIPNTTNSEYGKGFNKADQLPRTGPVHLRTQQQMINHIKQELNEIENQKEIQRNIRYFQTTTQTEFGPKEHAMAGCTVGRRVMRTQNGQPITPDNRDEDILVDHGFLERQPFLTDEELKNQLPQGESYLTQQPITYWTEKTTDGFGCVYQSKSNPNDPKSTFKLNNQFLKTFHDYSHVRK 263 T 0.044 DA1-like pdb F Eukaryota T 8g2z 19 Y 1J I7MLS4_TETTS CFAP21A MAANRTKDLFPGFKTAGASTKLPDESAMNCIKPKENLNPGTPEHIKKYRKSYKNQPGSTILHYGIYDDQKPPETFVYGKKIEGSDHVQQVMDSGKTDGIKQMINEIKEAKYHSRQVEPLGKRMERNYEFPQEVHQDNFKFGVATINSENTAKEVMFPQKPAVNDQAAHDQYVRSHGNYEAGEQKNRNYNWKVDPQQHRFGKTDKIASNEVVYCLNQEALQDQFPKTTIVQKKQEDFRNFKEDHLGLPKNLGQTNAKNNPDMVFGARLGGPDEWNAGKCISGEATLKEVQTDKDLGKTNRFGFRNITKDGDENRVFGVPTIRDDIQKPLMKSIADPNNYGDEKPAVNLLFPKKYDYMGVEQDDFKIKRHKYEIKDIFEKIGYKYKIGKFEGIFKRAQEIENSTDNKVSCSSFLQAIQEMDHIE 422 T 0.21 DUF4483 unphh F Eukaryota T 8g2z 31 QB,VA 3I,2I I7M2G0_TETTS STPG2 MADKVEEEQAPPKIYTYIDLLKFENPEIYSSFGKMRLSHKTTAPKYGFGTADRNKQAKVFQNKELAKTNFAGKSSPGPAYDVRDTDYFTYQKAPKWKIGSEVRNTLNTGSKHDFYLRKDVDFDPLEADIFRRPKAPTVRIGLELRFPNDPKRHKGTPGPQYNPTLRHEIPNPPKFSFGFRREIQGFSPLVANSSTPQLVGPGSYIQKNVPNTSKIRNEPKWSFPNAERFSGFQADLSNAHYTKPRAIGTQYDSRKQTLPMFSFGKSTRESKRGTFKDMMSTQEVRIRISMPKF 293 T 0.039 SHIPPO-rpt unppercent F Eukaryota T 8g2z 33 BC 4F I7M9I4_TETTS CFAP129 MISKVSGYQGFSSYGDKPYPSLKPGRQVLSPNDVWEQTQRTRDDASMYENHQHYKTVYKVDVSNAVQPKVYQNHTHVKKGLQTQYQLQATGKSVLSYGSDRKDQTFDPNNETSKLKTGVEHWKSNYNANIKDPYSYSKASRPEWSYHLKPHQVDSKIGPTEYKTTFGEFGTKPTDKLNEYGNEIINKHEDPLKMGTTKSTFHIPNYTGFIPAARTVGKSLEHAAALNSRIDKSRATIVDNYHTKIPGYAGHQPKAPVNQRGFLREHCFSTVSSLKL 276 T 0.0059 ZinT unp F Eukaryota T 8g2z 36 HC,IC,JC,KC 5A,5B,5C,5D Q236L2_TETTS OJ2 MPPIDTKKSQITDFSQSTRLQYLGDKKSQKSAAFFRDTRVSSCSYISLKGGVPRAIPYYGSPTKTYADTMSKSSYISSLNHDTYRVRPYQHVGMSQKLLEPYHPHSYRNRLPVPDAPPQFSNASQIEVGDRSEVNHRRFVSQSKNVYGNFGKFDPVSNPGILASKTKWHHHLQSK 175 T 31 DUF6014 pdbhh F Eukaryota T 8g2z 38 PC,QC,RC 5I,5J,5K OJ3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 168 F F F 8g2z 39 VC 6F Q231B2_TETTS SB1 MIRDFVLQQTTEPEKKFNSTVLIGNWYEERCDPNREQSKFYNERKFADNNYQKYTLSENKFQDTSNSWLNFQENKPEVKNDQFITMNMQEYKKPSEQKRNSELKPFIVKKSHFDKNPHELEEYREKWTKSAHTFDRMYLGTQKPN 145 T 0.00073 DUF1143 pdbhh F Eukaryota T 8g2z 40 WC 6G Q24GM1_TETTS STPG1A MQCLLLVFQFNQNYTVKKIAEKEILSNFYLQSRLIDQQNSSYLNMSQSLPSVQKSASMTLMPIMEMYNISTRQHAAWGLDGYEVPKKYFDHLKVVQDRHFEEISKSGKATKNNKIITKRGSYLEDEIKFRGQNPGPQKYDVTYKWVSDAEIEKGKKLPKNTKKNTFIEQIFLEQQRRGIPGPGKYNILKTDEQVKAEAEKMNKKLKYGERSNYLQEYEYLSSTLPGPGNYNPRPILPKIHKDNMSPDKWIAFHKAKLSKTAKSSLPDVGTYKMNYPLDYATFGKMLVKTQEEGGNKKSSVRYMGTEERFKDPKKTKSKTSQIVPGPGQYPLVAKWQGKEQKKDSKDKNWMDSITTGISKSIYYS 364 T 0.13 SHIPPO-rpt pdbhh F Eukaryota T 8g2z 41 XC 6H Q231B6_TETTS Nebulin MTDNPQQPHKSKQEIQREQRKELARELRKAHFDLGFKEGFDDETRYREFYKWYDLEQSEKTKQEMLKLRNDLRSTHYILGTDDPNKLFVSTATQSFVKPVNPQVSQLSVETKNDLRSHHFNLGHYNDKVLSDYKLNYDQKQIDPETLKDRKEQINFLRKHNHDFGDKNNYHSSMYNENFNKSYDPHFLKQGKSKEEIHQQIVDLRKTNLVMGNNNPQFTSEAMSEFNNKPQAFRTQVDLGLKKSHFKLGEDPSLYETTTAKTYQGKQMFQHDPEKIKALSKDLRAEHFKLGNDPQSYTSEAAAKFKEFDKNSLTQQPDMSYLYRSHFNLEGFGGSNPVQHYVSNYKQNYEPKAAQKSEATRNDRADRGSHIVFGSDKIDDQFKSEAQKNFVNFGRQAPSALEKEVQADLRRHHYQFGTDQPEMISEMKKTFNDKTKESSQSKLDPNLIKDLRSNHFEYGTMGNEYTTTMQDIGRYQCQPSRLNPELAKDLRSHHFRPGDLEKYYDTTYRLAFIDFKAV 518 T 130 Malate_DH unphh F Eukaryota T 8g2z 42 BD,CD 8L,8N B5B6_fMIP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 507 F F F 8g2z 43 DD 8P CFAP112A XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 529 F F F 8g2z 44 ED 8R B2B3_fMIP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAYAAAAYAYAAAAYAAAAYAAAAAAAAAAAAYAYAYAAAYAAAAAYAAAAAAYAAAAAAAYYAYAAAAYAAAAAAAAAAAAAAAAAAAAAYAAAYYYYAAAAAYAAYAAAYAAAAAAAYAAAYAAAAAAAAAAAAAAAAAAAAAAAYYAAAAAYAAAYAAAAAAAAAAAYAAAAAAYAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 361 T 6100 Foie-gras_1 pdbhh F F 8g3d 1 A,KB,NA,P 0A,3A,2A,1A I7LUL4_TETTS RIB27A MSQYAYDFNPAKTQSLQSRPIEHNKFSAWTGDQLYRTSYGTHWTSKPQEPKTHVPPGYAGYIPGLKPNNHYGASYGEIAKNCLSNPKVAQNPFKLASTGFNYQRHDFRDPSLTATTHKFGAQTLLKNHPSIDQKSNQWQSQTHDSFRNPLHKPNPTYRETDKDLQTQKYFTKTSGFQQNHTTFDRTGWVPEKVLHADRTTSEYRIHFNKQVPFHRDTVLFKERRLPPKEYNYKYMG 236 T 0.00015 DUF2475 pdbpercent F Eukaryota T 8g3d 7 G,V 0G,1G Q237T1_TETTS CFAP107 MNAQTIQDNVNKYRFGVLIGNFAEEKFGMDMAQRQIDERLPNSTMKDSYGLKNSALNCEPSKLTPIDKEFNQHVIFNTQGVQQHILFGHGVKQTDYNKREYGTSYDLSFNQKIKPQTQIYSKYTPDLSASARTFHKDTIYEKDYQKHIPELGSKPTVPKDKARPYNEFTKTYDSTHMKIPLRK 183 T 0.058 DUF1143 pdbhh F Eukaryota T 8g3d 18 X 1I A4VD56_TETTS CFAP143 MELNQTTLESYNKQLQKGQGTLIGNWWEERELRDVTGIGRSAHNYAKLKSTISQTGAQSLYKESPQTESVNDTNERTMGKKFNHIPNTTNSEYGKGFNKADQLPRTGPVHLRTQQQMINHIKQELNEIENQKEIQRNIRYFQTTTQTEFGPKEHAMAGCTVGRRVMRTQNGQPITPDNRDEDILVDHGFLERQPFLTDEELKNQLPQGESYLTQQPITYWTEKTTDGFGCVYQSKSNPNDPKSTFKLNNQFLKTFHDYSHVRK 263 T 0.044 DA1-like pdb F Eukaryota T 8g3d 19 Y 1J I7MLS4_TETTS CFAP21A MAANRTKDLFPGFKTAGASTKLPDESAMNCIKPKENLNPGTPEHIKKYRKSYKNQPGSTILHYGIYDDQKPPETFVYGKKIEGSDHVQQVMDSGKTDGIKQMINEIKEAKYHSRQVEPLGKRMERNYEFPQEVHQDNFKFGVATINSENTAKEVMFPQKPAVNDQAAHDQYVRSHGNYEAGEQKNRNYNWKVDPQQHRFGKTDKIASNEVVYCLNQEALQDQFPKTTIVQKKQEDFRNFKEDHLGLPKNLGQTNAKNNPDMVFGARLGGPDEWNAGKCISGEATLKEVQTDKDLGKTNRFGFRNITKDGDENRVFGVPTIRNDIQKPLMKSIADPNNYGDEKPAVNLLFPKKYDYMGVEQDDFKIKRHKYEIKDIFEKIGYKYKIGKFEGIFKRAQEIENSTDNKVSCSSFLQAIQEMDHIE 422 T 0.21 DUF4483 pdbhh F Eukaryota T 8g3d 31 QB,VA 3I,2I I7M2G0_TETTS SPERM-TAIL PG-RICH REPEAT PROTEIN MADKVEEEQAPPKIYTYIDLLKFENPEIYSSFGKMRLSHKTTAPKYGFGTADRNKQAKVFQNKELAKTNFAGKSSPGPAYDVRDTDYFTYQKAPKWKIGSEVRNTLNTGSKHDFYLRKDVDFDPLEADIFRRPKAPTVRIGLELRFPNDPKRHKGTPGPQYNPTLRHEIPNPPKFSFGFRREIQGFSPLVANSSTPQLVGPGSYIQKNVPNTSKIRNEPKWSFPNAERFSGFQADLSNAHYTKPRAIGTQYDSRKQTLPMFSFGKSTRESKRGTFKDMMSTQEVRIRISMPKF 293 T 0.039 SHIPPO-rpt unppercent F Eukaryota T 8g3d 33 BC 4F I7M9I4_TETTS CFAP129 MISKVSGYQGFSSYGDKPYPSLKPGRQVLSPNDVWEQTQRTRDDASMYENHQHYKTVYKVDVSNAVQPKDYQNHTHVKKGLQTQYQLQATGKSVLSYGSDRKDQTFDPNNETSKLKTGVEHWKSNYNANIKDPYSYSKASRPEWSYHLKPHQVDSKIGPTEYKTTFGEFGTKPTDKLNEYGNEIINKHEDPLKMGTTKSTFHIPNYTGFIPAARTVGKSLEHAAALNSRIDKSRATIVDNYHTKIPGYAGHQPKAPVNQRGFLREHCFSTVSSLKL 276 T 0.0059 ZinT pdb F Eukaryota T 8g3d 36 HC,IC,JC,KC 5A,5B,5C,5D Q236L2_TETTS OJ2 MPPIDTKKSQITDFSQSTRLQYLGDKKSQKSAAFFRDTRVSSCSYISLKGGVPRAIPYYGSPTKTYADTMSKSSYISSLNHDTYRVRPYQHVGMSQKLLEPYHPHSYRNRLPVPDAPPQFSNASQIEVGDRSEVNHRRFVSQSKNVYGNFGKFDPVSNPGILASKTKWHHHLQSK 175 T 31 DUF6014 pdbhh F Eukaryota T 8g3d 38 PC,QC,RC 5I,5J,5K OJ3 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 168 F F F 8g3d 39 VC 6F Q231B2_TETTS SB1 MIRDFVLQQTTEPEKKFNSTVLIGNWYEERCDPNREQSKFYNERKFADNNYQKYTLSENKFQDTSNSWLNFQENKPEVKNDQFITMNMQEYKKPSEQKRNSELKPFIVKKSHFDKNPHELEEYREKWTKSAHTFDRMYLGTQKPN 145 T 0.00073 DUF1143 pdbhh F Eukaryota T 8g3d 40 WC 6G Q24GM1_TETTS SPERM-TAIL PG-RICH REPEAT PROTEIN MQCLLLVFQFNQNYTVKKIAEKEILSNFYLQSRLIDQQNSSYLNMSQSLPSVQKSASMTLMPIMEMYNISTRQHAAWGLDGYEVPKKYFDHLKVVQDRHFEEISKSGKATKNNKIITKRGSYLEDEIKFRGQNPGPQKYDVTYKWVSDAEIEKGKKLPKNTKKNTFIEQIFLEQQRRGIPGPGKYNILKTDEQVKAEAEKMNKKLKYGERSNYLQEYEYLSSTLPGPGNYNPRPILPKIHKDNMSPDKWIAFHKAKLSKTAKSSLPDVGTYKMNYPLDYATFGKMLVKTQEEGGNKKSSVRYMGTEERFKDPKKTKSKTSQIVPGPGQYPLVAKWQGKEQKKDSKDKNWMDSITTGISKSIYYS 364 T 0.13 SHIPPO-rpt pdbhh F Eukaryota T 8g3d 41 XC 6H Q231B6_TETTS Nebulin MTDNPQQPHKSKQEIQREQRKELARELRKAHFDLGFKEGFDDETRYREFYKWYDLEQSEKTKQEMLKLRNDLRSTHYILGTDDPNKLFVSTATQSFVKPVNPQVSQLSVETKNDLRSHHFNLGHYNDKVLSDYKLNYDQKQIDPETLKDRKEQINFLRKHNHDFGDKNNYHSSMYNENFNKSYDPQFLKQGKSKEEIHMQIVDLRKTNLVMGNNNPQFTSEAMSEFNNKPQAFRTQVDLGLKKSHFKLGEDPSLYETTTAKTYQGKQMFQHDPEKIKALSKDLRAEHFKLGNDPQSYTSEAAAKFKEFDKNSLTQQPDMSYLYRSHFNLEGFGGSNPVQHYVSNYKQNYEPKAAQKSEATRNDRADRGSHIVFGSDKIDDQFKSEAQKNFVNFGRQAPSALEKEVQADLRRHHYQFGTDQPEMISEMKKTFNDKTKESSQSKLDPNLIKDLRSNHFEYGTMGNEYTTTMQDIGRYQCQPSRLNPELAKDLRSHHFRPGDLEKYYDTTYRLAFIDFKAV 518 T 130 Malate_DH pdbhh F Eukaryota T 8g3d 42 BD,CD 8L,8N B5B6_fMIP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 507 F F F 8g3d 43 DD 8P CFAP112A XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 529 F F F 8g3d 44 ED 8R B2B3_fMIP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAYAAAAYAYAAAAYAAAAYAAAAAAAAAAAAYAYAYAAAYAAAAAYAAAAAAYAAAAAAAYYAYAAAAYAAAAAAAAAAAAAAAAAAAAAYAAAYYYYAAAAAYAAYAAAYAAAAAAAYAAAYAAAAAAAAAAAAAAAAAAAAAAAYYAAAAAYAAAYAAAAAAAAAAAYAAAAAAYAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 361 T 6100 Foie-gras_1 pdbhh F F 8g57 1 A K SIR6_HUMAN NAD-DEPENDENT PROTEIN DEACETYLASE SIRTUIN-6,PROTEIN MONO-ADP-RIBOSYLTRANSFERASE SIRTUIN-6,REGULATORY PROTEIN SIR2 HOMOLOG 6,HSIRT6,SIR2-LIKE PROTEIN 6 GSMSVNYAAGLSPYADKGKCGLPEIFDPPEELERKVWELARLVWQSSNVVFHTGAGISTASGIPDFRGPHGVWTMEERGLAPKFDTTFESARPTQTHMALVQLERVGLLRFLVSQNVDGLHVRSGFPRDKLAELHGNMFVEECAKCKTQYVRDTVVGTMGLKATGRLCTVAKARGLRACRGELRDTILDWEDSLPDRDLALADEASRNADLSITLGTSLQIRPSGNLPLATKRRGGRLVIVNLQPTKHDRHADLRIHGYVDEVMTRLMKHLGLEIPAWDGPRVLERALPPLPRPPTPKLEPKEESPTRINGSIPAGPKQEPCAQHNGSEPASPKRERPTSPAPHRPPKRVKAKAVPSKLN 360 T 3.2E-07 SIR2 unppercent F Eukaryota T 8g59 4 D A GNAI1_HUMAN;GNAQ_HUMAN ADENYLATE CYCLASE-INHIBITING G ALPHA PROTEIN,GUANINE NUCLEOTIDE-BINDING PROTEIN ALPHA-Q MGCTLSAEDKAAVERSKMIDRNLREDGEKAAREVKLLLLGAGESGKSTIVKQMKIIHEAGYSEEECKQYKAVVYSNTIQSIIAIIRAMGRLKIDFGDSARADDARQLFVLAGAAEEGFMTAELAGVIKRLWKDSGVQACFNRSREYQLNDSAAYYLNDLDRIAQPNYIPTQQDVLRTRVKTTGIVETHFTFKDLHFKMFDVGAQRSERKKWIHCFEGVTAIIFCVALSDYDLVLAEDEEMNRMHESMKLFDSICNNKWFTDTSIILFLNKKDLFEEKIKKSPLTICYPEYAGSNTYEEAAAYIQCQFEDLNKRKDTKEIYTHFTCSTDTENIRFVFAAVKDTILQLNLKEYNLV 354 T 6.200000000000001E-119 G-alpha unp F Eukaryota T 8g85 1 A,C,E G,D,J Q2N0S6_9HIV1 ENV POLYPROTEIN AENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSACTQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQAFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQCMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 481 T 3.5E-54 GP120 pdbpercent T Viruses T 8g8k 1 A A ACEAB_MYCTU ICL,ISOCITRASE,ISOCITRATASE EVPRKLLEEWLAMWSGHYQLKDKLRVQLRPQRAGSEVLELGIHGESDDKLANVIFQPIQDRRGRTILLVRDQNTFGAELRQKRLMTLIHLWLVHRFKAQAVHYVTPTDDNLYQTSKMKSHGIFTEVNQEVGEIIVAEVNHPRIAELLTPDRVALRKLITKEA 162 T 9.4 Lentiviral_Tat pdbhh F Bacteria T 8g9w 2 C,D,H E,G,P Q2N0S6_9HIV1 ENV POLYPROTEIN AENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSACTQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQAFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQCMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 481 T 3.5E-54 GP120 pdbpercent T Viruses T 8g9x 2 C,D,H E,G,P Q2N0S6_9HIV1 ENV POLYPROTEIN AENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSACTQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQAFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQCMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 481 T 3.5E-54 GP120 pdbpercent T Viruses T 8g9y 2 C,D,H E,G,P Q2N0S6_9HIV1 ENV POLYPROTEIN AENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSACTQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQAFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQCMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 481 T 3.5E-54 GP120 pdbpercent T Viruses T 8ga8 3 C J SAP30_YEAST Transcriptional regulatory protein SAP30 MARPVNTNAETESRGRPTQGGGYASNNNGSCNNNNGSNNNNNNNNNNNNNSNNSNNNNGPTSSGRTNGKQRLTAAQQQYIKNLIETHITDNHPDLRPKSHPMDFEEYTDAFLRRYKDHFQLDVPDNLTLQGYLLGSKLGAKTYSYKRNTQGQHDKRIHKRDLANVVRRHFDEHSIKETDCIPQFIYKVKNQKKKFKMEFRG 201 T 0.041 NAM-associated pdb F Eukaryota T 8gai 2 B,D B,D Bimax2 QSGSRRRRRRKRKREWDDDDDPPKKRRRLD 30 T 0.85 Med24_N pdbhh F T 8gaj 2 B,D B,D THAN_PODMA Thanatin GSKKPVPIIYCNRRTGKCQRM 21 T 2.6 YihI pdbhh F Eukaryota T 8gak 2 B,D B,D Thanatin GSKPVPIIACNRKTGKCTRI 20 T 4.5 Fuz_longin_3 pdbhh F T 8gal 2 B,D B,D Thanatin GSKPVPIIACNRKTGKCRRI 20 T 4.7 Fuz_longin_3 pdbhh F T 8gap 3 C E RFA2_TETTS Telomerase holoenzyme Teb2 subunit MSNRVQGGFDNNSGNNQSAQKQQAEKIPQITVPLNCFMINQIVKAAKENPQAHSGNHYEWYGAFENAIITAKFEFLQSINDSPKIMGKLSDSTGCIEVVIQKSKMSDELPEFVQAYEIELQNNGNRHKYVRAMLKMRKNAQIQLLYFSIVNDANEISRHGLDLCLRYLQRKHGIEDFMHMTNDKAHNNHNASAQKVHYQIDRNQQPKEQVLELMRQILKHNPNDQIPKSKIIEFFQSQLNQVQINQILQQLVSANEIFSVGSDNYLLNV 269 T 0.012 HSM3_N pdb F Eukaryota T 8gap 5 E G TAP50_TETTS P50 MKLLLQNQNIFQKLKNTLNGCIKKFYDTYQDLEQMQKFEMIVEDKLLFRYSCSQSEMFSAQIQAHYLEKRVLQLTDGNVKYIVNFRDKGVLDKANFFDTPNNSLVIIRQWSYEIYYTKNTFQINLVIDEMRCIDIITTIFYCKLELDFTQGIKGISKSSSFSNQIYEYSAQYYKAIQLLKKLLINDSYISELYNSTKSKQQPRLFIFQSFKPKMNLAEQNLSRQFEQCQQDDFGDGCLLQIVNYTHQSLKQIENKNNSNQIVNGQNEISKKKRVLKSNEDLYKISLQKQLKIFQEEEIELHSQSTIRNQTNQQLETFESDTSKRNSEKILHSINELNTSKQKVNQMNSSQHQIQKLENNNLNKNILNQINENDIKNELEERQQQHLTQSFNSKAQLKKIITLKKNQDILLFKPQEQEGSKKY 422 T 0.093 DUF6235 pdb F Eukaryota T 8gas 3 C,G,K G,D,J Q2N0S6_9HIV1 Envelope glycoprotein gp120 AENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSACTQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQAFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQCMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 481 T 3.5E-54 GP120 pdbpercent T Viruses T 8gbs 2 D C GAS VESICLE STRUCTURAL PROTEIN C, GVPC XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 33 F F F 8gch 1 A E CTRA_BOVIN GAMMA-CHYMOTRYPSIN A CGVPAIQPVLSGL 13 T 2 CaM_bind pdbhh F Eukaryota T 8gch 4 D C GLY ALA TRP PEPTIDE GAW 3 T 26 B_solenoid_dext pdbhh F F 8gdi 1 A A B2HHT9_MYCMM CYP124A1 GLLPRVNGTPPPEVPLADIELGSLEFWGRDDDFRDGAFATLRREAPISFWPPIELAGLTAGKGHWALTKHDDIHFASRHPEIFHSSPNIVIHDQTPELAEYFGSMIVLDDPRHQRLRSIVSRAFTPKVVARIEASVRERAHRLVAAMIENHPDGQADLVSELAGPLPLQIICDMMGIPEEDHEQIFHWTNVILGFGDPDLTTDFDEFLQVSMAIGGYATALADDRRVNHHGDLTTSLVEAEVDGERLSSSEIAMFFILLVVAGNETTRNAISHGMLALSRYPDERAKWWSDFDGLAATAVEEIVRWASPVVYMRRTLSQDVDLRGTKMAAGDKVTLWYCSANRDEEKFADPWTFDVTRNPNPQVGFGGGGAHFCLGANLARREIRVVFDELRRQMPDVVATEEPARLLSQFIHGIKRLPVAWS 423 T 1.3E-22 p450 unppssm F Bacteria T 8gh7 2 C,D D,H BIR3 inhibitor MAA-CHG-PRO-ZHW XXPX 4 T 1300 NHL pdbhh F F 8ght 1 A,B A,B A0A0H3LM39_BORBR Putative membrane protein GSHMNQPSSLAADLRGAWHAQAQSHPLITLGLAASAAGVVLLLVAGIVNALTGENRVHVGYAVLGGAAGFAATALGALMALGLRAISARTQDAMLGFAAGMMLAASAFSLILPGLDAAGTIVGPGPAAAAVVALGLGLGVLLMLGLDYFTPHEHERTGHQGPEAARVNRVWLFVLTIILHNLPEGMAIGVSFATGDLRIGLPLTSAIAIQDVPEGLAVALALRAVGLPIGRAVLVAVASGLMEPLGALVGVGISSGFALAYPISMGLAAGAMIFVVSHEVIPETHRNGHETTATVGLMAGFALMMFLDTALG 312 T 1.1E-26 Zip pdb F Bacteria T 8gi1 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H E6K399_9BACT Accessory protein Csx28 MDYMELAKEAFSIICTFIAAYVAYYYAIKQLHQKSVENIEYAKYQAVLQAHKSLYKLLRFTTNTENEDSILIWEKTKDGKQEATYYFRKENIRKFIKELSKEIYNEGCGIFMSKEALSLISEYRNIVYGFMLSAQNNPQETIRITNRESVERMKKIHQNLSIEIRQAINLKKRDLRFENLYFQ 183 T 0.0028 DUF6019 pdbpercent F Bacteria T 8gi5 1 A,B,C,D A,B,C,D Pyrene peptide XYSPTSPS 8 T 0.045 RNA_pol_Rpb1_R pdbhh F F 8gih 1 A,B,C,D,E,F A,B,C,D,E,F L7R9I1_HBV CORE ANTIGEN,CORE PROTEIN,HBCAG,P21.5 MGSMDIDPYKEFGATVELLSFLPSDFFPSVRDLLDTAAALYRDALESPEHCSPHHTALRQAILCWGDLMTLATWVGTNLEDPASRDLVVSYVNTNVGLKFRQLLWFHISCLTFGRETVLEYLVSFGVWIRTPPAARPPNAPILSTLPETTVVKLENLYFQ 160 T 3.9E-25 Hepatitis_core unp T Viruses T 8giu 1 A,B Z,Y G1JWB5_9CAUD gp_4 (capsid accessory protein) MANRTVSPSTQGVRPAMRQMYNGRNVATRPIPLIVDTSEIRAIMAAAADARPKTSAVNFPQSGPRPAGAAVVFGTKVSGAPGNVVSNNAATFAPLTGTQNFE 102 T 64 CLP1_N pdbhh T Viruses T 8giu 2 C,D,E,F,G,H,I E,F,A,B,C,D,G G1JWD4_9CAUD Capsid protein MATKELKIGGVPVFPIFGGTAPVRQEGIMTQGDLVTVTSDGIDLNALWNSFAESIAIYNEAMDNLIQLLTYPVTVPVEPVVQIGETTFEEATELGVPRGAGLPIEVFQMGYDLRHYDKRNAYSWMFLADADGRQVEAIHDAVLWADKRLVFRKVMEALFDNRTRRANIRNQAYNVYPLYNGDGVPPPRFKNNVFDETHSHYVISHNSVVDSSDLEDLMELLAEHGYSPQAGTQFLLLANKAETDAIRQFRRGVVNNNGATAGYDFIPSPTQPAMMLPNAEGLLGNQPAPTFGGLAVIGSYGFWNIVEEDYIPPGYLVGVGYGGAFNLGNPVGLRQHANPAMQGLRIIAGNYQRYPLVDGFYARSFGTGVRQRGGAAIMQIKASGAYECPPIYKKGGGFLV 400 T 0.78 DUF5309 pdbhh T Viruses T 8giu 3 J,K,L,M,N,O,P 1,2,3,4,5,6,7 G1JWD3_9CAUD gp_22 (Minor Capsid Protein) MALKTKPRWDKYDGYVGNYRGVLGEDIDLDTEANRVLAVGTNSNGAIVVGAGQTGIKGLMIVAVGADIHGAMLDGGINNHAGDPQDVGKHGEITNFQPTVFGRTFGVAISATEGNVKLAVNGVDTGNIAYDTSAANLKSGIVAVDDGFTADDFTVTGTAPNFTIVTTRTDVTITASGEGVTVTEATSVAAAGTNYYGHADGTVNAVKGSDGVYVGHTQEADRLIVNVKDEED 232 T 14 DUF5114 pdbhh T Viruses T 8gjs 2 B B ACE-LEU-THR-PHE-ALA-GLU-TYR-TRP-ALA-GLN-LEU-DAL-ALA-ALA-ALA-ALA-ALA-DAL XLTFXEYWAQLXAAAAAX 18 T 0.61 Abi_alpha pdbhh F T 8gjv 1 A,B,C,D A,B,C,D Intermediate compound 10 for maxamycins synthesis XXXXXXXXX 9 F F F 8glv 9 JD,KW,MMA 5E,Gr,OQ A0A2K3E5X9_CHLRE FAP34 MLNVTGGRRPVASWRTPPGFLERLADAWPAVLDGAVEQAGGDPARVTRDSFLAALREALPGLSAAEDDYARQVSLSVIQQVRGSNVFFPDLDYLQAALLQGRVPPQELDQPRSTLSLATFTTTTRSGTKSLDLFKTTGVTWKIPKGFLNRYNDCNHEVLRRAAALVGARHDGARDVVAGVWGRVDVPTFVEACRQVLGEISADEEEYLIALASEQVQDGTAYIRDLPFLDKCIQNGKTPTSIKGPELLPSIFLNDTTSGKTDGMTLRHTGGRIF 274 T 0.2 FliX pdb F Eukaryota T 8glv 11 LD,MD,ND,NMA,OD,OW,PW,QW 5H,5I,5J,OV,5K,Gv,Gw,Gx FLTOP_CHLRE CILIA- AND FLAGELLA-ASSOCIATED PROTEIN 126,FLAGELLUM-ASSOCIATED PROTEIN 126 MSRSYPGEQVEHAFNSKRLKNWEVPAVDKSQAISTSTGTRFGTLQPRSGRTQFIVDDNGHLKSGVPKLEKSAFNFTQTTPVFMDSAPRWPKENPTWPKNMKATMGYKGIQSNYLPTNTVTLKAVEVPGTTERNFNFM 137 T 6.8 DUF3697 pdbhh F Eukaryota T 8glv 17 GE,PX,VMA 6E,HN,Om A0A2K3E1X6_CHLRE FAP143 MAEETQPYTSYNKQDEVPTLIGNWVEERELKELTGVTRNLAASQALKDTSDGTSPTRSLGDALTATHPRVIEHVQAQTHAADWQSTVQATYRPPSDATRNAAAYVNTSKMGPRERMLHEQLMREAQDLPPELQATLTGPAVPVTTASTYGADFHQHDLTGIVVGAKVMKDRDGRPAVRDPTFLAETQMMKKDAADRLMGETARQSGARDTTMLPNPDVPVTIYTEAVANKTYGGVFPGTTTLNTAAPFGKSTNFSKPMSDYSKVVVDE 268 T 0.093 DUF1143 pdbhh F Eukaryota T 8glv 19 IE,JE,RX,SX,XMA 6G,6H,HP,HQ,Op A0A2K3DZF0_CHLRE FAP166 MSLTLNGLDESMRRMQGYEVTRAPEDVGNSIPNFKEGIFTYKGSRQAPWKSEQTHSFSLPNAYTARVLNGTIVHTGGATEMAITTHHTVERPMMPPGTIRGSTWVKPQYIPTDDPALDELHAVAYVVSPQLPALMDACNSYHLHSADGWITTAGFMTAARRAGLTLSRAEYLALERALTKDTMGRINYLQLEALVQAVTAADQTGEGGAEPAAE 214 T 0.028 EF-hand_11 pdbhh F Eukaryota T 8glv 21 AY,BNA,BY,CNA,SE,TE,UE,ZX HZ,Oy,Ha,Oz,6Q,6R,6S,HY A0A2K3DTN6_CHLRE FAP339 MDLKQQVKNYTMTIRNTRPPTMIKEQDKSEFSHFRALQVLANGDEVPYEATLRNVIHDGARQPKLPPRQTQKHPGYIRNESGGFFTS 87 T 0.092 DUF3337 pdbpercent F Eukaryota T 8glv 24 EF,FF,LY,MY,NNA 7D,7E,Hk,Hl,PK A8JF23_CHLRE Flagellar associated protein MATNSTGPWATGTFSPNSTGTVTQYNHPMFVSQRLTGNFTSQFEMNSLPSHKYETLPIRSGHLPGYQGHVPGGVGAIAQRKPAAAMHTMTHLATSGSLPKGSPQTDMSLVDLRPEQRSMAKVYMYAEGAKTSFLKFPTPKTFDHRN 146 T 0.74 DUF2475 pdbhh F Eukaryota T 8glv 25 GF,OY 7G,Hn A8HSW0_CHLRE Flagellar associated protein MAAYAHNDGAPDISQAFQNTVLVKNWYEDRFQSQVASATGRTLRELPTHERVVHKAVPPGHPGLFQTTKQAAEEKLLTTPPPAKVKKPSMYTEANVAERLQTYGLADNIHYTIGPNAATEASWAPVHNLTTTNKEFYEIKPEAARAADPDTFRASGPSPFAKTGFCAKSVKGEASDETTVAGGKGARGEITRRPGESGNPYGVSVFVDEYGKWGSAIQGMPLTETRARMQTKYFP 235 T 0.25 DUF1143 pdbpercent F Eukaryota T 8glv 27 IF,JF,QY 7I,7J,Hp A0A2K3DZI2_CHLRE FAP129 MVHKGPNQAGNKGLLTYNNAVGIPGYTGFMPSTNALALPVKGFEHTGRPAASAEVEKLTVKSVDPRKTSQYADDYHKKPADTKAFSKTGGGYWISQRVLPPHTAFTATTTYRAETLNAEPNTAAILDRSQGLASTLVGYEAARQAGEVRRSISADPRARAEDTARGIGTQTVLGRPGSGGNTSILATVAAAAPSSPQAGSSVMVSSARRPATVPTKYGELPGYQTTYGAATDKMARMQADNELNGTGSFAPSNMGDPRFKTLPRVMNPGMGRNYSSYVAEYGGDGHDPMARQAANKDTMTRISVTRDLAGGTTRNVSHIPRYTGHIPASEYATPEARAQGEAAEPRPDHKSQALTYTLDQYPRGRLPGYTGFKAQAPANIDAGLKHSMKLPCHSTTSGDATLRGTQFGVPHQDHTHYINSRAGLNSFFSNSVVGTEFVSDNGLFNAQVYYKEAKSQGALGIKTAQPSKLTHYGAPFRAAASMV 483 T 0.0018 SPATA48 pdb F Eukaryota T 8glv 28 KF,ONA,RY,SY 7K,PQ,Hq,Hr A0A2K3CNK8_CHLRE FAP21 MSLTTQSLRRTNYEAEMTQPQIPPAGITGKLHETAKDALTWNDERPSTPDDIKKYRQSTVHEPGKIVRHPGHADDPVPQGPFGVKSAASGGQNINEALKNYPDSELARWKLEQAEGVYASAQREPLGAGYVRGHRLPEGLGSERPFGVTYDARGKDLSRQAAAVIFPTDRPAEEDAATRAMYTRSHQDFQPGEQRRRDYNWDAAGIDPAQHRFGAVDRNGVGDGVRKALQPGLDPSLQAPKVLPKLHEDFKATATDYLGRPRQLGTGDRPQLAPDHAFGQPSMRKGREPGVGELLTGRFGADEQQPDADLGKSLREGYRNQPKPGDEGRAFGVPTIRTDVRLPRLRSVANACNYGNEPDAGQVLRPPRAADLGISDEAFVALRPKSELRQLVDEAGLALSDADFEAAWALAAEADGGAAAAGEGGGAAEGPEGRACVDTFFRARHHLLAQTLQIEPTF 458 T 0.035 DUF6395 pdbpssm F Eukaryota T 8glv 29 LF,MF,UY 7M,7N,Ht A8IXN7_CHLRE Flagellar associated protein MSILGPADRRPELALTGTTISHLKTWRTEYLDEYSDIKLAAGVPEQRMEMAGITAHIGTITGRHTHMHKETTRLPTGHPPSSTYRAQDAVPIGTMTRGTGTITKLGDSCLYDKEQTWAHWRVAVDGKPADTRRKYRGVS 139 T 0.12 Autoind_bind pdbpercent F Eukaryota T 8glv 30 NF,PNA,WY 7P,PV,Hv A8JC52_CHLRE Flagellar associated protein MQGDRWSRNCGSGGVGHSGTVNEYRSGVLIGNFVENAAKTTGRMGETILSHTGPGAQTGIPTTTQKRSYTAEGKTGEYLVEASTRHDLNQPGVKGELLTRHGRFDEPPVQCLGTTYQLTYGRADGTDRRVQSYLWHGRKQVDYFVPHSTGGPSTLSLTARKQQEWGTQGATDAYLTTKMAATQPAALATAENPTRTQTLRPLGDSGLMPQPGQKPKGFARDELDKPHHRTGLRVNYRS 238 T 3.1 DUF1143 pdbhh F Eukaryota T 8glv 31 OF,QNA,URA,VRA,XY 7Q,PW,zn,zo,Hw A0A2K3D7C7_CHLRE FAP306 MDATTKTLKSTTRVDNSTNPNFKHTSTFHTRGQWTPESPPPLTSTYTIFHGERPELPRYVPKYAVSPETAALTSRHGSSPYSFRATAERAGSTPDGRATYRFSGLPAGVSPYSTGTKLSSSTLGSSGLPPVQYKSYLTEYVDEYREPLEQLDTQRSLTLKYGTTGGYRTTQRSTRSDGQPKYQTRVVAF 189 T 76 DUF4851 pdbhh F Eukaryota T 8glv 32 OZ,QF,RNA ID,7Y,Pe A8HPK6_CHLRE Flagellar associated protein MGAANENIHMTDGIRRETMKKETLARERSLAAQSPYMAQVATYRARNPPLDHSRLMQDPKVQDWASIAGTRRSLATNVPDGGPRVNVNLLKYKRDADFISTTPYDGGPSYNAETCMQNWAEDRRDKHYKSGFHPKELRRSTRYDSEYSARFKPTSADYVGRLTHTYNTTSRFEGLTRVGTNGIAAPVLPKRSADTSGEHVFYAKDGYGPTPWMDHTAPTARGRFWVGTAPHVAHDTITHSTLRSEPLEFQQRCPTEDARSKILMGNKPLTHESDRTLRIRDDLVATNTFTRTWRTMYQSDHVDFSRRPATVR 312 T 0.034 DUF1143 pdbpercent F Eukaryota T 8glv 33 RF,RZ,SF,SNA,SZ 8B,IG,8C,Pi,IH A8ICC1_CHLRE Flagellar associated protein MNNNKLDEAAILAGCKGVFSKTSYITHTGQEGKAEEYEKKGGHRSAFAGKQLATAPLKEGKTVDVYFTKKHDWISDKDPYVDRIRYKDSNQEKKKGFYTSDFSKRDEFTNTIRTEQWREQLKGENTHAKKALDMFAEATGLEASQLRTSRKMEPEVFMYDQVFEKEDPGFDGASRTHRDTKNKTMLSRDRANGEMMTTTALAFQAPDEHHKPEHARKPLVRETFFRKTNVFFPEGCAADPST 242 F F Eukaryota T 8glv 35 JH,KT,LJA,MG,QJA,XJA,ZJA AY,Fb,MM,AB,MR,MY,Ma A8IPZ5_CHLRE Outer dynein arm-docking complex subunit 1 MAQKSTLKLPRLRTKEELLKTSPELCKLLGEDSDDGRSMSPFTAPPPAGTVKPPSRGLPAVSTKATKGPGMDTPRGLGEEELTEEELLRLELEKIKNERQVLLDSIKLVKAQAGTAGGEAQQNDIKALRRELELKKAKLNELHEDVRRKENVLNKQRDDTTDASRLTPGELSEEQAYIQQLQDEMKQIDEELVEAEAKNRLYYLLGERTRREHLAMDMKVRASQQLKKDSADDLYTLTAHFNEMRAAKEQAERELARMKRMLEETRVDWQKKLRERRREVRELKKRQQKQLERERKMREKQLERERQERELQAKLKMEQDSYEMRVAALAPKVEAMEHSWNRIRTISGADTPEEVLAYWEGLKAKEEQMRSLVSLAEQRESSAKSEIAALLENRSGMYEKGSAAAADVGEGSEERATLITEVERNMEGAKGKFNKLRSVCIGAEQGLRSLQERLMIALEEIHPDQLRASHMKGGHDAKARGKGAASAGARRGSAHAHTPDRNKRGPATGSRSQSPALVPHSPAGDKPSSPLHGTSPEHGHEPIPEGAEELAGEAEMVSPLGADGNTIDDEHFFPELPELLTSVTDRLNRVLVLAAELDAQEPAGAGEDGLPLSGEPGADGAEGAAPASPSRGAPEGLSESERTLVKGMNRRTWTGAPLLETINASPSEAALTLNIKRKKGKKKEQQVQPDLNRILGYTGSDVEEEEPESEEETEEEANKDDGVVDRDYIKLRALKMSQRLANQQRAIKV 749 T 0.00034 CALCOCO1 pdbhh F Eukaryota T 8glv 38 PG AE A0A2K3CYR5_CHLRE IC97/Casc1 N-terminal domain-containing protein MAPKDAAKGKKKKKTKEELEEERRQAEEAARLAEEERLRAEEAERQRLAELERQRLELLGQFLDAEKARLDAQLSELDPLLRQFEHERSRSRAAAREAAEWERFLRCVDVPHPRQRVPLAEFLRRMHEAATKDVTGSPDGRDLRAAFLAVEACRTVILEARQELLAARHSSELAAPPAAAAAAAGGGGGASAEEAQAAAGSAARGAAGGAAVAEWAEALESDLRTLYGIVNARIDRLTAAVLHHCDEYANDKNEIQLGHVAPMPAWWPTAPGSGSGSAGSQQQQQQGGGGEQQQLAEGAGEGPGGAFKWGVWVNTAKNPRLKAVEMPQLGVTLEIPKQIALANIALRVQQRSGPGVDEYFSRCANAWMAVGGLLAVDLLAMPPGAKKVRGWTLRQVTPLALNVQRVPYPIPPAGADPATWASEEEPPPLGVTAPLPPDVVLLEDPLQVAWWDEAHSIWNTDGISDVAFDGASKTFSFHTTHLAPLALVMRRTRLLPYAGWAVRPTGGRNGNGAAISLDVGLDAPVVIEVSKGAAWLSSPAWPQLASLIGQPMPPLDLLQALSDRGLHLLPEDRDAEAAGVTAKARDTEEAMCRDLALLGGVFLMASSRWNQTAGPEEALARLSEVTDWEEGGRTAPHHLARIFDKEKEDGERRVLVVMRRGAKGVAFSDALNRRPEYPALPGVGSVEAVKECELSIWGEVHASVLTLLRGQFSAPGAADSPLALRLAAAPESLELCRTTSPLFTATLADTMLALRLFSFS 760 T 0.69 Casc1_N pdb F Eukaryota T 8glv 40 RG AG A8J3B6_CHLRE IC140 MEDASAGPPPVDDGEVPAAPADSSPLDDAPASSGAEPGDGGYDEGEPLDNEQAGPADVEGLDDAGEPGAAPEDGEEGTGGEGEAGAGAEAPGDGESPEAAAEAEAAAAAPPPPVELEPLPEDYVPVRDLPPIPEPFARNEEGKPVPLDGVESLFLTGTTIELVGLKELGAANVMGEVSRDELLKDIQFRGAISDFHAYKAKIQAADYEPLLVRFNEDDVYGDGNNFELAVTAAAAAVWRGIGEEVARRAALLELEAAHAAAEKAKPRSKRVRKVKPWQSMGSEVDIEEASVRPRRDPIRLVVQRRRREFNQPNAKLADKDAHELWNSSQMECRPFKDPNFDMRRMEQDVAVQAVAPLRDAATQSTGTVPPRPAVTQTEPLDLPPEAKQDLVRRPRNAPGSVADFLERVRDQCEVALVQNEITNIFRDDLSSLNDEADGGGGGSRKETLVSEAQSFTHLTYSKNKVVSAIQWLPHRKGVVAVACTEAQSHAERVARMGRTAPAHILLWNFRDPIHPELVLQSPWEVFSFQFNPLQPDLLTGGCYNGQVVLWDLSSEADRLSRRAGGGAGAGAAKSSDGAAAGAGGKGADSTPPSTALPGGGGGGGGVDSTSGSSADGDAHIPVIKHRFMTDTQFSHHQVVTDLQWLPGVEISHRGKVTKLGEGSKECNFFATIAADGKVLFWDVRVEKLLKKGKKADELLDLVWKPIHSVHLISLIGMDLGGTKLAFDFRKLEQGMFYAGSFDGELVYADFVKPEGEENPDYAKSCLQAHVGPVIALERSPFFDDIVLTCGDWQWQIWQEGQSTPLFQSGYAQDYYTAACWSPTRPAVLYLADQSGSLEVWDLLDRSHEPSIRVTLAATPIMSLSFNPMPTSASAAQQAAQQLLAVGDATGVLRIMELPRNLRRPVHNEKKLMGTWLERQQARLADVGARQPVRTSARKEAEERKKEAESAALAEAAAKEAAAKDAAAAAAAGMPLPTANERKKDKGPPPPEFDEKAEQEYLKLEARFKAQLGLMPAEANGGPGH 1024 T 0.02 WD40 pdbpssm F Eukaryota T 8glv 50 EIA,LH,MH,XT,YFA Lp,Aa,Ab,Fo,Kt DYI2_CHLRE IC78 MPALSPAKKGTDKGKTGKKTGKQEQNAQDYIPPPPPMPGDEAFAMPIREIVKPDNQLWLSEADLNEEVAKMLTANNPAAPKNIVRFNMKDKVFKLEPMVEQTVVHYATDGWLLHKSSDEAKRQMDMEKMEQEASARFQADIDRASHEHKDHGDVEPPDDSRQLRNQFNFSERAAQTLNYPLRDRETFTEPPPTATVSGACTQWEIYDEYIKDLERQRIDEAMKSKGGKKAAAAARAAGAAHRQRNEHVPTLQSPTLMHSLGTLDRMVNQNMYEEVAMDFKYWDDASDAFRPGEGSLLPLWRFVSDKSKRRQVTSVCWNPLYDDMFAVGYGSYEFLKQASGLINIYSLKNPSHPEYTFHTESGVMCVHFHPEFANLLAVGCYDGSVLVYDVRLKKDEPIYQASVRTGKLNDPVWQIYWQPDDAQKSLQFVSISSDGAVNLWTLTKSELIPECLMKLRVVRAGETREEEDPNASGPAGGCCMDFCKMPGQESIYLVGTEEGAIHRCSKAYSSQYLSTYVSHHLAVYAVHWNNIHPSMFLSASCRLDHQAVGLCHDPKRAVMNFDLNDSIGDVSWAALQPTVFAAVTDDGRVHVFDLAQNKLLPLCSQKVVKKAKLTKLVFNPKHPIVLVGDDKGCVTSLKLSPNLRITSKPEKGQKFEDLEVAKLDGVVEIARKSDADLAKNAAH 683 T 0.035 HTH_8 pdb F Eukaryota T 8glv 51 BI,FIA,IU,NH,ZFA Aq,Lq,Fz,Ac,Ku A8IJZ3_CHLRE WD_REPEATS_REGION domain-containing protein MEIYHQYIKLRKQFGRFPKFGDEGSEMLADIRPNEDHGKEYIPRNPVTTVTQCVPEMSEHEANTNAVILVNKAMSHVEGGWPKDVDYTEAEHTIRYRKKVEKDEDYIRTVVQLGSSVEDLIKQNNAVDIYQEYFTNVTMDHTSEAPHVKTVTVFKDPNNIKRSASYVNWHPDGSVPKVVVAYSILQFQQQPAGMPLSSYIWDVNNPNTPEYEMVPTSQICCAKFNLKDNNLVGAGQYNGQLAYFDVRKGNGPVEATPIDISHRDPIYDFAWLQSKTGTECMTVSTDGNVLWWDLRKMNECVENMPLKEKNSETTVGGVCLEYDTNAGPTNFMVGTEQGQIFSCNRKAKNPVDRVKYVLSGHHGPIYGLRRNPFNSKYFLSIGDWTARVWVEDTAVKTPILTTKYHPTYLTGGTWSPSRPGVFFTIKMDGAMDVWDLYYKHNEPTLTVQVSDLALTAFAVQESGGTVAVGTSDGCTSVLQLSTGLSEASPAEKANINAMFERETTREKNLEKAIKEAKVKARKEQARRDEVKDNVTEEQLKALEDEFFKTTDPAVGGGYGAGEGAAAE 567 T 0.35 Stap_Strp_toxin pdb F Eukaryota T 8glv 56 AKA,DW,HM,IM,MJA,WJA,YJA Mb,Gk,Cc,Cd,MN,MX,MZ ODA1_CHLRE DOCKING COMPLEX COMPONENT 2 MPSADATRGGGSAGSMGKGTLGAGDTLGHKSVLDKQRAAIEKLRAQNEQLKTELLLENKFSVRPGDPFAQALINRLQDEGDMLARKIVLEMRKTKMLDQQLSEMGSTLTTTRNNMGGIFSAKEQSTAVQKRIKLLENRLEKAYVKYNQSITHNKQLRESINNLRRERIMFESIQSNLERELAKLKRDMADMIQQANGAFEAREKAIGEMNALKAQADKEQQGFEEEWRQLTTIIEEDKKERERARAQELAMRERETQELLKMGTLSSAEKKKRITKGSWNVGYNKAMAQNVAAEKVEMYGQAFKRIQDATGIEDIDQLVNTFLAAEDQNYTLFNYVNEVNQEIEKLEDQINIMRGEINKYRETGRELDMTKSRELTEEEARLAASEAQSQLYEKRTDSALSMTTALKAGINDLFERIGCNTPAVRDLLGEEGVTEANLTAYLGIIEQRTNEILQIYAKRKAQQGTDGLAEALLAQPLTQPGNRIIIEPPSTTQEEEVEGLEPEPVEEDRPLTREHLESKVQRTLPRKLETAIKVRPAGADATGGKRGSPTRR 552 F F Eukaryota T 8glv 65 DP,OJA Do,MP A0A2K3DCF8_CHLRE FAP44 MAEPGEDSLPVDGLAEVNEQPASNPEQQAIVDVAAPAESAPDSDGDGVAETSEPADADEPAQSGSGEEAAIADETGSKPPAEAVATGTPETMPEEQPAEEQEPELRTDAPAADAEATDAPEEQPQEASATEAAPGAEAVEDVGAAAASNQDEKCTPEGPCSAVPDGEPRQADAEAPVPTPAAAAAAAAAASAAADQQLAGKPTTDAADGLTAEPTDRVPVPDPQAVGAQDGPAGEQADAEGAASGGPLAASAEEAANAGHAAGPEDRAALEAAAVAELDAASAGAGDAAAADGAACAPATPDSQAEDQPQPHAVAEAVTAPAAAPPAAPGSRTSSARSAPVAATVAAAEPAPRPPSATPPAEPRPQSGSSRTLPPPAPPPSLPPASAASSGVAQLLSVHGLDTHRRNNLVLLDEDTAASCIAGQLVLLSLSTGARRYLPGRDGGGVGAVAVHPSRTLLAVGEKARPGPASAGPAVYIYSYPGLEVVKVLRGGTERAYSALAFDGERGDTLASVGHFPDFLLTLWDWRQEAIVLRAKAFSQDVYGVAFSPYFEGQLTTSGQGHIRFWRMASTFTGLKLQGAIGKFGNVELSDVAAFVELPDGKVLSSTETGELLLWDGGLIKVVLTRPGSRPCHDGPIEALLLDRPAGRVLSAGADGRVRMWDFGAVNDAEPREDSHSLELSPLDEVVVAEGAALSALLADSGGRRWVVADKAGNVYTVALPPAGPVGKGAVVTRVASHPAGAVAGLQLSARTHTALVASADGCLRALDYVSGAVLAEAATPQRITAFTPLPAASPACPGGAMTAATGYRDGVVRLHARCAEGLALVGVAKAHKGAVAALAVSADGGRLVSAGEDGSVFFFDLTAQPQPGTGVPGMPACGLLAPRAFIKLPSGSGTVTCGVWEAAEGGGVLLGTNRGTILSVPLPPPDLNTHHSYEWAAGTSAVSSYQLVVPKPKRPKKKKGKNDGEEGDKEGGEQDEGQGGEDKGGEQADGEGGSKEGGEEGRAAEEEEEEEADDEADGGAGGPSSTTGELISLTLAPNEPGALLVTAGGVGHAARKAWRVRMGEPLAAPLLEGFASAPVTCLAHAGPEGRLALLGSGDGLVRLQALEEPFGSAAPGALPLWEAPLHDMQSGRVSGLGLSHDGAYLVTAAADGALHLLALALPPELAPPPTTQPGDEPLPGPAALPLRPPDVLAAAAYTLEEEKQQAERDQQVREAEEKKLSVRQRLGLIRAEFEALLAENEAAPEALRLPRADLEVDPGLRALMEAEALRREEVARLELAWESERQRLGLAKLRRYFLDGLESERVVLHSLRGSSTVTTFRVAKLSDETRAELAAMRQAARAAAAASAAAGGEGGAGGRDTDARGKGSDTGGGPGGDAAARARLAEATAALEEGTASGKLNKADLRRLARKRREAEWAAFNGTRPDDTYDSPADLAAIEEARRTIGDFKLKSDPNYVVPEEERLTPQRKRLAMLELEEALHDIAAAFNAKFFALRDVKRKVLADVRVKLAALAELAAAAGAATGGADPDATAAAYLAPFSGLPSGLLPEEEPAEAREAVTDADLAAFAARKAEDERKAAAAAAGGLGGFAGAAAGPKKPAAGGAAPAGGALAGGAAGSGSVAHGAGGPSAGGQQGLTAAEEALAKMMAAVPQSELEKGLAAYNRRRVEHMRSKLSEEITAMLDAFDDAHSALKAEKLGLEADVKAGQMRLLVGLQELQLLREFDKRESVLLAKRQAKLDDKQEIVDKIAECTDKLETKRLELEGLVARRAAVVAELDAVVPESDPFREALVRVFHRRIKRSKKKAGGGGGEDDYDSEEDEEDEDMGDDEVDDDDDGGEEVCPPGCDQSVYERVCDLREKRLDEEDMIAEFTKTIEVLRKEKEALAKKQRLVEQGLAAVNADMAEFQKEKQGRLNQVEVIVALRMHQIEYLLDGCLPDDLSACLVFSASQLRRLQARVDELEEEKAGLRAAHKELRRQHAALLRDKADKEARVAELEARAHDVQMLKFGQVIDLELLDRVSSSRGTEELREDLKKQELAYARELAEWDAKINARMDELVVLTRENTACLNAVSELTAAQRRLESGLTATRKGLFADPVQQRRAEVEERDALVALVNAQAAELDRLKGQLLALRRKDTSMYA 2141 T 0.026 Macoilin pdbhh F Eukaryota T 8glv 69 LQ EM DRC6_CHLRE FLAGELLAR ASSOCIATED PROTEIN 169 MAPKKKGGGKKKKKDDGAEPPHDGSWERAVESGTWEKPVTDLPDANTWPTWGALRERVLTACREIKINNTASLRDAFANELVKLSPPELTLIDLRGSSNLHNFNLSPMTTCPKLTDLDLSECAGLDYVLLQSQTVRSVNLRKNPAITKALIHCPRLNKLSITDCPALETLMLWTDELTELDLTGCNNLSVVKLQCPNLLDSKIPPLKVAPQHVKPSHPPIASLLKENLTTAAHKAAADKEALAGVKDTSDSIIPHVFRPF 260 T 1.4E-05 FBXL18_C pdbhh F Eukaryota T 8glv 100 MAA Ib UNKNOWN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 34 F F F 8glv 108 HEA KC UNKNOWN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 72 F F F 8glv 114 DHA LO A0A2K3D574_CHLRE Flagellar associated protein MDGNFVADVRLDDSDEVLQLPIVKSKVKKLLQGAVKKIALLGAPVIPSSDDSLEQFLQSAGRFFGKDPAKWDQVGETKVDVVEEAGKRTTKLTGVFTGTELALMVENPYYDERLPAREDKPELNFSQKRTPIRTDDEWQELIAEQPWTASKRRQLLTAYLHAKVEAEEPVLATEGAQGFWELAINKDHHADFRLDRLAALLNRLSSPSLEVATTTAAAIWGLATTGLSRKNLADLDIVSLLLSNIKRSFKMPVIPDEATLAAAAAAAKAAGKDGDAAAAAAAAAAAGGDEGGGVGGKPAAGALPEAQRNKYQSFLLGALSVLLIDRNCRRAYLQQEPEFGTLFVLARNLDGYEPGHAAARREAAAKLLTTMVQRDADARRSLIASGALRNVISLLNPKGPGENMIQFCAASLLATLVLDDDAMELIRDRGEAPLMFEACIVLLQSTLGKLKREVQRFYGQLTPEEAASTPPFDVELGVRLGEAASQAMWGSAHYCVMMDPIQVKMDHIQQLGVMGNDCYTTVALPLSRIAHCITASLATLAANPDAALLIMTSPNDVALVFLMSMLDCVETENFEQAGHVKASACAGVAFLACHPIGAEGDECMFGPFRQKLLGLGAFGALLRAALSSVLESDCDRIIQQAAAIGLMYLSTMAGAVDAAELAMYAALLTDSDNSEMIEFLMAGMWILLRDGNNRKVLGTSFNPSPANALAKNMINKLNDAITLHEINDEVAAKTAKLKQTMAGRGGEDEGGEDSGAVTAEPSALESMAAASASAPSPPPPAVEGEASATAAAGAAEGGEPGAAPGEGVGAGAEGGAGGEGGLPGEVGDAGAGPAPGEDGGEEGLLSPEASGVNVAMEPEPAAAAPPPPAAAAPADEGEEQLERDPDEAFAMPPDTNMPDGEKFYENKDTAFPSPMLMKREESADRVRRKAEAVKGRMKQLEKRFDKQLKDNWGLETLVSVGESWLPAMLEQDEVGEATDVPVLKLFEFLVASICMFMVDDDGVPERRELDVFRLSAPEGARNKTWWTVDVRAPEADGTVDSDTERALRILLQILGMHLSAAWKSMQLGVLTLWNACCRHPNMERHVVERGVALKLLMVVNNPMWPPSLREISAGCLEFFQERWSNLATFGAGATLLPGGLPAEVSGVSAGGVPPEGVVPYIAAMVGLVNTGVPLMEYRGCHGLARMTYTAPYACPEPKPFLKEAKAVAAALGGVEALVALMKRLNRRYQDLCGPAGLGGSLRATGGAAGGSGGAPPPGGGGGAAAGGGGGGNSNRGPTGQGEEPENPAMFERDMQNLEAVQDIYFVCMAALLNLSVLRGNQVPIAKRGLLVLLGTNTVFYNRVVVLRANLNATRPGGAHAPSAADALAREEQLLHLCSAIIQNIAQHPQNRTRMYKAELKGSVALDKVIEAATDVDEETRTAASFLPTIPSTRSMSPSAVPSAASLGRGGRSAGAAGGSLHASASAARVGAGGGGSTSPTRAATTGRLAQNAKMTQNGQVINGGVDTALAGSVRPKVVFPPICERGAGGDAITLQRYGPGGAGSPGVMRMGSQHSGRSGSPGTADTSGMDYNEGGGAGGAGMSHEAITDSRYRFLTWIDNTFHDLEAGANVPFSKKGGADDRSLASGSGRTYRKALWDEHGDWLPNEPESAKALNKLLARPMSHLWQDMPEHRARQGRQRWEPTVSEYRELQGAKPLTRPAAKLLSTLPPRDQEDLMVAASQMIGLMPEDYDSDEDEPLGPRGDGDGGFDGGAGGAAGPSGVEASGASGSGGHGAHHRKAGGAAGAAGAGGAAGGAAGGAAGGKKAGPPPPSNMAAMLAGEATMEIAKPTVSASLNWNDGPPNRPRTAERDNGRVGLTVLAAPPEALQATAARKAAEAAAASGTHFGAVGMDISDEALAAAAGSSTAAKADAVPLKVCLGPKRPRQIITFEDRIVIDNDNRPTLTLFEHVEGSRVSDGLFPSYILPNGKRAHMYYNGGTLLDEVGVEAVIPPPRPSTVPQALQQTMPLANVLNLIAKPPGSAPPFIPYKPVPRLVPLPPEHTLTVKRPDIHAAEAFGDLREDNLQLVIQAKKIIKTQTTTRVENIEVKQQEEREPWTLPSSIFKNRVKECDARAFFDSHTVEEKMFERDWQRACAKEKFTSMMSRENKANKEGKDEKVAIKEVHDVLLKYWPQVVGAFVYYATGGSSDPYHMSLNAFTTFLDECCIADSESQYCKRSDCDTVFIVCNFQPDKKSAEAQVNMENAMMRYEFLEAIVRLAISKYGKGQATDDLPTAVTMLLEKNIIPNLVPGAVIQSNTFRSERLYHEEVDLVFKKHSVLLKALYSRYRLKPVGGGLRPKVLKLDGWQQFMNDASLVDSQFTLQDASLAYLWARMYTIDEIKDYARYTCLSFTDFLEALGRVADMKALPAASDLDLAGYDSVLEWALDKERMEGGPDKGGQGQGGDGGEGGAGGGGATLDIFRPRPSAGFSAPKTRPLYVKLEMFLDLVFRRLYWDPSQPEVPFNYDGLLKLVKKIDKELGP 2520 T 4.7E-05 KAP pdbhh F Eukaryota T 8glv 119 THA Le A4PET3_CHLRE Subunit of axonemal inner dynein MATLTYTVFSLGEAQLHQLHTSNGKLFVMGEVAVELFQESPTAFLQELRKNKLPKLQSANRDVLHTVAELHLPVESSANSQGVCLLPAATVETLLVDKRRMELVQPFKLALLKLASQEAARLMAAGEYELALPVALDAVQQGQALFKPAPALQLFPLYLLAAQANLGLRRAKQCEDFLALASWLAMKEPGLTTSIMKSQLSRLYGQLYAFQSKHAEALHAFAEDVYYCSLEYGPEDVRTSLGYYNMGKVFQSSAELDKAASCNDQVVAIWAAALNAVVLGLADGGGAAQPAALPVGRLQLMEVVDMLTDIARSRAAALGSGHVTVGEAHLVTALACIQLEERGRAGEELEAAAATFGEDDVERLRLVEMARVMLNALTGG 380 T 0.01 TPR_12 pdb F Eukaryota T 8glv 127 ASA,BSA,CSA,DSA,ESA,FSA,GSA zt,zu,zv,zw,zx,zy,zz A8J0T8_CHLRE FAP1 MSGPIYPSTLRYKDRLDCGKDDAFTYNRLYTATQGSDVWARLTVDASVRQSSARSRGSFQEGQAMVRHSFKNSGFDSNTCPAVLTHSTFKAGLYSYGVPIEEQHPITARRFKQKQPLEFMTQIRPHTETTNQALRMLGTYVSDQPHADRLGMFIPAGCPGGKPAYHPDVTTGGFGLLPTLPRRGMGATLTDGRNLK 196 T 17 DBB pdbhh F Eukaryota T 8gnn 4 D D RAD17_HUMAN HRAD17,RF-C/ACTIVATOR 1 HOMOLOG TDWVDPSFDDF 11 T 0.29 DUF4088 pdbhh F Eukaryota T 8go8 2 B,D V,U C5AR1_HUMAN C5A ANAPHYLATOXIN CHEMOTACTIC RECEPTOR,C5A-R,C5AR ESKSFTRSTVDTMAQKTQAV 20 T 24 DUF4355 pdbhh F Eukaryota T 8goc 4 F,K,L G,U,V V2R_HUMAN V2R,AVPR V2,ANTIDIURETIC HORMONE RECEPTOR,RENAL-TYPE ARGININE VASOPRESSIN RECEPTOR ARGRTPPSLGPQDESCTTASSSLAKDTSS 29 T 21 DUF6352 pdbhh F Eukaryota T 8gok 1 A A Q5ZTB4_LEGPH Legionella OTU-deubiquitinase A OTU1 domain GIPATGDGACLFNAVSIGLSVEILSGRLDSQLDTPGYQALLDEFAKHHPQFNPKSWKTLKEWLAYYNDTRDIELILAPVLFNLNQKYQDHLDEEILNELTNLVWKNKANIENGQAWFQLQNTGDLGEALFPKLENLDLKKDRAPLLDKLREILKDYKLELTRENVKQFLTEKAKELLSALKKKISSDPHAFQRGYSCDELKGMTDALAISLVENREEDITDNRIKIRLENQEEHWNVLCNEEDSERFLDSTPSRLKMTSLEAYRGDKQVSAPT 273 T 0.07 OTU pdbhh F Bacteria T 8goo 4 F,K,L G,U,V C5A ANAPHYLATOXIN CHEMOTACTIC RECEPTOR,C5A-R,C5AR ESKSFTRSTVDTMAQKTQAV 20 T 24 DUF4355 pdbhh F T 8gp3 2 B,D V,U CXCR4_HUMAN CXC-R4,CXCR-4,FB22,FUSIN,HM89,LCR1,LEUKOCYTE-DERIVED SEVEN TRANSMEMBRANE DOMAIN RECEPTOR,LESTR,LIPOPOLYSACCHARIDE-ASSOCIATED PROTEIN 3,LAP-3,LPS-ASSOCIATED PROTEIN 3,NPYRL,STROMAL CELL-DERIVED FACTOR 1 RECEPTOR,SDF-1 RECEPTOR GHSSVSTESESSSFHSS 17 T 55 DUF5582 pdbhh F Eukaryota T 8gpn 7 K K MEN1_HUMAN Isoform 2 of Menin SMGLKAAQKTLFPLRSIDDVVRLFAAELGREEPDLVLLSLVLGFVEHFLAVNRVIPTNVPELTFQPSPAPDPPGGLTYFPVADLSIIAALYARFTAQIRGAVDLSLYPREGGVSSRELVKKVSDVIWNSLSRSYFKDRAHIQSLFSFITGTKLDSSGVAFAVVGACQALGLRDVHLALSEDHAWVVFGPNGEQTAEVTWHGKGNEDRRGQTVNAGVAERSWLYLKGSYMRCDRKMEVAFMVCAINPSIDLHTDSLELLQLQQKLLWLLYDLGHLERYPMALGNLADLEELEPTPGRPDPLTLYHKGIASAKTYYRDEHIYPYMYLAGYHCRNRNVREALQAWADTATVIQDYNYCREDEEIYKEFFEVANDVIPNLLKEAASLLEAGEERPGEQSQGTQSQGSALQDPECFAHLLRFYDGICKWEEGSPTPVLHVGWATFLVQSLGRFEGQVRQKVRIVSREAEAAEAEEPWGEEAREGRRRGPRRESKPEEPPPPKKPALDKGLGTGQGAVSGPPRKPPGTVAGTARGPEGGSTAQVPAPTASPPPEGPVLTFQSEKMKGMKELLVATKINSSAIKLQLTAQSQVQMKKQKVSTPSDYTLSFLKRQRKGL 611 T 2.2E-24 Menin unppssm F Eukaryota T 8gqa 2 B B precursor peptide analog MslAdeltaW21 CLGVGSCNDFAGCGYAIVCF 20 T 0.37 CCAP pdbhh F T 8gqv 3 C,F C,F As64 ALLRSATYY 9 T 9.3 BetaGal_dom3 pdbhh F T 8gqw 3 C,F C,F Hu64 ALLRTATYY 9 T 15 BetaGal_dom3 pdbhh F F 8gre 2 C C F-box protein UCC1 MNQSDSSLMDLPLEIHLSLLEYVPNELRAVNKYFYVLHNHSYKEKSLAWIAEDNYIWAVVKHSLCLYVKSLDPLRQHAREIIQETKEPGFNVPLCMTKYIADSWYIVYNALQYPGKIINMGWDKYTKSQDLNGSDSTSNFNSRPKERTLMQSLTALPVNFWSRKKDEPTPVNVWFYVKNAHVARYIPKIITEIGICNYGPKQIVASAGYINELITSEGIYCVNLGHLPRLYDEQIFEGTGTTHLPLELKAIDRTDSDVCINSDLVLLGYDFIPYQISKPWLLFRIEPVNSIEAIFNYSECSFSYQFAWSLACLQSEEKISFPRDTIIGHGLPYKPSKLIRIFVYKHPEQKQDLGQEIALPNWNTPYLRR 369 T 0.058 Elongin_A pdbhh F T 8grf 2 C C F-box protein UCC1 MNQSDSSLMDLPLEIHLSLLEYVPNELRAVNKYFYVLHNHSYKEKSLAWIAEDNYIWAVVKHSLCLYVKSLDPLRQHAREIIQETKEPGFNVPLCMTKYIADSWYIVYNALQYPGKIINMGWDKYTKSQDLNGSDSTSNFNSRPKERTLMQSLTALPVNFWSRKKDEPTPVNVWFYVKNAHVARYIPKIITEIGICNYGPKQIVASAGYINELITSEGIYCVNLGHLPRLYDEQIFEGTGTTHLPLELKAIDRTDSDVCINSDLVLLGYDFIPYQISKPWLLFRIEPVNSIEAIFNYSECSFSYQFAWSLACLQSEEKISFPRDTIIGHGLPYKPSKLIRIFVYKHPEQKQDLGQEIALPNWNTPYLRR 369 T 0.058 Elongin_A pdbhh F T 8grq 2 B,F B,F A0A0P9AXL3_DROAN Histone H4 LRDNIQGITKPAIRRLARRGGVKRISGLIYEETRGVLKVFLENVIRDAVTYTEHAKRKTVTAMDVVYALKRQGRTLYGFG 80 F F Eukaryota T 8gs2 4 D B A0A401FT52_9DELT CHAT domain-containing protein MSNPIRDIQDRLKTAKFDNKDDMMNLASSLYKYEKQLMDSSEATLCQQGLSNRPNSFSQLSQFRDSDIQSKAGGQTGKFWQNEYEACKNFQTHKERRETLEQIIRFLQNGAEEKDADDLLLKTLARAYFHRGLLYRPKGFSVPARKVEAMKKAIAYCEIILDKNEEESEALRIWLYAAMELRRCGEEYPENFAEKLFYLANDGFISELYDIRLFLEYTEREEDNNFLDMILQENQDRERLFELCLYKARACFHLNQLNDVRIYGESAIDNAPGAFADPFWDELVEFIRMLRNKKSELWKEIAIKAWDKCREKEMKVGNNIYLSWYWARQRELYDLAFMAQDGIEKKTRIADSLKSRTTLRIQELNELRKDAHRKQNRRLEDKLDRIIEQENEARDGAYLRRNPPCFTGGKREEIPFARLPQNWIAVHFYLNELESHEGGKGGHALIYDPQKAEKDQWQDKSFDYKELHRKFLEWQENYILNEEGSADFLVTLCREIEKAMPFLFKSEVIPEDRPVLWIPHGFLHRLPLHAAMKSGNNSNIEIFWERHASRYLPAWHLFDPAPYSREESSTLLKNFEEYDFQNLENGEIEVYAPSSPKKVKEAIRENPAILLLLCHGEADMTNPFRSCLKLKNKDMTIFDLLTVEDVRLSGSRILLGACESDMVPPLEFSVDEHLSVSGAFLSHKAGEIVAGLWTVDSEKVDECYSYLVEEKDFLRNLQEWQMAETENFRSENDSSLFYKIAPFRIIGFPAE 751 T 1.7E-07 CHAT pdbpercent F Bacteria T 8gtb 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R B,A,C,D,E,F,I,G,K,M,O,Q,J,H,L,N,P,R A0A4Y6EGR9_9CAUD Major tail protein MLKGKDGVVKNASTGDSIGHLQSWALDTQRDEVSGWGMGDDAERAFTTVGRASGNFEVYLDPADPSDDLEPGDLVDLELYPGGESTGSGYRSVAGALILSTAESASKDGIPMLTVNWRTSGALPQKATVS 130 T 0.11 tRNA_anti-codon pdbpercent T Viruses T 8gtc 1 A,B,C,D,E,F A,B,C,D,E,F A0A4Y6EGR9_9CAUD Major tail protein MLKGKDGVVKNASTGDSIGHLQSWALDTQRDEVSGWGMGDDAERAFTTVGRASGNFEVYLDPADPSDDLEPGDLVDLELYPGGESTGSGYRSVAGALILSTAESASKDGIPMLTVNWRTSGALPQKATVS 130 T 0.11 tRNA_anti-codon pdbpercent T Viruses T 8gtd 2 B,D,F,H,J,L,N,P,R,T,V,X M,N,O,P,Q,R,S,T,U,V,W,X A0A4Y6E755_9CAUD Head-to-tail joining protein MTVSIHPPATLVAGDSWAWEAGAVFEDHPDPWAASYVLRPEAGGDPVTVSGGLEVLAPVFRLPASVTADLPPGEWTWFAVAVDATTDARAVLAQGRVTVIPDPLAGTEDRRTPARRILAAIEATLEGRATKDADTYSIEGRSITRTPLPDLLRLRAVYAEQVARETGRSPYRQRRVSF 178 T 0.057 DUF6148 unppercent T Viruses T 8gtf 1 A,B,C,D,E,F M,N,O,P,Q,X A0A4Y6E757_9CAUD Head-to-tail joining protein MIESLADWSIFTDPDVFGEPVTWTTPPLPDPVPAIFTDASEDRPATLGPGVLTIAPTLTLGAAQLPFSPARNHRCTVRGITYRVAEVQPDGSGGLRLLLERV 102 T 0.0008 Phage_attach unppercent T Viruses T 8gtf 2 G,H,I,J,K,L e,f,g,h,i,j A0A4Y6EGR9_9CAUD Major tail protein MLKGKDGVVKNASTGDSIGHLQSWALDTQRDEVSGWGMGDDAERAFTTVGRASGNFEVYLDPADPSDDLEPGDLVDLELYPGGESTGSGYRSVAGALILSTAESASKDGIPMLTVNWRTSGALPQKATVS 130 T 0.11 tRNA_anti-codon pdbpercent T Viruses T 8gtf 3 M,N,O,P,Q,R k,l,m,n,o,p A0A4Y6E8T3_9CAUD Terminator protein MSEAIIAAARGRLISPPFSDATGDVYRTPEAALPAIIVELDYTDAERISMGGGFIASAELRVEILAKRDDWSLLTPTPANTAEGMARLAALVRTAILAPPSDLSGLAWSIAPAGYEFETERGETPLARATQSFALQILQP 140 F T Viruses T 8gue 1 A,B A,B PIKC_STRVZ CYTOCHROME P450 MONOOXYGENASE PICK,NARBOMYCIN C-12 HYDROXYLASE,PIKROMYCIN SYNTHASE CYP107L1 MRRTQQGTTASPPVLDLGALGQDFAADPYPTYARLRAEGPAHRVRTPEGDEVWLVVGYDRARAVLADPRFSKDWRNSTTPLTEAEAALNHNMLESDPPRHTRLRKLVAREFTMRRVELLRPRVQEIVDGLVDAMLAAPDGRADLMESLAWPLPITVISELLGVPEPDRAAFRVWTDAFVFPDDPAQAQTAMAEMSGYLSRLIDSKRGQDGEDLLSALVRTSDEDGSRLTSEELLGMAXILLVAGHETTVNLIANGMYALLSHPDQLAALRADMTLLDGAVEEMLRYEGPVESATYRFPVEPVDLDGTVIPAGDTVLVVLADAHRTPERFPDPHRFDIRRDTAGHLAFGHGIHFCIGAPLARLEARIAVRALLERCPDLALDVSPGELVWYPNPMIRGLKALPIRWRRGREAGRRTGLEHHHHHH 424 T 3.9E-36 p450 pdbpssm F Bacteria T 8gvn 1 A A TRP-LEU-ARG-ARG-ILE-LYS-ALA-TRP-LEU-ARG-ARG-ILE-LYS-ALA WLRRIKAWLRRIKA 14 T 1.6 DUF3349 pdbhh F T 8gwa 4 F,G A,a Q8KAY0_CHLTE Photosystem P840 reaction center, large subunit MAEQVKPAGVKPKGTVPPPKGNAPAPKANGAPGGASVIKEQDAAKMRRFLFQRTETRSTKWYQIFDTEKLDDEQVVGGHLALLGVLGFIMGIYYISGIQVFPWGAPGFHDNWFYLTIKPRMVSLGIDTYSTKTADLEAAGARLLGWAAFHFLVGSVLIFGGWRHWTHNLTNPFTGRCGNFRDFRFLGKFGDVVFNGTSAKSYKEALGPHAVYMSLLFLGWGIVMWAILGFAPIPDFQTINSETFMSFVFAVIFFALGIYWWNNPPNAAIHLNDDMKAAFSVHLTAIGYINIALGCIAFVAFQQPSFAPYYKELDKLVFYLYGEPFNRVSFNFVEQGGKVISGAKEFADFPAYAILPKSGEAFGMARVVTNLIVFNHIICGVLYVFAGVYHGGQYLLKIQLNGMYNQIKSIWITKGRDQEVQVKILGTVMALCFATMLSVYAVIVWNTICELNIFGTNITMSFYWLKPLPIFQWMFADPSINDWVMAHVITAGSLFSLIALVRIAFFAHTSPLWDDLGLKKNSYSFPCLGPVYGGTCGVSIQDQLWFAMLWGIKGLSAVCWYIDGAWIASMMYGVPAADAKAWDSIAHLHHHYTSGIFYYFWTETVTIFSSSHLSTILMIGHLVWFISFAVWFEDRGSRLEGADIQTRTIRWLGKKFLNRDVNFRFPVLTISDSKLAGTFLYFGGTFMLVFLFLANGFYQTNSPLPPPVSHAAVSGQQMLAQLVDTLMKMIA 731 F F Bacteria T 8gwa 5 K E unkown protein XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 58 F F F 8gxq 26 Z DQ TF2AA_HUMAN GENERAL TRANSCRIPTION FACTOR IIA SUBUNIT 1,TFIIAL,TRANSCRIPTION INITIATION FACTOR TFIIA 42 KDA SUBUNIT,TFIIA-42 MANSANTNTVPKLYRSVIEDVINDVRDIFLDDGVDEQVLMELKTLWENKLMQSRAVDGFHSEEQQLLLQVQQQHQPQQQQHHHHHHHQQAQPQQTVPQQAQTQQVLIPASQQATAPQVIVPDSKLIQHMNASNMSAAATAATLALPAGVTPVQQILTNSGQLLQVVRAANGAQYIFQPQQSVVLQQQVIPQMQPGGVQAPVIQQVLAPLPGGISPQTGVIIQPQQILFTGNKTQVIPTTVAAPTPAQAQITATGQQQPQAQPAQTQAPLVLQVDGTGDTSSEEDEDEEEDYDDDEEEDKEKDGAEDGQVEEEPLNSEDDVSDEEGQELFDTENVVVCQYDKIHRSKNKWKFHLKDGIMNLNGRDYIFSKAIGDAEW 376 F F Eukaryota T 8gxs 21 U DQ TF2AA_HUMAN GENERAL TRANSCRIPTION FACTOR IIA SUBUNIT 1,TFIIAL,TRANSCRIPTION INITIATION FACTOR TFIIA 42 KDA SUBUNIT,TFIIA-42 MANSANTNTVPKLYRSVIEDVINDVRDIFLDDGVDEQVLMELKTLWENKLMQSRAVDGFHSEEQQLLLQVQQQHQPQQQQHHHHHHHQQAQPQQTVPQQAQTQQVLIPASQQATAPQVIVPDSKLIQHMNASNMSAAATAATLALPAGVTPVQQILTNSGQLLQVVRAANGAQYIFQPQQSVVLQQQVIPQMQPGGVQAPVIQQVLAPLPGGISPQTGVIIQPQQILFTGNKTQVIPTTVAAPTPAQAQITATGQQQPQAQPAQTQAPLVLQVDGTGDTSSEEDEDEEEDYDDDEEEDKEKDGAEDGQVEEEPLNSEDDVSDEEGQELFDTENVVVCQYDKIHRSKNKWKFHLKDGIMNLNGRDYIFSKAIGDAEW 376 F F Eukaryota T 8gy7 1 A A GNAI3_HUMAN;GNAS1_HUMAN G(I) ALPHA-3,ADENYLATE CYCLASE-STIMULATING G ALPHA PROTEIN,EXTRA LARGE ALPHAS PROTEIN,XLALPHAS MGCTLSAEDKAAVERSKMIEKQLQKDKQVYRATHRLLLLGADNSGKSTIVKQMRIYHVNGYSEEECKQYKAVVYSNTIQSIIAIIRAMGRLKIDFGDSARADDARQLFVLAGAAEEGFMTAELAGVIKRLWKDSGVQACFNRSREYQLNDSAAYYLNDLDRIAQPNYIPTQQDVLRTRVKTSGIFETKFQVDKVNFHMFDVGAQRDERRKWIQCFNDVTAIIFVVDSSDYNRLQEALNDFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKIEDYFPEFARYTTPEDATPEPGEDPRVTRAKYFIRDEFLRISTASGDGRHYCYPHFTCSVDTENARRIFNDCRDIIQRMHLRQYELL 361 F F Eukaryota T 8gym 3 C,GF 2f,2F Q248F8_TETTS Transmembrane protein, putative MIKYLLHQLFIYIYVAEVLLGCIFAFAETVFFHSDQDEDYFLQIKQIQIKNQKRFRNNQKKSRSFKKKIINQQLVSKMVRLNLKSNVDQNEYPFLAKWDKDMRQNYEEYQNRIDATTYHLQRSQRGIAVFGEWMYPRYFQKDILELEVLRRKQQLGKIYPEEVSSYTQINPDIANDLNLTFNAKLLWPVRGMTVGAGFFAFAHLFNLPYSFRLGLFVLPTAVELAFTWGNKTSQFKSIEFMDYLLQYRVSKALLEKNAKHFAEKKAAYQKEINSSQSVQDLYNQLITLVSEQAPSE 296 T 16 NADH_dh_m_C1 pdbhh F Eukaryota T 8gym 4 D,HF 2g,2G I7MEX7_TETTS SDHTT3 MSLVSLFKNTFLKSRVIGLSFQAQRVMAQMAKTDFENPDEHFLLNDAMKYNELVFYGRLAENWSINPELFGKAELAKYNEAKQTLIDFNQYHALVQNLHEFYWELKTIYLELSRGVATSNFHNKREVTHSIIESDIKNSIHKYIQLIDDLKDYPEWQHKVREEIGYYAHMIYTSVNHDGNFPEIFKEFNKVDSLYYFK 198 T 0.0012 MiaE_2 pdb F Eukaryota T 8gym 5 E,IF 2h,2H I7LX66_TETTS Diphthamide synthesis protein MLTQRFYMIQFTKEEQSSEEKYLKTREREEDRKKELMHPQKVLNKKENKRKALLSKNQQNKKLIKYLNLNKRQEKLININQEEMSILPPLQYTYSNEESLELLIHSIKGNKDCNSERKAFNLCRSTVLGKHVEPEKCLDKALVFVNCFQKVRRDESAACQSAFNSTLECGKKYSESTISLGSSCQSQLDAYLNCK 195 T 0.0012 CHCH pdbpercent F Eukaryota T 8gym 6 F,JF 2i,2I Q22YL0_TETTS DUF4885 domain-containing protein MFSDFNMYEAKVFLKAVADAQNTFRQTAQQENQLARYESQSQSLLNGSTSGAISITGDNIQQGRNFKALKEVKLFQYSNEIFKKYLAGFDSFSGDYTAFKKFLNESVKKIEQDA 114 T 0.0045 gp37_C pdb F Eukaryota T 8gym 7 G,KF 2j,2J Q23S01_TETTS Transmembrane protein, putative MNHSCQKVFEGFVSALYDTSYFFRNFGPFKATIHYATYANYLAQNWAPRVSYIETSTPAYTLAKNKYAVYIVYGLIGGALIHNYMLDNKAAQKSQQYYLKHRD 103 T 11 Fzo_mitofusin pdbhh F Eukaryota T 8gym 8 H,LF 2k,2K Q24CW6_TETTS Transmembrane protein, putative MLDDTKYIQMAQKFPRNVSVQLNKKLFVTRTWFRNYYFVGVFGIFAYFIYNQPKIFAPFSGYPTTVAYKAQPDFLNDQVIFYSQQRQNTLKNF 93 T 9.1 DUF108 pdbhh F Eukaryota T 8gym 9 I,MF 2l,2L W7XBF5_TETTS Transposase MKLDQIISYYITPVRRFDKNLTAEQIYEQYQQAAQFNEIDAFTNIRFHRKFKEYIQTQEQSDYLYEKAKQISTLAQKMFEKKFPEYYTQ 89 T 0.13 Cdc6_C pdb F Eukaryota T 8gym 10 J,NF 2m,2M Q22HD6_TETTS Transmembrane protein, putative MARLWWTLDPSKYYLKQISSGGRNEILFTVLGVTAAYWYFGNKRCEHYWRRQIDNCQSWSRAQNINGNNLTVKQYF 76 T 0.0044 PriCT_1 pdb F Eukaryota T 8gym 11 K,OF 2n,2N W7XF00_TETTS Transmembrane protein, putative MRRIFWNFKTAFVGLPMFSLAPKNILVYPIVVGVPLYTFIVLQNSVRGFAYFDEYDSDVKEN 62 T 6 PPI_Ypi1 pdbhh F Eukaryota T 8gym 12 L,PF 2o,2O SDHTT11 MGLPIRNIQFARYHYLAAVTVFTYFATRCCLLDYKKYYPLASVKKI 46 T 15 DUF4519 pdbhh F T 8gym 15 O,SF 4a,4A Q24C97_TETTS Phage protein MMQNLKKFMSKTIQVQPVSFNQIPKAFYNFPEYRTGGVQANPGITAKRIIKCIGERLRKYDPARWENVPITFKTHFRDENGYSDVATSIQIHDALEREFGIDIKDRLALVTDVETAFYIVMSHHDPL 127 T 0.16 DUF1493 pdbhh F Eukaryota T 8gym 17 Q,UF 5t,5T Q22N23_TETTS Cullin domain-containing protein MEDNYAADVQRQFNRTAFDSLYKICYNSLVQKNGSTIDFQKQIDCHQRLIQVFAKIAPIVVKVEQDAASSGGAAAGGEDEE 81 T 11 MetOD2 pdbhh F Eukaryota T 8gym 18 R,VF 6a,6A W7XCY5_TETTS Transmembrane protein, putative MIWKYLQRTNRGNIIQAGLQHRKFENLPFKQNFDNLTKAYDLRMWYISNSPHEAKNLEYVNELEALHNELNYQNSRQFLFRTVSFLLGWALFYQFYELPKTYDWQDTQEPKHQVPAYGDLEEGGDEGGDD 130 T 0.025 SpoU_methylas_C pdbpssm F Eukaryota T 8gym 19 S,WF 6b,6B Q24I72_TETTS Structural protein MSSAVEKKDLPADYGKMPAGYNFLTRGKDWREYDKDFILRTDAVWEKFQLEHFFRNYMKCFFFDHGLKKYQMFEPEDMYTVVFEGWALDDLITFPGFTPTGRTNSYQIGLSPRQRTVVPTQTFYQMQDYYMLCGLRFERWFRCDLVYHDQRHTKFDQVKNQKNYKTYPCYREYYEAQYACQDDMFDFLMELAYARRAADNFESDFASHELTTLPTFYDTPKAAERKTYTY 230 T 35 SNN_linker pdbhh F Eukaryota T 8gym 20 T,XF 6c,6C Q23DS4_TETTS Transmembrane protein, putative MGSVWFRNRYWWYRSLYDDYVAREAKLAFGIAAFIWLPHYYWGIHLNRAFEVNFSHRNYAHEWGPRRNRLAHSLEFEQFDMILENWQDLEDEYAQRGDGMLKK 103 T 7.9 NIPSNAP pdbhh F Eukaryota T 8gym 21 U,YF 6l,6L I7LVX0_TETTS Decapping nuclease MEVKYRGPSDDKLECEFLENNLLSCLREKSVQDNVAKMTCRPEFLVWFFLECPTKAAVYHDPKGLRNIFIQDKIKQKGSDDGVLSKDD 88 T 1.3 Defensin_4 pdbhh F Eukaryota T 8gym 23 AG,W 7A,7a I7MGF9_TETTS Transmembrane protein, putative MNRFFKVSSKYQYYKYLEQYDAAFLRKYQSETHWYLGRRGAWKNLVIKYAGDHISLEEEHNVKYKTHLSFVYLSYRLAWVLFAYVLIYNHFLLGDIGKTFNVGEWDHRLKPSAERDYPTRYESLYILDRTQKW 133 T 0.0061 IRK pdbpercent F Eukaryota T 8gym 24 BG,X 7C,7c W7X287_TETTS NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 8, mitochondrial MISKYRYLHCARKLVKQSVQAFGGGHHHHEYDWRDDPKVNKDIEEDIRDRGWHPETYDFPYTKKHDDWVFDVTMPSQNYQTDLTVNIHPENKKMHVMKQVMRQSYWDAEHDMAHEYDYESEDLDFQCESFKSQHFRKKGPISQYLILGLLPILYFGTEFFYNHYPDEDYWRVAHPPPLDYPDTDDTDDTETFKDYKSFTGRRMVDTGIVDPLWYDIREGKKVYYDWAGVNQPMEDI 236 T 0.14 Tctex-1 pdb F Eukaryota T 8gym 26 DG,Z A,a Q22PJ5_TETTS Transmembrane protein, putative MLSKVTRRFLNYNQIYCFASQHGAEHHKLTASDEAYLNEVRQRYVTPDMEKWAYLDYKKHPSTTLSHYDHKSKDYVESERDDYNADVATNSHNKLIDDFKRNLQMQRKVHDILQKMDRPYLRGVPGVTKNISAGLQDYSAPVSKKSQSDPNDFYRDAYRNENRWIDQSVFTPKTSKMTHYDVEWPKELASRPVTKKFHHDKGYKYDVTTPYDQRYNYVADRLGHPEILGNPFERLMRLEGDIYHPNYLDQPFVKVPNANPNASLNFEEGEVLYENTRLLEWAKFWNYSVVVGYLWCAYFVPYNIFFKTHMPLEHAYDNLFFPYFQHTHFLWDNNALHIPTVGGVAIYATYIALSYINNIWKDYVVRAQFSKDKELLFVTRVSPFGTTEEEVYEVAHLEHLPPSVRSGVKDLSAQDADGLVDVTCMSSQRSLVFYKGDQYWNPKVYNDFINQTSNLWTRNYTGYNRLEVQNSVEQVKIGFSHSSQPKLEKK 490 T 0.015 TMEM70 pdbhh F Eukaryota T 8gym 27 AA,EG b,B Q22FX8_TETTS Protein phosphatase 2C, putative MFRRIISNGALLSTQTQRWQDLSKFACLRASLNKESEKAFQELAKKNNVSPQELVELSKIVSMNLDVLKQNINSEQFLLEKESTLKRYRQSSIGTRGHLQTVNEAVNTKYPTLAEGLGQVAGYKEAYQALREIFVHPSISVNNLRQGSYGQQFAVDFRTRADEYVKALLKDHSSNPQAVQTIQEIQHTLHQIIKNYEQNPASIYARILTVLQTRGVNTLPVSKTADQKAVATIQKTSTPSLTIDQLTVPVQERVQTQTVFDAELAFIKEANEMIQQNTGNLPWDGGKKKIFQGQANKYLETPYYLLAALSGLGLLYFLYSGDAKYKTLVLTPVVGIAAFVLLRRNQILNRVPTLTELFLHKDGKFVDAVVSVNGQLISKNDIPVSTLKLYRGDHTVKVNLNDFEDASAKKFLAQQSGQEGVINVHFSKLRNLAARNGQVLNLGDTEVVVPFENQANRIILKQIFKGVEVLPSS 473 T 0.011 Rh5 pdb F Eukaryota T 8gym 31 EA,IG c3,C3 Q950Y6_TETTH Ymf68 MLICNFLMYSNFSRIYWFDFNGTVNENLPLNYNVLKICRNEINKLEKLNENNLGTQKNPIKLNLSFEDKHYNTNNLVLDLNSYETFNSKNFISSIFDKTFESLNTVLMAPIYSFLEFKLKLSSTKINTNHYYVINGKLYITYNDSFKLFTTINDYFNDLNELSNTKLFFLYRSFNIYNIKLNSLVDFVFLKLILFIHLLYLKSTNYNRFDYRLKQTDWGFYINNNSNYIQNIFSGLKYIWRGLRFWIIGLLLGLSSIYYLMYVRLLPFNKIIFAWILVAMFLYWLLSGFVFFVKKYQYSKFTAAIQRFWKRTYIIFWVIEAGTFSVFFYLTLNASSEPVYMYDQIKIYKTHLFSWRWFLIKLLPSVSIILLGYYLQLTLKWNLFNKQNTIVLLITLLLLYILWLEFYQFYHILSFYGNINWAFDYDEYIWTLELDTRRTRLANNYIAICLFAKFWHFVFIFLFWVFFVLRINELGRIRYPLLVANVQNFIIIYIMSWAYMYPWLKFIFRKYLDVPYYWFYLNGRELGIRVFFTDLKLFFYGITNRLFDFNPSSIKFEKYPFYYWINSSQLTEFNQYRKFVIRDSIIYSLNNYII 594 T 0.29 DUF3408 pdbpssm F Eukaryota T 8gym 34 HA,LG f,F Q23DG8_TETTS Transmembrane protein, putative MRYLKIEKEKLVSCKKQEQEVQRIRRRKGNQKLNSIAKQQRVKRRDYQQNIKQNKEVKNPKKLIKQQIINKVKKRKKMFRGLTKFNKVFALNSFKNSLVAVPKANLNHVQNMLEENLKYDAQKYNDEVAVIQKTSRIYKPTYTIEFNREGEVLVYSADPIKNSVVYFKYPYVLYEAAIPLFIWAWIYNPLELSKNAVNSLLIYPNIAWIPRMWYWRSLQYKIQKMYLLRGGKVAKIETQSLAGDRFTSWVETYQFHPLTQDQKNFDNQDNAEFLEDEGQLKYELGVQLDNLQEMGTTSQDIVINFMKEGTVHHPELFEAIVKGYNIDTSDYVINTANNLRAREGNHNH 348 T 0.023 TMEM70 pdbhh F Eukaryota T 8gym 36 JA,NG g,G Q23DZ5_TETTS NADH dehydrogenase [ubiquinone] 1 alpha subcomplex subunit 8, mitochondrial MFLNRLVKETSKAKRLFSMAQNNFARAGPYNPNRYKDYYIPRTLPKNEEIVEFVQSQHSVPASPIRNQRHINPVRESGPLPSYDGTYTMEDIRAVFYNTTVGRDYCYCQMDPEEIMRRVPGITRKEAEFITKLGLSPQEQVDFAYIAYNIGLDIFYFTNQMFVARQVVTNSKGEKVEVLWNAQCYEDIAQLNVGFAPVLESVDYHWEIFLWADPPIKPNNDFDLNVPCTWFEYEQEWWMESCIQEDQFNLPEDERPYNTPRNPHCRKELWRSQDALQEEELMVNENWYPKNTQYNIYNQPDFIKPKSGSGAAADDIRI 318 T 5.7 IMS_HHH pdbhh F Eukaryota T 8gym 38 LA,PG i,I I7LY65_TETTS COXTT9 MVYHLFERICNPDNFKLSGEAARVRTLIAAGFSKEEAEQVAWLQNHQVNGKILGLFTGGFALYCCNNYFHYFERYFPRLRYQPFTKFLAQAATVYFFFKIGDYYFTSRRYGSNDARMNGLMYSNTYYSTNKEALIQNFEPLNRKFTEEEVEQFLRNEGRSQEEKRNWIYNPHIHGSTEGEWKADIHEKFDSGKAPWEREHVKAKILETNKAKIDAGEEIQLKPFKTLNHLDKTGLLHRLHPFIWTNNWTLLG 252 T 0.19 Bac_luciferase pdbpssm F Eukaryota T 8gym 39 MA,QG j,J I7MD70_TETTS COXTT10 MSSFIQYEFLKIYQGNQKIKNYYKRKRLIFQQKKVLKKKQKEIQMSTNNLRLKPWFHWTDEERSHAIFSAYEKRILKSEDLPSFLRANRINNVSTWVFPLIALPLFNQSIFKLGFAQRILLTRPAIEWHCFKIATVAASWLAWLNFSPFYRKLENEKEYLLDTLESRIGINVLDLNDALPRWTTSQEYNRRTQQLYNQRNGFFAGLLYPQEESSRPLVDIASFPKNLHKEKLTK 234 T 2.9 TFA2_Winged_2 pdbhh F Eukaryota T 8gym 40 NA,RG k,K W7X4J9_TETTS 39S ribosomal protein L9, mitochondrial MFGRLVLKQTRRTLFNPVLKNTFCIYQAYQNPLRHINTGHNPNNVYEDIVMLGDYPVQNRTHDKVISQTYVPAIANIAFTHLSKKYPQAGLKVDQLNTLKEKTWNDLGVNIEHEKQEILVELSEQIFVKESKLRWVHEQRQRLAHTTYVFSGLEFQNVKVGFFIDSYNFLLQELAHRSNLYQSKDIVGEKSFHEKHLEQQTAPYSGVKSLEEPVSQNKSFINSLMRAIHNH 231 T 27 FliD_C pdbhh F Eukaryota T 8gym 41 OA,SG l,L I7M3P9_TETTS Ubiquinol-cytochrome c reductase complex ubiquinone-binding protein QP-C IKGNQKKQKGKNQSNNNNNIREEGKQIKEMILPHNNRQLARQYFDSLPENDINRKYYEGLKYETPKTFFGRFLNQFNIDAKLDTLSKFYTYQKTIRATQAELQEDRKSYLTNSLLFTAVSWFSIYQFARKGAVLPVLREYGRYFGTHRLFRQYLHTLVLPLLYTEYALNQKYYTHMEHLWTVHVNRLNQKILEDPLYTFYPQELNVPKHNIIVPTIFRDTPQ 222 T 18 Plk4_PB2 pdbhh F Eukaryota T 8gym 42 PA,TG m,M W7WZP1_TETTS Transmembrane protein, putative MSLSLFGVKNNWHKNGIWWFSKILNKTVGEERYDALRVQRRIWSMRFYYARQQCLYELFVDHPDLAQWTGTYPKVDSSHGFPFYSTYEMYRDFQENTLNSDGSFAQWITLVCGIYVIHVIYNYMIPYYWVSTPLKNDEFTRLRMKDYIASTVLEEVYGISYAEWGWLPHDFAYNRMRGLAGYMHPDDPRAMCTSTFHRKHKYIEHEVEKVGDYHHMTYPK 220 T 61 Spore_YabQ pdbhh F Eukaryota T 8gym 46 TA,XG n,N I7LZX8_TETTS Transmembrane protein, putative MKKGTASEEELKKLYDPNTFYEHGDNPAFKQFMNIAVENLREGKLTDHRTYVVDTYKKWMYARNWDDFLQRDCKAITFPRAFALWIVGTLGMATASKWCRQILPVGSHGITKISQTQFFHQFGPLGTLGAVGFYGLTAYLYYKTTIFTVKKFYSHCILQEREWIFEQERQNPGYGEYFFKDVPLSAEEHFNDLARGEMAKKKFEKPNHEF 210 T 8.8 DUF4500 pdbhh F Eukaryota T 8gym 47 UA,YG o,O Q23F08_TETTS Mobilization protein MKEKIFNELTRKMKRKEISAKIQREENKQILIRQRNNKKYIQSIQGIQQERKKGKLYLVEMATQNVEEMDTIQKMNYEATVNMGRQDLITREYTFYSDYEFIPIQEDRKQQMEDALNNLHKIIHPTVTQLKKKANVQEIQDRVFRKLQGWEGELNTCVFSAKNVRDSNFCADRFTNRINTEGVEFVKQILREY 193 T 0.13 DUF3221 pdbpssm F Eukaryota T 8gym 48 VA,ZG p,P I7M8Y9_TETTS YflT domain-containing protein MNNTFKFLHQVISKLTLKAQVPNYGQYSHSLKRPINPKVVVFGNSSRAYELISSQFRNFNHVNGLELKGQEDNIQANKVAQSVLSINDGFQDGYYITDFPQNSKQAERLDLITDGVNLALYIKDPSDKVTVTRQQEAIDYYRKTGALVEFEVDPRGDLEEQVKQLSNQVLNGYKH 175 T 0.11 PRORP pdbhh F Eukaryota T 8gym 49 AH,WA Q,q Q23D87_TETTS Transmembrane protein, putative MDNNYHFWGNGDRQDVSLSYEDYYSILDCLLDEKLSPQGLMKFKNLHEVSMYGVSYVPLYCFPVAYGISHMLTGKVRRGHSGYRNLFSLMSVVLPFTCWYAYTTPIPRRLYTEIICSNNADGAYVRNRIKQQKPGIWRKLSQQLYNKNFRFPELNQDLTATEFPLDYVAPHKF 173 T 0.16 DUF2206 pdb F Eukaryota T 8gym 50 BH,XA R,r I7MKT6_TETTS Transmembrane protein MVFEFLFYNQQHKTRNGYFINHDNLMLASLEERKKLIFYFIANQVPEKLDPVDRVKFNEELSDNLSTKARLIGSLTGLIGLVGFPYISTRIYSRPVLNIGLSLLICPFLYYVGNQLTYSVWEPKFIANNNTVCELSKKYNFTVFDFAQAKKEAHLKALRTELVSDNLLYSPGI 173 T 0.048 DUF1689 pdbhh F Eukaryota T 8gym 51 CH,YA S,s Q230X6_TETTS Complex III subunit VII MAIRNFVFKISNQIQNLAAKRSLAYLNQIDSQSVPSRATINMKDQVTQMQREIDNMANVIRAQIPDEDRAEFEILKKYYVTGQHDSLVDPQDVLLQLDRIQVLKNLKMIELNEEAYDPELVRLEKLKARVLLEEEGALLEYAHFISKRPYNKPYEKWGVSEEHVKQQILG 170 T 0.35 APG6_N pdb F Eukaryota T 8gym 54 BB,FH sc,SC Q23RH8_TETTS Cytochrome b-c1 complex subunit 8 MRTKLYNAAYFLLNNNESFGHSFGIRLKIVGLNTWIVGYAVSRYYFSSLRVKAAQDERFE 60 T 4.8 eIF3_p135 pdbhh F Eukaryota T 8gym 55 CB,GH sd,SD SDHD MFKELIHIFRTYFITFRYLKKSNINFLKNLSYTLIAYYLIINFM 44 T 17 DUF1869 pdbhh F T 8gym 56 DB,HH t,T Q23VY4_TETTS Transmembrane protein, putative MGFETVVPAPPTRDDELRMIKATEEQFLQQPRYKLYMNEAHRIAKMNHGDRHNNIRAHFWSNFALGLLITGPIFIIPFGKAFRNLRSGVPYYFRPKYVFTQKNQYNQDRNWGAMKKQIPLWLGLSTAYAYWFTDFSINDDEWLEKGKVIYPHQTIKVL 158 T 0.091 MASE1 pdbpercent F Eukaryota T 8gym 57 EB,IH u,U Q22DP8_TETTS Transmembrane protein, putative MSCTTRRFIDEKEKLEYSRGYNQQELEASKLRKDFVKKYIVDFDTTLYKTQVERDWAYIAKREYRYEVQLKSIGYGGALANAVLLWRIYANKKMVFWPIPIVGALGYLYFQPVFFQKSNKRFFDMCNVGEEYYLGRERNKILRECNKILNVEDF 154 T 0.1 DUF559 pdb F Eukaryota T 8gym 58 FB,JH v,V I7MFV5_TETTS COXTT22 MGKDQLDFSHFDKAFENKYDIVAPEFGDLHQKRAEFIAKNQGTYRPVPLVPNNIKGLIPKTCRLPATRNWYRRTSSFERNGFFNIHTPVLNTKMIPWLLFIVLTWGWSSFQIGGYNYERFDDNGERRNTLYWKLSPVEFPQSKLWNRPS 149 T 0.056 TOM6p pdb F Eukaryota T 8gym 59 GB,KH vb,VB Q23FF5_TETTS Cytochrome C oxidase subunit Vb protein MKKQKRTQGKQNTKQIKQEKLSSKRKANNQKEGKKKVKQEDYKEIKQKGKRMLSKIVKASFSSKGFNLANAVNTVKSTLNAPIKHIKRNIEPTGSNYSRMTNTTEEAFDEVSHEWQALVTSNPFDLNVFNYLENTQTSNFGTVDNPLVVFTSETPFRYVGCTGQMNEDDYEGHELLFFLLREGSLQRCMGCGQVFKLVRLRNEYSPEMDYYLSNFHPYEMQEMGESDTTVLMSPYKYASHYEYTQFETPSNMVYSMVNPDEHDRLLVDPAYRMERTKALEEKYKVYTSSLREVEKQFEERYGRAGQINISKVTYSTLIDVEKAVLKMDRLFRKVAKFENRAFIDRANHSRREKRMLERAQQRWDSNYSFFTGSLTEEEQKYRDYYETELEAYPEDEGIEQQLDQQEVLLSGRYDPKLYDFQEGYTKNPEDDQTSLIEKKAFKFRYRLANETSETFQRRNNRMVERQIKRFQQPQYKHAFEQLQKNIAISSNSGNALHSEYGYLELLSNESVQLYKDYYESDAEEDFKVFENLSSKEKLVMIANFENNLLPKYDRSEVHLIPKRQWEPAFGVWENFLYDITEYASFIAPRGKEIAADYQIQSAIPLTKEELIEAGLYKETIEKKVEPKLEAKKQTKSE 637 F F Eukaryota T 8gym 60 HB,LH w,W Q23TE5_TETTS Transmembrane protein, putative MVFHYTNFVQETNAWWLRRVRPVYCTVLAYYGWWLYDRYYLFGKNATQDIRKDTTEVWEKRAALNKRNWGYNAHYKPELERSMKKVLYADPNYKFPIEWPERYMAETKTLEQVMDEEENWEYYK 124 T 4.1 GTA_holin_3TM pdbhh F Eukaryota T 8gym 61 IB,MH x,X Q22W32_TETTS Transmembrane protein, putative MEPFGTDERNWTHEEKDIITRFLKYDKHVNLKTAEMVYSAEVESAYFGKAGALAGGVISALFFNFPIVRNLPIIRRSVIGVLPFLYCYTWGKNTQEELRWLKTFAAYQRFVVYHGQHCKLWV 122 T 3.1 Pepsin-I3 pdbhh F Eukaryota T 8gym 62 JB,NH y,Y I7M9E7_TETTS Lysozyme MAQTAHQNRYQGGLCYAQCNELFSFWNPSIQQCWKGCDFGVGRVNDPEGRIEAQQMCKRWAAELYWTYKGELDTIKDLRVHADMYPTTPQNVYRACLAGVRRQKF 105 T 0.59 BSMAP pdbhh F Eukaryota T 8gym 63 KB,OH y0,Y0 Q950Y0_TETTH Ymf70 MFRWLFLYWYNSTDTPSAIAKVNLWSYINLRLFKARLSSSIAYYILGLNNLELKKLKIFYKNTYFDYIYLKSIPCLFLIIFFTNLYLFL 89 T 37 DUF5784 pdbhh F Eukaryota T 8gym 64 LB,PH y5,Y5 Q951A7_TETTH Ymf75 MFLGIFKDVIKLLNKKVVPVYFWFFLYCFLSTMDTNIFVSSCSFLKVEVFGKDENTTLVLLFYVFYSLFNFYLSRIKNKNNYLVRKHLYTTELLIELILFKYKLIILKFSSIKYILNFNVRKFILFNLFLINNYKAYKINTFFLYIYIYLNNLNIIWYPIFKAYSIFGYYKSTRLNFIDTKNENIKRIKY 190 T 1.8 PDH_E1_M pdb F Eukaryota T 8gym 65 MB,QH y7,Y7 Q950Y7_TETTH Ymf67 MTALFLHILWSISYIIINILYIFLSLLLSNNNEKIKQYNSNYFIKILLVLFYNKNLSFYKNLLSEDEISKIEFERLKNYPTLVLIHSNLNKLEKRNKIINSFINFKTKYRFYKFISTNFNLQTIIKNCNDKIIFSTLLYIVNLNYSFFYKTIKNTDLIVYLLANKFSILNDNIIVSKFNISKFNDYIKYINNTNSIDTYLENQIILGLNNNTNSNITKNINTKLLNSYSNLKNLVNITNNTFYLKKINDNYNTVINSEFLTYLKSNYKISFSASNIVKYLSDKSVNNSVILYLRKNKIFNKSRYSRNRQTYRTGAYWCLYVNIIAVVAFYFWFYKFTMNFGYLWWLLYSLILSFFFSRALKHRFYNPLNVMTEFKNGFMWFIIILINIFKPLLKLLENNYINLYNHLVIKYYQSFICNTLINKKKLEFNYILSSFKFIKELNNIIIISLNKLF 453 T 0.0058 NUFIP1 pdbpercent F Eukaryota T 8gym 66 NB,RH z,Z I7LTF1_TETTS ABC transporter MSSDPFKKVERDYHNERSVHKHFASYPLKFWWGLNKFETIQGIHSILGNAADLVVSTLSFIPGVQGRNNASYIENSIRVTRFRGFDDKTQ 90 T 0.14 DUF5493 pdbpssm F Eukaryota T 8gym 67 OB,SH z1,Z1 COXTT28 MAARDFEYNNQDVNQLNGAFISLVEDEKIGFWVGVGGFAYSQFIMRKFVKSTNIFASVTSLFAGAALANLYTHQSRASYARVAARANRNASLALNKLMEY 100 T 0.02 DUF1689 pdbhh F T 8gym 69 QB,UH 2b,2B Q951B2_TETTH NADH dehydrogenase subunit 2 MSIFSNIWINNDLNSYGLSILLLNIINYLIVFMLILSVILLTNLSKFKSLNQFKEFNSYNFILYSLIFSLLSMAGIPPLLGFTGKFLAILYSSFKSQYLLILFMTILNIFGMYFYIQNLRFVVKKNKSSILNYKNYYVNINYSITLNIILLNFFNFFGILFLSDLIIILNYISSYIYI 178 F F Eukaryota T 8gym 70 RB,VH 4l,4L Q950Z5_TETTH Ymf58 MLTWISFWSLIFWLILIILVLKPKNFISILFMSELTWLALYCLSLLFGAIYCDITLLSISFFILGVAGLEFSFGILIAILYKNLNESLNTDLNNNNNNQNIFDKNFKTPLEKINWQ 116 T 0.0017 Oxidored_q2 pdbhh F Eukaryota T 8gym 71 SB,WH 5b,5B Q951C2_TETTH Ymf57 MLKNKLIKFKFFRFVQSGFYVDFIFKKFSEMFIRNIFIYSSIFFGEKFMIEYLTKKTIDSFIFNNNRFNFINLVESKYFLQILTLILYLFFITIFILFYI 100 T 29 HEPN_AbiV pdbhh F Eukaryota T 8gym 72 TB,XH a1,A1 I7MI60_TETTS Transmembrane protein, putative MVNTAYPTPLKTILKTTPAFVVYFVFGLGFSTVIYDVVYHPKDRIERFYFRSSKFERLSRKRDEKLRHYFKPAIEWQPWYNTSTNNNTRPLLRY 94 T 0.15 KCT2 pdbhh F Eukaryota T 8gym 74 VB,ZH a3,A3 I7M9B3_TETTS Transmembrane protein, putative MSNNNQGDFFVDKYNFSRRVVDHRQPYDLNFSINNPVGSRVWFKAWKQKAIGNFLNLVGVHYAFYGAGFCLLFVLADAWGREKYAQPYKSQILHGRQPFGHTFVQNYRNQATDLGRWNHNFACYEKQPGCGRDFD 135 T 8.3 DUF983 pdbhh F Eukaryota T 8gym 76 BI,XB A6,a6 I7M2Y3_TETTS NADH dehydrogenase, putative MNHYWGSSNTIPASSTQNNNYFSGGGNNVTIRGNEIMERLPSQTPSQNMVQASMKTLRFYRKFCRLIPFILRIHNIGTKFTAQQAMINFGNYIRERNHYRDPGLIDHRIQLGYELLYEAEMHFSQHTILMQYLSPYNTPLSDRGYSYLEKVKYGNKSKFLQGFYKGNKPTEF 172 T 0.00098 Complex1_LYR_1 pdbhh F Eukaryota T 8gym 77 CI,YB A7,a7 I7MIJ7_TETTS 37S ribosomal protein S25, mitochondrial MRKALERFNEIIFNPAIRWYQLPKPTVRRTRYPAPGSEPINREVHQIDYKTAFRDSPHNIRYHHEIHTSDQTYHSSYDPVGETTTERLVRYGYLNKDQVNNAEAVAAAAKEFQEKEKRSPSNNIIIDEISNSDKPITKENRESVAHHVRQQFEFFREVNAEEVWSVSIEEKYNPELYIYKTYDMAADDPVWRQVKLDLEWTFENIAERRESLGYMPTFKGDPNFWQALDNSFSPENIAQVQSSIGDKVTNIDTKALALNHQTEEYHKTSKLVYPIRTNLVVE 282 T 1.1 Synaptobrevin pdb F Eukaryota T 8gym 83 EC,II am,AM I7M2U4_TETTS NDUA13 MQFFRPDFIATQVLRRADMAHSPFHKAIHDLEDKRSKLFPDRRRIPGRKAKLLLAASLLLQMWGVGKIIEIKKFMKRRDIELKGLQRKAAPFMQSMNDVRHLALRERNDMLYNELLSVHGEEYAQKMQKRFHQTDIWAPFRHRYAYMYNSSNKNVKDYKQVTLSRYINGFDKFNV 175 T 0.00014 GRIM-19 pdbpercent F Eukaryota T 8gym 84 FC,JI an,AN Q24F24_TETTS Transmembrane protein, putative MELNSSAKEDSHYVGVLGYPSQHDPHTLHPKKHDSTFTKVYACRDMLWDHHWEVRNTLYAGFKGALLGVAYASGFGLISKTVPSIVLKKMFRFVRNNNFGHIRIMQDLLTPYALTGFGLGSVYYLYQHNVWENRSNKWLAEVLSNALFFQVATAVCVNPGFHIYGMVGGILFGTLKYAFYNSSFFQEKESIGSYTTFGDLSEEERKKQEYKDYIQFLGNYHKVRNGQLVDL 231 T 9.3 ENOD93 pdbhh F Eukaryota T 8gym 85 GC,KI b2,B2 I7MG29_TETTS NDUB2 MSLRKGTSIFSRQFKKAFNDAKYQNLTAAQGETYSHLGWISNVDLRLGRAIFTFGVVGIAFCIYLEPSYFHETFGHMSQPPKYDLIDSNINGVEKKLNKQILHREHNEHKLDGFVSMFKGSDVAKN 126 T 6 Biopterin_H pdbhh F Eukaryota T 8gym 86 HC,LI b3,B3 A4VD20_TETTS Transmembrane protein, putative MNSPQKVAQGAGRKLFKHYINENIKSNNEQKLFFYRVNRWRWNTKDNTTAPKFLRLKYPLLVTGVCLFAYDWTYGFTQVDAHH 83 T 0.086 RGS pdbpssm F Eukaryota T 8gym 87 IC,MI b4,B4 NDUB4 MALRRILANQAKLNQKLAQTNRNYHYDFCGRAYMGNPAVQSPPKEFFNYHYVPDNYPDALSGFRIAYRDPFEVQHVFAYENWEYQYDGQWWSMGSLACNVLFFCTPLFLYLILQVEELNSEKRGGTSNKFYHNNAGFFHFQIYDNKQ 147 T 1.9 DUF3930 pdbhh F T 8gym 88 JC,NI b6,B6 Q231G0_TETTS NDUB6 MLLIEMAFNAMKMKIFSLRKIKVKSKEQYLYNYQQKLLILGQGKEKNNKQYKKDIEMGGFQKYPIPRYLHVGQWIVNKNWKWNTFHMFFPTAILCFMVWRNSMISTAKPPNYGEYVDPQSPVAPKAIKY 129 T 0.0021 TMEM117 pdb F Eukaryota T 8gym 90 LC,PI b8,B8 I7M855_TETTS NDUB8 MALRRVLKNQFNLIHKGQAQAVRGGHGWDRPDVPLSFNPLYVHKRELSIFDTNMWMYDQVYPEYVISYNEIHLVDQWKGLKESFSQSAYWWAMMAMVFGFYFINTTPRQLGIDTNDLKGFLGEYYGQYKKRSGIRSNFLGLDVTGENSIIQPNYDRKNGIRDVIDSLNADAGKRKLINLEAKNFIERVEKECEQRILKKGGATQSHH 207 T 0.23 Spore_YhaL pdbpercent F Eukaryota T 8gym 91 MC,QI b9,B9 Q233X7_TETTS NADH dehydrogenase [ubiquinone] 1 alpha subcomplex subunit 10, mitochondrial MSKAYYFVKNFSWAEVSNLLCYGTKYPTVLNHQQKVTRLYRATLRRVYAHQVEGYKTDFKQYNENITDIGKDFNKMLALKPESLELQAYFKKYEDLQEELFDPAMIIDESRPYAASSGRYYIFDDYLLKFDPFGFYSPKLLSENRPEEAMPFYEDYPQNDSHWNLWEQFPEDFEDSNAEREAILKSNKH 189 T 0.005 Complex1_LYR pdbpssm F Eukaryota T 8gym 92 NC,RI bl,BL Q23KG0_TETTS NDUB10 MAFGGFRQTDNSLIIDDRRKIILNTRSLNDFQQKIYLRNFFTNYRPDLSSYDYFAFKEKLRIGELFLNEYRKRINNEVRRAAILTPTSSLREKMNHKIADQILDLSSPHVRGAHFQAVRSWTDASKIVNYVEEKQTKINKYGLQFPLLGNMTEEQCASKEDEVYQRLLKEMQKPPKKASEPVEESSDE 188 T 0.12 CMV_1a pdb F Eukaryota T 8gym 93 OC,SI bm,BM Q22Z32_TETTS Transmembrane protein, putative MNPRNIFNLAKKVQNFNSITQKAFKRFGGAAAHHDDHHDDHHGHGGHGYEVHLVKDKNLIGNKSFKDDLVAVYGFTDVNDHHHHDETDPYHHLRGVPTLSFERMYFADAYYHDDTHEGLMNEPHGYLTMDDPMDLRPNYEKSALELLFLVSGGAILALMLGYQGLNLANPAESLFSLNTAAEEIEDKIRQIRIDNDKLLQRKAQLEEELASLNN 214 T 0.022 MctB pdbhh F Eukaryota T 8gym 94 PC,TI c4,C4 Q22W63_TETTS Complex I-MNLL MSSMLIWGACFGLFTRAAACKASMIPLTTSPWKYPKYMIVSAVTFYYFDWYRRMALEQLCYNEEKLERYQIRAKLQSLKIGEELSDAYRESFFEHAVQKNNI 102 T 0.0018 NDUF_C2 pdbhh F Eukaryota T 8gym 105 AD,EJ n6,N6 Q950Y2_TETTH Ymf62 MFLITITSYFSNIIEFNSYIINLIDFITPLFFIENFVIQFFILYLFYLLIVNNNLYYILLYIFLEIVFFGLFLCLYQLELFTGFLWVAEFAIVFIAVVLLFYLNIDGLHLKYNHNINNVLYYTPSLVLFLIFFNIDYFSELELFLPLELSFIDIYDDYYEGFNNSIMNDFTPLTLSYYSINSAEFIIIGLLLLLGSVACVNLYKSNKNYTIVKQSNLLTMFDFFKDFINFSFIRKQDLNNQTNFNPSLRSIKKKY 255 T 8.3 DUF2070 pdbpssm F Eukaryota T 8gym 106 BD,FJ p1,P1 Q24C39_TETTS Transmembrane protein, putative MIARRLFKRSLYYIPRAGFGGGDIRHKFSNEITDDDYDYQRAMHVKPPKEESLFQLTNILSSVPVFKTRFFLDFIARNLDTNSAVSTSDFVAPPRVHENSFFVYHSRELGNVIRKYRSLESIVLPGALLTFTYPLFAAFVAIPSYYFMFNAKIYEMSRRFVVRMDVLPHLEMISVQRIGAFGILYTKLHRIQDLEYVPFDQVKEQENYLWAIGGHGVDNQLIFKDRSTGEFFYFERQGVWDAKGLNHPLLN 251 T 0.7 TMEM70 pdbhh F Eukaryota T 8gym 107 CD,GJ p2,P2 Q23KE0_TETTS NDUPH2 MFNILKGAQLSFRSITNKSVNNYYNIMRQVSLDSNPIVLYQSSTFTGNGLQEFYENADALTKYLKLVPFFLEKNLYDHPKQFVIKMEFHPQNKVLSLDCLTHQGVLKKTVNLENLIPVPYEDYVQFCRRKLFNAPLFLDTEMIYFNTFQNEFYVFDKNAKWNEEGINHPELDISKLYNEKAWFDSLRII 189 T 0.26 TMEM70 pdbhh F Eukaryota T 8gym 114 JD,NJ,VD,YJ qG,QG,qg,Qg Q23F81_TETTS Sulphotransf domain-containing protein MVRLEKILWEQLVNVKAFSRQRVIGAPSKWYNENRTEWFKVAQHNAFNTGFSGVILRALEPLLAKFIYRWRLDIAHQRGLTLEDSLLFMDRELRRCYFFETVARQNLHPYTVLFMKKRRARYYKVERGLRGFYVPDWVRKEAEERQLSETVDNIFNWENFVYREYMSDMTPIGRWTSLSKITPLDMFQYYGLFRNEAWDRFFYNEAFYESYSEKEKQEANGNPFGKFNLQTADGRAQFEKEVNTFIERYPFAVTKPGQKFDFTRFYALEDLANKRDTSKYDPALLESVKNELKQSAALPADNGANKTKKSKPILPDWLQPKFGKAFQA 328 T 0.69 DUF6322 pdb F Eukaryota T 8gym 115 KD,OJ,WD,ZJ qH,QH,qh,Qh I7M484_TETTS Transmembrane protein, putative MNVTGAGLTHVKDFHSDEMRVFRGGLRHIADKQGNLIYGSVNSSVRYYHDKMSYERGFIQHSRSPSNQFINFHFMLGGFRTYVLERFFKQVWYRRNIRTFWFPVLISYTSGCITMRMYDNNCYDYFYFSD 130 T 1.6 DUF5320 pdbpssm F Eukaryota T 8gym 116 AK,LD,PJ,XD Qi,qI,QI,qi I7MM45_TETTS Transmembrane protein, putative MVYGKLIFNNIKEYTPSWIKTIPYSQVTKPILRKQPQIVGKINADPKVKKFWVFLRENVQYYPFLWQFFILGTSFVWFHVCYDPWLAIYQANNAHRSLETALTKEKAHKKKLAEQEESE 119 T 2 Selenoprotein_S pdbhh F Eukaryota T 8gym 117 BK,MD,QJ,YD Qj,qJ,QJ,qj I7MFL6_TETTS Transmembrane protein, putative MYLPTFYKLFHETNAFRLKRYVGYGPLLLTWSIWTLYPALYNMIYSDFIPPERGVPKRIVDA 62 T 1.5 DUF5621 pdbhh F Eukaryota T 8gym 118 CK,ND,RJ,ZD Ql,qL,QL,ql UQCRTT2 MAPVFLKALRYVIYSYPLYVCYLIKQAQINAQGSEKEEEHH 41 T 2.8 DUF5392 pdbhh F T 8gym 119 AE,DK,KL,LL,ML,OD qm,Qm,u2,U2,QM,qM Unknown peptide XXXXXXXXXXXXXXXXX 17 F F F 8gym 124 FE,IK s5,S5 W7X4R4_TETTS GRAM domain protein MSYSGYSLNGGVHPCLPFYERMLQCAKSEALPIKMCTAQTEDYLECHHRKKQYALNYAIKKELNNIRIVALPRYDEENDTFVPFSQATADHIFQ 94 F F Eukaryota T 8gym 128 JE,MK t1,T1 Q22E24_TETTS Lipid-A-disaccharide synthase MLTHISRRYFSFTGRKTIFVAAGSPSHDLQAANFMRDLKKKSNNNYDFVGIGGPLMQAEGLNQSYADINKFIDKPFFPLKNFIRFHVARCYHPYMAPLHFFNKQVLNQVDKSSLLKDQVELSIPSAIITFGNEFFMKKLYVRLCDQYELHNKIRPPTFFYDRSHINQRFEFQDYLDHFFYTIPMKQINFQSFTYPSTCVGHEGVGRAIQYLFQNSKQYANVKSLVTANGLKIASNPKQHREIIEKLVEEQRGIQRARLGINESKNVFLLAPGNTKAEINFAVNLLSRSLEEFFKKPQLTNVSRDHFTIIITADNAQNAEFVNQAVSNTKYLKTLQTIVTTGEKEKFGAMCAADVGIPLNGELVSECAALQLPSVIISNMNLFYAYITQLYNNFYSDINFAIQGEAYHELVSTAANPYKLSDEIFDLYSDPKLRYHFAERYQNVVHEMIPQANSQDNIVTTDVATLHGVEVQERAFTYETIAAKVLKAARAYESLDKNIPNHQIDQHRKEKLIKAAF 516 T 8.5E-42 LpxB pdbpssm F Eukaryota T 8gym 130 LE,OK t3,T3 I7LUQ4_TETTS RNase III domain-containing protein MSGLLRNFEKLVCQSQLSKAGHKLLLRSPNSTLHPTAFYYKRNSSQRLANEMDVFQLGLAAAALTRQANNYAQLLDQVDKEAVREEVQERITQNHSDLNVYFGEILSLFKIGKKECPVQTVADISYVLAFGPIQVPNAAAIITENLLPVLKEKLDYASIHNLQDILSAFVKLNYVSDKELLKRLITALSQKDFPNQLQPVTNHAWNIDQYEYSDCNSWNIVSCGDNTFEKYIHEGGCENSLAKAKFAVHELLDHISFNFVNPFLFRENRINHRFAKRNADLDHEVLMQTLSKLQEIVPETSEAIATIKARL 311 T 0.034 Baculo_F pdb F Eukaryota T 8gym 131 ME,PK t4,T4 I7MIE0_TETTS Transmembrane protein MGGDHHHEDSHHKSNVDQHELKAEMIKELSHYYDHHDLSLFGKVQHFVEHLLEEKHHAKINTSNFDQKKLENFSESKQISRTVFALKKIKTFNHDFFTSEEEMILEPLPLGILTYGLKYAFAGVDAALLTYFWRNWNFNVRTIGLLGGLVGIQMATLHIPNLVNEVVIQTPRRRALAKKYISAYGPQFFHDIVNPKYDIEHLRHLQNKLNPY 212 T 0.17 PfUIS3 pdb F Eukaryota T 8gym 132 NE,QK t5,T5 I7LT77_TETTS Transmembrane protein, putative MFLYKKILSIYKQSFSFFLSFNFSFFLYALLAIFLLINFCQHIHKFLYYCKEKIQKEMQNAYPEITDQHREFLKKQGLKVYEPKPLPDQINPFSKTYWITNAFIIGVSFLARRHALKVGAPRIFWSGCIVGVPLAAIISRGKSDQLDELVGARKTLEQKLEYAPITRRAWERALATNQEYQNEIKTQIQDLQAEIAAKKVAAKLE 205 T 0.02 FlxA pdb F Eukaryota T 8gym 134 PE,SK t7,T7 Q22HE4_TETTS Transmembrane protein, putative MTNFGSPFRNTDSGIVIRDPENEKRLKLAFQNFWKSKQEDKEFQAQIKTAVSKDTVNFMFYASPLFGALLGKTYIDMFCNPRYFYFRAFTLSMFALAGYCVGNGFRNRYEHSLYTRNYHLFPKDLQDALVNGDARYCISWWKQ 143 T 0.053 AHH pdbpssm F Eukaryota T 8gym 135 QE,TK t8,T8 Q22SC4_TETTS PH domain-containing protein MTHQFENVLLSNRKNLTPQESVQKVINYALLQDAKQRSRTLRHIKASWVIPALLFTYPAWYLAKGAVNGVWSNIHPTDKVTLSFANIGRPFRLIYRPEIFLRDQQAKFIQLEKEHIEKSKKGEFVETTSPLVLWN 135 T 4 DUF1852 pdbhh F Eukaryota T 8gym 136 RE,UK t9,T9 Q23B10_TETTS NADH dehydrogenase [ubiquinone] 1 alpha subcomplex subunit 8 MFLNPVKDSEFDDEVKGFVPSEGEVRFVANKNKECGYYLQGIEQCRRKMVQLAGDSSSQFHSLGFLPCKRLVDAHYRCMTDDKFGSTIEEVPEIGLDSAQKFFDCTFQQLKPMQSCRRFFDQVVRDVYRANGSQLI 136 T 0.23 COX17 pdb F Eukaryota T 8gym 137 SE,VK ta,TA I7MAF0_TETTS NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 4 MLARTLKNYMRVQQNLRFSRANIEKSPAPPTIGQVELEPFKFNHERDQLIYGYTMEELYGKKFGLKHSATVLREIKKDTIMMILFIIGGFTYCYHMRETRFQLDDDFNEYVNTNKQTFRPIPDHVKL 127 T 0.1 Viral_Beta_CD pdbpssm F Eukaryota T 8gym 138 TE,WK tb,TB Q22T55_TETTS Transmembrane protein, putative MFWRNVVRGLNCQQALRRQNFAKNITTTDIPKDSHHFAAKRSGFTQTEQAPFAYNDVYQYPKDYKPWNYNYKGNGVLLALFLGSAFSLVAYERSYASKTGRYQRKVQQNYYQI 113 T 0.055 Ncstrn_small pdbpercent F Eukaryota T 8gym 139 UE,XK tc,TC Q22E95_TETTS ATP synthase subunit e, mitochondrial MVYQGFKVLRRNPTFYNPRSAGMVALSYFAYSYYVNKYYKPQNSNFEEYNSSHPHNHDEKVRQYHEKTNQAIRDAVLEKRAEHDQRLREEAKL 93 T 0.015 Hrs_helical pdbhh F Eukaryota T 8gym 140 VE,YK td,TD Q22DC2_TETTS Transmembrane protein, putative MNLPWFVRWGTDVALFFIPAYTFANYPTTFFVFAAEKRRQRRRKDFSDVKLRDDAAFSVDQVKQLQTKLHLKQ 73 T 1.6 HIND pdbhh F Eukaryota T 8gym 141 WE,ZK te,TE I7MIK1_TETTS Transmembrane protein, putative MDKYIQQAKCAYNFSLKAVRFVGPLNIVFAGVAFLMFYENNYKKLYLNPRYSYTMPYLQSAKITKNLYEKL 71 T 0.21 YpmT pdbhh F Eukaryota T 8gym 142 AL,XE TF,tf NDUTT15 MNNLKGSNCLVQNVAFNFSQRGRDYTPSNKKYLQPWELERKEYVELSLAIQSAYSCKMLSEILKDNLYMLTDYQLSFAMFHLWNHEIPIDNYFYNVISPILKEYITRFDRECNKSLAEIATFLGRMNVQDDALWKVIETKLVQERLYRYIPLNDLIDLAHGMATANRGSQEFYNIVENVIIKHRLRLIPDKIAVAKDCFTARKIGSPLLYQVLENPQAEAHELAGLKEHEQLKISG 236 T 0.83 FAST_1 pdbhh F T 8gym 143 BL,YE TG,tg NDUTT16 MASQLQREQKLVQSLQQESLQPHLFKIIVDSQSDLVCEADRREYIKHYTRANEKSSTSQLLQVGALLGYIYAVGRYVSNPSTRKFSYGLAALLGSFSLLNPSKNLHHNHSLREIYSKYNISTNPQALEILKSRIY 135 T 22 ApoO pdbhh F T 8gym 144 CL,ZE TH,th NDUTT17 MNISYTGLKLEDYSDEVIRKYKFPNSNELERFLNREQTLTVQQHKSAIKLAQQDFFAVAGLLSVGSLSYIFYNSVGGKVIRDRIRASMPFPKRVLVQVLPFVALGTALIISRRGIEGHNHGYKQ 124 T 0.12 EMC3_TMCO1 pdbpercent F T 8gym 149 HL,IL C,c COXTT3 MSALLKEILALTVKSEAALWKGAEQKVLSGLNNLAKTELVQITHHFGVNKQGSEALWSQLDKAAVGAFPELSVDETLQLIDGFGECPDSYTLSHDLNQRLLVSWEQLGKLNFQKLKETNPYFASDIVNQLDAAAAEFIKVRPAAESEAGGFLNSLGVSSSFNTTKNDIYVVQSASGKKLNNKEQREAYVLEKAQKYLKEDPQSKILDIIAQK 212 T 1.3 A_thal_3526 pdb F T 8gym 150 JL,NL u1,U1 Unknown peptide XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 92 F F F 8gz0 1 A,B A,B Q5SH57_THET8 hypothetical protein TTHA1873 MGNYLEDCATVDVQARPTAYALAISSLGEFNSLTGGTSTDPVAEGNDYYYRFEIRAWEGSSGPQTNVTLNVTRTLGNSTFAGSGTKGVDFEVELDPDGPFGPASYAPVLSADVQVLAWGPTGVQLRYLPSLAPGATLRFSLRANAVNGTNTTVQADATSTEAPGPYTVFETTTIIP 176 T 0.021 CRISPR_assoc pdb F Bacteria T 8gz3 3 C L 7D5 Fab light chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 211 F F F 8gz3 4 D H 7D5 Fab heavy chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 215 F F F 8gze 2 C,D E,D S39A7_HUMAN HISTIDINE-RICH MEMBRANE PROTEIN KE4,REALLY INTERESTING NEW GENE 5 PROTEIN,SOLUTE CARRIER FAMILY 39 MEMBER 7,ZRT-,IRT-LIKE PROTEIN 7,ZIP7 GHTHE 5 T 1.9 Zip unppercent F Eukaryota F 8gzu 1 A,BC,BO,YL 00,55,A,a Q22PJ5_TETTS Transmembrane protein, putative MLSKVTRRFLNYNQIYCFASQHGAEHHKLTASDEAYLNEVRQRYVTPDMEKWAYLDYKKHPSTTLSHYDHKSKDYVESERDDYNADVATNSHNKLIDDFKRNLQMQRKVHDILQKMDRPYLRGVPGVTKNISAGLQDYSAPVSKKSQSDPNDFYRDAYRNENRWIDQSVFTPKTSKMTHYDVEWPKELASRPVTKKFHHDKGYKYDVTTPYDQRYNYVADRLGHPEILGNPFERLMRLEGDIYHPNYLDQPFVKVPNANPNASLNFEEGEVLYENTRLLEWAKFWNYSVVVGYLWCAYFVPYNIFFKTHMPLEHAYDNLFFPYFQHTHFLWDNNALHIPTVGGVAIYATYIALSYINNIWKDYVVRAQFSKDKELLFVTRVSPFGTTEEEVYEVAHLEHLPPSVRSGVKDLSAQDADGLVDVTCMSSQRSLVFYKGDQYWNPKVYNDFINQTSNLWTRNYTGYNRLEVQNSVEQVKIGFSHSSQPKLEKK 490 T 0.015 TMEM70 pdbhh F Eukaryota T 8gzu 2 B,CC,CO,ZL 01,56,B,b Q22FX8_TETTS Protein phosphatase 2C, putative MFRRIISNGALLSTQTQRWQDLSKFACLRASLNKESEKAFQELAKKNNVSPQELVELSKIVSMNLDVLKQNINSEQFLLEKESTLKRYRQSSIGTRGHLQTVNEAVNTKYPTLAEGLGQVAGYKEAYQALREIFVHPSISVNNLRQGSYGQQFAVDFRTRADEYVKALLKDHSSNPQAVQTIQEIQHTLHQIIKNYEQNPASIYARILTVLQTRGVNTLPVSKTADQKAVATIQKTSTPSLTIDQLTVPVQERVQTQTVFDAELAFIKEANEMIQQNTGNLPWDGGKKKIFQGQANKYLETPYYLLAALSGLGLLYFLYSGDAKYKTLVLTPVVGIAAFVLLRRNQILNRVPTLTELFLHKDGKFVDAVVSVNGQLISKNDIPVSTLKLYRGDHTVKVNLNDFEDASAKKFLAQQSGQEGVINVHFSKLRNLAARNGQVLNLGDTEVVVPFENQANRIILKQIFKGVEVLPSS 473 T 0.011 Rh5 pdb F Eukaryota T 8gzu 3 BM,C,DC,EO c,02,57,C COXTT3 MSALLKEILALTVKSEAALWKGAEQKVLSGLNNLAKTELVQITHHFGVNKQGSEALWSQLDKAAVGAFPELSVDETLQLIDGFGECPDSYTLSHDLNQRLLVSWEQLGKLNFQKLKETNPYFASDIVNQLDAAAAEFIKVRPAAESEAGGFLNSLGVSSSFNTTKNDIYVVQSASGKKLNNKEQREAYVLEKAQKYLKEDPQSKILDIIAQK 212 T 1.3 A_thal_3526 pdb F T 8gzu 6 F,HC,HM,KO 05,60,f,F Q23DG8_TETTS Transmembrane protein, putative MRYLKIEKEKLVSCKKQEQEVQRIRRRKGNQKLNSIAKQQRVKRRDYQQNIKQNKEVKNPKKLIKQQIINKVKKRKKMFRGLTKFNKVFALNSFKNSLVAVPKANLNHVQNMLEENLKYDAQKYNDEVAVIQKTSRIYKPTYTIEFNREGEVLVYSADPIKNSVVYFKYPYVLYEAAIPLFIWAWIYNPLELSKNAVNSLLIYPNIAWIPRMWYWRSLQYKIQKMYLLRGGKVAKIETQSLAGDRFTSWVETYQFHPLTQDQKNFDNQDNAEFLEDEGQLKYELGVQLDNLQEMGTTSQDIVINFMKEGTVHHPELFEAIVKGYNIDTSDYVINTANNLRAREGNHNH 348 T 0.023 TMEM70 pdbhh F Eukaryota T 8gzu 7 G,IC,JM,MO 06,61,g,G Q23DZ5_TETTS NADH dehydrogenase [ubiquinone] 1 alpha subcomplex subunit 8, mitochondrial MFLNRLVKETSKAKRLFSMAQNNFARAGPYNPNRYKDYYIPRTLPKNEEIVEFVQSQHSVPASPIRNQRHINPVRESGPLPSYDGTYTMEDIRAVFYNTTVGRDYCYCQMDPEEIMRRVPGITRKEAEFITKLGLSPQEQVDFAYIAYNIGLDIFYFTNQMFVARQVVTNSKGEKVEVLWNAQCYEDIAQLNVGFAPVLESVDYHWEIFLWADPPIKPNNDFDLNVPCTWFEYEQEWWMESCIQEDQFNLPEDERPYNTPRNPHCRKELWRSQDALQEEELMVNENWYPKNTQYNIYNQPDFIKPKSGSGAAADDIRI 318 T 5.7 IMS_HHH pdbhh F Eukaryota T 8gzu 9 I,KC,LM,OO 08,63,i,I I7LY65_TETTS COXTT9 MVYHLFERICNPDNFKLSGEAARVRTLIAAGFSKEEAEQVAWLQNHQVNGKILGLFTGGFALYCCNNYFHYFERYFPRLRYQPFTKFLAQAATVYFFFKIGDYYFTSRRYGSNDARMNGLMYSNTYYSTNKEALIQNFEPLNRKFTEEEVEQFLRNEGRSQEEKRNWIYNPHIHGSTEGEWKADIHEKFDSGKAPWEREHVKAKILETNKAKIDAGEEIQLKPFKTLNHLDKTGLLHRLHPFIWTNNWTLLG 252 T 0.19 Bac_luciferase pdbpssm F Eukaryota T 8gzu 10 J,LC,MM,PO 09,64,j,J I7MD70_TETTS COXTT10 MSSFIQYEFLKIYQGNQKIKNYYKRKRLIFQQKKVLKKKQKEIQMSTNNLRLKPWFHWTDEERSHAIFSAYEKRILKSEDLPSFLRANRINNVSTWVFPLIALPLFNQSIFKLGFAQRILLTRPAIEWHCFKIATVAASWLAWLNFSPFYRKLENEKEYLLDTLESRIGINVLDLNDALPRWTTSQEYNRRTQQLYNQRNGFFAGLLYPQEESSRPLVDIASFPKNLHKEKLTK 234 T 2.9 TFA2_Winged_2 pdbhh F Eukaryota T 8gzu 14 N,QN,SP,TB 0D,4A,4a,48 Q24C97_TETTS Phage protein MMQNLKKFMSKTIQVQPVSFNQIPKAFYNFPEYRTGGVQANPGITAKRIIKCIGERLRKYDPARWENVPITFKTHFRDENGYSDVATSIQIHDALEREFGIDIKDRLALVTDVETAFYIVMSHHDPL 127 T 0.16 DUF1493 pdbhh F Eukaryota T 8gzu 15 KN,O,OP,UB y7,0E,Y7,49 Q950Y7_TETTH Ymf67 MTALFLHILWSISYIIINILYIFLSLLLSNNNEKIKQYNSNYFIKILLVLFYNKNLSFYKNLLSEDEISKIEFERLKNYPTLVLIHSNLNKLEKRNKIINSFINFKTKYRFYKFISTNFNLQTIIKNCNDKIIFSTLLYIVNLNYSFFYKTIKNTDLIVYLLANKFSILNDNIIVSKFNISKFNDYIKYINNTNSIDTYLENQIILGLNNNTNSNITKNINTKLLNSYSNLKNLVNITNNTFYLKKINDNYNTVINSEFLTYLKSNYKISFSASNIVKYLSDKSVNNSVILYLRKNKIFNKSRYSRNRQTYRTGAYWCLYVNIIAVVAFYFWFYKFTMNFGYLWWLLYSLILSFFFSRALKHRFYNPLNVMTEFKNGFMWFIIILINIFKPLLKLLENNYINLYNHLVIKYYQSFICNTLINKKKLEFNYILSSFKFIKELNNIIIISLNKLF 453 T 0.0058 NUFIP1 pdbpercent F Eukaryota T 8gzu 16 JN,NP,P,WB y5,Y5,0F,50 Q951A7_TETTH Ymf75 MFLGIFKDVIKLLNKKVVPVYFWFFLYCFLSTMDTNIFVSSCSFLKVEVFGKDENTTLVLLFYVFYSLFNFYLSRIKNKNNYLVRKHLYTTELLIELILFKYKLIILKFSSIKYILNFNVRKFILFNLFLINNYKAYKINTFFLYIYIYLNNLNIIWYPIFKAYSIFGYYKSTRLNFIDTKNENIKRIKY 190 T 1.8 PDH_E1_M pdb F Eukaryota T 8gzu 17 IN,MP,Q,XB y0,Y0,0G,51 Q950Y0_TETTH Ymf70 MFRWLFLYWYNSTDTPSAIAKVNLWSYINLRLFKARLSSSIAYYILGLNNLELKKLKIFYKNTYFDYIYLKSIPCLFLIIFFTNLYLFL 89 T 37 DUF5784 pdbhh F Eukaryota T 8gzu 18 MN,QP,R,YB z1,Z1,0H,52 COXTT28 MAARDFEYNNQDVNQLNGAFISLVEDEKIGFWVGVGGFAYSQFIMRKFVKSTNIFASVTSLFAGAALANLYTHQSRASYARVAARANRNASLALNKLMEY 100 T 0.02 DUF1689 pdbhh F T 8gzu 19 BN,EP,S,ZB u1,U1,0I,53 Unknown peptide XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 92 F F F 8gzu 20 AC,CG,CN,FP,HI,QF,T,VH 54,qm,u2,U2,Qm,qM,0J,QM unknown peptide XXXXXXXXXXXXXXXXX 17 F F F 8gzu 21 MC,NM,QO,U 65,k,K,10 W7X4J9_TETTS 39S ribosomal protein L9, mitochondrial MFGRLVLKQTRRTLFNPVLKNTFCIYQAYQNPLRHINTGHNPNNVYEDIVMLGDYPVQNRTHDKVISQTYVPAIANIAFTHLSKKYPQAGLKVDQLNTLKEKTWNDLGVNIEHEKQEILVELSEQIFVKESKLRWVHEQRQRLAHTTYVFSGLEFQNVKVGFFIDSYNFLLQELAHRSNLYQSKDIVGEKSFHEKHLEQQTAPYSGVKSLEEPVSQNKSFINSLMRAIHNH 231 T 27 FliD_C pdbhh F Eukaryota T 8gzu 22 NC,OM,RO,V 66,l,L,11 I7M3P9_TETTS Ubiquinol-cytochrome c reductase complex ubiquinone-binding protein QP-C IKGNQKKQKGKNQSNNNNNIREEGKQIKEMILPHNNRQLARQYFDSLPENDINRKYYEGLKYETPKTFFGRFLNQFNIDAKLDTLSKFYTYQKTIRATQAELQEDRKSYLTNSLLFTAVSWFSIYQFARKGAVLPVLREYGRYFGTHRLFRQYLHTLVLPLLYTEYALNQKYYTHMEHLWTVHVNRLNQKILEDPLYTFYPQELNVPKHNIIVPTIFRDTPQ 222 T 18 Plk4_PB2 pdbhh F Eukaryota T 8gzu 23 OC,PM,SO,W 67,m,M,12 W7WZP1_TETTS Transmembrane protein, putative MSLSLFGVKNNWHKNGIWWFSKILNKTVGEERYDALRVQRRIWSMRFYYARQQCLYELFVDHPDLAQWTGTYPKVDSSHGFPFYSTYEMYRDFQENTLNSDGSFAQWITLVCGIYVIHVIYNYMIPYYWVSTPLKNDEFTRLRMKDYIASTVLEEVYGISYAEWGWLPHDFAYNRMRGLAGYMHPDDPRAMCTSTFHRKHKYIEHEVEKVGDYHHMTYPK 220 T 61 Spore_YabQ pdbhh F Eukaryota T 8gzu 24 PC,TM,WO,X 68,n,N,13 I7LZX8_TETTS Transmembrane protein, putative MKKGTASEEELKKLYDPNTFYEHGDNPAFKQFMNIAVENLREGKLTDHRTYVVDTYKKWMYARNWDDFLQRDCKAITFPRAFALWIVGTLGMATASKWCRQILPVGSHGITKISQTQFFHQFGPLGTLGAVGFYGLTAYLYYKTTIFTVKKFYSHCILQEREWIFEQERQNPGYGEYFFKDVPLSAEEHFNDLARGEMAKKKFEKPNHEF 210 T 8.8 DUF4500 pdbhh F Eukaryota T 8gzu 25 QC,UM,XO,Y 69,o,O,14 Q23F08_TETTS Mobilization protein MKEKIFNELTRKMKRKEISAKIQREENKQILIRQRNNKKYIQSIQGIQQERKKGKLYLVEMATQNVEEMDTIQKMNYEATVNMGRQDLITREYTFYSDYEFIPIQEDRKQQMEDALNNLHKIIHPTVTQLKKKANVQEIQDRVFRKLQGWEGELNTCVFSAKNVRDSNFCADRFTNRINTEGVEFVKQILREY 193 T 0.13 DUF3221 pdbpssm F Eukaryota T 8gzu 26 RC,VM,YO,Z 70,p,P,15 I7M8Y9_TETTS YflT domain-containing protein MNNTFKFLHQVISKLTLKAQVPNYGQYSHSLKRPINPKVVVFGNSSRAYELISSQFRNFNHVNGLELKGQEDNIQANKVAQSVLSINDGFQDGYYITDFPQNSKQAERLDLITDGVNLALYIKDPSDKVTVTRQQEAIDYYRKTGALVEFEVDPRGDLEEQVKQLSNQVLNGYKH 175 T 0.11 PRORP pdbhh F Eukaryota T 8gzu 27 AA,SC,WM,ZO 16,71,q,Q Q23D87_TETTS Transmembrane protein, putative MDNNYHFWGNGDRQDVSLSYEDYYSILDCLLDEKLSPQGLMKFKNLHEVSMYGVSYVPLYCFPVAYGISHMLTGKVRRGHSGYRNLFSLMSVVLPFTCWYAYTTPIPRRLYTEIICSNNADGAYVRNRIKQQKPGIWRKLSQQLYNKNFRFPELNQDLTATEFPLDYVAPHKF 173 T 0.16 DUF2206 pdb F Eukaryota T 8gzu 28 AP,BA,TC,XM R,17,72,r I7MKT6_TETTS Transmembrane protein MVFEFLFYNQQHKTRNGYFINHDNLMLASLEERKKLIFYFIANQVPEKLDPVDRVKFNEELSDNLSTKARLIGSLTGLIGLVGFPYISTRIYSRPVLNIGLSLLICPFLYYVGNQLTYSVWEPKFIANNNTVCELSKKYNFTVFDFAQAKKEAHLKALRTELVSDNLLYSPGI 173 T 0.048 DUF1689 pdbhh F Eukaryota T 8gzu 29 BP,CA,UC,YM S,18,73,s Q230X6_TETTS Complex III subunit VII MAIRNFVFKISNQIQNLAAKRSLAYLNQIDSQSVPSRATINMKDQVTQMQREIDNMANVIRAQIPDEDRAEFEILKKYYVTGQHDSLVDPQDVLLQLDRIQVLKNLKMIELNEEAYDPELVRLEKLKARVLLEEEGALLEYAHFISKRPYNKPYEKWGVSEEHVKQQILG 170 T 0.35 APG6_N pdb F Eukaryota T 8gzu 30 CP,DA,VC,ZM T,19,74,t Q23VY4_TETTS Transmembrane protein, putative MGFETVVPAPPTRDDELRMIKATEEQFLQQPRYKLYMNEAHRIAKMNHGDRHNNIRAHFWSNFALGLLITGPIFIIPFGKAFRNLRSGVPYYFRPKYVFTQKNQYNQDRNWGAMKKQIPLWLGLSTAYAYWFTDFSINDDEWLEKGKVIYPHQTIKVL 158 T 0.091 MASE1 pdbpercent F Eukaryota T 8gzu 32 AN,DP,FA,WC u,U,20,75 Q22DP8_TETTS Transmembrane protein, putative MSCTTRRFIDEKEKLEYSRGYNQQELEASKLRKDFVKKYIVDFDTTLYKTQVERDWAYIAKREYRYEVQLKSIGYGGALANAVLLWRIYANKKMVFWPIPIVGALGYLYFQPVFFQKSNKRFFDMCNVGEEYYLGRERNKILRECNKILNVEDF 154 T 0.1 DUF559 pdb F Eukaryota T 8gzu 33 DN,GA,GP,XC v,21,V,76 I7MFV5_TETTS COXTT22 MGKDQLDFSHFDKAFENKYDIVAPEFGDLHQKRAEFIAKNQGTYRPVPLVPNNIKGLIPKTCRLPATRNWYRRTSSFERNGFFNIHTPVLNTKMIPWLLFIVLTWGWSSFQIGGYNYERFDDNGERRNTLYWKLSPVEFPQSKLWNRPS 149 T 0.056 TOM6p pdb F Eukaryota T 8gzu 34 FN,HA,JP,YC w,22,W,77 Q23TE5_TETTS Transmembrane protein, putative MVFHYTNFVQETNAWWLRRVRPVYCTVLAYYGWWLYDRYYLFGKNATQDIRKDTTEVWEKRAALNKRNWGYNAHYKPELERSMKKVLYADPNYKFPIEWPERYMAETKTLEQVMDEEENWEYYK 124 T 4.1 GTA_holin_3TM pdbhh F Eukaryota T 8gzu 35 GN,IA,KP,ZC x,23,X,78 Q22W32_TETTS Transmembrane protein, putative MEPFGTDERNWTHEEKDIITRFLKYDKHVNLKTAEMVYSAEVESAYFGKAGALAGGVISALFFNFPIVRNLPIIRRSVIGVLPFLYCYTWGKNTQEELRWLKTFAAYQRFVVYHGQHCKLWV 122 T 3.1 Pepsin-I3 pdbhh F Eukaryota T 8gzu 36 AD,HN,JA,LP 79,y,24,Y I7M9E7_TETTS Lysozyme MAQTAHQNRYQGGLCYAQCNELFSFWNPSIQQCWKGCDFGVGRVNDPEGRIEAQQMCKRWAAELYWTYKGELDTIKDLRVHADMYPTTPQNVYRACLAGVRRQKF 105 T 0.59 BSMAP pdbhh F Eukaryota T 8gzu 37 BD,KA,LN,PP 80,25,z,Z I7LTF1_TETTS ABC transporter MSSDPFKKVERDYHNERSVHKHFASYPLKFWWGLNKFETIQGIHSILGNAADLVVSTLSFIPGVQGRNNASYIENSIRVTRFRGFDDKTQ 90 T 0.14 DUF5493 pdbpssm F Eukaryota T 8gzu 40 ED,EM,HO,NA 83,c3,C3,28 Q950Y6_TETTH Ymf68 MLICNFLMYSNFSRIYWFDFNGTVNENLPLNYNVLKICRNEINKLEKLNENNLGTQKNPIKLNLSFEDKHYNTNNLVLDLNSYETFNSKNFISSIFDKTFESLNTVLMAPIYSFLEFKLKLSSTKINTNHYYVINGKLYITYNDSFKLFTTINDYFNDLNELSNTKLFFLYRSFNIYNIKLNSLVDFVFLKLILFIHLLYLKSTNYNRFDYRLKQTDWGFYINNNSNYIQNIFSGLKYIWRGLRFWIIGLLLGLSSIYYLMYVRLLPFNKIIFAWILVAMFLYWLLSGFVFFVKKYQYSKFTAAIQRFWKRTYIIFWVIEAGTFSVFFYLTLNASSEPVYMYDQIKIYKTHLFSWRWFLIKLLPSVSIILLGYYLQLTLKWNLFNKQNTIVLLITLLLLYILWLEFYQFYHILSFYGNINWAFDYDEYIWTLELDTRRTRLANNYIAICLFAKFWHFVFIFLFWVFFVLRINELGRIRYPLLVANVQNFIIIYIMSWAYMYPWLKFIFRKYLDVPYYWFYLNGRELGIRVFFTDLKLFFYGITNRLFDFNPSSIKFEKYPFYYWINSSQLTEFNQYRKFVIRDSIIYSLNNYII 594 T 0.29 DUF3408 pdbpssm F Eukaryota T 8gzu 41 EN,FD,HP,OA vb,84,VB,29 Q23FF5_TETTS Cytochrome C oxidase subunit Vb protein MKKQKRTQGKQNTKQIKQEKLSSKRKANNQKEGKKKVKQEDYKEIKQKGKRMLSKIVKASFSSKGFNLANAVNTVKSTLNAPIKHIKRNIEPTGSNYSRMTNTTEEAFDEVSHEWQALVTSNPFDLNVFNYLENTQTSNFGTVDNPLVVFTSETPFRYVGCTGQMNEDDYEGHELLFFLLREGSLQRCMGCGQVFKLVRLRNEYSPEMDYYLSNFHPYEMQEMGESDTTVLMSPYKYASHYEYTQFETPSNMVYSMVNPDEHDRLLVDPAYRMERTKALEEKYKVYTSSLREVEKQFEERYGRAGQINISKVTYSTLIDVEKAVLKMDRLFRKVAKFENRAFIDRANHSRREKRMLERAQQRWDSNYSFFTGSLTEEEQKYRDYYETELEAYPEDEGIEQQLDQQEVLLSGRYDPKLYDFQEGYTKNPEDDQTSLIEKKAFKFRYRLANETSETFQRRNNRMVERQIKRFQQPQYKHAFEQLQKNIAISSNSGNALHSEYGYLELLSNESVQLYKDYYESDAEEDFKVFENLSSKEKLVMIANFENNLLPKYDRSEVHLIPKRQWEPAFGVWENFLYDITEYASFIAPRGKEIAADYQIQSAIPLTKEELIEAGLYKETIEKKVEPKLEAKKQTKSE 637 F F Eukaryota T 8gzu 42 EK,PA 2B,2b Q951B2_TETTH NADH dehydrogenase subunit 2 MSIFSNIWINNDLNSYGLSILLLNIINYLIVFMLILSVILLTNLSKFKSLNQFKEFNSYNFILYSLIFSLLSMAGIPPLLGFTGKFLAILYSSFKSQYLLILFMTILNIFGMYFYIQNLRFVVKKNKSSILNYKNYYVNINYSITLNIILLNFFNFFGILFLSDLIIILNYISSYIYI 178 F F Eukaryota T 8gzu 44 JI,RA 2F,2f Q248F8_TETTS Transmembrane protein, putative MIKYLLHQLFIYIYVAEVLLGCIFAFAETVFFHSDQDEDYFLQIKQIQIKNQKRFRNNQKKSRSFKKKIINQQLVSKMVRLNLKSNVDQNEYPFLAKWDKDMRQNYEEYQNRIDATTYHLQRSQRGIAVFGEWMYPRYFQKDILELEVLRRKQQLGKIYPEEVSSYTQINPDIANDLNLTFNAKLLWPVRGMTVGAGFFAFAHLFNLPYSFRLGLFVLPTAVELAFTWGNKTSQFKSIEFMDYLLQYRVSKALLEKNAKHFAEKKAAYQKEINSSQSVQDLYNQLITLVSEQAPSE 296 T 16 NADH_dh_m_C1 pdbhh F Eukaryota T 8gzu 45 KI,SA 2G,2g I7MEX7_TETTS SDHTT3 MSLVSLFKNTFLKSRVIGLSFQAQRVMAQMAKTDFENPDEHFLLNDAMKYNELVFYGRLAENWSINPELFGKAELAKYNEAKQTLIDFNQYHALVQNLHEFYWELKTIYLELSRGVATSNFHNKREVTHSIIESDIKNSIHKYIQLIDDLKDYPEWQHKVREEIGYYAHMIYTSVNHDGNFPEIFKEFNKVDSLYYFK 198 T 0.0012 MiaE_2 pdb F Eukaryota T 8gzu 46 LI,TA 2H,2h I7LX66_TETTS Diphthamide synthesis protein MLTQRFYMIQFTKEEQSSEEKYLKTREREEDRKKELMHPQKVLNKKENKRKALLSKNQQNKKLIKYLNLNKRQEKLININQEEMSILPPLQYTYSNEESLELLIHSIKGNKDCNSERKAFNLCRSTVLGKHVEPEKCLDKALVFVNCFQKVRRDESAACQSAFNSTLECGKKYSESTISLGSSCQSQLDAYLNCK 195 T 0.0012 CHCH pdbpercent F Eukaryota T 8gzu 47 MI,UA 2I,2i Q22YL0_TETTS DUF4885 domain-containing protein MFSDFNMYEAKVFLKAVADAQNTFRQTAQQENQLARYESQSQSLLNGSTSGAISITGDNIQQGRNFKALKEVKLFQYSNEIFKKYLAGFDSFSGDYTAFKKFLNESVKKIEQDA 114 T 0.0045 gp37_C pdb F Eukaryota T 8gzu 48 NI,VA 2J,2j Q23S01_TETTS Transmembrane protein, putative MNHSCQKVFEGFVSALYDTSYFFRNFGPFKATIHYATYANYLAQNWAPRVSYIETSTPAYTLAKNKYAVYIVYGLIGGALIHNYMLDNKAAQKSQQYYLKHRD 103 T 11 Fzo_mitofusin pdbhh F Eukaryota T 8gzu 49 OI,WA 2K,2k Q24CW6_TETTS Transmembrane protein, putative MLDDTKYIQMAQKFPRNVSVQLNKKLFVTRTWFRNYYFVGVFGIFAYFIYNQPKIFAPFSGYPTTVAYKAQPDFLNDQVIFYSQQRQNTLKNF 93 T 9.1 DUF108 pdbhh F Eukaryota T 8gzu 50 PI,XA 2L,2l W7XBF5_TETTS Transposase MKLDQIISYYITPVRRFDKNLTAEQIYEQYQQAAQFNEIDAFTNIRFHRKFKEYIQTQEQSDYLYEKAKQISTLAQKMFEKKFPEYYTQ 89 T 0.13 Cdc6_C pdb F Eukaryota T 8gzu 51 QI,YA 2M,2m Q22HD6_TETTS Transmembrane protein, putative MARLWWTLDPSKYYLKQISSGGRNEILFTVLGVTAAYWYFGNKRCEHYWRRQIDNCQSWSRAQNINGNNLTVKQYF 76 T 0.0044 PriCT_1 pdb F Eukaryota T 8gzu 52 RI,ZA 2N,2n W7XF00_TETTS Transmembrane protein, putative MRRIFWNFKTAFVGLPMFSLAPKNILVYPIVVGVPLYTFIVLQNSVRGFAYFDEYDSDVKEN 62 T 6 PPI_Ypi1 pdbhh F Eukaryota T 8gzu 53 AB,SI 2o,2O SDHTT11 MGLPIRNIQFARYHYLAAVTVFTYFATRCCLLDYKKYYPLASVKKI 46 T 15 DUF4519 pdbhh F T 8gzu 54 BB,GD,QL,TN 30,85,6a,6A W7XCY5_TETTS Transmembrane protein, putative MIWKYLQRTNRGNIIQAGLQHRKFENLPFKQNFDNLTKAYDLRMWYISNSPHEAKNLEYVNELEALHNELNYQNSRQFLFRTVSFLLGWALFYQFYELPKTYDWQDTQEPKHQVPAYGDLEEGGDEGGDD 130 T 0.025 SpoU_methylas_C pdbpssm F Eukaryota T 8gzu 55 CB,HD,RL,UN 31,86,6b,6B Q24I72_TETTS Structural protein MSSAVEKKDLPADYGKMPAGYNFLTRGKDWREYDKDFILRTDAVWEKFQLEHFFRNYMKCFFFDHGLKKYQMFEPEDMYTVVFEGWALDDLITFPGFTPTGRTNSYQIGLSPRQRTVVPTQTFYQMQDYYMLCGLRFERWFRCDLVYHDQRHTKFDQVKNQKNYKTYPCYREYYEAQYACQDDMFDFLMELAYARRAADNFESDFASHELTTLPTFYDTPKAAERKTYTY 230 T 35 SNN_linker pdbhh F Eukaryota T 8gzu 56 DB,ID,SL,VN 32,87,6c,6C Q23DS4_TETTS Transmembrane protein, putative MGSVWFRNRYWWYRSLYDDYVAREAKLAFGIAAFIWLPHYYWGIHLNRAFEVNFSHRNYAHEWGPRRNRLAHSLEFEQFDMILENWQDLEDEYAQRGDGMLKK 103 T 7.9 NIPSNAP pdbhh F Eukaryota T 8gzu 57 EB,JD,TL,WN 33,88,6l,6L I7LVX0_TETTS Decapping nuclease MEVKYRGPSDDKLECEFLENNLLSCLREKSVQDNVAKMTCRPEFLVWFFLECPTKAAVYHDPKGLRNIFIQDKIKQKGSDDGVLSKDD 88 T 1.3 Defensin_4 pdbhh F Eukaryota T 8gzu 58 FB,KD,VL,YN 34,89,7a,7A I7MGF9_TETTS Transmembrane protein, putative MNRFFKVSSKYQYYKYLEQYDAAFLRKYQSETHWYLGRRGAWKNLVIKYAGDHISLEEEHNVKYKTHLSFVYLSYRLAWVLFAYVLIYNHFLLGDIGKTFNVGEWDHRLKPSAERDYPTRYESLYILDRTQKW 133 T 0.0061 IRK pdbpercent F Eukaryota T 8gzu 59 GB,LD,WL,ZN 35,90,7c,7C W7X287_TETTS NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 8, mitochondrial MISKYRYLHCARKLVKQSVQAFGGGHHHHEYDWRDDPKVNKDIEEDIRDRGWHPETYDFPYTKKHDDWVFDVTMPSQNYQTDLTVNIHPENKKMHVMKQVMRQSYWDAEHDMAHEYDYESEDLDFQCESFKSQHFRKKGPISQYLILGLLPILYFGTEFFYNHYPDEDYWRVAHPPPLDYPDTDDTDDTETFKDYKSFTGRRMVDTGIVDPLWYDIREGKKVYYDWAGVNQPMEDI 236 T 0.14 Tctex-1 pdb F Eukaryota T 8gzu 68 PB,PL,SN,UD 44,5t,5T,99 Q22N23_TETTS Cullin domain-containing protein MEDNYAADVQRQFNRTAFDSLYKICYNSLVQKNGSTIDFQKQIDCHQRLIQVFAKIAPIVVKVEQDAASSGGAAAGGEDEE 81 T 11 MetOD2 pdbhh F Eukaryota T 8gzu 69 FK,VB 4L,4l Q950Z5_TETTH Ymf58 MLTWISFWSLIFWLILIILVLKPKNFISILFMSELTWLALYCLSLLFGAIYCDITLLSISFFILGVAGLEFSFGILIAILYKNLNESLNTDLNNNNNNQNIFDKNFKTPLEKINWQ 116 T 0.0017 Oxidored_q2 pdbhh F Eukaryota T 8gzu 70 GC,GK 5b,5B Q951C2_TETTH Ymf57 MLKNKLIKFKFFRFVQSGFYVDFIFKKFSEMFIRNIFIYSSIFFGEKFMIEYLTKKTIDSFIFNNNRFNFINLVESKYFLQILTLILYLFFITIFILFYI 100 T 29 HEPN_AbiV pdbhh F Eukaryota T 8gzu 71 HK,VD A1,a1 I7MI60_TETTS Transmembrane protein, putative MVNTAYPTPLKTILKTTPAFVVYFVFGLGFSTVIYDVVYHPKDRIERFYFRSSKFERLSRKRDEKLRHYFKPAIEWQPWYNTSTNNNTRPLLRY 94 T 0.15 KCT2 pdbhh F Eukaryota T 8gzu 73 JK,XD A3,a3 I7M9B3_TETTS Transmembrane protein, putative MSNNNQGDFFVDKYNFSRRVVDHRQPYDLNFSINNPVGSRVWFKAWKQKAIGNFLNLVGVHYAFYGAGFCLLFVLADAWGREKYAQPYKSQILHGRQPFGHTFVQNYRNQATDLGRWNHNFACYEKQPGCGRDFD 135 T 8.3 DUF983 pdbhh F Eukaryota T 8gzu 75 LK,ZD A6,a6 I7M2Y3_TETTS NADH dehydrogenase, putative MNHYWGSSNTIPASSTQNNNYFSGGGNNVTIRGNEIMERLPSQTPSQNMVQASMKTLRFYRKFCRLIPFILRIHNIGTKFTAQQAMINFGNYIRERNHYRDPGLIDHRIQLGYELLYEAEMHFSQHTILMQYLSPYNTPLSDRGYSYLEKVKYGNKSKFLQGFYKGNKPTEF 172 T 0.00098 Complex1_LYR_1 pdbhh F Eukaryota T 8gzu 76 AE,MK a7,A7 I7MIJ7_TETTS 37S ribosomal protein S25, mitochondrial MRKALERFNEIIFNPAIRWYQLPKPTVRRTRYPAPGSEPINREVHQIDYKTAFRDSPHNIRYHHEIHTSDQTYHSSYDPVGETTTERLVRYGYLNKDQVNNAEAVAAAAKEFQEKEKRSPSNNIIIDEISNSDKPITKENRESVAHHVRQQFEFFREVNAEEVWSVSIEEKYNPELYIYKTYDMAADDPVWRQVKLDLEWTFENIAERRESLGYMPTFKGDPNFWQALDNSFSPENIAQVQSSIGDKVTNIDTKALALNHQTEEYHKTSKLVYPIRTNLVVE 282 T 1.1 Synaptobrevin pdb F Eukaryota T 8gzu 82 GE,SK am,AM I7M2U4_TETTS NDUA13 MQFFRPDFIATQVLRRADMAHSPFHKAIHDLEDKRSKLFPDRRRIPGRKAKLLLAASLLLQMWGVGKIIEIKKFMKRRDIELKGLQRKAAPFMQSMNDVRHLALRERNDMLYNELLSVHGEEYAQKMQKRFHQTDIWAPFRHRYAYMYNSSNKNVKDYKQVTLSRYINGFDKFNV 175 T 0.00014 GRIM-19 pdbpercent F Eukaryota T 8gzu 83 HE,TK an,AN Q24F24_TETTS Transmembrane protein, putative MELNSSAKEDSHYVGVLGYPSQHDPHTLHPKKHDSTFTKVYACRDMLWDHHWEVRNTLYAGFKGALLGVAYASGFGLISKTVPSIVLKKMFRFVRNNNFGHIRIMQDLLTPYALTGFGLGSVYYLYQHNVWENRSNKWLAEVLSNALFFQVATAVCVNPGFHIYGMVGGILFGTLKYAFYNSSFFQEKESIGSYTTFGDLSEEERKKQEYKDYIQFLGNYHKVRNGQLVDL 231 T 9.3 ENOD93 pdbhh F Eukaryota T 8gzu 84 IE,UK b2,B2 I7MG29_TETTS NDUB2 MSLRKGTSIFSRQFKKAFNDAKYQNLTAAQGETYSHLGWISNVDLRLGRAIFTFGVVGIAFCIYLEPSYFHETFGHMSQPPKYDLIDSNINGVEKKLNKQILHREHNEHKLDGFVSMFKGSDVAKN 126 T 6 Biopterin_H pdbhh F Eukaryota T 8gzu 85 JE,VK b3,B3 A4VD20_TETTS Transmembrane protein, putative MNSPQKVAQGAGRKLFKHYINENIKSNNEQKLFFYRVNRWRWNTKDNTTAPKFLRLKYPLLVTGVCLFAYDWTYGFTQVDAHH 83 T 0.086 RGS pdbpssm F Eukaryota T 8gzu 86 KE,WK b4,B4 NDUB4 MALRRILANQAKLNQKLAQTNRNYHYDFCGRAYMGNPAVQSPPKEFFNYHYVPDNYPDALSGFRIAYRDPFEVQHVFAYENWEYQYDGQWWSMGSLACNVLFFCTPLFLYLILQVEELNSEKRGGTSNKFYHNNAGFFHFQIYDNKQ 147 T 1.9 DUF3930 pdbhh F T 8gzu 87 LE,XK b6,B6 Q231G0_TETTS NDUB6 MLLIEMAFNAMKMKIFSLRKIKVKSKEQYLYNYQQKLLILGQGKEKNNKQYKKDIEMGGFQKYPIPRYLHVGQWIVNKNWKWNTFHMFFPTAILCFMVWRNSMISTAKPPNYGEYVDPQSPVAPKAIKY 129 T 0.0021 TMEM117 pdb F Eukaryota T 8gzu 89 NE,ZK b8,B8 I7M855_TETTS NDUB8 MALRRVLKNQFNLIHKGQAQAVRGGHGWDRPDVPLSFNPLYVHKRELSIFDTNMWMYDQVYPEYVISYNEIHLVDQWKGLKESFSQSAYWWAMMAMVFGFYFINTTPRQLGIDTNDLKGFLGEYYGQYKKRSGIRSNFLGLDVTGENSIIQPNYDRKNGIRDVIDSLNADAGKRKLINLEAKNFIERVEKECEQRILKKGGATQSHH 207 T 0.23 Spore_YhaL pdbpercent F Eukaryota T 8gzu 90 AL,OE B9,b9 Q233X7_TETTS NADH dehydrogenase [ubiquinone] 1 alpha subcomplex subunit 10, mitochondrial MSKAYYFVKNFSWAEVSNLLCYGTKYPTVLNHQQKVTRLYRATLRRVYAHQVEGYKTDFKQYNENITDIGKDFNKMLALKPESLELQAYFKKYEDLQEELFDPAMIIDESRPYAASSGRYYIFDDYLLKFDPFGFYSPKLLSENRPEEAMPFYEDYPQNDSHWNLWEQFPEDFEDSNAEREAILKSNKH 189 T 0.005 Complex1_LYR pdbpssm F Eukaryota T 8gzu 91 BL,PE BL,bl Q23KG0_TETTS NDUB10 MAFGGFRQTDNSLIIDDRRKIILNTRSLNDFQQKIYLRNFFTNYRPDLSSYDYFAFKEKLRIGELFLNEYRKRINNEVRRAAILTPTSSLREKMNHKIADQILDLSSPHVRGAHFQAVRSWTDASKIVNYVEEKQTKINKYGLQFPLLGNMTEEQCASKEDEVYQRLLKEMQKPPKKASEPVEESSDE 188 T 0.12 CMV_1a pdb F Eukaryota T 8gzu 92 CL,QE BM,bm Q22Z32_TETTS Transmembrane protein, putative MNPRNIFNLAKKVQNFNSITQKAFKRFGGAAAHHDDHHDDHHGHGGHGYEVHLVKDKNLIGNKSFKDDLVAVYGFTDVNDHHHHDETDPYHHLRGVPTLSFERMYFADAYYHDDTHEGLMNEPHGYLTMDDPMDLRPNYEKSALELLFLVSGGAILALMLGYQGLNLANPAESLFSLNTAAEEIEDKIRQIRIDNDKLLQRKAQLEEELASLNN 214 T 0.022 MctB pdbhh F Eukaryota T 8gzu 93 DL,RE C4,c4 Q22W63_TETTS Complex I-MNLL MSSMLIWGACFGLFTRAAACKASMIPLTTSPWKYPKYMIVSAVTFYYFDWYRRMALEQLCYNEEKLERYQIRAKLQSLKIGEELSDAYRESFFEHAVQKNNI 102 T 0.0018 NDUF_C2 pdbhh F Eukaryota T 8gzu 104 CF,XI n6,N6 Q950Y2_TETTH Ymf62 MFLITITSYFSNIIEFNSYIINLIDFITPLFFIENFVIQFFILYLFYLLIVNNNLYYILLYIFLEIVFFGLFLCLYQLELFTGFLWVAEFAIVFIAVVLLFYLNIDGLHLKYNHNINNVLYYTPSLVLFLIFFNIDYFSELELFLPLELSFIDIYDDYYEGFNNSIMNDFTPLTLSYYSINSAEFIIIGLLLLLGSVACVNLYKSNKNYTIVKQSNLLTMFDFFKDFINFSFIRKQDLNNQTNFNPSLRSIKKKY 255 T 8.3 DUF2070 pdbpssm F Eukaryota T 8gzu 105 DF,YI p1,P1 Q24C39_TETTS Transmembrane protein, putative MIARRLFKRSLYYIPRAGFGGGDIRHKFSNEITDDDYDYQRAMHVKPPKEESLFQLTNILSSVPVFKTRFFLDFIARNLDTNSAVSTSDFVAPPRVHENSFFVYHSRELGNVIRKYRSLESIVLPGALLTFTYPLFAAFVAIPSYYFMFNAKIYEMSRRFVVRMDVLPHLEMISVQRIGAFGILYTKLHRIQDLEYVPFDQVKEQENYLWAIGGHGVDNQLIFKDRSTGEFFYFERQGVWDAKGLNHPLLN 251 T 0.7 TMEM70 pdbhh F Eukaryota T 8gzu 106 EF,ZI p2,P2 Q23KE0_TETTS NDUPH2 MFNILKGAQLSFRSITNKSVNNYYNIMRQVSLDSNPIVLYQSSTFTGNGLQEFYENADALTKYLKLVPFFLEKNLYDHPKQFVIKMEFHPQNKVLSLDCLTHQGVLKKTVNLENLIPVPYEDYVQFCRRKLFNAPLFLDTEMIYFNTFQNEFYVFDKNAKWNEEGINHPELDISKLYNEKAWFDSLRII 189 T 0.26 TMEM70 pdbhh F Eukaryota T 8gzu 113 CI,LF,QH,XF Qg,qG,QG,qg Q23F81_TETTS Sulphotransf domain-containing protein MVRLEKILWEQLVNVKAFSRQRVIGAPSKWYNENRTEWFKVAQHNAFNTGFSGVILRALEPLLAKFIYRWRLDIAHQRGLTLEDSLLFMDRELRRCYFFETVARQNLHPYTVLFMKKRRARYYKVERGLRGFYVPDWVRKEAEERQLSETVDNIFNWENFVYREYMSDMTPIGRWTSLSKITPLDMFQYYGLFRNEAWDRFFYNEAFYESYSEKEKQEANGNPFGKFNLQTADGRAQFEKEVNTFIERYPFAVTKPGQKFDFTRFYALEDLANKRDTSKYDPALLESVKNELKQSAALPADNGANKTKKSKPILPDWLQPKFGKAFQA 328 T 0.69 DUF6322 pdb F Eukaryota T 8gzu 114 DI,MF,RH,YF Qh,qH,QH,qh I7M484_TETTS Transmembrane protein, putative MNVTGAGLTHVKDFHSDEMRVFRGGLRHIADKQGNLIYGSVNSSVRYYHDKMSYERGFIQHSRSPSNQFINFHFMLGGFRTYVLERFFKQVWYRRNIRTFWFPVLISYTSGCITMRMYDNNCYDYFYFSD 130 T 1.6 DUF5320 pdbpssm F Eukaryota T 8gzu 115 EI,NF,SH,ZF Qi,qI,QI,qi I7MM45_TETTS Transmembrane protein, putative MVYGKLIFNNIKEYTPSWIKTIPYSQVTKPILRKQPQIVGKINADPKVKKFWVFLRENVQYYPFLWQFFILGTSFVWFHVCYDPWLAIYQANNAHRSLETALTKEKAHKKKLAEQEESE 119 T 2 Selenoprotein_S pdbhh F Eukaryota T 8gzu 116 AG,FI,OF,TH qj,Qj,qJ,QJ I7MFL6_TETTS Transmembrane protein, putative MYLPTFYKLFHETNAFRLKRYVGYGPLLLTWSIWTLYPALYNMIYSDFIPPERGVPKRIVDA 62 T 1.5 DUF5621 pdbhh F Eukaryota T 8gzu 117 BG,GI,PF,UH ql,Ql,qL,QL UQCRTT2 MAPVFLKALRYVIYSYPLYVCYLIKQAQINAQGSEKEEEHH 41 T 2.8 DUF5392 pdbhh F T 8gzu 122 EJ,HG S5,s5 W7X4R4_TETTS GRAM domain protein MSYSGYSLNGGVHPCLPFYERMLQCAKSEALPIKMCTAQTEDYLECHHRKKQYALNYAIKKELNNIRIVALPRYDEENDTFVPFSQATADHIFQ 94 F F Eukaryota T 8gzu 128 NG,VI sc,SC Q23RH8_TETTS Cytochrome b-c1 complex subunit 8 MRTKLYNAAYFLLNNNESFGHSFGIRLKIVGLNTWIVGYAVSRYYFSSLRVKAAQDERFE 60 T 4.8 eIF3_p135 pdbhh F Eukaryota T 8gzu 129 OG,WI sd,SD SDHD MFKELIHIFRTYFITFRYLKKSNINFLKNLSYTLIAYYLIINFM 44 T 17 DUF1869 pdbhh F T 8gzu 130 IJ,PG T1,t1 Q22E24_TETTS Lipid-A-disaccharide synthase MLTHISRRYFSFTGRKTIFVAAGSPSHDLQAANFMRDLKKKSNNNYDFVGIGGPLMQAEGLNQSYADINKFIDKPFFPLKNFIRFHVARCYHPYMAPLHFFNKQVLNQVDKSSLLKDQVELSIPSAIITFGNEFFMKKLYVRLCDQYELHNKIRPPTFFYDRSHINQRFEFQDYLDHFFYTIPMKQINFQSFTYPSTCVGHEGVGRAIQYLFQNSKQYANVKSLVTANGLKIASNPKQHREIIEKLVEEQRGIQRARLGINESKNVFLLAPGNTKAEINFAVNLLSRSLEEFFKKPQLTNVSRDHFTIIITADNAQNAEFVNQAVSNTKYLKTLQTIVTTGEKEKFGAMCAADVGIPLNGELVSECAALQLPSVIISNMNLFYAYITQLYNNFYSDINFAIQGEAYHELVSTAANPYKLSDEIFDLYSDPKLRYHFAERYQNVVHEMIPQANSQDNIVTTDVATLHGVEVQERAFTYETIAAKVLKAARAYESLDKNIPNHQIDQHRKEKLIKAAF 516 T 8.5E-42 LpxB pdbpssm F Eukaryota T 8gzu 132 KJ,RG T3,t3 I7LUQ4_TETTS RNase III domain-containing protein MSGLLRNFEKLVCQSQLSKAGHKLLLRSPNSTLHPTAFYYKRNSSQRLANEMDVFQLGLAAAALTRQANNYAQLLDQVDKEAVREEVQERITQNHSDLNVYFGEILSLFKIGKKECPVQTVADISYVLAFGPIQVPNAAAIITENLLPVLKEKLDYASIHNLQDILSAFVKLNYVSDKELLKRLITALSQKDFPNQLQPVTNHAWNIDQYEYSDCNSWNIVSCGDNTFEKYIHEGGCENSLAKAKFAVHELLDHISFNFVNPFLFRENRINHRFAKRNADLDHEVLMQTLSKLQEIVPETSEAIATIKARL 311 T 0.034 Baculo_F pdb F Eukaryota T 8gzu 133 LJ,SG T4,t4 I7MIE0_TETTS Transmembrane protein MGGDHHHEDSHHKSNVDQHELKAEMIKELSHYYDHHDLSLFGKVQHFVEHLLEEKHHAKINTSNFDQKKLENFSESKQISRTVFALKKIKTFNHDFFTSEEEMILEPLPLGILTYGLKYAFAGVDAALLTYFWRNWNFNVRTIGLLGGLVGIQMATLHIPNLVNEVVIQTPRRRALAKKYISAYGPQFFHDIVNPKYDIEHLRHLQNKLNPY 212 T 0.17 PfUIS3 pdb F Eukaryota T 8gzu 134 MJ,TG T5,t5 I7LT77_TETTS Transmembrane protein, putative MFLYKKILSIYKQSFSFFLSFNFSFFLYALLAIFLLINFCQHIHKFLYYCKEKIQKEMQNAYPEITDQHREFLKKQGLKVYEPKPLPDQINPFSKTYWITNAFIIGVSFLARRHALKVGAPRIFWSGCIVGVPLAAIISRGKSDQLDELVGARKTLEQKLEYAPITRRAWERALATNQEYQNEIKTQIQDLQAEIAAKKVAAKLE 205 T 0.02 FlxA pdb F Eukaryota T 8gzu 136 OJ,VG T7,t7 Q22HE4_TETTS Transmembrane protein, putative MTNFGSPFRNTDSGIVIRDPENEKRLKLAFQNFWKSKQEDKEFQAQIKTAVSKDTVNFMFYASPLFGALLGKTYIDMFCNPRYFYFRAFTLSMFALAGYCVGNGFRNRYEHSLYTRNYHLFPKDLQDALVNGDARYCISWWKQ 143 T 0.053 AHH pdbpssm F Eukaryota T 8gzu 137 PJ,WG T8,t8 Q22SC4_TETTS PH domain-containing protein MTHQFENVLLSNRKNLTPQESVQKVINYALLQDAKQRSRTLRHIKASWVIPALLFTYPAWYLAKGAVNGVWSNIHPTDKVTLSFANIGRPFRLIYRPEIFLRDQQAKFIQLEKEHIEKSKKGEFVETTSPLVLWN 135 T 4 DUF1852 pdbhh F Eukaryota T 8gzu 138 QJ,XG T9,t9 Q23B10_TETTS NADH dehydrogenase [ubiquinone] 1 alpha subcomplex subunit 8 MFLNPVKDSEFDDEVKGFVPSEGEVRFVANKNKECGYYLQGIEQCRRKMVQLAGDSSSQFHSLGFLPCKRLVDAHYRCMTDDKFGSTIEEVPEIGLDSAQKFFDCTFQQLKPMQSCRRFFDQVVRDVYRANGSQLI 136 T 0.23 COX17 pdb F Eukaryota T 8gzu 139 RJ,YG TA,ta I7MAF0_TETTS NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 4 MLARTLKNYMRVQQNLRFSRANIEKSPAPPTIGQVELEPFKFNHERDQLIYGYTMEELYGKKFGLKHSATVLREIKKDTIMMILFIIGGFTYCYHMRETRFQLDDDFNEYVNTNKQTFRPIPDHVKL 127 T 0.1 Viral_Beta_CD pdbpssm F Eukaryota T 8gzu 140 SJ,ZG TB,tb Q22T55_TETTS Transmembrane protein, putative MFWRNVVRGLNCQQALRRQNFAKNITTTDIPKDSHHFAAKRSGFTQTEQAPFAYNDVYQYPKDYKPWNYNYKGNGVLLALFLGSAFSLVAYERSYASKTGRYQRKVQQNYYQI 113 T 0.055 Ncstrn_small pdbpercent F Eukaryota T 8gzu 141 AH,TJ tc,TC Q22E95_TETTS ATP synthase subunit e, mitochondrial MVYQGFKVLRRNPTFYNPRSAGMVALSYFAYSYYVNKYYKPQNSNFEEYNSSHPHNHDEKVRQYHEKTNQAIRDAVLEKRAEHDQRLREEAKL 93 T 0.015 Hrs_helical pdbhh F Eukaryota T 8gzu 142 BH,UJ td,TD Q22DC2_TETTS Transmembrane protein, putative MNLPWFVRWGTDVALFFIPAYTFANYPTTFFVFAAEKRRQRRRKDFSDVKLRDDAAFSVDQVKQLQTKLHLKQ 73 T 1.6 HIND pdbhh F Eukaryota T 8gzu 143 CH,VJ te,TE I7MIK1_TETTS Transmembrane protein, putative MDKYIQQAKCAYNFSLKAVRFVGPLNIVFAGVAFLMFYENNYKKLYLNPRYSYTMPYLQSAKITKNLYEKL 71 T 0.21 YpmT pdbhh F Eukaryota T 8gzu 144 DH,WJ tf,TF NDUTT15 MNNLKGSNCLVQNVAFNFSQRGRDYTPSNKKYLQPWELERKEYVELSLAIQSAYSCKMLSEILKDNLYMLTDYQLSFAMFHLWNHEIPIDNYFYNVISPILKEYITRFDRECNKSLAEIATFLGRMNVQDDALWKVIETKLVQERLYRYIPLNDLIDLAHGMATANRGSQEFYNIVENVIIKHRLRLIPDKIAVAKDCFTARKIGSPLLYQVLENPQAEAHELAGLKEHEQLKISG 236 T 0.83 FAST_1 pdbhh F T 8gzu 145 EH,XJ tg,TG NDUTT16 MASQLQREQKLVQSLQQESLQPHLFKIIVDSQSDLVCEADRREYIKHYTRANEKSSTSQLLQVGALLGYIYAVGRYVSNPSTRKFSYGLAALLGSFSLLNPSKNLHHNHSLREIYSKYNISTNPQALEILKSRIY 135 T 22 ApoO pdbhh F T 8gzu 146 FH,YJ th,TH NDUTT17 MNISYTGLKLEDYSDEVIRKYKFPNSNELERFLNREQTLTVQQHKSAIKLAQQDFFAVAGLLSVGSLSYIFYNSVGGKVIRDRIRASMPFPKRVLVQVLPFVALGTALIISRRGIEGHNHGYKQ 124 T 0.12 EMC3_TMCO1 pdbpercent F T 8h0g 1 A A VGF_HUMAN VGF-DERIVED PEPTIDE SAQEEAEAEERRLQEQEELENYIEHVLLRRP 31 T 9.3 TSKS pdbhh F Eukaryota T 8h0i 2 C,E,G,I C,E,G,I Viral infectivity factor MGSSHHHHHHSQDPMENRWQVMIVWQVDRMRINTWKRLVKHHMYISRKAKDWFYRHHYESTNPKISSEVHIPLGDAKLVITTYWGLHTGERDWHLGQGVSIEWRKKRYSTQVDPDLADQLIHLHYFDEASEGSQIKPPLPSVRKLTEDRWNK 152 T 0.052 Vif pdbpercent F T 8h0u 1 A A VGF_HUMAN VGF-DERIVED PEPTIDE SAQEEAEAEERRLQEQEELENYIEHVLLRRP 31 T 9.3 TSKS pdbhh F Eukaryota T 8h2i 8 IC,JC,KC bI,bJ,bK Q98573_PBCV1 REDUCTIVE DEHALOGENASE SUBUNIT A MAMKTQRKENVLFQNVKPREIPLVDNPFSTYPYKHVITETQPTQAKNQAIWGLVQMGLSGEAAAMYGDVVVQKTTRACRKSEGGFKDVNTELWGTSPYLGRGDGEVYNMPASNQLLRGFESSLRGSRVRTQIDDKSFIPYTWQMIDVPLAAAKTSFIAGLDTRQQLAYGNP 171 T 3.2 B_solenoid_ydck pdbhh T Viruses T 8h2i 9 LC,MC,NC,OC bL,bM,bN,bO O41054_PBCV1 DUF2268 DOMAIN-CONTAINING PROTEIN MFSAFRDTASIGFSDTHQDEKTLRFLKKQISQFIKHLKEYYPNNELTKKLVMKYSDVQLLPYTKGATKDTYTSGLFDHTTGVIKIAPRDGLGNVRDEQSLNKSICHELAHGTRVKYPGESSHSDEWKDAWKTFLKIAADELGWKIEVPCSSVSFYGLTKDDCENCVWDQDPETCPKTAKLA 181 T 0.002 WLM pdbhh T Viruses T 8h2i 10 PC bP Q98576_PBCV1 NADH-UBIQUINONE OXIDOREDUCTASE SUBUNIT MDSRLSAAYAIRAARISMIPGGVDGLVINYAEGGEPAWVQYPLKKQKPLPNNLCYTPTLEDIARKREAVIAKYTKQPLETGTTFTHVLNASHLNEQYTRVKKSALPDKEFPIIETEKYPEPPILWETTIGAPSRLFDRSDGVKYVR 146 T 8.9 XRN1_DBM pdbhh T Viruses T 8h2i 11 QC bQ Q84523_PBCV1 PSBP DOMAIN-CONTAINING PROTEIN MILVGIAVLILLAVFAILYYKQKEKFVVVGKFVEPIPSNPGQDFTLLPMDQTYTFADPVPDTATAFDVVLSRFTDKKAPADLLKGATFPEAAPYTDSEVENISKLALSRVKGPDAPVLSFISVEYAAKGVDNKKNTHYDIAFMVYDQVKNFSLKLVLVAVLDAKNKLWIKKFSSFNSFTPKDKGPKGVENIDETPLAEFIPDFVQFSRLYKDNANV 216 T 3.7 Mid2 pdbhh T Viruses T 8h2i 14 AD,BD,CD,DD,ED,TC,UC,VC,WC,XC,YC,ZC ca,cb,cc,cd,ce,bT,bU,bV,bW,bX,bY,bZ Q84666_PBCV1 SECRETED PROTEIN MDMHMIVKVVAILAVLFLVYKLWESMNKPNASPLKIQNPYEKYMNSAEGGEYDAEDDDIYYPETDAEDDDIYTGETDDMYDGEDDDIYVQEGDDIEDAEDEPYDDSADMEQDVPKVQQPMMPLLTPSSQLLPKPSPEAADFAQFAPKNLQAQNFLTATQWIGVNTQGSSLKNANYDLRADPIIPKADVGPWMMSSVDPNIYQKPLFG 207 T 0.14 FeoB_associated pdbhh T Viruses T 8h2i 15 FD cf Q84459_PBCV1 P12 MGNGPPMERAVSSDDILTYYNTFIFFIYFNFTNENIYIIYTIYMKVQNTIVYIVLLLIVVVIIWNFTRKEGWSDYNAPNDFMKIYYSNIVEDKKLAEKYPFFGTGPFTGLRCRKPNNVGCNTTWVSGQLVELTPKLKEQIECKFGIQYVKT 151 T 0.042 ID pdbpercent T Viruses T 8h2i 16 GD cg Q98473_PBCV1 NVEALA FAMILY PROTEIN MHKITPFLIAAVVAVIVLAVWLFKKDNKKETWFSRDLNYGKANSKIWNATVAKGLKGIANENAEIRKMYPYLGYGDFTGAICKGPNNQGCTYYANYTR 98 T 0.0083 DUF4381 pdbpssm T Viruses T 8h2i 17 HD ch Q98534_PBCV1 ENTRY/FUSION COMPLEX COMPONENT MWLFFFALAVIYMIYKRDVFKKIAVNLKMNGVSIPFVDKYSKQYPTYTKNALFHVTRFNNAYQKTFEYKNISIDTINNLFSIRDDVLYNISEIKLRLPNDLTQEKEINYMYEKTDQRLMEYITDVKSRFHINIYPGTMSSAFEARNYRASNDIVF 155 T 0.19 CortBP2 unppercent T Viruses T 8h2i 18 ID ci Q98530_PBCV1 GLYCINE-RICH PROTEIN MQGGLFGTIKLMIMLFSYFAAYQLGKMQERPQSQWPKAKAGQNKYMVGDWAAWKPIYMGVLGVAVLLTLLGPGGVGGGMGGMFGGGGGYGGYY 93 T 0.00098 DUF2062 pdb T Viruses T 8h2i 19 JD,KD,LD,MD cj,ck,cl,cm Q84629_PBCV1 P17 MGAFTSFVLMLLFTGIILIATNELTYNRPREIQYRYLPRDLDSFIRTQEMPSAIFSSMWDVDTRRGGDGGPNPPGIRQSN 80 T 0.13 Ac110_PIF pdbhh T Viruses T 8h2i 20 ND,OD cn,co Q84533_PBCV1 TRANSMEMBRANE PROTEIN MTTITYDTDLLPPPELKVPSLDQALAPESVKNDDPFLDLSYFPVPKGFDNVGSLELNNLSTAEDVATLQNQLNKLAEEKHKRSTWKGLTFRIAVQDMWEALTGIPTDIYQNSGRVSLKELLTRDDRLRGLGLIFFLVAVVSIFFLAAG 148 T 0.19 BMP2K_C pdbpercent T Viruses T 8h2i 21 AE,BE,CE,DE,EE,FE,GE,HE,IE,JE,KE,LE,ME,NE,PD,QD,RD,SD,TD,UD,VD,WD,XD,YD,ZD cA,cB,cC,cD,cE,cF,cG,cH,cI,cJ,cK,cL,cM,cN,cp,cq,cr,cs,ct,cu,cv,cw,cx,cy,cz Q84602_PBCV1 A2M DOMAIN-CONTAINING PROTEIN MFLYYKMNEVLNVNNGSSINVIKILKTPGPDIIRPGKTYKKKDVFTPKFFKNGNVMYTCNTFSLNVPVNNSIATIDFAEHVNGAVFKIEYNRVNFIAPSLYPISGLGTAVVFDLQKGEKASQRKITEFGNKDIRIADEISDIAADDHSVLITTKLMSESTPGELSRDVVLNGEIATGRINMNTGFVSDIIHTKKISIIDDGIVDYYVKINVPAKYSHGIVEVVSGTFLNDIMIHLVRNKNKWALMDSKMYIVNNTNGFVIAKNPDTAMGVSLLEWPRGAVVFPPYYNFKKFKNVNKWSISQQIASCVNTSVKIPGGEYSWKIRLFFGPIWQVQESINSVKTVPHGNDYMYLYKEEIVKYKYQRNTKRNFHEYDYDYKF 378 T 0.11 Fer4_16 pdbpssm T Viruses T 8h2i 23 BF,HF,PE,TF db,dh,cP,dt Q89349_PBCV1 P21 MSWFDPNWNPFFNRSIIVSGNLVVAGGVIEGNGGGLTNSAFSNPDTIRTDLTGNVTSNGTVRSSLIQTNSIVASGDAYANVWNGNAAFITGISANTTLVGNVRISISGNLTGNFAVSTTTVSNTMTANVLIGTTLTVNNLSATRANVQVLTGNVTGNVRLIANVVNVAAYTVGNLTTSSNVSTTNLNVTSNINGNVRSSTANVQGRLSATTPTTGSIISSSFIANSFTSSGNVQIGDFFGNVRASVGTFDMIASNVFSNVANIIEIRSDIISNIANVGNGTFVFVNASNITTTNITGNITSNVASFSNVDSIFGNVGNLRANTLTVTYANITGTLASGNVAVGNLISADNTSEFANITSFTTNGLVVNGNIIAAIGNGNVFLANVITANSLTLQSSSSTAASNVTTTRWIQSTTSTSNVMTIGNITSGNVISSNAAYFANVISNTITVMSAMTVSGNVSFGNVSFGNVSYGNITANVTTSTGNVSVDNLTSNVVNVVFCNASSVTANTLTSIALTGNTYVYSANTGVLTANVGNVFGTLTVGVANVTSMVSNALFANTAVVSNLSVSNLTVTISSDISNISITGNTSGNVFAANIATIGTLNTANVVANLVASNVMNAWVTSNIRTLITGNANVSTTTSNVITTGGFTITGNITSGLLTSNVIAGNITMYNTSNTTLFTSNTSNIANFFAGNMTAVNTIVSNLEIQGNSVVITQTTPFKVTSVLLANVLFANTIISNTFTTTANVVGNISTDIANIGIANVNFLNTTDFAVNTANIVNYTPTSNINVTGNLTLGNANIVNFYATSANIVGNITANNAVITFLTTPFYKGSGTVSAGFTSIISANALSNVIVFGNLSANIVVSNTLLSNSINVATVFSNVVNIGQVTSTGTTLVNSINFNISNVDVSGNVLVTRDVYTTNISVTTANIANVTFFSNVAIGNLISTRNLTAANLISSSDLFNSGPYTSSQNVTIDTLNFTAGNLGTVAARTTITTPTLFATDVNFTQDALVAGELITQNFYGNITSANIGSRITIGNANVTQTNITGNVVIPRTSNAAGFNSRLLISNNTTVTSNIVANSLISTGNIITNLLQTTGQVSFASLAVEYIDVANLAVRNVVTIGGNLTVSNVANLFSITANTVNMSTVTTNTLSANTITGTSNVLVAGNIIGNCFGNVLVNRGVVTGNVFADTITVPLNAVGVLTGNALIVPNTALHSVAITGSSAFTSNIRTLNTPNAFIWNTSVPSSGLLDARRLRFSQYITSNRIVEMRYFYITNALPQKYDSGGLNARYDLSFGTTLPTSQGQTWHHFLNTLYIGAQYNTNFSINCLIINATTTGLEVIIYNPFGIAASLSSSTPELYFYITSIATSTQ 1369 T 47 GMP_PDE_delta pdbhh T Viruses T 8h2j 1 A,B,C,D,E,F A,B,C,D,E,F Q6PVL0_9CAUD p26 MDNQHKKIKGYRDLSQEEIDMMNRVKELGSQFEKLIQDVSDHLRGQYNASLHNRDEITRIANAEPGRWLAIGKTDIQTGMMAIIRAIAQPDSF 93 T 1.6 S36_mt pdbhh T Viruses T 8h2x 1 A,B,C,D,E,F A,B,C,D,E,F Q6PVL0_9CAUD p26 MDNQHKKIKGYRDLSQEEIDMMNRVKELGSQFEKLIQDVSDHLRGQYNASLHNRDEITRIANAEPGRWLAIGKTDIQTGMMAIIRAIAQPDSF 93 T 1.6 S36_mt pdbhh T Viruses T 8h39 1 A,B,C,D,E,F A,B,C,D,E,F Q6PVL0_9CAUD p26 MDNQHKKIKGYRDLSQEEIDMMNRVKELGSQFEKLIQDVSDHLRGQYNASLHNRDEITRIANAEPGRWLAIGKTDIQTGMMAIIRAIAQPDSF 93 T 1.6 S36_mt pdbhh T Viruses T 8h4i 1 A A engineered mini Galpha-S subunit MGCTLSAEDKAAVERSKMIEKQLQKDKQVYRATHRLLLLGADNSGKSTIVKQMRIYHVNGYSEEECKQYKAVVYSNTIQSIIAIIRAMGRLKIDFGDSARADDARQLFVLAGAAEEGFMTAELAGVIKRLWKDSGVQACFNRSREYQLNDSAAYYLNDLDRIAQPNYIPTQQDVLRTRVKTSGIFETKFQVDKVNFHMFDVGAQRDERRKWIQCFNDVTAIIFVVDSSDYNRLQEALNDFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKIEDYFPEFARYTTPEDATPEPGEDPRVTRAKYFIRDEFLRISTASGDGRHYCYPHFTCSVDTENARRIFNDCRDIIQRMHLRQYELL 361 F F T 8h7a 1 A,B,E,F A,B,E,F KAT6A_HUMAN Histone acetyltransferase KAT6A SMVKLANPLYTEWILEAIKKVKKQKQRPSEERICNAVSSSHGLDRKTVLEQLELSVKDGTILKVSNKGLNSYKDPDNPGRIALPKP 86 T 0.0018 Linker_histone pdb F Eukaryota T 8h7g 4 D D SP20H_HUMAN P38-INTERACTING PROTEIN,P38IP MQQALELALDRAEYVIESARQRPPKRKYLSSGRKSVFQKLYDLYIEECEKEPEVKKLRRNVNLLEKLVMQETLSCLVVNLYPGNEGYSLMLRGKNGSDSETIRLPYEEGELLEYLDAEELPPILVDLLEKSQVNIFHCGCVIAEIRDYRQSSNMKSPGYQSRHILLRPTMQTLICDVHSITSDNHKWTQEDKLLLESQLILATAEPLCLDPSIAVTCTANRLLYNKQKMNTRPMKRCFKRYSRSSLNRQQDLSHCPPPPQLRLLDFLQKRKERKAGQHYDLKISKAGNCVDMWKRSPCNLAIPSEVDVEKYAKVEKSIKSDDSQPTVWPAHDVKDDYVFECEAGTQYQKTKLTILQSLGDPLYYGKIQPCKADEESDSQMSPSHSSTDDHSNWFIIGSKTDAERVVNQYQELVQNEAKCPVKMSHSSSGSASLSQVSPGKETDQTETVSVQSSVLGKGVKHRPPPIKLPSSSGNSSSGNYFTPQQTSSFLKSPTPPPSSKPSSIPRKSSVDLNQVSMLSPAALSPASSSQRTTATQVMANSAGLNFINVVGSVCGAQALMSGSNPMLGCNTGAITPAGINLSGLLPSGGLLPNALPSAMQAASQAGVPFGLKNTSSLRPLNLLQLPGGSLIFNTLQQQQQQLSQFTPQQPQQPTTCSPQQPGEQGSEQGSTSQEQALSAQQAAVINLTGVGSFMQSQAAVLSQLGSAENRPEQSLPQQRFQLSSAFQQQQQQIQQLRFLQHQMAMAAAAAQTAQLHHHRHTGSQSKSKMKRGTPTTPKF 779 T 6.5E-20 Spt20 pdb F Eukaryota T 8h7g 6 F G TADA1_HUMAN SPT3-ASSOCIATED FACTOR 42,STAF42,TRANSCRIPTIONAL ADAPTER 1-LIKE PROTEIN MDYKDHDGDYKDHDIDYKDDDDKGGSGGSLEVLFQGPLDMATFVSELEAAKKNLSEALGDNVKQYWANLKLWFKQKISKEEFDLEAHRLLTQDNVHSHNDFLLAILTRCQILVSTPDGAGSLPWPGGSAAKPGKPKGKKKLSSVRQKFDHRFQPQNPLSGAQQFVAKDPQDDDDLKLCSHTMMLPTRGQLEGRMIVTAYEHGLDNVTEEAVSAVVYAVENHLKDILTSVVSRRKAYRLRDGHFKYAFGSNVTPQPYLKNSVVAYNNLIESPPAFTAPCAGQNPASHPPPDDAEQQAALLLACSGDTLPASLPPVNMYDLFEALQVHREVIPTHTVYALNIERIITKLWHPNHEELQQDKVHRQRLAAKEGLLLC 374 T 3.7E-15 SAGA-Tad1 pdb F Eukaryota T 8h7g 13 M X Unassigned sequence XXXXXXXXXXXXXXXXXXX 19 F F F 8h89 2 J,K,L,M,N,O,P,Q,R J,K,L,M,N,O,P,Q,R A0A345GTT2_9CAUD GP1 MALIQSDFAQGIRMTPVPDCAGDVTACRFDITLKNAPAAGDIIELGVLPGNAVPVEAILDVDDLDTGGAPTITLDVGIMSGPVGKNDPARTCGNELFAASTVGQAGGVVRATASSAFRIQKAEDHRSVGVKVAAGPATGAAGKTIALILFYVQGTSQ 157 T 81 DUF6476 pdbhh T Viruses T 8h8h 1 A,B,C,D,E,F,G,H A,B,C,D,E,F,G,H Q88GF9_PSEPK H-NS family protein MvaT MSRLAEFRAAEKALQEQMAQLEALKKDAGLKREIEFEQKLVGLMKSYDKSLEHHHHHH 58 T 0.00012 Histone_HNS unppercent F Bacteria T 8ha0 1 A A Guanine nucleotide-binding protein g(s) subunit alpha MGCTLSAEDKAAVERSKMIEKQLQKDKQVYRATHRLLLLGADNSGKSTIVKQMRIYHVNGYSEEECKQYKAVVYSNTIQSIIAIIRAMGRLKIDFGDSARADDARQLFVLAGAAEEGFMTAELAGVIKRLWKDSGVQACFNRSREYQLNDSAAYYLNDLDRIAQPNYIPTQQDVLRTRVKTSGIFETKFQVDKVNFHMFDVGAQRDERRKWIQCFNDVTAIIFVVDSSDYNRLQEALNDFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKIEDYFPEFARYTTPEDATPEPGEDPRVTRAKYFIRDEFLRISTASGDGRHYCYPHFTCSVDTENARRIFNDCRDIIQRMHLRQYELL 361 F F T 8haf 1 A A Guanine nucleotide-binding protein G(s) subunit alpha-1 MGCTLSAEDKAAVERSKMIEKQLQKDKQVYRATHRLLLLGADNSGKSTIVKQMRIYHVNGYSEEECKQYKAVVYSNTIQSIIAIIRAMGRLKIDFGDSARADDARQLFVLAGAAEEGFMTAELAGVIKRLWKDSGVQACFNRSREYQLNDSAAYYLNDLDRIAQPNYIPTQQDVLRTRVKTSGIFETKFQVDKVNFHMFDVGAQRDERRKWIQCFNDVTAIIFVVDSSDYNRLQEALNDFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKIEDYFPEFARYTTPEDATPEPGEDPRVTRAKYFIRDEFLRISTASGDGRHYCYPHFTCSVDTENARRIFNDCRDIIQRMHLRQYELL 361 F F T 8hao 1 A,C A,C Guanine nucleotide-binding protein G(s) subunit alpha MGCTLSAEDKAAVERSKMIEKQLQKDKQVYRATHRLLLLGADNSGKSTIVKQMRIYHVNGYSEEECKQYKAVVYSNTIQSIIAIIRAMGRLKIDFGDSARADDARQLFVLAGAAEEGFMTAELAGVIKRLWKDSGVQACFNRSREYQLNDSAAYYLNDLDRIAQPNYIPTQQDVLRTRVKTSGIFETKFQVDKVNFHMFDVGAQRDERRKWIQCFNDVTAIIFVVDSSDYNRLQEALNDFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKIEDYFPEFARYTTPEDATPEPGEDPRVTRAKYFIRDEFLRISTASGDGRHYCYPHFTCSVDTENARRIFNDCRDIIQRMHLRQYELL 361 F F T 8hbr 1 A,B,C,D,E,F A,B,C,D,E,F A0A2K1IUB4_PHYPA TOG domain-containing protein GSHMLWSQAMESVRASDFDLAYADILGSNDELLLVRLMSRTGPVLEQLSDATLTHLMGNLKHFLQQQSFLECVIPWIQQVADLVLSNGPNALGLTGDSKKDLVFALQEAASMDHAQSWMAAKIVELAEQLRSAWL 135 T 0.084 Dna2 pdb F Eukaryota T 8hcr 8 H,Q I,U CYTOCHROME AA3 SUBUNIT CtaJ MSAMEIHLFFVGIPLLLVVVLSVLIWSRKGPHPATYKLSEPWTHPPILWAATDVVGSAHGGHGHDASEFTVGGGASGTW 79 T 0.2 ASFV_J13L pdbhh F T 8hdj 1 A,C,E,G A,C,E,G Periplasmic domain of RsgI2 SSQEYAYIDVDIN 13 T 2.1 Peptidase_M15_4 pdbhh F T 8hdj 2 B,D,F,H B,D,F,H RSGI2_ACET2 Anti-sigma-I factor RsgI2 PSIGLVIDKKEKVIDAKPLNNDAKPILDEAAPKDMPLYDALSKILDISKKNGYINSADNIVLFSASINSGRNNVSESDKGIQEIISTLKDVAKDAGVKFEIIPSTEEDRQKALDQNLSMGRYAIYVKAVEEGVNLNLEDARNLSVSEILGKVNIGKFAISDT 162 T 0.0038 Spore_III_AB pdbpercent F Bacteria T 8hdr 2 C,D,E,F,G,H A,B,C,D,E,F Pam3 connector protein MIDVAIAIDAESVEVTWRNRSGGSYDSRGNATGASWADTQIRAAIQPVSGRELQDLPEGVRSKVTLVAWTRSEVAENDQIIYLGDAYRVYAARPRPMDGFTRIALGKVSP 110 T 0.00037 Phage_H_T_join pdbhh F T 8hdr 3 I,J,K,L,M,N G,H,I,J,K,L Pam3 terminator protein MRRITGITVIKDHQSEDRPALPYGVVELANFRDLHQQVRTIHYEDIEDSDNGEGFPEVQATPEVEQEWVFLVQVYGPGGLDYLRKVAAAFHVNQVNDLPGSLVIHEVAQINSIPEFLGERWEKRAQTNITLRGMSTDGFKVDVIEQHVINVTGERA 156 T 7.9 DUF3168 pdbhh F T 8hdr 4 O,P,Q,R,S,T,U,V,W,X,Y,Z M,N,O,P,Q,R,S,T,U,V,W,X Pam3 sheath protein MAKLPYSRVTNVTLTRTDNFPTRRGFGTQLILTHTAVSGQVDATKRTKLYASLAEVEADYPANTSVYKAALSAFSQNPRPIRLKVGYAATPTGGDDAAKKADFITSLGAILNYDQAFYQITLDAALRDQPYLDGLVEWVEAQPKIAMIDSNAAGHEDPANTTVIAARHKGTVERTAVFYHTDSTEYLAASMAAYMSTRVFDDANSAYTLKFKKAPGVRAIDKGSAVVTAITGFVEQTGQSESAGHCANTLIDIGDQEFLVEGSTLTQNVFLDEIHATDWIIARTEEEMLSLFLNNDRVPFTDQGMQQLASVPRAIMQLAARAGIVALDLNPLTGAYEPAYTITVPSVFDIPESQRKARIAPAIQVRFRYAGAVHYSVINYTMTF 384 T 3.3E-20 DUF3383 pdbpssm F T 8hdt 2 G,H,I,J,K,L,M G,H,I,J,K,L,M Cement MAPYNETYASDYAFAYEGMVSDIAPADIISRTVETSAGIGFGKIVAQGTSDRGCKADVSAVSPTAPPLGITVRSQATENLTLDKYPRYDGAAIMRKGVIWVLVTDAGGVVAGDPVWLKKSDGTFSNADVGSSGGLRLAGCRWDTSAANGALARMRVDFDVPPVAGA 166 T 5.7E-05 DUF2190 pdbhh F T 8hdu 1 A,B,C,D,E A,B,C,D,E De novo design cavitated protein MGSSHHHHHHSSGLVPRGSHMASMTGGQQMGRGSEDEKKKIEELLKKAKEMLKKYASNIDKFIAALRRVVQALYDAGAYQVVIRMYQAALAGQIDREHLRFLIETLQRIMANAPSEMTRMAALLLRLLALLALLTGDLLLVILLAAMIILLFAGYGEVVVKIFKIIREMPDKEEALKKAVELAIKMVEEFRKKQGLEHHHHHH 203 T 0.06 DUF5344 pdb F T 8hdv 1 A,B A,B De novo design cavitated protein MGSSHHHHHHSSGLVPRGSHMASMTGGQQMGRGSEDVIKQALKRVQQYIQQAPNGYRDVIQQILQTVLKILKLMGMPEVEAVLIVAYVAEMLVLAAKYGYIDELLKLAKEALEADDVDKMIEIFLKMLKIMFLALALDPEGLKKLKELKKNGSEEVRKLIEEVIKQLKQQRQQQALEHHHHHH 183 T 0.059 DUF3103 pdb F T 8hdw 2 C,D,E,F,G,H,I,J,K,L,M,N M,N,O,P,Q,R,S,T,U,V,W,X Pam3 sheath protein MAKLPYSRVTNVTLTRTDNFPTRRGFGTQLILTHTAVSGQVDATKRTKLYASLAEVEADYPANTSVYKAALSAFSQNPRPIRLKVGYAATPTGGDDAAKKADFITSLGAILNYDQAFYQITLDAALRDQPYLDGLVEWVEAQPKIAMIDSNAAGHEDPANTTVIAARHKGTVERTAVFYHTDSTEYLAASMAAYMSTRVFDDANSAYTLKFKKAPGVRAIDKGSAVVTAITGFVEQTGQSESAGHCANTLIDIGDQEFLVEGSTLTQNVFLDEIHATDWIIARTEEEMLSLFLNNDRVPFTDQGMQQLASVPRAIMQLAARAGIVALDLNPLTGAYEPAYTITVPSVFDIPESQRKARIAPAIQVRFRYAGAVHYSVINYTMTF 384 T 3.3E-20 DUF3383 pdbpssm F T 8he0 2 B B HIF1A_HUMAN HIF-1-ALPHA,HIF1-ALPHA GSQRKRKMEHDGSLFQAVGIGTLLQQPDDHAATTSLSWKRVKG 43 T 33 PheRS_DBD2 pdbhh F Eukaryota T 8he3 2 B B Hypoxia-inducible factor 1-alpha GSQRKRKMEWKRVKG 15 T 3 Nucleos_tra2_N pdbhh F T 8hep 1 A A H6SHX8_ACETH Anti-sigma factor MYGYICVDIN 10 T 0.014 Pox_F11 unppssm F Bacteria T 8hep 2 B B H6SHX8_ACETH Anti-sigma factor PSVELVIDETCRVLEVRPQNKDGEQLISGLELLDKNVEDVVYELINRSISFGFVKADDNRKIVLISGALNDKRNELKTKKENDEAELTELLDNIKARVDRIDNIKVRTITATSRERKDALKYGLSMGKYCLYLEAQELNGSITIDEVHDMSISDMIEKLEHHHHHH 166 T 0.0087 SesA pdb F Bacteria T 8heq 1 A A RSGI2_ACET2 Anti-sigma-I factor RsgI2 MYAYIDVDIN 10 T 2 DUF4179 unppssm F Bacteria T 8heq 2 B B RSGI2_ACET2 Anti-sigma-I factor RsgI2 PSIGLVIDKKEKVIDAKPLNNDAKPILDEAAPKDMPLYDALSKILDISKKNGYINSADNIVLFSASINSGRNNVSESDKGIQEIISTLKDVAKDAGVKFEIIPSTEEDRQKALDQNLSMGRYAIYVKAVEEGVNLNLEDARNLSVSEILGKLEHHHHHH 159 T 0.0077 Spore_III_AB unppercent F Bacteria T 8her 1 A A H6SHY0_ACETH Anti-sigma factor MYAYVGIDIN 10 T 0.86 Peptidase_M23_N pdbhh F Bacteria T 8her 2 B B H6SHY0_ACETH Anti-sigma factor PSIELWINYNNKIAEAKALNGDAETVLEGLELKEKTVAEAVNEIVQKSMELGFISREKENIILISTACDLKAGEGSENKDVQNKIGQLFDDVNKAVSDLKNSGITTRILNLTLEERESSKEENISMGRYAVYLKAKEQNVNLTIDEIKDADLLELIAKLEHHHHHH 166 T 0.0064 Gypsy pdb F Bacteria T 8hev 2 B,D,F,H,J,L,N,P,R,T,V,X R,D,E,F,W,N,P,S,X,O,Q,T Unknown peptide XXXXXXXXXXXXXXX 15 F F F 8hf2 1 A,B,C,D,E A,B,C,D,E A0A654EJS8_ARATH WEITSING METVSAVNQTLPISGGEPVKFTTYSAAVHKVLVMVNAGILGLLQLVSQQSSVLETHKAAFLCFCVFILFYAVLRVREAMDVRLQPGLVPRLIGHGSHLFGGLAALVLVSVVSTAFSIVLFLLWFIWLSAVVYLETNKPSACPPQLPPV 148 T 0.0024 Frag1 pdb F Eukaryota T 8hif 3 GE,ZD y1,y2 Q5YFC8_9VIRU VP137 MDCATYATRKDKGWELNENRCVWAASVKPTSGAIMTNVGVHGKSGNAVLMTPKRRPHAQNHAGYKIKYCKQVPLIPLHGGDYILNHWETRGVDRMRIPGIQHAPPPPAPSGMQNAYSTHPDAYRTPLLADSHALSRMPVVQVHGPQVAPKNSHFTVAPEKHGPVEDMNAIINALPTKVDAVKLEYSASKTNRTNKRPGDGGAPPPKNLSKCHQNKLKTFARTANSGANPFRPATAAPQGLSKQPVRKPFASARNANSGANPFRPPLAHQGLSKAHVVKTAVSVANRSAGAEPFVTRNDPRALAMELANNKTISVTLGLRHWKTVSAAPPEKMSKSGVCKIATNVYNRDGGANPFLVKYEPDSLAVCPMETVEIAAVPSKRPWEGSANPRRPEQISFGMSDKPKFVNDKIGIVLRGPTLAPTLDRTATHTVTRPRALGSFHSTAGPAKHAASIMAECKDESR 461 T 87 TAL_effector pdbhh T Viruses T 8hif 5 BE y5 Q5YFK6_9VIRU VP59 MDSQGFWAILAFTPVLMILSLKGEGLLAMVGLLVLTVTLLASREKNDRPRLSCRGKIGRKVSGFENAGHVRDSHHVIYKRPPVNEYCAETREDNSLYVPEYCGQNWKNGVLSGMGTHHDAYRNLAVNMMTLRRESAVSAGWAHSYL 146 T 4.2 DAG1 pdbhh T Viruses T 8hif 6 CE y6 Q5YFC6_9VIRU VP139 MNAYQNDKLHLCAPRPDLVRAAMSAMVRETGCTPNVNIREMAISAGVMLTKIRANPGMLRYGMTATQTVIYNLKELFAAHAARGVVFKTPAIHPAHPSQWKGF 103 T 2.7 DUF3285 pdbhh T Viruses T 8hif 7 HE y3 Q5YFQ1_9VIRU Penton protein (VP14) MYRGFSLKLPNNYRSGQVTTEHRLPASNSHARWPVEVSFYSAVLHVPAKHQHKFPPVLELKLHNMTTGSMAAHRGSGHHFTFMFHAQSSPTEAVYSCVPVPIVFSDYQSNVIASVDMGEHDTAEKLHFYGSIRNCDNGCTY 141 T 5.7 DUF1848 pdbhh T Viruses T 8hj4 3 C C V9H5N5_9NEIS Phage protein SMNNSIKFHVSYDGTARALFNTKEQAEKYCLVEEINDEMNGYKRKSWEEKLREENCASVQDWVEKNYTSSYSDLFNICEIEVSSAGQLVKIDNTEVDDFVENCYGFTLEDDLEEFNKAKQYLQKFYAECEN 131 T 0.028 LZ3wCH pdb F Bacteria T 8hjc 1 A A Bidentatide CLESGTSCIPGAQHNCCSGVCVPIVTIFYGVCY 33 T 0.0013 Tryp_inh pdb F T 8hjd 1 A A Gly-bidentatide CLESGTSCIPGAQHNCCSGVCVPIVTIFYGVCY 33 T 0.0013 Tryp_inh pdb F T 8hkw 2 C,D C,D TP53B_HUMAN P53BP1 SGKRKLITSEEERSPAKRGRKS 22 T 35 GMAP pdbhh F Eukaryota T 8hlo 2 B C MICA1_HUMAN MOLECULE INTERACTING WITH CASL PROTEIN 1,MICAL-1,NEDD9-INTERACTING PROTEIN WITH CALPONIN HOMOLOGY AND LIM DOMAINS GPGSEPPPKPPRS 13 T 2.9 Dscam_C pdbhh F Eukaryota T 8hn9 2 C C CCNE2 peptide SPVKLKTFKXIPM 13 T 0.36 DUF3754 pdbhh F T 8hns 1 A,B A,B anti-CRISPR protein AcrIIC4 SMKITSSNFATIATSENFAKLSVLPKNHREPIKGLFKSAVEQFSSARDFFKNENYSKELAEKFNKEAVNEAVEKLQKAIDLAEKQGIQF 89 T 1.5 Nif11 pdbhh F T 8hnt 3 C C anti-CRISPR protein AcrIIC4 SMKITSSNFATIATSENFAKLSVLPKNHREPIKGLFKSAVEQFSSARDFFKNENYSKELAEKFNKEAVNEAVEKLQKAIDLAEKQGIQF 89 T 1.5 Nif11 pdbhh F T 8hnv 5 E E anti-CRISPR protein AcrIIC4 SMKITSSNFATIATSENFAKLSVLPKNHREPIKGLFKSAVEQFSSARDFFKNENYSKELAEKFNKEAVNEAVEKLQKAIDLAEKQGIQF 89 T 1.5 Nif11 pdbhh F T 8hpo 5 E I SAP30_YEAST Transcriptional regulatory protein SAP30 MARPVNTNAETESRGRPTQGGGYASNNNGSCNNNNGSNNNNNNNNNNNNNSNNSNNNNGPTSSGRTNGKQRLTAAQQQYIKNLIETHITDNHPDLRPKSHPMDFEEYTDAFLRRYKDHFQLDVPDNLTLQGYLLGSKLGAKTYSYKRNTQGQHDKRIHKRDLANVVRRHFDEHSIKETDCIPQFIYKVKNQKKKFKMEFRG 201 T 0.041 NAM-associated pdb F Eukaryota T 8hsv 2 B,D E,F D3ZVH5_RAT peptide from E3 ubiquitin-protein ligase Mdm2 DLDDGVSDHSADCLDQDS 18 T 27 Raf1_N pdbhh F Eukaryota T 8hvp 2 C I INHIBITOR VAL-SER-GLN-ASN-LEU-PSI(CH(OH)-CH2)-VAL-ILE-VAL (U-85548E) VSQNXIV 7 T 120 Diphtheria_R pdbhh F T 8hzw 1 A A noursinH11W APSNVLSXLLWGRACV 16 T 0.91 DUF765 pdbhh F T 8i03 8 I I RXT3_SCHPO Transcriptional regulatory protein rxt3 MEEKTPENEQSKKTFDPKDSMKIEETSTNGSSQPSQPSNIKLSIGSILESSNDNGDPEYSENGMGNMNMNTLPMATSTPMSYTKQPSEAKYPNSVWERKGVSDQEENTSSVKRQKTLPTQSSGEEEAKYSHPGAPTATSADSISMESRPSNLSTSLSKTTSYPQFQVRQFVSPIISIDNSALEPFLNRYPASESLFPVTEYEYTPWLEFPLLYSSIGKFVRVTIDIKWLNAAINPRLCRREIWGTDVYTDDSDIATILAHCGCFSLLKPVRKIAVVDLYILPPLVHYKGTRKNQIESRSWSSRQDGISLKIKEVTWKPACASIFENSIHTLTLEERLQARLELSRSSTFKI 351 T 8.9E-16 Rxt3 unppssm F Eukaryota T 8i0n 4 G,H U,V C5AR1_HUMAN C5A ANAPHYLATOXIN CHEMOTACTIC RECEPTOR,C5A-R,C5AR ESKSFTRSTVDTMAQKTQAV 20 T 24 DUF4355 pdbhh F Eukaryota T 8i0q 4 G,H U,V CXCR4_HUMAN CXC-R4,CXCR-4,FB22,FUSIN,HM89,LCR1,LEUKOCYTE-DERIVED SEVEN TRANSMEMBRANE DOMAIN RECEPTOR,LESTR,LIPOPOLYSACCHARIDE-ASSOCIATED PROTEIN 3,LAP-3,LPS-ASSOCIATED PROTEIN 3,NPYRL,STROMAL CELL-DERIVED FACTOR 1 RECEPTOR,SDF-1 RECEPTOR GHSSVSTESESSSFHSS 17 T 55 DUF5582 pdbhh F Eukaryota T 8i0z 4 F,K,L G,U,V C5AR1_HUMAN C5A ANAPHYLATOXIN CHEMOTACTIC RECEPTOR,C5A-R,C5AR ESKSFTRSTVDTMAQKTQAV 20 T 24 DUF4355 pdbhh F Eukaryota T 8i10 4 F,K,L G,U,V V2R_HUMAN V2R,AVPR V2,ANTIDIURETIC HORMONE RECEPTOR,RENAL-TYPE ARGININE VASOPRESSIN RECEPTOR ARGRTPPSLGPQDESCTTASSSLAKDTSS 29 T 21 DUF6352 pdbhh F Eukaryota T 8i60 2 C,D A,B ALA-ARG-KCR-SER-ALA-PRO ATKAARXSAPATG 13 T 95 OxoGdeHyase_C pdbhh F T 8i87 2 B,D,E,H B,D,F,T A0A316E3U6_9FLAO Piwi domain-containing protein MKELIYIEEPKILFAHGQKCTDARDGLALFGPLNNLYGIKSGVIGTKQGLKIFRDYLDHIQKPIYNSNSITRPMFPGFEAVFDCKWESTGITFKEVTNEDIGKFLYNSSTHKRTYDLVSLFIDKIISANKNEDENVDVWFVIVPDEIYKYCRPNSVLPKEMVQTKALMSKSKAKSFRYEPSLFPDINIELKEQEKEAETYNYDAQFHDQFKARLLKHTIPTQIFRESTLAWRDFKNAFGLPIRDFSKIEGHLAWTISTAAFYKAGGKPWKLSDVRNGVCYLGLVYKKVEKSKNPRNACCAAQMFLDNGDGTVFKGEVGPWYNPKNGQYHLEPKEAKALLSQSLQSYKEQIGEYPKEVFIHAKTRFNHQEWDAFLEVTPKETNLVGVTISKTKPLKLYKTEGDYTILRGNAYVVNERSAFLWTVGYVPKIQTALSMEVPNPLFIEINKGEADIKQVLKDILSLTKLNYNACIFADGEPVTLRFADKIGEILTASTDIKTPPLAFKYYI 507 T 0.28 TPR_10 pdbpercent F Bacteria T 8i88 2 B B A0A316E3U6_9FLAO Piwi domain-containing protein MKELIYIEEPKILFAHGQKCTDARDGLALFGPLNNLYGIKSGVIGTKQGLKIFRDYLDHIQKPIYNSNSITRPMFPGFEAVFDCKWESTGITFKEVTNEDIGKFLYNSSTHKRTYDLVSLFIDKIISANKNEDENVDVWFVIVPDEIYKYCRPNSVLPKEMVQTKALMSKSKAKSFRYEPSLFPDINIELKEQEKEAETYNYDAQFHDQFKARLLKHTIPTQIFRESTLAWRDFKNAFGLPIRDFSKIEGHLAWTISTAAFYKAGGKPWKLSDVRNGVCYLGLVYKKVEKSKNPRNACCAAQMFLDNGDGTVFKGEVGPWYNPKNGQYHLEPKEAKALLSQSLQSYKEQIGEYPKEVFIHAKTRFNHQEWDAFLEVTPKETNLVGVTISKTKPLKLYKTEGDYTILRGNAYVVNERSAFLWTVGYVPKIQTALSMEVPNPLFIEINKGEADIKQVLKDILSLTKLNYNACIFADGEPVTLRFADKIGEILTASTDIKTPPLAFKYYI 507 T 0.28 TPR_10 pdbpercent F Bacteria T 8i9r 48 WA CX G0SGJ0_CHATD 60S ribosomal subunit-like protein MGKTRTIKNKHAEPSKKKAKKAGDGGVKKTKDRAGSKSKAKVPAIEVKGKPNLGQDKKKQKRVYSEKELGIPQLNTVTPVGVTKPKGKKKGKVFVDDRESMNTILAMVEAEKQGQIESKIMRARQLEEIREARRIEAEKKEAERKALLENTKEQLRKKRKKNKKGGDQKSSEDEGPSLKELTSTGSKAVKSKKKKKVSFATPE 203 T 0.32 LptF_LptG pdbpercent F Eukaryota T 8i9t 23 W CX G0SGJ0_CHATD 60S ribosomal subunit-like protein MGKTRTIKNKHAEPSKKKAKKAGDGGVKKTKDRAGSKSKAKVPAIEVKGKPNLGQDKKKQKRVYSEKELGIPQLNTVTPVGVTKPKGKKKGKVFVDDRESMNTILAMVEAEKQGQIESKIMRARQLEEIREARRIEAEKKEAERKALLENTKEQLRKKRKKNKKGGDQKSSEDEGPSLKELTSTGSKAVKSKKKKKVSFATPE 203 T 0.32 LptF_LptG pdbpercent F Eukaryota T 8i9v 24 X CX G0SGJ0_CHATD 60S ribosomal subunit-like protein MGKTRTIKNKHAEPSKKKAKKAGDGGVKKTKDRAGSKSKAKVPAIEVKGKPNLGQDKKKQKRVYSEKELGIPQLNTVTPVGVTKPKGKKKGKVFVDDRESMNTILAMVEAEKQGQIESKIMRARQLEEIREARRIEAEKKEAERKALLENTKEQLRKKRKKNKKGGDQKSSEDEGPSLKELTSTGSKAVKSKKKKKVSFATPE 203 T 0.32 LptF_LptG pdbpercent F Eukaryota T 8i9w 24 X CX G0SGJ0_CHATD 60S ribosomal subunit-like protein MGKTRTIKNKHAEPSKKKAKKAGDGGVKKTKDRAGSKSKAKVPAIEVKGKPNLGQDKKKQKRVYSEKELGIPQLNTVTPVGVTKPKGKKKGKVFVDDRESMNTILAMVEAEKQGQIESKIMRARQLEEIREARRIEAEKKEAERKALLENTKEQLRKKRKKNKKGGDQKSSEDEGPSLKELTSTGSKAVKSKKKKKVSFATPE 203 T 0.32 LptF_LptG pdbpercent F Eukaryota T 8i9x 25 Y CX G0SGJ0_CHATD 60S ribosomal subunit-like protein MGKTRTIKNKHAEPSKKKAKKAGDGGVKKTKDRAGSKSKAKVPAIEVKGKPNLGQDKKKQKRVYSEKELGIPQLNTVTPVGVTKPKGKKKGKVFVDDRESMNTILAMVEAEKQGQIESKIMRARQLEEIREARRIEAEKKEAERKALLENTKEQLRKKRKKNKKGGDQKSSEDEGPSLKELTSTGSKAVKSKKKKKVSFATPE 203 T 0.32 LptF_LptG pdbpercent F Eukaryota T 8i9y 25 Y CX G0SGJ0_CHATD 60S ribosomal subunit-like protein MGKTRTIKNKHAEPSKKKAKKAGDGGVKKTKDRAGSKSKAKVPAIEVKGKPNLGQDKKKQKRVYSEKELGIPQLNTVTPVGVTKPKGKKKGKVFVDDRESMNTILAMVEAEKQGQIESKIMRARQLEEIREARRIEAEKKEAERKALLENTKEQLRKKRKKNKKGGDQKSSEDEGPSLKELTSTGSKAVKSKKKKKVSFATPE 203 T 0.32 LptF_LptG pdbpercent F Eukaryota T 8i9z 26 Z CX G0SGJ0_CHATD 60S ribosomal subunit-like protein MGKTRTIKNKHAEPSKKKAKKAGDGGVKKTKDRAGSKSKAKVPAIEVKGKPNLGQDKKKQKRVYSEKELGIPQLNTVTPVGVTKPKGKKKGKVFVDDRESMNTILAMVEAEKQGQIESKIMRARQLEEIREARRIEAEKKEAERKALLENTKEQLRKKRKKNKKGGDQKSSEDEGPSLKELTSTGSKAVKSKKKKKVSFATPE 203 T 0.32 LptF_LptG pdbpercent F Eukaryota T 8ia0 26 Z CX G0SGJ0_CHATD 60S ribosomal subunit-like protein MGKTRTIKNKHAEPSKKKAKKAGDGGVKKTKDRAGSKSKAKVPAIEVKGKPNLGQDKKKQKRVYSEKELGIPQLNTVTPVGVTKPKGKKKGKVFVDDRESMNTILAMVEAEKQGQIESKIMRARQLEEIREARRIEAEKKEAERKALLENTKEQLRKKRKKNKKGGDQKSSEDEGPSLKELTSTGSKAVKSKKKKKVSFATPE 203 T 0.32 LptF_LptG pdbpercent F Eukaryota T 8ia4 2 C Q peptide IELSG 5 T 64 NlpE_C pdbhh F F 8ia5 2 B B 9-mer peptide DLENLYFQG 9 T 5.7 DUF1563 pdbhh F T 8ia8 1 A L ALA-SER-LYS-LEU-GLY-LEU-ALA-ARG WWGKKYRASKLGLAR 15 T 13 GXWXG pdbhh F T 8ibl 1 A,B A,B W0TJ64_9PSEU CUTINASE GPNPYERGPDPTEDSIEAIRGPFSVATERVSSFASGFGGGTIYYPRETDEGTFGAVAVAPGFTASQGSMSWYGERVASHGFIVFTIDTNTRLDAPGQRGRQLLAALDYLVERSDRKVRERLDPNRLAVMGHAMGGGGSLEATVMRPSLKASIPLTPWHLDKTWGQVQVPTFIIGAELDTIAPVSTHAKPFYESLPSSLPKAYMELCGATHFAPNIPNTTIAKYVISWLKRFVDEDTRYSQFLCPNPTDRAICEYRSTCPYKLN 263 F F Bacteria T 8ibm 1 A,B A,B W0TJ64_9PSEU CUTINASE GPNPYERGPDPTEDSIEAIRGPFSVATERVSSFASGFGGGTIYYPRETDEGTFGAVAVAPGFTASQGSMSWYGERVASHGFIVFTIDTNTRLDAPGQRGRQLLAALDYLVERSDRKVRERLDPNRLAVMGHAMGGGGSLEATVMRPSLKASIPLTPWHLDKTWGQVQVPTFIIGAELDTIAPVSTHAKPFYESLPSSLPKAYMELCGATHFAPNIPNTTIAKYVISWLKRFVDEDTRYSQFLCPNPTDRAICEYRSTCPYKLN 263 F F Bacteria T 8ig0 1 A,B,C,D A,B,C,D MEN1_HUMAN Menin MGLKAAQKTLFPLRSIDDVVRLFAAELGREEPDLVLLSLVLGFVEHFLAVNRVIPTNVPELTFQPSPAPDPPGGLTYFPVADLSIIAALYARFTAQIRGAVDLSLYPREGGVSSRELVKKVSDVIWNSLSRSYFKDRAHIQSLFSFITGTKLDSSGVAFAVVGACQALGLRDVHLALSEDHAWVVFGPNGEQTAEVTWHGKGNEDRRGQTVNAGVAERSWLYLKGSYMRCDRKMEVAFMVCAINPSIDLHTDSLELLQLQQKLLWLLYDLGHLERYPMALGNLADLEELEPTPGRPDPLTLYHKGIASAKTYYRDEHIYPYMYLAGYHCRNRNVREALQAWADTATVIQDYNYCREDEEIYKEFFEVANDVIPNLLKEAASLLEAGEERPGEQSQGTQSQGSALQDPECFAHLLRFYDGICKWEEGSPTPVLHVGWATFLVQSLGRFEGQVRQKVRIVSGTVAGTARGPEGGSTAQVPAPTASPPPEGPVLTFQSEKMKGMKELLVATKINSSAIKLQLTAQSQVQMKKQKVSTPSDYTLSFLKRQRKGL 550 T 2.2E-24 Menin unppssm F Eukaryota T 8igg 1 A,B,C,D E,A,B,C CHMA_BP201 CHMA,PHAGE NUCLEUS ENCLOSURE PROTEIN,PHUN,GENE PRODUCT 105,GP105 MIRDTATNTTQTQAAPQQAPAQQFTQAPQEKPMQSTQSQPTPSYAGTGGINSQFTRSGNVQGGDARASEALTVFTRLKEQAVAQQDLADDFSILRFDRDQHQVGWSSLVIAKQISLNGQPVIAVRPLILPNNSIELPKRKTNIVNGMQTDVIESDIDVGTVFSAQYFNRLSTYVQNTLGKPGAKVVLAGPFPIPADLVLKDSELQLRNLLIKSVNACDDILALHSGERPFTIAGLKGQQGETLAAKVDIRTQPLHDTVGNPIRADIVVTTQRVRRNGQQENEFYETDVKLNQVAMFTNLERTPQAQAQTLFPNQQQVATPAPWVASVVITDVRNADGIQANTPEMYWFALSNAFRSTHGHAWARPFLPMTGVAKDMKDIGALGWMSALRNRIDTKAANFDDAQFGQLMLSQVQPNPVFQIDLNRMGETAQMDSLQLDAAGGPNAQKAAATIIRQINNLGGGGFERFFDHTTQPILERTGQVIDLGNWFDGDEKRDRRDLDNLAALNAAEGNENEFWGFYGAQLNPNLHPDLRNRQSRNYDRQYLGSTVTYTGKAERCTYNAKFIEALDRYLAEAGLQITMDNTSVLNSGQRFMGNSVIGNNMVSGQAQVHSAYAGTQGFNTQYQTGPSSFYALEHHHHHH 640 T 6.2 TGBp3 pdbhh T Viruses T 8igl 1 A,B A,B A0A2X0THU5_ASF CP2475L CDS PROTEIN,CP2475L PROTEIN,POLYPROTEIN PP220,PROTEIN CP2475L MHHHHHHHHHHGSDYKDHDGDYKDHDIDYKDDDDKELENLYFQGAGSMKIFLFHETVITGLNLLSAIYVLLNNFRNNIKGLDLDTIQKSIIEWLRETQAANVNRANLIDWLGRKHGAISEIRNPGLVIKEINMRLSMVYPDPTTEAAAAAQDRNLTTETLFAWIVPYVGIPAGGGVRPEQELAARYLVDNQRIMQLLLTNIFEMTSSFNKMVQVRFPETSTAQVHLDFTGLISLIDSLMADTKYFLDLLRPHIDKNIIQYYENRSNPGSFYWLEEHLIDKLIKPPTDAGGRPLPGGELGLEGVNQIINKTYTLLTKPYNVLQLRGGAQRRDAANIQINNNPQSSERFEQYGRVFSRLVFYDALENNSGLRVEQVALGDFRLSNLIRTNNAQEENTLSYWDNIALRTYANVNDAANNLRRYRLYGSDYGIQNNRSMMMVFNQLIASYITRFYDAPSGKIYLNLINAFANGNFSQAVMEMGYAHPDLARNNNVFGHRGDPTEQSVLLLSLGLILQRLIKDTNRQGLSQHLISTLTEIPIYLKENYRANLPLFNKMFNILISQGELLKQFIQYTNVQLARPNLTALLGANNDSVIYYNNNNVPATGLSVGQAALRGIGGVFRPNVTLMPLGDAQNNTSDVVRKRLVAVIDGIIRGSHTLADSAMEVLHELTDHPIYLETEEHFIQNYMSRYNKEPLMPFSLSLYYLHDLRIENNEVYDPLLYPNLESGSPEFKLLYGTRKLLGNDPVQLSDMPGVQLIMKNYNETVVAREQITPTRFEHFYTHAIQALRFIINIRSFKTVMMYNENTFGGVNLISENRDDKPIITAGIGMNAVYSLRKTLQDVISFVESSYQEEQINHIHKIVSPKGQTRTLGSNRERERIFNLFD 883 T 9.3 DUF3888 unp T Viruses T 8im8 1 A,B,C,D A,B,C,D AMY1_ECOLI 1,4-ALPHA-D-GLUCAN GLUCANOHYDROLASE MKLAACFLTLLPGFAVAASWTSPGFPAFSEQGTGTFVSHAQLPKGTRPLTLNFDQQCWQPADAIKLNQMLSLQPCSNTPPQWRLFRDGEYTLQIDTRSGTPTLMISIQNAAEPVASLVRECPKWDGLPLTVDVSATFPEGAAVRDYYSQQIAIVKNGQIMLQPAATSNGLLLLERAETDTSAPFDWHNATVYFVLTDRFENGDPSNDQSYGRHKDGMAEIGTFHGGDLRGLTNKLDYLQQLGVNALWISAPFEQIHGWVGGGTKGDFPHYAYHGYYTQDWTNLDANMGNEADLRTLVDSAHQRGIRILFDVVMNHTGYATLADMQEYQFGALYLSGDEVKKSLGERWSDWKPAAGQTWHSFNDYINFSDKTGWDKWWGKNWIRTDIGDYDNPGFDDLTMSLAFLPDIKTESTTASGLPVFYKNKMDTHAKAIDGYTPRDYLTHWLSQWVRDYGIDGFRVDTAKHVELPAWQQLKTEASAALREWKKANPDKALDDKPFWMTGEAWGHGVMQSDYYRHGFDAMINFDYQEQAAKAVDCLAQMDTTWQQMAEKLQGFNVLSYLSSHDTRLFREGGDKAAELLLLAPGAVQIFYGDESSRPFGPTGSDPLQGTRSDMNWQDVSGKSAASVAHWQKISQFRARHPAIGAGKQTTLLLKQGYGFVREHGDDKVLVVWAGQQ 676 T 6.800000000000001E-27 Alpha-amylase pdb F Bacteria T 8iqb 1 A,B A,B A0A0C5B022_ASF ASFVPRIMPOL GSMREESWEEHDTIQLTAQRKYLAEVQALETLLARELSVFLTEPGSKKTNIINRITGKTYALPSTELLRFYEHLEQCRKQGALMYFLERQGTYSGLMLDYDLKLNTNAAPSLESSVLSRLCHRIFVHIKNSSVLPEGSHKIHFFFTLKPEAVQGKYGFHVLIPGLKMAASTKKSIIASLQHDATVQKILHEQGVANPESCLDPHSASVPSLLYGSSKLNHRPYQLKTGFELVFDSSDPDYIPIHQIKNIESYNLVSELSLTNEQGSLVRPVYCAADIAAEKEEEIPA 287 T 0.0044 PPL4 unppercent T Viruses T 8iqc 1 A,B A,B A0A0C5B022_ASF Putative primase C962R GSLAEVQALETLLARELSVFLTEPGSKKTNIINRITGKTYALPSTELLRFYEHLEQCRKQGALMYFLERQGTYSGLMLDYDLKLNTNAAPSLESSVLSRLCHRIFVHIKNSSVLPEGSHKIHFFFTLKPEAVQGKYGFHVLIPGLKMAASTKKSIIASLQHDATVQKILHEQGVANPESCLDPHSASVPSLLYGSSKLNHRPYQLKTGFELVFDSSDPDYIPIHQIKNIESYNLVSELSLTNEQGSLVRPVYCA 254 T 0.0044 PPL4 unppercent T Viruses T 8iqd 1 A,B,C,D A,B,C,D A0A0C5B022_ASF Putative primase C962R GSLAEVQALETLLARELSVFLTEPGSKKTNIINRITGKTYALPSTELLRFYEHLEQCRKQGALMYFLERQGTYSGLMLDYDLKLNTNAAPSLESSVLSRLCHRIFVHIKNSSVLPEGSHKIHFFFTLKPEAVQGKYGFHVLIPGLKMAASTKKSIIASLQHDATVQKILHEQGVANPESCLDPHSASVPSLLYGSSKLNHRPYQLKTGFELVFDSSDPDYIPIHQIKNIESYNLVSELSLTNEQGSLVRPVYCA 254 T 0.0044 PPL4 unppercent T Viruses T 8itg 2 B B A0A385ZG42_9ACTN Tricyclic peptide MS-271 MSAVYEPPMLQEVGDFDELTKCLGVGSCNDFAGCGYAIVCFG 42 T 0.12 DUF5972 pdb F Bacteria T 8iyj 5 I 8 CF107_MOUSE Cilia- and flagella-associated protein 107 MAMLSTSVVPEAFSTPGWQIEKKYSTKVLLGNWVEERGKFTKAIDHTPQCIYRKEYVPMPDHRPDFVSRWYSKSKMEGLPYKHLITHHQEPSHRYLISTYDDHYNRHNYNPGLPALRTWNGQKLLWLPEKSDFPLVAPPTNYGLLEQLQQKWLASKTSLKESIYTTSYPRLPVCAMSRREHAIPVPHPRLQPIPRF 196 T 0.025 DUF1143 pdbpssm F Eukaryota T 8iyj 13 AC,BH D,K3 SPAG8_MOUSE SPERM MEMBRANE PROTEIN 1,SMP-1,SPERM MEMBRANE PROTEIN BS-84 METTESTEGSLSRSCDVQPSSERLDTPSEPVPSSSSSPRSTAPAEAPAQYSVLTEPSSDSLYGAPCPPAHHRGHGFGFQPFYVSCIPQDPCNMADLSSRADPTSSYPCHSSVHGSGSGTCGLGQSSEPSQGSGPTSGPAPASVPSLVSGPDSASGPDSSASGPALASGPGPADPGQGPKFSTCIPQGYRCIPVDLAPDYNAWCQHLHWKPQRSWEPLQVSEPGVRGPYKPPEPGALGPCEPCEPCEPPEAESEETLCKARPRGQCLLYNWEEERATNQLDQIPPLQDGSESYFFRHGHQGLLTTQPQSPMSSSTTQRDSYQLPRHICQPLRGKREAMLEMLLRHQICKEVQAEQEPARKLFETESVTHHDYRVELVRAAPPASTKPHDYRQEQPETFWIQRAARLPGVSDIRTLDTPFRKNCSFSTPVPLSLGQPLPYELESGPHQVGVISSLACQGGGQGCGRTKTTPI 470 T 2.3 DUF1143 pdbhh F Eukaryota T 8iyj 15 HJ,ND,XC N2,F,E CF161_MOUSE Cilia- and flagella-associated protein 161 MAQNVYGPGVRMGNWNEDVYLEEERMRHFLEKREKGELLIQRNRRVKKNILRPMQLSVSEDGYVHYGDKVIIVNPDQVLGEEAGKFMRGDLSLCMSPDEVKAQLSDDLEIPCGVSAVQTIAPMGRNTFTILSDGANSCEMGQVVVYGQNFCLGIAAGLEGKMLYLTSDHRTLLKSSLKSGLQEVTLTDEVTHLNCWQAAFLDPQLRLEYEGFPVRANEKIVIYHRHTNRALAVHRNLFLRTYFGKEMEVVAHTYLDSHKVEKPKNQWMLVTGNPRNKSNTMLDISKPITEDTRALEQAMGINT 303 T 0.066 DUF1143 pdbhh F Eukaryota T 8iyj 38 MT Z1 TEPP_MOUSE Testis, prostate and placenta-expressed protein MAQIIDLVPWDECSAHLYASPAVLLPLERVRHPLAGVKHQLYHPALPSLRRMDMDTVKGCLSDEHCQSSTYFSKDDFNKAHFTLLGVPNKPLQCLDFTATGQKLCHKYRGGKMIPIAPGINRVDWPCFTRAIEDWSKFVSRSEEFKLPCANKRVEGFSGYAVRYLKPEVTQNWRYCLNQNPSLDRYGQKPLPFDSLNAFRRFGSHYSRINYLTPWH 216 T 0.012 PAN_4 unppssm F Eukaryota T 8iyj 40 ST,TT,UT,VT a1,a2,a3,a4 CJ082_MOUSE Uncharacterized protein C10orf82 homolog MESPKTFMRKLPITPGYCGFIPWLSCQESSSEDRMNPCVKAFQERTQRYKEDQQGLNCSVANTPPLKPICSEDTVLWVLHEYAKKYHPLTLECKNEKKPLQEPPIPGWAGYLPRARVTEFGYATRYTIMAKKCYKDFLDLVEQAKRAQLKPYEQTYDVRAAQPLSPSSKILQLQGLSPAFPEFSGPGQTPPSEDPQAPRPCGCAQWSSQSCSRNVYGEPPSLAKAFAES 229 T 0.0034 DUF2475 pdbhh F Eukaryota T 8iyj 45 OU,VU,WU,XU i1,l,m,n FLTOP_MOUSE CILIA- AND FLAGELLA-ASSOCIATED PROTEIN 126 MATNYSANQYEKAYLPTYLQNWSPARPTKEKIAAHEGYTQIIANDRGHLLPSVPRSKASPWGSFMGTWQMPLKIPPAKVTLTARTTTAADNLTKWIHKNPDLLNACNGLRPEISGKPFDPDSQTKQKKSVTKTVQQAPNPTIIPSSPVIQGDNPDEPQSSHPSAGHTPGPQTPVNSPNNPPPSPCKSTK 189 T 0.2 DUF4248 pdb F Eukaryota T 8j07 11 CA,DA 1Y,1Z CF107_HUMAN Cilia- and flagella-associated protein 107 MFLTAVNPQPLSTPSWQIETKYSTKVLTGNWMEERRKFTRDTDKTPQSIYRKEYIPFPDHRPDQISRWYGKRKVEGLPYKHLITHHQEPPHRYLISTYDDHYNRHGYNPGLPPLRTWNGQKLLWLPEKSDFPLLAPPTNYGLYEQLKQRQLTPKAGLKQSTYTSSYPRPPLCAMSWREHAVPVPPHRLHPFPHF 194 T 0.14 DUF1143 pdbpercent F Eukaryota T 8j07 13 FA,GA,HA,IA,JA,KA,LA 2A,2B,2C,2D,2E,2F,2G FLTOP_HUMAN CILIA- AND FLAGELLA-ASSOCIATED PROTEIN 126 MATNYSANQYEKAFSSKYLQNWSPTKPTKESISSHEGYTQIIANDRGHLLPSVPRSKANPWGSFMGTWQMPLKIPPARVTLTSRTTAGAASLTKWIQKNPDLLKASNGLCPEILGKPHDPDSQKKLRKKSITKTVQQARSPTIIPSSPAANLNSPDELQSSHPSAGHTPGPQRPAKS 177 T 38 Scm3 pdbhh F Eukaryota T 8j07 15 OA,PA,QA,RA 2L,2M,2N,2O CF161_HUMAN Cilia- and flagella-associated protein 161 MAQNVYGPGVRIGNWNEDVYLEEELMKDFLEKRDKGKLLIQRSRRLKQNLLRPMQLSVTEDGYIHYGDKVMLVNPDDPDTEADVFLRGDLSLCMTPDEIQSHLKDELEVPCGLSAVQAKTPIGRNTFIILSVHRDATGQVLRYGQDFCLGITGGFDNKMLYLSSDHRTLLKSSKRSWLQEVYLTDEVSHVNCWQAAFPDPQLRLEYEGFPVPANAKILINHCHTNRGLAAHRHLFLSTYFGKEAEVVAHTYLDSHRVEKPRNHWMLVTGNPRDASSSMLDLPKPPTEDTRAMEQAMGLDTQ 301 T 0.028 zf-RING_5 pdbpssm F Eukaryota T 8j07 23 AC,BC,CC,DC,EC,FC,GC,HC,IC,JC,KC,LC,MC 4A,4B,4C,4D,4E,4F,4G,4H,4I,4J,4K,4L,4M F166B_HUMAN Protein FAM166B MAVASTFIPGLNPQNPHYIPGYTGHCPLLRFSVGQTYGQVTGQLLRGPPGLAWPPVHRTLLPPIRPPRSPEVPRESLPVRRGQERLSSSMIPGYTGFVPRAQFIFAKNCSQVWAEALSDFTHLHEKQGSEELPKEAKGRKDTEKDQVPEPEGQLEEPTLEVVEQASPYSMDDRDPRKFFMSGFTGYVPCARFLFGSSFPVLTNQALQEFGQKHSPGSAQDPKHLPPLPRTYPQNLGLLPNYGGYVPGYKFQFGHTFGHLTHDALGLSTFQKQLLA 275 F F Eukaryota T 8j07 30 DD,ED,FD 5H,5I,5J SPAG8_HUMAN HSD-1,SPERM MEMBRANE PROTEIN 1,SMP-1,SPERM MEMBRANE PROTEIN BS-84 METNESTEGSRSRSRSLDIQPSSEGLGPTSEPFPSSDDSPRSALAAATAAAAAAASAAAATAAFTTAKAAALSTKTPAPCSEFMEPSSDPSLLGEPCAGPGFTHNIAHGSLGFEPVYVSCIAQDTCTTTDHSSNPGPVPGSSSGPVLGSSSGAGHGSGSGSGPGCGSVPGSGSGPGPGSGPGSGPGHGSGSHPGPASGPGPDTGPDSELSPCIPPGFRNLVADRVPNYTSWSQHCPWEPQKQPPWEFLQVLEPGARGLWKPPDIKGKLMVCYETLPRGQCLLYNWEEERATNHLDQVPSMQDGSESFFFRHGHRGLLTMQLKSPMPSSTTQKDSYQPPGNVYWPLRGKREAMLEMLLQHQICKEVQAEQEPTRKLFEVESVTHHDYRMELAQAGTPAPTKPHDYRQEQPETFWIQRAPQLPGVSNIRTLDTPFRKNCSFSTPVPLSLGKLLPYEPENYPYQLGEISSLPCPGGRLGGGGGRMTPF 485 T 0.027 PIP49_C pdbpssm F Eukaryota T 8j07 35 FE 7 DRC7_HUMAN COILED-COIL DOMAIN-CONTAINING PROTEIN 135,COILED-COIL DOMAIN-CONTAINING PROTEIN LOBO HOMOLOG MEVLREKVEEEEEAEREEAAEWAEWARMEKMMRPVEVRKEEITLKQETLRDLEKKLSEIQITVSAELPAFTKDTIDISKLPISYKTNTPKEEHLLQVADNFSRQYSHLCPDRVPLFLHPLNECEVPKFVSTTLRPTLMPYPELYNWDSCAQFVSDFLTMVPLPDPLKPPSHLYSSTTVLKYQKGNCFDFSTLLCSMLIGSGYDAYCVNGYGSLDLCHMDLTREVCPLTVKPKETIKKEEKVLPKKYTIKPPRDLCSRFEQEQEVKKQQEIRAQEKKRLREEEERLMEAEKAKPDALHGLRVHSWVLVLSGKREVPENFFIDPFTGHSYSTQDEHFLGIESLWNHKNYWINMQDCWNCCKDLIFDLGDPVRWEYMLLGTDKSQLSLTEEDDSGINDEDDVENLGKEDEDKSFDMPHSWVEQIEISPEAFETRCPNGKKVIQYKRAKLEKWAPYLNSNGLVSRLTTYEDLQCTNILEIKEWYQNREDMLELKHINKTTDLKTDYFKPGHPQALRVHSYKSMQPEMDRVIEFYETARVDGLMKREETPRTMTEYYQGRPDFLSYRHASFGPRVKKLTLSSAESNPRPIVKITERFFRNPAKPAEEDVAERVFLVAEERIQLRYHCREDHITASKREFLRRTEVDSKGNKIIMTPDMCISFEVEPMEHTKKLLYQYEAMMHLKREEKLSRHQVWESELEVLEILKLREEEEAAHTLTISIYDTKRNEKSKEYREAMERMMHEEHLRQVETQLDYLAPFLAQLPPGEKLTCWQAVRLKDECLSDFKQRLINKANLIQARFEKETQELQKKQQWYQENQVTLTPEDEDLYLSYCSQAMFRIRILEQRLNRHKELAPLKYLALEEKLYKDPRLGELQKIFA 874 T 0.00016 Peptidase_C93 pdbhh F Eukaryota T 8j07 48 CJA,IV,IW,RIA r,Q,R,q ROP1L_HUMAN ROPN1-LIKE PROTEIN,AKAP-ASSOCIATED SPERM PROTEIN MPLPDTMFCAQQIHIPPELPDILKQFTKAAIRTQPADVLRWSAGYFSALSRGDPLPVKDRMEMPTATQKTDTGLTQGLLKVLHKQCHHKRYVELTDLEQKWKNLCLPKEKFKALLQLDPCENKIKWINFLALGCSMLGGSLNTALKHLCEILTDDPEGGPARIPFKTFSYVYRYLARLDSDVSPLETESYLASLKENIDARKNGMIGLSDFFFPKRKLLESIENSEDVGH 230 T 0.0089 RIIa unppercent F Eukaryota T 8j07 57 IDA a0 Unknown XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 72 F F F 8j07 87 NGA k2 DNAI4_HUMAN WD REPEAT-CONTAINING PROTEIN 78 MTPGKHSGASARAANGGAWGYRDFRGGQKKGWCTTPQLVATMPVSPAGSHKQQNFGLNNATQPKKSISFFATMKATSVKGYTGANQSRMAVSKTVLIPPELKTVEKPNPNIKTTQVFDINGTDVTPRPLYHPDPLTGTAKPSKLLTSQEGSLGSEFISSYSLYQNTINPSTLGQFTRSVLGSSTVSKSSVSASESIAEDLEEPSYKRERLTSFTDLQVIRAAPEKIVTKEDLEKNIEIILTETETLRFFDLPTVMVSVESEEAEKVTQRNKNYEVLCRNRLGNDLYVERMMQTFNGAPKNKDVQCDKIIMEDKGIMSTAWDLYDSYNAMELVSLSVKQSVVESSSKANVLPKDQDQRLPGSTTEKNSETSSLMDIENVILAKIHEDEEDHSDAILKSDKFHQDLFFMERVLMENIFQPKLAAYRQLPVLKEPEPEEPEDVLESAKHEEVEEESKKEEEEEIHAEESTIPANLERLWSFSCDLTKGLNVSSLAWNKTNPDLLAVGYGHFGFKEQKRGLACCWSIKNPMWPERIYQSPYGVTAVDFSIGAPNLLAVGYHNGTIAIYNVRSNSNVPVLDSSESPQKHLGPVWQLQWIEQDRGTTGDGKREILVSISADGRISKWVIRKGLDCYDLMRLKRTTAASNKKGGEKEKKDEALISRQAPGMCFAFHPKDTNIYLAGTEEGHIHKCSCSYNEQYLDTYRGHKGPVYKVTWNPFCHDVFLSCSADWGVIIWQQENVKPSLSFYPATSVVYDVAWSPKSSYIFAAANENRVEIWDLHISTLDPLIVNTANPGIKFTTILFAKQTDCLLVGDSDGQVSVYELRNMPTVLETGRGDIMDTLLGSKSNQSA 848 T 0.23 WD40 pdb F Eukaryota T 8j07 96 EHA,PJA,TIA,ZHA m1,s1,q1,o1 DNAI1_HUMAN AXONEMAL DYNEIN INTERMEDIATE CHAIN 1 MIPASAKAPHKQPHKQSISIGRGTRKRDEDSGTEVGEGTDEWAQSKATVRPPDQLELTDAELKEEFTRILTANNPHAPQNIVRYSFKEGTYKPIGFVNQLAVHYTQVGNLIPKDSDEGRRQHYRDELVAGSQESVKVISETGNLEEDEEPKELETEPGSQTDVPAAGAAEKVTEEELMTPKQPKERKLTNQFNFSERASQTYNNPVRDRECQTEPPPRTNFSATANQWEIYDAYVEELEKQEKTKEKEKAKTPVAKKSGKMAMRKLTSMESQTDDLIKLSQAAKIMERMVNQNTYDDIAQDFKYYDDAADEYRDQVGTLLPLWKFQNDKAKRLSVTALCWNPKYRDLFAVGYGSYDFMKQSRGMLLLYSLKNPSFPEYMFSSNSGVMCLDIHVDHPYLVAVGHYDGNVAIYNLKKPHSQPSFCSSAKSGKHSDPVWQVKWQKDDMDQNLNFFSVSSDGRIVSWTLVKRKLVHIDVIKLKVEGSTTEVPEGLQLHPVGCGTAFDFHKEIDYMFLVGTEEGKIYKCSKSYSSQFLDTYDAHNMSVDTVSWNPYHTKVFMSCSSDWTVKIWDHTIKTPMFIYDLNSAVGDVAWAPYSSTVFAAVTTDGKAHIFDLAINKYEAICNQPVAAKKNRLTHVQFNLIHPIIIVGDDRGHIISLKLSPNLRKMPKEKKGQEVQKGPAVEIAKLDKLLNLVREVKIKT 699 T 0.0027 WD40 pdb F Eukaryota T 8j07 97 AIA,FHA,QJA,UIA o2,m2,s2,q2 DNAI2_HUMAN AXONEMAL DYNEIN INTERMEDIATE CHAIN 2 MEIVYVYVKKRSEFGKQCNFSDRQAELNIDIMPNPELAEQFVERNPVDTGIQCSISMSEHEANSERFEMETRGVNHVEGGWPKDVNPLELEQTIRFRKKVEKDENYVNAIMQLGSIMEHCIKQNNAIDIYEEYFNDEEAMEVMEEDPSAKTINVFRDPQEIKRAATHLSWHPDGNRKLAVAYSCLDFQRAPVGMSSDSYIWDLENPNKPELALKPSSPLVTLEFNPKDSHVLLGGCYNGQIACWDTRKGSLVAELSTIESSHRDPVYGTIWLQSKTGTECFSASTDGQVMWWDIRKMSEPTEVVILDITKKEQLENALGAISLEFESTLPTKFMVGTEQGIVISCNRKAKTSAEKIVCTFPGHHGPIYALQRNPFYPKNFLTVGDWTARIWSEDSRESSIMWTKYHMAYLTDAAWSPVRPTVFFTTRMDGTLDIWDFMFEQCDPTLSLKVCDEALFCLRVQDNGCLIACGSQLGTTTLLEVSPGLSTLQRNEKNVASSMFERETRREKILEARHREMRLKEKGKAEGRDEEQTDEELAVDLEALVSKAEEEFFDIIFAELKKKEADAIKLTPVPQQPSPEEDQVVEEGEEAAGEEGDEEVEEDLA 605 T 0.088 DUF4795 pdb F Eukaryota T 8j07 99 AKA,EIA,JHA,RJA,YIA u6,o6,m6,s6,q6 ODAD1_HUMAN COILED-COIL DOMAIN-CONTAINING PROTEIN 114 MEGERRAYSKEVHQRINKQLEEIRRLEEVRGDLQVQISAAQNQVKRLRDSQRLENMDRLLKGRAQVQAEIEELQEQTRALDKQIQEWETRIFTHSKNVRSPGFILDQKVKIRRRIRILENQLDRVTCHFDNQLVRNAALREELDLLRIDRNRYLNVDRKLKKEIHHLHHLVSTLILSSTSAYAVREEAKAKMGLLRERAEKEEAQSEMEAQVLQRQILHLEQLHHFLKLKNNDRQPDPDVLEKREKQAGEVAEGVWKTSQERLVLCYEDALNKLSQLMGESDPDLLVQKYLEIEERNFAEFNFINEQNLELEHVQEEIKEMQEALVSARASKDDQHLLQEQQQKVLQQRMDKVHSEAERLEARFQDVRGQLEKLKADIQLLFTKAHCDSSMIDDLLGVKTSMGDRDMGLFLSLIEKRLVELLTVQAFLHAQSFTSLADAALLVLGQSLEDLPKKMAPLQPPDTLEDPPGFEASDDYPMSREELLSQVEKLVELQEQAEAQRQKDLAAAAAKLDGTLSVDLASTQRAGSSTVLVPTRHPHAIPGSILSHKTSRDRGSLGHVTFGGLSSSTGHLPSHITHGDPNTGHVTFGSTSASSGGHVTFRPVSASSYLGSTGYVGSSRGGENTEGGVESGGTASDSSGGLGSSRDHVSSTGPASSTGPGSSTSKDSRG 670 T 2.4E-05 CCDC73 pdbhh F Eukaryota T 8j07 101 AJA,CKA,FIA,LHA,TJA q8,u8,o8,m8,s8 ODAD3_HUMAN COILED-COIL DOMAIN-CONTAINING PROTEIN 151 MTSPLCRAASANALPPQDQASTPSSRVKGREASGKPSHLRGKGTAQAWTPGRSKGGSFHRGAGKPSVHSQVAELHKKIQLLEGDRKAFFESSQWNIKKNQETISQLRKETKALELKLLDLLKGDEKVVQAVIREWKWEKPYLKNRTGQALEHLDHRLREKVKQQNALRHQVVLRQRRLEELQLQHSLRLLEMAEAQNRHTEVAKTMRNLENRLEKAQMKAQEAEHITSVYLQLKAYLMDESLNLENRLDSMEAEVVRTKHELEALHVVNQEALNARDIAKNQLQYLEETLVRERKKRERYISECKKRAEEKKLENERMERKTHREHLLLQSDDTIQDSLHAKEEELRQRWSMYQMEVIFGKVKDATGTDETHSLVRRFLAQGDTFAQLETLKSENEQTLVRLKQEKQQLQRELEDLKYSGEATLVSQQKLQAEAQERLKKEERRHAEAKDQLERALRAMQVAKDSLEHLASKLIHITVEDGRFAGKELDPQADNYVPNLLGLVEEKLLKLQAQLQGHDVQEMLCHIANREFLASLEGRLPEYNTRIALPLATSKDKFFDEESEEEDNEVVTRASLKIRSQKLIESHKKHRRSRRS 595 T 0.0023 CCDC73 pdbhh F Eukaryota T 8j07 106 IKA w LRC34_HUMAN Leucine-rich repeat-containing protein 34 MAAQPPRPVGERSMGSSREAARAPARSPAWASTQASTPGAALAVQRESPESGLQKHYSNLCMEKSQKINPFILHILQEVDEEIKKGLAAGITLNIAGNNRLVPVERVTGEDFWILSKILKNCLYINGLDVGYNLLCDVGAYYAAKLLQKQLNLIYLNLMFNDIGPEGGELIAKVLHKNRTLKYLRMTGNKIENKGGMFFAAMLQINSSLEKLDLGDCDLGMQSVIAFATVLTQNQAIKAINLNRPILYSEQEESTVHVGRMLKENHCLVALHMCKHDIKNSGIQQLCDALYLNSSLRYLDVSCNKITHDGMVYLADVLKSNTTLEVIDLSFNRIENAGANYLSETLTSHNRSLKALSVVSNNIEGEGLVALSQSMKTNLTFSHIYIWGNKFDEATCIAYSDLIQMGCLKPDNTDVEPFVVDGRVYLAEVSNGLKKHYYWTSTYGESYDHSSNAGFALVPVGQQP 464 T 7.1E-08 FBXL18_C pdbhh F Eukaryota T 8j62 2 B,D,H,J C,E,G,I Viral infectivity factor MGHHHHHHSQDPMENRWQVMIVWQVDRMRINTWKRLVKHHMYISRKAKDWFYRHHYESTNPKISSEVHIPLGDAKLVITTYWGLHTGERDWHLGQGVSIEWRKKRYSTQVDPDLADQLIHLHYFDEASEGSQIKPPLPSVRKLTEDRWNK 150 T 0.21 Vif pdb F T 8j8p 2 B A A0A0L8RF82_SACEU CDC73-like protein SGSAGNGLVPSDPVLAETMKNERVVQDHNSALRGARPINFGYLIKDAELKLVQSIKGSLRGSKLPPGHKGAHGRVSKTNGS 81 T 5.6 CDC73_N unppercent F Eukaryota T 8j8p 4 D R A0A0L8RIY1_SACEU RTF1-like protein SKSDPFSRLKTRTKVYYQEIQKEENAKAKEMAQQEKLQEDRETKERREKELLLAQFRRLGGLERMIGELDIKFDFKF 77 T 0.095 Tom37_C pdbpercent F Eukaryota T 8j8q 3 C R A0A0L8RIY1_SACEU RTF1-like protein SKSDPFSRLKTRTKVYYQEIQKEENAKAKEMAQQEKLQEDRETKERREKELLLAQFRRLGGLERMIGELDIKFDFKF 77 T 0.095 Tom37_C pdbpercent F Eukaryota T 8j8q 4 D A A0A0L8RF82_SACEU CDC73-like protein SGSAGNGLVPSDPVLAETMKNERVVQDHNSALRGARPINFGYLIKDAELKLVQSIKGSLRGSKLPPGHKGAHGRVSKTNGS 81 T 5.6 CDC73_N unppercent F Eukaryota T 8jaj 1 A,B,C,D,E,F,G,H,I,J,K,L a,b,c,d,e,f,g,h,i,j,k,l Q71TB2_BPP1 THE TAIL SHEATH PROTEIN MSQYSIQQSLGNASGVAVSPINADATLSTGVALNSSLWAGIGVFARGKPFTVLAVTESNYEDVLGEPLKPSSGSQFEPIRHVYEAIQQTSGYVVRAVPDDAKFPIIMFDESGEPAYSALPYGSEIELDSGEAFAIYVDDGDPCISPTRELTIETATADSAGNERFLLKLTQTTSLGVVTTLETHTVSLAEEAKDDMGRLCYLPTALEARSKYLRAVVNEELISTAKVTNKKSLAFTGGTNGDQSKISTAAYLRAVKVLNNAPYMYTAVLGLGCYDNAAITALGKICADRLIDGFFDVKPTLTYAEALPAVEDTGLLGTDYVSCSVYHYPFSCKDKWTQSRVVFGLSGVAYAAKARGVKKNSDVGGWHYSPAGEERAVIARASIQPLYPEDTPDEEAMVKGRLNKVSVGTSGQMIIDDALTCCTQDNYLHFQHVPSLMNAISRFFVQLARQMKHSPDGITAAGLTKGMTKLLDRFVASGALVAPRDPDADGTEPYVLKVTQAEFDKWEVVWACCPTGVARRIQGVPLLIK 529 T 0.034 Phage_sheath_1 pdbpercent T Viruses T 8jan 1 A,B,C,D,E,F,G,H,I,J,K,L a,b,c,d,e,f,g,h,i,j,k,l Q71TM5_BPP1 TAIL TUBE PROTEIN MGHNNTKGNRKFIKGRYTANAAKGERLVSSEFLLTFAGHEDISVLVRTSQIPEMTREDVEDYGPNGVKFNQHGPIRNSGEIQVQCVETIEGDILQFIKDRIAAKDYVDITMAATPESKSSGVNAVTKAATTIEMLDCKIYSDAIDFSTEDVTAAVRPSLRIVYNWIEWD 169 T 0.0002 Phage_T4_gp19 pdbhh T Viruses T 8jan 2 M,N,O,P,Q,R,S,T,U,V,W,X m,n,o,p,q,r,s,t,u,v,w,x Q71TB2_BPP1 TAIL SHEATH PROTEIN MSQYSIQQSLGNASGVAVSPINADATLSTGVALNSSLWAGIGVFARGKPFTVLAVTESNYEDVLGEPLKPSSGSQFEPIRHVYEAIQQTSGYVVRAVPDDAKFPIIMFDESGEPAYSALPYGSEIELDSGEAFAIYVDDGDPCISPTRELTIETATADSAGNERFLLKLTQTTSLGVVTTLETHTVSLAEEAKDDMGRLCYLPTALEARSKYLRAVVNEELISTAKVTNKKSLAFTGGTNGDQSKISTAAYLRAVKVLNNAPYMYTAVLGLGCYDNAAITALGKICADRLIDGFFDVKPTLTYAEALPAVEDTGLLGTDYVSCSVYHYPFSCKDKWTQSRVVFGLSGVAYAAKARGVKKNSDVGGWHYSPAGEERAVIARASIQPLYPEDTPDEEAMVKGRLNKVSVGTSGQMIIDDALTCCTQDNYLHFQHVPSLMNAISRFFVQLARQMKHSPDGITAAGLTKGMTKLLDRFVASGALVAPRDPDADGTEPYVLKVTQAEFDKWEVVWACCPTGVARRIQGVPLLIK 529 T 0.034 Phage_sheath_1 pdbpercent T Viruses T 8jan 3 AA,BA,CA,DA,Y,Z A,B,C,D,y,z Q71T90_BPP1 TAIL TERMINATOR PROTEIN MILNNQEWLLAIFKKKGLTPTGKLEFATIDGIDSALAQALNEAFDSQVVSFNDRINQSFREFLKRTPRDRITLGTFSDVKEWLSSFEADRAGRKDTASAGPVNKLAMPLVNLSRSPAFSIYEGELCRDNYDEGHVTNENDEIEALVSTIPFSLEYSLWIASDEKESLGMVTTALAFWLRMYASLGQASFTHIANVGGYEIPVTCYIEGQKSIAFQDLTTGTADNRLFAVGLNLTVVAELPILAYMQQTTGTITVKAKILEE 261 T 0.87 T4-gp15_tss pdbhh T Viruses T 8jjs 2 B I MAA-ILE-SAR-SAR-7T2-SAR-IAE-LEU-MEA-MLE-7TK XIXXXXXLXXX 11 T 0.49 CT47 pdbhh F F 8jpa 1 A,B A,B De novo design cavitated protein MGSSHHHHHHSSGLVPRGSHMASMTGGQQMGRGSEDVIKQALKRVQQYIQQAPNGYRDVIQQILQTVLKILKLMGMPEVEAVLIVAYVAEMLVLAAKYGYIDELLKLAKEALEADDVDKMIEIFLKMLKIMFLALALDPEGLKKLKELKKNGSEEVRKLIEEVIKQLKQQRQQQALEHHHHHH 183 T 0.059 DUF3103 pdb F T 8jtk 2 B A Q2NK94_AYWBP Sequence-variable mosaic (SVM) signal sequence domain-containing protein GPAPNEEFVGDMRIVNVNLSNIDILKKHETFKKYFDFTLTGPRYNGNIAEFAMIWKIKNPPLNLLGVFFDDGTRDDEDDKYILEELKQIGNGAKNMYIFWQYEQK 105 T 8.7 Sigma_reg_N unppercent F Bacteria T 8jtl 2 C,D D,C Q6YQ57_ONYPE Sequence-variable mosaic (SVM) signal sequence domain-containing protein GPAPHEERVGDMRIVNITFSDINSIKNFQPFSQYFDFTLTGPRYNGNIAQFAMIWKIKNPPHNLLGVFFDNNTRDDEDDKYTLEELKQMGNGAKNMYIFWQYEQK 105 T 4.5 DUF5454 pdbhh F Bacteria T 8ju8 1 A A de novo designed protein MWGKVVVIGSGEYGKRAAQRVADLLDPRIDVYLIFDAKSTDEIRKMIKDHGADAVIVIGAPLGTAFAIAKAAAELGAAVIVIIPRRPGVREAARRFGEEARKYGGRVEVLLGATVEEAVAFARRVVQQFFALEHHHHHH 139 T 0.0022 GFO_IDH_MocA pdb F T 8kme 4 D 4 SEL2770 XXXKLPX 7 T 9.9 TagF_N pdbhh F F 8lpr 2 B P METHOXYSUCCINYL-ALA-ALA-PRO-PHENYLALANINE BORONIC ACID INHIBITOR XAAPX 5 T 170 DUF3054 pdbhh F F 8oep 2 B,D B,D VE6_HPV18 Protein E6 RQERLQRRRETQV 13 T 0.19 Mu-like_Com unphh T Viruses T 8ofg 2 C C GLU-ARG-LEU-LEU-GLY-GLY-TRP-LYS ERLLGGWK 8 T 0.66 hSac2 pdbhh F T 8og5 1 A A DCAF1_HUMAN HIV-1 VPR-BINDING PROTEIN,VPRBP,SERINE/THREONINE-PROTEIN KINASE VPRBP,VPR-INTERACTING PROTEIN GGGREPKQRRQAPINFTSRLNRRASFPKYGGVDGGCFDRHLIFSRFRPISVFREANEDESGFTCCAFSARERFLMLGTCTGQLKLYNVFSGQEEASYNCHNSAITHLEPSRDGSLLLTSATWSQPLSALWGMKSVFDMKHSFTEDHYVEFSKHSQDRVIGTKGDIAHIYDIQTGNKLLTLFNPDLANNYKRNCATFNPTDDLVLNDGVLWDVRSALAIHKFDKFNMNISGVFHPNGLEVIINTEIWDLRTFHLLHTVPALDQCRVVFNHTGTVMYGAMLQADDEDDLMEERMKSPFGSSFRTFNATDYKPIATIDVKRNIFDLCTDTKDCYLAVIENQGSMDALNMDTVCRLYEVGRQRLAEDEDEE 367 T 0.0019 ANAPC4_WD40 pdbpercent F Eukaryota T 8og6 1 A A DCAF1_HUMAN HIV-1 VPR-BINDING PROTEIN,VPRBP,SERINE/THREONINE-PROTEIN KINASE VPRBP,VPR-INTERACTING PROTEIN GGGREPKQRRQAPINFTSRLNRRASFPKYGGVDGGCFDRHLIFSRFRPISVFREANEDESGFTCCAFSARERFLMLGTCTGQLKLYNVFSGQEEASYNCHNSAITHLEPSRDGSLLLTSATWSQPLSALWGMKSVFDMKHSFTEDHYVEFSKHSQDRVIGTKGDIAHIYDIQTGNKLLTLFNPDLANNYKRNCATFNPTDDLVLNDGVLWDVRSALAIHKFDKFNMNISGVFHPNGLEVIINTEIWDLRTFHLLHTVPALDQCRVVFNHTGTVMYGAMLQADDEDDLMEERMKSPFGSSFRTFNATDYKPIATIDVKRNIFDLCTDTKDCYLAVIENQGSMDALNMDTVCRLYEVGRQRLAEDEDEE 367 T 0.0019 ANAPC4_WD40 pdbpercent F Eukaryota T 8og7 1 A A DCAF1_HUMAN HIV-1 VPR-BINDING PROTEIN,VPRBP,SERINE/THREONINE-PROTEIN KINASE VPRBP,VPR-INTERACTING PROTEIN GGGREPKQRRQAPINFTSRLNRRASFPKYGGVDGGCFDRHLIFSRFRPISVFREANEDESGFTCCAFSARERFLMLGTCTGQLKLYNVFSGQEEASYNCHNSAITHLEPSRDGSLLLTSATWSQPLSALWGMKSVFDMKHSFTEDHYVEFSKHSQDRVIGTKGDIAHIYDIQTGNKLLTLFNPDLANNYKRNCATFNPTDDLVLNDGVLWDVRSALAIHKFDKFNMNISGVFHPNGLEVIINTEIWDLRTFHLLHTVPALDQCRVVFNHTGTVMYGAMLQADDEDDLMEERMKSPFGSSFRTFNATDYKPIATIDVKRNIFDLCTDTKDCYLAVIENQGSMDALNMDTVCRLYEVGRQRLAEDEDEE 367 T 0.0019 ANAPC4_WD40 pdbpercent F Eukaryota T 8og8 1 A A DCAF1_HUMAN HIV-1 VPR-BINDING PROTEIN,VPRBP,SERINE/THREONINE-PROTEIN KINASE VPRBP,VPR-INTERACTING PROTEIN GGGREPKQRRQAPINFTSRLNRRASFPKYGGVDGGCFDRHLIFSRFRPISVFREANEDESGFTCCAFSARERFLMLGTCTGQLKLYNVFSGQEEASYNCHNSAITHLEPSRDGSLLLTSATWSQPLSALWGMKSVFDMKHSFTEDHYVEFSKHSQDRVIGTKGDIAHIYDIQTGNKLLTLFNPDLANNYKRNCATFNPTDDLVLNDGVLWDVRSALAIHKFDKFNMNISGVFHPNGLEVIINTEIWDLRTFHLLHTVPALDQCRVVFNHTGTVMYGAMLQADDEDDLMEERMKSPFGSSFRTFNATDYKPIATIDVKRNIFDLCTDTKDCYLAVIENQGSMDALNMDTVCRLYEVGRQRLAEDEDEE 367 T 0.0019 ANAPC4_WD40 pdbpercent F Eukaryota T 8og9 1 A A DCAF1_HUMAN HIV-1 VPR-BINDING PROTEIN,VPRBP,SERINE/THREONINE-PROTEIN KINASE VPRBP,VPR-INTERACTING PROTEIN GGGREPKQRRQAPINFTSRLNRRASFPKYGGVDGGCFDRHLIFSRFRPISVFREANEDESGFTCCAFSARERFLMLGTCTGQLKLYNVFSGQEEASYNCHNSAITHLEPSRDGSLLLTSATWSQPLSALWGMKSVFDMKHSFTEDHYVEFSKHSQDRVIGTKGDIAHIYDIQTGNKLLTLFNPDLANNYKRNCATFNPTDDLVLNDGVLWDVRSALAIHKFDKFNMNISGVFHPNGLEVIINTEIWDLRTFHLLHTVPALDQCRVVFNHTGTVMYGAMLQADDEDDLMEERMKSPFGSSFRTFNATDYKPIATIDVKRNIFDLCTDTKDCYLAVIENQGSMDALNMDTVCRLYEVGRQRLAEDEDEE 367 T 0.0019 ANAPC4_WD40 pdbpercent F Eukaryota T 8oga 1 A P DCAF1_HUMAN HIV-1 VPR-BINDING PROTEIN,VPRBP,SERINE/THREONINE-PROTEIN KINASE VPRBP,VPR-INTERACTING PROTEIN GGGRFRPISVFREANEDESGFTCCAFSARERFLMLGTCTGQLKLYNVFSGQEEASYNCHNSAITHLEPSRDGSLLLTSATWSQPLSALWGMKSVFDMKHSFTEDHYVEFSKHSQDRVIGTKGDIAHIYDIQTGNKLLTLFNPDLANNYKRNCATFNPTDDLVLNDGVLWDVRSALAIHKFDKFNMNISGVFHPNGLEVIINTEIWDLRTFHLLHTVPALDQCRVVFNHTGTVMYGAMLQADDEDDLMEERMKSPFGSSFRTFNATDYKPIATIDVKRNIFDLCTDTKDCYLAVIENQGSMDALNMDTVCRLYEVGRQR 318 T 0.069 ANAPC4_WD40 pdb F Eukaryota T 8ogb 1 A A DCAF1_HUMAN HIV-1 VPR-BINDING PROTEIN,VPRBP,SERINE/THREONINE-PROTEIN KINASE VPRBP,VPR-INTERACTING PROTEIN GGGREPKQRRQAPINFTSRLNRRASFPKYGGVDGGCFDRHLIFSRFRPISVFREANEDESGFTCCAFSARERFLMLGTCTGQLKLYNVFSGQEEASYNCHNSAITHLEPSRDGSLLLTSATWSQPLSALWGMKSVFDMKHSFTEDHYVEFSKHSQDRVIGTKGDIAHIYDIQTGNKLLTLFNPDLANNYKRNCATFNPTDDLVLNDGVLWDVRSALAIHKFDKFNMNISGVFHPNGLEVIINTEIWDLRTFHLLHTVPALDQCRVVFNHTGTVMYGAMLQADDEDDLMEERMKSPFGSSFRTFNATDYKPIATIDVKRNIFDLCTDTKDCYLAVIENQGSMDALNMDTVCRLYEVGRQRLAEDEDEE 367 T 0.0019 ANAPC4_WD40 pdbpercent F Eukaryota T 8ogc 1 A A DCAF1_HUMAN HIV-1 VPR-BINDING PROTEIN,VPRBP,SERINE/THREONINE-PROTEIN KINASE VPRBP,VPR-INTERACTING PROTEIN GGGREPKQRRQAPINFTSRLNRRASFPKYGGVDGGCFDRHLIFSRFRPISVFREANEDESGFTCCAFSARERFLMLGTCTGQLKLYNVFSGQEEASYNCHNSAITHLEPSRDGSLLLTSATWSQPLSALWGMKSVFDMKHSFTEDHYVEFSKHSQDRVIGTKGDIAHIYDIQTGNKLLTLFNPDLANNYKRNCATFNPTDDLVLNDGVLWDVRSALAIHKFDKFNMNISGVFHPNGLEVIINTEIWDLRTFHLLHTVPALDQCRVVFNHTGTVMYGAMLQADDEDDLMEERMKSPFGSSFRTFNATDYKPIATIDVKRNIFDLCTDTKDCYLAVIENQGSMDALNMDTVCRLYEVGRQRLAEDEDEE 367 T 0.0019 ANAPC4_WD40 pdbpercent F Eukaryota T 8oij 1 A,B A,B SMG_DROME Protein Smaug GGSGGSGGSGGSLFCEQVTTVTNLFEKWNDCERTVVMYALLKRLRYPSLKFLQYSIDSNLTQNLGTSQTNLSSVVIDINANNPVYLQNLLNAYKTARKEDILHEVLNMLPLLKPGNEEAKLIYLTLIPVAVKDTMQQIVPTELVQQIFSYLLIHPAITSEDRRSLNIWLRHLEDHIQ 177 T 0.067 DUF6179 pdbpssm F Eukaryota T 8oij 2 C,D C,D SMO_DROME DSMO,SMOH,SMOOTH SVPSYGEDELQQAMRLLNAASRQRTEAANEDFGGT 35 T 1.1 DUF1635 pdbhh F Eukaryota T 8oik 1 A,B,C A,B,C SMAG1_HUMAN SMAUG 1,HSMAUG1,STERILE ALPHA MOTIF DOMAIN-CONTAINING PROTEIN 4A,SAM DOMAIN-CONTAINING PROTEIN 4A MKHHHHHHPMSDYDIPTTENLYFQSMFRDQVGVLAGWFKGWNECEQTVALLSLLKRVSQTQARFLQLCLEHSLADCAELHVLEREANSPGIINQWQQESKDKVISLLLTHLPLLKPGNLDAKVEYMKLLPKILAHSIEHNQHIEESRQLLSYALIHPATSLEDRSALAMWLNHLEDRTST 180 T 0.22 MerR-DNA-bind pdbpercent F Eukaryota T 8oin 33 LA Bc I3LN63_PIG mL54 MAARRLFGAARSWAAWRAWELSDAAVSGRLHVRNYAKRPVIKGGKGGKGAVVGEALKDPEVCTDPFRLTTHAMGVNIYKEGQDVVLKPDSEYPEWLFEMNVGPPKKLEELDPETREYWRLLRKHNIWRHNRLSKNRKF 138 F F Eukaryota T 8oiq 33 LA Bc I3LN63_PIG mL54 MAARRLFGAARSWAAWRAWELSDAAVSGRLHVRNYAKRPVIKGGKGGKGAVVGEALKDPEVCTDPFRLTTHAMGVNIYKEGQDVVLKPDSEYPEWLFEMNVGPPKKLEELDPETREYWRLLRKHNIWRHNRLSKNRKF 138 F F Eukaryota T 8oir 33 LA Bc RM54_HUMAN MRP-L54,MITOCHONDRIAL LARGE RIBOSOMAL SUBUNIT PROTEIN ML54 MATKRLFGATRTWAGWGAWELLNPATSGRLLARDYAKKPVMKGAKSGKGAVTSEALKDPDVCTDPVQLTTYAMGVNIYKEGQDVPLKPDAEYPEWLFEMNLGPPKTLEELDPESREYWRRLRKQNIWRHNRLSKNKRL 138 F F Eukaryota T 8oit 33 LA Bc RM54_HUMAN MRP-L54,MITOCHONDRIAL LARGE RIBOSOMAL SUBUNIT PROTEIN ML54 MATKRLFGATRTWAGWGAWELLNPATSGRLLARDYAKKPVMKGAKSGKGAVTSEALKDPDVCTDPVQLTTYAMGVNIYKEGQDVPLKPDAEYPEWLFEMNLGPPKTLEELDPESREYWRRLRKQNIWRHNRLSKNKRL 138 F F Eukaryota T 8onu 2 B B THAN_PODMA Thanatin-like derivative XPITYXNRXTXKCXRY 16 T 2.6 YihI unphh F Eukaryota T 8oo5 1 A P DCAF1_HUMAN HIV-1 VPR-BINDING PROTEIN,VPRBP,SERINE/THREONINE-PROTEIN KINASE VPRBP,VPR-INTERACTING PROTEIN GGGRFRPISVFREANEDESGFTCCAFSARERFLMLGTCTGQLKLYNVFSGQEEASYNCHNSAITHLEPSRDGSLLLTSATWSQPLSALWGMKSVFDMKHSFTEDHYVEFSKHSQDRVIGTKGDIAHIYDIQTGNKLLTLFNPDLANNYKRNCATFNPTDDLVLNDGVLWDVRSALAIHKFDKFNMNISGVFHPNGLEVIINTEIWDLRTFHLLHTVPALDQCRVVFNHTGTVMYGAMLQADDEDDLMEERMKSPFGSSFRTFNATDYKPIATIDVKRNIFDLCTDTKDCYLAVIENQGSMDALNMDTVCRLYEVGRQR 318 T 0.069 ANAPC4_WD40 pdb F Eukaryota T 8ood 1 A A DCAF1_HUMAN HIV-1 VPR-BINDING PROTEIN,VPRBP,SERINE/THREONINE-PROTEIN KINASE VPRBP,VPR-INTERACTING PROTEIN GGGREPKQRRQAPINFTSRLNRRASFPKYGGVDGGCFDRHLIFSRFRPISVFREANEDESGFTCCAFSARERFLMLGTCTGQLKLYNVFSGQEEASYNCHNSAITHLEPSRDGSLLLTSATWSQPLSALWGMKSVFDMKHSFTEDHYVEFSKHSQDRVIGTKGDIAHIYDIQTGNKLLTLFNPDLANNYKRNCATFNPTDDLVLNDGVLWDVRSALAIHKFDKFNMNISGVFHPNGLEVIINTEIWDLRTFHLLHTVPALDQCRVVFNHTGTVMYGAMLQADDEDDLMEERMKSPFGSSFRTFNATDYKPIATIDVKRNIFDLCTDTKDCYLAVIENQGSMDALNMDTVCRLYEVGRQRLAEDEDEE 367 T 0.0019 ANAPC4_WD40 pdbpercent F Eukaryota T 8opz 1 A A Tailspike depolymerase (APK16_gp47) from Acinetobacter phage APK16 GSEVAAAQTQYYLKYFNPDIVYPKNARIMLDTGVVVMSMVDGNSTNPNSNMTGWVRVNSASLIFDQSGKTQQEINDSQKQKLPSLKDYGAVSGQDSTAAIKAAIAAEDFLYFGDIGDNFIVSEQIDLRDGCYYVSNGAKFTAALGIEGSQPYTPKSIINASGKVGINISGLVRTHIDHNIFSALGDANSKPTISGFLADAAIDCDFGKWESVGSVNYYYTPNFKEYGIVDLRNSIDCYIEADVNGRWTEETTASTPSTVGIMGSNNKGCYLKGRAKNCYWSGILWEGEDCVVDGPHVRNTKGSNLNLAGKNTAAYNVDLYGSEQGNISIGEGATQAENCNVVGGVAGNAKFANCHLHSVTKNCHVKLFHYGWGQTASAVSDATSGIRCQGTGNTIDSEFDVTYGGLTVKGDAVNVYCSTLTNPEATNIKVNVVGIGARVQIRAPYTIVNAKITGATGDAVVLGERCKGSIVEEVTAIKCGRPLQYAPKTTDANDYAGVIIGRINDVECTNRSVFYGQKIVHSQRKIERIYAQETAFVLDQVLEAIEVYTNDSGVTGANKLASAIRHISADSFGTSYGLDLVASTISKNNLANSKTKVRAGHIEVEPAVAGAASHIVLYAANGTKWKLEPTGSASAANWVAV 641 T 0.39 YmcE_antitoxin pdbpssm F T 8oq0 1 A A Tailspike protein GSEEAAQVARSADKVIDASGLTQQDINDRLAITYPTAVGLVGKPNLKDADVIYVQCYSNIFDGGDGYYRVSADTTTVADGAYVIRINPNLIATMLNTTGSVDVARFGAVMNADVSPFIEKAFKYFRDVCLTKPYKLNTVVGIPDQNNYSKNVYYLRGLGDPEITVDCPSAVFTSASAKLDPTSTVNKFTAKIDVSNISFIGTTVANSVVFNGDRLYNINVHHNNFKGNITIFKAYVKREVGRQYTQSVSINHNHLTGVYRVIESDKSYNLDFSYNMCEACIGGIYVGVDAPWDPNNISLTIHRNLWEGSGMLLKTNGGIIGGTISANYFENNTFNDAGIEKCLISINRTGTGAGYASGLVISGNTFSGNGAIPDFVDVRYVNQSTESSSTSKTANVKPVVFIGNWSNSYLMTNFAGALLINNRCSNRNTMFNAYSPQEGRVTFASGYLDKPLSSMLSGNLLNLITLDTRPCFTAGYINTNFKTTFDVNVLFKTSGGINTASCSFKLDVFVYTPLGAGTPPKSNLKAVMSAFMQSDTNDIISTGVNETMKSVIGATPTMAVVNNGDGTYGIRLSPFTNASSPNWGAITSARIEYTYQGTLIASHTSTYSTANLLTIT 616 T 9.1E-05 Pectate_lyase_3 pdbhh F T 8ou0 2 B D A0A3S5ZPV0_BOVIN Stabilizer of axonemal microtubules 1 MAPTKGKCVCELCSCGRHHCPHLPTKIYDKTEKPCLLSEYTENYPVYHSYLPRESFKPKMDYQRACTPMEGLTTSRRDFGPHKVLPVKIHQPNPFVPSEENMDLQTTYKQDYNPYPLCRVDPFKPRDSKYPCGDKMESLPTYKADYLPWNQPRRELLRPPHHYRPASTKFDSRTTQQDDYSMKGLVNTRSCKPPAVPKLCNVPLEDLTNYKMSYVAHPLEKRFVHESEKFRPCEIPFESLTTHKESYRGLMGEPAKSLKPPARPYGLDTPFSNTTEFRDKYQAWPTPQVFSKPPSMYVPPEEKMDLLTTVQTHYTYPKGAPAESCRPALSVKKGGRFEGSTTTKEDYKQWASTRTEPAKPIPQLNLPTEPLDCLTTARAHYVPHLPMMTKSCKPVWSGPQGNIPVEGQTTYTISFTPKEMSRCLASYPEPPGYIFEEIDALGHRIYRPVSQTGSRRSSRFSVGDSENPNQQELTVSA 477 T 0.0011 STOP pdb F Eukaryota T 8owi 1 A,B A,B CENPE_HUMAN CENTROMERE PROTEIN E,CENP-E,KINESIN-7,KINESIN-RELATED PROTEIN CENPE GPSPYKEEIEDLKMKLVKIDLEKMKNAKEFEKEISATKATVEYQKEVIRLLRENLRRSQQAQDTSVISEHTDPQPSNKPLTCGGGSGIVQNTKALILKSEHIRLEKEISKLKQQNEQLIKQKNELLSNNQHLSNEVKTWKERTLKREAHK 150 T 0.0014 ZapB pdb F Eukaryota T 8p26 1 A,B,C,D,E,F,G,H,I,J A,B,C,D,E,F,G,H,I,J F4KC77_ARATH U2 small nuclear ribonucleoprotein auxiliary factor-like protein GSMASFEKFEPIFGEVVPERSDPGSGLLRRCLFHVYASDSYNLTVHVTDFISGVWTTILSVSQLDDMRDTVGIGGSWSEFVDYTVASLKSDNVKLLLGETSVSNGVKTARLVSQKAKGMPRINVPLTKMVESSASEAMANLSLELFRAFKSKQHLQGEVSFSAAATDEKDKRDATYNQLERYSRKLDVMAPSTNNRQDSPANQSAREANTKNPVKRVPAHRRTRKRGALLQDSEEEDG 238 T 0.003 PAXX unphh F Eukaryota T 8p3l 1 A,B,C,D A,D,G,J W0DP94_9GAMM THIOCYANATE DEHYDROGENASE MSYYHHHHHHDYDIPTTENLYFQGAMGKYVKVQDFYDQLGKYVLVAPGKFSGTVAATDLSTGWTMAWLAAWNYGDTCPIMHHMAAFPSPDPYKEFEFVVNTQGGKNLFIYGVPVAVEDPGEGMKIYRIKYDGTRMNLQRDAAEVSGLGLGVHVTITPEADGYAVGDGQKDICAEFDRETDMVRYAWAFDWDPNVKDLKRAWLDGGTMTIKRLKPTLPGGRYDLQGSKGNKIDWELVPGGELAIEDGKVSGDRPLHSVANDALVFDPRGKWAVASMRLPGVCVVFDRENQVPVAVLAGPKGTPSQFQLVKVDDDTWTVDIPEVISAGHQAGFSPDGQSFLFMNSLRQNNIMVWDSSNHDDPTTWEKKAVVESPDWRGAYPNTFHMVFTPDAKKIYVTMWWPSPTPNGIAVIDAVNWEVLKEVDLGPDMHTLAITYDGKFVVGTLSGYQNTASAIVVMETETDEVLGFLPSPMGHHDNVIVPRTLEDLRISRSTTT 494 T 0.0037 Cytochrom_D1 unppercent F Bacteria T 8p3m 1 A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P A,D,G,J,M,P,S,V,Y,2,5,8,x,e,h,k W0DP94_9GAMM THIOCYANATE DEHYDROGENASE MSYYHHHHHHDYDIPTTENLYFQGAMGKYVKVQDFYDQLGKYVLVAPGKFSGTVAATDLSTGWTMAWLAAWNYGDTCPIMHHMAAFPSPDPYKEFEFVVNTQGGKNLFIYGVPVTVEDPGEGMKIYRIKYDGTRMNLQRDAAEVSGLGLGVHVTITPEADGYAVGDGQKDICAEFDRETDMVRYAWAFDWDPNVKDLKRAWLDGGTMTIKRLKPTLPGGRYDLQGSAGNKIDWELVPGGELAIEDGKVSGDRPLHSVANDALVFDPRGKWAVASMRLPGVCVVFDRENQVPVAVLAGPKGTPSQFQLVKVDDDTWTVDIPEVISAGHQAGFSPDGQSFLFMNSLRQNNIMVWDSSNHDDPTTWEKKAVVESPDWRGAYPNTFHMVFTPDAKKIYVTMWWPSPTPNGIAVIDAVNWEVLKEVDLGPDMHTLAITYDGKFVVGTLSGYQNTASAIVVMETETDEVLGFLPSPMGHHDNVIVPRTLEDLRISRSTTT 494 T 0.0037 Cytochrom_D1 unppercent F Bacteria T 8p5d 24 X LM0 S7XVN9_SPRLO Transposase MYFIKPGCLIKKKFSIYTSIVISVIDNNSVVIQSYDKSDNGDVISIDREVINVSKIVPIGNIDIKNKSKKEIDGILVEENRNLKNKDDVLLMNDFERFKEQLKKEVEDMVIEEMA 115 T 0.00024 Ribosomal_L14e pdbhh F Eukaryota T 8p60 24 TC,X KM0,LM0 S7XVN9_SPRLO Transposase MYFIKPGCLIKKKFSIYTSIVISVIDNNSVVIQSYDKSDNGDVISIDREVINVSKIVPIGNIDIKNKSKKEIDGILVEENRNLKNKDDVLLMNDFERFKEQLKKEVEDMVIEEMA 115 T 0.00024 Ribosomal_L14e pdbhh F Eukaryota T 8p6j 2 C,D,E,H,I,J AAA,DDD,EEE,HHH,JJJ,III collagen II-27 Toolkit peptide (JDM238) GPPGPPGPPGVPGEAGPPGPPGPP 24 T 0.00056 Collagen pdb F F 8pch 2 B P CATH_PIG CATHEPSIN H EPQNCSAT 8 T 0.4 SCAN unp F Eukaryota T 8pe9 1 A A DDR1_HUMAN EPITHELIAL DISCOIDIN DOMAIN RECEPTOR 1,CD167 ANTIGEN-LIKE FAMILY MEMBER A,CELL ADHESION KINASE,DISCOIDIN RECEPTOR TYROSINE KINASE,HGK2,MAMMARY CARCINOMA KINASE 10,MCK-10,PROTEIN-TYROSINE KINASE 3A,PROTEIN-TYROSINE KINASE RTK-6,TRK E,TYROSINE KINASE DDR,TYROSINE-PROTEIN KINASE CAK RDGLLSYTAPVGQTMYLSEAVYLNDSTYDGHTVGGLQYGGLGQLADGVVGLDDFRKSQELRVWPGYDYVGWSNHSFSSGYVEMEFEFDRLRAFQAMQVHCNNMHTLGARLPGGVECRFRRGPAMAWEGEPMRHNLGGNLGDPRARAVSVPLGGRVARFLQCRFLFAGPWLLFSEISFISDVVN 183 T 0.011 Lamprin pdb F Eukaryota T 8pfc 1 A,C,E,G,I,K,M,O A,C,E,G,I,K,M,O Q2NK94_AYWBP Sequence-variable mosaic (SVM) signal sequence domain-containing protein GPAPNEEFVGDMRIVNVNLSNIDILKKHETFKKYFDFTLTGPRYNGNIAEFAMIWKIKNPPLNLLGVFFDDGTRDDEDDKYILEELKQIGNGAKNMYIFWQYEQK 105 T 8.7 Sigma_reg_N unppercent F Bacteria T 8pfd 1 A A Q2NK94_AYWBP Sequence-variable mosaic (SVM) signal sequence domain-containing protein GPAPNEEFVGDMRIVNVNLSNIDILKKHETFKKYFDFTLTGPRYNGNIAEFAMIWKIKNPPLNLLGVFFDDGTRDDEDDKYILEELKQIGNGAKNMYIFWQYEQK 105 T 8.7 Sigma_reg_N unppercent F Bacteria T 8pfm 2 S S Q84626_PBCV1 Paramecium bursaria chlorella virus 1 (PBCV-1) penton protein. VETTQHFVSIESSNRPDPANTTPANYSIQLPQRYRNIWSAMLVNIALPAVSPPQKYVYLDIDKLNSIDSTSPSGGVNFALAKIPLSIAGTGNVFFADTMTSSFPNVPLQNPVATMDKLNIKLKDANGNVLTIPAGNEHSFMIQLTCGDYIPRGGGSTITQNGRVLGG 167 T 1.8 DUF2433 unphh T Viruses T 8pfn 2 GC K Q84626_PBCV1 Paramecium bursaria chlorella virus 1 (PBCV-1) penton protein. VETTQHFVSIESSNRPDPANTTPANYSIQLPQRYRNIWSAMLVNIALPAVSPPQKYVYLDIDKLNSIDSTSPSGGVNFALAKIPLSIAGTGNVFFADTMTSSFPNVPLQNPVATMDKLNIKLKDANGNVLTIPAGNEHSFMIQLTCGDYIPRGGGSTITQNGRVLGG 167 T 1.8 DUF2433 unphh T Viruses T 8phq 1 A,B,BA,C,CA,CB,DA,DB,EB,J,K,KA,L,LA,LB,MA,MB,NB,S,T,TA,U,UA,UB,VA,VB,WB AA,AB,BB,AC,BC,CC,BD,CD,CE,AJ,AK,BK,AL,BL,CL,BM,CM,CN,AS,AT,BT,AU,BU,CU,BV,CV,CW Major capsid protein MELFDENYYAKAVANIIGEVKDPIMYKWFSPDQIEDVDLQMGYQKTVKWDAFLNANPTTIANEVNTISTIGFSSEVVRLNYLKLQYKFRHLKQTSEKFYTSDSYIGDINNNLLPFAQAYKLASSEIIKLINHFVLTGTVSIQKDGKNQKRLLPNMYGLLNMPEQIKEEVASGDKDKMDKIFEKIEAGLSKLELGDEFSTPMMVIVDPATSLKLVKPYAAAQGAASSCEKWEDVLIQTIKAINNREDVYIETSNLLKHKILIYPLNSELIKFKPSKYMLPTPNEQVDKDSTDVAHSYIDFVLGGLLATRKTILQVNIKQS 319 T 0.12 DUF6260 pdbhh F T 8phr 1 A,B,BA,C,CA,DA,J,K,KA,L,MA,OA,S,T,U A,B,b,C,c,d,J,K,k,L,m,o,S,T,U Major capsid protein MELFDENYYAKAVANIIGEVKDPIMYKWFSPDQIEDVDLQMGYQKTVKWDAFLNANPTTIANEVNTISTIGFSSEVVRLNYLKLQYKFRHLKQTSEKFYTSDSYIGDINNNLLPFAQAYKLASSEIIKLINHFVLTGTVSIQKDGKNQKRLLPNMYGLLNMPEQIKEEVASGDKDKMDKIFEKIEAGLSKLELGDEFSTPMMVIVDPATSLKLVKPYAAAQGAASSCEKWEDVLIQTIKAINNREDVYIETSNLLKHKILIYPLNSELIKFKPSKYMLPTPNEQVDKDSTDVAHSYIDFVLGGLLATRKTILQVNIKQS 319 T 0.12 DUF6260 pdbhh F T 8phs 1 A,AA,AB,B,BB,G,H,HA,I,IA,IB,JA,JB,KB,P,Q,QA,R,RA,RB,SA,SB,TB,Y,Z,ZA AB,BD,CD,AC,CE,AJ,AK,BK,AL,BL,CL,BM,CM,CN,AS,AT,BT,AU,BU,CU,BV,CV,CW,BB,BC,CC Major capsid protein MELFDENYYAKAVANIIGEVKDPIMYKWFSPDQIEDVDLQMGYQKTVKWDAFLNANPTTIANEVNTISTIGFSSEVVRLNYLKLQYKFRHLKQTSEKFYTSDSYIGDINNNLLPFAQAYKLASSEIIKLINHFVLTGTVSIQKDGKNQKRLLPNMYGLLNMPEQIKEEVASGDKDKMDKIFEKIEAGLSKLELGDEFSTPMMVIVDPATSLKLVKPYAAAQGAASSCEKWEDVLIQTIKAINNREDVYIETSNLLKHKILIYPLNSELIKFKPSKYMLPTPNEQVDKDSTDVAHSYIDFVLGGLLATRKTILQVNIKQS 319 T 0.12 DUF6260 pdbhh F T 8phu 1 A,AA,AB,AF,AJ,AK,B,BB,BF,BG,BK,CF,CG,CH,DC,DG,DH,DL,EC,ED,EH,EL,EM,FC,FD,FE,FL,FM,FN,G,GD,GE,GI,GM,GN,H,HA,HE,HI,HJ,HN,I,IA,IB,II,IJ,IK,JA,JB,JF,JJ,JK,KB,KF,KG,KK,LF,LG,LH,MC,MG,MH,ML,NC,ND,NH,NL,NM,OC,OD,OE,OK,OL,OM,P,PD,PE,PI,PK,PM,Q,QA,QE,QI,QJ,R,RA,RB,RH,RI,RJ,SA,SB,SF,SH,SJ,TB,TF,TG,UE,UF,UG,UK,VC,VE,VG,VK,VL,WC,WD,WK,WL,WM,XB,XC,XD,XH,XL,XM,Y,YB,YD,YH,YI,YM,Z,ZA,ZH,ZI,ZJ AB,BD,CD,GJ,KM,LM,AC,CE,GK,HK,LN,GL,HL,IL,DJ,HM,IM,MS,DK,EK,IN,MT,NT,DL,EL,FL,MU,NU,OU,AJ,EM,FM,JS,NV,OV,AK,BK,FN,JT,KT,OW,AL,BL,CL,JU,KU,LU,BM,CM,GS,KV,LV,CN,GT,HT,LW,GU,HU,IU,DS,HV,IV,NB,DT,ET,IW,NC,OC,DU,EU,FU,MB,ND,OD,AS,EV,FV,KB,MC,OE,AT,BT,FW,KC,LC,AU,BU,CU,JB,KD,LD,BV,CV,HB,JC,LE,CW,HC,IC,GB,HD,ID,MJ,EB,GC,IE,MK,NK,EC,FC,ML,NL,OL,DB,ED,FD,JJ,NM,OM,BB,DC,FE,JK,KK,ON,BC,CC,JL,KL,LL Major capsid protein MELFDENYYAKAVANIIGEVKDPIMYKWFSPDQIEDVDLQMGYQKTVKWDAFLNANPTTIANEVNTISTIGFSSEVVRLNYLKLQYKFRHLKQTSEKFYTSDSYIGDINNNLLPFAQAYKLASSEIIKLINHFVLTGTVSIQKDGKNQKRLLPNMYGLLNMPEQIKEEVASGDKDKMDKIFEKIEAGLSKLELGDEFSTPMMVIVDPATSLKLVKPYAAAQGAASSCEKWEDVLIQTIKAINNREDVYIETSNLLKHKILIYPLNSELIKFKPSKYMLPTPNEQVDKDSTDVAHSYIDFVLGGLLATRKTILQVNIKQS 319 T 0.12 DUF6260 pdbhh F T 8pkh 1 A,AA,AB,AC,B,BA,BB,BC,C,CA,CB,CC,D,DA,DB,DC,E,EA,EB,EC,F,FA,FB,G,GA,GB,H,HA,HB,I,IA,IB,J,JA,JB,K,KA,KB,L,LA,LB,M,MA,MB,N,NA,NB,O,OA,OB,P,PA,PB,Q,QA,QB,R,RA,RB,S,SA,SB,T,TA,TB,U,UA,UB,V,VA,VB,W,WA,WB,X,XA,XB,Y,YA,YB,Z,ZA,ZB AA,BA,CA,DA,AB,BB,CB,DB,AC,BC,CC,DC,AD,BD,CD,DD,AE,BE,CE,DE,AF,BF,CF,AG,BG,CG,AH,BH,CH,AI,BI,CI,AJ,BJ,CJ,AK,BK,CK,AL,BL,CL,AM,BM,CM,AN,BN,CN,AO,BO,CO,AP,BP,CP,AQ,BQ,CQ,AR,BR,CR,AS,BS,CS,AT,BT,CT,AU,BU,CU,AV,BV,CV,AW,BW,CW,AX,BX,CX,AY,BY,CY,AZ,BZ,CZ Major capsid protein MELFDENYYAKAVANIIGEVKDPIMYKWFSPDQIEDVDLQMGYQKTVKWDAFLNANPTTIANEVNTISTIGFSSEVVRLNYLKLQYKFRHLKQTSEKFYTSDSYIGDINNNLLPFAQAYKLASSEIIKLINHFVLTGTVSIQKDGKNQKRLLPNMYGLLNMPEQIKEEVASGDKDKMDKIFEKIEAGLSKLELGDEFSTPMMVIVDPATSLKLVKPYAAAQGAASSCEKWEDVLIQTIKAINNREDVYIETSNLLKHKILIYPLNSELIKFKPSKYMLPTPNEQVDKDSTDVAHSYIDFVLGGLLATRKTILQVNIKQS 319 T 0.12 DUF6260 pdbhh F T 8s9i 2 B G SSB_BPT4 SSB PROTEIN,GP32,HELIX-DESTABILIZING PROTEIN AATAAKKADKVADDLDAFNVDDF 23 T 0.34 Dehydrin unppercent T Viruses T 8s9s 9 I 10 EMC10_HUMAN HEMATOPOIETIC SIGNAL PEPTIDE-CONTAINING MEMBRANE DOMAIN-CONTAINING PROTEIN 1 MAAASAGATRLLLLLLMAVAAPSRARGSGCRAGTGARGAGAEGREGEACGTVGLLLEHSFEIDDSANFRKRGSLLWNQQDGTLSLSQRQLSEEERGRLRDVAALNGLYRVRIPRRPGALDGLEAGGYVSSFVPACSLVESHLSDQLTLHVDVAGNVVGVSVVTHPGGCRGHEVEDVDLELFNTSVQLQPPTTAPGPETAAFIERLEMEQAQKAKNPQEQKSFFAKYWMYIIPVVLFLMMSGAPDTGGQGGGGGGGGGGGSGR 262 T 0.0033 2OG-FeII_Oxy_5 pdbpssm F Eukaryota T 8sah 1 A A HD_HUMAN HUNTINGTON DISEASE PROTEIN,HD PROTEIN MVSPDKDWYVHLVKSQCWTRSDSALLEGAELVNRIPAEDMNAFMMNSEFNLSLLAPCLSLGMSEISGGQKSALFEAAREVTLARVSGTVQQLPAVHHVFQPELPAEPAAYWSKLNDLFGDAALYQSLPTLARALAQYLVVVSKLPSHLHLPPEKEKDIVKFVVATLEALSWHLIHEQIPLSLDLQAGLDCCCLALQLPGLWSVVSSTEFVTHACSLIHCVHFILEAVAVQPGEQLLSPERRTNTPKAISEEEEEVDPNTQNPKYITAACEMVAEMVESLQSVLALGHKRNSGVPAFLTPLLRNIIISLARLPLVNSYTRVPPLVWKLGWSPKPGGDFGTAFPEIPVEFLQEKEVFKEFIYRINTLGWTSRTQFEETWATLLGVLVTQPLVMEQEESPPEEDTERTQINVLAVQAITSLVLSAMTVPVAGNPAVSCLEQQPRNKPLKALDTRFGRKLSIIRGIVEQEIQAMVSKRENIATHHLYQAWDPVPSLSPATTGALISHEKLLLQINPERELGSMSYKLGQVSIHSVWLGNSITPLREEEWDEEEEEEADAPAPSSPPTSPVNSRKHRAGVDIHSCSQFLLELYSRWILPSSSARRTPAILISEVVRSLLVVSDLFTERNQFELMYVTLTELRRVHPSEDEILAQYLVPATCKAAAVLGMDKAVAEPVSRLLESTLRSSHLPSRVGALHGILYVLECDLLDDTAKQLIPVISDYLLSNLKGIAHCVNIHSQQHVLVMCATAFYLIENYPLDVGPEFSASIIQMCGVMLSGSEESTPSIIYHCALRGLERLLLSEQLSRLDAESLVKLSVDRVNVHSPHRAMAALGLMLTCMYTGKEKVSPGRTSDPNPAAPDSESVIVAMERVSVLFDRIRKGFPCEARVVARILPQFLDDFFPPQDIMNKVIGEFLSNQQPYPQFMATVVYKVFQTLHSTGQSSMVRDWVMLSLSNFTQRAPVAMATWSLSCFFVSASTSPWVAAILPHVISRMGKLEQVDVNLFCLVATDFYRHQIEEELDRRAFQSVLEVVAAPGSPYHRLLTCLRNVGGSGDYKDDDDK 1057 T 0.06 Spidroin_MaSp pdbpercent F Eukaryota T 8san 1 A,E,I A,E,I A0A1W6IM54_9HIV1 CH848.0836.10 gp120 AENLWVTVYYGVPVWKEAKTTLFCASDAKAYKKEVHNVWATHACVPTDPSPQELFLKNVTENFNMWKNDMVDQMHEDIISLWDQSLKPCVKLTPLCVTLICSNATVKNSTVEEMKNCSFNTTTEIRDKEKKEYALFYRPDIVPLNNETSNTSEYRLINCNTSACTQACPKVTFEPIPIHYCAPAGYAILKCNDETFNGTGPCSNVSTVQCTHGIRPVVSTQLLLNGSLAEKGIVIRSENLTNNAKIIIVHLHTPVEIVCTRPNNNTRKSVRIGPGQTFYATGDIIGDIRQAHCNISESKWNETLQKVGKELQKHFPNKTIKYAQSAGGDMEITTHSFNCGGEFFYCNTAKLFNGTYNGTDISTNSSTNSNPTITLQCRIKQIINMWQGVGRCMYAPPIAGNITCKSNITGLLLTRDGGTNSSGKEEIFRPAGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRR 464 T 3.5E-53 GP120 pdbpssm T Viruses T 8sar 1 A,E,I A,F,K A0A1W6IPB2_9HIV1 CH848.10.17 gp120 AENLWVTVYYGVPVWKEAKTTLFCASDARAYEKEVHNVWATHACVPTDPSPQELVLGNVTENFNMWKNDMVDQMHEDIISLWDQSLKPCVKLTPLCVTLICSNATVKNGTVEEMKNCSFNTTTEIRDKEKKEYALFYKPDIVPLSETNNTSEYRLINCNTSACTQACPKVTFEPIPIHYCAPAGYAILKCNDETFNGTGPCSNVSTVQCTHGIRPVVSTQLLLNGSLAEKEIVIRSENLTNNAKIIIVHLHTPVEIVCTRPNNNTRKSVRIGPGQTFYATGDIIGDIKQAHCNISEEKWNDTLQKVGIELQKHFPNKTIKYNQSAGGDMEITTHSFNCGGEFFYCNTSNLFNGTYNGTYISTNSSANSTSTITLQCRIKQIINMWQGVGRCMYAPPIAGNITCRSNITGLLLTRDGGTNSNETETFRPAGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRV 463 T 2.5E-52 GP120 pdbpssm T Viruses T 8sas 1 A,E,I A,F,K A0A1W6IPB2_9HIV1 CH848.10.17 gp120 AENLWVTVYYGVPVWKEAKTTLFCASDARAYEKEVHNVWATHACVPTDPSPQELVLGNVTENFNMWKNDMVDQMHEDIISLWDQSLKPCVKLTPLCVTLICSNATVKNGTVEEMKNCSFNTTTEIRDKEKKEYALFYKPDIVPLSETNNTSEYRLINCNTSACTQACPKVTFEPIPIHYCAPAGYAILKCNDETFNGTGPCSNVSTVQCTHGIRPVVSTQLLLNGSLAEKEIVIRSENLTNNAKIIIVHLHTPVEIVCTRPNNNTRKSVRIGPGQTFYATGDIIGDIKQAHCNISEEKWNDTLQKVGIELQKHFPNKTIKYNQSAGGDMEITTHSFNCGGEFFYCNTSNLFNGTYNGTYISTNSSANSTSTITLQCRIKQIINMWQGVGRCMYAPPIAGNITCRSNITGLLLTRDGGTNSNETETFRPAGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRV 463 T 2.5E-52 GP120 pdbpssm T Viruses T 8sat 1 A,E,I A,E,I A0A1W6IPB2_9HIV1 CH848.10.17 gp120 AENLWVTVYYGVPVWKEAKTTLFCASDARAYEKEVHNVWATHACVPTDPSPQELVLGNVTENFNMWKNDMVDQMHEDIISLWDQSLKPCVKLTPLCVTLICSNATVKNGTVEEMKNCSFNTTTEIRDKEKKEYALFYKPDIVPLSETNNTSEYRLINCNTSACTQACPKVTFEPIPIHYCAPAGYAILKCNDETFNGTGPCSNVSTVQCTHGIRPVVSTQLLLNGSLAEKEIVIRSENLTNNAKIIIVHLHTPVEIVCTRPNNNTRKSVRIGPGQTFYATGDIIGDIKQAHCNISEEKWNDTLQKVGIELQKHFPNKTIKYNQSAGGDMEITTHSFNCGGEFFYCNTSNLFNGTYNGTYISTNSSANSTSTITLQCRIKQIINMWQGVGRCMYAPPIAGNITCRSNITGLLLTRDGGTNSNETETFRPAGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 471 T 2.2E-52 GP120 pdbpssm T Viruses T 8sau 1 A,E,I A,F,K A0A1W6IPB2_9HIV1 CH848.10.17 gp120 AENLWVTVYYGVPVWKEAKTTLFCASDARAYEKEVHNVWATHACVPTDPSPQELVLGNVTENFNMWKNDMVDQMHEDIISLWDQSLKPCVKLTPLCVTLICSNATVKNGTVEEMKNCSFNTTTEIRDKEKKEYALFYKPDIVPLSETNNTSEYRLINCNTSACTQACPKVTFEPIPIHYCAPAGYAILKCNDETFNGTGPCSNVSTVQCTHGIRPVVSTQLLLNGSLAEKEIVIRSENLTNNAKIIIVHLHTPVEIVCTRPNNNTRKSVRIGPGQTFYATGDIIGDIKQAHCNISEEKWNDTLQKVGIELQKHFPNKTIKYNQSAGGDMEITTHSFNCGGEFFYCNTSNLFNGTYNGTYISTNSSANSTSTITLQCRIKQIINMWQGVGRCMYAPPIAGNITCRSNITGLLLTRDGGTNSNETETFRPAGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRV 463 T 2.5E-52 GP120 pdbpssm T Viruses T 8saw 1 A,E,I A,E,K A0A1W6IPB2_9HIV1 CH848.3.D0949.10.17chim.6R.SOSIP.664 gp120A AENLWVTVYYGVPVWKEAKTTLFCASDARAYEKEVHNVWATHACVPTDPSPQELVLGNVTENFNMWKNDMVDQMHEDIISLWDQSLKPCVKLTPLCVTLICSNATVKNGTVEEMKNCSFNTTTEIRDKEKKEYALFYKPDIVPLSETNNTSEYRLINCNTSACTQACPKVTFEPIPIHYCAPAGYAILKCNDETFNGTGPCSNVSTVQCTHGIRPVVSTQLLLNGSLAEKEIVIRSENLTNNAKIIIVHLHTPVEIVCTRPNNNTRKSVRIGPGQTFYATGDIIGDIKQAHCNISEEKWNDTLQKVGIELQKHFPNKTIKYNQSAGGDMEITTHSFNCGGEFFYCNTSNLFNGTYNGTYISTNSSANSTSTITLQCRIKQIINMWQGVGRCMYAPPIAGNITCRSNITGLLLTRDGGTNSNETETFRPAGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 471 T 2.2E-52 GP120 pdbpssm T Viruses T 8sax 1 A,E,I A,E,I A0A1W6IPB2_9HIV1 CH848.10.17.SOSIP gp120 AENLWVTVYYGVPVWKEAKTTLFCASDARAYEKEVHNVWATHACVPTDPSPQELVLGNVTENFNMWKNDMVDQMHEDIISLWDQSLKPCVKLTPLCVTLICSDATVKTGTVEEMKNCSFNTTTEIRDKEKKEYALFYKPDIVPLSETNNTSEYRLINCNTSACTQACPKVTFEPIPIHYCAPAGYAILKCNDETFNGTGPCSNVSTVQCTHGIRPVVSTQLLLNGSLAEKEIVIRSENLTNNAKIIIVHLHTPVEIVCTRPNNNTRKSVRIGPGQTFYATGDIIGDIKQAHCNISEEKWNDTLQKVGIELQKHFPNKTIKYNQSAGGDMEITTHSFNCGGEFFYCNTSNLFNGTYNGTYISTNSSANSTSTITLQCRIKQIINMWQGVGRCMYAPPIAGNITCRSNITGLLLTRDGGTNSNETETFRPAGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 471 T 3.3999999999999995E-53 GP120 pdbpssm T Viruses T 8say 1 A,E,I A,F,K A0A1W6IPB2_9HIV1 CH848.10.17 gp120 AENLWVTVYYGVPVWKEAKTTLFCASDARAYEKEVHNVWATHACVPTDPSPQELVLGNVTENFNMWKNDMVDQMHEDIISLWDQSLKPCVKLTPLCVTLICSNATVKNGTVEEMKNCSFNTTTEIRDKEKKEYALFYKPDIVPLSETNNTSEYRLINCNTSACTQACPKVTFEPIPIHYCAPAGYAILKCNDETFNGTGPCSNVSTVQCTHGIRPVVSTQLLLNGSLAEKEIVIRSENLTNNAKIIIVHLHTPVEIVCTRPNNNTRKSVRIGPGQTFYATGDIIGDIKQAHCNISEEKWNDTLQKVGIELQKHFPNKTIKYNQSAGGDMEITTHSFNCGGEFFYCNTSNLFNGTYNGTYISTNSSANSTSTITLQCRIKQIINMWQGVGRCMYAPPIAGNITCRSNITGLLLTRDGGTNSNETETFRPAGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRV 463 T 2.5E-52 GP120 pdbpssm T Viruses T 8saz 1 A,E,I A,E,K A0A1W6IPB2_9HIV1 ENV POLYPROTEIN AENLWVTVYYGVPVWKEAKTTLFCASDARAYEKEVHNVWATHACVPTDPSPQELVLGNVTENFNMWKNDMVDQMHEDIISLWDQSLKPCVKLTPLCVTLICSNATVKNGTVEEMKNCSFNTTTEIRDKEKKEYALFYKPDIVPLSETNNTSEYRLINCNTSACTQACPKVTFEPIPIHYCAPAGYAILKCNDETFNGTGPCSNVSTVQCTHGIRPVVSTQLLLNGSLAEKEIVIRSENLTNNAKIIIVHLHTPVEIVCTRPNNNTRKSVRIGPGQTFYATGDIIGDIKQAHCNISEEKWNDTLQKVGIELQKHFPNKTIKYNQSAGGDMEITTHSFNCGGEFFYCNTSNLFNGTYNGTYISTNSSANSTSTITLQCRIKQIINMWQGVGRCMYAPPIAGNITCRSNITGLLLTRDGGTNSNETETFRPAGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 471 T 2.2E-52 GP120 pdbpssm T Viruses T 8sb0 1 A,E,I A,F,K A0A1W6IPB2_9HIV1 CH848.10.17.SOSIP gp120 AENLWVTVYYGVPVWKEAKTTLFCASDARAYEKEVHNVWATHACVPTDPSPQELVLGNVTENFNMWKNDMVDQMHEDIISLWDQSLKPCVKLTPLCVTLICSNATVKNGTVEEMKNCSFNTTTEIRDKEKKEYALFYKPDIVPLSETNNTSEYRLINCNTSACTQACPKVTFEPIPIHYCAPAGYAILKCNDETFNGTGPCSNVSTVQCTHGIRPVVSTQLLLNGSLAEKEIVIRSENLTNNAKIIIVHLHTPVEIVCTRPNNNTRKSVRIGPGQTFYATGDIIGDIKQAHCNISEEKWNDTLQKVGIELQKHFPNKTIKYNQSAGGDMEITTHSFNCGGEFFYCNTSNLFNGTYNGTYISTNSSANSTSTITLQCRIKQIINMWQGVGRCMYAPPIAGNITCRSNITGLLLTRDGGTNSNETETFRPAGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 471 T 2.2E-52 GP120 pdbpssm T Viruses T 8sb1 1 A,E,I A,F,K A0A1W6IPB2_9HIV1 CH848.10.17 gp120 AENLWVTVYYGVPVWKEAKTTLFCASDARAYEKEVHNVWATHACVPTDPSPQELVLGNVTENFNMWKNDMVDQMHEDIISLWDQSLKPCVKLTPLCVTLICSNATVKNGTVEEMKNCSFNTTTEIRDKEKKEYALFYKPDIVPLSETNNTSEYRLINCNTSACTQACPKVTFEPIPIHYCAPAGYAILKCNDETFNGTGPCSNVSTVQCTHGIRPVVSTQLLLNGSLAEKEIVIRSENLTNNAKIIIVHLHTPVEIVCTRPNNNTRKSVRIGPGQTFYATGDIIGDIKQAHCNISEEKWNDTLQKVGIELQKHFPNKTIKYNQSAGGDMEITTHSFNCGGEFFYCNTSNLFNGTYNGTYISTNSSANSTSTITLQCRIKQIINMWQGVGRCMYAPPIAGNITCRSNITGLLLTRDGGTNSNETETFRPAGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 471 T 2.2E-52 GP120 pdbpssm T Viruses T 8sb2 1 A,E,I A,F,K A0A1W6IPB2_9HIV1 CH848.10.17.SOSIP gp120 AENLWVTVYYGVPVWKEAKTTLFCASDARAYEKEVHNVWATHACVPTDPSPQELVLGNVTENFNMWKNDMVDQMHEDIISLWDQSLKPCVKLTPLCVTLICSNATVKNGTVEEMKNCSFNTTTEIRDKEKKEYALFYKPDIVPLSETNNTSEYRLINCNTSACTQACPKVTFEPIPIHYCAPAGYAILKCNDETFNGTGPCSNVSTVQCTHGIRPVVSTQLLLNGSLAEKEIVIRSENLTNNAKIIIVHLHTPVEIVCTRPNNNTRKSVRIGPGQTFYATGDIIGDIKQAHCNISEEKWNDTLQKVGIELQKHFPNKTIKYNQSAGGDMEITTHSFNCGGEFFYCNTSNLFNGTYNGTYISTNSSANSTSTITLQCRIKQIINMWQGVGRCMYAPPIAGNITCRSNITGLLLTRDGGTNSNETETFRPAGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 471 T 2.2E-52 GP120 pdbpssm T Viruses T 8sb3 1 A,E,I A,F,K A0A1W6IPB2_9HIV1 CH848.10.17 gp120 AENLWVTVYYGVPVWKEAKTTLFCASDARAYEKEVHNVWATHACVPTDPSPQELVLGNVTENFNMWKNDMVDQMHEDIISLWDQSLKPCVKLTPLCVTLICSNATVKNGTVEEMKNCSFNTTTEIRDKEKKEYALFYKPDIVPLSETNNTSEYRLINCNTSACTQACPKVTFEPIPIHYCAPAGYAILKCNDETFNGTGPCSNVSTVQCTHGIRPVVSTQLLLNGSLAEKEIVIRSENLTNNAKIIIVHLHTPVEIVCTRPNNNTRKSVRIGPGQTFYATGDIIGDIKQAHCNISEEKWNDTLQKVGIELQKHFPNKTIKYNQSAGGDMEITTHSFNCGGEFFYCNTSNLFNGTYNGTYISTNSSANSTSTITLQCRIKQIINMWQGVGRCMYAPPIAGNITCRSNITGLLLTRDGGTNSNETETFRPAGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 471 T 2.2E-52 GP120 pdbpssm T Viruses T 8sb4 1 A,E,I A,F,K A0A1W6IPB2_9HIV1 CH848.10.17 gp120 AENLWVTVYYGVPVWKEAKTTLFCASDARAYEKEVHNVWATHACVPTDPSPQELVLGNVTENFNMWKNDMVDQMHEDIISLWDQSLKPCVKLTPLCVTLICSNATVKNGTVEEMKNCSFNTTTEIRDKEKKEYALFYKPDIVPLSETNNTSEYRLINCNTSACTQACPKVTFEPIPIHYCAPAGYAILKCNDETFNGTGPCSNVSTVQCTHGIRPVVSTQLLLNGSLAEKEIVIRSENLTNNAKIIIVHLHTPVEIVCTRPNNNTRKSVRIGPGQTFYATGDIIGDIKQAHCNISEEKWNDTLQKVGIELQKHFPNKTIKYNQSAGGDMEITTHSFNCGGEFFYCNTSNLFNGTYNGTYISTNSSANSTSTITLQCRIKQIINMWQGVGRCMYAPPIAGNITCRSNITGLLLTRDGGTNSNETETFRPAGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRV 463 T 2.5E-52 GP120 pdbpssm T Viruses T 8sb5 1 A,E,I A,F,K A0A1W6IPB2_9HIV1 CH848.10.17.SOSIP gp120 AENLWVTVYYGVPVWKEAKTTLFCASDARAYEKEVHNVWATHACVPTDPSPQELVLGNVTENFNMWKNDMVDQMHEDIISLWDQSLKPCVKLTPLCVTLICSNATVKNGTVEEMKNCSFNTTTEIRDKEKKEYALFYKPDIVPLSETNNTSEYRLINCNTSACTQACPKVTFEPIPIHYCAPAGYAILKCNDETFNGTGPCSNVSTVQCTHGIRPVVSTQLLLNGSLAEKEIVIRSENLTNNAKIIIVHLHTPVEIVCTRPNNNTRKSVRIGPGQTFYATGDIIGDIKQAHCNISEEKWNDTLQKVGIELQKHFPNKTIKYNQSAGGDMEITTHSFNCGGEFFYCNTSNLFNGTYNGTYISTNSSANSTSTITLQCRIKQIINMWQGVGRCMYAPPIAGNITCRSNITGLLLTRDGGTNSNETETFRPAGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 471 T 2.2E-52 GP120 pdbpssm T Viruses T 8sb6 2 D,E D,E H4_HUMAN Histone H4 SGRGXGGKGLGXGGA 15 T 11 Shadoo unppercent F Eukaryota T 8shi 3 C,F C,F VAL-ARG-SER-ARG-ARG-ABA-LEU-ARG-LEU VRSRRXLRL 9 T 2 DUF1331 pdbhh F F 8sk7 3 C,F,I X,Y,Z HA_20 minibinder (RFdiffusion-designed) MEKEKELKEYAEKIKKEIGDIESVEVKDGKILVKAKKITDKTVDAIMKLTVKAARLGFKVEVELV 65 T 7.2 DUF5320 pdbhh F T 8sl0 1 A A GSDM_VITXG BGSDM,BACTERIAL GASDERMIN SGLCSDPAITYLKRLGYNVVRLPREGIQPLHLLGQQRGTVEYLGSLEKLITQPPSEPPAITRDQAAAGINGQKTENLSFSIGINILKSVLAQFGAGAGIEAQYNQARKVRFEFSNVLADSVEPLAVGQFLKMAEVDADNPVLKQYVLGNGRLYVITQVIKSNEFTVAAEKSGGGSIQLDVPEIQKVVGGKLKVEASVSSQSTVTYKGEKQLVFGFKCFEIGVKNGEITLFASQLVPR 237 T 0.00027 Gasdermin pdbpercent F Bacteria T 8smq 1 A,B,C,D A,B,C,D Q182N1_CLOD6 hypothetical protein CD630_25440 SNADKILDLSFKKIETDLSSKITYEDTGVKIETDSSKSDKERYLYIYQNIKENWSMYNNFYIEIQNKNKSSQKINLSIQSKNMFEFRLKEGSEVFLEGKNIIYSDKIKEGCIEVPGEFEGKIYVNFNSLINEESNVVLDSNMLSNIVSWGITFIPSDEEHNIVIIKKISLLSE 173 T 1.6E-05 Agarase_CBM pdbhh F Bacteria T 8snb 11 X,Y 1l,1m A0A7M7GGC2_STRPU Tex26(LOC100888047) MPNSGFQGQINQSFYGTHKAHLVPLGDQYVSGNLPNPAFWRRFIRDPVSSIENPSGTRVLTTTEVLAPVWQSNNCAANLSKNDRATSNTLPRLHTSSTWTTNEGSDHAPLRPQVPSSARGRFRDAHIPLVALNSLAPFATTYKPTCGYFFSRSTNNKKKQMGIPATDLVKYRYYVK 176 T 18 DUF983 pdbhh F Eukaryota T 8snb 12 AA,BA,CA,Z 1p,1q,1r,1o A0A7M7NFX5_STRPU Meiosis-specific nuclear structural protein 1 MSQQQYHWEALRKQRVIDRRLAAMKKMTDEERQMEELSTEGMARDELQSRQMDHAKRRELAEQIQHRDRRGIRNSYKEREQTIQKLVSERTWHDNLIAKMDQSEKDDLLKDLLRDHAQKSKTLRDRGVYRGDAKFFIDPQYN 142 T 0.19 Rubella_Capsid unppercent F Eukaryota T 8snb 14 MA 2G A0A7M7RF95_STRPU CFAP107 MAHGDPQKWNLPGWRIEQRYAGNVLIGNWSEERQKFGRGGEKHTSTHRMDYLNNRNFAPDVMTRRAAKMRNEGLDQTLLFAHHNKNLKNNLISWYDEQFNKRERSGGDQLPELRHWDGQKLAWEPEKTDHPVKGAPTNFGLKDRLQEKWKTEEADKKLSDYSTTYGLDYKNKPRAALVTEHFAPQRAQSSRMHPVNKINKDTNLRSTSILQTPQQIHMRTRNEAVSRSGPAPVSV 235 T 0.0066 DUF1143 unppssm F Eukaryota T 8snb 15 NA,OA,PA 2J,2K,2L A0A7M7NA77_STRPU Cilia- and flagella-associated protein 126 MSSHFSANQYKQAFDSRRLQNSQIPQTYKERPSSYEGFTQIIANDRGHLKQGVPRSKDSPWGGFVGTWEMPKKIPGNVTTYMSRGDPAIDNIQKTRAEHNEYMRQAVSPDKTLAMEPKPQVTKVAEEDRPGNPSPNDAIPA 141 T 48 Scm3 pdbhh F Eukaryota T 8snb 18 TA,UA 2V,2W A0A7M7RAY9_STRPU Cilia- and flagella-associated protein 161 MSVRSYNPSVRVGNWNEDTCLEEDMVKDFLEKKEKGELLIQKASNLRSTILKPSDLSVTVDGFVHFGDTVVIMNEAAADQVRTQPGVEPRQANVLSVNMSETKMHETMRFEGTCTASASKSLNPCVRNTFVIAPAQDGIPPGSPLTYGQHFRLCTLPGVGGNLVLQSDRVSFHASAEKSRKQLISFVDEVQSPYLTEWRILCFNPQIRMESEGLPVPANQRIIFNHCKSNEDLCVVSGMSVRTPFGREYEVVAHTDLDSHRAEKDVNHWIIKTGEPAQPTTLAKTLPVGDQQ 292 T 0.1 DUF1143 pdbhh F Eukaryota T 8snb 29 CC 4Y A0A7M7RHW6_STRPU HeLo_like_N(LOC577943) MTDTVPVPAVRPPVDKPMRLVGVSHSNNSYSLVDHASANDQLHYIFLVVKQHIAGILADVQYYKIAELKNEIIALGTDVAALVRDRSFEESRDKYHSHFWRADDEAENKRLIVKLSEVAYALLDYKQNCCTQSRGPYALDEAKLEILCKKHLYELQNFRRELAQFVNTARDRNAQVTRLSAPLQSQLEWYQYKSPLDDPSIRRPLPYECTLTSRETIRPRTEPPEVDGHISGVKHVPSQWDVPNLSGKPGAANSGTHGNLTSQGRYGGRHMYERNINERRKPCEQEIQHVSSRYKHYHGLCE 302 T 0.18 DUF4208 pdb F Eukaryota T 8snb 34 AD,BD 6Q,6R A0A7M7THB5_STRPU RIB35 MSVTSANPATPYTRAEFFNACTSVLEGLNCLQRQQSVIDERSWEVLNRLGQMVSDLRPEVKGNKEYWVDGPLVKFLTANVLKAKSVLQEMKRTCSQKSAATNGPEYIERHLLQHVEDIRTSLEELETYHQQTLYRTDTTGIDAGVGQRMHTITQGPTEMTAGGLTQTPPAEISLEPKGSGTFMKDLNNAKIATNDSYPGTSSQPMTGMWSSGLPHHPETSKPGVFEHIQEMTNIYPEHDPNWPKREQKPKLPFERTQPPLVYPGIRDMHNYSNLPQFTGKDLPMPKIPNGEMGLRAPHVPHWDSTNHYSY 310 T 0.032 CCCAP pdbpercent F Eukaryota T 8snb 38 ND,OD 7M,7N A0A7M7THD0_STRPU SAXO3(LOC115918676) MTGADRRFDLHQTSSGRGLDYRPEYYFPASDFKTTINNPLPPQLAKQDEIIKPFQTTTGGAHDYKYHGGLMANPQHHKAPGHWNMHYNKDLREKLQQRGWRKPLTMGNQESEVQAQYKGDQMQMGVDFDNRLSGNPQPSDLQTHHQNCPAPVRDSVPKYKPTLVRDDGALQLLDIYVPTSHHVHKRFTRHELDDYPKKDAATYWRCEDYTQAWGHGTKHNPLPKGAEIHQRAPMVDEMVFKTAIKEPARWPERFKRVPHAGMKTTMTSSYKTPSDPKMTELFSCPVDTPWVIPEAGPIQTFSVPNMYTTEYKTYASGKPITV 322 T 27 DUF4632 pdbhh F Eukaryota T 8snb 39 PD,QD 7Q,7R A0A7M7N5A5_STRPU TEPP protein MPTVEVPYYVPQYPTFRRAQLAAVKEGLYHPSLPTFRRMDMDTAAHRLPDEHCRTTTGVGPADFQNATATYFQPPANTYNGANITDTGRLLRETMKDDVKSLRLDWAKAKDIKELPQIKNTGQLRFSGYAVRYQKPAISGSWRYTFTQEPRLDQYGQRPVPANIYSRYRDTFPQYSRNMSTDAFR 185 T 1.7 Cofac_haem_bdg unppercent F Eukaryota T 8snb 41 TD,UD 7Y,7Z A0A7M7N7W6_STRPU Sperm-associated antigen 8 MATLNPARTLNNSGGRCLMENWVEERQVFQTGLDSAGVNSTESYTSNSSLPYKDGHKGILTRELDTAVEKESNSMGSYQRPAQCGVRTVGRKKELMERALYAKVSQELQEEINEPSPVEEYKSVTQKDFYDDEFESELPAPLYEHNVNTEQPITFWSHHKEKIPGVSQIKTLDTPFKKNSAFSKPIAEGTDQPQPYEQETHPFL 204 T 0.00018 Apis_Csd pdb F Eukaryota T 8snb 50 QE 9J A0A7M7RFU3_STRPU Tex49_homologue(LOC580808) MTAQCMGNFGAHKWRYPEFMADRYGRSNGSVDPESRHKQYDAGVSDQTVWVNKRYIPTDLRSTAPVPRRKLLASARLPHVERDWVPLGEASCRQISRAMEERQLYVSQPSTRSEDWSTLRQILPSKGLPLRDSPPNWGTGNAYAPPMLGARQRRFPHINSPMTRYTDNMHTTHKLFKLH 179 T 0.7 ARL6IP6 unppssm F Eukaryota T 8snb 52 VE 9T A0A7M7RHE9_STRPU SPATA45 MDPQKNYEMNNQRESWCAVELSPLQDWCKSERKHHGENFKSSVFNAKQGQPESEARCTFEVNDKTHREKRHFPNKTSYSHLAI 83 T 0.048 ESF1 unppssm F Eukaryota T 8soi 3 D D ATG13_HUMAN Autophagy-related protein 13 DLGTFYREFQNPPQLSSLSIDIGAQSMAEDLDSLPEKLAVHEKNVREFDAFVETLQGSDEA 61 T 12 DUF2315 pdbhh F Eukaryota T 8sqz 3 E,F E,F ATG13_HUMAN Autophagy-related protein 13 HDVLETIFVRKVGAFVNKPINQVTLTSLDIPFAMFAPKNLELEDTDPMVNPPDSPETESPLQGSLHSDGSSGGSSGNTHDDFVMIDFKPAFSKDDILPMDLGTFYREFQNPPQLSSLSIDIGAQSMAEDLDSLPEKLAVHEKNVREFDAFVETLQ 155 T 22 BLI1 unphh F Eukaryota T 8srm 3 E,F E,F ATG13_HUMAN Autophagy-related protein 13 KPAFSKDDILPMDLGTFYREFQNPPQLSSLSIDIGAQSMAEDLDSLPEKLAVHEKNVREFDAFVETLQGSDEA 73 T 11 DUF1244 pdbhh F Eukaryota T 8srq 3 D D ATG13_HUMAN Autophagy-related protein 13 DLGTFYREFQNPPQLSSLSIDIGAQSMAEDLDSLPEKLAVHEKNVREFDAFVETLQGSDEA 61 T 12 DUF2315 pdbhh F Eukaryota T 8srz 1 A,B A,B PROT2_LYSEN TRYPSIN-LIKE PROTEASE 2 SVQADYSRAEALAAWTRLSDEFIGNCYVSVRPRHAPAWEVVVASAAGSLRLEAFKRAHDHDFLDRLAVAIGNWEQKAQRPDHEIAQMLDQVG 92 T 3.3 Anillin_N pdbhh F Bacteria T 8ss1 1 A,B A,B A0A2U1VUZ9_9PROT Serine protease SNAERLAAWTRLPWEGLRYSYNRERRGTAARSCPQLEADVALKAETQPSEIPLERQLILEACREAERFGFLHELSIAIVEMERLNKRPEAEVEEIAKLWQ 100 T 0.85 EAD7 unphh F Bacteria T 8sv0 2 B,C C,E protein VII PGGFKRRRL 9 T 5.3 Nha1_C pdbhh F T 8sw1 2 B B Polyglutamine peptide XXXXXXXXXXXXXXXXXXX 19 F F F 8sw7 1 A,C,F A,C,F BG505 Boost 2 gp120 MDAMKRGLCCVLLLCGAVFVSPSQEIHARFRRGARAENLWVTVYYGVPVWKDAETTLFCASDAKAYETKKHNVWATHCCVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCTNVSTVQCTHGIKPVVSTQLLLNGSLAEEEVIIRSENITNNAKNILVQLNESVQINCTRPNNNTRKSIRIGPGQWFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRR 516 T 2.8E-49 GP120 pdb F T 8sw7 3 G H FP1 heavy chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 125 F F F 8sw7 4 H L FP1 light chain XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 106 F F F 8swd 1 A,B,C,D A,B,C,D Q0PAA2_CAMJE CIAD MHHHHHHSSGVDLWSHPQFEKGTENLYFQSNIMNLEDLAKKTISEVSSIMEEQRRQNEILKEQELNRKTEIKDELPPMEFVCEELDTPQDLEDKISMAKFEEEQKIQNNIEISTQENKEFKKEEPFLQNEILNPSVMTEVQTLNEDIFLKHLRERILVLFEGLNSIKKDDLENRLNLTINFLEFLLANIEDKLKK 195 T 0.31 YebO unp F Bacteria T 8t2u 3 C C Eptifibatide XXGDWPCX 8 T 5.3 Ferlin_C pdbhh F T 8t61 1 A A Designed peptide BH33 RHYYKFNSTGRHYHYY 16 T 0.091 Phage_fiber pdb F T 8t62 1 A A Designed peptide BH21 TMIEDPEAGHFHTSSA 16 T 5.2 MPLKIP pdbhh F T 8t63 1 A A Designed peptide PH1 WHMWNTVPNAKQVIAA 16 T 7.7 DUF5820 pdbhh F T 8t8b 56 DB,HD 1v,2v P-site Peptidyl-tRNA Analog Peptide MAI 3 T 170 DUF6117 pdbhh F F 8t8c 56 DB,HD 1v,2v P-site Peptidyl-tRNA Analog Peptide MFI 3 T 120 RmuC pdbhh F F 8tfv 1 A A THAN_PODMA PROTEIN (THANATIN) GSKKPVPIIYCNRRTGKCQRM 21 T 2.6 YihI pdbhh F Eukaryota T 9lpr 2 B P METHOXYSUCCINYL-ALA-ALA-PRO-LEUCINE BORONIC ACID INHIBITOR XAAPX 5 T 500 Suv3_C_1 pdbhh F F # Virus entities: 1197