Output of CFTR R domain pattern search (filtered, SwissProt)

As the matches from the UniProt contained a lot of 'Theoretical protein', we narrowed the protein set to the well annotated SwissProt.

PaPbPcPdPePfPgPhP: 'primitive' pattern, where
P: [RK][RK]X[ST]
U: XXXX instead of [RK][RK]X[ST]
a=9, b=15, c=13, d=11, e=24, f=30, g=26, h=17 (any aa)
(num1:num2): class and penalty score



Name: CFTR_BOVIN [UniProt] [InterPro]
Description: Cystic fibrosis transmembrane conductance regulator (CFTR) (cAMP- dependent chloride channel).
Keywords: Phosphorylation, Transmembrane, Chloride, ATP-binding, Chloride channel, Glycoprotein, Ionic channel, Repeat, Transport
PaPbPcPdPePfPgPhP (1:0) P-sites: 655,665,681,695,707,732,763,791,809
656RRNSIITETL RRFSLEGDTS VSWNETKKPS FKQTGEFGEK RKNSILSSIN 705
706SIRKFSVVQK TSLQMNGIEG AADAPLERRL SLVPHSEPGE GILPRSNAVN 755
756SGPTFLGGRR QSVLNLMTGS SVNQGQSIHR KTATSTRKMS LAPQASLAEI 805
806DIYSRRLS
Name: CFTR_RABIT [UniProt] [InterPro]
Descripton: Cystic fibrosis transmembrane conductance regulator (CFTR) (cAMP- dependent chloride channel).
Keywords: Ionic channel, Transport, Repeat, Chloride channel, Glycoprotein, ATP-binding, Chloride, Phosphorylation, Transmembrane
PaPbPcPdPePfPgPhP (1:0) P-sites: 626,636,652,666,678,703,734,761,779
627RRNSILTETL RRFSLEGDAS VSWNDTRKQS FKQNGELGEK RKNSILNPVN 676
677SMRKFSIVLK TPLQMNGIEE DSDATIERRL SLVPDSEQGE AILPRSNMIN 726
727TGPMLQGCRR QSVLNLMTHS VSQGPSIYRR TTTSTRKMSL APQTNLTEMD 776
777IYSRRLS
Name: CFTR_SHEEP [UniProt] [InterPro]
Description: Cystic fibrosis transmembrane conductance regulator (CFTR) (cAMP- dependent chloride channel).
Keywords: Phosphorylation, Repeat, Transport, Chloride, 3D-structure, ATP-binding, Chloride channel, Glycoprotein, Ionic channel, Polymorphism, Transmembrane
PaPbPcPdPePfPgPhP (1:0) P-sites: 655,665,681,695,707,732,763,791,809
656RRNSIITETL RRFSLEGDTS VSWNETKKPS FKQTGEFGEK RKNSILNSIN 705
706SIRKFSVVQK TSLQMNGIDG ASDEPLERRL SLVPHSEPGE GILPRSNAVN 755
756SGPTFLGGRR QSVLNLMTCS SVNQGQSIHR KTATSTRKMS LAPQASLAEI 805
806DIYSRRLS
Name: CFTR_MACMU [UniProt] [InterPro]
Description: Cystic fibrosis transmembrane conductance regulator (CFTR) (cAMP- dependent chloride channel) (Fragment).
Keywords: Transport, Repeat, Ionic channel, ATP-binding, Chloride, Chloride channel, Phosphorylation, Transmembrane
PaPbPcPdPePfP (1:2) P-sites: 56,66,82,96,108,133,164
57RRNSILTETL RRFSLEGDAG VSWTETKKQS FKQTGEFGEK RKNSILNPIN 106
107SIRKFSIVQK TPLQMNGIEE DSDEPLERRL SLVPYSEQGE VILPRISVIS 156
157TGPTLQARRR QS
Name: ATRX_CAEEL [UniProt] [InterPro]
Description: Transcriptional regulator ATRX homolog (X-linked nuclear protein-1).
Keywords: ATP-binding, DNA damage, DNA repair, DNA-binding, Helicase, Hydrolase, Nuclear protein
PaUbPcUdPeUfUgPhP (1:4) P-sites: 100,120,147,229,247
101RKKSKSKKKV DQKKKEKSKK KRTTSSSEDE DSDEEREQKS KKKSKKTKKQ 150
151 TSSESSEESE EERKVKKSKK NKEKSVKKRA ETSEESDEDE KPSKKSKKGL 200
201KKKAKSESES ESEDEKEVKK SKKKSKKVVK KESESEDEAP EKKKTEKRKR 250
251 S
Name: EBP2_SCHPO [UniProt] [InterPro]
Description: Probable rRNA processing protein ebp2.
Keywords: Nuclear protein, Ribosome biogenesis, Coiled coil, Hypothetical protein, Complete proteome
PbUcPdPeP (1:5) P-sites: 194,225,237,263
195KKLSQQAKKQ RELKKFGKQV QLAKQEERQR EKKETLEKIN LLKRKHTGGD 244
245LTTEDDFDIA LSSASADTFK KGS
Name: FHL1_YEAST [UniProt] [InterPro]
Description: Pre-rRNA processing protein FHL1.
Keywords: Transcription regulation, Transcription, DNA-binding, Nuclear protein, Complete proteome
PbUcUdPePfUgP (1:5) P-sites: 474,520,546,608
475RKYSTAKGMS LSEIYAGIRE LFPYYKYCPD GWQSSVRHNL SLNKSFRKVS 524
525 KEGKGWLWGL DEEYIAERER QKKKQSEIAV AKAQAAQLKL EQQQHKLQQV 574
575PQRGKKDIVS QRSNVNARKQ NISQTLAANR AASNRKNT
Name: FUTSC_DROME [UniProt] [InterPro]
Description: Microtubule-associated protein futsch.
Keywords: Cytoskeleton, Repeat, Microtubule, Developmental protein
PaUbUcPdPeP (1:5) P-sites: 3709,3746,3759,3783
3710RRESVAEKSP LPSKEASRPT SVAESVKDEA EKSKEESRRE SVAEKSSLAS 3759
3760KKASRPASVA ESVKDEAEKS KEESRRES
Name: GTSE1_MOUSE [UniProt] [InterPro]
Description: G2 and S phase expressed protein 1 (Gtse-1) (B99 protein).
Keywords: Microtubule, Phosphorylation
PbUcUdUePfPgP (1:5) P-sites: 430,489,521,547
431RRDSYLSCKT EAVSTTTNPF KVPQFSVGES PGGVTPKFSR THRLQSWTPA 480
481SRVVSSTPVR RSSGTTPQGL PGSMRTPLST RRMSVLPTPA SRRLSSLPLM 530
531APQSMPRALV SPLCVPARRL S
Name: RS18_MYCPE [UniProt] [InterPro]
Description: 30S ribosomal protein S18.
Keywords: rRNA-binding, RNA-binding, Ribonucleoprotein, Ribosomal protein, Complete proteome
PaPbPcUdUeUfP (1:5) P-sites: 30,41,57,122
31KKTTTTKTST AKKATTASVE KTEVKETKKS SDNKKEFNPQ DRPFAKYNRG 80
81YPNKQGRRKK FCKLCAKGQE HVDYKDVELL YKYLTPNLKI ASRKIT
Name: RT04_MARPO [UniProt] [InterPro]
Description: Mitochondrial ribosomal protein S4.
Keywords: Ribonucleoprotein, Mitochondrion, Ribosomal protein
PaUbPcPdUeP (1:5) P-sites: 19,39,52,77
20KKLTLKQKFL ISELQKQKKN KKQSDFSIQL QTIKKLSLFY GNLPIKKMQR 69
70AKTHTYIDKK NS
Name: RU17_HUMAN [UniProt] [InterPro]
Description: U1 small nuclear ribonucleoprotein 70 kDa (U1 snRNP 70 kDa) (snRNP70) (U1-70K).
Keywords: Nuclear protein, Direct protein sequencing, RNA-binding, mRNA processing, Phosphorylation, Ribonucleoprotein, Alternative splicing
PaPbPcUdUeUfP (1:5) P-sites: 262,273,289,353
263RRRSRSRDKE ERRRSRERSK DKDRDRKRRS SRSRERARRE RERKEELRGG 312
313GGDMAEPSEA GDAPPDDGPP GELGPDGPDG PEEKGRDRDR ERRRS
Name: CFTR_XENLA [UniProt] [InterPro]
Description: Cystic fibrosis transmembrane conductance regulator (CFTR) (cAMP- dependent chloride channel).
Keywords: Chloride, Phosphorylation, Repeat, Ionic channel, Chloride channel, ATP-binding, Glycoprotein, Transmembrane, Transport
PaPbUcPdPePfPgPhP (2:1) P-sites: 654,664,694,706,733,764,792,811
655RRNSIITETL RRCSIDSDPS AVRNEVKNKS FKQVADFTEK RKSSIINPRK 704
705SSRKFSLMQK SQPQMSGIEE EDMPAEQGER KLSLVPESEQ GEASLPRSNF 754
755LNTGPTFQGR RRQSVLNLMT RTSISQGSNA FATRNASVRK MSVNSYSNSS 804
805FDLDIYNRRL S
Name: ALK1_YEAST [UniProt] [InterPro]
Description: DNA damage-responsive protein ALK1.
Keywords: Complete proteome, DNA damage
PaPbPcUdPePfUgUhP (2:3) P-sites: 89,97,115,141,167,254
90RKTTSNSKKR WSLLSNHSAV SSSKSKKRWS VLSSSFTSES HKDRESRNVL 139
140QQKRKSLQSY SSLDTVASNS SISASSSLKR SSTGLSLRQL FTKIGINDDI 189
190SQPGIGIPQG KENLPPTMGK KNSSIASTSS ENRLRTPLKP LVNHSKRPTS 239
240QPQQQQPLYN ASLSSRRSS
Name: CYAA_YEAST [UniProt] [InterPro]
Description: Adenylate cyclase (EC 4.6.1.1) (ATP pyrophosphate-lyase) (Adenylyl cyclase).
Keywords: Lyase, Leucine-rich repeat, Complete proteome, cAMP biosynthesis, Repeat, Metal-binding, Magnesium
PaPbUcPdUePfUgP (2:4) P-sites: 164,174,208,230,292
165RKPSLAETLF KRFSGSNSHD GNKSGKESKV ANLSLSTVNP APANRKPSKD 214
215STLSNHLADN VPSTLRRKVS SLVRGSSVHD INNGIADKQI RPKAVAQSEN 264
265TLHSSDVPNS KRSHRKSFLL GSTSSSSSRR GS
Name: CYP1_BRUMA [UniProt] [InterPro]
Description: Peptidylprolyl isomerase 1 (EC 5.2.1.8) (Peptidylprolyl cis-trans isomerase 1) (PPIase 1) (Cyclophilin) (BmCYP-1).
Keywords: 3D-structure, Rotamase, Isomerase
PaPbUcPdUeUfPgP (2:4) P-sites: 704,714,748,796,825
705RRRSRSNGRR RRSSSRRSRS RDRRHKSRSR SRGYVRRFEG WSRSRRPTRR 754
755ELYDERMRRE RERRRSFDRY SDRRRTRSRS ARRDSDRHSR RSRKRSPSSS 804
805SSSSESSSSD SRSTASSSAS SKRSS
Name: DBH2_STRCO [UniProt] [InterPro]
Description: DNA-binding protein HU 2.
Keywords: Complete proteome, DNA condensation, DNA-binding
PaUbUcUdUePfPgPhP (2:4) P-sites: 70,133,163,190,210
71KKTSVPRFRA GQGFKDLVSG SKKLPKNDIA VKKAPKGSLS GPPPTISKAA 120
121GKKAAAKKAT GAAKKTTGAA KKTSAAAKKT TAKKTTGAAK TTAKKTTAKK 170
171SAAKTTTAAA KKTAAKKAPA KKATAKKAPA KKSTARKTTA KKAT
Name: DBH2_STRLI [UniProt] [InterPro]
Description: DNA-binding protein HU 2.
Keywords: DNA-binding, DNA condensation
PaUbUcUdUePfPgPhP (2:4) P-sites: 70,133,163,190,210
71KKTSVPRFRA GQGFKDLVSG SKKLPKNDIA VKKAPKGSLS GPPPTISKAA 120
121GKKAAAKKAT GAAKKTTGAA KKTSAAAKKT TAKKTTGAAK TTAKKTTAKK 170
171SAAKTTTAAA KKTAAKKAPA KKATAKKAPA KKSTARKTTA KKAT
Name: ADR1_YEAST [UniProt] [InterPro]
Description: Regulatory protein ADR1.
Keywords: 3D-structure, Activator, Complete proteome, DNA-binding, Metal-binding, Nuclear protein, Phosphorylation, Repeat, Transcription, Transcription regulation, Zinc-finger
PaPbUcPdP (2:5) P-sites: 166,176,208,222
167KKVSRTITKA RKNSASSVKF QTPTYGTPDN GNFLNRTTAN TRRKASPEAN 216
217VKRKYLKKLT
Name: ANK3_HUMAN [UniProt] [InterPro]
Description: Ankyrin 3 (ANK-3) (Ankyrin G).
Keywords: Alternative splicing, ANK repeat, Cytoskeleton, Repeat
PaUbPcPdP (2:5) P-sites: 1417,1439,1455,1466
1418RKTTKGLPQT AVCNLNITLP AHKKETESDQ DDEIEKTDRR QSFASLALRK 1467
1468 RYS
Name: BLM_DROME [UniProt] [InterPro]
Description: Bloom's syndrome protein homolog (EC 3.6.1.-) (Dmblm) (Mutagen- sensitive protein 309) (RecQ helicase homolog).
Keywords: ATP-binding, DNA replication, DNA-binding, Helicase, Hydrolase, Nuclear protein, Repeat
PcPdUePfUgP (2:5) P-sites: 71,83,109,171
72KKSSRDPRTA KLKKHTYLDL SVSPLAKLSA KKYARDSPKK PTSLDLSVSP 121
122LAELLAKKSD RDSPKKPVQN ENSYTYRGLS ESPVENKSIG DTLRKPPQKE 171
172RKTS
Name: CCNB3_MOUSE [UniProt] [InterPro]
Description: G2/mitotic-specific cyclin B3.
Keywords: Ubl conjugation, Meiosis, Nuclear protein, Cell division, Alternative splicing, Cell cycle, Cyclin
PbUcPdUePfUgP (2:5) P-sites: 215,246,271,335
216RKVTLTSSPL WLKNKHVVQE EKPVIQEKSS FKRISLVSNV VTTKEKPPVK 265
266KPHFRKKKPT TEMKSLLQEP SLEEKYNTQE DASILKKPQV LQENTNNKDA 315
316TLTEPVTFKG KHSANEATHT KKPS
Name: IF2_COXBU [UniProt] [InterPro]
Description: Translation initiation factor IF-2.
Keywords: Protein biosynthesis, Initiation factor, GTP-binding, Complete proteome
PaPbUcPdUeUfP (2:5) P-sites: 59,69,99,149
60KKRSKIVLKR KKLSVVKSGK KSVNVEIRSK RTYTKPVVEQ KRETEPAPTQ 109
110EVPPTSDTTN LNEKAEVNVA TLEKAVEAEV KEEAKKTPSE KKET
Name: NPH1_ARATH [UniProt] [InterPro]
Description: Nonphototropic hypocotyl protein 1 (EC 2.7.1.37) (Phototropin).
Keywords: Serine/threonine-protein kinase, Flavoprotein, FMN, Repeat, Transferase, ATP-binding, Kinase, Photoreceptor, Phosphorylation, Membrane
PaUbPcPdP (2:5) P-sites: 372,390,406,416
373RRMSENVVPS GRRNSGGGRR NSMQRINEIP EKKSRKSSLS FMGIKKKS
Name: PHI0_HOLTU [UniProt] [InterPro]
Description: Sperm-specific protein Phi-0.
Keywords: Spermatogenesis, Differentiation, Chromosomal protein, DNA-binding, Nucleosome core, Nuclear protein
PaPbUcPdUeP (2:5) P-sites: 3,15,48,71
4RRQTKKARKP AARRRSAAKR AAPAAKKAAS RRRPKSAKKA KPAARRRSSV 53
54KPKAAKAAAQ VRRRSRRIRR AS
Name: PRP4B_HUMAN [UniProt] [InterPro]
Description: Serine/threonine-protein kinase PRP4 homolog (EC 2.7.1.37) (PRP4 pre- mRNA processing factor 4 homolog) (PRP4 kinase).
Keywords: mRNA processing, Transferase, Serine/threonine-protein kinase, Kinase, ATP-binding, Phosphorylation, mRNA splicing, Nuclear protein
PbUcPdUePfP (2:5) P-sites: 406,439,461,491
407RRLSSPRTRP RDDILSRRER SKDASPINRW SPTRRRSRSP IRRRSRSPLR 456
457RSRSPRRRSR SPRRRDRGRR SRSRLRRRSR SRGGRRRRS
Name: PRP4B_MOUSE [UniProt] [InterPro]
Description: Serine/threonine-protein kinase PRP4 homolog (EC 2.7.1.37) (PRP4 pre- mRNA processing factor 4 homolog) (Pre-mRNA protein kinase).
Keywords: Kinase, Serine/threonine-protein kinase, mRNA processing, ATP-binding, Phosphorylation, Nuclear protein, mRNA splicing, Transferase
PbUcPdUePfP (2:5) P-sites: 406,439,461,491
407RRLSSPRTRP RDDILGRCER SKDASPINRW SPTRRRSRSP IRRRSRSPLR 456
457RSRSPRRRSR SPRRRDRSRR SKSRLRRRSR SRGGHRRRS
Name: SFR12_HUMAN [UniProt] [InterPro]
Description: Splicing factor, arginine/serine-rich 12 (Serine-arginine-rich splicing regulatory protein 86) (SRrp86) (Splicing regulatory protein 508) (SRrp508).
Keywords: mRNA processing, Direct protein sequencing, Alternative splicing, mRNA splicing, Nuclear protein, Spliceosome
PaUbPcPdP (2:5) P-sites: 209,231,247,260
210RKRSQSKHRS RSHNRSRSRQ KDRRRSKSPH KKRSKSRERR KSRSRSHSRD 259
260KRKDT
Name: SON_MOUSE [UniProt] [InterPro]
Description: SON protein.
Keywords: Alternative splicing, Repeat, Nuclear protein, DNA-binding, RNA-binding, Alternative splicing, Repeat, Nuclear protein, DNA-binding, RNA-binding
PaUbPcPdUeP (2:5) P-sites: 1801,1821,1833,1855
1802KRDSSLRSRS KRSKSSEHKS RKRTSESRSR ARKRSSKSKS HRSQTRSRSR 1851
1852SRRRRRSS
PaUbPcUdPeUfPgPhP (4:3) P-sites: 1875,1891,1921,1975,2003,2025
1876RKRSPKHRSK SRERKRKRSS SRDNRKAARA RSRTPSRRSR SHTPSRRRRS 1925
1926 ISVGRRRSFS ISPSRRSRTP SRRSRTPSRR SRTPSRRSRT PSRRSRTPSR 1975
1976RRRSRSAVRR RSFSISPVRL RRSRTPLRRR FSRSPIRRKR SRSSERGRSP 2025
2026KRLT
Name: TRA2A_HUMAN [UniProt] [InterPro]
Description: Transformer-2 protein homolog (TRA-2 alpha).
Keywords: RNA-binding, Alternative splicing, Nuclear protein, mRNA splicing, mRNA processing, Phosphorylation
PaUbPcPdP (2:5) P-sites: 60,78,92,105
61RRHSHRRYTR SRSHSHSHRR RSRSRSYTPE YRRRRSRSHS PMSNRRRHT
Name: YCF1_PINTH [UniProt] [InterPro]
Description: Hypothetical 205.3 kDa protein ycf1 (ORF 1756).
Keywords: Hypothetical protein, Chloroplast
PcUdPeUfPgUhP (2:5) P-sites: 1370,1400,1450,1502
1371KKKTDLITNQ QGIDETRREN VGIMDSPKIE KRISQLDLNF WLFPELSGIK 1420
1421NIYYETKSKF IPGNSLLREE RERKKIEEEE RKETTNVLEQ IIGIRSNVKN 1470
1471KQVEDGQDKN GQVEDQDGQD QDGQVEDQQT DGKKKT
Name: YQGF_BACSU [UniProt] [InterPro]
Description: Hypothetical protein yqgF.
Keywords: Peptidoglycan synthesis, Cell wall, Hypothetical protein, Transmembrane, Complete proteome
PaPbUcPdP (2:5) P-sites: 106,117,151,164
107KKLSDMIKVD TKKVTERDKK DYWILTRPKE AKKLISSKER QQVEDKKISD 156
157DDLYQLQLKR IT
Name: MLH_TETTH [UniProt] [InterPro]
Description: Micronuclear linker histone polyprotein (MIC LH) [Contains: Micronuclear linker histone-alpha; Micronuclear linker histone-beta; Micronuclear linker histone-delta; Micronuclear linker histone-gamma].
Keywords: Phosphorylation, Repeat, Nuclear protein, DNA-binding, Direct protein sequencing, Chromosomal protein, Phosphorylation, Repeat, Nuclear protein, DNA-binding, Direct protein sequencing, Chromosomal protein
PaPbPcPdPePfPgPhP (4:0) P-sites: 242,250,265,281,297,321,348,374,391
243RKASNSKGRK NSTSNKRNSS SSSKRSSSSK NKKSSSSKNK KSSSSKGRKS 292
293SSSRGRKASS SKNRKSSKSK DRKSSSSKGR KSSSSSKSNK RKASSSRGRK 342
343SSSSKGRKSS KSQERKNSHA DTSKQMEDEG QKRRQSSSSA KRDESSKKSR 392
393 RNS
PcUdPeUfPgPhP (4:4) P-sites: 498,525,572,595,613
499KKNSKSNTRS KSKSKSASKS RKNASKSKKD TTNHGRQTRS KSRSESKSKS 548
549EAPNKPSNKM EVIEQPKEES SDRKRRESRS QSAKKTSDKK SKNRSDSKKM 598
599 TAEDPKKNNA EDSKGKKKS
Name: SGS3_DROER [UniProt] [InterPro]
Description: Salivary glue protein Sgs-3 precursor.
Keywords: Repeat, Signal
PaPbPcPdPePfPgPhP (4:0) P-sites: 81,90,108,122,137,162,193,223,243
82KRPTARPTTR RTTVRATTKR ATTRRTTKRA TTRRTTVRAT TKRATTRRTT 131
132TKRAPTRRTT TKRATTRRNP TRRTTTRRAP TKRATTKRAT TRRNPTKRKT 181
182TRRTTVRATK TTKRATTKRA PTKRATTKRA PTKRVTTKRA PTKRATTKRA 231
232PTKRATTKRA PTKRAT
Name: CFTR_MOUSE [UniProt] [InterPro]
Description: Cystic fibrosis transmembrane conductance regulator (CFTR) (cAMP- dependent chloride channel).
Keywords: Glycoprotein, Ionic channel, Phosphorylation, Chloride, Alternative splicing, 3D-structure, ATP-binding, Chloride channel, Repeat, Transmembrane, Transport
PaPbUcPdPePfPgPhP (4:1) P-sites: 656,666,694,706,728,759,786,804
657RRSSILTETL RRFSVDDSSA PWSKPKQSFR QTGEVGEKRK NSILNSFSSV 706
707RKISIVQKTP LCIDGESDDL QEKRLSLVPD SEQGEAALPR SNMIATGPTF 756
757PGRRRQSVLD LMTFTPNSGS SNLQRTRTSI RKISLVPQIS LNEVDVYSRR 806
807 LS
Name: CFTR_RAT [UniProt] [InterPro]
Description: Cystic fibrosis transmembrane conductance regulator (CFTR) (cAMP- dependent chloride channel) (Fragments).
Keywords: Chloride channel, Glycoprotein, Transmembrane, Chloride, ATP-binding, Ionic channel, Phosphorylation, Repeat, Transport
PaPbUcPdPePfPgPhP (4:1) P-sites: 192,202,230,242,264,295,322,340
193RRSSILTETL RRFSVDDAST TWNKAKQSFR QTGEFGEKRK NSILSSFSSV 242
243KKISIVQKTP LSIEGESDDL QERRLSLVPD SEHGEAALPR SNMITAGPTF 292
293PGRRRQSVLD LMTFTPSSVS SSLQRTRASI RKISLAPRIS LKEEDIYSRR 342
343 LS
Name: CFTR_HUMAN [UniProt] [InterPro]
Description: Cystic fibrosis transmembrane conductance regulator (CFTR) (cAMP- dependent chloride channel).
Keywords: Transmembrane, Transport, Chloride channel, Repeat, Phosphorylation, Glycoprotein, ATP-binding, Chloride, 3D-structure, Disease mutation, Ionic channel, Polymorphism
PaUbUcPdPePfPgPhP (4:2) P-sites: 656,696,708,733,764,791,809
657RRNSILTETL HRFSLEGDAP VSWTETKKQS FKQTGEFGEK RKNSILNPIN 706
707SIRKFSIVQK TPLQMNGIEE DSDEPLERRL SLVPDSEQGE AILPRISVIS 756
757TGPTLQARRR QSVLNLMTHS VNQGQNIHRK TTASTRKVSL APQANLTELD 806
807IYSRRLS
Name: CFTR_SQUAC [UniProt] [InterPro]
Description: Cystic fibrosis transmembrane conductance regulator (CFTR) (cAMP- dependent chloride channel).
Keywords: Transmembrane, Phosphorylation, Repeat, Glycoprotein, ATP-binding, Chloride, Chloride channel, Ionic channel, Transport
PaPbPcPdPeUfUgPhP (4:2) P-sites: 657,667,686,701,714,798,818
658RRNSILTETF RRCSVSSGDG AGLGSYSETR KASFKQPPPE FNEKRKSSLI 707
708VNPITSNKKF SLVQTAMSYP QTNGMEDATS EPGERHFSLI PENELGEPTK 757
758PRSNIFKSEL PFQAHRRQSV LALMTHSSTS PNKIHARRSA VRKMSMLSQT 807
808NFASSEIDIY SRRLS
Name: SFR11_HUMAN [UniProt] [InterPro]
Description: Splicing factor arginine/serine-rich 11 (Arginine-rich 54 kDa nuclear protein) (p54).
Keywords: mRNA splicing, RNA-binding, Repeat, Nuclear protein, mRNA processing
PaPbPcPdPeUfPgUhP (4:2) P-sites: 245,256,273,283,291,344,397
246RRHSRSRSRS RRRRTPSSSR HRRSRSRSRR RSHSKSRSRR RSKSPRRRRS 295
296 HSRERGRRSR STSKTRDKKK EDKEKKRSKT PPKSYSTARR SRSASRERRR 345
346 RRSRSGTRSP KKPRSPKRKL SRSPSPRRHK KEKKKDKDKE RSRDERERST 395
396SKKKKS
Name: CYLC1_HUMAN [UniProt] [InterPro]
Description: Cylicin-1 (Cylicin I) (Multiple-band polypeptide I) (Fragment).
Keywords: Structural protein, Cytoskeleton, Spermatogenesis, Differentiation, Sperm, Repeat
PaPbPcPdPeUfUgUhP (4:3) P-sites: 239,253,273,290,306,411
240KKSSDAESED SKDAKKDSKK VKKNVKKDDK KKDVKKDTES TDAESGDSKD 289
290ERKDTKKDKK KLKKDDKKKD TKKYPESTDT ESGDAKDARN DSRNLKKASK 339
340NDDKKKDAKK ITFSTDSESE LESKESQKDE KKDKKDSKTD NKKSVKNDEE 389
390STDADSEPKG DSKKGKKDEK KGKKDS
Name: CYLC2_BOVIN [UniProt] [InterPro]
Description: Cylicin-2 (Cylicin II) (Multiple-band polypeptide II).
Keywords: Direct protein sequencing, Differentiation, Cytoskeleton, Repeat, Structural protein, Spermatogenesis, Sperm
PaUbPcUdPePfPgUhP (4:3) P-sites: 144,164,192,220,247,303
145RKLSKAKEEK PDEKKDLKKE RKDSKKGKES ATESEDEKAG AEKGAKKDRK 194
195 GSKKGKETPS DSGSEKGDAK KDSKKSKKDS KGKESATESE GEKGDAKKDD 244
245KKGKKGSKKG KESATESEGE KGDAKKDDKK GKKGSKKGKE SATESEGEKG 294
295DAKKDDKKGK KGS
Name: GIN4_YEAST [UniProt] [InterPro]
Description: Serine/threonine-protein kinase GIN4 (EC 2.7.1.37).
Keywords: ATP-binding, Transferase, Kinase, Complete proteome, Serine/threonine-protein kinase
PaPbPcPdUePfP (4:3) P-sites: 439,449,467,479,507,536
440KRLSKNFSSK KKLSTIVNQS SPTPASRNKR ASVINVEKNQ KRASIFSTTK 489
490KNKRSSRSIK RMSLIPSMKR ESVTTKLMST YAKLAEDDDW EYIEKETKRT 539
540 S
Name: RSP7_CAEEL [UniProt] [InterPro]
Description: Probable splicing factor, arginine/serine-rich 7 (p54).
Keywords: Nuclear protein, RNA-binding, Phosphorylation, mRNA splicing, mRNA processing, Alternative splicing
PaPbPcUdPePfUgP (4:3) P-sites: 279,288,302,330,352,417
280RRRSPSPRRR RDSRDRDRDR DRDRRRSRDR RSRSRDRDRD RDRKRSRSRD 329
330RKRRSRSRDN KDRDRKRSRS RDRRRRSKSR DRKRERSRSR SKDRKRDKKR 379
380SRSRSPEKRR DKEDRKTEKK ENENESSLRE KLLEKKAARK DS
Name: SON_HUMAN [UniProt] [InterPro]
Description: SON protein (SON3) (Negative regulatory element-binding protein) (NRE- binding protein) (DBP-5) (Bax antagonist selected in saccharomyces 1) (BASS1).
Keywords: DNA-binding, Alternative splicing, Nuclear protein, RNA-binding, Repeat
PaUbPcPdUePfPgP (4:3) P-sites: 1816,1837,1848,1870,1905,1936
1817KRDSSLRSRS KRSKSSEHKS RKRTSESRSR ARKRSSKSKS HRSQTRSRSR 1866
1867SRRRRRSSRS RSKSRGRRSV SKEKRKRSPK HRSKSRERKR KRSSSRDNRK 1916
1917TVRARSRTPS RRSRSHTPSR RRRS
Name: SON_MOUSE [UniProt] [InterPro]
Description: SON protein.
Keywords: Alternative splicing, Repeat, Nuclear protein, DNA-binding, RNA-binding, Alternative splicing, Repeat, Nuclear protein, DNA-binding, RNA-binding
PaUbPcPdUeP (2:5) P-sites: 1801,1821,1833,1855
1802KRDSSLRSRS KRSKSSEHKS RKRTSESRSR ARKRSSKSKS HRSQTRSRSR 1851
1852SRRRRRSS
PaUbPcUdPeUfPgPhP (4:3) P-sites: 1875,1891,1921,1975,2003,2025
1876RKRSPKHRSK SRERKRKRSS SRDNRKAARA RSRTPSRRSR SHTPSRRRRS 1925
1926 ISVGRRRSFS ISPSRRSRTP SRRSRTPSRR SRTPSRRSRT PSRRSRTPSR 1975
1976RRRSRSAVRR RSFSISPVRL RRSRTPLRRR FSRSPIRRKR SRSSERGRSP 2025
2026KRLT
Name: ATRX_DROME [UniProt] [InterPro]
Description: Transcriptional regulator ATRX homolog (X-linked nuclear protein) (dXNP) (d-xnp).
Keywords: ATP-binding, DNA damage, DNA repair, DNA-binding, Helicase, Hydrolase, Nuclear protein
PaPbUcUdPePfP (4:4) P-sites: 249,263,307,332,359
250KRVSLPKTKP AQKPKKMSSD SEEAATTSKK SRQRRSKSES EADSDYEAPA 299
300AEEEEEEERK SSGDEEEAAN SSDSEVMPQR KRRRKKSESD KGSSDFEPEE 349
350KQKKKGRKRI KKTS
Name: BOI2_YEAST [UniProt] [InterPro]
Description: BOI2 protein (BEB1 protein).
Keywords: Complete proteome, Cytoskeleton, Phosphorylation, SH3 domain
PaPbPcUdUePfUgUhP (4:4) P-sites: 626,632,644,687,776
627RRNSSLKRSS SASRTSSFKK SSFMLSPFRQ QFTDNAARSS SPEENPITSM 676
677PSEKNSSPIV DKKSSKKSRS KRRSVSAKEA EIFTETVKDD KNKRSASEAI 726
727KGETLKGKSL RQMTARPVAK KKQTSAFIEG LRSISVKEAM KDADFSGWMS 776
777KKGS
Name: CD2L7_HUMAN [UniProt] [InterPro]
Description: Cell division cycle 2-related protein kinase 7 (EC 2.7.1.37) (CDC2- related protein kinase 7) (CrkRS).
Keywords: Nuclear protein, ATP-binding, Kinase, Serine/threonine-protein kinase, Transferase
PcPdPeUfPgUhP (4:4) P-sites: 202,216,224,270,328
203KRETPKSYKT VDSPKRRSRS PHRKWSDSSK QDDSPSGASY GQDYDLSPSR 252
253SHTSSNYDSY KKSPGSTSRR QSVSPPYKEP SAYQSSTRSP SPYSRRQRSV 302
303SPYSRRRSSS YERSGSYSGR SPSPYGRRRS
Name: CFTR_CAVPO [UniProt] [InterPro]
Description: Cystic fibrosis transmembrane conductance regulator (CFTR) (cAMP- dependent chloride channel) (Fragment).
Keywords: Chloride channel, Glycoprotein, Ionic channel, Transport, ATP-binding, Chloride, Phosphorylation, Repeat, Transmembrane
PcPdPeUfPgP (4:4) P-sites: 61,71,87,138,168
62RRNSILTETL RRFSLEGDPS VSFNETKKQS FKQTGEFGEK RKNSILNQFN 111
112SITKFSIVPK TPLQISGIEE DSDDPVERRL SLVPDSEQSD GLLRKHVIHT 161
162GPTFQGSRRQ S
Name: DGKZ_HUMAN [UniProt] [InterPro]
Description: Diacylglycerol kinase, zeta (EC 2.7.1.107) (Diglyceride kinase) (DGK- zeta) (DAG kinase zeta).
Keywords: Nuclear protein, Alternative splicing, Multigene family, ANK repeat, Metal-binding, Kinase, Repeat, Phosphorylation, Phorbol-ester binding, Transferase, Zinc-finger, Zinc
PaPbPcUdPeUfUgUhP (4:4) P-sites: 20,33,48,72,184
21RRPSSVGLPT GKARRRSPAG QASSSLAQRR RSSAQLQGCL LSCGVRAQGS 70
71SRRRSSTVPP SCNPRFIVDK VLTPQPTTVG AQLLGAPLLL TGLVGMNEEE 120
121GVQEDVVAEA SSAIQPGTKT PGPPPPRGAQ PLLPLPRYVR RASSHCCPAD 170
171AVYDHALWGL HGYYRRLS
Name: DOT6_YEAST [UniProt] [InterPro]
Description: Disrupter of telomere silencing protein 6.
Keywords: Transcription, Nuclear protein, DNA-binding, Complete proteome, Transcription regulation
PbUcPdPeUfPgUhP (4:4) P-sites: 278,309,318,364,419
279RRSSFAYPQQ VAITTTPSSP NSSHVLLSSK SRRGSLANWS RRSSFNVSSN 328
329NTSRRSSMIL APNSVSNIFN VNNSGSNTAS TSNTNSRRES VIKKEFQQRL 378
379NNLSNSGGPT SNNGPIFPNS YTFMDLPHSS SVSSSSTLHK SKRGS
Name: FNBP3_HUMAN [UniProt] [InterPro]
Description: Formin-binding protein 3 (Huntingtin yeast partner A) (Huntingtin- interacting protein HYPA/FBP11) (Fas-ligand associated factor 1) (NY- REN-6 antigen) (HSPC225).
Keywords: Antigen, mRNA processing, Repeat, mRNA splicing, 3D-structure, Alternative splicing
PaPbUcUdPeUfPgP (4:4) P-sites: 811,822,872,919,944
812KKHSKKSKKH HRKRSRSRSG SDSDDDDSHS KKKRQRSESR SASEHSSSAE 861
862SERSYKKSKK HKKKSKKRRH KSDSPESDAE REKDKKEKDR ESEKDRTRQR 911
912SESKHKSPKK KTGKDSGNWD TSGSELSEGE LEKRRRT
Name: FNBP3_MOUSE [UniProt] [InterPro]
Description: Formin-binding protein 3 (Formin-binding protein 11) (FBP 11).
Keywords: mRNA splicing, Repeat, mRNA processing, Alternative splicing
PaPbUcUdPeUfPgP (4:4) P-sites: 807,818,868,915,940
808KKHSKKSKKH HRKRSRSRSG SESDDDDSHS KKKRQRSESH SASERSSSAE 857
858SERSYKKSKK HKKKSKKRRH KSDSPESDTE REKDKKEKDR DSEKDRSRQR 907
908SESKHKSPKK KTGKDSGNWD TSGSELSEGE LEKRRRT
Name: FXL10_HUMAN [UniProt] [InterPro]
Description: F-box/LRR-repeat protein 10 (F-box and leucine-rich repeat protein 10) (F-box protein FBL10) (JEMMA protein) (CXXC finger protein 2) (Protein containing CXXC domain 2).
Keywords: Coiled coil, Alternative splicing, Ubl conjugation pathway, Repeat, Leucine-rich repeat, Zinc-finger
PaPbPcUdPeUfP (4:4) P-sites: 764,774,791,815,867
765KRRSECEEAP RRRSDEHSKK VPPDGLLRRK SDDVHLRKKR KYEKPQELSG 814
815RKRASSLQTS PGSSSHLSPR PPLGSSLSPW WRSSLTYFQQ QLKPGKEDKL 864
865FRKKRRS
Name: FXL10_MOUSE [UniProt] [InterPro]
Description: F-box/LRR-repeat protein 10 (F-box and leucine-rich repeat protein 10) (F-box protein FBL10).
Keywords: Zinc-finger, Coiled coil, Alternative splicing, Repeat, Ubl conjugation pathway, Leucine-rich repeat
PaPbPcUdPeUfP (4:4) P-sites: 737,747,764,788,840
738KRRSECEEAP RRRSDEHPKK VPADGILRRK SDDVHLRRKR KYEKPQELSG 787
788RKRASSLQTS PGSSSHLSPR PPLGSSLSPW WRSSLTYFQQ QLKPGKEDKL 837
838FRKKRRS
Name: KCC4_YEAST [UniProt] [InterPro]
Description: Probable serine/threonine-protein kinase KCC4 (EC 2.7.1.37).
Keywords: Complete proteome, ATP-binding, Cell division, Cell cycle, Serine/threonine-protein kinase, Kinase, Cell shape, Transferase
PaUbPcPdUePfP (4:4) P-sites: 385,409,424,446,479
386KRSSTLSSSS SLLLNNRSIQ STPRRRTSKR HSREFSSSRK RSSFLLSSNP 435
436TDSSPIPLRS SKRITHINVA SANTQATPSG VPNPHKRNSK KRSSKRLS
Name: MLH_TETTH [UniProt] [InterPro]
Description: Micronuclear linker histone polyprotein (MIC LH) [Contains: Micronuclear linker histone-alpha; Micronuclear linker histone-beta; Micronuclear linker histone-delta; Micronuclear linker histone-gamma].
Keywords: Phosphorylation, Repeat, Nuclear protein, DNA-binding, Direct protein sequencing, Chromosomal protein, Phosphorylation, Repeat, Nuclear protein, DNA-binding, Direct protein sequencing, Chromosomal protein
PaPbPcPdPePfPgPhP (4:0) P-sites: 242,250,265,281,297,321,348,374,391
243RKASNSKGRK NSTSNKRNSS SSSKRSSSSK NKKSSSSKNK KSSSSKGRKS 292
293SSSRGRKASS SKNRKSSKSK DRKSSSSKGR KSSSSSKSNK RKASSSRGRK 342
343SSSSKGRKSS KSQERKNSHA DTSKQMEDEG QKRRQSSSSA KRDESSKKSR 392
393 RNS
PcUdPeUfPgPhP (4:4) P-sites: 498,525,572,595,613
499KKNSKSNTRS KSKSKSASKS RKNASKSKKD TTNHGRQTRS KSRSESKSKS 548
549EAPNKPSNKM EVIEQPKEES SDRKRRESRS QSAKKTSDKK SKNRSDSKKM 598
599 TAEDPKKNNA EDSKGKKKS
Name: NBP1_YEAST [UniProt] [InterPro]
Description: NAP1-binding protein.
Keywords: Complete proteome
PcUdPePfPgP (4:4) P-sites: 30,54,75,110,137
31RKRSALRSRR KQMRPTGKSV LKRPRKVTDR KTEEKIRTNR RKTPKRRLTK 80
81IFQTIRDVFS NDNENMSKMQ NVCGDMTRIL KKRSQGRPSY MDTDTAKSRI 130
131LRSDAFKRKI S
Name: ORK1_DROME [UniProt] [InterPro]
Description: Open rectifier potassium channel protein 1 (Two pore domain potassium channel Ork1).
Keywords: Ion transport, Transmembrane, Transport, Ionic channel, Glycoprotein, Potassium transport
PaPbPcUdUeUfUgPhP (4:4) P-sites: 523,533,550,649,663
524KKQSPGAGRV KKFSMPDGLR RLFPFQKKRP SQDLERKLSV VSVPEGVISQ 573
574QARSPLDYYS NTVTAASSQS YLRNGRGPPP PFESNGSLAS GGGGLTNMGF 623
624QMEDGATPPS ALGGGAYQRK AAAGKRRRES IYTQNQAPSA RRGS
Name: PMD1_YEAST [UniProt] [InterPro]
Description: Negative regulator of sporulation PMD1.
Keywords: Sporulation, Complete proteome, Meiosis, Kelch repeat, Repeat
PaUbPcUdPeUfUgPhP (4:4) P-sites: 813,834,866,950,970
814RRASTVGTTT NSSVDDGFSS IRRASHPLQS YIIAKSSPSS ISKASPAEKA 863
864FSRRKSSALR FIASPNQSRQ TSFASTASTA SVVSSTSGRR RNSNQISHLG 913
914SSASLPNSPI LPVLNIPLPP QEKIPLEPLP PVPKAPSRRS SSLAEYVQFG 963
964RDSPVASRRS S
Name: RNPC2_HUMAN [UniProt] [InterPro]
Description: RNA-binding region containing protein 2 (Hepatocellular carcinoma protein 1) (Splicing factor HCC1).
Keywords: Nuclear protein, Activator, Transcription regulation, Transcription, mRNA processing, RNA-binding, mRNA splicing, Alternative splicing, Polymorphism, Phosphorylation, Repeat
PbPcPdPeUfP (4:4) P-sites: 34,46,62,70,121
35RKKSKSRSRS HERKRSKSKE RKRSRDRERK KSKSRERKRS RSKERRRSRS 84
85RSRDRRFRGR YRSPYSGPKF NSAIRGKIGL PHSIKLSRRR S
Name: RNPC2_MOUSE [UniProt] [InterPro]
Description: RNA-binding region containing protein 2 (Coactivator of activating protein-1 and estrogen receptors) (Coactivator of AP-1 and ERs) (Transcription coactivator CAPER).
Keywords: Activator, Repeat, Nuclear protein, Phosphorylation, Alternative splicing, mRNA processing, mRNA splicing, Transcription, RNA-binding, Transcription regulation
PbPcPdPeUfP (4:4) P-sites: 34,46,62,70,121
35RKKSKSRSRS HERKRSKSKE RKRSRDRERK KSKSRERKRS RSKERRRSRS 84
85RSRDRRFRGR YRSPYSGPKF NSAIRGKIGL PHSIKLSRRR S
Name: SFR12_MOUSE [UniProt] [InterPro]
Description: Splicing factor, arginine/serine-rich 12 (Serine-arginine-rich splicing regulatory protein 86) (SRrp86).
Keywords: Spliceosome, mRNA processing, mRNA splicing, Alternative splicing, Nuclear protein
PaUbPcPdPeUfUgUhP (4:4) P-sites: 212,234,250,263,367
213RKRSQSKHRS RSHNRSRSRQ KDRRRSKSPH KKRSKSRERR KSRSRSRSRD 262
263KRKDTREKVK ERVKEKEREK EREREKDREK DKERGKNKDK DREKEKDHEK 312
313ERDKEKEKEQ DKDKEREKDR SKETDEKRKK EKKSRTPPRS YNASRRSRST 362
363SRERRRRRS
Name: SGS3_DROYA [UniProt] [InterPro]
Description: Salivary glue protein Sgs-3 precursor.
Keywords: Signal, Repeat
PaUbPcPdPeUfP (4:4) P-sites: 106,126,136,146,197
107RRPTTRSTTT RHTTTTTTTT RRPTTTTTTT RRPTTTTTTT RRPTTTTTTT 156
157RLPTTRSTTT RHTTKSTTSK RPTHETTTTS KRPTQETTTT TRRAT
Name: VE2_HPV14 [UniProt] [InterPro]
Description: Regulatory protein E2.
Keywords: Repressor, DNA replication, Early protein, Activator, Transcription regulation, DNA-binding, Trans-acting factor, Transcription, Nuclear protein
PbPcPdUeUfUgPhP (4:4) P-sites: 235,250,265,344,358
236RKQSQQANTK GRRYGRRPSS RTRRTTETRQ RRRSRSKSRS RSRSRSRLRS 285
286RSRSQSSERR SRYRSRSRSR QKEVSRITTT TRGRGRGSSS TSSKRSQRAR 335
336GRGRGGSRGR RSSSTSPTSS KRSRRES
Name: YBF4_YEAST [UniProt] [InterPro]
Description: Hypothetical 59.2 kDa protein in PTC3-SAS3 intergenic region.
Keywords: DNA-binding, Complete proteome, Hypothetical protein, Nuclear protein
PaUbPcPdPeP (4:4) P-sites: 276,294,304,314,342
277RRNSFIPSTQ IPHSTTKTRK NSHSVISSRR SSFNMMHSRR SSFNSHAPTE 326
327PISRRASLVV SPYMSPRRLS