Gene Details:
- Gene ID: Thecc1EG026389t4
- Gene Name: TCM_026389
- Gene Family: CPP Family
- Description: CPP Family protein
- Species: Theobroma cacao
- Source: CPP family gene from PlantTFDB
Protein Features:
Annotation Proteins:
- Refseq: XP_007030619.2 — PREDICTED: protein tesmin/TSO1-like CXC 2 isoform X3
- TrEMBL: A0A061F265 — A0A061F265_THECC; Tesmin/TSO1-like CXC domain-containing protein, putative isoform 4
- STRING: EOY11118 — (Theobroma cacao)
Family Introduction:
- CPP-like (cystein-rich polycomb-like protein) proteins are members of a small transcription factor family whose typical feature is the existence of one or two similar Cys-rich domains termed the CXC domain. The members of this family are widely present in plants and animals but absent in yeast, and the CXC domains of CPP-like proteins show high conservation across species from amoebae though plants to mammals (Andersen et al. 2007; Riechmann et al. 2000). CPP-like genes play an important role in development of reproductive tissue and control of cell division in plants.
Literature:
- Molecular evolution of the CPP-like gene family in plants: insights from comparative genomics of Arabidopsis and rice. DOI: 10.1007/s00239-008-9143-z ; PMID: 18696028
Sequences:
CDS Sequence:
- >Thecc1EG026389t4|Theobroma_cacao|CPP|Thecc1EG026389t4
ATGGATACACCAGATAAAACCCAGATCACTCCAACTCCTTCTCTTTCCAAATTTGAGGATTCACCTGTCTTTAAGTACATCAACAGTCTTTCACCTATTGAGCTAGCCAAGTTCAGGCAGACAGATAATGCTTTCAATTCGCTAGCTTTTTTGTCCCCTTCATCATTGTTTCCTTCCCCGCAGATCAGTTGCCACAGGGAATCCAGGTTTTCGGTTAAGAGGCATCATTTCTCAGCAGCTTCAAACTCTAGTGTCCTGCAGAGTAGTAATGACTTAAATACTGATGAAGGAGCTTCAAAAGCCATTGAGCAATCTTACTTGTATGATGAACAACCGGGATGTCTCAATAGTGGCAGTTCATCCAAAGGAGTTAGTAGTGATCAACTCGATGACCAATCAGATTTAGCAATCGAGCTGCCAAGAACCTTGAAATATGATTGTGGGAGCCCTGATGGTAACTTGGAGCCTTGTGATGAGATACTGAAAAATACAAGTGAGAAAGTGGCAGGACATGAAGCATCTCCTTTCCAACACAATAAAGATGAAGGGGAAGAGAGACAAATGTCCTTTGAAAATGAAAGAGATCTGCGTAAAATACGTCGAATTATGAGGAGTGAAGAATCAGCAGGATGTGACTGGGTGGCAATTGTTTCTGATGTTGCTGATTTGTTGACCACGAATTCATCCATCATTTATGAGAATATTGAGGGTCAGGATCGGAGAACAGCAGACCCTGGGACAACCTCTTTTATATCAACTATCCTGCAGTTCCCGCTAGATAATTCCAATAATTTAGAAAATACTGAAACTGGTGATCCTAGTGGTTCCTGCAAACAAAGCAAGTTAGGAGTGCCAGTAACAGATCAGACACCTGCCATCCTGTCTACTTGTCTACTGGACAAGCTAGTTGTCAGTGATTCAGGTTTGAACAAGGATGATAAGGGGGAGAAGTGTAACCAGTCCAGCCATCAGCAGCGTAGCATACGACGGCGCTGTCTAGTATTTGAGAAGTCACCTGGTTTTGGTTTGCACTTGAATTCTCTTGCAAATACCTCAAACGATCAGAGTCCTCTTAGTAAATTAACTCCAAGCACTATGAAGAGGGATGAGGTTCCTCATCACAACAAAGCTGTGGTTACAGACAATTCTCCTGAGACACCTGCTACTGTCAGTGGCAATGAGACTGACCTTAATAGTCCTGAAAAGAAGAGGACAAAGTTTGAACATGTTGAGGAGAATGCAGCATGCAAGCGCTGTAACTGTAAGAGGTCAAAATGCTTGAAGCTTTATTGCGACTGCTTTGCTGCTGGTCTGTACTGTATTGAGCCTTGTTCATGCCAAGATTGCTTCAATAAGCCAATTCATGAAAACAAGGTTCTAGAGACTCGCAGACAGATTGAATCTCGCAACCCACTAGCATTTGCTCCCAAAGTGATTAGAAGCACGGACAGTGTTTCAGACTCCGGGGGTGAAACTAACAAAACTCCTGCTTCAGCCAGGCATAAAAGAGGATGCAATTGTAAAAAATCAAGTTGCTTAAAGAAATACTGCGAATGTTTTCAGGCTGGTGTTGGATGCTCCCCCAGCTGTAGATGTGAAGGTTGTAAAAATAGATTTGGTCGGAAGGGTGGTGAGTCTTTATCTGGTACTGTTTGGTTGAATACAAATATTAATGTTTGGAACTGA
Protein Sequence:
- >Thecc1EG026389t4|Theobroma_cacao|CPP|Thecc1EG026389t4
MDTPDKTQITPTPSLSKFEDSPVFKYINSLSPIELAKFRQTDNAFNSLAFLSPSSLFPSPQISCHRESRFSVKRHHFSAASNSSVLQSSNDLNTDEGASKAIEQSYLYDEQPGCLNSGSSSKGVSSDQLDDQSDLAIELPRTLKYDCGSPDGNLEPCDEILKNTSEKVAGHEASPFQHNKDEGEERQMSFENERDLRKIRRIMRSEESAGCDWVAIVSDVADLLTTNSSIIYENIEGQDRRTADPGTTSFISTILQFPLDNSNNLENTETGDPSGSCKQSKLGVPVTDQTPAILSTCLLDKLVVSDSGLNKDDKGEKCNQSSHQQRSIRRRCLVFEKSPGFGLHLNSLANTSNDQSPLSKLTPSTMKRDEVPHHNKAVVTDNSPETPATVSGNETDLNSPEKKRTKFEHVEENAACKRCNCKRSKCLKLYCDCFAAGLYCIEPCSCQDCFNKPIHENKVLETRRQIESRNPLAFAPKVIRSTDSVSDSGGETNKTPASARHKRGCNCKKSSCLKKYCECFQAGVGCSPSCRCEGCKNRFGRKGGESLSGTVWLNTNINVWN*