Gene Details:
- Gene ID: 491858
- Gene Name: ARALYDRAFT_491858
- Gene Family: CPP Family
- Description: CPP Family protein
- Species: Arabidopsis lyrata
- Source: CPP family gene from PlantTFDB
Protein Features:
Annotation Proteins:
- Refseq: XP_020874783.1 — protein tesmin/TSO1-like CXC 5
- Swissprot: Q9SZD1 — TCX5_ARATH; Protein tesmin/TSO1-like CXC 5
- TrEMBL: D7MD61 — D7MD61_ARALL; Uncharacterized protein
- STRING: fgenesh2_kg.7__1291__AT4G29000.1 — (Arabidopsis lyrata)
Family Introduction:
- CPP-like (cystein-rich polycomb-like protein) proteins are members of a small transcription factor family whose typical feature is the existence of one or two similar Cys-rich domains termed the CXC domain. The members of this family are widely present in plants and animals but absent in yeast, and the CXC domains of CPP-like proteins show high conservation across species from amoebae though plants to mammals (Andersen et al. 2007; Riechmann et al. 2000). CPP-like genes play an important role in development of reproductive tissue and control of cell division in plants.
Literature:
- Molecular evolution of the CPP-like gene family in plants: insights from comparative genomics of Arabidopsis and rice. DOI: 10.1007/s00239-008-9143-z ; PMID: 18696028
Sequences:
CDS Sequence:
- >491858|Arabidopsis_lyrata|CPP|491858
ATGGGTGAAGACGGCGGAGGAGGCGAATTCCCGCCAAAAAAGGACGGTGTTGAAGAGGGTTTTCCGACGAAGAAGCCAGCAAGACAGCTTGACTTCACTGGTGGCTCTGATGAACAATCACTGTCTAAAGCAGCGGCTCCTACCGTCGTAGCTACGGCGGTGAAACCCGTCGTTACTAGCTCCATCCCTTCGACGATACGGCCGGGTGTGACCATTGCAATCGGTCAGGTACGACCGACGTTACCAATGGCGACGACGTCAAATCCGCCGTCACAATCACAAATCCTTAACGCGCCAATCAGGCATCCAAAACCGGAATCTCCAAAAGCTAGAGGTCCAAGGCCTATTGTGGAAGGTAGAGATGGAACTCCTCAGAAGAAGAAGCAGTGTAACTGTAAACACTCACGCTGCTTGAAACTGTACTGTGAATGCTTTGCATCCGGAACATATTGTGACGGTTGTAACTGTGTAAATTGTTTCAACAATGTTGATAATGAACCTGCAAGACGAGAAGCTGTGGAAGCAACTCTGGAGCGGAATCCATTTGCGTTCAGGCCTAAAATTGCTAGCAGTCCACATGGTGTGCGGGATAAAAGGGAGGATATTGGTGAAGTTGTGTTGTTAGGGAAACATAACAAAGGATGCCACTGCAAGAAATCGGGATGCCTTAAGAAGTATTGTGAGTGCTTTCAAGCAAACATTCTTTGTTCTGAGAACTGCAAATGCTTGGATTGCAAAAACTTTGAAGGAAGTGAAGAGAGACAAGCTCTGTTTCATGGTGAACATGCCAACCACATGGCTTATCTTCAACAGGCAGCAAATGCAGCCATTACTGGAGCTGTTGGTTCCTCCGGCTTTGCTCCTTCTCCAGCACCTAAGAGAAGGAAAGGCCAAGAGATTTTGTTCAACCAAGCAACCAAAGATTCATCTAGACTTGGCCAGTTTCCACAGGTAAATAGTGGAAGAGCTAGTGGACCAACCTCAGGCTCATCTCCGTCACCTGTTTCTCGTGCTGGTGGCAATGCATCGTCAGCTCCATCAAAGTTTGTATATAGGTCCCTTTTAGCAGATATAATCCAACCGCATGATGTTAGAGCACTTTGCTCTGTTTTAGTCGCTGTAGCTGGAGAAGCAGCAAAGACATCAACAGATAAAAGAAATGAAATAGAAAATCGTGTGGAAGATCAAACAGAAACTTCTTTAGCTTCTTCTGCACAAGATCAGCCCCAAGGCGATAATGATGCAGCTGATATGGAGATGGTTGCAACTGATGGGAATCAAGCTGATAAGTCTGGAGCAGAGGAATCCAATTCAGATGGTGCTGATGCCTCGAAAGGGAATCCACTATCTCCAGCAACTTTGGCTTTGATGTGCGATGAACAAGACACTATCTTTATGGTGGCAGCACCTTCGCCTAATGGTGCAGTCGACCCCGGTGGCCGCAGAACGAACTCACAGGGCCAGTCAGAGATTTACGCAGAGCAGGAGAGACTGGTATTAACCAAATTCAGAGACTGCCTTAGTCGACTTATCTCTTACGCAGAAATAAAAGAATCAAAGTGTTTATCTTTAGCAAGAATGCACATACAGCCATCTGCAACCGCGACTGTGAAAACCGAGAATGGGGTTCAACAGCAAGTACCAATTGTGAATGGAGCTTCGCGAACCAATTCTCAACCTACACTCAACAAACCGCAGCCTATGCAGCTGATAAACACAACATCAGCATCAGCAGCAGCAGCAGCTGCGACAAACACTCATCATCTTCATAAACCTCCAGCTTTATCAGAGAAGAAAGACCCCTGA ATGTCCACCACCAACTCGACCGACGCCGACACGCCACCACAACAACCACCGCGTCCCACGATCACCCTCCCACCACGTCCCTCCGTGGAAGCATTCTTCGCCGGCGCCGCCAGTCCTGGACCTATGACACTAGTTTCGAGCTTCTTCGCCACCGAATCCGCCACCTTCTCTCAGCTTCTCGCCGGTGCCATGGCTTCTCCTCTCGCTTTCTCAACCGCCGTCGATAGTAAAGAAGATGATGGGACCTACAGGTTCAAACAGAGTAGACCTATGAACTTGGTCATTGCTCGTTCTCCAGTTTTCACTATTCCTCCTGGTTTGAGTCCTTCTGGTTTTCTTAACTCACCTGGTTTCTTTTCTCCTCAGAGTCCCTTTGGGATGTCACACCAACAGGCTTTAGCACAAGTTACAGCTCAAGCTGTCCTAGCACAATCTCATATGCACATGCAACCTGATTACCAGCCAGTCTCACTAGATTACCAGCCGGTCTCACTTGAAGCTCCAACCGAACCACCAGTAGAGAACCCATCTTTTACTCGAAATGAAACTTCAGAGGTGCAAGTAGTAGCACCTATATCAGAACCTAGAAATGCTCAAACGGAAACGTCAGAGCTTTCGTATTCTGATAAGAAACAGCAACCATCTTCATTACCAATTGACAAACCTGCAGATGATGGCTACAACTGGCGTAAGTATGGACAGAAGCAGGTTAAGGGGAGCGAGTATCCGCGGAGCTACTACAAATGCACGCATCTGAATTGTCCCGTGAAGAAAAAAGTTGAGCGTGCTCCTGATGGTCATATAACTGAAATTATCTATAAAGGCCAGCATAACCATGAGAAGCCGCAAGCAAATCGGCGCATCAAAGATAATAGTGACGTGAATGGAAATGCTAATGTTCAACCTAAGTCTGACTCAAACTCACAAGGTTGGTTTGCAAATTCAAACAAAACTAGTGAAAGTGTGCCTGATTGTTCTGTGGTTGAGAGTGACCAGACCTCTAACCAAGGCGCTCCTAGGCCATTGCCTGGATTGAGTGAGAGTGAGGAAGTTGGTGATGCAGGCAATAAGGAAGAGGGAGATGACTGTGAGCCAAACCCTAAGAGAAGGAGCATTGAACCTGTGGTTCCTGAAGTACCTCTATCTCAGAAGACTGTCACAGAACCCAAAATCATTGTGCAAACAAGAAGTGAAGTTGATCTTTTGGATGACGGCTACAGGTGGCGAAAGTATGGTCAGAAAGTGGTGAAGGGAAATCCTCATCCAAGGAGTTATTACAAGTGCACTAGTGCAGGCTGCAACGTGCGTAAACATGTCGAGAGAGCATCAACAGACCCAAAAGCTGTCATAACTACATATGAGGGAAAACATAACCACGACGTCCCTGCCGCTAGACACAGCAGCCACAACACAGCCAGTAGCAATTCGATGCCATCAAAACCGCAGCCTGTGGCAGCAGAGAAGCATCCTTTGCTTAAAGATATGGAGTTCGGAAACAACAATCAGAGACCTGTACATTTACGCTTAAAAGAAGAGCAAATCATCGTATAA
Protein Sequence:
- >491858|Arabidopsis_lyrata|CPP|491858
MGEDGGGGEFPPKKDGVEEGFPTKKPARQLDFTGGSDEQSLSKAAAPTVVATAVKPVVTSSIPSTIRPGVTIAIGQVRPTLPMATTSNPPSQSQILNAPIRHPKPESPKARGPRPIVEGRDGTPQKKKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCFNNVDNEPARREAVEATLERNPFAFRPKIASSPHGVRDKREDIGEVVLLGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCLDCKNFEGSEERQALFHGEHANHMAYLQQAANAAITGAVGSSGFAPSPAPKRRKGQEILFNQATKDSSRLGQFPQVNSGRASGPTSGSSPSPVSRAGGNASSAPSKFVYRSLLADIIQPHDVRALCSVLVAVAGEAAKTSTDKRNEIENRVEDQTETSLASSAQDQPQGDNDAADMEMVATDGNQADKSGAEESNSDGADASKGNPLSPATLALMCDEQDTIFMVAAPSPNGAVDPGGRRTNSQGQSEIYAEQERLVLTKFRDCLSRLISYAEIKESKCLSLARMHIQPSATATVKTENGVQQQVPIVNGASRTNSQPTLNKPQPMQLINTTSASAAAAAATNTHHLHKPPALSEKKDP*