Gene Details:
- Gene ID: Glyma.17G208500.1.p
- Gene Name: GLYMA_17G208500
- Gene Family: CPP Family
- Description: CPP Family protein
- Species: Glycine max
- Source: CPP family gene from PlantTFDB
Protein Features:
Annotation Proteins:
- Refseq: XP_025982093.1 — cysteine-rich polycomb-like protein isoform X1
- Refseq: XP_025982094.1 — cysteine-rich polycomb-like protein isoform X1
- TrEMBL: A0A0R0FP02 — A0A0R0FP02_SOYBN; Uncharacterized protein
- TrEMBL: A0A445G9H3 — A0A445G9H3_GLYSO; Protein tesmin/TSO1-like CXC 3 isoform A
- STRING: GLYMA17G31399.1 — (Glycine max)
Family Introduction:
- CPP-like (cystein-rich polycomb-like protein) proteins are members of a small transcription factor family whose typical feature is the existence of one or two similar Cys-rich domains termed the CXC domain. The members of this family are widely present in plants and animals but absent in yeast, and the CXC domains of CPP-like proteins show high conservation across species from amoebae though plants to mammals (Andersen et al. 2007; Riechmann et al. 2000). CPP-like genes play an important role in development of reproductive tissue and control of cell division in plants.
Literature:
- Molecular evolution of the CPP-like gene family in plants: insights from comparative genomics of Arabidopsis and rice. DOI: 10.1007/s00239-008-9143-z ; PMID: 18696028
Sequences:
CDS Sequence:
- >Glyma.17G208500.1.p|Glycine_max|CPP|Glyma.17G208500.1.p
ATGATGGATTCTCCAGAACCTTCCAAGAACAACAATGGCAGTTCTTCTTCTGCCTCCACTCTCAACAACAACAACAACAACGATGATGCTCCTTCATCGGAATCCCCTCAAGTTCAAGAATCCCCCTTTCTTAGATTTGTGAAGACTTTATCTCCAATTCCCACCAAGGCTTCTCATATGACACAAGGTTGTGTAGGGCTCAGTTCTCCACCCCTTGTGTTCAAGTCTCCACGTATTAGTCACCGTGAAACACAGCTCACGAAAAGGCCCCAAGGCACTCAGTCATTCGGTGGAGTAATACCTCAAAGTGTAAATGAAGGCAACAGGCTTGGTGAAGCTCCTGGAGATTCCAGGACATCAAATTCTCATCAGTCACTGCCAGAGAGGTTTATAAATGACACTCAGCAGGTTTTTGACTTCAAGAATGATGAAAATACTCAATACTACAGCTCTCCATCATGCATTGATAAATATTTGGTGGATCCTGGGGATATTGATCAAATGTATTCAGCTGACCAGGATGTGCAACAACAATCTACAGATGCAGCTGAAACATCACTAAGTGACCAAACTCATTCAAAGAATAATATATTAAATTTTGACAGGAAAGATGGTCCTGGTGATAAAGTGGAAGAATCTTTGCCTTTGTCTGAAGATTTTAACAAGGTTCATCTAGAAAAGGCAGCATATGGTGAAGAACCTGAAAAGATGGAAGGGGAGAAAAATGATGTTGAATGGTCTTCTCAAGAGCCTGCAAAGTTAGAGTCCATTTTAGCTGCAGATGGTTTTGATAAACGATATAGTCATGGTCCACTTCCTCAGTGGATGCCTAATCCCCTGCAGGATGTTAAAGGATGTGAAGATTACAATGAGATGGTGCCAACCTCACATGTAACTGCAGAGAATATCTTGCAAGATGGTTCCGAGGCTACTCTAAAGCACCATGGCATTCGTAGACGCTGTCTACAATTTGGTGAAGCTGCCTCAAATGCTCTTGGAAGAAATGTGAAACTGAATGCAGCTTCAAACACAATGATAACAGTCAAGCCGTCTGAACTTGTCACTTCCTTGTGTCCTCGGCGAGGTAGTGGAAACTTCCCTTCAACAAGTCCCAAGCCATCAGGTATCGGGTTGCATTTAAATAGCATCATAAATGCTATTCCAATTGATCAGGCTGCTACTACTGGTGTGAGATTATCAGATAGTTCACAGGGGATGAAATCCACATCTTCAATAAGGTTACAAAGAATGGAGAATGTGAAGAGATCCATACTTTCATCTAATGTCGATGGGCGATCATTAGTTGATACAAGAACTGAGAGCCATGAAATTGATGATACAGTGGCTACAGATACTGGAAATTCTGAGGACCTCAACCAACCTCCTAGCCCCTGCAAGAAAAAGAAGAAAACATCAGTCACTGCTGATGACAATGGCTGTAAACGGTGCAACTGTAAGAAGAGTAAATGTTTAAAACTTTACTGTGATTGTTTTGCTGCTGGAACGTACTGTACCGACCCTTGTGCTTGCCAAGGCTGCTTAAACAGACCAGAATATGTAGAGACAGTTGTCGAGACTAAACAACAAATTGAATCCCGTAATCCAATTGCATTTGCTCCCAAGATTGTTCAGCCTACCACTGATATTTCTTCACATATGGATGATGAAAATCTGACAACACCGTCATCGGCAAGGCACAAAAGAGGTTGCAATTGCAAAAGGTCAATGTGTCTGAAAAAATATTGTGAATGTTATCAGGCTAATGTTGGATGCTCTAGTGGATGCCGATGTGAGGGATGTAAGAATGTCCATGGCAAGAAAGAAGATTATGTTGCATTTGGACATACTTCGAGTAAAGAAAGGGTGAGCAGTATTGTTGAAGAAGGATCAGACTGCACTTTTCACAATAAACTGGAAATGGTGGCTAGCAAGACTGTTTATGATCTACACTGCCTCTCACCTATAACGCCATCATTGCAATGTTCTGACCAAGGCAAGGAGGATGCTAAATCCAGAGTTATTTCTGGAAACTATCTACCATCCCCTGAATCTGATGTCAATATGTTGGCATCTTGTACAAACTACACTAAGTCTTCTGAAAATTTGCATGGCAGTGAAGCACTTCTGGACACAAATGAGATGTTGGGAAATACCCCCTATGATTCCCAAATTGAATGCAGCGATGCTGCCTTACTTCAGCTTACTCCTCTGCCTAATCCTGAGCAGTCTGGCACTTCATCATTCTCATCTGTACCAAATGAGTGTGCAAAGATTACTCACTCCAGACTCTCCCATGGATGTATTCGCCAGTTACCTGGCGGTTCTCTTCGTTGGCGTAGTTCCCCGCTTACCCCGAGCACTAGAGTAGGTGAAGCACAATATTTACAGTGTTCTGAATCAGATAGCAAGCTCTTTGACATTCTAGAAAATGAAACCCCTGATATACTGAAGGAAGCTTCAACTCCTATGACGTCTGTCAAAGTAAATTCCCCAACCCAAAAACGAGTTTCTCCTCCCCAGAGTTGTCATATTGGGATTGGATCAAGCTCCTCTGGAGGATTGAGAAGTGGCCGGAAATTTATATTGAAAGCTGTACCTACTTTCCCTTCATTGTCTCCTTGCATTAATTCCAAAAGCAATGGCGATGAGGATTCTTGTAACAGTCCGAGCAAGTCACCTCTCAAAGCTAATGAGTGCCCTCAACGTGAGAGTTAA
Protein Sequence:
- >Glyma.17G208500.1.p|Glycine_max|CPP|Glyma.17G208500.1.p
MMDSPEPSKNNNGSSSSASTLNNNNNNDDAPSSESPQVQESPFLRFVKTLSPIPTKASHMTQGCVGLSSPPLVFKSPRISHRETQLTKRPQGTQSFGGVIPQSVNEGNRLGEAPGDSRTSNSHQSLPERFINDTQQVFDFKNDENTQYYSSPSCIDKYLVDPGDIDQMYSADQDVQQQSTDAAETSLSDQTHSKNNILNFDRKDGPGDKVEESLPLSEDFNKVHLEKAAYGEEPEKMEGEKNDVEWSSQEPAKLESILAADGFDKRYSHGPLPQWMPNPLQDVKGCEDYNEMVPTSHVTAENILQDGSEATLKHHGIRRRCLQFGEAASNALGRNVKLNAASNTMITVKPSELVTSLCPRRGSGNFPSTSPKPSGIGLHLNSIINAIPIDQAATTGVRLSDSSQGMKSTSSIRLQRMENVKRSILSSNVDGRSLVDTRTESHEIDDTVATDTGNSEDLNQPPSPCKKKKKTSVTADDNGCKRCNCKKSKCLKLYCDCFAAGTYCTDPCACQGCLNRPEYVETVVETKQQIESRNPIAFAPKIVQPTTDISSHMDDENLTTPSSARHKRGCNCKRSMCLKKYCECYQANVGCSSGCRCEGCKNVHGKKEDYVAFGHTSSKERVSSIVEEGSDCTFHNKLEMVASKTVYDLHCLSPITPSLQCSDQGKEDAKSRVISGNYLPSPESDVNMLASCTNYTKSSENLHGSEALLDTNEMLGNTPYDSQIECSDAALLQLTPLPNPEQSGTSSFSSVPNECAKITHSRLSHGCIRQLPGGSLRWRSSPLTPSTRVGEAQYLQCSESDSKLFDILENETPDILKEASTPMTSVKVNSPTQKRVSPPQSCHIGIGSSSSGGLRSGRKFILKAVPTFPSLSPCINSKSNGDEDSCNSPSKSPLKANECPQRES*