Information report for Thecc1EG026383t1
Gene Details
|
|
Functional Annotation
- Refseq: XP_007030607.2 — PREDICTED: trihelix transcription factor GTL2
- TrEMBL: A0A061F9D3 — A0A061F9D3_THECC; Duplicated homeodomain-like superfamily protein, putative
- STRING: EOY11109 — (Theobroma cacao)
- GO:0045893 — Biological Process — positive regulation of transcription, DNA-templated
- GO:0005634 — Cellular Component — nucleus
- GO:0001158 — Molecular Function — enhancer sequence-specific DNA binding
- GO:0005516 — Molecular Function — calmodulin binding
Family Introduction
- GT factors constitute a plant-specific transcription factor family with a DNA-binding domain that binds GT elements. The DNA-binding domain of GT factor, rich in basic and acidic amino acids and proline and glutamine residues, features a typical trihelix (helix-loop-helix-loop-helix) structure that determines the specific binding of GT elements; thus GT factors are also called trihelix transcription factors. GT elements are highly degenerate cis-elements with A/T-rich core sequences (Villain et al. 1996; Wang et al. 2004). Interaction between GT factors and GT elements has been implicated in the complex transcriptional regulation of many plant genes.
Literature and News
Homologs
- Actinidia chinensis: Achn009891
- Citrus sinensis: orange1.1g007139m
- Fragaria vesca: mrna16528.1-v1.0-hybrid
- Glycine max: Glyma.19G243500.2.p, Glyma.19G243500.1.p
- Gossypium arboreum: Cotton_A_14977_BGI-A2_v1.0
- Gossypium hirsutum: Gh_A11G2296, Gh_D11G2607
- Juglans regia: WALNUT_00017100-RA, WALNUT_00032082-RA
- Malus domestica: MDP0000209313
- Manihot esculenta: Manes.09G154300.1.p, Manes.08G132700.1.p
- Populus trichocarpa: Potri.013G039100.2, Potri.005G051700.1, Potri.013G039100.1
- Prunus mume: XP_008218241.1
- Prunus persica: Prupe.6G341000.1.p
- Ricinus communis: 30128.m008610
- Ziziphus jujuba: XP_015875344.1
Sequences
CDS Sequence:
- >Thecc1EG026383t1|Theobroma_cacao|Trihelix|Thecc1EG026383t1
ATGTTTGATGGAGTACCAGACCAGTTTCACCAATTCATAGCCTCATCAGCAGCAGCAGCAGCAGCAGCAGCAGTAGCAGCTGCAAGAACCACAACCCTCCCTCTCCCTCTCTCTTTCCCTCCCCTTCATCTTGCTAACTCTTCAAATGGTTTCACTTCCTTTGACACTTTGTACACTTCCAACTCACATAACCAAGTACCCCCTCAGCTGCAGCAGCAGCAGCCTCACTTTTTGCACCCATTGCACCCCCAACATCAAACCCAGAAGAATGAAGAGAAAGAAGAGAACACCGGCTTGGTACGTATGAACATGGAGATTGAAAGAGAGAGATCCATGCCTGAATCGATTGATAATCATCACCACCACCATCATCCTTGGTCTAATGATGAAGTCCTTGCACTGTTGAGGATCAGATCTAGCATTGAAAATTGGTTCCCTGAATTCACTTGGGAACACGTCTCAAGGAAGTTGGCAGAGCTTGGGTTCAAGAGGAGTGCAGAGAAGTGCAAAGAGAAATTTGAAGAAGAAAGCAGATATTTCAACAGCATCAATTGTAGCAAAAACTACAGGCTCTTTAGTGAACTTGAAGAGCTTTGCCAGGGTGAGAATCCTCCTCCTCCTCATCATAACCAGCAGGTTGTTGGCGCAACTGAAAAAAACAAAAACGTGGAAAAGTCAAGGGAAGATGAAGACAATATGGGTCAGAATTTGGAAGATGATTCAAGAAATATTGATGAATATCAAACTACAGCAGGAAATAATGCACCAGAGGATAATGAAAGAGTGGTTGAGAACAAAGCAGATAACAAGAACAGCAGCAACAGGAAGAGGAAAAGACAGAAGAAATTTGAGATGATCAAAGGGTTTTGCGAAGATATTGTGAACAAGTTGATGAATCAACAGGAAGAGATGCACAATAAACTGCTTGAAGACATGGTGAAGAGAGATGAAGAGAAAGTTGCAAGAGAAGAAGCCTGGAAGAAACAAGAGCTAGATAGGATTAACCAGGAGCTAGAGCTTAGGGCAAAAGAACAAGCTATTGCAGGTGATAGACAGGCTACCATAATCAAATTCTTGAGTAAATTCGCATCAACTGGCTCCTCTAAATGTTTTCGAAGGAGTAATGAAGCTCTTTTTAAGGTACCAAATGATTCAAATCCTCCTAGTACTTCATCTTCTCTAGTGCCAGCACAAAACCCTAATCCTATAGTCAATGCCCAAAGCCAGGGGGATCAAGTTTCTAGTACTACTTTATCTACGATGGTCCTTGGTCATCAAAACTCGGGTTCTTGTCCAACTGACAACAATCAAATCAAGGCCACTTCAATGACAGAAAACCAAGCCCCTGAAAACCCTAATCCAAAAACACTCACTTCATCAGCTCTAGCCCTAGCTCCCAAAAACCCTAATCCTGTCAACGCTCAAAGCAATCCATCACCACCTACTTCATCAGTGACTGTAAATAAAGCTCCCCTAACCCCTACATCCAATGACAAAGAAGACCTCGGGAAGAGATGGCCAAGAGATGAAGTGTTGGCGTTAATAAACCTAAGATGCAGTCTCTATAACAATGGCGATCACGATAAGGAAGGAGCAGCCATAAAAGCTCCTCTTTGGGAAAGAATCTCGCAAGGGATGTCAGAGTTGGGGTACAAGAGAAGTGCCAAGAGGTGCAAAGAGAAATGGGAGAACATAAACAAGTACTTCAGGAAGACCAAAGATGTTAATAAAAAGAGGTCCCTTGACTCTAGGACATGTCCTTATTTTCATCAATTAAGCACTTTGTACAATCAAGGAACACTTATAGCACCCTCTGAAGGGCTGGAAAACCGCCCGGCTTTGCCCGAAAACCACTCGGCTGCTCTGCCGGAATCCGGTAACGATAACTCATCTCAAAGGGGACCAGCTAAGGATTCTACTGTGCATTTTTCTGAAGGTGAGACAAACATGGTTCAAGTACCAGCTTTTGAATTTGAATTCTGA
Protein Sequence:
- >Thecc1EG026383t1|Theobroma_cacao|Trihelix|Thecc1EG026383t1
MFDGVPDQFHQFIASSAAAAAAAAVAAARTTTLPLPLSFPPLHLANSSNGFTSFDTLYTSNSHNQVPPQLQQQQPHFLHPLHPQHQTQKNEEKEENTGLVRMNMEIERERSMPESIDNHHHHHHPWSNDEVLALLRIRSSIENWFPEFTWEHVSRKLAELGFKRSAEKCKEKFEEESRYFNSINCSKNYRLFSELEELCQGENPPPPHHNQQVVGATEKNKNVEKSREDEDNMGQNLEDDSRNIDEYQTTAGNNAPEDNERVVENKADNKNSSNRKRKRQKKFEMIKGFCEDIVNKLMNQQEEMHNKLLEDMVKRDEEKVAREEAWKKQELDRINQELELRAKEQAIAGDRQATIIKFLSKFASTGSSKCFRRSNEALFKVPNDSNPPSTSSSLVPAQNPNPIVNAQSQGDQVSSTTLSTMVLGHQNSGSCPTDNNQIKATSMTENQAPENPNPKTLTSSALALAPKNPNPVNAQSNPSPPTSSVTVNKAPLTPTSNDKEDLGKRWPRDEVLALINLRCSLYNNGDHDKEGAAIKAPLWERISQGMSELGYKRSAKRCKEKWENINKYFRKTKDVNKKRSLDSRTCPYFHQLSTLYNQGTLIAPSEGLENRPALPENHSAALPESGNDNSSQRGPAKDSTVHFSEGETNMVQVPAFEFEF*