Information report for Thecc1EG035565t1
Gene Details
|
|
Functional Annotation
- Refseq: XP_017981065.1 — PREDICTED: trihelix transcription factor GT-2
- Swissprot: Q39117 — TGT2_ARATH; Trihelix transcription factor GT-2
- TrEMBL: A0A061FJ90 — A0A061FJ90_THECC; Duplicated homeodomain-like superfamily protein isoform 1
- STRING: EOY16707 — (Theobroma cacao)
- GO:0010192 — Biological Process — mucilage biosynthetic process
- GO:0044212 — Molecular Function — transcription regulatory region DNA binding
Family Introduction
- GT factors constitute a plant-specific transcription factor family with a DNA-binding domain that binds GT elements. The DNA-binding domain of GT factor, rich in basic and acidic amino acids and proline and glutamine residues, features a typical trihelix (helix-loop-helix-loop-helix) structure that determines the specific binding of GT elements; thus GT factors are also called trihelix transcription factors. GT elements are highly degenerate cis-elements with A/T-rich core sequences (Villain et al. 1996; Wang et al. 2004). Interaction between GT factors and GT elements has been implicated in the complex transcriptional regulation of many plant genes.
Literature and News
Homologs
- Cajanus cajan: C.cajan_43791
- Cicer arietinum: XP_004496473.1
- Citrus sinensis: orange1.1g006925m, orange1.1g006924m
- Glycine max: Glyma.20G166600.1.p
- Gossypium arboreum: Cotton_A_12729_BGI-A2_v1.0
- Gossypium hirsutum: Gh_A05G2066, Gh_D05G2306
- Juglans regia: WALNUT_00024662-RA, WALNUT_00012588-RA
- Lotus japonicus: Lj0g3v0303659.1
- Manihot esculenta: Manes.18G091600.1.p, Manes.02G179100.1.p
- Medicago truncatula: Medtr1g098900.1
- Populus trichocarpa: Potri.002G068600.1, Potri.005G191900.1
- Prunus persica: Prupe.8G028700.1.p
- Ricinus communis: 29576.m000224
Sequences
CDS Sequence:
- >Thecc1EG035565t1|Theobroma_cacao|Trihelix|Thecc1EG035565t1
ATGCTGGGTTGTGGTGACACAAGCGTCAGTGTCCTCGGAAGCAGCAGCGGCGGTGGCGGTGGCGACGTGGCTGCTGCTGCTGCTGTTGCTACCACGGTTAGCAGCAGCGGAGCTCTCGATGGAAGGAGTGAAGCTGCTGCCAACATGCTTGGCTCCAACGGCAATAATAACAATAACAACAACAACACTAATAATAATTCAGGTGACGATGATAGAGGTAGGGTTGATGAAGGTGACCGCTCTTTCGGAGGCAACCGATGGCCGAGGCAGGAGACTTTGGCACTCTTGAAAATAAGGTCCGACATGGATGTTACCTTTCGTGACGCTAGCGTGAAGGGTCCATTGTGGGAAGAGGTTTCCAGGAAACTAGCAGAGCTTGGTTATCATCGAAGTGCCAAGAAATGCAAGGAGAAATTCGAGAACGTGTACAAGTATCACAAGAGAACCAAAGATGGGCGAACTGGTAAATCAGATGGCAAGGCTTATCGGTTTTTCGATCAATTAGAAGCCCTCGAAAACATTTCTTCCATTCAATCGCCGGCTGCACCACCACCACCATCACCACAATTAAAGCCTCAACACCAAACGGTAATGCCAGCAGCCAATCCTCCCAGTCTGTCCCATATCACTATTCCATCAACAACACTCGCCAGTTTGCCACAAAACATTGTACCACCAAATGCAAGCTTTACAGTCCCGTCCTTCCCATCAACAAACCCTACAATTCAACCCCCACCACCCACAACAAACCCTACAATTCCTTCTTTTCCTAACATTTCCGCAGATCTAATGTCGAATTCAACGTCTTCTTCGACCTCATCAGATTTAGAATTAGAAGGCCGGAGGAAAAGGAAAAGAAAGTGGAAGGATTTTTTCGAGAGGCTAATGAAGGAGGTGATTCAGAAGCAGGAAGATATGCAGAAGAAATTCTTGGAAGCAATAGAGAAACGTGAGCACGAAAGACTGGTCCGTGAAGATGCTTGGAGGATGCAAGAAATGGCAAGAATAAATAGAGAACGCGAGATATTAGCCCAAGAAAGATCCATAGCTGCTGCGAAGGATGCAGCAGTAATGGCGTTCTTGCAAAAATTATCTGAGCAGCGAAATCCGGGGCAGGCACAAAATAATCCACTGCCGTCTCAGCAACCGCAACCGCCTCCACAGGCTCCACCACAGCCAGTACCAGCAGTGGCAACAGCAGCGCCTCCAGCCGCAACAGCAGCACCAGTGCCAGCGCCTGCTCCACCACTATTACCGCTGCCAATGGTGAATTTAGACGTGTCAAAGACTGATAATGGCGATCAAAGTTACACACCATCAAGCTCTTCAAGATGGCCTAAGGTTGAGGTTGAAGCATTGATTAAGCTAAGGACTAGTCTTGATGCTAAATACCAAGAAAATGGTCCTAAAGGACCATTATGGGAGGAGATATCAGCTGCAATGAAAAAGCTTGGTTACAATCGCAACGCCAAAAGATGCAAAGAGAAATGGGAGAACATAAATAAGTACTTCAAAAAGGTGAAAGAGAGCAACAAGAAAAGGCCTGAGGATTCAAAAACATGTCCCTACTTTCACCAACTTGATGCTTTATATAGAGAAAAGAACAAACTTGACAACTCCTCCAATGAATTAAAACCCGAGAACTCAGTCCCATTGCTGGTACGCCCAGAGCAGCAATGGCCCCCTCCTCCTTCGGAGCCCGATGACCACCAACATGATCATGCCACGGAAGACATGGAAAGCGAGCAAAATCAAGACGAAGACGAAAAAGACGGCGATGATGAGGAGGAAGATGAAGGGGGTGACTATGAGATTGTAGCCAGCAAACCGGTTTCAATGGGCACAGCAGCTATCTGCCCGGCTAGTGGGTCAGGATCCGGCAACGGTGCATTGGAGTGGAGGCATTTGAATTGA
Protein Sequence:
- >Thecc1EG035565t1|Theobroma_cacao|Trihelix|Thecc1EG035565t1
MLGCGDTSVSVLGSSSGGGGGDVAAAAAVATTVSSSGALDGRSEAAANMLGSNGNNNNNNNNTNNNSGDDDRGRVDEGDRSFGGNRWPRQETLALLKIRSDMDVTFRDASVKGPLWEEVSRKLAELGYHRSAKKCKEKFENVYKYHKRTKDGRTGKSDGKAYRFFDQLEALENISSIQSPAAPPPPSPQLKPQHQTVMPAANPPSLSHITIPSTTLASLPQNIVPPNASFTVPSFPSTNPTIQPPPPTTNPTIPSFPNISADLMSNSTSSSTSSDLELEGRRKRKRKWKDFFERLMKEVIQKQEDMQKKFLEAIEKREHERLVREDAWRMQEMARINREREILAQERSIAAAKDAAVMAFLQKLSEQRNPGQAQNNPLPSQQPQPPPQAPPQPVPAVATAAPPAATAAPVPAPAPPLLPLPMVNLDVSKTDNGDQSYTPSSSSRWPKVEVEALIKLRTSLDAKYQENGPKGPLWEEISAAMKKLGYNRNAKRCKEKWENINKYFKKVKESNKKRPEDSKTCPYFHQLDALYREKNKLDNSSNELKPENSVPLLVRPEQQWPPPPSEPDDHQHDHATEDMESEQNQDEDEKDGDDEEEDEGGDYEIVASKPVSMGTAAICPASGSGSGNGALEWRHLN*