Information report for Thecc1EG014612t1
Gene Details
|
|
Functional Annotation
- Refseq: XP_007037950.2 — PREDICTED: ribonuclease J isoform X1
- Swissprot: Q84W56 — RNJ_ARATH; Ribonuclease J
- TrEMBL: A0A061FYV1 — A0A061FYV1_THECC; RNA-metabolising metallo-beta-lactamase family protein
- STRING: EOY22451 — (Theobroma cacao)
- GO:0009658 — Biological Process — chloroplast organization
- GO:0009942 — Biological Process — longitudinal axis specification
- GO:0060918 — Biological Process — auxin transport
- GO:0009507 — Cellular Component — chloroplast
- GO:0016021 — Cellular Component — integral component of membrane
- GO:0003677 — Molecular Function — DNA binding
Family Introduction
- GT factors constitute a plant-specific transcription factor family with a DNA-binding domain that binds GT elements. The DNA-binding domain of GT factor, rich in basic and acidic amino acids and proline and glutamine residues, features a typical trihelix (helix-loop-helix-loop-helix) structure that determines the specific binding of GT elements; thus GT factors are also called trihelix transcription factors. GT elements are highly degenerate cis-elements with A/T-rich core sequences (Villain et al. 1996; Wang et al. 2004). Interaction between GT factors and GT elements has been implicated in the complex transcriptional regulation of many plant genes.
Literature and News
Homologs
- Gossypium arboreum: Cotton_A_16240_BGI-A2_v1.0, Cotton_A_19924_BGI-A2_v1.0
- Gossypium hirsutum: Gh_D07G0523, Gh_A07G0459, Gh_D03G1080, Gh_A03G0459
- Manihot esculenta: Manes.14G052700.1.p, Manes.14G052700.2.p
- Populus trichocarpa: Potri.012G095800.1, Potri.015G093600.1
- Prunus mume: XP_008239449.1, XP_016651115.1
- Prunus persica: Prupe.5G152800.1.p, Prupe.5G152800.3.p
- Ricinus communis: 30147.m014528
- Sesamum indicum: XP_011073024.1
- Vitis vinifera: GSVIVT01008515001
- Ziziphus jujuba: XP_015881808.1
Sequences
CDS Sequence:
- >Thecc1EG014612t1|Theobroma_cacao|Trihelix|Thecc1EG014612t1
ATGAAAAAGGTCAAAAAAAAAAGAAGAAAAAATCTCTACACGTGCGGTAAGAACACACGTGGGAGAAATTTGTCTTGTTTCAAAAATGATAAGCAGGATATTTCTCCACCTTACAAAAATCTCCACACTTTCACTCACTCACTCAATTTCCCGCTCCCAGAAGCTCAATCATCTCCTCTCCTCCTTGTCTCTAAACTCCTTCACAAAAACGCAAAATGTTCAGGGTTTAATCTCCACTGTTGTTTTTTTTTCTTTGTAAACCTAAAAGAGCAGATGCAGTTAGGGTTTTTGGGGGGTTTATCTTTCTCCTACTCCCTCTACTTCACTTCTTTTAAGCCAATAAAAGCACCCACAAAAATGGCAGCTTCCACTGCTCACTCGCTTTGTCCGTACGGCCTCTACTGCCGGCCAAACCCTAGACACCGTTACATTTCTTGCTCCGTCGGCTCTCCTACTCCTTTAGGTACACGAAGAACTAAAGTACCACGTAAAAAGTCAGGACGATTGGATGGGGCTAGAAAGAGTATGGAAGACTCTGTCCAACGCAAGATGGAACAGTTTTATGAAGGGACTGCAGGGCCACCACTTCGTGTCCTTCCAATAGGTGGCTTGGGTGAAATCGGGATGAATTGCATGCTCGTTGGGAATTATGATCGCTATATTCTAATAGATGCCGGTGTGATGTTCCCGGACTATGATGAGCTTGGAGTCCAAAAGATTATACCTGATACGACATTTATTAAGAAATGGAGCCACAAAATTGAAGCAGTCGTGATAACACATGGCCATGAAGACCACATTGGTGCGTTGCCTTGGGTTATCCCAGCTTTGGATTCCCATACTCCAATATATGCATCATCCTTTACAATGGAGCTGATCAAAAAGCGTTTGAAGGAGAATGGGATTTTTGTTCCATCTAGGCTTAAGATATTTAAAACCAGGAAGAGATTTATGGCTGGGCCATTTGAAATAGAGCCTCTCAGAGTGACCCATTCTATTCCTGACTGTTGTGGATTAGTTCTTCGCTGTGCTGATGGTACAATTCTTCACACTGGGGACTGGAAGATTGATGAATCACCCTTGGATGGGAAAATTTTTGATCGACAGTTTCTAGAGGATCTCTCTAAAGAAGGAGTAACACTGATGATGAGTGACTCTACTAATGTATTGTCACCTGGAAGGACAATCAGCGAAAGTTCAGTAGCAGATGCATTGTTGAGACATATTTCAGCTGCTAAAGGAAGGATTATTACTACCCAATTTGCGTCAAACATACACCGGCTTGGAAGTGTAAAAGCTGCTGCAGATTTAACTGGCAGAAAGTTGGTATTTGTTGGCATGTCATTGAGGACATATCTAGATGCAGCTTGGAAGGATGGAAAAGCACCAATTGATCCATCAACTCTGGTGAAAGTGGAAGATATTGATGCCTATGCTCCAAAGGATTTGATAATTGTCACAACTGGATCCCAAGCAGAGCCACGTGCTGCCTTGAATCTTGCATCTTATGGAAGTAGTCATTCCTTCAAACTGAACAAGGAAGATGTGATTCTCTACTCAGCTAAGGTAATCCCTGGTAATGAATCTCGGGTAATGAAGATGCTAAACCGCATATCAGAGATTGGATCAACTATAGTGATGGGTAAGAATGAGGGGCTACACACTTCTGGTCATGGCTATCGTGGAGAACTGGAGGAAGTACTTAAAATTGTGAAGCCGCAACATTTTTTACCCATACATGGAGAGCTATTGTTCCTGAAAGAGCATGAGCTACTTGGGAAATCAACTGGCATTCGACACACCACCGTTATAAAGAATGGAGAGATGCTTGGTGTTTCTCATTTGAGGAATAGAAGAGTTCTATCTAATGGTTTTAGTTCCCTTGGGAAGGAGAATTTGCAGTTGATGTACAGTGATGGTGATAAAGCATATGGCACATCAACTGAACTTTGCATTGATGAGAGACTAAGAATTGCATCTGATGGCATTATAGTGGTCAGCATGGAAATTTTACGCCCCCAAAAGATAGATGGCATAATGGAAAATAGCTTAAAAGGGAAGATAAGAATTACTACACGCTGCTTATGGCTTGACAAAGGGAAGCTTTTAGATGCACTTCATAAAGCGGCACATGCTGCACTCTCTAGCTGTCCTGTGAATTGTCCTTTAGGTCACATGGAAAGAACTGTGTCAGAGGTATTGAGGAAGATGGTAAGGAAGTACAGTGGTAAGAGGCCTGAAGTCATTGCCATTGCTTTGGAGAACCCAGCAGGAGTTTTCTCTGATGAGCTAAATGAAAGGCTATCTGGCAATTACAATGTCGGTTTTGAGATACCAACGTTGAGAAAAGTGGTAGATGGACATCCAAAAAGGAGCCAGCCAAACAAGATAAAAGCAGAAGATGACAGTAATTTGCATTTAGAGAACACCTCAGAACAAAGTTTGGAAGTTAGTGATGGTGAAGTTGAAAAGCTCTTACCTGAGGAAGACACCACTACTTCAAGTCCTGATTCTCTGGAAAGGCATACACCCAATTCTGAGGGTTCTGATGAATTCTGGAAATCATTCATCACGTCATCGTCACCTGTTAACAATTTGGTAAATGATAACAATGGATTGGTCCCAAAAAAGGAATATAAGTCACAACTTAAGAGTGATGGCACTGCAAGCAGTGGAGATGACTCAGAAATGCCCAGCTCTCAACCTAAGTCGTCCAAGCCTGCCAAACGGAACAAATGGAAACCTGAGGAGGTTAAGAAGCTGATTAAAATGCGTGGGAAATTGCATAGCAGATTTCAGGTTGTGAAGGGGAGAATGGCCCTCTGGGAAGAAATATCCACAAGCTTGATGGCTGAAGGAATTAGTCGAAGCCCTGGGCAATGTAAATCTCTTTGGACATCTCTGGTTCAGAAATACGAGGAAAGTAAGGGTGAGAAAAAAAGCCATAAAGAATGGCCATATTTTGAGGACATGAGTAAAGTTTTTTCTGATTTTGAGGCAACCGCGACAAAATGA
Protein Sequence:
- >Thecc1EG014612t1|Theobroma_cacao|Trihelix|Thecc1EG014612t1
MKKVKKKRRKNLYTCGKNTRGRNLSCFKNDKQDISPPYKNLHTFTHSLNFPLPEAQSSPLLLVSKLLHKNAKCSGFNLHCCFFFFVNLKEQMQLGFLGGLSFSYSLYFTSFKPIKAPTKMAASTAHSLCPYGLYCRPNPRHRYISCSVGSPTPLGTRRTKVPRKKSGRLDGARKSMEDSVQRKMEQFYEGTAGPPLRVLPIGGLGEIGMNCMLVGNYDRYILIDAGVMFPDYDELGVQKIIPDTTFIKKWSHKIEAVVITHGHEDHIGALPWVIPALDSHTPIYASSFTMELIKKRLKENGIFVPSRLKIFKTRKRFMAGPFEIEPLRVTHSIPDCCGLVLRCADGTILHTGDWKIDESPLDGKIFDRQFLEDLSKEGVTLMMSDSTNVLSPGRTISESSVADALLRHISAAKGRIITTQFASNIHRLGSVKAAADLTGRKLVFVGMSLRTYLDAAWKDGKAPIDPSTLVKVEDIDAYAPKDLIIVTTGSQAEPRAALNLASYGSSHSFKLNKEDVILYSAKVIPGNESRVMKMLNRISEIGSTIVMGKNEGLHTSGHGYRGELEEVLKIVKPQHFLPIHGELLFLKEHELLGKSTGIRHTTVIKNGEMLGVSHLRNRRVLSNGFSSLGKENLQLMYSDGDKAYGTSTELCIDERLRIASDGIIVVSMEILRPQKIDGIMENSLKGKIRITTRCLWLDKGKLLDALHKAAHAALSSCPVNCPLGHMERTVSEVLRKMVRKYSGKRPEVIAIALENPAGVFSDELNERLSGNYNVGFEIPTLRKVVDGHPKRSQPNKIKAEDDSNLHLENTSEQSLEVSDGEVEKLLPEEDTTTSSPDSLERHTPNSEGSDEFWKSFITSSSPVNNLVNDNNGLVPKKEYKSQLKSDGTASSGDDSEMPSSQPKSSKPAKRNKWKPEEVKKLIKMRGKLHSRFQVVKGRMALWEEISTSLMAEGISRSPGQCKSLWTSLVQKYEESKGEKKSHKEWPYFEDMSKVFSDFEATATK*