Information report for WALNUT_00019222-RA
Gene Details
|
|
Functional Annotation
- Refseq: XP_018844915.1 — PREDICTED: trihelix transcription factor GTL1-like isoform X1
- TrEMBL: A0A2I4GLZ0 — A0A2I4GLZ0_JUGRE; trihelix transcription factor GTL1-like isoform X1
- STRING: EOY23769 — (Theobroma cacao)
Family Introduction
- GT factors constitute a plant-specific transcription factor family with a DNA-binding domain that binds GT elements. The DNA-binding domain of GT factor, rich in basic and acidic amino acids and proline and glutamine residues, features a typical trihelix (helix-loop-helix-loop-helix) structure that determines the specific binding of GT elements; thus GT factors are also called trihelix transcription factors. GT elements are highly degenerate cis-elements with A/T-rich core sequences (Villain et al. 1996; Wang et al. 2004). Interaction between GT factors and GT elements has been implicated in the complex transcriptional regulation of many plant genes.
Literature and News
Homologs
- Citrus sinensis: orange1.1g003766m, orange1.1g005124m, orange1.1g005262m
- Fragaria vesca: mrna01828.1-v1.0-hybrid
- Gossypium arboreum: Cotton_A_17373_BGI-A2_v1.0
- Gossypium hirsutum: Gh_A02G0934, Gh_D12G2385, Gh_A12G2206
- Malus domestica: MDP0000451038
- Manihot esculenta: Manes.01G001900.1.p, Manes.01G001900.2.p
- Prunus mume: XP_008239287.1, XP_008239289.1
- Prunus persica: Prupe.5G137600.1.p, Prupe.5G137600.2.p
- Pyrus bretschneideri: Pbr038634.1
- Ricinus communis: 30068.m002635
- Ziziphus jujuba: XP_015896308.1
Sequences
CDS Sequence:
- >WALNUT_00019222-RA|Juglans_regia|Trihelix|WALNUT_00019222-RA
ATGCAAGAAGGAGGAGGAGTATCTAAATCCCAGTACGGAGTGTCAGCCGAGATGGCCAGTAGTACTCCTGCCATGGCTGCTACTGCTGCTGTAGCAACATCAGTAGGTCAGCCGGTGGGTATGGAAATGGAAAGCGAGCAAGCCCAGCTGGTAGAAGCCGCCTCGCCTATTAGCAGCCGGCCTCCTGCCTCTGCAGCTATGAATTTGGATGAGCTGGTGCCTCTACAGAGCGGTGGTGCGGGTGCAGATGAGGATGCACTCGCTGCTTCTGCCGCTGCCGGAGATGGAGGAGGTGTTGGCGGCGGAAATCGGTGGCCTCGTCAAGAGACCTTAGCACTTCTCAAGATTAGGTCGGAGATGGATGCAGTCTTCCGTGATGCCACACTCAAGGGTCCCCTCTGGGAAGATGTTTCTAGGAAGCTGGCGGAGCTGGGATACAAAAGGAATGCAAAGAAATGCAAGGAAAAATTCGAGAACGTCCACAAGTACTACAAGCGAACAAAGGAAGGCCGAGCTGGTCGTCAAGATGGGAAAAGCTACAAGTTTTTCAGCGAGCTTGAGGCTCTCCATAGCTCAGCTGCCAGTAGTAGCGTCGCCGGCAATAACTCCTCGGCTTCCCTCGTAGTACCGCCGGCGACAGCTACCCTCACAATTAACCCAGTGGTTGCTCCTGTTTCCACTGGGACCGGCATCGGTGGCATGATCAGCAGCCCCATGCCGATCTCATCCAATATCAGACTTCCCCAAGTCCTTCCAGGTCTCGGCATGTTTCCTCCTGATCCTAGTGCGGCAGTGCTTGCACCGGCTGTCGTTTCTGGATCTGTCATAGCACCAGCAGCCAAGCCTATCGGTGTCAGCTTCTCATCTAACAGCTCTTCATCTTCTCCGGGCTCCGATGAAGACGATGATGACGAGGAGGAGAGCCTAGAGGTGGAGCCATCGAACGTAGGAAGCAGCCGCAAGCGTAAACGGGGGTCCACAAAAGCTGGCGGCAGCAGTGACACCCGCAGAATGATGGAGTTCTTCGAGAATCTAATGAAACAGGTAATGCACAAGCAAGAAGTCATGCAGCAGAGATTTCTGGAGGCAATAGAGAAGAGAGAGCAAGACAGGATGATAAGAGAAGAAGCTTGGAAGAGACAAGAGATGGCTAGGCTGGCCCGCGAGCACGAGCTCATGGCTCAGGACCGGGCCATCTCTGCTTCCAGGGATGCCGCTATAGTCGCTTTTCTACAAAAGATTACTGGCCAGCCCATCCAACAACCTACACCTGCAAGCAACAGTCCTGTTGCCCCACCACCGGCTGTGCCACCTTCTCATGTCCCCTCGCCAGTGCTGGCGGCAACGCAACCACCACAACCGTCACAACAGCAGCAGCAACAACAACAACAAAAGCAACAGCAGCAACAACAGCAGCAAGCTCTACAACATTCTCATCGACATACCCAAGTAGTGATGGCAATCCCAGAACAACAGGTACTCCCACAGGAAATAATTAGTGGCGGAGGAGGAAGCTTTGAACCTGCTTCCTCGAGATGGCCTAAGGTAGAGGTTCTTGCACTTATAAAGTTGAGGAGTGGACTTGAATCGAGGTATCAAGAGGCAGGGCCCAAGGGCCCACTTTGGGAAGAAATTTCCGCAGGGATGCAGCGGATGGGCTACAAGAGGAATGCCAAGAGGTGCAAGGAAAAATGGGAGAACATAAATAAGTACTACAAGAAGGTTAAGGAAAGCAACAAGAAACGTCCCGAAGATGCCAAAACCTGTCCTTATTTTCACGAACTAGATGCTCTTTATCGGAAAAAGATACTTGGCAGCTCTAGCAGTGGTGGCAGCGGCAACAGCAGTTTCCTTGATCAGAACAGGCCGCAGCTGCAGCAGCAACAACCAGAGAACATAGAATTGGATCCAAGCGTTGCCCCAGAAATTGTATCACTGCCACAACCACAAACTACACTTGCCAGTACGGAGTCAGATCATGACAAAAATGGAGCTAGTACAGATGTACAGGCCCAGGCAAGCAACATTGCTTTACCAGAAGGCCTCTTTGGAGAAGCAATTGGAGGAGCTCCCAAAAAGCCAGAAGACATTGTGAAGGAGTTAATGGAGGAGCGGGAGCAACATCATCAACCACAACAACGACTAATTCTAGATGAATATGATAAAATTGAGGATGCTGATAGCGACAATAATGATCAAGAGGAAGACGAAGATGAAGACGAGGACGAAGACTTGGAAGAGGAGGGGAAAATGGATTATAAGATAGAAGAGCAATGGCTCTCTACCTGGATCTTTTCTATTACCAAGCTTACACAGAACAGAAGGCCACGGAGAAAGGCCAGCAACTCCACCTATTTAGGTTTAGCAACAACATCTTTCAAGGAGCTTGTAGCCATCAACACTCGGCCTGAACTATCTCTCAAGATTGCACCTGTATTAGCTCTTCCCATCTCTTCAAAAACAGCTCTATCAACATTTAATTTCAAAACATCAGGATCCAATGGCTTTCATTTGTAG
Protein Sequence:
- >WALNUT_00019222-RA|Juglans_regia|Trihelix|WALNUT_00019222-RA
MQEGGGVSKSQYGVSAEMASSTPAMAATAAVATSVGQPVGMEMESEQAQLVEAASPISSRPPASAAMNLDELVPLQSGGAGADEDALAASAAAGDGGGVGGGNRWPRQETLALLKIRSEMDAVFRDATLKGPLWEDVSRKLAELGYKRNAKKCKEKFENVHKYYKRTKEGRAGRQDGKSYKFFSELEALHSSAASSSVAGNNSSASLVVPPATATLTINPVVAPVSTGTGIGGMISSPMPISSNIRLPQVLPGLGMFPPDPSAAVLAPAVVSGSVIAPAAKPIGVSFSSNSSSSSPGSDEDDDDEEESLEVEPSNVGSSRKRKRGSTKAGGSSDTRRMMEFFENLMKQVMHKQEVMQQRFLEAIEKREQDRMIREEAWKRQEMARLAREHELMAQDRAISASRDAAIVAFLQKITGQPIQQPTPASNSPVAPPPAVPPSHVPSPVLAATQPPQPSQQQQQQQQQKQQQQQQQQALQHSHRHTQVVMAIPEQQVLPQEIISGGGGSFEPASSRWPKVEVLALIKLRSGLESRYQEAGPKGPLWEEISAGMQRMGYKRNAKRCKEKWENINKYYKKVKESNKKRPEDAKTCPYFHELDALYRKKILGSSSSGGSGNSSFLDQNRPQLQQQQPENIELDPSVAPEIVSLPQPQTTLASTESDHDKNGASTDVQAQASNIALPEGLFGEAIGGAPKKPEDIVKELMEEREQHHQPQQRLILDEYDKIEDADSDNNDQEEDEDEDEDEDLEEEGKMDYKIEEQWLSTWIFSITKLTQNRRPRRKASNSTYLGLATTSFKELVAINTRPELSLKIAPVLALPISSKTALSTFNFKTSGSNGFHL