Gene Details:

  • Gene ID: Aradu.3MA41
  • Gene Family: Trihelix Family
  • Description: Trihelix Family protein
  • Species: Arachis duranensis
  • Source: Trihelix family gene from PlantTFDB

Protein Features:

Annotation Proteins:

  • Refseq:  XP_020999670.1  — uncharacterized protein LOC107490890
  • TrEMBL:  A0A445CZW2  — A0A445CZW2_ARAHY; Uncharacterized protein
  • STRING:  XP_007152366.1  — (Phaseolus vulgaris)

Family Introduction:

  • GT factors constitute a plant-specific transcription factor family with a DNA-binding domain that binds GT elements. The DNA-binding domain of GT factor, rich in basic and acidic amino acids and proline and glutamine residues, features a typical trihelix (helix-loop-helix-loop-helix) structure that determines the specific binding of GT elements; thus GT factors are also called trihelix transcription factors. GT elements are highly degenerate cis-elements with A/T-rich core sequences (Villain et al. 1996; Wang et al. 2004). Interaction between GT factors and GT elements has been implicated in the complex transcriptional regulation of many plant genes.

Literature:

Sequences:

CDS Sequence:
  • >Aradu.3MA41|Arachis_duranensis|Trihelix|Aradu.3MA41
    ATGGGTGATAAGGGAGACTCATCCAAGAAACAACCACAACAATCTCACCACCCTAAACAGCAGCAGCAGATTTCATCTTCTCCAAAATCCCTACCACAAGCTTCTGCTGAGGAACCAATATCAGAAACAAGGACAATCCATCACCAACAGCAACAGTCACCAGTTGTTGTTGTGAGTGGAGCAACTCCATTCATCTCATCAAGCCCTCTCTATGTCTCAAGTGGTGCAAGTTCTTCACCCTTTGAGCAGCAACAACAGCAACAATTTGAGGCACTGAATGTTAAGAGGCCAAGGTACAGTACTGGACAATGGAAGCTTCTTCCATCACCATCCTCACAGCAGCATCAGAAACAGATGGCGGCCATGCTCCCAAGCGAATCAAGCCCTTCACCATCAGCAAATCCACCACAACCCTTACAAACATATGCTGCTGCTGCAGCAGCAGCAGCTGCATCATCTTCATCAGACACAGCATCATCTCCAAGTCACTCACCTATGCCTTTGCTCTCAGGTGGTTCTGGCCATGAAGGGAGCAAGCCAACATCAGAAGGAGAACAACCACCACAGCAACTTCATCACCAGCAACTAAGAAAAGGGAAGTATGTGAGTCCAGTTTGGAAGCCTAATGAGATGCTATGGCTTGCAAGGGCATGGAAAGCACAGTACCAAGGTGGTGGTTCTGATGGTTCATCTTCAAGAACAGAACAACAGCAAGAATTGGGAGGAATGAGTAGAGGGAAAACTAGGGCTGACAAAGATAGAGAAGTAGCTGAGTTTCTTCAGAGGCATGGGGTCAGCAGAGATGCTAAGACAGCAGGGACCAAATGGGATAACATGTTAGGTGAATTCAGGAAAGTGTATGAATGGGAGAGAGGTGGTGAGAGAGAACAAGTTGGAAAGAGCTACTTTAGGCTCTCACCATATGAGAGGAAGCTTCATAGGTTGCCAGCTTCCTTTGATGAGGAGGTTTTTGAGGAGCTTTCACAGTTCATGGGGTCCAGAATGAGGTCCTCCTCTCATGGTGGCGGCAGAGGTGTAGATGATGGTAGGACAGCATCACATGCTGCTGTTGCTACAGTGAGACCTCTGCCACTGCCTCCACCTAGGCCTTTCAAGGATGATGATCTTCCTCTCTCAGCCAGGACAACAAAGCAATTGGGTGGTAATGAAGCTTTCTTTCATGGTCCCAACAGAGGTAGCTTATTAGGGTTGGACCCTCATCAGCATAATGTATTGGACATTCAAGGTGCTTCATCTTCTTCATCTTCCAGAGAGCTGAGAAGAATTGGGAAGATAAGGATGACATGGGAAGAATGTGTGAGCCTTTGGGCTGAAGAAGGTGAAGTTCATAGAGGGAGAGTGAGGGTTCAAGCTTCAAGCTTTTTGAATGCTGATGAACTCACTTACTTCGATGAATCCATGGTTCCATGTCTCATGGAGTCCTTCGAAGATGGTCCCCTAAGAGGCTATTCGGTTGATAGATTTGTTTCAGGTCAGCAAGTCAAAGTTTTTGGGAGAAGAAAATCGTCTTCGGCTTCGTCACCTCCTTCTTCTGGTTTTAACGAAAGACTTCAATTTCCATCCAAATCACCTTCCATAAGATCAGCGAATACTACAACAACAGTAGAGTTTAGGGACCCAAGTGAATACTACGTGGAATGTTTGCAACAGAGAATAATGTCATCACTACAACAACACTCACTTCCAACCTTATTCGAATTGAAACGCTACTTGCAAGAGCCACCTCCACAGGACTTACGCTTCCCACTCCGTAAACAGGTTTACGACGACTTACCATCTTCCAAAGACCTCTTCTTCGCTCCTTCAACTCACCCGCCTTTCTTGGATTCTAGAACCTTCCTCTACGACGTCGTTTCCCCTCTCATCAGATCCAACAACCCTACCATTCTCCCGTCTTCTAGAGACTCGTTCATTCCCCTTTGGGACCATTGCATTAACTCAATTCTTTCCTTCTTCTGCAACCAAGATTTCATTCTAATCAGAAAGCCAACATATACTACTACTAGTTCTACTGCGTTGCAAGATCAATGGCCCAACGTTACTGGTTTCGTTAACAACTATTGCTTATGGCGAGGGGAAGAAACTGATCAACTACGAGAGGGACAACAAGATCCGTCTTCCACCATTGTTGAGAAGCTTCTGTGGACCTACGCGGATCTACCTTACATTCTCGGCTACTACGCCATTGGAACCAAGGTAACACTCTGCGCGTTAAGTAGGTCAACGCAATCAGACCAAGGAGAAACGAGAATTGTACGAACGGATCTACAACAACTTAACCTAACTACACCTACAGAACGAATCAAAGCTTTGGTTCCGTGTTTCAGAATCGCTACACTGTTACCGTTACTGAGCAAGATCTGTAACAACAGCAAGGGACTCGGGATCTTCACGTACAGCGATTTCGAGAGATTTGACCACGGAAACGGCGTCGTAACGGAGATGACGCCGAACACGTGCACTAGAATGTTCTCCGATAAGAGAAAATGGCTGGCGGCGAAGGAGGTTTACGAGATCTTGAAACACCGAATCCCTCACGCGGAGACTCTGGTTCAGTGTTGCGAAAACGACATGAGTTTAGTGTTCAAGCCGAGAGGTTGCAGAACGAAGCCTTCAAACTGCGAGGAGTTAGTTGAGGCGTTGAAGTGCGTGACGAAAGCGCTGGTGGCGCTGCACGACTTGTCGTTTATGCATAGGGATTTGGGATGGGAGAAAGTGTTGAGGAGGAGTGATAGTAGGGAGAGGGAGAGGGAGAGTGAGTGGTTTGTGTGTGGGTTTGAGGAGGCGAGTGGGGCTCCGGAGCTGAACAGGCGCGTGAGAGTTGGGAGGGAGGGGGAGTGGGCGCCGGAAATGGAGAGAGGTTTGCATTCCGTTAAAGTGGATGTGTGGGGGGTTGGGTGGTTGATAAGGACCTGTGGTTTGGTGGTTCCCAAAATGGTGAAGGAGTTGGAGAGGATGTGTATGGAGAATAATCCCGAGCATCGGCCTACGGCGGCTGACTGTTATCACCACCTGCTTCAGCTTCAGTCTTCTCTCTCTCACCACCAACCGCCTCCTGCTGCCGCTCATTTGATGTGA
Protein Sequence:
  • >Aradu.3MA41|Arachis_duranensis|Trihelix|Aradu.3MA41
    MGDKGDSSKKQPQQSHHPKQQQQISSSPKSLPQASAEEPISETRTIHHQQQQSPVVVVSGATPFISSSPLYVSSGASSSPFEQQQQQQFEALNVKRPRYSTGQWKLLPSPSSQQHQKQMAAMLPSESSPSPSANPPQPLQTYAAAAAAAAASSSSDTASSPSHSPMPLLSGGSGHEGSKPTSEGEQPPQQLHHQQLRKGKYVSPVWKPNEMLWLARAWKAQYQGGGSDGSSSRTEQQQELGGMSRGKTRADKDREVAEFLQRHGVSRDAKTAGTKWDNMLGEFRKVYEWERGGEREQVGKSYFRLSPYERKLHRLPASFDEEVFEELSQFMGSRMRSSSHGGGRGVDDGRTASHAAVATVRPLPLPPPRPFKDDDLPLSARTTKQLGGNEAFFHGPNRGSLLGLDPHQHNVLDIQGASSSSSSRELRRIGKIRMTWEECVSLWAEEGEVHRGRVRVQASSFLNADELTYFDESMVPCLMESFEDGPLRGYSVDRFVSGQQVKVFGRRKSSSASSPPSSGFNERLQFPSKSPSIRSANTTTTVEFRDPSEYYVECLQQRIMSSLQQHSLPTLFELKRYLQEPPPQDLRFPLRKQVYDDLPSSKDLFFAPSTHPPFLDSRTFLYDVVSPLIRSNNPTILPSSRDSFIPLWDHCINSILSFFCNQDFILIRKPTYTTTSSTALQDQWPNVTGFVNNYCLWRGEETDQLREGQQDPSSTIVEKLLWTYADLPYILGYYAIGTKVTLCALSRSTQSDQGETRIVRTDLQQLNLTTPTERIKALVPCFRIATLLPLLSKICNNSKGLGIFTYSDFERFDHGNGVVTEMTPNTCTRMFSDKRKWLAAKEVYEILKHRIPHAETLVQCCENDMSLVFKPRGCRTKPSNCEELVEALKCVTKALVALHDLSFMHRDLGWEKVLRRSDSRERERESEWFVCGFEEASGAPELNRRVRVGREGEWAPEMERGLHSVKVDVWGVGWLIRTCGLVVPKMVKELERMCMENNPEHRPTAADCYHHLLQLQSSLSHHQPPPAAAHLM