Gene Details:

  • Gene ID: AA4G00072
  • Gene Family: Trihelix Family
  • Description: Trihelix Family protein
  • Species: Aethionema arabicum
  • Source: Trihelix family gene from PlantTFDB

Protein Features:

Annotation Proteins:

  • Refseq:  XP_010444239.1  — PREDICTED: uncharacterized protein LOC104726958
  • Swissprot:  Q84W56  — RNJ_ARATH; Ribonuclease J
  • TrEMBL:  A0A1J3H5W3  — A0A1J3H5W3_NOCCA; Ribonuclease J
  • STRING:  Bostr.0568s0468.1.p  — (Boechera stricta)

Gene Ontology:

  • GO:0009658  — Biological Process — chloroplast organization
  • GO:0009942  — Biological Process — longitudinal axis specification
  • GO:0060918  — Biological Process — auxin transport
  • GO:0009507  — Cellular Component — chloroplast
  • GO:0003677  — Molecular Function — DNA binding

Family Introduction:

  • GT factors constitute a plant-specific transcription factor family with a DNA-binding domain that binds GT elements. The DNA-binding domain of GT factor, rich in basic and acidic amino acids and proline and glutamine residues, features a typical trihelix (helix-loop-helix-loop-helix) structure that determines the specific binding of GT elements; thus GT factors are also called trihelix transcription factors. GT elements are highly degenerate cis-elements with A/T-rich core sequences (Villain et al. 1996; Wang et al. 2004). Interaction between GT factors and GT elements has been implicated in the complex transcriptional regulation of many plant genes.

Literature:

Sequences:

CDS Sequence:
  • >AA4G00072|Aethionema_arabicum|Trihelix|AA4G00072
    ATGAAATCTGCATCTTTGCAAGGTTTCTCTTACCCTTCTTCCTCACATTCTTCTTCTATACACTCAAATGTTCACAGACCAGCTTCTTCTCCCTTTAAAATGGCGTCGTTTAGTGCCCTTTCAATTTGTCCCTACACTTTCACCTTCCGCCCAATCTCTCGAAATAGGTCTACCGTTTCGTGCTCTGTCACTGCTCCTGCTACTGGCACATCTTCCTCTAAGACACCTCGAAGAAGATCGAATCGACTAGAAGGAGCTGGAAAAAGTATGGAGGATTCAGTAAAACGTAAACTAGAACAGTTTTATGAAGGAACAGATGGGCCCCCACTCCGAGTTCTTCCTATAGGTGGTCTTGGTGAAATTGGAATGAACTGTATGCTTGTTGGAAACTATGATCGATACATTCTAATCGACGCTGGTGTAATGTTTCCTGATTATGATGAGCCTGGAGTCCAGAAAATTATGCCAGACACAGGATTTATCAGAAGATGGAAACACAAAATTGAAGCTGTAGTTATAACTCATGGTCATGAAGATCACATTGGTGCGTTGCCTTGGGTTATACCTGCATTGGACTCCAATACACCGATATTTGCATCATCCTTTACCATGGAGCTTATAAAGAAGCGGTTGAAGGAGAATGGGATCTTTGTTCAATCTAGACTCAAGATATTTAAAACTCGAAGTAGATTCATGGCTGGACCATTTGAAATAGAACCGATAACAGTTACTCATTCTATTCCTGATTGTTCTGGCTTAGTCCTTCGCTGTGCTGACGGTAATATTCTCCACACTGGAGATTGGAAGATCGATGAATCGCCATTAGATGGAAATGTATTTGATCGTGTAGCTTTAGAAGAAATCTCAAAGGAAGGAATTACGTTGATGATGAGTGATTCAACAAATGTGTTGTCGCCGGGAAGGACGCTTAGTGAAAAAGTGGTAGCAGATGCTTTGGTAAGGAATATAATGGCGGCCAAAGGAAGAGTTATCACAACTCAATTTGCCTCTAATATACATCGTTTGGGAAGTATAAAAGCTGCTGCTGATTTAACTGGTCGAAAATTGGTATTTGTTGGTATGTCCTTAAGAACATATCTAGACGCAGCTTGGAGGGATGGAAAGGCTTCAATTGACCCATCTAGTCTGGTGAAAATCGAAGATATTGAAGCATATGCTCCTAAAGATTTGCTGATTGTCACAACTGGGTCACAAGCAGAACCACGTGCTGCTCTGAATCTTGCATCATATGGAACTAGTCACGCATTCAAACTTACCAAAGAAGACGTAATTCTTTACTCAGCAAAGGTGATCCCAGGCAATGAATCACGAGTAATGAAAATGATGAACCGAATAGCAGATATTGGTCCAAATATAATCATGGGTAGGAATGAAATGCTTCACACTTCTGGTCATGCCTACCGTGGAGAATTGGAAGAGGTTCTTAAAATAGTGAAACCGCAACATTTTCTACCCGTACATGGAGAACTATTGTTTCTTAAGGAGCATGAGTTGCTAGGGAAATCTACTGGCATTCGACACACTACTGTTATAAAGAATGGAGAAATGCTTGGAGTTTCTCACTTGAGAAACAGAAGAGTTTTATCCAATGGATTTAGCTTACTTGGGAGGGAGAATTTACAGCTAATGTACAGCGATGGCGATAAAGCATTTGGAACATCAAGTGAACTTTGCATTGATGAAAGACTAAGGATTGCATCTGATGGTATTATAGTTCTAAGTATGGAAATCATGCGTCCAAATAGCGATGAAAGCCGCACAGAAAACAGTATAAAAGGAAAGATAAGAATCACGACACGGTGTATGTGGCTTGACAAAGGAAAGCTTTTAGATGCACTACATAAAGCTGCTCATGCTTCTTTATCAAGTTGTCCTGTGAATTGTCCTTTATCTCATATGGAAAGAACAGTTTCCGAAGTTTTAAGGAAAATAGTGAGGAAATATAGCGGTAAAAGACCAGAAGTTATCGCTATAGCCACAGAAAATCCCTTGGCTGTCCAAGCTGAGGAGGTCAATGCAAGACTGTCTGGTGAGGTTAATGTTGGCTCTGGAGTTGCAGCTTTAAGGAAAGTTGTCGAAGGACATTCAAAAAGAAACCGATCAAAGAAAGTACCGAAAGAAACGGATCCCACTTCAAAAGATGAGATTGTTGATAGTGCAAGACTACTAGCTGAGGAAGAAACCGAAACCTCAGTGTCAACTTACAGAGAAGATGATAAACTCACTATAAATACCGAAGATTCAGATGATTTTTGGAAATCTTTCATCACTCCATCATCTCCTGATGAAAACAAAACTGCGAGTACAGTAACGGATCCATCGGAGGCTAAAACAGAGGATAGTAAAGACGATGATCTATCTGATGCTACAGATGCTGAACAGAAGTCGTCGAAACGAGTGAGGAGAAACAAATGGAAACCGGAGGAGATTAAGAAAGTAATTAGAATGCGAGGAGAGTTGCATAGCAGATTTCAGGTAGTGAAAGGAAGAATGGCTTTATGGGAAGAGATTTCTTCAAATTTAATGGCTGAAGGAATCAACAGAAGCCCAGGACAGTGTAAATCTCTCTGGTCATCTCTTATTCAGAAATACGAGAAGATCTTTGCTGATTTGATTCTGAACTTGGAGCCTGTAATGTCGCTTTTGCAGGAATGTAAGACCGAGGAAAGAAGCAAGACGAATTGGGCACATTTTGAGGATATGAACAACATCTTATCTGAGTTAGACACACCTGCAACAACC
Protein Sequence:
  • >AA4G00072|Aethionema_arabicum|Trihelix|AA4G00072
    MKSASLQGFSYPSSSHSSSIHSNVHRPASSPFKMASFSALSICPYTFTFRPISRNRSTVSCSVTAPATGTSSSKTPRRRSNRLEGAGKSMEDSVKRKLEQFYEGTDGPPLRVLPIGGLGEIGMNCMLVGNYDRYILIDAGVMFPDYDEPGVQKIMPDTGFIRRWKHKIEAVVITHGHEDHIGALPWVIPALDSNTPIFASSFTMELIKKRLKENGIFVQSRLKIFKTRSRFMAGPFEIEPITVTHSIPDCSGLVLRCADGNILHTGDWKIDESPLDGNVFDRVALEEISKEGITLMMSDSTNVLSPGRTLSEKVVADALVRNIMAAKGRVITTQFASNIHRLGSIKAAADLTGRKLVFVGMSLRTYLDAAWRDGKASIDPSSLVKIEDIEAYAPKDLLIVTTGSQAEPRAALNLASYGTSHAFKLTKEDVILYSAKVIPGNESRVMKMMNRIADIGPNIIMGRNEMLHTSGHAYRGELEEVLKIVKPQHFLPVHGELLFLKEHELLGKSTGIRHTTVIKNGEMLGVSHLRNRRVLSNGFSLLGRENLQLMYSDGDKAFGTSSELCIDERLRIASDGIIVLSMEIMRPNSDESRTENSIKGKIRITTRCMWLDKGKLLDALHKAAHASLSSCPVNCPLSHMERTVSEVLRKIVRKYSGKRPEVIAIATENPLAVQAEEVNARLSGEVNVGSGVAALRKVVEGHSKRNRSKKVPKETDPTSKDEIVDSARLLAEEETETSVSTYREDDKLTINTEDSDDFWKSFITPSSPDENKTASTVTDPSEAKTEDSKDDDLSDATDAEQKSSKRVRRNKWKPEEIKKVIRMRGELHSRFQVVKGRMALWEEISSNLMAEGINRSPGQCKSLWSSLIQKYEKIFADLILNLEPVMSLLQECKTEERSKTNWAHFEDMNNILSELDTPATT