Gene Details:
- Gene ID: AA19G00150
- Gene Family: bZIP Family
- Description: bZIP Family protein
- Species: Aethionema arabicum
- Source: bZIP family gene from PlantTFDB
Protein Features:
- SMART: SM00338
- Gene3D: G3DSA:1.20.5.170
- PROSITE profile: PS50217
- Pfam: PF00170
- SuperFamily: SSF57959
- PROSITE pattern: PS00036
- InterPro: IPR004827
Annotation Proteins:
- Refseq: NP_171893.1 — G-box binding factor 4
- Swissprot: P42777 — GBF4_ARATH; G-box-binding factor 4
- TrEMBL: A0A384LG29 — A0A384LG29_ARATH; GBF4
- TrEMBL: Q2HIT6 — Q2HIT6_ARATH; At1g03970
- STRING: AT1G03970.1 — (Arabidopsis thaliana)
Gene Ontology:
- GO:0006355 — Biological Process — regulation of transcription, DNA-templated
- GO:0005634 — Cellular Component — nucleus
- GO:0003700 — Molecular Function — transcription factor activity, sequence-specific DNA binding
- GO:0043565 — Molecular Function — sequence-specific DNA binding
Family Introduction:
- The bZIP domain consists of two structural features located on a contiguous alpha-helix: first, a basic region of ~ 16 amino acid residues containing a nuclear localization signal followed by an invariant N-x7-R/K motif that contacts the DNA; and, second, a heptad repeat of leucines or other bulky hydrophobic amino acids positioned exactly nine amino acids towards the C-terminus, creating an amphipathic helix. To bind DNA, two subunits adhere via interactions between the hydrophobic sides of their helices, which creates a superimposing coiled-coil structure. The ability to form homo- and heterodimers is influenced by the electrostatic attraction and repulsion of polar residues flanking the hydrophobic interaction surface of the helices.
- Plant bZIP proteins preferentially bind to DNA sequences with an ACGT core. Binding specificity is regulated by flanking nucleotides. Plant bZIPs preferentially bind to the A-box (TACGTA), C-box (GACGTC) and G-box (CACGTG), but there are also examples of nonpalindromic binding sites.
Literature:
Sequences:
CDS Sequence:
- >AA19G00150|Aethionema_arabicum|bZIP|AA19G00150
ATGGCGTCTTTTAAGGTGATGTCTTCTTCAAACTCAAGAGATTCGGATCTATCTCGTCGTAATTCCTCTTCTGCTTCATCTTCTCCTTCTATAAAATCGCATCTCCTTAGACGAGATTCAAACACCGATCATTCAAGGATAAGTTTCGCTAACACCGGAAGAAACCAAAGAAGAATCTCACCTACTTCAATTTCTGATCCACCGCCGCCATTGAATGATCAAAAGCTTGAAGCTATGGCTGCTAAAGCAACAGAGAAGAAGAGTGTAGACGATGTTTGGAGAGAGATTGTATCAGGAGAGAGAAAAAAAATGAAGGAAGAAACACAAGAAGATATAATGACGCTTGAAGATTTCTTAGCTAAAGCAGCTATGGATGAAACAGTGGGAACAAGAGGAGATTCAAATGATATTGATGTTAAAATACCACTCAACAATGGCGGTGGAAGCGGTTACACGTTTGATTTTCCAATGCAACCGCACGGTTATGGTAATGGTTTAGGTGGTTCTGGGTCTGGTTCAGGCTCAGGTTCAGGTTTCGGTAGAGGAAAGAGAGGGAGAGTAATGGTTGAGCCATTAGATAAAGCTGCAGCTCAGAGACAGAGAAGGATGATAAAGAATCGTGAGTCTGCAGCGAGGTCTAGGGAGCGAAAACAAGCTTATCAAGCTGAGTTAGAGACATTAGCTGCAAAATTAGAGGAAGAGAATGAGAAACTTTGTAAGGAAATAGCAGAGAAGAGTAAAGAGAGATATAAGAAACTTATGGAGCAAGTGGTACCTGTTGTTGAGAAATCAAAGCCATTGCGATTTTTACGGAGGAGTCAGTCCTTGGAATGG
Protein Sequence:
- >AA19G00150|Aethionema_arabicum|bZIP|AA19G00150
MASFKVMSSSNSRDSDLSRRNSSSASSSPSIKSHLLRRDSNTDHSRISFANTGRNQRRISPTSISDPPPPLNDQKLEAMAAKATEKKSVDDVWREIVSGERKKMKEETQEDIMTLEDFLAKAAMDETVGTRGDSNDIDVKIPLNNGGGSGYTFDFPMQPHGYGNGLGGSGSGSGSGSGFGRGKRGRVMVEPLDKAAAQRQRRMIKNRESAARSRERKQAYQAELETLAAKLEEENEKLCKEIAEKSKERYKKLMEQVVPVVEKSKPLRFLRRSQSLEW