Information report for 912691
Gene Details
Functional Annotation
- Refseq: XP_002866995.1 — G-box-binding factor 1
- Swissprot: P42774 — GBF1_ARATH; G-box-binding factor 1
- TrEMBL: D7MBE7 — D7MBE7_ARALL; G-box binding factor 1
- STRING: scaffold_700433.1 — (Arabidopsis lyrata)
- GO:0006355 — Biological Process — regulation of transcription, DNA-templated
- GO:0010310 — Biological Process — regulation of hydrogen peroxide metabolic process
- GO:0010629 — Biological Process — negative regulation of gene expression
- GO:0090342 — Biological Process — regulation of cell aging
- GO:0005737 — Cellular Component — cytoplasm
- GO:0003700 — Molecular Function — transcription factor activity, sequence-specific DNA binding
- GO:0043565 — Molecular Function — sequence-specific DNA binding
- GO:0044212 — Molecular Function — transcription regulatory region DNA binding
Family Introduction
- The bZIP domain consists of two structural features located on a contiguous alpha-helix: first, a basic region of ~ 16 amino acid residues containing a nuclear localization signal followed by an invariant N-x7-R/K motif that contacts the DNA; and, second, a heptad repeat of leucines or other bulky hydrophobic amino acids positioned exactly nine amino acids towards the C-terminus, creating an amphipathic helix. To bind DNA, two subunits adhere via interactions between the hydrophobic sides of their helices, which creates a superimposing coiled-coil structure. The ability to form homo- and heterodimers is influenced by the electrostatic attraction and repulsion of polar residues flanking the hydrophobic interaction surface of the helices.
- Plant bZIP proteins preferentially bind to DNA sequences with an ACGT core. Binding specificity is regulated by flanking nucleotides. Plant bZIPs preferentially bind to the A-box (TACGTA), C-box (GACGTC) and G-box (CACGTG), but there are also examples of nonpalindromic binding sites.
Literature and News
Gene Resources
Homologs
- Arabidopsis thaliana: AT4G36730.1, AT4G36730.2
- Brassica napus: GSBRNA2T00121501001, GSBRNA2T00131046001, GSBRNA2T00027216001, GSBRNA2T00058500001
- Brassica oleracea: XP_013630450.1, XP_013630449.1, XP_013630448.1, XP_013601841.1, XP_013601847.1
- Brassica rapa: XP_009142677.1, XP_009109337.1, XP_009109336.1
- Raphanus raphanistrum: RrC131_p9, RrC5538_p3
Sequences
CDS Sequence:
- >912691|Arabidopsis_lyrata|bZIP|912691
ATGGGAACGAGCGAAGACAAGATGCCATTTAAGCCTACCAAACCAACATCTTCGGCTCAGGAAGTTCCTCCCACACCGTATCCAGATTGGTCAAATTCAATGCAGGCTTATTATGGCGGAGGAGGTACGCCAAATCCTTTTTTCCCATCCCCAGTTGGATCTCCTAGTCCCCACGCTTATATGTGGGGCGCTCAACACCATATGATGCCGCCTTATGGGACCCCAGTACCGTACCCAGCAATGTATCCCCCGGGAGCAGTCTATTCTCATCCTAGCATGCCCATGCCTCCTAATTCTGGTCCAACCAACAAGGAGACTGTGAAGGACCAAGCTTCTGGCAAGAAGTCAAAGGGGAGCTCGAAAAAAAAGGGTGAAGGAGGTGACAAAGCGCTCTCTGGTTCAGGGAACGATGGTGTCTCTCATAGTGATGACAGTGTCACAGCGGGTTCATCTGATGAAAATGATGACAATGCCAATCAACAGGAACAAGGTTCAGTTAGAAAGCCGAGCTTTGGACAGATGCTTGCGGACGCAAGTTCTCAAAGTACTACTGGTGAAATCCAAGGTTCGGTGCCCATGAAGCCGGTAGCCCCGGGGACTAATCTGAATATCGGGATGGACTTATGGTCTTCCCAAGCTGGTGTACCTGTGAAGGATGAACGAGAGCTCAAGAGGCAGAAGAGGAAACAGTCTAACCGTGAATCTGCTAGGCGGTCTAGATTGCGGAAGCAGGCGGAATGCGAACAACTTCAACAGAGAGTAGAGAGTTTGTCGAACGAGAATCAAAGCCTGAGAGATGAGCTACAAAGACTCTCAAGCGAATGTGAAAAGCTCAAGTCTGAGAACAACTCAATCCAGGATGAGTTGCAGAGAGTGCTTGGAGCAGAGGCTGTAGCTAATCTAGAGCAGAATGCTGCTGACGGTGAAGGAAAAAATTAA ATGGACGCAATCTTCTTAACGGATGATCCGAATACCCGGAAACAATTAATTGGGTCGCTAGCTCATTCTTTCGGATGCATCTACGTTTCTCTCTGGTCCTATTATTTTCCTCGACCGTCTAACTACTTGATATCATTCGATGGATATTACAATGAAGCATCTAACGAGCCTTCTACGTCTACTGGAAGTTTAGCGAGAAGATTGTTCCATGAGTATCGCCAATCCGTCATTCCTCTCCAAAATGGACATATACCAAGCATGGCGTTCATGAATAATCTCCCATACCTAGAGATTCAGACACAAGATATTCAAAGACTTGCTTCTAACGACGCACAGCGTCTCTTCTATCAGGAAGCAAGGATTCAGACGGTGATATTCATGGGTTGTCGGAGCGGGGAGATCGAACTCGGATTGACGTATGATGCTGCAAATATGAAAGTAGAAGCAAGTCTTCGAGATTGGTTCCCTGAAGATTTCAGTAGAAAAACTTCTCCGGTCAACTCAGACTATCTCCGGCCACAGCCTCCTCCGTCTTCATCCTCTTCTTCTCTTAGATCACTAGACAGTCCCCAAAACGCCTCCGAATATTCCTCTCTCTTATTCCCACTCATCCCTAAACCTTCAACGACGACTGACGCCGTTAACGTTCCGTTGCATACGCTGCTAGCTCCGGTCACCACAGCAGAAACAACGACCAACATGATCCATCAACAACAACAAGAGCCTTTGTTTCGCAACCGTGAACGTGAGGAGGAAGTAATGACGCAAGCTATCTTAGCGGTTTTATCGATGTCTTCAAGTCCTTCGTCGCCGCAGCGAAAAGGAAAGGCCACCGCTTTTAAGAGATACTACTGCGTGGCTAGCGGCGGCGGTGGGAGCGGTAGAGCACCGCAACCGCCGAGTGTACGGAGGCAAAGTATGATGAAAAGAGCTATTTCGTTCTACAATAGGCTTAACATTAACTGGAGAGAGCGTTTTCCCGCTACTGGCGGCGGCAGTGATGGAATCGGTGGAAGCGGTGGCGGCGGTGGCGGGCGTGGGCCAACCGCAACGCAGTTGCATCATATGATATCGGAGAGGAAACGGCGAGAAAAGCTTAATGAGAGCTTTCAAGCATTAAGATCTCTCCTTCCTCCTGGAACTAAGAAAGATAAAGCATCGGTCCTCACCATTGCAAGGGATCATCTAACTTCTTTGCAAGGTGACATTTCGAAACTACTAGAGAGAAATCGAGAGCTGGAGGCTAAGCTAGCGGGGGAAAGAGAGATGGAAATTTTTTTACAAGCCGATGAGAGGTTTAACGTTCGTATAATACATATACCCGAATCCACATCCAGAGAAAGGGTTTTGGATCTAAGAATTGCTCACAGAGGAGACAACATTGGGGCTGATGATTTGATCATAAGGCTTCTAGAGTTCTTAAAGCATATCAACAATGTGAGTTTAGTATCAATCGACGGTAAAACCCGAGCTAGAGAAGATGGAGTTACTTCGGTCATTCTCGTGAGCTTAAGGCTCAAGATTGAGGGTGAATGCGACGAATCAGCCTTCCAAGAAGCAATCAGAAGAGTTGTTGCTGACTTGGCTCACTGA
Protein Sequence:
- >912691|Arabidopsis_lyrata|bZIP|912691
MGTSEDKMPFKPTKPTSSAQEVPPTPYPDWSNSMQAYYGGGGTPNPFFPSPVGSPSPHAYMWGAQHHMMPPYGTPVPYPAMYPPGAVYSHPSMPMPPNSGPTNKETVKDQASGKKSKGSSKKKGEGGDKALSGSGNDGVSHSDDSVTAGSSDENDDNANQQEQGSVRKPSFGQMLADASSQSTTGEIQGSVPMKPVAPGTNLNIGMDLWSSQAGVPVKDERELKRQKRKQSNRESARRSRLRKQAECEQLQQRVESLSNENQSLRDELQRLSSECEKLKSENNSIQDELQRVLGAEAVANLEQNAADGEGKN*