Gene Details:

  • Gene ID: Glyma.19G252600.4.p
  • Gene Name: GLYMA_19G252600, Glyma19g44190.2, LOC100781875
  • Gene Family: bZIP Family
  • Description: bZIP Family protein
  • Species: Glycine max
  • Source: bZIP family gene from PlantTFDB

Protein Features:

Annotation Proteins:

  • Refseq:  XP_003554746.1  — common plant regulatory factor 1 isoform X2
  • Refseq:  XP_028218712.1  — common plant regulatory factor 1-like isoform X2
  • Swissprot:  Q99089  — CPRF1_PETCR; Common plant regulatory factor 1
  • TrEMBL:  A0A445FLD8  — A0A445FLD8_GLYSO; Common plant regulatory factor 1 isoform A
  • TrEMBL:  K7N091  — K7N091_SOYBN; BZIP transcription factor
  • STRING:  GLYMA19G44190.2  — (Glycine max)

Gene Ontology:

  • GO:0006355  — Biological Process — regulation of transcription, DNA-templated
  • GO:0003700  — Molecular Function — transcription factor activity, sequence-specific DNA binding
  • GO:0043565  — Molecular Function — sequence-specific DNA binding

Family Introduction:

  • The bZIP domain consists of two structural features located on a contiguous alpha-helix: first, a basic region of ~ 16 amino acid residues containing a nuclear localization signal followed by an invariant N-x7-R/K motif that contacts the DNA; and, second, a heptad repeat of leucines or other bulky hydrophobic amino acids positioned exactly nine amino acids towards the C-terminus, creating an amphipathic helix. To bind DNA, two subunits adhere via interactions between the hydrophobic sides of their helices, which creates a superimposing coiled-coil structure. The ability to form homo- and heterodimers is influenced by the electrostatic attraction and repulsion of polar residues flanking the hydrophobic interaction surface of the helices.
  • Plant bZIP proteins preferentially bind to DNA sequences with an ACGT core. Binding specificity is regulated by flanking nucleotides. Plant bZIPs preferentially bind to the A-box (TACGTA), C-box (GACGTC) and G-box (CACGTG), but there are also examples of nonpalindromic binding sites.

Literature:

Sequences:

CDS Sequence:
  • >Glyma.19G252600.4.p|Glycine_max|bZIP|Glyma.19G252600.4.p
    ATGGGAAACAGTGAGGAAGAGAAATCTACCAAGACTGAAAAACCTTCTTCACCTGTAACAGTGGATCAAGCCAATCAGACCAACCAGACCAATATTCATGTCTATCCTGATTGGGCAGCCATGCAGGCATATTATGGGCCAAGAGTCACCATGCCACCATACTACAACTCAGCTGTGGCTTCTGGTCACGCTCCTCACCCATACATGTGGGGACCACCACAGCCTATGATGCCACCTTATGGGCCTCCTTATGCAGCAATTTATCCACATGGAGGGGTTTATACTCACCCTGCAGTTCCTATTGGGCCACATACACATAGTCAAGGAGTTCCATCTTCACCCGCCGCTGGGACTCCTTTAAGCATAGAGACACCACCCAAATCATCTGGAAATACTGATCAGGGTTTAATGAAGAAATTGAAAGAGTTTGATGGACTTGCAATGTCAATAGGAAATGGCCATGCTGAAAGTGCAGAGCCTGGAGGTGAAAACAGGCTGTCAGAGAGTGTGGATACTGAGGGTTCCAGTGATGGAAGTGATGGCAACACTTCAGGGGCTAATCAAACAAGAAGGAAAAGAAGCCGTGAGGGAACACCAACCACTGATGGAGAAGGGAAAACTGAGATGCAAGGCAGTCCAATTTCCAAAGAGACTGCAGCTTCTAATAAGATGTTGGCAGTTGTCACTGCTGGTGTTGCAGGAACAATAGTTGGACCTGTAGTTTCTTCAGGTATGACCACCACGCTGGAGCTGAGAAATCCTTCCAGTGTTCATTCTAAAGCAAGTGCCCCACAACCTTGTCCAGTATTGCCTGCAGAAACTTGGTTACAGAATGAGCGTGAGCTGAAACGTGAGAGGCGGAAACAATCAAATCGAGAATCTGCTAGAAGGTCCAGACTAAGGAAGCAGGCTGAAACTGAAGAACTGGCACGGAAAGTTGAATCCTTGAATGCTGAGAATGCAACACTGAAATCAGAAATAAATCGACTGACCGAAAGTTCTGAAAAAATGAGGGTGGAAAATGCTACATTAAGGGGAAAACTTAAAAATGCTCAACTGAGACAAACACAAGAGATAACTTTGAACATAATTGACAGCCAGAGGGCTACACCTATAAGTACAGAAAACTTACTATCGAGAGTTAATAATAATTCCGGTTCTAATGATAGAACTGTGGAGGATGAGAATGGTTTTTGCGAAAATAAACCAAACTCTGGTGCAAAGCTGCATCAACTACTGGACACAAGTCCTAGAGCTGATGCTGTGGCAGCTGGTTGA
Protein Sequence:
  • >Glyma.19G252600.4.p|Glycine_max|bZIP|Glyma.19G252600.4.p
    MGNSEEEKSTKTEKPSSPVTVDQANQTNQTNIHVYPDWAAMQAYYGPRVTMPPYYNSAVASGHAPHPYMWGPPQPMMPPYGPPYAAIYPHGGVYTHPAVPIGPHTHSQGVPSSPAAGTPLSIETPPKSSGNTDQGLMKKLKEFDGLAMSIGNGHAESAEPGGENRLSESVDTEGSSDGSDGNTSGANQTRRKRSREGTPTTDGEGKTEMQGSPISKETAASNKMLAVVTAGVAGTIVGPVVSSGMTTTLELRNPSSVHSKASAPQPCPVLPAETWLQNERELKRERRKQSNRESARRSRLRKQAETEELARKVESLNAENATLKSEINRLTESSEKMRVENATLRGKLKNAQLRQTQEITLNIIDSQRATPISTENLLSRVNNNSGSNDRTVEDENGFCENKPNSGAKLHQLLDTSPRADAVAAG*