Information report for 107704
Gene Details
Functional Annotation
- Refseq: XP_002499671.1 — hypothetical protein MICPUN_107704
- TrEMBL: C1DYD9 — C1DYD9_MICCC; Uncharacterized protein
- STRING: XP_002499671.1 — (Micromonas sp. RCC299)
- GO:0006355 — Biological Process — regulation of transcription, DNA-templated
- GO:0007623 — Biological Process — circadian rhythm
- GO:0009416 — Biological Process — response to light stimulus
- GO:0003700 — Molecular Function — transcription factor activity, sequence-specific DNA binding
- GO:0008270 — Molecular Function — zinc ion binding
- GO:0043565 — Molecular Function — sequence-specific DNA binding
Family Introduction
- GATA factors were first identified as proteins that interact with conserved WGATAR (W = T or A; R = G or A) motifs involved in erythroid-specific gene expressionin vertebrates.
- GATA factors are characterised by the presence of conserved, type-IV zinc-finger motifs Animal factors typically contain two C-x2-Cx17-C-x2-C zinc-finger domains. The majority of known fungal GATA factors contain a single C-x2-C-x17-C-x2-C finger with greatest similarity to the carboxyl (C) terminal finger of animal GATA factors.Several examples of fungal GATA factors containing a variant C-x2-C-x18-C-x2-C DNA-binding domain are also known.
- Examples of both C-x2-C-x17-Cx2-C (Type IVa) and C-x2-C-x18-C-x2-C (Type IVb) GATA factors are found within fungi; animals onlycontain the former configuration, and plants only the latter. Plant GATA factors typically contain a single zinc finger. The Arabidopsis type-IV zinc-finger proteins may represent the previously defined family of nuclear GATA-binding proteins implicated in light-responsive transcription.
Literature and News
Gene Resources
Sequences
CDS Sequence:
- >107704|Micromonas_sp._RCC299|GATA|107704
ATGGCCGAAGGACCTCTCGCCTGGCTCCACCCCTTCGACTCGGGTTTGCACTCGTTTCCCAGCCTCGACGCGGGCACCGATCTCCTCGGCCTTCCCCCGTCCAACTGCGGCGTCGCCGCCAAGATGGCCTTCAGCGAGCTGGCGGCGATGGACCTGCTCCGTACTCCCTCGGGATTCGACACGCGCGCGGCGGACGCCCAGCCGCCGGAGACGTCGCGACGCGTCACCGGCGCCGACGCGACGTCGGCGGGTGAAAAGCCGTTCGCCTTCGACGCGCAAAGGGCCCACGCCGGAGCGCCCTCGACCTCTTATGTGCCCTCCGAGGTTTTCTCCCCCGTCAACTTCGGCGACGTCGACGTCGCAGACGCGCTCTTCACCAAGGGACCGCTCCGCACGCTCTCCGCGCGCACCTTCGACGCACTCCTGAACGATACGAGCCCGAGAGAGAACGAGGAGTGGACGGCGCTCCAGAACCTCCCCTCGCTGCCCAGGGACCTCACCACGGTGGACCTCGGCGTCGGGTGCGACCCGGACGACGACGACGCCAGCTGTGCGCCCGTCGCTTTTGGCGACGACCGACCCGACGACGTCGCGCTGATGGACAACGGCGACGAGGACGGGGTGCGCCTCGCCCGCGCGCGCGCCGCGACGGCCAGGGCGCGCGCCAAGGAGGCGCGGTCCATCGCCACGCAGAAGACGGCGACGGCGACCGCGCTCACCATCCAGGCGAAGGAGATGCTGAACGCGGCGGCGACGGTCGGCGACGCCGATCGCGACCCCGCCGGCGTCGCCAGCGGCTTGCGAACGCCGACGGCGATGTCGGAGCGCGGTACGTACTGGGGCGGGTACTTCGAGGGGTTCGACGCGACCACGGCGCTCTCGGGCGATGCGGTGAGCTCGCCGGCGGCGGTGGGGACGTGCGGGACCACCGCGGAGCACGTCCCCACCGCGGAGGTGCGATCCTCGTCGAGGAGGTCCACCAAGCGGGACGCGCCCTTCGACGGACCGAACGCCTCCGACGACGAGGAGTCCACCCCTTCGCGCGGCGGCGGCGGTTCGTCGGCCAAGAGGAGGAGGAAGGGCGAGGCCAACGGGGTGATCCAGGTCAATCCTCGACCCATGGCGACCGACAGCATCATCACCGCGAGTGGCAAGAAGATGAAGCGGGGATGCCTCAACTGCGGCCAGCAGAAGACGCCACAGTGGCGCATGGGACCCGAGGGACCCAAGACTCTGTGCAACGCGTGCGGCGTGAGGTTTCGAAAGGGACTCCCCCTCGACGGACCGTGA ATGGAATACAGCGAAAACACCACCGCTATCGCCACGGCGGAACCTTCTCCCTCCGCCGTGTCTCCCACCTCATCATCTTCACCACCCTCCTCTCCACCGGTGATTCTCAGCCCATGCGCCGCCTGCAAGATCCTCCGCCGCCGCTGCGCCGAGAAATGCGTCTTGGCCCCTTACTTTCCGCCCACAGAACCACTCAAATTCACCATCGCCCACCGGGTTTTCGGAGCAAGCAACATTATCAAGTTGTTGCAGGAACTTCCTGAGGCTCAAAGAGCAGATGCTGTGAACAGCATGGTGTATGAAGCCAACGCTAGGCTGAGGGATCCGGTCTACGGCTGTGCCGGTGCGATTTGCCAGCTTCAATACCAGATCAGTGAGCTCCAAGCTGAACTGGCCAAGGCTCAAGCTGAGATCCTGAACATGCAATGCCAGAACGCCAATTTGTTTGAGTTGATTGGCAAGCAAGTGGCCGCAGAAACTGATCGCCAGCCGCAAACACTACCTTCAGTTCATGAGAACACGACGTTTTTTTTCGATGAAAGTATTCTTGGAGTGGCTTGGGAGCCCCTCTGGACATGA
Protein Sequence:
- >107704|Micromonas_sp._RCC299|GATA|107704
MAEGPLAWLHPFDSGLHSFPSLDAGTDLLGLPPSNCGVAAKMAFSELAAMDLLRTPSGFDTRAADAQPPETSRRVTGADATSAGEKPFAFDAQRAHAGAPSTSYVPSEVFSPVNFGDVDVADALFTKGPLRTLSARTFDALLNDTSPRENEEWTALQNLPSLPRDLTTVDLGVGCDPDDDDASCAPVAFGDDRPDDVALMDNGDEDGVRLARARAATARARAKEARSIATQKTATATALTIQAKEMLNAAATVGDADRDPAGVASGLRTPTAMSERGTYWGGYFEGFDATTALSGDAVSSPAAVGTCGTTAEHVPTAEVRSSSRRSTKRDAPFDGPNASDDEESTPSRGGGGSSAKRRRKGEANGVIQVNPRPMATDSIITASGKKMKRGCLNCGQQKTPQWRMGPEGPKTLCNACGVRFRKGLPLDGP*