Information report for EuGene.0200010371
Gene Details
|
|
Functional Annotation
- Refseq: XP_001416350.1 — predicted protein
- TrEMBL: A4RTA3 — A4RTA3_OSTLU; Uncharacterized protein
- STRING: ABO94643 — (Ostreococcus ’lucimarinus')
- GO:0006355 — Biological Process — regulation of transcription, DNA-templated
- GO:0003700 — Molecular Function — transcription factor activity, sequence-specific DNA binding
- GO:0008270 — Molecular Function — zinc ion binding
- GO:0043565 — Molecular Function — sequence-specific DNA binding
- GO:0044212 — Molecular Function — transcription regulatory region DNA binding
Family Introduction
- GATA factors were first identified as proteins that interact with conserved WGATAR (W = T or A; R = G or A) motifs involved in erythroid-specific gene expressionin vertebrates.
- GATA factors are characterised by the presence of conserved, type-IV zinc-finger motifs Animal factors typically contain two C-x2-Cx17-C-x2-C zinc-finger domains. The majority of known fungal GATA factors contain a single C-x2-C-x17-C-x2-C finger with greatest similarity to the carboxyl (C) terminal finger of animal GATA factors.Several examples of fungal GATA factors containing a variant C-x2-C-x18-C-x2-C DNA-binding domain are also known.
- Examples of both C-x2-C-x17-Cx2-C (Type IVa) and C-x2-C-x18-C-x2-C (Type IVb) GATA factors are found within fungi; animals onlycontain the former configuration, and plants only the latter. Plant GATA factors typically contain a single zinc finger. The Arabidopsis type-IV zinc-finger proteins may represent the previously defined family of nuclear GATA-binding proteins implicated in light-responsive transcription.
Literature and News
Gene Resources
Homologs
- Brassica napus: GSBRNA2T00028419001, GSBRNA2T00144343001
- Brassica oleracea: XP_013585555.1
- Brassica rapa: XP_009129774.1, XP_009112489.1, XP_009151727.1
- Cicer arietinum: XP_004494549.1, XP_012569576.1
- Medicago truncatula: Medtr7g112330.1, Medtr5g020230.1
- Musa acuminata: GSMUA_Achr11P17770_001, Ma11_g16340
- Nicotiana tabacum: XP_016478905.1
- Raphanus raphanistrum: RrC3297_p1
- Solanum lycopersicum: Solyc09g075610.2.1
- Solanum tuberosum: PGSC0003DMP400055095
Sequences
CDS Sequence:
- >EuGene.0200010371|Ostreococcus_sp._RCC809|GATA|EuGene.0200010371
ATGACGCAGGCGATGGACGCGTCGACGTCGGGGCAGTTGTTGCCCGGCGTGGCGGGGAAGCGGTGCGCGCACTGTAACACGCACACGACGCCGTTGTGGCGGAACGGACCGGATGGGCCGAAGACGTTGTGTAACGCGTGCGGGGTGAGAGATAATAGAAGACACGCCAAGGCGAATCGAGTGCAGAAACCGTCTGCGCCCAAGGCGCCCAAGGCGAGCAAGTCTAACGGGAAGGGTGGCGATAAGAGGAAGCGCGGGGATGCCGCTTCACCGGGAAGAGGGGGTAAGAAAGACGCGAAGAAAGCAAAGCCGGCGCGCAACTACTTTGCGCAAAAGGTGGACATTCATGTGCCGAGCTTTCACGAGGTCGCGGATTATGAAATGTCGCACGCGGGGGGATTCAGACAGCCGAACGCGTACCTGCGCGAAAACGTTGCCGATCACCAGCTCAACCGTTACGGACCGGGAACCGCCGCGCCGATGTATGAAGCGACGCCGGCCGACTTTGAGTGGCTCGAACACATGAACTCGGAGCCACTCGAAGGGAAGGGGACGGTGTGCGCGACGACGCCGAACGCGCAGTACATGCGCCCGGAACACTTGGAGAAGCTCTTTGACACGTTTGAAGAAACCTCTTGGGCTTCATCCGCGATTCCGACGCAAGAACAGGCGGCACAGGTTGTGCTCGGGAGCGTGTTCGGTGCGGTGAACACGCCGGAGAGCAAAATGGAGGATGTTTTGCACTGGGCCTCAAACGCGCAGTTGGGCGACGGACAGCTCGTCAATTTGAAGTCTGAGGTCGAGGGTTGGAATCCGTTGGATTTGCACGACGAAAACTCGAACCAAAGCTCGTCCGAGACGGCCGCGCCGTTGACACCGAGCGGAAGCGCAGAGTTTTTGCTCAAGCGTACGTCTCAGTCTGGAAGCATTCCGTCTGAAAACGGCGATGATGACTCATCGACGCAATCCGCCGACGATTTGATCGAACGTCGGCTCGCGTCGAACACGGATCGTCAGCAGTCGGAAAACTTAAAGAGCAGTGATCTCGACACCCGCGCCCAAGTGGGAAAACTGCGTCGCGCTGGATATTTCTTTTCCAGACGCGTCGCGCAGGTGTCGCGCCCCCGGATTGGTGGCAAGAGTGAGCTCAAGAACGCAATCCAAACCATTAACCAAATCAACGAGAAAATTTTCAAGATATGCAAACACGCCCCTTCTTTTGAGGTGATTTGCAAAGTCTACCGTTACTGGCTAAAAAAGCGTTGGAAGAATGGAGGAAAACCTCTTCTGAAGCGTTTTGATCCGGTGCCGCCGTTGCGGCTCAGAGAGCGTCCAGAGGTCGCGACGGAATCAGAGCAACTCGCTTATTTATTCTGTTTCAATCTCGAGGCGGTTTTCAGGCAGCAACAGGAACGAGCAGTCAGAGTGGAACAGGCCGAGGCGCAGAAAAAGCGTCGCCGCCGTCCTTGTTTCAATCCCGCCGCGACGAAGCGTCGACGTCGCGTCCAAGCCGCGATTGACGACGCGGTGTCAAAGCCTCTGACCATCTCATTCGAACCGCTCGACAAACTCTTCGCGCACGGCTGGGAAGTGGTCGAGTACCAACCCAAGCCGGTCGTTCCGGTCAAGGAACAGAAGACGCCGAAAAAGGAACCAACCGTCAAGAAGCAGAGCAAACCCGCCGCCGACGCCGCTCGGTTGCCGACGCGTTCGCCGAAGTCCGTCAAGGCTGAAAAGCTCGAGTCTCCCGCGGAGAATGCGGTCTTCGACGATAGCGTTGGTCGCCGATCGGAGGCAAAGTCCGTCAAATCACCGATGGGCGCCGCCGCGGCAAAGGCGGGTGCGCTCGTCAAGAACTTTGTCTCTTCGGTGGGCTGCGCCTTCGGTTACGGCAACAACGGGAAAAATAATCACGAAATGGTAAATGGATCTTCCGGGGATCCCAACGCGCCGTCGACACCTACGACGAAGCGTCGCTCGGCGCGTGTAAAATCAAGCGACTGGACCGTGGCGAGATTGCCTTAA
Protein Sequence:
- >EuGene.0200010371|Ostreococcus_sp._RCC809|GATA|EuGene.0200010371
MTQAMDASTSGQLLPGVAGKRCAHCNTHTTPLWRNGPDGPKTLCNACGVRDNRRHAKANRVQKPSAPKAPKASKSNGKGGDKRKRGDAASPGRGGKKDAKKAKPARNYFAQKVDIHVPSFHEVADYEMSHAGGFRQPNAYLRENVADHQLNRYGPGTAAPMYEATPADFEWLEHMNSEPLEGKGTVCATTPNAQYMRPEHLEKLFDTFEETSWASSAIPTQEQAAQVVLGSVFGAVNTPESKMEDVLHWASNAQLGDGQLVNLKSEVEGWNPLDLHDENSNQSSSETAAPLTPSGSAEFLLKRTSQSGSIPSENGDDDSSTQSADDLIERRLASNTDRQQSENLKSSDLDTRAQVGKLRRAGYFFSRRVAQVSRPRIGGKSELKNAIQTINQINEKIFKICKHAPSFEVICKVYRYWLKKRWKNGGKPLLKRFDPVPPLRLRERPEVATESEQLAYLFCFNLEAVFRQQQERAVRVEQAEAQKKRRRRPCFNPAATKRRRRVQAAIDDAVSKPLTISFEPLDKLFAHGWEVVEYQPKPVVPVKEQKTPKKEPTVKKQSKPAADAARLPTRSPKSVKAEKLESPAENAVFDDSVGRRSEAKSVKSPMGAAAAKAGALVKNFVSSVGCAFGYGNNGKNNHEMVNGSSGDPNAPSTPTTKRRSARVKSSDWTVARLP*