Information report for 341076
Gene Details
Functional Annotation
- TrEMBL: D7L7U3 — D7L7U3_ARALL; Uncharacterized protein
- STRING: fgenesh1_pg.C_scaffold_3000839 — (Arabidopsis lyrata)
- GO:0006352 — Biological Process — DNA-templated transcription, initiation
- GO:0006355 — Biological Process — regulation of transcription, DNA-templated
- GO:0003677 — Molecular Function — DNA binding
- GO:0008270 — Molecular Function — zinc ion binding
- GO:0017025 — Molecular Function — TBP-class protein binding
Family Introduction
- MYB factors represent a family of proteins that include the conserved MYB DNA-binding domain.The first MYB gene identified was the ‘oncogene’ v-Myb derived from the avian myeloblastosis virus . Evidence obtained from sequence comparisons indicates that v-Myb may have originated from a vertebrate gene, which mutated once it became part of the virus. Many vertebrates contain three genes related to v-Myb c-Myb, A-Myb and B-Myb and other similar genes have been identified in insects, plants, fungi and slime moulds. The encoded proteins are crucial to the control of proliferation and differentiation in a number of cell types, and share the conserved MYB DNA-binding domain. This domain generally comprises up to three imperfect repeats, each forming a helix-turn-helix structure of about 53 amino acids. Three regularly spaced tryptophan residues, which form a tryptophan cluster in the three-dimensional helix-turn-helix structure, are characteristic of a MYB repeat. The three repeats in c-Myb are referred to as R1, R2 and R3; and repeats from other MYB proteins are categorised according to their similarity to either R1, R2 or R3.
- In contrast to animals, plants contain a MYB-protein subfamily that is characterised by the R2R3-type MYB domain. MYB proteins can be classified into three subfamilies depending on the number of adjacent repeats in the MYB domain (one, two or three). We refer to MYB-like proteins with one repeat as ‘MYB1R factors’, with two as ‘R2R3-type MYB’ factors, and with three repeats as ‘MYB3R’ factors.
Literature and News
Gene Resources
Homologs
- Arabidopsis thaliana: AT3G09370.2, AT3G09370.1, AT3G09370, AT3G09360, AT2G45100
- Brassica napus: GSBRNA2T00111710001, GSBRNA2T00080677001, GSBRNA2T00073268001
- Brassica oleracea: XP_013619277.1, XP_013638377.1
- Brassica rapa: XP_009146933.1, XP_009123357.1
- Raphanus raphanistrum: RrC641_p4
Sequences
CDS Sequence:
- >341076|Arabidopsis_lyrata|MYB|341076
ATGGTGTGGTGTAACCACTGCGTGAAGAATGTTCCCGGAATACGCCCTTATGATGGTGCATTGGCATGTAATTTATGTGGGAGGATATTGGAGAACTTCAATTTTTCTACAGAAGTTACATTCGTTAAGAATGCTGCTGGACAGAGCCAAGCCTCAGGTAACATAGTGAGCAGTGTTCAAAGTGGGATTCCAAGCTCACGTGAAAGGAGATATAGAATAGCTAGAGATGAGTTTACGAATTTGAGAGATGCATTGGGAATTGGTGATGAGAGAGCTGATGTGATTGACATGGCTGTTCTATTCTTTAAATCGGCAGTTGAGCAGAATTTCACTAAAGGCCGCAGAACTGAACTAGTACAGGCTTCCTGTCTCTACTTGACTTGCAGGGAATTGAACGTTCCGTTTCTTCTTATTGATTTCTCAAGCTACCTTCGAGTTAGCGTTTATGAGTTAGGTTCTGTGTACTTGCAACTCTGTGAAATGCTGTACATTGCGGATAATCAAAATTATGAAAAGCTTGTTGACCCTTCAATTTTCATCGATCGATTCTCAAACATCTTATTGAAAGGAACACATAATAAAGCTGTTGTGAAAACAGCTATAGCCATTATAGCTAGTATGAAGCGAGATTGGATACAGACCGGCCGGAAGCCTAGTGGAATATGTGGAGCAGCACTTTACACAGCTGCCCTTTCTCATGGTATCAAGTGCTCTAAGTCAGATATTGTAAACATTGTGCATATATGTGAAGCAACTCTAACCAAACGATTGATTGAGTTTGGAAATACGGAGTCTGGAAATTTAAATGTTGATGAGATCACAGAAAGAGAATCTCATAAAAGATCTTCTACCATGAAACCAACCTCAAACAAAGAGGCGGTGCTCTGTATGCATCAGGATAGTAAACCTTTTGGTTATGGACTATGTAAAGACTGCTACGAAGATTTCATAAATGTTTCTGGTGGACTTGTTGGTGGGTCCGACCCTCCTGCTTTCCAGCGCGCAGAGAATGAGCGAATGGAAAAAGCAGCTAGAGAAGAAAACGAGGGAGGAATTAGTAGCCTAAACCACGATGAACAACTATATGATCTTATACTAAAAATCTCTTGTGCCGAACTTTTAACAGTCAGACTATTGCAGTATGAGCAAAAGCGAAAAACTATTTTCTGTTCTTCACTGTCGTTTGAGCGCTTGAGAATCTATTTGTGCTGTGTCGCAGAGAAAGGTGAAAGAAATAAAGATGGGGATGAAGAACATGCTGATACTTCGGATGAATCTGACAACTTTTCTGACATCAGTGATGATGAGGTAGATGGCTATATCAACAACGAGGAGGAAACGCACTATAAGACGATCACATGGACAGAAATGAACAAAGATTATCTTGAGGAGCAAGCCGCTAAGGAAGCAGCTCTGAAGGCGGCTAGTGAAGCTTTAAAAGCTAGCAACTCTAATTGCCCAGAAGATGCAAGAAAGGCTTTCGAAGCTGCCAAAGCTGATGCGGCAAAATCTAGAAAGGAAAAGCAACAAAAAAAAGCTGAGGAAGCAAAGAACGCGGCTCCCCCAGCCACAGCAATGGAAGCGGTTCGCCGAACGCTTGAGAAAAAGAGACTAAGTTTGGTGATCAATTACGATGTTTTGGAGGAGCTATTTGATACATCCAGCAATGAACATGAGAAAGGTGAAAATGAAGATGAAGCTGAAGAGGATGAAGAAGAAGGCAGTGTAGAATCATACGACATGAACACAGATTTTCAAAACGGGGAGAAATTCTATGAAGAGGACGAGGGAGAAGAAGAGGATGTAAAATGTGTCGGTCCGGACGATTTTACATGTAGCATGGTTTTACAGATGGATCTTCAGGAAGAAGCAGGTGAAGTGAAAGTTGAGGATCAGTGTGTAGAAAACAAGCAATCAACACCTGCTTCGTGCTCTTCTGTATCTGAAGGTAGTGCTGGTAGTTCTCACAAGTCACCTACAATTGCAAGTCCTCCTGCCACAGTTTCACCAACTCATAGATACCTCGGGAGGACGAGTGGCCCTATTAGGCGAGCGAAAGGTGGATGGACTCCGGCAGAGGATGAGACATTGAGACGAGCAGTTGGCACGTATAAAGGGAAGAGCTGGAAGAATATAGCGAAATTTTTCCCTGATAGAACTGAAGTTCAATGCCTGCACCGGTGGCAGAAAGTTCTAAATCCAGACCTTATTAAGGGACCTTGGACACAAGAGGAGGATGAGAAAATCGTTGAACTCGTTGAAAAATACGGGCCTGCGAAATGGTCTGTTATCGCACAGTCTTTACCAGGTCGAATTGGGAAGCAATGTCGAGAAAGGTGGCACAACCATTTGAATCCTGATATTAACAAGGATGCTTGGACCTCAGAGGAAGAAGTAGCTCTCATGAATGCTCATCGAAGCCACGGAAACAAATGGGCTGAAATTGCTAAGGTCTTACCTGGCAGGACTGATAATGCAATAAAGAACCATTGGAATAGTTCACTGAAAAAGAAGTCAGAATTCTACTCAATGACTGGTAGGTTACCACCACCAACAACAGCAAAGAACGGTGTTCCTGATAGTGTAACTAAACGTTCATCGTCCTCTCAGAAAAGGGTTTTTGGTTCAGTTACTCAAACCTCATCAGGAACTACAGATAAAAACAATCCCGATGAAGACAGAAATGGCCAAATAAACTCTACTGTTCCGGTCGAAGAAGTAGTAGCTGCTTCACGAATGACTGGTGTTAATGAGTATGCTCGTTCTCCTCAGTTGCCTAACCCAGAGCCATTGCCAGAGAATGGTGGAGCTGCAAATAATGGTTATCATCTGTACTACACGCCTCAAATAGAGTATTACATGGCGTCAGAAGTGGATACGCAGCGTATGTATGGGTATGAATGTGGTTGCAGTCCAAGCGCATCACCAGTTAGCTTCTTCACTCCACCGCCATGTAGAAATGCGTACAGTAACGGTTCAACTCCAAGAAGTCCTGAATCTTACCTGAGAGAAGCTGCTAGAACTTACCCAAACACACCATCCATTTTCAGGAAAAGACGACCCAGGGTTGTTGTTGAGGATAACAACAATGCCGAGAAAACAGATGAAGCTAAAGAGGTTGATCAAAAGGTGAATGATGGTAAGGATAGTTCAGAAAGCCCTAACTGTGAAGAGATTCAAAAGAATGGATCAAATGCTTACAATCTATCTCCTCCGTATCGGATAAGATCGAAAAGAACAGCAGTTTTCAAATCAAGACAGCTTGAGTTTATATCTGCAGAGGAAGAGAAAGCTGATGATGAAACCAAATCATCTGAGAAAGATATGTTGATTGATGGAGATTCTCAACTCCTAATCCCTAGCGAGGAGATATATGGAGGCTCAGAGTACAAAATCGTAGAGTATGAGAGAACTGTCTATGTGAGGTTTGAGGCGAAGAATGTGACTTACAAGCTGCATAAAGGTAGATGGCGAAAGGCAGACTTGGCGATAATGAATCATGGATTGAATCTTCCTTATCTATATCATTTTGTGATAGAGAATGTTTTCTACCGTTATAATAAAAGGAGGATCGATTGGTATGACTCCAAAGAAAGATCATGGACAACTTTGAAGGGTTTGGAAAGATTACCTAGCACTTTATCACGTTCTAATCGTCTTAAATTGGCGATTATGGTGGAAAAATGGTACTTTTGTGGGAAGAGTATGTGTTTGTTAACAACCATCAAGAGACGATAA
Protein Sequence:
- >341076|Arabidopsis_lyrata|MYB|341076
MVWCNHCVKNVPGIRPYDGALACNLCGRILENFNFSTEVTFVKNAAGQSQASGNIVSSVQSGIPSSRERRYRIARDEFTNLRDALGIGDERADVIDMAVLFFKSAVEQNFTKGRRTELVQASCLYLTCRELNVPFLLIDFSSYLRVSVYELGSVYLQLCEMLYIADNQNYEKLVDPSIFIDRFSNILLKGTHNKAVVKTAIAIIASMKRDWIQTGRKPSGICGAALYTAALSHGIKCSKSDIVNIVHICEATLTKRLIEFGNTESGNLNVDEITERESHKRSSTMKPTSNKEAVLCMHQDSKPFGYGLCKDCYEDFINVSGGLVGGSDPPAFQRAENERMEKAAREENEGGISSLNHDEQLYDLILKISCAELLTVRLLQYEQKRKTIFCSSLSFERLRIYLCCVAEKGERNKDGDEEHADTSDESDNFSDISDDEVDGYINNEEETHYKTITWTEMNKDYLEEQAAKEAALKAASEALKASNSNCPEDARKAFEAAKADAAKSRKEKQQKKAEEAKNAAPPATAMEAVRRTLEKKRLSLVINYDVLEELFDTSSNEHEKGENEDEAEEDEEEGSVESYDMNTDFQNGEKFYEEDEGEEEDVKCVGPDDFTCSMVLQMDLQEEAGEVKVEDQCVENKQSTPASCSSVSEGSAGSSHKSPTIASPPATVSPTHRYLGRTSGPIRRAKGGWTPAEDETLRRAVGTYKGKSWKNIAKFFPDRTEVQCLHRWQKVLNPDLIKGPWTQEEDEKIVELVEKYGPAKWSVIAQSLPGRIGKQCRERWHNHLNPDINKDAWTSEEEVALMNAHRSHGNKWAEIAKVLPGRTDNAIKNHWNSSLKKKSEFYSMTGRLPPPTTAKNGVPDSVTKRSSSSQKRVFGSVTQTSSGTTDKNNPDEDRNGQINSTVPVEEVVAASRMTGVNEYARSPQLPNPEPLPENGGAANNGYHLYYTPQIEYYMASEVDTQRMYGYECGCSPSASPVSFFTPPPCRNAYSNGSTPRSPESYLREAARTYPNTPSIFRKRRPRVVVEDNNNAEKTDEAKEVDQKVNDGKDSSESPNCEEIQKNGSNAYNLSPPYRIRSKRTAVFKSRQLEFISAEEEKADDETKSSEKDMLIDGDSQLLIPSEEIYGGSEYKIVEYERTVYVRFEAKNVTYKLHKGRWRKADLAIMNHGLNLPYLYHFVIENVFYRYNKRRIDWYDSKERSWTTLKGLERLPSTLSRSNRLKLAIMVEKWYFCGKSMCLLTTIKRR*