Information report for Thecc1EG016909t1
Gene Details
|
|
Functional Annotation
- Refseq: XP_017974434.1 — PREDICTED: transcription factor MYB98
- TrEMBL: A0A061EBY4 — A0A061EBY4_THECC; Myb domain protein 118, putative
- STRING: EOY02441 — (Theobroma cacao)
- GO:0010439 — Biological Process — regulation of glucosinolate biosynthetic process
- GO:0045893 — Biological Process — positive regulation of transcription, DNA-templated
- GO:1904095 — Biological Process — negative regulation of endosperm development
- GO:2000692 — Biological Process — negative regulation of seed maturation
- GO:0005634 — Cellular Component — nucleus
- GO:0043565 — Molecular Function — sequence-specific DNA binding
Family Introduction
- MYB factors represent a family of proteins that include the conserved MYB DNA-binding domain.The first MYB gene identified was the ‘oncogene’ v-Myb derived from the avian myeloblastosis virus . Evidence obtained from sequence comparisons indicates that v-Myb may have originated from a vertebrate gene, which mutated once it became part of the virus. Many vertebrates contain three genes related to v-Myb c-Myb, A-Myb and B-Myb and other similar genes have been identified in insects, plants, fungi and slime moulds. The encoded proteins are crucial to the control of proliferation and differentiation in a number of cell types, and share the conserved MYB DNA-binding domain. This domain generally comprises up to three imperfect repeats, each forming a helix-turn-helix structure of about 53 amino acids. Three regularly spaced tryptophan residues, which form a tryptophan cluster in the three-dimensional helix-turn-helix structure, are characteristic of a MYB repeat. The three repeats in c-Myb are referred to as R1, R2 and R3; and repeats from other MYB proteins are categorised according to their similarity to either R1, R2 or R3.
- In contrast to animals, plants contain a MYB-protein subfamily that is characterised by the R2R3-type MYB domain. MYB proteins can be classified into three subfamilies depending on the number of adjacent repeats in the MYB domain (one, two or three). We refer to MYB-like proteins with one repeat as ‘MYB1R factors’, with two as ‘R2R3-type MYB’ factors, and with three repeats as ‘MYB3R’ factors.
Literature and News
Gene Resources
Homologs
- Citrullus lanatus: Cla019199
- Citrus sinensis: orange1.1g044021m
- Gossypium arboreum: Cotton_A_01493_BGI-A2_v1.0
- Gossypium hirsutum: Gh_A07G2035, Gh_D12G2613, Gh_A12G2485
- Juglans regia: WALNUT_00010733-RA, WALNUT_00014765-RA
- Malus domestica: MDP0000215359
- Manihot esculenta: Manes.15G149100.1.p, Manes.14G064900.1.p
- Populus trichocarpa: Potri.001G347200.1
- Prunus mume: XP_008239795.1
- Prunus persica: Prupe.5G172400.1.p
- Vitis vinifera: GSVIVT01007731001
Sequences
CDS Sequence:
- >Thecc1EG016909t1|Theobroma_cacao|MYB|Thecc1EG016909t1
ATGGAGTTTGAAACAAGCCTGAGAGAGGATTTCCCTTTCCTTTCAAACCTTTTCTCTGATAATTCTCCCCTGAAACCTGACTTCGGAAATGGTTTTCCCTTGGATGGTTCTTCCTCTTCCAAAGGGTTGCTTCACAACTTTATCCTTCCTGATCATAATCACTACTCCCCCAGTGTCAACAATGGTTCTCTCTTGAATCCATACCATTTTCATCAATTTCCCGCTGAAGGGTCATCAAGGAATCCCTTCTTCGGGTTTTCCTCGACATGCACCGATGCTTTCGAGCCTTATGTCAATGGATTTTCTAATGATCTCAATGCTTACATCCCTTCCTTGCCTTTTGCACCAGATGGTGTCAATAACAATGGCTTTTTGCAGGGTTTTCAGAGTGAAGGCTGCTGGGATTTTTCTCAGAAAGTTCCACCTCAACCCCTGCAGCCTGAAACTCAGACTACTTATCAGCCTCTGATTTTTCAGGATCAACATGGAGCGGTGACTGCTAAGGTGGCTGATGAAGTCTCATGCGTCACCGGAGATCAAAATGGGTATCAGGACAAAGTTGATCAGAAGAAGAACAAAAGGTTCCTGACCAGGAGATGTTCCAAAGCTCCTAAGAAATCTAATATCATCAAAGGCCAATGGACTCCTCAAGAGGACAGTTATATCTATGAAGATCTGATTATGCCAATCTCTTTTCCTAGACTGCTAGTGCAGTTGGTAACAAGGCATGGAACCAAGAAATGGTCTCTGATTGCAAAGATGCTGAATGGGAGAGTGGGAAAGCAGTGTAGAGAGAGATGGCATAACCATCTAAGGCCTGATATCAAGAAGGATTCGTGGAGCGAAGATGAAGATAGGATTCTGATTGAAGCACATAAAGAAATAGGGAACAAGTGGGCTGAGATTGCCAGGAGACTACCAGGACGCACCGAGAACACTATAAAGAATCACTGGAATGCAACCAAGCGAAGGCAATACTCAAGGCGCAAAGGCAAAGACTCCAACCCCAAGGGCGCCCTTTTGCAAAGCTACATCAAGTCTGTGAGTTCAACCTCGACCCGAAAGGACAAAGGGAAAAGGGTCATGGAAGCCAATGCTCAGATGCTGATAAACAACAACCCAGCAGCAAAAAATCCGCAAGTAGTTCAAGTCCAAAGCTCAGATTTCAACCCAAAGGATTGGCCAGTTGCGGTTTACAATGCCCAAGCTGATCATCATCAGCCCATGAATTTCTCTTTCGACGCAAGTGTTTTCGGTGAAAGCTGCACCGCCAGCTTTGAATCGATGCTCGAAGAGGTTCCTTCTGGTTCTCTTGTTGAAGAGAGCAATGCCGTCATGGATTTTGAGCTGCCCCTGGAAATGGATTCCGCGAAGAAGGAGCTGGATTTGCTGGAGATGATCTCTCAAGGAAATCTCTAA
Protein Sequence:
- >Thecc1EG016909t1|Theobroma_cacao|MYB|Thecc1EG016909t1
MEFETSLREDFPFLSNLFSDNSPLKPDFGNGFPLDGSSSSKGLLHNFILPDHNHYSPSVNNGSLLNPYHFHQFPAEGSSRNPFFGFSSTCTDAFEPYVNGFSNDLNAYIPSLPFAPDGVNNNGFLQGFQSEGCWDFSQKVPPQPLQPETQTTYQPLIFQDQHGAVTAKVADEVSCVTGDQNGYQDKVDQKKNKRFLTRRCSKAPKKSNIIKGQWTPQEDSYIYEDLIMPISFPRLLVQLVTRHGTKKWSLIAKMLNGRVGKQCRERWHNHLRPDIKKDSWSEDEDRILIEAHKEIGNKWAEIARRLPGRTENTIKNHWNATKRRQYSRRKGKDSNPKGALLQSYIKSVSSTSTRKDKGKRVMEANAQMLINNNPAAKNPQVVQVQSSDFNPKDWPVAVYNAQADHHQPMNFSFDASVFGESCTASFESMLEEVPSGSLVEESNAVMDFELPLEMDSAKKELDLLEMISQGNL*