Information report for Thecc1EG021168t4
Gene Details
|
|
Functional Annotation
- Refseq: XP_007035525.2 — PREDICTED: protein ALWAYS EARLY 3 isoform X3
- Refseq: XP_007035526.2 — PREDICTED: protein ALWAYS EARLY 3 isoform X1
- Swissprot: Q6A332 — ALY3_ARATH; Protein ALWAYS EARLY 3
- TrEMBL: A0A061EW03 — A0A061EW03_THECC; Always early, putative isoform 4
- STRING: EOY06452 — (Theobroma cacao)
- GO:0005730 — Cellular Component — nucleolus
- GO:0016592 — Cellular Component — mediator complex
- GO:0003677 — Molecular Function — DNA binding
Family Introduction
- A novel myb-like gene (AtmybL2) was isolated from an Arabidopsis thaliana cDNA library. The single copy gene was localised on chromosome I. A gene specific transcript is preferentially found in leaves. The predicted gene product consists of a conservative N-terminal myb-domain known to be involved in DNA-binding and a unique proline-rich C-terminal part. Remarkably, the myb-domain includes only one of the typical two or three tryptophan repeats found in other myb-like proteins.
Literature and News
Gene Resources
Homologs
- Citrus sinensis: orange1.1g001272m, orange1.1g001101m, orange1.1g001099m
- Gossypium arboreum: Cotton_A_04877_BGI-A2_v1.0
- Gossypium hirsutum: Gh_A02G0465, Gh_D02G0520
- Manihot esculenta: Manes.15G014400.2.p, Manes.15G014400.1.p
- Populus trichocarpa: Potri.008G216600.1
- Ricinus communis: 29726.m004015
Sequences
CDS Sequence:
- >Thecc1EG021168t4|Theobroma_cacao|MYB_related|Thecc1EG021168t4
ATGGCGCCATCTAGAAAATCTAAAAGTGTAAATAAGAAGTTTTCTTATGTTAATGAGGTTGCTTCTAGTAAAGATGGAGATAGTAGTGCTAAGAGAAGCGGGCAACGGAAAAGGAAGTTGTCTGACATGTTAGGGCCTCAATGGACTAAGGAAGAGCTTGAGCGTTTCTATGAAGCGTATCGCAAGTATGGGAAAGATTGGAAGAAGGTTGCTACTGTGGTACGAAATCGACAACCCTCTAAAAGTTTAGATAAGTCATTTCCTGATCTTTTGCAATTTCATTCAGCTGCATCAAGTTATGGTTGCTTGTCATTGTTGAAGAGGAGACGCTCTGAAAGTAGGCCCCGTGCTGTTGGAAAAAGGACTCCTCGTGTTCCTATTTCTTTTTCTCATGACAAAAACAAAGGAGAAAGGTACTTTTCACCTATTAGGCAGGGCATGAAACTAAAGGTGGATACCGTTGATGATGATGTTGCTCATGAGATAGCATTAGTTTTGACGGAGGCATCACAAAGAGGTGGATCTCCTCAAGTTTCTCGAACACCAAACAGAAAAGCAGAGGCATCTTCACCTATCCTCAACAGTGAAAGGATGAATGCTGAGTCAGAAACTACTAGTGCCAAGATTCATGGTAGTGAAATGGATGAGGATGCTTGTGAATTGAGCTTAGGAAGCACTGAAGCTGATAATGCTGATTATGCTAGAGGTAAAAATTATTCAATGAATATAGAAGGGACTGGTACCATTGAAGTTCAACAGAAGGGAAAAAGATACTACAGAAGGAAGCCAGGGGTTGAGGAAAGTGTAAACAATCATCTGGAAGACACAAAAGAAGCCTGTAGTGGGACGGAAGAAGATCAAAAGTTATGTGATTTCAAGGGAAAGTTTGAAGCAGAGGTTGCAGATACCAAACCTTCTAGAGGCTCCATCAAGGGTCTAAGGAAAAGAAGTAAAAAAGTGTTGTTTGGGAGAGTTGAAGACACTTCCTTTGATGCCCTGCAAACTCTAGCAGATCTGTCCTTGATGATGCCAGAAACTGCTGCTGATACTGAGTCATCTGTGCAGTTCAAGGAAGAGAAAAATGAAGTTGTTGAGAAGACTAAACTGAAAGGAAACCATCCTGTTTCTGGAGCTAAAGGCACTGCCCCCAAAACATGTAAACAGGGAAAAGTTTTTGGTCATGATGTTCGTGCTATTCCCGAGGCAAAGGAGGAAACACACCCAGGTAATGTTGGAATGCGGAAAAGGAGACAGAAGTCCTCACCATATAAATTGCAGATTCCAAAAGATGAAACTGATGCTGATTCTCATTTGGGTGAATCTCGAAACATTGAGGCTTTAGATGAGGTAAAGAATTTTCCAAGCAAAGGTAAACGCTCTAATAATGTTGCACATTCAAAGCAAGGGAAATCAGTGAGACCTCCAGAGCATCGTTCCTCAAGTACTGATCATGGAAGGGACTTGAACAATTCAACTCCATCTACCATACAGGTTTCACCTGTTAACCAGGTCAACCTACCCACAAAAGTCAGGAGTAAGAGAAAGATAGATGCACAGAAACAAGTGATTGGGAAGGATATAAAGTCCTCTGATGGTATTGTGAAGGGAAAATTTAGTGTTCCAGTTAGTTTATTCCATGACAGAGCACTCAATCTGAAGGAAAAGCTTTGTAACTTCCTATGTCCATATCAAGCACGGAGATGGTGTACCTTTGAGTGGTTCTGTAGTACAATTGATTATCCATGGTTTGCTAAAAGGGAGTTTGTGGAGTATTTGGATCATGTAGGATTGGGTCATGTTCCAAGATTAACTCGTGTTGAATGGGGTGTCATAAGGAGTTCCCTTGGCAAGCCACGAAGGTTTTCTGAGCAATTTTTGAAGGAAGAAAGAGAGAAGCTTTATCAATATCGGGAATCTGTTAGAACGCATTATGCTGAACTCCGTGCTGGTATTGGTGAAGGACTTCCAACTGATTTAGCTCGACCTCTATCAGTTGGACAGCGTGTTATTGCTATTCATCCAAAAACTAGAGAGATTCATGATGGAAATGTGTTAATTGTTGACCATAGTAGGTACCGGATTCAATTTGACAGCACTGAGCTAGGAGTGGAATCTGTCATGGATATTGATTGTATGGCTTTAAATCCATTGGAAAATTTGCCTGCTTCCCTTGTGAGACAAAATGCTGCTGTCAGGAAATTTTTTGAAAACTACAATGAGCTCAAAATGAACGGGCAGCCAAAAGAAAGCAAGATGGAAGAGAACATCAAATTTGCTCCGTGTGAGGAGAATGCCAATAGTCCCTCTCGAACTTCCCCATCAACTTTCAGTGTTGGCAATTTATCACAACCTGTTAAGGTAAGTTATGACTGCAAAATGGTTCTCGTTTCATGA
Protein Sequence:
- >Thecc1EG021168t4|Theobroma_cacao|MYB_related|Thecc1EG021168t4
MAPSRKSKSVNKKFSYVNEVASSKDGDSSAKRSGQRKRKLSDMLGPQWTKEELERFYEAYRKYGKDWKKVATVVRNRQPSKSLDKSFPDLLQFHSAASSYGCLSLLKRRRSESRPRAVGKRTPRVPISFSHDKNKGERYFSPIRQGMKLKVDTVDDDVAHEIALVLTEASQRGGSPQVSRTPNRKAEASSPILNSERMNAESETTSAKIHGSEMDEDACELSLGSTEADNADYARGKNYSMNIEGTGTIEVQQKGKRYYRRKPGVEESVNNHLEDTKEACSGTEEDQKLCDFKGKFEAEVADTKPSRGSIKGLRKRSKKVLFGRVEDTSFDALQTLADLSLMMPETAADTESSVQFKEEKNEVVEKTKLKGNHPVSGAKGTAPKTCKQGKVFGHDVRAIPEAKEETHPGNVGMRKRRQKSSPYKLQIPKDETDADSHLGESRNIEALDEVKNFPSKGKRSNNVAHSKQGKSVRPPEHRSSSTDHGRDLNNSTPSTIQVSPVNQVNLPTKVRSKRKIDAQKQVIGKDIKSSDGIVKGKFSVPVSLFHDRALNLKEKLCNFLCPYQARRWCTFEWFCSTIDYPWFAKREFVEYLDHVGLGHVPRLTRVEWGVIRSSLGKPRRFSEQFLKEEREKLYQYRESVRTHYAELRAGIGEGLPTDLARPLSVGQRVIAIHPKTREIHDGNVLIVDHSRYRIQFDSTELGVESVMDIDCMALNPLENLPASLVRQNAAVRKFFENYNELKMNGQPKESKMEENIKFAPCEENANSPSRTSPSTFSVGNLSQPVKVSYDCKMVLVS*