Information report for Thecc1EG021168t3
Gene Details
|
|
Functional Annotation
- Refseq: XP_007035526.2 — PREDICTED: protein ALWAYS EARLY 3 isoform X1
- Swissprot: Q6A332 — ALY3_ARATH; Protein ALWAYS EARLY 3
- TrEMBL: A0A061EP68 — A0A061EP68_THECC; Always early, putative isoform 3
- STRING: EOY06452 — (Theobroma cacao)
- GO:0005730 — Cellular Component — nucleolus
- GO:0016592 — Cellular Component — mediator complex
- GO:0003677 — Molecular Function — DNA binding
Family Introduction
- A novel myb-like gene (AtmybL2) was isolated from an Arabidopsis thaliana cDNA library. The single copy gene was localised on chromosome I. A gene specific transcript is preferentially found in leaves. The predicted gene product consists of a conservative N-terminal myb-domain known to be involved in DNA-binding and a unique proline-rich C-terminal part. Remarkably, the myb-domain includes only one of the typical two or three tryptophan repeats found in other myb-like proteins.
Literature and News
Gene Resources
Homologs
- Citrullus lanatus: Cla021725
- Citrus sinensis: orange1.1g001101m, orange1.1g001099m, orange1.1g001272m
- Gossypium arboreum: Cotton_A_04877_BGI-A2_v1.0
- Gossypium hirsutum: Gh_A02G0465, Gh_D02G0520
- Manihot esculenta: Manes.15G014400.2.p, Manes.15G014400.1.p
- Populus trichocarpa: Potri.008G216600.1
- Prunus mume: XP_016647952.1
- Prunus persica: Prupe.1G139300.6.p, Prupe.1G139300.1.p
- Ricinus communis: 29726.m004015
Sequences
CDS Sequence:
- >Thecc1EG021168t3|Theobroma_cacao|MYB_related|Thecc1EG021168t3
ATGGCGCCATCTAGAAAATCTAAAAGTGTAAATAAGAAGTTTTCTTATGTTAATGAGGTTGCTTCTAGTAAAGATGGAGATAGTAGTGCTAAGAGAAGCGGGCAACGGAAAAGGAAGTTGTCTGACATGTTAGGGCCTCAATGGACTAAGGAAGAGCTTGAGCGTTTCTATGAAGCGTATCGCAAGTATGGGAAAGATTGGAAGAAGGTTGCTACTGTGGTACGAAATCGATCTGTGGAAATGGTAGAAGCTCTGTACACTATGAATAGGGCCTACTTATCTCTCCCGGAAGGCACTGCTTCTGTGGTTGGACTCATAGCGATGATGACTGATCACTATTGTGTTATGGGAGGAAGTGATAGTGAACAAGAAAGCAATGAGGGCGTGGGAGCTTCTCGGAAACCTCAGAAGCGTAGTAGGGGAAAACTTCGAGATCAACCCTCTAAAAGTTTAGATAAGTCATTTCCTGATCTTTTGCAATTTCATTCAGCTGCATCAAGTTATGGTTGCTTGTCATTGTTGAAGAGGAGACGCTCTGAAAGTAGGCCCCGTGCTGTTGGAAAAAGGACTCCTCGTGTTCCTATTTCTTTTTCTCATGACAAAAACAAAGGAGAAAGGTACTTTTCACCTATTAGGCAGGGCATGAAACTAAAGGTGGATACCGTTGATGATGATGTTGCTCATGAGATAGCATTAGTTTTGACGGAGGCATCACAAAGAGGTGGATCTCCTCAAGTTTCTCGAACACCAAACAGAAAAGCAGAGGCATCTTCACCTATCCTCAACAGTGAAAGGATGAATGCTGAGTCAGAAACTACTAGTGCCAAGATTCATGGTAGTGAAATGGATGAGGATGCTTGTGAATTGAGCTTAGGAAGCACTGAAGCTGATAATGCTGATTATGCTAGAGGTAAAAATTATTCAATGAATATAGAAGGGACTGGTACCATTGAAGTTCAACAGAAGGGAAAAAGATACTACAGAAGGAAGCCAGGGGTTGAGGAAAGTGTAAACAATCATCTGGAAGACACAAAAGAAGCCTGTAGTGGGACGGAAGAAGATCAAAAGTTATGTGATTTCAAGGGAAAGTTTGAAGCAGAGGTTGCAGATACCAAACCTTCTAGAGGCTCCATCAAGGGTCTAAGGAAAAGAAGTAAAAAAGTGTTGTTTGGGAGAGTTGAAGACACTTCCTTTGATGCCCTGCAAACTCTAGCAGATCTGTCCTTGATGATGCCAGAAACTGCTGCTGATACTGAGTCATCTGTGCAGTTCAAGGAAGAGAAAAATGAAGTTGTTGAGAAGACTAAACTGAAAGGAAACCATCCTGTTTCTGGAGCTAAAGGCACTGCCCCCAAAACATGTAAACAGGGAAAAGTTTTTGGTCATGATGTTCGTGCTATTCCCGAGGCAAAGGAGGAAACACACCCAGGTAATGTTGGAATGCGGAAAAGGAGACAGAAGTCCTCACCATATAAATTGCAGATTCCAAAAGATGAAACTGATGCTGATTCTCATTTGGGTGAATCTCGAAACATTGAGGCTTTAGATGAGGTAAAGAATTTTCCAAGCAAAGGTAAACGCTCTAATAATGTTGCACATTCAAAGCAAGGGAAATCAGTGAGACCTCCAGAGCATCGTTCCTCAAGTACTGATCATGGAAGGGACTTGAACAATTCAACTCCATCTACCATACAGGTTTCACCTGTTAACCAGGTCAACCTACCCACAAAAGTCAGGAGTAAGAGAAAGATAGATGCACAGAAACAAGTGATTGGGAAGGATATAAAGTCCTCTGATGGTATTGTGAAGGGAAAATTTAGTGTTCCAGTTAGTTTATTCCATGACAGAGCACTCAATCTGAAGGAAAAGCTTTGTAACTTCCTATGTCCATATCAAGCACGGAGATGGTGTACCTTTGAGTGGTTCTGTAGTACAATTGATTATCCATGGTTTGCTAAAAGGGAGTTTGTGGAGTATTTGGATCATGTAGGATTGGGTCATGTTCCAAGATTAACTCGTGTTGAATGGGGTGTCATAAGGAGTTCCCTTGGCAAGCCACGAAGGTTTTCTGAGCAATTTTTGAAGGAAGAAAGAGAGAAGCTTTATCAATATCGGGAATCTGTTAGAACGCATTATGCTGAACTCCGTGCTGGTATTGGTGAAGGACTTCCAACTGATTTAGCTCGACCTCTATCAGTTGGACAGCGTGTTATTGCTATTCATCCAAAAACTAGAGAGATTCATGATGGAAATGTGTTAATTGTTGACCATAGTAGGTACCGGATTCAATTTGACAGCACTGAGCTAGGAGTGGAATCTGTCATGGATATTGATTGTATGGCTTTAAATCCATTGGAAAATTTGCCTGCTTCCCTTGTGAGACAAAATGCTGCTGTCAGGAAATTTTTTGAAAACTACAATGAGCTCAAAATGAACGGGCAGCCAAAAGAAAGCAAGATGGAAGAGAACATCAAATTTGCTCCGTGTGAGGAGAATGCCAATAGTCCCTCTCGAACTTCCCCATCAACTTTCAGTGTTGGCAATTTATCACAACCTGTTAAGGTTGATCCATCAAGTCCTAATTTACAACTTAAAGTTGGGCCTATGGAAACTGTTTATACTCAGCAGGCAGTAAATTCCCAGCTTTCTGCTCTGGCGCTGATACAGGCGAGGGAAGCTGATGTTGAAGCTCTTTCTCAGTTGACTCGTGCTCTTGACAAAAAAGCATTTGCAGGAGGCTGTGGTCTCTGA
Protein Sequence:
- >Thecc1EG021168t3|Theobroma_cacao|MYB_related|Thecc1EG021168t3
MAPSRKSKSVNKKFSYVNEVASSKDGDSSAKRSGQRKRKLSDMLGPQWTKEELERFYEAYRKYGKDWKKVATVVRNRSVEMVEALYTMNRAYLSLPEGTASVVGLIAMMTDHYCVMGGSDSEQESNEGVGASRKPQKRSRGKLRDQPSKSLDKSFPDLLQFHSAASSYGCLSLLKRRRSESRPRAVGKRTPRVPISFSHDKNKGERYFSPIRQGMKLKVDTVDDDVAHEIALVLTEASQRGGSPQVSRTPNRKAEASSPILNSERMNAESETTSAKIHGSEMDEDACELSLGSTEADNADYARGKNYSMNIEGTGTIEVQQKGKRYYRRKPGVEESVNNHLEDTKEACSGTEEDQKLCDFKGKFEAEVADTKPSRGSIKGLRKRSKKVLFGRVEDTSFDALQTLADLSLMMPETAADTESSVQFKEEKNEVVEKTKLKGNHPVSGAKGTAPKTCKQGKVFGHDVRAIPEAKEETHPGNVGMRKRRQKSSPYKLQIPKDETDADSHLGESRNIEALDEVKNFPSKGKRSNNVAHSKQGKSVRPPEHRSSSTDHGRDLNNSTPSTIQVSPVNQVNLPTKVRSKRKIDAQKQVIGKDIKSSDGIVKGKFSVPVSLFHDRALNLKEKLCNFLCPYQARRWCTFEWFCSTIDYPWFAKREFVEYLDHVGLGHVPRLTRVEWGVIRSSLGKPRRFSEQFLKEEREKLYQYRESVRTHYAELRAGIGEGLPTDLARPLSVGQRVIAIHPKTREIHDGNVLIVDHSRYRIQFDSTELGVESVMDIDCMALNPLENLPASLVRQNAAVRKFFENYNELKMNGQPKESKMEENIKFAPCEENANSPSRTSPSTFSVGNLSQPVKVDPSSPNLQLKVGPMETVYTQQAVNSQLSALALIQAREADVEALSQLTRALDKKAFAGGCGL*