Information report for Thecc1EG026624t2
Gene Details
|
|
Functional Annotation
- Refseq: XP_017976648.1 — PREDICTED: protein ALWAYS EARLY 2 isoform X2
- Swissprot: Q6A333 — ALY2_ARATH; Protein ALWAYS EARLY 2
- TrEMBL: A0A061F434 — A0A061F434_THECC; DIRP,Myb-like DNA-binding domain, putative isoform 2
- STRING: EOY11452 — (Theobroma cacao)
- GO:0005654 — Cellular Component — nucleoplasm
- GO:0003677 — Molecular Function — DNA binding
Family Introduction
- A novel myb-like gene (AtmybL2) was isolated from an Arabidopsis thaliana cDNA library. The single copy gene was localised on chromosome I. A gene specific transcript is preferentially found in leaves. The predicted gene product consists of a conservative N-terminal myb-domain known to be involved in DNA-binding and a unique proline-rich C-terminal part. Remarkably, the myb-domain includes only one of the typical two or three tryptophan repeats found in other myb-like proteins.
Literature and News
Gene Resources
Homologs
- Gossypium arboreum: Cotton_A_10975_BGI-A2_v1.0, Cotton_A_27916_BGI-A2_v1.0
- Gossypium hirsutum: Gh_A11G2639, Gh_A13G0889, Gh_Sca005054G02, Gh_D13G1130
- Ziziphus jujuba: XP_015875086.1, XP_015875085.1
Sequences
CDS Sequence:
- >Thecc1EG026624t2|Theobroma_cacao|MYB_related|Thecc1EG026624t2
ATGGCGCCAACAAGAAAATCTAAAAGTGTGAACAAACGATATTCGAGTGTATATGAAGTCTCTCCTGATAAAGATGCCGGTAATTCAAGCAAAAATAAGCCAAAGAAGAAATTAGCTGACAAGTTAGGATCTCAATGGAGCAAGGAAGAAATTGAGCGTTTTTATAAAGCTTATCGAGAGTATGGGAAAGATTGGAAGAAGGTGGCTGCTGCTGTGCATAATAGATCCACTGAAATGGTGGAGGCCCTTTACCTTATGAACCGGGCATATTTATCCCTGCCAGATGGAACAGCTTCTGTGATTGGCCTTATAGCAATGATGACCGATCATTATAGTGTCCTGAGAGGGAGCGATTGTGAAAGAGAGAGTAATGAGCCTTCTGAAATACCCCAGAAAGCTCAAAAGCGCAAGCGGGCAAAGGTTCATCTTGGTACCTCAAAAGAAGGTGTTGTACAGCCTCAATCAATTGCATCTAGTCAGGGATGCCTATCTTTGTTGAAGAGGGCAGGCCTTAATGGTATTCATCCTCATGCAGTAAGGAAAAGAACTCCTCGGGTTCCTGTTTCATATTCATATAGGAGAAATGACACTGAAAGTTACATTCCACCAAACAAAAGAGTTAAGAAGTCAGACGCTGATGATAATGATGCTGAACATGTTGCCGCATTGACGTTGACTGGGGCATTGCAAAGGGGAGGCTCCCCTCAGGTTTCTCAGACACCTTACAAAAGAGCTGAATGCAGAAGATCCTCACCCGTTCAGAGCTATGATAGGACGTCACCACAACCGGAGACCACTAAGGCCAAGCTTGATGATTCTTCCTACGAATGCTGGATGGAAGGCAGGCCTAGGGGCACGGAACCTGTAATTGGAACTCATGCCAGAGATGCAGACCCCTTGATGGATATGGAAGTTGTTGGTACCATTGAAGGTCATCGAAAGGGAAAGAAATTTTACAGGAAGAAAATGAAAGTTGAAGAAACCAAGAACAATCTCTCTGATGATGGTGGAGAAGCGTGTAGTGGCACTGAAGAAAGAATTAGAGGTAGTACTCTCAAAGGGAAAGTTGATATGGAGATTACCAGTGCAAAAAGTGAACAACTTTCACCATGGAGTCAGAGGAAGAGAAGTAACAAGAAGCTTGTCTTCGGAGATGAAAGCTCTTCCATTGATGCTCTGCTGACATTAGCCAATTTGTCAACGTCAATGTTGCCAACATCAATAATGGAATCTGAATCATCTGTCAAATTGAAAGAGAATAGAATTACACTTGAATCTGTTGACAAGTCTAGTGCCCCTGAAGCTGCATCTACAAGTCATCACAGAGATAATATTAAGCACCTACGGCCAAACGAAAAGGTGCTCGACTCAATCACTGGTGCAGAGGAAGCTACCACTAGGAAACTAAAAGTTGGAAGGAATTCAGCTATGGATGATAATGTTGTTTCTGAGGCAAAACAAAAGCCAGAACCTACCAATAACTCTTGGAAAAGAAAACGCAAATCCTTCAGTTCAAAGCTGCAGATTTCTAATGCAGAAGCTTCAATGGATTCTCATCTCCAACAATCTTTTGATAATGAGGACATGGGTGAAGAAGACAATAAATATCTCACTAAAGGTAAATGTGGTGCTCAATCTTCTGTTCAATCAAGACAATGGAAGTCATTCAGAGTGTCAGAGGATTCCTCTACTAATGATGATCCAAAAATGGCTGGAATTGATTCAGTGGTGTTGACTTCACAAGTTCCTGCACCAAACCCTGTTAGCGTACCACCTAAGCATCAAAGTAGACGTAAAATGAACCTGAGGAGAGCCTTCCTTTCAACAGATAGAAGTTCTTCCAAGTGCACATTGAAAAATCAACCAATCAAGCAGTCTGTCACACAAGACAGACTAAAGGAACAGCTCTCTTCCTGCCTATCATCTAATCTGGCACGAAGATGGTGCAGTTTTGAATGGTTTTACAGTGCTATTGATTATGCTTGGTTTGCTAAAAGGGAGTTTGTTGAGTACCTAAATCATGTCGGACTGGGTCATGTTCCAAGGCTTACTCGTGTTGAGTGGGGTGTCATAAGAAGTTCCCTTGGCAAACCTCGGAGGTTTTCTGAACGCTTTTTACATGAAGAAAGGGAAAAACTTAAACATTATCGGGAGTCTGTGAGACAACATTATTCTCAGCTTCGCGTTGGTGCTAGGGAAGGACTTCCAACGGATCTGGCATATCCTTTATCAGTTGGACAACAAGTAATTGCCATTCATCCCAAAACGAGGGAAGCTCATGATGGAAAAGTACTTACTGTGGACCATGATAGGTGCAGGGTTCAGTTTGATAGTCCTGAACTAGGGGTTGAATTTGTCATGGATATTGATTGCATGCCATTAAATCCGTTGGAAAATATGCCGGAAGCACTTAGGAGACAGAACCTTGCTTTTGATAAATTCTCTGTGACACCTAAACCGTCTCAAGTGAATAGCCATTCAGATTTTGGTGGGTCCACGGTATTCACTTCAAGTGGGCATCTGGAGAATGGAACCAGCCCTGTGAACATGTCGGCAAATCAGATAAAGGTGGATGCCAACCGTAACATTTTGCATGCTGAGGCAGCTGTTCCTTATGTTGTTAGTGCACATCAAGCAGCCTATGGTCAACCACTTACCATGGCACATATCAAAGGGAGGGAAACTGATACACGAGCTATGTCTGAATTGAACGGTGCTCTTGACAAAAAGGAAGCTTTATTGATGGAGCTCAGAAACACGAACAATGACATATCAGAAAATCAAAATGGAGAAAGTTGTTTAAAAGATTCTGAACCTTTCAAGAAGCATATTGCCACGGCTTCTTCTGCTTTAGTTAACTTGAGACAACGAAATGCTTACCCAGCAAACCCCCTGTCACCTTGGCAGAAACCCCCAACCAATTCCAACTTCTTTGGTGGCTTGAAAAGTTATGTTGACAGTTCTCTTGTCTCACCAGAATCAGGATCTGGTGTGGGTGAAATTGTTCAAGGCTCAAGACTAAAGGCGCATGCTATGGTGGATGCTGCTATGAAGGCCATGTCATCAATGAAGGAAGGCGAAGATGCATTTATGAGGATTGGAGAAGCTTTGGACTCTTTAGATAAACGGCAATTCACATATGACATTAGGATGCCGGTGATCAAGTCACGAGAGCAGGAGAATGGCAGTATGGATTATCGCAATCACTTGGTTTCCTGTACATCAAAACCGGTGGCTGCCGGTTGGGCAACTAATCCCAAGTCGCAGGAGGCTTCTGACAAAAACGAGGAACAAGGTCCTTCAGAGCTGATCGCATCATGTGTTGCTACTTTGCTCATGATACAGACATGTACAGAGCGACAATATCCGCCAGCAGACGTGGCTCAAATAATCGATTCAGCTGTTACAAGCTTGCATCCATGTTTTCCCCAGAACCTGCCAATTTACCGAGAAATACAAATGTGCATGGGGAGGATTAAGACTCAAATATTAGCTTTGATACCCACTTGA
Protein Sequence:
- >Thecc1EG026624t2|Theobroma_cacao|MYB_related|Thecc1EG026624t2
MAPTRKSKSVNKRYSSVYEVSPDKDAGNSSKNKPKKKLADKLGSQWSKEEIERFYKAYREYGKDWKKVAAAVHNRSTEMVEALYLMNRAYLSLPDGTASVIGLIAMMTDHYSVLRGSDCERESNEPSEIPQKAQKRKRAKVHLGTSKEGVVQPQSIASSQGCLSLLKRAGLNGIHPHAVRKRTPRVPVSYSYRRNDTESYIPPNKRVKKSDADDNDAEHVAALTLTGALQRGGSPQVSQTPYKRAECRRSSPVQSYDRTSPQPETTKAKLDDSSYECWMEGRPRGTEPVIGTHARDADPLMDMEVVGTIEGHRKGKKFYRKKMKVEETKNNLSDDGGEACSGTEERIRGSTLKGKVDMEITSAKSEQLSPWSQRKRSNKKLVFGDESSSIDALLTLANLSTSMLPTSIMESESSVKLKENRITLESVDKSSAPEAASTSHHRDNIKHLRPNEKVLDSITGAEEATTRKLKVGRNSAMDDNVVSEAKQKPEPTNNSWKRKRKSFSSKLQISNAEASMDSHLQQSFDNEDMGEEDNKYLTKGKCGAQSSVQSRQWKSFRVSEDSSTNDDPKMAGIDSVVLTSQVPAPNPVSVPPKHQSRRKMNLRRAFLSTDRSSSKCTLKNQPIKQSVTQDRLKEQLSSCLSSNLARRWCSFEWFYSAIDYAWFAKREFVEYLNHVGLGHVPRLTRVEWGVIRSSLGKPRRFSERFLHEEREKLKHYRESVRQHYSQLRVGAREGLPTDLAYPLSVGQQVIAIHPKTREAHDGKVLTVDHDRCRVQFDSPELGVEFVMDIDCMPLNPLENMPEALRRQNLAFDKFSVTPKPSQVNSHSDFGGSTVFTSSGHLENGTSPVNMSANQIKVDANRNILHAEAAVPYVVSAHQAAYGQPLTMAHIKGRETDTRAMSELNGALDKKEALLMELRNTNNDISENQNGESCLKDSEPFKKHIATASSALVNLRQRNAYPANPLSPWQKPPTNSNFFGGLKSYVDSSLVSPESGSGVGEIVQGSRLKAHAMVDAAMKAMSSMKEGEDAFMRIGEALDSLDKRQFTYDIRMPVIKSREQENGSMDYRNHLVSCTSKPVAAGWATNPKSQEASDKNEEQGPSELIASCVATLLMIQTCTERQYPPADVAQIIDSAVTSLHPCFPQNLPIYREIQMCMGRIKTQILALIPT*