Information report for Thecc1EG029045t1
Gene Details
|
|
Functional Annotation
- Refseq: XP_017978893.1 — PREDICTED: beta-galactosidase 17
- Swissprot: Q93Z24 — BGA17_ARATH; Beta-galactosidase 17
- TrEMBL: A0A061GB64 — A0A061GB64_THECC; Beta-galactosidase
- STRING: EOY27120 — (Theobroma cacao)
- GO:0005975 — Biological Process — carbohydrate metabolic process
- GO:0016021 — Cellular Component — integral component of membrane
- GO:0003676 — Molecular Function — nucleic acid binding
- GO:0004565 — Molecular Function — beta-galactosidase activity
- GO:0046872 — Molecular Function — metal ion binding
Family Introduction
- Numerous genes encoding the C2H2 zinc-finger domain have been characterized from a wide variety of eukaryotes, including plants. The canonical ZF (zinc finger) sequence (CX2-4CX3FX5LX2HX3-5H) contains two cysteines and two histidines that coordinate a zinc atom, creating a compact nucleic acid-binding domain. The majority of such proteins characterized to date are DNA-binding transcription factors, and many have been shown to play crucial roles in the development of plants, animals and fungi.
Literature and News
Gene Resources
Homologs
- Actinidia chinensis: Achn198751
- Citrus sinensis: orange1.1g007898m
- Gossypium arboreum: Cotton_A_05439_BGI-A2_v1.0, Cotton_A_18922_BGI-A2_v1.0
- Gossypium hirsutum: Gh_A05G1117, Gh_A10G0519, Gh_D05G1287, Gh_D10G0557
- Lotus japonicus: Lj3g3v0323160.2, Lj3g3v0323160.1
- Manihot esculenta: Manes.04G140300.1.p
- Populus trichocarpa: Potri.003G040100.1, Potri.003G036900.5, Potri.003G036900.4, Potri.003G036900.2, Potri.003G036900.1
Sequences
CDS Sequence:
- >Thecc1EG029045t1|Theobroma_cacao|C2H2|Thecc1EG029045t1
ATGATGAAAGGTTTGTTATTCCACGATCAACAGCAACAAGTTCTGGAAGAGAATATGTCAAATTTGACTTCTGCATCTGGTGAAGCAAGTGTTTCTTCAGGCAATAGAGCCGAAGCTGCCACCAACTATCCTCAACAATACTTTAGTACTCCACCACCTGAAACTCAGCCAGCTAAGAAAAAGAGAAACCTGCCAGGCAACCCAGACCCAGATGCAGAAGTGATAGCTTTGTCTCCTAAAACACTCATGGCAACAAATAGATTTGTGTGTGAGATCTGCAACAAAGGGTTTCAAAGAGACCAGAACCTCCAGCTCCACAGAAGAGGGCATAACTTGCCGTGGAAGTTAAAGCAAAGAACAAGTAAAGAGGTGAGGAAGAAGGTGTATGTCTGTCCAGAACCCAGCTGTGTGCATCATGACCCATCAAGGGCCCTTGGGGACTTGACAGGAATCAAGAAGCATTTTTGCAGAAAGCATGGTGAGAAGAAATGGAAATGTGATAAGTGTTCAAAAAGGTACGCAGTTCAATCGGATTGGAAGGCTCACTCCAAGACTTGTGGCACTAGAGAATACAGATGTGACTGTGGAACCCTCTTCTCAAGGAGAGATAGCTTCATCACTCACAGAGCCTTCTGTGATGCTTTAGCAGAGGAGAGCGCAAGAGCAATCACGGGAGCTAACCCACTTCTCTCCTCGCATCAACCAGGAGCATCAGCATCTCACATTAATTTACAAGTTCCCCAATTCAATGCCCAAGACATACAAGCATTTTCACTTAAGAAAGAGCAACAAAGTTTCAGTCTAAGGCCAGAGATTCCTCCATGGCTCTCTAGCCAACCAATGCTAGGGGCTGGTCCGGGCCCACCACCACAGCCTATAGATCTTTCCTCATCATCATCATCAATCTTCTCCGCAAGATTAGATCATCATCATCAAGAATTCACACAAACAACACACCATCAGGACTTAACACATCATGTAAACCCCAACCCTAACCCTACTAGTCTTGGCCCCACTCTTCCTGCCTACCATCCAACAACAGTACCATCCCCACACATGTCAGCAACTGCATTACTTCAGAAAGCAGCCCAGATGGGTGCAACCATGAGCAGCAAAACTGGCTCATCATCAGCACCAGCTACTGCTGCAGCTGCCTCCTTGATCAGACCCCACCAACAAGCTCACGTGTCTGCTGATTCTGCTGGCAGTAACAATAACACAACAACAGCTGTTTTTGGCCTCAACTTGTCTTCACGTGAAGAACTGGCTGTGATCATGCTGGCTCAGACGAAGGATGGGAGGTTCATAAATGAAACATTCTCATCAACAACAACTACACCAACAACAACGACAAATGCTGCTGCCGCCGCTAGGAATGATCACGAAACTGGCGGTATTCAAGGTGAAGGCTTGACGAGAGATTTCTTGGGTCTTAGAGCTTTCTCTCATAGCGATATTCTCAATCTTGCTGGTCTTAGTAACTGCATGAACACTTCGCATGAACAACGCAATCAGTCGCAAAAACCATGGCAAGTTGTTGATTCACCAAAATTTGAAGCAGTCCATATAGCTCTACTTGCTGCACCATCTGCGGGGACAGGGTCTGGGTACTTTTACTTGCCAGGAACTACTGATAGCTTAAAAGCAGAAGAAATGAATCACAGTAAAAAGCTAGCTGCAACGGAGTCAACCTCCAAATCAACTGCGGCGATGGCGAGAAAGCGAAGCAGCAAAACGACGTTGATCTTCTTCGTTTTGCTTTCCATTGTAGCTTTCGTTGCTTTCGTTCCCGTCTTCGCTTCTCTCCCTTCTCTTTCCTCTCACTCTCACGATCTTCACCTCCATCTTCGTCTTCATCAGCGTCAGCATCGTCTCGAAAAAAGTGATGCTAGAAAGTTTGAAATTGCGGAGGATATGTTTTGGAAAGATGGAAAGCCTTTTCAGATAATTGGTGGTGACTTGCATTATTTCCGCATTCTTCCTGAGTACTGGGAAGATAGGCTTTTAAGAGCAAAAGCACTGGGACTGAATACCATTCAAACTTATATTCCTTGGAATTTGCATGAACCAGAGCCTGGCAAACTGGTTTTTGAAGGCATTGCAGATCTAGTATCATTTCTCAAACTTTGCCAGAAGTTAGGTCTCCTTGTTATGCTTCGAGCTGGGCCTTATATTTGTGCAGAGTGGGATCTTGGAGGATTCCCAGCTTGGTTACTTGCCATAGAACCAGATATCAGACTAAGGTCATCAGATCCTGCTTACCTCCAATTGGTTGAAGGATGGTGGGGAGTCCTACTTCCAAAAGTAGCTCCTCTTCTTTATGGTAATGGAGGTCCTATTATAATGGTGCAGATAGAAAATGAATTTGGGTCATATGGAGATGATAAAGCTTATCTTCGTCACCTGGTGAAGTTGGCTAGAGGACATCTTGGGGAAGACATTATTTTGTATACTACAGATGGAGGTTCTCGAGAAACTCTTGAAAAAGGAACCCTTGTAGGAGATGATGTCTTTTCCGCTGTTGACTTCACTACTGGGGATGATCCTTGGCCCATATTTGAGTTACAAAAGGAGTTCAATTCCCCTGGGAAATCACCACCACTTTCTTCGGAGTTTTATACAGGTTGGCTTACACATTGGGGTGAGAAGATTGCAAGGACAGATGCAGATTTTACCGCAGCTGCCTTGGAAAAGATTTTGTCACGAAATGGTTCTGTCGTGCTTTATATGGCACATGGTGGAACAAACTTTGGATTTTATAATGGGGCAAATACAGGTGCTGATGAGTCAGATTACAAGCCTGATCTAACTTCCTATGATTATGATGCGCCAATTACGGAGTCTGGTGATGTGGACAATGCAAAATTCAAAGCCATAAGGAGAGTGGTGGGGAAATATAGTTCAGTATCTCTTCCTTCATTTCCTTCCAGTAATAAAAAGACAGGATATGGTTTTATCCAGTTACAAAAAACAAGAAGTTTATTTGATTTACTTGATGGGTTTGATTCTGCACACATTGTTGAAGCTGAAAATCCAACTGCAATGGAGTATTTCTACCAGATGTTTGGATTTCTATTATATGTATCTGAATATGCATCGAAAGCTGGTGGAAATAAGCTATTTATACCAAAGGTGCATGACAGAGCTCAAGTGTTCATATCATGCCCTTCTAGAGCTGATGGTGGACGAGTATCATATGTTGGTACAATTGAAAGATGGTCAAATCAAGCAATTTACCTTCCTAATGCTAAATGTGTTTCTAACACCAGCTTATTTATTTTGGTTGAAAACATGGGCCGTGTAAATTATGGACCATACTTGTTTGACAGGAAGGGAATTTTGTCTTCTGTTTATGTAGATGGGAGAGTTTTGAACAGATGGAAAATGATCCCAATTCCTTTCCAAAACCTGAATGAGGTGCCAAAGTTCAATCCTGTCATTCAAGTTGCATCTGAATTCCCTAAAGTATCCATCCGCAAAAAGTTAGAGCACAAGTCAGAGGATGTTTTAGAAGGACCATCATTCTACACTGGTCATTTCTCTATTGATAAAACTAGTGAAGTTACAGATACATTCATTTCGTTTAGAGCCTGGGGTAAAGGGATTGCTTTTGTTAATGAATTCAACATCGGAAGATATTGGCCAACTTCAGGACCACAATGCAACCTTTATATCCCTGCTCCAATCCTTCGGCATGGGGAAAATGTTTTGGTGATATTCGAGTTAGAATCACCAAACCCTGAGCTTGTGGTTGATTCAGTTGATCAGCAAGATTTCAATTGTGGATCAAGTAAAGCAAGTGTGCGTCAACTTTAA
Protein Sequence:
- >Thecc1EG029045t1|Theobroma_cacao|C2H2|Thecc1EG029045t1
MMKGLLFHDQQQQVLEENMSNLTSASGEASVSSGNRAEAATNYPQQYFSTPPPETQPAKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRTSKEVRKKVYVCPEPSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAEESARAITGANPLLSSHQPGASASHINLQVPQFNAQDIQAFSLKKEQQSFSLRPEIPPWLSSQPMLGAGPGPPPQPIDLSSSSSSIFSARLDHHHQEFTQTTHHQDLTHHVNPNPNPTSLGPTLPAYHPTTVPSPHMSATALLQKAAQMGATMSSKTGSSSAPATAAAASLIRPHQQAHVSADSAGSNNNTTTAVFGLNLSSREELAVIMLAQTKDGRFINETFSSTTTTPTTTTNAAAAARNDHETGGIQGEGLTRDFLGLRAFSHSDILNLAGLSNCMNTSHEQRNQSQKPWQVVDSPKFEAVHIALLAAPSAGTGSGYFYLPGTTDSLKAEEMNHSKKLAATESTSKSTAAMARKRSSKTTLIFFVLLSIVAFVAFVPVFASLPSLSSHSHDLHLHLRLHQRQHRLEKSDARKFEIAEDMFWKDGKPFQIIGGDLHYFRILPEYWEDRLLRAKALGLNTIQTYIPWNLHEPEPGKLVFEGIADLVSFLKLCQKLGLLVMLRAGPYICAEWDLGGFPAWLLAIEPDIRLRSSDPAYLQLVEGWWGVLLPKVAPLLYGNGGPIIMVQIENEFGSYGDDKAYLRHLVKLARGHLGEDIILYTTDGGSRETLEKGTLVGDDVFSAVDFTTGDDPWPIFELQKEFNSPGKSPPLSSEFYTGWLTHWGEKIARTDADFTAAALEKILSRNGSVVLYMAHGGTNFGFYNGANTGADESDYKPDLTSYDYDAPITESGDVDNAKFKAIRRVVGKYSSVSLPSFPSSNKKTGYGFIQLQKTRSLFDLLDGFDSAHIVEAENPTAMEYFYQMFGFLLYVSEYASKAGGNKLFIPKVHDRAQVFISCPSRADGGRVSYVGTIERWSNQAIYLPNAKCVSNTSLFILVENMGRVNYGPYLFDRKGILSSVYVDGRVLNRWKMIPIPFQNLNEVPKFNPVIQVASEFPKVSIRKKLEHKSEDVLEGPSFYTGHFSIDKTSEVTDTFISFRAWGKGIAFVNEFNIGRYWPTSGPQCNLYIPAPILRHGENVLVIFELESPNPELVVDSVDQQDFNCGSSKASVRQL*