•The protein sequences are composed of only 20 standard amino acids, yet there are 64 different codons.
• 61 codons encoding for amino acids plus three stop codons.
So there exists the “degeneracy” of the codons, and codon usage in organisms is not average or random, which leads to codon usage bias(CUB). There are many reasons for the existence of CUB, such as the following.
Origin of the genetic code system.
Mutation, selection, and random drift.
GC content, RNA structure, protein structure, etc.
There are many theories explaining the origin of codons. In general, codons undergo adaptive selection to achieve a required level of optimization that meets protein synthesis requirements throughout the life of an organism.
Affect the translation elongation speed and efficiency.
Affect Initiation(Ramp).
Affect translation fidelity. Regulates co-translation protein folding and protein function.
Determining mRNA level.
Heterologous gene expression (Synthetic biology).
Codon optimization.
Development of attenuated viral vaccines.
Gene therapy.
To study CUB, many scientists have proposed indexes for evaluating it. You can easily calculate the following indexes in cubat.