Genetic Code
| Alanine | Ala | A | GCU, GCC, GCA, GCG |
| Arginine | Arg | R | CGU, CGC, CGA, CGG, AGA, AGG |
| Asparagine | Asn | N | AAU, AAC |
| Aspartic acid | Asp | D | GAU, GAC |
| Cysteine | Cys | C | UGU, UGC |
| Glutamine | Gln | Q | CAA, CAG |
| Glutamic Acid | Glu | E | GAA, GAG |
| Glycine | Gly | G | GGU, GGC, GGA, GGG |
| Histidine | His | H | CAU, CAC |
| Isoleucine | Ile | I | AUU, AUC, AUA |
| Leucine | Leu | L | UUA, UUG, CUU, CUC, CUA, CUG |
| Lysine | Lys | K | AAA, AAG |
| Methionine | Met | M | AUG |
| Phenylalanine | Phe | F | UUU, UUC |
| Proline | Pro | P | CCU, CCC, CCA, CCG |
| Serine | Ser | S | UCU, UCC, UCA, UCG, AGU,AGC |
| Threonine | Thr | T | ACU, ACC, ACA, ACG |
| Tryptophan | Trp | W | UGG |
| Tyrosine | Tyr | Y | UAU, UAC |
| Valine | Val | V | GUU, GUC, GUA, GUG |
| Start | AUG* | ||
| Stop | UAG (amber), UGA (opal), UAA (ochre) |
*AUG is the most common start codon. Alternative start codons include CUG in eukaryotes and GUG in prokaryotes.
| Second Position |
|||||||
| U | C | A | G | ||||
| U | UUU (Phe) UUC (Phe) UUA (Leu) UUG (Leu) |
UCU (Ser) UCC (Ser) UCA (Ser) UCG (Ser) |
UAU (Tyr) UAC (Tyr) UAA (Stop) UAG (Stop) |
UGU (Cys) UGC (Cys) UGA (Stop) UGG (Trp) |
U C A G |
||
| First Position (5’end) |
C | CUU (Leu) CUC (Leu) CUA (Leu) CUG (Leu) |
CCU (Pro) CCC (Pro) CCA (Pro) CCG (Pro) |
CAU (His) CAC (His) CAA (Gln) CAG (Gln) |
CGU (Arg) CGC (Arg) CGA (Arg) CGG (Arg) |
U C A G |
Third Position (3’end) |
| A | AUU (Ile) AUC (Ile) AUA (Ile) AUG (Met/Start) |
ACU (Thr) ACC (Thr) ACA (Thr) ACG (Thr) |
AAU (Asn) AAC (Asn) AAA (Lys) AAG (Lys) |
AGU (Ser) AGC (Ser) AGA (Arg) AGG (Arg) |
U C A G |
||
| G | GUU (Val) GUC (Val) GUA (Val) GUG (Val) |
GCU (Ala) GCC (Ala) GCA (Ala) GCG (Ala) |
GAU (Asp) GAC (Asp) GAA (Glu) GAG (Glu) |
GGU (Gly) GGC (Gly) GGA (Gly) GGG (Gly) |
U C A G |
| A | Adenine |
| C | Cytosine |
| G | Guanine |
| T | Thymine |
| U | Uracil |
| B | C, G, or T |
| D | A, G, or T |
| H | A, C, or T |
| K | G or T |
| M | A or C |
| N | A, T, C, or G |
| R | A or G |
| S | C or G |
| V | A, C, or G |
| W | A or T |
| Y | C or T |
| Tag | Protein Sequence | DNA Sequence* |
|---|---|---|
| FLAG | DYKDDDDK | GAC TAC AAA GAC GAT GAC GAC AAG |
| HA | YPYDVPDYA | TAC CCA TAC GAT GTT CCA GAT TAC GCT |
| His | HHHHHH | CAC CAC CAC CAC CAC CAC |
| Myc | EQKLISEEDL | GAA CAA AAA CTC ATC TCA GAA GAG GAT CTG |
| V5 | GKPIPNPLLGLDST | |
| Xpress | DLDDDDK or DLYDDDDK | |
| Thrombin | LVPRGS | |
| BAD (Biotin Acceptor Domain) | GLNDIFEAQKIEWHE | |
| Factor Xa | IEGR or IDGR | |
| VSVG | YTDIEMNRLGK | |
| SV40 NLS | PKKKRKV or PKKKRKVG | |
| Protein C | EDQVDPRLIDGK | |
| S Tag | KETAAAKFERQHMDS | |
| OneStrap | SAWSHPQFEK2GGSAWSHPQFEK | |
| SB1 | PRPSNKRLQQ |
*Due to the degenerate nature of the genetic code, the given sequence is one of a number of sequences that can encode the same epitope tag.