IUPAC codes

The IUPAC Code is a set of symbols encoding each subset of the four nucleotides. Here is the complete list of these symbols together with the subsets they encode.
CodeCorresponding className
A-Adenine
C-Cytosine
G-Guanine
T-Thymine
U-Uracil
R[GA]purine
Y[TC]Pyrimidine
K[GT]Keto
M[AC]Amino
S[GC]?
W[AT]?
B[GTC]?
D[GAT]?
H[ACT]?
V[GCA]?
N[ACGT]Any

Gregory Kucherov
Last modified: Wed Jul 26 15:38:31 MEST 2000