Ligand Expo Downloads
Chemical component dictionaries
- mmCIF format dictionary (text |
gzip)
- mmCIF format variants dictionary (text |
gzip)
- SDF/MOL format file of dictionary ideal geometries (gzip)
- PDBML format dictionary (tar.gz)
- PDBML format variants dictionary (tar.gz)
SMILES/InChI data files:
Chemical component coordinate data files1:
- Experimental coordinates for non-polymer molecules and non-standard amino acids and
nucleotides are bundled in single data files (SDF and mmCIF) or tar file bundles (PDB and PDBML/XML)
(SDF/MOL (gz) (>45MB) |
mmCIF (gz) (>98MB) |
PDBML/XML (tar.gz) (>160MB) |
PDB (tar.gz) (>104MB)
)
Downloading experimental coordinates for a particular chemical component1:
Since all chemical component dictionary entries follow a specific
naming convention, you can easily skip searching and download the
coordinates by entering the correct URL. The URL template is:
-
http://ligand-expo.rcsb.org/files/<HASH>/<CC_ID>/<FORMAT>/
where:
- <HASH>
-
- is the first character of the component identifier code,
- <CC_ID>
-
- is the 3-letter-code component identifier,
- <FORMAT>
-
- is one of icif, isdf, ipdb, ixml, or ccif
for mmCIF, SDF/MOL, PDB, PDBML/XML, or mmCIF definition formats, respectively.
For example, all experimental coordinate files in SDF/MOL format for ATP can
be found in the URL:
http://ligand-expo.rcsb.org/files/A/ATP/isdf/
[1] Notes of files names: to uniquely identity experimental coordinate files the following
underscore delimited coding scheme has been used:
PDB ID _ Component ID _ Model No. _ Chain ID _ Residue No. _ mmCIF Sequence No. _ mmCIF Asym ID _ Disorder Flag _
For example, the file name: 1pk8_ATP_1_B_801__R_D_.icif
| Identifier | Value |
| PDB ID | 1PK8 |
| Component/residue code | ATP |
| Model number | 1 (default) |
| Chain ID | B |
| Residue number | 801 |
| mmCIF sequence number | not applicable |
| mmCIF Asym ID | R |
| Disorder Flag | D (modeled with disorder) |