glycosylation
Manual
This is the manual for GLYCOSYLATION, a program to be used in conjunction with SASREF for rigid body modeling of glycosylated structures. Glycan groups are added onto PDB structures for improved modeling against experimental scattering data of glycosylated proteins.
Introduction
GLYCOSYLATION adds glycan groups to asparagine (ASN) or serine (SER) residues of protein structures. The glycan groups are selected from a database of 50 precomputed structures found on glycosylated proteins ranging from 320 to 6820 Da. The structures were generated using Sweet2 which models the 3D structure of saccharides. The program can be run automatically, where the residues and glycan structures are determined by the program, given the number of glycans to be added and the total molecular weight of the glycans. Alternatively, expert mode can be used to specify exactly which glycan goes to which residue. This mode offers no detection of buried residues. NOTE: Only ASN residues are considered in the automatic mode.
Running
Usage:
$ glycosylation [OPTIONS] <MODEL>
Here, the MODEL is the protein structure in PDB or mmCIF format to be glycosylated.
Command-Line Arguments and Options
GLYCOSYLATION recognizes the following command-line options. Mandatory arguments to long options are mandatory for short options too.
Short Option | Long Option | Default | Description |
---|---|---|---|
--seed=<INT> | Set the seed for the random number generator | ||
-k | -keepASN | False | (optional) whether to keep the ASN residue from the glycan structure. This means there will be an additional amino acid residue to the corresponding protein residue. This is meant to be used for modeling the glycan as a separate rigid body. |
-n | -nsugars<number of glycans> | N/A | The number of glycans to be added to the structure. |
-m | -mw<total molecular weight> | N/A | The total molecular weight of the glycan groups to be added. |
-e | -expert | N/A | Run in expert mode. NOTE: only the number of glycans needs to be specified at the command line, not the MW. |
-o | --output=<FILE> | write output to FILE. The output format depends on the FILE extension, supported are PDB and mmCIF format. | |
-v | -version | N/A | Print version information and exit. |
-h | --help | N/A | Print a summary of arguments, options, and exit. |
Output files
The output file contains the atoms in PDB or mmCIF format with a new header. The header is formatted as follows:
Parent PDB file: <input pdb file>
Number of glycans requested: <number of glycans>
Total molecular weight requested: <total molecular weight>
Keep ASN residues from glycans <T or F>
Glycan ASN chain Glycan ID name MW
-----------------------------------------------------------
1 60G B 6 complex_5_1 1141.0
2 78 F 28 hm_6_1 1155.0
3 204B F 8 complex_6_2 1141.0
4 78 H 27 hm_5_2 992.9
Total molecular weight of glycans added: 4429.8
Examples
Automatic
Glycosylate 1XMN.pdb with 4 glycans about 1000 Da each.
$ glycosylation 1XMN.pdb gly_1XMN.pdb -n 4 -mw 4000
Glyco data successfully loaded.
input PDB FILE: 1XMN.pdb
output PDB FILE: gly_auto_1XMN.pdb
IGNORED 1XMN.pdb A (less than 5 residues)
Found ( 28 ) non-buried ASN residues out of ( 42 ) ASN residues.
Calculated approximately 1000.0000000000000 Daltons per glycosylation.
Found 7 glyco structures available for glycosylation.
CA ASN B 60G
Placing glycan complex_5_1 on residue CA ASN B 60G
CA ASN F 78
Placing glycan hm_6_1 on residue CA ASN F 78
CA ASN B 62
WARNING: Glyco molecule moved more than 8 Angstroms. Could be a missed buried residue. Reinitializing with another ASN residue.
CA ASN H 143
WARNING: Glyco molecule moved more than 8 Angstroms. Could be a missed buried residue. Reinitializing with another ASN residue.
CA ASN B 78
WARNING: Glyco molecule moved more than 8 Angstroms. Could be a missed buried residue. Reinitializing with another ASN residue.
CA ASN H 60G
WARNING: Glyco molecule moved more than 8 Angstroms. Could be a missed buried residue. Reinitializing with another ASN residue.
CA ASN B 95
WARNING: Glyco molecule moved more than 8 Angstroms. Could be a missed buried residue. Reinitializing with another ASN residue.
CA ASN F 60G
WARNING: Glyco molecule moved more than 8 Angstroms. Could be a missed buried residue. Reinitializing with another ASN residue.
CA ASN B 147D
WARNING: Glyco molecule moved more than 8 Angstroms. Could be a missed buried residue. Reinitializing with another ASN residue.
CA ASN F 204B
Placing glycan complex_6_2 on residue CA ASN F 204B
CA ASN B 159
WARNING: Glyco molecule moved more than 8 Angstroms. Could be a missed buried residue. Reinitializing with another ASN residue.
CA ASN H 78
Placing glycan hm_5_2 on residue CA ASN H 78
Pdb file written: gly_1XMN.pdb
Expert
Glycosylate 1XMN.pdb with 4 glycans and run in expert mode.
$> glycosylation 1XMN.pdb gly_1XMN.pdb -n 4 -e
Entering expert mode.
Glyco data successfully loaded.
input PDB FILE: 1XMN.pdb
output PDB FILE: gly_1XMN.pdb
Found ( 105 ) non-buried ASN residues out of ( 105 ) ASN residues.
Choose from the following ASN and SER residues:
< 1> CA SER B 20
< 2> CA SER B 27
< 3> CA SER B 36A
< 4> CA SER B 45
< 5> CA SER B 48
< 6> CA ASN B 60G
< 7> CA ASN B 62
< 8> CA SER B 72
< 9> CA ASN B 78
< 10> CA SER B 83
< 11> CA ASN B 95
< 12> CA ASN B 98
< 13> CA SER B 115
< 14> CA SER B 129B
< 15> CA ASN B 143
.
.
.
<105> CA SER H 214
Enter residue choices one at a time:
6
7
37
51
For each of the chosen ASN residues, choose a
sugar by first giving molecular weight range
then selecting from the available structures.
Enter approximate MW (Da):
1000
Found 7 glyco structures available for glycosylation.
Structure < 1>
Molecular weight: 1141.0
Name:complex_5_1
A-D-MANP-(1-6)+
|
A-D-MANP-(1-3)-B-D-MANP-(1-4)-B-D-GLCPNAC-(1-4)-B-D-GLCPNAC-(1-4)-ASN
|
B-D-XYLP-(1-2)+
Structure < 2>
Molecular weight: 1141.0
Name:complex_6_2
A-D-MANP-(1-6)+
|
A-D-MANP-(1-3)-B-D-MANP-(1-4)-B-D-GLCPNAC-(1-4)-B-D-GLCPNAC-(1-4)-ASN
|
B-D-XYLP-(1-2)+
Structure < 3>
Molecular weight: 846.7
Name:hm_4_1
a-D-Manp-(1-3)-b-D-Manp-(1-4)-b-D-GlcpNAc-(1-4)-b-D-GlcpNAc-(1-4)-Asn
Structure < 4>
Molecular weight: 1008.9
Name:hm_5_1
A-D-MANP-(1-6)+
|
B-D-MANP-(1-4)-B-D-GLCPNAC-(1-4)-B-D-GLCPNAC-(1-4)-ASN
|
A-D-MANP-(1-3)+
Structure < 5>
Molecular weight: 992.9
Name:hm_5_2
a-L-Fucp-(1-6)+
|
b-D-GlcpNAc-(1-4)-Asn
|
a-D-Manp-(1-6)-b-D-Manp-(1-4)-b-D-GlcpNAc-(1-4)+
Structure < 6>
Molecular weight: 1155.0
Name:hm_6_1
A-L-FUCP-(1-6)+
|
A-D-MANP-(1-6)+ B-D-GLCPNAC-(1-4)-ASN
| |
B-D-MANP-(1-4)-B-D-GLCPNAC-(1-4)+
|
A-D-MANP-(1-3)+
Structure < 7>
Molecular weight: 1171.0
Name:hm_6_2
A-D-MANP-(1-3)-A-D-MANP-(1-6)+
|
B-D-MANP-(1-4)-B-D-GLCPNAC-(1-4)-B-D-GLCPNAC-(1-4)-ASN
|
A-D-MANP-(1-3)+
Choose from the following structures:
1
Repeat selection process for other glycans…
Placing glycan complex_5_1 on residue CA ASN B 60G
Placing glycan complex_5_1 on residue CA ASN B 62
WARNING: Glyco molecule moved more than 8 Angstroms. Could be a missed buried residue.
Placing glycan complex_5_1 on residue CA ASN D 78
Placing glycan complex_5_1 on residue CA ASN D 204B
Pdb file written: gly_1XMN.pdb