Manual

The following sections briefly describe the method implemented in GLYCOSYLATION, how to run GLYCOSYLATION from the command-line and describe the required input and the produced output file.

Introduction

GLYCOSYLATION adds glycan groups to asparagine (ASN) or serine (SER) residues of protein structures. The glycan groups are selected from a database of 50 precomputed structures found on glycosylated proteins ranging from 320 to 6820 Da. The structures were generated using Sweet2 which models the 3D structure of saccharides.

The program can be run automatically, where the residues and glycan structures are determined by the program, given the number of glycans to be added and the total molecular weight of the glycans.

Alternatively, expert mode can be used to specify exactly which glycan goes to which residue. This mode offers no detection of buried residues. NOTE: Only ASN residues are considered in the automatic mode.

Running glycosylation

Command-Line

Usage:

$ glycosylation [OPTIONS] <MODEL>

GLYCOSYLATION accepts absolute as well as relative paths to the atomic coordinate MODEL in .pdb or .cif format.

Arguments and Options

GLYCOSYLATION recognizes the following command-line options. Mandatory arguments to long options are mandatory for short options too.

Short Option Long Option Default Description
  --seed=<INT>   Set the seed for the random number generator
-k -keepASN False (optional) whether to keep the ASN residue from the glycan structure. This means there will be an additional amino acid residue to the corresponding protein residue. This is meant to be used for modeling the glycan as a separate rigid body.
-n -nsugars=<number of glycans> N/A The number of glycans to be added to the structure.
-m -mw=<total molecular weight> N/A The total molecular weight of the glycan groups to be added.
-e -expert N/A Run in expert mode. NOTE: only the number of glycans needs to be specified at the command line, not the MW.
-o --output=<FILE>   Write output to FILE. The output format depends on the FILE extension, supported are .pdb and .cif.
-v -version N/A Print version information and exit.
-h --help N/A Print a summary of arguments, options, and exit.

Graphical Interface

Figure 1: first page of the GLYCOSYLATION wizard when started from the ATSAS Application Launcher.

As an alternative to usage from the command-line, GLYCOSYLATION may also be run as a wizard from the ATSAS Application Launcher.

This graphical interface allows convenient configuration of the arguments and options outlined in the command-line section. After completion of the calculations, the produced output files can be saved.

Note that the wizard only supports the automatic mode of GLYCOSYLATION. To manually insert glycans, use the command-line with the --expert option.

glycosylation Input Files

GLYCOSYLATION accepts atomic coordinate data in .pdb or .cif format as input. This may be either a relative or absolute file path.

glycosylation Output files

The created output file contains the same atomic positions as the initial input file, but with glycans and an application-specific header added.

The output format is determined by the extension of the output file name. Supported are the .pdb or .cif formats.

Examples

Random addition of glycans

Glycosylate 1XMN.pdb with 4 glycans about 1000 Da each.

Note the implied format conversion; this is not required, but possible and may be convenient for some use cases.

$ glycosylation 1XMN.cif -n 4 -mw 4000 -o 1xmn_gly.pdb
 input PDB FILE: 1XMN.cif
 output PDB FILE: 1xmn_gly.pdb
 Glyco data successfully loaded.
 Found (          42 ) ASN residues.
 Found           7  glyco structures available for glycosylation.

The created 1xmn_gly.pdb contains details about the glycans added:

REMARK 265 atsas_software_runtime
REMARK 265 ASN           chain_id      glyan_id      name          mw
REMARK 265 91            F             26            hm_4_1        846.7
REMARK 265 151D          B             7             hm_6_1        1155.0
REMARK 265 143           B             6             hm_6_2        1171.0
REMARK 265 95            F             27            hm_5_1        1008.9

User-defined addition of glycans

Glycosylate 1XMN.pdb with 4 glycans about 1000 Da each, with manual selection of glycans and their attachment position.

First select the locations for the glycans from the list of all possible locations, then provide an MW estimate for each, get a list of sugars of that weight to choose from. Continue until all glycans are placed.

$ glycosylation 1XMN.cif -n 4 -mw 4000 -o 1xmn_gly.pdb --expert

 Entering expert mode.
 Glyco data successfully loaded.
 input PDB FILE: 1XMN.pdb
 output PDB FILE: gly_1XMN.pdb
 Found (         105 ) non-buried ASN residues out of (         105 ) ASN residues.
 Choose from the following ASN and SER residues:
<  1>  CA SER  A    4E
<  2>  CA SER  A   19
<  3>  CA SER  A   31I
   :
<107>  CA ASN  H  217
<108>  CA SER  H  226
Enter residue choices one at a time:
Enter residue choice (1/4):
6
Enter residue choice (2/4):
7
Enter residue choice (3/4):
37
Enter residue choice (4/4):
57
 For each of the chosen ASN residues, choose a  
 sugar by first giving molecular weight range   
 then selecting from the available structures.
 
 For glycan:           1 CASERB          22
 Enter approximate MW (Da):
1000
 Found           7  glyco structures available for glycosylation.
[...]

Repeat selection process for other glycans. The header now contains:

REMARK 265 atsas_software_runtime
REMARK 265 ASN           chain_id      glyan_id      name          mw            
REMARK 265 22A           B             6             complex_5_1   1141.0        
REMARK 265 31            B             7             complex_5_1   1141.0        
REMARK 265 53G           D             37            complex_5_1   1141.0        
REMARK 265 19            E             57            complex_5_1   1141.0