NMR structure calculation with ARIAweb


Link to this tutorial online: http://aria.pasteur.fr/tutorial

In this practical, we will calculate the structure of a GTPase-binding domain from the Mus musculus Sec5 protein (Mott et al., 2003, Structure 278(19):17053-9) with ARIA2. The data comprise two NOESY spectra and torsion angles restraints from Talos-N predictions.

Links:

IMPORTANT For this tutorial, you will use the ARIAweb server, for which you can either Register or use anonymously. Please go first to the login page and choose how to use the server. https://ariaweb.pasteur.fr/login


Once you have access to ARIAweb, here is how your homepage looks like.

Figure 1: ARIAweb homepage

Using a NEF data file


IMPORTANT For this tutorial, you first need to download the corresponding NEF data file input_new.nef (right-click "Save link As...").

The NEF data file contains the following data:


1. Create a Project


Before all, you will need to create a new project.

  1. Click on the button.

  2. Specify the "Project name", e.g. 1hk6_run1

  3. Click the button under "NEF project" to select the input_new.nef file you've downloaded.

  4. Click on the button.
Figure 1: Creating a project with NEF

ARIAweb will thus read the content of the NEF data file and will be ready for preparing a new structure calculation.

2. Prepare your ARIA structure calculation project

Now that you have created your ARIAweb project and uploaded your NEF data file, you can prepare your ARIA structure calculation.

Click the button under Structure calculation (Figure 2).

Figure 2: Project menu

You now have a new form where you will be able to validate or modify all the parameters for your ARIA run. The Structure Calculation form is designed to guide the user through the main categories of parameters that need to be checked (if default values are used) or specified (if a user wants to change with customized values). The 4 main categories of parameters are:

Figure 3: SETUP page of the structure calculation form

Start by giving a name to this ARIA job, e.g. 1hk6_run1.

Click the button to move to the next section of the form.

In the DATA section, you will see that the Molecular system is already pre-filled with the sequence defined in the NEF file (nef_molecular_system) and the format is set to NEF.

Click to move to the Spectra definitions.

Adding NOE peak-list and chemical shifts

There are 2 spectra (or peak-lists) in the NEF data file.

Leave all parameters as default (such as the assignment tolerances for the 3 dimensions of the NOE peak-lists).

Figure 4: Spectra parameters

Adding other data

In addition to NOE data, we will use dihedral angle restraint (derived from Talos-N predictions using xplor-NIH).

To do so, activate the sliders on the left for Dihedral angles (Figure 5).

Click the button to move to Dihedral angles.

Select NEF as format and the nef_dihedral_restraint_list_1 as NEF name (Figure 5).

Figure 5: Dihedral angle restraints data and parameters

Finally, click the button again.

For the practical, we use all parameters as default for PROCOL and STRUCTURE GENERATION.

Continue clicking until you see the button.

Figure 6: A Structure calculation form ready to be saved

Clicking will save (no surprise here 😉) your parameters and you will be directed to the page where can submit the ARIA calculation job.

Figure 7: List of structure calculations ready to be submitted

Click the button to submit you structure calculation. You will be redirected to a Results page where you can see the status of your ARIA job. See here for an explanation on the status icon.

Depending on the server load the calculation might take ~30 minutes to hours... For the rest of the practical you will not have to wait until the calculation is finished. We have pre-calculated results that will allow you to visualise and analyse the results of an ARIA job.

3. Results

If your job is finished (status icon ), click on the button to start the job analysis and visualize the results.

Results archive can be downloaded using .

Tutorial results are already available so you don't have to wait. Click HERE to go to the results page.

The visualisation page allows to view an analysed the results of a structure calculation job. Final structure ensemble generated by ARIAweb are displayed interactively, with various representations. Final restraints statistics, structure quality checks and bundle RMSD are shown to help the user interprets the reliability of the results. On top of that, more graphs, restraints validation and PDB files can be downloaded directly (Figure 8).

Figure 8: Screenshot of a typical Visualisation page, where main components are marked in red.

Results for all ARIA iterations can be shown by selecting an iteration in the Iterations tab. The main component of the Visualisation page is the NGL viewer. By default, several representations of the structure ensemble generated by ARIA are shown:

quizz
Is the NOE violation statistics for residues better at iteration 2 (IT 2) or the last iteration (IT 8) ?
(Hint: Look at the color of side-chains)

The Representation tab (below the NGL viewer) allows to change the visibility, styling and coloring of a selected representation. By default, all structures in the bundle are shown; use the filter_frames button to select individual models for display).

The add_circle button allows to upload another PDB file to superimpose on the ARIA structure ensemble (use the layers button to superimpose on a selected structure).

Below, Restraints statistics recapitulate the number of restraints and the trend since the previous iteration for:

  1. Restraints used for structure calculation (the more, the better)
  2. Unsatisfied (i.e. consistently violated) restraints (less is better)
  3. Merged (identical) restraints
quizz
How many NOE cross-peaks remain unassigned ?

How many restraints were used for structure calculation ?

The Contribution histogram gives count of restraints (from each input spectra) that have 1 (i.e. unambiguous) or more (i.e. ambiguous) assignment possibilities (or "contributions"). Ultimately, ARIA tries to reduce the ambiguity in NOE assignments, producing more unambiguous restraints. Truly ambiguous assignments can remain due to spectral overlap or chemical shifts degeneracy.

The Ensemble RMSD graph shows, for each iteration, the RMSD of the structure ensemble generated by ARIA ("bundle"), computed as the mean RMSD when superimposing of the ensemble average (after iterative superimposition). Low RMSD (< 1-2 ) is good indicator of convergence of the NOE assignment and structure calculation process by ARIA.

quizz
How the ensemble precision evolves during the ARIA iterative process ?

Is there a convergence at the 1st iteration ? At the last iteration ?

What is the precision of the final structure ensemble ?
(Hint: Look at the "Summary table" under More results)

The Quality checks panel summarizes the results of 3 main structural quality validators: WHAT-IF, Procheck and Molprobity. Quality scores are shown on a slider from bad to good values. Bad quality score values indicate that the input data may contain errors/inconsistencies and that ARIA was not able to produce a high quality model. We provide here some indicators on how to judge the quality checks:

  1. Procheck Ramachandran percentage: for typical NMR structures deposited in the PDB, 80% of the dihedral angles lie within the preferred region of the Ramachandran plot. For high-resolution NMR structures, a higher percentage is expected (90%).

  2. WHAT-IF Z-scores: WHAT-IF results are presented in the form of overall Z-scores. In general, structures with Z-scores between -2 and +2 are considered to be within a normal range and are thus good structures, while structures with Z-scores lower than -2 should be inspected further. Useful indicators of good quality are Backbone conformation and Packing quality. The bump-score also reports the number of van der Waals violations per 100 residues.

  3. WHAT-IF profiles: recently, some studies have stressed that global structural indicators are not sufficient to detect errors in structures and suggested examining parameters on a per-residue basis. Such profiles for the WHAT-IF scores are produced by ARIA in the form of a PDF file. Thus, poor quality regions can be precisely identified.

  4. Molprobity clashscore. this reports the number of overlaps >0.4 per thousand atoms. For typical NMR structures deposited in the PDB, this score is generally high (>10). From our experience, the application of the log-harmonic potential along with automated weight estimation significantly improves this situation (see the More results panels).

quizz
Compare the overall quality scores (Quality checks) for the last iteration (IT8) and for the water refinement (REFINE).

The More results panel provides a link to a standard summary table of structural and restraints statistics (iteration 8 and refine)

quizz
Take a look at the restraint violations statistics for NOE and dihedrals angles restraints in the "Summary table"

Additional results are also accessible:

quizz
Look at the "Per residue WHATCHECK graph". Do you spot regions of lower quality ?


4. Updated NEF file and mmCIF coordinates

When the calculation is finished, ARIA exports a new NEF file containing the cross-peaks assignments and restraints used for calculation. You can access those files in the ARIA run ZIP file, in the More results section (archive Full ARIA run archive).

After unpacking the archive, you can the find the following files:

run1/structures/it8/aria2_run1_it8_deposition.nef
run1/structures/it8/aria2_run1_it8_full.nef
run1/structures/it8/fitted_1hk6_run1.cif
run1/structures/refine/fitted_1hk6_run1_water.cif

Coordinate files of final ensembles (it8 and refine) are present in mmCIF format for PDB deposition.

NEF files (with restraints used for the last iteration and solvent refinement) are exported in 2 flavors:


5. New calculation with tuned parameters

As you may have concluded, the previous calculation did not yield satisfactory results in terms of convergence, restraints statistics and structure quality.

On ARIAweb, one can setup a new calculation starting from the same input data (i.e. NEF file), but with different, optimised parameters. Among the many parameters that can be changed in ARIAweb, we suggest the following:

As an example, you can visualise HERE the results of a new structure calculation using the same NEF file (input_new.nef), but with tuned parameters.
quizz
How improved are the results for this new calculation ?


To explore the variety of parameters on ARIAweb, do not hesitate to activate the sliders on the different forms . By default, only basic options are displayed in each form. Enable the Expert Mode to see more options (User menu ).



We invite you to read the following book chapter to learn more about ARIA and on how to judge the quality and reliability of structures determined with ARIA from NMR data.

Bardiaux B, Malliavin T, Nilges M. ARIA for solution and solid-state NMR. Methods Mol Biol. 2012;831:453-83. https://doi.org/10.1007/978-1-61779-480-3_23

Back to top | Home | PDF version


© 2022 Benjamin Bardiaux (Institut Pasteur/CNRS)