In this tutorial we will
To this end we will use the software CYANA, NMR2, and excel (or similar)
Content
|
Please follow the following steps carefully (exact Linux commands are given below; you may copy them to a terminal):
cd "download path"/cyana-3.98.15
#optional
sudo xattr -rd com.apple.quarantine cyanaexe.macarm-gfortran
./cyana
CYANA 3.98.15 (macarm-gfortran)
Copyright (c) 2002-22 Peter Guentert. All rights reserved.
___________________________________________________________________
Demo license valid for specific sequences until 2024-12-31.
Library file "/Users/julienorts/Downloads/cyana-3.98.15/lib/cyana.lib" read, 41 residue types.
Hint: More information on the CYANA commands etc. is in the CYANA 3.0 Reference Manual.
Remark: CYANA is a proprietary software. For any installation problem, contact Peter Güntert, the author of CYANA.
You need also a molecular viewer capable of saving mol2 file
Download the zip file for the workshop: workshopData.zip
Open the simulation_data.xlsx in part1/task1_to_3 directory. These are for us the experimetal data!
We will start with the structure calculation of our ligand (drug-like) molecule.
From the NOESY spectra, we obtain the cross-relaxation rates using the intensity
(volume) of the crosspeaks at a known (experimentally set) mixing time.
These are directly related to the interatomic distances, which will determine the conformation of the molecule.
The NOESY buildup curves. "Intramolecular NOE"
In the table "Intramolecular NOE", we have the intensity at different mixing times (up to 0.1s) given at the first line.
Plot those, notice the scale, and shape.
The NOESY spectrum contain also diagonal crosspeaks. These correspond to the (non-equilibrium Z-) magnetisation decay of
the spin-themselfs: autorelaxation, or in other words, decay with the T1-type (R1-type) relaxation time (rate).
Rii = ρi = b2/dij6 (J(0) + 3J(ω) + 6J(2ω)) #contribution of one neighbor spin "j" in distance "d".
And the crossrelaxation rate between H-spins "i" and "j" is:
Rij = σij = b2/dij6 (-J(0) + 6J(2ω))
where "b" is the dipol-dipol interaction strength, and "J(ω)" is the spectral density at angular frequency "ω".
The spectral density J is the Fourier transformation of the rotational correlation function, shows the distribution of frequencies of the molecular rotational motion.
Notice the scale and shape.
How would you obtain the R1 relaxation rate?
The initial rate of the NOESY crosspeak, is directly proportional to the distance between the respective spins (protons).
The other dependency comes from the rotational correlation time of the molecule, which is dependent on temperature,
solvent viscosity, solvation shell of the molecule, shape of the molecule, and in the case of a small molecule partly
bound to a larger protein, the effective correlation time is also modulated by this partial bounding - the chemical
exchange. Therefore, it is practical to leave these dependencies aside, and calibrate the relation between the NOE
buildup rate (cross-relaxation rate) and the interproton distance using a known distance.
Fortunately, there are many proton pairs in the molecule with fixed distance, simply due to the covalent structure.
In a separate column, assuming a linear buildup in time: chose now
the first mixing time to get the initial cross-relaxation rate.
Note, that this cross-relaxation rate is not normalized (it is not in [s-1]), but since we will do the referencing using a know distance, we do not have to normalize.
For the reference pair of protons. We have now also the reference sigma sigmaRef.
Use the formula
rij = rRef * (sigmaRef/sigmaij)^(1/6). Vogeli 2014, eq. 63b
to calculate the other distances between other atoms (in a separate column).
In order to calculate the structure (conformation) of the molecule, we need to know the interatomic distances.
This is something what we have already. As these distances come with some inaccuracies, we have to input two distances -
a lower and upper bound in which we believe that the true distance resides.
For practical purpose, we just create two new columns, where the distances derived above are multiplied by factors of
0.8 and 1.2 respectively. In fact, just the upper bound will be enough in our exercises.
Create the lower and upper interatomic distances as separate columns.
There is no closed-form formula to calculate the conformation (structure) from a set of distances.
The setup starts with defining an energy penalty for every experimental distance not fulfilled by the molecular conformation. These are also called distance restraints.
Starting from one chosen conformation, and trying to minimize the structure (using steepest descent or other local method) to fulfill
the distances measured by NOE (or any other means) would fail: the structure would end-up in a local minimum. Instead we
have to search for a global minimum. A commonly used algorithm for a global minimum is called simulated annealing, where
the molecule is heated up such that high-energy barriers (due to van-der Waals clashes) can be surpassed. By a
subsequent cooling, the imposed distance restraints will drive the molecule towards the conformation with minimal
violation of the distance restraints. Many attempts will nevertheless end up in different local minima, and hence, only a
subset of resulting conformers, the lowest-energy conformers will be likely to represent the global minimum.
In practice, we have to input the knowledge about the covalent (bonding) structure of the molecule, and the distance
restraints. The bonding structure can be as simple as the chain of aminoacids, as the standard programs would have
libraries of the actual atomic bonding (topology) for those. For an unknown molecule, we have to supply a full topology
ourselves. These would be different for different programs.
We will use program specialized structure calculation from NMR restraints: CYANA by Prof. Dr. Peter Günter.
CYANA can obtain the bonding topology from a .mol2 molecular structure file, converting it into its own (library)
format, a .lib file. This library file with contain information about one molecule, but since biopolymers - proteins
contain chain (sequence) of building blocks like aminoacid residues of nucleotides, there has to be also information
about the sequence of those building block. In our case it contains only one record: the name of our ligand molecule.
We will use a ready mol.lib file in our exercise. Besides of the physical atoms H, there are also pseudoatoms Q created to replace the chemically equivalent H atoms.
We complete the information by a .seq file with a "sequence" containing only one line the name and number of the residue (MOL 999)
The other information: the distance restraints are obtained by CYANA from a separate text file, where
the pairs of atoms are identified by three and three columns, and the distance in Ånsgrom.
ResidueNumber1 residueName1 atomName1 residueNumber2 residueName2 atomName2 distance.
In our first calculation of a single molecule (residue), obviously only the atomName1 and atomNam2 would be different.
Further instructions for CYANA are read from the .cya file.
Note that the atom numbers and names have to exactly match the MOL.lib file.
Note also, that the distance has to be in a numerical format using "dot" as a decimal separator and not a "comma"!
Navigate to the task_4_to_5 directory in the terminal (command line) and start the CYANA:
$ cyana
In the cyana prompt, call the instruction in req.cya (leaving out the .cya) to perform the structure calculation:
cyana> reg
In less than a minute, the calculation is ready.
We have now the structure file: demo.pdb
and the overview file about the calculation: demo.ovw
Close CYANA
cyana> exit
From the theoretical introduction about NOE, we know that the existence of crosspeak between two spins does not have to
be caused by the direct through-space transfer of magnetisation between them.
Instead, magnetisation transfer via an third nucleus can occur. This is called spin diffusion.
Compare the structures
Instead of the first mixing time in Task 1,
choose the last mixing time and proceed all the way to calculate the structure in CYANA.
Note the differences in extracted distances and in the resulting structures.
In this short exercise, we will calculate the protein structure using ready distances stored in the final_protein.upl
file. We do not need any extra library file, as this time, the sequence file (demoShort.seq) contains only standard
aminoacid residues.
Look at the structures using chimera molecular viewer.
Calculate the structure of the protein-ligand complex.
cat final_protein.upl mol.upl intermolecular.upl > complex.upl
Note! if using VMD as a molecular viewer, it will refuse to recognize large parts of the secondary structure!
Try to explain what can be causing it.
Check the median of the distance restraints placed into .upl
It should be around 4.2 for intramolecular distances, and 4.4 for the intermolecular distances.
What do you get? Discuss how robust the calculation of the protein-ligand structure is.
NMR2 runs via the platform SAMSON.