<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://chemwiki.ch.ic.ac.uk/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Af706</id>
	<title>ChemWiki - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="https://chemwiki.ch.ic.ac.uk/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Af706"/>
	<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/wiki/Special:Contributions/Af706"/>
	<updated>2026-06-09T21:13:47Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.43.8</generator>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterFulvene&amp;diff=106406</id>
		<title>AndyForesterFulvene</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterFulvene&amp;diff=106406"/>
		<updated>2010-03-16T19:51:19Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* Setting up the IRC file */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Fulvene TD-DFT Excited State IRC=&lt;br /&gt;
&lt;br /&gt;
==Getting Force constants and a path==&lt;br /&gt;
&lt;br /&gt;
Using initial coordinates from David, performed a calculation to retrieve forces using:&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;#p freq=projected b3lyp/6-31G(d) td=(root=1)&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This will give the steepest descent as an imaginary frequency and also allow me to read in (RCFC) to the IRC calculation.&lt;br /&gt;
&lt;br /&gt;
==Creating the IRC input file==&lt;br /&gt;
&lt;br /&gt;
===Internal Coordinate path vector===&lt;br /&gt;
&lt;br /&gt;
The path vector MUST be in internal coordinates - they are printed as Cartesian coordinates in the log file so must be converted:&lt;br /&gt;
&lt;br /&gt;
* Open the frequency calculation log in Gaussview 5&lt;br /&gt;
* Visualise the frequency that corresponds to the required path&lt;br /&gt;
* Allow manual displacement of the frequency in Gaussian 5&lt;br /&gt;
* Save structure at 0 displacement and turn off (write Cartesians) by going though calculation setup (edit)&lt;br /&gt;
* Displace structure 0.1 using the manual displacement slider&lt;br /&gt;
* Save structure again without writing Cartesians&lt;br /&gt;
&lt;br /&gt;
This gives 2 sets of internal coordinates - the difference is the vector we require in internal coordinates.&lt;br /&gt;
&lt;br /&gt;
* Check that the atoms are matched in each set of coordinates - the NoSym option in calculation setup can be used to avoid changing the coordinates at all.&lt;br /&gt;
* Paste coordinates into excel and take one from the other.&lt;br /&gt;
* This is our new vector!&lt;br /&gt;
&lt;br /&gt;
The next step is to make the vector in a format that Gaussian can read. This is VERY important since the vector is not checked by gaussian. The format is &#039;&#039;8F10.6&#039;&#039; for each value - 8 numbers, 10 Characters, 6 decimal places. Place the vector into this format in a text file (or use the script!). There should be 3N-6 internal coordinates.&lt;br /&gt;
&lt;br /&gt;
eg:&lt;br /&gt;
&lt;br /&gt;
 -0.010890 -0.010890 -0.010890 -0.010890 -0.010890 -0.010890 -0.010890 -0.010890&lt;br /&gt;
  0.000330  0.000330  0.000330  0.000330  0.000330  0.000330  0.000330  0.000330&lt;br /&gt;
  0.012430  0.012430  0.012430  0.012430  0.012430  0.012430  0.012430  0.012430&lt;br /&gt;
  0.012430  0.012430  0.012430  0.012430&lt;br /&gt;
&lt;br /&gt;
The route IRC=ReadVector will make Gaussian read the vector - it should be placed at the end of the input file.&lt;br /&gt;
&lt;br /&gt;
===Setting up the IRC file===&lt;br /&gt;
&lt;br /&gt;
IRC calculations also require initial force constants to proceed. You must provide these to the calculation in some way, options defined in the route section:&lt;br /&gt;
&lt;br /&gt;
* &#039;&#039;IRC=RCFC&#039;&#039; reads the force constants from the checkpoint file from the preceding frequency calculation (used to identify the required frequency corresponding to the reaction coordinate)&lt;br /&gt;
* &#039;&#039;IRC=CalcFC&#039;&#039; recalculates the force constants at the beginning of the IRC calculation&lt;br /&gt;
&lt;br /&gt;
The best way to do this is to retrieve them from the previous calculation of the frequencies (the chk file). Also because there are no analytical second derivatives for the TD-DFT IRC we need to use an algorithm that doesn&#039;t require them... this is specified using the &#039;&#039;IRC=GradientOnly&#039;&#039;. The route section now looks like:&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;# TD(root=1) b3lyp/6-31G(d) IRC=(RCFC,GradientOnly,ReadVector)&#039;&#039;&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterFulvene&amp;diff=106405</id>
		<title>AndyForesterFulvene</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterFulvene&amp;diff=106405"/>
		<updated>2010-03-16T19:50:57Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* Internal Coordinate path vector */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Fulvene TD-DFT Excited State IRC=&lt;br /&gt;
&lt;br /&gt;
==Getting Force constants and a path==&lt;br /&gt;
&lt;br /&gt;
Using initial coordinates from David, performed a calculation to retrieve forces using:&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;#p freq=projected b3lyp/6-31G(d) td=(root=1)&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This will give the steepest descent as an imaginary frequency and also allow me to read in (RCFC) to the IRC calculation.&lt;br /&gt;
&lt;br /&gt;
==Creating the IRC input file==&lt;br /&gt;
&lt;br /&gt;
===Internal Coordinate path vector===&lt;br /&gt;
&lt;br /&gt;
The path vector MUST be in internal coordinates - they are printed as Cartesian coordinates in the log file so must be converted:&lt;br /&gt;
&lt;br /&gt;
* Open the frequency calculation log in Gaussview 5&lt;br /&gt;
* Visualise the frequency that corresponds to the required path&lt;br /&gt;
* Allow manual displacement of the frequency in Gaussian 5&lt;br /&gt;
* Save structure at 0 displacement and turn off (write Cartesians) by going though calculation setup (edit)&lt;br /&gt;
* Displace structure 0.1 using the manual displacement slider&lt;br /&gt;
* Save structure again without writing Cartesians&lt;br /&gt;
&lt;br /&gt;
This gives 2 sets of internal coordinates - the difference is the vector we require in internal coordinates.&lt;br /&gt;
&lt;br /&gt;
* Check that the atoms are matched in each set of coordinates - the NoSym option in calculation setup can be used to avoid changing the coordinates at all.&lt;br /&gt;
* Paste coordinates into excel and take one from the other.&lt;br /&gt;
* This is our new vector!&lt;br /&gt;
&lt;br /&gt;
The next step is to make the vector in a format that Gaussian can read. This is VERY important since the vector is not checked by gaussian. The format is &#039;&#039;8F10.6&#039;&#039; for each value - 8 numbers, 10 Characters, 6 decimal places. Place the vector into this format in a text file (or use the script!). There should be 3N-6 internal coordinates.&lt;br /&gt;
&lt;br /&gt;
eg:&lt;br /&gt;
&lt;br /&gt;
 -0.010890 -0.010890 -0.010890 -0.010890 -0.010890 -0.010890 -0.010890 -0.010890&lt;br /&gt;
  0.000330  0.000330  0.000330  0.000330  0.000330  0.000330  0.000330  0.000330&lt;br /&gt;
  0.012430  0.012430  0.012430  0.012430  0.012430  0.012430  0.012430  0.012430&lt;br /&gt;
  0.012430  0.012430  0.012430  0.012430&lt;br /&gt;
&lt;br /&gt;
The route IRC=ReadVector will make Gaussian read the vector - it should be placed at the end of the input file.&lt;br /&gt;
&lt;br /&gt;
===Setting up the IRC file===&lt;br /&gt;
&lt;br /&gt;
IRC calculations also require initial force constants to proceed. You must provide these to the calculation in some way, options defined in the route section:&lt;br /&gt;
&lt;br /&gt;
* &#039;&#039;IRC=RCFC&#039;&#039; reads the force constants from the checkpoint file from the preceding frequency calculation (used to identify the required frequency corresponding to the reaction coordinate)&lt;br /&gt;
* &#039;&#039;IRC=CalcFC&#039;&#039; recalculates the force constants at the beginning of the IRC calculation&lt;br /&gt;
&lt;br /&gt;
The best way to do this is to retrieve them from the previous calculation of the frequencies (the chk file). Also because there are no analytical second derivatives for the TD-DFT IRC we need to use an algorithm that doesn&#039;t require them... this is specified using the &#039;&#039;IRC=GradientOnly&#039;&#039;. The route section now looks like:&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;# TD(root=1) b3lyp/sto-3g IRC=(RCFC,GradientOnly,ReadVector)&#039;&#039;&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterFulvene&amp;diff=106401</id>
		<title>AndyForesterFulvene</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterFulvene&amp;diff=106401"/>
		<updated>2010-03-16T19:45:51Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* Creating the IRC input file */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Fulvene TD-DFT Excited State IRC=&lt;br /&gt;
&lt;br /&gt;
==Getting Force constants and a path==&lt;br /&gt;
&lt;br /&gt;
Using initial coordinates from David, performed a calculation to retrieve forces using:&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;#p freq=projected b3lyp/6-31G(d) td=(root=1)&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This will give the steepest descent as an imaginary frequency and also allow me to read in (RCFC) to the IRC calculation.&lt;br /&gt;
&lt;br /&gt;
==Creating the IRC input file==&lt;br /&gt;
&lt;br /&gt;
===Internal Coordinate path vector===&lt;br /&gt;
&lt;br /&gt;
The path vector MUST be in internal coordinates - they are printed as Cartesian coordinates in the log file so must be converted:&lt;br /&gt;
&lt;br /&gt;
* Open the frequency calculation log in Gaussview 5&lt;br /&gt;
* Visualise the frequency that corresponds to the required path&lt;br /&gt;
* Allow manual displacement of the frequency in Gaussian 5&lt;br /&gt;
* Save structure at 0 displacement and turn off (write Cartesians) by going though calculation setup (edit)&lt;br /&gt;
* Displace structure 0.1 using the manual displacement slider&lt;br /&gt;
* Save structure again without writing Cartesians&lt;br /&gt;
&lt;br /&gt;
This gives 2 sets of internal coordinates - the difference is the vector we require in internal coordinates.&lt;br /&gt;
&lt;br /&gt;
* Check that the atoms are matched in each set of coordinates - the NoSym option in calculation setup can be used to avoid changing the coordinates at all.&lt;br /&gt;
* Paste coordinates into excel and take one from the other.&lt;br /&gt;
* This is our new vector!&lt;br /&gt;
&lt;br /&gt;
The next step is to make the vector in a format that Gaussian can read. This is VERY important since the vector is not checked by gaussian. The format is &#039;&#039;8F10.6&#039;&#039; for each value - 8 numbers, 10 Characters, 6 decimal places. Place the vector into this format in a text file (or use the script!). There should be 3N-6 internal coordinates.&lt;br /&gt;
&lt;br /&gt;
eg:&lt;br /&gt;
&lt;br /&gt;
 -0.010890 -0.010890 -0.010890 -0.010890 -0.010890 -0.010890 -0.010890 -0.010890&lt;br /&gt;
  0.000330  0.000330  0.000330  0.000330  0.000330  0.000330  0.000330  0.000330&lt;br /&gt;
  0.012430  0.012430  0.012430  0.012430  0.012430  0.012430  0.012430  0.012430&lt;br /&gt;
  0.012430  0.012430  0.012430  0.012430&lt;br /&gt;
&lt;br /&gt;
The route IRC=ReadVector will make Gaussian read the vector - it should be placed at the end of the input file.&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterFulvene&amp;diff=106397</id>
		<title>AndyForesterFulvene</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterFulvene&amp;diff=106397"/>
		<updated>2010-03-16T19:35:33Z</updated>

		<summary type="html">&lt;p&gt;Af706: New page: =Fulvene TD-DFT Excited State IRC=  ==Getting Force constants and a path==  Using initial coordinates from David, performed a calculation to retrieve forces using:  &amp;#039;&amp;#039;#p freq=projected b3l...&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Fulvene TD-DFT Excited State IRC=&lt;br /&gt;
&lt;br /&gt;
==Getting Force constants and a path==&lt;br /&gt;
&lt;br /&gt;
Using initial coordinates from David, performed a calculation to retrieve forces using:&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;#p freq=projected b3lyp/6-31G(d) td=(root=1)&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This will give the steepest descent as an imaginary frequency and also allow me to read in (RCFC) to the IRC calculation.&lt;br /&gt;
&lt;br /&gt;
==Creating the IRC input file==&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterPYP&amp;diff=105961</id>
		<title>AndyForesterPYP</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterPYP&amp;diff=105961"/>
		<updated>2010-03-16T13:26:08Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* S1 Frequency Calculations */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=3PHY=&lt;br /&gt;
&lt;br /&gt;
Back to Main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester&lt;br /&gt;
&lt;br /&gt;
==First thoughts==&lt;br /&gt;
&lt;br /&gt;
The 3PHY NMR structure was used as an initial structure and processed like 1W7S. NMR experiments like this lead to a huge amount of information about through-space distances between atoms&lt;br /&gt;
and also bonded distances, but the sheer size of the data set means it can be fit lots of ways (fitting is done iteratively starting from a guess stucture). The result is a set of possible structures, in this case there are 26 different possible stuctures - im not sure which one the wiki starts with!&lt;br /&gt;
&lt;br /&gt;
The file is slightly different to Mikes example... overall -4 charge due to different protonation state (pH7) but charges are the same except for around the chromophore. I included the chromophore + connecting residue when fitting charges since the chromophore is bound to a residue in the primary sequence via a sulphur in is sidechain - this will change the charge distribution of the residue so should be refitted with the chromophore attached. The file you provided uses standard amber charges for this residue and some estimated charges from ESP calculations (+ some fudging to get an integer charge!) for the chromophore.&lt;br /&gt;
&lt;br /&gt;
==The Plan==&lt;br /&gt;
&lt;br /&gt;
* Find and Characterise a Ground State minimum&lt;br /&gt;
* Vertical excitation to the S1 PES - frequency analysis of imaginary modes&lt;br /&gt;
* IRC along S1 PES to S1 minimum.&lt;br /&gt;
&lt;br /&gt;
==Ground State calculations==&lt;br /&gt;
&lt;br /&gt;
A ground state minimum was found using ONIOM B3LYP 6-31G* : Amber with Electronic Embedding and characterised with a frequency calculation.&lt;br /&gt;
&lt;br /&gt;
* Ground state ONIOM optimisation and frequency analysis &#039;&#039;&#039;INPUT&#039;&#039;&#039; - [[Media:3PHY_ONIOM_Optimisation_Frequency_EE.gjf]]&lt;br /&gt;
&lt;br /&gt;
==S1 Frequency and IRC calculations==&lt;br /&gt;
&lt;br /&gt;
The aim of this part of the project is to follow an excited state reaction path using an IRC having simulated a vertical excitation from S0 → S1 (geometry is the same). To do this we cannot use a ground state theory like DFT cf. the density is the ground state density. Methods that can do excited states are TD-DFT, CIS and CASSCF (there are many - we are interested in these). So we will be using the ONIOM method with an excited state-capable theory on the model high level region and looking at the effects of Electronic embedding on the IRC of the S1 surface.&lt;br /&gt;
&lt;br /&gt;
The IRC keyword requests that a reaction path be followed by integrating the intrinsic reaction coordinate, ONIOM calculations use the first-order Euler integration for the predictor step along with the HPC corrector step. It is a practical choice for calculations on large molecules. &lt;br /&gt;
&lt;br /&gt;
It is assumed that the IRC will be started from a transition state structure (1st order saddle point) and thus have only 1 imaginary frequency. The IRC can then follow this transition vector either in the forward or backward direction. Vertical excitation from a S0 minimum will be highly unlikely to lead to a 1st order saddle point on the S1 PES and will almost certainly result in being on the &amp;quot;side of a hill&amp;quot; on the S1 PES - this leads to complications with the following IRC calculation.&lt;br /&gt;
&lt;br /&gt;
There will be multiple imaginary frequencies at this non-stationary point on the S1 surface, the IRC will follow the first one it finds which is not necessarily the vector we want to follow! We need to explicitly define a specific vector for the IRC to follow which represents the reaction path we are investigating. The best way to do this is to run a frequency calculation at this point and visualize the imaginary frequencies to identify the imaginary frequency which represents the reaction, the frequency will be accompanied by a vector which can be read into the IRC calculation.&lt;br /&gt;
&lt;br /&gt;
The standard frequency route will be meaningless on the S1 surface since the calculation assumes a harmonic function (untrue on the side of a hill). This is overcome by using &#039;&#039;freq=project&#039;&#039; in the route section. This invokes computation of the projected frequencies for vibrations perpendicular to the IRC path.&lt;br /&gt;
&lt;br /&gt;
To read the vector into the IRC calculation use &#039;&#039;IRC=ReadVector&#039;&#039; in the route section with the vector listed after the molecule specification in rows containing 8 values per row with 10 characters (inc spaces) making up each value.&lt;br /&gt;
&lt;br /&gt;
IRC calculations also require initial force constants to proceed. You must provide these to the calculation in some way, options defined in the route section:&lt;br /&gt;
&lt;br /&gt;
* &#039;&#039;IRC=RCFC&#039;&#039; reads the force constants from the checkpoint file from the preceding frequency calculation (used to identify the required frequency corresponding to the reaction coordinate)&lt;br /&gt;
* &#039;&#039;IRC=CalcFC&#039;&#039; recalculates the force constants at the beginning of the IRC calculation&lt;br /&gt;
&lt;br /&gt;
=== S1 Frequency Calculations===&lt;br /&gt;
&lt;br /&gt;
Using TD-DFT with the route section stating:&lt;br /&gt;
&lt;br /&gt;
* Electronic Embedding - &#039;&#039;#p freq=projected oniom(b3lyp/6-31g(d) TD=(Root=1):amber=softfirst)=embed geom=connectivity&#039;&#039;&lt;br /&gt;
* Mechanical Embedding - &#039;&#039;#p freq=projected oniom(b3lyp/6-31g(d) TD=(Root=1):amber=softfirst) geom=connectivity&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
These calculations require minimum 14.5Gbytes memory to run - they are also extremely time consuming, taking xxx to run.&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterPYP&amp;diff=105957</id>
		<title>AndyForesterPYP</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterPYP&amp;diff=105957"/>
		<updated>2010-03-16T13:25:52Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* S1 Frequency Calculations */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=3PHY=&lt;br /&gt;
&lt;br /&gt;
Back to Main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester&lt;br /&gt;
&lt;br /&gt;
==First thoughts==&lt;br /&gt;
&lt;br /&gt;
The 3PHY NMR structure was used as an initial structure and processed like 1W7S. NMR experiments like this lead to a huge amount of information about through-space distances between atoms&lt;br /&gt;
and also bonded distances, but the sheer size of the data set means it can be fit lots of ways (fitting is done iteratively starting from a guess stucture). The result is a set of possible structures, in this case there are 26 different possible stuctures - im not sure which one the wiki starts with!&lt;br /&gt;
&lt;br /&gt;
The file is slightly different to Mikes example... overall -4 charge due to different protonation state (pH7) but charges are the same except for around the chromophore. I included the chromophore + connecting residue when fitting charges since the chromophore is bound to a residue in the primary sequence via a sulphur in is sidechain - this will change the charge distribution of the residue so should be refitted with the chromophore attached. The file you provided uses standard amber charges for this residue and some estimated charges from ESP calculations (+ some fudging to get an integer charge!) for the chromophore.&lt;br /&gt;
&lt;br /&gt;
==The Plan==&lt;br /&gt;
&lt;br /&gt;
* Find and Characterise a Ground State minimum&lt;br /&gt;
* Vertical excitation to the S1 PES - frequency analysis of imaginary modes&lt;br /&gt;
* IRC along S1 PES to S1 minimum.&lt;br /&gt;
&lt;br /&gt;
==Ground State calculations==&lt;br /&gt;
&lt;br /&gt;
A ground state minimum was found using ONIOM B3LYP 6-31G* : Amber with Electronic Embedding and characterised with a frequency calculation.&lt;br /&gt;
&lt;br /&gt;
* Ground state ONIOM optimisation and frequency analysis &#039;&#039;&#039;INPUT&#039;&#039;&#039; - [[Media:3PHY_ONIOM_Optimisation_Frequency_EE.gjf]]&lt;br /&gt;
&lt;br /&gt;
==S1 Frequency and IRC calculations==&lt;br /&gt;
&lt;br /&gt;
The aim of this part of the project is to follow an excited state reaction path using an IRC having simulated a vertical excitation from S0 → S1 (geometry is the same). To do this we cannot use a ground state theory like DFT cf. the density is the ground state density. Methods that can do excited states are TD-DFT, CIS and CASSCF (there are many - we are interested in these). So we will be using the ONIOM method with an excited state-capable theory on the model high level region and looking at the effects of Electronic embedding on the IRC of the S1 surface.&lt;br /&gt;
&lt;br /&gt;
The IRC keyword requests that a reaction path be followed by integrating the intrinsic reaction coordinate, ONIOM calculations use the first-order Euler integration for the predictor step along with the HPC corrector step. It is a practical choice for calculations on large molecules. &lt;br /&gt;
&lt;br /&gt;
It is assumed that the IRC will be started from a transition state structure (1st order saddle point) and thus have only 1 imaginary frequency. The IRC can then follow this transition vector either in the forward or backward direction. Vertical excitation from a S0 minimum will be highly unlikely to lead to a 1st order saddle point on the S1 PES and will almost certainly result in being on the &amp;quot;side of a hill&amp;quot; on the S1 PES - this leads to complications with the following IRC calculation.&lt;br /&gt;
&lt;br /&gt;
There will be multiple imaginary frequencies at this non-stationary point on the S1 surface, the IRC will follow the first one it finds which is not necessarily the vector we want to follow! We need to explicitly define a specific vector for the IRC to follow which represents the reaction path we are investigating. The best way to do this is to run a frequency calculation at this point and visualize the imaginary frequencies to identify the imaginary frequency which represents the reaction, the frequency will be accompanied by a vector which can be read into the IRC calculation.&lt;br /&gt;
&lt;br /&gt;
The standard frequency route will be meaningless on the S1 surface since the calculation assumes a harmonic function (untrue on the side of a hill). This is overcome by using &#039;&#039;freq=project&#039;&#039; in the route section. This invokes computation of the projected frequencies for vibrations perpendicular to the IRC path.&lt;br /&gt;
&lt;br /&gt;
To read the vector into the IRC calculation use &#039;&#039;IRC=ReadVector&#039;&#039; in the route section with the vector listed after the molecule specification in rows containing 8 values per row with 10 characters (inc spaces) making up each value.&lt;br /&gt;
&lt;br /&gt;
IRC calculations also require initial force constants to proceed. You must provide these to the calculation in some way, options defined in the route section:&lt;br /&gt;
&lt;br /&gt;
* &#039;&#039;IRC=RCFC&#039;&#039; reads the force constants from the checkpoint file from the preceding frequency calculation (used to identify the required frequency corresponding to the reaction coordinate)&lt;br /&gt;
* &#039;&#039;IRC=CalcFC&#039;&#039; recalculates the force constants at the beginning of the IRC calculation&lt;br /&gt;
&lt;br /&gt;
=== S1 Frequency Calculations===&lt;br /&gt;
&lt;br /&gt;
Using TD-DFT with the route section stating:&lt;br /&gt;
&lt;br /&gt;
* Electronic Embedding - &#039;&#039;#p freq=projected oniom(b3lyp/6-31g(d) TD=(Root=1):amber=softfirst)=embed geom=connectivity&#039;&#039;&lt;br /&gt;
* Mechanical Embedding - &#039;&#039;#p freq=projected oniom(b3lyp/6-31g(d) TD=(Root=1):amber=softfirst) geom=connectivity&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
These calculations require minimum 1.45Gbytes memory to run - they are also extremely time consuming, taking xxx to run.&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterPYP&amp;diff=105951</id>
		<title>AndyForesterPYP</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterPYP&amp;diff=105951"/>
		<updated>2010-03-16T13:24:58Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* S1 Frequency */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=3PHY=&lt;br /&gt;
&lt;br /&gt;
Back to Main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester&lt;br /&gt;
&lt;br /&gt;
==First thoughts==&lt;br /&gt;
&lt;br /&gt;
The 3PHY NMR structure was used as an initial structure and processed like 1W7S. NMR experiments like this lead to a huge amount of information about through-space distances between atoms&lt;br /&gt;
and also bonded distances, but the sheer size of the data set means it can be fit lots of ways (fitting is done iteratively starting from a guess stucture). The result is a set of possible structures, in this case there are 26 different possible stuctures - im not sure which one the wiki starts with!&lt;br /&gt;
&lt;br /&gt;
The file is slightly different to Mikes example... overall -4 charge due to different protonation state (pH7) but charges are the same except for around the chromophore. I included the chromophore + connecting residue when fitting charges since the chromophore is bound to a residue in the primary sequence via a sulphur in is sidechain - this will change the charge distribution of the residue so should be refitted with the chromophore attached. The file you provided uses standard amber charges for this residue and some estimated charges from ESP calculations (+ some fudging to get an integer charge!) for the chromophore.&lt;br /&gt;
&lt;br /&gt;
==The Plan==&lt;br /&gt;
&lt;br /&gt;
* Find and Characterise a Ground State minimum&lt;br /&gt;
* Vertical excitation to the S1 PES - frequency analysis of imaginary modes&lt;br /&gt;
* IRC along S1 PES to S1 minimum.&lt;br /&gt;
&lt;br /&gt;
==Ground State calculations==&lt;br /&gt;
&lt;br /&gt;
A ground state minimum was found using ONIOM B3LYP 6-31G* : Amber with Electronic Embedding and characterised with a frequency calculation.&lt;br /&gt;
&lt;br /&gt;
* Ground state ONIOM optimisation and frequency analysis &#039;&#039;&#039;INPUT&#039;&#039;&#039; - [[Media:3PHY_ONIOM_Optimisation_Frequency_EE.gjf]]&lt;br /&gt;
&lt;br /&gt;
==S1 Frequency and IRC calculations==&lt;br /&gt;
&lt;br /&gt;
The aim of this part of the project is to follow an excited state reaction path using an IRC having simulated a vertical excitation from S0 → S1 (geometry is the same). To do this we cannot use a ground state theory like DFT cf. the density is the ground state density. Methods that can do excited states are TD-DFT, CIS and CASSCF (there are many - we are interested in these). So we will be using the ONIOM method with an excited state-capable theory on the model high level region and looking at the effects of Electronic embedding on the IRC of the S1 surface.&lt;br /&gt;
&lt;br /&gt;
The IRC keyword requests that a reaction path be followed by integrating the intrinsic reaction coordinate, ONIOM calculations use the first-order Euler integration for the predictor step along with the HPC corrector step. It is a practical choice for calculations on large molecules. &lt;br /&gt;
&lt;br /&gt;
It is assumed that the IRC will be started from a transition state structure (1st order saddle point) and thus have only 1 imaginary frequency. The IRC can then follow this transition vector either in the forward or backward direction. Vertical excitation from a S0 minimum will be highly unlikely to lead to a 1st order saddle point on the S1 PES and will almost certainly result in being on the &amp;quot;side of a hill&amp;quot; on the S1 PES - this leads to complications with the following IRC calculation.&lt;br /&gt;
&lt;br /&gt;
There will be multiple imaginary frequencies at this non-stationary point on the S1 surface, the IRC will follow the first one it finds which is not necessarily the vector we want to follow! We need to explicitly define a specific vector for the IRC to follow which represents the reaction path we are investigating. The best way to do this is to run a frequency calculation at this point and visualize the imaginary frequencies to identify the imaginary frequency which represents the reaction, the frequency will be accompanied by a vector which can be read into the IRC calculation.&lt;br /&gt;
&lt;br /&gt;
The standard frequency route will be meaningless on the S1 surface since the calculation assumes a harmonic function (untrue on the side of a hill). This is overcome by using &#039;&#039;freq=project&#039;&#039; in the route section. This invokes computation of the projected frequencies for vibrations perpendicular to the IRC path.&lt;br /&gt;
&lt;br /&gt;
To read the vector into the IRC calculation use &#039;&#039;IRC=ReadVector&#039;&#039; in the route section with the vector listed after the molecule specification in rows containing 8 values per row with 10 characters (inc spaces) making up each value.&lt;br /&gt;
&lt;br /&gt;
IRC calculations also require initial force constants to proceed. You must provide these to the calculation in some way, options defined in the route section:&lt;br /&gt;
&lt;br /&gt;
* &#039;&#039;IRC=RCFC&#039;&#039; reads the force constants from the checkpoint file from the preceding frequency calculation (used to identify the required frequency corresponding to the reaction coordinate)&lt;br /&gt;
* &#039;&#039;IRC=CalcFC&#039;&#039; recalculates the force constants at the beginning of the IRC calculation&lt;br /&gt;
&lt;br /&gt;
=== S1 Frequency Calculations===&lt;br /&gt;
&lt;br /&gt;
Using TD-DFT with the route section stating:&lt;br /&gt;
&lt;br /&gt;
* Electronic Embedding - &#039;&#039;#p freq=projected oniom(b3lyp/6-31g(d) TD=(Root=1):amber=softfirst)=embed geom=connectivity&#039;&#039;&lt;br /&gt;
* Mechanical Embedding - &#039;&#039;#p freq=projected oniom(b3lyp/6-31g(d) TD=(Root=1):amber=softfirst) geom=connectivity&#039;&#039;&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterPYP&amp;diff=105947</id>
		<title>AndyForesterPYP</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterPYP&amp;diff=105947"/>
		<updated>2010-03-16T13:24:31Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* S1 Frequency */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=3PHY=&lt;br /&gt;
&lt;br /&gt;
Back to Main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester&lt;br /&gt;
&lt;br /&gt;
==First thoughts==&lt;br /&gt;
&lt;br /&gt;
The 3PHY NMR structure was used as an initial structure and processed like 1W7S. NMR experiments like this lead to a huge amount of information about through-space distances between atoms&lt;br /&gt;
and also bonded distances, but the sheer size of the data set means it can be fit lots of ways (fitting is done iteratively starting from a guess stucture). The result is a set of possible structures, in this case there are 26 different possible stuctures - im not sure which one the wiki starts with!&lt;br /&gt;
&lt;br /&gt;
The file is slightly different to Mikes example... overall -4 charge due to different protonation state (pH7) but charges are the same except for around the chromophore. I included the chromophore + connecting residue when fitting charges since the chromophore is bound to a residue in the primary sequence via a sulphur in is sidechain - this will change the charge distribution of the residue so should be refitted with the chromophore attached. The file you provided uses standard amber charges for this residue and some estimated charges from ESP calculations (+ some fudging to get an integer charge!) for the chromophore.&lt;br /&gt;
&lt;br /&gt;
==The Plan==&lt;br /&gt;
&lt;br /&gt;
* Find and Characterise a Ground State minimum&lt;br /&gt;
* Vertical excitation to the S1 PES - frequency analysis of imaginary modes&lt;br /&gt;
* IRC along S1 PES to S1 minimum.&lt;br /&gt;
&lt;br /&gt;
==Ground State calculations==&lt;br /&gt;
&lt;br /&gt;
A ground state minimum was found using ONIOM B3LYP 6-31G* : Amber with Electronic Embedding and characterised with a frequency calculation.&lt;br /&gt;
&lt;br /&gt;
* Ground state ONIOM optimisation and frequency analysis &#039;&#039;&#039;INPUT&#039;&#039;&#039; - [[Media:3PHY_ONIOM_Optimisation_Frequency_EE.gjf]]&lt;br /&gt;
&lt;br /&gt;
==S1 Frequency and IRC calculations==&lt;br /&gt;
&lt;br /&gt;
The aim of this part of the project is to follow an excited state reaction path using an IRC having simulated a vertical excitation from S0 → S1 (geometry is the same). To do this we cannot use a ground state theory like DFT cf. the density is the ground state density. Methods that can do excited states are TD-DFT, CIS and CASSCF (there are many - we are interested in these). So we will be using the ONIOM method with an excited state-capable theory on the model high level region and looking at the effects of Electronic embedding on the IRC of the S1 surface.&lt;br /&gt;
&lt;br /&gt;
The IRC keyword requests that a reaction path be followed by integrating the intrinsic reaction coordinate, ONIOM calculations use the first-order Euler integration for the predictor step along with the HPC corrector step. It is a practical choice for calculations on large molecules. &lt;br /&gt;
&lt;br /&gt;
It is assumed that the IRC will be started from a transition state structure (1st order saddle point) and thus have only 1 imaginary frequency. The IRC can then follow this transition vector either in the forward or backward direction. Vertical excitation from a S0 minimum will be highly unlikely to lead to a 1st order saddle point on the S1 PES and will almost certainly result in being on the &amp;quot;side of a hill&amp;quot; on the S1 PES - this leads to complications with the following IRC calculation.&lt;br /&gt;
&lt;br /&gt;
There will be multiple imaginary frequencies at this non-stationary point on the S1 surface, the IRC will follow the first one it finds which is not necessarily the vector we want to follow! We need to explicitly define a specific vector for the IRC to follow which represents the reaction path we are investigating. The best way to do this is to run a frequency calculation at this point and visualize the imaginary frequencies to identify the imaginary frequency which represents the reaction, the frequency will be accompanied by a vector which can be read into the IRC calculation.&lt;br /&gt;
&lt;br /&gt;
The standard frequency route will be meaningless on the S1 surface since the calculation assumes a harmonic function (untrue on the side of a hill). This is overcome by using &#039;&#039;freq=project&#039;&#039; in the route section. This invokes computation of the projected frequencies for vibrations perpendicular to the IRC path.&lt;br /&gt;
&lt;br /&gt;
To read the vector into the IRC calculation use &#039;&#039;IRC=ReadVector&#039;&#039; in the route section with the vector listed after the molecule specification in rows containing 8 values per row with 10 characters (inc spaces) making up each value.&lt;br /&gt;
&lt;br /&gt;
IRC calculations also require initial force constants to proceed. You must provide these to the calculation in some way, options defined in the route section:&lt;br /&gt;
&lt;br /&gt;
* &#039;&#039;IRC=RCFC&#039;&#039; reads the force constants from the checkpoint file from the preceding frequency calculation (used to identify the required frequency corresponding to the reaction coordinate)&lt;br /&gt;
* &#039;&#039;IRC=CalcFC&#039;&#039; recalculates the force constants at the beginning of the IRC calculation&lt;br /&gt;
&lt;br /&gt;
=== S1 Frequency===&lt;br /&gt;
&lt;br /&gt;
Using TD-DFT with the route section stating:&lt;br /&gt;
&lt;br /&gt;
* Electronic Embedding - &#039;&#039;#p freq=projected oniom(b3lyp/6-31g(d) TD=(Root=1):amber=softfirst)=embed geom=connectivity&#039;&#039;&lt;br /&gt;
* Mechanical Embedding - &#039;&#039;#p freq=projected oniom(b3lyp/6-31g(d) TD=(Root=1):amber=softfirst) geom=connectivity&#039;&#039;&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterPYP&amp;diff=105943</id>
		<title>AndyForesterPYP</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterPYP&amp;diff=105943"/>
		<updated>2010-03-16T13:24:16Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* S1 Frequency and IRC calculations */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=3PHY=&lt;br /&gt;
&lt;br /&gt;
Back to Main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester&lt;br /&gt;
&lt;br /&gt;
==First thoughts==&lt;br /&gt;
&lt;br /&gt;
The 3PHY NMR structure was used as an initial structure and processed like 1W7S. NMR experiments like this lead to a huge amount of information about through-space distances between atoms&lt;br /&gt;
and also bonded distances, but the sheer size of the data set means it can be fit lots of ways (fitting is done iteratively starting from a guess stucture). The result is a set of possible structures, in this case there are 26 different possible stuctures - im not sure which one the wiki starts with!&lt;br /&gt;
&lt;br /&gt;
The file is slightly different to Mikes example... overall -4 charge due to different protonation state (pH7) but charges are the same except for around the chromophore. I included the chromophore + connecting residue when fitting charges since the chromophore is bound to a residue in the primary sequence via a sulphur in is sidechain - this will change the charge distribution of the residue so should be refitted with the chromophore attached. The file you provided uses standard amber charges for this residue and some estimated charges from ESP calculations (+ some fudging to get an integer charge!) for the chromophore.&lt;br /&gt;
&lt;br /&gt;
==The Plan==&lt;br /&gt;
&lt;br /&gt;
* Find and Characterise a Ground State minimum&lt;br /&gt;
* Vertical excitation to the S1 PES - frequency analysis of imaginary modes&lt;br /&gt;
* IRC along S1 PES to S1 minimum.&lt;br /&gt;
&lt;br /&gt;
==Ground State calculations==&lt;br /&gt;
&lt;br /&gt;
A ground state minimum was found using ONIOM B3LYP 6-31G* : Amber with Electronic Embedding and characterised with a frequency calculation.&lt;br /&gt;
&lt;br /&gt;
* Ground state ONIOM optimisation and frequency analysis &#039;&#039;&#039;INPUT&#039;&#039;&#039; - [[Media:3PHY_ONIOM_Optimisation_Frequency_EE.gjf]]&lt;br /&gt;
&lt;br /&gt;
==S1 Frequency and IRC calculations==&lt;br /&gt;
&lt;br /&gt;
The aim of this part of the project is to follow an excited state reaction path using an IRC having simulated a vertical excitation from S0 → S1 (geometry is the same). To do this we cannot use a ground state theory like DFT cf. the density is the ground state density. Methods that can do excited states are TD-DFT, CIS and CASSCF (there are many - we are interested in these). So we will be using the ONIOM method with an excited state-capable theory on the model high level region and looking at the effects of Electronic embedding on the IRC of the S1 surface.&lt;br /&gt;
&lt;br /&gt;
The IRC keyword requests that a reaction path be followed by integrating the intrinsic reaction coordinate, ONIOM calculations use the first-order Euler integration for the predictor step along with the HPC corrector step. It is a practical choice for calculations on large molecules. &lt;br /&gt;
&lt;br /&gt;
It is assumed that the IRC will be started from a transition state structure (1st order saddle point) and thus have only 1 imaginary frequency. The IRC can then follow this transition vector either in the forward or backward direction. Vertical excitation from a S0 minimum will be highly unlikely to lead to a 1st order saddle point on the S1 PES and will almost certainly result in being on the &amp;quot;side of a hill&amp;quot; on the S1 PES - this leads to complications with the following IRC calculation.&lt;br /&gt;
&lt;br /&gt;
There will be multiple imaginary frequencies at this non-stationary point on the S1 surface, the IRC will follow the first one it finds which is not necessarily the vector we want to follow! We need to explicitly define a specific vector for the IRC to follow which represents the reaction path we are investigating. The best way to do this is to run a frequency calculation at this point and visualize the imaginary frequencies to identify the imaginary frequency which represents the reaction, the frequency will be accompanied by a vector which can be read into the IRC calculation.&lt;br /&gt;
&lt;br /&gt;
The standard frequency route will be meaningless on the S1 surface since the calculation assumes a harmonic function (untrue on the side of a hill). This is overcome by using &#039;&#039;freq=project&#039;&#039; in the route section. This invokes computation of the projected frequencies for vibrations perpendicular to the IRC path.&lt;br /&gt;
&lt;br /&gt;
To read the vector into the IRC calculation use &#039;&#039;IRC=ReadVector&#039;&#039; in the route section with the vector listed after the molecule specification in rows containing 8 values per row with 10 characters (inc spaces) making up each value.&lt;br /&gt;
&lt;br /&gt;
IRC calculations also require initial force constants to proceed. You must provide these to the calculation in some way, options defined in the route section:&lt;br /&gt;
&lt;br /&gt;
* &#039;&#039;IRC=RCFC&#039;&#039; reads the force constants from the checkpoint file from the preceding frequency calculation (used to identify the required frequency corresponding to the reaction coordinate)&lt;br /&gt;
* &#039;&#039;IRC=CalcFC&#039;&#039; recalculates the force constants at the beginning of the IRC calculation&lt;br /&gt;
&lt;br /&gt;
=== S1 Frequency===&lt;br /&gt;
&lt;br /&gt;
Using TD-DFT with the route section stating:&lt;br /&gt;
&lt;br /&gt;
* Electronic Embedding - &#039;&#039;#p freq=projected oniom(b3lyp/6-31g(d) TD=(Root=1):amber=softfirst)=embed geom=connectivity&#039;&#039;&lt;br /&gt;
* Mechanical Embedding - &#039;&#039;#p freq=projected oniom(b3lyp/6-31g(d) TD=(Root=1):amber=softfirst)=embed geom=connectivity&#039;&#039;&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterPYP&amp;diff=105940</id>
		<title>AndyForesterPYP</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterPYP&amp;diff=105940"/>
		<updated>2010-03-16T13:22:22Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* Ground State calculations */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=3PHY=&lt;br /&gt;
&lt;br /&gt;
Back to Main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester&lt;br /&gt;
&lt;br /&gt;
==First thoughts==&lt;br /&gt;
&lt;br /&gt;
The 3PHY NMR structure was used as an initial structure and processed like 1W7S. NMR experiments like this lead to a huge amount of information about through-space distances between atoms&lt;br /&gt;
and also bonded distances, but the sheer size of the data set means it can be fit lots of ways (fitting is done iteratively starting from a guess stucture). The result is a set of possible structures, in this case there are 26 different possible stuctures - im not sure which one the wiki starts with!&lt;br /&gt;
&lt;br /&gt;
The file is slightly different to Mikes example... overall -4 charge due to different protonation state (pH7) but charges are the same except for around the chromophore. I included the chromophore + connecting residue when fitting charges since the chromophore is bound to a residue in the primary sequence via a sulphur in is sidechain - this will change the charge distribution of the residue so should be refitted with the chromophore attached. The file you provided uses standard amber charges for this residue and some estimated charges from ESP calculations (+ some fudging to get an integer charge!) for the chromophore.&lt;br /&gt;
&lt;br /&gt;
==The Plan==&lt;br /&gt;
&lt;br /&gt;
* Find and Characterise a Ground State minimum&lt;br /&gt;
* Vertical excitation to the S1 PES - frequency analysis of imaginary modes&lt;br /&gt;
* IRC along S1 PES to S1 minimum.&lt;br /&gt;
&lt;br /&gt;
==Ground State calculations==&lt;br /&gt;
&lt;br /&gt;
A ground state minimum was found using ONIOM B3LYP 6-31G* : Amber with Electronic Embedding and characterised with a frequency calculation.&lt;br /&gt;
&lt;br /&gt;
* Ground state ONIOM optimisation and frequency analysis &#039;&#039;&#039;INPUT&#039;&#039;&#039; - [[Media:3PHY_ONIOM_Optimisation_Frequency_EE.gjf]]&lt;br /&gt;
&lt;br /&gt;
==S1 Frequency and IRC calculations==&lt;br /&gt;
&lt;br /&gt;
The aim of this part of the project is to follow an excited state reaction path using an IRC having simulated a vertical excitation from S0 → S1 (geometry is the same). To do this we cannot use a ground state theory like DFT cf. the density is the ground state density. Methods that can do excited states are TD-DFT, CIS and CASSCF (there are many - we are interested in these). So we will be using the ONIOM method with an excited state-capable theory on the model high level region and looking at the effects of Electronic embedding on the IRC of the S1 surface.&lt;br /&gt;
&lt;br /&gt;
The IRC keyword requests that a reaction path be followed by integrating the intrinsic reaction coordinate, ONIOM calculations use the first-order Euler integration for the predictor step along with the HPC corrector step. It is a practical choice for calculations on large molecules. &lt;br /&gt;
&lt;br /&gt;
It is assumed that the IRC will be started from a transition state structure (1st order saddle point) and thus have only 1 imaginary frequency. The IRC can then follow this transition vector either in the forward or backward direction. Vertical excitation from a S0 minimum will be highly unlikely to lead to a 1st order saddle point on the S1 PES and will almost certainly result in being on the &amp;quot;side of a hill&amp;quot; on the S1 PES - this leads to complications with the following IRC calculation.&lt;br /&gt;
&lt;br /&gt;
There will be multiple imaginary frequencies at this non-stationary point on the S1 surface, the IRC will follow the first one it finds which is not necessarily the vector we want to follow! We need to explicitly define a specific vector for the IRC to follow which represents the reaction path we are investigating. The best way to do this is to run a frequency calculation at this point and visualize the imaginary frequencies to identify the imaginary frequency which represents the reaction, the frequency will be accompanied by a vector which can be read into the IRC calculation.&lt;br /&gt;
&lt;br /&gt;
The standard frequency route will be meaningless on the S1 surface since the calculation assumes a harmonic function (untrue on the side of a hill). This is overcome by using &#039;&#039;freq=project&#039;&#039; in the route section. This invokes computation of the projected frequencies for vibrations perpendicular to the IRC path.&lt;br /&gt;
&lt;br /&gt;
To read the vector into the IRC calculation use &#039;&#039;IRC=ReadVector&#039;&#039; in the route section with the vector listed after the molecule specification in rows containing 8 values per row with 10 characters (inc spaces) making up each value.&lt;br /&gt;
&lt;br /&gt;
IRC calculations also require initial force constants to proceed. You must provide these to the calculation in some way, options defined in the route section:&lt;br /&gt;
&lt;br /&gt;
* &#039;&#039;IRC=RCFC&#039;&#039; reads the force constants from the checkpoint file from the preceding frequency calculation (used to identify the required frequency corresponding to the reaction coordinate)&lt;br /&gt;
* &#039;&#039;IRC=CalcFC&#039;&#039; recalculates the force constants at the beginning of the IRC calculation&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterPYP&amp;diff=105932</id>
		<title>AndyForesterPYP</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterPYP&amp;diff=105932"/>
		<updated>2010-03-16T13:17:48Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* S1 Frequency and IRC calculations */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=3PHY=&lt;br /&gt;
&lt;br /&gt;
Back to Main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester&lt;br /&gt;
&lt;br /&gt;
==First thoughts==&lt;br /&gt;
&lt;br /&gt;
The 3PHY NMR structure was used as an initial structure and processed like 1W7S. NMR experiments like this lead to a huge amount of information about through-space distances between atoms&lt;br /&gt;
and also bonded distances, but the sheer size of the data set means it can be fit lots of ways (fitting is done iteratively starting from a guess stucture). The result is a set of possible structures, in this case there are 26 different possible stuctures - im not sure which one the wiki starts with!&lt;br /&gt;
&lt;br /&gt;
The file is slightly different to Mikes example... overall -4 charge due to different protonation state (pH7) but charges are the same except for around the chromophore. I included the chromophore + connecting residue when fitting charges since the chromophore is bound to a residue in the primary sequence via a sulphur in is sidechain - this will change the charge distribution of the residue so should be refitted with the chromophore attached. The file you provided uses standard amber charges for this residue and some estimated charges from ESP calculations (+ some fudging to get an integer charge!) for the chromophore.&lt;br /&gt;
&lt;br /&gt;
==The Plan==&lt;br /&gt;
&lt;br /&gt;
* Find and Characterise a Ground State minimum&lt;br /&gt;
* Vertical excitation to the S1 PES - frequency analysis of imaginary modes&lt;br /&gt;
* IRC along S1 PES to S1 minimum.&lt;br /&gt;
&lt;br /&gt;
==Ground State calculations==&lt;br /&gt;
&lt;br /&gt;
A ground state minimum was found using ONIOM B3LYP 6-31G* : Amber with Electronic Embedding and characterised with a frequency calculation.&lt;br /&gt;
&lt;br /&gt;
* Ground state ONIOM optimisation and frequency analysis INPUT - [[Media:3PHY_ONIOM_Optimisation_Frequency_EE.gjf]]&lt;br /&gt;
&lt;br /&gt;
==S1 Frequency and IRC calculations==&lt;br /&gt;
&lt;br /&gt;
The aim of this part of the project is to follow an excited state reaction path using an IRC having simulated a vertical excitation from S0 → S1 (geometry is the same). To do this we cannot use a ground state theory like DFT cf. the density is the ground state density. Methods that can do excited states are TD-DFT, CIS and CASSCF (there are many - we are interested in these). So we will be using the ONIOM method with an excited state-capable theory on the model high level region and looking at the effects of Electronic embedding on the IRC of the S1 surface.&lt;br /&gt;
&lt;br /&gt;
The IRC keyword requests that a reaction path be followed by integrating the intrinsic reaction coordinate, ONIOM calculations use the first-order Euler integration for the predictor step along with the HPC corrector step. It is a practical choice for calculations on large molecules. &lt;br /&gt;
&lt;br /&gt;
It is assumed that the IRC will be started from a transition state structure (1st order saddle point) and thus have only 1 imaginary frequency. The IRC can then follow this transition vector either in the forward or backward direction. Vertical excitation from a S0 minimum will be highly unlikely to lead to a 1st order saddle point on the S1 PES and will almost certainly result in being on the &amp;quot;side of a hill&amp;quot; on the S1 PES - this leads to complications with the following IRC calculation.&lt;br /&gt;
&lt;br /&gt;
There will be multiple imaginary frequencies at this non-stationary point on the S1 surface, the IRC will follow the first one it finds which is not necessarily the vector we want to follow! We need to explicitly define a specific vector for the IRC to follow which represents the reaction path we are investigating. The best way to do this is to run a frequency calculation at this point and visualize the imaginary frequencies to identify the imaginary frequency which represents the reaction, the frequency will be accompanied by a vector which can be read into the IRC calculation.&lt;br /&gt;
&lt;br /&gt;
The standard frequency route will be meaningless on the S1 surface since the calculation assumes a harmonic function (untrue on the side of a hill). This is overcome by using &#039;&#039;freq=project&#039;&#039; in the route section. This invokes computation of the projected frequencies for vibrations perpendicular to the IRC path.&lt;br /&gt;
&lt;br /&gt;
To read the vector into the IRC calculation use &#039;&#039;IRC=ReadVector&#039;&#039; in the route section with the vector listed after the molecule specification in rows containing 8 values per row with 10 characters (inc spaces) making up each value.&lt;br /&gt;
&lt;br /&gt;
IRC calculations also require initial force constants to proceed. You must provide these to the calculation in some way, options defined in the route section:&lt;br /&gt;
&lt;br /&gt;
* &#039;&#039;IRC=RCFC&#039;&#039; reads the force constants from the checkpoint file from the preceding frequency calculation (used to identify the required frequency corresponding to the reaction coordinate)&lt;br /&gt;
* &#039;&#039;IRC=CalcFC&#039;&#039; recalculates the force constants at the beginning of the IRC calculation&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterPYP&amp;diff=105030</id>
		<title>AndyForesterPYP</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterPYP&amp;diff=105030"/>
		<updated>2010-03-15T16:09:48Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* S1 Frequency and IRC calculations */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=3PHY=&lt;br /&gt;
&lt;br /&gt;
Back to Main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester&lt;br /&gt;
&lt;br /&gt;
==First thoughts==&lt;br /&gt;
&lt;br /&gt;
The 3PHY NMR structure was used as an initial structure and processed like 1W7S. NMR experiments like this lead to a huge amount of information about through-space distances between atoms&lt;br /&gt;
and also bonded distances, but the sheer size of the data set means it can be fit lots of ways (fitting is done iteratively starting from a guess stucture). The result is a set of possible structures, in this case there are 26 different possible stuctures - im not sure which one the wiki starts with!&lt;br /&gt;
&lt;br /&gt;
The file is slightly different to Mikes example... overall -4 charge due to different protonation state (pH7) but charges are the same except for around the chromophore. I included the chromophore + connecting residue when fitting charges since the chromophore is bound to a residue in the primary sequence via a sulphur in is sidechain - this will change the charge distribution of the residue so should be refitted with the chromophore attached. The file you provided uses standard amber charges for this residue and some estimated charges from ESP calculations (+ some fudging to get an integer charge!) for the chromophore.&lt;br /&gt;
&lt;br /&gt;
==The Plan==&lt;br /&gt;
&lt;br /&gt;
* Find and Characterise a Ground State minimum&lt;br /&gt;
* Vertical excitation to the S1 PES - frequency analysis of imaginary modes&lt;br /&gt;
* IRC along S1 PES to S1 minimum.&lt;br /&gt;
&lt;br /&gt;
==Ground State calculations==&lt;br /&gt;
&lt;br /&gt;
A ground state minimum was found using ONIOM B3LYP 6-31G* : Amber with Electronic Embedding and characterised with a frequency calculation.&lt;br /&gt;
&lt;br /&gt;
* Ground state ONIOM optimisation and frequency analysis INPUT - [[Media:3PHY_ONIOM_Optimisation_Frequency_EE.gjf]]&lt;br /&gt;
&lt;br /&gt;
==S1 Frequency and IRC calculations==&lt;br /&gt;
&lt;br /&gt;
The aim of this part of the project is to follow an excited state reaction path using an IRC having simulated a vertical excitation from S0 → S1 (geometry is the same). To do this we cannot use a ground state theory like DFT cf. the density is the ground state density. Methods that can do excited states are TD-DFT, CIS and CASSCF (there are many - we are interested in these). So we will be using the ONIOM method with an excited state-capable theory on the model high level region and looking at the effects of Electronic embedding on the IRC of the S1 surface.&lt;br /&gt;
&lt;br /&gt;
The IRC keyword requests that a reaction path be followed by integrating the intrinsic reaction coordinate, ONIOM calculations use the first-order Euler integration for the predictor step along with the HPC corrector step. It is a practical choice for calculations on large molecules. &lt;br /&gt;
&lt;br /&gt;
It is assumed that the IRC will be started from a transition state structure (1st order saddle point) and thus have only 1 imaginary frequency. The IRC can then follow this transition vector either in the forward or backward direction. Vertical excitation from a S0 minimum will be highly unlikely to lead to a 1st order saddle point on the S1 PES and will almost certainly result in being on the &amp;quot;side of a hill&amp;quot; on the S1 PES - this leads to complications with the following IRC calculation.&lt;br /&gt;
&lt;br /&gt;
There will be multiple imaginary frequencies at this non-stationary point on the S1 surface, the IRC will follow the first one it finds which is not necessarily the vector we want to follow! We need to explicitly define a specific vector for the IRC to follow which represents the reaction path we are investigating. The best way to do this is to run a frequency calculation at this point and visualize the imaginary frequencies to identify the imaginary frequency which represents the reaction, the frequency will be accompanied by a vector which can be read into the IRC calculation.&lt;br /&gt;
&lt;br /&gt;
The standard frequency route will be meaningless on the S1 surface since the calculation assumes a harmonic function (untrue on the side of a hill). This is overcome by using &#039;&#039;freq=project&#039;&#039; in the route section. &lt;br /&gt;
&lt;br /&gt;
To read the vector into the IRC calculation use &#039;&#039;IRC=ReadVector&#039;&#039; in the route section with the vector listed after the molecule specification in rows containing 8 values per row with 10 characters (inc spaces) making up each value.&lt;br /&gt;
&lt;br /&gt;
IRC calculations also require initial force constants to proceed. You must provide these to the calculation in some way, options defined in the route section:&lt;br /&gt;
&lt;br /&gt;
* &#039;&#039;IRC=RCFC&#039;&#039; reads the force constants from the checkpoint file from the preceding frequency calculation (used to identify the required frequency corresponding to the reaction coordinate)&lt;br /&gt;
* &#039;&#039;IRC=CalcFC&#039;&#039; recalculates the force constants at the beginning of the IRC calculation&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterPYP&amp;diff=103737</id>
		<title>AndyForesterPYP</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterPYP&amp;diff=103737"/>
		<updated>2010-03-12T14:00:57Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* S1 Frequency calculation */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=3PHY=&lt;br /&gt;
&lt;br /&gt;
Back to Main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester&lt;br /&gt;
&lt;br /&gt;
==First thoughts==&lt;br /&gt;
&lt;br /&gt;
The 3PHY NMR structure was used as an initial structure and processed like 1W7S. NMR experiments like this lead to a huge amount of information about through-space distances between atoms&lt;br /&gt;
and also bonded distances, but the sheer size of the data set means it can be fit lots of ways (fitting is done iteratively starting from a guess stucture). The result is a set of possible structures, in this case there are 26 different possible stuctures - im not sure which one the wiki starts with!&lt;br /&gt;
&lt;br /&gt;
The file is slightly different to Mikes example... overall -4 charge due to different protonation state (pH7) but charges are the same except for around the chromophore. I included the chromophore + connecting residue when fitting charges since the chromophore is bound to a residue in the primary sequence via a sulphur in is sidechain - this will change the charge distribution of the residue so should be refitted with the chromophore attached. The file you provided uses standard amber charges for this residue and some estimated charges from ESP calculations (+ some fudging to get an integer charge!) for the chromophore.&lt;br /&gt;
&lt;br /&gt;
==The Plan==&lt;br /&gt;
&lt;br /&gt;
* Find and Characterise a Ground State minimum&lt;br /&gt;
* Vertical excitation to the S1 PES - frequency analysis of imaginary modes&lt;br /&gt;
* IRC along S1 PES to S1 minimum.&lt;br /&gt;
&lt;br /&gt;
==Ground State calculations==&lt;br /&gt;
&lt;br /&gt;
A ground state minimum was found using ONIOM B3LYP 6-31G* : Amber with Electronic Embedding and characterised with a frequency calculation.&lt;br /&gt;
&lt;br /&gt;
* Ground state ONIOM optimisation and frequency analysis INPUT - [[Media:3PHY_ONIOM_Optimisation_Frequency_EE.gjf]]&lt;br /&gt;
&lt;br /&gt;
==S1 Frequency and IRC calculations==&lt;br /&gt;
&lt;br /&gt;
The aim of this part of the project is to follow an excited state reaction path using an IRC having simulated a vertical excitation from S0 → S1 (geometry is the same). To do this we cannot use a ground state theory like DFT cf. the density is the ground state density. Methods that can do excited states are TD-DFT, CIS and CASSCF (there are many - we are interested in these). So we will be using the ONIOM method with an excited state-capable theory on the model high level region and looking at the effects of Electronic embedding on the IRC of the S1 surface.&lt;br /&gt;
&lt;br /&gt;
The IRC keyword requests that a reaction path be followed by integrating the intrinsic reaction coordinate, ONIOM calculations use the first-order Euler integration for the predictor step along with the HPC corrector step. It is a practical choice for calculations on large molecules. &lt;br /&gt;
&lt;br /&gt;
It is assumed that the IRC will be started from a transition state structure (1st order saddle point) and thus have only 1 imaginary frequency. The IRC can then follow this transition vector either in the forward or backward direction. Vertical excitation from a S0 minimum will be highly unlikely to lead to a 1st order saddle point on the S1 PES and will almost certainly result in being on the &amp;quot;side of a hill&amp;quot; on the S1 PES - this leads to complications with the following IRC calculation.&lt;br /&gt;
&lt;br /&gt;
There will be multiple imaginary frequencies at this non-stationary point on the S1 surface, the IRC will follow the first one it finds which is not necessarily the vector we want to follow! We need to explicitly define a specific vector for the IRC to follow which represents the reaction path we are investigating. The best way to do this is to run a frequency calculation at this point and visualize the imaginary frequencies to identify the imaginary frequency which represents the reaction, the frequency will be accompanied by a vector which can be read into the IRC calculation.&lt;br /&gt;
&lt;br /&gt;
To read the vector into the IRC calculation use &#039;&#039;IRC=ReadVector&#039;&#039; in the route section with the vector listed after the molecule specification in rows containing 8 values per row with 10 characters (inc spaces) making up each value.&lt;br /&gt;
&lt;br /&gt;
IRC calculations also require initial force constants to proceed. You must provide these to the calculation in some way, options defined in the route section:&lt;br /&gt;
&lt;br /&gt;
* &#039;&#039;IRC=RCFC&#039;&#039; reads the force constants from the checkpoint file from the preceding frequency calculation (used to identify the required frequency corresponding to the reaction coordinate)&lt;br /&gt;
* &#039;&#039;IRC=CalcFC&#039;&#039; recalculates the force constants at the beginning of the IRC calculation&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterPYP&amp;diff=103735</id>
		<title>AndyForesterPYP</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterPYP&amp;diff=103735"/>
		<updated>2010-03-12T13:55:58Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* S1 Frequency calculation */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=3PHY=&lt;br /&gt;
&lt;br /&gt;
Back to Main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester&lt;br /&gt;
&lt;br /&gt;
==First thoughts==&lt;br /&gt;
&lt;br /&gt;
The 3PHY NMR structure was used as an initial structure and processed like 1W7S. NMR experiments like this lead to a huge amount of information about through-space distances between atoms&lt;br /&gt;
and also bonded distances, but the sheer size of the data set means it can be fit lots of ways (fitting is done iteratively starting from a guess stucture). The result is a set of possible structures, in this case there are 26 different possible stuctures - im not sure which one the wiki starts with!&lt;br /&gt;
&lt;br /&gt;
The file is slightly different to Mikes example... overall -4 charge due to different protonation state (pH7) but charges are the same except for around the chromophore. I included the chromophore + connecting residue when fitting charges since the chromophore is bound to a residue in the primary sequence via a sulphur in is sidechain - this will change the charge distribution of the residue so should be refitted with the chromophore attached. The file you provided uses standard amber charges for this residue and some estimated charges from ESP calculations (+ some fudging to get an integer charge!) for the chromophore.&lt;br /&gt;
&lt;br /&gt;
==The Plan==&lt;br /&gt;
&lt;br /&gt;
* Find and Characterise a Ground State minimum&lt;br /&gt;
* Vertical excitation to the S1 PES - frequency analysis of imaginary modes&lt;br /&gt;
* IRC along S1 PES to S1 minimum.&lt;br /&gt;
&lt;br /&gt;
==Ground State calculations==&lt;br /&gt;
&lt;br /&gt;
A ground state minimum was found using ONIOM B3LYP 6-31G* : Amber with Electronic Embedding and characterised with a frequency calculation.&lt;br /&gt;
&lt;br /&gt;
* Ground state ONIOM optimisation and frequency analysis INPUT - [[Media:3PHY_ONIOM_Optimisation_Frequency_EE.gjf]]&lt;br /&gt;
&lt;br /&gt;
==S1 Frequency calculation==&lt;br /&gt;
&lt;br /&gt;
The aim of this part of the project is to follow an excited state reaction path using an IRC having simulated a vertical excitation from S0 → S1 (geometry is the same). To do this we cannot use a ground state theory like DFT cf. the density is the ground state density. Methods that can do excited states are TD-DFT, CIS and CASSCF (there are many - we are interested in these). So we will be using the ONIOM method with an excited state-capable theory on the model high level region and looking at the effects of Electronic embedding on the IRC of the S1 surface.&lt;br /&gt;
&lt;br /&gt;
The IRC keyword requests that a reaction path be followed by integrating the intrinsic reaction coordinate, ONIOM calculations use the first-order Euler integration for the predictor step along with the HPC corrector step. It is a practical choice for calculations on large molecules. &lt;br /&gt;
&lt;br /&gt;
It is assumed that the IRC will be started from a transition state structure (1st order saddle point) and thus have only 1 imaginary frequency. The IRC can then follow this transition vector either in the forward or backward direction. Vertical excitation from a S0 minimum will be highly unlikely to lead to a 1st order saddle point on the S1 PES and will almost certainly result in being on the &amp;quot;side of a hill&amp;quot; on the S1 PES - this leads to complications with the following IRC calculation.&lt;br /&gt;
&lt;br /&gt;
There will be multiple imaginary frequencies at this non-stationary point on the S1 surface, the IRC will follow the first one it finds which is not necessarily the vector we want to follow! We need to explicitly define a specific vector for the IRC to follow which represents the reaction path we are investigating. The best way to do this is to visualize the imaginary frequencies and visually identify the frequency which represents the reaction, the frequency will be accompanied by a vector which can be read into the IRC calculation.&lt;br /&gt;
&lt;br /&gt;
To read the vector into the IRC calculation use &#039;&#039;IRC=ReadVector&#039;&#039; in the route section with the vector listed after the molecule specification in rows containing 8 values per row with 10 characters (inc spaces) making up each value.&lt;br /&gt;
&lt;br /&gt;
IRC calculations also require initial force constants to proceed. You must provide these to the calculation in some way, options defined in the route section:&lt;br /&gt;
&lt;br /&gt;
* &#039;&#039;IRC=RCFC&#039;&#039; reads the force constants from the checkpoint file from the preceding frequency calculation (used to identify the required frequency corresponding to the reaction coordinate)&lt;br /&gt;
* &#039;&#039;IRC=CalcFC&#039;&#039; recalculates the force constants at the beginning of the IRC calculation&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterPYP&amp;diff=103734</id>
		<title>AndyForesterPYP</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterPYP&amp;diff=103734"/>
		<updated>2010-03-12T13:55:48Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* S1 Frequency calculation */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=3PHY=&lt;br /&gt;
&lt;br /&gt;
Back to Main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester&lt;br /&gt;
&lt;br /&gt;
==First thoughts==&lt;br /&gt;
&lt;br /&gt;
The 3PHY NMR structure was used as an initial structure and processed like 1W7S. NMR experiments like this lead to a huge amount of information about through-space distances between atoms&lt;br /&gt;
and also bonded distances, but the sheer size of the data set means it can be fit lots of ways (fitting is done iteratively starting from a guess stucture). The result is a set of possible structures, in this case there are 26 different possible stuctures - im not sure which one the wiki starts with!&lt;br /&gt;
&lt;br /&gt;
The file is slightly different to Mikes example... overall -4 charge due to different protonation state (pH7) but charges are the same except for around the chromophore. I included the chromophore + connecting residue when fitting charges since the chromophore is bound to a residue in the primary sequence via a sulphur in is sidechain - this will change the charge distribution of the residue so should be refitted with the chromophore attached. The file you provided uses standard amber charges for this residue and some estimated charges from ESP calculations (+ some fudging to get an integer charge!) for the chromophore.&lt;br /&gt;
&lt;br /&gt;
==The Plan==&lt;br /&gt;
&lt;br /&gt;
* Find and Characterise a Ground State minimum&lt;br /&gt;
* Vertical excitation to the S1 PES - frequency analysis of imaginary modes&lt;br /&gt;
* IRC along S1 PES to S1 minimum.&lt;br /&gt;
&lt;br /&gt;
==Ground State calculations==&lt;br /&gt;
&lt;br /&gt;
A ground state minimum was found using ONIOM B3LYP 6-31G* : Amber with Electronic Embedding and characterised with a frequency calculation.&lt;br /&gt;
&lt;br /&gt;
* Ground state ONIOM optimisation and frequency analysis INPUT - [[Media:3PHY_ONIOM_Optimisation_Frequency_EE.gjf]]&lt;br /&gt;
&lt;br /&gt;
==S1 Frequency calculation==&lt;br /&gt;
&lt;br /&gt;
The aim of this part of the project is to follow an excited state reaction path using an IRC having simulated a vertical excitation from S0 → S1 (geometry is the same). To do this we cannot use a ground state theory like DFT cf. the density is the ground state density. Methods that can do excited states are TD-DFT, CIS and CASSCF (there are many - we are interested in these). So we will be using the ONIOM method with an excited state-capable theory on the model high level region and looking at the effects of Electronic embedding on the IRC of the S1 surface.&lt;br /&gt;
&lt;br /&gt;
The IRC keyword requests that a reaction path be followed by integrating the intrinsic reaction coordinate, ONIOM calculations use the first-order Euler integration for the predictor step along with the HPC corrector step. It is a practical choice for calculations on large molecules. &lt;br /&gt;
&lt;br /&gt;
It is assumed that the IRC will be started from a transition state structure (1st order saddle point) and thus have only 1 imaginary frequency. The IRC can then follow this transition vector either in the forward or backward direction. Vertical excitation from a S0 minimum will be highly unlikely to lead to a 1st order saddle point on the S1 PES and will almost certainly result in being on the &amp;quot;side of a hill&amp;quot; on the S1 PES - this leads to complications with the following IRC calculation.&lt;br /&gt;
&lt;br /&gt;
There will be multiple imaginary frequencies at this non-stationary point on the S1 surface, the IRC will follow the first one it finds which is not necessarily the vector we want to follow! We need to explicitly define a specific vector for the IRC to follow which represents the reaction path we are investigating. The best way to do this is to visualize the imaginary frequencies and visually identify the frequency which represents the reaction, the frequency will be accompanied by a vector which can be read into the IRC calculation.&lt;br /&gt;
&lt;br /&gt;
To read the vector into the IRC calculation use &#039;&#039;IRC=ReadVector&#039;&#039; in the route section with the vector listed after the molecule specification in rows containing 8 values per row with 10 characters (inc spaces) making up each value.&lt;br /&gt;
&lt;br /&gt;
IRC calculations also require initial force constants to proceed. You must provide these to the calculation in some way, options defined in the route section:&lt;br /&gt;
&lt;br /&gt;
* &#039;&#039;IRC=RCFC&#039;&#039; reads the force constants from the checkpoint file from the preceding frequency calculation (used to identify the required frequency corresponding to the reaction coordinate)&lt;br /&gt;
* &#039;&#039;&#039;IRC=CalcFC&#039;&#039; recalculates the force constants at the beginning of the IRC calculation&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterPYP&amp;diff=103733</id>
		<title>AndyForesterPYP</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterPYP&amp;diff=103733"/>
		<updated>2010-03-12T13:55:38Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* S1 Frequency calculation */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=3PHY=&lt;br /&gt;
&lt;br /&gt;
Back to Main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester&lt;br /&gt;
&lt;br /&gt;
==First thoughts==&lt;br /&gt;
&lt;br /&gt;
The 3PHY NMR structure was used as an initial structure and processed like 1W7S. NMR experiments like this lead to a huge amount of information about through-space distances between atoms&lt;br /&gt;
and also bonded distances, but the sheer size of the data set means it can be fit lots of ways (fitting is done iteratively starting from a guess stucture). The result is a set of possible structures, in this case there are 26 different possible stuctures - im not sure which one the wiki starts with!&lt;br /&gt;
&lt;br /&gt;
The file is slightly different to Mikes example... overall -4 charge due to different protonation state (pH7) but charges are the same except for around the chromophore. I included the chromophore + connecting residue when fitting charges since the chromophore is bound to a residue in the primary sequence via a sulphur in is sidechain - this will change the charge distribution of the residue so should be refitted with the chromophore attached. The file you provided uses standard amber charges for this residue and some estimated charges from ESP calculations (+ some fudging to get an integer charge!) for the chromophore.&lt;br /&gt;
&lt;br /&gt;
==The Plan==&lt;br /&gt;
&lt;br /&gt;
* Find and Characterise a Ground State minimum&lt;br /&gt;
* Vertical excitation to the S1 PES - frequency analysis of imaginary modes&lt;br /&gt;
* IRC along S1 PES to S1 minimum.&lt;br /&gt;
&lt;br /&gt;
==Ground State calculations==&lt;br /&gt;
&lt;br /&gt;
A ground state minimum was found using ONIOM B3LYP 6-31G* : Amber with Electronic Embedding and characterised with a frequency calculation.&lt;br /&gt;
&lt;br /&gt;
* Ground state ONIOM optimisation and frequency analysis INPUT - [[Media:3PHY_ONIOM_Optimisation_Frequency_EE.gjf]]&lt;br /&gt;
&lt;br /&gt;
==S1 Frequency calculation==&lt;br /&gt;
&lt;br /&gt;
The aim of this part of the project is to follow an excited state reaction path using an IRC having simulated a vertical excitation from S0 → S1 (geometry is the same). To do this we cannot use a ground state theory like DFT cf. the density is the ground state density. Methods that can do excited states are TD-DFT, CIS and CASSCF (there are many - we are interested in these). So we will be using the ONIOM method with an excited state-capable theory on the model high level region and looking at the effects of Electronic embedding on the IRC of the S1 surface.&lt;br /&gt;
&lt;br /&gt;
The IRC keyword requests that a reaction path be followed by integrating the intrinsic reaction coordinate, ONIOM calculations use the first-order Euler integration for the predictor step along with the HPC corrector step. It is a practical choice for calculations on large molecules. &lt;br /&gt;
&lt;br /&gt;
It is assumed that the IRC will be started from a transition state structure (1st order saddle point) and thus have only 1 imaginary frequency. The IRC can then follow this transition vector either in the forward or backward direction. Vertical excitation from a S0 minimum will be highly unlikely to lead to a 1st order saddle point on the S1 PES and will almost certainly result in being on the &amp;quot;side of a hill&amp;quot; on the S1 PES - this leads to complications with the following IRC calculation.&lt;br /&gt;
&lt;br /&gt;
There will be multiple imaginary frequencies at this non-stationary point on the S1 surface, the IRC will follow the first one it finds which is not necessarily the vector we want to follow! We need to explicitly define a specific vector for the IRC to follow which represents the reaction path we are investigating. The best way to do this is to visualize the imaginary frequencies and visually identify the frequency which represents the reaction, the frequency will be accompanied by a vector which can be read into the IRC calculation.&lt;br /&gt;
&lt;br /&gt;
To read the vector into the IRC calculation use &#039;&#039;IRC=ReadVector&#039;&#039; in the route section with the vector listed after the molecule specification in rows containing 8 values per row with 10 characters (inc spaces) making up each value.&lt;br /&gt;
&lt;br /&gt;
IRC calculations also require initial force constants to proceed. You must provide these to the calculation in some way, options defined in the route section:&lt;br /&gt;
&lt;br /&gt;
* &#039;&#039;IRC=RCFC&#039;&#039; reads the force constants from the checkpoint file from the preceding frequency calculation (used to identify the required frequency corresponding to the reaction coordinate)&lt;br /&gt;
* &#039;&#039;&#039;IRC=CalcFC&#039;&#039;&#039; recalculates the force constants at the beginning of the IRC calculation&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterPYP&amp;diff=103732</id>
		<title>AndyForesterPYP</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterPYP&amp;diff=103732"/>
		<updated>2010-03-12T13:55:06Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* S1 Frequency calculation */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=3PHY=&lt;br /&gt;
&lt;br /&gt;
Back to Main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester&lt;br /&gt;
&lt;br /&gt;
==First thoughts==&lt;br /&gt;
&lt;br /&gt;
The 3PHY NMR structure was used as an initial structure and processed like 1W7S. NMR experiments like this lead to a huge amount of information about through-space distances between atoms&lt;br /&gt;
and also bonded distances, but the sheer size of the data set means it can be fit lots of ways (fitting is done iteratively starting from a guess stucture). The result is a set of possible structures, in this case there are 26 different possible stuctures - im not sure which one the wiki starts with!&lt;br /&gt;
&lt;br /&gt;
The file is slightly different to Mikes example... overall -4 charge due to different protonation state (pH7) but charges are the same except for around the chromophore. I included the chromophore + connecting residue when fitting charges since the chromophore is bound to a residue in the primary sequence via a sulphur in is sidechain - this will change the charge distribution of the residue so should be refitted with the chromophore attached. The file you provided uses standard amber charges for this residue and some estimated charges from ESP calculations (+ some fudging to get an integer charge!) for the chromophore.&lt;br /&gt;
&lt;br /&gt;
==The Plan==&lt;br /&gt;
&lt;br /&gt;
* Find and Characterise a Ground State minimum&lt;br /&gt;
* Vertical excitation to the S1 PES - frequency analysis of imaginary modes&lt;br /&gt;
* IRC along S1 PES to S1 minimum.&lt;br /&gt;
&lt;br /&gt;
==Ground State calculations==&lt;br /&gt;
&lt;br /&gt;
A ground state minimum was found using ONIOM B3LYP 6-31G* : Amber with Electronic Embedding and characterised with a frequency calculation.&lt;br /&gt;
&lt;br /&gt;
* Ground state ONIOM optimisation and frequency analysis INPUT - [[Media:3PHY_ONIOM_Optimisation_Frequency_EE.gjf]]&lt;br /&gt;
&lt;br /&gt;
==S1 Frequency calculation==&lt;br /&gt;
&lt;br /&gt;
The aim of this part of the project is to follow an excited state reaction path using an IRC having simulated a vertical excitation from S0 → S1 (geometry is the same). To do this we cannot use a ground state theory like DFT cf. the density is the ground state density. Methods that can do excited states are TD-DFT, CIS and CASSCF (there are many - we are interested in these). So we will be using the ONIOM method with an excited state-capable theory on the model high level region and looking at the effects of Electronic embedding on the IRC of the S1 surface.&lt;br /&gt;
&lt;br /&gt;
The IRC keyword requests that a reaction path be followed by integrating the intrinsic reaction coordinate, ONIOM calculations use the first-order Euler integration for the predictor step along with the HPC corrector step. It is a practical choice for calculations on large molecules. &lt;br /&gt;
&lt;br /&gt;
It is assumed that the IRC will be started from a transition state structure (1st order saddle point) and thus have only 1 imaginary frequency. The IRC can then follow this transition vector either in the forward or backward direction. Vertical excitation from a S0 minimum will be highly unlikely to lead to a 1st order saddle point on the S1 PES and will almost certainly result in being on the &amp;quot;side of a hill&amp;quot; on the S1 PES - this leads to complications with the following IRC calculation.&lt;br /&gt;
&lt;br /&gt;
There will be multiple imaginary frequencies at this non-stationary point on the S1 surface, the IRC will follow the first one it finds which is not necessarily the vector we want to follow! We need to explicitly define a specific vector for the IRC to follow which represents the reaction path we are investigating. The best way to do this is to visualize the imaginary frequencies and visually identify the frequency which represents the reaction, the frequency will be accompanied by a vector which can be read into the IRC calculation.&lt;br /&gt;
&lt;br /&gt;
To read the vector into the IRC calculation use &#039;&#039;IRC=ReadVector&#039;&#039; in the route section with the vector listed after the molecule specification in rows containing 8 values per row with 10 characters (inc spaces) making up each value.&lt;br /&gt;
&lt;br /&gt;
IRC calculations require initial force constants to proceed. You must provide these to the calculation in some way, options defined in the route section:&lt;br /&gt;
&lt;br /&gt;
* &#039;&#039;IRC=RCFC&#039;&#039; reads the force constants from the checkpoint file from the preceding frequency calculation (used to identify the required frequency corresponding to the reaction coordinate)&lt;br /&gt;
* &#039;&#039;&#039;IRC=CalcFC&#039;&#039;&#039; recalculates the force constants at the beginning of the IRC calculation&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterPYP&amp;diff=103730</id>
		<title>AndyForesterPYP</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterPYP&amp;diff=103730"/>
		<updated>2010-03-12T13:54:24Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* S1 Frequency calculation */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=3PHY=&lt;br /&gt;
&lt;br /&gt;
Back to Main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester&lt;br /&gt;
&lt;br /&gt;
==First thoughts==&lt;br /&gt;
&lt;br /&gt;
The 3PHY NMR structure was used as an initial structure and processed like 1W7S. NMR experiments like this lead to a huge amount of information about through-space distances between atoms&lt;br /&gt;
and also bonded distances, but the sheer size of the data set means it can be fit lots of ways (fitting is done iteratively starting from a guess stucture). The result is a set of possible structures, in this case there are 26 different possible stuctures - im not sure which one the wiki starts with!&lt;br /&gt;
&lt;br /&gt;
The file is slightly different to Mikes example... overall -4 charge due to different protonation state (pH7) but charges are the same except for around the chromophore. I included the chromophore + connecting residue when fitting charges since the chromophore is bound to a residue in the primary sequence via a sulphur in is sidechain - this will change the charge distribution of the residue so should be refitted with the chromophore attached. The file you provided uses standard amber charges for this residue and some estimated charges from ESP calculations (+ some fudging to get an integer charge!) for the chromophore.&lt;br /&gt;
&lt;br /&gt;
==The Plan==&lt;br /&gt;
&lt;br /&gt;
* Find and Characterise a Ground State minimum&lt;br /&gt;
* Vertical excitation to the S1 PES - frequency analysis of imaginary modes&lt;br /&gt;
* IRC along S1 PES to S1 minimum.&lt;br /&gt;
&lt;br /&gt;
==Ground State calculations==&lt;br /&gt;
&lt;br /&gt;
A ground state minimum was found using ONIOM B3LYP 6-31G* : Amber with Electronic Embedding and characterised with a frequency calculation.&lt;br /&gt;
&lt;br /&gt;
* Ground state ONIOM optimisation and frequency analysis INPUT - [[Media:3PHY_ONIOM_Optimisation_Frequency_EE.gjf]]&lt;br /&gt;
&lt;br /&gt;
==S1 Frequency calculation==&lt;br /&gt;
&lt;br /&gt;
The aim of this part of the project is to follow an excited state reaction path using an IRC having simulated a vertical excitation from S0 → S1 (geometry is the same). To do this we cannot use a ground state theory like DFT cf. the density is the ground state density. Methods that can do excited states are TD-DFT, CIS and CASSCF (there are many - we are interested in these). So we will be using the ONIOM method with an excited state-capable theory on the model high level region and looking at the effects of Electronic embedding on the IRC of the S1 surface.&lt;br /&gt;
&lt;br /&gt;
The IRC keyword requests that a reaction path be followed by integrating the intrinsic reaction coordinate, ONIOM calculations use the first-order Euler integration for the predictor step along with the HPC corrector step. It is a practical choice for calculations on large molecules. &lt;br /&gt;
&lt;br /&gt;
It is assumed that the IRC will be started from a transition state structure (1st order saddle point) and thus have only 1 imaginary frequency. The IRC can then follow this transition vector either in the forward or backward direction. Vertical excitation from a S0 minimum will be highly unlikely to lead to a 1st order saddle point on the S1 PES and will almost certainly result in being on the &amp;quot;side of a hill&amp;quot; on the S1 PES - this leads to complications with the following IRC calculation.&lt;br /&gt;
&lt;br /&gt;
There will be multiple imaginary frequencies at this non-stationary point on the S1 surface, the IRC will follow the first one it finds which is not necessarily the vector we want to follow! We need to define a specific vector for the IRC to follow which represents the reaction path we are investigating. The best way to do this is to visualize the imaginary frequencies and visually identify the frequency which represents the reaction, the frequency will be accompanied by a vector which can be read into the IRC calculation.&lt;br /&gt;
&lt;br /&gt;
To read the vector into the IRC calculation use &#039;&#039;IRC=ReadVector&#039;&#039; in the route section with the vector listed after the molecule specification in rows containing 8 values per row with 10 characters (inc spaces) making up each value.&lt;br /&gt;
&lt;br /&gt;
IRC calculations require initial force constants to proceed. You must provide these to the calculation in some way, options defined in the route section:&lt;br /&gt;
&lt;br /&gt;
* &#039;&#039;IRC=RCFC&#039;&#039; reads the force constants from the checkpoint file from the preceding frequency calculation (used to identify the required frequency corresponding to the reaction coordinate)&lt;br /&gt;
* &#039;&#039;&#039;IRC=CalcFC&#039;&#039;&#039; recalculates the force constants at the beginning of the IRC calculation&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterPYP&amp;diff=103728</id>
		<title>AndyForesterPYP</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterPYP&amp;diff=103728"/>
		<updated>2010-03-12T13:54:03Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* S1 Frequency calculation */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=3PHY=&lt;br /&gt;
&lt;br /&gt;
Back to Main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester&lt;br /&gt;
&lt;br /&gt;
==First thoughts==&lt;br /&gt;
&lt;br /&gt;
The 3PHY NMR structure was used as an initial structure and processed like 1W7S. NMR experiments like this lead to a huge amount of information about through-space distances between atoms&lt;br /&gt;
and also bonded distances, but the sheer size of the data set means it can be fit lots of ways (fitting is done iteratively starting from a guess stucture). The result is a set of possible structures, in this case there are 26 different possible stuctures - im not sure which one the wiki starts with!&lt;br /&gt;
&lt;br /&gt;
The file is slightly different to Mikes example... overall -4 charge due to different protonation state (pH7) but charges are the same except for around the chromophore. I included the chromophore + connecting residue when fitting charges since the chromophore is bound to a residue in the primary sequence via a sulphur in is sidechain - this will change the charge distribution of the residue so should be refitted with the chromophore attached. The file you provided uses standard amber charges for this residue and some estimated charges from ESP calculations (+ some fudging to get an integer charge!) for the chromophore.&lt;br /&gt;
&lt;br /&gt;
==The Plan==&lt;br /&gt;
&lt;br /&gt;
* Find and Characterise a Ground State minimum&lt;br /&gt;
* Vertical excitation to the S1 PES - frequency analysis of imaginary modes&lt;br /&gt;
* IRC along S1 PES to S1 minimum.&lt;br /&gt;
&lt;br /&gt;
==Ground State calculations==&lt;br /&gt;
&lt;br /&gt;
A ground state minimum was found using ONIOM B3LYP 6-31G* : Amber with Electronic Embedding and characterised with a frequency calculation.&lt;br /&gt;
&lt;br /&gt;
* Ground state ONIOM optimisation and frequency analysis INPUT - [[Media:3PHY_ONIOM_Optimisation_Frequency_EE.gjf]]&lt;br /&gt;
&lt;br /&gt;
==S1 Frequency calculation==&lt;br /&gt;
&lt;br /&gt;
The aim of this part of the project is to follow an excited state reaction path using an IRC having simulated a vertical excitation from S0 → S1 (geometry is the same). To do this we cannot use a ground state theory like DFT cf. the density is the ground state density. Methods that can do excited states are TD-DFT, CIS and CASSCF (there are many - we are interested in these). So we will be using the ONIOM method with an excited state-capable theory on the model high level region and looking at the effects of Electronic embedding on the IRC of the S1 surface.&lt;br /&gt;
&lt;br /&gt;
The IRC keyword requests that a reaction path be followed by integrating the intrinsic reaction coordinate, ONIOM calculations the first-order Euler integration for the predictor step along with the HPC corrector step. It is a practical choice for calculations on large molecules. &lt;br /&gt;
&lt;br /&gt;
It is assumed that the IRC will be started from a transition state structure (1st order saddle point) and thus have only 1 imaginary frequency. The IRC can then follow this transition vector either in the forward or backward direction. Vertical excitation from a S0 minimum will be highly unlikely to lead to a 1st order saddle point on the S1 PES and will almost certainly result in being on the &amp;quot;side of a hill&amp;quot; on the S1 PES - this leads to complications with the following IRC calculation.&lt;br /&gt;
&lt;br /&gt;
There will be multiple imaginary frequencies at this non-stationary point on the S1 surface, the IRC will follow the first one it finds which is not necessarily the vector we want to follow! We need to define a specific vector for the IRC to follow which represents the reaction path we are investigating. The best way to do this is to visualize the imaginary frequencies and visually identify the frequency which represents the reaction, the frequency will be accompanied by a vector which can be read into the IRC calculation.&lt;br /&gt;
&lt;br /&gt;
To read the vector into the IRC calculation use &#039;&#039;IRC=ReadVector&#039;&#039; in the route section with the vector listed after the molecule specification in rows containing 8 values per row with 10 characters (inc spaces) making up each value.&lt;br /&gt;
&lt;br /&gt;
IRC calculations require initial force constants to proceed. You must provide these to the calculation in some way, options defined in the route section:&lt;br /&gt;
&lt;br /&gt;
* &#039;&#039;IRC=RCFC&#039;&#039; reads the force constants from the checkpoint file from the preceding frequency calculation (used to identify the required frequency corresponding to the reaction coordinate)&lt;br /&gt;
* &#039;&#039;&#039;IRC=CalcFC&#039;&#039;&#039; recalculates the force constants at the beginning of the IRC calculation&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterPYP&amp;diff=103724</id>
		<title>AndyForesterPYP</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterPYP&amp;diff=103724"/>
		<updated>2010-03-12T13:48:52Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* Ground State calculations */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=3PHY=&lt;br /&gt;
&lt;br /&gt;
Back to Main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester&lt;br /&gt;
&lt;br /&gt;
==First thoughts==&lt;br /&gt;
&lt;br /&gt;
The 3PHY NMR structure was used as an initial structure and processed like 1W7S. NMR experiments like this lead to a huge amount of information about through-space distances between atoms&lt;br /&gt;
and also bonded distances, but the sheer size of the data set means it can be fit lots of ways (fitting is done iteratively starting from a guess stucture). The result is a set of possible structures, in this case there are 26 different possible stuctures - im not sure which one the wiki starts with!&lt;br /&gt;
&lt;br /&gt;
The file is slightly different to Mikes example... overall -4 charge due to different protonation state (pH7) but charges are the same except for around the chromophore. I included the chromophore + connecting residue when fitting charges since the chromophore is bound to a residue in the primary sequence via a sulphur in is sidechain - this will change the charge distribution of the residue so should be refitted with the chromophore attached. The file you provided uses standard amber charges for this residue and some estimated charges from ESP calculations (+ some fudging to get an integer charge!) for the chromophore.&lt;br /&gt;
&lt;br /&gt;
==The Plan==&lt;br /&gt;
&lt;br /&gt;
* Find and Characterise a Ground State minimum&lt;br /&gt;
* Vertical excitation to the S1 PES - frequency analysis of imaginary modes&lt;br /&gt;
* IRC along S1 PES to S1 minimum.&lt;br /&gt;
&lt;br /&gt;
==Ground State calculations==&lt;br /&gt;
&lt;br /&gt;
A ground state minimum was found using ONIOM B3LYP 6-31G* : Amber with Electronic Embedding and characterised with a frequency calculation.&lt;br /&gt;
&lt;br /&gt;
* Ground state ONIOM optimisation and frequency analysis INPUT - [[Media:3PHY_ONIOM_Optimisation_Frequency_EE.gjf]]&lt;br /&gt;
&lt;br /&gt;
==S1 Frequency calculation==&lt;br /&gt;
&lt;br /&gt;
The aim of this part of the project is to follow an excited state reaction path using an IRC having simulated a vertical excitation from S0 → S1 (geometry is the same). To do this we cannot use a ground state theory like DFT cf. the density is the ground state density. Methods that can do excited states are TD-DFT, CIS and CASSCF (there are many - we are interested in these). So we will be using the ONIOM method with an excited state-capable theory on the model high level region and looking at the effects of Electronic embedding on the IRC of the S1 surface.&lt;br /&gt;
&lt;br /&gt;
The IRC keyword requests that a reaction path be followed by integrating the intrinsic reaction coordinate, ONIOM calculations the first-order Euler integration for the predictor step along with the HPC corrector step. It is a practical choice for calculations on large molecules. &lt;br /&gt;
&lt;br /&gt;
It is assumed that the IRC will be started from a transition state structure (1st order saddle point) and thus have only 1 imaginary frequency. The IRC can then follow this transition vector either in the forward or backward direction. Vertical excitation from a S0 minimum will be highly unlikely to lead to a 1st order saddle point on the S1 PES and will almost certainly result in being on the &amp;quot;side of a hill&amp;quot; on the S1 PES - this leads to complications with the following IRC calculation.&lt;br /&gt;
&lt;br /&gt;
There will be multiple imaginary frequencies at this non-stationary point on the S1 surface, the IRC will follow the first one it finds which is not necessarily the vector we want to follow! We need to define a specific vector for the IRC to follow which represents the reaction path we are investigating. The best way to do this is to visualize the imaginary frequencies and visually identify the frequency which represents the reaction, the frequency will be accompanied by a vector which can be read into the IRC calculation.&lt;br /&gt;
&lt;br /&gt;
IRC calculations require initial force constants to proceed. You must provide these to the calculation in some way:&lt;br /&gt;
&lt;br /&gt;
* IRC=RCFC reads the force constants from the checkpoint file from the preceding frequency calculation (used to identify the required frequency corresponding to the reaction coordinate)&lt;br /&gt;
* IRC=CalcFC recalculates the force constants at the beginning of the IRC calculation&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterPYP&amp;diff=103713</id>
		<title>AndyForesterPYP</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterPYP&amp;diff=103713"/>
		<updated>2010-03-12T13:18:35Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* Ground State calculations */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=3PHY=&lt;br /&gt;
&lt;br /&gt;
Back to Main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester&lt;br /&gt;
&lt;br /&gt;
==First thoughts==&lt;br /&gt;
&lt;br /&gt;
The 3PHY NMR structure was used as an initial structure and processed like 1W7S. NMR experiments like this lead to a huge amount of information about through-space distances between atoms&lt;br /&gt;
and also bonded distances, but the sheer size of the data set means it can be fit lots of ways (fitting is done iteratively starting from a guess stucture). The result is a set of possible structures, in this case there are 26 different possible stuctures - im not sure which one the wiki starts with!&lt;br /&gt;
&lt;br /&gt;
The file is slightly different to Mikes example... overall -4 charge due to different protonation state (pH7) but charges are the same except for around the chromophore. I included the chromophore + connecting residue when fitting charges since the chromophore is bound to a residue in the primary sequence via a sulphur in is sidechain - this will change the charge distribution of the residue so should be refitted with the chromophore attached. The file you provided uses standard amber charges for this residue and some estimated charges from ESP calculations (+ some fudging to get an integer charge!) for the chromophore.&lt;br /&gt;
&lt;br /&gt;
==The Plan==&lt;br /&gt;
&lt;br /&gt;
* Find and Characterise a Ground State minimum&lt;br /&gt;
* Vertical excitation to the S1 PES - frequency analysis of imaginary modes&lt;br /&gt;
* IRC along S1 PES to S1 minimum.&lt;br /&gt;
&lt;br /&gt;
==Ground State calculations==&lt;br /&gt;
&lt;br /&gt;
A ground state minimum was found using ONIOM B3LYP 6-31G* : Amber with Mechanical and Electronic Embedding (Separately) and characterised with a frequency calculation.&lt;br /&gt;
&lt;br /&gt;
* Ground state ONIOM optimisation and frequency analysis INPUT - [[Media:3PHY_ONIOM_Optimisation_Frequency_EE.gjf]]&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterPYP&amp;diff=103711</id>
		<title>AndyForesterPYP</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterPYP&amp;diff=103711"/>
		<updated>2010-03-12T13:18:23Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* Ground State calculations */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=3PHY=&lt;br /&gt;
&lt;br /&gt;
Back to Main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester&lt;br /&gt;
&lt;br /&gt;
==First thoughts==&lt;br /&gt;
&lt;br /&gt;
The 3PHY NMR structure was used as an initial structure and processed like 1W7S. NMR experiments like this lead to a huge amount of information about through-space distances between atoms&lt;br /&gt;
and also bonded distances, but the sheer size of the data set means it can be fit lots of ways (fitting is done iteratively starting from a guess stucture). The result is a set of possible structures, in this case there are 26 different possible stuctures - im not sure which one the wiki starts with!&lt;br /&gt;
&lt;br /&gt;
The file is slightly different to Mikes example... overall -4 charge due to different protonation state (pH7) but charges are the same except for around the chromophore. I included the chromophore + connecting residue when fitting charges since the chromophore is bound to a residue in the primary sequence via a sulphur in is sidechain - this will change the charge distribution of the residue so should be refitted with the chromophore attached. The file you provided uses standard amber charges for this residue and some estimated charges from ESP calculations (+ some fudging to get an integer charge!) for the chromophore.&lt;br /&gt;
&lt;br /&gt;
==The Plan==&lt;br /&gt;
&lt;br /&gt;
* Find and Characterise a Ground State minimum&lt;br /&gt;
* Vertical excitation to the S1 PES - frequency analysis of imaginary modes&lt;br /&gt;
* IRC along S1 PES to S1 minimum.&lt;br /&gt;
&lt;br /&gt;
==Ground State calculations==&lt;br /&gt;
&lt;br /&gt;
A ground state minimum was found using ONIOM B3LYP 6-31G* : Amber with Mechanical and Electronic Embedding (Separately) and characterised with a frequency calculation.&lt;br /&gt;
&lt;br /&gt;
Ground state ONIOM optimisation and frequency analysis INPUT - [[Media:3PHY_ONIOM_Optimisation_Frequency_EE.gjf]]&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=File:3PHY_ONIOM_Optimisation_Frequency_EE.gjf&amp;diff=103708</id>
		<title>File:3PHY ONIOM Optimisation Frequency EE.gjf</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=File:3PHY_ONIOM_Optimisation_Frequency_EE.gjf&amp;diff=103708"/>
		<updated>2010-03-12T13:17:06Z</updated>

		<summary type="html">&lt;p&gt;Af706: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterPYP&amp;diff=103707</id>
		<title>AndyForesterPYP</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterPYP&amp;diff=103707"/>
		<updated>2010-03-12T13:16:09Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* First thoughts */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=3PHY=&lt;br /&gt;
&lt;br /&gt;
Back to Main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester&lt;br /&gt;
&lt;br /&gt;
==First thoughts==&lt;br /&gt;
&lt;br /&gt;
The 3PHY NMR structure was used as an initial structure and processed like 1W7S. NMR experiments like this lead to a huge amount of information about through-space distances between atoms&lt;br /&gt;
and also bonded distances, but the sheer size of the data set means it can be fit lots of ways (fitting is done iteratively starting from a guess stucture). The result is a set of possible structures, in this case there are 26 different possible stuctures - im not sure which one the wiki starts with!&lt;br /&gt;
&lt;br /&gt;
The file is slightly different to Mikes example... overall -4 charge due to different protonation state (pH7) but charges are the same except for around the chromophore. I included the chromophore + connecting residue when fitting charges since the chromophore is bound to a residue in the primary sequence via a sulphur in is sidechain - this will change the charge distribution of the residue so should be refitted with the chromophore attached. The file you provided uses standard amber charges for this residue and some estimated charges from ESP calculations (+ some fudging to get an integer charge!) for the chromophore.&lt;br /&gt;
&lt;br /&gt;
==The Plan==&lt;br /&gt;
&lt;br /&gt;
* Find and Characterise a Ground State minimum&lt;br /&gt;
* Vertical excitation to the S1 PES - frequency analysis of imaginary modes&lt;br /&gt;
* IRC along S1 PES to S1 minimum.&lt;br /&gt;
&lt;br /&gt;
==Ground State calculations==&lt;br /&gt;
&lt;br /&gt;
A ground state minimum was found using ONIOM B3LYP 6-31G* : Amber with Mechanical and Electronic Embedding (Separately) and characterised with a frequency calculation.&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForester&amp;diff=103703</id>
		<title>AndyForester</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForester&amp;diff=103703"/>
		<updated>2010-03-12T13:11:53Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* TAO tools */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=HBDI Calculations=&lt;br /&gt;
==Wed 21st Oct 2009==&lt;br /&gt;
&lt;br /&gt;
- Started project work&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
- Gaussview and Gaussian working&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
- Working out how to use the command inputs&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
- Trying to sort VPN out&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
- Working to get the project proposal written&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Tue 27th Oct 2009==&lt;br /&gt;
&lt;br /&gt;
Drew up the HBDI molecule in Gaussview, the molecule will be optimised at the HF 3-21G level on the mac (locally). &amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
DFT B3LYP 6-31G (Becke threeparameter hybrid exchange functional5 and the Lee-Yang-Parr correlation functional) method optimisation was run on the resulting file, the MP2 6-31G method optimisation. Each of these had a corresponding freq calculation to check that minima were reached. All calculations were successful, with no negative vibrations.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolAppletButton&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;HBDI_MP2_6-31G_FREQ.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;HBDI optimised at the MP2 6-31G level&amp;lt;/text&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;600&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;/jmolAppletButton&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Wed 28th Oct 2009==&lt;br /&gt;
&lt;br /&gt;
Ran DFT B3YLP 6-311G calculations on HBDI (Opt and Freq) since these were the calculations used in a published study of HBDI.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
Optimisation job finshed successfully, the frequency calculation did not finish succesfully. Error was:&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
 ****Warning!!: The largest alpha MO coefficient is  0.17588883D+02&lt;br /&gt;
 Symmetrizing basis deriv contribution to polar:&lt;br /&gt;
 IMax=3 JMax=2 DiffMx= 0.00D+00&lt;br /&gt;
 G2DrvN: will do    29 centers at a time, making    1 passes doing MaxLOS=1.&lt;br /&gt;
 Calling FoFCou, ICntrl=  3107 FMM=F I1Cent=   0 AccDes= 0.00D+00.&lt;br /&gt;
 FoFDir/FoFCou used for L=0 through L=1.&lt;br /&gt;
 End of G2Drv Frequency-dependent properties file   721 does not exist.&lt;br /&gt;
 End of G2Drv Frequency-dependent properties file   722 does not exist.&lt;br /&gt;
          IDoAtm=1111111111111111111111111111&lt;br /&gt;
          Differentiating once with respect to electric field.&lt;br /&gt;
                with respect to dipole field.&lt;br /&gt;
          Differentiating once with respect to nuclear coordinates.&lt;br /&gt;
          Keep R1 ints in memory in canonical form, NReq=447529180.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Thurs 29th Oct 2009==&lt;br /&gt;
&lt;br /&gt;
===Learning to use ONIOM===&lt;br /&gt;
&lt;br /&gt;
The example on chem wiki was attempted. Drawn up in Gaussview and optimised initially using molecular mechanics AMBER force field.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
The ONIOM calculation was set up in Gaussview using DFT B3LYP 6-311G(d):AMBER methods with the benzene ring as the high level and chain at low level. Electronic embedding was NOT used in this initial calculation. &amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolAppletButton&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;ONIOM_B3LYP_6-311G_AMBER.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Test Molecule optimised using B3LYP 6-311G(d):AMBER&amp;lt;/text&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;600&amp;lt;/size&amp;gt;&lt;br /&gt;
  &amp;lt;/jmolAppletButton&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
&lt;br /&gt;
This conformation is NOT the lowest energy conformation, the initial AMBER MM calculation did not find the global minimum probably due to the drawn up starting geometry.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
The molecule was drawn up again and DFT B3LYP 6-311G(d):AMBER and pure B3LYP 6-311G(d) calculations run. Absolute energy differences between the 2 conformations was very small at the ONIOM(DFT B3LYP 6-311G(d):AMBER) level. -232.27532671 vs -232.27315709 au respectively, a 0.00271au (4.359 744 17×10-18 J = 1au) difference.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolAppletButton&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;ONIOM_B3LYP_6-311G_AMBER2.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Test Molecule optimised using B3LYP 6-311G(d):AMBER&amp;lt;/text&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;600&amp;lt;/size&amp;gt;&lt;br /&gt;
  &amp;lt;/jmolAppletButton&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
&lt;br /&gt;
It has already become obvious that using ONIOM decreases calculation time considerably. The table below shows a &amp;quot;CPU time&amp;quot; comparison on the above molecule using the different approaches.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Calculation&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Time (1st attempt)&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Time (2nd attempt)&#039;&#039;&#039;&lt;br /&gt;
|-&lt;br /&gt;
|   ONIOM(DFT B3LYP 6-311G(d):AMBER)||4 minutes 22.0 seconds||8 minutes  5.5 seconds&lt;br /&gt;
|-&lt;br /&gt;
|   DFT 3LYP 6-311G(d)||2 hours 12 minutes 21.9 seconds||2 hours 30 minutes  0.8 seconds&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Mon 2nd Nov 2009==&lt;br /&gt;
&lt;br /&gt;
Having obtained optimized structures of HBDI at various theory levels and basis set variations, these structures will now be compared to previous literature. Extensive studies have been performed using HF and B3LYP methods previously. It has been noted in Jaspers recent review that HF does not model frequencies very accurately - my computations agree HF 6-31G frequencies are very far out. B3LYP frequencies are much closer to the experimental values.&lt;br /&gt;
&lt;br /&gt;
==Tues 3rd Nov 2009==&lt;br /&gt;
&lt;br /&gt;
Tabulated frerquency comparison&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;A.Esposito HF 6-31G(d)&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;A.Esposito B3LYP 6-31G(d)&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;X.He B3LYP 6-31G(d)&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;HF 6-31G(d)&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;B3LYP 6-31G(d)&#039;&#039;&#039;&lt;br /&gt;
|-&lt;br /&gt;
| C=O||1987||1802||1802||1986||1801&lt;br /&gt;
|-&lt;br /&gt;
| C=C||1880||1705||1704||1880||1704&lt;br /&gt;
|-&lt;br /&gt;
| Phenol||1809||1665||1663||1812||1666&lt;br /&gt;
|-&lt;br /&gt;
| Phenol||1781||1635||1631||1777||1635&lt;br /&gt;
|-&lt;br /&gt;
| C=N||1844||1620||1618||1844||1618&lt;br /&gt;
|-&lt;br /&gt;
| Phenol||1696||1562||1558||1694||1562&lt;br /&gt;
|-&lt;br /&gt;
| C-N||1607||1483||N/A||1606||1482&lt;br /&gt;
|-&lt;br /&gt;
| Phenol||1602||1487||N/A||1603||1488&lt;br /&gt;
|-&lt;br /&gt;
| Total Energy(au)||||||||-720.043||-724.465&lt;br /&gt;
|-&lt;br /&gt;
| &lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Thurs 5th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
Having got Jaspers input files, several things were noticed:&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
* Solvent models were used - should be in vacum.... I have re run these in G09 with No solvent on both DFT 6-311G(d,p) and HF 6-31+G(d) methods/basis sets for comparison&lt;br /&gt;
* The models in his Biophys paper using other residues in contact COO- for example are not present. He has not reported the pure HBDI calculation results.&lt;br /&gt;
* More than just HBDI is likely to be required, meaning larger than expected high level region in the GFP calculation. (3 layer ONIOM DFT:HF:AMBER)?&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
Other things to do:&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
* Work on a guide to loading .pdb files into Gaussview 5 - how much information is maintained?&lt;br /&gt;
* Are residues defined still in Gaussview input files&lt;br /&gt;
* AMBER MM requires definitions of charges etc... how easy is it to import from .pdb and get a file that can be run in opt etc...&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
* Find a model system which is larger than HBDI but incorporates this molecule &amp;lt;=500 atoms (Limit of size for pure DFT calculation)&lt;br /&gt;
* Possibilities include a system alread developed by someone else online&lt;br /&gt;
* A &amp;quot;custom&amp;quot; system made by me to model this (time intensive)&lt;br /&gt;
* &amp;quot;strip&amp;quot; back the GFP pdb file to 500 atoms surrounding the HBDI chromophore.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Fri 6th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
Yesterday started by trying to follow the tutorial on loading the GFP molecule from .pdb, this resulted in a problem with Gaussviews addition of hydrogens. It over protonated the nitrogens and carbon in the chromophore leading to over valent parts. I then tried several other molecules (PYP, a scorpion toxin and a strand of RNA) which had no problems with hydrogen addition.&lt;br /&gt;
&lt;br /&gt;
I tried to use ChemBio 3D to load the .pdb and correctly assign hydrogens/connectivity. This resulted in the same protonation problems!&lt;br /&gt;
&lt;br /&gt;
==Mon 9th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
TINKER - http://dasher.wustl.edu/tinker/ was installed with the Force Field Explorer (FFE) interface on the mac. This was used to load the same .pdb file and resulted in the same protonation around the chromophore. So far the SAME protonation pattern (incorrect) around the chromophore has been found using:&lt;br /&gt;
&lt;br /&gt;
*Gaussview&lt;br /&gt;
*ChemBio 3D&lt;br /&gt;
*TINKER&lt;br /&gt;
&lt;br /&gt;
Looking into how the .pdb file format works, the molecule is built from a series of residues, it becomes clear as you look at the sequence that is building the GFP protein chromophore (residues 65,66,67 in 1GFL) are not then reproducing the internal cyclisation properly leaving protons attached and thus atoms hypervalent. This accounts for the consistent error loading from the .pdb. As a result must all molecules expressed as a .pdb file must be pure protein structures...?&lt;br /&gt;
&lt;br /&gt;
===Comparison of the 2 available wild type GFP structures===&lt;br /&gt;
&lt;br /&gt;
The 2 structures being considered here are:&lt;br /&gt;
&lt;br /&gt;
*1GFL - Yang et al. -http://www.pdb.org/pdb/explore/explore.do?structureId=1GFL&lt;br /&gt;
*1W7S - Jaspers stucture - http://www.pdb.org/pdb/explore/explore.do?structureId=1W7S&lt;br /&gt;
&lt;br /&gt;
These 2 structures differ in that 1GFL contains 2 GFP molecules, whereas 1W7S contains 4. These were both stripped down to 1 unit of GFP, using the &amp;quot;pdb secondary structure&amp;quot; function in Gaussview 5. 1GFL had chain B removed and 1W7S had chains B,C and D removed. Next the residues were investigated that created the chromophore. The results are not consistent because GFL uses 3 residues (65,66,67) to form the chromophore. 1W7S has residue 66 (labelled CSY) as the entire chromophore! Both get protonated the same (incorrectly) which is odd considering how they molecules are built.&lt;br /&gt;
&lt;br /&gt;
The next test was to &amp;quot;modify bond&amp;quot; in Gaussview 5 and create a &amp;quot;cleaned&amp;quot; chromophore structure (no bonding information from the residue) by manually changing the bonds. The chromophore in each .pdb file was altered to look like HBDI in terms of double single bonds and the hydrogens taken out (makes no difference as these are created each time the .pdb is loaded). When the new .pdb files (1GFL_cleaned.pdb and 1W7S_cleaned.pdb) were loaded they were correctly protonated! Success.&lt;br /&gt;
&lt;br /&gt;
===1W7S improvement===&lt;br /&gt;
&lt;br /&gt;
Chain A (not cleaned) of the 1W7S .pdb was submitted to run a AMBER calculation, as expected it failed with the lines:&lt;br /&gt;
&lt;br /&gt;
 Missing atomic parameters for atom   474 IAtTyp=    20000000&lt;br /&gt;
 Missing atomic parameters.&lt;br /&gt;
&lt;br /&gt;
This error says that the atom type of atom 474 is not defined. One way to counter this is to locate the atom and then change its definition manually (as shown in the example https://www.ch.ic.ac.uk/wiki/index.php/PYP_:_Photoactive_yellow_protein - time intensive!)&lt;br /&gt;
&lt;br /&gt;
Instead the molecule was cleaned (manually changing the bonding in the chromophore), after the bond updates the structure was re submitted, again failing but with a different message:&lt;br /&gt;
&lt;br /&gt;
 Include all MM classes&lt;br /&gt;
  Bondstretch undefined between atoms    473    478 CC-N* [H,H]  &lt;br /&gt;
  Bondstretch undefined between atoms    475    476 CC-CC [H,H]  &lt;br /&gt;
  Bondstretch undefined between atoms    475    486 CC-CM [H,H]  &lt;br /&gt;
  Bondstretch undefined between atoms    476    477 CC-O [H,H]  &lt;br /&gt;
  Bondstretch undefined between atoms    476    478 CC-N* [H,H]  &lt;br /&gt;
  Bondstretch undefined between atoms    486   2559 CM-HC [H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    473    474    475 CC-NB-CC [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    473    478    479 CC-N*-CT [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    473    478    476 CC-N*-CC [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    473    482    483 CC-CT-N [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    473    482   3857 CC-CT-H1 [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    474    475    476 NB-CC-CC [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    474    475    486 NB-CC-CM [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    474    473    478 NB-CC-N* [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    475    476    477 CC-CC-O [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    475    476    478 CC-CC-N* [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    475    486    487 CC-CM-CA [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    475    486   2559 CC-CM-HC [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    476    478    479 CC-N*-CT [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    476    475    486 CC-CC-CM [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    477    476    478 O-CC-N* [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    478    479    480 N*-CT-C [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    478    473    482 N*-CC-CT [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    486    487    488 CM-CA-CA [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    486    487    489 CM-CA-CM [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    487    486   2559 CA-CM-HC [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    488    487    489 CA-CA-CM [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    490    492    491 CA-C-CM [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    491    492    493 CM-C-OH [H,H,H]  &lt;br /&gt;
  MM function not complete&lt;br /&gt;
&lt;br /&gt;
This is a better error message since all the atoms are now defined, only parameters are missing. For parameters that involve only atoms in the chromophore at the high level in the ONIOM calculation, the values can be set to 0 in the input file. This is because the ONIOM extrapolation will cancel MM from this high level region, To do this the keyword &amp;quot;Amber=Hardfirst&amp;quot; and wildcards for the missing parameters which are zero valued are placed in the input file.&lt;br /&gt;
&lt;br /&gt;
[[Image:Atoms involving undefined parameters.png]]&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
The above screeshot shows highlighted the atoms listed in the error... most of them if not all will be involved in the high level calculation and thus may not require parameters defining (other than to be set to 0). The reason the parameters are undefined is that the AMBER force field is very specific to protein bonds and thus the organic chromophore has bonds that are not found in this parameter set.&lt;br /&gt;
&lt;br /&gt;
The atoms in the chromophore were frozen in the atom group editor and the input file had a parameter set added to the end setting chromophore parameters to 0:&lt;br /&gt;
&lt;br /&gt;
 HrmStr1 CC N* 0.0 0.0&lt;br /&gt;
 HrmStr1 CC CC 0.0 0.0&lt;br /&gt;
 HrmStr1 CC CM 0.0 0.0&lt;br /&gt;
 HrmStr1 CC O 0.0 0.0&lt;br /&gt;
 HrmStr1 CM HC 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC NB CC 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC N* CC 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CT N 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CT H1 0.0 0.0&lt;br /&gt;
 HrmBnd1 NB CC CC 0.0 0.0&lt;br /&gt;
 HrmBnd1 NB CC CM 0.0 0.0&lt;br /&gt;
 HrmBnd1 NB CC N* 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CC O  0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CC N* 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CM CA 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CM HC 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CC CM 0.0 0.0&lt;br /&gt;
 HrmBnd1 O CC N*  0.0 0.0&lt;br /&gt;
 HrmBnd1 N* CT C  0.0 0.0&lt;br /&gt;
 HrmBnd1 N* CC CT 0.0 0.0&lt;br /&gt;
 HrmBnd1 CM CA CA 0.0 0.0&lt;br /&gt;
 HrmBnd1 CA CM HC 0.0 0.0&lt;br /&gt;
 HrmBnd1 CA C CM 0.0 0.0&lt;br /&gt;
 HrmBnd1 CM C OH 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC N* CT 0.0 0.0&lt;br /&gt;
 HrmBnd1 CM CA CM 0.0 0.0&lt;br /&gt;
&lt;br /&gt;
This file then ran successfully for an AMBER single point energy calculation. It was then submitted for AMBER optimisation using &amp;quot;amber=hardfirst geom=connectivity&amp;quot; and returned a successful minimum with Forces and Displacements converged:&lt;br /&gt;
&lt;br /&gt;
         Item               Value     Threshold  Converged?&lt;br /&gt;
  Maximum Force            0.000089     0.000450     YES&lt;br /&gt;
  RMS     Force            0.000004     0.000300     YES&lt;br /&gt;
  Maximum Displacement     0.000089     0.001800     YES&lt;br /&gt;
  RMS     Displacement     0.000004     0.001200     YES&lt;br /&gt;
  Predicted change in Energy=-1.156135D-07&lt;br /&gt;
  Optimization completed.&lt;br /&gt;
     -- Stationary point found.&lt;br /&gt;
&lt;br /&gt;
===Cleaned .pdb files and TINKER===&lt;br /&gt;
&lt;br /&gt;
*The files that had been cleaned were opened in TINKER, it is necessary to convert the .pdb to a .xyz file to run any calculations within TINKER. This is currently failing due to an &amp;quot;atom missing on SER-228&amp;quot; - checking this there is no SER residue at 228 (in fact a GLU) so a bit puzzled. This is the 1W7S cleaned version.&lt;br /&gt;
&lt;br /&gt;
*PYP was loaded in its .pdb format to TINKER and was immediately successful in being converted to .xyz then a MINIMISE algorithm ran successfully with the energy at its X-ray crystal structure as 261KJ/mol reduced to -3670.5355KJ/mol in 1645 iterations - this took a while (45mins locally)!&lt;br /&gt;
&lt;br /&gt;
*Interestingly the .xyz output from the minimization can be converted back to .pdb (loses hydrogens minimization) and this PYP file was loaded into Gaussview 5 - the file then ran successfully for an AMBER single point energy calculation! cf. the first attempt to get PYP to run was unsuccessful, and the tutorial underwent much more hassle to run this calculation.&lt;br /&gt;
&lt;br /&gt;
===What .pdb information is retained?===&lt;br /&gt;
&lt;br /&gt;
The only molecule that has been run with AMBER opt (GFP) has lost its secondary structure information but RETAINED its residue information.&lt;br /&gt;
&lt;br /&gt;
==Fri 13th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
===Conversion from.xyz to .com===&lt;br /&gt;
A protein with 2 residues (Glu-Asp) was built using TINKER, saved as:&lt;br /&gt;
&lt;br /&gt;
*.xyz&lt;br /&gt;
*.pdb&lt;br /&gt;
*.com&lt;br /&gt;
&lt;br /&gt;
The initial format was .xyz (created in TINEKR as a native format), converted to .pdb using TINKER and then loaded into Gaussview and saved as a .com.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
.xyz&lt;br /&gt;
&lt;br /&gt;
    36  Protein built with TINKER/Force Field Explorer&lt;br /&gt;
     1  CT     0.000000    0.000000    0.000000   340     2     4     5     6&lt;br /&gt;
     2  C      0.000000    0.000000    1.510000   342     1     3     7&lt;br /&gt;
     3  O      1.028938    0.000000    2.165506   343     2&lt;br /&gt;
     4  HC     1.056270    0.000000   -0.341166   341     1&lt;br /&gt;
     5  HC    -0.497647    0.931694   -0.341166   341     1&lt;br /&gt;
     6  HC    -0.497647   -0.931694   -0.341166   341     1&lt;br /&gt;
     7  N     -1.236201    0.000000    2.027114   232     2     8    11&lt;br /&gt;
     8  CT    -1.446961    0.000000    3.471822   233     7     9    12    13&lt;br /&gt;
     9  C     -2.509557   -0.992751    3.878558   234     8    10    22&lt;br /&gt;
    10  O     -3.569672   -1.098634    3.284129   236     9&lt;br /&gt;
    11  H     -2.025518    0.000000    1.381066   235     7&lt;br /&gt;
    12  H1    -0.503221   -0.281395    3.983946   237     8&lt;br /&gt;
    13  CT    -1.910491    1.397334    3.923704   238     8    14    18    19&lt;br /&gt;
    14  CT    -2.132799    1.397334    5.447574   240    13    15    20    21&lt;br /&gt;
    15  C     -2.585014    2.781238    5.848089   242    14    16    17&lt;br /&gt;
    16  O2    -2.816736    2.973609    7.061266   243    15&lt;br /&gt;
    17  O2    -2.693194    3.629064    4.935955   243    15&lt;br /&gt;
    18  HC    -2.861794    1.653482    3.412322   239    13&lt;br /&gt;
    19  HC    -1.132567    2.145528    3.664589   239    13&lt;br /&gt;
    20  HC    -1.170306    1.191123    5.960588   241    14&lt;br /&gt;
    21  HC    -2.946052    0.685840    5.701535   241    14&lt;br /&gt;
    22  N     -2.153997   -1.723291    4.944151   210     9    23    26&lt;br /&gt;
    23  CT    -3.047986   -2.739703    5.491225   211    22    24    27    28&lt;br /&gt;
    24  C     -3.120370   -2.663723    6.997574   212    23    25    34&lt;br /&gt;
    25  O     -2.128241   -2.535414    7.695870   214    24&lt;br /&gt;
    26  H     -1.239994   -1.547921    5.361576   213    22&lt;br /&gt;
    27  H1    -4.072297   -2.585746    5.092246   215    23&lt;br /&gt;
    28  CT    -2.526745   -4.139104    5.114946   216    23    29    32    33&lt;br /&gt;
    29  C     -3.476741   -5.157722    5.698062   218    28    30    31&lt;br /&gt;
    30  O2    -3.212872   -6.362275    5.493304   219    29&lt;br /&gt;
    31  O2    -4.454666   -4.718803    6.341112   219    29&lt;br /&gt;
    32  HC    -1.530243   -4.298013    5.577366   217    28&lt;br /&gt;
    33  HC    -2.540086   -4.252948    4.010880   217    28&lt;br /&gt;
    34  N     -4.374888   -2.752228    7.460123   344    24    35    36&lt;br /&gt;
    35  H     -5.140865   -2.858197    6.794954   345    34&lt;br /&gt;
    36  H     -4.534830   -2.711583    8.466684   345    34&lt;br /&gt;
&lt;br /&gt;
.pdb&lt;br /&gt;
&lt;br /&gt;
 HEADER    Protein built with TINKER/Force Field Explorer&lt;br /&gt;
 COMPND&lt;br /&gt;
 SOURCE&lt;br /&gt;
 ATOM      1  CH3 ACE     1       0.000   0.000   0.000&lt;br /&gt;
 ATOM      2  C   ACE     1       0.000   0.000   1.510&lt;br /&gt;
 ATOM      3  O   ACE     1       1.029   0.000   2.166&lt;br /&gt;
 ATOM      4  H1  ACE     1       1.056   0.000  -0.341&lt;br /&gt;
 ATOM      5  H2  ACE     1      -0.498   0.932  -0.341&lt;br /&gt;
 ATOM      6  H3  ACE     1      -0.498  -0.932  -0.341&lt;br /&gt;
 ATOM      7  N   GLU     2      -1.236   0.000   2.027&lt;br /&gt;
 ATOM      8  CA  GLU     2      -1.447   0.000   3.472&lt;br /&gt;
 ATOM      9  C   GLU     2      -2.510  -0.993   3.879&lt;br /&gt;
 ATOM     10  O   GLU     2      -3.570  -1.099   3.284&lt;br /&gt;
 ATOM     11  CB  GLU     2      -1.910   1.397   3.924&lt;br /&gt;
 ATOM     12  CG  GLU     2      -2.133   1.397   5.448&lt;br /&gt;
 ATOM     13  CD  GLU     2      -2.585   2.781   5.848&lt;br /&gt;
 ATOM     14  OE1 GLU     2      -2.817   2.974   7.061&lt;br /&gt;
 ATOM     15  OE2 GLU     2      -2.693   3.629   4.936&lt;br /&gt;
 ATOM     16  H   GLU     2      -2.026   0.000   1.381&lt;br /&gt;
 ATOM     17  HA  GLU     2      -0.503  -0.281   3.984&lt;br /&gt;
 ATOM     18  HB2 GLU     2      -2.862   1.653   3.412&lt;br /&gt;
 ATOM     19  HB3 GLU     2      -1.133   2.146   3.665&lt;br /&gt;
 ATOM     20  HG2 GLU     2      -1.170   1.191   5.961&lt;br /&gt;
 ATOM     21  HG3 GLU     2      -2.946   0.686   5.702&lt;br /&gt;
 ATOM     22  N   ASP     3      -2.154  -1.723   4.944&lt;br /&gt;
 ATOM     23  CA  ASP     3      -3.048  -2.740   5.491&lt;br /&gt;
 ATOM     24  C   ASP     3      -3.120  -2.664   6.998&lt;br /&gt;
 ATOM     25  O   ASP     3      -2.128  -2.535   7.696&lt;br /&gt;
 ATOM     26  CB  ASP     3      -2.527  -4.139   5.115&lt;br /&gt;
 ATOM     27  CG  ASP     3      -3.477  -5.158   5.698&lt;br /&gt;
 ATOM     28  OD1 ASP     3      -3.213  -6.362   5.493&lt;br /&gt;
 ATOM     29  OD2 ASP     3      -4.455  -4.719   6.341&lt;br /&gt;
 ATOM     30  H   ASP     3      -1.240  -1.548   5.362&lt;br /&gt;
 ATOM     31  HA  ASP     3      -4.072  -2.586   5.092&lt;br /&gt;
 ATOM     32  HB2 ASP     3      -1.530  -4.298   5.577&lt;br /&gt;
 ATOM     33  HB3 ASP     3      -2.540  -4.253   4.011&lt;br /&gt;
 ATOM     34  N   NH2     4      -4.375  -2.752   7.460&lt;br /&gt;
 ATOM     35  H1  NH2     4      -5.141  -2.858   6.795&lt;br /&gt;
 ATOM     36  H2  NH2     4      -4.535  -2.712   8.467&lt;br /&gt;
 END&lt;br /&gt;
&lt;br /&gt;
.com&lt;br /&gt;
&lt;br /&gt;
 %chk=File_Test_Glu-Asp.chk&lt;br /&gt;
 # hf/3-21g geom=connectivity&lt;br /&gt;
 &lt;br /&gt;
 Protein built with TINKER/Force Field Explorer&lt;br /&gt;
 &lt;br /&gt;
 0 1&lt;br /&gt;
 C(PDBName=CH3,ResName=ACE,ResNum=1)                   0.00000000    0.00000000    0.00000000&lt;br /&gt;
 C(PDBName=C,ResName=ACE,ResNum=1)                     0.00000000    0.00000000    1.51000000&lt;br /&gt;
 O(PDBName=O,ResName=ACE,ResNum=1)                     1.02900000    0.00000000    2.16600000&lt;br /&gt;
 H(PDBName=H1,ResName=ACE,ResNum=1)                    1.05600000    0.00000000   -0.34100000&lt;br /&gt;
 H(PDBName=H2,ResName=ACE,ResNum=1)                   -0.49800000    0.93200000   -0.34100000&lt;br /&gt;
 H(PDBName=H3,ResName=ACE,ResNum=1)                   -0.49800000   -0.93200000   -0.34100000&lt;br /&gt;
 N(PDBName=N,ResName=GLU,ResNum=2)                    -1.23600000    0.00000000    2.02700000&lt;br /&gt;
 C(PDBName=CA,ResName=GLU,ResNum=2)                   -1.44700000    0.00000000    3.47200000&lt;br /&gt;
 C(PDBName=C,ResName=GLU,ResNum=2)                    -2.51000000   -0.99300000    3.87900000&lt;br /&gt;
 O(PDBName=O,ResName=GLU,ResNum=2)                    -3.57000000   -1.09900000    3.28400000&lt;br /&gt;
 C(PDBName=CB,ResName=GLU,ResNum=2)                   -1.91000000    1.39700000    3.92400000&lt;br /&gt;
 C(PDBName=CG,ResName=GLU,ResNum=2)                   -2.13300000    1.39700000    5.44800000&lt;br /&gt;
 C(PDBName=CD,ResName=GLU,ResNum=2)                   -2.58500000    2.78100000    5.84800000&lt;br /&gt;
 O(PDBName=OE1,ResName=GLU,ResNum=2)                  -2.81700000    2.97400000    7.06100000&lt;br /&gt;
 O(PDBName=OE2,ResName=GLU,ResNum=2)                  -2.69300000    3.62900000    4.93600000&lt;br /&gt;
 H(PDBName=H,ResName=GLU,ResNum=2)                    -2.02600000    0.00000000    1.38100000&lt;br /&gt;
 H(PDBName=HA,ResName=GLU,ResNum=2)                   -0.50300000   -0.28100000    3.98400000&lt;br /&gt;
 H(PDBName=HB2,ResName=GLU,ResNum=2)                  -2.86200000    1.65300000    3.41200000&lt;br /&gt;
 H(PDBName=HB3,ResName=GLU,ResNum=2)                  -1.13300000    2.14600000    3.66500000&lt;br /&gt;
 H(PDBName=HG2,ResName=GLU,ResNum=2)                  -1.17000000    1.19100000    5.96100000&lt;br /&gt;
 H(PDBName=HG3,ResName=GLU,ResNum=2)                  -2.94600000    0.68600000    5.70200000&lt;br /&gt;
 N(PDBName=N,ResName=ASP,ResNum=3)                    -2.15400000   -1.72300000    4.94400000&lt;br /&gt;
 C(PDBName=CA,ResName=ASP,ResNum=3)                   -3.04800000   -2.74000000    5.49100000&lt;br /&gt;
 C(PDBName=C,ResName=ASP,ResNum=3)                    -3.12000000   -2.66400000    6.99800000&lt;br /&gt;
 O(PDBName=O,ResName=ASP,ResNum=3)                    -2.12800000   -2.53500000    7.69600000&lt;br /&gt;
 C(PDBName=CB,ResName=ASP,ResNum=3)                   -2.52700000   -4.13900000    5.11500000&lt;br /&gt;
 C(PDBName=CG,ResName=ASP,ResNum=3)                   -3.47700000   -5.15800000    5.69800000&lt;br /&gt;
 O(PDBName=OD1,ResName=ASP,ResNum=3)                  -3.21300000   -6.36200000    5.49300000&lt;br /&gt;
 O(PDBName=OD2,ResName=ASP,ResNum=3)                  -4.45500000   -4.71900000    6.34100000&lt;br /&gt;
 H(PDBName=H,ResName=ASP,ResNum=3)                    -1.24000000   -1.54800000    5.36200000&lt;br /&gt;
 H(PDBName=HA,ResName=ASP,ResNum=3)                   -4.07200000   -2.58600000    5.09200000&lt;br /&gt;
 H(PDBName=HB2,ResName=ASP,ResNum=3)                  -1.53000000   -4.29800000    5.57700000&lt;br /&gt;
 H(PDBName=HB3,ResName=ASP,ResNum=3)                  -2.54000000   -4.25300000    4.01100000&lt;br /&gt;
 N(PDBName=N,ResName=NH2,ResNum=4)                    -4.37500000   -2.75200000    7.46000000&lt;br /&gt;
 H(PDBName=H1,ResName=NH2,ResNum=4)                   -5.14100000   -2.85800000    6.79500000&lt;br /&gt;
 H(PDBName=H2,ResName=NH2,ResNum=4)                   -4.53500000   -2.71200000    8.46700000&lt;br /&gt;
 &lt;br /&gt;
 1 2 1.0 4 1.0 5 1.0 6 1.0&lt;br /&gt;
 2 3 2.0 7 1.5&lt;br /&gt;
 3&lt;br /&gt;
 4&lt;br /&gt;
 5&lt;br /&gt;
 6&lt;br /&gt;
 7 16 1.0 8 1.0&lt;br /&gt;
 8 9 1.0 11 1.0 17 1.0&lt;br /&gt;
 9 10 2.0 22 1.5&lt;br /&gt;
 10&lt;br /&gt;
 11 18 1.0 12 1.0 19 1.0&lt;br /&gt;
 12 21 1.0 13 1.0 20 1.0&lt;br /&gt;
 13 14 2.0 15 2.0&lt;br /&gt;
 14&lt;br /&gt;
 15&lt;br /&gt;
 16&lt;br /&gt;
 17&lt;br /&gt;
 18&lt;br /&gt;
 19&lt;br /&gt;
 20&lt;br /&gt;
 21&lt;br /&gt;
 22 23 1.0 30 1.0&lt;br /&gt;
 23 26 1.0 24 1.0 31 1.0&lt;br /&gt;
 24 34 1.5 25 2.0&lt;br /&gt;
 25&lt;br /&gt;
 26 27 1.0 33 1.0 32 1.0&lt;br /&gt;
 27 28 2.0 29 2.0&lt;br /&gt;
 28&lt;br /&gt;
 29&lt;br /&gt;
 30&lt;br /&gt;
 31&lt;br /&gt;
 32&lt;br /&gt;
 33&lt;br /&gt;
 34 35 1.0 36 1.0&lt;br /&gt;
 35&lt;br /&gt;
 36&lt;br /&gt;
&lt;br /&gt;
The issue going from .xyz straight to .com is that ALL protein information would be lost ie. residue information that is written into the .pdb at .xyz to .pdb comversion.&lt;br /&gt;
&lt;br /&gt;
===PYP files as downloaded compared to run through TINKER===&lt;br /&gt;
&lt;br /&gt;
*TINKER removes much of the header information&lt;br /&gt;
*TINKER only preserves the 1st model of a .pdb file (may be multiple models)&lt;br /&gt;
*TINKER removes far right 3 columns of information (not essential to the .pdb file)&lt;br /&gt;
*A virtual residue in the original .pdb comprising the chromophore was omitted from the TINKER output (number 126)&lt;br /&gt;
&lt;br /&gt;
===What each TINKER search algorithm does===&lt;br /&gt;
&lt;br /&gt;
There are several programs within TINKER that may be able to help locate a global minimum (not just one near the published crystal structure) by way of searching the potential energy surface. All of these are MM methods for use in TINKER.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&#039;&#039;&#039;SCAN&#039;&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
A program for general conformational search of an entire potential energy surface via a basin hopping method. The program takes as input a TINKER .xyz coordinates file which is then minimized to find the first local minimum for a search list. A series of activations along various normal modes from this initial minimum are used as seed points for additional minimizations. Whenever a previously unknown local minimum is located it is added to the search list. When all minima on the search list have been subjected to the normal mode activation without locating additional new minima, the program terminates. The individual local minima are written to cycle files as they are discovered. While the SCAN program can be used on standard undeformed potential energy surfaces, we have found it to be most useful for quickly ‘‘scanning’’ a smoothed energy surface to enumerate the major basins of attraction spaning the entire surface.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;SNIFFER&#039;&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
A program that implements the Sniffer global optimization algorithm of Butler and Slaminka, a discrete version of Griewank’s global search trajectory method. The program takes an input TINKER .xyz coordinates file and shakes it vigorously via a modified dynamics trajectory before, hopefully, settling into a low lying minimum. Some trial and error is often required as the current implementation is sensitive to various parameters and tolerances that govern the computation. At present, these parameters are not user accessible, and must be altered in the source code. However, this method can do a good job of quickly optimizing conformation within a limited range of convergence.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;PSS&#039;&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
Implements our version of a potential smoothing and search algorithm for the global optimization of molecular conformation. An initial structure in .xyz format is first minimized in Cartesian coordinates on a series of increasingly smoothed potential energy surfaces. Then the smoothing procedure is reversed with minimization on each successive surface starting from the coordinates of the minimum on the previous surface. A local search procedure is used during the backtracking to explore for alternative minima better than the one found during the current minimization. The final result is usually a very low energy conformation or, in favorable cases, the global energy minimum conformation. The minimum energy coordinate sets found on each surface during both the forward smoothing and backtracking procedures are placed in sequentially numbered cycle files.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
TINKER also contains programs that are able to perform MM optimisations:&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&#039;&#039;&#039;MINIMIZE&#039;&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;The MINIMIZE program performs a limited memory L-BFGS minimization of an input structure over Cartesian coordinates using a modified version of the algorithm of Jorge Nocedal. The method requires only the potential energy and gradient at each step along the minimization pathway. It requires storage space proportional to the number of atoms in the structure. The MINIMIZE procedure is recommended for preliminary minimization of trial structures to an rms gradient of 1.0 to 0.1 kcal/mole/Å. It has a relatively fast cycle time and is tolerant of poor initial structures, but converges in a slow, linear fashion near the minimum. The user supplies the name of the TINKER .xyz coordinates file and a target rms gradient value at which the minimization will terminate. Output consists of minimization statistics written to the screen or redirected to an output file, and the new coordinates written to updated .xyz files or to cycle files.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;OPTIMIZE&#039;&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;The OPTIMIZE program performs a optimally conditioned variable metric minimization of an input structure over Cartesian coordinates using an algorithm due to William Davidon. The method does not perform line searches, but requires computation of energies and gradients as well as storage for an estimate of the inverse Hessian matrix. The program operates on Cartesian coordinates from a TINKER .xyz file. OPTIMIZE will typically converge somewhat faster and more completely than MINIMIZE. However, the need to store and manipulate a full inverse Hessian estimate limits its use to structures containing less than a few hundred atoms on workstation class machines. As with the other minimizers, OPTIMIZE needs input coordinates and an rms gradient cutoff criterion. The output coordinates are saved in updated .xyz files or as cycle files.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;NEWTON&#039;&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt; A truncated Newton minimization method which requires potential energy, gradient and Hessian information. This procedure has significant advantages over standard Newton methods, and is able to minimize very large structures completely. Several options are provided with respect to minimization method and preconditioning of the Newton equations. The default options are recommended unless the user is familiar with the math involved. This program operates in Cartesian coordinate space and is fairly tolerant of poor input structures. Typical algorithm iteration times are longer than with nonlinear conjugate gradient or variable metric methods, but many fewer iterations are required for complete minimization. NEWTON is usually the best choice for minimizations to the 0.01 to 0.000001 kcal/mole/Å level of rms gradient convergence. Tests for directions of negative curvature can be removed, allowing NEWTON to be used for optimization to conformational transition state structures (this only works if the starting point is very close to the transition state). Input consists of a TINKER .xyz coordinates file; output is an updated set of minimized coordinates and minimization statistics.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Tues 17th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
Comparing 2 .pdb files:&lt;br /&gt;
&lt;br /&gt;
*As downloaded from PDB&lt;br /&gt;
*Converted to .xyz in TINKER then back to .pdb&lt;br /&gt;
&lt;br /&gt;
These files were then cleaned up to compare in a terminal using the &amp;quot;diff&amp;quot; command:&lt;br /&gt;
&lt;br /&gt;
*Header information and end information removed&lt;br /&gt;
*3 columns to the right in PDB version were removed to become consistent with the TINEKR file&lt;br /&gt;
*The first column &amp;quot;ATOM&amp;quot; was removed effectively making the line numbers equal to the atom numbers in both files&lt;br /&gt;
&lt;br /&gt;
==Wed 18th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
1st diff was on line 52, a HYDROGEN ATOM:&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&#039;&#039;52  HE2 HIS A   3    13.349 -10.803   9.527&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
added to the TINKER output, not in PDB file.&lt;br /&gt;
&lt;br /&gt;
2nd diff was on line 657, a HYDROGEN ATOM:&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&#039;&#039;657  HE2 GLU A  46	 -0.867  -5.216  -0.822&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
added to the PDB file, not in TINKER output&lt;br /&gt;
&lt;br /&gt;
3rd diff was on line 1007, a HYDROGEN ATOM (number 1008):&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&#039;&#039;5HG  CYS A  69	 -8.909  -5.194   5.526&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
added to the TINKER output, not in PDB file.&lt;br /&gt;
&lt;br /&gt;
4th diff was  on line 1621, a HYDROGEN ATOM (number 1623):&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&#039;&#039;HE2 HIS A 108	  4.663   7.459   6.879&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
added to the TINKER output, not in PDB file.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Following these reconciliations, both files were exactly the same with 1911 atoms present in each. 3 atoms were removed from the TINKER file and 1 was removed from the PDB file.&lt;br /&gt;
&lt;br /&gt;
*NOTE - loading the original .pdb from PDB to Gaussview then converting to a .gjf results in an error &amp;quot;1 or more PDB atom names has a &#039; or * character which can cause problems for Gaussian&amp;quot; - this is because the PDB file has extra atoms on the end of the file to define a highlighted chromophore... After these were removed in a txt editor the .gjf save went smoothly. Infact after this removal the file will run an AMBEr single point energy calculation fine - cf running when the extra info is in the bottom.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
All differences in files were due to protonation states of atoms... not the AA backbone.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===TINKER vs PDB===&lt;br /&gt;
[[Image:TINKER PROTON.png]]&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
[[Image:PDB_PROTON.png]]&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
The highlighted fragments represent where the respective file version is protonated and the other is not.&lt;br /&gt;
&lt;br /&gt;
==Fri 20th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
In an attempt to load the 1W7S structure to TINKER and successfully convert it it a .xyz format the following procedure was undertaken:&lt;br /&gt;
&lt;br /&gt;
*&lt;br /&gt;
&lt;br /&gt;
==Mon 23rd Nov 2009==&lt;br /&gt;
&lt;br /&gt;
===Protonation of GFP at various pH conditions===&lt;br /&gt;
&lt;br /&gt;
With 230 amino acid residues, each possessing a unique pKa in GFP it is obvious that there is a huge spectrum of protonation states available at a single solvation pH. Since protons cannot be detected using X-ray diffraction, protonation states must be uncovered via a difference method. Several questions arise when thinking about protonation of the protein:&lt;br /&gt;
&lt;br /&gt;
*Which residues have the ability to be protonated/deprotonated&lt;br /&gt;
*Is the PDB structure protonated for a specific pH (Bio pH)&lt;br /&gt;
*How does the protonation of the protein change through a variety of solvation pHs&lt;br /&gt;
*Does the protonation state affect the chromophore photophysics (electronic embedding)&lt;br /&gt;
&lt;br /&gt;
Answering these questions:&lt;br /&gt;
&lt;br /&gt;
*Polar amino acids are those that can be protonated/deprotonated, these are identified by Gaussview and shown in the atom list of the molecule with the option of setting them to either protonated/deprotonated.&lt;br /&gt;
*Dont know if the current structure is protonated to any specific pH... this is unclear until protonate at certain pHs&lt;br /&gt;
*The protonation will be tested across a range of pHs to see how protonation changes&lt;br /&gt;
&lt;br /&gt;
==Tues 24th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
The pKa value of a residue will indicate protonation state as a function of pH. If you know the pH of your solution then if the pKa of a given functional group is lower than this pH then the functional group will be unprotonated. If the pKa is higher than the pH then the functional group will be protonated. &#039;&#039;&#039;Beware that pKa values can be distorted by local conformational effects.&#039;&#039;&#039; The structure is protonated as though it were in pH 7 as it is loaded into Gaussview. Since hydrogens are defined in the PDB file it could be that the author added protons under these conditions.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Histadine has 2 protonation points on the imidizole R chain:&lt;br /&gt;
&lt;br /&gt;
{|&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Yes-Yes&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Yes-No&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;No-Yes&#039;&#039;&#039;&lt;br /&gt;
|-&lt;br /&gt;
|&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolApplet&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;Protein_yes_yes.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;400&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Yes Yes&amp;lt;/text&amp;gt;&lt;br /&gt;
  &amp;lt;/jmolApplet&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;||&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolApplet&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;Protein_yes_no.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;400&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Yes Yes&amp;lt;/text&amp;gt;&lt;br /&gt;
  &amp;lt;/jmolApplet&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;||&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolApplet&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;Protein_no_yes.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;400&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Yes Yes&amp;lt;/text&amp;gt;&lt;br /&gt;
  &amp;lt;/jmolApplet&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
The imidazole sidechain of histidine has a pKa of approximately 6, and overall, the amino acid has a pKa of 7.6. This means that at physiologically relevant pH values, relatively small shifts in pH will change its average charge. Below a pH of 6, the imidazole ring is mostly protonated as described by the Henderson–Hasselbalch equation. When protonated, the imidazole ring bears two NH bonds and has a positive charge. The positive charge is equally distributed between both nitrogens and can be represented with two equally important resonance structures.&lt;br /&gt;
&lt;br /&gt;
===Protonation===&lt;br /&gt;
&lt;br /&gt;
At pH 7 the PDB file protonations are correct for all residues. Moving to pH 5 the histadine residues become protonated (pKa 6)- this is the only change. At more alkaline pHs it would take pH 9.1 to make any effect (deprotonating CYS and then TYR at 9.7)&lt;br /&gt;
&lt;br /&gt;
===Mini-Biomolecule===&lt;br /&gt;
&lt;br /&gt;
To test ONIOM calculations vs pure DFT methods and other calculation types on a bio system that is smaller than GFP (computationally less expensive, can streamline the real calcs)&lt;br /&gt;
&lt;br /&gt;
Trying a molecule analogous to the example given in the ONIOM tutorial...&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolApplet&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;Biomolecule1.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;400&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Yes Yes&amp;lt;/text&amp;gt;&lt;br /&gt;
  &amp;lt;/jmolApplet&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Thurs 26th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
===Checking the GFP input will run===&lt;br /&gt;
&lt;br /&gt;
Procedure to get 1WS7 to run an AMBER energy calculation:&lt;br /&gt;
&lt;br /&gt;
*Load the PDB file without water&lt;br /&gt;
*Try opt amber but will fail due to missing parameters (all in the chromophore)&lt;br /&gt;
*Set these parameters to 0 since they cancel in ONIOM&lt;br /&gt;
*Freeze the atotms involving missing paramters&lt;br /&gt;
*Energy AMBER will now run for the file - NO CHARGES - 9.07881632au&lt;br /&gt;
*Open the Atom Editor, Edit, MM charges, Assign all atoms&lt;br /&gt;
*Manually edit any missing charges&lt;br /&gt;
*Energy AMBER will now run including charges (basic AMBER94 charges) - 0.260391165au&lt;br /&gt;
&lt;br /&gt;
Charges in this file did not sum to an integer number - the only residue with a non integer charge was no.64 with a charge of 0.61160000&lt;br /&gt;
&lt;br /&gt;
 MM sanity checks:&lt;br /&gt;
 All charges sum to:               -5.38840000&lt;br /&gt;
 Charges of atoms sum to:          -5.38840000&lt;br /&gt;
 Layer 1 (S) has     34 atoms and charge   0.61160000 sum=   0.61160000&lt;br /&gt;
 Layer 2 (R) adds  3574 atoms and charge  -6.00000000 sum=  -5.38840000&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
 WARNING -- The following centers have non-zero partial&lt;br /&gt;
 charges, but zero Vanderwaals parameters. These centers&lt;br /&gt;
 can collapse onto other centers with non-zero charge&lt;br /&gt;
 Center    Charge&lt;br /&gt;
   1833  0.423900&lt;br /&gt;
   1890  0.410200&lt;br /&gt;
   2029  0.427500&lt;br /&gt;
   2043  0.427500&lt;br /&gt;
   2080  0.410200&lt;br /&gt;
   2089  0.399200&lt;br /&gt;
   2123  0.410200&lt;br /&gt;
   2179  0.410200&lt;br /&gt;
   2186  0.410200&lt;br /&gt;
   2260  0.410200&lt;br /&gt;
   2287  0.410200&lt;br /&gt;
   2294  0.410200&lt;br /&gt;
   2347  0.427500&lt;br /&gt;
   2369  0.399200&lt;br /&gt;
   2469  0.427500&lt;br /&gt;
   2508  0.399200&lt;br /&gt;
   2551  0.410200&lt;br /&gt;
   2619  0.399200&lt;br /&gt;
   2639  0.410200&lt;br /&gt;
   2714  0.410200&lt;br /&gt;
   2924  0.399200&lt;br /&gt;
   2939  0.399200&lt;br /&gt;
   2950  0.427500&lt;br /&gt;
   2981  0.399200&lt;br /&gt;
   3180  0.427500&lt;br /&gt;
   3233  0.399200&lt;br /&gt;
   3262  0.410200&lt;br /&gt;
   3361  0.399200&lt;br /&gt;
   3377  0.427500&lt;br /&gt;
   3384  0.410200&lt;br /&gt;
   3397  0.427500&lt;br /&gt;
   3418  0.427500&lt;br /&gt;
   3562  0.410200&lt;br /&gt;
   3593  0.410200&lt;br /&gt;
   3608  0.410200&lt;br /&gt;
&lt;br /&gt;
* oniom(pm6:amber=hardfirst) geom=connectivity was run successfully, resulting in an energy of 0.210085730788au&lt;br /&gt;
&lt;br /&gt;
This still resulted in some charge problems:&lt;br /&gt;
&lt;br /&gt;
 MM sanity checks:&lt;br /&gt;
 All charges sum to:               &#039;&#039;&#039;-5.38840000&#039;&#039;&#039;&lt;br /&gt;
 Charges of atoms sum to:           &#039;&#039;&#039;0.79320000&#039;&#039;&#039;&lt;br /&gt;
 Residue  63 PDB Number    &#039;&#039;&#039;64_A PHE has&#039;&#039;&#039;     1 atoms, and a charge of   0.59730000&lt;br /&gt;
 Residue  64 PDB Number    &#039;&#039;&#039;66_A CSY has&#039;&#039;&#039;    34 atoms, and a charge of   0.61160000&lt;br /&gt;
 Residue  65 PDB Number    &#039;&#039;&#039;68_A VAL has&#039;&#039;&#039;     1 atoms, and a charge of  -0.41570000&lt;br /&gt;
 WARNING -- The following centers have non-zero partial&lt;br /&gt;
 charges, but zero Vanderwaals parameters. These centers&lt;br /&gt;
 can collapse onto other centers with non-zero charge&lt;br /&gt;
 Center    Charge&lt;br /&gt;
   &#039;&#039;&#039;3608  0.410200&#039;&#039;&#039;&lt;br /&gt;
 MMInit generated parameter data with length LenPar=       40470.&lt;br /&gt;
 Standard basis: Dummy (5D, 7F)&lt;br /&gt;
 There are    36 symmetry adapted basis functions of A   symmetry.&lt;br /&gt;
 Integral buffers will be    262144 words long.&lt;br /&gt;
 Raffenetti 1 integral format.&lt;br /&gt;
 Two-electron integral symmetry is turned on.&lt;br /&gt;
    36 basis functions,    36 primitive gaussians,    36 cartesian basis functions&lt;br /&gt;
    27 alpha electrons       27 beta electrons&lt;br /&gt;
       nuclear repulsion energy       200.9124124811 Hartrees.&lt;br /&gt;
 NAtoms= 3608 NActive=   36 NUniq=   36 SFac= 7.50D-01 NAtFMM=   80 NAOKFM=F Big=F&lt;br /&gt;
 AMBER calculation of energy.&lt;br /&gt;
 Energy=    0.110474559     NIter=   0.&lt;br /&gt;
 Dipole moment=       3.390901       5.889448      -2.505476&lt;br /&gt;
 ONIOM: saving gridpoint  1&lt;br /&gt;
 ONIOM: restoring gridpoint  3&lt;br /&gt;
 ONIOM: calculating energy.&lt;br /&gt;
 ONIOM: gridpoint  1 method:  low   system:  model energy:     0.110474559177&lt;br /&gt;
 ONIOM: gridpoint  2 method:  high  system:  model energy:     0.060169124949&lt;br /&gt;
 ONIOM: gridpoint  3 method:  low   system:  real  energy:     0.260391165016&lt;br /&gt;
 ONIOM: extrapolated energy =       0.210085730788&lt;br /&gt;
 ONIOM: calculating electric field derivatives.&lt;br /&gt;
 ONIOM: Integrating ONIOM file  4 number   619&lt;br /&gt;
 ONIOM: Dipole        =-2.76174164D+01-1.34762322D+02 3.99239712D+01&lt;br /&gt;
 ONIOM: Dipole moment (Debye):&lt;br /&gt;
    X=   -70.1965    Y=  -342.5316    Z=   101.4766  Tot=   364.0782&lt;br /&gt;
 ONIOM: Integrating ONIOM file  5 number   695&lt;br /&gt;
&lt;br /&gt;
An ONIOM calculation was run using a DFT method on the high level and AMBER on the low level:  &#039;&#039;oniom(b3lyp/3-21g:amber=hardfirst) geom=connectivity&#039;&#039; which ran successfully...&lt;br /&gt;
&lt;br /&gt;
===Charges===&lt;br /&gt;
&lt;br /&gt;
Charges in the protein are important for the calculation that will be run on GFP eventually since they provide the electrostatic field or &amp;quot;atmosphere&amp;quot; which surrounds the chromophore, polarizing the electron density used by the DFT method to evaluate the chromophore. Gaussian offers 2 types of influence which the MM portion can exert on the QM part:&lt;br /&gt;
&lt;br /&gt;
* Mechanical Embedding&lt;br /&gt;
* Electronic Embedding&lt;br /&gt;
&lt;br /&gt;
Mechanical embedding does not use any electronic components to affect the central high region. Electronic embedding does use the charge generated in the MM region to polarize the high region.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
At pH 7 the GFP molecule is in a 6- charge state according to assigning AMBER charges and summing the residue charges (available in the out file). 3 options were considered at this stage for treating charges:&lt;br /&gt;
&lt;br /&gt;
* Ignore charges on the entire molecule&lt;br /&gt;
* Use AMBER94 assigned partial charges on the low region&lt;br /&gt;
* Use Qeq to assign charges based on an overall charge of -6&lt;br /&gt;
&lt;br /&gt;
The first option can be discarded since the electronic atmosphere is of particular interest here. The second 2 options are to be investigated. See below (on a small Biomolecule)&lt;br /&gt;
&lt;br /&gt;
ON a 4 residue chain (in mini biomolecule folder) - As in the GFP tutorial on the wiki, a energy calculation was submitted with the keyword amber=qeq to get back the charges across the molecule. (not entirely sure what this does yet). AS a test of the effect of charges on a molecule a test molecule was compiled in TINKER consisting of 4 residues with 0 overall charges....&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolAppletButton&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;Protein1_AMBER_OPT_qeq.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Test Protien&amp;lt;/text&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;600&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;/jmolAppletButton&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{|&lt;br /&gt;
|No Charges|| 0.01940311au&lt;br /&gt;
|-&lt;br /&gt;
|Amber Charges|| -0.18117151au&lt;br /&gt;
|-&lt;br /&gt;
|Qeq Charges|| -0.23038199au&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Clearly the Qeq shows the lowest energy - the considerable differences in energy show that charges are an important part of the input.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Testing Charges and EE on a small system===&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolAppletButton&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;Chromopre_molecule_B3LYP_6-31G(d).mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;script&amp;gt;spin on&amp;lt;/script&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Test Protien&amp;lt;/text&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;600&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;/jmolAppletButton&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Energies of the optimisations using various charge approximations and with/without EE&lt;br /&gt;
{|class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Method||Mechanical Embedding||Electronic Embedding||Computational Time (mechanical embedding)||Computational Time (electronic embedding)&lt;br /&gt;
|-&lt;br /&gt;
|B3LYP 6-31G*||-1406.04||N/A||11 hours 18 minutes 44.7 seconds||11 hours 18 minutes 44.7 seconds&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (No Charges)||-783.2225||-783.2223||2 hours 13 minutes  4.1 seconds||&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Amber Charges)|| || ||  ||&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Qeq Charges)||-783.2467|| ||2 hours  6 minutes  8.6 seconds ||&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
NB. &lt;br /&gt;
* B3LYP 6-31G* and ONIOM B3LYP 6-31G*:AMBER (No Charges) were from the same starting geometry.&lt;br /&gt;
* ONIOM B3LYP 6-31G*:AMBER (Amber Charges) and ONIOM B3LYP 6-31G*:AMBER (Qeq Charges) were from the same starting geometry.&lt;br /&gt;
* All Electronic embedding is from the same starting point - Can compare times.&lt;br /&gt;
&lt;br /&gt;
Dipole moments using various charge approximations and with/without EE&lt;br /&gt;
{|class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Method||Dipole moment (No EE)||Dipole moment (With EE)&lt;br /&gt;
|-&lt;br /&gt;
|B3LYP 6-31G*||7.4914||N/A&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (No Charges)||3.2328||3.2241&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Amber Charges)||5.2794||5.2865&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Qeq Charges)||8.173||8.1654&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==2nd Dec 2009==&lt;br /&gt;
&lt;br /&gt;
A single point energy of 1w7s was run with the additional qeq keyword to return partial charges (overall charge set to -6)&lt;br /&gt;
This new molecule was submitted for AMBER opt but failed after around 700 iterations... this could be due to the error:&lt;br /&gt;
&lt;br /&gt;
 WARNING -- The following centers have non-zero partial&lt;br /&gt;
 charges, but zero Vanderwaals parameters. These centers&lt;br /&gt;
 can collapse onto other centers with non-zero charge&lt;br /&gt;
&lt;br /&gt;
When the atoms defined in this error had their partial charges set to 0.00, the result was a fully converged AMBER optimisation:&lt;br /&gt;
&lt;br /&gt;
*Frozen chromophore.&lt;br /&gt;
*pH 7 giving a charge of -6&lt;br /&gt;
*Charges assigned using Qeq method&lt;br /&gt;
*Zero Van der Waals parameter atoms partial charges set to 0.00&lt;br /&gt;
&lt;br /&gt;
Looking at what atoms are not defined in terms of Van der Waals paramters it is ALWAYS an aloholic proton that is not parameterised...&lt;br /&gt;
&lt;br /&gt;
Tutorial used AMBER charges then ran Qeq for the missing charges... i think this is what she thought she did... gaussian &amp;quot;amber=qeq&amp;quot; will overwrite all charges.. need to use &amp;quot;amber=uncharged&amp;quot;&lt;br /&gt;
so that we dont overwrite the amber charges...&lt;br /&gt;
&lt;br /&gt;
In effect full Qeq charges were assigned in the tutorial. Also overall charge is 0 - will check this.&lt;br /&gt;
&lt;br /&gt;
===The GFP tutorial===&lt;br /&gt;
&lt;br /&gt;
*AMBER charges (AMBER94 charges) were assigned to the GFP molecule in the tutorial&lt;br /&gt;
*she notices correctly that some are missing in the chromophore and so decides to assign them via Qeq&lt;br /&gt;
*Runs Qeq on the GFP molecule with a &#039;&#039;&#039;0 charge&#039;&#039;&#039; which is wrong! When run the AMBER charges and look at the log file we can see that the overall charge should be &#039;&#039;&#039;+4&#039;&#039;&#039; for that protonation state.&lt;br /&gt;
*Since Qeq is an equilibrium method, partial charges are solved with the constraint that their sum must equal the overall charge.&lt;br /&gt;
*These Qeq charges should therefore not fit with an integer sum when pasted into the AMBER charges.&lt;br /&gt;
&lt;br /&gt;
===Qeq versus AMBER94 charges===&lt;br /&gt;
&lt;br /&gt;
Ogawa et al. state:&lt;br /&gt;
&lt;br /&gt;
 Geometry optimizations of a crambin protein were performed using force fields combined with the&lt;br /&gt;
 CQEq. The CQEq charges were found to be more comparable to the Mulliken charges obtained by the ab initio molecular orbital&lt;br /&gt;
 calculation than the fixed charges used in the AMBER force field. Furthermore, the CQEq charges give more realistic charge redistribution&lt;br /&gt;
 between amino acids in the crambin.&lt;br /&gt;
&lt;br /&gt;
Mini Biomolecule calculations:&lt;br /&gt;
&lt;br /&gt;
{|class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Method||Dipole moment (No EE)||Dipole moment (With EE)&lt;br /&gt;
|-&lt;br /&gt;
|B3LYP 6-31G*||7.4914||N/A&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (No Charges)||3.2328||3.2241&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Amber Charges)||5.2794||5.2865&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Qeq Charges)||8.173||8.1654&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Clearly the Qeq charges are in better agreement with the DFT method than no or AMBER charges.&lt;br /&gt;
&lt;br /&gt;
These frequencies are calulated using mechanical embedding!&lt;br /&gt;
&lt;br /&gt;
{|class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Method||C=O||C=C&lt;br /&gt;
|-&lt;br /&gt;
|B3LYP 6-31G*||1799.65||1701.86&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (No Charges)||1802.25||1703.63&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Amber Charges)||1803.37||1701.67&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Qeq Charges)||1800.18||1702.86&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
As a result it seems obvious that a Qeq charge assignment in constraint by the correct total charge gives results most comparable to the pure QM method (both dipoles and frequencies)&lt;br /&gt;
&lt;br /&gt;
==3rd Dec 2009==&lt;br /&gt;
===ESP Charges===&lt;br /&gt;
&lt;br /&gt;
ESP calculates the expectation values of the electrostatic potential of a molecule on a uniform distribution of points. The resultant ESP surface is then fit to atom centered charges that best reproduce the electron distribution in a least-squares method. The set of points defining the default surface is generated according to the algorithm of Connolly.&lt;br /&gt;
&lt;br /&gt;
ESP charges thus require a QM calculation to create the molecular electrostatic potential used to fit atom centered charges. The size of GFP means that this is not an efficient method for use in this case?&lt;br /&gt;
&lt;br /&gt;
===Qeq Charges===&lt;br /&gt;
&lt;br /&gt;
The charge equilibration (QEq) method treats the atomic charge variation through the equilibration of electronegativities at atomic sites. The advantage of the QEq method is that the polarization effect in MM is taken into account by variation of the charge distribution in the molecules. The atomic charge variation relates to the intramolecular geometrical changes.&lt;br /&gt;
&lt;br /&gt;
===AMBER charges===&lt;br /&gt;
&lt;br /&gt;
==8th Dec 2009==&lt;br /&gt;
&lt;br /&gt;
A input creation page is now avaliable:&lt;br /&gt;
&lt;br /&gt;
https://www.ch.ic.ac.uk/wiki/index.php/AndyForesterGFP&lt;br /&gt;
&lt;br /&gt;
The final input file had protein information pasted in using text to columns in excel and now displays PDB residue information and PDB secondary structure information (nice cartoon pictures in ChemBio3D!)&lt;br /&gt;
&lt;br /&gt;
Interesting is a paper that uses TD-DFT on GFP (in my papers folder on the mac), the group uses a 3 stage minimisation method from the 1GFL PDB structure:&lt;br /&gt;
&lt;br /&gt;
* Freeze all atoms in the backbone of the protein and optimise after adding hydrogens&lt;br /&gt;
* Optimise the chromophore at a semi-empirical level&lt;br /&gt;
* They cut out a chromophore section but i could then perform a full QM/MM geometry optimisation&lt;br /&gt;
&lt;br /&gt;
===A more charged mini molecule to test electronic embedding===&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolAppletButton&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;More_charge_B3LYP_6_31G(d).mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;The molecule with 2+ charge&amp;lt;/text&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;600&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;/jmolAppletButton&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
The molecule was submitted with 3 different job lines:&lt;br /&gt;
&lt;br /&gt;
* b3lyp 6-31g(d)&lt;br /&gt;
* oniom(b3lyp 6-31g(d):amber=(hardfirst,qeq))&lt;br /&gt;
* oniom(b3lyp 6-31g(d):amber=(hardfirst,qeq))=embedcharge&lt;br /&gt;
&lt;br /&gt;
Opt and Freq jobs were completed successfully. The charge breakdown input was:&lt;br /&gt;
&lt;br /&gt;
* Low Real 2 1&lt;br /&gt;
* High Model 0 1&lt;br /&gt;
* Low Model 0 1&lt;br /&gt;
&lt;br /&gt;
The aim of the calculations was to test electronic embedding versus the mechanical embedding. A paper suggests that there is a large difference in accuracy between the two methods. Qeq charge equilibration was used in both cases as it has already been proved to be much more effective than AMBER assigned charges. &#039;&#039;&#039;It has consistently overestimated the dipole moment though&#039;&#039;&#039;.&lt;br /&gt;
&lt;br /&gt;
{|class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Method||Dipole moment&lt;br /&gt;
|-&lt;br /&gt;
|B3LYP 6-31G*||8.02&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (ME)||12.22&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (EE)||12.23&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
These results are odd since we expect the EE to have some effect on the charge distribution in the molecule (it is polarising the QM region) comparing the frequencies as a further check:&lt;br /&gt;
&lt;br /&gt;
{|class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Method||C=O||C=C&lt;br /&gt;
|-&lt;br /&gt;
|B3LYP 6-31G*||1764||1636&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (ME)||1797||1697&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (EE)||1797||1697&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Again the EE has made no difference to the frequency of various strong vibrations in the molecule. The conclusion is thus either:&lt;br /&gt;
&lt;br /&gt;
* EE has no real effect on the moleucule&lt;br /&gt;
* EE is not working properly&lt;br /&gt;
&lt;br /&gt;
Assuming the latter further investigation discovered that the meechanism of EE is determined using scaling parameters for MM charges during electronic embedding in the QM calculations. As a default the charges are scaled 555500 using the nomenclature ScaleCharge=ijklmn where n is closest to the QM region (massively anti intuitive). The input is read left to right and not all scaling factors need be defined. Further to this, the value given is then multiplied by 0.2 to arrive at the actual scaling coefficient. The result is thus for an atom:&lt;br /&gt;
&lt;br /&gt;
* 1 bond from the QM region (n) = 0.2n           Default is 0&lt;br /&gt;
* 2 bonds from the QM region (m) = 0.2m       Default is 0&lt;br /&gt;
* 3 bonds from the QM region (l) = 0.2l            Default is 5&lt;br /&gt;
* 4 bonds from the QM region (k) = 0.2k          Default is 5&lt;br /&gt;
* 5 bonds from the QM region (j) = 0.2j            Default is 5&lt;br /&gt;
* 6 bonds from the QM region (i) = 0.2i            Default is 5&lt;br /&gt;
&lt;br /&gt;
So all the atoms (and partial charges) in the MM region in the test molecule were being scaled to 0 when performing the EE. This explains the fact that no difference was uncovered in the previous calculation set.&lt;br /&gt;
&lt;br /&gt;
A new calculation was submitted with the command scalecharge=555 so that all charges in the MM part will be scaled to 1 (0.2xn,m,l)&lt;br /&gt;
&lt;br /&gt;
===Charges...again===&lt;br /&gt;
&lt;br /&gt;
So since i deleted the charges on the atoms where they can collapse, the overall charge is now -19.6 from the original -6 minus the 13.6 in charge deleted. REALLY ANNOYING! This is a huge difference in charges so will have to be solved. One easy option would be to set the involved atoms to 0 then run QEQ on the atoms with unassigned charges... essentially distributing the -6 charge through all atoms except those with 0 VdW parameters.&lt;br /&gt;
&lt;br /&gt;
==Tues 15th Dec 2009==&lt;br /&gt;
&lt;br /&gt;
The more charged molecule failed once the charges were scaled to 1 since SCF convergence was not attained.&lt;br /&gt;
&lt;br /&gt;
==Mon 25th Jan 2010==&lt;br /&gt;
&lt;br /&gt;
TAO tool publication was very good read... has a good suggestion of an optimisation procedure:&lt;br /&gt;
&lt;br /&gt;
*Take the crystal structure&lt;br /&gt;
*Add Hydrogens&lt;br /&gt;
*Determine protonation&lt;br /&gt;
*MM geometry optimisation&lt;br /&gt;
*(MD?)&lt;br /&gt;
*(Look at several samples of the MD)&lt;br /&gt;
*Optimise a final structure&lt;br /&gt;
&lt;br /&gt;
==Thurs 28th Jan 2010==&lt;br /&gt;
&lt;br /&gt;
Tao Tools requires preprocess within AMBER tools to run the scripts correctly. Does not have HIS on the residue database... Amber defines the protonation state within its residue naming scheme... HIS is either HIE, HIP, HID depending on its protonation state. [http://www.poissonboltzmann.org/pdb2pqr/documentation/programmers-guide/naming-scheme]&lt;br /&gt;
&lt;br /&gt;
==Thurs 4th Jan 2010==&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Preprocessing&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
To get TAO tools and also for completeness it is necessary to preprocess the PDB file before attempting to run ONIOM calculations, this includes:&lt;br /&gt;
&lt;br /&gt;
* Adding hydrogens&lt;br /&gt;
* Create a custom residue file for the chromophore&lt;br /&gt;
* Convert from PDB to gjf ONIOM input&lt;br /&gt;
&lt;br /&gt;
Ambertools are extremely helpful with this preprocessing especially since amber will be the low level theory used throughout this project. Within ambertools are a number of useful programs:&lt;br /&gt;
&lt;br /&gt;
* antechamber - has a parameterlookup function within amber and GAFF parameter library files.&lt;br /&gt;
* xLeap - builds the protein from its constituent residues - adds hydrogens according to library files - allows a parm file to be built to add custom residues.&lt;br /&gt;
&lt;br /&gt;
I will be working on using these 2 programs to create a full MM input file which is fully defined and correct.&lt;br /&gt;
&lt;br /&gt;
===xLeap===&lt;br /&gt;
&lt;br /&gt;
To load xLeap we must use X11 to get the graphical interface and the ssh -Y command to allow tunneling back to X11. Also the command line to start xLeap should be:  &#039;&#039;xleap -s -f /apps/ambertools/amber11/dat/leap/cmd/leaprc.ff03&#039;&#039;. This loads the ff03 parameter file and results in the start up of the graphical interface with:&lt;br /&gt;
&lt;br /&gt;
 Welcome to LEaP!&lt;br /&gt;
 Sourcing leaprc: /amber/dat/leap/cmd/leaprc&lt;br /&gt;
 Log file: ./leap.log&lt;br /&gt;
 Loading parameters: /amber/dat/leap/parm/parmME.dat&lt;br /&gt;
 Loading library: /amber/dat/leap/lib/all_nucleic94.lib&lt;br /&gt;
 Loading library: /amber/dat/leap/lib/all_aminoME.lib&lt;br /&gt;
 Loading library: /amber/dat/leap/lib/all_aminoctME.lib&lt;br /&gt;
 Loading library: /amber/dat/leap/lib/all_aminontME.lib&lt;br /&gt;
 Loading library: /amber/dat/leap/lib/ions94.lib&lt;br /&gt;
 Loading library: /amber/dat/leap/lib/water.lib&lt;br /&gt;
&lt;br /&gt;
This shows that the residue libraries have been loaded. We are then free to load a pdb file which will be read by xLeap and built according to the residues! In addition to the residue names xleap also requires information about the chain connectivity. Residues can be connected to other residues from both sides, only one side or not at all. xleap is fairly simple in that it expects that all residues in the pdb file are connected in the order in which they are listed unless they are separated by a &amp;quot;TER&amp;quot; card. In most cases where a single protein chains is being investigated like here - the TER card should end the residue information.&lt;br /&gt;
&lt;br /&gt;
To load a pdb use (ignore the &amp;lt;&amp;gt;) &#039;&#039;&amp;lt;name for use in xLeap&amp;gt;=loadpdb &amp;quot;&amp;lt;name of file&amp;gt;&amp;quot;&#039;&#039;. Make sure that the file is in your PATH else fully define the path in the filename.&lt;br /&gt;
&lt;br /&gt;
The pdb should then be read into xLeap and built. At the end of the output we get (I loaded ChainA of the 1W7S without any water or hydrogens):&lt;br /&gt;
&lt;br /&gt;
 Leap added 1765 missing atoms according to residue templates:&lt;br /&gt;
       1 Heavy&lt;br /&gt;
       1764 H / lone pairs&lt;br /&gt;
 The file contained 21 atoms not in residue templates&lt;br /&gt;
&lt;br /&gt;
This shows that Leap added hydrogens automatically and also a missing &amp;quot;heavy&amp;quot; atom (one which is not a H or lp):&lt;br /&gt;
&lt;br /&gt;
 Added missing heavy atom: .R&amp;lt;CHIE 229&amp;gt;.A&amp;lt;OXT 18&amp;gt;&lt;br /&gt;
&lt;br /&gt;
For amino acids, the traditional order of atom records within a residue is N, CA, C, O, followed by the sidechain atoms (CB, CG1, CG2 . . . ) in order first of increasing remoteness, and then branch. The extra oxygen at the carboxyl terminal is called &amp;quot; OXT&amp;quot;, and appears after all sidechain atoms.&lt;br /&gt;
&lt;br /&gt;
Now that we have established that there are 21 atoms that are in the non standard residue region we need to define a custom residue which describes this region (chromophore). This involves creating a prep file....&lt;br /&gt;
&lt;br /&gt;
I first drew up the undefined section of GFP (chromophore) in Gaussview5 and saved as a.pdb then:&lt;br /&gt;
&lt;br /&gt;
* Load in antechamber and convert to mol2 format using &#039;&#039;antechamber -i &amp;lt;input&amp;gt; -fi pdb -o &amp;lt;output&amp;gt; -fo mol2 -c bcc&#039;&#039;&lt;br /&gt;
* Use parmchk to make sure the molecule was defined in the GAFF force field - kicks out a .frcmod file with missing parameters (including dihedrals) - nice :D!&lt;br /&gt;
* Manually add the &amp;quot;1CSY&amp;quot; where a standalone &amp;quot;0&amp;quot; is in the mol2 file so Leap can read it&lt;br /&gt;
* Load the gaff parameter file to Leap using &#039;&#039;source leaprc.gaff&#039;&#039;&lt;br /&gt;
* Load the chromophore_CSY_full.mol2 to Leap using &#039;&#039;CSY=loadmol2 &amp;quot;ChromophoreGFP_CSY_full.mol2&amp;quot;&#039;&#039;&lt;br /&gt;
* Check the residue using &#039;&#039;check CSY&#039;&#039; - Notice missing parameters are there.&lt;br /&gt;
* Load the ChromophoreGFP_CSY_full.frcmod file created by parmchk using &#039;&#039;loadamberparams ChromophoreGFP_CSY_full.frcmod&#039;&#039;&lt;br /&gt;
* Now check the CSY residue again using &#039;&#039;check CSY&#039;&#039; - No Problems! woo hoo! Now I can create amber parameters :D&lt;br /&gt;
&lt;br /&gt;
==Tues 9th Feb 2010==&lt;br /&gt;
&lt;br /&gt;
===RESP Fitting===&lt;br /&gt;
&lt;br /&gt;
To get charges that are like those of the standard amber force field an RESP fitting method has to be used. This involves 3 steps:&lt;br /&gt;
&lt;br /&gt;
* First, the molecule studied is optimized to determine a stable minimum (using Gaussian).&lt;br /&gt;
* Then, this minimized structure is used to calculate a Molecular Electrostatic Potential (MEP) on a three-dimensional grid (using Gaussian).&lt;br /&gt;
* Finally, this grid is exported into the &amp;quot;RESP program&amp;quot; which is used to fit atom-centered charges to the MEP&lt;br /&gt;
&lt;br /&gt;
To assist in this procedure - I am using a program called RED - [http://q4md-forcefieldtools.org/RED/] which can be used in all 3 stages...&lt;br /&gt;
&lt;br /&gt;
* Ante_RED is used to prepare Gaussian input files for the RESP fit&lt;br /&gt;
* RED III is used to fit atom centered charges&lt;br /&gt;
&lt;br /&gt;
The procedure used to create the CSY file for RESP fitting is as follows:&lt;br /&gt;
&lt;br /&gt;
* Cut the CSY residue from the 1W7S protein structure and cap terminal ends with methyl groups&lt;br /&gt;
* Optimise this in Gaussian at PM6 level&lt;br /&gt;
* Run the optimised CSY geometry through ante_RED to create the Gaussian input file involving minimisation at HF level and calculation of the MEP - also get the .P2N file required for RED III&lt;br /&gt;
* Run this Gaussian input file&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Input file for optimisation:&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;#P hf/6-31G* Opt=(Tight,CalcFC) Freq SCF(Conver=8) Test&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This will run a HF level optimisation and frequency analysis.  &#039;&#039;Opt=tight&#039;&#039; tightens the cutoffs on forces and step size that are used to determine convergence. An optimization with Opt=Tight will take several more steps than with the default cutoffs. For molecular systems with very small force constants (low frequency vibrational modes), this may be necessary to ensure adequate convergence and reliability of frequencies computed in a subsequent job step. This option can only be used with Berny optimizations. &#039;&#039;Opt = CalcFC&#039;&#039; causes the calculation of an analytical Hessian (in HF). &#039;&#039;SCF(Conver=8)&#039;&#039; sets the SCF convergence criterion to 10^-8.&lt;br /&gt;
&lt;br /&gt;
===Parameter files for use in Gaussian (Gaussian only uses amber94)===&lt;br /&gt;
&lt;br /&gt;
Using external parameter files. It is possible to include MM function entries within an external file by replacing the entries within the input file with the usual Gaussian include file mechanism: e.g., @oplsaa.prm. Parameter files for various force fields are included within the main Gaussian directory (i.e., $g09root/g09). If you want to specify the entire force field via an external file, use an input file like the following:&lt;br /&gt;
&lt;br /&gt;
# UFF=SoftOnly&lt;br /&gt;
&lt;br /&gt;
Read-in force field example&lt;br /&gt;
&lt;br /&gt;
molecule specification&lt;br /&gt;
&lt;br /&gt;
@$g09root/g09/oplsaa.prm&lt;br /&gt;
&lt;br /&gt;
Note that the choice of a molecular mechanics keyword is arbitrary as the entire force field depends on the read-in functions and parameters and not on any internal ones (in other words, specifying Amber would yield the same result as UFF).&lt;br /&gt;
&lt;br /&gt;
==Tues 16th Feb 2010==&lt;br /&gt;
===RESP Fitting===&lt;br /&gt;
&lt;br /&gt;
Following optimisation of the fragment for RESP fitting as seen before. We now have to fit the RESP to a MEP, this involves a few calculations:&lt;br /&gt;
&lt;br /&gt;
* ESP calcs in gaussian&lt;br /&gt;
* Fitting calcs by resp in ambertools&lt;br /&gt;
&lt;br /&gt;
The result is a .mol2 containing the fitted charges. The process has 2  input files:&lt;br /&gt;
&lt;br /&gt;
* .log&lt;br /&gt;
* .p2n&lt;br /&gt;
&lt;br /&gt;
The p2n is created by ante_RED and controls the RESP fitting process. (See RED site). To get a fit that excludes the capping methyls we add a REMARK line:&lt;br /&gt;
&lt;br /&gt;
 REMARK INTRA-MCC 0.0 |  35  36  37  38  39  40  41  42 | R&lt;br /&gt;
&lt;br /&gt;
This must be formatted EXACTLY as shown here (2 spaces between the numbers). The numbers correspond to atoms numbers, the 0.0 constrains these atoms to 0.0 charge and the R tells RESP to remove these atoms from the 2nd round fit and the subsequent mol2 output. Leaving a fragment.&lt;br /&gt;
&lt;br /&gt;
The RED program can be run on CX1! I used the following Jobscript (REDjobscript_4):&lt;br /&gt;
&lt;br /&gt;
 #PBS -l ncpus=1 &lt;br /&gt;
 #PBS -l mem=6000mb&lt;br /&gt;
 #PBS -l walltime=01:00:00&lt;br /&gt;
 #PBS -q pqchem&lt;br /&gt;
 #PBS -joe&lt;br /&gt;
 module load ambertools&lt;br /&gt;
 module load gaussian&lt;br /&gt;
 export GAUSS_SCRDIR=$TMPDIR&lt;br /&gt;
 echo $GAUSS_SCRDIR&lt;br /&gt;
 cd $HOME&lt;br /&gt;
 pwd&lt;br /&gt;
 perl /home/af706/RED-vIII.3.pl &amp;gt; /home/af706/RED-vIII.log&lt;br /&gt;
&lt;br /&gt;
So here we are using the chemistry node pqchem:&lt;br /&gt;
&lt;br /&gt;
* loading the ambertools (for RESP) and gaussian modules&lt;br /&gt;
* defining a scratch directory for the RESP program to read gaussian scratch&lt;br /&gt;
* TEMPDIR is used since the scratch must be empty upon calculation start. The temporary directory will be created fresh on each node as the calculation is started, and it&#039;s unlikely that two calculations will run on the same node.&lt;br /&gt;
* Moving to my HOME directory /home/af706 because the script must be run in the same directory as the input files  (.log and .p2n)&lt;br /&gt;
* RUN!&lt;br /&gt;
&lt;br /&gt;
The fitting process runs single point energy calcs and fits the RESP - overall it took 3mins for my fragment.&lt;br /&gt;
&lt;br /&gt;
===Going from xLeap out to Gaussian in===&lt;br /&gt;
&lt;br /&gt;
2 ways to do this:&lt;br /&gt;
&lt;br /&gt;
* Load in Gaussview&lt;br /&gt;
* Use Pengs scripts&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Gaussview&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
* The file loads and all residues are recognised (apart from the chromophore). The file contains the same number of atoms as the saved Leap file.&lt;br /&gt;
* Charges are assigned from amber94 - not the chromophore still&lt;br /&gt;
* Amber atom types are assigned correctly to ALL atoms&lt;br /&gt;
* All that remains is to add charges from the RESP fit.&lt;br /&gt;
* PDB information is retained in input&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;TAO&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
* Atoms are added so atom count is different to the file leaving Leap&lt;br /&gt;
* Fails to treat terminal residues correctly&lt;br /&gt;
* Lose residue information&lt;br /&gt;
* Charges are missing from chromophore and atoms in terminal residues&lt;br /&gt;
&lt;br /&gt;
==22nd Feb 2010==&lt;br /&gt;
&lt;br /&gt;
Links to HOW TO for Ambertools + Gaussview/ Ambertools + TAO tools + Gaussview&lt;br /&gt;
&lt;br /&gt;
* Ambertools + Gaussview  - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester_Ambertools_Gaussview&lt;br /&gt;
* Ambertools + TAO tools + Gaussview - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester_Ambertools_TAO&lt;br /&gt;
&lt;br /&gt;
* Running Amber only Calculations - https://www.ch.imperial.ac.uk/wiki/index.php/AndyForesterAmber&lt;br /&gt;
&lt;br /&gt;
Running ONIOM calculations - https://www.ch.ic.ac.uk/wiki/index.php/AndyForesterONIOM&lt;br /&gt;
&lt;br /&gt;
==26th Feb 2010==&lt;br /&gt;
&lt;br /&gt;
===Picking out model Modes===&lt;br /&gt;
&lt;br /&gt;
Integers and integer ranges without a keyword are interpreted as mode numbers, although the [not]mode keywords may be used. The keywords atoms and notatoms can be used to define an atom list whose modes should be included/excluded (respectively). Atoms can also be specified by ONIOM layer via the [not]layer keywords, which accept these values: real for the real system, model for the model system in a 2-layer ONIOM, middle for the middle layer in a 3-layer ONIOM, and small for the model layer of a 3-layer ONIOM. Atoms may be similarly included/excluded by residue with residue and notresidue, which accept lists of residue names or numbers. Both keyword sets function as shorthand forms for atom lists.&lt;br /&gt;
&lt;br /&gt;
==TAO tools==&lt;br /&gt;
&lt;br /&gt;
The expected differences to the one without using his tools are:&lt;br /&gt;
&lt;br /&gt;
* Different partial charges since his scripts use all_amino03 to assign charges,&lt;br /&gt;
  Gaussview uses amber94 charges (Net charge is the same for both files - good thing)&lt;br /&gt;
&lt;br /&gt;
* Potential for some changes in atom-types due to different versions of amber being used.&lt;br /&gt;
&lt;br /&gt;
What actually happened:&lt;br /&gt;
&lt;br /&gt;
* Difference in individual partial charges but same overal net charge&lt;br /&gt;
&lt;br /&gt;
* Different atom type assignment led to some more undefined parameters.&lt;br /&gt;
&lt;br /&gt;
* Defining these and running an energy led to significantly different energy than the non tools&lt;br /&gt;
  method:&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;WITHOUT tools&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
 AMBER calculation of energy.&lt;br /&gt;
 Energy per function type:&lt;br /&gt;
 Direct non-bonded                                 122152.193595037&lt;br /&gt;
 Non-bonded pair                                  -122162.319396395&lt;br /&gt;
 Harmonic stretch I                                     0.354921205&lt;br /&gt;
 Harmonic bend I                                        0.989669436&lt;br /&gt;
 Amber torsion                                          1.587390468&lt;br /&gt;
 Improper torsion                                       0.013641893&lt;br /&gt;
 Energy per function class:&lt;br /&gt;
        Coulomb              -9.200657698&lt;br /&gt;
    Vanderwaals              -0.925143661&lt;br /&gt;
     Stretching               0.354921205&lt;br /&gt;
        Bending               0.989669436&lt;br /&gt;
        Torsion               1.587390468&lt;br /&gt;
   Out-of-plane               0.013641893&lt;br /&gt;
 Energy=    -7.18017836     NIter=   0.&lt;br /&gt;
 Dipole moment=     -25.836974    -115.466231     -34.635294&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;WITH TAO tools&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
 AMBER calculation of energy.&lt;br /&gt;
 Energy per function type:&lt;br /&gt;
 Direct non-bonded                                 137343.707022741&lt;br /&gt;
 Non-bonded pair                                  -137347.284026475&lt;br /&gt;
 Harmonic stretch I                                     0.354921205&lt;br /&gt;
 Harmonic bend I                                        1.082982995&lt;br /&gt;
 Amber torsion                                          1.589487307&lt;br /&gt;
 Improper torsion                                       0.013645198&lt;br /&gt;
 Energy per function class:&lt;br /&gt;
        Coulomb              -6.890960120&lt;br /&gt;
    Vanderwaals               3.313956385&lt;br /&gt;
     Stretching               0.354921205&lt;br /&gt;
        Bending               1.082982995&lt;br /&gt;
        Torsion               1.589487307&lt;br /&gt;
   Out-of-plane               0.013645198&lt;br /&gt;
 Energy=   -0.535967030     NIter=   0.&lt;br /&gt;
 Dipole moment=     -29.988370    -117.918540     -32.409355&lt;br /&gt;
&lt;br /&gt;
Obvious where the differences are.&lt;br /&gt;
&lt;br /&gt;
==3PHY==&lt;br /&gt;
&lt;br /&gt;
https://www.ch.ic.ac.uk/wiki/index.php/AndyForesterPYP&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterPYP&amp;diff=103702</id>
		<title>AndyForesterPYP</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterPYP&amp;diff=103702"/>
		<updated>2010-03-12T13:11:31Z</updated>

		<summary type="html">&lt;p&gt;Af706: New page: =3PHY=  Back to Main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester  ==First thoughts==  The 3PHY NMR structure was used as an initial structure and processed like 1W7S. NMR experim...&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=3PHY=&lt;br /&gt;
&lt;br /&gt;
Back to Main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester&lt;br /&gt;
&lt;br /&gt;
==First thoughts==&lt;br /&gt;
&lt;br /&gt;
The 3PHY NMR structure was used as an initial structure and processed like 1W7S. NMR experiments like this lead to a huge amount of information about through-space distances between atoms&lt;br /&gt;
and also bonded distances, but the sheer size of the data set means it can be fit lots of ways (fitting is done iteratively starting from a guess stucture). The result is a set of possible structures, in this case there are 26 different possible stuctures - im not sure which one the wiki starts with!&lt;br /&gt;
&lt;br /&gt;
The file is slightly different to Mikes example... overall -4 charge due to different protonation state (pH7) but charges are the same except for around the chromophore. I included the chromophore + connecting residue when fitting charges since the chromophore is bound to a residue in the primary sequence via a sulphur in is sidechain - this will change the charge distribution of the residue so should be refitted with the chromophore attached. The file you provided uses standard amber charges for this residue and some estimated charges from ESP calculations (+ some fudging to get an integer charge!) for the chromophore.&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterONIOM&amp;diff=103195</id>
		<title>AndyForesterONIOM</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterONIOM&amp;diff=103195"/>
		<updated>2010-03-10T15:32:18Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* An expanded Model */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=ONIOM calculations on 1W7S=&lt;br /&gt;
&lt;br /&gt;
Back to Main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester&lt;br /&gt;
&lt;br /&gt;
Using the created input file ONIOM calculations were submitted always using the DFT B3LYP 6-31G* : Amber scheme.&lt;br /&gt;
&lt;br /&gt;
==Minimal model==&lt;br /&gt;
&lt;br /&gt;
Optimisation started from the Amber optimization chk file, this was because we want to compare frequencies at the SAME minimum critical point for each level of theory. &lt;br /&gt;
&lt;br /&gt;
The minimal model includes the planer portion of the chromophore residue (number 64 - CSY) and contains 16 heavy atoms and 8 hydrogens. The smaller the model region, the more efficient the ONIOM calculation becomes since only model is treated with the DFT costly theory. The exact input command line was - &#039;&#039;#p opt oniom(b3lyp/6-31g(d):amber=softfirst) geom=connectivity&#039;&#039;. The time taken to optimise was 2 hours 25 minutes  1.2 seconds and the final energy was -736.838427249468. This optimisation uses the Gaussian default Mechanical Embedding scheme where the MM charges are not included in the high level model - this is cheaper but not as accurate. Another calculation was started from the same inital structure but optimised using Electronic Embedding (polarisation of the wavefunction), this input line was - &lt;br /&gt;
&#039;&#039;#p opt oniom(b3lyp/6-31g(d):amber=softfirst)=embed geom=connectivity&#039;&#039;. The time taken for optimisation was 7 hours 25 minutes 29.3 seconds and the final energy was -736.807010778592.&lt;br /&gt;
&lt;br /&gt;
Frequency calculations were then run on the resulting geometries. For the inputs with water molecules 10A near the chromophore, 3N-6 frequencies were returned. When comparing these frequencies we used the C=O frequency which is discussed in detail in many papers. It involves a C=O stretch mixed with a C=C stretch in the chormophore and has been assigned experimentally to occur at 1680cm-1. Calculations on the HBDI molecule as a chromophore model gave the mode a frequency of 1802cm-1 using B3LYP 6-31G*. This is not good agreement with the experimental value, so something must be missing - the protein electrostatic environment (cf. ian gould saying electrostatics are by far most important term in forcefield)&lt;br /&gt;
&lt;br /&gt;
The table below shows the C=O frequency for the 2 ONIOM calculations.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Calculation&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Time&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;C=O Frequency&#039;&#039;&#039;&lt;br /&gt;
|-&lt;br /&gt;
|   ONIOM with ME||4 hours 40 minutes 55.2 seconds||1804&lt;br /&gt;
|-&lt;br /&gt;
|   ONIOM with EM||13 hours 10 minutes 50.4 seconds||1745&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
It is clear that not much changes in comparison to the HBDI model when using ME with the ONIOM scheme, this is to be expected since no real change has been made to the structure or wavefunction across the chromophore (except for the connections to the protein primary sequence). The C=O frequency occuring at 1804cm-1 in the ME scheme is thus to be expected and confirms the hypothesis that the protein provides an electrostatic environment which polarises the wavefunction over the chromophore and affects the frequencies.&lt;br /&gt;
&lt;br /&gt;
Moving to the EE scheme, the MM charges are used to polarise the wavefunction over the DFT region, and results in a significant downshift of the C=O mode to 1745cm-1. This shows that the protein electrostatic environment has had a significant effect on the frequencies of the chromophore region. This frequency is also converging on the experimental value (1680cm-1). It is also noted that 8 modes outline in the HBDI experiments were observed in the ONIOM calculations and all moved towards experimental values.&lt;br /&gt;
&lt;br /&gt;
In conclusion the minimal model has provided an improvement on the frequency calculation of HBDI towards the experimental value calculated for GFP. The next step is to expand the model to include hydrogen bonding interactions from the surrounding protein (cf. hydrogen bonding by DFT, orbital interaction, directional, electrostatics more like a field)&lt;br /&gt;
&lt;br /&gt;
==An expanded Model==&lt;br /&gt;
&lt;br /&gt;
The next step is an expanded model region to include hydrogen bonding interactions between the chromophore and its surrounding residues (in space, not adjacent in sequence). The residues proximal to the chromophore which have some ability to hydrogen bond were detailed in the paper by Jasper van Thor: &lt;br /&gt;
&lt;br /&gt;
Uncovering the hidden ground state of green fluorescent protein, John T. M. Kennis,†‡§ Delmar S. Larsen,†‡ Ivo H. M. van Stokkum,†‡ Mikas Vengris,† Jasper J. van Thor,¶ and Rienk van Grondelle†, Proc Natl Acad Sci U S A. 2004 December 28; 101(52): 17988–17993.&lt;br /&gt;
&lt;br /&gt;
The diagram in this paper shows the interactions. A model was then built around this diagram, trying to keep the model system as small as possible. Residues surrounding the chromophrore with suggested interactions are:&lt;br /&gt;
&lt;br /&gt;
* Glu222&lt;br /&gt;
* Ser202&lt;br /&gt;
* Gln94&lt;br /&gt;
* Arg96&lt;br /&gt;
* Tyr148&lt;br /&gt;
* Ala203&lt;br /&gt;
&lt;br /&gt;
The parts of these residues that were of interest were added to the model system, along with crystalographic water molecule near the phenol hydrogen (with suggested H-bond network).&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterONIOM&amp;diff=103188</id>
		<title>AndyForesterONIOM</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterONIOM&amp;diff=103188"/>
		<updated>2010-03-10T15:26:51Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* An expanded Model */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=ONIOM calculations on 1W7S=&lt;br /&gt;
&lt;br /&gt;
Back to Main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester&lt;br /&gt;
&lt;br /&gt;
Using the created input file ONIOM calculations were submitted always using the DFT B3LYP 6-31G* : Amber scheme.&lt;br /&gt;
&lt;br /&gt;
==Minimal model==&lt;br /&gt;
&lt;br /&gt;
Optimisation started from the Amber optimization chk file, this was because we want to compare frequencies at the SAME minimum critical point for each level of theory. &lt;br /&gt;
&lt;br /&gt;
The minimal model includes the planer portion of the chromophore residue (number 64 - CSY) and contains 16 heavy atoms and 8 hydrogens. The smaller the model region, the more efficient the ONIOM calculation becomes since only model is treated with the DFT costly theory. The exact input command line was - &#039;&#039;#p opt oniom(b3lyp/6-31g(d):amber=softfirst) geom=connectivity&#039;&#039;. The time taken to optimise was 2 hours 25 minutes  1.2 seconds and the final energy was -736.838427249468. This optimisation uses the Gaussian default Mechanical Embedding scheme where the MM charges are not included in the high level model - this is cheaper but not as accurate. Another calculation was started from the same inital structure but optimised using Electronic Embedding (polarisation of the wavefunction), this input line was - &lt;br /&gt;
&#039;&#039;#p opt oniom(b3lyp/6-31g(d):amber=softfirst)=embed geom=connectivity&#039;&#039;. The time taken for optimisation was 7 hours 25 minutes 29.3 seconds and the final energy was -736.807010778592.&lt;br /&gt;
&lt;br /&gt;
Frequency calculations were then run on the resulting geometries. For the inputs with water molecules 10A near the chromophore, 3N-6 frequencies were returned. When comparing these frequencies we used the C=O frequency which is discussed in detail in many papers. It involves a C=O stretch mixed with a C=C stretch in the chormophore and has been assigned experimentally to occur at 1680cm-1. Calculations on the HBDI molecule as a chromophore model gave the mode a frequency of 1802cm-1 using B3LYP 6-31G*. This is not good agreement with the experimental value, so something must be missing - the protein electrostatic environment (cf. ian gould saying electrostatics are by far most important term in forcefield)&lt;br /&gt;
&lt;br /&gt;
The table below shows the C=O frequency for the 2 ONIOM calculations.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Calculation&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Time&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;C=O Frequency&#039;&#039;&#039;&lt;br /&gt;
|-&lt;br /&gt;
|   ONIOM with ME||4 hours 40 minutes 55.2 seconds||1804&lt;br /&gt;
|-&lt;br /&gt;
|   ONIOM with EM||13 hours 10 minutes 50.4 seconds||1745&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
It is clear that not much changes in comparison to the HBDI model when using ME with the ONIOM scheme, this is to be expected since no real change has been made to the structure or wavefunction across the chromophore (except for the connections to the protein primary sequence). The C=O frequency occuring at 1804cm-1 in the ME scheme is thus to be expected and confirms the hypothesis that the protein provides an electrostatic environment which polarises the wavefunction over the chromophore and affects the frequencies.&lt;br /&gt;
&lt;br /&gt;
Moving to the EE scheme, the MM charges are used to polarise the wavefunction over the DFT region, and results in a significant downshift of the C=O mode to 1745cm-1. This shows that the protein electrostatic environment has had a significant effect on the frequencies of the chromophore region. This frequency is also converging on the experimental value (1680cm-1). It is also noted that 8 modes outline in the HBDI experiments were observed in the ONIOM calculations and all moved towards experimental values.&lt;br /&gt;
&lt;br /&gt;
In conclusion the minimal model has provided an improvement on the frequency calculation of HBDI towards the experimental value calculated for GFP. The next step is to expand the model to include hydrogen bonding interactions from the surrounding protein (cf. hydrogen bonding by DFT, orbital interaction, directional, electrostatics more like a field)&lt;br /&gt;
&lt;br /&gt;
==An expanded Model==&lt;br /&gt;
&lt;br /&gt;
The next step is an expanded model region to include hydrogen bonding interactions between the chromophore and its surrounding residues (in space, not adjacent in sequence). The residues proximal to the chromophore which have some ability to hydrogen bond were detailed in the paper by Jasper van Thor: Uncovering the hidden ground state of green fluorescent protein, John T. M. Kennis,†‡§ Delmar S. Larsen,†‡ Ivo H. M. van Stokkum,†‡ Mikas Vengris,† Jasper J. van Thor,¶ and Rienk van Grondelle†, Proc Natl Acad Sci U S A. 2004 December 28; 101(52): 17988–17993.&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterONIOM&amp;diff=103184</id>
		<title>AndyForesterONIOM</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterONIOM&amp;diff=103184"/>
		<updated>2010-03-10T15:21:58Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* Minimal model */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=ONIOM calculations on 1W7S=&lt;br /&gt;
&lt;br /&gt;
Back to Main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester&lt;br /&gt;
&lt;br /&gt;
Using the created input file ONIOM calculations were submitted always using the DFT B3LYP 6-31G* : Amber scheme.&lt;br /&gt;
&lt;br /&gt;
==Minimal model==&lt;br /&gt;
&lt;br /&gt;
Optimisation started from the Amber optimization chk file, this was because we want to compare frequencies at the SAME minimum critical point for each level of theory. &lt;br /&gt;
&lt;br /&gt;
The minimal model includes the planer portion of the chromophore residue (number 64 - CSY) and contains 16 heavy atoms and 8 hydrogens. The smaller the model region, the more efficient the ONIOM calculation becomes since only model is treated with the DFT costly theory. The exact input command line was - &#039;&#039;#p opt oniom(b3lyp/6-31g(d):amber=softfirst) geom=connectivity&#039;&#039;. The time taken to optimise was 2 hours 25 minutes  1.2 seconds and the final energy was -736.838427249468. This optimisation uses the Gaussian default Mechanical Embedding scheme where the MM charges are not included in the high level model - this is cheaper but not as accurate. Another calculation was started from the same inital structure but optimised using Electronic Embedding (polarisation of the wavefunction), this input line was - &lt;br /&gt;
&#039;&#039;#p opt oniom(b3lyp/6-31g(d):amber=softfirst)=embed geom=connectivity&#039;&#039;. The time taken for optimisation was 7 hours 25 minutes 29.3 seconds and the final energy was -736.807010778592.&lt;br /&gt;
&lt;br /&gt;
Frequency calculations were then run on the resulting geometries. For the inputs with water molecules 10A near the chromophore, 3N-6 frequencies were returned. When comparing these frequencies we used the C=O frequency which is discussed in detail in many papers. It involves a C=O stretch mixed with a C=C stretch in the chormophore and has been assigned experimentally to occur at 1680cm-1. Calculations on the HBDI molecule as a chromophore model gave the mode a frequency of 1802cm-1 using B3LYP 6-31G*. This is not good agreement with the experimental value, so something must be missing - the protein electrostatic environment (cf. ian gould saying electrostatics are by far most important term in forcefield)&lt;br /&gt;
&lt;br /&gt;
The table below shows the C=O frequency for the 2 ONIOM calculations.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Calculation&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Time&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;C=O Frequency&#039;&#039;&#039;&lt;br /&gt;
|-&lt;br /&gt;
|   ONIOM with ME||4 hours 40 minutes 55.2 seconds||1804&lt;br /&gt;
|-&lt;br /&gt;
|   ONIOM with EM||13 hours 10 minutes 50.4 seconds||1745&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
It is clear that not much changes in comparison to the HBDI model when using ME with the ONIOM scheme, this is to be expected since no real change has been made to the structure or wavefunction across the chromophore (except for the connections to the protein primary sequence). The C=O frequency occuring at 1804cm-1 in the ME scheme is thus to be expected and confirms the hypothesis that the protein provides an electrostatic environment which polarises the wavefunction over the chromophore and affects the frequencies.&lt;br /&gt;
&lt;br /&gt;
Moving to the EE scheme, the MM charges are used to polarise the wavefunction over the DFT region, and results in a significant downshift of the C=O mode to 1745cm-1. This shows that the protein electrostatic environment has had a significant effect on the frequencies of the chromophore region. This frequency is also converging on the experimental value (1680cm-1). It is also noted that 8 modes outline in the HBDI experiments were observed in the ONIOM calculations and all moved towards experimental values.&lt;br /&gt;
&lt;br /&gt;
In conclusion the minimal model has provided an improvement on the frequency calculation of HBDI towards the experimental value calculated for GFP. The next step is to expand the model to include hydrogen bonding interactions from the surrounding protein (cf. hydrogen bonding by DFT, orbital interaction, directional, electrostatics more like a field)&lt;br /&gt;
&lt;br /&gt;
==An expanded Model==&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterONIOM&amp;diff=103177</id>
		<title>AndyForesterONIOM</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterONIOM&amp;diff=103177"/>
		<updated>2010-03-10T15:06:48Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* Minimal model */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=ONIOM calculations on 1W7S=&lt;br /&gt;
&lt;br /&gt;
Back to Main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester&lt;br /&gt;
&lt;br /&gt;
Using the created input file ONIOM calculations were submitted always using the DFT B3LYP 6-31G* : Amber scheme.&lt;br /&gt;
&lt;br /&gt;
==Minimal model==&lt;br /&gt;
&lt;br /&gt;
Optimisation started from the Amber optimization chk file, this was because we want to compare frequencies at the SAME minimum critical point for each level of theory. &lt;br /&gt;
&lt;br /&gt;
The minimal model includes the planer portion of the chromophore residue (number 64 - CSY) and contains 16 heavy atoms and 8 hydrogens. The smaller the model region, the more efficient the ONIOM calculation becomes since only model is treated with the DFT costly theory. The exact input command line was - &#039;&#039;#p opt oniom(b3lyp/6-31g(d):amber=softfirst) geom=connectivity&#039;&#039;. The time taken to optimise was 2 hours 25 minutes  1.2 seconds and the final energy was -736.838427249468. This optimisation uses the Gaussian default Mechanical Embedding scheme where the MM charges are not included in the high level model - this is cheaper but not as accurate. Another calculation was started from the same inital structure but optimised using Electronic Embedding (polarisation of the wavefunction), this input line was - &lt;br /&gt;
&#039;&#039;#p opt oniom(b3lyp/6-31g(d):amber=softfirst)=embed geom=connectivity&#039;&#039;. The time taken for optimisation was 7 hours 25 minutes 29.3 seconds and the final energy was -736.807010778592.&lt;br /&gt;
&lt;br /&gt;
Frequency calculations were then run on the resulting geometries. For the inputs with water molecules 10A near the chromophore, 3N-6 frequencies were returned. When comparing these frequencies we used the C=O frequency which is discussed in detail in many papers. It involves a C=O stretch mixed with a C=C stretch in the chormophore and has been assigned experimentally to occur at 1680cm-1. Calculations on the HBDI molecule as a chromophore model gave the mode a frequency of 1802cm-1 using B3LYP 6-31G*. This is not good agreement with the experimental value, so something must be missing - the protein electrostatic environment (cf. ian gould saying electrostatics are by far most important term in forcefield)&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Calculation&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Time&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;C=O Frequency&#039;&#039;&#039;&lt;br /&gt;
|-&lt;br /&gt;
|   ONIOM with ME||4 hours 40 minutes 55.2 seconds||1804&lt;br /&gt;
|-&lt;br /&gt;
|   ONIOM with EM||13 hours 10 minutes 50.4 seconds||1745&lt;br /&gt;
|}&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterONIOM&amp;diff=103168</id>
		<title>AndyForesterONIOM</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterONIOM&amp;diff=103168"/>
		<updated>2010-03-10T14:43:54Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* Minimal model */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=ONIOM calculations on 1W7S=&lt;br /&gt;
&lt;br /&gt;
Back to Main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester&lt;br /&gt;
&lt;br /&gt;
Using the created input file ONIOM calculations were submitted always using the DFT B3LYP 6-31G* : Amber scheme.&lt;br /&gt;
&lt;br /&gt;
==Minimal model==&lt;br /&gt;
&lt;br /&gt;
Optimisation started from the Amber optimization chk file, this was because we want to compare frequencies at the SAME minimum critical point for each level of theory. &lt;br /&gt;
&lt;br /&gt;
The minimal model includes the planer portion of the chromophore residue (number 64 - CSY) and contains 16 heavy atoms and 8 hydrogens. The smaller the model region, the more efficient the ONIOM calculation becomes since only model is treated with the DFT costly theory. The exact input command line was - &#039;&#039;#p opt oniom(b3lyp/6-31g(d):amber=softfirst) geom=connectivity&#039;&#039;. The time taken to optimise was 2 hours 25 minutes  1.2 seconds and the final energy was -736.838427249468. This optimisation uses the Gaussian default Mechanical Embedding scheme where the MM charges are not included in the high level model - this is cheaper but not as accurate. Another calculation was started from the same inital structure but optimised using Electronic Embedding (polarisation of the wavefunction), this input line was - &lt;br /&gt;
&#039;&#039;#p opt oniom(b3lyp/6-31g(d):amber=softfirst)=embed geom=connectivity&#039;&#039;. The time taken for optimisation was 7 hours 25 minutes 29.3 seconds and the final energy was -736.807010778592.&lt;br /&gt;
&lt;br /&gt;
Frequency calculations were then run on the resulting geometries. For the inputs with water molecules 10A near the chromophore, 3N-6 frequencies were returned. When comparing these frequencies we used the C=O frequency which is discussed in detail in many papers. It involves a C=O stretch mixed with a C=C stretch in the chormophore and has been assigned experimentally to occur at 1680-1&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Calculation&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Time&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;C=O Frequency&#039;&#039;&#039;&lt;br /&gt;
|-&lt;br /&gt;
|   ONIOM with ME||4 hours 40 minutes 55.2 seconds||1804&lt;br /&gt;
|-&lt;br /&gt;
|   ONIOM with EM||13 hours 10 minutes 50.4 seconds||1745&lt;br /&gt;
|}&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterONIOM&amp;diff=103161</id>
		<title>AndyForesterONIOM</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterONIOM&amp;diff=103161"/>
		<updated>2010-03-10T14:27:13Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* Minimal model */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=ONIOM calculations on 1W7S=&lt;br /&gt;
&lt;br /&gt;
Back to Main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester&lt;br /&gt;
&lt;br /&gt;
Using the created input file ONIOM calculations were submitted always using the DFT B3LYP 6-31G* : Amber scheme.&lt;br /&gt;
&lt;br /&gt;
==Minimal model==&lt;br /&gt;
&lt;br /&gt;
Optimisation started from the Amber optimization chk file, this was because we want to compare frequencies at the SAME minimum critical point for each level of theory. &lt;br /&gt;
&lt;br /&gt;
The minimal model includes the planer portion of the chromophore residue (number 64 - CSY) and contains 16 heavy atoms and 8 hydrogens. The smaller the model region, the more efficient the ONIOM calculation becomes since only model is treated with the DFT costly theory. The exact input command line was - &#039;&#039;#p opt oniom(b3lyp/6-31g(d):amber=softfirst) geom=connectivity&#039;&#039;. The time taken to optimise was 2 hours 25 minutes  1.2 seconds and the final energy was -736.838427249468. This optimisation uses the Gaussian default Mechanical Embedding scheme where the MM charges are not included in the high level model - this is cheaper but not as accurate. Another calculation was started from the same inital structure but optimised using Electronic Embedding (polarisation of the wavefunction), this input line was - &lt;br /&gt;
&#039;&#039;#p opt oniom(b3lyp/6-31g(d):amber=softfirst)=embed geom=connectivity&#039;&#039;. The time taken for optimisation was 7 hours 25 minutes 29.3 seconds and the final energy was -736.807010778592.&lt;br /&gt;
&lt;br /&gt;
Frequency calculations were then run on the resulting geometries. For the inputs with water molecules 10A near the chromophore, 3N-6 frequencies were returned.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Calculation&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Time&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;C=O Frequency&#039;&#039;&#039;&lt;br /&gt;
|-&lt;br /&gt;
|   ONIOM with ME||4 hours 40 minutes 55.2 seconds||1804&lt;br /&gt;
|-&lt;br /&gt;
|   ONIOM with EM||13 hours 10 minutes 50.4 seconds||1745&lt;br /&gt;
|}&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterONIOM&amp;diff=103160</id>
		<title>AndyForesterONIOM</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterONIOM&amp;diff=103160"/>
		<updated>2010-03-10T14:24:27Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* Minimal model */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=ONIOM calculations on 1W7S=&lt;br /&gt;
&lt;br /&gt;
Back to Main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester&lt;br /&gt;
&lt;br /&gt;
Using the created input file ONIOM calculations were submitted always using the DFT B3LYP 6-31G* : Amber scheme.&lt;br /&gt;
&lt;br /&gt;
==Minimal model==&lt;br /&gt;
&lt;br /&gt;
Optimisation started from the Amber optimization chk file, this was because we want to compare frequencies at the SAME minimum critical point for each level of theory. &lt;br /&gt;
&lt;br /&gt;
The minimal model includes the planer portion of the chromophore residue (number 64 - CSY) and contains 16 heavy atoms and 8 hydrogens. The smaller the model region, the more efficient the ONIOM calculation becomes since only model is treated with the DFT costly theory. The exact input command line was - &#039;&#039;#p opt oniom(b3lyp/6-31g(d):amber=softfirst) geom=connectivity&#039;&#039;. The time taken to optimise was 2 hours 25 minutes  1.2 seconds and the final energy was -736.838427249468. This optimisation uses the Gaussian default Mechanical Embedding scheme where the MM charges are not included in the high level model - this is cheaper but not as accurate. Another calculation was started from the same inital structure but optimised using Electronic Embedding (polarisation of the wavefunction), this input line was - &lt;br /&gt;
&#039;&#039;#p opt oniom(b3lyp/6-31g(d):amber=softfirst)=embed geom=connectivity&#039;&#039;. The time taken for optimisation was 7 hours 25 minutes 29.3 seconds and the final energy was -736.807010778592.&lt;br /&gt;
&lt;br /&gt;
Frequency calculations were then run on the resulting geometries. For the inputs with water molecules 10A near the chromophore, 3N-6 frequencies were returned.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Calculation&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Time&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;C=O Frequency&#039;&#039;&#039;&lt;br /&gt;
|-&lt;br /&gt;
|   ONIOM(DFT B3LYP 6-311G(d):AMBER)||4 minutes 22.0 seconds||8 minutes  5.5 seconds&lt;br /&gt;
|-&lt;br /&gt;
|   DFT 3LYP 6-311G(d)||2 hours 12 minutes 21.9 seconds||2 hours 30 minutes  0.8 seconds&lt;br /&gt;
|}&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterONIOM&amp;diff=103159</id>
		<title>AndyForesterONIOM</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterONIOM&amp;diff=103159"/>
		<updated>2010-03-10T14:23:21Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* Minimal model */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=ONIOM calculations on 1W7S=&lt;br /&gt;
&lt;br /&gt;
Back to Main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester&lt;br /&gt;
&lt;br /&gt;
Using the created input file ONIOM calculations were submitted always using the DFT B3LYP 6-31G* : Amber scheme.&lt;br /&gt;
&lt;br /&gt;
==Minimal model==&lt;br /&gt;
&lt;br /&gt;
Optimisation started from the Amber optimization chk file, this was because we want to compare frequencies at the SAME minimum critical point for each level of theory. &lt;br /&gt;
&lt;br /&gt;
The minimal model includes the planer portion of the chromophore residue (number 64 - CSY) and contains 16 heavy atoms and 8 hydrogens. The smaller the model region, the more efficient the ONIOM calculation becomes since only model is treated with the DFT costly theory. The exact input command line was - &#039;&#039;#p opt oniom(b3lyp/6-31g(d):amber=softfirst) geom=connectivity&#039;&#039;. The time taken to optimise was 2 hours 25 minutes  1.2 seconds and the final energy was -736.838427249468. This optimisation uses the Gaussian default Mechanical Embedding scheme where the MM charges are not included in the high level model - this is cheaper but not as accurate. Another calculation was started from the same inital structure but optimised using Electronic Embedding (polarisation of the wavefunction), this input line was - &lt;br /&gt;
&#039;&#039;#p opt oniom(b3lyp/6-31g(d):amber=softfirst)=embed geom=connectivity&#039;&#039;. The time taken for optimisation was 7 hours 25 minutes 29.3 seconds and the final energy was -736.807010778592.&lt;br /&gt;
&lt;br /&gt;
Frequency calculations were then run on the resulting geometries. For the inputs with water molecules 10A near the chromophore, 3N-6 frequencies were returned.&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterONIOM&amp;diff=103155</id>
		<title>AndyForesterONIOM</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterONIOM&amp;diff=103155"/>
		<updated>2010-03-10T14:15:09Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* Minimal model */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=ONIOM calculations on 1W7S=&lt;br /&gt;
&lt;br /&gt;
Back to Main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester&lt;br /&gt;
&lt;br /&gt;
Using the created input file ONIOM calculations were submitted always using the DFT B3LYP 6-31G* : Amber scheme.&lt;br /&gt;
&lt;br /&gt;
==Minimal model==&lt;br /&gt;
&lt;br /&gt;
Optimisation started from the Amber optimization chk file, this was because we want to compare frequencies at the SAME minimum critical point for each level of theory. &lt;br /&gt;
&lt;br /&gt;
The minimal model includes the planer portion of the chromophore residue (number 64 - CSY) and contains 16 heavy atoms and 8 hydrogens. The smaller the model region, the more efficient the ONIOM calculation becomes since only model is treated with the DFT costly theory. The exact input command line was - &#039;&#039;#p opt oniom(b3lyp/6-31g(d):amber=softfirst) geom=connectivity&#039;&#039;. The time taken to optimise was 2 hours 25 minutes  1.2 seconds and the final energy was -736.838427249468.&lt;br /&gt;
&lt;br /&gt;
This optimisation uses the Gaussian default Mechanical Embedding scheme where the MM charges are not included in the high level model - this is cheaper but not as accurate.&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterONIOM&amp;diff=103151</id>
		<title>AndyForesterONIOM</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterONIOM&amp;diff=103151"/>
		<updated>2010-03-10T14:05:47Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* ONIOM calculations on 1W7S */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=ONIOM calculations on 1W7S=&lt;br /&gt;
&lt;br /&gt;
Back to Main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester&lt;br /&gt;
&lt;br /&gt;
Using the created input file ONIOM calculations were submitted always using the DFT B3LYP 6-31G* : Amber scheme.&lt;br /&gt;
&lt;br /&gt;
==Minimal model==&lt;br /&gt;
&lt;br /&gt;
Optimisation started from the Amber optimization chk file, this was because we want to compare frequencies at the SAME minimum critical point for each level of theory.&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForester&amp;diff=103148</id>
		<title>AndyForester</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForester&amp;diff=103148"/>
		<updated>2010-03-10T13:58:33Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* 22nd Feb 2010 */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=HBDI Calculations=&lt;br /&gt;
==Wed 21st Oct 2009==&lt;br /&gt;
&lt;br /&gt;
- Started project work&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
- Gaussview and Gaussian working&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
- Working out how to use the command inputs&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
- Trying to sort VPN out&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
- Working to get the project proposal written&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Tue 27th Oct 2009==&lt;br /&gt;
&lt;br /&gt;
Drew up the HBDI molecule in Gaussview, the molecule will be optimised at the HF 3-21G level on the mac (locally). &amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
DFT B3LYP 6-31G (Becke threeparameter hybrid exchange functional5 and the Lee-Yang-Parr correlation functional) method optimisation was run on the resulting file, the MP2 6-31G method optimisation. Each of these had a corresponding freq calculation to check that minima were reached. All calculations were successful, with no negative vibrations.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolAppletButton&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;HBDI_MP2_6-31G_FREQ.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;HBDI optimised at the MP2 6-31G level&amp;lt;/text&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;600&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;/jmolAppletButton&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Wed 28th Oct 2009==&lt;br /&gt;
&lt;br /&gt;
Ran DFT B3YLP 6-311G calculations on HBDI (Opt and Freq) since these were the calculations used in a published study of HBDI.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
Optimisation job finshed successfully, the frequency calculation did not finish succesfully. Error was:&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
 ****Warning!!: The largest alpha MO coefficient is  0.17588883D+02&lt;br /&gt;
 Symmetrizing basis deriv contribution to polar:&lt;br /&gt;
 IMax=3 JMax=2 DiffMx= 0.00D+00&lt;br /&gt;
 G2DrvN: will do    29 centers at a time, making    1 passes doing MaxLOS=1.&lt;br /&gt;
 Calling FoFCou, ICntrl=  3107 FMM=F I1Cent=   0 AccDes= 0.00D+00.&lt;br /&gt;
 FoFDir/FoFCou used for L=0 through L=1.&lt;br /&gt;
 End of G2Drv Frequency-dependent properties file   721 does not exist.&lt;br /&gt;
 End of G2Drv Frequency-dependent properties file   722 does not exist.&lt;br /&gt;
          IDoAtm=1111111111111111111111111111&lt;br /&gt;
          Differentiating once with respect to electric field.&lt;br /&gt;
                with respect to dipole field.&lt;br /&gt;
          Differentiating once with respect to nuclear coordinates.&lt;br /&gt;
          Keep R1 ints in memory in canonical form, NReq=447529180.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Thurs 29th Oct 2009==&lt;br /&gt;
&lt;br /&gt;
===Learning to use ONIOM===&lt;br /&gt;
&lt;br /&gt;
The example on chem wiki was attempted. Drawn up in Gaussview and optimised initially using molecular mechanics AMBER force field.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
The ONIOM calculation was set up in Gaussview using DFT B3LYP 6-311G(d):AMBER methods with the benzene ring as the high level and chain at low level. Electronic embedding was NOT used in this initial calculation. &amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolAppletButton&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;ONIOM_B3LYP_6-311G_AMBER.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Test Molecule optimised using B3LYP 6-311G(d):AMBER&amp;lt;/text&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;600&amp;lt;/size&amp;gt;&lt;br /&gt;
  &amp;lt;/jmolAppletButton&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
&lt;br /&gt;
This conformation is NOT the lowest energy conformation, the initial AMBER MM calculation did not find the global minimum probably due to the drawn up starting geometry.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
The molecule was drawn up again and DFT B3LYP 6-311G(d):AMBER and pure B3LYP 6-311G(d) calculations run. Absolute energy differences between the 2 conformations was very small at the ONIOM(DFT B3LYP 6-311G(d):AMBER) level. -232.27532671 vs -232.27315709 au respectively, a 0.00271au (4.359 744 17×10-18 J = 1au) difference.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolAppletButton&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;ONIOM_B3LYP_6-311G_AMBER2.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Test Molecule optimised using B3LYP 6-311G(d):AMBER&amp;lt;/text&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;600&amp;lt;/size&amp;gt;&lt;br /&gt;
  &amp;lt;/jmolAppletButton&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
&lt;br /&gt;
It has already become obvious that using ONIOM decreases calculation time considerably. The table below shows a &amp;quot;CPU time&amp;quot; comparison on the above molecule using the different approaches.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Calculation&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Time (1st attempt)&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Time (2nd attempt)&#039;&#039;&#039;&lt;br /&gt;
|-&lt;br /&gt;
|   ONIOM(DFT B3LYP 6-311G(d):AMBER)||4 minutes 22.0 seconds||8 minutes  5.5 seconds&lt;br /&gt;
|-&lt;br /&gt;
|   DFT 3LYP 6-311G(d)||2 hours 12 minutes 21.9 seconds||2 hours 30 minutes  0.8 seconds&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Mon 2nd Nov 2009==&lt;br /&gt;
&lt;br /&gt;
Having obtained optimized structures of HBDI at various theory levels and basis set variations, these structures will now be compared to previous literature. Extensive studies have been performed using HF and B3LYP methods previously. It has been noted in Jaspers recent review that HF does not model frequencies very accurately - my computations agree HF 6-31G frequencies are very far out. B3LYP frequencies are much closer to the experimental values.&lt;br /&gt;
&lt;br /&gt;
==Tues 3rd Nov 2009==&lt;br /&gt;
&lt;br /&gt;
Tabulated frerquency comparison&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;A.Esposito HF 6-31G(d)&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;A.Esposito B3LYP 6-31G(d)&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;X.He B3LYP 6-31G(d)&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;HF 6-31G(d)&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;B3LYP 6-31G(d)&#039;&#039;&#039;&lt;br /&gt;
|-&lt;br /&gt;
| C=O||1987||1802||1802||1986||1801&lt;br /&gt;
|-&lt;br /&gt;
| C=C||1880||1705||1704||1880||1704&lt;br /&gt;
|-&lt;br /&gt;
| Phenol||1809||1665||1663||1812||1666&lt;br /&gt;
|-&lt;br /&gt;
| Phenol||1781||1635||1631||1777||1635&lt;br /&gt;
|-&lt;br /&gt;
| C=N||1844||1620||1618||1844||1618&lt;br /&gt;
|-&lt;br /&gt;
| Phenol||1696||1562||1558||1694||1562&lt;br /&gt;
|-&lt;br /&gt;
| C-N||1607||1483||N/A||1606||1482&lt;br /&gt;
|-&lt;br /&gt;
| Phenol||1602||1487||N/A||1603||1488&lt;br /&gt;
|-&lt;br /&gt;
| Total Energy(au)||||||||-720.043||-724.465&lt;br /&gt;
|-&lt;br /&gt;
| &lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Thurs 5th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
Having got Jaspers input files, several things were noticed:&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
* Solvent models were used - should be in vacum.... I have re run these in G09 with No solvent on both DFT 6-311G(d,p) and HF 6-31+G(d) methods/basis sets for comparison&lt;br /&gt;
* The models in his Biophys paper using other residues in contact COO- for example are not present. He has not reported the pure HBDI calculation results.&lt;br /&gt;
* More than just HBDI is likely to be required, meaning larger than expected high level region in the GFP calculation. (3 layer ONIOM DFT:HF:AMBER)?&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
Other things to do:&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
* Work on a guide to loading .pdb files into Gaussview 5 - how much information is maintained?&lt;br /&gt;
* Are residues defined still in Gaussview input files&lt;br /&gt;
* AMBER MM requires definitions of charges etc... how easy is it to import from .pdb and get a file that can be run in opt etc...&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
* Find a model system which is larger than HBDI but incorporates this molecule &amp;lt;=500 atoms (Limit of size for pure DFT calculation)&lt;br /&gt;
* Possibilities include a system alread developed by someone else online&lt;br /&gt;
* A &amp;quot;custom&amp;quot; system made by me to model this (time intensive)&lt;br /&gt;
* &amp;quot;strip&amp;quot; back the GFP pdb file to 500 atoms surrounding the HBDI chromophore.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Fri 6th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
Yesterday started by trying to follow the tutorial on loading the GFP molecule from .pdb, this resulted in a problem with Gaussviews addition of hydrogens. It over protonated the nitrogens and carbon in the chromophore leading to over valent parts. I then tried several other molecules (PYP, a scorpion toxin and a strand of RNA) which had no problems with hydrogen addition.&lt;br /&gt;
&lt;br /&gt;
I tried to use ChemBio 3D to load the .pdb and correctly assign hydrogens/connectivity. This resulted in the same protonation problems!&lt;br /&gt;
&lt;br /&gt;
==Mon 9th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
TINKER - http://dasher.wustl.edu/tinker/ was installed with the Force Field Explorer (FFE) interface on the mac. This was used to load the same .pdb file and resulted in the same protonation around the chromophore. So far the SAME protonation pattern (incorrect) around the chromophore has been found using:&lt;br /&gt;
&lt;br /&gt;
*Gaussview&lt;br /&gt;
*ChemBio 3D&lt;br /&gt;
*TINKER&lt;br /&gt;
&lt;br /&gt;
Looking into how the .pdb file format works, the molecule is built from a series of residues, it becomes clear as you look at the sequence that is building the GFP protein chromophore (residues 65,66,67 in 1GFL) are not then reproducing the internal cyclisation properly leaving protons attached and thus atoms hypervalent. This accounts for the consistent error loading from the .pdb. As a result must all molecules expressed as a .pdb file must be pure protein structures...?&lt;br /&gt;
&lt;br /&gt;
===Comparison of the 2 available wild type GFP structures===&lt;br /&gt;
&lt;br /&gt;
The 2 structures being considered here are:&lt;br /&gt;
&lt;br /&gt;
*1GFL - Yang et al. -http://www.pdb.org/pdb/explore/explore.do?structureId=1GFL&lt;br /&gt;
*1W7S - Jaspers stucture - http://www.pdb.org/pdb/explore/explore.do?structureId=1W7S&lt;br /&gt;
&lt;br /&gt;
These 2 structures differ in that 1GFL contains 2 GFP molecules, whereas 1W7S contains 4. These were both stripped down to 1 unit of GFP, using the &amp;quot;pdb secondary structure&amp;quot; function in Gaussview 5. 1GFL had chain B removed and 1W7S had chains B,C and D removed. Next the residues were investigated that created the chromophore. The results are not consistent because GFL uses 3 residues (65,66,67) to form the chromophore. 1W7S has residue 66 (labelled CSY) as the entire chromophore! Both get protonated the same (incorrectly) which is odd considering how they molecules are built.&lt;br /&gt;
&lt;br /&gt;
The next test was to &amp;quot;modify bond&amp;quot; in Gaussview 5 and create a &amp;quot;cleaned&amp;quot; chromophore structure (no bonding information from the residue) by manually changing the bonds. The chromophore in each .pdb file was altered to look like HBDI in terms of double single bonds and the hydrogens taken out (makes no difference as these are created each time the .pdb is loaded). When the new .pdb files (1GFL_cleaned.pdb and 1W7S_cleaned.pdb) were loaded they were correctly protonated! Success.&lt;br /&gt;
&lt;br /&gt;
===1W7S improvement===&lt;br /&gt;
&lt;br /&gt;
Chain A (not cleaned) of the 1W7S .pdb was submitted to run a AMBER calculation, as expected it failed with the lines:&lt;br /&gt;
&lt;br /&gt;
 Missing atomic parameters for atom   474 IAtTyp=    20000000&lt;br /&gt;
 Missing atomic parameters.&lt;br /&gt;
&lt;br /&gt;
This error says that the atom type of atom 474 is not defined. One way to counter this is to locate the atom and then change its definition manually (as shown in the example https://www.ch.ic.ac.uk/wiki/index.php/PYP_:_Photoactive_yellow_protein - time intensive!)&lt;br /&gt;
&lt;br /&gt;
Instead the molecule was cleaned (manually changing the bonding in the chromophore), after the bond updates the structure was re submitted, again failing but with a different message:&lt;br /&gt;
&lt;br /&gt;
 Include all MM classes&lt;br /&gt;
  Bondstretch undefined between atoms    473    478 CC-N* [H,H]  &lt;br /&gt;
  Bondstretch undefined between atoms    475    476 CC-CC [H,H]  &lt;br /&gt;
  Bondstretch undefined between atoms    475    486 CC-CM [H,H]  &lt;br /&gt;
  Bondstretch undefined between atoms    476    477 CC-O [H,H]  &lt;br /&gt;
  Bondstretch undefined between atoms    476    478 CC-N* [H,H]  &lt;br /&gt;
  Bondstretch undefined between atoms    486   2559 CM-HC [H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    473    474    475 CC-NB-CC [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    473    478    479 CC-N*-CT [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    473    478    476 CC-N*-CC [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    473    482    483 CC-CT-N [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    473    482   3857 CC-CT-H1 [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    474    475    476 NB-CC-CC [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    474    475    486 NB-CC-CM [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    474    473    478 NB-CC-N* [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    475    476    477 CC-CC-O [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    475    476    478 CC-CC-N* [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    475    486    487 CC-CM-CA [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    475    486   2559 CC-CM-HC [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    476    478    479 CC-N*-CT [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    476    475    486 CC-CC-CM [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    477    476    478 O-CC-N* [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    478    479    480 N*-CT-C [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    478    473    482 N*-CC-CT [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    486    487    488 CM-CA-CA [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    486    487    489 CM-CA-CM [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    487    486   2559 CA-CM-HC [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    488    487    489 CA-CA-CM [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    490    492    491 CA-C-CM [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    491    492    493 CM-C-OH [H,H,H]  &lt;br /&gt;
  MM function not complete&lt;br /&gt;
&lt;br /&gt;
This is a better error message since all the atoms are now defined, only parameters are missing. For parameters that involve only atoms in the chromophore at the high level in the ONIOM calculation, the values can be set to 0 in the input file. This is because the ONIOM extrapolation will cancel MM from this high level region, To do this the keyword &amp;quot;Amber=Hardfirst&amp;quot; and wildcards for the missing parameters which are zero valued are placed in the input file.&lt;br /&gt;
&lt;br /&gt;
[[Image:Atoms involving undefined parameters.png]]&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
The above screeshot shows highlighted the atoms listed in the error... most of them if not all will be involved in the high level calculation and thus may not require parameters defining (other than to be set to 0). The reason the parameters are undefined is that the AMBER force field is very specific to protein bonds and thus the organic chromophore has bonds that are not found in this parameter set.&lt;br /&gt;
&lt;br /&gt;
The atoms in the chromophore were frozen in the atom group editor and the input file had a parameter set added to the end setting chromophore parameters to 0:&lt;br /&gt;
&lt;br /&gt;
 HrmStr1 CC N* 0.0 0.0&lt;br /&gt;
 HrmStr1 CC CC 0.0 0.0&lt;br /&gt;
 HrmStr1 CC CM 0.0 0.0&lt;br /&gt;
 HrmStr1 CC O 0.0 0.0&lt;br /&gt;
 HrmStr1 CM HC 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC NB CC 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC N* CC 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CT N 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CT H1 0.0 0.0&lt;br /&gt;
 HrmBnd1 NB CC CC 0.0 0.0&lt;br /&gt;
 HrmBnd1 NB CC CM 0.0 0.0&lt;br /&gt;
 HrmBnd1 NB CC N* 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CC O  0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CC N* 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CM CA 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CM HC 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CC CM 0.0 0.0&lt;br /&gt;
 HrmBnd1 O CC N*  0.0 0.0&lt;br /&gt;
 HrmBnd1 N* CT C  0.0 0.0&lt;br /&gt;
 HrmBnd1 N* CC CT 0.0 0.0&lt;br /&gt;
 HrmBnd1 CM CA CA 0.0 0.0&lt;br /&gt;
 HrmBnd1 CA CM HC 0.0 0.0&lt;br /&gt;
 HrmBnd1 CA C CM 0.0 0.0&lt;br /&gt;
 HrmBnd1 CM C OH 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC N* CT 0.0 0.0&lt;br /&gt;
 HrmBnd1 CM CA CM 0.0 0.0&lt;br /&gt;
&lt;br /&gt;
This file then ran successfully for an AMBER single point energy calculation. It was then submitted for AMBER optimisation using &amp;quot;amber=hardfirst geom=connectivity&amp;quot; and returned a successful minimum with Forces and Displacements converged:&lt;br /&gt;
&lt;br /&gt;
         Item               Value     Threshold  Converged?&lt;br /&gt;
  Maximum Force            0.000089     0.000450     YES&lt;br /&gt;
  RMS     Force            0.000004     0.000300     YES&lt;br /&gt;
  Maximum Displacement     0.000089     0.001800     YES&lt;br /&gt;
  RMS     Displacement     0.000004     0.001200     YES&lt;br /&gt;
  Predicted change in Energy=-1.156135D-07&lt;br /&gt;
  Optimization completed.&lt;br /&gt;
     -- Stationary point found.&lt;br /&gt;
&lt;br /&gt;
===Cleaned .pdb files and TINKER===&lt;br /&gt;
&lt;br /&gt;
*The files that had been cleaned were opened in TINKER, it is necessary to convert the .pdb to a .xyz file to run any calculations within TINKER. This is currently failing due to an &amp;quot;atom missing on SER-228&amp;quot; - checking this there is no SER residue at 228 (in fact a GLU) so a bit puzzled. This is the 1W7S cleaned version.&lt;br /&gt;
&lt;br /&gt;
*PYP was loaded in its .pdb format to TINKER and was immediately successful in being converted to .xyz then a MINIMISE algorithm ran successfully with the energy at its X-ray crystal structure as 261KJ/mol reduced to -3670.5355KJ/mol in 1645 iterations - this took a while (45mins locally)!&lt;br /&gt;
&lt;br /&gt;
*Interestingly the .xyz output from the minimization can be converted back to .pdb (loses hydrogens minimization) and this PYP file was loaded into Gaussview 5 - the file then ran successfully for an AMBER single point energy calculation! cf. the first attempt to get PYP to run was unsuccessful, and the tutorial underwent much more hassle to run this calculation.&lt;br /&gt;
&lt;br /&gt;
===What .pdb information is retained?===&lt;br /&gt;
&lt;br /&gt;
The only molecule that has been run with AMBER opt (GFP) has lost its secondary structure information but RETAINED its residue information.&lt;br /&gt;
&lt;br /&gt;
==Fri 13th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
===Conversion from.xyz to .com===&lt;br /&gt;
A protein with 2 residues (Glu-Asp) was built using TINKER, saved as:&lt;br /&gt;
&lt;br /&gt;
*.xyz&lt;br /&gt;
*.pdb&lt;br /&gt;
*.com&lt;br /&gt;
&lt;br /&gt;
The initial format was .xyz (created in TINEKR as a native format), converted to .pdb using TINKER and then loaded into Gaussview and saved as a .com.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
.xyz&lt;br /&gt;
&lt;br /&gt;
    36  Protein built with TINKER/Force Field Explorer&lt;br /&gt;
     1  CT     0.000000    0.000000    0.000000   340     2     4     5     6&lt;br /&gt;
     2  C      0.000000    0.000000    1.510000   342     1     3     7&lt;br /&gt;
     3  O      1.028938    0.000000    2.165506   343     2&lt;br /&gt;
     4  HC     1.056270    0.000000   -0.341166   341     1&lt;br /&gt;
     5  HC    -0.497647    0.931694   -0.341166   341     1&lt;br /&gt;
     6  HC    -0.497647   -0.931694   -0.341166   341     1&lt;br /&gt;
     7  N     -1.236201    0.000000    2.027114   232     2     8    11&lt;br /&gt;
     8  CT    -1.446961    0.000000    3.471822   233     7     9    12    13&lt;br /&gt;
     9  C     -2.509557   -0.992751    3.878558   234     8    10    22&lt;br /&gt;
    10  O     -3.569672   -1.098634    3.284129   236     9&lt;br /&gt;
    11  H     -2.025518    0.000000    1.381066   235     7&lt;br /&gt;
    12  H1    -0.503221   -0.281395    3.983946   237     8&lt;br /&gt;
    13  CT    -1.910491    1.397334    3.923704   238     8    14    18    19&lt;br /&gt;
    14  CT    -2.132799    1.397334    5.447574   240    13    15    20    21&lt;br /&gt;
    15  C     -2.585014    2.781238    5.848089   242    14    16    17&lt;br /&gt;
    16  O2    -2.816736    2.973609    7.061266   243    15&lt;br /&gt;
    17  O2    -2.693194    3.629064    4.935955   243    15&lt;br /&gt;
    18  HC    -2.861794    1.653482    3.412322   239    13&lt;br /&gt;
    19  HC    -1.132567    2.145528    3.664589   239    13&lt;br /&gt;
    20  HC    -1.170306    1.191123    5.960588   241    14&lt;br /&gt;
    21  HC    -2.946052    0.685840    5.701535   241    14&lt;br /&gt;
    22  N     -2.153997   -1.723291    4.944151   210     9    23    26&lt;br /&gt;
    23  CT    -3.047986   -2.739703    5.491225   211    22    24    27    28&lt;br /&gt;
    24  C     -3.120370   -2.663723    6.997574   212    23    25    34&lt;br /&gt;
    25  O     -2.128241   -2.535414    7.695870   214    24&lt;br /&gt;
    26  H     -1.239994   -1.547921    5.361576   213    22&lt;br /&gt;
    27  H1    -4.072297   -2.585746    5.092246   215    23&lt;br /&gt;
    28  CT    -2.526745   -4.139104    5.114946   216    23    29    32    33&lt;br /&gt;
    29  C     -3.476741   -5.157722    5.698062   218    28    30    31&lt;br /&gt;
    30  O2    -3.212872   -6.362275    5.493304   219    29&lt;br /&gt;
    31  O2    -4.454666   -4.718803    6.341112   219    29&lt;br /&gt;
    32  HC    -1.530243   -4.298013    5.577366   217    28&lt;br /&gt;
    33  HC    -2.540086   -4.252948    4.010880   217    28&lt;br /&gt;
    34  N     -4.374888   -2.752228    7.460123   344    24    35    36&lt;br /&gt;
    35  H     -5.140865   -2.858197    6.794954   345    34&lt;br /&gt;
    36  H     -4.534830   -2.711583    8.466684   345    34&lt;br /&gt;
&lt;br /&gt;
.pdb&lt;br /&gt;
&lt;br /&gt;
 HEADER    Protein built with TINKER/Force Field Explorer&lt;br /&gt;
 COMPND&lt;br /&gt;
 SOURCE&lt;br /&gt;
 ATOM      1  CH3 ACE     1       0.000   0.000   0.000&lt;br /&gt;
 ATOM      2  C   ACE     1       0.000   0.000   1.510&lt;br /&gt;
 ATOM      3  O   ACE     1       1.029   0.000   2.166&lt;br /&gt;
 ATOM      4  H1  ACE     1       1.056   0.000  -0.341&lt;br /&gt;
 ATOM      5  H2  ACE     1      -0.498   0.932  -0.341&lt;br /&gt;
 ATOM      6  H3  ACE     1      -0.498  -0.932  -0.341&lt;br /&gt;
 ATOM      7  N   GLU     2      -1.236   0.000   2.027&lt;br /&gt;
 ATOM      8  CA  GLU     2      -1.447   0.000   3.472&lt;br /&gt;
 ATOM      9  C   GLU     2      -2.510  -0.993   3.879&lt;br /&gt;
 ATOM     10  O   GLU     2      -3.570  -1.099   3.284&lt;br /&gt;
 ATOM     11  CB  GLU     2      -1.910   1.397   3.924&lt;br /&gt;
 ATOM     12  CG  GLU     2      -2.133   1.397   5.448&lt;br /&gt;
 ATOM     13  CD  GLU     2      -2.585   2.781   5.848&lt;br /&gt;
 ATOM     14  OE1 GLU     2      -2.817   2.974   7.061&lt;br /&gt;
 ATOM     15  OE2 GLU     2      -2.693   3.629   4.936&lt;br /&gt;
 ATOM     16  H   GLU     2      -2.026   0.000   1.381&lt;br /&gt;
 ATOM     17  HA  GLU     2      -0.503  -0.281   3.984&lt;br /&gt;
 ATOM     18  HB2 GLU     2      -2.862   1.653   3.412&lt;br /&gt;
 ATOM     19  HB3 GLU     2      -1.133   2.146   3.665&lt;br /&gt;
 ATOM     20  HG2 GLU     2      -1.170   1.191   5.961&lt;br /&gt;
 ATOM     21  HG3 GLU     2      -2.946   0.686   5.702&lt;br /&gt;
 ATOM     22  N   ASP     3      -2.154  -1.723   4.944&lt;br /&gt;
 ATOM     23  CA  ASP     3      -3.048  -2.740   5.491&lt;br /&gt;
 ATOM     24  C   ASP     3      -3.120  -2.664   6.998&lt;br /&gt;
 ATOM     25  O   ASP     3      -2.128  -2.535   7.696&lt;br /&gt;
 ATOM     26  CB  ASP     3      -2.527  -4.139   5.115&lt;br /&gt;
 ATOM     27  CG  ASP     3      -3.477  -5.158   5.698&lt;br /&gt;
 ATOM     28  OD1 ASP     3      -3.213  -6.362   5.493&lt;br /&gt;
 ATOM     29  OD2 ASP     3      -4.455  -4.719   6.341&lt;br /&gt;
 ATOM     30  H   ASP     3      -1.240  -1.548   5.362&lt;br /&gt;
 ATOM     31  HA  ASP     3      -4.072  -2.586   5.092&lt;br /&gt;
 ATOM     32  HB2 ASP     3      -1.530  -4.298   5.577&lt;br /&gt;
 ATOM     33  HB3 ASP     3      -2.540  -4.253   4.011&lt;br /&gt;
 ATOM     34  N   NH2     4      -4.375  -2.752   7.460&lt;br /&gt;
 ATOM     35  H1  NH2     4      -5.141  -2.858   6.795&lt;br /&gt;
 ATOM     36  H2  NH2     4      -4.535  -2.712   8.467&lt;br /&gt;
 END&lt;br /&gt;
&lt;br /&gt;
.com&lt;br /&gt;
&lt;br /&gt;
 %chk=File_Test_Glu-Asp.chk&lt;br /&gt;
 # hf/3-21g geom=connectivity&lt;br /&gt;
 &lt;br /&gt;
 Protein built with TINKER/Force Field Explorer&lt;br /&gt;
 &lt;br /&gt;
 0 1&lt;br /&gt;
 C(PDBName=CH3,ResName=ACE,ResNum=1)                   0.00000000    0.00000000    0.00000000&lt;br /&gt;
 C(PDBName=C,ResName=ACE,ResNum=1)                     0.00000000    0.00000000    1.51000000&lt;br /&gt;
 O(PDBName=O,ResName=ACE,ResNum=1)                     1.02900000    0.00000000    2.16600000&lt;br /&gt;
 H(PDBName=H1,ResName=ACE,ResNum=1)                    1.05600000    0.00000000   -0.34100000&lt;br /&gt;
 H(PDBName=H2,ResName=ACE,ResNum=1)                   -0.49800000    0.93200000   -0.34100000&lt;br /&gt;
 H(PDBName=H3,ResName=ACE,ResNum=1)                   -0.49800000   -0.93200000   -0.34100000&lt;br /&gt;
 N(PDBName=N,ResName=GLU,ResNum=2)                    -1.23600000    0.00000000    2.02700000&lt;br /&gt;
 C(PDBName=CA,ResName=GLU,ResNum=2)                   -1.44700000    0.00000000    3.47200000&lt;br /&gt;
 C(PDBName=C,ResName=GLU,ResNum=2)                    -2.51000000   -0.99300000    3.87900000&lt;br /&gt;
 O(PDBName=O,ResName=GLU,ResNum=2)                    -3.57000000   -1.09900000    3.28400000&lt;br /&gt;
 C(PDBName=CB,ResName=GLU,ResNum=2)                   -1.91000000    1.39700000    3.92400000&lt;br /&gt;
 C(PDBName=CG,ResName=GLU,ResNum=2)                   -2.13300000    1.39700000    5.44800000&lt;br /&gt;
 C(PDBName=CD,ResName=GLU,ResNum=2)                   -2.58500000    2.78100000    5.84800000&lt;br /&gt;
 O(PDBName=OE1,ResName=GLU,ResNum=2)                  -2.81700000    2.97400000    7.06100000&lt;br /&gt;
 O(PDBName=OE2,ResName=GLU,ResNum=2)                  -2.69300000    3.62900000    4.93600000&lt;br /&gt;
 H(PDBName=H,ResName=GLU,ResNum=2)                    -2.02600000    0.00000000    1.38100000&lt;br /&gt;
 H(PDBName=HA,ResName=GLU,ResNum=2)                   -0.50300000   -0.28100000    3.98400000&lt;br /&gt;
 H(PDBName=HB2,ResName=GLU,ResNum=2)                  -2.86200000    1.65300000    3.41200000&lt;br /&gt;
 H(PDBName=HB3,ResName=GLU,ResNum=2)                  -1.13300000    2.14600000    3.66500000&lt;br /&gt;
 H(PDBName=HG2,ResName=GLU,ResNum=2)                  -1.17000000    1.19100000    5.96100000&lt;br /&gt;
 H(PDBName=HG3,ResName=GLU,ResNum=2)                  -2.94600000    0.68600000    5.70200000&lt;br /&gt;
 N(PDBName=N,ResName=ASP,ResNum=3)                    -2.15400000   -1.72300000    4.94400000&lt;br /&gt;
 C(PDBName=CA,ResName=ASP,ResNum=3)                   -3.04800000   -2.74000000    5.49100000&lt;br /&gt;
 C(PDBName=C,ResName=ASP,ResNum=3)                    -3.12000000   -2.66400000    6.99800000&lt;br /&gt;
 O(PDBName=O,ResName=ASP,ResNum=3)                    -2.12800000   -2.53500000    7.69600000&lt;br /&gt;
 C(PDBName=CB,ResName=ASP,ResNum=3)                   -2.52700000   -4.13900000    5.11500000&lt;br /&gt;
 C(PDBName=CG,ResName=ASP,ResNum=3)                   -3.47700000   -5.15800000    5.69800000&lt;br /&gt;
 O(PDBName=OD1,ResName=ASP,ResNum=3)                  -3.21300000   -6.36200000    5.49300000&lt;br /&gt;
 O(PDBName=OD2,ResName=ASP,ResNum=3)                  -4.45500000   -4.71900000    6.34100000&lt;br /&gt;
 H(PDBName=H,ResName=ASP,ResNum=3)                    -1.24000000   -1.54800000    5.36200000&lt;br /&gt;
 H(PDBName=HA,ResName=ASP,ResNum=3)                   -4.07200000   -2.58600000    5.09200000&lt;br /&gt;
 H(PDBName=HB2,ResName=ASP,ResNum=3)                  -1.53000000   -4.29800000    5.57700000&lt;br /&gt;
 H(PDBName=HB3,ResName=ASP,ResNum=3)                  -2.54000000   -4.25300000    4.01100000&lt;br /&gt;
 N(PDBName=N,ResName=NH2,ResNum=4)                    -4.37500000   -2.75200000    7.46000000&lt;br /&gt;
 H(PDBName=H1,ResName=NH2,ResNum=4)                   -5.14100000   -2.85800000    6.79500000&lt;br /&gt;
 H(PDBName=H2,ResName=NH2,ResNum=4)                   -4.53500000   -2.71200000    8.46700000&lt;br /&gt;
 &lt;br /&gt;
 1 2 1.0 4 1.0 5 1.0 6 1.0&lt;br /&gt;
 2 3 2.0 7 1.5&lt;br /&gt;
 3&lt;br /&gt;
 4&lt;br /&gt;
 5&lt;br /&gt;
 6&lt;br /&gt;
 7 16 1.0 8 1.0&lt;br /&gt;
 8 9 1.0 11 1.0 17 1.0&lt;br /&gt;
 9 10 2.0 22 1.5&lt;br /&gt;
 10&lt;br /&gt;
 11 18 1.0 12 1.0 19 1.0&lt;br /&gt;
 12 21 1.0 13 1.0 20 1.0&lt;br /&gt;
 13 14 2.0 15 2.0&lt;br /&gt;
 14&lt;br /&gt;
 15&lt;br /&gt;
 16&lt;br /&gt;
 17&lt;br /&gt;
 18&lt;br /&gt;
 19&lt;br /&gt;
 20&lt;br /&gt;
 21&lt;br /&gt;
 22 23 1.0 30 1.0&lt;br /&gt;
 23 26 1.0 24 1.0 31 1.0&lt;br /&gt;
 24 34 1.5 25 2.0&lt;br /&gt;
 25&lt;br /&gt;
 26 27 1.0 33 1.0 32 1.0&lt;br /&gt;
 27 28 2.0 29 2.0&lt;br /&gt;
 28&lt;br /&gt;
 29&lt;br /&gt;
 30&lt;br /&gt;
 31&lt;br /&gt;
 32&lt;br /&gt;
 33&lt;br /&gt;
 34 35 1.0 36 1.0&lt;br /&gt;
 35&lt;br /&gt;
 36&lt;br /&gt;
&lt;br /&gt;
The issue going from .xyz straight to .com is that ALL protein information would be lost ie. residue information that is written into the .pdb at .xyz to .pdb comversion.&lt;br /&gt;
&lt;br /&gt;
===PYP files as downloaded compared to run through TINKER===&lt;br /&gt;
&lt;br /&gt;
*TINKER removes much of the header information&lt;br /&gt;
*TINKER only preserves the 1st model of a .pdb file (may be multiple models)&lt;br /&gt;
*TINKER removes far right 3 columns of information (not essential to the .pdb file)&lt;br /&gt;
*A virtual residue in the original .pdb comprising the chromophore was omitted from the TINKER output (number 126)&lt;br /&gt;
&lt;br /&gt;
===What each TINKER search algorithm does===&lt;br /&gt;
&lt;br /&gt;
There are several programs within TINKER that may be able to help locate a global minimum (not just one near the published crystal structure) by way of searching the potential energy surface. All of these are MM methods for use in TINKER.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&#039;&#039;&#039;SCAN&#039;&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
A program for general conformational search of an entire potential energy surface via a basin hopping method. The program takes as input a TINKER .xyz coordinates file which is then minimized to find the first local minimum for a search list. A series of activations along various normal modes from this initial minimum are used as seed points for additional minimizations. Whenever a previously unknown local minimum is located it is added to the search list. When all minima on the search list have been subjected to the normal mode activation without locating additional new minima, the program terminates. The individual local minima are written to cycle files as they are discovered. While the SCAN program can be used on standard undeformed potential energy surfaces, we have found it to be most useful for quickly ‘‘scanning’’ a smoothed energy surface to enumerate the major basins of attraction spaning the entire surface.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;SNIFFER&#039;&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
A program that implements the Sniffer global optimization algorithm of Butler and Slaminka, a discrete version of Griewank’s global search trajectory method. The program takes an input TINKER .xyz coordinates file and shakes it vigorously via a modified dynamics trajectory before, hopefully, settling into a low lying minimum. Some trial and error is often required as the current implementation is sensitive to various parameters and tolerances that govern the computation. At present, these parameters are not user accessible, and must be altered in the source code. However, this method can do a good job of quickly optimizing conformation within a limited range of convergence.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;PSS&#039;&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
Implements our version of a potential smoothing and search algorithm for the global optimization of molecular conformation. An initial structure in .xyz format is first minimized in Cartesian coordinates on a series of increasingly smoothed potential energy surfaces. Then the smoothing procedure is reversed with minimization on each successive surface starting from the coordinates of the minimum on the previous surface. A local search procedure is used during the backtracking to explore for alternative minima better than the one found during the current minimization. The final result is usually a very low energy conformation or, in favorable cases, the global energy minimum conformation. The minimum energy coordinate sets found on each surface during both the forward smoothing and backtracking procedures are placed in sequentially numbered cycle files.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
TINKER also contains programs that are able to perform MM optimisations:&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&#039;&#039;&#039;MINIMIZE&#039;&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;The MINIMIZE program performs a limited memory L-BFGS minimization of an input structure over Cartesian coordinates using a modified version of the algorithm of Jorge Nocedal. The method requires only the potential energy and gradient at each step along the minimization pathway. It requires storage space proportional to the number of atoms in the structure. The MINIMIZE procedure is recommended for preliminary minimization of trial structures to an rms gradient of 1.0 to 0.1 kcal/mole/Å. It has a relatively fast cycle time and is tolerant of poor initial structures, but converges in a slow, linear fashion near the minimum. The user supplies the name of the TINKER .xyz coordinates file and a target rms gradient value at which the minimization will terminate. Output consists of minimization statistics written to the screen or redirected to an output file, and the new coordinates written to updated .xyz files or to cycle files.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;OPTIMIZE&#039;&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;The OPTIMIZE program performs a optimally conditioned variable metric minimization of an input structure over Cartesian coordinates using an algorithm due to William Davidon. The method does not perform line searches, but requires computation of energies and gradients as well as storage for an estimate of the inverse Hessian matrix. The program operates on Cartesian coordinates from a TINKER .xyz file. OPTIMIZE will typically converge somewhat faster and more completely than MINIMIZE. However, the need to store and manipulate a full inverse Hessian estimate limits its use to structures containing less than a few hundred atoms on workstation class machines. As with the other minimizers, OPTIMIZE needs input coordinates and an rms gradient cutoff criterion. The output coordinates are saved in updated .xyz files or as cycle files.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;NEWTON&#039;&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt; A truncated Newton minimization method which requires potential energy, gradient and Hessian information. This procedure has significant advantages over standard Newton methods, and is able to minimize very large structures completely. Several options are provided with respect to minimization method and preconditioning of the Newton equations. The default options are recommended unless the user is familiar with the math involved. This program operates in Cartesian coordinate space and is fairly tolerant of poor input structures. Typical algorithm iteration times are longer than with nonlinear conjugate gradient or variable metric methods, but many fewer iterations are required for complete minimization. NEWTON is usually the best choice for minimizations to the 0.01 to 0.000001 kcal/mole/Å level of rms gradient convergence. Tests for directions of negative curvature can be removed, allowing NEWTON to be used for optimization to conformational transition state structures (this only works if the starting point is very close to the transition state). Input consists of a TINKER .xyz coordinates file; output is an updated set of minimized coordinates and minimization statistics.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Tues 17th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
Comparing 2 .pdb files:&lt;br /&gt;
&lt;br /&gt;
*As downloaded from PDB&lt;br /&gt;
*Converted to .xyz in TINKER then back to .pdb&lt;br /&gt;
&lt;br /&gt;
These files were then cleaned up to compare in a terminal using the &amp;quot;diff&amp;quot; command:&lt;br /&gt;
&lt;br /&gt;
*Header information and end information removed&lt;br /&gt;
*3 columns to the right in PDB version were removed to become consistent with the TINEKR file&lt;br /&gt;
*The first column &amp;quot;ATOM&amp;quot; was removed effectively making the line numbers equal to the atom numbers in both files&lt;br /&gt;
&lt;br /&gt;
==Wed 18th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
1st diff was on line 52, a HYDROGEN ATOM:&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&#039;&#039;52  HE2 HIS A   3    13.349 -10.803   9.527&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
added to the TINKER output, not in PDB file.&lt;br /&gt;
&lt;br /&gt;
2nd diff was on line 657, a HYDROGEN ATOM:&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&#039;&#039;657  HE2 GLU A  46	 -0.867  -5.216  -0.822&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
added to the PDB file, not in TINKER output&lt;br /&gt;
&lt;br /&gt;
3rd diff was on line 1007, a HYDROGEN ATOM (number 1008):&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&#039;&#039;5HG  CYS A  69	 -8.909  -5.194   5.526&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
added to the TINKER output, not in PDB file.&lt;br /&gt;
&lt;br /&gt;
4th diff was  on line 1621, a HYDROGEN ATOM (number 1623):&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&#039;&#039;HE2 HIS A 108	  4.663   7.459   6.879&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
added to the TINKER output, not in PDB file.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Following these reconciliations, both files were exactly the same with 1911 atoms present in each. 3 atoms were removed from the TINKER file and 1 was removed from the PDB file.&lt;br /&gt;
&lt;br /&gt;
*NOTE - loading the original .pdb from PDB to Gaussview then converting to a .gjf results in an error &amp;quot;1 or more PDB atom names has a &#039; or * character which can cause problems for Gaussian&amp;quot; - this is because the PDB file has extra atoms on the end of the file to define a highlighted chromophore... After these were removed in a txt editor the .gjf save went smoothly. Infact after this removal the file will run an AMBEr single point energy calculation fine - cf running when the extra info is in the bottom.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
All differences in files were due to protonation states of atoms... not the AA backbone.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===TINKER vs PDB===&lt;br /&gt;
[[Image:TINKER PROTON.png]]&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
[[Image:PDB_PROTON.png]]&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
The highlighted fragments represent where the respective file version is protonated and the other is not.&lt;br /&gt;
&lt;br /&gt;
==Fri 20th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
In an attempt to load the 1W7S structure to TINKER and successfully convert it it a .xyz format the following procedure was undertaken:&lt;br /&gt;
&lt;br /&gt;
*&lt;br /&gt;
&lt;br /&gt;
==Mon 23rd Nov 2009==&lt;br /&gt;
&lt;br /&gt;
===Protonation of GFP at various pH conditions===&lt;br /&gt;
&lt;br /&gt;
With 230 amino acid residues, each possessing a unique pKa in GFP it is obvious that there is a huge spectrum of protonation states available at a single solvation pH. Since protons cannot be detected using X-ray diffraction, protonation states must be uncovered via a difference method. Several questions arise when thinking about protonation of the protein:&lt;br /&gt;
&lt;br /&gt;
*Which residues have the ability to be protonated/deprotonated&lt;br /&gt;
*Is the PDB structure protonated for a specific pH (Bio pH)&lt;br /&gt;
*How does the protonation of the protein change through a variety of solvation pHs&lt;br /&gt;
*Does the protonation state affect the chromophore photophysics (electronic embedding)&lt;br /&gt;
&lt;br /&gt;
Answering these questions:&lt;br /&gt;
&lt;br /&gt;
*Polar amino acids are those that can be protonated/deprotonated, these are identified by Gaussview and shown in the atom list of the molecule with the option of setting them to either protonated/deprotonated.&lt;br /&gt;
*Dont know if the current structure is protonated to any specific pH... this is unclear until protonate at certain pHs&lt;br /&gt;
*The protonation will be tested across a range of pHs to see how protonation changes&lt;br /&gt;
&lt;br /&gt;
==Tues 24th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
The pKa value of a residue will indicate protonation state as a function of pH. If you know the pH of your solution then if the pKa of a given functional group is lower than this pH then the functional group will be unprotonated. If the pKa is higher than the pH then the functional group will be protonated. &#039;&#039;&#039;Beware that pKa values can be distorted by local conformational effects.&#039;&#039;&#039; The structure is protonated as though it were in pH 7 as it is loaded into Gaussview. Since hydrogens are defined in the PDB file it could be that the author added protons under these conditions.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Histadine has 2 protonation points on the imidizole R chain:&lt;br /&gt;
&lt;br /&gt;
{|&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Yes-Yes&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Yes-No&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;No-Yes&#039;&#039;&#039;&lt;br /&gt;
|-&lt;br /&gt;
|&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolApplet&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;Protein_yes_yes.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;400&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Yes Yes&amp;lt;/text&amp;gt;&lt;br /&gt;
  &amp;lt;/jmolApplet&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;||&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolApplet&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;Protein_yes_no.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;400&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Yes Yes&amp;lt;/text&amp;gt;&lt;br /&gt;
  &amp;lt;/jmolApplet&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;||&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolApplet&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;Protein_no_yes.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;400&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Yes Yes&amp;lt;/text&amp;gt;&lt;br /&gt;
  &amp;lt;/jmolApplet&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
The imidazole sidechain of histidine has a pKa of approximately 6, and overall, the amino acid has a pKa of 7.6. This means that at physiologically relevant pH values, relatively small shifts in pH will change its average charge. Below a pH of 6, the imidazole ring is mostly protonated as described by the Henderson–Hasselbalch equation. When protonated, the imidazole ring bears two NH bonds and has a positive charge. The positive charge is equally distributed between both nitrogens and can be represented with two equally important resonance structures.&lt;br /&gt;
&lt;br /&gt;
===Protonation===&lt;br /&gt;
&lt;br /&gt;
At pH 7 the PDB file protonations are correct for all residues. Moving to pH 5 the histadine residues become protonated (pKa 6)- this is the only change. At more alkaline pHs it would take pH 9.1 to make any effect (deprotonating CYS and then TYR at 9.7)&lt;br /&gt;
&lt;br /&gt;
===Mini-Biomolecule===&lt;br /&gt;
&lt;br /&gt;
To test ONIOM calculations vs pure DFT methods and other calculation types on a bio system that is smaller than GFP (computationally less expensive, can streamline the real calcs)&lt;br /&gt;
&lt;br /&gt;
Trying a molecule analogous to the example given in the ONIOM tutorial...&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolApplet&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;Biomolecule1.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;400&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Yes Yes&amp;lt;/text&amp;gt;&lt;br /&gt;
  &amp;lt;/jmolApplet&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Thurs 26th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
===Checking the GFP input will run===&lt;br /&gt;
&lt;br /&gt;
Procedure to get 1WS7 to run an AMBER energy calculation:&lt;br /&gt;
&lt;br /&gt;
*Load the PDB file without water&lt;br /&gt;
*Try opt amber but will fail due to missing parameters (all in the chromophore)&lt;br /&gt;
*Set these parameters to 0 since they cancel in ONIOM&lt;br /&gt;
*Freeze the atotms involving missing paramters&lt;br /&gt;
*Energy AMBER will now run for the file - NO CHARGES - 9.07881632au&lt;br /&gt;
*Open the Atom Editor, Edit, MM charges, Assign all atoms&lt;br /&gt;
*Manually edit any missing charges&lt;br /&gt;
*Energy AMBER will now run including charges (basic AMBER94 charges) - 0.260391165au&lt;br /&gt;
&lt;br /&gt;
Charges in this file did not sum to an integer number - the only residue with a non integer charge was no.64 with a charge of 0.61160000&lt;br /&gt;
&lt;br /&gt;
 MM sanity checks:&lt;br /&gt;
 All charges sum to:               -5.38840000&lt;br /&gt;
 Charges of atoms sum to:          -5.38840000&lt;br /&gt;
 Layer 1 (S) has     34 atoms and charge   0.61160000 sum=   0.61160000&lt;br /&gt;
 Layer 2 (R) adds  3574 atoms and charge  -6.00000000 sum=  -5.38840000&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
 WARNING -- The following centers have non-zero partial&lt;br /&gt;
 charges, but zero Vanderwaals parameters. These centers&lt;br /&gt;
 can collapse onto other centers with non-zero charge&lt;br /&gt;
 Center    Charge&lt;br /&gt;
   1833  0.423900&lt;br /&gt;
   1890  0.410200&lt;br /&gt;
   2029  0.427500&lt;br /&gt;
   2043  0.427500&lt;br /&gt;
   2080  0.410200&lt;br /&gt;
   2089  0.399200&lt;br /&gt;
   2123  0.410200&lt;br /&gt;
   2179  0.410200&lt;br /&gt;
   2186  0.410200&lt;br /&gt;
   2260  0.410200&lt;br /&gt;
   2287  0.410200&lt;br /&gt;
   2294  0.410200&lt;br /&gt;
   2347  0.427500&lt;br /&gt;
   2369  0.399200&lt;br /&gt;
   2469  0.427500&lt;br /&gt;
   2508  0.399200&lt;br /&gt;
   2551  0.410200&lt;br /&gt;
   2619  0.399200&lt;br /&gt;
   2639  0.410200&lt;br /&gt;
   2714  0.410200&lt;br /&gt;
   2924  0.399200&lt;br /&gt;
   2939  0.399200&lt;br /&gt;
   2950  0.427500&lt;br /&gt;
   2981  0.399200&lt;br /&gt;
   3180  0.427500&lt;br /&gt;
   3233  0.399200&lt;br /&gt;
   3262  0.410200&lt;br /&gt;
   3361  0.399200&lt;br /&gt;
   3377  0.427500&lt;br /&gt;
   3384  0.410200&lt;br /&gt;
   3397  0.427500&lt;br /&gt;
   3418  0.427500&lt;br /&gt;
   3562  0.410200&lt;br /&gt;
   3593  0.410200&lt;br /&gt;
   3608  0.410200&lt;br /&gt;
&lt;br /&gt;
* oniom(pm6:amber=hardfirst) geom=connectivity was run successfully, resulting in an energy of 0.210085730788au&lt;br /&gt;
&lt;br /&gt;
This still resulted in some charge problems:&lt;br /&gt;
&lt;br /&gt;
 MM sanity checks:&lt;br /&gt;
 All charges sum to:               &#039;&#039;&#039;-5.38840000&#039;&#039;&#039;&lt;br /&gt;
 Charges of atoms sum to:           &#039;&#039;&#039;0.79320000&#039;&#039;&#039;&lt;br /&gt;
 Residue  63 PDB Number    &#039;&#039;&#039;64_A PHE has&#039;&#039;&#039;     1 atoms, and a charge of   0.59730000&lt;br /&gt;
 Residue  64 PDB Number    &#039;&#039;&#039;66_A CSY has&#039;&#039;&#039;    34 atoms, and a charge of   0.61160000&lt;br /&gt;
 Residue  65 PDB Number    &#039;&#039;&#039;68_A VAL has&#039;&#039;&#039;     1 atoms, and a charge of  -0.41570000&lt;br /&gt;
 WARNING -- The following centers have non-zero partial&lt;br /&gt;
 charges, but zero Vanderwaals parameters. These centers&lt;br /&gt;
 can collapse onto other centers with non-zero charge&lt;br /&gt;
 Center    Charge&lt;br /&gt;
   &#039;&#039;&#039;3608  0.410200&#039;&#039;&#039;&lt;br /&gt;
 MMInit generated parameter data with length LenPar=       40470.&lt;br /&gt;
 Standard basis: Dummy (5D, 7F)&lt;br /&gt;
 There are    36 symmetry adapted basis functions of A   symmetry.&lt;br /&gt;
 Integral buffers will be    262144 words long.&lt;br /&gt;
 Raffenetti 1 integral format.&lt;br /&gt;
 Two-electron integral symmetry is turned on.&lt;br /&gt;
    36 basis functions,    36 primitive gaussians,    36 cartesian basis functions&lt;br /&gt;
    27 alpha electrons       27 beta electrons&lt;br /&gt;
       nuclear repulsion energy       200.9124124811 Hartrees.&lt;br /&gt;
 NAtoms= 3608 NActive=   36 NUniq=   36 SFac= 7.50D-01 NAtFMM=   80 NAOKFM=F Big=F&lt;br /&gt;
 AMBER calculation of energy.&lt;br /&gt;
 Energy=    0.110474559     NIter=   0.&lt;br /&gt;
 Dipole moment=       3.390901       5.889448      -2.505476&lt;br /&gt;
 ONIOM: saving gridpoint  1&lt;br /&gt;
 ONIOM: restoring gridpoint  3&lt;br /&gt;
 ONIOM: calculating energy.&lt;br /&gt;
 ONIOM: gridpoint  1 method:  low   system:  model energy:     0.110474559177&lt;br /&gt;
 ONIOM: gridpoint  2 method:  high  system:  model energy:     0.060169124949&lt;br /&gt;
 ONIOM: gridpoint  3 method:  low   system:  real  energy:     0.260391165016&lt;br /&gt;
 ONIOM: extrapolated energy =       0.210085730788&lt;br /&gt;
 ONIOM: calculating electric field derivatives.&lt;br /&gt;
 ONIOM: Integrating ONIOM file  4 number   619&lt;br /&gt;
 ONIOM: Dipole        =-2.76174164D+01-1.34762322D+02 3.99239712D+01&lt;br /&gt;
 ONIOM: Dipole moment (Debye):&lt;br /&gt;
    X=   -70.1965    Y=  -342.5316    Z=   101.4766  Tot=   364.0782&lt;br /&gt;
 ONIOM: Integrating ONIOM file  5 number   695&lt;br /&gt;
&lt;br /&gt;
An ONIOM calculation was run using a DFT method on the high level and AMBER on the low level:  &#039;&#039;oniom(b3lyp/3-21g:amber=hardfirst) geom=connectivity&#039;&#039; which ran successfully...&lt;br /&gt;
&lt;br /&gt;
===Charges===&lt;br /&gt;
&lt;br /&gt;
Charges in the protein are important for the calculation that will be run on GFP eventually since they provide the electrostatic field or &amp;quot;atmosphere&amp;quot; which surrounds the chromophore, polarizing the electron density used by the DFT method to evaluate the chromophore. Gaussian offers 2 types of influence which the MM portion can exert on the QM part:&lt;br /&gt;
&lt;br /&gt;
* Mechanical Embedding&lt;br /&gt;
* Electronic Embedding&lt;br /&gt;
&lt;br /&gt;
Mechanical embedding does not use any electronic components to affect the central high region. Electronic embedding does use the charge generated in the MM region to polarize the high region.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
At pH 7 the GFP molecule is in a 6- charge state according to assigning AMBER charges and summing the residue charges (available in the out file). 3 options were considered at this stage for treating charges:&lt;br /&gt;
&lt;br /&gt;
* Ignore charges on the entire molecule&lt;br /&gt;
* Use AMBER94 assigned partial charges on the low region&lt;br /&gt;
* Use Qeq to assign charges based on an overall charge of -6&lt;br /&gt;
&lt;br /&gt;
The first option can be discarded since the electronic atmosphere is of particular interest here. The second 2 options are to be investigated. See below (on a small Biomolecule)&lt;br /&gt;
&lt;br /&gt;
ON a 4 residue chain (in mini biomolecule folder) - As in the GFP tutorial on the wiki, a energy calculation was submitted with the keyword amber=qeq to get back the charges across the molecule. (not entirely sure what this does yet). AS a test of the effect of charges on a molecule a test molecule was compiled in TINKER consisting of 4 residues with 0 overall charges....&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolAppletButton&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;Protein1_AMBER_OPT_qeq.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Test Protien&amp;lt;/text&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;600&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;/jmolAppletButton&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{|&lt;br /&gt;
|No Charges|| 0.01940311au&lt;br /&gt;
|-&lt;br /&gt;
|Amber Charges|| -0.18117151au&lt;br /&gt;
|-&lt;br /&gt;
|Qeq Charges|| -0.23038199au&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Clearly the Qeq shows the lowest energy - the considerable differences in energy show that charges are an important part of the input.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Testing Charges and EE on a small system===&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolAppletButton&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;Chromopre_molecule_B3LYP_6-31G(d).mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;script&amp;gt;spin on&amp;lt;/script&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Test Protien&amp;lt;/text&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;600&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;/jmolAppletButton&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Energies of the optimisations using various charge approximations and with/without EE&lt;br /&gt;
{|class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Method||Mechanical Embedding||Electronic Embedding||Computational Time (mechanical embedding)||Computational Time (electronic embedding)&lt;br /&gt;
|-&lt;br /&gt;
|B3LYP 6-31G*||-1406.04||N/A||11 hours 18 minutes 44.7 seconds||11 hours 18 minutes 44.7 seconds&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (No Charges)||-783.2225||-783.2223||2 hours 13 minutes  4.1 seconds||&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Amber Charges)|| || ||  ||&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Qeq Charges)||-783.2467|| ||2 hours  6 minutes  8.6 seconds ||&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
NB. &lt;br /&gt;
* B3LYP 6-31G* and ONIOM B3LYP 6-31G*:AMBER (No Charges) were from the same starting geometry.&lt;br /&gt;
* ONIOM B3LYP 6-31G*:AMBER (Amber Charges) and ONIOM B3LYP 6-31G*:AMBER (Qeq Charges) were from the same starting geometry.&lt;br /&gt;
* All Electronic embedding is from the same starting point - Can compare times.&lt;br /&gt;
&lt;br /&gt;
Dipole moments using various charge approximations and with/without EE&lt;br /&gt;
{|class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Method||Dipole moment (No EE)||Dipole moment (With EE)&lt;br /&gt;
|-&lt;br /&gt;
|B3LYP 6-31G*||7.4914||N/A&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (No Charges)||3.2328||3.2241&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Amber Charges)||5.2794||5.2865&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Qeq Charges)||8.173||8.1654&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==2nd Dec 2009==&lt;br /&gt;
&lt;br /&gt;
A single point energy of 1w7s was run with the additional qeq keyword to return partial charges (overall charge set to -6)&lt;br /&gt;
This new molecule was submitted for AMBER opt but failed after around 700 iterations... this could be due to the error:&lt;br /&gt;
&lt;br /&gt;
 WARNING -- The following centers have non-zero partial&lt;br /&gt;
 charges, but zero Vanderwaals parameters. These centers&lt;br /&gt;
 can collapse onto other centers with non-zero charge&lt;br /&gt;
&lt;br /&gt;
When the atoms defined in this error had their partial charges set to 0.00, the result was a fully converged AMBER optimisation:&lt;br /&gt;
&lt;br /&gt;
*Frozen chromophore.&lt;br /&gt;
*pH 7 giving a charge of -6&lt;br /&gt;
*Charges assigned using Qeq method&lt;br /&gt;
*Zero Van der Waals parameter atoms partial charges set to 0.00&lt;br /&gt;
&lt;br /&gt;
Looking at what atoms are not defined in terms of Van der Waals paramters it is ALWAYS an aloholic proton that is not parameterised...&lt;br /&gt;
&lt;br /&gt;
Tutorial used AMBER charges then ran Qeq for the missing charges... i think this is what she thought she did... gaussian &amp;quot;amber=qeq&amp;quot; will overwrite all charges.. need to use &amp;quot;amber=uncharged&amp;quot;&lt;br /&gt;
so that we dont overwrite the amber charges...&lt;br /&gt;
&lt;br /&gt;
In effect full Qeq charges were assigned in the tutorial. Also overall charge is 0 - will check this.&lt;br /&gt;
&lt;br /&gt;
===The GFP tutorial===&lt;br /&gt;
&lt;br /&gt;
*AMBER charges (AMBER94 charges) were assigned to the GFP molecule in the tutorial&lt;br /&gt;
*she notices correctly that some are missing in the chromophore and so decides to assign them via Qeq&lt;br /&gt;
*Runs Qeq on the GFP molecule with a &#039;&#039;&#039;0 charge&#039;&#039;&#039; which is wrong! When run the AMBER charges and look at the log file we can see that the overall charge should be &#039;&#039;&#039;+4&#039;&#039;&#039; for that protonation state.&lt;br /&gt;
*Since Qeq is an equilibrium method, partial charges are solved with the constraint that their sum must equal the overall charge.&lt;br /&gt;
*These Qeq charges should therefore not fit with an integer sum when pasted into the AMBER charges.&lt;br /&gt;
&lt;br /&gt;
===Qeq versus AMBER94 charges===&lt;br /&gt;
&lt;br /&gt;
Ogawa et al. state:&lt;br /&gt;
&lt;br /&gt;
 Geometry optimizations of a crambin protein were performed using force fields combined with the&lt;br /&gt;
 CQEq. The CQEq charges were found to be more comparable to the Mulliken charges obtained by the ab initio molecular orbital&lt;br /&gt;
 calculation than the fixed charges used in the AMBER force field. Furthermore, the CQEq charges give more realistic charge redistribution&lt;br /&gt;
 between amino acids in the crambin.&lt;br /&gt;
&lt;br /&gt;
Mini Biomolecule calculations:&lt;br /&gt;
&lt;br /&gt;
{|class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Method||Dipole moment (No EE)||Dipole moment (With EE)&lt;br /&gt;
|-&lt;br /&gt;
|B3LYP 6-31G*||7.4914||N/A&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (No Charges)||3.2328||3.2241&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Amber Charges)||5.2794||5.2865&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Qeq Charges)||8.173||8.1654&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Clearly the Qeq charges are in better agreement with the DFT method than no or AMBER charges.&lt;br /&gt;
&lt;br /&gt;
These frequencies are calulated using mechanical embedding!&lt;br /&gt;
&lt;br /&gt;
{|class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Method||C=O||C=C&lt;br /&gt;
|-&lt;br /&gt;
|B3LYP 6-31G*||1799.65||1701.86&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (No Charges)||1802.25||1703.63&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Amber Charges)||1803.37||1701.67&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Qeq Charges)||1800.18||1702.86&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
As a result it seems obvious that a Qeq charge assignment in constraint by the correct total charge gives results most comparable to the pure QM method (both dipoles and frequencies)&lt;br /&gt;
&lt;br /&gt;
==3rd Dec 2009==&lt;br /&gt;
===ESP Charges===&lt;br /&gt;
&lt;br /&gt;
ESP calculates the expectation values of the electrostatic potential of a molecule on a uniform distribution of points. The resultant ESP surface is then fit to atom centered charges that best reproduce the electron distribution in a least-squares method. The set of points defining the default surface is generated according to the algorithm of Connolly.&lt;br /&gt;
&lt;br /&gt;
ESP charges thus require a QM calculation to create the molecular electrostatic potential used to fit atom centered charges. The size of GFP means that this is not an efficient method for use in this case?&lt;br /&gt;
&lt;br /&gt;
===Qeq Charges===&lt;br /&gt;
&lt;br /&gt;
The charge equilibration (QEq) method treats the atomic charge variation through the equilibration of electronegativities at atomic sites. The advantage of the QEq method is that the polarization effect in MM is taken into account by variation of the charge distribution in the molecules. The atomic charge variation relates to the intramolecular geometrical changes.&lt;br /&gt;
&lt;br /&gt;
===AMBER charges===&lt;br /&gt;
&lt;br /&gt;
==8th Dec 2009==&lt;br /&gt;
&lt;br /&gt;
A input creation page is now avaliable:&lt;br /&gt;
&lt;br /&gt;
https://www.ch.ic.ac.uk/wiki/index.php/AndyForesterGFP&lt;br /&gt;
&lt;br /&gt;
The final input file had protein information pasted in using text to columns in excel and now displays PDB residue information and PDB secondary structure information (nice cartoon pictures in ChemBio3D!)&lt;br /&gt;
&lt;br /&gt;
Interesting is a paper that uses TD-DFT on GFP (in my papers folder on the mac), the group uses a 3 stage minimisation method from the 1GFL PDB structure:&lt;br /&gt;
&lt;br /&gt;
* Freeze all atoms in the backbone of the protein and optimise after adding hydrogens&lt;br /&gt;
* Optimise the chromophore at a semi-empirical level&lt;br /&gt;
* They cut out a chromophore section but i could then perform a full QM/MM geometry optimisation&lt;br /&gt;
&lt;br /&gt;
===A more charged mini molecule to test electronic embedding===&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolAppletButton&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;More_charge_B3LYP_6_31G(d).mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;The molecule with 2+ charge&amp;lt;/text&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;600&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;/jmolAppletButton&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
The molecule was submitted with 3 different job lines:&lt;br /&gt;
&lt;br /&gt;
* b3lyp 6-31g(d)&lt;br /&gt;
* oniom(b3lyp 6-31g(d):amber=(hardfirst,qeq))&lt;br /&gt;
* oniom(b3lyp 6-31g(d):amber=(hardfirst,qeq))=embedcharge&lt;br /&gt;
&lt;br /&gt;
Opt and Freq jobs were completed successfully. The charge breakdown input was:&lt;br /&gt;
&lt;br /&gt;
* Low Real 2 1&lt;br /&gt;
* High Model 0 1&lt;br /&gt;
* Low Model 0 1&lt;br /&gt;
&lt;br /&gt;
The aim of the calculations was to test electronic embedding versus the mechanical embedding. A paper suggests that there is a large difference in accuracy between the two methods. Qeq charge equilibration was used in both cases as it has already been proved to be much more effective than AMBER assigned charges. &#039;&#039;&#039;It has consistently overestimated the dipole moment though&#039;&#039;&#039;.&lt;br /&gt;
&lt;br /&gt;
{|class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Method||Dipole moment&lt;br /&gt;
|-&lt;br /&gt;
|B3LYP 6-31G*||8.02&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (ME)||12.22&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (EE)||12.23&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
These results are odd since we expect the EE to have some effect on the charge distribution in the molecule (it is polarising the QM region) comparing the frequencies as a further check:&lt;br /&gt;
&lt;br /&gt;
{|class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Method||C=O||C=C&lt;br /&gt;
|-&lt;br /&gt;
|B3LYP 6-31G*||1764||1636&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (ME)||1797||1697&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (EE)||1797||1697&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Again the EE has made no difference to the frequency of various strong vibrations in the molecule. The conclusion is thus either:&lt;br /&gt;
&lt;br /&gt;
* EE has no real effect on the moleucule&lt;br /&gt;
* EE is not working properly&lt;br /&gt;
&lt;br /&gt;
Assuming the latter further investigation discovered that the meechanism of EE is determined using scaling parameters for MM charges during electronic embedding in the QM calculations. As a default the charges are scaled 555500 using the nomenclature ScaleCharge=ijklmn where n is closest to the QM region (massively anti intuitive). The input is read left to right and not all scaling factors need be defined. Further to this, the value given is then multiplied by 0.2 to arrive at the actual scaling coefficient. The result is thus for an atom:&lt;br /&gt;
&lt;br /&gt;
* 1 bond from the QM region (n) = 0.2n           Default is 0&lt;br /&gt;
* 2 bonds from the QM region (m) = 0.2m       Default is 0&lt;br /&gt;
* 3 bonds from the QM region (l) = 0.2l            Default is 5&lt;br /&gt;
* 4 bonds from the QM region (k) = 0.2k          Default is 5&lt;br /&gt;
* 5 bonds from the QM region (j) = 0.2j            Default is 5&lt;br /&gt;
* 6 bonds from the QM region (i) = 0.2i            Default is 5&lt;br /&gt;
&lt;br /&gt;
So all the atoms (and partial charges) in the MM region in the test molecule were being scaled to 0 when performing the EE. This explains the fact that no difference was uncovered in the previous calculation set.&lt;br /&gt;
&lt;br /&gt;
A new calculation was submitted with the command scalecharge=555 so that all charges in the MM part will be scaled to 1 (0.2xn,m,l)&lt;br /&gt;
&lt;br /&gt;
===Charges...again===&lt;br /&gt;
&lt;br /&gt;
So since i deleted the charges on the atoms where they can collapse, the overall charge is now -19.6 from the original -6 minus the 13.6 in charge deleted. REALLY ANNOYING! This is a huge difference in charges so will have to be solved. One easy option would be to set the involved atoms to 0 then run QEQ on the atoms with unassigned charges... essentially distributing the -6 charge through all atoms except those with 0 VdW parameters.&lt;br /&gt;
&lt;br /&gt;
==Tues 15th Dec 2009==&lt;br /&gt;
&lt;br /&gt;
The more charged molecule failed once the charges were scaled to 1 since SCF convergence was not attained.&lt;br /&gt;
&lt;br /&gt;
==Mon 25th Jan 2010==&lt;br /&gt;
&lt;br /&gt;
TAO tool publication was very good read... has a good suggestion of an optimisation procedure:&lt;br /&gt;
&lt;br /&gt;
*Take the crystal structure&lt;br /&gt;
*Add Hydrogens&lt;br /&gt;
*Determine protonation&lt;br /&gt;
*MM geometry optimisation&lt;br /&gt;
*(MD?)&lt;br /&gt;
*(Look at several samples of the MD)&lt;br /&gt;
*Optimise a final structure&lt;br /&gt;
&lt;br /&gt;
==Thurs 28th Jan 2010==&lt;br /&gt;
&lt;br /&gt;
Tao Tools requires preprocess within AMBER tools to run the scripts correctly. Does not have HIS on the residue database... Amber defines the protonation state within its residue naming scheme... HIS is either HIE, HIP, HID depending on its protonation state. [http://www.poissonboltzmann.org/pdb2pqr/documentation/programmers-guide/naming-scheme]&lt;br /&gt;
&lt;br /&gt;
==Thurs 4th Jan 2010==&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Preprocessing&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
To get TAO tools and also for completeness it is necessary to preprocess the PDB file before attempting to run ONIOM calculations, this includes:&lt;br /&gt;
&lt;br /&gt;
* Adding hydrogens&lt;br /&gt;
* Create a custom residue file for the chromophore&lt;br /&gt;
* Convert from PDB to gjf ONIOM input&lt;br /&gt;
&lt;br /&gt;
Ambertools are extremely helpful with this preprocessing especially since amber will be the low level theory used throughout this project. Within ambertools are a number of useful programs:&lt;br /&gt;
&lt;br /&gt;
* antechamber - has a parameterlookup function within amber and GAFF parameter library files.&lt;br /&gt;
* xLeap - builds the protein from its constituent residues - adds hydrogens according to library files - allows a parm file to be built to add custom residues.&lt;br /&gt;
&lt;br /&gt;
I will be working on using these 2 programs to create a full MM input file which is fully defined and correct.&lt;br /&gt;
&lt;br /&gt;
===xLeap===&lt;br /&gt;
&lt;br /&gt;
To load xLeap we must use X11 to get the graphical interface and the ssh -Y command to allow tunneling back to X11. Also the command line to start xLeap should be:  &#039;&#039;xleap -s -f /apps/ambertools/amber11/dat/leap/cmd/leaprc.ff03&#039;&#039;. This loads the ff03 parameter file and results in the start up of the graphical interface with:&lt;br /&gt;
&lt;br /&gt;
 Welcome to LEaP!&lt;br /&gt;
 Sourcing leaprc: /amber/dat/leap/cmd/leaprc&lt;br /&gt;
 Log file: ./leap.log&lt;br /&gt;
 Loading parameters: /amber/dat/leap/parm/parmME.dat&lt;br /&gt;
 Loading library: /amber/dat/leap/lib/all_nucleic94.lib&lt;br /&gt;
 Loading library: /amber/dat/leap/lib/all_aminoME.lib&lt;br /&gt;
 Loading library: /amber/dat/leap/lib/all_aminoctME.lib&lt;br /&gt;
 Loading library: /amber/dat/leap/lib/all_aminontME.lib&lt;br /&gt;
 Loading library: /amber/dat/leap/lib/ions94.lib&lt;br /&gt;
 Loading library: /amber/dat/leap/lib/water.lib&lt;br /&gt;
&lt;br /&gt;
This shows that the residue libraries have been loaded. We are then free to load a pdb file which will be read by xLeap and built according to the residues! In addition to the residue names xleap also requires information about the chain connectivity. Residues can be connected to other residues from both sides, only one side or not at all. xleap is fairly simple in that it expects that all residues in the pdb file are connected in the order in which they are listed unless they are separated by a &amp;quot;TER&amp;quot; card. In most cases where a single protein chains is being investigated like here - the TER card should end the residue information.&lt;br /&gt;
&lt;br /&gt;
To load a pdb use (ignore the &amp;lt;&amp;gt;) &#039;&#039;&amp;lt;name for use in xLeap&amp;gt;=loadpdb &amp;quot;&amp;lt;name of file&amp;gt;&amp;quot;&#039;&#039;. Make sure that the file is in your PATH else fully define the path in the filename.&lt;br /&gt;
&lt;br /&gt;
The pdb should then be read into xLeap and built. At the end of the output we get (I loaded ChainA of the 1W7S without any water or hydrogens):&lt;br /&gt;
&lt;br /&gt;
 Leap added 1765 missing atoms according to residue templates:&lt;br /&gt;
       1 Heavy&lt;br /&gt;
       1764 H / lone pairs&lt;br /&gt;
 The file contained 21 atoms not in residue templates&lt;br /&gt;
&lt;br /&gt;
This shows that Leap added hydrogens automatically and also a missing &amp;quot;heavy&amp;quot; atom (one which is not a H or lp):&lt;br /&gt;
&lt;br /&gt;
 Added missing heavy atom: .R&amp;lt;CHIE 229&amp;gt;.A&amp;lt;OXT 18&amp;gt;&lt;br /&gt;
&lt;br /&gt;
For amino acids, the traditional order of atom records within a residue is N, CA, C, O, followed by the sidechain atoms (CB, CG1, CG2 . . . ) in order first of increasing remoteness, and then branch. The extra oxygen at the carboxyl terminal is called &amp;quot; OXT&amp;quot;, and appears after all sidechain atoms.&lt;br /&gt;
&lt;br /&gt;
Now that we have established that there are 21 atoms that are in the non standard residue region we need to define a custom residue which describes this region (chromophore). This involves creating a prep file....&lt;br /&gt;
&lt;br /&gt;
I first drew up the undefined section of GFP (chromophore) in Gaussview5 and saved as a.pdb then:&lt;br /&gt;
&lt;br /&gt;
* Load in antechamber and convert to mol2 format using &#039;&#039;antechamber -i &amp;lt;input&amp;gt; -fi pdb -o &amp;lt;output&amp;gt; -fo mol2 -c bcc&#039;&#039;&lt;br /&gt;
* Use parmchk to make sure the molecule was defined in the GAFF force field - kicks out a .frcmod file with missing parameters (including dihedrals) - nice :D!&lt;br /&gt;
* Manually add the &amp;quot;1CSY&amp;quot; where a standalone &amp;quot;0&amp;quot; is in the mol2 file so Leap can read it&lt;br /&gt;
* Load the gaff parameter file to Leap using &#039;&#039;source leaprc.gaff&#039;&#039;&lt;br /&gt;
* Load the chromophore_CSY_full.mol2 to Leap using &#039;&#039;CSY=loadmol2 &amp;quot;ChromophoreGFP_CSY_full.mol2&amp;quot;&#039;&#039;&lt;br /&gt;
* Check the residue using &#039;&#039;check CSY&#039;&#039; - Notice missing parameters are there.&lt;br /&gt;
* Load the ChromophoreGFP_CSY_full.frcmod file created by parmchk using &#039;&#039;loadamberparams ChromophoreGFP_CSY_full.frcmod&#039;&#039;&lt;br /&gt;
* Now check the CSY residue again using &#039;&#039;check CSY&#039;&#039; - No Problems! woo hoo! Now I can create amber parameters :D&lt;br /&gt;
&lt;br /&gt;
==Tues 9th Feb 2010==&lt;br /&gt;
&lt;br /&gt;
===RESP Fitting===&lt;br /&gt;
&lt;br /&gt;
To get charges that are like those of the standard amber force field an RESP fitting method has to be used. This involves 3 steps:&lt;br /&gt;
&lt;br /&gt;
* First, the molecule studied is optimized to determine a stable minimum (using Gaussian).&lt;br /&gt;
* Then, this minimized structure is used to calculate a Molecular Electrostatic Potential (MEP) on a three-dimensional grid (using Gaussian).&lt;br /&gt;
* Finally, this grid is exported into the &amp;quot;RESP program&amp;quot; which is used to fit atom-centered charges to the MEP&lt;br /&gt;
&lt;br /&gt;
To assist in this procedure - I am using a program called RED - [http://q4md-forcefieldtools.org/RED/] which can be used in all 3 stages...&lt;br /&gt;
&lt;br /&gt;
* Ante_RED is used to prepare Gaussian input files for the RESP fit&lt;br /&gt;
* RED III is used to fit atom centered charges&lt;br /&gt;
&lt;br /&gt;
The procedure used to create the CSY file for RESP fitting is as follows:&lt;br /&gt;
&lt;br /&gt;
* Cut the CSY residue from the 1W7S protein structure and cap terminal ends with methyl groups&lt;br /&gt;
* Optimise this in Gaussian at PM6 level&lt;br /&gt;
* Run the optimised CSY geometry through ante_RED to create the Gaussian input file involving minimisation at HF level and calculation of the MEP - also get the .P2N file required for RED III&lt;br /&gt;
* Run this Gaussian input file&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Input file for optimisation:&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;#P hf/6-31G* Opt=(Tight,CalcFC) Freq SCF(Conver=8) Test&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This will run a HF level optimisation and frequency analysis.  &#039;&#039;Opt=tight&#039;&#039; tightens the cutoffs on forces and step size that are used to determine convergence. An optimization with Opt=Tight will take several more steps than with the default cutoffs. For molecular systems with very small force constants (low frequency vibrational modes), this may be necessary to ensure adequate convergence and reliability of frequencies computed in a subsequent job step. This option can only be used with Berny optimizations. &#039;&#039;Opt = CalcFC&#039;&#039; causes the calculation of an analytical Hessian (in HF). &#039;&#039;SCF(Conver=8)&#039;&#039; sets the SCF convergence criterion to 10^-8.&lt;br /&gt;
&lt;br /&gt;
===Parameter files for use in Gaussian (Gaussian only uses amber94)===&lt;br /&gt;
&lt;br /&gt;
Using external parameter files. It is possible to include MM function entries within an external file by replacing the entries within the input file with the usual Gaussian include file mechanism: e.g., @oplsaa.prm. Parameter files for various force fields are included within the main Gaussian directory (i.e., $g09root/g09). If you want to specify the entire force field via an external file, use an input file like the following:&lt;br /&gt;
&lt;br /&gt;
# UFF=SoftOnly&lt;br /&gt;
&lt;br /&gt;
Read-in force field example&lt;br /&gt;
&lt;br /&gt;
molecule specification&lt;br /&gt;
&lt;br /&gt;
@$g09root/g09/oplsaa.prm&lt;br /&gt;
&lt;br /&gt;
Note that the choice of a molecular mechanics keyword is arbitrary as the entire force field depends on the read-in functions and parameters and not on any internal ones (in other words, specifying Amber would yield the same result as UFF).&lt;br /&gt;
&lt;br /&gt;
==Tues 16th Feb 2010==&lt;br /&gt;
===RESP Fitting===&lt;br /&gt;
&lt;br /&gt;
Following optimisation of the fragment for RESP fitting as seen before. We now have to fit the RESP to a MEP, this involves a few calculations:&lt;br /&gt;
&lt;br /&gt;
* ESP calcs in gaussian&lt;br /&gt;
* Fitting calcs by resp in ambertools&lt;br /&gt;
&lt;br /&gt;
The result is a .mol2 containing the fitted charges. The process has 2  input files:&lt;br /&gt;
&lt;br /&gt;
* .log&lt;br /&gt;
* .p2n&lt;br /&gt;
&lt;br /&gt;
The p2n is created by ante_RED and controls the RESP fitting process. (See RED site). To get a fit that excludes the capping methyls we add a REMARK line:&lt;br /&gt;
&lt;br /&gt;
 REMARK INTRA-MCC 0.0 |  35  36  37  38  39  40  41  42 | R&lt;br /&gt;
&lt;br /&gt;
This must be formatted EXACTLY as shown here (2 spaces between the numbers). The numbers correspond to atoms numbers, the 0.0 constrains these atoms to 0.0 charge and the R tells RESP to remove these atoms from the 2nd round fit and the subsequent mol2 output. Leaving a fragment.&lt;br /&gt;
&lt;br /&gt;
The RED program can be run on CX1! I used the following Jobscript (REDjobscript_4):&lt;br /&gt;
&lt;br /&gt;
 #PBS -l ncpus=1 &lt;br /&gt;
 #PBS -l mem=6000mb&lt;br /&gt;
 #PBS -l walltime=01:00:00&lt;br /&gt;
 #PBS -q pqchem&lt;br /&gt;
 #PBS -joe&lt;br /&gt;
 module load ambertools&lt;br /&gt;
 module load gaussian&lt;br /&gt;
 export GAUSS_SCRDIR=$TMPDIR&lt;br /&gt;
 echo $GAUSS_SCRDIR&lt;br /&gt;
 cd $HOME&lt;br /&gt;
 pwd&lt;br /&gt;
 perl /home/af706/RED-vIII.3.pl &amp;gt; /home/af706/RED-vIII.log&lt;br /&gt;
&lt;br /&gt;
So here we are using the chemistry node pqchem:&lt;br /&gt;
&lt;br /&gt;
* loading the ambertools (for RESP) and gaussian modules&lt;br /&gt;
* defining a scratch directory for the RESP program to read gaussian scratch&lt;br /&gt;
* TEMPDIR is used since the scratch must be empty upon calculation start. The temporary directory will be created fresh on each node as the calculation is started, and it&#039;s unlikely that two calculations will run on the same node.&lt;br /&gt;
* Moving to my HOME directory /home/af706 because the script must be run in the same directory as the input files  (.log and .p2n)&lt;br /&gt;
* RUN!&lt;br /&gt;
&lt;br /&gt;
The fitting process runs single point energy calcs and fits the RESP - overall it took 3mins for my fragment.&lt;br /&gt;
&lt;br /&gt;
===Going from xLeap out to Gaussian in===&lt;br /&gt;
&lt;br /&gt;
2 ways to do this:&lt;br /&gt;
&lt;br /&gt;
* Load in Gaussview&lt;br /&gt;
* Use Pengs scripts&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Gaussview&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
* The file loads and all residues are recognised (apart from the chromophore). The file contains the same number of atoms as the saved Leap file.&lt;br /&gt;
* Charges are assigned from amber94 - not the chromophore still&lt;br /&gt;
* Amber atom types are assigned correctly to ALL atoms&lt;br /&gt;
* All that remains is to add charges from the RESP fit.&lt;br /&gt;
* PDB information is retained in input&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;TAO&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
* Atoms are added so atom count is different to the file leaving Leap&lt;br /&gt;
* Fails to treat terminal residues correctly&lt;br /&gt;
* Lose residue information&lt;br /&gt;
* Charges are missing from chromophore and atoms in terminal residues&lt;br /&gt;
&lt;br /&gt;
==22nd Feb 2010==&lt;br /&gt;
&lt;br /&gt;
Links to HOW TO for Ambertools + Gaussview/ Ambertools + TAO tools + Gaussview&lt;br /&gt;
&lt;br /&gt;
* Ambertools + Gaussview  - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester_Ambertools_Gaussview&lt;br /&gt;
* Ambertools + TAO tools + Gaussview - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester_Ambertools_TAO&lt;br /&gt;
&lt;br /&gt;
* Running Amber only Calculations - https://www.ch.imperial.ac.uk/wiki/index.php/AndyForesterAmber&lt;br /&gt;
&lt;br /&gt;
Running ONIOM calculations - https://www.ch.ic.ac.uk/wiki/index.php/AndyForesterONIOM&lt;br /&gt;
&lt;br /&gt;
==26th Feb 2010==&lt;br /&gt;
&lt;br /&gt;
===Picking out model Modes===&lt;br /&gt;
&lt;br /&gt;
Integers and integer ranges without a keyword are interpreted as mode numbers, although the [not]mode keywords may be used. The keywords atoms and notatoms can be used to define an atom list whose modes should be included/excluded (respectively). Atoms can also be specified by ONIOM layer via the [not]layer keywords, which accept these values: real for the real system, model for the model system in a 2-layer ONIOM, middle for the middle layer in a 3-layer ONIOM, and small for the model layer of a 3-layer ONIOM. Atoms may be similarly included/excluded by residue with residue and notresidue, which accept lists of residue names or numbers. Both keyword sets function as shorthand forms for atom lists.&lt;br /&gt;
&lt;br /&gt;
==TAO tools==&lt;br /&gt;
&lt;br /&gt;
The expected differences to the one without using his tools are:&lt;br /&gt;
&lt;br /&gt;
* Different partial charges since his scripts use all_amino03 to assign charges,&lt;br /&gt;
  Gaussview uses amber94 charges (Net charge is the same for both files - good thing)&lt;br /&gt;
&lt;br /&gt;
* Potential for some changes in atom-types due to different versions of amber being used.&lt;br /&gt;
&lt;br /&gt;
What actually happened:&lt;br /&gt;
&lt;br /&gt;
* Difference in individual partial charges but same overal net charge&lt;br /&gt;
&lt;br /&gt;
* Different atom type assignment led to some more undefined parameters.&lt;br /&gt;
&lt;br /&gt;
* Defining these and running an energy led to significantly different energy than the non tools&lt;br /&gt;
  method:&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;WITHOUT tools&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
 AMBER calculation of energy.&lt;br /&gt;
 Energy per function type:&lt;br /&gt;
 Direct non-bonded                                 122152.193595037&lt;br /&gt;
 Non-bonded pair                                  -122162.319396395&lt;br /&gt;
 Harmonic stretch I                                     0.354921205&lt;br /&gt;
 Harmonic bend I                                        0.989669436&lt;br /&gt;
 Amber torsion                                          1.587390468&lt;br /&gt;
 Improper torsion                                       0.013641893&lt;br /&gt;
 Energy per function class:&lt;br /&gt;
        Coulomb              -9.200657698&lt;br /&gt;
    Vanderwaals              -0.925143661&lt;br /&gt;
     Stretching               0.354921205&lt;br /&gt;
        Bending               0.989669436&lt;br /&gt;
        Torsion               1.587390468&lt;br /&gt;
   Out-of-plane               0.013641893&lt;br /&gt;
 Energy=    -7.18017836     NIter=   0.&lt;br /&gt;
 Dipole moment=     -25.836974    -115.466231     -34.635294&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;WITH TAO tools&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
 AMBER calculation of energy.&lt;br /&gt;
 Energy per function type:&lt;br /&gt;
 Direct non-bonded                                 137343.707022741&lt;br /&gt;
 Non-bonded pair                                  -137347.284026475&lt;br /&gt;
 Harmonic stretch I                                     0.354921205&lt;br /&gt;
 Harmonic bend I                                        1.082982995&lt;br /&gt;
 Amber torsion                                          1.589487307&lt;br /&gt;
 Improper torsion                                       0.013645198&lt;br /&gt;
 Energy per function class:&lt;br /&gt;
        Coulomb              -6.890960120&lt;br /&gt;
    Vanderwaals               3.313956385&lt;br /&gt;
     Stretching               0.354921205&lt;br /&gt;
        Bending               1.082982995&lt;br /&gt;
        Torsion               1.589487307&lt;br /&gt;
   Out-of-plane               0.013645198&lt;br /&gt;
 Energy=   -0.535967030     NIter=   0.&lt;br /&gt;
 Dipole moment=     -29.988370    -117.918540     -32.409355&lt;br /&gt;
&lt;br /&gt;
Obvious where the differences are.&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterONIOM&amp;diff=103147</id>
		<title>AndyForesterONIOM</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterONIOM&amp;diff=103147"/>
		<updated>2010-03-10T13:58:12Z</updated>

		<summary type="html">&lt;p&gt;Af706: New page: =ONIOM calculations on 1W7S=  Back to Main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=ONIOM calculations on 1W7S=&lt;br /&gt;
&lt;br /&gt;
Back to Main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForester&amp;diff=98905</id>
		<title>AndyForester</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForester&amp;diff=98905"/>
		<updated>2010-02-26T15:11:29Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* 22nd Feb 2010 */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=HBDI Calculations=&lt;br /&gt;
==Wed 21st Oct 2009==&lt;br /&gt;
&lt;br /&gt;
- Started project work&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
- Gaussview and Gaussian working&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
- Working out how to use the command inputs&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
- Trying to sort VPN out&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
- Working to get the project proposal written&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Tue 27th Oct 2009==&lt;br /&gt;
&lt;br /&gt;
Drew up the HBDI molecule in Gaussview, the molecule will be optimised at the HF 3-21G level on the mac (locally). &amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
DFT B3LYP 6-31G (Becke threeparameter hybrid exchange functional5 and the Lee-Yang-Parr correlation functional) method optimisation was run on the resulting file, the MP2 6-31G method optimisation. Each of these had a corresponding freq calculation to check that minima were reached. All calculations were successful, with no negative vibrations.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolAppletButton&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;HBDI_MP2_6-31G_FREQ.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;HBDI optimised at the MP2 6-31G level&amp;lt;/text&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;600&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;/jmolAppletButton&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Wed 28th Oct 2009==&lt;br /&gt;
&lt;br /&gt;
Ran DFT B3YLP 6-311G calculations on HBDI (Opt and Freq) since these were the calculations used in a published study of HBDI.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
Optimisation job finshed successfully, the frequency calculation did not finish succesfully. Error was:&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
 ****Warning!!: The largest alpha MO coefficient is  0.17588883D+02&lt;br /&gt;
 Symmetrizing basis deriv contribution to polar:&lt;br /&gt;
 IMax=3 JMax=2 DiffMx= 0.00D+00&lt;br /&gt;
 G2DrvN: will do    29 centers at a time, making    1 passes doing MaxLOS=1.&lt;br /&gt;
 Calling FoFCou, ICntrl=  3107 FMM=F I1Cent=   0 AccDes= 0.00D+00.&lt;br /&gt;
 FoFDir/FoFCou used for L=0 through L=1.&lt;br /&gt;
 End of G2Drv Frequency-dependent properties file   721 does not exist.&lt;br /&gt;
 End of G2Drv Frequency-dependent properties file   722 does not exist.&lt;br /&gt;
          IDoAtm=1111111111111111111111111111&lt;br /&gt;
          Differentiating once with respect to electric field.&lt;br /&gt;
                with respect to dipole field.&lt;br /&gt;
          Differentiating once with respect to nuclear coordinates.&lt;br /&gt;
          Keep R1 ints in memory in canonical form, NReq=447529180.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Thurs 29th Oct 2009==&lt;br /&gt;
&lt;br /&gt;
===Learning to use ONIOM===&lt;br /&gt;
&lt;br /&gt;
The example on chem wiki was attempted. Drawn up in Gaussview and optimised initially using molecular mechanics AMBER force field.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
The ONIOM calculation was set up in Gaussview using DFT B3LYP 6-311G(d):AMBER methods with the benzene ring as the high level and chain at low level. Electronic embedding was NOT used in this initial calculation. &amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolAppletButton&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;ONIOM_B3LYP_6-311G_AMBER.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Test Molecule optimised using B3LYP 6-311G(d):AMBER&amp;lt;/text&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;600&amp;lt;/size&amp;gt;&lt;br /&gt;
  &amp;lt;/jmolAppletButton&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
&lt;br /&gt;
This conformation is NOT the lowest energy conformation, the initial AMBER MM calculation did not find the global minimum probably due to the drawn up starting geometry.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
The molecule was drawn up again and DFT B3LYP 6-311G(d):AMBER and pure B3LYP 6-311G(d) calculations run. Absolute energy differences between the 2 conformations was very small at the ONIOM(DFT B3LYP 6-311G(d):AMBER) level. -232.27532671 vs -232.27315709 au respectively, a 0.00271au (4.359 744 17×10-18 J = 1au) difference.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolAppletButton&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;ONIOM_B3LYP_6-311G_AMBER2.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Test Molecule optimised using B3LYP 6-311G(d):AMBER&amp;lt;/text&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;600&amp;lt;/size&amp;gt;&lt;br /&gt;
  &amp;lt;/jmolAppletButton&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
&lt;br /&gt;
It has already become obvious that using ONIOM decreases calculation time considerably. The table below shows a &amp;quot;CPU time&amp;quot; comparison on the above molecule using the different approaches.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Calculation&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Time (1st attempt)&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Time (2nd attempt)&#039;&#039;&#039;&lt;br /&gt;
|-&lt;br /&gt;
|   ONIOM(DFT B3LYP 6-311G(d):AMBER)||4 minutes 22.0 seconds||8 minutes  5.5 seconds&lt;br /&gt;
|-&lt;br /&gt;
|   DFT 3LYP 6-311G(d)||2 hours 12 minutes 21.9 seconds||2 hours 30 minutes  0.8 seconds&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Mon 2nd Nov 2009==&lt;br /&gt;
&lt;br /&gt;
Having obtained optimized structures of HBDI at various theory levels and basis set variations, these structures will now be compared to previous literature. Extensive studies have been performed using HF and B3LYP methods previously. It has been noted in Jaspers recent review that HF does not model frequencies very accurately - my computations agree HF 6-31G frequencies are very far out. B3LYP frequencies are much closer to the experimental values.&lt;br /&gt;
&lt;br /&gt;
==Tues 3rd Nov 2009==&lt;br /&gt;
&lt;br /&gt;
Tabulated frerquency comparison&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;A.Esposito HF 6-31G(d)&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;A.Esposito B3LYP 6-31G(d)&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;X.He B3LYP 6-31G(d)&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;HF 6-31G(d)&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;B3LYP 6-31G(d)&#039;&#039;&#039;&lt;br /&gt;
|-&lt;br /&gt;
| C=O||1987||1802||1802||1986||1801&lt;br /&gt;
|-&lt;br /&gt;
| C=C||1880||1705||1704||1880||1704&lt;br /&gt;
|-&lt;br /&gt;
| Phenol||1809||1665||1663||1812||1666&lt;br /&gt;
|-&lt;br /&gt;
| Phenol||1781||1635||1631||1777||1635&lt;br /&gt;
|-&lt;br /&gt;
| C=N||1844||1620||1618||1844||1618&lt;br /&gt;
|-&lt;br /&gt;
| Phenol||1696||1562||1558||1694||1562&lt;br /&gt;
|-&lt;br /&gt;
| C-N||1607||1483||N/A||1606||1482&lt;br /&gt;
|-&lt;br /&gt;
| Phenol||1602||1487||N/A||1603||1488&lt;br /&gt;
|-&lt;br /&gt;
| Total Energy(au)||||||||-720.043||-724.465&lt;br /&gt;
|-&lt;br /&gt;
| &lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Thurs 5th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
Having got Jaspers input files, several things were noticed:&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
* Solvent models were used - should be in vacum.... I have re run these in G09 with No solvent on both DFT 6-311G(d,p) and HF 6-31+G(d) methods/basis sets for comparison&lt;br /&gt;
* The models in his Biophys paper using other residues in contact COO- for example are not present. He has not reported the pure HBDI calculation results.&lt;br /&gt;
* More than just HBDI is likely to be required, meaning larger than expected high level region in the GFP calculation. (3 layer ONIOM DFT:HF:AMBER)?&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
Other things to do:&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
* Work on a guide to loading .pdb files into Gaussview 5 - how much information is maintained?&lt;br /&gt;
* Are residues defined still in Gaussview input files&lt;br /&gt;
* AMBER MM requires definitions of charges etc... how easy is it to import from .pdb and get a file that can be run in opt etc...&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
* Find a model system which is larger than HBDI but incorporates this molecule &amp;lt;=500 atoms (Limit of size for pure DFT calculation)&lt;br /&gt;
* Possibilities include a system alread developed by someone else online&lt;br /&gt;
* A &amp;quot;custom&amp;quot; system made by me to model this (time intensive)&lt;br /&gt;
* &amp;quot;strip&amp;quot; back the GFP pdb file to 500 atoms surrounding the HBDI chromophore.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Fri 6th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
Yesterday started by trying to follow the tutorial on loading the GFP molecule from .pdb, this resulted in a problem with Gaussviews addition of hydrogens. It over protonated the nitrogens and carbon in the chromophore leading to over valent parts. I then tried several other molecules (PYP, a scorpion toxin and a strand of RNA) which had no problems with hydrogen addition.&lt;br /&gt;
&lt;br /&gt;
I tried to use ChemBio 3D to load the .pdb and correctly assign hydrogens/connectivity. This resulted in the same protonation problems!&lt;br /&gt;
&lt;br /&gt;
==Mon 9th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
TINKER - http://dasher.wustl.edu/tinker/ was installed with the Force Field Explorer (FFE) interface on the mac. This was used to load the same .pdb file and resulted in the same protonation around the chromophore. So far the SAME protonation pattern (incorrect) around the chromophore has been found using:&lt;br /&gt;
&lt;br /&gt;
*Gaussview&lt;br /&gt;
*ChemBio 3D&lt;br /&gt;
*TINKER&lt;br /&gt;
&lt;br /&gt;
Looking into how the .pdb file format works, the molecule is built from a series of residues, it becomes clear as you look at the sequence that is building the GFP protein chromophore (residues 65,66,67 in 1GFL) are not then reproducing the internal cyclisation properly leaving protons attached and thus atoms hypervalent. This accounts for the consistent error loading from the .pdb. As a result must all molecules expressed as a .pdb file must be pure protein structures...?&lt;br /&gt;
&lt;br /&gt;
===Comparison of the 2 available wild type GFP structures===&lt;br /&gt;
&lt;br /&gt;
The 2 structures being considered here are:&lt;br /&gt;
&lt;br /&gt;
*1GFL - Yang et al. -http://www.pdb.org/pdb/explore/explore.do?structureId=1GFL&lt;br /&gt;
*1W7S - Jaspers stucture - http://www.pdb.org/pdb/explore/explore.do?structureId=1W7S&lt;br /&gt;
&lt;br /&gt;
These 2 structures differ in that 1GFL contains 2 GFP molecules, whereas 1W7S contains 4. These were both stripped down to 1 unit of GFP, using the &amp;quot;pdb secondary structure&amp;quot; function in Gaussview 5. 1GFL had chain B removed and 1W7S had chains B,C and D removed. Next the residues were investigated that created the chromophore. The results are not consistent because GFL uses 3 residues (65,66,67) to form the chromophore. 1W7S has residue 66 (labelled CSY) as the entire chromophore! Both get protonated the same (incorrectly) which is odd considering how they molecules are built.&lt;br /&gt;
&lt;br /&gt;
The next test was to &amp;quot;modify bond&amp;quot; in Gaussview 5 and create a &amp;quot;cleaned&amp;quot; chromophore structure (no bonding information from the residue) by manually changing the bonds. The chromophore in each .pdb file was altered to look like HBDI in terms of double single bonds and the hydrogens taken out (makes no difference as these are created each time the .pdb is loaded). When the new .pdb files (1GFL_cleaned.pdb and 1W7S_cleaned.pdb) were loaded they were correctly protonated! Success.&lt;br /&gt;
&lt;br /&gt;
===1W7S improvement===&lt;br /&gt;
&lt;br /&gt;
Chain A (not cleaned) of the 1W7S .pdb was submitted to run a AMBER calculation, as expected it failed with the lines:&lt;br /&gt;
&lt;br /&gt;
 Missing atomic parameters for atom   474 IAtTyp=    20000000&lt;br /&gt;
 Missing atomic parameters.&lt;br /&gt;
&lt;br /&gt;
This error says that the atom type of atom 474 is not defined. One way to counter this is to locate the atom and then change its definition manually (as shown in the example https://www.ch.ic.ac.uk/wiki/index.php/PYP_:_Photoactive_yellow_protein - time intensive!)&lt;br /&gt;
&lt;br /&gt;
Instead the molecule was cleaned (manually changing the bonding in the chromophore), after the bond updates the structure was re submitted, again failing but with a different message:&lt;br /&gt;
&lt;br /&gt;
 Include all MM classes&lt;br /&gt;
  Bondstretch undefined between atoms    473    478 CC-N* [H,H]  &lt;br /&gt;
  Bondstretch undefined between atoms    475    476 CC-CC [H,H]  &lt;br /&gt;
  Bondstretch undefined between atoms    475    486 CC-CM [H,H]  &lt;br /&gt;
  Bondstretch undefined between atoms    476    477 CC-O [H,H]  &lt;br /&gt;
  Bondstretch undefined between atoms    476    478 CC-N* [H,H]  &lt;br /&gt;
  Bondstretch undefined between atoms    486   2559 CM-HC [H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    473    474    475 CC-NB-CC [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    473    478    479 CC-N*-CT [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    473    478    476 CC-N*-CC [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    473    482    483 CC-CT-N [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    473    482   3857 CC-CT-H1 [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    474    475    476 NB-CC-CC [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    474    475    486 NB-CC-CM [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    474    473    478 NB-CC-N* [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    475    476    477 CC-CC-O [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    475    476    478 CC-CC-N* [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    475    486    487 CC-CM-CA [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    475    486   2559 CC-CM-HC [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    476    478    479 CC-N*-CT [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    476    475    486 CC-CC-CM [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    477    476    478 O-CC-N* [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    478    479    480 N*-CT-C [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    478    473    482 N*-CC-CT [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    486    487    488 CM-CA-CA [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    486    487    489 CM-CA-CM [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    487    486   2559 CA-CM-HC [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    488    487    489 CA-CA-CM [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    490    492    491 CA-C-CM [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    491    492    493 CM-C-OH [H,H,H]  &lt;br /&gt;
  MM function not complete&lt;br /&gt;
&lt;br /&gt;
This is a better error message since all the atoms are now defined, only parameters are missing. For parameters that involve only atoms in the chromophore at the high level in the ONIOM calculation, the values can be set to 0 in the input file. This is because the ONIOM extrapolation will cancel MM from this high level region, To do this the keyword &amp;quot;Amber=Hardfirst&amp;quot; and wildcards for the missing parameters which are zero valued are placed in the input file.&lt;br /&gt;
&lt;br /&gt;
[[Image:Atoms involving undefined parameters.png]]&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
The above screeshot shows highlighted the atoms listed in the error... most of them if not all will be involved in the high level calculation and thus may not require parameters defining (other than to be set to 0). The reason the parameters are undefined is that the AMBER force field is very specific to protein bonds and thus the organic chromophore has bonds that are not found in this parameter set.&lt;br /&gt;
&lt;br /&gt;
The atoms in the chromophore were frozen in the atom group editor and the input file had a parameter set added to the end setting chromophore parameters to 0:&lt;br /&gt;
&lt;br /&gt;
 HrmStr1 CC N* 0.0 0.0&lt;br /&gt;
 HrmStr1 CC CC 0.0 0.0&lt;br /&gt;
 HrmStr1 CC CM 0.0 0.0&lt;br /&gt;
 HrmStr1 CC O 0.0 0.0&lt;br /&gt;
 HrmStr1 CM HC 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC NB CC 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC N* CC 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CT N 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CT H1 0.0 0.0&lt;br /&gt;
 HrmBnd1 NB CC CC 0.0 0.0&lt;br /&gt;
 HrmBnd1 NB CC CM 0.0 0.0&lt;br /&gt;
 HrmBnd1 NB CC N* 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CC O  0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CC N* 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CM CA 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CM HC 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CC CM 0.0 0.0&lt;br /&gt;
 HrmBnd1 O CC N*  0.0 0.0&lt;br /&gt;
 HrmBnd1 N* CT C  0.0 0.0&lt;br /&gt;
 HrmBnd1 N* CC CT 0.0 0.0&lt;br /&gt;
 HrmBnd1 CM CA CA 0.0 0.0&lt;br /&gt;
 HrmBnd1 CA CM HC 0.0 0.0&lt;br /&gt;
 HrmBnd1 CA C CM 0.0 0.0&lt;br /&gt;
 HrmBnd1 CM C OH 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC N* CT 0.0 0.0&lt;br /&gt;
 HrmBnd1 CM CA CM 0.0 0.0&lt;br /&gt;
&lt;br /&gt;
This file then ran successfully for an AMBER single point energy calculation. It was then submitted for AMBER optimisation using &amp;quot;amber=hardfirst geom=connectivity&amp;quot; and returned a successful minimum with Forces and Displacements converged:&lt;br /&gt;
&lt;br /&gt;
         Item               Value     Threshold  Converged?&lt;br /&gt;
  Maximum Force            0.000089     0.000450     YES&lt;br /&gt;
  RMS     Force            0.000004     0.000300     YES&lt;br /&gt;
  Maximum Displacement     0.000089     0.001800     YES&lt;br /&gt;
  RMS     Displacement     0.000004     0.001200     YES&lt;br /&gt;
  Predicted change in Energy=-1.156135D-07&lt;br /&gt;
  Optimization completed.&lt;br /&gt;
     -- Stationary point found.&lt;br /&gt;
&lt;br /&gt;
===Cleaned .pdb files and TINKER===&lt;br /&gt;
&lt;br /&gt;
*The files that had been cleaned were opened in TINKER, it is necessary to convert the .pdb to a .xyz file to run any calculations within TINKER. This is currently failing due to an &amp;quot;atom missing on SER-228&amp;quot; - checking this there is no SER residue at 228 (in fact a GLU) so a bit puzzled. This is the 1W7S cleaned version.&lt;br /&gt;
&lt;br /&gt;
*PYP was loaded in its .pdb format to TINKER and was immediately successful in being converted to .xyz then a MINIMISE algorithm ran successfully with the energy at its X-ray crystal structure as 261KJ/mol reduced to -3670.5355KJ/mol in 1645 iterations - this took a while (45mins locally)!&lt;br /&gt;
&lt;br /&gt;
*Interestingly the .xyz output from the minimization can be converted back to .pdb (loses hydrogens minimization) and this PYP file was loaded into Gaussview 5 - the file then ran successfully for an AMBER single point energy calculation! cf. the first attempt to get PYP to run was unsuccessful, and the tutorial underwent much more hassle to run this calculation.&lt;br /&gt;
&lt;br /&gt;
===What .pdb information is retained?===&lt;br /&gt;
&lt;br /&gt;
The only molecule that has been run with AMBER opt (GFP) has lost its secondary structure information but RETAINED its residue information.&lt;br /&gt;
&lt;br /&gt;
==Fri 13th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
===Conversion from.xyz to .com===&lt;br /&gt;
A protein with 2 residues (Glu-Asp) was built using TINKER, saved as:&lt;br /&gt;
&lt;br /&gt;
*.xyz&lt;br /&gt;
*.pdb&lt;br /&gt;
*.com&lt;br /&gt;
&lt;br /&gt;
The initial format was .xyz (created in TINEKR as a native format), converted to .pdb using TINKER and then loaded into Gaussview and saved as a .com.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
.xyz&lt;br /&gt;
&lt;br /&gt;
    36  Protein built with TINKER/Force Field Explorer&lt;br /&gt;
     1  CT     0.000000    0.000000    0.000000   340     2     4     5     6&lt;br /&gt;
     2  C      0.000000    0.000000    1.510000   342     1     3     7&lt;br /&gt;
     3  O      1.028938    0.000000    2.165506   343     2&lt;br /&gt;
     4  HC     1.056270    0.000000   -0.341166   341     1&lt;br /&gt;
     5  HC    -0.497647    0.931694   -0.341166   341     1&lt;br /&gt;
     6  HC    -0.497647   -0.931694   -0.341166   341     1&lt;br /&gt;
     7  N     -1.236201    0.000000    2.027114   232     2     8    11&lt;br /&gt;
     8  CT    -1.446961    0.000000    3.471822   233     7     9    12    13&lt;br /&gt;
     9  C     -2.509557   -0.992751    3.878558   234     8    10    22&lt;br /&gt;
    10  O     -3.569672   -1.098634    3.284129   236     9&lt;br /&gt;
    11  H     -2.025518    0.000000    1.381066   235     7&lt;br /&gt;
    12  H1    -0.503221   -0.281395    3.983946   237     8&lt;br /&gt;
    13  CT    -1.910491    1.397334    3.923704   238     8    14    18    19&lt;br /&gt;
    14  CT    -2.132799    1.397334    5.447574   240    13    15    20    21&lt;br /&gt;
    15  C     -2.585014    2.781238    5.848089   242    14    16    17&lt;br /&gt;
    16  O2    -2.816736    2.973609    7.061266   243    15&lt;br /&gt;
    17  O2    -2.693194    3.629064    4.935955   243    15&lt;br /&gt;
    18  HC    -2.861794    1.653482    3.412322   239    13&lt;br /&gt;
    19  HC    -1.132567    2.145528    3.664589   239    13&lt;br /&gt;
    20  HC    -1.170306    1.191123    5.960588   241    14&lt;br /&gt;
    21  HC    -2.946052    0.685840    5.701535   241    14&lt;br /&gt;
    22  N     -2.153997   -1.723291    4.944151   210     9    23    26&lt;br /&gt;
    23  CT    -3.047986   -2.739703    5.491225   211    22    24    27    28&lt;br /&gt;
    24  C     -3.120370   -2.663723    6.997574   212    23    25    34&lt;br /&gt;
    25  O     -2.128241   -2.535414    7.695870   214    24&lt;br /&gt;
    26  H     -1.239994   -1.547921    5.361576   213    22&lt;br /&gt;
    27  H1    -4.072297   -2.585746    5.092246   215    23&lt;br /&gt;
    28  CT    -2.526745   -4.139104    5.114946   216    23    29    32    33&lt;br /&gt;
    29  C     -3.476741   -5.157722    5.698062   218    28    30    31&lt;br /&gt;
    30  O2    -3.212872   -6.362275    5.493304   219    29&lt;br /&gt;
    31  O2    -4.454666   -4.718803    6.341112   219    29&lt;br /&gt;
    32  HC    -1.530243   -4.298013    5.577366   217    28&lt;br /&gt;
    33  HC    -2.540086   -4.252948    4.010880   217    28&lt;br /&gt;
    34  N     -4.374888   -2.752228    7.460123   344    24    35    36&lt;br /&gt;
    35  H     -5.140865   -2.858197    6.794954   345    34&lt;br /&gt;
    36  H     -4.534830   -2.711583    8.466684   345    34&lt;br /&gt;
&lt;br /&gt;
.pdb&lt;br /&gt;
&lt;br /&gt;
 HEADER    Protein built with TINKER/Force Field Explorer&lt;br /&gt;
 COMPND&lt;br /&gt;
 SOURCE&lt;br /&gt;
 ATOM      1  CH3 ACE     1       0.000   0.000   0.000&lt;br /&gt;
 ATOM      2  C   ACE     1       0.000   0.000   1.510&lt;br /&gt;
 ATOM      3  O   ACE     1       1.029   0.000   2.166&lt;br /&gt;
 ATOM      4  H1  ACE     1       1.056   0.000  -0.341&lt;br /&gt;
 ATOM      5  H2  ACE     1      -0.498   0.932  -0.341&lt;br /&gt;
 ATOM      6  H3  ACE     1      -0.498  -0.932  -0.341&lt;br /&gt;
 ATOM      7  N   GLU     2      -1.236   0.000   2.027&lt;br /&gt;
 ATOM      8  CA  GLU     2      -1.447   0.000   3.472&lt;br /&gt;
 ATOM      9  C   GLU     2      -2.510  -0.993   3.879&lt;br /&gt;
 ATOM     10  O   GLU     2      -3.570  -1.099   3.284&lt;br /&gt;
 ATOM     11  CB  GLU     2      -1.910   1.397   3.924&lt;br /&gt;
 ATOM     12  CG  GLU     2      -2.133   1.397   5.448&lt;br /&gt;
 ATOM     13  CD  GLU     2      -2.585   2.781   5.848&lt;br /&gt;
 ATOM     14  OE1 GLU     2      -2.817   2.974   7.061&lt;br /&gt;
 ATOM     15  OE2 GLU     2      -2.693   3.629   4.936&lt;br /&gt;
 ATOM     16  H   GLU     2      -2.026   0.000   1.381&lt;br /&gt;
 ATOM     17  HA  GLU     2      -0.503  -0.281   3.984&lt;br /&gt;
 ATOM     18  HB2 GLU     2      -2.862   1.653   3.412&lt;br /&gt;
 ATOM     19  HB3 GLU     2      -1.133   2.146   3.665&lt;br /&gt;
 ATOM     20  HG2 GLU     2      -1.170   1.191   5.961&lt;br /&gt;
 ATOM     21  HG3 GLU     2      -2.946   0.686   5.702&lt;br /&gt;
 ATOM     22  N   ASP     3      -2.154  -1.723   4.944&lt;br /&gt;
 ATOM     23  CA  ASP     3      -3.048  -2.740   5.491&lt;br /&gt;
 ATOM     24  C   ASP     3      -3.120  -2.664   6.998&lt;br /&gt;
 ATOM     25  O   ASP     3      -2.128  -2.535   7.696&lt;br /&gt;
 ATOM     26  CB  ASP     3      -2.527  -4.139   5.115&lt;br /&gt;
 ATOM     27  CG  ASP     3      -3.477  -5.158   5.698&lt;br /&gt;
 ATOM     28  OD1 ASP     3      -3.213  -6.362   5.493&lt;br /&gt;
 ATOM     29  OD2 ASP     3      -4.455  -4.719   6.341&lt;br /&gt;
 ATOM     30  H   ASP     3      -1.240  -1.548   5.362&lt;br /&gt;
 ATOM     31  HA  ASP     3      -4.072  -2.586   5.092&lt;br /&gt;
 ATOM     32  HB2 ASP     3      -1.530  -4.298   5.577&lt;br /&gt;
 ATOM     33  HB3 ASP     3      -2.540  -4.253   4.011&lt;br /&gt;
 ATOM     34  N   NH2     4      -4.375  -2.752   7.460&lt;br /&gt;
 ATOM     35  H1  NH2     4      -5.141  -2.858   6.795&lt;br /&gt;
 ATOM     36  H2  NH2     4      -4.535  -2.712   8.467&lt;br /&gt;
 END&lt;br /&gt;
&lt;br /&gt;
.com&lt;br /&gt;
&lt;br /&gt;
 %chk=File_Test_Glu-Asp.chk&lt;br /&gt;
 # hf/3-21g geom=connectivity&lt;br /&gt;
 &lt;br /&gt;
 Protein built with TINKER/Force Field Explorer&lt;br /&gt;
 &lt;br /&gt;
 0 1&lt;br /&gt;
 C(PDBName=CH3,ResName=ACE,ResNum=1)                   0.00000000    0.00000000    0.00000000&lt;br /&gt;
 C(PDBName=C,ResName=ACE,ResNum=1)                     0.00000000    0.00000000    1.51000000&lt;br /&gt;
 O(PDBName=O,ResName=ACE,ResNum=1)                     1.02900000    0.00000000    2.16600000&lt;br /&gt;
 H(PDBName=H1,ResName=ACE,ResNum=1)                    1.05600000    0.00000000   -0.34100000&lt;br /&gt;
 H(PDBName=H2,ResName=ACE,ResNum=1)                   -0.49800000    0.93200000   -0.34100000&lt;br /&gt;
 H(PDBName=H3,ResName=ACE,ResNum=1)                   -0.49800000   -0.93200000   -0.34100000&lt;br /&gt;
 N(PDBName=N,ResName=GLU,ResNum=2)                    -1.23600000    0.00000000    2.02700000&lt;br /&gt;
 C(PDBName=CA,ResName=GLU,ResNum=2)                   -1.44700000    0.00000000    3.47200000&lt;br /&gt;
 C(PDBName=C,ResName=GLU,ResNum=2)                    -2.51000000   -0.99300000    3.87900000&lt;br /&gt;
 O(PDBName=O,ResName=GLU,ResNum=2)                    -3.57000000   -1.09900000    3.28400000&lt;br /&gt;
 C(PDBName=CB,ResName=GLU,ResNum=2)                   -1.91000000    1.39700000    3.92400000&lt;br /&gt;
 C(PDBName=CG,ResName=GLU,ResNum=2)                   -2.13300000    1.39700000    5.44800000&lt;br /&gt;
 C(PDBName=CD,ResName=GLU,ResNum=2)                   -2.58500000    2.78100000    5.84800000&lt;br /&gt;
 O(PDBName=OE1,ResName=GLU,ResNum=2)                  -2.81700000    2.97400000    7.06100000&lt;br /&gt;
 O(PDBName=OE2,ResName=GLU,ResNum=2)                  -2.69300000    3.62900000    4.93600000&lt;br /&gt;
 H(PDBName=H,ResName=GLU,ResNum=2)                    -2.02600000    0.00000000    1.38100000&lt;br /&gt;
 H(PDBName=HA,ResName=GLU,ResNum=2)                   -0.50300000   -0.28100000    3.98400000&lt;br /&gt;
 H(PDBName=HB2,ResName=GLU,ResNum=2)                  -2.86200000    1.65300000    3.41200000&lt;br /&gt;
 H(PDBName=HB3,ResName=GLU,ResNum=2)                  -1.13300000    2.14600000    3.66500000&lt;br /&gt;
 H(PDBName=HG2,ResName=GLU,ResNum=2)                  -1.17000000    1.19100000    5.96100000&lt;br /&gt;
 H(PDBName=HG3,ResName=GLU,ResNum=2)                  -2.94600000    0.68600000    5.70200000&lt;br /&gt;
 N(PDBName=N,ResName=ASP,ResNum=3)                    -2.15400000   -1.72300000    4.94400000&lt;br /&gt;
 C(PDBName=CA,ResName=ASP,ResNum=3)                   -3.04800000   -2.74000000    5.49100000&lt;br /&gt;
 C(PDBName=C,ResName=ASP,ResNum=3)                    -3.12000000   -2.66400000    6.99800000&lt;br /&gt;
 O(PDBName=O,ResName=ASP,ResNum=3)                    -2.12800000   -2.53500000    7.69600000&lt;br /&gt;
 C(PDBName=CB,ResName=ASP,ResNum=3)                   -2.52700000   -4.13900000    5.11500000&lt;br /&gt;
 C(PDBName=CG,ResName=ASP,ResNum=3)                   -3.47700000   -5.15800000    5.69800000&lt;br /&gt;
 O(PDBName=OD1,ResName=ASP,ResNum=3)                  -3.21300000   -6.36200000    5.49300000&lt;br /&gt;
 O(PDBName=OD2,ResName=ASP,ResNum=3)                  -4.45500000   -4.71900000    6.34100000&lt;br /&gt;
 H(PDBName=H,ResName=ASP,ResNum=3)                    -1.24000000   -1.54800000    5.36200000&lt;br /&gt;
 H(PDBName=HA,ResName=ASP,ResNum=3)                   -4.07200000   -2.58600000    5.09200000&lt;br /&gt;
 H(PDBName=HB2,ResName=ASP,ResNum=3)                  -1.53000000   -4.29800000    5.57700000&lt;br /&gt;
 H(PDBName=HB3,ResName=ASP,ResNum=3)                  -2.54000000   -4.25300000    4.01100000&lt;br /&gt;
 N(PDBName=N,ResName=NH2,ResNum=4)                    -4.37500000   -2.75200000    7.46000000&lt;br /&gt;
 H(PDBName=H1,ResName=NH2,ResNum=4)                   -5.14100000   -2.85800000    6.79500000&lt;br /&gt;
 H(PDBName=H2,ResName=NH2,ResNum=4)                   -4.53500000   -2.71200000    8.46700000&lt;br /&gt;
 &lt;br /&gt;
 1 2 1.0 4 1.0 5 1.0 6 1.0&lt;br /&gt;
 2 3 2.0 7 1.5&lt;br /&gt;
 3&lt;br /&gt;
 4&lt;br /&gt;
 5&lt;br /&gt;
 6&lt;br /&gt;
 7 16 1.0 8 1.0&lt;br /&gt;
 8 9 1.0 11 1.0 17 1.0&lt;br /&gt;
 9 10 2.0 22 1.5&lt;br /&gt;
 10&lt;br /&gt;
 11 18 1.0 12 1.0 19 1.0&lt;br /&gt;
 12 21 1.0 13 1.0 20 1.0&lt;br /&gt;
 13 14 2.0 15 2.0&lt;br /&gt;
 14&lt;br /&gt;
 15&lt;br /&gt;
 16&lt;br /&gt;
 17&lt;br /&gt;
 18&lt;br /&gt;
 19&lt;br /&gt;
 20&lt;br /&gt;
 21&lt;br /&gt;
 22 23 1.0 30 1.0&lt;br /&gt;
 23 26 1.0 24 1.0 31 1.0&lt;br /&gt;
 24 34 1.5 25 2.0&lt;br /&gt;
 25&lt;br /&gt;
 26 27 1.0 33 1.0 32 1.0&lt;br /&gt;
 27 28 2.0 29 2.0&lt;br /&gt;
 28&lt;br /&gt;
 29&lt;br /&gt;
 30&lt;br /&gt;
 31&lt;br /&gt;
 32&lt;br /&gt;
 33&lt;br /&gt;
 34 35 1.0 36 1.0&lt;br /&gt;
 35&lt;br /&gt;
 36&lt;br /&gt;
&lt;br /&gt;
The issue going from .xyz straight to .com is that ALL protein information would be lost ie. residue information that is written into the .pdb at .xyz to .pdb comversion.&lt;br /&gt;
&lt;br /&gt;
===PYP files as downloaded compared to run through TINKER===&lt;br /&gt;
&lt;br /&gt;
*TINKER removes much of the header information&lt;br /&gt;
*TINKER only preserves the 1st model of a .pdb file (may be multiple models)&lt;br /&gt;
*TINKER removes far right 3 columns of information (not essential to the .pdb file)&lt;br /&gt;
*A virtual residue in the original .pdb comprising the chromophore was omitted from the TINKER output (number 126)&lt;br /&gt;
&lt;br /&gt;
===What each TINKER search algorithm does===&lt;br /&gt;
&lt;br /&gt;
There are several programs within TINKER that may be able to help locate a global minimum (not just one near the published crystal structure) by way of searching the potential energy surface. All of these are MM methods for use in TINKER.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&#039;&#039;&#039;SCAN&#039;&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
A program for general conformational search of an entire potential energy surface via a basin hopping method. The program takes as input a TINKER .xyz coordinates file which is then minimized to find the first local minimum for a search list. A series of activations along various normal modes from this initial minimum are used as seed points for additional minimizations. Whenever a previously unknown local minimum is located it is added to the search list. When all minima on the search list have been subjected to the normal mode activation without locating additional new minima, the program terminates. The individual local minima are written to cycle files as they are discovered. While the SCAN program can be used on standard undeformed potential energy surfaces, we have found it to be most useful for quickly ‘‘scanning’’ a smoothed energy surface to enumerate the major basins of attraction spaning the entire surface.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;SNIFFER&#039;&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
A program that implements the Sniffer global optimization algorithm of Butler and Slaminka, a discrete version of Griewank’s global search trajectory method. The program takes an input TINKER .xyz coordinates file and shakes it vigorously via a modified dynamics trajectory before, hopefully, settling into a low lying minimum. Some trial and error is often required as the current implementation is sensitive to various parameters and tolerances that govern the computation. At present, these parameters are not user accessible, and must be altered in the source code. However, this method can do a good job of quickly optimizing conformation within a limited range of convergence.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;PSS&#039;&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
Implements our version of a potential smoothing and search algorithm for the global optimization of molecular conformation. An initial structure in .xyz format is first minimized in Cartesian coordinates on a series of increasingly smoothed potential energy surfaces. Then the smoothing procedure is reversed with minimization on each successive surface starting from the coordinates of the minimum on the previous surface. A local search procedure is used during the backtracking to explore for alternative minima better than the one found during the current minimization. The final result is usually a very low energy conformation or, in favorable cases, the global energy minimum conformation. The minimum energy coordinate sets found on each surface during both the forward smoothing and backtracking procedures are placed in sequentially numbered cycle files.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
TINKER also contains programs that are able to perform MM optimisations:&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&#039;&#039;&#039;MINIMIZE&#039;&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;The MINIMIZE program performs a limited memory L-BFGS minimization of an input structure over Cartesian coordinates using a modified version of the algorithm of Jorge Nocedal. The method requires only the potential energy and gradient at each step along the minimization pathway. It requires storage space proportional to the number of atoms in the structure. The MINIMIZE procedure is recommended for preliminary minimization of trial structures to an rms gradient of 1.0 to 0.1 kcal/mole/Å. It has a relatively fast cycle time and is tolerant of poor initial structures, but converges in a slow, linear fashion near the minimum. The user supplies the name of the TINKER .xyz coordinates file and a target rms gradient value at which the minimization will terminate. Output consists of minimization statistics written to the screen or redirected to an output file, and the new coordinates written to updated .xyz files or to cycle files.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;OPTIMIZE&#039;&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;The OPTIMIZE program performs a optimally conditioned variable metric minimization of an input structure over Cartesian coordinates using an algorithm due to William Davidon. The method does not perform line searches, but requires computation of energies and gradients as well as storage for an estimate of the inverse Hessian matrix. The program operates on Cartesian coordinates from a TINKER .xyz file. OPTIMIZE will typically converge somewhat faster and more completely than MINIMIZE. However, the need to store and manipulate a full inverse Hessian estimate limits its use to structures containing less than a few hundred atoms on workstation class machines. As with the other minimizers, OPTIMIZE needs input coordinates and an rms gradient cutoff criterion. The output coordinates are saved in updated .xyz files or as cycle files.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;NEWTON&#039;&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt; A truncated Newton minimization method which requires potential energy, gradient and Hessian information. This procedure has significant advantages over standard Newton methods, and is able to minimize very large structures completely. Several options are provided with respect to minimization method and preconditioning of the Newton equations. The default options are recommended unless the user is familiar with the math involved. This program operates in Cartesian coordinate space and is fairly tolerant of poor input structures. Typical algorithm iteration times are longer than with nonlinear conjugate gradient or variable metric methods, but many fewer iterations are required for complete minimization. NEWTON is usually the best choice for minimizations to the 0.01 to 0.000001 kcal/mole/Å level of rms gradient convergence. Tests for directions of negative curvature can be removed, allowing NEWTON to be used for optimization to conformational transition state structures (this only works if the starting point is very close to the transition state). Input consists of a TINKER .xyz coordinates file; output is an updated set of minimized coordinates and minimization statistics.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Tues 17th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
Comparing 2 .pdb files:&lt;br /&gt;
&lt;br /&gt;
*As downloaded from PDB&lt;br /&gt;
*Converted to .xyz in TINKER then back to .pdb&lt;br /&gt;
&lt;br /&gt;
These files were then cleaned up to compare in a terminal using the &amp;quot;diff&amp;quot; command:&lt;br /&gt;
&lt;br /&gt;
*Header information and end information removed&lt;br /&gt;
*3 columns to the right in PDB version were removed to become consistent with the TINEKR file&lt;br /&gt;
*The first column &amp;quot;ATOM&amp;quot; was removed effectively making the line numbers equal to the atom numbers in both files&lt;br /&gt;
&lt;br /&gt;
==Wed 18th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
1st diff was on line 52, a HYDROGEN ATOM:&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&#039;&#039;52  HE2 HIS A   3    13.349 -10.803   9.527&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
added to the TINKER output, not in PDB file.&lt;br /&gt;
&lt;br /&gt;
2nd diff was on line 657, a HYDROGEN ATOM:&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&#039;&#039;657  HE2 GLU A  46	 -0.867  -5.216  -0.822&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
added to the PDB file, not in TINKER output&lt;br /&gt;
&lt;br /&gt;
3rd diff was on line 1007, a HYDROGEN ATOM (number 1008):&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&#039;&#039;5HG  CYS A  69	 -8.909  -5.194   5.526&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
added to the TINKER output, not in PDB file.&lt;br /&gt;
&lt;br /&gt;
4th diff was  on line 1621, a HYDROGEN ATOM (number 1623):&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&#039;&#039;HE2 HIS A 108	  4.663   7.459   6.879&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
added to the TINKER output, not in PDB file.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Following these reconciliations, both files were exactly the same with 1911 atoms present in each. 3 atoms were removed from the TINKER file and 1 was removed from the PDB file.&lt;br /&gt;
&lt;br /&gt;
*NOTE - loading the original .pdb from PDB to Gaussview then converting to a .gjf results in an error &amp;quot;1 or more PDB atom names has a &#039; or * character which can cause problems for Gaussian&amp;quot; - this is because the PDB file has extra atoms on the end of the file to define a highlighted chromophore... After these were removed in a txt editor the .gjf save went smoothly. Infact after this removal the file will run an AMBEr single point energy calculation fine - cf running when the extra info is in the bottom.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
All differences in files were due to protonation states of atoms... not the AA backbone.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===TINKER vs PDB===&lt;br /&gt;
[[Image:TINKER PROTON.png]]&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
[[Image:PDB_PROTON.png]]&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
The highlighted fragments represent where the respective file version is protonated and the other is not.&lt;br /&gt;
&lt;br /&gt;
==Fri 20th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
In an attempt to load the 1W7S structure to TINKER and successfully convert it it a .xyz format the following procedure was undertaken:&lt;br /&gt;
&lt;br /&gt;
*&lt;br /&gt;
&lt;br /&gt;
==Mon 23rd Nov 2009==&lt;br /&gt;
&lt;br /&gt;
===Protonation of GFP at various pH conditions===&lt;br /&gt;
&lt;br /&gt;
With 230 amino acid residues, each possessing a unique pKa in GFP it is obvious that there is a huge spectrum of protonation states available at a single solvation pH. Since protons cannot be detected using X-ray diffraction, protonation states must be uncovered via a difference method. Several questions arise when thinking about protonation of the protein:&lt;br /&gt;
&lt;br /&gt;
*Which residues have the ability to be protonated/deprotonated&lt;br /&gt;
*Is the PDB structure protonated for a specific pH (Bio pH)&lt;br /&gt;
*How does the protonation of the protein change through a variety of solvation pHs&lt;br /&gt;
*Does the protonation state affect the chromophore photophysics (electronic embedding)&lt;br /&gt;
&lt;br /&gt;
Answering these questions:&lt;br /&gt;
&lt;br /&gt;
*Polar amino acids are those that can be protonated/deprotonated, these are identified by Gaussview and shown in the atom list of the molecule with the option of setting them to either protonated/deprotonated.&lt;br /&gt;
*Dont know if the current structure is protonated to any specific pH... this is unclear until protonate at certain pHs&lt;br /&gt;
*The protonation will be tested across a range of pHs to see how protonation changes&lt;br /&gt;
&lt;br /&gt;
==Tues 24th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
The pKa value of a residue will indicate protonation state as a function of pH. If you know the pH of your solution then if the pKa of a given functional group is lower than this pH then the functional group will be unprotonated. If the pKa is higher than the pH then the functional group will be protonated. &#039;&#039;&#039;Beware that pKa values can be distorted by local conformational effects.&#039;&#039;&#039; The structure is protonated as though it were in pH 7 as it is loaded into Gaussview. Since hydrogens are defined in the PDB file it could be that the author added protons under these conditions.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Histadine has 2 protonation points on the imidizole R chain:&lt;br /&gt;
&lt;br /&gt;
{|&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Yes-Yes&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Yes-No&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;No-Yes&#039;&#039;&#039;&lt;br /&gt;
|-&lt;br /&gt;
|&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolApplet&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;Protein_yes_yes.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;400&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Yes Yes&amp;lt;/text&amp;gt;&lt;br /&gt;
  &amp;lt;/jmolApplet&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;||&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolApplet&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;Protein_yes_no.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;400&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Yes Yes&amp;lt;/text&amp;gt;&lt;br /&gt;
  &amp;lt;/jmolApplet&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;||&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolApplet&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;Protein_no_yes.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;400&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Yes Yes&amp;lt;/text&amp;gt;&lt;br /&gt;
  &amp;lt;/jmolApplet&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
The imidazole sidechain of histidine has a pKa of approximately 6, and overall, the amino acid has a pKa of 7.6. This means that at physiologically relevant pH values, relatively small shifts in pH will change its average charge. Below a pH of 6, the imidazole ring is mostly protonated as described by the Henderson–Hasselbalch equation. When protonated, the imidazole ring bears two NH bonds and has a positive charge. The positive charge is equally distributed between both nitrogens and can be represented with two equally important resonance structures.&lt;br /&gt;
&lt;br /&gt;
===Protonation===&lt;br /&gt;
&lt;br /&gt;
At pH 7 the PDB file protonations are correct for all residues. Moving to pH 5 the histadine residues become protonated (pKa 6)- this is the only change. At more alkaline pHs it would take pH 9.1 to make any effect (deprotonating CYS and then TYR at 9.7)&lt;br /&gt;
&lt;br /&gt;
===Mini-Biomolecule===&lt;br /&gt;
&lt;br /&gt;
To test ONIOM calculations vs pure DFT methods and other calculation types on a bio system that is smaller than GFP (computationally less expensive, can streamline the real calcs)&lt;br /&gt;
&lt;br /&gt;
Trying a molecule analogous to the example given in the ONIOM tutorial...&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolApplet&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;Biomolecule1.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;400&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Yes Yes&amp;lt;/text&amp;gt;&lt;br /&gt;
  &amp;lt;/jmolApplet&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Thurs 26th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
===Checking the GFP input will run===&lt;br /&gt;
&lt;br /&gt;
Procedure to get 1WS7 to run an AMBER energy calculation:&lt;br /&gt;
&lt;br /&gt;
*Load the PDB file without water&lt;br /&gt;
*Try opt amber but will fail due to missing parameters (all in the chromophore)&lt;br /&gt;
*Set these parameters to 0 since they cancel in ONIOM&lt;br /&gt;
*Freeze the atotms involving missing paramters&lt;br /&gt;
*Energy AMBER will now run for the file - NO CHARGES - 9.07881632au&lt;br /&gt;
*Open the Atom Editor, Edit, MM charges, Assign all atoms&lt;br /&gt;
*Manually edit any missing charges&lt;br /&gt;
*Energy AMBER will now run including charges (basic AMBER94 charges) - 0.260391165au&lt;br /&gt;
&lt;br /&gt;
Charges in this file did not sum to an integer number - the only residue with a non integer charge was no.64 with a charge of 0.61160000&lt;br /&gt;
&lt;br /&gt;
 MM sanity checks:&lt;br /&gt;
 All charges sum to:               -5.38840000&lt;br /&gt;
 Charges of atoms sum to:          -5.38840000&lt;br /&gt;
 Layer 1 (S) has     34 atoms and charge   0.61160000 sum=   0.61160000&lt;br /&gt;
 Layer 2 (R) adds  3574 atoms and charge  -6.00000000 sum=  -5.38840000&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
 WARNING -- The following centers have non-zero partial&lt;br /&gt;
 charges, but zero Vanderwaals parameters. These centers&lt;br /&gt;
 can collapse onto other centers with non-zero charge&lt;br /&gt;
 Center    Charge&lt;br /&gt;
   1833  0.423900&lt;br /&gt;
   1890  0.410200&lt;br /&gt;
   2029  0.427500&lt;br /&gt;
   2043  0.427500&lt;br /&gt;
   2080  0.410200&lt;br /&gt;
   2089  0.399200&lt;br /&gt;
   2123  0.410200&lt;br /&gt;
   2179  0.410200&lt;br /&gt;
   2186  0.410200&lt;br /&gt;
   2260  0.410200&lt;br /&gt;
   2287  0.410200&lt;br /&gt;
   2294  0.410200&lt;br /&gt;
   2347  0.427500&lt;br /&gt;
   2369  0.399200&lt;br /&gt;
   2469  0.427500&lt;br /&gt;
   2508  0.399200&lt;br /&gt;
   2551  0.410200&lt;br /&gt;
   2619  0.399200&lt;br /&gt;
   2639  0.410200&lt;br /&gt;
   2714  0.410200&lt;br /&gt;
   2924  0.399200&lt;br /&gt;
   2939  0.399200&lt;br /&gt;
   2950  0.427500&lt;br /&gt;
   2981  0.399200&lt;br /&gt;
   3180  0.427500&lt;br /&gt;
   3233  0.399200&lt;br /&gt;
   3262  0.410200&lt;br /&gt;
   3361  0.399200&lt;br /&gt;
   3377  0.427500&lt;br /&gt;
   3384  0.410200&lt;br /&gt;
   3397  0.427500&lt;br /&gt;
   3418  0.427500&lt;br /&gt;
   3562  0.410200&lt;br /&gt;
   3593  0.410200&lt;br /&gt;
   3608  0.410200&lt;br /&gt;
&lt;br /&gt;
* oniom(pm6:amber=hardfirst) geom=connectivity was run successfully, resulting in an energy of 0.210085730788au&lt;br /&gt;
&lt;br /&gt;
This still resulted in some charge problems:&lt;br /&gt;
&lt;br /&gt;
 MM sanity checks:&lt;br /&gt;
 All charges sum to:               &#039;&#039;&#039;-5.38840000&#039;&#039;&#039;&lt;br /&gt;
 Charges of atoms sum to:           &#039;&#039;&#039;0.79320000&#039;&#039;&#039;&lt;br /&gt;
 Residue  63 PDB Number    &#039;&#039;&#039;64_A PHE has&#039;&#039;&#039;     1 atoms, and a charge of   0.59730000&lt;br /&gt;
 Residue  64 PDB Number    &#039;&#039;&#039;66_A CSY has&#039;&#039;&#039;    34 atoms, and a charge of   0.61160000&lt;br /&gt;
 Residue  65 PDB Number    &#039;&#039;&#039;68_A VAL has&#039;&#039;&#039;     1 atoms, and a charge of  -0.41570000&lt;br /&gt;
 WARNING -- The following centers have non-zero partial&lt;br /&gt;
 charges, but zero Vanderwaals parameters. These centers&lt;br /&gt;
 can collapse onto other centers with non-zero charge&lt;br /&gt;
 Center    Charge&lt;br /&gt;
   &#039;&#039;&#039;3608  0.410200&#039;&#039;&#039;&lt;br /&gt;
 MMInit generated parameter data with length LenPar=       40470.&lt;br /&gt;
 Standard basis: Dummy (5D, 7F)&lt;br /&gt;
 There are    36 symmetry adapted basis functions of A   symmetry.&lt;br /&gt;
 Integral buffers will be    262144 words long.&lt;br /&gt;
 Raffenetti 1 integral format.&lt;br /&gt;
 Two-electron integral symmetry is turned on.&lt;br /&gt;
    36 basis functions,    36 primitive gaussians,    36 cartesian basis functions&lt;br /&gt;
    27 alpha electrons       27 beta electrons&lt;br /&gt;
       nuclear repulsion energy       200.9124124811 Hartrees.&lt;br /&gt;
 NAtoms= 3608 NActive=   36 NUniq=   36 SFac= 7.50D-01 NAtFMM=   80 NAOKFM=F Big=F&lt;br /&gt;
 AMBER calculation of energy.&lt;br /&gt;
 Energy=    0.110474559     NIter=   0.&lt;br /&gt;
 Dipole moment=       3.390901       5.889448      -2.505476&lt;br /&gt;
 ONIOM: saving gridpoint  1&lt;br /&gt;
 ONIOM: restoring gridpoint  3&lt;br /&gt;
 ONIOM: calculating energy.&lt;br /&gt;
 ONIOM: gridpoint  1 method:  low   system:  model energy:     0.110474559177&lt;br /&gt;
 ONIOM: gridpoint  2 method:  high  system:  model energy:     0.060169124949&lt;br /&gt;
 ONIOM: gridpoint  3 method:  low   system:  real  energy:     0.260391165016&lt;br /&gt;
 ONIOM: extrapolated energy =       0.210085730788&lt;br /&gt;
 ONIOM: calculating electric field derivatives.&lt;br /&gt;
 ONIOM: Integrating ONIOM file  4 number   619&lt;br /&gt;
 ONIOM: Dipole        =-2.76174164D+01-1.34762322D+02 3.99239712D+01&lt;br /&gt;
 ONIOM: Dipole moment (Debye):&lt;br /&gt;
    X=   -70.1965    Y=  -342.5316    Z=   101.4766  Tot=   364.0782&lt;br /&gt;
 ONIOM: Integrating ONIOM file  5 number   695&lt;br /&gt;
&lt;br /&gt;
An ONIOM calculation was run using a DFT method on the high level and AMBER on the low level:  &#039;&#039;oniom(b3lyp/3-21g:amber=hardfirst) geom=connectivity&#039;&#039; which ran successfully...&lt;br /&gt;
&lt;br /&gt;
===Charges===&lt;br /&gt;
&lt;br /&gt;
Charges in the protein are important for the calculation that will be run on GFP eventually since they provide the electrostatic field or &amp;quot;atmosphere&amp;quot; which surrounds the chromophore, polarizing the electron density used by the DFT method to evaluate the chromophore. Gaussian offers 2 types of influence which the MM portion can exert on the QM part:&lt;br /&gt;
&lt;br /&gt;
* Mechanical Embedding&lt;br /&gt;
* Electronic Embedding&lt;br /&gt;
&lt;br /&gt;
Mechanical embedding does not use any electronic components to affect the central high region. Electronic embedding does use the charge generated in the MM region to polarize the high region.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
At pH 7 the GFP molecule is in a 6- charge state according to assigning AMBER charges and summing the residue charges (available in the out file). 3 options were considered at this stage for treating charges:&lt;br /&gt;
&lt;br /&gt;
* Ignore charges on the entire molecule&lt;br /&gt;
* Use AMBER94 assigned partial charges on the low region&lt;br /&gt;
* Use Qeq to assign charges based on an overall charge of -6&lt;br /&gt;
&lt;br /&gt;
The first option can be discarded since the electronic atmosphere is of particular interest here. The second 2 options are to be investigated. See below (on a small Biomolecule)&lt;br /&gt;
&lt;br /&gt;
ON a 4 residue chain (in mini biomolecule folder) - As in the GFP tutorial on the wiki, a energy calculation was submitted with the keyword amber=qeq to get back the charges across the molecule. (not entirely sure what this does yet). AS a test of the effect of charges on a molecule a test molecule was compiled in TINKER consisting of 4 residues with 0 overall charges....&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolAppletButton&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;Protein1_AMBER_OPT_qeq.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Test Protien&amp;lt;/text&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;600&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;/jmolAppletButton&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{|&lt;br /&gt;
|No Charges|| 0.01940311au&lt;br /&gt;
|-&lt;br /&gt;
|Amber Charges|| -0.18117151au&lt;br /&gt;
|-&lt;br /&gt;
|Qeq Charges|| -0.23038199au&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Clearly the Qeq shows the lowest energy - the considerable differences in energy show that charges are an important part of the input.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Testing Charges and EE on a small system===&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolAppletButton&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;Chromopre_molecule_B3LYP_6-31G(d).mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;script&amp;gt;spin on&amp;lt;/script&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Test Protien&amp;lt;/text&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;600&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;/jmolAppletButton&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Energies of the optimisations using various charge approximations and with/without EE&lt;br /&gt;
{|class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Method||Mechanical Embedding||Electronic Embedding||Computational Time (mechanical embedding)||Computational Time (electronic embedding)&lt;br /&gt;
|-&lt;br /&gt;
|B3LYP 6-31G*||-1406.04||N/A||11 hours 18 minutes 44.7 seconds||11 hours 18 minutes 44.7 seconds&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (No Charges)||-783.2225||-783.2223||2 hours 13 minutes  4.1 seconds||&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Amber Charges)|| || ||  ||&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Qeq Charges)||-783.2467|| ||2 hours  6 minutes  8.6 seconds ||&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
NB. &lt;br /&gt;
* B3LYP 6-31G* and ONIOM B3LYP 6-31G*:AMBER (No Charges) were from the same starting geometry.&lt;br /&gt;
* ONIOM B3LYP 6-31G*:AMBER (Amber Charges) and ONIOM B3LYP 6-31G*:AMBER (Qeq Charges) were from the same starting geometry.&lt;br /&gt;
* All Electronic embedding is from the same starting point - Can compare times.&lt;br /&gt;
&lt;br /&gt;
Dipole moments using various charge approximations and with/without EE&lt;br /&gt;
{|class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Method||Dipole moment (No EE)||Dipole moment (With EE)&lt;br /&gt;
|-&lt;br /&gt;
|B3LYP 6-31G*||7.4914||N/A&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (No Charges)||3.2328||3.2241&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Amber Charges)||5.2794||5.2865&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Qeq Charges)||8.173||8.1654&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==2nd Dec 2009==&lt;br /&gt;
&lt;br /&gt;
A single point energy of 1w7s was run with the additional qeq keyword to return partial charges (overall charge set to -6)&lt;br /&gt;
This new molecule was submitted for AMBER opt but failed after around 700 iterations... this could be due to the error:&lt;br /&gt;
&lt;br /&gt;
 WARNING -- The following centers have non-zero partial&lt;br /&gt;
 charges, but zero Vanderwaals parameters. These centers&lt;br /&gt;
 can collapse onto other centers with non-zero charge&lt;br /&gt;
&lt;br /&gt;
When the atoms defined in this error had their partial charges set to 0.00, the result was a fully converged AMBER optimisation:&lt;br /&gt;
&lt;br /&gt;
*Frozen chromophore.&lt;br /&gt;
*pH 7 giving a charge of -6&lt;br /&gt;
*Charges assigned using Qeq method&lt;br /&gt;
*Zero Van der Waals parameter atoms partial charges set to 0.00&lt;br /&gt;
&lt;br /&gt;
Looking at what atoms are not defined in terms of Van der Waals paramters it is ALWAYS an aloholic proton that is not parameterised...&lt;br /&gt;
&lt;br /&gt;
Tutorial used AMBER charges then ran Qeq for the missing charges... i think this is what she thought she did... gaussian &amp;quot;amber=qeq&amp;quot; will overwrite all charges.. need to use &amp;quot;amber=uncharged&amp;quot;&lt;br /&gt;
so that we dont overwrite the amber charges...&lt;br /&gt;
&lt;br /&gt;
In effect full Qeq charges were assigned in the tutorial. Also overall charge is 0 - will check this.&lt;br /&gt;
&lt;br /&gt;
===The GFP tutorial===&lt;br /&gt;
&lt;br /&gt;
*AMBER charges (AMBER94 charges) were assigned to the GFP molecule in the tutorial&lt;br /&gt;
*she notices correctly that some are missing in the chromophore and so decides to assign them via Qeq&lt;br /&gt;
*Runs Qeq on the GFP molecule with a &#039;&#039;&#039;0 charge&#039;&#039;&#039; which is wrong! When run the AMBER charges and look at the log file we can see that the overall charge should be &#039;&#039;&#039;+4&#039;&#039;&#039; for that protonation state.&lt;br /&gt;
*Since Qeq is an equilibrium method, partial charges are solved with the constraint that their sum must equal the overall charge.&lt;br /&gt;
*These Qeq charges should therefore not fit with an integer sum when pasted into the AMBER charges.&lt;br /&gt;
&lt;br /&gt;
===Qeq versus AMBER94 charges===&lt;br /&gt;
&lt;br /&gt;
Ogawa et al. state:&lt;br /&gt;
&lt;br /&gt;
 Geometry optimizations of a crambin protein were performed using force fields combined with the&lt;br /&gt;
 CQEq. The CQEq charges were found to be more comparable to the Mulliken charges obtained by the ab initio molecular orbital&lt;br /&gt;
 calculation than the fixed charges used in the AMBER force field. Furthermore, the CQEq charges give more realistic charge redistribution&lt;br /&gt;
 between amino acids in the crambin.&lt;br /&gt;
&lt;br /&gt;
Mini Biomolecule calculations:&lt;br /&gt;
&lt;br /&gt;
{|class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Method||Dipole moment (No EE)||Dipole moment (With EE)&lt;br /&gt;
|-&lt;br /&gt;
|B3LYP 6-31G*||7.4914||N/A&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (No Charges)||3.2328||3.2241&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Amber Charges)||5.2794||5.2865&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Qeq Charges)||8.173||8.1654&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Clearly the Qeq charges are in better agreement with the DFT method than no or AMBER charges.&lt;br /&gt;
&lt;br /&gt;
These frequencies are calulated using mechanical embedding!&lt;br /&gt;
&lt;br /&gt;
{|class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Method||C=O||C=C&lt;br /&gt;
|-&lt;br /&gt;
|B3LYP 6-31G*||1799.65||1701.86&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (No Charges)||1802.25||1703.63&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Amber Charges)||1803.37||1701.67&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Qeq Charges)||1800.18||1702.86&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
As a result it seems obvious that a Qeq charge assignment in constraint by the correct total charge gives results most comparable to the pure QM method (both dipoles and frequencies)&lt;br /&gt;
&lt;br /&gt;
==3rd Dec 2009==&lt;br /&gt;
===ESP Charges===&lt;br /&gt;
&lt;br /&gt;
ESP calculates the expectation values of the electrostatic potential of a molecule on a uniform distribution of points. The resultant ESP surface is then fit to atom centered charges that best reproduce the electron distribution in a least-squares method. The set of points defining the default surface is generated according to the algorithm of Connolly.&lt;br /&gt;
&lt;br /&gt;
ESP charges thus require a QM calculation to create the molecular electrostatic potential used to fit atom centered charges. The size of GFP means that this is not an efficient method for use in this case?&lt;br /&gt;
&lt;br /&gt;
===Qeq Charges===&lt;br /&gt;
&lt;br /&gt;
The charge equilibration (QEq) method treats the atomic charge variation through the equilibration of electronegativities at atomic sites. The advantage of the QEq method is that the polarization effect in MM is taken into account by variation of the charge distribution in the molecules. The atomic charge variation relates to the intramolecular geometrical changes.&lt;br /&gt;
&lt;br /&gt;
===AMBER charges===&lt;br /&gt;
&lt;br /&gt;
==8th Dec 2009==&lt;br /&gt;
&lt;br /&gt;
A input creation page is now avaliable:&lt;br /&gt;
&lt;br /&gt;
https://www.ch.ic.ac.uk/wiki/index.php/AndyForesterGFP&lt;br /&gt;
&lt;br /&gt;
The final input file had protein information pasted in using text to columns in excel and now displays PDB residue information and PDB secondary structure information (nice cartoon pictures in ChemBio3D!)&lt;br /&gt;
&lt;br /&gt;
Interesting is a paper that uses TD-DFT on GFP (in my papers folder on the mac), the group uses a 3 stage minimisation method from the 1GFL PDB structure:&lt;br /&gt;
&lt;br /&gt;
* Freeze all atoms in the backbone of the protein and optimise after adding hydrogens&lt;br /&gt;
* Optimise the chromophore at a semi-empirical level&lt;br /&gt;
* They cut out a chromophore section but i could then perform a full QM/MM geometry optimisation&lt;br /&gt;
&lt;br /&gt;
===A more charged mini molecule to test electronic embedding===&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolAppletButton&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;More_charge_B3LYP_6_31G(d).mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;The molecule with 2+ charge&amp;lt;/text&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;600&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;/jmolAppletButton&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
The molecule was submitted with 3 different job lines:&lt;br /&gt;
&lt;br /&gt;
* b3lyp 6-31g(d)&lt;br /&gt;
* oniom(b3lyp 6-31g(d):amber=(hardfirst,qeq))&lt;br /&gt;
* oniom(b3lyp 6-31g(d):amber=(hardfirst,qeq))=embedcharge&lt;br /&gt;
&lt;br /&gt;
Opt and Freq jobs were completed successfully. The charge breakdown input was:&lt;br /&gt;
&lt;br /&gt;
* Low Real 2 1&lt;br /&gt;
* High Model 0 1&lt;br /&gt;
* Low Model 0 1&lt;br /&gt;
&lt;br /&gt;
The aim of the calculations was to test electronic embedding versus the mechanical embedding. A paper suggests that there is a large difference in accuracy between the two methods. Qeq charge equilibration was used in both cases as it has already been proved to be much more effective than AMBER assigned charges. &#039;&#039;&#039;It has consistently overestimated the dipole moment though&#039;&#039;&#039;.&lt;br /&gt;
&lt;br /&gt;
{|class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Method||Dipole moment&lt;br /&gt;
|-&lt;br /&gt;
|B3LYP 6-31G*||8.02&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (ME)||12.22&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (EE)||12.23&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
These results are odd since we expect the EE to have some effect on the charge distribution in the molecule (it is polarising the QM region) comparing the frequencies as a further check:&lt;br /&gt;
&lt;br /&gt;
{|class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Method||C=O||C=C&lt;br /&gt;
|-&lt;br /&gt;
|B3LYP 6-31G*||1764||1636&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (ME)||1797||1697&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (EE)||1797||1697&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Again the EE has made no difference to the frequency of various strong vibrations in the molecule. The conclusion is thus either:&lt;br /&gt;
&lt;br /&gt;
* EE has no real effect on the moleucule&lt;br /&gt;
* EE is not working properly&lt;br /&gt;
&lt;br /&gt;
Assuming the latter further investigation discovered that the meechanism of EE is determined using scaling parameters for MM charges during electronic embedding in the QM calculations. As a default the charges are scaled 555500 using the nomenclature ScaleCharge=ijklmn where n is closest to the QM region (massively anti intuitive). The input is read left to right and not all scaling factors need be defined. Further to this, the value given is then multiplied by 0.2 to arrive at the actual scaling coefficient. The result is thus for an atom:&lt;br /&gt;
&lt;br /&gt;
* 1 bond from the QM region (n) = 0.2n           Default is 0&lt;br /&gt;
* 2 bonds from the QM region (m) = 0.2m       Default is 0&lt;br /&gt;
* 3 bonds from the QM region (l) = 0.2l            Default is 5&lt;br /&gt;
* 4 bonds from the QM region (k) = 0.2k          Default is 5&lt;br /&gt;
* 5 bonds from the QM region (j) = 0.2j            Default is 5&lt;br /&gt;
* 6 bonds from the QM region (i) = 0.2i            Default is 5&lt;br /&gt;
&lt;br /&gt;
So all the atoms (and partial charges) in the MM region in the test molecule were being scaled to 0 when performing the EE. This explains the fact that no difference was uncovered in the previous calculation set.&lt;br /&gt;
&lt;br /&gt;
A new calculation was submitted with the command scalecharge=555 so that all charges in the MM part will be scaled to 1 (0.2xn,m,l)&lt;br /&gt;
&lt;br /&gt;
===Charges...again===&lt;br /&gt;
&lt;br /&gt;
So since i deleted the charges on the atoms where they can collapse, the overall charge is now -19.6 from the original -6 minus the 13.6 in charge deleted. REALLY ANNOYING! This is a huge difference in charges so will have to be solved. One easy option would be to set the involved atoms to 0 then run QEQ on the atoms with unassigned charges... essentially distributing the -6 charge through all atoms except those with 0 VdW parameters.&lt;br /&gt;
&lt;br /&gt;
==Tues 15th Dec 2009==&lt;br /&gt;
&lt;br /&gt;
The more charged molecule failed once the charges were scaled to 1 since SCF convergence was not attained.&lt;br /&gt;
&lt;br /&gt;
==Mon 25th Jan 2010==&lt;br /&gt;
&lt;br /&gt;
TAO tool publication was very good read... has a good suggestion of an optimisation procedure:&lt;br /&gt;
&lt;br /&gt;
*Take the crystal structure&lt;br /&gt;
*Add Hydrogens&lt;br /&gt;
*Determine protonation&lt;br /&gt;
*MM geometry optimisation&lt;br /&gt;
*(MD?)&lt;br /&gt;
*(Look at several samples of the MD)&lt;br /&gt;
*Optimise a final structure&lt;br /&gt;
&lt;br /&gt;
==Thurs 28th Jan 2010==&lt;br /&gt;
&lt;br /&gt;
Tao Tools requires preprocess within AMBER tools to run the scripts correctly. Does not have HIS on the residue database... Amber defines the protonation state within its residue naming scheme... HIS is either HIE, HIP, HID depending on its protonation state. [http://www.poissonboltzmann.org/pdb2pqr/documentation/programmers-guide/naming-scheme]&lt;br /&gt;
&lt;br /&gt;
==Thurs 4th Jan 2010==&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Preprocessing&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
To get TAO tools and also for completeness it is necessary to preprocess the PDB file before attempting to run ONIOM calculations, this includes:&lt;br /&gt;
&lt;br /&gt;
* Adding hydrogens&lt;br /&gt;
* Create a custom residue file for the chromophore&lt;br /&gt;
* Convert from PDB to gjf ONIOM input&lt;br /&gt;
&lt;br /&gt;
Ambertools are extremely helpful with this preprocessing especially since amber will be the low level theory used throughout this project. Within ambertools are a number of useful programs:&lt;br /&gt;
&lt;br /&gt;
* antechamber - has a parameterlookup function within amber and GAFF parameter library files.&lt;br /&gt;
* xLeap - builds the protein from its constituent residues - adds hydrogens according to library files - allows a parm file to be built to add custom residues.&lt;br /&gt;
&lt;br /&gt;
I will be working on using these 2 programs to create a full MM input file which is fully defined and correct.&lt;br /&gt;
&lt;br /&gt;
===xLeap===&lt;br /&gt;
&lt;br /&gt;
To load xLeap we must use X11 to get the graphical interface and the ssh -Y command to allow tunneling back to X11. Also the command line to start xLeap should be:  &#039;&#039;xleap -s -f /apps/ambertools/amber11/dat/leap/cmd/leaprc.ff03&#039;&#039;. This loads the ff03 parameter file and results in the start up of the graphical interface with:&lt;br /&gt;
&lt;br /&gt;
 Welcome to LEaP!&lt;br /&gt;
 Sourcing leaprc: /amber/dat/leap/cmd/leaprc&lt;br /&gt;
 Log file: ./leap.log&lt;br /&gt;
 Loading parameters: /amber/dat/leap/parm/parmME.dat&lt;br /&gt;
 Loading library: /amber/dat/leap/lib/all_nucleic94.lib&lt;br /&gt;
 Loading library: /amber/dat/leap/lib/all_aminoME.lib&lt;br /&gt;
 Loading library: /amber/dat/leap/lib/all_aminoctME.lib&lt;br /&gt;
 Loading library: /amber/dat/leap/lib/all_aminontME.lib&lt;br /&gt;
 Loading library: /amber/dat/leap/lib/ions94.lib&lt;br /&gt;
 Loading library: /amber/dat/leap/lib/water.lib&lt;br /&gt;
&lt;br /&gt;
This shows that the residue libraries have been loaded. We are then free to load a pdb file which will be read by xLeap and built according to the residues! In addition to the residue names xleap also requires information about the chain connectivity. Residues can be connected to other residues from both sides, only one side or not at all. xleap is fairly simple in that it expects that all residues in the pdb file are connected in the order in which they are listed unless they are separated by a &amp;quot;TER&amp;quot; card. In most cases where a single protein chains is being investigated like here - the TER card should end the residue information.&lt;br /&gt;
&lt;br /&gt;
To load a pdb use (ignore the &amp;lt;&amp;gt;) &#039;&#039;&amp;lt;name for use in xLeap&amp;gt;=loadpdb &amp;quot;&amp;lt;name of file&amp;gt;&amp;quot;&#039;&#039;. Make sure that the file is in your PATH else fully define the path in the filename.&lt;br /&gt;
&lt;br /&gt;
The pdb should then be read into xLeap and built. At the end of the output we get (I loaded ChainA of the 1W7S without any water or hydrogens):&lt;br /&gt;
&lt;br /&gt;
 Leap added 1765 missing atoms according to residue templates:&lt;br /&gt;
       1 Heavy&lt;br /&gt;
       1764 H / lone pairs&lt;br /&gt;
 The file contained 21 atoms not in residue templates&lt;br /&gt;
&lt;br /&gt;
This shows that Leap added hydrogens automatically and also a missing &amp;quot;heavy&amp;quot; atom (one which is not a H or lp):&lt;br /&gt;
&lt;br /&gt;
 Added missing heavy atom: .R&amp;lt;CHIE 229&amp;gt;.A&amp;lt;OXT 18&amp;gt;&lt;br /&gt;
&lt;br /&gt;
For amino acids, the traditional order of atom records within a residue is N, CA, C, O, followed by the sidechain atoms (CB, CG1, CG2 . . . ) in order first of increasing remoteness, and then branch. The extra oxygen at the carboxyl terminal is called &amp;quot; OXT&amp;quot;, and appears after all sidechain atoms.&lt;br /&gt;
&lt;br /&gt;
Now that we have established that there are 21 atoms that are in the non standard residue region we need to define a custom residue which describes this region (chromophore). This involves creating a prep file....&lt;br /&gt;
&lt;br /&gt;
I first drew up the undefined section of GFP (chromophore) in Gaussview5 and saved as a.pdb then:&lt;br /&gt;
&lt;br /&gt;
* Load in antechamber and convert to mol2 format using &#039;&#039;antechamber -i &amp;lt;input&amp;gt; -fi pdb -o &amp;lt;output&amp;gt; -fo mol2 -c bcc&#039;&#039;&lt;br /&gt;
* Use parmchk to make sure the molecule was defined in the GAFF force field - kicks out a .frcmod file with missing parameters (including dihedrals) - nice :D!&lt;br /&gt;
* Manually add the &amp;quot;1CSY&amp;quot; where a standalone &amp;quot;0&amp;quot; is in the mol2 file so Leap can read it&lt;br /&gt;
* Load the gaff parameter file to Leap using &#039;&#039;source leaprc.gaff&#039;&#039;&lt;br /&gt;
* Load the chromophore_CSY_full.mol2 to Leap using &#039;&#039;CSY=loadmol2 &amp;quot;ChromophoreGFP_CSY_full.mol2&amp;quot;&#039;&#039;&lt;br /&gt;
* Check the residue using &#039;&#039;check CSY&#039;&#039; - Notice missing parameters are there.&lt;br /&gt;
* Load the ChromophoreGFP_CSY_full.frcmod file created by parmchk using &#039;&#039;loadamberparams ChromophoreGFP_CSY_full.frcmod&#039;&#039;&lt;br /&gt;
* Now check the CSY residue again using &#039;&#039;check CSY&#039;&#039; - No Problems! woo hoo! Now I can create amber parameters :D&lt;br /&gt;
&lt;br /&gt;
==Tues 9th Feb 2010==&lt;br /&gt;
&lt;br /&gt;
===RESP Fitting===&lt;br /&gt;
&lt;br /&gt;
To get charges that are like those of the standard amber force field an RESP fitting method has to be used. This involves 3 steps:&lt;br /&gt;
&lt;br /&gt;
* First, the molecule studied is optimized to determine a stable minimum (using Gaussian).&lt;br /&gt;
* Then, this minimized structure is used to calculate a Molecular Electrostatic Potential (MEP) on a three-dimensional grid (using Gaussian).&lt;br /&gt;
* Finally, this grid is exported into the &amp;quot;RESP program&amp;quot; which is used to fit atom-centered charges to the MEP&lt;br /&gt;
&lt;br /&gt;
To assist in this procedure - I am using a program called RED - [http://q4md-forcefieldtools.org/RED/] which can be used in all 3 stages...&lt;br /&gt;
&lt;br /&gt;
* Ante_RED is used to prepare Gaussian input files for the RESP fit&lt;br /&gt;
* RED III is used to fit atom centered charges&lt;br /&gt;
&lt;br /&gt;
The procedure used to create the CSY file for RESP fitting is as follows:&lt;br /&gt;
&lt;br /&gt;
* Cut the CSY residue from the 1W7S protein structure and cap terminal ends with methyl groups&lt;br /&gt;
* Optimise this in Gaussian at PM6 level&lt;br /&gt;
* Run the optimised CSY geometry through ante_RED to create the Gaussian input file involving minimisation at HF level and calculation of the MEP - also get the .P2N file required for RED III&lt;br /&gt;
* Run this Gaussian input file&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Input file for optimisation:&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;#P hf/6-31G* Opt=(Tight,CalcFC) Freq SCF(Conver=8) Test&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This will run a HF level optimisation and frequency analysis.  &#039;&#039;Opt=tight&#039;&#039; tightens the cutoffs on forces and step size that are used to determine convergence. An optimization with Opt=Tight will take several more steps than with the default cutoffs. For molecular systems with very small force constants (low frequency vibrational modes), this may be necessary to ensure adequate convergence and reliability of frequencies computed in a subsequent job step. This option can only be used with Berny optimizations. &#039;&#039;Opt = CalcFC&#039;&#039; causes the calculation of an analytical Hessian (in HF). &#039;&#039;SCF(Conver=8)&#039;&#039; sets the SCF convergence criterion to 10^-8.&lt;br /&gt;
&lt;br /&gt;
===Parameter files for use in Gaussian (Gaussian only uses amber94)===&lt;br /&gt;
&lt;br /&gt;
Using external parameter files. It is possible to include MM function entries within an external file by replacing the entries within the input file with the usual Gaussian include file mechanism: e.g., @oplsaa.prm. Parameter files for various force fields are included within the main Gaussian directory (i.e., $g09root/g09). If you want to specify the entire force field via an external file, use an input file like the following:&lt;br /&gt;
&lt;br /&gt;
# UFF=SoftOnly&lt;br /&gt;
&lt;br /&gt;
Read-in force field example&lt;br /&gt;
&lt;br /&gt;
molecule specification&lt;br /&gt;
&lt;br /&gt;
@$g09root/g09/oplsaa.prm&lt;br /&gt;
&lt;br /&gt;
Note that the choice of a molecular mechanics keyword is arbitrary as the entire force field depends on the read-in functions and parameters and not on any internal ones (in other words, specifying Amber would yield the same result as UFF).&lt;br /&gt;
&lt;br /&gt;
==Tues 16th Feb 2010==&lt;br /&gt;
===RESP Fitting===&lt;br /&gt;
&lt;br /&gt;
Following optimisation of the fragment for RESP fitting as seen before. We now have to fit the RESP to a MEP, this involves a few calculations:&lt;br /&gt;
&lt;br /&gt;
* ESP calcs in gaussian&lt;br /&gt;
* Fitting calcs by resp in ambertools&lt;br /&gt;
&lt;br /&gt;
The result is a .mol2 containing the fitted charges. The process has 2  input files:&lt;br /&gt;
&lt;br /&gt;
* .log&lt;br /&gt;
* .p2n&lt;br /&gt;
&lt;br /&gt;
The p2n is created by ante_RED and controls the RESP fitting process. (See RED site). To get a fit that excludes the capping methyls we add a REMARK line:&lt;br /&gt;
&lt;br /&gt;
 REMARK INTRA-MCC 0.0 |  35  36  37  38  39  40  41  42 | R&lt;br /&gt;
&lt;br /&gt;
This must be formatted EXACTLY as shown here (2 spaces between the numbers). The numbers correspond to atoms numbers, the 0.0 constrains these atoms to 0.0 charge and the R tells RESP to remove these atoms from the 2nd round fit and the subsequent mol2 output. Leaving a fragment.&lt;br /&gt;
&lt;br /&gt;
The RED program can be run on CX1! I used the following Jobscript (REDjobscript_4):&lt;br /&gt;
&lt;br /&gt;
 #PBS -l ncpus=1 &lt;br /&gt;
 #PBS -l mem=6000mb&lt;br /&gt;
 #PBS -l walltime=01:00:00&lt;br /&gt;
 #PBS -q pqchem&lt;br /&gt;
 #PBS -joe&lt;br /&gt;
 module load ambertools&lt;br /&gt;
 module load gaussian&lt;br /&gt;
 export GAUSS_SCRDIR=$TMPDIR&lt;br /&gt;
 echo $GAUSS_SCRDIR&lt;br /&gt;
 cd $HOME&lt;br /&gt;
 pwd&lt;br /&gt;
 perl /home/af706/RED-vIII.3.pl &amp;gt; /home/af706/RED-vIII.log&lt;br /&gt;
&lt;br /&gt;
So here we are using the chemistry node pqchem:&lt;br /&gt;
&lt;br /&gt;
* loading the ambertools (for RESP) and gaussian modules&lt;br /&gt;
* defining a scratch directory for the RESP program to read gaussian scratch&lt;br /&gt;
* TEMPDIR is used since the scratch must be empty upon calculation start. The temporary directory will be created fresh on each node as the calculation is started, and it&#039;s unlikely that two calculations will run on the same node.&lt;br /&gt;
* Moving to my HOME directory /home/af706 because the script must be run in the same directory as the input files  (.log and .p2n)&lt;br /&gt;
* RUN!&lt;br /&gt;
&lt;br /&gt;
The fitting process runs single point energy calcs and fits the RESP - overall it took 3mins for my fragment.&lt;br /&gt;
&lt;br /&gt;
===Going from xLeap out to Gaussian in===&lt;br /&gt;
&lt;br /&gt;
2 ways to do this:&lt;br /&gt;
&lt;br /&gt;
* Load in Gaussview&lt;br /&gt;
* Use Pengs scripts&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Gaussview&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
* The file loads and all residues are recognised (apart from the chromophore). The file contains the same number of atoms as the saved Leap file.&lt;br /&gt;
* Charges are assigned from amber94 - not the chromophore still&lt;br /&gt;
* Amber atom types are assigned correctly to ALL atoms&lt;br /&gt;
* All that remains is to add charges from the RESP fit.&lt;br /&gt;
* PDB information is retained in input&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;TAO&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
* Atoms are added so atom count is different to the file leaving Leap&lt;br /&gt;
* Fails to treat terminal residues correctly&lt;br /&gt;
* Lose residue information&lt;br /&gt;
* Charges are missing from chromophore and atoms in terminal residues&lt;br /&gt;
&lt;br /&gt;
==22nd Feb 2010==&lt;br /&gt;
&lt;br /&gt;
Links to HOW TO for Ambertools + Gaussview/ Ambertools + TAO tools + Gaussview&lt;br /&gt;
&lt;br /&gt;
* Ambertools + Gaussview  - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester_Ambertools_Gaussview&lt;br /&gt;
* Ambertools + TAO tools + Gaussview - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester_Ambertools_TAO&lt;br /&gt;
&lt;br /&gt;
* Running Amber only Calculations - https://www.ch.imperial.ac.uk/wiki/index.php/AndyForesterAmber&lt;br /&gt;
&lt;br /&gt;
==26th Feb 2010==&lt;br /&gt;
&lt;br /&gt;
===Picking out model Modes===&lt;br /&gt;
&lt;br /&gt;
Integers and integer ranges without a keyword are interpreted as mode numbers, although the [not]mode keywords may be used. The keywords atoms and notatoms can be used to define an atom list whose modes should be included/excluded (respectively). Atoms can also be specified by ONIOM layer via the [not]layer keywords, which accept these values: real for the real system, model for the model system in a 2-layer ONIOM, middle for the middle layer in a 3-layer ONIOM, and small for the model layer of a 3-layer ONIOM. Atoms may be similarly included/excluded by residue with residue and notresidue, which accept lists of residue names or numbers. Both keyword sets function as shorthand forms for atom lists.&lt;br /&gt;
&lt;br /&gt;
==TAO tools==&lt;br /&gt;
&lt;br /&gt;
The expected differences to the one without using his tools are:&lt;br /&gt;
&lt;br /&gt;
* Different partial charges since his scripts use all_amino03 to assign charges,&lt;br /&gt;
  Gaussview uses amber94 charges (Net charge is the same for both files - good thing)&lt;br /&gt;
&lt;br /&gt;
* Potential for some changes in atom-types due to different versions of amber being used.&lt;br /&gt;
&lt;br /&gt;
What actually happened:&lt;br /&gt;
&lt;br /&gt;
* Difference in individual partial charges but same overal net charge&lt;br /&gt;
&lt;br /&gt;
* Different atom type assignment led to some more undefined parameters.&lt;br /&gt;
&lt;br /&gt;
* Defining these and running an energy led to significantly different energy than the non tools&lt;br /&gt;
  method:&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;WITHOUT tools&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
 AMBER calculation of energy.&lt;br /&gt;
 Energy per function type:&lt;br /&gt;
 Direct non-bonded                                 122152.193595037&lt;br /&gt;
 Non-bonded pair                                  -122162.319396395&lt;br /&gt;
 Harmonic stretch I                                     0.354921205&lt;br /&gt;
 Harmonic bend I                                        0.989669436&lt;br /&gt;
 Amber torsion                                          1.587390468&lt;br /&gt;
 Improper torsion                                       0.013641893&lt;br /&gt;
 Energy per function class:&lt;br /&gt;
        Coulomb              -9.200657698&lt;br /&gt;
    Vanderwaals              -0.925143661&lt;br /&gt;
     Stretching               0.354921205&lt;br /&gt;
        Bending               0.989669436&lt;br /&gt;
        Torsion               1.587390468&lt;br /&gt;
   Out-of-plane               0.013641893&lt;br /&gt;
 Energy=    -7.18017836     NIter=   0.&lt;br /&gt;
 Dipole moment=     -25.836974    -115.466231     -34.635294&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;WITH TAO tools&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
 AMBER calculation of energy.&lt;br /&gt;
 Energy per function type:&lt;br /&gt;
 Direct non-bonded                                 137343.707022741&lt;br /&gt;
 Non-bonded pair                                  -137347.284026475&lt;br /&gt;
 Harmonic stretch I                                     0.354921205&lt;br /&gt;
 Harmonic bend I                                        1.082982995&lt;br /&gt;
 Amber torsion                                          1.589487307&lt;br /&gt;
 Improper torsion                                       0.013645198&lt;br /&gt;
 Energy per function class:&lt;br /&gt;
        Coulomb              -6.890960120&lt;br /&gt;
    Vanderwaals               3.313956385&lt;br /&gt;
     Stretching               0.354921205&lt;br /&gt;
        Bending               1.082982995&lt;br /&gt;
        Torsion               1.589487307&lt;br /&gt;
   Out-of-plane               0.013645198&lt;br /&gt;
 Energy=   -0.535967030     NIter=   0.&lt;br /&gt;
 Dipole moment=     -29.988370    -117.918540     -32.409355&lt;br /&gt;
&lt;br /&gt;
Obvious where the differences are.&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForester&amp;diff=98794</id>
		<title>AndyForester</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForester&amp;diff=98794"/>
		<updated>2010-02-25T19:38:18Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* 22nd Feb 2010 */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=HBDI Calculations=&lt;br /&gt;
==Wed 21st Oct 2009==&lt;br /&gt;
&lt;br /&gt;
- Started project work&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
- Gaussview and Gaussian working&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
- Working out how to use the command inputs&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
- Trying to sort VPN out&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
- Working to get the project proposal written&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Tue 27th Oct 2009==&lt;br /&gt;
&lt;br /&gt;
Drew up the HBDI molecule in Gaussview, the molecule will be optimised at the HF 3-21G level on the mac (locally). &amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
DFT B3LYP 6-31G (Becke threeparameter hybrid exchange functional5 and the Lee-Yang-Parr correlation functional) method optimisation was run on the resulting file, the MP2 6-31G method optimisation. Each of these had a corresponding freq calculation to check that minima were reached. All calculations were successful, with no negative vibrations.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolAppletButton&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;HBDI_MP2_6-31G_FREQ.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;HBDI optimised at the MP2 6-31G level&amp;lt;/text&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;600&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;/jmolAppletButton&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Wed 28th Oct 2009==&lt;br /&gt;
&lt;br /&gt;
Ran DFT B3YLP 6-311G calculations on HBDI (Opt and Freq) since these were the calculations used in a published study of HBDI.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
Optimisation job finshed successfully, the frequency calculation did not finish succesfully. Error was:&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
 ****Warning!!: The largest alpha MO coefficient is  0.17588883D+02&lt;br /&gt;
 Symmetrizing basis deriv contribution to polar:&lt;br /&gt;
 IMax=3 JMax=2 DiffMx= 0.00D+00&lt;br /&gt;
 G2DrvN: will do    29 centers at a time, making    1 passes doing MaxLOS=1.&lt;br /&gt;
 Calling FoFCou, ICntrl=  3107 FMM=F I1Cent=   0 AccDes= 0.00D+00.&lt;br /&gt;
 FoFDir/FoFCou used for L=0 through L=1.&lt;br /&gt;
 End of G2Drv Frequency-dependent properties file   721 does not exist.&lt;br /&gt;
 End of G2Drv Frequency-dependent properties file   722 does not exist.&lt;br /&gt;
          IDoAtm=1111111111111111111111111111&lt;br /&gt;
          Differentiating once with respect to electric field.&lt;br /&gt;
                with respect to dipole field.&lt;br /&gt;
          Differentiating once with respect to nuclear coordinates.&lt;br /&gt;
          Keep R1 ints in memory in canonical form, NReq=447529180.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Thurs 29th Oct 2009==&lt;br /&gt;
&lt;br /&gt;
===Learning to use ONIOM===&lt;br /&gt;
&lt;br /&gt;
The example on chem wiki was attempted. Drawn up in Gaussview and optimised initially using molecular mechanics AMBER force field.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
The ONIOM calculation was set up in Gaussview using DFT B3LYP 6-311G(d):AMBER methods with the benzene ring as the high level and chain at low level. Electronic embedding was NOT used in this initial calculation. &amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolAppletButton&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;ONIOM_B3LYP_6-311G_AMBER.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Test Molecule optimised using B3LYP 6-311G(d):AMBER&amp;lt;/text&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;600&amp;lt;/size&amp;gt;&lt;br /&gt;
  &amp;lt;/jmolAppletButton&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
&lt;br /&gt;
This conformation is NOT the lowest energy conformation, the initial AMBER MM calculation did not find the global minimum probably due to the drawn up starting geometry.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
The molecule was drawn up again and DFT B3LYP 6-311G(d):AMBER and pure B3LYP 6-311G(d) calculations run. Absolute energy differences between the 2 conformations was very small at the ONIOM(DFT B3LYP 6-311G(d):AMBER) level. -232.27532671 vs -232.27315709 au respectively, a 0.00271au (4.359 744 17×10-18 J = 1au) difference.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolAppletButton&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;ONIOM_B3LYP_6-311G_AMBER2.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Test Molecule optimised using B3LYP 6-311G(d):AMBER&amp;lt;/text&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;600&amp;lt;/size&amp;gt;&lt;br /&gt;
  &amp;lt;/jmolAppletButton&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
&lt;br /&gt;
It has already become obvious that using ONIOM decreases calculation time considerably. The table below shows a &amp;quot;CPU time&amp;quot; comparison on the above molecule using the different approaches.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Calculation&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Time (1st attempt)&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Time (2nd attempt)&#039;&#039;&#039;&lt;br /&gt;
|-&lt;br /&gt;
|   ONIOM(DFT B3LYP 6-311G(d):AMBER)||4 minutes 22.0 seconds||8 minutes  5.5 seconds&lt;br /&gt;
|-&lt;br /&gt;
|   DFT 3LYP 6-311G(d)||2 hours 12 minutes 21.9 seconds||2 hours 30 minutes  0.8 seconds&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Mon 2nd Nov 2009==&lt;br /&gt;
&lt;br /&gt;
Having obtained optimized structures of HBDI at various theory levels and basis set variations, these structures will now be compared to previous literature. Extensive studies have been performed using HF and B3LYP methods previously. It has been noted in Jaspers recent review that HF does not model frequencies very accurately - my computations agree HF 6-31G frequencies are very far out. B3LYP frequencies are much closer to the experimental values.&lt;br /&gt;
&lt;br /&gt;
==Tues 3rd Nov 2009==&lt;br /&gt;
&lt;br /&gt;
Tabulated frerquency comparison&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;A.Esposito HF 6-31G(d)&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;A.Esposito B3LYP 6-31G(d)&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;X.He B3LYP 6-31G(d)&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;HF 6-31G(d)&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;B3LYP 6-31G(d)&#039;&#039;&#039;&lt;br /&gt;
|-&lt;br /&gt;
| C=O||1987||1802||1802||1986||1801&lt;br /&gt;
|-&lt;br /&gt;
| C=C||1880||1705||1704||1880||1704&lt;br /&gt;
|-&lt;br /&gt;
| Phenol||1809||1665||1663||1812||1666&lt;br /&gt;
|-&lt;br /&gt;
| Phenol||1781||1635||1631||1777||1635&lt;br /&gt;
|-&lt;br /&gt;
| C=N||1844||1620||1618||1844||1618&lt;br /&gt;
|-&lt;br /&gt;
| Phenol||1696||1562||1558||1694||1562&lt;br /&gt;
|-&lt;br /&gt;
| C-N||1607||1483||N/A||1606||1482&lt;br /&gt;
|-&lt;br /&gt;
| Phenol||1602||1487||N/A||1603||1488&lt;br /&gt;
|-&lt;br /&gt;
| Total Energy(au)||||||||-720.043||-724.465&lt;br /&gt;
|-&lt;br /&gt;
| &lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Thurs 5th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
Having got Jaspers input files, several things were noticed:&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
* Solvent models were used - should be in vacum.... I have re run these in G09 with No solvent on both DFT 6-311G(d,p) and HF 6-31+G(d) methods/basis sets for comparison&lt;br /&gt;
* The models in his Biophys paper using other residues in contact COO- for example are not present. He has not reported the pure HBDI calculation results.&lt;br /&gt;
* More than just HBDI is likely to be required, meaning larger than expected high level region in the GFP calculation. (3 layer ONIOM DFT:HF:AMBER)?&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
Other things to do:&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
* Work on a guide to loading .pdb files into Gaussview 5 - how much information is maintained?&lt;br /&gt;
* Are residues defined still in Gaussview input files&lt;br /&gt;
* AMBER MM requires definitions of charges etc... how easy is it to import from .pdb and get a file that can be run in opt etc...&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
* Find a model system which is larger than HBDI but incorporates this molecule &amp;lt;=500 atoms (Limit of size for pure DFT calculation)&lt;br /&gt;
* Possibilities include a system alread developed by someone else online&lt;br /&gt;
* A &amp;quot;custom&amp;quot; system made by me to model this (time intensive)&lt;br /&gt;
* &amp;quot;strip&amp;quot; back the GFP pdb file to 500 atoms surrounding the HBDI chromophore.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Fri 6th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
Yesterday started by trying to follow the tutorial on loading the GFP molecule from .pdb, this resulted in a problem with Gaussviews addition of hydrogens. It over protonated the nitrogens and carbon in the chromophore leading to over valent parts. I then tried several other molecules (PYP, a scorpion toxin and a strand of RNA) which had no problems with hydrogen addition.&lt;br /&gt;
&lt;br /&gt;
I tried to use ChemBio 3D to load the .pdb and correctly assign hydrogens/connectivity. This resulted in the same protonation problems!&lt;br /&gt;
&lt;br /&gt;
==Mon 9th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
TINKER - http://dasher.wustl.edu/tinker/ was installed with the Force Field Explorer (FFE) interface on the mac. This was used to load the same .pdb file and resulted in the same protonation around the chromophore. So far the SAME protonation pattern (incorrect) around the chromophore has been found using:&lt;br /&gt;
&lt;br /&gt;
*Gaussview&lt;br /&gt;
*ChemBio 3D&lt;br /&gt;
*TINKER&lt;br /&gt;
&lt;br /&gt;
Looking into how the .pdb file format works, the molecule is built from a series of residues, it becomes clear as you look at the sequence that is building the GFP protein chromophore (residues 65,66,67 in 1GFL) are not then reproducing the internal cyclisation properly leaving protons attached and thus atoms hypervalent. This accounts for the consistent error loading from the .pdb. As a result must all molecules expressed as a .pdb file must be pure protein structures...?&lt;br /&gt;
&lt;br /&gt;
===Comparison of the 2 available wild type GFP structures===&lt;br /&gt;
&lt;br /&gt;
The 2 structures being considered here are:&lt;br /&gt;
&lt;br /&gt;
*1GFL - Yang et al. -http://www.pdb.org/pdb/explore/explore.do?structureId=1GFL&lt;br /&gt;
*1W7S - Jaspers stucture - http://www.pdb.org/pdb/explore/explore.do?structureId=1W7S&lt;br /&gt;
&lt;br /&gt;
These 2 structures differ in that 1GFL contains 2 GFP molecules, whereas 1W7S contains 4. These were both stripped down to 1 unit of GFP, using the &amp;quot;pdb secondary structure&amp;quot; function in Gaussview 5. 1GFL had chain B removed and 1W7S had chains B,C and D removed. Next the residues were investigated that created the chromophore. The results are not consistent because GFL uses 3 residues (65,66,67) to form the chromophore. 1W7S has residue 66 (labelled CSY) as the entire chromophore! Both get protonated the same (incorrectly) which is odd considering how they molecules are built.&lt;br /&gt;
&lt;br /&gt;
The next test was to &amp;quot;modify bond&amp;quot; in Gaussview 5 and create a &amp;quot;cleaned&amp;quot; chromophore structure (no bonding information from the residue) by manually changing the bonds. The chromophore in each .pdb file was altered to look like HBDI in terms of double single bonds and the hydrogens taken out (makes no difference as these are created each time the .pdb is loaded). When the new .pdb files (1GFL_cleaned.pdb and 1W7S_cleaned.pdb) were loaded they were correctly protonated! Success.&lt;br /&gt;
&lt;br /&gt;
===1W7S improvement===&lt;br /&gt;
&lt;br /&gt;
Chain A (not cleaned) of the 1W7S .pdb was submitted to run a AMBER calculation, as expected it failed with the lines:&lt;br /&gt;
&lt;br /&gt;
 Missing atomic parameters for atom   474 IAtTyp=    20000000&lt;br /&gt;
 Missing atomic parameters.&lt;br /&gt;
&lt;br /&gt;
This error says that the atom type of atom 474 is not defined. One way to counter this is to locate the atom and then change its definition manually (as shown in the example https://www.ch.ic.ac.uk/wiki/index.php/PYP_:_Photoactive_yellow_protein - time intensive!)&lt;br /&gt;
&lt;br /&gt;
Instead the molecule was cleaned (manually changing the bonding in the chromophore), after the bond updates the structure was re submitted, again failing but with a different message:&lt;br /&gt;
&lt;br /&gt;
 Include all MM classes&lt;br /&gt;
  Bondstretch undefined between atoms    473    478 CC-N* [H,H]  &lt;br /&gt;
  Bondstretch undefined between atoms    475    476 CC-CC [H,H]  &lt;br /&gt;
  Bondstretch undefined between atoms    475    486 CC-CM [H,H]  &lt;br /&gt;
  Bondstretch undefined between atoms    476    477 CC-O [H,H]  &lt;br /&gt;
  Bondstretch undefined between atoms    476    478 CC-N* [H,H]  &lt;br /&gt;
  Bondstretch undefined between atoms    486   2559 CM-HC [H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    473    474    475 CC-NB-CC [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    473    478    479 CC-N*-CT [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    473    478    476 CC-N*-CC [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    473    482    483 CC-CT-N [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    473    482   3857 CC-CT-H1 [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    474    475    476 NB-CC-CC [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    474    475    486 NB-CC-CM [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    474    473    478 NB-CC-N* [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    475    476    477 CC-CC-O [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    475    476    478 CC-CC-N* [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    475    486    487 CC-CM-CA [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    475    486   2559 CC-CM-HC [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    476    478    479 CC-N*-CT [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    476    475    486 CC-CC-CM [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    477    476    478 O-CC-N* [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    478    479    480 N*-CT-C [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    478    473    482 N*-CC-CT [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    486    487    488 CM-CA-CA [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    486    487    489 CM-CA-CM [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    487    486   2559 CA-CM-HC [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    488    487    489 CA-CA-CM [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    490    492    491 CA-C-CM [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    491    492    493 CM-C-OH [H,H,H]  &lt;br /&gt;
  MM function not complete&lt;br /&gt;
&lt;br /&gt;
This is a better error message since all the atoms are now defined, only parameters are missing. For parameters that involve only atoms in the chromophore at the high level in the ONIOM calculation, the values can be set to 0 in the input file. This is because the ONIOM extrapolation will cancel MM from this high level region, To do this the keyword &amp;quot;Amber=Hardfirst&amp;quot; and wildcards for the missing parameters which are zero valued are placed in the input file.&lt;br /&gt;
&lt;br /&gt;
[[Image:Atoms involving undefined parameters.png]]&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
The above screeshot shows highlighted the atoms listed in the error... most of them if not all will be involved in the high level calculation and thus may not require parameters defining (other than to be set to 0). The reason the parameters are undefined is that the AMBER force field is very specific to protein bonds and thus the organic chromophore has bonds that are not found in this parameter set.&lt;br /&gt;
&lt;br /&gt;
The atoms in the chromophore were frozen in the atom group editor and the input file had a parameter set added to the end setting chromophore parameters to 0:&lt;br /&gt;
&lt;br /&gt;
 HrmStr1 CC N* 0.0 0.0&lt;br /&gt;
 HrmStr1 CC CC 0.0 0.0&lt;br /&gt;
 HrmStr1 CC CM 0.0 0.0&lt;br /&gt;
 HrmStr1 CC O 0.0 0.0&lt;br /&gt;
 HrmStr1 CM HC 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC NB CC 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC N* CC 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CT N 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CT H1 0.0 0.0&lt;br /&gt;
 HrmBnd1 NB CC CC 0.0 0.0&lt;br /&gt;
 HrmBnd1 NB CC CM 0.0 0.0&lt;br /&gt;
 HrmBnd1 NB CC N* 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CC O  0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CC N* 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CM CA 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CM HC 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CC CM 0.0 0.0&lt;br /&gt;
 HrmBnd1 O CC N*  0.0 0.0&lt;br /&gt;
 HrmBnd1 N* CT C  0.0 0.0&lt;br /&gt;
 HrmBnd1 N* CC CT 0.0 0.0&lt;br /&gt;
 HrmBnd1 CM CA CA 0.0 0.0&lt;br /&gt;
 HrmBnd1 CA CM HC 0.0 0.0&lt;br /&gt;
 HrmBnd1 CA C CM 0.0 0.0&lt;br /&gt;
 HrmBnd1 CM C OH 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC N* CT 0.0 0.0&lt;br /&gt;
 HrmBnd1 CM CA CM 0.0 0.0&lt;br /&gt;
&lt;br /&gt;
This file then ran successfully for an AMBER single point energy calculation. It was then submitted for AMBER optimisation using &amp;quot;amber=hardfirst geom=connectivity&amp;quot; and returned a successful minimum with Forces and Displacements converged:&lt;br /&gt;
&lt;br /&gt;
         Item               Value     Threshold  Converged?&lt;br /&gt;
  Maximum Force            0.000089     0.000450     YES&lt;br /&gt;
  RMS     Force            0.000004     0.000300     YES&lt;br /&gt;
  Maximum Displacement     0.000089     0.001800     YES&lt;br /&gt;
  RMS     Displacement     0.000004     0.001200     YES&lt;br /&gt;
  Predicted change in Energy=-1.156135D-07&lt;br /&gt;
  Optimization completed.&lt;br /&gt;
     -- Stationary point found.&lt;br /&gt;
&lt;br /&gt;
===Cleaned .pdb files and TINKER===&lt;br /&gt;
&lt;br /&gt;
*The files that had been cleaned were opened in TINKER, it is necessary to convert the .pdb to a .xyz file to run any calculations within TINKER. This is currently failing due to an &amp;quot;atom missing on SER-228&amp;quot; - checking this there is no SER residue at 228 (in fact a GLU) so a bit puzzled. This is the 1W7S cleaned version.&lt;br /&gt;
&lt;br /&gt;
*PYP was loaded in its .pdb format to TINKER and was immediately successful in being converted to .xyz then a MINIMISE algorithm ran successfully with the energy at its X-ray crystal structure as 261KJ/mol reduced to -3670.5355KJ/mol in 1645 iterations - this took a while (45mins locally)!&lt;br /&gt;
&lt;br /&gt;
*Interestingly the .xyz output from the minimization can be converted back to .pdb (loses hydrogens minimization) and this PYP file was loaded into Gaussview 5 - the file then ran successfully for an AMBER single point energy calculation! cf. the first attempt to get PYP to run was unsuccessful, and the tutorial underwent much more hassle to run this calculation.&lt;br /&gt;
&lt;br /&gt;
===What .pdb information is retained?===&lt;br /&gt;
&lt;br /&gt;
The only molecule that has been run with AMBER opt (GFP) has lost its secondary structure information but RETAINED its residue information.&lt;br /&gt;
&lt;br /&gt;
==Fri 13th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
===Conversion from.xyz to .com===&lt;br /&gt;
A protein with 2 residues (Glu-Asp) was built using TINKER, saved as:&lt;br /&gt;
&lt;br /&gt;
*.xyz&lt;br /&gt;
*.pdb&lt;br /&gt;
*.com&lt;br /&gt;
&lt;br /&gt;
The initial format was .xyz (created in TINEKR as a native format), converted to .pdb using TINKER and then loaded into Gaussview and saved as a .com.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
.xyz&lt;br /&gt;
&lt;br /&gt;
    36  Protein built with TINKER/Force Field Explorer&lt;br /&gt;
     1  CT     0.000000    0.000000    0.000000   340     2     4     5     6&lt;br /&gt;
     2  C      0.000000    0.000000    1.510000   342     1     3     7&lt;br /&gt;
     3  O      1.028938    0.000000    2.165506   343     2&lt;br /&gt;
     4  HC     1.056270    0.000000   -0.341166   341     1&lt;br /&gt;
     5  HC    -0.497647    0.931694   -0.341166   341     1&lt;br /&gt;
     6  HC    -0.497647   -0.931694   -0.341166   341     1&lt;br /&gt;
     7  N     -1.236201    0.000000    2.027114   232     2     8    11&lt;br /&gt;
     8  CT    -1.446961    0.000000    3.471822   233     7     9    12    13&lt;br /&gt;
     9  C     -2.509557   -0.992751    3.878558   234     8    10    22&lt;br /&gt;
    10  O     -3.569672   -1.098634    3.284129   236     9&lt;br /&gt;
    11  H     -2.025518    0.000000    1.381066   235     7&lt;br /&gt;
    12  H1    -0.503221   -0.281395    3.983946   237     8&lt;br /&gt;
    13  CT    -1.910491    1.397334    3.923704   238     8    14    18    19&lt;br /&gt;
    14  CT    -2.132799    1.397334    5.447574   240    13    15    20    21&lt;br /&gt;
    15  C     -2.585014    2.781238    5.848089   242    14    16    17&lt;br /&gt;
    16  O2    -2.816736    2.973609    7.061266   243    15&lt;br /&gt;
    17  O2    -2.693194    3.629064    4.935955   243    15&lt;br /&gt;
    18  HC    -2.861794    1.653482    3.412322   239    13&lt;br /&gt;
    19  HC    -1.132567    2.145528    3.664589   239    13&lt;br /&gt;
    20  HC    -1.170306    1.191123    5.960588   241    14&lt;br /&gt;
    21  HC    -2.946052    0.685840    5.701535   241    14&lt;br /&gt;
    22  N     -2.153997   -1.723291    4.944151   210     9    23    26&lt;br /&gt;
    23  CT    -3.047986   -2.739703    5.491225   211    22    24    27    28&lt;br /&gt;
    24  C     -3.120370   -2.663723    6.997574   212    23    25    34&lt;br /&gt;
    25  O     -2.128241   -2.535414    7.695870   214    24&lt;br /&gt;
    26  H     -1.239994   -1.547921    5.361576   213    22&lt;br /&gt;
    27  H1    -4.072297   -2.585746    5.092246   215    23&lt;br /&gt;
    28  CT    -2.526745   -4.139104    5.114946   216    23    29    32    33&lt;br /&gt;
    29  C     -3.476741   -5.157722    5.698062   218    28    30    31&lt;br /&gt;
    30  O2    -3.212872   -6.362275    5.493304   219    29&lt;br /&gt;
    31  O2    -4.454666   -4.718803    6.341112   219    29&lt;br /&gt;
    32  HC    -1.530243   -4.298013    5.577366   217    28&lt;br /&gt;
    33  HC    -2.540086   -4.252948    4.010880   217    28&lt;br /&gt;
    34  N     -4.374888   -2.752228    7.460123   344    24    35    36&lt;br /&gt;
    35  H     -5.140865   -2.858197    6.794954   345    34&lt;br /&gt;
    36  H     -4.534830   -2.711583    8.466684   345    34&lt;br /&gt;
&lt;br /&gt;
.pdb&lt;br /&gt;
&lt;br /&gt;
 HEADER    Protein built with TINKER/Force Field Explorer&lt;br /&gt;
 COMPND&lt;br /&gt;
 SOURCE&lt;br /&gt;
 ATOM      1  CH3 ACE     1       0.000   0.000   0.000&lt;br /&gt;
 ATOM      2  C   ACE     1       0.000   0.000   1.510&lt;br /&gt;
 ATOM      3  O   ACE     1       1.029   0.000   2.166&lt;br /&gt;
 ATOM      4  H1  ACE     1       1.056   0.000  -0.341&lt;br /&gt;
 ATOM      5  H2  ACE     1      -0.498   0.932  -0.341&lt;br /&gt;
 ATOM      6  H3  ACE     1      -0.498  -0.932  -0.341&lt;br /&gt;
 ATOM      7  N   GLU     2      -1.236   0.000   2.027&lt;br /&gt;
 ATOM      8  CA  GLU     2      -1.447   0.000   3.472&lt;br /&gt;
 ATOM      9  C   GLU     2      -2.510  -0.993   3.879&lt;br /&gt;
 ATOM     10  O   GLU     2      -3.570  -1.099   3.284&lt;br /&gt;
 ATOM     11  CB  GLU     2      -1.910   1.397   3.924&lt;br /&gt;
 ATOM     12  CG  GLU     2      -2.133   1.397   5.448&lt;br /&gt;
 ATOM     13  CD  GLU     2      -2.585   2.781   5.848&lt;br /&gt;
 ATOM     14  OE1 GLU     2      -2.817   2.974   7.061&lt;br /&gt;
 ATOM     15  OE2 GLU     2      -2.693   3.629   4.936&lt;br /&gt;
 ATOM     16  H   GLU     2      -2.026   0.000   1.381&lt;br /&gt;
 ATOM     17  HA  GLU     2      -0.503  -0.281   3.984&lt;br /&gt;
 ATOM     18  HB2 GLU     2      -2.862   1.653   3.412&lt;br /&gt;
 ATOM     19  HB3 GLU     2      -1.133   2.146   3.665&lt;br /&gt;
 ATOM     20  HG2 GLU     2      -1.170   1.191   5.961&lt;br /&gt;
 ATOM     21  HG3 GLU     2      -2.946   0.686   5.702&lt;br /&gt;
 ATOM     22  N   ASP     3      -2.154  -1.723   4.944&lt;br /&gt;
 ATOM     23  CA  ASP     3      -3.048  -2.740   5.491&lt;br /&gt;
 ATOM     24  C   ASP     3      -3.120  -2.664   6.998&lt;br /&gt;
 ATOM     25  O   ASP     3      -2.128  -2.535   7.696&lt;br /&gt;
 ATOM     26  CB  ASP     3      -2.527  -4.139   5.115&lt;br /&gt;
 ATOM     27  CG  ASP     3      -3.477  -5.158   5.698&lt;br /&gt;
 ATOM     28  OD1 ASP     3      -3.213  -6.362   5.493&lt;br /&gt;
 ATOM     29  OD2 ASP     3      -4.455  -4.719   6.341&lt;br /&gt;
 ATOM     30  H   ASP     3      -1.240  -1.548   5.362&lt;br /&gt;
 ATOM     31  HA  ASP     3      -4.072  -2.586   5.092&lt;br /&gt;
 ATOM     32  HB2 ASP     3      -1.530  -4.298   5.577&lt;br /&gt;
 ATOM     33  HB3 ASP     3      -2.540  -4.253   4.011&lt;br /&gt;
 ATOM     34  N   NH2     4      -4.375  -2.752   7.460&lt;br /&gt;
 ATOM     35  H1  NH2     4      -5.141  -2.858   6.795&lt;br /&gt;
 ATOM     36  H2  NH2     4      -4.535  -2.712   8.467&lt;br /&gt;
 END&lt;br /&gt;
&lt;br /&gt;
.com&lt;br /&gt;
&lt;br /&gt;
 %chk=File_Test_Glu-Asp.chk&lt;br /&gt;
 # hf/3-21g geom=connectivity&lt;br /&gt;
 &lt;br /&gt;
 Protein built with TINKER/Force Field Explorer&lt;br /&gt;
 &lt;br /&gt;
 0 1&lt;br /&gt;
 C(PDBName=CH3,ResName=ACE,ResNum=1)                   0.00000000    0.00000000    0.00000000&lt;br /&gt;
 C(PDBName=C,ResName=ACE,ResNum=1)                     0.00000000    0.00000000    1.51000000&lt;br /&gt;
 O(PDBName=O,ResName=ACE,ResNum=1)                     1.02900000    0.00000000    2.16600000&lt;br /&gt;
 H(PDBName=H1,ResName=ACE,ResNum=1)                    1.05600000    0.00000000   -0.34100000&lt;br /&gt;
 H(PDBName=H2,ResName=ACE,ResNum=1)                   -0.49800000    0.93200000   -0.34100000&lt;br /&gt;
 H(PDBName=H3,ResName=ACE,ResNum=1)                   -0.49800000   -0.93200000   -0.34100000&lt;br /&gt;
 N(PDBName=N,ResName=GLU,ResNum=2)                    -1.23600000    0.00000000    2.02700000&lt;br /&gt;
 C(PDBName=CA,ResName=GLU,ResNum=2)                   -1.44700000    0.00000000    3.47200000&lt;br /&gt;
 C(PDBName=C,ResName=GLU,ResNum=2)                    -2.51000000   -0.99300000    3.87900000&lt;br /&gt;
 O(PDBName=O,ResName=GLU,ResNum=2)                    -3.57000000   -1.09900000    3.28400000&lt;br /&gt;
 C(PDBName=CB,ResName=GLU,ResNum=2)                   -1.91000000    1.39700000    3.92400000&lt;br /&gt;
 C(PDBName=CG,ResName=GLU,ResNum=2)                   -2.13300000    1.39700000    5.44800000&lt;br /&gt;
 C(PDBName=CD,ResName=GLU,ResNum=2)                   -2.58500000    2.78100000    5.84800000&lt;br /&gt;
 O(PDBName=OE1,ResName=GLU,ResNum=2)                  -2.81700000    2.97400000    7.06100000&lt;br /&gt;
 O(PDBName=OE2,ResName=GLU,ResNum=2)                  -2.69300000    3.62900000    4.93600000&lt;br /&gt;
 H(PDBName=H,ResName=GLU,ResNum=2)                    -2.02600000    0.00000000    1.38100000&lt;br /&gt;
 H(PDBName=HA,ResName=GLU,ResNum=2)                   -0.50300000   -0.28100000    3.98400000&lt;br /&gt;
 H(PDBName=HB2,ResName=GLU,ResNum=2)                  -2.86200000    1.65300000    3.41200000&lt;br /&gt;
 H(PDBName=HB3,ResName=GLU,ResNum=2)                  -1.13300000    2.14600000    3.66500000&lt;br /&gt;
 H(PDBName=HG2,ResName=GLU,ResNum=2)                  -1.17000000    1.19100000    5.96100000&lt;br /&gt;
 H(PDBName=HG3,ResName=GLU,ResNum=2)                  -2.94600000    0.68600000    5.70200000&lt;br /&gt;
 N(PDBName=N,ResName=ASP,ResNum=3)                    -2.15400000   -1.72300000    4.94400000&lt;br /&gt;
 C(PDBName=CA,ResName=ASP,ResNum=3)                   -3.04800000   -2.74000000    5.49100000&lt;br /&gt;
 C(PDBName=C,ResName=ASP,ResNum=3)                    -3.12000000   -2.66400000    6.99800000&lt;br /&gt;
 O(PDBName=O,ResName=ASP,ResNum=3)                    -2.12800000   -2.53500000    7.69600000&lt;br /&gt;
 C(PDBName=CB,ResName=ASP,ResNum=3)                   -2.52700000   -4.13900000    5.11500000&lt;br /&gt;
 C(PDBName=CG,ResName=ASP,ResNum=3)                   -3.47700000   -5.15800000    5.69800000&lt;br /&gt;
 O(PDBName=OD1,ResName=ASP,ResNum=3)                  -3.21300000   -6.36200000    5.49300000&lt;br /&gt;
 O(PDBName=OD2,ResName=ASP,ResNum=3)                  -4.45500000   -4.71900000    6.34100000&lt;br /&gt;
 H(PDBName=H,ResName=ASP,ResNum=3)                    -1.24000000   -1.54800000    5.36200000&lt;br /&gt;
 H(PDBName=HA,ResName=ASP,ResNum=3)                   -4.07200000   -2.58600000    5.09200000&lt;br /&gt;
 H(PDBName=HB2,ResName=ASP,ResNum=3)                  -1.53000000   -4.29800000    5.57700000&lt;br /&gt;
 H(PDBName=HB3,ResName=ASP,ResNum=3)                  -2.54000000   -4.25300000    4.01100000&lt;br /&gt;
 N(PDBName=N,ResName=NH2,ResNum=4)                    -4.37500000   -2.75200000    7.46000000&lt;br /&gt;
 H(PDBName=H1,ResName=NH2,ResNum=4)                   -5.14100000   -2.85800000    6.79500000&lt;br /&gt;
 H(PDBName=H2,ResName=NH2,ResNum=4)                   -4.53500000   -2.71200000    8.46700000&lt;br /&gt;
 &lt;br /&gt;
 1 2 1.0 4 1.0 5 1.0 6 1.0&lt;br /&gt;
 2 3 2.0 7 1.5&lt;br /&gt;
 3&lt;br /&gt;
 4&lt;br /&gt;
 5&lt;br /&gt;
 6&lt;br /&gt;
 7 16 1.0 8 1.0&lt;br /&gt;
 8 9 1.0 11 1.0 17 1.0&lt;br /&gt;
 9 10 2.0 22 1.5&lt;br /&gt;
 10&lt;br /&gt;
 11 18 1.0 12 1.0 19 1.0&lt;br /&gt;
 12 21 1.0 13 1.0 20 1.0&lt;br /&gt;
 13 14 2.0 15 2.0&lt;br /&gt;
 14&lt;br /&gt;
 15&lt;br /&gt;
 16&lt;br /&gt;
 17&lt;br /&gt;
 18&lt;br /&gt;
 19&lt;br /&gt;
 20&lt;br /&gt;
 21&lt;br /&gt;
 22 23 1.0 30 1.0&lt;br /&gt;
 23 26 1.0 24 1.0 31 1.0&lt;br /&gt;
 24 34 1.5 25 2.0&lt;br /&gt;
 25&lt;br /&gt;
 26 27 1.0 33 1.0 32 1.0&lt;br /&gt;
 27 28 2.0 29 2.0&lt;br /&gt;
 28&lt;br /&gt;
 29&lt;br /&gt;
 30&lt;br /&gt;
 31&lt;br /&gt;
 32&lt;br /&gt;
 33&lt;br /&gt;
 34 35 1.0 36 1.0&lt;br /&gt;
 35&lt;br /&gt;
 36&lt;br /&gt;
&lt;br /&gt;
The issue going from .xyz straight to .com is that ALL protein information would be lost ie. residue information that is written into the .pdb at .xyz to .pdb comversion.&lt;br /&gt;
&lt;br /&gt;
===PYP files as downloaded compared to run through TINKER===&lt;br /&gt;
&lt;br /&gt;
*TINKER removes much of the header information&lt;br /&gt;
*TINKER only preserves the 1st model of a .pdb file (may be multiple models)&lt;br /&gt;
*TINKER removes far right 3 columns of information (not essential to the .pdb file)&lt;br /&gt;
*A virtual residue in the original .pdb comprising the chromophore was omitted from the TINKER output (number 126)&lt;br /&gt;
&lt;br /&gt;
===What each TINKER search algorithm does===&lt;br /&gt;
&lt;br /&gt;
There are several programs within TINKER that may be able to help locate a global minimum (not just one near the published crystal structure) by way of searching the potential energy surface. All of these are MM methods for use in TINKER.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&#039;&#039;&#039;SCAN&#039;&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
A program for general conformational search of an entire potential energy surface via a basin hopping method. The program takes as input a TINKER .xyz coordinates file which is then minimized to find the first local minimum for a search list. A series of activations along various normal modes from this initial minimum are used as seed points for additional minimizations. Whenever a previously unknown local minimum is located it is added to the search list. When all minima on the search list have been subjected to the normal mode activation without locating additional new minima, the program terminates. The individual local minima are written to cycle files as they are discovered. While the SCAN program can be used on standard undeformed potential energy surfaces, we have found it to be most useful for quickly ‘‘scanning’’ a smoothed energy surface to enumerate the major basins of attraction spaning the entire surface.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;SNIFFER&#039;&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
A program that implements the Sniffer global optimization algorithm of Butler and Slaminka, a discrete version of Griewank’s global search trajectory method. The program takes an input TINKER .xyz coordinates file and shakes it vigorously via a modified dynamics trajectory before, hopefully, settling into a low lying minimum. Some trial and error is often required as the current implementation is sensitive to various parameters and tolerances that govern the computation. At present, these parameters are not user accessible, and must be altered in the source code. However, this method can do a good job of quickly optimizing conformation within a limited range of convergence.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;PSS&#039;&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
Implements our version of a potential smoothing and search algorithm for the global optimization of molecular conformation. An initial structure in .xyz format is first minimized in Cartesian coordinates on a series of increasingly smoothed potential energy surfaces. Then the smoothing procedure is reversed with minimization on each successive surface starting from the coordinates of the minimum on the previous surface. A local search procedure is used during the backtracking to explore for alternative minima better than the one found during the current minimization. The final result is usually a very low energy conformation or, in favorable cases, the global energy minimum conformation. The minimum energy coordinate sets found on each surface during both the forward smoothing and backtracking procedures are placed in sequentially numbered cycle files.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
TINKER also contains programs that are able to perform MM optimisations:&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&#039;&#039;&#039;MINIMIZE&#039;&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;The MINIMIZE program performs a limited memory L-BFGS minimization of an input structure over Cartesian coordinates using a modified version of the algorithm of Jorge Nocedal. The method requires only the potential energy and gradient at each step along the minimization pathway. It requires storage space proportional to the number of atoms in the structure. The MINIMIZE procedure is recommended for preliminary minimization of trial structures to an rms gradient of 1.0 to 0.1 kcal/mole/Å. It has a relatively fast cycle time and is tolerant of poor initial structures, but converges in a slow, linear fashion near the minimum. The user supplies the name of the TINKER .xyz coordinates file and a target rms gradient value at which the minimization will terminate. Output consists of minimization statistics written to the screen or redirected to an output file, and the new coordinates written to updated .xyz files or to cycle files.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;OPTIMIZE&#039;&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;The OPTIMIZE program performs a optimally conditioned variable metric minimization of an input structure over Cartesian coordinates using an algorithm due to William Davidon. The method does not perform line searches, but requires computation of energies and gradients as well as storage for an estimate of the inverse Hessian matrix. The program operates on Cartesian coordinates from a TINKER .xyz file. OPTIMIZE will typically converge somewhat faster and more completely than MINIMIZE. However, the need to store and manipulate a full inverse Hessian estimate limits its use to structures containing less than a few hundred atoms on workstation class machines. As with the other minimizers, OPTIMIZE needs input coordinates and an rms gradient cutoff criterion. The output coordinates are saved in updated .xyz files or as cycle files.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;NEWTON&#039;&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt; A truncated Newton minimization method which requires potential energy, gradient and Hessian information. This procedure has significant advantages over standard Newton methods, and is able to minimize very large structures completely. Several options are provided with respect to minimization method and preconditioning of the Newton equations. The default options are recommended unless the user is familiar with the math involved. This program operates in Cartesian coordinate space and is fairly tolerant of poor input structures. Typical algorithm iteration times are longer than with nonlinear conjugate gradient or variable metric methods, but many fewer iterations are required for complete minimization. NEWTON is usually the best choice for minimizations to the 0.01 to 0.000001 kcal/mole/Å level of rms gradient convergence. Tests for directions of negative curvature can be removed, allowing NEWTON to be used for optimization to conformational transition state structures (this only works if the starting point is very close to the transition state). Input consists of a TINKER .xyz coordinates file; output is an updated set of minimized coordinates and minimization statistics.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Tues 17th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
Comparing 2 .pdb files:&lt;br /&gt;
&lt;br /&gt;
*As downloaded from PDB&lt;br /&gt;
*Converted to .xyz in TINKER then back to .pdb&lt;br /&gt;
&lt;br /&gt;
These files were then cleaned up to compare in a terminal using the &amp;quot;diff&amp;quot; command:&lt;br /&gt;
&lt;br /&gt;
*Header information and end information removed&lt;br /&gt;
*3 columns to the right in PDB version were removed to become consistent with the TINEKR file&lt;br /&gt;
*The first column &amp;quot;ATOM&amp;quot; was removed effectively making the line numbers equal to the atom numbers in both files&lt;br /&gt;
&lt;br /&gt;
==Wed 18th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
1st diff was on line 52, a HYDROGEN ATOM:&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&#039;&#039;52  HE2 HIS A   3    13.349 -10.803   9.527&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
added to the TINKER output, not in PDB file.&lt;br /&gt;
&lt;br /&gt;
2nd diff was on line 657, a HYDROGEN ATOM:&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&#039;&#039;657  HE2 GLU A  46	 -0.867  -5.216  -0.822&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
added to the PDB file, not in TINKER output&lt;br /&gt;
&lt;br /&gt;
3rd diff was on line 1007, a HYDROGEN ATOM (number 1008):&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&#039;&#039;5HG  CYS A  69	 -8.909  -5.194   5.526&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
added to the TINKER output, not in PDB file.&lt;br /&gt;
&lt;br /&gt;
4th diff was  on line 1621, a HYDROGEN ATOM (number 1623):&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&#039;&#039;HE2 HIS A 108	  4.663   7.459   6.879&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
added to the TINKER output, not in PDB file.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Following these reconciliations, both files were exactly the same with 1911 atoms present in each. 3 atoms were removed from the TINKER file and 1 was removed from the PDB file.&lt;br /&gt;
&lt;br /&gt;
*NOTE - loading the original .pdb from PDB to Gaussview then converting to a .gjf results in an error &amp;quot;1 or more PDB atom names has a &#039; or * character which can cause problems for Gaussian&amp;quot; - this is because the PDB file has extra atoms on the end of the file to define a highlighted chromophore... After these were removed in a txt editor the .gjf save went smoothly. Infact after this removal the file will run an AMBEr single point energy calculation fine - cf running when the extra info is in the bottom.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
All differences in files were due to protonation states of atoms... not the AA backbone.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===TINKER vs PDB===&lt;br /&gt;
[[Image:TINKER PROTON.png]]&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
[[Image:PDB_PROTON.png]]&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
The highlighted fragments represent where the respective file version is protonated and the other is not.&lt;br /&gt;
&lt;br /&gt;
==Fri 20th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
In an attempt to load the 1W7S structure to TINKER and successfully convert it it a .xyz format the following procedure was undertaken:&lt;br /&gt;
&lt;br /&gt;
*&lt;br /&gt;
&lt;br /&gt;
==Mon 23rd Nov 2009==&lt;br /&gt;
&lt;br /&gt;
===Protonation of GFP at various pH conditions===&lt;br /&gt;
&lt;br /&gt;
With 230 amino acid residues, each possessing a unique pKa in GFP it is obvious that there is a huge spectrum of protonation states available at a single solvation pH. Since protons cannot be detected using X-ray diffraction, protonation states must be uncovered via a difference method. Several questions arise when thinking about protonation of the protein:&lt;br /&gt;
&lt;br /&gt;
*Which residues have the ability to be protonated/deprotonated&lt;br /&gt;
*Is the PDB structure protonated for a specific pH (Bio pH)&lt;br /&gt;
*How does the protonation of the protein change through a variety of solvation pHs&lt;br /&gt;
*Does the protonation state affect the chromophore photophysics (electronic embedding)&lt;br /&gt;
&lt;br /&gt;
Answering these questions:&lt;br /&gt;
&lt;br /&gt;
*Polar amino acids are those that can be protonated/deprotonated, these are identified by Gaussview and shown in the atom list of the molecule with the option of setting them to either protonated/deprotonated.&lt;br /&gt;
*Dont know if the current structure is protonated to any specific pH... this is unclear until protonate at certain pHs&lt;br /&gt;
*The protonation will be tested across a range of pHs to see how protonation changes&lt;br /&gt;
&lt;br /&gt;
==Tues 24th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
The pKa value of a residue will indicate protonation state as a function of pH. If you know the pH of your solution then if the pKa of a given functional group is lower than this pH then the functional group will be unprotonated. If the pKa is higher than the pH then the functional group will be protonated. &#039;&#039;&#039;Beware that pKa values can be distorted by local conformational effects.&#039;&#039;&#039; The structure is protonated as though it were in pH 7 as it is loaded into Gaussview. Since hydrogens are defined in the PDB file it could be that the author added protons under these conditions.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Histadine has 2 protonation points on the imidizole R chain:&lt;br /&gt;
&lt;br /&gt;
{|&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Yes-Yes&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Yes-No&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;No-Yes&#039;&#039;&#039;&lt;br /&gt;
|-&lt;br /&gt;
|&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolApplet&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;Protein_yes_yes.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;400&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Yes Yes&amp;lt;/text&amp;gt;&lt;br /&gt;
  &amp;lt;/jmolApplet&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;||&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolApplet&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;Protein_yes_no.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;400&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Yes Yes&amp;lt;/text&amp;gt;&lt;br /&gt;
  &amp;lt;/jmolApplet&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;||&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolApplet&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;Protein_no_yes.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;400&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Yes Yes&amp;lt;/text&amp;gt;&lt;br /&gt;
  &amp;lt;/jmolApplet&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
The imidazole sidechain of histidine has a pKa of approximately 6, and overall, the amino acid has a pKa of 7.6. This means that at physiologically relevant pH values, relatively small shifts in pH will change its average charge. Below a pH of 6, the imidazole ring is mostly protonated as described by the Henderson–Hasselbalch equation. When protonated, the imidazole ring bears two NH bonds and has a positive charge. The positive charge is equally distributed between both nitrogens and can be represented with two equally important resonance structures.&lt;br /&gt;
&lt;br /&gt;
===Protonation===&lt;br /&gt;
&lt;br /&gt;
At pH 7 the PDB file protonations are correct for all residues. Moving to pH 5 the histadine residues become protonated (pKa 6)- this is the only change. At more alkaline pHs it would take pH 9.1 to make any effect (deprotonating CYS and then TYR at 9.7)&lt;br /&gt;
&lt;br /&gt;
===Mini-Biomolecule===&lt;br /&gt;
&lt;br /&gt;
To test ONIOM calculations vs pure DFT methods and other calculation types on a bio system that is smaller than GFP (computationally less expensive, can streamline the real calcs)&lt;br /&gt;
&lt;br /&gt;
Trying a molecule analogous to the example given in the ONIOM tutorial...&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolApplet&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;Biomolecule1.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;400&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Yes Yes&amp;lt;/text&amp;gt;&lt;br /&gt;
  &amp;lt;/jmolApplet&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Thurs 26th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
===Checking the GFP input will run===&lt;br /&gt;
&lt;br /&gt;
Procedure to get 1WS7 to run an AMBER energy calculation:&lt;br /&gt;
&lt;br /&gt;
*Load the PDB file without water&lt;br /&gt;
*Try opt amber but will fail due to missing parameters (all in the chromophore)&lt;br /&gt;
*Set these parameters to 0 since they cancel in ONIOM&lt;br /&gt;
*Freeze the atotms involving missing paramters&lt;br /&gt;
*Energy AMBER will now run for the file - NO CHARGES - 9.07881632au&lt;br /&gt;
*Open the Atom Editor, Edit, MM charges, Assign all atoms&lt;br /&gt;
*Manually edit any missing charges&lt;br /&gt;
*Energy AMBER will now run including charges (basic AMBER94 charges) - 0.260391165au&lt;br /&gt;
&lt;br /&gt;
Charges in this file did not sum to an integer number - the only residue with a non integer charge was no.64 with a charge of 0.61160000&lt;br /&gt;
&lt;br /&gt;
 MM sanity checks:&lt;br /&gt;
 All charges sum to:               -5.38840000&lt;br /&gt;
 Charges of atoms sum to:          -5.38840000&lt;br /&gt;
 Layer 1 (S) has     34 atoms and charge   0.61160000 sum=   0.61160000&lt;br /&gt;
 Layer 2 (R) adds  3574 atoms and charge  -6.00000000 sum=  -5.38840000&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
 WARNING -- The following centers have non-zero partial&lt;br /&gt;
 charges, but zero Vanderwaals parameters. These centers&lt;br /&gt;
 can collapse onto other centers with non-zero charge&lt;br /&gt;
 Center    Charge&lt;br /&gt;
   1833  0.423900&lt;br /&gt;
   1890  0.410200&lt;br /&gt;
   2029  0.427500&lt;br /&gt;
   2043  0.427500&lt;br /&gt;
   2080  0.410200&lt;br /&gt;
   2089  0.399200&lt;br /&gt;
   2123  0.410200&lt;br /&gt;
   2179  0.410200&lt;br /&gt;
   2186  0.410200&lt;br /&gt;
   2260  0.410200&lt;br /&gt;
   2287  0.410200&lt;br /&gt;
   2294  0.410200&lt;br /&gt;
   2347  0.427500&lt;br /&gt;
   2369  0.399200&lt;br /&gt;
   2469  0.427500&lt;br /&gt;
   2508  0.399200&lt;br /&gt;
   2551  0.410200&lt;br /&gt;
   2619  0.399200&lt;br /&gt;
   2639  0.410200&lt;br /&gt;
   2714  0.410200&lt;br /&gt;
   2924  0.399200&lt;br /&gt;
   2939  0.399200&lt;br /&gt;
   2950  0.427500&lt;br /&gt;
   2981  0.399200&lt;br /&gt;
   3180  0.427500&lt;br /&gt;
   3233  0.399200&lt;br /&gt;
   3262  0.410200&lt;br /&gt;
   3361  0.399200&lt;br /&gt;
   3377  0.427500&lt;br /&gt;
   3384  0.410200&lt;br /&gt;
   3397  0.427500&lt;br /&gt;
   3418  0.427500&lt;br /&gt;
   3562  0.410200&lt;br /&gt;
   3593  0.410200&lt;br /&gt;
   3608  0.410200&lt;br /&gt;
&lt;br /&gt;
* oniom(pm6:amber=hardfirst) geom=connectivity was run successfully, resulting in an energy of 0.210085730788au&lt;br /&gt;
&lt;br /&gt;
This still resulted in some charge problems:&lt;br /&gt;
&lt;br /&gt;
 MM sanity checks:&lt;br /&gt;
 All charges sum to:               &#039;&#039;&#039;-5.38840000&#039;&#039;&#039;&lt;br /&gt;
 Charges of atoms sum to:           &#039;&#039;&#039;0.79320000&#039;&#039;&#039;&lt;br /&gt;
 Residue  63 PDB Number    &#039;&#039;&#039;64_A PHE has&#039;&#039;&#039;     1 atoms, and a charge of   0.59730000&lt;br /&gt;
 Residue  64 PDB Number    &#039;&#039;&#039;66_A CSY has&#039;&#039;&#039;    34 atoms, and a charge of   0.61160000&lt;br /&gt;
 Residue  65 PDB Number    &#039;&#039;&#039;68_A VAL has&#039;&#039;&#039;     1 atoms, and a charge of  -0.41570000&lt;br /&gt;
 WARNING -- The following centers have non-zero partial&lt;br /&gt;
 charges, but zero Vanderwaals parameters. These centers&lt;br /&gt;
 can collapse onto other centers with non-zero charge&lt;br /&gt;
 Center    Charge&lt;br /&gt;
   &#039;&#039;&#039;3608  0.410200&#039;&#039;&#039;&lt;br /&gt;
 MMInit generated parameter data with length LenPar=       40470.&lt;br /&gt;
 Standard basis: Dummy (5D, 7F)&lt;br /&gt;
 There are    36 symmetry adapted basis functions of A   symmetry.&lt;br /&gt;
 Integral buffers will be    262144 words long.&lt;br /&gt;
 Raffenetti 1 integral format.&lt;br /&gt;
 Two-electron integral symmetry is turned on.&lt;br /&gt;
    36 basis functions,    36 primitive gaussians,    36 cartesian basis functions&lt;br /&gt;
    27 alpha electrons       27 beta electrons&lt;br /&gt;
       nuclear repulsion energy       200.9124124811 Hartrees.&lt;br /&gt;
 NAtoms= 3608 NActive=   36 NUniq=   36 SFac= 7.50D-01 NAtFMM=   80 NAOKFM=F Big=F&lt;br /&gt;
 AMBER calculation of energy.&lt;br /&gt;
 Energy=    0.110474559     NIter=   0.&lt;br /&gt;
 Dipole moment=       3.390901       5.889448      -2.505476&lt;br /&gt;
 ONIOM: saving gridpoint  1&lt;br /&gt;
 ONIOM: restoring gridpoint  3&lt;br /&gt;
 ONIOM: calculating energy.&lt;br /&gt;
 ONIOM: gridpoint  1 method:  low   system:  model energy:     0.110474559177&lt;br /&gt;
 ONIOM: gridpoint  2 method:  high  system:  model energy:     0.060169124949&lt;br /&gt;
 ONIOM: gridpoint  3 method:  low   system:  real  energy:     0.260391165016&lt;br /&gt;
 ONIOM: extrapolated energy =       0.210085730788&lt;br /&gt;
 ONIOM: calculating electric field derivatives.&lt;br /&gt;
 ONIOM: Integrating ONIOM file  4 number   619&lt;br /&gt;
 ONIOM: Dipole        =-2.76174164D+01-1.34762322D+02 3.99239712D+01&lt;br /&gt;
 ONIOM: Dipole moment (Debye):&lt;br /&gt;
    X=   -70.1965    Y=  -342.5316    Z=   101.4766  Tot=   364.0782&lt;br /&gt;
 ONIOM: Integrating ONIOM file  5 number   695&lt;br /&gt;
&lt;br /&gt;
An ONIOM calculation was run using a DFT method on the high level and AMBER on the low level:  &#039;&#039;oniom(b3lyp/3-21g:amber=hardfirst) geom=connectivity&#039;&#039; which ran successfully...&lt;br /&gt;
&lt;br /&gt;
===Charges===&lt;br /&gt;
&lt;br /&gt;
Charges in the protein are important for the calculation that will be run on GFP eventually since they provide the electrostatic field or &amp;quot;atmosphere&amp;quot; which surrounds the chromophore, polarizing the electron density used by the DFT method to evaluate the chromophore. Gaussian offers 2 types of influence which the MM portion can exert on the QM part:&lt;br /&gt;
&lt;br /&gt;
* Mechanical Embedding&lt;br /&gt;
* Electronic Embedding&lt;br /&gt;
&lt;br /&gt;
Mechanical embedding does not use any electronic components to affect the central high region. Electronic embedding does use the charge generated in the MM region to polarize the high region.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
At pH 7 the GFP molecule is in a 6- charge state according to assigning AMBER charges and summing the residue charges (available in the out file). 3 options were considered at this stage for treating charges:&lt;br /&gt;
&lt;br /&gt;
* Ignore charges on the entire molecule&lt;br /&gt;
* Use AMBER94 assigned partial charges on the low region&lt;br /&gt;
* Use Qeq to assign charges based on an overall charge of -6&lt;br /&gt;
&lt;br /&gt;
The first option can be discarded since the electronic atmosphere is of particular interest here. The second 2 options are to be investigated. See below (on a small Biomolecule)&lt;br /&gt;
&lt;br /&gt;
ON a 4 residue chain (in mini biomolecule folder) - As in the GFP tutorial on the wiki, a energy calculation was submitted with the keyword amber=qeq to get back the charges across the molecule. (not entirely sure what this does yet). AS a test of the effect of charges on a molecule a test molecule was compiled in TINKER consisting of 4 residues with 0 overall charges....&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolAppletButton&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;Protein1_AMBER_OPT_qeq.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Test Protien&amp;lt;/text&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;600&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;/jmolAppletButton&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{|&lt;br /&gt;
|No Charges|| 0.01940311au&lt;br /&gt;
|-&lt;br /&gt;
|Amber Charges|| -0.18117151au&lt;br /&gt;
|-&lt;br /&gt;
|Qeq Charges|| -0.23038199au&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Clearly the Qeq shows the lowest energy - the considerable differences in energy show that charges are an important part of the input.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Testing Charges and EE on a small system===&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolAppletButton&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;Chromopre_molecule_B3LYP_6-31G(d).mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;script&amp;gt;spin on&amp;lt;/script&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Test Protien&amp;lt;/text&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;600&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;/jmolAppletButton&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Energies of the optimisations using various charge approximations and with/without EE&lt;br /&gt;
{|class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Method||Mechanical Embedding||Electronic Embedding||Computational Time (mechanical embedding)||Computational Time (electronic embedding)&lt;br /&gt;
|-&lt;br /&gt;
|B3LYP 6-31G*||-1406.04||N/A||11 hours 18 minutes 44.7 seconds||11 hours 18 minutes 44.7 seconds&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (No Charges)||-783.2225||-783.2223||2 hours 13 minutes  4.1 seconds||&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Amber Charges)|| || ||  ||&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Qeq Charges)||-783.2467|| ||2 hours  6 minutes  8.6 seconds ||&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
NB. &lt;br /&gt;
* B3LYP 6-31G* and ONIOM B3LYP 6-31G*:AMBER (No Charges) were from the same starting geometry.&lt;br /&gt;
* ONIOM B3LYP 6-31G*:AMBER (Amber Charges) and ONIOM B3LYP 6-31G*:AMBER (Qeq Charges) were from the same starting geometry.&lt;br /&gt;
* All Electronic embedding is from the same starting point - Can compare times.&lt;br /&gt;
&lt;br /&gt;
Dipole moments using various charge approximations and with/without EE&lt;br /&gt;
{|class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Method||Dipole moment (No EE)||Dipole moment (With EE)&lt;br /&gt;
|-&lt;br /&gt;
|B3LYP 6-31G*||7.4914||N/A&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (No Charges)||3.2328||3.2241&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Amber Charges)||5.2794||5.2865&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Qeq Charges)||8.173||8.1654&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==2nd Dec 2009==&lt;br /&gt;
&lt;br /&gt;
A single point energy of 1w7s was run with the additional qeq keyword to return partial charges (overall charge set to -6)&lt;br /&gt;
This new molecule was submitted for AMBER opt but failed after around 700 iterations... this could be due to the error:&lt;br /&gt;
&lt;br /&gt;
 WARNING -- The following centers have non-zero partial&lt;br /&gt;
 charges, but zero Vanderwaals parameters. These centers&lt;br /&gt;
 can collapse onto other centers with non-zero charge&lt;br /&gt;
&lt;br /&gt;
When the atoms defined in this error had their partial charges set to 0.00, the result was a fully converged AMBER optimisation:&lt;br /&gt;
&lt;br /&gt;
*Frozen chromophore.&lt;br /&gt;
*pH 7 giving a charge of -6&lt;br /&gt;
*Charges assigned using Qeq method&lt;br /&gt;
*Zero Van der Waals parameter atoms partial charges set to 0.00&lt;br /&gt;
&lt;br /&gt;
Looking at what atoms are not defined in terms of Van der Waals paramters it is ALWAYS an aloholic proton that is not parameterised...&lt;br /&gt;
&lt;br /&gt;
Tutorial used AMBER charges then ran Qeq for the missing charges... i think this is what she thought she did... gaussian &amp;quot;amber=qeq&amp;quot; will overwrite all charges.. need to use &amp;quot;amber=uncharged&amp;quot;&lt;br /&gt;
so that we dont overwrite the amber charges...&lt;br /&gt;
&lt;br /&gt;
In effect full Qeq charges were assigned in the tutorial. Also overall charge is 0 - will check this.&lt;br /&gt;
&lt;br /&gt;
===The GFP tutorial===&lt;br /&gt;
&lt;br /&gt;
*AMBER charges (AMBER94 charges) were assigned to the GFP molecule in the tutorial&lt;br /&gt;
*she notices correctly that some are missing in the chromophore and so decides to assign them via Qeq&lt;br /&gt;
*Runs Qeq on the GFP molecule with a &#039;&#039;&#039;0 charge&#039;&#039;&#039; which is wrong! When run the AMBER charges and look at the log file we can see that the overall charge should be &#039;&#039;&#039;+4&#039;&#039;&#039; for that protonation state.&lt;br /&gt;
*Since Qeq is an equilibrium method, partial charges are solved with the constraint that their sum must equal the overall charge.&lt;br /&gt;
*These Qeq charges should therefore not fit with an integer sum when pasted into the AMBER charges.&lt;br /&gt;
&lt;br /&gt;
===Qeq versus AMBER94 charges===&lt;br /&gt;
&lt;br /&gt;
Ogawa et al. state:&lt;br /&gt;
&lt;br /&gt;
 Geometry optimizations of a crambin protein were performed using force fields combined with the&lt;br /&gt;
 CQEq. The CQEq charges were found to be more comparable to the Mulliken charges obtained by the ab initio molecular orbital&lt;br /&gt;
 calculation than the fixed charges used in the AMBER force field. Furthermore, the CQEq charges give more realistic charge redistribution&lt;br /&gt;
 between amino acids in the crambin.&lt;br /&gt;
&lt;br /&gt;
Mini Biomolecule calculations:&lt;br /&gt;
&lt;br /&gt;
{|class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Method||Dipole moment (No EE)||Dipole moment (With EE)&lt;br /&gt;
|-&lt;br /&gt;
|B3LYP 6-31G*||7.4914||N/A&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (No Charges)||3.2328||3.2241&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Amber Charges)||5.2794||5.2865&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Qeq Charges)||8.173||8.1654&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Clearly the Qeq charges are in better agreement with the DFT method than no or AMBER charges.&lt;br /&gt;
&lt;br /&gt;
These frequencies are calulated using mechanical embedding!&lt;br /&gt;
&lt;br /&gt;
{|class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Method||C=O||C=C&lt;br /&gt;
|-&lt;br /&gt;
|B3LYP 6-31G*||1799.65||1701.86&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (No Charges)||1802.25||1703.63&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Amber Charges)||1803.37||1701.67&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Qeq Charges)||1800.18||1702.86&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
As a result it seems obvious that a Qeq charge assignment in constraint by the correct total charge gives results most comparable to the pure QM method (both dipoles and frequencies)&lt;br /&gt;
&lt;br /&gt;
==3rd Dec 2009==&lt;br /&gt;
===ESP Charges===&lt;br /&gt;
&lt;br /&gt;
ESP calculates the expectation values of the electrostatic potential of a molecule on a uniform distribution of points. The resultant ESP surface is then fit to atom centered charges that best reproduce the electron distribution in a least-squares method. The set of points defining the default surface is generated according to the algorithm of Connolly.&lt;br /&gt;
&lt;br /&gt;
ESP charges thus require a QM calculation to create the molecular electrostatic potential used to fit atom centered charges. The size of GFP means that this is not an efficient method for use in this case?&lt;br /&gt;
&lt;br /&gt;
===Qeq Charges===&lt;br /&gt;
&lt;br /&gt;
The charge equilibration (QEq) method treats the atomic charge variation through the equilibration of electronegativities at atomic sites. The advantage of the QEq method is that the polarization effect in MM is taken into account by variation of the charge distribution in the molecules. The atomic charge variation relates to the intramolecular geometrical changes.&lt;br /&gt;
&lt;br /&gt;
===AMBER charges===&lt;br /&gt;
&lt;br /&gt;
==8th Dec 2009==&lt;br /&gt;
&lt;br /&gt;
A input creation page is now avaliable:&lt;br /&gt;
&lt;br /&gt;
https://www.ch.ic.ac.uk/wiki/index.php/AndyForesterGFP&lt;br /&gt;
&lt;br /&gt;
The final input file had protein information pasted in using text to columns in excel and now displays PDB residue information and PDB secondary structure information (nice cartoon pictures in ChemBio3D!)&lt;br /&gt;
&lt;br /&gt;
Interesting is a paper that uses TD-DFT on GFP (in my papers folder on the mac), the group uses a 3 stage minimisation method from the 1GFL PDB structure:&lt;br /&gt;
&lt;br /&gt;
* Freeze all atoms in the backbone of the protein and optimise after adding hydrogens&lt;br /&gt;
* Optimise the chromophore at a semi-empirical level&lt;br /&gt;
* They cut out a chromophore section but i could then perform a full QM/MM geometry optimisation&lt;br /&gt;
&lt;br /&gt;
===A more charged mini molecule to test electronic embedding===&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolAppletButton&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;More_charge_B3LYP_6_31G(d).mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;The molecule with 2+ charge&amp;lt;/text&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;600&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;/jmolAppletButton&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
The molecule was submitted with 3 different job lines:&lt;br /&gt;
&lt;br /&gt;
* b3lyp 6-31g(d)&lt;br /&gt;
* oniom(b3lyp 6-31g(d):amber=(hardfirst,qeq))&lt;br /&gt;
* oniom(b3lyp 6-31g(d):amber=(hardfirst,qeq))=embedcharge&lt;br /&gt;
&lt;br /&gt;
Opt and Freq jobs were completed successfully. The charge breakdown input was:&lt;br /&gt;
&lt;br /&gt;
* Low Real 2 1&lt;br /&gt;
* High Model 0 1&lt;br /&gt;
* Low Model 0 1&lt;br /&gt;
&lt;br /&gt;
The aim of the calculations was to test electronic embedding versus the mechanical embedding. A paper suggests that there is a large difference in accuracy between the two methods. Qeq charge equilibration was used in both cases as it has already been proved to be much more effective than AMBER assigned charges. &#039;&#039;&#039;It has consistently overestimated the dipole moment though&#039;&#039;&#039;.&lt;br /&gt;
&lt;br /&gt;
{|class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Method||Dipole moment&lt;br /&gt;
|-&lt;br /&gt;
|B3LYP 6-31G*||8.02&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (ME)||12.22&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (EE)||12.23&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
These results are odd since we expect the EE to have some effect on the charge distribution in the molecule (it is polarising the QM region) comparing the frequencies as a further check:&lt;br /&gt;
&lt;br /&gt;
{|class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Method||C=O||C=C&lt;br /&gt;
|-&lt;br /&gt;
|B3LYP 6-31G*||1764||1636&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (ME)||1797||1697&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (EE)||1797||1697&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Again the EE has made no difference to the frequency of various strong vibrations in the molecule. The conclusion is thus either:&lt;br /&gt;
&lt;br /&gt;
* EE has no real effect on the moleucule&lt;br /&gt;
* EE is not working properly&lt;br /&gt;
&lt;br /&gt;
Assuming the latter further investigation discovered that the meechanism of EE is determined using scaling parameters for MM charges during electronic embedding in the QM calculations. As a default the charges are scaled 555500 using the nomenclature ScaleCharge=ijklmn where n is closest to the QM region (massively anti intuitive). The input is read left to right and not all scaling factors need be defined. Further to this, the value given is then multiplied by 0.2 to arrive at the actual scaling coefficient. The result is thus for an atom:&lt;br /&gt;
&lt;br /&gt;
* 1 bond from the QM region (n) = 0.2n           Default is 0&lt;br /&gt;
* 2 bonds from the QM region (m) = 0.2m       Default is 0&lt;br /&gt;
* 3 bonds from the QM region (l) = 0.2l            Default is 5&lt;br /&gt;
* 4 bonds from the QM region (k) = 0.2k          Default is 5&lt;br /&gt;
* 5 bonds from the QM region (j) = 0.2j            Default is 5&lt;br /&gt;
* 6 bonds from the QM region (i) = 0.2i            Default is 5&lt;br /&gt;
&lt;br /&gt;
So all the atoms (and partial charges) in the MM region in the test molecule were being scaled to 0 when performing the EE. This explains the fact that no difference was uncovered in the previous calculation set.&lt;br /&gt;
&lt;br /&gt;
A new calculation was submitted with the command scalecharge=555 so that all charges in the MM part will be scaled to 1 (0.2xn,m,l)&lt;br /&gt;
&lt;br /&gt;
===Charges...again===&lt;br /&gt;
&lt;br /&gt;
So since i deleted the charges on the atoms where they can collapse, the overall charge is now -19.6 from the original -6 minus the 13.6 in charge deleted. REALLY ANNOYING! This is a huge difference in charges so will have to be solved. One easy option would be to set the involved atoms to 0 then run QEQ on the atoms with unassigned charges... essentially distributing the -6 charge through all atoms except those with 0 VdW parameters.&lt;br /&gt;
&lt;br /&gt;
==Tues 15th Dec 2009==&lt;br /&gt;
&lt;br /&gt;
The more charged molecule failed once the charges were scaled to 1 since SCF convergence was not attained.&lt;br /&gt;
&lt;br /&gt;
==Mon 25th Jan 2010==&lt;br /&gt;
&lt;br /&gt;
TAO tool publication was very good read... has a good suggestion of an optimisation procedure:&lt;br /&gt;
&lt;br /&gt;
*Take the crystal structure&lt;br /&gt;
*Add Hydrogens&lt;br /&gt;
*Determine protonation&lt;br /&gt;
*MM geometry optimisation&lt;br /&gt;
*(MD?)&lt;br /&gt;
*(Look at several samples of the MD)&lt;br /&gt;
*Optimise a final structure&lt;br /&gt;
&lt;br /&gt;
==Thurs 28th Jan 2010==&lt;br /&gt;
&lt;br /&gt;
Tao Tools requires preprocess within AMBER tools to run the scripts correctly. Does not have HIS on the residue database... Amber defines the protonation state within its residue naming scheme... HIS is either HIE, HIP, HID depending on its protonation state. [http://www.poissonboltzmann.org/pdb2pqr/documentation/programmers-guide/naming-scheme]&lt;br /&gt;
&lt;br /&gt;
==Thurs 4th Jan 2010==&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Preprocessing&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
To get TAO tools and also for completeness it is necessary to preprocess the PDB file before attempting to run ONIOM calculations, this includes:&lt;br /&gt;
&lt;br /&gt;
* Adding hydrogens&lt;br /&gt;
* Create a custom residue file for the chromophore&lt;br /&gt;
* Convert from PDB to gjf ONIOM input&lt;br /&gt;
&lt;br /&gt;
Ambertools are extremely helpful with this preprocessing especially since amber will be the low level theory used throughout this project. Within ambertools are a number of useful programs:&lt;br /&gt;
&lt;br /&gt;
* antechamber - has a parameterlookup function within amber and GAFF parameter library files.&lt;br /&gt;
* xLeap - builds the protein from its constituent residues - adds hydrogens according to library files - allows a parm file to be built to add custom residues.&lt;br /&gt;
&lt;br /&gt;
I will be working on using these 2 programs to create a full MM input file which is fully defined and correct.&lt;br /&gt;
&lt;br /&gt;
===xLeap===&lt;br /&gt;
&lt;br /&gt;
To load xLeap we must use X11 to get the graphical interface and the ssh -Y command to allow tunneling back to X11. Also the command line to start xLeap should be:  &#039;&#039;xleap -s -f /apps/ambertools/amber11/dat/leap/cmd/leaprc.ff03&#039;&#039;. This loads the ff03 parameter file and results in the start up of the graphical interface with:&lt;br /&gt;
&lt;br /&gt;
 Welcome to LEaP!&lt;br /&gt;
 Sourcing leaprc: /amber/dat/leap/cmd/leaprc&lt;br /&gt;
 Log file: ./leap.log&lt;br /&gt;
 Loading parameters: /amber/dat/leap/parm/parmME.dat&lt;br /&gt;
 Loading library: /amber/dat/leap/lib/all_nucleic94.lib&lt;br /&gt;
 Loading library: /amber/dat/leap/lib/all_aminoME.lib&lt;br /&gt;
 Loading library: /amber/dat/leap/lib/all_aminoctME.lib&lt;br /&gt;
 Loading library: /amber/dat/leap/lib/all_aminontME.lib&lt;br /&gt;
 Loading library: /amber/dat/leap/lib/ions94.lib&lt;br /&gt;
 Loading library: /amber/dat/leap/lib/water.lib&lt;br /&gt;
&lt;br /&gt;
This shows that the residue libraries have been loaded. We are then free to load a pdb file which will be read by xLeap and built according to the residues! In addition to the residue names xleap also requires information about the chain connectivity. Residues can be connected to other residues from both sides, only one side or not at all. xleap is fairly simple in that it expects that all residues in the pdb file are connected in the order in which they are listed unless they are separated by a &amp;quot;TER&amp;quot; card. In most cases where a single protein chains is being investigated like here - the TER card should end the residue information.&lt;br /&gt;
&lt;br /&gt;
To load a pdb use (ignore the &amp;lt;&amp;gt;) &#039;&#039;&amp;lt;name for use in xLeap&amp;gt;=loadpdb &amp;quot;&amp;lt;name of file&amp;gt;&amp;quot;&#039;&#039;. Make sure that the file is in your PATH else fully define the path in the filename.&lt;br /&gt;
&lt;br /&gt;
The pdb should then be read into xLeap and built. At the end of the output we get (I loaded ChainA of the 1W7S without any water or hydrogens):&lt;br /&gt;
&lt;br /&gt;
 Leap added 1765 missing atoms according to residue templates:&lt;br /&gt;
       1 Heavy&lt;br /&gt;
       1764 H / lone pairs&lt;br /&gt;
 The file contained 21 atoms not in residue templates&lt;br /&gt;
&lt;br /&gt;
This shows that Leap added hydrogens automatically and also a missing &amp;quot;heavy&amp;quot; atom (one which is not a H or lp):&lt;br /&gt;
&lt;br /&gt;
 Added missing heavy atom: .R&amp;lt;CHIE 229&amp;gt;.A&amp;lt;OXT 18&amp;gt;&lt;br /&gt;
&lt;br /&gt;
For amino acids, the traditional order of atom records within a residue is N, CA, C, O, followed by the sidechain atoms (CB, CG1, CG2 . . . ) in order first of increasing remoteness, and then branch. The extra oxygen at the carboxyl terminal is called &amp;quot; OXT&amp;quot;, and appears after all sidechain atoms.&lt;br /&gt;
&lt;br /&gt;
Now that we have established that there are 21 atoms that are in the non standard residue region we need to define a custom residue which describes this region (chromophore). This involves creating a prep file....&lt;br /&gt;
&lt;br /&gt;
I first drew up the undefined section of GFP (chromophore) in Gaussview5 and saved as a.pdb then:&lt;br /&gt;
&lt;br /&gt;
* Load in antechamber and convert to mol2 format using &#039;&#039;antechamber -i &amp;lt;input&amp;gt; -fi pdb -o &amp;lt;output&amp;gt; -fo mol2 -c bcc&#039;&#039;&lt;br /&gt;
* Use parmchk to make sure the molecule was defined in the GAFF force field - kicks out a .frcmod file with missing parameters (including dihedrals) - nice :D!&lt;br /&gt;
* Manually add the &amp;quot;1CSY&amp;quot; where a standalone &amp;quot;0&amp;quot; is in the mol2 file so Leap can read it&lt;br /&gt;
* Load the gaff parameter file to Leap using &#039;&#039;source leaprc.gaff&#039;&#039;&lt;br /&gt;
* Load the chromophore_CSY_full.mol2 to Leap using &#039;&#039;CSY=loadmol2 &amp;quot;ChromophoreGFP_CSY_full.mol2&amp;quot;&#039;&#039;&lt;br /&gt;
* Check the residue using &#039;&#039;check CSY&#039;&#039; - Notice missing parameters are there.&lt;br /&gt;
* Load the ChromophoreGFP_CSY_full.frcmod file created by parmchk using &#039;&#039;loadamberparams ChromophoreGFP_CSY_full.frcmod&#039;&#039;&lt;br /&gt;
* Now check the CSY residue again using &#039;&#039;check CSY&#039;&#039; - No Problems! woo hoo! Now I can create amber parameters :D&lt;br /&gt;
&lt;br /&gt;
==Tues 9th Feb 2010==&lt;br /&gt;
&lt;br /&gt;
===RESP Fitting===&lt;br /&gt;
&lt;br /&gt;
To get charges that are like those of the standard amber force field an RESP fitting method has to be used. This involves 3 steps:&lt;br /&gt;
&lt;br /&gt;
* First, the molecule studied is optimized to determine a stable minimum (using Gaussian).&lt;br /&gt;
* Then, this minimized structure is used to calculate a Molecular Electrostatic Potential (MEP) on a three-dimensional grid (using Gaussian).&lt;br /&gt;
* Finally, this grid is exported into the &amp;quot;RESP program&amp;quot; which is used to fit atom-centered charges to the MEP&lt;br /&gt;
&lt;br /&gt;
To assist in this procedure - I am using a program called RED - [http://q4md-forcefieldtools.org/RED/] which can be used in all 3 stages...&lt;br /&gt;
&lt;br /&gt;
* Ante_RED is used to prepare Gaussian input files for the RESP fit&lt;br /&gt;
* RED III is used to fit atom centered charges&lt;br /&gt;
&lt;br /&gt;
The procedure used to create the CSY file for RESP fitting is as follows:&lt;br /&gt;
&lt;br /&gt;
* Cut the CSY residue from the 1W7S protein structure and cap terminal ends with methyl groups&lt;br /&gt;
* Optimise this in Gaussian at PM6 level&lt;br /&gt;
* Run the optimised CSY geometry through ante_RED to create the Gaussian input file involving minimisation at HF level and calculation of the MEP - also get the .P2N file required for RED III&lt;br /&gt;
* Run this Gaussian input file&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Input file for optimisation:&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;#P hf/6-31G* Opt=(Tight,CalcFC) Freq SCF(Conver=8) Test&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This will run a HF level optimisation and frequency analysis.  &#039;&#039;Opt=tight&#039;&#039; tightens the cutoffs on forces and step size that are used to determine convergence. An optimization with Opt=Tight will take several more steps than with the default cutoffs. For molecular systems with very small force constants (low frequency vibrational modes), this may be necessary to ensure adequate convergence and reliability of frequencies computed in a subsequent job step. This option can only be used with Berny optimizations. &#039;&#039;Opt = CalcFC&#039;&#039; causes the calculation of an analytical Hessian (in HF). &#039;&#039;SCF(Conver=8)&#039;&#039; sets the SCF convergence criterion to 10^-8.&lt;br /&gt;
&lt;br /&gt;
===Parameter files for use in Gaussian (Gaussian only uses amber94)===&lt;br /&gt;
&lt;br /&gt;
Using external parameter files. It is possible to include MM function entries within an external file by replacing the entries within the input file with the usual Gaussian include file mechanism: e.g., @oplsaa.prm. Parameter files for various force fields are included within the main Gaussian directory (i.e., $g09root/g09). If you want to specify the entire force field via an external file, use an input file like the following:&lt;br /&gt;
&lt;br /&gt;
# UFF=SoftOnly&lt;br /&gt;
&lt;br /&gt;
Read-in force field example&lt;br /&gt;
&lt;br /&gt;
molecule specification&lt;br /&gt;
&lt;br /&gt;
@$g09root/g09/oplsaa.prm&lt;br /&gt;
&lt;br /&gt;
Note that the choice of a molecular mechanics keyword is arbitrary as the entire force field depends on the read-in functions and parameters and not on any internal ones (in other words, specifying Amber would yield the same result as UFF).&lt;br /&gt;
&lt;br /&gt;
==Tues 16th Feb 2010==&lt;br /&gt;
===RESP Fitting===&lt;br /&gt;
&lt;br /&gt;
Following optimisation of the fragment for RESP fitting as seen before. We now have to fit the RESP to a MEP, this involves a few calculations:&lt;br /&gt;
&lt;br /&gt;
* ESP calcs in gaussian&lt;br /&gt;
* Fitting calcs by resp in ambertools&lt;br /&gt;
&lt;br /&gt;
The result is a .mol2 containing the fitted charges. The process has 2  input files:&lt;br /&gt;
&lt;br /&gt;
* .log&lt;br /&gt;
* .p2n&lt;br /&gt;
&lt;br /&gt;
The p2n is created by ante_RED and controls the RESP fitting process. (See RED site). To get a fit that excludes the capping methyls we add a REMARK line:&lt;br /&gt;
&lt;br /&gt;
 REMARK INTRA-MCC 0.0 |  35  36  37  38  39  40  41  42 | R&lt;br /&gt;
&lt;br /&gt;
This must be formatted EXACTLY as shown here (2 spaces between the numbers). The numbers correspond to atoms numbers, the 0.0 constrains these atoms to 0.0 charge and the R tells RESP to remove these atoms from the 2nd round fit and the subsequent mol2 output. Leaving a fragment.&lt;br /&gt;
&lt;br /&gt;
The RED program can be run on CX1! I used the following Jobscript (REDjobscript_4):&lt;br /&gt;
&lt;br /&gt;
 #PBS -l ncpus=1 &lt;br /&gt;
 #PBS -l mem=6000mb&lt;br /&gt;
 #PBS -l walltime=01:00:00&lt;br /&gt;
 #PBS -q pqchem&lt;br /&gt;
 #PBS -joe&lt;br /&gt;
 module load ambertools&lt;br /&gt;
 module load gaussian&lt;br /&gt;
 export GAUSS_SCRDIR=$TMPDIR&lt;br /&gt;
 echo $GAUSS_SCRDIR&lt;br /&gt;
 cd $HOME&lt;br /&gt;
 pwd&lt;br /&gt;
 perl /home/af706/RED-vIII.3.pl &amp;gt; /home/af706/RED-vIII.log&lt;br /&gt;
&lt;br /&gt;
So here we are using the chemistry node pqchem:&lt;br /&gt;
&lt;br /&gt;
* loading the ambertools (for RESP) and gaussian modules&lt;br /&gt;
* defining a scratch directory for the RESP program to read gaussian scratch&lt;br /&gt;
* TEMPDIR is used since the scratch must be empty upon calculation start. The temporary directory will be created fresh on each node as the calculation is started, and it&#039;s unlikely that two calculations will run on the same node.&lt;br /&gt;
* Moving to my HOME directory /home/af706 because the script must be run in the same directory as the input files  (.log and .p2n)&lt;br /&gt;
* RUN!&lt;br /&gt;
&lt;br /&gt;
The fitting process runs single point energy calcs and fits the RESP - overall it took 3mins for my fragment.&lt;br /&gt;
&lt;br /&gt;
===Going from xLeap out to Gaussian in===&lt;br /&gt;
&lt;br /&gt;
2 ways to do this:&lt;br /&gt;
&lt;br /&gt;
* Load in Gaussview&lt;br /&gt;
* Use Pengs scripts&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Gaussview&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
* The file loads and all residues are recognised (apart from the chromophore). The file contains the same number of atoms as the saved Leap file.&lt;br /&gt;
* Charges are assigned from amber94 - not the chromophore still&lt;br /&gt;
* Amber atom types are assigned correctly to ALL atoms&lt;br /&gt;
* All that remains is to add charges from the RESP fit.&lt;br /&gt;
* PDB information is retained in input&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;TAO&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
* Atoms are added so atom count is different to the file leaving Leap&lt;br /&gt;
* Fails to treat terminal residues correctly&lt;br /&gt;
* Lose residue information&lt;br /&gt;
* Charges are missing from chromophore and atoms in terminal residues&lt;br /&gt;
&lt;br /&gt;
==22nd Feb 2010==&lt;br /&gt;
&lt;br /&gt;
Links to HOW TO for Ambertools + Gaussview/ Ambertools + TAO tools + Gaussview&lt;br /&gt;
&lt;br /&gt;
* Ambertools + Gaussview  - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester_Ambertools_Gaussview&lt;br /&gt;
* Ambertools + TAO tools + Gaussview - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester_Ambertools_TAO&lt;br /&gt;
&lt;br /&gt;
* Running Amber only Calculations - https://www.ch.imperial.ac.uk/wiki/index.php/AndyForesterAmber&lt;br /&gt;
&lt;br /&gt;
==TAO tools==&lt;br /&gt;
&lt;br /&gt;
The expected differences to the one without using his tools are:&lt;br /&gt;
&lt;br /&gt;
* Different partial charges since his scripts use all_amino03 to assign charges,&lt;br /&gt;
  Gaussview uses amber94 charges (Net charge is the same for both files - good thing)&lt;br /&gt;
&lt;br /&gt;
* Potential for some changes in atom-types due to different versions of amber being used.&lt;br /&gt;
&lt;br /&gt;
What actually happened:&lt;br /&gt;
&lt;br /&gt;
* Difference in individual partial charges but same overal net charge&lt;br /&gt;
&lt;br /&gt;
* Different atom type assignment led to some more undefined parameters.&lt;br /&gt;
&lt;br /&gt;
* Defining these and running an energy led to significantly different energy than the non tools&lt;br /&gt;
  method:&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;WITHOUT tools&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
 AMBER calculation of energy.&lt;br /&gt;
 Energy per function type:&lt;br /&gt;
 Direct non-bonded                                 122152.193595037&lt;br /&gt;
 Non-bonded pair                                  -122162.319396395&lt;br /&gt;
 Harmonic stretch I                                     0.354921205&lt;br /&gt;
 Harmonic bend I                                        0.989669436&lt;br /&gt;
 Amber torsion                                          1.587390468&lt;br /&gt;
 Improper torsion                                       0.013641893&lt;br /&gt;
 Energy per function class:&lt;br /&gt;
        Coulomb              -9.200657698&lt;br /&gt;
    Vanderwaals              -0.925143661&lt;br /&gt;
     Stretching               0.354921205&lt;br /&gt;
        Bending               0.989669436&lt;br /&gt;
        Torsion               1.587390468&lt;br /&gt;
   Out-of-plane               0.013641893&lt;br /&gt;
 Energy=    -7.18017836     NIter=   0.&lt;br /&gt;
 Dipole moment=     -25.836974    -115.466231     -34.635294&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;WITH TAO tools&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
 AMBER calculation of energy.&lt;br /&gt;
 Energy per function type:&lt;br /&gt;
 Direct non-bonded                                 137343.707022741&lt;br /&gt;
 Non-bonded pair                                  -137347.284026475&lt;br /&gt;
 Harmonic stretch I                                     0.354921205&lt;br /&gt;
 Harmonic bend I                                        1.082982995&lt;br /&gt;
 Amber torsion                                          1.589487307&lt;br /&gt;
 Improper torsion                                       0.013645198&lt;br /&gt;
 Energy per function class:&lt;br /&gt;
        Coulomb              -6.890960120&lt;br /&gt;
    Vanderwaals               3.313956385&lt;br /&gt;
     Stretching               0.354921205&lt;br /&gt;
        Bending               1.082982995&lt;br /&gt;
        Torsion               1.589487307&lt;br /&gt;
   Out-of-plane               0.013645198&lt;br /&gt;
 Energy=   -0.535967030     NIter=   0.&lt;br /&gt;
 Dipole moment=     -29.988370    -117.918540     -32.409355&lt;br /&gt;
&lt;br /&gt;
Obvious where the differences are.&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterAmber&amp;diff=98717</id>
		<title>AndyForesterAmber</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterAmber&amp;diff=98717"/>
		<updated>2010-02-25T16:58:14Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* Frequency */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Amber Calculations on 1W7S=&lt;br /&gt;
&lt;br /&gt;
Back to main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester&lt;br /&gt;
&lt;br /&gt;
Full 1W7S amber input is in &amp;quot;though Leap into gassview&amp;quot; folder on the desktop&lt;br /&gt;
&lt;br /&gt;
Issues with this file are:&lt;br /&gt;
&lt;br /&gt;
* slightly non-0 charge on CSY residue&lt;br /&gt;
* no water molecules anywhere&lt;br /&gt;
&lt;br /&gt;
==Energy calculations==&lt;br /&gt;
&lt;br /&gt;
These run locally in 2 seconds and compute the energy fine - simple summation  of terms&lt;br /&gt;
&lt;br /&gt;
==Optimisation==&lt;br /&gt;
&lt;br /&gt;
These run on CX1 using 6000MB memory and 1 processor in roughly 4 hours and 5000 steps&lt;br /&gt;
&lt;br /&gt;
Attempt to run the calculation on 2 processors witht 7000MB memory - time should be halfed but CPU time remains around 4 hours.&lt;br /&gt;
&lt;br /&gt;
Issues with the optimisation:&lt;br /&gt;
&lt;br /&gt;
* Not a smooth start to the energy profile - jump from -2 to 200 kJ/mol! - this smooths out&lt;br /&gt;
&lt;br /&gt;
===Visualising whats changed===&lt;br /&gt;
&lt;br /&gt;
* VMD allows overlay of the 2 structures (start and finish) so we can see whats moved&lt;br /&gt;
* It also has a RMS fit to calculate some &amp;quot;fit factor&amp;quot;&lt;br /&gt;
* Can also print out a distance matrix using 3rd party script in VMD tcl console. This creates a full matrix/surface.&lt;br /&gt;
* I and Mike wrote scripts to pull the diagonal out of the file so we are comparing each residue with itself.&lt;br /&gt;
* This plots the classic movement per residue - BASED ON CENTER OF MASS.&lt;br /&gt;
&lt;br /&gt;
==Frequency==&lt;br /&gt;
&lt;br /&gt;
The optimised file can be submitted for a frequency calculation using only Amber. Questions to answer about this are:&lt;br /&gt;
&lt;br /&gt;
* Do H and L ONIOM tags get interpreted for Amber only calculations (giving % frequency in the diff regions)?&lt;br /&gt;
* Can this calculation be run in parallel NProc=2?&lt;br /&gt;
* Does the Chk file load in Gaussview? How large is it?&lt;br /&gt;
&lt;br /&gt;
The initial frequency calculation ran successfully but created a 1.6Gb text file which made reading it difficult.&lt;br /&gt;
&lt;br /&gt;
Answer to the ONIOM tag question:&lt;br /&gt;
&lt;br /&gt;
* Without ONIOM tags - (opt freq) no split between model/real vibrations&lt;br /&gt;
* With ONIOM tags - (opt freq) no split between model/real vibrations&lt;br /&gt;
* With ONIOM tags - (freq) split between model/real vibrations&lt;br /&gt;
&lt;br /&gt;
It seems Gaussian does recognize the ONIOM split in pure frequency calculations even if no ONIOM method is being used.&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterAmber&amp;diff=98555</id>
		<title>AndyForesterAmber</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterAmber&amp;diff=98555"/>
		<updated>2010-02-25T13:21:35Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* Optimisation */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Amber Calculations on 1W7S=&lt;br /&gt;
&lt;br /&gt;
Back to main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester&lt;br /&gt;
&lt;br /&gt;
Full 1W7S amber input is in &amp;quot;though Leap into gassview&amp;quot; folder on the desktop&lt;br /&gt;
&lt;br /&gt;
Issues with this file are:&lt;br /&gt;
&lt;br /&gt;
* slightly non-0 charge on CSY residue&lt;br /&gt;
* no water molecules anywhere&lt;br /&gt;
&lt;br /&gt;
==Energy calculations==&lt;br /&gt;
&lt;br /&gt;
These run locally in 2 seconds and compute the energy fine - simple summation  of terms&lt;br /&gt;
&lt;br /&gt;
==Optimisation==&lt;br /&gt;
&lt;br /&gt;
These run on CX1 using 6000MB memory and 1 processor in roughly 4 hours and 5000 steps&lt;br /&gt;
&lt;br /&gt;
Attempt to run the calculation on 2 processors witht 7000MB memory - time should be halfed but CPU time remains around 4 hours.&lt;br /&gt;
&lt;br /&gt;
Issues with the optimisation:&lt;br /&gt;
&lt;br /&gt;
* Not a smooth start to the energy profile - jump from -2 to 200 kJ/mol! - this smooths out&lt;br /&gt;
&lt;br /&gt;
===Visualising whats changed===&lt;br /&gt;
&lt;br /&gt;
* VMD allows overlay of the 2 structures (start and finish) so we can see whats moved&lt;br /&gt;
* It also has a RMS fit to calculate some &amp;quot;fit factor&amp;quot;&lt;br /&gt;
* Can also print out a distance matrix using 3rd party script in VMD tcl console. This creates a full matrix/surface.&lt;br /&gt;
* I and Mike wrote scripts to pull the diagonal out of the file so we are comparing each residue with itself.&lt;br /&gt;
* This plots the classic movement per residue - BASED ON CENTER OF MASS.&lt;br /&gt;
&lt;br /&gt;
==Frequency==&lt;br /&gt;
&lt;br /&gt;
The optimised file can be submitted for a frequency calculation using only Amber. Questions to answer about this are:&lt;br /&gt;
&lt;br /&gt;
* Do H and L ONIOM tags get interpreted for Amber only calculations (giving % frequency in the diff regions)?&lt;br /&gt;
* Can this calculation be run in parallel NProc=2?&lt;br /&gt;
* Does the Chk file load in Gaussview? How large is it?&lt;br /&gt;
&lt;br /&gt;
The initial frequency calculation ran successfully but created a 1.6Gb text file which made reading it difficult.&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterAmber&amp;diff=98549</id>
		<title>AndyForesterAmber</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterAmber&amp;diff=98549"/>
		<updated>2010-02-25T13:14:23Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* Optimisation */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Amber Calculations on 1W7S=&lt;br /&gt;
&lt;br /&gt;
Back to main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester&lt;br /&gt;
&lt;br /&gt;
Full 1W7S amber input is in &amp;quot;though Leap into gassview&amp;quot; folder on the desktop&lt;br /&gt;
&lt;br /&gt;
Issues with this file are:&lt;br /&gt;
&lt;br /&gt;
* slightly non-0 charge on CSY residue&lt;br /&gt;
* no water molecules anywhere&lt;br /&gt;
&lt;br /&gt;
==Energy calculations==&lt;br /&gt;
&lt;br /&gt;
These run locally in 2 seconds and compute the energy fine - simple summation  of terms&lt;br /&gt;
&lt;br /&gt;
==Optimisation==&lt;br /&gt;
&lt;br /&gt;
These run on CX1 using 6000MB memory and 1 processor in roughly 4 hours and 5000 steps&lt;br /&gt;
&lt;br /&gt;
Attempt to run the calculation on 2 processors witht 7000MB memory - time should be halfed but CPU time remains around 4 hours.&lt;br /&gt;
&lt;br /&gt;
Issues with the optimisation:&lt;br /&gt;
&lt;br /&gt;
* Not a smooth start to the energy profile - jump from -2 to 200 kJ/mol! - this smooths out&lt;br /&gt;
&lt;br /&gt;
===Visualising whats changed===&lt;br /&gt;
&lt;br /&gt;
* VMD allows overlay of the 2 structures (start and finish) so we can see whats moved&lt;br /&gt;
* It also has a RMS fit to calculate some &amp;quot;fit factor&amp;quot;&lt;br /&gt;
* Can also print out a distance matrix using 3rd party script in VMD tcl console. This creates a full matrix/surface.&lt;br /&gt;
* I and Mike wrote scripts to pull the diagonal out of the file so we are comparing each residue with itself.&lt;br /&gt;
* This plots the classic movement per residue - BASED ON CENTER OF MASS.&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterAmber&amp;diff=98548</id>
		<title>AndyForesterAmber</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterAmber&amp;diff=98548"/>
		<updated>2010-02-25T13:13:30Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* Amber Calculations on 1W7S */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Amber Calculations on 1W7S=&lt;br /&gt;
&lt;br /&gt;
Back to main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester&lt;br /&gt;
&lt;br /&gt;
Full 1W7S amber input is in &amp;quot;though Leap into gassview&amp;quot; folder on the desktop&lt;br /&gt;
&lt;br /&gt;
Issues with this file are:&lt;br /&gt;
&lt;br /&gt;
* slightly non-0 charge on CSY residue&lt;br /&gt;
* no water molecules anywhere&lt;br /&gt;
&lt;br /&gt;
==Energy calculations==&lt;br /&gt;
&lt;br /&gt;
These run locally in 2 seconds and compute the energy fine - simple summation  of terms&lt;br /&gt;
&lt;br /&gt;
==Optimisation==&lt;br /&gt;
&lt;br /&gt;
These run on CX1 using 6000MB memory and 1 processor in roughly 4 hours and 5000 steps&lt;br /&gt;
&lt;br /&gt;
Issues with the optimisation:&lt;br /&gt;
&lt;br /&gt;
* Not a smooth start to the energy profile - jump from -2 to 200 kJ/mol! - this smooths out&lt;br /&gt;
&lt;br /&gt;
===Visualising whats changed===&lt;br /&gt;
&lt;br /&gt;
* VMD allows overlay of the 2 structures (start and finish) so we can see whats moved&lt;br /&gt;
* It also has a RMS fit to calculate some &amp;quot;fit factor&amp;quot;&lt;br /&gt;
* Can also print out a distance matrix using 3rd party script in VMD tcl console. This creates a full matrix/surface.&lt;br /&gt;
* I and Mike wrote scripts to pull the diagonal out of the file so we are comparing each residue with itself.&lt;br /&gt;
* This plots the classic movement per residue - BASED ON CENTER OF MASS.&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterAmber&amp;diff=98546</id>
		<title>AndyForesterAmber</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterAmber&amp;diff=98546"/>
		<updated>2010-02-25T13:07:48Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* Amber Calculations on 1W7S */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Amber Calculations on 1W7S=&lt;br /&gt;
&lt;br /&gt;
Back to main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester&lt;br /&gt;
&lt;br /&gt;
Full 1W7S amber input is in &amp;quot;though Leap into gassview&amp;quot; folder on the desktop&lt;br /&gt;
&lt;br /&gt;
Issues with this file are:&lt;br /&gt;
&lt;br /&gt;
* slightly non-0 charge on CSY residue&lt;br /&gt;
* no water molecules anywhere&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForester&amp;diff=98545</id>
		<title>AndyForester</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForester&amp;diff=98545"/>
		<updated>2010-02-25T13:06:19Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* 22nd Feb 2010 */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=HBDI Calculations=&lt;br /&gt;
==Wed 21st Oct 2009==&lt;br /&gt;
&lt;br /&gt;
- Started project work&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
- Gaussview and Gaussian working&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
- Working out how to use the command inputs&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
- Trying to sort VPN out&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
- Working to get the project proposal written&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Tue 27th Oct 2009==&lt;br /&gt;
&lt;br /&gt;
Drew up the HBDI molecule in Gaussview, the molecule will be optimised at the HF 3-21G level on the mac (locally). &amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
DFT B3LYP 6-31G (Becke threeparameter hybrid exchange functional5 and the Lee-Yang-Parr correlation functional) method optimisation was run on the resulting file, the MP2 6-31G method optimisation. Each of these had a corresponding freq calculation to check that minima were reached. All calculations were successful, with no negative vibrations.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolAppletButton&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;HBDI_MP2_6-31G_FREQ.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;HBDI optimised at the MP2 6-31G level&amp;lt;/text&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;600&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;/jmolAppletButton&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Wed 28th Oct 2009==&lt;br /&gt;
&lt;br /&gt;
Ran DFT B3YLP 6-311G calculations on HBDI (Opt and Freq) since these were the calculations used in a published study of HBDI.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
Optimisation job finshed successfully, the frequency calculation did not finish succesfully. Error was:&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
 ****Warning!!: The largest alpha MO coefficient is  0.17588883D+02&lt;br /&gt;
 Symmetrizing basis deriv contribution to polar:&lt;br /&gt;
 IMax=3 JMax=2 DiffMx= 0.00D+00&lt;br /&gt;
 G2DrvN: will do    29 centers at a time, making    1 passes doing MaxLOS=1.&lt;br /&gt;
 Calling FoFCou, ICntrl=  3107 FMM=F I1Cent=   0 AccDes= 0.00D+00.&lt;br /&gt;
 FoFDir/FoFCou used for L=0 through L=1.&lt;br /&gt;
 End of G2Drv Frequency-dependent properties file   721 does not exist.&lt;br /&gt;
 End of G2Drv Frequency-dependent properties file   722 does not exist.&lt;br /&gt;
          IDoAtm=1111111111111111111111111111&lt;br /&gt;
          Differentiating once with respect to electric field.&lt;br /&gt;
                with respect to dipole field.&lt;br /&gt;
          Differentiating once with respect to nuclear coordinates.&lt;br /&gt;
          Keep R1 ints in memory in canonical form, NReq=447529180.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Thurs 29th Oct 2009==&lt;br /&gt;
&lt;br /&gt;
===Learning to use ONIOM===&lt;br /&gt;
&lt;br /&gt;
The example on chem wiki was attempted. Drawn up in Gaussview and optimised initially using molecular mechanics AMBER force field.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
The ONIOM calculation was set up in Gaussview using DFT B3LYP 6-311G(d):AMBER methods with the benzene ring as the high level and chain at low level. Electronic embedding was NOT used in this initial calculation. &amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolAppletButton&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;ONIOM_B3LYP_6-311G_AMBER.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Test Molecule optimised using B3LYP 6-311G(d):AMBER&amp;lt;/text&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;600&amp;lt;/size&amp;gt;&lt;br /&gt;
  &amp;lt;/jmolAppletButton&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
&lt;br /&gt;
This conformation is NOT the lowest energy conformation, the initial AMBER MM calculation did not find the global minimum probably due to the drawn up starting geometry.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
The molecule was drawn up again and DFT B3LYP 6-311G(d):AMBER and pure B3LYP 6-311G(d) calculations run. Absolute energy differences between the 2 conformations was very small at the ONIOM(DFT B3LYP 6-311G(d):AMBER) level. -232.27532671 vs -232.27315709 au respectively, a 0.00271au (4.359 744 17×10-18 J = 1au) difference.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolAppletButton&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;ONIOM_B3LYP_6-311G_AMBER2.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Test Molecule optimised using B3LYP 6-311G(d):AMBER&amp;lt;/text&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;600&amp;lt;/size&amp;gt;&lt;br /&gt;
  &amp;lt;/jmolAppletButton&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
&lt;br /&gt;
It has already become obvious that using ONIOM decreases calculation time considerably. The table below shows a &amp;quot;CPU time&amp;quot; comparison on the above molecule using the different approaches.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Calculation&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Time (1st attempt)&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Time (2nd attempt)&#039;&#039;&#039;&lt;br /&gt;
|-&lt;br /&gt;
|   ONIOM(DFT B3LYP 6-311G(d):AMBER)||4 minutes 22.0 seconds||8 minutes  5.5 seconds&lt;br /&gt;
|-&lt;br /&gt;
|   DFT 3LYP 6-311G(d)||2 hours 12 minutes 21.9 seconds||2 hours 30 minutes  0.8 seconds&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Mon 2nd Nov 2009==&lt;br /&gt;
&lt;br /&gt;
Having obtained optimized structures of HBDI at various theory levels and basis set variations, these structures will now be compared to previous literature. Extensive studies have been performed using HF and B3LYP methods previously. It has been noted in Jaspers recent review that HF does not model frequencies very accurately - my computations agree HF 6-31G frequencies are very far out. B3LYP frequencies are much closer to the experimental values.&lt;br /&gt;
&lt;br /&gt;
==Tues 3rd Nov 2009==&lt;br /&gt;
&lt;br /&gt;
Tabulated frerquency comparison&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;A.Esposito HF 6-31G(d)&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;A.Esposito B3LYP 6-31G(d)&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;X.He B3LYP 6-31G(d)&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;HF 6-31G(d)&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;B3LYP 6-31G(d)&#039;&#039;&#039;&lt;br /&gt;
|-&lt;br /&gt;
| C=O||1987||1802||1802||1986||1801&lt;br /&gt;
|-&lt;br /&gt;
| C=C||1880||1705||1704||1880||1704&lt;br /&gt;
|-&lt;br /&gt;
| Phenol||1809||1665||1663||1812||1666&lt;br /&gt;
|-&lt;br /&gt;
| Phenol||1781||1635||1631||1777||1635&lt;br /&gt;
|-&lt;br /&gt;
| C=N||1844||1620||1618||1844||1618&lt;br /&gt;
|-&lt;br /&gt;
| Phenol||1696||1562||1558||1694||1562&lt;br /&gt;
|-&lt;br /&gt;
| C-N||1607||1483||N/A||1606||1482&lt;br /&gt;
|-&lt;br /&gt;
| Phenol||1602||1487||N/A||1603||1488&lt;br /&gt;
|-&lt;br /&gt;
| Total Energy(au)||||||||-720.043||-724.465&lt;br /&gt;
|-&lt;br /&gt;
| &lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Thurs 5th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
Having got Jaspers input files, several things were noticed:&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
* Solvent models were used - should be in vacum.... I have re run these in G09 with No solvent on both DFT 6-311G(d,p) and HF 6-31+G(d) methods/basis sets for comparison&lt;br /&gt;
* The models in his Biophys paper using other residues in contact COO- for example are not present. He has not reported the pure HBDI calculation results.&lt;br /&gt;
* More than just HBDI is likely to be required, meaning larger than expected high level region in the GFP calculation. (3 layer ONIOM DFT:HF:AMBER)?&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
Other things to do:&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
* Work on a guide to loading .pdb files into Gaussview 5 - how much information is maintained?&lt;br /&gt;
* Are residues defined still in Gaussview input files&lt;br /&gt;
* AMBER MM requires definitions of charges etc... how easy is it to import from .pdb and get a file that can be run in opt etc...&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
* Find a model system which is larger than HBDI but incorporates this molecule &amp;lt;=500 atoms (Limit of size for pure DFT calculation)&lt;br /&gt;
* Possibilities include a system alread developed by someone else online&lt;br /&gt;
* A &amp;quot;custom&amp;quot; system made by me to model this (time intensive)&lt;br /&gt;
* &amp;quot;strip&amp;quot; back the GFP pdb file to 500 atoms surrounding the HBDI chromophore.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Fri 6th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
Yesterday started by trying to follow the tutorial on loading the GFP molecule from .pdb, this resulted in a problem with Gaussviews addition of hydrogens. It over protonated the nitrogens and carbon in the chromophore leading to over valent parts. I then tried several other molecules (PYP, a scorpion toxin and a strand of RNA) which had no problems with hydrogen addition.&lt;br /&gt;
&lt;br /&gt;
I tried to use ChemBio 3D to load the .pdb and correctly assign hydrogens/connectivity. This resulted in the same protonation problems!&lt;br /&gt;
&lt;br /&gt;
==Mon 9th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
TINKER - http://dasher.wustl.edu/tinker/ was installed with the Force Field Explorer (FFE) interface on the mac. This was used to load the same .pdb file and resulted in the same protonation around the chromophore. So far the SAME protonation pattern (incorrect) around the chromophore has been found using:&lt;br /&gt;
&lt;br /&gt;
*Gaussview&lt;br /&gt;
*ChemBio 3D&lt;br /&gt;
*TINKER&lt;br /&gt;
&lt;br /&gt;
Looking into how the .pdb file format works, the molecule is built from a series of residues, it becomes clear as you look at the sequence that is building the GFP protein chromophore (residues 65,66,67 in 1GFL) are not then reproducing the internal cyclisation properly leaving protons attached and thus atoms hypervalent. This accounts for the consistent error loading from the .pdb. As a result must all molecules expressed as a .pdb file must be pure protein structures...?&lt;br /&gt;
&lt;br /&gt;
===Comparison of the 2 available wild type GFP structures===&lt;br /&gt;
&lt;br /&gt;
The 2 structures being considered here are:&lt;br /&gt;
&lt;br /&gt;
*1GFL - Yang et al. -http://www.pdb.org/pdb/explore/explore.do?structureId=1GFL&lt;br /&gt;
*1W7S - Jaspers stucture - http://www.pdb.org/pdb/explore/explore.do?structureId=1W7S&lt;br /&gt;
&lt;br /&gt;
These 2 structures differ in that 1GFL contains 2 GFP molecules, whereas 1W7S contains 4. These were both stripped down to 1 unit of GFP, using the &amp;quot;pdb secondary structure&amp;quot; function in Gaussview 5. 1GFL had chain B removed and 1W7S had chains B,C and D removed. Next the residues were investigated that created the chromophore. The results are not consistent because GFL uses 3 residues (65,66,67) to form the chromophore. 1W7S has residue 66 (labelled CSY) as the entire chromophore! Both get protonated the same (incorrectly) which is odd considering how they molecules are built.&lt;br /&gt;
&lt;br /&gt;
The next test was to &amp;quot;modify bond&amp;quot; in Gaussview 5 and create a &amp;quot;cleaned&amp;quot; chromophore structure (no bonding information from the residue) by manually changing the bonds. The chromophore in each .pdb file was altered to look like HBDI in terms of double single bonds and the hydrogens taken out (makes no difference as these are created each time the .pdb is loaded). When the new .pdb files (1GFL_cleaned.pdb and 1W7S_cleaned.pdb) were loaded they were correctly protonated! Success.&lt;br /&gt;
&lt;br /&gt;
===1W7S improvement===&lt;br /&gt;
&lt;br /&gt;
Chain A (not cleaned) of the 1W7S .pdb was submitted to run a AMBER calculation, as expected it failed with the lines:&lt;br /&gt;
&lt;br /&gt;
 Missing atomic parameters for atom   474 IAtTyp=    20000000&lt;br /&gt;
 Missing atomic parameters.&lt;br /&gt;
&lt;br /&gt;
This error says that the atom type of atom 474 is not defined. One way to counter this is to locate the atom and then change its definition manually (as shown in the example https://www.ch.ic.ac.uk/wiki/index.php/PYP_:_Photoactive_yellow_protein - time intensive!)&lt;br /&gt;
&lt;br /&gt;
Instead the molecule was cleaned (manually changing the bonding in the chromophore), after the bond updates the structure was re submitted, again failing but with a different message:&lt;br /&gt;
&lt;br /&gt;
 Include all MM classes&lt;br /&gt;
  Bondstretch undefined between atoms    473    478 CC-N* [H,H]  &lt;br /&gt;
  Bondstretch undefined between atoms    475    476 CC-CC [H,H]  &lt;br /&gt;
  Bondstretch undefined between atoms    475    486 CC-CM [H,H]  &lt;br /&gt;
  Bondstretch undefined between atoms    476    477 CC-O [H,H]  &lt;br /&gt;
  Bondstretch undefined between atoms    476    478 CC-N* [H,H]  &lt;br /&gt;
  Bondstretch undefined between atoms    486   2559 CM-HC [H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    473    474    475 CC-NB-CC [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    473    478    479 CC-N*-CT [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    473    478    476 CC-N*-CC [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    473    482    483 CC-CT-N [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    473    482   3857 CC-CT-H1 [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    474    475    476 NB-CC-CC [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    474    475    486 NB-CC-CM [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    474    473    478 NB-CC-N* [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    475    476    477 CC-CC-O [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    475    476    478 CC-CC-N* [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    475    486    487 CC-CM-CA [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    475    486   2559 CC-CM-HC [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    476    478    479 CC-N*-CT [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    476    475    486 CC-CC-CM [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    477    476    478 O-CC-N* [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    478    479    480 N*-CT-C [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    478    473    482 N*-CC-CT [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    486    487    488 CM-CA-CA [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    486    487    489 CM-CA-CM [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    487    486   2559 CA-CM-HC [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    488    487    489 CA-CA-CM [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    490    492    491 CA-C-CM [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    491    492    493 CM-C-OH [H,H,H]  &lt;br /&gt;
  MM function not complete&lt;br /&gt;
&lt;br /&gt;
This is a better error message since all the atoms are now defined, only parameters are missing. For parameters that involve only atoms in the chromophore at the high level in the ONIOM calculation, the values can be set to 0 in the input file. This is because the ONIOM extrapolation will cancel MM from this high level region, To do this the keyword &amp;quot;Amber=Hardfirst&amp;quot; and wildcards for the missing parameters which are zero valued are placed in the input file.&lt;br /&gt;
&lt;br /&gt;
[[Image:Atoms involving undefined parameters.png]]&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
The above screeshot shows highlighted the atoms listed in the error... most of them if not all will be involved in the high level calculation and thus may not require parameters defining (other than to be set to 0). The reason the parameters are undefined is that the AMBER force field is very specific to protein bonds and thus the organic chromophore has bonds that are not found in this parameter set.&lt;br /&gt;
&lt;br /&gt;
The atoms in the chromophore were frozen in the atom group editor and the input file had a parameter set added to the end setting chromophore parameters to 0:&lt;br /&gt;
&lt;br /&gt;
 HrmStr1 CC N* 0.0 0.0&lt;br /&gt;
 HrmStr1 CC CC 0.0 0.0&lt;br /&gt;
 HrmStr1 CC CM 0.0 0.0&lt;br /&gt;
 HrmStr1 CC O 0.0 0.0&lt;br /&gt;
 HrmStr1 CM HC 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC NB CC 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC N* CC 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CT N 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CT H1 0.0 0.0&lt;br /&gt;
 HrmBnd1 NB CC CC 0.0 0.0&lt;br /&gt;
 HrmBnd1 NB CC CM 0.0 0.0&lt;br /&gt;
 HrmBnd1 NB CC N* 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CC O  0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CC N* 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CM CA 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CM HC 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CC CM 0.0 0.0&lt;br /&gt;
 HrmBnd1 O CC N*  0.0 0.0&lt;br /&gt;
 HrmBnd1 N* CT C  0.0 0.0&lt;br /&gt;
 HrmBnd1 N* CC CT 0.0 0.0&lt;br /&gt;
 HrmBnd1 CM CA CA 0.0 0.0&lt;br /&gt;
 HrmBnd1 CA CM HC 0.0 0.0&lt;br /&gt;
 HrmBnd1 CA C CM 0.0 0.0&lt;br /&gt;
 HrmBnd1 CM C OH 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC N* CT 0.0 0.0&lt;br /&gt;
 HrmBnd1 CM CA CM 0.0 0.0&lt;br /&gt;
&lt;br /&gt;
This file then ran successfully for an AMBER single point energy calculation. It was then submitted for AMBER optimisation using &amp;quot;amber=hardfirst geom=connectivity&amp;quot; and returned a successful minimum with Forces and Displacements converged:&lt;br /&gt;
&lt;br /&gt;
         Item               Value     Threshold  Converged?&lt;br /&gt;
  Maximum Force            0.000089     0.000450     YES&lt;br /&gt;
  RMS     Force            0.000004     0.000300     YES&lt;br /&gt;
  Maximum Displacement     0.000089     0.001800     YES&lt;br /&gt;
  RMS     Displacement     0.000004     0.001200     YES&lt;br /&gt;
  Predicted change in Energy=-1.156135D-07&lt;br /&gt;
  Optimization completed.&lt;br /&gt;
     -- Stationary point found.&lt;br /&gt;
&lt;br /&gt;
===Cleaned .pdb files and TINKER===&lt;br /&gt;
&lt;br /&gt;
*The files that had been cleaned were opened in TINKER, it is necessary to convert the .pdb to a .xyz file to run any calculations within TINKER. This is currently failing due to an &amp;quot;atom missing on SER-228&amp;quot; - checking this there is no SER residue at 228 (in fact a GLU) so a bit puzzled. This is the 1W7S cleaned version.&lt;br /&gt;
&lt;br /&gt;
*PYP was loaded in its .pdb format to TINKER and was immediately successful in being converted to .xyz then a MINIMISE algorithm ran successfully with the energy at its X-ray crystal structure as 261KJ/mol reduced to -3670.5355KJ/mol in 1645 iterations - this took a while (45mins locally)!&lt;br /&gt;
&lt;br /&gt;
*Interestingly the .xyz output from the minimization can be converted back to .pdb (loses hydrogens minimization) and this PYP file was loaded into Gaussview 5 - the file then ran successfully for an AMBER single point energy calculation! cf. the first attempt to get PYP to run was unsuccessful, and the tutorial underwent much more hassle to run this calculation.&lt;br /&gt;
&lt;br /&gt;
===What .pdb information is retained?===&lt;br /&gt;
&lt;br /&gt;
The only molecule that has been run with AMBER opt (GFP) has lost its secondary structure information but RETAINED its residue information.&lt;br /&gt;
&lt;br /&gt;
==Fri 13th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
===Conversion from.xyz to .com===&lt;br /&gt;
A protein with 2 residues (Glu-Asp) was built using TINKER, saved as:&lt;br /&gt;
&lt;br /&gt;
*.xyz&lt;br /&gt;
*.pdb&lt;br /&gt;
*.com&lt;br /&gt;
&lt;br /&gt;
The initial format was .xyz (created in TINEKR as a native format), converted to .pdb using TINKER and then loaded into Gaussview and saved as a .com.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
.xyz&lt;br /&gt;
&lt;br /&gt;
    36  Protein built with TINKER/Force Field Explorer&lt;br /&gt;
     1  CT     0.000000    0.000000    0.000000   340     2     4     5     6&lt;br /&gt;
     2  C      0.000000    0.000000    1.510000   342     1     3     7&lt;br /&gt;
     3  O      1.028938    0.000000    2.165506   343     2&lt;br /&gt;
     4  HC     1.056270    0.000000   -0.341166   341     1&lt;br /&gt;
     5  HC    -0.497647    0.931694   -0.341166   341     1&lt;br /&gt;
     6  HC    -0.497647   -0.931694   -0.341166   341     1&lt;br /&gt;
     7  N     -1.236201    0.000000    2.027114   232     2     8    11&lt;br /&gt;
     8  CT    -1.446961    0.000000    3.471822   233     7     9    12    13&lt;br /&gt;
     9  C     -2.509557   -0.992751    3.878558   234     8    10    22&lt;br /&gt;
    10  O     -3.569672   -1.098634    3.284129   236     9&lt;br /&gt;
    11  H     -2.025518    0.000000    1.381066   235     7&lt;br /&gt;
    12  H1    -0.503221   -0.281395    3.983946   237     8&lt;br /&gt;
    13  CT    -1.910491    1.397334    3.923704   238     8    14    18    19&lt;br /&gt;
    14  CT    -2.132799    1.397334    5.447574   240    13    15    20    21&lt;br /&gt;
    15  C     -2.585014    2.781238    5.848089   242    14    16    17&lt;br /&gt;
    16  O2    -2.816736    2.973609    7.061266   243    15&lt;br /&gt;
    17  O2    -2.693194    3.629064    4.935955   243    15&lt;br /&gt;
    18  HC    -2.861794    1.653482    3.412322   239    13&lt;br /&gt;
    19  HC    -1.132567    2.145528    3.664589   239    13&lt;br /&gt;
    20  HC    -1.170306    1.191123    5.960588   241    14&lt;br /&gt;
    21  HC    -2.946052    0.685840    5.701535   241    14&lt;br /&gt;
    22  N     -2.153997   -1.723291    4.944151   210     9    23    26&lt;br /&gt;
    23  CT    -3.047986   -2.739703    5.491225   211    22    24    27    28&lt;br /&gt;
    24  C     -3.120370   -2.663723    6.997574   212    23    25    34&lt;br /&gt;
    25  O     -2.128241   -2.535414    7.695870   214    24&lt;br /&gt;
    26  H     -1.239994   -1.547921    5.361576   213    22&lt;br /&gt;
    27  H1    -4.072297   -2.585746    5.092246   215    23&lt;br /&gt;
    28  CT    -2.526745   -4.139104    5.114946   216    23    29    32    33&lt;br /&gt;
    29  C     -3.476741   -5.157722    5.698062   218    28    30    31&lt;br /&gt;
    30  O2    -3.212872   -6.362275    5.493304   219    29&lt;br /&gt;
    31  O2    -4.454666   -4.718803    6.341112   219    29&lt;br /&gt;
    32  HC    -1.530243   -4.298013    5.577366   217    28&lt;br /&gt;
    33  HC    -2.540086   -4.252948    4.010880   217    28&lt;br /&gt;
    34  N     -4.374888   -2.752228    7.460123   344    24    35    36&lt;br /&gt;
    35  H     -5.140865   -2.858197    6.794954   345    34&lt;br /&gt;
    36  H     -4.534830   -2.711583    8.466684   345    34&lt;br /&gt;
&lt;br /&gt;
.pdb&lt;br /&gt;
&lt;br /&gt;
 HEADER    Protein built with TINKER/Force Field Explorer&lt;br /&gt;
 COMPND&lt;br /&gt;
 SOURCE&lt;br /&gt;
 ATOM      1  CH3 ACE     1       0.000   0.000   0.000&lt;br /&gt;
 ATOM      2  C   ACE     1       0.000   0.000   1.510&lt;br /&gt;
 ATOM      3  O   ACE     1       1.029   0.000   2.166&lt;br /&gt;
 ATOM      4  H1  ACE     1       1.056   0.000  -0.341&lt;br /&gt;
 ATOM      5  H2  ACE     1      -0.498   0.932  -0.341&lt;br /&gt;
 ATOM      6  H3  ACE     1      -0.498  -0.932  -0.341&lt;br /&gt;
 ATOM      7  N   GLU     2      -1.236   0.000   2.027&lt;br /&gt;
 ATOM      8  CA  GLU     2      -1.447   0.000   3.472&lt;br /&gt;
 ATOM      9  C   GLU     2      -2.510  -0.993   3.879&lt;br /&gt;
 ATOM     10  O   GLU     2      -3.570  -1.099   3.284&lt;br /&gt;
 ATOM     11  CB  GLU     2      -1.910   1.397   3.924&lt;br /&gt;
 ATOM     12  CG  GLU     2      -2.133   1.397   5.448&lt;br /&gt;
 ATOM     13  CD  GLU     2      -2.585   2.781   5.848&lt;br /&gt;
 ATOM     14  OE1 GLU     2      -2.817   2.974   7.061&lt;br /&gt;
 ATOM     15  OE2 GLU     2      -2.693   3.629   4.936&lt;br /&gt;
 ATOM     16  H   GLU     2      -2.026   0.000   1.381&lt;br /&gt;
 ATOM     17  HA  GLU     2      -0.503  -0.281   3.984&lt;br /&gt;
 ATOM     18  HB2 GLU     2      -2.862   1.653   3.412&lt;br /&gt;
 ATOM     19  HB3 GLU     2      -1.133   2.146   3.665&lt;br /&gt;
 ATOM     20  HG2 GLU     2      -1.170   1.191   5.961&lt;br /&gt;
 ATOM     21  HG3 GLU     2      -2.946   0.686   5.702&lt;br /&gt;
 ATOM     22  N   ASP     3      -2.154  -1.723   4.944&lt;br /&gt;
 ATOM     23  CA  ASP     3      -3.048  -2.740   5.491&lt;br /&gt;
 ATOM     24  C   ASP     3      -3.120  -2.664   6.998&lt;br /&gt;
 ATOM     25  O   ASP     3      -2.128  -2.535   7.696&lt;br /&gt;
 ATOM     26  CB  ASP     3      -2.527  -4.139   5.115&lt;br /&gt;
 ATOM     27  CG  ASP     3      -3.477  -5.158   5.698&lt;br /&gt;
 ATOM     28  OD1 ASP     3      -3.213  -6.362   5.493&lt;br /&gt;
 ATOM     29  OD2 ASP     3      -4.455  -4.719   6.341&lt;br /&gt;
 ATOM     30  H   ASP     3      -1.240  -1.548   5.362&lt;br /&gt;
 ATOM     31  HA  ASP     3      -4.072  -2.586   5.092&lt;br /&gt;
 ATOM     32  HB2 ASP     3      -1.530  -4.298   5.577&lt;br /&gt;
 ATOM     33  HB3 ASP     3      -2.540  -4.253   4.011&lt;br /&gt;
 ATOM     34  N   NH2     4      -4.375  -2.752   7.460&lt;br /&gt;
 ATOM     35  H1  NH2     4      -5.141  -2.858   6.795&lt;br /&gt;
 ATOM     36  H2  NH2     4      -4.535  -2.712   8.467&lt;br /&gt;
 END&lt;br /&gt;
&lt;br /&gt;
.com&lt;br /&gt;
&lt;br /&gt;
 %chk=File_Test_Glu-Asp.chk&lt;br /&gt;
 # hf/3-21g geom=connectivity&lt;br /&gt;
 &lt;br /&gt;
 Protein built with TINKER/Force Field Explorer&lt;br /&gt;
 &lt;br /&gt;
 0 1&lt;br /&gt;
 C(PDBName=CH3,ResName=ACE,ResNum=1)                   0.00000000    0.00000000    0.00000000&lt;br /&gt;
 C(PDBName=C,ResName=ACE,ResNum=1)                     0.00000000    0.00000000    1.51000000&lt;br /&gt;
 O(PDBName=O,ResName=ACE,ResNum=1)                     1.02900000    0.00000000    2.16600000&lt;br /&gt;
 H(PDBName=H1,ResName=ACE,ResNum=1)                    1.05600000    0.00000000   -0.34100000&lt;br /&gt;
 H(PDBName=H2,ResName=ACE,ResNum=1)                   -0.49800000    0.93200000   -0.34100000&lt;br /&gt;
 H(PDBName=H3,ResName=ACE,ResNum=1)                   -0.49800000   -0.93200000   -0.34100000&lt;br /&gt;
 N(PDBName=N,ResName=GLU,ResNum=2)                    -1.23600000    0.00000000    2.02700000&lt;br /&gt;
 C(PDBName=CA,ResName=GLU,ResNum=2)                   -1.44700000    0.00000000    3.47200000&lt;br /&gt;
 C(PDBName=C,ResName=GLU,ResNum=2)                    -2.51000000   -0.99300000    3.87900000&lt;br /&gt;
 O(PDBName=O,ResName=GLU,ResNum=2)                    -3.57000000   -1.09900000    3.28400000&lt;br /&gt;
 C(PDBName=CB,ResName=GLU,ResNum=2)                   -1.91000000    1.39700000    3.92400000&lt;br /&gt;
 C(PDBName=CG,ResName=GLU,ResNum=2)                   -2.13300000    1.39700000    5.44800000&lt;br /&gt;
 C(PDBName=CD,ResName=GLU,ResNum=2)                   -2.58500000    2.78100000    5.84800000&lt;br /&gt;
 O(PDBName=OE1,ResName=GLU,ResNum=2)                  -2.81700000    2.97400000    7.06100000&lt;br /&gt;
 O(PDBName=OE2,ResName=GLU,ResNum=2)                  -2.69300000    3.62900000    4.93600000&lt;br /&gt;
 H(PDBName=H,ResName=GLU,ResNum=2)                    -2.02600000    0.00000000    1.38100000&lt;br /&gt;
 H(PDBName=HA,ResName=GLU,ResNum=2)                   -0.50300000   -0.28100000    3.98400000&lt;br /&gt;
 H(PDBName=HB2,ResName=GLU,ResNum=2)                  -2.86200000    1.65300000    3.41200000&lt;br /&gt;
 H(PDBName=HB3,ResName=GLU,ResNum=2)                  -1.13300000    2.14600000    3.66500000&lt;br /&gt;
 H(PDBName=HG2,ResName=GLU,ResNum=2)                  -1.17000000    1.19100000    5.96100000&lt;br /&gt;
 H(PDBName=HG3,ResName=GLU,ResNum=2)                  -2.94600000    0.68600000    5.70200000&lt;br /&gt;
 N(PDBName=N,ResName=ASP,ResNum=3)                    -2.15400000   -1.72300000    4.94400000&lt;br /&gt;
 C(PDBName=CA,ResName=ASP,ResNum=3)                   -3.04800000   -2.74000000    5.49100000&lt;br /&gt;
 C(PDBName=C,ResName=ASP,ResNum=3)                    -3.12000000   -2.66400000    6.99800000&lt;br /&gt;
 O(PDBName=O,ResName=ASP,ResNum=3)                    -2.12800000   -2.53500000    7.69600000&lt;br /&gt;
 C(PDBName=CB,ResName=ASP,ResNum=3)                   -2.52700000   -4.13900000    5.11500000&lt;br /&gt;
 C(PDBName=CG,ResName=ASP,ResNum=3)                   -3.47700000   -5.15800000    5.69800000&lt;br /&gt;
 O(PDBName=OD1,ResName=ASP,ResNum=3)                  -3.21300000   -6.36200000    5.49300000&lt;br /&gt;
 O(PDBName=OD2,ResName=ASP,ResNum=3)                  -4.45500000   -4.71900000    6.34100000&lt;br /&gt;
 H(PDBName=H,ResName=ASP,ResNum=3)                    -1.24000000   -1.54800000    5.36200000&lt;br /&gt;
 H(PDBName=HA,ResName=ASP,ResNum=3)                   -4.07200000   -2.58600000    5.09200000&lt;br /&gt;
 H(PDBName=HB2,ResName=ASP,ResNum=3)                  -1.53000000   -4.29800000    5.57700000&lt;br /&gt;
 H(PDBName=HB3,ResName=ASP,ResNum=3)                  -2.54000000   -4.25300000    4.01100000&lt;br /&gt;
 N(PDBName=N,ResName=NH2,ResNum=4)                    -4.37500000   -2.75200000    7.46000000&lt;br /&gt;
 H(PDBName=H1,ResName=NH2,ResNum=4)                   -5.14100000   -2.85800000    6.79500000&lt;br /&gt;
 H(PDBName=H2,ResName=NH2,ResNum=4)                   -4.53500000   -2.71200000    8.46700000&lt;br /&gt;
 &lt;br /&gt;
 1 2 1.0 4 1.0 5 1.0 6 1.0&lt;br /&gt;
 2 3 2.0 7 1.5&lt;br /&gt;
 3&lt;br /&gt;
 4&lt;br /&gt;
 5&lt;br /&gt;
 6&lt;br /&gt;
 7 16 1.0 8 1.0&lt;br /&gt;
 8 9 1.0 11 1.0 17 1.0&lt;br /&gt;
 9 10 2.0 22 1.5&lt;br /&gt;
 10&lt;br /&gt;
 11 18 1.0 12 1.0 19 1.0&lt;br /&gt;
 12 21 1.0 13 1.0 20 1.0&lt;br /&gt;
 13 14 2.0 15 2.0&lt;br /&gt;
 14&lt;br /&gt;
 15&lt;br /&gt;
 16&lt;br /&gt;
 17&lt;br /&gt;
 18&lt;br /&gt;
 19&lt;br /&gt;
 20&lt;br /&gt;
 21&lt;br /&gt;
 22 23 1.0 30 1.0&lt;br /&gt;
 23 26 1.0 24 1.0 31 1.0&lt;br /&gt;
 24 34 1.5 25 2.0&lt;br /&gt;
 25&lt;br /&gt;
 26 27 1.0 33 1.0 32 1.0&lt;br /&gt;
 27 28 2.0 29 2.0&lt;br /&gt;
 28&lt;br /&gt;
 29&lt;br /&gt;
 30&lt;br /&gt;
 31&lt;br /&gt;
 32&lt;br /&gt;
 33&lt;br /&gt;
 34 35 1.0 36 1.0&lt;br /&gt;
 35&lt;br /&gt;
 36&lt;br /&gt;
&lt;br /&gt;
The issue going from .xyz straight to .com is that ALL protein information would be lost ie. residue information that is written into the .pdb at .xyz to .pdb comversion.&lt;br /&gt;
&lt;br /&gt;
===PYP files as downloaded compared to run through TINKER===&lt;br /&gt;
&lt;br /&gt;
*TINKER removes much of the header information&lt;br /&gt;
*TINKER only preserves the 1st model of a .pdb file (may be multiple models)&lt;br /&gt;
*TINKER removes far right 3 columns of information (not essential to the .pdb file)&lt;br /&gt;
*A virtual residue in the original .pdb comprising the chromophore was omitted from the TINKER output (number 126)&lt;br /&gt;
&lt;br /&gt;
===What each TINKER search algorithm does===&lt;br /&gt;
&lt;br /&gt;
There are several programs within TINKER that may be able to help locate a global minimum (not just one near the published crystal structure) by way of searching the potential energy surface. All of these are MM methods for use in TINKER.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&#039;&#039;&#039;SCAN&#039;&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
A program for general conformational search of an entire potential energy surface via a basin hopping method. The program takes as input a TINKER .xyz coordinates file which is then minimized to find the first local minimum for a search list. A series of activations along various normal modes from this initial minimum are used as seed points for additional minimizations. Whenever a previously unknown local minimum is located it is added to the search list. When all minima on the search list have been subjected to the normal mode activation without locating additional new minima, the program terminates. The individual local minima are written to cycle files as they are discovered. While the SCAN program can be used on standard undeformed potential energy surfaces, we have found it to be most useful for quickly ‘‘scanning’’ a smoothed energy surface to enumerate the major basins of attraction spaning the entire surface.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;SNIFFER&#039;&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
A program that implements the Sniffer global optimization algorithm of Butler and Slaminka, a discrete version of Griewank’s global search trajectory method. The program takes an input TINKER .xyz coordinates file and shakes it vigorously via a modified dynamics trajectory before, hopefully, settling into a low lying minimum. Some trial and error is often required as the current implementation is sensitive to various parameters and tolerances that govern the computation. At present, these parameters are not user accessible, and must be altered in the source code. However, this method can do a good job of quickly optimizing conformation within a limited range of convergence.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;PSS&#039;&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
Implements our version of a potential smoothing and search algorithm for the global optimization of molecular conformation. An initial structure in .xyz format is first minimized in Cartesian coordinates on a series of increasingly smoothed potential energy surfaces. Then the smoothing procedure is reversed with minimization on each successive surface starting from the coordinates of the minimum on the previous surface. A local search procedure is used during the backtracking to explore for alternative minima better than the one found during the current minimization. The final result is usually a very low energy conformation or, in favorable cases, the global energy minimum conformation. The minimum energy coordinate sets found on each surface during both the forward smoothing and backtracking procedures are placed in sequentially numbered cycle files.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
TINKER also contains programs that are able to perform MM optimisations:&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&#039;&#039;&#039;MINIMIZE&#039;&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;The MINIMIZE program performs a limited memory L-BFGS minimization of an input structure over Cartesian coordinates using a modified version of the algorithm of Jorge Nocedal. The method requires only the potential energy and gradient at each step along the minimization pathway. It requires storage space proportional to the number of atoms in the structure. The MINIMIZE procedure is recommended for preliminary minimization of trial structures to an rms gradient of 1.0 to 0.1 kcal/mole/Å. It has a relatively fast cycle time and is tolerant of poor initial structures, but converges in a slow, linear fashion near the minimum. The user supplies the name of the TINKER .xyz coordinates file and a target rms gradient value at which the minimization will terminate. Output consists of minimization statistics written to the screen or redirected to an output file, and the new coordinates written to updated .xyz files or to cycle files.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;OPTIMIZE&#039;&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;The OPTIMIZE program performs a optimally conditioned variable metric minimization of an input structure over Cartesian coordinates using an algorithm due to William Davidon. The method does not perform line searches, but requires computation of energies and gradients as well as storage for an estimate of the inverse Hessian matrix. The program operates on Cartesian coordinates from a TINKER .xyz file. OPTIMIZE will typically converge somewhat faster and more completely than MINIMIZE. However, the need to store and manipulate a full inverse Hessian estimate limits its use to structures containing less than a few hundred atoms on workstation class machines. As with the other minimizers, OPTIMIZE needs input coordinates and an rms gradient cutoff criterion. The output coordinates are saved in updated .xyz files or as cycle files.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;NEWTON&#039;&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt; A truncated Newton minimization method which requires potential energy, gradient and Hessian information. This procedure has significant advantages over standard Newton methods, and is able to minimize very large structures completely. Several options are provided with respect to minimization method and preconditioning of the Newton equations. The default options are recommended unless the user is familiar with the math involved. This program operates in Cartesian coordinate space and is fairly tolerant of poor input structures. Typical algorithm iteration times are longer than with nonlinear conjugate gradient or variable metric methods, but many fewer iterations are required for complete minimization. NEWTON is usually the best choice for minimizations to the 0.01 to 0.000001 kcal/mole/Å level of rms gradient convergence. Tests for directions of negative curvature can be removed, allowing NEWTON to be used for optimization to conformational transition state structures (this only works if the starting point is very close to the transition state). Input consists of a TINKER .xyz coordinates file; output is an updated set of minimized coordinates and minimization statistics.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Tues 17th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
Comparing 2 .pdb files:&lt;br /&gt;
&lt;br /&gt;
*As downloaded from PDB&lt;br /&gt;
*Converted to .xyz in TINKER then back to .pdb&lt;br /&gt;
&lt;br /&gt;
These files were then cleaned up to compare in a terminal using the &amp;quot;diff&amp;quot; command:&lt;br /&gt;
&lt;br /&gt;
*Header information and end information removed&lt;br /&gt;
*3 columns to the right in PDB version were removed to become consistent with the TINEKR file&lt;br /&gt;
*The first column &amp;quot;ATOM&amp;quot; was removed effectively making the line numbers equal to the atom numbers in both files&lt;br /&gt;
&lt;br /&gt;
==Wed 18th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
1st diff was on line 52, a HYDROGEN ATOM:&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&#039;&#039;52  HE2 HIS A   3    13.349 -10.803   9.527&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
added to the TINKER output, not in PDB file.&lt;br /&gt;
&lt;br /&gt;
2nd diff was on line 657, a HYDROGEN ATOM:&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&#039;&#039;657  HE2 GLU A  46	 -0.867  -5.216  -0.822&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
added to the PDB file, not in TINKER output&lt;br /&gt;
&lt;br /&gt;
3rd diff was on line 1007, a HYDROGEN ATOM (number 1008):&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&#039;&#039;5HG  CYS A  69	 -8.909  -5.194   5.526&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
added to the TINKER output, not in PDB file.&lt;br /&gt;
&lt;br /&gt;
4th diff was  on line 1621, a HYDROGEN ATOM (number 1623):&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&#039;&#039;HE2 HIS A 108	  4.663   7.459   6.879&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
added to the TINKER output, not in PDB file.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Following these reconciliations, both files were exactly the same with 1911 atoms present in each. 3 atoms were removed from the TINKER file and 1 was removed from the PDB file.&lt;br /&gt;
&lt;br /&gt;
*NOTE - loading the original .pdb from PDB to Gaussview then converting to a .gjf results in an error &amp;quot;1 or more PDB atom names has a &#039; or * character which can cause problems for Gaussian&amp;quot; - this is because the PDB file has extra atoms on the end of the file to define a highlighted chromophore... After these were removed in a txt editor the .gjf save went smoothly. Infact after this removal the file will run an AMBEr single point energy calculation fine - cf running when the extra info is in the bottom.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
All differences in files were due to protonation states of atoms... not the AA backbone.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===TINKER vs PDB===&lt;br /&gt;
[[Image:TINKER PROTON.png]]&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
[[Image:PDB_PROTON.png]]&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
The highlighted fragments represent where the respective file version is protonated and the other is not.&lt;br /&gt;
&lt;br /&gt;
==Fri 20th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
In an attempt to load the 1W7S structure to TINKER and successfully convert it it a .xyz format the following procedure was undertaken:&lt;br /&gt;
&lt;br /&gt;
*&lt;br /&gt;
&lt;br /&gt;
==Mon 23rd Nov 2009==&lt;br /&gt;
&lt;br /&gt;
===Protonation of GFP at various pH conditions===&lt;br /&gt;
&lt;br /&gt;
With 230 amino acid residues, each possessing a unique pKa in GFP it is obvious that there is a huge spectrum of protonation states available at a single solvation pH. Since protons cannot be detected using X-ray diffraction, protonation states must be uncovered via a difference method. Several questions arise when thinking about protonation of the protein:&lt;br /&gt;
&lt;br /&gt;
*Which residues have the ability to be protonated/deprotonated&lt;br /&gt;
*Is the PDB structure protonated for a specific pH (Bio pH)&lt;br /&gt;
*How does the protonation of the protein change through a variety of solvation pHs&lt;br /&gt;
*Does the protonation state affect the chromophore photophysics (electronic embedding)&lt;br /&gt;
&lt;br /&gt;
Answering these questions:&lt;br /&gt;
&lt;br /&gt;
*Polar amino acids are those that can be protonated/deprotonated, these are identified by Gaussview and shown in the atom list of the molecule with the option of setting them to either protonated/deprotonated.&lt;br /&gt;
*Dont know if the current structure is protonated to any specific pH... this is unclear until protonate at certain pHs&lt;br /&gt;
*The protonation will be tested across a range of pHs to see how protonation changes&lt;br /&gt;
&lt;br /&gt;
==Tues 24th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
The pKa value of a residue will indicate protonation state as a function of pH. If you know the pH of your solution then if the pKa of a given functional group is lower than this pH then the functional group will be unprotonated. If the pKa is higher than the pH then the functional group will be protonated. &#039;&#039;&#039;Beware that pKa values can be distorted by local conformational effects.&#039;&#039;&#039; The structure is protonated as though it were in pH 7 as it is loaded into Gaussview. Since hydrogens are defined in the PDB file it could be that the author added protons under these conditions.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Histadine has 2 protonation points on the imidizole R chain:&lt;br /&gt;
&lt;br /&gt;
{|&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Yes-Yes&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Yes-No&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;No-Yes&#039;&#039;&#039;&lt;br /&gt;
|-&lt;br /&gt;
|&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolApplet&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;Protein_yes_yes.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;400&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Yes Yes&amp;lt;/text&amp;gt;&lt;br /&gt;
  &amp;lt;/jmolApplet&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;||&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolApplet&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;Protein_yes_no.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;400&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Yes Yes&amp;lt;/text&amp;gt;&lt;br /&gt;
  &amp;lt;/jmolApplet&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;||&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolApplet&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;Protein_no_yes.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;400&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Yes Yes&amp;lt;/text&amp;gt;&lt;br /&gt;
  &amp;lt;/jmolApplet&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
The imidazole sidechain of histidine has a pKa of approximately 6, and overall, the amino acid has a pKa of 7.6. This means that at physiologically relevant pH values, relatively small shifts in pH will change its average charge. Below a pH of 6, the imidazole ring is mostly protonated as described by the Henderson–Hasselbalch equation. When protonated, the imidazole ring bears two NH bonds and has a positive charge. The positive charge is equally distributed between both nitrogens and can be represented with two equally important resonance structures.&lt;br /&gt;
&lt;br /&gt;
===Protonation===&lt;br /&gt;
&lt;br /&gt;
At pH 7 the PDB file protonations are correct for all residues. Moving to pH 5 the histadine residues become protonated (pKa 6)- this is the only change. At more alkaline pHs it would take pH 9.1 to make any effect (deprotonating CYS and then TYR at 9.7)&lt;br /&gt;
&lt;br /&gt;
===Mini-Biomolecule===&lt;br /&gt;
&lt;br /&gt;
To test ONIOM calculations vs pure DFT methods and other calculation types on a bio system that is smaller than GFP (computationally less expensive, can streamline the real calcs)&lt;br /&gt;
&lt;br /&gt;
Trying a molecule analogous to the example given in the ONIOM tutorial...&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolApplet&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;Biomolecule1.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;400&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Yes Yes&amp;lt;/text&amp;gt;&lt;br /&gt;
  &amp;lt;/jmolApplet&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Thurs 26th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
===Checking the GFP input will run===&lt;br /&gt;
&lt;br /&gt;
Procedure to get 1WS7 to run an AMBER energy calculation:&lt;br /&gt;
&lt;br /&gt;
*Load the PDB file without water&lt;br /&gt;
*Try opt amber but will fail due to missing parameters (all in the chromophore)&lt;br /&gt;
*Set these parameters to 0 since they cancel in ONIOM&lt;br /&gt;
*Freeze the atotms involving missing paramters&lt;br /&gt;
*Energy AMBER will now run for the file - NO CHARGES - 9.07881632au&lt;br /&gt;
*Open the Atom Editor, Edit, MM charges, Assign all atoms&lt;br /&gt;
*Manually edit any missing charges&lt;br /&gt;
*Energy AMBER will now run including charges (basic AMBER94 charges) - 0.260391165au&lt;br /&gt;
&lt;br /&gt;
Charges in this file did not sum to an integer number - the only residue with a non integer charge was no.64 with a charge of 0.61160000&lt;br /&gt;
&lt;br /&gt;
 MM sanity checks:&lt;br /&gt;
 All charges sum to:               -5.38840000&lt;br /&gt;
 Charges of atoms sum to:          -5.38840000&lt;br /&gt;
 Layer 1 (S) has     34 atoms and charge   0.61160000 sum=   0.61160000&lt;br /&gt;
 Layer 2 (R) adds  3574 atoms and charge  -6.00000000 sum=  -5.38840000&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
 WARNING -- The following centers have non-zero partial&lt;br /&gt;
 charges, but zero Vanderwaals parameters. These centers&lt;br /&gt;
 can collapse onto other centers with non-zero charge&lt;br /&gt;
 Center    Charge&lt;br /&gt;
   1833  0.423900&lt;br /&gt;
   1890  0.410200&lt;br /&gt;
   2029  0.427500&lt;br /&gt;
   2043  0.427500&lt;br /&gt;
   2080  0.410200&lt;br /&gt;
   2089  0.399200&lt;br /&gt;
   2123  0.410200&lt;br /&gt;
   2179  0.410200&lt;br /&gt;
   2186  0.410200&lt;br /&gt;
   2260  0.410200&lt;br /&gt;
   2287  0.410200&lt;br /&gt;
   2294  0.410200&lt;br /&gt;
   2347  0.427500&lt;br /&gt;
   2369  0.399200&lt;br /&gt;
   2469  0.427500&lt;br /&gt;
   2508  0.399200&lt;br /&gt;
   2551  0.410200&lt;br /&gt;
   2619  0.399200&lt;br /&gt;
   2639  0.410200&lt;br /&gt;
   2714  0.410200&lt;br /&gt;
   2924  0.399200&lt;br /&gt;
   2939  0.399200&lt;br /&gt;
   2950  0.427500&lt;br /&gt;
   2981  0.399200&lt;br /&gt;
   3180  0.427500&lt;br /&gt;
   3233  0.399200&lt;br /&gt;
   3262  0.410200&lt;br /&gt;
   3361  0.399200&lt;br /&gt;
   3377  0.427500&lt;br /&gt;
   3384  0.410200&lt;br /&gt;
   3397  0.427500&lt;br /&gt;
   3418  0.427500&lt;br /&gt;
   3562  0.410200&lt;br /&gt;
   3593  0.410200&lt;br /&gt;
   3608  0.410200&lt;br /&gt;
&lt;br /&gt;
* oniom(pm6:amber=hardfirst) geom=connectivity was run successfully, resulting in an energy of 0.210085730788au&lt;br /&gt;
&lt;br /&gt;
This still resulted in some charge problems:&lt;br /&gt;
&lt;br /&gt;
 MM sanity checks:&lt;br /&gt;
 All charges sum to:               &#039;&#039;&#039;-5.38840000&#039;&#039;&#039;&lt;br /&gt;
 Charges of atoms sum to:           &#039;&#039;&#039;0.79320000&#039;&#039;&#039;&lt;br /&gt;
 Residue  63 PDB Number    &#039;&#039;&#039;64_A PHE has&#039;&#039;&#039;     1 atoms, and a charge of   0.59730000&lt;br /&gt;
 Residue  64 PDB Number    &#039;&#039;&#039;66_A CSY has&#039;&#039;&#039;    34 atoms, and a charge of   0.61160000&lt;br /&gt;
 Residue  65 PDB Number    &#039;&#039;&#039;68_A VAL has&#039;&#039;&#039;     1 atoms, and a charge of  -0.41570000&lt;br /&gt;
 WARNING -- The following centers have non-zero partial&lt;br /&gt;
 charges, but zero Vanderwaals parameters. These centers&lt;br /&gt;
 can collapse onto other centers with non-zero charge&lt;br /&gt;
 Center    Charge&lt;br /&gt;
   &#039;&#039;&#039;3608  0.410200&#039;&#039;&#039;&lt;br /&gt;
 MMInit generated parameter data with length LenPar=       40470.&lt;br /&gt;
 Standard basis: Dummy (5D, 7F)&lt;br /&gt;
 There are    36 symmetry adapted basis functions of A   symmetry.&lt;br /&gt;
 Integral buffers will be    262144 words long.&lt;br /&gt;
 Raffenetti 1 integral format.&lt;br /&gt;
 Two-electron integral symmetry is turned on.&lt;br /&gt;
    36 basis functions,    36 primitive gaussians,    36 cartesian basis functions&lt;br /&gt;
    27 alpha electrons       27 beta electrons&lt;br /&gt;
       nuclear repulsion energy       200.9124124811 Hartrees.&lt;br /&gt;
 NAtoms= 3608 NActive=   36 NUniq=   36 SFac= 7.50D-01 NAtFMM=   80 NAOKFM=F Big=F&lt;br /&gt;
 AMBER calculation of energy.&lt;br /&gt;
 Energy=    0.110474559     NIter=   0.&lt;br /&gt;
 Dipole moment=       3.390901       5.889448      -2.505476&lt;br /&gt;
 ONIOM: saving gridpoint  1&lt;br /&gt;
 ONIOM: restoring gridpoint  3&lt;br /&gt;
 ONIOM: calculating energy.&lt;br /&gt;
 ONIOM: gridpoint  1 method:  low   system:  model energy:     0.110474559177&lt;br /&gt;
 ONIOM: gridpoint  2 method:  high  system:  model energy:     0.060169124949&lt;br /&gt;
 ONIOM: gridpoint  3 method:  low   system:  real  energy:     0.260391165016&lt;br /&gt;
 ONIOM: extrapolated energy =       0.210085730788&lt;br /&gt;
 ONIOM: calculating electric field derivatives.&lt;br /&gt;
 ONIOM: Integrating ONIOM file  4 number   619&lt;br /&gt;
 ONIOM: Dipole        =-2.76174164D+01-1.34762322D+02 3.99239712D+01&lt;br /&gt;
 ONIOM: Dipole moment (Debye):&lt;br /&gt;
    X=   -70.1965    Y=  -342.5316    Z=   101.4766  Tot=   364.0782&lt;br /&gt;
 ONIOM: Integrating ONIOM file  5 number   695&lt;br /&gt;
&lt;br /&gt;
An ONIOM calculation was run using a DFT method on the high level and AMBER on the low level:  &#039;&#039;oniom(b3lyp/3-21g:amber=hardfirst) geom=connectivity&#039;&#039; which ran successfully...&lt;br /&gt;
&lt;br /&gt;
===Charges===&lt;br /&gt;
&lt;br /&gt;
Charges in the protein are important for the calculation that will be run on GFP eventually since they provide the electrostatic field or &amp;quot;atmosphere&amp;quot; which surrounds the chromophore, polarizing the electron density used by the DFT method to evaluate the chromophore. Gaussian offers 2 types of influence which the MM portion can exert on the QM part:&lt;br /&gt;
&lt;br /&gt;
* Mechanical Embedding&lt;br /&gt;
* Electronic Embedding&lt;br /&gt;
&lt;br /&gt;
Mechanical embedding does not use any electronic components to affect the central high region. Electronic embedding does use the charge generated in the MM region to polarize the high region.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
At pH 7 the GFP molecule is in a 6- charge state according to assigning AMBER charges and summing the residue charges (available in the out file). 3 options were considered at this stage for treating charges:&lt;br /&gt;
&lt;br /&gt;
* Ignore charges on the entire molecule&lt;br /&gt;
* Use AMBER94 assigned partial charges on the low region&lt;br /&gt;
* Use Qeq to assign charges based on an overall charge of -6&lt;br /&gt;
&lt;br /&gt;
The first option can be discarded since the electronic atmosphere is of particular interest here. The second 2 options are to be investigated. See below (on a small Biomolecule)&lt;br /&gt;
&lt;br /&gt;
ON a 4 residue chain (in mini biomolecule folder) - As in the GFP tutorial on the wiki, a energy calculation was submitted with the keyword amber=qeq to get back the charges across the molecule. (not entirely sure what this does yet). AS a test of the effect of charges on a molecule a test molecule was compiled in TINKER consisting of 4 residues with 0 overall charges....&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolAppletButton&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;Protein1_AMBER_OPT_qeq.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Test Protien&amp;lt;/text&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;600&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;/jmolAppletButton&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{|&lt;br /&gt;
|No Charges|| 0.01940311au&lt;br /&gt;
|-&lt;br /&gt;
|Amber Charges|| -0.18117151au&lt;br /&gt;
|-&lt;br /&gt;
|Qeq Charges|| -0.23038199au&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Clearly the Qeq shows the lowest energy - the considerable differences in energy show that charges are an important part of the input.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Testing Charges and EE on a small system===&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolAppletButton&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;Chromopre_molecule_B3LYP_6-31G(d).mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;script&amp;gt;spin on&amp;lt;/script&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Test Protien&amp;lt;/text&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;600&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;/jmolAppletButton&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Energies of the optimisations using various charge approximations and with/without EE&lt;br /&gt;
{|class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Method||Mechanical Embedding||Electronic Embedding||Computational Time (mechanical embedding)||Computational Time (electronic embedding)&lt;br /&gt;
|-&lt;br /&gt;
|B3LYP 6-31G*||-1406.04||N/A||11 hours 18 minutes 44.7 seconds||11 hours 18 minutes 44.7 seconds&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (No Charges)||-783.2225||-783.2223||2 hours 13 minutes  4.1 seconds||&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Amber Charges)|| || ||  ||&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Qeq Charges)||-783.2467|| ||2 hours  6 minutes  8.6 seconds ||&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
NB. &lt;br /&gt;
* B3LYP 6-31G* and ONIOM B3LYP 6-31G*:AMBER (No Charges) were from the same starting geometry.&lt;br /&gt;
* ONIOM B3LYP 6-31G*:AMBER (Amber Charges) and ONIOM B3LYP 6-31G*:AMBER (Qeq Charges) were from the same starting geometry.&lt;br /&gt;
* All Electronic embedding is from the same starting point - Can compare times.&lt;br /&gt;
&lt;br /&gt;
Dipole moments using various charge approximations and with/without EE&lt;br /&gt;
{|class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Method||Dipole moment (No EE)||Dipole moment (With EE)&lt;br /&gt;
|-&lt;br /&gt;
|B3LYP 6-31G*||7.4914||N/A&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (No Charges)||3.2328||3.2241&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Amber Charges)||5.2794||5.2865&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Qeq Charges)||8.173||8.1654&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==2nd Dec 2009==&lt;br /&gt;
&lt;br /&gt;
A single point energy of 1w7s was run with the additional qeq keyword to return partial charges (overall charge set to -6)&lt;br /&gt;
This new molecule was submitted for AMBER opt but failed after around 700 iterations... this could be due to the error:&lt;br /&gt;
&lt;br /&gt;
 WARNING -- The following centers have non-zero partial&lt;br /&gt;
 charges, but zero Vanderwaals parameters. These centers&lt;br /&gt;
 can collapse onto other centers with non-zero charge&lt;br /&gt;
&lt;br /&gt;
When the atoms defined in this error had their partial charges set to 0.00, the result was a fully converged AMBER optimisation:&lt;br /&gt;
&lt;br /&gt;
*Frozen chromophore.&lt;br /&gt;
*pH 7 giving a charge of -6&lt;br /&gt;
*Charges assigned using Qeq method&lt;br /&gt;
*Zero Van der Waals parameter atoms partial charges set to 0.00&lt;br /&gt;
&lt;br /&gt;
Looking at what atoms are not defined in terms of Van der Waals paramters it is ALWAYS an aloholic proton that is not parameterised...&lt;br /&gt;
&lt;br /&gt;
Tutorial used AMBER charges then ran Qeq for the missing charges... i think this is what she thought she did... gaussian &amp;quot;amber=qeq&amp;quot; will overwrite all charges.. need to use &amp;quot;amber=uncharged&amp;quot;&lt;br /&gt;
so that we dont overwrite the amber charges...&lt;br /&gt;
&lt;br /&gt;
In effect full Qeq charges were assigned in the tutorial. Also overall charge is 0 - will check this.&lt;br /&gt;
&lt;br /&gt;
===The GFP tutorial===&lt;br /&gt;
&lt;br /&gt;
*AMBER charges (AMBER94 charges) were assigned to the GFP molecule in the tutorial&lt;br /&gt;
*she notices correctly that some are missing in the chromophore and so decides to assign them via Qeq&lt;br /&gt;
*Runs Qeq on the GFP molecule with a &#039;&#039;&#039;0 charge&#039;&#039;&#039; which is wrong! When run the AMBER charges and look at the log file we can see that the overall charge should be &#039;&#039;&#039;+4&#039;&#039;&#039; for that protonation state.&lt;br /&gt;
*Since Qeq is an equilibrium method, partial charges are solved with the constraint that their sum must equal the overall charge.&lt;br /&gt;
*These Qeq charges should therefore not fit with an integer sum when pasted into the AMBER charges.&lt;br /&gt;
&lt;br /&gt;
===Qeq versus AMBER94 charges===&lt;br /&gt;
&lt;br /&gt;
Ogawa et al. state:&lt;br /&gt;
&lt;br /&gt;
 Geometry optimizations of a crambin protein were performed using force fields combined with the&lt;br /&gt;
 CQEq. The CQEq charges were found to be more comparable to the Mulliken charges obtained by the ab initio molecular orbital&lt;br /&gt;
 calculation than the fixed charges used in the AMBER force field. Furthermore, the CQEq charges give more realistic charge redistribution&lt;br /&gt;
 between amino acids in the crambin.&lt;br /&gt;
&lt;br /&gt;
Mini Biomolecule calculations:&lt;br /&gt;
&lt;br /&gt;
{|class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Method||Dipole moment (No EE)||Dipole moment (With EE)&lt;br /&gt;
|-&lt;br /&gt;
|B3LYP 6-31G*||7.4914||N/A&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (No Charges)||3.2328||3.2241&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Amber Charges)||5.2794||5.2865&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Qeq Charges)||8.173||8.1654&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Clearly the Qeq charges are in better agreement with the DFT method than no or AMBER charges.&lt;br /&gt;
&lt;br /&gt;
These frequencies are calulated using mechanical embedding!&lt;br /&gt;
&lt;br /&gt;
{|class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Method||C=O||C=C&lt;br /&gt;
|-&lt;br /&gt;
|B3LYP 6-31G*||1799.65||1701.86&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (No Charges)||1802.25||1703.63&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Amber Charges)||1803.37||1701.67&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Qeq Charges)||1800.18||1702.86&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
As a result it seems obvious that a Qeq charge assignment in constraint by the correct total charge gives results most comparable to the pure QM method (both dipoles and frequencies)&lt;br /&gt;
&lt;br /&gt;
==3rd Dec 2009==&lt;br /&gt;
===ESP Charges===&lt;br /&gt;
&lt;br /&gt;
ESP calculates the expectation values of the electrostatic potential of a molecule on a uniform distribution of points. The resultant ESP surface is then fit to atom centered charges that best reproduce the electron distribution in a least-squares method. The set of points defining the default surface is generated according to the algorithm of Connolly.&lt;br /&gt;
&lt;br /&gt;
ESP charges thus require a QM calculation to create the molecular electrostatic potential used to fit atom centered charges. The size of GFP means that this is not an efficient method for use in this case?&lt;br /&gt;
&lt;br /&gt;
===Qeq Charges===&lt;br /&gt;
&lt;br /&gt;
The charge equilibration (QEq) method treats the atomic charge variation through the equilibration of electronegativities at atomic sites. The advantage of the QEq method is that the polarization effect in MM is taken into account by variation of the charge distribution in the molecules. The atomic charge variation relates to the intramolecular geometrical changes.&lt;br /&gt;
&lt;br /&gt;
===AMBER charges===&lt;br /&gt;
&lt;br /&gt;
==8th Dec 2009==&lt;br /&gt;
&lt;br /&gt;
A input creation page is now avaliable:&lt;br /&gt;
&lt;br /&gt;
https://www.ch.ic.ac.uk/wiki/index.php/AndyForesterGFP&lt;br /&gt;
&lt;br /&gt;
The final input file had protein information pasted in using text to columns in excel and now displays PDB residue information and PDB secondary structure information (nice cartoon pictures in ChemBio3D!)&lt;br /&gt;
&lt;br /&gt;
Interesting is a paper that uses TD-DFT on GFP (in my papers folder on the mac), the group uses a 3 stage minimisation method from the 1GFL PDB structure:&lt;br /&gt;
&lt;br /&gt;
* Freeze all atoms in the backbone of the protein and optimise after adding hydrogens&lt;br /&gt;
* Optimise the chromophore at a semi-empirical level&lt;br /&gt;
* They cut out a chromophore section but i could then perform a full QM/MM geometry optimisation&lt;br /&gt;
&lt;br /&gt;
===A more charged mini molecule to test electronic embedding===&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolAppletButton&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;More_charge_B3LYP_6_31G(d).mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;The molecule with 2+ charge&amp;lt;/text&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;600&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;/jmolAppletButton&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
The molecule was submitted with 3 different job lines:&lt;br /&gt;
&lt;br /&gt;
* b3lyp 6-31g(d)&lt;br /&gt;
* oniom(b3lyp 6-31g(d):amber=(hardfirst,qeq))&lt;br /&gt;
* oniom(b3lyp 6-31g(d):amber=(hardfirst,qeq))=embedcharge&lt;br /&gt;
&lt;br /&gt;
Opt and Freq jobs were completed successfully. The charge breakdown input was:&lt;br /&gt;
&lt;br /&gt;
* Low Real 2 1&lt;br /&gt;
* High Model 0 1&lt;br /&gt;
* Low Model 0 1&lt;br /&gt;
&lt;br /&gt;
The aim of the calculations was to test electronic embedding versus the mechanical embedding. A paper suggests that there is a large difference in accuracy between the two methods. Qeq charge equilibration was used in both cases as it has already been proved to be much more effective than AMBER assigned charges. &#039;&#039;&#039;It has consistently overestimated the dipole moment though&#039;&#039;&#039;.&lt;br /&gt;
&lt;br /&gt;
{|class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Method||Dipole moment&lt;br /&gt;
|-&lt;br /&gt;
|B3LYP 6-31G*||8.02&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (ME)||12.22&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (EE)||12.23&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
These results are odd since we expect the EE to have some effect on the charge distribution in the molecule (it is polarising the QM region) comparing the frequencies as a further check:&lt;br /&gt;
&lt;br /&gt;
{|class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Method||C=O||C=C&lt;br /&gt;
|-&lt;br /&gt;
|B3LYP 6-31G*||1764||1636&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (ME)||1797||1697&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (EE)||1797||1697&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Again the EE has made no difference to the frequency of various strong vibrations in the molecule. The conclusion is thus either:&lt;br /&gt;
&lt;br /&gt;
* EE has no real effect on the moleucule&lt;br /&gt;
* EE is not working properly&lt;br /&gt;
&lt;br /&gt;
Assuming the latter further investigation discovered that the meechanism of EE is determined using scaling parameters for MM charges during electronic embedding in the QM calculations. As a default the charges are scaled 555500 using the nomenclature ScaleCharge=ijklmn where n is closest to the QM region (massively anti intuitive). The input is read left to right and not all scaling factors need be defined. Further to this, the value given is then multiplied by 0.2 to arrive at the actual scaling coefficient. The result is thus for an atom:&lt;br /&gt;
&lt;br /&gt;
* 1 bond from the QM region (n) = 0.2n           Default is 0&lt;br /&gt;
* 2 bonds from the QM region (m) = 0.2m       Default is 0&lt;br /&gt;
* 3 bonds from the QM region (l) = 0.2l            Default is 5&lt;br /&gt;
* 4 bonds from the QM region (k) = 0.2k          Default is 5&lt;br /&gt;
* 5 bonds from the QM region (j) = 0.2j            Default is 5&lt;br /&gt;
* 6 bonds from the QM region (i) = 0.2i            Default is 5&lt;br /&gt;
&lt;br /&gt;
So all the atoms (and partial charges) in the MM region in the test molecule were being scaled to 0 when performing the EE. This explains the fact that no difference was uncovered in the previous calculation set.&lt;br /&gt;
&lt;br /&gt;
A new calculation was submitted with the command scalecharge=555 so that all charges in the MM part will be scaled to 1 (0.2xn,m,l)&lt;br /&gt;
&lt;br /&gt;
===Charges...again===&lt;br /&gt;
&lt;br /&gt;
So since i deleted the charges on the atoms where they can collapse, the overall charge is now -19.6 from the original -6 minus the 13.6 in charge deleted. REALLY ANNOYING! This is a huge difference in charges so will have to be solved. One easy option would be to set the involved atoms to 0 then run QEQ on the atoms with unassigned charges... essentially distributing the -6 charge through all atoms except those with 0 VdW parameters.&lt;br /&gt;
&lt;br /&gt;
==Tues 15th Dec 2009==&lt;br /&gt;
&lt;br /&gt;
The more charged molecule failed once the charges were scaled to 1 since SCF convergence was not attained.&lt;br /&gt;
&lt;br /&gt;
==Mon 25th Jan 2010==&lt;br /&gt;
&lt;br /&gt;
TAO tool publication was very good read... has a good suggestion of an optimisation procedure:&lt;br /&gt;
&lt;br /&gt;
*Take the crystal structure&lt;br /&gt;
*Add Hydrogens&lt;br /&gt;
*Determine protonation&lt;br /&gt;
*MM geometry optimisation&lt;br /&gt;
*(MD?)&lt;br /&gt;
*(Look at several samples of the MD)&lt;br /&gt;
*Optimise a final structure&lt;br /&gt;
&lt;br /&gt;
==Thurs 28th Jan 2010==&lt;br /&gt;
&lt;br /&gt;
Tao Tools requires preprocess within AMBER tools to run the scripts correctly. Does not have HIS on the residue database... Amber defines the protonation state within its residue naming scheme... HIS is either HIE, HIP, HID depending on its protonation state. [http://www.poissonboltzmann.org/pdb2pqr/documentation/programmers-guide/naming-scheme]&lt;br /&gt;
&lt;br /&gt;
==Thurs 4th Jan 2010==&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Preprocessing&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
To get TAO tools and also for completeness it is necessary to preprocess the PDB file before attempting to run ONIOM calculations, this includes:&lt;br /&gt;
&lt;br /&gt;
* Adding hydrogens&lt;br /&gt;
* Create a custom residue file for the chromophore&lt;br /&gt;
* Convert from PDB to gjf ONIOM input&lt;br /&gt;
&lt;br /&gt;
Ambertools are extremely helpful with this preprocessing especially since amber will be the low level theory used throughout this project. Within ambertools are a number of useful programs:&lt;br /&gt;
&lt;br /&gt;
* antechamber - has a parameterlookup function within amber and GAFF parameter library files.&lt;br /&gt;
* xLeap - builds the protein from its constituent residues - adds hydrogens according to library files - allows a parm file to be built to add custom residues.&lt;br /&gt;
&lt;br /&gt;
I will be working on using these 2 programs to create a full MM input file which is fully defined and correct.&lt;br /&gt;
&lt;br /&gt;
===xLeap===&lt;br /&gt;
&lt;br /&gt;
To load xLeap we must use X11 to get the graphical interface and the ssh -Y command to allow tunneling back to X11. Also the command line to start xLeap should be:  &#039;&#039;xleap -s -f /apps/ambertools/amber11/dat/leap/cmd/leaprc.ff03&#039;&#039;. This loads the ff03 parameter file and results in the start up of the graphical interface with:&lt;br /&gt;
&lt;br /&gt;
 Welcome to LEaP!&lt;br /&gt;
 Sourcing leaprc: /amber/dat/leap/cmd/leaprc&lt;br /&gt;
 Log file: ./leap.log&lt;br /&gt;
 Loading parameters: /amber/dat/leap/parm/parmME.dat&lt;br /&gt;
 Loading library: /amber/dat/leap/lib/all_nucleic94.lib&lt;br /&gt;
 Loading library: /amber/dat/leap/lib/all_aminoME.lib&lt;br /&gt;
 Loading library: /amber/dat/leap/lib/all_aminoctME.lib&lt;br /&gt;
 Loading library: /amber/dat/leap/lib/all_aminontME.lib&lt;br /&gt;
 Loading library: /amber/dat/leap/lib/ions94.lib&lt;br /&gt;
 Loading library: /amber/dat/leap/lib/water.lib&lt;br /&gt;
&lt;br /&gt;
This shows that the residue libraries have been loaded. We are then free to load a pdb file which will be read by xLeap and built according to the residues! In addition to the residue names xleap also requires information about the chain connectivity. Residues can be connected to other residues from both sides, only one side or not at all. xleap is fairly simple in that it expects that all residues in the pdb file are connected in the order in which they are listed unless they are separated by a &amp;quot;TER&amp;quot; card. In most cases where a single protein chains is being investigated like here - the TER card should end the residue information.&lt;br /&gt;
&lt;br /&gt;
To load a pdb use (ignore the &amp;lt;&amp;gt;) &#039;&#039;&amp;lt;name for use in xLeap&amp;gt;=loadpdb &amp;quot;&amp;lt;name of file&amp;gt;&amp;quot;&#039;&#039;. Make sure that the file is in your PATH else fully define the path in the filename.&lt;br /&gt;
&lt;br /&gt;
The pdb should then be read into xLeap and built. At the end of the output we get (I loaded ChainA of the 1W7S without any water or hydrogens):&lt;br /&gt;
&lt;br /&gt;
 Leap added 1765 missing atoms according to residue templates:&lt;br /&gt;
       1 Heavy&lt;br /&gt;
       1764 H / lone pairs&lt;br /&gt;
 The file contained 21 atoms not in residue templates&lt;br /&gt;
&lt;br /&gt;
This shows that Leap added hydrogens automatically and also a missing &amp;quot;heavy&amp;quot; atom (one which is not a H or lp):&lt;br /&gt;
&lt;br /&gt;
 Added missing heavy atom: .R&amp;lt;CHIE 229&amp;gt;.A&amp;lt;OXT 18&amp;gt;&lt;br /&gt;
&lt;br /&gt;
For amino acids, the traditional order of atom records within a residue is N, CA, C, O, followed by the sidechain atoms (CB, CG1, CG2 . . . ) in order first of increasing remoteness, and then branch. The extra oxygen at the carboxyl terminal is called &amp;quot; OXT&amp;quot;, and appears after all sidechain atoms.&lt;br /&gt;
&lt;br /&gt;
Now that we have established that there are 21 atoms that are in the non standard residue region we need to define a custom residue which describes this region (chromophore). This involves creating a prep file....&lt;br /&gt;
&lt;br /&gt;
I first drew up the undefined section of GFP (chromophore) in Gaussview5 and saved as a.pdb then:&lt;br /&gt;
&lt;br /&gt;
* Load in antechamber and convert to mol2 format using &#039;&#039;antechamber -i &amp;lt;input&amp;gt; -fi pdb -o &amp;lt;output&amp;gt; -fo mol2 -c bcc&#039;&#039;&lt;br /&gt;
* Use parmchk to make sure the molecule was defined in the GAFF force field - kicks out a .frcmod file with missing parameters (including dihedrals) - nice :D!&lt;br /&gt;
* Manually add the &amp;quot;1CSY&amp;quot; where a standalone &amp;quot;0&amp;quot; is in the mol2 file so Leap can read it&lt;br /&gt;
* Load the gaff parameter file to Leap using &#039;&#039;source leaprc.gaff&#039;&#039;&lt;br /&gt;
* Load the chromophore_CSY_full.mol2 to Leap using &#039;&#039;CSY=loadmol2 &amp;quot;ChromophoreGFP_CSY_full.mol2&amp;quot;&#039;&#039;&lt;br /&gt;
* Check the residue using &#039;&#039;check CSY&#039;&#039; - Notice missing parameters are there.&lt;br /&gt;
* Load the ChromophoreGFP_CSY_full.frcmod file created by parmchk using &#039;&#039;loadamberparams ChromophoreGFP_CSY_full.frcmod&#039;&#039;&lt;br /&gt;
* Now check the CSY residue again using &#039;&#039;check CSY&#039;&#039; - No Problems! woo hoo! Now I can create amber parameters :D&lt;br /&gt;
&lt;br /&gt;
==Tues 9th Feb 2010==&lt;br /&gt;
&lt;br /&gt;
===RESP Fitting===&lt;br /&gt;
&lt;br /&gt;
To get charges that are like those of the standard amber force field an RESP fitting method has to be used. This involves 3 steps:&lt;br /&gt;
&lt;br /&gt;
* First, the molecule studied is optimized to determine a stable minimum (using Gaussian).&lt;br /&gt;
* Then, this minimized structure is used to calculate a Molecular Electrostatic Potential (MEP) on a three-dimensional grid (using Gaussian).&lt;br /&gt;
* Finally, this grid is exported into the &amp;quot;RESP program&amp;quot; which is used to fit atom-centered charges to the MEP&lt;br /&gt;
&lt;br /&gt;
To assist in this procedure - I am using a program called RED - [http://q4md-forcefieldtools.org/RED/] which can be used in all 3 stages...&lt;br /&gt;
&lt;br /&gt;
* Ante_RED is used to prepare Gaussian input files for the RESP fit&lt;br /&gt;
* RED III is used to fit atom centered charges&lt;br /&gt;
&lt;br /&gt;
The procedure used to create the CSY file for RESP fitting is as follows:&lt;br /&gt;
&lt;br /&gt;
* Cut the CSY residue from the 1W7S protein structure and cap terminal ends with methyl groups&lt;br /&gt;
* Optimise this in Gaussian at PM6 level&lt;br /&gt;
* Run the optimised CSY geometry through ante_RED to create the Gaussian input file involving minimisation at HF level and calculation of the MEP - also get the .P2N file required for RED III&lt;br /&gt;
* Run this Gaussian input file&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Input file for optimisation:&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;#P hf/6-31G* Opt=(Tight,CalcFC) Freq SCF(Conver=8) Test&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This will run a HF level optimisation and frequency analysis.  &#039;&#039;Opt=tight&#039;&#039; tightens the cutoffs on forces and step size that are used to determine convergence. An optimization with Opt=Tight will take several more steps than with the default cutoffs. For molecular systems with very small force constants (low frequency vibrational modes), this may be necessary to ensure adequate convergence and reliability of frequencies computed in a subsequent job step. This option can only be used with Berny optimizations. &#039;&#039;Opt = CalcFC&#039;&#039; causes the calculation of an analytical Hessian (in HF). &#039;&#039;SCF(Conver=8)&#039;&#039; sets the SCF convergence criterion to 10^-8.&lt;br /&gt;
&lt;br /&gt;
===Parameter files for use in Gaussian (Gaussian only uses amber94)===&lt;br /&gt;
&lt;br /&gt;
Using external parameter files. It is possible to include MM function entries within an external file by replacing the entries within the input file with the usual Gaussian include file mechanism: e.g., @oplsaa.prm. Parameter files for various force fields are included within the main Gaussian directory (i.e., $g09root/g09). If you want to specify the entire force field via an external file, use an input file like the following:&lt;br /&gt;
&lt;br /&gt;
# UFF=SoftOnly&lt;br /&gt;
&lt;br /&gt;
Read-in force field example&lt;br /&gt;
&lt;br /&gt;
molecule specification&lt;br /&gt;
&lt;br /&gt;
@$g09root/g09/oplsaa.prm&lt;br /&gt;
&lt;br /&gt;
Note that the choice of a molecular mechanics keyword is arbitrary as the entire force field depends on the read-in functions and parameters and not on any internal ones (in other words, specifying Amber would yield the same result as UFF).&lt;br /&gt;
&lt;br /&gt;
==Tues 16th Feb 2010==&lt;br /&gt;
===RESP Fitting===&lt;br /&gt;
&lt;br /&gt;
Following optimisation of the fragment for RESP fitting as seen before. We now have to fit the RESP to a MEP, this involves a few calculations:&lt;br /&gt;
&lt;br /&gt;
* ESP calcs in gaussian&lt;br /&gt;
* Fitting calcs by resp in ambertools&lt;br /&gt;
&lt;br /&gt;
The result is a .mol2 containing the fitted charges. The process has 2  input files:&lt;br /&gt;
&lt;br /&gt;
* .log&lt;br /&gt;
* .p2n&lt;br /&gt;
&lt;br /&gt;
The p2n is created by ante_RED and controls the RESP fitting process. (See RED site). To get a fit that excludes the capping methyls we add a REMARK line:&lt;br /&gt;
&lt;br /&gt;
 REMARK INTRA-MCC 0.0 |  35  36  37  38  39  40  41  42 | R&lt;br /&gt;
&lt;br /&gt;
This must be formatted EXACTLY as shown here (2 spaces between the numbers). The numbers correspond to atoms numbers, the 0.0 constrains these atoms to 0.0 charge and the R tells RESP to remove these atoms from the 2nd round fit and the subsequent mol2 output. Leaving a fragment.&lt;br /&gt;
&lt;br /&gt;
The RED program can be run on CX1! I used the following Jobscript (REDjobscript_4):&lt;br /&gt;
&lt;br /&gt;
 #PBS -l ncpus=1 &lt;br /&gt;
 #PBS -l mem=6000mb&lt;br /&gt;
 #PBS -l walltime=01:00:00&lt;br /&gt;
 #PBS -q pqchem&lt;br /&gt;
 #PBS -joe&lt;br /&gt;
 module load ambertools&lt;br /&gt;
 module load gaussian&lt;br /&gt;
 export GAUSS_SCRDIR=$TMPDIR&lt;br /&gt;
 echo $GAUSS_SCRDIR&lt;br /&gt;
 cd $HOME&lt;br /&gt;
 pwd&lt;br /&gt;
 perl /home/af706/RED-vIII.3.pl &amp;gt; /home/af706/RED-vIII.log&lt;br /&gt;
&lt;br /&gt;
So here we are using the chemistry node pqchem:&lt;br /&gt;
&lt;br /&gt;
* loading the ambertools (for RESP) and gaussian modules&lt;br /&gt;
* defining a scratch directory for the RESP program to read gaussian scratch&lt;br /&gt;
* TEMPDIR is used since the scratch must be empty upon calculation start. The temporary directory will be created fresh on each node as the calculation is started, and it&#039;s unlikely that two calculations will run on the same node.&lt;br /&gt;
* Moving to my HOME directory /home/af706 because the script must be run in the same directory as the input files  (.log and .p2n)&lt;br /&gt;
* RUN!&lt;br /&gt;
&lt;br /&gt;
The fitting process runs single point energy calcs and fits the RESP - overall it took 3mins for my fragment.&lt;br /&gt;
&lt;br /&gt;
===Going from xLeap out to Gaussian in===&lt;br /&gt;
&lt;br /&gt;
2 ways to do this:&lt;br /&gt;
&lt;br /&gt;
* Load in Gaussview&lt;br /&gt;
* Use Pengs scripts&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Gaussview&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
* The file loads and all residues are recognised (apart from the chromophore). The file contains the same number of atoms as the saved Leap file.&lt;br /&gt;
* Charges are assigned from amber94 - not the chromophore still&lt;br /&gt;
* Amber atom types are assigned correctly to ALL atoms&lt;br /&gt;
* All that remains is to add charges from the RESP fit.&lt;br /&gt;
* PDB information is retained in input&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;TAO&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
* Atoms are added so atom count is different to the file leaving Leap&lt;br /&gt;
* Fails to treat terminal residues correctly&lt;br /&gt;
* Lose residue information&lt;br /&gt;
* Charges are missing from chromophore and atoms in terminal residues&lt;br /&gt;
&lt;br /&gt;
==22nd Feb 2010==&lt;br /&gt;
&lt;br /&gt;
Links to HOW TO for Ambertools + Gaussview/ Ambertools + TAO tools + Gaussview&lt;br /&gt;
&lt;br /&gt;
* Ambertools + Gaussview  - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester_Ambertools_Gaussview&lt;br /&gt;
* Ambertools + TAO tools + Gaussview - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester_Ambertools_TAO&lt;br /&gt;
&lt;br /&gt;
* Running Amber only Calculations - https://www.ch.imperial.ac.uk/wiki/index.php/AndyForesterAmber&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterAmber&amp;diff=98544</id>
		<title>AndyForesterAmber</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForesterAmber&amp;diff=98544"/>
		<updated>2010-02-25T13:06:08Z</updated>

		<summary type="html">&lt;p&gt;Af706: New page: =Amber Calculations on 1W7S=  Back to main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Amber Calculations on 1W7S=&lt;br /&gt;
&lt;br /&gt;
Back to main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForester&amp;diff=97696</id>
		<title>AndyForester</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForester&amp;diff=97696"/>
		<updated>2010-02-22T16:17:16Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* 22nd Feb 2010 */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=HBDI Calculations=&lt;br /&gt;
==Wed 21st Oct 2009==&lt;br /&gt;
&lt;br /&gt;
- Started project work&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
- Gaussview and Gaussian working&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
- Working out how to use the command inputs&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
- Trying to sort VPN out&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
- Working to get the project proposal written&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Tue 27th Oct 2009==&lt;br /&gt;
&lt;br /&gt;
Drew up the HBDI molecule in Gaussview, the molecule will be optimised at the HF 3-21G level on the mac (locally). &amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
DFT B3LYP 6-31G (Becke threeparameter hybrid exchange functional5 and the Lee-Yang-Parr correlation functional) method optimisation was run on the resulting file, the MP2 6-31G method optimisation. Each of these had a corresponding freq calculation to check that minima were reached. All calculations were successful, with no negative vibrations.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolAppletButton&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;HBDI_MP2_6-31G_FREQ.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;HBDI optimised at the MP2 6-31G level&amp;lt;/text&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;600&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;/jmolAppletButton&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Wed 28th Oct 2009==&lt;br /&gt;
&lt;br /&gt;
Ran DFT B3YLP 6-311G calculations on HBDI (Opt and Freq) since these were the calculations used in a published study of HBDI.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
Optimisation job finshed successfully, the frequency calculation did not finish succesfully. Error was:&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
 ****Warning!!: The largest alpha MO coefficient is  0.17588883D+02&lt;br /&gt;
 Symmetrizing basis deriv contribution to polar:&lt;br /&gt;
 IMax=3 JMax=2 DiffMx= 0.00D+00&lt;br /&gt;
 G2DrvN: will do    29 centers at a time, making    1 passes doing MaxLOS=1.&lt;br /&gt;
 Calling FoFCou, ICntrl=  3107 FMM=F I1Cent=   0 AccDes= 0.00D+00.&lt;br /&gt;
 FoFDir/FoFCou used for L=0 through L=1.&lt;br /&gt;
 End of G2Drv Frequency-dependent properties file   721 does not exist.&lt;br /&gt;
 End of G2Drv Frequency-dependent properties file   722 does not exist.&lt;br /&gt;
          IDoAtm=1111111111111111111111111111&lt;br /&gt;
          Differentiating once with respect to electric field.&lt;br /&gt;
                with respect to dipole field.&lt;br /&gt;
          Differentiating once with respect to nuclear coordinates.&lt;br /&gt;
          Keep R1 ints in memory in canonical form, NReq=447529180.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Thurs 29th Oct 2009==&lt;br /&gt;
&lt;br /&gt;
===Learning to use ONIOM===&lt;br /&gt;
&lt;br /&gt;
The example on chem wiki was attempted. Drawn up in Gaussview and optimised initially using molecular mechanics AMBER force field.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
The ONIOM calculation was set up in Gaussview using DFT B3LYP 6-311G(d):AMBER methods with the benzene ring as the high level and chain at low level. Electronic embedding was NOT used in this initial calculation. &amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolAppletButton&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;ONIOM_B3LYP_6-311G_AMBER.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Test Molecule optimised using B3LYP 6-311G(d):AMBER&amp;lt;/text&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;600&amp;lt;/size&amp;gt;&lt;br /&gt;
  &amp;lt;/jmolAppletButton&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
&lt;br /&gt;
This conformation is NOT the lowest energy conformation, the initial AMBER MM calculation did not find the global minimum probably due to the drawn up starting geometry.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
The molecule was drawn up again and DFT B3LYP 6-311G(d):AMBER and pure B3LYP 6-311G(d) calculations run. Absolute energy differences between the 2 conformations was very small at the ONIOM(DFT B3LYP 6-311G(d):AMBER) level. -232.27532671 vs -232.27315709 au respectively, a 0.00271au (4.359 744 17×10-18 J = 1au) difference.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolAppletButton&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;ONIOM_B3LYP_6-311G_AMBER2.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Test Molecule optimised using B3LYP 6-311G(d):AMBER&amp;lt;/text&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;600&amp;lt;/size&amp;gt;&lt;br /&gt;
  &amp;lt;/jmolAppletButton&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
&lt;br /&gt;
It has already become obvious that using ONIOM decreases calculation time considerably. The table below shows a &amp;quot;CPU time&amp;quot; comparison on the above molecule using the different approaches.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Calculation&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Time (1st attempt)&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Time (2nd attempt)&#039;&#039;&#039;&lt;br /&gt;
|-&lt;br /&gt;
|   ONIOM(DFT B3LYP 6-311G(d):AMBER)||4 minutes 22.0 seconds||8 minutes  5.5 seconds&lt;br /&gt;
|-&lt;br /&gt;
|   DFT 3LYP 6-311G(d)||2 hours 12 minutes 21.9 seconds||2 hours 30 minutes  0.8 seconds&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Mon 2nd Nov 2009==&lt;br /&gt;
&lt;br /&gt;
Having obtained optimized structures of HBDI at various theory levels and basis set variations, these structures will now be compared to previous literature. Extensive studies have been performed using HF and B3LYP methods previously. It has been noted in Jaspers recent review that HF does not model frequencies very accurately - my computations agree HF 6-31G frequencies are very far out. B3LYP frequencies are much closer to the experimental values.&lt;br /&gt;
&lt;br /&gt;
==Tues 3rd Nov 2009==&lt;br /&gt;
&lt;br /&gt;
Tabulated frerquency comparison&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;A.Esposito HF 6-31G(d)&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;A.Esposito B3LYP 6-31G(d)&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;X.He B3LYP 6-31G(d)&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;HF 6-31G(d)&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;B3LYP 6-31G(d)&#039;&#039;&#039;&lt;br /&gt;
|-&lt;br /&gt;
| C=O||1987||1802||1802||1986||1801&lt;br /&gt;
|-&lt;br /&gt;
| C=C||1880||1705||1704||1880||1704&lt;br /&gt;
|-&lt;br /&gt;
| Phenol||1809||1665||1663||1812||1666&lt;br /&gt;
|-&lt;br /&gt;
| Phenol||1781||1635||1631||1777||1635&lt;br /&gt;
|-&lt;br /&gt;
| C=N||1844||1620||1618||1844||1618&lt;br /&gt;
|-&lt;br /&gt;
| Phenol||1696||1562||1558||1694||1562&lt;br /&gt;
|-&lt;br /&gt;
| C-N||1607||1483||N/A||1606||1482&lt;br /&gt;
|-&lt;br /&gt;
| Phenol||1602||1487||N/A||1603||1488&lt;br /&gt;
|-&lt;br /&gt;
| Total Energy(au)||||||||-720.043||-724.465&lt;br /&gt;
|-&lt;br /&gt;
| &lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Thurs 5th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
Having got Jaspers input files, several things were noticed:&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
* Solvent models were used - should be in vacum.... I have re run these in G09 with No solvent on both DFT 6-311G(d,p) and HF 6-31+G(d) methods/basis sets for comparison&lt;br /&gt;
* The models in his Biophys paper using other residues in contact COO- for example are not present. He has not reported the pure HBDI calculation results.&lt;br /&gt;
* More than just HBDI is likely to be required, meaning larger than expected high level region in the GFP calculation. (3 layer ONIOM DFT:HF:AMBER)?&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
Other things to do:&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
* Work on a guide to loading .pdb files into Gaussview 5 - how much information is maintained?&lt;br /&gt;
* Are residues defined still in Gaussview input files&lt;br /&gt;
* AMBER MM requires definitions of charges etc... how easy is it to import from .pdb and get a file that can be run in opt etc...&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
* Find a model system which is larger than HBDI but incorporates this molecule &amp;lt;=500 atoms (Limit of size for pure DFT calculation)&lt;br /&gt;
* Possibilities include a system alread developed by someone else online&lt;br /&gt;
* A &amp;quot;custom&amp;quot; system made by me to model this (time intensive)&lt;br /&gt;
* &amp;quot;strip&amp;quot; back the GFP pdb file to 500 atoms surrounding the HBDI chromophore.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Fri 6th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
Yesterday started by trying to follow the tutorial on loading the GFP molecule from .pdb, this resulted in a problem with Gaussviews addition of hydrogens. It over protonated the nitrogens and carbon in the chromophore leading to over valent parts. I then tried several other molecules (PYP, a scorpion toxin and a strand of RNA) which had no problems with hydrogen addition.&lt;br /&gt;
&lt;br /&gt;
I tried to use ChemBio 3D to load the .pdb and correctly assign hydrogens/connectivity. This resulted in the same protonation problems!&lt;br /&gt;
&lt;br /&gt;
==Mon 9th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
TINKER - http://dasher.wustl.edu/tinker/ was installed with the Force Field Explorer (FFE) interface on the mac. This was used to load the same .pdb file and resulted in the same protonation around the chromophore. So far the SAME protonation pattern (incorrect) around the chromophore has been found using:&lt;br /&gt;
&lt;br /&gt;
*Gaussview&lt;br /&gt;
*ChemBio 3D&lt;br /&gt;
*TINKER&lt;br /&gt;
&lt;br /&gt;
Looking into how the .pdb file format works, the molecule is built from a series of residues, it becomes clear as you look at the sequence that is building the GFP protein chromophore (residues 65,66,67 in 1GFL) are not then reproducing the internal cyclisation properly leaving protons attached and thus atoms hypervalent. This accounts for the consistent error loading from the .pdb. As a result must all molecules expressed as a .pdb file must be pure protein structures...?&lt;br /&gt;
&lt;br /&gt;
===Comparison of the 2 available wild type GFP structures===&lt;br /&gt;
&lt;br /&gt;
The 2 structures being considered here are:&lt;br /&gt;
&lt;br /&gt;
*1GFL - Yang et al. -http://www.pdb.org/pdb/explore/explore.do?structureId=1GFL&lt;br /&gt;
*1W7S - Jaspers stucture - http://www.pdb.org/pdb/explore/explore.do?structureId=1W7S&lt;br /&gt;
&lt;br /&gt;
These 2 structures differ in that 1GFL contains 2 GFP molecules, whereas 1W7S contains 4. These were both stripped down to 1 unit of GFP, using the &amp;quot;pdb secondary structure&amp;quot; function in Gaussview 5. 1GFL had chain B removed and 1W7S had chains B,C and D removed. Next the residues were investigated that created the chromophore. The results are not consistent because GFL uses 3 residues (65,66,67) to form the chromophore. 1W7S has residue 66 (labelled CSY) as the entire chromophore! Both get protonated the same (incorrectly) which is odd considering how they molecules are built.&lt;br /&gt;
&lt;br /&gt;
The next test was to &amp;quot;modify bond&amp;quot; in Gaussview 5 and create a &amp;quot;cleaned&amp;quot; chromophore structure (no bonding information from the residue) by manually changing the bonds. The chromophore in each .pdb file was altered to look like HBDI in terms of double single bonds and the hydrogens taken out (makes no difference as these are created each time the .pdb is loaded). When the new .pdb files (1GFL_cleaned.pdb and 1W7S_cleaned.pdb) were loaded they were correctly protonated! Success.&lt;br /&gt;
&lt;br /&gt;
===1W7S improvement===&lt;br /&gt;
&lt;br /&gt;
Chain A (not cleaned) of the 1W7S .pdb was submitted to run a AMBER calculation, as expected it failed with the lines:&lt;br /&gt;
&lt;br /&gt;
 Missing atomic parameters for atom   474 IAtTyp=    20000000&lt;br /&gt;
 Missing atomic parameters.&lt;br /&gt;
&lt;br /&gt;
This error says that the atom type of atom 474 is not defined. One way to counter this is to locate the atom and then change its definition manually (as shown in the example https://www.ch.ic.ac.uk/wiki/index.php/PYP_:_Photoactive_yellow_protein - time intensive!)&lt;br /&gt;
&lt;br /&gt;
Instead the molecule was cleaned (manually changing the bonding in the chromophore), after the bond updates the structure was re submitted, again failing but with a different message:&lt;br /&gt;
&lt;br /&gt;
 Include all MM classes&lt;br /&gt;
  Bondstretch undefined between atoms    473    478 CC-N* [H,H]  &lt;br /&gt;
  Bondstretch undefined between atoms    475    476 CC-CC [H,H]  &lt;br /&gt;
  Bondstretch undefined between atoms    475    486 CC-CM [H,H]  &lt;br /&gt;
  Bondstretch undefined between atoms    476    477 CC-O [H,H]  &lt;br /&gt;
  Bondstretch undefined between atoms    476    478 CC-N* [H,H]  &lt;br /&gt;
  Bondstretch undefined between atoms    486   2559 CM-HC [H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    473    474    475 CC-NB-CC [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    473    478    479 CC-N*-CT [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    473    478    476 CC-N*-CC [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    473    482    483 CC-CT-N [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    473    482   3857 CC-CT-H1 [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    474    475    476 NB-CC-CC [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    474    475    486 NB-CC-CM [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    474    473    478 NB-CC-N* [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    475    476    477 CC-CC-O [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    475    476    478 CC-CC-N* [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    475    486    487 CC-CM-CA [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    475    486   2559 CC-CM-HC [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    476    478    479 CC-N*-CT [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    476    475    486 CC-CC-CM [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    477    476    478 O-CC-N* [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    478    479    480 N*-CT-C [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    478    473    482 N*-CC-CT [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    486    487    488 CM-CA-CA [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    486    487    489 CM-CA-CM [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    487    486   2559 CA-CM-HC [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    488    487    489 CA-CA-CM [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    490    492    491 CA-C-CM [H,H,H]  &lt;br /&gt;
  Angle bend  undefined between atoms    491    492    493 CM-C-OH [H,H,H]  &lt;br /&gt;
  MM function not complete&lt;br /&gt;
&lt;br /&gt;
This is a better error message since all the atoms are now defined, only parameters are missing. For parameters that involve only atoms in the chromophore at the high level in the ONIOM calculation, the values can be set to 0 in the input file. This is because the ONIOM extrapolation will cancel MM from this high level region, To do this the keyword &amp;quot;Amber=Hardfirst&amp;quot; and wildcards for the missing parameters which are zero valued are placed in the input file.&lt;br /&gt;
&lt;br /&gt;
[[Image:Atoms involving undefined parameters.png]]&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
The above screeshot shows highlighted the atoms listed in the error... most of them if not all will be involved in the high level calculation and thus may not require parameters defining (other than to be set to 0). The reason the parameters are undefined is that the AMBER force field is very specific to protein bonds and thus the organic chromophore has bonds that are not found in this parameter set.&lt;br /&gt;
&lt;br /&gt;
The atoms in the chromophore were frozen in the atom group editor and the input file had a parameter set added to the end setting chromophore parameters to 0:&lt;br /&gt;
&lt;br /&gt;
 HrmStr1 CC N* 0.0 0.0&lt;br /&gt;
 HrmStr1 CC CC 0.0 0.0&lt;br /&gt;
 HrmStr1 CC CM 0.0 0.0&lt;br /&gt;
 HrmStr1 CC O 0.0 0.0&lt;br /&gt;
 HrmStr1 CM HC 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC NB CC 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC N* CC 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CT N 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CT H1 0.0 0.0&lt;br /&gt;
 HrmBnd1 NB CC CC 0.0 0.0&lt;br /&gt;
 HrmBnd1 NB CC CM 0.0 0.0&lt;br /&gt;
 HrmBnd1 NB CC N* 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CC O  0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CC N* 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CM CA 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CM HC 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC CC CM 0.0 0.0&lt;br /&gt;
 HrmBnd1 O CC N*  0.0 0.0&lt;br /&gt;
 HrmBnd1 N* CT C  0.0 0.0&lt;br /&gt;
 HrmBnd1 N* CC CT 0.0 0.0&lt;br /&gt;
 HrmBnd1 CM CA CA 0.0 0.0&lt;br /&gt;
 HrmBnd1 CA CM HC 0.0 0.0&lt;br /&gt;
 HrmBnd1 CA C CM 0.0 0.0&lt;br /&gt;
 HrmBnd1 CM C OH 0.0 0.0&lt;br /&gt;
 HrmBnd1 CC N* CT 0.0 0.0&lt;br /&gt;
 HrmBnd1 CM CA CM 0.0 0.0&lt;br /&gt;
&lt;br /&gt;
This file then ran successfully for an AMBER single point energy calculation. It was then submitted for AMBER optimisation using &amp;quot;amber=hardfirst geom=connectivity&amp;quot; and returned a successful minimum with Forces and Displacements converged:&lt;br /&gt;
&lt;br /&gt;
         Item               Value     Threshold  Converged?&lt;br /&gt;
  Maximum Force            0.000089     0.000450     YES&lt;br /&gt;
  RMS     Force            0.000004     0.000300     YES&lt;br /&gt;
  Maximum Displacement     0.000089     0.001800     YES&lt;br /&gt;
  RMS     Displacement     0.000004     0.001200     YES&lt;br /&gt;
  Predicted change in Energy=-1.156135D-07&lt;br /&gt;
  Optimization completed.&lt;br /&gt;
     -- Stationary point found.&lt;br /&gt;
&lt;br /&gt;
===Cleaned .pdb files and TINKER===&lt;br /&gt;
&lt;br /&gt;
*The files that had been cleaned were opened in TINKER, it is necessary to convert the .pdb to a .xyz file to run any calculations within TINKER. This is currently failing due to an &amp;quot;atom missing on SER-228&amp;quot; - checking this there is no SER residue at 228 (in fact a GLU) so a bit puzzled. This is the 1W7S cleaned version.&lt;br /&gt;
&lt;br /&gt;
*PYP was loaded in its .pdb format to TINKER and was immediately successful in being converted to .xyz then a MINIMISE algorithm ran successfully with the energy at its X-ray crystal structure as 261KJ/mol reduced to -3670.5355KJ/mol in 1645 iterations - this took a while (45mins locally)!&lt;br /&gt;
&lt;br /&gt;
*Interestingly the .xyz output from the minimization can be converted back to .pdb (loses hydrogens minimization) and this PYP file was loaded into Gaussview 5 - the file then ran successfully for an AMBER single point energy calculation! cf. the first attempt to get PYP to run was unsuccessful, and the tutorial underwent much more hassle to run this calculation.&lt;br /&gt;
&lt;br /&gt;
===What .pdb information is retained?===&lt;br /&gt;
&lt;br /&gt;
The only molecule that has been run with AMBER opt (GFP) has lost its secondary structure information but RETAINED its residue information.&lt;br /&gt;
&lt;br /&gt;
==Fri 13th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
===Conversion from.xyz to .com===&lt;br /&gt;
A protein with 2 residues (Glu-Asp) was built using TINKER, saved as:&lt;br /&gt;
&lt;br /&gt;
*.xyz&lt;br /&gt;
*.pdb&lt;br /&gt;
*.com&lt;br /&gt;
&lt;br /&gt;
The initial format was .xyz (created in TINEKR as a native format), converted to .pdb using TINKER and then loaded into Gaussview and saved as a .com.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
.xyz&lt;br /&gt;
&lt;br /&gt;
    36  Protein built with TINKER/Force Field Explorer&lt;br /&gt;
     1  CT     0.000000    0.000000    0.000000   340     2     4     5     6&lt;br /&gt;
     2  C      0.000000    0.000000    1.510000   342     1     3     7&lt;br /&gt;
     3  O      1.028938    0.000000    2.165506   343     2&lt;br /&gt;
     4  HC     1.056270    0.000000   -0.341166   341     1&lt;br /&gt;
     5  HC    -0.497647    0.931694   -0.341166   341     1&lt;br /&gt;
     6  HC    -0.497647   -0.931694   -0.341166   341     1&lt;br /&gt;
     7  N     -1.236201    0.000000    2.027114   232     2     8    11&lt;br /&gt;
     8  CT    -1.446961    0.000000    3.471822   233     7     9    12    13&lt;br /&gt;
     9  C     -2.509557   -0.992751    3.878558   234     8    10    22&lt;br /&gt;
    10  O     -3.569672   -1.098634    3.284129   236     9&lt;br /&gt;
    11  H     -2.025518    0.000000    1.381066   235     7&lt;br /&gt;
    12  H1    -0.503221   -0.281395    3.983946   237     8&lt;br /&gt;
    13  CT    -1.910491    1.397334    3.923704   238     8    14    18    19&lt;br /&gt;
    14  CT    -2.132799    1.397334    5.447574   240    13    15    20    21&lt;br /&gt;
    15  C     -2.585014    2.781238    5.848089   242    14    16    17&lt;br /&gt;
    16  O2    -2.816736    2.973609    7.061266   243    15&lt;br /&gt;
    17  O2    -2.693194    3.629064    4.935955   243    15&lt;br /&gt;
    18  HC    -2.861794    1.653482    3.412322   239    13&lt;br /&gt;
    19  HC    -1.132567    2.145528    3.664589   239    13&lt;br /&gt;
    20  HC    -1.170306    1.191123    5.960588   241    14&lt;br /&gt;
    21  HC    -2.946052    0.685840    5.701535   241    14&lt;br /&gt;
    22  N     -2.153997   -1.723291    4.944151   210     9    23    26&lt;br /&gt;
    23  CT    -3.047986   -2.739703    5.491225   211    22    24    27    28&lt;br /&gt;
    24  C     -3.120370   -2.663723    6.997574   212    23    25    34&lt;br /&gt;
    25  O     -2.128241   -2.535414    7.695870   214    24&lt;br /&gt;
    26  H     -1.239994   -1.547921    5.361576   213    22&lt;br /&gt;
    27  H1    -4.072297   -2.585746    5.092246   215    23&lt;br /&gt;
    28  CT    -2.526745   -4.139104    5.114946   216    23    29    32    33&lt;br /&gt;
    29  C     -3.476741   -5.157722    5.698062   218    28    30    31&lt;br /&gt;
    30  O2    -3.212872   -6.362275    5.493304   219    29&lt;br /&gt;
    31  O2    -4.454666   -4.718803    6.341112   219    29&lt;br /&gt;
    32  HC    -1.530243   -4.298013    5.577366   217    28&lt;br /&gt;
    33  HC    -2.540086   -4.252948    4.010880   217    28&lt;br /&gt;
    34  N     -4.374888   -2.752228    7.460123   344    24    35    36&lt;br /&gt;
    35  H     -5.140865   -2.858197    6.794954   345    34&lt;br /&gt;
    36  H     -4.534830   -2.711583    8.466684   345    34&lt;br /&gt;
&lt;br /&gt;
.pdb&lt;br /&gt;
&lt;br /&gt;
 HEADER    Protein built with TINKER/Force Field Explorer&lt;br /&gt;
 COMPND&lt;br /&gt;
 SOURCE&lt;br /&gt;
 ATOM      1  CH3 ACE     1       0.000   0.000   0.000&lt;br /&gt;
 ATOM      2  C   ACE     1       0.000   0.000   1.510&lt;br /&gt;
 ATOM      3  O   ACE     1       1.029   0.000   2.166&lt;br /&gt;
 ATOM      4  H1  ACE     1       1.056   0.000  -0.341&lt;br /&gt;
 ATOM      5  H2  ACE     1      -0.498   0.932  -0.341&lt;br /&gt;
 ATOM      6  H3  ACE     1      -0.498  -0.932  -0.341&lt;br /&gt;
 ATOM      7  N   GLU     2      -1.236   0.000   2.027&lt;br /&gt;
 ATOM      8  CA  GLU     2      -1.447   0.000   3.472&lt;br /&gt;
 ATOM      9  C   GLU     2      -2.510  -0.993   3.879&lt;br /&gt;
 ATOM     10  O   GLU     2      -3.570  -1.099   3.284&lt;br /&gt;
 ATOM     11  CB  GLU     2      -1.910   1.397   3.924&lt;br /&gt;
 ATOM     12  CG  GLU     2      -2.133   1.397   5.448&lt;br /&gt;
 ATOM     13  CD  GLU     2      -2.585   2.781   5.848&lt;br /&gt;
 ATOM     14  OE1 GLU     2      -2.817   2.974   7.061&lt;br /&gt;
 ATOM     15  OE2 GLU     2      -2.693   3.629   4.936&lt;br /&gt;
 ATOM     16  H   GLU     2      -2.026   0.000   1.381&lt;br /&gt;
 ATOM     17  HA  GLU     2      -0.503  -0.281   3.984&lt;br /&gt;
 ATOM     18  HB2 GLU     2      -2.862   1.653   3.412&lt;br /&gt;
 ATOM     19  HB3 GLU     2      -1.133   2.146   3.665&lt;br /&gt;
 ATOM     20  HG2 GLU     2      -1.170   1.191   5.961&lt;br /&gt;
 ATOM     21  HG3 GLU     2      -2.946   0.686   5.702&lt;br /&gt;
 ATOM     22  N   ASP     3      -2.154  -1.723   4.944&lt;br /&gt;
 ATOM     23  CA  ASP     3      -3.048  -2.740   5.491&lt;br /&gt;
 ATOM     24  C   ASP     3      -3.120  -2.664   6.998&lt;br /&gt;
 ATOM     25  O   ASP     3      -2.128  -2.535   7.696&lt;br /&gt;
 ATOM     26  CB  ASP     3      -2.527  -4.139   5.115&lt;br /&gt;
 ATOM     27  CG  ASP     3      -3.477  -5.158   5.698&lt;br /&gt;
 ATOM     28  OD1 ASP     3      -3.213  -6.362   5.493&lt;br /&gt;
 ATOM     29  OD2 ASP     3      -4.455  -4.719   6.341&lt;br /&gt;
 ATOM     30  H   ASP     3      -1.240  -1.548   5.362&lt;br /&gt;
 ATOM     31  HA  ASP     3      -4.072  -2.586   5.092&lt;br /&gt;
 ATOM     32  HB2 ASP     3      -1.530  -4.298   5.577&lt;br /&gt;
 ATOM     33  HB3 ASP     3      -2.540  -4.253   4.011&lt;br /&gt;
 ATOM     34  N   NH2     4      -4.375  -2.752   7.460&lt;br /&gt;
 ATOM     35  H1  NH2     4      -5.141  -2.858   6.795&lt;br /&gt;
 ATOM     36  H2  NH2     4      -4.535  -2.712   8.467&lt;br /&gt;
 END&lt;br /&gt;
&lt;br /&gt;
.com&lt;br /&gt;
&lt;br /&gt;
 %chk=File_Test_Glu-Asp.chk&lt;br /&gt;
 # hf/3-21g geom=connectivity&lt;br /&gt;
 &lt;br /&gt;
 Protein built with TINKER/Force Field Explorer&lt;br /&gt;
 &lt;br /&gt;
 0 1&lt;br /&gt;
 C(PDBName=CH3,ResName=ACE,ResNum=1)                   0.00000000    0.00000000    0.00000000&lt;br /&gt;
 C(PDBName=C,ResName=ACE,ResNum=1)                     0.00000000    0.00000000    1.51000000&lt;br /&gt;
 O(PDBName=O,ResName=ACE,ResNum=1)                     1.02900000    0.00000000    2.16600000&lt;br /&gt;
 H(PDBName=H1,ResName=ACE,ResNum=1)                    1.05600000    0.00000000   -0.34100000&lt;br /&gt;
 H(PDBName=H2,ResName=ACE,ResNum=1)                   -0.49800000    0.93200000   -0.34100000&lt;br /&gt;
 H(PDBName=H3,ResName=ACE,ResNum=1)                   -0.49800000   -0.93200000   -0.34100000&lt;br /&gt;
 N(PDBName=N,ResName=GLU,ResNum=2)                    -1.23600000    0.00000000    2.02700000&lt;br /&gt;
 C(PDBName=CA,ResName=GLU,ResNum=2)                   -1.44700000    0.00000000    3.47200000&lt;br /&gt;
 C(PDBName=C,ResName=GLU,ResNum=2)                    -2.51000000   -0.99300000    3.87900000&lt;br /&gt;
 O(PDBName=O,ResName=GLU,ResNum=2)                    -3.57000000   -1.09900000    3.28400000&lt;br /&gt;
 C(PDBName=CB,ResName=GLU,ResNum=2)                   -1.91000000    1.39700000    3.92400000&lt;br /&gt;
 C(PDBName=CG,ResName=GLU,ResNum=2)                   -2.13300000    1.39700000    5.44800000&lt;br /&gt;
 C(PDBName=CD,ResName=GLU,ResNum=2)                   -2.58500000    2.78100000    5.84800000&lt;br /&gt;
 O(PDBName=OE1,ResName=GLU,ResNum=2)                  -2.81700000    2.97400000    7.06100000&lt;br /&gt;
 O(PDBName=OE2,ResName=GLU,ResNum=2)                  -2.69300000    3.62900000    4.93600000&lt;br /&gt;
 H(PDBName=H,ResName=GLU,ResNum=2)                    -2.02600000    0.00000000    1.38100000&lt;br /&gt;
 H(PDBName=HA,ResName=GLU,ResNum=2)                   -0.50300000   -0.28100000    3.98400000&lt;br /&gt;
 H(PDBName=HB2,ResName=GLU,ResNum=2)                  -2.86200000    1.65300000    3.41200000&lt;br /&gt;
 H(PDBName=HB3,ResName=GLU,ResNum=2)                  -1.13300000    2.14600000    3.66500000&lt;br /&gt;
 H(PDBName=HG2,ResName=GLU,ResNum=2)                  -1.17000000    1.19100000    5.96100000&lt;br /&gt;
 H(PDBName=HG3,ResName=GLU,ResNum=2)                  -2.94600000    0.68600000    5.70200000&lt;br /&gt;
 N(PDBName=N,ResName=ASP,ResNum=3)                    -2.15400000   -1.72300000    4.94400000&lt;br /&gt;
 C(PDBName=CA,ResName=ASP,ResNum=3)                   -3.04800000   -2.74000000    5.49100000&lt;br /&gt;
 C(PDBName=C,ResName=ASP,ResNum=3)                    -3.12000000   -2.66400000    6.99800000&lt;br /&gt;
 O(PDBName=O,ResName=ASP,ResNum=3)                    -2.12800000   -2.53500000    7.69600000&lt;br /&gt;
 C(PDBName=CB,ResName=ASP,ResNum=3)                   -2.52700000   -4.13900000    5.11500000&lt;br /&gt;
 C(PDBName=CG,ResName=ASP,ResNum=3)                   -3.47700000   -5.15800000    5.69800000&lt;br /&gt;
 O(PDBName=OD1,ResName=ASP,ResNum=3)                  -3.21300000   -6.36200000    5.49300000&lt;br /&gt;
 O(PDBName=OD2,ResName=ASP,ResNum=3)                  -4.45500000   -4.71900000    6.34100000&lt;br /&gt;
 H(PDBName=H,ResName=ASP,ResNum=3)                    -1.24000000   -1.54800000    5.36200000&lt;br /&gt;
 H(PDBName=HA,ResName=ASP,ResNum=3)                   -4.07200000   -2.58600000    5.09200000&lt;br /&gt;
 H(PDBName=HB2,ResName=ASP,ResNum=3)                  -1.53000000   -4.29800000    5.57700000&lt;br /&gt;
 H(PDBName=HB3,ResName=ASP,ResNum=3)                  -2.54000000   -4.25300000    4.01100000&lt;br /&gt;
 N(PDBName=N,ResName=NH2,ResNum=4)                    -4.37500000   -2.75200000    7.46000000&lt;br /&gt;
 H(PDBName=H1,ResName=NH2,ResNum=4)                   -5.14100000   -2.85800000    6.79500000&lt;br /&gt;
 H(PDBName=H2,ResName=NH2,ResNum=4)                   -4.53500000   -2.71200000    8.46700000&lt;br /&gt;
 &lt;br /&gt;
 1 2 1.0 4 1.0 5 1.0 6 1.0&lt;br /&gt;
 2 3 2.0 7 1.5&lt;br /&gt;
 3&lt;br /&gt;
 4&lt;br /&gt;
 5&lt;br /&gt;
 6&lt;br /&gt;
 7 16 1.0 8 1.0&lt;br /&gt;
 8 9 1.0 11 1.0 17 1.0&lt;br /&gt;
 9 10 2.0 22 1.5&lt;br /&gt;
 10&lt;br /&gt;
 11 18 1.0 12 1.0 19 1.0&lt;br /&gt;
 12 21 1.0 13 1.0 20 1.0&lt;br /&gt;
 13 14 2.0 15 2.0&lt;br /&gt;
 14&lt;br /&gt;
 15&lt;br /&gt;
 16&lt;br /&gt;
 17&lt;br /&gt;
 18&lt;br /&gt;
 19&lt;br /&gt;
 20&lt;br /&gt;
 21&lt;br /&gt;
 22 23 1.0 30 1.0&lt;br /&gt;
 23 26 1.0 24 1.0 31 1.0&lt;br /&gt;
 24 34 1.5 25 2.0&lt;br /&gt;
 25&lt;br /&gt;
 26 27 1.0 33 1.0 32 1.0&lt;br /&gt;
 27 28 2.0 29 2.0&lt;br /&gt;
 28&lt;br /&gt;
 29&lt;br /&gt;
 30&lt;br /&gt;
 31&lt;br /&gt;
 32&lt;br /&gt;
 33&lt;br /&gt;
 34 35 1.0 36 1.0&lt;br /&gt;
 35&lt;br /&gt;
 36&lt;br /&gt;
&lt;br /&gt;
The issue going from .xyz straight to .com is that ALL protein information would be lost ie. residue information that is written into the .pdb at .xyz to .pdb comversion.&lt;br /&gt;
&lt;br /&gt;
===PYP files as downloaded compared to run through TINKER===&lt;br /&gt;
&lt;br /&gt;
*TINKER removes much of the header information&lt;br /&gt;
*TINKER only preserves the 1st model of a .pdb file (may be multiple models)&lt;br /&gt;
*TINKER removes far right 3 columns of information (not essential to the .pdb file)&lt;br /&gt;
*A virtual residue in the original .pdb comprising the chromophore was omitted from the TINKER output (number 126)&lt;br /&gt;
&lt;br /&gt;
===What each TINKER search algorithm does===&lt;br /&gt;
&lt;br /&gt;
There are several programs within TINKER that may be able to help locate a global minimum (not just one near the published crystal structure) by way of searching the potential energy surface. All of these are MM methods for use in TINKER.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&#039;&#039;&#039;SCAN&#039;&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
A program for general conformational search of an entire potential energy surface via a basin hopping method. The program takes as input a TINKER .xyz coordinates file which is then minimized to find the first local minimum for a search list. A series of activations along various normal modes from this initial minimum are used as seed points for additional minimizations. Whenever a previously unknown local minimum is located it is added to the search list. When all minima on the search list have been subjected to the normal mode activation without locating additional new minima, the program terminates. The individual local minima are written to cycle files as they are discovered. While the SCAN program can be used on standard undeformed potential energy surfaces, we have found it to be most useful for quickly ‘‘scanning’’ a smoothed energy surface to enumerate the major basins of attraction spaning the entire surface.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;SNIFFER&#039;&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
A program that implements the Sniffer global optimization algorithm of Butler and Slaminka, a discrete version of Griewank’s global search trajectory method. The program takes an input TINKER .xyz coordinates file and shakes it vigorously via a modified dynamics trajectory before, hopefully, settling into a low lying minimum. Some trial and error is often required as the current implementation is sensitive to various parameters and tolerances that govern the computation. At present, these parameters are not user accessible, and must be altered in the source code. However, this method can do a good job of quickly optimizing conformation within a limited range of convergence.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;PSS&#039;&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
Implements our version of a potential smoothing and search algorithm for the global optimization of molecular conformation. An initial structure in .xyz format is first minimized in Cartesian coordinates on a series of increasingly smoothed potential energy surfaces. Then the smoothing procedure is reversed with minimization on each successive surface starting from the coordinates of the minimum on the previous surface. A local search procedure is used during the backtracking to explore for alternative minima better than the one found during the current minimization. The final result is usually a very low energy conformation or, in favorable cases, the global energy minimum conformation. The minimum energy coordinate sets found on each surface during both the forward smoothing and backtracking procedures are placed in sequentially numbered cycle files.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
TINKER also contains programs that are able to perform MM optimisations:&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&#039;&#039;&#039;MINIMIZE&#039;&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;The MINIMIZE program performs a limited memory L-BFGS minimization of an input structure over Cartesian coordinates using a modified version of the algorithm of Jorge Nocedal. The method requires only the potential energy and gradient at each step along the minimization pathway. It requires storage space proportional to the number of atoms in the structure. The MINIMIZE procedure is recommended for preliminary minimization of trial structures to an rms gradient of 1.0 to 0.1 kcal/mole/Å. It has a relatively fast cycle time and is tolerant of poor initial structures, but converges in a slow, linear fashion near the minimum. The user supplies the name of the TINKER .xyz coordinates file and a target rms gradient value at which the minimization will terminate. Output consists of minimization statistics written to the screen or redirected to an output file, and the new coordinates written to updated .xyz files or to cycle files.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;OPTIMIZE&#039;&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;The OPTIMIZE program performs a optimally conditioned variable metric minimization of an input structure over Cartesian coordinates using an algorithm due to William Davidon. The method does not perform line searches, but requires computation of energies and gradients as well as storage for an estimate of the inverse Hessian matrix. The program operates on Cartesian coordinates from a TINKER .xyz file. OPTIMIZE will typically converge somewhat faster and more completely than MINIMIZE. However, the need to store and manipulate a full inverse Hessian estimate limits its use to structures containing less than a few hundred atoms on workstation class machines. As with the other minimizers, OPTIMIZE needs input coordinates and an rms gradient cutoff criterion. The output coordinates are saved in updated .xyz files or as cycle files.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;NEWTON&#039;&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt; A truncated Newton minimization method which requires potential energy, gradient and Hessian information. This procedure has significant advantages over standard Newton methods, and is able to minimize very large structures completely. Several options are provided with respect to minimization method and preconditioning of the Newton equations. The default options are recommended unless the user is familiar with the math involved. This program operates in Cartesian coordinate space and is fairly tolerant of poor input structures. Typical algorithm iteration times are longer than with nonlinear conjugate gradient or variable metric methods, but many fewer iterations are required for complete minimization. NEWTON is usually the best choice for minimizations to the 0.01 to 0.000001 kcal/mole/Å level of rms gradient convergence. Tests for directions of negative curvature can be removed, allowing NEWTON to be used for optimization to conformational transition state structures (this only works if the starting point is very close to the transition state). Input consists of a TINKER .xyz coordinates file; output is an updated set of minimized coordinates and minimization statistics.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Tues 17th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
Comparing 2 .pdb files:&lt;br /&gt;
&lt;br /&gt;
*As downloaded from PDB&lt;br /&gt;
*Converted to .xyz in TINKER then back to .pdb&lt;br /&gt;
&lt;br /&gt;
These files were then cleaned up to compare in a terminal using the &amp;quot;diff&amp;quot; command:&lt;br /&gt;
&lt;br /&gt;
*Header information and end information removed&lt;br /&gt;
*3 columns to the right in PDB version were removed to become consistent with the TINEKR file&lt;br /&gt;
*The first column &amp;quot;ATOM&amp;quot; was removed effectively making the line numbers equal to the atom numbers in both files&lt;br /&gt;
&lt;br /&gt;
==Wed 18th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
1st diff was on line 52, a HYDROGEN ATOM:&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&#039;&#039;52  HE2 HIS A   3    13.349 -10.803   9.527&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
added to the TINKER output, not in PDB file.&lt;br /&gt;
&lt;br /&gt;
2nd diff was on line 657, a HYDROGEN ATOM:&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&#039;&#039;657  HE2 GLU A  46	 -0.867  -5.216  -0.822&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
added to the PDB file, not in TINKER output&lt;br /&gt;
&lt;br /&gt;
3rd diff was on line 1007, a HYDROGEN ATOM (number 1008):&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&#039;&#039;5HG  CYS A  69	 -8.909  -5.194   5.526&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
added to the TINKER output, not in PDB file.&lt;br /&gt;
&lt;br /&gt;
4th diff was  on line 1621, a HYDROGEN ATOM (number 1623):&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&#039;&#039;HE2 HIS A 108	  4.663   7.459   6.879&#039;&#039;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
added to the TINKER output, not in PDB file.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Following these reconciliations, both files were exactly the same with 1911 atoms present in each. 3 atoms were removed from the TINKER file and 1 was removed from the PDB file.&lt;br /&gt;
&lt;br /&gt;
*NOTE - loading the original .pdb from PDB to Gaussview then converting to a .gjf results in an error &amp;quot;1 or more PDB atom names has a &#039; or * character which can cause problems for Gaussian&amp;quot; - this is because the PDB file has extra atoms on the end of the file to define a highlighted chromophore... After these were removed in a txt editor the .gjf save went smoothly. Infact after this removal the file will run an AMBEr single point energy calculation fine - cf running when the extra info is in the bottom.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
All differences in files were due to protonation states of atoms... not the AA backbone.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===TINKER vs PDB===&lt;br /&gt;
[[Image:TINKER PROTON.png]]&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
[[Image:PDB_PROTON.png]]&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
The highlighted fragments represent where the respective file version is protonated and the other is not.&lt;br /&gt;
&lt;br /&gt;
==Fri 20th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
In an attempt to load the 1W7S structure to TINKER and successfully convert it it a .xyz format the following procedure was undertaken:&lt;br /&gt;
&lt;br /&gt;
*&lt;br /&gt;
&lt;br /&gt;
==Mon 23rd Nov 2009==&lt;br /&gt;
&lt;br /&gt;
===Protonation of GFP at various pH conditions===&lt;br /&gt;
&lt;br /&gt;
With 230 amino acid residues, each possessing a unique pKa in GFP it is obvious that there is a huge spectrum of protonation states available at a single solvation pH. Since protons cannot be detected using X-ray diffraction, protonation states must be uncovered via a difference method. Several questions arise when thinking about protonation of the protein:&lt;br /&gt;
&lt;br /&gt;
*Which residues have the ability to be protonated/deprotonated&lt;br /&gt;
*Is the PDB structure protonated for a specific pH (Bio pH)&lt;br /&gt;
*How does the protonation of the protein change through a variety of solvation pHs&lt;br /&gt;
*Does the protonation state affect the chromophore photophysics (electronic embedding)&lt;br /&gt;
&lt;br /&gt;
Answering these questions:&lt;br /&gt;
&lt;br /&gt;
*Polar amino acids are those that can be protonated/deprotonated, these are identified by Gaussview and shown in the atom list of the molecule with the option of setting them to either protonated/deprotonated.&lt;br /&gt;
*Dont know if the current structure is protonated to any specific pH... this is unclear until protonate at certain pHs&lt;br /&gt;
*The protonation will be tested across a range of pHs to see how protonation changes&lt;br /&gt;
&lt;br /&gt;
==Tues 24th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
The pKa value of a residue will indicate protonation state as a function of pH. If you know the pH of your solution then if the pKa of a given functional group is lower than this pH then the functional group will be unprotonated. If the pKa is higher than the pH then the functional group will be protonated. &#039;&#039;&#039;Beware that pKa values can be distorted by local conformational effects.&#039;&#039;&#039; The structure is protonated as though it were in pH 7 as it is loaded into Gaussview. Since hydrogens are defined in the PDB file it could be that the author added protons under these conditions.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Histadine has 2 protonation points on the imidizole R chain:&lt;br /&gt;
&lt;br /&gt;
{|&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Yes-Yes&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;Yes-No&#039;&#039;&#039;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|&#039;&#039;&#039;No-Yes&#039;&#039;&#039;&lt;br /&gt;
|-&lt;br /&gt;
|&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolApplet&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;Protein_yes_yes.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;400&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Yes Yes&amp;lt;/text&amp;gt;&lt;br /&gt;
  &amp;lt;/jmolApplet&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;||&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolApplet&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;Protein_yes_no.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;400&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Yes Yes&amp;lt;/text&amp;gt;&lt;br /&gt;
  &amp;lt;/jmolApplet&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;||&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolApplet&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;Protein_no_yes.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;400&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Yes Yes&amp;lt;/text&amp;gt;&lt;br /&gt;
  &amp;lt;/jmolApplet&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
The imidazole sidechain of histidine has a pKa of approximately 6, and overall, the amino acid has a pKa of 7.6. This means that at physiologically relevant pH values, relatively small shifts in pH will change its average charge. Below a pH of 6, the imidazole ring is mostly protonated as described by the Henderson–Hasselbalch equation. When protonated, the imidazole ring bears two NH bonds and has a positive charge. The positive charge is equally distributed between both nitrogens and can be represented with two equally important resonance structures.&lt;br /&gt;
&lt;br /&gt;
===Protonation===&lt;br /&gt;
&lt;br /&gt;
At pH 7 the PDB file protonations are correct for all residues. Moving to pH 5 the histadine residues become protonated (pKa 6)- this is the only change. At more alkaline pHs it would take pH 9.1 to make any effect (deprotonating CYS and then TYR at 9.7)&lt;br /&gt;
&lt;br /&gt;
===Mini-Biomolecule===&lt;br /&gt;
&lt;br /&gt;
To test ONIOM calculations vs pure DFT methods and other calculation types on a bio system that is smaller than GFP (computationally less expensive, can streamline the real calcs)&lt;br /&gt;
&lt;br /&gt;
Trying a molecule analogous to the example given in the ONIOM tutorial...&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolApplet&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;Biomolecule1.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;400&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Yes Yes&amp;lt;/text&amp;gt;&lt;br /&gt;
  &amp;lt;/jmolApplet&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Thurs 26th Nov 2009==&lt;br /&gt;
&lt;br /&gt;
===Checking the GFP input will run===&lt;br /&gt;
&lt;br /&gt;
Procedure to get 1WS7 to run an AMBER energy calculation:&lt;br /&gt;
&lt;br /&gt;
*Load the PDB file without water&lt;br /&gt;
*Try opt amber but will fail due to missing parameters (all in the chromophore)&lt;br /&gt;
*Set these parameters to 0 since they cancel in ONIOM&lt;br /&gt;
*Freeze the atotms involving missing paramters&lt;br /&gt;
*Energy AMBER will now run for the file - NO CHARGES - 9.07881632au&lt;br /&gt;
*Open the Atom Editor, Edit, MM charges, Assign all atoms&lt;br /&gt;
*Manually edit any missing charges&lt;br /&gt;
*Energy AMBER will now run including charges (basic AMBER94 charges) - 0.260391165au&lt;br /&gt;
&lt;br /&gt;
Charges in this file did not sum to an integer number - the only residue with a non integer charge was no.64 with a charge of 0.61160000&lt;br /&gt;
&lt;br /&gt;
 MM sanity checks:&lt;br /&gt;
 All charges sum to:               -5.38840000&lt;br /&gt;
 Charges of atoms sum to:          -5.38840000&lt;br /&gt;
 Layer 1 (S) has     34 atoms and charge   0.61160000 sum=   0.61160000&lt;br /&gt;
 Layer 2 (R) adds  3574 atoms and charge  -6.00000000 sum=  -5.38840000&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
 WARNING -- The following centers have non-zero partial&lt;br /&gt;
 charges, but zero Vanderwaals parameters. These centers&lt;br /&gt;
 can collapse onto other centers with non-zero charge&lt;br /&gt;
 Center    Charge&lt;br /&gt;
   1833  0.423900&lt;br /&gt;
   1890  0.410200&lt;br /&gt;
   2029  0.427500&lt;br /&gt;
   2043  0.427500&lt;br /&gt;
   2080  0.410200&lt;br /&gt;
   2089  0.399200&lt;br /&gt;
   2123  0.410200&lt;br /&gt;
   2179  0.410200&lt;br /&gt;
   2186  0.410200&lt;br /&gt;
   2260  0.410200&lt;br /&gt;
   2287  0.410200&lt;br /&gt;
   2294  0.410200&lt;br /&gt;
   2347  0.427500&lt;br /&gt;
   2369  0.399200&lt;br /&gt;
   2469  0.427500&lt;br /&gt;
   2508  0.399200&lt;br /&gt;
   2551  0.410200&lt;br /&gt;
   2619  0.399200&lt;br /&gt;
   2639  0.410200&lt;br /&gt;
   2714  0.410200&lt;br /&gt;
   2924  0.399200&lt;br /&gt;
   2939  0.399200&lt;br /&gt;
   2950  0.427500&lt;br /&gt;
   2981  0.399200&lt;br /&gt;
   3180  0.427500&lt;br /&gt;
   3233  0.399200&lt;br /&gt;
   3262  0.410200&lt;br /&gt;
   3361  0.399200&lt;br /&gt;
   3377  0.427500&lt;br /&gt;
   3384  0.410200&lt;br /&gt;
   3397  0.427500&lt;br /&gt;
   3418  0.427500&lt;br /&gt;
   3562  0.410200&lt;br /&gt;
   3593  0.410200&lt;br /&gt;
   3608  0.410200&lt;br /&gt;
&lt;br /&gt;
* oniom(pm6:amber=hardfirst) geom=connectivity was run successfully, resulting in an energy of 0.210085730788au&lt;br /&gt;
&lt;br /&gt;
This still resulted in some charge problems:&lt;br /&gt;
&lt;br /&gt;
 MM sanity checks:&lt;br /&gt;
 All charges sum to:               &#039;&#039;&#039;-5.38840000&#039;&#039;&#039;&lt;br /&gt;
 Charges of atoms sum to:           &#039;&#039;&#039;0.79320000&#039;&#039;&#039;&lt;br /&gt;
 Residue  63 PDB Number    &#039;&#039;&#039;64_A PHE has&#039;&#039;&#039;     1 atoms, and a charge of   0.59730000&lt;br /&gt;
 Residue  64 PDB Number    &#039;&#039;&#039;66_A CSY has&#039;&#039;&#039;    34 atoms, and a charge of   0.61160000&lt;br /&gt;
 Residue  65 PDB Number    &#039;&#039;&#039;68_A VAL has&#039;&#039;&#039;     1 atoms, and a charge of  -0.41570000&lt;br /&gt;
 WARNING -- The following centers have non-zero partial&lt;br /&gt;
 charges, but zero Vanderwaals parameters. These centers&lt;br /&gt;
 can collapse onto other centers with non-zero charge&lt;br /&gt;
 Center    Charge&lt;br /&gt;
   &#039;&#039;&#039;3608  0.410200&#039;&#039;&#039;&lt;br /&gt;
 MMInit generated parameter data with length LenPar=       40470.&lt;br /&gt;
 Standard basis: Dummy (5D, 7F)&lt;br /&gt;
 There are    36 symmetry adapted basis functions of A   symmetry.&lt;br /&gt;
 Integral buffers will be    262144 words long.&lt;br /&gt;
 Raffenetti 1 integral format.&lt;br /&gt;
 Two-electron integral symmetry is turned on.&lt;br /&gt;
    36 basis functions,    36 primitive gaussians,    36 cartesian basis functions&lt;br /&gt;
    27 alpha electrons       27 beta electrons&lt;br /&gt;
       nuclear repulsion energy       200.9124124811 Hartrees.&lt;br /&gt;
 NAtoms= 3608 NActive=   36 NUniq=   36 SFac= 7.50D-01 NAtFMM=   80 NAOKFM=F Big=F&lt;br /&gt;
 AMBER calculation of energy.&lt;br /&gt;
 Energy=    0.110474559     NIter=   0.&lt;br /&gt;
 Dipole moment=       3.390901       5.889448      -2.505476&lt;br /&gt;
 ONIOM: saving gridpoint  1&lt;br /&gt;
 ONIOM: restoring gridpoint  3&lt;br /&gt;
 ONIOM: calculating energy.&lt;br /&gt;
 ONIOM: gridpoint  1 method:  low   system:  model energy:     0.110474559177&lt;br /&gt;
 ONIOM: gridpoint  2 method:  high  system:  model energy:     0.060169124949&lt;br /&gt;
 ONIOM: gridpoint  3 method:  low   system:  real  energy:     0.260391165016&lt;br /&gt;
 ONIOM: extrapolated energy =       0.210085730788&lt;br /&gt;
 ONIOM: calculating electric field derivatives.&lt;br /&gt;
 ONIOM: Integrating ONIOM file  4 number   619&lt;br /&gt;
 ONIOM: Dipole        =-2.76174164D+01-1.34762322D+02 3.99239712D+01&lt;br /&gt;
 ONIOM: Dipole moment (Debye):&lt;br /&gt;
    X=   -70.1965    Y=  -342.5316    Z=   101.4766  Tot=   364.0782&lt;br /&gt;
 ONIOM: Integrating ONIOM file  5 number   695&lt;br /&gt;
&lt;br /&gt;
An ONIOM calculation was run using a DFT method on the high level and AMBER on the low level:  &#039;&#039;oniom(b3lyp/3-21g:amber=hardfirst) geom=connectivity&#039;&#039; which ran successfully...&lt;br /&gt;
&lt;br /&gt;
===Charges===&lt;br /&gt;
&lt;br /&gt;
Charges in the protein are important for the calculation that will be run on GFP eventually since they provide the electrostatic field or &amp;quot;atmosphere&amp;quot; which surrounds the chromophore, polarizing the electron density used by the DFT method to evaluate the chromophore. Gaussian offers 2 types of influence which the MM portion can exert on the QM part:&lt;br /&gt;
&lt;br /&gt;
* Mechanical Embedding&lt;br /&gt;
* Electronic Embedding&lt;br /&gt;
&lt;br /&gt;
Mechanical embedding does not use any electronic components to affect the central high region. Electronic embedding does use the charge generated in the MM region to polarize the high region.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
At pH 7 the GFP molecule is in a 6- charge state according to assigning AMBER charges and summing the residue charges (available in the out file). 3 options were considered at this stage for treating charges:&lt;br /&gt;
&lt;br /&gt;
* Ignore charges on the entire molecule&lt;br /&gt;
* Use AMBER94 assigned partial charges on the low region&lt;br /&gt;
* Use Qeq to assign charges based on an overall charge of -6&lt;br /&gt;
&lt;br /&gt;
The first option can be discarded since the electronic atmosphere is of particular interest here. The second 2 options are to be investigated. See below (on a small Biomolecule)&lt;br /&gt;
&lt;br /&gt;
ON a 4 residue chain (in mini biomolecule folder) - As in the GFP tutorial on the wiki, a energy calculation was submitted with the keyword amber=qeq to get back the charges across the molecule. (not entirely sure what this does yet). AS a test of the effect of charges on a molecule a test molecule was compiled in TINKER consisting of 4 residues with 0 overall charges....&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolAppletButton&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;Protein1_AMBER_OPT_qeq.mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Test Protien&amp;lt;/text&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;600&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;/jmolAppletButton&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{|&lt;br /&gt;
|No Charges|| 0.01940311au&lt;br /&gt;
|-&lt;br /&gt;
|Amber Charges|| -0.18117151au&lt;br /&gt;
|-&lt;br /&gt;
|Qeq Charges|| -0.23038199au&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Clearly the Qeq shows the lowest energy - the considerable differences in energy show that charges are an important part of the input.&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Testing Charges and EE on a small system===&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolAppletButton&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;Chromopre_molecule_B3LYP_6-31G(d).mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;script&amp;gt;spin on&amp;lt;/script&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;Test Protien&amp;lt;/text&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;600&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;/jmolAppletButton&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Energies of the optimisations using various charge approximations and with/without EE&lt;br /&gt;
{|class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Method||Mechanical Embedding||Electronic Embedding||Computational Time (mechanical embedding)||Computational Time (electronic embedding)&lt;br /&gt;
|-&lt;br /&gt;
|B3LYP 6-31G*||-1406.04||N/A||11 hours 18 minutes 44.7 seconds||11 hours 18 minutes 44.7 seconds&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (No Charges)||-783.2225||-783.2223||2 hours 13 minutes  4.1 seconds||&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Amber Charges)|| || ||  ||&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Qeq Charges)||-783.2467|| ||2 hours  6 minutes  8.6 seconds ||&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
NB. &lt;br /&gt;
* B3LYP 6-31G* and ONIOM B3LYP 6-31G*:AMBER (No Charges) were from the same starting geometry.&lt;br /&gt;
* ONIOM B3LYP 6-31G*:AMBER (Amber Charges) and ONIOM B3LYP 6-31G*:AMBER (Qeq Charges) were from the same starting geometry.&lt;br /&gt;
* All Electronic embedding is from the same starting point - Can compare times.&lt;br /&gt;
&lt;br /&gt;
Dipole moments using various charge approximations and with/without EE&lt;br /&gt;
{|class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Method||Dipole moment (No EE)||Dipole moment (With EE)&lt;br /&gt;
|-&lt;br /&gt;
|B3LYP 6-31G*||7.4914||N/A&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (No Charges)||3.2328||3.2241&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Amber Charges)||5.2794||5.2865&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Qeq Charges)||8.173||8.1654&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==2nd Dec 2009==&lt;br /&gt;
&lt;br /&gt;
A single point energy of 1w7s was run with the additional qeq keyword to return partial charges (overall charge set to -6)&lt;br /&gt;
This new molecule was submitted for AMBER opt but failed after around 700 iterations... this could be due to the error:&lt;br /&gt;
&lt;br /&gt;
 WARNING -- The following centers have non-zero partial&lt;br /&gt;
 charges, but zero Vanderwaals parameters. These centers&lt;br /&gt;
 can collapse onto other centers with non-zero charge&lt;br /&gt;
&lt;br /&gt;
When the atoms defined in this error had their partial charges set to 0.00, the result was a fully converged AMBER optimisation:&lt;br /&gt;
&lt;br /&gt;
*Frozen chromophore.&lt;br /&gt;
*pH 7 giving a charge of -6&lt;br /&gt;
*Charges assigned using Qeq method&lt;br /&gt;
*Zero Van der Waals parameter atoms partial charges set to 0.00&lt;br /&gt;
&lt;br /&gt;
Looking at what atoms are not defined in terms of Van der Waals paramters it is ALWAYS an aloholic proton that is not parameterised...&lt;br /&gt;
&lt;br /&gt;
Tutorial used AMBER charges then ran Qeq for the missing charges... i think this is what she thought she did... gaussian &amp;quot;amber=qeq&amp;quot; will overwrite all charges.. need to use &amp;quot;amber=uncharged&amp;quot;&lt;br /&gt;
so that we dont overwrite the amber charges...&lt;br /&gt;
&lt;br /&gt;
In effect full Qeq charges were assigned in the tutorial. Also overall charge is 0 - will check this.&lt;br /&gt;
&lt;br /&gt;
===The GFP tutorial===&lt;br /&gt;
&lt;br /&gt;
*AMBER charges (AMBER94 charges) were assigned to the GFP molecule in the tutorial&lt;br /&gt;
*she notices correctly that some are missing in the chromophore and so decides to assign them via Qeq&lt;br /&gt;
*Runs Qeq on the GFP molecule with a &#039;&#039;&#039;0 charge&#039;&#039;&#039; which is wrong! When run the AMBER charges and look at the log file we can see that the overall charge should be &#039;&#039;&#039;+4&#039;&#039;&#039; for that protonation state.&lt;br /&gt;
*Since Qeq is an equilibrium method, partial charges are solved with the constraint that their sum must equal the overall charge.&lt;br /&gt;
*These Qeq charges should therefore not fit with an integer sum when pasted into the AMBER charges.&lt;br /&gt;
&lt;br /&gt;
===Qeq versus AMBER94 charges===&lt;br /&gt;
&lt;br /&gt;
Ogawa et al. state:&lt;br /&gt;
&lt;br /&gt;
 Geometry optimizations of a crambin protein were performed using force fields combined with the&lt;br /&gt;
 CQEq. The CQEq charges were found to be more comparable to the Mulliken charges obtained by the ab initio molecular orbital&lt;br /&gt;
 calculation than the fixed charges used in the AMBER force field. Furthermore, the CQEq charges give more realistic charge redistribution&lt;br /&gt;
 between amino acids in the crambin.&lt;br /&gt;
&lt;br /&gt;
Mini Biomolecule calculations:&lt;br /&gt;
&lt;br /&gt;
{|class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Method||Dipole moment (No EE)||Dipole moment (With EE)&lt;br /&gt;
|-&lt;br /&gt;
|B3LYP 6-31G*||7.4914||N/A&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (No Charges)||3.2328||3.2241&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Amber Charges)||5.2794||5.2865&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Qeq Charges)||8.173||8.1654&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Clearly the Qeq charges are in better agreement with the DFT method than no or AMBER charges.&lt;br /&gt;
&lt;br /&gt;
These frequencies are calulated using mechanical embedding!&lt;br /&gt;
&lt;br /&gt;
{|class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Method||C=O||C=C&lt;br /&gt;
|-&lt;br /&gt;
|B3LYP 6-31G*||1799.65||1701.86&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (No Charges)||1802.25||1703.63&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Amber Charges)||1803.37||1701.67&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (Qeq Charges)||1800.18||1702.86&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
As a result it seems obvious that a Qeq charge assignment in constraint by the correct total charge gives results most comparable to the pure QM method (both dipoles and frequencies)&lt;br /&gt;
&lt;br /&gt;
==3rd Dec 2009==&lt;br /&gt;
===ESP Charges===&lt;br /&gt;
&lt;br /&gt;
ESP calculates the expectation values of the electrostatic potential of a molecule on a uniform distribution of points. The resultant ESP surface is then fit to atom centered charges that best reproduce the electron distribution in a least-squares method. The set of points defining the default surface is generated according to the algorithm of Connolly.&lt;br /&gt;
&lt;br /&gt;
ESP charges thus require a QM calculation to create the molecular electrostatic potential used to fit atom centered charges. The size of GFP means that this is not an efficient method for use in this case?&lt;br /&gt;
&lt;br /&gt;
===Qeq Charges===&lt;br /&gt;
&lt;br /&gt;
The charge equilibration (QEq) method treats the atomic charge variation through the equilibration of electronegativities at atomic sites. The advantage of the QEq method is that the polarization effect in MM is taken into account by variation of the charge distribution in the molecules. The atomic charge variation relates to the intramolecular geometrical changes.&lt;br /&gt;
&lt;br /&gt;
===AMBER charges===&lt;br /&gt;
&lt;br /&gt;
==8th Dec 2009==&lt;br /&gt;
&lt;br /&gt;
A input creation page is now avaliable:&lt;br /&gt;
&lt;br /&gt;
https://www.ch.ic.ac.uk/wiki/index.php/AndyForesterGFP&lt;br /&gt;
&lt;br /&gt;
The final input file had protein information pasted in using text to columns in excel and now displays PDB residue information and PDB secondary structure information (nice cartoon pictures in ChemBio3D!)&lt;br /&gt;
&lt;br /&gt;
Interesting is a paper that uses TD-DFT on GFP (in my papers folder on the mac), the group uses a 3 stage minimisation method from the 1GFL PDB structure:&lt;br /&gt;
&lt;br /&gt;
* Freeze all atoms in the backbone of the protein and optimise after adding hydrogens&lt;br /&gt;
* Optimise the chromophore at a semi-empirical level&lt;br /&gt;
* They cut out a chromophore section but i could then perform a full QM/MM geometry optimisation&lt;br /&gt;
&lt;br /&gt;
===A more charged mini molecule to test electronic embedding===&lt;br /&gt;
&lt;br /&gt;
&amp;lt;jmol&amp;gt;&lt;br /&gt;
  &amp;lt;jmolAppletButton&amp;gt;&lt;br /&gt;
    &amp;lt;uploadedFileContents&amp;gt;More_charge_B3LYP_6_31G(d).mol&amp;lt;/uploadedFileContents&amp;gt;&lt;br /&gt;
    &amp;lt;text&amp;gt;The molecule with 2+ charge&amp;lt;/text&amp;gt;&lt;br /&gt;
    &amp;lt;color&amp;gt;#FFFFFF&amp;lt;/color&amp;gt;&lt;br /&gt;
    &amp;lt;size&amp;gt;600&amp;lt;/size&amp;gt;&lt;br /&gt;
    &amp;lt;/jmolAppletButton&amp;gt;&lt;br /&gt;
&amp;lt;/jmol&amp;gt;&amp;lt;br style=&amp;quot;clear:both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
The molecule was submitted with 3 different job lines:&lt;br /&gt;
&lt;br /&gt;
* b3lyp 6-31g(d)&lt;br /&gt;
* oniom(b3lyp 6-31g(d):amber=(hardfirst,qeq))&lt;br /&gt;
* oniom(b3lyp 6-31g(d):amber=(hardfirst,qeq))=embedcharge&lt;br /&gt;
&lt;br /&gt;
Opt and Freq jobs were completed successfully. The charge breakdown input was:&lt;br /&gt;
&lt;br /&gt;
* Low Real 2 1&lt;br /&gt;
* High Model 0 1&lt;br /&gt;
* Low Model 0 1&lt;br /&gt;
&lt;br /&gt;
The aim of the calculations was to test electronic embedding versus the mechanical embedding. A paper suggests that there is a large difference in accuracy between the two methods. Qeq charge equilibration was used in both cases as it has already been proved to be much more effective than AMBER assigned charges. &#039;&#039;&#039;It has consistently overestimated the dipole moment though&#039;&#039;&#039;.&lt;br /&gt;
&lt;br /&gt;
{|class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Method||Dipole moment&lt;br /&gt;
|-&lt;br /&gt;
|B3LYP 6-31G*||8.02&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (ME)||12.22&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (EE)||12.23&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
These results are odd since we expect the EE to have some effect on the charge distribution in the molecule (it is polarising the QM region) comparing the frequencies as a further check:&lt;br /&gt;
&lt;br /&gt;
{|class=&amp;quot;wikitable&amp;quot; border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Method||C=O||C=C&lt;br /&gt;
|-&lt;br /&gt;
|B3LYP 6-31G*||1764||1636&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (ME)||1797||1697&lt;br /&gt;
|-&lt;br /&gt;
|ONIOM B3LYP 6-31G*:AMBER (EE)||1797||1697&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Again the EE has made no difference to the frequency of various strong vibrations in the molecule. The conclusion is thus either:&lt;br /&gt;
&lt;br /&gt;
* EE has no real effect on the moleucule&lt;br /&gt;
* EE is not working properly&lt;br /&gt;
&lt;br /&gt;
Assuming the latter further investigation discovered that the meechanism of EE is determined using scaling parameters for MM charges during electronic embedding in the QM calculations. As a default the charges are scaled 555500 using the nomenclature ScaleCharge=ijklmn where n is closest to the QM region (massively anti intuitive). The input is read left to right and not all scaling factors need be defined. Further to this, the value given is then multiplied by 0.2 to arrive at the actual scaling coefficient. The result is thus for an atom:&lt;br /&gt;
&lt;br /&gt;
* 1 bond from the QM region (n) = 0.2n           Default is 0&lt;br /&gt;
* 2 bonds from the QM region (m) = 0.2m       Default is 0&lt;br /&gt;
* 3 bonds from the QM region (l) = 0.2l            Default is 5&lt;br /&gt;
* 4 bonds from the QM region (k) = 0.2k          Default is 5&lt;br /&gt;
* 5 bonds from the QM region (j) = 0.2j            Default is 5&lt;br /&gt;
* 6 bonds from the QM region (i) = 0.2i            Default is 5&lt;br /&gt;
&lt;br /&gt;
So all the atoms (and partial charges) in the MM region in the test molecule were being scaled to 0 when performing the EE. This explains the fact that no difference was uncovered in the previous calculation set.&lt;br /&gt;
&lt;br /&gt;
A new calculation was submitted with the command scalecharge=555 so that all charges in the MM part will be scaled to 1 (0.2xn,m,l)&lt;br /&gt;
&lt;br /&gt;
===Charges...again===&lt;br /&gt;
&lt;br /&gt;
So since i deleted the charges on the atoms where they can collapse, the overall charge is now -19.6 from the original -6 minus the 13.6 in charge deleted. REALLY ANNOYING! This is a huge difference in charges so will have to be solved. One easy option would be to set the involved atoms to 0 then run QEQ on the atoms with unassigned charges... essentially distributing the -6 charge through all atoms except those with 0 VdW parameters.&lt;br /&gt;
&lt;br /&gt;
==Tues 15th Dec 2009==&lt;br /&gt;
&lt;br /&gt;
The more charged molecule failed once the charges were scaled to 1 since SCF convergence was not attained.&lt;br /&gt;
&lt;br /&gt;
==Mon 25th Jan 2010==&lt;br /&gt;
&lt;br /&gt;
TAO tool publication was very good read... has a good suggestion of an optimisation procedure:&lt;br /&gt;
&lt;br /&gt;
*Take the crystal structure&lt;br /&gt;
*Add Hydrogens&lt;br /&gt;
*Determine protonation&lt;br /&gt;
*MM geometry optimisation&lt;br /&gt;
*(MD?)&lt;br /&gt;
*(Look at several samples of the MD)&lt;br /&gt;
*Optimise a final structure&lt;br /&gt;
&lt;br /&gt;
==Thurs 28th Jan 2010==&lt;br /&gt;
&lt;br /&gt;
Tao Tools requires preprocess within AMBER tools to run the scripts correctly. Does not have HIS on the residue database... Amber defines the protonation state within its residue naming scheme... HIS is either HIE, HIP, HID depending on its protonation state. [http://www.poissonboltzmann.org/pdb2pqr/documentation/programmers-guide/naming-scheme]&lt;br /&gt;
&lt;br /&gt;
==Thurs 4th Jan 2010==&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Preprocessing&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
To get TAO tools and also for completeness it is necessary to preprocess the PDB file before attempting to run ONIOM calculations, this includes:&lt;br /&gt;
&lt;br /&gt;
* Adding hydrogens&lt;br /&gt;
* Create a custom residue file for the chromophore&lt;br /&gt;
* Convert from PDB to gjf ONIOM input&lt;br /&gt;
&lt;br /&gt;
Ambertools are extremely helpful with this preprocessing especially since amber will be the low level theory used throughout this project. Within ambertools are a number of useful programs:&lt;br /&gt;
&lt;br /&gt;
* antechamber - has a parameterlookup function within amber and GAFF parameter library files.&lt;br /&gt;
* xLeap - builds the protein from its constituent residues - adds hydrogens according to library files - allows a parm file to be built to add custom residues.&lt;br /&gt;
&lt;br /&gt;
I will be working on using these 2 programs to create a full MM input file which is fully defined and correct.&lt;br /&gt;
&lt;br /&gt;
===xLeap===&lt;br /&gt;
&lt;br /&gt;
To load xLeap we must use X11 to get the graphical interface and the ssh -Y command to allow tunneling back to X11. Also the command line to start xLeap should be:  &#039;&#039;xleap -s -f /apps/ambertools/amber11/dat/leap/cmd/leaprc.ff03&#039;&#039;. This loads the ff03 parameter file and results in the start up of the graphical interface with:&lt;br /&gt;
&lt;br /&gt;
 Welcome to LEaP!&lt;br /&gt;
 Sourcing leaprc: /amber/dat/leap/cmd/leaprc&lt;br /&gt;
 Log file: ./leap.log&lt;br /&gt;
 Loading parameters: /amber/dat/leap/parm/parmME.dat&lt;br /&gt;
 Loading library: /amber/dat/leap/lib/all_nucleic94.lib&lt;br /&gt;
 Loading library: /amber/dat/leap/lib/all_aminoME.lib&lt;br /&gt;
 Loading library: /amber/dat/leap/lib/all_aminoctME.lib&lt;br /&gt;
 Loading library: /amber/dat/leap/lib/all_aminontME.lib&lt;br /&gt;
 Loading library: /amber/dat/leap/lib/ions94.lib&lt;br /&gt;
 Loading library: /amber/dat/leap/lib/water.lib&lt;br /&gt;
&lt;br /&gt;
This shows that the residue libraries have been loaded. We are then free to load a pdb file which will be read by xLeap and built according to the residues! In addition to the residue names xleap also requires information about the chain connectivity. Residues can be connected to other residues from both sides, only one side or not at all. xleap is fairly simple in that it expects that all residues in the pdb file are connected in the order in which they are listed unless they are separated by a &amp;quot;TER&amp;quot; card. In most cases where a single protein chains is being investigated like here - the TER card should end the residue information.&lt;br /&gt;
&lt;br /&gt;
To load a pdb use (ignore the &amp;lt;&amp;gt;) &#039;&#039;&amp;lt;name for use in xLeap&amp;gt;=loadpdb &amp;quot;&amp;lt;name of file&amp;gt;&amp;quot;&#039;&#039;. Make sure that the file is in your PATH else fully define the path in the filename.&lt;br /&gt;
&lt;br /&gt;
The pdb should then be read into xLeap and built. At the end of the output we get (I loaded ChainA of the 1W7S without any water or hydrogens):&lt;br /&gt;
&lt;br /&gt;
 Leap added 1765 missing atoms according to residue templates:&lt;br /&gt;
       1 Heavy&lt;br /&gt;
       1764 H / lone pairs&lt;br /&gt;
 The file contained 21 atoms not in residue templates&lt;br /&gt;
&lt;br /&gt;
This shows that Leap added hydrogens automatically and also a missing &amp;quot;heavy&amp;quot; atom (one which is not a H or lp):&lt;br /&gt;
&lt;br /&gt;
 Added missing heavy atom: .R&amp;lt;CHIE 229&amp;gt;.A&amp;lt;OXT 18&amp;gt;&lt;br /&gt;
&lt;br /&gt;
For amino acids, the traditional order of atom records within a residue is N, CA, C, O, followed by the sidechain atoms (CB, CG1, CG2 . . . ) in order first of increasing remoteness, and then branch. The extra oxygen at the carboxyl terminal is called &amp;quot; OXT&amp;quot;, and appears after all sidechain atoms.&lt;br /&gt;
&lt;br /&gt;
Now that we have established that there are 21 atoms that are in the non standard residue region we need to define a custom residue which describes this region (chromophore). This involves creating a prep file....&lt;br /&gt;
&lt;br /&gt;
I first drew up the undefined section of GFP (chromophore) in Gaussview5 and saved as a.pdb then:&lt;br /&gt;
&lt;br /&gt;
* Load in antechamber and convert to mol2 format using &#039;&#039;antechamber -i &amp;lt;input&amp;gt; -fi pdb -o &amp;lt;output&amp;gt; -fo mol2 -c bcc&#039;&#039;&lt;br /&gt;
* Use parmchk to make sure the molecule was defined in the GAFF force field - kicks out a .frcmod file with missing parameters (including dihedrals) - nice :D!&lt;br /&gt;
* Manually add the &amp;quot;1CSY&amp;quot; where a standalone &amp;quot;0&amp;quot; is in the mol2 file so Leap can read it&lt;br /&gt;
* Load the gaff parameter file to Leap using &#039;&#039;source leaprc.gaff&#039;&#039;&lt;br /&gt;
* Load the chromophore_CSY_full.mol2 to Leap using &#039;&#039;CSY=loadmol2 &amp;quot;ChromophoreGFP_CSY_full.mol2&amp;quot;&#039;&#039;&lt;br /&gt;
* Check the residue using &#039;&#039;check CSY&#039;&#039; - Notice missing parameters are there.&lt;br /&gt;
* Load the ChromophoreGFP_CSY_full.frcmod file created by parmchk using &#039;&#039;loadamberparams ChromophoreGFP_CSY_full.frcmod&#039;&#039;&lt;br /&gt;
* Now check the CSY residue again using &#039;&#039;check CSY&#039;&#039; - No Problems! woo hoo! Now I can create amber parameters :D&lt;br /&gt;
&lt;br /&gt;
==Tues 9th Feb 2010==&lt;br /&gt;
&lt;br /&gt;
===RESP Fitting===&lt;br /&gt;
&lt;br /&gt;
To get charges that are like those of the standard amber force field an RESP fitting method has to be used. This involves 3 steps:&lt;br /&gt;
&lt;br /&gt;
* First, the molecule studied is optimized to determine a stable minimum (using Gaussian).&lt;br /&gt;
* Then, this minimized structure is used to calculate a Molecular Electrostatic Potential (MEP) on a three-dimensional grid (using Gaussian).&lt;br /&gt;
* Finally, this grid is exported into the &amp;quot;RESP program&amp;quot; which is used to fit atom-centered charges to the MEP&lt;br /&gt;
&lt;br /&gt;
To assist in this procedure - I am using a program called RED - [http://q4md-forcefieldtools.org/RED/] which can be used in all 3 stages...&lt;br /&gt;
&lt;br /&gt;
* Ante_RED is used to prepare Gaussian input files for the RESP fit&lt;br /&gt;
* RED III is used to fit atom centered charges&lt;br /&gt;
&lt;br /&gt;
The procedure used to create the CSY file for RESP fitting is as follows:&lt;br /&gt;
&lt;br /&gt;
* Cut the CSY residue from the 1W7S protein structure and cap terminal ends with methyl groups&lt;br /&gt;
* Optimise this in Gaussian at PM6 level&lt;br /&gt;
* Run the optimised CSY geometry through ante_RED to create the Gaussian input file involving minimisation at HF level and calculation of the MEP - also get the .P2N file required for RED III&lt;br /&gt;
* Run this Gaussian input file&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Input file for optimisation:&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;#P hf/6-31G* Opt=(Tight,CalcFC) Freq SCF(Conver=8) Test&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
This will run a HF level optimisation and frequency analysis.  &#039;&#039;Opt=tight&#039;&#039; tightens the cutoffs on forces and step size that are used to determine convergence. An optimization with Opt=Tight will take several more steps than with the default cutoffs. For molecular systems with very small force constants (low frequency vibrational modes), this may be necessary to ensure adequate convergence and reliability of frequencies computed in a subsequent job step. This option can only be used with Berny optimizations. &#039;&#039;Opt = CalcFC&#039;&#039; causes the calculation of an analytical Hessian (in HF). &#039;&#039;SCF(Conver=8)&#039;&#039; sets the SCF convergence criterion to 10^-8.&lt;br /&gt;
&lt;br /&gt;
===Parameter files for use in Gaussian (Gaussian only uses amber94)===&lt;br /&gt;
&lt;br /&gt;
Using external parameter files. It is possible to include MM function entries within an external file by replacing the entries within the input file with the usual Gaussian include file mechanism: e.g., @oplsaa.prm. Parameter files for various force fields are included within the main Gaussian directory (i.e., $g09root/g09). If you want to specify the entire force field via an external file, use an input file like the following:&lt;br /&gt;
&lt;br /&gt;
# UFF=SoftOnly&lt;br /&gt;
&lt;br /&gt;
Read-in force field example&lt;br /&gt;
&lt;br /&gt;
molecule specification&lt;br /&gt;
&lt;br /&gt;
@$g09root/g09/oplsaa.prm&lt;br /&gt;
&lt;br /&gt;
Note that the choice of a molecular mechanics keyword is arbitrary as the entire force field depends on the read-in functions and parameters and not on any internal ones (in other words, specifying Amber would yield the same result as UFF).&lt;br /&gt;
&lt;br /&gt;
==Tues 16th Feb 2010==&lt;br /&gt;
===RESP Fitting===&lt;br /&gt;
&lt;br /&gt;
Following optimisation of the fragment for RESP fitting as seen before. We now have to fit the RESP to a MEP, this involves a few calculations:&lt;br /&gt;
&lt;br /&gt;
* ESP calcs in gaussian&lt;br /&gt;
* Fitting calcs by resp in ambertools&lt;br /&gt;
&lt;br /&gt;
The result is a .mol2 containing the fitted charges. The process has 2  input files:&lt;br /&gt;
&lt;br /&gt;
* .log&lt;br /&gt;
* .p2n&lt;br /&gt;
&lt;br /&gt;
The p2n is created by ante_RED and controls the RESP fitting process. (See RED site). To get a fit that excludes the capping methyls we add a REMARK line:&lt;br /&gt;
&lt;br /&gt;
 REMARK INTRA-MCC 0.0 |  35  36  37  38  39  40  41  42 | R&lt;br /&gt;
&lt;br /&gt;
This must be formatted EXACTLY as shown here (2 spaces between the numbers). The numbers correspond to atoms numbers, the 0.0 constrains these atoms to 0.0 charge and the R tells RESP to remove these atoms from the 2nd round fit and the subsequent mol2 output. Leaving a fragment.&lt;br /&gt;
&lt;br /&gt;
The RED program can be run on CX1! I used the following Jobscript (REDjobscript_4):&lt;br /&gt;
&lt;br /&gt;
 #PBS -l ncpus=1 &lt;br /&gt;
 #PBS -l mem=6000mb&lt;br /&gt;
 #PBS -l walltime=01:00:00&lt;br /&gt;
 #PBS -q pqchem&lt;br /&gt;
 #PBS -joe&lt;br /&gt;
 module load ambertools&lt;br /&gt;
 module load gaussian&lt;br /&gt;
 export GAUSS_SCRDIR=$TMPDIR&lt;br /&gt;
 echo $GAUSS_SCRDIR&lt;br /&gt;
 cd $HOME&lt;br /&gt;
 pwd&lt;br /&gt;
 perl /home/af706/RED-vIII.3.pl &amp;gt; /home/af706/RED-vIII.log&lt;br /&gt;
&lt;br /&gt;
So here we are using the chemistry node pqchem:&lt;br /&gt;
&lt;br /&gt;
* loading the ambertools (for RESP) and gaussian modules&lt;br /&gt;
* defining a scratch directory for the RESP program to read gaussian scratch&lt;br /&gt;
* TEMPDIR is used since the scratch must be empty upon calculation start. The temporary directory will be created fresh on each node as the calculation is started, and it&#039;s unlikely that two calculations will run on the same node.&lt;br /&gt;
* Moving to my HOME directory /home/af706 because the script must be run in the same directory as the input files  (.log and .p2n)&lt;br /&gt;
* RUN!&lt;br /&gt;
&lt;br /&gt;
The fitting process runs single point energy calcs and fits the RESP - overall it took 3mins for my fragment.&lt;br /&gt;
&lt;br /&gt;
===Going from xLeap out to Gaussian in===&lt;br /&gt;
&lt;br /&gt;
2 ways to do this:&lt;br /&gt;
&lt;br /&gt;
* Load in Gaussview&lt;br /&gt;
* Use Pengs scripts&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Gaussview&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
* The file loads and all residues are recognised (apart from the chromophore). The file contains the same number of atoms as the saved Leap file.&lt;br /&gt;
* Charges are assigned from amber94 - not the chromophore still&lt;br /&gt;
* Amber atom types are assigned correctly to ALL atoms&lt;br /&gt;
* All that remains is to add charges from the RESP fit.&lt;br /&gt;
* PDB information is retained in input&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;TAO&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
* Atoms are added so atom count is different to the file leaving Leap&lt;br /&gt;
* Fails to treat terminal residues correctly&lt;br /&gt;
* Lose residue information&lt;br /&gt;
* Charges are missing from chromophore and atoms in terminal residues&lt;br /&gt;
&lt;br /&gt;
==22nd Feb 2010==&lt;br /&gt;
&lt;br /&gt;
Links to HOW TO for Ambertools + Gaussview/ Ambertools + TAO tools + Gaussview&lt;br /&gt;
&lt;br /&gt;
* Ambertools + Gaussview  - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester_Ambertools_Gaussview&lt;br /&gt;
* Ambertools + TAO tools + Gaussview - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester_Ambertools_TAO&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForester_Ambertools_TAO&amp;diff=97695</id>
		<title>AndyForester Ambertools TAO</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForester_Ambertools_TAO&amp;diff=97695"/>
		<updated>2010-02-22T16:16:50Z</updated>

		<summary type="html">&lt;p&gt;Af706: New page: =Ambertools + TAO + Gaussview=  back to main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester#Going_from_xLeap_out_to_Gaussian_in&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Ambertools + TAO + Gaussview=&lt;br /&gt;
&lt;br /&gt;
back to main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester#Going_from_xLeap_out_to_Gaussian_in&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
	<entry>
		<id>https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForester_Ambertools_Gaussview&amp;diff=97693</id>
		<title>AndyForester Ambertools Gaussview</title>
		<link rel="alternate" type="text/html" href="https://chemwiki.ch.ic.ac.uk/index.php?title=AndyForester_Ambertools_Gaussview&amp;diff=97693"/>
		<updated>2010-02-22T16:15:53Z</updated>

		<summary type="html">&lt;p&gt;Af706: /* Ambertools + Gaussview */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Ambertools + Gaussview=&lt;br /&gt;
&lt;br /&gt;
Back to main - https://www.ch.ic.ac.uk/wiki/index.php/AndyForester#Going_from_xLeap_out_to_Gaussian_in&lt;/div&gt;</summary>
		<author><name>Af706</name></author>
	</entry>
</feed>