Skip to main content

Using Dictionary to edit ptraj input file

Recently I have been working on Linear combination of Distance (LCOD).
One has to extract a set of structures (may vary from 100 to 1000 to any bigger number depending on requirement, accuracy) from a very long trajectory.
If one is looking for very specific orientation, then it is necessary to go through the whole trajectory and choose the required frames and select that geometry, which is very tedious and time taking job.
But more often a random selection from the trajectory is done to take care of all the diversity in the trajectory.
In Amber, one can use ptraj to select a frame from trajectory and created its restart file to launch LCOD calculation. To launch such 100 or 1000 calculation one need to modify or create 100 or 1000 such input files to feed to ptraj with the frame number and the name of the restart file.
I am using a dictionary in python to do so.
Beginning with 2 lists:
one having frame numbers to be selected
another with name of the restart files corresponding to the different frames

I also want to give the input file a name referring to the restart file.

The initial input file know as praj.in looks like:

-----------------------
trajin md11.mdcrd 1111 1111 1
trajout md11_restart.rst restart

-----------------------

Numbers 1111 1111 are the frame numbers. So asking to select frame numbers from 1111 to 1111 i.e. only one frame.

want to create restart files by changing number 11 in second line by sequential numbers from 1 to say 100 or upto 1000.

Here is a script that first creates dictionary using 2 lists matching frame number to the number of restart file, then uses this dictionary to generate different input files which can later on launched to generate the restart files.

------------------------------

#! /usr/bin/python
# code to create dictionary and generate ptraj input files
import os
keys=[]
values=[]
inp=open("list1","r")
inp1=open("list2","r")
for line in inp.readlines():
 for i in line.split():
  keys.append(i)
for line1 in inp1.readlines():
 for j in line1.split():
  values.append(j)
print keys
print values
dictionary=dict(zip(keys,values))
print dictionary
print keys
print values
for keys, values in dictionary.items():
 print keys
 print values
 fin = open("praj.in","r")
 fout = open("praj_"+str(values)+".in", "w")
 for line in fin:
  source="1111 1111"
  final= keys+" "+keys
  source1="md11_"
  final1= "md"+str(values)+"_"
  line=line.replace(source, final)
  print line
  line=line.replace(source1, final1)
  print line
  fout.write(line)
fin.close()
fout.close()
print "\nfiles praj_values.in are created\n"

--------------------------------

So the initial praj.in file was modified and saved by different names as praj_1.in, praj_2.in etc.
One of the modified file "praj_2.in" looks like:

-----------------------------
trajin md11.mdcrd 985 985 1
trajout md2_restart.rst restart

-------------------

Frame no. 985 will be selected as a restart point for the 2nd run out of 100 or 1000's of similar run

Comments

Popular posts from this blog

AutoDock Vina protocol: For Molecular Docking

There are many softwares available for carrying out molecular docking calculations. Many of these softwares are paid, while some of them are free. AutoDock is one such freely available software. Earlier, there was AutoDock 4.0 version available. One of the problem with the software was the space it would occupy, if you attempt to do multi docking. Recently, there is another algorithm made available under autodock. It is called AutoDock Vina. This is more efficient, robust and fast algorithm. There are many tutorials available to learn AutoDock Vina including: http://autodock.scripps.edu/faqs-help/tutorial/using-autodock-4-with-autodocktools/2012_ADTtut.pdf and also a video tutorial http://vina.scripps.edu/tutorial.html    But for some reason, few changes or explanation in little detail is always required to make it understandable for a beginner to follow.  Here is a protocol designed in our group, which our Master's student follow to get acquainted to Molecular Docking s

Material Studio - selective molecule movement

So, working with Material Studio is awesome. You get to handle bigger, complicated and even unimaginable system. But I am facing problems in very minute functioning. I recently faced two problems, but managed to resolve on my own. Somehow, tutorials doesn't help in resolving them and you end up doing some exercise on your own to get the solution. Among these, the problem of bringing two molecular structures in one window and then adjusting one molecule by moving it around, while other structure remains untouched, as it is. Actually, this should not be a problem to do. There is a simple way of doing this. Select the molecule to be moved and then there is a symbol (button) to select, which allows you to selectively move that molecule only. This is the same way it can be done in Material studio too.  But among all the dozens of symbols, the new person handling MS, kind of gets lost. So, what he does is check the tutorials, for any help. But the tutorials don't talk about suc

xleap to generate prmtop and inpcrd files

Now that we have the lib file with the information for all the molecules we need to include and also the corresponding combined pdb file, creating prmtop and inpcrd files should not be a difficult task. So, to begin with we have a lib file name: molecule.lib a pdb file for 4 molecules name: four.pdb The  task is to generate a prmtop and inpcrd file. Initiate xleap using: $AMBERHOME/exe/xleap -s -f $AMBERHOME/dat/leap/cmd/leaprc.ff99 In xleap window > source leaprc.gaff # load the lib file by: > loadoff molecule.lib I am not sure but some times it helps to also load the frcmod files we created for the four molecules the information of which is stored in the lib file. > loadamberparams aldehyde.frcmod # Load the pdb file: > P=loadpdb four.pdb You should see some thing like this in the xleap window now: ------------------- > P=loadpdb four.pdb Loading PDB file: ./four.pdb Warning: name change in pdb file residue 1 ;  this residue is split i