Skip to main content

Using Dictionary to edit ptraj input file

Recently I have been working on Linear combination of Distance (LCOD).
One has to extract a set of structures (may vary from 100 to 1000 to any bigger number depending on requirement, accuracy) from a very long trajectory.
If one is looking for very specific orientation, then it is necessary to go through the whole trajectory and choose the required frames and select that geometry, which is very tedious and time taking job.
But more often a random selection from the trajectory is done to take care of all the diversity in the trajectory.
In Amber, one can use ptraj to select a frame from trajectory and created its restart file to launch LCOD calculation. To launch such 100 or 1000 calculation one need to modify or create 100 or 1000 such input files to feed to ptraj with the frame number and the name of the restart file.
I am using a dictionary in python to do so.
Beginning with 2 lists:
one having frame numbers to be selected
another with name of the restart files corresponding to the different frames

I also want to give the input file a name referring to the restart file.

The initial input file know as praj.in looks like:

-----------------------
trajin md11.mdcrd 1111 1111 1
trajout md11_restart.rst restart

-----------------------

Numbers 1111 1111 are the frame numbers. So asking to select frame numbers from 1111 to 1111 i.e. only one frame.

want to create restart files by changing number 11 in second line by sequential numbers from 1 to say 100 or upto 1000.

Here is a script that first creates dictionary using 2 lists matching frame number to the number of restart file, then uses this dictionary to generate different input files which can later on launched to generate the restart files.

------------------------------

#! /usr/bin/python
# code to create dictionary and generate ptraj input files
import os
keys=[]
values=[]
inp=open("list1","r")
inp1=open("list2","r")
for line in inp.readlines():
 for i in line.split():
  keys.append(i)
for line1 in inp1.readlines():
 for j in line1.split():
  values.append(j)
print keys
print values
dictionary=dict(zip(keys,values))
print dictionary
print keys
print values
for keys, values in dictionary.items():
 print keys
 print values
 fin = open("praj.in","r")
 fout = open("praj_"+str(values)+".in", "w")
 for line in fin:
  source="1111 1111"
  final= keys+" "+keys
  source1="md11_"
  final1= "md"+str(values)+"_"
  line=line.replace(source, final)
  print line
  line=line.replace(source1, final1)
  print line
  fout.write(line)
fin.close()
fout.close()
print "\nfiles praj_values.in are created\n"

--------------------------------

So the initial praj.in file was modified and saved by different names as praj_1.in, praj_2.in etc.
One of the modified file "praj_2.in" looks like:

-----------------------------
trajin md11.mdcrd 985 985 1
trajout md2_restart.rst restart

-------------------

Frame no. 985 will be selected as a restart point for the 2nd run out of 100 or 1000's of similar run

Comments

Popular posts from this blog

Material Studio - selective molecule movement

So, working with Material Studio is awesome. You get to handle bigger, complicated and even unimaginable system. But I am facing problems in very minute functioning. I recently faced two problems, but managed to resolve on my own. Somehow, tutorials doesn't help in resolving them and you end up doing some exercise on your own to get the solution. Among these, the problem of bringing two molecular structures in one window and then adjusting one molecule by moving it around, while other structure remains untouched, as it is. Actually, this should not be a problem to do. There is a simple way of doing this. Select the molecule to be moved and then there is a symbol (button) to select, which allows you to selectively move that molecule only. This is the same way it can be done in Material studio too.  But among all the dozens of symbols, the new person handling MS, kind of gets lost. So, what he does is check the tutorials, for any help. But the tutorials don't talk about suc...

How would COVID bahave in August - few projections

Entering August with COVID by side   Entering in the month of August, COVID is still here by our side. Situation seems grim, worse than earlier, as number of active cases as well as the count of deceased patients are increasing day by day. Yesterday we crossed the mark of 50,000 cases in a day. Closely competing with Brazil with US counts also in range now. On the other side, few other numbers like reduced mortality rate to 2.18%, recovery rate of 64.6% and slightly increased doubling rate of 21 days can reduce the anxiety. My projections of 16.75 lakhs for India might be crossed by coupe of thousands of more cases after today’s count is added. This number would reach to a huge count of 28.5 lakhs by August 15. Maharashtra just doesn’t like competition. Just like what Delhi tried few days back, Andhra Pradesh also managed to cross Maharashtra’s per day count for2 consecutive days. Seems like not a good idea. Maharashtra immediately went to the next zone. The state which w...

Rigorous method to create lib file

Sometimes you totally fail to go ahead with the prepin and frcmod files you have generated from the newly generated PDB file even after you follow the procedure given in previous two posts. And you keep wondering? In such a case you need a more detailed procedure. First take a look at the pdb file. For example: Here is one such pdb file generated from the gabedit that troubled me for some time. --------------- HEADER    PROTEIN COMPND    UNNAMED AUTHOR    GENERATED BY GABEDIT 2.3.8 at Tue Jul 26 16:53:48 2011 ATOM  1      C   C     1       -2.907  -0.997  -0.385  1.00  300.00          C 0.0000   ATOM  2      N   N     1       -2.060  -0.981  -1.189  1.00  300.00 ...