- This topic has 6 replies, 3 voices, and was last updated 12 years, 11 months ago by Anonymous.
January 6, 2011 at 3:48 am #732Anonymous
I’ve build loop model based on Rosetta Loop protocol and I am going to cluster those loops based on cluster protocol. However, I found that the cluster protocol is designed for the whole protein and there is no option for clustering only a part of the protein.
So, I am wondering, is it possible only cluster a part of the protein?
January 6, 2011 at 2:41 pm #4804Anonymous
Did you try using the cluster program anyway? Large identical sections shouldn’t cause it to not function…
January 7, 2011 at 1:34 am #4806Anonymous
yes, it works but didn’t work well because in the same cluster the results are largely different(eg: I use 3A for cutoff). So, it is wise to cluster them just focus on the loop region. Do you have any advices?
Thank you very much
January 7, 2011 at 2:58 pm #4807Anonymous
I guess you could try extracting JUST the loop residues into PDBs with a script and clustering those.
If you have constant sequences, then the loop residues ought to be on constant line numbers, so you can use something like:
for each PDB
$PDB | tail -n > looponly_$PDB
Then you can try clustering those. The new files will have the old file names embedded so you can translate them back…
If you have nonconstant sequences then you can do something slightly fancier with sed/awk/grep where you search for the right residue number in the right column – either way it should be a one-liner to extract the needed residues into their own PDB file.
January 8, 2011 at 8:56 am #4808Anonymous
Thank you very much for your so kind advices. I am still a little bit confused about this command:
- for each PDB
head -1 $PDB | tail -100 > looponly_$PDB
I store above three lines in a file and then run it in the terminal, it is said:
- ./00: line 1: syntax error near unexpected token `PDB’
./00: line 1: `for each PDB’
I also tried the following command in terminal directly:
- head -1 $PDB | tail -10 > looponly_$PDB
and it is said that:
- PDB: Undefined variable.
How to solve this?
- for each PDB
January 8, 2011 at 12:15 pm #4809Anonymous
smlewis’s script is a pseudo code, the PDB variable is undefined.
You can change your script like this:
#for example, my pdb files are in /home/SunH/my_decoys/
foreach PDB ( /home/SunH/my_decoys/* )
echo “Extract loop residues from $PDB”
#read loop residues from each pdb file and then write to a new pdb file
head -1 $PDB | tail -100 > “$PDB”_loop_only
January 8, 2011 at 4:16 pm #4810Anonymous
Yeah…I use tcsh, but most of my lab uses bash, so I just left the loop in pseudocode so albumns could pick his/her favorite shell. Sorry for the confusion.
- You must be logged in to reply to this topic.