How to calculate the pairwise rotamer-to-rotamer energy table between mutable residues allowed to change to all amino acid types

This topic has 7 replies, 2 voices, and was last updated 10 years, 6 months ago by Anonymous.

Viewing 4 reply threads

Author

Posts
- January 23, 2014 at 12:24 am #1808
  Anonymous
  How can I calculate the self-energies and the table of pairwise energies between rotamers of mutable (molten or changeable) residues in a redesign problem where the mutable residues can change to all amino acid types? I basically am trying to figure out how to get the tables of pairwise rotamer-to-rotamer energies between mutable residues. I want the pairwise rotamer-to-rotamer energy table between mutable residues where the residue can change to all possible amino acid types and thus all possible rotamers of all possible amino acid types. I want to include all rotamers of all possible amino acid types in the mutable residues because I am trying to extend the side chain packing (only rotamers of a single amino acid type allowed) to redesign (all rotamers of all amino acid types allowed). I am modifying code from Jason Pacheco of Brown University (I changed the rotamer_set_for_moltenresidue function to FixbbRotamerSets function to try to include all rotamers of all amino acid types but it is giving the following error – AttributeError: ‘RotamerSets’ object has no attribute ‘FixbbRotamerSets’):
  
  import numpy as np
  import optparse # for option sorting
  import rosetta
  import rosetta.core.pack.task
  
  from rosetta import *
  from scipy.sparse import *
  from rosetta.core.pack.rotamer_set import *
  from rosetta.core.pack import *
  from rosetta.core.pack.interaction_graph import *
  from rosetta.core.pack.task import * # for using resfiles
  
  init()
  
  ## FUNCTIONS ############################################
  
  def get_rotamer_etable( ig, rotsets, moltenresI, moltenresJ ):
  “””Computes rotamer-rotamer energy table between two residues.”””
  
  # check edge exists
  if not ig.get_edge_exists(moltenresI, moltenresJ):
  return None
  
  # get etable
  rotsI = rotsets.FixbbRotamerSets()
  rotsJ = rotsets.FixbbRotamerSets()
  n_rots_I = rotsI.num_rotamers()
  n_rots_J = rotsJ.num_rotamers()
  E = np.zeros((n_rots_I,n_rots_J))
  for s_i in range(1, n_rots_I+1):
  for s_j in range(1, n_rots_J+1):
  E[s_i-1,s_j-1] = ig.get_two_body_energy_for_edge(moltenresI, moltenresJ, s_i, s_j)
  return E
  
  Please kindly guide me.
  Thank you so much.
- January 23, 2014 at 4:57 pm #9708
  Anonymous
  The rotsets object you’re getting is an instance of an object somewhere in the RotamerSets hierarchy. The FixbbRotamerSets() call would be to the constructor of the FixbbRotamerSets class to create a new object instance. You don’t need to call it, as rotsets should already be a FixbbRotamerSets object already. (If it’s not of the correct class for some reason, you’ll need to look at how you’re preparing the rotsets object before you pass it to your get_rotamer_etable() function call. You should be able to call the rotamer_set_for_moltenresidue() function directly.
  
  So how do you get all rotamers of all amino acids? That’s all up to how you’re creating the rotsets object. If you’re creating the object with a packertask which allows for design at a given position, the RotamerSet for that position will contain the rotamers for all of the amino acids allowed at that position. If you’ve set the packertask to be repack only at that position, then the rotamerset will only contain rotamers for a single identity.
  
  So what’s a molten residue? To save on storage space, the packer does a bit of index compression. Instead of indexing across the entire pose, it subsets just those residues for which it is considering rotamer changes. So any residue set to NATRO in the packertask won’t be considered a molten residue. Molten residues contain both design and repacking residues. The important thing to remember is that the parameter passed to rotamer_set_for_moltenresidue() is the moltenresID, which is not equivalent to the residue index in the pose. The translation is packertask dependent, but is stored in the RotamerSets object, and can be accessed with the resid_2_moltenres() and moltenres_2_resid() functions.
  
  So you get your RotamerSet object for your desired position (after converting it from absolute pose numbering to the moltenresID). Each of the rotamers in the rotamerset has a particular index for that particular rotamerset object. (The exact numbering depends on the particular packer task you use. Different levels of design and different packer task settings will result in different indexing.) This RotamerSet-specific index is what’s used internally in the interaction graph and the packer to index and label the rotamers and the rotamer interaction energies. (This is the reason for iterating from 1 to rotamerset.num_rotamers() inclusive in the code you provide.) This index is not amino-acid specific. In fact, all possible amino acids allowed at that position by the packer task are represented. For example, indices 1-20 may be for serine, 21 is for alanine, 22 for glycine, and 23-74 for lysine.
  
  Note that this rotamerset index is almost completely unrelated to the value obtained from core.pack.dunbrack.rotamer_from_chi(). Not only does the rotamerset combine multiple residue types into a single ordinal scheme, if you looked you’ll likely notice that there’s multiple rotamerset indexes which correspond to the same amino acid type and core.pack.dunbrack.rotamer_from_chi() vector. The reason for that is the extra rotamer sampling settings like -ex1 and -ex2. These options add subsamples of each of the rotamer bins. so you’ll get several rotamer samples (each with a different rotamerset index) which all fall at different places within the same dunbrack.rotamer_from_chi() bin.
  
  The essential problem you’re running up against is how to take a multidimensional data source (aa identity by chi identity by chi value) and collapse it into a single dimension which you can put as one side of your 2×2 interaction table. There’s no simple way to do it. The packer and the interaction graph choose to do it on an ad hoc basis, specialized for the particular residue and packer task under current consideration. If you’re looking for a more general indexing scheme, I believe you’ll have to come up with one that fits the criteria you need for your application.
- January 28, 2014 at 2:26 pm #9724
  Anonymous
  How can I get the chi angles or rotamer vector of a particular rotamer for a particular molten residue in an instantiation of the RotamerSets class? Is there a method or function with which I can get the chi angles or rotamer vector of a rotamer for any molten residue in the core::pack::rotamer_set::RotamerSets class? I want to get the rotamer vector or chi angles for a rotamer for a molten residue from a public member function in the core::pack::rotamer_set::RotamerSets class. I want to get the rotamer vector or chi angles for a rotamer for a molten residue in the RotamerSets class so that I can tabulate the energy for each of a molten residue’s rotamers defined by its set of chi angles or defined by the rotamer vector.
  
  core::conformation::ResidueCOP core::pack::rotamer_set::RotamerSets::rotamer (uint rotid)
  core::conformation::ResidueCOP core::pack::rotamer_set::RotamerSets::rotamer_for_moltenres (uint moltenres_id, uint rotamerid)
  
  In the core::pack::rotamer_set::RotamerSets class, the public member functions rotamer (uint rotid) and rotamer_for_moltenres (uint moltenres_id, uint rotamerid) give the amino acid type and residue number followed by the pdb coordinates of the atoms, like follows (but they don’t give the rotamer vector or chi angles. Is it possible to calculate the rotamer vector or chi angles from the atom coordinates with a PyRosetta function):
  
  Rotamer 11 of residue 1 is: ASP 3:
  N : 27.727 28.474 33.178
  CA : 26.925 27.439 32.581
  C : 27.786 26.384 31.926
  O : 28.925 26.623 31.562
  CB : 25.9648 28.0374 31.55
  CG : 24.9239 28.9573 32.1737
  OD1: 25.031 29.2359 33.3445
  OD2: 24.0317 29.3726 31.4734
  H : 28.2632 29.087 32.5807
  HA : 26.333 26.9649 33.3643
  1HB : 26.5316 28.603 30.8101
  2HB : 25.4493 27.2338 31.0233
  
  The rotamer sets are created with the command:
  rotsets = RotamerSets()
  (see in code below).
  
  I modified Jason Pacheco’s code to allow the packer task to do design:
  
  pose = pose_from_pdb(in_file)
  nRes = pose.n_residue()
  scorefxn = get_fa_scorefxn() # create_score_function()
  
  task_pack = TaskFactory.create_packer_task(pose)
  parse_resfile(pose, task_pack, resfile)
  print ‘Created packer task from resfile’
  
  print ‘Task pack is ‘,task_pack
  
  pack_scorefxn_pose_handshake( pose, scorefxn )
  pose.update_residue_neighbors()
  scorefxn.setup_for_packing( pose, task_pack.designing_residues(), task_pack.designing_residues() )
  packer_neighbor_graph = create_packer_graph( pose, scorefxn, task_pack )
  rotsets = RotamerSets()
  rotsets.set_task( task_pack )
  rotsets.build_rotamers( pose, scorefxn, packer_neighbor_graph )
  rotsets.prepare_sets_for_packing( pose, scorefxn )
  ig = InteractionGraphFactory.create_interaction_graph( task_pack, rotsets, pose, scorefxn )
  print “built”, rotsets.nrotamers(), “rotamers at”, rotsets.nmoltenres(), “positions.”
  
  rotsets.compute_energies( pose, scorefxn, packer_neighbor_graph, ig )
  
  return pose, task_pack, scorefxn, rotsets, ig
  
  The output spits out the packer task:
  
  Task pack is #Packer_Task
  
  resid pack? design? allowed_aas
  1 FALSE FALSE
  2 FALSE FALSE
  3 TRUE TRUE ALA,CYS,ASP,GLU,PHE,GLY,HIS,HIS_D,ILE,LYS,LEU,MET,ASN,PRO,GLN,ARG,SER,THR,VAL,TRP,TYR
  4 FALSE FALSE
  5 TRUE TRUE ALA,CYS,ASP,GLU,PHE,GLY,HIS,HIS_D,ILE,LYS,LEU,MET,ASN,PRO,GLN,ARG,SER,THR,VAL,TRP,TYR
  6 FALSE FALSE
  7 TRUE TRUE ALA,CYS,ASP,GLU,PHE,GLY,HIS,HIS_D,ILE,LYS,LEU,MET,ASN,PRO,GLN,ARG,SER,THR,VAL,TRP,TYR
  8 FALSE FALSE
  9 FALSE FALSE
  10 FALSE FALSE
  11 FALSE FALSE
  12 FALSE FALSE
  13 FALSE FALSE
  14 FALSE FALSE
  15 FALSE FALSE
  16 FALSE FALSE
  17 FALSE FALSE
  18 FALSE FALSE
  19 FALSE FALSE
  20 TRUE TRUE ALA,CYS,ASP,GLU,PHE,GLY,HIS,HIS_D,ILE,LYS,LEU,MET,ASN,PRO,GLN,ARG,SER,THR,VAL,TRP,TYR
  21 FALSE FALSE
  22 FALSE FALSE
  23 FALSE FALSE
  24 FALSE FALSE
  25 FALSE FALSE
  26 TRUE TRUE ALA,CYS,ASP,GLU,PHE,GLY,HIS,HIS_D,ILE,LYS,LEU,MET,ASN,PRO,GLN,ARG,SER,THR,VAL,TRP,TYR
  27 FALSE FALSE
  28 FALSE FALSE
  29 FALSE FALSE
  30 TRUE TRUE ALA,CYS,ASP,GLU,PHE,GLY,HIS,HIS_D,ILE,LYS,LEU,MET,ASN,PRO,GLN,ARG,SER,THR,VAL,TRP,TYR
  31 FALSE FALSE
  32 FALSE FALSE
  33 FALSE FALSE
  34 TRUE TRUE ALA,CYS,ASP,GLU,PHE,GLY,HIS,HIS_D,ILE,LYS,LEU,MET,ASN,PRO,GLN,ARG,SER,THR,VAL,TRP,TYR
  35 FALSE FALSE
  36 FALSE FALSE
  37 FALSE FALSE
  38 FALSE FALSE
  39 TRUE TRUE ALA,CYS,ASP,GLU,PHE,GLY,HIS,HIS_D,ILE,LYS,LEU,MET,ASN,PRO,GLN,ARG,SER,THR,VAL,TRP,TYR
  40 FALSE FALSE
  41 FALSE FALSE
  42 FALSE FALSE
  43 TRUE TRUE ALA,CYS,ASP,GLU,PHE,GLY,HIS,HIS_D,ILE,LYS,LEU,MET,ASN,PRO,GLN,ARG,SER,THR,VAL,TRP,TYR
  44 FALSE FALSE
  45 FALSE FALSE
  46 FALSE FALSE
  47 FALSE FALSE
  48 FALSE FALSE
  49 FALSE FALSE
  50 FALSE FALSE
  51 FALSE FALSE
  52 TRUE TRUE ALA,CYS,ASP,GLU,PHE,GLY,HIS,HIS_D,ILE,LYS,LEU,MET,ASN,PRO,GLN,ARG,SER,THR,VAL,TRP,TYR
  53 FALSE FALSE
  54 TRUE TRUE ALA,CYS,ASP,GLU,PHE,GLY,HIS,HIS_D,ILE,LYS,LEU,MET,ASN,PRO,GLN,ARG,SER,THR,VAL,TRP,TYR
  55 FALSE FALSE
  56 FALSE FALSE
- January 29, 2014 at 9:16 am #9729
  Anonymous
  Thank you so much. Yes I can use the chi() method and it works on the rotamer object returned by the rotamer_for_moltenres (uint moltenres_id, uint rotamerid) RotamerSets method. The chi() method is giving angles back to me, however, sometimes they are empty. If the chi angles is empty, is it [0 0 0 0]?
- January 30, 2014 at 5:32 am #9735
  Anonymous
  Yes this amino acid was ALA. So are GLY and ALA always only one single rotamer? Is that why they don’t have any chi rotamers?
  
  Rotamer 1 of residue 1 is: ALA 3:
  N : 27.727 28.474 33.178
  CA : 26.925 27.439 32.581
  C : 27.786 26.384 31.926
  O : 28.925 26.623 31.562
  CB : 25.9599 28.0395 31.5693
  H : 28.2632 29.087 32.5807
  HA : 26.333 26.9649 33.3643
  1HB : 25.3554 27.2469 31.1281
  2HB : 25.3088 28.7569 32.0689
  3HB : 26.5224 28.5449 30.786
  
  Chi angles are []
- January 28, 2014 at 4:16 pm #9728
  Anonymous
  If you have a RotamerSets instance, I’d first recommend obtaining the appropriate RotamerSet instance from it for whichever particular residue you want. While the RotamerSets (plural) class has some convenience functions, internally they just obtain the RotamerSet (singular) instance for the residue of interest, and then call the appropriate function on that RotamerSet. Or if they don’t take a residue/moltenres id, they’ll simply report numbers across *all* residue positions. You’ll get more flexibility if you obtain the RotamerSet instance yourself, and then you can call any number of functions on that RotamerSet
  
  For any particular rotamer in the RotamerSet you can get the Residue object which represents that rotamer with the rotamer(i) member function of the rotamer set. The returned Rotamer object is the same class as the Rotamer object one gets from the Pose, so you can use the same methods on it, for example the nchi() and chi() methods. (BTW, the rotamer (uint rotid) and rotamer_for_moltenres (uint moltenres_id, uint rotamerid) RotamerSets methods are giving you the same Residue objects.)
- January 29, 2014 at 8:17 pm #9732
  Anonymous
  What’s the amino acid identity for those residues which give you empty chi() values? (Print out the results of the name() method on your Residue object.) I’m guessing they’re going to be something like Gly or Ala, which don’t have any chi rotamers. Those should return an empty list/vector1 for their chis, not [0 0 0 0] (which would imply four chi angles, each set to zero.)
- January 30, 2014 at 4:04 pm #9736
  Anonymous
  Well, glycine doesn’t have a sidechain, so there are no sidechain rotamers. While alanine has a sidechain, it only has a single heavy atom in the sidechain, so there are no heavy atom chis. Also, even though there’s technically rotational freedom in the methyl group, Rosetta typically does not model that rotational freedom, as the methyl group is rather symmetric and for most purposes that rotation doesn’t matter.
  
  So the upshot is that for glycine and alanine there isn’t any conformational flexibility in the sidechain which Rosetta models, so it doesn’t have any chis, and only has a single “rotamer”.
Author

Posts

Viewing 4 reply threads

You must be logged in to reply to this topic.