A three-dimensional signature search of a motif inside a larger cryo-EM density map can be done using the following SPIDER procedures. Multiple occurrences of the motif and their locations and orientations inside the searched volume can be found. A rotationally invariant mask or an asymmetric mask can be used for calculating locally normalized correlation function. The motif could be obtained from various sources including crystal structure (if known) or a single particle reconstruction (if available). Or, alternatively, if a copy of the searched molecule is recognizable inside the larger volume, it can be cut out and used as the motif. The searched volume and the motif should have the same scaling. Uses Alan Roseman's Fourier formulation to calculate locally normalized correlation function.
The SPIDER procedures for doing the signature search are designed to run on a Linux cluster using PubSub parallel processing scheduler. sigs_pub.spi is the main procedure which calls: sigs_settings.spi to set parameters and another procedure: sigsloop.spi which does the actual computation.
STEP BY STEP INSTRUCTION FOR RUNNING THE PROGRAM :
PHI = 0 - 350 at 10 degrees interval
THETA = 0 - 180 at 10 degrees interval
PSI = 0 - 350 at 10 degrees interval
These values need to be set in the procedure sigs_settings.spi. Usually, a coarse scanning already finds the signatures of the motif at correct x,y,z locations inside the large volume.
UNIX commands:
(Combines the output files. DOC_OUT_* are the ouput files generated by the signature search procedures. This is done in this way to handle more than a million keys (lines in the document file) which cann't be handled by SPIDER document files.)
(ii) sort -nr -k 7 DOC_CUT_COMBINED > DOC_SORTED
(If any of the cross-correlation coefficient heights is written in Scientific format in the merged output file, DOC_CUT_COMBINED then, in LINUX, instead of the above command use :)
(ii) sort -gr -k 7 DOC_CUT_COMBINED > DOC_SORTED
(Sorts the file according to the cross-correlation coefficient heights.)
PERL programs:
(Supresses the peaks inside an area equal to the size of the motif around the highest peak. The procedure continues for the next largest peak after the previous step is executed. The output file will now contain the unique peaks corresponding to the probable presence of the signature of the motif at various locations and in different orientations.)
(ii) make_docfile.perl < DOC_UNIQUE > doc_unique.dat
(Converts the output file from uniq.perl program to SPIDER document file by adding keys and number of registers etc. info.)
REFERENCES: