This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
pipeline:sort3d:sxrsort3d [2018/02/23 16:55] moriya |
pipeline:sort3d:sxrsort3d [2018/02/23 18:12] moriya |
||
---|---|---|---|
Line 1: | Line 1: | ||
~~NOTOC~~ | ~~NOTOC~~ | ||
- | ===== sxsort3d | + | ===== sxrsort3d |
- | 3D Clustering - SORT3D: Sort 3D heterogeneity | + | 3D Clustering - RSORT3D: Sort out 3D heterogeneity of 2D data whose 3D reconstruction |
\\ | \\ | ||
Line 9: | Line 9: | ||
Usage in command line | Usage in command line | ||
- | | + | |
\\ | \\ | ||
===== Typical usage ===== | ===== Typical usage ===== | ||
- | sxsort3d | + | sxrsort3d.py |
- | mpirun -np 192 sxsort3d.py bdb: | + | mpirun -np 192 sxrsort3d.py bdb: |
\\ | \\ | ||
Line 23: | Line 23: | ||
; stack : Input images stack: (default required string) | ; stack : Input images stack: (default required string) | ||
; outdir : Output directory: There is a log.txt that describes the sequences of computations in the program. (default required string) | ; outdir : Output directory: There is a log.txt that describes the sequences of computations in the program. (default required string) | ||
- | ; mask : 3D mask: File path of the global 3D mask for clustering. | + | ; mask : 3D mask: (default none) |
- | ; focus : Binary | + | ; %%--%%previous_run1 : Directory of 1st sort3d run: (default required string) |
- | ; radius : Particle | + | ; %%--%%previous_run2 : Directory of 2nd sort3d run: (default required string) |
- | ; delta : Angular step for projections | + | ; %%--%%focus : Focus 3D mask: Mask used for focused clustering (default none) |
- | ; CTF : Use CTF: Do a full CTF correction during the alignment. (default False) | + | ; %%--%%radius : Outer radius |
- | ; sym : Point-group symmetry: | + | ; %%--%%delta : Angular step for projections: |
- | ; number_of_images_per_group : Images per group: Critical | + | ; %%--%%CTF : Use CTF: Do a full CTF correction during the alignment. (default False) |
- | ; nxinit : Initial image size for sorting: | + | ; %%--%%sym : Point-group symmetry: (default c1) |
- | ; smallest_group : Smallest group size: Minimum members for identified group. (default 500) | + | ; %%--%%number_of_images_per_group : Images per group: Critical |
- | ; chunk0 | + | ; %%--%%nxinit : Initial image size for sorting: (default 64) |
- | ; chunk1 : Chunk file name for 2nd halfset: Name of chunk file containing particle IDs of 2nd halfset (chunk1) | + | ; %%--%%smallest_group : Smallest group size: Minimum members for identified group. (default 500) |
+ | ; %%--%%chunkdir | ||
\\ | \\ | ||
=== Advanced Parameters === | === Advanced Parameters === | ||
- | ; ir : Inner radius for rotational correlation [Pixels]: Must be bigger than 1. (default 1) | + | ; %%--%%ir : Inner radius for rotational correlation [Pixels]: Must be bigger than 1. (default 1) |
- | ; maxit : Maximum iterations: | + | ; %%--%%maxit : Maximum iterations: (default |
- | ; rs : Step between rings in rotational correlation: | + | ; %%--%%rs : Step between rings in rotational correlation: |
- | ; xr : X search range [Pixels]: The translational search range in the x direction will take place in -xr to +xr range. (default ' | + | ; %%--%%xr : X search range [Pixels]: The translational search range in the x direction will take place in a +/xr range. (default ' |
- | ; yr : Y search range [Pixels]: The translational search range in the y direction | + | ; %%--%%yr : Y search range [Pixels]: The translational search range in the y direction. If omitted it will be set as xr. (default ' |
- | ; ts : Translational search step [Pixels]: The search will be performed in -xr, -xr+ts, 0, xr-ts, xr, can be fractional. (default ' | + | ; %%--%%ts : Translational search step [Pixels]: The search will be performed in -xr, -xr+ts, 0, xr-ts, xr, can be fractional. (default ' |
- | ; an : Local angular search width [Degrees]: This defines the neighbourhood where the local angular search will be performed. (default ' | + | ; %%--%%an : Local angular search width [Degrees]: This defines the neighbourhood where the local angular search will be performed. (default ' |
- | ; center : Centering method: 0 - if you do not want the volume to be centered, 1 - center the volume using the center of gravity. (default 0) | + | ; %%--%%center : Centering method: 0 - if you do not want the volume to be centered, 1 - center the volume using the center of gravity. (default 0) |
- | ; nassign : Number of reassignment iterations: Performed for each angular step. (default 1) | + | ; %%--%%nassign : Number of reassignment iterations: Performed for each angular step. (default 1) |
- | ; nrefine : Number of alignment iterations: Performed for each angular step. (default 0) | + | ; %%--%%nrefine : Number of alignment iterations: Performed for each angular step. (default 0) |
- | ; stoprnct : Assignment convergence threshold [%]: Used to asses convergence of the run. It is the minimum percentage of assignment change required to stop the run. (default 3.0) | + | ; %%--%%stoprnct : Assignment convergence threshold [%]: Used to asses convergence of the run. It is the minimum percentage of assignment change required to stop the run. (default 3.0) |
- | ; function : Reference preparation function: | + | ; %%--%%function : Reference preparation function: |
- | ; independent : Number of independent runs: Number of independent equal-Kmeans. (default 3) | + | ; %%--%%independent : Number of independent runs: Number of independent equal-Kmeans(default 3) |
- | ; low_pass_filter : Low-pass filter frequency [1/Pixels]: Low-pass filter used for the 3D sorting on the original image size. Specify with absolute frequency. (default -1.0) | + | ; %%--%%low_pass_filter : Low-pass filter frequency [1/Pixel]: Low-pass filter used for the 3D sorting on the original image size. (default -1.0) |
- | ; unaccounted : Reconstruct unaccounted images: | + | ; %%--%%unaccounted : Reconstruct unaccounted images: (default False) |
- | ; seed : Random seed: Seed used for the initial random assignment for EQ Kmeans. The program generates a random integer by default. (default -1) | + | ; %%--%%seed : Random seed: Seed used for the initial random assignment for EQ Kmeans. The program generates a random integer by default. (default -1) |
- | ; sausage : Use sausage filter: | + | ; %%--%%group_size_for_unaccounted : Unaccounted particles group size: (default 500) |
- | ; PWadjustment : Power spectrum reference: Text file containing a 1D reference power spectrum | + | ; %%--%%sausage : Use sausage filter: (default False) |
- | ; protein_shape : Protein Shape: It defines protein preferred orientation angles. " | + | ; %%--%%PWadjustment : Power spectrum reference: Text file containing a 1D reference power spectrum. (default none) |
- | ; upscale : Power spectrum adjustment strength: This parameters adjusts how strongly the power spectrum of the volume should be modified to match the reference. A value of 1 brings the volume' | + | ; %%--%%protein_shape : Protein Shape: It defines protein preferred orientation angles. " |
- | ; wn : Target image size [Pixels]: | + | ; %%--%%upscale : Power spectrum adjustment strength: This parameters adjusts how strongly the power spectrum of the volume should be modified to match the reference. A value of 1 brings the volume' |
- | ; interpolation : 3D interpolation method: Method interpolation in 3D. Options are tr1 or 4nn. (default | + | ; %%--%%wn : Target image size [Pixels]: If different than 0, then the images will be rescaled to fit this size. (default 0) |
+ | ; %%--%%interpolation : 3D interpolation method: Method interpolation in 3D. Options are tr1 or 4nn. (default 4nn) | ||
\\ | \\ | ||
Line 66: | Line 68: | ||
\\ | \\ | ||
===== Description ===== | ===== Description ===== | ||
- | The clustering algorithm in the program combines a couple of computational techniques, equal-Kmeans clustering, K-means clustering, and reproducibility | + | sxrsort3d.py finds out stable members by carrying out two-way comparison |
+ | |||
+ | For small tested datasets (real and simulated ribosome data around 10K particles), it gives 70%-90% reproducibility. However, this rate also depends on the choice of number of images per group and number of particles in the smallest | ||
+ | |||
+ | \\ | ||
+ | === Time and Memory === | ||
+ | On lonestar cluster of TACC, using 264 cpus, it takes about 2 hours and 23 minutes to accomplish 95953 128x128 images for one sxsort3d.py independent | ||
\\ | \\ | ||
==== Method ==== | ==== Method ==== | ||
+ | K-means, equal K-means, reproducibility, | ||
\\ | \\ | ||
==== Reference ==== | ==== Reference ==== | ||
- | Described by A.Einstein in his first paper on spectrum of radiation from a house heater kept at room temperature. Journal of Irreproducible Results, 12, 1905, 12-1127. | + | Not published yet. |
\\ | \\ | ||
Line 81: | Line 90: | ||
==== Author / Maintainer ==== | ==== Author / Maintainer ==== | ||
Zhong Huang | Zhong Huang | ||
- | |||
\\ | \\ | ||
Line 89: | Line 97: | ||
\\ | \\ | ||
==== Files ==== | ==== Files ==== | ||
- | sparx/bin/sxsort3d.py | + | sparx/bin/sxrsort3d.py |
\\ | \\ | ||
==== See also ==== | ==== See also ==== | ||
- | [[pipeline: | + | [[pipeline: |
\\ | \\ | ||
==== Maturity ==== | ==== Maturity ==== | ||
- | Alpha:: Under development. Two programs (P1,P2) have been tested on both simulated and experimental ribosome data. For experimental ribosome data, P2 has a reproducible ratio-70-90%. P2 can 100%separate two conformations from the simulated ribosome data that contains 5 conformations. | + | Alpha:: Under development. |
\\ | \\ |