This is an old revision of the document!
3D Clustering - SORT3D DEPTH: Sorting heterogeneous 3D dataset by checking the reproducible members of two independent runs of K-means clustering with minimum group size constraint. Sorting requires the 3D reconstruction parameters have been determined already.
Usage1 in command line
sxrsort3d_depth.py --refinement_dir=refinemen_out_dir --output_dir=master_dir --niter_for_sorting=num_of_sorting_iterations --mask3D=mask3d_file --focus=focus3d_file --radius=outer_radius --sym=symmetry --img_per_group=img_per_grp --minimum_grp_size=minimum_grp_size --depth_order=depth_of_order --noctf=no_ctf --instack=input_stack_file --memory_per_node=memeory_per_node --orientation_groups=number_of_orientation_groups --not_include_unaccounted=not_include_unaccounted --stop_mgskmeans_percentage=MGSKmeans_stop_ratio --swap_ratio=accounted_vs_unaccounted_swap_ratio --notapplybckgnoise=do_not_use_background_noise --do_swap_au=turn_on_swap_accounted_vs_unaccounted
Usage2 in command line
sxrsort3d_depth.py --instack=input_stack_file --output_dir=master_dir --mask3D=mask3d_file --focus=focus3d_file --radius=outer_radius --sym=symmetry --number_of_images_per_group=num_of_images_per_group --minimum_grp_size=minimum_grp_size --depth_order=depth_of_order --nxinit=initial_image_size --noctf=no_ctf --memory_per_node=memeory_per_node --orientation_groups=number_of_orientation_groups --not_include_unaccounted=not_include_unaccounted --stop_mgskmeans_percentage=MGSKmeans_stop_ratio --swap_ratio=accounted_vs_unaccounted_swap_ratio --notapplybckgnoise=do_not_use_background_noise --do_swap_au=turn_on_swap_accounted_vs_unaccounted
sxrsort3d.py exists only in MPI version.
Initiate sorting from a SPHIRE/SPARX refinement: In this mode, one can select arbitrary iteration of a 3D refinement directory. Typically, it is the master directory of a sxmeridien refinement via —niter_for_sorting option.
mpirun -np 176 sxrsort3d.py --refinement_method=SPARX --refinement_dir=meridien_outdir --niter_for_sorting=30 --radius=120 --sym=c5 --number_of_images_per_group=6000 --smallest_group=1500 --nindependent=5 --interpolation=trl --low_pass_filter=0.25
Initiate sorting from a data stack: Currently, this mode is not supported by SPHIRE GUI.
mpirun -np 176 sxrsort3d.py --instack=bdb:data --mask3D=mask3d.hdf --focus=focus3d.hdf -radius=29 --sym=c1 --nxinit=64 --number_of_images_per_group=2000 --nindependent=3 --low_pass_filter=0.25 --interpolation=4nn --comparison_method=cross --Kmeans_lpf=adhoc
Initiate sorting from a relion refinement: For this mode, please provide relion refinement directory. The program will pick up the results of the last iteration and start sorting. Currently, this mode is not supported by SPHIRE GUI.
mpirun -np 160 sxrsort3d.py --refinement_method=relion --refinement_dir=relion_outdir --radius=120 --sym=c5 --nindependent=3 --number_of_images_per_group=6000
NOTE - How to continue sxmeridien refinement using sorting results: Please use –ctrefromsort3d option of sxmeridien, then specify the directory where you wish to continue the refinement to –oldrefdir option and a subset of data to —-subset option. The command will load the refinement information from the directory and continue refinement. Optinally, you can specify the iteration number for continuing refinement using -—ctrefromiter option, which is not necessarily be the same iteration where you used for the 3D sorting. Also, one can modify refinement parameters of the selected iteration through the other options.
mpirun -np 88 sxmeridien.py --ctrefromsort3d --oldrefdir=meridien_outdir --ctrefromiter=20 --subset=Clusters3.txt ''' <<BR>><<BR>>
Please use –masterdir option to specify the output directory. The results will be written here. This directory will be created automatically if it does not exist Here, you can find a log.txt that describes the sequences of computations in the program.
sxrsort3d finds out stable members by carrying out two-way comparison of two independent sxsort3d runs.
For small tested datasets (real and simulated ribosome data around 10K particles), it gives 70%-90% reproducibility. However, this rate also depends on the choice of number of images per group and number of particles in the smallest group.
K-means, equal K-means, reproducibility, two-way comparison.
Not published yet.
Zhong Huang
Category 1:: APPLICATIONS
sxrsort3d.py
beta:: Under development. It has been tested, The test cases/examples are available upon request. Please let us know if there are any bugs.
None so far.