This is an old revision of the document!
From the fairy tale Cinderella
Our binary classification tool (Cinderella) is based on a deep learning network to classify class averages, micrographs or subtomograms into good and bad categories. For class averages, it supports .hdf/.mrcs, for micrographs .mrc format and for subtomograms it expect that they are saved in a .hdf file. Cinderella was written to automate cryo-em data processing. It's open source and easy to use. We provide a pretrained general model for classifying class averages.(see tutorial). But you can easily train it with your own set of classes/micrographs/subtomograms.
Here are a couple of examples for good / bad classes in Cinderella:
Our model was trained on a set of 2D classes from ISAC. During the creation of the training dataset, I tried to ask myself “Which class would I select If I would not know the particle?” to decide which is a good class.
You can easily contribute your own classes!
Right now our model is trained on 19 datasets. But we will increase the number often!
Uploaded: 16. December 2019
Old versions of cinderella and the pretrained model can be found in the archive
Here is the repository of our training data:
Unfortunately, we cannot upload the complete training dataset, as some classes are from projects that are not yet published.
If you want to contribute with your own classes, please upload them here:
Ideally, please upload separate HDF/mrcs files for good and bad classes. You can do this separation with EMAN2's e2display. However, you can also upload the classes without separation and we will try to do it.
conda env remove --name cinderella
After that, create a new virtual environment:
conda create -n cinderella -c anaconda python=3.6 pyqt=5 cudnn=7.1.2 numpy==1.14.5
Activate the environment:
source activate cinderella
Install Cinderella for GPU:
pip install cinderella[gpu]
… or CPU:
pip install cinderella[cpu]