Finds and counts stomata in microscopic images of leaves. This is code for developers who intend to build their own web service or train their own models. If you just want to evaluate images, you can use our existing service running at http://www.stomata.science/
Download the latest version of the code:
mkdir ~/epidermal
git clone https://github.com/SvenTwo/epidermal.git ~/epidermal/
cd ~/epidermal
The project is python2.7-based using caffe for the CNN processing. First install caffe http://caffe.berkeleyvision.org/installation.html
and remember to also install the python bindings (make pycaffe
if you compile yourself). For all basic python dependencies (you may want to enter a virtualenv to install all dependencies in usermode if desired first):
for req in $(cat requirements.txt); do pip install $req; done
pyIMQ for image quality measures is also required and can be found here: https://github.com/danielsnider/PyImageQualityRanking - if pyimq is not installed, the service should run but image quality measures will not be available.
If you prefer to not use the web service, a script can be used for batch processing locally. It can be used to input a set of images and output CSV file with stomata counts, and optionally create heatmaps of each processed image.
Download the pre-trained model weights from here [Download] and unzip the two files (sc_feb2019.caffemodel and sc_feb2019.prototxt).
The processing command allows tweaking various settings such as the input scale and detection threshold. The interface is:
python2.7 process_images.py [-h] --weights-filename WEIGHTS_FILENAME
--proto-filename PROTO_FILENAME
[--gpu-index GPU_INDEX]
[--cnn-top-layer-name CNN_TOP_LAYER_NAME]
[--scale SCALE]
[--heatmap-output-path HEATMAP_OUTPUT_PATH]
[--plot-contours] [--plot-no-peaks]
[--prob-threshold PROB_THRESHOLD]
[--prob-area-threshold PROB_AREA_THRESHOLD]
--csv-output-filename CSV_OUTPUT_FILENAME
[--output-fields {count,image_filename,imq_entropy,imq_hf_entropy,imq_hf_kurtosis,imq_hf_mean,imq_hf_power,imq_hf_skewness,imq_hf_std,imq_hf_threshfreq,margin,positions,scale} [{count,image_filename,imq_entropy,imq_hf_entropy,imq_hf_kurtosis,imq_hf_mean,imq_hf_power,imq_hf_skewness,imq_hf_std,imq_hf_threshfreq,margin,positions,scale} ...]]
[--verbose]
image-paths [image-paths ...]
From the downloaded model, pass the .caffemodel as --weights-filename
and the .prototxt as --proto-filename
. For example:
TODO
The project consists of three different services that communicate via a MongoDb and a locally mounted filesystem:
-
The web service is a python flask-based service that serves the web page, allows uploads and issues processing and training requests via the database.
-
The processing worker is a python-based worker service that listens for images to be processed on the database and processes them as needed. It uses a GPU if available. Multiple processing workers can be run in parallel to allow faster throughput, although that feature hasn't been tested extensively.
-
The training worker is a python-based worker service that listens for training requests issued by the admin interface of the web service. It has to run on a GPU. The training worker is not required to run the service. It would typically run on the same GPU as the processing worker.
All services share a configuration file from the home path of the current user (~/.epidermal). To create a file with default values, either run one of the services or just run from the command line:
python2.7 config.py
Following is the meaning of the fields. Remember to configure an admin password and security cookies before the first start:
Name | Default value | Meaning |
---|---|---|
debug_flask | False | Set to True to enable debug mode, which outputs useful error messages into the browser when something goes wrong. |
db_address | localhost | Hostname for the main image MongoDB connection. If everything is hosted on one machine, this can be a loopback address. |
db_port | 27017 | Port of main image MongoDB address. |
db_name | epidermal | MongoDB database name. |
admin_username | admin | Administrator username. |
admin_password | password | Password for admin access. Configure this before start |
data_path | ~/epidermal/data | Path where files generated by the service and workers are written. |
train_data_path | ~/epidermal/data/train | Intermediate training data path for when model is re-trained. |
caffe_path | ~/caffe/build | Path to find caffe executables for model training. |
caffe_train_baseweights | ~/epidermal/bvlc_reference_caffenet.caffemodel | Path to find initial weights for model training. |
caffe_train_options | --gpu 0 | Additional options passed to caffe training command line. |
model_path | ~/epidermal/data/models | Path to store trained model files. |
server_path | ~/epidermal/data/server | Path to store trained model files. |
server_image_path | ~/epidermal/data/server/images | Path to store trained model files. |
server_heatmap_path | ~/epidermal/data/server/heatmaps | Path to store trained model files. |
image_extensions | .jpg,.jpeg,.png | Comma-separated list of supported image file extensions. |
archive_extensions | .zip | Comma-separated list of supported archive file extensions. |
max_image_file_size | 52428800 | Maximum file size for uploaded images (in bytes) |
worker_gpu_index | 0 | Index of GPU to use by apply worker, unless specified on command line. Set to -1 for CPU processing. |
src_path | ./ | Path to store trained model files. |
APP_SECRET_KEY | Cookie key for user management. Configure this before start | |
APP_SECURITY_REGISTERABLE | True | If users can register on the site. |
APP_SECURITY_CHANGEABLE | True | If users can change their password. |
APP_SECURITY_PASSWORD_SALT | Salt for user password storage. Configure this before start | |
APP_DEFAULT_MAIL_SENDER | [email protected] | E-mails sent from this address. |
APP_SECURITY_EMAIL_SENDER | [email protected] | E-mails sent from this address. |
APP_SECURITY_REGISTERABLE | True | Whether users can create accounts. |
APP_SECURITY_CONFIRMABLE | False | Whether email confirmation is required on new accounts. |
APP_SECURITY_RECOVERABLE | True | Whether users can send their passwords to their registered address. |
APP_MAIL_SERVER | smtp.gmail.com | SMTP server to be used for sending mails to users. |
APP_MAIL_PORT | 465 | SMTP server port to be used for sending mails to users. |
APP_MAIL_USE_SSL | True | Use encryption for user e-mails. |
APP_MAIL_USERNAME | [email protected] | E-mails sent from this address. |
APP_MAIL_PASSWORD | SMTP mail password to be used for sending emails. | |
APP_MONGODB_DB | epidermal | Database name for user management MongoDB. |
APP_MONGODB_HOST | localhost | Database host for user management MongoDB. Usually same as db_address. |
APP_MONGODB_PORT | 27017 | Database port for user management MongoDB. Usually same as db_port. |
plot_path | ~/epidermal/data/plots | Path to store plots of evaluation scripts. |
cnn_path | ~/epidermal/data/cnn | Model files. |
maintenance_text | Set to non-empty to replace all pages with a maintenance message. | |
automatic_deletion_days | 36500 | Number of days after last access after which datasets that have no tags are deleted automatically. |
The web service is located in webapp.py, which can simply be run to serve directly from flask. However, it is recommended to use a more powerful standalone web server to allow multiple processes. The app is available in wsgi.py. To launch e.g. via gunicorn (http://docs.gunicorn.org/en/stable/install.html):
gunicorn --timeout 1200 --workers 3 --bind 0.0.0.0:8000 wsgi
This would launch the service on port 8000. You can then open a browser on http://localhost:8000/ to test if things work. If you want to host publicly, I recommend to forward port 80 to 8000 instead of hosting directly on the low port, so that the app can be run without root privileges. Annotation and custom model training can be done via the admin interface on the web service.
When images are uploaded, the web service puts worker jobs into a MongoDB. The actual computation is handled by a separate process which can be launched as:
python2.7 apply_worker.py
Note that the apply worker needs to have a model saved in the database (either trained via annotations from the web interface or imported from another service).
The service can be run without extra training if only inference should be done. If you want to improve the model using annotated images, launch a train worker using:
python2.7 train_worker.py
As with inference, training jobs can be launched via the web interface.