Zusammenfassung
Endoscopic surgery leads to large amounts of recordings that have to either be stored completely or postprocessed to extract relevant frames. These recordings regularly contain long out-of-body scenes. This paper proposes to apply anomaly detection methods to detect these irrelevant scenes. A conditional generative adversarial networks (GAN) architecture is used to predict future video frames and classify these predictions with an anomaly score. To avoid the successful prediction of anomalous frames due to the good generalization capability of convolutional neural networks (CNNs) we enhance the optimization process with a negative training phase. The experimental results demonstrate promising results for out-of-body sequence detection with the proposed approach. The enhanced GAN training framework can improve the results of the prediction framework by a large margin. The negative training phase reduces the number of false negative (FN) predictions and is shown to counteract a common problem in anomaly detection methods based on convolutional neural networks (CNNs). The good performance in standard metrics also shows the suitability for clinical use.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Literatur
Stanek SR, Tavanapong W, Wong J, et al. Automatic real-time detection of endoscopic procedures using temporal features. Computer methods and programs in biomedicine. 2012;108(2):524–535.
Münzer B, Schoeffmann K, Böszörmenyi L. Relevance segmentation of laparoscopic videos. In: Proc ISM. IEEE; 2013. p. 84–91.
Chandola V, Banerjee A, Kumar V. Anomaly detection: A survey. ACMcomputing surveys (CSUR). 2009;41(3):15.
Kiran B, Thomas D, Parakkal R. An overview of deep learning based methods for unsupervised and semi-supervised anomaly detection in videos. Journal of Imaging. 2018;4(2):36.
Goodfellow I, Pouget-Abadie J, Mirza M, et al. Generative Adversarial Nets. In: Proc NIPS 27. Curran Associates, Inc.; 2014. p. 2672–2680.
Liu W, Luo W, Lian D, et al. Future frame prediction for anomaly detection–a new baseline. In: Proc CVPR; 2018. p. 6536–6545.
Munawar A, Vinayavekhin P, De Magistris G. Limiting the reconstruction capability of generative neural network using negative learning. In: Proc MLSP. IEEE; 2017. p. 1–6.
Isola P, Zhu JY, Zhou T, et al. Image-to-image translation with conditional adversarial networks. In: Proc CVPR; 2017. p. 1125–1134.
Ronneberger O, Fischer P, Brox T. U-net: Convolutional networks for biomedical image segmentation. In: Proc MICCAI. Springer; 2015. p. 234–241.
Mathieu M, Couprie C, LeCun Y. Deep multi-scale video prediction beyond mean square error. arXiv preprint arXiv:151105440. 2015;.
Dosovitskiy A, Fischer P, Ilg E, et al. Flownet: Learning optical flow with convolutional networks. In: Proc ICCV; 2015. p. 2758–2766.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Fachmedien Wiesbaden GmbH, ein Teil von Springer Nature
About this paper
Cite this paper
Reiter, W. (2020). Video Anomaly Detection in Post-Procedural Use of Laparoscopic Videos. In: Tolxdorff, T., Deserno, T., Handels, H., Maier, A., Maier-Hein, K., Palm, C. (eds) Bildverarbeitung für die Medizin 2020. Informatik aktuell. Springer Vieweg, Wiesbaden. https://doi.org/10.1007/978-3-658-29267-6_22
Download citation
DOI: https://doi.org/10.1007/978-3-658-29267-6_22
Published:
Publisher Name: Springer Vieweg, Wiesbaden
Print ISBN: 978-3-658-29266-9
Online ISBN: 978-3-658-29267-6
eBook Packages: Computer Science and Engineering (German Language)