Will the Machine Like Your Image? Automatic Assessment of Beauty in Images with Machine Learning Techniques
Abstract
:1. Introduction
2. How Can We Measure Beauty?
2.1. Philosophical Approaches
2.2. Neuro-Aesthetic Approaches
- The visual areas, the occipital and inferior lateral zones, the insular cortex, and the superior parietal lobule are active for the task of vision and also for the extraction of shapes, colors, movements, and faces.
- The orbitofrontal cortex acts during the evaluation of risks, and also when we feel pleasure. It is clear that humans feel pleasure when they look at beautiful objects [42]. It seems also an important part in the control of our decisions.
- The insular cortex, which controls our emotions, is also an important and always involved part when observing artworks, images, and photographs.
- The areas engaged in cognition (the amygdala) and memory operations (medial parietal areas, prefrontal lobe) are often active in the task of aesthetic assessment.
- The areas in charge of the premotor control (ventral premotor cortex, temporal lobes, hippocampus) are active, specifically in situations of strong empathy and embodiment, which often occur when observing an artwork.
2.3. Experimental Psychology, Psycho-Sociology, and Photography
3. Machine Learning Approaches
3.1. Classical Machine Learning Approaches
3.2. Deep Learning Approaches
- Several solutions have been proposed to allow treating very large images while preserving the fine structure of details: window preselection around points of interest [10,95], parallel processing of randomly-drawn windows [96], use of hierarchical structures [7,97], etc. Despite these solutions, the size of the operational DNN input layers is a limit for the works on aesthetics that handle large images.
3.3. Datasets
- The Photo.net dataset and the DP Challenge dataset [99,100]. This can be considered the earliest attempt to construct large-scale image databases for aesthetic assessment. The Photo.net dataset contains 20,278 images, with a minimum of ten score ratings for every image. The ratings range is from zero to seven, with seven assigned to the most aesthetically-pleasing photos. Typically, images uploaded to Photo.net are evaluated as somewhat pleasing [99]. The DPChallenge dataset is more challenging and provides several ratings. The DPChallenge dataset is composed of 16,509 images and was extended by the Aesthetic Visual Analysis (AVA) dataset, in which several images derived from DPChallenge.com are also included.
- The Chinese University of Hong Kong-PhotoQuality (CUHK-PQ) dataset [81,101]. It is composed of 17,690 images also collected from DPChallenge.com and many photographers. All the images come provided with binary aesthetic labels and are grouped into seven categories: architecture, landscape, humans, animals, plants, static, and night. Usually, the training and test set are selected as random partitions of a 50/50 split, or a ten-fold cross-validation, where the ratio of the positive examples and the negative examples is around 1:3. Many sample images taken from the dataset are available in Figure 4.
- The AVA dataset contains ∼250,000 photos [14]. These were obtained from DPChallenge.com and labeled with scores. Every image received hundreds of votes, in the range one to ten. The average score of an image is commonly taken to be the ground truth. The dataset contains many challenging examples. For the task of binary aesthetic classification, images with an average score higher than a threshold of are treated as positive examples, and images with a score lower than are treated as negative examples. Further, the AVA dataset contains 14 style attributes and 60 category attributes. There are two typical training and test splits used with this dataset: (1) a large-scale standardized partition with ∼230,000 training images and ∼20,000 test images, with a hard threshold of , and (2) an easier partition modeling the one of CUHK-PQ, taking those images whose score ranking is at the top and bottom . This results in ∼25000 images for training and ∼25,000 images for testing. The ratio of the total number of positive examples to that of the negative examples is around 12:5.
3.4. Evaluation Metrics and Comparison of the Methods
4. Analysis of the Works
4.1. Non-Exploited Features
4.2. The Binary Criterion: Ugly vs. Beautiful
4.3. A Continuous Ranking for Evaluation
4.4. Which Beauty? Which Expert?
5. Conclusions
Funding
Conflicts of Interest
References
- Burningham, N.; Pizlo, Z.; Allebach, J.P. Image Quality Metrics. In Encyclopedia of Imaging Science and Technology; American Cancer Society. 2002. Available online: https://onlinelibrary.wiley.com/doi/pdf/10.1002/0471443395.img038 (accessed on 27 June 2019).
- Datta, R.; Li, J.; Wang, J.Z. Learning the Consensus on Visual Quality for Next-generation Image Management. In Proceedings of the 15th MM ’07 ACM International Conference on Multimedia, Augsburg, Germany, 25–29 September 2007; ACM: New York, NY, USA, 2007; pp. 533–536. [Google Scholar] [CrossRef]
- Ke, Y.; Tang, X.; Jing, F. The Design of High-Level Features for Photo Quality Assessment. In Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’06), New York, NY, USA, 17–22 June 2006; Volume 1, pp. 419–426. [Google Scholar] [CrossRef]
- Deng, Y.; Loy, C.C.; Tang, X. Image Aesthetic Assessment: An experimental survey. IEEE Signal Process. Mag. 2017, 34, 80–106. [Google Scholar] [CrossRef]
- Lu, X.; Lin, Z.; Jin, H.; Yang, J.; Wang, J.Z. RAPID: Rating Pictorial Aesthetics Using Deep Learning. In Proceedings of the 22nd MM ’14 ACM International Conference on Multimedia; ACM: New York, NY, USA, 2014; pp. 457–466. [Google Scholar] [CrossRef]
- Lu, X.; Lin, Z.; Shen, X.; Mech, R.; Wang, J.Z. Deep Multi-patch Aggregation Network for Image Style, Aesthetics, and Quality Estimation. In Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, Chile, 7–13 December 2015; pp. 990–998. [Google Scholar] [CrossRef]
- Mai, L.; Jin, H.; Liu, F. Composition-Preserving Deep Photo Aesthetics Assessment. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 26–30 June 2016; pp. 497–506. [Google Scholar] [CrossRef]
- Wang, Z.; Liu, D.; Chang, S.; Dolcos, F.; Beck, D.; Huang, T. Image aesthetics assessment using Deep Chatterjee’s machine. In Proceedings of the 2017 International Joint Conference on Neural Networks (IJCNN), Anchorage, AK, USA, 14–19 May 2017; pp. 941–948. [Google Scholar] [CrossRef]
- Kao, Y.; He, R.; Huang, K. Deep Aesthetic Quality Assessment With Semantic Information. IEEE Trans. Image Process. 2017, 26, 1482–1495. [Google Scholar] [CrossRef] [PubMed]
- Ma, S.; Liu, J.; Chen, C.W. A-Lamp: Adaptive Layout-Aware Multi-patch Deep Convolutional Neural Network for Photo Aesthetic Assessment. In Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017; pp. 722–731. [Google Scholar] [CrossRef]
- Talebi, H.; Milanfar, P. NIMA: Neural Image Assessment. IEEE Trans. Image Process. 2018, 27, 3998–4011. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Datta, R.; Wang, J.Z. ACQUINE: Aesthetic Quality Inference Engine—Real-time Automatic Rating of Photo Aesthetics. In Proceedings of the International Conference on MIR ’10 Multimedia Information Retrieval; ACM: New York, NY, USA, 2010; pp. 421–424. [Google Scholar] [CrossRef]
- Marchesotti, L.; Perronnin, F.; Larlus, D.; Csurka, G. Assessing the aesthetic quality of photographs using generic image descriptors. In Proceedings of the 2011 International Conference on Computer Vision, Barcelona, Spain, 6–13 November 2011; pp. 1784–1791. [Google Scholar] [CrossRef]
- Murray, N.; Marchesotti, L.; Perronnin, F. AVA: A large-scale database for aesthetic visual analysis. In Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, Providence, RI, USA, 16–21 June 2012; pp. 2408–2415. [Google Scholar] [CrossRef]
- Schifanella, R.; Redi, M.; Aiello, L.M. An Image is Worth More than a Thousand Favorites: Surfacing the Hidden Beauty of Flickr Pictures. In Proceedings of the ICWSM’15: Proceedings of the 9th AAAI International Conference on Web and Social Media, Oxford, UK, 26–29 May 2015. [Google Scholar]
- Henry, C. Introduction à une Esthétique Scientifique; La Revue Contemporaine: Paris, France, 1885. [Google Scholar]
- Garabedian, C.A. Book Review: Aesthetic Measure. Bull. Am. Math. Soc. 1934, 40, 7–11. [Google Scholar] [CrossRef]
- Birkhoff, G.D. Aesthetic Measure; Harvard University Press: Cambridge, MA, USA, 1933. [Google Scholar]
- Moles, A.A. Théorie de l’information et perception esthétique. Revue Philosophique de la France et de l’Étranger 1957, 147, 233–242. [Google Scholar]
- Bense, M. Einführung in die informationtheoretische Ästhetik. In Ausgewählte Schriften: Band 3 Ästhetik und Texttheorie; J.B. Metzler: Stuttgart, Germany, 1998; pp. 251–417. [Google Scholar]
- Rigau, J.; Feixas, M.; Sbert, M. Informational Aesthetics Measures. IEEE Comput. Graph. Appl. 2008, 28, 24–34. [Google Scholar] [CrossRef] [Green Version]
- Cooper, J.M.; Hutchinson, D.S. Plato: Complete Works; Hackett Publishing: Indianapolis, IN, USA, 1997. [Google Scholar]
- Grimm, F.M.; Diderot, D. Correspondance Littéraire, Philosophique et Critique de Grimm et de Diderot, depuis 1753 jusqu’en 1790; Furne: Paris, France, 1831. [Google Scholar]
- Hegel, G.W.F. Vorlesungen über die Ästhetik; Duncker und Humblot: Berlin, Germany, 1842; Volume 1. [Google Scholar]
- Kant, I. Critique of Judgment; Hackett Publishing: Indianapolis, IN, USA, 1987. [Google Scholar]
- Gombrich, E.H. Art and Illusion: A Study in the Psychology of Pictorial Representations; Princeton University Press: Princeton, NJ, USA, 1960. [Google Scholar]
- Berlyne, D.E. Aesthetics and Psychobiology; Appleton-Century-Crofts: New York, NY, USA, 1971; Volume 336. [Google Scholar]
- Zemach, E.M. La Beauté Réelle: Une Défense du Réalisme Esthétique; Presses Universitaires de Rennes: Rennes, France, 2005. [Google Scholar]
- Pouivet, R. Le Réalisme Esthétique; Presses Universitaires de France: Paris, France, 2015. [Google Scholar]
- Tatarkiewicz, W. History of Aesthetics; Harrrel, J., Ed.; PwN-Polish Scientific Publishers: Mouton, Warsaw, Poland, 1970; Volume I. [Google Scholar]
- Danto, A. The artworld. J. Philos. 1964, 61, 571–584. [Google Scholar] [CrossRef]
- Arnheim, R. Art and Visual Perception: A Psychology of the Creative Eye; University of California Press: Oakland, CA, USA, 1965. [Google Scholar]
- Solso, R.L. Cognition and the Visual Arts; MIT Press: Cambridge, MA, USA, 1996. [Google Scholar]
- Lange, C.G.; James, W. The Emotions; Williams & Wilkins: Philadelphia, PA, USA, 1922; Volume 1. [Google Scholar]
- Sperber, D.; Wilson, D. Relevance theory. In Handbook of Pragmatics; Blackwell: Oxford, UK, 2004; pp. 607–632. [Google Scholar]
- Dessalles, J.L. La Pertinence et ses Origines Cognitives-Nouvelles Théories; Hermes-Science Publications: Paris, France, 2008. [Google Scholar]
- Zeki, S. Inner Vision: An Exploration of Art and the Brain; Oxford University Press: New York, NY, USA, 1999. [Google Scholar]
- Zeki, S. Artistic creativity and the brain. Science 2001, 293, 51–52. [Google Scholar] [CrossRef]
- Brown, S.; Gao, X.; Tisdelle, L.; Eickhoff, S.B.; Liotti, M. Naturalizing aesthetics: brain areas for aesthetic appraisal across sensory modalities. Neuroimage 2011, 58, 250–258. [Google Scholar] [CrossRef]
- Chatterjee, A.; Vartanian, O. Neuroscience of aesthetics. Ann. N. Y. Acad. Sci. 2016, 1369, 172–194. [Google Scholar] [CrossRef]
- Barrett, L.F.; Mesquita, B.; Ochsner, K.N.; Gross, J.J. The experience of emotion. Annu. Rev. Psychol. 2007, 58, 373–403. [Google Scholar] [CrossRef] [PubMed]
- Kühn, S.; Gallinat, J. The neural correlates of subjective pleasantness. NeuroImage 2012, 61, 289–294. [Google Scholar] [CrossRef] [PubMed]
- Cinzia, D.D.; Vittorio, G. Neuroaesthetics: A review. Curr. Opin. Neurobiol. 2009, 19, 682–687. [Google Scholar] [CrossRef] [PubMed]
- Ishizu, T.; Zeki, S. Toward A Brain-Based Theory of Beauty. PLoS ONE 2011, 6, e21852. [Google Scholar] [CrossRef] [PubMed]
- Vartanian, O.; Skov, M. Neural correlates of viewing paintings: Evidence from a quantitative meta-analysis of functional magnetic resonance imaging data. Brain Cogn. 2014, 87, 52–56. [Google Scholar] [CrossRef] [PubMed]
- Changeux, J.P. La beauté Dans le Cerveau; Odile Jacob: Paris, France, 2016. [Google Scholar]
- Changeux, J.P. Beauty in the brain: For a neuroscience of art. Rendiconti Lincei 2012, 23, 315–320. [Google Scholar] [CrossRef]
- Ehtemami, A.; Scott, R.; Bernadin, S. A Survey of FMRI Data Analysis Methods. In Proceedings of the SoutheastCon 2018, St. Petersburg, FL, USA, 19–22 April 2018; pp. 1–7. [Google Scholar]
- Vidal, F. Neuroaesthetics: Getting rid of art and beauty. BioSocieties 2012, 7, 209–213. [Google Scholar] [CrossRef]
- Brown, S.; Dissanayake, E. The arts are more than aesthetics. In Neuroaesthetics; Routledge: Abingdon-on-Thames, UK, 2018; pp. 43–57. [Google Scholar]
- Peterson, B. Learning to See Creatively: Design, Color, and Composition in Photography; Amphoto Books: London, UK, 2015. [Google Scholar]
- Bourdieu, P.; Castel, R.; Schnapper, D.; Boltanski, L.; Lagneau, G.; Chamboredon, J.C. Un Art Moyen: Essai sur les Usages Sociaux de la Photographie; Les éditions de Minuit: Paris, France, 1965. [Google Scholar]
- Mittal, A.; Soundararajan, R.; Bovik, A.C. Making a “Completely Blind” Image Quality Analyzer. IEEE Signal Process. Lett. 2013, 20, 209–212. [Google Scholar] [CrossRef]
- Chandler, D.M. Seven Challenges in Image Quality Assessment: Past, Present, and Future Research. ISRN Signal Process. 2013, 2013, 905685. [Google Scholar] [CrossRef]
- McManus, I.C.; Stöver, K.; Kim, D. Arnheim’s Gestalt Theory of Visual Balance: Examining the Compositional Structure of Art Photographs and Abstract Images. i-Perception 2011, 2, 615–647. [Google Scholar] [CrossRef]
- Amirshahi, S.A.; Hayn-Leichsenring, G.U.; Denzler, J.; Redies, C. Evaluating the Rule of Thirds in Photographs and Paintings. Art Percept. 2014, 2, 163–182. [Google Scholar] [CrossRef] [Green Version]
- Hübner, R.; Fillinger, M.G. Comparison of Objective Measures for Predicting Perceptual Balance and Visual Aesthetic Preference. Front. Psychol. 2016, 7. [Google Scholar] [CrossRef] [PubMed]
- Schweinhart, A.M.; Essock, E.A. Structural Content in Paintings: Artists Overregularize Oriented Content of Paintings Relative to the Typical Natural Scene Bias. Perception 2013, 42, 1311–1332. [Google Scholar] [CrossRef]
- Attewell, D.; Baddeley, R.J. The distribution of reflectances within the visual environment. Vis. Res. 2007, 47, 548–554. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Mehrabian, A. Individual differences in stimulus screening and arousability. J. Person. 1977, 45, 237–250. [Google Scholar] [CrossRef] [PubMed]
- Smith, D. Color-person-environment relationships. Color Res. Appl. 2008, 33, 312–319. [Google Scholar] [CrossRef] [Green Version]
- Tatler, B.W.; Wade, N.J.; Kwan, H.; Findlay, J.M.; Velichkovsky, B.M. Yarbus, Eye Movements, and Vision. i-Perception 2010, 1, 7–27. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Huston, J.P.; Nadal, M.; Mora, F.; Agnati, L.F.; Conde, C.J.C. (Eds.) Art, Aesthetics, and the Brain; Oxford University Press: Oxford, UK, 2015. [Google Scholar]
- LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning. Nature 2015, 521, 436. [Google Scholar] [CrossRef] [PubMed]
- Goodfellow, I.; Bengio, Y.; Courville, A. Deep Learning; MIT Press: Cambridge, MA, USA, 2016. [Google Scholar]
- Bodini, M. A review of facial landmark extraction in 2d images and videos using deep learning. Big Data Cognit. Comput. 2019, 3, 14. [Google Scholar] [CrossRef]
- Boccignone, G.; Bodini, M.; Cuculo, V.; Grossi, G. Predictive Sampling of Facial Expression Dynamics Driven by a Latent Action Space. In Proceedings of the 2018 14th International Conference on Signal-Image Technology Internet-Based Systems (SITIS), Las Palmas de Gran Canaria, Spain, 26–29 November 2018; pp. 143–150. [Google Scholar] [CrossRef]
- Datta, R.; Joshi, D.; Li, J.; Wang, J.Z. Studying Aesthetics in Photographic Images Using a Computational Approach. In Computer Vision—ECCV 2006; Springer: Berlin, Germany; Heidelberg, Germany, 2006; pp. 288–301. [Google Scholar]
- Nishiyama, M.; Okabe, T.; Sato, I.; Sato, Y. Aesthetic quality classification of photographs based on color harmony. In Proceedings of the CVPR 2011, Colorado Springs, CO, USA, 20–25 June 2011. [Google Scholar]
- Simond, F.; Arvanitopoulos, N.; Susstrunk, S. Image aesthetics depends on context. In Proceedings of the 2015 IEEE International Conference on Image Processing (ICIP), Quebec City, QC, Canada, 27–30 September 2015. [Google Scholar]
- Kao, Y.; Wang, C.; Huang, K. Visual aesthetic quality assessment with a regression model. In Proceedings of the 2015 IEEE International Conference on Image Processing (ICIP), Quebec City, QC, Canada, 27–30 September 2015. [Google Scholar]
- Deng, J.; Dong, W.; Socher, R.; Li, L.J.; Li, K.; Fei-Fei, L. ImageNet: A large-scale hierarchical image database. In Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA, 20–25 June 2009. [Google Scholar]
- Bhattacharya, S.; Sukthankar, R.; Shah, M. A framework for photo-quality assessment and enhancement based on visual aesthetics. In Proceedings of the International Conference on Multimedia—MM10, Firenze, Italy, 25–29 October 2010. [Google Scholar]
- Wu, O.; Hu, W.; Gao, J. Learning to predict the perceived visual quality of photos. In Proceedings of the 2011 International Conference on Computer Vision, Barcelona, Spain, 6–13 November 2011. [Google Scholar]
- Tong, H.; Li, M.; Zhang, H.J.; He, J.; Zhang, C. Classification of Digital Photos Taken by Photographers or Home Users. In Advances in Multimedia Information Processing—PCM 2004; Springer: Berlin, Germany; Heidelberg, Germany, 2004; pp. 198–205. [Google Scholar]
- Sun, X.; Yao, H.; Ji, R.; Liu, S. Photo assessment based on computational visual attention model. In Proceedings of the Seventeen ACM International Conference on Multimedia–MM’09, Beijing, China, 19–24 October 2009. [Google Scholar]
- You, J.; Perkis, A.; Hannuksela, M.M.; Gabbouj, M. Perceptual quality assessment based on visual attention analysis. In Proceedings of the Seventeen ACM International Conference on Multimedia–MM’09, Beijing, China, 19–24 October 2009. [Google Scholar]
- Luo, Y.; Tang, X. Photo and Video Quality Evaluation: Focusing on the Subject. In Lecture Notes in Computer Science; Springer: Berlin, Germany; Heidelberg, Germany, 2008; pp. 386–399. [Google Scholar]
- Lo, L.Y.; Chen, J.C. A statistic approach for photo quality assessment. In Proceedings of the 2012 International Conference on Information Security and Intelligent Control, Yunlin, Taiwan, 14–16 August 2012. [Google Scholar]
- Dhar, S.; Ordonez, V.; Berg, T.L. High level describable attributes for predicting aesthetics and interestingness. In Proceedings of the CVPR 2011, Colorado Springs, CO, USA, 20–25 June 2011. [Google Scholar]
- Tang, X.; Luo, W.; Wang, X. Content-Based Photo Quality Assessment. IEEE Trans. Multimed. 2013, 15, 1930–1943. [Google Scholar] [CrossRef] [Green Version]
- Yeh, M.C.; Cheng, Y.C. Relative features for photo quality assessment. In Proceedings of the 2012 19th IEEE International Conference on Image Processing, Orlando, FL, USA, 30 September–3 October 2012. [Google Scholar]
- Marchesotti, L.; Perronnin, F. Learning beautiful attributes. In Proceedings of the British Machine Vision Conference 2013, Bristol, UK, 9–13 September 2013. [Google Scholar]
- Dong, Z.; Shen, X.; Li, H.; Tian, X. Photo Quality Assessment with DCNN that Understands Image Well. In MultiMedia Modeling; Springer International Publishing: Cham, Switzerland, 2015; pp. 524–535. [Google Scholar]
- Lv, H.; Tian, X. Learning Relative Aesthetic Quality with a Pairwise Approach. In MultiMedia Modeling; Springer International Publishing: Cham, Switzerland, 2016; pp. 493–504. [Google Scholar]
- Bodini, M.; D’Amelio, A.; Grossi, G.; Lanzarotti, R.; Lin, J. Single Sample Face Recognition by Sparse Recovery of Deep-Learned LDA Features. In Advanced Concepts for Intelligent Vision Systems; Springer International Publishing: Cham, Switzerland, 2018; pp. 297–308. [Google Scholar]
- Kong, S.; Shen, X.; Lin, Z.; Mech, R.; Fowlkes, C. Photo Aesthetics Ranking Network with Attributes and Content Adaptation. In Computer Vision—ECCV 2016; Springer International Publishing: Cham, Switzerland, 2016; pp. 662–679. [Google Scholar] [Green Version]
- Peng, K.C.; Chen, T. Toward correlating and solving abstract tasks using convolutional neural networks. In Proceedings of the 2016 IEEE Winter Conference on Applications of Computer Vision (WACV), Lake Placid, NY, USA, 7–9 March 2016. [Google Scholar]
- Wang, W.; Zhao, M.; Wang, L.; Huang, J.; Cai, C.; Xu, X. A multi-scene deep learning model for image aesthetic evaluation. Signal Process. Image Commun. 2016, 47, 511–518. [Google Scholar] [CrossRef]
- Tian, X.; Dong, Z.; Yang, K.; Mei, T. Query-Dependent Aesthetic Model With Deep Learning for Photo Quality Assessment. IEEE Trans. Multimed. 2015, 17, 2035–2048. [Google Scholar] [CrossRef]
- Lu, X.; Lin, Z.; Jin, H.; Yang, J.; Wang, J.Z. Rating Image Aesthetics Using Deep Learning. IEEE Trans. Multimed. 2015, 17, 2021–2034. [Google Scholar] [CrossRef]
- Wang, Z.; Chang, S.; Dolcos, F.; Beck, D.; Liu, D.; Huang, T.S. Brain-inspired deep networks for image aesthetics assessment. arXiv Preprint 2016, arXiv:1601.04155. [Google Scholar]
- Zhang, L. Describing human aesthetic perception by deeply-learned attributes from flickr. arXiv Preprint 2016, arXiv:1605.07699. [Google Scholar]
- Kao, Y.; Huang, K.; Maybank, S. Hierarchical aesthetic quality assessment using deep convolutional neural networks. Signal Process. Image Commun. 2016, 47, 500–510. [Google Scholar] [CrossRef] [Green Version]
- Lu, P.; Peng, X.; Li, R.; Wang, X. Towards aesthetics of image: A Bayesian framework for color harmony modeling. Signal Process. Image Commun. 2015, 39, 487–498. [Google Scholar] [CrossRef]
- Lu, P.; Kuang, Z.; Peng, X.; Li, R. Discovering Harmony: A Hierarchical Colour Harmony Model for Aesthetics Assessment. In Computer Vision—ACCV 2014; Springer International Publishing: Cham, Switzerland, 2015; pp. 452–467. [Google Scholar]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition. In Computer Vision – ECCV 2014; Springer International Publishing: Cham, Switzerland, 2014; pp. 346–361. [Google Scholar] [Green Version]
- Yao, L.; Suryanarayan, P.; Qiao, M.; Wang, J.Z.; Li, J. OSCAR: On-Site Composition and Aesthetics Feedback Through Exemplars for Photographers. Int. J. Comput. Vis. 2011, 96, 353–383. [Google Scholar] [CrossRef]
- Joshi, D.; Datta, R.; Fedorovskaya, E.; Luong, Q.T.; Wang, J.; Li, J.; Luo, J. Aesthetics and Emotions in Images. IEEE Signal Process. Mag. 2011, 28, 94–115. [Google Scholar] [CrossRef]
- Datta, R.; Li, J.; Wang, J.Z. Algorithmic inferencing of aesthetics and emotion in natural images: An exposition. In Proceedings of the 2008 15th IEEE International Conference on Image Processing, San Diego, CA, USA, 12–15 October 2008. [Google Scholar]
- Luo, W.; Wang, X.; Tang, X. Content-based photo quality assessment. In Proceedings of the 2011 International Conference on Computer Vision, Barcelona, Spain, 6–13 November 2011. [Google Scholar]
- Wong, L.K.; Low, K.L. Saliency-enhanced image aesthetics class prediction. In Proceedings of the 2009 16th IEEE International Conference on Image Processing (ICIP), Cairo, Egypt, 7–10 November 2009. [Google Scholar]
- Bhattacharya, S.; Sukthankar, R.; Shah, M. A holistic approach to aesthetic enhancement of photographs. ACM Trans. Multimed. Comput. Commun. Appl. 2011, 7S, 1–21. [Google Scholar] [CrossRef] [Green Version]
- Wu, Y.; Bauckhage, C.; Thurau, C. The Good, the Bad, and the Ugly: Predicting Aesthetic Image Labels. In Proceedings of the 2010 20th International Conference on Pattern Recognition, Istanbul, Turkey, 23–26 August 2010. [Google Scholar]
- Lienhard, A.; Ladret, P.; Caplier, A. Low Level Features for Quality Assessment of Facial Images. In Proceedings of the 10th International Conference on Computer Vision Theory and Applications, Berlin, Germany, 11–14 March 2015. [Google Scholar]
- Yin, W.; Mei, T.; Chen, C.W. Assessing photo quality with geo-context and crowdsourced photos. In Proceedings of the 2012 Visual Communications and Image Processing, San Diego, CA, USA, 27–30 November 2012. [Google Scholar]
- Li, C.; Gallagher, A.; Loui, A.C.; Chen, T. Aesthetic quality assessment of consumer photos with faces. In Proceedings of the 2010 IEEE International Conference on Image Processing, Hong Kong, China, 26–29 September 2010. [Google Scholar]
- Sun, R.; Lian, Z.; Tang, Y.; Xiao, J. Aesthetic visual quality evaluation of Chinese handwritings. In Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, Buenos Aires, Argentina, 25–31 July 2015. [Google Scholar]
- Lo, K.Y.; Liu, K.H.; Chen, C.S. Assessment of photo aesthetics with efficiency. In Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012), Tsukuba, Japan, 11–15 November 2012; pp. 2186–2189. [Google Scholar]
- Marchesotti, L.; Perronnin, F.; Meylan, F. Learning beautiful (and ugly) attributes. BMVC 2013, 7, 1–11. [Google Scholar]
- Su, H.H.; Chen, T.W.; Kao, C.C.; Hsu, W.H.; Chien, S.Y. Scenic photo quality assessment with bag of aesthetics-preserving features. In Proceedings of the 19th ACM international conference on Multimedia—MM’11, Scottsdale, AZ, USA, 28 November–1 December 2011. [Google Scholar]
- Liu, Z.; Wang, Z.; Yao, Y.; Zhang, L.; Shao, L. Deep Active Learning with Contaminated Tags for Image Aesthetics Assessment. IEEE Trans. Image Process. 2018. [Google Scholar] [CrossRef] [PubMed]
- Zhang, L.; Gao, Y.; Zimmermann, R.; Tian, Q.; Li, X. Fusion of Multichannel Local and Global Structural Cues for Photo Aesthetics Evaluation. IEEE Trans. Image Process. 2014, 23, 1419–1429. [Google Scholar] [CrossRef] [PubMed]
- Jin, B.; Segovia, M.V.O.; Susstrunk, S. Image aesthetic predictors based on weighted CNNs. In Proceedings of the 2016 IEEE International Conference on Image Processing (ICIP), Phoenix, AZ, USA, 25–28 September 2016. [Google Scholar]
- Schwarz, K.; Wieschollek, P.; Lensch, H.P.A. Will People Like Your Image? Learning the Aesthetic Space. In Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision (WACV), Lake Tahoe, NV, USA, 12–15 March 2018. [Google Scholar]
The Reviewed Methods Evaluated on the AVA Dataset | ||||
---|---|---|---|---|
Method | Dataset | Employed Method | Overall Accuracy | Train/Test Split |
Marchesotti et al. [110] | AVA | Elastic Net | 67.89% | Standard partition |
AVA handcrafted features [14] | AVA | SVM | 68.00% | Standard partition |
Lu et al. [6] | AVA | CNN | 72.85% | Standard partition |
RAPID (full method) [5] | AVA | CNN | 74.46% | Standard partition |
Peng et al. [88] | AVA | CNN | 74.50% | Standard partition |
Kao et al. [94] | AVA | CNN | 74.51% | Standard partition |
Lu et al. [6] | AVA | CNN | 75.41% | Standard partition |
RAPID (improved version) [91] | AVA | CNN | 75.42% | Standard partition |
Kao et al. [71] | AVA | CNN | 76.15% | Standard partition |
Wang et al. [89] | AVA | CNN | 76.94% | Standard partition |
Kong et al. [87] | AVA | CNN | 77.33% | Standard partition |
Wang et al. [92] | AVA | CNN | 78.08% | Standard partition |
Tian et al. [90] | AVA | CNN | 80.38% | 10% subset/20k*2 |
Liu et al. [112] | AVA | CNN | 83.09% | Standard partition |
Zhang et al. [113] | AVA | Bayesian model | 83.24% | 10% subset/12.5k*2 |
Lv et al. [85] | AVA | Ranking model | 84.32% | 10% subset/20k*2 |
Dong et al. [84] | AVA | CNN | 83.52% | 10% subset/19k*2 |
Wang et al. [89] | AVA | CNN | 84.88% | 10% subset/25k*2 |
© 2019 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Bodini, M. Will the Machine Like Your Image? Automatic Assessment of Beauty in Images with Machine Learning Techniques. Inventions 2019, 4, 34. https://doi.org/10.3390/inventions4030034
Bodini M. Will the Machine Like Your Image? Automatic Assessment of Beauty in Images with Machine Learning Techniques. Inventions. 2019; 4(3):34. https://doi.org/10.3390/inventions4030034
Chicago/Turabian StyleBodini, Matteo. 2019. "Will the Machine Like Your Image? Automatic Assessment of Beauty in Images with Machine Learning Techniques" Inventions 4, no. 3: 34. https://doi.org/10.3390/inventions4030034
APA StyleBodini, M. (2019). Will the Machine Like Your Image? Automatic Assessment of Beauty in Images with Machine Learning Techniques. Inventions, 4(3), 34. https://doi.org/10.3390/inventions4030034