A Novel Statistical Learning-Based Methodology for Measuring the Goodness of Energy Profiles of Applications Executing on Multicore Computing Platforms
Abstract
:1. Introduction
- A novel methodology to measure the similarity between an energy profile and the ground truth. To the best of our knowledge, the proposed methodology is the first work that takes into consideration the qualitative differences of the energy consumption trend of the profiles and ranks the energy profiles based on their similarity with the ground truth.
- An experimental validation of the proposed methodology for a diverse set of 235 application energy profiles on modern multicore hybrid heterogeneous computing platforms.
- A comprehensive comparative analysis of the proposed methodology with popular statistical approaches such as correlation, average error, and Euclidean distance, which are commonly used to compare the accuracy and similarity of energy profiles as well as time series of equal lengths in general. We demonstrate that all three statistical approaches fail to capture the qualitative difference of an energy profile and the ground truth, and thus fail to distinguish the energy profiles based on their energy consumption trend. Therefore, they can mislead to consider as similar the energy profile whose energy consumption trend is different from that of the ground truth.
- We demonstrate how the proposed methodology can help in determining whether the energy model that is used to construct the energy profile, includes an extraneous contributor that does not reflect the energy consumption by the application, or it lacks some essential contributor to energy consumption by the application.
- We compare the effectiveness of our proposed method with state-of-the-art statistical approaches for energy optimization. We demonstrate that the use of the state-of-the-art instead of TSM in the energy optimization loop leads to significant energy losses (up to 54% in our case).
2. Materials and Methods
2.1. Goodness Measuring Problem Formulation
2.2. Challenges With State-of-the-Art Practices to Measure the Goodness of Energy Models
2.3. Trend-Based Similarity Measuring Methodology for Energy Profiles
2.3.1. Model Fitting
2.3.2. The Discrepancy Analysis
- The regression models of the energy profile and the ground truth must follow the same orientation. Both regression models must exhibit the same increase and decrease in the range of all the data points.
- The regression models of the energy profile and the ground truth must not intersect at any point.
- The distance between the regression models of the energy profile and the ground truth must be the same for the range of all the data points.
- Opposite: The slopes of the regression models of an energy profile and the ground truth are in the opposite direction. The regression models of an energy profile and the ground truth exhibit opposite behavior such that one of them is increasing at and the other one is decreasing with . Furthermore, the shape of the regression fit is concave up for one of them and concave down for the other one.
- Same: The slopes of the regression models of an energy profile and the ground truth are identically the same and follow the same direction. This class represents the energy profiles which are ideally the same to their corresponding ground truths.
- Similar: The slopes of the regression models of an energy profile and the ground truth are different, however, they follow the same direction. It indicates that the energy profile is neither the same as the ground truth nor in the opposite direction to it.
2.3.3. The Distance Metric
2.4. Experimental Setup
2.4.1. Experimental Platform and Applications
2.4.2. Experimental Methodology to Validate TSM
- Group A (Sets of many energy profiles): Group A comprises of the EPS where there is more than one energy profile of the same application constructed with different approaches such as on-chip power sensors, system-level power measurements provided by power meters, etc.
- Group B (Sets of single energy profiles): Group B comprises of the EPS where only one energy profile is compared with the ground truth.
3. Results and Discussion
3.1. Experiment Results
3.2. Discussion
3.3. Comparison of TSM and State-of-the-Art Statistical Approaches for Energy Optimization
4. Conclusions
Supplementary Materials
Author Contributions
Funding
Conflicts of Interest
Abbreviations
IEA | International Energy Agency |
ICT | Information and communication technology |
TWh | Terra-watt hours |
GPU | Graphics processing unit |
FPGA | Field programmable gate array |
QPI | Quick path interconnect |
NUMA | Non-uniform memory access |
DGEMM | Double-precision general matrix multiplication |
FFT | Fast fourier transform |
MKL | Intel Math Kernel Library |
RAPL | Running average power limit |
NVML | NVIDIA Management Library |
HPC | High performance computing |
TSM | Trend-based similarity measure |
EPS | Set of energy profiles of an application constructed with different energy measurement approaches |
Variable | Unit |
Power | Watt |
Static power | Watt |
Execution time | Second |
Energy | Joule |
Total energy | Joule |
Dynamic energy | Joule |
References
- IEA. Climate Change. Available online: https://www.iea.org/topics/climate-change (accessed on 23 June 2020).
- IEA. Global Energy & CO2 Status Report 2019. Available online: https://www.iea.org/reports/global-energy-and-co2-status-report-2019 (accessed on 17 June 2020).
- IEA. IEA Sets Out Pillars for Success at COP21. Available online: https://www.iea.org/news/iea-sets-out-pillars-for-success-at-cop21 (accessed on 17 June 2020).
- Andrae, A.; Edler, T. On Global Electricity Usage of Communication Technology: Trends to 2030. Challenges 2015, 6, 117–157. [Google Scholar] [CrossRef] [Green Version]
- O’brien, K.; Pietri, I.; Reddy, R.; Lastovetsky, A.; Sakellariou, R. A Survey of Power and Energy Predictive Models in HPC Systems and Applications. ACM Comput. Surv. 2017, 50, 37:1–37:38. [Google Scholar] [CrossRef]
- Fagas, G.; Gallagher, J.P.; Gammaitoni, L.; Paul, D.J. Energy Challenges for ICT. In ICT - Energy Concepts for Energy Efficiency and Sustainability; Fagas, G., Gammaitoni, L., Gallagher, J.P., Paul, D.J., Eds.; IntechOpen: Rijeka, Croatia, 2017; Chapter 1. [Google Scholar]
- Tang, Z.; Qi, L.; Cheng, Z.; Li, K.; Khan, S.U.; Li, K. An Energy-Efficient Task Scheduling Algorithm in DVFS-enabled Cloud Environment. J. Grid Comput. 2016, 14, 55–74. [Google Scholar] [CrossRef]
- Cao, T.; He, Y.; Kondo, M. Demand-Aware Power Management for Power-Constrained HPC Systems. In Proceedings of the 2016 16th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), Cartagena, Colombia, 16–19 May 2016; pp. 21–31. [Google Scholar]
- Arroba, P.; Moya, J.M.; Ayala, J.L.; Buyya, R. Dynamic Voltage and Frequency Scaling-Aware Dynamic Consolidation of Virtual Machines for Energy Efficient Cloud Data Centers. Concurr. Comput. Pract. Exp. 2017, 29, e4067. [Google Scholar] [CrossRef]
- Lang, J.; Rünger, G. An Execution Time and Energy Model for an Energy-Aware Execution of a Conjugate Gradient Method with CPU/GPU Collaboration. J. Parallel Distrib. Comput. 2014, 74, 2884–2897. [Google Scholar] [CrossRef]
- Lastovetsky, A.; Manumachu, R.R. New Model-Based Methods and Algorithms for Performance and Energy Optimization of Data Parallel Applications on Homogeneous Multicore Clusters. IEEE Trans. Parallel Distrib. Syst. 2017, 28, 1119–1133. [Google Scholar] [CrossRef]
- Khaleghzadeh, H.; Fahad, M.; Manumachu, R.R.; Lastovetsky, A. A Novel Data Partitioning Algorithm for Dynamic Energy Optimization on Heterogeneous High-Performance Computing Platforms. Concurr. Comput. Pract. Exp. 2020, e5928. [Google Scholar] [CrossRef]
- Fahad, M.; Shahid, A.; Manumachu, R.R.; Lastovetsky, A. Accurate Energy Modelling of Hybrid Parallel Applications on Modern Heterogeneous Computing Platforms Using System-Level Measurements. IEEE Access 2020, 8, 93793–93829. [Google Scholar] [CrossRef]
- Fahad, M.; Shahid, A.; Manumachu, R.R.; Lastovetsky, A. A Comparative Study of Methods for Measurement of Energy of Computing. Energies 2019, 12, 2204. [Google Scholar] [CrossRef] [Green Version]
- Kuzma, J.W. Basic Statistics for the Health Sciences, 3rd ed.; Mayfield Publishing Company: Mountain View, CA, USA, 1998. [Google Scholar]
- Abramowitz, M. Handbook of Mathematical Functions, With Formulas, Graphs, and Mathematical Tables; Dover Publications, Inc.: Mineola, NY, USA, 1974. [Google Scholar]
- Dargie, W. A Stochastic Model for Estimating the Power Consumption of a Processor. IEEE Trans. Comput. 2015, 64. [Google Scholar] [CrossRef]
- Zhou, Z.; Abawajy, J.H.; Li, F.; Hu, Z.; Chowdhury, M.U.; Alelaiwi, A.; Li, K. Fine-Grained Energy Consumption Model of Servers Based on Task Characteristics in Cloud Data Center. IEEE Access 2018, 6, 27080–27090. [Google Scholar] [CrossRef]
- McCullough, J.C.; Agarwal, Y.; Chandrashekar, J.; Kuppuswamy, S.; Snoeren, A.C.; Gupta, R.K. Evaluating the Effectiveness of Model-Based Power Characterization. In Proceedings of the 2011 USENIX Conference on USENIX Annual Technical Conference, Portland, OR, USA, 15–17 June 2011; p. 12. [Google Scholar]
- Hackenberg, D.; Ilsche, T.; Schöne, R.; Molka, D.; Schmidt, M.; Nagel, W.E. Power Measurement Techniques on Standard Compute Nodes: A Quantitative Comparison. In Proceedings of the 2013 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), Austin, TX, USA, 21–23 April 2013; pp. 194–204. [Google Scholar]
- Subramaniam, B.; Feng, W.C. Towards Energy-Proportional Computing for Enterprise-Class Server Workloads. In Proceedings of the 4th ACM/SPEC International Conference on Performance Engineering, Prague, Czech Republic, 21–24 April 2013; p. 1526. [Google Scholar]
- Hackenberg, D.; Schöne, R.; Ilsche, T.; Molka, D.; Schuchart, J.; Geyer, R. An Energy Efficiency Feature Survey of the Intel Haswell Processor. In Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, Hyderabad, India, 25–29 May 2015; pp. 896–904. [Google Scholar]
- Khan, K.N.; Hirki, M.; Niemi, T.; Nurminen, J.K.; Ou, Z. RAPL in Action: Experiences in Using RAPL for Power Measurements. ACM Trans. Model. Perform. Eval. Comput. Syst. 2018, 3, 9:1–9:26. [Google Scholar] [CrossRef]
- Rico-Gallego, J.A.; Díaz-Martín, J.C.; Lastovetsky, A.L. Extending t-Lop to Model Concurrent MPI Communications in Multicore Clusters. Future Gener. Comput. Syst. 2016, 61, 6682. [Google Scholar] [CrossRef]
- Li, B.; Han, L. Distance Weighted Cosine Similarity Measure for Text Classification. Intelligent Data Engineering and Automated Learning – IDEAL 2013; Yin, H., Tang, K., Gao, Y., Klawonn, F., Lee, M., Weise, T., Li, B., Yao, X., Eds.; Springer: Berlin/Heidelberg, Germany, 2013; pp. 611–618. [Google Scholar]
- Berndt, D.J.; Clifford, J. Using Dynamic Time Warping to Find Patterns in Time Series. In Proceedings of the 3rd International Conference on Knowledge Discovery and Data Mining, Seattle, WA, USA, 31 July–1 August 1994; pp. 359–370. [Google Scholar]
- Nakamura, T.; Taki, K.; Nomiya, H.; Seki, K.; Uehara, K. A Shape-Based Similarity Measure for Time Series Data with Ensemble Learning. Pattern Anal. Appl. 2013, 16, 535–548. [Google Scholar] [CrossRef]
- Box, G.E.P.; Jenkins, G. Time Series Analysis, Forecasting and Control; Holden-Day, Inc.: San Francisco, CA, USA, 1976. [Google Scholar]
- Yu, Q.; Jibin, L.; Jiang, L. An Improved ARIMA-Based Traffic Anomaly Detection Algorithm for Wireless Sensor Networks. Int. J. Distrib. Sens. Netw. 2016, 12, 9653230. [Google Scholar] [CrossRef] [Green Version]
- Piccolo, D. A Distance Measure for Classifying Arima Models. J. Time Ser. Anal. 1990, 11, 153–164. [Google Scholar] [CrossRef]
- Ramoni, M.; Sebastiani, P.; Cohen, P. Bayesian Clustering by Dynamics. Mach. Learn. 2002, 47, 91–121. [Google Scholar] [CrossRef] [Green Version]
- Maharaj, E.A. Cluster of Time Series. J. Classif. 2000, 17, 297–314. [Google Scholar] [CrossRef]
- Anton, H.; Rorres, C. Elementary Linear Algebra: Applications Version, 11th ed.; John Wiley & Sons, Inc.: Hoboken, NJ, USA, 2013. [Google Scholar]
- Warren Liao, T. Clustering of Time Series Data-A Survey. Pattern Recogn. 2005, 38, 18571874. [Google Scholar] [CrossRef]
- Iglesias, F.; Kastner, W. Analysis of Similarity Measures in Times Series Clustering for the Discovery of Building Energy Patterns. Energies 2013, 6, 579–597. [Google Scholar] [CrossRef] [Green Version]
- Sanfeliu, A.; Fu, K. A Distance Measure between Attributed Relational Graphs for Pattern Recognition. IEEE Trans. Syst. Man Cybern. 1983, SMC-13, 353–362. [Google Scholar] [CrossRef]
- Fahad, M.; Shahid, A.; Manumachu, R.R.; Lastovetsky, A. Supplementary Materials: A Novel Statistical Learning-Based Methodology for Measuring the Goodness of Energy Profiles of Applications Executing on Multicore Comptuing Platforms. Available online: https://csgitlab.ucd.ie/Muhammad_Fahad/supplementals-to-the-publications/-/blob/master/2020/MDPI_goodness/supplemental.pdf (accessed on 25 July 2020).
- Rotem, E.; Naveh, A.; Ananthakrishnan, A.; Weissmann, E.; Rajwan, D. Power-Management Architecture of the Intel Microarchitecture Code-Named Sandy Bridge. IEEE Micro 2012, 32, 20–27. [Google Scholar] [CrossRef]
- Heterogeneous Computing Laboratory, University College Dublin. HCLWattsUp: API for Power and Energy Measurements Using WattsUp Pro Meter. Available online: https://csgitlab.ucd.ie/ucd-hcl/hclwattsup (accessed on 23 May 2020).
- Rawlings, J.O.; Pantula, S.G.; Dickey, D.A. Applied Regression Analysis, 2nd ed.; Springer: New York, NY, USA, 1998. [Google Scholar]
- Ostertagová, E. Modelling Using Polynomial Regression. Procedia Eng. 2012, 48, 500–506. [Google Scholar] [CrossRef] [Green Version]
- Shahid, A.; Fahad, M.; Manumachu, R.R.; Lastovetsky, A. Energy of Computing on Multicore CPUs: Predictive Models and Energy Conservation Law. Available online: https://arxiv.org/pdf/1907.02805.pdf (accessed on 15 June 2020).
- Nvidia. NVML Reference Manual. Available online: https://docs.nvidia.com/pdf/NVML_API_Reference_Guide.pdf (accessed on 16 June 2020).
- Corporation, I. Intel Xeon Phi Coprocessor SystemSoftware Developers Guide. Available online: https://software.intel.com/sites/default/files/managed/09/07/xeon-phi-coprocessor-system-software-developers-guide.pdf (accessed on 16 June 2020).
- Corporation, I. Intelo Manycore Platform Software Stack (Intel MPSS). Available online: https://software.intel.com/en-us/articles/intel-manycore-platform-software-stack-mpss (accessed on 16 June 2020).
© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Fahad, M.; Shahid, A.; Manumachu, R.R.; Lastovetsky, A. A Novel Statistical Learning-Based Methodology for Measuring the Goodness of Energy Profiles of Applications Executing on Multicore Computing Platforms. Energies 2020, 13, 3944. https://doi.org/10.3390/en13153944
Fahad M, Shahid A, Manumachu RR, Lastovetsky A. A Novel Statistical Learning-Based Methodology for Measuring the Goodness of Energy Profiles of Applications Executing on Multicore Computing Platforms. Energies. 2020; 13(15):3944. https://doi.org/10.3390/en13153944
Chicago/Turabian StyleFahad, Muhammad, Arsalan Shahid, Ravi Reddy Manumachu, and Alexey Lastovetsky. 2020. "A Novel Statistical Learning-Based Methodology for Measuring the Goodness of Energy Profiles of Applications Executing on Multicore Computing Platforms" Energies 13, no. 15: 3944. https://doi.org/10.3390/en13153944
APA StyleFahad, M., Shahid, A., Manumachu, R. R., & Lastovetsky, A. (2020). A Novel Statistical Learning-Based Methodology for Measuring the Goodness of Energy Profiles of Applications Executing on Multicore Computing Platforms. Energies, 13(15), 3944. https://doi.org/10.3390/en13153944