A transistor operations model for deep learning energy consumption scaling law
dc.contributor.author | Li, Chen | |
dc.contributor.author | Tsourdos, Antonios | |
dc.contributor.author | Guo, Weisi | |
dc.date.accessioned | 2023-01-10T11:16:17Z | |
dc.date.available | 2023-01-10T11:16:17Z | |
dc.date.freetoread | 2023-01-10 | |
dc.date.issued | 2024-01-01 | |
dc.date.pubOnline | 2022-12-14 | |
dc.description.abstract | Deep Neural Networks (DNN) has transformed the automation of a wide range of industries and finds increasing ubiquity in society. The high complexity of DNN models and its widespread adoption has led to global energy consumption doubling every 3-4 months. Current energy consumption measures largely monitor system wide consumption or make linear assumptions of DNN models. The former approach captures other unrelated energy consumption anomalies, whilst the latter does not accurately reflect nonlinear computations. In this paper, we are the first to develop a bottom-up Transistor Operations (TOs) approach to expose the role of non-linear activation functions and neural network structure. As there will be inevitable energy measurement errors at the core level, we statistically model the energy scaling laws as opposed to absolute consumption values. We offer models for both feedforward DNNs and convolution neural networks (CNNs) on a variety of data sets and hardware configurations - achieving a 93.6% - 99.5% precision. This outperforms existing FLOPs-based methods and our TOs method can be further extended to other DNN models. | en_UK |
dc.description.journalName | IEEE Transactions on Artificial Intelligence | |
dc.description.sponsorship | European Union funding: 778305 | en_UK |
dc.format.extent | pp. 192-204 | |
dc.identifier.citation | Li C, Tsourdos A, Guo W. (2024) A transistor operations model for deep learning energy consumption scaling law. IEEE Transactions on Artificial Intelligence, Volume 5, Issue 1, January 2024, pp. 192-204 | en_UK |
dc.identifier.issn | 2691-4581 | |
dc.identifier.issueNo | 1 | |
dc.identifier.uri | https://doi.org/10.1109/TAI.2022.3229280 | |
dc.identifier.uri | https://dspace.lib.cranfield.ac.uk/handle/1826/18923 | |
dc.identifier.volumeNo | 5 | |
dc.language.iso | en | en_UK |
dc.publisher | IEEE | en_UK |
dc.rights | Attribution-NonCommercial 4.0 International | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc/4.0/ | * |
dc.subject | Energy Consumption | en_UK |
dc.subject | Deep Learning | en_UK |
dc.subject | Model Architecture | en_UK |
dc.subject | Transistor Operations | en_UK |
dc.title | A transistor operations model for deep learning energy consumption scaling law | en_UK |
dc.type | Article | en_UK |
dcterms.dateAccepted | 2022-12-10 |