A transistor operations model for deep learning energy consumption scaling law

dc.contributor.authorLi, Chen
dc.contributor.authorTsourdos, Antonios
dc.contributor.authorGuo, Weisi
dc.date.accessioned2023-01-10T11:16:17Z
dc.date.available2023-01-10T11:16:17Z
dc.date.issued2022-12-14
dc.description.abstractDeep Neural Networks (DNN) has transformed the automation of a wide range of industries and finds increasing ubiquity in society. The high complexity of DNN models and its widespread adoption has led to global energy consumption doubling every 3-4 months. Current energy consumption measures largely monitor system wide consumption or make linear assumptions of DNN models. The former approach captures other unrelated energy consumption anomalies, whilst the latter does not accurately reflect nonlinear computations. In this paper, we are the first to develop a bottom-up Transistor Operations (TOs) approach to expose the role of non-linear activation functions and neural network structure. As there will be inevitable energy measurement errors at the core level, we statistically model the energy scaling laws as opposed to absolute consumption values. We offer models for both feedforward DNNs and convolution neural networks (CNNs) on a variety of data sets and hardware configurations - achieving a 93.6% - 99.5% precision. This outperforms existing FLOPs-based methods and our TOs method can be further extended to other DNN models.en_UK
dc.description.sponsorshipEuropean Union funding: 778305en_UK
dc.identifier.citationLi C, Tsourdos A, Guo W. (2024) A transistor operations model for deep learning energy consumption scaling law. IEEE Transactions on Artificial Intelligence, Volume 5, Issue 1, January 2024, pp. 192-204en_UK
dc.identifier.issn2691-4581
dc.identifier.urihttps://doi.org/10.1109/TAI.2022.3229280
dc.identifier.urihttps://dspace.lib.cranfield.ac.uk/handle/1826/18923
dc.language.isoenen_UK
dc.publisherIEEEen_UK
dc.rightsAttribution-NonCommercial 4.0 International*
dc.rights.urihttp://creativecommons.org/licenses/by-nc/4.0/*
dc.subjectEnergy Consumptionen_UK
dc.subjectDeep Learningen_UK
dc.subjectModel Architectureen_UK
dc.subjectTransistor Operationsen_UK
dc.titleA transistor operations model for deep learning energy consumption scaling lawen_UK
dc.typeArticleen_UK

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Transistor_operations_model-2022.pdf
Size:
2.45 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.63 KB
Format:
Item-specific license agreed upon to submission
Description: