Enhancing performance and interpretability of multivariate time-series model through sparse saliency

Date published

2024-09-24

Free to read from

2025-04-14

Supervisor/s

Journal Title

Journal ISSN

Volume Title

Publisher

IEEE

Department

Type

Conference paper

ISSN

2153-0009

Format

Citation

Kong X, Xing Y, Liu Z, et al., (2024) Enhancing performance and interpretability of multivariate time-series model through sparse saliency. In: 2024 IEEE 27th International Conference on Intelligent Transportation Systems (ITSC), 24-27 September 2024, Edmonton, Canada, pp. 3265-3270

Abstract

Explainable time-series modelling is an essential task for modern intelligent transportation systems (ITS). How-ever, balancing accuracy and interpretability in multivariate time series forecasting presents significant challenges. These challenges arise from the necessity to understand the significance of features and their temporal variations. Factors such as autocorrelation in time series and data processing techniques like sliding windows expand feature sets, thereby complicating pattern recognition using traditional post-hoc explanation methods and making the issue even more complex. To overcome these challenges, in this study, we propose a flexible post-process approach which generates sparse and normalized saliency values based on existing saliency generation methods such as GradientSHAP. Additionally, an optional window aggregation and alignment strategy is introduced to align with the original time series dataset, enhancing the intuitive understanding of feature importance. Furthermore, the potential use of sparse saliency for data augmentation to improve the model is explored. Lastly, we utilize naturalistic data from San Francisco airport to demonstrate our approach for ITS time-series prediction and explanation. The evaluation results indicate that integrating sparse saliency from high-performing models not only boosts the performance of XGBoost models by 10.92% but also simplifies model complexity, facilitating easier interpretation.

Description

Software Description

Software Language

Github

Keywords

4605 Data Management and Data Science, 46 Information and Computing Sciences, Generic health relevance

DOI

Rights

Attribution 4.0 International

Relationships

Relationships

Resources

Funder/s

Engineering and Physical Sciences Research Council (EPSRC)
This project is supported by the Engineering and Physical Sciences Research Council (EPSRC) training grant entitled “DTP 2020-2021 Cranfield University” bearing reference EP/T518104/1.