A sample decreasing threshold greedy‑based algorithm for big data summarisation

dc.contributor.authorLi, Teng
dc.contributor.authorShin, Hyo-Sang
dc.contributor.authorTsourdos, Antonios
dc.date.accessioned2021-03-03T14:50:21Z
dc.date.available2021-03-03T14:50:21Z
dc.date.issued2021-02-09
dc.description.abstractAs the scale of datasets used for big data applications expands rapidly, there have been increased efforts to develop faster algorithms. This paper addresses big data summarisation problems using the submodular maximisation approach and proposes an efficient algorithm for maximising general non-negative submodular objective functions subject to k-extendible system constraints. Leveraging a random sampling process and a decreasing threshold strategy, this work proposes an algorithm, named Sample Decreasing Threshold Greedy (SDTG). The proposed algorithm obtains an expected approximation guarantee of 11+k−ϵ for maximising monotone submodular functions and of k(1+k)2−ϵ in non-monotone cases with expected computational complexity of O(n(1+k)ϵlnrϵ). Here, r is the largest size of feasible solutions, and ϵ∈(0,11+k) is an adjustable designing parameter for the trade-off between the approximation ratio and the computational complexity. The performance of the proposed algorithm is validated and compared with that of benchmark algorithms through experiments with a movie recommendation system based on a real database.en_UK
dc.identifier.citationLi T, Shin H-S & Tsourdos A (2021) A sample decreasing threshold greedy‑based algorithm for big data summarisation, Journal of Big Data, Volume 8, 2021, Article number 30en_UK
dc.identifier.issn2196-1115
dc.identifier.urihttps://doi.org/10.1186/s40537-021-00416-y
dc.identifier.urihttps://dspace.lib.cranfield.ac.uk/handle/1826/16434
dc.language.isoenen_UK
dc.publisherSpringeren_UK
dc.rightsAttribution 4.0 International*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/*
dc.subjectBig data summarisationen_UK
dc.subjectSubmodular maximisationen_UK
dc.subjectk-extendible system constraintsen_UK
dc.subjectPersonalised recommendationen_UK
dc.titleA sample decreasing threshold greedy‑based algorithm for big data summarisationen_UK
dc.typeArticleen_UK

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
big_data_summarisation-2021.pdf
Size:
1020.62 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.63 KB
Format:
Item-specific license agreed upon to submission
Description: