Tierra: multi-tiered arrays and recency-aware hot data decision

Tierra:多层阵列和基于近因感知的热数据决策

阅读:1

Abstract

Hot data identification plays a critical role in a wide range of computing systems, including memory hierarchy management, database optimization, and large-scale storage infrastructures. Its importance has further increased with the emergence of non-volatile memory (NVM) technologies. However, many existing studies have not fully succeeded in accurately identifying hot and cold data due to excessive computational overhead, large memory requirements, and limited accuracy, primarily stemming from ineffective data structures for recording both recency and frequency information. To date, two representative data structures-bit-array counters and multiple bloom filters-have been widely adopted. While bit-array counters effectively capture access count (i.e., frequency) information, they fail to consider recency. Multiple bloom filters were proposed to record both frequency and recency information and have therefore been widely employed. Nevertheless, many hot data identification schemes based on multiple bloom filters still suffer from low accuracy due to fundamental limitations of the underlying data structure. To overcome these inherent limitations, this paper proposes Tierra, a novel hot data identification scheme based on a completely new data structure employing asymmetric multilevel arrays. These asymmetric arrays improve performance by significantly reducing internal data movement by 3.1×. In addition, Tierra incorporates a recency-aware request screening mechanism based on an enhanced stack distance approximation algorithm, which substantially reduces computational overhead while improving identification accuracy. Comprehensive evaluations using diverse real-world workloads demonstrate that the proposed Tierra achieves high accuracy, with an average true identification rate of 99.4%.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。