Time and Location: 5/6/2020, Wed., 4 p.m., Online seminar Speaker: Mr. Wei Zhang Title: A Data Retention Recommender System for HPC Facilities Abstract: The storage system of many high-performance computing (HPC) facilities faces an increasingly challenging goal of meeting unlimited data growth with limited storage capacity growth. The data retention policy plays a vital role in addressing such challenge. However, most existing data retention policies are designed based on and biased towards the temporal properties of files rather than the activities of users. Such biased data retention policies may cause unnecessary data interruption or even data loss to users. In this talk, I will introduce our recent study where we proposed a data retention action recommender system (DataRecommender) that generates user-centric data retention recommendations based on a holistic view of the user activeness. Our evaluation on the recommendation results shows that, as compared to the existing data retention policies, adopting DataRecommender’s data retention recommendation can significantly avoid undesired data loss for active users while maintaining the same ability of cleaning unnecessary files as compared to existing LRU-based data retention policies.