Brief Bio

Guanlin Lu received his PhD degree in computer science in 2012. He is now working for EMC Data Domain. He worked with Prof. David H.C. Du . His research interests include storage system design and big data analysis. In particular, he is interested in applying statistical methods in storage system design.

Guanlin Lu got his B.S. degree in Computer Science from Huazhong University of Science and Technology, Wuhan, China in 2006. Following that, he obtained Master degree in Computer Science from University of Minnesota Twin Cities in 2009.


Guanlin Lu did an research internship at NEC Laboratories of America during summer 2010 at Princeton, New Jersey. 

Before that, Guanlin Lu worked as a software engineer intern in Symantec Storage Availability Management Group (SAMG) between June 2009 and December 2009, where he worked for cloud file system project.

Research Work

Guanlin Lu’s research work focuses on issues related to Data De-duplication field, involving adaptive chunking algorithm design, data pre-processing (e.g. clustering/classification) to support better performance of de-duplication. He is very interested in using machine learning techniques as well as clustering techniques to facilitate the performance of de-duplication process.

For detail about his research projects, click here!

Engineering Work

During his intern with SAMG, he developed sets of custom Solaris MDB debug macros for each of Storage Cloud Data Server, Storage Could Meta Data Server and Storage Could Client modules to facilitate the core dump debugging & analysis. He also ported FiST based stackable file system from SunOS 5.7 to SunOS 5.10.


·       Xing Lin, Guanlin Lu, Fred Douglis, Philip Shilane, Grant Wallace, Migratory Compression: Coarse-grained Data Reordering to improve Compressibility, in the Proceedings of the 12th USENIX Conference on File and Storage and Technologies, February, 2014 (FAST 2014).

·       Guanlin Lu, Youngjin Nam, David H.C. Du, BloomStore: Efficient Bloom-Filter based Key-Value Store for Indexing of Data Deduplication on Flash-Memory, in the proceeding of 28th IEEE (MSST 2012) Symposium on Massive Storage System and Technologies, April, 2012.

·       Ramya Prabhakar, Erik Kruus, Guanlin Lu, Cristian Ungureanu, EEffSim: A Discrete Event Simulator for Energy Efficiency in Large-Scale Storage Systems, 2nd  Annual International Conference on Energy Aware Computing (ICEAC'11), Istanbul, Turkey. November 2011.

·       Youngjin Nam, Guanlin Lu, Nohhyun Park, Weijun Xiao, David H. C. Du: Chunk Fragmentation Level: An Effective Indicator for Read Performance Degradation in Deduplication Storage, 13th IEEE International Conference on High Performance Computing & Communication, (HPCC 2011), Banff, Alberta, Canada, September 2-4, 2011

·       Youngjin Nam, Guanlin Lu, David H.C. Du: Reliability-aware deduplication storage: Assuring chunk reliability and chunk loss severity, International Green Computing Conference (IGCC 2011), Orlando, FL, USA

·       Guanlin Lu, Biplob Debnath, David H.C. Du, Forest-structured Bloom Filters on Flash,  in the proceeding of 27th IEEE Symposium on Massive Storage System and Technologies (MSST), 2011

·       Guanlin Lu, Yu Jin, David H.C. Du, Frequency Based Chunking Algorithm for Data Deduplication, 18th  Annual Meeting of the IEEE International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOTS 2010), Miami, Florida, August, 2010. (extended paper, category of top 16% submitted papers)

·       Guanlin Lu, Yu Jin, David H.C. Du, Frequency-Based Chunking for Backup Streams, in the posters of 8th  USENIX Conference on File and Storage Technologies (FAST 2010), San Jose, CA, Feb 2010.

·       Chuanyi Liu, Yingpin Lu, Chunhui Shi, Guanlin Lu, David H.C. Du, Dong-Sheng Wang, ADMAD: Application-Driven Metadata Aware De-duplication Archival Storage System, Storage Network Architecture and Parallel I/Os, 2008 (SNAPI '08) 15th IEEE international workshop

·       Pramod Mandagere, Guanlin Lu, David H.C. Du, Data de-duplication using Object Based Storage, research proposal submitted to Seagate, April 2007              


Co-authored 3 patents in the domains of data compression and RAID vulnerability.


ADC Graduate Fellowship in Wireless and Networking Technology, University of Minnesota, 2006

Academic Excellence scholarship, Huazhong University of Science and Technology, 2005


Major Courses taken in his PhD program

Data Communication & Computer Networks   

Fall 06

Advanced Algorithm and Data Structure           

Fall 06

Modern Operating System                                

Spring 08

Principles of Database Systems                         

Fall 07

Introduction to Parallel Computing                   

Fall 07

Modern Cryptography                                       

Fall 07

Artificial Intelligence I                                       

Spring 07

Artificial Intelligence II                                      

Spring 08

Machine Learning                                 

Fall 08

Theory of Statistics I                                                        

Fall 08

Theory of Statistics II

Spring 09

Advanced Storage System                                 

Spring 09

Applied Multivariate Statistical Analysis

Spring 10