image: The directory tree replication architecture
Credit: Chenhao ZHANG, Liang WANG, Jing SHANG, Zhiwen XIAO, Limin XIAO, Meng HAN, Bing WEI, Runnan SHEN, Jinquan WANG
Metadata management has always been the focus of distributed file systems research. Metadata replication exhibits a comparative advantage in maintaining locality and balancing system load. However, according to the requirements of POSIX semantics, existing systems use strict consistency to guarantee the consistency and atomicity of metadata operations. The strict consistency impedes I/O, resulting in a performance bottleneck.
To solve the problems, a research team led by Limin Xiao published their new research on 15 Jan 2025 in Frontiers of Computer Science co-published by Higher Education Press and Springer Nature.
The team proposed a low-cost and efficient consistency framework with adaptive synchronization for metadata replication. This synchronization method is verified and tested in the global virtual data space file system. Compared with the existing research results, the proposed method has better metadata operation performance and better scalability.
In the research, they analyzed the consistency and performance requirements of metadata operations and data operations for distributed file systems in the wide area high performance computing environment, and then decoupled directory from file metadata. They propose different synchronization methods for directory trees and file metadata to ensure system performance without sacrificing consistency.
ICGDT, an incremental consistency guaranteed directory tree synchronization method, enables fine-grained tracking of access conflict status for files and directories. By utilizing a decentralized conflict status tree, it allows dynamic switching of the message synchronization channel for files or directories within the directory tree. CCGRI, a method that ensures causal consistency for data access in wide-area DFSes. It converts data update synchronization requests into data index update synchronization requests to reduce the wide-area bandwidth occupied by data replication and delay the actual synchronization of replica data until the replica data is accessed or the wide-area link is idle by lazy replication. They work together to achieve efficient and consistent metadata replication.
The experiment was carried out on a real distributed file system GVDS. The experimental results show that, compared with the existing methods, the proposed method greatly reduces the synchronization delay and has good scalability.
DOI: 10.1007/s11704-023-2772-y
Journal
Frontiers of Computer Science
Method of Research
Experimental study
Subject of Research
Not applicable
Article Title
ICCG: low-cost and efficient consistency with adaptive synchronization for metadata replication
Article Publication Date
15-Jan-2025