25–29 May 2026
Chulalongkorn University
Asia/Bangkok timezone

JWanFS: A WAN-Oriented Distributed File System for Multi-Data Center Collaboration in High Energy Physics

27 May 2026, 17:45
18m
Chulalongkorn University

Chulalongkorn University

Oral Presentation Track 1 - Data and metadata organization, management and access Track 1 - Data and metadata organization, management and access

Speaker

隗立畅 weilc (IHEP)

Description

With the continuous advancement of HEP detectors and online reconstruction capabilities, the scale of experimental data is growing rapidly. The data pattern is increasingly characterized by "massive small files distributed across multiple data centers." On one hand, the surge in small files creates bottlenecks in metadata and directory operations; on the other hand, cross-data center access often relies on complex cross-domain operational strategies, making it difficult to balance performance with scalability.

To address these issues, this paper proposes JWanFS, a distributed file system designed for HEP experiments. It provides a unified namespace and nearest-access capabilities for multi-site users, with optimizations specifically for Wide Area Network (WAN) environments.

The key designs of JWanFS include:

  1. Storage & Interface Optimization: Enhances small file organization and access strategies based on SeaweedFS, and supports multi-protocol access (NFS, S3, XRootD) via a gateway layer to seamlessly integrate with data analysis and AI training workflows.
  2. Metadata Synchronization: Utilizes MongoDB's asynchronous replication (Oplog) mechanism for efficient cross-site metadata distribution and minimizes directory traversal overhead through range query optimization.
  3. Access Acceleration: Combines a "nearest data center" policy with client-side multi-level caching to significantly reduce WAN Round-Trip Time (RTT) and cross-domain jitter.

JWanFS demonstrates stable and efficient throughput and scalability under typical small-file workloads and cross-domain access scenarios. We plan to deploy and iterate the system in further HEP experiments (such as LHAASO and JUNO) to provide reliable and efficient cross-domain storage infrastructure support for the next generation of high energy physics experiments.

Author

Co-authors

LI Haibo lihaibo Dr Yaodong CHENG (Institute of High Energy Physics, Chinese Academy of Sciences) Mr Yuanming Tang (IHEP) Dr Yujiang BI (Institute of High Energy Physics, Chinese Academy of Sciences) Mr Zhuo Meng (IHEP)

Presentation materials

There are no materials yet.