9-15 June 2018
Woodlands Conference Center
America/New_York timezone
**** See you at Real Time 2020 ****

Real time data access log analysis system of EAST tokamak based on spark

14 Jun 2018, 14:35
1h 30m
Woodlands Conference Center

Woodlands Conference Center

159 Visitor Center Dr, Williamsburg, VA 23185
Poster presentation Data Acquisition Poster 2




The experiment data generated by the EAST device is getting larger and larger, and it is necessary to monitor the MDSplus data storage server on EAST. In order to facilitate the management of users on the MDSplus server, a real-time monitoring log analysis system is needed. The data processing framework adopted by this log analysis system is the Spark Streaming framework in Spark ecosphere, whose real-time streaming data is derived from MDSplus logs. The framework also makes use of key technologies such as log monitoring, aggregation and distribution with framework likes Flume and Kafka,which makes it possible for MDSplus mass log data processing power.The system can process tens of millions of unprocessed MDSplus log information at a second level, then model the log information and display it on the web.This report introduces the design and implementation of the overall architecture of real time data access log analysis system based on spark.Experimental results show that the system is proved to be with steady and reliable performance and has an importanl application value to the management of fusion experiment data.

Speaker Feng Wang
Institute IPP Hefei
Country China
Minioral No

Primary authors

Feng WANG Mr Qihao ZHANG (ASIPP) Ms Yueting WANG (ASIPP) Ms Ying CHEN (ASIPP) Dr Fei Yang (Department of Computer Science, Anhui Medical University)

Presentation Materials

Your browser is out of date!

Update your browser to view this website correctly. Update my browser now