Mr Igor Tkachev (Joint Institute for Nuclear Research)

Following parameters were chosen for Russian Grid segment:
number of busy, free and down CPUs;
amount of running and waiting jobs for each virtual organization (VO);
used and available disc space for each VO;
main servers loading (e.g. for Computing Element);
Round Trip Time (RTT) in networks between Resource Centers.
The new monitoring system based on current one is developed now. It will include
different subsystems such as job monitoring, network monitoring, storage monitoring
and other impro

We have experience in monitoring and accounting of grid sites using LDAP, R-GMA,

Additional task was an accounting system. It stores the data on resource utilization
on Grid sites by virtual organizations and single users. The derivable parameters are
jobs count, consumed CPU time, average job waiting time, used physical memory.
Information is taking from R-GMA (Relational Grid Monitoring Architecture) system and
stored in local Oracle DB. Web interface allows to select and group parameters by
different criteria such as period of time, virtual organization, Grid site.
Now RDIG Monitoring and Accounting Web site is functioning well and is in use.

The developed monitoring system allows to keep an eye on parameters of Grid sites'
operation in real time. There is also a option to keep track of a history of sites
usage. The system is based on MonALISA package (Monitoring Agents in Large Integrated
Systems Architecture) and our own developments. It permits to get an information on
resources of computational sites, virtual organizations activities and some
parameters of network channels.

Primary author

Mr Igor Tkachev (Joint Institute for Nuclear Research)


Mr Sergey Belov (Joint Institute for Nuclear Research)

