Speaker
Description
Provide a set of generic keywords that define your contribution (e.g. Data Management, Workflows, High Energy Physics)
LHC, Monitoring, SAM, High Energy Physics
URL for further information:
https://lcg-sam.cern.ch:8443/sam/sam.py
3. Impact
The work covered by this contribution has largely improved the usage efficiency of Grid resources by the LHC experiments. A more accurate and prompt discovery of problems allows to fix them as soon as they appear, thus increasing the overall reliability of the Grid resources from the experiment point of view. This information also allows the experiment applications to make better decisions whenever they are given a choice of the resources to use, avoiding for example to send jobs to problematic or overloaded computing resources
1. Short overview
This contribution describes how the LHC experiments implement their own Grid resource monitoring, either by internally developed tools, or by reusing tools used for Grid operations, like the Service Availability Monitor (SAM) used for the EGEE operations
4. Conclusions / Future plans
The necessity to commission the computing resources available to the experiments before the start of the LHC data taking in 2008 requires a constant effort to improve the quality of the monitoring information. This is why the work described here is still ongoing and we foresee an increasing usage of the SAM framework by the experiments, both by expanding the current tests, and by adding new tests for services that are not yet tested with this methodology