28-R-15 (CERN conferencing service (joining details below))
CERN conferencing service (joining details below)
firstname.lastname@example.org Weekly OSG, EGEE, WLCG infrastructure coordination meeting.
We discuss the weekly running of the production grid infrastructure based on weekly reports from the attendees. The reported issues are discussed, assigned to the relevant teams, followed up and escalated when needed. The meeting is also the forum for the sites to get a summary of the weekly WLCG activities and plans
OSG operations team
EGEE operations team
EGEE ROC managers
WLCG coordination representatives
WLCG Tier-1 representatives
other site representatives (optional)
To dial in to the conference:
a. Dial +41227676000
b. Enter access code 0140768
<big> Experience of countries/regions with the WMS? </big>
In the UK we are still trying to understand when to move to relying on the WMS and how many we require. What are the experiences of other countries/regions?
Here is some background from a GridPP meeting today:
"The RAL WMS lcgwms01 (SL3 host with gLite-WMS-2.4.9-0 and glite-LB-2.3.5-0) became heavily loaded on 22nd and user throughput suffered as a result. The underlying problem was not understood as the service returned to normal without a clear intervention required. This prompted SL to comment on WMS and RB availability in the UK. He noted 5 RBs (3 RAL; 1 Glasgow and 1 IC). He was only aware of the 1 WMS instance at RAL. As of today, the default server in Glasgow is a gLite 3.1 WMS instance (RB to be removed at the end of July and possibly replaced with another WMS). RAL maintains one test instance on SL4 – to be moved to production after further testing. IC has PPS-glite-WMS.i386 3.1.8-1. This WMS is stable with 20-30,000 jobs a day not causing a problem. NGS has an unadvertised WMS hosted at RAL. Grid Ireland run a WMS and has seen “quite a few issues” while working with users to get their apps working via it. Throughput performance of the WMS is good.
Stephen recently noticed that YAIM will soon be configuring UIs to work with service discovery (WMS and LBs will be discoverable through the information system using appropriate UI commands): https://savannah.cern.ch/bugs/?31211.”
<big> WLCG issues coming from ROC reports </big>
<big> End points for FTM service at tier-1 sites </big>
There is a request to know what are the FTM endpoints at the Tier-1 sites.
We can collect these manually now, but how should the list be kept up-to-date?
<big>WLCG Service Interventions (with dates / times where known) </big>
PIC will have a Scheduled downtime on 5-Aug, from 8:00 CEST (UTC+2) to 20:00 CEST (UTC+2). The SRM and CE services will be down for a dCache upgrade and PBS master migration, respectively. The LHCB-DIRAC2 (lhcb.pic.es) server will also be stopped from 9:00-10:00 (UTC+2) for a cold backup of the MySQL DB.