Deployment team
Tuesday 21 April 2009 -
11:00
Monday 20 April 2009
Tuesday 21 April 2009
11:00
Experiment problems/issues
Experiment problems/issues
11:00 - 11:20
Review of weekly issues by experiment/VO - LHCb -- Prep. for STEP09 - CMS Prep. for STEP09 - ATLAS Prep. for STEP09 - Other -- Have any more sites had problems with Fusion user work? -- Any further feedback on the camont proposal? The T1 has raised a few questions that are being answered via email (will share the summary). - Experiment blacklisted sites: review Which sites are currently blacklisted and why? - Site performance -- http://pprc.qmul.ac.uk/~lloyd/gridpp/ukgrid.html -- ECDF has 100% failure recently -- Cambridge has 50% failure recently -- UCL and IC also have poor success (around 10%)
11:20
ROC update
ROC update
11:20 - 11:45
ROC update *************** - Status of central Nagios: https://gridppnagios.physics.ox.ac.uk/nagios/ - Status of Nagios in each Tier-2. From the EGEE ops meeting: http://indico.cern.ch/conferenceDisplay.py?confId=57117. - Nothing really new at the meeting this week - Manchester's accounting problem was escalated due to inactivity on the ticket. Internal escalation (in the ROD model) can be done differently but sites do need to demonstrate that problems are being worked on and provide updates. From the site reports: - Interesting T1 comment this week: CE-host-cert-valid: This is a non-lhc service The service users are also non-lhc so this comment did not seem to provide a "reason". WLCG update ***************** 8th April GDB. Duncan's report: http://www.gridpp.ac.uk/wiki/GDB_8th_April_2009. What are the key T2/GridPP areas to follow up? Ticket status *************** https://gus.fzk.de/download/escalationreports/roc/html/20090420_EscalationReport_ROCs.html 45327: GFAL for biomed at RHUL. still on hold. 46024: Pheno would like usage data at user level -> sites asked to enable DN 47073: ATLAS spacetokens at Cambridge. Needs to be followed up. 47074: ATLASSCRATCHDISK at Oxford. No user response. Close? 47118: LFC at Liverpool. Fixed but slow. Close? 47342: ILC Tier-1 SE problem. Seems stuck. Brian please prompt again. 47393: LHCb prod jobs stall at MAN-HEP. Waiting for James... 47528: Supernemo access to OX-HEP SE. In progress. 47529: Biomed at TCD issue with job cleanup. Comments from dteam?? 47530: Supernemo access to MAN-HEP SE. Now fixed so close? 47653: VOMS host cert update. With Jens. 47677: RAL T1 SRM ATLAS problem. Marked as solved yesterday. Any other issues?
11:45
Quarterly reports
Quarterly reports
11:45 - 11:50
- First look at T2 reports for Q109 (reports should now be online) -- Any completion issues? -- Any urgent matters arising?
11:50
Benchmarking
Benchmarking
11:50 - 12:00
- Following the DB decision we now need to move sites to the new accounting units. - For this we need to gather site benchmarking data - How far has each Tier-2 got in pushing this forward? - What are the plans (which sites have spec2006 available)? (This needs to be done very soon)
12:00
Team updates
Team updates
12:00 - 12:10
- Short update from each team member (1-2mins) -- Current ongoing work -- Current issues and concerns
12:10
AOB
AOB
12:10 - 12:15