Operations team & Sites

Europe/London
EVO - GridPP Operations team meeting

EVO - GridPP Operations team meeting

Description
- This is the biweekly ops & sites meeting - The intention is to run the meeting in EVO: http://evo.caltech.edu/evoGate/. Join the meeting in the GridPP Community area. - The phone bridge number is +44 (0)161 306 6802 (CERN number +41 22 76 71400). The phone bridge ID is 115728 with code: 4880. Apologies: Chris W
Minutes
    • 11:00 11:20
      Experiment problems/issues 20m
      Review of weekly issues by experiment/VO - LHCb - CMS - ATLAS - Other - T2K
    • 11:20 11:40
      Meetings & updates 20m
      - ROD team update - EGI ops UMD/EMI tarball release for the workernode. Any news? Minutes of the last EGI ops meeting: https://www.egi.eu/indico/getFile.py/access?resId=0&materialId=minutes&confId=766. - Nagios status - Tier-1 update - Security update -- T2 issues -- General notes. There is a GDB next week: https://indico.cern.ch/conferenceDisplay.py?confId=155065. It will use Vidyo. The previous day there is a TEG outcomes workshop: https://indico.cern.ch/conferenceDisplay.py?confId=158775. This will also use Vidyo. Is there any feedback or comment(s) about the TEG meetings last week? - Tickets Some site's tracking of their tickets might be in disarray after Thursday's GGUS hiccup. BRUNEL (26th) https://ggus.eu/ws/ticket_info.php?ticket=78624 biomed have batch system problems. No site response as of 16.00 today (30th), ticket was accidentially closed then reopened so might have slipped through the cracks. CAMBRIDGE https://ggus.eu/ws/ticket_info.php?ticket=78629 lhcb pilots dying with the unhelpful "Got a job held event, reason: Job failed, no reason given by GRAM server". Did Santanu track down the cause? Last update 26th. DURHAM https://ggus.eu/ws/ticket_info.php?ticket=78456 Still some errors due to the certificate expiry the other week. Did everything get restarted that needed to be? Or is this something else? Sam mentions LFC_host today. https://ggus.eu/ws/ticket_info.php?ticket=78447 The certificate has changed focus, now involves a missing $VO_ENMR_EU_SW_DIR variable on the cluster... being followed up today. https://ggus.eu/ws/ticket_info.php?ticket=78428 Again, the ticket has changed focus and now seems to indicate problems with some biomed users not recieving mappings. 31st - Sam has commented but ticket not 'waiting for reply'. LANCASTER https://ggus.eu/ws/ticket_info.php?ticket=78646 This is a similar ticket to 75960, with biomed complaining about dodgey output for our site from an lcg-infosites command. I don't think Chris' wrapper will work due to his solution being for a dcache SE. I'll be taking it to the storage group. Do other sites see this kind of problem?
    • 11:40 11:55
      Networking update (MM) 15m
      - Report from December networking meeting at CERN - Update on perfsonar plans - Latest on GridMon
    • 11:55 12:10
      Site roundtable 15m
      - Current status, concerns and priorities
    • 12:10 12:15
      Hardware purchasing 5m
      - status and current issues
    • 12:15 12:16
      AOB 1m
      - CHEP early registration closes tonight http://www.bnl.gov/chep/default.asp#register - Next Tuesday there is a TEG summary day at CERN. The ops meeting will be standing items only and somebody else may chair.