Operations team & Sites

Europe/London
EVO - GridPP Operations team meeting

EVO - GridPP Operations team meeting

Description
- This is the weekly GridPP ops & sites meeting - The intention is to run the meeting in Vidyo: https://vidyoportal.cern.ch/flex.html?roomdirect.html&key=zXhsqAxVnaT6 -- The PIN is 1234. To join via phone see http://information-technology.web.cern.ch/services/fe/howto/users-join-vidyo-meeting-phone for dial in numbers. -- The London (UK) service is on +442030510622 -- The meeting extension is 9308582. Apologies:
Minutes
    • 11:00 11:20
      Experiment problems/issues 20m
      Review of weekly issues by experiment/VO - LHCb - CMS The only CMS issue is that Brunel showed up with a site availability < 40% which a closer look revealed was because of bad connectivity in the Phedex links *to* the T1, i.e. copies from Brunel to T1 fail. Actual jobs running at the site were not affected, so annoying that this shows up as low site readiness(*), but Raul is looking into it, to see if this is a consequence/side-effect of the recent CMS induced tuning of his DPM. (*) Difference between left and right plot. Of course Phedex problems don't trigger alarms unless CMS files a ticket, which they didn't. https://twiki.cern.ch/twiki/bin/viewauth/CMS/CompOpsMeeting#Tier_2_level Bristol still has two CMS GGUS tickets: 106554 and 106325 and RAL T1 has one: 106324. - ATLAS - Other
      ATLAS July availability
    • 11:20 11:40
      Meetings & updates 20m
      With reference to: http://www.gridpp.ac.uk/wiki/Operations_Bulletin_Latest - General updates - WLCG ops coordination - Tier-1 status - Storage and data management - Accounting - Documentation - Interoperation - Monitoring - On-duty - Rollout - Security - Services - Tickets - Tools - VOs - Site updates
    • 11:40 11:55
      Multi-core status and actions 15m
      - The next-steps for follow-up as agreed last week: 1) Oxford to resolve lack of jobs (likely a setting) 2) Gathering of links and deployment pages 3) Test torque scripts at Manchester 4) Lancaster to review SGE queue (gets few jobs) 5) Clarify queue recommendations - setup multi-core queue or use existing ones)
      Multicore-update
    • 11:55 12:10
      Site updates 15m
    • 12:10 12:11
      AOB 1m