Core-ops tasks

Europe/London
EVO - GridPP Operations team meeting

EVO - GridPP Operations team meeting

Description
- This a meeting for the review of the GridPP ops core tasks - The intention is to run the meeting in Vidyo. Direct link http://vidyoportal.cern.ch/flex.html?roomdirect.html&key=HRHLwpRDyPg7. Pin 1234. - To join via phone see http://information-technology.web.cern.ch/services/fe/howto/users-join-vidyo-meeting-phone for dial in numbers. -- The London (UK) service is on +442030510622 -- The meeting extension is 9313701. Apologies: Chris W.
    • 11:30 11:40
      Documentation 10m
      https://www.gridpp.ac.uk/wiki/Documentation Done in Q1: - several pages updated - improvements to vomsnooper - website migrated To discuss: - https://www.gridpp.ac.uk/php/KeyDocs.php -- Team changes have left some documents without owners - Blogs being updated infrequently Blogs: http://planet.gridpp.ac.uk - Tier-1 - April 2014 - http://gridpp-ops.blogspot.co.uk/2009/01/openssl-vulnerability.html - but relevant! - http://gridpp-storage.blogspot.co.uk - April 2014 - http://londongrid.blogspot.co.uk - June 2013 x- http://nationalgridservice.blogspot.co.uk - October 2012 (SHA2) - http://northgrid-tech.blogspot.co.uk - April 2014 - http://scotgrid.blogspot.co.uk - March 2014 - http://southgrid.blogspot.co.uk - January 2014
    • 11:40 11:50
      Monitoring 10m
      https://www.gridpp.ac.uk/wiki/Monitoring Done in Q1 14: - Collecting UK feedback - Contributing to WLCG monitoring consolidation group -- https://twiki.cern.ch/twiki/bin/view/LCG/WLCGMonitoringConsolidation - Tested site Nagios - Continued development of graphene scripts What next? - consolidation continues - sharing scripts - where does puppet/config fit
    • 11:50 12:00
      Staged rollout 10m
      https://www.gridpp.ac.uk/wiki/Staged_rollout Done in Q1 14: - Pushing/tracking EMI-2 decommisioning - Updating EGI SR contributions -- where are we https://www.gridpp.ac.uk/wiki/Staged_rollout_emi3? - involvement in middleware readiness work. https://twiki.cern.ch/twiki/bin/view/LCG/MiddlewareReadiness#Volunteer_Sites was approved by the MB yesterday.
    • 12:00 12:10
      Core services 10m
      https://www.gridpp.ac.uk/wiki/Core_Grid_services Done in Q1 2014: - perfSONAR - steady progress with site enablements (sites moved to latest mesh/version) - VOMS - https://voms.gridpp.ac.uk:8443/vomses/ -- Distributed VOMS finally active. 4) We need to remove obsolete VOs: e.g. supernemo and minos. What next? - Review VO activity. Still need to to remove obsolete VOs: e.g. supernemo and minos. Testing -- IPv6 (Glasgow, Oxford, IC...) - good progress
    • 12:10 12:20
      Wider VOs 10m
      https://www.gridpp.ac.uk/wiki/Wider_VO_issues Done Q1 14: - Fixing proxy renewal issues - Neiss.org.uk and ILC issues followed up - Non-LHC VO moves to CVMFS What next? - More communities (hyperk). Document lessons learned? - Test DIRAC server running ... trying to get an update! - Push wider WebDAV usage? https://www.gridpp.ac.uk/wiki/WebDAV#Federated_storage_support - No 'quick start' documentation - Future service requirements (e.g. interest in cloud interfaces/resources)
    • 12:20 12:30
      Regional tools 10m
      Done in Q1 14: - Nagios upgrade - VO Nagios instances deployed - Some progress with DiRAC What next? - Push DiRAC testing! - Make VO Nagios more useful? (Is it being used?)
    • 12:30 12:40
      Interoperation 10m
      https://www.gridpp.ac.uk/wiki/Grid_interoperation Done Q1 14: - Continued representation at EGI ops meetings - Start small movements on engagements with DIRAC and on IAAA. What next? - (DC cloud/technical discussions and ops overlaps) - Use of NGI services (e.g. CA and certwizard)
    • 12:40 12:45
      Security 5m
      https://www.gridpp.ac.uk/wiki/Security Done Q1 14: - Various see security report.... Issues/concerns - Some areas not getting attention such as reviewing cloud approaches - Helping with glexec in WN tarball - Sites not always picking up on pakiti warnings .. Security officer not recruited.
    • 12:45 12:50
      Accounting 5m
      https://www.gridpp.ac.uk/wiki/Accounting Done Q1 14: - HS06 updates What next? - Broaden area to 'New technologies and impacts'? (e.g. use of whole node or cloud scheduling, impacts of many core....) -> testing? - Other metric areas
    • 12:50 12:52
      Ticket follow-up 2m
      https://www.gridpp.ac.uk/wiki/Ticket_follow-up - Only obvious issue is that some sites are slow to follow up on certain tickets.
    • 12:52 13:00
      Summary/conclusions/other 8m
      - Focus for next month - glexec and ARGUS enablement at sites. (ongoing)onfId=280057. - ROD Team number being addressed - WN tarball - need a plan for glexec. - CA move to SHA-2 EMI-3/Middleware readiness - More UK site involvement needed - Focus on batch systems - Focus on multi-core - Focus on glue2 and data validation In need of pusing: - RIPE probes proposal - DiRAC usage
    • 13:00 13:01
      AOB 1m
      - Tentative next meeting date Thursday 22nd May.