UKI Monthly Operations Meeting (TB-SUPPORT)

Europe/London
EVO - GridPP Deployment team meeting

EVO - GridPP Deployment team meeting

Jeremy Coles
Description
- This is the monthly UKI meeting - The intention is to run the meeting in EVO: http://evo.caltech.edu/evoGate/. Join the meeting in the GridPP Community area. - The UK phone bridge is on +44 (0)161 306 6802. The CERN one is: +41 22 76 71400. The phone bridge ID is 868811 with code: 4880. - If the CERN phone connection does not work please try Caltech +1 626 395 2112 or DESY +49 40 8998 1346. - For more information on the UK phone bridge: http://www.ja.net/services/video/agsc/services/evotelephonebridge.html
    • 10:30 10:40
      Security updates 10m
      - News and current concerns from Mingchao
    • 10:40 11:00
      Experiment problems/issues 20m
      CMS: CMS status overview: http://tinyurl.com/dchre7 LHCb: ATLAS: You can get the latest experiment wide information here: https://twiki.cern.ch/twiki/bin/view/LCG/WLCGDailyMeetingsWeek090316 - Other VO issues -- We are losing the opportunity to contribute to (and associate with) worthwile EGEE activities while CPUs are under utilised. -- The GridPP PMB reiterated on Monday a desire to support more EGEE VOs and requested all sites to consider enabling a few new VOs! (Examples EUMEDGRID, EELA)
    • 11:00 11:05
      Site availability 5m
      We will only visit a couple of issues from this list... the links are included for reference! EGEE availability figures for February are recorded in the linked pdf. SAM tests: http://pprc.qmul.ac.uk/~lloyd/gridpp/samtest.html Recent performance drops for: Durham ECDF Glasgow Lancaster UK tests: http://pprc.qmul.ac.uk/~lloyd/gridpp/uktest.html Failure rates high at: Manchester CMS tests: http://pprc.qmul.ac.uk/~lloyd/gridpp/cms_samtest.html Imperial Oxford RALPP ATLAS tests: http://pprc.qmul.ac.uk/~lloyd/gridpp/atest.html QMUL BRIS-HEP-01 Oxford LHCb tests; http://pprc.qmul.ac.uk/~lloyd/gridpp/lhcb_samtest.html QMUL RHUL Durham Glasgow Accounting: http://www3.egee.cesga.es/gridsite/accounting/CESGA/egee_view.php Problems at: IC-LeSC (mid-February) Manchester (mid-January) ECDF (early March) Oxford (end February)
      EGEE-Feb09
    • 11:45 11:55
      ROC/WLCG stuff 10m
      ROC update *************** - Today's CERN network interruption. - http://gridmap.cern.ch/ has been upgraded and now shows sites sized by the number of logical CPUs (cores). Sites in maintenance are in grey. We need to use this to check the published CPU values [let us know of anything that is wrong]. JC sees: Black (neither) - RAL-LCG2; Glasgow; RALPP; Liverpool. Blue (physical) Mancheser - Reminder that gLite 3.0 support will end in April. How are the CE migrations for Cambridge; Lancaster; RAL and Imperial? - We held a core UKI and DTEAM workshop on Thursday & Friday: http://indico.cern.ch/conferenceDisplay.py?confId=53442 -- A few topics/actions have UKI wide implications: --- We would like to have Nagios alarms acknowledged or turned off by each site. This will be coordinated by the Tier-2 coordinator in each case. https://gridppnagios.physics.ox.ac.uk/nagios/ --- There was a question about training needs in the UK. Do any of you have requirements or suggestions? T1 news ********** - The move to the new T1 building is still delayed. - The WMS has been suffering since a "mega-patch" was installed last week (problems also now seen at CERN, FZK and GRIF). WLCG update *************** - The last GDB was on 11th March: http://indico.cern.ch/conferenceDisplay.py?confId=45473 Topics covered: - Accounting policies & other security policies (especially portals) - Reporting installed capacity (new information publishing - new YAIM) - The ASGC incident - Middleware - WMS performance - CREAM (ALICE, PPS and performance criteria) - 64 bit WNs in SL5 - Multi-user pilot jobs
      gridview-check
    • 11:55 12:05
      Site monitoring & blacklisting 10m
      What are the best links to use to check your site status and how do you know if your site has been blacklisted by an experiment? We are gathering useful links for the blacklisting questions: - LHCb: http://lhcb-project-dirac.web.cern.ch/lhcb-project-dirac/lhcbProdnMask.html - CMS: Use FCR and site history for experiment shown here: http://lhcweb.pic.es/cms/SiteReadinessReports/SiteReadinessReport.html - ATLAS: http://panda.cern.ch:25880/server/pandamon/query?dash=prod. http://gangarobot.cern.ch//blacklist.html For site monitoring we have gather links here: http://www.gridpp.ac.uk/wiki/Links_Monitoring_pages but a simpler summary is expected soon. A prototype site view of the dashboard is coming and your feedback is welcomed: http://dashb-siteview.cern.ch/generic/site-monitoring/test.html. Finally... if you want to check the Tier-2 MoU resource pledges then this is now also online here: http://gridops.cern.ch/mou/.
    • 12:05 12:10
      AOB 5m
      - Talks from the pre-CHEP WLCG workshop may be of interest: http://indico.cern.ch/conferenceOtherViews.py?view=standard&confId=16861. If you have questions that you would like raised then let us know. - GridPP22 1st-2nd April at UCL: http://www.gridpp.ac.uk/gridpp22/. Meeting will focus on service resilience. - The next HEPSYSMAN will probably be after HEPiX (Sweden, May).