Help us make Indico better by taking this survey! Aidez-nous à améliorer Indico en répondant à ce sondage !

WLCG-OSG-EGEE Operations meeting

Europe/Zurich
28-R-15 (CERN conferencing service (joining details below))

28-R-15

CERN conferencing service (joining details below)

Nick Thackray
Description
grid-operations-meeting@cern.ch
Weekly OSG, EGEE, WLCG infrastructure coordination meeting.
We discuss the weekly running of the production grid infrastructure based on weekly reports from the attendees. The reported issues are discussed, assigned to the relevant teams, followed up and escalated when needed. The meeting is also the forum for the sites to get a summary of the weekly WLCG activities and plans
Attendees:
  • OSG operations team
  • EGEE operations team
  • EGEE ROC managers
  • WLCG coordination representatives
  • WLCG Tier-1 representatives
  • other site representatives (optional)
  • GGUS representatives
  • VO representatives
  • To dial in to the conference:
    a. Dial +41227676000
    b. Enter access code 0140768

    OR click HERE

    NB: Reports were not received in advance of the meeting from:

  • ROCs: All reports received.
  • VOs:
  • list of actions
    Minutes
      • 16:00 16:01
        Feedback on last meeting's minutes 1m
      • 16:01 16:30
        EGEE Items 29m
        • <big> Grid-Operator-on-Duty handover </big>
          From: Russia / CERN
          To: SouthEast Europe / France


          Issues from Russian ROC::
          1. There are nodes which are not registered or have switch off the monitoring in the GOC DB, but are tested by SAM:
            • prod-bdii.cern.ch
            • se-0-fzk.gridka.de
            • e5grid08.physik.uni-dortmund.de
          2. The site RO-01-ICI (SEE ROC) has a wrong tuned spam filter, so it can not receive COD's e-mails. Ticket #29126 was opened about.
          Issues from CERN ROC::
          1. One site to be raised at operation meeting for suspension:
            VGTU-gLite - Northern ROC.
            https://gus.fzk.de/ws/ticket_info.php?ticket=28603
            https://gus.fzk.de/ws/ticket_info.php?ticket=28596
        • <big> PPS Report & Issues </big>
          PPS reports were not received from these ROCs:
          AP, CE, IT, SEE, SWE

          Issues from EGEE ROCs:
          1. Nothing reported

          Release News:
          1. No release to PPS is scheduled for this week.
          Calls for volunteers:
          1. PPS has started supporting a new VO, blaubert, aimed to the study of financial derivatives in a market based on grid resource trade.
            All PPS sites are kindly invited to start supporting the new VO with resources, in the idea that the more real usage of our system we have, the more effective we are in spotting potential issues with the release.
            The VO is hosted at CNAF and all parameters needed to configure your site can be retrieved directly on the voms server
            https://cert-voms-01.cnaf.infn.it:8443/voms/blaubert/ Many thanks in advance to all adhering sites
          2. In the PPS we’re helping the ATLAS and CMS VOs to try out a solution for allowing VOs to set the relative priorities of their jobs at a site. We’re looking for 1 more volunteer site in PPS, not necessarily experienced with the set-up of this feature. Joining this effort won’t take much work and won’t be for long but it will be relatively high profile and would bring the site "on the spot" with the VOs
            We need to find another site quickly, so a quick response would be very helpful.
        • <big> EGEE issues coming from ROC reports </big>
          1. (ROC France): CEs disappeared from SAM DB. Judit has opened a Savannah bug (https://savannah.cern.ch/bugs/index.php?31229).
          2. (ROC France): IN2P3-CC on 9th of Nov., SiteBDII appears in grey in Gridview, but no downtime and site is OK. It is impacting the overall availability.
            (https://gridview.cern.ch/GRIDVIEW/same_graphs.php?GraphName=sBDII&Information=SiteDetail&DefVO=15&TestVO=-1&DurationOption=hourly&LComponent=-2&NodeID=-1&TestID=-1&StartDay=9&StartMonth=11&StartYear=2007&EndDay=19&EndMonth=11&EndYear=2007&LTier1Site=28&RelOrAvail=Availability&ContAvailFlag=ON&SiteFullName=0
            https://gridview.cern.ch/GRIDVIEW/same_graphs.php?GraphName=IN2P3-CC&Information=SiteDetail&DefVO=15&TestVO=-1&DurationOption=hourly&LComponent=-2&NodeID=-1&TestID=-1&StartDay=9&StartMonth=11&StartYear=2007&EndDay=19&EndMonth=11&EndYear=2007&LTier1Site=28&RelOrAvail=Availability&ContAvailFlag=ON&SiteFullName=0&LTier2Site[]=28)
            Other French have seen the same behaviour. What is grey color for?
        • <big> gLite Release News</big>
          No new updates to production since the last meeting.
        • <big> Request for 2 day delay to release of new CA RPMs</big>
        • <big> Reminder for DN based authentication for VOMS server</big>
      • 16:30 17:00
        WLCG Items 30m
        • <big> WLCG issues coming from ROC reports </big>
          None this week.
        • <big>WLCG Service Interventions (with dates / times where known) </big>
          Link to CIC Portal (broadcasts/news), scheduled downtimes (GOCDB) and CERN IT Status Board
          1. The RAL-LCG2 CMS CASTOR endpoints will be unavailable during upgrades on 2007-11-20 between 08:30 and 17:00 UTC
          2. voms.cern.ch will not provide VOMS proxy until the end of Novemeber 2007 (see broadacast announcement for details).
          3. Site UKI-LT2-UCL-CENTRAL will be down 26-11-2007 00:00 to 27-11-2007 00:00 (UTC)

          Time at WLCG T0 and T1 sites.

        • <big> ATLAS service </big>
        • <big>CMS service</big>
          • No items to raise.
          Speaker: Mr Daniele Bonacorsi (CNAF-INFN BOLOGNA, ITALY)
        • <big> LHCb service </big>
          • No items raised before the meeting.
          Speaker: Dr roberto santinelli (CERN/IT/GD)
        • <big> ALICE service </big>
          • For information: ALICE no longer need the LFC service. All instances of the LFC service for ALICE can be removed.
          Speaker: Dr Patricia Mendez Lorenzo (CERN IT/GD)
        • <big> WLCG Service Coordination </big>
          • Please see the WLCG Service Reliability workshop (home page, agenda), to be held at CERN Nov 26 - 30 2007. We are encouraging (in particular) Tier1 sites to participate(register here). Tier2 sites are naturally very welcome, although a follow-up day, possibly as part of the April 2008 WLCG Collaboration workshop, is currently being studied.

          Deadline for registration: this Thursday

          Speaker: Harry Renshall / Jamie Shiers
      • 17:00 17:30
        OSG Items 30m
        Speaker: Rob Quick (OSG - Indiana University)
      • 17:30 17:35
        Review of action items 5m
        more information
      • 17:35 17:36
        AOB 1m