WLCG-OSG-EGEE Operations meeting

Europe/Zurich
28-R-15 (CERN conferencing service (joining details below))

28-R-15

CERN conferencing service (joining details below)

Maite Barroso Lopez (CERN)
Description
grid-operations-meeting@cern.ch
Weekly OSG, EGEE, WLCG infrastructure coordination meeting.
We discuss the weekly running of the production grid infrastructure based on weekly reports from the attendees. The reported issues are discussed, assigned to the relevant teams, followed up and escalated when needed. The meeting is also the forum for the sites to get a summary of the weekly WLCG activities and plans
Attendees:
  • OSG operations team
  • EGEE operations team
  • EGEE ROC managers
  • WLCG coordination representatives
  • WLCG Tier-1 representatives
  • other site representatives (optional)
  • GGUS representatives
  • VO representatives
  • To dial in to the conference:
    a. Dial +41227676000
    b. Enter access code 0157610

    OR click HERE

    list of actions
    Minutes
  • <big> EGEE issues coming from ROC reports </big>
    Reports were not received from these ROCs:
    1. (CERN ROC): Maintenace day correctly handled by GOC, but timezones in SAM were all wrong. They had the maintenance starting 8 hours earlier than it did, i.e. interpretted GOC time as UTC rather than PST. This lead to SAM reporting filure diring downtime, and affects efficiency stats. https://gusiwr.fzk.de/pages/ticket_details.php?ticket=12884.


    2. (CERN ROC): SAM test in every job on WN seem to take up to 300s at beginning and end of job - this is an enormaous waste of cpu, and makes job turnaround poor. Can we disable it? Do other sites see this, or could it be a local mon box(rgma) problem?


    3. (France ROC): How is a VOMS proxy mapped on a grid node (CE, SE, etc.) using LCMAPS ? Is there an official document that explains this mapping mechanism?


    4. (DECH ROC): 64-bit support: Do others have experience finding workarounds? (in addition to discussion e.g. on LCG Rollout, "Who's planning to move to SL/SLC/CentOS 4.x and when?")


    5. (DECH ROC): Problems with LFC upgrade - Impression: testing/certification of MySQL related middleware features has flaws. Improve MySQL support for the future? Is the current testing of MySQL in PPS enough?


    6. (SE Europe ROC): It seems that CIC daily reports for sites contain incorrect links to SAM failures details as of today: https://gus.fzk.de/pages/ticket_details.php?ticket=20043


    7. (SE Europe ROC): One site in IL reports that they get "submitter proxy expired" ggus ticket https://gus.fzk.de/pages/ticket_details.php?ticket=19854 any ideas?


    8. (UK/I ROC): The site is marked as having failed some replica management tests on 22-03-2007. However, the "details" link does not display any data about this job or the reasons for this job failure.


  • 16:00 16:05
    Feedback on last meeting's minutes 5m
    Minutes
  • 16:30 17:00
    WLCG Items 30m
    Reports were not received from these tier-1 sites: INFN
    Reports were not received from these VOs:

  • 16:55 17:00
    OSG Items 5m
    Item 1
  • 17:00 17:05
    Review of action items 5m
    more information
  • 17:10 17:15
    AOB 5m