Minutes for Regional Operations Meeting DECH (June, 30th 2006)

Attendance:

Peter Kunszt (CSCS);
Andreas Gellrich (DESY);
Peter Wienemann, Hans-Gunther Borrmann (Freiburg);
Sven Hermann <chair>, Clemens Koerdt (FZK);
Christian Peter (ITWM);
Andreas Nowack, Thomas Kress (RWTH);
Ute Karabek, Kläre Cassirer (SCAI).

Missing:

Wuppertal (excused),
Dortmund (excused),
MPPMU,
GSI


1. Introduction


Focus of this meeting (biweekly):
for EGEE funded and associated sites within region DECH to improve communication and collaboration with sites. Discuss site specific problems, distribute and collect information easily.

Additional communication channels to be added soon (to avoid discussions via the admin lists.).

#! Sven to clarify existing lists

#! Everybody: Send major topics in advance (to be included in the agenda)
 

2. Round the Sites (gLite update status, encountered problems, issues to discuss, ops-VO, ..)

*CSCS*
status:    gLite update last week, updating successful, Ops-VO next week
feedback: R-GMA, APEL configuration not straightforward
           
*DESY*
status:    gLite update was already attacked beginning of June, updating was successful, 3CEs with 250CPUs up and running, some hardware moves have taken place at same time, ops-VO operational
Issues:
- Problems with APEL (YAIM) works only for 1CE/site,
- YAIM FAQ also not helpful in this regard --> Request (YAIM) to provide this in future
- CA-announcements: problems tend to reappear for every new CA update --> Request to bring this forward to the appropriate people

*FZK*
status:    gLite update started on 17.5. and finished on 21.6., ops-VO this week
Issues:   
collecting top 5 MW changes: ---> feedback from sites (#! All)

AG: now daily site reports necessary?
SH: no, only one weekly report necessary, but possibility offered for daily changes on it; detailed list of problems not obligatory but can be helpful, more important: give short overview
           
*GSI*
status: (no site representative present) no gLite update yet (Alice-VO does not want it for the moment)

*ITWM*
status:    update started two weeks ago, updating successful, ops-VO next week
issues:
- usual problems updating of new major releases, same problems tend to reappear,
- too low priority given to many 'hot' fixes

*MPPMU*
status:    (no site representative present) no gLite update yet (only recently certified site), no ops-VO yet

*RWTH Aachen*
status:    no gLite update yet (waiting for template from Quattor working group),    ops-VO operational           
issues:   
- dCache related SFT-problems (site opened GGUS ticket: #9885),
- Quattor scripts still not available

AG: don't expect to have working scripts from CERN available soon. DESY did quick fix (YAIM component outdated).
SH: fabric management tools like Quattor not officially supported by CERN developers. Apparently, its up to the ROC to provide help with fabric management tools. This was not clear in advance. Better don't wait for complete Quattor scripts from CERN and start upgrading with what you have (DESY can help). ROC will also address this point (again) in the next ROC managers meeting in order to get feedback from other ROCs/Sites (S.H. done).

*SCAI*
status:    current update to gLite 3.0 to be finished by Wednesday 5.7., ops-VO after update
issues:   
- R-GMA configuration problematic, status query changes configuration-entries (GGUS Ticket to be opened)
- LFC for dech-VO: permission problems encountered

AG: Problem is that LFC 2.6. had different format compared to new version. Only complete reinstallation solves this. Database structure has changed considerably. Looking for appropriate conversion scripts....
SH: SCAI, DESY, CSCS and Freiburg should work together on LFC issues
PK: CSCS has delegated problem to VOs.

*Uni Freiburg*
status:    updating starts next week, ops-VO together with update

*Uni Karlsruhe*
status:    (no site representative present) no gLite update or ops-VO yet

*Uni Dortmund*
status:    upgrade to gLite 3.0 performed from June 3rd to 6th, OPS VO done since weeks.
issues:
- (more MW and rollout feedback was received by mail),
- problems with dependencies in dCache rpm (GGUS ticket 9204)
- proposal for improvement: clearer guide lines needed, what to install and upgrade

*Uni Wuppertal*
status:    (no site representative present) apparently no mayor problems encountered during update

****

--> !#all:    
- please supply ROC with detailed feedback on gLite deployment problems.
- please supply ROC with top 5 MW changes, you would like to have implemented
   

3. ROC-on-duty (EGEE funded effort)

Status:    Workflow is currently being documented. No documents yet.

Handover CSCS - DESY (P.K.): last week rather quiet with some trailing tickets already addressed.

SH: Please contact your ROC or GGUS for feedback on this.   
TK: new support unit for regional CMS related problems in ROC-DECH portal (CMS experts), other support units for different VOs could follow.
SH: Also regional MW Support Unit, announced earlier, for MW related problems.
PK: please use new ROC DECH Wiki (at CSCS) in relation to ROC-DECH-On-Duty


4. Setup of COD-DECH
(EGEE funded effort)
Status: Successful training event held on 21.6. with the COD-CERN team. Next week first unofficial COD-shift for ROC-DECH with overall responsibility on Italian COD.

PK: please use CSCS Wiki for this duty as well

5. AOB
 
AG stressed that there are other VOs then LHC, that are part of the grid and specifically update procedures should respect these VOs' needs. This could also be mentioned at higher levels, as it seems to be a persisting problem in the EGEE/LCG grid (Marcel K, PMB).

-- 
Dr. Sven Hermann sven.hermann@iwr.fzk.de
Forschungszentrum Karlsruhe Tel.: +49-7247-828632
Institute for Scientific Computing / Inst. f. Wissenschaftliches Rechnen
Hermann-von-Helmholtz-Platz 1, 76344 Eggenstein-Leopoldshafen, Germany