WLCG MW Readiness WG 18th meeting Minutes - July 6th 2016
WG twiki
Agenda
Summary
- Old and inactive tickets in jira will be closed by the MW Officer after last verification with Product owners and/or Volunteer site managers. See here which ones.
- We need to know about upcoming MW products' releases, we also have a permanent poll open for new Volunteer sites especially for MW Readiness verification on CentOS7. See here what we know so far for the near future.
- The agenda topic on WG Mandate review and meeting frequency discussion was postponed due to lack of participants in this meeting. For more people joining this effort, we hope to get some publicity at the WLCG Workshop in October.
- Due to summer holidays, WLCG Workshop & CHEP preparations and aftermath, the proposed date for the next meeting is Wed Nov. 2nd @ 4pm CET. Please email the e-group of the WG for comments.
Attendance
- local: Maria Dimou (chair & notes), Maarten Litmaath (ARGUS report), Andrea Manzi (MW Officer), Vincent Brillault (WLCG Security), David Cameron (ATLAS), Andrea Sciabà (CMS).
- remote: Matt Doidge (Lancaster), Di Qing (Triumf).
Minutes of previous meeting
The minutes of the
last (17th) meeting HERE are corrected in the
Summary and
Action List to remove the additional action around the pakiti API documentation given that the pakiti client won't be used on
production systems.
Verification status report
The
MWREADY JIRA dashboard shows the latest status info of open tickets. Summary of progress since our last meeting is in the tables below. The following idle tickets on the dashboard are waiting for WG members' comments:
ATLAS workflow Readiness Verification Status:
MW Product |
version |
Volunteer Site(s) |
Comments |
Verification status |
dCache |
2.16.x |
NDGF |
JIRA:MWR-131 |
ongoing |
dCache |
2.13.30 |
Triumf |
JIRA:MWR-130 |
Installed already in prod for Triumf |
UI bundle |
centos7-ui-0.1 |
CERN |
JIRA:MWR-128 verification on CentOS also for CMS. Version number is just a place-holder by Matt Doidge make available the bundle in cvfms end of May. Cristina Aiftimiei built the rpms for the emi repo |
Please see dedicated discussion slot in this meeting |
FTS |
3.4.7 |
CERN |
JIRA:MWR-133 also for CMS |
ongoing |
StoRM |
1.11.11 |
CNAF |
JIRA:MWR-127 |
Completed in prod |
DPM |
1.8.11 |
Edinburgh=UK-SCOTGRID-ECDF |
JIRA:MWR-125 |
Completed |
DPM (srm-less) |
1.8.11 |
LAPP Annecy |
JIRA:MWR-104 , last update in the ticket reports the new DPM 1.8.11 now installed on LAPP testbed |
on-going |
CMS workflow Readiness Verification Status
Discussions around CentOS 7 UI and WN bundles
- UI bundle on CentOS7 available in CVMFS and now via RPM ( to be tested and pushed to UMD preview repo)
- Missing dep ( cream-cli, wms-cli)
- WN bundle to be prepared, Matt stated that could work on it..then RPM will follow ( to contact Cristina)
- UI bundle can be tested at CERN but for the WN one we would need some volunteer sites
- ATLAS already have some testing queue, Edinburgh was interested in this
- it would be great to have both WN and UI ready and released after the summer in UMD
Matt confirmed that Edinburgh is interested in this indeed, because they are moving to large CentOS7 installations on site. Andrea M. will open a jira ticket to monitor progress.
Discussing with the experiments about their workflows, ATLAS has a
dedicated twiki linked from their
workflow document. CMS and ALICE are in the position described in the Action List below, namely
not yet. LHCb hasn't answered but they did ask for clients on CentOS7 in the grid application area of CVMFS, in afs. Indeed, David Smith is working on this port.
Sites, in need to move to CentOS7, are interested to get the MW verified.
WLCG MW Readiness Software Status
Sites' feedback
Special topic
- Major releases coming out this year ( that we are aware of)
- new Cream-CE release ( both SL6 and CentOS7) scheduled by the end of the year
- dCache new golden release 2.16 ( already out) ( SL6 and CentOS7)
- DPM 1.9.0 ( SL6 and CentOS7) scheduled for November
- ??
- Looking for CentOS7 volunteers for :
- MW readiness mandate review and products review? Postponed to the next meeting due to lack of participants today.
Report from recent ARGUS meetings
- Argus meeting held May 20
- Argus meeting held June 24
- Next meeting Sep 2
- main items for MW Readiness:
- new Argus 1.7 beta rpms were created that fix:
- the mapping bug that affected simple CMS proxies
- a few long-standing minor issues e.g. with the startup scripts
- the new rpms have been tested on one CentOS7 host in the Argus cluster at CERN
- the host was repeatedly included in the cluster during a few days
- its logs were checked for unexpected failures
- its effects on the shared
gridmapdir
were checked as well
- all looked OK!
- to facilitate the upgrade of the complete cluster, we have asked EGI to copy the new rpms into the UMD Preview repository
- pending the release notes
- after a short Staged Rollout phase the release should become official
Maarten foresees a smooth operation in the future as 10% of the ARGUS cluster nodes will point to the UMD review repository, so the new versions will be automatically tried and roll into production.
Actions
Action items
Done from past meetings can be found
HERE.
- 20160518-02: Expansion of the CentOS7 experiments' intentions to: Pending
- ALICE: Maarten to check and bring experiment intentions at the next meeting. So far, ALICE runs on SL6 with binaries build on SL5 and it works but in the future this might not be the case.
- LHCb: Joel/Stefan to give us experiment intentions
- 20160127-02: David C. and Andrea S. to obtain their experiments' plans concerning EL7 and/or CentOS7. On-going
- ATLAS: Information is collected in this ATLAS twiki. See in particular the statement on ATLAS migration
- CMS: The CMS software built on SLC6 is known to be not binary compatible with an OS other than SLC6. CMS is evaluating a container based approach to allow running SLC6 (or other) binaries on WNs with CC7 or other OS versions. In addition, CMSSW is routinely built on the CC7 architecture as a possible future production architecture. Formal physics validation of CMSSW on CentOS7 hasn't started yet, but CMS is definitely doing more than just building on it.
- 20160127-01: Andrea M., Andrea S., David C., Paul M. see how the nightly data scratch can be handled so that the Prometheus dCache tests can start JIRA:MWREADY:36. The last update of this ticket dates since June 2015. If there is no interest currently, we should probably close the ticket and this action. Decided to close the action and verify via the jira ticket whether Prometheus contains CentOS7 nodes. Prometheus's bio here as promised at the meeting. Close
Next meeting
- Proposed date is Wed Nov. 2nd @ 4pm CET. Objections to the e-group a.s.a.p. please!
AOB
--
MariaDimou - 2016-06-06