StoRM ----- * blocksize mismatch GridFTP (hardcoded) vs. GPFS * SLC4 brings twice the performance of SLC3 * tuning number of streams * CNAF FTS SRM Get timeout 3600 --> 3000 * CMS farm activity saturated number of slots on disk servers * GPFS: high random access latency for software area * GPFS problems due to limited hardware and misconfiguration * better logs needed to distinguish StoRM problems from other problems * better configuration/admin tools needed RAL --- * LHCb RFIO core dumps, not yet understood * CMS tape mounts for skimming halted production * tape servers flaky, probably due to older CASTOR version CASTOR SRM v2.2 --------------- * DB deadlocks * too many DB connections, need more machines and better configuration * CGSI errors unclear * SRM stuck in recv(), cured by timeouts in latest version * timeout on stager calls needed * pinning/GC problem fixed * logging trail being improved * Put/Get processing typically 1-5 s after authentication * moving to SL4, 2.1.7 and new MoU all urgent * more tests to avoid problems in production * test tool to come with release SARA ---- * GSIDCAP server only on SRM node, due to bug that will be fixed * read/write/cache pools separated * queues for GridFTP and GSIDCAP * full pools due to orphaned files removed from PNFS --> FTS timeouts increased, cron job to clean up * slow ATLASDATATAPE --> increased number of movers, extra node * slow staging for LHCb --> more hardware needed * DIRAC staging small (150 MB) files, bad for tape system * SRM reports NEARLINE also for T0D1 when file is only on a write pool --> T0D1 should be made read-write * space token VOMS checking problem fixed * GSIDCAP no longer listening on port 22128, not understood * LFC crashes, fix coming * ATLAS DDM bugs: failures seen as successes and vice versa * LHCb: bringOnline not enough to make status ONLINE, should be fixed * D1 <--> D0 transition function or pinning? --> changeSpaceForFiles not on roadmap, PNFS admin command available * dCache release should highlight configuration changes * should not mix patches with new features (was an accident) * stage tests: - 500 + 50 (different tape) 2 GB files - bringOnline crashes with 500 files --> use "dccp -P" for now - 100 MB/s with pre-stager, else ~60 MB/s DPM at GRIF ----------- * 1.6.10, 64-bit, 100 TB * 250 MB/s transfers without tuning * ATLASGRPDISK needs multiple FQANs --> feature expected in September * XROOTD plugin rpm coming * advanced monitoring tools by Greig Cowan