Big Data processing and analysis challenges in mega-science experiments

Europe/Moscow
NRC KI and JINR

NRC KI and JINR

29.01 NRC KI 30.01 JINR
Alexei Klimentov (Brookhaven National Laboratory (US)), Vladimir Korenkov (Joint Inst. for Nuclear Research (RU))
How to come to NRC KI
how to find LIT JINR
link to hotel aerostar
link to hotel Pekin
link to Mariott (Tverskaya Yamskaya)
Схема проезда к НИЦ КИ
    • 09:45 10:00
      Welcome. Dr.V.Velikhov, NRC-KI vice-director; Dr.V.Demin, NBICS vice-director Room 378 (Bldg.190)

      Room 378

      Bldg.190

      Bldg.190 Room 378
      Introduction and logistics. A.Klimentov
    • 10:00 11:50
      Database Technologies Room 274 (Bldg.190)

      Room 274

      Bldg.190

      Bldg.190 Room 274
      Convener: Dr Maria Grigorieva (NRC KI)

      Topics for discussion:

      1. Our experience of the developing of Hybrid SQL/NoSQL Storage, and NoSQL database integration in BigPanDA monitor

      2. Database performance tests : SQL - NoSQL, NoSQL - NoSQL

      3. Technology evaluation tests results for NoSQL databases: MongoDB, HBase, Cassandra, Dremel, CouchDB, MariaDB

      4. Experience of using Hadoop/Spark/MapReduce in PanDA Infrasctucture. Use cases.

      5. Foreseen performance and possible changes in PanDA Oracle archived database schema during/after the LHC Run2

      6. Query routing strategy in BigPanDA applications (BigPanDA Monitor in particular)

      7. How to implement cross database requests in heterogeneous architecture

      8. Strategies of the data modelling for NoSQL databases 

    • 10:00 11:00
      EOS and Data Store
      Convener: Eygene Ryabinkin (National Research Centre Kurchatov Institute (RU))
    • 10:00 11:50
      PanDA@NRC-KI room 278 (Bldg.190)

      room 278

      Bldg.190

      Bldg.190 Room 278
      Convener: Ruslan Mashinistov (Russian Academy of Sciences (RU))
      • 10:00
        PanDA @ NRC-KI 50m
        PanDA instance installation and commissioning status PanDA for ATLAS PanDA beyond LHC and HEP
        Speaker: Ruslan Mashinistov (Russian Academy of Sciences (RU))
        Slides
      • 10:50
        Discussion 40m
        PanDA beyond ATLAS : ALICE Do we need version w/o VOMS, Grid Services… Do we need Light weight DDM Staging and I/O matters
    • 10:00 11:00
      webFTS 234 (Bldg.190)

      234

      Bldg.190

      Bldg.190 Room 234
      Convener: Mr Andrey Kiryanov (B.P. Konstantinov Petersburg Nuclear Physics Institute - PNPI ()
      • 10:00
        Introduction: current status of FTS3/WebFTS 10m
        Speaker: Oliver Keeble (CERN)
        Slides
      • 10:10
        FTS readiness for transfers to/from non-Grid resources (VOMS/non-VOMS credentials, etc.) 10m
      • 10:20
        Impact of Federated Identity support in WebFTS for non-Grid users 10m
        Slides
      • 10:30
        FTS & HPC: old/new APIs, lightweight clients, Python client library for PanDA integration 10m
      • 10:40
        Usage scenario for Titan: requirements from both sides 10m
        Speakers: Mr Andrey Kiryanov (B.P. Konstantinov Petersburg Nuclear Physics Institute - PNPI (), Danila Oleynik (Joint Inst. for Nuclear Research (RU))
      • 10:50
        Feature requests for FTS/WebFTS 10m
    • 11:00 11:50
      Meeting with Big Data Lab PhD students (round table) 207 (Bldg.190)

      207

      Bldg.190

      Bldg.190 Room 207
      Conveners: Alberto Pace (CERN), Dr Markus Schulz (CERN), Massimo Lamanna (CERN), Prof. Shantenu Jha (Rutgers U), Simon C. LIN (Academia Sinica)
      round table panel and participants list
    • 11:50 13:30
      BigPanDA Room 378 (Bldg.190)

      Room 378

      Bldg.190

      Bldg.190 Room 378
      Convener: Dr Alexei Klimentov (Brookhaven National Laboratory (US))
      • 11:50
        Introduction 5m
      • 11:55
        webFTS highlights 10m
        Speakers: Mr Andrey Kiryanov (B.P. Konstantinov Petersburg Nuclear Physics Institute - PNPI (), Dr Markus Schulz (CERN), Oliver Keeble (CERN)
      • 12:05
        Database session highlights 10m
        Speakers: Gancho Dimitrov (CERN), Dr Maria Grigorieva (NRC KI), Ms Marina Golosova (NRC KI), Dr Mario Lassnig (CERN)
        Slides
      • 12:15
        PanDA session highlights 10m
        Speakers: Danila Oleynik (Joint Inst. for Nuclear Research (RU)), Kaushik De (University of Texas at Arlington (US)), Ruslan Mashinistov (Russian Academy of Sciences (RU))
        Slides
      • 12:25
        PanDA HPC pilot 20m
        Speakers: Danila Oleynik (Joint Inst. for Nuclear Research (RU)), Prof. Shantenu Jha (Rutgers U)
        Slides
        • HPC pilot @NRC KI supercomputer
          Speaker: Alexey Poyda (Kurchatov Institute)
          Slides
      • 12:45
        webFTS integration with PanDA 45m
        Speakers: Mr Andrey Kiryanov (B.P. Konstantinov Petersburg Nuclear Physics Institute - PNPI (), Danila Oleynik (Joint Inst. for Nuclear Research (RU)), Kaushik De (University of Texas at Arlington (US)), Oliver Keeble (CERN)
        • FTS & HPC: lightweight clients, Python client library for PanDA integration
          Slides
        • Usage scenario for Titan: requirements from both sides
    • 13:30 14:30
      Lunch 1h
    • 14:30 16:50
      Modeling Distributed Computing Systems 207 (Bldg.190)

      207

      Bldg.190

      Bldg.190, Room 207
      Conveners: Dr Eugene Burnaev (IPPI RAS), Kaushik De (University of Texas at Arlington (US)), Prof. Shantenu Jha (Rutgers U)
    • 14:30 16:00
      Nano-,bio-,information and cognitive technologies Institute seminar 322 (Bldg.348)

      322

      Bldg.348

      Bldg. 348 Room 322
      • 14:30
        "PanDA,  A New Paradigm for Computing in HEP" 40m
        Speaker: Kaushik De (University of Texas at Arlington (US))
        Slides
      • 15:10
        CERN and Computing 40m
        Speaker: Alberto Pace (CERN)
        Slides
    • 14:30 16:30
      visit to I.V.Kurchatov museum and NRC KI Supercomputing center
      doodle poll to visit I.V.Kurchatov museum
      doodle poll to visit NRC KI Tier1 and supercomputi
    • 17:00 17:20
      Leave to JINR by bus from NRC KI 20m
    • 10:00 10:15
      Welcome from Prof.N.Rusakovich, JINR Chief Scientific Secretary 15m
    • 10:15 11:10
      Modeling in mega-projects
      • 10:15
        NICA mega-project 20m
        Speaker: Prof. O.Rohachevsky (JINR)
      • 10:35
        Modeling of grid-cloud systems 25m
        Speaker: Prof. Gennady Ososkov (JINR)
    • 11:10 11:25
      coffee break 15m
    • 11:25 12:25
      BigPanDA project
      • 11:25
        Project overview : Growing PanDA ecosystem, PanDA beyond ATLAS, prospects for ALICE, COMPASS, NICA. 19m
        Speaker: Kaushik De (University of Texas at Arlington (US))
        Slides
      • 11:40
        Towards an Abstractions-driven and Standards-based Next-generation of Distributed Cyberinfrastructure 20m
        Speaker: Prof. Shantenu Jha (Rutgers University)
      • 12:05
        Network awareness 20m
        Speaker: Artem Petrosyan (Joint Inst. for Nuclear Research (RU))
        Slides
    • 14:00 15:00
      lunch 1h
    • 15:00 15:30
      Monitoring
    • 15:30 16:20
      Reports on Tier1
      • 15:30
        ASGC TW Tier-1 20m
        Speaker: Simon Lin (Academia Sinica (TW))
        Slides
      • 15:50
        NRC KI RF Tier-1 15m
        Speaker: Dr Vassily Velikhov (NRC KI)
      • 16:05
        JINR Tier-1 15m
        Speaker: Dr Tatiana Strizh (JINR)
    • 16:20 16:35
      tea break 15m
    • 16:35 16:55
      Presentation of the heterogeneous cluster of LIT 20m
      Speaker: Dr D.Podgainy (JINR)
    • 16:55 17:30
      Excursion in LIT
    • 18:00 21:00
      Beer party
      doodle poll
    • 09:30 13:00
      Technical splinter meetings
      • 09:30
        Distributed Modeling splinter meeting 1h 40m