Workshop on the future of Big Data management
→
Europe/London
Lecture Theatre 3 (LT3) in the Blackett Laboratory (Imperial College London)
Lecture Theatre 3 (LT3) in the Blackett Laboratory
Imperial College London
David Colling
(Imperial College Sci., Tech. & Med. (GB)),
Jens Jensen
(CLRC-RAL),
Wahid Bhimji
(University of Edinburgh (GB))
Description
"Big Data" is now being managed by various academic and industry groups.
This workshop is organised by the LHC community and will bring
together a range of different participants from different disciplines
working with Big Data. The aim is to explore the current and future
challenges in data processing, storage, transfer and preservation.
The workshop focuses on the infrastructure, technologies, and
tools, with a view of bringing together communities.
This meeting will seek to achieve the following outputs:
- To build a cross-disciplinary community in Big Data who can exchange knowledge and best practice and work together as this field evolves in the future.
- The meeting discussion will be processed afterwards to form a working document that represent the current state of knowledge and future plans for big data communities.
The video of the event is now available:
Thu:
http://tinyurl.com/BigDataImpThu
Fri:
http://tinyurl.com/BigDataImpFri
Participants
-
-
10:00
→
10:30
Coffee 30m
- 10:30 → 10:40
-
10:40
→
13:00
Big data needs of different communities
To establish the requirements driving the later discussion.
Convener: Dr David Colling (Imperial College Sci., Tech. & Med. (GB))- 10:40
- 11:00
-
11:20
Cloud computing and data intensive research 20mSpeaker: Dr Kenji Takeda (Microsoft Research)
- 11:40
-
12:00
Weather forcasting 20mSpeaker: Baudouin Raoult (European Centre for Medium-Range Weather Forecasts)
-
12:20
PanData and the Research Data Alliance 20mSpeaker: Juan Bicarregui (STFC)
- 12:40
-
13:00
→
14:00
Lunch 1h
-
14:00
→
15:00
Big data needs of different communities
To establish the requirements driving the later discussion.
Convener: Jens Jensen (CLRC-RAL)- 14:00
- 14:20
- 14:40
-
15:00
→
15:20
Tea 20m
-
15:20
→
17:20
Data Storage: Advanced filesystems and interfaces
- Advances in cluster Filesystems : Lustre; Ceph; HDFS ; GPFS
- Data access interfaces and protocols.
- Storage management interfaces
- Advances in storage hardware.
- High-throughput storage strategies, caching,
Convener: Shaun De Witt (Unknown)- 15:20
- 15:40
- 16:00
- 16:20
- 16:40
-
17:00
Discussion: Filesystem needs for different communities 20m
- 19:30 → 21:05
-
10:00
→
10:30
-
-
09:00
→
11:00
Data Processing: Toolkits,Data structures, I/O optimisation
- analysis packages and tools for data processing.
- data visualisation
- Serialisation formats
- Layout and access optimisations
- Benchmarking
Convener: Wahid Bhimji (University of Edinburgh (GB))- 09:00
- 09:20
- 09:40
-
09:55
Optimising bioinformatics pipelines for clinical genomics 20mSpeaker: Dr Michael Mueller (Imperial College)
- 10:15
- 10:25
-
10:45
Discussion: Building on strengths of tools for all communitiies 15m
-
11:00
→
11:30
Coffee 30m
-
11:30
→
12:10
Data Storage: Hardware
- Advances in cluster Filesystems : Lustre; Ceph; HDFS ; GPFS
- Data access interfaces and protocols.
- Storage management interfaces
- Advances in storage hardware.
- High-throughput storage strategies, caching,
Convener: Wahid Bhimji (University of Edinburgh (GB))- 11:30
-
11:50
High performance storage solutions 20mSpeaker: James CoomerSlides
-
12:10
→
13:10
Lunch 1h
-
13:10
→
14:55
Data Transfer: Protocols and tools
- File transfer services: (FTS iRods...)
- Remote access (Federated data stores...)
Convener: Roger Jones (Lancaster University (GB))- 13:15
- 13:35
- 13:55
- 14:15
-
14:35
Discussion: Future of data transfer 15m
-
14:55
→
15:35
Data Management: meta-data, data discovery and preservationConvener: Richard Bantges
- 14:55
- 15:15
-
15:35
→
15:45
Tea 10m
-
15:45
→
16:45
Data management 2: Open access and preservationConvener: Jens Jensen (CLRC-RAL)
- 15:55
- 16:15
-
16:35
Discussion 10m
- 16:45 → 17:00
-
09:00
→
11:00