Second International Symposium on Open Search Technology

Europe/Zurich
Web Meeting

Web Meeting

Andreas Wagner (CERN) , Christian Guetl (TU Graz, Austria) , Michael Granitzer (Univ. Passau. Germany) , Stefan Voigt (Open Search Foundation)
Description

12-14 October 2020
Web Meeting hosted by CERN, Geneva, Switzerlan


Dear Experts, Supporters and Friends of Open Internet Search,

The 2nd international Open Search Symposium (OSSYM 2020) was held as a fully web-based, remote event, hosted by CERN. This event allowed us to dive even deeper into the societal, scientific and technical aspects of open and distributed internet search.

As you know the pandemic has tremendously boosted digital activities of individuals, society and economy as a whole. This continues to underpin how important it is to ensure that orientation in the digital sphere, as well as the access to digital knowledge, information and services via the internet remains neutral, open, democratic and in a privacy respecting way.

As you know, the development of a European Open Internet search community and infrastructure involves expertise from many different scientific and technical fields. It requires profound understanding of internet search technologies and new thinking for services and innovative applications, to be built on an open and distributed Internet search infrastructure in and for Europe.

The Third International Symposium on Open Search Technology will take place at CERN from 11-13 October 2021. More information will be published in due time.

On behalf of the symposium/organizing committee,
Dr. Andreas Wagner, CERN
Prof. Dr. Christian Guetl , TU Graz
Prof. Dr. Michael Granitzer, Univ. Passau
Dr. Stefan Voigt, open search foundation

Important Dates

  • Call for Contributions: 
    • 15 February 2020
  • Submissions of abstracts/papers:
    • opening 15 February 2020, closing 6 July 2020
  • Information of acceptance of contribution:
    • 5 August 2020
  • Symposium registration:
    • opening 16 July 2020 @ 20:00, closing 28 September 2020 (or when maximum number of participants is exceeded)
  • Open Search Symposium 2020, Web Meeting hosted by CERN:
    • 12 – 14 October 2020
  • Online publication of proceedings:
    • (Extended abstracts/full Papers) during winter 2020/2021

 

Registration
OSSYM 2020 Registration Form
98 / 150
    • 11:00 14:20
      Symposium Opening & Keynotes: Welcome & Keynotes
      Convener: Session Chair: Stefan Voigt (opensearchfoundation) (opensearchfoundation)
      • 11:00
        Symposium Opening & Keynotes: Welcome & Keynotes 1m
        Speaker: Chair: Stefan Voigt (opensearchfoundation)
      • 11:01
        Welcome and Introduction 14m
        Speaker: Symposium Organizing Committee
      • 11:15
        Policy Keynote 25m
        Speaker: Pearse O'Donohue (Director, Future Networks, DG CONNECT, European Commission)
      • 11:40
        Keynote: "Towards a Free and Open Internet Search Infrastructure: How Law and the legal experts can help?" 25m
        Speaker: Olivia Tambou (Paris Dauphine University - PSL, France) (Paris Dauphine University - PSL, France)
      • 12:05
        Lunch Break 1h 25m
      • 13:30
        Keynote: "Searching, fast and slow - a tech perspective" 25m
        Speaker: Arjen de Vries (Radboud Universiy, The Netherlands)
    • 14:20 15:20
      Break & Individual Networking 1h
    • 15:20 16:40
      Plenary Session: “Open Search Ecosystems”
      Convener: (tba)
      • 15:21
        Keynote: "Keynote: "Beyond Tech: Raising Awareness For The Open Search Foundation Through A Tailor-Made Communication Approach" 24m
        Speakers: Alexander Decker (Technische Hochschule Ingolstadt, Germany) (Open Search Foundation; Technische Hochschule Ingolstadt) , Ms Cornelia Hiemer
      • 15:45
        "The CERN-Solid collaboration project" 15m

        Author: Maria Dimou
        by Maria Dimou – CERN-Solid collaboration manager
        current data thanks to the CERN-Solid development experts.

        The Web was invented at CERN by Sir Tim Berners-Lee. He defined it as a free, open and networked medium. Ever since Sir Tim went to MIT to create the World Wide Web Consortium (W3C), CERN developed a lot of important web-based applications. Still, as an organisation, CERN stayed away from the evolution of the Web, in terms of its standards and philosophy.
        The management consideration was that the laboratory has to operate the LEP accelerator, prepare and build its successor, the LHC, basically to do physics. This was something also said to Sir Tim when he was programming the Delphi experiment RPCs, while the design and need of the web was clear in his mind.
        In the first years of the web, when Mosaic and Netscape were still bleeding edge of web technology, in-house web development was still taking place in the, then called, CERN Web Office.
        Student Heidi Schuster developed pinaweb (Personal Intelligent Newspaper Agent), a programme written in Java, guessing the user’s taste of web pages visited, creating profile per user and proposing most recent appearances on the web on the matters that interest the user subscribing to the pinaweb service. At that time manipulating via the web didn’t exist, so we found this a very clever and convenient application. Surveillance and intrusion were not yet terms we were conscious of.
        Student Darius Kogut wrote Torch,a search engine understanding natural english language rather than keywords linked only with AND and OR operators. The development of this application was giving us intellectual satisfaction, as we were feeling that we were getting to grips with other disciplines, the understanding of rich human language by the search engine.
        In the end, we did purchase the search engine Infoseek, later called Inktomi. They made us a good price offer, which we refused with the argument: "Your business would not exist, had the web not have been invented at CERN". It worked. The price was symbolic.
        The above projects were approved because our evaluation of Lykos, Altavista and the like was leaving a lot to desire. Also because the time was such that companies had not yet made money out of offering, withholding, manipulating information on the web. Google didn't exist yet. The search results one was getting were probably irrelevant or incomplete, still they were what existed and not what the engine would like to show the users according to its estimation of what is appropriate/relevant/desirable for them.
        By 2001 all these creative activities ended. Commercial solutions were adopted for the web matters of the lab.
        It is true, the CERN IT Department has to support computing applications and ensure smooth operation, and for this it is appreciated by the lab.
        The years coming up to the LHC were critical because the network and storage needs were unprecedented, so we were not sure the technology would make the quantum leap before we actually needed it. Luckily it did. Today's CERN experimental data are massive, still Google and facebook stole the first position and store even more.
        Innovative and pioneer work started to be mostly expected to come from the physics arena. Technology transfer, especially usable by medical applications had first priority and most attention.
        Still, there were several possibilities to link our computing developments with W3C standardisation work.
        For example, in the area of the Worldwide LHC Computing Grid (WLCG) project and operations, where the use of the https protocol for data transfer and remote access to storage related naturally with the work done by the W3C Working Groups.
        There are also design concepts in CERN applications, which can contribute good ideas in areas like the Data Catalogue Vocabulary, cross-service inter-operability and Authentication/Authorisation rules and restrictions.
        Those CERN-W3C proposals remained without answer.
        Then, Sir Tim Berners-Lee announced the Solid project in 2019. Private political or financial interests drove the web away from the principles he invented it for, namely universal, educational and free access to information. In Solid, standards will follow the principle of loose connection between Identity, Data and Applications. This will give the user control of his/her data.
        At this point the climat was ripe for the CERN-Solid collaboration to be born this year.
        The CERN development areas, where interesting exchanges can happen, possibly leading to selective adoption of ideas fed in / taken from the Solid specification effort are:
        • The CERN push notifications project, aiming at a set of official channels that users can subscribe to, in order to receive news. The notifications are unilateral and will be archived.
        • Indico, an event management open source platform, with 20 years of operational status.
        • CS3MESH, a pan-European cross-institution mesh that will offer data sharing/co-editing facilities, relying on the federation of different sites by using well-known APIs.
        • InvenioRDM, a Research Data Management, open source platform for persistent registration of research papers and data.
        The common feature of these applications is the user Authentication needed for their workflows for restricted data.
        Ideas are discussed in a dedicated CERN-Solid gitter channel. For now the exchanges focus on Web Access Control (WAC), as implemented in CERN applications. The design of Solid Access Control Lists is attractive, because it refers to users with URIs that can live anywhere on the web.

        At the time of this Abstract submission (June 2020) nothing is fixed in terms of co-development, because, in the CERN case, the data and the applications belong to CERN and not to the users, so the ACLs do exist but give no freedom to the users as to own them and decide where/how their data will be stored, indexed, accessed.

        Still nothing is more useful in the area of web development than collaboration, awareness of possible malicious incentives by service providers and technical, ethical, ideological preservation of the web founding principles. Its birth place has the duty to contribute in this effort.

        References:
        [0] The web original proposal by Sir Tim Berners-Lee https://www.w3.org/History/1989/proposal.html
        [1] Solid announcement in the press https://www.nytimes.com/2019/11/24/opinion/world-wide-web.html
        [2] The CERN-Solid Indico category https://indico.cern.ch/category/11962/
        [3] The Solid project web site https://solidproject.org
        [4] The CERN Web Office (most data missing today) https://weboffice.web.cern.ch/WebOffice/
        [5] The CERN Torch search engine http://cern.ch/dimou/SApaper.html#torch
        [6] CERN-W3C 2014 proposal https://cern.ch/dimou/personal/CERN-W3C_Collaboration.pdf
        [7] CERN-W3C 2017 proposal https://cern.ch/dimou/personal/CERN-W3C_Collaboration_2017_proposal.pdf
        [8] Push notifications proposal in 2003 ttps://cern.ch/dimou/it-us/zephyr.shtml
        [9] Push notification proposal in 2020 https://codimd.web.cern.ch/p/ry5_j4r2U#/
        [10] Linked Data Notifications: https://www.w3.org/TR/ldn/
        [11] The WebSocket Protocol: https://tools.ietf.org/html/rfc6455
        [12] Indico https://getindico.io/
        [13] The Road to the new CERN Identification https://auth.docs.cern.ch/whitepapers/the-road-to-new-auth/
        [14] CS3 MESH https://silo2.sciencedata.dk/sites/cs3mesh4eosc/
        [15] InvenioRDM https://inveniosoftware.org/

        Speaker: Maria Dimou (CERN)
      • 16:00
        "Ecosia - Fair competition in the market of web search engines" 15m
        Speaker: Wolfgang Oels (Ecosia)
      • 16:15
        "Merging Search Engine Needs" 15m
        Speaker: Mark Overmeer (MarkOv Solution)
      • 16:30
        Summary and Wrap-Up Day 1 10m
        Speaker: Stefan Voigt (opensearchfoundation)
    • 11:00 12:35
      Plenary Session: "The Open Web Index"
      Convener: (tba)
      • 11:00
        Opening Day 2 and Results of Day 1 9m
        Speaker: Andreas Wagner (CERN)
      • 11:09
        Plenary Session: "The Open Web Index" 1m
        Speaker: Chair: Wolf-Tilo Balke (University of Braunschweig, Germany)
      • 11:10
        Keynote: "Towards an Open Web Index: Lessons From the Past" 25m
        Speaker: Michael Völske (Bauhaus-Universität Weimar, Germany) (Bauhaus-Universität Weimar)
      • 11:35
        "Geolocated Learning Environments and CapacityBuilding for tailored support in the context of an Open Web Index" 15m
        Speaker: Melanie Platz (Pedagogical University of Tyrol, Austria)
      • 11:50
        "Experiments using a Distributed Web Crawler to Process and Index Web Archives" 15m
        Speaker: Sebastian Nagel (Common Crawl)
      • 12:05
        "Open Search Use Cases for Improving Information Discovery and Information Retrieval In Large and Highly Connected Organizations" 15m
        Speaker: Igor Jakovljevic (CERN & Graz University of Technology, Austria)
      • 12:20
        "Discovery of Software Innovations using Repository Mining" 15m
        Speaker: Tobias Hecking (German Aerospace Center)
    • 12:35 14:00
      Lunch Break 1h 25m
    • 14:00 15:00
      Working Groups (parallel sessions)
      Convener: Chair: Stefan Voigt (opensearchfoundation)
      • 14:00
        Introduction Working Groups 10m
        Speaker: Chair: Kai Erenli (University of Applied Sciences BFI Vienna, Austria)
      • 14:10
        Applications Working Group 50m
        Speaker: Christian Guetl (Graz University of Technology, Austria)
      • 14:10
        Awareness Working Group: 50m
        Speaker: Alexander Decker (Open Search Foundation)
      • 14:10
        Economy Working Group 50m
        Speaker: Olivier Blanchard (Open Search Foundation)
      • 14:10
        Ethics Working Group 50m
        Speakers: Christine Plote (Open Search Foundation) (Open Search Foundation e.V.) , Anton Frank ( Leibniz Supercomputing Centre (LRZ), Germany) ( Leibniz Supercomputing Centre (LRZ), Germany)
      • 14:10
        Tech Working Group: Tech/Tech Experiments 50m
        Speakers: Michael Granitzer (University of Passau, Germany), Dr Stefan Voigt (Open Search Foundation)
    • 15:00 15:30
      Coffee break 30m
    • 15:30 16:30
      Summary Working Groups & Wrap-up of Day 2
      Convener: Chair: Kai Erenli (University of Applied Sciences BFI Vienna, Austria)
    • 16:30 17:30
      Virtual "Cocktails” and Networking
    • 11:00 12:30
      Plenary Session - “Infrastructure, architecture and beyond”
      Convener: Chair: Maria Dimou (CERN)
      • 11:00
        Opening Day 3 and Results of Day 2 9m
        Speaker: Christian Guetl (Graz University of Technology, Austria)
      • 11:09
        Plenary Session - “Infrastructure, architecture and beyond” 1m
        Speaker: Chair: Maria Dimou (CERN)
      • 11:10
        Keynote: "First thoughts on a data lake architecture for an open search infrastructure" 25m
        Speaker: Leon Martin (University of Bamberg, Germany) et al
      • 11:35
        Crypto-securities and the lex rei sitae rule 15m
        Speaker: Sara Sánchez Fernández (IE Law School, IE University, Spain)
      • 11:50
        Reducing Misinformation in Query Autocompletions 15m
        Speaker: Djoerd Hiemstra (Radboud University, The Netherlands)
      • 12:05
        Improving Open Web Index’s Transparency by Leveraging bloxberg’s Blockchain Technology 15m
        Speaker: Iosif Peterfi (Max Planck Digital Library, Germany)
      • 12:20
        Poster Pitches 10m
        Speaker: Chair: Maria Dimou (CERN)
    • 12:30 13:00
      Poster Session
      • 12:30
        MFAing the Quantum Crypto Proof Mobile Apps 30m
        Speaker: Miika Tuisku (CSC IT Center for Science, Finland)
      • 12:30
        Towards Graph Assisted Query Expansion and Graph Navi-Gation Based Relevance Feedback in Open Search Settings 30m
        Speaker: Aleksandar Bobić (CERN & Graz University of Technology, Austria) et al
      • 12:30
        Web Archive Analytics: Infrastructure & Applications @ Webis 30m
        Speaker: Michael Völske (Bauhaus-Universität Weimar, Germany) et al
      • 12:30
        WEBVR - Making Educational Content a Public Experience 30m
        Speaker: Marc Bastian Rieger (University of Koblenz and Landau, Landau, Germany) et al
    • 13:00 14:00
      Lunch Break 1h
    • 14:00 15:15
      Plenary Session - "Towards an European Open Search Infrastructure"
      Convener: Chair: Stefan Voigt (Open Search Foundation )
      • 14:00
        Plenary Session - "Towards an European Open Search Infrastructure" 1m
        Speaker: Chair: Stefan Voigt (Open Search Foundation)
      • 14:01
        Interdisciplinary, interactive Session and next steps 19m
      • 14:20
        Open Search Experiments 20m
      • 14:40
        Open Search Working Groups (Tech, Legal, Ethics, Economy, Awareness, etc) 20m
      • 15:00
        Any other Business 15m
    • 15:15 15:30
      Symposium Wrap-up, way ahead and closing remarks, Outlook OSSYM 2021
      Conveners: Andreas Wagner (CERN), Stefan Voigt (Open Search Foundation)