GSoC 2017 - Big Data Tools for Physics Analysis

Name: GSoC 2017 - Big Data Tools for Physics Analysis
Start: 2017-05-31T11:30:00+02:00
End: 2017-05-31T13:10:00+02:00
Location: CERN

Wednesday 31 May 2017, 11:30 → 13:10 Europe/Zurich

31/S-028 (CERN)

31/S-028

CERN

Show room on map

Hide

Participants: Krishnan, Danilo, Prasanth, Enric

# Status of tasks

* Task information for JS display
- Via Spark listeners, we can obtain the same information we get from the Spark REST API plus some extra information about the RDDs of our application and the dependencies between stages. However, this does not help us link tasks with user code.
- Given the information that is at our disposal, we will start with a display that shows an event timeline for tasks, where the x axis represents the time (which is refreshed as the application runs) and the y axis corresponds to the executors. Tasks will be represented as rectangles much like in the Spark UI. We will also draw vertical lines to mark the start and end of a stage, according to the tasks shown in the display.
- ACTION: Krishnan will implement a first prototype of the display described above.

* Automatic detection of Spark jobs in a cell
- Krishnan has implemented a Python listener that automatically places a JS display in the output of the cell that triggered a Spark job.
- ACTION: Krishnan will implement the listener in both Python and Scala, since the Python-only version listens to all the possible event types and not only the job creation events.
- ACTION: Krishnan will check if the listener can be configured by setting the spark.extraListeners property on a Python SparkConf object, just like it can be done via an argument of pyspark. Ideally, in SWAN we would create a SparkConf object with some default configuration that the user can extend; part of the default configuration would be the registration of the listener. No extra call from the user should be needed.

- ACTION: Krishnan will make sure that the display is always placed in the right cell also for special cases (kernel restarted, cell deleted).

* Synchronous execution - Communication frontend-kernel
- The control channel only contains messages such as "abort" for the kernel.

- ACTION: Krishnan will try to use the control channel to send "stop job" messages from the frontend to the kernel. Krishnan will implement a solution where he only inspects the control channel and not the shell channel by using lower level primitives in Jupyter.

* Code
- Krishnan placed his code at this repo:

https://github.com/krishnan-r/sparkmonitor

# Jupyter Spark roadmap

- The Jupyter people did not reply to our e-mail, we will continue on our own for now.

There are minutes attached to this event. Show them.

- 11:30 → 12:15
  
  Status of assigned tasks 45m
  
  We will discuss the progress on the following tasks:
  * Check if we can obtain more information using Spark listeners than the one provided by the Spark REST API.
  * It seems that the Spark REST API only provides information about code for stages, and it is not user code but rather a stack trace of a Spark action. Check what we get with more complex programs (map-reduce instead of just a count).
  * Implement a Spark listener in Scala + Python that is fired when a new job starts. When that happens, the corresponding display should appear in the output of the right cell (the one that triggered the job).
  * Investigate more the communication frontent -> kernel. The kernel can be interrupted and then asked to stop the job, but this makes the kernel hang sometimes. Try to see if the control channel can be used for this communication and a thread can inspect it without inspecting the shell channel, otherwise try to implement a custom communication.
  * First implementation of the display, requireJS.
- 12:15 → 12:35
  
  Contribution to Jupyter-Spark 20m
  
  Discuss the answer of the Jupyter team about our collaboration in the project.
- 12:35 → 12:55
  
  AOB 20m

Choose timezone

GSoC 2017 - Big Data Tools for Physics Analysis

31/S-028

CERN