6th Inter-experiment Machine Learning Workshop

Name: 6th Inter-experiment Machine Learning Workshop
Start: 2024-01-29T09:00:00+01:00
End: 2024-02-02T19:00:00+01:00
Location: CERN

29 January 2024 to 2 February 2024

CERN

Europe/Zurich timezone

Contact

iml.coordinators@cern.ch

Thinking like Transformers

1 Feb 2024, 11:00

1h 30m

503/1-001 - Council Chamber (CERN)

503/1-001 - Council Chamber

CERN

162

Show room on map

Tutorials

Dr Gail Weiss (EPFL)

Transformers - the purely attention based NN architecture - have emerged as a powerful tool in sequence processing. But how does a transformer think? When we discuss the computational power of RNNs, or consider a problem that they have solved, it is easy for us to think in terms of automata and their variants (such as counter machines and pushdown automata). But when it comes to transformers, no such intuitive model is available.

In this tutorial I will present a programming language, RASP (Restricted Access Sequence Processing), which we hope will serve the same purpose for transformers as finite state machines do for RNNs. In particular, we will discuss the transformer architecture, identify its base components, and abstract them into a small number of primitives which we will then compose into a small programming language: RASP. We will go through some example programs in the language, and discuss how a given RASP program relates to the transformer architecture.

IML2024_Weiss.mp4

Long addition solution in RASP

RASP 2024 02 01.pdf

6th Inter-experiment Machine Learning Workshop

Contact

Thinking like Transformers

503/1-001 - Council Chamber

CERN

Speaker

Description

Presentation materials

Choose timezone

6th Inter-experiment Machine Learning Workshop

Contact

Speaker

Description

Presentation materials