Edge SpAIce CERN Technical Meeting

Europe/Zurich
40/5-A01 (CERN)

40/5-A01

CERN

45
Show room on map
Zoom Meeting ID
63468087547
Host
Sioni Paris Summers
Useful links
Join via phone
Zoom URL

Attending: Nicolò, Stelios, Noemi, Sioni, Maurizio (Zoom)

  • Currently 1 order of magnitude smaller than target on parameters size, 2 orders of magnitude better than target on pixels / W / s
  • Currently resource limited by LUTs
    • Stelios to look at main consumer of LUTs
  • Can latency matching be automated?
  • Vladimir’s fellow will work on layer IP stitching
    • Not yet shareable - keep an eye on it
  • Pruning from Vladimir’s NGT group
    • Trying 4 different pruning methods
    • Nicolò to follow meeting
    • Potential to feedback to Agenium on methods
  • Maurizio to provide EPFL paper on structured pruning
  • Propose to look at intermediate layers KD
  • Proposals for next steps:
  • Study dataflow in more detail
    • Where are bottlenecks?
  • Small reusable blocks?
    • Need some reconfigurability in layer code (e.g. non constant image dimensions)
    • Save resources at the cost of throughput
  • Different clock frequencies for different layers
    • Yes it’s possible, but needs split layer IPs
  • Partial reconfiguration
    • Look into how fast that is
  • Nicolò to deliver the CI / reproducibility pipeline
  • Short term plan:
    • look into some of the ideas in sandbox environment
    • meet again in two weeks
    • bring material (slides, diagrams, anything)
      • ideally aim to quantify the potential of each idea
      • how much will it realistically impact the resources, latency, throughput etc?
      • how feasible is the implementation?
  •  
There are minutes attached to this event. Show them.