Attending: Nicolò, Stelios, Noemi, Sioni, Maurizio (Zoom)
- Currently 1 order of magnitude smaller than target on parameters size, 2 orders of magnitude better than target on
pixels / W / s
- Currently resource limited by LUTs
- Stelios to look at main consumer of LUTs
- Can latency matching be automated?
- Vladimir’s fellow will work on layer IP stitching
- Not yet shareable - keep an eye on it
- Pruning from Vladimir’s NGT group
- Trying 4 different pruning methods
- Nicolò to follow meeting
- Potential to feedback to Agenium on methods
- Maurizio to provide EPFL paper on structured pruning
- Propose to look at intermediate layers KD
- Proposals for next steps:
- Study dataflow in more detail
- Small reusable blocks?
- Need some reconfigurability in layer code (e.g. non constant image dimensions)
- Save resources at the cost of throughput
- Different clock frequencies for different layers
- Yes it’s possible, but needs split layer IPs
- Partial reconfiguration
- Look into how fast that is
- Nicolò to deliver the CI / reproducibility pipeline
- Short term plan:
- look into some of the ideas in sandbox environment
- meet again in two weeks
- bring material (slides, diagrams, anything)
- ideally aim to quantify the potential of each idea
- how much will it realistically impact the resources, latency, throughput etc?
- how feasible is the implementation?
-
There are minutes attached to this event.
Show them.