Edge SpAIce CERN Technical Meeting
                    
    
        
            
            →
            
        
        Europe/Zurich
    
                
                
                    
                        
                            
    
    
        
            
                
                40/5-A01 (CERN)
            
            
                
    
            
        
    
                        
                    
                
                
                
            Attending: Nicolò, Stelios, Noemi, Sioni, Maurizio (Zoom)
- Currently 1 order of magnitude smaller than target on parameters size, 2 orders of magnitude better than target on pixels / W / s
- Currently resource limited by LUTs
- Stelios to look at main consumer of LUTs
 
- Can latency matching be automated?
- Vladimir’s fellow will work on layer IP stitching
- Not yet shareable - keep an eye on it
 
- Pruning from Vladimir’s NGT group
- Trying 4 different pruning methods
- Nicolò to follow meeting
- Potential to feedback to Agenium on methods
 
- Maurizio to provide EPFL paper on structured pruning
- Propose to look at intermediate layers KD
- Proposals for next steps:
- Study dataflow in more detail
- Where are bottlenecks?
 
- Small reusable blocks?
- Need some reconfigurability in layer code (e.g. non constant image dimensions)
- Save resources at the cost of throughput
 
- Different clock frequencies for different layers
- Yes it’s possible, but needs split layer IPs
 
- Partial reconfiguration
- Look into how fast that is
 
- Nicolò to deliver the CI / reproducibility pipeline
- Short term plan:
- look into some of the ideas in sandbox environment
- meet again in two weeks
- bring material (slides, diagrams, anything)
- ideally aim to quantify the potential of each idea
- how much will it realistically impact the resources, latency, throughput etc?
- how feasible is the implementation?
 
 
                    There are minutes attached to this event.
                    Show them.