- We see a single extra cuda malloc or 3 extra hip malloc calls in Setup - @rfhaque please add the call trees here