Performance Summary
Wall Clock Time ()  
Tensor Core Kernel Efficiency (%)  
GPU Utilization (%)  
Total Iterations  
Profiled Iterations  
Start Iteration
Stop Iteration
Average Iteration Time ()