Mixed-precision fusions (#2401)
* CPUQuantFusion pass and some usions for converting mixed precision sub-graphs to int8 fused ops * - Added unit tests and misc bug fixes for mixed-precision fusions - Adjust fused sum_scale in quantization builders instead of mkldnn primitive creation
Showing
This diff is collapsed.
This diff is collapsed.
Please
register
or
sign in
to comment