Skip to main content

QuantTess.h File

QuantTess Node — fused CVU kernel that quantizes FP32→INT8 then tessellates. More...

Included Headers

#include "builder/NodeContractConfigurable.h" #include "builder/NodeContractProvider.h" #include "builder/Node.h" #include <nlohmann/json.hpp> #include <memory> #include <optional> #include <string> #include <vector>

Namespaces Index

namespacesimaai
namespaceneat
namespacenodes

Classes Index

structQuantTessOptions

Construction options for a QuantTess Node. More...

classQuantTess

Fused CVU kernel Node: Quant (FP32→INT8) followed by Tess. More...

Description

QuantTess Node — fused CVU kernel that quantizes FP32→INT8 then tessellates.

Combines an INT8 quantize and a tessellate layout shuffle into a single CVU pass. Inserted by the route planner before the MLA when the model expects INT8 input and MLA-side tessellation is not part of the compiled MLA kernel — fusing the two stages saves a DDR round-trip versus running Quant and Tess back to back.

See Also

"The dtype contract" page in /concepts/dtype_contract


Generated via doxygen2docusaurus 2.0.0 by Doxygen 1.9.8.