[circle-resizer] Implementation a new tool for model resizing #14791

mbencer · 2025-03-07T07:39:25Z

In the current moment we have no a capabilities for model resizing and dynamic input specialization.
The best place in the whole tool-chain seems to be operating on circle representation (common solution for all front-ends).

What types of API should be exposed?

We need both C++ and Python API. Python API is especially needed for Python-based frontends, like Pytorch->Circle converter.
There are also some C++ tools run from Python as sub-processes but I am against such approach (very hard to debug, need to determine paths for every usage, no versioning control, each step requires saving the whole model to file).

Why a separate tool instead of extending existing API:

I am just following convention from https://github.com/Samsung/ONE/tree/master/compiler where an each step is realized as a separate app/service. What's more the tool should be easy accessible for front-ends.

Initial use-cases:

In the beginning I am focused on LLM Pytoch models. My first candidate to provide e2e support LLama2.

cmd app

The tool should be accessible also as command line app with args:

--input_model
--output_model
--input_shapes
Supported format of new inputs shapes: comma separated. An example for model with 2 inputs: [1,2],[3,4].

Way of tests:

unit tests (c++ and python parts)
e2e for the models, like: Pytorch->Circle conversion to dynamic version, resizing/shape specialization using circle-resizer, numerical results testing on circle-interpreter

Proposition of implementation:

TODO:

changing shapes of inputs on circle::Model flatbuffer representation
shape inference on model with changed shapes using luci/Pass capabilities
parsing new shapes passed by a user
adjust/expand current shape inference capabilities (luci/Pass/CircleShapeInferencePass) to support requires cases
API to exported changed model to stream/file
exposing resizer API using pybind11
setup.py and python wheel packing/distribution

The text was updated successfully, but these errors were encountered:

mbencer · 2025-03-07T07:40:26Z

DRAFT: #14727

mbencer mentioned this issue Mar 7, 2025

[circle-resizer] Introduce a new tool to models resizing #14727

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[circle-resizer] Implementation a new tool for model resizing #14791

[circle-resizer] Implementation a new tool for model resizing #14791

mbencer commented Mar 7, 2025 •

edited

Loading

mbencer commented Mar 7, 2025

[circle-resizer] Implementation a new tool for model resizing #14791

[circle-resizer] Implementation a new tool for model resizing #14791

Comments

mbencer commented Mar 7, 2025 • edited Loading

What types of API should be exposed?

Why a separate tool instead of extending existing API:

Initial use-cases:

cmd app

Way of tests:

TODO:

mbencer commented Mar 7, 2025

mbencer commented Mar 7, 2025 •

edited

Loading