Runner for structured pruning of Keras model in an iterative way. This API has the following methods:
-
__init__(model, input_specs)
Creates a new
IterativePruningRunner
object.- model
- A baseline model to prune. The mode should be an
instance of
keras.Model
. - input_specs
- A single or a list of
tf.TensorSpec
used to represent model input specifications.
-
ana(eval_fn, excludes=None, forced=False)
Performs model analysis. The analysis result will be saved in the '.vai' directory and this cached result will be used directly in subsequent calls unless
forced
is set to True.- eval_fn
- Callable object that takes a
keras.Model
object as its first argument and returns the evaluation score. - excludes
- A list of layer names or layer instances to be excluded from pruning.
- forced
- When set to True, run model analysis is run instead of using the cached analysis result.
-
prune(ratio=None, threshold=None, spec_path=None, excludes=None, mode='sparse')
Prunes the baseline model and returns a sparse model. The degree of pruning can be specified in three ways: ratio, threshold, or pruning specification. The first method is the preferred method; the latter two are more suitable for experiments with manual tuning.
- ratio
- The expected percentage of FLOPs reduction of baseline model. This is a guidance value and the actual FLOPs reduction may not strictly be equal to this value.
- threshold
- Relative proportion of model performance loss between the baseline model and the pruned model.
- spec_path
- Pruning specification path used to prune the model.
- excludes
- A list of layer name or layer instance to be excluded from pruning.
- mode
- The mode in which the baseline model is pruned to return a sparse model.
-
get_slim_model(spec_path=None)
Gets a slim model from a sparse model. You would generally use the latest pruning specification to do this transformation by default. If the sparse model was not generated from the latest specification, a specification path can be provided explicitly.
- spec_path
- Pruning specification path used to transform a sparse model to a slim model.