Where to grow?¶
Algorithm |
Where to grow |
|---|---|
Predefined stages. |
|
Best loss-improving split or addition. |
|
Largest next-step gradient gain. |
|
Strongest activation-gradient correlations. |
|
All chosen layers at the same time. |
|
Any chosen layer. |
|
Layers lacking novel orthogonal directions. |
|
Largest residual natural-gradient bottleneck. |
|
Neurons with negative splitting criterion. |
|
Best residual-gradient match. |
|
Underlying width-growth choice. |