bugfix: Update modeling_t5.T5Stack.forward() for Gradient Checkpointing
#2
by
Panda-vid
- opened
Update checkpoint() call such that parameters for the layer_module object are passed correctly.
plenz
changed pull request status to
closed
The feature only works with older transformer versions