Implementation highly binded to gpt oss
#3
by
zinccat
- opened
The current implementation is highly binded to gpt oss, with forward using swiglu that is not exposed to user to change, there should be a better way to implement this for wider adoption