Implementation highly binded to gpt oss

#3
by zinccat - opened

The current implementation is highly binded to gpt oss, with forward using swiglu that is not exposed to user to change, there should be a better way to implement this for wider adoption

Sign up or log in to comment