This is a GPT2 model uploaded using the KerasNLP library and can be used with JAX, TensorFlow, and PyTorch backends. This model is related to a CausalLM task.

Model config:

  • name: gpt2_backbone
  • trainable: True
  • vocabulary_size: 50257
  • num_layers: 12
  • num_heads: 12
  • hidden_dim: 768
  • intermediate_dim: 3072
  • dropout: 0.1
  • max_sequence_length: 1024

This model card has been generated automatically and should be completed by the model author. See Model Cards documentation for more information.

Downloads last month
12
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the HF Inference API does not support keras-hub models with pipeline type text-generation