Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
CATIE-AQ
/
FAT5-small
like
1
Follow
CATIE
19
Text2Text Generation
Transformers
PyTorch
Safetensors
4 datasets
French
doi:10.57967/hf/4160
flash_t5
custom_code
Carbon Emissions
arxiv:
11 papers
License:
apache-2.0
Model card
Files
Files and versions
Community
1
Train
Use this model
main
FAT5-small
2 contributors
History:
4 commits
bourdoiscatie
SFconvertbot
Adding `safetensors` variant of this model (
#1
)
f97356d
verified
about 15 hours ago
.gitattributes
1.52 kB
initial commit
about 1 month ago
README.md
14.5 kB
Add DOI
about 1 month ago
adamw_scaled.py
12.3 kB
Add FAT5-small
about 1 month ago
attn_ref.py
868 Bytes
Add FAT5-small
about 1 month ago
config.json
1.86 kB
Add FAT5-small
about 1 month ago
configuration_flash_t5.py
3.01 kB
Add FAT5-small
about 1 month ago
cross_entropy_loss.py
16.1 kB
Add FAT5-small
about 1 month ago
custom_heads_flash_t5.py
12.9 kB
Add FAT5-small
about 1 month ago
flash_attention_v2_bias.py
32.9 kB
Add FAT5-small
about 1 month ago
generation_config.json
147 Bytes
Add FAT5-small
about 1 month ago
model.safetensors
587 MB
LFS
Adding `safetensors` variant of this model (#1)
about 15 hours ago
modeling_flash_t5.py
33.1 kB
Add FAT5-small
about 1 month ago
optimizer.pt
pickle
Detected Pickle imports (4)
"collections.OrderedDict"
,
"torch.FloatStorage"
,
"torch.IntStorage"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
1.17 GB
LFS
Add FAT5-small
about 1 month ago
positional_encoding.py
17.6 kB
Add FAT5-small
about 1 month ago
pytorch_model.bin
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
587 MB
LFS
Add FAT5-small
about 1 month ago
rms_norm.py
7.64 kB
Add FAT5-small
about 1 month ago
rng_state.pth
pickle
Detected Pickle imports (7)
"torch._utils._rebuild_tensor_v2"
,
"torch.ByteStorage"
,
"numpy.dtype"
,
"numpy.ndarray"
,
"numpy._core.multiarray._reconstruct"
,
"_codecs.encode"
,
"collections.OrderedDict"
How to fix it?
14.2 kB
LFS
Add FAT5-small
about 1 month ago
scheduler.pt
pickle
Pickle imports
No problematic imports detected
What is a pickle import?
1.26 kB
LFS
Add FAT5-small
about 1 month ago
special_tokens_map.json
5.71 kB
Add FAT5-small
about 1 month ago
tokenizer.json
2.34 MB
Add FAT5-small
about 1 month ago
tokenizer_config.json
52.2 kB
Add FAT5-small
about 1 month ago
trainer_state.json
3.22 MB
Add FAT5-small
about 1 month ago
training_args.bin
pickle
Detected Pickle imports (9)
"transformers.trainer_utils.SchedulerType"
,
"transformers.training_args.TrainingArguments"
,
"accelerate.utils.dataclasses.DistributedType"
,
"accelerate.state.PartialState"
,
"torch.device"
,
"transformers.training_args.OptimizerNames"
,
"transformers.trainer_utils.IntervalStrategy"
,
"transformers.trainer_pt_utils.AcceleratorConfig"
,
"transformers.trainer_utils.HubStrategy"
How to fix it?
5.24 kB
LFS
Add FAT5-small
about 1 month ago