Keep getting 'model_kwargs` are not used by the model: ['token_type_ids']
#60 opened almost 2 years ago
by
uglydumpling
Falcon models slow inference
10
#59 opened almost 2 years ago
by
mikeytrw
I need an API of Falcon
8
#56 opened almost 2 years ago
by
JustMe4Real
Google Colab for Falcon 40B and 7B with Live Response Streaming
3
#55 opened almost 2 years ago
by
gaodrew
can anyone help me get prompt template for Question Answering model
2
#54 opened almost 2 years ago
by
Iamexperimenting
Might be interesting to have a thread on people with Successful Implementations, and on what kind of hardware..
1
#53 opened almost 2 years ago
by
LinuxMagic
Batch inference seems to be done sequentially
3
#50 opened almost 2 years ago
by
yard1
Extracting attention maps
#49 opened almost 2 years ago
by
roeehendel
Error with custom inference loop with past_key_values
26
#48 opened almost 2 years ago
by
dimaischenko

Fix the kv-cache dimensions
1
#47 opened almost 2 years ago
by
cchudant

Multi GPU inference issue
1
#39 opened almost 2 years ago
by
srinivasbilla
Is it on purpose? loss for singlelable and multilable switched.
#36 opened almost 2 years ago
by
rhy2023
Fine-tuning on a new language
4
#35 opened almost 2 years ago
by
AliMirlou

Flash attention
2
#34 opened almost 2 years ago
by
utensil
about evaluating on humaneval
#33 opened almost 2 years ago
by
dongZheX
Finetune on "uncensored" dataset?
1
#32 opened almost 2 years ago
by
sivarajan
Tokenizer Details
#31 opened almost 2 years ago
by
kye

Import dataset and chat with it
2
#27 opened almost 2 years ago
by
phdykd
Working code with full server requirements
2
#24 opened almost 2 years ago
by
gmjolt
Bug: Generate method doesn't work for falcon-7b and falcon-40b in int8 mode.
#22 opened almost 2 years ago
by
avacaondata

It can run with two 4090 or a single 6000 ADA.
5
#20 opened almost 2 years ago
by
znsoft
Finetune wtih QLoRA please
7
#14 opened almost 2 years ago
by
supercharge19
How to set trust_remote_code to true?
13
#9 opened almost 2 years ago
by
gmjolt
[Bug] Does not work
58
#3 opened almost 2 years ago
by
catid
