Spaces:

Ahmadzei
/

RAG

Runtime error

App Files Files Community

RAG / chunked /content_aware_chunking /_perplexity /chunk_1.txt

Ahmadzei's picture

update 1

57bdca5 over 1 year ago

history blame contribute delete

476 Bytes

	If we have a tokenized
	sequence \(X = (x_0, x_1, \dots, x_t)\), then the perplexity of \(X\) is,
	$$\text{PPL}(X) = \exp \left{ {-\frac{1}{t}\sum_i^t \log p_\theta (x_i\|x_{<i}) } \right}$$
	where \(\log p_\theta (x_i\|x_{<i})\) is the log-likelihood of the ith token conditioned on the preceding tokens \(x_{<i}\) according to our model. Intuitively, it can be thought of as an evaluation of the model's ability to predict uniformly among the set of specified tokens in a corpus.