tonyc666
/

text_summarization-Q4_K_M-GGUF

Inference Endpoints

Model card Files Files and versions Community

text_summarization-Q4_K_M-GGUF / README.md

tonyc666's picture

Upload README.md with huggingface_hub

7c3187a verified about 2 months ago

|

history blame contribute delete

3.3 kB

	---
	license: apache-2.0
	language:
	- en
	pipeline_tag: summarization
	widget:
	- text: 'Hugging Face: Revolutionizing Natural Language Processing Introduction In
	the rapidly evolving field of Natural Language Processing (NLP), Hugging Face
	has emerged as a prominent and innovative force. This article will explore the
	story and significance of Hugging Face, a company that has made remarkable contributions
	to NLP and AI as a whole. From its inception to its role in democratizing AI,
	Hugging Face has left an indelible mark on the industry. The Birth of Hugging
	Face Hugging Face was founded in 2016 by Clément Delangue, Julien Chaumond, and
	Thomas Wolf. The name Hugging Face was chosen to reflect the company''s mission
	of making AI models more accessible and friendly to humans, much like a comforting
	hug. Initially, they began as a chatbot company but later shifted their focus
	to NLP, driven by their belief in the transformative potential of this technology.
	Transformative Innovations Hugging Face is best known for its open-source contributions,
	particularly the Transformers library. This library has become the de facto standard
	for NLP and enables researchers, developers, and organizations to easily access
	and utilize state-of-the-art pre-trained language models, such as BERT, GPT-3,
	and more. These models have countless applications, from chatbots and virtual
	assistants to language translation and sentiment analysis. '
	example_title: Summarization Example 1
	base_model: Falconsai/text_summarization
	tags:
	- llama-cpp
	- gguf-my-repo
	---

	# tonyc666/text_summarization-Q4_K_M-GGUF
	This model was converted to GGUF format from [`Falconsai/text_summarization`](https://huggingface.co/Falconsai/text_summarization) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
	Refer to the [original model card](https://huggingface.co/Falconsai/text_summarization) for more details on the model.

	## Use with llama.cpp
	Install llama.cpp through brew (works on Mac and Linux)

	```bash
	brew install llama.cpp

	```
	Invoke the llama.cpp server or the CLI.

	### CLI:
	```bash
	llama-cli --hf-repo tonyc666/text_summarization-Q4_K_M-GGUF --hf-file text_summarization-q4_k_m.gguf -p "The meaning to life and the universe is"
	```

	### Server:
	```bash
	llama-server --hf-repo tonyc666/text_summarization-Q4_K_M-GGUF --hf-file text_summarization-q4_k_m.gguf -c 2048
	```

	Note: You can also use this checkpoint directly through the [usage steps](https://github.com/ggerganov/llama.cpp?tab=readme-ov-file#usage) listed in the Llama.cpp repo as well.

	Step 1: Clone llama.cpp from GitHub.
	```
	git clone https://github.com/ggerganov/llama.cpp
	```

	Step 2: Move into the llama.cpp folder and build it with `LLAMA_CURL=1` flag along with other hardware-specific flags (for ex: LLAMA_CUDA=1 for Nvidia GPUs on Linux).
	```
	cd llama.cpp && LLAMA_CURL=1 make
	```

	Step 3: Run inference through the main binary.
	```
	./llama-cli --hf-repo tonyc666/text_summarization-Q4_K_M-GGUF --hf-file text_summarization-q4_k_m.gguf -p "The meaning to life and the universe is"
	```
	or
	```
	./llama-server --hf-repo tonyc666/text_summarization-Q4_K_M-GGUF --hf-file text_summarization-q4_k_m.gguf -c 2048
	```