angeloc1
/

llama3dot1DifferentProcesses8

Text Generation

Trained with AutoTrain

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

llama3dot1DifferentProcesses8 / README.md

angeloc1's picture

Update README.md

9449b93 verified 4 months ago

|

history blame contribute delete

1.8 kB

	---
	tags:
	- autotrain
	- text-generation-inference
	- text-generation
	- peft
	- int8
	- BPLLM
	library_name: transformers
	base_model: meta-llama/Meta-Llama-3.1-8B-Instruct
	widget:
	- messages:
	- role: user
	content: What is your favorite condiment?
	license: other
	---

	# Fine-tuned Llama 3.1 8B PEFT int8 for Food Delivery and Reimbursement

	This model was trained for the experiments carried out in the research paper "Conversing with business process-aware Large Language Models: the BPLLM framework".

	It comprises a version of the Llama 3.1 8B model fine-tuned (PEFT with quantization int8) to operate within the context of the Food Delivery and Reimbursement process models (different in terms of activities and events) introduced in the article.

	Further insights can be found in our paper "[Conversing with business process-aware Large Language Models: the BPLLM framework](https://doi.org/10.21203/rs.3.rs-4125790/v1)".

	# Model Trained Using AutoTrain

	This model was trained using AutoTrain. For more information, please visit [AutoTrain](https://hf.co/docs/autotrain).

	# Usage

	```python

	from transformers import AutoModelForCausalLM, AutoTokenizer

	model_path = "PATH_TO_THIS_REPO"

	tokenizer = AutoTokenizer.from_pretrained(model_path)
	model = AutoModelForCausalLM.from_pretrained(
	model_path,
	device_map="auto",
	torch_dtype='auto'
	).eval()

	# Prompt content: "hi"
	messages = [
	{"role": "user", "content": "hi"}
	]

	input_ids = tokenizer.apply_chat_template(conversation=messages, tokenize=True, add_generation_prompt=True, return_tensors='pt')
	output_ids = model.generate(input_ids.to('cuda'))
	response = tokenizer.decode(output_ids[0][input_ids.shape[1]:], skip_special_tokens=True)

	# Model response: "Hello! How can I assist you today?"
	print(response)
	```