athirdpath
/

Llama-3-15b-StockTest

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Llama-3-15b-StockTest / README.md

athirdpath's picture

Update README.md

6e2fed7 verified 7 months ago

|

history blame contribute delete

1.77 kB

	---
	base_model:
	- athirdpath/Llama-3-15b-Instruct-GLUED-Plus
	- athirdpath/Llama-3-15b-OpenBioLexi-GLUED
	- athirdpath/Llama-3-15b-Instruct-CoT
	- athirdpath/Llama-3-15b-HermesPlaying-GLUED
	- athirdpath/Llama-3-15b-Instruct-GLUED
	library_name: transformers
	tags:
	- mergekit
	- merge

	---

	![image/png](https://cdn-uploads.huggingface.co/production/uploads/633a809fa4a8f33508dce32c/GTFQNTsNVeNeAMw73IiGz.png)

	# Llama-3-15b-StockTest

	This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).

	## Merge Details
	### Merge Method

	This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [athirdpath/Llama-3-15b-Instruct-GLUED](https://huggingface.co/athirdpath/Llama-3-15b-Instruct-GLUED) as a base.

	### Models Merged

	The following models were included in the merge:
	* [athirdpath/Llama-3-15b-Instruct-GLUED-Plus](https://huggingface.co/athirdpath/Llama-3-15b-Instruct-GLUED-Plus)
	* [athirdpath/Llama-3-15b-OpenBioLexi-GLUED](https://huggingface.co/athirdpath/Llama-3-15b-OpenBioLexi-GLUED)
	* [athirdpath/Llama-3-15b-Instruct-CoT](https://huggingface.co/athirdpath/Llama-3-15b-Instruct-CoT)
	* [athirdpath/Llama-3-15b-HermesPlaying-GLUED](https://huggingface.co/athirdpath/Llama-3-15b-HermesPlaying-GLUED)

	### Configuration

	The following YAML configuration was used to produce this model:

	```yaml
	models:
	- model: athirdpath/Llama-3-15b-Instruct-GLUED-Plus
	- model: athirdpath/Llama-3-15b-Instruct-CoT
	- model: athirdpath/Llama-3-15b-OpenBioLexi-GLUED
	- model: athirdpath/Llama-3-15b-HermesPlaying-GLUED

	merge_method: model_stock
	base_model: athirdpath/Llama-3-15b-Instruct-GLUED
	parameters:
	normalize: true
	int8_mask: true
	dtype: float16
	```