AlexWortega
commited on
Commit
•
c7ba5ce
1
Parent(s):
be7b7b0
Update README.md
Browse files
README.md
CHANGED
@@ -7,9 +7,11 @@ base_model:
|
|
7 |
- TinyLlama/TinyLlama_v1.1
|
8 |
---
|
9 |
|
10 |
-
# Salt
|
11 |
|
|
|
12 |
|
|
|
13 |
|
14 |
## Model Authors
|
15 |
|
|
|
7 |
- TinyLlama/TinyLlama_v1.1
|
8 |
---
|
9 |
|
10 |
+
# Vikhr Salt: Speech And Language Transformer
|
11 |
|
12 |
+
![Vikhr Salt Logo](IMG_1304_copy.png)
|
13 |
|
14 |
+
ikhr Salt is a multimodal model based on a pre-trained large language model, extended with new audio tokens to handle both TTS (text-to-speech) and ASR (automatic speech recognition) tasks. The model incorporates two variants for encoding audio—Encodec and SpeechTokenizer—and achieves stable training by fine-tuning precision settings. This approach allows Vikhr Salt to leverage pre-existing LLM knowledge while effectively generating and understanding speech, marking a step forward in multimodal learning.
|
15 |
|
16 |
## Model Authors
|
17 |
|