AlexWortega
commited on
Commit
•
eb9e807
1
Parent(s):
c7ba5ce
Update README.md
Browse files
README.md
CHANGED
@@ -9,9 +9,9 @@ base_model:
|
|
9 |
|
10 |
# Vikhr Salt: Speech And Language Transformer
|
11 |
|
12 |
-
![Vikhr Salt Logo](
|
13 |
|
14 |
-
|
15 |
|
16 |
## Model Authors
|
17 |
|
|
|
9 |
|
10 |
# Vikhr Salt: Speech And Language Transformer
|
11 |
|
12 |
+
![Vikhr Salt Logo](https://huggingface.co/Vikhrmodels/salt-116k/resolve/main/IMG_1304%20copy.png)
|
13 |
|
14 |
+
Vikhr Salt is a multimodal model based on a pre-trained large language model, extended with new audio tokens to handle both TTS (text-to-speech) and ASR (automatic speech recognition) tasks. The model incorporates two variants for encoding audio—Encodec and SpeechTokenizer—and achieves stable training by fine-tuning precision settings. This approach allows Vikhr Salt to leverage pre-existing LLM knowledge while effectively generating and understanding speech, marking a step forward in multimodal learning.
|
15 |
|
16 |
## Model Authors
|
17 |
|