sauc-abadal-lloret
commited on
Commit
•
e2d0d1e
1
Parent(s):
e636989
Update README.md
Browse files
README.md
CHANGED
@@ -29,7 +29,7 @@ In particular, the **ALT-RM** checkpoint collects the feedback by leveraging a [
|
|
29 |
```
|
30 |
Thus, at inference time, the expected aligned behavior can be attained by conditioning the input with the *'Excellent.'* feedback.
|
31 |
|
32 |
-
**Related Models:** [ALT-Quark]().
|
33 |
|
34 |
# Intended uses & limitations
|
35 |
This model originates from a research project focused on alignment and is intended primarily for research purposes. Commercial use as an off-the-shelf model is discouraged, as it was not designed with such applications in mind. The model is tailored specifically for the summarization task, having been trained on the TL;DR dataset, though some out-of-distribution generalization may be possible for related datasets.
|
|
|
29 |
```
|
30 |
Thus, at inference time, the expected aligned behavior can be attained by conditioning the input with the *'Excellent.'* feedback.
|
31 |
|
32 |
+
**Related Models:** [ALT-Quark](https://huggingface.co/sauc-abadal-lloret/gpt-j-6b-ALT-Quark-tldr).
|
33 |
|
34 |
# Intended uses & limitations
|
35 |
This model originates from a research project focused on alignment and is intended primarily for research purposes. Commercial use as an off-the-shelf model is discouraged, as it was not designed with such applications in mind. The model is tailored specifically for the summarization task, having been trained on the TL;DR dataset, though some out-of-distribution generalization may be possible for related datasets.
|