dharun2049
commited on
Commit
•
c981ab2
1
Parent(s):
689dad4
Create About
Browse files
About
ADDED
@@ -0,0 +1,5 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
so the use of llms in daily life is increasing however only english is the language of all the base models , plus also the current state of the art llms
|
2 |
+
are created using the transformer architecture, which has proven to be the industry standard however due to its self attention mechanism
|
3 |
+
its been computationally inefficient so we are proposing Cauvery 7b , a 7 billion parameter large language model currently under development
|
4 |
+
that DOES NOT USE THE TRANSFORMER ARCHITECTURE AND AS AN ALTERNATIVE USES THE retentive network architecture with retention mechanism, we are
|
5 |
+
in our early stages and looking for investors
|