bhaskartripathi
commited on
Commit
•
87f7b68
1
Parent(s):
376b347
Update README.md
Browse files
README.md
CHANGED
@@ -26,6 +26,12 @@ library_name: peft
|
|
26 |
**IndicFinGPT** is a specialized transformer model, re-engineered from **EleutherAI's GPT-Neo-125M** architecture, which is a GPT-3 class architecture, designed specifically for the **Indian financial market**. The model has undergone **retraining on its top layers** to enhance its performance in providing insights into the **top 100 companies listed in the NIFTY50 Index, BSE, and NSE exchanges**.
|
27 |
The primary objective of this model is to **serve the unique needs of Indian stock markets** and **investors engaged in chartless trading**. IndicFinGPT aims to provide insights that could **minimize capital loss and drawdowns** while **maximizing financial ratios** such as the **Sharpe, Sortino, Calmar, Omega, and Treynor Ratios**. Additionally, the model is designed to help in **reducing maximum drawdowns** in financial portfolios, offering a robust AI solution tailored to **India’s dynamic financial landscape**.
|
28 |
|
|
|
|
|
|
|
|
|
|
|
|
|
29 |
|
30 |
## First Indic-Stock Small Language Model Focused Top 100 Companies Listed in NSE and BSE Stock Exchanges
|
31 |
|
@@ -35,11 +41,6 @@ The primary objective of this model is to **serve the unique needs of Indian sto
|
|
35 |
</p>
|
36 |
|
37 |
|
38 |
-
## Training Data and Procedure
|
39 |
-
|
40 |
-
**IndicFinGPT 125M** utilizes the **Pile dataset** created by EleutherAI and includes the **top 100 tickers** (by volume and liquidity) from Indian stock markets, covering data from **January 1, 2018, to October 30, 2024**. This dataset encompasses diverse market periods, including **pre-COVID-19 (stable), COVID-19 (volatile), and post-COVID-19 (recovery phase)**. Such comprehensive data exposure allows the model to recognize **problem-solution patterns across various bull and bear runs**.
|
41 |
-
The training data also incorporates **local influences** such as cultural factors and **market-specific volatility**, enhancing its ability to perform **automated technical analysis** for chartless trading. Key capabilities include identifying **classical chart patterns** using technical analysis, conducting **earnings analysis**, interpreting **market sentiment** from multiple sources, and **assessing risks**, all aimed at **improving decision-making for Indian investors**.
|
42 |
-
This model was trained on 310 billion tokens over 692,380 steps. It was trained as a masked autoregressive language model, using cross-entropy loss, F1, Accuracy, Precision, recall,Pattern Detection Rate, and Cross-Entropy Loss.
|
43 |
|
44 |
## Key Highlights
|
45 |
|
@@ -47,7 +48,7 @@ This model was trained on 310 billion tokens over 692,380 steps. It was trained
|
|
47 |
2. Market Sentiment: Built-in understanding of Indian market sentiment and cultural influences
|
48 |
3. Macro-Economic Indicators: Adapted to domestic economic and financial metrics
|
49 |
4. Indian Economic Influences: Awareness of timing, festival impacts, and market-specific volatility
|
50 |
-
|
51 |
|
52 |
## Implementation
|
53 |
|
|
|
26 |
**IndicFinGPT** is a specialized transformer model, re-engineered from **EleutherAI's GPT-Neo-125M** architecture, which is a GPT-3 class architecture, designed specifically for the **Indian financial market**. The model has undergone **retraining on its top layers** to enhance its performance in providing insights into the **top 100 companies listed in the NIFTY50 Index, BSE, and NSE exchanges**.
|
27 |
The primary objective of this model is to **serve the unique needs of Indian stock markets** and **investors engaged in chartless trading**. IndicFinGPT aims to provide insights that could **minimize capital loss and drawdowns** while **maximizing financial ratios** such as the **Sharpe, Sortino, Calmar, Omega, and Treynor Ratios**. Additionally, the model is designed to help in **reducing maximum drawdowns** in financial portfolios, offering a robust AI solution tailored to **India’s dynamic financial landscape**.
|
28 |
|
29 |
+
## Training Data and Procedure
|
30 |
+
|
31 |
+
**IndicFinGPT 125M** utilizes the **Pile dataset** created by EleutherAI and includes the **top 100 tickers** (by volume and liquidity) from Indian stock markets, covering data from **January 1, 2018, to October 30, 2024**. This dataset encompasses diverse market periods, including **pre-COVID-19 (stable), COVID-19 (volatile), and post-COVID-19 (recovery phase)**. Such comprehensive data exposure allows the model to recognize **problem-solution patterns across various bull and bear runs**.
|
32 |
+
The training data also incorporates **local influences** such as cultural factors and **market-specific volatility**, enhancing its ability to perform **automated technical analysis** for chartless trading. Key capabilities include identifying **classical chart patterns** using technical analysis, conducting **earnings analysis**, interpreting **market sentiment** from multiple sources, and **assessing risks**, all aimed at **improving decision-making for Indian investors**.
|
33 |
+
This model was trained on 310 billion tokens over 692,380 steps. It utilized Quantized Low-Rank Adoption as a masked autoregressive language model, utilizing cross-entropy loss, F1, Accuracy, Precision, recall,Pattern Detection Rate, and Cross-Entropy Loss as performance metrics.
|
34 |
+
|
35 |
|
36 |
## First Indic-Stock Small Language Model Focused Top 100 Companies Listed in NSE and BSE Stock Exchanges
|
37 |
|
|
|
41 |
</p>
|
42 |
|
43 |
|
|
|
|
|
|
|
|
|
|
|
44 |
|
45 |
## Key Highlights
|
46 |
|
|
|
48 |
2. Market Sentiment: Built-in understanding of Indian market sentiment and cultural influences
|
49 |
3. Macro-Economic Indicators: Adapted to domestic economic and financial metrics
|
50 |
4. Indian Economic Influences: Awareness of timing, festival impacts, and market-specific volatility
|
51 |
+
|
52 |
|
53 |
## Implementation
|
54 |
|