bhaskartripathi
commited on
Commit
•
4a5a37e
1
Parent(s):
56fca64
Update README.md
Browse files
README.md
CHANGED
@@ -3,200 +3,128 @@ base_model: EleutherAI/gpt-neo-125M
|
|
3 |
library_name: peft
|
4 |
---
|
5 |
|
6 |
-
# Model Card for Model
|
7 |
-
|
8 |
-
<!-- Provide a quick summary of what the model is/does. -->
|
9 |
-
|
10 |
|
|
|
11 |
|
12 |
## Model Details
|
13 |
|
14 |
### Model Description
|
15 |
|
16 |
-
|
17 |
-
|
18 |
-
|
19 |
|
20 |
-
- **Developed by:**
|
21 |
-
- **
|
22 |
-
- **
|
23 |
-
- **
|
24 |
-
- **
|
25 |
-
- **License:** [More Information Needed]
|
26 |
-
- **Finetuned from model [optional]:** [More Information Needed]
|
27 |
|
28 |
-
### Model Sources
|
29 |
-
|
30 |
-
<!-- Provide the basic links for the model. -->
|
31 |
-
|
32 |
-
- **Repository:** [More Information Needed]
|
33 |
-
- **Paper [optional]:** [More Information Needed]
|
34 |
-
- **Demo [optional]:** [More Information Needed]
|
35 |
|
36 |
## Uses
|
37 |
|
38 |
-
<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
|
39 |
-
|
40 |
### Direct Use
|
|
|
41 |
|
42 |
-
|
43 |
-
|
44 |
-
[More Information Needed]
|
45 |
-
|
46 |
-
### Downstream Use [optional]
|
47 |
-
|
48 |
-
<!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->
|
49 |
-
|
50 |
-
[More Information Needed]
|
51 |
|
52 |
### Out-of-Scope Use
|
53 |
-
|
54 |
-
<!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
|
55 |
-
|
56 |
-
[More Information Needed]
|
57 |
|
58 |
## Bias, Risks, and Limitations
|
59 |
|
60 |
-
|
61 |
-
|
62 |
-
|
63 |
|
64 |
### Recommendations
|
65 |
-
|
66 |
-
<!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
|
67 |
-
|
68 |
-
Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
|
69 |
|
70 |
## How to Get Started with the Model
|
71 |
-
|
72 |
Use the code below to get started with the model.
|
73 |
|
74 |
-
|
75 |
-
|
76 |
-
## Training Details
|
77 |
-
|
78 |
-
### Training Data
|
79 |
-
|
80 |
-
<!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
|
81 |
-
|
82 |
-
[More Information Needed]
|
83 |
-
|
84 |
-
### Training Procedure
|
85 |
-
|
86 |
-
<!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
|
87 |
|
88 |
-
|
|
|
89 |
|
90 |
-
[
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
91 |
|
|
|
|
|
|
|
92 |
|
93 |
-
|
|
|
94 |
|
95 |
-
|
96 |
-
|
97 |
-
#### Speeds, Sizes, Times [optional]
|
98 |
|
99 |
-
|
|
|
100 |
|
101 |
-
|
|
|
|
|
|
|
102 |
|
103 |
## Evaluation
|
104 |
|
105 |
-
<!-- This section describes the evaluation protocols and provides the results. -->
|
106 |
-
|
107 |
### Testing Data, Factors & Metrics
|
108 |
|
109 |
#### Testing Data
|
110 |
-
|
111 |
-
<!-- This should link to a Dataset Card if possible. -->
|
112 |
-
|
113 |
-
[More Information Needed]
|
114 |
-
|
115 |
-
#### Factors
|
116 |
-
|
117 |
-
<!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
|
118 |
-
|
119 |
-
[More Information Needed]
|
120 |
|
121 |
#### Metrics
|
122 |
-
|
123 |
-
|
124 |
-
|
125 |
-
[More Information Needed]
|
126 |
|
127 |
### Results
|
128 |
-
|
129 |
-
[More Information Needed]
|
130 |
-
|
131 |
-
#### Summary
|
132 |
-
|
133 |
-
|
134 |
-
|
135 |
-
## Model Examination [optional]
|
136 |
-
|
137 |
-
<!-- Relevant interpretability work for the model goes here -->
|
138 |
-
|
139 |
-
[More Information Needed]
|
140 |
|
141 |
## Environmental Impact
|
|
|
|
|
|
|
142 |
|
143 |
-
|
144 |
-
|
145 |
-
Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
|
146 |
-
|
147 |
-
- **Hardware Type:** [More Information Needed]
|
148 |
-
- **Hours used:** [More Information Needed]
|
149 |
-
- **Cloud Provider:** [More Information Needed]
|
150 |
-
- **Compute Region:** [More Information Needed]
|
151 |
-
- **Carbon Emitted:** [More Information Needed]
|
152 |
-
|
153 |
-
## Technical Specifications [optional]
|
154 |
|
155 |
### Model Architecture and Objective
|
156 |
-
|
157 |
-
[More Information Needed]
|
158 |
|
159 |
### Compute Infrastructure
|
|
|
160 |
|
161 |
-
|
162 |
-
|
163 |
-
#### Hardware
|
164 |
-
|
165 |
-
[More Information Needed]
|
166 |
-
|
167 |
-
#### Software
|
168 |
-
|
169 |
-
[More Information Needed]
|
170 |
-
|
171 |
-
## Citation [optional]
|
172 |
-
|
173 |
-
<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
|
174 |
-
|
175 |
-
**BibTeX:**
|
176 |
-
|
177 |
-
[More Information Needed]
|
178 |
-
|
179 |
-
**APA:**
|
180 |
-
|
181 |
-
[More Information Needed]
|
182 |
-
|
183 |
-
## Glossary [optional]
|
184 |
-
|
185 |
-
<!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
|
186 |
-
|
187 |
-
[More Information Needed]
|
188 |
-
|
189 |
-
## More Information [optional]
|
190 |
-
|
191 |
-
[More Information Needed]
|
192 |
|
193 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
194 |
|
195 |
-
|
|
|
196 |
|
197 |
## Model Card Contact
|
|
|
198 |
|
199 |
-
[More Information Needed]
|
200 |
### Framework versions
|
|
|
201 |
|
202 |
-
- PEFT 0.13.2
|
|
|
3 |
library_name: peft
|
4 |
---
|
5 |
|
6 |
+
# Model Card for GPT-Neo 125M Market Analysis Model
|
|
|
|
|
|
|
7 |
|
8 |
+
This model is a fine-tuned version of GPT-Neo 125M for financial market analysis and prediction. It specializes in identifying technical patterns, analyzing market sentiment, assessing risk, and generating trading strategy recommendations.
|
9 |
|
10 |
## Model Details
|
11 |
|
12 |
### Model Description
|
13 |
|
14 |
+
The GPT-Neo 125M Market Analysis Model is designed for analyzing stock market data, specifically focusing on the Indian market. It uses fine-tuning through QLoRA (Quantized Low-Rank Adaptation) to adjust the base GPT-Neo 125M model for recognizing market patterns, interpreting sentiment, and providing trading insights.
|
|
|
|
|
15 |
|
16 |
+
- **Developed by:** Bhaskar Tripathi
|
17 |
+
- **Model type:** Causal Language Model (LLM) with financial analysis adaptations
|
18 |
+
- **Language(s) (NLP):** English
|
19 |
+
- **License:** Apache 2.0
|
20 |
+
- **Finetuned from model:** EleutherAI/gpt-neo-125M
|
|
|
|
|
21 |
|
22 |
+
### Model Sources
|
23 |
+
- **Repository:** [Hugging Face Hub Repository](https://huggingface.co/bhaskartripathi/GPT_Neo_Market_Analysis)
|
|
|
|
|
|
|
|
|
|
|
24 |
|
25 |
## Uses
|
26 |
|
|
|
|
|
27 |
### Direct Use
|
28 |
+
The model can be used directly for generating market insights, interpreting technical analysis, and making sentiment-based predictions. It is intended for market analysts, traders, and financial researchers interested in automated market analysis and predictions.
|
29 |
|
30 |
+
### Downstream Use
|
31 |
+
The model can be further fine-tuned for specific financial tasks, integrated into trading bots, or used in financial research applications to provide advanced automated analysis.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
32 |
|
33 |
### Out-of-Scope Use
|
34 |
+
The model should not be used as the sole basis for making financial decisions. It is not intended for high-frequency trading or as a substitute for human financial advisors. Misuse in making critical financial decisions without human verification could lead to significant financial losses.
|
|
|
|
|
|
|
35 |
|
36 |
## Bias, Risks, and Limitations
|
37 |
|
38 |
+
- The model is specifically tuned for the Indian stock market, and its effectiveness may be limited in other markets.
|
39 |
+
- Predictions are based on historical data and patterns recognized by the model, which may not account for unexpected market events or real-time data changes.
|
40 |
+
- Users should not solely rely on the model for investment decisions; independent verification and diverse sources of market information are recommended.
|
41 |
|
42 |
### Recommendations
|
43 |
+
Users should always verify the model’s outputs against other market data and perform independent analysis to mitigate risks. Financial professionals should be aware of potential biases and use this model as a supplementary tool.
|
|
|
|
|
|
|
44 |
|
45 |
## How to Get Started with the Model
|
|
|
46 |
Use the code below to get started with the model.
|
47 |
|
48 |
+
```python
|
49 |
+
from transformers import AutoTokenizer, AutoModelForCausalLM
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
50 |
|
51 |
+
model = AutoModelForCausalLM.from_pretrained("bhaskartripathi/GPT_Neo_Market_Analysis")
|
52 |
+
tokenizer = AutoTokenizer.from_pretrained("bhaskartripathi/GPT_Neo_Market_Analysis")
|
53 |
|
54 |
+
input_text = '''[INST] Given the following stock market data and technical analysis:
|
55 |
+
Stock: EXAMPLE
|
56 |
+
Date: 2024-01-01
|
57 |
+
Technical Analysis:
|
58 |
+
Current Price: ₹100
|
59 |
+
Daily Range: ₹98 - ₹102
|
60 |
+
Trading Volume: 1,000,000
|
61 |
+
RSI: 55
|
62 |
+
MACD: Bullish
|
63 |
+
Based on this technical analysis, what is the likely price movement for tomorrow and why? [/INST]'''
|
64 |
|
65 |
+
inputs = tokenizer(input_text, return_tensors="pt")
|
66 |
+
outputs = model.generate(**inputs, max_new_tokens=50)
|
67 |
+
result = tokenizer.decode(outputs[0], skip_special_tokens=True)
|
68 |
|
69 |
+
print(result)
|
70 |
+
```
|
71 |
|
72 |
+
## Training Details
|
|
|
|
|
73 |
|
74 |
+
### Training Data
|
75 |
+
The model was fine-tuned using a custom dataset of Indian stock market data, including technical analysis patterns, trading signals, market sentiment, and risk metrics. The dataset included historical market prices, technical indicators, news sentiment, and other financial metrics.
|
76 |
|
77 |
+
### Training Procedure
|
78 |
+
- **Training regime:** Mixed precision training (fp16) with QLoRA for efficient parameter adaptation using 4-bit quantization.
|
79 |
+
- **Hardware Type:** Nvidia T4 GPU
|
80 |
+
- **Hours used:** Approximately 6 hours
|
81 |
|
82 |
## Evaluation
|
83 |
|
|
|
|
|
84 |
### Testing Data, Factors & Metrics
|
85 |
|
86 |
#### Testing Data
|
87 |
+
The model was evaluated on a validation dataset from the Indian stock market, which includes unseen technical analysis data, price movements, and sentiment data.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
88 |
|
89 |
#### Metrics
|
90 |
+
- **Prediction Accuracy**: Evaluated for market movements.
|
91 |
+
- **Sentiment Correlation**: Assessed for accuracy in sentiment interpretation from news and social media.
|
92 |
+
- **Pattern Recognition Precision**: Accuracy in detecting predefined technical patterns.
|
|
|
93 |
|
94 |
### Results
|
95 |
+
The model performed well on predicting price movements based on technical analysis and sentiment inputs, with high accuracy in identifying well-known technical patterns.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
96 |
|
97 |
## Environmental Impact
|
98 |
+
- **Hardware Type:** Nvidia T4 GPU
|
99 |
+
- **Hours used:** Approximately 6 hours
|
100 |
+
- **Carbon Emitted:** Estimated using [ML CO2 Impact Calculator](https://mlco2.github.io/impact#compute).
|
101 |
|
102 |
+
## Technical Specifications
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
103 |
|
104 |
### Model Architecture and Objective
|
105 |
+
The model uses the GPT-Neo 125M architecture, fine-tuned using QLoRA for efficient adaptation to financial analysis tasks.
|
|
|
106 |
|
107 |
### Compute Infrastructure
|
108 |
+
The model was fine-tuned using Google Colab Pro with an Nvidia T4 GPU.
|
109 |
|
110 |
+
## Citation
|
111 |
+
If you use this model, please cite:
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
112 |
|
113 |
+
```bibtex
|
114 |
+
@misc{tripathi2024gptneomarket,
|
115 |
+
title={GPT-Neo 125M Market Analysis Model},
|
116 |
+
author={Bhaskar Tripathi},
|
117 |
+
year={2024},
|
118 |
+
url={https://huggingface.co/bhaskartripathi/GPT_Neo_Market_Analysis}
|
119 |
+
}
|
120 |
+
```
|
121 |
|
122 |
+
## More Information
|
123 |
+
For more information, reach out to [Bhaskar Tripathi](https://huggingface.co/bhaskartripathi).
|
124 |
|
125 |
## Model Card Contact
|
126 |
+
For any questions or issues, please contact: [email protected]
|
127 |
|
|
|
128 |
### Framework versions
|
129 |
+
- PEFT 0.13.2
|
130 |
|
|