shellwork commited on
Commit
3797f93
1 Parent(s): 3a4eeb1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -4
README.md CHANGED
@@ -8,11 +8,13 @@ pipeline_tag: question-answering
8
  tags:
9
  - biology
10
  - medical
 
 
11
  ---
12
 
13
  # shellwork/ChatParts-llama3.1-8b
14
 
15
- 🤖 [XJTLU-Software RAG GitHub Repository](https://github.com/shellwork/XJTLU-Software-RAG/tree/main) • 📊 [ChatParts Dataset](https://www.modelscope.cn/datasets/shellwork/ChatParts_Dataset)
16
 
17
  **shellwork/ChatParts-llama3.1-8b** is a specialized dialogue model fine-tuned from **Meta-Llama-3.1-8B-Instruct** by the XJTLU-Software iGEM Competition team. This model is tailored for the synthetic biology domain, aiming to assist competition participants and researchers in efficiently collecting and organizing relevant information. It serves as the local model component of the XJTLU-developed Retrieval-Augmented Generation (RAG) software, enhancing search and summarization capabilities within synthetic biology data.
18
 
@@ -24,7 +26,7 @@ The model is trained on a comprehensive synthetic biology-specific dataset curat
24
  - **Synthetic Biology Review Papers:** More than 1,000 high-quality review articles providing in-depth insights into various aspects of synthetic biology.
25
  - **iGEM Parts Registry Documentation:** Detailed documentation of parts used in iGEM projects, facilitating accurate information retrieval.
26
 
27
- In total, the dataset comprises over **200,000 question-answer pairs**, meticulously assembled to cover a wide spectrum of synthetic biology topics. For more detailed information about the dataset, please visit our [training data repository](https://github.com/shellwork/XJTLU-Software-RAG/tree/main).
28
 
29
  ## 🛠️ How to Use
30
 
@@ -81,12 +83,12 @@ print("Context (divided into sentences):\n{}\n".format(result['splited_context']
81
 
82
  ## 📄 License
83
 
84
- This model is released under the **Llama-3.1 License**. For more details, please refer to the [license information](https://www.modelscope.cn/models/llm-research/meta-llama-3.1-8b-instruct) in the repository.
85
 
86
  ## 🔗 Additional Resources
87
 
88
  - **RAG Software:** Explore the full capabilities of our Retrieval-Augmented Generation software [here](https://github.com/shellwork/XJTLU-Software-RAG/tree/main).
89
- - **Training Data:** Access and review the extensive training dataset [here](https://www.modelscope.cn/datasets/shellwork/ChatParts_Dataset) .
90
  - **Support & Contributions:** For support or to contribute to the project, visit our [GitHub Issues](https://github.com/shellwork/XJTLU-Software-RAG/issues) page.
91
 
92
 
 
8
  tags:
9
  - biology
10
  - medical
11
+ datasets:
12
+ - shellwork/ChatParts_Dataset
13
  ---
14
 
15
  # shellwork/ChatParts-llama3.1-8b
16
 
17
+ 🤖 [XJTLU-Software RAG GitHub Repository](https://github.com/shellwork/XJTLU-Software-RAG/tree/main) • 📊 [ChatParts Dataset](https://huggingface.co/datasets/shellwork/ChatParts_Dataset)
18
 
19
  **shellwork/ChatParts-llama3.1-8b** is a specialized dialogue model fine-tuned from **Meta-Llama-3.1-8B-Instruct** by the XJTLU-Software iGEM Competition team. This model is tailored for the synthetic biology domain, aiming to assist competition participants and researchers in efficiently collecting and organizing relevant information. It serves as the local model component of the XJTLU-developed Retrieval-Augmented Generation (RAG) software, enhancing search and summarization capabilities within synthetic biology data.
20
 
 
26
  - **Synthetic Biology Review Papers:** More than 1,000 high-quality review articles providing in-depth insights into various aspects of synthetic biology.
27
  - **iGEM Parts Registry Documentation:** Detailed documentation of parts used in iGEM projects, facilitating accurate information retrieval.
28
 
29
+ In total, the dataset comprises over **200,000 question-answer pairs**, meticulously assembled to cover a wide spectrum of synthetic biology topics. For more detailed information about the dataset, please visit our [training data repository](https://huggingface.co/datasets/shellwork/ChatParts_Dataset).
30
 
31
  ## 🛠️ How to Use
32
 
 
83
 
84
  ## 📄 License
85
 
86
+ This model is released under the **Llama-3.1 License**. For more details, please refer to the [license information](https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct) in the repository.
87
 
88
  ## 🔗 Additional Resources
89
 
90
  - **RAG Software:** Explore the full capabilities of our Retrieval-Augmented Generation software [here](https://github.com/shellwork/XJTLU-Software-RAG/tree/main).
91
+ - **Training Data:** Access and review the extensive training dataset [here](https://huggingface.co/datasets/shellwork/ChatParts_Dataset) .
92
  - **Support & Contributions:** For support or to contribute to the project, visit our [GitHub Issues](https://github.com/shellwork/XJTLU-Software-RAG/issues) page.
93
 
94