Update README.md
Browse files
README.md
CHANGED
|
@@ -2,6 +2,7 @@
|
|
| 2 |
license: mit
|
| 3 |
datasets:
|
| 4 |
- tuandunghcmut/normal_dataset
|
|
|
|
| 5 |
language:
|
| 6 |
- en
|
| 7 |
base_model:
|
|
@@ -14,13 +15,13 @@ pipeline_tag: text-generation
|
|
| 14 |
|
| 15 |
* This document provides guidance on getting started with `tuandunghcmut/Qwen25_Coder_MultipleChoice`, a model fine-tuned for multiple-choice coding questions.
|
| 16 |
|
| 17 |
-
* Future plans include refactoring the project into a well-structured GitHub repository, expanding the dataset, and retraining the model using distributed training for improved scalability.
|
| 18 |
-
|
| 19 |
* A demonstration notebook is available on Google Colab (click the badge below). Please note that the training code has been omitted from this notebook. It is intended solely for testing and inference using the latest checkpoint.
|
| 20 |
[](https://drive.google.com/file/d/1Q4jtRjIkFWIAM82pAg4OBPCLjpQ8ndpI/view?usp=sharing)
|
| 21 |
|
| 22 |
* Note: The Qwen2.5 Coder 1.5B-Instruct model might be too small for this task, and the current training dataset may be insufficient. Future iterations will explore using a larger model and more extensive data. However, the current model successfully adheres to the desired YAML format and demonstrates structured reasoning.
|
| 23 |
|
|
|
|
|
|
|
| 24 |
* The guide below provides an explanation of the code presented in the notebook.
|
| 25 |
|
| 26 |
## Installation
|
|
@@ -1243,4 +1244,4 @@ Contributions are welcome! Please feel free to submit a Pull Request.
|
|
| 1243 |
|
| 1244 |
## License
|
| 1245 |
|
| 1246 |
-
This project is licensed under the MIT License - see the LICENSE file for details.
|
|
|
|
| 2 |
license: mit
|
| 3 |
datasets:
|
| 4 |
- tuandunghcmut/normal_dataset
|
| 5 |
+
- tuandunghcmut/coding-mcq-reasoning
|
| 6 |
language:
|
| 7 |
- en
|
| 8 |
base_model:
|
|
|
|
| 15 |
|
| 16 |
* This document provides guidance on getting started with `tuandunghcmut/Qwen25_Coder_MultipleChoice`, a model fine-tuned for multiple-choice coding questions.
|
| 17 |
|
|
|
|
|
|
|
| 18 |
* A demonstration notebook is available on Google Colab (click the badge below). Please note that the training code has been omitted from this notebook. It is intended solely for testing and inference using the latest checkpoint.
|
| 19 |
[](https://drive.google.com/file/d/1Q4jtRjIkFWIAM82pAg4OBPCLjpQ8ndpI/view?usp=sharing)
|
| 20 |
|
| 21 |
* Note: The Qwen2.5 Coder 1.5B-Instruct model might be too small for this task, and the current training dataset may be insufficient. Future iterations will explore using a larger model and more extensive data. However, the current model successfully adheres to the desired YAML format and demonstrates structured reasoning.
|
| 22 |
|
| 23 |
+
* Apologies for the current state of the project. The initial version has some inconsistencies, but future plans include refactoring the code into a more structured format, expanding the dataset, and retraining the model using distributed training for improved scalability. Additionally, we plan to train on a larger, high-quality dataset to enhance performance and ensure better stability.
|
| 24 |
+
|
| 25 |
* The guide below provides an explanation of the code presented in the notebook.
|
| 26 |
|
| 27 |
## Installation
|
|
|
|
| 1244 |
|
| 1245 |
## License
|
| 1246 |
|
| 1247 |
+
This project is licensed under the MIT License - see the LICENSE file for details.
|