tuandunghcmut
/

Qwen25_Coder_MultipleChoice

Text Generation

Model card Files Files and versions

tuandunghcmut commited on Apr 1

Commit

67457f4

·

verified ·

1 Parent(s): 44eb9a4

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -18,11 +18,11 @@ pipeline_tag: text-generation
 *   A demonstration notebook is available on Google Colab (click the badge below). Please note that the training code has been omitted from this notebook. It is intended solely for testing and inference using the latest checkpoint.
     [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://drive.google.com/file/d/1Q4jtRjIkFWIAM82pAg4OBPCLjpQ8ndpI/view?usp=sharing)
-*   Note: The Qwen2.5 Coder 1.5B-Instruct model might be too small for this task, and the current training dataset may be insufficient. Future iterations will explore using a larger model and more extensive data. However, the current model successfully adheres to the desired YAML format and demonstrates structured reasoning.
-*   Apologies for the current state of the project. The initial version has some inconsistencies, but future plans include refactoring the code into a more structured format, expanding the dataset, and retraining the model using distributed training for improved scalability. Additionally, we plan to train on a larger, high-quality dataset to enhance performance and ensure better stability.
-*   The guide below provides an explanation of the code presented in the notebook.
 ## Installation

 *   A demonstration notebook is available on Google Colab (click the badge below). Please note that the training code has been omitted from this notebook. It is intended solely for testing and inference using the latest checkpoint.
     [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://drive.google.com/file/d/1Q4jtRjIkFWIAM82pAg4OBPCLjpQ8ndpI/view?usp=sharing)
+*   Note: The initial training was conducted on a dataset with errors rather than a perfectly preprocessed one—<span style="color:red;">**garbage in, garbage out**</span>. As a result, while the model successfully adheres to the desired YAML format and demonstrates structured reasoning, its performance remains <span style="color:red;">**unstable**</span>. Future iterations will focus on retraining with a <span style="color:red;">**more extensive, high-quality dataset**</span> to improve stability and accuracy.
+*   Apologies for the current state of the project. The initial version has some inconsistencies due to training on the old dataset, [tuandunghcmut/normal_dataset](https://huggingface.co/datasets/tuandunghcmut/normal_dataset). Future plans include refactoring the code into a more structured format, expanding the dataset to the new one, [tuandunghcmut/coding-mcq-reasoning](https://huggingface.co/datasets/tuandunghcmut/coding-mcq-reasoning), and retraining the model using distributed training for improved scalability. Additionally, I plan to train on a larger, high-quality dataset to enhance performance and ensure better stability.
+*   The guide below provides an explanation of the code presented in the notebook. I hope you will understand my ideas and the structure of the code.
 ## Installation