File size: 4,696 Bytes
8cd4774
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
2025-11-27 06:07:31,285 - INFO - === Entrenamiento LoRA para servicio EC2 ===
2025-11-27 06:07:31,285 - INFO - Train dataset: /workspace/data/dataset_sft_EC2_train.jsonl
2025-11-27 06:07:31,285 - INFO - Val dataset  : /workspace/data/dataset_sft_EC2_val.jsonl
2025-11-27 06:07:31,285 - INFO - Output dir   : /workspace/out/starcoder2_7b_lora_ec2
2025-11-27 06:07:31,285 - INFO - Cargando tokenizer...
2025-11-27 06:07:51,430 - INFO - === Entrenamiento LoRA para servicio EC2 ===
2025-11-27 06:07:51,430 - INFO - Train dataset: /workspace/data/dataset_sft_EC2_train.jsonl
2025-11-27 06:07:51,430 - INFO - Val dataset  : /workspace/data/dataset_sft_EC2_val.jsonl
2025-11-27 06:07:51,431 - INFO - Output dir   : /workspace/out/starcoder2_7b_lora_ec2
2025-11-27 06:07:51,431 - INFO - Cargando tokenizer...
2025-11-27 06:07:52,217 - INFO - Cargando datasets y aplicando formato...
2025-11-27 06:07:53,195 - INFO - Configurando quantizaci贸n 4-bit...
2025-11-27 06:12:48,086 - INFO - === Entrenamiento LoRA para servicio EC2 ===
2025-11-27 06:12:48,086 - INFO - Train dataset: /workspace/data/dataset_sft_EC2_train.jsonl
2025-11-27 06:12:48,086 - INFO - Val dataset  : /workspace/data/dataset_sft_EC2_val.jsonl
2025-11-27 06:12:48,086 - INFO - Output dir   : /workspace/out/starcoder2_7b_lora_ec2
2025-11-27 06:12:48,086 - INFO - Cargando tokenizer...
2025-11-27 06:12:48,375 - INFO - Cargando datasets y aplicando formato...
2025-11-27 06:12:48,645 - INFO - bitsandbytes NO disponible. Cargando modelo en bfloat16 sin cuantizaci贸n...
2025-11-27 06:13:41,018 - INFO - Configurando LoRA...
2025-11-27 06:13:41,019 - INFO - Configurando entrenamiento SFT...
2025-11-27 06:15:56,608 - INFO - === Entrenamiento LoRA para servicio EC2 ===
2025-11-27 06:15:56,608 - INFO - Train dataset: /workspace/data/dataset_sft_EC2_train.jsonl
2025-11-27 06:15:56,608 - INFO - Val dataset  : /workspace/data/dataset_sft_EC2_val.jsonl
2025-11-27 06:15:56,608 - INFO - Output dir   : /workspace/out/starcoder2_7b_lora_ec2
2025-11-27 06:15:56,608 - INFO - Cargando tokenizer...
2025-11-27 06:15:57,256 - INFO - Cargando datasets y aplicando formato...
2025-11-27 06:15:57,507 - INFO - bitsandbytes NO disponible. Cargando modelo en bfloat16 sin cuantizaci贸n...
2025-11-27 06:16:03,256 - INFO - Configurando LoRA...
2025-11-27 06:16:03,256 - INFO - Configurando entrenamiento SFT...
2025-11-27 06:18:28,869 - INFO - === Entrenamiento LoRA para servicio EC2 ===
2025-11-27 06:18:28,869 - INFO - Train dataset: /workspace/data/dataset_sft_EC2_train.jsonl
2025-11-27 06:18:28,869 - INFO - Val dataset  : /workspace/data/dataset_sft_EC2_val.jsonl
2025-11-27 06:18:28,869 - INFO - Output dir   : /workspace/out/starcoder2_7b_lora_ec2
2025-11-27 06:18:28,869 - INFO - Cargando tokenizer...
2025-11-27 06:18:29,159 - INFO - Cargando datasets y aplicando formato...
2025-11-27 06:18:29,374 - INFO - bitsandbytes NO disponible. Cargando modelo en bfloat16 sin cuantizaci贸n...
2025-11-27 06:18:33,501 - INFO - Configurando LoRA...
2025-11-27 06:18:33,502 - INFO - Configurando entrenamiento SFT...
2025-11-27 06:21:23,985 - INFO - === Entrenamiento LoRA para servicio EC2 ===
2025-11-27 06:21:23,985 - INFO - Train dataset: /workspace/data/dataset_sft_EC2_train.jsonl
2025-11-27 06:21:23,985 - INFO - Val dataset  : /workspace/data/dataset_sft_EC2_val.jsonl
2025-11-27 06:21:23,985 - INFO - Output dir   : /workspace/out/starcoder2_7b_lora_ec2
2025-11-27 06:21:23,985 - INFO - Cargando tokenizer...
2025-11-27 06:21:24,268 - INFO - Cargando datasets y aplicando formato...
2025-11-27 06:21:24,504 - INFO - bitsandbytes NO disponible. Cargando modelo en bfloat16 sin cuantizaci贸n...
2025-11-27 06:21:28,727 - INFO - Configurando LoRA...
2025-11-27 06:21:28,728 - INFO - Configurando entrenamiento SFT...
2025-11-27 06:21:41,012 - INFO - Iniciando entrenamiento...
2025-11-27 08:48:59,934 - INFO - Entrenamiento finalizado.
2025-11-27 08:48:59,935 - INFO - Duraci贸n total (s): 8838.92
2025-11-27 08:48:59,935 - INFO - Epochs entrenadas : 3.0
2025-11-27 08:48:59,935 - INFO - Global steps      : 2025
2025-11-27 08:48:59,935 - INFO - Evaluando en conjunto de validaci贸n...
2025-11-27 08:50:48,176 - INFO - M茅tricas de evaluaci贸n: {'eval_loss': 0.29000768065452576, 'eval_runtime': 108.2382, 'eval_samples_per_second': 5.543, 'eval_steps_per_second': 0.693, 'eval_entropy': 0.2899935628970464, 'eval_num_tokens': 13708374.0, 'eval_mean_token_accuracy': 0.9388597853978475, 'epoch': 3.0}
2025-11-27 08:50:48,177 - INFO - M茅tricas guardadas en: /workspace/out/starcoder2_7b_lora_ec2/training_summary_ec2.json
2025-11-27 08:50:48,177 - INFO - Guardando modelo y tokenizer LoRA...
2025-11-27 08:50:48,581 - INFO - Guardado completo.