Jerry Huang PRO
jerry128
AI & ML interests
None yet
Recent Activity
updated
a dataset
4 days ago
jerry128/SWE-bench_Verified
published
a dataset
4 days ago
jerry128/SWE-bench_Verified
updated
a dataset
16 days ago
jerry128/swe-smith-tool
Organizations
models
9
jerry128/ToolACE-axolotl
Updated
jerry128/Qwen2.5-7B-Instruct-Setwise-Reranker
Text Generation
•
8B
•
Updated
•
25
jerry128/Qwen2.5-7B-Instruct-MUSIQUE-GRPO-CL-Sorted-by_Hops
Text Generation
•
8B
•
Updated
•
4
jerry128/Qwen2.5-7B-Instruct-MUSIQUE-GRPO-STEP-CL
Text Generation
•
8B
•
Updated
jerry128/Qwen2.5-7B-Instruct-MUSIQUE-GRPO-CL-Shuffled
Text Generation
•
8B
•
Updated
•
4
jerry128/Qwen2.5-7B-Instruct-MUSIQUE-GRPO-Baseline
Text Generation
•
8B
•
Updated
•
4
jerry128/Qwen2.5-7B-Instruct-MUSIQUE-GRPO-CL
Text Generation
•
8B
•
Updated
•
6
jerry128/Qwen2.5-7B-Instruct-HOTPOTQA-GRPO-STEP-CL
Text Generation
•
8B
•
Updated
•
4
jerry128/Qwen2.5-7B-Instruct-HOTPOTQA-GRPO-CL
Text Generation
•
8B
•
Updated
•
5
datasets
153
jerry128/SWE-bench_Verified
Viewer
•
Updated
•
100
•
15
jerry128/swe-smith-tool
Viewer
•
Updated
•
2
•
24
jerry128/taubench-tool-calling-Qwen2.5-7B-Instruct-0.0_range_0-10_user-gpt-4o-llm_1116210635
Viewer
•
Updated
•
10
•
28
jerry128/test
Viewer
•
Updated
•
10
•
27
jerry128/rag-rl-sft-linear
Viewer
•
Updated
•
2.77k
•
10
jerry128/rag-rl-sft-min-max
Viewer
•
Updated
•
3.15k
•
8
jerry128/RAG-RL-MuSiQue-Min-Max-rebuttal-Shuffled
Viewer
•
Updated
•
19.9k
•
8
jerry128/RAG-RL-MuSiQue-Min-Max-rebuttal
Viewer
•
Updated
•
19.9k
•
9
jerry128/RAG-RL-MuSiQue-Linear-rebuttal-Sorted-by-Num-Hops
Viewer
•
Updated
•
19.9k
•
11
jerry128/RAG-RL-MuSiQue-Linear-rebuttal-Shuffled
Viewer
•
Updated
•
19.9k
•
11