·
AI & ML interests
None yet
Organizations
None yet
models
239
gohsyi/Mistral-7B-Instruct-v0.3-ppo4-rwt2.0-math-4shot-epoch3-critic
7B
•
Updated
•
2
gohsyi/Mistral-7B-Instruct-v0.3-ppo4-rwt2.0-math-4shot-epoch3
7B
•
Updated
•
3
gohsyi/Mistral-7B-Instruct-v0.3-ppo-math-4shot-epoch3-critic
7B
•
Updated
•
3
gohsyi/Mistral-7B-Instruct-v0.3-ppo-math-4shot-epoch3
7B
•
Updated
•
2
gohsyi/Mistral-7B-Instruct-v0.3-ppo4-rwt2.0-math-4shot-epoch2-critic
7B
•
Updated
•
3
gohsyi/Mistral-7B-Instruct-v0.3-ppo4-rwt2.0-math-4shot-epoch2
7B
•
Updated
•
3
gohsyi/Mistral-7B-Instruct-v0.3-ppo-math-4shot-epoch2-critic
7B
•
Updated
•
2
gohsyi/Mistral-7B-Instruct-v0.3-ppo-math-4shot-epoch2
7B
•
Updated
•
4
gohsyi/Llama-3.1-8B-Instruct-ppo-math-4shot-epoch3-critic
8B
•
Updated
•
3
gohsyi/Llama-3.1-8B-Instruct-ppo-math-4shot-epoch3
8B
•
Updated
•
4
datasets
33
gohsyi/Mistral-7B-Instruct-v0.3-gsm8k-4shot.jsonl
Viewer
•
Updated
•
29.9k
•
7
gohsyi/gemma-1.1-7b-it-gsm8k-4shot.jsonl
Viewer
•
Updated
•
29.9k
•
8
gohsyi/samples_gsm8k_cot_2024-12-04T19-03-11.038885.jsonl
Viewer
•
Updated
•
2.64k
•
12
gohsyi/meta-llama__Llama-3.1-8B-Instruct
Preview
•
Updated
•
26
Viewer
•
Updated
•
8.79k
•
5
Viewer
•
Updated
•
12.5k
•
25
gohsyi/Llama-3.2-3B-Instruct-ultrafeedback-4k-overweight
Viewer
•
Updated
•
4.1k
•
9
gohsyi/Llama-3.2-3B-Instruct-ultrafeedback-4k-underweight
Viewer
•
Updated
•
4.1k
•
9
gohsyi/Llama-3.1-8B-Instruct-ultrafeedback-4k-overweight
Viewer
•
Updated
•
4.1k
•
2
gohsyi/Llama-3.1-8B-Instruct-ultrafeedback-4k-underweight
Viewer
•
Updated
•
4.1k
•
6