PaperBanana: Automating Academic Illustration for AI Scientists Paper β’ 2601.23265 β’ Published Jan 30 β’ 221
view article Article BigCodeArena: Judging code generations end to end with code executions Oct 7, 2025 β’ 22
BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution Paper β’ 2510.08697 β’ Published Oct 9, 2025 β’ 39
Privacy-Preserving Tabular Synthetic Data Generation Using TabularARGN Paper β’ 2508.06647 β’ Published Aug 8, 2025 β’ 17
TabularARGN: A Flexible and Efficient Auto-Regressive Framework for Generating High-Fidelity Synthetic Data Paper β’ 2501.12012 β’ Published Jan 21, 2025 β’ 9
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper β’ 2501.12948 β’ Published Jan 22, 2025 β’ 444
view article Article I Clicked βI Agreeβ, But What Am I Really Consenting To? Mar 26, 2025 β’ 24
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM +2 Mar 12, 2025 β’ 492
The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism Paper β’ 2407.10457 β’ Published Jul 15, 2024 β’ 24
Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer Paper β’ 2403.13570 β’ Published Mar 20, 2024 β’ 3