Minbyul Jeong

Minstar

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 15 hours ago

OpenBioRQ: Unsolved Biomedical Research Questions for Agents

upvoted a paper about 15 hours ago

Ko-WideSearch: A Korean Breadth-Search Benchmark for Exhaustive Set Enumeration by Web Agents

liked a dataset 10 days ago

Minbyul/OpenBioRQ

View all activity

Organizations

upvoted 2 papers about 15 hours ago

OpenBioRQ: Unsolved Biomedical Research Questions for Agents

Paper • 2606.21959 • Published 11 days ago • 4

Ko-WideSearch: A Korean Breadth-Search Benchmark for Exhaustive Set Enumeration by Web Agents

Paper • 2606.27595 • Published 6 days ago • 6

upvoted 3 papers 23 days ago

A Matter of TASTE: Improving Coverage and Difficulty of Agent Benchmarks

Paper • 2605.28556 • Published May 27 • 73

OCC-RAG: Optimal Cognitive Core for Faithful Question Answering

Paper • 2606.00683 • Published May 30 • 98

GrepSeek: Training Search Agents for Direct Corpus Interaction

Paper • 2605.29307 • Published May 28 • 115

upvoted a paper 30 days ago

The MiniMax-M2 Series: Mini Activations Unleashing Max Real-World Intelligence

Paper • 2605.26494 • Published May 26 • 41

upvoted a paper about 1 month ago

OmniRetrieval: Unified Retrieval across Heterogeneous Knowledge Sources

Paper • 2605.29250 • Published May 28 • 79

upvoted 6 papers about 2 months ago

Teaching Language Models to Think in Code

Paper • 2605.07237 • Published May 11 • 31

The Curious Case of Analogies: Investigating Analogical Reasoning in Large Language Models

Paper • 2511.20344 • Published Nov 25, 2025 • 14

Thinking Sparks!: Emergent Attention Heads in Reasoning Models During Post Training

Paper • 2509.25758 • Published Sep 30, 2025 • 25

Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs

Paper • 2605.09063 • Published May 9 • 82

InteractWeb-Bench: Can Multimodal Agent Escape Blind Execution in Interactive Website Generation?

Paper • 2604.27419 • Published Apr 30 • 13

Co-Evolving Policy Distillation

Paper • 2604.27083 • Published Apr 29 • 68

upvoted a collection about 2 months ago

Korean Medical Dataset

한국어 의료 관련 데이터 • 23 items • Updated Mar 21, 2025 • 7

upvoted a paper 2 months ago

ASGuard: Activation-Scaling Guard to Mitigate Targeted Jailbreaking Attack

Paper • 2509.25843 • Published Apr 14 • 20

upvoted 2 papers over 1 year ago

System Message Generation for User Preferences using Open-Source Models

Paper • 2502.11330 • Published Feb 17, 2025 • 15

Does Time Have Its Place? Temporal Heads: Where Language Models Recall Time-specific Information

Paper • 2502.14258 • Published Feb 20, 2025 • 26