Sailor2

community

Activity Feed Request to join this org

AI & ML interests

Open language models for South-East Asia

Recent Activity

patrickamadeus authored a paper 2 days ago

Seeing Culture: A Benchmark for Visual Reasoning and Grounding

patrickamadeus authored a paper 2 days ago

Vision Language Models are Confused Tourists

patrickamadeus authored a paper 2 days ago

Can Large Language Models Understand, Reason About, and Generate Code-Switched Text?

View all activity

authored 5 papers 2 days ago

Seeing Culture: A Benchmark for Visual Reasoning and Grounding

Paper • 2509.16517 • Published Sep 20, 2025 • 3

Vision Language Models are Confused Tourists

Paper • 2511.17004 • Published Nov 21, 2025 • 1

Can Large Language Models Understand, Reason About, and Generate Code-Switched Text?

Paper • 2601.07153 • Published Jan 12

M4-RAG: A Massive-Scale Multilingual Multi-Cultural Multimodal RAG

Paper • 2512.05959 • Published Dec 5, 2025

LinguDistill: Recovering Linguistic Ability in Vision- Language Models via Selective Cross-Modal Distillation

Paper • 2604.00829 • Published 4 days ago • 5

submitted a paper to Daily Papers 2 days ago

LinguDistill: Recovering Linguistic Ability in Vision- Language Models via Selective Cross-Modal Distillation

Paper • 2604.00829 • Published 4 days ago • 5

submitted a paper to Daily Papers 5 days ago

Composer 2 Technical Report

Paper • 2603.24477 • Published 10 days ago • 15

authored a paper about 2 months ago

SWE-Universe: Scale Real-World Verifiable Environments to Millions

Paper • 2602.02361 • Published Feb 2 • 60

authored a paper 2 months ago

PingPong: A Natural Benchmark for Multi-Turn Code-Switching Dialogues

Paper • 2601.17277 • Published Jan 24 • 6

authored a paper 2 months ago

PingPong: A Natural Benchmark for Multi-Turn Code-Switching Dialogues

Paper • 2601.17277 • Published Jan 24 • 6

authored 2 papers 5 months ago

Diffusion Language Models are Super Data Learners

Paper • 2511.03276 • Published Nov 5, 2025 • 132

Training Optimal Large Diffusion Language Models

Paper • 2510.03280 • Published Sep 28, 2025

authored a paper 5 months ago

Diffusion Language Models are Super Data Learners

Paper • 2511.03276 • Published Nov 5, 2025 • 132

authored a paper 5 months ago

VideoAgentTrek: Computer Use Pretraining from Unlabeled Videos

Paper • 2510.19488 • Published Oct 22, 2025 • 21

authored a paper 6 months ago

Deflanderization for Game Dialogue: Balancing Character Authenticity with Task Execution in LLM-based NPCs

Paper • 2510.13586 • Published Oct 15, 2025 • 1

authored 3 papers 6 months ago

SceMQA: A Scientific College Entrance Level Multimodal Question Answering Benchmark

Paper • 2402.05138 • Published Feb 6, 2024 • 2

Data Interpreter: An LLM Agent For Data Science

Paper • 2402.18679 • Published Feb 28, 2024 • 1

MTSQL-R1: Towards Long-Horizon Multi-Turn Text-to-SQL via Agentic Training

Paper • 2510.12831 • Published Oct 12, 2025 • 5

authored 2 papers 6 months ago

Talk Less, Call Right: Enhancing Role-Play LLM Agents with Automatic Prompt Optimization and Role Prompting

Paper • 2509.00482 • Published Aug 30, 2025

Thai Semantic End-of-Turn Detection for Real-Time Voice Agents

Paper • 2510.04016 • Published Oct 5, 2025 • 4