dmariko (Mariko)

upvoted a collection 10 months ago

smolrx-135M

Collection

3 items • Updated Mar 31, 2025 • 1

upvoted a paper 11 months ago

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Paper • 2503.11576 • Published Mar 14, 2025 • 141

upvoted a paper over 1 year ago

HelpSteer2-Preference: Complementing Ratings with Preferences

Paper • 2410.01257 • Published Oct 2, 2024 • 24

upvoted 3 collections over 1 year ago

upvoted 3 articles over 1 year ago

Article

SmolLM - blazingly fast and remarkably powerful

+1

Jul 16, 2024

•

441

Article

🧑‍⚖️ "Replacing Judges with Juries" using distilabel

May 3, 2024

•

17

Article

CyberSecEval 2 - A Comprehensive Evaluation Framework for Cybersecurity Risks and Capabilities of Large Language Models

+14

May 24, 2024

•

22

upvoted a paper about 2 years ago

MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts

Paper • 2401.04081 • Published Jan 8, 2024 • 74

upvoted a paper over 2 years ago

ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs

Paper • 2307.16789 • Published Jul 31, 2023 • 101

Mariko

AI & ML interests

Organizations

smolrx-135M

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

HelpSteer2-Preference: Complementing Ratings with Preferences

Llama-3.1-Nemotron-70B

NaturalFunctions

🪐 SmolLM

SmolLM - blazingly fast and remarkably powerful

🧑‍⚖️ "Replacing Judges with Juries" using distilabel

CyberSecEval 2 - A Comprehensive Evaluation Framework for Cybersecurity Risks and Capabilities of Large Language Models

MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts

ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs

Mariko

AI & ML interests

Organizations

dmariko's activity

SmolLM - blazingly fast and remarkably powerful

🧑‍⚖️ "Replacing Judges with Juries" using distilabel

CyberSecEval 2 - A Comprehensive Evaluation Framework for Cybersecurity Risks and Capabilities of Large Language Models