17 49 28

ct2

ct-2

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

sensenova/SenseNova-U1-8B-MoT

liked a model 1 day ago

ideogram-ai/ideogram-4-nf4

upvoted a paper 1 day ago

Kwai Keye-VL-2.0 Technical Report

View all activity

Organizations

None yet

upvoted a paper 1 day ago

Kwai Keye-VL-2.0 Technical Report

Paper • 2606.10651 • Published 3 days ago • 177

upvoted a paper 5 days ago

The Shape of Addition: Geometric Structures of Arithmetic in Large Language Models

Paper • 2606.03645 • Published 14 days ago • 5

upvoted a paper 7 days ago

LongLive-RAG: A General Retrieval-Augmented Framework for Long Video Generation

Paper • 2606.02553 • Published 11 days ago • 19

upvoted a paper 18 days ago

LLMs as Noisy Channels: A Shannon Perspective on Model Capacity and Scaling Laws

Paper • 2605.23901 • Published 21 days ago • 13

upvoted a collection 20 days ago

BitCPM-CANN

Collection

Full-pipeline ternary quantized model trained on CANN. • 12 items • Updated 18 days ago • 27

upvoted 2 papers 21 days ago

Mix-Quant: Quantized Prefilling, Precise Decoding for Agentic LLMs

Paper • 2605.20315 • Published 24 days ago • 28

Enhancing Train-Free Infinite-Frame Generation for Consistent Long Videos

Paper • 2605.18233 • Published 25 days ago • 92

upvoted a paper 23 days ago

Post-Trained MoE Can Skip Half Experts via Self-Distillation

Paper • 2605.18643 • Published 25 days ago • 30

upvoted 2 papers about 1 month ago

Large Language Models Explore by Latent Distilling

Paper • 2604.24927 • Published Apr 27 • 74

StateSMix: Online Lossless Compression via Mamba State Space Models and Sparse N-gram Context Mixing

Paper • 2605.02904 • Published Apr 5 • 8

upvoted a paper about 2 months ago

Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation

Paper • 2604.10098 • Published Apr 11 • 82

upvoted a collection 2 months ago

Trinity-Large-Thinking

Collection

5 items • Updated Apr 10 • 32

upvoted 8 papers 3 months ago

InCoder-32B: Code Foundation Model for Industrial Scenarios

Paper • 2603.16790 • Published Mar 17 • 312

FineRMoE: Dimension Expansion for Finer-Grained Expert with Its Upcycling Approach

Paper • 2603.13364 • Published Mar 9 • 9

ct2

AI & ML interests

Recent Activity

Organizations

ct-2's activity