Andrew W
Andre3000
AI & ML interests
None yet
Organizations
Pre training
-
Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling
Paper • 2401.16380 • Published • 53 -
OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration
Paper • 2602.05400 • Published • 352 -
The Pile: An 800GB Dataset of Diverse Text for Language Modeling
Paper • 2101.00027 • Published • 10
Research
Pre training
-
Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling
Paper • 2401.16380 • Published • 53 -
OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration
Paper • 2602.05400 • Published • 352 -
The Pile: An 800GB Dataset of Diverse Text for Language Modeling
Paper • 2101.00027 • Published • 10
models 0
None public yet
datasets 0
None public yet