PeacefulData
/

TSFM-ScalingLaws-Checkpoints

Model card Files Files and versions

huckiyang commited on Apr 6

Commit

9af96ea

·

verified ·

1 Parent(s): a036127

Update README.md

Files changed (1) hide show

README.md +38 -29

README.md CHANGED Viewed

@@ -1,29 +1,38 @@
----
-license: apache-2.0
----
-This repo provides the model weights released in the paper [Towards Neural Scaling Laws for Time Series Foundation Models](https://arxiv.org/abs/2410.12360).
-The models have varying sizes, ranging from 1M to 1B parameters, and were trained on datasets spanning from 10M to 16B time points.
-Code: https://github.com/Qingrenn/TSFM-ScalingLaws
-Dataset: https://huggingface.co/datasets/Qingren/TSFM-ScalingLaws-Dataset
-<p align="center">
-  <img src="figures/tsfm-scalinglaws.jpg" width="100%">
-  <br />
-  <span>
-    Figure1: Scaling laws for NLL in relation to model size, compute, and dataset size. The blue lines represent ID performance, while the red and green lines show OOD performance on LSF subset and Monash subset.
-  </span>
-</p>
-<p align="center">
-  <img src="figures/subjective_results.jpg" width="100%">
-  <br />
-  <span>
-    Figure2: Prediction results of models with sizes 1B, 300M, 100M, and 10M.
-  </span>
-</p>

+---
+license: apache-2.0
+---
+This repo provides the model weights released in the paper [Towards Neural Scaling Laws for Time Series Foundation Models](https://arxiv.org/abs/2410.12360), ICLR 2025
+The models have varying sizes, ranging from 1M to 1B parameters, and were trained on datasets spanning from 10M to 16B time points.
+Code: https://github.com/Qingrenn/TSFM-ScalingLaws
+Dataset: https://huggingface.co/datasets/Qingren/TSFM-ScalingLaws-Dataset
+<p align="center">
+  <img src="figures/tsfm-scalinglaws.jpg" width="100%">
+  <br />
+  <span>
+    Figure1: Scaling laws for NLL in relation to model size, compute, and dataset size. The blue lines represent ID performance, while the red and green lines show OOD performance on LSF subset and Monash subset.
+  </span>
+</p>
+<p align="center">
+  <img src="figures/subjective_results.jpg" width="100%">
+  <br />
+  <span>
+    Figure2: Prediction results of models with sizes 1B, 300M, 100M, and 10M.
+  </span>
+</p>
+```
+@inproceedings{yaotowards,
+  title={Towards Neural Scaling Laws for Time Series Foundation Models},
+  author={Yao, Qingren and Yang, Chao-Han Huck and Jiang, Renhe and Liang, Yuxuan and Jin, Ming and Pan, Shirui},
+  booktitle={The Thirteenth International Conference on Learning Representations}
+  year={2025}
+}
+```