-
Physics of Language Models: Part 4.1, Architecture Design and the Magic of Canon Layers
Paper • 2512.17351 • Published • 22 -
facebook/PhysicsLM4.2__LlamaCanon-8B-Nemo-1T-lr0.003
Updated • 1 • 4 -
facebook/PhysicsLM4.2__LlamaCanon-1B-Nemo-1T-lr0.002
Updated • 1 • 2 -
facebook/PhysicsLM4.2__LlamaCanon-1B-Nemo-1T-lr0.003
Updated • 1 • 1
Zeyuan Allen-Zhu
zhuzeyuan
AI & ML interests
None yet
Recent Activity
updated
a model
7 days ago
facebook/PhysicsLM4.2__LlamaCanon-8B-Nemo-1T-lr0.003
updated
a model
7 days ago
facebook/PhysicsLM4.2__LlamaCanon-8B-Nemo-1T-lr0.002
updated
a model
7 days ago
facebook/PhysicsLM4.2__LlamaCanon-3B-Nemo-1T-lr0.003