%0 Journal Article %T Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster %A Dey, Nolan %A Gosal, Gurpreet %A Zhiming, %A Chen, %A Khachane, Hemant %A Marshall, William %A Pathria, Ribhu %A Tom, Marvin %A Hestness, Joel %J Computing Research Repository %V 2023 %N 2304 %D 2023-04-06 %~ DeepDyve